LOCUS NODE_263_length_66606_cov_4.98187966606 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 66606) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 66606) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..66606 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 102..368 /gene="psaK" /locus_tag="DP116_00005" CDS 102..368 /gene="psaK" /locus_tag="DP116_00005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410900.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit PsaK" /protein_id="PRJNA477356:DP116_00005" /translation="MLTSTLLAATFATTPLQWSPVVGIIMILSNIAAIAFGKSTIKYQ SVGPELPSPNLFGGLGLPALLATTAFGHVIGTGIILGLHNLGRL" gene complement(545..916) /locus_tag="DP116_00010" CDS complement(545..916) /locus_tag="DP116_00010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00010" /translation="MKRKHFFHEEDSHPVQLTSADIMLRQQLEYSISRYFYEGCDRNI QDLLSNCRWYVTTDVSTLTLVIECPDQVTNWRVLQKLVTMAALLQQIVSSAKIRVCPP KSQGMPFEMRVDELSVYRDLA" gene complement(941..1809) /locus_tag="DP116_00015" /pseudo CDS complement(941..1809) /locus_tag="DP116_00015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194535.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 2204..2977 /locus_tag="DP116_00020" CDS 2204..2977 /locus_tag="DP116_00020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314840.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4142 domain-containing protein" /protein_id="PRJNA477356:DP116_00020" /translation="MKKFVGGFVGVAGISTFIVFPGMVQANINSSSSNSSNESLVSQN TQQTTSPSGSTTSSPRTTTPNGSLTALDKEFMTKAAQSDQTEIQTSQLALKRSQNKEV KDFAQRMIKEHTDSSQQLKQIAKKKSFTLPKDIGQENKALLTKLTKLNGTNFDQAYMQ GQVQAHTKTLANYQNYLSQGQDPDLSAFANKIAPIVADHQQMAQNMAGGSGTSSSGTS GSGTSGTTTPGSGTSGSGTSGSGTNGTTTPSRGTSGSGR" gene complement(3083..3730) /locus_tag="DP116_00025" /pseudo CDS complement(3083..3730) /locus_tag="DP116_00025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872345.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" gene complement(4432..5376) /locus_tag="DP116_00030" CDS complement(4432..5376) /locus_tag="DP116_00030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872347.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protoheme IX farnesyltransferase" /protein_id="PRJNA477356:DP116_00030" /translation="MIENTATRHHQNFLQVIHSYYQLTKPRIIPLLLMTTASSMWIAS EGNVNPVLLLVTVCGGTFAAASAQTINCIYDRDIDDQMERTRHRPLVSGRVQPSHALI FAIALACLSFTLLAVFVNLLSALLTMSGIVFYVGVYTHLLKRHNPANIVIGGAAGGIP ALVGWAAVTDTLSWAAWLLFAIVFLWTPPHFWALALMIRDDYAKVGVPMLPVVAGNVA ATRQIWVYSLVLIPTTFLLSYPLHVTGAVYTCIALILGTFFIKKAWVLLHNPDDKQLA RSLFLYSIFYMMLLCAAMVVDSLSFTHQLIKAILDLIL" gene complement(5400..5651) /locus_tag="DP116_00035" CDS complement(5400..5651) /locus_tag="DP116_00035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312651.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferrous iron transport protein A" /protein_id="PRJNA477356:DP116_00035" /translation="MFSKFHVKGSSLELLKKGERGIIKFCNIKDEKSLQELISMGITP GTFITVEQHFPCFVIRVGERQLTLTREFAQRIYVRIDDE" gene complement(5840..6139) /locus_tag="DP116_00040" CDS complement(5840..6139) /locus_tag="DP116_00040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_045868657.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_00040" /translation="MATYQVRLINKKEDLDTTIEVDEETTILDAAEENGIDLPFSCKS GACSSCVGKVVEGTIDQSDQSFLDDEQMSKGFALLCVTYPRSDCTIKTNQEPYLV" gene complement(6326..6697) /locus_tag="DP116_00045" CDS complement(6326..6697) /locus_tag="DP116_00045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320632.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron-sulfur cluster assembly accessory protein" /protein_id="PRJNA477356:DP116_00045" /translation="MTVTLTEKAEFRLRTLLRGATSETNTPAKGIRISVKDGGCSGYE YGMEVISQPQPDDLVSEQGKVLVYVDAKSAPLLEGLVIDFIEGVMESGFKFINPKATD TCGCGKSFRTADCSSAGTPCS" gene complement(6720..6943) /locus_tag="DP116_00050" /pseudo CDS complement(6720..6943) /locus_tag="DP116_00050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316211.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(6949..7749) /locus_tag="DP116_00055" CDS complement(6949..7749) /locus_tag="DP116_00055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314847.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein hesA" /protein_id="PRJNA477356:DP116_00055" /translation="MVDLTPTELERYRRQMMLPNFGETAQKRLKSATVLVTGVGGLGG TAALYLAVAGVGRLILVRGGDLRLDDMNRQILMTHDWVDKPRVFKAKETLQAINPDVQ VEAVHEYVTPENVDSLVQSADMALDCAHNFTERDLLNEACVRWRKPMVEAAMDGMEAY LTTIIPGVTPCLSCLFPEKPDWDRRGFSVLGAVSGTLACLTALEAIKLITGFSQPLLS QLLTIDLTRMEFVKRRSYRDRSCPVCGNSAPWRYSQSQVMETTSVANK" gene complement(7760..8077) /gene="nifW" /locus_tag="DP116_00060" CDS complement(7760..8077) /gene="nifW" /locus_tag="DP116_00060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312647.1" /note="associates with NifD and may protect the nitrogenase Fe-Mo protein from oxidative damage; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase stabilizing/protective protein" /protein_id="PRJNA477356:DP116_00060" /translation="MTWDYDKFKKLEDAEEYLEFFQLPYDQKFVNVNRLHILKKFSQF LKEIDENYTDLSISDRLSKYREALEQAYQVFLESTPQEQKLFKVFNQKPKNVVTLTEI TSD" gene complement(8074..8295) /locus_tag="DP116_00065" CDS complement(8074..8295) /locus_tag="DP116_00065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743467.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00065" /translation="MAQAEETSTIEELKTQIKRLNSKAGQMKMDLHDLAEGLPTDYQQ LMDVASQTYEIYRQLDELKQHLKKLEQEK" gene complement(8363..8839) /locus_tag="DP116_00070" CDS complement(8363..8839) /locus_tag="DP116_00070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872353.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NifX-associated nitrogen fixation protein" /protein_id="PRJNA477356:DP116_00070" /translation="MSFTNNVNGTRPTETINSPFLKAIVQQFRGQDSYGTYRSWSDDL LLKPFIVSKQKKREISIEGEVDLMTQARIMTFYRAVAACIEKETGLLSQVVVDLSHEG FGWALVFSGRLLLVAKSLRDAHRFGFDSLEKLAEEGEKFVQTGVSLAQRFPEVGKI" gene complement(8836..9249) /gene="nifX" /locus_tag="DP116_00075" CDS complement(8836..9249) /gene="nifX" /locus_tag="DP116_00075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320628.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogen fixation protein NifX" /protein_id="PRJNA477356:DP116_00075" /translation="MKIAFTTSDKVHINAHFGSAKEIDVYEISDKGYEFLETLSFDGD LKQDGNEDKVTPKLEALADCKIVYVAAIGGSAAARLIKKGVTPVKARSEEEEIAEILN KLVQTLKGNPPPWLRKALQAKPQSFEDELEEEATV" gene complement(9349..10695) /locus_tag="DP116_00080" CDS complement(9349..10695) /locus_tag="DP116_00080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407195.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase iron-molybdenum cofactor biosynthesis protein NifN" /protein_id="PRJNA477356:DP116_00080" /translation="MAIVTVTNSSVAVNSFKLSQPLGAALALLGLKGMIPLFHGSQGC TAFAKSMLVKHFCESIPLSTTAMTEVSTILGGEENVEQAIVTLAERSKPEIIGLCTTA LTETRGDDMPRFLKEVRDRHPELDHLPIVLVSSPDFKGTLQDGYAAAVESIVKQIAQK SDKQDPSPTQVVILPSSAFTPGDVEEIKEIITAFGLKPIVVPDLSASLDGHLEDSYSA ITACGTSLTELGEIGNSVFTLAFGESMRSAAQILHERFGIPYEVFGELTGLEPVDNFL QALADLSGTSVPEKYRRQRRQLQDAMLDTHFYFGCKRVSLALEPDLLWSTVCFLRSMG AEIHAAVTTTRSPLLEKLPISSVTIGDLEDFEQLAPTSDLLIGNSHTVAVANRLGIPL YRQGFPIFDRVGNGLFTKVGYRGTMQVLFDIGNIFLQAEEARIQHSGSFNVLSSEI" gene complement(10695..12119) /locus_tag="DP116_00085" CDS complement(10695..12119) /locus_tag="DP116_00085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase iron-molybdenum cofactor biosynthesis protein NifE" /protein_id="PRJNA477356:DP116_00085" /translation="MKITQNKINELLTQPGCEHNKSKHGDKKNKSCTQQPKPGAAQGG CAFDGAMIALVPITDAAHLVHGPIACSGNSWGSRGSLSSGPMLYKTGFTTDLDETDVI FGGEKKLYKAILEVENRYKPTAVFVYSTCVTALIGDDIDAVCKAAAKKTGTPVIPVNA PGFIGSKNFGNRLAGEALLDYVVGTAEPEFTTPYDINLIGEYNVAGEMWNVVRLFEKL GIRVLAKMTGDGRYNEICYAHRAKLNVIICSKALLNMATKMEERYGIPYIEESFYGVE DMNRCLRNVAAKLGDPDLQERTEKLIAEETAALDVALAPYREKLKGKRVVLYTGGVKS WSIISAAKDLGMEVVATSTRKSTEEDKARIKKLLGQDGIMLEQESPKELLRIIKEKNA DMLIAGGRNQYTALKARIPFLHINQERHHPYAGYEGMLEMARELYEAVYSPVWEQVRK PAPWEVAESLEQAVSSSSLLQEVH" gene complement(12207..12533) /locus_tag="DP116_00090" CDS complement(12207..12533) /locus_tag="DP116_00090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312641.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase" /protein_id="PRJNA477356:DP116_00090" /translation="MKTITHTHHHQNHHSPTSKKSGSFEILYPLRRLVDGIQVKNARL AHLICQIIPCCCPFERSIKLFGHTFHIPPLCKLNPLYDNFVGLRFRALSYLTDECGED VTKYIC" gene complement(12748..13005) /locus_tag="DP116_00095" /pseudo CDS complement(12748..13005) /locus_tag="DP116_00095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214312.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="nitrogenase molybdenum-iron protein subunit beta" gene 13060..14610 /locus_tag="DP116_00100" CDS 13060..14610 /locus_tag="DP116_00100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312639.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase family protein" /protein_id="PRJNA477356:DP116_00100" /translation="MDIWAYARVSTDEQNEDEGALIKQMRRLRSAGATQIYYDVESRT SDKRKGLLQLISDINASAPGKVSKLLFIRIDRLTSSSIIFYRLIDALNNKGIQPIALD EPFDMSSIGGELTIDVRLAASKYEVKMLGMRVKKERDTRKANKKPHWNAPLGYVVDGD RYKRDDRPCVCLIENKVEFTRSQLMRFVFDTFLHVGSVSQTTRRLHDVFGIQANALPK PSNEDINLLGEEDEIILENINKSRGNGLNLRYPHTGLKWSVSGLRSILVNPVYAGGTL YNTVVRPKGHRKPFDEWEVTWGTHEDEAIITHEEHERIKSMIRSNRKNRWATEQKYTN PFANLVKCAHCGAAYSRQCKKLVKKKNFVRHHYQCSFYRTGACHNKRMISSDELEKQV ISHLVREAERLATLVEQEVKITEEPPEIKTLRASLKTIDALPSNPAIEKAKVDIRMQI ASAIATTDNNSKQYLIAKERIISAFTNSRYWQGLKSEDKQALLQGCIKKIVVDANTVT AVELLHLH" gene 15068..16165 /locus_tag="DP116_00105" CDS 15068..16165 /locus_tag="DP116_00105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006631740.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_00105" /translation="MTKYHIVLHHSIDLEKITHEAENNKRPRHVMWTIKQRLNATIHT PQGYQSSFADKLLSKIAGSPEHWAMARALSSRLESDDIVFCNSEAGGIQLATLCGFKR NRPKIVVFFHNIDRPRGRLALKLFNLANKIDLFLACSNHQVTFLRDYLNLPASRVQFV WEQTDLKFFAPSPVSPKKSRPMIASVGLEQRDYKTLAEATANLDVDVKISGYSSDALV LKRAFPEKLPDNMSRQFYEWPELVQLYQNADIIVVSVFESNCGAGIQALMEAMACRRP VIVTRTLGLDQYIAGTNAVMMVKPQDVEGLRQAIIYLLNNPQEADALAERGYRLALSR HDSEQYVNYVTKELKQFEEGVITQESYQTVI" gene complement(16426..17493) /locus_tag="DP116_00110" CDS complement(16426..17493) /locus_tag="DP116_00110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321110.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_00110" /translation="MTDLFTPLTIGAVTFRNRIAVSPMCQYSSTDGYANDWHLIHLAS RAVGGAGLVFTEAAAIEPRGRISPQDLGIWLDEHIEPLAKIVRAIHNFGAIAGIQLAH AGRKASTAPPWEGGQVLDTSNGGWHPVLSSSAIPFSENHPIPEALNTEGIQQIINDFV QAAQRSQEAGFKVIEIHAAHGYLLHQFLSPLSNKRNDEYGGSFENRTRLLREVVQAVR EILPSDYPLWVRISATDWVENGWDIEQSIALAEKLNSLGVDLIDTSSGGSVPNAKIPF GAGYQTEFAARIRHEANILTGAVGLITSPEQADQIIRTGQADIVLIGREMLRNPYWAL SAAKKLRQEKFSPVQYERAWL" gene complement(17655..18440) /locus_tag="DP116_00115" CDS complement(17655..18440) /locus_tag="DP116_00115" /inference="COORDINATES: protein motif:HMM:PF01925.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfite exporter TauE/SafE family protein" /protein_id="PRJNA477356:DP116_00115" /translation="MCEEVNLTTAQACFLFVAAVLGGTLTSVASGGGFLLFPALIFIG LPSINANATSMTAGWLGCAVSVAAYRHELSGQQRISLVLGSISLVGGMIGALLLLYTP TDIFDRLIPYLLLLSTLLFTFGGKITTWLHSDLEDSRRSLLTASVIQFFLAIYGGFYG AGVAMLMLATMEMLGMKNIHKMNALKMLLMSCTSGFAVVTYVIAGVVAWQPAVFMMLG TVVGGYGGAYYARKLQPDLVKRFIIIVSFAMTYYFFIRAYYTH" gene complement(18525..18794) /locus_tag="DP116_00120" CDS complement(18525..18794) /locus_tag="DP116_00120" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00120" /translation="MTIYNSIGKVYSNSRLPDLRIVNSLIDLLNLPKKSIIADIGAGT GGYSRANLLRRRFANAEREYSVYAVEPSSVMRSQSVEHAQRSYQT" gene complement(18797..19747) /locus_tag="DP116_00125" CDS complement(18797..19747) /locus_tag="DP116_00125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867791.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-2-hydroxyacid dehydrogenase" /protein_id="PRJNA477356:DP116_00125" /translation="MKLIIPIEAADELKPHLPTDTTFVRADIDGNLDGDAKEAEIYFS WLYYLKPTTLHKVLESAPALRWHHAPNAGVNNILTQKYLERDIILTNGAGVHAIPIAE FVIAYMLSYAKQLLKLHKLQTQQQWQRDFQIEELQDKTLLIIGTGGIGQEIAARAKAF GMRIFGSRRHPQPLANFDKVVGTNEWKALLSEAEYVVIATPLTKETEGLINAEVLQSM RPDAYLINIARGKIVDEPALLKALQESWIAGAALDAMFTEPLPPDSPFWTLPNVFITP HCSAHSSKVKERTLALFLDNLTRYRHGKPLRNVVDKNAGY" gene complement(19807..20754) /locus_tag="DP116_00130" CDS complement(19807..20754) /locus_tag="DP116_00130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749814.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-2-hydroxyacid dehydrogenase" /protein_id="PRJNA477356:DP116_00130" /translation="MKVILPVELADDIEPLLPSDITSVRVDPDGNFDGDPSGAEVYLN GFRVKNTTLHKVLAAAPTIRWQHTPSSGVNHILTPTFLSHDIILTNSSGVHAIPIAEF VLNFMLYHAKNVRELQDLQTNHYWNKWLELQELYEKTLLLIGTGNIGQEIALRAKAFG MQIWGSRRNPEPLANFDKIVGANEWKALLPEADYIVIATPLTPETKGLINAETLRLMR PTAYLINIARGAIVDENALLTALREGWIAGAGLDIFESEPLPPESPFWSLPNAFITPH CSALTPQVRSRIVKLFIDNLTSYRNKEKLRNVVNKNVGY" gene complement(20965..21294) /locus_tag="DP116_00135" CDS complement(20965..21294) /locus_tag="DP116_00135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747235.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4Fe-4S ferredoxin" /protein_id="PRJNA477356:DP116_00135" /translation="MIELVSESRCIECNLCVNVCPTNVFDKVPDAPPIIARQSDCQTC FMCELYCPVDALYVAPQVEPLESVDEKSLKETGLLGSYRNNIGWGRDRTNTAKQDSTH LILKQLK" gene complement(21390..23063) /locus_tag="DP116_00140" CDS complement(21390..23063) /locus_tag="DP116_00140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323011.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridine nucleotide-disulfide oxidoreductase" /protein_id="PRJNA477356:DP116_00140" /translation="MRQSSSTINIDVDPNLQTDVLVIGGGPAGTWAAWSAASAGARVV LVDKGYCGTSGCAAASGNGVWYVPPEPESREAAIASREALGGFLSNRDWMQQVLNQTY TNVNLLAKWGYPFPVDQQQKSYRRSLQGPEYMRLMRKQIKRAGVTILDHSPALQLLVD AEGAVAGATGVNRQTGKQWVVRAGGVIIATGGCAFLSKALGCNVLTGDGYLMAAEAGV EMSGMEFSNAYGISPAFSSVTKTLFYNWATFTYEDGTPIPGAGSQKGRSVIAQTLLTQ PVYAILDQATEEMQAHMRKAQPNFFLPLDRAGIDPFTQRFPVTLRLEGTVRGTGGIRI VDFTCATSVQGLYAAGDAATRELICGGFTGGGSHNAAWAISSGYWAGKSAAEYAQKFA EWGTQRFIQAVGEAGFTNGSQHPSGSSVTYGGKPACSAAHQINTEEVIQATQAEVFPY DRNYFRREKGLTESLHRLNHLWKEIRTSQVDTHNIVRVREAAAMVATARWMYSSALQR KETRGMHKHLDYPEMDANQQHYLTSGGLDQVWVKATPLHQTKVKVSAAA" assembly_gap 23550..23559 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(23653..24693) /locus_tag="DP116_00145" CDS complement(23653..24693) /locus_tag="DP116_00145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321660.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADP(H)-dependent aldo-keto reductase" /protein_id="PRJNA477356:DP116_00145" /translation="MKYNQLGESDLKVSEICLGTMTYGQQNTIEEAHQQLDYAIAQGI NFIDTAEMYPVPPRGETQGKTEAYIGEWLKKQQRNQLIIATKIAGPGRPFKWLRGGNL QINHNNIKQAVDDSLKRLQTDYIDLYQIHWPERYVPTFGQTEYKPDLERETVSIAEQL QAFADVIKAGKIRYLGLSNETPWGVSEFVHIAKQLELPKVISIQNAYNLLNRVFDSAL AEVSRYTNVGLLAYSPLGFGLLSGKYTDEKKPENTRLSLFQGFGQRYLKPNVNEAVAA YVSIARKYNLQPTQLALAFVRSRWFVSSTIIGATSLEQLQENIDSVNVILNQEILAEL DAVHTRYPNPAP" gene 25364..27325 /locus_tag="DP116_00150" CDS 25364..27325 /locus_tag="DP116_00150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412120.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer protein" /protein_id="PRJNA477356:DP116_00150" /translation="MSKFRWNSRWILFLVSLLDMQIILVLNPAKSWADTPEYSSSTDS SISSAVRSTAIVPRGSAQRANAPEQYKASVLQSSNSVMDRFTLISQVPSSVPSLSPIS VPLHPSPGDAGVTPKTSVSELSDGKRSPTPVPANPTLKNDSSKLPNRDATIGQVNSVS QLSDVKPTDWAFAALQSLVERYGVIAGYPDQTFRGNRAMSRYEFAAGLNAALNRITEL LAAGTSDLVRKEDLVTLQTLQNQFGAELAQLRGRVDALEVQNANLEQRQFSTTTKLVG EVETVIGGVLTGNNVVTKRPAPHVITFQDRVRLILNTSFSGTDQLRMTLQAGNIASLG GTRTGIFGTTDGRTSDNASPVYPNNDVYISGLRYQFKPFKSTQVNIFSQSDGAFEIGL SGPINPYFEGSAANAISRFARRNMVYDYGDTGPGIAILQQFGKQWQLGLAYTAINGDN PTPNNGLFSGRYVALGQLSYSTPSQDFRVALTYANTYSPPNTIGQTGTNFGPVIGSNL ANSTVAGRGTVGNLYGIQALYKISPKFALNGWVGYSAHRYLGVGDGQVWDWAVGLAFP DLFQKGSLGGLFVGMEPKLTALSKNVDLGAGLGQVDKDTSLHVEAFYQYQLNDYIAIT PGFIWITAPNSDADNPGSVVGWLRTTFKF" gene complement(27561..27884) /locus_tag="DP116_00155" CDS complement(27561..27884) /locus_tag="DP116_00155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019490862.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00155" /translation="MSTANDTFNYWFPLLKEVLPSSLLEQVKKTQVTTFLVNRTQPKR MFEKSKTVCFTTLTTEKRLRENLGFGICDHAPESHLRLKQTLFKHPLIYFNNSKTKSY VLSKK" gene 28007..28372 /locus_tag="DP116_00160" CDS 28007..28372 /locus_tag="DP116_00160" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00160" /translation="MTHNNPEDQMQSNLLAFASAGISVLFEVIKDPFDEIRGHLTGSI LRSWITENRKTSEQSSGCVVSYYPAEQKVEAFIVDEKNESLFTDKGRKIAVVFKAKSL DLELQQLFANQKIVLIPFD" gene 28386..28682 /locus_tag="DP116_00165" CDS 28386..28682 /locus_tag="DP116_00165" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00165" /translation="MFDPLTISLAIAALAVASVVTYLTVTVIRNYLRERRTNQNVNAK PALMVDRLNNGDYSVVTGFFDGNTKVLDSKVWNAQKLDEDLQKFPISKPVIIES" gene 28679..29740 /locus_tag="DP116_00170" CDS 28679..29740 /locus_tag="DP116_00170" /inference="COORDINATES: protein motif:HMM:PF00004.27" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00170" /translation="MKVIIEGVENSSQKEKLRENVAKRIAEVWNVCRENSKIREIIIN CNPGSSNAEHNNGQEKLATFDHLLRMPNNNASILVLPSKQQELINSQLKRWDLIQLVQ KKWGVDKIDPFPRVSLNFVGPPGTGKTLTAHNFASRLNKKIIEVSYADIVSKYFGEAA KNLSALFEFAKANDAVLFIDEAETLLSKRNAAASEGADHAVNSMRSQLLLLIQNTPII CIFASNLVEGYDPAFLSRLTRIDFPLPDENLSERIWETHLIQELPLDSSITANYLATK FKGLTGRQIRQIVIEAAYRAASREHADQTLCPEDFSWAHDLVCSNETVSYSVATGFLE NDKKIVNSKVSDAERLDWE" gene 29730..30488 /locus_tag="DP116_00175" CDS 29730..30488 /locus_tag="DP116_00175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867799.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADPH-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_00175" /translation="MGNDCISSLLSHHSIRAYLPDTLPPGTLETLVAAAQSASTSSNL QTWSVVAVEDANRKQKLSQLASNQAHIRSCPLFLVWLADLARLTHIAESRGLPHEGLD YLEMFLMAAIDAALAAQNAVVAAESLGLGTVYIGALRNHPESVAEVLDLPPHVFAVFG LCVGYADSTVETAIKPRLPQRAVLHRETYKLTEQDESITFYNQVMKAFYNSQQMNVPG DWTEHSAKRVAFAESLSGRDRLREALKNLGFELR" gene 30966..31886 /gene="ssuB" /locus_tag="DP116_00180" CDS 30966..31886 /gene="ssuB" /locus_tag="DP116_00180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213448.1" /note="part of the ABC type transport system SsuABC for aliphatic sulfonates; with SsuA being the periplasmic substrate-binding subunit, SsuB the ATP-binding subunit and SsuC the permease; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aliphatic sulfonate ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_00180" /translation="MKSETRGMHLKLEGLRKSFGKKTVLQDIDLDIQPGEFVAIVGRS GCGKSTMLRLVAGLDSPSGGSVLLDDKYSHHRINPSIRMMFQDARLLPWDRVLANVEL GLVGLNSKVYARQTALQVLRAVGLEDRANEWPAVLSGGQRQRVALARALASQPALLLL DEPLGALDALTRIEMQQLLENLWQEQGFTALLITHDVEEAVVLADRVILIENGQIGLD LEINLPRPRVRGDAVLALTVEKILRRVMGKQEQEVAAGRFTQGDAEAVANGVKHSSAV GNRLHELEPQHSSFLNSPTTEIPKHLEKVS" gene 32007..33191 /locus_tag="DP116_00185" CDS 32007..33191 /locus_tag="DP116_00185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308650.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="PRJNA477356:DP116_00185" /translation="MVQLLVKESKDWIGIASSLLQELSSTAVERDLKAGIPDLEIQRL RESGLLPLVVPKAYGGADGTWIEALKVVQELSQADGSIGQLYGNHLNLTALGHVSGTP EQKERYYRETAQNNLFWANAINTRDTRLKISPEGEDFRVNGVKSFGTGIASADLRVFS AVQDGVEFPLLFVIPKDRSGVVSNQDWDNIGQRRTDSNTFTFHNVLVKKDEILGYPHP PDSAFSTFLGIIAQLTKTYVYLGIAQGAFTAAKQYTTTITKPWITSGVDSATKDPYIQ HHYGELWTELQAAILLADQTAVKVQQAWEKDVKLTTEERGEVAIAVFSAKAFATKAGL NITNRIFEVMGTRSTGTKYGFDRYWRDLRTFTLHDPIDYKFKDIGNWVLNQEFPLITQ YS" gene 33694..34809 /locus_tag="DP116_00190" CDS 33694..34809 /locus_tag="DP116_00190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127547.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aliphatic sulfonates ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_00190" /translation="MQALKDKFESWKRGRITRRTALFALGYSLVLSSTLFSWGTPQNN TQQQATSSTPDAANAATKLISTSANKVVRIVRSKQLTALAVLEKQGNLEKRLQSLGYK VEWPEFAAGPQQLEALNANGLDIALTAESPPVFAQAAGTPLVYLAANSADGKSISLLV PTNSKVKSVKDLKGKKVAFQKASIGHYLLLRALEKEGLKLTDVQSVYLPPADASAAFS QGKVDAWFIWEPFVTRNEQNKIGRVLIDGSNGLRDTNNFFSTTRKFYQENPEVIKVFL DELQKAQVWSKEHPKEIAQLLAPVTQLDVPTLEKMHKKYDFSLVPITNKIITKQQEVA DKWYSLKLIPKKVNVRDGFLSPEEYAKITPKEVLAKQ" gene 34820..35020 /locus_tag="DP116_00195" /pseudo CDS 34820..35020 /locus_tag="DP116_00195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747227.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" gene 35129..35929 /locus_tag="DP116_00200" CDS 35129..35929 /locus_tag="DP116_00200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867804.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aliphatic sulfonate ABC transporter permease SsuC" /protein_id="PRJNA477356:DP116_00200" /translation="MKKYKFKSHFLKNRKIQSLIPWLVPLSIIILWQFFSSIGLIPIR ILPSPLSVVGAAINLAKTGELFRNIGISATRAISGFLLGGSIGFLLGLLNGISPTAEK LLDTSIQMLRNIPNLALIPLVILWFGIGDEARLFLVSLGVMFPIYLNTFHGIRSVDPG LIEMGKVYGLSTWGLFWRIILPGALSSILVGVRFSLGIMWLTLIVAETIAADSGIGYM AMNAREFMQTDVVVLSILLYALFGKLADVIARALENYWLQWNPNYSRS" gene 35954..36364 /gene="fosX" /locus_tag="DP116_00205" CDS 35954..36364 /gene="fosX" /locus_tag="DP116_00205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014794455.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FosX/FosE/FosI family fosfomycin resistance thiol transferase" /protein_id="PRJNA477356:DP116_00205" /translation="MIQGISHITFIVRDLEKMTKFLVSIFHAKEIYSSGEQTFSISKE KFFLINGLWIAIMEGESMPEKTYNHVAFKITEEDYEFYAARVRSLGVDVKEGRTRVEG EGRSLYFYDYDNHLFELHTGTLNQRLQTYQDLSI" gene complement(36516..37604) /locus_tag="DP116_00210" CDS complement(36516..37604) /locus_tag="DP116_00210" /EC_number="4.2.3.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994971.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chorismate synthase" /protein_id="PRJNA477356:DP116_00210" /translation="MGNTFGHLFRITTFGESHGGGVGVVIDGCPPQLEISAEEIQVEL DRRRPGQSKITTPRKETDTCEILSGVFEGKTLGTPISILVRNKDTRSQDYDEMAVKYR PSHADATYDAKYGIRNWQGGGRSSARETIGRVAAGAIAKKILRQVANVEVIGYVKRIK DLEGGVDPNTVTLEQVESNIVRCPDAECAERMIELIEQTGRQGNSIGGVVECVARNVP KGLGEPVFDKLEADIAKGVMSLPASKGFEIGSGFAGTLLTGIEHNDEFYVDENGETRT VTNRSGGIQGGISNGENIILRVAFKPTATIRKEQKTVTNEGEETTLAGKGRHDPCVLP RAVPMVEAMVALVLCDHLLRHHGQCKVL" gene 37856..38836 /locus_tag="DP116_00215" CDS 37856..38836 /locus_tag="DP116_00215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139811.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase subunit CofG" /protein_id="PRJNA477356:DP116_00215" /translation="MPNSHSQKITYSPAYTVVPTYECFNRCAYCNFRTEPGKSEWLTI SQAEKLFKQLHNQDVCEILILSGEVHPLSPRRQAWFQRIYDLCELALAMGFLPHTNAG PLSFEEMQQLKKVNVSMGLMLEQLTPELLNTVHKHAPSKTPEVRLQQLQWAGELKIAF TTGLLLGIGETEQDWWKTLEAIAEIHQRYHHIQEVILQPHSPGHQQTFDAPPFNPHQL PKVISQARQILPPDISIQIPPNLIKDDEWLLACLEAGARDLGGIGPKDEVNPDYPHLP EQELREILHPAGWELVPRLPVYPQYDHWLSVQLQTNIKRWRTFFRQLTVL" gene 38864..40189 /locus_tag="DP116_00220" CDS 38864..40189 /locus_tag="DP116_00220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013190943.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_00220" /translation="MLDTQELLTNLYNAFNPFEPLPAGDPKYVDCQDVRGDVDILQEF GNRIQRADRKTCQLYSGHRGAGKSTELLRLKQYLENRKFYVVYFAADEEDIDSEDAQY TDILLACTRRLLKDLQQFGDASPVLNWLKERWQELKDLAQTPIEFENLKLEAQIAQFA KLTANLRAVPELRQQIRRKIDPHTVTLIKVLNEFLADAKSKLPNGYTQLAVIVDNLDR MVLVKDGENTNHEEIFLDRSEQLKALDCHLIYTAPISMLYSKRATDIRDIYGECLILP MIMVKTHKGEVYEPGLKKVKEVIRKRVRQIEQELPLENGIFDSQQTLERLCLMSGGHV RNLLLLTQNAIGRTEELPISEKAVRRAITQARDDYHRAVENHQWCLLAEVSRSKRIVN DDQYRSLMYNRCLLEYRYLDDDGEMQRWYDIHPLIQGISEFKEAVAKLP" gene 40186..42153 /locus_tag="DP116_00225" CDS 40186..42153 /locus_tag="DP116_00225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015212381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00225" /translation="MNPNLSDWESDWDDDLPPEPEEAYQDLVRALKRKSGFGLFFVQC TPAQSDRFIAKLPQDIPQKKIAALRLVEPIDNLYQPVAEFVKNKQVDILLIKGLEYSL YKYEQRTFGEITEGQFSNLTRVPPILNHLNQQRERFRDDFPFCFVFLLRSFSINYLIH RAPDFFDWRSGVFELPTTPEVVEQETSRLLLEGDSEKYFKLSLEKKIEKVLEIQDLLA AKHQTENSQVILLLELGNLLVAAKEYEAAITSYNQAVKCQLDLHEAWYNRGIALDNLE RYEEAIASYDQAVKFQPDDHEAWNSRGYALRNLEQYEEAIASYDQALKIKSDYHEAWY NRGYAQGNLERYEEAIASYKQAVKFQPDYHEAWYSLATALDDLGRYEEAIASYDQALK IKPDYHQAWYNLVTALYDLGRYEEAMAVDRQVWNNWGIALRDIESHQAAIASYQAVKI KPDLHEAWNNRGVVLGNLGRHQEAIVFYDQAVKSKPDDHEAWNNRGYALRNLGRYAEA IASYDQALKFKPDKHEAWNNRGIALLSLGRNEEAIASYDQALKFKPDKHEAWYNRGIA LRHLGRYTEAIASYDQALKIKPDKHEAWYNKARCYILQSNIEQAIENLEKAIHLNPDK CQDWAKNDSDFDSIREDERFLVLIQGQQTGD" gene complement(42360..42773) /locus_tag="DP116_00230" CDS complement(42360..42773) /locus_tag="DP116_00230" /inference="COORDINATES: protein motif:HMM:PF01850.19" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_00230" /translation="MIDKIFLDTNLWIYLYAKNPPEKYQKIERIIKEDLPLIQVSTQV LGEFFHVLTRKNFTSKIDAINIISNLISTFPVQAIDTPQVLKALEINGKYNYSYWDSL IIATALLSDCSMIYSEDMQHNQLVENKVRIINPFL" gene complement(42763..42999) /locus_tag="DP116_00235" CDS complement(42763..42999) /locus_tag="DP116_00235" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00235" /translation="MVKIVQATCTNGELVLSEKLNPELEGKTVQIMIFEQSESRETID SRETKIQEFLARVNKYSFEIPSDYKFNREEIYDR" gene complement(43082..43711) /locus_tag="DP116_00240" CDS complement(43082..43711) /locus_tag="DP116_00240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314496.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_00240" /translation="MVTTRIPSESRVVLRNISWQTFETMLAEMGEDRASRLTYDRGTL EIMTPLLPHEYWNRLIERLIFVLGEELNLEILPTGSTTLKREDLRRGAEPDSSYYIRN EARVRNKTEINLNNDPPPDLVVEVDLTSSSLDRFQIYASLGVPELWRYDEGVLHIYQL QQGEYVECNNSPTFAQLSLIEIPQFLEESQRIGVMGMTRNFRNWVRERI" gene 43869..45665 /gene="hflX" /locus_tag="DP116_00245" CDS 43869..45665 /gene="hflX" /locus_tag="DP116_00245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320493.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTPase HflX" /protein_id="PRJNA477356:DP116_00245" /translation="MLIETIFGNLQGLKSSQLKQIQRLYHQRISGDRITTPEFSQRLA AISTEINQPVCAYLNRRGQVIRVGVGTPRQTQIPPLELPRYGAERLSGIRCIATQLKP EPPNEAALTAMALQRLDALVMLNITGTGFQRRGGGATGYVKEAYLAHLTPQDARALIS SSAMEKVSHLPPAGGENKEGKEQKNNLPYTSWSVSPPMSVDMLTNQDLMELVEGLEAE FRREFVAQDVDTDHDRVLIVGVMTDDTTAQQFQDTLEELARLVDTAGGEVLQMMRQKR SRIHPQTVVGEGKVQEIAVAAQTLGANLIVFDRDLSPTQVRNLELQIGVRVVDRTEVI LDIFAQRAQSRAGKLQVELAQLEYMLPRLAGRGQAMSRLGGGIGTRGPGETKLETERR GIGRRISRLQQEVNQLQAHRERLRQRRQHQEVPSVAIVGYTNAGKSTLLNTLTNAEVY TADQLFATLDPTTRRLVIPHPETDEPQEILVTDTVGFIHELPASLMDAFRATLEEVTE ADALLHLVDLSHPAWLSHIRSVREILSQMPITPGPALVVFNKIDEVDSKNLALAQEEF PLAVFISASERLGLETLRQRLGQLVEYAASYH" gene 45873..46736 /locus_tag="DP116_00250" CDS 45873..46736 /locus_tag="DP116_00250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744594.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00250" /translation="MRRDSIFYKLFQQSPSLLFELLANPPANADNYRFDSVAVKEPKF EIDGVFLPPDTTDAGIVYFSEVQFQKDERLYERLFAESLLYFYRNRVRYSDWQAVVIY PSRSTEQSDTHPYRALLNSEQVHRVYLDELGDIRQLPLLLALMVLTTVAEDQAPEQAR YLLTRTRQEQSQASSRAIIEMITTIMVYKFEQLSRAEVEVMLGITLEQTRVYQEIKEE GRQEGRQEGQQEATVKLIVRLLTKRLGQELSEEMQATISHLPLGVLENLSVALLDFTN LADLQAWLDAQ" gene complement(46788..47957) /locus_tag="DP116_00255" CDS complement(46788..47957) /locus_tag="DP116_00255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015026404.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkanesulfonate monooxygenase" /protein_id="PRJNA477356:DP116_00255" /translation="MFKGDVPRLSTIRTHQRVSEVGWFADLCGGDTDRLGKLDPSRRS NYEHCRDIVLTADALGYKNILLPTSYMLGQEVLPFAGAIASQLQQISLLTAIRTGEIH PPMLARHLSTLDHLLQGRLTVNIINSELPGLVEDPEYRYQRCKEVIQILQQAWTQEKI HHSGEIYRFSLPSYPVKPYQQHGGPLLYFGGLSPGSRDVCAQYCDVFLMWPDTEEGLF ESMQDLSKRAAAYGRRIDFGLRIHVIVRETEEEARLWAKTLISQLDAARGVQLKSRAQ DSRSVGVLKQDTLRTVADSDDYIEGNLWMGIGRARSGCGGALVGNPSQILERLHRYLD MGIRAFIFSGYPLIEESHYFAKLVLPHLPNIRLAELQERLPTGQPVTPLTTGEVR" gene 48403..48528 /locus_tag="DP116_00260" /pseudo CDS 48403..48528 /locus_tag="DP116_00260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015956922.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="phycobiliprotein lyase" gene 49004..49852 /locus_tag="DP116_00265" CDS 49004..49852 /locus_tag="DP116_00265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015161131.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_00265" /translation="MPFGPASRLGVSLFEETPPLEWVPGLSEEEAQTLIKAVYRQVLG NAYVMESERLTVPESQFKRGELSVREFVRAVAKSDLYFSRFGDTPRYRFIELNFRHLL GRAPNSYDEMKAHSAILDAGDFEAEIDSYLDSDEYQNTFGENLVPYIRGYKTEALSHM IGFTHTFQLVRGASTSSLKADLAGKSPKLNSLVINATPTPVVPPGTTFRNPPVSSRVR LGVGASEEGKVYRIEVTGYRANAVNRVSKFRRSNQVYLVPFDKLSEEYQRIHRQGGVI ASITPV" gene 49952..50131 /locus_tag="DP116_00270" CDS 49952..50131 /locus_tag="DP116_00270" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00270" /translation="MPQIEDLGREILCRIPLGKQGRLPKGQTLLGVSLMGKRRACTLD TPLVRELGALSQTPE" gene 50321..51076 /locus_tag="DP116_00275" CDS 50321..51076 /locus_tag="DP116_00275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015161132.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_00275" /translation="MATIAPIELWSTRDLEDVQAVIRAVYKQVLGNPHVMESERLVTA ESQLKDSTISVRDFVRAVGKSDFYRSRYFEACAPYRFIELNFKHFLGRPPQSQAEISE HIVRCVEKGYDAEIDSYIDSEEYQSAFGENVVPYNRGVKTEVGRSQVTYNRTFALDRG PSQISSAVKSSQLVYAVATNSPNKIKPADVNLGGSGEANKKKFKILVQGSKFDSPRRV STTEYIVPGDRMTPQIQRINRTGAKIVSITEIV" gene complement(51073..51381) /locus_tag="DP116_00280" CDS complement(51073..51381) /locus_tag="DP116_00280" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00280" /translation="MITFIKTVPKEHRPLGRFLCEDGVLRLFCLSTFGRCFLDVPSPQ IENLGREIGQRIVLGKQGRLPFGQTLLGVSPANTHLTARAKLSRIAKNMPVQFLKVDA " gene 51616..52380 /locus_tag="DP116_00285" CDS 51616..52380 /locus_tag="DP116_00285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_00285" /translation="MALWIADAESVELRPNTSEDDLQTIIRAVYRQVLGNAHVMESQR LTSAESLLRNGDITVRGFVRAIAQSELYRSLFFDTSSSYRFIELNFKHLLGRAPVDQT EISRHVLIYNEQGYEAEINSYIDSDEYIQSFGENVVPSSRGNRTQTGIKNVGFNRTFA LDRGFAAYDAAGKNAKLISDVGGNLPTKIKFPATGSGAYSNTGKRFRITVTKGSSNPR MNQGKVTFKVGYNQMSQKIQNIQKTGGKILSITEVA" gene 52416..52664 /locus_tag="DP116_00290" CDS 52416..52664 /locus_tag="DP116_00290" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00290" /translation="MLPLGFPYAGSPVGLSPCSLITVRCVFGNYSLRCSKITSAKALR IFSALWCVFKIGNVLGRKGVVTERFKITGEVANKYPKS" gene 52705..53247 /locus_tag="DP116_00295" CDS 52705..53247 /locus_tag="DP116_00295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019489893.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobiliprotein lyase" /protein_id="PRJNA477356:DP116_00295" /translation="MDITQFVELSIGRWRSQRSAHHLAFSHFEAVQSVIDIIALSPDD PDVMTLCKSYNIDQSQIVSPFGMSWEGQSDWDEDAQMKGSSILVPVPDPNVPNRGKLL RDQGYAETIAAAGDYHLTEDGTFVLLTTYDRAAAEEKIWFANPNLRFRVSLIKTSGGS GVVTASFASEIRSSSSSVNS" gene 53317..53922 /locus_tag="DP116_00300" CDS 53317..53922 /locus_tag="DP116_00300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015161136.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chorismate-binding protein" /protein_id="PRJNA477356:DP116_00300" /translation="MTSAKANDLTTLAQWMAGDFSNYKQSYHKPQQFAHIHIFFRPLP FEFFNAIGFYSEQVYDHDLWSPYRQGVHRLIDEGEQIYIENYSLNDPLLYAGAARELS ILRTITPDCIERRYHCSMIFKRQGEMFQGSVEPGNKCLIERKGCLTYLISDVELTATT WVSLDKGMDVNTHQQVWGSTFGALEFEKRESFAHELPEYRL" gene 53980..54285 /locus_tag="DP116_00305" CDS 53980..54285 /locus_tag="DP116_00305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197234.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CpeR family transcriptional regulator" /protein_id="PRJNA477356:DP116_00305" /translation="MLPPEAEKKMQCWIRSRHLICSGNFFVFESVDYSAVERFSECIA ALGGALLSIEPIGKIWMGDHRQVILYRAKASLHTPHHTLKQYWLKYGGFRTRFDERV" gene 54293..54628 /locus_tag="DP116_00310" /pseudo CDS 54293..54628 /locus_tag="DP116_00310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199293.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="XisI protein" gene 54654..55619 /locus_tag="DP116_00315" CDS 54654..55619 /locus_tag="DP116_00315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130127.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00315" /translation="MSDVNERTDFDSPWKEVLEAYFPQAMQFFFPETVALIDWERPYE FLDKEFQQIAREAEQGKRYADKLVKVWRTQGQELWLLVHVEIQAQPEENFAERMFTYS FRIFDRFHQPAVSLAILCDANRQWRPNSYSYSYPDTRLNFEFGTVKLLDYESRWTELE VSENPFATVVMAHLKTQQTRQQPQERKTWKFSLIRRLYDLGLQEQDIRNLYRFIDWVM ILPKALEAEFWEQFKQFEQERTMRYVTTGERIGYERGKQEGKQEQTQTLILRLLQRRV GELSLEVRSHIQSLTLSQLEELGEALLDFTAMEDLLNWLQANQSE" gene 55740..56567 /locus_tag="DP116_00320" CDS 55740..56567 /locus_tag="DP116_00320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00320" /translation="MIIWTPNQPLQNGRFIIQKVLGSGGFGITYSILEQRTGKLFVLK TLNHIQQIRKDFSERQVKFVQEMTRLARCTHPHIVKFEDVIQEDGLWGMLMEYIDGVD LKTYVDEGGQLSEDEALRYINQIGQALEYVHQQGFLHRDIKPHNIILRRGKQEAVLID FGLARQFSTGEKSISMTSDGTEGYAPIEQYRRKGNFGAYTDVYALAATLYFLLTADAL KAADEEIVSDLRRKYEDEELPPPKQFNPEISHRVNEAILKGMALEPQDRVQTVRKWL" gene 56574..57221 /locus_tag="DP116_00325" CDS 56574..57221 /locus_tag="DP116_00325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357564.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00325" /translation="MPRKVNPPIPQVSISNPQPKIQNQQSIYIATPVVSSKQSSQGIL KWLEGIIPKNSKSDDEIRLITAKMDYTQLRDLLTAGNWKQADEETRRVMLAVARREKE GWLNGEDIDHFPCEDLRTIDQLWVKYSNGRFGFSVQKGIYQSLGGTREYDTKIWEAFA DAVGWRLVFGEEEWWSGDIFYDYRITPKGHLPVGGLWGWLARHPVLCLLLSRRDL" gene 57377..57793 /locus_tag="DP116_00330" CDS 57377..57793 /locus_tag="DP116_00330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409463.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty-acid oxidation protein subunit alpha" /protein_id="PRJNA477356:DP116_00330" /translation="MPAKDIFHNAVKHALEKDGWLITDDPIYLDFGGVEIYIDLGAEK IIAAEREGEKIAVEVKSFIGGSAISQFHTALGQFINYRTALNQEQPERELFLAVPNIT YETFFKLELVQIVIQSQNLKLLIYEPEQEVIERWIR" gene 57781..58116 /locus_tag="DP116_00335" CDS 57781..58116 /locus_tag="DP116_00335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199293.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_00335" /translation="MDKIVQYREIIKRLITDYVNEASTRDDVERQMIFDTEHHHYQLV NVGWRNRHRVYGCVLHFDIKNDKIWLQYNGTEIDFAEELIKQGVPKEDVVLGFHSPFM RQFTEYAVG" gene 58311..58547 /locus_tag="DP116_00340" CDS 58311..58547 /locus_tag="DP116_00340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875934.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00340" /translation="MLKSVEGVFKNGAIELSEVPSDVVESRVIVTFLEAKPVQFTPQI MYFGMFADSNQQSTEEDFKIAEFHGDFGDELDWS" gene 58554..58952 /locus_tag="DP116_00345" CDS 58554..58952 /locus_tag="DP116_00345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950325.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="twitching motility protein PilT" /protein_id="PRJNA477356:DP116_00345" /translation="MKYVIDTHALIWFLEGNPRLGSNAKTILSNPESQLIIPATALAE AVWIVERGRTSISSAIALLSAVNADTRIVVYPLDTNVIQQTINLSAIAEMHDRQIVAT AIVLVNQGETVVLLTCDQNITASGLVTIIW" gene complement(58975..59361) /locus_tag="DP116_00350" /pseudo CDS complement(58975..59361) /locus_tag="DP116_00350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128655.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" gene complement(59351..59593) /locus_tag="DP116_00355" CDS complement(59351..59593) /locus_tag="DP116_00355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310170.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00355" /translation="MKAQEVMGSVDENGTLCLDEPLTVQKHSRVKVIVLFVEDEMDED DEPKESVLNSLRTSLQEAKVGKTKPVSELWDGIDAE" gene 59744..61178 /locus_tag="DP116_00360" /pseudo CDS 59744..61178 /locus_tag="DP116_00360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999080.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" gene complement(61234..61851) /locus_tag="DP116_00365" CDS complement(61234..61851) /locus_tag="DP116_00365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007303712.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_00365" /translation="MTVDSLFEQLKHPNPHLRERAMFELAENRDENTIPRLMSVLNDE DVTLRRAAVKTLGVIGVDSVPSLVKSLLNSNNVTVRGSCAKALAQIAVNYPDVPFPTE GLQGLQTAINDPNPVVHIAAVMALGEVGSPAFDILDEALKTTDNVAVAVSVVNALAAS GDDRAIEVLKGLTNDESADSYVRESAVSALSRLEQVINLNARRQQ" gene complement(61848..63140) /locus_tag="DP116_00370" CDS complement(61848..63140) /locus_tag="DP116_00370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycocyanin operon protein Y" /protein_id="PRJNA477356:DP116_00370" /translation="MDKRFSNLFGLTEEQAISLLDTPLDQLAEDDSRYVAASELVNFA TERSINALIRAVQNTDPSLDNRIVRRKSVESLGKLQAKVALSAIRACLADEDCYTVEN AVWAIGEIGTQDSDILEEIAQLLEKPGQTYRVIIHTLAKLDYPPAVERIRKFVDATDK PTASAAISAVCRFTGDYSPIEKVLALLQHPNVYARRLCIQDLIDARYYAAIPTIAQCP VSLVFRLRGIRLLAEAGIPAGAIAFTDIQPSLEQVIFDHPRDLKLVHAYEQTPTLELL IRELYDTDFGRCYLATKTILEAYPDSAGEALMATYAAEAYNDYGAHYHVMKLLGWLKY APGYDLLVEALNNREPQFQKSRAGAAIALGEFGDTSAIPLLKAVLETKIWDLKYAALM ALEKLGDTNAHAIAANDQNWLIRAKATHTPASSNIINP" gene complement(63413..63907) /locus_tag="DP116_00375" CDS complement(63413..63907) /locus_tag="DP116_00375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132847.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bleomycin hydrolase" /protein_id="PRJNA477356:DP116_00375" /translation="MKSVVTTVIAAADAAGRFPSTSDLESVQGSIQRAGARLEAAEKL AGNLDNVAKEAYDASIKKYPYLNEPAEANGTQVKKDKCLRDIKHYMRLIQYSLVVGGT GPLDEWGIAGGREVYRALELPTAPYVEALRFARNRGCAPRDMSAQALVEYNNLLDYVI NSLS" gene complement(63981..64535) /locus_tag="DP116_00380" CDS complement(63981..64535) /locus_tag="DP116_00380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_045870829.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bleomycin hydrolase" /protein_id="PRJNA477356:DP116_00380" /translation="MLDSFSRAVVGADAKTGTLSTGEIGALRGFIAEGNKRLDAVNAI ASNASCIVSDAVAGMICENQGLIQAGGNCYPNRRMAACLRDGEIVLRYITYALLAGDA SVLDDRCLNGLKETYAALGVPTGSTVRAVQIMKASSLAHINDTNTEEYGGKRFRKMGS TQGDCSALTAEAASYFDRVISALS" gene complement(64866..65039) /locus_tag="DP116_00385" CDS complement(64866..65039) /locus_tag="DP116_00385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410163.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobilisome degradation protein nblA" /protein_id="PRJNA477356:DP116_00385" /translation="MREIPIELSLEQEFSLKTYEQQVQGLNEKQAQGLLLEVLRQLMI KENVIKHLIAQIE" gene complement(65793..66068) /locus_tag="DP116_00390" CDS complement(65793..66068) /locus_tag="DP116_00390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861813.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1049 domain-containing protein" /protein_id="PRJNA477356:DP116_00390" /translation="MKTFANLLISIVIASWVLGVAILSVQNAEPVSLKLLTFQSIQIP VGIVLAFCAGIGIVGVALLQPLWGLAGSEQRNSRLEDETEFFADEEF" gene complement(66341..66521) /locus_tag="DP116_00395" /pseudo CDS complement(66341..66521) /locus_tag="DP116_00395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006279012.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" BASE COUNT 19572 a 14072 c 14147 g 18805 t 10 others ORIGIN 1 attattaatt cacaaacaaa caccggattc aattttttga taattaaatt tacatgaata 61 tcttcgattt tgctcattaa caattttaag gagaaagtca gttgttaaca tctactttac 121 tagcagccac tttcgctaca acacccttgc aatggtctcc tgtagttgga atcattatga 181 tcctgagcaa cattgctgcc attgccttcg gcaaatctac cattaagtat caaagcgttg 241 gtccagaact accttcaccc aacttgtttg gtggtttggg tttacctgca cttttggcaa 301 ctactgcctt tggtcatgtt ataggtactg gtattatttt agggctacat aatcttggta 361 ggctctagat tgttgccgaa cttggaattc gcttgattgt ttttttgcta cataaaagag 421 ccagataaga taactacctg gctctttcat gcttattggc ttcgcttgct tctatcctgc 481 tttgaggatc gagaccttgc atttgacgga taccattaat gtaggctagg caatcacagc 541 catcttaagc caaatctcgg tagactgata gttcatctac tctcatttca aaaggcattc 601 cttgactttt gggaggacag acgcgaattt ttgcactgct gactatttgc tgtagcaatg 661 cagccatcgt taccagtttc tgtaacacac gccaatttgt cacttgatct ggacattcaa 721 tgactaaagt caaagtactc acgtcggtgg tgacgtacca ccgacaatta gacaataaat 781 cttggatgtt gcgatcgcac ccttcataga agtatctgct gatagaatat tcaagttgct 841 gtcgcagcat aatatccgct gatgtcagtt ggacagggtg tgaatcttct tcgtgaaaaa 901 aatgcttcct cttcatctcc ttgtctctca attcctctca ctatttttca atgaccattg 961 catacataaa gtacgtcttt atttctaaaa attcttgatt ttcgtcatca tagaaagcac 1021 caataccact acactgaata ttccaatagc tgttgaatag atacatactt tgtcctaaaa 1081 aactagcaaa ttggattaca gttagatagt ttttgtagtt ggaaataaaa aataaaatga 1141 cagcactatc tctagcaaaa gcttgattca gacacaagtc agtgaccttg tgactaaaat 1201 cacctgcttt ttccaaacgc tttcgttaat ataaaccagg caatattcct ttcattcagt 1261 gaagaaccga ataaatctct atctcctcaa atttctctgt tggtattgat ttttctactg 1321 agtgtaagat attccagaaa atttttgcag agataacttt tttctgaaaa cgcctgattt 1381 cacgtctgtt ccaaacagtt tggtaaaatc ttccctggta taagtcaaat tgcggatact 1441 ctagttgttt ttgcaaactc ggagtacagt tgctttatag ccatcttcaa tcaattgatg 1501 agcttcaaaa taatctgttc taagaacgaa aggcactttc agccttaaac gtctgacttt 1561 cttgtcttga aattcataag agatggcata agcagtcaga actctttatt ctcaatgcct 1621 aaatcgccat tgagagcgag tttatcaaca tcaaaataat tgtaagtctc tattgtgtag 1681 ataagttgaa gcggcaatag cacctagatg atctccgtta tctaaaaagg aatacctcac 1741 actccttcat ttctattccc aactagattt ataataaacg caaagaatta aaaaaatgaa 1801 tccaatgata cgattattat atcaattttc aatctgcaaa ccagaatcaa gaaattcatg 1861 aatgagtagt agcagttagt cgcaaatagt agatgataaa aaccatctac tattgcctat 1921 aaacctataa aatgtcaaga tgagcagtag tattaattca tcgctaataa aacctatctt 1981 gataattatt ttcaaaaaat ctagaccgag ttaactagat aaatgaatag aatgattgac 2041 attgattgct gaggaaaaca ttttgcaata gttacaaact atctctatag atggaaggtg 2101 aggatattcc taaggtagaa gttttaacca atcacagtag ttatcttttc atcataagca 2161 atagagacct caaataaata tcatgtacaa tcacaaacga ttaatgaaaa aatttgttgg 2221 cggttttgta ggagttgctg gtataagtac cttcattgtt tttccaggaa tggtacaagc 2281 caatatcaac tctagttctt ctaacagttc taatgagtct cttgtttccc aaaatacgca 2341 acagacaacg agtccttctg gtagtactac gtcttctcct cgtaccacaa ctcctaatgg 2401 ttcgctcact gctttagata aggaatttat gactaaggcg gcgcaaagtg accaaacaga 2461 gatccagaca agtcaactgg cgctcaagcg gtctcaaaac aaagaggtga aagattttgc 2521 acagcgcatg attaaagaac atacagattc gagccaacaa ctgaagcaga tagctaagaa 2581 aaagagtttt acactaccaa aagatattgg tcaggaaaac aaagctttat tgacgaaatt 2641 gaccaagcta aatggtacaa attttgacca agcatatatg caagggcagg ttcaagctca 2701 taccaaaact ttggctaatt accaaaatta cctcagccag ggacaagacc cagatttgag 2761 tgcgtttgca aataagattg ctccaatcgt tgctgatcac caacaaatgg cacagaatat 2821 ggcaggagga tcgggaactt ctagctcagg aacttctggc tcaggaactt ctggtacaac 2881 cactcctggc tcaggaactt ctggttcagg aacttctggt tcaggaacga atggtacaac 2941 tactcctagc cgaggaactt ctggctcagg acgttaatca gcaataagca tatggtgggt 3001 actgtctatc tcatgatgaa tgggtagaca gtacctactg atttaccttg ccaaggtttt 3061 atgatcaaaa accttacttc tttcaagcaa caatatgaaa aatatgccaa tgtttctctt 3121 ctcctatagc tgtttttcca ggatgctctt cttcctgtaa catttctatt tcaaaacgtt 3181 gcagtaacgt ttctacttgc tgacgtgtat gagaattctt ggaagtataa attgcccaag 3241 aatcacgcct cccaaacaat tgtccacaga aacgaccacc agaacgtaat gatgaaataa 3301 ttttctccca taagctaggg aaagactccg gtggacaaaa tggtagtgaa aagctggcgt 3361 ttactaaatc tactgattct ggtagcatta catcttgaaa aagaacaacg cgggtttcta 3421 aaaattgacg attaatatct ggacgttcta gtaaacgcgc gatcgcttca gcttcaccat 3481 caagagccaa gactcgccaa cctccacgtc atagttctac tgtatctcgc ccttcgccgc 3541 agcccaaatc aactgcaaac cgtgattgtt ggacaggtgt ttcttgttgg ttaagacttt 3601 cagcgtcaaa acgtgctaaa gcttcaagta aggtatcccg tggtggacga cctataacag 3661 cgttataata tgcagaccaa tcgcgctcaa aaactttact ttcagccttg taattattta 3721 aatctgacat agtatataca agtttggaat ttgaaccact aacccataaa aatagaagta 3781 gatgttatct agaaatcatc tgctctttga ttgaaacaat actatagcaa tcttaagtga 3841 tttgtaaaac tcttctctct gtttcttggc gtccaaagcg gctgtagttt gcatgcacga 3901 aagggaaccg gagtaatacc gtttcacttt aaggttgata caaatagata gcacccgagc 3961 agggggaaag aattagagac caatcatttg tatcaggccc gatcgtgaaa tcgtataact 4021 cactggctcc ccaagggttg agtttaaacg tgtaaaaaac agaagttggc aagcttaagc 4081 agtgattaag atcacagata cgagaattgc cgttaactaa tgattttcta ggaacctaaa 4141 aagctttgtc ctctttccac tcaatcttga taacaattca tatgttatat aggttatcaa 4201 gttactagta attttactga agacaagatt tttttgagaa ggctaatatg aacatgatag 4261 ttgaagttgc aagagacagt agctggcttt ctcctgcaac ctcaatatct ggagcaatca 4321 aattctcagt ttgattttgg atgtgagatg agcaagtagg taaattttta aggaaaaagg 4381 taaagtctat tttatctttt tcctttttct tttttctgtg tgatttacta tctaaagtat 4441 caaatctaaa attgctttta tgagttgatg agtgaaagac aaactatcaa ccaccattgc 4501 agcgcagagc aacatcatat agaaaattga gtagagaaat agagaacgtg ctagttgttt 4561 atcatctgga ttgtgcaata gcacccaagc ttttttgatg aaaaaagttc ccagtataag 4621 ggcgatgcag gtgtaaacag cacctgtgac gtgaagcgga taggagagta aaaatgtagt 4681 ggggataaga accaaagagt acacccaaat ctggcgagtc gcagcaacat taccagcaac 4741 aacaggaagc attggtacac caactttggc ataatcatca cggatcatca atgctaaagc 4801 ccagaaatgg ggaggtgtcc ataaaaagac aattgcaaat agcagccatg ctgcccagct 4861 taaagtatct gttaccgcag cccaacctac caacgccggg attcccccag cagcaccacc 4921 aatgacgata tttgcaggat tatggcgttt gagtaagtga gtgtagacgc ccacataaaa 4981 aacaatacca gacattgtta gtagggcact cagcaggttg acaaacacag caagtagggt 5041 gaaggaaaga caagcaagtg cgatcgcaaa tattaaagca tgcgacggct gcacacgacc 5101 agaaaccaaa ggtcggtggc gtgtacgttc catttggtca tcaatgtcgc ggtcatagat 5161 acaattaata gtctgggcac ttgcagcagc aaaagtacca ccacaaacag tgacaagcag 5221 caatactggg ttgacgtttc cctcagatgc aatccacata ctactcgctg ttgtcatgag 5281 gaggaggggg ataatccgag gctttgtgag ttgatagtaa ctgtgaatga cttgtagaaa 5341 gttttgatgg tggcgtgtgg cggtgttctc aatcatgatg cgacgaattc ctcatttgtt 5401 cactcatcat caatgcggac ataaattctt tgagcaaatt cccgagttag tgtcaattgt 5461 ctttctccta ctctgatgac aaaacaagga aagtgctgtt ctacagtgat gaaagttcct 5521 ggtgttatcc ccattgatat cagttcttgg agagattttt cgtccttaat gttgcagaac 5581 ttgatgattc ctcgttctcc ttttttgagc aattctagtg aggaaccttt aacatgaaat 5641 ttagaaaaca ttaataaatt aaatcctaat attcagattc ccgactcctt agagaaatcg 5701 ggaatcttgt tgatcacaaa tgatttaata ttgctatagt cgctttgttt gaaggtttga 5761 gagacgtagt attctacgtc tctacagaaa taaggaaatt gatttcctta cggttgaaaa 5821 gactgcaaca agatatgaac tagaccaagt atggttcttg gttagtctta atcgtgcaat 5881 cagaacgtgg ataagtcaca caaagtagag caaaaccttt agacatttgt tcgtcatcta 5941 aaaagctttg atctgattga tcaattgtac cttcaacaac ttttccgaca cagctagagc 6001 aagcacctga cttacaagaa aaaggcaaat ctataccgtt ttcttcagct gcatctagaa 6061 tagttgtctc ttcatcaact tcaatcgtgg tatctagatc ttctttcttg ttgattaatc 6121 taacttggta tgttgccatt ttttgattct tctcacacgc ggtacagttc ggttttatga 6181 cttcaagggt gtaagaaagt caataattag gggtgtaagg gggtaagggg agccagtgcc 6241 gtgggcgggt tccccgactt gaggcacctg gcgttgtagg gctataaaag tcaaaagtaa 6301 aaatcacttt ttactttgac ttggtttagc tacaaggtgt acctgcggaa gaacagtcag 6361 cggttctaaa tgactttcca caaccgcagg tatcagtagc tttggggttg atgaatttaa 6421 acccgctttc catcactccc tcgataaaat cgataaccaa cccttctagc aagggggcac 6481 ttttggcatc aacataaaca agcaccttgc cttgctcgct gactaaatca tctggttggg 6541 gctgacttat gacttccatg ccatactcat agccactgca accaccatct ttgacagaga 6601 tgcggatacc tttagctggt gtattggttt cggaggtcgc accccgcagc aatgtccgca 6661 gacgaaattc tgctttttcc gtcaaagtaa ccgtcatcaa tcctcctaat tagcaaaaat 6721 tagttgttgt tgtgatttaa ggtgtgaatt tgttcagtgg tgatgagtac agccatagca 6781 atcagcgctt tcattttggc acaggtgaaa aaatgaagct tacacccagg caaaataagg 6841 caattaatgt tgctaacagt tgccagatct gatttacgat ttttgcactg acttgaaacg 6901 aagctatggc aaaccccaaa gcaatgaatg aagacaccaa cataacaatt atttgtttgc 6961 aacactggtg gtttccatga cttgagattg cgagtatctc caaggcgcgc tgttaccgca 7021 tacaggacaa gagcgatcgc ggtaagaacg gcgcttaaca aattccatcc gggttagatc 7081 aattgtcaaa agctgcgata acagaggttg actaaaccca gtgatgagct taatcgcctc 7141 caatgcagtc agacaagcta gtgttccaga aacagcaccc agaacagaaa agccacgcct 7201 atcccaatca ggtttttccg gaaataagca agataaacaa ggagtcacac caggaataat 7261 tgtcgtcaag taagcctcca tcccgtccat tgctgcttcc accataggct tacgccagcg 7321 tacgcaagct tcatttaaca gatcgcgctc cgtaaaatta tgagcgcagt caagagccat 7381 gtcagctgat tgcactaagg agtctacatt ttccggggtg acatattcat gaactgcttc 7441 cacttggaca tcaggattga tggcttggag agtttctttt gctttgaaca cccttggctt 7501 gtccacccaa tcgtgcgtca tcaggatctg acgattcatg tcatcaagcc gcaagtcacc 7561 accccggact aaaatcagcc gcccaacgcc agctactgca aggtatagcg ccgccgtacc 7621 gcctaatcct cctacacccg taaccagaac cgtcgctgac ttgaggcgct tctgtgctgt 7681 ttcgccaaaa ttaggaagca tcatttgacg acggtagcgt tccaactcgg taggcgttag 7741 gtctaccatt ttatacctcc taatctgatg tgatttctgt cagcgtgact acatttttcg 7801 gcttctggtt aaacactttg aacagctttt gttcttgggg tgttgattca agaaatacct 7861 gataggcttg ttccagagct tcgcgatatt tacttaatct atctgaaata ctgagatcag 7921 tataattttc atcaatttcc ttaagaaatt gagaaaactt tttcaaaata tgtagacgat 7981 tgacattcac gaacttttga tcgtagggga gttgaaaaaa ctctaaatac tcctctgcat 8041 cttcaagttt tttgaattta tcataatccc aggtcatttt tcctgctcca gcttttttag 8101 gtgttgctta agttcgtcta actgacgata aatttcataa gtttgggaag caacatccat 8161 cagttgttga taatctgttg gtaatccctc tgccaaatcg tgcagatcca ttttcatttg 8221 acctgcttta ctattcaggc gttttatctg agttttcagt tcctcaatag tagatgtttc 8281 ttcagcttgt gccatcaaca cctctttatt gcctcaaaaa tgaagagaag aattctttga 8341 ctcttcatta ttcatgattg ttttaaattt taccgacttc cgggaaacgc tgagccaaac 8401 tgacgccagt ttgtacaaac ttttctccct cttctgctaa tttttccaac gagtcaaagc 8461 caaagcgatg tgcatccctt aaagactttg caactaacaa aagacgacca gaaaaaacga 8521 gtgcccagcc aaatccttca tggcttaaat caaccacaac ttgagataaa agacccgttt 8581 ccttttcgat gcaggcagcg acagcacgat aaaatgtcat aattcgtgcc tgagtcatca 8641 ggtcaacttc accctcaatc gaaatttctc gcttcttttg tttgctgaca ataaatggtt 8701 tgagcagcaa atcatccgac caactacgat aagtcccgta actatcttga ccccggaact 8761 gttgcacgat cgccttgaga aaaggtgagt taatggtttc tgttgggcgt gttccgttaa 8821 cattattagt gaaactcata cggttgcttc ctcttccaat tcgtcttcaa aactttgggg 8881 ttttgcctgt aaggctttac gcagccaagg cggaggatta cccttaagag tttgcactaa 8941 tttgttgaga atctcagcaa tctcttcttc ttcagaacgc gccttgactg gagttacacc 9001 cttcttaatt aaccgagctg cggcactacc tccaattgct gcaacataaa caattttgca 9061 gtcagctagt gcctcaagtt tgggtgttac cttatcctca ttaccatctt gtttcaggtc 9121 accgtcaaaa gaaagagttt ctaaaaactc gtaacctttg tcagaaattt catacacatc 9181 aatttctttc gccgagccga aatgagcatt gatatgaact ttatcacttg ttgtaaaggc 9241 aatcttcatg cttaattctc ctctcatcaa gttatgagtg atgagttatg agtagtaaaa 9301 aagcatttac gacaactcat aattcataac tcataattta tttacaagtt aaatttcgga 9361 actcaaaaca ttaaaacttc cagagtgctg aatccttgct tcctcagcct gcaagaaaat 9421 gttcccaata tcaaacaaaa cctgcattgt acctcggtag ccaactttag taaacagacc 9481 attacccacg cggtcaaaaa taggaaagcc ttgacgataa agaggaattc ctaatcgatt 9541 tgcgacagca actgtgtggg agttaccaat gagcaggtca gaggtaggtg cgagttgctc 9601 gaagtcttcc aaatcaccaa tggtgacact actgatagga agtttttcta acaaggggga 9661 gcgagtcgtc gtcaccgctg catgaatttc agcacccatt gatcttagaa aacaaactgt 9721 tgaccagagt aaatctggtt ccagcgctag agacactcgt ttgcacccaa aataaaagtg 9781 agtgtcgagc atggcatctt gcaactgacg gcgctggcgg cggtatttct ctggtacact 9841 ggtaccactc aggtctgcca atgcttgcaa gaaattatct actggttcta atccagtcag 9901 ttcaccaaac acctcatagg gaatgccaaa acgctcatgt aagatttgcg ccgcactccg 9961 catactctca ccgaaagcta aggtgaatac agaattacca atctcgccca gttctgttag 10021 acttgtccca catgctgtta tagcactgta agaatcttct aagtgaccat ctagtgaagc 10081 agaaagatca ggaacgacaa tcggcttcaa tccaaaagcc gtgataatct cttttatctc 10141 ctccacatct ccaggcgtga aagcagaact cggcaaaatc acgacttgtg tgggagaagg 10201 atcttgcttg tcgctttttt gggcaatttg cttaacgatg ctttcaacag cagcggcata 10261 cccatcctgc aatgtgccct taaaatcagg agacgagact aaaacaattg gtagatgatc 10321 tagttctggg tggcgatcgc gaacttcctt aagaaatcgc ggcatatcat cgcctctagt 10381 ttccgtcagt gcagtggtac acaaaccaat gatttctggc tttgacctct cagctaaggt 10441 aacaatagct tgttctacat tctcttcccc acccaagata gtgctcactt ccgtcatcgc 10501 tgtggtagaa aggggaatcg actcacaaaa atgcttcacc aacatggatt tggcaaaagc 10561 agtacagcct tgggaaccgt ggaatagagg tatcattccc ttcaagccca acaaagccaa 10621 agcagcaccc aaaggttgac tcagcttgaa agaattaaca gcaactgatg agtttgtcac 10681 agtcacgatc gccatcaatg cacctcctgt agaagagatg aggaagaaac tgcttgttca 10741 agagattcag caacttccca cggtgcgggt ttgcgtactt gctcccaaac tgggctataa 10801 acagcttcgt acagttctcg tgccatttcc agcattcctt catagcctgc atacggatga 10861 tggcgttctt ggttgatgtg taggaaagga atccgagctt tcaaagcggt gtattgatta 10921 cgacctccag caatcaacat atcagcgttc ttttccttga tgatccgcag cagctcttta 10981 ggactttctt gttctagcat gatgccatcc tgaccgagca acttcttgat tcgggctttg 11041 tcttcctcag tactctttcg tgtactggta gcaacaactt ccatccccaa gtccttagct 11101 gccgagataa tcgaccaact cttaacacca cctgtataaa gaacaacgcg tttgcctttg 11161 agtttctcgc gataaggagc caaagcgaca tccaaagcag cagtttcttc agcaatcagc 11221 ttttctgtac gctcttgcaa atcgggatca cctaactttg cagcaacgtt ccgcagacaa 11281 cggttcatgt cttctacacc gtagaaagac tcttcaatgt aggggatgcc atagcgctct 11341 tccatctttg tcgccatatt aagcagcgct tttgagcaaa tgatcacgtt gagcttggca 11401 cggtgagcgt aacagatttc gttgtaacga ccatcacctg tcattttcgc caaaactcga 11461 atacctaatt tctcaaacag tcgcacaaca ttccacattt ccccggcaac attgtactca 11521 ccgatgaggt taatgtcata gggtgtggtg aattcaggtt cggctgtacc cacaacgtaa 11581 tcaagcaaag cttcaccagc aaggcggtta ccaaaattct tactaccgat gaatcctggt 11641 gcattgacag ggataacagg agttcctgtt ttcttggcag cagctttaca aacggcatct 11701 atatcatcac caattaaggc ggtgacacag gtggagtaga caaaaactgc agttggtttg 11761 tagcgatttt cgacttctag aattgctttg tacagctttt tttcgccacc aaagatgaca 11821 tcggtttcat ccaaatcagt ggtaaagcct gttttgtaca gcataggacc agaggaaaga 11881 ctaccacgac ttccccaaga gttaccagaa caggcgatcg gtccgtggac taaatgagca 11941 gcatccgtaa tgggtactag agcaatcatt gcaccatcaa aggcacaacc cccttgagca 12001 gccccaggct tgggttgctg cgtgcaagac ttgtttttct tgtctccatg cttgctttta 12061 ttgtgttcac acccaggctg agtgagcaac tcgttaattt tattttgggt gattttcatt 12121 tattcgtaca ggatagagga catgaattat tagctgttag ttgtcatttg ttcttttggc 12181 actaaccact aacgaacaaa tcatgactag caaatatatt tcgtaacatc ctctccacac 12241 tcatcagtta gataagataa tgctcgaaaa cgcaatccta caaaattgtc gtacaaggga 12301 ttgagtttac acagtggagg aatatggaag gtatgtccaa ataatttaat acttcgttca 12361 aaggggcaac agcaaggaat tatttggcaa ataagatgag ccagtcgagc attttttact 12421 tgaattccat ccaccagacg gcgcaaagga tataaaatct caaatgaacc tgattttttt 12481 gaagtaggtg aatggtgatt ctggtgatga tgagtgtgag ttatagtttt catattaaat 12541 tactccataa ctcaagaaaa aggtaaaagg aattgggtag agagatgagg gattgctcgt 12601 gggctcattt tagtttttag ttgttaattg ttagttcttt ttaatcaaaa ataaccgcta 12661 actactaact actaaccact aaccaatgcc ttcatccttc tgccttttcc ttttcctttg 12721 ttagcacgtc gcaggtgaat gtaattccta acgaatcaag tcgtaggaaa tatctgtctt 12781 agaaggaatg ttggtgtgtt gatcgatatc ctcgaagatt gtattcacaa cccagttaag 12841 caggttgatc acaccttgat aaccgatggt ggagtagcgg tgcaggtggt gacgatccat 12901 gatgggatag ccaattctca cgaggggaac cttgcaatcg cgccacaggt acttaccgta 12961 ggtattaccg atgagtaagt ctacaggctc ggtgaacaac agagaaccaa tgcgttgact 13021 atcaggtaaa atcaaaatat cgcttcgatg gtttgctcta tggacatttg ggcttatgct 13081 agggtttcga cggatgaaca aaatgaagat gagggcgcgt taatcaaaca aatgcgtcgg 13141 ttgcgtagtg ctggggctac acagatatat tacgacgtgg aaagtcgtac tagtgataag 13201 cgcaaaggac ttttgcagtt gatttcggat atcaacgcat ctgcaccagg gaaagtgtcg 13261 aagttgctat ttattcggat tgatcgttta acatcatcat caatcatctt ttatcgtttg 13321 attgatgcgt tgaacaacaa aggtatacaa ccgattgctc tggacgaacc gttcgacatg 13381 tcgagtattg ggggtgagtt aacaatcgat gtgagactgg ctgcatccaa atatgaggtg 13441 aaaatgctgg ggatgcgagt taagaaagag cgcgataccc gcaaagcaaa taagaaaccc 13501 cactggaatg caccgctggg gtatgtggtt gatggggata gatataaacg agatgatcgc 13561 ccgtgtgttt gtttgattga gaataaagtt gagttcacgc gatcgcaatt aatgcgcttt 13621 gtatttgata cttttctaca tgtaggctca gtttcgcaga caactagacg attgcacgat 13681 gtttttggta ttcaggcaaa tgcattaccg aagcccagca acgaagatat taatctgctg 13741 ggagaagagg atgagattat attagaaaat atcaacaaat cccgtggtaa tgggctaaat 13801 cttcggtatc cccacacagg tttgaaatgg tctgtttctg gattgcgttc tattctcgta 13861 aatcctgtct acgcaggggg gacgctttac aacactgtgg ttcgccccaa aggacacagg 13921 aaaccattcg acgaatggga agtgacatgg ggaactcatg aggacgaagc gattattacc 13981 catgaagaac acgaacgcat taagtcaatg atcaggagta accgtaaaaa tcgatgggcg 14041 accgagcaaa aatacactaa cccattcgca aacctagtta agtgtgcaca ttgtggtgct 14101 gcttacagcc gccagtgcaa gaaattggtt aagaaaaaga attttgtcag acaccactac 14161 caatgcagtt tttaccgcac aggggcttgt cacaacaagc gaatgatttc ttctgatgaa 14221 ctggagaagc aagtgatttc gcatctagta cgtgaagccg aacgtctagc gactttagta 14281 gaacaagagg tgaaaataac agaggaacct ccagagatta agacactccg agcatcttta 14341 aagacaatag acgctttacc atcaaatcca gcaattgaga aagcgaaagt tgatattaga 14401 atgcagattg catcggcgat cgcaacaaca gacaacaatt ctaaacaata tttaatcgcc 14461 aaagaacgga ttatcagcgc ttttacaaac tccagatact ggcaagggct aaaatcggag 14521 gataaacaag ctctacttca agggtgcatc aagaaaatag tggtagatgc caacacagtc 14581 actgcggtag aattattgca cttgcactaa gcggctttgg acttgcggcg tttggttttt 14641 ttgccctttt gcggttgcag ctgcgagact tgttgctgta gctgtgtctc gcgttcctca 14701 aactcaatca cggtacgggc gactaactct gctaaggttt cacgtttcat ggttgatttt 14761 tatcgcttct ttgacgataa ctggcaggtg aaaaacttct gttagtatta agctttttaa 14821 attagtaaaa agcattaacg cccctaaaaa cctgtctggg caaggatttt atacccctgg 14881 taacagcttc cgtctcatta ctccaaaaaa tctgatagac aaacggtgta agctgtttat 14941 atgatacgta aagttactga tcaactgaag ctaactgcta caaatgctca agtcaaaaag 15001 ccttgttgaa taaccatctt ggcttcatct ttttctttca tgaaagctaa tctttactgt 15061 aatatttatg acgaaatatc acatcgtact ccatcactct attgaccttg agaaaattac 15121 tcacgaagcc gagaacaata aacgtcctag gcatgtaatg tggaccatta agcaaagact 15181 gaatgccacc attcatacac cacaaggata tcaaagttct tttgcagaca aacttctttc 15241 caaaatagca ggttcgccag aacattgggc aatggcacgt gctttatcat ctcgacttga 15301 atcggatgac atagtattct gcaatagtga ggctggaggt attcagttag caacactttg 15361 tgggttcaag cgcaaccgtc ccaaaattgt agtgtttttt cacaatattg atcgacctcg 15421 tggacgctta gcactaaagt tatttaactt ggctaacaaa atcgatttgt ttttagcctg 15481 tagcaaccac caagtgactt ttttacgcga ttacttaaat ctacctgcat ctcgtgttca 15541 gtttgtttgg gagcaaaccg atctcaaatt ctttgcaccc tcgcctgtct caccgaaaaa 15601 gtcgcgtcca atgatcgcta gtgttggctt agaacaacgt gattacaaaa ctcttgcgga 15661 agcaacagca aatctagatg ttgatgtgaa aatcagcggt tactccagcg atgcacttgt 15721 actaaaacga gcttttcctg aaaaattacc tgataatatg tcacgccaat tttacgaatg 15781 gccggagtta gttcagctat atcagaatgc agatattatt gtggttagcg tctttgaaag 15841 taattgtggt gcaggaatac aggcattgat ggaagcaatg gcgtgtaggc gtcctgtgat 15901 agtcactcgc actcttggac tcgatcaata tatagcagga actaatgcgg taatgatggt 15961 taagccacaa gatgttgaag gcttaagaca agctatcatc tatctgttaa ataaccctca 16021 agaagctgat gctttagctg aacgtggtta tcggttagct ctaagtcggc atgatagtga 16081 gcagtacgta aattacgtca caaaagagct taaacagttt gaagaaggag tcataactca 16141 ggagtcatat caaactgtaa tctaaacctt aacaagaacg caccaacgca caacgcagta 16201 gtcagaaagc ttcgactcat ctcgactacg ctcaatgtcc atcgctcagc tttaatattc 16261 tcacggatgg aatccgaaga aaagttccgt gcgcgggtga gcaaggcagt gcgaagggaa 16321 acgttgtgcc aacttgtagc acctgctctc gcgcccttgg aaacttcggg gggatttaat 16381 atcgaattac tccgacgcaa gcggaaatga tatgatacga cacgattaaa gccaagctcg 16441 ctcgtactga acaggtgaaa atttttcctg tcgcagtttt ttggcagcag ataacgccca 16501 gtaaggattg cgtagcatct ctcgtcctat taaaacaata tctgcctgac ctgtacgaat 16561 aatctgatca gcctgttctg gggatgtaat taaaccgaca gcacctgtga ggatatttgc 16621 ctcgtggcgg attctcgcag caaactcagt ttgataacca gcgccaaagg gaattttggc 16681 gttgggtaca gaaccaccgc tagaagtatc aattaaatct acacctaatg aattcagttt 16741 ctctgctaag gcgatactct gctcaatatc ccagccattt tctacccaat ctgttgcaga 16801 aattcttacc cacaagggat aatctgatgg taagatttct cgcaccgctt gaacgacttc 16861 tcttagcaaa cgagtgcgat tttcaaaact accgccatat tcatcgttgc gtttattact 16921 gaggggtgag agaaattggt gtaagagata accatgagcc gcatggattt caatgacttt 16981 aaacccagct tcttgtgaac gttgagccgc ttgtacgaag tcatttataa tctgttggat 17041 gccttctgtg tttaatgctt cgggaatagg gtggttttcg ctaaaaggaa tagcactact 17101 agaaagtaca ggatgccaac caccgttgga tgtgtctaat acttgtcctc cttcccaagg 17161 tggtgcagta ctggctttcc taccagcatg agcaagttga atacctgcaa tcgccccaaa 17221 gttatgaatt gccctaacaa tttttgccaa tggctcaata tgctcatcca accagatacc 17281 caaatcttgg ggactaatcc gtcctcttgg ttcgatggct gctgcttcag tgaaaactaa 17341 acccgcacca ccaacagcac gacttgccag atgaatcaga tgccaatcat tagcataacc 17401 atctgtactg gaatattgac acattggcga aactgcgatg cggttgcgaa aagttactgc 17461 gccaattgtt aggggtgtaa ataaatctgt catggtaagg gtgaaatctc actaaattga 17521 gtcgtgtgta cgagaacgga ttgctcctga gaatttatgt tgtctgatca aaactttagg 17581 tgtcatctaa ccgcaggact gttgtagaga ttctctcata gaatcgtaaa gttggttaga 17641 cattttggct cgattcaatg tgtataatat gcacgaataa aaaaatagta ggtcatcgca 17701 aaactcacga tgataataaa acgcttcact aaatcgggct gaagtttacg ggcatagtat 17761 gcgccaccgt aaccgccaac cactgtccct aacatcataa aaactgctgg ctgccatgcg 17821 acgactccgg caataacata ggtgacaaca gcaaagccac ttgtacaact catcaacagc 17881 atcttcaggg cattcatctt gtgaatattt ttcattccta acatctccat cgtcgctaac 17941 atcaacatag cgacacctgc accgtaaaat cccccataaa tggcaagaaa aaattggatg 18001 actgaggctg tgagtagcga tcgcctagag tcttctaaat cactatgtaa ccaagttgta 18061 atctttccgc cgaaagtaaa tagaagtgta gataaaagta aaaggtaagg aatgaggcga 18121 tcaaagatgt ctgtcggggt gtaaagtagt aacaaagccc caatcattcc acccacgagg 18181 ctgatactgc ctagtacaag agagatacgc tgctgcccgc tcaactcatg acgataagca 18241 gcaacactca cagcgcatcc taaccatcct gctgtcatac tggtagcatt cgcattgatg 18301 gacggtaagc caataaaaat gagtgctgga aataaaagaa agccgccacc actagcaaca 18361 gaagtcaatg ttccacctaa cacagcagcg acgaagagaa aacaagcttg ggcggtcgtc 18421 aaattcacct cctcgcacac aaaaagcgat aaccgacatc aatttctgat aaatttctga 18481 tttctccata tttcgcattc cattttccag ttgacaaatc ttctttaagt ctgataactc 18541 ctttgtgcat gttcgacaga ttgagaacgc attaccgatg aaggttcgac agcgtaaact 18601 gaatattctc gttcagcgtt cgcgaagcgt ctccgcagga gattcgctcg actataacct 18661 cctgttccag caccaatatc agcaataata cttttctttg gtaggttgag taaatcaatt 18721 agagaattta caattctcaa atcaggaaga cgcgaattag agtagacttt gccaatggaa 18781 ttgtagatag tcattattag taaccagcat ttttatccac gacatttcgc aggggctttc 18841 cgtggcgata gcgtgttaaa ttatcgagga aaagtgcaag cgttcgctct tttactttgg 18901 atgaatgagc agaacagtgg ggtgtaataa aaacattcgg taacgtccaa aatggactgt 18961 ctggcggcag tggttccgta aacattgcat ctaaagctgc accagcaatc caactttcct 19021 gcaacgcttt taacagtgca ggttcatcca caattttgcc acgagcaata tttatcagat 19081 aagcatcggg acgcattgat tgcagtactt ctgcattaat caggccctcg gtttcctttg 19141 tgagtggtgt agcaataact acgtattctg cttccgaaag aagtgctttc cattcatttg 19201 tacccactac tttgtcaaag ttcgctaatg gttgcgggtg acgacggctt ccaaaaatcc 19261 gcatcccaaa agctttggca cgggcagcaa tttcttgccc aattccccct gtaccaataa 19321 ttaataatgt cttatcttgc agttcttcaa tttgaaagtc tctttgccat tgctgttgag 19381 tttgtagctt gtgtaatttt agcaattgct tggcataaga caacatataa gcaatgacaa 19441 attcagcaat gggaatcgca tgaactcctg caccgttagt caggatgatg tcacgttcta 19501 ggtatttttg tgtcaggata ttatttacac ccgcattcgg tgcatgatgc caacgcagtg 19561 caggtgcaga ttctaggact ttatgcaaag ttgtgggttt aagataataa agccaactga 19621 aatatatttc agcttcttta gcatcaccgt ccagatttcc atcaatatct gcccgaacaa 19681 atgttgtatc ggtaggtaga tgcggcttaa gctcatcagc agcttctatc ggtataatta 19741 gtttcatgat aattgttaat agcccgacgc caaacgatat tagtagtcta attcagcttt 19801 cttgcattag taaccaacat ttttgttcac aacgttacgt agtttttctt tgttccgata 19861 acttgtcaaa ttgtcgatga ataacttcac aatgcggcta cgaacttgtg gtgtcagtgc 19921 tgaacaatga ggtgtgatga atgcattggg caaagaccaa aagggacttt ctggtggaag 19981 tggttctgac tcaaaaatgt ctaatccagc accagcaatc caaccttcac gtagagcagt 20041 cagcaaggca ttttcatcta caattgcacc acgagcaata ttaatcagat aggcagtggg 20101 acgcattaac cgtaatgtct ccgcattaat caagccttta gtttccggtg ttaaaggtgt 20161 ggcaatgact atataatctg cttccggaag aagtgctttc cattcattcg caccgactat 20221 tttatcgaaa ttcgctaatg gttcaggatt tcggcgacta ccccaaattt gcattccgaa 20281 agctttagcg cggagagcaa tttcttgacc aatatttcct gttccaataa gaagtaaagt 20341 tttctcgtac aattcctgta actctaacca cttgttccag tagtgattag tttgtaaatc 20401 ttgcaattcc cggacatttt tggcatgata aagcatgaag ttcagtacaa attccgctat 20461 tggaatagca tgaactcctg agctattggt aagaatgata tcgtgtgaca aaaatgtggg 20521 tgtgagaata tgattcacac cggaactagg tgtatgctgc caacgaattg taggtgctgc 20581 tgctaaaact ttgtgcaggg tagtattttt caccctgaat ccgttgagat aaacttctgc 20641 tccactagga tcaccatcaa aattaccatc aggatctaca cgtacagatg taatatctga 20701 gggtaacaat ggctcaatat catcagcaag ttctacaggt aaaataactt tcataacaga 20761 caaagagtaa tatgatcaca caagaaagtc taaaaatccc ttgtgccctt tctctaatta 20821 aagttcactt ttactgttgt ggcacaatcc aatcattgtc atcatgactg atgcaagaaa 20881 aaccattaag agaaaaatct gagcggaagt ttccgctcag aactttgaga gttcactaag 20941 gaagataaaa cagaagcgag aaagctactt caactgctta agtattagat gagttgaatc 21001 ttgttttgct gtgttagtgc gatcgcgtcc ccaaccgatg ttattgcggt agctgcccaa 21061 aagtccagtt tctttaagtg atttttcgtc cactgactcc aggggttcaa cttgcggtgc 21121 tacatacaaa gcatcgacag gacaatacaa ttcgcacatg aaacaggttt gacaatcact 21181 ctgtctggca ataataggtg gtgcatctgg cactttgtca aagacgtttg tcgggcaaac 21241 gttaacgcaa agattacatt caatgcaccg cgactcactc actaactcaa tcatgtcgat 21301 tttggattgg ggataagatc ttgggaaaac gcgatttgca aaagcctctt tattgacttt 21361 cttctttgcg tgagtttgaa atcctgagat tatgcagccg cactcacttt gacttttgtc 21421 tggtgtagag gtgttgcttt cacccagact tgatccaaac caccactggt taaatagtgt 21481 tgttggtttg catccatctc tggatagtct aagtgtttgt gcataccgcg agtctcttta 21541 cgttgtaggg cgctgctgta catccatcgg gctgtggcta ccattgcagc ggcttccctc 21601 acccggacaa tgttgtgagt gtctacttga ctagtgcgaa tctctttcca gaggtgattt 21661 aatctgtgca atgattctgt caagcctttt tcgcggcgga aataattgcg atcgtagggg 21721 aagacttcag cctgggttgc ttggataact tcttcggtgt tgatttggtg agcagcactg 21781 caggcgggtt tcccgccgta ggtgactgac gaacccgaag ggtgttgact accatttgtg 21841 aatcccgctt caccgactgc ttgaataaaa cgctgtgttc cccactcagc gaacttctgg 21901 gcatattcag ccgcagattt tccagcccag tatccagaag atatcgccca agctgcattg 21961 tgactaccgc caccagtaaa cccaccacaa attaattctc gcgtagcagc atcgccagcc 22021 gcataaagtc cctgaactga tgtggcgcag gtaaaatcga caatccgaat gccacccgta 22081 ccgcgcactg ttccttctaa gcgcaaagta acagggaagc gttgggtaaa gggatcgata 22141 cctgcgcgat caagaggtaa gaagaaatta ggctgtgcct tccgcatatg tgcctgcatt 22201 tcttcagtcg cctgatccaa gatggcgtaa actggttgtg tcaacaaagt ttgagcaata 22261 accgaacgtc ctttttgcga acccgcacca ggtattggtg tgccatcttc gtaagtaaaa 22321 gttgcccaat tgtaaaacag ggttttggtg acagaggaaa aggcgggaga gatgccgtag 22381 gcattggaaa actccatccc cgacatctca acacccgctt ctgctgccat gaggtatcca 22441 tccccggtca acacattgca accgagtgcc ttgcttaaaa aagcacagcc accagtcgca 22501 atgattactc cacctgcacg cacaacccat tgcttgccag tctgacggtt aacccccgtt 22561 gccccagcaa cagcaccttc tgcgtctacc aatagttgca aggcgggact atgatctaat 22621 attgttaccc ccgcccgttt aatttgcttg cgcatcaggc gcatatattc gggtccctgt 22681 agactccgcc gataggattt ttgctgttga tcaacaggaa agggataacc ccactttgct 22741 aataggttga cgtttgtgta agtttgattc aatacttgct gcatccagtc acggttagag 22801 agaaatcccc ccaaagcttc tcgactcgcg atcgcagctt ctctactttc gggttctggt 22861 ggtacatacc acacaccatt acccgatgct gctgcacatc cactggtacc gcagtaaccc 22921 ttatctacca aaacaactct cgcaccagca gatgcagcac tccaagcagc ccaagtacca 22981 gccggaccac cgccaataac tagaacatct gtctgtaaat tgggatcaac atcaatatta 23041 atagtagatg aactctgcct cattgaattc ctcgcctcaa tattcgcttc ttacgtacca 23101 gagaaaaact tggacaataa cacaatgaaa aaactaattg ttaagagcaa ttatggcaat 23161 aagaacaact tattaacgta agtttttgtt gtactagcat ctacactacc attatatacg 23221 gtttgttcgt cagtttacag tatttttctg tttcttaaga ctcgcacaat ttattcagaa 23281 aatcaagggt agtttttata aaacttaaga atcagaatac aaggcttgct tttcgagtat 23341 taatatgtaa atatgcaagt ctgttgaagt taactcagca aaatctcaaa aattaccgtt 23401 ttttacctaa attttgacga gatagggggt tcgcgatctt gcttcagttt cagttggggt 23461 ttaaattacc aactgaaaat atctttcatt ttgcgaaaag tttgcgcgtc cggtgagacc 23521 agtgctgcgg gagggtttcc ctccgcaggn nnnnnnnnng aattttgaga gaagtttgag 23581 cggaggtttc ctccgttcgc gcagcgtctc cgaaggagat tcaaatcttc ggtgattttg 23641 aattttgaat tgttaagggg ctggattggg atagcgtgta tgaactgcat ctaactctgc 23701 taaaatctct tggttgagaa tgacattcac gctgtcaata ttttcttgta gttgttctaa 23761 ggatgtcgca ccaataattg tgctactcac aaaccaccga ctccggacga aagccaaagc 23821 taattgtgta ggctgcaaat tatattttct ggcaattgat acataagctg ctacagcctc 23881 atttacattt ggtttgagat agcgttgtcc aaaaccttga aatagagaca atcgtgtatt 23941 ttctggtttt ttctcgtctg tgtacttccc agataacaaa ccaaatccta aaggactata 24001 agctaataac ccaacatttg tataacgaga aacttctgct aaagctgaat caaatacacg 24061 attcagcaag ttataagcgt tttgaatcga gatcactttg ggcaattcta gctgtttggc 24121 aatatgaaca aactcactca caccccaagg tgtctcatta ctcaagccca gatagcgaat 24181 tttacctgct ttgatgacat ctgcaaatgc ttgtaattgt tcggcaatac tcacagtttc 24241 acgttccaaa tcaggtttgt actctgtctg cccaaaggtg ggaacatagc gttctggcca 24301 gtgaatttgg tacaaatcta tataatctgt ttgtaatcgc ttcagactat catctaccgc 24361 ttgcttaatg ttgttgtggt taatttggag atttcctccc cgcaaccatt taaaaggacg 24421 accgggacct gcaatttttg tggcaatgat gagttggttc cgctgctgtt tttttaacca 24481 ttcgccaatg taagcttcgg tttttccttg agtttctcca cgaggcggta ctggatacat 24541 ttccgcagta tcaataaaat ttatcccttg agcaatagca taatctaact gttgatgcgc 24601 ttcctcaatt gtattctgct gtccataagt catggttccc agacaaattt cagaaacctt 24661 gaggtcactc tcgccaagtt ggttgtactt catattcttg aatgaattat cttgattact 24721 tttagcaaaa aaaacgtaac aaaacgaaac tttaacttaa aatacagtaa atataccgtt 24781 taaccggagt taatatctca aaaatggata ttaccagata tttatttcgg aaacaagctt 24841 agatgactaa ttttcaatta tagatcaaat gtagctaaaa atactcacta tatacagttt 24901 ttctcaaaga ctgctatcta gtaagataat caacattaat tcaaccaggt aaccgtattt 24961 tttcaataag gtaatttggt tattgccata gagttatttt ctcttaccta gttttgttac 25021 taggagggga aagttttgat gattttttga ggattttcca gatacaaagc tggagtcatt 25081 ttgtggtggg cttacttcag gcatgctacg cgaagcccga cagggcgcag gcggctatgc 25141 catcattttt tatgagtagc aagcgtacaa gataaattct gactcctaaa tttgcttatt 25201 taagtgctca ttgctggctt tgttctttta aattgatacc tctctagcga agttattact 25261 aagccctttt gactatttag cagaagcatt gtatcaagtg attctgccaa ttttgtctat 25321 gaatttagag caaattcatt cactttttag gttgaggagt gtcatgtcga aatttcgttg 25381 gaattctaga tggatactat ttttggtttc cctgctagat atgcagatca tcttagttct 25441 taatcctgcc aaaagttggg cagatacacc tgaatattct tcatctaccg attcatcaat 25501 ttcatctgcc gtgaggagta ctgcgattgt cccaagggga tccgcccaaa gggcgaacgc 25561 tcctgagcaa tataaagcgt ctgttctcca aagcagcaat tccgtaatgg atcgctttac 25621 gctgatctca caagtaccct catcagtgcc ctcactgtca cctatttctg tccctcttca 25681 cccttcacca ggcgatgcag gggtaactcc aaagacttct gtctctgaac tatcggatgg 25741 gaagcgatcg cctactcctg tccctgctaa ccctacactc aagaatgatt cgtccaagct 25801 accaaaccgc gatgcaacaa tagggcaggt taattctgtc tcccagctct ccgatgtgaa 25861 acccaccgac tgggcatttg ctgcattgca atcgttagta gaacggtatg gtgtgattgc 25921 gggttatcct gaccagactt ttcggggtaa tcgggcaatg agccgttacg agtttgccgc 25981 agggttgaat gcagcactca atcgaattac tgaactgctt gctgcaggaa caagcgattt 26041 ggtcagaaaa gaggacttag ttaccctgca aacactacaa aatcaatttg gagcagaact 26101 ggcacaacta cgcggacgag tagatgcgct agaggtacag aatgcgaatc ttgaacaacg 26161 tcagttttct actacaacca aattagtcgg tgaagttgag actgttattg ggggagttct 26221 gacgggcaat aacgtcgtta ccaagagacc tgcacctcat gtcatcacat ttcaagaccg 26281 ggttcgccta attttaaaca ccagttttag cggcaccgat caactacgga tgacacttca 26341 agctggaaat attgcctctt taggaggaac aagaactgga atatttggaa ccactgacgg 26401 gagaacttct gataatgcca gtcctgttta tccaaacaat gacgtttata tctctggact 26461 tcgttatcag tttaaacctt ttaaaagtac tcaagtcaac attttttccc agtccgatgg 26521 agcttttgaa atcggcttga gtggtcctat caacccatat tttgaagggt ctgcggctaa 26581 tgcgatctca cgatttgcac ggcggaatat ggtctatgat tatggagata cgggtcccgg 26641 aattgcaata ctccagcaat tcggtaaaca gtggcaatta gggttagcgt acaccgcaat 26701 taatggtgac aatcctacac ctaacaatgg cttattttca ggcaggtatg tagctctggg 26761 acagttatca tactccaccc ccagtcaaga ttttcgggtg gctttaactt acgccaatac 26821 ctacagccca ccgaacacca taggtcaaac tgggacaaac ttcggtccag taatcgggag 26881 taacctggca aacagtaccg tggctggaag gggaactgta ggtaatcttt acggcataca 26941 agctttgtat aaaattagtc ccaagtttgc acttaatggg tgggtaggtt attcagcaca 27001 ccgctatctg ggggtcggag atggtcaggt ctgggactgg gcggtaggac tggcattccc 27061 tgatcttttt caaaaaggta gtttaggcgg tcttttcgtc ggtatggaac cgaaactgac 27121 tgccttgagt aagaatgtag atttgggagc aggtcttgga caagtggaca aggatacgtc 27181 cctgcacgtt gaggcatttt accaatatca actcaacgat tatattgcga ttacgccagg 27241 tttcatctgg atcaccgcac caaactctga tgccgacaat cctggtagtg tagttggttg 27301 gctgcgtacc acattcaagt tttaacctga agagcggttt tcatatgaat agaccacaaa 27361 tcttagggtg ggcattgctc actaatagtt agatttttgt gacctttggg ctcgttatta 27421 ttgaaagtga tgcgagctac tttttaatct actttagaag agctattaat agttatcact 27481 ggttaatcta agtacttttt attattgaat atctgatgag tattcctaat tacaagaaat 27541 ttgataagtc taatgactta tcattttttt gataacacat atgattttgt tttgctatta 27601 ttgaaataga tgagaggatg tttgaaaagg gtctgcttga gccttaagtg agactcaggg 27661 gcgtgatcgc aaatcccaaa acccagattc tctcgtagtc gtttctcggt agttagggta 27721 gtgaaacaca cggttttaga cttttcaaac atcctcttag gttgcgtcct gtttaccaaa 27781 aaggtagtca cttgtgtttt ttttacctgt tcaagtaagc tcgatggcaa gacttctttg 27841 aggagtggaa accagtagtt aaatgtgtcg tttgcagtag acatctccga aaactaatta 27901 caagtacttc aaaactcttt gtaaatgtac cccggttgtt cgccctcaca tcatttcaga 27961 agatatctat tgacgttttt tgtaaatatt aagaagactt gtaactatga ctcataataa 28021 tcccgaagac caaatgcagt caaatttgtt agctttcgcc agcgcaggaa tttccgtgtt 28081 gttcgaggta atcaaagatc cctttgatga aatccgaggg catttgaccg gatcaattct 28141 tcgttcctgg ataactgaga accggaaaac ttctgaacaa tctagtggtt gtgtcgtctc 28201 ttactatcct gccgaacaga aagtggaagc ctttatagtt gatgagaaaa atgaaagtct 28261 cttcaccgat aaaggtcgga aaattgctgt agttttcaaa gctaaatctc ttgatctaga 28321 gttacagcag ttatttgcaa atcaaaaaat agttttaatt ccttttgatt agaaggagaa 28381 caaaaatgtt tgatccacta acgatctctc ttgcaattgc tgcgttggca gtagcttcag 28441 ttgtcactta tctaactgta acagttattc gtaattatct tcgggagcgt cgaacaaacc 28501 agaatgtaaa tgctaaacct gctttgatgg tggacagatt gaacaacgga gactattctg 28561 ttgtgactgg cttttttgac ggcaatacaa aggtcttaga tagtaaagtt tggaacgcac 28621 agaaattgga tgaagatctc cagaagtttc ctattagtaa acctgtaatc atagagtcat 28681 gaaagtaata attgaagggg tcgagaactc ctcccagaag gaaaaacttc gcgaaaatgt 28741 agcgaagagg attgctgagg tgtggaatgt gtgcagagaa aactccaaga ttcgcgaaat 28801 aatcatcaac tgtaatcctg gctcatcaaa cgcagaacat aacaatggtc aagaaaagtt 28861 ggcaacattc gatcacctat tacgaatgcc taataacaat gcctcgattc ttgtactgcc 28921 ctccaaacag caagaactta ttaatagcca acttaagcgt tgggatttaa ttcaattagt 28981 tcagaaaaaa tggggagttg ataagattga cccatttccg cgagtgtcgc tgaacttcgt 29041 tggaccacca ggaacaggta aaacgttgac agcgcacaat tttgcgtcta ggctgaacaa 29101 aaaaatcatt gaggtttcat atgctgacat tgtaagtaag tattttggag aggctgctaa 29161 gaatctttcc gcactttttg aatttgctaa ggctaacgac gctgtattgt ttattgatga 29221 agccgaaaca ttgctgtcaa agcgcaatgc agcggcgagt gaaggggctg atcatgcggt 29281 taactccatg cgtagccagt tgctattact tatccaaaat acaccaatca tctgtatttt 29341 tgcatcaaac cttgtagaag gatatgatcc ggcttttctc tctagattaa ctagaatcga 29401 ttttccactg ccagatgaaa atttaagtga acgtatttgg gaaactcatc tcatacaaga 29461 gttaccgctc gactctagca ttacagcaaa ttatctcgcc acaaaattca agggtttaac 29521 aggtcgtcag atacgtcaaa tcgttattga agcagcttat cgggctgcta gtcgcgaaca 29581 tgctgaccaa actttgtgtc ctgaggattt ttcctgggct catgatttag tttgtagcaa 29641 cgaaacggta agttattctg ttgcaacggg ctttcttgag aacgataaga aaatcgtcaa 29701 tagtaaagtt tcggacgctg agagattaga ttgggaatga ctgtatcagt tctcttttat 29761 ctcaccattc gattcgtgct tacttgcctg acactttacc accaggaact ctagaaacgc 29821 tagtcgcagc ggcgcagtca gcgtctactt cctcgaatct ccagacgtgg agtgtagtag 29881 cggtagaaga tgcaaaccgc aagcaaaagc tatcacaatt agctagtaat caagcgcata 29941 ttcgatcttg tccgctattt ctagtgtggt tggcagattt agccagactc acccacatcg 30001 ctgaaagtcg cggattaccc catgagggtt tagattatct ggaaatgttc ctgatggcgg 30061 cgattgatgc agcgttggcg gctcaaaatg cggtggtagc ggcggagtct cttggtttgg 30121 gaacggtgta tattggtgcg ttgcgtaacc atccagaaag tgttgcggag gttttggact 30181 taccacccca tgtgtttgca gtttttggat tatgtgtcgg ttatgctgac agcactgtag 30241 agacagcgat taaaccacgc ttacctcagc gggctgtttt gcacagagaa acttataaat 30301 taacagaaca ggatgagtca attacttttt ataaccaggt gatgaaagcc ttctacaatt 30361 cccagcaaat gaatgttcct ggtgattgga cagaacattc tgccaaacga gtggcgtttg 30421 ctgagtcttt atcaggacgc gatcgcttgc gggaagcttt aaaaaacttg gggtttgaat 30481 tgcgataaca gcgtttatcc agacaagcgc gaaatgccaa catcaatcgt agtttggcat 30541 ttttttataa aaagacactt gcctttttcg gctgagtgtg tgagactact agaaacaaaa 30601 tactgtttgc ttacctatta accgtacttt aaaagcaaac atcaatcaat agacaattac 30661 tgggcaggta tcccatgtaa gtgtcttctg tgaaagagtt ttgagattat agacacttaa 30721 tcaaaaaagg agcgaaaaca atgtcaataa gaaagtgttg ttgtgacctt ggctgatgac 30781 attctttttc ggatggttca ctcaagtaag tagtcttgat gaacgcatgt agcggatcaa 30841 agccagaatg actgcattga acagaccata taaatgtatt ttagctggta tttcttttct 30901 gatacgaggt ttctctagcc aatagaggca gttctgatac agggtcaaaa aagggaggtt 30961 ttgcgatgaa atctgaaaca cgtggaatgc atttaaaact cgaagggcta cgaaagtcct 31021 ttgggaagaa aacagttttg caagatattg acttagatat tcagccaggg gaatttgtcg 31081 ccattgtcgg tcggagtgga tgtggtaaaa gtacaatgct gcgtctggtt gctggactag 31141 atagtccgag tggaggtagc gtgttgttag atgacaaata ttcacatcat cgcattaacc 31201 caagtatccg gatgatgttc caagatgcgc ggttactacc ttgggatcgg gtattagcaa 31261 atgtggagtt aggcttagta ggtttgaact cgaaagtgta tgcacggcaa acggctttgc 31321 aagtcttgcg tgctgtcggg ctggaagaca gagcgaatga atggcccgcc gtcctttctg 31381 gtggacaacg ccaacgggtg gcattagcaa gagcgttggc aagtcaacct gctttgttac 31441 tgcttgatga acccctggga gcgttggatg cactaacgcg gattgaaatg cagcagttac 31501 ttgagaactt atggcaagag caaggattta ctgcattgct gattacccat gatgtggaag 31561 aagccgttgt gctagcagac cgagtgatat tgattgaaaa cggtcagatt ggcttagatt 31621 tagagattaa tcttccacgt ccacgagtta ggggagatgc ggtacttgct ctgacagtgg 31681 agaagatttt acggcgggtg atgggtaagc aagagcagga ggttgcagca gggagattca 31741 ctcagggcga tgcggaagcc gtcgccaatg gcgttaagca tagctctgcc gtaggcaatc 31801 gcctccatga gttggaacca caacattcat ctttcttgaa ttcacctact acagaaattc 31861 caaaacattt ggagaaagtt tcgtaactat ctcgcttaga acttgcagtt atacgcgaga 31921 tagaaagcta atgcaaaatg tggctcaaga aatattaatt gacaactacc aactcatcaa 31981 aaatcaaaaa aggggaaaat ctaccaatgg tgcaactatt agttaaagaa tcaaaagatt 32041 ggataggaat tgcatcctct cttttacaag aactctccag caccgcagtc gaacgtgatc 32101 tcaaagcagg aattccagat ttagaaatac aaagacttcg tgaaagtggt ttacttcccc 32161 tagtcgtacc aaaagcctat ggtggtgctg acggaacttg gatagaagct ttaaaagttg 32221 tccaagaact atctcaagct gatggttcca ttgggcaatt gtacggcaat catctcaatt 32281 taactgcttt gggtcatgtt tctggaaccc cagaacaaaa agagagatat tatcgagaaa 32341 ctgcccagaa taacttgttt tgggcaaacg ctatcaacac gcgagataca agattgaaaa 32401 ttagcccaga aggtgaagat ttccgcgtta atggcgttaa aagctttggt actggtattg 32461 cttccgccga tttacgggta ttctctgccg tgcaagatgg tgtagaattt cctttattat 32521 ttgtcattcc caaagaccga tctggtgtag tttctaatca ggattgggat aatattgggc 32581 aacgacgaac cgacagcaac acgtttacat ttcacaatgt tttagtgaaa aaagatgaaa 32641 ttttgggata tcctcatcct cctgatagtg cttttagtac atttctggga attatcgctc 32701 aactgactaa aacttatgtg tatttaggaa ttgcccaagg agcatttaca gccgccaaac 32761 aatacactac aactattact aaaccttgga ttacatcagg ggtagacagt gcaacaaaag 32821 acccctacat tcagcatcac tatggtgaat tgtggacaga actgcaagct gctatactgt 32881 tagctgacca aaccgctgtg aaagtacaac aagcttggga aaaggatgtt aagctgacta 32941 ctgaagaaag aggagaagtg gcgatcgctg ttttctctgc caaagctttt gccaccaaag 33001 ccggattaaa tatcacgaac cgcatttttg aagtcatggg tactcggtca actggtacca 33061 aatatggctt tgaccgttat tggcgagatt tacgtacctt cacacttcac gaccccatag 33121 actataaatt caaagatatc ggaaattggg tacttaacca agaattccct cttataaccc 33181 aatactccta atactatttt tgaaccagag attttggatt ttggatgttg aaagtagtat 33241 caacataaga ttttaaaagt ttagtttttt tcctgagaaa ttttcatgat aaaatcaaaa 33301 tgatataaat caatacgatt cagttaagac aatcgaattt agcgttaact tctgtagaga 33361 cgcactcata taaaagtctt tacacgccta aaattttcac gccccaatcc ttaactgaat 33421 tgtattatga tacaaatcat aagtgttgcc ttggttctcc tctgaatctg agatggcaaa 33481 ttcctaaggt gggtattacc caccttatta cctcaataat taagaagatt aatacattaa 33541 tacttgccat tcaggcgatg gcgatcgcca gtcaaagact tagtaattgt ttttgaatat 33601 atttgctaca cacctctgaa ttatcaacaa tcagatacaa acaagtaagt ttctaaacat 33661 ccagttattt gtcatttttg taatcatcta ctaatgcaag ccttaaaaga taagtttgaa 33721 tcatggaaac gcggacgaat cactcgtcgt accgcattat ttgcccttgg ctacagctta 33781 gtattatcta gtacactttt tagttggggc acgccacaga ataatactca acaacaggca 33841 acttcatcta cacctgatgc agctaacgca gctactaaac ttatatctac tagtgctaat 33901 aaagtagtaa gaattgttcg ttctaaacaa cttactgcac tggcagtttt agaaaagcaa 33961 ggcaatctcg aaaaaagatt acaatctctg ggatataaag tagaatggcc tgaattcgct 34021 gctggaccac aacaattaga agccctaaat gcaaacgggc ttgacattgc cttaaccgct 34081 gaatcaccgc ctgtatttgc ccaagcagcc ggaactcctc ttgtttactt ggctgctaat 34141 tccgctgatg gtaaatcaat ttcgctttta gttcctacca actctaaagt caaaagtgtc 34201 aaggatttga aaggtaaaaa agttgctttt caaaaagcat ctatcggtca ttaccttcta 34261 ctcagggcgt tagaaaaaga aggattaaaa ctcactgatg tacaatcagt ttatctacca 34321 ccagctgatg cgagtgcagc attcagtcaa ggtaaagtgg atgcttggtt tatctgggaa 34381 ccatttgtta ctagaaatga gcaaaataaa atcggtcgtg ttttgataga tggtagtaat 34441 ggtttacggg atacgaataa ttttttctca acaacacgca aattttatca agaaaatccc 34501 gaagtcatta aagtgttttt agacgaacta caaaaagcac aagtttggtc aaaagaacac 34561 ccgaaagaaa tagcccaact acttgcacct gtgacacaac ttgatgtacc aactctggag 34621 aaaatgcaca aaaagtatga tttttcattg gtaccaatta ccaataaaat tattacaaaa 34681 caacaggaag tagcagataa atggtatagt ttaaaactca tacccaaaaa ggtgaatgtc 34741 agagacgggt ttttatcacc tgaagaatat gccaaaatca cacctaaaga agtcctagct 34801 aaacagtagc tatctctagg aagtcaaata taaaaaatat tggacaattg atacctacat 34861 tggctatctc tattctacag ctttctgtct cccttctttt gttggagaaa accgccataa 34921 atttgaggct gatttaaaag aatcccttct acaggtagag ccttcaggaa catttacaga 34981 agagttaccc attactgtca ttgcaacttg gaaaagttag tttatcaaaa aacattctac 35041 tagcttagtc aaactttatt aacaaagaca acattatcct atgaaaatta tctcttccac 35101 tcataataat tttccatcaa ggcagacaat gaaaaaatat aaattcaaat cacacttttt 35161 aaaaaatcgc aaaattcagt ctttgattcc ttggttagta cctctgtcga taattatttt 35221 gtggcagttc ttctcttcta taggcttaat tccaatccgg attttacctt cgcctttaag 35281 tgttgtcggt gctgctatta atttagcaaa gacaggcgaa ctatttagaa atatcggcat 35341 tagtgccaca agggcaattt ccggatttct acttggtgga agtattgggt ttctcttggg 35401 tttactcaat ggaatctctc ccactgcgga aaagttatta gatacttcta ttcaaatgct 35461 acgtaatatt cctaatttag cattaatacc tttggtgatt ttgtggtttg gcattggcga 35521 tgaagccaga ttatttcttg tctctttggg tgtcatgttt cccatttatt taaatacttt 35581 tcatggaatt cgtagtgttg atccaggatt aattgagatg ggaaaagttt atggtttaag 35641 tacctggggt ttattctggc gaattattct tcctggggca ttgtcctcta ttttggtggg 35701 tgtccgtttt tctttgggta tcatgtggtt gacactgatt gttgctgaga caatagccgc 35761 tgattctggt attggttaca tggcaatgaa cgccagagaa tttatgcaaa cagatgttgt 35821 tgtattgagc attttgctat acgccttgtt cggtaaatta gcagatgtta tcgccagagc 35881 cttggaaaat tactggttgc aatggaatcc caattattct agatcttaga tatatcgatt 35941 tacagtaatt taaatgattc aaggaattag tcatattaca ttcatcgtca gggacttgga 36001 gaagatgacg aaattccttg tatctatatt tcatgccaag gagatttatt ctagcggtga 36061 gcaaactttc tccatctcca aggaaaaatt cttccttata aacggcttat ggattgccat 36121 catggaagga gagtcaatgc ccgagaaaac ttataaccat gtggctttta aaataaccga 36181 agaggactac gaattctatg ctgctagagt taggagtctt ggggtagatg ttaaggaagg 36241 tcgaactcgt gtagaaggag aaggacgttc cttatatttt tacgactacg acaatcattt 36301 gtttgagtta catacaggaa ctttgaatca acgattgcaa acgtatcaag acctgagtat 36361 atagcaatcc tatatgagtt gtgttggcat agcctgcgcg tagcgcataa attgctggag 36421 tccagaaacc cggtttctgg cagaaaccgg gtttctaatt tttcacaaat gatttatgac 36481 tcaccctaca gtctactttt gataaatcaa tcatcctaca acactttaca ctgaccatga 36541 tgccgcaaca gatggtcaca gagcaccaac gctaccatcg cctcaaccat tggcactgca 36601 cgcggtaaca cacaaggatc atggcgtcct tttcctgcta gagttgtttc ttcaccctca 36661 ttggtaactg tcttctgttc ttttctaatc gttgccgttg gcttaaatgc aacacgcaag 36721 atgatatttt caccgttgga aataccgcct tgaattccac cagaacggtt ggttacagtg 36781 cgagtttctc cattttcatc tacataaaac tcgtcgttat gctcaatacc agtgagtagt 36841 gtccccgcaa aaccggaacc aatttcaaag cctttactag ctgggagaga catcacacct 36901 ttagctatat ccgcttccaa cttatcaaaa actggctcac ccaaaccttt cggcacgttt 36961 cgcgctacgc attccaccac gccaccaata gaattacctt gtctgccagt ttgttcaatc 37021 aattcaatca tccgttctgc acattccgca tcgggacagc ggacgatgtt gctttcaact 37081 tgttctaaag taacagtgtt gggatcaaca ccgccttcta aatctttgat acgtttgacg 37141 taaccaataa cttcgacatt ggcgacttga cggaggattt ttttggcgat cgcccccgcc 37201 gcaactcgtc ctatagtctc ccgcgccgat gatctcccgc caccctgcca gttacgaata 37261 ccatatttcg cgtcatatgt cgcatccgca tgagaggggc gatatttaac cgccatctca 37321 tcataatcct gagagcgggt gtctttattc cgtaccagaa tagaaattgg cgttcccagg 37381 gttttccctt caaaaacccc agagagaatt tcgcaagtgt ccgtttcctt gcgaggcgtc 37441 gtaatcttac tttgtccggg acgccttcta tctagttcta cttgaatttc ctcagcagaa 37501 atttctaatt gcggaggaca accatcaatg acaaccccca caccgccgcc gtgggattcg 37561 ccgaaagtgg tgatgcgaaa cagatgccca aaagtgttgc ccataatatt gaggaagtat 37621 cgtcaagcgc cgtattctac ctatagcatt ttctcatcac aaagcgaaaa accagatgac 37681 tgtgatttac ctatttcttt gacttttttt tgcttactta gctcccgttt ttaggctacg 37741 gtgtacacaa gtccttaaat tactaactca ttcctgtaaa acctcaccct gccctgtcgg 37801 gcatccctct ccttattaag gagagggaaa ggtttttacg ccagtcagcc cccatatgcc 37861 aaattctcat tcccaaaaaa ttacctacag ccctgcttac acagtcgttc cgacttacga 37921 gtgcttcaat cgctgtgctt attgtaactt tcgcacagaa ccgggtaaga gcgaatggct 37981 aacaatttca caagctgaaa aacttttcaa acaacttcac aaccaagatg tttgtgaaat 38041 cttgatactg agtggcgagg tgcatcctct ttccccacgg cgtcaggcgt ggtttcagcg 38101 gatttatgat ttgtgtgaat tagctttggc tatgggattt ttaccacaca cgaatgctgg 38161 accattgagt tttgaggaaa tgcaacaatt aaagaaagtt aatgtttcaa tggggttaat 38221 gttggaacag ttaacgccag agttgttgaa tacagtgcat aaacatgcgc cgagtaagac 38281 accagaagtg cgtttacaac aattacagtg ggcgggagaa ttaaaaatag cttttaccac 38341 tggcttactg ttaggaattg gggaaacaga acaagattgg tggaaaacat tagaagctat 38401 agctgaaatt catcaacgtt accatcatat tcaagaagtt attctgcaac cccacagtcc 38461 aggacatcag caaacttttg atgcaccacc ttttaaccct catcaattgc caaaagttat 38521 ttctcaggca cgtcaaatat taccaccaga tattagcatc caaattccgc cgaatttaat 38581 caaagatgac gaatggttac tcgcttgttt agaagcaggt gcgagagatt tagggggaat 38641 tggaccaaaa gatgaagtga atcctgatta tccgcatctt ccggagcagg aattgagaga 38701 aatattacac cctgctggct gggagttagt accaagattg cccgtttatc cacagtatga 38761 tcattggttg tcagtccaat tgcagactaa tattaagcga tggcgaacct tttttcgaca 38821 attgaccgtc ctctgataaa aattgcaata ttttttacca cttatgcttg acactcaaga 38881 attactcact aacctctaca acgcctttaa cccatttgaa cccttacccg caggtgatcc 38941 aaagtacgtt gattgtcagg atgtacgcgg tgatgtggat attttacaag agttcggaaa 39001 tcgcatacaa cgagcagata gaaagacttg ccaattgtac tctggacatc ggggggcggg 39061 gaagtctaca gaactactga gattaaagca atatctggaa aatcgcaaat tctatgttgt 39121 ttattttgcg gctgatgaag aagatatcga ctcagaagat gctcaatata cagatattct 39181 cctcgcttgt acccgccgat tgctcaaaga tttacagcaa tttggtgatg ctagtccggt 39241 gctaaactgg ctcaaagaac gctggcaaga attgaaagat ttggcgcaga ctccgataga 39301 gtttgagaac ttgaaattag aagcacaaat cgctcaattt gccaagctaa cagcaaattt 39361 aagagccgtt cccgaattac gccagcagat tcgccgaaaa atcgatcccc acactgtcac 39421 cttaattaag gtgttgaatg aattcctggc agatgcaaaa agcaagctac caaacggcta 39481 tactcaactg gcggtgattg tcgataactt agaccggatg gtattagtta aagatgggga 39541 aaatacgaac catgaagaaa tttttttaga ccgcagtgaa caactcaaag ctttggattg 39601 ccatttgatt tatactgccc ctatttccat gctgtattct aaacgggcaa cagacatcag 39661 agatatctat ggtgaatgct taattttgcc gatgattatg gtcaaaactc acaaaggaga 39721 agtttatgaa ccaggactca agaaagtaaa agaagtgatt cgtaagcgag tccggcaaat 39781 tgaacaagaa ctcccgttgg aaaacggaat ttttgatagt cagcaaactt tagaaagatt 39841 gtgtttgatg agtggaggtc atgtcagaaa tttgctattg ttaactcaaa acgctattgg 39901 acggactgaa gagttaccaa tttctgaaaa agcagtgcga cgagcaatca ctcaagccag 39961 agatgactat catcgggctg tggaaaatca tcaatggtgt ctgctagcgg aagtctcccg 40021 ttctaaacgc atcgttaatg atgaccagta tcgcagtttg atgtataacc gctgtctttt 40081 agaatatcgc tacttggatg atgatggaga aatgcagcgt tggtatgata ttcacccatt 40141 gattcaagga atctcagaat ttaaagaagc tgtggcgaaa cttccatgaa cccaaattta 40201 agtgattggg aaagtgattg ggatgatgat ttaccccccg aaccagaaga agcttatcaa 40261 gacttagttc gcgcactgaa gcggaaatca ggatttggct tgttttttgt gcaatgtaca 40321 cctgcccaat cagaccgttt cattgccaaa cttccgcagg atattcccca gaaaaaaatt 40381 gcagcgttgc gcttagttga accgattgat aacttgtatc agccggtagc tgagtttgtt 40441 aaaaacaagc aagttgatat tttattgatt aaagggttgg aatattcttt atataagtat 40501 gagcaaagaa cttttggcga aattacagag gggcaattta gtaatttaac tcgtgttcct 40561 ccaattttaa atcatttaaa ccagcagcga gaaaggttta gagatgattt tccattttgc 40621 tttgtttttc tattgcggtc attttcgatt aactatttga ttcaccgcgc cccagatttt 40681 tttgattggc gttcaggggt atttgaatta ccaacaacac cagaagtggt agaacaagaa 40741 acatctcgtt tgctgctgga aggggattct gaaaaatatt ttaaactcag ccttgaaaaa 40801 aagatagaga aagtccttga aattcaagac cttctagcag caaaacatca aacagagaat 40861 agtcaggtaa ttttactact tgaacttgga aatttattag tcgctgctaa ggaatatgaa 40921 gcagccatca catcttacaa ccaagctgtg aaatgtcaac tagacttaca tgaggcttgg 40981 tacaaccggg ggattgcgct ggataattta gaacgatacg aagaagcaat agcatcctat 41041 gaccaagctg tgaaatttca accagacgac cacgaagctt ggaacagccg gggctatgcg 41101 ctgcggaatt tagaacaata cgaagaagca atagcatctt acgaccaagc actgaaaatt 41161 aaatcagact accacgaagc ttggtacaat cggggctatg cgcaagggaa tttagaacga 41221 tacgaagaag caatagcatc ttacaagcaa gctgtgaaat ttcaaccaga ctatcacgaa 41281 gcttggtaca gtcttgctac tgcgctggat gatttaggac gatacgaaga agcaatagca 41341 tcttacgatc aagcactgaa aattaaacca gactaccatc aggcttggta caaccttgtg 41401 actgcgctgt atgatttagg aagatacgaa gaagcgatgg ctgttgaccg tcaagtttgg 41461 aacaactggg gtattgcgct gcgggatata gaatcccacc aagcagcaat agcatcttat 41521 caagctgtga aaattaaacc agacttacat gaggcttgga acaatcgggg tgttgtgctg 41581 gggaatttag gacgccacca ggaagcaata gtattctacg accaagctgt gaaaagtaaa 41641 ccagatgatc acgaagcttg gaacaaccgg ggttatgcgc tgcggaattt aggacggtac 41701 gcagaagcaa tagcatctta tgaccaagca ctgaaattta aaccagataa acatgaggct 41761 tggaacaacc gaggcattgc cctgctgtct ttgggacgca acgaagaagc aatagcatct 41821 tacgaccaag cactgaaatt taaaccagat aaacatgaag cttggtacaa ccgggggatt 41881 gccctgcggc atttaggacg atacaccgaa gcgatcgcat cttacgacca agcactcaaa 41941 attaaacctg acaaacatga ggcttggtac aacaaagctc gttgttatat ccttcaaagt 42001 aacattgaac aagcaattga aaacctcgaa aaagcaattc atcttaatcc tgataaatgc 42061 caggattggg caaagaatga ttccgatttt gatagtattc gtgaagatga gcgctttttg 42121 gtgttgattc aaggacagca aacaggcgat tagttaaact cagattttgc acttcaccgg 42181 tgagtacgcc ttgaactcaa gttgagggct gcatagtcaa agtccgttaa aacagactga 42241 aatattatgt ctccagtgag ctttagctta cttgatcttt gagccaagaa atttatttct 42301 tggtggacga aaacgcagac gcaagatttg agaaaagtta ttttagacgt gcgattgcgt 42361 tacaaaaatg gattgataat tcttaccttg ttttctacta actgattgtg ctgcatatct 42421 tctgagtaaa tcatggaaca gtcacttagt aaagctgtag ctatgataag gctatcccaa 42481 taagagtagt tatattttcc atttatttcc aaagctttta aaacttgtgg tgtatcaatt 42541 gcctgaacag gaaaagtgct gatgagattt gaaataatat ttatcgcgtc tattttggaa 42601 gtaaagtttt ttctggttaa aacatggaag aattcaccta aaacttgagt gctcacttgg 42661 attaatggca aatcctcctt gataattcgc tcaattttct gatatttttc tggtggattt 42721 ttggcataaa gataaatcca gagattagta tccagaaaaa ttttatctat catagatttc 42781 ttctcgattg aatttataat cggaagggat ttcaaaagaa tacttattaa ctcgtgctaa 42841 aaattcctga attttggttt ccctagaatc tattgtctca cgagactcgc tttgctcaaa 42901 aatcataatc tgtacggttt taccttccag ttcaggattt agcttttcgc taagaactaa 42961 ttcaccattg gtgcaggttg cttgcacaat tttgaccata ggacttgctt ggtgattatg 43021 attgctttct ttagtttaac ccgaagttga gctacttaaa actaaacaat tgtttgatta 43081 cctaaatcct ttccctcacc caattccgaa aattccgcgt cattcccatc actccaattc 43141 tttgactttc ctctaaaaat tgtggaattt caattaaaga caattgtgca aaagtaggcg 43201 aattattgca ctcgacatat tccccttgtt gcagttggta aatgtgcagc acaccttcgt 43261 cataacgcca aagttcagga acacctaaag aagcataaat ctggaatcga tccagagatg 43321 agctggtcaa atctacttct accactaagt cgggtggcgg atcgttattt aaattaattt 43381 ctgttttgtt tctcacccgt gcttcattgc ggatgtagta gctgctatcg ggttcagcac 43441 cacgccgcaa atcttcgcgt ttcaaggttg tggaacctgt aggaagaatt tctaaattca 43501 actcttcccc aagaacaaaa atcagacgtt ctattaatct attccaatat tcatggggca 43561 agagtggcgt cataatttcc agcgtacctc tgtcataagt cagtcgggag gctcggtctt 43621 ctcccatttc tgctagcatg gtttcaaagg tctgccagct aatgtttctt aaaacaaccc 43681 ttgattctga ggggatgcgt gttgttacca ttgctctatc gggtacaggc tcatatcata 43741 caacatagcg ttttatgaat ttaaagtaat caagtaaaaa gcatcctcct taagtaagag 43801 ttttgacaca aaaacagtga gataaaatag aggaagtagg attcggggga ccatattact 43861 ggtcatagat gcttatcgag actatctttg gtaatctcca aggtttaaag tctagtcagc 43921 tgaaacaaat acagcgactg taccaccagc gcatatcggg cgatcgcatc acaacgcctg 43981 agttttccca gcgactggca gcaattagca ctgaaatcaa tcagcctgtg tgtgcttacc 44041 tcaaccgtcg cggacaagtc atccgcgtag gggtaggtac accgcgccaa acgcaaattc 44101 cacctttgga attaccccgt tatggcgcag aacgattgag cggtattcgt tgcatcgcca 44161 ctcagctaaa gccagaaccg cctaatgaag cggcgctgac tgctatggca ttgcaacgat 44221 tggatgcctt agtgatgcta aacattaccg gaacaggatt tcaacgacgc ggtggcggcg 44281 cgacgggcta tgtcaaggaa gcttacttag ctcatttgac accccaagac gctcgcgccc 44341 tgatcagcag ctcagcaatg gaaaaagttt ctcatcttcc acctgcaggc ggggagaata 44401 aagagggtaa agaacaaaag aacaaccttc catacacaag ttggagcgtt tcgccaccca 44461 tgagtgtgga tatgctgaca aatcaggact tgatggagtt ggtggaagga ctcgaagcag 44521 agttccggcg ggaatttgtc gcccaagatg tagacaccga tcatgatcgc gtcctgattg 44581 ttggtgttat gactgatgat acgacagccc aacaattcca agacaccttg gaagaattgg 44641 cacggttggt ggatacggct gggggagagg tgttgcagat gatgcgacag aagcgatcgc 44701 gcattcatcc tcaaacagtt gttggtgaag gtaaagtcca agaaatcgcc gtagctgctc 44761 aaaccctagg agcaaacctc atcgttttcg accgcgatct ctcacccaca caagttcgta 44821 atttagaatt gcaaattgga gttcgggtag ttgaccgcac tgaagtgatt ttggatatct 44881 ttgctcaacg cgctcaatct cgtgctggta aattgcaagt tgaactcgca cagttagaat 44941 atatgctacc acgactggct ggtcgaggtc aagcaatgtc caggctgggg ggtggtattg 45001 ggactcgtgg tcctggtgaa actaaactag aaacagaacg ccgtggcatt gggcggcgta 45061 tttccagact acaacaagaa gtgaatcaac tacaagcaca tcgggagcgg ttgcgccagc 45121 ggcgacaaca tcaggaggtt ccgtcagtgg ctattgttgg ttacaccaac gctggcaaat 45181 ctaccttgct caatactctc acgaatgctg aggtttacac agcagaccag ttatttgcca 45241 ctctcgaccc caccacacgc cgcttggtga ttccccatcc cgaaacggat gaacctcaag 45301 aaattctggt gacagataca gtagggttta ttcacgaact gcccgcatct ttaatggatg 45361 cgttccgcgc caccctagag gaagtcacag aagctgatgc tttgttgcat ttggtagatt 45421 tgtctcatcc tgcttggttg agtcatattc gctctgtcag agaaattctc tcacaaatgc 45481 ccatcactcc gggtcccgcg ctagttgttt ttaacaagat tgatgaagta gatagcaaaa 45541 acttagcatt agcgcaggaa gagtttcccc tagcggtgtt tatttccgcg agcgagcgtt 45601 tagggttaga aaccttacgt cagcgtctcg gtcaactggt tgaatatgcg gcttcttatc 45661 attaaataaa ttttattctc tgtcagtata ggggtgtggg gtgttagtgc agaataaacg 45721 cagctcctcc acccctaaat tctgccctaa attctgactg atattagggg aagcgatacc 45781 accacatgac aaacttagtc atgaataaaa tttgtggtgt aagtcctaca aagacgtatg 45841 atgaggcata ataaattttc cgcaacgcct ccatgcgtcg agactcaatt ttctacaaac 45901 tatttcaaca atccccaagc ctattatttg aattattggc aaatccccca gccaatgctg 45961 acaattaccg ctttgattca gtggctgtta aggaaccgaa atttgaaatt gacggcgtgt 46021 ttcttccacc tgataccact gatgcaggaa ttgtgtattt cagtgaagtc caatttcaaa 46081 aagatgaaag actctacgaa aggctatttg ccgagtcatt attgtacttc tatcgcaatc 46141 gcgtcagata cagtgattgg caagctgtag tcatctaccc gtcgcgtagt actgaacaaa 46201 gcgatactca tccctatcga gcactattga atagcgaaca agtgcatcgt gtttacttgg 46261 atgaattagg agatattcgc caattacctc tgttgttagc cctaatggta ctaactacgg 46321 tagcagaaga ccaagcacca gaacaagcca ggtatttgct caccagaaca cgtcaagaac 46381 agtcccaagc atcaagtcgc gccataatag aaatgataac gacgataatg gtgtacaaat 46441 ttgaacaatt aagccgagcc gaggtggaag ttatgctggg aataacttta gagcaaacga 46501 gagtttatca agaaattaag gaagaaggac gacaagaagg acgacaagaa ggacaacagg 46561 aagcaacagt taaattgatt gttcgactgt tgactaagcg ccttgggcaa gaactttctg 46621 aggaaatgca agcaacaatt tctcatttac ccttaggagt gcttgagaat ctcagcgttg 46681 ccctgttgga ttttaccaac ttggcggatt tacaagcttg gttagatgca cagtaaatca 46741 tacaactgag cgaataaggc aaaagcgcgt tgtaagagcg tgcatcccta tcgtacttct 46801 cctgttgtca gaggtgtcac tggctgacct gtgggtaacc gttcttgaag ctctgcaagt 46861 ctaatattcg gtagatgagg taacacaagc ttggcaaaat agtgagattc ctcaatcagg 46921 ggatagccag agaaaataaa tgctcgaatt cccatatcca agtaacgatg cagtcgttca 46981 agtatctgag atggatttcc cacgagtgct cctccacacc cagaacgcgc tcgtcctatt 47041 cccatccata agttcccctc gatgtagtcg tcagagtcag caactgttcg cagggtatcc 47101 tgtttcagga cgcctacgga gcgagaatct tgtgcgcgag atttcaactg tacaccgcga 47161 gctgcatcga gttgcgatat taaagttttt gcccataaac gcgcctcttc ctcggtttcg 47221 cgtacaatca catgaatgcg gagtccaaaa tcgatacgcc gtccatatgc agcagccctc 47281 tttgaaaggt cctgcatact ttcaaataga ccttcttcag tatcgggcca catcagaaaa 47341 acatcgcagt attgagcaca cacatcgcga gaacctggtg atagaccgcc gaagtaaagc 47401 agtggtccgc catgttgctg atacggtttg actgggtaac tcggtaagct gaagcgataa 47461 atctctcctg aatggtgtat tttttcttgc gtccacgcct gttgaagaat ttggatcact 47521 tctttacacc gttgatagcg atattcaggg tcttcaacaa gaccagggag ttcagaatta 47581 atgatgttta cggtgagccg tccctgtaaa agatggtcta acgtcgagag atgacgtgcg 47641 agcataggag ggtggatctc gcctgtgcgt attgcggtga gtaaactgat ttgttggagt 47701 tgggatgcga tcgcccccgc aaaaggtaac acctcctgac ccaacatata gcttgtaggc 47761 agtagaatgt tcttatagcc caacgcatct gctgtgagca caatatcacg gcaatgttcg 47821 tagttgctgc gccgagaggg gtcaagcttt cctaaacgat cagtgtcacc cccgcataga 47881 tcggcaaacc aacccacctc agataccctc tgatgcgtcc taatagtcga aagccgaggt 47941 acatcgcctt taaacatatt ttttttctcc taaataactc tgtaagttgt actcgacttt 48001 agtcatattt agataagacg gctcattgcc agcaaaacta cagaaaaact aggtaataat 48061 aatttatcag taattatcct gaattatcct gaacttataa agaataatta aaactgccat 48121 gaaaactttt gctaatctgc tgatatcgta gtcaacacct aagtctcgtg agttttagct 48181 atcgaccagc tacactcatg tggacgtatg aagaagacag gcacggtaag cgtagagagt 48241 agggggggtc atctacggct gagactcccg acgagtgtcg ctcaaggcgc tgctaggtac 48301 atctccacgc gactagagga caataaaatc tgaaacttct gtgactagcc gattctttta 48361 atctttgcca gttttacttg gacggctttt ttgttcgtcg aactcacgtt gattaaccca 48421 aatctccgca tccgccagat tattaactat cgccgtccaa gcgttggaca acccctacaa 48481 gaagtgctac tggtgggctt tggagttgaa caaaaaacag ttccttaaaa ttctcagaaa 48541 cttgaaaaat gaattaagaa acgttaatac ttcctaagcg cgattgccca aagggcagac 48601 gccagaggcg tatcgcctgc ttgctgacta aagtcctgga tgccacaata gtccatgaaa 48661 atacaaaatc tcctccgtta tgactatggc ggctttcagc aacagcgatt gactgcattt 48721 tgatgatgtc tcagtgatgt caaactgatg taactaatct acacaataaa acttaacgat 48781 tcttaatctt tttggttaca acgttatatg aagattgagg gtgtgcaagg tgtaaaacct 48841 tggtataaag ttaaaaaact cacacgattg agagtttttc tagctcctag agtcgataat 48901 aactctcaac tctaaatttg ttgcttgaaa aagcaattct taatttactt aatcttttga 48961 gataatcact tctcattatg ttgttcacca ggagatttgc ttaatgcctt ttggaccagc 49021 atcacgttta ggagttagcc tatttgagga aacccctcca cttgagtggg ttccaggtct 49081 gtctgaagaa gaagcacaaa ccctcattaa agccgtctac aggcaagtgc tgggtaatgc 49141 gtacgtcatg gaaagtgagc ggctgacagt gccagaatca caattcaaac gtggtgaact 49201 cagtgttcgt gagttcgttc gggctgtagc aaagtccgat ctctacttct cccgttttgg 49261 tgacactccc cgctatcgtt ttatcgagct aaacttccgg catttgctcg gtcgtgcccc 49321 caacagttac gatgaaatga aggctcacag cgccatcctg gatgcgggtg attttgaagc 49381 tgaaatcgac tcctaccttg acagtgacga atatcagaac actttcggtg aaaacctcgt 49441 gccttacatc agaggctaca agaccgaagc attgagccac atgataggct ttacccatac 49501 cttccagttg gtgcggggtg cttccactag ctctctcaag gcagatttgg caggtaagtc 49561 tccgaaactc aactcactag tcattaatgc aactccaacc cctgtggttc cacctggaac 49621 cactttccgc aacccgccag tcagttcacg tgttcgtctt ggtgtgggag caagcgaaga 49681 aggtaaagtt taccgcatcg aagtcactgg ctaccgagca aatgcggtta accgagtttc 49741 caagttccgc cgcagcaatc aggtttactt ggtaccgttc gataagcttt cggaagagta 49801 tcaacggatt cacagacagg gcggtgtcat cgctagtatc acgcctgtat agcaaaaatg 49861 gcgatcagcg attagcagtt tgttgataac accagactta gagaataggt aagcccgttg 49921 agccacggcg atgctccttt ggaggagccc tatgccccaa atcgaagact tggggagaga 49981 gattctttgc cgaatccctc taggcaaaca agggcgattg cccaaagggc agacgctgct 50041 aggcgtatcg ctcatgggga aaagacgcgc ttgtacactc gacacccccc ttgttcgaga 50101 gttaggggcg ctttcccaaa ccccagaata agggagcgac ttgttcccaa agctatccta 50161 agcagttgaa actggaagcc ctaacctcta cgcagcaatc atcgagtgcg taccgtaagg 50221 catcgcaaag taggttgggg tacttcagtt gctaatcggt gagtataaag tgagttttga 50281 attctgagca aaacaagtgt taggagtatt cgcgtaaatt atggcaacaa tagcaccaat 50341 agaactctgg tccacccgcg acctggagga tgtacaagca gtcattcggg cagtctacaa 50401 gcaggtttta ggcaatcccc acgtgatgga aagtgagcgc ttggtgactg cagaatcaca 50461 attgaaggat agcactatct ctgtacggga ctttgtccgg gcagttggta agtcggattt 50521 ctaccgctca cgttattttg aagcgtgtgc tccctatcgt tttatagaat tgaatttcaa 50581 gcacttcctc ggtcgtccac cacagtcaca agcggagatt tccgagcaca tcgttcgttg 50641 tgtggaaaag ggatacgacg ctgaaatcga ctcctacatc gatagcgaag agtaccagtc 50701 agcatttggc gaaaatgttg taccctacaa tcgcggggtg aaaacggaag ttgggcgcag 50761 ccaggttacc tataaccgta cgtttgctct ggatcgtggt ccttctcaaa ttagtagcgc 50821 tgttaaatcc tctcaactgg tttatgcagt cgctaccaac agcccgaaca agataaaacc 50881 agcagatgtt aacctcggtg gttcgggtga agcgaacaaa aagaaattca agattttagt 50941 gcagggttcc aagtttgata gccctcgtcg ggtcagcacc accgaataca ttgttcctgg 51001 tgaccggatg acccctcaaa ttcagcggat taaccgcacc ggcgctaaga ttgtcagcat 51061 cactgagatt gtctaggcat caaccttcaa gaattgcaca ggcatgttct ttgcaattct 51121 tgagagcttt gctcgtgctg tgagatgagt atttgcaggc gatacgccta gcagcgtctg 51181 cccaaagggc agacgccctt gtttgcccaa aacaattctt tgcccaatct ccctccccaa 51241 attttcgatt tgaggagagg gtacgtcaag gaagcatcgc ccaaaagtgc tcagacaaaa 51301 cagtcttaaa acgccatcct cgcacaaaaa tctgcccaag gggcgatgct cctttggaac 51361 tgtcttaata aaagtgatca aaaaaacctt gaacctctgg tttaatgagt tgaaatttcg 51421 ataagattat gctagcaact atgtgcaatg accaattgag ctgttccatt ggatgtcact 51481 catactcatc attcgccata ggaattgaaa aatagctttg aacctattgt ctaatcagct 51541 taccgattaa atcacgagaa tgatttctca actttatcaa actgaacaat aaacacttca 51601 ttaggagaaa ttcccatggc actttggatt gcagatgcag agtctgttga actacgcccc 51661 aacacttcag aggatgatct acagacgatc attcgggcag tgtatcggca agtgctaggc 51721 aatgctcatg ttatggaaag tcagcgcctt accagtgcag aatccttgct gcgcaatggt 51781 gatattaccg ttcgaggatt tgtgagagcc attgcccaat ccgaactata ccgctcgttg 51841 ttttttgata cttcatcctc ctaccgtttc attgaattga acttcaagca tctgttagga 51901 cgcgctcctg ttgatcagac tgagatttct agacatgttc tcatttacaa tgagcagggc 51961 tacgaagcgg agattaactc ctacattgac agtgatgagt atatccaaag ttttggcgaa 52021 aacgtagttc cttcctcacg cggcaatcgt actcaaactg gcattaagaa tgtcggcttt 52081 aatcgcacct tcgctctaga tcgaggattt gctgcctatg atgctgctgg caaaaacgcg 52141 aaattgatta gtgatgtcgg aggtaacctc ccaacaaaaa tcaaattccc agcaactggc 52201 tctggagctt acagtaatac tggcaagcgt ttccggatta ctgttaccaa gggtagttct 52261 aatccccgta tgaatcaggg caaagtcact tttaaagttg gctataacca aatgtcgcag 52321 aagattcaga acatccaaaa gactggcggc aagattctca gcattactga agttgcttag 52381 accataatac agtgatcagt gaacagtgag acagagtgct gcctttgggt ttcccttatg 52441 ctggctctcc cgttgggtta tcaccttgct cactgataac tgttcgctgc gtctttggta 52501 attactccct tcgatgctct aaaattacct ctgccaaagc gttgcgtatt ttctctgcgc 52561 tctggtgcgt ttttaaaatt ggtaatgttt taggtagaaa gggagtagtt acagaacgtt 52621 tcaaaatcac cggcgaagtg gcaaacaaat atccaaaaag ttagtggttg ttctgttgct 52681 cttaaaattc attgcatgaa atcgatggat attacgcagt ttgttgagct ttcaattggg 52741 cgttggcgat cgcaacgtag cgcccaccat ttagcattta gccactttga agctgtgcaa 52801 tcggtgattg atattattgc actctcacct gacgatccgg atgtcatgac tctgtgtaag 52861 tcatacaata ttgaccaaag ccagattgtg tctccctttg ggatgtcttg ggagggacag 52921 tcggattggg atgaagacgc acagatgaaa gggagtagta tactcgtacc agttcccgat 52981 cctaatgtcc ctaatcgggg caaattactt agagatcagg gatacgctga aaccatcgca 53041 gcggctggtg actatcacct gacagaagat gggacatttg tcctgctgac gacatatgat 53101 cgagcagcgg ctgaagagaa aatttggttt gccaatccaa acttgcgatt tcgcgtgtca 53161 ctcatcaaaa cgagtggtgg aagtggagtg gtgactgcat cgtttgcctc agaaattcgt 53221 tcgtcgagta gttcagtaaa cagttaacag taaacagtga tcaactgaca actgataatt 53281 gataactgat aattgataac tgataactga tagatcatga cttcagcaaa agccaatgac 53341 ttaacgacgc tggctcagtg gatggcaggc gatttcagca attacaagca atcctaccac 53401 aagcctcagc aatttgctca tatccatatc ttttttcgtc ccttaccctt tgagtttttc 53461 aatgcgattg gtttttactc agaacaggtt tacgatcacg acttatggag tccctatcgt 53521 cagggagtgc atcgattaat tgacgaggga gagcaaattt atatagaaaa ttacagtctg 53581 aacgatccat tgctctatgc aggagcagca cgggaattaa gcattctcag aactattaca 53641 ccagattgta tcgaacgacg atatcactgc tctatgattt tcaaacgaca gggcgagatg 53701 tttcaaggca gtgtcgagcc agggaacaaa tgcttaatcg agcgaaaggg ctgtttaacg 53761 tatctcatca gtgatgtgga gctgacagca acaacgtggg ttagcctgga taaaggtatg 53821 gatgttaata cccatcaaca agtttgggga tccacgtttg gggcgttgga gtttgaaaag 53881 cgggagagtt ttgcacatga acttcccgag tatcgtcttt gaagcaacag ttttatgaac 53941 gccttaggtg agcaaacgga agctggggta aaaataggga tgttaccacc ggaagctgaa 54001 aaaaaaatgc aatgctggat ccggagccga catctgatct gttccggaaa tttttttgtt 54061 tttgaatcag tggattacag tgcagttgaa cgattttcag agtgtattgc agcactggga 54121 ggggcgttat tatccattga acccattggc aaaatctgga tgggcgatca tcgtcaggtg 54181 attctttatc gggcgaaggc aagtttacat acgcctcatc ataccttaaa acaatactgg 54241 ctaaaatacg gtggttttcg cacaaggttt gatgagcgtg tttagtagaa taatggataa 54301 aagaattcaa gatcgggaaa ttatcaaaaa attaattact gattacgtca atgaagcttt 54361 aaccagaaat gacgtggaaa aataaataat ttttgataca gaacacgacc attatcaatt 54421 agtgaatgtt ggttggcgaa atcgccatcg ggtttacggt tgtgtgctgc atttcgatat 54481 taagaatgat aagatttggc ttcaatataa tggtacagaa attgattttg cagaggaatt 54541 aatcaaacag ggtgtaccaa aagaggatgt tgttcttggg tttcattcgc cttttatgcg 54601 acagtttaca gaatacgcag ttggttaagc aagacaatcc atacaaacaa ttcatgagcg 54661 atgtcaatga acgtaccgat tttgatagtc cctggaaaga agttttagaa gcttactttc 54721 ctcaagcaat gcaattcttc tttcccgaaa ctgttgcatt gattgattgg gaacgtccct 54781 acgagtttct ggataaagaa tttcaacaaa ttgcccgcga agccgaacaa ggcaaacgat 54841 acgctgataa attagttaaa gtttggcgta cccaaggaca agagctttgg ttattggtgc 54901 atgtcgaaat tcaagcccaa ccggaagaaa actttgcaga gcggatgttt acatacagct 54961 ttcggatttt tgaccgcttc caccagcctg ctgttagttt agcaattttg tgtgacgcta 55021 accgtcaatg gcgaccaaat agctacagtt atagttatcc cgatactcgg ttgaactttg 55081 aatttggaac tgtcaagctt ttggactacg aaagccgttg gacagaactc gaagtaagtg 55141 agaatccttt tgcaaccgtg gtgatggcgc atttgaaaac acaacagaca cgccaacaac 55201 cccaggaacg caaaacttgg aaatttagcc tgattcgccg actctatgat ttaggcttgc 55261 aagagcagga tatccgtaac ctatatcgat ttattgattg ggttatgata ttgccaaagg 55321 cgttggaagc agagttttgg gaacagttca aacaatttga acaggagcgt actatgagat 55381 acgttactac aggtgagcgc attggctatg agcgcggtaa acaggaaggt aaacaggaac 55441 agacacagac gctcattcta cgactattac aaagacgggt gggagaatta tcattagaag 55501 tgcgatcgca catccaatct ctcactcttt ctcagctaga agaacttggt gaagccttgt 55561 tagattttac cgcgatggag gatttgctca attggttgca agcaaatcaa agcgagtagc 55621 accttaccat ataggcttcg ccacatcttg aatcacaaag aatgttgtca ggaaacattt 55681 ttccacgcca gcgtattaag ctcagtatat atattgatct caaaccgcaa gctggaacaa 55741 tgataatttg gacacccaat caaccactgc aaaacggacg ctttatcatt caaaaagtcc 55801 ttggtagcgg tggctttggt attacctaca gtatcctcga acagcgtacg ggtaaattat 55861 ttgtcctgaa aaccctcaac cacattcagc aaataagaaa agacttctcg gaacgacagg 55921 taaaatttgt ccaggagatg acgcggttag cacgatgcac tcacccccat attgtgaaat 55981 ttgaggatgt catccaagaa gatggactgt gggggatgct gatggaatat attgatggcg 56041 tggatttaaa gacttatgtt gatgagggtg ggcaattgtc agaggatgaa gccttacgct 56101 acattaacca aattgggcaa gctttagaat atgtccatca acaaggattc ttacatcggg 56161 atatcaagcc tcataatatt atcttgcgcc gtggcaaaca agaagcagta ttgattgatt 56221 ttggtcttgc ccgtcagttt agcactggtg aaaaatccat cagcatgacg agtgatggaa 56281 ccgagggtta tgcacccatt gaacagtaca gaagaaaggg caattttggt gcgtatactg 56341 atgtttatgc tttagccgca acgttgtatt ttttgctgac agctgatgcg ctcaaagctg 56401 ctgacgaaga aatagtttca gatcttcgtc gcaagtatga ggatgaagaa ttaccaccgc 56461 caaagcaatt taaccccgag attagccata gagtaaacga ggcgattctc aaagggatgg 56521 cattggaacc acaagataga gtgcagacgg tgcggaagtg gttataattg gtgatgccca 56581 gaaaagtaaa tcctccaatt cctcaagtat caatctccaa tccacaaccg aaaatacaaa 56641 atcaacaatc aatttatatt gcaaccccag tagtaagttc aaaacagtct tctcaaggaa 56701 ttttgaagtg gctagaagga attataccca aaaattcaaa atcagatgat gagattaggt 56761 taatcacagc gaagatggac tacacccaac tgcgtgactt actcacagca ggaaattgga 56821 aacaagctga tgaagaaaca aggcgagtta tgctagcggt agcgaggaga gaaaaagaag 56881 gttggcttaa tggtgaagat attgatcact ttccctgtga agacctccgc acaattgacc 56941 aattgtgggt aaaatacagc aatgggcgct ttggcttctc tgtgcaaaag ggcatttatc 57001 aaagtctggg tggaaccaga gaatacgaca cgaagatttg ggaagctttt gccgacgcag 57061 tcggttggag gttggtgttt ggggaggaag agtggtggag tggcgatatt ttttatgact 57121 acaggatcac acccaaggga cacctccccg ttgggggtct gtgggggtgg ttagcacggc 57181 accctgttct ttgtcttctt ctctcacgtc gagacttgta gaatttaaca tataaggctt 57241 acagattttt attaagattt atatacgagt cactttgggt cgaagttttg agggttctgg 57301 ctttggggtg ttattgatat catatagata atgattttgc acagcgttgc aggtcaattt 57361 tctactaaat ttcactatgc ctgcaaaaga tatctttcac aacgcagtca aacatgcact 57421 tgaaaaagat ggatggttga ttacagacga tccaatatat ttggatttcg gcggagttga 57481 aatttatatt gatttaggtg cagaaaaaat aattgcagca gagagagaag gagaaaaaat 57541 tgccgttgaa gtcaaaagtt ttattggagg ttcagcgatt tctcagtttc acacagcttt 57601 gggacaattt attaactatc gaactgcctt gaatcaagag caaccagaac gagaattatt 57661 tttagctgtt cccaacatca cttatgaaac attttttaaa cttgagctag ttcaaattgt 57721 tatccaatct caaaacctca agctgctgat ttacgaacca gaacaggaag tgattgaacg 57781 atggataaga tagttcaata tcgggaaatt atcaaacgat tgattactga ttacgttaat 57841 gaagcttcaa ccagagatga cgtagaaaga caaatgattt ttgatacgga acatcaccat 57901 tatcaattag tgaatgttgg ttggcgaaat cgccatcgag tgtacggttg tgtgctgcat 57961 ttcgatatta agaatgataa gatttggctt caatataatg gtacagaaat tgattttgca 58021 gaggaattaa tcaaacaggg tgtaccaaaa gaggatgttg ttcttgggtt tcattcgcct 58081 tttatgcgac agtttacaga atacgcagtt ggttaagcaa gacacatcat atatcaaaat 58141 ggagaagcca gagatttact ttaagagtgc aaggaagttt aagcaggcgt tgttagatgt 58201 gttgtaacga gtttaaattc cctcacaata gtgtgttcca taatttgaac aggagcgcaa 58261 tatgagctac attattccaa actacattcc aatgagcgag gttagtagta atgttgaagt 58321 cagttgaagg agtttttaaa aacggtgcga ttgagttatc tgaagtacca tccgatgtag 58381 tagaaagccg cgtcatcgtt acttttttag aagcaaagcc agttcaattc actccacaaa 58441 taatgtactt cggtatgttc gctgattcca atcaacaatc aaccgaagaa gatttcaaaa 58501 tagcagaatt tcatggtgat tttggcgatg aattagactg gtcttagata aaaatgaaat 58561 atgttattga tactcatgct ctgatttggt tcctagaagg taatccccgc ttgggaagta 58621 acgcgaaaac aattctttcc aatcctgaga gccaattaat tatacctgct acagctttag 58681 ctgaggctgt ttggattgta gaacgtggac gaacatcgat ttcttcagca attgctttac 58741 tttcagcagt caatgcggac actcgtattg tagtatatcc attagacaca aacgtcattc 58801 aacaaactat caatttatct gctattgcag aaatgcacga ccgacaaatt gttgcaactg 58861 cgatagtttt ggtcaatcaa ggagaaactg ttgttttgtt aacctgtgac caaaatatta 58921 ccgcatcagg attggtaact attatttggt aacttcaaat aatgattttt tcaattattc 58981 ttgatttgct tcagacgaac tattaccacc ttctttttga aactcttcta taactgactt 59041 aatttcctct gcttcaacat ctgcttgttc acttttggag tagattagca ggaggataat 59101 gcttgtaggt gattcaagct aatagatcag gcgataacca ccacttttac ctttttggat 59161 atcactgtta cgaactcgta ccttgaaaac agtgtttcct gttccagata tttgatcgcc 59221 aacaaagtct cctgcttgta gttgtgcaat tatcggctga atatcagatc gaatgtggcg 59281 atacttttta gatagggtgc gtagtcggcg ctggaattca tctgtaaaat taacttgaac 59341 tgagggtggt tcactctgca tcaattccat cccacagttc tgaaacaggt ttagtttttc 59401 caacttttgc ttcttgcaat gaagtgcgaa gactgtttaa cacagattct tttggttcat 59461 catcttcatc catctcatct tccacaaata gcacaatcac tttaacgcgg ctatgctttt 59521 gaactgtcag cggttcatcc aaacacagtg taccgttttc gtcaacgctt cccatgactt 59581 cttgtgcttt catttttcct cccaaagtct gtcagcttat cttcttataa tattttgcac 59641 caatctcaag aaagttatta ttactgtata tcgagtctag gatggagcta agaatgacta 59701 taataaaaaa taaaaaccgt aaaaatacat ataaaaatta ccaatgcctg ccaaagtcac 59761 cctcacaatt actcaaggga aatcaccagg acggcaatat actttcgact ctcgcacgac 59821 ctgtataatt ggcagagcaa aagactggaa cccgcaacta ccagatgatg atgatcatct 59881 taccatttcc cgctatcact gcttattaga tatcaacccg ccagatattc gcgtgcggga 59941 ttttggtagt agaaatggaa cttacgtaaa tggtaaaaaa attggacaac gccaatcaaa 60001 gcaaactcca gaagaagctg caaagttgga attaacagag tatgatttgc aagatggaga 60061 tgaaattaaa ctgggtaata cggtgtttgt cgttggagtt taagttgaat tagaatcact 60121 taaaattcct cctttcatcc caggtacagt taatattcat catgtcaacc aacgcccaac 60181 acaagcacca ttcttcttag aattgattaa acgcttgttg ggtttagcag agaaaggtga 60241 cgaaaatctc atagcaattc gtggttatag cattatccaa ctattaggaa aaggtggttg 60301 tggcgaagtt tatttagcgc agcataatca aaccaacaat tttgtcgccc tgaaagttat 60361 gttacctgca gttgctgcaa atgacagggc aatacaaatg tttctccggg agacagaaaa 60421 tacgaaagcg ttgcgacatc ctgatgtcgt gaaattgctt gattatggtt actccgagag 60481 cattttcttt ttcacaatgg agtattgtga aggtggtact gttactgact taatgcagcg 60541 acagggagga aagttatcaa tagatattgc tctcccaatt actctccaag ttttaaacgg 60601 tttagaatac acccacaacg cagaaattcc taatgtcaaa ttaggtaata gtggatttgg 60661 taaaggtagg ggtttggttc accgcgatct caaaccaagt aatatttttc tgggcaatgt 60721 tgatggtaaa ctcacagtaa aaatcggaga ctacggttta tcaaaggcgt ttgatttggc 60781 aggtttaagc ggtgagacca gcgctgcggg agggtttccc gacagccagg cgactggcga 60841 acccggaggg tcaaacttta acaggttata aagcaggtac tcccgttttg atgtgtcgcc 60901 agcaactctt aaattataag cgtgttcaac cagaagtcga tgtgtgggct gctgctgctt 60961 gtctctacta tatgctgaca ggattttttc ctcgtaattt cactgacaga gacctcttga 61021 aagcagtatt agaaaatgac tgcgtaccta ttagccagcg tgatgcaagt atacctcaac 61081 gactggcaga ggtgattgat ttagctttgg tggagaagcc agaaatttac ttcaagagtg 61141 cgacggagtt caagcaggcg ttgttggatg tgttgtgata ttaaccactg ttcactgttc 61201 actgttcact gttcactgtt cactgttcac tgttcactgt tgtcgcctag cattcaaatt 61261 aataacttgc tccaaccgag ataaagcact cacagccgac tcccgtacat aagaatcagc 61321 agactcatca ttcgtcaaac ctttcaagac ttctattgct cgatcatctc ccgatgctgc 61381 gagtgcgttg acgactgaca ccgccaccgc tacattatct gtcgttttca gcgcctcatc 61441 caagatatca aaagcaggag aaccaacctc acccaacgcc atcacagctg caatatgaac 61501 cactggattg gggtcattga tggcagtttg caatccttgc aaaccttcag ttggaaaggg 61561 aacatctgga tagtttaccg caatttgtgc cagcgccttt gcacagctac cccggacagt 61621 cacattgttg ctgttgagta atgattttac cagagatggt acgctatcta caccaatgac 61681 acctaaagtc ttgactgctg ctcgacgaag ggtgacatct tcatcattga gaacactcat 61741 cagacgagga atcgtatttt catcacgatt ttcagccaac tcgaacatcg cccgttctcg 61801 caggtggggg ttggggtgtt tcagttgttc aaatagcgaa tctactgtca tgggtttatt 61861 atgttgctgg atgcaggagt gtgagttgct ttggctcgaa tcaaccagtt ttggtcattt 61921 gctgcaattg cgtgagcgtt ggtatcgccg agtttctcca gagccattaa tgcagcatac 61981 ttcagatccc agatttttgt ctccagaacg gctttcaaaa gtggaatagc actcgtatcg 62041 ccgaattcac ctagagcgat cgccgccccc gcccgagatt tctgaaactg aggttcgcga 62101 ttgttcaaag cttcaacaag caaatcgtaa cccggagcat acttgagcca acccaagagt 62161 ttcatcacat ggtaatgtgc tccgtagtca ttgtaagctt cagcggcata ggttgccatc 62221 agggcttcac cagcactatc gggatacgcc tctaatatag tctttgtcgc cagataacaa 62281 cgcccaaaat cagtatcata aagttctcta atcaaaagtt ccagtgttgg agtctgctcg 62341 taagcatgta ccaacttcaa atctcttgga tggtcaaaaa tgacttgctc tagtgagggt 62401 tgaatatctg taaaggcgat cgctcccgct ggaataccag cttctgccaa caaacgaatc 62461 ccgcgcagac gaaacacaag tgataccgga cactgggcaa tagtcggaat agcagcgtaa 62521 taccgagcat cgatcaaatc ttggatacac aaccgacgcg catacacatt cgggtgttgg 62581 agcaaggcaa gcactttctc aatgggcgag tagtccccag tgaagcgaca aacggcagag 62641 atagctgcac ttgctgtggg tttgtcggta gcatccacaa atttacggat gcgttctact 62701 gctggcggat aatctaactt tgccagggta tggataatca ctcgataggt ttgtcctggc 62761 ttttccagca attgagcaat ttcttcgagg atatctgagt cttgggtgcc aatttcacca 62821 attgcccata ccgcattttc cacggtatag cagtcttcat ctgccaagca ggcgcgaatg 62881 gctgacaaag caaccttggc ttgcaatttc cccagcgact ctactgattt ccgtcgcacg 62941 atgcggttat ccagggaggg gtcggtgttt tgcactgccc gaatcagcgc gttgattgag 63001 cgctctgtcg cgaagtttac caactcggaa gcagccacgt agcgagagtc atcctccgcc 63061 aactggtcta gaggcgtatc caagaggcta atagcttgtt cttcggttaa tccaaaaaga 63121 ttggaaaagc gcttatccat gagttattag ttaacagtta acagttaaca gttaacagtt 63181 atcaatttga agaagaattc agaatacaga attcagtatt cagaaggaag aaagaaagaa 63241 tttagttttc cgacttccga ctccttcact gataactgat gactgataac tgataaaaaa 63301 gatgaacctg gggattctca aaccaatact taacctttct gggcaaagca ctagattaag 63361 aatccccagg actgacgaac ctaagtccaa tcagctatga gagattatga acctatgaga 63421 gggagttgat tacatagtcg aggagattgt tgtactcaac caaagcttga gcagacatgt 63481 cgcgaggagc acaaccacgg ttacgagcga atctcagtgc ttcaacgtaa ggagcggtgg 63541 gcagttctaa agcgcgatag acttcacgac caccagcaat accccactca tccaatggac 63601 cagtaccgcc aactaccaag ctgtactgga tgaggcgcat gtagtgcttg atgtcgcgca 63661 agcacttgtc tttcttaact tgagtgccgt tagcttcagc aggttcgttc aggtaaggat 63721 acttcttgat agaagcatcg taagcttctt tagctacgtt gtcgagattg ccagcgagct 63781 tttcagcagc ttccagacga gcaccagcgc gttggatgct accttgaacg gattctaggt 63841 cagaggtgct ggggaaacga ccagccgcat cagcagccgc gatgactgtg gtcacaactg 63901 atttcatgtt ttggtatctc cagaaataga tatttaagcg tttatttcaa ttcagttcgt 63961 taaattgctt tggctattgc ttagctcaaa gcagaaataa cgcgatcaaa gtagctagct 64021 gcttcagctg ttaaagcaga gcaatcacct tgagtagaac ccattttgcg gaatctctta 64081 ccaccatatt cttctgtatt ggtgtcgttg atgtgagcca gtgaactagc cttcatgatt 64141 tgaacagctc tgactgtgga gccggtgggt acgcccagag ccgcataggt ttctttcaaa 64201 ccattgagac agcgatcatc aagaacagaa gcatcaccag ccaacaaagc gtaggtgata 64261 taacgcagga cgatttcacc atcgcgcaga caagcagcca tgcggcggtt gggatagcag 64321 ttaccaccag cttggatcaa gccttggttt tcgcaaatca tcccagcgac agcatcagaa 64381 acgatacagc tagcattgct ggcgatcgcg ttgacagcat caagacgctt gttaccttca 64441 gcaataaaac cacggagtgc gccaatttca cctgtactta gggtaccagt tttagcatct 64501 gctccgacaa cggctctgga aaatgagtcg agcattgaac tctccttgtt ttaaaaattc 64561 ttaccttgtt ttggacaaga catctctgaa gatagaagaa gaggtatgcg agcgtctcag 64621 ttatgagaaa taaagtaata atctgtaaca attgctatat tttgaagtca agtttaagcc 64681 aaggaaagtt ggctcttagc cctttgatag ttatcagttc catcccttgg ctgacgtatt 64741 gtctgaaaag tttaggcaag atatattttg tgattttgta gaggcgtaag cgccttcatg 64801 caggatgctt agtcctgatc tggtttggtt atggacttta gactaaaagc ccatgaacat 64861 ttgaactact ctatctgggc gatcaggtgt ttaatcacat tctctttaat cattaactgc 64921 cgcaggactt ccaaaagcag cccttgagcc tgcttctcat tcagtccttg cacctgttgt 64981 tcgtatgttt tcaggctaaa ctcttgctct aagctaagtt ctatcggaat ttccctcata 65041 attgcctcaa atcgcgtgaa tttaatgaga tgtgaaagaa ccaataaaat tcaaactgga 65101 attgtcattg tcttaaataa atcttttggc agtgactttt ctaagtattt agaaatttgt 65161 cctcttgtta actaatataa caaaaaattg acaatataac cactttggat acttctctca 65221 gatggcaaaa aacatcaacc aatgcggtga aggtttgatc gtataagctg ccccgccttg 65281 tgaagagtgt aggtaattca ctccacataa agttcaagcc aaattatcta aagtttcgtt 65341 aagtttttta aacttagtct aatgaaactt gcattgagaa cagtaacgac tcacgttggt 65401 catgggattt gcacgttcaa acaaatcgca gcgatgcccc agagcaagtg ataggtgcga 65461 ttcgccttgc gatagcttcg cttcacgctc aacagataac cgccagaaga atacctaatt 65521 taaagttcgc ccaagccaag tgccgcaggc atacgctcat gctgagcgcc aagggcgcat 65581 aggaagaaga tataggactc ctccaaagat ttttgctcag ctacctacac ctcaaatagc 65641 cttcttaaga gttccctggt ccctccctac gcaagtcatt tcacccttcg ggttcgccag 65701 atgcctacgg agggaaaccc tcctgcagca ctggtctcac caaatcaaac cagattccca 65761 taggtcatct tagacaggaa acgaagtagt tctcaaaact cctcatcggc gaaaaattca 65821 gtttcgtctt ctaatctcga attcctttgc tcggaaccag caagacccca caggggttgc 65881 agcaatgcta caccaactat ccctatacca gcacaaaaag ctagcactat acctactgga 65941 atttgaatcg actgaaaagt cagaagtttt aaagagactg gttcagcgtt ttgtactgag 66001 aggatagcta ctcccagcac ccatgaagct ataactatcg atatcaggag attagcaaaa 66061 gttttcatgg cagttttaat tattctttat aactacagga taattcagga taattactgt 66121 tatgtcgtct tgtgcttatc taagagcttg tagtcagtat ctcttgtctc gtaaggcttc 66181 ttacattgta gctaatagaa cgtgagttcg acgaactcaa aaaatggcag gaggtagggc 66241 aggtaagtct tttggttaaa tataagcgtg atggtttttg tggtaaataa gtatatgcaa 66301 tgctcagtga ataatttttg tgcgacaaac ttagcaacta ttagaccggt aatattaacg 66361 ccccgatgcg taacttgact aacccacaaa ttgtcagaat tatctgttca tatttctctt 66421 gattaaggcg aaatatgaac agaggctact cgaaatatct taaccagacg aatgatatgc 66481 tctacgaaaa tgcgctcggt tgataattct ttgtttgaca gatagtttta cctttcgacc 66541 tccacctgcc taattgacac tcccctgcct aaaggcgagg ggattctaca ttcatcgtca 66601 gaactt // LOCUS NODE_273_length_64699_cov_4.97302164699 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 64699) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 64699) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..64699 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..3019 /locus_tag="DP116_00400" CDS <1..3019 /locus_tag="DP116_00400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="protoporphyrin IX magnesium chelatase" /protein_id="PRJNA477356:DP116_00400" /translation="LKGGLGGSKIQKPVVGILLYRKHVITKQPYIPQLIRNFQEAGLI PLPIFINGVEGHVAVRDWMTTDYETQQRQQGNVEIASLSKEAVKVDAIVSTIGFPLVG GPAGSMEAGRQVDVAKRILTAKNVPYIVAAPLLIQDIYSWTRQGIGGLQSVVLYALPE LDGAIDTVPLGGLVGEKIYLVPERVQRLIGRVKNWVALRQKPVSERKIAIILYGFPPG YGAVGTAALLNVPRSLLKFLQALKDQGYTVGDLPEDGEELIRKVKEADELKWEDKQDE ENIVPSSVNVRTLEKWLGYRSTSRIEKQWKSLTSTGIKTYGDEFHIGGVQVGNVWIGV QPPLGLQGDPMRLMFERDLTPHPQYAAFYKWLQNEFAADAVVHFGMHGTVEWLPGSPL GNTGYSWSDILLGDLPNLYIYAANNPSESILAKRRGYGVLISHNVPPYGRAGLYKELV TLRDLIAEYREDPQKNYVLKEAICKKIVDTGLDADCPFEDAKRLGIGFTPENIRMFSG HAFDDYLVKLYEYLQVLENRLFSSGLHVLGEKPSEEELAGYLEAYFGNEPQRRRERRE EEEEEKRIRELLGQTTDELTNLVRGLNGEYILPAPGGDLLRDGVGVLPTGRNIHALDP YRMPSPAAFARGREIGQKIIAQHLQENGTYPETVAVMLWGLDAIKTKGESLGILLELV GAEAVKEGTGRIVRYELKPLAEVGHPRIDVLANLSGIFRDSFVNIIELLDDLFERAAE ADEPENQNFIRKHALALKAQGVENVSARLFSNPAGDFGSLVNDRVVDSNWESGDELGD TWKGRNVFSYGRQDKGQARPEVLTQLLQNTSRIVQEIDSVEYGLTDIQEYYANTGGLK KAAEKQRGKKVTTSFVESFSKDTTPRNLDDLLRMEYRSKLLNPKWAEAMANQGSGGAY EISQRMTALIGWGGTADFTDNWVYDQAADTYALDAEMAEKLRKANPEAFRNLVGRMLE AHGRGFWEASDEKLQKLRELYELTDEEIEGVTV" gene 3125..3364 /locus_tag="DP116_00405" CDS 3125..3364 /locus_tag="DP116_00405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002734968.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_00405" /translation="MENSFTAIFEKVDDWYIGYVQELSGANVQERTLEEARESLREVI ELILISNRELAEQKLSGKDVVREKITVKISSRDEN" gene 3343..3531 /locus_tag="DP116_00410" CDS 3343..3531 /locus_tag="DP116_00410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016514546.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="addiction module toxin, HicA family" /protein_id="PRJNA477356:DP116_00410" /translation="MVKRRELIRHLEANGCLLLREGGKHTIYYNPSNNRTSAVPRHTE IVDILAVKICKDLEIPPP" gene complement(3742..5112) /locus_tag="DP116_00415" CDS complement(3742..5112) /locus_tag="DP116_00415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015150092.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_00415" /translation="MYWGEGQLIHNGKYKIERQLGGGGFAVTYQAIHTQLNRRVVIKT PNLSVQNDPDYPKYLKRFKKEAQMLGECCADSHPHIVQVFDFFEEDGRSCLVMQYIPG ESLWQFVQNRGALPETEAVKYIRQIGSALVEVHKKNIFHLDVTPPNIMLSFKPGVSNS GKAVLIDFGIAGDMSPPSTLSRSFGNKAFAPYELVRKGIRHPTVDVYCLAASLYYAVT GQRPTNSFDRKFDNEELVPPQQLVPSLSNGVNQVILQGMALEAKDRPQTMQEWLNLLG SQQAIISNKLSSDMGVDYRNLEKLLKANLWQEADEETTRLMLKVAGREKDGWLDVESI NNFPCTDLSTIDQLWVKYSNAHFGFSVQKRIWQEVGGKPDIDMKIYIHLCKCVGWYRN SGWLNYDELTFNAKSPVGHLPGGLLIWLWISFEVGIERETTETKWWNIISSLSSRLAK CNIQCD" gene complement(5302..6348) /locus_tag="DP116_00420" CDS complement(5302..6348) /locus_tag="DP116_00420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319731.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_00420" /translation="MTWAPGQKLHRDKYEIKRELGRGRIGITYLATNRDGKETVIKTL NPDLLNQLGLEDRNYLESGFLDEAPKLARCQHPNIVLMIESFKEGDLPCIVMEYIQGD NLAKLVKSRGFFPEKEALGYIQQIGQALIEVHKQGFLHRDIKPENIMVRAGTYQAVLI DFDLARGFDSPLTSRGARVDGFTPIELHFNSATQQKRRGAWTDIYSLAATLYVLLTGQ QPVSAINRKDQNKRLTEPKELNNQISDRINNAIIQGMELEPEQRPQTVEDWLKELGLQ TRGFSLPKLLWIKPLWARIIGILTVLGLLAGIISGLDATINLRDKLFPKPLATPTSST TQDTPPSSSLPQKK" gene complement(6446..6781) /locus_tag="DP116_00425" CDS complement(6446..6781) /locus_tag="DP116_00425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742293.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_00425" /translation="MDTLNTYRRIIKDVLIPYTQIPYSHGDIQCKTVFDSENDSYLLI TLGWDGVKRIHGCLVHIDIIDGKVWVQRDDTEDGVTYELVAAGIPKDRIVLGFHPANV RPHTGYAVA" gene complement(6769..7185) /locus_tag="DP116_00430" CDS complement(6769..7185) /locus_tag="DP116_00430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877169.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty-acid synthase" /protein_id="PRJNA477356:DP116_00430" /translation="MPARDIYHDAVKNALLKDSWTITDDPLHLKWGQKDMYVDLGAQR LLAAEQGNKKIAVEIKSFMSPSEMQDLKDAIGGFVMYRAVIGRLEPERTLYLAVRDNI FTALFEEPIGKLLIETENLYLVVFNPNSETIVQWIP" gene complement(7217..7987) /locus_tag="DP116_00435" CDS complement(7217..7987) /locus_tag="DP116_00435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865801.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_00435" /translation="MHIVTTEPVISAQNLNHYFGEGTLRKQALFDINLDIYSGDIIIM TGPSGSGKTTLLTLMGGLRSAQEGSLKILGQEMCSASKKQLRDVRRQIGYIFQAHNLM TFLTARENVRMSLELHDEFLNQDMDGKAISMLESVGLGNRADYYPDSLSGGQKQRVAF AKRSRRELARALISQPRIVLADEPTAALDKKSGRDVVELMQKLAKEQGCTILLVTHDN RILDIAVDAKRLVARHRIIYMEDGHLISDGVDAAAKVG" gene complement(8053..8883) /locus_tag="DP116_00440" CDS complement(8053..8883) /locus_tag="DP116_00440" /inference="COORDINATES: protein motif:HMM:PF01135.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_00440" /translation="MSNDSTNAFNNETRDAWNTNASVWDARMGDNGNDFHQLLIRPAM ERLLEIKPGTRILDIGCGTGLTTRRLASLGAHVVGIDFAEEMITCASKRTQQHETSIE YHVLDATDETALLGLGERSFDAAVSAMVLMDMAEIDSLLRALTKLLRPGGCFVFAVMH PCFNSTHTSMAAEVKDCEGQLVTEYSVKVSGYLQPSTTKGLAIENQPKPQLYFHRPLH VLLGAAFRVGFVLDGLEEPAFPADDSSNSRFYSWSNFTQIPPALVARLRFSDELLRTS " gene complement(9008..9322) /locus_tag="DP116_00445" /pseudo CDS complement(9008..9322) /locus_tag="DP116_00445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195214.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter" gene complement(9350..9967) /locus_tag="DP116_00450" CDS complement(9350..9967) /locus_tag="DP116_00450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316691.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00450" /translation="MSRQPLLDPNRSYTFSNYFELGFAIDDLVAEFGYSFERKFLHLP QYSGTLDRLLDLKQRIEEVLPYVDLENEATRREILIAPVVTELIHYSRAKLRIEYNIK VDNRLQGNLDYYLRTQTNLIVIEAKQADINRGFTQLATEMIALDRWTDSNQPEILGAV TTGNIWQFGILYRQAQRIEQAINLYRVTEELEVVVRILLAALVNL" gene complement(9990..10361) /locus_tag="DP116_00455" CDS complement(9990..10361) /locus_tag="DP116_00455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197200.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00455" /translation="MTDFSLKKQSFVKNDIPNRLHKLAENLSQIKNLCAEESHQESIL NLAKESRYFIEWTVPDMVQADIDQAAELVDLGRVLTRWLFDWEKIWSDSKEKTKIAQQ AEDWLKRVLEISRSQSESITV" gene complement(10358..10642) /locus_tag="DP116_00460" CDS complement(10358..10642) /locus_tag="DP116_00460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307368.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00460" /translation="MLEEHKFSIKTVVDVERRVIAGGGDMHYDCEQVLLDDGSTQENI WGASFMPVNQKITYDSIVNLRPRQNRSMEILDPNIRERVAQIIIEFLGKL" gene 11024..14427 /locus_tag="DP116_00465" /pseudo CDS 11024..14427 /locus_tag="DP116_00465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316688.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="class I SAM-dependent DNA methyltransferase" gene 14463..14660 /locus_tag="DP116_00470" CDS 14463..14660 /locus_tag="DP116_00470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872499.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2281 domain-containing protein" /protein_id="PRJNA477356:DP116_00470" /translation="MTIKEQITQELEKLPEPLLQEILDFVQFLQAKNQQRNIREITIM SESSLQKDWLKPEEEAAWQDL" gene 14645..14737 /locus_tag="DP116_00475" /pseudo CDS 14645..14737 /locus_tag="DP116_00475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015118605.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" gene 14831..15433 /locus_tag="DP116_00480" CDS 14831..15433 /locus_tag="DP116_00480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357841.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00480" /translation="MPYSQFTIDRVKKDFRLTTVEGVRFFPDSIEPVTPSPRLQGILE DLPWAIAVDTEKARSEVIINPVLLEVRRIFNQQISVFSGEEFNVDPSIGLNGVCDFLL CRSPEQLTVEAPAIVIVEAKKSDLKSGLGQCIAEMVAAQQFNEAKAQPITAVYGTVSS GTQWRFLKLEGQTVTIDLMDYPLPPIEQILSFLVWMVKAG" gene 15544..16872 /locus_tag="DP116_00485" CDS 15544..16872 /locus_tag="DP116_00485" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00485" /translation="MPATTPAEVRSHLTDALQLDLVGPTPNDTTYACEVLTQPPSKWY LTGFLVPFGAPPEVRSDDTSNDELDQISNSDVADDAGTPETASARKALFPSSIGLSFL ISSETKELDVTVNWGDYVSKVGETELESTETDSESEQISTELWERIPQAVDVHIILPK DKQKKQLDVPGGSGLQLFVSCRAVRSQNLPKGTRSVSVFLVNYRTHNPNKKRDPSFAF QASLNIRTTKGFVPRPDLRGQNSSDWDEAVASLQYRNDCEFAVGHNVSAIALHSDNKC QEISTTWIPQAEVPRVEPADIAGLELSMEALADAADAQTVRRMINPMVTSYLKWITDQ QAAAPSEPQQTKVATDLLNRARKICDTRSVKPPAYRIALDSKHWMIHWFLRRFKLLTV RSLLPADNASPTTPFGFASFDSTETAKTRTCSPTTSHPKTFHPPNGDRFS" gene 16809..17915 /locus_tag="DP116_00490" CDS 16809..17915 /locus_tag="DP116_00490" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00490" /translation="MLTDNISPEDFPPPKWRPFQLAFLLLNLVSITNPENSDREIVDL LFFPTGGGKTEAYLGLAAFTLVLRRLRDPSIYSAGVSVLMRYTLRLLTLDQLERAATL ICALELQRQHNPKLGNHPFEIGLWVGQSATPNKMGSKRDKDENSAHARTHAYQRDDKA KPKPVPLERCPWCGAELTTNSFQLLPNPDAPTDLRLICVGSKKRADGKPACVFRRNNF LPLIAVDEQIYRRLPCFIIATVDKFANLPWVGQTGALFGKVTHYDDRDGFYGTGDPKI KGRSLEKRLPPPDLVIQDELHLISGPLGTMVGLYETAIDALCSVTNNDKTIRPKIVAS TATVRRADRQIQALCDVLSRYTLAGIMGASKSQN" gene complement(17912..19165) /locus_tag="DP116_00495" CDS complement(17912..19165) /locus_tag="DP116_00495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_00495" /translation="MLITYCYRIKPNPEQVVIMDNWLELLRRHYNYALGQRLDWLRRT RCQIDRCSTVSQPIGEIPDKVDYYTQQSDLKQTKQLFPNYKNIWAESQQVNLQRLDKA WKKWLFPDKSGKRGGRPKFKKSGQMRSFVFPRVNNPKAGAFLENGVLRLSKIGPMPVI MHRPLPVGFNLKQATIVKKADGWYVCLSLEDDTVPSPLPLDEVKSTVGVDVGLKEFLT TSLGETVPASKPYRTAQNHLARQQKFLSRKHKGSNGYKKLQNKIARIHQRVGRVRENF HYNTAHKLVKRYDLIAVEDLNIRGLARTPLGKSILDVAWGSFIHKLEAVAVKCSVHVV KVNPHGTTVDCSNCGAKVPKTLSMRTHECHKCNAVIDRDENAARNILQRALNAVGLMV SAGGGLGDAQPMKPEAWGWNGVQTT" gene 19202..19606 /gene="tnpA" /locus_tag="DP116_00500" CDS 19202..19606 /gene="tnpA" /locus_tag="DP116_00500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136513.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS200/IS605 family transposase" /protein_id="PRJNA477356:DP116_00500" /translation="MYQKGFRSVYSLTAHIVFVTKYRRKVINKEILEKLAQIFTSTCN KWECNLKQFNGESDHVHLLVSYPPHVQLSKLIANLKTVSSRLIRRDYSEYLSKFDKKP VFWTGSYFVASCGGVTIEQLKRYVEQQSSPEN" gene 19749..20453 /locus_tag="DP116_00505" CDS 19749..20453 /locus_tag="DP116_00505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496538.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00505" /translation="MPNIEGEIDELAINVRRAALDFETKWIPAIENKGEGVFISFHKQ VMDQWFKRDTVKKRAEDLNRGFLKWLDSKGIPRDKAAFPGVKYIMLHSLSHLLITAVS LECGYAASAIRERIYASDYGYGILLYTGSSGSEGTLGGLVEVGRHIDYHLGKALALGR LCSNDPVCAQHEPDNPREERFLHGSACHGCLLIAETSCERRNQFLDRALVVDTVEELD AAFFPDDIFPDDLVTS" gene 20912..23860 /locus_tag="DP116_00510" CDS 20912..23860 /locus_tag="DP116_00510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864531.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein" /protein_id="PRJNA477356:DP116_00510" /translation="MTQPSSQKQAEQQLNQSINQPKGLQESAHPQQGRQELSLKTKAI AWAMLVSMVPVLTVGTTTYYFGSQLTAKQISQARLADTTGLAKAELALNKLLSLLLIE TGVTAVLAAAIATILGIRAIRPVLNAAAVSTTMVNRLRRESADSPTSTVSKDELAVLE TNINFVKEQLPDLLWKQEAEAECSQVFHNISRRIRTSLTQEDLLRTTVEEVRKALRID RVVIFRFDSNLDGTFVEESAASDLPKILWTTISDPCFNGGYVEQYRNGRVRAIDDIYQ ANLTDCHIGLLERFAVKSNLIAPILKNDQLFGLLIAHQCFEPRIWQQYEIDLFAQIAT QVGFALDYVKLLERIDTKADQAQAFIDITRRIRESLNEEDVLKATVEETRKALSADRV LVYGFDSNWYGTVIAESVIPGFPKALRAKIKDPCFAEGYVEKYQAGRVQATNNIYEAG LTACHISQLQPFAVRANLVAPILKDDQLFGLLIAHQCSGPRDWQQYEIDLFTQIATQV GFALDHARLLQRIDAEGVRTQLLTDITRRIRESLNEEDVLKTTVNEVRKALGADRVVV YGFDSDWYGTVVAESVVPGFPKALRAKIKDPCFAEGYVEMYQAGRVQATNNIYEAGLS DCYIGQLEPFAVKANLVAPILKDDQLFGLLIAHQCSGPRDWQQYEIDLFAQIATQVGF ALDHARLLYRVEQAYQSAEATSDEQREQKEALQRQVLELLRGSDTAVQTLSGEAKSQV ESLTGAYNQIKTLVDSAMIMVICAQQAELQEQQLSQIVQDGHESIDPILENFCDIQVR VMEAAEKVERLDQPFQKLSHIVSFISNIASQIKLQAMNTVLEASRTPEAGQQFAWLAD EALSQVHQLDASIVEIESLVAEIQTQANEVIPVMEYGAEQAMTGIRLAQETQQKFNQI VIISDQMKKLVEELVHTGPVQAKTSTSASQSILEVASIASKTSEQVMAVAKSLDKLLT IAQDLQEDAE" gene 24057..24343 /locus_tag="DP116_00515" /pseudo CDS 24057..24343 /locus_tag="DP116_00515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309420.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 25174..25419 /locus_tag="DP116_00520" CDS 25174..25419 /locus_tag="DP116_00520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651786.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00520" /translation="MASKLVSVREYTVKAHKRLIHTRVFNFLCKECGIPAKRETYGSR PLYCEQCRPPQPPKKSLMKPQKAKPRPMTYKSKTDLD" gene 25431..26525 /locus_tag="DP116_00525" CDS 25431..26525 /locus_tag="DP116_00525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317889.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00525" /translation="MTTKGKLEPPPQTLLEPLLELRDYYTALTEEYERLWQTARSQLV HVEALLSNWSGVDERDNLRAVVEMLSFAPTPSQQDSLSTTQYQSDVEQQQAIDSELQD DEDQQIDLDDVEQQQAIDSELPDDEDQQIDLDEESSDEENTVVTSPTPSPQKEIPTQN NDHSMPGDIPMLPQYQVLTRMQAIEKILRENAGSVCHIDFVVRSLFGDLEPSVFKIVK SRIQSSLTHGKEKSYWAAVPDEPGCYTLDLSLITPANGKVKSKTIKPQKKKPFLLPKS KRASMLPEYEGKFLIDAICILLQKNSGKIFSVADVITGLYGELNAEQLTEIKTAVHNE LSRGHRIGRFSRVPDKVGYYTWDLSKFRRK" gene complement(26712..28007) /locus_tag="DP116_00530" CDS complement(26712..28007) /locus_tag="DP116_00530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317879.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S1" /protein_id="PRJNA477356:DP116_00530" /translation="MKRIRSKKNQNLINLKIDALDKKINQLGKISEKKSLTHLFLPLI GVGIAFLSGCSINSSRNVQPSRETNTANAQPVQQSATPNNRPLTPTPEDNNFVVAVVN KVEPAVVQINTSRTVRSQVPEILNDPFYQRFFGRQIPAQPQEKVVRGVGSGFVINANG QILTNAHVVNDADTVSVSLSDGRTVEGKVLGQDKLTDIAVVQVPVNNLPTVELAKSQQ VKPGQWAIAIGNPLGLQETVTVGVVSATDRSISDIGASNNRVGYIQTDAAINPGNSGG PLLNARGQVIGMNTAIIQGAQGIGFAIPIDTAQRIAQQLITQGKVEHPFIGIQMVSLT PELKQRINSLSNSNVRVQADQGILIVRVLPGSPADKAGIRPGDVIQQINNQSVTTADT VQQIIDKSGVGANVQMQLQRNDTTVQVTVQPGPRPVNAQ" gene complement(28354..28593) /locus_tag="DP116_00535" CDS complement(28354..28593) /locus_tag="DP116_00535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318387.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00535" /translation="MTQEPRKPINLSVHESADPSVINPNPDKEASGNKNDLNDLNDSV HDEENVDVPIPGTFDDADDSNPVDRQLGIISRTAG" gene complement(28818..31454) /locus_tag="DP116_00540" CDS complement(28818..31454) /locus_tag="DP116_00540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740141.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycogen phosphorylase" /protein_id="PRJNA477356:DP116_00540" /translation="MNNGNGLKKHNGKHIPHWKKHICRTEHAAIQVEDDRTGMDVETL KRAFFDNLYYIQAKDKGWATAHDYYMALAYTVRDRLLHRWLKTVEQTYFKKDVKVVCY LSAEYLIGRQLGKNLTNIGLHDVARQVVQESGYDLYDLMEQEDEPGLGNGGLGRLAAC FLDSLSTLEIPAIGYGIRYEYGIFQQVIRDGTQVEVPDRWLRFGNPWEIPRPDYTMEV KFGGHTEAFTDEQGRYQVRWIPERTVLGTPYDTPMVGYNSNTVNTLRLWSAKASDEFN LQIFNSGDFANAVADKVFSENITKVLYPNDNNYQGRELRLQQQYFFVSCSLQDIIRNF LQHDDNFDNFPNKFALQINDTHPTIGVAELMRLLVDEHQLGWDKAWDITQKTFGFTNH TLLSEALERWSLSIFGRLLPRHLEIIFEINQRFLDEVRAKYPQDIEKVARLSLIEEGA DKRVRMAHLACVGSHAINGVAALHTELLTHDLFRDFYELFPEKFSNKTNGVTPRRWIL VANSKLALHITNKIGKGWIKNLEELKQLEAFVDDQEFRDQWRQIKQENKQDLAEYILQ TNGIHVDPNSLFDIQVKRLHEYKRQLLNLLHVITLYNQIKQNPHADVVPRTVIFAGKA APGYAMAKLIIQLINSVADVVNNDPDVGVRGSVPEASRLKVVFLANYNVALAQRIYPA ADVSEQISTAGKEASGTSNMKFAMNGALTIGTLDGANIEIRDRVGHENFFLFGLTTEQ VSQMKAQGYHPWDYYNSNPQLKQAIDQIASGYFSKGDRDLFKPLVESLKNRDDYLLFA DYQSYIECQQQVSEAYRNQDKWLRMSILNAARTGYFSSDRTIREYARDIWHVQPVPVI LEDDQQENAAIKHRSTTSINSQ" gene complement(31616..32914) /locus_tag="DP116_00545" CDS complement(31616..32914) /locus_tag="DP116_00545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310155.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00545" /translation="MAFDRRLDTFSLAALLVSAHYGLGFLLGTAEKSLTLGAAGSLYT VSLGLGTIALLGLAKFYWKRAEPIWTLLGSAYGDGVKVFVSLMSWSSLIGIEAVQLIS GAFILKVFGIATLPTMVVLAILFAIISLLPMEKAGWILRGLLILNFLALVYGLWVLHG FGDYVRSPIEFASSLKHMSLPTIVGISLSTILLVPIDMKYQQFLLQAKDVNSLYQGCT LAAILLLLLAFLPSTVVVAAHNAGIFPAGIDGKETLPFILAWVGGGTDKPLGIALIIS LLVPALGIGSSILRVQSKTILDFNILPASVWSRLLVTAANALFGLAVALKGGEIVNLI VYFYAAYVGGVFAPFVAFLLAQTGRYNFSKTSVKLSLMTSSFSSISLLLITLINPAFL GFGSVELNIMGIGILSGVLFLLIGEILQKYFLISKVREEA" gene complement(32940..33188) /locus_tag="DP116_00550" CDS complement(32940..33188) /locus_tag="DP116_00550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873449.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00550" /translation="MPIKNQTSLYTDEVNFFPDHQFRLIGECAGKKLLLIGRTKAYND PIVATSQTDEPSQEDLYAYDLYELMKFSHEPVKIHGEI" gene 33778..34131 /locus_tag="DP116_00555" /pseudo CDS 33778..34131 /locus_tag="DP116_00555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998751.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="sodium:proton antiporter" gene 34350..35441 /gene="trpD" /locus_tag="DP116_00560" CDS 34350..35441 /gene="trpD" /locus_tag="DP116_00560" /EC_number="2.4.2.18" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875302.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anthranilate phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_00560" /translation="MITANVDNPLDQDSSNWSALLQQLLERQSLSVSQASNLMSGWLK EAIPPVLSGAILAAIQAKGVSAEELLGMVEVLYSQSNKPTQRDSIVGTSPLVDTCGTG GDGASTFNISTAVAFVTAAAGVKVAKHGNRSASGKTGSADVLEALGVNLKASLEKTQE AVSAVGMTFLFAPDWHPALKVIAPLRKTLKVRTVFNLLGPLINPLKPTGQVIGVNSPA LVETFAKVLNQLGTRRAITLHGREKLDEAGLGDKTDLAVLSNQQIHLLELSPQELGLN PAPISELRGGDVEENAEILKAVLQGKGTGPQQDVVALNAAFALYVGEVVPDQGDEYQT FSQAVIVAKEILQSGLAWKKLEQLAQFLK" gene 35869..36798 /locus_tag="DP116_00565" CDS 35869..36798 /locus_tag="DP116_00565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012626019.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_00565" /translation="MLKQFSWNFNNFPLGKKLTVLLLVIFIAGITLSGIALSGILNYK AQDEVSSNGKLLFKTINSVRSYTNDEVNPELEARLGKDEFAAQTIPAYSARKVFEKLR NEDDAYKDFFYKEAMLNPTNPRDKADSVETELIQKFRKEKNLKFLSGFRSFDQEQFYY IARPLAITESSCLRCHSTPDVAPKKMIQIYGTEHGFGWKLNLINGVQIVSIPASQVFQ KANQSFLLVMGIVTIIFAIAIYVANFWLKRYVVQPIKRVVRVAEAVSTGDMDAEFEKV GNDEVGSLVEAFTRMKISLVMAIRSFERYRGGN" gene 37058..38170 /gene="cas6" /locus_tag="DP116_00570" CDS 37058..38170 /gene="cas6" /locus_tag="DP116_00570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311112.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated endoribonuclease Cas6" /protein_id="PRJNA477356:DP116_00570" /translation="MARKLQNKTIDYKSNLNWSSETELVGLVFEFVTQADASIYPQYT IGLHAWFLDQVRSYDPELSGYLHDGESEKPFTISALDGELVSSGRLVQLSANNSYYWY VTVLSSRVSQWMEQWVQNLPKELNLRNASLQIRSCNIAHAPTTYTQLLNSEHGETVTL KFLSPTSFRRKGHHFPLPMPTNVFHSYLRRWNDFSGMFVDQEAFLAWVDENVLITRHQ LTSMKVLAGKKGAVTGFTGTIEFALTKEASRQPDFCKLFYALGKFAPYCGTGHKTTFG LGQTRLGWSSQAAPEVPDVQSLLARRIEELTDIFKSQRKRTGGDRAEEIASKWATILA RREMGESLQAIAQDLEMPYETVKTYAKLSRRALKSG" gene complement(38220..38864) /locus_tag="DP116_00575" CDS complement(38220..38864) /locus_tag="DP116_00575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00575" /translation="MTMVEFSSESLTDAEWDIAHAIAQTLVKENTDINELGKAVAYLR TVVNKPDATSRFFKYLKTLVSNGRVIGHSGRTSDYYRSIEKACSDSLKSVGNAYTILQ ILGWVSRLMRYYKDAGVPIGEIAINTPSPVESSRQAEIAKVVQSQDFQVDQILEAKVI KINGNKVTYEILGAIKLTEKEPKKASVLQEGQTVKVKIVSMKEDGSIKSVKYCD" gene complement(38867..39931) /locus_tag="DP116_00580" CDS complement(38867..39931) /locus_tag="DP116_00580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311120.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated protein" /protein_id="PRJNA477356:DP116_00580" /translation="MTGNRPQRPQQPNRSNRPSSTSSSNPELAPKPYEFVSFPKERPN LQRPVGHHKYLSDRLHGTLHLTLKVQTSLHVSTGVVVMGSDIGSRIPLIKTMIQGVDQ KLTIGGSSLKGCIRSVYEAITNSTLAVVTPKYKSQIPTERLPCRNKEELCPASRVFGA LDWQGLLDFNDAKCESTGFSTGFMPSLYRPRPDESSAYFIQGRVAGRKFYYHTIRAID KGQNAGIPVQQAPREYTFTTQLHFKNLLPEELGTLFIVLGQDPKNPIALKVGGGKPIG MGTMTVTVEKILQAQNLKQRYSSYNLNDSDEMTGEKLKKFIQEKIQTAHSRLIQKPQL EELTAVLRYPTDREPPSGMY" gene complement(39928..40455) /locus_tag="DP116_00585" CDS complement(39928..40455) /locus_tag="DP116_00585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00585" /translation="MKPFVGYKKELFSVPQLLEFIPKFISEPSYYFLRWTHEVSGIVE KPPTNDDFPMIEGQMFNHKCELRWKYKRQNTYEVLLLTIADNHDEFIPVKETINKEEK EPWRIEPNNPPYGYPAYAYRPDETRFPKKLIFPESLDIRQEENNKPKLAQRYFIDNET STVQFIALTVEVALP" gene complement(40452..41417) /locus_tag="DP116_00590" CDS complement(40452..41417) /locus_tag="DP116_00590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197827.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated protein" /protein_id="PRJNA477356:DP116_00590" /translation="MHKRFVNHCTIDITLIPDDPILIKSGKEGADPTKPDMEFVETYH AGGRSIYLPGSSLKGAIRAHAERIVRTVGKDKRDSDNWNLLWANDPLNDKYDYLKSKD GKDLPAPEIYKLSSFTDQMFGNTSIASRVRIEDAYPDKSQPLKIEERNGVAIDRVFGS VAVGPFNYQVCTAGEFHTKIHLKNFTLAQLGLIGLVLRDLDDGWFGLGFAKSRGLGTV QVKLNKAVVQYPGCILSEDKQQICALGSEKKWSKTFLLGAGEFLELQEAQLYGFPTQD RQDTPATAQEMDLGFGVQLTWREGTVKDLFERSVRSWSHLLMGAK" gene complement(41449..42069) /locus_tag="DP116_00595" CDS complement(41449..42069) /locus_tag="DP116_00595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197828.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00595" /translation="MAFSSYKTIGEVLKAFQVIYTEANFVGEVEFKIPDYFREDLETM MRDGVVDSSEFAICENLIYPVLKEIWKCYRSKFILWSHHSLNYDEKLSGFPEYILAKR SPLGKVVFDKPYFILVEAKQDNFEAGWAQCLAEMIAAQRLNDEFQITIFGIVSNGDRW QFGKLEAEVFTRNITFYTIQEIDKLFAVVNYVFQQCELQLNNLVAA" gene complement(42072..42974) /locus_tag="DP116_00600" CDS complement(42072..42974) /locus_tag="DP116_00600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197829.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated RAMP protein" /protein_id="PRJNA477356:DP116_00600" /translation="MFEIFKNRLKITGTLTTITALRISAGRSTEPIGSDLPVIKDALG RPLIPGSSFKGAMRSRLESFLRGINPNFAANPAIEAEWSITNERLNGKNGIKEEVEEE LKQYPEKERNAKRDELLTKKIIDETDLASHLFGSPWLASKFQVRDLTIVPDTWFGQYQ ERDGVAIDRDTETAAEGKLYDFQVVPAGTQFEFQAVVENAEEWELGLLMIGLHQFQTE QIPLGGGRSRGLGVVKLDINEMLWFDYPEDQPQLLLDYLKKLVMGDKKAYEDARDFKD HWVQKLIEHLESKASHKNNTEAKR" gene complement(42980..43627) /locus_tag="DP116_00605" CDS complement(42980..43627) /locus_tag="DP116_00605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006278675.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR02710 family CRISPR-associated protein" /protein_id="PRJNA477356:DP116_00605" /translation="MSLSIESKTELQIQKGIRQAEDELVIWIQVINSRGQIDRNFDSS NGTQGHGYEIVQDLLRNAERRAAQERSDDAVGRLYRALELLAQIRLLKSYGIRTGDVN PQQLPEYLQDEYEKKRSPVKGLIQLSLRSSYELLNQLPNDPIGQFYQESANKIINALE VRNNSLFAHGFQPITSNNYQKVSEVFVNFIQSALTAVIPPKLQLQPPQFPNHLEI" gene complement(43624..45006) /gene="csx10" /locus_tag="DP116_00610" CDS complement(43624..45006) /gene="csx10" /locus_tag="DP116_00610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741239.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated RAMP protein Csx10" /protein_id="PRJNA477356:DP116_00610" /translation="MKRIKLEITTLSPLAIAGKKPGSVSEAEDHIPGSVIRGAIASQI LQLSNQQTPRIAEQNFTAGGGDFEALFLGDEPAIFQNAYPAVAKIADESNSTKKKPDL ISQVVTDEVMVLPATAVSSKTNSGFKPKGNGVFDTLIDRFCADAYNFPYDPSDPKSLE EKTDARVEPFGGFYSKTNDSEILYKYRSHSTTTRFLTRVGINRRRATSEDDILYSIEV LNESFLENPQARFKNWRPVIFRSSVLVKDAELAKSLENFINQNSHFFRLGGATSRGLG KVIIKAEVEDASADVEARITEFKNKLQQRWELWSVFGKPEKNLLENRTYFTLDLQADA ILTENWRHTTVISPQMLCQAVELNDEFSQLSKEEQEDFLKLEVAYSSYNYRSGWNAAW GLMKDIELVTNRAAVYLFSTKEPGLWMEKLKELEWKGVGDRTSEGFGQVQICNPFHLV LRDELIEDTV" gene complement(45003..45146) /locus_tag="DP116_00615" /pseudo CDS complement(45003..45146) /locus_tag="DP116_00615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864706.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="CRISPR-associated protein Csm3" gene complement(45145..45414) /locus_tag="DP116_00620" /pseudo CDS complement(45145..45414) /locus_tag="DP116_00620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741241.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type III-B CRISPR-associated protein Cas10/Cmr2" gene 45944..46177 /locus_tag="DP116_00625" /pseudo CDS 45944..46177 /locus_tag="DP116_00625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653337.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(46376..47614) /locus_tag="DP116_00630" CDS complement(46376..47614) /locus_tag="DP116_00630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872545.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_00630" /translation="MLPELNPGTSINSRYQIQRLLGQGGFGRTYLAVDTQRFGDYCVL KEFVPANTAEPVLLKSRELFEREAKILYQLNHPQIPKFLAWLTEKQRIFIVQEYIEGD SYSRLLGDRLSIQKKPFSEAEVIQWLLDLLPVLEYIHQRNIIHRDISPDNVMLSRKLS KPVLIDFGVVKQKFTQILAGDSSNPSHSVAGSVVGKVGYSPPEQIRMGRSYPCSDLYA LGVSALVLLTGKMPRLLLDQSFQWQWKSYVKVSDFLTQILEKMLAERPAYRYQSAREV LNLLQSQSSSGGINVTPSHQNVRIHGDPVNKVGLPETKLPKETKESSRKSQKPIQNTE KANELPAENTTSIKPEFVEYCRRELTSFVGPFASFVLEDTLDKNPQITPEQLVEVLVE KIPDQRRGQEFKKRLNLPRH" gene 47641..47841 /locus_tag="DP116_00635" CDS 47641..47841 /locus_tag="DP116_00635" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00635" /translation="MQIEFYSSVSYLPGYTKLQLTTVNRSVLRAPGAQEQRQNCFSHY RNIGNALSSKSMIANKVDCPTA" gene 47939..48208 /locus_tag="DP116_00640" CDS 47939..48208 /locus_tag="DP116_00640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744069.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_00640" /translation="MSSELKAATTAADAQLSNDLPTLKKGLQTEGVRLLQQILILRYK YKITFDANFGDKTEDAVKDFQRKYNLSPDGIVGVKTWRALGVNIA" gene 48979..49797 /locus_tag="DP116_00645" CDS 48979..49797 /locus_tag="DP116_00645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318625.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00645" /translation="MNDQEMKGKISRERDYLLQRLWDARVIVSLVLIAVLVRVGYGDN RLAPTQVVNTYDVTTKTNNFIGETVTVRSQPIKKVGLASFTVTDQRLLGGEPVVVVNA SGLAFDLPTDSDTRVQVTGDVRNLDIPNIERDYNLNLQDEFYKDYINKPAIIAKSILL APRVGQITKNPRKYYGTKVAVMGNVDNIQSPVLFTLNESYSLGADNLLVLFVATPKRV INKGQTVGMVGVVRPFVVADIERDYGITWDERVRRQLEADYRNKPVFVADTIYP" gene complement(50152..50637) /locus_tag="DP116_00650" CDS complement(50152..50637) /locus_tag="DP116_00650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999803.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00650" /translation="MKRDIQGKFALKNDDYRSVRSLRLTDDTWRALGVIAECLGLTRA DYLEEIVSRNLLPSITPSEAEHLPSVTPLEGEHLPSITRYEQEIERLKAQVQNLQENN SEMRKGAAFIFIQDVVNFETIRDRILFELKLGRQANGYKTAQKALNRLIAELKLLATR L" gene complement(50744..52423) /locus_tag="DP116_00655" /pseudo CDS complement(50744..52423) /locus_tag="DP116_00655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006458160.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS1182 family transposase" gene complement(53065..54033) /locus_tag="DP116_00660" CDS complement(53065..54033) /locus_tag="DP116_00660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995591.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L11 methyltransferase" /protein_id="PRJNA477356:DP116_00660" /translation="MPWIELSLDTTHEAIDWVCTLLAETIDIDDINIVEYTEPNLPHP VDQHPQWTFTIYLYLPLDAQSRTRVEKIVNLLSPLHRTGLTTAIQTSVVEEKPTDANG LNSRVHRIGKRFVVVTPDAPLQSQTADEITLRLKTTLCFGSGLHPATILSLQLLEQYI IPTMNVLDLGSGSGILSVAMAKLGANVLALDNDSIAVEATQDAVRCNGVEQQVRVMKG SLGSGSEMGHWMGGDIIDNVPSIKPTKTFDLIVANILARMHIALADDFQRALRQTDAQ VGLLITAGFTADHEDNVDKALTKAGFEIVDCKRLDEWVAFAHRLVL" gene complement(54047..55195) /locus_tag="DP116_00665" CDS complement(54047..55195) /locus_tag="DP116_00665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873394.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00665" /translation="MQNRLRLLRTQRNWSQAELAQRLGVSRQTINAIEVGKYDPSLSL AFKIAQLFRCPMQFIFLPEERSMFERFTAKAYRAIVLAQGEGERLGHQFVGTEQILIG LIAEGNGLAAKILKSFGVTLEAAQIEVEKIIGRGSGIKGIEYPFTPKGKQVLDFAVEE SQKLGHTHIGTEHLLLGVLMVTDGVAVRVLEKLGINLQNLKQEVLREITSVGIPQVNI APSRDFGSEDNLNDSPPDFTSGEISARFCAFLFSWVEPRKLGHIIGSRGGFQLPNGEI AAPRISFFSRERLKRVPRTYPELVPDLVVEIKSAFDRLISVQQTIQRFLDLGVKVALL IDPDAQTVAVHRLSNGATVLGNGEKLTISELFPEWELAVSEIWPPVFD" gene 55371..55598 /locus_tag="DP116_00670" CDS 55371..55598 /locus_tag="DP116_00670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006455622.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00670" /translation="MRGILFSLDSRPNGDEVWRSRSGCFAASPPLSYTNARLVGASAA ILISFWTKNLLLTIVLRMLIFFCWQWLVQLH" gene complement(55714..56439) /gene="cysE" /locus_tag="DP116_00675" CDS complement(55714..56439) /gene="cysE" /locus_tag="DP116_00675" /EC_number="2.3.1.30" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine O-acetyltransferase" /protein_id="PRJNA477356:DP116_00675" /translation="MLLTDLRTIYERDPAARNWLEVLFCYPGFQAILFHRVAHWLYQR GIPFIPRLISHISRFFTGIEIHPGAKIGTGVFIDHGMGVVIGETAVVGEYSLIYQGVT LGGTGKQTGKRHPTVGNHVVVGAGAKVLGNINIGDHVRIGAGSVVLRDVPSNTTVVGV PGRVTRQAEESTDVLAHNKLRDVEAEVIRALFERVKALEKQVQELEVQPSVHALQVSD SQTNNGKSSSDLMISEFLDGAGI" gene 56705..57160 /locus_tag="DP116_00680" CDS 56705..57160 /locus_tag="DP116_00680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318639.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_00680" /translation="MNSQNYTLLDLSSKVEYALLALLELATNKGKKTPLTMSEMTAKQ PIPERYLEQILTSLRRAGVIQSHRGSRGGFVLAREPWQITLLEIVTLVEGERKDREPS VTSTLERDLVHEIWEQANTASIKVLQNYTLQDLCQQREARLQQGPMYYI" gene complement(57505..57927) /locus_tag="DP116_00685" CDS complement(57505..57927) /locus_tag="DP116_00685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308548.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00685" /translation="MKANAEISEKEKSNKPSSTDKVREHLANERTYLAWMRSGIALMG FGVLIVRLRILRPPLAPQAPGDGWKLGLAFSLVGLLTVMLSTQHYLVVRRDIEEDTYQ PADRLVILSSLAVILLGIGVIYYVFSIPLESLNTVIVE" gene complement(58547..59941) /locus_tag="DP116_00690" CDS complement(58547..59941) /locus_tag="DP116_00690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127643.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="APC family permease" /protein_id="PRJNA477356:DP116_00690" /translation="MTITDETRQKRSAHGLKPNCLSFGEVLAQSFAVIAPTTIPASNI GLIVALSGNGTWLSFVIGLIGLVLVSININQFASRSASPGSLYSYISKGLGPTAGVIC GWSLVLAYLFTGMSVLCGFANFSSVLIGHLGIHPSSITLLALGAGIAWYAAYKDIQLS AIAMLWMEGISLVLIAGLCLLIWAHKGFAIDIPQLTLEGVTPGNLATGLVLVMFGFSG FESATSLGDEAKNPLRTIPRSVMGSAILAGLFFISTTYIEVLGFRDTGMSITKTEEPL GFLSQQIGMGYLGDLIAFGALFSFFACVLGCINPAARIFFTMARHGLFHSRLGTAHSS NRTPHVAVTMCSFITFLVPAVMSLFHIKLFESMGYLGAICSYGFLTVYILISIAAPVY LYRIRKLRVHHLVSSVLAVGFMMIPVLGSVGIPGSTMFPVPEPPYNTLHQLMTEQHSF VVSTRQPRTDVSGF" gene 60450..61418 /locus_tag="DP116_00695" CDS 60450..61418 /locus_tag="DP116_00695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746523.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LysR family transcriptional regulator" /protein_id="PRJNA477356:DP116_00695" /translation="MTLEQLRIFLAVAELMHFTRAAEALYITQPAVSAAIQSLEAEYG VRLFHRIGRHIEITDAGKLLQMEAQKVLDQVSLTERGLKELNNLQRGELKLGSSLTIG NYWLPEKISQFKRQYPGIHIDCTLGNAEEICEGTATGFFDFGLVTGDVKPSLKSYLEQ EVVGSDRLQIVVGTSHPWFERTEICPAELLATSWVMREPGSGAQQMFEQALQNWGIQL TELDIVLVLSSSEMVKAVVESGVGAAAIPEVMVKKEIQLSTLHAVYVVERNSGTKLDI VQPVWKLKHRQRFQTRVAIAFEEILTAVESPESTRQNSKVFDSELN" gene 62100..62537 /locus_tag="DP116_00700" CDS 62100..62537 /locus_tag="DP116_00700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652531.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_00700" /translation="MELSNKTEYAILSLIALAACYSSGESLQIREIAVRQNIPNRYLE ELLATLRRGGLIKSIRGVKGGYVLAREPRKITLLDAFRCMEEADEDVPNKKSTPTSVE TEVVQEVWQEACEAAYSVLQKYTIHDLYEQRGKRRQMEFMYYI" gene 62898..64628 /locus_tag="DP116_00705" CDS 62898..64628 /locus_tag="DP116_00705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455646.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diflavin flavoprotein A" /protein_id="PRJNA477356:DP116_00705" /translation="MVALTARSENTQSHGRLTMQTVDIGVETIAIRSLDWDRSRFDIE FGLNNGTTYNSFLIQSEKTALIDTSHRKFKQLYLDVLKGLINLSTLDYLIISHTEPDH SGLVKEVLQLAPQVTVVGAKVAIQFLENMIHQPFQSLIVKNGDRLDLGNGHELEFVSA PNLHWPDTIFTYDAKTRILFTCDAFGMHYCDDHTFDEDSELIEADFKYYYDCLMGPNA RSVLSAIKRMEKLDINTIATGHGPLLQHHLSDWVSCYQKWSQEQAKADTLVALFYCED YGSSDQLARAIAHGVQRNGVAVELVDLNTAEPHEVRELVNQATGLVIGMPSQSNQNAH AALSTILAAAHRKQAVGLFESGGREDEPIYPLRNKFQEIGLTEAFPPILIKEPPTHIT EQMCDEAGTDIGQWLTRDRTIKQIKAIDNDLERALGRLSSGLYIITAQKADVTSAMFA SWVMQASMNPLGVAIAVAKDRAIESLLHVGDRFVLNVLEEDNYQSLMKHFLKRFPPGS DRFANIKTYPASNGCPILADALAYMECEVTTRIECSDHWIIYSSVQTGRVAKLDALTA VHHRKVGNHY" BASE COUNT 18433 a 13705 c 13947 g 18614 t ORIGIN 1 tttgaaaggg gggttggggg gatctaaaat ccaaaagcca gtcgtaggaa ttctcctcta 61 tcgcaaacac gttatcacca aacaacctta cataccgcaa ctcattcgca attttcaaga 121 agctggtttg ataccattac ccatctttat taatggtgta gaaggtcatg tcgcggtgcg 181 ggattggatg acaactgact acgaaactca gcaacgacaa caaggaaatg tagaaattgc 241 ttcgctttca aaagaagcag tcaaggttga cgccattgtc tcaacgattg gctttcctct 301 tgtcggtggt ccggctggtt caatggaagc gggaagacaa gttgatgttg caaaacgcat 361 ccttactgct aagaatgtac catacattgt cgccgcacca ttgctgattc aagatattta 421 ttcttggacg cgccaaggta ttggaggatt gcagagtgtt gttttatatg ctttgccaga 481 attggatggg gcgatcgaca ccgttcccct tggtggtttg gtgggagaaa agatttatct 541 ggttcccgaa cgggtgcagc gattaattgg gagagtgaaa aactgggttg ctttgcgaca 601 aaaacctgtc tcggaacgaa agattgcaat tattttgtat gggttcccgc ctggttacgg 661 tgctgtggga acagctgcgt tgttaaatgt accaagaagt ttactgaagt ttcttcaagc 721 attgaaagac caaggttata cggttggaga tttaccggaa gatggagaag agttgattcg 781 caaagttaaa gaagcggatg aactcaaatg ggaagataag caagatgaag aaaatattgt 841 tccctcatca gttaatgtcc gcaccttgga aaaatggcta ggataccgga gcacatcccg 901 gattgagaaa caatggaaat ctctcacgag tactgggatt aagacttacg gcgatgaatt 961 tcatattggc ggtgttcagg ttggaaatgt ttggataggt gtacaaccac cattgggatt 1021 acaaggtgat ccgatgcggt tgatgtttga acgagattta actcctcatc cccagtacgc 1081 tgctttttac aaatggttgc aaaatgaatt tgcagctgat gcggttgttc attttgggat 1141 gcacggtact gtagaatggt tgcctggttc tcctttgggg aatacgggat attcttggtc 1201 tgatattttg ttgggagatt tgcctaatct atatatatat gcggcgaata atccttcgga 1261 atcgattctg gcaaagcgtc gcggttatgg ggtgttaatt tctcacaatg tcccacctta 1321 tgggcgtgcg ggtttgtata aggagttggt gacgttgcgg gatttgattg cggagtatcg 1381 agaggatccg cagaagaatt atgtcctgaa ggaagcgatt tgtaagaaga ttgtggatac 1441 tggtttggat gctgattgtc cgtttgagga tgcgaagcgg ttggggattg ggtttacgcc 1501 tgagaatatt cggatgttta gtggtcatgc ttttgatgat tatctggtga agttgtacga 1561 gtatttgcag gttttggaga atcgtctgtt ttcttctggg ttgcatgttt tgggggaaaa 1621 gccaagtgag gaggagttgg cggggtattt ggaggcttat tttggaaacg aaccgcagag 1681 gcgcagagaa cgcagagagg aggaggaaga ggagaagaga attagagagt tgttgggaca 1741 aactactgat gagttgacga atctggtgcg gggtttgaat ggggagtata ttttgcctgc 1801 gcctggtggg gatttgttac gggatggtgt gggtgttttg cctacgggga gaaatattca 1861 tgctttagat ccttatcgta tgccttcgcc tgcggcgttt gcacggggac gggaaattgg 1921 tcaaaagatt atcgcgcagc atctgcaaga gaatgggacg tatccggaga ctgtggcggt 1981 gatgttatgg ggtttggatg cgattaaaac taagggggaa tctcttggta ttcttttgga 2041 gttggttggt gcggaagctg tgaaggaggg gacggggcga attgttcgtt atgagttgaa 2101 gcctttggct gaggtgggac atccccggat tgatgtgttg gcgaatttgt cggggatttt 2161 ccgcgatagt tttgtgaata ttattgagtt gttggatgat ttgtttgaac gggcggctga 2221 ggctgatgaa ccggagaacc agaattttat taggaaacac gctttggctt tgaaggcgca 2281 aggagtggag aatgtttcgg cgaggttgtt ttctaatcct gcgggtgatt ttggttcttt 2341 ggtgaatgat cgtgtggttg atagtaattg ggaatctggg gatgagttgg gggatacttg 2401 gaaaggtcgc aatgtgttta gctatggtag gcaagataaa ggtcaagcta gaccagaagt 2461 gctgactcaa cttttgcaaa acaccagtcg cattgtccaa gaaattgatt ctgtggaata 2521 cggtttgact gatattcaag aatattatgc caatactggt ggtttgaaga aggcggcgga 2581 gaaacaacgc gggaaaaaag tgactactag ttttgtggaa agtttctcga aggatacgac 2641 tccccgaaat ttggatgatt tgttgcggat ggagtatcgc agcaagttgc tgaatcctaa 2701 atgggcggaa gcgatggcga atcagggttc gggtggtgct tatgaaattt ctcaacggat 2761 gacggcgttg attggttggg gcggtactgc ggattttact gataattggg tatatgacca 2821 agctgctgat acttatgctt tagatgcaga gatggcggag aaattacgaa aggcgaatcc 2881 ggaagctttt cgtaatcttg tgggcaggat gttggaggcg catgggcgcg gtttttggga 2941 ggcgagtgat gagaagttgc agaagttacg cgaattgtat gagttgacgg atgaggagat 3001 tgaaggtgtg acggtttaga aggtagggtg ggttagccga gagtacggca ccacacaaac 3061 tcacctcttt tgaggctatg atgtcttcaa atgattacac tttcggagtc agagagcaaa 3121 ttatatggaa aactctttca ctgcaatatt tgaaaaagtc gatgactggt atattggcta 3181 cgtccaagag ttatctggcg caaatgtcca agaaagaact ttggaagaag caagggaaag 3241 cctgcgggaa gtcattgagc taattttaat ctcaaatcgg gaattggcag agcaaaaact 3301 atctggtaaa gatgttgtgc gtgaaaagat tacagtcaaa atatcgtcaa gagacgagaa 3361 ttgattcgcc atctggaagc caacggttgt cttttgcttc gagagggtgg aaaacacacc 3421 atttactaca atccatcaaa caatagaaca tcagcagttc caagacatac cgagatagtt 3481 gatattttgg ctgtgaaaat ttgtaaagat ttggagatac ctccacccta gcaatattaa 3541 tagctggtaa gttgaatgac ttaactatta aagatagatt ggagcgatcg cctaatatct 3601 cgtcagacgc taagtaactg tattttacta aattcaaaaa tggcacagaa gttgccttaa 3661 gcaacagcat attttgtatg gggcggaaag ggattgagca agtcaccatc catatcatcc 3721 ctcgaaaagg tttctaccca actaatcgca ctgaatgtta cactttgcaa gtcttgatga 3781 gagagaagat ataatattcc accactttgt ttctgttgtt tctcgttcta ttcctacttc 3841 aaaacttatc cataaccaga tcagtaatcc tcctggaaga tgacctacag gagatttagc 3901 attgaaggta agttcatcat aattcaacca accagagttt ctataccatc cgacgcactt 3961 acaaagatgg atgtaaattt tcatatcaat atcaggttta ccaccaactt cctgccaaat 4021 gcgcttttgt acgctaaagc cgaaatgcgc attgctgtat tttacccaaa gttggtcaat 4081 ggtgcttaag tctgtgcagg gaaagttgtt aatagattca acatcaagcc agccatcctt 4141 ttctcgacca gctactttca gcatcagtct agtagtttcc tcatctgctt cttgccacag 4201 gttggctttg agtagcttct ccagattacg gtaatctacc cccatatctg aacttaattt 4261 attagagata atcgcttgct gggaacccaa taagtttaac cactcctgca tcgtttgagg 4321 acggtctttt gcctctaacg ccatgccttg cagaattacc tgattaacac cattgctgag 4381 actggggacg agttgctggg gtggtactaa ttcctcgtta tcaaacttgc ggtcaaagga 4441 attggtgggg cgttgcccag ttacagcata atacagggac gcagccaaac aatacacatc 4501 cactgttggg tgacgaatac ctttacgcac taattcataa ggcgcaaatg ccttgtttcc 4561 aaaagatctc gaaagtgtac taggtggaga catatcccca gcaataccaa aatcaatcag 4621 caccgctttg ccagagttag aaacccctgg cttaaaactg agcataatat tgggaggggt 4681 gacatcaaga tggaagatat tttttttatg cacttccacc aaggctgatc ctatttgtcg 4741 gatgtattta acagcttcag tttctggtaa tgcccctcgg ttttgtacaa actgccacaa 4801 actctcgcct ggtatatatt gcatcaccag acaggaacga ccatcttcct caaagaagtc 4861 gaagacttgc acaatatgag gatgggagtc tgcacaacat tctcccagca tttgtgcttc 4921 ttttttaaag cgtttaagat acttgggata gtctggatcg ttttggacac ttaaatttgg 4981 tgttttgatg acgactcggc gattaagctg agtatgtatt gcttggtagg tgacagcaaa 5041 gccgccgccg cccaactgtc tttctatctt gtatttgccg ttgtgtatga gttgcccctc 5101 gccccagtac ataattaaag gtttgtcgtt aactattatt gattttgaca cgccttttgc 5161 ggatatggca ggtgctacag tttgccataa gtacttagaa ctacgcaatg tgccagtgta 5221 catccgcaag cattgatgct ttggcgattt cctatctttc ttttggtatt agttgttatc 5281 agtcagaaaa gcaaaatttt gtcacttttt ctgcgggaga ctggaagatg gtggcgtgtc 5341 ttgtgttgtg gaactggtgg gtgtagctaa tggttttggg aacaatttat cccgtaaatt 5401 tatagtggcg tctaatcctg atattattcc tgccaataac cccaacacag tcaaaatccc 5461 tattatccgt gcccacagag gttttatcca aagtagtttc gggagagaaa aacctcttgt 5521 ttgcaacccc aattctttca accagtcctc tactgtctgg ggacgttgtt ctggttctag 5581 ttccattcct tggattatgg cattgttaat gcgatcgcta atttggttgt tcaattcttt 5641 gggttcagtt aatcttttat tttggtcttt cctgtttatc gcactaactg gctgttgtcc 5701 tgttaataat acatataaag ttgcagcaag agaataaata tccgtccaag caccgcgtcg 5761 tttctgttgt gtggcggagt tgaagtgtag ttcaatgggt gtaaatccgt caactctagc 5821 accgcgactt gtcaacggac tgtcaaatcc tcttgctaag tcaaagtcta tcaaaacagc 5881 ttgatacgta cctgcccgca ccatgatatt ttctggtttg atatcccgat gtaaaaatcc 5941 ttgtttatga acttcgataa gtgcttgtcc aatttgctga atgtaaccga gggcttcttt 6001 ttctggaaaa aatccccgtg atttgacaag tttcgctaaa ttatctcctt gaatatactc 6061 catgactatg catggtaagt ccccttcttt gaaggactct atcataagga caatattggg 6121 atgttgacat cttgcgagtt taggggcttc atctagaaac cctgattcta agtaattacg 6181 gtcttctaaa ccaagttgat tgagtaaatc gggattcagt gttttaataa cagtttcttt 6241 tccatcccta tttgtggcaa gataagtaat gccaatccgt ccccttccta actctcgttt 6301 tatttcatac ttatcgcggt gtaatttctg tcctggtgcc cacgtcattg gttgatagtc 6361 acaaaagttt gaatttattt tggcacagat atccagggat cagaaatttg tcaattttgg 6421 aacaacctaa ccttcaaccg taaccttaag caactgcata tcctgtatga ggacgaacat 6481 tagctggatg aaatcctagc acaatcctat ctttgggaat tcctgcagct actaattcat 6541 aggtaactcc atcctctgta tcgtcccgtt gcacccaaac cttaccatca ataatatcaa 6601 tgtggactaa acaaccgtga atccgcttaa caccatccca tcctaaagta atgagtaaat 6661 agctatcatt ttcactatcg aagactgttt tacactggat gtctccgtgc gagtaaggaa 6721 tctgtgtata tggaattaat acatctttaa taatgcgtcg ataagtattt aaggtatcca 6781 ttgcacaatc gtctcactgt taggattgaa cacgactaga tagagatttt ctgtctctat 6841 caaaagttta ccaattggct cctcgaacaa agctgtaaat atattgtcgc gcaccgccaa 6901 atacaacgtt ctttcaggtt caagacgccc aataacagcc cggtacatga caaatccgcc 6961 aatagcatct tttaagtctt gcatttccga cggactcata aaacttttaa tttccacagc 7021 tatcttttta tttccctgtt ctgcggctag tagtcgttgt gctcctaaat ccacatacat 7081 atctttctgc ccccatttta aatgcaaagg gtcgtctgta attgtccagc tgtctttaag 7141 caaagcattt ttgacagcat catgatagat atctcttgct ggcatatcaa aattcattta 7201 aatggacatt aactttttaa cccactttcg ctgcagcatc tacaccatcg ctgataagat 7261 gaccatcttc catataaata atgcgatgtc tggcgacaag ccgcttcgcg tctacggcta 7321 tatcaagaat acggttatca tgagtcacca gcaaaattgt acagccttgt tctttcgcta 7381 acttctgcat caattccacg acatcacgcc cagatttttt atcaagtgct gctgtgggtt 7441 catctgctag aactattctt ggttgactga tgagggcgcg ggcgagttcc cttcgggaac 7501 gcttcgcgaa cgccactctc tgcttttgtc cacccgataa actgtctgga taataatctg 7561 cacgatttcc caaacctaca ctctccaaca tactaatagc tttaccatcc atgtcctgat 7621 ttaaaaactc gtcatgcaac tctaacgaca tccgcacgtt ttctcttgct gttagaaatg 7681 tcatgaggtt atgcgcttgg aaaatatagc caatttgacg acgaacgtct cgcagctgct 7741 tcttactcgc actacacatt tcttgtccta aaattttcaa acttccttct tgggcagaac 7801 gcagtcctcc cataagggtt aacaaggttg tttttcctga acctgaaggt ccagtcataa 7861 tgataatatc gccagagtaa atatctaaat tgatatcaaa taacgcttgt ttgcgaagcg 7921 taccttcacc aaagtaatga ttgaggtttt gggcagaaat gacgggttca gttgttacta 7981 tgtgcattgg gttatgagaa gaagttattt ttttgatagt cttatttttc taggttatta 8041 ttgactgact gcctaagatg tgcgtagtaa ttcatcagaa aatctcaacc tggcaacaag 8101 cgcaggtgga atttgagtaa agttagacca actgtaaaag cgactattag aagaatcgtc 8161 agcaggaaaa gcaggttctt ccaagccatc aagtacaaac ccaacacgga atgctgcacc 8221 taacaaaacg tgcaatggtc gatggaagta caattggggt tttggttgat tttcaattgc 8281 taatcctttt gtggtgctgg gttgtaagta tccagacact ttgaccgagt attctgttac 8341 tagttgacct tcgcaatcct tgacttctgc agccatactg gtatgtgtac tattgaaaca 8401 aggatgcatc acggcaaaga caaagcagcc tccaggacgc aacaactttg tgagtgcccg 8461 taacagagag tcaatttctg ccatgtccat aagtaccatt gcggatacag cagcatcaaa 8521 ggaacgttcc cctagcccaa gtaaagctgt ctcatctgta gcgtcgagga cgtgatactc 8581 aattgatgtc tcatgctgtt gagttctttt gctggcacaa gtaatcatct cctcggcaaa 8641 atctattccg acgacatgtg cacccaagct agctaatctg cgagttgtca gcccagttcc 8701 acagcctatg tccaaaatac gagtacccgg ttttatctcc agcagtctct ccatcgcagg 8761 acgaatcagc agttgatgaa agtcgttccc gttatctccc atgcgggcgt cccatacact 8821 agcgttggta ttccacgcat cacgagtttc gttgttgaat gcatttgttg agtcatttga 8881 cattgatata ctcccagaac atttctgtga tcacttatat caaatccgtt tatatcagaa 8941 tttatttctc cttcctcttt ctctgcgtac tctgcgtctc tgcggttttt taatcattta 9001 ttactcacta aaacacatcc gcaggatctg cagattgtaa cttcctcatt gcaactgccc 9061 cagaaaaagt acacatgact atcgttatga taaacacagt cattgcacga gtcagtgtca 9121 tcgcaatagg tagcaaagtt gctgcatatg tcacttggta gagtgcgatc gcaatcaaaa 9181 atccaggcac ataacccaat gctgccaaaa gtaacgcttc ttgaaacaaa acacccaaaa 9241 gataaccgtc actatatccc attgccttca atgtggcgta ctctggtaaa tgatccgaaa 9301 catcactata aagaatctgg tgagtggtta gttgttaatt tacgttcact cagagattca 9361 cgagtgcagc tagtaaaatc ctgactacca cttccaattc ttcggtgact cggtaaaggt 9421 tgattgcttg ttcaatacgc tgcgcctgtc ggtacaatat accaaattgc caaatgttac 9481 cagttgttac tgctcccaat atttctggtt gattggaatc agtccatcta tcaagggcta 9541 tcatctcagt cgctagttgt gtaaatcccc tatttatatc tgcttgcttg gcttcaatca 9601 caattaaatt ggtctgtgtt cgtaggtaat aatccaaatt accttgaagt cgattatcaa 9661 ctttgatgtt gtattcaatc cgcaatttgg cacgggaata gtggattaat tcggtaacaa 9721 ctggcgcaat aagtatctct cgtcgcgtag cttcgttttc caaatctaca tatggtaata 9781 cttcttcaat tcgttgtttt aaatcaagta atcggtctag tgtaccggaa tattgcggca 9841 aatggagaaa tttccgctcg aatgagtagc caaactctgc aactaaatca tcaattgcaa 9901 atcccaattc aaaataatta ctaaaagtat acgagcgatt tgggtctagc aatggttggc 9961 ggctcatttg attcctccaa gaaagctagc tatacagtaa ttgattcaga ttgcgagcgc 10021 gatatttcta aaactctttt taaccagtct tcggcttgtt gggcgatttt cgttttttct 10081 ttggaatcag accaaatttt ttcccaatca aatagccaac gcgtcaaaac acgccctaaa 10141 tcaaccaatt ctgctgcttg atcaatatct gcctgtacca tgtctggaac agtccattca 10201 ataaaatacc tactttcttt ggcaagattg agtatcgatt cctgatgcga ttcctcggca 10261 cataagtttt taatctgaga taaattttca gccaacttat gcaggcgatt aggaatatca 10321 ttttttacaa agctttgttt ctttaagcta aagtctgtca taattttcct agaaactcga 10381 taatgatttg agcaaccctt tctctaatat tagggtctag tatttccatt gaacgatttt 10441 gacgaggacg aagattaaca atcgagtcgt aagtaatttt ctgattcaca ggcataaagc 10501 ttgcacccca gatattttct tgagtgctac catcatctag caatacctgt tcgcagtcat 10561 aatgcatatc accccctcca gcaataactc ggcgctcaac atcaactaca gttttgatag 10621 aaaacttatg ctcttccagc atttgctcga tttgctctct tgtagcgcgc tcgcgtagca 10681 acagaatcaa ttagctttcc tctattttaa aactatgtct agactccaca ctatactttg 10741 tgttgcaaca tccgaacatt atctgatgaa agttataaag taggaaagaa gcttataccg 10801 attctctata aacttgcact aaataattaa atctctcttc ttgctctttt cttggcgttc 10861 ttggcgttct tggcggtatg cgccaagggc gcacgctacg ctaacgtttt aaaaataaag 10921 tgcatctatg gcactttacg agcaagctat catggagaac tggtatgagt aatcaaagat 10981 acttgatttg agagattaaa ttaaaggaga cacagcagag acaatggcaa tcgaccctga 11041 aatcacaaga catagagagt ggttgggttt tctccaacca gtagggttgg ttgtgtctcc 11101 tccggcgttg gtgaaggcgc aggcggtggt gaatcgcaat gtggtggatt tgcaacaatc 11161 gctgctggct gtggttgatg aggatggtag gattgcaaat tttctttctt tcgtggtata 11221 agtattaaat tgggataact ctaacttaat tgaggcatcg acagaatatg aggtagcgtt 11281 acctgattac ggcgaagtcc tcgcgcctac ctatattgtg cctgaaccag atagcgactc 11341 accgctgatg ttggtgcaaa tcatttcccc ggatactgat ttggatgcag ttgcaccttt 11401 gattgctaag tctggtacaa gttggcacgc tagtcctcaa gcaaaatttg agcggttgtt 11461 gcgggagacg caaataccca tcggtttgtt gtgtaagcga ttcaaccacc acgaattcct 11521 gatggtgtgg tgtataggtt gttagaaaag ttactgattt tggaaggaga acggctttct 11581 tatcgggcgc tggatgtgga gcaaattggc tcagtgtatg aaggtatcat gggttttgct 11641 gtggaacaag cgaaaagtcc gagtattgga gtttacagca agcccaaggg gtcaaaggtt 11701 tcgacgacgg tggtggtgga tgtggcagca attttggcgg cgaaatcagg cgataagccg 11761 gaggcttgac gcttcgcgta tcgccctaaa ctattaaaag aatgggcaaa ctgtgaaatt 11821 tccggaaatg ccctcaagga attaaaagca gcacagacat tagaagatgt ggtggcggcg 11881 ctagatcgaa aagtgtcgcg acaaacaccg catattttac ctgtgggttc gctgtacttg 11941 caaccaggag aagaacgcag gcgttctggt tctcactaca caccccgttc cctcaccaag 12001 ccaattgtgg aaacaacact gcgtccggtg ttggaagctt tgggagaacg tcccaccgct 12061 gagcaaattt tatcgttgaa agtttgtgat ttggcgatgg gttccggtgc gtttttggtg 12121 gagacttgtc gtcagttggc agagaagttg gtggaaactt gggagagaaa agagcagggg 12181 agcaggggag cagaggagca ggggggagaa gattctcctg cgatcctccg tctttccacc 12241 cctctgcttt cttccgacga acccctcctt ctcgcgcgtc gcctcgtcgc ccaacgctgt 12301 ttgtatggag tggataaaaa cccgtttgcg gtgaatttgg cgaagttatc tttatggttg 12361 gtgacgctgg cgaaggattt gccgtttaca tttttggatc atgcgctcaa gtgtggagat 12421 tcgctggtag ggttgaggaa agaggagatt ggttcttttg ggaaagatgc ggctccgttg 12481 ttagcactgt tgaaagaaca acttgagcgt gcgcattctt atcggacgga aattcaggcg 12541 ttggataccc gcagtgatgc tgatgatgac caaaagcgga attatctata taaagtagaa 12601 caagaattgc acgaggcgcg gttaacagca gatgtgagaa ttgcggcgtt ctttgaggga 12661 agtaataaca agcagcggga agagagagaa aaagagattg gagaattggt gagaaattgg 12721 cggtatcacc aagcagatac tgagaatttg caggaaattg cgagtaggtt gcggagtggg 12781 gaacgcggga ttattccgtt taactgggat attgaatttc cagaggtatt tgataaggat 12841 gatagaaata ataatccagg gtttgatgcg atcgttggga atccgccgtt tgcgggaaaa 12901 aataccacaa ttaacgccca tgcacctggt tatcaagatt ggttaaaaga agtttaccca 12961 gaatctcacg gtaattctga tttagtggcg tttttcttcc gtcgtgcttt tgagattttg 13021 cgtcagggtg ggacgcttgg gttaattgct acaaatacta ttgcacaggg tgatactcgt 13081 agcactggac tgcgttggat ttgtcagcaa gggggaacaa tttacaatgc tcaaaaacgg 13141 ttaaaatggc ccggactagc agcagttgtt gtcagcgtgg ttaatatatg tcaaggaaat 13201 tataaacaga caaagctgtt gaatgggcga gaggttcccc gaatttctgc atttttgttt 13261 catgcggggg gaaatgaaaa tccggcggtg ttgctggcga atgctgagaa aagttttatt 13321 ggaagctatg ttttgggaat gggtttcacg tttgatgata gtaatccaga ggcgacaccc 13381 attgcagaaa tgcagcgttt gattgagaaa gattcaaaaa atgcagatag gatttttcct 13441 tatattgggg gagaagaggt aaatagtagt ccaactcacg cgcatcgtcg ttatgttatt 13501 aattttgggg agatgagtga ggatgaggcg aggttgtatc ctgatttgat ggagattgta 13561 gagcagaagg ttaagccaaa gcgattactt gataatcgtg cttcatatag aaaatattgg 13621 tggcaatatg cagaaaaaag agtagatttg tttagagcga tcgccccact taatcgcgtg 13681 ttagtcattt ctcgcatcgg tcaacatggc tcatttacat ttttaccttc aaatatagtt 13741 ttttctgagg gtttagtagt tttcgcactt cccacctact cagccttttg catccttcaa 13801 tcccgcatcc acgaaatttg ggcaagattt ttcggttcat ccttggaaga tagacttcgc 13861 tacaccccca ccgactgctt tgaaaccttc cccttccccg aaaactcgca aaccaacccc 13921 accctagaag ccgcaggcaa agaatactac gaattccgcg ccgccttaat gattcgcaac 13981 aacgaaggac tcaccgacac ttacaaccgc ttccatgatc cagaggaacg cgactctgat 14041 atcctgaaat tgcgcgaact tcacgccgca atggataaag ccgtcctcaa tgcttacgac 14101 tggagtgata ttcctaccga ctgcaccttc ttactagact acaacgatga ggaagacctc 14161 acccccaacg cctctgctta tgaaggagag gggagtagta gacagcggaa aaaaccttgg 14221 cgctaccgtt ggacagaaca agtgcatgat gaagttttag cacgcctcct cgaccttaac 14281 caaaaacgcg cggaagcgga aattcttggc ggtaaggcgg cggaggggaa agcgaaggcg 14341 aagagtgcga aaaagaaaac agctaaaacc aaatctcaaa agcttgttaa agattcgcca 14401 ataataccag ggtttaatgt ggagtaatat taatttaaat tgcctaaata taagcggaga 14461 caatgacaat taaagaacaa attacacaag aactagaaaa attacctgaa cccttgttgc 14521 aggaaatttt agatttcgtt cagtttctcc aggcaaaaaa ccagcaacgt aacataagag 14581 aaatcactat tatgagtgaa tcatcactac aaaaagattg gttaaaacca gaggaagaag 14641 ctgcatggca ggatttgtaa aaggtaatgt cgttatcgtt ccctttccat tttcagactt 14701 aactcaaaca aaacgtagac ggaaaattct cttttctctc ttttcttggc gttcttggcg 14761 tctatgtcct gcggacacgc tgcgctaagg cggttgataa tttttacaac tcaaatagga 14821 ctgcgatatc atgccttaca gtcaattcac catcgacaga gtaaaaaaag attttcgcct 14881 gactactgta gagggagttc gcttttttcc agactcgatt gaaccagtga ctccgagtcc 14941 gagactacaa ggaatcttag aagatttacc gtgggctata gctgtagata cagagaaagc 15001 acgttctgaa gtgatcataa atcccgtgct tctggaagtg cggcgcattt ttaatcagca 15061 aatcagcgta ttttccgggg aagaatttaa cgttgatccg agtattggac tcaacggtgt 15121 atgtgacttc ctgctttgtc gttcacccga acaattaacc gtggaagcac ctgcaattgt 15181 gattgttgaa gcgaaaaaat ctgatctcaa atctggactc ggacaatgca ttgcagaaat 15241 ggtagccgca cagcaattta atgaagccaa agcacaaccc ataacagcag tttacggtac 15301 tgtaagcagt ggaacgcaat ggcgatttct caaacttgag ggacaaacag tcacaattga 15361 tttaatggat tatcctcttc ctcctatcga acaaattctt agctttctgg tgtggatggt 15421 gaaagctggt tgatttttac gtcataaatt tttcaagtta tgaattgaca cttgacaaaa 15481 ctttcgtcaa cataagaaat acagttgatt gcttgctgta cctctagtaa acggcgatac 15541 accatgccag ccactactcc agccgaagtg cgatcgcacc tcacagatgc cctccaactt 15601 gacttagtag gaccaacacc taatgatacc acctacgcct gtgaagtcct gacgcagcca 15661 ccttctaagt ggtatctcac tggctttctg gtaccttttg gtgcaccgcc tgaagtgcgt 15721 tctgatgaca caagcaatga tgaacttgac caaattagca acagcgatgt agctgatgat 15781 gctggaactc ctgaaactgc ttccgcaaga aaagcacttt ttccctcttc gattggctta 15841 agctttctta tctcatcaga gactaaagag ttagatgtga cggtgaattg gggcgactac 15901 gtttctaaag ttggcgaaac ggaattagag agtacagaaa ctgacagcga gtcggaacaa 15961 atctcaactg agttatggga acgcattccc caagctgtgg atgtacacat catcttaccc 16021 aaagataagc agaaaaaaca gctagatgtt cctggtggta gtgggttgca gttgtttgtt 16081 tcgtgtcgcg cagtacgttc tcaaaatctt cccaagggta ctcgttcggt ttccgtcttt 16141 ctagttaact atcgcactca taaccctaat aaaaaacgag atcccagttt tgctttccaa 16201 gccagtctca acatccgtac caccaagggc tttgttcctc gtcctgactt gcgtggacag 16261 aatagttcag attgggatga agctgtcgct agtttacaat accgcaatga ctgcgagttc 16321 gccgtcggtc ataacgtctc ggcgatcgca cttcactctg ataacaaatg tcaggaaatc 16381 tccacaactt ggataccaca agccgaagtt cccagagttg aaccagcaga catcgcaggg 16441 ttggaacttt caatggaagc actggcggat gctgcagacg cgcaaactgt gcgccggatg 16501 ataaatccaa tggtgacaag ttatctcaag tggataactg accaacaagc cgcagcccca 16561 agcgaaccgc aacaaaccaa agtcgcaact gacttgttga accgtgcgag aaaaatatgc 16621 gatacgcgga gcgtcaagcc tccggcttat cgcatcgccc tggactccaa gcactggatg 16681 atccattggt ttttgaggcg tttcaaattg ctaaccgtgc gatcgctact gcccgcagac 16741 aacgcctcac ccacgacacc cttcgggttc gcaagtttcg actcgacgga aaccgccaag 16801 actcgaactt gctcaccgac aacatctcac ccgaagactt tccacccccc aaatggcgac 16861 cgtttcagtt agcattttta ctcttaaatc tcgttagcat caccaacccc gaaaattctg 16921 atcgggaaat tgtcgattta ctcttcttcc ccacaggtgg cgggaaaacc gaagcatacc 16981 tgggtttagc tgcatttacc cttgtcttac gccgtctgcg agaccccagc atttactctg 17041 caggtgtcag cgtcctgatg cgctacaccc tccgtctcct caccctggat caactcgaac 17101 gcgctgcaac tctcatctgt gctttggaac ttcaacgcca gcataacccc aaacttggca 17161 atcacccctt tgaaattggg ctttgggtcg ggcaaagcgc tactcccaat aaaatgggta 17221 gcaaacgcga caaggatgag aattcagccc acgcccgcac tcatgcttac caacgcgatg 17281 acaaagccaa acctaagcca gtccccctcg aacgctgtcc ttggtgtggt gcagagttaa 17341 caaccaactc gtttcagttg ttaccaaatc ccgatgcacc tactgacttg cgattaattt 17401 gtgttggcag taagaaacgt gcggacggca aaccagcgtg cgtttttaga agaaacaatt 17461 ttctgccctt aattgcagtc gatgaacaga tatatcgcag attaccctgt ttcatcattg 17521 cgacagttga caaatttgcc aatttaccct gggtaggtca aacgggggcg ctttttggta 17581 aggtgacgca ctacgacgac cgagacggat tttatggaac tggtgatcca aaaattaaag 17641 ggcgatcgct cgaaaaaaga ctacccccac ccgacttagt tattcaagac gaattgcacc 17701 tgatttccgg accactggga acgatggtag gattgtacga aactgcaata gatgctttat 17761 gcagcgtcac caacaacgat aaaaccatcc gcccgaaaat cgttgcttcc actgcaaccg 17821 tccgccgcgc cgatcgccaa atccaagcat tatgtgacgt tctctcccgt tatacccttg 17881 cgggtataat gggagcttct aagagtcaaa attaagttgt ctgtactcca ttccaacccc 17941 aagcttctgg cttcataggc tgggcatccc ctaagcctcc accagcagac accatgagtc 18001 ccacggcgtt taatgctcgt tgcaagatat tacgagccgc gttttcgtcc ctgtcgatga 18061 ccgcattgca tttatggcat tcatgagtac gcatcgacag ggttttaggt actttcgccc 18121 cacaatttga acaatccaca gttgtgccat gaggattaac tttaacaacg tgaacgctgc 18181 atttgactgc cactgcttcc aacttgtgaa tgaaactacc ccaagccaca tctaggattg 18241 atttgcccaa aggtgtacgg gctaaacccc ggatatttaa atcttcaact gcaattaagt 18301 catagcgttt aaccagtttg tgtgccgtat tgtagtgaaa attttctctc acccgcccca 18361 ctcgttggtg aatgcgagca atcttgtttt gcagcttctt gtacccatta gaccctttgt 18421 gcttgcggga taaaaatttt tgttgtcggg ctaaatgatt ttgtgcggtg cggtatggct 18481 tgcttgcggg cactgtttcg cccaatgatg tcgtcagaaa ttcttttaat cctacatcca 18541 cgccaacagt agatttcact tcatctaatg gcaatggtga tggcactgtg tcatcttcta 18601 gggacaagca cacataccaa ccatccgctt ttttgactat cgttgcctgt ttgagattaa 18661 acccgactgg taatggtcga tgcatgatta cgggcattgg gccaatctta gacaaacgca 18721 atactccgtt ctctaaaaaa gcccctgcct tgggattatt gactcgtgga aacacgaaag 18781 aacgcatttg tcccgacttc ttaaacttcg gcctgcctcc ccttttgcca ctcttgtcgg 18841 gaaacaacca tttcttccaa gccttatcca agcgttgcag attgacttgc tgagattctg 18901 cccaaatatt cttgtaattg ggaaacaact gttttgtttg cttcaggtct gactgctgag 18961 tgtagtagtc aactttatct ggtatttccc caatgggttg agaaacagta ctgcaacggt 19021 caatttgaca tctcgtacgc cgcaaccaat caagtctttg ccctaatgcg taattgtagt 19081 gcctacgcaa gagttcgagc cagttgtcca taatgacaac ctgttcggga tttggcttga 19141 ttcggtagca gtaagtgatt agcatagaac tattttagtt tatcgcttta acttctatac 19201 catgtatcaa aaaggtttta ggtcagttta tagtcttacc gctcatattg tttttgtcac 19261 caaataccgc agaaaagtca tcaacaaaga aattcttgag aaattagctc agatatttac 19321 ttcgacttgc aataagtggg aatgtaacct caagcagttt aatggggaat ctgaccacgt 19381 tcacttgctt gtcagttacc cgccacacgt gcaactgagc aaactcattg ccaatctcaa 19441 gacggtttct tctcgattaa ttcgcaggga ttatagtgag tatctgagca agttcgacaa 19501 aaaacctgta ttctggactg gttcttattt tgttgccagc tgcggtggtg tcacaattga 19561 gcaattaaag agatacgttg agcaacaatc gtctcccgaa aactaaggat tcgctaccgc 19621 tcaattttat actggcggca attccccacc cagttagcgg actcctacca tatattgatc 19681 gcgttgtgct agtccatcgc ctgcgcgaag tcatagccct aactggtttt acccgctttg 19741 aagccgcaat gcccaatatc gaaggcgaaa tcgacgaatt agcgattaat gtccgtcgcg 19801 ccgcccttga ttttgaaacg aagtggatac ctgcaattga gaataaaggc gaaggtgtct 19861 ttatctcgtt tcataagcaa gttatggacc aatggttcaa gcgcgatacg gtgaaaaagc 19921 gagccgaaga tttaaaccgt ggttttctca aatggcttga tagtaaggga attcctcggg 19981 ataaagccgc atttccaggt gtaaaataca tcatgctgca ctccctatct cacttattga 20041 ttacagcagt gtctcttgaa tgcggctatg ctgcaagtgc catccgtgaa cgcatttatg 20101 caagcgatta tgggtatgga attttacttt ataccggttc ttcgggttca gaagggactt 20161 taggcggctt agtagaagtt ggtagacaca ttgattacca tctcggaaaa gccctagcct 20221 taggaagact ttgctcaaac gacccagttt gcgctcaaca cgaaccggat aaccctcgcg 20281 aagaacgctt tctgcacggt tctgcatgtc acggatgttt gttaattgct gaaacatcct 20341 gcgaacgccg caaccaattt cttgatcgtg cgctggtggt ggacacggtt gaagaattag 20401 acgctgcatt ctttccagat gatatttttc ccgatgatct cgtcactagt taattagcac 20461 cgccaaaagc agtgttttgt atagctactt gctgcacctt aaaaagtgac aaaaattgac 20521 aaatttctat cattcacttg aatcaatcaa taatataata tactctgata gcgtcaaata 20581 ctaaatacat tcaaattctt aactcggcgc atcagaccct tgaggagagg cagaaggcct 20641 aaggagtgaa aagcaaacca tgtcaagctt ttagggttac tttgccgctt gttgtaccga 20701 actctaaatt ttacccatgc ttatggtagc gcgcactcct acacaaaagc cccagataaa 20761 gtcaacagaa ggttgcaact gtaaaaatct acctttgact cagcagcaat ccaaaaggtg 20821 catgactcta cctccccagt tgaggaactg aatatcgcaa ccaacaccta tgtcctcaat 20881 aagtttttgc agcataaagt agatttcgtc tatgactcag ccttcttctc aaaagcaagc 20941 cgagcaacaa ttgaatcagt caattaatca gccaaaaggg cttcaagagt ctgctcatcc 21001 tcagcagggg cgacaggaat tgagtttgaa gaccaaggca atcgcttggg ctatgcttgt 21061 tagtatggtc cccgtgctaa cagtagggac aaccacttac tactttggta gccagttgac 21121 tgccaaacag atttctcaag ctagacttgc ggataccaca ggtctggcaa aagctgaact 21181 tgcactaaac aaactactgt cactcctgtt aattgaaacg ggggtgacgg cagtgttggc 21241 agctgcgatc gctacaattt tagggattcg cgccattcgt ccagtcctga atgctgccgc 21301 agtctctacc actatggtga ataggttacg tcgagaaagc gcggattctc ccacctccac 21361 cgtcagcaaa gatgaactgg ctgtcttaga gacaaacatc aacttcgtca aagaacagct 21421 tccggattta ctgtggaaac aagaagccga agccgaatgc tcccaggtgt ttcacaacat 21481 tagccgccgt attcgcacat cgctcactca agaagacctc ctcagaacca ctgttgaaga 21541 agttcgcaaa gccctaagaa ttgaccgcgt ggtcatcttt cgttttgatt ctaacttgga 21601 tggaactttt gtagaagaat ctgcggcatc tgatttgcca aaaatcttat ggactaccat 21661 ctcagacccc tgtttcaacg gagggtatgt cgaacaatac cgaaatggtc gcgttcgtgc 21721 tattgatgat atttatcaag ctaatcttac tgattgtcat attgggttgc tagagcgatt 21781 tgctgtcaag tccaatctca tcgcgcccat tctcaaaaac gaccagttat tcggcttatt 21841 gattgcacat cagtgctttg aaccccgtat ttggcagcag tatgaaattg atttgtttgc 21901 tcagatagct acgcaagtgg gattcgctct tgactatgtt aagctgctag aacggataga 21961 tacaaaagcc gatcaagctc aggcatttat agatataacc cgccgcattc gagaatcgct 22021 caacgaagag gatgtcctaa aagccaccgt tgaagaaacc cgcaaagcac tgagcgctga 22081 ccgagtgctg gtttatggct ttgacagcaa ctggtatgga acagtgattg cagaatcagt 22141 gattccaggt tttccgaaag cgttgcgagc caaaatcaaa gatccctgct ttgctgaggg 22201 ctatgtggaa aagtaccaag caggtcgagt tcaagccact aataacattt atgaagcagg 22261 tttgaccgct tgtcacatta gtcaactcca accctttgct gtcagggcaa atttggttgc 22321 gcccattctc aaagatgacc agctatttgg cttattaatt gcacatcagt gctcaggacc 22381 tcgtgattgg cagcaatatg aaattgattt gtttacccag atagctacgc aagtaggatt 22441 tgctctcgac catgccagac tccttcagcg aatcgatgct gaaggagtgc gaactcaatt 22501 acttacggac ataacccgcc gcattcggga atcgctcaac gaagaagacg tcctcaaaac 22561 caccgttaat gaagttcgca aagcactggg cgctgaccga gtggttgttt atgggtttga 22621 cagcgactgg tatggaactg tggttgcaga atcagttgtt ccaggttttc cgaaagcgtt 22681 gcgagccaaa atcaaagacc cctgttttgc agaaggctat gtagaaatgt accaagcagg 22741 tcgagttcaa gcgaccaata acatttatga agcaggattg agcgattgtt acattggtca 22801 actcgaaccc tttgctgtca aggcaaattt ggttgcgccc attctcaaag atgaccagct 22861 atttggctta ttgatagcgc atcagtgctc aggacctcgt gattggcagc aatatgaaat 22921 tgatttattc gctcagatag ctacgcaggt cggatttgct ctcgaccatg ccagactcct 22981 ataccgggta gagcaggcgt accaatctgc tgaagcgacc tctgacgagc agcgtgaaca 23041 aaaagaagca ctccagcgtc aggtgttaga actgctcaga ggtagtgaca ccgctgtcca 23101 aaccctttca ggtgaggcga agagccaggt ggagtccctt acaggtgcct acaatcaaat 23161 taaaacgtta gttgattcgg cgatgataat ggtgatttgt gcccaacaag cagaactcca 23221 agaacagcag ctcagtcaaa tagtacagga tgggcatgag tccatagacc caattttaga 23281 aaacttctgt gatatccaag taagagttat ggaagctgcc gagaaggtcg agcgtctaga 23341 ccaacctttt caaaagcttt cccatatagt gagctttatt agtaatatcg catctcaaat 23401 aaagctccaa gctatgaata cagtacttga agcgagtcga accccagaag ctggtcaaca 23461 atttgcttgg ctcgctgacg aagcactttc tcaagtgcat caattagacg cgagtattgt 23521 agaaatagaa tcactcgttg cagaaattca aacacaagcc aatgaagtta ttcctgttat 23581 ggaatacggg gcagaacagg caatgaccgg gattagattg gcgcaggaaa ctcagcagaa 23641 gttcaatcag attgttatta ttagtgacca aatgaaaaaa ctggtcgagg agcttgtcca 23701 cacgggtcca gtccaagcaa aaacttcaac ctcagcaagc caatctattc tagaagttgc 23761 gagtattgcc agcaagactt cagagcaagt catggcagtg gctaagtctt tagacaagct 23821 gttaacaatc gcacaagatc tgcaggagga tgctgagtga gaaccccact gttggactca 23881 ttgcgctgcc tcaccgtatt cgttattggg tgtgcaagac ggttattata gcgcttggtg 23941 gtgttgggtg caatacatat ctgtagtgaa agtgcatgca ctttcacatt attcgtggtg 24001 atgtatgtga ttcaaatgag aatcgctata atttccaaac ttgtccggaa taaccttcct 24061 acaaactaga acgagatggc gggacgttga tagaaataaa ccgatggttc cctagctcta 24121 aactttgttc taattgtcat taccaaattt cggagttgtc gttagatgta agaacttgga 24181 cttgcccaag ttgtggtact catcatgata gagatggtaa tgctgcaatg aatattagag 24241 cagctatgtt tcagaatgct gtcctcctct gggacggggg aggcgctccg ccaatggaga 24301 ggaagtaaga ccaactcgcg ggcgtaagcc taagatgagg tgagaccagc gctgcaggag 24361 ggtttcccga cagccaggcg actggcgttc gccttcaggc gtgcgctttg cgcatacccg 24421 gagggcattc ctccgtgaag ttggaagccc cgtccgcgcc cgacggcggg gtagttcact 24481 gagaaaagta aaaatctggc aaaagtctga aaacaatgta tgcaaagctt gacagattta 24541 aacataccgc agaaattatt cggtgatggt tgagctaatc ggtgcattga cggcgtatat 24601 aaactacgct acaagtggca cgaaagatta accggggaaa tctgaaattt gtagaactta 24661 tttttttgtt tgaaatgctg aatattaaaa tgaaccaaca gggaatttgc aatgctttaa 24721 ttaaaatatg tagataattt tgcttctttg tagttaaaag cctgtgttac ttcatagtat 24781 tgattcctac actctgtctg agtgcctcaa agaactcttc aagacaatct tgcagaacga 24841 tggcactctc caaaaaaatg tatccagtgt ttgcctcata ttggcacagt cagtacaaaa 24901 tgctgtgcta gagtgacaac actggcgcta gcgtgaggga taggatatgg tcgtagaaag 24961 gtgtttgggt gtttattata aacccatttc tcatcttcac aaaggatgag aaagcctttt 25021 cgacttgacg aaacggagtt ttgcctcggg tgttgcacaa aaagcaaagc ttaagtcaat 25081 gagtggcaaa aaggtgctgt ttgcgtcata tttatgcaag ttcctctgct ccaatagcta 25141 tattgaccgg acaaaaattc acaaattctg attatggcaa gtaaattagt ctctgttaga 25201 gaatacactg ttaaggcaca taaacgactg attcataccc gggttttcaa ctttctgtgt 25261 aaggaatgcg gtattccggc gaagcgcgag acttacggat ctcgaccttt gtattgtgaa 25321 caatgtcgtc ctcctcaacc acccaaaaaa tcgttaatga agcctcaaaa ggctaaacct 25381 agacccatga cctataaatc taaaactgat ttggattgag ttgacgccat atgactacca 25441 agggaaaact agagccacca ccccaaacct tgctggagcc acttttggaa ttacgggact 25501 actatactgc tcttactgaa gagtatgaac gtttatggca aacggcgcga tcgcaacttg 25561 tacatgtaga agccttgtta tccaactggt ctggtgtcga tgaacgcgac aatctaaggg 25621 ctgtcgttga gatgttgtct tttgcaccca ctcctagcca gcaagactcc ctgtcaacca 25681 ctcaatatca atcggatgtt gaacaacaac aagctatcga ctcagaactc caagacgatg 25741 aagaccaaca gatagatttg gatgatgttg aacaacaaca agctatcgac tcagaactcc 25801 cagacgatga ggaccagcag atagatttag atgaggagtc atcggacgaa gaaaacacag 25861 tagttacatc acccactcca tccccacaaa aggagatacc aactcagaat aacgatcact 25921 ctatgcctgg ggatattcca atgctgcccc agtaccaagt tctcacgcga atgcaagcga 25981 tagagaaaat attgcgagag aatgctggaa gtgtctgtca tatagacttc gttgtgcgct 26041 cactttttgg cgatttagag ccaagtgtgt tcaaaatcgt caaaagtcgc atccagtcct 26101 ctctgacaca tggcaaagaa aagagttact gggcagccgt tcccgatgag ccaggatgtt 26161 acaccctgga tttaagcttg ataacaccag caaatggtaa ggtcaaatcc aagacaatca 26221 aacctcaaaa gaagaaaccc ttcctactcc ctaagtcaaa aagggcatca atgctgcctg 26281 aatatgaagg taagtttttg attgatgcca tctgcatact cttgcagaaa aattccggga 26341 aaatcttcag cgtagctgat gtcatcacag gactttatgg agaactcaat gctgaacaac 26401 taacagagat aaaaactgca gtgcataatg aactttctag aggtcatcgg ataggaagat 26461 tctctagagt tcctgacaaa gttggttatt atacttggga tttaagtaaa ttccggagaa 26521 aatagccccc atgtcgcaac ttgagaagtc aaaaggagcc tgcactgtgg aatggcaaat 26581 gcaaaattcg caaatttcaa ctcaccttct tttagctgag ttacaaatac agggcgactc 26641 cgaatgactt ctgcaagcat tgctgatgtt ctggataaaa ctagtgaggc taaagcataa 26701 atccagcaaa gctactgcgc atttactggg cgaggtccag gttgtactgt gacttgcaca 26761 gttgtgtcat tacgttgcag ttgcatttgc acatttgccc caacgccact tttatcaatt 26821 atctgctgca ctgtatccgc agtggtaaca gattgattgt tgatttgttg gatgacatct 26881 cctggacgta ttccggcttt gtcagccgga gaaccaggaa gaacgcggac aataagaata 26941 ccttggtctg cttgcacgcg gacgttacta tttgacaggc tgttaatccg ttgcttaagt 27001 tcaggtgtga gggataccat ctgaatccct atgaagggat gttcaacttt accctgagta 27061 attaattgct gggcaattcg ttgagcagta tcaatgggaa tcgcaaagcc tattccttga 27121 gcgccctgga taatagctgt gttcatccca ataacttgac cacgagcatt tagcagtggt 27181 ccgccggagt ttccaggatt aatggctgca tcagtctgaa tataaccaac ccggttattt 27241 gatgcaccga tatcactaat agaacgatcc gttgcactaa caacgcctac tgttactgtt 27301 tcttgcaaac ctaagggatt accaattgcg atcgcccact gtcctggctt aacttgctga 27361 gattttgcca gttctacagt tgggaggtta tttactggaa cttgcacaac cgcaatatca 27421 gtcaatttgt cttgtcccaa gacttttcct tcaacagtac gaccgtcgga caatgatacc 27481 gatactgtat cagcatcatt gacaacgtga gcatttgtga gaatctgtcc attagcatta 27541 atcacaaaac cagaaccaac accacgcacc actttctctt gaggctgtgc tggtatctgc 27601 cgaccaaaaa accgttgata gaatgggtca tttaagatct ctgggacttg gcttctgaca 27661 gttctggaag tattaatttg cactaccgca ggttcaacct tatttaccac agcaactaca 27721 aagttattat cctctggagt tggtgtcaga ggacggttat ttggagtagc agactgttgg 27781 actggttgag cattagcagt attagtctcc cgtgaaggtt gcacgtttct cgatgagttt 27841 attgagcagc cactcaaaaa tgctattcct actccaataa gtggtaggaa taagtgagtt 27901 aacgattttt tttcagagat tttgccgagt tgatttattt ttttatctag tgcatctatc 27961 tttagattga tcaaattttg attcttcttt gagcgtattc ttttcattag actgtgttta 28021 tcgaatatgt caaacagaaa atccctcttt ttctatgctt ataccaactc aagcttcttg 28081 tcatctataa taagaaagct attattaaag aataattagt attgttaaaa gggaacaggc 28141 aacagggaat agggaacagg gtggttatgt gaggggattt aaaacccgtc actatcgctc 28201 gttcttaaga acggggcttt ggatccaagt ggtctcgggg gtagaagctg taagcagcaa 28261 ctatcccaag aggaaaacag cgcgctcagg ttagggtagg gtgtagatgg cttagatcta 28321 tttattattt tcccctaccc tagtaaagtc agcttatcca gcggttcggc ttataatacc 28381 gagctgacga tccacaggat tagagtcgtc agcatcatca aaagtccccg gaatgggaac 28441 atctacgttt tcttcatcat gaacagagtc attcaagtca ttcaagtcat tcttatttcc 28501 agaagcttct ttatctggat tagggttaat tacacttgga tcagcgcttt catgaactga 28561 taaattaatg ggcttgcgtg gttcctgagt catttttgcc tctttgttaa acttgcataa 28621 actgactact tatacaagtt aaaacttggg ctgatgctat tccctctagc aacagacgta 28681 cccttaagga agatatttcc ggtattcaaa tatcaaaagt gatttgaaac catcttcatc 28741 aggacaacct ttcttattct taaccttgac ttcgtaattc gtaattacgt tggtaaatac 28801 gaattacgaa tcactcatta ttgactatta atgctggttg tgctgcgatg ttttatagct 28861 gcattctctt gttgatcatc ttctaagatg acaggtacgg gctgaacgtg ccaaatatct 28921 cgggcgtact cgcgaattgt ccggtctgag gaaaagtagc ccgtgcgcgc tgcgttgaga 28981 attgacatcc tcaaccactt gtcttgattg cgatatgcct cactcacctg ttgttgacac 29041 tcaatgtagg actggtaatc agcaaacagt aaatagtcat cccggttttt gagagattct 29101 accagtggtt tgaataaatc cctgtcacct ttggagaagt aaccggatgc aatctggtca 29161 attgcttgtt tgagttgagg attactgttg taataatccc acggatggta gccttgagct 29221 ttcatttgtg aaacttgttc agtcgtcagt ccaaagagga agaagttttc gtgtccgacg 29281 cgatcgcgaa tttctatatt cgccccatcc agtgtaccaa tcgtcaatgc cccattcatg 29341 gcaaacttca tgttgctcgt accagaggct tccttaccag cagtcgagat ttgttccgaa 29401 acgtcagccg ctggatagat gcgctgtgct aacgctacat tgtagtttgc taagaaaacg 29461 actttgaggc gagatgcctc cggcacgcta ccgcgaacgc ctacatccgg atcattattc 29521 accacatccg ctactgagtt gatcaactga ataatcaact ttgccattgc ataaccaggt 29581 gcagccttac cagcaaaaat cacagttcgc ggtacaacat ccgcatgagg attctgtttg 29641 atttggttat aaagtgtaat tacgtgcagc aaattcaaaa gctgacgctt atactcgtgc 29701 agtcgcttca cctgtatatc aaacagagaa ttgggatcaa catgaatacc attcgtttgc 29761 aaaatgtact ctgctaagtc ctgtttattc tcctgcttaa tttgtcgcca ttgatcacga 29821 aattcttggt cgtccacaaa agcttctaac tgcttgagtt cttccaaatt tttaatccaa 29881 cccttaccga ttttatttgt tatgtggagg gctagtttgg agttggcgac caaaatccaa 29941 cgacgcgggg tgactccatt ggttttgtta ctgaattttt cgggaaacaa ttcgtagaag 30001 tctcgaaaca agtcatgtgt cagtagttcg gtgtgtaatg ctgcgacacc gttaatagca 30061 tgactaccca cacaagcaag atgagccatt cttacccgtt tgtctgcacc ttcttcaatc 30121 agcgacagtc gggcaacttt ttcaatgtct tgaggatatt tagcacgaac ttcgtccaag 30181 aaacgctggt taatctcaaa aataatctcc aagtgtcgag gtagaagtct gccaaaaatg 30241 cttaaagacc aacgttccaa agcttcagac agcaatgtat gattggtaaa accaaaggtt 30301 ttttgggtga tgtcccaagc tttatcccat cctagctggt gttcgtctac caatagccgc 30361 atgagttcgg ctacaccaat ggtagggtgg gtgtcattaa tttgcagggc aaatttattg 30421 gggaaattgt caaagttgtc gtcgtgttgc aggaagttgc ggatgatatc ttgcagagaa 30481 caggaaacaa agaagtattg ctgctgtaaa cgcagttctc gaccttggta attattgtcg 30541 ttgggataga gaactttggt aatattttca gagaagactt tatcagcaac ggcattggca 30601 aagtctccac tgttgaaaat ctgcaagtta aattcatcac tcgctttggc actccacaaa 30661 cgcagagtgt tcacagtgtt ggagttgtaa ccgaccattg gtgtgtcgta tggtgtccct 30721 aagacagttc tttctggaat ccaacgcact tggtagcgtc cctgctcatc tgtgaaagct 30781 tcggtgtgtc caccaaattt aacttccata gtgtaatcgg gacgaggaat ttcccaaggg 30841 tttccaaaac gtagccagcg atctggaact tccacttgag taccatcgcg aatcacttgc 30901 tgaaagatgc cgtattcgta gcgaataccg tagccaatgg caggaatttc caaagtcgag 30961 agggaatcta gaaagcaagc ggctagtcgt cctaaaccac cgttgcctaa tcctggttca 31021 tcttcctgtt ccattaagtc atacaaatca taccctgact cttgcactac ttgacgggca 31081 acatcatgaa gtccaatatt tgtcaagttt ttgcctagtt gtcgtccaat aaggtattct 31141 gcagacaggt agcaaacgac tttgacatcc tttttaaagt aggtttgctc gacagtctta 31201 agccaacggt gcagcaggcg atcgcgtaca gtatacgcca gtgccatata gtagtcgtgt 31261 gctgttgccc aacctttatc ttttgcctga atgtaataga ggttatcaaa gaaagcacgt 31321 ttgagggttt cgacgtccat tcccgtgcga tcgtcctcca cttggatggc ggcgtgttct 31381 gtacgacaaa tgtgcttttt ccaatgagga atgtgcttgc cattgtgttt ttttaagccg 31441 ttgccattat tcatggttac aatttccaaa gagtcgtgac gaggtgaagt taccactgca 31501 actttagaag aacggataga ttttttacct ctaaccaagg aaacaatagc gcaacgaagt 31561 gtcaaattgt tttcaatgaa attttatgag tgacagcatt cttagaaatt ttgttttaag 31621 cttcttctct aaccttggat attaggaagt acttttgtaa tatctcccca attaacaaaa 31681 acaagacccc actcagaatt cctattccca tgatgttcaa ttcaacgcta ccaaatccta 31741 aaaaagcagg atttataagt gttatcagga gcaatgaaat agacgaaaaa ctacttgtca 31801 ttaaagacag cttgacactt gtttttgaga agttatagcg tcccgtttga gcgagcaaaa 31861 aagcaacaaa aggagcaaac acccctccta catatgctgc ataaaaatac acaattaaat 31921 taacaatttc cccacctttc aaagcaacgg caagaccaaa cagagcattt gcagcagtga 31981 cgagtagccg actccaaaca gatgcaggca gaatgttgaa gtccaaaata gtcttgcttt 32041 gcactcgcaa tatgctgctg ccaataccca gcgctggaac cagtaaagat ataatcagag 32101 caatacccaa aggcttgtcg gtaccaccgc caacccaggc taggataaat gggagagttt 32161 ctttaccatc aatgccagca gggaaaattc ctgcgttatg tgccgctaca acaactgttg 32221 aaggtaaaaa agcaagcagt agtagcaaaa ttgcggccaa agtacaccct tgatacagac 32281 tgtttacatc ttttgcttgg agaagaaact gctgatactt catatctata ggtaccagca 32341 gaattgtaga caatgagatg cccactatag ttggcaaact catatgtttg agtgagctag 32401 caaattctat gggcgatcgc acataatcac caaaaccatg taacacccac aatccataaa 32461 ccaaagccaa aaaattcaga atcagtaacc ctcgcaatat ccagcctgct ttttccattg 32521 ggagtaggga aatgattgca aacaggatag ccaaaacaac catcgtcggg agagtagcta 32581 ttccgaagac cttgaggata aatgctccag aaatcaactg aacagcttca atgccaatca 32641 gcgatgacca ggacattagg ctcacaaaaa cttttacccc atctccgtag gcagaaccta 32701 gcaatgtcca aattggctct gctcgtttcc agtaaaattt tgccagtcct aaaagagcta 32761 ttgtacccaa gccaagggag acagtataga gactgccagc agcccctaaa gttagggatt 32821 tctcagcggt tcccaaaaga aaccccaaac cgtaatgcgc ggataccagc aaggctgcta 32881 gtgaaaaagt atccagtctg cggtcaaaag ccatattggc atctgctcca caaatttttt 32941 taaatttccc cgtgaatttt aactggttcg tggctaaact tcattaactc gtagagatca 33001 taagcatata aatcttcctg acttggctcg tcagtttgac ttgttgccac aatagggtca 33061 ttgtatgctt tggttctacc aatcaataaa agtttttttc ctgcacactc tccaattaac 33121 ctgaactgat gatctggaaa gaagttgact tcatcagtat ataatgatgt ctgatttttg 33181 attggcatat tttctcctga attctaagaa ttgctgcgac tcgttcaaat tgattttcaa 33241 ccattacggg aaaggatgct gattcttttg agtagaggga attgcaaaaa tagagatgca 33301 cacaattgtg atgttttttg acttttgaat tgacaaaaga tacaggatat tccctaaacg 33361 aaatcaacag cctcttatgt ctacactgcc agtagtgaca gctattctag ttaggagaac 33421 ctagggacaa cggcttgaaa tcagctgatg tcacaggtac tccttccaat gccgacggag 33481 ttagctgacg ggctaagact ggaaggtgtc tctccattca agattcgccc caatacattg 33541 gttcctccgc ttctaaccca ggttagaagt ttttcaacaa ctagactgga ttgaattagg 33601 caagttttac ccatacatta actaaaacta tactgaaacg tccaaacagt caatccttct 33661 tttggtctca cctcaggacg agtgctattt aacaacttat ctggttaata tttacggtga 33721 attgtattaa gaatgctctt atggtggatg cttcggtaat tggtgaagaa gtcattacgg 33781 ttcgatttac cacttgtgga acaatcttta actttggttc ctacctacgc tacatatctt 33841 attgttgaag agttggatgg ttctggagtc attggggttg tgattacaag gttgattttg 33901 cgccattttg gttcacgtat gggcatgaac ccgcgtactt gagttatcgt tattgagttc 33961 tgggagtttc tgacgttttt tgttcactca tttgtctttc tcttgattgg cgatcagata 34021 cggtttgcca gtttggagga aaacttaaca atcattgcag tgacagtggg ggtaatgatt 34081 ttgattcggg cagctgcaat cttgtctttg agtcagttaa gcaatcttct acaagcaaaa 34141 gaaacaccat ttccctgttc cctgttccct gttccctgtt ccctgttccc tgtgtttatt 34201 aggagaagat ttgaaaaaga caggtaaatt cttctacatt tagaatattc gagatatcta 34261 agagacgttt gtttctgata aaatccttac ggtaaagaca tactccgacg taacataacg 34321 gtcataaaag agttgtatct atcgaagtta tgataacagc taatgttgac aatccattag 34381 atcaagactc ctctaattgg tctgccttgt tgcagcaact gttagagcgt caatcacttt 34441 cggtatctca agcttctaac ttaatgtctg gttggctcaa agaagccatt ccccctgtgt 34501 tatcgggtgc tattttagca gcgattcaag ccaagggagt gtctgccgaa gagttacttg 34561 gcatggttga ggttctatat tcccaatcta acaaaccaac gcaacgagat tccattgttg 34621 gcacctcacc acttgttgat acttgtggta ctgggggaga tggagcatca acatttaata 34681 tttccactgc tgttgctttt gtgactgcgg ctgctggggt aaaagtggct aagcatggta 34741 atcgttcagc atctggaaaa actggatcag cagacgtgtt agaagctttg ggtgttaatc 34801 ttaaagcaag tctggagaaa actcaagaag ctgtgagcgc agttggcatg actttcttgt 34861 ttgcacctga ttggcatcct gctctcaaag ttatcgctcc tttgcgaaaa actctgaaag 34921 tacgcacagt ttttaatctg ttaggtccat taatcaaccc cttaaaacca acaggacagg 34981 ttattggtgt caactctccg gctttggtag aaacttttgc taaagttcta aatcaactag 35041 ggactcgccg agcaattaca cttcacggac gcgaaaaatt agacgaagca gggttgggag 35101 ataaaactga cttggctgtg ttgtcaaatc aacaaataca cctgctcgaa ctttctccac 35161 aagagttagg cttaaacccc gctcccatta gtgaactacg gggtggggat gtcgaagaga 35221 atgcagaaat tctcaaagct gttcttcaag gtaaagggac tgggccacag caagatgtgg 35281 ttgctctcaa tgcagcgttt gcgctgtatg tgggcgaagt agtaccagat caaggagatg 35341 agtatcaaac tttttctcaa gctgttattg ttgccaagga aattctccaa agtggacttg 35401 cttggaagaa gttggaacag cttgctcaat ttcttaagtg aaagataatc tgcaagaacg 35461 ataacatttg tagagacgtc gcagtgcaac gtctctacac gcctaattcg gataaagcta 35521 tcactgcaat caataccacc tttgaccatg caacttattg gttgttataa aaagtcataa 35581 aaatttataa atcttacgta attgctcgta aatcattcca aatttccgta aaaattcaaa 35641 cggctgttga ggaatagata tagaataaat tccataaaat ttataacaag gaacgttaat 35701 gaaatgaact accagataga aaggaatttt attattgtta caaaaagcat tgctttttaa 35761 atgatttaag catctattgg aattaaagga gtctttctgg gcgcaatact ttgtatcccg 35821 aaagcctaag taaatacgct tgatttatat taatggtttt caagaaatat gctgaaacaa 35881 ttttcctgga attttaacaa ttttccatta ggcaaaaagc taactgtatt actgctcgtt 35941 atatttattg caggaattac tctgagtggt atagctttgt ctgggatctt aaattacaaa 36001 gctcaagatg aagtgagttc aaatggtaaa ttgctgttca agactataaa ctctgtgcgt 36061 tcttacacaa atgatgaagt caatcccgag ttggaagcac gcttaggaaa ggatgaattt 36121 gcagcacaga ctatccctgc ttactcagca cggaaagttt ttgaaaagct acgaaatgag 36181 gacgatgcct ataaagattt tttctataag gaggcgatgc tgaatccgac aaatcctcgg 36241 gataaagctg atagcgttga gacagaactt atacaaaaat tccgtaagga gaaaaatcta 36301 aaatttttgt caggattccg ttcttttgat caagaacagt tttattacat tgcgcgtcct 36361 ttagccataa ctgaatctag ttgcttgaga tgtcacagca caccagatgt tgcgcctaag 36421 aagatgattc agatttatgg aacagaacat ggatttgggt ggaaattaaa tctgattaat 36481 ggcgttcaga tagtgtctat tcctgctagt caagttttcc agaaagcgaa tcaatcgttt 36541 ttattggtga tgggaattgt gactataatt tttgcgatcg ccatatacgt cgctaacttc 36601 tggctaaagc gatatgttgt acaacccatt aagcgtgtcg tccgtgttgc agaagctgtg 36661 agtactggtg atatggatgc agagtttgaa aaagtgggta acgatgaagt cggcagtttg 36721 gtggaagctt ttacacgcat gaagataagt ttagttatgg caatcaggag ttttgaacgg 36781 taccgtggag gaaattagtg aactatagca atccgatttg attagctgaa tcactcgtag 36841 agagagggaa cagggaacag ggaacaggga acaaggaaca gggaataaag gtgtacctag 36901 ctaagcaaaa atctgcggag gagtcctata tatgtttact gattgttgag aattatagcg 36961 gtttgaaatt gcgtgcaata cagtagagac cgtactatac ggtctctaca caccgttgta 37021 tatttgattc aaacgagaat cgctataaaa atttataatg gctcgcaaat tacaaaacaa 37081 aacaattgac tacaaatcta acttaaactg gtcatcagaa acggaactcg tcgggttagt 37141 atttgagttc gtcacacaag cagatgcttc tatttaccct cagtacacca ttggactgca 37201 cgcttggttt cttgaccaag tgcgttctta tgatccagaa ctctctgggt acctccatga 37261 tggcgagtca gaaaagccgt ttacgatttc tgcattagat ggagaattag tcagcagtgg 37321 taggctagtg caactaagcg ccaacaactc ttactactgg tatgtcacag tgttgtcgag 37381 tcgagtctca caatggatgg aacagtgggt gcaaaatcta ccaaaagaac ttaacttgcg 37441 gaatgcatcc ttacagattc gttcttgcaa tattgctcat gctcccacga cttatactca 37501 actgctaaac tctgaacatg gagaaactgt taccttaaaa tttctcagcc cgaccagttt 37561 ccgtcgcaaa ggtcatcatt tccccttacc aatgccaacg aatgtttttc acagttacct 37621 gcgtcgttgg aatgactttt ctgggatgtt tgttgatcag gaggcgtttt tggcttgggt 37681 ggatgaaaac gttttgatta ctcgtcacca actgacatct atgaaagtcc tagctggtaa 37741 gaaaggtgcg gtaacgggat ttacaggaac aatagaattt gctttgacca aagaagcttc 37801 cagacaacca gacttttgca agctatttta tgctttgggt aagtttgcac cttactgcgg 37861 tactggtcat aaaacgacgt ttggattggg tcaaacacgg ttgggttggt cgtcgcaagc 37921 agcaccagaa gtacctgatg tgcaaagctt gttggcaaga cgtattgagg aactgacaga 37981 tatctttaag tcacagcgta agcggactgg aggcgatcgc gcagaagaaa tagcatcaaa 38041 atgggcgact attttggcgc ggcgggaaat gggggagtcg ttgcaggcga tcgcccaaga 38101 tttggagatg ccttatgaaa cggtgaagac ttatgctaag ttaagtcgtc gggctttaaa 38161 atcagggtag tttcatttga tttcgcaatt cactaaaagg ctttgtcttc atatctttct 38221 taatcacaat acttgacact tttaatactc ccatcttcct tcatagacac aattttcacc 38281 ttcactgtct gtccttcctg aagaacacta gcttttttcg gttccttctc agttaactta 38341 attgctccca aaatctcata agtgacctta ttaccattga ttttgatgac ttttgcctct 38401 aatatttggt caacctgaaa gtcttgggat tgaacgactt tggcaatttc tgcttgacgg 38461 gatgactcta caggggacgg ggtgttaatt gcaatttctc caatgggtac accagcatct 38521 ttgtagtaac gcatcagccg agagacccag cctaaaattt gcagtattgt gtaagcattt 38581 ccaacacttt tgagagaatc actacaagct ttctcgatgc tgcggtaata atctgaagtt 38641 ctgccactgt gtccaataac tctaccatta ctcaccaagg tttttaaata cttgaaaaac 38701 cgagaagttg catctggttt gttaacaaca gtacgcagat aagcaactgc tttccccaat 38761 tcgttaatat ctgtattttc ttttactagg gtttgagcga tagcatgagc aatatcccac 38821 tctgcatccg tcaaagattc cgaggaaaat tcaaccatag tcataattag tacattcctg 38881 atggcggttc gcggtcggtt ggatagcgta gtacggctgt aagttcctct agttggggtt 38941 tctgaatcaa acgcgagtgg gctgtttgaa tcttttcttg tataaatttt tttagttttt 39001 ctcctgtcat ctcgtcagaa tcattgaggt tataagacga gtaacgctgt ttcaggtttt 39061 gtgcttgtaa gattttttcc acagtgactg tcattgttcc cataccaatg ggtttcccgc 39121 ctcccacttt taaagcaata ggatttttgg ggtcttgtcc taagacaatg aacaaagttc 39181 ctaactcttc tggtagtaaa ttcttgaagt gcaactgggt tgtgaaagta tactctcttg 39241 gggcttgctg tacaggaatg cctgcatttt gacctttatc aattgccctg atagtatggt 39301 agtaaaactt ccgacctgca actctgcctt gaataaagta tgcgctgctt tcatctggac 39361 gaggacgata taaggaaggc ataaatccgg tggaaaagcc cgtgctttca cacttagcat 39421 cgttgaaatc aagcaacccc tgccaatcta acgcaccaaa aacccgactg gctggacaaa 39481 gttcttcttt gttgcgacaa ggtaggcgtt cggtaggtat ttgactttta tacttgggag 39541 taacaactgc tagagtactg ttagtaatcg cttcataaac agaccggatg cagcctttga 39601 gggaactgcc accaattgta agcttttggt caacaccttg tatcattgtt ttaatgagag 39661 gaatgcgact gccaatatcg ctacccatga caacgactcc tgtggagacg tggagagatg 39721 tctggacttt tagggtaaga tgtaaagtac cgtgtaggcg atcgctcaaa tatttatgat 39781 gtccaacagg acgctgtagg ttaggacgtt ccttgggaaa agagacgaat tcataaggtt 39841 tcggtgcaag ttcaggattt gatgaactag tactgctagg acggttagat ctgtttggtt 39901 gttgaggtcg ttgcggacga tttccagtca tggtagcgct acctccacag tcaaagcgat 39961 aaattgcaca gttgaggttt cattatcaat aaaataacgt tgagctaatt ttggcttatt 40021 gttttcttct tgtcgaatat ccaaactctc agggaaaatt agctttttgg gaaaacgagt 40081 ctcatccgga cgatacgcat aagcgggata tccataagga ggattattag gttctatccg 40141 ccaaggttct ttttcttctt tgttaattgt ttctttgact gggataaact catcatgatt 40201 gtcagcaata gttagcagca acacctcata agtattttgg cgcttgtact tccaccgcag 40261 ttcacactta tgattaaaca tttgcccctc aatcatggga aagtcatcat ttgtgggtgg 40321 tttttctaca attccactga cttcatgagt ccaacgtaag aaataatagc taggttcact 40381 tatgaatttt ggaatgaatt ctagcagttg aggtactgag aaaagttctt tcttataacc 40441 aacaaaaggt ttcattttgc ccccatcagt aaatgactcc aagacctaac agaacgctca 40501 aacaaatctt tgactgtacc ttctcgccag gtgagttgca ctccaaaacc caaatccatt 40561 tcttgtgctg tagctggggt atcttgacga tcttgtgtag ggaagccata taattgtgct 40621 tcttgaagtt ctaaaaattc tccagcacct aagaggaaag tcttagacca ttttttctca 40681 ctacccaaag cgcaaatttg ttgtttatct tccgataaaa tacatccagg atattgtaca 40741 acggctttat ttaatttcac ctgtactgta cccaatccgc gagatttagc gaaacccaaa 40801 ccaaaccaac catcatctaa atctcgcaac actaaaccaa ttaaacctaa ctgtgcaaga 40861 gtgaagtttt tcaaatgaat tttggtatgg aattcaccag cagtgcaaac ttgataattg 40921 aatggaccga ctgcgacaga accaaagact cggtcaattg ctaccccatt acgttcttca 40981 attttgagag gttgagattt atcaggataa gcatcttcta ttctgactcg gctggcgatg 41041 gaagtattgc caaacatttg gtctgtaaaa gaagagagtt tgtaaatctc cggtgcaggt 41101 aagtctttac catctttact tttgagataa tcgtatttat catttaatgg gtcattcgcc 41161 caaagcagat tccagttatc agaatcgcgc ttgtctttac ctaccgtcct aacaattcgt 41221 tctgcatgag cgcgaattgc tccttttaaa ctgcttcctg gcaaatatat agaacgtcct 41281 ccagcatgat aagtttccac aaattccata tccggcttag ttggatctgc accttcttta 41341 ccagatttga tgagaattgg atcatcagga attagggtta tatcaatcgt acagtggttt 41401 acaaatcttt tatgcatgac gaaatttagt tggtaataat ctggaatctt aagcagctac 41461 cagattgttt agctgcaatt cacactgttg aaagacataa ttgacgacag caaagagttt 41521 atctatttct tgaattgtgt aaaacgtgat gtttctcgta aacacttctg cttctagctt 41581 cccgaattgc caacgatcac cattagatac aattccaaat atagttattt gaaattcgtc 41641 attaagccgc tgtgctgcta tcatttctgc taaacattgc gcccatcctg cttcaaagtt 41701 atcttgcttg gcttctacca agatgaagta tggtttgtca aaaacaactt ttcctaaagg 41761 agaacgtttc gctaaaatat actcaggaaa acctgatagt ttttcgtcat agttgagtga 41821 atgatgactc cataaaatga acttactacg gtagcatttc caaatctctt tgaggactgg 41881 ataaatcaga ttttcacaaa tagcaaactc tgaactatca acaacaccat cccgcatcat 41941 ggtttctaag tcttctcgga agtaatcagg tatcttaaat tcaacttcgc ctacgaaatt 42001 agcttcggtg tagataactt gaaatgcttt gagaacttca ccaatagttt tgtagctact 42061 aaaagccata tttacctctt cgcttcagtg ttatttttat gagatgcttt actttcaaga 42121 tgttcaatta acttttgtac ccaatggtct ttaaaatccc gtgcatcttc ataagctttc 42181 ttatctccca taaccaattt tttgagataa tctagtaata actgcggttg gtcttcggga 42241 tagtcaaacc aaagcatctc gttaatgtcc agtttaacaa cacccaaccc gcgcgatcgc 42301 ccaccaccta aaggtatctg ttctgtttga aactggtgca aaccaatcat caacaatcct 42361 aattcccatt cttcagcatt ctctaccaca gcctgaaact caaactgtgt cccagcagga 42421 acgacttgga aatcatagag ttttccctct gcggctgttt ctgtatctct atctatggcg 42481 acaccatctc tttcttgata ttgtccaaac caggtatcag gtactatggt taaatctcgg 42541 acttggaatt tgctggcgag ccagggagaa ccaaaaaggt gagaagctaa atcagtttca 42601 tcgataatct tttttgtcag tagttcatct cttttcgcgt tgcgttcttt ctctgggtat 42661 tgcttcagtt cttcttcgac ttcttctttg ataccattct taccattgag acgttcgttt 42721 gtgattgacc attctgcttc gatagcagga ttagcggcaa aatttgggtt gataccacgg 42781 aggaaacttt ccaagcgcga tcgcattgca cccttgaagc tagaacctgg tattaatggt 42841 ctacctaaag catctttaat gacaggtaaa tcagaaccaa tgggttcagt agaacgtcct 42901 gcactaatac gtagtgctgt tattgtagtt agtgtgcctg tgatttttag gcggttttta 42961 aaaatttcga acatagtgtt taaatttcta aatgattggg aaattgtggt ggctgtaatt 43021 gtaatttagg tggaataaca gctgtaagag cagattgaat aaaattgaca aacacttcgc 43081 ttactttctg ataattatta ctagtaatcg gttgaaatcc atgagcaaag agtgagttat 43141 tacgaacttc tagggcatta ataattttgt ttgcgctttc ctgataaaac tgtcctatag 43201 ggtcattagg aagttgattt aaaagctcat aactacttct caaagaaagt tgaatcaatc 43261 ccttaacagg agaacgtttt ttctcatatt catcttgcag atattctgga agttgctgag 43321 gattcacatc acctgttcta ataccataag atttcaacag acgaatctga gctaataatt 43381 ctaaagctct gtaaagtctt cctacagcat catccgaacg ttcttgagca gcacgtcttt 43441 cagcatttcg tagtaagtct tgaacaattt cataaccgtg tccttgtgta ccgttactgg 43501 aatcaaaatt tctatctatc tgtcctctac tattaataac ttgaatccag attactagct 43561 catcttcagc ttgacgtatt cctttttgaa tttgtaactc tgtttttgat tctatagata 43621 aactcataca gtatcctcta ttagctcatc tctcaatact agatgaaatg ggttgcaaat 43681 ttgtacttga ccgaagcctt cagaagtgcg atcgcctact cctttccatt ccaactcctt 43741 taatttctcc atccataaac ctggttcctt tgtactaaat aaatatacag cagctcgatt 43801 ggtgactaat tctatatctt tcattaaccc ccaagctgca ttccagccag aacgatagtt 43861 atagctgctg tacgcgactt ccaatttaag aaaatcctcc tgttcttctt tactaagttg 43921 ggaaaattca tcatttaatt ccacagcttg gcatagcatt tgtggtgaaa tcactgtggt 43981 gtgtcgccag ttttctgtga gaattgcatc ggcttgcagg tcaagggtaa aataggtacg 44041 gttttctaac aagttctttt ctggcttacc aaaaactgac caaagttccc agcgttgctg 44101 aagtttattt ttaaattcag tgattctagc ttcgacatct gctgaagcat cttcaacttc 44161 tgctttgatg ataaccttcc ctagacctcg tgaagtcgca ccacctaaac gaaagaaatg 44221 agaattttga ttgatgaaat tttctagaga cttggctaat tctgcgtctt taactaaaac 44281 tgaacttcgg aaaataacag gacgccagtt tttaaaacgt gcttggggat tttctaaaaa 44341 tgactcgttt aaaacttcaa tgctgtagag aatatcatcc tcagatgtgg cgcgtcggcg 44401 gttaattccc acccttgtta ggaagcgagt tgtagtagaa tgactacgat atttatatag 44461 aatttcacta tcgttagttt tgctgtagaa gccaccaaaa ggttcaaccc gtgcatctgt 44521 tttttcttcc agagattttg gatcgctagg gtcgtaagga aaattatatg catctgcaca 44581 gaagcggtca atcagagtat caaaaactcc gttacctttg ggtttaaaac ctgagtttgt 44641 tttagaacta acggctgttg caggtagaac catgacttca tcagtcacaa cctgagaaat 44701 taagtctggc ttctttttag tagaattaga ttcatcagct attttagcga cggctggata 44761 agcattttga aaaatggcag gttcatcacc caagaacaat gcttcaaagt caccaccgcc 44821 tgctgtgaaa ttttgttctg ctatccgtgg tgtttgctga ttggatagtt gtagaatttg 44881 ggatgcgatc gctcctcgaa tcactgaacc aggaatatga tcctccgctt cactcactga 44941 accaggtttt tttccggcga tcgctaacgg agacaaagtg gtaatttcta atttaatccg 45001 cttcatgact tagcctccgg taatagtttc tcccagactg aatcatcttt tgaaaatgtt 45061 tcaagttctt tccatttcag ccaacccagc ccaacagatt tactcccgcc aaacgcatgg 45121 atatgtcgca atcctgctaa tacgatgaag attttccgag cgaattttgc cgtaatatct 45181 gtggggagta tcgtgtaaat attgctcaaa cttgtcaacc caactttcta tagcttcttt 45241 accaggtttc cactcaaaat ctgcctgttc ataccattgt tgagaactat catctctttt 45301 agctatttgt cctaccactc ttttccgtgc ggatgcttct gaaaaccaag gatcaccagg 45361 aagtccttgg cttgaatcgt cgggagaact cttagcttgg agaatagcag aacggctaca 45421 tcaaacttgg gttctcgctt atctccccaa gccaaacacc agctaatagc cgttgtgatg 45481 ttgagttgag aatttgtcat aagtattttc aatcgctgct gttgttcttt attcttccag 45541 ttacaaggat acgtatcagt aatatttcgg aaaaaaaata cctatctttg tatcgatgca 45601 ttacacttga cttccttgat tgcttctaaa tcgttcgggt actcacattc acggtagtaa 45661 tcgtacacca gccaattgtg ctaaaaccca tacagtctgt tgagcaactt gcaatgcagc 45721 gttggtagtt tttaaatcag tcattacact tcacgatgat tgtgctgaac tgcgtgtgat 45781 tgtagcttat aggtggaact ggggtattcc aaatctacga gttagcaaac aaattagcac 45841 aactatgtaa ctattttagg tgaggactac cggaaaatac tgaattttcc cgataattta 45901 aaattgtgaa gtttacccaa ggaaccagca ggagtaaaaa atagtgtacg atgagccacc 45961 accagaccca tttttagccc aactatgtga aggatacacc gaggcggaag ttgttgaaat 46021 acagcggtat atggctgagt gggatgcttc tacttatatc agtgtggctc aaagtattct 46081 agaccatgct aacagaaaag aatttgaccc gttgaaatat ctacgcaaag cctacaattt 46141 taataaaaaa cgagcagtac gaattcccaa aattggataa cgtcaggatg gttcagcact 46201 ttgctgaatt ttacttgaca agcaatcgag actacggaca aatgaaagcc tatattcgga 46261 tgctggatta tttgcgcttg ctgataattg aatatctaca aacatatcaa agtccacaac 46321 cacaatctgc ttgatttttc atagtcatct agaaaggaat tttcatgtaa aaaaattaat 46381 ggcgagggag gttaaggcgc tttttaaatt cttgccctct tcgctgatca ggaatttttt 46441 ctactaaaac ttctaccaat tgctctggtg ttatttgtgg atttttatcc aaagtgtctt 46501 caagaacaaa actggcgaaa ggaccaacaa aactagtcag ttctcgccga caatactcca 46561 caaactctgg cttgatgcta gtcgtatttt cggctggcaa ttcgtttgct ttttcagtgt 46621 tttgaatcgg tttttgtgat tttcgagaag actctttagt ttctttaggt agtttagtct 46681 ctggtaatcc aaccttatta acaggatcac catgaattcg tacgttttga tgtgatggtg 46741 tcacattaat accgccagat gaactttgtg actgaagcag attcaaaact tctcttgctg 46801 attgataacg atatgctggt ctttctgcca gcattttttc caatatctga gtgagaaaat 46861 cactaacttt aacgtaagat ttccattgcc actgaaaaga ctgatctagt agcaaacgtg 46921 gcatttttcc agtcagaaga acaagagcag aaacacccag tgcatagagg tcgctacagg 46981 ggtaagagcg tcccatacga atctgttctg gtggagaata acctactttt ccaactactg 47041 aaccagcaac agagtgcgaa ggatttgaag aatcacccgc cagaatttga gtgaattttt 47101 gcttaaccac tccaaagtca atcagtacag gtttagaaag cttacgggaa agcatcacat 47161 tgtctggtga aatatcccga tgaatgatat tgcgttgatg gatatactct aaaactggca 47221 acaaatccaa taaccactga ataacttctg cctcggaaaa tggctttttt tgtatcgaaa 47281 ggcgatcgcc caacagtcga gagtaactat caccttcaat atattcctgg actataaata 47341 tccgctgttt ctcggttaac caagccagaa acttaggaat ctggggatga ttgagttggt 47401 ataaaatttt agcctcacgc tcgaataact cacgggattt gagaagtacg ggttctgctg 47461 tgtttgcagg gacaaattct ttgaggacgc aataatcccc gaaacgttga gtgtcaactg 47521 ctaagtaagt tcttccaaag cctccctgtc caagaagtct ttgaatctga tagcgactgt 47581 taatcgaagt tccggggttt agctctggta gcattagtgc gttgtggttt agtgagtaaa 47641 atgcaaatag aattttattc tagtgtctct tacttgccag gatacaccaa actacaactg 47701 acgaccgtta accgaagcgt attgcgagca cccggtgcac aagagcaacg ccagaattgc 47761 ttctcacatt atcgcaatat tggcaacgcc ttatctagca aaagcatgat cgcgaataaa 47821 gttgattgtc caactgcata agttacattc gatgacattt actaagaatt tggagggaca 47881 agcctcaata ctttgagtgt gcgtgttctc ccaaaaaaca ttcaaacagg aataaactat 47941 gtcatccgaa cttaaagctg caactactgc tgctgatgcc caactgtcta atgatctgcc 48001 aactctaaaa aaaggtctac aaactgaagg agttagatta ctgcaacaaa ttctgattct 48061 ccgttacaaa tacaaaatta ccttcgacgc caacttcggt gataaaactg aggatgcagt 48121 caaagacttt caacgcaagt ataatctgag tccagatgga attgttggtg taaaaacttg 48181 gcgtgcttta ggcgtaaata ttgcctaacg ggacttagca cactgctact gcgccatgag 48241 taaaacttaa aaaaagcctg ggggataatc ccccaggctt gctcactgct atttgtcttt 48301 gcttttggct tactattaga gttccctttg aaaattccct ttcgatctgc ttttttgtgt 48361 gcgatacgca ttcgcgttcg cgctgcctgt aaaatgtatg ttaaggctac atttttttga 48421 cgttttgggc aatggtgcac agctgccgac cgaaggaggc gatcgccctt ggcgcgccct 48481 atgccccaac caaactccgt ttggttgccc caaatcggag atttggggag aggggttttt 48541 ccaaaacccc tctgggcaaa gaagggcgat cgctcaatcg ttgtatcttg ccatcttgcc 48601 atcttggtac atatgtacca tttcaagctt tttaccattc tcttcagata gttgcactag 48661 ctaagatagc cacctagagt cgccgatagg aagagaagac tctatctaca gtattggcaa 48721 tcaatcttac ttgatattaa ttatattacc tcagcaggaa gagggaatga cacgacatcg 48781 ctaatattaa acttgcatca catcaaacaa aaactttctc ttccttcaga ttaataactt 48841 catgaagaag cgaattttga gtttttgtca aagtctcata cctattgatt tttatggaat 48901 ttgtgtgcta tctgactatc taatattagc ttagtatttt tggacaacaa ataaatttag 48961 aataagaaag tctgaattat gaatgaccaa gagatgaagg gcaaaatcag tcgtgaaaga 49021 gattatttac tgcaaaggct ttgggacgca agagtcattg tatctctagt attgattgca 49081 gtgctggtga gagtaggcta tggcgataat agactagctc ctacacaagt tgttaataca 49141 tatgatgtta ccacaaagac aaacaatttt atcggtgaaa ccgtgacagt gagaagccaa 49201 ccgattaaaa aagtaggttt agcttccttt acggtaactg atcaaagatt attaggcggc 49261 gagcctgtag tagtcgtcaa cgcttcgggt ctggctttcg atttaccgac tgacagtgat 49321 acaagagttc aagttacggg tgatgttcgt aacttagata ttccaaacat tgagcgggat 49381 tataacctca atttgcagga cgaattttat aaagactaca tcaataaacc tgcgattatt 49441 gctaagtcaa ttctattagc gccgagagtg ggacaaatta ccaaaaaccc tcgcaagtac 49501 tatggtacaa aggttgcagt tatgggtaat gttgataata tccaaagtcc cgttttgttc 49561 acactcaatg aaagctactc cctaggcgcg gacaatttgt tggttttatt cgttgcaaca 49621 ccaaaacggg tgattaacaa aggtcagaca gtaggcatgg tgggtgttgt tcgtcctttt 49681 gtcgttgcag atattgaacg agactacggc ataacttggg atgagagggt gagacgacag 49741 ttggaagcag attacagaaa caagcctgtg ttcgttgctg acactatata cccataacta 49801 actcttatga ggttaaaccc cgtccttaag gacggggtct ttccttgtat tatctacttg 49861 cttgagtgac ttagaaggcg caggtttacg aggtcgttga aaatagggct tgtcccgtta 49921 cagatgctgt tggcgaaata tttcttaact gacatcttgc accattctca acaagaccta 49981 aaaaaccggg tttttgaaca gtaaattaag gtttgcaagt gtcatcattc tcaagaaacc 50041 cgatttttta ccctggtgca agatctgagt taacatttcc caaccaggaa aaccctatgt 50101 tcgggaagat atggaatttt ggttgggaga gagattgctg gtttgtgttg atcacaggcg 50161 agtcgctaaa agttttagtt ccgcaatcag tctattcagg gctttttgag ctgttttata 50221 accattcgcc tgtcgcccca acttcaattc aaataaaatg cgatcgcgta ttgtctcaaa 50281 gttcacaacg tcttgaataa agatgaaggc tgcccccttt ctcatttctg aattattttc 50341 ttgaagattc tgtacttggg ctttaagtct ctctatctcc tgctcatacc gtgtaatact 50401 tggtaggtgt tctccctcca acggtgtaac actcggtagg tgttccgcct ccgacggtgt 50461 aatacttggt aaaaggttcc tgctgactat ttcttcgaga tagtcagctc tggttaaccc 50521 caaacattca gcgatcacgc ccaatgccct ccaggtatcg tccgttagcc gtagagaacg 50581 cacagaacgg tagtcatcgt ttttgagtgc gaactttccc tgaatatctc ttttcatatg 50641 aagtttcacc atgtcttaca ccggtattgt aaacaacgga ttgggtcggg ttcaaccgtg 50701 tattacatga ttttggtgat tgaaaaaaaa ggtacgtgtt gttttaagcg acggctagtg 50761 cggcaaatcg tgatattctg gtttgggcgt gaggtatacc atctaaccaa gcaatcatac 50821 gaacgatatt aatagcaaca gcggtgattt ggtgctgcaa ccggactttg ggcaggttgc 50881 ggtaacgtgc tttccgtaaa ccaaaagcgt ttattccttg tgataaagtg ccttcgacac 50941 cagcacgggt gttataacgc tctttccact catcagtatc ctgagttgat cggatagttt 51001 gaataatttg gtgttcttct tttggacgca atcttaattt tcttggctca ctttcacttt 51061 ttgatttaat acacaaatgc ctaaaatcac aacggcggca gactttggag ggaaacttca 51121 cgttaattcc tgaattgccc caattatcta ttgcaggagt ccaagtcgta cttttttgac 51181 cttgagggca cgtaactgtt ttagcgtccc actcgacact aaatttactg atgtcatacc 51241 caccaggcgt ttttgcttgc caactgacgt ttggacgaac tggaccaata agttctatat 51301 caaaatcgat ttgacttttt atcagtaatt cgctgtctac gtaacctgca tcgacaatat 51361 gttcagatgg tgataagcct tttgattcca aagatacatg aattggttct gtctggtcta 51421 cgtcagataa ttgggcttgg gtcgtctcaa catttgtaat caaatgaaca tcattagcat 51481 cacaagtttc agtaatatgt accttgtaac caatccaggt tgtgctgcgt ttattggcgt 51541 aacgagcatc tgtatcgtaa ggagagtcaa aacggtcgcc tgcttttggt aagtctgctg 51601 cgttgcgcca acgtaatttg ccaggtgtat ctaatggtac atctgtgtag tctatataat 51661 attgatgtag ccaagtttgg cggagaactt caattgctgg aatttgccgc agccattggg 51721 gtgttgttga cgattcgtag atagatgcta atagctgcat accatcatta ccaatagttt 51781 cagcatattc ataccgagcc gcaacacatg cacggtaagc ggtactcatc tacggcacgt 51841 ccgtaacgtt caaaccactc ttgtggaacc caaccacgta accaatctgg tgcgacagtg 51901 gcgaccgcgt ttaatgcagc acgtaaggtt tcggctacgt tttcaaggcg attcaaattc 51961 cgaactgccg ccagcacatg agttgaatca gtgcgctgtt ttcctttgtc ttttataata 52021 cctttttctt gaaactgctt taataaaata tccagtaatt tttgttcaac accccccgct 52081 attagtcgcc ctctaaattc gcatagcacg ctaaaatcaa agccggggtc ggttagttct 52141 agagaaagtg cgtacttcca atcaattcga ccgcgcactg caactgaaac gcttgcctgt 52201 cagtcatgtc gttgatgtat tgcaacacac ataccagagc taatcgccaa gggctaaaag 52261 ctggttgacc tcgtgttgag aatagagctt caaagtcttg atcgcaaaag atagtaccga 52321 tagaatccct catttttata caagtgttac ctttggcaaa ggctaaacga gcgatgtgtg 52381 ctgtttcttc tggaattggg tcgatttctt gaagttgaat cattactatt ctccaccagt 52441 agacacccta acgatacaaa taatcgaacc gtacccagag gtttaatttt ttcgcgtact 52501 cattggcaac tggagtacac aatcacaagt tgaatcttta acttgtaggt attgcctctg 52561 aatgtttggg attaagtaga gccgactacc tggaggaaat agttagaaga aagttttatc 52621 cgtgtaatac atggaaaagc gtcgctctcc aaccgtgtaa tatatggaaa tcctcagagc 52681 tattaccgag tattacgcgg tatgaacagg agatagagac gcttaaagcc caagtataga 52741 atctcacttc acgtaactct gaaattaagg aacgctcagc cattaccaaa gtgcatgatg 52801 ttgtagatta tgaggcggta cgcgttagcg gagcggcgcg gaacgcgctt cgcattttat 52861 ctgaattgaa attagggcga caggctcatg cttataaaac aacccgaaaa gctttaaata 52921 gaatgattgc cgagctaaaa ctcctaaaat attgttgaga acgattaccg acaaaatctg 52981 tttgaaatag tactgctcaa tgtttttcct tacattgggt tttatgggct caaatgtaac 53041 gaaatatttc gccaacagca tccgttacaa aacaaggcgg tgagcaaatg caacccactc 53101 gtctaatcgt ttacaatcaa cgatctcaaa tcctgctttt gttagagcct tgtcaacatt 53161 atcctcatga tcagcagtaa aacccgctgt aatcagaagt ccgacctgtg catcagtttg 53221 acgcagcgca cgctggaagt catcagcaag agcaatgtgc attcgtgcta gaatatttgc 53281 gacgataagg tcaaaagttt ttgtaggttt gatgcttggc acattgtcga taatgtctcc 53341 acccatccaa tgccccatct cacttccaga ccccaagctt cctttcatca cccttacctg 53401 ctgttctact ccgttacaac gtacggcgtc ctgagtggct tctacagcaa tgctatcatt 53461 gtctagcgct aagacgtttg ccccaagctt cgccattgcc acactaagaa tacctgaacc 53521 tgaacccaag tctaagacat tcatggtggg gatgatatac tgctcaagca gttgaaggct 53581 gagaatggtt gctggatgta aaccgctgcc aaaacagagg gttgtcttca gtctcagagt 53641 gatttcgtct gctgtctggg attgtaatgg tgcatcagga gtcaccacaa cgaaacgttt 53701 cccaatccga tgaactctgg agttgagtcc gtttgcgtct gtgggtttct cctcaaccac 53761 actcgtctgt atcgcagttg ttagtccagt acggtgcaag ggcgagagca aattaacgat 53821 tttttctaca cgtgtcctcg actgggcatc tagtggtaga tataagtaaa tcgtgaacgt 53881 ccattgaggg tgctgatcaa ctgggtgggg taggttaggt tctgtatatt ctacgatatt 53941 aatgtcatca atatcgatag tttctgcaag cagcgtacag acccaatcaa tagcctcgtg 54001 tgttgtatcg aggcttaact ctatccacgg catatttttt ttcttctcaa tcaaatacag 54061 ggggccaaat ttcagagact gcaagttccc actctgggaa caactctgag atagtcagct 54121 tttcaccatt ccctagcact gtcgccccat tgcttaagcg atgaacagcc acagtttgag 54181 catctggatc aatgagtagc gcgactttca ctcccaaatc taagaatctt tgaattgtct 54241 gttgtacaga tataagtcgg tcaaaggcag atttaatctc cacaacgaga tcaggtacta 54301 attcaggata agttcgcggt actcgcttca agcgttcccg tgaaaagaat gaaatacgcg 54361 gcgcagctat ctctccattg ggtaactgaa atccacctct agagccaatg atatgtccta 54421 gcttgcgcgg ttctacccaa gaaaacaaaa atgcacaaaa tcgagcgcta atttcaccag 54481 aggtaaaatc aggtggggaa tcattcaagt tatcctcact gccaaagtct ctactaggag 54541 caatatttac ctgtggtatg ccaacagatg tgatttctct gagcacttct tgctttagat 54601 tttgaaggtt tatgcctagc ttttccaaca ccctcactgc taccccatct gtaaccatca 54661 atacacccag taaaaggtgc tctgtaccaa tgtgtgtgtg acctaatttc tgagattctt 54721 caacagcaaa gtccaaaact tgcttacctt taggagtaaa gggatactct atgcctttaa 54781 ttccagaacc acgaccaata attttttcta cctctatttg agccgcttcc aaggtgacac 54841 caaaagactt gagaatttta gcagcaagac cattaccttc tgcaattaat ccaatcaaaa 54901 tctgttcagt acctacaaat tgatgtccta agcgctcacc ttcgccttgt gcaagtacaa 54961 ttgcacggta tgctttggcg gtaaatcttt caaacataga cctttcctct ggtaagaaga 55021 tgaattgcat tggacagcga aacaactggg caattttaaa ggctagagac aaacttgggt 55081 cgtatttacc aacttcaata gcgttgatgg tttgacgact cacgcccaaa cgttgtgcta 55141 attccgcctg tgaccaattg cgttgagttc taagaagacg taaacgattt tgcattgctt 55201 ggctgaaaaa aaactagttt atttgtatag tttttgccta gcctgtatcc tatagtaaac 55261 ttaacttgtc tttatgtcaa gtttacttta ctttaactcc caaagctata tttcataccg 55321 ttttcaggaa atgacaggct agtggtccgc cacattaatt ttgacaggtc gtgaggggaa 55381 tcttgttttc tttggactct cgcccaaatg gtgacgaggt atggcgttcg cgtagcggct 55441 gctttgcagc atcgccgccc ctatcttaca ccaatgctcg tttggtggga gcgagcgctg 55501 caattttgat cagcttttgg acaaagaatt tgctattaac tattgtgttg cgaatgctga 55561 tatttttttg ttggcagtgg cttgtgcaac tgcattaaaa cttgattgaa gtttttactt 55621 gtaagtaaat ttcattcttt gaaaatgcga tcgcagcgtt gcttacagtg ctgcgatcgc 55681 attcatctat gaaatagcga tcgccttcat cacctaaatc cccgcaccat ctaaaaactc 55741 tgaaatcatc aaatcactac tggacttgcc attgttcgtt tgtgagtcgc tgacttgcaa 55801 tgcgtgtaca gaaggttgta cctctaactc ttggacttgt ttttctaaag ctttaacccg 55861 ttcaaataaa gcgcgaatca cttctgcttc cacatcccgc agtttattgt gagccagaac 55921 atcggtgctt tcctcagcct gacgagtcac acgccctgga acccctacga ctgtggtgtt 55981 gctaggcaca tctcgcagca cgactgaacc tgctccaatg cggacatgat ccccaatatt 56041 aatgttaccc aatactttag cacctgctcc cacaacaacg tgattgccaa cagtaggatg 56101 gcgctttcct gtctgtttgc cagtaccgcc tagggtaact ccttggtaaa ttaggctgta 56161 ctcacccacc actgctgtct caccaatcac gactcccata ccgtggtcaa taaaaactcc 56221 cgtgccaatt tttgctccag ggtgaatctc aattccagta aaaaaccgac tgatatggga 56281 gatcaaccga ggtatgaagg gaatccccct ttgataaagc cagtgtgcca ctcgatggaa 56341 gagtatggct tgaaaacctg ggtaacagaa taacacttcc agccagttac gggcggctgg 56401 atcacgttca taaatcgttc gcaaatcagt tagcagcata atttgtttct atctcgggaa 56461 gttcataaac aactcaaaat atgcaccgac accaaatctc aacagacaag acaggtgtgt 56521 aagttttgta atcaatcaag ttatttgtga taattgtgtt agcaaaacta cggtaaaacg 56581 ataagtttat gatattttat gagtagatag cgcaatatta ttacatttaa cactgccatg 56641 agaatgaatc tatcataaaa ttacaataaa attatcgacc aacagtagta tattgagtac 56701 aaacttgaat agccaaaact acaccctttt ggatctgtct tccaaagttg aatacgcgct 56761 gctagcactt ttagaattag caaccaacaa gggaaaaaaa actcctctca ctatgagtga 56821 aatgactgct aaacaaccca tacctgagcg ctatctggaa caaattctca ccagtttgcg 56881 gcgagcaggt gtcatacaga gtcatcgtgg ctcgcgagga ggctttgttc tggcgcgtga 56941 accttggcaa atcaccttgc tggagattgt cactttggtg gaaggagaac gtaaagacag 57001 agaaccctct gtgacttcca ctttggaaag ggatttggtt catgagatct gggagcaagc 57061 caacaccgcc tctattaagg ttttacagaa ctatacactc caagatttgt gtcagcaaag 57121 agaggctcgc ttgcagcaag gtccaatgta ttatatttag acattctcgg agttaccata 57181 tgcagatagc aaaaaacatc acagcattag gcagaccgat tcttgtcgtt tgtttcaaag 57241 agattcccaa acagttggag attgtttctc accaaaccag tcacttctgc caaaaactcg 57301 attgaaatga gtaggaggaa acaacaaaag aagtagactc aagtttaatc agagaaacta 57361 tcttggtcga gtcaacagcc acctcacaca ggaatcacgc gattcaatga tgggatttac 57421 caccggatga ccatcctatt ctaacaatcc ttcataaaag gtgcagggtg caggggtact 57481 cctaacactc cacacccagt ctcttcactc aactatgact gtgtttaatg actccagagg 57541 aatactgaag acgtaataga taaccccaat tccaagaagt ataacagcaa ggctagagag 57601 gatgaccaat cggtctgctg gttggtaggt atcctcctca atatcacggc gtacaacaag 57661 ataatgctgt gttgaaagca tgactgttaa caaaccaact aatgagaagg ctaaacctaa 57721 cttccaacca tcaccaggtg cttgaggtgc aagaggagga cgaagaatac gcaagcggac 57781 aatcaaaaca ccaaacccca tcaaagcaat tccactcctc atccatgcaa gataggtacg 57841 ttcatttgct aaatgttccc gcactttatc tgttgaacta ggtttatttg acttttcttt 57901 ttctgaaatt tctgcattcg ccttcataaa tttgaatggt agttgcatac aaaacttttt 57961 aaattttttg atgaatttga gtcaaattta cttctgtgaa agtttatttt cctctgtctt 58021 attctatatt ccagatttga caataaaatt taatagtgtt atcttgataa ttgtgaatta 58081 tactggttaa ttagtcaatt ttgactgtat ccaataatac tttttaaaaa tcgatatttg 58141 atgacacaac aatttcaaag agattgaaat gactttgact tgttaagacg gacttgatat 58201 agtaatccta tttgattttt taaaatcgtg catgttcaga ttccgttggg tttcacttcg 58261 ttcaacctga gatattgcat cactttcaat tagcccattt tgaccgcctc ggaattaatt 58321 ccgaggctca cagcccaagt ctactaaagt agacttcaag gcttatgcag tcgtctttag 58381 acgacttttg ctatgagact gggatttaaa tcccaggcgg acgagaatgc aagcttgaga 58441 atcgtgcaag atctcagttc aacccaacct acaaatatta aaagtgtttg acccaactta 58501 gtcaacaaag attaacatct caaaagtcct ctcaataaga ctttcgctag aagccagaaa 58561 catcagttct tggctgtcgg gttgacacaa cgaatgagtg ctgttcagtc attaattggt 58621 gcaaggtatt ataaggaggt tctggcacag ggaacatcgt actcccagga attcccacac 58681 tccctaacac cggaatcatc atgaacccaa ccgccagcac actggatacc agatgatgca 58741 cacgcagttt tctgattctg tacaagtaaa ctggagccgc gattgaaatc aggatataaa 58801 ctgtgagaaa tccataacta cagatagccc ctaaataacc catgctctcg aacaatttga 58861 tgtgaaacag ggacatcacc gctggaacta agaaggtgat gaatgaacac atcgtcactg 58921 caacgtgggg tgtgcggttg gatgagtgcg ccgtacccaa gcgagaatgg aacaatccgt 58981 ggcgtgccat tgtgaagaag attctggcag caggattgat acaacccaac acacaagcaa 59041 agaaactaaa cagcgcgcca aaagctatca aatcgcctag gtaacccatg cctatctgtt 59101 gcgacaagaa acccaacggt tcctctgttt tggtaatcga catacccgtg tcgcgaaaac 59161 ccagtacttc tatataagtt gtggaaatga agaacaaacc agccaaaatc gcactaccca 59221 ttacggatct agggatagtt cgcagtggat ttttagcttc atcacccaat gaggtcgcac 59281 tttcaaagcc agaaaaacca aacatcacca gcactaaccc tgttgctaag ttaccaggag 59341 tgacaccttc tagggttagt tgtggtatgt caattgcaaa acctttatgt gcccagatga 59401 gaagacacaa tcctgcaatc aggacaagag aaataccttc catccatagc attgcaatcg 59461 cagaaagttg gatatcttta tatgctgcat accaagcaat acccgcaccc aaagctaaca 59521 gggtaatact cgaaggatga atccctaaat gaccaattaa gacactacta aagttggcaa 59581 aaccacacaa cactgacatc cctgtaaaca aatacgccag taccaaactc cagccgcaaa 59641 tcacacctgc tgttggacca agtcctttgc taatgtacga atacaatgaa ccgggtgaag 59701 ctgagcgact ggcaaattga tttatattga tgctgacaag cactaatcca attaagccaa 59761 tcacaaacga cagccaagtg ccatttcccg atagggcaac aattaagccg atgttggatg 59821 ctgggatcgt agtgggagcg atgacggcaa aggattgtgc tagtacttct ccaaatgata 59881 aacagttcgg ctttaagccg tgagcactcc ttttctgtcg agtttcgtca gtaatggtca 59941 tggtaagcaa attttcaaga gtgtagatgt ccgagtaatt ttggtaaact ttgctttaac 60001 gaagtaccat agccgtagca cagtcttcac tttattgctc tgtgtcagta aaagcaaata 60061 ttcttttttg tcaaagtcat agattgaatt tattcatttt tcgatttgat gtcaaataaa 60121 tcactctatt attatttgca tttacagtta acaaatgatg attatcttgg ctgttcttgt 60181 gttgtgataa cacttatggt attgttaact gtacaggtta ttattagctt tattcacaac 60241 tcagtatgtt agaaaaatga acccataaag tcactgctta taccactttg cctgatttat 60301 tactaatatt taggcacata agttacagcc agagatgtta caaaaatttg atttttaagt 60361 acggtagttg cctctaatta ccgtacaata tgaggaagtc atcaaccata gtggtcagac 60421 tcaacaccca aaaagcctta aggttcaaaa tgacgcttga gcagttgcga atttttttgg 60481 cagttgcgga actgatgcac tttacccgcg ctgcagaagc gctttatatc actcaaccag 60541 cagtaagtgc ggcaattcaa agtttggaag cagaatatgg ggtgagacta ttccatagga 60601 ttggtcgcca tatcgaaatt actgatgctg gtaaattgct gcaaatggag gcgcagaaag 60661 ttcttgacca agtttcttta actgaacgag gcttgaaaga actcaataat ctgcaacgcg 60721 gtgaattaaa gctgggatca agtctcacca ttggtaacta ctggttacca gagaaaatta 60781 gccagttcaa gcgccaatat cccggaattc atatcgactg cacgttgggt aatgcagaag 60841 aaatttgcga agggactgcg acaggatttt ttgattttgg tttagtgaca ggagacgtta 60901 aaccgtcact gaagagctat ttggagcaag aagttgtggg gagcgatcgc ctgcaaattg 60961 tggttggtac atctcacccc tggtttgagc ggacagaaat ttgcccagca gaactgctag 61021 caaccagttg ggtgatgcga gaacccggtt ccggggcaca gcaaatgttt gagcaagcct 61081 tacaaaattg gggaattcag ctgactgagt tggatattgt tcttgtctta agcagtagcg 61141 aaatggtgaa agcagtagta gaaagcggtg ttggtgctgc tgctattccg gaagtgatgg 61201 tcaaaaaaga aatacagcta tctacacttc acgccgttta tgtggtagaa agaaactctg 61261 gcaccaagct agacattgtt caacctgttt ggaagctcaa acacagacag cgttttcaga 61321 ctcgagttgc gatcgccttt gaagagattt tgaccgctgt ggaaagtcca gagtcaactc 61381 gtcaaaactc aaaagttttc gactcggaac tcaactaata cggagttgca ttcaaagata 61441 tcaggtagtt cttgggacaa ggcagacaag ggagacaaca caagacagta aaatgtatga 61501 acgcaacttg gtattgatcc gagcttgata ttaatcactt ttttttatta aacaaacaaa 61561 taaaaatatt tgattttatt aataatcttc tttatacttg tcgtaagcct acaagcacaa 61621 gattagcttc ttcgctgtaa gctccgcacc aaagtgttgc taaagccgaa attgctaacc 61681 acgcatctgt acctagacaa gtgaagatat aaggttttac tgggaagtgc ggtgaggcag 61741 tatggtcttg gggtctcccc aagtggagta actgccgaac ccgtaagggt cttaaatacg 61801 ctgtcaaaca gctgactaca gccgaccgta tgtaaaatct tcaaggaatg taggtgaatg 61861 gtttttcaat cattcgagtc gagaagggcg cgccttgact ataaaccgcc gtagtgggaa 61921 agtcgagtac cacgaaggaa tttctggaaa gataatttct gccagaagct tacaccgtac 61981 tgcaagcagt cagtgtagga gcgtcaccga gatattgcct gtaatcctaa ttcttatggt 62041 tgggctgaca cccagcttga tcacaggtaa taactaaatg ggtctatatt ttctcgtcta 62101 tggaactctc taataaaact gagtacgcga ttctttccct aatagcgcta gcagcctgct 62161 attctagcgg tgaatcatta caaatccgag aaatagcagt acgacaaaac ataccaaacc 62221 gctatttgga agaacttctg gcgacattaa ggcgtggagg tttaattaag agcatacgcg 62281 gcgtcaaagg tggctatgtt ctggcacgag aaccccgaaa gattacactt ctagatgctt 62341 tccgttgtat ggaggaagca gacgaggatg tgcccaataa aaaatctacc ccgacatcag 62401 tagaaactga agttgttcaa gaggtttggc aagaagcatg tgaggcagct tactctgttc 62461 tacagaagta taccatccat gacctttatg agcaacgagg gaaacgacga cagatggaat 62521 ttatgtacta catctagtct tacgtcgcta atccatctcc tgattaggaa tcgcgaatta 62581 ctaaaaatct tgatttgtag atttgtaggg tgggcactgc atagcttcct atattatggg 62641 cgtggggtga gtggagagtg cccaccctac gttggaggag catgaatcag gcacgcccca 62701 ataccgcttt ttggagcagg agaaaagcct aaaatcaatg caaattttaa tgataagaat 62761 agaaaaataa aagtaatttc attattgaaa taactaatga ttagctttag aaattgacat 62821 tttacaattg gagtgctttc caagcagaat gagagttaag aacactcacg ccaaaagatt 62881 gttggaggca caagcgtatg gttgcactca ctgctcgttc tgaaaatact caatcgcacg 62941 gtcgcttgac catgcaaaca gttgatattg gcgttgaaac gatcgctatt cgctctctgg 63001 attgggatcg ctcacgcttc gatatcgagt ttggattgaa caatggtaca acctataatt 63061 cctttctcat tcaaagtgaa aagacagctt taattgacac ctctcatcgc aagtttaagc 63121 agttgtactt agatgtcttg aaaggattga ttaacctctc aactttggac tatctaatta 63181 ttagtcatac tgaaccggat cacagtggac ttgttaaaga ggtgttgcag ttagctcctc 63241 aagtgacggt agtaggagca aaagttgcca tccagttcct ggaaaatatg atacatcagc 63301 catttcaatc gctgatagtt aagaatggcg atcgcctcga tttaggcaac ggacatgagt 63361 tagaattcgt gagtgccccc aatcttcact ggcctgatac tatttttacc tatgatgcta 63421 aaacccgcat cctcttcact tgcgatgctt ttgggatgca ctactgcgac gatcacacct 63481 ttgatgaaga ttcagaatta atcgaagccg attttaaata ttactacgac tgcctgatgg 63541 gtccaaatgc ccgttccgtg ttgtctgcta tcaagcggat ggaaaaatta gatataaata 63601 caatcgccac agggcatgga cctttattac aacatcattt gtcagactgg gtgagttgct 63661 accagaaatg gagtcaggaa caagccaaag cagatactct cgttgccctg ttctactgtg 63721 aagattacgg ctcaagcgac caattggcaa gagcgatcgc tcatggcgtc cagagaaatg 63781 gggtagcggt agaactggta gatttaaata ctgctgaacc tcatgaagtc cgggaattgg 63841 tcaatcaagc gacgggtttg gtgattggta tgccatccca atcgaatcaa aatgcccatg 63901 cagctttaag tactattctc gcagctgcac accgtaagca agcagtcgga ctgtttgaaa 63961 gtggcggtag ggaagatgaa ccaatttatc ccttgcgtaa caagtttcaa gaaatcggtc 64021 tgacggaagc ttttccaccc attttgatca aagaaccacc aacgcacatc accgaacaaa 64081 tgtgtgatga agcggggact gacatcggac agtggttgac tcgcgatcgc acgatcaaac 64141 aaatcaaagc catagataat gatttagaac gggcattggg acggttaagc agtggactct 64201 atattattac cgcccagaaa gcagatgtca ccagtgccat gttcgcctct tgggtgatgc 64261 aagcgagtat gaatcctttg ggagtcgcta ttgcagttgc gaaagacagg gcaattgaat 64321 cattacttca cgtgggcgat cgctttgttc tcaacgtcct agaagaagac aactaccaaa 64381 gcttaatgaa acacttcctc aagcgtttcc ctcccggttc agatcgattt gcaaacatta 64441 aaacctaccc cgccagcaat ggctgtccga ttttggcaga tgcactagca tatatggaat 64501 gtgaagtcac gacgcgaatc gagtgcagcg atcactggat tatttatagc agcgtccaaa 64561 ctggcagagt ggctaaacta gatgccctca cagccgttca ccatcgtaag gttggtaacc 64621 attattaagt tatcgcgcct aaaaattgca cattgattcc ttataccaat tctccatgag 64681 gatgcactta atgtttatt // LOCUS NODE_304_length_60749_cov_4.99960560749 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 60749) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 60749) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..60749 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1167 /locus_tag="DP116_00710" CDS <1..1167 /locus_tag="DP116_00710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459833.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoglucomutase/phosphomannomutase family protein" /protein_id="PRJNA477356:DP116_00710" /translation="AFSWAAKQQNALGALVITASHNPGKYLGLKVKSAFGGSAPPEVT KQIEALLLQALPPASTPGKIEKFNPWGSYCEALEGKVKIEKIRDAIAAGKLTVFADVM HGAAAGGLAMLLGNEIKEINSNRDPLFGGGAPEPLPKYLSHLFEVMQNHQKTNQGGLA VGLVFDGDSDRIAAVDGDSNFLSSQILIPILIDHLTLRRDFKGEIVKTVSGSDLIPRL AAVHNLSVFETPVGYKYIADRMLEAQVLLGGEESGGIGYGSHIPERDALLSALYVLEA IVESGLDLGDYYRQLQEQTDFTSTYDRIDLPLASMEVRSRLLQQLQTQPLTEIAGKPV IDCQTIDGYKFRLADNSWLMIRFSGTEPVLRLYCEAPTLQQVHQTLAWAKEWAE" gene 1376..1960 /gene="rdgB" /locus_tag="DP116_00715" CDS 1376..1960 /gene="rdgB" /locus_tag="DP116_00715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494653.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-canonical purine NTP pyrophosphatase, RdgB/HAM1 family" /protein_id="PRJNA477356:DP116_00715" /translation="MTLLVVATGNPGKLKEMQAYLTDSGWELTLKPEELEIEETGDTF SANACLKASQVALATGNWALADDSGLQVDALDGAPGVYSARYGKTDEERIARVLRELG DTPNRQAQFVCVVAIAHPDGTIALQSEGICPGEILYAPRGSGGFGYDPIFYVQEKQLT FAEMTPELKRSVSHRGKAFASLLQELPHLNNTHC" gene complement(2103..2441) /locus_tag="DP116_00720" CDS complement(2103..2441) /locus_tag="DP116_00720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459835.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_00720" /translation="MRKVEAIIRPFKLDEVKIALVNAGIVGMTVSEVRGFGRQKGQTE RYRGSEYTVEFLQKLKVEIVVEDNQVDMVVDKIIAAARTGEIGDGKIFISPVEQVVRI RTGEKNTEAV" gene complement(2960..3550) /locus_tag="DP116_00725" CDS complement(2960..3550) /locus_tag="DP116_00725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873783.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00725" /translation="MRTISSSVHSSTPAANTQAYSPSVPLSVYRDLATELQTVQAKLD TLNAQNQQVVQENQLLRQEIAKAVESVLRLQKVVDSQAKINFHQVSQASSDSRTQTKP PITEPASREQISRPRAFYVGNSQTPVFSQNMEIPSSIPEPVFIEEQEVSYDPYKESEP LRIRGWWLIIGILLIIALGFSAGYLIVRPLFQSHSR" gene complement(3816..4637) /gene="thiD" /locus_tag="DP116_00730" CDS complement(3816..4637) /gene="thiD" /locus_tag="DP116_00730" /EC_number="2.7.1.49" /EC_number="2.7.4.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinase" /protein_id="PRJNA477356:DP116_00730" /translation="MNSETTSKVPVALTIAGSDSGGGAGIQADLRTFAFHCVHGTSAI TCVTAQNTLGVMRVDPILPEAVVAQIRTVYEDIEVQAVKTGMLLNKEIITAVAEQVEA LQIHNLVVDPVMVSRTGAQLLDNDAVNTLRHLLIPKAAIVTPNRYEAQILSSLPINSL DDMRAAAQMIHRNVRAKVVLVKGGGMQGNLRGVDIWFDGHKMETLTTKHVETKNSHGT GCTLSAAIAAHLARGCDLITAVRQAKEYVTSALSYALDIGKGQGPVGHFFPLLNK" gene complement(4852..5274) /locus_tag="DP116_00735" CDS complement(4852..5274) /locus_tag="DP116_00735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00735" /translation="MERGLLWLPLLVAFFWLAWQGSQEYQKIEAYRTWAEQFEQAKYD IYAVLGHKGNNITWGKPTPKGPIKLETFSLLDVQSITLLVDFLQVEVEKPPEKGRTIE LEFLFSEPDKTVRVPFTEIPLAAEWGKYLQSQLQSLES" gene 5393..7249 /locus_tag="DP116_00740" CDS 5393..7249 /locus_tag="DP116_00740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997693.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S8" /protein_id="PRJNA477356:DP116_00740" /translation="MRKLILFCLFVIGLSVALFNFINFQGLAAKGEFETIVLNFREDI PKEVLQQDLQAIAQQYNVTPELDNQYWAKENVYIIKGDKQRLKELKKSPFAQVMDFIE PNYIYKIPKPGKATWLGELLDPSQEADEAKPTFKGPNDPYYSKQWNLHNIHVEGAWTQ TKGKDITVAVIDTGVTRVRDLIETEFVPGYDFVNDRVEAKDDNGHGTHVAGTIAQSTN NSYGVAGIAYEAKLMPLKVLSEYGGGTVADIAAAIKFAADNGADVINMSLGGGGESHL MKDAIDYAHRKGVIIIAAAGNENQNSAAYPARYPHVVGVSAFGPDGDKAPYSNFGAGV DISAPGGSDAGKILQETIDPDNKGTAVFMGFQGTSMASPHVAGVAALVKASGVKEPDQ ILEVLKQSARSVQDDSLNYYGAGQLNAEAAVQLATQGQITFQDFFRWLRDSGYLNPGF WIDGGAIALLPKILMVVGSYLLAWFLRVYFPFRWGWNLSWGLITGSSGLFFLKGIYIF DVPQWPFRVLGSSIPELGNTLQGTDAFNPLFASVLIPLGLMGLLLSHPKWKWFAIGSS LGVAACLAVSAVLDPAVWGLGSSILARMFLIVNAVLCFGLARLAVKNEEQPA" gene 7326..7559 /locus_tag="DP116_00745" CDS 7326..7559 /locus_tag="DP116_00745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873348.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00745" /translation="MSITVTGTIERRDIGTGAWALVTDKGDTYEILRGVDKNLLKQGQ KAKVTGLVREDVLTAAMIGPVLEVKSFEVINSP" gene complement(7546..7872) /locus_tag="DP116_00750" CDS complement(7546..7872) /locus_tag="DP116_00750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131509.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin family protein" /protein_id="PRJNA477356:DP116_00750" /translation="MNLAVIKFSSEDCGICHKMSFYDKKVTEELGLQFIDVKMQDTAT YRKYRQILLTQYPDKSQMGWPTYIICDSPEGEFQILGEVKGGHPKGEFRSRLQEVLNS PASKES" assembly_gap 8218..8227 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 9491..12088 /locus_tag="DP116_00755" CDS 9491..12088 /locus_tag="DP116_00755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131510.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heterocyst differentiation protein" /protein_id="PRJNA477356:DP116_00755" /translation="MTQEFHISVTPVGQNDYLVRTEQVAPGVPLAEELVTWPVADWLT AAGHLMNDPLKSVLQGDAIARNSVNLVALGQQLYNALFQGTLRDSWITAQGIAQNHQQ VLRLRLGLKDTKLARLPWEVMHAGDRPIATGPYIAFSRYQNGIGRTSRLPSTGMPAPA EEGGLKVLMAIASPTDLVRLDLLKQEAIKLQAELHPGESGNYLPHNIELTLLEQPGRE ELTQALEQGRYHVLHYSGHSNIGPNGGEIYLVSSRTGLRETLSGDDLAGLLVNNNIQM AVFNSCLGTYAATSSGGVRDTGERNLTESLVRRGIRSVLAMSERIPDEVALLLTQLFY RNLSHGYPVDLCVSRMRQGLIAAYGSHQLYWALPTLYIQPGFDGYLSPQISLPQGEEL FNEYNPSLKTSAIIYSDQANDASMPLPLEDMLPSSLARDSFDDLDLLGEETWGDLIDE IEYDDPTYEEDSAIVSDLFRQLDQQKTSEQSSMKAELVQFGEDSLDEKEVSGEIASLE NDLGMWEEVREAASYGRQQDADPHELASNSQLSRQQELDLQVDWENSNVGPLTQTTAI RTPTAKPKQGKRRKVSRGSIGVSAIALAGAALCAIVAVVGFNWWSHNQQIPVFSPESQ QESRSTPQANFKTTETKVVSAIATDKLSKGELQIGLEAVEELLNRNLLPNAQAALDAI PEQFADNPSVYFFKGRLAWQLIKTGDNRYSVDDARRYWESALKTEPDSILYNNALGFA YYAQGNLNRANDYWFKALSFAVREQQSKSAASATTFLPSKPVPRDALTAYAGLAIGLY KTANNQPNGKREQYLNEAIKLRQMIIKDDPVNFEIKELTKNWLWTDKTLQDWKTLLQL KSQRSTLQN" gene 12404..13669 /locus_tag="DP116_00760" CDS 12404..13669 /locus_tag="DP116_00760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317678.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_00760" /translation="MSYCLNPNCPNPVNINNEKFCHTCGSKLVLKERYHAIKPIGQGG FGRTFLAVDEDKPSKPRCVIKQFYPQAQGTSTVQKAVELFTQEAMRLDELGKHPQIPE LLAYFTQDDRQYLVQEFIDGLNLAQELEQKGAFNEAQIRQLLNDLLPVLQFCHARQVI HRDIKPENIIRRTTTKSSNGNLVLVDFGAAKEATYTALNRTGTSIGSPEYVAPEQIRG RAIFASDIYSLGVTCVHLLTQRSPFDLYDINNDAWIWQQYLTSPVSNELSRILDKMLQ SIPVRRYQTVEEVLKELNQKPQLAPTPVTPIKPVTPSKPISTPVSTNQATTQIDTELE EMKTLFLQGGKSKSNKGRNVQPQPQTPQSSSSKSKNIQPQPQTPQPSSSKSKIDEELE QLKANPLSFSKSEIDEELEELKAKYKKGE" gene 13990..15492 /locus_tag="DP116_00765" CDS 13990..15492 /locus_tag="DP116_00765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141948.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="betaine-aldehyde dehydrogenase" /protein_id="PRJNA477356:DP116_00765" /translation="MSTLVQQAVGIPLSDQVSRFLTQPLKLLIGGKWVESASGQTFPV YNPATGQVIAHAASGESEDINRAVQAAREAFENGPWSKLTVSERGRLIWKLADLIEAN LEEFAELESLDNGKPISVARVADVPGAVDLFRYMAGWATKIEGNTIPISAGTQYFAYT VREPVGVVGQIIPWNFPLLMAAWKLGPALAAGCTIVLKPAEQTPLSAIRLGELICEAG FPDGVVNIVTGYGETAGAPLAAHPDVDKVAFTGSTEVGRLIVQAAAGNLKKVSLELGG KSPNVVFKDADLEIAIKGAANSIFFNHGQCCNAGSRLYIQQDIFDQVVEGVAAEAKKI KIGPGLDPSTEMGPLISDEQLDRVYGYMQSGFAEGAKAVTGGQLIGEQGYFVEPTVLV DTKQTMKVVQEEIFGPVVTAMPFREVDEVVPLANDSTYGLAAGIWTNDISKAHRLASK LRAGTVWINCYHIFDAALPFGGYKQSGWGRDMGHNALELYTEVKSVCVKL" gene complement(15630..15974) /locus_tag="DP116_00770" CDS complement(15630..15974) /locus_tag="DP116_00770" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00770" /translation="MTLSFSESLFLACFLVPCLWQGMHLVKALPQADNTVQLRIIVGW GASAGGGFPSLGVWRLSNVSAACPQDINQDFQNFVGFHYVQTPLANSCGDALATSRAL PAQRSGSPTYKI" gene complement(16064..17494) /locus_tag="DP116_00775" CDS complement(16064..17494) /locus_tag="DP116_00775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654467.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphogluconate dehydrogenase (NADP(+)-dependent, decarboxylating)" /protein_id="PRJNA477356:DP116_00775" /translation="MTQQSFGVIGLAVMGENIALNVERNGFPIAVYNRSRDKTDKFMA ERAQGRNAKAAFTLEEFVGSLERPRRILIMVQAGKPVDAVISQLKPLLDEGDIIIDGG NSWFEDTDRRTQELEPAGFRFIGMGVSGGEEGALNGPSLMPGGTQSSYQYLEPIFTKI AAQVDDGPCVTYIGPGGSGHYVKMVHNGIEYGDMQLIAEAYDLLKNAAGLDHNQLHEV FSQWNTTDELNSFLIEITSNIFPYIDPDTNLPLVDLIVDSAGQKGTGRWTVQTALELG VSIPTIIAAVNGRIISSYKQERVAASKVLTGPTGKYDGDTKEFINKVRDALYCSKICS YAQGMALLSKASQTYNWDLKLSELARIWKGGCIIRAGFLNKIKKAFNENPALPNLLLA PEFKQTILDRQTAWREVLVTAAKLGIPVPAFSASLDYFDSYRRDRLPQNLTQAQRDYF GAHTYERVDKPGAFHTEWVPITETSV" gene 18028..18801 /locus_tag="DP116_00780" CDS 18028..18801 /locus_tag="DP116_00780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00780" /translation="MLAYILALTVGLGSVALYIAAFFFPEIHRKNDFILSGVGLFYAL VLWIFARGITGGLLLGHVASVALLGWFGWQTLSLRRQLTPKIQQTQIPSTEVVKTNIQ QQVSQLSLPQKLAQLPKAIGSRFSGVKDRAQTAGKKTPEKSKRTQTPVPTAKPVVEIV DERIPVTDQPAVIPPATTDTEAKTPTASAPTEAKTETPPEAVPPNPSSPELLEAAQAH ETEEKTPPPIEEVAPRAALAPPAEAPPGQVPPKNQAEGS" gene 19254..19582 /locus_tag="DP116_00785" /pseudo CDS 19254..19582 /locus_tag="DP116_00785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138459.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS66 family transposase" gene 19563..19880 /locus_tag="DP116_00790" CDS 19563..19880 /locus_tag="DP116_00790" /inference="COORDINATES: protein motif:HMM:PF03050.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00790" /translation="MPPTRDRPYTAQPGYASLPRIGDSGNPILEELSLAVARSAPCSQ GEPRKARSRVELEQLLGTEYRGVNSSDDFSVYNAYCVASHQKCLAHLRRHFLRLMSAS WSG" gene 19880..20203 /locus_tag="DP116_00795" CDS 19880..20203 /locus_tag="DP116_00795" /inference="COORDINATES: protein motif:HMM:PF03050.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00795" /translation="MQRAGNAFVDLIDDVFDNYRQFQHSNDFQQYTSWAKQFKSKLSH ALNTWIPKASAFVLNLLSKLRTSMTAWWFFLDHPEVPPDNNLAERSLRLAVTKRKVSG GSRYN" gene 20417..20656 /locus_tag="DP116_00800" CDS 20417..20656 /locus_tag="DP116_00800" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00800" /translation="MVVFMIQLLRDHLRTLIWFAVGLICVLLLALGLPLTQLASTAQN PNHQSSAHAYTPKNPPGTPYWSAGVTRDSNHLPNF" gene 20797..22791 /locus_tag="DP116_00805" CDS 20797..22791 /locus_tag="DP116_00805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319561.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase" /protein_id="PRJNA477356:DP116_00805" /translation="MISWFEKHVRAIAWLTSCLLLFTTLLIEQPAFAQEVSGNGVNVI IMIGDGMGWEPVRAAAITNGGSFYTRGEGRGLNLQKLKGYTYATTYGTTIFDQKSGRF STGNSALDGTNNVTGQSNIRPGFSFKPLPFNPGTDLASGSGGATDPSLGNLTGWEVEK GGPNPWTPATPPPCDITVPVGKSINDVPNRFNCQAEYIKLSLPDSANTAFTLYTGVKS YNNAMVDIFEKPIETILQTARKQGKSTGLLTSVPISHATPGAAEASVNRRTKYDADYP TLDNIIQQSIRPDFNENPDRPDLEDIFLPTVLLGGGHPLDHDNTVNTPGQVGYKEPGT CNYVYIRASTYKELTGKTSLSDAEACKATASSNPNNRYGYRFLERSPNAAKLLLKTSR EIDPNKGERLLGLYGARGQEGNIPVSGADGDYSITGLANFARQSSLYNRTNDAYKRGD IPVNDTDRPLQPGETNEAFIAREVNENPTLGDLTQAALNVLGKDKDGFWLMVEGGDVD WGIHDNNIDNIIGTVLDFDKAVGVAMKWIERNGGWQKNVLIVTADHDHYLTLYPNFPE LLRTKGAKALTYGSETDSASVGHSFGSIPEDKYGWGSHTRRPVPVYYQGRPFKLDKYI GKGYKNYGFDVPGVPNAVDQVHIYKAMYEAITGKEPQDPS" gene 23019..24017 /locus_tag="DP116_00810" CDS 23019..24017 /locus_tag="DP116_00810" /inference="COORDINATES: protein motif:HMM:TIGR01297" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation transporter" /protein_id="PRJNA477356:DP116_00810" /translation="MLSVKQHLNWSEIAFDRSCTCCDFSRSQNQPQKKIWRLWLVLAL LISVLPAEIGIGLWSHSLSLQADAGHLLSDVGALGLTLLASWLAGRPAAGRATFGHGR VEILAALINGLGLLLIAGFIAWEAIARFQNPEPILSLPLLLGAGLGLIVNGLNLTLLH KHSLDDLNLRAAFLHIVADAFSAVGILVAALVIYWLKWWWIDPVMSLLIACFAGLSAI PLIWSSLEVLLEYAPRFIKPTEVEAALQSFPGVRRVETLRIWAIGCGQTALCAHLRIE PLNAQQQDQLQWELQTHLVEAFGIHESTLQLSSGSVTNLVPLHPLLNRSLVSYIYK" gene 24216..27206 /locus_tag="DP116_00815" CDS 24216..27206 /locus_tag="DP116_00815" /EC_number="6.1.1.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995247.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="isoleucine--tRNA ligase" /protein_id="PRJNA477356:DP116_00815" /translation="MTETGKYKDTVNLPKTNFDMRANAVKREPEIQKFWEENKIYERL SQNNPGELFILHDGPPYANGSLHLGHALNKILKDIINRYQLLRGRKVRYVPGWDCHGL PIELKVLQNMKSAERQNLTPLQLRHKAKDFALSTVNEQRESFKRYGIWGDWEHPYLTL KPEYEAAQVGVFGQMVLKGYIYRGLKPVHWSPSSNTALAEAELEYPEGHTSRSIYVAF PMTGLSETVKPALGEFLTQLGVAIWTTTPWTIPANLAVAVNPDLKYAVVEVEPHPPTP SPQAGRGSEENASHSVQAGRESEEVASPSPLGERGTEGVRFRYLIVAADLVERLSEVL GTQLTVKTTVKGKDLEHSTYRHPLFDRESPIVIGGDYVTTESGTGLVHTAPGHGQEDY IVGQRYGLPILAPVDDNGNFTEEAGQFAGLNVLGDGNQAVIDALTASGSLLKEEAYVH KYPYDWRTKKPTIFRATEQWFASVEGFRDEALKAIATVKWIPAQGENRITPMVAERSD WCISRQRSWGVPIPVFYDEETSEPLLNEETISHVQAIFAEKGSDAWWELSVEELLPEK YRNNGRSYRKGTDTMDVWFDSGSSWAAVAEQRPELRYPADLYLEGSDQHRGWFQSSLL TSVAVNDCAPYKKVLTHGFILDEQGRKQSKSLGNTVEPKVVIEGGKNQKEEPAYGVDV LRLWVSSVDYTSDVPLGKNILKQLTDIRNKIRNTARFLLGNLHDFDPQKDAVPFEELP QLDRYMLHRMTEVFKEVTEAFESFQFFRFFQTIQNFCVVDLSNFYLDAAKDRLYISAP NAFRRRSCQTVLKIALENLARAIAPVLCHMAEDIWQFLPYKTPYKSVFEAGWVQLDEK WRNPELAVVWQQLRQVRTEVNKVLEEARVKKMIGSSLEAKVLLSVVDEQLRSTVKSLN ATQNGIDELRYLFITSQVELLDSPEAVQGLEYKLQSDAWEIGIVNAEGQKCDRCWNYS THVGESAEHPLICERCVSALAGEF" gene complement(27413..28336) /locus_tag="DP116_00820" CDS complement(27413..28336) /locus_tag="DP116_00820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317339.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ParA family protein" /protein_id="PRJNA477356:DP116_00820" /translation="MGYVIATANMKGGVGKTTLTVNMATCLVKNHNKRVLVLDLDSQI SATLSLMSPVDFAKLRKNKRTLRYLIDDIINPSSRAKLTIRDIIQPQVCNLPGLDLLP GDIDLYDEFVVSQMLHQQAVSLGENEFETIWNRFERVLVGKILEPIRQEYDFIILDCA PGYNLLTRSALATSNYYILPAKPEPLSVVGIQLLERRIAQLKESHEHEAHIDIQMLGI IFTMSNANFLSGRYYKQVMQRVHQDFGDGKICQTQIPVDVNVAKAVDSFMPVVLNNSS TAGSKAFYQLTQELLQKLESAELQKQQKTNL" gene 28847..29671 /locus_tag="DP116_00825" CDS 28847..29671 /locus_tag="DP116_00825" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00825" /translation="MDNLESKIVEILKRGTPLRAVHIADMLGVERREINHYLYSSLKH MVVQDSDYRWSLKTNQISKTQRPSTQAPTSHTKSSQPVRQNNLYSFTKEEIKQDSPLR STPQTKSSQHAKPTSQTPSSQPSKHENLQKFSEKSFQQNGQPQQASTAQTPPLSQPVK QNNPYKVIKTELGQASPEEKVKIIENAFRQEQFRELEDEEINALQSILEQSRREIDIA NTAYTQGKLSTRKNNPIMIAILSVALTLSTLFLISQFIPNLTNQPSPTIPQTKSMQ" gene complement(29691..32303) /locus_tag="DP116_00830" CDS complement(29691..32303) /locus_tag="DP116_00830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454721.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsK" /protein_id="PRJNA477356:DP116_00830" /translation="MQYLTQPTEIRAQIAKFASAKTLWLDTEIADWDTYYPRLSLIQV LAEPTNLTSDSAYILDVLNKPDLAAYFINQIIVNPQIEKVFHNASFDLKYLGGKQAPN VTCTLKLARKITREVLQVSNLQLKTLATELCQFSNVDKEEQGSDWGKRPLTQKQLHYA AMDTVYLAAVHRRLLEISNPNTINNIFDMVANGSKSRKKSEQLSLSATKVRVAFECPR LFYLNQRFGGNTLFLPPENPVGIGNTFHQLANDFVRFVSHEPQCTDLFQPTAAQLEIE EIASSLQQIFYEIKFYSYLQEAIKKDSSQAQALFKVWQGLQGLIKHFAQLLIINRRYC SAETLIRDTFITEERKLEYYFNLPDDTQQRIAGEFDCLVYNFEKKRLCVVEFKTYQPA DPSAQLAQVALYSYMLWQKKKVAVDSAVYCVLPEFKEYQYSWEQLENTVHQLIPYKLQ QMRQWLTWEPPLPNSPPLTTQPHLCKICPQQQKCQSYFVEQSQGEESSYEENQQKTVS QDEQTGNNHKQPFNPDEMGENLVKTLQSFGISVDYHGAAVGPAFVRVKLKPQLGVKVN SLLKLSADLQVQLGLANPPLISPQAGYVSIDLPRPDRQVAKFENYIKSQVLPATAPVK IAIGVNLEGQLIEADLSDPNTCHFLVGGTTGSGKSEYLRSLLLSLILRHSPAHLQIAL VDPKRVTFPEFERMQWLYSPVVKESDRAIELMQELVTEMESRYQRFEKAKCADLSTYN QRSPQPLPRIVCIFDEYADFMAEKESRTALEQSIKRLGAMARAAGIHLIIATQRPEAG IVTPIIRSNLPGRVALRTASEADSAIILGGKQTTAARLLGKGDLLYQMGAQLYRLQSL FATDIRLPLSTSHR" gene complement(32321..34396) /locus_tag="DP116_00835" CDS complement(32321..34396) /locus_tag="DP116_00835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_00835" /translation="MATIDQVIKKSLNPFDNPAARNFWEEQEPFPTVESIHQKQLIEI KSVIAQIAQDHETRSLILYGDVGTGKTHFLGQLKEQLNDQAFFVYIEPFSQSDRIWRH ILRYTVDSLVNAPAGQADSQLILWIKSCLSTIEKGLKSEQQKFIDRIKGFFGKTDTER DRQLFIDILKKTIGTTGIYNANEFFGVLYDLTNPDLYSLACEWLKGDDLDEESLKKLK VKQSIDDEDKARGILANFSKVSAKTQPIVLCFDQLDSIARLPDGFLDLQALFNVNSTI YNGRWKNFLIIISIRTENWYNNSKRVQPSDIDRVSIRIPLKRITLEEGEALLASRLYP LHNQANPKPTSLIYPLNRQVLEKVFLSRKATPRDFLTLGKQLFQDYKEWLFRDKQPPQ PKWFDGEVTVETEISWEVIKAEFELLWQQEYQKVQGKNTKIILLAAPDLIWMLQQALE ALQVQEIKPKLISGRYASYSLSYQQPGKRERVGVVWTEDSHMNSFFNMMTACQKAIQQ NLCQTMYLLRAGGVGKPNLVGNQLYQQIFTSTNHRHIKPNLSSVHYLATYHSFVKSVE ANELVLASKTITLQELQTLIRESKILEKCTLLQDLGILSKQETVPEDRNGKKDFRPVK DFLLNLIKTQGYMGVPTLMIQSVNQFSFVKEADVQLLIDLLCQEQKVTIINPKAKLQD QLICFIAKT" gene complement(34451..35302) /locus_tag="DP116_00840" CDS complement(34451..35302) /locus_tag="DP116_00840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197540.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcription factor RcaD" /protein_id="PRJNA477356:DP116_00840" /translation="MDTNELKFLLKLLGCPNYRSSLSASGFKAFKGKDKICQSLGDRE LVDYSREIASVKILPPGNALLKLESAQLPITDQEFKVLEKISKAAGKVSPSKIKVSSL KAPEKEAILKTLSERGLIDAESKIKKTKAEVWLTERALEYLREDYSPKGAATIRLDLL NNYLRFLRKSLHSKPEEVSTSAPTSRESAAVTIINLTDEEILETIRRLDRELGTDNYL PIFHLRQKLQPPLSREEVDKVLYRLEEADQIELSTLAEPRDYTPEQIDAGISQISGGS LFFITAI" gene complement(35627..36574) /locus_tag="DP116_00845" CDS complement(35627..36574) /locus_tag="DP116_00845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 2" /protein_id="PRJNA477356:DP116_00845" /translation="MPKVSVCIPTFNRVNLLPFAIESVLQQTYQDFELIVCDDGSQDE TAKLMSQYTDSRIKYIRHQQNIGKSNNMRSGFDAATGEYFIKFDDDDRLTSDFLSRTT AILDKNPNIDFVGTDHWVIDINNIRDDTKTQENSRRWGRANLPEGVVDNLQEVVFVNQ SFQIGATLFRRSTLQELGFMQPNWQNCEDNDLFVRLALAGKKGYYLPELLMEYRFHAQ QQGIDRAIPYLSDKLRYLESYQFEFEKLEKIRQQRLTETQLLLGLRLIEKGETQKGRK LVLAGKSFSPAKAWTGLGLSLLPVRLRSLVFDLVRRVKG" gene complement(36607..37635) /locus_tag="DP116_00850" CDS complement(36607..37635) /locus_tag="DP116_00850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007311449.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methionine synthase" /protein_id="PRJNA477356:DP116_00850" /translation="MEDFGVYTLANDIVYNQLVALLNSIEVNVSPNIPICIIPYDERL ELVKLEVSSRPNVTLFENKSSMQRWEDFYNQVWNAHPQARQLKQGHSRKWYKQSNLLR KMCAFDGNFKRFVFYDADSLAMTSLDRVLEKLDDYDFVFDDWEHGKSSPVAALNFSRI EKAISLPESQVKPLLHCSSFFGSKEGIFGKDELEMLRERLVVNQEFTWINERSWWCDA DLFSYMTFRCDHRPLFNFTLSPNGQDITGNCADADPFININNVLYNQDGLKPIHRLHY MNYPAIDFTRLCQGEDVDIRYKNEFLYYRFLKEPEKRPKQLKPPSIVVKTHRFMQKVK SRIERTIS" gene 38094..39317 /locus_tag="DP116_00855" CDS 38094..39317 /locus_tag="DP116_00855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317347.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 1" /protein_id="PRJNA477356:DP116_00855" /translation="MNILMLSSTFPYPPTRGGTQVRTFHLLKYLSQHHVITLVTQREP DVTDTEVVELRNCVDNLIVFNRPQDTSKSGRILKKIQRFYSFIQQGTPPSVLNRYSSE MQTWIDNFVEAGKCDVITCEHSVNEIYVRAHFQKHLRTIVNVHSSVYGTCRNQLKTGI SENRFRDKLYLPLLRRYEQRYCSKFSAIVVTTEEDRVQMQEFNPNSEIQVIPNGVDLV SFPYRTYDPNGHRIIFIGAMDNLANIDAVCFFSNEILPEIQKLYPDTTFDIVGSRPAP EVLALKEKPGITVTGRVPSMVEYLHKATVCVVPMRTGFGIKNKTLEAMAAGVPVVASE RGLEGLAVDGSNIPLRALRANQPTEYVTAIRQLFENSQLRAELSRNGRQFVESQFTWE SAGKRYEEVLTNTGL" gene complement(39332..41164) /locus_tag="DP116_00860" CDS complement(39332..41164) /locus_tag="DP116_00860" /inference="COORDINATES: protein motif:HMM:PF13229.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00860" /translation="MKNHKLKRFLFPRQNGTAVPAWQIFGLFMLIFFSISSQTQPGLA SLTSNLFTKLSSFVMVSPNSEQNNFANQRRTLYVSPDGKGNACTSTAPCTLTSAQQYV RSINTSMKTDITVILKNGTYYLSDTFTLGTQDSGFNGHSIIFRAQTPGKVELSGGKVI TGWQTQDGKVFQATVDNQDFRRLFINGVPAIRARQPNTGTYFRVVNWDIPNKRVEVNP SEINRWKKLSEAEMIVFRHWTINRFKVEDFTINGNIASVALQNPGRDLAFMGNTQFLE PELSYYFENAYEFLDDKGEFYLDKEAHTLYYIPRPGEDMSRTTVIAPRLEHILKIAGT AEKPAHNIVFQGIVFQDTNWTAPSQKGFIGGQAGTEVSTKHVWGDTTMIAGVELSYAG NVSFEKCTFRNMSASGVNASTDVENLSIRDSHFENLGGQGIVMDTLLHSEPATATIRN VLINNNTLTALGQDYPNSVGIFAGFVENMIVENNKLWNLPYTGISIGWGWTKDIHRVK SNTIRNNEIYNVMHTLDDGAGIYTLSNQYNTLISDNYIHHVVRSPWAASYPISGVYLD QASGGITVTRNKIDNVLMPIYTHQIQNNNIIEGNWPNAVTDNKS" gene complement(41349..41546) /locus_tag="DP116_00865" CDS complement(41349..41546) /locus_tag="DP116_00865" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00865" /translation="MVGWVKRSATQQQTFCVGFPGVKRQTPTEEPVRSWGLPKWSIWR WKPFCMASAPQPTQFKGFGSN" gene 42334..42618 /locus_tag="DP116_00870" CDS 42334..42618 /locus_tag="DP116_00870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130968.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00870" /translation="MSKSSQLPQANQLSQFSSLELAQALMEKLSISPNDWHRLKSNRN VRASEQVAAALVFLLKDEPQEALLRLQQAALWLDRSISAPPCPTHGEQKR" gene complement(42683..43138) /locus_tag="DP116_00875" CDS complement(42683..43138) /locus_tag="DP116_00875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951442.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_00875" /translation="MITISLRPTGRNWGTISFASTLYLCPILDLLLAEIPGKLQAELR LGLQEALVNAAKHGNNLDPGKRVVVRFSLIDNQYWWVISDQGSGFTPCFTSDVDIDPT EYLPADESENGRGMSILHQIFDQVQWNSKGTELRLCKQIENRNRLALRR" gene complement(43535..44932) /locus_tag="DP116_00880" CDS complement(43535..44932) /locus_tag="DP116_00880" /EC_number="2.1.1.190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407636.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD" /protein_id="PRJNA477356:DP116_00880" /translation="MTKKNWKQGELIDVEIVDLNDAGDGVGRWEERVVFVSDTVPGDR AVVFLVHVKPNYAHGKLKQLLKSSPYRVRPSCIVADKCGGCQWQHIDYQYQLLAKQNQ VIQALERIGGFVQPPVDPVLSTPECLGYRNKATYPVGLSATFTVQAGYYQKGSHQLIN LNQCPVQDPRLNPLLLEVKQDIQKRGWEVYNEQHHTGVVRHLSLRIGRRTGEILLTLV VKDWNLPGIQDQALEWLKRYPQLVGVCLNRNPNRTNAIFGRETHCIAGHSYLREKFAG LEFQVRPDTFFQVHTETAEALLQVVQSELNLQGSEVLLDTYCGIGTLTLPLAKQVRKA MGLELQPEAVEQAILNAKQNDISNVIFQTGAVEKLLPQIEILPDIVLLDPPRKGCERT VIETLLKFQPPRIVYISCKVATLARDLKLLCENGVYNLTRIQPADFFGQTAHVEAAAF LVLSQSDKGTNSFAT" gene 45220..45705 /locus_tag="DP116_00885" CDS 45220..45705 /locus_tag="DP116_00885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194619.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="allophycocyanin" /protein_id="PRJNA477356:DP116_00885" /translation="MTVVSQVILKADDELRYPSSGELNNIKDYLQTGEQRIRIVSTLA ENEKKIVQEATKQLWQKRPDFIAPGGNAYGDKQRALCVRDYGWYLRLITYGILAGDKQ PIEDIGLIGVREMYNSLGVPVPGMVEAINCLKKASLNLLNAEDAAEAAPYFDYIIQAM S" gene 47194..49560 /locus_tag="DP116_00890" CDS 47194..49560 /locus_tag="DP116_00890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315936.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylformylglycinamidine synthase subunit PurL" /protein_id="PRJNA477356:DP116_00890" /translation="MIPISSAPFSPEEIAAVGLKLEEYEEIVMRLGRHPNKAELGMFG VMWSEHCCYKNSRPLLKQFPTTGPRILVGPGENAGVVDLGDGLRLAFKIESHNHPSAV EPFQGAATGVGGILRDIFTMGARPIAVLNSLRFGSLDDAKTQRLFSGVVAGISHYGNC VGVPTVGGEVYFDPAYSDNPLVNVMALGLMETPEIVKSGASGIGNPVLYVGSTTGRDG MGGASFASAELSDESMDDRPAVQVGDPFLEKSLIEACLEAFKTGAVVAAQDMGAAGIT CSTSEMAAKGDVGIEFDLDKIPVREAGMVPYEYLLSESQERMLFVAHKGREQELIDIF HRWGLHAVVAGSVIAEPIVRILFQGEVAAEIPATALAENTPLYHRELLAQPPEYVRKA WEWTPDSLPKSTFAGIEIQGRLQSWNDILLTLLNTPTIASKRWVYRQYDHQVQNNTVL LPGGADASVIRLRPQEVEVQGKVSNTQSGVAATVDCNPRYVYLDPYEGAKAVVAEAAR NLSCVGAEPLAVTDNLNFGSPEKPIGYWQLAEACRGLAEGCREMTTPVTGGNVSLYNE TLDSQGKPQPIYPTPVVGMVGLIPDLTKICGQAWQVNGDVIYLLGELSSHSTLGGSEY LATIHNTVAGIPPRVDFELERQVQQVCREGIRKGWVRSAHDCAEGGVAVALAECCITG KFGADIQLELPADNNQRWDEVLFGEGGARIIVSVGIEQQETWESLLREQLGDHWQKLG TVGNSEIGLRVLTTGNHTLIKVTIEEMSDRYLNAIERRLNIHNTTPNS" gene 49709..51205 /locus_tag="DP116_00895" CDS 49709..51205 /locus_tag="DP116_00895" /EC_number="2.4.2.14" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315935.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amidophosphoribosyltransferase" /protein_id="PRJNA477356:DP116_00895" /translation="MISCQPDSLDDSQLPPNSANDHENRPDKPEEACGVFGVYAPGED VAKLTYFGLYALQHRGQESAGIATFEGEQVNLHKDMGLVSQVFNESTLRNLPGTLAVG HTRYSTTGSSRKVNAQPAVVETRLGSVALAHNGNLVNTVALREELLKNNCNLVSSTDS EMIAFAIAEEINAGADWQEGCMRAFHRCQGAFSLVIGTSKGIMGVRDPNGIRPLVIGT LGDNPVRYVLSSETCGLDIIGAQYLRDVEPGELVWITEEGLTSFQWSQKSKRKLCIFE MIYFARPDSVMHDESLYSYRMRIGRRLAKESPADADLVIGVPDSGIPAAIGYSQASGI PYAEGLIKNRYVGRTFIQPTQSMRESGIRMKLNPLKDVLFGKRVIIVDDSIVRGTTSR KLVKTLREAGALEVHMRISSPPVTHPCFYGIDTDTQDQLIAATSSVEDIAKLLEVDSL AYLTREGMLQSTREDPESFCSACFTGDYPVSVPEQVKRSKLILEKVSV" gene complement(51247..51528) /locus_tag="DP116_00900" CDS complement(51247..51528) /locus_tag="DP116_00900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878522.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00900" /translation="MFSEVYYLVRSKANGDYLTARPNAEANGYLLLFREHFDALSYLN THAGEVANRFAVEFVPGSQVGGLLKRWGFSGVGIVSDPLLPKIEFLQQS" gene complement(51625..52671) /locus_tag="DP116_00905" CDS complement(51625..52671) /locus_tag="DP116_00905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_00905" /translation="MTTTRISLDASRDWLLKLVTSESFIYVIKRVLQALLTLFLASAL SFIIIQLAPGDYVDTLRQNPKISPERIEELRRQFGLDKSWPEQFGLWVWRIITQGDFG TSFVYQRSVASLLWERIPATLLLAVSSLIVTWAIAIPLGIVAAVKQNQWVDRILQVIS YAGQGFPSFITALLLLIFAQNASPIFPVGDMTSINHTELNWFGRILDIGWHMILPTIA LSVTSFAGLQRITRGELLDVLRQDYIQTARAKGLPENRVIYVHALRNAINPLITLLGF ELASLLGGAFIAEFFFNWPGLGRLTLQAVQAQDLYLVMASLVMSAVLLSVGNLIADLL LKASDPRIRLENLN" gene complement(52862..53380) /locus_tag="DP116_00910" CDS complement(52862..53380) /locus_tag="DP116_00910" /EC_number="2.4.2.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321502.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenine phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_00910" /translation="MNLKSLIRDIPDFPKPGILFRDITTLLRDPEGLRYTIDLFAHKV KDAGLRPEYVVGIESRGFIVGAPLAYQLGTGFIPVRKPGKLPAAVHSIEYALEYGTDG LEVHQDALHPGSRVLIIDDLLATGGTAGATAKLVEKIGCKLEGFGFIIELQDLQGRKH LPDVPILTLVEY" gene 53770..54396 /locus_tag="DP116_00915" CDS 53770..54396 /locus_tag="DP116_00915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457298.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00915" /translation="MNISASITPINSPTPQSVPMILDSLPEPVVEGQGCPRRARLQID LILLAIEALELGGSEAILAFAEELDLKGIVKNRVNLWRMRSTNPMRRAHIRRPLSIME AKALVVIGCYIARRLTVVIRQLLAIHQQMVEKQLPLEQNLRLSNYLERFRSHFKSRMN SRRSGVLALNSDEKLDELAISLLEQLLFCTGTAGMQRFWISLFDGEVE" gene 54393..55883 /locus_tag="DP116_00920" CDS 54393..55883 /locus_tag="DP116_00920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878518.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00920" /translation="MNIQRKYSLPNCTLLLEGLSDASRAAHFQELRPELSILVNAECY LSGYTRPLSGGREFFESLVRGVSAYAQEFLSSVPHAEAHNSESELVQFQKIGNNQHRL IVHSDAADEMESYSNNGNRRIQVDLNTVQFFDLVEAIDQFFADSQTLPELTLQLQPVA RRHGGVSQAVIKQAVPASVGVTSLAAAAVAFTMIPTPQVRSPQLNTQQDVNRTTNVNS PATASITPTPTANEQIAANPRFTPTTASSSVVLNSTAATPTPGAAPVVKDLEVLLNTS GEITEASQLRALNRELYNQINPRWAKRSGLNEDLVFRVGVGADGGIVGYKAVNKQAND AVENTPLANLLSNPANRTSSGNEAIAQYRVVFTKAGVLQVSPWRGYTKTPDVVGTKIT DPNAVKDLNQKLYNTIRQNWSTSSAFARDLRYRVAVTKDGVIADYEPLNQPAVDYYRA TPLPKMFQDVYGSNLAPPNNKEPLAHYQVMFKPDGSLEVLPWRGYR" gene complement(55958..58228) /locus_tag="DP116_00925" CDS complement(55958..58228) /locus_tag="DP116_00925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858744.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="penicillin-binding protein" /protein_id="PRJNA477356:DP116_00925" /translation="MSSPQPPQKPQTLLGQITQAVQTIQARVDFSKLALKPNAKVPEL LVQDAGADKAEVYPLLGDRYMLGRSSKSCDIVVRNPVVSQIHLSLSRDSAKRESVFVI KDENSTNGIYRGKRRVNSLELRHGDILTLGPPELAASVRLQYIDPPPWYLRVATWAAY GVGGATALMALVIGVESLKVSVKPLPGATRAPVVVYARDGVTKLREPRTTSHIDMKQL KDFGPFLPTAVVASEDSRYYWHFGVDPLGILRAVLINSKSGDIQQGASTVTQQVARSL FRDYVGRQDSFGRKFREAVVALKLETFYSKDYVLLTYLNRVFLGADTSGFEDASRYYF DKSAKELTLSEAATLVAILPAPNGFDFCGRDSQNKLTTIEYRNRVLKRMLEMGKIKQD EYNRARRSPIEISSKVCDQQLKTTAPYFYSYVFQELEAILGKELATEGNFIIETQLDP AIQKQAEEALRKHISNAGSSFGFSQGAMLTLDSSNGAILAMIGGKDYKSSQFNRAVQA KRQPGSTFKVFTYTTAIEQGISPGNSYSCSPFRWKGFTYKPCRTSAGSLDLATGLALS ENPVALRIAKEVGLDKIVTTAKRLGVKSDLDPVPGLVLGQSVVNVMEMTGAFAAIGNG GVWNRPHAISRILDSSDCRDRNDLKTCRVMYSYDQDPEANKRVLPINIADTMIRLLRG VVTNGTGRSAAIGLGEAGKTGTTDKNVDLWFIGFIPSRRLATGIWLGNDNNAPTSGSS AQAAELWGNYMGKITK" gene 58678..59112 /locus_tag="DP116_00930" CDS 58678..59112 /locus_tag="DP116_00930" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00930" /translation="MNGKYLTLIGAALTLALTSNVAVAESAKFLIGQSSGGSSSGSSG EIKLSPRVNKKKFCNDYPLNSRCQEGSASTSSPSDSSTETKKKPSKSTPGGTATPGLA PADPGTTNTPGGTNEIAPPPAGNPANPGTTTPSGSPGSGTRK" gene complement(59349..59822) /locus_tag="DP116_00935" CDS complement(59349..59822) /locus_tag="DP116_00935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipoprotein signal peptidase" /protein_id="PRJNA477356:DP116_00935" /translation="MSVKNLLFWIAAFLAFLLDQLTKYWVVQTFNSGQTQALLPGIFH FTYVTNTGAAFSLLTGKVEWLRWLSLGVSLALIALAWFTVLHFWDQLGYGLILGGAIG NGIDRFVHGYVVDFLDFRLINFPVFNLADVFINIGIICLLIATFQKTPASHRRSR" gene complement(60076..60663) /locus_tag="DP116_00940" CDS complement(60076..60663) /locus_tag="DP116_00940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315926.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="biotin transporter BioY" /protein_id="PRJNA477356:DP116_00940" /translation="MLAASNQLLWSMIGLLLTMGGTFLEAYVATSPLSWSQYGIHAFS LGVTYQIGGVLLAGCLGGKNAGALSQIAYLVMGLTLLPVFADGGGIGYVKVSQFGYLL GFIPGAWICGFFAFKARPRLETLAFSCFCGLLTVHLCGITYLILSYVFQWKGTETLSL MQAMLRYSWFALPGQLAVVCAVVVIAYVLRHLMFY" BASE COUNT 17479 a 13038 c 13257 g 16965 t 10 others ORIGIN 1 gcttttagtt gggcggcgaa gcaacaaaat gccttgggtg cactcgtgat taccgcgagt 61 cataatccag gcaaatattt gggattgaaa gtcaaaagtg cttttggtgg ttctgcgccg 121 ccagaagtca ccaagcagat agaagcactt ttattacaag cattaccacc tgcatcaact 181 ccaggcaaga tagagaagtt caacccctgg ggcagttatt gtgaagcact cgaaggtaaa 241 gtcaagattg agaaaatccg cgatgcgatc gccgcaggca aactcacagt atttgctgat 301 gtgatgcatg gcgcggctgc tggtggattg gcaatgctac ttggcaatga gatcaaagaa 361 attaacagca accgtgatcc gctgtttggt ggcggcgcac cagaaccatt acccaaatac 421 ctttcgcatc tgtttgaggt gatgcaaaat caccaaaaaa cgaatcaagg aggtttagcg 481 gtagggttgg tgtttgacgg ggacagcgat cgcattgctg ctgttgatgg agacagcaat 541 ttcctgagtt cacaaatctt aatcccaata ttaatcgacc acttaaccct acgccgcgat 601 ttcaaaggcg aaatagtcaa aactgtgagt ggttctgact tgattcctcg cctagcagca 661 gtgcacaact tgtcagtttt tgaaacaccc gttggttata aatacattgc tgacagaatg 721 ttagaagcac aagtgttgtt gggtggcgaa gagtcgggag gaattggcta cggaagccat 781 atacctgaac gggatgcact tttatcagca ttgtacgtac tagaagctat cgtggaatct 841 gggctggatt taggtgatta ttatcgtcaa ttgcaagaac aaacagattt cacttccaca 901 tatgatcgca tcgatttacc cctcgccagc atggaagtgc gatcgcgtct tttacaacaa 961 ctgcaaaccc aacctttaac agaaatagct ggaaaaccag tcattgactg tcagacaatt 1021 gatggctaca agttccgctt agccgacaac agttggttaa tgattcgttt tagcggtacc 1081 gaacccgttt tacgcctgta ctgcgaagcc cccacactcc aacaggtaca tcaaactcta 1141 gcttgggcaa aagagtgggc agagtaatat cagtgatcag tgagccagcg cgaatgacgg 1201 ctctccctcc gtaggcgact ggtgagacag cgcgaatgac ggctctccct ccgtaggcga 1261 ctgcgttcgc gaagcgtctc cgaaggagat acccgaaggg cgaacccgaa gggttatcag 1321 ttgtcagtta tcactgttaa ctgttaactg ttaactgtta actgctaact gacttatgac 1381 attactcgta gtagcgacag gaaatccagg taagttaaaa gaaatgcaag cttacctcac 1441 tgattctggt tgggaattaa ctctcaaacc tgaggaattg gaaatcgagg aaacaggaga 1501 cacctttagt gccaacgcct gcctaaaagc atcccaagtg gcacttgcaa cgggaaactg 1561 ggcgctcgcc gatgattctg gcttgcaagt agatgccctt gatggtgcac caggagtgta 1621 ttctgcacgt tatggcaaaa ctgatgaaga gcgaattgcc agagtgttga gagaattagg 1681 cgatacacca aatcggcaag cccaatttgt ttgtgtggtg gcgatcgctc atccagatgg 1741 tacaattgct ctacaatcag aagggatttg tcctggtgaa atcctctacg cgcctcgtgg 1801 aagtggcggt ttcggctatg atccaatttt ttacgttcaa gaaaagcaat tgacatttgc 1861 tgagatgaca ccagagttga agcgttcagt tagccataga ggcaaagctt ttgcatcttt 1921 gcttcaagag ttgcctcatt taaacaacac tcattgttag ttgttagttg ttagttgtta 1981 gttaaatttc tacctaaata cgaacgacta acgactaacc actaacttct aactatgaaa 2041 tgagtgaaca gtgaacagtt atctgttggt tttggacaac aagtactgat aactgataac 2101 tgttaaacag cttctgtatt cttttcccca gtacgaattc gcacaacttg ttcaactggc 2161 gagatgaaaa ttttgccatc accgatttca ccagtgcgag cagctgcgat aattttatca 2221 acaaccatat caacttggtt gtcttcaaca actatttcca ctttcagttt ttgcaaaaat 2281 tccacagtgt actcagaacc gcgatagcgt tcggtttgac ctttttggcg tccaaaaccc 2341 cgcacctctg aaacagtcat accaacgata ccagcattaa cgagggcaat tttcacctca 2401 tcaagcttaa agggacggat aatagcctct acttttctca tttattttct cccgacttct 2461 acgtttatct atcaaagcag atccagtgtt taacactttc acgcttgctg ttacacattg 2521 cagatttcac ttatgagatt tgttaagatt caagagttat tcttaacctt ggacagtctt 2581 aacaaagaaa actgcaatat aagatgctca tcagctagag agcgtaacct caaataaata 2641 tttaatcact tcttgtcaca cgacttaacg gtagatgacc tagttatcaa agaaaagtaa 2701 aactagtaaa ccaaatccgg atggtcttcc ttttccatta gtcagcacaa gaagcataaa 2761 tatcgtttag ccacacctgt gccagagtaa aaagcagact taaaaagcgg atttggtatt 2821 agcattatct accttaatca tttcttaagt tattttctga ggtttatacg ttcaatttat 2881 cattttttcg tagcttattt aacttgcaaa caaagatgaa ttttgatctg taacaggtta 2941 agttgaagaa tgatttgact taacgactat gactttgaaa taaaggacgt acaatcaaat 3001 acccagcgct aaaaccaaga gcaataatta acaagatacc tattattaac caccagcccc 3061 taattcgtaa aggctctgat tccttgtaag gatcatagct gacttcctgt tcttctataa 3121 agactggttc tggaatggaa cttggaattt ccatgttttg ggaaaaaaca ggtgtctggg 3181 aatttcctac ataaaaagca cggggacgag aaatttgctc tctagaagct ggttcagtaa 3241 taggtggttt tgtttgagtt ctagaatcag aagaagcttg ggaaacttga tgaaaattaa 3301 tcttggcttg tgaatcaacc accttttgca agcgcaaaac agactcaaca gctttggcaa 3361 tttcttgccg aagtaattga ttttcttgca cgacttgttg attttgagca ttgagagtat 3421 ctaactttgc ttggacagtt tgcaactctg ttgccaaatc ccgatagaca gaaagcggta 3481 cagaaggaga ataggcttgt gtatttgctg ctggtgtact gctgtgaaca gaacttgaga 3541 ttgttcgcat tggtaaattt caatcaaaat ttaacgtcag tcagctaaaa cgaatttcct 3601 accagagata atcagtctgg tgaaagtatg cccaagaatc atagaaagat attcttgagg 3661 ttaaaaatga ttaatttaca atttagaagg agaaagatac aagtttatca gtaataggat 3721 ataaatttct aaagtatgag atgaaaaagg aatgttggta gaaaagaagt ttaagcgatt 3781 aagcaacttg cgtcattcct tattttgtaa ctcccttact tattcaacag tggaaagaag 3841 tgtcccacag gaccttgtcc tttgccaatg tctaaggcgt aagaaagcgc tgaagtgaca 3901 tactcctttg cctgccgtac tgctgttatc aagtcgcatc ctcgtgcaag atgggcggcg 3961 atcgccgcag agagagtaca accagtaccg tgagaatttt ttgtttctac gtgttttgtt 4021 gtcaaagttt ccatcttgtg tccgtcgaac cagatatcaa caccacgtaa atttccttgc 4081 atacctccac ccttgactaa aacaaccttt gcccgtacat tcctgtgaat catttgagct 4141 gctgctctca tgtcatctag tgaattaatt ggtaaactgc ttaaaatctg agcttcatag 4201 cgatttggtg tcacaatagc agccttagga atgagcaaat gtcgaagagt gttcacagca 4261 tcattatcta gtaattgcgc tcctgtgcga gagaccatca caggatcgac gactaggtta 4321 tgaatttgta aagcctccac ctgttcagca acagcggtga taatttcctt attaagtagc 4381 attcccgttt tcaccgcttg aacttctata tcttcataca cggtgcggat ttgtgccaca 4441 acagcctctg gtagtatagg atcaacccgc atcacgccca aagtattttg tgctgtcacg 4501 caggtaatag cgcttgtacc gtgaacacag tgaaaggcaa aagtgcgtaa atctgcttga 4561 ataccagcac caccaccgct atcagatccg gcgatagtta aagcaacggg caccttggat 4621 gttgtttcgg agttcatagg ttaggggagg gagagaggga gagagggagg gagagaggga 4681 aagagggaaa cttccctact tccccatctg gtgttaatgg cagttgtttt tcgtttttga 4741 acgccagtcg cctcaacggg gggaaccccc cttcgggttc gccagtcacc tacggcggga 4801 aacccgcctg cagtgctggt ctcaccgcac ggcgctggct cctttagact tttaactttc 4861 aaggctttgc aattgacttt gcaaatattt gccccattct gcggctaagg gaatctctgt 4921 aaaaggaaca cgtactgttt tgtctggttc agaaaataaa aattccaact caatagtacg 4981 acctttttcg ggtggttttt ctacttctac ttgtaagaag tctaccaaaa gggtgatact 5041 ttgcacatca agcaaagaaa atgtttctaa cttgattggt ccttttgggg ttggtttgcc 5101 ccaagttata ttgttgcctt tgtgacccaa aaccgcgtaa atatcatact tagcttgttc 5161 aaattgttct gcccaagtgc ggtatgcttc tatcttttga tactcctggg aaccttgcca 5221 agccaaccaa aaaaatgcga ctaacagggg taaccataaa agaccacgtt ccatgtttga 5281 aaaattgtcc tccagtttgt ataaaaatcc taagtttact accagatgca ccactaaagg 5341 gcttgtatga gttatggtag tgagcaaact ggcaaaaagc taatggtttg aaatgagaaa 5401 gctgatatta ttttgcttat ttgtcatcgg gctgagcgtt gccttgttta attttattaa 5461 ttttcaagga ctagcagcca aaggggagtt tgagacaata gtgctaaatt ttcgggaaga 5521 tattcccaaa gaggtattgc agcaagacct gcaagctatt gctcaacagt acaatgtcac 5581 tcccgaacta gataatcaat attgggccaa ggagaatgtt tacatcatca agggcgataa 5641 gcagcgactg aaagaattga aaaagtcgcc atttgcccaa gtcatggatt ttattgaacc 5701 aaattatatc tacaaaattc caaaaccagg taaagcaact tggctgggag aattattaga 5761 tccctcgcaa gaggctgacg aagcgaaacc aacattcaaa ggtcccaacg acccctatta 5821 cagcaagcag tggaatctcc acaatattca tgttgaaggc gcgtggactc aaaccaaagg 5881 caaagacatt acggttgcag tgattgatac tggtgtcact cgcgtgcgtg atctcataga 5941 gacagaattt gtccctggct acgactttgt taacgataga gtagaagcca aggacgacaa 6001 tggacatggt actcacgttg ctggtacgat tgcccaatca actaacaaca gctatggcgt 6061 agctggaatt gcctacgaag ccaagctgat gcctctcaag gttctgagtg agtatggggg 6121 tggtactgtt gctgatatcg cagcagcgat taagtttgct gctgataacg gtgcagatgt 6181 gattaatatg agcttaggcg gtggcggtga aagtcacctg atgaaagatg caattgacta 6241 tgcccataga aaaggagtga tcatcattgc tgctgctggt aatgaaaatc agaactctgc 6301 tgcttaccca gcgcgttatc cccatgttgt tggcgtttcg gcatttggtc cagatggcga 6361 taaagcaccc tattctaact ttggtgctgg agtagatatc tccgcccctg gtggtagcga 6421 tgcaggcaag attttgcaag aaaccatcga ccctgataac aaaggcacag cagtgtttat 6481 gggcttccag gggacaagca tggcttcgcc ccacgttgct ggtgttgctg cattagtaaa 6541 agcttctggt gtgaaagaac cagaccaaat cctcgaagtc ctcaagcagt cggcacgaag 6601 tgttcaagat gacagtttga actactacgg cgctggacaa ctcaacgcag aagcagcagt 6661 gcaattagcc actcaagggc aaatcacttt ccaagacttc ttccgttggc tacgggatag 6721 cggctatctc aaccccggtt tttggattga tggcggtgct atagcgctgt tacccaagat 6781 tttgatggtc gtcggttcct atctcctcgc ttggttttta cgagtttact tcccgttccg 6841 ctggggctgg aatttatcct ggggattgat cactggtagt tccggattat tttttctcaa 6901 gggaatctat atttttgacg ttccgcaatg gcctttccgt gtgttgggca gttctattcc 6961 ggaattaggc aatacccttc aaggaacaga tgcttttaac ccactgtttg ccagcgtatt 7021 aattccctta ggcttaatgg gactcttgct aagtcatcct aaatggaagt ggtttgccat 7081 tggttcatca ctaggtgtag cagcgtgttt agcggtaagt gcagtattag accctgcagt 7141 ttggggattg ggcagtagta tcttagcccg tatgttcctc attgttaatg ccgtgttatg 7201 tttcggttta gctcgcttag cagtcaaaaa cgaagaacaa ccagcttaaa caccagatgg 7261 ggaagtgagg aagttttcct ccctccctcc ctcatcccct cagtaattag gagcagtgtt 7321 acgatatgag catcacagtt acaggtacca ttgaacgccg tgacattggt actggcgctt 7381 gggcgctagt tactgataag ggtgatacat acgaaatcct tagaggagta gataaaaatt 7441 tactcaaaca aggacaaaaa gcaaaagtca cgggacttgt gcgtgaagat gttctcactg 7501 ctgcaatgat tggtccagtg ctagaagtaa aatcttttga agtgattaac tctccttaga 7561 tgctggtgaa ttgagaactt cttgcaagcg acttctaaat tcgcctttgg gatgaccacc 7621 tttgacctcg cccaaaatct gaaattctcc ttctggcgaa tcacaaataa tgtaggtggg 7681 ccatcccatc tgtgacttat cagggtattg agtcaacaga atctgacgat acttccgata 7741 agtcgccgta tcctgcatct tgacgtcaat aaattgcaaa cccaattcct ctgtcacctt 7801 tttgtcatag aaagacatct tgtggcagat accgcagtct tcggaagaaa acttgatgac 7861 agctaaattc atagatttac ttaactccat aagaagcttc ccactataac ggtactccgt 7921 ttaacggtgc tcgactcgaa tagtgatttt tagaatcccc cctacttgcc ccaagggtta 7981 gctgtggcag gatatcaaaa ttctcgatct attaacactg gcttgtggaa taacatatta 8041 tagaagaata ccaaatatta ccagtcagcg agatcctagt taggtgcaaa ccgcgaaata 8101 tcgctgattt cgtgtcaaaa aaaataaatc cccagcagcg agtagccaac tagggaagtg 8161 agttatcagg gtgcatctac caatactatg ttctatgaat aactagggtg ttccggannn 8221 nnnnnnntta tcagggtgca tctaccaata ctatgttcta tgaataacta gggtgttccg 8281 gacaaaaact accaaatgat cgctactttt gtgtcaaaaa aaataaatcc ccagcagcga 8341 gtagccaact agggaagtga gttatcaggg tgcatctata tgtatgatgt tttataaaat 8401 actcaatatt atacttgtct gatttgtaaa aaataaataa aatccagtaa gtggaagccg 8461 actaaaaaag tgagttatca gggtggattg accagttgta tgttttatga ataactaagt 8521 gtttgtcaca aaaagcgcga aaatgagagt gatagttgtg tcagtcagaa aaaatcccct 8581 agcagccgaa agtccactag ggggtcagtt atcagggtgc atctaccaat cctatgttct 8641 ataaagaacg gcttgaaccg gataaattct gcaaaataga ggtaaaaaat attttgcttt 8701 tatctaccat gaaaagaaag cagagggcaa aagaaataag gattttcatc gcgtgattct 8761 gtagtgaaag ttcaagaccc actaaaaata aaaaaaaccc caagtagccg aaagtccact 8821 aggggtgtca gttatcaggg tgcatctacc aatactatgt tctagagata acgactttct 8881 ccggatgatt ttgtatatca gaaataattg agattttttg taaattatca caaaaactca 8941 gagggtgtga acaaatgagc taattatcac gagaaaatta atttggatgt agaggtgctg 9001 ggaaattatt ttcccaagag acccgtaacc ccctactcct agcatgaaaa tgaaatcacc 9061 aaagacagaa aaactcccaa aaggtgttaa cctacatggg agtcaagatc aaggtgcatc 9121 taccagtaag aaagttctgg gctaaactga caagatccgg ataaagttat gaaagaagat 9181 tggttccgaa tttaaaattg tagtacaaga actcaatcta tgggatgtac ttttatgaac 9241 aaataatttt ttctctgcga taccccagag gtttcagcaa ttcaccgaga cttttaacct 9301 agtcggagag aagacagcat gaatgatgac tcgtgggtga tgaatgctta cagctcacac 9361 gctgatgagc aaaacaacac gaatcagaaa caatcagtgc caggatgcat ctaaggtata 9421 agaagaagca aaacatcgac ttgaaacttt cgggtttcaa accagattta ggaggcaaac 9481 aggagcagtt gtgacccagg aatttcacat ttccgtaacc ccagtagggc aaaatgacta 9541 cttggtgcgc acggaacaag tcgcgcctgg ggtgccatta gcagaagaac tggtgacttg 9601 gcctgtggct gattggttga cggctgcggg acatttgatg aatgacccgt tgaagtcggt 9661 gctgcaggga gatgcgatcg caagaaactc cgtcaacttg gtggcgttgg gtcagcaact 9721 ttataatgca ctgtttcaag gcactctcag agatagttgg attacggcac aaggaatagc 9781 ccaaaaccac caacaagtgt tgcgtttgcg gttagggctg aaggatacta aattagcacg 9841 tcttccttgg gaagtcatgc atgcaggcga tcgcccgatt gccacaggac cctatatcgc 9901 cttttctcgc taccaaaatg gaattggtag aacgtctcgt ctaccgtcaa caggtatgcc 9961 agcacctgca gaggaaggtg ggctaaaggt attgatggca attgcttctc ccacagatct 10021 ggtgcgtcta gatctcctca aacaagaagc cattaaactc caagccgaac ttcacccagg 10081 cgaaagcggc aattatctcc cgcacaatat agaactgacc ctgttagaac aaccagggcg 10141 agaagagttg acacaagcgt tagaacaagg acgataccat gttctgcact actccggtca 10201 cagtaacata ggtcctaacg gcggagaaat ttatcttgtc agtagcagaa ctggtttaag 10261 ggaaaccctc agtggtgacg acttagcggg cttgttggtc aacaataaca tccaaatggc 10321 agtatttaac tcctgcctgg gaacatatgc agcgacctcc tctggtggcg tcagagacac 10381 aggggaacgt aatttgacag aaagcttggt gaggcgcggc atcaggagtg ttttggcgat 10441 gtcagaacgt attcctgatg aagtggcact actgttaaca cagttatttt accgcaacct 10501 ttctcacgga tatcctgtgg atttgtgcgt gagtcggatg cgtcaaggat tgattgctgc 10561 ctatggttct caccaactat actgggcact accaactctt tatatccagc caggatttga 10621 cggctatctt agtccacaga tatcgttacc tcaaggtgag gagttattta acgagtacaa 10681 tccttctcta aagacgtcag caataattta ctcagatcag gcaaacgacg ccagtatgcc 10741 tttacccctt gaggatatgt taccctcatc tttggcaaga gactcattcg atgatctaga 10801 cttgctaggg gaagagacct ggggcgacct gattgatgaa atcgagtacg atgacccaac 10861 ttatgaagaa gattcagcga ttgtttcaga tttgttccgc cagctagatc aacaaaagac 10921 atctgaacaa tcttccatga aggcagaact tgttcaattt ggggaagata gtctcgacga 10981 aaaagaagtt tcaggtgaaa tagcttcgct ggaaaatgat ctcgggatgt gggaagaagt 11041 ccgtgaagca gcatcttatg gtagacagca ggacgcagat cctcacgaac tcgcctctaa 11101 tagtcaactg agtcgccaac aagagttgga cttgcaagta gactgggaaa actcgaatgt 11161 cgggccatta actcagacga ccgctattag aacaccaaca gccaaaccaa aacaaggcaa 11221 aagacgtaaa gtatcgcggg gtagtatagg agtcagtgct atcgctcttg cgggagcagc 11281 cctttgcgcc atcgtagccg tagtcggttt caactggtgg tcacataatc aacaaatacc 11341 ggtcttttct cctgagtctc aacaagaaag ccgcagcact ccacaagcca attttaaaac 11401 gactgaaact aaagttgtta gtgcgatcgc caccgacaaa ttgagtaagg gggaattaca 11461 aataggactt gaagctgtcg aagaattgct taatcgtaat ctacttccta acgctcaagc 11521 cgcccttgac gccattcccg aacagtttgc agacaatcca tcagtctact ttttcaaagg 11581 acgattagca tggcaattga tcaagacagg agacaataga tatagcgtcg atgatgcccg 11641 tcgttattgg gaaagtgcgc tcaaaactga accagactca attttgtata ataatgcttt 11701 aggctttgcc tactatgcac aaggtaacct taaccgagca aatgactatt ggtttaaggc 11761 gttgagtttc gcagtcagag aacagcagag taaatcggct gcttctgcaa caacattttt 11821 accctcaaag ccagtccctc gggatgcttt aacagcttat gctgggttag ctattggctt 11881 gtataaaact gcaaacaacc aaccaaatgg aaaacgcgag caatatctca atgaagccat 11941 caagctgcgt caaatgatca ttaaagacga cccagtcaat tttgagatta aagaattgac 12001 gaaaaattgg ctttggacgg acaaaacttt acaagattgg aaaacactcc tccaactaaa 12061 aagtcagcgt tccacacttc aaaattgacc aagaaagaat tcagaactta gaagtcagta 12121 gtcaaaagga tgaatagtaa aaaagataca aagccattca tgaatttctt taaaaaagaa 12181 aaaatgtatt tcatcaagta ggcgcgtaga attacggcgt cattatttaa agccccgcgc 12241 atttgtttgg gggggatgca ttcctagctg cggagttcta acaactgagt tcttcttaag 12301 atatttttta ctaatattcg ccatcatctt tgaacgtgcc aaaataaaaa catatgaaag 12361 cctagccaac gtatcaaggc tacgcgaaag ccatattcgt cacatgagct attgcttaaa 12421 tcccaattgc cccaaccctg ttaacatcaa taatgaaaaa ttctgccaca cgtgtggttc 12481 caagttagtg ctcaaagagc gctaccatgc catcaaacca ataggacagg gtggttttgg 12541 cagaactttc ttggctgtag atgaggataa gccatccaaa ccacgctgcg tcattaaaca 12601 attctacccc caagctcaag gtaccagcac agtccagaaa gcagtagagt tgtttaccca 12661 agaggcaatg cgcttggatg aattgggcaa gcatccccaa attccagaac tattagctta 12721 ttttactcaa gatgataggc agtatcttgt acaagaattt atcgacggac tgaacttagc 12781 ccaggaatta gaacaaaaag gtgctttcaa tgaagctcaa attcggcaac tgctgaatga 12841 tttattacca gtgctgcaat tttgtcatgc aagacaagtc attcaccgag atatcaagcc 12901 agaaaatatt attcgtcgta caaccacaaa atcaagcaac ggaaatctgg ttttagtaga 12961 ttttggtgct gcaaaagaag cgacttacac agctttaaat cgaacaggaa ccagtattgg 13021 gagtccagaa tatgttgctc ctgaacaaat tagaggaaga gctatttttg ccagtgatat 13081 atacagctta ggtgttacct gtgttcacct gttaactcaa cgctctcctt ttgacttgta 13141 cgatatcaac aacgacgctt ggatttggca acaatatctt acaagcccag ttagcaatga 13201 gttgagccgt attttggaca aaatgctaca aagcattccc gttcgacgtt accaaactgt 13261 agaggaagtc ctcaaagagt taaatcaaaa gccgcaatta gccccaacac ccgtaactcc 13321 gatcaaacct gttactccat caaaacctat ttccacacct gtttctacca atcaagcgac 13381 cactcaaata gatacagaat tagaggaaat gaaaactcta tttcttcaag gtggaaaatc 13441 caaaagcaat aaaggtcgta atgttcagcc tcagccacaa acaccccagt cgtcttctag 13501 taaaagcaaa aatattcagc ctcagccaca aacaccccag ccatcttcta gtaaaagcaa 13561 aatagatgaa gaattggaac agttaaaggc taatccgctg tcttttagta aaagcgaaat 13621 agatgaagaa ttggaagaat taaaggctaa gtataaaaaa ggtgaatgat agatagtaca 13681 ttgtcaaaat ttatatagta ttttaacatt ggttttacat aactcaataa gaattatcga 13741 aggaaggctt tctaagccga acttttgtat tacaaataaa gtggggaaaa attgacaaat 13801 tttacttaac cttaaaaaca aaatcgtagg ggagccagtg cgttggacgg gtttcccggc 13861 ttgaagcatc tggcgttggg cattgcccat aaaaggttat gtgatagaca tttgagatat 13921 tggtaagcaa tgcccacctt acagaaaact gctgtaagtg catgacagct tcataaaaag 13981 gagattaata tgtcaacttt agttcaacaa gcagttggaa ttcctctgag cgaccaagtg 14041 tcgaggtttt taactcagcc attgaaacta ctgattgggg gaaaatgggt tgagtctgcc 14101 agtggtcaaa cctttcctgt atataaccca gcaacaggtc aagtcatcgc ccatgctgct 14161 tctggtgaga gcgaagatat taatcgagca gtacaagctg cacgtgaagc ctttgaaaat 14221 ggtccttggt caaaactcac tgtttcagaa cgcggacgac tcatttggaa actagcggac 14281 ttaatagaag caaacctgga agaatttgcc gaattagaat ccctggataa tggtaaacca 14341 atcagtgttg ctcgtgttgc cgatgtgcca ggtgcggttg acttgtttcg ctacatggca 14401 ggatgggcga caaaaattga aggaaatacg attcccatct ctgctgggac tcaatatttt 14461 gcctacacag tgcgcgaacc tgttggagtt gtcggacaaa ttattccttg gaatttccct 14521 ttgctaatgg ctgcgtggaa attaggtccg gctttagctg ctggctgcac aatcgttttg 14581 aaaccagccg aacaaacccc cctgtcagct attcgcttag gcgaattaat ctgtgaagct 14641 ggatttcctg atggtgtggt caacattgtg actggttatg gtgaaactgc aggtgctccc 14701 ctagccgcac atccggatgt tgataaagtt gcttttactg gctcaactga agtgggtaga 14761 ttgattgtcc aagctgctgc tggtaacctc aaaaaggttt ctttagaact aggcggtaaa 14821 tcacccaacg tcgtgttcaa agatgctgat ctagagatag cgattaaggg cgcagccaat 14881 tctatcttct tcaatcatgg tcaatgttgt aatgctggtt caagacttta cattcaacaa 14941 gatatttttg accaagtggt ggaaggtgtt gctgctgaag ctaagaaaat taaaattgga 15001 ccaggacttg accccagtac agaaatggga ccactgatct ctgatgagca actcgaccga 15061 gtttatggtt atatgcagtc aggttttgct gaaggtgcaa aagccgtaac tggtggacag 15121 ctaattggtg agcaaggcta ttttgtggag cctacagtgc tggttgatac taagcagaca 15181 atgaaggtgg tacaagagga gatatttggt ccggttgtaa cggcgatgcc ttttagagaa 15241 gtggatgagg ttgtcccact ggctaacgat agcacttatg gcttagcggc aggaatctgg 15301 actaacgata tatcgaaagc tcatcgttta gcgtctaaat tgcgtgctgg tacggtttgg 15361 attaactgct atcacatctt tgatgctgct ctgccctttg gcggctacaa gcagtcagga 15421 tggggtcgag acatgggaca caatgcgcta gagctttaca ctgaggtcaa atctgtttgt 15481 gtgaagcttt aggatactgt tagcctctgt gtcattgcgg tataaatcaa tttttatcgt 15541 tcctatgctc tgcatgggaa tgaagtatgg gaggctctgc ctccttatta atcagcttga 15601 gcaatacggt tcagttagag ccaaaaacct taaattttgt aggttgggga gccactgcgt 15661 tgggcgggca atgcccgact tgttgcaaga gcgtctccgc aggagttagc aagtggcgtt 15721 tgaacgtagt gaaacccaac aaaattctga aaatcttggt ttatgtcctg cggacacgct 15781 gcgctaacgt tgctcaaacg ccagacgcca agtgagggaa accctcctcc agcgctggct 15841 ccccaaccta ctattatcct taactgaacc gtattgtcag cttgaggcag agccttgacc 15901 aaatgcattc cttgccagag gcaaggaacg agaaaacaag ctaaaaaaag tgactctgaa 15961 aaactcagag tcactcaaaa ttctcaactg aagttcttga gactaccgta gaatttgaag 16021 gtgtctttgg ttgtctagct tgcagttgtt atgacatgcc atgttataca gatgtttcag 16081 tgatgggtac ccattctgta tggaaggcac cgggcttatc gacacgctcg taggtgtgcg 16141 caccgaagta gtcgcgttga gcttgtgtga ggttttgagg taggcgatcg cgacgatagc 16201 tatcgaagta atccaaagat gcactaaatg ctggcactgg aattcccagt ttggcggctg 16261 ttaccaaaac ttcacgccaa gctgtttgtc tatcaagaat tgtctgcttg aattctgggg 16321 ctaacaggag gttaggtagt gctggatttt cgttaaacgc tttcttaatt ttgttcaaaa 16381 atccagcgcg aatgatacaa ccacctttcc aaatccgcgc caattcactc agtttcaaat 16441 cccaattata ggtttgggaa gctttcgaca gcaacgccat tccttgagca taagaacaga 16501 ttttcgagca gtagagcgcg tcacgcacct tgttgataaa ctctttggta tctccgtcat 16561 acttgccagt aggacctgtg agtaccttgg atgctgcgac ccgttcttgt ttgtaggaag 16621 agatgatgcg accattaact gctgcgataa tcgttggtat ggaaacaccc aattccaaag 16681 cagtttgtac agtccaacgt cctgttccct tttgacctgc ggagtcaaca atcaaatcta 16741 ccaaaggtaa atttgtgtct gggtcaatgt aagggaaaat gtttgacgta atttcaatca 16801 aaaacgagtt gagctcgtct gtggtgttcc actggctaaa aacttcatga agttgattat 16861 ggtcaagtcc agcggcattt ttcagtaaat cgtaggcttc tgcaatcagt tgcatatcgc 16921 cgtactcaat accgttgtgt accattttga cgtagtgacc agaaccacca ggaccaatgt 16981 aggtgacaca gggaccatca tcgacttggg cagcaatttt tgtaaaaatt ggctccagat 17041 actggtaaga gctttgagta cctcctggca ttagtgaagg accattgagt gctccttctt 17101 caccaccgct gacgcccata ccaataaacc gaaacccagc gggttctagt tcttgggtgc 17161 gtctatctgt atcttcaaac caagagttac caccatcgat aataatgtcg ccttcgtcca 17221 gcaagggttt gagctgacta atcactgcgt ctacaggttt acctgcttgc accatgatta 17281 aaattcttct gggacgttcc aaggagccaa cgaactcttc taaagtaaaa gcagctttag 17341 cgtttcttcc ttgggcacgc tcagccatga atttatcagt tttgtcgcgc gagcggttgt 17401 aaactgcgat tggaaagccg ttacgttcaa cgtttagagc gatattttca cccataacag 17461 ctagaccaat cacaccaaag ctttgctgtg tcataaaatt ttggctaact cttgcagatt 17521 ctctttctct ttaagggtag tctgagattt ctgcttctct cctaaagaag acattaagac 17581 ttcctaaact cagccgaaat attccaccca cacagataaa aaattttaat ttgtatctga 17641 ttcaacaact ttctggcaaa atttgcaaaa ttgagatatc tagttaaacg atgggggact 17701 agtaccgctg cgcggaagta aaaagtcaaa agtcaaaaag ttgattatgt gggcttttcg 17761 ctgctttgga atggtgaatc cagcgcgcat gaggagggtt tccctcactt ggcgactggt 17821 gtatgcgcaa agcgcacgcc cagagggcta aagccgtcag gcgaaagcag gcatacccga 17881 tagccgtaag gcgtggccgt tcccactagg gtagctttat ttccgccgcg ttggactaga 17941 tactttggac tgagtacttc tactcccaat tcctcatcct caatcccaaa tccctagtga 18001 aatgacacag taaggagtaa ccacaaaatg ctggcatata tcctagcgtt gactgtcggt 18061 cttggaagtg tagcgcttta catagcagct ttcttttttc ctgaaatcca ccgtaagaat 18121 gattttattt tgagtggtgt aggactgttc tacgccttgg tgttatggat atttgcccga 18181 ggcattactg ggggtctttt gctgggtcat gtagctagtg tggctctttt aggttggttt 18241 ggctggcaaa cactgtcgtt acgtcgtcaa ctcacaccaa agatacaaca aactcagata 18301 cccagcactg aggtagtaaa aacaaacatt cagcagcagg tttctcagtt gtcgcttcca 18361 caaaaacttg ctcagttacc aaaagctatt ggtagtcgtt tttcgggggt caaagaccga 18421 gcgcaaactg cgggtaaaaa gacacccgaa aagtctaaaa ggacacagac tcctgtaccc 18481 acagccaaac ctgtagttga gattgtcgat gaacgcattc cagtcacaga ccaaccagca 18541 gttatcccgc ctgctaccac tgataccgaa gcaaaaaccc caacagcatc tgctcctacg 18601 gaagcaaaaa ctgagactcc tccagaagca gttccaccga atccttcatc tccggaattg 18661 ctagaagccg ctcaagcaca tgagactgag gaaaaaacac cgcctccgat tgaggaagta 18721 gcaccacgag ctgcacttgc tcctccagca gaagcacctc caggacaagt accaccgaaa 18781 aaccaggctg aaggttcata atagttaaag gcgatatcaa aattctacgt ccacaagtgt 18841 gtccgcactg cggaggaaag tacttcgcgc ctaatatggt taatgtcgaa atccagcaag 18901 tcgcgcagtt agtcgcactc tcctattgaa atcgtcgaat atcatcgcta tcacaatcaa 18961 tgttcgcact gcggaacatc ttgtcctgcc gactggtcag ggttgggtca caacgattga 19021 aatactgcgg tgccgacctg agatgccccg tgtataaata ctcatttttt ccgaaaaatc 19081 tacgtttggg gcatcttagc gcccagagtg tcctccggac acgctgcgcg aacgttggcg 19141 ctaaaccaag gcgcttagta atgaggtgag tccagcgctg caggagggtt tccctcactt 19201 ggcgactggc gaacccgaag ggcaccgcag tttttccggt tttaatgaag cgggagatga 19261 ttccaggaca agatttagag tacgactgca agcgttaatt ggttggttgg gcaattacgg 19321 acacgtacca tatgcgaaaa tccgcgaact gttatttgag ttgggacaaa ttgatatcgg 19381 cgagggaacg ctcgttgcga caaatgaacg tgtcgctgtt gcgattgacc cgccagtcga 19441 agcattgggt acatgggtga aaaacgagca accaaatatt cacgttgatg agacaccttg 19501 gtcagtcaaa ggagtaaagg aatggttgtg ggtcgtcgcc aataaaacgt tttgtttgtt 19561 tcatgccgcc gacacgcgat cgcccgtata ctgcccaacc aggctacgcc tcgttgcccc 19621 gaatcggaga ttcggggaac cctattctgg aagaactttc tttggcagta gcgcgaagcg 19681 cgccctgctc ccaaggggag ccgcgcaagg cgcgatcgcg tgtcgaactc gaacaattat 19741 tgggaactga gtatcgtggg gtcaacagca gtgacgattt tagtgtttac aatgcctact 19801 gtgttgcaag tcaccaaaag tgtctggcac acttacgacg acattttctg cggttaatga 19861 gcgcgtcctg gtcgggataa tgcaacgagc aggtaatgcg ttcgttgatt tgattgatga 19921 tgtgtttgat aactatcgtc aatttcaaca ctcaaacgac tttcaacagt ataccagttg 19981 ggcgaaacaa ttcaaatcaa agttgagcca tgccttgaat acttggatac caaaagctag 20041 tgcttttgtg ttgaacctgt tatcgaaatt gcggacaagt atgacagcgt ggtggttttt 20101 tcttgatcat ccggaagttc cccctgataa caatttggcg gagcgctcgt tgcgattggc 20161 ggtaacaaaa cgtaaagtca gtggtggttc gcgttacaac taaagatttt acctaagttt 20221 aatcaaacgt tagtgtttga tcaattgaac agttgatatt ctttctgtaa gctttagcaa 20281 gcgtaaaaaa atacacagac ttactgcact cccttgcaat ggcactccgg tgggcacgga 20341 atgtttgagc taacctcaat tgcctgaaat tttcctcagc acagcagagc atcgaaactt 20401 tgtttaatcc gcacgaatgg tggtatttat gattcaactc ctcagagatc atctacgaac 20461 attgatctgg tttgcagttg gcttgatctg tgtgcttctg ttagcgttgg gattgccgct 20521 tactcagtta gcttcaaccg ctcagaaccc caatcatcag agttctgctc atgcatatac 20581 cccaaaaaac ccgcctggca ctccgtattg gtcggctggg gtgaccaggg attcaaacca 20641 cttgccaaat ttttagtggc ttctaggttt taggaacttc caaggtgcga ttcgttgatg 20701 gctgaatcgc actttggaag ttccttagtg atcccgatag attacaattt caacttttga 20761 cgaaacattt ctaaataaat tatcaaggca accccgatga tttcctggtt tgaaaagcat 20821 gtacgagcga tcgcatggct cacaagttgt ctgctccttt tcacgacgct cttgattgaa 20881 cagcctgcct ttgctcaaga ggtaagcggg aatggtgtta acgtcattat catgattggc 20941 gatggaatgg gttgggagcc tgtgcgtgca gctgccatta ccaacggtgg ttctttttac 21001 acccgtggag aaggtcgtgg tctgaattta cagaagctga aaggctacac ctatgcgacg 21061 acctacggta ctaccatttt cgatcagaag agtggtaggt tttctaccgg caactccgca 21121 ttggatggaa ctaacaatgt aaccggtcag agcaacatcc gtcctgggtt ttcctttaag 21181 cctttaccgt ttaatcccgg tactgactta gcaagtggat caggtggtgc aacagaccca 21241 tcgttgggta atcttacagg ttgggaggtt gaaaaaggtg gtcctaatcc ctggacacct 21301 gctacaccgc caccttgcga cattacagta ccagtaggaa aatctattaa cgacgttcct 21361 aatcggttca actgccaggc agaatatatc aagctgagcc taccagattc cgccaacact 21421 gccttcacac tctacacagg agtgaagagc tacaacaacg ccatggtaga catctttgaa 21481 aagcctatcg aaacgattct gcaaaccgcg agaaagcaag gcaaatcaac aggtctgtta 21541 acctcagttc ccatcagcca cgcaacgccc ggtgcagccg aagcgtctgt caaccggcgt 21601 accaagtatg atgctgacta tccaaccttg gataacatta tccagcagtc tattagaccg 21661 gatttcaatg aaaacccaga ccgacctgac ctggaggaca ttttcctacc cacggtgctg 21721 ctaggtggtg gtcatcccct cgatcatgac aacacggtca acacaccagg tcaagtgggt 21781 tacaaagagc caggtacttg caactacgtc tacatcagag catctaccta taaggaactg 21841 acaggtaaaa cttctctcag tgatgcagag gcttgcaaag cgactgcttc atcgaatccc 21901 aacaaccgct acggctatcg tttcttagaa cggagtccga atgccgcgaa gcttctccta 21961 aagacttcca gagaaattga tcccaacaag ggtgaacgct tgctgggact ctacggtgct 22021 cgcggacagg agggcaacat tccagtgagt ggtgctgacg gggattacag tattaccggt 22081 cttgccaact ttgcccgtca gtcttctttg tacaaccgta ccaacgatgc ttataagcgg 22141 ggtgatatac cagtcaatga taccgatcgc ccactccagc ccggtgaaac caatgaggct 22201 ttcattgcgc gtgaggtgaa tgagaatcct accttaggag acttgacaca agcggcactc 22261 aatgtgttag gcaaggataa ggacggtttc tggttgatgg tcgaaggcgg cgatgttgat 22321 tggggaattc atgacaacaa catcgataac atcattggta cggtactcga ctttgataag 22381 gctgtaggcg ttgcaatgaa atggatcgag cgtaacggtg gttggcaaaa gaacgttcta 22441 attgtcacgg cagaccatga ccattattta acactgtatc ccaacttccc agaattgctg 22501 agaaccaagg gcgctaaagc acttacctat ggttctgaaa ctgactcggc ttctgtgggt 22561 cacagctttg ggtcaattcc tgaagacaaa tacggttggg gcagccacac tcgtcgtccg 22621 gtgccagttt actatcaggg cagaccgttc aagctggaca agtacatcgg taagggctac 22681 aaaaactacg gatttgatgt tcctggcgtt ccaaatgcag tagaccaggt tcacatttac 22741 aaggcaatgt atgaggctat cactggtaaa gaaccgcaag atccctcata aaaagggctt 22801 gtctgaaatt agagtttccc ccttgtctat catgtccgga taaacacttg taaaaaacta 22861 aaaaccttac caaactcctt tccctctccg caggttcgga gagggaaagg agttttgaat 22921 gagttacaga tacatttatc tgtgttcatc tctggttcat tatttctttg gtgtacctta 22981 cccaattaca aacagctaga aacattagat attacgatat gctttcggtt aagcagcact 23041 tgaattggtc tgaaattgcc tttgatcgct cctgcacttg ttgtgacttt agtcgttcac 23101 aaaatcagcc tcagaaaaag atctggcgat tatggctggt gctggcgttg ctaatcagtg 23161 ttctgccagc agaaattggt atcggactgt ggagccatag tctgtcgctc caagctgatg 23221 ccggacacct gttatcggat gtcggtgcgt tggggctaac actgttggca agctggcttg 23281 ccgggcgtcc agcggcgggg cgggcgacct ttggacatgg gcgagttgaa attctggcgg 23341 cgttaataaa tggtttaggg ctgctgttga tcgccgggtt tattgcttgg gaagcgatcg 23401 cacggttcca gaatccagaa cctatcttga gtctgccttt gttgttagga gcaggactgg 23461 ggttgattgt caacggtttg aatttaacgc tgctgcacaa gcacagtctc gacgatttga 23521 atctccgagc agcattcctg cacatcgttg ctgatgcatt tagcgcggtg gggattctgg 23581 tagcagcact ggtgatttac tggctgaaat ggtggtggat tgacccggtc atgagtctat 23641 tgattgcctg ttttgctggt ctaagcgcca ttcccctaat ctggagtagt ctggaggttt 23701 tgctagagta tgctccgcgc ttcatcaagc ctaccgaggt tgaagcagca ttgcaatcgt 23761 ttccgggggt gcgccgggtt gaaacgctaa gaatttgggc gattgggtgc ggtcagactg 23821 ccctgtgcgc ccacttacgt atcgagccgt taaatgctca acagcaggat caattgcaat 23881 gggagttaca gactcattta gtcgaagcgt ttggtatcca cgaatcgacg ctgcaattgt 23941 ctagcggcag tgtcacaaac ttggttcctc tgcatccgct actcaaccgt agcctggttt 24001 catatattta taagtaaccg ggaaatgtca gcaggattaa tcggattact ttaaagaaaa 24061 ctcctaacaa caaagataga gtaccatatg tttaatattc gactatcttc gcaggcatag 24121 ctcgcgtcgc ctgttgaaat gttttgcacc caagtcaaaa accttctcac cgttattcgt 24181 gagaaaaaga cattactatt ccttggcatg aaatcgtgac agaaactgga aaatacaaag 24241 ataccgtaaa tttacccaaa acaaactttg atatgcgagc gaacgctgtg aagcgggagc 24301 cagaaatcca aaagttttgg gaagaaaata aaatttatga acgcctgtcc caaaataatc 24361 caggcgagtt atttatattg cacgatggac caccctacgc aaatggttct ctccatcttg 24421 gtcatgcttt aaataaaatt cttaaagaca ttattaaccg ctatcaactg ctgcgcgggc 24481 gtaaagttcg ttatgtacct ggttgggact gtcacggttt gccgattgaa ctaaaagtgt 24541 tgcaaaatat gaagtcggca gaacgacaaa atttgacgcc tttgcaactg cgtcataaag 24601 caaaagattt tgccctctct acagtcaatg aacagcgtga aagtttcaaa cgctacggga 24661 tttggggtga ttgggaacac ccgtatctaa ctctgaagcc ggaatatgag gcggctcaag 24721 ttggtgtttt cggtcaaatg gtgttaaaag gatacatcta tcgtggtctg aagccagtac 24781 actggagtcc gagttctaat acggcactgg cagaagctga attggagtat cctgaaggtc 24841 acacttcgcg tagtatttat gtggctttcc caatgacggg tttatcggag acggtaaaac 24901 ctgcactagg ggaattttta actcagttgg gtgtggctat ctggacaacg acaccttgga 24961 caattcctgc taacttggct gttgcggtaa acccagatct caagtacgcg gtggtggaag 25021 tggaacccca ccccccaacc ccctccccgc aagcagggag ggggagtgaa gaaaatgctt 25081 cccactccgt gcaagcagga agagagagtg aagaagttgc ttccccctct ccccttgggg 25141 agagggggac tgagggggtg aggttccgct acctgattgt tgctgctgat ttggtggaac 25201 gtttatctga ggttctggga actcagctaa cagtaaaaac cacagtcaaa gggaaagatt 25261 tagaacattc cacttaccgt catcctttgt ttgaccgcga aagtccgatt gtcatcggcg 25321 gtgattatgt aacgactgag tcaggtactg gtttggtaca cacagctccc ggacatggtc 25381 aagaagacta catagttggt cagcgctacg gtttaccgat tcttgcgcca gtagatgaca 25441 atggaaattt caccgaggaa gcgggacagt ttgctgggtt gaatgttctg ggtgatggaa 25501 accaggcggt gattgatgct ttaacggcgt cgggttctct gttgaaggag gaagcttatg 25561 ttcacaagta tccttatgac tggcggacaa agaaaccgac gattttccgg gcgacggaac 25621 agtggtttgc ttcagtcgaa ggatttcggg atgaggcact caaggcgatc gccactgtaa 25681 aatggattcc cgcccaaggc gaaaaccgca tcacaccaat ggttgcagaa cgttctgact 25741 ggtgtatctc ccgtcagcgc agttggggtg tcccaattcc tgttttttac gatgaagaaa 25801 ccagcgaacc actgctgaat gaagaaacca tttcccacgt ccaagcaata tttgccgaaa 25861 agggttctga tgcttggtgg gaactttcgg ttgaggagtt attaccagaa aaataccgga 25921 acaacggtcg gtcttaccgt aaggggactg atacgatgga tgtgtggttt gattctggtt 25981 catcttgggc agctgtggca gaacaacgac cagagttgcg ctatcctgct gatttgtatc 26041 tggaaggttc agaccaacat cgcggttggt tccagtcaag tttactcacc agtgtagcgg 26101 ttaacgattg tgcgccgtac aaaaaggtgt tgactcacgg ctttatctta gatgaacaag 26161 ggcgtaagca aagtaagtca ctgggaaaca cggttgaacc gaaagtcgtg attgaaggtg 26221 ggaaaaatca aaaagaagaa cccgcctatg gtgtagacgt gttgcgttta tgggtgtcgt 26281 cagtagatta cacctcggat gttccgcttg gtaaaaatat cctcaagcaa ctgacggata 26341 tcagaaacaa gattcgcaat acggcgcggt tcttgttggg taacttgcac gattttgatc 26401 ctcaaaaaga cgcagtaccg ttcgaggaat taccgcagct tgatcgctat atgctgcacc 26461 gcatgacgga agtctttaag gaagtgacag aagcgtttga aagtttccaa ttcttccgct 26521 ttttccaaac aattcaaaat ttctgcgtgg ttgatttatc caacttctat ctggatgctg 26581 ctaaggatag actgtacatc agtgcgccaa atgctttccg gcgtcgcagt tgtcaaacgg 26641 tgctgaagat tgctttggag aatttagcac gggcgatcgc ccctgtgtta tgccatatgg 26701 cagaagacat ctggcaattt ctcccgtaca aaacacctta caagtcagtg tttgaagctg 26761 gttgggtgca gttggacgag aagtggcgaa atccagaatt agcagtagtg tggcagcaat 26821 tgcgacaagt ccgcactgag gtgaataaag tgctggaaga agccagagtc aagaaaatga 26881 ttggttcttc cttggaagct aaggtgctac tgtctgtagt agatgagcag ttacgctcta 26941 cagtaaaatc actgaatgca actcagaatg gaatagacga actgcgttat ttgttcatta 27001 cctcacaagt agagttgttg gattctccgg aagcggtgca aggattggag tacaaattgc 27061 agtccgatgc atgggaaatt ggaattgtga acgcagaggg acaaaagtgc gatcgctgct 27121 ggaactactc aactcatgtc ggagagtcag ccgaacatcc cctgatttgt gaacgttgtg 27181 tttctgcttt agcaggagag ttttaacagt tatcagtgag tccagtgctg caggagggtt 27241 tccctccgta ggcatctggc gaacccgata gccgtaaggc gtggcgtcag ccatagggtg 27301 atcagttatc agttatcagt acttgttact cataaccttg ataactgttc actgtgaaat 27361 ccatctcacc aaacttgata actgttccct gttccctgtt aagcgttccc tgctataaat 27421 ttgttttttg ctgcttttga agttcagcag actctagctt ttgcaacaac tcttgagtta 27481 attgatagaa tgcttttgac cctgctgtgc tggaattgtt taaaacaaca ggcataaaac 27541 tgtcaacagc cttagcaaca ttcacatcca ctggtatttg tgtttgacaa atttttccat 27601 ccccaaaatc ttgatgaacg cgctgcatca cttgtttgta atatcttccg ctgagaaagt 27661 tagcgttaga cattgtgaag atgattccca gcatttgtat atctatatgt gcttcatgtt 27721 cgtgactttc tttcaactgg gcgatgcgtc tttccagcaa ttgaatacct accacagata 27781 gtggttctgg tttagcagga agtatatagt aattactagt tgctaaagca ctacgagtca 27841 aaagattata tccaggagcg cagtctaaga tgataaaatc atattcttgc cgaatgggtt 27901 ctaagatttt tccaaccaaa actctttcaa aacgattcca aatcgtttca aattcatttt 27961 cacccaaact gactgcttgt tgatgcagca tttgtgagac gacaaattca tcgtacaagt 28021 cgatatctcc tggcaataaa tccagtccgg ggagattaca aacttgaggc tgaatgatat 28081 ctcgaatcgt cagttttgcc cttgatgagg ggttgataat gtcatctatc agatatctca 28141 atgtccgttt gtttttgcga agcttggcaa aatccacagg ggacatcaga ctaagtgtgg 28201 cactaatttg gctatctaag tcaaggacaa gcacccgctt gttatgattt ttaactaaac 28261 aagtcgccat attgacggtg agggtggttt taccaactcc gcccttcata tttgcagtag 28321 caatgacata tcccatcggt taattcctct gataaagcat tctcatctac ttaacgtaaa 28381 tgttttatta tagagagaaa atttttgcga agtttaatat ttttaaatag tcaagagcat 28441 gacctcagtg acgcttcttt aattgaagca gtcaagatta tagcagtgag cggtactttg 28501 gtggggagat aagggtgtaa tctttcttat tttagatcta ttatggttat aatagataca 28561 aagcaattta gcaaagttcg gctcaatagt tttgtttata aattctgaat tcaagctttc 28621 agaaggttta gagaaaaaac gatcctaaaa atgattgata gctttacggt aggtacaagt 28681 taattgagtt acgtaatttt ctttatatca tacagtacta tgatcgtgtc tttctgtctg 28741 tgcagaaggc attactccat gccaattaaa aatatccgga aagcacattg gagcaatgga 28801 atagctatct cagcatcatt agaaaagatg caaatcgggg gctaagatgg ataatttaga 28861 atcgaaaatt gttgaaattt taaagcgtgg tactcctcta agagcagtac atattgctga 28921 tatgcttgga gttgaaagaa gagagataaa ccactaccta tactcttcat taaagcatat 28981 ggttgtgcaa gatagtgact atagatggtc attgaaaaca aatcaaataa gcaagactca 29041 gcgtccctca actcaagcgc caacttctca tactaagtca tcacaacctg taaggcaaaa 29101 taacctgtat agctttacta aagaagagat taaacaagat agtccacttc gatcaactcc 29161 tcaaactaaa tcatcacaac atgccaagcc aacctctcaa actccatcat cacaacctag 29221 taagcacgaa aacctacaga aattttcgga aaaatctttc caacaaaatg gtcaacctca 29281 acaagcttca acggctcaaa ctccaccatt atcacaacct gtaaagcaaa ataatcctta 29341 caaagttatt aaaacagaac ttggtcaagc atctccagaa gaaaaagtaa aaattataga 29401 aaatgctttt aggcaagagc aattccgaga actcgaagat gaagaaatta atgctcttca 29461 atcaatttta gaacagtcta ggcgtgaaat agatatagcc aatacagcat atacacaagg 29521 taaactaagc actcgaaaaa ataatcctat tatgattgca attttatcag ttgcattaac 29581 tctgagtaca cttttcctta tcagtcaatt tataccaaac ttaactaacc aaccttctcc 29641 aacaattccg caaactaagt caatgcaata gttacgctga atcaaccctg ttaccgatgg 29701 gaggttgaga ggggtagccg aatatctgtt gcgaataaac tttgtaagcg gtacaattga 29761 gcgcccattt gataaagcaa atcgcctttc cctaataaac gagcagcggt tgtttgttta 29821 cctcccaaga taatagctga gtctgcttca ctagcagttc tcagtgcaac ccttcctggt 29881 aaatttgagc ggataattgg ggtgacaata ccggcttctg ggcgttgagt ggcgataatc 29941 aaatgaattc cggctgctct tgccattgca cctaatcgtt ttatactttg ttctagtgct 30001 gtgcgggatt ctttctccgc catgaaatca gcgtattcat caaagatgca gacaattcta 30061 ggcaaaggct gaggagaacg ttgattataa gtacttaaat cagcgcattt cgctttttca 30121 aaccgttgat agcgcgattc catttctgtt accagttcct gcatcagttc aatagcgcga 30181 tcgctctcct tcacaacagg tgaatacaac cactgcatcc gctcaaactc aggaaatgtc 30241 actcgtttag gatcaacaag agcaatttgt aaatgtgcag gggaatgacg caaaatgaga 30301 ctgaggagga gcgatcgcaa atactcactc ttcccacttc cagtagtccc ccctaccaaa 30361 aagtgacacg tatttggatc agacaaatca gcttctatca gttgtccctc caaatttact 30421 ccaattgcaa ttttcactgg tgctgttgct ggcaaaactt gcgattttat gtaattctca 30481 aacttagcaa cttgtctgtc gggacgaggt aaatcaatac tgacatatcc agcttgaggg 30541 gaaatcagag gcggatttgc taaccccaat tgaacttgca aatctgctga caatttcagc 30601 aaggagttca ccttaacacc aagttgaggc ttaagtttta cccgaacaaa agctggacca 30661 acagccgctc cgtgataatc tacactaata ccaaaagatt gcagagtttt aaccaaattt 30721 tctcccattt catcaggatt aaatggttgc ttatgattgt ttcctgtctg ttcatcctga 30781 ctaactgttt tttgctgatt ctcttcataa gatgattcct caccttgaga ttgttccaca 30841 aagtaacttt gacacttttg ctgctgtgga caaattttgc acaggtgagg ctgggttgtt 30901 aagggtggag aattaggtaa tggtggttcc caagtcaacc attgacgcat ttgctgtaat 30961 ttataaggaa tcaactgatg aaccgtattt tccagttgtt cccaagaata ctgatactct 31021 ttgaattctg gtaaaacaca gtaaaccgct gagtcaacag ctactttttt cttttgccac 31081 agcatataac tataaagcgc aacttgagct aattgtgctg atggatccgc tggttgatag 31141 gttttaaatt ccaccacaca aaggcgcttt ttttcaaaat tgtagactaa gcaatcaaat 31201 tcccctgcta ttcgttgctg cgtgtcatcc ggcagattaa agtagtactc aagcttgcgc 31261 tcttcagtga taaaagtgtc acgaataagt gtttctgcac tgcaatagcg gcgattgatg 31321 attagcaatt gggcaaagtg tttgatgagt ccttgtaatc cttgccaaac tttgaataat 31381 gcttgtgctt gactcgaatc ttttttaata gcttcttgta ggtaagaata aaacttgatt 31441 tcataaaaaa tctgttgcag actagaagca atttcctcta tctctaattg ggctgctgtt 31501 ggttgaaaca aatctgtaca ttgaggttca tgagaaacaa aacgtacaaa atcattagct 31561 aattgatgaa atgtattacc aattccaaca ggattttctg gaggcaaaaa taatgtattg 31621 cctccaaagc gctgattgag ataaaataag cgtgggcatt caaaagcaac tcgtactttt 31681 gtagcgctta aagataattg ttcagatttt tttcttgatt ttgaaccatt agctaccata 31741 tcaaaaatat tatttatagt attagggtta gaaatttcca ataagcgacg atggacagca 31801 gccagataga ctgtatccat tgcagcatag tgtagttgtt tttgtgtgag aggtcgtttt 31861 ccccaatcgc ttccctgttc ttctttatct acattagaga attgacaaag ttccgttgct 31921 aaagttttca gttgtagatt agagacttgc aaaacttcgc gggtaatttt tcgggctagc 31981 ttcagcgtac aggtaacatt tggtgcttgc ttacctccaa gatattttaa atcaaagctg 32041 gcgttgtgaa atactttttc tatttggggg ttaactataa tttgattaat aaaatatgct 32101 gctaaatcag gtttattcaa tacatcaagg atataagcag agtcactcgt caagtttgtc 32161 ggttcagcta gcacttgaat gagcgacagt ctcggatagt aagtatccca gtcagcgatt 32221 tctgtatcta gccatagggt tttagccgag gcaaatttgg ctatttgcgc tcgtatttca 32281 gttggttgtg tgagatattg cattggcttt caaacaaaaa tcatgtttta gctatgaaac 32341 aaattaattg gtcttgtaac ttagcttttg gattaataat tgtcaccttc tgttcttgac 32401 acaatagatc aatcaagagt tgaacatcag cttctttaac aaaagaaaac tgattaacag 32461 actgtatcat taaagttgga actcccatgt aaccttgagt tttgattaga ttcaacaaga 32521 aatctttgac aggtctgaaa tcttttttcc catttctatc ttctggaaca gtctcttgtt 32581 tagaaagaat tcctaaatcc tgtaataaag tacatttttc taaaatttta gactcacgga 32641 ttaaggtttg tagttcctgt aaagtaatgg ttttacttgc aagcaccaat tcattagctt 32701 ccactgactt cacaaagctg tggtatgtcg ccaaatagtg aacagaagac agattaggct 32761 tgatatgacg atgattagta gatgtgaaaa tctgttggta gagctgattc ccaacaaggt 32821 ttggtttccc tacccctcca gcccggagca agtacatagt ttgacagaga ttttgctgaa 32881 tagctttctg gcaagcagtc atcatattga aaaaactgtt catgtgtgaa tcttctgtcc 32941 agactactcc tacccgttca cgcttaccag gttgttgata actcaaagaa taactagcgt 33001 accttccact tataagtttc ggcttaattt cctgtacttg taatgcttcc aaggcttgtt 33061 gcaacatcca aatgagatcg ggtgcagcta acaagataat tttggtattt tttccctgaa 33121 ctttttgata ttcctgctgc catagcaatt caaattctgc tttaattacc tcccaagaaa 33181 tttccgtttc taccgtcacc tcaccatcaa accattttgg ctgaggaggt tgtttatctc 33241 taaatagcca ttccttataa tcctggaata gttgtttccc tagcgtcaga aaatcgcggg 33301 gtgtagcttt gcggctgaga aacacttttt ctaacacctg ccgatttaac gggtaaatta 33361 gggaagtagg tttggggtta gcttgattgt gcaggggata gagtcgagaa gctaagaggg 33421 cttcgccttc ttccagcgtg atgcgtttta gtggaattct gatactaact ctatctatgt 33481 cagagggctg aactcgctta gaattgttat accagttctc tgttctgatg ctgatgataa 33541 tcaggaaatt tttccacctg ccgttataaa ttgtagagtt gacgttgaat aaagcttgta 33601 gatcaagaaa gccatctggt aaacgagcaa tgctatctaa ctgatcgaaa cacaacacaa 33661 tgggctgggt tttcgcagaa actttgctaa agttggctaa aataccccgc gccttgtctt 33721 catcatcaat agattgctta acttttaact ttttcaaact ttcttcatct aaatcatccc 33781 ctttcaacca ttcacaagct agagagtata agtctggatt tgtcaggtca taaagcacac 33841 caaaaaattc attggcattg taaattcctg tggtaccgat ggtcttttta aggatgtcaa 33901 tgaagagttg gcgatcgcgt tcagtgtctg tttttccaaa aaagccttta attctatcaa 33961 taaatttttg ctgctcgctt tttaagccct tctcaatcgt agacaagcag cttttgatcc 34021 agaggattaa ttgggaatct gcttgtcctg ctggagcatt aaccaaacta tcaacggtgt 34081 agcgtaagat gtgccgccag atacggtcgc tctgagaaaa tggctcaatg taaacgaaaa 34141 aagcttggtc atttagctgt tctttgagct gacccaagaa atgagtcttt cccgttccaa 34201 cgtcaccata gagaattagg ctacgggttt catggtcttg ggcaatttgt gctattacac 34261 ttttaatttc gattaattgc ttttgatgaa tagattcaac agtagggaat ggctcttgct 34321 cttcccaaaa gttacgtgct gctgggttat caaatggatt cagcgatttt ttaataactt 34381 gatcaattgt tgccatgcca atatccctgc tgtcttaaag ttagcctgct cataatttag 34441 gtgtatgtcg ctagattgct gtgatgaaaa acaaagagcc accgctgatc tgagaaattc 34501 cagcgtcaat ttgttcagga gtataatcgc gtggttctgc taatgtactc aattcaattt 34561 ggtcagcttc ttccagacga tagagcacct tatctacttc ttcccgtgac agcggaggtt 34621 gtaacttttg ccgcagatgg aagattggca aataattgtc agtacccagt tcccgatcta 34681 atctccggat ggtttctaaa atttcttcgt cagtcaaatt gataattgtt acagcagccg 34741 attctctact ggtaggtgct gaagtagaga cttcctcagg cttactatgt agagatttcc 34801 gtaaaaagcg caggtaatta ttcagcaaat ccaatctaat cgttgcagcg cccttcggac 34861 tataatcttc ccgtaaatac tctagtgctc gttcagtcag ccatacttca gctttagtct 34921 ttttgatttt ggattcagcg tcaatcaatc cccgttcact cagagttttc aatatcgcct 34981 ctttctcagg cgctttcaac gacgacactt tgattttgct gggggagact ttaccagcag 35041 ctttacttat cttttctagt accttgaatt cttgatctgt gattggcaac tgagctgatt 35101 cgagtttcaa taaagcgtta ccaggtggta aaattttaac tgaggcaatc tcgcgggaat 35161 agtctactag ttcgcgatcg cctaaacttt gacaaatttt atctttaccc ttaaaagctt 35221 tgaaaccact agcactaagg cttgaccgat aattaggaca tcctaataac ttgagtagaa 35281 actttaactc atttgtatcc atgcggattc ttgcctgtat ttaattcagc accactctag 35341 ctcaagtggc aataattcac accttttctt atcctaacta gatttagggt atttaggcag 35401 ggtgagtgtt atctaaatat caatacatat atatagcagt cctaaataat tcatgaaaat 35461 ctctgtttct tctcttggcg ttcttggcgt cttggcggtt cgataatttc cacaactcaa 35521 ataggattgc tatagcagtc ctaaatcatt tgtaagaaac aagatccccg acttctcact 35581 acaagtcggg gatctgagtt tttcaatttt cacccctcca tctcccctac cccttcactc 35641 ttctcaccaa atcaaacacc aaagaacgca acctcacagg taacagcgac aaccccaacc 35701 cagtccacgc cttagcagga gaaaaagact tccctgctaa aactaacttt cttccttttt 35761 gcgtttcacc cttttcaatt aaccgtaacc ctaacaataa ctgtgtttct gtaagacgct 35821 gttgtctaat tttctccagc ttctcaaact caaattgata actttccaaa tatcgtaact 35881 tgtcagataa ataaggaatc gccctgtcaa tcccttgctg ttgtgcatgg aaacgatatt 35941 ccattaacaa ctctggtaga taatagcctt tctttcctgc caaagccagc cgcacaaata 36001 aatcattgtc ttcgcaattt tgccaattag gctgcataaa tcctaattct tgcaaggtag 36061 aacggcgaaa taaggttgca ccaatttgaa agctttgatt cacaaagaca acttcctgta 36121 aattatcaac aacaccttct ggcaaattcg ccctacccca gcgacgagaa ttttcttgtg 36181 ttttcgtatc atctcggata ttgttaatat caatcaccca gtggtcagtt ccaacaaaat 36241 caatattggg gtttttatcc agaattgctg tggtacgtga taggaaatct gaagttaacc 36301 tgtcatcatc atcaaatttt ataaaatatt cccctgttgc agcatcaaag ccagagcgca 36361 tattattgct tttaccaata ttttgctgat gacggatgta tttgatgcgg ctatctgtat 36421 actgcgacat gagtttagct gtctcatcct gagaaccgtc atcacaaaca attaattcaa 36481 aatcttgata agtctgctgg agtacacttt caattgcgaa aggcagcaga ttaacacgat 36541 taaaagtcgg aatgcaaaca ctgactttag gcatttttgt aatagtaggg ttgacaaaaa 36601 ctaaacttaa gaaatagttc tttcaattct actcttaacc ttttgcatga atctatgggt 36661 tttaaccact atactcggtg gcttgagttg cttaggtcgt ttttctggtt ctttcagaaa 36721 tcgatagtat aaaaattcat ttttgtaacg aatatcaacg tcttcgcctt gacacaaccg 36781 agtaaaatca attgcaggat aattcatgta atgcaggcga tgaatcggtt ttaatccatc 36841 ttgattgtag aggacattat taatattgat aaaagggtcg gcgtcggcac aattaccagt 36901 tatgtcttga ccattggggc tgagggtaaa gttaaatagt gggcgatgat cgcatcggaa 36961 tgtcatgtaa ctaaataagt ctgcatcaca ccaccaagac cgctcattaa tccaggtaaa 37021 ctcctggttt accaccaaac gctctcttag catctctagc tcatccttgc caaaaattcc 37081 ctcttttgaa ccgaagaaac tagagcaatg taatagtggt ttgacctggg attctggcaa 37141 ggaaattgcc ttttcaatcc gagaaaaatt gagtgctgca acaggacttg acttgccatg 37201 ctcccagtca tcaaaaacaa agtcgtagtc gtctagcttt tctaaaaccc tatccagcga 37261 agtcattgcc aaactatcgg catcataaaa gacaaatcgt ttaaagttgc catcaaaggc 37321 acacattttc ctcaataaat tactttgttt gtaccatttt cgagagtgac cttgctttag 37381 ctgcctagcc tggggatggg cattccaaac ttgattgtaa aaatcttccc agcgttgcat 37441 tgagcttttg ttctcaaaca aggtgacatt aggtcgagaa cttacctcta gcttcactag 37501 ctctagtctt tcatcatagg gaataataca aatcggaata tttgggctaa cattcacttc 37561 aatgctattt agtaatgcca ctaattgatt ataaacaata tcattggcaa gtgtgtaaac 37621 accaaaatct tccatcatga aatatccttg gtttcttcac gtagtttatc aaaattttaa 37681 atttatcagt cgtcaaatga tgatataaat tgtgattaaa agtcatccta tatgtatagt 37741 aaaccgaaat gagttgtgaa aattcaaaaa ctaaaatacc cgacttttta ccagaagtcg 37801 ggtatcaagt ttgctcacaa atgatttcgg actgctatag caatcctata tgagttgtga 37861 gaatttatcg aaccgccaag acgccaagga cgccaagaaa cagaggagaa aatcttacga 37921 atgatttagg actgctatat atcaccgctt ttgttagtat aggactttta tagaactcct 37981 atttgatttt tacgaagcta ggtacacttt tattgttttt cccctgttaa gcgttccctg 38041 tttcctgtta agcgttccct attccctgtt aagcgttccc tataattagg tttatgaata 38101 tcttaatgct atcttccaca tttccctatc ctcctacacg aggcggaacc caggtaagga 38161 catttcattt actcaagtat ctcagccaac atcatgttat tacgctggtg actcaacgag 38221 agcctgatgt gacagataca gaagtagtag aattacggaa ttgtgttgat aacttaattg 38281 tttttaatcg ccctcaagac acgagtaaat ctggaagaat actaaaaaaa atccagcgct 38341 tttattcttt tatacaacaa ggaacaccac caagtgtgtt aaaccgctac tcaagcgaga 38401 tgcaaacttg gattgataac tttgtggagg cggggaaatg tgatgttatt acctgtgaac 38461 atagcgtgaa tgaaatttat gtgcgagcgc attttcaaaa acatcttaga actatagtta 38521 atgttcatag ttcagtctac ggtacttgtc gcaaccagct aaaaacgggc atttcggaaa 38581 atagatttcg cgataaactc tatttaccac ttttgcgtcg ttatgaacaa cgctactgct 38641 ctaaattctc ggcaattgtg gtgacaacag aagaagatag ggttcagatg caagaattta 38701 acccaaacag cgaaattcaa gttattccca atggggtaga tttagtttct tttccctacc 38761 gcacttatga tccaaacgga catcgtatca ttttcattgg tgctatggat aatttagcaa 38821 acattgatgc tgtttgtttt tttagcaacg aaatattacc agaaattcaa aagctttacc 38881 ctgatactac ttttgatatt gttggttctc gtcctgcacc agaagtttta gcactcaaag 38941 aaaagcccgg aattactgtt accgggcgtg ttccttctat ggtagaatat ttacacaaag 39001 caacagtctg cgttgtgcct atgcggacgg gatttggtat taaaaataaa actttagagg 39061 cgatggctgc aggtgtgcca gttgtagcaa gtgagcgtgg tttagaagga ctggctgtag 39121 atggttcaaa tataccattg agggcattac gagcaaatca gcctacagaa tacgttacag 39181 ctattcgtca attatttgaa aattcgcaat tgcgagcaga attgtctcgt aacggcagac 39241 aattcgtaga aagtcaattc acttgggaaa gcgcaggtaa acgctatgag gaagtcctga 39301 caaatactgg attgtgattt tataccgttt cttaggactt gttatcggtc acggcgttcg 39361 gccaattacc ctcgataatg ttattatttt gaatctgatg tgtataaatt ggcataagaa 39421 cattatcgat tttgtttctt gtgactgtaa tccctccact tgcctgatcc agataaacac 39481 cggagatcgg atagcttgca gcccagggag agcgaacaac atggtgaata taattatctg 39541 aaatcaaggt attatactgg ttagacaaag tataaatacc cgcaccgtca tcaagggtat 39601 gcatgacgtt gtaaatttcg ttgttcctga tagtgttgct ttttaccctg tggatatcct 39661 tcgtccatcc ccagccaatg ctgatacctg tgtaagggag attccacagt ttattgtttt 39721 caacaatcat gttttcgaca aaaccagcga aaattcctac tgaatttgga tagtcctgac 39781 cgagggcggt aagggtattg ttgttgatta acacattgcg aatcgtcgct gttgcaggtt 39841 ccgagtgcaa aagggtatcc ataacaatgc cttgtccccc caggttctcg aaatgactgt 39901 ccctaatgga gaggttttca acatccgtag aagcattgac gcctgaagca ctcatattac 39961 ggaaggtgca tttctcaaag ctgacgtttc ccgcgtaaga cagttctacc cctgcgatca 40021 tcgtcgtatc gccccatacg tgctttgtgc taacttccgt accagcctgg ccaccgataa 40081 atcctttttg cgaaggtgct gtccagttgg tatcctgaaa aacgatgcct tgaaaaacaa 40141 tattatgtgc tggtttttca gcggttccgg caatttttag gatatgctca agacgcgggg 40201 cgataacagt tgttcgggac atgtcttcgc ctgggcgggg tatgtaatat aaggtgtgtg 40261 cttccttatc gagataaaac tctcccttgt cgtcaagaaa ttcataagca ttctcaaaat 40321 aataggagag ttctggttcc aggaattggg tattgcccat aaaagctaga tctctgcctg 40381 gattctgtag cgcgacagaa gcaatgttac cgtttattgt aaagtcttca actttaaatc 40441 tgttgatagt ccagtgtctg aaaactatca tttctgcttc gctgagtttc ttccatcgat 40501 tgatctcgct tgggtttact tccacccttt tattgggtat gtcccaatta acaacacgaa 40561 aataagtgcc agtgtttggt tgtcgggcgc ggatagctgg aacgccattg atgaacagcc 40621 gacggaaatc ctggttatcc accgtagcct gaaaaacctt tccgtcctga gtttgccatc 40681 ccgtaatcac cttaccgcct gaaagttcaa cttttccggg tgtttgggct cgaaaaatga 40741 tagaatgccc gttgaagcca ctgtcctgtg ttcctagggt aaaagtatct gacaggtaat 40801 aggtgccgtt tttcagaatg acggtaatat cggttttcat ggaggtattg atggaacgca 40861 catactgctg cgcacttgtc agggtgcagg gtgctgtgct tgtacaggca ttccccttac 40921 cgtcagggct gacatatagc gttcttctct gattagcaaa gttgttctgc tctgagttgg 40981 gggaaaccat gacaaaactc gacaactttg taaataagtt actcgtaaga ctagccaatc 41041 ctggctgtgt ttggctgctt attgagaaga atatcaacat aaacaaccca aaaatctgcc 41101 atgctggaac ggctgtcccg ttttgtctgg gaaaaaggaa acgcttaagc ttatgattct 41161 tcattcggta aaacttgccg agaaaacgaa catctgtagt tttcccttgg tgcaaaactt 41221 caaaaaactc ggcgaatcgc cttcatcacg agatgccttg ttttcagtcg gacaagtaat 41281 ataagaatcc gatttgattt ggtgaaaaaa tttaagtatc tgtagggtgt gttattgcgc 41341 aatacggttc agttagagcc aaaaccctta aattgcgtag gttggggagc gctagccatg 41401 cagaagggtt tccaacgcca gatgctccac ttggggagac cccaagaccg cactggctcc 41461 tccgtaggtg tctggcgttt gacgccagga aacccaacac aaaacgtttg ttgttgggtt 41521 gcgctacgct taacccaacc taccattacc ttaactgaac cgtattgtgt tattgcgtag 41581 cataacgcac caaagcctga tgatcatggt gagccaccag gcataacaca ccctactacg 41641 tactgtcgga caagtaatac tgatacccgc gttgcattca aagatagtag gtgggttgct 41701 tcatctcccc ccatttttaa atgctgtttc tgaacgcaga tttgtataaa aaaaacagaa 41761 ctatcaatgt agcgcttagc gaaaaaaata tttattgtca acgccctacg ttaggggtgt 41821 aagggtataa gggtataagg gtataagggt gtagggggaa ccccatacca caatagagtg 41881 caaaccgcta tggagtgcgt ggcggtctgg gcttgtaccc gctccgggtg aaaggactgt 41941 gcatttcccc tacaccccac acccccacaa cgccaggtgc taccctgcgg gaagccgcct 42001 ccggcgtcta caagtcggga aacccgaacg ccagatacct acggagggaa accctcctgc 42061 agtactggct ccccaacgcg ctggctcccc aagaccgcgc tggctcactt tctcaaaaag 42121 agtaaggtga acagtgcccg cgctagggaa taataagctt ttgaggctac aaaatcgcaa 42181 taattatggc gggtaagatc tcagttttaa cgctagctga accgtattga gttttaacag 42241 acttcggcta ttagactgtg tggttgtgca tagaatgaaa tagccccaga ctatcaccca 42301 caacttcagt actcggcact catgtctgat atcatgtcga aatcaagcca gctacctcaa 42361 gctaatcaac tgagtcagtt tagcagcttg gaactagccc aagccctgat ggagaagctg 42421 agcatatctc ctaacgattg gcatcgtctc aagtctaacc gcaatgttcg cgcaagtgaa 42481 caagtagcag cagctcttgt gtttctcctc aaagatgaac cacaagaagc tttactcaga 42541 ctccagcaag cagcgctttg gttggatcgt tctatttctg ctccaccttg tccaactcac 42601 ggcgagcaaa agagatagcc gaaggtgacg aggatgagcc agatgaggaa gtgttgtcct 42661 ccctcctctg gcttatccgt cttcaccgtc gcaaagcgag tcggtttcga ttttctatct 42721 gtttacaaag cctcaactct gtacctttac tgttccactg aacttggtca aaaatttgat 42781 gaagaataga cataccacga ccattttctg actcatcggc tggtaaatat tctgtaggat 42841 cgatatcaac gtcgctggta aaacatggtg taaaaccgct accttggtct gatatgaccc 42901 accaatactg attatcaatt aaggaaaagc gaacgacaac ccttttccca ggatctagat 42961 tgttaccatg tttggctgca tttaccagag cttcttgaag tcctagccgc agttctgctt 43021 gtaatttgcc cggaatttct gccaaaagca aatctagtat cggacagagg tacaaagttg 43081 aggcaaaact aattgtgccc caattacgcc ctgtaggacg aagcgagata gtaatcacaa 43141 tagaaacccc gtagctttca gagttagcta gacatcagtt tgctgtcaca ggcaccctga 43201 tagaatttga ggtggttatg ttgccttcaa actttcgtct tcacaactaa attttgaaaa 43261 tgctagggag cattaactct cctttacgac atagagtcac ttacagaaaa aacggcttgt 43321 ttgcaactgt ttaaacagta agcaactgat atgtttttta gtcaaaagag taggttaata 43381 ctcttttacg ccgattcgca tcttagacaa ctcatttcat atttttggga gagtttttgc 43441 aacgtcaatc tctttagaca aaaggaatta tatatttctc ttaatagtat tatagcattg 43501 ttacgatacg ttggacgaca aatttttatt tcagttaagt tgcaaatgag ttagtacctt 43561 tgtccgattg cgatagcaca agaaatgcgg cagcttccac atgagcggtt tgcccaaaga 43621 aatcagccgg ttgtatccgt gtgagattat ataccccgtt ttcacaaagt aatttcaggt 43681 cacgagcaag ggtagcaact ttacaactga tataaacaat acgtgggggt tgaaatttga 43741 gtaaagtttc aataacagta cgttcgcatc ccttacgcgg tggatcaagc aaaacgatgt 43801 ctggcagaat ttcaatttgg ggaagtaatt tctcgacagc cccagtctga aaaatgacat 43861 ttgaaatatc attctgcttt gcatttaaaa tcgcttgttc tactgcctct ggttgcaatt 43921 ctaaccccat tgctttccta acttgtttgg caagtggcaa tgtcaaagtc ccaataccac 43981 aataagtatc cagtaatacc tcagaacctt gtaaattgag ttctgattga acgacctgca 44041 acaatgcttc tgccgtctct gtatgtactt gaaaaaatgt atccggtcgt acttggaatt 44101 ctagtccggc aaatttttct cttaagtaag aatgcccagc aatgcaatga gtttctcgtc 44161 caaaaatggc attcgtacgg tttgggttgc ggttcaaaca aacccccact aactggggat 44221 agcgttttaa ccattccagc gcttggtctt gaattccggg taaattccaa tccttaacca 44281 ccaaagttag taatatttcc cctgtgcggc gaccaatgcg taaactaaga tgacgtacaa 44341 cacctgtgtg gtgctgttcg ttgtaaactt cccaaccccg cttttggata tcctgtttaa 44401 cttccaacag caaaggattt aatcttggat cttgtacggg acattggttc aagttaatca 44461 attggtggct acctttttgg tagtaaccag cttgaaccgt aaaagtggca gataatccca 44521 caggataagt cgctttgtta cggtagccca aacattcagg agtcgaaagc actggatcta 44581 ctggtggttg gacaaaacca ccaatacgtt ccaaagcctg gataacttga ttttgctttg 44641 ctagtaactg gtattgataa tcaatatgct gccattgaca accaccacat ttatcagcca 44701 cgatacagct aggtcggacg cggtagggtg atgatttcaa aagttgtttg agtttaccgt 44761 gagcgtagtt aggtttaacg tgtaccaaaa agacaacagc gcgatcgccc ggtacagtgt 44821 cgcttacaaa tacgaccctt tcttcccaac gccctacacc atcacctgca tcattcaaat 44881 ccactatctc aacgtcaatt aactcacctt gtttccaatt ttttttagtc attagtcatt 44941 acttattagc aagcagtgcg cctttggagg ttctggaggt agcgcacggg ggatcccccc 45001 tgagtgtgca acttaccgtt cattaccagc cagtgcaagg cggtgaaacc tcccttgggg 45061 catagtttag accacgggcg tcagccgccc aacaactctt ggtacggaag tgtcttccgc 45121 tggttcccga ttacattgtt tccttgtcct ctcttcactt gtgtcctcaa gactcgttaa 45181 actagcaaat aataattaat tcatgtcatt gcaataatca tgactgtagt aagccaagtt 45241 attctcaaag ccgacgacga actgcgttat cctagcagtg gcgaactcaa caatatcaaa 45301 gactatttgc aaaccggcga gcaacggata cgaattgtct caaccctagc cgaaaatgaa 45361 aaaaagatag ttcaagaagc tactaaacag ctttggcaga agcgtcctga ctttatcgca 45421 cctgggggca atgcatacgg tgacaagcaa cgggctttat gtgttcgtga ctacggatgg 45481 tatttgcgcc taatcactta cggtatacta gctggtgaca aacaaccaat tgaagatatt 45541 ggtttaattg gcgtcaggga aatgtacaat tcccttggcg ttcccgtacc tggaatggta 45601 gaagcaatca actgtctgaa aaaagcctcc cttaacttac tgaatgcaga agacgcggct 45661 gaagcagccc cctactttga ttacatcatc caagcgatgt cgtaagacaa agttagtgag 45721 acttctttga gtggacttgc agactacaaa gtgagaatgc agtcaactca ttctcaatct 45781 caataatacc cacatactga tcatactctc cccacactgg atagtggcta cggcacaatt 45841 tgtgtcaagt tactaacaag ttgtggggaa atttcatttg atagtcaaga gtcatttgtt 45901 actattttta tttctctatt tcctctttta tacaaaaagc cgacagtttt cttgtgaaag 45961 ctgtcggcgt ttagtgtgtt ccctattaat gactcggcta gagttcagtt attagtatta 46021 tgtctattgg cttatataaa acataggatc ttaatcatca gcaccagtga ttctcatcac 46081 aaaactattt ctaataaact gcatagatac ataaatacgc ataatttcat ctttttcata 46141 actcaaaaat taagatacaa ccagcatagc cttttaccta agaaggagca gtttctggat 46201 aatcagaaac tgtattaatt tcgataaatc attttataca cataatagtt attttgggag 46261 catcccaaat gtgtaacttg cacttcagac tggaagccaa gctagtgcag cgcggtagaa 46321 ataactaaac agttataaat gacaaaaagt agatgctata aggattacag ccttatctgg 46381 aggaatggct ttgcccgaca gcataggtgc ttttttgtat aagtacaaca aggaaaaagt 46441 gacaaagaaa caacacttat attgtaatat ttttactggt ttcaagtggt agctttcttt 46501 ccgcgtccaa cgcattagta cattggggag ccagtcctac caagaaatgg ggttttctgt 46561 tgtatgtgaa aaatgttaga ggaactttca acagagaaag attcccctaa catttttata 46621 caattccttt tgtgaaaaga gcagattaaa ccacaaattc aatcaattgg gtacgtggga 46681 caattgtgtt gagtagtagt gagcacacta caaaaatcaa attttacaaa ctaatacaag 46741 tatgattgcc catgaaaaaa ccgatgattg taacaatttc gaacttgtta aatcatcact 46801 ttagatacgc gtattgtata tgggaaaact tgtttaccat acttgactta gttaatatta 46861 atggtatgaa aggatttctc ttcttcaaat agtgaaaaca cttatttgat ttttctgaaa 46921 cttaaagcct ctataaaaat ttacgtgttt atacagttgc tcgacaatct ctaccataga 46981 agaataatcg gctaaggtag taggaatacc aatttgtcac gtcttgcaag agacttcagt 47041 aagaatagcg atcgccggag gcgcagctag cttaacgcag cagaacaaag cacgcagcat 47101 ctgacttgag aaaatagaaa accatttgcg tgaataccta cagggcactt aaatcgctaa 47161 aataatgctc tgtccccttg ctgcattatt gccatgatcc ccatttcctc tgctcctttt 47221 tctcctgaag aaatcgcggc tgttggtttg aagctagaag aatatgaaga aatcgtgatg 47281 cgtctggggc gtcatcccaa caaagcggaa ctgggaatgt ttggggtgat gtggtcagaa 47341 cattgctgtt acaagaattc ccgtccgtta ctcaaacagt ttcccacgac aggaccccgc 47401 attcttgttg gacctggcga aaatgctggt gttgtagatt tgggcgatgg tctgcgactt 47461 gcgtttaaga ttgagtcaca taaccacccc tcagcggtcg aaccttttca aggagctgcg 47521 acaggagtag gaggtatctt aagagatatt tttacaatgg gtgcgcgtcc cattgcggtg 47581 ttaaattctc tgcgctttgg ttctctggac gatgcgaaaa cccaaaggct ttttagtggt 47641 gtggttgctg gtatctccca ttacggaaat tgcgtaggtg tccccactgt cggcggtgaa 47701 gtttactttg atcccgctta ttctgacaac cctttggtga atgtgatggc gctgggattg 47761 atggaaacac cagaaattgt gaaatctggc gcgtctggta taggtaaccc cgttctttat 47821 gttggttcta cgaccggacg tgatggtatg ggaggtgcaa gttttgccag tgcagaactc 47881 agtgatgagt ctatggatga tcgtccagcg gtacaagtgg gtgatccttt tttggagaag 47941 tcgctgattg aagcttgctt ggaggcgttt aaaacaggtg cagtcgtcgc cgcacaggat 48001 atgggggctg ctggtattac ctgttctacc tcagaaatgg ctgcaaaagg cgatgtgggg 48061 attgaatttg atttagataa gattcctgta cgcgaagcag gaatggttcc gtacgaatat 48121 ctgctttcgg aatctcaaga acggatgctg tttgttgcgc acaagggacg cgaacaagag 48181 ttaattgaca ttttccatcg ttggggactc catgcagttg ttgcaggttc tgtcattgct 48241 gaacccattg tccggatttt attccagggt gaagttgctg cagaaattcc tgcaacggcg 48301 ttggcagaaa atacgcccct atatcaccgg gaattgttgg cccaaccacc ggaatatgtg 48361 cgaaaagctt gggaatggac tcctgattct ctgcctaaga gtacatttgc tggcattgaa 48421 attcaaggac gcttgcaaag ttggaatgac atcctgttga ctttgctgaa tactcctaca 48481 atcgcatcaa aacgttgggt atatcgacag tatgaccacc aggtacagaa taataccgtt 48541 ctgctaccag gtggtgcaga tgcttctgtg attcgcttac gtccacagga agtagaggtt 48601 caggggaaag tttccaatac ccaaagcggc gtagcggcta cggtagactg caatccccgt 48661 tatgtttatc ttgatcccta tgaaggagcg aaggcagttg tcgcagaagc agcacgaaat 48721 cttagctgtg tgggtgcaga acctttagca gttactgata acctgaattt tggtagtcca 48781 gaaaaaccta ttggttactg gcaattagca gaagcttgtc gcggtttggc tgaaggttgt 48841 cgagaaatga caactccagt gactggcgga aatgtctctc tttacaacga aactctcgat 48901 tctcaaggta agccacaacc aatctatcct acccctgttg tgggtatggt aggcttaatc 48961 cctgatttga ccaagatttg tggtcaagct tggcaagtga atggtgatgt aatttatttg 49021 ttgggtgaac tttcttccca ctctacgcta ggaggttcgg aatatttagc cactatccac 49081 aacactgttg ctggaatacc gccacgagtt gattttgaat tggaacgcca agtacaacaa 49141 gtctgtcgcg agggaattcg caaaggttgg gtacgttcgg ctcatgattg tgctgagggt 49201 ggagtcgccg ttgctctggc agaatgttgt ataacgggca aatttggtgc tgatatccaa 49261 ttagaattac cagcagacaa caaccaacgt tgggatgaag ttctttttgg tgaaggtggc 49321 gcacggatta tagtctctgt tggaatagaa caacaagaaa cttgggagag tcttttaaga 49381 gaacaactag gtgatcattg gcaaaaactt ggtacggttg ggaattccga aataggtttg 49441 cgggttttaa caactggtaa tcatacttta ataaaagtta caattgagga aatgagcgat 49501 cgctacctaa acgcgattga aaggcgtttg aacatccata acaccactcc gaattcatag 49561 atagcatctc ctcacccaag agactagcgt tcataggcat aattgcgata ctgtgttaat 49621 ggatggttaa gaaattttaa gatttgtctg caactatcct tagatatatt ttttgcgact 49681 tgacatcctc atcaggagca cacctgccat gatttcctgc caacccgact ctttggatga 49741 ctctcaattg cccccgaact cagccaacga ccatgaaaat cgtccagata agccagaaga 49801 agcttgcggt gtttttggtg tttatgcacc aggagaagac gttgccaaac taacgtactt 49861 tggattgtac gccctccagc accggggtca agaatcggct ggtattgcca catttgaggg 49921 tgaacaagtc aacttacaca aagacatggg gttggtgtcc caagtcttta atgaatctac 49981 cttgcgaaat ttgcctggaa ctttagctgt tggtcacact cgttattcaa ccactggttc 50041 tagccgcaaa gtgaatgctc aacctgctgt tgtcgaaact cgcttgggtt ctgtcgcgct 50101 ggcacataat ggtaatttag tcaatacagt ggcgctacgc gaggagttgt tgaagaacaa 50161 ctgcaactta gttagcagca ctgactcaga aatgattgct tttgccattg ctgaagaaat 50221 caacgctggt gcagattggc aagagggctg catgcgggct tttcaccgtt gtcagggagc 50281 ttttagttta gttattggta catcaaaggg tattatgggg gttcgcgacc ccaatggcat 50341 tcgtcccctt gtgattggta ccttgggtga taatccagtt cgctatgttc tctcctctga 50401 aacttgtggt ttagacatca ttggtgcgca atacctgcga gatgtggaac caggtgagct 50461 agtttggatt actgaagagg gtttaacttc ctttcaatgg agtcaaaagt ccaagagaaa 50521 gctgtgcatt tttgagatga tttactttgc tcgtcctgat agcgtcatgc atgacgagag 50581 tttgtacagc tatcggatgc gtataggacg gcgactggct aaagaatctc cagctgatgc 50641 tgatttggtg attggtgtcc ctgattctgg aattcccgct gctataggct actctcaagc 50701 ttctggcatt ccttatgctg aaggattgat taagaatcgt tacgttggac gtacttttat 50761 ccagccgaca cagagtatgc gagagtcggg tatccgcatg aaactcaacc ccctcaaaga 50821 tgtgttattt ggtaaacgag tcattattgt agatgattcc attgtgcggg gaactactag 50881 ccgtaagtta gttaaaacct tgcgtgaagc aggtgcactg gaagtgcata tgcgaatttc 50941 ttctccacca gtgactcacc cctgctttta tggtattgat actgataccc aagaccaact 51001 cattgctgct accagttcag tagaagatat tgccaagtta ctagaagtag atagtctggc 51061 ctatctcacc agagaaggaa tgctacagtc aacacgagaa gatccagaaa gcttctgttc 51121 cgcttgcttc acaggagact atccagtttc tgtccctgag caagtgaagc gttctaaatt 51181 gatattggag aaagtatctg tttaggcatt ttagattttg aaccgcagag aaattttatc 51241 tgcggtttag ctttgttgca aaaattcaat ttttggtaat aatgggtcag aaacaatacc 51301 cacgccacta aaaccccaac gtttgagcaa acctcctact tgagaaccag gaacaaactc 51361 tacagcaaag cggtttgcca cctcaccagc atgagtattg aggtagctga gggcatcaaa 51421 atgttcgcga aacaacagta aataaccatt agcttcggcg ttgggacgag cggtaagata 51481 atcaccgtta gctttcgacc gtacaagata atagacttca gaaaacatag ttatcagtta 51541 ttagtcacta gtcattagtc atagctatga ctaatgacta aggacagtga atacgagtgt 51601 accgtcgctt gaatttaaaa tgagttaatt gagattttct aagcgaatgc gaggatcact 51661 cgccttgagt agcaaatccg caatcaaatt accaacactg agcaacacag cgctcataac 51721 caaacttgcc atcaccaaat acaaatcttg agcctgcacc gcttgtaaag tcaaccttcc 51781 taaaccaggc caattgaaga aaaattccgc aataaatgca ccacccaata aacttgctaa 51841 ctcgaaacct aataaagtaa tcaaggggtt gattgcatta cgcagagcat gaacatagat 51901 aacacggttt tctggcagtc ctttggcacg agctgtttga atgtaatctt gacgcaggac 51961 atccaataat tcaccacgag taatacgctg taagccagca aagctggtga cacttaaagc 52021 aatagtgggt aaaatcatgt gccagccaat atcgagaatt cgaccaaacc agttgagttc 52081 agtgtgatta atgctagtca tgtcacccac tggaaagatt ggtgaggcgt tttgggcaaa 52141 aatcagcaac aatagggcgg tgataaaact gggaaaccct tgtccagcat agctaattac 52201 ctgcaaaatc cggtcaaccc actgattttg ttttacagct gcgactatac ccaaagggat 52261 ggcgatcgcc caagttacaa tcaaagaaga cactgccaac aacaaagtgg caggtatccg 52321 ttcccacaac aacgacgcca ccgaccgttg ataaacaaaa cttgtgccaa aatccccctg 52381 tgttatgatt cgccacaccc aaagtccaaa ttgctctggc caagatttat ccagaccaaa 52441 ctgtcgtctg agttcctcaa ttctttctgg ggaaatcttt gggttttgcc gtagcgtatc 52501 tacataatcc cctggagcaa gttgaataat gataaacgac aacgccgatg ctaaaaataa 52561 agtcaacagc gcctgcaata cccgcttgat gacataaata aaactctcac tggtgactaa 52621 cttcaaaagc cagtcacgac ttgcatccaa ggaaattcgt gtagtagtca ttagtcaagg 52681 gtcaagggtc aacagtcaag ggtcaagagt taatagtcaa aaatattttg gaatctagac 52741 tattaactta gtagtacatt aaggcggaag tcatctgttt tgttagaagg acacaaaaac 52801 tttatgtgtt aagttttgcg ccttcagcat agctcccaat tgagcctcct gccttcttgt 52861 actagtattc tacaagggtg aggatgggca catctggcag atgttttcgt ccttgtaaat 52921 cctgtagctc gatgataaac ccaaaacctt caagtttgca gccaattttt tcgactaatt 52981 ttgctgttgc acctgcagtt ccaccagttg cgagcaagtc gtctataatc aaaactcggc 53041 ttcctggatg caaagcgtct tggtggactt ctaaaccgtc agtaccgtat tccagtgcgt 53101 actcaattga gtgaactgct gctggtaact taccaggttt acggacagga ataaagccag 53161 ttcctaactg ataagcaagc ggtgcgccaa caataaagcc ccgtgactct attcccacca 53221 cgtattctgg tctcagtcca gcatctttta ccttgtgtgc aaacaggtca atcgtgtagc 53281 gcagtccctc tggatcgcgc agtagcgtag tgatatcgcg aaataaaatt cccggtttag 53341 gaaaatcagg aatgtcacga atcagagact ttaaattcat aaaatagaga aaaagtttta 53401 gttccagcag tacaacctac aaccactatc gcccgattgc aagctctcgg ttcaacagga 53461 ggtatgtacc ggaaaaatcg tacactaaaa tgtcatacgc ttggtatgag acctcgaaaa 53521 agtctatcca ttgacgatta ttggaatcta caccaagcta aggatataac gatattaacg 53581 agtgagctaa gaaaacttaa agtttataaa ctttaatcaa aagcaatgcg gaaacctttg 53641 tgtcaaaaga gtaagtaatg tatgtatcag gcaaacttgc taataatgcg atcagtctga 53701 tttttttgat tggtcatcta ttttaaatta gttttactca gtttgttgca accgttaatg 53761 gtatctataa tgaatatttc tgctagtatc acaccaatta acagtccaac tccccagtct 53821 gtgccgatga ttctggactc tttacccgaa cctgtggttg aaggacaggg atgtccccgt 53881 agagctcggt tgcaaataga tcttatacta ctggcaattg aagctttaga gcttggtggc 53941 tctgaagcta ttttggcatt tgctgaagag ttggatctta aaggaattgt taaaaaccga 54001 gtcaatttat ggcgaatgcg tagcaccaac cccatgcgac gagcacacat ccgccgtcct 54061 ctgtctatca tggaagccaa agctttggtg gtcattggtt gctacattgc gcgacggtta 54121 actgttgtca ttcgccagtt gctggcgata catcaacaaa tggttgaaaa acagcttcca 54181 ctggaacaga atttacgtct atctaactat ctagagcgct ttagaagcca ctttaagagc 54241 cgaatgaatt ctcgacgttc tggtgtttta gccttaaatt ccgatgaaaa attagatgag 54301 ctagctataa gtttgttgga acaattatta ttttgtactg gtacagctgg aatgcagcga 54361 ttctggatta gtctttttga tggtgaggtg gaatgaatat tcaacgtaag tatagtttac 54421 ctaattgcac actactgcta gagggtttaa gtgatgcctc aagagcggca cattttcagg 54481 aactgcgccc agaattatct atattggtaa atgcagaatg ctatctatct ggctacacta 54541 gaccactgag tggagggcga gaattttttg aaagtttggt taggggagtc agtgcatatg 54601 cccaggaatt tctgagtagt gtgccacatg ccgaagcgca taactctgag tctgagttag 54661 tgcagttcca aaaaattggc aataaccaac atagattaat tgtgcattcc gatgcagcag 54721 atgaaatgga gtcttactcc aacaatggaa accggcgaat acaagtcgat ttgaacacgg 54781 tgcagttttt cgatttagtg gaagcaatag accagttttt cgcagacagc caaactttac 54841 cagaattaac attacaatta caaccagttg ccagacgtca tggcggtgtc agtcaagctg 54901 tgatcaagca ggcagtgcct gcgagtgtgg gagtgacgag tttggcagca gcagcagttg 54961 cttttaccat gattccaact ccgcaagttc gttcgccgca actgaacaca cagcaagatg 55021 tcaatagaac tactaatgtt aatagtcctg caacagcatc tatcactcct actccaactg 55081 ctaatgaaca gatagccgca aatcctagat tcacacccac aacagcttct tcctcagtag 55141 tattaaattc tactgcagca actcccacac cgggtgcagc gcctgttgtc aaggatttag 55201 aagtactttt aaatacatct ggagaaatta ccgaagcatc tcaactgcgt gccttgaacc 55261 gtgaattata taaccaaatt aatccacgtt gggcaaagcg ctcaggactg aatgaggatt 55321 tagtctttcg tgtgggagtg ggtgcagatg gtggaattgt tggatacaaa gcagtcaata 55381 agcaagcaaa tgatgcagtg gagaacactc ctttagcaaa cctactttct aatccggcaa 55441 accgcacttc tagtggtaat gaagcgatcg cccaatatag agtagtcttt acaaaagcag 55501 gcgtgctaca ggttagccct tggcggggat acaccaagac accagatgtt gttggtacaa 55561 aaattacaga ccccaacgca gttaaggatt tgaaccaaaa gctttataac actatccgtc 55621 aaaattggag tacttcttct gcctttgcac gagatttgag ataccgagta gcggttacta 55681 aagatggcgt tattgctgac tacgaaccac tcaaccaacc agccgttgac tattatcggg 55741 caacacccct tcctaagatg tttcaagatg tctacggctc aaacttggct cctccaaata 55801 ataaagaacc tctagcacat taccaggtga tgtttaagcc tgacggtagc ctggaggttc 55861 ttccttggcg aggatatcgg tagattgtta ttaggagtca ggaacgagga aataattttc 55921 tctccaaaga ttggtaaatt ctctgttctg actccattta ttttgtaatt ttgcccatgt 55981 aatttcccca taactcagca gcttgagcgc tactgccaga tgtgggggca ttattgtcgt 56041 tgcccaacca aataccagtt gctagccgcc gactaggaat aaagccaata aaccacaagt 56101 caacgttttt atctgttgta ccagttttac ctgcttcccc cagtccgata gcagcactgc 56161 gtccagtacc gtttgtaaca acaccgcgca aaagacgaat cattgtatca gcaatgttga 56221 ttggcagtac tcgtttgttg gcttcgggat cttggtcgta ggagtacatc acacggcaag 56281 tttttaagtc gtttcggtcg cgacaatcac tgctatctag aatccgacta attgcatgtg 56341 gacgattcca cacacctcca ttaccaatag cggcaaatgc acctgtcatt tccatgacat 56401 ttacgacgct ttgacctaac actaaaccag gaactgggtc tagatctgac ttcactccta 56461 atcgtttcgc cgttgtcacg attttatcta gccctacttc tttggcaatc ctcaatgcaa 56521 cagggttttc agataatgct agccctgtgg cgagatccaa gctacctgcg cttgtacgac 56581 atggtttgta ggtaaagcct ttccaacgaa aaggggagca ggaatatgaa tttccaggtg 56641 aaatgccttg ctcaatagca gttgtgtagg taaaaacttt gaaggtggaa cctggttgtc 56701 tttttgcctg aaccgcacga ttaaactggc ttgatttata atccttgcct ccgatcattg 56761 caagaatagc tccattgctg gaatcaagtg tcagcattgc tccttgggaa aaaccaaaag 56821 atgagcctgc gttgctaata tgtttacgca atgcttcttc cgcttgtttt tgaattgctg 56881 gatctagctg agtttcaata ataaagttcc cttctgttgc cagttctttt cccaaaatgg 56941 cttctagttc ttgaaacaca taactgtaaa aataaggagc agtcgttttt aactgttggt 57001 cgcaaacttt ggagctgatt tcaatgggcg atcgcctcgc tcggttatac tcatcttgtt 57061 tgattttgcc catctctagc atccgcttca atacacgatt gcggtactca atggttgtga 57121 gcttattctg gctatctcgt ccacaaaaat cgaagccgtt aggagcaggt aaaattgcca 57181 ccagcgttgc tgcctctgac agagttaact cttttgctga cttgtcaaag taatatcgag 57241 aagcatcctc gaaaccagag gtatctgctc ctaagaaaac ccgatttaaa taagtaagta 57301 aaacataatc tttgctgtaa aacgtttcta atttcagggc gacaaccgct tctcggaatt 57361 tacgaccaaa agaatcctgt ctgccaacat aatcgcggaa caaactcctc gcaacttgct 57421 gggtaactgt actagctcct tgctgaatat caccgctctt gctatttatc aatacggctc 57481 gtaaaatccc tagcgggtca actccaaagt gccagtaata acggctatct tcagaagcaa 57541 cgactgcggt gggcaagaaa ggaccaaagt ccttcaactg cttcatgtct atatgagaag 57601 tcgtccgagg ctcacgtaac ttggtgactc catcacgagc gtaaacaacc actggtgctc 57661 gcgtcgctcc aggtaaaggt ttgacagaaa ctttcagcga ttctacgcca ataaccagag 57721 ccatgagagc ggtggctcca cccaccccgt aagctgccca agttgctact ctgagatacc 57781 aaggaggtgg atcgatgtat tgcagtctga ctgaggcggc gagttctgga ggtccgagag 57841 tgagaatatc gccgtgacgt aactctaaag aattgacgcg acgcttaccc cgataaatgc 57901 catttgtgga gttttcatct ttaatgacga aaacagattc tcttttcgca gaatctcgtg 57961 aaagagacag gtgaatttgg ctgacaactg ggttacggac aacaatatcg caggatttgg 58021 aactgcgacc tagcatgtag cgatcgccca acaacggata cacctccgcc ttatccgccc 58081 ctgcatcctg caccaatagc tctggtacct tggcgtttgg tttgagtgct agtttcgaga 58141 aatcgactct cgcttgaatt gtttgtactg cttgagttat ttgaccaagc aaagtttgtg 58201 gcttttgagg cggttggggg gaactcatcg gctatttaca ccacatttat tggtaaggaa 58261 tcgggcgcat tgcaatctcg atgctagtat tccaccattg taatcataag cctacgaaga 58321 cgttggtggc atgaaatggt tcccgattca atttgatact tcaaaaaaat atatacaata 58381 actagacttc tgttgttaat tttgtcacat ttttaataga atcgcaggaa ttttcttctt 58441 gctttgtagt ggttattttg tctactttac tctgtttccc aaatgtgatg ttgagacttg 58501 cgatatagac ttttgttacc aaacatttta taaaagtaaa tagaaaaacc ctattttgtt 58561 aacatataag aattttaaaa gcaagaatac atatcttctt ttaggatgat tttttaaaag 58621 ctcatatttt ggtttaatga gcggaagtac tccgttgagg tgttgaggtt aagtaatatg 58681 aacggaaaat atttgaccct tatcggtgca gcactaactc tagctttaac cagcaacgtt 58741 gctgttgccg agtcagccaa attcctgata ggtcaatcta gcggaggttc atcctctgga 58801 tcatcggggg aaatcaaact ttcaccaagg gtaaataaga aaaagttttg caacgattac 58861 cctttgaact cccgctgtca agaaggctca gcatctacat caagcccttc tgatagcagt 58921 actgaaacaa aaaagaaacc ttctaagagt acccctggcg gtacggcaac accaggttta 58981 gcacctgctg accccggcac taccaacact cccgggggta caaatgaaat cgccccaccc 59041 cctgctggta accctgctaa tcctggcact acaactccct ctggttctcc agggtctggt 59101 acgcgaaaat agcccagcac aggcggagta aagaagtagt cttacgcttg gctaagtgtc 59161 gtctcttaaa taatcaatgc cctcagacga gagtctagcg ttaaaaacag aaagtccgtt 59221 catacggact ttttattttt tggtttcacc aaagaagttg tcaacagttt aaagccggga 59281 gtcaacttgg gattttgttg tccccactca ttgcattcgg aattcggaat tcggaattct 59341 ttgacttgtt atcgtgatct acgatgcgaa gcgggagtct tttgaaaggt agcaatcagt 59401 aggcaaataa taccaatatt aataaagaca tctgctaagt taaaaactgg aaagttaatg 59461 agtcgaaaat ccagaaaatc aaccacgtag ccatgaacaa atcggtcaat gccattaccg 59521 atcgctcccc ctaaaattaa gccatagcct agttgatccc aaaaatgcaa cactgtaaac 59581 catgccagtg ctatcaatgc taaactcact cctaaagata gccagcgtaa ccactctacc 59641 ttccctgtta acaaactaaa ggctgcacca gtgttggtaa cgtaggtaaa gtgaaagatt 59701 cccggtaaca gggcttgtgt ctgtcctgag ttaaaggttt gcaccaccca gtactttgtc 59761 agttgatcta agagaaaagc taaaaaggca gcaatccaga agagaagatt ttttacactc 59821 atggtgaatt agtcattact cattagtcac tagtcattag tgagccagcg cagcaggagg 59881 atctcccgct ctcacggcga ctggcgtaag cgcaagcgca cgcccagagg gcgaacgcga 59941 acgcgagtac gtgcgctttg cgctcagcgt ctccgttcgc cgcaggcgtc gtgcaggcat 60001 aggagatacc cgtaagggtc attagtcatt agtcaaagaa ctctggacta ctgactgttg 60061 actgtggact catgactagt aaaacatcag gtggcgcagc acgtatgcta ttacgacgac 60121 agcacaaaca acggctagtt gtccagggag tgcaaaccag gagtatctaa gcatagcttg 60181 cattaaagac agagtttctg ttcctttcca ttgaaaaaca tagctgagta tcaaataagt 60241 aataccgcac aagtggacag ttaacaagcc acaaaagcaa ctaaaggcca gagtttctag 60301 tcgaggtcta gctttaaatg cgaaaaagcc acaaatccac gctccaggga taaaaccgag 60361 caaataacca aactgggata ctttaacata accaatgcca ccaccatcag caaatactgg 60421 taacaatgtt aaccccatta ctaaatacgc aatttgcgat agtgcaccag catttttgcc 60481 tcctaaacaa ccggctagca gcaccccgcc aatttgatag gtgacaccca gagaaaaagc 60541 atgaattcca tattgactcc aactcaacgg tgaggtggcg acataagctt ctaaaaaggt 60601 accacccatc gttagcaata agccaatcat agaccatagt aattgattgg atgcagctag 60661 catttacaaa gtactcatgc aaaggactct cgtacaaaca tgtcaatcaa ttaagtatta 60721 ttttattctt ttatttttgt attatcttg // LOCUS NODE_308_length_60555_cov_5.02833160555 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 60555) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 60555) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..60555 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..191) /locus_tag="DP116_00945" CDS complement(<1..191) /locus_tag="DP116_00945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00945" /translation="MGLGERVNRVIKANLNNIAGNFNQTEGVMFLAGGSVAGAGISAT VGNMGLAGMGTAVGIYTPH" gene 492..2672 /locus_tag="DP116_00950" CDS 492..2672 /locus_tag="DP116_00950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314752.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium:proton antiporter" /protein_id="PRJNA477356:DP116_00950" /translation="MQTVVLVLVEVLIVIGLSRLVGLAFRWINQPLVIGEIVAGIMLG PSLFGLVAPGLASTLFPPETIPFLNVLSQVGLIFFMFLIGLELNPKYISGNLEIAVLT SHVSILVPFSLGTLLAVLLYPLVSNASVSFTAFALFLGAAMSITAFPVLARIITENNL QRTRLGTLALTCAAVDDVTAWCLLAVAIAVARTGSIIGAFPTIIASLVYIGLMMTVGR TFLKRLATYYRRTGRLSQLVLALIYMAVVASALITELIGIHLIFGAFLLGAAMPKNAG LVREIAIKTEDFVLIFLLPVFFAYSGLRTEIGLLNKPELWALCLAVLAVAIIGKYVGT YVAARISGIEKREASALGWLMNTRGLTELIVLNIGLSLGVISPLLFTMLVIMALVTTF MTSPLLERTYPKKLIKLDVVDQEPEEIKADTPDGEDFYNRPYRILVPVANPNSQKGLV QLAAAIAINYQQPAVVYPLSLIEFQEDYAFENTPQEANKVIAQRRQQLEELISRLEPL EARSYMHPIVCTSSNVARETARIALLEQANLVLVGWHRPAFSKNRLGGRVGQILTNAP VDVAVFIDRGQERLERLLVPYSANIHDDLALILALRLLINRDTCRLLVLQVVAENQVK NEHSYELDTVMEQLPQSVRDRIDIKIVEAIEPIQAVVAASETVDLTIAGTSRAWGIER QTLGRYTDQLAIECRSSLLITRRHSQISSHITSVLVNEKLEVKR" gene 2824..3303 /locus_tag="DP116_00955" CDS 2824..3303 /locus_tag="DP116_00955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314753.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_00955" /translation="MAISSVLLLFKVVTAYGENSIKAPPAINSNYRLVLSEKLPVCEK SDFLLLNLQQSGIYLNGFLLPANSTKTSALSEKHSLAGKLQNQQLSLSGEVTRHILCN TPNSQTQANIIPVKIQIKLAEKGDLIGKISVSDSSKTIGLTATPQKTNEQSEQPKNH" gene 3688..4809 /locus_tag="DP116_00960" CDS 3688..4809 /locus_tag="DP116_00960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129769.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AMIN domain-containing protein" /protein_id="PRJNA477356:DP116_00960" /translation="MNKELKNKQFFRFCKYLFRVSLFALCTGVLYSKPLSVYGFDVGN SSSVAPLARLEDWRFYPEASQLEISLSAAAQPRYFYLAQPPRIVIDLPGTKLGKIPTK QNFYGAIRSIRVSQLNADVTRIVMDLAPGILIAPTRVQLQPVSRENPTRWVLRPLVVS NGTSLPGNFSSTTISPPLPSNTPDIYNPPQSPGNLPPSIYNNPPQPPGNLPPSIYNNP PQLPGNLPPSVYNNPPQPFSNSPVGVYNPPQPPSNLPPATYNNLPQLPGNPPSTTYNQ QQVPDFSVPPPLTTLPTNNFSSAPSVQVPPLTPNNSSQLPGSSFSPPSLPYQGSYTNN SAPSLGTSNFPIPNLPTGSRSSPNSKVIEFGEPFPNSSR" gene complement(4892..5431) /locus_tag="DP116_00965" CDS complement(4892..5431) /locus_tag="DP116_00965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321344.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3172 domain-containing protein" /protein_id="PRJNA477356:DP116_00965" /translation="MRRKSTGRTVTTPKPSIFQSPMFNFTTMAILGGVFVLGIGIGIA FSSTATFSTSNVASREFIDTKAPNADICVQYGASAMVMDTRLFVTLNPFKVYIAQPSM RPGCVLRSSNMTILEQKKLVTSQQLRECKNRLNNFAYAGDLNGDKPDISCTYENDDAK NFFINQPGATAPGIETERF" gene 5720..6547 /locus_tag="DP116_00970" CDS 5720..6547 /locus_tag="DP116_00970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310544.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_00970" /translation="MVHLQSRSFGKSFKISQFQTLIRDSLTIFWGDWLELRTKLVQII SSGLISPLIYILAFGFGLGSSIKPGSGLSGDYNNYLEFILPGMVALSSMTVSFVGTTF SICGDRLFTKNFEELLLVPVHPLALHIGKMLAGVTRGLMTSLGVILVALVFTRNWNFL NPLFLLILVLSCAVFAGLGVIVGLTVKSLESVGLYNNFIIIPMSFLGATFFDPGTLPT ALKFVVYLLPLTYASIGLRAAACLPLSQFPWYCLPILLVMAIALSLWGAYVFSHQQD" gene complement(6598..7074) /locus_tag="DP116_00975" CDS complement(6598..7074) /locus_tag="DP116_00975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215910.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase" /protein_id="PRJNA477356:DP116_00975" /translation="MALITTGNALIRDLEKFGALGVYVPLEGGFEGRYRRRLRAAGYV TLDLSAKGLGDIAAYLTTVHGVRPPHLGKKSTSTGAAVGYVYYVPPIVSSQLEHLPPK SKGLVLWIIEGQILCDQEVEFLAALPSLEPRVKVVIERGGDRAFRWTPLQKTLLAS" gene complement(7173..8309) /locus_tag="DP116_00980" CDS complement(7173..8309) /locus_tag="DP116_00980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="PRJNA477356:DP116_00980" /translation="MQTAKKIIIIGGGIGGAATALALHRAGLEPVVYERTKELQEVGA GIALWANATHILRNLDLLEDAIHVGYLTTNYQFNSQSGKELVNIAVDGFELPVIGIHR AELHQLLWRNVQEKFVLEQTFKRLEQQEEKIRAHFSSGLSVEGDALIGADGLRSRVRA ALLGEQPPIYRNFKTWRGLTDYIPGGYRPGYIQEFLGRGKSFGFMMLGKERMYWYAAA RAPLAQPDAPSGHKKELETMFQDWFASVPELILATDEANILTTDLYDRAPTQPWSKQN VTLLGDAAHPMLPTMGQGACTALEDAFVVAKCLREQADPITAFQQYESQRFPRTKLIV EQSLRAGKMGELDNPLSLLLRNTFMKLMGVTISNSFKSLHAYRA" gene 8436..11438 /locus_tag="DP116_00985" CDS 8436..11438 /locus_tag="DP116_00985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140771.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal transduction protein" /protein_id="PRJNA477356:DP116_00985" /translation="MESVTLTAVVTAIASILSTLPSEAVEKIGENIDDSVWVLRGKLV EKLRQKNKLVSLTGSVEGNEPQQLQQLDYGQGVLELKAAADTDPEIAQVVVEIEAAAR TLDFSYPQQVEKFIHKWFAAIPKTGEQLCTALKEPGKERIQDLVKNPLLLTLLCLNWQ SGDGKLPDTQAGLYQQLVDNFYKWRRAEFATNDHQRQQLNVKLGELAKAAIDKETIRF RLQQDFVSDFLGDADDENSLLKLALNLGWLNCVGIDTDRKPVYAFFHTSFQEYFAAKA IDDWHFFLNHILYHPSLGTYRIFEPQWKQTILLWLGRPEENLKQQKQQFINALVNFKD GIGKWNKYYDKGFYEYRAYILAAAGIAEFRSYSRADRIVAQIVKWVFGEADLIGEEAK LVLQHTDRTKVIAALVQLLQSNRLNDYTRIQVASSLGKIAPGNENAIAALVQLLPLNS LYTTYARYTEIAKSLGEIGTGNEIAISALVQMLLSTNISDIGDYEACKLAAESLGQIG TGNEIAITALLQLLQSPNLKWYNFTRWQAAESLEKIVTANENVIVALVQLLQSNGVDE NTRNWAAKSLGKIGTGNENAIAALVKLLQSTNVADHTRRLAAESLGKIGTGNKVAIAA LVQQLQSTDVDDYTCELAASSLGKIDPGNEIAIAALVKLLQSTNVGDHARRLAAESLG EIDPGNEIAIAALVQLLVSNDVKAWTYTEIAESLRKIDPGNENVITALVQLLQSNDLD DFTRRLAAYSLGKIDPGNEIAIAALVQQVQSTDVDDYTCELAAYSLGIIGTGNENAIA ALVQLLQSTNLSQDTRGEAAESLKKIGTGNENAIVALVQLLLSTNVDHFARMLAVQSL GKIGTGNVIAITALVQVLLSTQLLLSTNVGEGLLRQAAESLGEIGTGNENAIAALVQL LQSSNTNMDDYNYTRIQVANSLIKILQDNKHRFAVVKTLSGYNCYGVFWECAQNMPYP DFYRAWHHSKFATGVVQGLKEIFFTRII" gene 12285..12926 /locus_tag="DP116_00990" CDS 12285..12926 /locus_tag="DP116_00990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748869.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L3" /protein_id="PRJNA477356:DP116_00990" /translation="MSVGIIGTKLGMTQVFDEAGVAIPVTVIQAGPCTITQVKTKQTD GYSAIQVGYGEVKPKALNKPLLGHLAKSSAPALRHLQEYRIDNESEYTLGQQIKADIF SPGQTVDVIGTSIGRGFAGNQKRNNFGRGPMSHGSKNHRAPGSIGAGTTPGRVYPGKR MAGRLGGSRVTIRQLTVVRVDPERNLILIKGSVPGKPGNLVDIVPAIVVGKKS" gene 12991..13623 /locus_tag="DP116_00995" CDS 12991..13623 /locus_tag="DP116_00995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321348.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L4" /protein_id="PRJNA477356:DP116_00995" /translation="MFECVVKNWQGEQVGQKTFDFKVAKEETASHVVHRALVRQMTNA RQGTASTKTRSEVRGGGRKPWRQKGTGRARAGSIRSPLWRGGGVIFGPKPREYNLKMN RKERRLALRTAFASRIDDLIVVEEFSEQIQRPKTKELVGAIARWGSEPENKTLLILSE RTDNVFLSARNVANLKLLGSDQLNVYDLLHADKIVVTASALEKIQEVYSD" gene 13616..13930 /locus_tag="DP116_01000" CDS 13616..13930 /locus_tag="DP116_01000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651195.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L23" /protein_id="PRJNA477356:DP116_01000" /translation="MTRFDPRNLPDLVRRPILTEKATVLMEQNKYTFEVTPKATKPQI RAAIEDLFEVKVEKVNTTRPPRKKKRVGKFLGYKPQYKKAIVTVASGDVDKIRQVLFP EV" gene 13935..14798 /locus_tag="DP116_01005" CDS 13935..14798 /locus_tag="DP116_01005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L2" /protein_id="PRJNA477356:DP116_01005" /translation="MGTRSYRPYTPSTRQVTISDFSEITKTEPEKSLTVSKHRAKGRN NQGRITVRHRGGGHKRLYRIIDFKRDKREIPAIVTAIEYDPNRNARIALVQYEDGEKR YILQSNGLKVGTTVIAGPNSPIENGNALPLSNIPLGTSVHNVELKPGKGGQIVRSAGA TAQVVAKEGSYVTLKLPSGEVRMIRRECYATIGQVGNLDARNLSAGKAGRTRWKGRRP QVRGSVMNPVDHPHGGGEGRAPIGRPGPVTPWGKPALGLKTRKPKKASSKFIVRRRRK SSKRGRGGRES" gene 14900..15178 /locus_tag="DP116_01010" CDS 14900..15178 /locus_tag="DP116_01010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196740.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S19" /protein_id="PRJNA477356:DP116_01010" /translation="MGRSLKKGPFVADHLLTKIERLNDANKKEVIKTWSRASTILPLM VGHTIAVHNGRQHVPIFISDQMVGHKLGEFAPTRTYKGHARSDKKAGR" gene 15305..15667 /locus_tag="DP116_01015" CDS 15305..15667 /locus_tag="DP116_01015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314765.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L22" /protein_id="PRJNA477356:DP116_01015" /translation="MATDTTEVKAIARYVRMSPYKVRRVLDQIRGRSYREALILLEFM PYRSCEPVLKVLRSAAANAEHNAGLDRTELVITQAYADQGPVLKRFQPRAQGRAYQIR KPTCHITVAVAAEPAAAK" gene 15722..16501 /locus_tag="DP116_01020" CDS 15722..16501 /locus_tag="DP116_01020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455520.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S3" /protein_id="PRJNA477356:DP116_01020" /translation="MGQKIHPVGFRLGVTQEHQSRWFADPSRYPELLQEDHKLRQYIE QKLGRYAQNNAGISEVRIDRKADQIDLEVRTARPGVVVGRGGQGIESLRTGLQQLLGS NRQIRINVVEVQRVDADAYLISEYIAQQLERRVSFRRVVRQAIQRAQRAGVQGIKIQV SGRLNGAEIARTESSREGSVPLHTLRADIDYAYCTAKTVYGILGIKVWVFKGEIIPGQ EQTPPPATNRGDRDRDRDRDRPPSRRQQRRRQQFEDRSNEG" gene 16563..16995 /locus_tag="DP116_01025" /pseudo CDS 16563..16995 /locus_tag="DP116_01025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314767.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="50S ribosomal protein L16" assembly_gap 16847..16856 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 16999..17229 /locus_tag="DP116_01030" CDS 16999..17229 /locus_tag="DP116_01030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113423.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L29" /protein_id="PRJNA477356:DP116_01030" /translation="MPLPKISEARVLTDEQLGQEIIAVKKQLFQLRLQQATRQLDKPH LFRHARHRLAQLMTVEAERKRALSSQPAKETE" gene 17236..17484 /locus_tag="DP116_01035" CDS 17236..17484 /locus_tag="DP116_01035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877814.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S17" /protein_id="PRJNA477356:DP116_01035" /translation="MAVKERVGLVVSDKMQKTVVVAIENRAPHPKYGKIVVNTQRYKV HDEDNQCKIGDRVRIQETRPLSKTKRWKVTEILTTKNS" gene 17532..17900 /locus_tag="DP116_01040" CDS 17532..17900 /locus_tag="DP116_01040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L14" /protein_id="PRJNA477356:DP116_01040" /translation="MIQPQTYLNVADNSGARKLMCIRVLGAGNRRYGFIGDRIIAVVK DAQPNMAVKKSDVVEAVIVRTRHNTHRDSGMSIRFDDNAAVIINKDGNPKGTRVFGPV ARELRDKNFTKIVSLAPEVL" gene 17900..18253 /locus_tag="DP116_01045" CDS 17900..18253 /locus_tag="DP116_01045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310483.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L24" /protein_id="PRJNA477356:DP116_01045" /translation="MANKKKDQPRFFKMHVKTGDTVQVIAGKDKGKVGEIIKAIPQES KVLVKGVNIKTKHVKPQQEGESGRIVTQEYPIHSSNVMLYSTKQNVASRISYTFTAEG KKVRMLKKTGEIIDK" gene 18484..19029 /locus_tag="DP116_01050" CDS 18484..19029 /locus_tag="DP116_01050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877817.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L5" /protein_id="PRJNA477356:DP116_01050" /translation="MATRLKTLYQETIVPKLMQQFQYTNIHQVPKLVKVTVNRGLGEA SQNAKALEASLTEIATITGQKPVVTRAKKAIAGFKIRQGMPVGIMVTLRSERMYSFLD RLINLSLPRIRDFRGISPKSFDGRGNYTLGVREQLIFPEIEYDSIDQIRGMDISIITT AKTDEEGRALLKEMGMPFRDQ" gene 19050..19451 /locus_tag="DP116_01055" CDS 19050..19451 /locus_tag="DP116_01055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410712.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S8" /protein_id="PRJNA477356:DP116_01055" /translation="MAVNDTIADMLTRIRNANMARHQTTQVPSTKMTRSIAKVLREEG FIGEFEEAGEGVARNLVISLKYKGKNRQPLITTLKRVSKPGLRVYSNRKELPRVLGGI GIAIISTSSGIMTDREARRQNLGGEVLCYVW" gene 19550..20098 /locus_tag="DP116_01060" CDS 19550..20098 /locus_tag="DP116_01060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L6" /protein_id="PRJNA477356:DP116_01060" /translation="MSRIGKRPITVPAKVEVSIDGTKVLVKGPKGELSRDLPPHVIVS KEGEILQVNRRDESRTSRQMHGLCRTLVANMVEGVSKGFQRRLEIQGVGYRAQVQGRN LVLNMGYSHQVQIVPPEGIQFAVENNTNVIISGYDKEVVGNTAAKIRAVRPPEPYKGK GIRYAGEMVRRKAGKTGGKGKK" gene 20101..20463 /locus_tag="DP116_01065" CDS 20101..20463 /locus_tag="DP116_01065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317571.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L18" /protein_id="PRJNA477356:DP116_01065" /translation="MKHTRKESRERRRRRIRGKVDGSSDRPRLAIFRSHQHIYAQVID DTNHHTLVAASTLEPDFKSKLASGSNCQASVEVGKLIAVRSLEKGISKVVFDRGGNLY HGRVKALAEAAREAGLDF" gene 20787..21311 /gene="rpsE" /locus_tag="DP116_01070" CDS 20787..21311 /gene="rpsE" /locus_tag="DP116_01070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455542.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S5" /protein_id="PRJNA477356:DP116_01070" /translation="MATGRRKANRTKKEETTWQERVIQIRRVSKVVKGGKKLSFRAIV VVGNERGQVGVGVGKASDVIGAVKKGVADGKKHLIEIPMTKSNSIPHPIDGIGGGAKV IMRPAAPGTGVIAGGAVRTVLELAGVRNILAKQLGSNNPLNNARAAVNALSTLRTFAE VAEDRGIPVENLYI" gene 21711..22157 /locus_tag="DP116_01075" CDS 21711..22157 /locus_tag="DP116_01075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314778.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L15" /protein_id="PRJNA477356:DP116_01075" /translation="MRLTDVRPQKGSKKRPRRLGRGVSAGQGASAGKGMRGQKARSGG STRPGFEGGQQPLYRRIPKLKGFPLVNPKKYTTINVEKLASLPANTEVTLTSLKEAGI LTASRGPLKILGDGELSVPLKVQAAAFTGTARSKIEAAGGSCEVLE" gene 22229..23542 /locus_tag="DP116_01080" CDS 22229..23542 /locus_tag="DP116_01080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecY" /protein_id="PRJNA477356:DP116_01080" /translation="MISRDKAPTAQETFMQMAQAAGLRGRLLVTLGILMLIRLGIHLP IPGINRDEFARTLQNNNQVLSFLDIFSGGGLSALGVFALGILPYINASIIIQLLTAAI PALENLQKNEGEAGRRRISQMTRYVSVGWAIIQSVFLASFWLKPFAVNYGPIFVIETA LALVAGSMFVMWASEVITERGVGNGASLLIFVNIVATLPKALGDTIDFVQTGNRETVG RVIVLLLLFLVTIVGIVFVQEGIRRIPIISARRQVGRRIFAEQRNYLPLRLNQGGVMP IIFATAILSLPVLAVNFIKNPEYSRIINTYLVPGGSGAWVYALVYLVSIVFFSYFYSS LIVNPVDVAQNLKKMGSSIPGIRPGKATSEYIERVLNRLTFLGAIFLGLIAIIPTAVE SALNVRTFRGLGATSLLILVGVAIDTSKQIQTYLISQRYEGMVKQ" gene 23542..24096 /locus_tag="DP116_01085" CDS 23542..24096 /locus_tag="DP116_01085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859429.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylate kinase" /protein_id="PRJNA477356:DP116_01085" /translation="MTRLIFLGPPGAGKGTQAKTLADEWNIPHISTGDILRQGLKDQT PLGVKAQSYMDKGELVPDELVQEMVQERLSQVDTKSGWILDGFPRTVNQAVFLEKLLR ILNQNGEKVVNLDVPDETVIARLLERGRKDDSEEVIRRRLEVYRSETAPLIDFYSSRQ TLVSINGNQSLEEVTAELKKVIAS" gene 24306..24530 /gene="infA" /locus_tag="DP116_01090" CDS 24306..24530 /gene="infA" /locus_tag="DP116_01090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006276978.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="translation initiation factor IF-1" /protein_id="PRJNA477356:DP116_01090" /translation="MSKQDLIEMEGTVTESLPNAMFRVDLDNGFNVLAHISGKIRRNY IKILPGDRVKVELTPYDLTKGRITYRLRKK" gene 24673..24786 /locus_tag="DP116_01095" CDS 24673..24786 /locus_tag="DP116_01095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196723.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L36" /protein_id="PRJNA477356:DP116_01095" /translation="MKVRASVKKICEKCNVIRRRGRVMVICVNPKHKQRQG" gene 24889..25284 /locus_tag="DP116_01100" /pseudo CDS 24889..25284 /locus_tag="DP116_01100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310474.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="30S ribosomal protein S13" assembly_gap 25151..25160 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 25341..25739 /locus_tag="DP116_01105" CDS 25341..25739 /locus_tag="DP116_01105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410703.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S11" /protein_id="PRJNA477356:DP116_01105" /translation="MARQQSGKKTGSKKQKRNVPNGIAYIQSTFNNSIVTISDQNGDV ISWASAGSSGFKGAKKGTPFAAQTAAESAARRAIDQGMRQIEVMVSGPGAGRETAIRA LQGAGLEITLIRDITPIPHNGCRPPKRRRV" gene 27068..28015 /locus_tag="DP116_01110" CDS 27068..28015 /locus_tag="DP116_01110" /EC_number="2.7.7.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314784.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit alpha" /protein_id="PRJNA477356:DP116_01110" /translation="MAQFQIECVESSTEDNRSNHSKFVLEPLERGQGTTVGNALRRVL LSNLEGTAVTAVRIAGVTHEFATVVGVREDVLEILMKMKEVILKSYSSQPQIGRLLVT GPTTVTAGHFDLPSEVEVIDPTQYVATVAEGGKLEMEFRIERGKGYRTVERGREEATS LDFLQIDSVFMPVRKVNYSVEETRGDSGIPKDRLLLEIWTNGSLTPQEALSSAATILV DLFNPLKEISLDIPDVGAEVPDDPTAQIPIEELQLSVRAYNCLKRAQVNSVADLLDYT QEDLLEIKNFGQKSAEEVVEALQRRLGITLPTERANKHT" gene 28306..28656 /gene="rplQ" /locus_tag="DP116_01115" CDS 28306..28656 /gene="rplQ" /locus_tag="DP116_01115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748892.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L17" /protein_id="PRJNA477356:DP116_01115" /translation="MRHRCGINKLSKPADQRRALLRSLTTELIRHGRITTTLARAKVL RSEVDKMITLAKVGSLAARRQALGYIYDKQLVHALFEQVPTRYGNRQGGYTRILHTVP RRGDNSKMAIIELV" gene 28729..29556 /locus_tag="DP116_01120" CDS 28729..29556 /locus_tag="DP116_01120" /EC_number="5.4.99.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317577.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA pseudouridine(38-40) synthase TruA" /protein_id="PRJNA477356:DP116_01120" /translation="MLSSHQPTQTCRVALVIQYLGTHFHGWQRQPQQRTVQEEIETAL CHILGHPVTLYGAGRTDAGVHAAAQVAHFNVTSPIPAHKWATILNSYLPKDILIRASA EVSHNWHARFSAIYRRYRYTFYTEKLPNLFASAFSWHYYHEPLDESLILAALEPLIGK HHLAAFHRANSGRQHSWVEVQAAECYRTGPLLYIEIQADGFLYGMVRLLVGMLVQVGS RQLTLTSFTDLWKDQRRQEVKYAAPAHGLCLLRVGYPDFPFPPEIWFETQPKLVFGQ" gene 29598..30059 /locus_tag="DP116_01125" CDS 29598..30059 /locus_tag="DP116_01125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877831.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L13" /protein_id="PRJNA477356:DP116_01125" /translation="MTTDKTYLPPQNTMERDWYVIDATNQRLGRLASEIAMILRGKNK PHYTPHMDTGDFVIVVNAEKVEVTGKKRTQKLYRRHSGRPGGMKTETFDKLQQRLPER IVEHAIKGMLPKNSLGRQLFTKLKVYAGPAHPHAAQQPKEIQIQTIPGVQD" gene 30059..30472 /locus_tag="DP116_01130" CDS 30059..30472 /locus_tag="DP116_01130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455558.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S9" /protein_id="PRJNA477356:DP116_01130" /translation="MQAADNSGRAMYWGTGRRKSAIARVRLVPGTGQMTVNGKPGDLY FQFNPNYISLAKAPLETLGLENEYDILVKAEGGGLTGQSDAVRLGVARALCQLDPENR PPLKIEGYLTRDPRAKERKKYGLHKARKAPQYSKR" gene 30654..30893 /locus_tag="DP116_01135" CDS 30654..30893 /locus_tag="DP116_01135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868067.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L31" /protein_id="PRJNA477356:DP116_01135" /translation="MAKPEIHPQWYPEAKVYCNGQLVMTVGSTKPELHVDVWSGNHPF YTGTQKIIDSEGRVERFMRKYGMMDGQSKGGRRKK" gene 30998..32098 /gene="prfA" /locus_tag="DP116_01140" CDS 30998..32098 /gene="prfA" /locus_tag="DP116_01140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651222.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide chain release factor 1" /protein_id="PRJNA477356:DP116_01140" /translation="MAETYLLEKLKSVEQTFHELTRRLADPDTAKNPDEYQKIAKARS SLEEVVDTYETWKNAQEEFVGARQVLKESQSDPELQEMAALEVQELEQKIEQLENRLK ILLLPRDPNDDKNIMLEIRAGTGGDEASIWAGDLVRLYSKYADSQSWRVKLVSESLAE MGGFKEAILEIQGDSVYSKLKFEAGVHRVQRVPVTEAGGRVHTSTATVAIMPEVDEVE IHIDPKDIEMSTARSGGAGGQNVNKVETAADLFHKPTGIRIFCTEERSQLQNKERAMQ ILRAKLYEMKLREQQEAVTSMRRSQVGTGSRSEKIRTYNYKDNRATDHRLNQNYSLNA VLEGELEHIIQACISQDQQERLAELAASGANS" gene 32789..34240 /locus_tag="DP116_01145" CDS 32789..34240 /locus_tag="DP116_01145" /EC_number="6.3.4.21" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319329.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nicotinate phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_01145" /translation="MTSFPSLNHASTNNHQQQNTENQELTLSTVDNSLLTDLYQLTMA ACYAGEGVEQRWASFELFVRRLPENFGYLIAMGLAQVLEYLEKYHFSPSGIAALQATG IFANVPESFWSLLGEGRFTGNVWAVPEGTAVFANEPLLRVEAPLWQAQLVETYLLNTI NYQSLIATRAARLRDVASTHATLLEFGTRRAFSPQASLWAARAALAGGLDATSNVLAA LQLGEKPSGTMAHALVMALSAMEGSEDQAFTVFHRYFPGAPLLIDTYDTIAAAQRLSA KVNSGEIQLAGVRLDSGDLVSLSKQVRSLLPDVSIFASGDLDEWEIARLKAAGAQIDG YGLGTRLVTGSAVNGVYKLVEIDGTPVMKHSSGKTTYPGRKQIFRSFEGSQVKADSLG LVTEQGRRYGNEEFPQSSQVPLLQLFVKEGKRVQPLETLAEIRQRTATSVASLPDETR RLDNPVSVKVEISAQLQQLTEKTKNLTPQVQTE" gene 34504..35088 /locus_tag="DP116_01150" CDS 34504..35088 /locus_tag="DP116_01150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741588.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nicotinic acid mononucleotide adenylyltransferase" /protein_id="PRJNA477356:DP116_01150" /translation="MKVALFGTSADPPTAGHQAVIRWLSDHFDWVAVWAANNPLKSHQ TPLEHRVAMLHLLILDIDSSKHNIGLEQELSHLRTLETLEKAKKCWSEAEFTLVVGSD LLTQLPRWYHIEDLLQQVQLLVIPRPRYAIDESNLDIVQKLGGKVTVANFIGLDVSST AYRKNGDLSALTPLVVDYINKEHLYKCVPENLIC" gene 35158..35904 /locus_tag="DP116_01155" CDS 35158..35904 /locus_tag="DP116_01155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741589.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NUDIX hydrolase" /protein_id="PRJNA477356:DP116_01155" /translation="MPGHNQRKIVHALKQPPLADFKVGVDNVIFSVDTAQNRLLVLLV MRQQEPFLNSWSLPGTLVRQGESLEDAAYRILSEKIRVKNLYLEQLYTFGGPSRDPRE ASNSDGVRYLSVSYFALVRFEEAELIADGVSGIAWYPVKQLPQLAFDHNKILAYGHRR LRNKLEYSPVAFEVLPEVFTLNDLYQLYTTVLGENFSDYSNFRARLLKLGFLYDTGRK VSRGAGRPASLYKFDAEAFAPFKDKPLVFI" gene 35932..36228 /locus_tag="DP116_01160" CDS 35932..36228 /locus_tag="DP116_01160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493169.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="isochorismate lyase" /protein_id="PRJNA477356:DP116_01160" /translation="MKKPDECSNIKDIRREIDAIDKEVIAALGRRFAYVKAASQFKTS ETGVKAPERFHSMLQERRAWAEVVGLNPDVIEKLYRDLVNYLIDEELKHWQKKT" gene 36252..37955 /locus_tag="DP116_01165" CDS 36252..37955 /locus_tag="DP116_01165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319326.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD+ synthase" /protein_id="PRJNA477356:DP116_01165" /translation="MKIAIAQLNPTIGDLPGNAQKILDVAQKAVAEGARLLLTPELSL CGYPPRDLLLNPSFVQAMDIALQQLTRDLPPELAVLVGTVEENVKAHKTGGKTLFNSI ALLEKGRVKQVFHKRLLPTYDVFDEHRYFEPGLEANYFTLDNVQIGVTICEDLWNDEE FWGKRSYTINPIADLAVLGVDFIANLSASPYSVGKQKSREAMLKYSAVRFQQLILYAN QVGGNDDLIFDGRSFALNRQGEIVSRACGFEEDLRVVEFDEVQRDLHSGSIAPEYESE DEEIWHALVLGLRDYVRKCGFSKVVLGLSGGIDSSLVAAVATAALGKENVLGVLMPSP YSSEHSITDADALAENLGMKTHILQIGELMQDYDKTLAELFAGTEFGLAEENLQSRIR GNLLMAISNKFGHLLLSTGNKSEMAVGYCTLYGDMNGGLAVIADVPKTRVYSICRWLN RNGEIIPQNVLTKAPSAELKPGQVDQDSLPPYDILDDILQRLIHNHESTAEIVAAGHD SATVDRVISLVSRAEFKRRQAPPGLKITDRAFGTGWRMPIASKWTAIKNSYNPVFSVR Q" gene 38010..38840 /locus_tag="DP116_01170" CDS 38010..38840 /locus_tag="DP116_01170" /EC_number="1.17.1.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411217.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxy-tetrahydrodipicolinate reductase" /protein_id="PRJNA477356:DP116_01170" /translation="MSNQGPIPVVVIGAAGKMGREVIKAVAQAPDMNLVGAIDTTVEH QGKDAGELAGLSEPLEIPITNQLEPMLAFASGEKQLGVMVDFTHPSSVYDNVRSAIAY GIRPVVGTTGLSPEQIQDLAEFADKASTGCLIIPNFSIGMVLLQQAAVAASQHFDHVE IIELHHNQKADAPSGTAIQTAQMLAGIGKIYNLPLVEETEKLPGARGSTAEEGVRIHS VRLPGLIAHQEVIFGAAGQIYTLRHDTSDRACYMPGVLLAIRKVLQLKSLVYGLEKIL " gene 38953..39642 /locus_tag="DP116_01175" CDS 38953..39642 /locus_tag="DP116_01175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867648.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease" /protein_id="PRJNA477356:DP116_01175" /translation="MIVPLTRQKFEQLIPLIATGPQYKYYWGKFPDFLQRLLISVVAV AVVFVMKVILGIDFGGIIFLLGLIGALYWLWGPVFWASMRNVKSRRCKYSGFLRGRVL DYWIAEELMGKQETVDNKGDLVIVENREKRIHLEIGDDTGFTAEYKAPLRSAYKVIAR GQRAELLVMSNRPDLSTIDEISDIYIPSRDLWVSDYPCIRRDFFTEVSSRLRRDEEDE RPRRRRPRVER" gene complement(39702..40280) /locus_tag="DP116_01180" CDS complement(39702..40280) /locus_tag="DP116_01180" /inference="COORDINATES: protein motif:HMM:PF00145.15" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01180" /translation="MPYLTAWEALCDLPLIDAGEGAEELVHSGNYYNLYQRERRGCRS PGMIFNHRAIKHCDRVQARYAQIPEGGDNQLLPVELRTKKRNVWKLNREKPSRTVTCN HRTDILHPILPRGTTVREAARLQSFDDDYRFFGNLTRKAKWLTQDDQVGNAVAPLLAR ALALHIKSVLAQEELKVQAELVKQTCNHNSNI" gene complement(40312..40728) /locus_tag="DP116_01185" CDS complement(40312..40728) /locus_tag="DP116_01185" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01185" /translation="MADSPAHAFGQKLGEFLEILAEYLLIPVATKHGLYLDRKGPRPA RGNLKKATWTDKYGSKHDLDYVLEGGGSVTKLGDPLTFVEVAWRKGTRHSKNKVQEIQ GAILPLVETNSHTVHSTSSSKGLHCNILSASDAWNP" gene complement(41142..41948) /locus_tag="DP116_01190" CDS complement(41142..41948) /locus_tag="DP116_01190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319322.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TPM domain-containing protein" /protein_id="PRJNA477356:DP116_01190" /translation="MQQSFWKRILAFVTVFFLAGSIWITHSPSAYAYDNPELLPDTPT PVVDLAKSLTSVQEENLVKQLEQFQAETGWKLRVLTQYDRTPGRAVIRFWGLDDKSVL LVADSRGGNILSFSVGDAVYDLLPRTFWIELQTRFGNLYFVREEGEDQAILQAMETVK NCLLQGGCAVVPGLPREQWIFTLITSVIGGVICGFAAQPRREGQVFAWQWALIFSPLW GILFIAFGIGPVVTRTNDWLPLIRNISGFLIGVLVAYLSPIFSQSSSTES" gene complement(42157..42753) /locus_tag="DP116_01195" CDS complement(42157..42753) /locus_tag="DP116_01195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319321.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF948 domain-containing protein" /protein_id="PRJNA477356:DP116_01195" /translation="MIDPLFWLGLSILLVAASLTAVLVAAIPALQELARAARSAEKLF DTLYRELPPTLDAIRMTSLEITDLTDDVSEGVKSAGQVVKQVDQSLDSARKQAQNVQT GTRSFMVGLQAAWKTFTRSKSSRRTVDRLSPSQKAKLTLQERQALRQENRFTPAEGYR TNKNYNDYNDYNVGGGENSFDEQDSHRQKESENWSDKE" gene complement(42768..43115) /locus_tag="DP116_01200" CDS complement(42768..43115) /locus_tag="DP116_01200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873916.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YtxH domain-containing protein" /protein_id="PRJNA477356:DP116_01200" /translation="MSNNRSGSFIGGMMLGATIGALTGLLIAPRTGRETRQLLKKSAS ALPELAEDISTSVQIQADRLSANARSNWDDTLDRLRDAISAGIDASLRESQAMKQQNA QEKSNTIPQHSDS" gene complement(43237..43566) /locus_tag="DP116_01205" CDS complement(43237..43566) /locus_tag="DP116_01205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746671.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01205" /translation="MVTVVVVINTLISLVLLYVAWRVRRLKRRLTRIANIFIAAERSS HAVLYTAPQALYTGQGNINNIRQKDQPARLQIQRLRQILSLVALGQQVWQSNFLRFRA KLVRKRR" gene complement(43712..44083) /locus_tag="DP116_01210" CDS complement(43712..44083) /locus_tag="DP116_01210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409919.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2237 domain-containing protein" /protein_id="PRJNA477356:DP116_01210" /translation="MTEGSNVLGEKLEICCSSPVTGYYRDGFCNTGGMDFGLHVVCAQ VTTEFLEFTKSRGNDLSTPVPEYQFPGLKEGDRWCLCAARWQEALEAGVAPPVILSAT HARALEAVSVDDLKKHALTSS" gene complement(44281..44811) /locus_tag="DP116_01215" CDS complement(44281..44811) /locus_tag="DP116_01215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867948.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="PRJNA477356:DP116_01215" /translation="MMTDSRMKRGQEWLKTLLQLSGLSVDIKGEKETAIAEDEESPEQ DSYWLTIGETNLSPEQIEILTGPDGSVLDAIQYLANSILNLSQLQEEQASYTVELNGY RVRRYAEIRAIAEAAAQQVRSSGQEAEIKSLSSAERRQIHTFLKEFGDLETFSRGKEP HRQLIVRLASMESLRS" gene complement(44811..45956) /locus_tag="DP116_01220" CDS complement(44811..45956) /locus_tag="DP116_01220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320156.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="membrane protein insertase YidC" /protein_id="PRJNA477356:DP116_01220" /translation="MDFGIGFLSNNVMLPIIDLFYSIVPSYGLAIVALTLIIRFALYP LSAGSIRNMRRMRIVQPLMNKRMQEVRERYKDDQQKLNEEMMNVQKEFGNPLAGCLPL LLQMPVLLALFATLRGSPFAGVNYSVNLQVFPGEQIERIQPQAFATPPQNIYISEGEH LKVNAILPGGNKLAVGEHTKIEYQTVEGKPFQALLAEHPDTQLIPEWKITKGEERIKI DAEGNIEALQPGDVTIQGTIPGLAANQGFIFIDALGRVGAIDPDGIIHWDIVAMVILF GITLYVSQILSGQNTTNANPQQDTVNKITPVIFSGMFLFFPLPAGVLMYMVIGNIFQT LQTYILTREPLPEELQKIVDTQEKEEESAKQKALPFEPKSSKKKTTG" gene complement(46174..46563) /locus_tag="DP116_01225" CDS complement(46174..46563) /locus_tag="DP116_01225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458191.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PH domain-containing protein" /protein_id="PRJNA477356:DP116_01225" /translation="MGIREEVYYEGGPHIGDLITSILIGFTIVGLPLTVGAIVRALWL RFRITDRRISVMGGWMGRSRSDVIYSEIVKMAKIPRGIGLWGDMVITLRDGSRLELRA LPKFREIYDYIEEKVIAKNPSYVGAKK" gene complement(46550..46966) /locus_tag="DP116_01230" CDS complement(46550..46966) /locus_tag="DP116_01230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867951.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease P protein component" /protein_id="PRJNA477356:DP116_01230" /translation="MALPKANRLKSRRDFQAVFREGIRRHGSHITLRALRPSPSSKPS CDTAPETVNTNTKHLTPAQIGISISTKVSKRAVVRNRLKRQIAAALYQLLPKMSPGWR IVVVVKPAADIKCVTQQFLQELEQLLEQAEVFNGHS" gene complement(46997..47131) /gene="rpmH" /locus_tag="DP116_01235" CDS complement(46997..47131) /gene="rpmH" /locus_tag="DP116_01235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139827.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L34" /protein_id="PRJNA477356:DP116_01235" /translation="MKRTLEGTCRKRKRTSGFRARMRTPDGRNVIRARRKKGRHRLSV " gene complement(47363..47875) /locus_tag="DP116_01240" CDS complement(47363..47875) /locus_tag="DP116_01240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214885.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01240" /translation="MRRLLSTLAVTSCLLAGIPAISWAQSLPGLTIFSGVKGENQLPF RLDFGGQTNGWDRYRLKISNKKMKTAVARFVISYPNYYQGTFDSKEIEVKAKGKKIAL SQVKWDKENHVLEIFPQEPVPAGTDVELILSNVKNPSSGGMFYFNCSIQSPGDVPLSR YIGTWIISIS" gene 48576..49739 /locus_tag="DP116_01245" CDS 48576..49739 /locus_tag="DP116_01245" /EC_number="1.6.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743020.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Re/Si-specific NAD(P)(+) transhydrogenase subunit alpha" /protein_id="PRJNA477356:DP116_01245" /translation="MRIAVAKEIEVCERRVALIPDTVTRLVKQGVEVWVETGAGERAF FSDAAYEAAGAKVITDTATLWGEADILLKVSPPQEREDGRSEVDLLKEGSVLISFLNP LGNPSVAGRLAERKVTAISMEMIPRTTRAQSMDALSSQASIAGYKAVLIGAAALPKYF PMLTTAAGTIAPAKVFIMGAGVAGLQAIATARRLGAIVEAFDIRPAVKEEVQSLGAKF VEVKLEEETTAAGGYAKEISEASKQRTQEVVAEHVKNADVVITTAQVPGRKAPLLVTE EMVAQMKPGSVIVDLAAEQGGNCACTDPGKDIVWNGITIIGPINLPSSLPVHASQLYS KNLSSLIQLLIKDKALNVNFADDIVDAACVTHGGEIRNQRVKDALQALSGVAS" gene 49800..50093 /locus_tag="DP116_01250" CDS 49800..50093 /locus_tag="DP116_01250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P) transhydrogenase subunit alpha" /protein_id="PRJNA477356:DP116_01250" /translation="MTEALIAALFVFVLASFTGFEVINKVPPTLHTPLMSGSNAISGI AVLGAIVASGARETNLSVILGLIAVILAMVNVVGGFLVTDRMLQMFKKKEIKA" gene 50090..51496 /locus_tag="DP116_01255" CDS 50090..51496 /locus_tag="DP116_01255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015081071.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD synthetase" /protein_id="PRJNA477356:DP116_01255" /translation="MSDFLPTGIQLTYLVAASLFILGLKQLGSPATARQGNVVAAVGM LLAIVATMLDQHVLNYEMILVGLAIGSLVGIVVAYKVQMTDMPQMVGLLNGLGGAASA LVAVAEFWRLLGNGEAIPLDANISMLLDVLIGGVTFTGSFVAFAKLQGIISGSPITFP LQQPFNLSLLVAFIAGSAYLIISPHSLPVFLGIVAVSLVLGVMFVIPIGGGDMPVVIS LLNSFSGLAAAAAGFVVMNNMLIIAGALVGASGIILTEIMCKAMNRSLFSVLFSAFGT VTASGGAAGTGGTTDKSVRSIDPEEGAMMLGYARSVVIVPGYGMAVAQAQHNIRELAD QLERMGVDVKYAIHPVAGRMPGHMNVLLAEANVPYEQLHDMDDINPQFEQTDVALVIG ANDVVNPAARSDTSSPIYGMPILEVDRAKQTIVIKRGMSAGFAGVDNDLFYKNKTTML FGSAKDMVGKLVSEVKQL" gene 51733..51987 /locus_tag="DP116_01260" CDS 51733..51987 /locus_tag="DP116_01260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740486.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system ParD family antitoxin" /protein_id="PRJNA477356:DP116_01260" /translation="MNISLKPEHEQFIQSQIQAGRYANAEDVMNEALKLMQAREQRLE ELRQKIAVGKEQIARGEVTDGEIVFAQLQDKINKIAESQR" gene 51984..52283 /locus_tag="DP116_01265" CDS 51984..52283 /locus_tag="DP116_01265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013320828.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_01265" /translation="MSNYSFSDEAVKDLNSICEYIAQNNPKAASKLFDAIRQKCKLVS GFPNMGKSYEELSPNLRGFSIEDYIVLYYPREDGIDIARVISGYRDLESMFLEPE" gene 52474..55944 /locus_tag="DP116_01270" CDS 52474..55944 /locus_tag="DP116_01270" /inference="COORDINATES: protein motif:HMM:PF07721.12,HMM:PF13424.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01270" /translation="MGKEQQKLILSQHRYEEQVKKRLGSLGNEPQNILIFYGRAGIGK TDLSKSLFQSLKSNYPVCARLDGEGIFNYGLARKPIESAVVQLRAHLRRGGADLSCFD LAYRIYSAGANPLMVTISPEAANRMDWAEKISNGTDLADAVLEMNPAELAPSLLEGLT ELAKDLLPGGQVLAKLGWFLMQQSPEIWQWWKERGNQNLRELKDCVSPYEILERLPLF LARDLQQYLRRSQHKAVIFIDGYEKLVDKFGRCDWLEELLEQPNPNVLWVIFSERSLN FTKYAHNIPILPLTEAECQAVLQEFGIDEPEICQIIIQASGGIPLYLRLGVETWQDIK KQRQPKLTDFARNLNEVLRQRDIAWQPDERRMWQVLSHCRTWDEALFAKLMSQFQLDN WENRLSQITVSPYVEEAGSGVWRLNQVMQQHLQENQPEDLRKSVNNWLFEYYRAEYQE PELQLTALAEALYHGLESEQPEAATSWFLKQVAVQQEVGRHQAVVSMLQFLVGKNHQL PLAWTLLGKSLVVLGDYEQALEALETARSQWEALQQRESLDAGTMELELANVYLKLER TFDASNAAQKAYRIRTAQLGANESSVAEVLNRQAEIAASQGDYREAVNLSQRALQILQ FHPDTQPLQLAQLKHTAAWLNAYNNNLDAAEKLCQEALEIVKNNAGDEHSLAISCQAS LGDIYQGMGEHKYQKAYEQYQLALDAADISLSPSHPQTLQLLQGLTHLCRRMGEYDAA DEFAERHNAHVQIGNFEETAAAATRLNNLGFSLYKKGEYGKAEPLLKQALQIFLKALG EEHPHTALSLNGLARLYESQGRYNEAEPLYNQALQISLKVLGEEHPNTATSFNNLGWL YESQGRYDEAEPLYNQALQIRLKVLGAEHPHTDISLNSLAGLYESQGRYDEAEPLYNQ ALQIRRKVLEAQHPDTAMSLNILAGLYKSQARYDEAELLYNQALQIRRKVFGAEHPDT ADSLSSLAGLYESQRRYDEAEPLYNQALQISLKVLGAEHPHTAKSLNSLAGLYKSQGR YDEAEPLYNQGLQIFLKALGAEHPHTGRNLNNLAGLYESQGRYDEAEPLYNQALQIWL KVLGEEHPHTAVSLNGLAGLYESQGRYDEAEPLYNQALQIRLKVLGEEHPDTKETESN LNRLRDEMTK" gene complement(56247..56465) /locus_tag="DP116_01275" /pseudo CDS complement(56247..56465) /locus_tag="DP116_01275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016948901.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" gene complement(56722..56919) /locus_tag="DP116_01280" CDS complement(56722..56919) /locus_tag="DP116_01280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316763.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01280" /translation="MESPFLTILASVAAVLLVAVTGGVGYLTLAGWRDRRLREDEKRE MRRASSSTTSKSTAPKLKKKK" gene complement(56927..57016) /locus_tag="DP116_01285" CDS complement(56927..57016) /locus_tag="DP116_01285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017322509.1" /note="cytochrome b6-f complex subunit 8; with PetL, PetG and PetM makes up the small subunit of the cytochrome b6-f complex; cytochrome b6-f mediates electron transfer between photosystem II and photosystem I; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome b6-f complex subunit PetN" /protein_id="PRJNA477356:DP116_01285" /translation="MEILTLGWVALLVVFTWSISMVVWGRNGL" gene 57101..57565 /locus_tag="DP116_01290" CDS 57101..57565 /locus_tag="DP116_01290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318055.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01290" /translation="MNKILDPQLMAERIESLKAGILAGLSLMIAFFLTTFVNNLVLAK YFEQLASLAIDSLDLQLLLKLGIAVFCGLLFGVTYRYIIRSDKNPQLKAGGVLAFGLV RCLTQVDVGLSYSRDILPFVVLGVESILWFVLAAFFLDTAIQLGWIKPFQSS" gene 57599..58981 /locus_tag="DP116_01295" CDS 57599..58981 /locus_tag="DP116_01295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316765.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome P450" /protein_id="PRJNA477356:DP116_01295" /translation="MNLPNPVKTPSFLQRIQWVAEPIGYMESAAQQYPDIFSTTVVGS RRPLVFVNHPQTIAEIFTNDRKKFAALSQENRILQPLLGDSSVVMLDGDRHKRQRQLL MPPFHGERMRTYGEIIVNITEKVFSQLPQNQPLSIRTAMQEISLQVILQAVFGLYEGE RCQQLKRVLASTLGVFESPLSSSFLFFPFLQKDLGAWSPWGRFLRQRQQIDKLLYAEI AERRAQDDPNRIDILSLLMSARDEKGKPLTDKELRDELMTLLFAGHETTATAMAWALY WIHHIPQVGEKLLQELGTLGDSPNPIDIARLPYLSAVCNETLRIYPVGFLTFGRVAQE PVEILGHHLESGKVVFGCIYLLHHREDLYPQPKQFKPERFLQRQYSPYEFMPFGGGAR RCIGEALAVFEMKLVLATIVSRYQLALTTNQPEQPQRRGVTLAPSGKVKMIITGERKH QESPKVAASV" gene 58993..59331 /locus_tag="DP116_01300" CDS 58993..59331 /locus_tag="DP116_01300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316766.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide-concentrating protein CcmK" /protein_id="PRJNA477356:DP116_01300" /translation="MTLALGMIEVFGVPTAIEAGDAMCKAARITLVGYENTDLGRITV LIRGAVGEVNVAVAAALEAIPRVNGGEVLSHHIIPRPHENLEYVLPIHQSVNIEQFNS YIRFPPPLSA" gene complement(59378..59953) /locus_tag="DP116_01305" CDS complement(59378..59953) /locus_tag="DP116_01305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457668.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_01305" /translation="MNTITIDLKPVIELTDEQFYQLCQKNPDLKFERSAKGELIIMPP TGGETGSHNSEMNADFVIWNRQTKLGKVFDSSTAFKLPNGADRSPDVTWIRQERWDVL TFEQKEKFPPIVPDFVLELMSPSDNLQKTQEKMQEYMENQVKLGWLIDRKTRRVEIYR LGQAVEVLESPTELSGEDILPGFILDLRTVF" gene complement(60164..60520) /locus_tag="DP116_01310" CDS complement(60164..60520) /locus_tag="DP116_01310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015974224.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01310" /translation="MKTNKCSNCYLVGVDLGVANLEGADLAGANLRGARARCGIHLNG ANFNGADLRGFSSNNCIGFDLQGANFRGANLKGANLQYANLRGADLTGVDLSEVNIEG ADIAGAIVPNEAKSPK" BASE COUNT 17359 a 13018 c 13723 g 16435 t 20 others ORIGIN 1 aaatgaggcg tataaattcc cactgctgtg cccataccag ctaaacccat attgccaacc 61 gtagcagaaa ttcccgcacc cgctacactt cctccagcaa gaaacatgac tccttctgtt 121 tggttgaaat tacctgcaat attattcaag tttgctttta taactcgatt cacgcgttcg 181 cccaatccca cagttttctt tccttttgac cttatctaaa gttttgtagc cctatccttt 241 gtcaaccttc atgctcatat ttgatactta cgcaaagcct atgccacttt gttagttggt 301 caattagaca tctggtggaa ataggtgatg caatattaat ccgtaacgag ctttccaacg 361 tacaacagcg cttgttaaca acaattaaca ccacgattta ctataagtaa actagataac 421 gttgactaga tatattaaag ggttataaag aatcatcttt agaacactgg atagtccctt 481 gtgaggaaaa catgcagaca gttgttctcg ttctcgttga agtgctaatt gttatcggac 541 tgtcacggct agtaggactc gccttccggt ggattaacca accattagtc attggagaga 601 ttgttgctgg gattatgctc ggaccatctc tttttggttt ggttgctcca ggattagcat 661 ctaccttgtt tccacccgaa acaattccct tcctgaatgt gttgtctcag gtagggttaa 721 ttttcttcat gtttttgatt gggttagaat taaaccccaa atacatcagc ggcaatttag 781 aaatagcagt attaacatct cacgtcagta ttctagtacc gttttcgtta ggaacgctgc 841 tcgcagtact gctatatcct ctagtttcca atgctagtgt gtcgtttacc gcctttgcgc 901 tgtttctcgg agcagcaatg tcgattactg cttttccggt cttggcacgt attattactg 961 agaacaactt gcaaagaaca cgcttgggaa ctttggcgtt gacttgtgct gcagtagatg 1021 atgtcactgc ttggtgcttg ttggcagtgg cgatcgcagt agcacgtaca ggtagtatta 1081 ttggtgcttt tcctaccatt atcgccagct tagtatacat tggcttgatg atgacagtag 1141 ggcgaacttt cttgaaacgc cttgcgacct actaccgtcg cactggacgc ctcagccaat 1201 tggttctagc tttgatttac atggcagtgg ttgcttctgc tctgattacc gaactgattg 1261 gcattcacct tatatttgga gcgtttttac taggggcagc catgccaaag aatgcaggtt 1321 tagtgcggga aatagcaata aaaacagaag attttgtgct gatatttctg ctaccagtgt 1381 tctttgctta tagtggtttg cggacggaaa ttggtttgct gaataagcct gaattatggg 1441 cactgtgctt agctgtgctc gcagtggcaa tcataggtaa atacgttggc acttatgtag 1501 cagcacgtat cagtggtata gagaagcgag aagcctcagc actaggttgg ttgatgaata 1561 ctcgcggctt aactgagctg attgtgctga atattggtct gagtcttgga gtcatttcgc 1621 cgctactatt taccatgctg gtgattatgg cattagtcac aacatttatg acttcgccac 1681 tgttagagcg aacttatccg aaaaagctca tcaagttgga tgtggttgac caagaaccag 1741 aagaaataaa agcagacacc cctgatggtg aagattttta caatcgtcct taccgaattt 1801 tggttccagt cgcaaatccc aatagccaaa aaggtttggt gcagttggct gctgcgatcg 1861 ctatcaacta tcaacaaccc gccgttgttt acccactcag tttgattgaa ttccaagaag 1921 attatgcttt tgaaaatacc ccacaagaag caaataaagt cattgcacag cgtcgccaac 1981 agctagaaga attgatttca cgtctagaac cattagaagc acgctcgtat atgcatccta 2041 ttgtttgcac atcgagcaat gttgcacgag aaacagcacg gattgcctta cttgaacaag 2101 cgaacttagt tttagtagga tggcatcgtc cagcttttag taagaatcgc ttaggtggac 2161 gagtgggtca aattctcacc aatgcaccag tggatgtggc ggtgtttata gatcgaggac 2221 aagagcgttt agaaaggttg ttggttcctt attccgcaaa cattcatgac gatttagcat 2281 tgatacttgc tttgagattg ctaataaatc gtgatacttg ccgattactg gttttacagg 2341 ttgtggcaga gaatcaagtc aaaaatgaac acagttacga gcttgacact gtaatggagc 2401 aattgcctca aagtgtgcgc gatcgcattg atatcaaaat agtcgaagcc atagagccaa 2461 ttcaagctgt agtcgcagcc tctgaaactg ttgatctgac tattgctggt acaagccgcg 2521 cttggggtat cgaacgtcaa accttgggaa gatacacaga tcaactagct attgagtgtc 2581 gctcctcact gctgattact cgtcgccaca gtcaaattag ctctcatatt acttctgttc 2641 tcgttaacga gaaattagag gtaaaacgtt agatgaatag tgaagaatgc tgaatgatta 2701 attggaaaaa tataaagttt attctgacat cattcggtat tcatcattct cctctgttcc 2761 ctatcacccc ttatctttat aaccatgagt catttcaact tcaaatctct cactttctat 2821 ggtgtggcaa taagttcagt actgcttttg tttaaagttg taacggctta tggagaaaac 2881 agcatcaaag cgccgccagc aattaacagc aactatcgcc tcgtgttgag tgaaaagttg 2941 cctgtctgcg aaaaatcaga ttttctactc ttaaatcttc agcaatcagg aatttacttg 3001 aatggttttt tactgccagc taactccact aaaacttcag cactatctga aaaacattct 3061 ctggcgggga aacttcagaa tcaacagttg agtttatcgg gagaagttac gaggcacatc 3121 ctatgtaata ctccaaattc tcaaacacaa gcaaacatta tcccagttaa aatacaaatc 3181 aaattagctg agaaaggaga tttaataggc aaaataagtg tgagcgatag ttccaaaact 3241 attggattga cagcaacacc acaaaaaact aacgaacaat ctgagcagcc aaagaatcat 3301 tgaacagctc ctaaccccat gtttgagcgt gctaccattt gaccaatgat tccttgatgg 3361 tacaagctgc cagattgatc cgtggaatca attctttaat ccaaaatcca aaatttccaa 3421 gccctcggct tgatccgtgg ggtcaatcca aaatccataa gttaagatcc aaaattgtct 3481 gacacctttg gtagaagcgg gtagtatagc agttgccagg tagattagga cgtgaactaa 3541 tgagaaaata cggacagcac gagacaatca cacgcctatt aagagttccc tgttccctgt 3601 tccctgttcc ctgttccctg ttccctgcta taaaaaaaac agttgtcatg gacaattgtg 3661 gataaattat gtggagtcgt gaggagtatg aacaaggaat taaagaataa acaatttttc 3721 cggttctgca agtatctgtt tcgcgttagc ttgtttgcct tgtgtacagg tgtgctttac 3781 agcaagcccc ttagcgtcta tggttttgat gtgggcaata gtagtagcgt agcgccatta 3841 gcaaggcttg aggattggcg cttttaccca gaggcgtcac aactagaaat ttccctctca 3901 gcagctgcac aacctcgata cttttacctc gcccaaccgc cacgcattgt tatagatttg 3961 cctggtacta aattaggcaa aattcctacc aaacaaaatt tctatggagc catccgaagc 4021 attcgcgttt cccaactcaa cgcagatgtc actcgtattg ttatggattt agcaccagga 4081 atcctcatcg ctccaacaag agtgcaactg caacccgttt ctagagaaaa tcccacccgt 4141 tgggtattac gtcctttggt tgtcagtaac ggtacctctt tacctgggaa tttttcatct 4201 acaactattt ccccaccgct accaagcaac acacctgata tctataatcc cccacaatcg 4261 ccaggcaacc taccacctag catatacaat aatcccccac aaccgccagg caacttacca 4321 cctagcatat acaataatcc cccacaacta ccaggcaact taccacctag cgtatacaat 4381 aatcccccac agccatttag taactcacca gtgggtgtct ataatcctcc acaaccacca 4441 agcaacttac cacctgcaac gtataacaat ctcccacaac ttcctggcaa cccaccatca 4501 actacctata atcaacaaca agtacctgat ttcagtgtac cgccaccatt aacaactcta 4561 cctacaaata acttttcaag cgcaccatct gtccaagtac cgcctctgac cccgaacaac 4621 tcctctcaac tccctggttc tagttttagc ccaccctctt tgccatatca aggcagttat 4681 acgaacaaca gcgctccttc tttgggtacg tcgaattttc caatcccaaa tctccccact 4741 ggttcacgca gttctcccaa ctctaaagta attgagtttg gtgaaccgtt tcccaactct 4801 tcaagataaa gagcacacaa aagtaaagaa gaacgagaaa agaaagaaca aagtaattct 4861 tttttcttcc ttttcttgtc ttcgttcttt tctagaatct ttctgtttcg attcctgggg 4921 ctgttgctcc tggttggtta atgaagaagt ttttcgcatc atcgttctcg taagtgcagc 4981 taatgtcagg tttgtcaccg ttcaagtcac ccgcgtaggc aaagttgttc aaacgatttt 5041 tgcattctct gagctgctgt gatgtcacca gcttcttttg ttctaaaatt gtcatgttac 5101 tcgaacgcag aacacatcca ggacgcatac tgggctgagc aatgtagact ttaaaagggt 5161 tgagagtcac aaacagtctg gtatccatca ccatagcgct ggctccatac tgcacacaaa 5221 tatcagcatt cggtgctttg gtatcaataa attctcggct agctacgttt gacgtactaa 5281 acgtagccgt ggagctaaaa gcaataccaa tcccaatacc taaaacaaac acccctccca 5341 aaatagccat cgtggtgaaa ttaaacatcg gggattggaa gatagaaggt ttaggagtcg 5401 taactgttct accagtagat tttcgtctca ttgtcgctta gtcaacttta cgctttggac 5461 agtagggagt gaattcaata gctcccctta tttcagtatg ccgattcttg agtctcattg 5521 tctgaaactt tttctttgaa gttcctacca tccaacaaat tcagcaaatc cgctaaacat 5581 gcaggtttaa tttgaagtat tttttgtttt tgattttcat ttcgatctat gagggctttt 5641 tactaccttt ggtagtaaag gtgtcaagaa atgttaactt aaaatttaag aaaatattaa 5701 tacattcagt acaaataaag tggtacacct acaatcgcgg tcctttggca agagtttcaa 5761 aatctcacaa tttcaaactc tcatacggga tagtctcact atcttctggg gagattggct 5821 agagttgcgt acaaaacttg tacaaattat ttcctcagga ttaatctctc cattgatata 5881 cattctagct tttggctttg gtttgggtag ctctatcaaa cctggttcag gactcagtgg 5941 tgactataat aactatttag aatttatttt gccaggaatg gtagcactgt cgtctatgac 6001 agtcagcttt gtcgggacaa ctttctccat ctgtggagat agactcttta ccaaaaactt 6061 cgaggaattg ttactcgtcc ccgtgcatcc cctagcgctg catataggaa aaatgctggc 6121 gggcgtgaca cgagggttga tgacttcact gggtgttatt ttagtagcgc tggtgtttac 6181 tcgaaactgg aattttctca atccattgtt tctgctgata ttggtattga gttgtgcagt 6241 ttttgctggt ttgggagtca ttgtagggtt aacggtaaaa tcccttgaat cagtcggact 6301 ttacaacaac tttattatta tccccatgtc gttcttagga gcaacgtttt ttgatccagg 6361 tacattacct acagccttaa aattcgtagt ttacttgtta cccctgactt acgccagcat 6421 tggactacgt gcagcggctt gtttgccatt gtctcagttt ccttggtatt gcttgccaat 6481 tttacttgtg atggcgatcg ccctttccct atggggtgct tacgtatttt ctcatcaaca 6541 ggattaagac ttggcagggg acagagatgg tttgttctct gtcccctgtt ggttgtctta 6601 actagctaga agcgtctttt gcaaaggcgt ccaacggaag gcgcgatcgc cacccctctc 6661 aatcaccact ttcactcgtg gttccaagct gggtaaggcc gctaaaaact caacttcttg 6721 atcgcaaaga atttgccctt cgatgatcca caacaccaag ccctttgatt tgggtgggag 6781 gtgttcgagt tgagagctaa caatcggcgg cacatagtaa acataaccta ctgccgcacc 6841 agtgctagta ctttttttcc caagatgagg aggtctaaca ccatgaactg ttgtgagata 6901 ggcagctata tcgcctaatc cctttgcact cagatcgaga gtgacataac ctgctgcacg 6961 cagtcggcgt ctgtaccgac cttcaaaacc tccttctaga ggtacgtaaa caccaagagc 7021 gccaaatttc tccagatctc gaattaaagc gttgccagtg gtaattagtg ccataagttg 7081 atttttgttc tatcccactt cgatatttct attattattg tttgttaagc atatagatta 7141 agttctacaa attaacttat atttgttttc gctcaagctc tgtaagcatg aagcgacttg 7201 aagctgttac tgatagttac tcccatgagc ttcataaatg tgtttcgtag cagcaagctc 7261 aaaggattat ccaattcacc cattttcccc gcccgcaagg actgctcaac gattaacttt 7321 gtgcgaggaa accgctgtga ctcatactgt tggaaagcag ttattggatc agcttgctcc 7381 cttagacact tagccactac aaaagcatcc tctaatgctg tacatgctcc ctgtcccata 7441 gttggtagca ttggatgtgc ggcatcaccc agtagcgtaa cattttgctt gctccaaggt 7501 tgagtcggag cgcgatcata caaatctgtt gtcagaatat tggcttcatc cgttgctaga 7561 attaattcag gaaccgacgc aaaccaatct tgaaacatgg tttcaagttc ttttttgtga 7621 ccacttggtg catctggctg tgctaaaggt gctcttgcgg ctgcatacca atacatccgt 7681 tccttaccca gcatcataaa gccgaagctt ttaccacgcc ctaagaactc ctgaatgtag 7741 ccaggacgat atcctccagg aatataatct gttagaccac gccaggtttt aaaattacga 7801 tagatgggtg gttgttcacc caacagagca gctcgtactc gcgatcgcaa accatcagcc 7861 ccaatcaaag catccccctc aacacttaag ccggaactaa agtgagcgcg aatcttttct 7921 tcttgttgct ccaatcgctt gaaagtttgc tctaaaacaa atttttcctg tacattacgc 7981 cacaatagct gatgtaattc cgctctatga ataccaataa cgggtaactc gaagccatca 8041 acggctatat tgactaattc cttaccgctt tgagaattga attgataatt tgttgtgaga 8101 taaccaacgt ggatcgcgtc ctccaataag tctaaatttc ttaagatatg tgtggcattt 8161 gcccaaagtg caataccagc ccctacctcc tgcaattcct ttgttcgctc atagacaact 8221 ggttccaaac cagccctatg caaagcaagt gcagttgcag caccgccaat tccgccgccg 8281 ataatgatga tcttcttagc tgtttgcatt tttactacta tacagaactt atccaaactt 8341 aagtcaaaac gtaagtcaaa aacactcaat tgctggtctt taatagcatg tgtgacaatt 8401 gaaataattt ctacatcaac ttgtttgtag tgtttatgga atctgtgaca ctgacagctg 8461 ttgttactgc gatcgcttcc atcctttcga cacttccgag tgaagcagtt gagaaaatag 8521 gcgaaaatat tgatgattct gtgtgggtgc tacgtggcaa gttagtagag aaactccgcc 8581 agaagaataa gttagtatcg cttaccggat ctgtagaggg aaatgaaccg caacaattac 8641 aacaattaga ttatggtcaa ggagtactgg aactgaaagc agcagcggac acagatccag 8701 aaattgctca agtagttgtg gaaattgaag cagcagctcg cactttagat ttttcttatc 8761 cacaacaggt agagaaattt attcacaaat ggtttgctgc tattcccaaa acgggggaac 8821 agttatgcac tgcgttgaaa gaaccaggta aagagcgaat tcaggatttg gtgaaaaatc 8881 ctctgctgtt gacgctgctg tgcttgaatt ggcaatcggg agatggcaaa ttaccggata 8941 ctcaagccgg actctatcag caattggttg ataatttcta taagtggaga agagccgaat 9001 ttgctacaaa cgatcaccag cgtcagcaac tcaatgtcaa attaggggaa ttagctaaag 9061 cagcaataga caaagaaaca atccgttttc gcttacagca agattttgtt agtgacttct 9121 tgggagatgc tgatgatgaa aattcgctgt tgaaattggc actaaatctc ggttggttga 9181 actgtgtagg gatagataca gatagaaaac ctgtttacgc tttctttcat acctcatttc 9241 aagagtattt tgctgctaaa gcaattgatg attggcattt ctttctcaac cacattctct 9301 atcatccaag tcttggaact tatcggattt ttgaaccgca atggaagcaa acaatattac 9361 tttggttagg gcgaccagaa gaaaatctca aacagcaaaa gcaacaattt atcaatgctt 9421 tagtcaactt taaagacgga attggtaagt ggaataaata ctacgataaa ggattttatg 9481 aatatcgcgc ctatatttta gctgccgcag gaattgctga gtttaggagt tattctagag 9541 ccgataggat agtagcacaa attgtcaagt gggtttttgg tgaggccgat ttgattgggg 9601 aagaagctaa gttagtactg caacacacag accgtacaaa agtgatcgca gctttggtgc 9661 aactgctgca atcaaatcgt ctgaatgact acactcgtat ccaggtagca tctagcttag 9721 gtaaaatcgc tcctggcaat gaaaatgcga tcgctgcctt ggtgcaactg ctgccattaa 9781 atagtttata taccacctac gcacgttata cagagatagc aaaaagctta ggggaaattg 9841 gcacaggcaa tgaaatcgcg atctccgcct tagtgcaaat gctgctatca actaatatat 9901 ctgatatagg tgattatgaa gcttgtaagc tggcagcaga aagcttaggg caaattggta 9961 caggtaatga aattgcgatc accgccttgc tgcaactgct gcaatcacct aatttgaaat 10021 ggtacaactt cacccgttgg caggcagcag aaagcttaga gaaaatcgtc acagccaatg 10081 aaaatgttat cgtagcctta gtgcaactct tgcaatcaaa tggtgtggat gagaacactc 10141 gtaactgggc agcaaaaagc ttagggaaaa tcggcacagg caatgaaaat gcgatcgccg 10201 ctttggtaaa actgctgcaa tcaactaatg tggctgacca cacccgtagg ctggcagcag 10261 aaagcttagg taaaatcggc acaggcaata aagttgcgat cgccgcctta gtgcaacagc 10321 tgcaatcaac tgatgtggat gactacactt gtgaattagc agcatctagc ttagggaaaa 10381 tcgaccctgg caatgaaatt gcgatcgccg ctttggtaaa actgctgcaa tcaactaatg 10441 tgggtgacca cgcccgtagg ctggcagcag aaagcttagg ggaaattgac cctggcaatg 10501 agattgcgat tgctgcctta gtgcaactgc tggtatcaaa tgatgtgaag gcctggactt 10561 atacggagat agcagaaagt ttaagaaaaa tcgaccctgg caatgaaaat gtgataacag 10621 ccctggtaca actcctgcaa tcaaatgatc tggatgattt cacccgtagg ctggcagcat 10681 acagcttagg gaaaattgac cctggcaatg aaattgcgat tgccgcttta gtgcaacagg 10741 tacaatcaac tgatgtggat gactacacct gtgaattagc agcatatagc ttagggataa 10801 tcggcacagg caatgaaaat gcgatcgccg ctttggtaca actgctgcaa tcaactaatc 10861 tgtctcagga cacccgtggt gaggcagcag aaagcttaaa gaaaatcggc acaggcaatg 10921 aaaatgcaat cgtagccttg gtgcaattgc tgctatcaac taatgttgat cattttgccc 10981 gtatgttggc agtgcaaagc ttagggaaaa tcggcacagg aaatgtaatt gcgatcactg 11041 ccttggtgca agtgctgcta tcaactcaac tgctgctatc aactaatgtg ggtgagggtc 11101 ttcttagaca agcagcagaa agtttagggg aaattggtac aggcaatgaa aatgcgatcg 11161 ctgccttggt gcaactgctg caatcaagta atactaatat ggatgactac aactacaccc 11221 gtatccaggt agcaaatagc ttaataaaaa ttctacagga taataagcat cgctttgcgg 11281 tagtcaaaac tttaagtggt tataattgct acggtgtctt ttgggaatgc gcccagaata 11341 tgccttaccc cgatttctat cgagcttggc atcactccaa gtttgctact ggtgtggtgc 11401 aagggttaaa ggaaatcttc ttcacaagaa taatttaaat acttttgcag cagcgacgtt 11461 acaaataaag ttcttgtagt aacgataact tgtccattcg ttactaaccc agaagcgtcg 11521 taatccccaa atttagaatt gctgaacact caacaaatta gtagacaaat ataatataga 11581 taagctatat tattagattg tgaccatcca ttaaaataag ttactgtctg actgctaatt 11641 gtcactagta ttagcatcaa aaattgatga ctaaatcccg tctgggataa actagtaaaa 11701 attgatgaaa gactggtaga ctagataaac tgcaaagaca aattaggagt ggttggtaaa 11761 aaatcaatgc caaccttaag aataaactgc attcactcct gaacctcaaa ataatgaggt 11821 aaaatcttga atagttcaaa tgactgttta aggttgttaa tgaagtagga gccgagcagt 11881 catgttggtt tgtagcattg tataaatctc agccaacgga tactgggcag gtcgctgcct 11941 aagtcctact acaacaaaga acgtgccaga gcaatggctg ttgataaagg cattgccgca 12001 caaagcgcaa agcaacataa gccttgcaca aagttcaaga gttattcctg agaaaaagga 12061 agatcttttg gctgttctgg tgacgaatac aaggtaaaca agttcatcaa atactatgga 12121 ctcttggtga agacccaaag aagttgtaac tgtgagtacc taagcagttc gacatcaata 12181 ggtctgtcca aaaaatggga agtcgtcctg aagggcgttc acaagctctg tccttaaaaa 12241 tctgatggca acaaaaagaa aataagtaag gagatccagt aactgtgtct gtaggtatta 12301 tcggcaccaa actgggtatg acccaagtct ttgacgaggc aggagtcgcg attcctgtga 12361 ccgtcattca ggcgggacca tgcaccatta cccaggtaaa aacaaaacag accgacggtt 12421 actccgccat ccaagtaggt tatggcgagg ttaagccaaa ggcgctgaac aaaccattgt 12481 tgggacatct tgccaaatca tcagccccag cattgcgtca cttgcaggag tatcgcatag 12541 ataacgaaag cgaatatacc ctaggtcaac aaattaaagc tgatattttt agtccaggtc 12601 aaaccgtaga cgtgatcggc acaagtattg gtcgcggttt tgcaggtaac caaaaacgta 12661 acaacttcgg tcgaggaccg atgtcgcacg gttcaaaaaa ccatagagca ccaggttcaa 12721 ttggtgctgg tacaactccc ggtcgtgttt atccaggtaa gcggatggca ggacgtttgg 12781 gtggtagccg cgtcacaatc cgccaactaa ctgtggtacg agtagaccca gagcgcaact 12841 taatacttat caagggaagt gttcctggta agccaggtaa tttagtagat attgtcccgg 12901 caattgtagt tggtaaaaag tcgtaactca ttagtcattt aactctggac taatgactaa 12961 taactaatga ctaatgacta aggattaaaa atgtttgagt gtgtagtcaa aaattggcaa 13021 ggagaacaag tcgggcaaaa aacgttcgac ttcaaggttg ccaaagaaga aacggcatct 13081 catgtcgtgc atcgagcttt ggtaagacaa atgaccaatg ctcgtcaagg aacagctagt 13141 accaaaaccc gttctgaagt cagaggtggc ggtcgcaaac cttggcgaca aaaagggact 13201 ggtcgtgcgc gtgctggttc tattcgttca cctttgtggc gtggtggtgg tgtcatattt 13261 ggaccaaaac cgcgagagta taacctcaag atgaaccgca aagagcggcg tttagctttg 13321 cgaacagctt ttgccagtcg aattgacgat ttgattgtgg tggaagaatt tagcgagcaa 13381 atccagcgtc ccaagactaa ggaattagtg ggagcgatcg cccgttgggg ttcagaacca 13441 gaaaacaaaa ccttattaat cttgtccgaa cgtacagaca acgtcttttt atcagctcgt 13501 aacgtcgcaa atctaaaact gcttggatct gaccagttaa atgtttacga tttgctgcac 13561 gccgacaaga ttgtagttac agcatcagcc ctagaaaaaa ttcaggaggt ctacagtgac 13621 tagatttgac ccccggaacc ttcctgacct tgtgcgtcgc ccaatactaa ctgaaaaagc 13681 gaccgtactc atggagcaaa acaaatacac ttttgaagtg actccaaagg caacaaagcc 13741 acaaattaga gcggcgatag aagatttatt tgaagtaaaa gttgaaaaag tcaacaccac 13801 aagaccacca cgcaaaaaaa agcgtgttgg aaaatttctt ggttacaagc cccaatataa 13861 gaaagccatt gttacagtgg cgtctgggga tgtagataag attagacaag ttctattccc 13921 agaggtttaa aaaaatgggt actcgttctt atcgccctta cacccccagt actcgccaag 13981 ttacgatttc tgacttctcg gaaattacca aaaccgagcc agaaaaatcg cttaccgtat 14041 cgaagcatcg cgccaaaggt agaaataatc aaggacggat tactgttcgc catagaggtg 14101 gtggacacaa acgtctgtat cgcattatcg acttcaaaag agataagcgt gagattccgg 14161 ccattgtgac agctattgaa tacgacccta accgcaacgc gcggattgcc cttgtgcaat 14221 acgaagacgg tgaaaagcgt tacattcttc aatccaacgg tttaaaagta gggacaacag 14281 ttattgccgg cccaaattct ccgatcgaaa atggcaatgc tttacctcta tctaacattc 14341 ccttgggaac cagcgttcac aacgtggaat tgaaaccggg taaaggcggt caaatcgttc 14401 gctcagctgg tgcgactgct caagttgtgg caaaagaagg tagttacgta acactcaagt 14461 tgccttctgg agaagtccgg atgatccgtc gtgagtgcta tgccaccatc ggacaagtcg 14521 gtaatcttga tgccagaaac ctgagcgcag gtaaagctgg tagaactcgc tggaaaggtc 14581 gccgtcctca agtcagaggt agtgtgatga acccagttga ccacccacac ggtggtggtg 14641 aaggtagggc acctatcggc agaccaggac ctgtcacacc ttggggtaaa cctgctttag 14701 gtttgaagac acgcaaaccc aagaaagcca gtagcaagtt tattgtgcgc cgtcgccgca 14761 aatcttctaa gcgcggtcgt ggcggtcgtg aatcttaatc agtaccgagt gttgagtcaa 14821 tagaactctt tgactcagca ctccacactt agaacgaaaa ctcattcctt aacacttctt 14881 tactcaaagc taactagtta tgggtcgctc tcttaaaaaa ggtccttttg ttgcggatca 14941 tctcctaaca aaaattgaaa ggctcaatga cgccaataaa aaggaagtta ttaaaacttg 15001 gtcacgggct tcgacaattt tgcccctcat ggtaggtcat accatcgcag ttcataacgg 15061 acgccaacac gttcccattt tcatcagtga ccagatggta ggacataaac tgggtgagtt 15121 cgcacccacg cgcacttata agggtcacgc cagaagtgac aaaaaagcag gtagatagtt 15181 atcagttatt aggaggcact gcgttgggcg ggttccccga cttgaagcat gtgccgttca 15241 ctagtggctt agtagatgac taatgactaa tgactaatca ctaatgacta agacggagaa 15301 aactatggca acagacacga ctgaagtaaa agcgatagcc cgttatgtac gtatgtcacc 15361 ctataaagtg cgtcgcgtac ttgaccaaat aagagggcgg tcatacagag aagcgctcat 15421 tctcctagaa ttcatgccct atcgctcctg cgagccagta ttgaaggttc tcagaagcgc 15481 cgcagccaat gctgaacaca acgctgggct agacagaaca gagcttgtga ttactcaagc 15541 ttacgctgat caaggtccgg tgctcaaacg gttccaacca agggcgcaag gtcgtgctta 15601 ccaaattcgc aagccgacgt gtcatattac tgtagctgtc gctgctgaac ccgccgcagc 15661 taaataaaga aaaaatagcg tacaggtaca cgagaagaat ttttgtttag aggaagcatt 15721 tgtgggacag aagatacatc cagtaggttt tcgcttggga gtcacccaag agcatcaatc 15781 ccgatggttt gctgatccta gccgctatcc agaactttta caagaagacc acaaacttcg 15841 tcagtatata gaacaaaaac tgggtagata cgcacaaaac aacgctggaa tttctgaggt 15901 gcggattgat cgtaaagcag atcaaattga tttagaagta cgtacagctc gacccggcgt 15961 tgtcgtaggt cgaggcggac aaggtattga atctctacgc actggtctcc agcaattgtt 16021 gggaagcaat cgccaaattc gcattaacgt tgttgaagtc caacgagttg atgctgatgc 16081 ttacctcatt tctgaataca tcgctcaaca actggaacgc cgcgtttcct tccggcgggt 16141 agtacgccaa gccattcaac gtgcccaacg cgctggcgta caaggaatca aaattcaagt 16201 cagtggacgg ctcaacggcg cagaaattgc ccggacagag tcgagtcgtg aaggtagtgt 16261 tcctttacac accttacggg ctgacattga ttacgcatac tgtacagcaa aaactgttta 16321 cggaattctc gggattaaag tgtgggtgtt taaaggagaa attattccag gacaggaaca 16381 aactcctccc ccagcaacaa atcgcggcga tcgcgaccgt gatcgtgacc gcgatcgccc 16441 cccatctcgc cgtcaacaac gccgtcgcca gcaatttgaa gaccgctcaa atgaaggata 16501 gttattagtc attagctttt gacttgttac cttcactttt gacttttaac ttttgacaca 16561 tcatgctaag tccaagaaga acgaaattcc gcaaacaaca gcgcggacgg atgaatggtc 16621 tagcccatcg gggtagcgac ctgaactttg gggattttgc tttgcaagct ttagaaccct 16681 gctggatcac ctctcgtcaa attgaggctt ctcgtcgagc aatgactcgt tacatccgcc 16741 ggggtggtaa aatctggatt cggatttttc ctgataaacc agtgacaatg cgtccagcag 16801 aaacccggat gggttccggt aagggttcac ctgagttctg ggtagcnnnn nnnnnncgtc 16861 aaaccaggac gaattttatt tgaaattgca ggcgtttccg aagaaattgc tcgcgaagca 16921 atgcgtttgg ctgctttcaa attacccatc aaaactaagt tcattgcgcg ctcacaagag 16981 caatcacagg tgtaggttat gcctcttccc aaaatttcag aagcaagagt tttaactgac 17041 gaacaactag ggcaggagat tatcgctgtc aaaaagcaac tctttcagtt gcgcttgcaa 17101 caagcaacca gacagctaga caaaccccat ctgtttagac acgctcgcca ccgtctggct 17161 caactaatga ctgtagaagc agagcgcaaa cgcgctttat caagtcaacc cgcgaaagaa 17221 acagagtagg agattatggc agtaaaagaa cgagttggct tggtcgtaag cgacaaaatg 17281 caaaaaacgg tggtagtcgc catcgaaaac cgcgcacctc atccaaaata cggcaaaatt 17341 gtcgtaaaca cccagcgcta taaggtgcac gatgaagaca atcagtgcaa aataggcgat 17401 cgcgttcgca ttcaggaaac gcgacccctg agtaaaacta agcgctggaa agtcacagaa 17461 atcctgacta ccaaaaatag ctaaaccata tagttattaa gtagttaaca agacacaaag 17521 ggaaaacgat tgtgattcag ccccaaactt accttaatgt tgcagataat agcggtgccc 17581 gtaaactcat gtgcatccgc gtattaggag caggtaaccg tcgttacggc ttcataggcg 17641 atcgaatcat cgctgttgtt aaagacgctc agccgaacat ggctgtgaaa aagtctgatg 17701 ttgtagaagc agtcattgtc cgcacccgtc ataataccca tcgagatagc ggtatgagta 17761 ttcgttttga cgataatgcc gcagtgatca tcaacaaaga tggtaacccc aaaggaacac 17821 gggttttcgg accagtagca cgagaactac gtgataaaaa ctttactaaa attgtttctc 17881 tggctccgga ggtgctgtaa tggcaaacaa aaagaaagat caacctagat ttttcaaaat 17941 gcacgtcaaa accggggata ccgtgcaagt aattgctgga aaagataaag gcaaagttgg 18001 agaaattatc aaagcaattc cccaagaaag taaagtgctt gtcaaaggtg taaatatcaa 18061 aaccaagcac gtcaaacctc agcaagaagg tgaatctggg cgtattgtga ctcaggaata 18121 tccaatccac agttccaacg tcatgctcta ttccaccaag caaaatgttg ccagtcgcat 18181 ctcctacact ttcactgcag aaggtaagaa agttaggatg ctaaagaaaa ctggtgaaat 18241 catagacaaa tagtgacaag ctgcggtgga caagtctggc ggcataaaag agtcaatgtt 18301 tggctaaagt ttcccgactt gtaaacgccc actaaaaggc ttcctcatag cgtaaggtag 18361 cacagcggcg ttaagtggtg aacccagaag atcattggtc attagtaaaa aaacaaaaga 18421 caatagacaa agagtaagga caataatccc tgaccaagac cagggataaa aagaacaaaa 18481 actatggcaa caagactaaa aactttatac caagaaacaa ttgtccccaa actgatgcaa 18541 cagtttcagt acaccaacat tcatcaagtg ccgaagttgg taaaagttac tgttaaccgg 18601 ggtttggggg aagcatctca aaatgcgaaa gcgttggaag cgtctttgac cgagattgcg 18661 acaattactg gtcaaaaacc agttgtgacg cgggcgaaaa aggcgatcgc gggcttcaaa 18721 attcgtcaag gtatgccagt tggtatcatg gtcaccttaa gaagcgaaag aatgtattcc 18781 tttcttgacc gtctcatcaa tctgtcacta ccaagaatcc gggactttcg cggtattagt 18841 cccaaaagct ttgatggtcg cggcaattat actctaggtg tcagagagca actcattttt 18901 ccagaaattg agtatgacag cattgaccaa attcgtggta tggatatttc cattatcacc 18961 acagcaaaga ctgacgaaga gggtcgcgcc ttacttaaag aaatgggaat gccctttcgg 19021 gatcaataag ttcatctaaa gaggaaacga tggcggttaa cgacacaatt gcagatatgc 19081 tcacgcgcat ccgcaatgcg aatatggcgc ggcatcaaac aacacaagtg ccatccacca 19141 aaatgactcg tagcatcgca aaagtgctgc gcgaagaagg ctttattggt gagtttgaag 19201 aagcaggaga aggagtggca cgcaatttgg tgatttcctt gaaatacaag ggcaaaaatc 19261 gccaacccct cattaccacc cttaagcggg tgagtaaacc aggtttgcgt gtttactcca 19321 atagaaaaga actaccaaga gtactaggcg gtatcggtat tgccattatt tccacatcta 19381 gtggcattat gactgaccgg gaagcacggc gtcagaactt gggtggtgaa gtactgtgct 19441 acgtttggta gtcatgagtc atgagtcatt ggtcatttgt catgagtaaa gaaaagacaa 19501 ataacaatag acttaggaca aataacaata gacttaggac aacaaagtta tgtctcgaat 19561 tggtaaacgc ccaattacag ttcccgccaa ggtggaagta agcattgatg gcacaaaagt 19621 gctagtcaaa ggtccaaaag gcgaactttc tcgcgatttg cctcctcatg tcatagtctc 19681 caaagaagga gaaatattgc aggtcaaccg tcgcgatgaa tctcgcacat cccgccaaat 19741 gcacggttta tgccggactt tggttgccaa tatggttgaa ggagtttcca aaggatttca 19801 acgtcgtttg gaaattcaag gggttggcta ccgggcgcaa gttcaaggtc gcaacctggt 19861 tttgaacatg ggctatagcc atcaagtgca aatcgttcca ccagaaggaa ttcaatttgc 19921 agttgaaaat aatactaacg ttattatcag cggttatgac aaagaagtgg taggcaacac 19981 agcagcgaaa attcgtgctg tgcgcccacc ggagccttat aaaggcaaag gtattcgtta 20041 cgccggtgaa atggtcagac gtaaagctgg taagactggt ggtaagggca agaagtaaaa 20101 atgaaacata ctcgtaaaga atcaagggaa cgtcgccgta gacgcattcg tggtaaagtt 20161 gatggttctt ctgaccgtcc acggttagct atatttcgct ctcaccaaca tatttatgct 20221 caggtgattg atgatactaa tcatcacact ttggtggcag cctcaactct ggaaccagac 20281 ttcaaatcaa aattagcttc aggctcaaac tgccaagctt cggtggaagt tggcaaattg 20341 attgcagttc gctcattaga aaaaggtatc agcaaagtcg tctttgaccg aggtggcaac 20401 ctttaccacg gacgcgtcaa agcactagca gaagcagctc gcgaagctgg tttggatttt 20461 taaataagtc attgctcatg agtcattatg aggcagtgcg gtgggtagtc gccgaggtct 20521 tggtggtttc cacgccactt gcactcaacg aggagacctc caagggcgca gtggctccca 20581 aggaggaact gccgtgcatg gttacccggc ataaaggagc cagtgcggac gccacatgca 20641 gcccggtggg agacccgaag tgcagcagtg gctcgtcttg gggtctcccc aagtggagca 20701 cagccgcgtc acctgccgtt cattagtaaa gaaagacaaa tgaccgagga caaatgacaa 20761 ttggcttcta atacgaggta attaaaatgg caactggtcg tcgtaaagct aatcgtacaa 20821 aaaaagaaga gactacatgg caagagcggg ttatccaaat ccgacgggtg agtaaagtcg 20881 taaaaggagg taaaaaactg agcttccgcg cgatcgtcgt tgtcggtaat gaacgcggtc 20941 aagttggggt gggagtaggc aaagcctcag atgtgattgg tgctgtgaaa aaaggcgtcg 21001 ctgatgggaa aaagcatctc attgaaattc ccatgaccaa atccaactct atcccccatc 21061 ctattgatgg aattggtggt ggtgccaagg tcatcatgcg accagcagca cctggtactg 21121 gagtaattgc tggtggtgct gtacgtacag tactggaatt ggcaggagtt cgtaatatcc 21181 tagctaagca acttggttca aacaatcctc tcaataatgc cagagctgca gttaatgccc 21241 tatctaccct gcgaactttt gcagaagtcg ctgaagaccg gggcattcct gttgaaaatc 21301 tttacattta agtcatttgt cttagtcctt tgtttttata gaaatgacaa agaatcctct 21361 gggtatgcct ttggcacgtg tatgtcctgt agcgcactga cacggctatg cccccaagtc 21421 ctgcggaagc ttgccccaac aaagcagcgt ctggaggaag agatacggac accctacgcg 21481 aatgacaaac gccacttgct ttatggctgc tagaccatgc acagcagtcc agttattggg 21541 caaacctcaa gggcacactg cctccctaaa cttcttaagg tatgcgcaca tacactttgc 21601 gcttacgcca gtcaccaagg cgcaccgaaa aacttggact gcagtgctgg actcattacg 21661 gcagtggctc acaaatgata aatgattaaa ttttacacaa ggtttttttt atgagactaa 21721 ccgatgttcg tcctcaaaaa ggctctaaga aacgtccccg tcgtttaggt aggggtgttt 21781 ctgccggtca aggtgctagt gctggtaagg gtatgcgtgg tcaaaaagcc cgttcaggcg 21841 gcagtacacg acccggtttt gaaggtggtc aacagccatt gtaccgccgc atacctaagt 21901 taaagggctt ccctctggtt aatcctaaga agtacactac gattaatgta gaaaagctag 21961 cctccctccc tgcaaataca gaagtcacac tcacctcgtt aaaagaagca ggtatcctta 22021 ctgctagtcg gggacctttg aagattttag gggatgggga attgagcgtt cccctcaagg 22081 tgcaagcggc agcttttaca ggtacagctc gtagcaaaat tgaggcagct ggtgggagtt 22141 gcgaagtttt agagtgagtc acaaagcacg cacctacgac aagcctaccc cagactcgtg 22201 cttcactata ctcgcaaggt agccctctat gatcagtcga gacaaagccc caacggctca 22261 agaaactttt atgcagatgg cgcaagcagc aggtctgcga ggtcggctgc ttgtgaccct 22321 aggtatttta atgttgattc gtctgggcat acatttgccc ataccaggta ttaatcgaga 22381 tgaatttgcc agaacccttc aaaataataa ccaggtatta agctttttgg acatcttttc 22441 cggaggtgga ctttctgctt taggagtctt tgccttaggt attttgccat acatcaacgc 22501 ctcgattatc atccaattgc tcaccgctgc catcccagct ttagagaact tacagaaaaa 22561 tgagggagaa gcaggacggc gcagaatttc ccagatgaca cgctatgttt ctgtgggttg 22621 ggcgattatt caaagtgtct tcttggcatc gttctggctt aaaccgtttg ctgtaaacta 22681 cgggccaatt ttcgttatcg agacagcgct ggctctcgta gcaggttcga tgtttgtcat 22741 gtgggcatca gaagtcatta cagagcgtgg cgtaggcaac ggagcatctt tgttgatttt 22801 tgtcaacatt gtggcaacac tcccaaaagc tttaggcgac acaatcgatt ttgtacaaac 22861 cggaaaccgg gaaacagtag gtcgagttat tgtgctgttg ctccttttct tggtcacgat 22921 tgttggtatt gtttttgtgc aagaaggaat ccgtcgtatt ccgattattt cagcacgccg 22981 ccaagttggt agacgaattt ttgcagagca gcgtaattat ttacccctac gactcaatca 23041 agggggcgtg atgccaatta tttttgcgac tgcaattcta agtttgccag tgttggcagt 23101 gaatttcatc aaaaatccag aatattcgag aattattaat acctatctcg tacctggcgg 23161 ttctggggct tgggtctatg cccttgtcta tctagtttct atcgtgttct tcagctactt 23221 ctattcttca ttgatagtca acccagttga tgtagcgcag aacttgaaga agatgggttc 23281 tagcattcca ggtattcgtc cgggtaaagc aacaagcgag tacattgagc gcgtgctgaa 23341 tcgattaact tttttaggcg ctatctttct aggcttaatt gctattatcc ctactgccgt 23401 tgaaagtgct ttaaatgtac gaacctttag ggggctaggt gctacctctt tgttaatcct 23461 ggttggtgtc gcgattgata cgtcaaagca gattcaaacc taccttatct ctcagcgcta 23521 tgaaggaatg gtgaaacagt agtgacacga ctaatcttct tgggaccacc tggagctggt 23581 aaaggaactc aagctaaaac cttagctgac gagtggaata ttcctcatat ttctacaggt 23641 gacatactgc gtcaaggcct caaagaccaa actcctttgg gtgttaaggc tcaaagttat 23701 atggataaag gtgagttggt tcccgacgag cttgtgcagg aaatggtgca ggaacgtctg 23761 agccaggttg atacgaaatc aggttggatt cttgatggct ttccccgtac tgtcaatcaa 23821 gcagtttttt tagaaaaatt actgcgaatc ctcaaccaaa atggtgaaaa ggttgtcaat 23881 ttggatgtgc cagatgaaac tgtgatagca cggttgctgg agcgaggcag aaaagatgat 23941 tcagaagaag tcatccgtcg tcgcttagaa gtttaccgtt ctgaaacggc acccttgatt 24001 gacttttaca gtagtcgcca gactctcgtc agtattaacg gtaatcagtc cctagaagaa 24061 gtcactgctg aattaaaaaa ggtcattgcg tcttagaaca agggagtagg tgcaggttaa 24121 aagggacaaa tttgtcccag atatttctac ctaacatcct attccctgtc accccaagca 24181 gatgtatgta aatcttggat aagatatatt aatgaactgt tgatatattc tccgaaatat 24241 gttcttttgc agactatatt aagtagctgt aacaaactag gaggctgttg agttgaggaa 24301 aaaacttgtc taaacaagat ttgattgaaa tggaaggcac ggtcaccgag tccctgccaa 24361 atgcaatgtt tcgtgttgat ttagataacg gcttcaatgt gttagctcac atttctggga 24421 agattcgccg taactatatc aagattttac ccggcgatcg cgtcaaagta gagttaacgc 24481 cttacgacct gacaaaaggt agaatcactt atcgactacg taagaaatag ctcattttta 24541 aacgattcat cgcaaggaaa ttttaaagat gtaaaaaatc ataccatagt gattgcgaga 24601 ttaacaacga tttttactga ctaattaatc ataaaatgtt agaattcact gtttggagtc 24661 aagaaaaaag gcatgaaagt cagagcgtct gtcaagaaaa tctgtgaaaa gtgtaacgtg 24721 atcaggcgtc gtggtcgcgt catggtgatt tgtgtgaatc cgaagcataa gcagcgccaa 24781 ggataacaca acgaaaaagt gaaaagagtc atcaggaaaa tttattcaaa gaactgctga 24841 ctaacaaaaa acacactact agctttaagg aatagggaga cacgagttgt ggcacgtatt 24901 gccggagtag accttccacg tgataaacgt gttgaaattg gtctaactta tatctacgga 24961 attggtttat ctctatcaaa agagatatta gcagccacga gtgtcaaccc tgatactcgt 25021 gttaaagacc tgagtgatgc tgacgtagca gctctcaggg gagaaataga agagaattac 25081 caagttgaag gtgacctcag gcgcttggaa gcaatgaaca tcaagcgctt aattgagatt 25141 ggcacttaca nnnnnnnnnn ttacaggggt cgtcgccacc gcatgggctt gccagtgaga 25201 ggacaaagaa ctcggactaa cgccagaact cgtcgaggaa gacgacaaac agtagctggt 25261 aagaagaaag ccccatccaa gtaattgttt ctacgctaac ctacagcaat acttaacctt 25321 aatatttaac caagtgcgat atggcgcgac aacaaagcgg aaagaaaacc ggcagtaaaa 25381 aacaaaaacg caatgttcca aatgggatag cctacatcca gtctactttc aacaatagca 25441 ttgttaccat tagcgatcaa aatggagatg tcatctcctg ggcaagtgct ggatccagcg 25501 gttttaaggg agcaaaaaaa ggtactcctt ttgcagccca gactgcagct gaaagtgcag 25561 cacgtcgagc aatcgaccaa gggatgcgcc aaatagaagt catggtcagc ggtcctggag 25621 ctggtcgaga aacagctatc cgggcgcttc aaggagcagg actggaaatt acattaattc 25681 gggatattac cccgattcct cataatggtt gccgcccacc caagcgccgc cgagtgtaaa 25741 gacaacttat gaagcggtga gtccagcgcg aatgacggtg agaccagtgc tgcgggaggg 25801 tttccctccg caggcatctg gcgaacccga agggctttcc ctcacttggc gactggcgta 25861 tgcgcaaagc gcacgcccac agggctaaag ccgtaaggcg tgcgctttgc gcatacccga 25921 agggtgaggg cgagaagttt gagaaaaccc tggttgcaca aggattctac caaagatctc 25981 cactagggag agttttcttt cgtaccattg agcaacttcc cccggaaaac tcaaaacttc 26041 aaaaaacaaa acaccatcac tgcgggttgg ctttttgagg gatcaaactt caccgccatt 26101 aaacttacag tttgataaac ctggtagttt aagagggcat taagtaaatt cgagagggca 26161 cagccgcatg gtgagcgtaa gctccaacag ggtaacccga aggcaagcaa ctagcgaaag 26221 gctttgtcaa cttgaaagga caacttgtac aagttcctcc gagtatgcaa acagccattt 26281 cattaagggc actggtgtac aactgtccgc ccttcccact gtgttgtggt tcggcagctg 26341 ctctgtggca ggtttaccgc accctggtga ctggcgtcag ccctatgccg taggcatagg 26401 gcttagaaca taggagataa tcgcacgcag ttggtgctcg ttgaatgcaa ctggcgaaaa 26461 gcgccctccc tagttcgcgt agcgtctttg caggagagac ggcagttgcc cagtttccac 26521 gccagatggc accgtcggag gaacctccct tcgggtatgc ctgccttcgc cctccggtct 26581 aacgccagtc acctacggcg ggaaaacgcc tcatgcgtgc tggtctcacc gcaacgcact 26641 ggctccccaa taagaacagc cgaaagggtt tctcagcttg tagacgcgct tttcggcatc 26701 ctcaaaggta gcaagtggtc tctcaagctc tggatgtcac ttgcagctaa ttaaggcaaa 26761 atcacccaaa gaacgtctac aaagtagcat accgaagcac agcccaggct cgacttgaag 26821 agctttccct tggacggatt tgccgactct gggcaactgg tgagaccagt gcggcaccga 26881 cgtttccctc ctgccaggtc tgacccttcc gttggggttt cccgatggag aaaaccggtt 26941 aaccagaagg acgtcgtcgg actgcccgtt gaggacacta gtaaaaacgg ttttcacgac 27001 aactcagtgt cactcaaggc aataaatttg tttgctagca agaccttatg ttgagggagg 27061 ctaccccatg gcgcagtttc aaattgaatg tgtagagtca agtactgaag ataataggag 27121 taatcacagt aaatttgttt tagaacctct ggagcgcggt caaggtacga cagttggcaa 27181 cgcactcaga cgggtactac tttctaactt ggagggtaca gctgtcaccg cagttcggat 27241 tgcgggcgtg acacacgagt tcgccaccgt tgtgggtgta cgggaagacg tgctagaaat 27301 cctgatgaaa atgaaggaag ttatccttaa aagctactct tctcaacccc agattggtcg 27361 gttacttgtt acaggcccaa caacagtgac tgcgggacat tttgatttac ctagtgaagt 27421 agaagtcatc gatccgaccc aatatgtagc tactgtcgct gaaggtggca agctggaaat 27481 ggaatttcgg attgaaagag gcaaaggata tcggactgta gagcggggtc gtgaagaagc 27541 tacatcttta gactttctcc aaatcgactc agtatttatg ccagtgcgga aagtcaacta 27601 cagcgttgaa gaaacccgtg gagatagtgg gataccaaaa gacagattgt tactggaaat 27661 ttggacaaat ggtagtctga ccccccaaga agcactctcc tctgccgcga ctatactggt 27721 ggatttgttt aacccgttga aagagatctc gctcgacata ccggatgtcg gcgcagaagt 27781 cccagacgac ccaaccgctc agattccaat cgaagagttg caactgtctg tacgagctta 27841 taactgtctc aaacgagcac aagtaaactc tgtcgctgat ttgttggatt acacacaaga 27901 agacctatta gaaataaaaa actttggtca aaagtcagct gaagaagttg ttgaagcctt 27961 acagcgacgc ttggggatca ctttaccaac tgaaagagct aacaaacaca cctaacactg 28021 gttattggtt agtagttagt agttaggaca acaacccttc gagtctgtct atggctaacg 28081 ctcactcgtt ctgtggagac ttgccccaac aaagcagcgt cttgtggagg agatacggca 28141 cacctgtgtc gtatgtcatt tggatacgct ggtttaaagg gtgaacgcca gcgcctaagc 28201 cgaagagaac ctctgccggc gttagactta ccaagggcgc attgactcac caataactaa 28261 taaccaacaa ccaacaacaa ccaactaaca actaacagaa agattatgcg tcaccgttgt 28321 gggattaata aactcagcaa accagctgac caacgtcgcg ctctgttgcg atcgctcacc 28381 actgaattaa ttcgtcatgg acgaattact accactttag cccgagctaa ggtgctacgc 28441 tcagaagtgg acaaaatgat tactcttgcc aaagttggtt ctttagcagc acgtcgtcaa 28501 gccctcggtt acatatacga caaacaattg gttcatgctc tgtttgagca agttcccaca 28561 cggtatggca atcgccaagg gggttacacc cgtatcctgc ataccgtacc gcgtcggggt 28621 gataattcta agatggcaat catcgaactc gtctaaacaa ataaaggcag aaagtagaaa 28681 gataaaattt tcatttgatc tttctaactt ccgcctcaag ttgattttat gttaagtagc 28741 caccagccta cacaaacttg tcgagtcgcc ctggtcattc aatacttggg cactcatttt 28801 catggctggc aacggcaacc acaacaacgc actgtccaag aagagataga aacagctctt 28861 tgtcacatcc taggtcatcc agtgacactg tacggcgcag ggcgaactga tgctggagtt 28921 cacgcagctg ctcaagtggc acattttaat gtcacaagtc cgataccagc tcacaaatgg 28981 gcaaccattc ttaacagcta tttgcccaaa gatatactaa ttcgagcctc cgcagaagtg 29041 agtcacaatt ggcacgctcg ctttagtgct atctatcgac ggtatcgtta cacgttctat 29101 actgaaaagt taccgaactt gtttgccagc gcttttagtt ggcattatta tcacgaacct 29161 ttggatgaat ctctcatact cgcagcactg gaacctttga ttggaaagca tcacttagct 29221 gcttttcacc gcgcaaactc aggacgccag cattcttggg tagaagtaca agcagcagag 29281 tgttatcgca ctggaccact tctttatatt gaaattcagg cagatggatt tttgtacggt 29341 atggtgcggc tattagtagg gatgttggta caggttggtt ccagacaact cacactgact 29401 agtttcaccg acctttggaa agatcaacga cgtcaagaag tgaaatatgc cgcacctgct 29461 cacggcttat gcttgttgcg agtcggctat cctgatttcc cgtttccccc agagatttgg 29521 tttgagactc aacccaaatt agtctttggt caataaggac taaagacaaa ggactaaaga 29581 caaaggacta aagacaaatg acaacagaca aaacatatct tccacctcag aatactatgg 29641 agcgcgattg gtacgtcata gacgctacta accaacgcct cggtcgctta gcaagcgaaa 29701 tcgccatgat tctcagaggg aaaaataaac cccactatac ccctcacatg gacacaggtg 29761 atttcgtcat cgttgtaaat gctgaaaaag tcgaagtcac aggtaaaaaa cgcacacaaa 29821 agctttaccg tcgtcattcc ggtcgtcccg gtggaatgaa gaccgaaacc tttgacaagc 29881 tacaacaacg tttgccagaa aggattgtgg aacatgccat caaaggtatg ctaccgaaaa 29941 atagcttggg acggcagttg tttactaagc taaaagttta cgctggacca gctcatcccc 30001 atgcagcaca acaaccaaaa gaaatacaaa ttcagacaat tccaggagta caagattaat 30061 gcaagcagca gataatagcg gtcgtgctat gtattggggt acaggtcgtc gtaaatccgc 30121 gatcgcgcgg gttcgcttgg ttcctggtac tggacaaatg actgtcaatg gcaaaccagg 30181 agacctttac ttccaattca acccgaacta catttccctt gccaaagcgc ctctagaaac 30241 tctaggcttg gaaaacgaat atgacattct tgtcaaagct gaagggggcg gtttaactgg 30301 acaatcagat gcagttcgtc tgggagttgc tcgtgcgctt tgccaactcg acccagaaaa 30361 ccgtccgcct ttaaaaatcg aaggctacct cactcgcgat ccacgagcaa aagagcgcaa 30421 aaaatacggt ttgcacaaag ctcgcaaagc tcctcagtac tccaagcgtt aagcaatttt 30481 ggattttaac cgtagcctgt agcagataaa cgcagataat tttctgtgat attcgcttac 30541 atcagccttt atcagcagtt gctacaatga tcccaaaaat ctaaaatcca aagttctaca 30601 acccctgaaa acctgaaagt tataattggt atagatagac cacggaaaga aaaatggcaa 30661 aaccagagat tcatccccag tggtatccag aagcaaaagt ttactgtaac ggtcaactag 30721 tgatgaccgt tggctctact aagccagaat tgcacgtaga tgtttggtct ggaaaccacc 30781 ccttctacac aggtactcag aagataattg actccgaagg tcgcgtagaa cgctttatgc 30841 gtaaatatgg catgatggat ggtcaatcca aaggcggaag aaggaaaaag tagcggctta 30901 gctttagctg ttaacaacga ccctgcatca gctgggtcga ttgttgtatt taatgctcaa 30961 gcgaatttgc tatttttaag gagtgactgt attcgtcatg gctgaaacat acctgctgga 31021 gaaactaaaa tccgttgaac aaacttttca tgaattgact cgtcgccttg ccgaccccga 31081 tactgccaaa aatcctgatg agtatcaaaa aatcgcaaag gcacgctctt ctttggaaga 31141 agtggtagat acctacgaaa cctggaaaaa tgcgcaggaa gaattcgtag gggcgcgtca 31201 ggtgctcaaa gaatctcaaa gcgatccaga actgcaagaa atggcagccc tggaagttca 31261 agaacttgag caaaaaatag aacagttaga aaatcgccta aagatattgc tattaccccg 31321 tgaccccaat gatgataaaa acatcatgtt ggaaattcgt gctggtactg gtggtgatga 31381 agcaagtatc tgggcaggtg atttagtacg cttatactca aagtatgctg acagccaaag 31441 ttggcgcgtg aagttagtca gtgagtcctt ggcggaaatg ggcggcttta aagaagccat 31501 attggaaatt caaggcgata gcgtatacag caagctaaag tttgaagctg gagtgcatcg 31561 cgtgcagcgt gtaccagtca ctgaagctgg aggacgggtt cacacgtcaa cagcaactgt 31621 ggcgataatg ccagaggtgg atgaggtaga aatccacatt gatccgaagg atattgaaat 31681 gagcacagct cgttctggtg gtgctggtgg acaaaacgtc aacaaagtgg aaacagccgc 31741 tgacttgttc cacaaaccca ctggtatccg gattttctgt acagaagaac gcagccagtt 31801 gcaaaacaaa gaacgggcaa tgcaaatcct gcgagcaaaa ctgtatgaaa tgaagttgcg 31861 agaacaacag gaagctgtga cttctatgcg gcgatcgcaa gtcggtacag ggtcacgttc 31921 tgaaaagatt cgcacttata actacaaaga taaccgcgcc accgaccacc gcttaaatca 31981 aaactattcg ctcaacgcag ttctagaagg ggaacttgaa cacattattc aagcttgtat 32041 ttctcaagat caacaagaac gcttagcaga actcgctgct tctggcgcta atagttaatt 32101 tttaggctaa taaaaaaaaa gtaggtggaa caggattacc ctctccacct actttaattt 32161 atattggctc tttttagaaa atcatttacc aaatgtctgc tgtctttttg ttataagggt 32221 ctcctgtaaa tggttgtacg ccagtattgg gatatgcatc gtctgatggt gcaaaaatac 32281 gcattgcagc ttcagaaata tattgtgtga ctttagcaat aatctcggag atagccataa 32341 tatcaaaccc ccatttttgt tatcgcttca cttgcgatat gtcttctaat atatctactc 32401 tagttcttca gattatcctt ggctctcatt ggcaatatat ctacattctc cgcaataatc 32461 agtgatagat agttgcaata ctttagctta acaaaacagt tcaagaaaga atcaaggata 32521 agggaatgaa agtgttagta atatgatttt ttatccttca tccttcgttt ggcatagttt 32581 gatatttcac ttcactcttt taggggtacg atcattatta ataagtataa ctattaaaat 32641 tcattaaata atttactaaa ctaataatat agtagccgtc aaaacgctag ttccaagcac 32701 tttttgacag tgctcgttca ataattcaca ttatttatgg taaaatttac cataactcaa 32761 agaagtattc ttctaagggc gctatgtcat gactagtttc ccaagtttga accatgcatc 32821 tacaaacaat caccaacaac agaacacaga gaatcaagaa ctgacacttt ctactgttga 32881 taacagcttg ctcacagacc tttaccagtt gacaatggca gcttgttatg ctggggaagg 32941 tgtagaacaa cgatgggcaa gttttgagtt gtttgtcaga cgtttgccag agaattttgg 33001 ctatttgata gctatggggc tggcacaagt attggaatat ttggaaaaat accacttcag 33061 tccttctgga atagcggctt tgcaagcaac aggaattttt gctaatgttc ctgagagttt 33121 ttggtcactg ctaggtgagg gacgttttac aggtaacgtt tgggcggtac ctgagggaac 33181 agcggtgttt gccaatgaac ctctgttgcg agtggaagca cctctttggc aagcgcaact 33241 ggtagaaacc taccttttga ataccataaa ttaccagagt ttgattgcga cacgtgcagc 33301 aagattacgc gatgtggcgt caactcatgc aacattgctg gaatttggaa ccagacgggc 33361 atttagcccg caagcatctt tgtgggcggc gcgtgcagcg ctggctggtg gtttggatgc 33421 aacatcgaac gtgttagcgg cgctacaact gggagaaaaa cccagtggta caatggcgca 33481 cgctttggtt atggcactgt cggcaatgga aggtagcgaa gaccaggcat ttactgtctt 33541 tcatcgttat ttcccaggtg cgcccttatt aattgatact tacgatacta tcgctgctgc 33601 ccagcggttg tctgcaaagg ttaattcagg ggaaatccaa ttagcggggg ttaggttgga 33661 ttctggtgat ttggtgtcat tgtcaaaaca agtgcgatcg cttcttccag atgtgtcaat 33721 ttttgcaagt ggcgacttgg acgagtggga aatcgcacgg cttaaggctg ctggtgcgca 33781 gattgatggt tatggactcg gaacgcgact ggtgacaggt tctgctgtca atggagttta 33841 caaactggtg gaaattgatg ggactccggt gatgaaacac tcgagtggta agacgactta 33901 cccaggacgt aagcagattt ttcgctcttt tgagggaagt caggtgaaag cagactcttt 33961 gggcttggtg actgaacaag ggagaagata tggaaacgaa gaatttcccc aatcttcaca 34021 agtgcctttg ttgcagttgt ttgttaagga aggtaaacgg gtgcaaccgc tagagacttt 34081 ggcagaaatt cgacaaagga ctgccacgtc cgttgctagt ttaccagacg agacgcgacg 34141 tttggataat cccgtatcgg tgaaagtgga gatttctgcc cagttacaac agttgactga 34201 aaagacgaaa aacctaaccc cacaggtaca gacagaatga gaagaaggtg gattgaaaac 34261 tcctaactaa tgacccttcg ggtatctcct gcaccagacg ctgcgcttta gccctttggg 34321 cgtgcgcttt gcgcatacgc cagatgcctg gctgtcggga aaacgccaga tgctacaacg 34381 gggggaaccc tccgcagtcg ccacaacggg gggaaccccc gcaaggcgct gctctccgca 34441 acgcactggc tcccctcccg cagcactggt ctcaccaata actaatgact attaactaat 34501 actatgaaag ttgctttgtt tggtacaagt gctgatccac caactgctgg acatcaagcc 34561 gttattcgtt ggctgtcaga ccattttgat tgggtggcag tttgggcggc aaataatcca 34621 ttgaagtcgc atcaaacccc tttggaacat cgggtagcga tgttacattt gttgattttg 34681 gatatagatt catcaaagca taatattggt ttggaacagg aattaagtca cttgagaacc 34741 ctggaaacat tagagaaggc aaaaaaatgt tggtcagagg cggagtttac attagtggta 34801 ggttcagatt tactaactca attgcctcgt tggtatcata ttgaagattt attgcaacaa 34861 gtacaacttt tggttatacc gcgcccaaga tatgcaatag atgaatctaa tttagatatt 34921 gtgcaaaaac tcggggggaa agtcacggtt gccaacttta ttggtttaga tgtgtcctca 34981 acagcttatc gtaaaaatgg cgatctctca gccctaacac ctctggtcgt tgattatatt 35041 aacaaagagc atttgtacaa atgtgtaccc gaaaatttga tttgttaaac tctaagcgct 35101 tccaccaagc aatcccaaat tccaaactcc aaattgcatt caattgaaaa cggctatatg 35161 ccaggacaca accaaagaaa gattgtacat gcgctaaaac aaccaccttt agcagatttc 35221 aaggtcggtg ttgataatgt tattttctct gtagatactg cacaaaatcg actgttggtt 35281 ttattagtga tgagacagca ggagccattt ttaaactctt ggagtcttcc tggtactttg 35341 gtgcgtcaag gtgagtcttt agaagatgct gcttatcgga ttttgtctga aaaaataagg 35401 gtaaaaaacc tttatttaga gcaactttat acctttggag gaccatcacg cgacccccga 35461 gaagcaagca atagtgatgg tgtacgttat ctatcggtga gttactttgc cctcgtgcga 35521 tttgaggaag cagaattgat tgctgatgga gtcagtggaa tagcttggta tccggtgaag 35581 caattgccgc aactagcttt tgaccataac aaaattttgg catacggaca caggcgtttg 35641 cgaaataagt tggagtatag tccagtcgcg tttgaagtgt tgccagaagt gtttactttg 35701 aatgatttat atcagttata tactacagtt ttaggagaaa atttttctga ttattctaat 35761 tttagagcgc gtttactcaa gttaggtttt ttatacgata caggaagaaa ggtgtcgcgt 35821 ggtgctggtc gtccagcgag tttatataaa tttgatgcgg aagcttttgc tccttttaag 35881 gataaacctt tagtttttat ttaaccgtaa agacgcgcaa agcaagaaga tatgaaaaaa 35941 ccagatgagt gttctaatat taaagacatt cgtagagaaa ttgacgcgat tgacaaagaa 36001 gttattgcgg ctttagggag aagatttgca tacgtgaaag cagcatcaca gttcaaaacc 36061 agtgaaacag gtgtcaaagc cccagaaaga tttcattcta tgttgcagga aagacgtgct 36121 tgggcggaag ttgtaggatt aaacccagat gtcattgaaa agctgtatcg agatttagtc 36181 aattatttga ttgatgaaga attgaaacat tggcaaaaaa agacgtaatt atctataatc 36241 attaatccaa tatgaagata gcgatcgctc aacttaatcc taccattggt gacttgccag 36301 gtaatgctca gaaaattctg gacgtggcac aaaaggctgt tgcagaaggt gcccgtttat 36361 tactgacgcc agaactttcc ttatgtggtt atccaccccg tgatttattg ctgaatccta 36421 gttttgtaca agcaatggac attgccttac aacaactaac aagagatttg cctccagaac 36481 tcgctgtgtt agtagggaca gttgaagaaa atgtaaaagc acacaagact ggcggtaaaa 36541 ctttatttaa tagtatcgct ttattagaaa agggaagagt caagcaagtt tttcacaaac 36601 gccttttgcc aacttatgac gtctttgatg aacatcgcta ttttgaacct ggtttagaag 36661 ctaattattt cactttggat aatgtccaga ttggcgtcac gatttgcgaa gatttatgga 36721 atgatgagga attttggggc aaacgtagtt acaccataaa tccaattgct gacttagcag 36781 ttctgggtgt agattttata gcgaatttgt ctgcttctcc ttatagtgtt ggcaagcaga 36841 agtctcgcga agccatgctg aagtatagcg cagtccgctt tcaacaactg attctctatg 36901 ccaatcaagt cgggggtaac gatgatttaa tttttgatgg tagaagtttt gctttgaatc 36961 gtcagggtga aatagtctct cgtgcctgtg gttttgaaga ggatttgaga gttgtagaat 37021 ttgatgaggt gcaacgagat ttgcactcag gttctatagc accagaatat gaatctgaag 37081 atgaggaaat ttggcacgct ttggttttag gattgcgaga ttatgttcgt aaatgtggtt 37141 tttctaaagt cgtactaggt ttaagtggtg ggatagattc ttcattggtg gcggctgttg 37201 ctaccgcagc actgggtaaa gaaaatgtcc tcggtgttct tatgccttcc ccctacagtt 37261 ctgagcattc tatcaccgat gctgatgcct tggcggaaaa tcttggtatg aaaactcaca 37321 tcctacagat aggggagtta atgcaagact atgacaaaac gttagctgag ttatttgctg 37381 gaacagagtt tggactggca gaggaaaatc ttcaatcccg gattaggggc aatttattga 37441 tggcaatttc taataaattc ggtcatcttc tcctatctac tggtaataag tcagaaatgg 37501 cagttggtta ctgcactctt tacggtgata tgaatggagg attagcagtc attgctgatg 37561 tgccaaaaac tcgtgtttat tcaatttgtc gctggttgaa tcgcaatggt gagattattc 37621 cacaaaatgt cctgaccaaa gcacccagcg cagaactcaa acctggtcaa gtggatcaag 37681 attcgctacc accttacgat attttagatg acatcttgca acgcctgatt cacaaccatg 37741 agtctacagc agaaattgtc gctgcgggtc atgactcagc aactgtagac cgagttatca 37801 gtttggtgtc gcgtgcggaa tttaagcggc gacaagcacc ccccggattg aaaattacgg 37861 atcgcgcctt tgggactggt tggcgaatgc ctattgccag taaatggact gctatcaaaa 37921 atagttacaa tccagttttt tctgtacgtc aatagctccc ctgctagatt tacaattcct 37981 ttgatgatgt aaaaagtatt gctcaaatta tgtcaaatca aggtccaatc cctgttgttg 38041 ttattggtgc tgcaggcaaa atgggtcgcg aggttatcaa agcggtggcg caagcaccag 38101 atatgaacct tgtcggtgcg attgatacaa ctgttgaaca tcaaggtaag gatgcagggg 38161 aactggcggg tttaagtgaa cctttggaaa ttccgattac taatcaattg gaaccgatgc 38221 tggcgtttgc atctggtgaa aaacagctag gggtgatggt ggattttact cacccgagtt 38281 ctgtgtatga caatgttcgt agtgcgatcg cctacggtat tcgtccagtt gttggtacca 38341 ctggcttaag tccagaacaa attcaagatt tagcagaatt tgctgataaa gcaagtacag 38401 ggtgtttaat tattcccaac ttttctattg gtatggtact gctacaacaa gcagcagttg 38461 cggcttctca acatttcgac cacgtggaaa ttattgaact gcatcacaac caaaaagctg 38521 atgcccccag cggtacagca attcaaactg ctcagatgct agcaggaata ggtaaaatat 38581 ataacctacc tcttgtggaa gaaacggaga aattaccagg agcaagaggt agtacagccg 38641 aagaaggcgt tagaattcat agtgtgcgtt taccaggact gattgcccat caagaagtga 38701 tttttggtgc tgctggtcaa atttatactt tgagacatga tacgagcgat cgcgcttgct 38761 atatgccagg agtattacta gcaattcgca aagtcctcca gttaaagtcg ttagtatatg 38821 gattagaaaa aattctttaa cagtcaacag taccagccgc agtgaacaat tatcaagcaa 38881 ttgataactg ataactgata actgataact gatttaaaaa attcagcact caccacttag 38941 aagactgcac tcatgatcgt cccactgact cgccagaaat ttgaacaact cattccccta 39001 attgccactg gtccgcagta caaatactac tgggggaaat ttccagactt tttgcaacgg 39061 ctgctcattt ctgtcgtggc ggtagctgtt gttttcgtga tgaaagtcat tctggggatt 39121 gattttggtg gaatcatctt tttgcttggg ttaattggtg ccctttactg gctgtgggga 39181 ccagtgtttt gggcgagtat gcgaaatgtg aaaagccgtc gttgtaagta cagcggtttt 39241 ctccgtggtc gagtgctgga ttattggatt gcagaagagt taatgggtaa acaggaaacc 39301 gttgataaca aaggcgattt ggtgattgtc gaaaaccgag aaaaacgcat tcacttagaa 39361 ataggtgatg atacaggatt tacggctgag tataaagcgc cactgcgttc tgcctacaaa 39421 gttattgctc gcggtcagag ggcagaacta ttggtgatgt cgaatcgtcc agatttaagc 39481 actattgatg aaatttcgga tatttatatt cccagtcgcg acttgtgggt gagtgattat 39541 ccttgtatac gacgggattt ctttactgag gtgagtagcc gcctgcgccg tgatgaagaa 39601 gatgaaagac cgcgtcgccg tcgtcctagg gtggagagat aaatacgata gtccctacat 39661 cgctataaat atccttgtag gttcgcattg tggataactg gtcaaatgtt ggagttgtga 39721 ttacaagttt gtttcactag ttcggcttga accttgagtt cttcttgtgc cagtacactt 39781 ttaatatgga gtgcaagcgc acgagctaac aggggtgcaa cagcatttcc aacctgatca 39841 tcttgtgtaa gccacttggc tttacgggta agatttccaa aaaaacgata atcatcatca 39901 aacgattgaa gtcgtgcagc ctcgcgaaca gtcgttcccc taggaagtat tgggtgaaga 39961 atatctgttc ggtgattaca agtcactgtt cgtgaaggct tttcacgatt aagcttccaa 40021 acattgcgct tttttgtccg caactcaact ggcagaagct ggttatcgcc accttcaggt 40081 atttgggcgt aacgcgcctg aactcgatca caatgtttaa tagcacgatg gttgaatatc 40141 attcctggag aacgacatcc ccggcgttcg cgctgatata aattgtagta gttaccacta 40201 tgtactaatt cttctgcgcc ttctccagca tctattaaag gaaggtcaca gagggcttcc 40261 catgctgtta ggtaaggcaa gggagtttca ttaaaaagat gaagctgctg gttaagggtt 40321 ccatgcgtcg cttgcggaaa ggatattgca gtgcaaccct ttggaactgc tagtagaatg 40381 aaccgtatga ctattcgtct ctactaaagg tagaatagcg ccttgaatct cctggacttt 40441 gttttttgaa tgccttgttc ctttgcgcca agcaacctct acaaatgtta ggggatcacc 40501 caattttgta acacttccac caccttctag cacataatct agatcatgct tactgccata 40561 tttatcagtc caggttgctt tcttcaaatt tccccttgca gggcgtggtc ccttcctatc 40621 aaggtatagc ccgtgtttgg tcgctactgg tattagtaga tattcagcaa gtatttccaa 40681 aaactcgcca agtttttgac cgaacgcgtg agcgggtgaa tctgccacag ctatcctcca 40741 atcccaagac tgtatatcca cccctttgta acaccttcgt gactctggaa agaccgtatt 40801 tagggtttag ccgaaattag acaaaaaaat atgctgcgcg tgtcaatcag tatatttctt 40861 gcaaaaagct ggtttaaaga tatttttaac cgcagatgaa cgaggcagtg cgttgctgcg 40921 ccagtgcgaa tgacgggttt cccgctctca cggcatctgg tgagaccagc gctgcaggag 40981 ggtctcccga cagaggcgac tggcgaaccc gaagggcgtt ggggttcctc ctttgggtgc 41041 accttgccgt gcagatggca gataaataca gatcatttat ctgtgtgcat ctgtgtttac 41101 ctgcagtttg attttcactc ttgttgactt ttgccagagg tctaggactc agtactagaa 41161 gactgactaa aaatagggga aaggtaagca actaaaacac cgatgagaaa accggaaatg 41221 ttgcgaatca ggggtagcca gtcgtttgtt cgtgttacta ctggtccaat accgaaggcg 41281 ataaataaga ttccccataa aggcgagaag attaaagccc attgccaagc aaaaacttgt 41341 ccttcgcgac gaggttgagc tgcaaatcca caaatgactc caccaataac tgaagtaatc 41401 aaagtgaaaa tccactgttc ccgtggtaat ccaggaacga ctgcgcaacc tccctgaagc 41461 aaacaatttt ttactgtttc catagcttgt aggatggctt ggtcttcgcc ttcttctcgc 41521 acaaagtaca aattgccaaa gcgggtttgt aattcaatcc agaaagtacg tggtagaaga 41581 tcataaacag cgtcacccac actaaagctg agaatgttac cgccacggga atcagcaact 41641 agcaaaacgc ttttgtcatc caaaccccaa aaacgaataa cagctcgacc tggagtacgg 41701 tcatactgag ttaatactcg gagtttccag ccagtttcgg cttgaaattg ttctagttgc 41761 ttaacaagat tttcttcttg aacgctggta agagactttg ctaagtctac aactggggtt 41821 ggggtatcag gtaatagttc cggattgtca taagcgtatg cgctgggaga atgcgttatc 41881 caaatcgacc cagctaggaa aaatactgtc acaaaagcca gaattcgttt ccaaaaactt 41941 tgctgcatga gcttttatat acaaatctca acaggaaaag tgaacaagtt gttttaaaag 42001 agtactaaaa aagtgagact tctttacact ttttaactct aatctaattt aatatctcag 42061 atccagaact cactgagcaa gcttgcctgt aaggttggca tatgaaaata aggacgcaat 42121 gccttgcgcc cctattctca tagagactta cacactctat tctttatcag accaattctc 42181 agattctttc tgacggtgcg aatcctgttc atcaaaactg ttttccccac caccaacgtt 42241 gtagtcattg taatcgttgt aattcttgtt agtgcggtat ccttctgcag gtgtgaagcg 42301 gttctcctgc cttagggctt gccgttcttg taatgtcagc tttgcttttt gagatggcga 42361 tagacggtca actgttcgcc ttgaagattt ggagcgtgtg aaggttttcc aagcagcttg 42421 caaaccaacc ataaaactgc gcgtgcctgt ttgaacgttt tgtgcttgct ttctagcact 42481 atccaaactt tgatcgactt gtttaaccac ttgaccagca cttttaacac cttcactgac 42541 atcatcagtt aagtcagtga tttccaagct agtcatgcga atagcatcga gagtgggtgg 42601 taactctcta taaagagtgt caaataattt ttctgcactg cgagctgctc ttgctaattc 42661 ctgcaaagct ggtattgccg ccaccaaaac agcagtcaga ctggcggcga ctaggagtat 42721 ggacagtccc agccaaaaca ggggatcaat cactttgata tcttctatta actatctgaa 42781 tgttgaggaa ttgtattgga cttttcttga gcattttgct gcttcattgc ttggctttcg 42841 cgcaaactgg catcaatgcc tgctgaaatt gcgtcccgca gtcggtctaa agtatcatcc 42901 cagttagagc gtgcgttagc agaaaggcgg tctgcttgta tttggacact cgttgaaata 42961 tcttctgcca attctggcaa agcactggca gatttcttca agagttgacg ggtttcacgt 43021 cctgtgcgtg gagcaatgag caacccagtt aaagcaccaa tggttgcgcc tagcatcata 43081 ccgccaataa atgatccaga acggttgtta gacatttttt tgtttctctc cttacaccca 43141 ttctatgtgg gttttcacgc cttgcaaaat ccaaaagatt ttatctttct aaatttagag 43201 agacaattca ttagccagaa aggttgccct tactgcctac cttctctttc tcactaattt 43261 tgctcggaac ctcagaaaat tactctgcca gacttgctgt ccaagtgcaa ccagactcaa 43321 aatctgtcgc agccgctgga tttgtagtcg tgctggttga tctttctgcc gtatattatt 43381 aatattccct tgacctgtgt aaagagcttg tggtgctgta tatagtactg catgggaact 43441 acgttcagca gctataaaga tattagctat tcttgtcaat cgcctcttga gtcgccgcac 43501 tcgccaagcc acgtagagaa gaaccagcga gataagtgtg ttgatgacga caacgactgt 43561 aaccattgtt taattttgtt gcacccactg taattcttgc taaattatga ctctcattaa 43621 agctatgaaa atcagtctga agttgagttt tagccagttt atcagatact cagtgggctg 43681 ctactaaagt caaagaaatt tttagctttc tttaagaaga agttagggca tgttttttca 43741 aatcatccac agaaaccgct tctaaagctc ttgcatgagt tgcgctaagg ataacaggag 43801 gagcaacgcc agcctcaaga gcttcttgcc agcgggctgc acataaacac cagcgatcgc 43861 cttctttcaa accaggaaat tgatattccg gaactggggt actgaggtcg tttcctcgtg 43921 atttagtaaa ttcaagaaat tctgttgtca cttgggcaca aacaacgtgt aacccaaaat 43981 ccataccgcc tgtgttacaa aacccatcgc gataatatcc agtcacggga gaagagcagc 44041 aaatttctag cttttcgccg agtacgttac ttccttctgt catcgccaac tcctttaaat 44101 ttctgtgact gtttaaaagc caagcaaata tattgcgtca attgttcaag ttacgacttt 44161 agtaacctta ctttatgata taagcaattt tcttacccat ttcaaaatcg ctatatttat 44221 gagttatgag tgatgagttg tcacttttgg tgtttactct tcactcttca ctcctaacaa 44281 ttaactcctc agtgattcca ttgatgcaag gcgtacaata agttgacgat gcggctcttt 44341 tcctcggctg aaggtttcta agtcaccaaa ctctttcaaa aaagtgtgaa tttgacggcg 44401 ttcagctgaa ctgagtgatt taatttcagc ttcttgacca gaggaacgca cttgctgggc 44461 tgctgcttca gctattgcgc gaatttccgc gtatctcctg acgcggtagc cgtttaattc 44521 aacagtgtaa gaggcttgct cttcctgtag ttgacttaag ttgagaatag aatttgctag 44581 atactgaatc gcatctagca ctgaaccatc gggacccgtt aaaatttcga tttgttcagg 44641 tgataaattg gtttcaccaa tcgtcaacca gtaactatcc tgttctgggg attcctcatc 44701 ctcagcgatg gctgtttctt tctcaccttt tatatcaaca gatagcccac taagttgcag 44761 cagcgttttc aaccactcct gacctcgttt cattcgactg tctgtcatca ttaccctgta 44821 gtctttttct tagaactttt tggttcaaac ggcagcgctt tttgttttgc gctttcttct 44881 tctttctcct gagtgtctac aattttttgt agttcttctg gtaggggttc acgcgtgaga 44941 atgtaggttt gcaatgtttg gaaaatatta ccaatcacca tatacatcaa caccccagct 45001 ggcaggggaa agaacaaaaa catcccagaa aagatgacgg gagtgatttt gtttacagtg 45061 tcctgctgcg ggttggcatt cgtggtgttt tgcccagaaa ggatttggct gacgtaaagg 45121 gtaataccaa acaggataac catagcgaca atatcccagt gaataatgcc gtctggatca 45181 attgcaccaa ctctacccaa ggcatcaata aagataaatc cttgatttgc ggcgagtccc 45241 ggaattgttc cttggattgt aacatccccc ggctgtaagg cttctatatt accctcagca 45301 tcaattttta tcctttcttc ccctttggta attttccatt cggggattaa ctgagtatcg 45361 gggtgttctg ctaatagtgc ctgaaaaggt ttaccctcga cagtttgata ttctatcttc 45421 gtatgttctc ccaccgctaa tttgttgcct ccaggaagga tggcattcac cttaaggtgt 45481 tccccttcag agatatagat gttttgtggg ggagttgcaa aggcttgcgg ttgaattctt 45541 tcgatttgtt cgccaggaaa aacttggaga tttacactat agttgacacc agcaaaaggt 45601 gaaccccgca aagtcgcaaa cagtgccagt aaaactggca tttgtagcag caatggcaaa 45661 caacctgcaa gcgggttgcc aaattctttt tggacgttca tcatttcttc gttcagcttt 45721 tgctgatcgt ccttatatcg ctccctaact tcttgcatcc gcttgttcat caggggttgc 45781 acgatccgca tccgtcgcat attgcgaatt gaaccagcac tcaggggata gagcgcaaag 45841 cggatgatca atgttaatgc tacaatcgcc aatccatagc taggcacaat gctatagaac 45901 aagtctatga ttggcagcat cacgttgttc gagagaaacc cgataccaaa atccattatt 45961 ctgaattcaa cctgagatac tgtaaattga catcatctaa tttatctaaa tcatgattta 46021 gattgcgact agcgtggtac tacggatagc cgcacattat atgaattggg gaaataagaa 46081 aaatgagaat aagggagtca cagagtgtgg gagtgtgagg aaatattcct ccctccttcg 46141 ctccctttct cctgtttccc ctgatcctcc ttatcacttc ttagcgccca cgtaactagg 46201 atttttggcg atgacttttt cttcaatgta gtcataaatt tcccgaaact taggaagcgc 46261 ccgtagttcc agacggctac cgtctcttaa ggttataacc atatctcccc acaagccaat 46321 accacggggg attttagcca ttttgacaat ttctgagtaa atgacgtcgc tgcgagaacg 46381 tcccatccaa ccacccatga cggagatcct gcgatctgtg atccgaaagc gtagccacaa 46441 tgccctgaca atcgccccaa ctgtcaatgg gagaccaaca atagtgaatc caatcaatat 46501 acttgtgatt aaatccccaa tatgtggacc accctcataa taaacttctt cacgaatgcc 46561 cattgaacac ctcagcttgt tccaataact gctctaattc ttgcagaaat tgttgggtca 46621 cgcactttat atctgctgct ggtttcacaa caaccacaat ccgccatcct ggtgacatct 46681 tgggcaacaa ctgatacaaa gctgctgcaa tttggcgttt gaggcggtta cgaacaacag 46741 ctcgtttgct aacctttgtg ctaatcgaaa ttcctatttg ggctggagta agatgctttg 46801 tgtttgtgtt tacagtttca ggagccgtat cacaagaagg cttagaggaa ggtgatggtc 46861 gtaatgctct caatgttata tgtgaaccat gacgacggat tccttcccgg aaaactgcct 46921 ggaaatctct tcgagatttt agccgatttg ccttgggcaa tgccacagtt gctatgataa 46981 ttagcttttc tatggtctaa acgctcaaac ggtggcgtcc cttctttctt cttgcccgga 47041 tgacgtttct accatctgga gttcgcattc tagcgcgaaa acccgaggtt ctttttctct 47101 tgcgacaagt accttccagt gttctcttca tgtttttatc ctcttagaca attatcataa 47161 aaaaaagtca caattggcta ttatatcata atgtttgttg tttttatcca gcagtcatgc 47221 taaacccaga aagccgtcat cagtcatttg tctgttgtca atgataacgg caaaactcta 47281 accgttgctg tacaagccag gtagtggtaa tctatttgcc ttgctttgac tcactaatga 47341 ctgatgattc atgagtgata acttaactaa tacttatgat ccacgtgccg atgtagcggg 47401 ataatggcac atcgccggga gactgaattg agcagttaaa atagaacatt cccccagaag 47461 acgggttttt cacgttagag agaatcaact caacatcggt acctgctggt actggttctt 47521 ggggaaaaat ttcaagaacg tgattttctt tatcccactt cacctgcgag agtgcaatct 47581 ttttgccttt agccttgacc tcaatttcct tgctgtcaaa ggttccttgg tagtaattag 47641 gataagaaat cacaaatcga gcaactgctg ttttcatctt tttgttagaa attttcagtc 47701 tgtatctgtc ccagccatta gtttgaccgc caaaatctaa tcggaacggt agctgatttt 47761 cgcctttgac accactaaat atcgtcaacc ctggcaaact ttgtgcccag cttatggctg 47821 gtattccagc tagcaaacaa cttgtcacgg ctaaagtaga gagtaaacgt cgcatggtta 47881 agctcctgag gcagaaatag gtcttttgtt taaaataact aggacggaat cttcgtaact 47941 aaaactttac tacttaacat ttggatattg ggacgtaaat tgctaatata aagtgtcgtt 48001 ctcagtacaa aagccgtcat ctggctttct gatactaagt gtataaccat aaaaaaattg 48061 atgaatttga agtcactgaa aacacgcttg aaatttgatt ttgttaccaa atatgtatag 48121 atatatataa aatgttataa aaattgaaaa aatgaatatg aaaattagta gttaacttga 48181 acgagattcg ataagataag ttgtaatcaa gtcagggttg actaatatta tttggggata 48241 gtttttgaga atttgactaa tattgcaata tgtaaaaaaa tcagaaaaat tcttgaatcc 48301 ctaaaagaag ttgaagacaa acacgtaaaa tttggggatt gacataggat gtgtaataga 48361 agcccctctt ttcaaataca tgggaatctg gaaaccctgc aagaaccaaa ttgaaacttt 48421 gcgtaggtca ttatcgagtt tgagagcgaa ggtaaagttg attgggcaaa gcaactgcta 48481 atagctgcta cccatgcacg ataagtgcac ctattgcctc ttaagaaaaa tttcacaact 48541 tgccaaaaat aggtattaat cttcaaggag attgcatgag aatagcagtt gccaaagaaa 48601 ttgaagtttg tgagcgtcga gttgctctaa ttcctgacac cgttacccga ttggtaaaac 48661 aaggtgtgga agtgtgggta gaaactggtg caggtgagcg ggctttcttt tctgatgctg 48721 cttatgaagc agcaggggcg aaagttatca ctgataccgc cacattatgg ggtgaagcag 48781 atattctgct taaggtgagt ccacctcaag agcgagaaga tggacgctca gaagttgact 48841 tactaaagga aggatctgta ctcattagct ttctgaatcc tttagggaat ccatccgtag 48901 cagggcgact ggcagaacgt aaggtaactg cgatcagtat ggagatgatc ccccgcacga 48961 ctcgggcaca aagtatggat gctttatcgt cgcaagcatc aattgcaggt tacaaagcgg 49021 ttctaattgg tgcagcagca ttaccaaaat atttcccgat gttgacaaca gcggctggaa 49081 cgatcgcccc cgcgaaagta tttatcatgg gggctggtgt ggctgggttg caggcgatcg 49141 ccaccgccag acgcttggga gcgatcgtcg aagcctttga tattcgtccc gccgtcaaag 49201 aagaagtcca aagtttaggg gcaaaatttg tcgaagtcaa actagaagaa gaaacaaccg 49261 ccgctggagg ctacgccaaa gaaatttctg aagcaagcaa acagcgcact caggaagttg 49321 tcgccgaaca cgtcaagaat gctgatgtgg tgattacgac tgcccaagtt cctggtagaa 49381 aagcacctct tttagttact gaagagatgg tagcgcagat gaaaccaggt tcggtgattg 49441 tggatctcgc cgccgaacag ggtggtaact gcgcttgcac tgatcccggc aaagatattg 49501 tgtggaatgg catcactatc attggtccca tcaacttacc atcatcgcta ccagttcatg 49561 ccagccaact ttattccaag aatttgtcgt cgttgataca actgttgatt aaagacaaag 49621 ctttaaatgt caactttgcc gacgacatcg ttgatgcggc ttgtgttacc cacggtggcg 49681 aaattcgtaa ccagcgggtg aaagatgcct tacaagcttt aagcggtgtg gcaagttaat 49741 taattcgtag taagtgcgtg cttactacaa ctcaatttgc ataaggagtt taccctgcga 49801 tgacagaagc attaatcgct gccttgtttg tgtttgtttt ggcatccttt actggatttg 49861 aagttatcaa caaagttcca ccaaccctcc acacaccttt gatgtcaggt tccaacgcca 49921 tttctggaat tgctgtactg ggtgcgatcg tggcttctgg tgctagagag acgaatttat 49981 cagttattct cggtttgatt gccgtgatat tggcaatggt taacgtggtg ggtggcttcc 50041 tagtcacaga cagaatgctg caaatgttca agaaaaagga gattaaggcg tgagcgactt 50101 tttaccaacc gggattcagc taacgtattt agttgctgca tccttattca ttctgggttt 50161 gaaacagctg ggatcacccg cgacagcacg acaaggtaat gttgttgcag ctgtggggat 50221 gctgttggct attgtggcaa caatgctgga tcagcatgtg ttgaactatg aaatgatttt 50281 ggtaggattg gctattggat ctttggttgg tatagtcgtc gcctacaaag tccaaatgac 50341 ggatatgccc caaatggtgg gtttgctcaa cggcttgggt ggtgcagcat ctgcacttgt 50401 tgcggttgct gaattttggc ggttactagg aaacggtgaa gcaatacccc ttgatgccaa 50461 tatctccatg ttgctggatg tgttaattgg tggtgtcacc ttcacaggaa gctttgtagc 50521 ctttgcaaaa ctgcaaggta ttatcagcgg ttccccaatt acatttcctt tgcagcaacc 50581 atttaacctc tcgcttctgg ttgcctttat tgcgggtagt gcttatttaa tcatctcacc 50641 gcacagctta cctgtctttt tgggaattgt tgctgtttct ctagtgttgg gtgtgatgtt 50701 cgtcatcccc atcggtggcg gcgatatgcc tgtggtgatt tccctgttga actcgttttc 50761 cgggttagcg gcggctgctg ctggtttcgt ggtgatgaac aatatgttaa tcatcgctgg 50821 cgcattggtg ggagcatctg ggatcatcct taccgagatt atgtgtaagg cgatgaaccg 50881 ttctctattc agtgtgctgt tcagtgcttt tggtacagtg actgcgtctg gtggtgctgc 50941 tggtactggt ggtacaaccg ataaaagtgt ccgcagcatt gatcccgaag aaggcgcaat 51001 gatgttgggt tatgctcgtt ccgtcgtcat tgttcctggt tatggtatgg cggttgcgca 51061 ggcgcagcac aacatacgtg aattggcaga tcagctcgaa cgtatgggcg tggatgtgaa 51121 gtatgcaatt caccctgttg ctggtagaat gccgggacat atgaatgtgt tgctggcgga 51181 agcgaatgta ccttatgagc aactgcacga tatggatgat atcaatcctc agtttgagca 51241 gacggatgtc gctttggtaa ttggcgcaaa tgacgtggtg aatccagcgg cgcggagtga 51301 tacaagtagc ccgatttatg gtatgccaat cttggaagtg gatcgggcga agcagacaat 51361 tgtgattaag cgcggtatga gtgcgggttt tgctggtgtc gataatgact tgttctacaa 51421 gaataaaacg acgatgctct ttggtagtgc gaaggatatg gttggaaagt tggtttctga 51481 agtgaagcaa ctgtagggaa ttgtagatta tggatacagc ataaatccaa aatccctttt 51541 atggatcgtg tataagcttg ctaagggttt gtcctgaggc aagttttttt atagttaaaa 51601 ttaataccag cttagcttct actgcatttt taggaaatcg tatggtttat actacattgt 51661 agttaaagat gacgttgata aaataaatca gcgataaaat ttgaataagt caagcaatta 51721 gggttacgtg aaatgaatat ttccctcaaa cctgagcatg agcagtttat tcaatcccaa 51781 attcaagcag ggagatatgc taatgcagag gatgtgatga acgaagcatt gaaactgatg 51841 caagcaaggg agcaacgttt ggaagaactc cggcaaaaaa tagcggttgg gaaggaacaa 51901 attgctagag gagaggttac ggatggggaa atagtatttg ctcaactgca agataaaatc 51961 aataaaattg ctgagtctca aagatgagta attactcttt ttctgatgaa gcagtcaaag 52021 atttaaattc tatttgtgaa tatattgctc agaataatcc taaagctgcc agtaagcttt 52081 ttgacgcaat tcgtcagaag tgtaaactag tttctggatt tcccaatatg gggaagagtt 52141 atgaggaatt gtcccctaat ttacggggat ttagtattga agattatatt gttttgtact 52201 atccaagaga ggatggaatt gatattgcgc gtgtcattag cggatataga gatttagaat 52261 ccatgttttt ggaaccagaa taaacctgct aaaaagcaca agtaaaactg gattcacttt 52321 tttaacgaac cgccaagacg caaagagcgc caaggaaaga attagagagt tagcttgtct 52381 tgggggagat tataatcctg agacaagctt ttttgtgggt taaaagtaat actagtagaa 52441 aagggctatt gttgattatt taagagggtt gctatgggaa aagagcagca gaagttgatt 52501 ctgtctcagc acaggtatga ggaacaggta aaaaaaaggc tagggagtct tggcaatgag 52561 ccgcaaaata tactaatttt ttatggacga gcaggtattg gcaaaacaga tttatcgaag 52621 tcactgtttc aatctttgaa gtcaaattat ccggtgtgcg cccggttgga tggagaggga 52681 attttcaact acggtttggc acgtaaaccg attgaatcag ccgtagttca gctacgagcg 52741 catttgcgtc ggggtggggc tgatctcagt tgctttgatt tggcgtatcg gatttatagt 52801 gcgggagcaa atcctttaat ggtgacaatt tcacctgaag ctgctaaccg gatggattgg 52861 gcagagaaaa tcagcaatgg cacggattta gctgatgctg tgctggaaat gaatcctgct 52921 gaactggctc cttctttgtt agaggggctg acggaactgg cgaaggattt gctacctggt 52981 ggacaagttt tagctaaact gggctggttt ttgatgcagc aatcgccaga aatatggcag 53041 tggtggaaag aacggggtaa tcaaaatttg cgagaattaa aagattgtgt aagcccgtat 53101 gaaattctgg agcgactgcc tttatttctg gcgcgtgact tgcagcaata cctacgccgt 53161 tcccagcaca aagcagtgat ttttatagat ggctatgaaa aattagtaga taaattcggg 53221 cgatgcgatt ggcttgagga actactggaa cagccgaatc caaatgtgct gtgggtgatt 53281 ttttcggagc gatcgcttaa ttttacaaaa tacgcccaca atattcctat cctcccgtta 53341 actgaggcgg aatgtcaagc ggttctgcaa gaatttggta ttgatgagcc tgaaatttgc 53401 caaattatca ttcaggcatc tggaggaatt cctttgtatt tgcggctggg ggtggagacg 53461 tggcaagaca tcaaaaagca gcgtcagccg aaacttacgg actttgcccg gaatttaaat 53521 gaggtgttac gtcagcgaga tatagcttgg cagccagatg aacggcggat gtggcaagtg 53581 ctttctcact gtcgtacttg ggatgaggcg ctgtttgcta agttgatgag tcagtttcag 53641 ttggataact gggaaaatcg cctatctcaa atcaccgtat cgccttatgt tgaggaagct 53701 ggttcaggag tttggcgctt gaatcaagta atgcagcagc atttgcagga gaaccagcca 53761 gaagacttgc gaaaatcagt gaataactgg ttgtttgagt attatcgagc agaataccaa 53821 gaaccagaat tacaattaac cgcgctggca gaagcgcttt atcacgggtt ggaaagcgaa 53881 cagccagaag ccgcaacgag ctggttttta aaacaggtgg cggtgcagca ggaggtgggt 53941 agacatcagg ctgttgtttc tatgctgcaa tttcttgtcg ggaaaaatca tcaattgcct 54001 cttgcttgga cgctgttggg taagtcgctc gttgttttgg gtgattacga gcaagcgttg 54061 gaagctttgg agacagcgcg aagtcaatgg gaagctttgc aacagaggga aagtcttgat 54121 gctggtacta tggagttgga actcgctaat gtttacctca agcttgagcg gacttttgat 54181 gcaagcaacg ctgctcaaaa agcatatcgc atccgcactg ctcaactggg tgcaaatgag 54241 tcttcagttg ctgaggttct caaccgccaa gcagaaattg ctgcaagtca gggcgattat 54301 cgagaagcgg tgaatctgag tcaacgcgct ttgcaaatac tccagtttca tccagatacc 54361 caaccgctgc aactggctca actgaagcac actgctgctt ggttgaatgc ttacaacaac 54421 aatttggatg cagcagaaaa gctttgccaa gaggcgctgg aaattgttaa gaacaatgct 54481 ggtgacgaac attctctggc tatctcttgc caagcgtctt tgggagatat ttaccaaggg 54541 atgggagagc ataaatacca aaaagcttac gaacagtatc agctagctct tgatgcagca 54601 gatatcagcc ttagtcccag ccatcctcaa acactgcaac tgctgcaagg cttgacgcac 54661 ttgtgccgaa gaatgggaga atacgatgcg gcggatgagt ttgcagaacg tcataatgct 54721 catgttcaaa tcggtaattt tgaagaaacc gctgctgctg ctacaaggct aaacaatctt 54781 ggtttttcgc tttataaaaa aggtgagtac ggtaaagcag aaccactcct gaaacaagcg 54841 ctgcaaatct ttttgaaagc gctgggagaa gaacatcccc acacagcttt aagtctcaac 54901 ggcttggcac gattgtacga atctcaagga cggtacaatg aagccgaacc gttgtacaac 54961 caagccctgc aaatctctct gaaggtgctc ggagaagaac atcccaacac tgccacaagt 55021 ttcaacaact tgggatggtt gtacgaatcc caaggaaggt acgatgaagc cgaaccgttg 55081 tacaaccaag cgctgcaaat ccgtctcaag gtgctcggag ctgaacatcc ccatacagac 55141 atcagtctca acagcttggc aggattgtat gaatcccaag gacggtatga tgaggcggaa 55201 ccattgtaca accaagcgct gcaaatccgt cgcaaggtgc tggaagcaca acatcccgac 55261 acagccatga gtctcaacat cttggcagga ttgtacaaat cccaagcacg gtacgatgaa 55321 gcggaactgt tgtacaacca agcgctgcaa atccgtcgca aggtgtttgg agcggaacat 55381 cccgacacag ccgacagtct cagcagctta gcaggattgt acgaatccca aagacggtac 55441 gatgaggcag aaccgttgta caaccaagct ctgcaaatct ctctgaaggt gctcggagct 55501 gaacatcccc atacagccaa aagtctcaac agcttggcag gattgtacaa atcccaagga 55561 cggtacgatg aggcggaacc gttgtacaac caaggactgc aaattttttt gaaggcgctc 55621 ggagctgaac atccccacac aggcaggaat ctcaacaact tggcaggatt gtatgaatcc 55681 caaggacggt atgatgaggc agaacccctg tacaaccaag cactgcaaat ctggctcaag 55741 gtgctgggag aagaacatcc ccacacagct gtaagtctca acggtttggc aggattgtat 55801 gaatcccaag gacggtacga tgaggcggaa cctctgtaca accaagccct gcaaatccgt 55861 ctcaaggtgc tgggtgaaga acatcccgac actaaggaaa cggaaagcaa cctcaatcgc 55921 ctacgcgatg aaatgacaaa atgatatttc gtccaaacac cctgtagaga cgtagccctt 55981 cgggttcgcc agtcgcctgc ggagggaaac cctcccgcag cgctggactc acatgctacg 56041 tctctacatt ttgacgcgag atttcaagga gaattatttt ttcttcttaa gtttaggagc 56101 ggtattagga ctggtgcgct taatttagcg tttttcttcg tctcgcagac ggcgatcgcc 56161 taattctaat ttttcctata ctctcttctc tctgtgtcct ctgcgcctct gcggttaaaa 56221 aataggtatt cttctggtac cgaaggatta acgccatccc caacaacatt atcccaaaca 56281 gcaatttcag cttgatactc tgaatcttct gcatcctgct tgagggctgc aatcatttct 56341 gcttccaaaa tccggcggcg gtgttctgtc agcaacgcat taatataacc actgcggtta 56401 ccctgtgcct gctgatctat aaactggaga atgtcttctt ctaaggtaat tgtgattttg 56461 accatcagta ttacgcaggc gctatttaaa aaagagcggg ttgcgattgc ttccctatgc 56521 cctccgggca cgctgcgcta acggtcgcaa tgactatagt taggttattc gtagctaggg 56581 gaatcttaaa agcccccttt ttaagggggt tgggggatct cttttgcgta agttttacct 56641 aagttttcat aaatacacaa acacggtgta gagacgtagc acgctacgtc tctacatttt 56701 gacgcgagat ttcaagcaga attatttttt cttcttaagt ttaggagcag tgctcttaga 56761 agtggtgcta gaactagcgc gtcgcatttc gcgtttttca tcttctcgca gacggcgatc 56821 gcgccaacct gccagcgtca aataaccaac gcctcccgta actgctacaa gcagcaccgc 56881 agcaactgaa gccaaaatag tcaagaatgg actttccacg attctcttag agtccgttgc 56941 gaccccaaac taccattgaa attgaccaag tgaaaacaac aagtaaagca acccagccga 57001 gtgtcaaaat ttccatagtg ctacttttaa actcaattga aaacgcctct aacattaata 57061 atactagaag cccgttcaga actaaagttt ggattcctca gtgaacaaga ttttagatcc 57121 tcaactgatg gcagaacgca ttgaatccct caaagctggg atactcgcgg gtttgagcct 57181 gatgatagct ttttttctca ccaccttcgt gaataatctg gtgctagcaa aatattttga 57241 gcagctcgcc agtctggcga tcgattcgct agatttacaa ttgttgctca agcttggaat 57301 tgcagttttt tgtggtttgc tctttggcgt cacctaccgc tatatcatcc gctcagataa 57361 aaatcctcaa ttgaaagctg gcggagtgtt agcgtttggc ttagtacgct gtttaaccca 57421 agtcgatgtt gggctgtcat attctagaga tatattgcct tttgtagttc tgggagtcga 57481 aagtattttg tggtttgtgt tggcagcatt ttttcttgat actgccatcc aactcggctg 57541 gattaagcct tttcaatcaa gctaattcat aaaatataca ttttgtcaag ataaacatat 57601 gaatctaccc aatcctgtta aaactccgtc ttttttgcaa agaatccaat gggttgctga 57661 acctatagga tatatggaaa gcgcggctca acaatatcct gacattttta gtactacagt 57721 agtcggttct agacgtcctc tggtattcgt gaaccatccc cagacaattg cggaaatttt 57781 taccaacgat agaaagaagt ttgcagccct gagtcaagaa aacagaattt tgcaaccctt 57841 actaggggac agttcagtcg ttatgttgga cggcgatcgc cacaaacgac aacgccaact 57901 cctaatgccc ccctttcacg gggaacggat gcgaacctac ggtgaaatca tcgtcaacat 57961 cactgaaaaa gtctttagcc agctaccaca gaaccaacct ttgtcgattc ggactgcaat 58021 gcaggaaatt tctctgcaag tcattttaca ggctgtcttc ggcttatatg agggagagcg 58081 ttgccaacaa ctcaagcgcg tactagcttc gacgttggga gtttttgaat caccgctcag 58141 ttctagcttc ctgtttttcc ctttcctgca aaaagattta ggggcttgga gtccttgggg 58201 aagatttttg cgccaacgtc agcaaattga taaattgctt tacgctgaaa ttgctgaacg 58261 ccgtgcacaa gatgatccaa atcgcatcga tatcctctca ttgctaatgt cagcacggga 58321 tgaaaagggt aaaccgctga cggataaaga gttacgcgat gagttaatga ctttgttgtt 58381 tgctggacat gaaactacag caacggctat ggcttgggca ttgtattgga ttcaccatat 58441 accacaagtt ggtgaaaaac tcctccaaga actggggact ctcggcgact ccccaaatcc 58501 catagacatt gcacgattac cttatctgag tgctgtttgc aatgaaacct tgcgtattta 58561 ccctgttgga ttcttgacat ttggtagagt cgcgcaagaa cccgttgaga ttctgggaca 58621 tcacttagag tctggtaagg tagtctttgg ttgcatttat cttttgcatc atcgtgaaga 58681 tttatacccg cagcccaagc agtttaagcc agagcgcttt ttgcaacgcc aatattcgcc 58741 ctatgaattt atgccttttg ggggtggtgc ccgtcgctgt attggtgaag ctttggctgt 58801 gtttgaaatg aagcttgttt tggcaactat cgtgtcacgc tatcagcttg cactcactac 58861 taatcaacca gaacaacctc aacgacgagg agtaaccctt gctcctagtg gtaaagtcaa 58921 gatgattatt acaggagaac ggaagcatca agagtctcca aaagtagcag caagtgttta 58981 gagtaagtaa gcatgacttt agcacttggc atgattgaag tttttggcgt tcctacagca 59041 atagaagcgg gggacgccat gtgtaaagcc gcccgtatca ctcttgttgg gtatgaaaat 59101 actgatttag ggcgaattac tgtgcttatt cggggagcag taggtgaggt taatgttgct 59161 gttgcagcag cacttgaggc gataccgcga gtgaatggtg gtgaggtgct ttctcatcac 59221 atcattcctc gtccccatga aaatttagaa tatgttttgc caattcatca atcagtaaat 59281 attgagcaat tcaattctta tatccgattt ccaccaccat tgtcggcgta aaaactgttg 59341 agtatgtggg ggtgttgagt ttttattgtt cctaagttta aaagacagtc cgcaaatcga 59401 gaataaaacc aggtaaaata tcttcacctg atagttctgt tggtgattct aaaacttcta 59461 ctgcttgccc caaacggtat atttctactc ggcgtgtttt tctatcaatt aaccagccca 59521 gtttaacttg attttccatg tattcttgca ttttttcttg agttttttgt aaattatcag 59581 atggtgacat taactctaaa acaaaatctg ggacgatggg agggaatttt tctttttgtt 59641 caaaggtaag gacatcccat ctttcttgtc ttatccaagt cacatcagga gaacgatcag 59701 caccattagg aagtttaaag gcagttgaag aatcaaaaac ttttccaagc tttgtttgac 59761 gattccaaat cacaaaatca gcattcattt ctgagttatg gcttcctgtt tctcctcctg 59821 ttgggggcat gataataagt tctcccttgg cgcttctttc aaatttgaga tcagggtttt 59881 tttgacaaag ttgataaaat tgttcgtcgg ttaattcaat aacaggtttg aggtcgatgg 59941 taatagtgtt cataagaagt gaggagttaa taagcaattc aaaattcaaa attcaaaaag 60001 aaggaagaaa cgtgccgaca agcatcctac ctgattgtat gcactcttaa ggcaggacta 60061 tttcattcta aacaaggggc tatatctctt gttcatctgc ggtttcttaa ctctttgatg 60121 tactgcaccc aactgaaaac cgatattata aattctgata attttattta ggagatttgg 60181 cttcatttgg cacaatagct cctgctatat ctgctccttc tatattgacc tcagataggt 60241 caacaccagt caagtcagca ccgcgcaggt tagcatactg caggttagca cccttgagat 60301 tagcaccacg gaagttagca ccctgtaagt cgaaaccaat acaattgttg gaactaaaac 60361 cacgcagatc tgcaccgtta aaattagcac cgttaagatg aataccacag cgagcacgag 60421 cacccctcag attagcacca gccaagtcag ccccttccag gtttgcaacc cctagatcaa 60481 ctccaaccag ataacaattt gagcatttat tagtttttat taaagtttta agattcttac 60541 ttgcgtcatc aaaac // LOCUS NODE_328_length_57545_cov_4.74619957545 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 57545) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 57545) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..57545 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 503..1753 /locus_tag="DP116_01315" CDS 503..1753 /locus_tag="DP116_01315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017804039.1" /note="sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released; the proteins in this cluster is involved expression of genes important stationary phase, nitrogen promoter recognition, and light/dark adaption; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor, RpoD/SigA family" /protein_id="PRJNA477356:DP116_01315" /translation="MPATSFYADAAFNNKQSDPVFDPDITVDETELPIDELDDLEIAS VDSASLGANLNRRSTDLVRLYLQEIGRVRLLGRDEEVSEAQKVQRYLRMRILLSKAAE QGDEVIAPYLRLIETQERLASALGHRPSLERWAGEAGVGLLELKQIIGLGKRRWAEIA KITVEELEKIQSNGLQAKEHMIKANLRLVVSVAKKYQNRGLELLDLVQEGTLGLERAV EKFDPTKGYRFSTYAYWWIRQGITRAIATSSRTIRLPVHITEKLNKIKKAQRKIAQEK GRTPTLEDLAQELDMTPTQVREVLLRVPRSVSLETKVGKDKDTELGELLETDNITPEE TLMRESLQRDLQNLLSDLTSRERDVILMRFGLADGHPYSLAEIGRALDLSRERVRQIE SKALQKLRQPKRRNLIRDYLESLS" gene 2463..2918 /locus_tag="DP116_01320" CDS 2463..2918 /locus_tag="DP116_01320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870366.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional repressor" /protein_id="PRJNA477356:DP116_01320" /translation="MTAYTAASLKAELNERGWRLTPQREVILHIFQELPQGEHLSAED LYERLEAEHEGISLSTIYRTLKLMARMGILRELELGEGHKHYELNQPYPHHHHHLICV RCNTTIEFKNDSILKIGAKTAQKEGFHLLDCQLTIHAVCPKCQRALMPL" gene complement(3052..4212) /locus_tag="DP116_01325" CDS complement(3052..4212) /locus_tag="DP116_01325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell wall-binding protein" /protein_id="PRJNA477356:DP116_01325" /translation="METLAYLHVSSVYEDLPPSELISLSRLFKQTVAPDWKRLSGRAW KYMLPLALGLSVLSCVSSALALEKGDRGPSVRSLQQQLKAAGFYQASVTQLYDTETEA GVRRFQRAAGLDVNGVAGSVTLEKLENWRISNRSSQVTKISTGSVQARRTSTDNSQVI ATSSENPSVTNKRRSSNVLQKGDEGEDVRVLQEQLRIAGFYTGNATTVFGPITEEAVK RFQEAYNLTPDGVVGSVTQAKLPTLSIGYGEDSVSKPPATGDKLRLGDRGEAVRVLQE QLIKAGYLQGEPNGYFGPNTADAVRRFQTDNYLAASGIAGPTTRAKLYSMANDAPTSG DFNVLEIQRRLRDQGFYKGALNGVMGDETKKAIRQAQQFYGISLNNVRSGRF" gene complement(4228..8322) /locus_tag="DP116_01330" CDS complement(4228..8322) /locus_tag="DP116_01330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877309.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobaltochelatase subunit CobN" /protein_id="PRJNA477356:DP116_01330" /translation="MHRTNATSGGWNPQSENIIFLEQTPAPIVFLTSADTDIQTLAAA VPKLPSTFPGLRVVNLLQLQYQISIDSYAEKVLESASVIILRLLGGRSYWAYGLEVVQ EIVERNGTTLIVMPGDDAIDPDFISQSTLPLSTVNQVWQYFNEGGVENFLHALHFICD TCFSTFFNPPSPQVVPRVGLYEWEREQGTLNRERLTGNRERLTGNRERLTGNREKNNS LTRSLPESVTSKVGILFYRAHYLAGNTKVIDALCNALAKQNLTPVPIFVSSLRDVDVQ EELCEFFQPKDTPQIALLLNTTSFSLARLETETPQIDLWQKLDVPVLQVILSGGSIEQ WRVEFQGLSPRDMAMNVALPEVDGRIISRAVSFKTVQSRNSSLETDVVVYEPVGDRIE FVTSLAANWVRLRSKPPQERRVALILANYPNRDGRLANGVGLDTPASCVEILKALQLA GYQVENLPDTGDELISCLTAGVTNDSEGRELRPVLQCVSLAEYEEFFSSLPDAVQKGI GDRWGGVFETNRPSGSPVPHRDGSRQDGGWTHQGAKDAKEEREEDFNFPPSFPVPGIQ LGNIFVGVQPARGYDVDPSLSYHASDLEPTHDYLAFYYWVRQCFGADAVVDVGKHGNL EWLPGKSVALSSECYPEVALGALPHLYPFIVNDPGEGSQAKRRAQAVIVDHLTPPLTR AELYGCLHELENLIDEYYEAQSLDPSRLPMIGDRIRELVVKENLFLDLELKQKAKGIK RKEEFSVSSFDFSVLPSLDGYLCELKESQIRDGLHVFGQCPQGRQLRDLIVAIARHPN RHHIGLTRALAQDFGLDFDPLTTDFSTGLSVQDIQILANKTQHPCRTVGDAVEFLEEQ AAKLVEELQIPNFDFPTPNSSLPTVLDWIRTTLLPSLLQTQQEITNLLHGLDGGYVPS APAGAPTRGRPEVLPTGKNFYSVDIRAIPTETAWDVGRKAAEALIERYVQEEGEYPKT LGLSVWGTATMRTGGDDIAQALALLGVQPVWDGAARRVVDFEILPLSVLGRPRVDVTL RISGFFRDAFPNLIDLFDSAVQAVAALDETPEENPLAAQVRQETEYWTQLGLSLPDAR VRSRYRIFGSKPGAYGAGLQGLIEGQNWTDDQDLARAYINWSCYAYSCPPAAKVSTAA GQDNLEFGIRNSKLEVSPSSPSSVQQGRSAPEAFEMRLREMQIVLHNQDNREHDLLDS DDYYQFQGGLTAAIRAVQGKNPQTYFGDHSIPAKPQVRQLKEEIARVYRSRVVNPKWI AGMMRHGYKGAFEMAATVDYLFAYDATAQCVEDYMYQGITEAYLFDPVVSEFVYQKNP YALRDMAERLLEAHQRGLWEDVNTQTLEHLRNIVHQAEAAIEEK" gene 8416..10443 /locus_tag="DP116_01335" CDS 8416..10443 /locus_tag="DP116_01335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749063.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_01335" /translation="MSSSADLSFSRTLPSNVFDQLGAILQQMAQAVGKGALVLTEAVL IPIDIPQEWQTQRFIVVVSEQFSALLVGFPQEMDNQGGQSLDSAVNVRLTFCSEAIAS FVLNLRDFFQRDSHTYQKLEQYCHIATANDVQLQSQFSLLLLKYFLPPLYTENTELSP LTYPHVSVCQPVEQALKKQIAQEQLLNQVTTQIRKSFDLPVIIATAVAQVREFLQLDR LVVYKFEASRVKSKEVTNLSSVSSGHFCIPPQPDYIHWTGFTPNTPQVYPNTGTSSPP SSKSSQQDLSHQTGCIIYEVCASDEIPSVLNNKEENCLARTIQCWEKFSRGLTLAVDD VEKTYVLQECLLNFLREARVRAKLASPIVYEDKLWGLLIAHQCNAPRQWTESEKNLLT SVAEQLAIAIHQTELMRSLTQEKQTLEQRVVERTMALHDALVAAEAASRLKSEFLATV SHELLTPLTYVIGMSSTLLRWSFGELTQRQRDYLQTIHDSGEHLLEMINDILDLSQIE AGKAVLDITEFSLTSIAESTVNALKEKANTQGVKLKLDLQLNTQRDLFTADVRRVQQI LWNLLTNAIKFTPEDGEVILRLWVEDKTAIFQVEDTGIGIPEEQLSLLFEKFHQLDTP YRRRYGGTGLGLALTKQLIELHRGRIEVESTVGVGSIFTVWIPAQGSSSEC" gene 10981..11193 /locus_tag="DP116_01340" CDS 10981..11193 /locus_tag="DP116_01340" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01340" /translation="MPYRILFDNDQFLTFFLVFAGYSIVRLVYAAFLLRDAFYVTYLL LGRIPLQAKACQAALNACSKGTGFSS" gene 11489..12973 /gene="dacB" /locus_tag="DP116_01345" CDS 11489..12973 /gene="dacB" /locus_tag="DP116_01345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318158.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase" /protein_id="PRJNA477356:DP116_01345" /translation="MPKKITFGFLLLFLGTQVGITTKPVSAQQTQVVPTVVPTEATTK SICPAQLGGSIDAVINRPVFNRARWGILVQPLSSTQTLYSRDAQKYFTPASTTKLLTT AAALQQLGTNFRIRTSVYGGGNGVLYVVGRGDPSFTDAQLAVLAKQLKQRGIGRVNQL IADDTYIRGDIVHPSWQWEDIQSDYGAPINSLILNQNVFHIRLLPQSVGQPLKVIWND ISEAAQWQVINQSVTTAENQPTSINVTRDLKGKILRIQGQLALNSKPELVSLPVVSPA EYFSRRFRSTLIAEKIPVLQTFVSSTNGKNEEELANVESPPLSELVAQTNINSNNLFA ESLLRALGFQKPPTENQTSADAGLEVMKATLTQMGVDSTGYSLVDGSGLSRKDLVSPE ALVQTLQAMAKSPAALVYRASLPVAGRSGTLKSRFQNTPAEGIVQAKTGTMGGVVSLA GYINVPKYEPVVFSIMVNQSEQPARVVRQAIDEIVVLLAQLQNC" gene complement(13124..13825) /locus_tag="DP116_01350" CDS complement(13124..13825) /locus_tag="DP116_01350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318159.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01350" /translation="MVSQPLATTPSEIIYPESDGKPMADNTKQFRWIVQIQQNLDWLY ADDPNVFVAGDLLWYPLKGRNTIVTAPDVMVVLGRTKGDRTSYKQWEEDNIPPQVVFE ILSPSNSSTEMDKKLLFYDRYGVQEYYIYDPDKNILRGWLKAEDGLDVIGEIANWVSP RLGIRFDSSGEELQIYRPDGSKFFSYAEVNEQLEQEKQRAEQAQQELEKERQRSQNLE DVLKKYRDRFGDVIE" repeat_region 13880..14062 /inference="COORDINATES: alignment:crt:1.2" /inference="COORDINATES: alignment:pilercr:v1.02" /rpt_family="CRISPR" /rpt_type=direct /rpt_unit_range=13881..13916 /rpt_unit_seq="gtttccatccccgttaggggcgatgtaagtggaaag" gene complement(14569..15375) /locus_tag="DP116_01355" CDS complement(14569..15375) /locus_tag="DP116_01355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454634.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FHA domain-containing protein" /protein_id="PRJNA477356:DP116_01355" /translation="MPNLSTSHTTGQNLELFHLQSNTSFELPPNLPIFRIGKPNEEIA PDINVLALPNADVVSRLHAEIQVEENIYYIIDTGSSNGTFLNSVKLEPKKRYPLNLGD KIDLGQQEKITFIFQQKQNFASTSYSRLTRQPTVLQAQIVQPEMIGNNKQYQVDRPSK LVGLVLMVLGILIISTNTRIGFFIGLPSILLCLAGVIVLSRRHLNRKIGWILIGLGVA IMLFTGNFFASVNLFAILASSAFLFAGYQLFNTGKVLNYSLHSLKGFLKR" gene complement(15815..17641) /locus_tag="DP116_01360" CDS complement(15815..17641) /locus_tag="DP116_01360" /EC_number="2.7.2.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873382.1" /note="catalyzes the formation of 4-phospho-L-aspartate from L-aspartate and ATP; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate kinase" /protein_id="PRJNA477356:DP116_01360" /translation="MALIVQKYGGTSVGSVERIQAVAQRVCKTVKAGHSLVVVLSAMG KTTDGLVKLAYEISQNPNSREMDMLLSTGEQISIALVSMALQELGHPAISLTGAQVGI VTEADHTRARILHIETERVKRHLNQNKVVVVAGFQGITRTEELEITTLGRGGSDTSAV ALAAALHADFCEIYTDVPGILTTDPRLVPEAQLIDEITCNEMLELASLGAKVLHPRAV EIARNYGVPLVVRSSWTDEPGTWVISPAPRPRALVNLEIARSVDDIEFDTNQAKVSML RVPDKPGVAARLFGEIARQNVDVDLIIQSIHEGNTNDIAFTVMAPILKKAEAVAQAIA PVLRSQKATENGEAEVMVLPNIAKVSIAGAGMIGRPGVAAKMFATLAEAGVNIQMIST SEVKVSCVIDNADCDRAVAFLRAAFEIEEDKQTTQQINSKFSSPSSDHPPVRGVALDM KQARLAIRQVQDRPGTAAKIFGLLAEHNISVDMIIQSQRCHMMNGVPTRDIAFTTARI DAQVAQKMLQQSAAEYGWGEVVLDNDIAKVSIVGAGMVGQPGIAATMFEALSQHQINI QMITTSEIKISCVVAQEEGVKALQVIHTAFGLAGSQSIQVPA" gene complement(17674..18615) /locus_tag="DP116_01365" CDS complement(17674..18615) /locus_tag="DP116_01365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410925.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01365" /translation="MLQKLRLNFSSLFGLLLLVLSLWAISHELREYRYQDVLNSLGKI PKRYLSLSILLSNIGYLVMVSYDALGFSYIGRFLALRKIAFTGFISSVLGNTIGFALV TGSAIRYRYYSNWGVSPLAIAQVIAFANFTFWLGVFAASGVIFIFNPLEIPTQIHLFF TDTRPLGVIFLLIVICYLLGSIFIKTPLVIRRKEFRFPSFKIAFAQVVISSIDWMIAA AVLYLLLPMNSVSFLDVLRTYSLAMFAGVVSNVPAGLGVFEIVILHFLSAKLSPVVVL GATLAYRAIYDLLALLIATSLLGFYEIKHNTRNIINR" gene 19075..19935 /locus_tag="DP116_01370" CDS 19075..19935 /locus_tag="DP116_01370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013323480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_01370" /translation="MTNDSFQDALAGSTIHLRRGINLQVCHNSGRTPAIVFLHGGTGN RFNFRSQYEFAQSQGWEVLVYDLAGHGQSSPYPRYSIGRHRRDLQRLLYKLGISSPVL CCHSYGVPIGLEFAQHNCVSGFIAIAGGTHNLAPWWEIPLMKFMAWGGRYLYSLPGVQ AISNFFSTSYRHSVIERFFAECPTPTDFQSYKALEIFWGYDFFARHPLPQNLHIPALI ITGGLDPMFTHQMGNDLARHFVNGTHLHFANAGHLVMAESPELINSAILKYLIGINTQ IPSSSQTSAL" gene 19988..20938 /locus_tag="DP116_01375" CDS 19988..20938 /locus_tag="DP116_01375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410926.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="esterase family protein" /protein_id="PRJNA477356:DP116_01375" /translation="MNCFKYKIVLLLSVLTFCSCANIKSAQAQVQMRPPTLTSVLPPQ PSASDLATLLTYKTETYDSKVMGGSRIYCVVLPPGYDQNQNQHYPVIFLLHGGNGNAD HWFAKGDALTVLQQLYATGKLPPSIIITPDGNDKRGSSRYWDPDYIDGPNGKVSTAIG DELVKVVQNRYRTLTSPDFWAMGGLSSGGWGAMNVGLHNLNNFSILFSHSGYFQDKSG PQNSPITYIKTISPQAKKRLRIYLDAGIEDTEVGLDESKKFNQILSTEKIYNIFHAFP GGHTWNYWHEHLADSLTFVGRQFKISAIIHASDNLGFKKP" gene 20935..21834 /locus_tag="DP116_01380" CDS 20935..21834 /locus_tag="DP116_01380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="esterase family protein" /protein_id="PRJNA477356:DP116_01380" /translation="MKLSKVLIGVAGAIAILTAAGYWYVFILGAPQLDSDPPQQQITT GLKFQLETFNSQAMGTQRQYGVILPPDYHKNLLKRYPVIFLLHGGHDDARAYVDKYRV LKVLHELYRDHKLPPSIVITPDGNDQRGSSPIIDPDYYDGPNGKVGTLIGSELVQVVK SRYRTLENPKFWALGGLSSGGWGAFNIGLRYLKNFNILFSHSGYFTDNSGPQNSPQQI VQQLPVQDRQQLRVYLDAGESDSNLLASTRKFHETLDKLGIENVFYAFPGGHGLSGPD VGWNYFHKHLKDSLSYVGKQFKE" gene 21809..22192 /locus_tag="DP116_01385" CDS 21809..22192 /locus_tag="DP116_01385" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01385" /translation="MWENSLKNNSETPLATTGGTPTPDASVGKPSSSTGSATQWLLRT QNSESISGDHQFPTNKRPRLRFNFGVLDLTNSTSHTAVQNSLLILPTSSIDILLLAQR VRRIYSDNCVLCSFLLKIHSKLSDQ" gene 22189..23853 /locus_tag="DP116_01390" CDS 22189..23853 /locus_tag="DP116_01390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864658.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01390" /translation="MTLDFRTRIGLWSAAFLTGLVGVVNLVSAVTPNLHERNHWLKHF LPFDIRASGHLFAALTGFVLLTLATNLLRRKRIAWLLTIGLLIISIFSHLIKGLDYEE SFLSGVLLMQLLLMRHVFTAKSDRPSVAQGVRVLIAALLFSLAYGTIGFYLLDGKFTE NFSWSDAIAQTFAMFFTDNNDGLKPKSRFGDFFANSIYIIAASTIAYALFMLLRPVFL REPTTVRERQQARDIVEKYGCSSLAAFTLLSDKSYFFSPSGRSVIAYVPKGRGAIALG DPIGPFEDRKEVIVSFQLFCQRNDWYPAFYQTLPNDISLYKSLGFQVLKIGEEGIVDL QTFTLQGKAGKNLRTAINRMTKLGYEVKFYEPPIADELLHQLKTVSDEWLQLVQGSEK KFSLGWFDETYLRECEIVTVQSSHGEIIAFTNIVLEYQLNEVTNDMMRHRKSIENGTM DFLFLSMFQHYKDRSYDSFNIGLSALSGVGKTQESGRLEKVLHYLYKHLERFYNFQGL HAYKDKFHPRWESRYLVFPSLTALPDVVVALVRADSGDRLLDYFKG" gene complement(24326..24694) /locus_tag="DP116_01395" CDS complement(24326..24694) /locus_tag="DP116_01395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868502.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01395" /translation="MYYIPEHPPYFLFVFGLFAALISDIALWGTLKVIVQKWQDEGAE TSGSRLPVKQLSVPFIGITVGLCVFLCCGFEIFGFPPLIAYGVGIPVAFITALLIWLQ LGSMLTFVERQGIQALDLDS" gene complement(24901..26595) /locus_tag="DP116_01400" CDS complement(24901..26595) /locus_tag="DP116_01400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315962.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AarF/ABC1/UbiB kinase family protein" /protein_id="PRJNA477356:DP116_01400" /translation="MNWQTLTQQNHRRYDPMAIARYYRYRPWLAWMRAIKIIWFFAVF ILSLKSDQWQNQEEQNKFKRATQLRQLLTRLGPTFIKVGQALSTRPDLIRKDYLDELI KLQDQLPAFDSALAYKIIETELERPISEIFSSLSSSPVAAASLGQVYRGRLMNGEEVA VKVQRPNLRPILTLDLYLMRWAAGWLAPWLPLNLGHDLTMIVDEFGTKLFEEIDYINE ARNAEKFANNFRDDLQVKIPAIYWRYTNTHVLTLEWINGFKLTETNKIREAGLDPELI IQIGVTSGLQQLLEHGFFHADPHPGNLFAMPDGRMAYIDFGMMDQLNETTKESLVDAL VHLVNKEYNELAKDFVKLGFLSSDTDIHPIVPALEAVLGDAIGKNVGNFNFKTITDQF SELMYEYPFRVPAKFALIIRSLVTQEGIALSLNQNFKILEVAYPYVARRLLTGESPQL RRRLLNLLFKDGRFQWSRLENLIAIARSDTNFDVLPTAQLGLQYLLSDESKFLRRQLA LALTEDDRLHTEEVQRLWNLVKDDLKPTRLFNVAIGVLTEFSREGVAAILPKALKE" gene 27193..28287 /locus_tag="DP116_01405" CDS 27193..28287 /locus_tag="DP116_01405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317893.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01405" /translation="MSITQPLTRRKRMSYTVAFVASVIIALLISCFATPSPGQTSTRD KFLWPFASSSPWNMAIGSSAYYIPANIGKAGYAAADKEYFFQLNNSDPWRPVYSPGAW GEGRCTGTTSMGTWLPIPNDLIIPDATSNPYSTPNNASAFLMPDGKTLIQLEPLARCK TGGDIYGWRYPNVDIYGDGIGGAHFGSGLSSIGGSIRKGELTSDQPIRHALKVVIWGE KYLYYSTSNPGYRWPADRADANAANQYHGKNPSLVQGTLLAIPPNVTEANLDLQTPAV KKLFHALQDYGAYVVDDAGWDAHYFAVEDGVTEEFRNTFGYDFEGSNGSFYEDFMKLF QALYIVDNNSSNSVGGGGTPRVALAPPIGN" gene 28498..29997 /locus_tag="DP116_01410" CDS 28498..29997 /locus_tag="DP116_01410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874597.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="site-2 protease family protein" /protein_id="PRJNA477356:DP116_01410" /translation="MLSSSETPIVATIVLVAFGILGWGFYRARSYGKLGILAWLQSVV LMTPWLLFFGLFAAGIYINIVGVLFLLVVSTGVYIFLGRQLRAGGQDAILRQRATQRL EADALEQASPTNNSNLPEGNAQLKPEVLAIPEEDLNMIKGIFGLDTFFSTETIAYQEG AIFKGNLRGDPEEVHNRLSASLQERLGDKYSLFLVENTDGKPVLIVLPSRNDPRPMTL PQKVFAVVLLVGTIATSLETAGLLLNFDFFANPERFREVLPIGAGILTVLIAHEIGHW LLARRHQIRLSLPYFLPAIQIGSFGAITRFESLLPNRKVLFDIASAGPAAGGIVSLLM LVGGLLLSHKGSLFQLPNEFFSGSILVGTLARVILGSALQSPLVDIHPLVVIGWLGLV ITALNLMPAGQLDGGRIVQAIYGRKIAGRATIATLIVLALVSLVNPLAMYWAIVIVFL QRDLERPSLNEISEPDDARAALGLLALFLMVATLLPLTPGLAGRLGIGG" gene complement(30043..31293) /locus_tag="DP116_01415" CDS complement(30043..31293) /locus_tag="DP116_01415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317920.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase family 10 protein" /protein_id="PRJNA477356:DP116_01415" /translation="MKKLVKWCNRLLSQEPLAINKQSFFALLVALSMVAAMLLSFPSY AQNTSYRPQTSELRGVWLTNIDSDVLFGRDRLQNSLQSLKNLNFNTVYPTVWNWGWTL YPSKVAQKVIGRSLDPAPGLQGRDILKEIVTVGHQKGLTVIPWFEFGFMAPADSQLAK NRPQWLTSRSDSSKIWKEGTHDRVWLNPFHPEVQQFIQDLVVEIVKNYNIDGIQFDDH FGLPSELGYDAYTVALYKKEHRGQAPSTNPQDAEWVKWRANKITDYMKRVFQAIKATK KNCLVSVAPNPQDFSYKTFLADWQSWERNGLIEELVIQLYRDDLNVFVKELEYPEVKA AQSHIPVSIGIMTGLKAKPISMQQIQTQIQKVRERNFAGVSFFFYETLWNVSGETPQQ RQAGFQKIFPTSVAYPNLLAGWKR" gene 31907..33595 /locus_tag="DP116_01420" CDS 31907..33595 /locus_tag="DP116_01420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315954.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01420" /translation="MKNILKNYTTTKQAQIVTALVLSGILSIGSSLSAIKSAEAAPTN YFPSTANQVLKENIKTNSLPRPVASAILRDLSNREPTHVRKIEIIDYTQRTWRDGCMG LPQPDELCTQALVPGWRVVLSNGSQTWIYHTDTNGRFIRLANPYILADNVPQNFPSYI EDAVLQAASQRLGLPTSRVTIIQAEQRTWNNGCLNLPNSGEACTEALEKGWRVVVKSP EQTLVYHTNTTGSKIRFNKKESEFSEGKLPATVRDAVLRRASEESGLPEKSLSVAASQ PTRWNECDLQSNANPCDSAVSGWQVTVAAGLNRWVFLTDERGSRIQLSRQYSQTPNVN LPRDIAERVLVRASKRLKAPISQLGIIEVQPKQWPDSCLGLADALTSCAAVIVPGWEV IVSDGQQRLVYRVGESGAVFLDEKASPIADDNNSLKPISIPISELPQPLDSGVIFRQI SSGGFTGRTYETVLLNDGRLIRVRIGDINDSERSVRRIPLKQVEKFQQLLERQGDEFK NLSYPAPNGAADYITYTLTNRYGTVKYNDISQKSLPEDLRLIVKAWNRISSRDQ" gene complement(33692..34750) /locus_tag="DP116_01425" CDS complement(33692..34750) /locus_tag="DP116_01425" /inference="COORDINATES: protein motif:HMM:PF13426.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01425" /translation="MSKASRIGNKSCLKFKEVNVAAGGRRFVNLNDFYRQMIAARQRA AELCHRANESFFQEKELLVLGYEELYATLSKLEVAFEELHQINWELACAHQVVQMERQ RYRNLFEFIPNAYLVTDGKGIIQDANKAAAMLLGVQQYFLVGKPLVIFICWQERRAFL SKLLKLHLHHQTQEWEVRLCLRNGDSIDATLTVSIADSWGGKLNTLGWLIRPITERKQ VDPAPWLLSNTVQYATESIIVTEANLDEPGPKIVFVNPACTKITGYTSQEMIGKTPRM LQGPKTERSVLYRLRRNLSQGQFFHGELINYRKDDTEYNMELFCAPIHNERGDITHFF SIQRQIIPIQNSKFKIQN" gene complement(34747..36381) /locus_tag="DP116_01430" CDS complement(34747..36381) /locus_tag="DP116_01430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015187568.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein CheB" /protein_id="PRJNA477356:DP116_01430" /translation="MPGHDIIVIGASAGGVEALKELVAPLPKDLRAAIFIVVHIFAQS KSFLPDILSRCGSLRATHAKDAEAIEHGHIYVAPPDYHLLVKRGYIRVVQGPKENRSR PAVDPLFRTAAKAYKSRVVGVVLSGTLDDGTAGLIDIKKLGGVAVVQNPDDALFSGMP NNAIEHVDVDYILPVVSIAPLLVHLACEPIPDQGALNMSNEDELEMEPDIVEVDGAGL RNKGLPGISAGLGCPDCGGVLFQLQERNLLQFRCRVGHAFSAASLQAAQAQVQEEALW AAIRSLEERAELMSNMATDARSKNRTKSANLFEAQAQEAQQRSDFIRQALFMGQLPVG ATGTAINQVPNKGTEQELTADKVVVLAAGDGGISALSQILVALPVNFPAAIIVVQHLD TQSDPSLMAIALTDSITLPLKLAQEGERLRAGTIYFAPPNEHLFVTPNGTVCLSQAVL VDFVRPSADLLLESVAASFKHRAIAVVLSGTGNDGALGVQAIHQMGGKVITSDDSTSE FFDMPDAAIATGTVDFVLPVNEIASSLLNLVTEATD" gene complement(36483..37796) /locus_tag="DP116_01435" CDS complement(36483..37796) /locus_tag="DP116_01435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152881.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_01435" /translation="MQKDYGCQQVLLNRSLLVHPVLEYLCTQAAKLTNCGIYYGRQVW FKERRYLGKYDLEKEYKTNRNYQSLHSQCAQQVLRTVAESFKSFYELDKKARKGDLNQ KPRLPNYRKNGLASCTYPKQALKLIDNQIRIPLGQTVNTWFGIDSFTIPAPSNLDFAS IKELRILPRNREFYAEFVYEQSRAKRPRLNEKWALGIDPGINNWLTCVDSSSSGEGFI IDGRHVKSLNRWYNKQISTLKEGKPQGFWSKKLAAITEKRNRQMRDAINKAARIVVDY CLRMKVGNVVFGWNEGQRQEANMGRRNNQAFVQIPTYRLKERIRQLCLRHGINFVETE ESYTSKASFLNHDFLPTFGEKPTSWKPSGRRTKRGLYKSLWYGTINADCNGAANIIRK VDTTLDIDTSRVSRGALTRPTRIKVWVTAKKSVCAAPLKGASGTG" gene 38435..39037 /locus_tag="DP116_01440" CDS 38435..39037 /locus_tag="DP116_01440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858473.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NIL domain-containing protein" /protein_id="PRJNA477356:DP116_01440" /translation="MALSTTLNSTLGHIHIIIPQHYHRQPIVSRLISRYDLIINIASA LLESHAKDDGLFNLEIQGVSQQIEASLSYLQELNVEIVELDFKSIVQENQDKFQILCT SHNFSDIIDGNEKKADSHVVKGQTSRAKFQVCIPKNYQSYPVIAGLVYCYGLTVNISG AVLDTNPENDGWFDLEVWGRRQQIVLGLRYLKELGLQIWL" gene 39100..40452 /locus_tag="DP116_01445" CDS 39100..40452 /locus_tag="DP116_01445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874604.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="selenium-binding protein" /protein_id="PRJNA477356:DP116_01445" /translation="MTHACCGPGYASPEAATKAEREKVLYTIAIYTGSSIVEPDYLAT VDVDPNSPTYAQVIHRLPMPYVGDELHHFGWNACSSCHGDASKSRRFMVIPGQRSSRI HIVDTADIKAPKLHKVIEPEEIKEKTNLTAPHTVHCLADSHVMISMLGDSEGNGPGGF LLLDENFDIAGRWERKADGMRFNYDFWYQPRHNIMVSSEWGAPKTFYPGFDLNDVAAG NYGHQLHFWDWSKHEIIQSFDLGEEGLIPLEVRFHHNPDSTHGYVGAALSSNVWHWHK SNGHWQVEKVIDVPSVEVEGWPIPVPSLITDILISIDDRYIYFSNWLHGDIRQYDISD PSHPKLTGQVWLGGLLGKSSEIQSHKLTGGPQMLQLSLDGKRLYVTNSLFSTWDNQFY PDLAKAGSYLLQIDCDTENGGLKINENFYVDFGKEPAGPSRAHEMRYPGGDSTSDIWV " gene complement(40543..42075) /locus_tag="DP116_01450" CDS complement(40543..42075) /locus_tag="DP116_01450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195319.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SagB-type dehydrogenase domain-containing protein" /protein_id="PRJNA477356:DP116_01450" /translation="MPELRQSIAQHYHERTKYDPQTIASKNKGLNWSKQPVPFKEYKI GSTFDLKPYIQEKAEAFVDEPDKQWWQRLSRLLFCSYGLTAKMPSMGNTVYLRAAPSA GGLYPAEMYVVSRGTVLLPPGVYNYQCRTHSLIAYWESDVWQKLQEACLWHPTLQKTQ LAIIITAVFYRSAWRYEDRAYRRICLDTGHLLGNVELAGAMSQYRPHLIGGFVDEAVN ELLYIDSLHEGAIAVLPLADLLDIKQNIPTGQTALPSATETNYPHIPDGKLLKYFHDC TQILPGTTDKLTSGEVKQEKSLEDKYNFPFCEKISTVSQSIHWGEKLEGLEVTILKRR STRAYSGDDLTFDELKALLDFSYQPQHYVDQGLDHSTDYFDLNLIETFIAVSGVEGLE AGCYYYAPKAQELRQIRFKNFRKELHFLCLGQELGRDAAAIVFHTADLKSAVGQYGDR VYRYLHMDAGHLGQKLNLAAIRLGLGVSGIGGFFDDKVNELLGIPADEVVLYMTTLGR PR" gene 42641..43507 /locus_tag="DP116_01455" CDS 42641..43507 /locus_tag="DP116_01455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_01455" /translation="METKQLGKTGVSVSAIGLGAMPMSISNRPPESQSIDVIHRALDL GITFIDTADSYCKDESDKHHNERLIHQALESYKGDVSHVVVATKGGLMRPNGNWTRNG NPQHLRETIRISFEALGGKKPIDLWQYHAPDPDYTIEESLAPAKEAVDAGMIRFVGVS NFSVEQIKRARDVVDIVSVQNQYNPWQRQPEFDGVLEYCQHESLTFLPWSPYGGSRRH DGLEDIGAIAKLAKEKGVSVYNIVLAWLRAKSPAILPIPGASKTSSIEDTVHAVDVKL SDDEVQRIDREI" gene 43994..44527 /locus_tag="DP116_01460" CDS 43994..44527 /locus_tag="DP116_01460" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01460" /translation="MWVWVIAFFSITAFDGWLIKLSSFVSVPTVIYFAIFPLASIISR RLSKTTWRLVLFVVFSRSILAISLCILHLPKLSPFTIIMYCINILLFAFLLLDVISSR GIVAPALTVILVILLWMIIGWQVVIRLVLYCNLVVIFNKAATRLNKYLNSFNAAMAVL TGTAAFGLALGWMLASL" gene 44608..45600 /locus_tag="DP116_01465" CDS 44608..45600 /locus_tag="DP116_01465" /EC_number="1.1.1.86" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006277981.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ketol-acid reductoisomerase" /protein_id="PRJNA477356:DP116_01465" /translation="MARMYYDEDGNLDLLAQKTIAIIGYGSQGHAHALNLKDSGLNVI VGLYPGSKSAAKAEAAGLTVKNVADAVKAADFIMILLPDEVQKTIYKNEIEPNLEEGN VLLFAHGFNIHFGQVVPPANVDVVMVAPKGPGHLVRRTYEQGEGVPCLFAVFQDASGQ ARDRAMAYAKGIGGTRAGILETTFREETETDLFGEQAVLCGGLSALIKAGFETLVEAG YQPELAYFECLHEVKLIVDLVVEGGLAKMRDSISNTAEYGDYTRGPRIVNEQTKAEMR KILQEIQSGQFAREFVLENQSGKPGFTAMRRQEAEHRVEEVGKDLRAMFSWLKK" gene complement(45766..46035) /locus_tag="DP116_01470" CDS complement(45766..46035) /locus_tag="DP116_01470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860487.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GlsB/YeaQ/YmgE family stress response membrane protein" /protein_id="PRJNA477356:DP116_01470" /translation="MSLIAWVVLGILAGAIAKAIYPGYQGGGILATIVLGIVGAFIGG TLVNVIQTGTLTITSATLTLPGLFVAVLGAMIAIFLYYQFSRRAY" gene 46515..46769 /locus_tag="DP116_01475" CDS 46515..46769 /locus_tag="DP116_01475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873556.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01475" /translation="MQSIKLRSRVGQDGILHLEIPVGIADREIEVMVIYQPLEPSTQQ KTPEELGWTPGFFEQTAGCLQDDPLVRYPQGEYEQREPLE" gene 46766..47173 /locus_tag="DP116_01480" CDS 46766..47173 /locus_tag="DP116_01480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015159896.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_01480" /translation="MIYLLDTNACIVYLNRPVSGVRRRLQSLSPQDIAVCSVVKAELF YGAMKSKNPTRTLALQEAFLNNFVSLPFDDTAARIYSRIRADLAALGTPIGPYDLQIA AIALANNLTLVTHNTGEFSRVEGLQISDWEEEG" gene complement(47455..50094) /gene="clpB" /locus_tag="DP116_01485" CDS complement(47455..50094) /gene="clpB" /locus_tag="DP116_01485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141138.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent chaperone ClpB" /protein_id="PRJNA477356:DP116_01485" /translation="MQPTNPNQFTEKAWEAIAHTPDIAKQYQQQQIESEHLMKALLDQ EGLANGVFTKAGVNLQKLRDKSEEFIQTQPKVSGSSSSVYLGRSLDTLLDRADGYRKE LQDEYISIEHLLLGYAKDDRFGKKLYQEFGLDESKLKNIIKQIRGSQKVTDQNPEGKY QSLEKYGRDLTEAARQGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIA EGLAQRIIAGDVPQSLKDRKLIGLDMGALIAGAKFRGEFEERLKAVLKEVTESNGNIV LFIDEIHTVVGAGATQGAMDAGNLLKPMLARGELRCIGATTLDEYRKYIEKDAALERR FQQVYVDQPSVEDTISILRGLKERYEVHHGVKISDSSLVAAATLSSRYISDRFLPDKA IDLVDEAAARLKMEITSKPEELDEIDRKILQLEMEKLSLQKESNAASRERLERIEREL ADLKEDQRTLNTQWQSEKDIITKIQSVKEEIDRVNLEVQQAERDYDLNRAAELKYGKL TSLHRDLEAVETELAQAQRSGKSLLREEVTEADIAEVISKWTGIPISKLVESEKERLL HLEDELHHRVVGQEEAVSAVADAIQRSRAGLADPNRPIASFVFLGPTGVGKTELAKAL ASYMFDTEEALVRIDMSEYMEKHAVSRLIGAPPGYVGYDEGGQLTEAIRRRPYAVILF DEIEKAHPDVFNILLQILDDGRVTDAQGHTVDFKNTVIIMTSNIGSQYILDVSGDDSR YDEMRHRVMEAMRNSFRPEFLNRIDEIIIFHSLQKQELRRIVQLQVARLQQRLSDTSG GLRQRKMSLKLSDAALDFLAEVGYDPVFGARPLKRAIQRELETQIAKAILRGEFNDGD TIFVDIENERLAFKRLPVQVFTS" gene 50733..52088 /locus_tag="DP116_01490" CDS 50733..52088 /locus_tag="DP116_01490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745882.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_01490" /translation="MAFSTLVSVLAIREILLVRLEQEIEKYLTQEVNEFRRLTQGKNP STAQPFGDDVAAIFDVFLSRNIPHENSFLITLLSGKFYKSSPQALPIGLQPNSALIKD WQKLKQLKQGKEFISNHTIYYMAEPILKGKTQGVFVVTYSSSSAHQQVNQAVVVIIQV TIVVLAIASVLAWVVAGRLLAPLSLLIETAHLITESDLSRRIPVQGVDQIAELSITFN EMLDRLQTAFASQRNFINDASHELRTPITIIRGHLELLGDDPQERRETVELVTDELDR MSRFVDDLLLLAKAEQPNFLNLQTVDISLLIGELYTKATALAQRDWRLENKGVGLIVA DRQRLTQAIMNLAQNATQYTSDGDVIALGSEVLNGYAYFWVRDTGVGIAPTDQERIFE RFARGSHSYRRSEGAGLGLSIVRAIATAHGGRVELKSKLGKGSTFTLIIPLDPPFGDS V" gene 52085..52756 /locus_tag="DP116_01495" CDS 52085..52756 /locus_tag="DP116_01495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_01495" /translation="MTDRILIAEDEPRIATFIEKGLRASGFSSAIAKDGHEALSMAQT GDFNLLLLDIGLPGKDGWMVLEELRGQGEQISIIILSARDEVSDKVAGLEGGADDYIT KPFRFEELLARVRARLRDNRLVRRQEETILKTGKIVLNLLTRQIWVGDHLLKLSAREF ILAETFVRHPGQVMSREQLLSRVWGYDYDPGSNVVDVCVGSLRKKLGHDYIETVRGMG YRLRT" gene 53145..54686 /locus_tag="DP116_01500" CDS 53145..54686 /locus_tag="DP116_01500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013793403.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SulP family inorganic anion transporter" /protein_id="PRJNA477356:DP116_01500" /translation="MTIRDFKAKRSIVNDLVASVVVFLVALPLCMGIAIASGVPPELG IITGIVGGIIVGTVAGSPLQVSGPAAGLAVIVWELVQQYGIEMLGPILMLAGLFQLLA GIFKLGQVFRAISPAVIYGMLAGIGVLIFASQFHVMFDSKPSAHGIDNLISIPSQIYK TIFSAQGNNHLIAGIVALITIITLILWEKFKPKRLKLLPGSLIAVVIATAIATVMKLP IQYVNVPDNLIGTIHLPKLENFIGLLKPSVLMEAMAIAFIASAESLLSAAAVDRLHFG PKTNFDRELAAQGFGNMVCGALGALPMTGVIVRSSVNVEAGGKTRLSAIFHGVWLLAL VVAAPSVLNLIPTSCLAAILVVTGYKLVKVENIRKLQQYGRIPVFIYFATLGGILTAD LLFGVLLGLVLSALKLIYKVSHLSIHVLSDENNQRVDVYLDGMATFIRLPYLAKVLEQ ISPGKEIHIHLEMLSYIDHSCLDFLSMWEKQEEKKGSTVVMQRDRLVERYRKPLISGR SHLAA" assembly_gap 55336..55345 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(55547..56671) /locus_tag="DP116_01505" CDS complement(55547..56671) /locus_tag="DP116_01505" /inference="COORDINATES: protein motif:HMM:PF13424.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01505" /translation="MAWFIAQQGNISRALALWEQSLEIIERIGDVDGKAATLNNMAWF IAQQGNIPRALALWEQSLEIKERIGDVGGKAATLNNMAWFIAQQGNIPRALALWKQSL EIIERIGDVDGKAATLNNMAQVIAQQGDIPKAITLWEQSLEIKERIGDVGGKATTLNN MAGVIAQQGDIERAIALWQQSLEIYEQIGDVDGKATTLANMAYVAGETGDKARELDLN LQAALALAQVCAYGDLVTVLGNLGVTAQSNGLVYLAQAMWLTVRIHAPLAKTIQLIHN LYNAVPRGDELQSLLGATALFFCNCRGKGHPQLEELQGLSVRMISDAAVAQGIETQEA FDTWFVQQRLKDPEYFLTRLTQRLEEIVGDGWLFDRSQVS" assembly_gap 56705..56714 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(<56715..>57545) /locus_tag="DP116_01510" CDS complement(<56715..>57545) /locus_tag="DP116_01510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206680.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01510" /translation="PLLSTTSPPAPLLSKERGARDEQRLCLEKLVSLSLVESATTHAT QTAEYRVTTILEFLLEPILSEEEWLTTRQQAARKIYLVWWEKSEKHTEEQGLEIVRLG LLAGEEEIAVSVGDRIVSHWANSSRFVETFQLCDQLLQKFLDYRILGTIARVEEVLGR VEDAVAHYQQALDLCPEDDDTRKAATLNNMAGVIARQGDIERAIALWQQSLEIVERIG DVGGKAATLCNMAGVIAQQGDIERAIALWEQSLEIVERIGDVGGKAATLNNMAWFIAQ " BASE COUNT 16013 a 12563 c 12308 g 16641 t 20 others ORIGIN 1 cccgtaaggg tcttggggtc tccccaagtg gagcatctgg cgttggaaac ccgtcattcg 61 cgctgtctca ctccctcacg ctcttatccc ccacatcata tgactaatca tttttgaagg 121 aaaattcgta aattgtcagg aaagcaccta gagactcatg tagatatgac tatgcattta 181 aagaatcgaa tttttcacta atatgtattt atactttact gcctaaacta cagttaaaat 241 tatgtgaatg ttcaaatttt catcaggtca tttgcaattt aacgccccgt aagacattta 301 aatagctatt ggttcatggg aatgctactg taacggagcc ttcagaggca ctgtgtcaat 361 aaaaaccagt acaggttttt tttcaaaaac cctaaatgga accctcaacc gtgttagttc 421 ttcaaaaatc acgggaacaa tagcaacagt taattagctg accgtaatct attgttacct 481 ggagccattg agacaaatct ttatgccagc aacatctttt tacgcagatg ccgctttcaa 541 taacaaacaa tccgatccag tctttgaccc agatatcacg gttgacgaaa ctgagttgcc 601 gattgatgaa cttgatgatc tggagatagc ttctgttgac tctgcgagct taggtgcaaa 661 tctcaaccgc cgcagcacag atctagtacg tctatacctt caagaaattg gtcgagttcg 721 gttactagga cgggatgaag aagtttcaga agcacaaaaa gtccagcgct atttgcggat 781 gcgaatactt ctgtctaaag ctgcagaaca aggagatgaa gtgatcgcac catatctccg 841 actgattgaa actcaggaac gtttggcatc tgcattagga catcgtcctt ctcttgaacg 901 ttgggctggt gaagctggtg taggtctttt agagcttaag cagattatag gactaggaaa 961 acggcgttgg gctgaaatcg ccaaaataac tgtggaagaa ctggagaaaa ttcaaagcaa 1021 tggactccag gcaaaagaac acatgattaa ggcaaatctt cgtcttgtcg tctccgttgc 1081 gaagaaatat caaaatcgcg gtttggaatt gttggactta gtccaagaag gaactcttgg 1141 tttagaacga gcagttgaga aatttgatcc aaccaagggt tatcgtttta gcacttatgc 1201 ttattggtgg attcgtcaag gaatcacacg ggcgatcgca acttctagcc gcacaatccg 1261 cctacctgtc cacatcacag aaaagctgaa caaaattaaa aaagctcagc gtaaaatcgc 1321 acaagaaaaa ggtcgtaccc caactttgga agaccttgct caagagctag acatgacacc 1381 tacacaagtg cgggaagttc ttttgcgagt tcctcgttct gtttctctag agacaaaagt 1441 tggtaaggac aaagacacag agttaggaga actactcgaa actgacaata tcactccaga 1501 agagacatta atgcgagagt ctttacaaag agacttgcaa aatcttctgt cagatttaac 1561 aagccgggag cgtgatgtga ttctgatgcg ctttggtttg gcagatggtc atccttactc 1621 cttagcagaa attggacgtg ctttggattt atcacgggaa cgggtacgac aaatcgagtc 1681 taaagcgttg caaaagctgc gccaaccaaa gcggcgcaat cttatccgcg actacttgga 1741 gtctttaagc tagttagtcg atagtcaata gtcaatggtt accactcaaa aagaagaatt 1801 cggaattcgg aattccgaat tcggaatcca atcagtaagc tcttgaactc atcagtctta 1861 ctgaccacga aatttttgat tcagtgagag cttgaacccc attttcatcc gccgtattcc 1921 tacggcactg ctgtgccgta tgcgcaaagc gcacgaatcg gacctgttca aagcagcgtg 1981 tcctctggac ataggactta cgggtatctt ctccataaga ggctgcttta aacaccagat 2041 acctctgtcg ggagacgtcg ctgcaggact tgttcaccac ttgcacagaa atataccccc 2101 taactcctga ctcctaagta gttactaggt caaaaaactt atacattcta cacataaggc 2161 aagatgcctg ctctaaaaga cttttaacta atggttgttg actcacttta tctccgttcc 2221 caaaactttt cactaactta tggcaagtta ttttcatcta attgcataag aatctttcga 2281 ttcgtctttg ggaacattta tagaggatag gatggggatc gcttgttttt ttacttactg 2341 ctcaaagttt caaaatttac ctttaatatt taataactct tctcaagagg gtttgattat 2401 tgcaaatcag tccaaatttt tgctatattc gcaactaatg gactttgttg agaaaaatta 2461 gtatgacagc ctatacagca gcctcgctaa aggcagaact taatgaacgt ggttggcgtt 2521 taacacccca gcgcgaagtc attctacaca ttttccaaga actgccgcaa ggagaacacc 2581 tcagtgctga agatctttac gaaaggctag aagctgaaca cgaaggaatt agcctatcga 2641 cgatttatcg gactctcaag ttgatggcgc gaatggggat attacgggag ctggaactag 2701 gagaagggca taaacattac gaacttaatc agccctatcc ccatcaccac catcacctga 2761 tttgtgttag gtgcaacaca acaattgagt tcaagaacga ctcgatttta aaaattggag 2821 cgaaaacagc acaaaaagaa ggttttcacc tgcttgattg ccaattaacc atccatgctg 2881 tttgtcccaa gtgccaaagg gcactcatgc cactgtagca cctgggtttt acgattcaag 2941 aaagccctgg ctgcaatgaa atattttggt aaatttcatt agcagctttg aacttgttgt 3001 tatgggaggt tggcaaactg ttaggtatca acaccaacac ctgagcggat gctaaaagcg 3061 tccgcttctg acattattca gactgatacc gtagaattgt tgtgcttgtc taatagcttt 3121 ctttgtctcg tctcccatca caccattaag agcacctttg taaaaacctt ggtctcgtag 3181 tcgtctttga atctccaaaa cattaaaatc accacttgtg ggagcatcat ttgccatgct 3241 gtacaattta gcgcgggttg ttggaccagc aataccgctt gcggcgaggt aattatctgt 3301 ctggaaccgg cgtacagcat ctgctgtatt aggaccaaag taaccgtttg gctctccttg 3361 taagtatcct gctttaatta attgttcttg aagaactctt acagcctcac cacgatctcc 3421 caagcgaagt ttatctccag tagcaggtgg cttactaact gaatcttctc cataaccaat 3481 agagagtgtt ggaagttttg cttgcgttac tgatccgaca actccatcgg gggttaagtt 3541 ataagcttct tggaaccgct tgacagcttc ttcagtaatt ggaccgaata ccgttgtggc 3601 gttacctgta taaaagcctg caattcgtaa ttgttcttgt agaactctga catcctcacc 3661 ttcatctcct ttttggagaa cgttagaact gcgacgcttg ttagttactg atggattttc 3721 agaactggtt gctatgactt gagagttgtc tgtactggtt cttctggctt gaacacttcc 3781 agtgctgatc tttgtcactt gcgaactcct attagagata cgccaattct ctaatttttc 3841 taaagtgaca gacccagcaa caccgttaac atctaaacca gcagctcttt ggaagcggcg 3901 cacaccagct tctgtctcag tgtcatagag ttgagtgacg gaagcttgat aaaagcctgc 3961 tgctttcaac tgttgttgta aacttctgac agaaggaccg cgatcgcctt tttcaagtgc 4021 taaggcgctg ctcacacaac taagaacaga taaaccgagg gctaaaggta gcatatattt 4081 ccaagcccta ccagacagac gtttccaatc tggtgctact gtttgcttaa acaaacggct 4141 gagggagatt aattcactgg gaggtaagtc ctcgtagacc gatgaaacgt gtaaatacgc 4201 aagggtttcc ataatttgtt acaccattta tttttcttca atagctgctt cagcttgatg 4261 tactatgttt cgtaaatgtt ccagtgtttg tgtatttaca tcctcccata aaccgcgctg 4321 atgtgcttct aacaatcttt cagccatatc acgcaaagca tatgggttct tttgatagac 4381 aaattccgaa acaactggat cgaacaaata tgcctcggtt atcccttgat acatatagtc 4441 ttccacacat tgtgccgtgg catcgtaagc aaataaataa tcaacagtcg ccgccatttc 4501 aaaagcaccc ttataaccgt ggcgcatcat tcctgctatc catttaggat tgacaacccg 4561 agaacgatac actcgtgcga tttcttcttt aagttgacgg acttggggtt tggctggaat 4621 ggaatgatca ccaaaataag tttggggatt ctttccctgt acagcacgta ttgcagctgt 4681 taaaccaccc tgaaattggt agtaatcatc agaatcaagc aaatcgtgtt cgcgattgtc 4741 ttggttatgt aacactattt gcatttctcg caaccgcatt tcaaatgctt ctggcgcgga 4801 acgtccttgt tgaacggatg acggagatga gggagagact tctaatttgg aattccgtat 4861 tccgaattcc aaattgtctt gtccagctgc tgtggaaact ttggctgcag gaggacaaga 4921 gtaagcatag caactccaat tgatgtaggc acgagctaaa tcttgatcat ctgtccagtt 4981 ttgtccttca attaaacctt ggagtcctgc accataagct cctggcttag aaccaaagat 5041 gcggtagcgc gatcgcactc ttgcgtctgg taaacttaaa ccaagttgtg tccaatactc 5101 agtttcttga cgcacttgcg ccgctagagg attttcttct ggcgtttcgt ccaaagccgc 5161 cactgcttgc acagccgagt cgaacaaatc aatcaagtta ggaaaagcat ccctaaaaaa 5221 accagagatt cgcaacgtca catccacacg cggacgtccc aaaacagaaa gcggtaaaat 5281 ttcaaaatct acaactcgcc gcgccgcacc atcccataca ggctgcacgc caagtaaagc 5341 taacgcctgt gcaatatcat cacctccagt ccgcatcgtg gcggttcccc atacagataa 5401 acccagtgtt ttcggatact caccctcttc ttgtacataa cgctcaataa gtgcctcagc 5461 ggcttttcta cctacatccc aagcagtttc ggttggtata gcgcggatgt caacagaata 5521 aaaattttta ccagttggca ggacttccgg acgtcctcgc gttggtgcgc cagccggggc 5581 acttgggaca tatccaccat cgagtccatg taacaaattg gtaatttctt gctgggtttg 5641 tagtaaagaa gggagaaggg tggtgcgtat ccagtcaagg acagttggga gtgaggaatt 5701 cggagttggg aagtcgaaat tcggaatttg gagttcttct actaatttgg cggcttgttc 5761 ttctaaaaat tcgacagcat cgccgacagt gcgacagggg tgttgagttt tgttagctaa 5821 aatttgaata tcttggactg atagcccagt actaaaatca gtcgtgaggg ggtcaaaatc 5881 taagccaaaa tcttgagcta aagcacgagt cagtccgata tgatggcgat tgggatgacg 5941 tgcgatcgca acaattaaat ctcgtaactg tcgcccttga ggacattgtc caaaaacatg 6001 taatccatca cgaatttgag attccttcaa ttcacaaaga taaccatcaa gtgaaggtaa 6061 aactgaaaaa tcaaaacttg aaacagaaaa ttcttctttt cgcttaatcc cttttgcctt 6121 ttgcttcaat tccaaatcca agaaaagatt ttctttgaca acaagttcgc gtatgcgatc 6181 gccaatcatt ggtaaacgag aaggatccaa actctgagcc tcataatact catcaatcaa 6241 attttccaac tcatgtaaac aaccataaag ttctgcacga gttagaggtg gcgtcagatg 6301 atccacaatc acagcttgag cgcgacgttt cgcctgcgaa ccttcaccag gatcattcac 6361 aataaacgga tacaaatgtg gaagcgcccc caaagccact tctggataac actcactcga 6421 caaagccaca cttttacctg gtaaccattc taaatttcca tgttttccaa catcaaccac 6481 tgcatccgcc ccaaaacatt gtctcaccca ataataaaaa gccaaataat cgtgagttgg 6541 ctctaaatca gacgcatgat aactcaaact tggatcaaca tcataacccc gcgctggttg 6601 aactcctaca aaaatgttac caagttgaat tccaggaaca ggaaacgagg gaggaaaatt 6661 aaaatcttct tctctttctt cctttgcgtc ctttgcgcct tggtgagtcc agcccccgtc 6721 ttggcggctt ccgtcacgat gggggactgg cgaacccgaa gggcggttcg tttcaaaaac 6781 ccctccccac ctatcgccaa ttcccttctg cacagcatcc ggtaacgaac taaaaaactc 6841 ctcatactcc gccaaagaaa cacactgcag caccggacgc aactccctac cttccgagtc 6901 attcgtcacc ccagcagtta gacatgaaat caactcatct ccagtgtcag gcaaattttc 6961 cacctgatac ccagccaact gcaaagcctt aagaatttct acacaactcg cgggcgtatc 7021 caacccgaca ccatttgcaa ggcgtccatc acggttcggg taatttgcca aaatgagcgc 7081 cacacgacgt tcttgaggag gttttgagcg caaacgtacc caattggcgg ctaaagaagt 7141 gacaaactca atgcgatcgc ccactggttc ataaaccacg acatcagttt ccaaagaaga 7201 attcctactt tgtaccgtct taaaagacac agcacgactg ataattcgcc catctacctc 7261 tggaagcgcc acattcatcg ccatatcacg gggtgaaagc ccttgaaatt ctaccctcca 7321 ctgttcaatt gaaccaccac tgaggataac ttgcaacacc ggcacatcca atttttgcca 7381 caaatcaatc tgcggtgttt ctgtttccaa ccgggctaaa gaaaaactag tcgtattcag 7441 cagcagcgca atttgcggag tgtctttggg ttgaaaaaac tcacacaact cctcttgaac 7501 atcaacatca cgcaaagaag aaacaaaaat tggaactggt gtgagatttt gctttgccaa 7561 agcattgcac aaagcatcaa tcaccttagt atttcccgcc aaataatgag cacggtagaa 7621 gagaattcct actttagaag tcacagattc agggagtgag cgagtgaggg agttattttt 7681 ctccctgttc cctgttaagc gttccctgtt ccctgttaac cgttccctgt tccctgttaa 7741 gcgttccctg ttaagcgttc cctgttccct ctcccactca tacaacccca cacgaggaac 7801 aacttgcggt gatggaggat taaaaaaagt tgaaaagcaa gtatcacaga taaagtggag 7861 agcgtggaga aaattttcga ctccaccttc gttaaagtat tgccatacct gattcacagt 7921 actcaaaggt agggtagact gagagatgaa atcggggtct attgcgtcat ctcctggcat 7981 tacaatgaga gttgtaccat tacgttccac aatttcctgc acgacttcta agccataagc 8041 ccagtaagag cgtcctccta ataggcggag aattatcacg cttgcggact ctaaaacttt 8101 ttcggcatag ctatcaatgc tgatttgata ctgcaattgc agtaagttaa caactcttaa 8161 accaggaaat gtggagggta atttgggtac agcagccgct agagtttgaa tatcggtatc 8221 agcagacgtt agaaatacta tgggagcagg ggtttgttcc agaaaaatga tattttccga 8281 ctgaggattc catcctcccg atgtggcatt agtacgatgc ataatcttgt cttaacgtct 8341 taaactgttt gttaggacac aatacaaaat tttttgagca atagcaaaaa gcccctttta 8401 ctcctgtttg taacaatgtc tagttctgcc gatttgagtt tctctcgaac cttgcctagt 8461 aatgtgtttg atcagcttgg agcaattttg caacagatgg ctcaagcagt aggaaagggg 8521 gctttagtac tgacagaagc tgtgttgatt ccaattgaca tacctcaaga atggcagaca 8581 caacgcttta tcgtagtggt ttctgaacag tttagtgctc tgcttgttgg ttttccacaa 8641 gagatggata accagggggg gcagagtttg gactcagcag tgaatgtcag gttgactttt 8701 tgttcagaag cgatcgcctc ctttgttttg aacttaagag atttctttca acgagattct 8761 catacctacc aaaagcttga acaatattgt cacattgcta ctgctaatga tgtccagctt 8821 caaagtcaat tttcactgtt gttattaaag tattttttac caccattata tactgagaac 8881 acagaattat cacctctaac ttatcctcat gtttctgttt gtcaaccagt tgaacaagct 8941 ctgaaaaaac agattgctca ggagcaactg ttaaatcaag tgacaactca aatccgtaaa 9001 agttttgatt tgcctgtcat tatagcaacg gcagttgcac aagtacgaga gtttttacaa 9061 ttagataggc ttgtagttta caaatttgaa gcatcaagag tcaagagtaa agaagtaaca 9121 aatctctcat ctgtttcatc tgggcacttt tgtattcccc ctcagcctga ctatattcat 9181 tggacgggct ttacgccaaa cacaccgcag gtatacccca atacgggaac ttcaagccct 9241 ccatcatcca aatcctccca gcaagacttg tctcatcaga caggttgtat tatctacgaa 9301 gtctgtgcga gtgatgagat tccgtcagtt ctgaataaca aagaagaaaa ttgcttggcg 9361 cgaacaatcc aatgttggga aaagttcagc agaggcttaa ccttagcagt ggatgatgtg 9421 gaaaaaactt atgttctgca agagtgttta ttgaattttt taagagaagc tagagtccgc 9481 gcaaagctag cttctccaat tgtgtatgag gacaaacttt ggggactgtt gattgctcat 9541 cagtgcaatg cgccacgcca gtggactgaa agtgagaaaa atttgctcac ttctgtagcc 9601 gaacaacttg caattgcgat tcaccaaaca gagttaatgc gatcgctcac tcaagaaaaa 9661 caaaccctag aacaacgcgt tgttgagcgc acaatggcgt tacacgacgc ccttgttgct 9721 gctgaagccg ccagccgcct gaaaagtgaa tttctggcta ccgttagcca tgagttactg 9781 acacctttaa cttatgttat cggcatgtct tccactttat tgcgttggtc ttttggcgag 9841 ttgactcaac gccaaaggga ttacctacaa actatccacg atagtggaga acatttatta 9901 gaaatgatca atgacatcct tgacctctct caaattgagg cgggcaaggc agttttagat 9961 atcacagaat tttcattaac aagcatagca gaatctacag ttaacgcgct caaagaaaaa 10021 gcaaatactc agggagtcaa gctcaaactt gatctgcaac tcaacacgca gcgcgatctg 10081 ttcactgctg acgtcaggag agtgcaacaa attttgtgga atctgttaac taatgcgatc 10141 aaattcacac cagaagatgg tgaggtgatt ttacgtcttt gggtagaaga caaaactgcc 10201 atatttcaag tagaggatac tggcatagga attccagaag aacagttatc actgctgttt 10261 gagaaatttc accaacttga tacaccttac cgtcgtcgtt atggaggaac tggattaggt 10321 ttggctctaa cgaaacaact tatagaactt caccgggggc gaattgaagt agaatctact 10381 gtaggtgttg gctcaatttt taccgtttgg ataccagcac aaggttccag tagtgagtgt 10441 tgagttttga gtgcagggaa gcagagggga gcatcccaat gcgtgaaaga ggacgagtag 10501 tgggagcatc ttgctcccaa attttaacca aagcgggcta taagcccgca ctacgagaca 10561 ccacaaattt aaacatttgg atgctccgga agcagagaag ggggacagaa aaaaggctta 10621 tctgaactgt agtagaacat taagttggca ttttggcaac tctgccgatt ttgccaggag 10681 tatcgtaggt gttaatacgt tcttgcacaa aaaatgacac agtacaaggc tcaatccctc 10741 ctcttaacga ttagctactg ggtcttgtac tccacgacag catatgatga gtagtcatat 10801 tgtgtcgtgg ttttcgtatc ctctgggtga tagactagag gatttttttt ggaaccgtca 10861 atatattgac actctcctga ctagaagtca ggggattctt ggttcttcca caaatcttgc 10921 cccaacagga cgtatctagc gcttagtaga ggattcatgt ccccaatcgt aaattccggt 10981 atgccctacc gtattttgtt tgataacgat caatttttga cgtttttttt ggttttcgca 11041 ggttactcaa tcgttagatt ggtttacgca gcatttctac ttagagatgc tttttacgtt 11101 acctacttat tgttaggcag gataccactt caggctaaag cctgtcaagc tgccctgaat 11161 gcctgctcta aaggaacagg gttttcgagc tagcgtcctt ataaaactag gacttacgca 11221 ctttacaaat aggcgctttg tgtggaacag ccccagacat caaatctgta agcgatagaa 11281 gtgccatttt ttcccatata atcccctaag tgaataagtt gatgagtctg acacctataa 11341 aaaagaagat attgttggtt agctcttaat cgcaaacttc cgttcagtaa acaggtataa 11401 cttaacgcac ataatcagac tttcgtcaat gcgtaagtcc taaaaacatt cataaagtaa 11461 aaacaattac ttaatattta ttcacataat gccgaaaaaa atcacttttg ggttcctgct 11521 gctattcttg ggaacccaag taggtatcac cactaaacct gtgtcagcac aacaaacaca 11581 agttgtacca acagttgtac ccacagaagc taccacaaaa tcaatttgcc ctgcccaact 11641 gggaggatct atagatgctg tgattaatcg tcctgtattc aatcgggcgc gttggggcat 11701 attagtacaa cctctgtctt ctacccaaac tctttacagt cgggatgccc aaaaatactt 11761 taccccggct tccacgacga aactcctgac aacagctgct gcactacagc aattagggac 11821 aaactttcgc attcgcactt ctgtgtatgg cggtggtaac ggggttttat atgttgtggg 11881 tagaggagat cccagtttca ctgatgcaca attagcagtt ttggcaaagc agttgaaaca 11941 aagggggatt gggagagtca atcagttgat tgctgatgat acttatattc gaggagatat 12001 tgttcacccc tcttggcaat gggaagatat acagtcagat tatggcgcac ctattaatag 12061 cctaatttta aatcaaaatg tttttcatat aagacttctg ccgcaaagtg tgggacaacc 12121 tctcaaagtt atctggaatg atattagtga ggctgcacaa tggcaagtta ttaatcagtc 12181 ggtaacaaca gcagaaaacc aaccaacttc tatcaatgtg actcgcgact tgaaaggaaa 12241 aattttgcga attcaaggac agttggcgct caattctaaa ccggagttag tcagtttacc 12301 tgttgtttct ccggcggaat atttctcacg tcgttttcgt agtactttga tagcagagaa 12361 aattcctgtt ctacagacgt ttgtatcatc aactaatggc aagaatgaag aggaattagc 12421 gaatgtggaa tctcctccac tttcagaact tgtcgcgcaa acaaatatta acagtaataa 12481 tctttttgct gagtcgttgt tgagggcatt aggttttcaa aaaccaccta cagagaatca 12541 aacttctgct gatgctggtt tggaagtcat gaaagcaaca ctaactcaaa tgggagtcga 12601 ttcaacaggc tactctttag tagatggttc ggggttatct cgtaaagact tagtgagtcc 12661 agaggctttg gtacaaactt tgcaggcgat ggcaaaatca ccagcagcgt tggtatatcg 12721 agcatcttta cctgtcgcag gtagaagcgg gactctaaaa agtcgctttc aaaacacacc 12781 tgctgaaggt attgttcaag caaaaactgg tacgatgggc ggtgttgttt ctttggcagg 12841 atatattaat gtgccaaagt acgagccagt tgttttcagt attatggtaa atcaaagcga 12901 acagcctgca agagttgtgc ggcaagctat tgatgaaatt gtagtattgt tagctcaatt 12961 gcaaaattgt taaatatcat ttagggaaca gggaacaggg aacagggaac aggaaaatgt 13021 cctaacaatg gtggcgacag ctataaaaaa cgaggaacct caccccctgc ccctctcctt 13081 agtaaggaga ggggtgcccg gagggcgggg tgaggttttt aaattactcg atgacatcgc 13141 caaagcgatc gcgatacttt ttcaacacat cctctaggtt ttgtgatcgc tgacgttctt 13201 tttctaattc ctgttgtgct tgttcagcgc gttgcttttc ctgttctagc tgctcgttaa 13261 cttcagcata agagaaaaac ttgctaccat cgggacgata gatttgcaac tcttcgccgg 13321 atgagtcaaa ccggataccc aagcggggac ttacccagtt tgctatttct ccaatcacat 13381 ctagtccatc ctctgctttc aaccagcctc tcaaaatatt cttgtctgga tcgtagatgt 13441 aatattcctg aactccgtag cgatcgtaaa ataaaagctt tttgtccatc tcagtagaac 13501 tgttactggg agagagaatt tcaaagacga cttgcggcgg aatgttgtct tcttcccatt 13561 gtttatatga agtgcgatcg ccctttgttc tgccaagcac taccatcacg tctggtgctg 13621 tgacaatcgt atttcgacct tttagtggat accataacaa gtccccagcc acaaagacat 13681 ttggatcatc agcgtacagc caatcaagat tttgctgaat ctgcacaatc caacggaact 13741 gcttggtatt atctgccatt ggcttgccgt cgctttctgg gtaaataatt tctgatggag 13801 ttgttgccag gggttgagaa accataacca agcttatgac tctggatgct tcaaagttag 13861 cacaatatgg cattagtaat gtttccatcc ccgttagggg cgatgtaagt ggaaagtcta 13921 ctgtaacgtc taccaattct gaatattggg cttgtttcca tccccgttag gggcgatgta 13981 agtggaaagg cttgaaaata atacttcatt gttttgaagt accagagttt ccatccccgt 14041 taggggcgat gtaagtggaa agagttcact tctggaagcc ttactgggag agggttttag 14101 ataccccaat cgacgtactc ctaaaaagga attaaacttt ctgcacaata aatacttttt 14161 agcgtctgaa aagcttactg ggtaagcaat cgacgcatct caacgaagtt acgcggtttt 14221 caaggatcgg aggagatgcg tcgatgaagt tcaacacact cttctaaaaa taaaatatat 14281 cacttatggt tgtcaagttt tttacataat aggcggtttt gtttcaagat ttttcaatca 14341 agtgcacagg gccatacaag ttgcttagag gcacagaaat atcagctgtt gaatcggatc 14401 acgttgagga ggattgtgtt ggcgatcgcc ataataagta ggtcaacata aaaaaacata 14461 aaatacgatt cttgcgattg ctgagtccca aggggacacg ctgcgcgaac gctccactgc 14521 gttttgctcg caatgacatt tcacgtttaa ttaggttgag ctacttaact atctttttaa 14581 gaatcccttc aaagagtgca aactataatt caagacttta cccgtattaa acaattgata 14641 tccagcgaat aagaaagcag atgatgccag gatagcgaaa agattgacgg atgcaaaaaa 14701 attgcctgtg aacagcatga ttgcaactcc caacccgatc aaaatccatc ctattttacg 14761 atttagatgg cgtcggctta aaacgataac tcctgcaaga cataacaata tactagggag 14821 acctataaaa aaaccaattc gagtgtttgt agatataatc aaaattccta aaaccatcag 14881 cactaagccg acaagtttac taggacgatc cacttgatat tgtttattgt ttccaatcat 14941 ctcaggctgt acaatctgag cctgaagaac tgtgggttga cgagtcagtc tggaataaga 15001 agtcgaagca aaattttgtt tttgttgaaa aatgaaagtt atcttttctt gttgacctag 15061 gtctatctta tctcctaaat taagaggata gcgctttttt ggctctagtt tgacactatt 15121 aagaaatgtg ccgttggagc ttcccgtatc aataatataa taaatgtttt cttctacctg 15181 aatttctgca tgaagccgag aaacaacatc ggcatttggc aaagctaaaa cattgatgtc 15241 cggtgcaatt tcctcgtttg gcttaccaat ccgaaaaatc ggaagatttg gtggcaattc 15301 aaaggaggtg ttactttgga gatgaaaaag ctctaaattt tgtcctgttg tgtgtgatgt 15361 gctgaggttt ggcataacct gcttttaagg agcaatgttt tgatttatct aaaatatctt 15421 tcttgtctga caacaaagat gcttcggtga tcatcttaaa cttgattgtc ttaattttaa 15481 tacttacaga ttcaaaaaat atttgccaca gataacccct ccccttgcta aggctacggt 15541 gtacacacat ctcacttaaa acctcaccct cgcttttagc ttcgctaaaa tctttccctc 15601 tccgaactcg cggagaggga tgcccgatag ggcagggtga ggtgaaaagc gcgggtgcgt 15661 ctggtacact ggcttttcac ctcaccctcg cttttagcta cgccaaaatc tttccctctc 15721 cttcataagg agagggatgc ccggtagggc agggtgaggt tgggaacgcg ggtgcgtctc 15781 gtaatacaaa aaatctctat ttcaaaacca accatcaagc aggtacttga atactctgac 15841 tacccgcaag tccaaaggct gtatgaatga cttgcaacgc cttcacgcct tcttcttgcg 15901 cgacaacaca gctaatctta atttccgaag tcgtaatcat ttggatatta atttggtgct 15961 gtgatagggc ttcaaacatt gtagccgcga tgcctggttg tccgaccatg cctgctccta 16021 caatactgac tttggcaata tcattatcca gcacgacttc accccatcca tattctgctg 16081 ctgattgttg gagcattttt tgtgctactt gtgcatctat tctggcagta gtgaaggcta 16141 tatctcgtgt cggaactcca ttcatcatat gacagcgttg ggactgaata atcatgtcta 16201 cgctgatatt gtgttctgcc aacaatccaa atatcttcgc cgctgttcct ggacgatcct 16261 gtacttgacg aatagctaga cgcgcttgct tcatatccag ggcgacacct ctgacgggag 16321 gatgatcaga tgagggagag gagaattttg aattgatttg ttgggttgtt tgtttgtctt 16381 cttcaatttc aaaggcggcg cgaagaaaag caacagcgcg atcgcaatct gcattatcaa 16441 ttacgcaact cactttcact tcacttgtag aaatcatctg gatattaaca ccagcttctg 16501 caagagtcgc gaacatcttc gctgccacac caggacgccc aatcatcccc gcacctgcaa 16561 tactaacttt agcaatattg ggcagtacca tcacttccgc ttctccattt tctgttgcct 16621 tttgagaacg cagaacaggg gcgatcgctt gtgctacggc ttctgccttt tttaatattg 16681 gtgccatcac ggtaaaggca atgtcattcg tattaccttc gtgaattgat tgaatgatca 16741 aatccacatc cacattctga cgtgctatct ccccaaataa ccgcgctgct acgcctggtt 16801 tatctggaac acgcaacatt gacactttcg cttgattggt atcaaattca atatcatcaa 16861 ctgaacgagc aatttccaaa ttcacaagtg ctcgcggacg tggcgctggt gaaatcaccc 16921 aagttccagg ttcatctgtc caactagagc gtaccactaa aggaacgcca tagttccggg 16981 caatttccac tgcacgagga tgcaagactt ttgcccccaa gcttgcgagt tccagcattt 17041 cattgcaggt gatttcatct atcagttgtg cttctggaac caaacggggg tctgttgtta 17101 aaatccctgg tacgtcagta taaatttcac aaaaatctgc gtgcaatgca gctgccaaag 17161 ccaccgctga ggtatctgaa ccgccacgtc ccaatgttgt aatttccaat tcctctgttc 17221 tggtaatccc ctgaaaccct gctacaacaa caactttgtt ttgattaaga tgtcgcttga 17281 cgcgttctgt ttcaatatgt aaaatccgcg ctcgcgtatg gtctgcttca gtaacaattc 17341 ctacttgcgc tcctgtcaaa gaaatagctg gatgtcccag ttcttgcaaa gccatactca 17401 ctaaagcaat ggatatctgc tcacctgtgg acagcaacat atccatttcc cggctgttgg 17461 gattttgtga gatttcgtat gctagcttaa cgagtccatc tgtggttttc cccatcgccg 17521 aaagtacgac aacaagagag tgtcctgctt ttacagtttt gcacacgcgc tgtgcaacag 17581 cttgaatacg ctctactgaa ccgacagatg taccaccata tttctgaact atgagcgcca 17641 taagttttat gtagtcaatt gtccctagga attctatctg ttaattatat ttctagtatt 17701 gtgtttaatt tcatagaatc ccagtaaact tgttgctatt agcaaagcca ataagtcgta 17761 aattgctctg taagctagcg ttgcacccaa aacgactaca ggagaaagtt tcgcagacag 17821 aaaatgcaaa atcacaatct caaacactcc taacccagca gggacattac ttactacacc 17881 tgcaaacata gcgagtgagt aggttctcaa cacatccaga aaagatacac tattcatagg 17941 aagcagcaaa taaagaactg ctgcagctat catccagtca atgctagaaa tgacgacttg 18001 agcaaaagct atcttgaaag aaggaaaacg aaactctttg cgacgaatga ctaacggtgt 18061 tttaataaaa atacttccta ataaataaca aataactatt aataggaaaa taacacccag 18121 aggacgtgtg tctgtaaaaa ataaatgtat ttgggtaggg atttccaagg ggttgaagat 18181 aaatattacc ccagatgcgg caaaaactcc taaccaaaaa gtgaaattgg caaaagcaat 18241 aacttgggcg atcgccagtg gtgatactcc ccaattggaa taataacgat aacggatagc 18301 actgccagtt accaaagcaa aaccgatggt gttgccgagt acagagctaa taaacccagt 18361 gaaagcaatt tttcgcagag ccagaaaacg accaatatag ctaaagccaa gagcatcata 18421 cgacaccatc actagatagc ctatatttga caataaaatt gacaaactta agtacctttt 18481 aggaatcttt cctaaagagt tcaacacatc ttggtaacga tactcacgta gttcgtggct 18541 aattgcccat aaagaaagta ccaaaagcag caagccaaac aacgaactga agttgagtcg 18601 gagtttttga agcatagtac aaccagggaa ggggggagac tacagacgtg gagagaccag 18661 aagcagaaaa cgactaatga ctcgacattc gcaaaagttt cattatgttg acataaaatt 18721 gacacatcac tagcgtaagc ttttgactat aggattcaga caataaacga agttaattta 18781 ttgtcactaa attattggct ctatagaaat cccgtttgat ttggtgagac cagtgcttga 18841 tgagagtttc ccgacagagg catctggcgt tagcgcagcg gaaccggagg ttctccccga 18901 agggtaaaat tacttagcgt agggagggaa cgcttaacag ggaacgctta acagtgaaca 18961 attgatcact ggtaactggt aactggtaac tgataaaggt gtacctagtt tcacaaaaat 19021 ctgcacagga gttttatagt tgaccattaa ctaatgacta atgactaata accaatgact 19081 aatgactctt ttcaagatgc cttagctggt tcaacaattc atctgcgacg ggggataaat 19141 ttacaagttt gtcacaattc ggggagaact ccagctatag tttttctgca cggaggaacg 19201 ggaaaccgct ttaactttcg ttctcagtat gagtttgctc aaagtcaagg ttgggaagtt 19261 ctggtgtatg atttggcagg acacggacag tcgagtccct atcctcgcta ctccatcggg 19321 cgacatcgtc gggatttaca gcgactgttg tacaaattgg ggatttcctc accagtgctg 19381 tgttgtcata gctacggagt tcctattggt ttagagtttg cccagcacaa ttgtgtgagt 19441 ggttttattg cgatcgccgg cggaactcat aacttagcac cttggtggga aattccgctg 19501 atgaagttta tggcttgggg cggcagatat ttatactctt tacctggcgt acaagcaatc 19561 tcaaactttt tctcgacttc ttaccgacac agcgtcatag aacggttttt tgccgaatgt 19621 ccaacaccca cagattttca atcctacaaa gccctagaaa tcttttgggg ttatgacttc 19681 tttgcgcgtc accctttgcc gcagaatttg catatccctg ctttgattat aaccgggggt 19741 cttgacccca tgtttacaca ccaaatgggg aacgatttag caagacattt cgtcaacggt 19801 actcatttac attttgctaa tgcaggacat ttagttatgg cggagtctcc agagttaatc 19861 aacagtgcta tattgaaata tcttattggg attaatacac aaataccttc gtcttcacag 19921 acttcagctc tataaatttt gtcatttttt acagctttga ttcaattgct atcaatcaat 19981 attcttgatg aattgcttca aatataaaat tgtgctgctg ctttcggtgt tgactttctg 20041 tagctgcgct aacataaaat ctgcacaagc acaagtacag atgcgtccgc caactctaac 20101 ttctgtgtta ccgccacaac caagtgctag cgatttagct accctactaa catataaaac 20161 cgaaacttac gatagcaaag ttatgggagg aagtcgcatt tactgcgttg ttttacctcc 20221 tggctatgat caaaaccaaa atcaacacta tccagttatc tttctcctcc acggtggaaa 20281 cggaaatgct gaccattggt ttgccaaagg agatgctttg acagttctcc aacagcttta 20341 tgcaacaggc aagttgccac ccagtattat tatcacacca gatggtaatg acaaacgtgg 20401 ttctagccgc tactgggatc ccgattatat tgatggcccc aatggcaaag tttccacagc 20461 cattggggat gaactggtga aagttgtgca aaaccgctac cgtacactca ccagtcctga 20521 tttttgggca atgggaggat tatcttctgg aggttggggt gcaatgaatg tgggattgca 20581 taatttaaat aatttttcta tcttatttag tcatagcggt tactttcaag acaagagtgg 20641 accgcaaaat agcccaatca cctatatcaa gaccatttca cctcaagcta aaaaacgctt 20701 aaggatatac ctggatgcag gtatagaaga tactgaagtc ggacttgacg aatccaaaaa 20761 atttaatcaa atactcagta cagaaaaaat ttataatata tttcatgcgt ttcccggtgg 20821 tcacacttgg aactactggc acgaacatct ggcagattcg ctaacatttg tcggaagaca 20881 atttaaaata tcggcgatta tacatgctag tgataatttg ggttttaaaa agccatgaaa 20941 ttatctaaag tcctaattgg cgttgcaggt gcaattgcaa tcctgactgc tgcaggctac 21001 tggtatgtct ttatcttagg tgcgccgcag ctagattcag acccgcctca acaacagatc 21061 actactgggt tgaagtttca actagaaact tttaacagtc aagcaatggg cacacaacgg 21121 caatatggtg ttattttgcc cccagattat cacaagaatc ttttaaaacg ctatcctgtc 21181 atatttttat tacatggcgg tcatgatgat gcccgtgctt atgttgataa atatagagtc 21241 ttaaaagtac tacatgaact ttatagagat cacaaactac cgccttcgat tgtaattaca 21301 ccagatggaa atgatcagcg aggttcgagt cccattatcg accctgatta ttatgatggt 21361 cctaatggta aagtaggaac gttgatcggt tcggagttag tacaagtagt caagtcacgc 21421 taccgtacat tagaaaatcc aaagttttgg gcgttgggag gtctatcttc tggaggatgg 21481 ggcgcgttta atattggatt acgctatttg aaaaacttca atattctgtt cagccatagc 21541 ggctatttta ccgataacag tggtccacaa aatagccccc aacaaattgt ccaacaatta 21601 ccagttcagg ataggcagca actacgcgta tatcttgatg ctggagaaag tgactccaat 21661 ttacttgctt ctacccgaaa attccatgaa accttagaca aattaggtat agaaaatgtg 21721 ttttatgcct tccctggagg acatggtttg tcaggtccag atgttggctg gaattacttt 21781 cacaagcatc tcaaagattc gctatcttat gtgggaaaac agtttaaaga ataactcaga 21841 aacgccactt gctacaacgg ggggaacccc aacgccagat gcctctgtcg ggaaaccctc 21901 atcaagcact ggctccgcaa cgcagtggct cctcagaact cagaactcag aatctattag 21961 tggggatcac caattcccca ctaataaaag accacgcctc cgcttcaatt ttggtgttct 22021 tgacttgacc aactctactt cgcacacagc tgtacagaat tccttgttga ttcttcctac 22081 aagttccatc gacatattgc tgttggcgca gcgtgtccgc aggatatatt ctgacaactg 22141 tgttctgtgt tctttcttgt taaaaattca ctcaaaatta agtgaccaat gactcttgat 22201 ttcagaactc ggattgggct ttggagtgca gctttcctca caggtttagt cggagtggta 22261 aacttggtat cagccgtgac gcctaacctg catgaacgaa atcactggtt gaagcacttt 22321 ttaccctttg atattcgcgc tagcggtcat ttatttgcag cactgactgg gtttgttttg 22381 ctaacacttg caacgaattt gttgcgtcga aaacgaattg cttggttact cacgattgga 22441 ttattaatta tttctatctt cagccacttg attaagggac tggactatga agagagtttc 22501 ttgtctggtg ttttgctgat gcaattactc ttgatgcgcc atgtttttac agcaaaatca 22561 gaccgtcctt ctgttgcaca gggagtacgg gtgttaattg cagctttgct gtttagcttg 22621 gcatatggaa caattgggtt ttacttgtta gatggcaaat ttacggaaaa tttcagttgg 22681 agtgatgcca tagcccaaac ttttgctatg ttcttcacag ataataatga cggactgaaa 22741 ccaaaaagcc gatttggaga tttttttgct aattctattt atattattgc agcaagtacg 22801 atcgcttatg cgctgttcat gctattaaga ccagtttttc tacgtgaacc caccactgtt 22861 agagaacgtc agcaagcaag agatattgta gaaaagtatg gatgctcttc gttagcagca 22921 tttacattgt taagtgataa aagttacttt tttagtcctt ctggtcgtag tgtgattgct 22981 tatgtgccta agggacgggg tgcgatcgcc ctaggtgatc ccattggacc atttgaagat 23041 cgtaaagaag tgattgtgag tttccagcta ttttgccaac gtaacgactg gtatccagca 23101 ttttaccaaa ctttacccaa tgacattagc ctctacaagt cgctgggatt tcaggtactc 23161 aagattggtg aagaaggaat agttgatctg caaactttca ccttacaagg aaaagcaggt 23221 aaaaacctca gaacagcaat caatcgcatg actaagctgg ggtatgaagt gaaattttac 23281 gaaccaccga ttgctgatga attgttgcat caactcaaaa ctgttagtga cgaatggctg 23341 caattggtgc aaggttccga aaaaaagttt tctctaggtt ggtttgacga aacttatttg 23401 cgagagtgtg aaatcgtaac ggtgcagtct tcccacggtg agattattgc ttttaccaac 23461 attgtattag agtaccaact caacgaagtg actaacgaca tgatgcgaca ccgaaagtcg 23521 attgaaaatg gaacaatgga ctttttgttt ctttctatgt tccagcacta caaggaccga 23581 agttacgata gttttaatat tggtctttct gccctttctg gagtgggtaa aactcaagag 23641 tcgggtcgtt tggagaaagt tttgcactat ctttacaagc atttggagcg attttacaac 23701 ttccaaggct tgcacgcata caaagacaaa tttcaccctc gttgggaatc acgttatctg 23761 gttttcccca gtttaaccgc tttacccgac gttgttgttg cattggttcg tgctgattca 23821 ggcgatcggc ttttagatta ttttaaaggg tgaagatgtg agttgtgtgt taagaaaaac 23881 tcagttatca gaacgtagaa ctcagcttgg acatcccacc ccaaatcaat tcagaaacga 23941 atgtaattgc tccaactggt gatgctacag ttgcaaatat tgagaagagt actcctacaa 24001 taaaagtact ctgacaattt ttgagtgaat tgaagactgt tctggttgtc ataagtttta 24061 tcaagcttac tccaaaaaag ccgcgctttt tattaggact tacgcaaaaa ctcttttaaa 24121 ccctattaac ttcgtgttct ttgtgtcctt tgcggtttat tttttcatta ttttgcgtaa 24181 gtcctgttta tatcaacaag atattgctac atacagcgcg gctagtatta aaatcatcca 24241 gtactaggag tagagcaatg ataaactttt cgtgaagctc tcctaaaatt agataaaaga 24301 tcattagtag tgagcagctc tactgctaag aatctaaatc tagagcttgt atcccttggc 24361 gttctacaaa agttagcata cttcctaact gtaaccaaat cagcaaggca gttataaaag 24421 caactggtat accaactcca taggcaatta agggcggaaa gccaaatatt tcaaaaccgc 24481 aacagagaaa tacacaaaga ccaacagtaa ttcctataaa tggtacagat aactgtttaa 24541 cgggtagacg agacccagaa gtttctgcac cctcatcttg ccatttctgg acaatgactt 24601 tcagcgttcc ccataacgcg atatctgaaa ttaatgctgc aaacaatcca aaaacgaata 24661 gaaaataggg tggatgttca ggaatgtagt acatgaaaac tcctgtggat attgagtacg 24721 gaaatatagc agttgccagt tatcagttac cagttaccag ttaccagtta tcagttacca 24781 gttaccagtt accagttacc agttaccagt tttcactgtt ccctgttccc tagttccctg 24841 ctatatgagt ttttcattat gaatattaat attttttcat aatttgtaac tcaaaattca 24901 tcattctttt agcgctttcg gtagaatagc tgctactccc tctctggaaa attcggtcaa 24961 aactccaatc gctacgttga ataagcgagt tggttttaag tcatctttga ctaaattcca 25021 caagcgttgg acttcttctg tatgaagacg atcatcttca gtcaaagcaa gcgctaactg 25081 tcgtcggaga aacttgcttt catcagaaag tagatattgc aatcccaatt gagctgtagg 25141 taatacgtca aagttagtat cactgcgggc gatcgcaatc aaattctcca atcgagacca 25201 ctggaatcga ccatctttaa aaagcagatt cagcaaacgt cgccgcagtt gtggagattc 25261 tcccgtcagc aagcgtcttg caacgtaggg gtatgcgact tcaagaattt tgaaattttg 25321 gttgaggctg agggcaattc cttcctgtgt taccagagaa cgaataatca aagcaaactt 25381 agcaggaact cggaaaggat attcatacat caattctgag aactgatcag tgatggtttt 25441 gaagttaaaa ttcccaacgt ttttaccgat cgcatctccc agaactgctt ctagtgctgg 25501 tactattggg tgaatatccg tgtctgaact cagaaaacct agcttgacaa agtctttggc 25561 taattcgttg tactctttat taaccagatg tactagtgca tctaccagac tttcttttgt 25621 ggtttcattt aactgatcca tcatcccgaa gtctatataa gccatacgac catcaggcat 25681 ggcaaataaa ttacctggat gtggatcagc atggaagaac ccatgttcta ggagttgttg 25741 caaaccagag gttacgccaa tttggataat taactctgga tctaaacctg cttcgcggat 25801 tttgtttgtt tcggtgagtt tgaagccgtt aatccactct agggttaaaa catgggtgtt 25861 ggtgtaacgc caatatatag ctggtatttt gacttggaga tcatcgcgga agttatttgc 25921 aaatttctcg gcattgcggg cttcgttgat gtaatctatc tcctcaaata acttggtacc 25981 aaactcgtct acaatcatcg tcagatcatg accgaggttt aagggtaacc aaggggctaa 26041 ccaaccagcc gcccatcgca tcagataaag gtcaagtgta agaattgggc gtaagttggg 26101 gcgctgtact ttcaccgcca cttcttcgcc gttcatcaaa cgaccgcgat aaacttgacc 26161 caaactagct gcagcgacgg gagatgacga taacgagcta aaaatttcgc taattggacg 26221 ttctagttcg gtttcaataa ttttgtaggc gagtgcacta tcaaaagctg gcaattggtc 26281 ttgcagcttg atgagttcgt ccagataatc tttacgtatc aagtctggtc gtgttgacaa 26341 agcttgacca actttaatga aggtgggacc aaggcgagtg agcaactgtc gcaattgagt 26401 tgctcgtttg aacttgtttt gttcctcttg attttgccat tggtccgact tgagactaag 26461 aataaagacg gcaaaaaacc aaataatctt tattgcccgc atccaagcga gccaaggacg 26521 gtagcggtag tagcgggcga tcgccattgg atcgtagcgt cgatgatttt gttgtgtaag 26581 ggtctgccag ttcactccct tatcttcctc ataaactgaa aaaggtttta ctttactaaa 26641 gccaaacaag gatacgcaag tcgccataca aatgctttgt aggcatttcc tttgttgttc 26701 cttgtcctga tcgctaatct ttacttaacg ttacttttta tcttttattt tatattaata 26761 cataaaagta tatatactta tatgagcgga gcgacaaact agtaaattca agtaaaaaac 26821 ataaaattat gtgtaatttg ttatttatta atttaagtat ataaaattct ataatcaccg 26881 aacttgaggc tgatttaaaa agattattgt ctttctccgg agaactatta tttgtcttga 26941 taagtataat ataagacttt ataaattttt gcgagagtga tgatgccgtc tatcccaaac 27001 aaagaaaaga gtaaacgact tccaaatcaa aactttatat tcttgtaagt ttttctttaa 27061 aaaaagataa gtttgcaaaa atgtgaacga ctttatgtac gatattacgg gcataattgg 27121 caagcacatt atctgatgaa ccatgccaag aattagcatt attacgggaa ataattgagg 27181 agtggctttt tcatgagcat tactcaacct ttgacaaggc gtaagcggat gtcttacaca 27241 gttgcatttg tcgctagtgt cattattgct ctacttatta gctgttttgc aacccctagt 27301 cctggccaaa cctctacacg tgacaaattt ctttggccat ttgcatcctc atcaccttgg 27361 aatatggcaa ttgggtcgag tgcttattat atccccgcaa acattggaaa agcgggctat 27421 gctgcagcag ataaagaata tttcttccaa ctcaacaata gtgacccgtg gcgtccagtt 27481 tatagtcctg gtgcttgggg tgaaggacgc tgtacaggca ccacatctat gggcacctgg 27541 ctgccaattc caaatgacct gatcattcca gatgccacga gcaacccata tagcaccccc 27601 aacaatgcat cagcctttct gatgccagat gggaaaacct taatacagct tgaacctctt 27661 gctcgctgta aaacaggagg cgacatctac ggttggcgtt atccaaatgt tgatatttac 27721 ggggatggaa tcggcggagc acattttggt tctggtcttt cgtcgattgg tgggtctatt 27781 cgcaaaggtg aactaaccag cgatcaacca attcgtcacg ctttaaaagt tgtcatttgg 27841 ggggaaaaat acctctacta ctctacgtct aatcctgggt atcgctggcc tgcagataga 27901 gcagatgcca atgcagccaa ccaataccac ggtaaaaacc cctctttggt acaggggaca 27961 ctactagcaa ttcctcccaa cgtaacagaa gcaaatctcg acttgcaaac acctgctgtt 28021 aaaaagttat ttcatgcctt gcaagattat ggtgcctacg tggttgacga cgcaggctgg 28081 gatgctcact actttgcagt agaggacgga gttacagaag agtttcgcaa cacctttggt 28141 tacgattttg agggatctaa tggttcattt tatgaagact ttatgaagct attccaagcg 28201 ctttacattg ttgataacaa tagctcaaat agcgtcggtg gtggtggaac tccccgtgta 28261 gctcttgctc caccgattgg gaactgatac ggaaataaat ccaacactta aaaactcggt 28321 ttcataatga agccgagttt tatcgcttct catagcacat aacataatat accatttact 28381 gacattcgtt acattaatct ctaatttgcg acagcctcgt tataaattat taatatccac 28441 aaacaactga cttactcagg cttggtttca aaagcaaaag tttaaaaagg ttaaaagatg 28501 ttgagttctt cagaaacacc tattgtcgcc actattgtgc tagtcgcttt cggaatttta 28561 ggttggggct tttatcgcgc cagatcttat ggcaagctgg gaatcctagc ctggttacag 28621 tcggtggtat tgatgactcc ttggttgctg ttttttggtt tgtttgcagc tgggatttac 28681 atcaatatag taggcgtatt gttcttgttg gtcgtatcca ccggagtgta catcttttta 28741 ggaagacagt tacgagcagg gggacaggat gcaatactta ggcaacgggc aactcaaagg 28801 ctagaagctg atgcccttga gcaagctagc cccacaaata attccaatct tccggaagga 28861 aatgcacagt taaaacctga agtcttagcg attcctgaag aagacttgaa catgattaaa 28921 ggtattttcg gccttgatac cttttttagc acagaaacta ttgcttacca ggaaggagct 28981 atttttaaag gaaatctgcg aggagatcca gaagaggttc ataaccgtct gtctgcgagt 29041 ttacaagaac gtttaggtga taaatatagc ctatttttag tggaaaatac agatggtaaa 29101 cctgtcctca ttgtgcttcc cagtcgcaat gacccccgtc cgatgacatt accgcagaaa 29161 gtttttgctg ttgtcttgct cgtgggaaca attgctacaa gcttggaaac tgcggggtta 29221 ctcctgaatt ttgatttctt tgccaaccca gaacgcttcc gagaagttct acccatcggt 29281 gctggtatct taacagtttt gatagctcac gaaatcggtc attggttact cgcccgtcgt 29341 catcagatcc gcctcagctt gccttacttt cttcccgcta tacaaattgg ttctttcggt 29401 gctattaccc gttttgaatc tttactaccc aatcgcaagg tactatttga tattgcctca 29461 gcaggaccag cggctggagg aattgtatct ttattaatgt tggtgggtgg attgctgctt 29521 tctcacaaag gtagtctatt tcaattgcca aatgagtttt tctctggttc aattctagtc 29581 ggaactttgg cgcgagttat tcttggttct gcattacagt cacctttggt ggatatacat 29641 ccacttgtcg tcattggttg gttagggttg gtcattactg ctttaaattt aatgccagca 29701 ggtcaattag atggtggtcg tattgtccaa gcaatttacg gacgaaaaat tgcaggacgg 29761 gcaacaatag caactttaat tgtgttggcg ttagtgtctc tcgtgaatcc tttagctatg 29821 tattgggcaa ttgtgattgt gtttttacaa cgagatttag aacgtcccag cttgaatgaa 29881 attagcgaac ccgatgatgc acgcgctgct ttgggtcttt tggctttgtt cttgatggtt 29941 gcgactctgc ttcctctaac tcctggttta gctgggcgtt tgggaattgg cggatgatgc 30001 tagtggtgga ctacacagtc caccctccga cttttctact tgttaccgtt tccaaccagc 30061 cagcaggttt ggataagcaa ctgatgttgg gaaaattttc tggaaaccag cttgacgttg 30121 ttgtggtgtt tcaccactga cattccatag ggtttcgtag aaaaagaagg agactccagc 30181 aaagttgcgt tctcgaactt tttgtatttg tgtttggatt tgttgcatag atatgggttt 30241 ggcttttaac ccagtcataa tacctatgct cactggtata tgactttgtg ctgctttcac 30301 ttctggatac tccagttctt tgacaaagac atttagatca tcacggtata gttgtataac 30361 cagttcttca atcagtccgt tccgttccca gctttgccag tctgctaaaa aggttttgta 30421 ggaaaagtcc tgaggattag gcgctacaga gactagacaa tttttcttag tcgctttaat 30481 tgcctggaat acccgtttca tgtagtcagt tattttatta gctctccact ttacccattc 30541 tgcgtcttga ggatttgttg aaggagcttg accacgatgt tcttttttgt acaatgcaac 30601 tgtgtacgcg tcgtatccca attcactagg caagccgaaa tggtcatcaa actgaatacc 30661 gtcgatattg tagtttttaa cgatttcgac aactaaatct tgtataaatt gttgaacttc 30721 agggtgaaag ggatttagcc aaacccggtc gtgggtacct tctttccaaa ttttactact 30781 gtcactgcgg ctggtgagcc attggggacg atttttggca agttgtgagt cagcaggtgc 30841 cataaagcca aactcgaacc agggaatgac tgttaagcct ttttgatgcc caactgtgac 30901 aatttccttg agtatatctc gcccttgcag tccgggggct gggtcaagcg atcgcccaat 30961 aactttttgt gccactttgc tggggtacaa tgtccaaccc cagttccaaa cagtaggata 31021 tacagtgtta aagtttaggt tttttaggct ttgcaaagag ttttggaggc gatcgcgccc 31081 aaaaagcaca tcactatcta tattcgttaa ccacaccccc cgtaactcag atgtctgcgg 31141 acggtaggaa gtattttgag catatgaagg aaatgagagc agcattgccg ctaccatact 31201 taaagctacc aaaagtgcga aaaatgactg tttattaatt gccagaggtt cctgcgatag 31261 aagcctgttg caccacttca ccaatttttt catcatctac ttgttagaaa agttgaattt 31321 tgtatgagaa aactatcaac aatgacttag agcaatgggc agaagttgct atagttgctt 31381 atacatagca tgcccgctta gcgatactag ttacatatat tgtttatctt ttttcaagat 31441 tcccaatttc tgccaactgg tcgattctat gtgctaacgt atactcacaa tcgaagagca 31501 taaccgaata agcaggaact attgcgcgtg agcttgtgtt aattctcacc atttagatga 31561 gtggtagtgt gggaaggata tcaatcagta atctttctga gtttttctta tctaaatcac 31621 cactacaggt gactcagtag gatgaagtct tttcacccaa tcaaaatcta gtataaaaac 31681 aaaggaacaa tagacaatca taacagcgtc tattttcaca tctcctcaaa acagaggcgc 31741 tttaattatc gcaaatcctt cttattatga gggataataa acttatttgt tctgcgcaga 31801 gtttctagtt attagtttta gtttcttaac gcatactaat atctattgac taagatacaa 31861 acgttaaatt tactattctt actataattt tctttgccat attactatga aaaacattct 31921 aaaaaactat actactacga aacaagcaca aattgtcaca gcgttagttt tatcgggaat 31981 tttgtctatt ggtagtagtt tatcagcaat aaaaagtgct gaagctgccc ctacaaacta 32041 ttttccttcc acagccaacc aagttttaaa agaaaacatt aaaacaaaca gcttaccgcg 32101 tccagttgct tctgcgatac tacgagattt atctaaccga gaaccaactc atgtgagaaa 32161 aatagaaatc attgactaca ctcaacggac ttggcgtgac ggatgcatgg gtttacctca 32221 gccggatgaa ctttgcactc aagcattagt tcctggttgg cgagttgtcc tctcgaatgg 32281 tagtcaaaca tggatttatc ataccgatac caatgggcgt tttattcgct tagcaaatcc 32341 atatattctg gcagacaatg taccacaaaa ctttccaagt tatatagagg atgctgtttt 32401 gcaagccgca tcccaacgct taggtttacc aacctctcgg gtaactatca ttcaagctga 32461 acagcggact tggaacaatg gctgtttgaa cttaccaaac tcgggtgaag cttgtaccga 32521 ggctttagaa aagggttggc gagtcgttgt caaatcacct gagcaaactt tggtctatca 32581 caccaataca acaggttcca aaatcaggtt taataaaaag gaaagcgaat ttagtgaagg 32641 caagttaccc gcaacagtta gagatgctgt gttgcgtcga gcaagtgaag agtcaggttt 32701 acctgaaaag tcgctgagtg ttgctgcatc tcaaccgact cgatggaatg agtgcgatct 32761 tcagagtaat gccaatcctt gcgattccgc cgtttctggt tggcaagtca cagtagctgc 32821 tgggctaaat cgctgggtat ttctcactga tgagcgtggt tctcgaattc agctttcaag 32881 acagtatagt caaacaccca atgtcaactt acccagagat attgccgaga gagttttggt 32941 acgagcatct aagcgtttaa aagcacctat ttcgcaatta gggataattg aggtacaacc 33001 aaaacaatgg cccgatagtt gtttgggtct agcagatgcg ttaacttcat gcgctgctgt 33061 cattgtacca ggttgggagg tcattgtgag cgatggacaa caacgcttgg tttatcgtgt 33121 tggcgaatca ggtgctgttt ttctggatga aaaggctagt cccattgctg atgataataa 33181 ctcgctcaag ccaatttcta tccccataag tgagttacca caacctttag atagcggcgt 33241 catttttcga caaatctcca gtggtggctt tactgggaga acatacgaaa ctgttttgct 33301 taacgacgga cgtctgattc gtgttcggat tggtgatatt aatgattctg aacgtagcgt 33361 tcgccgcatt cctctaaaac aagtggaaaa atttcagcaa ttgcttgaac gccagggcga 33421 tgaattcaag aatctaagtt atccagcacc caacggcgct gctgattaca tcacatacac 33481 tctgaccaac cgctatggta cggttaaata caacgatatt tctcaaaaaa gcttacctga 33541 agatttacgg ctcatagtca aggcttggaa ccggataagc agtcgcgatc agtaggggac 33601 agagaatagg gaatgaaagc accggaaaaa tttgttacct gtaaagtgta ggttgggttt 33661 cattccattc aacccaacct actagcttca cttaattttg aattttgaat tttgaatttt 33721 gaattggtat gatttggcgt tgtattgaga agaaatgcgt gatgtctcct ctttcgttat 33781 gaatgggagc gcaaaacaat tccatgttat attcagtatc atccttgcgg tagtttatta 33841 actcaccgtg aaagaattgt ccctgcgaca ggtttcgacg taatctatat aagactgagc 33901 gttctgtttt aggtccttgc agcatacgcg gtgttttgcc aatcatctct tgggaggtgt 33961 agcctgtgat tttcgtacaa gctgggttga caaagacaat tttaggacct ggttcatcaa 34021 ggtttgcttc agtgacaatg attgattcgg tagcatattg cacagtatta ctaagtagcc 34081 aaggtgccgg atctacttgc ttacgctcag ttatagggcg tattagccac cccaaggtat 34141 tgagtttgcc tccccaagag tcggcaatgg acactgttaa ggttgcgtct atggaatcac 34201 cgttacgtag acaaagacgc acttcccact cctgggtttg gtggtgcagg tgcagcttga 34261 gaagtttgct aagaaaggct cgccgctctt gccagcagat aaatatgacc agtggttttc 34321 ccactaggaa atactgctga acacccagca gcatggcagc agccttgtta gcgtcttgga 34381 tgattccttt accatcagtc accaagtatg cgtttgggat gaactcgaac aaattccggt 34441 agcgctggcg ttccatttgc actacctggt gggcgcaagc aagctcccag ttgatctggt 34501 gtaactcctc gaaagcgact tcaagcttgg ataaggtagc gtaaagttcc tcatagccta 34561 aaactagtaa ctctttttcc tggaaaaatg attcatttgc tctgtggcaa agttcagctg 34621 cacgctgacg cgctgcaatc atctgtcggt aaaaatcgtt caagttcacg aatcgcctcc 34681 ctcctgctgc aacattaact tctttaaatt tcagacaact tttattaccg attcttgacg 34741 ctttgctcaa tcggttgctt ccgtcactag gtttaaaagg cttgaagcaa tctcgttcac 34801 tgggaggaca aaatctacag tcccggtagc aatggcagcg tcgggcatat cgaaaaactc 34861 gctagtacta tcatcagagg taataacctt accgcccatc tgatgaatcg cctgcacccc 34921 caaagcgcca tcgttgcctg taccactgag gactacggct attgcccggt gtttgaaact 34981 agctgcaact gactccaaca gcaagtcggc agaagggcgg acaaaatcca cgagtaccgc 35041 ttgcgaaagg caaacagtac cgttcggggt gacgaagagg tgttcgtttg ggggagcaaa 35101 gtaaatcgtt cccgctcgta atcgctctcc ttcttgggca agcttgagtg gtaaagttat 35161 ggagtcggtt agggcgatcg ccatcaagct ggggtcggat tgggtgtcta agtgctgcac 35221 tacaatgatg gctgccggaa agttcactgg taaggcaact aggatctggc ttaaggcgct 35281 tatcccaccg tcacctgccg ccagcaccac cactttgtcg gcagttagct cttgttccgt 35341 tcccttatta ggtacttggt taattgcggt tccagtagcc ccaacaggca attgacccat 35401 gaacagtgct tgtcggatga agtcagagcg ttgttgcgcc tcctgtgcct gtgcttcaaa 35461 aaggtttgct gacttggtgc ggttcttaga gcgcgcatca gtagccatat tagacatcag 35521 ctccgctcgc tcttctaaag aacgaattgc cgcccacaac gcctcctctt ggacctgagc 35581 ttgagccgcc tgcaaactag ctgctgagaa ggcgtgacct acgcggcagc ggaactggag 35641 caaatttctt tcctgcaatt ggaacagaac accgccgcag tctggacagc ccaagcccgc 35701 gcttatccca ggcagtccct tgttccgtaa acctgcccca tccacttcca caatgtcggg 35761 ttccatttcc agttcgtcct cgttagacat attcaaagct ccttggtcgg gtattggttc 35821 acaagctagg tgtactaaaa ggggagctat tgacaccact ggcaggatat aatcaacgtc 35881 tacgtgttcg atggcgttgt tgggcatccc ggaaaacagg gcatcatccg ggttctgaac 35941 gacagccaca ccaccaagct tttttatgtc tataagcccc gcagtaccgt cgtcgagcgt 36001 gcccgatagc accacaccga ccactcggct tttgtacgcc tttgctgccg tgcgaaacag 36061 gggatctacc gccggacgag agcggttctc cttgggacct tgcactacgc gtatgtagcc 36121 gcgtttgacc agcaggtgat agtcaggcgg tgccacgtag atatgcccgt gctctatagc 36181 ctcagcgtct tttgcatgag tcgccctcaa ggaaccacaa cggctgagga tatccggcag 36241 aaagctttta ctttgggcga aaatgtgaac gacgatgaag attgcagccc ttagatcttt 36301 tggcaaaggg gcgaccagtt ccttcagcgc ttccacgccg cctgctgagg ctccaataac 36361 aataatgtcg tgaccgggca atttgtctct ccttaatata ttgactcaaa aatgatgagg 36421 aatgattgac atctcccctt tctcgcggct cggggattct aaattgatgc gaagcggcag 36481 ggttaacccg taccgctagc cccttttagg ggcgctgcgc aaacgctttt tttagctgtc 36541 acccagactt taatccgggt gggacgtgtc aaagcgcccc tactcactcg gctagtatct 36601 atatccagcg ttgtgtctac tttgcgtata atattggcgg ctccgttgca atctgcgttg 36661 attgttccat accacagaga tttgtacaga ccgcgttttg tgcgtcgccc ggatggcttc 36721 cagcttgtgg gtttttcacc aaaggttggc agaaagtcat ggttaagaaa agacgccttt 36781 gaggtatacg attcctcggt ttcaacaaaa tttatgccgt gtctaaggca taattgccta 36841 atacgttcct ttagtctgta ggtcgggatt tggacaaatg cttgattgtt tctcctcccc 36901 atattggctt cttgtcgctg cccttcattc caaccaaaca caacattacc taccttcatc 36961 ctcaagcagt aatcaacgac aatgcgcgct gctttattga ttgcgtcacg catttgtcta 37021 ttgcgctttt cggtaatagc agctaacttc tttgaccaga agccttgggg ctttccttct 37081 ttgagagtgc tgatttgttt gttgtaccag cgattaagag acttgacgtg acgaccatca 37141 atgatgaagc cttcacccga ggatgagcta tcaacacatg tcagccaatt attaattccg 37201 gggtcgattc ccaaagccca tttctcattt aagcgtggtc gtttagcccg tgactgctca 37261 tacacgaatt cggcgtagaa ttcacggtta cgtggcagaa ttcgtagctc tttgattgat 37321 gcaaaatcga ggtttgacgg agcggggata gtgaaagaat ctatgccaaa ccacgtatta 37381 acggtttgtc ctaaaggaat ccgtatctga ttatcgatta gctttaaagc ttgtttgggg 37441 taagtgcatg aagccaatcc atttttgcgg tagttaggaa gtctcggctt ttgattgaga 37501 tcccctttgc gcgctttctt gtctaactcg tagaaagatt taaaactttc ggcaactgtt 37561 cttaaaactt gttgagcaca ttgggaatga agagactggt aattgcgatt ggttttgtac 37621 tctttttcta gatcgtattt acctaggtat ctgcgctctt tgaaccaaac ttgacgacca 37681 taatatattc cgcagttcgt caactttgct gcttgagtac aaagatactc aaggacgggg 37741 tgaacgagta acgaacgatt taacaatact tgttgacagc cataatcttt ttgcaataca 37801 atcacctttt tcgtatgcaa ttagtataca atgcttctgt tagtaattcc ggaaagattg 37861 acggcaggct aaaccctgct agccccgaag ggggcgctac gcaaacgtgg ctttcatccc 37921 ttgcctaaag tacaaagatg cctttctaag tcaggcatct tttcaagggc tttcatctca 37981 ccccttgtaa aggtggacaa gaattaggtt acagttctca accccgatca gatttgtctg 38041 gctgctttca ccaaaataac tgcgactgtt tcacctaact tctagctaaa gaatggactg 38101 agtgtaaagt ttcaaagtgt atctcaatac aaaaagctcc tatgtcggct ttgcgcttac 38161 aaggatggct ggctgaaaac tcttgaggaa cagcaacgta gttgcgtatt ccgtactatg 38221 gttatggcac gctgcgcgaa caggcaaggg atgaaagcca ggaaagtgcc tttaggctac 38281 cgtcttcaga agcagaatta agttatgata aacaacagta accgtatcaa gttactgtat 38341 tctttgccgt tgtcaatttc aactaaggcg atagaaaaaa ctttagtttc tatcagcttc 38401 cgtgaaattg ctgataattt gaggaattag gcagatggca ttatctacaa ctttgaattc 38461 cacattaggt cacatccata ttataattcc tcagcactac catcgtcaac cgattgtctc 38521 gcgattgatt tctcgttacg atttaatcat caatattgca tcggctcttt tagaatctca 38581 tgcaaaagat gacggtttat ttaaccttga aattcaaggg gtttctcagc agatagaagc 38641 aagtcttagc tatcttcaag aactaaatgt agagattgtg gagttagact tcaaaagcat 38701 tgtccaagaa aatcaagata aattccagat tttatgtaca agtcacaatt tcagtgacat 38761 cattgacggc aatgaaaaaa aagctgactc ccatgtagta aaaggacaaa ctagtcgtgc 38821 taaatttcaa gtttgtattc ctaaaaatta tcagtcttat ccagtgattg cagggcttgt 38881 ttattgttat ggattaactg ttaatatttc cggagcagta ttagatacca acccggaaaa 38941 tgacggttgg tttgatttag aggtttgggg tagacgtcag cagattgtgt tgggcttaag 39001 atatttaaaa gaattgggct tacaaatttg gttgtaatta ttgacactat ttaaactgcg 39061 tatcagaatt tttcacaacc tattggagag ataataacga tgactcatgc ttgctgtggt 39121 cccggctacg cttctcctga agcagcaaca aaagcagaac gtgaaaaggt actatatacg 39181 attgcaatct acacaggttc aagtattgtt gaaccagatt atcttgcaac cgtagacgtt 39241 gatcccaact ctcccacata cgcccaagtg attcaccgcc tgccaatgcc ttatgttggc 39301 gatgaactcc accatttcgg ctggaatgct tgtagttctt gtcacggcga tgctagtaaa 39361 tctcggcgtt ttatggtgat tcctggtcaa cgttccagca gaattcacat tgtagataca 39421 gcggatatca aagcaccgaa acttcacaaa gttattgaac cggaggaaat caaagaaaaa 39481 accaatttga cagcccctca taccgtacat tgcctagctg atagtcatgt catgatttcc 39541 atgttgggtg acagcgaagg gaatggtcca ggtggctttt tgttgctgga tgaaaatttt 39601 gacattgctg gacgttggga acgcaaagcc gacggtatgc gtttcaacta cgacttttgg 39661 tatcagccgc gtcacaatat catggtgagt agtgagtggg gtgcacccaa aactttctat 39721 cctggctttg accttaatga tgtggctgct gggaattatg gtcaccaact gcatttttgg 39781 gattggtcaa aacacgaaat tatccaaagt tttgacttag gtgaagaagg actcatcccc 39841 ttagaagtgc gctttcacca taaccctgat agcactcacg ggtatgttgg tgctgcactc 39901 agtagtaacg tttggcattg gcataagtcg aacggtcatt ggcaagttga gaaagtgatt 39961 gatgtaccat ccgtggaagt ggaaggctgg cccattcctg taccatcatt gattaccgat 40021 atcctgattt cgattgatga tcgctacatc tatttctcca actggctgca tggtgatatc 40081 cgccaatacg atatcagtga cccctcccat cctaaactga caggtcaagt ttggcttggt 40141 ggtttgttgg gcaaaagtag cgaaattcaa agtcataaac tgactggtgg accgcagatg 40201 ctacaactga gtcttgatgg caaacggctt tatgtcacta attctctgtt tagcacttgg 40261 gacaatcagt tttaccctga cttagccaaa gctggctcat atctattaca aattgattgc 40321 gacacagaaa acgggggact gaaaatcaac gagaatttct acgttgactt tggtaaggaa 40381 ccagctggtc catctcgcgc tcatgagatg cgctatcctg gtggtgattc tacatctgat 40441 atttgggtat aaaagagcat atggcaggca agatgcctgc cacacgaaat atgtattaaa 40501 atatacaaat ttgtacttgt gaatcacgat gcttctagca aactacctcg gtcttcctag 40561 cgtagtcata tataacacaa cttcatccgc aggaatacca agtaattcat tgactttatc 40621 atcaaagaaa ccgccaatac cactgacgcc taaaccgagg cgaatagcag ctagattcag 40681 cttttgtcct aaatgaccag catccatgtg cagataacgg taaacgcgat cgccatactg 40741 acccactgct gattttaaat cggctgtatg aaaaacgatt gctgcggcat ctcgtcctaa 40801 ctcttgtcct aaacagagga aatgtaactc ttttctgaag tttttaaaac gaatttgtcg 40861 taattcttgt gctttgggcg cataataata acagcctgcc tcaagacctt ctaccccaga 40921 aacagcaata aatgtttcta ttaaattcag gtcaaagtaa tcagtcgaat gatccaaacc 40981 ttggtcaaca taatgttgag gttggtaaga aaaatcgagt aaagctttta gttcatcaaa 41041 tgttaaatca tcaccactgt aagcacgggt agaacgccgt ttgagaatgg tcacttccag 41101 tccttctagt ttttctcccc agtggataga ttggctgaca gtagatattt tttcacagaa 41161 agggaagtta tatttatcct ctaaagattt ttcttgtttc acttcacctg aagtgagttt 41221 atccgttgtc cctggtagaa tttgcgtgca gtcatgaaaa tatttcagga gtttaccatc 41281 gggaatatga ggatagtttg tttccgtcgc agagggtaag gctgtttgtc ctgtgggtat 41341 gttttgcttg atatccagta agtctgctag ggggagaacg gcgatcgcac cttcgtgtaa 41401 agaatcgata tacagcaatt cattcaccgc ctcatctaca aagccaccaa tcaggtgagg 41461 gcgatactga ctcatagcgc cagctaactc gacattacct aataggtgtc ctgtatccaa 41521 acaaattctt cggtaagctc tatcttcata tcgccaagct gaacgataga aaactgctgt 41581 aataataatt gctaactggg tcttttgcaa agtcggatgc cataaacagg cttcttgcag 41641 cttttgccaa acatcacttt cccaataagc aatcagagaa tgagttcgac actggtaatt 41701 gtacactccc ggcggcaaca acacagtgcc acgagaaaca acatacatct ccgcaggata 41761 caagccaccc gcactgggtg cagccctcaa atataccgta tttcccatag aaggcatttt 41821 tgctgtgagt ccgtagctgc aaaacagtag ccgcgatagt ctttgccacc actgcttatc 41881 tggttcatca acaaaagctt ctgctttctc ttggatatag ggcttgaggt caaaagtaga 41941 gccaattttg tactctttga acggcactgg ttgcttagac cagtttaacc ccttattctt 42001 tgaggcgata gtctgcgggt cgtacttagt tcgttcgtgg tagtgctgag cgattgattg 42061 acgtagttct ggcataagag cttaaatatg cttctgctac atatctttgc attctatcgg 42121 atcatcattt agtcttaatt agtacaaata atgagcaaac agaaggttat cagagaataa 42181 acgagtaagg agacaagctg aagaggttgt ttaatttttg atcaaaaatg taagattcca 42241 aattttaatt ttcaaaaatg ttaatttttt atttacaaaa cttctggaag ttatggagaa 42301 gtcaaaaaga agaattcctg agtcagaacg ccagtcgcct gccctgtagc cagtgctgaa 42361 cgcgagtgct cctttttgca aactctgaca ccctaatcaa atctatcaga atgacccgat 42421 tctggtatta cttgactcaa gccactcata atctacccat agatggatga acatattgca 42481 agtatggctg aacaaaaaac taatttcttg cattgttaca gtttgtgtaa ccataatttt 42541 cttttacctc ttgtgccaga ggtaaaaata tagagttttt tttagcgtct tagtgcgaag 42601 ttatcatcca attcataacg ctgaaactaa agcgagaaac atggaaacaa aacagctagg 42661 caaaacaggt gtctccgtaa gtgcgattgg tttaggggct atgccaatgt caataagcaa 42721 tcgtccccca gagtcacagt caattgatgt cattcatcgt gccctggatt tgggtatcac 42781 gttcattgac acagccgact cttactgcaa agatgaaagc gacaagcacc acaatgagcg 42841 attgattcac caagctttag aatcctacaa gggcgatgtg agtcatgttg ttgtagcaac 42901 caaaggcggt ttaatgcgtc ctaatggaaa ttggacacgc aatggtaatc cccagcattt 42961 acgcgaaacc atccgcatca gctttgaggc tttgggcggt aaaaaaccga ttgatttgtg 43021 gcaatatcac gcaccagacc cagattatac catagaggag tccctcgcac cagccaaaga 43081 agctgtggat gcgggaatga ttcggtttgt gggagtttcc aacttttctg tagaacaaat 43141 aaaacgggca cgggatgtgg ttgatattgt ctcggtgcaa aatcaataca acccttggca 43201 acgtcagcca gaatttgatg gcgtattgga gtattgtcag catgagagtt tgacatttct 43261 accttggagt ccttatggcg gtagtcgtcg ccatgatggt ttggaagata ttggggcgat 43321 cgccaaactt gccaaagaaa aaggcgtatc agtctataat attgtcctgg cttggttgcg 43381 tgctaaatcg cctgcgatat tgcctattcc tggcgcaagc aaaacttcca gcattgaaga 43441 cacagtacat gctgttgatg tgaaattatc agatgacgaa gtgcaaagaa tagatcgcga 43501 aatctaaaaa cattaccttg ctttgcttag tcatagtaac ccaatacaaa atattttcag 43561 gacaaaatct gggtatttta cctgaatcta aaatacaaaa ttgtatcata tcagctatat 43621 aaaataagct gatatttttt tgttcaagtt cacaaatatt attatctttt ctgactgcca 43681 aaatcttata taatctgctt gacaacgttt tacctcaaga aaaaatcttt aataaaagct 43741 attgtattac tcaattgttc ttaactctgt gtgctgtatt atacgtaaaa aaatactttg 43801 ataaaccgca tgagttcgct atacatgctt gcatcaagag tgaagaatat tttactgata 43861 aaaaatttaa cttaaatcta attaaaaaac tatgttccct tttaaagcaa tgagtaaagc 43921 aacgagaggc aacattaatt tccttaattt ccttaaatac ttttacaaaa ttttcactcg 43981 tccgacatca aaaatgtggg tttgggtaat tgcttttttt agtatcactg cttttgacgg 44041 ctggttaatc aaactatctt cttttgtttc tgtgcctaca gtcatttatt ttgctatttt 44101 tcccttagct tctattattt ctaggcgtct ttccaagact acttggagat tggtactttt 44161 tgtagttttt tcacgttcta ttttagctat tagtttatgt attttacatt tgccaaagct 44221 ttcaccgttc acaataatta tgtattgtat aaacatttta ttattcgctt ttctattgct 44281 tgatgtaata agttctcgtg gaattgtagc tccagcttta acagttattt tggtaatctt 44341 gctgtggatg ataatcggtt ggcaagtggt tatccgttta gttttgtatt gcaatttagt 44401 agtgattttc aacaaagccg ccacaaggtt aaacaaatat ttaaatagtt tcaatgctgc 44461 tatggctgtt ttgactggaa ccgcagcctt tggattagca ctaggatgga tgctggctag 44521 cttatgaatt aggagttttg gtcaagtttt gcggtaagat tataaacttg caataaatca 44581 ggacgcttgc tgactggaga agttattatg gcccggatgt actacgacga agacggtaat 44641 ttagaccttt tggcacaaaa aaccatcgcc atcattggct atggttctca aggtcacgcc 44701 cacgccctta atcttaaaga tagtggtctg aatgtgattg tcggactata tccgggtagt 44761 aagtcagcgg cgaaagctga agctgctggc ttaaccgtga aaaatgttgc tgatgctgtt 44821 aaagccgctg actttattat gattttgtta ccagatgagg tgcaaaaaac gatttataaa 44881 aacgaaattg aaccgaattt ggaagaagga aatgttttgt tatttgccca cggcttcaat 44941 attcactttg gtcaagtcgt cccacctgct aacgtggatg tggtgatggt tgcacccaaa 45001 ggaccaggac acttagtacg ccggacttat gaacaagggg aaggtgtacc ttgtttgttt 45061 gcggtgtttc aggatgcgag tggtcaggca cgcgatcgcg caatggcata tgctaaaggt 45121 atcggtggta cccgcgctgg tattctcgaa acaactttcc gcgaggaaac cgaaaccgat 45181 ttgtttggtg aacaagcagt tctgtgcggt ggtttgagtg ctttaattaa ggctggtttt 45241 gaaactttgg tggaagctgg ttatcaaccg gagttagctt attttgaatg tctccatgaa 45301 gtcaaattga ttgttgactt agttgtggaa ggcggtttag caaaaatgcg cgacagcatt 45361 tctaatactg cggaatatgg cgattatact cgtggacctc ggattgtgaa tgaacaaacc 45421 aaggcagaaa tgcgcaaaat tctccaagaa attcaatctg gtcaatttgc acgggaattt 45481 gttttagaaa accagtctgg taaacctgga tttactgcga tgcgtcgtca agaggctgaa 45541 catcgtgttg aggaggtggg taaagattta cgcgctatgt tcagttggct gaagaagtaa 45601 ttaaccgcag atgcactagc tttggacgta gattattatc ttttttagac ttattcccag 45661 gttgaacctg ggaatgtctt tttaaactga aattccttca caagatggta aatgctgcaa 45721 aatgtatttg tcgaatctat tcgtggcaaa ataataaatg attttctagt atgctctgcg 45781 ggaaaattga tagtacaaga aaatagcgat catcgcaccc aacactgcca cgaacaaacc 45841 aggaagagtc aaagttgcag aagttattgt caaagtccca gtttggatta cgttaactaa 45901 tgttccacca atgaaagcac caacaattcc caggactata gtcgctaaga ttccaccacc 45961 ttgataacca ggatagattg ctttagcaat agcaccagct agaataccta aaacgaccca 46021 agcaataaga ctcattgttt tgtttccttt tgtgtttctt actaacactt actatttaaa 46081 ctgatgcaat gtttctattc atcattcatc agaaacagtt tgaacttatc ttcagtaaga 46141 aaacaagagt gatgcaagtc acactttact aagacttagt taattctcat ccgtccaaag 46201 ttataatcct gttgggttag ggagcatccg aaatgtgtaa aatgcccctc acactggaag 46261 tgactggctc acaaacgaag tccggctatt tcctaagccc tatggaaggc ctgcgccgtt 46321 gcaagagcgt cttttggagg aaatacaagc acgcactcgc gttcgcggac taattttgag 46381 ccactgtgca aataccgcca atctgaggaa cgtctcatca agccatgagc atctaaacga 46441 agatggaaca gacggaagca ttgcttgagg gtgggaaaat tctagaatga atcaatgaat 46501 tgaagaacaa agcgatgcag agtatcaagt tacgttctcg tgttgggcaa gatggtatct 46561 tgcatctaga aattccagta ggtattgcag acagagaaat agaagttatg gtgatttatc 46621 aaccactaga accatcaaca cagcagaaaa caccagaaga attgggctgg actcccggtt 46681 tttttgagca gacagctggt tgtctgcaag atgatccgtt ggtgcgatac ccccagggtg 46741 aatatgaaca acgggagccg ctagagtgat ttatttgtta gataccaatg cttgcattgt 46801 ttatctaaat cgccctgtgt ctggtgtgcg gcgacgatta caatcactat cgccacaaga 46861 tattgctgtg tgttcagttg tgaaagctga actattttat ggtgcgatga aaagcaagaa 46921 tcctacgcgc actctggcat tacaagaagc ttttctaaat aattttgtat ccttgccttt 46981 tgatgatacc gccgccagaa tctacagtag aattcgtgct gatttggcgg ctttaggcac 47041 tcctattggt ccttatgatt tgcagattgc agctattgct ttggcaaata atttaacatt 47101 agtaacccac aatactggcg aatttagtcg ggtggagggg ctacagattt cggattggga 47161 ggaagagggc taggaactgc gctggactca cctcggcgac tacccacggc gctgcctcgc 47221 ctttgtctga ggttatatca tgtccggata aacacttcta aaatacgaag aaactcaccc 47281 ctcacgccag gtgcaccctc ggaggaaccc caacgccagg tccgtctgtc gggaaaccct 47341 cctgtaggac tggctccgca acgcactggc tctccttaat aaggagaggg gcggggggtg 47401 aggtctttgt tattgtaagt aactaagcga acttgatatt acttatttac ccaattaact 47461 cgtaaacacc tgcactggta agcgcttgaa ggcaagccgc tcattctcga tatccacaaa 47521 gatggtatcc ccatcattaa attcaccacg caagatagct ttggcaatct gagtttctaa 47581 ctcgcgttga attgctcgct tcagcggacg cgctccaaat acaggatcat atcctacttc 47641 tgccaaaaag tcaagagcag catcggaaag tttgagagac atcttgcgtt ggcgaagccc 47701 accggaggta tcgctcaatc tttgctgcag tctagcaact tgcaactgca caattcggcg 47761 caattcttgt ttctgcaaac tgtggaagat gataatttcg tcgatacggt tgaggaactc 47821 tggacggaag ctatttcgca tcgcttccat cacccgatgg cgcatttcat cgtaacgtga 47881 gtcatctcct gacacatcta agatatactg tgaaccgatg ttgctggtca tgatgatgac 47941 ggtgttcttg aagtccactg tatgtccttg agcatcagtg actcgaccat catcgagaat 48001 ttgcaacaag atgttaaaga catctgggtg tgctttttcg atttcgtcga acaaaatcac 48061 cgcgtaggga cgacgacgaa ttgcttcggt aagttgtccg ccttcatcgt aacccacgta 48121 tcctggaggc gcaccgataa gtctggaaac ggcgtgtttc tccatgtact cggacatatc 48181 aatccgcacc agcgcttctt cggtgtcgaa catataagac gccaaggctt tagcgagttc 48241 agttttacca acacctgttg gaccaaggaa aacaaagctt gcgatcggac gattcggatc 48301 agcaagtcca gcgcgagaac gttgaatagc atctgctaca gcactgactg cttcttcttg 48361 tccaaccacg cggtgatgca gttcatcttc taagtgcagc agtctttctt tctctgattc 48421 caccagcttg ctgatgggaa tccctgtcca cttagaaatg acttccgcaa tgtcagcttc 48481 agtgacttcc tcacgtaata aagattttcc actgcgttgt gcttgagcaa gttcagtttc 48541 tactgcttcc aaatcgcgat gcaaactggt taacttgccg tattttaact cagccgcgcg 48601 gttgaggtcg tagtcgcgtt ctgcttgttg aacttccaag ttgacgcggt caatttcttc 48661 tttaacagac tgaattttcg tgataatgtc tttttcagac tgccattgag tattgagtgt 48721 tctttggtct tctttgagat ctgcaagttc tctttctatt ctttcgagac gttctctgga 48781 cgctgcattg ctttcttttt gcagtgacag cttctccatt tctaattgca gaatcttgcg 48841 gtcgatttcg tcgagttctt cgggtttgga ggtaatctcc attttcaagc gtgctgcagc 48901 ttcgtctact aagtcaatgg ctttgtctgg taagaagcga tcgctaatat accgactcga 48961 caatgtcgca gctgcaacta aagaactatc agaaattttc acaccgtggt gaacttcata 49021 ccgttctttc aaaccgcgca aaatagaaat ggtatcttcg acgctgggtt gatcgacata 49081 aacctgctgg aagcgtcttt ccaaagcagc gtctttttca atatatttac ggtactcatc 49141 aagagttgtc gcaccaatac agcgcaattc accccgcgcc aacatcggtt ttaacaagtt 49201 acccgcgtcc attgcgcctt gagtcgcacc agcaccgaca acggtatgaa tttcatcaat 49261 aaataaaact atattaccgt tagactcggt cacttctttt aagactgctt tgagacgttc 49321 ttcaaattcg cctcggaatt ttgctcccgc aatcaaagca cccatatcta aacctatgag 49381 cttgcggtct ttgagggact ggggtacatc accagctatg atacgttgtg cgagtccttc 49441 ggcgatcgca gttttaccaa cccctggttc accaatcaac acagggttat ttttggtacg 49501 acgcgacaga atttgaatcg tgcggcgaat ttcatcatca cgtccaatca ctgggtcaag 49561 tttaccttga cgggcagctt ctgtcaggtc acgcccatat ttttccagtg attggtattt 49621 accttctgga ttttggtcgg tcactttttg gctcccacga atttgtttaa taatattttt 49681 aagcttactt tcgtctaacc caaattcttg gtacagtttc ttgccgaaac ggtcatcttt 49741 agcgtaaccc agcagcaagt gttcaataga aatatattca tcttgcaact ccttacgata 49801 cccgtcggcg cggtcaagaa gagtatccaa gctgcgtcct aagtagacag aactactgct 49861 accagagact ttgggctgag tttgaataaa ttcttcgctt ttgtcgcgca gtttttgcag 49921 attcacaccc gctttggtaa aaacaccgtt ggctagtcct tcttgatcca gcagtgcttt 49981 cattaggtgt tcgctttcaa tctgctgttg ttgatattgt ttagctatat ctggggtgtg 50041 ggcgatcgct tcccaggctt tttctgtaaa ttggtttgga ttagttggtt gcataggctt 50101 aatcgacggg cgaatgactc taggcgcacg caaggtgcag atagaacaag tgtatagttt 50161 gttcttatat cgccattgta aaaacaggag agcaaatcac tggatcggtg ttcacgtctt 50221 tcttcaggag tattaccgtc caatgaatcc tgattgactc caagcaccca ataaatgcaa 50281 acagaacaaa aataagtttt gtttgtatac cgtgattgta caaacagggg cgcaaataac 50341 tggatcggtg ttcgcgtttt tgtagaggag tattaccgcc cattttttac tcattgatta 50401 gctgtttggg gcgtgtcatc ttataaaaag cttatcatac aagagtttaa gtcaaataac 50461 ttttgtgttg caaaaagtga tgttacttta gtaaaagtgt catctgtgtc aactgcctat 50521 agctagaatt actcataaca acattccacc tcacaggaac tggtgtcctc aaagagtttt 50581 tgcaatacaa ccggtattat cggctcatgg ctccaaaaac ctttaccagc agcaatcaaa 50641 atgctcaaat gataaagaaa aaccgatttc ctggatggcg cacaatcttc tttggcgttc 50701 gcacacgcat cctcatctgg tatgttgcgc tgatggcatt ttccacatta gtatctgtct 50761 tagcaatccg tgagattctg cttgtccgac tcgaacagga gattgaaaag tatcttaccc 50821 aggaagtaaa cgagtttaga cgattaactc agggaaaaaa cccaagcacg gctcaacctt 50881 tcggggatga tgttgcagcc atttttgatg tattcttgtc ccgcaacatt ccccatgaaa 50941 actcgttttt gattacgctg ttgagtggga aattctacaa atctagtccg caagctctgc 51001 caattggttt acagcccaat tcagccctta ttaaagattg gcaaaaactc aaacagctca 51061 agcaaggtaa ggaattcatc tcaaatcata ctatctacta tatggctgag ccgatactca 51121 aaggcaaaac tcagggtgtt tttgtggtca cttactctag tagtagcgct catcagcagg 51181 taaaccaggc agttgttgtc attatccagg tgacaatagt tgttctagcc atagcatcgg 51241 tgctggcttg ggtagttgct ggacggttat tggctcccct aagcttactc atcgaaacag 51301 cccacttgat taccgaatct gacttatctc gacgtatccc cgtgcaggga gtcgatcaaa 51361 ttgccgaact gagcatcacc ttcaatgaga tgttagatcg tctccagact gcttttgcca 51421 gtcaacgaaa cttcatcaac gatgcaagtc atgaactgcg gacaccaatc acaattatcc 51481 gaggtcattt ggaactactg ggtgacgatc cccaagagcg acgtgaaaca gtggaattag 51541 tgactgatga gcttgatcgc atgagtcgct ttgtcgatga cttactatta cttgctaagg 51601 cagagcaacc aaatttttta aacctacaaa cggtggatat tagtttatta attggggaac 51661 tatatactaa agccacagct ttagctcagc gagactggcg tttggaaaac aagggtgtag 51721 ggctgattgt ggctgaccgc cagcgtctga ctcaggctat catgaacttg gctcagaatg 51781 ctacgcagta caccagtgat ggagatgtca ttgcccttgg ttcagaagtt ttgaacggtt 51841 atgcttattt ctgggtgcgt gacacaggcg ttggcattgc tcctaccgat caagagcgaa 51901 ttttcgagcg ctttgcccgt ggctcccata gttatcgtcg ttctgagggg gctggtttgg 51961 gattgtctat cgtacgagcg atcgcaacag cacatggcgg tcgagtagaa cttaaaagca 52021 aactcggtaa aggttctaca ttcactctta tcattccact agatccaccc tttggagatt 52081 ctgtatgact gatcggattc taattgctga agatgaaccg cgcatagcta ctttcataga 52141 aaaagggtta cgagcgtcgg ggttttcttc cgcaattgcc aaagatggtc acgaagcctt 52201 gagcatggct caaactgggg attttaacct cctgctcctt gacattggtc tgcctggcaa 52261 agatggttgg atggtattgg aagaattacg cggtcagggc gaacagatct ctatcataat 52321 cctctccgct cgtgacgagg taagtgataa agttgccgga ctagaaggtg gcgctgacga 52381 ctacatcaca aaaccctttc ggtttgaaga attgctggca cgggtacggg cgcggttacg 52441 cgacaatcgt ttggttagac gacaggaaga aacgattctt aaaacaggca agattgtgct 52501 aaacctgctg acacgtcaga tttgggttgg ggatcatcta ctcaagttat cagcacgaga 52561 atttatcctg gctgaaacct ttgtgcgtca tccagggcaa gtcatgagcc gagagcaatt 52621 gctaagtcgg gtttggggct atgactacga tcccggttct aatgttgttg atgtctgcgt 52681 tggttccctg cgcaaaaagc taggtcacga ctatatcgaa acagtcagag gtatgggcta 52741 tcggctgcga acgtgaaaat tttctcatca attattcata tacaattcat aagtatttat 52801 aaatataaag aagaactgta acaatcaggg tctgttgatt gtcgcctatt gtcatttttt 52861 caacttcata ccaagttgtg ttcaaatgtg taacttagtg tcaagaaagc agggtgagta 52921 ctatttttga cgacaactta gtatcagttg gtgagcaatg agatgccatt ttgaacgtca 52981 ggagagtagc tacatactgg ctcaggtagt aggcgcaaaa atttacgttc tgtttgctga 53041 gttcgtgaag atgctggttg aacgatattt ttggaaaaat atattgtgat tactttaact 53101 tttgttgcag tgaaaatact tataaagatt acgagaaaag caagatgaca atcagagatt 53161 tcaaagcaaa acgctccatt gtcaacgact tagtggcgtc cgtagttgtg tttctcgtcg 53221 ctctgccact gtgcatggga attgcgatcg cttctggggt tcctccagag ttgggaatca 53281 tcacaggaat cgttggcgga attattgttg gtactgttgc tggttcaccc ttgcaggtca 53341 gtggaccagc ggctgggctt gctgttatag tctgggaact ggttcaacag tatggcattg 53401 aaatgctagg accaatccta atgctggcgg gtttgttcca gttactagca ggaattttca 53461 agctgggaca ggttttccga gcaatatctc ccgcagtcat ctacgggatg ctagcaggaa 53521 ttggcgtgct tatctttgct tctcaattcc atgtgatgtt tgatagcaag ccaagcgcac 53581 acggaatcga taacttaatt tctataccta gccagatata taagactatc ttttccgccc 53641 aaggcaacaa ccatctcatt gctgggattg tagctttaat tactattatt actctgattc 53701 tctgggaaaa gtttaagcca aaaagattaa aattgctgcc cggttccttg attgctgtcg 53761 tgattgcaac tgcgatcgcc actgtgatga agttgcctat tcagtacgtc aatgtacccg 53821 ataatcttat cggcacgatt cacctgccaa agttagagaa cttcatcggt ctactcaagc 53881 catctgtgct tatggaggcg atggcgatcg cctttatcgc cagtgccgaa agcctactct 53941 cagcagcagc ggtagatcga cttcactttg gaccaaaaac gaattttgac cgcgaactcg 54001 cagcccaagg ctttggcaac atggtttgcg gcgcattagg ggcgctacca atgacaggtg 54061 tgattgtccg cagttctgta aacgtggaag caggaggcaa aaccagactg tctgcgattt 54121 ttcacggagt ctggctgttg gctcttgttg tcgccgcgcc ttctgtactg aatctgattc 54181 ccacctcctg cctcgcagcg attttggtgg tcacaggcta taagctggtg aaagttgaga 54241 atatccgcaa gctgcaacaa tacgggcgca tccctgtatt catctacttt gccaccttag 54301 gcggaattct cacagctgat ctactttttg gcgtgctact cggtcttgtg ttgtcagcac 54361 tcaagctgat ttacaaagtt tcccatctct ctatccatgt cctttcagat gaaaacaacc 54421 agcgcgttga tgtgtatttg gatggtatgg ctacatttat tcggctacct tatcttgcca 54481 aagttcttga gcaaatttct ccaggcaagg aaattcatat tcacctagag atgctgagct 54541 atattgacca ttcttgtctt gactttttgt cgatgtggga aaaacaagag gaaaagaagg 54601 gaagtactgt cgtcatgcag cgggataggt tggtagagcg ctaccgcaaa ccgcttattt 54661 ctgggcgatc gcacctagca gcgtagaagt tttcactact caactttcac aaaagaggag 54721 ttcactatga atacgataga tgcagtaatc ttaacattac tgaacactat tgcttgtttt 54781 gctttcccca aacttctgtc tgtcattatg gctcctaaaa agaaacgtac tgcaccaatg 54841 cctactgcca tcagagcaca aaccgaagca taccaaagtt cttgagcaga ttcctcctgg 54901 aaattcatat tcactttctt ctcgttcttc tcgttcccag gctcagcctg ggaatgctat 54961 atcaagaggc tcagcctccc atattcgaca cctttgaaaa tttatcgcct caaatctttc 55021 acctcaccta agtacgctca ggttttgcac aattctcaca ccttgtattc tcgtccgcct 55081 gggattgaaa tcccagtctc ataggcaaag tcatctaaaa gatgactgcg tagtacgctc 55141 ctaagaaaca catttttaaa cccatatata gcaaagcttt tagagattga aggaaaatgt 55201 gtttctttca tatttcactt gcggtacacc gctagttgga ggctcagatc ttgcacaatt 55261 ctcacgtctt gcactctcgt ccgcctggga ttgaaatccc agtctcatag gcaaagtcat 55321 ctaaaagatg actgcnnnnn nnnnnccgcc tgggattgaa atcccagtct cataggcaaa 55381 gtcatctaaa agatgactgc atgagctttt cagtctactt tagtagactt gggctgtgag 55441 ccttggaatt gattccaagg cggttgaaat cagctaatct ttttctcgtt cccagactcc 55501 ggctgggaat gcccatactg aggctcagcc tcacctcaag gagggactac gaaacttgac 55561 tgcgatcaaa caaccaccca tccccaacta tctcctctag gcgttgagtg agtcgggtaa 55621 gaaaatattc cggatctttc aacctttgct ggacaaacca agtgtcaaac gcttcctgcg 55681 tctcaattcc ttgggcaacc gccgcatctg atatcatcct gacactcaat ccctggagtt 55741 cctctaactg cggatgccct ttgcctcggc agttgcaaaa gaatagtgct gttgccccta 55801 gcaaggattg cagttcatca cctcgcggta ctgcgttata caaattatga attagctgaa 55861 ttgtttttgc caagggggcg tgaatcctca ccgtcagcca catagcttga gccaggtaaa 55921 ccaaaccgtt gctttgagcc gtcacgccta aattacctag aactgtcacc aaatcgccat 55981 aagcacaaac ctgtgcaagt gccaaagcgg cttgcagatt taagtctaac tccctagctt 56041 tatctcctgt ttcacctgca acataagcca tatttgctag ggtagtagct ttgccatcga 56101 catcgccaat ctgctcatat atttccaagg attgttgcca aagtgcgatc gccctctcga 56161 tgtctccttg ttgggcgatg actcctgcca tgttgttcag ggtggtagct ttgccaccga 56221 catcgccaat ccgctctttt atttccaagg attgctccca cagtgtgatc gctttgggga 56281 tgtctccctg ttgggcaatt acctgtgcca tgttgttgag ggtagcggct ttaccatcga 56341 catcgccaat ccgctcaatt atttccaagg attgtttcca tagtgcaagc gccctaggga 56401 tattcccttg ttgggcgatg aaccatgcca tgttattgag ggttgcagct ttgccaccga 56461 catctccaat ccgctctttt atttccaagg attgttccca tagtgcaagc gccctgggga 56521 tattcccttg ttgggctata aaccatgcca tgttattgag ggtagcggct ttaccatcga 56581 catcgccaat gcgctcaatt atttccaagg attgttccca tagtgcaagc gccctggaga 56641 tattcccttg ttgggcgatg aaccatgcca tgttattgag ggttgcagct ttgccaccga 56701 catcnnnnnn nnnngttggg cgatgaacca tgccatgtta ttgagggttg cagctttgcc 56761 accgacatcg ccaatacgct ctactatttc caaggattgt tcccatagtg cgatcgccct 56821 ctcgatgtct ccttgttggg cgatgactcc tgccatgttg cacagggttg cagctttgcc 56881 accgacatcg ccaatgcgct ctactatttc caacgattgt tgccaaagtg cgatcgccct 56941 ctcaatgtct ccttgtcggg cgatgactcc tgccatgttg ttcagggttg cggcttttct 57001 ggtatcgtca tcttcagggc aaaggtctaa agcttgctga taatgggcaa cagcatcctc 57061 aactctgccc aaaacctctt ctacacgggc gatggttccc aagatgcggt agtccagaaa 57121 cttttgcagt aattgatcac aaagttggaa agtctccaca aaccgggaac tgttcgccca 57181 gtggcttaca atcctatccc caacactaac agcaatttcc tcctcccctg ccagcagtcc 57241 taacctcact atctccagtc cttgctcttc cgtgtgtttc tcagacttct cccaccacac 57301 tagataaatt ttccttgctg cttgctgtcg cgttgtcagc cactcttcct cactcaatat 57361 cggttccagc aagaattcta aaatcgttgt cacacgatat tcagcagttt gggtggcgtg 57421 agtcgttgca gattcaacta agctgaggct tactaatttc tctaaacata acctctgctc 57481 atctcttgct cccctctcct tactaaggag aggggctggg ggtgaggttg tgctaaggag 57541 agggg // LOCUS NODE_330_length_57098_cov_5.17327357098 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 57098) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 57098) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..57098 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 533..1027 /locus_tag="DP116_01515" CDS 533..1027 /locus_tag="DP116_01515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196913.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_01515" /translation="MKYQQVLVGLVLAIVLFLFPLSAEGVGSSSIRRSTDDAFNGKDF SGQSLIGSEYINVKLKNVNFSNADLRGGVFNGSILEGVNLHGVDFTEGIAYLVAFEGG DFSDAIFTNAMMLRSTFDDVDITGADFTDAVLDRLEVKKLCAKASGVNSKTGVSTRES LQCR" gene complement(1524..1865) /locus_tag="DP116_01520" CDS complement(1524..1865) /locus_tag="DP116_01520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877667.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_01520" /translation="MDRLEQYRQIIRQLLTSHANLESGNSDNNVECQLVFDTEHDHYQ ILDVGWSGLKRVYNCFIHLDIKDSKVWIQRNMTEANLAQELVEMGIPKEDIILGLHPP YKRPYTGYGVA" gene complement(1853..2269) /locus_tag="DP116_01525" CDS complement(1853..2269) /locus_tag="DP116_01525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877666.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty-acid oxidation protein subunit alpha" /protein_id="PRJNA477356:DP116_01525" /translation="MAAKDLFHNAVKQALLKDQWIITADPLTIKIEKVKFEIDLAAEK VLAAQKAGRKIAVEIKSFLNPSAITDFHGALGQFLNYRLALQMSEPNRILYLAVPVDT FESFFQEPFTQEAVKVYQVKLIVYEPLQQVIIKWTD" gene complement(2292..3467) /locus_tag="DP116_01530" CDS complement(2292..3467) /locus_tag="DP116_01530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317720.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_01530" /translation="MIAFIQQLQRRIPLGWLQLSHEKSRLLVALSGIAFADVLMFMQL GFQTALYDSNTRLNRVLQTDIVLVSPKGRNMQNLSTFSRRRLYQASSISGVKSAEALY VSFITWKNPQTRRETSIQLLGFNPEQPAFGLPEVNQQLDKIKLPDTFLFDKGSRGQYQ EAIAQIEQGKTVTTEAENHTITINGLFKLGTSFGAEGNLITSDQNFLRLFPGRQAASI NLGLVYLEPGYDPQQVATALRAYLPNDVKVLTHKEFIQFEENYWQTESPIGFIFGLGV SMGFLVGIIIVYQVLSTDVNAHLKEYATLKAMGYHNLYFLGIIFEEALILAFLGFLPG TIVPLGLYSLTRTATNLPIYMTLTRALVVLMLTIIMCVISGAIATRKLQAADPADMF" gene complement(3464..4771) /locus_tag="DP116_01535" CDS complement(3464..4771) /locus_tag="DP116_01535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317721.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_01535" /translation="MVRDATASGSSFFKHSRRPLTILAAVITFVVAGVTTYRFWQLQS HKSNEIQKSHQTISKIIKVVALGRLEPQGEVIKISAPTSSQESRVGQLLVKEGQEVKT GQVIAILDSKDRLQAAVVKAQQQVKVKQANLAKVQAGAKQGEIEAQKATVERLKAQWE GERIAQEEAIARLKAQSQGDKIAQQATVEKLQAQLNNAQAEYQRNHQLYSNGAISKSS FDSKRLSVDTATQQLKEARAILTRIDDTSSRQISEAQAVLTRINTTLSQQISEAKATL QKIAEVRPVDVEAAKAEVDDAIAQVNQAQKDLQQAYVRTPQNAQVLEIHTRAGEVVSS DGIVEIGQTSQMYAVAEVYQSDISKIRPGQDVRIISDSLPGELQGKVERIGLQVRRQS VINTDPSTNIDARIVEVHIRLDKISSQKAAQFTNLQVKVVISL" gene 4980..5834 /locus_tag="DP116_01540" CDS 4980..5834 /locus_tag="DP116_01540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_01540" /translation="MRKTQANNLDRKLSEEKVDAILAGAMQEFLAHGYAATTMDRVTA AAGVSKTTVYSHFQDKEGLFTALIQQLILEKYYASFNPQKAQLMEGEASIILRHLAFS MLNNIIGDQQVLGLMRLIVGESGRFPELARAFVLNLEKPFLEDLCQFLMSRPELNLPD PEVAARVLVGTLVHFILIEEILHGNDILPIERERLINNLISLLTANQTQKDLPADQYS GTRQKSFRRNRKNSGKFERDYDSEPKRLRSIRLTDTAWEKLAQVAAKHELTRSEMIEI IARDGELT" gene 7969..8472 /locus_tag="DP116_01545" CDS 7969..8472 /locus_tag="DP116_01545" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01545" /translation="MFHRHIVSACLLVSGFAVSVQTAYAQTPPVPSASARVATSYNNP LYSNLPITVFIETVSSNNSPVNSEYTCVGVAPGAIFSQLTCGVPGGSQLTGGSNGSVA TVGPIIVGKNDATICWAGKFLFAPNKYAYQSGCSTPKQIVIDTQAPANPAAPPVLPNL PLPGLSQ" gene 8903..9220 /locus_tag="DP116_01550" CDS 8903..9220 /locus_tag="DP116_01550" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01550" /translation="MLHRKISPLMLLSPFSVPAATASPPEKGNPQSVEMPESTPSLTD LVLPNKDKGITKTEVELLITQAIRDHEFRTTVHAIFMIVVVYTAGFFCGFLLFLSQWT PHH" gene complement(9389..9811) /locus_tag="DP116_01555" CDS complement(9389..9811) /locus_tag="DP116_01555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01555" /translation="MAYWVKIIYERNEYIINLKQITAFCHERNGRITFWLPDSASSIV INRQSNQEDYQKILDYVEHITGLEFEKSFWVKILYERNEYIINLDSISCFRHEPNNGR ITFWLPGSTISVVINPVSNSDSYQKVLEYIQNTTGQTL" gene complement(9983..11620) /locus_tag="DP116_01560" CDS complement(9983..11620) /locus_tag="DP116_01560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877171.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CTP synthase" /protein_id="PRJNA477356:DP116_01560" /translation="MTKFVFVTGGVVSSIGKGIVAASLGRLLKSRDYSVSILKLDPYI NVDPGTMSPFQHGEVFVTQDGAETDLDLGHYERFTDTSMSRLNSVTTGSIYQAVINKE RRGDYNGGTVQVIPHITNEIKERIIRVAKDTNPDVVITEIGGTVGDIESLPFLEAIRQ FRKDVGRINVVYMHVTLMPWIASAGEMKTKPTQHSVKELRSIGIQPDILICRCDRPLP TSLKHKLSEFCDVPVECVIPSQDAKSIYEVPLNLEREGLAQQTLELLNMEQHQPNLAQ WQTLVEKLYSPTHRVEIAIVGKYVQLSDAYLSVVEALRHAAIAMSSELHLRWVNSEQL ETEAAETYLEGVDGIVVPGGFGVRGVDGKIAAIRYARKNEIPFLGLCLGMQCSVIEWG RDVAGLQDANSAEFDPHTKNPVINLLPEQQDVIDLGGTMRLGLYPCRLLPNSLAFKLY QEEVIYERHRHRYEFNNSYRNIFVESGYLISGTSPDGRLVEIIEIANHPFFIACQFHP EFQSSPSAPHPLFKGFIEATIARYYPSSQVQTPLEVF" gene 11823..13751 /locus_tag="DP116_01565" CDS 11823..13751 /locus_tag="DP116_01565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317737.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetylmuramoyl-L-alanine amidase" /protein_id="PRJNA477356:DP116_01565" /translation="MRSLLGLILLSSIVTPSIALAQEQSLKVVFPKTNYQTSAQKIFF LGTAPSSGEVLINNQPVTRSKSGHFAPSFPLQLGENLFTVRYQNQEIQIRVTRLSTQP EVPQGLGFAKDSLTPRVDIAKLPGEQICFSALAPRATPSGSIAPGTASSGSIAAPNVS VSVKLGNQTVSLLPQPQQAQLPANSAALTGQNQPSTQSSAGKYQGCTTVPQFNPQLYG NNIVSGAVAENSARNIDLGKPEFQLTLNGKTITQQGTGKVTILSPEQLEVVEVIADSG VARTGPSTDYSRLTPLPKGTRAAVTGREGEWLRLDYGAWINTKETRPLLGAVPQQSII RSVGYRRLPSMTEMVFPLQVPVPVSVQQGDQTFTLTLHNTTAQTDIIRLDDDPLISRL DWQQLPPYVPGGQPGVQYTFNLKKAQQWGYKLRYDNTSLVLSLRHSPFISSTNTRETR GMFAIQKPLSGIKILLDPGHGGKESGASGPTGYLEKDVNLVVSKLVRDELVKRGAKVV MTREDDKEVSLPDRVAIIDKEEPAIAISIHYNSLPDEGDAEKIKGMAAFWYHPQAHSF AVFMQKYVVSKLGRPSYGVFWDNLALTRPASTPSVLLELGFMSNPNEFEWVTNAQEQK KLAKVIGDGIVEWFRSAR" gene 14351..16183 /locus_tag="DP116_01570" CDS 14351..16183 /locus_tag="DP116_01570" /inference="COORDINATES: protein motif:HMM:PF00059.19" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01570" /translation="MVQTLSVSGTTNYTAAATPQIVAANLTISDPDSNTLNGVSVIIN SNFKTDQDRLGIAGQNGTNGTINNLNWNYDTTRGVLSITGTASNEAYQNALRQVTYSN ISGNPTTAVRSIEFSLGTTLGSAENNRFYEFVSAPNITWTDARSAAANREYFGLKGYL ATITSAAEENFIQGKVQGNGWIGGSDAETEGDWRWVTGPETGTQFWSGGPNGTSVQGR YNNWAPGEPNDLNNNEDYAHIIGNSAIGQAVQGKWNDLPNAVQSGNYVSGGYFIEYGG LQGDPTLQLTGSVSVNVTGNASANARTSKFDFTGDGKPDILWRNSRTDETALWKMNGT TLEESISLPKTFSNAWEIKGQGDFTGDGKVDILWRNSRTGENSIWRMNGTTLDQATLT TSVPDLAWEIKGVSDFTGDGKQDILWRNNRTGENSIWEMDGTALKQSTLLPSADTAWE IKGLADFTGDGKDEILWRNKGTGENAIWQLDGTTLKQSTPLTAYAGDASWDIVGEADF TGDGKVDILWRNYRTGDNAILPMDGTNPQQVISLTPVPDTNWKVEGLADFTNDGKVDI LWRNSSTDETAISRINGSNLEEPLALPKTGSPSWEISFPTSYPV" gene complement(16537..17082) /locus_tag="DP116_01575" CDS complement(16537..17082) /locus_tag="DP116_01575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868936.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2085 domain-containing protein" /protein_id="PRJNA477356:DP116_01575" /translation="MLPGLVFKKSSSTQHFQIRWVSAVADFLLAGMVVGPLTAPFLAA SGLPMLPVIANIIYFMGVHVCPQPEMGVALSPPYIMAVCMRCYGTVTGLLITRLLYGF TGGKGFYWLKQYGWSGAALASVLMMAYPLELAAEVLGWWSFNNYVVTPFGLMTGLAWG LFTIPILHEWRRTPDDEKFGA" gene complement(17175..19286) /locus_tag="DP116_01580" CDS complement(17175..19286) /locus_tag="DP116_01580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877165.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR00300 family protein" /protein_id="PRJNA477356:DP116_01580" /translation="MTSQIRFLICPPHYYDVDYVINPWMEGNIHKSSQDRAVEQWDKL HHILKENAIVDIVPPEKGWPDMVFTANAGLVLGKTVVLSRFLHKERQGEEPYFKQWFE DNGYTVHELPKDLPFEGAGDALLDREGRWLWAGYGFRTELDSHPYLAKWLDIEVLSLR LIDERFYHLDTCFCPLANGYLLYYPPAFDSYSNRLIEMRVPQQKRIAITEADAVNFAC NAVNVDSIVVMNKASEPLKARLAEVGFQVIETPLTEFLKAGGAAKCLTLRVTEPVREE LHANVSVESRVFNLEGHLLDSGLINRALDLIVDNGGSFQVLNFSLGEQRQSTSAAEVK VSAPSHEVMESIISHLIDLGAVDLPQDERDAKLEPVIQAGVAPDDFYVSTIYPTEVRI NGEWVKVQNQRMDGAIAITRTPKGIVAQCKLLRDVEVGEEVVVDVLGIRTVRKTESRE KRNAEEFSFMSSGVSSERRVELVVEQVAWELRKIRDGGGKVVVTAGPVVIHTGGGEHL SKLIREGYVQALLGGNAIAVHDIEQAMMGTSLGVDMKRGVAVRGGHRHHLKVINTIRR YGSIAKAVEAGVIQSGVMYECVRNKVPFCLAGSIRDDGPLPDTEMNLIKAQTEYARLL KGADMILMLSSMLHSIGVGNMTPAGVKMVCVDINPAVVTKLSDRGSIESVGVVTDVGL FLSLLLQQLQKLTSPYTAKVS" gene complement(19443..20198) /locus_tag="DP116_01585" CDS complement(19443..20198) /locus_tag="DP116_01585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316984.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_01585" /translation="MPKEFEEMADYKKKFDKFAQSKYQMNYTPRGAIIAKGVEMVINS KQQRLQILKKVYWEIKKGDIEILMGPSGSGKTTLLSILAGLLTPTAGNVYLLGQEITR MSRTELAKFRRQNIGFIFQDFNLFPALTAIENVETALNVKGIRGKFARKEAQALLEQV GLGDKAKLLPRDLSGGQKQRVAIARALTGRPQVIMADEPTAALDSHSGHLVMELLRGL AKEQGCTVLIVTHDPRILDLADRVAHMEDGVLK" gene complement(20315..21502) /locus_tag="DP116_01590" CDS complement(20315..21502) /locus_tag="DP116_01590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316985.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_01590" /translation="MASIARKNLLEDIPRFLVAQAGILFAVSLVTIQTGLQYGFARSS SQLIDQSRADIWVSSKNMQHLGLTIPIPYERVTKARKVKGVDKAEAVIIDGGLWRELA TDKINSITLVGADPQGMLFDRSNIVEGRFNDLKQSFRFMIDKTNLNSIDLKRLGEVGE INNIPAKLVGFTQGTQSIVFGTLMFTSLETANTYRNYGTQTTASPNNVGTSAKKPVST DQISFVLVRAKRGQNIAKLKRDLEQALPDSRAYTRQEMSKITQDFWQVRSGIGFILGL GAVVGVVVGAVVVSQILYASVTDHIKEFGTLKAMGASDWFIYNVIIEQAIWMAILGYL PGIALCVGVAAWTSTTQGIVILITPVSALIVFGITVLMCVGSAVFAIQKVTRVDPAIV FKG" gene complement(21568..22856) /locus_tag="DP116_01595" /pseudo CDS complement(21568..22856) /locus_tag="DP116_01595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744491.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="glycosyl transferase family 2" assembly_gap 22232..22241 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 22971..24887 /gene="shc" /locus_tag="DP116_01600" CDS 22971..24887 /gene="shc" /locus_tag="DP116_01600" /EC_number="5.4.99.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126635.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="squalene--hopene cyclase" /protein_id="PRJNA477356:DP116_01600" /translation="MQIQDKQTVSRVKDAIAKNQNYLLSIQYPDGYWWAELESNVTIT AEVVLLHKIWGTDRERPLHKVEAYLRSQQRDHGGWELFYGDGGELSTSVEAYMALKLL GVPETDPAMVKARKFILERGGISKTRIFTKLHLALIGCYSWQGIPSLPPWVMLLPDNF VFNIYEMSSWARSSTVPLLIVIDRKPVFVTDPGITLDELYAEGVEQARYELPSNGDWT DLFITLDNAFKLAETLNLVPFREEGIQAAERWILERQEATGDWGGIIPAMLNSLLALR ALDYDAADPIVERGLRAVDNFAIETADTYTVQPCISPVWDTAWVMRALIESGLASDHP AVVRAGEWLLSKQILDYGDWAIKNKKGKPGAWAFEFDNRFYPDVDDTAVVVMALNEVK LPNEKLKAAAIARAVNWIASMQCQPGGWAAFDMDNNQEWLNMIPYGDLKAMIDPNTAD VTARVLEMLGCGNLSIDTRNLERAISYLIREQETEGCWFGRWGVNYIYGTSGVLSALS LIAPEKTQVSIERGAAWLVGCQNSDGGWGETCRSYNDPALKGQGPSTASQTAWAILGL IAAGQATSKFAKLAIEKGINYLLETQQSDGTWYEADFTGTGFPCHFYLKYHLYQQYFP LLALGRYQTISELW" gene 25253..25522 /locus_tag="DP116_01605" CDS 25253..25522 /locus_tag="DP116_01605" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01605" /translation="MKNSPQDLVAVEDLLIAVQITDTDDIVIVQGEAIVDLAEFNLTP QEVDKFEKILKTINGRLAESFQKQFPATSVISEIRKKSQCADAST" gene complement(25651..26094) /locus_tag="DP116_01610" CDS complement(25651..26094) /locus_tag="DP116_01610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872991.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_01610" /translation="MSNDSENRHPNDDIEDFQSLGGLRILVVDDDADTCILITFILES YGVQVMTAASALDALEVIGQFEPNLLISDIAMPEVDGYSLMRKVRTLSPPLGGIPAIA VTAMDTQEGRDLALISGFQAYLAKPIEPDDLVIEIAKLITSYHSL" gene 26771..27007 /locus_tag="DP116_01615" CDS 26771..27007 /locus_tag="DP116_01615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141514.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01615" /translation="MITPKMMQLLWAVIESTHVSTLLGFDDAALVQLLLKQFKTQQVL DAQATSRLNTYIKSKLPLIRDTAAGRLSTGQGSY" gene 27073..27339 /locus_tag="DP116_01620" CDS 27073..27339 /locus_tag="DP116_01620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197055.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01620" /translation="MNHWLGIALVAYLMGAALEGVSTASQLSQKVPELAKGTQDNPDE SDAPPTGRWQIFAAIASVSICGACVWPCRLLHRSIKGKQDCQKD" gene 27748..27936 /locus_tag="DP116_01625" CDS 27748..27936 /locus_tag="DP116_01625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_01625" /translation="MRMQGVSLAEAARRLGVSQSTLYVAVQKGQIPTFRRDGRTVVAT GALTEYQIRQRPISDYRL" gene 28290..29624 /locus_tag="DP116_01630" CDS 28290..29624 /locus_tag="DP116_01630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01630" /translation="MSIPFNQKFGLTVSFATLLFIGLANTHLRAQSKTQQLSQQHTAT QNLSHTQQQNSALLQQQIPSASSQENAFIVQQTSDNNLDKQNTRLSEPKSAPTSTYYQ VIDELQKYKFSVQGNQVFTSGKLPTTKVNFNQSDLLTVLVNTRKYYQDYASEDQKVLR TGVLATQGVSVEDILKTLDFMITVLQEDIANNRATRLQDPNFINTNFRVIKWSAYNSP SSTSQKQLRITKYAVFTHPASHKKTSKYNIPIYSLKDNSIAEKFYTKYTKQDVLSGIY ESGGKEFGKVEPLAYLTREGFEEALMEGTILLNFTDGSKALFNVDRSNEMPYLRGVAA TSQKRYWYFRQVDDIKGYGYKIDAKISIKPGVTFAGDVLNIGLGRVIVLEYPKDGRKQ LQLGVLADTGGAFLPNLHQLDFLAGIFQGRKDFGQHTRQLPEYATAYILVKK" gene complement(29649..30365) /locus_tag="DP116_01635" CDS complement(29649..30365) /locus_tag="DP116_01635" /inference="COORDINATES: protein motif:HMM:PF13561.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01635" /translation="MLKEKTILITGATKGIGRALVSRLETHNTICFALVARSSELLGE LKAHLQNQGHQCEVFAGDVAAEPFVVSTVQHCVERFGSIDILVNNAGIGKFGEVEQYS LGEWQELFDTNVTGTFLFSREVVPHMKRQGHGHIVMVASDVSKRVCDGGTAYCATKFA QDAFSMALRKEVRRFGIKVSTIFPGLVDTSFHTQPQGDPAHQGWLSAHDVADAIIYTL SAPSHVVIDELMLHPLIQEY" gene 30861..31823 /locus_tag="DP116_01640" CDS 30861..31823 /locus_tag="DP116_01640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878907.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid desaturase" /protein_id="PRJNA477356:DP116_01640" /translation="MFRDTTTRPELLARIADNQHNPRQIISVQELKVLNERSNWKGLV QLAFHLTVTGCSGYLWATNFGNWWLAIPALVIYGFSIACMFAPMHECGHRTVFVNNRL NETVGWCAGLLSFYNSTFDRYYHKWHHLYTRIPGKDPELTEPKPSNLGKYLLIISGLP WWEGKIRGHFRACIGQLDDCPFVPRTARGEVIRSTRLQLAVYAGAIALSFAVRQPWFV LYWLLPLVVGQPILRFLMLAEHTGCTLDANLLTNTRTTLTLWPVRFLTWNMPFHAEHH LYPSIPFHALPKAHKQLSSHFAHIDSGYIKVNWDIVSKQGKSAV" gene complement(31870..33285) /locus_tag="DP116_01645" CDS complement(31870..33285) /locus_tag="DP116_01645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879612.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-acyl-sn-glycerol-3-phosphate acyltransferase" /protein_id="PRJNA477356:DP116_01645" /translation="MPHSIQQAQPPLEFIPQHFNPVVYHITKWALPVLLRFRTRPWLP AGIAEVETVNVERLVDLYQQFQAGKIRFLMAFRHPEVDDPLCMMYLLSHAVPKVAHQR GISLQYPIHSHFLYDRGMTLWAGDWLGWFFSGLGGFPIHRGKRLDKVGMRTARDLFAN GQLPMSVAPEGATNGHSEIVSPLEPGVAHMGFWCVEDLLKANRTEEVFIVPIANQYRY INPSWAKLDWLLGKLEADCGKQVQKVGESNLVEREKVFYERLLCLGEHILSQMEQFYA RFYHQSTPVTTQINPSASRNHVLEARLQTVLDNSLQAAEQFFGLESQGTIIERCRRLE SASWDDIHRKDLPNLHALSPMERGLADWIAEEASLRILHMRLAESFVAVTGTYVQQKP TFERFAETSLILFDVIARIKGEKNPARPRLGWRKSRLTVGEPISVTQRWSVYQTSRQA ARGAVNEITQDLQMALEKMIS" gene complement(33509..34384) /locus_tag="DP116_01650" CDS complement(33509..34384) /locus_tag="DP116_01650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01650" /translation="MNIIISRKNVFSLLSLSAMAVLGSGLSAGAQTVDSTSTQQFIKS PSTDAFVNQVNLEVERFYPPAFSPTPVESTAVQDNLSPVSNVNHPQQLPTSTQAANNV VTPVPGTTSTSSAALIDSQNAEVQQSTSQSSKSKVAQADISVDPGRPTRGGSSYIGIG GNIGLGGNSALGDGNFMVISKIGLTNAISVRPAAVIGDNTSILIPVTYDLSFKQLSDP FAAPLPIAPYIGAGAAINTGNGSEVAFLVTGGVDVPITPRFTATAALNAAFFSDTDIG LSIGVGYNFGGLFGS" gene 35704..36492 /locus_tag="DP116_01655" CDS 35704..36492 /locus_tag="DP116_01655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200057.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_01655" /translation="MNKVKILNLEIDNLSKLELLEKLQSGVVFTPNVDHLIKLQEDPE FLQAYSISDYKVCDSQILLYASKFLGTPIKEKISGSDLFPAFYNYHKNNPDIKIFLLG AGIGIASKAQNEINRKTSRNIIVASYSPPFGFEKDEQECQNIINMINSSGATVLVIGV GAPKQEKWVCKYKNMLPHIKIFMALGATIDFEAGNLKRSPKWMSEVGLEWLFRIFCDP KRLWKRYLIDDLPFLRLILKQKLNLYINKEHKKKNPIWQIANKF" gene complement(36519..36875) /locus_tag="DP116_01660" CDS complement(36519..36875) /locus_tag="DP116_01660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200058.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01660" /translation="MINSQTGGQDELGSRFFEGTAGKTVMIGTPPVCEAYTTYFNWSN AVIEIPYDAANVADIIAELDARPDHLNRIRKDNLMNSLLRHDWLYRWEQIIEKVGLNT TPEMLSRKTYLANLAE" gene complement(36899..37975) /locus_tag="DP116_01665" CDS complement(36899..37975) /locus_tag="DP116_01665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120179.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase family 5 protein" /protein_id="PRJNA477356:DP116_01665" /translation="MARSEAATTIVPPLSTRGSAIIDAKNRVVLLRGVNWFGIEQIHH VPHGLWVRSYKDMLAQMKSLGYNVIRLPYSVQALRSRDISGVDFYIGANKDFQAKTPI EVMDMIIQEAQRQGLFILLDSHTLKDDKIPELWYGDGFTEKDWIDTWTFLAKRYKNQS NVIGADLKNEPHGSASWGTGDRATDWRLAAERAGNEILSVNPKWLILVEGVVGNVPGQ KQPIYAWGANLEGVLKYPVRLKVPKKLVYSPHEYAASHLPWVKEPILPGNLYKRLEIG FHYIATQGIAPIWIGEFGGNQVDTKSKEGIWQRQFVDYVDKKKLSFTYWCWNPNSKDT GGILLDDWKSINTDKQKLLNQVLR" gene complement(38182..39219) /locus_tag="DP116_01670" CDS complement(38182..39219) /locus_tag="DP116_01670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009768997.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_01670" /translation="MHWTIAAPFINQQNLVNEPWLTRNVPGDRHQFHIVPRSKPLGNW HNQKSSVTGFRSWLIYWQHGMEAMKASEGGVITLFPQLPAVIGMQQRMTGKKRIPVVA WLFNVGTCSSGIRQAIAQFSLKHIDHFVVHTRHEVDIYHHWLGIPKERFEFVPYHQPE IPITYEENTTHPFITALGSAHRDFSTFFQAVEKLNLPTVVASSQRALEGLTIPPNVKT PFGIERADCLRLAQEARISVVPLRPLPQVTAAGQVTIVEAMLMGRAIIATRCHGAEDY IQHGETGLLVEPKSVDDLMQAIEMLWNDPALRNRLGQAAKRYAEEHFSCDAAGAALGR ILDKVADAAGM" gene complement(39310..40521) /locus_tag="DP116_01675" CDS complement(39310..40521) /locus_tag="DP116_01675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120177.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_01675" /translation="MKITLICHDIPYPPNRGARVEMWRRIKAFSNIGVELQLISWINE PLQPEEIAEIKKYVKEISLIPFKRTLGSLARRAIDLLHYPLEVTSRIVRGKELTTLLS GVRLFHPDVIWLDGIHGGEIADKLSKKLNVPLITRSHNIEYLYYRRLLASTTDFTSKL KRYLSVSHLESFEKDLLKKSTLFYDISVDDFKFWQSQGFRNGRYLPPIVEFPKDNYSE QAHDQISANLVYDVVFLGNLKVDNNVAGIVWFITQVFPIIRSALPTVKVLIAGSNPVK KVKQLCEEHQGVSLSINPASSLAVYQSGRVLINPVLTGSGVSIKSIEMLVSGRPIVST PQGIAGLPEEVRKLFKIAVNAQSFAEEIVRLLSTPLKVNVEQELLESLFGPQVVEGVV SDINKEVFTVN" gene complement(40537..41850) /locus_tag="DP116_01680" CDS complement(40537..41850) /locus_tag="DP116_01680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120176.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="O-antigen ligase family protein" /protein_id="PRJNA477356:DP116_01680" /translation="MHRLSRLAEKVFVVLTLFFSTSALIPILIEKEDSVDASQDPYTP ILFMGVYVVTLFLVIKNRKSFLYVAQKDIWIWLLVGIALASVLWTVAPDLTLRRGVLL LGTTGFGVYLATRYTMREQLELLAWMFGLIILLSFVFAIALPSYGLMTFQEEGAHAGA WRGVIAHKNHLGRLMNVSTIVFLLLCVDNSLYQQKSQQKYKWILWVGFVLSAILIILS TSKTAFVVFLSLTTILQLYKALRGNYNQVIPSVLTVILLVGSIAILLLDNLPVIATAL GRDLTLTGRTDIWGVMFELIWERPLLGYGFNAVWQSWDNEVTAYLWRTLEWQCPYGHN GFMDLLAELGIVGLIVFCISYVTACIRGVMWLRATKVVEGLWPLMYLTFLFLSNVTES TLVATNSIFWILYISIIFSTAVEYEQAKKYNYYLSVMADEEGRIN" gene 42355..44661 /locus_tag="DP116_01685" CDS 42355..44661 /locus_tag="DP116_01685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314646.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="capsular biosynthesis protein" /protein_id="PRJNA477356:DP116_01685" /translation="MPTKEQSLQYLRNSNTILKEPYLFWRRPEGEEENTGFNQVLSVL RRRLGLIASATVIVTTVAIAWTATRTPKYEGKFQILVEPLKSADSELLLLLSETLKQN INEITKQNKTDMDYQALMEVLKSPKLIEPVVKNLKNRYPNISYDQLVGSDVSGKVSPE RVGTLHISRLGKGKDLSRVVEVRYRESNPQKVQYVLEQVSQAYQNYSKEQQQTNLRQG IKFIEQQIPKIQFRVNTLQGQMQVFQKKHNMFNPELQGKQLLNRVDELKTQRIEIERG LAETRSLSASLQKQLDMSENTAIAASALSESPQYQQVLTRLQEVETKIATESTRLTDN NPVMRNLREQRRKLLPLVQQEAKLALGRNGGRDNSQVGVYQNSVRRDLIKQLADTANQ NQALESSLQANQKATAELNQQIQEYPALSRQYANLQRELQVSSDTLNQILTKQEALRV DAAQQDLPWEVITPPTLPRDKKGHLVPVGLNGERNIALGVVGGLLLGTLAAFVLENSQ NVFRDSEEIKRTTKLPVFAVVPFHKELRHPTSVTDKQLPLADQKGKNQFAPQVKAKTQ EYQTTAFTEAFCSLYNRINSLKSQASIHSIVVTSATSGDGKSTVAVNLAKIAAQAGQK VLLVDANLRHPQVHHALGLVNIKGLSEILFLGIDFNDVVGQAPREENLFVITAGDAPQ NPTKLFSSQRMENFVEEAHANYDLIVYDAPHIMGLLDTSILANRVDGVLMVAGLGKTV RPSLHQALEELKTGQVPVLGIVADTIER" gene 44715..46076 /locus_tag="DP116_01690" CDS 44715..46076 /locus_tag="DP116_01690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314645.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipopolysaccharide biosynthesis protein" /protein_id="PRJNA477356:DP116_01690" /translation="MRSIHPVINIIKKKFSSQFIRNVGWLSISEIIYRVLRLGLVVII ARFLSRHDYGLGAIVMTVREFSITFADVGIAAKIIQAEEEELEDLCNSAYWLSWVVFL SLFLIQSIAAFPIAWFYKSPNLILPIIVSGLAYLIWPLSTIQKTLIQRENRLKVIAIT NSLQNLTGSILSAIFAVSGMGVWSLVLPPILAAPMEPLIYYRAHPWRLNTGFTTKNWG QILKFGKNLLGVSLLRTLRNNLDYLLVGRFVVNPGDPEYGIQELGLYFFGFNAGLGIS LSFINAINSAILPHLCAVRSELSELKKSYFNSLKTIRATIVPFVILQSSLAHIYVPIV FGKKWQPAIPIVVLICLSAIPRPFADAASQLLIAVGKPHLDLHWNVLFTILFSIAILI GVNLQVLGVDTSVLSVHLGEHWQIIVVAISVLLVHLVFLPLFTLWATRYVFPKSKKNE ALL" gene 46073..47137 /locus_tag="DP116_01695" CDS 46073..47137 /locus_tag="DP116_01695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740358.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_01695" /translation="MKKVSVIIPVYKVEKYVAATIQSVLDQTYKNFELIIIDDGSPDK SIEICQQFTDNRIKIIRQENRGVAAARNVGIRHAQGEYLAFLDADDLWVSEKLEQHVE HLKNSPAVGVSFCRSSLIDEAGKPLGIYQITKLKEITPLDILCRTPIGNGSVPVIRRE VFEEIAFQDNLYGVVENFYFDDDRKLHPSEDVELWLRIAIKTKWLIEGIPEALTLYRI NSQGFSAQLVKKLHSWETMLEKARAYVPPASMAQLEKIAIAYQLRHLARRAVTLEDGS TAVEFAWRSLSTHWRIILEEPHRTIITLAAAYFLWLMPRQLYHQVQSVALKIAGTIQK RRIQQEELAKKSYSVVLKDV" gene complement(47370..49040) /locus_tag="DP116_01700" CDS complement(47370..49040) /locus_tag="DP116_01700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321643.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_01700" /translation="MSEALFCIENLRVAYPHRSDEEAQWAVNGVSFILQPGERMGLVG ESGCGKSTLGRAAMRLLPPSSQIEGRVTFQGRSVFEMTPEQLRKFRGEAVALIFQDPM TRLDPLMTIGNHCLETLKAHSPQLSTKQAKEKAIATLEKVKIPASRWSQYPHEFSGGM RQRVAIALALLLSPKLIVADEPTTSLDVTVSAQILQELTRLCGEENMALLLISHDLAL VAEYCDRIGVMYNGKMVETGSSQTVFQQPQHEYTQSLLKAALHIQTVDESTSFSDFSS IPVHRGQEVQSTPILRVLELQQHYTLEPNFIERLLKRQNQTIKAVDGINLELYPGEIL GLVGESGCGKSTLSRTILQLIRPTAGKVEFLGQDLTNLSRQQVRASRRQMQMVFQDPH ACLNPAMTVGQSIADPLLIHKLAVPANAKQQVLSMLEKVGLKPPQVYYERYPSDLSGG QQQRVAIARALITRPKLLICDEPVSMLDASVQSQVLDLMLELKEEFELTYLFITHDLW LARFLCDRIAVMNSGKIVEIGRTKQIFANPQHPYTKTLLAAAPLLGRA" gene complement(49467..49817) /locus_tag="DP116_01705" CDS complement(49467..49817) /locus_tag="DP116_01705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195232.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01705" /translation="MSLTRDKYHVLKVLLKQLHSDVMTTKVDASEVAQRVRSLQQFFQ QQIVPLVNLDTDSNDESRLQSNQTEMSKQLRLLEIDVMFFQGARQASTAKSRLDAIGD RLTTLINYCNAILQ" gene 50117..52396 /locus_tag="DP116_01710" CDS 50117..52396 /locus_tag="DP116_01710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="(p)ppGpp synthetase" /protein_id="PRJNA477356:DP116_01710" /translation="MSSTVVVTTRIDVTLPEWLQTCLNGSYTVVGTETEDGRRQSDMA LIRQAFEFAYQLHEGQYRKSGEPYICHPVAVAGILRDLGGSADMIAAGFLHDIVEDTD VTIEEIEQRFGKEVGLLVEGVTKLSKINFKSKTESQAENFRRMFLAMAQDIRVIVVKL ADRLHNMRTLEFLRDEKRRAIALETREIFAPLANRSGIWRIKWELEDLAFKYLEPEAY RQIQEYVAEKRTAREERLTKIAETLRTRVEEAGIKCLDMSGRPKHLYSIYLKMQRQNK EFHEIYDLAALRIIVNSNEECYRALAIVHDACRPIPGRFKDYIGLPKPNRYQSLHTGV IGPWGRPLEVQIRTLEMHHVAEYGIAAHWKYKETGDSNIIHWRPSDEKFTWLRQLLEW QNDLKDAQEYLESIKDNLFEDDVYVFTPKGDLVALNPGSTTVDFAYRIHTEVGNHCAG AKVNGRMVPLSTRLQNGDIVEILTQKNGHPSLDWLNFVRTSAAKNRIRQWYKRSRREE NIARGRELLEKELGRSGVENLIKSQPMQIVAERCNYHSMEDLLAALGYGEVTLNLVLN RWREVVKAQQPVADAPLVPTKELTSTTKALRDLTPATSRTTDSPIIGVEGLVHYLAKC CTPIPGEPIIGVVTRGRGISIHRQGCQNLENVECDRLVPVHWNSPGEIYSRPATYPVN IQIEALDRVGILKDILSRLSDHGINVRHAQVKTATSQPALIDLGIEIRDRPQLEQIFT QIKKLSDIINIRRVGQIEE" gene complement(52760..54040) /locus_tag="DP116_01715" CDS complement(52760..54040) /locus_tag="DP116_01715" /EC_number="6.1.1.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459840.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine--tRNA ligase" /protein_id="PRJNA477356:DP116_01715" /translation="MLDIKLIRENPQLVQERLKTRGGDYDIQPLLELDKQQRELEAKR NQLQARSNEIAKLIPEKIKAGCNPTGPEIQALRENGSSVKAEIGTLEPQEKELSAKIE ELLLTLPNLPSDSTPVGRSEEDNPEVRRWGDEYLPQNPNILPHWEIGEKLGILNVERA VKVAQSRFITLIGAGAKLERALIQFMLDRQIAAGYVEVIPPFLINTESMTATGQLPKF AEESFKCAADDLWLTPTAEVPVTNLYRGETLNFEDLPIYHCAYTPCFRREAGSYGRDM RGLIRLHQFNKVELVKFVHPSTSSEELEKLLNNAEAILQALQLPYRVIELCTGDIGFH SAKTYDIEVWLPSSGKYREISSCSNFLDYQARRGQIRFKESGKKGTQLVHTLNGSGLA VGRTMAAILENYQQEDRTVRIPEALQPYMGREVL" gene complement(54386..54967) /locus_tag="DP116_01720" CDS complement(54386..54967) /locus_tag="DP116_01720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01720" /translation="MSDSSTLRAIAQIFRLTGWVSFWIQLVLGVVSGVILLFAVFSQR GANTSSNPGTGFGAIFAVAGLVALAVGIYIAFRYTRLGLRLESSNPNNRPRKVETVQV VRFAIIVHLVGMLLTLLGAQIIVGTLVTKSLTLPQLGAGVITQIDPSRSIQPLDMFVV QANTNTVTAHFAGLVASIWILYRISKPQSERSS" gene 55525..55872 /locus_tag="DP116_01725" CDS 55525..55872 /locus_tag="DP116_01725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115158.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PadR family transcriptional regulator" /protein_id="PRJNA477356:DP116_01725" /translation="MKLEDIYHFFENPPPTYLCQELAVCYIMYILIPSESYGTELIQR LETEYPTYRLSDTVLYSAIKFLEDQKAITGYWKKLEGRGRPRRMYQVSPEWQFKAQDL ARLWQQYISGRTS" gene 55927..56457 /locus_tag="DP116_01730" CDS 55927..56457 /locus_tag="DP116_01730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878414.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cofactor assembly of complex C subunit B" /protein_id="PRJNA477356:DP116_01730" /translation="MDTATLPSTLLLTFLLSVGLFFFIRASTKDRTQTTQLVCEQDEA TLMPQLREYFQTRSYRVAAVDPKQNQVTFEGIVRPSWFLAVFLTLLAAVGLLCLSLVL SLLFPNLSTVFLGLVLVSPLSGVFYWKKAGKHELVSLKVEAAESDQHSPTQITLTAHR DEITELRRTLGLKSCE" gene complement(56985..>57098) /locus_tag="DP116_01735" CDS complement(56985..>57098) /locus_tag="DP116_01735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129573.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3155 domain-containing protein" /protein_id="PRJNA477356:DP116_01735" /translation="EKGLFGAQYVEENHFLFPSLRVLEAPPGQESLAVASR" BASE COUNT 16619 a 12292 c 11761 g 16416 t 10 others ORIGIN 1 aacggcagtg aaacccaaca tgaacaagga tcaagtgttg ggtttcgtcc ctcaacccaa 61 cctacagata ttataagtgt ttaagcgaac atgatattat ctgaaatttt gataacctga 121 catcttgcat ctggtgacat caaggtaaca aatattacat aaaatttgac tccttctcgt 181 ccgcctagga ctgtaagtcc caggctaata ggcgaagtcc attaaaatgg aacggcactt 241 gctacaacgg ggggaacccc aacgccagac gcctcaagtc gcaaagctcg ccggacggaa 301 tcttccgccc ggcagacttt gcgggaaacc cgcccacggc gctggctccg caacgcagtg 361 cctcctgaaa aaactagaac tgatttttga gtaaatataa acactatttc atagaagcat 421 tagtagttta ctcgtaatgg tggaagatct cagataactt tttactgttc actgttcact 481 gttcactgtt ttaatgaatt gtggcagtgt tgagatttat taagttggca ttatgaagta 541 tcagcaagta ctcgttgggt tggttctagc gatagttctt ttcttgtttc ccctgtcagc 601 agagggagtc ggttcttcga gtattcggcg ttccacagac gatgcattca acggtaagga 661 tttttctggt caaagcttga ttggatctga gtatattaat gtcaagctta agaacgttaa 721 ctttagcaat gctgatttgc gcggtggcgt tttcaatggt tcgatattag agggagtcaa 781 tttgcatggt gtagatttta ctgaaggtat agcctacttg gtcgcttttg agggtggtga 841 tttcagcgat gcaatcttta caaacgcaat gatgttgcgt tctacctttg atgatgtaga 901 tattacaggt gctgatttca ccgatgcagt tctagatcgt ctggaagtta aaaaactttg 961 tgcgaaagca agtggggtga attctaaaac tggtgtttct acgcgtgagt ctttgcaatg 1021 ccgataaagt tactggatat tagcttagcc gtcacgatgt ttttgtaagt agaaattttt 1081 ggcgatcgct tccgagtgag cgatcgctct atttatgttc cctcaacctg ggatgaagta 1141 tcatcttgcc atagtaaaaa gtcaaaagtg agccagtgct cgactagggt tccacaggct 1201 acaggagcca gtgcgttggg cgggtttccc gacttgtaga cgccggaggc ggcttcccgg 1261 agggtagcac ctggcgtcaa ctggcgttag cccggagggc gtgccgcagg catacccgca 1321 agggtaaaaa gaaagaactc ttatgctaca agctttttac ctatttcaaa tggtctgttt 1381 ctttccgcgc ccgctgtact agtgcaaata ataatacctg cacgactggg ctgccaacgc 1441 actaattttt accgaggaaa atatttgtta gtaaagcgat acgcctttat acgtctgttg 1501 ctacgagtgc aatcgcagat tatttaagca actccatacc cagtatacgg gcgcttgtaa 1561 ggtggatgca gtcccaagat aatgtcttct ttaggtatgc ccatctcgac taattcttga 1621 gcaagattgg cttctgtcat attgcgttga atccaaactt tactatcctt gatgtctaaa 1681 tgaatgaaac agttataaac ccgtttgagt cccgaccagc caacatctaa aatttggtag 1741 tggtcgtgtt ctgtatcaaa gacaagctgg cactcaacat tgttatctga attaccagac 1801 tcaagatttg catgagatgt caacagttgg cggatgattt gacggtattg ctctagtctg 1861 tccatttaat aattacctgc tgaagtggct cgtatacaat taatttaact tgatatactt 1921 tgactgcttc ctgagtaaat ggttcttgaa aaaatgattc aaatgtatca acaggaactg 1981 ccagatacag aattctattt ggttcgctca tctgtaatgc taggcggtag tttaaaaact 2041 gtcctaatgc cccatgaaaa tcggtaattg ccgaaggatt tagaaagctt ttgatttcaa 2101 ctgcaatctt gcgtcctgct ttttgggccg ccaagacttt ctctgctgct agatcgatct 2161 caaatttcac cttctcgatt ttgatcgtta gcggatctgc tgtaattatc cactgatctt 2221 tgaggagtgc ttgcttgacg gcattatgaa ataggtcttt ggcagccatt atttttgagt 2281 cctttaacat attaaaacat atcagcagga tcagccgctt ggagtttgcg ggtggcgatc 2341 gccccagaaa taacacacat aataatggtc aacatgagca ccactaaagc tcgtgttaat 2401 gtcatgtata tcggcaaatt tgtcgctgtt cttgttaaag aataaagtcc caaaggaact 2461 attgttcctg gaagaaagcc cagaaatgcg aggataagag cttcttcaaa aatgattcct 2521 aaaaagtata aattgtgata acccattgct tttaaggttg cgtattcctt caggtgtgca 2581 ttgacatcgg tagaaagaac ttgataaacg ataataatgc caacaaggaa ccccattgat 2641 acacccaaac caaagataaa accaataggg ctttctgtct gccagtagtt ttcctcaaac 2701 tgaataaatt ctttatgagt gaggactttg acatcattgg gtaagtaagc tcttaatgct 2761 gttgcgactt gctgtggatc gtaacctggt tcaagataaa ctaaacccag attaatactc 2821 gctgcttgtc gtccgggaaa cagccgtaaa aaattctgat cactggtaat caggtttccc 2881 tcagccccaa aggaagtccc caatttaaat aagccattga tagtaatggt atgattttct 2941 gcttcggtag taacagtttt accttgttca atttgagcga tcgcttcttg atattgccct 3001 ctagaacctt tgtcaaataa gaaagtatct ggtagcttaa tcttatctaa ctgttggtta 3061 acttctggta gaccaaaagc tggttgctca ggattgaacc caagcagttg aatggaagtt 3121 tcacgacgag tttggggatt tttccaggtg ataaaactca catacaatgc ttctgccgac 3181 tttactcctg atatcgagga agcttggtac agtcgccgcc gggaaaatgt agacagattt 3241 tgcatattac gacctttcgg actgactaaa acaatgtctg tctgcaaaac tcggttgagg 3301 cgagtattac tgtcataaag tgcagtctga aagccaagct gcatgaacat gagcacatca 3361 gcaaaagcaa tacctgacaa tgcgacaagc agacggcttt tttcatgact cagttgcagc 3421 catcccagag gtattcgccg ctgtaactgt tgtatgaatg caatcaaagt gaaatcacca 3481 ccttaacctg tagattggta aactgagcag ctttctggct tgatatctta tcgagtcgga 3541 tatgcacttc cacaattctg gcgtcaatat tagtactagg atcggtattg atgacactct 3601 ggcgacgcac ttgtaagcct atccgctcga cttttccttg cagttcacca ggtagagaat 3661 cactgataat ccgcacatct tgcccaggac ggattttact gatatcactt tggtaaactt 3721 ccgcaacagc atacatttgg ctggtttgcc caatttcaac aatgccatca cttgatacca 3781 cttcccctgc acgtgtgtgt atctccaaca cttgagcatt ttgtggtgtt cgcacgtaag 3841 cttgttgcaa atctttttga gcttgattca cttgtgcgat cgcatcatca acttctgctt 3901 ttgctgcttc cacatccact ggacgcactt cagctatttt ctggagtgtg gctttcgcct 3961 cgctaatttg ttgactcagc gttgtgttaa tccgggtgag aacagcttga gcttcactga 4021 tttgcctgct actggtatca tcaatacggg tgagtattgc tctggcttct ttgagttgct 4081 gtgttgcagt gtctacactc aggcgtttac tatcaaatga agacttagaa attgcaccat 4141 tcgagtagag ttggtgattg cgctgatatt cggcttgagc gttgttgagt tgggcttgca 4201 atttctcaac agttgcttgt tgcgctatct tatcgccttg ggactgtgct ttgagtcggg 4261 cgatcgcctc ctcttgcgct atcctctcac cttcccactg tgcctttaat cgctcaactg 4321 tggctttttg tgcttcaatt tcaccctgtt ttgcacctgc ttgcacttta gcaaggtttg 4381 cctgtttcac tttcacttgt tgctgtgctt taacgacagc agcttgaaga cgatctttgc 4441 tgtcaagaat agcaatcacc tgacctgttt taacttcctg cccctccttg actaacaatt 4501 gtccaactcg actttcttga ctagatgttg gtgcagaaat tttaataacc tctccctgtg 4561 gctctagcct tcccaaagca acgactttta tgattttaga aatggtttgg tgagattttt 4621 ggatctcatt agacttgtgt gactgtaact gccagaacct gtatgttgtc acccctgcaa 4681 caacaaatgt tatgacagct gccagtattg tcagtgggcg acggctatgc ttaaagaatg 4741 aggaaccgct agctgttgca tcacgcacca taaggtgcct cctgtgggac aaaataaact 4801 aaatagttta attcctcttt ttaaactaaa ctgtacagta caagctgtca agaaaatttt 4861 cgccccagtg cgcgtgttgc atacactcca tatagggggg ggattgttat atccctacag 4921 gaaaccttta ttgcagagtt gcaaacacct atagccggct attagcaaga gatgaaaaaa 4981 tgagaaagac acaagctaat aatttagatc gtaaactgtc agaagaaaaa gtggatgcca 5041 ttctggcagg tgccatgcaa gaatttttgg cacatggcta tgctgcgaca acaatggatc 5101 gggtgacagc agcagcaggc gtatctaaaa caactgttta cagccacttt caagacaagg 5161 aaggactatt tactgccctg atacagcaac ttatactgga aaaatattat gcatcattta 5221 acccacaaaa agctcaattg atggaaggtg aagcatctat catcctgcgt cacttggcat 5281 tcagtatgct aaacaacatt ataggtgacc agcaggtgtt aggattaatg cgcttgatag 5341 ttggtgagtc tggacgcttc ccagagttag cacgagcttt tgttcttaac cttgaaaaac 5401 cattcttgga ggatttgtgt caattcctca tgtcccgtcc tgaactcaac ttacctgatc 5461 cagaagttgc tgcgcgagtc ttggtcggaa ccttagtaca tttcatcttg attgaggaaa 5521 tacttcatgg caatgatatt ttaccaatag agcgggagcg cctgattaac aacctgatta 5581 gtttgttgac tgcaaatcag actcagaagg atttaccagc agatcaatac tcagggacaa 5641 ggcagaaatc tttcaggcgt aaccgtaaga attccggtaa gtttgagcga gattatgact 5701 ctgaaccgaa acgtttaaga tccatcagac tcacagatac agcttgggaa aaactggccc 5761 aagtcgctgc taaacatgaa ttgactcgca gtgagatgat tgagatcatt gctcgtgatg 5821 gtgaactgac ttgaggattc agaatgagtt ctttgaagaa acacttttaa cagggaacag 5881 ggaataggga acagggaaca cccgaacagg gaacagggaa taggggagca gggagatggt 5941 gtttctttcg tatgttgctg gtggttttct acctaagcgt gagaggctca ggtttagcat 6001 gacttgcaaa cttgcgtata tacatggtta aaagaatcag tgatatcaag ttcgggtgat 6061 tacttataat aaaatttatc aatgtaggtt gggttgagga acgaaaccca acataaactc 6121 aactgcgggt gttgggtaac acgagcgttc aacccaacct acagatatta taagtgttta 6181 accgaacctg atatgagact aagttgcgtt ctatagcagt tgccaggtag gttaggacat 6241 gaactaatga gaaaatacgg acaccacgag actttcaagc ctgttaagag ttccctgcta 6301 taacggagaa cttaatgtga ttaagattgc tgataccaat ttgaaaaaag aatgcgacag 6361 atacacactc ccaaaccctt gttatgtttg gctttcttca ttttgcgaaa agttaagcca 6421 gtcgcttgcg ggggttcccc ccgttgtgcg aactggcgtg cgcgtccggg tttcccggcg 6481 caaacttttc aagacgaatt ttgatagcgc agcgtggcac gaagtgccat attttgaatt 6541 ttgaattggt atgagtccct cccttgcccg ctactttgca tggaacagcg acacactttg 6601 tgcttacaca attgacttgt gctacctgag aattgcaact tgtcagtgtt gactgttcac 6661 ggtttgaacg ttcatcagta tatctgagac atcggtgcgc gaaatcatta cgtagttgta 6721 caaagttcaa tacactattg cgggcgcgat cgcaattatc tgtgtcgctt ttcagctatc 6781 aattaacatg aaacaagtat catgttaaat ttattacttt tgacttgtgt ctacatctgc 6841 atctgttatt tggtagatat ttctagagat agtggtcgta gtagtgtctt ttgtcaacaa 6901 acaagaggga gcttctggaa tgattgagaa gttataaaat catgacattt aagtactatc 6961 acttagtgct ctatttttta acttacttaa attgttcata aaatttgatg gcatctttta 7021 aagggttaat caaaattgag cgcaagcgtt tttcgttgac atacactatc ttctgaagtt 7081 gatttcaggt ttagttaata aagggaattt acttttgctg gggtttcatg aaaattatag 7141 aaaacgggta acctaggcgt gatggcggtt ctagacgtta aaattatcgc cttatcgtat 7201 aaaccacttc tgcgctgcct caaagagtac ataactcgct tcgcttaaat tcaaagtatg 7261 cctgcggcat gctgcgctat caaagtatgc cctatgcctg cggcacgcaa tcatgcgaac 7321 gggcacgctg cgcgtatgcg caaagcgcac gcccaaaggg ctaaagcgaa gcgtctccgc 7381 aggagataca aagttcaaaa taataactct tttttttgaa ttcccgaagg ggcggttgcc 7441 cgttttctac agaatcagtc caaggcttta atgcgtcaaa tttctctcat atctgcaaga 7501 aatgagtact gaataaagtc tatatttgta gtactttttt tactttgtat tttttctgta 7561 aaaatctttg gttaaattaa gtattattac tgatatattt tctactaata aaataataaa 7621 ttattactgg attagttact ttagtaaaaa aacacacatt tttttgaagc gagggatgct 7681 gcatcaaagc gtgagatgtt tacgttgttt tgcgatgtct atggtaacta agtctacaaa 7741 aaatcactgt atagacatcg cggaatttca ggacaattaa ctccctaagt tctatatcct 7801 tccgtaaact gcgcggttat gggagaattc aagattcaaa attataagat tgaatttgag 7861 taactttgag cactcatctt acggaggaga tgccaatcgc gcgtagcttg tacgtaagga 7921 tagaaagact taattaacag tcacaaataa tacaactaga ggaatctcat gtttcatcgt 7981 cacattgtct cggcgtgttt gcttgtgtca gggtttgccg tatcagtgca gacagcttat 8041 gctcagaccc ctccggttcc atcagccagc gcccgcgtcg ctacttccta caacaatccg 8101 ctttattcta acctacccat caccgttttc atagagacag taagcagcaa taatagccct 8161 gtaaacagcg agtatacctg cgtcggcgtt gcgcctggag caatcttctc acaactcacc 8221 tgcggtgttc ctggcggatc acaactcact ggcggttcta atggttcggt agcaactgtt 8281 ggtccaatca tagtcggtaa aaacgacgct acaatttgtt gggctggtaa atttttgttc 8341 gcgcccaaca aatacgccta tcaaagtggc tgttccacgc cgaagcagat agtaattgat 8401 acacaagccc ctgccaatcc agcagcacca ccagtcctac cgaacctacc ccttcctggt 8461 ctttctcaat aatcgcagtt caggaaaact tgataaaagc aagactaata ccaatctgcg 8521 ttcagacata acaaccccag tcctctccca atttgagcta tctacaagat gccagtcatg 8581 gatatcaatt tgggagagag gaaaaggagg cagagtaagc ttcgtactac ctttgactgc 8641 aacttattga ggcattctcc cggttaaaaa tcgggagatt cagacaaaaa tgttctgtga 8701 tttttggaga actgctttaa gggtgaatga cgaaagcacg cttatgccct atgtccagag 8761 gacatgctgc gcgtacccct accgggggtg ctttgcgctt acggacgcgc ctacttgctc 8821 gcagtttgtt tgaaaatact tccatgcctc caataagaaa gtttcttact caagcactca 8881 actactagag agtcaaattt tcatgcttca tcgcaagatt tcgccgctta tgcttctatc 8941 accgttctcg gtaccagctg cgacagcatc cccaccagaa aaaggcaatc cccagtcagt 9001 tgagatgcct gagagtacgc ctagcctcac ggacttagtg ctgcctaaca aggacaaagg 9061 cataacaaaa acagaagttg aattacttat tacacaggct attcgagacc acgagtttag 9121 aactactgtc cacgctatat tcatgatagt tgtagtctac acagctgggt tcttttgcgg 9181 tttcttgtta ttcctaagtc agtggacacc gcatcactaa caatacggtt cggttaagct 9241 atcaacgact cagccctcaa caaagtaggg ctgggaaccg cattactgat tctcaaaaat 9301 ctttggaaga ttgctatggc ttgatttatt tcccctgtct acttactttt ctttcaaaaa 9361 gtacatcttc atgcagaaca actataagct ataaagtctg tcccgttgtg ttctgaatgt 9421 attccaaaac tttttgataa gaatctgaat tacttacggg gttaatgact actgaaatag 9481 tactacctgg taaccaaaaa gttatcctac cattatttgg ttcatgacga aaacaactaa 9541 tactgtcgag gttaatgata tattcgttcc tttcataaag aatttttacc caaaaggatt 9601 tctcgaattc taaccctgtt atatgttcta cataatctaa aatcttttga tagtcttctt 9661 ggttactttg gcggttaata actattgaac tagcactatc tggtaaccaa aaagttattc 9721 tgccatttcg ttcatgacaa aaagctgtaa tctgcttcaa attgattata tattcatttc 9781 tttcatagat aatcttcacc cagtacgcca caaaattgcc tcaatcctca acaataataa 9841 cggtcagcgc tacactcata aacaatttta gtggtataat ttatagtttg tcattagcca 9901 ttgacaatcg aacaaatgac taatgaacgg cagatgcttt atgccgggga acccgtccac 9961 cgcactgcct cctaatgact atttaaaata cttccagtgg tgtttgtacc tgagaagatg 10021 gataataacg agcaattgtc gcttcgataa aacctttaaa taaaggatga ggtgcactgg 10081 gagacgattg aaattctgga tgaaattggc aagcaataaa gaaggggtga ttggcaattt 10141 cgataatttc tactaggcgt ccgtcgggag aagtcccact aatgagataa ccagactcta 10201 caaagatgtt gcggtaagag ttgttgaact cgtaacgatg tcgatggcgt tcataaatca 10261 cttcttcttg gtaaagtttg aaagccaaag agttaggaag caaacgacaa ggatataaac 10321 ctaagcgcat tgtcccgccc aaatcaatga catcctgttg ttctggcaat aaattaataa 10381 ctggattttt ggtatgtgga tcaaattcag cgctgttagc atcttgtaat ccggcgacat 10441 ctcgtcccca ttctatgact gaacattgca ttcccaagca caaacctaag aatgggattt 10501 cgttttttcg agcatatcta attgcggcaa ttttgccatc taccccccgg acgccaaaac 10561 ctcctgggac tactattcca tccacacctt cgagataggt ttcggctgct tcagtttcta 10621 gttgttctga atttacccaa cgcaggtgca gttcgctact catcgcaatt gctgcatgac 10681 gcaaagcttc cacaactgaa agataagcat cactcagctg cacgtactta cctacaatcg 10741 caatttccac ccgatgagtg ggactgtata atttttctac tagtgtttgc cactgcgcca 10801 aattaggttg gtgttgttcc atgttcagca actctagtgt ttgctgtgct aatccttctc 10861 gttctaaatt cagggggact tcatagatac tcttagcatc ttgagatggg atgacgcatt 10921 ctactggtac atcgcaaaat tcagataatt tgtgttttaa cgatgttggc agtgggcgat 10981 cgcaccgaca aattaaaata tccggttgaa taccaatgga tctaagttct ttcaccgagt 11041 gctgtgttgg cttagttttc atctcacccg ctgaggcaat ccacggcatc aaagtgacgt 11101 gcatatacac cacatttatt cgtcctacat ccttcctgaa ttgacgaatt gcttccaaaa 11161 acggtagtga ttcaatatct cctactgtcc caccaatttc tgtaataacc acatcaggat 11221 ttgtatcttt agcaactcta ataatccgtt ctttaatttc gtttgtaatg tggggaataa 11281 cttgaactgt gccaccatta taatcaccac gacgctcttt attaatgacg gcttgataaa 11341 ttgaaccagt cgtcacgctg ttcaaccggg acatagaggt atcagtaaaa cgttcgtaat 11401 gtcccaagtc caaatctgtc tcagcaccat cttgagtcac aaacacttcc ccatgctgaa 11461 aaggactcat ggttcctgga tcgacattaa tataagggtc gagtttgaga attgagacag 11521 aataatctcg tgacttgagc aatcgcccca gacttgctgc tacaattccc ttaccaatac 11581 tagaaacaac accgccagtt acaaatacaa acttagtcat agtattttta attttgcgca 11641 acttctagaa acacatgccg tcattgtgcc acagtcaccc ccttcgggaa ttcaaaattc 11701 aaaattataa atttcaaatt ataaatttaa attataaatt tatttttttg aattcaaccc 11761 ccatgctttt agggggaggt gcttaactcc gctcttgttg tcaattattc ttttattttg 11821 acgtgagatc acttttagga ttaatcttat taagttctat agtcaccccc tctatcgctt 11881 tggctcaaga acaatccctc aaggttgttt ttccgaaaac aaactaccag acttctgcac 11941 agaaaatttt ctttcttggt acagcaccat caagtggaga ggttttgatc aataatcagc 12001 cagtaacccg tagcaagtct ggtcattttg ccccaagttt ccccttacag ttgggagaaa 12061 atctttttac tgtccgctac caaaatcaag aaatccagat tagggtgaca agactttcca 12121 ctcaaccgga agtcccacag ggattaggct ttgcaaaaga ttctctcacg ccaagggtgg 12181 acattgccaa acttccagga gaacagattt gttttagtgc gttagcccct agggcgactc 12241 cctctgggag tatcgcccct ggaaccgctt cctctgggag tattgccgca ccgaatgtga 12301 gtgtttctgt aaaattggga aatcaaactg tttctctttt accccaacct caacaagcac 12361 aactaccagc aaattctgcc gctttaacag ggcaaaacca accctcaacc cagtctagcg 12421 ctggcaaata tcagggttgt acgacagttc ctcagtttaa tcctcaactg tacggtaaca 12481 acattgtctc tggtgctgtt gctgagaatt ctgccagaaa tatagatttg ggtaaaccag 12541 aattccaact cacgctgaat ggtaaaacaa taactcaaca aggaactgga aaagtcacca 12601 tcctctcacc tgaacagttg gaagttgtag aggtcatagc agactctgga gttgcacgca 12661 ctggtccgag tacagattat tctagactaa cacctttacc taaaggcaca cgagcagcag 12721 tcacaggacg cgaaggtgag tggttgcgtt tggactatgg tgcttggatc aatactaagg 12781 aaactcgtcc tttactagga gcagttcctc aacagagtat catccgcagt gttggctatc 12841 gtagacttcc tagtatgaca gaaatggttt tcccgctgca agttcctgtt cctgtgagtg 12901 tgcagcaagg tgaccagact tttactctaa ctcttcataa taccactgct cagacagata 12961 ttattcgctt agatgatgat cctctaatct ctcgcttaga ttggcaacaa ttaccaccat 13021 acgtcccagg gggacagcca ggagtgcagt acacttttaa cctcaaaaaa gctcaacagt 13081 gggggtataa gctgaggtac gataacacga gtctggtgct atctttacgt cactctcctt 13141 tcatctcctc tactaatact agggaaacaa gggggatgtt cgccatacaa aagccgttat 13201 ctggaatcaa gattttactc gatccaggac atggaggaaa agaatctggt gcatccggac 13261 caacaggtta tctggaaaaa gatgtgaatt tggtggtttc taagttagtg cgcgatgagt 13321 tggtgaagcg aggagcgaaa gtggtgatga cacgggaaga tgataaggaa gtgtcactac 13381 ctgatcgtgt ggcaattatt gataaagaag aaccagcgat cgccatctcc atacattaca 13441 actccctacc cgatgaaggt gatgccgaaa aaatcaaggg aatggcagcc ttttggtatc 13501 atccccaagc tcacagcttc gcagttttta tgcaaaagta tgttgttagc aagttaggac 13561 gaccatccta tggggtcttt tgggataatt tagcactgac acgtccagca agtacaccgt 13621 ctgtcttgct ggaattgggt tttatgagca accctaatga atttgagtgg gtgacgaatg 13681 ctcaggaaca aaagaagttg gcaaaagtca taggtgatgg aattgttgag tggttccgta 13741 gtgcccgctg atgcctaggc taaatatgta cagcgttgga aaacaaggtt attctgggtc 13801 ttttctgaat taatactgat gatagtctaa taagctgttc tatatttaac ttacataatt 13861 tgtcgccgga agctttttat tggcttggct cattctggaa agttagatgt ggttcagctt 13921 agcagtattc aattaggtag aacacacagc aatgctgtgt cgctatcatg tgatttgtat 13981 aaatgacaaa tcacatgtag tgtcagaaaa aaatgcaacg tagacgtgga tgatctcatt 14041 cttaaagttt tcttatacca aagttttagt tctttatgaa gtcatttttt tgaatcaaga 14101 aggcacaata taggtagaat agtcctttgt ggcactatgc gtgatatttt attattttta 14161 ttttaggtaa ttgataatac gttatttaat cttataataa agtgagttga cgatttttcg 14221 tattatttaa tacaatattt tcagcaaaga tatttcctat ataagtagca tctacaacat 14281 aggtattttc cataactatg tagtcaactc aaatcttccc aaaacacttt ttcttaagaa 14341 aggatgagcc atggtgcaaa ctttaagcgt ttctggaaca accaactata cggctgccgc 14401 tactccccaa attgttgccg ccaatttgac tatttctgac cctgatagta acactctaaa 14461 tggtgtttcg gttatcatta actctaactt caagacagat caagaccggc ttgggattgc 14521 tggacagaac ggtaccaatg gcactatcaa taatctgaac tggaactacg acacaacaag 14581 aggggtttta agcataactg gtaccgcttc caacgaagct taccagaatg cactgcgcca 14641 agttacttac agtaatatca gtggcaatcc cactacagca gtgcgtagca ttgaattttc 14701 tctggggacg accttaggaa gtgcagaaaa taaccgcttt tacgagtttg tctctgctcc 14761 caatattacc tggacagatg ctcggtctgc agctgctaat cgtgagtact tcggactcaa 14821 aggttattta gcaacgatca cctctgcggc agaagaaaac ttcatccaag gcaaagttca 14881 aggaaacggt tggatcggag gtagcgatgc tgagactgag ggggactggc gttgggtaac 14941 aggaccagaa actggaaccc agttttggag tggaggtcca aatggtactt ctgttcaagg 15001 tcgttacaac aactgggcac ctggtgaacc aaacgacctt aataacaacg aagactacgc 15061 ccacatcatt ggtaactccg cgattggtca agctgtacaa ggtaaatgga atgaccttcc 15121 caacgctgtg cagagtggaa attatgtatc tggcggttac tttatagagt acggtggctt 15181 acaaggcgat ccaactctac aactgactgg tagcgtcagt gttaacgtga ctggaaatgc 15241 ttcagctaat gctaggactt ctaaatttga ttttactggc gacggcaaac cagacattct 15301 gtggcggaac tctcgtactg atgagactgc tctttggaag atgaatggta ccacattaga 15361 agagtcaatc tcacttccga aaacctttag taatgcctgg gaaattaaag gtcaaggaga 15421 ctttacaggt gatggcaaag tcgatatcct gtggcggaac tctcgtactg gcgagaatag 15481 catctggcgc atgaatggta ctaccttaga tcaggcaact ttaactacct cagtccctga 15541 tcttgcctgg gaaattaaag gtgtatcaga ctttacaggt gatggcaaac aagacatcct 15601 gtggcgcaat aaccgcacag gtgaaaattc aatttgggaa atggatggta ctgccctcaa 15661 acaatctacc ttgcttccct cagcagatac tgcttgggaa atcaaaggtc tcgcagactt 15721 tacaggtgat ggcaaagacg aaattctctg gcgcaataag ggtactggcg agaatgcaat 15781 ctggcagttg gatggtacta ccctcaaaca gtctacccca ctgaccgcat atgcaggaga 15841 tgcttcttgg gatattgtcg gtgaggcaga ctttacaggt gatggcaaag tcgatatcct 15901 gtggcgcaac taccgcacag gtgacaacgc tattttgccg atggatggta caaatccaca 15961 acaggttatt tcgttgactc cagtcccgga tactaactgg aaagtcgaag ggctggcaga 16021 ctttacaaat gatggtaagg ttgacattct ctggcgcaac tctagtactg acgagactgc 16081 tatttcgcgg atcaatggtt ctaatttaga agaaccgctt gcacttccca aaactggtag 16141 tccttcttgg gaaatcagct tcccaacttc ctacccggtt tgaagataag ttgcataaga 16201 gtgaattgac tctgtgattt aatttcctcg caggtgtggt caatagtttt gtttatcaca 16261 cctgtgagtt tttattttta aacaaatttt aacttcgtta gaaaattaga acagattggc 16321 attggagatt agtcaggctg tattgcctta cgccaccaaa ctcgtttcaa cggttcgcct 16381 gtttgctcat ccaatatttc aagcgtcctt gtcgcaccgc cccgtcgaaa tggaataatt 16441 ttcttaaccg aacagtattg gggtgcaacg gacgttaagc gaacgcgagt gcgtgcgcct 16501 ctggcgctta gctctgccgt aggcgatcgc ttgctgtcac gccccaaact tctcatcatc 16561 tggtgttcgc cgccactcgt gcaaaattgg tatagtaaat aaaccccacg ccaaaccagt 16621 catcaaccca aaaggcgtaa caacgtaatt attaaaactc caccaaccta aaacctctgc 16681 tgccaattcc agtggatacg ccatcatcag aacactagca agcgccgcac cactccaacc 16741 atactgtttt agccagtaaa aacctttacc acctgtaaat ccatataaca gacgtgttat 16801 caataaccct gtcacagtac cgtagcaacg catacacaca gccataatat acggtggtga 16861 taatgctacc cccatctccg gctgcggaca tacatgaacg cccatgaaat aaataatatt 16921 cgcaattacg ggtaacatcg gcaacccaga cgccgccaaa aaaggagcag taagtggtcc 16981 caccaccatc cctgctaata agaaatcagc aactgcactc acccaacgaa tttgaaagtg 17041 ctgagtagaa gaactttttt taaaaactaa tcctggcaac atttctttct tctttcgttt 17101 ttccttggcg tccttggcgt cttggcggtt cgtttttcat acgcaaggtg ggcaatattg 17161 cccaccctac cttcttaact cactttagca gtatacggac tagttaactt ctgcaactgc 17221 tgtagtaaca aactgaggaa caacccaaca tctgttacca caccaactga ttctatagaa 17281 cctctatcgc tcaactttgt caccacagct ggattaatat ccacacaaac catcttcaca 17341 ccagcgggag tcatattccc cacgccgatt gagtgcagca tactcgacag catcaaaatc 17401 atatctgcac ctttgagcag tcgggcgtat tcagtttgtg ctttgatcaa attcatttcg 17461 gtatcgggca aaggaccatc atcgcgaatc gagccagcca gacagaatgg aactttatta 17521 cggacacatt catacatgac gccactctga ataaccccag cctcaacagc tttggcaata 17581 ctaccataac ggcgaatggt attaatgact tttaggtgat ggcggtgtcc accgcgtaca 17641 gccacacccc gcttcatatc cacaccgaga gaagttccca tcatcgcttg ctcaatatcg 17701 tggacggcga tcgcatttcc acccagtaac gcctgcacgt atccttctct aatcagtttc 17761 gacagatgct caccgcctcc agtgtgaatc accactggcc ccgctgtgac aactacttta 17821 cctccgccat ctcggatttt acgtaattcc caagccactt gttcaacgac aagttccaca 17881 cgtcgctcgc tggaaactcc cgacgacata aagctgaatt cttcagcatt gcgtttttcc 17941 cgtgattctg ttttgcggac ggtgcgaata cctagcacat ccacaaccac ttcttcgccg 18001 acttcaacgt cgcgtagtag tttacactgt gctactatgc ctttgggtgt tcgagtgatg 18061 gcgatcgccc catccatccg ctgattctgt accttcaccc attcaccgtt aatccgcact 18121 tcggtgggat aaatcgtgct aacgtaaaaa tcatcaggag ctacccctgc ttgaattacc 18181 ggctccaatt tggcatcgcg ctcatcttga ggcaagtcta ccgcacccaa atcaatcaga 18241 tgcgatatga tgctttccat cacctcatgg gaaggcgctg agacttttac ctcagctgct 18301 gatgtacttt gtcgctgctc tcccagtgag aaatttagaa cttggaagct accgccgtta 18361 tcaacaatca aatccaaagc ccggttaatc aagccagagt caagcaagtg tccttctagg 18421 ttaaagacgc gactctccac cgacacattc gcatgaagtt cttcccgtac tggttccgtc 18481 acccgtagcg tcaaacattt cgccgcgcca ccagctttga gaaattccgt cagcggtgtt 18541 tcaataactt ggaaaccaac ttctgcaagg cgtgctttca gtggttcact cgccttattc 18601 atcacaacaa tgctgtccac attcacagca ttgcaagcaa agttgaccgc atcagcttct 18661 gtaatcgcta tccgtttttg ttgcggtacc cgcatttcaa ttaagcggtt ggagtaggaa 18721 tcaaacgctg gtggatagta tagcagataa ccattagcca gcggacagaa gcaagtatct 18781 aggtggtaga aacgctcatc aatcagtcgc aacgacagca cctcaatatc cagccatttt 18841 gctagataag ggtgagaatc taattctgta cggaaaccgt atccagccca aagccagcgt 18901 ccttcccgat ccaagagtgc gtcccctgca ccttcaaagg gtaagtcttt gggtaattcg 18961 tggacagtgt agccgttatc ctcaaaccat tgcttgaaat aaggttcctc gccctggcgc 19021 tctttatgta aaaaacgact caatacaact gttttcccta gtaccaaacc agcatttgcc 19081 gtaaacacca tatcgggcca gcctttctcg ggtggtacta tgtcaacaat tgcgttttct 19141 ttaaggatgt gatgcagttt gtcccactgt tctacagcac ggtcttgcga tgacttgtga 19201 atgttccctt ccatccaggg gttaatcaca tagtctacat cgtagtagtg gggaggacag 19261 attaaaaagc gaatctggga agtcatatgg ttagatgcta ttagcctttc tcttatacaa 19321 atttttacgg tctcacaacg tatagtttaa tactgtttac cgtgctaaag tcaaatttct 19381 gtatttttgt agacagtaga atgcccatgc tctattttat cctgcaagga gtacaacgcg 19441 gactacttta gtactccatc ttccatatgc gctactcggt cagccaaatc taaaatccgt 19501 ggatcatggg tcacaattaa cactgtgcag ccttgttctt ttgctagtcc acgcagtaac 19561 tccatgacca aatgtccgct gtgagagtct aaagcagccg ttggctcatc cgccatgata 19621 acctgtggac gaccagttaa ggcacgggcg atcgccactc tttgtttttg tcctcccgat 19681 aaatcacgag gaagcaactt tgctttatca cccagtccca cttgttccaa caaagcttgt 19741 gcttccttgc gtgcgaattt tccacgaatt ccttttacat tcaacgctgt ttctacattt 19801 tcaatcgctg tgagtgctgg aaacaagtta aaatcctgaa aaataaagcc aatgttctgt 19861 cttcggaact tagctagttc agttcgagac attctcgtaa tttcttgccc aagtaaataa 19921 acattgccag ccgtcggagt caaaagtccc gctaaaatag acagcagagt cgttttccct 19981 gagccagaag gtcccattaa aatttctata tcaccctttt tgatctccca gtaaactttt 20041 ttcaagattt gaaggcgctg ctgcttggaa ttgataacca tttccacccc cttggcaata 20101 attgcccctc ttggggtgta attcatttga tatttgctct gagcaaattt atcaaatttc 20161 tttttataat cagccatctc ctcaaattct ttaggcatat atagttttta attattagtt 20221 gatagttgag tgctttcgac tactaaatat ttactgttcg gtttttgtca taattagcac 20281 tgaatttcag aaaagttaat aagtaccaag aaaactatcc tttaaaaaca atcgctggat 20341 ccacgcgcgt cactttttga atggcaaaaa cagcagaacc gacgcacatc aacactgtaa 20401 ttccgaaaac aatcagtgct gatactggtg taattaaaat gacaattcct tgtgttgttg 20461 atgtccaagc tgctactcct acgcaaagag cgattcctgg tagatagcct aaaattgcca 20521 tccaaattgc ttgctcaata attacattgt aaataaacca gtcagatgct cccattgctt 20581 tgagggtgcc aaattctttt atatgatctg taacagaagc atagagaatt tgactcacta 20641 caactgctcc aacaacaacg ccaaccaccg caccaagacc gagaataaat ccaattccag 20701 agcgaacctg ccaaaaatct tgtgtgattt tagacatttc ttgacgagtg taagcacggc 20761 tatctggcaa agcctgttcc aaatcccgct tgagtttcgc gatgttttga cctcgttttg 20821 cccgaaccaa aacgaagcta atctggtctg ttgatactgg ctttttagca gaggtaccca 20881 cattatttgg agaagctgtt gtctgagttc cgtaatttct ataagtatta gcagtctcta 20941 aagacgtgaa cattaaagta ccgaagacaa ttgattgagt tccttgagtg aaacccacta 21001 actttgctgg aatgttgtta atttctccta cctcacccaa ccttttgaga tcaatagaat 21061 tcaggttagt tttgtcaatc atgaaacgga acgattgctt taagtcattg aaacgacctt 21121 caacaatatt tgagcggtca aataacatcc cctgtggatc tgcaccaacc agcgtaatag 21181 aattaatctt gtctgttgca agttcacgcc aaagcccacc atcaataatg acagcttcag 21241 ctttgtctac acctttgact ttgcgcgcct tggtaactcg ctcgtaggga attggtatag 21301 ttagccccaa atgctgcata ttttttgaag acacccaaat atcagcacgg gattgatcaa 21361 ttaactgaga agaagaacga gcaaacccat attgaagacc tgtttgaatg gtaactaggc 21421 taacagcaaa taaaattccc gcctgcgcca ccaaaaagcg aggaatgtct tccagtaaat 21481 ttttacgagc aattgaagcc ataaaatgaa tttaatcatt ggtcattggt cattggtcat 21541 tagttgtgct actaatgact aactgggcta actagcacta ttcgttttag aatatactct 21601 tcctttccaa gttcctcctc gcttttgcca atgacgcaca gctgagtcta tagtcataag 21661 cgtatatagg agcgcaattc ctggaagaca cagtgccaac caaggtgaac acctataaaa 21721 tcggatcgta ggcaagtaag cgcaactcat aagtagccag cctaatgagc cgacgcacgc 21781 tacaagccaa gttcctgtga gcaaaccgaa gattgcacta acaggtggaa ctatgtaaac 21841 aagtatcatc cctaagactg ctaagattaa cagccctaaa gaataattta gctgggtaaa 21901 ggcagttcgg gctaccatgt cccaaactgt tgacagcgag ggataagatc gcaagctgtg 21961 agttgagtcg tctagtccta gccaaatttt gtgagatgag gagaaggagg gagggagtga 22021 gggaatattt ttttcttctg cttctcgtgc tttcccaatc gcctcatggg gaagttgggt 22081 aaaacagcta gactttactg cttgccctaa ggcacaatca tcaatgagcg cattacgaac 22141 aacttgaata ccaccaatcc gggttaaagc ttcactcgca attaagatac aaccaccagc 22201 agcagcagct gttggttttg tagaatcgtt cnnnnnnnnn ngttcaccca agggaagggg 22261 taaagttttt gaaagaaaaa gacaaacgct ggaatgagaa atttttccca aaaactttca 22321 cacctcagca gcaccatgag agaaacaagt tctaaattct cttgttgcgc ctttgctacc 22381 aaggagcgga gattgtcagc atcatgttca atatcagcat ctgtcaaaat aacatactct 22441 ggtgaatgtg tcaaggtttc tacatactca ataccttggt gtaacgccca taatttgccc 22501 gtccagtcag gaggtaaagg ttgtccagaa aggacgtgta attgctgagt tttgtttaat 22561 tcttgggcaa caccagaggc aatatttgct gttccatcag tactgttatc gtctaccaaa 22621 acgacggtga aatgaccagg atagtcttgg gtgagtagcg atcgcaaagt cacaggaagt 22681 aattcagctt catttctggc aggaactacc gcacaaatgg cagggaactt ttgtaaatct 22741 gtttcctgtc ttattaatcg ttggtcacag cgccaaaatt gaccccaaaa ccccagtaat 22801 atcacccaaa ttgtcaagga tagaacagaa agccatagtc caatttcacc gctcattgtg 22861 ttttcttttt ttcatataga aaacacatta caatgtcacg attagacagg caaagaattt 22921 tcttgttaca gcttataaga agcagatgtg gttgaggagt aacggaaacg atgcaaattc 22981 aagataaaca gacagtttcc cgtgtcaagg atgcaatcgc caaaaatcag aactatcttc 23041 tttccattca atatcctgat ggatactggt gggcggagtt agaatccaat gtcaccataa 23101 cagcggaggt tgtcctcctt cataaaattt ggggaacaga ccgagaaaga ccattacaca 23161 aagttgaagc atacttacgt tctcaacaac gggatcacgg tggatgggaa cttttctacg 23221 gtgatggagg agaactgagc acttcggttg aagcatacat ggcgctgaag ttgcttggtg 23281 taccagaaac agaccccgca atggtgaagg cgcggaagtt tattttagaa cgcggcggta 23341 tcagcaaaac tcgcattttt accaagttac acctagccct gattggatgc tacagctggc 23401 aaggcattcc ttctttgcca ccttgggtga tgttgttgcc cgataatttt gtgtttaaca 23461 tctacgaaat gtcgagctgg gcaaggtcaa gcactgtccc tctactgatt gtcattgacc 23521 gcaaacctgt ttttgtgact gatccaggaa tcaccttgga tgaattgtat gctgaaggtg 23581 tcgagcaagc caggtatgag ttgcccagca atggtgattg gacagattta tttattactc 23641 tcgataatgc ctttaagtta gcagaaaccc tgaatctggt tcctttccga gaagaaggta 23701 ttcaagctgc agagcgctgg attttagaac ggcaagaagc aacgggcgat tggggtggta 23761 tcattcctgc gatgctgaat tcactgctag ctttgcgcgc tttggattat gacgcagcag 23821 accccattgt agaacgagga ctacgagcag ttgacaactt tgccatagaa actgctgata 23881 cctacacagt gcagccttgt atttcccctg tatgggatac tgcttgggta atgcgagctt 23941 tgatagagtc aggtttggca tcagatcatc cggctgtggt tcgggctgga gaatggttgt 24001 tgagcaagca gattctggac tacggtgatt gggcgattaa aaataaaaaa ggaaaaccgg 24061 gggcttgggc gtttgagttt gacaaccgct tttatccgga tgtagatgat actgcagttg 24121 ttgtcatggc tcttaatgag gtgaaactgc cgaatgaaaa acttaaagct gctgcgatcg 24181 cccgtgccgt aaactggatt gcatctatgc agtgtcagcc aggaggctgg gcagcatttg 24241 atatggacaa taatcaggag tggctcaata tgatccccta tggtgatctc aaagccatga 24301 ttgatccaaa cacagctgat gtcaccgcta gggttttaga aatgttgggc tgtggtaatt 24361 tgtcaataga tacacgtaac ctagaacgag caataagcta tctcatccgc gagcaagaaa 24421 ctgagggttg ctggtttggt cgctggggag taaattacat ttatggaacc agtggagtac 24481 tttcggcttt gtcgttaatt gcgccagaaa agacgcaagt cagtatagaa cggggtgctg 24541 cttggttagt cgggtgtcaa aactcagatg gtggctgggg cgaaacttgt cgcagctaca 24601 atgatccagc cctcaaagga caaggaccca gtactgcttc tcaaacagct tgggcaatac 24661 taggcttaat agcagcaggt caagcaacta gcaagtttgc gaaacttgct attgagaagg 24721 gaattaacta cctgttagaa actcagcagt ctgatgggac ttggtacgag gcagatttca 24781 cagggactgg ctttccctgt catttttatc tgaagtatca cctctatcaa caatattttc 24841 ctctattagc tctgggtcgc tatcaaacaa tatcagagtt atggtgagag gctgggtcac 24901 gaaacatcaa agttaagtac cgtaaaactt cgcttaattc ataaacaaat atagtcagat 24961 aagctgctta ataagcataa agctatctga ctattgcaca acaaattggg aatcatatta 25021 ccttcaagca gcaatctgaa taacaatgct gacttaacag cttacaatct cagggcgagt 25081 caacctttga gtcgttatgc tttgaagtgg gtgcaccaca attgctctgt cctaaagtgt 25141 caattcatct tagaggagaa aagcagcgaa acagtcaagt ctgtctggag ttggatcaca 25201 aataacagcc gtctgctcat actgaaaata cggcatatca aacagttaaa gcatgaaaaa 25261 ctctccacag gatttagttg cagttgaaga ccttcttatt gccgtccaaa ttacagacac 25321 tgatgacatt gttatagttc aaggtgaagc gatagttgac ttagccgaat tcaatctcac 25381 acctcaagaa gttgataaat ttgaaaaaat actgaagaca ataaatggta ggctggcgga 25441 atcatttcaa aaacagtttc ctgctacctc ggttatctct gaaattcgga aaaaatcaca 25501 gtgcgcagac gcatctacct aaagattgaa tccacttggt agaagcaaga agataacaaa 25561 agtgagtatt ttttatctta attaaccgat aactcactaa aaactaagct tctagctagt 25621 ctgtcgctag aagctgttat ttatgagtca ttacaatgaa tgataggaag taataagttt 25681 tgctatctct atgaccaaat catccggctc aatgggcttt gccaagtatg cttggaaccc 25741 agatatcaaa gcaaggtcac gcccctcttg tgtatccata gccgttacag caatagcagg 25801 aattcctccc aaaggaggac taagagttct gactttacgc attaacgaat agccgtcaac 25861 ttctggcata gcaatatcag aaatcaaaag atttggttca aattgtccga tgacctctaa 25921 agcatctaag gctgatgctg cagtcatcac ttgcacgcca tagctttcaa gaataaaggt 25981 gattaagata caggtatcag cgtcgtcatc tactacaagt atccgtaacc cccctaaaga 26041 ctgaaaatct tctatatcat cgttaggatg acgattctca gagtcattcg acacgataaa 26101 agtgcctcct agttgcttac tcaatgaggg gggaagtcct gctgggagct aggtgggaac 26161 tagctaccta gtgggcgttt gaattatcag tcatgaaatc aactttcaat catcgttagt 26221 cttattccct atgagatacc tatatatgcc gtcttcacac aacccctaaa aaatttgata 26281 attttagcag ttgatattaa ggtagctgga aacgcctaat tcaagccacc agcccccaaa 26341 ctttcggtta ctcacagata gccttaattc taatagaact gtgcagctaa ataccgtgag 26401 caaagcattt ctaggcgtgc tttcaccagt cagccttcca ctatttctta ctactctatc 26461 tgtgcattat gacgagaggg tatgagtagg aaacgactat aaatgtacgg atagcctgac 26521 tttgctaaaa cggaacttaa tgattagcta aaagacatcc agtaagactt gccaactaaa 26581 atatgactga tacgatttca ccaaaatcct aatagaattt acttgccttg cctcaagagc 26641 ctcccccaat tcatgcagat ccgactacct taccccaagt tatgttgatt tataaattta 26701 caagacaaaa accaagcgcg aatgggaaaa gcgtgcgaat ataaagaaaa ccttaagacg 26761 ttaatgatcc atgatcaccc caaaaatgat gcaactcctc tgggctgtca tcgaatctac 26821 ccacgttagc acccttctgg ggtttgacga tgctgcctta gtacaattgc tcctcaagca 26881 gtttaaaacc cagcaggtgc ttgatgctca ggcaacaagc cgcctcaata cttacatcaa 26941 atctaaactg cccctcattc gcgacacagc cgcaggcaga ctatccaccg ggcaaggcag 27001 ttattagcac acaagagcga acacacaaag agcgtgcaaa acatttcctt ggattgaaat 27061 cagcacaatg ttatgaatca ctggctaggt attgccctgg tagcttatct catgggagcc 27121 gcacttgagg gggtaagtac agccagtcag ctatctcaaa aggtgccgga gctagctaaa 27181 ggcacacaag acaaccctga cgaatctgac gctccaccaa cagggcgttg gcaaattttc 27241 gcagccattg catcggtaag tatttgtgga gcttgcgttt ggccctgccg cctgctccac 27301 cgttcaataa aagggaagca ggattgccaa aaggactgaa agggcatgta aagttcaata 27361 tctatacatt tacactccca cagcttgtca ccgtggaagt gtcaaagagg tttatattca 27421 aggtctttat aatggctgac ctcgtttgaa agaagaagtt ttatctttta tttagattct 27481 catctaatta gtcattaatt gtaatactta atattttttt tagatttggg ttctgaatat 27541 ttttttgttt ttttcccatc caagattacc aataatgctc cagatgttga gaattgaggc 27601 ttttgtgtgg gtaagagtga ttttccttat ccctgtttcc atagcgtgca ttggcagtca 27661 actcttattt ttgttaattt ttctttataa tcagttacaa tttcattaga tagcggtcta 27721 attaagattg caaggaaggc aacaggtatg cggatgcaag gtgtttcact ggcagaggca 27781 gcaaggcgct tgggcgtgag tcaaagcact ttgtatgtgg ctgtgcaaaa aggtcagatc 27841 cctactttta gaagagatgg cagaacagta gttgctactg gagcattaac tgagtatcaa 27901 atcagacaac gcccaatttc tgattataga ctttaaacgt ttcaagatta aatcacgtat 27961 tcacgcagtt caagagtatg agataagcgc tagtgatatt gcttgatcgc tcctgagcac 28021 tttcttattg gtgcccgcaa cttcatatct gaagggtgcg cgatgcctta tgcctcagcg 28081 tgttctttat ggtttccacg ctacctggat ttcgtctgaa caaatggcta ctcaagggag 28141 ttaatgctca acccacattc tatagttgac gttttgtagt catatttctg tgagaaaaac 28201 caaagtcagt agatcaaatc gatctactga tttttcgttt atctgcgtct gtcgtgtatt 28261 attgaccagt agcattgatt ctccagctta tgagtatccc attcaatcaa aaatttggac 28321 ttacagttag ttttgctacc ttattattta tcggtttagc caacactcac ctacgtgctc 28381 agtcaaaaac tcaacagcta tctcaacaac atacggcgac acaaaattta tctcatactc 28441 aacagcaaaa ttctgcccta cttcaacagc agattccctc agcttcaagt caagagaatg 28501 cttttattgt tcagcaaacc tcagataaca atctagacaa acaaaatact cgtctttctg 28561 agccaaagtc agcaccaaca tcaacatatt atcaagttat tgacgagtta caaaaatata 28621 aatttagtgt tcaagggaat caggttttca cttccggaaa attgccgacg acgaaagtta 28681 atttcaatca atcagactta ctcacagttc tcgttaatac tagaaagtat tatcaagatt 28741 acgcttcaga agaccaaaag gttttacgga caggagtact tgctactcaa ggagtaagtg 28801 tagaagacat tctcaaaacc ttagacttca tgattactgt tttgcaagaa gatattgcta 28861 ataatcgagc gactcgttta caagatccta attttatcaa cactaatttc cgtgttatta 28921 agtggtctgc ctataactct ccaagttcta ctagtcaaaa acaattgcga attacaaaat 28981 atgcagtttt tactcatcca gcttctcata agaaaacatc taaatataac ataccaattt 29041 atagcttaaa agataactca atagctgaaa aattctacac taaatataca aagcaggatg 29101 tgttatcagg tatttatgaa tcaggtggta aagaatttgg aaaagtagaa cccctcgctt 29161 acttaacccg agaaggtttt gaagaagcgc ttatggaggg aacaatactt cttaatttta 29221 cggatggatc caaagcactt tttaatgttg atagaagtaa tgaaatgcct tatcttcgag 29281 gagttgcagc cacatcacaa aagcgctatt ggtattttag acaagtggat gatatcaagg 29341 gttatggata taagatagat gcaaaaattt ctatcaaacc aggagtgact tttgctggag 29401 atgtcttaaa tattggttta ggtagagtga ttgttcttga gtatcctaaa gatggacgca 29461 aacaactaca attaggagtc cttgcagata caggtggagc atttttacct aatcttcatc 29521 aactcgattt tttggcaggt atttttcaag gtagaaaaga ttttgggcag catactaggc 29581 aactacctga atatgctaca gcgtatattt tagtgaaaaa gtgaacaaag agttttttca 29641 accacaatct aatactcttg tataagtgga tgaagcatca gttcatcaat gactacatgt 29701 gatggtgcac tgagcgtata aataatggca tcggctacat cgtgtgcgct cagccatcct 29761 tgatgtgcgg gatctccctg aggttgtgta tggaatgatg tgtcaaccag acctggaaaa 29821 atagtgctta ccttaatgcc aaatcgtcga acctccttgc gtaaagccat cgagaaggca 29881 tcttgagcga attttgtcgc acagtaagca gttcctccgt cacaaactcg ttttgataca 29941 tcggatgcaa ccatgacaat gtgtccgtgt ccttggcgct tcatatgcgg tacaacctca 30001 cgcgaaaaca aaaaggtacc tgtaacgttg gtatcaaaaa gctcctgcca ttcaccaaga 30061 gagtactgtt caacttcgcc aaattttcct atgcctgcat tattcaccag aatatcgatt 30121 gaaccgaaac gttcaacgca gtgttgaacg gtggagacaa caaaaggttc tgctgcaacg 30181 tccccggcaa atacttcaca ctgatgtccc tggttttgta ggtgcgcttt gagttcacct 30241 aagagttcac ttgagcgagc aactaacgca aagcaaatcg tattgtgtgt ttcaagcctt 30301 gacaccagtg ctctcccgat gccttttgtg gctcctgtga tgaggattgt tttctctttt 30361 agcatgaata cattactgct atggcttctg gtctggcatt gttatccttc ttttctacaa 30421 caaacatctt aatatgcaat atttctacat tgaactaaac aatttaagca tgaataaaca 30481 gtaatatgga acttcaatat tgattaacga ctgtgcaagg tgcggcgctt ggcgacaaca 30541 tacccggaga gcgctatcgt tactttgtcc ggcgatgtcc gagcaaatgt cgcgtaaatg 30601 tatcagtgat tctacatttg tttatataat tgaatttttc tcgccgactt acttatagcg 30661 accaagatgc tcgcactcag ttcacacgca ttgagatgtt cccactgaga agctcatata 30721 acttggaaaa tcttataacc acacgattaa ctccagtatt tcacatcaat atactgtgag 30781 gttattacgc attaagtgtg ttaaaaatgt gcgtgacacc attcatgacg cccatctata 30841 aatgatgtgg agttttaagt atgtttcgtg atacaacgac tcgtcctgaa ttattggcaa 30901 ggattgccga taatcagcac aatccccgcc agattatcag tgtgcaagaa ttaaaggttt 30961 tgaacgagcg ctctaactgg aaagggctgg ttcaactggc ttttcatcta accgtcactg 31021 gatgcagtgg ctacctgtgg gcgacaaact ttggcaattg gtggttggca ataccagcat 31081 tggtgatcta cggttttagt attgcttgca tgtttgcgcc tatgcacgaa tgtggtcaca 31141 gaaccgtttt cgttaacaat cgtttaaatg aaactgttgg ttggtgcgca ggtttactat 31201 cattttacaa cagtacattt gaccgttact accacaagtg gcaccatctc tacactcgca 31261 ttcctggcaa agatccggaa ctaactgagc ctaagccgag caatctgggt aaatacttgt 31321 taataatcag tggtttacct tggtgggaag gaaagatacg cgggcatttt cgcgcctgca 31381 tcggtcaact tgacgattgc ccatttgtgc cacggacagc acgaggtgaa gtgattcgat 31441 ccactcgatt acaattagct gtttatgcag gggcgatcgc tctttcattt gcagtcagac 31501 agccttggtt tgtgctttat tggctgctac cgctcgttgt tggtcagccg attctgcgtt 31561 ttcttatgct ggcggaacat acaggttgca ctcttgacgc taatctgctc acgaatacgc 31621 gtacaacgct aacactttgg cctgtgcgat ttctcacgtg gaatatgccg tttcatgcag 31681 agcaccattt gtacccatca attccgttcc acgcgctgcc aaaagcccat aagcaattga 31741 gttcgcactt tgcccacatt gattcgggct atataaaagt caactgggat attgtgtcta 31801 aacagggaaa gtcggcggta tgacaaaact tctttgttga taccccaact gttcgagggt 31861 taccaaccct caggaaatca tcttttccaa cgccatttgc aaatcttgcg ttatctcatt 31921 aacagcgcct cttgctgctt gacgacttgt ctgataaact gaccaacgtt gtgtcactga 31981 aattggctca cctactgtta accgtgattt tcgccaacct aaccgaggtc gtgcaggatt 32041 tttctcccct tttatccgtg cgatgacgtc aaataagatt aaagaggttt cggcaaatct 32101 ctcaaaagta ggcttttgtt gaacataagt tcctgtgaca gcgacaaaac tctcggctag 32161 gcgcatatgt agtatccgca aagatgcctc ttcagcaatc cagtctgcca agccgcgctc 32221 cataggtgat aatgcatgca agtttggcaa atcctttctg tggatatcat cccagcttgc 32281 tgactctaag cggcgacaac gctcaatgat agttccttgg ctttccagtc caaaaaactg 32341 ctcagctgct tgtagggaat tatctaaaac cgtctgcagt cgagcctcaa gtacgtggtt 32401 acgacttgca gatggattta tttgcgttgt gactggtgta ctttgatggt aaaagcgagc 32461 gtaaaattgt tccatttggg aaaggatatg ttcaccaagg cacaacagtc gctcatagaa 32521 aactttttct cgctcaacca ggtttgattc accaactttt tgcacttgtt tgccgcaatc 32581 tgcttccaat ttgcccaaaa gccaatctag ttttgcccaa gatggattga tgtagcgata 32641 ctgattggcg attggcacaa taaaaacttc ctcagttcgg ttcgctttaa gcaaatcttc 32701 tacacaccag aagcccatat gagcaacacc aggttctaaa gggctgacaa tttcgctatg 32761 tccattagtt gcgccttcag gagctacact cattggtagt tgaccattgg cgaataaatc 32821 ccgcgctgtc cgcataccaa ccttgtctag ccgcttacct cgatgaatcg gaaagccccc 32881 taacccagaa aagaaccaac ctagccaatc cccagcccac agagtcatcc cccggtcata 32941 gagaaaatga gaatgaatcg gatattgcaa tgaaattccc ctttgatgag caactttcgg 33001 tacagcgtgg gagagtaaat acatcatgca aaggggatca tcaacctctg ggtggcgaaa 33061 tgccattaaa aagcgaattt tgccagcttg aaattgttgg tagagatcaa ctagcctctc 33121 gacattcaca gtttcaacct ccgcgatacc agctggtaac cacggtcgag tccgaaaccg 33181 tagcaaaaca ggcagtgccc atttggtgat atggtacact actgggttaa aatgctgggg 33241 aataaactct aggggcggtt gagcttgttg aatcgaatga ggcaaggtgg ttctccaagg 33301 tgtgtttgtg acgctcctac accgactgca agcagtacgg tgtgggcttc tggcagaaat 33361 tatctttcgg tgaacttcgt tcgtggtact ccaactgttt tccactaccg cagtttatag 33421 tcaggttcgc gtcctgcccg acttgaataa ccacagatac ggctctatgt caagccacat 33481 cttcggtaat tttggaatca gccattattt atgagccaaa aagaccacca aagttataac 33541 caactcctat tgataaacct atatcagtat cactgaaaaa tgcggcattc aaagcagctg 33601 ttgctgtaaa tcgaggagtt atgggcacat caactccacc agttacaaga aaggcaacct 33661 cagaaccatt accagtgttg atagctgcac ctgccccgat gtatggggca ataggtaacg 33721 gagcagcaaa aggatctgat aactgcttga atgaaaggtc gtaggttacg ggaattaaga 33781 tacttgtgtt gtctccgatc actgctgcgg gtcgcactga tatggcattt gttagcccga 33841 ttttgctgat aaccataaag ttaccatcac ccaaagcaga gttaccaccg agaccaatgt 33901 tgccaccaat tccaatataa cttgaaccac cacgggttgg tctgcctggg tcaacactaa 33961 tatctgcttg tgccacttta gatttagacg attgagaagt tgattgttga acttctgcat 34021 tctgggaatc tatgagtgct gcagatgaag ttgatgttgt cccaggaaca ggtgtaacta 34081 cattattcgc ggcttgtgtg ctcgttggta actgctgcgg gtgatttaca tttgatactg 34141 gcgataaatt atcttgtaca gcagtggact ctaccggagt agggctaaaa gctggtgggt 34201 agaacctttc tacttcaaga tttacctgat taacaaacgc atcagtgctg ggagacttga 34261 tgaactgctg agtgcttgtg gaatctaccg tttgggcacc agcagataaa ccgcttccca 34321 aaaccgccat agccgataaa ctcagcaaag aaaatacatt tttacgggaa ataataatgt 34381 tcacgttcac tcctgaatat tgcctaaatg gataacagtt atgtggtttt tagtcaagtt 34441 ttggcaatat ccactattaa acatcaaaat accaatttcc acatcttcta tttgactaga 34501 ttacctaaag tttgaaataa gaattttgac gcatagattc tcagtttgac agatgacaaa 34561 aaatatttat atcatgggaa atataggaat ccagtttggt ttatgagcga actcgttgag 34621 ctagggaact cttaacgctt aacagggaac ataaactgta cctagctgaa caaaaatcaa 34681 atacgagtcc tatagatcta ttggtaataa gaatctaaag caataaaaga aagctctaag 34741 ctacatcagc tcgttttctt ataaaagaat gagatttgac tggctctgac gctccaagta 34801 ctgactcttg taaaagcagc atctgcgaaa gtcttttggc gcatagcaga tacaacatac 34861 acaatcatta atttggaaac aaattttttc caaattaatg ttcttcttaa tacactattt 34921 atcttattat agctttatac agttttcata catggctcaa aagtgtgcaa attttggtca 34981 aaacccttaa aaggactata ttttagactt ttgctaactg tagatacact gaaattatgt 35041 cgtttttttt tatgaaaaat atgtgaagtg acgattttgc tttacatttt catgataaat 35101 tgtcacatgt aatttacatt tacagccgta aaagtgaaac attcaacaaa atcaaaccgt 35161 aaatacgcct aaaaaatcga gaactaaagc acattgctga tcttatatcc aagtactcct 35221 ctcgtcctta tgttctacag acacgctgct ttgaacatga agtgtgcact ttgggtttat 35281 gggacactga ctcgcgctca ttcaaaattc aaaaagcttt gattctgagc tttctagctg 35341 ttttgaatca tatgctccgc gcccaacgta ctagtaacga aatattgtat cggctataga 35401 aaacccaatt cagttaagtg ttctcagttg gatcatcaaa acagaactga tggcgtggag 35461 tgtacgcaat actctcaaag gcgatgtttc ctgtgtgggg tgaatcgtca aatcctctag 35521 aaattcataa ttctcacata actctcattg aaatctgttc attggctgaa agataggtgt 35581 tatacacaac taggatgaaa atgtaaaatc agcgcattac cgaactctca atatgctctc 35641 tcccttctca ttatctgctg agtaaattaa ccgaaagtaa atctaagccc aactgtgcaa 35701 gctatgaaca aagtcaaaat tttgaacttg gagattgaca atttatcgaa gttagagttg 35761 ttagagaaat tgcaatcggg agtagtattt acacccaatg ttgatcatct gattaaacta 35821 caagaagacc cggagttctt acaagcatat agcattagtg actataaggt ttgtgatagt 35881 caaattctcc tttacgcatc taagtttttg gggacgccaa taaaagagaa gatctcaggt 35941 tcagatttgt tcccggcttt ctataattat cacaagaaca atccagacat caaaattttt 36001 cttttgggag caggtatagg gatagccagc aaagctcaaa atgaaatcaa ccggaaaaca 36061 agcaggaata ttattgttgc ttcctattct ccaccttttg gatttgaaaa agacgaacaa 36121 gagtgtcaga acataattaa catgataaat agctccggtg ctactgtctt agttatcgga 36181 gttggtgccc caaagcaaga aaagtgggta tgcaagtaca aaaatatgct acctcatata 36241 aagatattta tggcgctggg tgccactatt gattttgaag caggaaactt gaaaagatct 36301 ccaaagtgga tgagtgaagt tggtttagag tggttgttca gaattttctg tgatccaaaa 36361 agattatgga aaagatactt aatagacgat cttccttttc tcagattaat tttgaaacaa 36421 aaactcaatt tatacataaa taaagaacac aagaagaaaa acccgatttg gcaaattgct 36481 aacaaatttt aggtctggct ttaacactca cttttcactt actcagctaa gtttgctaag 36541 taagtttttc gagataacat ttctggtgta gtattgagtc ctactttctc tataatttgc 36601 tcccaacggt acagccagtc atgacgcagt aaggaattca tgagattatc tttacgaatt 36661 ctatttagat gatcaggtcg tgcatctaat tctgctatga tatcggctac attagcagca 36721 tcgtaaggaa tttcaattac tgcattagac cagttaaagt aagttgtgta agcttcacaa 36781 actggaggag tcccaatcat aactgttttc cctgcagttc cctcgaaaaa gcgagaacct 36841 aactcatctt gaccacctgt ttgactgttt attattctta atgctactac ttgattagtt 36901 aacgcagcac ctgattcaaa agtttctgct tatcagtatt gatacttttc cagtcatcta 36961 gtaagatacc tccggtatct ttactattgg gattccagca ccagtaggta aagcttagct 37021 ttttcttgtc aacgtaatca acgaactgtc gctgccaaat tccctcttta gatttagtat 37081 ctacttgatt tccaccaaat tcaccaatcc aaatcggggc gataccttga gtcgcaatat 37141 agtgaaatcc aatctccaaa cgtttataca aattaccagg aagtatcggt tctttaaccc 37201 acgggagatg agaagctgcg tactcatgag gagaatatac tagctttttt ggcaccttca 37261 aacgcactgg gtatttcagc acaccttcta aattcgcacc ccaagcatag attggctgtt 37321 tctgacctgg cacatttccc acaactcctt ctaccaaaat cagccatttt gggtttacac 37381 taagaatttc gtttccagcc cgttctgctg ctaaccgcca gtctgtggcg cgatcgcctg 37441 ttccccaact agcagaacca tgaggttcgt ttttcagatc cgccccaata acattgcttt 37501 ggtttttata gcgctttgcc aaaaaagtcc aagtgtcaat ccagtccttt tcagtaaaac 37561 cgtcaccata ccataactct ggaatcttgt catctttaag agtgtgactg tctaaaagaa 37621 taaacaatcc ctgacgctgc gcctcctgaa taatcatgtc catgacctct attggagtct 37681 tagcttgaaa atctttattt gcaccaatat aaaaatcaac accactgata tcacgagaac 37741 gcagtgcttg tacagaatat ggcaagcgaa tgacgttgta acccagactt ttcatctgag 37801 caagcatatc tttgtaactt ctcacccaaa gtccatgagg aacatgatgt atttgttcta 37861 tcccaaacca gttgacgcca cgcagtaaaa caacacgatt tttagcatca ataatcgcag 37921 agccacgagt agacaaagga gggacaatag tcgtagcagc ttcgcttctt gccatgtcaa 37981 attgctttac aaacaacggc gaacaactac tgaaaattgt gtaaagtaac actaaagtca 38041 agccaactgc aatgtttttt gtgatttgtc cccacttttt gtaatagtaa gcaatggaaa 38101 aaccaatatt gcgtactatg gtaaaaccct gtttcacaaa aattagtcta ctcacaagac 38161 atcgttagtt atttgataga actacatacc tgctgcgtct gcaactttat ctaggattct 38221 gcccaaagct gctcctgcag catcacaaga aaaatgttct tcagcatagc gtttagctgc 38281 ttgaccaaga cggttgcgta gtgctgggtc gttccataac atttctatcg cctgcattaa 38341 gtcatcaaca gatttcggtt caactaacaa acctgtctcg ccatgctgaa tataatcttc 38401 tgctccatga caacgggtgg cgatgattgc acgacccatg agcatagctt ctacaatagt 38461 gacttgtcct gctgctgtca cttggggcaa tggacgcaga ggtacgacgc tgatgcgcgc 38521 ctcttgagct aaacgcaggc aatctgccct ctcaatgcca aacggtgttt ttacattcgg 38581 cggaatggtt aatccttcta atgcacgttg gcttgatgcg acaaccgttg gtaggtttag 38641 cttttctact gcttggaaga aggtggaaaa gtcgcgatga gcagaaccga gtgctgtgat 38701 aaaaggatgg gtcgtattct cttcatatgt tatgggaatc tctggttgat gataggggac 38761 aaattcaaaa cgctcctttg ggattcccaa ccagtggtgg tagatatcta cttcatggcg 38821 tgtgtggaca acaaagtgat caatgtgttt taggctaaat tgggcgatcg cctgccgaat 38881 cccactagaa caggtgccaa cattaaataa ccaagctact acaggaatgc gcttcttgcc 38941 agtcatccgt tgttgcattc ctatgactgc tggtagctga ggaaaaaggg tgatcacgcc 39001 cccctcagac gccttcatcg cttccatacc atgttgccaa tatattagcc aacttctgaa 39061 tccagtcaca gaagatttct gattgtgcca gtttcctaac ggttttgaac gtggaacgat 39121 atggaactgg tggcgatcgc cgggaacatt tcgagttaac caaggctcgt ttacgaggtt 39181 ctgttggtta ataaaagggg ctgcaattgt ccaatgcata taaattgaga atacctatgt 39241 agaaattctt tatagtataa atgactcacc atgtacaata tgctttcatg ttctttgaat 39301 ctgtttttgt cagtttacag taaagacttc tttgttgata tccgacacaa caccttcgac 39361 gacttgagga ccgaacagtg actcaagcaa ttcttgctca acattaacct ttaggggagt 39421 tgataacaat ctcactattt cttcggcgaa agattgagca ttaacagcta ttttaaaaag 39481 ttttctcacc tcttctggca aaccagctat tccctgaggt gtagaaacaa taggtctacc 39541 agagacaagc atttctatag actttatact caccccgcta cctgttaaaa ccggattaat 39601 cagaacacgt cccgactggt aaactgctag agaggatgct ggattgatgc ttaaagagac 39661 accttggtgt tcctcacata gttgcttgac tttctttaca ggatttgaac cagcgattag 39721 cactttaaca gtgggtaatg cagagcgtat aatcggaaaa acctgtgtta tgaaccagac 39781 tataccagct acgttgttgt caacttttag gttccctaga aacacaacgt catacacaag 39841 attggcgctg atttgatcat gggcttgttc tgaataatta tctttaggaa actcaacaat 39901 aggcggtaag tagcgcccat tcctaaagcc ttgactttgc caaaatttga agtcatctac 39961 agagatatcg tagaataaag tactcttttt cagtaagtct ttttcaaaac tttccaagtg 40021 agatactgac aaatatcttt tgagcttact tgtaaagtct gttgtagatg caagtaaccg 40081 acggtaatat aggtactcta tattgtgaga gcgagtgatc aggggtacat ttaacttttt 40141 gcttaactta tctgcgattt ctccaccgtg aattccatcc aaccagataa catcaggatg 40201 aaaaagacgg acaccagaaa gcagagtcgt taactctttt cccctgacaa tccgagaagt 40261 cacttctaaa ggataatgta ataagtcaat tgccctacga gcaagagagc ctagtgtacg 40321 tttaaatggt atgagagaaa tttcttttac atatttctta atttcagcta tttcctctgg 40381 ttgtagcggt tcattgatcc aagatatgag ttgtaactcc acgccgatat tagagaaggc 40441 ttttattctt cgccacatct ctactcgcgc accacgattc ggagggtagg gaatgtcatg 40501 acatattaaa gtaatcttca taattttaat acgttctcaa tttattcttc cttcttcgtc 40561 tgccatcaca ctgagataat agttgtattt ttttgcttgt tcatactcca cagccgttga 40621 gaaaattatt gatatgtata atatccaaaa gatactgtta gttgcgacaa gagtgctttc 40681 tgtgacgttg gataaaaata aaaacgtcag atacattaaa ggccataaac cttctaccac 40741 tttagtcgct cgtaaccaca tgactcctct tatacaggcg gttacatagc ttatgcaaaa 40801 aacgattaac cctactatcc ccaattctgc caataaatcc ataaagccat tgtgaccata 40861 aggacattgc cattccaagg tgcgccaaag gtaggctgtc acctcattat cccagctttg 40921 ccaaaccgca ttaaaaccgt agcctaataa agggcgttcc caaattagct caaacatgac 40981 tccccaaata tctgtgcgtc ctgtcagtgt caaatcccta cctaatgcag tcgcaatgac 41041 tggtaaatta tctaataaca agatagctat gcttcctact aaaagaatca cagttaggac 41101 tgaaggaatg acttggttat aattgcctcg caaagcttta taaagttgca aaatagttgt 41161 caaactcaga aaaacaacga aggctgtttt tgatgttgaa agaatgatta aaatcgctga 41221 aagaacaaaa ccaacccaca gaatccattt atatttttgt tgagatttct gttgataaag 41281 ggagttatca acacagagta agagaaagac tatagtgctg acattcatga gacgacccaa 41341 gtggtttttg tgcgctatga cacctcgcca agctcccgcg tgagcacctt cttcctgaaa 41401 ggtcattaaa ccataagatg gcagtgctat agcaaacaca aagcttagta atattattaa 41461 cccgaacatc caggctaata attctaattg ctccctcatg gtgtaacgtg tagctaagta 41521 tactccaaat cccgttgtcc ccaaaagaag tacaccacga cgtagggtaa gatctggtgc 41581 tactgtccac aacacggatg ctaatgcaat tcctaccagt agccagatcc aaatgtcttt 41641 ctgggcaacg tacaaaaaac ttttcctgtt tttaataacc aagaagagag tcacgacata 41701 gactcccata aacaggattg gagtgtatgg atcttggctc gcatccacag aatcttcttt 41761 ttctataaga ataggtatca aggcacttgt cgaaaaaaac agagtcaata caacaaatac 41821 cttttcggct agccttgata atctgtgcat ggcaactcta tttgcaataa ctcattagcc 41881 gtctcaagaa aaatttttga acagaaaaac taaataaatt taccgtctaa agactatcct 41941 ctttaactct cgtataatta tgtccacata agtgattttt gattcatcct aagttttgac 42001 tttactttat ctttattagc aaactcattt ttatgaattt tatgtaaaga agagcagaac 42061 acatacgtag gagggtatgt tagaggatac tgaaactagc aaaaatctgg tatatttatc 42121 cctatcatca gggagtactc tgcgcaaaaa aaaactaatg ggtgcagagt ctgtaaccgc 42181 atcaccttgt tggatgttag ccgtaccgta ctcacgctac catttttagt tgcaaatgca 42241 acaccattgc aaatagttat cccgtttgtg tttctggcaa gttggctaga attcagtatc 42301 taaaaatata gattgacaaa atcgtgtatt caatatgtgt gaatgggaaa accaatgcca 42361 actaaggaac aaagcttgca gtacttgaga aattccaata ctatcttgaa agagccgtat 42421 ttgttctgga gaagaccaga aggtgaagaa gaaaatacag gttttaatca ggttttatct 42481 gtattgcgtc gtcgcttagg tttgattgct agtgcgacag ttattgtgac gacagtggct 42541 attgcgtgga cagcaactcg aactccaaaa tatgaaggga agtttcagat attagttgaa 42601 ccattaaaaa gtgctgatag tgagttactg cttttgctat cggaaacttt aaagcaaaat 42661 atcaacgaaa tcaccaaaca gaataaaacc gacatggatt atcaggcttt gatggaggtt 42721 ttaaaaagtc caaagctgat agagccagtt gttaaaaatc tgaaaaaccg ctatccaaac 42781 attagctacg atcaacttgt tggcagtgat gtgtctggta aagtgtctcc tgagcgagtt 42841 ggtacgctac acattagtcg gcttggtaaa ggcaaagacc tatcaagagt tgtggaggtt 42901 cgctatcgag aatccaaccc ccaaaaagtc cagtatgttt tagagcaggt atcacaagca 42961 tatcaaaact acagtaaaga acaacagcaa acaaatttac gtcaaggaat aaaattcatt 43021 gagcagcaga ttcccaaaat acaattccgg gtgaatacac ttcaaggaca aatgcaagtg 43081 ttccaaaaga agcacaatat gtttaacccg gaacttcagg gtaagcaatt gctcaacaga 43141 gttgatgaac tcaaaacaca gcgcatagaa attgaaagag gcttagctga aactcgctca 43201 ctttctgctt ctttgcaaaa gcaattggat atgtcagaga atacagcgat cgccgcatca 43261 gccttgagtg aatcacccca atatcagcaa gtcctcactc gtctgcaaga ggtagaaaca 43321 aaaattgcca cagagtcaac gcgcctgact gacaacaacc cagtcatgcg gaatttgcgc 43381 gaacagcgac ggaaactgct acctttagta cagcaagaag caaaactagc attaggcagg 43441 aatggcggga gggataactc ccaagtcgga gtttatcaaa actcggttcg ccgtgacctg 43501 atcaaacaac tggctgatac cgctaaccaa aatcaggcct tagaaagtag cctccaagct 43561 aatcaaaaag caacggctga gctaaatcaa caaattcagg aatatcctgc cctttcacgt 43621 cagtatgcca atttgcaaag ggaattgcaa gtttctagcg acactctcaa ccagatttta 43681 actaagcaag aagcactgcg ggtagatgct gctcaacaag atcttccttg ggaagtgatt 43741 acgcctccca cactccctcg tgacaaaaaa ggtcatctcg taccagtagg actcaatggt 43801 gagcgtaaca ttgcattagg agttgttgga ggtttactct taggaactct ggctgctttt 43861 gttctggaaa attcgcaaaa tgtctttcgc gattctgagg aaatcaaacg tacaaccaaa 43921 ctaccagttt ttgcggtcgt tcctttccat aaagaattgc gacacccaac ttctgtcact 43981 gataaacaac tcccattagc agatcaaaaa ggaaaaaatc aatttgcacc acaggtaaaa 44041 gcaaagactc aggagtatca aacaacagct tttaccgaag ctttttgctc tctttataac 44101 agaataaact ctctcaaatc acaagcttct atccactcaa tagtcgtcac atcagcgaca 44161 agtggcgatg gaaaatctac agtagcagtg aatttagcaa agatagcagc tcaagcaggt 44221 caaaaagtgc tactagtaga tgctaattta cgtcatcctc aagttcatca tgcattaggt 44281 ttagttaaca tcaaaggact tagcgagata ctttttctag gtatagattt taatgatgtt 44341 gttggacaag cacccagaga agaaaatctt tttgtgatca cggctggtga tgcaccacaa 44401 aatcccacaa agctgttctc atctcaaaga atggagaatt tcgtggagga agcccatgca 44461 aactatgatt taattgtgta tgacgcacca cacattatgg gacttttaga cacaagcatc 44521 ttagcaaatc gtgtcgatgg agttttgatg gttgcaggac ttggtaaaac agttcgtcct 44581 tctttgcacc aagccttgga agaattaaaa actggtcagg ttccagtctt gggcatagta 44641 gctgatacca ttgaacggta gctagtagca gacagtgata ttttcgtcac tcgttagttg 44701 tctgaaagat aataataaga tcaattcatc ctgtgataaa cattatcaag aaaaaattct 44761 ccagtcaatt catccgaaat gttggctggc tgagtatctc agagattatc tacagagttt 44821 tgcgcttggg attagttgtg atcattgcgc ggtttttgag ccgccacgac tacggtttgg 44881 gtgcaattgt aatgacagta cgtgagtttt caatcacgtt tgctgatgta ggtatagcag 44941 caaagattat tcaggctgaa gaagaagaat tagaggattt gtgtaactct gcatactggt 45001 taagttgggt cgtttttttg agtctttttc tgattcagtc tattgcggct tttcccatag 45061 cttggtttta taaaagtcca aatctgattt tgccaattat tgtttcaggt ttagcttact 45121 taatttggcc tctatctacc atacaaaaaa ctctcatcca aagagaaaat cgtctgaaag 45181 taatagctat tacgaatagt cttcaaaact tgactggcag tatattgagt gcgatttttg 45241 ctgtatcagg tatgggtgta tggtcgttgg tgttgcctcc aatattagct gcacctatgg 45301 aacccttaat atactacaga gcacatcctt ggcgtttaaa cacaggattt actacaaaaa 45361 attggggaca gatattaaaa tttgggaaaa atcttttagg cgtctcctta ttaagaacat 45421 taagaaataa cttagattat ttattggttg gtcgctttgt ggtcaaccca ggagatcctg 45481 aatatggaat tcaagaatta gggttatact tctttggttt taacgcagga ttaggaatta 45541 gtttaagttt tatcaatgct attaactcag caattttgcc tcatttatgt gcagtacgct 45601 cagaattgtc tgaattgaaa aaatcctatt ttaatagtct gaaaactatt cgcgctacca 45661 ttgttccttt tgtcatactc caatccagtt tagctcatat ttatgtacca attgtttttg 45721 gtaagaaatg gcagcctgct atccccatcg ttgttctcat ttgcttatcg gcaattccac 45781 gaccctttgc agacgctgca tcccaattgc ttatagctgt tgggaaacct catttagatt 45841 tacattggaa tgtcttattt actatcctat ttagcatagc aattctgata ggagtgaact 45901 tacaagttct tggtgtagat acatctgtct tatcagttca tctcggagaa cattggcaaa 45961 taattgttgt agctatatct gttttattag ttcatcttgt gtttttacct ctgtttacat 46021 tgtgggcaac acgttatgtt tttccaaaaa gtaagaaaaa tgaggcgctt ctatgaaaaa 46081 agtttctgtg atcattccag tttacaaagt tgagaaatat gtagcagcaa caatacaatc 46141 agttcttgat caaacctata aaaattttga gttgatcatt attgatgatg gttctccgga 46201 taaaagcata gaaatttgcc agcaatttac agacaataga atcaaaataa tccgtcagga 46261 aaatcgggga gttgcagcgg ctcgaaacgt tggtattcgt catgctcaag gagaatattt 46321 ggctttttta gatgcagatg acttatgggt aagcgaaaag ttagaacaac atgttgaaca 46381 tttaaagaat tcaccagcag tgggggtgag tttttgtcgc tcttctttaa tcgatgaagc 46441 ggggaagcct ttgggtattt atcaaataac caagcttaag gagattaccc cacttgatat 46501 actttgtcgc actcccatag gaaatggatc ggttcccgtg attcggcgtg aggtttttga 46561 agaaattgca ttccaagata atctttatgg agtggtagaa aacttttatt ttgatgatga 46621 tcgcaaactg cacccttcag aagatgtgga actttggctg cggatagcga taaaaacgaa 46681 atggcttata gagggaattc ctgaagcttt gacgctttat cgaataaatt cccagggctt 46741 ttcggctcaa ctggtgaaaa aattacattc ttgggaaaca atgctggaaa aagcacgcgc 46801 ctacgtacct ccagcatcaa tggcccagtt agaaaaaatc gctatagctt atcaactacg 46861 gcatttagcc cgaagggcag tgactttaga agatggttca acagctgtgg agtttgcttg 46921 gcgatcgctc tctacccact ggcgaattat actagaagaa ccacaccgta caattatcac 46981 cctcgctgca gcttattttc tctggctcat gccacgccag ctgtaccatc aagtacagtc 47041 tgttgcttta aaaattgccg gaactattca aaaacgacgc attcaacaag aagaattagc 47101 aaaaaaatct tatagcgttg ttctaaagga tgtctgatat caagtcccgt aaattaccca 47161 taattcccac gcaccccacc ccgccaaagc tgcgctttgt ctcccctccc cttaataagg 47221 ggaggggatt aaggggtggg gttcagagca gagcgaaaat tataactaat taaccgaacc 47281 tgatagaaac caccaaaatt gaaattatgg tgctcttaac tgctgagtca ttcacactca 47341 gcagtcgtta cttataaatc agtactgaat tatgctcgcc ctaacagagg agctgcagca 47401 agtaatgttt ttgtgtaagg atgttgtgga ttggcaaaaa tctgtttagt gcgaccaatt 47461 tctacaattt tgccactatt catgacagca atcctatcac acaaaaaccg cgctaaccac 47521 aagtcatgag tgataaataa atacgttaac tcaaattctt ctttcaactc caacatcaaa 47581 tctaaaactt gcgactgcac gctagcatct aacatactca ctggttcatc gcagatgagt 47641 agcttaggac gagtaattaa agcacgggcg atcgccactc tttgctgctg tcccccagat 47701 aaatctgacg gataacgttc ataataaact tgtggtggtt ttaaccccac tttttccagc 47761 atcgacagca cctgttgttt tgcattcgcg ggaaccgcta gcttgtgaat cagcaaaggg 47821 tcagctatac tttgtcctac ggtcatcgct ggatttaagc aagcatgggg atcttgaaag 47881 accatttgca tttgccgcct agacgcacga acttgttgac gcgataagtt cgttaaatcc 47941 tgtcccaaaa actcgacttt acctgctgtg ggacgaatca gttgcaatat tgtccgtgat 48001 agtgtacttt tgccgcaacc cgactcccca actaatccta gaatttctcc tggataaagt 48061 tctaggttga ttccatccac cgctttaatg gtttgatttt gccgttttaa caatcgctct 48121 ataaagttgg gttctaaggt gtaatgctgt tgtaattcta agacgcgaag tatgggcgtt 48181 gactgaacct cttgtcctct gtgaacgggt attgaggaga agtcagagaa tgaggtgctt 48241 tcatcaactg tttgaatatg caaagctgct ttgaggagag actgtgtata ttcgtgttgg 48301 ggttgctgga acacagtctg ggaagaaccc gtttcaacca ttttgccgtt gtacatgacg 48361 ccaatgcgat cgcaatactc ggcaacaaga gccaaatcgt gagaaattaa tagcagtgcc 48421 atgttttctt ccccgcacag ccgtgtgagt tcctgcaaaa tctgtgcgga aactgtgaca 48481 tccaaacttg tggtgggttc atcagcaaca attaacttgg gggacagaag taaggctagg 48541 gcgatcgcca ccctctgacg cattccaccg ctaaactcgt ggggatactg actccaacga 48601 ctggcaggaa ttttaacttt ttctaaagtc gcaattgctt tttctttggc ttgtttggtg 48661 gacaattgtg gtgagtgagc ttttaatgtt tccaaacagt gattgccaat cgtcattaat 48721 ggatcaaggc gcgtcatggg atcttgaaat atcagcgcga ctgcttctcc tcggaattta 48781 cgcaactgtt ctggcgtcat ctcaaacact gaacgccctt gaaacgttac tcgtccctca 48841 atttggcttg agggtggtag taaccgcatt gctgcgcgtc ctagagttga cttaccgcag 48901 cctgattctc caaccaatcc cattctttcc ccaggttgca ggatgaaaga cacaccattt 48961 actgcccact gggcttcttc gtcgctgcga tgaggataag cgactcgcaa attttcaata 49021 caaaataagg cttcactcat aattaggcta tcagatatca gctatcagct tttttctctg 49081 acatacataa tatatgtcag gattattacc agagtatctt ttaaaaacaa caatataact 49141 gagaacgcac caataattcc aatgagtata cggataaatc accaggttta tcactttggt 49201 tttctactta taaagaatta ttttatactt atttttgctt cagcgagttt gaagtttttt 49261 atcagtcaaa agtttaaatt ttaaggattc ttgacttttg acgctattta caacgaattg 49321 tgcccaggaa atatgtaaag atgactacaa aacctcaaca aatttccatg aaaagagttt 49381 aaaaaaagtt caagattttt aagtttcaat ttttaatttt aatttgtcat tttttcatgg 49441 gttacggctt gcccttctgg tactatttat tgcaatatgg cattacagta attgatgaga 49501 gtagtcaggc gatcgcctat ggcatcaagc ctactcttgg cagtggaagc ttgccgcgct 49561 ccttggaaaa acatgacatc tatttccaac agccgcagct gcttactcat ttccgtctga 49621 tttgactgta accgactctc atcattgctg tctgtgtcca gattaactaa agggacaatc 49681 tgttgctgaa aaaattgttg caaggatctc acacgctgcg cgacttcgga tgcatctact 49741 ttggtagtca tgacatccga atgcaattgt ttcagcaata ccttcaatac gtgatactta 49801 tcgcgagtta gagacatcta agttcttctt gagatactat taaagtcaag gtaatgtcaa 49861 gaaacttttc aaatagacca aactttgtta atttttattt ccgaatctgt tgtctcatat 49921 gaaaattggg gctggatttg gcatttcatc gattttaccg actttatgat ccgagacact 49981 cgtactggat aatttgcact ctttgccttg gggaatctac tccacacaga aggacgaaag 50041 tgcaagtgtc agtaagagtt tgttttttca atttctcact ttactatcaa ttacctagcc 50101 acccccgatt ctttctatga gcagcactgt tgtagtcact accagaattg atgttactct 50161 cccagaatgg ctacagactt gcctaaatgg gtcatataca gtagtaggca cagaaactga 50221 agatggccga aggcaaagcg acatggcttt aattcgtcaa gcatttgaat ttgcttacca 50281 gttgcatgag ggtcagtacc gtaagtcagg agaaccatac atttgtcatc ctgtggctgt 50341 tgctggaatc ttgagagact tagggggcag tgctgacatg atagcagctg gcttcctcca 50401 tgatatagtg gaagacacag atgtcacaat cgaagagata gagcaacgct tcggcaaaga 50461 agtggggctt ttggttgaag gtgtgacaaa gctttctaaa attaatttca aaagtaagac 50521 cgaaagccaa gcagagaact ttcgcagaat gtttctggct atggcgcaag atatccgagt 50581 gattgtggtg aagttagcag atcgtctcca caatatgcgg actttggaat tccttcgtga 50641 tgaaaaacgc cgcgcaattg ctttggaaac acgagaaatt tttgctcccc tggctaatcg 50701 ctcagggatt tggcgcataa aatgggaact ggaagattta gcttttaaat atttagaacc 50761 agaggcttat cggcaaattc aggagtatgt cgctgaaaaa cgaacggcgc gggaagagag 50821 attgacaaaa attgccgaaa ctttacggac tcgggtagag gaagcgggga tcaagtgtct 50881 ggacatgagt ggacgtccga agcacctcta cagcatttac ctgaagatgc agaggcaaaa 50941 caaagagttt cacgaaattt acgatttagc agcactaagg attattgtca atagcaatga 51001 ggaatgttac cgtgctttag cgattgttca tgatgcttgc cgtccaattc ctggtagatt 51061 caaagattac atcggcttac ctaaacctaa ccgttaccaa tcgttacata ctggagttat 51121 aggtccgtgg ggtcgtcctc tggaggtgca aattagaaca ttggaaatgc atcacgtcgc 51181 cgagtatgga attgcagctc actggaagta taaagaaaca ggagactcca acattatcca 51241 ctggaggcca tcagatgaaa agtttacctg gttgcggcag ctgctggaat ggcagaatga 51301 cctcaaggat gctcaagaat atttggaaag catcaaggat aacttatttg aagatgatgt 51361 ttatgttttc acacctaagg gagatttggt tgctttaaat cctggttcca caaccgtaga 51421 ttttgcctat cgcattcaca cagaagttgg gaatcactgt gcaggagcaa aggtaaatgg 51481 gcggatggta ccactttcaa cgcgactgca aaatggtgat attgtagaga ttctgacaca 51541 aaagaacggt catcccagtt tggattggtt aaattttgtc agaacttcgg cggcgaaaaa 51601 tcggattagg cagtggtaca agcgatcgcg ccgggaagaa aatattgccc gtggacggga 51661 attgttggaa aaagaattgg ggagatcagg tgttgaaaac ctgattaaat cacaacccat 51721 gcagatagta gcagagcgat gtaattatca ttctatggaa gatttactcg cggctttggg 51781 ttatggtgaa gtgacgctga atttagtact caaccgttgg cgagaagtcg taaaggcgca 51841 acaacctgtt gcagatgcac ctctagttcc gaccaaagaa ctcacctcaa caaccaaagc 51901 tttacgagac cttactccag caacctcacg caccactgac tcaccgataa ttggggtaga 51961 aggattagtc cattatctgg ctaagtgttg tactccgatt cctggagaac cgattattgg 52021 tgttgtcaca cgaggtagag gcatttcgat tcatcgccaa ggatgtcaga atctagagaa 52081 tgtagagtgc gatcgcctcg taccagttca ttggaactca ccaggcgaaa tctactctcg 52141 tcctgcgact tatcctgtga atattcagat tgaggctctc gaccgcgtag gaattctaaa 52201 agatattctc tcacgcttaa gtgaccacgg catcaacgtc cgtcatgctc aggtaaaaac 52261 agcgactagt caaccagcat tgattgactt ggggattgag atacgcgatc gaccacaatt 52321 ggaacagatc tttactcaaa tcaaaaaact gagcgacatt atcaacattc gtcgtgtcgg 52381 tcaaatcgag gaatagtaaa attaatccgt aattcgtaat ctataatcat tacgaattac 52441 gagtcaatcg acactcccgt cataacacaa tccggttgct tacctagccc tccttttgaa 52501 ggctacggtg tacacacatc tccctgaaaa cctcaccctc gctttttgct acacaaaaac 52561 ttttccctct ccttaataag gagagggatg cccgataggg cagggtgagg ttaataagga 52621 gagggatacc cgatagggca gggtgaggtt aataaacaga cgcatgcccg atagggcagg 52681 gtgaggttcc gaggtttggg taattcgatg acgtgtgtgt acacgtagcc ctaaaaagga 52741 gggaactaag cccctcaaat cacagtactt cgcgtcccat ataaggttgc agcgcttccg 52801 gtatccgtac cgttctatcc tcttgttgat agttttctaa aatcgcagcc attgtacgtc 52861 ccacagccag tcctgaacca ttgagagtat gcaccagttg ggttcctttc tttccacttt 52921 ctttgaagcg aatttgcccg cgtcgcgctt gatagtccaa aaaattggaa caactggaaa 52981 tttctcggta ctttccagaa gaaggaagcc aaacctctat atcataggtt ttcgccgagt 53041 gaaatcctat atccccagta cataattcta tgactcggta aggcagctgc aacgcctgta 53101 aaattgcttc tgcattgttt agcaactttt ccagttcctc actagaagtg ctgggatgga 53161 caaatttcac gagttcaact ttgttgaatt gatgcagtcg aatcagtccc cgcatatcgc 53221 gtccataact cccagcctcc cggcgaaaac acggagtata agcacagtgg taaataggca 53281 aatcttcaaa attgagagtt tcaccacgat agaggttggt tacaggaacc tccgccgtag 53341 gcgtaagcca caaatcatcc gccgcacact taaagctttc ttcagcgaac ttgggcaatt 53401 gacccgtagc tgtcatagac tcggtattaa tcaaaaatgg cggaataact tccacatacc 53461 ctgcggctat ctggcgatca agcataaact ggattaatgc tctctccagt ttggcaccag 53521 cacctataag ggtgataaag cgactttgtg cgacttttac agctcgctca acattgagaa 53581 tacccaactt ctcgccaatt tcccagtggg gaagaatgtt tgggttttgg ggaagatact 53641 catcacccca acgacgcact tctgggttat cttcctcact cctaccaaca ggtgtagagt 53701 cgcttggtaa gttaggaagt gtgagtaaaa gttcttcgat tttggctgat agctcttttt 53761 cttgaggttc cagtgtccca atctcagctt tgacagaact accattctca cgcaacgctt 53821 gaatttctgg tccagtgggg ttgcaaccag cttttatctt ttccggaatt aatttcgcaa 53881 tttcgttact gcgtgcttgc agttgatttc gcttcgcttc caattcgcgt tgctgtttat 53941 ccaactccaa aagcggctga atatcgtagt caccaccacg agttttcaac cgttcctgaa 54001 ccaattgtgg attttccctt attaacttaa tgtccagcac agatttatcc taatttttag 54061 cctagtattt tctttacagt taacacttca aaatatagta cttccgactg gtgacaactg 54121 ccaaagctaa ataagtaggt cggtattaat aaagttcacg atgttttggc agtcatgagt 54181 cattagtcat tagtcatgag gaggcagtga tcgactgggg tctggcggca taaaacacct 54241 gccgttcatt ggtaagggtt ttaggcgttt ttacaaatct ttgcatagct tgattttttt 54301 ccgcttgtgt acttagcagg gaaaagtaag tgatctgtga ctagttagtc attcaagcag 54361 tccaaagcga acaacaaatg atccactagg aactacgctc agactggggt ttggagatgc 54421 ggtagagtat ccaaattgac gcgacaagcc ccgcaaaatg ggctgtaacc gtatttgtgt 54481 ttgcctgcac gacaaacatg tctaagggtt ggatactccg gctagggtcg atctgggtga 54541 ttactccagc acccaattgc ggtaaagtca atgactttgt gactagagtc ccaacaatta 54601 tttgtgctcc taacagagtc aatagcattc ctactaaatg cacaataatc gcaaagcgta 54661 caacttgaac agtttcaact ttgcgcgggc ggttgttagg atttgaggat tctagtcgta 54721 gcccaagtct ggtgtaacgg aaagctatat aaatgcccac tgctaaagca acaagcccag 54781 caacggcaaa aatagcgcca aacccagttc cagggttact actggtattg gcacctctct 54841 gactaaagac tgcgaacagc aaaatgaccc cagaaacaac acctagcact agttgaatcc 54901 agaagctaac ccatcctgta aggcgaaata tttgggcaat tgcccggagt gtagacgaat 54961 ctgacatact gatcaccatt tgtgctgagg gcttgataaa tttactatca tacttagtaa 55021 gtgtgataag agtatttcta gtaaataact tgcgtacttc aaactttttt gagtgtgccc 55081 actattcaca aagcttgaaa gaaaacatga tattttgcct aatctttaag caaaaactct 55141 aggaaaatta ctgtctagca aagttttgat taagttttac ataaataccg cattttccct 55201 ttaaaattgc ttagacggta ttttatgctt aagcagtcct cacgcctttg gcattactta 55261 acactaaaga ttgcagcata gccgttgtta aatcgctcct caatacaatg tcagaaactt 55321 gggtagggat acaaaaactt gtattcctat agttggtgca tcggttccac gataaataag 55381 cgtgaaatct gtctcatgtt atttttaccc tttggtgatc cagcctaagc agtgctatgt 55441 actcacttta taagggtagt attttacact actcttacca gattgtaagc ttaaccctta 55501 ggcatttttc gtcattcatt aaccatgaaa cttgaggata tatatcattt ctttgaaaat 55561 cctccgccaa cttacctttg tcaggaactc gctgtttgtt atataatgta tattttgata 55621 ccaagcgaat cctatggaac ggagttgatc caacgactgg aaactgaata tccaacctat 55681 cggctttcag atactgtgct ttacagtgcg attaaatttc tcgaagacca gaaggcaatt 55741 actgggtact ggaaaaagct ggaaggacgg ggacgtccta ggcgaatgta ccaagtctct 55801 ccagaatggc aatttaaagc gcaggattta gctcgtttgt ggcaacagta tataagcggg 55861 agaacaagtt aacgaatcat tgataatgaa taaatcacct cctgtgaaac cctcataatc 55921 tagtttatgg atactgctac tctaccatcc acgttgctgc tcactttttt attatcggtc 55981 gggctatttt tctttattcg tgcctctacc aaagatcgta cacaaaccac acaactggtt 56041 tgcgagcaag acgaagctac tttaatgcct caattaaggg agtattttca aacacggtct 56101 taccgagtag cagcggtcga cccaaaacaa aaccaggtga cttttgaagg cattgttaga 56161 ccaagctggt ttttagctgt gtttttaaca cttttagcag ctgttggtct tctgtgtcta 56221 tcactggttt tgtcgctact ttttcccaac ctgagtacag tatttcttgg gttggtgctg 56281 gtctcacctt tgagtggtgt tttttattgg aagaaagctg gcaaacatga gttggtgtcg 56341 ctgaaggtgg aagcagccga aagcgaccaa cactctccaa ctcaaataac actaacagcc 56401 catcgagatg aaatcacgga gttacggagg acgttgggat taaaaagttg tgaatagctg 56461 ttattctttc tattcttagt cttacactct tatgctgtac cgtaagctat acctcttata 56521 gctttcattc gtatccacga tacggtactt ccttgctttt aagctttcgc atttttatat 56581 aagtatcttt cagtgactag tctgagtcca gactggcata tcattttata acttttcaga 56641 ataacaaatg gaatagagag aggaggtaaa acgactccag atctgtagct cttctccaca 56701 aattgtagaa aaaagtgatc tcgcttgaaa ctgcgtgcca aagctttgat cagaacgggt 56761 tcgctgtatc ctcactccca cgtacgtttt cacacctaat actcaactca tacaaacaca 56821 aagtttgatt agagttgata gtcttaaaga caaaactgct agcttaacac tggcaagtga 56881 gctacattct taaaaaaaaa tgaaacttaa aaaaaaacta gcaattttgg ttgttaatgg 56941 ttgtcatgag ccaaattcaa actctatgat cgccatatgt tagttcaacg actagcaaca 57001 gccagtgatt cctgacctgg tggggcttcg agtaccctca ggctcggaaa taaaaagtga 57061 ttctcttcaa catattgagc tccaaacagc cccttttc // LOCUS NODE_386_length_52904_cov_4.95106852904 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 52904) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 52904) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..52904 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1128) /locus_tag="DP116_01740" CDS complement(<1..1128) /locus_tag="DP116_01740" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01740" /translation="MDIQPLASKVNRLGLGESLCQPNRLTLRQESNHRLSNSAYPIAK YTPLTTSKFLLPSHQNNKFLNTDSLANLNYWNDSDGNIDLFPELESYPQNEDISVNST DALSVRNHSSVNNNELAETLSSNNLGDLDTPQRKKANNKPKSQRKTKSKPTSDSKPQP KKTTKSSKTNKKGKHEVSPLQQASTQTFNPTTPNIETTTPVFEPTASPETSEVSPLQQ ASNQTFNPTTPNIETTVPVLGQPASPQTSEVSPLQQASNQKTNPTVPNLETTLPVSEQ TIPLQTSEVSPLQQASNYDNNATIPNIEATANVLGQPASPQTSEVSPLQQASNQTFNP TTPNIETTTPVFEPTASPQTSEVVPLQQASNYDNNATTPNIE" gene complement(1146..1664) /locus_tag="DP116_01745" CDS complement(1146..1664) /locus_tag="DP116_01745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494717.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phage tail protein" /protein_id="PRJNA477356:DP116_01745" /translation="MAGSNNSNITHELNYVTTNRFYIEIESAIAASFTECSGLGAQIQ KKVIHEGGVNDQQRVYLGQVQFNDVTLKRGVTDHPGFWNWISEVFDEQGKTFRRNVNI LVFNQAGETMMSWTLIGAVPIGWKAPALQADGNAAAIEELTLAYEGLQFGKEKGGGNK STRMKSGFFESK" gene complement(1664..1903) /locus_tag="DP116_01750" CDS complement(1664..1903) /locus_tag="DP116_01750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494716.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01750" /translation="MDLEKREVFYTPLGDVGGTGGYVGISAVSNPSRVSSYAERCLKD SLLLNTLTRRVYELLLEDIRNQRERVSNYGQKRWF" gene complement(2163..2759) /locus_tag="DP116_01755" CDS complement(2163..2759) /locus_tag="DP116_01755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407854.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FHA domain-containing protein" /protein_id="PRJNA477356:DP116_01755" /translation="MPANRCPNPNCEYFNRALPNNAKVCPWCSTPLGNVIPSTPSNPT NSQPVQPQAPIQEQPSYQPPPYATQYQQREYSPPTPPVYNATPPRLPVLKLVHSTGRE FQLRSEGGSIGRRSQNMPTPPEIDLTGIPHEGIVSRRHARIFWDWSQNAYMIVDTSTN GVYLNGNLLNSGVSYRILNGDSLQLGQDGLVTFMIAVM" gene complement(2737..4869) /locus_tag="DP116_01760" CDS complement(2737..4869) /locus_tag="DP116_01760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_01760" /translation="MSSTELFYKEYPCSHNAHLDCEKVIETLQEVKGAKFCFECGFPT ILPEEAEIKGYRGSYRVTKFLGVRGFGRLYSGIQLKDKQPVIIKEYLLPSRTFNQDEI NKRKEAFRRVGGIDLADNRVQNFRLIQTWEAISPEKGESCYLITKDVQPSQTLRQYLK QNGAMTPEQVRELLSEVLQTLEFLHSQKLRFSSNQIQRGLEHGNINLDSVLIKVENKQ RFVVYLCDLAIWENLFIPPSIPQPKAKTHVQDLESLGLVAFQLWVGQTQLDPKDHQVW PDNDNYLKQFIYRLLSLNTPYGSAEIARQELLKLPQPDESDNLRPSSDSQEQKKRFFK KYWLWLGVLAFLLLGGAIWYYFWQRFKLDEDQYLKWQGLAQNFSKVDNVPSGTFTYTG EKDDTWSYILRQNAENAETKLNDIFTNPKPDAKVTFTYQQVRSSDITKVSKPIEEVQK GGKDFAITTLFDNITPDMNSQQVAYDGLLVFIAFSQNDFSLHKKLDGKISLEDLRGIY TGKIIDWHQINKNAPRLPIERYVPTEPEAIQQFKKLVLKNNSQDIALFEEITKTRIRA TDFTQRQIRTANKKGQTGIISFGIFSKTWNQCSGYPLAIVNDNNKTIQPLLNPINKRP IEPSYNFCDRADFDTRSFQANGTANYPLGYPLYVVYPKDNTRQLGGFTFAELLKTRQG QCLLNQVGLVPLQPMPNHIKNDACKSVP" gene complement(4893..5567) /locus_tag="DP116_01765" CDS complement(4893..5567) /locus_tag="DP116_01765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494712.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_01765" /translation="MNNKQISILLVDDEERFRQGLRSLLGFYSTSASLPVNVVGEADC VEQVVKLAIQKSPDLILLDLELVNSDGITAMERLKDISYSGRVLVLSAHQEDKWIFQA MQKGAAGYVFKSRVANQLCDAINTVLKSEIYLPPEAASGFFRCFQENASSVYKASSQL HLTEREQEVLQLLTQGASNEEIAKNLYVTVATVKAHLTNIFEKLKVSSRTQAIVAAIK LGLVQA" gene complement(5607..6731) /locus_tag="DP116_01770" CDS complement(5607..6731) /locus_tag="DP116_01770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407857.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_01770" /translation="MNISFSLEKQKSNCHDLHAFDTETFIGLQAEQLTFHESICFVRI LYYNPSLKAFKERIEYADNQPSFSQQEIAYLRSEAWLTDFPHVWNVHEFQLAQFPKYF SYICPIRYTNQKPEYVQIITHTPLGEKSRRYVTNAAQMLNQYIDICLDNLQQKSKTEI LEQVLHKVGHQLRNNIALVKLYAHNLFLGLKDNSWREQATIICESVNDLDTNLTDIIS CGQESNLKIIPQDLRSIVDESIKYLQPLINQKQLKVNIPDTSATLPLDKLQMKQVFDN LLSNAIHFSPNSGVITFSWQVFYDEVLIKISDQGLGISLLDMQKIFTPFYSQRPGGTG LGLTIAKKIVLDHCGNLWADNLSAGGAQFCLILPTKKKYY" gene 6994..7896 /locus_tag="DP116_01775" CDS 6994..7896 /locus_tag="DP116_01775" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_927045.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01775" /translation="MIIYILQTLAGIIAGGTSLSSTEQIEFSHPSNRKEEGGGPILNL YVYDIRESKQIQHSGRQVERKLTEAKQVESQRKQVTQSASLNWSPTWFDVSMLLTAWD RTALGEHYLLSQALGVLLRHRALKEEFLVPELRGYGNLNLTVAVEPQIEAGSLWSALT IALRPALYLMVTVPVVPQVSPVYLVWQRTIGVDNNFQPELELENASVHRNGSIESFTK RVVVAGIIKNAVTNLPMKEAKVEVIGTKESTTTNKEGLFYFEELRNGNYVLQINSPGY LPLQVNALVDGSSCTFKEISLNPA" gene 7978..9639 /locus_tag="DP116_01780" CDS 7978..9639 /locus_tag="DP116_01780" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_927046.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phage tail sheath family protein" /protein_id="PRJNA477356:DP116_01780" /translation="MARLDYFAPGVYIEEIDRGSRPIEGVSTAIAGFVGFTEDVRGGA ELFKPMLVTNWTQYLNYFSRPNSDGFTDFNAYLPFAVYGYFMNGGGRCWITSIGTQLP NAPKPPDPQPASVRITGRGNRPSLQFTMRPEQIAAGTMTIVISDSGPRPLPEGTEGSV PPNTGEYFKVQIRRGDETLEEYDNLTMNREGNGQFATYAVTALRNSMFITIEDISQTG QPLGRRPGNGQYELTAPIPATPPDRFSSDIEGVRDDRTGVRGLFEIDEITMVACPDLM RAYQSELMNLDQVHAIMELMLSMCEGANTGDIPNPPNRMVVLDSPPDCPKPQQVVEWL NRFNRRSMFGALYYPWIKVANPRDRGNPISVPPCGHVMGVWGRTDETRGVYKAPANEV PRGVIGLDYDTNFREQELLNPLGINCIRRFPNRGIRIWGARTLVEPDKTEWRYISVRR LISYIEKSLELGTQWVVFEPNDEDLWARVRRTVSNFLERIWREGALFGASPEQAFYVK CDEELNPPDTRILGRLYIEVGVCPVRPAEFVILRISQWNGIEDEE" gene 9741..10196 /locus_tag="DP116_01785" CDS 9741..10196 /locus_tag="DP116_01785" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_927047.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phage tail protein" /protein_id="PRJNA477356:DP116_01785" /translation="MPGEFLTACKFYFEASVITDKFIKEISGLGVENTPAQEVHGSSK KGVISRQATPTVVKFTNITIKVIATDDKDLYAWYKKCNEDMGDPRQWMSNRYDGSVVA YDQQGTEKARWNIKRCYPCKYTGPTLTASGGDMANETIELVHEGIERIL" gene 10223..10711 /locus_tag="DP116_01790" CDS 10223..10711 /locus_tag="DP116_01790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494707.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phage tail protein" /protein_id="PRJNA477356:DP116_01790" /translation="MVQAGQNPEILTAHRFYLGLALDGQKDSDAYFLECQGFTRTQDV IEICEVTSQSWGTGQSKGLPVRTKIPGNVKSGNITLRRGMTNSIDFWNWFDKVQTGGW AKQRKMVSLSIYNQASVEVARFEMEGAWPTRYKIADVNARSTEIEIEEVEVAFEEFKR VK" gene 10732..11106 /locus_tag="DP116_01795" CDS 10732..11106 /locus_tag="DP116_01795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01795" /translation="MFATEFEITLPKGYIDSDGNLHRKGIMRLATAIDEISPLRDPRV KANPAYATIIILARVITSLGALSEVTPAIVENFFSQDLNYLQDFYRKINGLEPATPPI SDSQPPVSDSQLLEEVGNSKIP" gene 11150..11527 /locus_tag="DP116_01800" CDS 11150..11527 /locus_tag="DP116_01800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phage tail assembly protein" /protein_id="PRJNA477356:DP116_01800" /translation="MRRKELCTEFNFTLPRGLIDSQDRVHRHGVMRLATAKDELWVQQ ERKVQENPAYGALVMLSRVISRLGSLNSVTPEQLEELVLRDIYYLREFYNRINQQSNA YIPAHCPHCDTQFNVRLELAGES" gene 11683..13230 /locus_tag="DP116_01805" CDS 11683..13230 /locus_tag="DP116_01805" /inference="COORDINATES: protein motif:HMM:PF16697.3" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01805" /translation="MKVKISISPTQSEFEEVDLTLTTRRGAECLIGRSPDSDLVLDST DVSRSHGKFFSQGGNYYFCDLGSRNGSIVNGKLAEKNHSYILKDGDIIRIGDFVLVIE DEMPDSQQAETVVRIINPSEFSNWRQNQNQSATPQELQPVNNESVSSVPAAEISQKHE EAEIVNTSEEVEQSKEIFIQPDDIVTPDNAIPSAHGTSASDVALQATAESDASESDRI VQAHDLASPAPETENVSIIEHDIKVSEYSIVQAHDLASPEPETENVSIIEHDIEVSEY SIVQAHDIASPEPETENIPNVEHEEEIAPHTQEQEREIIKAINSERENFDLDTPSQQT EKELVLEDTSILTPHESELVATSQQTEEIPSENQESEELEEQEKSSIQLAKILEEKQI LLVAHQSKKSELTELVSEYKEFLSYCLTRTWQTFSDDLYKQTGLSVTQEIPPANSGGY QAINSLVNSGEILAVIYLRDLMMAPQQGQASEEALLRICNINNVLLATNLPTAKAILY YLKNLKD" gene 13316..15574 /locus_tag="DP116_01810" CDS 13316..15574 /locus_tag="DP116_01810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494702.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01810" /translation="MNSNSDNLISASLSTQSLTYRPNQGEVSFEVTVNNDSDRFANFQ LEIIAAGEIRNPNYRWYRLEPEVAAAQPHGSSTKFQVFIFNTPIPGFVGTVNLSVKIF SPQLARERRLILRLEIERDKKPILISVELPIRTFQVYPGNAIDIPVRVINQGQVASNV LLHLTGIDASWVTNSAERRFTLNANSQTEVSFRCEPPSVVQAPSLNYLFIAEATSNNS YPANAEGNLEVLPVGYINFSTPQNKQRIPSRRLWLPDWKSNSASYELLFKNVSNLHQE INVQIQGRDWRKCTFKKLPEIANLNLGETVKVILDVKTKRPWIGIGKTLFLEAKAELS DQRLGSTDPATQSLELEVLPIIPLWLQLAILALLAALLVWWKQFNQEAIAHTGFVNSV RIIGGGTGSSIVSASNDCTLRYWSIREYSIEPNKDATLESKPYEQNFRCAKPQKPKGV LAFANNPITVVRFVPVENNRVAIGLENGRIELREVPTGKRIPIPDPEEQGDKVFDLAF TKDSLNLFSGSGKGKVRVWSRESTTTNFQEKPVVIELEKQQEQKLTRFGIRALALSPD DEMLVVSGEYNRFLILHRNKNQPNNPFKKILVERLEKIDGRGKQKDSVLSLAFIPDSQ EKILATSDSFGFIAIWDLKQCKPTTNNNQQQINDANCKLLDRWQDESKNPIRTLAFSE DGKFLVSGGDDGRIVVWYLTSGYQLDKTNLPEGKTIFPGSTKIRSIDVKSSQGIVVSG SEDFKVRLHHIN" gene 15591..17906 /locus_tag="DP116_01815" CDS 15591..17906 /locus_tag="DP116_01815" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01815" /translation="MQARTNPLSIIINPPGIQYGMPGDIVSLYAVVINEGDQSAVIDL FFTFDETFQKISGWSASPRASLAVAPKESSDEVTFDFEIPVDALAGTYDYTFVVDSPE HYPQDTPINFPGQLKVLLKEQTVIRAFDPTFSIIPATNPNKLITYRIDQPLQVVVKVE NRSNRVDRFRLTCPDLDESWFKISYPATGLEGAGLFDVSALELNPHSEGEISLEFRPP IDTLAGSYSPTIRLLSENNPDLILLDLIYINLPTNYQLGIELNTILGKVSQKDGVFEL ALNNQGNLIRELFFNAKTRDEEEICTYKFDPSEFKLLPNTTEKASLLVKPGPWWRRPF FGQPLPINFEVNITDKQSYPLANASPQATLLWKARPWWQFLLLILLGLGLFAGISYMI WRLLNPEPLTIESFSTDKRKITEGDEVRLNWKINNYKPNRIEKLVVVIPQLPNNGHVY FDKKNNINELTKPVKSNQYPPCNFKPQEELECNRVTTGIKNQGKYVFELQVSYLQGSP LFNRRSQTVTKTAETEITKKPIAEVVDFKVDKSQYKTGESVSVSLTIKNPQLLNKLII NKKTNNLLVGNPVTLEFQNGKFKDPKLQKCKEENNLLKCTFSLPASNPGTFTYDISAI SGDQVSNKQAQNPVEILPKPFKIVFFKINNIDNSQKQSIVLNEGDQITVSWKVEGENV SVKLSLNNGDVPVPQTGNTSQIPVDVNFPRQIILTVTDKFGKLPPQPKGFLIIVKPKP TPASDLITPSPSPTQNIFKPLPASPRSRPLF" gene complement(17898..18236) /locus_tag="DP116_01820" CDS complement(17898..18236) /locus_tag="DP116_01820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01820" /translation="MKKHAPATANKMLCALRRVLQEAFKLDLISATDYQKAVDLRSIK ASKKQRGRRLKPDEITALIRVCQADSSPLGVRDSALIALLRCGLRRAEVVALHLKDFD ATNVMCIVLK" gene complement(18245..19174) /locus_tag="DP116_01825" /pseudo CDS complement(18245..19174) /locus_tag="DP116_01825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314910.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="glucosidase" gene complement(19320..19646) /locus_tag="DP116_01830" CDS complement(19320..19646) /locus_tag="DP116_01830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877148.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01830" /translation="MQDTSHSEYGDMYSRLIAISQEALESAYYETAYHALCAAMHYAY IQSDEHRLEMVGQAAKAQLDWIDANAKEHKMSTQSVMKRSGVNLYNSLLTQVHADLVI LRQKKR" gene complement(19734..20936) /locus_tag="DP116_01835" CDS complement(19734..20936) /locus_tag="DP116_01835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198315.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_01835" /translation="MLVFEAKLEGTNEQYGKLDEAIRTARFVRNSCLRHWMDNKDIDK YDLSAYCAVLAKEFEFAKNLNSQARQASAERAWSAISRFYDNCKKSKPGKKGYPRFKK EQTHGSVEYKTSGWKLSNDRRYITFSDGFEAGTFKMWGTRDLHFYQLKQIKRVRVVRR ADGYYAQFCIDQERIEKREPTAHNVGIDVGLNYFYTDSDGNTVANPRHLRKSEKSLKR LSRQMSKTKKGSKNRAKFRNKLARKHLKVSRQRKDFAVKTARCVVQSNDLVAYEDLKV RNMVRNRHLAKSISDAAWTQFRQWIEYFGKVFGVVTVAVPPHYTSQNCSNCGETVKKA LSTRTHTCQHCGHIQDRDYNAARNILELGLRTVGHTGTNASGDIDLYFGEETPQSKSS RGKRKPKQ" gene complement(21220..21960) /locus_tag="DP116_01840" CDS complement(21220..21960) /locus_tag="DP116_01840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865903.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-ketoacyl-ACP reductase" /protein_id="PRJNA477356:DP116_01840" /translation="MAPLAGKIAIVTGSSRGIGRAIALKLAGNGASVVVNYAGNANKA QEVVTEIENLGVQAIALQADVSKVADIQRLFEQTIERFGKVDILVNNAGVNFYKPLIE VTEEDFDKIFAINVKGTYFACQQAAHHMADGGRIINFSSSTTAMILPTYSAYVGTKGA VEQITRVVAKELGGQGITVNVISPGPTDTELFREGKTEEQINRFSQMAALGRLGQVQD IADVVAFLASDEARWITGQNVRVNGGIA" gene complement(22119..22835) /locus_tag="DP116_01845" CDS complement(22119..22835) /locus_tag="DP116_01845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314914.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pirin family protein" /protein_id="PRJNA477356:DP116_01845" /translation="MASNTKTHLIHDRNSRGHTKIGWLDSYHTFSFGNFYDPDRMGFR SLRVINDDRVVPGAGFGTHGHRDMEILTYVLEGAVEHKDSLGTGSVIRPGEAQIMSAG TGIMHSEYNPSETEPVHFLQIWILPDKQGLQPRYDQKAFSLEERRGKLRLIGAKDGRD GAIIIHQDVDLYTSVLEPGDVINYHLKPNRYAWLQIAQGIITLNGEELRAGDGVQMSG EEQLEISTNIGAEILLFDLA" gene complement(23340..23924) /locus_tag="DP116_01850" CDS complement(23340..23924) /locus_tag="DP116_01850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314915.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA starvation/stationary phase protection protein Dps" /protein_id="PRJNA477356:DP116_01850" /translation="MSDNNLSTRLYPTRIDIPAEARKQIAGILNQTLAATSDLKSQAK QAHWNVKGTDFYQLHELFDEIAGELEEYIDMFAERITALGGYACGTVRMAAANSFVPE YPTDILMGMEHVTALAERFAPYAKQLREAIDKTTELGDADTADLYTEVSRTIDKRLWF LEAHLQAAVTQNGNGNAGNIKTEEQAVVRQAAVR" gene complement(24215..25762) /locus_tag="DP116_01855" CDS complement(24215..25762) /locus_tag="DP116_01855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316495.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01855" /translation="MFKIASSHLTLGLTSLVGGLLLTNTILVTEAMAQSARPTEAMTQ INSVSQLSDVTPTDWAFQAVQSLVERYGCIAGYLDKTYRGNRALSRYEFAAGLNACIN RINELIAESTIDLVKKEDLETLKKLQEEFAAELATLRGRVDTLEAQTATLQAQEFSTT AKLVGEVVFAITDEFNQSVNNNTVFQQRVRLDIQNSFTGKDILHIRLAAGNTNIFNLK GNGVEGIQTFNFGNTNNSIYVDWMSYFFPIGENIEGYVAAVGGVHYDFVPTVSSYLEG YDSGVGSLSIFGQHNPIYLIGGGSGAGFTYSLSKKLSLSAGYLADNSDLFNDNYAALA QLTFSPSDQFSIGLTYINAYRKSAIFDTGSNLASVGTNLANGGGFDFGTVPSKVNAYG AEFTYKVSSEFAINGWFSYINADFVNLGTGDIWTYALTFAFPDLGKKNNLGGIVVGVE PYLGNASKFASGAKNDIPIHIEGFYKYQLNSNISITPGLIWVLSPGQNSENRDAIIGT LRTTFVF" gene complement(26426..26758) /locus_tag="DP116_01860" CDS complement(26426..26758) /locus_tag="DP116_01860" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01860" /translation="MEGDDPRLALLFLAMPRLPLARQTGQQWFDYLPSSMLVSGLRLV LYQFCSSFYPRLASARYVGQWQSNGTLIGTISIPKVAHKIHTSTSFNRDNQLVRVSVG DLSLVRQQ" gene complement(26748..27854) /locus_tag="DP116_01865" CDS complement(26748..27854) /locus_tag="DP116_01865" /inference="COORDINATES: protein motif:HMM:PF03253.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01865" /translation="MWNVLYQSALGRLKLTTLNNQKQTTQGWTTMQWLRTLRHLVRAV CAATAEILFLRGVGVGALLLSAMLLQPSVMVMGLVGVLAAVAFARVTQVDVMYLERGP LLFNPLLAGLSVGYLFQPSVASLFLAATAGILAFVLTWTLSHVLYTFFLLPVLSLPFV VVSWSVHLAAFRFAGLQHAVVPAYAYSIGLPVPLEGFLRTLGLIFFLPNVWVGMVVAL LLLLNSRIQFLLAVFSYAFGSFIQGILTGTFAYVYYDPAALNFILVALALGGYYLLPS PQSYMLAALGVALAALLGQAVSVFWAAVALPVHALPYNLVTLLLLYLLGLVGHQLLAR YPQSSPEKTLDYELTARRRFQGSSGRLLALPFGR" gene complement(27884..29266) /locus_tag="DP116_01870" CDS complement(27884..29266) /locus_tag="DP116_01870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013537691.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diaminopimelate decarboxylase" /protein_id="PRJNA477356:DP116_01870" /translation="MQTQYEKPVIQKLHSGLMNKFAGSTLLHRKVKTEIAGVAIDELV AQYGSPLFVYSEQMLRRKFRSIRNAFTTRYPNVTFGWSYKTNYLKAICAILHQEGAMA ELVSKMEYDKAKALGIPGNQIILNGPHKPFATLEAAVRDGVTINIDHLDEIEDLEAIA TRLQKIIPVGIRLNLDAGIQPCWSRFGFNLESGQAMDAIRRIASGGKLQVNGLHSHIG TFIMEPAAYARQVEKMVAFGYEIEQECGWRMDYIDIGGGFPSRSRLKGSYHAVDVMLP SIEEYADAVCDALWAALKPEHTPQLIIESGRAVVDEAGTLITSVCGTKRLPDGTRAYV IDAGVNLLFTSFWYRFDIALAQPVSGPYEPSVIYGPLCMNIDVVDDKISLPPLSRGTQ LVISTVGAYNNTQWLQFIEYRPNVVLITETGDVELIRAAEDLSDLERREHLPPRLATN AMFHDLTTKL" gene complement(29268..30308) /locus_tag="DP116_01875" CDS complement(29268..30308) /locus_tag="DP116_01875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013537689.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxylate--amine ligase" /protein_id="PRJNA477356:DP116_01875" /translation="MKPFTIAITGINAVDNPGPGTGIARSLREDNQLPTQIIGFAYDV MEPGLYMDWLFDRRYLMPYPSSEPEVLIERLQQIQQQVGLDCVIPNLDVELPLYIRCA KELESLGIRTYLPTREQFALRNKTQLAKVATAFGVRTPQTFTVTSIQELRDAIASLGL PLMMKGPYHGAYRVLTEDEALQRFHHLAAQWGYLIIMQQIISGTELNLVGIGDGKGSH LGLVATKKMSVTELGKIWSGVTIHHPGLLQAAEAFLQHTKWKGPFELECIVDADGMIY LIEINPRFPAWTYFATGVGVNLPARLVRAALGLPLPALPSYEAGKLFMRYTYEMVTDA TPLQTLVTLGER" gene complement(30305..30538) /locus_tag="DP116_01880" CDS complement(30305..30538) /locus_tag="DP116_01880" /inference="COORDINATES: protein motif:HMM:PF05402.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PqqD family protein" /protein_id="PRJNA477356:DP116_01880" /translation="MTSGINEVKTKDIDATGESFMLNHCGQLVLQRLRHGETQQQIVQ VLCDRFDIAHTIAERDVADFCQQLKTLGLTEDK" gene 30612..31346 /locus_tag="DP116_01885" CDS 30612..31346 /locus_tag="DP116_01885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655024.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01885" /translation="MFGLVLVFVLLLINALLSERQSREKLAIAHQQLHRYALRVEDQA TLQERNRIAREIHDALGHSLTAQSIQLENALLFLSSNLDKAKTFLEEAKQLGSSALRE VRQSVATLRCDALQGKSLESAIALLLSDFQRRTGIIPDYKLCLPQPLSGEVGTTIYRI VQEALTNISKHSAATVVTIDLQTTADSLYLQLCDNGRGFNPYQNTTGFGIQGMRERTL ALGGHFRIFSESGCGCRIIAVFPYHG" gene 31374..32086 /locus_tag="DP116_01890" /pseudo CDS 31374..32086 /locus_tag="DP116_01890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878972.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" gene complement(32117..33022) /locus_tag="DP116_01895" CDS complement(32117..33022) /locus_tag="DP116_01895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876317.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_01895" /translation="MRTEKIISVDFAQEDAYSQILPRSPLITSYHANWNDFRLDYHQQ PPFETPEHTPQQHVISISLTKQPINVERVLDGHVQHECIKYGDIVVIPASSYHKLSWE IEAEFLVLSLEKALFARAGYDLIDMQYTDIIPHFADSDPLIQQIGLALQSELESDGMG SRVYIESLATTLCIHLLRHYSVSSSKITKHPEGLSRLKLRQAIEYINQNLEKDLGLAE IATAVGMSMYHFSRLFKQSTCFSPHQYVMNCRIEEAKRLLTKTEQAIDQIYPQVGFQN QSHFTNVFRKLMGTTPNAYRKQVKI" gene 33177..33599 /locus_tag="DP116_01900" CDS 33177..33599 /locus_tag="DP116_01900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115696.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiol-disulfide oxidoreductase" /protein_id="PRJNA477356:DP116_01900" /translation="MSYLVIYDGNCNLCVTGVQMLETLDQGQLFRYVAMQDESTLQQW GITPEDCELGMILIDADAPERRWQGSAAAEEIGRLLPNGSVFVDAYRALPGVKWAGDR FYEQIRDHRYTIFGKRSSTYESTYCIDGGCKVAKNDAS" gene complement(33629..34060) /locus_tag="DP116_01905" CDS complement(33629..34060) /locus_tag="DP116_01905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210796.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01905" /translation="MNAAEMATNLEFASKIATVVNLFKSEFPDARSDLKPWKNDPQTR ELVDPDSIDIGFHFPGISKSWQSRSILIQIRFYHDHLNGSRRAIGVEVAGFSHIGEQW RLSTVENWSFVGTSVPSVQVGEKLKDFCRQILEIFNSAPPI" gene 34305..34772 /gene="psbQ" /locus_tag="DP116_01910" CDS 34305..34772 /gene="psbQ" /locus_tag="DP116_01910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314936.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II protein PsbQ" /protein_id="PRJNA477356:DP116_01910" /translation="MLRQRSILSLLLGFLAIFLISCGGPGVATPPPTYTPDQLVKIQE YVSDIQGVKERSRELERLIQTKQWVKVGNFTHGPMTEARLSMNYVTSNLLPKDQSAGR ELVRDLLDKLIKIDQAAEVGNTNGALNSSVAAFADIDKFLQLVPQTSSPSEES" gene 34914..36059 /locus_tag="DP116_01915" CDS 34914..36059 /locus_tag="DP116_01915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314937.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_01915" /translation="MSHVVIIGCGVVGAAIAYELSKVNGLRITVVDQQPPALASTGAA LGVLMGIISQKTKGKAWQMRQTSILGYETLIPELETLTDRKIGYNRQGILMLLPEPSM SSLEKEDGGISEWEKLQEIRQSQGFSLEIWDTDKLKQVCPHVNNDQIVGAVYSPCDRQ VDPTSLTLALIDAAKHNGVDFKFGVPVLGIEPQPLSPSQERDSEKYCNQLQTPEGKMT ADWFVVAAGLGSSLLSAQLKQMVDIRPVLGQALCVRLGHCLGNPDFQPVITGDDVHIV PVGGGDYWIGATVEFPTNKKDEVIADKELLELVRKQAIAFCPDLATATTIRTWSGLRP RPEGRPAPVIGKLPGYSNILLATGHYRNGVLLAPATAYAIREMIIAN" gene 36383..37234 /locus_tag="DP116_01920" CDS 36383..37234 /locus_tag="DP116_01920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458171.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="PRJNA477356:DP116_01920" /translation="MSITENKIQVGSLEWFYREAVPTGRSELLPVLLLHGLPSQSYSW RNIMPALAKQGTRAIAPDWIGFGFSSKPDKRDFAYTPDAFLTALEGFLKSIELERFSL VVQGFLGSVGLQYALRHPEQIANITILNTPISTQAKLPWKIQQMGLPFAGDMITQDPL LVDRTLEGGCRYVITEEHLNIYRKPFLKSSAAGRSLLATIRNLQLRSAMTEIDNGFKE LHQEILILWGMIDPWLPINIAQYFVNSLEKGSLIKLNNVGHYPQEHYHETILEDLLPF VRVKDSN" gene 37438..38016 /gene="hemJ" /locus_tag="DP116_01925" CDS 37438..38016 /gene="hemJ" /locus_tag="DP116_01925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141621.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protoporphyrinogen oxidase HemJ" /protein_id="PRJNA477356:DP116_01925" /translation="MAYSWFKAFHIVGIVVWFAGLFYLVRLFIYHVEANQEPEPARTI LKNQYQIMEKRLYNIITTPGMLVTVAMAIGLLSREPDVLKEGWLHVKLGFVVLLLGYH HYCKRLMKQLAQDTCKWNSQQLRALNEAPTVMLVVIVLLAVFKNNLPTDITAWGIVGM IIGMAATIQLYARKRRKDKEKLTTEMVQQQSS" gene 38347..38628 /locus_tag="DP116_01930" CDS 38347..38628 /locus_tag="DP116_01930" /inference="COORDINATES: protein motif:HMM:PF07282.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01930" /translation="MKIIYFGKILTIWLFVEFLLSLSLSAHAQEKLEARSTTLPQQLI SQQNFIPPRQGKPKDTSGAGSRSRLCENCGTSINRDVNAAINLSRLATA" gene complement(38750..41518) /locus_tag="DP116_01935" CDS complement(38750..41518) /locus_tag="DP116_01935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742548.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone TorD" /protein_id="PRJNA477356:DP116_01935" /translation="MNRYTYQVGGSLTINAPSYVERQADKQLYEALKEGEFCYVLNSR QMGKSSLLVRTRHRLQQDGFRCTTVDMTNIGCENITPEQWYKGVVAELWLGFKLLGKF NLKAWWREQEDIPVLQRLSRFISEVLLVQFPEERLFIFIDEVDSIKSLDFSVDDFFAF IRFCYNRRAIDPEYNRITFVIFGVTTPSDLIQDPKRTPFNIGKAIELHGFTMGEVEPL VKGLAVEQGNAQTIIKEILSWTGGQPFLTQKLCLLVQSSSQDSVSQTLIVLPGTEAFW VESIVKSRIIHKWESQDEPEHLRTISDRLLANEQIAGRSLGIYQQILAGEDVPTDDSP EQIELLLSGLVVNEQGYLRVKNPIYQTVFNSEWVALKLENLRPYAQRFKAWIASNQQD ESQLLVDLALQQALAWSKNKRLSDLDYRFLAASQELAKRQVETDLAAEKQARQIEREK AQFAVLAAQQANRILANARKAAKRKAQKLRLSKFWMGCIAGGVASFVILLRFTGLLQG MEWSMLDSFFQARPPAAVDPRITIITIDEPDIKQIGQYPLTDRVLTQAIRKIKSYKPR AIGLDLYRDLPIEPGHQELVELYKTTENLIGIEKAVGSQVAAPPTLAQLGQVGLADQV LDGDGKLRRALLSVKLENSSLHLNLGLQLALRYLEAEGITPQPQTKHREQIHLGKTVL VPFQPNDGGYVQADAGGYQVLLNFHGTEQQFQRFSLIDLLANKIPLELMRDRVVLIGS TAESVNDMFQTPLSSQNVGSAKQMAGVTIHANITSMILSAALQGRPLLTVRSKPMEWL WILLWCSVGTALAWQRKSPQSIITAVAIAEGGLIAIAYLAFLQGCWIPVVPAMIGLVI AAVTLPIVTNRQSEKAQLYQTVELLVAISREEPAVGQIALEYLKQAESSDNQALIEQI LRQEQRLH" gene 43123..44286 /locus_tag="DP116_01940" CDS 43123..44286 /locus_tag="DP116_01940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454750.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl hydrolase family 10" /protein_id="PRJNA477356:DP116_01940" /translation="MFKNSSQSRRRFLYIGLSSLAGATALAVVKNSNPMYLSHALDNP KRDFQVAGLASLSKRAAAKGLIYGVACGRDVLASDKNLQASIVRQCGVLTPENELKWQ FVRPRPDVFDFSRADWIAGFARTHNMLFRGHALVWHEQLPEWFKEVVNRQNAEKFLVE HITTVTKHYAGKIHSWDVVNEAIKPDDGLKSGLRQTPWLKFLGPDYIELAYRVAAQAD PKAMLVYNENGLEHNAPEYEVKRTAVLKLLERLKSRGTPIHALGIQSHLLGDASLNPK KLRNFLSNVASLGLKIIISELDVTDQKLPSDTVVRDRIVAAKYEDYLSVVLDERAVIA VLTWGLSDKYSWLSKFGSRSDGAPVRPLPLDLNFKSKLAWNGIARAFDKAPKR" gene 44497..45285 /locus_tag="DP116_01945" CDS 44497..45285 /locus_tag="DP116_01945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488610.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_01945" /translation="MQKFDEISLLGTRFHKIKVNELIDYAVQAAKLKKKTIIGHVNTR AMNFAYELPWYREFLNKSDLVFCDGFGVLLGAKLCGGCVNSSHRMTCPDYIEDFAKAC ERENVSLFLLAGEPGTVDQAIAKLKVIAPNLRVNGHHGYFDKSGEENEFVIQQINTFK PDILYIGFGMPLQERWILNNSEKIDTKVFLPLGACLDFYTGTVSRGPRWMTSSGLEWL TRLVTEPKRLWKRYVLGNPLFFYRVLQQQLTKLFRRRSRTSSQY" gene 45465..46892 /locus_tag="DP116_01950" CDS 45465..46892 /locus_tag="DP116_01950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861158.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="PRJNA477356:DP116_01950" /translation="MASKSLSVTGFKPDLRSANGTRIQKGLTSKFLRVGTLSVSDVIS LALAWKLAVIHGTPLDSPWTQKVSFLLLILAVEIGVIATRGLYKSGINRRNYLGLIKA VTLSDLLLLLIAFLYEPDSYISRSTFLLFWLLSVVFVCTGRFICDVGTKLLRSKGAIR HSVFLITDPEDKDSHIRLIEKENCYTVQKIADSSSLDLMNRETTFEYLRTQGIEEAFV SWNAIKNRLYVCWRFQTAGITLRILPTEGEVRHPKSIFWMIGEVPCMTIPAPIIAGSD FWVKRSFDLCCSTILLLLLSPVYVVIATLIKLDSPGPVFFQQNRVGLHSKSFKIWKFR TMVANAEKLQAKLEAKNEIKDGVLFKMKDDPRITRLGKFLRRYSLDELPQLFNVLLGE MSLVGPRPLPMRDVEKFKTKHFIRQEVLPGITGLWQVSGRSDIDNFEDGVKLDISYIE NWSVWLDLQILLKTVKVVFSKAGAY" gene 47489..49711 /locus_tag="DP116_01955" CDS 47489..49711 /locus_tag="DP116_01955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865611.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipopolysaccharide biosynthesis protein" /protein_id="PRJNA477356:DP116_01955" /translation="METRGYQEEVEIQKYWLVLKRRWPIVVGVLLASIGFSSFLIFLQ KPEYQASGMLLFKSDRTSSLTKVGEKIGDLESLMREGNPLETQAVILKSEPILKEVID TLGLKDKKGNALDPESLRIKVEPIVGTDVLKVSYTSENPALTASVVNQVMKSYVAKNI QFNRTQVVAAGEFIKKQLPEAQRELNQAAEGLREFKTRNKIIELPEEASAAVQNVAQI DEEINRARAALADTSAQEEKIRSQLNLAGSQAVEITSISQIPGVQEVLTELQKVQTKL ANEKARYTSKHPAITELENKEVTLNALLQQRVEQVLGPSGVKQVLGTQQNVAPAKLQI GRIKENLTTQYALIQAQRQGLENKLQALSNIRGTYKQKLSALPNLEKKQGDLERRLSI AQKNYENLVTRLQDIEVAEKQTVGNAKEIQLAQVPKKPSVSKITFLLGGGSVFVGLLL GTAAAFFVDLIDRTLKTVKEAETFFGYTLLGLIPKFESKKTSAPVNLMSDKASARIIV ATSPRSVVHEAYQMLQANLKFISHRKVRTIVVTSSVPGEGKSEVSANLAAVLAQAGRR VLLVDADMRKPSQHHLWGLVNSVGLSNVIVGQDQLPQTVQTVTKELSILTAGVQPPNP LGLIDSDRMATLIETFCDRYDYIVFDTPPLAGTADAAVLGKMADGVLLVARPGVVDSA SATAAKSLLERSEARILGMIANAVNLKQESANHFYYSNVRGGQDVVETAKGNEQWVYK " gene 49696..50277 /locus_tag="DP116_01960" CDS 49696..50277 /locus_tag="DP116_01960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006635773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine acetyltransferase" /protein_id="PRJNA477356:DP116_01960" /translation="MGVQINLTVNTDQAESPIGLWQLIQEDWIAHGRDWTKPGFRAVV VHRFGVWRMKIKPLLLRAPLSILYRMLFRKVRNHYGIELPYSVELGRRVIVEHQGAIV IHGDCSIGDECIIRQGVTLGNRYLDRPLDAPKLGKHVNVGAGAKIFGNVTIGDNASIG ANAVVLCDVPAGATAVGIPARIIHSEKVGNSHL" gene 50290..51516 /locus_tag="DP116_01965" CDS 50290..51516 /locus_tag="DP116_01965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865614.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_01965" /translation="MSMSQFMASFTDIVLLVSASGLFIVCVFFLIECTAALLPITSCI NKDNCPDLKVAVLVPAHNEEIVIGSTIEKLLPTLNRQDSLVVVADNCSDTTAEIARAK GATVIERQDLDRKGKGYALDYGLQFLESAPPDVVVIVDADCTVHPDAISLLSQYAIAM NAPVQATYLMSRPKNSQSSKDFVSQFSNIVRNLVRPLGLARLRQSCPLLGTGMAFPWT VIRSVNVANSHLLEDLKLGLDLTLAGYRPVFCQSAKVTGYLPQQSQAAKSQRTRWDHG HLQIMQTYVPILLKQAVFQKRFDLLVSVLDLCVPPLSLLVVIWLVLMALSLVFGVLGA SLMPAVIIATAGLCFLIAILTAWTKFARQDLPLRELLTVPFYILWKIPVYFKFLVKPQ SVWVRTERDSVNASDS" gene 51554..52810 /locus_tag="DP116_01970" CDS 51554..52810 /locus_tag="DP116_01970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017285978.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_01970" /translation="MNKIQQNSLLLVNSVPLREVDGQLGLDDQTCAGLVRWAENFKRL VIAGPALPEHIAVQQQNSSTAGTKWQAVKDLPCADQLELVPLPYAYKLRDFTKVYAKT RELLNEKIQQNQYLCFVIARPIGDWGGIAALEAIKLRRPYTAWLDRVEYEVISRMLQS LPLKSRIKESLHLPLMKRYQRYLIRQSSLGLFQGQDCYQAYSPFCKQPHCIYDVHTKK SDQIDEPSIDLKIKSLLSNEPLRICYVGRAADMKGPLDWLRVVDRICKAGVDVKATWV GDGPLLLKMKSLTQELGIADRVNLVGFVDDQSKILQTMRENHIFLFCHKTPESPRCLV ESIVSGCPIVGYESSYSQDLVSQYGGGAFVPMNDWQKLADLVVELNCDRPKLSKLIRE AALSGQFFDEETVYQERSDLIKKYLT" BASE COUNT 15500 a 10912 c 11026 g 15466 t ORIGIN 1 ttcgatattt ggtgtggttg cgttattgtc gtagtttgaa gcttgctgta atggtacaac 61 ttctgaagtt tgaggtgatg ctgtaggctc aaaaacaggt gtcgttgttt cgatatttgg 121 tgttgtcggg ttaaatgtct ggtttgaagc ttgctgtaat ggtgaaactt ctgaagtttg 181 aggtgatgct ggttgaccca aaacatttgc cgttgcttcg atatttggta tggtcgcgtt 241 attgtcgtag tttgaagctt gctgtaatgg tgaaacttct gaagtttgaa gtggtattgt 301 ttgctcagaa actggtagcg ttgtttctaa attcggtaca gtcggattag tcttctggtt 361 tgaagcttgc tgtaatggtg aaacttctga agtttgaggt gatgcgggtt gacccaaaac 421 aggcacagtt gtttcgatat ttggtgtggt cgggttaaat gtctggtttg aagcttgttg 481 taatggtgaa acttcagaag tttctggtga tgctgtaggc tcaaaaacag gtgtcgttgt 541 ttcgatattt ggtgttgtcg ggttaaatgt ttgagttgaa gcttgctgta atggtgaaac 601 ttcatgttta cccttcttat tagtcttgga ggatttggtt gtttttttag gttgaggttt 661 agaatctgat gtcggtttgg attttgtttt tctctgagat ttcggtttat tatttgcttt 721 tttacgctga ggtgtatcta aatcacctaa attatttgaa cttaatgtct cagctagttc 781 attgttatta acactgctgt gattgcgaac tgataatgca tcggtactat taacactgat 841 gtcctcattt tgaggataac tttctagttc gggaaataaa tcaatattcc catcactatc 901 attccaataa ttcaaatttg ctaaagaatc ggtgtttaaa aatttattgt tttgatgact 961 tggcaataaa aattttgatg tagttagcgg tgtatactta gcaataggat aagcggaatt 1021 actgagacga tgattagatt cttgtctcaa agttaatcta ttgggttggc ataaagattc 1081 acctaagccg agtctattga ctttgcttgc caatggttga atatccatat agttttattt 1141 tggtattact tagactcaaa gaatcctgat ttcatccgtg ttgacttatt acctccacct 1201 ttttctttcc cgaattgtaa accttcataa gctaaagtta attcctcaat tgctgctgca 1261 tttccatctg cttgcaatgc aggcgccttc catccaatag gaacagcacc aataagtgtc 1321 caactcatca tcgtttcacc tgcttgatta aaaaccaaaa tatttacatt gcgtcgaaat 1381 gtttttccct gttcatcaaa aacttcactt atccaattcc aaaaacctgg atggtcagta 1441 acaccccgct tcaatgtcac gtcattaaac tggacttgtc ccaaatacac cctctgttgg 1501 tcgttaacac ctccttcgtg gataaccttt ttctgaattt gggcgcctag tcctgaacat 1561 tcagtgaaag acgcagctat agcgctttcg atctcgatat aaaaccgatt ggttgtaaca 1621 taatttagtt catgagtaat gttgctatta ttactacctg ccatcagaac cacctttttt 1681 gaccgtagtt acttactctc tctcgctgat tacgaatatc ttcaagtaac agttcataca 1741 ctctgcgagt gagtgtattt aacaaaagcg agtctttaag acatcgttct gcatatgaag 1801 atactctgct tggattagat actgctgaaa taccaacata tcctccggtt ccaccaacgt 1861 ccccaagagg tgtgtaaaag acttctcttt tctctaagtc cataattgtt atgtttattt 1921 ttaacgcggt gttcagattg ttcagattcc cgacttcttt aagaagtcgg gaatctaatg 1981 taccgcatct gctattaaat ctaggctaaa ggtacagctt cagaatcaaa attagccaaa 2041 ggtctaactt gtatactcgg attttaattc tcaattatca acaaaaccag atatttcggt 2101 tgcgatcgcg tagcgtgtat tgtagaaacc taattaatta agtctctaca tgtggggtat 2161 aattacatca ctgcgatcat aaaagtaact agaccatctt gaccaagctg taaagaatca 2221 ccgttgagta tgcggtaaga tacaccagaa ttaagaagat taccgtttaa ataaacacca 2281 tttgtactgg tatcaacaat catataagca ttttgcgacc agtcccaaaa tatacgggca 2341 tggcgtcgag aaacaattcc ttcgtgagga atgccagtta ggtcaatttc tggtggagtt 2401 ggcatatttt gacttctacg cccgatcgaa ccaccttcac tcctcaattg aaactctcta 2461 ccggtagagt ggacaagttt caaaactggc agtcttggag gagttgcgtt gtaaacaggg 2521 ggtgttgggg gtgagtattc gcgctgctgg tattgtgttg cgtaaggagg aggttgatag 2581 cttggttgtt cctgtattgg tgcttgtggt tgaactggtt gtgagttggt tgggttactt 2641 ggtgtagaag gaattacatt acctaaagga gttgaacacc aaggacatac cttggcattg 2701 tttggcaagg cacgattaaa atattcgcag ttagggttag ggcaccgatt tgcaggcatc 2761 attttttata tggttaggca taggttgtaa gggaacaaga cctacctgat taagtaaaca 2821 ctgaccttga cgggttttta gcagttcagc aaaagtaaaa cctcctaact gacgagtatt 2881 gtctttagga tacactacat acaaaggata tcccaaagga taatttgctg ttccatttgc 2941 ctgaaagctt ctagtatcaa aatctgctcg atcgcagaag ttatatgaag gttcaattgg 3001 tcgtttatta atgggattta acagcggttg aatcgttttg ttgttatcat tgactattgc 3061 taagggataa cccgaacact ggttccaagt tttactaaaa ataccgaaac ttataatacc 3121 agtctgtcct ttcttgtttg cagtacgaat ctgtctttgg gtaaaatctg tggctctgat 3181 tcgagttttg gtaatttcct caaataaagc tatgtcctgt gaattgtttt taagaactag 3241 ctttttgaat tgctgtatcg cttctggttc agttgggaca taacgttcaa ttggaagacg 3301 aggagcattt ttgttaattt gatgccaatc tataattttg cctgtatata tgccacgcag 3361 gtcttcaaga gagattttac catccaattt tttatggaga ctaaagtcgt tttgactaaa 3421 tgcaataaac accagtaaac catcataagc aacttgttgt gaattcatat caggtgtgat 3481 attgtcgaat aaagtcgtta tggcaaagtc ttttccacct ttttgcactt cttctatagg 3541 tttactaact tttgtgatat cgctagaccg aacttgttga taggtaaatg ttacttttgc 3601 atctggtttt ggattggtga agatatcgtt taatttcgtt tcggcatttt cagcgttctg 3661 tctcaaaata taactccaag tatcatcctt ttctcctgta taagtaaatg ttccagaagg 3721 tacgttatct actttagaga aattttgtgc aagtccttgc cattttaagt attgatcttc 3781 gtcgagtttg aaacgctgcc aaaagtaata ccagatggcg cctcccagca ataaaaaagc 3841 cagcactccc agccataacc aatatttttt aaaaaagcgc tttttttgtt cttgagagtc 3901 agaagatgga cgtaaattat cactttcgtc gggttgagga agttttagca gttcttgacg 3961 cgcaatttct gcacttccgt aaggagtatt gagagaaagc agacgataga taaactgttt 4021 taaataatta tcattatcgg gccatacttg atggtctttg gggtcaagct gggtttgccc 4081 aacccataac tggaacgcta ctaaccctag agattctaaa tcctgcacgt gagttttcgc 4141 tttgggttga ggaatgctag ggggaataaa taggttttcc cagatagcga gatcacatag 4201 ataaactaca aatctttgtt tattttcgac tttaattaat acgctatcta aattaatatt 4261 gccatgctct aaaccccgct gtatttgatt cgaagaaaaa cgcagctttt gggagtgaag 4321 aaactctaga gtttgtagca cctcactcaa caactcacgg acttgttccg gcgtcattgc 4381 accattttgc tttaagtact gtctcagcgt ttgggatggt tgtacatctt tagtaattaa 4441 ataacagctt tctccttttt cggggctaat agcttcccag gtttgaatca agcgaaagtt 4501 ttgaactcga ttatctgcta aatctatccc tcctacacgt ctaaaagctt ccttacgttt 4561 gtttatttca tcttggttaa aagtacgact aggtaacaag tactctttaa taattactgg 4621 ctgtttgtct tttagctgta tacccgaata taaacgtcca aaaccccgta cgccgagaaa 4681 ctttgttacc cgatagctac ccctataccc tttaatctca gcttcctccg gtaaaattgt 4741 cggaaaacca cactcaaaac aaaacttggc gcccttgact tcctgcaaag tttctattac 4801 tttttcgcaa tccaagtgag cattgtgaga acaggggtac tctttgtaaa ataattcggt 4861 agaagacata atataaattc actcaattaa tactaagctt gaaccagtcc taatttgata 4921 gcagcaacaa ttgcttgagt cctactactg actttcaatt tctcaaaaat attcgtcaga 4981 tgagctttga ccgtggcgac agtcacatat agattttttg ctatttcttc gtttgaggcg 5041 ccttgagtca ataattgtaa aacttcttgt tctctttcgg ttagatgtaa ctgcgaagac 5101 gctttataaa cggaactagc attttcttga aaacagcgga aaaaaccact tgccgcttcc 5161 ggtggcaaat aaatttctga cttaagtacc gtattaattg catcacaaag ctgatttgcc 5221 acacgacttt taaaaacata gccagcggcg cctttttgca tagcttgaaa aatccactta 5281 tcttcttggt gggcagataa aaccaaaacc cgaccagagt aagaaatatc tttcaagcgt 5341 tccatagctg ttattccatc actattcacc aattctaaat cgagcaaaat tagatcggga 5401 ctcttttgaa tcgctaattt cacaacttgc tcaacacaat ctgcttcacc tacaacattc 5461 acaggtaatg aagcactagt gctataaaaa cccaaaagac ttcgcaagcc ttgacgaaaa 5521 cgctcttcat catctactag taaaatcgaa atttgtttat tgttcataat cttttccata 5581 ttgtgagatg cccgtcgaag cccgtcttag taatatttct ttttagtagg taatatcaaa 5641 caaaactgtg cccctcctgc ggataaatta tctgcccaaa gatttccaca atggtctaat 5701 acgatttttt tggctatagt tagacccaat cctgtaccac caggacgctg cgaataaaac 5761 ggagtgaaaa ttttttgcat gtccaagagt gaaattccta gcccttggtc tgaaatttta 5821 attaatactt cgtcgtaaaa aacctgccaa ctaaaggtga tgacgccaga attagggcta 5881 aagtgaatcg cattactaag taaattatca aacacttgct tcatctgtaa cttatctaaa 5941 ggtaaagtag cagatgtatc agggatgtta acttttaatt gtttttgatt aattaacggc 6001 tgtaaatatt ttatactttc gtctactata cttctgaggt cttgaggaat tatttttaaa 6061 ttagactctt gaccacagga aattatatcg gtgagattgg tatctaaatc attcacgctc 6121 tcgcaaatga ttgttgcttg ctcgcgccaa gaattatctt ttaagcccaa aaataaattg 6181 tgtgcgtaaa gtttgactag agcaatgtta ttccgcaatt gatgaccaac tttatgaagc 6241 acctgttcta gtatctcagt cttagacttt tgctgcaaat tatctaagca tatatctata 6301 tattgattta gcatttgggc tgcatttgtg acgtaacgcc gcgatttctc cccaagaggt 6361 gtgtgagtga ttatttgaac atattctggt ttttgatttg tataacggat aggacaaatg 6421 taagaaaaat attttggaaa ctgggcaagt tgaaattcat gtacattcca aacatgagga 6481 aaatctgtta accaagcttc cgagcgtaaa tatgctattt cttgttgaga aaagctaggt 6541 tgattatctg cgtattcgat tctttcttta aatgctttta acgaaggatt gtagtaaagt 6601 atacggacaa agcaaattga ctcatgaaag gtaagttgtt ctgcttgtaa cccaataaaa 6661 gtttctgtat caaatgcgtg taaatcatgg cagttacttt tttgtttttc aagagagaag 6721 ctgatattca tagctttact cactttgata gtagattctg gtttcccttt acttctactt 6781 agtagatgtt tagctttcct tatgcagcaa aaagcagcct tcttagtata tcttcatatt 6841 ttttaatatt acgtaattat tacttataag aaataaatga tttttgagtt tggacgaaat 6901 ctcgacaaaa gtctaattac cagaagagtt tgcattcata taacctagac atatctgaga 6961 agcgttagta gcaagtatgg acatcatgct tacatgatta tttatattct tcaaacccta 7021 gcaggaatta tcgctggagg aacatcgctt tctagtacgg agcaaattga gtttagtcat 7081 ccgagtaatc gtaaagaaga aggtggagga ccgattctca atttgtacgt ttatgacatt 7141 cgtgagagta agcaaatcca gcattcggga aggcaagttg aacgcaagct gacagaagct 7201 aagcaagttg aaagccagcg caaacaagtt acgcaatctg cttctttgaa ttggtctccg 7261 acctggtttg atgtttctat gctactaacg gcttgggatc ggacggcttt aggtgagcat 7321 tacctgcttt cgcaagcttt aggtgtacta ctacgccacc gtgccctcaa ggaagagttt 7381 ctggttcccg aattgcgcgg ttatggaaat ttgaatctga cggttgctgt agaaccgcaa 7441 attgaagctg gatctttgtg gagtgcgcta actattgctt tacgtccagc tttgtatctg 7501 atggtgacag ttcctgtagt gccgcaagta tcgccagtat atttagtttg gcaacggact 7561 atcggagtag acaataattt tcaacctgaa cttgaactgg aaaatgcaag tgttcataga 7621 aatggaagta ttgaaagttt caccaagcgg gtagtagtcg caggaataat caaaaacgct 7681 gttaccaatt taccgatgaa ggaagcaaaa gttgaggtga tcggaactaa agaatcaact 7741 actaccaata aagaggggtt gttctacttt gaagaacttc gtaatggtaa ctatgtatta 7801 caaatcaaca gtccgggata tttgcccctt caagtcaatg ctttggtaga tggttcaagc 7861 tgtaccttca aagaaatctc attaaatccg gcgtagagac gcgaaattat gcatcaattc 7921 gcgatatatc gcgtttatgc agatggttga ttaaattttt acattggaga tatttttatg 7981 gcaagactgg attattttgc tccgggtgtc tatatcgaag aaatagaccg aggtagtcgt 8041 ccaatcgaag gtgttagtac ggcaattgct ggatttgtag gctttactga agatgtgcgc 8101 ggtggggctg aattgttcaa gccgatgctt gttactaact ggacacagta tttaaactat 8161 ttttcccgtc ccaattcgga tggttttact gatttcaatg cttatttgcc ttttgcagtt 8221 tatggctact ttatgaatgg tggtggtcgc tgctggatta ccagtattgg tacgcagtta 8281 cccaatgccc ccaaaccacc agacccacaa ccagcttcag ttcggattac tggtcgaggc 8341 aatcgtcctt cgctacagtt taccatgcgt ccggaacaaa ttgccgcagg cacgatgaca 8401 atcgtgatta gcgatagtgg acctcgcccg ttaccagaag gaacagaagg atctgtccca 8461 ccgaatacag gtgaatactt caaggtgcaa attcgtcgag gagatgaaac tctcgaagaa 8521 tatgacaact tgacaatgaa ccgcgaaggc aatggccaat ttgcaactta tgcagttacg 8581 gcattgcgaa attcgatgtt cataaccatc gaagacattt ctcaaactgg acaaccttta 8641 gggcgtcgtc ccggaaatgg tcaatacgaa cttacagcac caatacctgc cacgccaccc 8701 gatagatttt ccagcgatat cgagggtgta cgcgacgatc gcacaggggt acgcggtcta 8761 tttgaaatcg atgaaatcac aatggttgct tgtcctgact tgatgcgtgc gtatcaatca 8821 gaattgatga acttagacca agttcacgcc atcatggaat tgatgctaag catgtgcgaa 8881 ggcgccaaca ctggggatat tcctaacccg cccaaccgca tggtagttct tgattctcct 8941 cctgactgtc ccaaaccgca gcaggttgta gaatggttga accgatttaa tcgccgttcc 9001 atgttcggtg ccctttatta tccttggatt aaagtcgcta acccacgcga tcgcggcaac 9061 ccaatcagcg ttcctccttg cggtcatgtg atgggtgttt ggggtcgtac cgatgaaact 9121 cgcggagttt acaaggcacc cgccaacgaa gtacccagag gtgttatcgg tttagactac 9181 gataccaatt tccgcgagca agaactttta aacccattgg gtataaattg tatccggaga 9241 tttcccaatc gagggattcg tatttggggc gcccgtacac tagttgaacc tgataaaacc 9301 gaatggcgtt atatcagcgt tcgtagattg attagctaca tcgaaaaatc tttagaactc 9361 ggtactcaat gggtagtatt cgagccaaac gacgaagatt tgtgggcacg cgtgcgccga 9421 actgtcagta atttcctaga aagaatctgg cgtgaaggag cattatttgg agcctcaccc 9481 gaacaagcat tttacgttaa atgcgacgaa gaattaaacc caccagatac cagaattcta 9541 ggtcgtttgt acatcgaagt cggtgtatgt cccgttagac cagcagaatt cgtcatcctc 9601 cgtatcagtc agtggaatgg aattgaagac gaagagtaac aaaattctcg ttttgtctct 9661 ctttctttgt gtcctttgta ccctttgtgg ttcgtttctt aatctacatc ttcacaaatt 9721 aaacaggagt ataaaacatc atgccaggag aatttcttac tgcttgtaaa ttttactttg 9781 aagcaagtgt aatcactgat aaatttatca aagaaatcag tggcttgggt gtagaaaata 9841 cccctgcaca ggaagttcac ggctcatcaa aaaaaggagt aatctcgcgc caagcaacac 9901 caactgtcgt taaatttact aacataacaa taaaagttat agcgactgat gataaagatc 9961 tatatgcttg gtataaaaaa tgtaacgaag acatggggga tccacgtcag tggatgtcta 10021 atcgttatga tggttcggta gtagcttacg accaacaagg tactgaaaaa gcacgttgga 10081 acataaaaag atgttatccc tgtaaatata caggacccac cttaacagca tcgggtggcg 10141 atatggcaaa tgaaactatt gaattagtac acgaaggaat tgaacgtatt ctgtaagaag 10201 gggaaagggg gaagagaaaa cagtggtaca agctgggcaa aatcctgaga ttcttacggc 10261 acatcgtttc tatttagggc tagcgctaga cgggcaaaaa gattctgatg cttacttttt 10321 agagtgtcaa ggttttacaa gaacacagga tgtcattgaa atttgtgagg taacatccca 10381 aagttgggga actggacaat ctaaaggttt accagtcaga accaaaattc ccggcaatgt 10441 taagagtggc aatattactc tccgtagagg gatgaccaac tcaatagatt tctggaactg 10501 gtttgataaa gttcaaacag gtgggtgggc aaagcagcga aaaatggttt ctttgtcgat 10561 ttacaatcaa gcaagtgttg aagttgccag gtttgaaatg gaaggtgctt ggcccactcg 10621 ttacaaaatt gccgatgtca acgcccgcag cacagaaata gaaattgagg aagtcgaagt 10681 ggcttttgag gaattcaaac gagtgaaata attggaggtt gaatacagac tatgtttgca 10741 acagagtttg agattacatt gccaaaaggg tatatagact ccgacggcaa cctccatcgt 10801 aaaggtatca tgcgtttagc aacagcgata gacgaaattt ctcctttacg tgacccgcgt 10861 gttaaagcta atccagctta tgccacgatt attattttgg cgcgtgtaat tactagttta 10921 ggtgccttat ctgaagttac tcctgcaata gtagaaaact tctttagcca ggatttaaat 10981 tacctccaag atttttaccg taaaatcaac ggcttggaac ctgcaactcc cccaatttca 11041 gattctcaac ccccggtttc agattcccaa cttcttgaag aagtcgggaa ttcaaaaatc 11101 ccataaaatt aagaatttag atgcgatata tcgcgtctct acattattta tgcgtcgtaa 11161 ggaactttgt actgaattca attttacgct tcctagagga ttaatcgatt cccaagaccg 11221 ggtacatcgt catggtgtta tgcgtttagc tactgccaaa gatgaacttt gggtgcagca 11281 agaacgtaaa gttcaggaaa atccagctta cggcgcttta gtgatgcttt cgcgggtcat 11341 ctcgcgctta ggcagtttga actctgtaac tcccgaacaa ctcgaagagt tagtattgcg 11401 tgatatttat tacctcagag aattttacaa ccgaattaat cagcaatcga atgcatatat 11461 cccagcccat tgcccacact gtgatacgca atttaacgtg aggttagagt tagcggggga 11521 gtcataagct acccctcaga tactttatat gaggaggtag cttttattgc ttaccatttt 11581 cactggtcgc aagacgatat tttaaaccta gaacatatta ctcgtcagcg atgggtaacg 11641 gaaataaata agattaacca gaaaattaac tagaacatca atatgaaagt caaaatttcc 11701 atatctccaa cacaaagcga attcgaggaa gttgatctga ctctaacaac cagacgaggc 11761 gctgaatgtt taattggtcg atctcctgat tctgatttgg ttctcgatag tactgatgtt 11821 agtcgatcgc atggtaaatt tttttctcaa ggaggaaatt attatttctg cgacctcggt 11881 agtcggaatg gatcgatagt caacggcaaa ttggctgaga aaaaccactc atacatactc 11941 aaagatggcg atattattcg catcggtgat tttgtgttgg tgatagaaga tgaaatgcca 12001 gatagccagc aagcggaaac agtagttaga attattaatc cttcggaatt ttctaactgg 12061 cgccaaaatc aaaatcaaag tgctacacca caggaattac aacctgttaa taacgaatca 12121 gttagttctg ttcctgctgc cgaaattagt caaaaacatg aagaagcaga aattgttaat 12181 acatctgaag aagtagaaca atctaaagaa atttttattc agcctgacga tattgtcaca 12241 cctgataacg ctatcccctc tgcgcatgga acaagcgcgt cagatgttgc cttacaagcc 12301 acagcagaat ctgacgccag cgaatccgat cgcattgttc aagcccacga tctcgccagt 12361 ccagcaccgg aaaccgagaa tgtttcaatc atagaacacg atattaaggt gtctgagtat 12421 agcattgttc aagcccacga tctcgccagt ccagaaccgg aaaccgagaa tgtttcaatc 12481 atagaacacg atattgaggt gtctgagtat agcattgttc aagcccatga tatcgccagt 12541 ccagaaccgg aaaccgagaa tattcctaat gtagaacatg aggaagaaat tgctccacat 12601 acacaagagc aagaaagaga aattatcaaa gcaataaaca gcgagcgcga aaattttgat 12661 ttagatacac catctcaaca aacagagaaa gaattggtgc ttgaggacac ttctatttta 12721 acaccacatg aaagcgaatt agtagcgaca agtcaacaga ctgaagaaat tccatcagaa 12781 aatcaagaat cagaagaact agaagaacaa gaaaaatcat ctatacagct tgctaaaatt 12841 ctcgaagaaa aacagatcct gcttgtcgct caccagagca aaaaatctga acttacagaa 12901 ttagtatctg aatacaaaga atttttatcg tactgcttaa cgagaacatg gcaaactttc 12961 agcgatgatt tgtacaaaca aacaggtcta agcgtcaccc aagaaattcc tccagcgaat 13021 tcgggagggt atcaagcaat taactcacta gttaactctg gagaaattct cgcagtaatt 13081 tatctcagag acttaatgat ggcgcctcaa caaggtcaag caagtgaaga agcactattg 13141 aggatatgta atatcaacaa tgttttactt gccacgaatt taccaacagc aaaagcaatt 13201 ctttattatc tcaaaaattt gaaagattag cagttcgtag taaacacttc agcgcttata 13261 ttttgaagaa ctgaagtcct tactacgagc tacgaacgct acgaattgta ataatatgaa 13321 tagtaatagt gataatctta tcagcgctag tctttcaacg caaagcttaa cttatcgacc 13381 aaatcaaggc gaggtttctt ttgaagtcac tgttaataat gacagcgatc gctttgccaa 13441 ttttcaatta gaaattatcg ctgctggtga aattcgcaat cctaattacc gttggtatcg 13501 tcttgaacca gaagttgctg cagctcaacc tcacggtagt agcactaaat tccaagtatt 13561 tatttttaat acgccaattc ctggattcgt tggtacagtt aacctgtctg taaaaatatt 13621 ttctcctcag ctagcacgag aacgcagact catactgcgt ttggaaattg agcgagataa 13681 gaaaccaatt ctaattagtg ttgaactacc aatacgcacg tttcaagtat atccaggcaa 13741 cgctatagat atacctgtac gagttataaa tcagggacaa gtagctagta atgtactgtt 13801 gcatttaaca ggtatcgatg cttcttgggt gactaacagt gctgaacgga ggtttacctt 13861 aaatgctaat agtcaaacgg aagtgagttt tagatgtgag ccaccttctg tagttcaagc 13921 acctagttta aattacttgt ttattgctga agctacaagc aataatagtt acccagctaa 13981 tgcagaaggt aatttggaag tattgcctgt aggttatatt aattttagca cgccacaaaa 14041 taaacaaaga attccttctc ggcgtttgtg gttgccggac tggaagtcta atagtgcttc 14101 ctatgaattg ctatttaaaa atgtcagcaa tctacatcaa gaaattaatg tgcaaattca 14161 gggtagagat tggcgaaaat gtacatttaa gaagttacct gaaattgcga atttaaattt 14221 aggagagaca gtaaaagtaa ttttagatgt taaaacaaaa cgtccttgga tcggaatcgg 14281 aaaaacttta tttttagaag caaaagcaga actatctgac caaaggttag gcagtacaga 14341 ccctgctact caaagtttag aattagaggt tttaccaatt atacctttgt ggttacaatt 14401 agcgattctg gcactgctcg cagcactact tgtttggtgg aagcagttta accaagaagc 14461 aattgcacat acaggttttg taaattcagt tcgcattatc ggtggcggta ctggctcatc 14521 tattgttagt gcttctaatg attgtacatt gagatattgg agtattagag aatattctat 14581 agaaccaaat aaagatgcaa cactagaatc aaaaccttat gaacaaaatt tcagatgtgc 14641 taaacctcaa aaaccaaaag gagtattagc atttgccaac aatccaatta cagttgtaag 14701 gtttgtacca gttgaaaata accgtgttgc tatcggtcta gagaatggaa gaattgagtt 14761 gagagaagta cctacaggta aaagaattcc aattccagac cctgaagaac aaggcgacaa 14821 agtatttgat ttagcattta ccaaagactc gcttaattta ttcagtggtt caggtaaagg 14881 taaagtaaga gtatggtcga gagaatcgac tactacaaat ttccaggaaa aaccagtagt 14941 cattgagcta gaaaaacagc aagaacaaaa actaacacgt tttggaatcc gtgcattagc 15001 tttgagtccc gacgatgaaa tgcttgtggt ttcgggcgag tataaccgtt ttttaatatt 15061 acataggaat aaaaatcaac ctaacaaccc attcaaaaaa atattagtcg aaagactgga 15121 aaaaatagat ggaagaggaa aacaaaaaga ctctgtttta tcgttagctt ttattcctga 15181 ctcgcaagaa aaaattttgg caacatccga ctcgtttgga tttattgcaa tttgggactt 15241 aaaacaatgt aaacctacaa ccaataataa tcaacagcaa atcaatgacg ctaattgtaa 15301 actgttagac cgttggcaag acgagtcaaa aaatcctata cgaactttag catttagtga 15361 agatggtaaa tttctagtta gtggtggaga tgacggacga atagttgtat ggtatttaac 15421 ttcgggttat caactcgaca aaacaaattt gccagaaggt aaaactattt tcccaggttc 15481 tacaaaaatt agaagcattg acgtaaaaag cagtcaagga atcgtggtta gtggcagtga 15541 agattttaaa gtcagactcc accatattaa ctaaaataat ctaaacaatt atgcaagcaa 15601 gaactaatcc actatccatt attattaatc cgcctggaat tcaatatgga atgccaggag 15661 atattgtatc actttatgct gtcgtgatta atgagggcga ccaaagcgct gttattgatt 15721 tgttttttac ttttgacgag acatttcaaa aaattagtgg ttggtctgct tctccaagag 15781 ctagtttagc agtggcgccg aaggaaagta gcgacgaagt tacgtttgat tttgaaattc 15841 cagttgatgc tttggcgggt acctatgatt atacttttgt cgttgattct ccagaacatt 15901 atccgcaaga tactccgata aactttcccg gtcaactcaa agttttactc aaagaacaaa 15961 ctgttatccg cgcttttgac ccaacttttt caattatacc tgccacaaat ccaaataaac 16021 ttataactta cagaatcgac caacctttac aagtggtggt aaaagtcgaa aatcgctcga 16081 accgtgtaga cagattccgc ttgacttgtc ctgatttaga tgaaagttgg tttaaaatat 16141 cttaccctgc tactgggttg gaaggagcag gcttatttga tgtcagcgcg ctagaactca 16201 accctcattc tgaaggtgaa atctctttag aatttcgtcc tcctattgat actctcgctg 16261 gtagctattc tccaacgatt cgcttgcttt cagaaaataa tccggatttg atactgctag 16321 atttaatata cattaatcta ccgaccaatt atcagttagg tatagaactt aatacgattt 16381 taggaaaagt tagccagaaa gatggcgtct ttgaattagc acttaataat caaggcaact 16441 taatacggga actatttttt aatgctaaaa ctagggatga agaagaaatt tgcacttaca 16501 aatttgatcc tagtgaattt aaattattac ccaatacaac cgaaaaagca agtctcttag 16561 ttaaacctgg cccttggtgg cgccgaccat tcttcggtca acctttacca attaattttg 16621 aagttaatat tacagataag caaagctatc ctttagctaa tgcatctcct caagccactt 16681 tattatggaa agcacgtcct tggtggcaat ttttactatt aattttacta gggttggggt 16741 tatttgcagg catctcatat atgatttgga gacttttaaa ccccgaaccg ttaacaattg 16801 aaagttttag tactgataag cgaaaaataa ctgaaggtga tgaagttcgc ttaaactgga 16861 aaattaataa ctataagcct aaccgaatag aaaaattggt agtggtcatt ccacaactac 16921 caaataatgg ccatgtatat tttgataaaa aaaataatat taatgaacta actaaaccag 16981 ttaaatctaa tcagtatcct ccctgtaact tcaaacccca ggaagaattg gaatgtaatc 17041 gtgtgacaac cggaattaaa aatcaaggta aatacgtttt tgaactgcaa gtatcttatc 17101 tacagggttc accgcttttt aaccgcagaa gtcaaaccgt taccaaaact gctgaaactg 17161 aaattaccaa aaagccaata gctgaagtag tagactttaa agtagataaa tcacagtata 17221 aaactggtga aagcgtatct gtgtccttga caattaagaa tccgcaatta cttaacaagc 17281 tgataattaa caagaaaaca aacaatttac tagtaggtaa tcctgtcaca ctagaatttc 17341 aaaatgggaa atttaaagac cccaaactcc aaaaatgcaa agaagaaaat aacttgctca 17401 aatgtacatt ttctttacct gcatctaacc caggtacatt tacttatgac atcagcgcta 17461 tttctggcga tcaagtaagt aataagcaag ctcaaaatcc agtcgaaata ttacccaaac 17521 catttaaaat tgtttttttt aaaattaata atattgacaa cagccaaaaa caaagtattg 17581 ttttgaatga aggagatcaa atcaccgtaa gttggaaagt agaaggagaa aacgtttcag 17641 tcaaactatc tctcaacaac ggagacgttc cagttccaca aactggtaac acatcccaaa 17701 ttcctgttga cgtgaacttt ccacgccaaa ttatacttac agttacagac aaatttggca 17761 agctccctcc tcaaccaaaa ggatttttaa tcatcgttaa accgaaaccc actccggcat 17821 ctgacttaat tacaccttct ccctcaccta cacaaaatat ctttaaacct cttcctgctt 17881 ctccacgttc aagaccacta ttttaataca atacacatga cgtttgtcgc atcaaaatct 17941 ttcaaatgca gcgccaccac ctccgccctt cttaacccac accgcagtag tgcaattaac 18001 gcgctatctc gcacacccag cggtgaggag tcagcttgac aaacccgaat gagtgcagta 18061 atttcatcgg gtttgagtcg cctacctcgt tgttttttac tcgccttaat actacgtaaa 18121 tctacagctt tttggtagtc agtggcactt atgaggtcga gtttgaaagc ttcttgcaac 18181 accctcctta acgcacacaa cattttattc gctgttgctg gggcgtgttt tttcatcagt 18241 gaggtgctgc ttttgtgcct accatttccg gattgactgc atccttacgt gcattcacaa 18301 tgtactcgtg gaatgcatct ttcacataag gactaggatt ttctgcacca aacaactttg 18361 ctgcatttgt ttcattttca gtaaacagaa aagttggtgc atcaatattt ggcagaggtt 18421 caacgacaaa gcggaattta ccgagtgttt cctgagaaag taacagttct ccatcttttt 18481 tatattcaat gcacggttta gaccaataac tttcgcctgt acgaccccaa gaccaagtat 18541 ttttgaacca aattgttggc aaaaggtgta gtgttgcagc ttgtgaactg cgattggcaa 18601 cggtaatttt aattaagatg tcgttagaag cagctttggc atactcggca aagatgtcaa 18661 agtagcggtt ttcctcaaaa atcccagtat cgataagttc aaactcacct aggtcttgac 18721 cccggtgacg attttcctca actaagcgcg tgtagggaaa ttcactttga ggatatttgt 18781 agagcgcttt catgtaagag tgagttggtg ttgagtcgag gtagaagtaa cattctttta 18841 catcctcacc gtgatttcct tcgttactac tcaacccaaa taaacgttct ttgagaatgg 18901 gatcaacccc gttccagagg gcgagggcaa aacacaaacg tccttggcga tcgcacatcc 18961 ccaaaagtcc atcctcaccc caccgataag cccgactgcg tgcttgatcg tgagtcaaat 19021 aatcccagga tgaaccgtct tttgaataat cttcacgtac agttccccat tgcctttctg 19081 caagatacgg tccccagcgc ttccagtttt tctctcttcg cacctcttct ctaaggcgct 19141 gggactcggc atcaagattg tcgttctggt tcattaaaaa ttgctacaaa tcgtgagttt 19201 aaataaccat tactcaagcg tgtcgggaaa caactactaa taaatccttc ttaggtattg 19261 aaattcctcg cttaggtcaa aaaatttcca aaacgccctt ccccaagctt ggtacgttct 19321 tagcgttttt tctgtcgtaa gattactagg tctgcatgca cttgtgtcaa aagcgagtta 19381 tataggttta caccgctgcg tttcatgacg gactgagttg acattttgtg ttcttttgcg 19441 ttggcgtcaa tccaatcaag ttgtgctttt gctgcttgtc ccaccatttc taaacgatgc 19501 tcgtcgctct gaatataggc atagtgcata gcggcacaca gcgcgtggta agccgtttcg 19561 taatatgcgc tctccaatgc ctcctggctt atcgctataa ggcgtgagta catatcgcca 19621 tattcgctgt gtgaagtgtc ttgcattatg ttcttcttgc ttgtatgact gtttctagtt 19681 tgacatcctc ccaccgctga acggagtacc gttacagcgg gggattccaa gcatcactgc 19741 ttaggcttcc tctttccacg actcgacttg ctttgaggag tttcctcacc aaagtagagg 19801 tcgatgtctc cagaggcgtt agttcccgtg tgccccacgg tacgaagtcc taattcaagg 19861 atattgcgag ctgcgttata atctctgtcc tgaatatgcc cacaatgttg acaagtatgt 19921 gttctggttg agagagcttt cttgacagtt tcgccacagt tagaacagtt ctgagaagta 19981 tagtgaggtg gcacagcaac tgtgactacg ccaaacactt tcccaaaata ctcaatccac 20041 tggcggaact gcgtccacgc tgcatcggag atagatttgg caaggtgtcg gttccttacc 20101 atatttcgca ccttcaagtc ctcataggct accaaatcgt tagactggac tacgcacctt 20161 gctgtcttca cggcaaagtc tttacgctgc ctacttacct tgagatgctt gcgagccaac 20221 ttgtttctaa acttggctct gtttttagac ccctttttgg tcttggacat ctgacgagaa 20281 agacgtttta gcgacttttc gcttttacga agatgacgtg gatttgcaac tgtatttccg 20341 tcagaatcgg tgtagaagta gttcaatccc acatcaatac cgacattgtg agccgttggt 20401 tctcgtttct caattcgttc ttggtcgata cagaattgag catagtaccc gtcagcacga 20461 cggacaacgc gaacacgttt aatctgcttc aactgataaa aatgcaaatc acgggttccc 20521 cacattttaa aggttcccgc ttcaaacccg tcagagaaag tgatataccg tctatcgttg 20581 gaaagcttcc agccactggt cttgtattcc acagacccat gtgtttgttc tttcttgaag 20641 cgtggatatc ctttctttcc tggcttgctt ttcttgcagt tgtcgtagaa acgagatatt 20701 gccgaccaag ctcgttcagc actagcttgg cgagcttggg agttgagatt cttggcaaac 20761 tcaaattctt ttgcaaggac agcacaatat gcgctcaggt catatttgtc aatgtccttg 20821 ttatccatcc agtgcctgag acaactgttc cgaacaaatc gggcagtacg gatagcttca 20881 tccagctttc catattgctc gttcgttcct tccagttttg cctcaaacac caacatctta 20941 tttacgctca acgttatagc gtaaattata gtctattggc tcgtgaggtg tccacagaga 21001 tagacaaaaa ctcaactgtc ccccttcctt gtccttgctg tgcaagaaat cagttggagg 21061 acgtttcgtt tttgtcccgt gattcatgag ccagtactgc aggagggtct ccctccgtag 21121 gtatctggcg tcctcaccac caaatcaaag attatggtgg gagccttctc acggagcgag 21181 gtaaagaact cttcatttgg gactataaaa atgaccatct caagctattc cgccgttaac 21241 gcgaacattt tgcccagtta tccaccgcgc ttcatcgcta gcaagaaacg ccacaacatc 21301 agctatgtct tgtacttgtc ccagtcgtcc taaagctgcc atttgagaga aacgatttat 21361 ttgttcttct gttttgcctt ctctgaacag ttctgtatcg gttggaccag gagagataac 21421 attaactgta ataccttgtc ctcccaactc ttttgccaca actcttgtga tttgttcaac 21481 agcacctttc gttcccacat acgcactata agtgggtagt atcatagcag ttgttgatga 21541 agaaaaattg ataatgcgtc caccatctgc catgtggtgt gcagcttgtt gacaagcaaa 21601 ataagtgcct ttgacattaa tggcaaaaat cttgtcaaaa tcttcttctg ttacctcaat 21661 aagtggctta tagaaattaa cgccagcgtt gttgactaaa atatcgactt tgccaaaacg 21721 ttcaattgtt tgctcaaaaa gtcgctggat atcagcaact ttgctcacat cggcttggag 21781 tgcaattgct tgtactccaa gattttcaat ttctgtaaca acttcttgtg ctttgtttgc 21841 atttcctgca tagttaacca caactgatgc accgttacca gccaatttga gtgcgatcgc 21901 acgtccaatt ccccgtgatg atcctgtaac gattgcaatt tttcctgcaa gaggtgccat 21961 aaattatttc ctttacttaa ccttgagtta actttccact aacatgaatc ttgcccctag 22021 gtgcaacctg ggagcaagat tgggagcttc tatgtcctta gagaattata gagaatgagc 22081 tactgctgaa caagaatttt tatttcttca ggtaggtttc acgccaaatc gaaaagcaaa 22141 atttctgcac caatgttggt actaatttcg agttgttctt cgccactcat ttgcacgcca 22201 tcacctgctc taagttcttc accattcaag gtgattatgc cttgagctat ttgcaaccag 22261 gcataacgat tgggtttgag gtgatagttg ataacgtcac caggttccaa aacagatgtg 22321 tataagtcaa catcttgatg aattatgata gcaccatcac gtccatcttt ggcaccaatt 22381 aaacgcagtt tcccgcgtct ttcttcaagg gaaaaagctt tttgatcata ccttggttgc 22441 aatccttgtt tgtcaggaag aatccaaatt tgtagaaagt gaactggttc agtttcagaa 22501 ggattatatt cgctgtgcat aataccagtg ccagcgctca taatttgcgc ttcaccggga 22561 cgtatgactg aaccagttcc caagctgtct ttatgttcta ctgctccttc tagaacatag 22621 gtgaggattt ccatatcgcg gtgtccatgt gtcccaaaac cagcaccggg aacaacgcgg 22681 tcatcgttaa taactcgtag agaacgaaat cccatgcggt ctggatcgta aaaattacca 22741 aaggaaaatg tgtggtaact atctagccag cctatcttgg tatgaccgcg tgaatttcga 22801 tcatgaatga ggtgagtttt ggtattggaa gccatagatt ttccctccat tttgatgatt 22861 gattgcagca aaaacattaa gcacctttat tggctaccag tcatctattc tggttccaaa 22921 agcgggagca tggacaaact ttctctttgg tggcaatact actgaaatga ttcttcttga 22981 ttgctcgaaa aactagggat ttcgtgaatc aatattccac ttaggtatgg ttgaatacta 23041 ggttacaacg gcgatgttta caaaatcacg aaaaaaaatt tttctttaga tttgccttgt 23101 tgcacctgtt ttttaactat ctcaatctta aaatatgtac atataaaatg aaaagtacgc 23161 actgaaaagt gctgtactta ccaaaaagat actgtcaaat taatttcttt gcacttaaca 23221 atcacgatta ctataggttg gattgagcta agtgtttcat aaaagataag ctgggttgac 23281 caacgttacc cagcctacaa tatatgacta atagctatta gctattagca acaacagact 23341 tatctgacag cagcttgtct aacaacagct tgctcttcag ttttaatatt gcctgcatta 23401 ccatttccat tttgggtgac agcagcttgc aaatgcgctt ctaggaacca caggcgcttg 23461 tcaatggtac gagacacttc ggtgtaaagg tcagcggtgt cagcatcgcc tagctcggtc 23521 gttttgtcaa tagcttcccg caactgctta gcatagggtg caaagcgctc tgccaaagct 23581 gtgacatgct ccatacccat taaaatatca gtaggatatt ctggcacgaa tgaatttgca 23641 gcagccatac gaactgtgcc acaagcataa ccacccaagg cggtgatgcg ctcagcaaac 23701 atatcaatgt actcttctaa ctcacctgca atttcatcaa acaattcgtg caactggtag 23761 aagtcggtgc ctttaacgtt ccagtgcgct tgcttggctt ggcttttcag atccgatgtt 23821 gcggctagag tctggttgag aatgccagcg atctgcttgc gcgcctcagc aggaatgtca 23881 atgcgggttg ggtaaagacg tgtggatagg ttattgtcgc tcatagttct gcttctctga 23941 tgggggtgac aaaacaagtt tacagatact gctatgctct cggaaactct ctaccgacag 24001 atggaatttt tgctgaaaga attataatta ttgttaatat tttactcttc aattcataat 24061 caattgcgat ggttgtctat gccactgaat gtacgaccgt aaaggcgaat cgcgaatggt 24121 gtcattttgc aaaaagtcat ggggtcaatt gagggggttc aatacccgca cggctgtgaa 24181 aatttcgcta cgtttttttg caacacaacc agtgttagaa cacaaaagtt gtcctgagtg 24241 taccaataat tgcgtcacga ttttctgagt tttgacctgg tgaaagaacc caaatcaaac 24301 ctggggtgat tgagatgttg ctatttaact ggtatttgta gaagccttct atatgaattg 24361 gtatgtcatt cttggcaccg cttgcaaact tacttgcatt acctaaataa ggctccacgc 24421 ctactacaat gccacctaag ttattttttt tgccaagatc aggaaaggca aaggtgagtg 24481 cataagtcca aatgtctcca gttccaagat taacgaagtc agcattgatg taagaaaacc 24541 agccgttgat tgcaaactca gaactaactt tgtacgtaaa ctctgcacca taggcattga 24601 cttttgatgg aactgtacca aaatcaaatc ctccaccatt ggctaggttt gtgcccactg 24661 aagcaagatt gctcccagta tcaaaaatgg cagatttacg ataagcatta atatatgtaa 24721 gaccgattga aaactggtca cttggtgaaa aggttagctg agctaaagct gcatagttgt 24781 cattaaacaa atcagaattg tctgctaaat aaccagcaga taatgataat tttttactca 24841 atgagtaagt aaatcccgct cccgagccac ccccaatcaa ataaatagga ttgtgctgac 24901 caaagattga aagcgaaccg acaccactgt cgtatccttc tagataagaa cttactgtag 24961 gcacgaagtc gtagtggact cctccaactg ctgcgacata accttcaatg ttttcaccaa 25021 ttggaaaaaa gtaagacatc cagtcaacat agattgaatt gttggtattt ccgaaattaa 25081 aagtctgaat accttcaact ccattacctt tgagattgaa gatatttgtg ttacctgcag 25141 ccagtctaat atgtaggata tcttttcccg taaagctgtt ctgaatatcc aagcgaaccc 25201 tttgttgaaa aacggtgtta ttattaaccg attgattaaa ttcgtcggtt attgcaaaaa 25261 caacttcgcc aaccagttta gcagttgtag aaaactcttg tgcctgtaaa gtagctgtct 25321 gagcctcaag tgtatcaact cgacctcgta aagtggcaag ttctgcggca aattcctctt 25381 gcaatttttt taaagtttct aaatcttctt tcttaactaa atcaatagta ctttcagcaa 25441 tgagttcatt gatgcgattt atgcaggcat ttaacccagc cgcaaactcg taacggctca 25501 aagcacgatt tcctctgtag gttttgtcaa gatagcctgc aatgcaacca tatcgttcaa 25561 ccaaagactg cactgcttga aatgcccaat ctgttggtgt tacatcagat agttgagaaa 25621 ctgaattgat ttgagtcatc gcctctgtgg ggcgggcgct ctgcgccatt gcttctgtta 25681 ctaaaattgt atttgtaagt aataatccac caactagact tgtcaaacca agagttaagt 25741 gacttgaagc tattttgaac ataatggttt cttaaaccgg acaacaaaat gtacttaatt 25801 ccctgattta taccaattct ctgtaaactt gcacttaata cttactcttt cattcttggc 25861 gttctagcct acggcacgcc cttcgggtat gtcctgagcc ctatggctaa cgccacggcc 25921 ccgtgccgaa cgggcacggc cccgtgccga acggacacgc tacgcgttag ccctcgcttg 25981 cgtgcgcttt gcgcatacgc agtcgcctct gtcgggaaag ccctccgggt atctccttcg 26041 gagacgctac gcgaacgggg gttcgcaatc gacgggaacc ttaagagctg cgaccccctc 26101 accgtcattc gcgctgtctc accgctgcgc gtctaaggcg tcttgttgct aatcgtttaa 26161 taaacattaa gtgcatcctc atggagaatt ggtattagta acactgctgt tttttgtagg 26221 acgccaaaat ctacgaacat tagcgaaaaa aaatcaagaa tcaattatca atgaacaatt 26281 cccccacaac gcctcggacg ggggcttgaa ataacgaata tttttgttgc tagctacaac 26341 tgaataatca gttatcaatt atcaattatc aaatcgataa ttgataactg atagcggttt 26401 aaaatccaaa attgctagct atttcttact gttggcgtac cagagataga tcccctacac 26461 taactctcac caattggttg tctcgattaa aagaggtaga tgtgtgaatc ttgtgtgcga 26521 cttttggtat tgagattgtg ccaattaatg tgccattaga ttgccattgc cccacatacc 26581 gtgccgatgc taaccgtgga tagaacgaac tgcaaaactg gtaaagtacc aagcgcaaac 26641 cacttacaag catactgctt ggcaagtagt caaaccattg ctgtccagtt tgacgtgcta 26701 ggggtagtct gggcattgcc agaaacaaca acgcaagcct tggatcgtca ccttccaaac 26761 ggtaaggcaa ggaggcgacc actactgccc tgaaatcgcc gccgtgccgt caactcataa 26821 tccaaggtct tctctggtga cgattggggg tatcgtgcca gcaactgatg ccccacaaga 26881 cccaatagat acaatagtaa caaggtgact aagttgtagg gtaacgcatg cactggcaac 26941 gctaccgccg cccagaagac actcacagct tgacctagca aggctgctaa tgcaacacca 27001 agagctgcta gcatatagct ttgaggcgat ggcagcaaat aatacccgcc aagcgccaga 27061 gccaccaaaa tgaaatttag tgccgccggg tcgtaatata cataagcgaa cgttcctgtc 27121 aagatgcctt gaataaatga cccaaaagca tagctaaaca ccgccaacaa aaattgaatg 27181 cgcgaattca acaacagcaa caacgcgacc accataccaa cccacacatt aggcaaaaag 27241 aagattaacc cgagtgtgcg taggaagccc tcaaggggga ctggcaaacc aatcgagtag 27301 gcatatgcag gaacgacagc gtgttgcaaa ccagcaaagc gaaacgctgc taaatgaacg 27361 ctccaactca caacaacgaa cggcaggctc aagactggta gtaaaaagaa cgtgtacaaa 27421 acatgggata acgtccaagt cagaacaaat gccagtatgc cagcagtcgc cgccagaaac 27481 agcgatgcca cactcggttg gaacaaataa ccaacactca gccccgccaa gagcgggtta 27541 aatagtaatg gtccccgctc taaatacatc acatcaactt gcgtcactct ggcaaaggca 27601 actgccgcca acaccccaac aagccccatc accatgactg agggttgcag cagcatagcg 27661 gacagcagca gcgccccaac cccgacacct cgcagaaaca gaatctcagc agtggcagca 27721 cacacagcac gcaccagatg tcgaagcgtt cgtaaccatt gcatggtagt ccacccttgc 27781 gtcgtttgct tctggttgtt taaggtcgtc agcttaagac gccccagggc ggattgataa 27841 agcacattcc acatgtcctt agctccgtcc tgtggtttaa aagttacaac tttgtcgtga 27901 gatcatggaa catagcgttc gttgccagtc ggggcgggag gtgttcccgt cgttccagat 27961 cactgaggtc ttccgccgca cgaatcagtt caacgtcgcc cgtctctgtg atcaacacca 28021 cattgggacg atattcaatg aactgtaacc attgcgtgtt gttgtaggca cccacggtgg 28081 aaatgaccaa ttgggtgcca cgcgatagcg gggggaggga aatcttgtca tcaacaacat 28141 caatattcat gcacagtggt ccgtagatga ctgaaggttc gtagggacca gaaactggtt 28201 gagcgagggc gatatcaaag cgataccaaa agcttgtgaa cagtaaattg acaccagcat 28261 ctatgacata cgctcgggtg ccatcgggca gtcgcttggt accgcacaca ctggtaatga 28321 gtgtccccgc ttcatcaact actgcccgac cggattcgat aatgagttga ggcgtatgct 28381 caggtttcag cgccgcccat aaagcgtcac aaacagcatc cgcgtattcc tcaattgacg 28441 gcagcatgac atctacggca tggtagctac cttttagacg gctgcgggat gggaagccac 28501 cccctatgtc gatgtagtcc atccgccaac cgcactcttg ctcgatttca tagccaaatg 28561 ccaccatttt ctcgacttgg cgagcataag ctgcgggttc catgatgaac gttccaatat 28621 ggctgtgtaa tccgttcacc tgcagtttac ccccactggc gatgcgccgg atggcatcca 28681 ttgcttgacc agattctaga ttgaagccaa agcgcgacca gcagggttga attccagcat 28741 ctaggttgag tcggataccc acgggaatga ttttctggag gcgggtggcg atcgcctcca 28801 aatcctcaat ttcatctaag tgatcaatat tgattgtgac tccatcgcgc acagcagctt 28861 ctagggtggc aaaaggtttg tgcggtccgt tgagaatgat ttgattccct ggaataccta 28921 gtgcttttgc tttgtcgtat tccatttttg aaaccaattc agccattgcc ccttcttgat 28981 gcaggatggc gcaaattgcc ttcaaataat tggttttgta ggaccagcca aaagtcacat 29041 ttgggtaacg agtcgtgaat gcattgcgga tgctgcggaa cttccgacgc agcatctgtt 29101 ctgaatagac aaacagcggt gagccgtact gcgccaccag ctcgtcgata gcaacacctg 29161 caatttctgt tttcaccttg cggtgcaaca gagtgctccc cgcaaacttg ttcattaatc 29221 cgctgtgtag tttttgaatt acgggttttt cgtactgggt ctgcatatca tcgttctccc 29281 aaggtcacta gagtttgtag aggggttgca tccgttacca tctcgtaggt gtagcgcatg 29341 aataacttgc ccgcttcgta ggagggcaag gcaggtaagg gtaatccgag tgctgccctg 29401 actagacgtg ctggcagatt aacacctacc ccggttgcaa agtaagtcca tgctggaaag 29461 cgcggattga tctcgatcag ataaatcatg ccgtcggcat cgacaatgca ttccaactca 29521 aagggtccct tccatttggt atgctgtagg aatgcttccg ccgcctgtag taagcccgga 29581 tgatggatcg tcacaccgct ccagattttg ccgagttcag tcaccgacat ctttttggtc 29641 gccactaaac cgagatggct tcctttgcca tccccgatcc caaccaggtt tagctcggta 29701 ccactgataa tttgctgcat gatgatcagg tagccccatt gggctgctaa gtgatgaaag 29761 cgttgcagtg cttcatcctc ggtcaacact cgatacgctc cgtgatacgg acctttcatc 29821 atcaacggta accctagtga tgcgatcgcg tcacgtagtt cttggatcga ggtaacagtg 29881 aatgtttgcg gtgtgcggac accaaacgcc gtcgccactt tcgcaagttg tgtcttgttg 29941 cgtaaggcga actgctctcg cgtaggcaga taagttcgga tgcctaacga ctccaattcc 30001 ttggcacacc ggatgtaaag cggcaattcc acatctaaat tgggaatgac gcaatccagt 30061 ccaacctgct gttggatctg ctgcaatcgc tcaatcagaa cctccggttc tgaagaggga 30121 taaggcataa ggtatcggcg atcaaacagc caatccatat acagccccgg ctccataaca 30181 tcgtaagcaa atccgatgat ttgggttggc aattggttgt cctcgcgtag gctgcgggca 30241 atacccgtac cgggaccagg attatcgact gcattgatac ctgtaatggc aattgtgaaa 30301 ggtttcattt atcctctgtt agtcccaagg ttttgagttg ctggcaaaag tctgcaacat 30361 cgcgttctgc tatagtgtga gcaatgtcga agcgatcgca tagaacctgg acaatttgtt 30421 gctgagtctc gccgtgtcgc agtcgttgca agactaactg accacaatgg tttaacataa 30481 agctttcacc agtggcatct atatcctttg ttttcacctc gttgattccg ctagtcatat 30541 cacaaccacc aaaaaacttc ttgcaaagac catttgtaga tatctggaat ttggtgttta 30601 atgtaggact gatgtttggc ttggtgctgg tgtttgtgtt gttgttaatc aatgctttgc 30661 tttcagaacg gcagagtagg gagaaattag ctattgctca ccaacaactc catcgatatg 30721 ccctccgcgt tgaagaccaa gcaacgctac aagaacgcaa ccgcattgcc cgtgaaatcc 30781 acgatgcttt gggacactcg ctcacggctc aaagtatcca gctagaaaac gctttactgt 30841 ttctgtcatc gaatcttgat aaagcgaaaa ccttcttaga ggaagccaag caacttggta 30901 gcagtgcttt gcgagaagtg cgacaatctg tagcgacgct gcgttgtgac gcgctgcaag 30961 gaaaatcttt agagtctgcg atcgcacttt tgctttcgga ctttcagcgc cgcactggca 31021 ttatcccaga ctacaagctc tgcctacctc aaccattatc cggtgaagtt ggtacgacaa 31081 tttaccgcat tgttcaagaa gccctaacaa atatttccaa gcatagcgcc gcaactgtgg 31141 tcacaattga cttacaaaca actgctgatt ccctgtattt gcagctttgt gacaacggtc 31201 gaggcttcaa cccataccag aacacgacag gatttggtat tcaaggaatg cgagaacgaa 31261 cgctggcgct aggaggtcat tttcgcattt tcagtgaatc aggatgcggt tgccgaatta 31321 tcgcagtatt tccatatcac ggttaacgca ataattatga tttctctgtt gctatgattc 31381 gcctgttatt ggtagacgac cagatgatta ttcgtcaggg cttaaaaagc ttgttggagt 31441 ccaaacccga ctttgtgatt gttggcgatg cagataatgg tgaaagtgcg atgctccagc 31501 agggagccgc ttcgcgatcg cccaagtaca agtcctgcaa ccagatttgg tattaatgga 31561 tgtccgtatg cccgtcatgg atggtgtagc agcaactgag gtgatttctc agcactttcc 31621 cgcagtgaaa gtgttggtgt tgacgacttt tgatgacgat gaatatgttt cacaagcaat 31681 gcgatttgga gcgaggggtt atttgttaaa agatacacct ttagaattat tggcaaatgc 31741 tatccggtcg gtttatctag gatatactca gctaggacca ggcttattcg agaaattctt 31801 gtctccttta ggacactcta caacgccaaa acccctaaat cccccacctg agtttgctga 31861 actcacccct agagaaagag aagttttgcg tttaattgct actggtgcta gcaatcaaga 31921 aattgcagag tctctctaca ttgctaaaag aacggtgaaa aatcacgtaa cgaatattct 31981 gggtcggtta aatctacgcg atcgcacaca agctgctatc tttgcccatt cctttgtgtc 32041 cctgttggat aattcttcag gtgaaagcaa aaaagttgaa ttctgagttc tgacttatgt 32101 gtcctgagtt ttttatttag attttcacct gttttctata ggcgttaggt gttgtaccca 32161 taagtttacg aaagacgttt gtgaaatggc tttgattttg aaaacctact tgcgggtaga 32221 tttggtctat cgcttgttct gttttagtta ataacctttt cgcttcctct attctgcaat 32281 tcataacgta ttggtgcggc gaaaaacaag tcgattgttt aaacagacgt gagaaatgat 32341 acatactcat ccccactgct gtcgcaattt cagctaagcc taaatctttt tctaagtttt 32401 gattaatata ttcaatagct tgccgcaact ttaaacgcga aagtccttct ggatgtttag 32461 taatctttga agaagataca gaatagtgtc ttagcagatg aatgcataga gtcgttgcta 32521 gagattctat atacacacga ctacccatac catctgattc tagttctgat tgtagtgcta 32581 gtccaatttg ttgaatcaac ggatctgaat cagcaaaatg tggtataata tccgtgtatt 32641 gcatatctat caaatcataa ccagcacgag caaacaaagc tttttccaaa ctaagcacta 32701 aaaattcagc ttctatctcc cagcttaatt tgtgataact gcttgctgga ataacgacaa 32761 tgtcgccata tttaatacac tcatgctgga cgtgtccgtc caacacccgt tctacgttga 32821 taggctgttt tgtaagactt atactgatga cgtgttgttg aggagtatgt tctggtgtct 32881 caaaaggagg ctgttgatga tagtctaggc gaaaatcatt ccagttagca tggtaactgg 32941 tgatcagggg cgatcgcgga agaatttgtg aatatgcatc ttcctgagca aaatcaacac 33001 ttatgatttt ttctgttcgc atcgctttca ccccagaaac gattgctttt aaggtacacc 33061 cgtatacttg ctttctaata cagcttggat gctgtcttta aaaacaaaag tttaatgcta 33121 taaatgttaa cacactaatt aaaatattaa gtgttcaaac agaatctgcg acttctatga 33181 gttacttagt tatctacgat gggaattgca atctctgtgt cacaggagtg caaatgctgg 33241 agactcttga tcagggacaa ctttttcgct acgttgctat gcaagacgag tcaactctcc 33301 aacagtgggg aattacaccc gaagattgtg agttgggtat gattctcata gatgcagatg 33361 caccagaaag acgctggcaa ggcagtgcag ctgctgagga aattgggcga ttgttgccta 33421 atggtagtgt atttgtagat gcatatcggg ctttaccagg ggtaaagtgg gcgggcgatc 33481 gcttttacga acaaatccga gatcaccgtt acactatctt tggcaaacgt tctagtactt 33541 atgaatcgac ctattgcatt gatggtggtt gcaaggttgc taaaaatgat gcttcctgaa 33601 aaccagcaaa ctcccgcctt cataattatc agatgggagg agcagaatta aaaatctcca 33661 gtatctgcct acaaaaatct ttaagctttt cccccacctg cacagaaggt acagatgttc 33721 ccacgaaact ccaattttct actgtagaaa gtcgccactg ttcaccaata tggctgaaac 33781 cagcgacttc tacgccaata gcacgacgtg aaccatttaa atggtcatga taaaaacgga 33841 tttgtattaa tatactgcgg ctttgccatg atttactgat gccaggaaag tgaaagccaa 33901 tgtctataga atctggatcg actaactctc tggtttgcgg atcgtttttc caaggtttga 33961 gatccgatct ggcatctgga aactcagatt tgaataaatt aactacggta gcgatcttgc 34021 tggcaaattc aaggtttgtt gccatctcag ctgcgttcac aaaaacactc ctacgttatt 34081 tcaccaggcg tatgagggtg cttgaatatt gaatgactta gtctggctat tttaagcaat 34141 tttactcaaa atttcaagaa aaaaagatca aaaatatgcc tttgaccgaa aatattaagt 34201 aaagatagaa aattttccgc aaaattcgtc attagtagga acgatgggat acgccagcga 34261 tctaaaatga cactgatcat tggtaagtaa aataagttta gcttatgctg cgtcaacgct 34321 caattttgtc gctccttcta ggattcttag caatttttct cattagttgt ggcggtcctg 34381 gggtcgcaac accacctcca acatacacac cagatcaatt ggtgaaaatt caggaatacg 34441 tttctgacat tcagggtgta aaagaacgct cacgagaact cgaaaggctg atccaaacca 34501 aacagtgggt gaaagttgga aacttcacac atggtccgat gacagaagca aggctgtcga 34561 tgaactatgt tacatccaat ctgctaccca aagaccaatc agcaggacgt gagttagtgc 34621 gtgatttgtt ggataagctc ataaaaatcg accaagctgc tgaggttggt aacacgaatg 34681 gtgccttgaa tagctcagta gctgcttttg cagacattga caaattctta caactggttc 34741 ctcaaacaag cagtccgtca gaagaaagct aagcgtgata gggtgaagag gaagatgggg 34801 gaacaaaagc accaggagca ggaaagcaag gaaaagttat cttccatatt ccggctttcc 34861 cacaactcca cttcctcacc ttcccatctt ctttgtcttc acttgcagaa acaatgagtc 34921 atgtcgttat catcggttgc ggtgtggttg gggcagcaat tgcctatgaa cttagcaaag 34981 tcaacgggct aagaatcaca gttgttgacc aacaaccacc agcacttgcc tctacaggag 35041 cagcactggg tgttttaatg ggtatcatca gccaaaaaac caaggggaag gcttggcaga 35101 tgcgacaaac aagcatactc ggttatgaaa ctctcattcc tgaattagaa acccttacag 35161 atcgtaaaat tggctacaac cgtcagggaa tccttatgct gctacctgaa ccttctatgt 35221 cttctttaga aaaagaagat gggggaatct ctgaatggga aaagttgcaa gaaattcgcc 35281 aatctcaagg tttttcccta gaaatttggg atactgataa actcaagcaa gtttgtcccc 35341 atgtgaataa cgatcagatt gtgggggctg tttactctcc gtgcgatcgc caagttgatc 35401 caacttccct caccttagct ctcattgacg ctgcaaagca taacggtgtt gatttcaaat 35461 ttggtgttcc ggttttgggt atagaacccc agccgttgtc tccttctcaa gaaagagatt 35521 ctgaaaaata ttgcaaccaa cttcagacgc cagagggaaa aatgactgct gattggtttg 35581 ttgtagcggc tgggttgggt tcttctttat tgagtgcaca attaaagcaa atggttgata 35641 ttcgccctgt gctagggcaa gctttgtgtg ttcgcttagg acattgtctt ggaaatcctg 35701 acttccagcc ggtgattaca ggtgatgatg tccatattgt ccccgtaggt ggtggggatt 35761 actggatagg tgcaacggtt gaatttccga ccaataaaaa agatgaggtc atagctgata 35821 aagaactgct ggagttggtt agaaagcagg cgatcgcctt ttgcccagac ttagcaacag 35881 cgacaactat ccgtacttgg tcagggttac gtcctcgtcc tgaaggacgt ccagcaccag 35941 taattggtaa attgcctgga tatagtaaca tcttgcttgc tactggacat tatcgcaatg 36001 gggttttact tgcaccagcg acagcttacg caattcgtga gatgattatt gctaactaat 36061 cataattaat aacccaatct taatctgacg ataaaaagtc gtagcgggag aaacaaaacc 36121 tacctacgag gcttccaata aatccactta tatcaggttc gcttaattac ttaataataa 36181 aacgatctca ccaggtgccc ctagtcccca ctatgtcctc cggacaagct acgcatgaac 36241 gtggggatca gtgagttccc ctctccttaa taaggagagg ggtgcccgaa gggcggggtg 36301 aggtgaaacg tatcacaatt tccaatatca ataacaaact tctttttctt tttttttgtt 36361 cttatttcat tcgtacattc aaatgtcaat aacagaaaac aaaattcaag tcggctcgtt 36421 agaatggttt taccgggaag cagtacctac tggtagaagt gagttacttc ctgtcttact 36481 gctacatggt ttaccatcac aaagttatag ttggcgcaat attatgccag ctttggcaaa 36541 gcaaggaaca agagcaatag cgcctgattg gatcggcttt ggtttttctt ccaaaccaga 36601 caaaagagat ttcgcttaca cacctgatgc ttttctgaca gctttagaag gatttctaaa 36661 atctatagaa cttgaacgtt tttccttagt tgtgcaaggc tttttaggtt ctgttggtct 36721 acaatatgcc ttgcgtcatc cagaacaaat tgccaatata accatactta acacaccaat 36781 ttctactcag gcgaagttac cgtggaaaat tcaacaaatg ggattacctt ttgcaggtga 36841 catgattact caagatccac tgttagttga ccggactcta gaaggtggct gtcgttacgt 36901 cattacagaa gaacatttaa atatttatcg taaacccttt ttgaaaagtt ctgctgctgg 36961 aagaagtttg ttagcaacta tccgtaattt gcagttgcgc tcagcaatga cggaaataga 37021 taatggtttt aaggaattgc accaagaaat tttaattttg tggggaatga tagatccttg 37081 gctacctata aatatagctc aatattttgt gaattccctg gaaaaaggga gtttgattaa 37141 acttaacaat gttggtcatt atcctcaaga acattatcac gaaacaattc ttgaagacct 37201 tttgcctttt gttcgtgtta aagatagcaa ttaataatga tagcaattgg aattcaagag 37261 tgacatcact tttatatctt cttaagattt caaattttgt cgagtgagca ttttatgctc 37321 atctgttttg caattcgtta aataaccaga gttaagtttc ataaatcaga taggatttct 37381 atagcacctt tacattattt atgtgtgcca aaaatccatt aatggagaaa aagattcatg 37441 gcttattcct ggtttaaggc ttttcacatt gtcggtatag tcgtttggtt tgctggtttg 37501 ttctacctcg tgcgtctttt catctaccat gttgaagcca accaagaacc ggaaccagcg 37561 cgcacaatac tgaaaaatca gtatcagata atggaaaagc gtctgtacaa tatcatcaca 37621 actccaggga tgctggtaac agtcgcaatg gcgattggtt tattgagtag ggaaccagat 37681 gttctcaaag aaggatggtt acatgtaaaa ctgggatttg ttgtcctttt acttggctat 37741 catcattact gtaagcgtct tatgaagcag ttagcccaag acacgtgcaa atggaacagt 37801 cagcagttga gggcgttaaa tgaagcacct acagttatgt tagtggtgat tgtattgcta 37861 gctgtgttca aaaacaatct acctacagat atcacagctt ggggtattgt cggcatgatt 37921 attgggatgg cggcgactat tcagctttac gcgagaaaac gccgcaaaga taaggaaaag 37981 ctgacaacag aaatggtgca gcagcaaagc agctaaataa gctttggtct atatcttaaa 38041 ccctcagtta tacatacata cgcagcctgc caaagcaggc tttgtttgta tgttaggaaa 38101 ctacagtatt atgcgacata cttaaaaata cttatctctt ctactttatt gaaattttat 38161 ataaataaca taaaatttca gttaagcttt ttttgaaaaa aattattttt gttagagtca 38221 tgattaggca tctatctgca gctagataaa actctaaggc tctggttatt attttttata 38281 attgccatat tctatactga ttactgtatt tacttacgga atatccccta ggacgatatc 38341 tactaaatga aaattattta ttttggtaaa attttaacaa tctggttatt cgtagaattc 38401 ttactgagcc taagcttatc tgctcacgct caggaaaaat tagaagctag aagcactacc 38461 ttaccacaac agcttatatc gcagcagaat tttataccac caagacaagg gaaaccaaaa 38521 gatacctctg gtgcaggttc ccgcagtaga ctttgtgaaa attgtggaac atctataaat 38581 agagatgtaa acgcagcaat caatttatcg cgtttggcta cagcgtgaaa gcttacagag 38641 ggataaccgc tcccatgctc ccgatgaagt aagaagtaaa tgtctagttt gtctagattt 38701 tacatagcag aaattgtaca actagatctg catcccaaga gttgaagtac taatgtaagc 38761 gttgctcttg gcgaagtatc tgctcaatta acgcctgatt atcagaactt tctgcttgtt 38821 tgagatattc aagggcaatc tgtccgacag caggttcctc ccgtgaaata gcaactaaca 38881 actctacagt ttgataaagt tgagcttttt ctgattgccg attcgtgaca attggcagag 38941 tcacagcagc aatcacaagt ccaatcatcg caggaacaac aggaatccag caaccttgta 39001 aaaaggcaag gtaggcaatt gctatgagtc ctccctccgc aatagcaacg gcagtaatta 39061 tagattgagg tgactttctc tgccaggcta acgcagttcc cacactacac caaagcaaaa 39121 tccaaagcca ttccattggt ttagatcgaa ctgttagcag gggtcgcccc tgtagcgcag 39181 cactcaaaat catgctagta atattggcgt gaatcgtcac ccctgccatt tgttttgcag 39241 aaccaacgtt ctgactgctt agtggtgttt gaaacatatc gttgacgctt tcagcggtgg 39301 agccgatgag gacaacgcga tcgcgcatca actctagtgg tatcttatta gctaacaaat 39361 ctatgagcga gaacctctga aactgttgct cagtaccatg aaagttgagc aaaacttgat 39421 agcctccagc atcagcctgg acatagccac cgtcattggg ttgaaaggga acaagcactg 39481 ttttgccgag gtgaatttgc tcgcgatgct tggtctgagg ttgaggggtg ataccttcag 39541 cttctagata gcgaagtgcg agctgcaaac caagattaag gtgtaatgaa gagttttcta 39601 gtttaactga aagcaacgcc cgacgcaact tgccatctcc atctaaaacc tgatcagcaa 39661 gaccaacctg acctaattgg gctaaagttg gtggtgcagc cacctgacta cccactgcct 39721 tttcaatacc aatgagattc tcagttgttt tataaagttc tactaattcc tgatgaccag 39781 gttctatcgg caaatctcga tacaaatcca gcccgatcgc tctaggttta tagcttttta 39841 ttttgcggat cgcttgagtt aacacccggt ctgtcagagg atattgacca atttgcttaa 39901 tatctggctc atcaattgta attatagtaa tacgggggtc aactgctgca ggaggacgcg 39961 cctgaaaaaa agaatctagc attgaccact ccatcccctg tagtagtcca gtaaaacgta 40021 gaagaataac aaaactggca actcccccag caatgcatcc catccagaat ttactcagac 40081 gtagcttttg ggcttttcgt tttgcggctt tgcgtgcgtt tgccaatatt cgattcgctt 40141 gttgggcagc aagtacggca aattgagctt tttctcgttc aatttgtcgt gcctgttttt 40201 ctgcggctaa atcggtttca acttgtcgct ttgccagttc ctgactggca gcgagaaagc 40261 gataatctaa gtcactgagc cttttatttt ttgaccaagc cagtgcttgc tgcaatgcaa 40321 gatccaccaa aagctgggac tcatcttgct gatttgaagc aatccaagct ttaaagcgtt 40381 gcgcgtaggg acgcaaattt tccaatttta gagcaaccca ttctgaatta aagactgttt 40441 gatagattgg gtttttcacc ctcaaatatc cctgctcatt gacaaccaaa cctgacaaaa 40501 gaagttcaat ctgttctgga ctatcatcgg ttggtacgtc ttccccagcc agaatttgtt 40561 gatatattcc caagctacgc cctgcaattt gctcattagc aagcaggcga tcgctaattg 40621 tccgcaaatg ttctggctca tcctgagatt cccacttgtg gattatacgt gatttcacaa 40681 tactttctac ccaaaatgcc tcagtcccag ggagaacgat cagcgtctga ctcactgaat 40741 cttgactcga actttgtacc aacagacaca gtttttgcgt cagaaagggt tgtcctcctg 40801 tccaagataa aatttctttg ataattgtct gagcattacc ttgctcaact gctaatcctt 40861 ttaccaatgg ctccacttca cccatcgtaa agccgtgcag ttctatggct ttaccaatat 40921 taaacggagt ccgctttgga tcttgaatta aatctgatgg tgttgttacc ccaaaaatga 40981 caaatgtgat gcgattgtat tctgggtcga ttgctctgcg attataacaa aaccgaatga 41041 aggcaaaaaa gtcatcaaca gaaaaatcaa ggctcttgat gctatcaact tcatcaatga 41101 agataaacag tctttcttcg ggaaattgaa ctagcaggac ttcagaaata aatcggctca 41161 gcctttggag tacagggata tcctcctgct cccgccacca agcttttaag ttaaatttcc 41221 ctaacagctt gaaacctagc cataactcag caacgactcc cttgtaccac tgctctgggg 41281 taatattttc acaaccaatg ttggtcatat ccacagtcgt gcatctaaac ccatcttgtt 41341 gtaggcgatg ccgagttctg actagaagcg aggacttgcc catttgccga gagttgagaa 41401 catagcaaaa ttcaccttct ttgagagctt catagagttg cttgtcagcc tgtcgttcta 41461 catagctagg agcattaatg gttagactac caccgacttg atacgtgtaa cggttcattt 41521 gagaatctga aggtgtcgta gaccgaaaat ctcagcagtc aattatacta aagcgcacgc 41581 ttgacttaat tgaattatta taatatgccc agaaaaaatg attattatta ttggcgtcaa 41641 acttgaatta gatttcagtt cgggttttcc ttttctatct tctgtgtact ttgtttgtca 41701 agttaatcct tgctggagtg tatctgttca gttgtgtgca taagtctccc aggtattgtt 41761 gtaaaaaaag gtaatcccaa agagctatag cagcgttttc gtttctaaaa atatgtgatc 41821 acagatactc tctacaagtc aattcttact tgcccgtaga atattttctc ttttcaaagc 41881 aattgacacc atcagattta gaacaaaaga taaggaggaa taccaatcct aagtgtttta 41941 ggtatgttga cggaattcta atcagaaaca taaaccagca aatcaacaga gtcaatatgt 42001 atttcggggt atattcagag tcaataggtg aatgtcagta aaaaggtatg ctgcttgttg 42061 gtactgcttt tgtcagctag agttctacat acttttctta cccttgcgac tatttatcaa 42121 cgacgatggg aaaggccttt ctttatgggg ataataccaa atctctatga cgttgcactt 42181 aataaagcaa acagaatatc atcggcttgt gatgaaatgg aactaaaaac tttatataac 42241 ttctgcctca actgagaagc gcgttggtgc aaagttcccc atatgcctag aacctcaaaa 42301 gtgaagataa caagtcaaaa gcttatcatc taggtctttc gctgatttag aacggtagct 42361 ttattttcgc gctatgtgta ctagagcaaa gtctttagtt ctccaattta atgcgtcttt 42421 gagcattgaa ttgtatcatt tcacacattt acttacggca atttattggt attaatgact 42481 tatgactcaa attccttagc gaaaagctag cagataagtc attttttatc gataaaaaat 42541 tgacagcttg atttgtgagt acgttaccta cttttcttac ttttcccact aatttgattt 42601 tcaaagtagt catacaaggt agcaattacc taaaaaataa tcatctatta tcaacgaaaa 42661 actaaaacaa gttgattgct aagaaatata caatgctgta tgataaataa ccaagattag 42721 tacttgccaa taaattgaca tatgctgtat gatgagcaga aacgcaacaa actttaataa 42781 aattcttggg aatatatcgt gcgaatggtt ttccagtgat gttattccgt cctaagaatc 42841 taatcgaaaa actaaatttt agctcttgta gagctacttt gaaaatggct taactgctac 42901 tttcaaagtg acaaaaaaat aagtggtgaa ttctccagac attcagttcg caagaattgc 42961 tagcttattc tctactaaga ctgacaagag gttatgttaa gcatattctc ctctcacgga 43021 tttttggaga ggttctgatc acatactcac ttgttggtta ctaactagca atttgatttc 43081 aagattcaga tatccttggt ttcggttgca tttacatata atatgttcaa aaactcgtca 43141 caaagcaggc gtcgtttttt gtacattggt ctatcaagtt tggcaggagc tactgctctg 43201 gcagtggtaa aaaacagtaa cccaatgtac ttaagtcatg cattagacaa cccaaaaaga 43261 gattttcaag tagcgggact tgcttcctta agcaagcgtg ctgcagccaa aggattaatt 43321 tatggggtag cctgtgggcg agatgttctt gcgtcagaca aaaatttaca ggctagcatt 43381 gttcggcagt gcggtgttct gacaccagaa aacgagctca agtggcaatt tgtgcgacct 43441 cgtccagacg tgtttgactt tagtagagca gattggatag ctggatttgc ccggactcac 43501 aatatgcttt ttcgaggaca tgctttagtc tggcatgagc aactacctga gtggttcaaa 43561 gaagtcgtta accgccaaaa tgcagaaaag tttttggttg aacatattac aactgtaacc 43621 aaacactacg ctggtaaaat acattcttgg gatgtggtca atgaggctat aaaaccagat 43681 gatggactaa aatcaggctt acgacaaact ccgtggctca agtttctagg tccagattac 43741 atagaacttg cgtatcgggt tgcggcgcaa gctgatccca aagccatgtt ggtctataac 43801 gagaatggat tggagcataa tgcccctgaa tatgaagtta aaagaactgc tgttctgaaa 43861 ctactagaac gtctgaaatc tagaggaaca ccgattcatg ctctgggtat tcaatctcac 43921 ttgttaggtg acgcttcctt gaatcccaaa aagctacgaa attttctgtc taacgtggcg 43981 agtcttggtc taaagattat cattagcgaa ctagacgtca cagaccagaa attaccttca 44041 gataccgttg tgcgcgatcg cattgtagcc gctaaatacg aagactatct ttccgtagtc 44101 ctagatgagc gagcagtgat tgcggtctta acttggggat taagcgacaa atactcttgg 44161 ctttcaaagt ttggttcccg ttcagatggt gcgccagtac gtcctttgcc actagactta 44221 aacttcaaat ccaagttggc atggaatggg atagcaagag cgtttgataa agccccgaaa 44281 cgttgatcaa gcagctttgt cgtgcatgga aaaatgtggg tgaggatgaa gacttgtgga 44341 gttttcctga ccagaatcaa aatccttttt gaacttacct tatcagggta aggcatttga 44401 gatatttaaa tgtctgtttg gatttacacg tacttacggt ctagagatat gagtatcaat 44461 aaaaccactt caacaatgtc ggaggaaata tctaaaatgc aaaaatttga tgagatttcg 44521 ttactcggaa ccagatttca taaaattaaa gttaatgaac tgatcgacta cgctgtccaa 44581 gcagcaaaac ttaagaagaa aacaatcata ggtcatgtca atactcgggc aatgaatttt 44641 gcctatgaac tgccttggta tagagagttt ctcaacaaat ccgatttagt tttctgcgat 44701 ggatttggtg ttctcttagg agcaaaactc tgtggtggtt gtgttaattc atctcatcgc 44761 atgacctgtc cagattatat agaagacttt gccaaagctt gcgagcgcga aaatgtttcc 44821 ttattcttac ttgctggtga gccaggaacc gtagatcaag cgatcgccaa gctaaaagtg 44881 attgcaccaa acttaagagt gaacggacat cacggttatt ttgacaagtc tggtgaggaa 44941 aatgaatttg ttatccaaca aatcaataca ttcaagccag atatcttgta tattggcttt 45001 ggaatgccgt tacaagaacg ttggattcta aataattcag agaaaataga cacaaaagtt 45061 tttctccctc ttggggcatg tcttgatttt tatactggta ctgtctccag aggtccgcgt 45121 tggatgacta gtagcggtct agagtggtta acaagattag tcacagaacc aaaacgattg 45181 tggaaacgtt atgtgcttgg caatccgttg tttttctatc gagtcctaca gcaacagttg 45241 acgaaattgt tcagaaggcg ttctcgaacc tcatcgcaat attaaactga tgcacgtaca 45301 tattacactt tttttaaggt tctcaagtat caagagtcga tagtactcca aattttggat 45361 atttgatgct ctcaaaaaga aatgtgcaaa atgtaacagt ttaaaagttc tacctagcca 45421 ggaaatttat tttgtaaagt gcaagtacag gggtcataac aatcgtggca tctaaatctt 45481 tatcagttac agggttcaag ccagatttaa ggtcagcaaa tggcaccagg atacagaaag 45541 gattaaccag caaatttctg cgcgtcggaa cacttagtgt ttcagatgtc atttctttgg 45601 cactagcatg gaagttagca gtcattcatg gcactccatt agattcgcct tggacacaaa 45661 aagtatcttt tttactactg attctggcag ttgaaatagg cgtgattgcg accagaggac 45721 tatacaaatc aggtatcaat cgtcgtaact atcttggtct gatcaaagca gtcacactgt 45781 cagatctttt actgttgctg attgcctttc tctacgagcc agatagttat atctcgcgct 45841 caacttttct acttttttgg ttgttatctg ttgttttcgt ctgtactggt cgctttatct 45901 gtgatgtagg tactaaacta cttcgtagca aaggagcaat tcgccattcc gtttttctga 45961 ttacagatcc ggaagacaaa gacagccata ttcgactaat agagaaagaa aattgctaca 46021 ccgtacaaaa aattgctgat tctagcagtt tggatctgat gaaccgagag acaacctttg 46081 aatatctacg cacacagggc atagaagagg cttttgtttc ttggaacgcc atcaaaaacc 46141 gtctatatgt ttgctggcgt ttccaaacag ctggcattac cctacgaata ctgccaactg 46201 agggtgaagt tcgtcacccc aaatccatat tttggatgat tggtgaagtt ccttgcatga 46261 caattccggc accaatcatt gcaggcagcg atttttgggt aaagcggagt tttgatcttt 46321 gttgttcaac tatactgtta cttctgttat ctcctgttta tgtggtgata gccacactga 46381 ttaagttaga ttctcctgga ccagtgttct tccagcaaaa tcgagttggt ctgcatagta 46441 agagtttcaa gatttggaag ttccgcacaa tggttgctaa tgcagaaaag ttgcaggcaa 46501 agctagaagc aaagaatgaa atcaaggatg gggttctttt caagatgaaa gacgaccctc 46561 gtatcacacg cctaggcaaa tttctgcgcc gttacagtct agacgaattg ccccaactgt 46621 ttaatgttct ccttggagag atgagtctag ttggtcctcg tcctctccct atgcgggatg 46681 tagaaaaatt caaaacaaag cactttattc gacaagaagt tttacccggt attacaggat 46741 tgtggcaggt gtctggtcga tcagatatcg ataattttga agatggggtg aaactagata 46801 tctcttacat agaaaattgg tctgtatggc tggatctgca aattttgctg aaaactgtca 46861 aggttgtctt tagtaaggca ggtgcctact aggtaactca gttatctacc gccaccaaag 46921 catcagcttg gtggcggctt taagtctgat caaaccggaa tgttgagaaa ccagtaggat 46981 atggctaaga atattgatac atcaccgaca tcagggaaaa aagtgaaatt tgaatgtttg 47041 agtgattgcc tagagtatag tgcagggcaa tagtggttga tacttaaaga tttatagtca 47101 ttaatttgaa tgttgtttct gaaagtattg cctaaaacat acttttctga cttttcccac 47161 ttgatgagct acacagtaaa ctttacaata tctttaaaga agtttctcta attctacttg 47221 aaaatacgga aaaataaagc tatgatttag catgaatatt gtgccaactt aacacttccg 47281 cgtcaaaatc gatagcaata ctcaaattat tcacgaaaca attgctaata tttaaggaca 47341 aaaacgatca gttttgtctt tgtatgagaa accgatagaa atcattttcg atgagtggca 47401 aagatagcct atactataaa gctaatgtat ttatatcaaa ttcattattg gctccaactt 47461 cgtactaaac tgtgattagg gaacttctat ggagactaga ggctatcaag aagaagtaga 47521 gattcaaaaa tactggctgg ttttgaaacg ccgttggcca attgtggtag gagtccttct 47581 cgcttcgata ggattctcgt cgtttcttat attccttcaa aaacctgaat atcaggcgag 47641 tgggatgctt ctttttaaat cagacagaac ctcgtcgctg acaaaagtgg gagaaaagat 47701 cggtgatctt gaatccttaa tgcgtgaggg taatcccctt gaaacacaag ctgtaatttt 47761 gaaatcagag ccaattttaa aggaggttat tgatactcta ggattaaaag ataaaaaagg 47821 aaacgccctt gatccggagt cgctgagaat caaggttgaa ccaattgtag gtacagatgt 47881 actgaaggtt tcctacactt cagaaaatcc ggcattgaca gcatcagtgg tcaatcaggt 47941 gatgaaatca tacgtagcaa agaatattca gttcaataga actcaagtgg ttgctgcggg 48001 tgaatttatc aaaaagcaac tgccagaggc tcaaagagaa ttaaaccagg cagcagaggg 48061 attacgtgag tttaaaactc ggaataagat tatcgagctt ccagaagagg caagtgccgc 48121 tgttcaaaat gtggctcaga tagatgagga aatcaaccga gctcgggcgg cgctagcaga 48181 tacgagtgct caggaagaaa aaatccgcag tcaactgaat ttggcaggct ctcaggctgt 48241 agaaattacc tccataagtc agatacctgg tgtacaagaa gtcttaactg agttacagaa 48301 ggtacaaact aagctagcaa atgaaaaagc acgctacacc agtaagcatc cagctatcac 48361 tgagctagaa aataaagaag taactttaaa tgctttatta caacagcggg ttgaacaggt 48421 tttaggacca tctggagtta agcaagtttt aggaactcaa caaaacgttg ctcctgctaa 48481 gttgcagata ggaagaataa aggaaaatct gacgactcaa tatgcactca tacaagcgca 48541 acgccagggt ctggaaaaca agctacaagc attgtctaat atccgaggaa cttataagca 48601 gaaactttct gccctaccaa atttagagaa aaaacaagga gatttagaac gaaggctgtc 48661 tatcgcgcaa aagaactacg aaaatcttgt gaccagacta caggacatcg aagtagcaga 48721 aaagcaaact gttggcaatg caaaagaaat tcaacttgct caggttccta aaaaaccatc 48781 tgtctcgaaa ataacattcc ttttaggagg aggcagtgta tttgtcgggt tattactggg 48841 tacagcagca gcattctttg tagacttaat agatagaacc ttaaaaacgg ttaaggaagc 48901 agagacattt tttggttaca ctttgctggg actaattcct aagtttgaat cgaagaagac 48961 atccgctcct gttaacttga tgtcagacaa agcctctgct cgaattattg tcgcaacctc 49021 tcctcgctcg gtggttcatg aagcttatca aatgctccag gcaaacctaa agttcattag 49081 ccataggaaa gttcgcacaa ttgtagtaac gagttctgta cctggtgaag ggaagtcaga 49141 agtttctgcc aatttagctg cagtattggc tcaggcggga cggcgagttc tcctagtcga 49201 tgcagatatg cgtaaaccat cgcaacatca cctgtggggt ctggtaaact cagtcggttt 49261 aagcaacgtg attgttggtc aagatcaact tccgcaaact gtgcaaacag tgacaaaaga 49321 gttatcgatc ctcacagctg gagtgcaacc tcctaatccc cttgggttga ttgactcaga 49381 taggatggcg actctgattg aaacgttctg cgatcgctac gattacatcg tatttgatac 49441 tcctccttta gctgggactg ctgatgcagc agttttagga aaaatggcag atggagtttt 49501 gctagtcgct cgaccaggtg ttgtggattc agcaagcgcc acagccgcaa aatccttgct 49561 ggaacgttct gaagctagga ttctgggaat gattgccaac gctgtaaact tgaagcagga 49621 atctgccaat cacttttact attctaatgt tagaggaggt caggatgttg tagaaactgc 49681 aaaggggaat gaacaatggg tgtacaaata aacctgacgg tcaatacaga ccaagcagaa 49741 tctcctatag gtctttggca actgattcag gaagactgga ttgcccacgg acgtgattgg 49801 actaagcctg gatttcgagc agtggtagtt caccgttttg gagtttggcg gatgaagatt 49861 aaaccactac ttttgcgagc gcctttgagc attctctatc gaatgctgtt tcgcaaggtt 49921 cggaatcatt acggaataga actaccctac tctgtggaac ttggtcgtcg cgtcattgtt 49981 gaacatcaag gagcaatagt cattcacgga gactgctcta ttggagatga gtgtattatt 50041 cgccaaggag ttactttagg caaccgttac ctggatcgcc cgcttgatgc accgaaattg 50101 ggtaaacatg tcaatgtcgg tgctggtgca aaaatctttg gcaacgtcac tattggagat 50161 aacgcaagta tcggtgctaa tgccgtggtt ctgtgcgatg ttcccgccgg agcaacagca 50221 gttgggatac ctgcaagaat catccattct gaaaaagttg gcaattcaca tctttgaggt 50281 aacaatgcca tgtcaatgag ccaattcatg gcttcgttta ctgatattgt tctgttagta 50341 agcgctagtg gtctatttat tgtctgcgta ttttttttga ttgaatgtac tgcggctcta 50401 ttgccaatca cttcttgtat taataaagac aattgcccag atttgaaagt ggctgtcttg 50461 gttcctgcac acaatgaaga gattgttatc ggttcaacaa tagaaaaact gctcccaacc 50521 ttaaacagac aggacagttt ggttgttgtt gcggacaatt gtagcgacac aacagcagaa 50581 attgctcgtg ccaaaggtgc tacagtgatt gagcgtcaag accttgatcg caaaggcaag 50641 ggatatgcct tagattacgg tctacagttt cttgagtcgg ctcctcctga tgtagttgtg 50701 attgttgatg ctgactgtac agtccatcca gatgcgattt cgctgttgag tcagtatgcg 50761 atcgccatga acgcacctgt ccaagcgacc tacttgatga gcagaccaaa aaactcgcag 50821 tcatcaaaag actttgtttc acagttttct aacatcgtca ggaatttagt tcgtcccctt 50881 ggattagcca gacttagaca gtcctgccca ttgcttggca caggtatggc tttcccttgg 50941 acagtgattc gttcagttaa tgtagccaac tctcacctcc ttgaagactt aaaactaggc 51001 ttggatctga ctttagctgg atacagacca gtattttgcc aatcggcaaa agtcactgga 51061 tatttgccac agcagtcgca ggctgccaaa agtcagagaa ctcgttggga tcatggtcat 51121 ttacaaatta tgcagaccta tgtccccatc ttgcttaaac aagcagtctt tcaaaaaagg 51181 ttcgacttgt tggtgagcgt tttagattta tgtgtgccac ctttatccct gcttgttgtt 51241 atttggttag tgctcatggc actttctttg gtttttgggg ttttaggagc atcattgatg 51301 ccagcagtta tcattgccac agcgggactt tgtttcctca ttgcaattct aacagcttgg 51361 actaagttcg ctcgacaaga tcttcctctg cgcgaacttt taactgttcc tttttacatt 51421 ctctggaaaa ttccagttta tttcaagttt ttagtcaagc ctcaaagcgt gtgggttcgt 51481 accgaaaggg actcagttaa cgcaagtgat tcttagttga gatattatac aaaacgttga 51541 gattaaaaac gttatgaata aaattcaaca gaatagtctt ttgttagtaa actcagttcc 51601 gcttagggaa gtagacggtc aattggggtt ggatgatcaa acctgcgcgg gtttagttcg 51661 ctgggcagaa aactttaagc gacttgtgat cgctggtcct gccttaccag aacacatcgc 51721 tgtacaacaa caaaattctt caactgctgg cacaaaatgg caagcagtga aagatttgcc 51781 gtgtgcagac caactggaat tagtgccact accgtacgca tataaattgc gagactttac 51841 aaaggtatat gcaaagacac gcgaattatt gaacgaaaag atacagcaaa accagtatct 51901 ttgtttcgtc atcgctcgac ctattggaga ttggggagga atcgctgctt tagaagccat 51961 aaagcttaga cgaccttaca ctgcgtggtt ggatcgcgtt gagtatgagg tcatcagtcg 52021 aatgttgcaa agtttgcctc tcaagagccg cattaaagag tctttgcacc taccattgat 52081 gaaacgctat caacgatact tgattcgtca aagcagccta ggactttttc aagggcaaga 52141 ttgctatcaa gcatattcac ctttttgcaa acagccccat tgtatttatg atgttcatac 52201 aaaaaagtct gatcaaattg acgaaccaag catagacctc aaaatcaagt cactcttgtc 52261 taatgaacct ttacggattt gctatgtagg acgagcagca gacatgaaag gaccactcga 52321 ttggctgcgg gtcgtggatc ggatttgtaa agctggtgtt gatgtcaaag caacttgggt 52381 tggagatgga ccacttcttt tgaagatgaa gtcgttaact caagagttgg gcattgctga 52441 ccgtgttaat ttagttgggt ttgttgatga ccaaagcaaa attttgcaga cgatgagaga 52501 aaatcacatc tttcttttct gccataaaac accagaatca cctcggtgtc tggttgaatc 52561 aatagtttct ggttgtccta ttgttggtta tgaaagctcc tattcccaag atctggtgtc 52621 tcaatatgga ggtggtgcat ttgtcccaat gaacgactgg caaaaactgg ctgacttagt 52681 tgttgaactg aattgcgatc gaccaaaatt aagtaagttg attcgagaag ctgcattatc 52741 tggtcagttt tttgatgaag aaaccgttta tcaggagcga agtgatctaa ttaagaaata 52801 tttaacatga ttcacttttt aaaagtttaa gtagctgaca tgttttaggg tgacgtgtac 52861 ttagtatgaa aaaaatcaat aaaactataa gtatactaat atac // LOCUS NODE_394_length_52622_cov_5.39330852622 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 52622) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 52622) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..52622 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1636) /locus_tag="DP116_01975" CDS complement(<1..1636) /locus_tag="DP116_01975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_01975" /translation="MADASIYTVGGTVQTNRSGVYISRQADEELLALCRDGKFAYVLT PRQMGKSSLMVRTARQLRTEGIESVMIDLTGIGTQVTNPEQWYLGLLIQIEAQLMLDT DVYSWWQAHAHLGFTQRLTEFFQRVLLVEITASVVIFIDEIDTTLRLDFTDDFYAAIR YLYVSRASLSSEFERLSFVLFGVATPGDLIRDPKRTPFNVGQRVDLTDFTLTEAEPLA KGLGLPTPKAQQVLHWVLDWTGGHPYLSQRICSILSQQDKKNWSKADIDRIVSRNFFG ARSEEDNNLQFVRDMLTKRAPDKAAVLSIYREIRRGKRAVADEEQSITKSHLKLSGVV KRKNGVLKVRNPIYQRVFDLKWVNQHLPLNLRDFWQRYKPALPYVAILLIFSIFMGGM AFYANTQRLDAEAQRLHTEKALKNSQVLVQSLTTENLFTSGYELEALLEGLKAGKNLK QYGDAVDSDNRTKAILALREVVYGLKERNRLEGHSSYVTSVVFSPDGKTLASGSRDNT VKLWNLNGQLQHTLKGHSSYVTSVVFSPDGKTLASGS" gene complement(1910..2329) /locus_tag="DP116_01980" CDS complement(1910..2329) /locus_tag="DP116_01980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002776593.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_01980" /translation="MSYLYLLDTNIISELIKNPRGVIFSKIQDVGEDKICTSIIVACE SRFGAKKKNSQKLIEKLEIILNSIEILPLTHPVEQYYAEIRTDLEQQGKPIGGNDLLI AAHSLSLGLTLVTANVREFSRVSNLKVENWLIPDEKS" gene complement(2332..2562) /locus_tag="DP116_01985" CDS complement(2332..2562) /locus_tag="DP116_01985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015201779.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system Phd/YefM family antitoxin" /protein_id="PRJNA477356:DP116_01985" /translation="MKQIPLAELPETVQNLINQTQKTGEPLTIIQNGVPFAIISPIKK KSLLQTLSTLEPLEEEFPDVDEGLLPLDDIEL" gene complement(2613..2754) /locus_tag="DP116_01990" /pseudo CDS complement(2613..2754) /locus_tag="DP116_01990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878813.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="restriction endonuclease subunit S" gene complement(2831..4936) /locus_tag="DP116_01995" CDS complement(2831..4936) /locus_tag="DP116_01995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745748.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_01995" /translation="MKKILILSANPNNTSKLRLDEEVREIQAGLQRAKKREQFEIISR WAVRVDDLRRALLDYEPQIVHFSGHGSGSDGLALENNSGELQLVSTQSLARLFKLFKD KVECVLLNACYSEVQAEAIHQNIDYVIGMNRAVGDRAAIKFAIGFYDAIGAGRNYEDA FEFGCTSIDLEGIPESETPVLKSRNSNQDTIASTTSKQRIFISYKRNVEPDESVATQV YQELSQQHNVFIDQIILIGKRWAECIETELRNADFLIVFLSEQSVNSEMVQWEISLAV ELAQRQGGKPVILPVRLAYQEAFPHPLSIYLNHINWAFWESPEDTSRLIEELRQAIHG GSLRVDQQAKTNLLKICVPEQLPRPTPFAQPVVLEMPSGTMKPESSFYVERNADTIAL QTIVQQGVTIPIKGPRQVGKSSLLMRIIQAVRNAGKRVAYLDFQQLSTAVLNDEELFF RHFCFWISDVLELEDKVEEYFARNLTTIQRCDRYLQRHILKGLGHPLVLAMDEVDRVF DTKFRNDFFGMLRNWHNSRAYYPILDTLDLVLVTSTEPYQLIDDLNQSPFNVGQIIEL EDFTPEQVAQLNGRHNSPFDNNTLQQLLRLLGGHPFLVRKALYLVASGQITPTELFTN ATAASGPFGDHLRRLLSLLYDKQELIQGLLLVINQNICPDRQIFWRLQGSGLVRASGQ SVLPRCQLYAEYFRENLHG" gene 5675..9646 /locus_tag="DP116_02000" CDS 5675..9646 /locus_tag="DP116_02000" /inference="COORDINATES: protein motif:HMM:PF14252.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321388.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02000" /translation="MKITGIIFIDPSVNNYEILLNLVVANIKAIVLDPEQDGVTQITQ VLSQHRGVENVHIVSHGSSGTLTLGNSELSLNTLELYADQLKNWFSPDSHLRFSPVPS LLLYGCSVAAGDAGVEFITKLHQLTKANVAASANLTGNAILGGDWNLEVNIGNVSNQP LALSAEVMAAYPSVLNNAPILNDTDVILNAVNENAGAPIGAVGTPIFSLVTLGGNVID EDRDALTGIAITNTDTTNGNWFYTINNGSNWAALGSVSDNNARLLAADAITRVYLQPN PNISSPSPIVNALTFRAWDQTTGTNGSTADTSANGDTTAFSTATDTAAITVNAVNDAP TLNNTNITLTAINEDAGTPTGAVGTLISSLVTGDNVIDPDSNALRGIAITNADTTNGN FFYSADDGNNWAPLELVSNTNARLLAADGITRIYFQPKANFNGTINNALTFRAWDQTT GTNTGIANTTISGGTTAFSSTTGAAGITINSVNDAPTLSEANITLTAINQNVGAPIGA VGTLVSSLVRLGDNVIDPDSNALTGIAITNANTTNGNWFYSTNNGSSWIPVGILSDTN ARLLAANASTRLYFQPNGNFNGSIDNVLTFRAWDQTSGTNGSTADTSVNGAATAFSTG VNTSTITINAVNNAPILLDTNLFLSTVEDAGAPTGPVGTLVSSLVSLGVNVTDSDNNI TTGNASTGNALTGIAITNANTTNGSWFYSTDDGSSWAALGSVSDTNARLLAANGTTRL YFQPTANFNGSINNALTFRAWDQTSGTNGSTANTTLNGGTTAFSIATDAVAITVDAVN DAPILRDTNVTLTAIDKNAGAPIGAVGTLVSSLVSVGVNGNVTDPDSNPLTGIAIANA DTTNGTFFYSTDNGNNWAPLGSVSNTNARLLTADANTRIYFQPNANFNGIISDALTFR AWDQTTGTNGSTADTTVNGGATAFSSATDTAAITIGTVTNPVTNPVTNPVTNPVTNPV TNPVNGAATLKLASNNIFQLEGVSDGNKPKLEVTLNKASAKQVNELGVFVVDDDQGRI NGIAPGTENYAQAALSRANTIFSVITDNPKGFNTDLTRLLEFNSGARLQFFLVNNSSI DAVQAGSTSTKDLIFSNPSTQKVTDLGNGEFTLALNDASNSNTSDFRNLVVNIKATNQ SLPLGAGLQGNPFGELIDLRYGSTQVKADFVLNRKATFDNFVGFYQVADTNGGIDIDG NGTVDLRPGDAGYTQAAVQRRVPGIDLTVGNQSTATFTSNLSPGSILAPFIIANGRPD AILDGNPNNDPAVYFSFLGANSDKVDHIRLLGNNTFGFEDLSGGGDKDYNDMTVRVNL GIA" gene complement(9940..11031) /locus_tag="DP116_02005" CDS complement(9940..11031) /locus_tag="DP116_02005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140633.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron ABC transporter permease" /protein_id="PRJNA477356:DP116_02005" /translation="MSSLKHKLKPILRNFQHSSFQKLFFPALLITLILTFLLDLALGS VHIPLTQVITILLNGEPEKATWTNIILKFRLPKALTATLAGAALGVSGLQMQTLFRNP LAGPFVLGISSGASLGVALVVLTTSVTGAGTLFKDLGVIGDFGLVVAASLGAAAVLGL VLLVSQRVEDTMTLLILGLLFGYATSAIVSILLHLSESQQIQSYLLWTFGSFGGVTWQ QMLVLAPVVLIALLISLMLSKPLNALLLGEAYARSLGLGVEEVRFWILITSSILAGAV TAFCGPIAFLGVAVPHLCRSLFNTCEHRVLVPAVTFMGAILALVADLISQLPGSQMVL PLNSVTALIGTPVVSWVILRRHSRTSFPK" gene complement(11182..12399) /locus_tag="DP116_02010" CDS complement(11182..12399) /locus_tag="DP116_02010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457220.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_02010" /translation="MRITILHKAKLFAFFCQFLLVAVLAIACHSTTTPPTLPTHIKEC AQTYNPNTDYFPEKVTVNYATGFEVEYYKNYKVVTVKNPWRDAQVSFQYILVQCGTPV PTGFNQSQVITVPINTVVSLSTTHLPHLAKLGVVDKLIGVSDSKQVNTPEVVEKMKQG KITKVGNNASLNVEQLLEINPNLVMTYGTGDKQTDNYPKIQEAGLKVAINAEYMETSP LGRSEWLKFTALFFNQEKVAQKIFDETVKKYQAIAAKAQSVKNRPTVFVGFNFKGIWY MPGGNSYVAKFLADAGSNYLWNNKKSSGTLPLSFETILERAANADYWLNSSQYWKTLK DLQAEDNRYADFQAVQKGNVYNNNARINQTGGNDYWEGGISNPDVILSDLIKIFHPEI LPNHQLIYYQKLS" gene complement(12473..13354) /locus_tag="DP116_02015" CDS complement(12473..13354) /locus_tag="DP116_02015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457219.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="(2Fe-2S) ferredoxin domain-containing protein" /protein_id="PRJNA477356:DP116_02015" /translation="MNQFHPWVTPLNHVIKEPSLTGGYIESEYFDCSCDVLSSLLYTL FQQNWHQVGVGHIVQGGVLELEFPAAPKICILYDGYLTVVTESWHFHLCIEANLGGPH CKTPLELRKQRQVNRAAFYRRFNAEGIPRSWGIDFWNGASENLMTIFLPNPYVEGENL LPEGKPNLAKLELYHELRDIYVLGKKPIPFDKNPLKHAYISVCTSTRCLPSQNWEPTL DALKVAVEKAGLDVEVRTSGCLEVCKLGPVVFYSEDSTWYTRVKPEVVETIVNEHLVE GKKVTAHCYPPESVEKK" gene complement(13560..15638) /locus_tag="DP116_02020" CDS complement(13560..15638) /locus_tag="DP116_02020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457218.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TonB-dependent receptor" /protein_id="PRJNA477356:DP116_02020" /translation="MLNILFNCRWAVKVVVSSVSAMTTLILWNYAVCAGEIPQSEKIA QSTNTPTDEQEPPIEITVTGKVLNQPVFSPFRREGTVKDSTRPVYVITGEEMEAQGVR TVREALRFLPGILPDGTVGTEVNALSGQFIRGSNSEQVLILLDGRPINNLGSGGFDLS EITTNNIERVEVLPGGGSTLYGSDAIGGVINIITRRPTEKVTTQAGVTLGAYGLNQQT ITNSGKIENISWVVGYNRTEADNNYPFSIPEANFEGTRKNNDTLYNNFNVKLEANLGK RNTLTVSSLYLSKNQGVPGGVPIPEPQFGQGYFNSLTDNNRKYTDQVLTDLTWNSKLG GADDSLLTARVYSDFLNTRFENRSGTLSSQRRFDNEQTSYGIQTQHSWKFAKNQTLVY GFDYRNTSVRNSTFNYSTEQQTLSYDDSISQGALFARYEINFTPSLSVNLGLRQDFSS LTDGSFTSPSVGARWAISDSTNFRANYIRNFRAPTLFNLYARGSTFVGNPNLKPENGD SYDIGIDQKLWDFGLLRLTFFSNTISDLIAYNFAVPVATYENIGLVRTRGIEAALNVQ LAKNVYAFANYTLNDPRIKESVNSGEKDKELRFAGADSLNLGISYETPQGFYAGILMH SLGSYPTNNTNTESLSGYTTFDFKTRIPLSDNLVLTGSVDNILNQRYQLFPGYPDAGR VFQVGLNATF" regulatory complement(15657..15792) /regulatory_class="riboswitch" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00174" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="cobalamin riboswitch; Derived by automated computational analysis using gene prediction method: cmsearch." /bound_moiety="adenosylcobalamin" /db_xref="RFAM:RF00174" gene complement(15782..16279) /locus_tag="DP116_02025" CDS complement(15782..16279) /locus_tag="DP116_02025" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02025" /translation="MERNGVILNGTNGNDTVQASGQKSSVIGVKVNTAEVGSRLINAE TESLGVGEIDILTGSPGRNVFYLGDNAKENPKDFYLGNGDQDYALIRYFDPTSEDAVY LAGNPKDYTLETINGSVNISKNGDRSICNITTKLNVIILICIEKLSMVNYSKSWVLMG FSHQR" gene 16566..18161 /locus_tag="DP116_02030" CDS 16566..18161 /locus_tag="DP116_02030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318595.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S15 family X-Pro dipeptidyl-peptidase" /protein_id="PRJNA477356:DP116_02030" /translation="MLTRDGVRLDADIYRPDADGEFPVLLMRQPYGRALASTVVYAHP TWYAAHGYIVVIQDVRGRGTSQGNFKLFTHEIEDGEYTVKWAANLPGSTGKVGMYGFS YQGMTQLYAAAAKPSALKTICPAMIGYDLYTDWAYEGGAFCLQSNLAWAIQLATETAR IKKDEKAYQALLAASRHLPLHDPIPTHPEILKTFAPDSFYHEWLAHSQPDKYWEKLSP KTYLKDVDLPMFHIGGWFDTYLRGTLNLYKDISARSAYRQELLIGPWAHLPWSRKVGA VDFGPDAASPVDRMQLCWFDQFLKGVDTGLLDELPVWLFYMGSNVWQGFPSLSISKGR SYFLSTTGLASIREGEGILATTCPETSTDDVLVHDPWRPVPSNGGHAAIPAGCFERTH IDYRSDVLTYTTESLETDLYLAGDVVVEVWCMSDKKSYDLCAVLSEIFPNGRVYNLTQ GYLRCQDGNHRVRRTIHLQATCAKIFRNHALRLSLSASCFPAYTMNPGNNSASSIDAE IITLMVSCGGESLSQIILPVVTP" gene 18243..19070 /locus_tag="DP116_02035" CDS 18243..19070 /locus_tag="DP116_02035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009783867.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_02035" /translation="MSETTLAAPYVTLPPTQAELPCDDGIPMETQRHKYQMDLLIETL EFWLAQREDGFVSGNMFVYYSMAQVRNKDFKGPDFFVVLGVPKGERRSWVVWEEGKAP DLVIELLSESTAEADKNEKKLIYQNQMRVPEYFWFDPFNPDDWAGFSIQQGVYQPLVP NERNQLVSQSLGLGLQRWQGKYRGVDTVWLRWATMEGEVLPTGMEIAQQEHQLAEQER QRAEQEYQRAEQERQRTEQVRSQLQQTVRNLLQAGMTVEQVAKLTGLDVSQVQELGN" gene complement(19067..20068) /locus_tag="DP116_02040" CDS complement(19067..20068) /locus_tag="DP116_02040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317623.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="A/G-specific adenine glycosylase" /protein_id="PRJNA477356:DP116_02040" /translation="MSQQTTLAVVVPKFSEFVKHLPTVDDLATCDEEILRQLWSGLGY YARARNLKKGAKFIVEQLEGHFPQSYYEWLKIPGCGPYTASVIASICFNEKVACVDGN VVRVVSRLLALSEDVWSSSGKSAIQTHVDQMIPEERPGDFNQAIMELGATVCRKSKPL CLLCPIREQCLAFAHNCVEICPPKKPRRDTVDVELFALVFWRKATDTVAIAKRCCFQQ NAHRTKGFLSNTVGFPLVTATEASEVKKVLQSLEHLQFLELSSKFSHSITHHRISGRV FIAQELENPSITTESVIWEKLGLPKSFDWIPRSVLASKLSTALDNKVFKLFENYILG" gene 20931..21656 /locus_tag="DP116_02045" CDS 20931..21656 /locus_tag="DP116_02045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoadenosine phosphosulfate reductase" /protein_id="PRJNA477356:DP116_02045" /translation="MTVSPATTSQTAAFDLDQLNQKFETAHPRDILAWSIENISTGLV QTSAFNVDDIIITHILYVDLKHPVPVIFLDTLYHFRETLELVAKVKDTYNLDLKVYKT PDVDTREAFEAKYGEALWDTDIAKFHEVTKIEPLQRGIAELNTVAWITGRRRDQAVTR ANMPIFELDSNNRLKINPLANWTRKQSWEYVAEHGVIYNPLHDQGYPSIGDEPITTRV GEGEDERAGRWRGTGKTECGIHI" gene complement(21952..23277) /locus_tag="DP116_02050" CDS complement(21952..23277) /locus_tag="DP116_02050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408812.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_02050" /translation="MLENIRRIKNPILRSATAASIAFSVVTIVSGVAIGLTNSKDKSA FGVGVYSSLLGTGCGAIFGLFMTSQASQETSVSTEVKPSSKIWKDWRNFVVVRKVKES EEITSFYLQPEDKGEIPNFQPGQFLTIKLDIPGQNKPVIRTYSLSDYSDLSEYYRLSI KREPTPKGLDVPPGVASNFMHDRIHSGSIIPAKPPNGKFVLDVKKSIPAVLISNGVGI TPMMSMAKAATRLNPNRPIWFVHGARDGRFHAFREEVTGLAQQNPNLHVHFRYSRPTP EDEGHYHSVGYVDAALIKQLVGQQAEYFLCGSPSFMQSIMQGLKESGVPDSRVFFESF GKPMKVASETQPPAATIGEEVQEAEIVFAKSGKTLNWKQGDGSILEFAEANDINPPFS CRAGICGTCMCKINAGEVAYQEEPTAAIDQGSVLICISQPGSLVVVLDI" gene complement(23728..24564) /locus_tag="DP116_02055" CDS complement(23728..24564) /locus_tag="DP116_02055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878747.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02055" /translation="MSVNNNYFENLRNDPRSTKELVKLALTEEDNETYWDLVWILRAR GSDEEFEAASRLCESQNPKERSLGVDILGYLGIPERSYPKECGEILLNLCKSEENPNV LSSIGYAFGHLGDSRGVVPLVKLKSHRDADVRMGVVFGLLCQEEELAIQALIDLSCDE DEDIRNWATFSLGYQIETNTQAIRDALFQRVILEIGEEDTIAEIRGEALLGLAIRKDE RVINPLIAELSCGCVGRLSVEAAKEIEDTRLYPVLIELQQWWDVDSELLQEAITSCQP KH" gene 24803..26056 /locus_tag="DP116_02060" CDS 24803..26056 /locus_tag="DP116_02060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126663.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="exonuclease sbcCD subunit D" /protein_id="PRJNA477356:DP116_02060" /translation="MIKILHLSDIHMGSGFSHGRINPETGINTRLSDFVNSLSRCIDR ALAEPVDLVVFGGDAFPDATPAPYVQEAFAGQFRRLVDANIPTVLLVGNHDQHTQGQG GASLCIYRALGVPGVVVGDTLKTHRIQTRNGSVQVITLPWLTRSTLMTRQETEGASVA EVNQLLTERLRVVLEGETRRLDPNVPTVLLGHLMMDNANLGAERFLAVGKGFTLPLSL LTRPCFDYVALGHVHRHQNLNKSNNPPVIYPGSIERVDFSEEKEDKGYVMVELEKGRV QWEFCPLPVRAFHTIEVDLSKAEDPQAAIMKALAKRDIQDAVVRLIYKLRSEQLDLID SASLHTALSSAHTYTIQPELASQLARPRVPELSASNSIDPIEALRTYLNNRDDLKDIA ASMLEAAHNLLADDVEICLESATQE" gene 26230..28065 /locus_tag="DP116_02065" CDS 26230..28065 /locus_tag="DP116_02065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318482.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid permease" /protein_id="PRJNA477356:DP116_02065" /translation="MSFYPQIKQFLLGKTLPTSAHAEERLSNAAALAVLSSDALSSVA YATEEILLVLVVAGSSALGLSLFIAIAIIILLAIVVLSYRQTIRAYPQGGGSYIVARE NLGLYPGLVAGGSLMIDYILTVTVSISAGTAALTSAFPVLQPFTVSLCLIFIVLLTLA NLRGVKESGNIFMIPTYAFIASIFVLIVLGLFKQATGQVPTEYPNIPVKEGLSLFFIL RAFSAGCTALTGVEAISDGVLAFKQPEWKNARLTLLYLGIILGFMFVGITYLSNVYHI VPEEGQTVVSQLGRLIVGTGPFYFFVQVVTLLILLLAANTSFADFPRLCYFLARDGFL PRQLSLLGDRLVYSNGIILLSVCAAILVIIFKGSVNAVIPLYAVGVFTSFTLSQAGMV RRWFHERTPGWQASALMNGLGAIATVVVLGVIISTKFLGGAWLVVVAIPVVVSIFLAI HRHYQYVAQRLSIEGLPPRSYTPRVKPEVITHPAVVVVGQLNRGTVEALDYARTIADE IVAIHVDLSSTEGEKLQEQWRQLESDIPLVIIESPYRSVISPIVEFVGEFEDRYHDTY TTVIIPAFVTRNWWEGLLHNQTTLFLKTALRAQKSRVITTVRYYL" gene 28342..28545 /locus_tag="DP116_02070" /pseudo CDS 28342..28545 /locus_tag="DP116_02070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879663.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="chlorophyll a/b binding light-harvesting protein" gene 28641..29372 /locus_tag="DP116_02075" CDS 28641..29372 /locus_tag="DP116_02075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875027.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_02075" /translation="MSKQPDAIWLNTSPSLLCFAQPLLRELSRSVTIAQWEYTQTQDE ASSLDVATLLVDDYLQSIKQPVHLIGHSTGGLLALLYTRRYPEKVKSLTLLAVGADAA LDWQAHYYNHRVSLTREKILNAMVYNLFGYQNEHAVKRLENLLERDLDCSLSPHSLFK RLSMVPSPVPAPLMVCGSTNDIIVEPDALEGWRPFLMEGDRYWECQKGRHFFHFFQPT LVAEQILDFWQSLYQISKKPLQIRK" gene 29661..30470 /locus_tag="DP116_02080" CDS 29661..30470 /locus_tag="DP116_02080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862946.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="short-chain dehydrogenase" /protein_id="PRJNA477356:DP116_02080" /translation="MRVKYQDQRKQTALITGAASGIGYELAHLFARDTYNLVLVDKNG QKLSEMAETFPQKFGIYVTKIVKDLSISTAPEEIFTDVQQASIKVDVLVNNAGFGNYG LFHETNLTAELDMLQVNLVSLTHLTKLFLKDMVNQGNGTILNVASTAAFQPGPLMAVY SATKSYILFFSEALANELKDTGVTVTVLCPGPTESAFHKITGMADSELLKNKKMMSAE TVAKIGYSGLLAKATVVIPGVKNKILAELVRFAPRKLVTKVVRSMHEGKIK" gene complement(30726..31880) /gene="nspC" /locus_tag="DP116_02085" CDS complement(30726..31880) /gene="nspC" /locus_tag="DP116_02085" /EC_number="4.1.1.96" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015161363.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxynorspermidine decarboxylase" /protein_id="PRJNA477356:DP116_02085" /translation="MTDTRNSVLATIPSPCFVLEEARLEQNLAVFERVQQETPVRVLL ALKAFSLFHCFPLIRKTLHGASASSLWEAQLAAEEFGGELHVYSPAYRDEDMSAIMTY ASHITFNSLSQWERFRPLIERSPSPPSIGLRINPQYSPVKTALYNPCQPFTRLGVSPE HLGTLLPQGIDGFLSHNLCESDSHELEKTLIHIEQFFGHFLPQLKWLNLGGGHLMTRQ GYDIAHAIKVLKAFHNRYPHIELILEPGSAVAWETGFLLSTVRDIIPTTDITNVILDI SFTAHMPDCLEMPYKPVIRGAHEPQKGEKAYRMGGSSCLAGDFLGDYAFCYDLSVSDR IIFEDMMHYTMVKTTMFNGVIHPSIGILKKNGTFEILRQFSYEDYRVRLG" gene complement(31883..33082) /locus_tag="DP116_02090" CDS complement(31883..33082) /locus_tag="DP116_02090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010873491.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="saccharopine dehydrogenase family protein" /protein_id="PRJNA477356:DP116_02090" /translation="MARVMIVGAGGVGNVVVHKCAAREEFSHILLASRTIDKCHRIAE SVSSPKVKIAELDADNVNDTVKLLKEFRPDILINVALPYQDLALMDACLEVGVHYLDT ANYEPPDEAKFEYSWQWKYHERYKNAGITAILGCGFDPGVTGIFTAYALKHYFDEIHY LDIVDCNAGNHGQVFATNFNPEINIREITQPGKYYENGTWIEVPSLSIHRPISYPEIG IKESYLLYHEELESLVKHIPTIKRARFWMTFSQSYINHLQVLQNLGLTRIDPIDYEGT KIVPLKFLKALLPEPSSLAENYTGQTSIGCHIRGVKNGKERSYYIYNNCEHTQAYQEV NAQAISYTTGVPAVIGSLMVVNGLWKQPGVFNVEQYDPNPFMELLGPLGLPWHEVIDQ PSPFEDN" gene complement(33641..33859) /locus_tag="DP116_02095" CDS complement(33641..33859) /locus_tag="DP116_02095" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02095" /translation="MNEVFLRTQLDGHQTGITLLKQVNQTVSEANNNQQLPLVEFNKI LKGRPTASIAMVPSIIRHTRPCSSESLN" gene complement(33878..33988) /locus_tag="DP116_02100" CDS complement(33878..33988) /locus_tag="DP116_02100" /inference="COORDINATES: protein motif:HMM:PF00668.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02100" /translation="MFTIILTAFKILLFKWSGQSYILVLTTVANRNTPTI" gene complement(34185..35263) /locus_tag="DP116_02105" /pseudo CDS complement(34185..35263) /locus_tag="DP116_02105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152095.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" gene complement(35435..35671) /locus_tag="DP116_02110" /pseudo CDS complement(35435..35671) /locus_tag="DP116_02110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206370.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 36131..38212 /locus_tag="DP116_02115" CDS 36131..38212 /locus_tag="DP116_02115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876833.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="single-stranded-DNA-specific exonuclease RecJ" /protein_id="PRJNA477356:DP116_02115" /translation="MAQTTSLSPLINQLLINRGIETPEQAQVFLNSGNLILPSPLEDF PDLGMSLELLHNAIAPWAGPLGTIAPSAAPEAIASQEKIAICGDYDADGMTSTALLLR SLRWLGAQVDYAIPSRMHEGYGINKRIVEEFHSEGVGLILTVDNGISAFEPIERAREL GLKVIITDHHDIPQQLPPAHAILNPKLIRESSPYRGIAGVGVAYILAVSLAQKLGQAN DDILTSMLELFTLGTIADLAPLTGVNRHWVKSGLQHLPKSKLPGVRALIQMSGVHLSG GQGDESIQNSKSRSVDREATQSQNSKSLKPEDIGFRLGPRINAIGRIADPQIVIELLT TDDMGIALEKAMQCEAINRQRQEMCEQIENEAREIVEIEYLPSLQEDRVLVIVQPDWH HGVIGIVASRLVERYGVPVFIGTYEDEGQIRGSARGIPEFHVFDALEYSQDLLGKFGG HKAAGGFSLPSENLEALRSRLSEFANQCLEPQHLKPLLKIDTSALLNQIHHQFYQQLN VLEPCGIDNPDPVFWTSNVRVIEQQIVGKGHIKLTITQTIDDSEYKIKAIAWRWREYF PLPPRLDIAYKLRENHFNGNTTIELELLGVRLPTQSHIFFAAHSAPLRTTFEYNQCHY TCGIYKNGSVPELRIGNPDGKILAVPLGHSIGLLGSSRQQAIQVDVSQPQYNQILQTA LQALSMLSNQY" gene complement(38264..38386) /locus_tag="DP116_02120" CDS complement(38264..38386) /locus_tag="DP116_02120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873729.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein Ycf12" /protein_id="PRJNA477356:DP116_02120" /translation="MFSGLGNIHWEVIFQLLFVALIMLAGPAVIFVLAFRNGDL" gene 38553..38903 /locus_tag="DP116_02125" CDS 38553..38903 /locus_tag="DP116_02125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114954.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YkgJ family cysteine cluster protein" /protein_id="PRJNA477356:DP116_02125" /translation="MATWQCVKQCGACCHLDPADRPDLHEYLLPEELELYLSMVGEGG WCINFDKDSRECRIYPDRPRFCRVESEIFQDMYGIEPEELDDFAIDCCRQQIEGVYGD RSLEMLRFDQAVGI" gene 39185..39613 /locus_tag="DP116_02130" CDS 39185..39613 /locus_tag="DP116_02130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197527.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UPF0016 domain-containing protein" /protein_id="PRJNA477356:DP116_02130" /translation="MKLDSKPLNLATSGTQSQLEQKDELDCNCATTLVEVSSPLVCDK PTKPQTPVVIFATTFLTIFLAEIGDKTQLSTLLMSAQSQSPWIVFLGSGTALVMTSLL GVVLGSWMASRLSPKTIEKAAGITLLLISLMLFWDLGFGN" gene 39664..39942 /locus_tag="DP116_02135" CDS 39664..39942 /locus_tag="DP116_02135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319163.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02135" /translation="MDWHLLGLSFITVFLSELGDKSQLAAIALSGRSQSPRAVFFGTA GALVLTSFLGALAGGAVSELLPTRLLKAIAAVGFAILAIRLLFFNNEE" gene complement(39981..40466) /locus_tag="DP116_02140" CDS complement(39981..40466) /locus_tag="DP116_02140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312325.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02140" /translation="MNAVSNLELKRPIWQSLAILTLGFWLSASMFLDWVIMPSLYVSG MMTQANFASAGYVLFWNFNRIELLSAGLVLTSVLALCNSQSQWRRGAIILSVVLLAIA LADTYLLTPQMSAIGIELNLFETTAEIPASMNILHGGYWILEAVKLLVGGTLLSWCWR R" gene 40876..41433 /locus_tag="DP116_02145" CDS 40876..41433 /locus_tag="DP116_02145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316558.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02145" /translation="MRIAFLFSTAIACTAVIGGYTADLSQAVQLRDGKVYFVQPPSLL SAVTTYKDTYVWGATYYFTISLPENAGEPLQRITINQREGVDSVRFDLRDTSAFEGTP SKERQKLALKDVTRDNKTKTLSLTFDPPVPPGRTITLALKPVQNPTVAGVYLFGVTAF PPGEQPHGQFLGYGRLQFYNFGFRF" gene complement(41687..42769) /locus_tag="DP116_02150" CDS complement(41687..42769) /locus_tag="DP116_02150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879808.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalt-precorrin-5B (C(1))-methyltransferase" /protein_id="PRJNA477356:DP116_02150" /translation="MARTGYTLPVFAVAAAKAALMHLREKIDSQLSVEIDLLSETAEI SISQVAALDSQSALAVTLSDPGDNLDLTKNTPIWAWVKLSQRQSQALILEAGEGLGKT TSGEPAIYSYARRLFDTNLLPLIPPDQTATVSIILPEGRQLAQRTSNEAFGILEGLAL LGTSGISQPLSAADYLEEFRLSLQAKVKVCPGLVFCIGSNGMQVAQRLGIPESAVVQT GNWIGAMLVEAGLYQATSVLLLGYHGKLIKLAGGIFNTSSHLADAKLEIISAAVVAVG GDIQAVRAILDAKTADAAHKKLIELGLAESVFGILAEKISQRATAYVQKYANVALKVG TVLFDRKGEIISQDLHAKELLSLDQN" gene complement(42982..43370) /gene="ssrA" /locus_tag="DP116_02155" tmRNA complement(42982..43370) /gene="ssrA" /locus_tag="DP116_02155" /product="transfer-messenger RNA" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00023" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="Derived by automated computational analysis using gene prediction method: cmsearch." /db_xref="RFAM:RF00023" gene complement(43545..44492) /locus_tag="DP116_02160" CDS complement(43545..44492) /locus_tag="DP116_02160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317216.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit gamma" /protein_id="PRJNA477356:DP116_02160" /translation="MPNLKAIRDRIQSVKNTKKITEAMRLVAAARVRRAQEQVLSTRP FADRLAQVLYGLQTRLRFEEANLPLLKKREVKSVGLLVISGDRGLCGGYNTNVIKRAE NRAKELKAEGVDYKFVIVGRKATQYFQRREQPIDATYTGLEQIPTAAEANKIADQLLS LFLSESVDRVELVYTKFVSLVSSRPVVQTLLPLDAQGLEAQDDEIFRLTTRGGEFEVT REKVTAQTRTFSRDMIFEQDPVQILDSLLPLYLSNQLLRALQESAASELAARMTAMSN ASDNAKALMNALTLSYNKARQAAITNQILEVVSGAEALG" gene complement(44635..46155) /gene="atpA" /locus_tag="DP116_02165" CDS complement(44635..46155) /gene="atpA" /locus_tag="DP116_02165" /EC_number="3.6.3.14" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874659.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit alpha" /protein_id="PRJNA477356:DP116_02165" /translation="MAISIRPDEISNIIQQQIEQYDQQVKVANVGTVLQVGDGIARIY GLDKAMAGELLEFEDGTIGIAQNLEEDNVGAVLMGEGREIQEGSSVTATGRIAQVPVG EALVGRVVDALGRPIDGKGEIKTTETRLIESPAPGIVARRSVHEPMQTGITAIDAMIP VGRGQRELIIGDRQTGKTAIAVDTIINQKEEDVICVYVAIGQKASTVANVVQTLQEKG ALDYTVVVAANASDPATLQFLAPYTGASIAEYFMYKGKATLVIYDDLSKQATAYRQMS LLLRRPPGREAYPGDVFYIHSRLLERAAKLSDELGKGSMTALPIIETQAGDVSAYIPT NVISITDGQIFLSSDLFNSGIRPAINPGISVSRVGSAAQTKAMKKVAGKLKLELAQFD ELQAFSQFASDLDKATQDQLARGARLRELLKQPQNAPLSVYEQVALLYAGINGYLDDI AVNKITDFTKGLRDYLKTSKPQYVQGIQSKKALGDEEEALLKEAINEYKKTFLATA" gene complement(46273..46827) /locus_tag="DP116_02170" CDS complement(46273..46827) /locus_tag="DP116_02170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863345.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit delta" /protein_id="PRJNA477356:DP116_02170" /translation="MKSNIAMAEISQPYAEALMSVAQSKNLTDQFGEDVRSLLSLLSE SEQLRNFLENPFVGLDDKKAVINRILGDGANAYFRNFLLLLVDRRRISLLEPVLQQYL TLLRQLKQIALAEVISAVPLTEEQQQAVRDKVIALTKAREVELDTKIDPELIGGVIIK VGSQVIDASLRGQLRRISLRLSGS" gene complement(46830..47378) /locus_tag="DP116_02175" CDS complement(46830..47378) /locus_tag="DP116_02175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311144.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit B" /protein_id="PRJNA477356:DP116_02175" /translation="MGIIGSSFLLATEAVANGHSEGGFGLNIDIFETNLINLAILVGI LFYFGRKVLSNILSERRSNIETVIKEAEAQAKDAAVALSKAQEQLTQAQAEAQRIRKA AEENAQATREAILTRAAEDVERLKETAARDLDTQRERAIGELQQYLVSKALQKVESEL QTGIADDAQQQLIDRSIALLGG" gene complement(47531..47962) /locus_tag="DP116_02180" CDS complement(47531..47962) /locus_tag="DP116_02180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457000.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit B'" /protein_id="PRJNA477356:DP116_02180" /translation="MFDFDATLPLMAVQFLLLAALLNVIFYKPLTKALDDRDNYIRTN NLEARERLAKAERLTKEYEQQLAEARRQSQATVAAAQAEAQKTTAQKIAEAQKEAQAQ REQAALEIEQQKQEALRSLDQQVDALSRQILEKLLGPVLAK" gene complement(48230..48475) /gene="atpE" /locus_tag="DP116_02185" CDS complement(48230..48475) /gene="atpE" /locus_tag="DP116_02185" /EC_number="3.6.3.14" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307105.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP synthase subunit C" /protein_id="PRJNA477356:DP116_02185" /translation="MDPLVSAASVLAAALAIGLAAIGPGIGQGNAAGQAVEGIARQPE AEGKIRGTLLLTLAFMESLTIYGLVIALVLLFANPFA" gene complement(48688..49446) /locus_tag="DP116_02190" CDS complement(48688..49446) /locus_tag="DP116_02190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit A" /protein_id="PRJNA477356:DP116_02190" /translation="MVNFLNALTSVPLGELEVGHHFYWQLGNLKLHGQVFLTSWFVIG ILVVASLAATKNIQKIPSGIQNLMEYALEFIRDLTKNQIGEKEYRPWVPFIGTLFLFI FISNWSGALIPWKLIKLPSGELAAPTNDINTTVALALLTSLAYFYAGFSKRGLGYFKK YIEPTPVLLPIAILEDFTKPLSLSFRLFGNILADELVVGVLVLLVPLFVPLPVMALGL FTSAIQALVFATLAAAYIHEAMEGHGGEEHEEAH" gene complement(49567..50019) /locus_tag="DP116_02195" CDS complement(49567..50019) /locus_tag="DP116_02195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141444.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP synthase subunit I" /protein_id="PRJNA477356:DP116_02195" /translation="MSLSNESTDPTPTTRQDSRPGFEDTEPDNSMREFYELYQELLLI TLVLTGIIFFSVWIFYSLNIALNYLLGACTGVVYLRMLAKDVERLSGEKKQLSKTRFA LLVGVILLASRWNQLEILPIFLGFLTYKATLLIYVVRGAFASDLSKFR" gene complement(50665..51846) /locus_tag="DP116_02200" CDS complement(50665..51846) /locus_tag="DP116_02200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874652.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_02200" /translation="MSDSQTISAAVAKLYNTYPFPPEPLLDEPPPGYNWRWNWLAAYN FCTGQKPQKQDIRILDAGCGTGVGTEYLVHLNPTASVVGIDLSAGALEVAQERCKRSG ANRVEFHHLSLYDVEQLSGEFDFINCVGVLHHLPDPIRGIQALAKKLAPGGLMHIFVY GELGRWEIQLMQRAIALLQGDKRGDYRDGVQVGRQIFASLPENNRIVKRERERWSLEN QRDECFADMYVHPQEIDYNIETLFELISASGLDFVGFSNPGFWQIDRLLGKAPELMER VAKLGEIERYRLIELLDPEVTHYEFFLSRPPLPKADWSADNDLLTAIAERNPCMDGFP SQCVFNYDYQIVNLSPEELKFLESCDGLLTVGEILAGSSLDLDGVRTLLKQQLILLTP S" gene complement(51985..52260) /locus_tag="DP116_02205" CDS complement(51985..52260) /locus_tag="DP116_02205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214872.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_02205" /translation="MSERYDLRIAKTAEKDLIDLPAKQFKQVVSKIFSLQSNPRPQDY KALKGYEGGYRVDQGEYRILYTIDDENLLVDIFRVGKRNDNQVYKNL" gene complement(52253..52504) /locus_tag="DP116_02210" CDS complement(52253..52504) /locus_tag="DP116_02210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214873.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system prevent-host-death family antitoxin" /protein_id="PRJNA477356:DP116_02210" /translation="MTAVSATEARANFQELINRAEYKGERILIQRHGKAVGAIIGLED LRLLEAIEDAIDSAELRRAVEQNDGFTTLEAIIASRGDE" BASE COUNT 14873 a 11677 c 10933 g 15139 t ORIGIN 1 aactaccaga agctagagtc ttaccatccg ggctgaacac gacacttgtg acataactgc 61 tatgcccctt cagggtatgc tgtaactgcc cgttgagatt ccacaatttg actgtgttgt 121 ccctactacc agaagctaga gtcttaccat ccgggctgaa cacgacactt gtgacataac 181 tgctatgccc ttctaatcgg ttgcgttctt tcaatccgta aacaacttct cgcagagcaa 241 gaatcgcttt tgtgcggtta tctgaatcta cagcatcacc atattgtttc aaatttttcc 301 ctgctttcaa gccttctaat aaagcttcta gttcataacc tgaggtaaaa agattttctg 361 ttgtcagact ttgaactaaa acctgtgaat ttttgagggc tttttcagta tgtaaacgtt 421 gtgcttcagc atctaaacgt tgtgtgttag cataaaacgc catccctccc ataaaaattg 481 aaaatatcaa taatatcgca acataaggaa gtgctggctt ataacgctgc cagaaatccc 541 gtaaattcaa cggcaaatgt tgattcaccc actttaaatc aaacactcgt tgataaatgg 601 gatttcgcac cttcaacacc ccattcttcc ttttaaccac acccgataat ttcaggtgag 661 atttagtaat tgactgttct tcatcagcaa cagcgcgttt tccgcgccga atttcacgat 721 aaatgcttaa aacagcagct ttgtctggtg cgcgtttggt cagcatatca cggacaaact 781 gcaagttgtt atcctcctca ctccttgcac cgaaaaagtt tctactcacg atacggtcaa 841 tatcagcttt tgaccaattt ttcttgtcct gctggctaag aatactacaa atacgctgcg 901 ataaatatgg atgtcctcct gtccaatcaa gtacccagtg caacacttgc tgtgctttag 961 gagttggcaa ccccaatccc tttgccagcg gttctgcttc agttaaggta aaatcagtca 1021 ggtcaactcg ctgaccaaca ttaaacggtg tccgttttgg atctcgtatc aagtcaccgg 1081 gagttgcgac accaaataag acaaacgaaa gccgttcaaa ctctgatgat agagaagcac 1141 gggaaacgta gagatagcga atagctgcat aaaaatcgtc tgtaaaatct aggcgaagag 1201 tggtatcaat ttcatcaata aagatgacca ctgatgccgt gatttcaacc aacaacaccc 1261 gctgaaaaaa ttctgttaac cgttgtgtga aaccgagatg cgcgtgtgct tgccaccacg 1321 aatatacatc agtatccagc atcagttgtg cttctatctg aatcagtaaa cccagatacc 1381 actgttctgg gtttgtgact tgcgtaccta tccccgtcaa gtcaatcatc actgactcaa 1441 tcccttcggt tctcaattga cgggcagtcc gtaccattaa actggattta cccatttgac 1501 gcggcgtcag aacataagca aactttccgt ctcgacacag tgctaacaat tcctcatctg 1561 cttgtcggga aatataaacc ccactccggt tcgtctgtac tgttccacca acagtgtaaa 1621 tgcttgcatc agccatgtag attctcccgg aaatattcag catagctgcc ttcgtacttc 1681 tgccttctca tactgccaaa agccaggata gctaagggga aaatcacaga cgcttttttc 1741 atcttatacc gtttctccat gaaggtgcgc taaataattt cttagccccc ctttttaagg 1801 ggggttgggg ggatctgatt tgttgcatct tcatatagaa ctcgtattag aagaagttga 1861 taggtttttt gtttcatggt gttttctgga acgaagaaat cagaacttgt tagctttttt 1921 catcaggtat gagccagttt tctactttta aattagaaac acgggaaaat tcacgaacat 1981 tggcggtaac gagagttaga ccaaggctta aagaatgggc agcaattaat aagtcattac 2041 ctccaatcgg tttaccttgc tgttctaaat ctgtgcggat ttctgcgtaa tattgctcta 2101 cgggatgagt taggggtagt atttcaatac tattcaagat gatttcaagt ttttctatga 2161 gcttctgaga gttctttttt tttgctccaa atcttgattc acaagctacg ataatacttg 2221 tacaaatttt gtcttctccg acatcttgaa ttttggaaaa tatcacgcct ctgggatttt 2281 taatgagttc ggaaatgatg tttgtatcta acaagtacaa ataactcatt tttaaagctc 2341 aatatcatct aagggtaata atccttcgtc cacgtcggga aactcttctt ccagtggttc 2401 aagagtggaa agggtttgca ggagggattt tttcttaatg ggggaaatga tggcgaaggg 2461 aacaccattt tggatgatgg tgaggggttc gcctgttttt tgagtttggt taattaggtt 2521 ttggacagtt tcgggaagtt ctgcaagggg aatttgtttc atggtggttt ctggggttag 2581 ggaatgggga cttggcgcac atttatttta cctgtgacgg cgtgactgat gagggcggtg 2641 cgtttttcgc ccttgttttt caatgagggc ttctttcttg gcaatgaggt cgtctatttg 2701 tttaatttta tggtcgagga agtgagcgat cgccttctgt tcatctagtg ggggctgttc 2761 tatgcttatt ggtgtcaact taaagtgaaa actttgttat actgtgccac caacagtgta 2821 aatgcttgca tcagccatgc aaattctccc ggaaatattc ggcgtataac tgacaacggg 2881 gcagaacaga ctgtccggaa gcccgcacca atcccgaacc ctgcaaccgc cagaaaattt 2941 gtctgtcggg acagatgttt tgattaatga caagcagcaa accttgtatc aattcttgct 3001 tatcatataa aagtgacaac agccgccgca gatgatcacc aaaaggtcca gaagcagcag 3061 tagcattggt aaataactct gttggtgtga tttgtccgct ggcaactaag tacagcgcct 3121 ttcgcaccag aaaagggtgt ccgccaagta agcgaagcaa ttgttgtaag gtgttgttgt 3181 caaaaggtga attatgacga ccattgagtt gggcaacttg ttctggtgta aaatcttcta 3241 gttcaatgat ctgcccgaca ttaaacggag actgattcaa atcatcaatt aactgatatg 3301 gttcagtaga agtgacaagc actaaatcta atgtgtctaa aataggataa tacgcacggc 3361 tgttatgcca gttacgcaac atcccaaaaa aatcattgcg gaatttggta tcaaaaactc 3421 tatcgacttc atccattgcc aataccaagg gatgtcccaa tccttttaag atatggcgtt 3481 gtaaatagcg atcgcaacgc tgaatcgtcg ttaagtttcg agcaaaatac tcctcgactt 3541 tatcttccag ttccagcaca tcgcttatcc aaaaacaaaa gtgacggaag aacagttcct 3601 catcgttaag tactgcagta cttaactgtt gaaaatcaag ataagcaact cgcttaccag 3661 cattgcgaac agcttgaatt atccgcatca ataatgaact tttcccaact tgcctaggtc 3721 ccttaatcgg aatcgttacg ccttgctgta caatagtttg caatgcaatt gtgtccgcgt 3781 tgcgttccac ataaaacgaa gattcaggtt tcatcgtccc ggatggcatt tctaacacca 3841 ctggctgtgc aaatggtgtt ggtcgcggta actgctcagg tacacaaatc ttgagtaaat 3901 tagtcttagc ttgttggtca actcttaagg aaccaccatg aattgcttgc cttaactctt 3961 ctatgagacg cgatgtatcc tctggacttt cccaaaatgc ccagttgatg tgattcaagt 4021 aaatgctgag gggatgggga aacgcttctt gatacgctaa acgcactggg agaataacag 4081 gttttcctcc ttgcctttgt gctaattcta ccgcaaggga gatttcccac tgcaccatct 4141 cgctattaac agactgttcg gaaaggaaaa cgatgagaaa atcagcattt cttaattctg 4201 tttcaatgca ttctgcccag cgtttgccta tcaatattat ttggtcaata aagacattgt 4261 gctgttgtga caattcctgg taaacttgag tggcgacaga ttcatccggt tcaacattgc 4321 gcttgtaact gataaagatg cgctgttttg aagtcgtgga agcaatcgtg tcttgattgc 4381 tgtttctaga tttcaagaca ggtgtttctg attcgggaat tccttctaaa tcaattgaag 4441 tgcaaccaaa ttcaaaagca tcctcgtagt ttctgcctgc accgatcgca tcataaaatc 4501 caatggcgaa cttaatggct gccctatccc caacagctcg attcatccca atcacataat 4561 ctatgttttg gtgaatcgcc tcagcttgta cctcactata gcaggcgttg agcaagacac 4621 actctacttt gtctttaaat aacttaaata accttgctaa ggattgagta ctaacaagtt 4681 gtaattctcc agagttattc tccaacgcca agccatcact cccagaacca tgtcccgaaa 4741 aatgcacaat ttgcggttca tagtctaata aagcacgtcg taaatcatca acccgtacag 4801 cccacctgct aatgatttca aactgttcgc gctttttggc acgttgtagc cccgcctgaa 4861 tttctctaac ttcctcatcc agacgtaact tggaggtatt gtttgggtta gctgataaaa 4921 tcagaatttt tttcactttg gtatcaacta ctgttgaatg ggctgttatt tatcagttat 4981 cagttatcag ttaccagtta ccagttacca gttaccagct accagttacc aagacaatta 5041 cggattagtt ttcttcactg ttcactgcgt aggtagctgt tcactgcggc tggtactgcg 5101 taggtagctg ttcactgttt gaacaaactt aggcgttgat gaaaaatccg attatgtaaa 5161 tcagacttat ttactaaata agtgtaagca aaatcaactt atgcacaagt gagatagaaa 5221 tcggaaattt ttgcgatgct cctttaggag ccttcatgtg gcgatcgccc ttctttgtcc 5281 agagggggta caagcccata tttttgagca cattttatca actatacact aaaagaaaaa 5341 tacttttact acagcttcat tagaactaaa tcttagtata ttttttgaaa aaaactagta 5401 atttatacgg aaatacgaaa gtcagcaagt agtgttgaac aggtattaac aatttttggg 5461 ataaatgctt ctacttactt tgtatcagaa tacaaagcaa taaaaatacc taaacacaga 5521 gtattaacct tgtgttacca accaaaactc tcatggctag tgagtagtct agaaaaattg 5581 acaaatacct agaactctct ccatcctttg gttccctata aatgaacggt tagtcatcag 5641 attgaaaacg taatattgtt ttataaagca ccagatgaag attacaggca ttattttcat 5701 tgatccatct gtcaacaact acgaaatttt acttaattta gttgttgcaa atatcaaggc 5761 tattgttctt gacccagagc aagatggagt cacacaaatt acccaggtac tatctcaaca 5821 ccgtggagtg gaaaatgtcc acatagtttc ccacggctct tcgggaaccc taacccttgg 5881 caatagcgaa ctcagcctca acaccctaga actttacgca gaccaactca agaattggtt 5941 ctcaccagat tctcaccttc ggttctctcc agttccctcc cttctcctct acggttgcag 6001 tgtcgctgca ggtgatgctg gggtagaatt catcaccaag ctacaccaac tcacgaaagc 6061 aaatgttgcc gcctccgcca acctcactgg caatgctatt ttggggggtg attggaacct 6121 agaagtcaac attggcaacg tttcaaatca accactggca ttaagcgcgg aggttatggc 6181 agcgtatcca tctgtactca ataatgcgcc tatcctcaat gacaccgacg ttatcctgaa 6241 tgccgttaac gaaaatgcag gcgcacccat aggtgcagtt ggtaccccaa ttttctcctt 6301 agtgaccctt ggtggtaacg ttattgacga agaccgcgac gcactcacag gtattgcgat 6361 cacaaatact gacactacca acggcaattg gttctacacc atcaacaatg gcagcaactg 6421 ggcagctttg gggtctgtat ccgataacaa cgctcgtcta ttggcagctg atgcaatcac 6481 ccgcgtttac ttgcagccca accccaacat tagtagtcct agtcctattg tcaatgctct 6541 aaccttccgt gcgtgggacc aaaccactgg taccaacggc agcactgcag acacctccgc 6601 taacggtgat actactgcct ttagcactgc gactgacact gcagcaatta ccgtcaatgc 6661 cgttaatgat gctcctaccc tcaataacac caacatcacc ctaactgcca ttaacgaaga 6721 cgcaggtaca cccacaggag cagttggtac ccttatttcc tccttagtca cgggtgataa 6781 cgtcatcgac ccagacagca atgcactcag gggtattgcg atcacaaatg ctgacactac 6841 caacggcaac ttcttttaca gcgctgacga tggcaacaac tgggcacctt tagagttggt 6901 atctaatact aacgctcgct tattggcggc tgatggaatt acccgcattt acttccagcc 6961 caaagccaac tttaatggca ccatcaacaa tgctctaacc ttccgtgcgt gggaccaaac 7021 cactggtacg aacacaggca ttgcgaacac caccattagt ggtggcacta ctgcttttag 7081 cagtacaact ggcgctgcag gaattacgat caattccgtc aatgatgctc ctaccctcag 7141 tgaagctaat atcaccctaa ctgcaattaa tcaaaatgta ggcgcaccca taggcgcagt 7201 tggtacgctt gtttcctccc tagtccgtct tggtgataac gtcatcgacc ccgacagcaa 7261 tgcactcaca ggtattgcga tcacaaatgc taacactacc aacggcaatt ggttctacag 7321 caccaacaac ggcagcagct ggatacccgt aggaatttta tccgatacca acgctcgcct 7381 attggcggct aatgcaagca cccgtcttta cttccagccc aacggcaact ttaatggcag 7441 catcgataat gttctaacct ttcgtgcatg ggaccaaacc agtggtacca acggcagtac 7501 tgcggacacc agcgtcaacg gtgctgctac tgcttttagc actggggtta acactagcac 7561 aattaccatc aacgccgtca ataatgctcc tatcctgctt gacaccaacc tcttcctaag 7621 cacagttgaa gatgcaggcg cacccacagg accagttggt acccttgttt cctccttagt 7681 cagtcttggt gttaacgtca ccgactcaga caacaatata actacaggta atgcatctac 7741 aggtaatgca cttacaggca ttgccatcac aaatgcgaac actaccaacg gcagttggtt 7801 ctacagcacc gatgatggca gtagctgggc agctttagga tctgtgtccg acaccaacgc 7861 ccgcctattg gcggctaatg gaaccactcg cctttacttc caacccaccg ctaacttcaa 7921 tggctcgatc aacaatgccc tcaccttccg tgcttgggac caaaccagtg gtaccaacgg 7981 cagcactgcg aacaccaccc ttaacggtgg cactactgcc tttagcattg cgactgacgc 8041 tgttgcaatt accgttgatg ccgtcaatga tgctcctatc ctccgtgaca ccaacgtcac 8101 tctaactgcc attgacaaaa atgcaggcgc acccataggg gcagttggta cgcttgtttc 8161 ctcgttggtg agtgtgggtg tcaatggtaa cgtcaccgac ccagacagca atccactgac 8221 aggtattgcg atcgcaaatg ctgacactac caacggcacc ttcttctaca gcaccgacaa 8281 tggcaataac tgggcaccct tagggtctgt atccaacacc aacgcccgcc tattgacagc 8341 tgatgccaac acccgcattt acttccaacc caacgccaac ttcaatggca tcatcagcga 8401 tgctttgacc ttccgtgcat gggaccaaac cactggtact aacggcagca ctgcggacac 8461 taccgtcaac ggtggcgcta ctgcctttag cagtgcgact gatactgctg caattactat 8521 tggtactgtc accaaccctg tcaccaatcc cgtcaccaat cccgtcacca atcccgtcac 8581 caatcccgtc accaatcctg tcaacggtgc tgctaccctg aaattagcgt ctaacaatat 8641 tttccagttg gaaggtgttt ctgatggcaa caaaccaaaa cttgaagtca ctctcaacaa 8701 agccagtgct aagcaggtga atgaactggg agtgtttgtg gttgacgatg accaaggaag 8761 aattaatggt attgctcctg gtacagaaaa ttatgcccaa gctgcacttt caagagctaa 8821 caccattttc tcagtgatta ccgataatcc caaggggttt aatactgact taacacgtct 8881 acttgaattt aattcagggg cacgcctgca attcttctta gtcaacaata gtagtattga 8941 cgctgtgcaa gctggttcaa cctctaccaa agacctcatc ttttccaacc cctcaacgca 9001 aaaagtgaca gacttaggga atggcgagtt tacattggct ttgaatgatg catcaaatag 9061 caacacttct gacttccgaa atctggtagt gaacattaaa gcaacaaatc aatcgttacc 9121 tctgggtgct ggtcttcagg gaaatccttt tggagaacta attgatttgc gctacggttc 9181 aactcaggtc aaagcagact ttgtcttaaa cagaaaagca acctttgaca atttcgttgg 9241 cttttatcaa gtggctgata ccaatggtgg tattgatatt gatggcaacg ggacggtaga 9301 cttgcgtcct ggtgatgcag gttatactca agctgcggtg cagcgacgtg ttccaggaat 9361 tgacttaaca gtggggaacc aaagtactgc tacattcaca agcaatctta gccctggttc 9421 aatcttggca ccattcatta ttgctaatgg cagaccagat gcaattttag atggtaatcc 9481 aaataacgac cccgcagtgt atttttcctt cttaggagct aactcagata aggttgacca 9541 tattcgcttg ttggggaata acacctttgg ctttgaagat ttatcaggtg gaggtgataa 9601 ggattacaac gatatgactg tcagagtgaa tttgggtatt gcttaaccca ctcacaaaga 9661 gtgtgtttct tatagcaggg aacagggaac agggaacagg gaacagggaa cagggaactc 9721 ttaacgctta actcttaaca gccgtgtgaa agtctggtgg tgtccgtatt ttctcattag 9781 ttcatgtctt aatctacctg gcaactgcta taactctcaa gaaatacact ctttgtgagg 9841 gagtcagaca gctaaaagcc gtgggacgga cagataccta cggcatttta cgggaatgag 9901 cgcaaagcgc gcgaagcaag tgctgtggga ggagtgaatt catttcggaa aagatgtgcg 9961 agagtgacga cgcaaaatca cccaactcac aacaggagtc cctatcaaag cggtaacaga 10021 attcaaaggt aacaccatct gacttcccgg tagctgcgat atcaaatctg caactaacgc 10081 cagaattgca cccatgaaag tcacagcagg gactaacacc cgatgctcac aagtattgaa 10141 cagactgcga caaaggtgag gaactgctac acctaaaaag gcgatcggtc cgcaaaaggc 10201 ggtaaccgca cctgctaaga tggaagaggt gataagaatc caaaatctga cttcttctac 10261 acctaaaccg agactgcgag cgtaagcttc acccagaagg agtgcattca aaggtttgga 10321 tagcatgagc gaaattaaca acgctatcag taccacaggc gctaaaacta gcatttgttg 10381 ccaggtgaca ccaccaaaac tgccaaaagt ccacagcaag tatgattgga tttgctgact 10441 ttcactcaaa tgcagcaaga tacttacaat agcactcgtt gcatagccaa acaacaatcc 10501 taaaatcaat agcgtcatcg tgtcttcgac gcgctgagaa acaagtaaca ctaaccccaa 10561 cactgctgca gcacccagac tcgccgccac aactaaacca aaatcgccaa tcactcctaa 10621 atctttgaac aaggtacccg caccggtgac acttgttgtt aacaccacca acgcgactcc 10681 taaagaagca cctgaactaa ttcctaatac aaaaggtccg gctaaagggt ttctaaaaag 10741 tgtttgcatt tgcaacccac tcacgcctaa agcagcaccc gcaagggttg cggttaaggc 10801 tttgggtaag cgaaatttca gaataatatt agtccaagtt gctttttctg gttctccgtt 10861 tagtaggatt gtaatgactt gggttagggg tatgtggacg gaaccaaggg ctaagtcgag 10921 taagaaggtg agaattaatg ttatgagtag tgcggggaag aagagttttt gaaaacttga 10981 atgttgaaaa tttcgtaaaa ttggtttgag tttgtgtttg agggatgaca ttgaaaaaac 11041 gaaccgcaaa gacgagccag tgcgttgcgg gggttccccc cgttgtagca cctggcgtgc 11101 caaggacgca aagaaaggaa agaagaggaa agaagaggaa agaagaaaaa agaaaagaaa 11161 agaaaaaaga gatttgggag attagctgag tttttgatag taaattaatt ggtggtttgg 11221 taagatttct ggatggaaaa tcttaattaa gtcggataag atgacatcag ggttactaat 11281 tccaccttcc cagtaatcgt tgccaccagt ctgattgata cgggcattat tattataaac 11341 gtttcctttt tgaactgctt gaaaatcagc gtagcgattg tcttcagcct gtaaatcttt 11401 taaagttttc caatattgac tggaatttag ccagtagtca gcgttggcag cacgttctaa 11461 aatagtttca aatgataggg gaagagtgcc agatgatttc ttgttattcc agagatagtt 11521 acttccggcg tcagccaaaa attttgcgac ataactgtta cctcctggca tataccagat 11581 acctttaaag ttgaatccca caaacacagt cggacgattt ttcacagatt gtgctttggc 11641 ggctatagct tgatattttt tgacagtttc atcaaagatt ttttgggcaa ctttctcttg 11701 attaaaaaac aaagccgtaa acttgagcca ttcacttctt cctaaggggg aagtttccat 11761 atattcggca tttatcgcca cttttaaacc tgcttcttga attttaggat aattatctgt 11821 ttgtttgtct cctgtaccgt aagtcatcac taagttagga ttaatttcta gtaattgctc 11881 aacattgaga ctagcattat ttcccacttt tgtgattttg ccttgtttca ttttttctac 11941 gacttctggt gtattgactt gtttgctatc actcacacca ataagtttat ctacaactcc 12001 taactttgct aaatgtggta aatgagtcgt agaaagagag acaaccgtat ttataggtac 12061 tgtaatgact tggctttgat taaaccctgt tggtacaggc gtcccgcatt gcactaaaat 12121 atattgaaaa ctgacttgtg catcacgcca gggatttttt accgtcacaa ccttgtagtt 12181 tttataatat tccacctcaa aacccgtggc gtaattgacg gttacttttt ccgggaagta 12241 gtcggtgttt ggattgtaag tttgagcgca ttctttgatg tgtgttggta aagtgggagg 12301 agttgtggta ctgtgacaag cgatcgccaa aacagcaacc aacagaaact gacagaaaaa 12361 agcgaacaat ttagctttgt gaagtattgt aattctcatc ggaaagtttt ccagaaataa 12421 ccacaaagac acactaataa cttatccgcg tccaaagctt gtgcatctgc ggttactttt 12481 tttcaaccga ctctggggga tagcaatgcg cagtcacttt cttaccttca accagatgtt 12541 cattgacaat agtttcaacc acctctggtt tgacgcgagt ataccaagtg ctatcctctg 12601 aataaaacac aactggacca agtttgcaaa cttctaaaca gccagaagtt ctgacttcca 12661 catccaaacc agctttttca actgctactt ttagtgcatc caaagtaggt tcccaatttt 12721 gggaagggag acaacgagta gacgtgcaaa cagatatata agcatgtttg aggggatttt 12781 tatcaaacgg tatcggcttt ttacccaaca cataaatatc ccgtagttca tggtaaagct 12841 ctaattttgc taagttcggc ttaccttccg gtaataagtt ttctccctcg acatagggat 12901 taggcaagaa gatagtcatc aaattttcgc ttgcaccatt ccagaaatca attccccaac 12961 ttctggggat tccttctgca ttaaatcgtc gataaaaagc agcacgattt acttgacgtt 13021 gtttccttaa ttctaatggt gttttacaat ggggaccacc taaatttgct tcaatacata 13081 agtgaaaatg ccaactttct gtgactacgg taaggtaacc atcataaaga atacaaattt 13141 tgggtgctgc aggaaattct agttccaaca caccaccttg tacaatgtgt cctactccta 13201 cttgatgcca attctgctga aataaggtgt aaagtaatga tgataaaacg tcacaactgc 13261 aatcgaaata ttcagattcg atgtatcctc ctgttaggga aggttctttg atgacatggt 13321 ttaatggtgt tacccaagga tgaaattgat tcatagtact tttggattgt gagttgttga 13381 ttggtgattg ttgatggtta tagcaatcct atctgatttg tgaaaaagag ttttattgaa 13441 ccgcagaggc gcagaggaca cagaggagcc agcgcgaatg acggctttcc cgacagaggc 13501 gactggcgtg aggagaaaag agaggaagaa ttaattccta aatagtattt tagtagtatt 13561 taaaatgtgg catttaaacc tacttgaaaa actctccctg catcaggata accaggaaat 13621 aactgatatc tctgattcaa gatattatct acactaccag tcagtactaa gttatcgctc 13681 agaggaattc gcgttttgaa atcaaaagtc gtatacccgg acaaagattc tgtgttggta 13741 ttgttggtag gatatgaacc aagggagtgc atgagtattc ccgcatagaa gccttgaggt 13801 gtttcataag aaattcccaa atttaaacta tctgcacctg caaatcttaa ttctttgtct 13861 ttttcgccag agttgacgct ctctttaatg cgtgggtcat tgagtgtata attggcaaag 13921 gcatagacat ttttcgccag ttgtacattc aatgcagctt caattcctct ggtgcgcact 13981 aagccaatat tttcatatgt ggcgacagga acagcaaagt tataggcaat taaatctgat 14041 atggtattgc tgaaaaaggt taagcgcaac aaaccaaaat cccacagctt ttggtcaatt 14101 cctatatcat aactatcacc attttctggt ttcaaattcg ggttaccaac gaaggtagaa 14161 cctctagcat ataaattgaa aagagtcggt gcgcggaaat tcctgatata gttagctctg 14221 aagttagtgg agtcagaaat tgcccagcgt gcgcctacac taggtgatgt aaaagagcca 14281 tctgttaaag agctaaaatc ttggcgtaag cctaaattga cacttaagct aggagtaaag 14341 ttgatttcat atctggcaaa aagcgctcct tgagagatgc tatcgtcgta gctgagagtt 14401 tgttgctcgg tggaatagtt gaaggtgctg ttacgtactg aagtgttgcg gtaatcaaaa 14461 ccgtagacta gggtttgatt tttagcaaat ttccagctat gttgtgtttg aattccgtaa 14521 gaggtttgct cgttatcaaa tcgtcgctga gatgaaagtg tcccactacg attctcaaaa 14581 cgagtgttga gaaaatcgct ataaactctc gctgttaaaa gagagtcatc tgcacctcct 14641 agttttgagt tccaggttaa gtcggtgaga acttgatctg tgtatttgcg attgttgtca 14701 gtcaaggagt taaaatagcc ttgtccaaac tggggttcag gaatagggac tcctcctgga 14761 actccttggt ttttgcttaa gtataggctt gagacagtaa gagtattgcg ttttcctaaa 14821 ttcgcctcta acttcacatt aaagttgttg tagagagtgt cattgttttt tctagttcct 14881 tcaaagttgg cttctgggat ggaaaaagga taattgttat ctgcttcggt gcggttatat 14941 cccacaaccc aagaaatatt ttctattttc ccgctattgg tgatagtctg ttgattgagt 15001 ccatatgcac ctagagtgac tcccgcttgt gttgtcactt tctctgtcgg acgacgagtg 15061 ataatgttaa tcactccacc aatcgcatcg gaaccgtata gggtggaacc tcctcctggt 15121 aaaacttcta ctcgttcaat gttgttggtg gtgatttctg aaaggtcaaa accaccactt 15181 ccaagattat taattggtct tccatcaagc aaaatcaata cttgttcgga gttggaaccc 15241 cggataaatt gaccgcttaa tgcgttcact tctgttccaa ctgttccatc aggtaagatt 15301 ccggggagaa atcgtagcgc ttctctgacg gttcgcactc cctgcgcttc catttcttcg 15361 cctgtaatga cgtagactgg acgggtagaa tctttcactg tcccctctcg acgaaatgga 15421 gagaatacag gctgattcaa cactttacct gtgactgtaa tttcaatagg gggttcttgt 15481 tcatctgttg gcgtatttgt gctttgggcg attttttctg attgtggaat ttctccagca 15541 cacacagcat agttccacaa tatcaaagtc gtcattgcac tcacactaga tacaacgact 15601 ttcaccgccc aacggcaatt aaaaagaata tttagcatct gtacctcgca gacgatttta 15661 tgttttttac ctcaggcggg cattctgact tagagacaaa attttgagtc tcatcacagc 15721 tgcgggacag tgtcggattt tcaccgttcg cctacggcgt ggcgcaggca tactttcccc 15781 gttacctctg atggctgaac cccatcagaa cccaggattt gctgtaattt accatgctca 15841 atttctcaat gcaaataaga attattacat ttagctttgt tgtgatgttg caaatagagc 15901 gatcgccatt tttggaaatg ttaacgctcc cgttgattgt ttccaacgta tagtctttag 15961 gatttccagc cagataaacc gcgtcttcac tggtggggtc aaaatagcga atcagcgcgt 16021 aatcttgatc gccattaccc aagtagaaat cttttggatt ttctttggcg ttgtcgccaa 16081 ggtaaaaaac attccgtcca ggtgaaccag ttaaaatgtc gatttcgcca actcccaagg 16141 attctgtttc tgcatttatc agtcgcgaac ccacttcggc ggtgttcacc ttcaccccaa 16201 tcacagaaga tttttgtccg gaagcttgaa ccgtgtcgtt cccattcgta ccattcaaga 16261 tgacaccgtt gcgttccaca aattgtccaa cttggacaaa ttcatcgtat cctgaagcgt 16321 actttccttc tgcaacagct tttgcaactt ccggattaag acctcacccc gccctaacgg 16381 gcacccctct ggtgaattcg cggagagggg acaggggtga cttgtgactt ccctcagaaa 16441 tttttcttaa tacatagcag cattgctgcc tttttttggt aagtctatca atatcatctg 16501 tggttcaatt tttaatttct gcatcataat tcatgttcaa agtccgtcct aaacaaactg 16561 tatccatgct gacacgtgat ggtgtgcgct tggatgcaga catctaccgc cccgatgctg 16621 atggggaatt tcctgtgtta ttaatgcgac aaccttatgg tagggcgctc gcctccactg 16681 ttgtctacgc ccatcctacc tggtacgccg cccacggcta cattgtagtt attcaagatg 16741 tccggggacg cggtacatca caaggaaact tcaaactttt tacccatgaa attgaagatg 16801 gtgaatatac agttaaatgg gcagcaaatt tacctggtag tacaggaaaa gtggggatgt 16861 atggcttttc ctaccaagga atgactcaac tttacgctgc tgctgccaaa ccaagcgccc 16921 taaaaacaat ttgccctgcg atgattggtt atgatttata tacagattgg gcttatgaag 16981 gaggcgcatt ctgtttacaa agcaatctcg cttgggcaat tcaattagcc acagaaactg 17041 cccggatcaa aaaagatgaa aaggcttatc aagcactgtt ggcagcttcg cgtcatttac 17101 ctttacacga tccaataccc actcatccag aaattttgaa aacttttgct ccagactcat 17161 tttatcatga gtggttggca cattctcaac ccgataaata ttgggaaaaa ctctccccca 17221 aaacatatct caaagatgtt gatttgccga tgttccacat tggaggatgg tttgatactt 17281 acctacgcgg cactctaaac ttatataagg atatctctgc tagaagtgca tatcgacaag 17341 aactgttaat tggaccttgg gcacatttac cttggagtcg taaagttggt gcggttgact 17401 ttggtccaga tgcagcaagt cctgttgaca ggatgcagct atgctggttt gaccagtttc 17461 tcaaaggtgt tgatacagga ttacttgatg agttgcctgt ttggctattt tatatgggaa 17521 gcaatgtatg gcaaggtttt cccagcttat ctatatcaaa agggcgatcg tactttttgt 17581 caaccaccgg acttgccagc atcagggaag gcgaaggtat tctcgccaca acttgtcccg 17641 aaacttccac tgatgatgta cttgttcacg acccttggcg acctgttcca tctaatggtg 17701 gtcatgctgc aattccggct ggttgttttg agagaacgca tatcgactac cgttcagatg 17761 tcttaactta cacaactgag agtttagaga cagatttata tttagccggt gatgttgtag 17821 ttgaagtctg gtgtatgtct gataagaaga gttatgattt gtgtgcggtg ctgtcagaga 17881 ttttccctaa cgggagagtg tacaatttga ctcaaggtta tttacgttgc caagatggaa 17941 accatcgcgt gagaagaaca attcatctac aagcaacttg tgcgaaaatt tttagaaatc 18001 atgctttacg tttgagtttg agtgcttcgt gtttcccagc ctatacaatg aatcctggta 18061 ataattcagc ctcaagtata gatgctgaaa ttatcacatt gatggtgagt tgcgggggtg 18121 agagtttatc gcaaattata ttgcctgtcg tgacaccata aatatagcgg cacatcgcac 18181 tccatccctg caatcgatta acataaaata gtccgcaatg gctacgcacc aggtgaaaag 18241 ctatgtcaga aactacccta gctgcaccat acgtcacact accgccaacc caggcagaac 18301 ttccctgcga tgatggtatc ccaatggaaa ctcaacgcca caaataccag atggacttgc 18361 tgattgaaac cctagagttc tggttagcac aacgcgagga tgggtttgtc agtgggaata 18421 tgtttgtcta ctacagtatg gcgcaggtgc ggaacaaaga ttttaaagga ccagattttt 18481 ttgtcgtgtt gggtgttcca aagggagaac gccgcagttg ggttgtttgg gaagagggta 18541 aagcgccaga cttagttatt gagttacttt ctgagagtac agccgaagca gacaagaatg 18601 agaaaaagct gatttatcaa aatcaaatgc gtgtgccaga atatttctgg tttgacccat 18661 ttaatcctga cgattgggca ggtttttcta tccaacaagg agtttaccag cccttggttc 18721 cgaatgaacg aaatcagttg gtgagtcagt cattagggtt aggattacag agatggcaag 18781 gtaagtatag aggagtggat actgtttggt tacgctgggc gacaatggaa ggagaagtgc 18841 taccgactgg tatggaaatt gcacagcaag aacatcaact ggcggaacaa gaacgccaac 18901 gcgctgaaca ggaataccaa cgcgcggaac aagaacgtca acgtactgaa caagtgcgat 18961 cgcagttaca gcaaacagta cgaaatttac ttcaagcggg gatgacggtg gaacaggttg 19021 ctaagttaac gggtttagat gtttctcaag tacaggaatt agggaattaa cctaatatat 19081 agttctcaaa aagcttaaaa actttgttgt ccaatgcagt ggagagttta gaagcaagaa 19141 cacttcttgg aatccagtca aaagattttg gcaaaccaag tttttcccaa ataacactct 19201 ctgtagtgat gcttgggttt tccaactctt gcgcgatgaa aactcttcca gaaatacgat 19261 gatgggtgat actgtgtgaa aacttgctag acagttccaa aaattgaaga tgttctagag 19321 attgaagaac cttcttgact tctgaggctt cagttgcggt aactaatgga aagccgacag 19381 tgtttgagag aaatcctttt gttctgtggg cgttctgctg aaagcagcag cgcttcgcta 19441 tcgccaccgt atctgtcgcc ttcctccaaa acaccaaagc aaataactcc acatccaccg 19501 tatcacgtcg tggttttttg ggaggacaaa tttcaacgca attgtgtgca aaagccaaac 19561 attgttcccg tattgggcac aaaaggcaca gaggtttcga tttccgacaa acagtcgctc 19621 ccaactccat tattgcctgg ttaaagtcac caggacgttc ctcaggaatc atctgatcaa 19681 catgagtttg aatcgctgat tttccagaag aactccacac atcctcagat aaagcaagca 19741 aacgactcac tactcgtaca acattaccat caacacaggc aactttttcg ttgaagcaaa 19801 tacttgcaat cacagacgcc gtgtaaggtc cacaaccagg aattttcaac cactcataat 19861 atgattgcgg aaaatgacct tcaagctgct caacaataaa ctttgcacct ttctttaagt 19921 tacgggcacg agcataatag cctaatcctg accaaagttg acgcaatatc tcttcatcac 19981 aagtagcaag gtcatcaaca gtgggaaggt gcttaacaaa ttctgagaat tttggcacaa 20041 ccacagcgag agttgtttgt tgactcatca cctcacacac ccaggtgtga tagatgttga 20101 ttttctgacg ccaaggcaat ggtctgtagt gatgatcgta ccacgcaagg agtttctcag 20161 cacattgatt cactatccaa cctttgttca ctccaagata aatcctagca atgtcaaaac 20221 ccatctcaga attgaaatgg gttttgtaaa gttattctgc taaaaatgag agttgtcgct 20281 ttagaaaaaa ttcatgatag tgatgacacc agcgctggtc agcatgacca aagctcccat 20341 aactaaagac tggctcagtg ggtttggagt tttggaatcg ttcatctgta gaatccttta 20401 atctaatctt aaagattcta ttaacaatat cagtattgcc acatatataa atggaaacat 20461 agtaaaactt aaattagctt aatttatctt agtaaacacg gtagttgtca ctcaactcaa 20521 gatgcatact tgtgcaaaac tttctgaaac atgaattttt gtaaaagtct tgagtactga 20581 gtgcttaaga agaacgcaga acacagaata tctccacaaa taagatgata ctgttggtca 20641 attcaccagg ggtgttacgc cgcagtcact ctcgttccca ggctccagcc tgggaatgca 20701 attcgctgtg ggctgctctg ccttcagtgg ctcaagaggc ggagcctaag cgtgctgcat 20761 tccttgccag agacaaggaa cgagacatat tttgagttac ttagacccac tgcctcttgg 20821 caaaggttac cgaatatgat atgtacttaa ttccgaccga tatatgtgcc tttgtagtgt 20881 aaaattaaat ttaactaagc aaaaaaaatc attggactgg agaagaatta atgactgttt 20941 cgccagcgac tacatcccaa accgccgctt ttgacctaga ccaactcaat cagaaatttg 21001 aaaccgctca tccaagagat atactggcgt ggtctataga aaatatttca actggactcg 21061 tgcaaaccag cgcctttaac gtagatgaca tcatcatcac gcatattctc tacgttgacc 21121 tcaagcaccc agtcccagtg atcttcctcg acacgctgta ccacttccgc gaaactctag 21181 aactggttgc gaaagtcaaa gacacttaca atttagactt gaaagtttac aaaaccccag 21241 atgtagacac tcgcgaagcc tttgaagcaa aatatggcga agcactttgg gatacagata 21301 ttgctaaatt ccacgaagtc accaaaatag aaccattgca acgaggtata gccgaactca 21361 ataccgtcgc ttggattacc ggacgtcgtc gcgatcaagc tgtcacccgt gcgaatatgc 21421 ccatatttga acttgacagc aacaaccgcc taaaaatcaa tcctctcgca aactggacac 21481 gtaaacaaag ctgggagtat gtcgctgaac atggcgtcat ctacaacccc ctacacgacc 21541 aaggttatcc cagcattggc gatgaaccca tcacaacccg agtcggtgaa ggcgaagatg 21601 aacgcgctgg acgttggcgg ggaactggta agactgagtg tggaattcat atctaacaag 21661 atgggaagat gaggtgatgg ggaaaagaag aatttcccct aactcttcct cctcgccctc 21721 tgttttgcta cgagaaccct atcttgttat ttcgacataa ccgcaaggtt ttaagccacc 21781 gagaatatgc acaatggttc ctgtttgtgt atcgttatcg tacaacaaac ctttatcgct 21841 cagagtttct acggctgttt ttggctcgaa atattttaat atccgtattt atctttacat 21901 agtttgtttt tttcaccctg acttacttaa cttaatccta gtgttccatc gctaaatatc 21961 aagcaccacc actaaactcc ccggttgaga aatacaaatc aatactgaac cttgatcaat 22021 tgcagcggtt ggttcttcct gataagcaac ctcacctgcg ttaattttac acatacaagt 22081 cccacaaata cccgcacgac aactgaaggg aggattaata tcattcgctt ccgcaaattc 22141 gagaatgctg ccatctccct gcttccaatt cagggttttg ccagattttg caaagacaat 22201 ctctgcctct tgcacttcct cacctatagt tgcagcaggg ggttgggttt cggacgcaac 22261 tttcatcggt ttaccgaaag attcaaagaa gactctactg tcaggtactc ccgattcctt 22321 gagtccttgc ataatagact gcataaagga tggagaacca cacaggaaat actctgcctg 22381 ttgcccaact aactgtttaa tgagggcagc atcgacataa ccaacactgt ggtagtgtcc 22441 ctcatcttca ggagtgggac ggctgtagcg aaaatgcaca tgcaagttag gattttgttg 22501 cgctaaaccc gtaacctctt ctcggaaggc gtgaaatcta ccatctctgg caccatgtac 22561 aaaccaaatc ggacggttag ggttgagacg agttgcagct ttcgccatac tcatcatggg 22621 agtaatacca acgccattgc taatcagcac tgcggggatg gactttttaa catctaaaac 22681 aaatttgccg tttggtggtt tcgccggaat gatcgaacca gagtgaatgc gatcgtgcat 22741 aaagttagac gctacaccag gcggtacatc taaacctttg ggagtaggtt cacgcttaat 22801 agatagacga taatattcac ttaagtcaga gtaatcagaa agcgagtagg tgcgaatcac 22861 aggtttattc tgtccgggaa tgtccagctt gattgttaag aattgccccg gttgaaagtt 22921 aggaatttcg cccttatcct ctggttgcaa gtagaaagaa gtaatttcct cgctctcttt 22981 cactttacga acaactacaa aatttcgcca atctttccaa atcttgctgc tgggttttac 23041 ttctgtagat actgatgttt cctgacttgc ctgactcgtc atgaataagc caaagatagc 23101 accgcaacct gttcccaaga gtgaggaata aactccaaca ccaaaagcag atttatcttt 23161 ggaattagtt agcccaatag ccacaccaga aacgatggta actactgaaa atgcaatact 23221 tgcagccgtc gcacttctga gaatgggatt tttaattcgt cgaatatttt ctaacattcc 23281 ctgtctcctg ttcatctgta gacattaact aatacgtaac agttgaaata aagcaatcat 23341 gtaaaataga caaaaagctg gctggtcaga ttttgctaac tgaacgatac aaaatgagtt 23401 tcagcagaca ggacttctct tgcacagcta attattacca gagatgtttg acttggtagt 23461 aaaaatattt aaatttcatg atgataaatt tctcgatgca acatcaatta gcggtgatac 23521 ttttgacttg aactttaaat tgattgttgc ttgaagaatt agtctacaga ttatgattat 23581 ctaaggattt ccgtgtataa accttagatt gtatctagaa agctcttatg actttctcag 23641 ttaattctaa cttttcattt gctatgtata aaataatgct tttcccgatg caacaccgcc 23701 gccgtattta caagaagcct ttgctggtca atgtttgggt tgacagctag ttatcgcctc 23761 ttgcaataac tcactgtcca catcccacca ttgttgtaac tcaatcaaaa ctggataaag 23821 tcttgtatct tctatttctt ttgctgcttc aacagataat ctgcccacac agccacagga 23881 gagttctgct attaatggat taatgactct ctcatctttt ctaatcgcta aacccaacaa 23941 tgcttctcca cggatttcag cgatggtgtc ttcttcccct atctctagaa tcacacgttg 24001 aaataaagca tctcgaatgg cttgcgtatt agtttctatt tgataaccca aactaaaagt 24061 tgcccaattc ctgatatctt catcctcatc acaagataaa tcaatcagtg cctgaattgc 24121 caattcctct tcttgacaca gcaagccaaa aacaacgccc atacggacat ctgcatctct 24181 gtggcttttt aacttgacca atggtacaac acctctcgaa tcccctaaat gtccgaaagc 24241 atatccaata gaagaaagaa catttgggtt ttcttcactc ttacataaat ttaaaagaat 24301 ttccccacat tcttttggat aacttctttc tggtattcct aaatatccta atatatcaac 24361 tcctaaactt ctttcttttg gattttgact ttcacacaat cgtgatgcag cttcaaattc 24421 ctcatcactc ccacgcgctc gcaaaatcca cactaaatcc caataagtct cattatcttc 24481 ttcagtgagt gctaatttga ctaactcttt tgtacttcgt ggatcatttc ttaagttttc 24541 aaaataatta ttatttacgc tcatatcaga ttttattccc ctagatatca tcagtactta 24601 tcggcttaaa ggtaaactgt tcggtaagcg ttctaatttg ataggcagat tggttatcat 24661 taggacttac gcacaagtta cgaaagaaca agactgtagt tacgacttac gtcgcaatga 24721 cgcaaccacg ttatttttac gtaagtccag atcatttttc aataataaag tattagacaa 24781 atcaccatag caaaggacaa ctatgattaa aatcctccac ctctccgaca tccacatggg 24841 aagcggcttt tcccacggac gaattaatcc tgaaacagga attaacacac gtttgagtga 24901 ttttgtcaat agcctatctc gatgtattga ccgagcgtta gcagaaccag tggatttggt 24961 tgtatttggc ggtgatgctt tccccgatgc gacaccagcg ccctatgtac aagaagcctt 25021 tgctggtcag tttcgccgcc ttgtcgatgc aaacatccct actgtactgt tggtgggaaa 25081 ccacgaccaa cacacccaag gacaaggagg agcaagtcta tgtatttacc gtgctttagg 25141 agtcccagga gttgtcgtag gtgatacctt aaaaactcat cgcatccaaa cccgcaatgg 25201 aagtgtgcag gttattaccc ttccttggct aacccgttcc acgctgatga ctcgccagga 25261 aaccgaaggt gcatctgtgg cggaagtcaa ccaactgctg actgaacgtc tgcgggttgt 25321 tctagaagga gaaactcgtc gccttgaccc caatgtgcca actgttcttt tgggtcactt 25381 gatgatggat aatgcaaatt tgggagctga gcgttttcta gcagttggta aaggctttac 25441 acttccctta tctttgctga cgcgaccttg ttttgattac gtagcgctag gacacgtcca 25501 tcgccatcaa aacctgaata aatctaataa tcctcccgtg atttatccag ggagtattga 25561 gcgggtagat tttagcgaag aaaaggagga caagggctac gttatggtag aactggagaa 25621 aggtcgtgtt caatgggaat tttgtccttt gccagttcgg gctttccaca caattgaggt 25681 agatctctct aaagctgaag atccgcaggc ggcgatcatg aaagccttag ccaagcgtga 25741 tatccaagat gctgttgtgc ggctcattta caaacttcgc tcagagcagt tggatttaat 25801 tgatagcgcc tcgctgcata ctgctttaag ttcagcgcat acttatacca ttcaaccaga 25861 attggctagt cagttagctc gaccccgtgt tccagaatta agtgcgagta atagcatcga 25921 cccgatagaa gcgctaagaa cttatttgaa taaccgggac gacctcaaag acatagcagc 25981 atcaatgctg gaagcggcgc acaatttact agcagatgat gtggaaatat gtctagaatc 26041 agcaactcag gaataagaat gtcaaaatga tctgtcggta gaaggtagag atgctcagct 26101 acgtctcctg taaaatttaa ggttaattgc taattaatct ggcaaaattg caatattctc 26161 aactttgtgg tgaatttttc ttaccacaga gtggtgacac aaaaaaaata gttgccaggt 26221 ggaccctata tgtccttcta ccctcagata aaacagtttc tactcggtaa aacattacct 26281 acaagcgctc acgctgaaga acgattgagt aatgcagcgg ctttagcagt gctttcatcg 26341 gatgcgcttt cctcggttgc ttacgccaca gaagaaattc tgctggtttt agtggtagcg 26401 ggaagtagcg ctcttggttt gtctttgttt attgctatag caattatcat cctactagca 26461 atagtcgtgc tttcttatcg acaaactatt cgagcttatc cccaaggcgg tggttcctat 26521 attgttgcta gggaaaactt gggtctatac ccaggactgg tggcaggagg ttctctgatg 26581 attgactata ttctaacagt caccgtcagt atatctgcgg gtacagccgc ccttacctca 26641 gcatttccag tcctacaacc cttcacagtc agcctttgct tgatttttat tgtcttgttg 26701 acgctggcaa atcttcgagg tgtgaaggaa tcaggtaata tattcatgat tcctacttat 26761 gcctttattg ctagcatttt tgtactcatt gtccttggtt tgtttaaaca agcgacagga 26821 caagtaccga cagaatatcc caacatacct gtcaaagaag gactgagttt attcttcatt 26881 ttgagagctt tttctgctgg ttgtactgca ctcacaggag tggaagcaat atctgatggt 26941 gttctggctt ttaagcaacc agagtggaaa aatgcccgcc tcacgttgct ctacttgggc 27001 attattctag gttttatgtt tgtaggaatt acctacttat ccaatgtata tcatattgtt 27061 cctgaagagg ggcaaacagt ggtttctcag ttaggtaggc tcattgttgg aacaggacca 27121 ttttatttct ttgtccaagt tgtcacgctg ttaattttac ttttagcagc aaataccagc 27181 tttgctgatt ttcccagact ttgttacttt ttggcacggg acggattttt gccgagacaa 27241 ttgtcgctgt tgggcgatcg cctagtctac tccaatggca ttattctcct cagcgtctgt 27301 gcggctatct tagtgattat cttcaaagga agtgtaaatg ctgttattcc cctgtacgcg 27361 gtgggggtat tcacttcatt taccctatca caagctggaa tggtacgccg ctggttccat 27421 gaacgcacac caggttggca agctagtgct ttaatgaatg gtttgggagc tattgcgaca 27481 gttgtagttc taggggtgat catatcaacg aagttcctgg gtggagcgtg gttagtagtg 27541 gtagcaattc ctgtggttgt cagtatattt ctagccattc accgccacta tcaatacgtc 27601 gcccaacgtc tcagtattga aggattacca cctcgcagtt atactcctag agtcaagccg 27661 gaagttatca ctcatcctgc tgtggttgta gtaggacaac tcaatcgggg aactgtagag 27721 gctttggact acgcacgcac gattgctgat gaaattgttg caattcacgt agatcttagt 27781 tctaccgagg gagaaaagct gcaagaacaa tggcgacaat tagagtcaga tattcctttg 27841 gtgattatcg agtcacccta ccgttctgtg atatctccta ttgtcgaatt tgttggtgag 27901 tttgaagacc gttatcacga cacttacaca actgttatta ttcctgcttt tgtgactcgt 27961 aattggtggg aaggtctttt gcacaaccaa acaactttgt ttttaaaaac ggctttacgt 28021 gctcaaaaga gtcgagttat cacaactgtt aggtattact tgtaatagtt ccgttttacg 28081 gtaaatttac cccgaagccc ggtttttcca aaagccgggt tttgacattt ttaggtttac 28141 gcaagagttt cactcccctg aggtaagccc agagggtatg cctaccgcac gctgcatcgg 28201 cgcaaccctc gccatcttcc ttaaacttat ccttaaattt aaagttttaa tgagacttta 28261 tttcattaat tttgatttta gtgtaagctc ttttctacag ttgttaaata aaattctcaa 28321 caaacttcag acaggcgtaa aatgacaact tctacacatc cgttgttggt gcagcaagac 28381 ttgaccaact caccttggtg ggctggtaac tttcgtctca ccaatctatc aggtaaatta 28441 ctgggcgctc atgttgccca tgctggattg attttgttat gggctaagga aatgacgctg 28501 tttgaggtgt ctcacttcaa tcctcatcag ccaatgtatg agcaatagcg aacagttaat 28561 tgctaacttc ttttattact gttagctatt cacaagcttc acgggaaatt gagaactcta 28621 attacttaca ggataaatga atgtctaaac aaccagatgc catttggttg aacacaagcc 28681 ctagtttgtt atgctttgct caaccattgt tgcgtgaatt gtcacgttct gtaactattg 28741 cccagtggga atatactcaa actcaggatg aggcgagttc tttggatgtt gcaacactgc 28801 tggtagatga ttatcttcaa tcaatcaagc aaccagtgca tttgattggt catagcacgg 28861 gaggattact agcactgcta tatacacgtc gatatccaga aaaagttaaa tctttaaccc 28921 tgttagctgt gggtgctgat gcagcccttg actggcaagc tcattactat aaccatcgcg 28981 tatccttgac tcgggaaaaa attctcaatg caatggtgta taacctattt ggctatcaga 29041 atgaacacgc tgtcaagcgt ctggagaatc ttttggaacg agatttagat tgctcgcttt 29101 cacctcactc cctgtttaaa cgactaagca tggttcccag tccagtccct gcgcctttaa 29161 tggtttgtgg tagcactaac gatatcattg tagaacctga cgctctggaa ggatggcgac 29221 cgtttttgat ggagggcgat cgctattggg aatgtcaaaa aggacggcat ttctttcact 29281 tcttccagcc gactctcgta gcagaacaga ttctcgactt ctggcaatct ctataccaaa 29341 tctctaaaaa accactacag atcagaaaat aactctagag gtgtccataa agacgctcac 29401 gttgcaccaa agttatcctg tagccaagcg atccgcagtt tgcgcgattg ggcaaaaccc 29461 taagttcagc cgcaaggcaa cttagggtag tttattaaga attgtctctg tctatgatat 29521 tggtaagtta cactcttttc agatgttggc atagagcttg atcaatattt catatctatt 29581 acaatgagca gtttctgtaa agcttgtcat ttaaggtagg aaagattgat agttcagaga 29641 atttcaggtg gacatcgtgt atgagagtca aataccaaga tcaacgaaaa caaacagctc 29701 taattacagg cgcagcgagt ggtattggtt acgaattagc acaccttttt gcccgtgata 29761 cttacaatct tgtcttagta gataaaaatg gacaaaaact ctccgaaatg gcggagacat 29821 ttccacaaaa gttcggcatt tatgtgacaa aaattgttaa ggatttatct atttcaaccg 29881 ctcctgaaga aatttttaca gacgtgcaac aagcatctat aaaggttgat gtgctggtga 29941 ataatgctgg atttggtaac tatggcttat ttcacgaaac caacctcact gcagaactgg 30001 atatgctaca ggtcaatttg gtatctctga cccatttaac gaaactattc ctaaaggata 30061 tggtcaatca aggcaacgga acaatcttaa atgttgcctc aacagctgca tttcaaccag 30121 gacctttaat ggcagtttat tccgctacta aaagttatat tttattcttt tcagaagcgc 30181 tagctaatga gttaaaggat acaggtgtca ctgtgactgt tctttgtcca ggaccgacag 30241 aatctgcttt tcataaaata accgggatgg ctgactctga actgctcaaa aacaagaaga 30301 tgatgagtgc agaaactgta gccaaaattg gctatagtgg tttattggcg aaggcaactg 30361 ttgtgattcc gggtgtgaaa aataagatac tggctgaatt agtaagattt gctcccagaa 30421 agctagtgac aaaagtagta agaagtatgc atgagggaaa aattaagtaa tctaatttaa 30481 atcttatttt tcttcaaaaa gatatttgtg gtactatagc agtcctaaat ttgtgaacaa 30541 caagattccc gatttctaaa acaagtcggg aatctgagca tgcgcgactt atgcgctacg 30601 cgcaggctat gccaacacaa atcaaatagg attgctattt aaaacgtggg gataggatgt 30661 cactgtagtt acttgctaac tggtacttta ttttcacgca agtccctgac tcgacaaatg 30721 tgctgttaac caagccttac tcgatagtct tcataagaaa actgccgcaa gatctcgaat 30781 gtgccgtttt tcttaagtat accgattgat ggatgaatca ctccgttgaa cattgtcgtt 30841 tttaccatcg tatagtgcat catatcctca aaaatgatgc gatcgctaac tgacaagtcg 30901 taacagaagg cataatcacc aagaaaatct cctgctaaac aacttgagcc acccatgcgg 30961 tacgctttct ctcccttctg tggttcatgg gctccccgaa taacaggctt ataaggcatt 31021 tctaaacaat caggcatatg agcggtaaac gatatatcca aaatgacatt cgtgatatca 31081 gtggtgggaa ttatatctcg cacagtactt agcaaaaacc ccgtttccca agcaactgca 31141 gaacctggtt ccagaatcag ttcaatatga gggtaacgat tgtgaaacgc cttgagaact 31201 ttaatcgcat gtgcaatatc atacccttga cgagtcatca agtgaccacc tccaagatta 31261 agccatttta gctgcggtag aaaatgcccg aaaaattgtt ctatgtgtat gagggttttt 31321 tccaactcat gggaatcgct ctcacataag ttgtgagata aaaaaccgtc tattccctgc 31381 ggtaacaaag taccaagatg ttctggagaa acaccaagac gtgtaaatgg ttgacaagga 31441 ttataaagcg cagtcttaac aggtgagtac tgggggttaa tccgaagccc tattgatggt 31501 ggagaaggcg atcgttcaat gagtggtcta aatcgttccc attgactcag agagttaaat 31561 gttatgtgag aagcatatgt cataattgca gacatatcct catcacgata ggcaggagaa 31621 tatacatgaa gctcccctcc aaattcttcc gcagctagtt gcgcttccca aagcgagctt 31681 gcagatgccc cgtgaagtgt ctttcgtatc agcggaaagc aatgaaatag ggagaaagct 31741 ttgagagcga gaaggactcg tacaggtgtt tcttgttgca ctcgttcaaa aactgcgagg 31801 ttttgctcaa gacgtgcttc ttccagtacg aaacaaggtg atgggatcgt tgccaaaact 31861 gagtttctag tgtccgtcat ccttagttat cctcgaatgg agatggctga tcaatcactt 31921 catgccaggg aagaccaaga ggtcctaaca gttccataaa agggttagga tcatactgct 31981 cgacattaaa cactcccggc tgtttccaga gtccattaac caccattaat gaaccgatca 32041 ccgctggaac acctgtggtg taagagattg cctgtgcatt aacctcttga tatgcctgag 32101 tgtgctcaca gttattgtat atatagtagc tgcgctcttt tccattttta accccacgaa 32161 tatgacaacc aatggatgtt tgtcctgtat aattctctgc taaggatgaa ggctccggaa 32221 ggagagcttt gagaaacttc aacggcacaa ttttagttcc ctcataatca attggatcaa 32281 tacgtgtcaa cccaagattt tgaagtactt gaagatggtt tatgtagctt tgtgaaaacg 32341 tcatccaaaa acgtgctctc ttaatagttg gaatatgctt gactagagat tcaagctctt 32401 cgtgataaag aagatatgac tcttttatac caatttctgg ataagagatt ggacgatgaa 32461 tcgaaagaga agggacttct atccatgtgc cattctcata atacttgcca ggttgcgtta 32521 tttcacggat gttgatctct gggttaaagt ttgtagcaaa tacctgtcca tgatttccag 32581 cgttgcaatc aacaatatcc agatagtgaa tttcatcaaa ataatgtttg agcgcatagg 32641 ctgtaaaaat tcccgtcact ccaggatcaa acccacatcc gagaatagcg gttattcccg 32701 cgtttttgta acgctcgtga tatttccact gccaactgta ttcaaatttt gcctcgtcag 32761 gaggttcgta gtttgcggta tcgagatagt gaactcctac ttcaagacac gcatccatta 32821 gtgctaaatc ctgataagga agtgccacat taatgagaat atcaggacga aactctttaa 32881 gaagcttgac tgtatcatta acattatctg catcaagttc agctatcttg acttttggtg 32941 acgagacact ttcagcaata cgatggcact tgtcaattgt ccgacttgct agaagaatgt 33001 gtgaaaactc ttctcgtgca gcacatttgt gtacaacaac attccctacc ccacctgctc 33061 caactatcat cactcgtgcc atactgctgc tccttgtcta actgtgtcta ttgctttata 33121 tcttattaac acttctgact tctaaatttt tcaaatctaa tcggatcacc tattttattt 33181 tccggtctga tgatatagga ttcctttttg atttttgttc agctaggtac agtttatatt 33241 ccctgttccc tgttaagcgt tccctgttaa gcgttccctg ttaagcgttc cctaccttaa 33301 ctagtacatt cataaacaaa accggattcc tatatctcct cattaatttc gggtaggtaa 33361 tgattttagc tgctctctgg agtaataatc aagttgtaaa aaatcaagat ttgagctttt 33421 tagaggttaa aaatctcgtt ttctgggtag tcaacaaact tcttaatgat ttcctggaag 33481 tagcaaaata gatcctcgat ggtctctgtt ttgaacgaat cggtactaca ataccctgca 33541 atttggatta atttagaatc tgcaccatat gaagaaacat gaaattctag aagatcatcc 33601 tcattccata gttcattttc acacggcact gataaaattt ctagttgagg gactcactag 33661 aacaaggtct tgtatgccta attatggaag gaaccatcgc aatagaggca gtaggtctac 33721 ctttgagtat tttattgaat tccaccaacg gcagttgctg attattgttg gcttcactaa 33781 ctgtttgatt gacttgtttt aaaagggtaa ttcctgtctg atgaccatcc agttgagttc 33841 gtaaaaatac ctcattcata aagcatccaa gtatttttta tatggttggt gtatttctgt 33901 tagcaactgt agtcaaaact agaatgtagg attgtccact ccatttgaat aagagtattt 33961 taaaagcagt caaaatgatc gtgaagatac taactccttg gaaactatcg agagttttta 34021 atttctgttg ttatatctga atcttgatta aaaacattga tctgatggtg tgaatctaag 34081 ggtgctttgt tcatattaac tgctagaaaa tgacggtaat aacgatttaa aaataggtat 34141 aataagtaag tcggcgggaa aatttataag tatgtaacga aaatttaact agaagaatta 34201 gagttaaatt tcatacgttg tatcctgtag tttccttttt ctcctctagc ttgaacacca 34261 tcaattaccg cgtaggcaag gtctaactca tcatcaaaca ttcgtcctgc tagttcatcc 34321 tttttaaggt gttgccactc taattcaatt gggttcattt cagaacaata tttgggcaaa 34381 aagaagatat ataaacccat ctgttcccac tttgaccaca tctgttggac ttctttgcac 34441 cgatgtattg cgccattgtc ttggacgatt accctaatgc gtccgacatc agaggcttca 34501 cgggcttctt tctccatcat ctggatataa gacttgcgat caacaccccc aatcaccaaa 34561 ccgtaaacaa aactgattag aggttgaaaa aacccgataa tgcttagtct tcgaccccga 34621 cgctttgtct gttctaagcg tttttgctca ccccgttggt aatagctgta acttggctca 34681 ctccatgcac aaaaccctga ttcatctaaa tacttgaggt ctatttctcc tgtagcagca 34741 gctaattcca acatctctag gtcagcttgc ttctgttctc gtactacagg gtcttgtttt 34801 cctttatgac tctttctaac tcgcttccat atgaccccct tttttttagc acccgtctta 34861 acctgtcggg acttaattta atagagcgtt aagcgtagct ctgccgtagg caatcgcttt 34921 gtaatttttg agctaattga acactgttgt atgtacgtgg ttcctttttc aggcattgtt 34981 ccaaaaaaac tatgtctgac tccttgcact tgggttttcc ccctcgacct ggtaactccc 35041 aaaggctctc tagacctaac ttttgccatt tatgtaaaac ctctcttact gtttgaggag 35101 tccaattaaa atgggcagct attttttcaa cgtaccaacc atgtgcattt agcctaacga 35161 cctctgctcg gtctttaact ttctgtggta catccgcaga tcttagattg aacagggttt 35221 tgtcttgttc acgggttagg aataccctga tacgagcacc catatcttac ttacctcagt 35281 agatacattc tccgtattta tttatcttta catagtttgg ttttttcacg ccgttttact 35341 tacagcactt aataacttaa ccatctatag ctgttctcga ttaagtgaat gccagatgat 35401 aaagtatgaa cagttagtta aagatgtatt gaaatcatac tgaatttttg ctaattctct 35461 gcctgattgc tctcaaaggc tgtaaaaata tattggtcac actattaaat aaatttaatt 35521 attcattttg taaaccaaag gtcagttttt catagttgct tggcacgtat aaaaaaccaa 35581 acatccagtc ccaaaatgcg aaaatataac caaagttttt ctcataatgt ttggggtcta 35641 cactatttaa aacatccttt gaaacactaa tttacccata ataagactct tttttgtagc 35701 aaataatcct gttttctgtg gtcgttagta tagtaatacg accataatta agtttgcaag 35761 gacagttact cccttcccgc cagaagaata cctaatttaa cgattagcaa taagacgagc 35821 cagtgcgttg cgggggtttc ccccgttgta gcacctggcg tgccaaggac gccaagaaaa 35881 ccaaaaagaa gataggtaat cttgcaccgc gaagggagta agacaagttt tggatgcctg 35941 gctgaaaact caccgagcaa taggcacacg gtactccata cctgcctgag tacaggtaca 36001 gtgtaggtag ttcaaattcc tcatctctat agtgttagac caattcacac ctgaattttc 36061 ccgtcctcat cggcgactac ctaaccaaag gtggcaaatt tatcctcaaa aaacagaatt 36121 agcgcaacag atggcgcaaa caacgagttt atcgccctta atcaaccaat tgttgataaa 36181 tcggggtatt gaaacgccag aacaagcaca agttttttta aattcaggaa acctgatatt 36241 accttcgcct ttggaagact ttcctgattt gggaatgagt ctggagttat tgcacaatgc 36301 gatcgcccct tgggcgggtc cccttgggac catcgcgcca agcgcagcgc cagaggcgat 36361 cgcctctcaa gagaaaatcg ccatctgcgg tgactacgat gctgacggaa tgacaagcac 36421 tgctttactt ttgcgtagtc tccgctggct gggtgctcaa gtcgattatg ctatccccag 36481 tcgaatgcat gaaggctacg gtatcaataa acgcattgtt gaagaattcc atagcgaagg 36541 tgtaggatta attctgactg tagacaatgg catctcagct tttgaaccaa ttgagcgagc 36601 cagagaactt ggtctaaaag ttattatcac cgaccatcat gatatccctc aacaattacc 36661 tccagcacac gccattctca atcccaaact gatacgagaa tcttctcctt accgaggtat 36721 tgctggtgtt ggtgtcgcct acattttggc tgtgtccctt gctcaaaaat tgggacaagc 36781 taatgatgac atcttaactt cgatgctgga actttttaca ctgggaacga ttgcagattt 36841 agctcccttg actggcgtaa accgccactg ggtaaaaagt ggtttgcagc acttacccaa 36901 atccaagcta cctggagtgc gagcgcttat tcagatgtct ggggtgcact tgagtggagg 36961 acaaggggac gaatcaattc aaaattcaaa atcgcgtagc gttgatcgcg aagcgacgca 37021 atctcaaaat tcaaaatcgt taaagccgga agatattggc tttcgcttgg gaccgcgaat 37081 taatgcaatt ggtcggattg ctgatccgca gattgtgatc gaattgctga cgactgacga 37141 tatggggata gcgctggaaa aggcgatgca gtgcgaagca attaaccgtc agcgtcagga 37201 aatgtgcgag caaattgaga atgaggcaag agaaattgta gaaatagaat atctcccctc 37261 tctacaggaa gaccgcgtgt tggttattgt acaacctgac tggcaccacg gtgtgattgg 37321 tattgttgct tctcgcttgg tggaacgcta cggtgtcccg gtctttattg gcacatatga 37381 ggatgagggg cagattcgcg gttcagcgcg tggaattcca gagttccatg tgtttgacgc 37441 tttagaatat tctcaggatt tgcttggaaa atttggcgga cacaaagctg ctgggggctt 37501 ttctttacca tcagagaatt tagaagcatt gcgatcgcgt ctgagtgagt ttgcaaatca 37561 gtgtctagaa cctcagcacc taaaaccact tttgaagatt gacacatcgg ctttactcaa 37621 tcaaattcat catcagtttt atcaacagct taatgtatta gagccgtgcg gaatcgacaa 37681 cccagatccc gtattttgga catctaatgt tcgagtcatt gagcaacaaa tagtgggcaa 37741 gggtcatatc aagctgacca taacacaaac tattgatgat tcggaataca aaattaaagc 37801 gatcgcctgg cgctggcgtg agtactttcc tctaccgccg cgactggata tcgcttacaa 37861 actacgagaa aaccacttta atggcaacac tactatcgag ttggagttac ttggtgttag 37921 gcttccaacc cagtctcaca tttttttcgc tgcacactca gcgcctttaa gaaccacttt 37981 tgagtacaat caatgccact acacgtgtgg gatttataaa aacggttctg taccagaatt 38041 gagaatcgga aatcctgacg gcaaaattct agccgttccg ctaggacaca gcatcggttt 38101 actgggaagc agtcgtcaac aggctataca agttgatgtt tctcaacccc aatacaacca 38161 gattcttcaa actgctcttc aagctttatc aatgctgagc aatcagtact aagtactgag 38221 gcatttccac ttcactcctg gtactctgtt ctcaacactc ggattatagg tcgccgttgc 38281 gaaatgccag tacaaaaatg actgcagggc cagctaacat aatgagtgca acaaaaagta 38341 gttgaaaaat gacttcccaa tgaatattgc ctaaaccact gaatatattg aatatagcgt 38401 ccatttgtcc tcctttaaaa tagctatgtc attatcaaac ccgcaataga tcatatccgg 38461 cgatcgccca tcgcctttaa tttacttaac aaaagtataa aaaaaactat aaagactgaa 38521 aacaatagca cgttcacaat aggtttttcg tcatggcaac ttggcaatgt gttaagcaat 38581 gtggagcctg ctgtcatctt gatccagcag atcgtcctga cttacatgag tatttattac 38641 cagaagaact agaactgtat ctgagcatgg ttggtgaagg aggatggtgc ataaatttcg 38701 acaaagactc gcgagaatgt cgcatttacc cagatcgtcc gcgattttgc cgtgtagaat 38761 cagaaatctt tcaggatatg tacggtattg agccggagga attagacgat tttgcgattg 38821 actgttgtcg tcagcagata gagggggtat atggggacag aagcttggag atgttgcgct 38881 ttgatcaagc agtaggtatt tgacgaaatt gggaaaaatc tttctctctc aagaaataca 38941 gggaacaggg aatagggaac agggaacagc tcgtagggtg ggtattgcga acaaaagcct 39001 atgtgataga cgagatatta gttcgcaatg cccaccagat ctcaaaaccg gtgtaaacgc 39061 ctcccgtacc gggtaggtgt aaggtttctt gcggaattag ctaaattctg ggttgctatt 39121 tgtaaaaaat cttgagaaaa tgataagaaa atgaaaacaa ctgacgagca aaacaatact 39181 acctgtgaaa cttgactcca agcccttgaa ccttgctaca tctgggaccc aaagccagct 39241 agagcaaaaa gatgaactag actgcaattg tgcaactacg ttagttgagg tgtcgtcacc 39301 tttggtttgt gataaaccga ctaagccaca aaccccagta gttatcttcg caactacatt 39361 tttgaccatc tttctggcgg aaattggtga caagacacag ctatccactc tgttaatgag 39421 tgcgcaatca caatctccct ggattgtgtt tttgggatct gggacagcgc tagtcatgac 39481 gagtctattg ggtgtcgtgt taggcagttg gatggctagc agactctccc caaaaactat 39541 agaaaaagca gcaggtataa cgctgctgct aatttcccta atgttatttt gggatttggg 39601 atttggtaat taataattct cagtcactaa taagcactct taactcttga ctcttgacta 39661 actatggatt ggcatctttt aggattaagc tttattacgg tttttttgtc agaattgggt 39721 gacaaaagcc agttagcggc aattgcgctt tcaggtcgta gtcaatctcc acgtgcggtg 39781 ttttttggta cagctggcgc actggttttg actagctttt tgggagcgtt agcgggggga 39841 gcggtgtcag agttattacc cacaaggttg ttgaaagcga tcgccgccgt gggatttgcc 39901 atactcgcaa tacgcctatt attttttaac aacgaggaat gaagtgtttt atgattgata 39961 attcatcatt cctcatcctg ttatcgtcgc caacaccagc tcaaaagtgt gccacctacc 40021 aatagtttta ctgcttccaa aatccagtaa ccaccatgca gtatattcat acttgctgga 40081 atttcagcag ttgtttcaaa taggttcagt tcgattccga tggcagacat ttgcggagtt 40141 aacaagtagg tgtcagccaa agctatggcg agcaatacca cggacaaaat aattgcaccg 40201 cgacgccact gagattggct gttgcacaaa gctaacacgc tggttaacac taagccagca 40261 gacagtaatt cgatacgatt aaaattccag aaaagcacat agcccgctga agcaaaattg 40321 gcttgagtca tcatgccaga aacgtaaagg ctaggcataa ttacccaatc caaaaacatg 40381 ctggcactga gccaaaagcc taaagttaat atggcaagac tttgccaaat cggtcgctta 40441 agttcgaggt tactaactgc attcataatt gtgaataatt gcgtgtgttt atatctctag 40501 tttctaactt aacaaagcac tatgacaaca gatgtcaatt ttcttttaca aatttgttat 40561 aactttgcaa agttatttag gtatttttta cttcagtaga ccgtcacaat tatttgtagg 40621 atgggtagag cgtaagcgtt acccatcaaa acgaatgtaa atgttgggtt tcgtccctca 40681 atccaaccta tatttaagtt ttacatttaa ttctgtctac ctatttagtt gcggttaatt 40741 attaaaagtt atgaagtagt tttaagataa ataaagaaaa agtttgagaa aaacatagaa 40801 cacagaaaag taaatcgtaa aatcaagtta tggggcaagt ttgtcacccg ttatagatcc 40861 accggatgca aaacaatgcg tatagcattt ttattcagta cagcaattgc ttgtacagca 40921 gtcataggcg gatacactgc ggacttgagt caagcggttc agttacgaga tggtaaagtc 40981 tattttgtcc aaccgccgag cctgctgagt gcagtgacga cctacaagga tacttatgta 41041 tggggcgcaa catactactt taccattagc ctgccagaaa atgctggaga accgctgcaa 41101 agaattacaa ttaaccagcg tgaaggagta gatagtgtgc gctttgactt gcgagacact 41161 tctgcttttg aaggaacgcc ttccaaggaa cgtcaaaagc ttgctttgaa agatgtgaca 41221 agggacaaca aaaccaaaac actatcgctc acatttgacc cgccagtacc tccaggtaga 41281 actattacac tagctctcaa gccagttcag aatccgacgg ttgctggtgt ctacctattt 41341 ggagtcacgg cgtttccgcc gggagagcaa cctcatggtc aatttcttgg ttatggaagg 41401 ttgcaatttt acaattttgg ttttcggttt tgaccaaagc taagggtgta agggtgtaag 41461 gggagccagt gcgttgggcg ggtttcccga cttgaagcac ctggcgttgt aggggtgtag 41521 gagtgtaggg aataaataag tggaggcttg tagccccact tattttagac caccaaaatc 41581 gtagatttgg tgagggcttg tacccttatt ggttccgggg cagggtgtac aaaaatatcg 41641 ttttttcttt ccccctacac ccttacaccc ccattcccct acacccttag ttttggtcaa 41701 gactcaagag ttcttttgcg tgcaagtctt ggctgataat ttcgcctttt cggtcaaaca 41761 atacagtacc aactttcagg gcaacattag catatttttg cacgtaagca gtagctctct 41821 gactaatttt ttcagctagt atcccaaata ctgattctgc cagtcctagt tcgattaatt 41881 ttttgtgtgc tgcatctgca gtcttagcat ctaaaattgc ccgtactgct tgtatatcac 41941 caccaacagc aacaactgct gcacttataa tttctaattt ggcatctgct aaatgactag 42001 acgtattgaa aatgccacca gctagcttaa ttagtttgcc atgatatccc agcagcaaaa 42061 cagaagttgc ttgatatagt ccagcttcta caagcattgc accaatccaa ttgccagttt 42121 gcacaactgc tgactctggt atgcctaaac gttgtgcaac ttgcatacca ttactaccaa 42181 tacaaaacac caaaccagga caaactttga ccttagcctg caacgacaaa cgaaattctt 42241 ctaaatagtc agccgctgac aagggttgag aaattccact ggttcccaac aaagctaatc 42301 cttctaaaat cccaaaggct tcattagagg tacgttgcgc taattgacga ccttctggaa 42361 gaataattga tacagttgcg gtttggtctg ggggaattag aggtagtaaa ttggtatcaa 42421 acaagcgacg agcataactg taaatcgctg gttctccaga ggttgttttt cctaaaccct 42481 ctcccgcttc taaaatcaaa gcttgggatt gcctttgtga caactttacc caagcccaaa 42541 taggcgtgtt ttttgttaaa tctaagttat cacctggatc gctgagtgtc acagccaaag 42601 cactctggga gtctaaagct gcaacttgag aaatcgaaat ttctgctgtt tcagaaagca 42661 aatcaatctc aactgagagt tgagaatcta ttttttcacg cagatgcata agagcagcct 42721 tagcagcggc tactgcaaac actggcaggg tgtaacctgt acgagccata tgattgttaa 42781 ataagttttt aaaaacatct tcgcctctac tacagggaga attaagcaaa gacaatcgac 42841 cttcgttact tagttgcaat tatatcactc ggatgctacc ggatttgata tcacgaatgc 42901 tcccaaacta cttttaaagg gcccagaaaa aacgaaatcc gccatcacat aaaattgtca 42961 ggcggattcg cagatatttt ttagtggatc cgagcagagt cgaactgctg tccagactgg 43021 gtattgaccc cccgctcgtt cacaggctta gcctttttta ccctcaaggc ggggaacgtt 43081 cattatcccg aacgctagga tgctctggtt aagtcttagc taacaagcga accagagaca 43141 cttgttagag catccgttgg gggttggtct ctaatcctta acggagtcaa actagagacg 43201 ctcgaaccga aaagaggtta ttaggcagct actacagctt gcttacgagc aaaaggtaca 43261 atgttgtttg catttacttt atgttgagcc ttggattttc gagagacgac tcactctcga 43321 cctgtttcac aaagtagctt tcgccaacct gtcgaaaccg ttacggaccc atgtgtttta 43381 cctatactac gattatagta cgccccactt gaaatgtgtt actttataga aaaaaatata 43441 gaaaaaattt gtgaattttt ataggggaga tagcgggagt cagagagtga gagcgaggaa 43501 aaattctcct tctctccctc cctctctcct attgtcccgt tttcctaacc tagcgcctca 43561 gcaccgctga caacttctag gatttgattg gtaattgcag cttgccgcgc tttgttgtaa 43621 gacaaagtga gggcattcat cagtgccttg gcgttgtcac tagcattgct cattgctgtc 43681 atccgcgccg caagttcgct tgcagcggat tcttgcagcg cccgtaagag ctggttactc 43741 aggtacagag gtaataaaga atcgagaatc tgtactggat cctgctcgaa aatcatgtca 43801 cgggagaaag tacgggtttg agcggtcact ttctctcgtg tgacttcaaa ttcaccacca 43861 cgggttgtta agcggaaaat ttcgtcatcc tgtgcttcca aaccttgagc gtcaagagga 43921 agcagggttt gcaccacagg acgagaactc accaaggaga cgaacttggt gtagacaagc 43981 tcaacacgat ctacgctttc tgacaagaac agagaaagca attgatcagc aattttgttg 44041 gcttcagcag cggtagggat ttgctctaaa ccagtatagg tagcgtctat tggctgttcg 44101 cgacgttgga aatactgggt tgctttgcgt cctacgatga caaatttgta gtctacgcct 44161 tctgctttga gttctttggc gcgattttca gcacgtttga tgacgttggt gttgtagccg 44221 ccgcacaaac cgcgatcgcc tgaaatcact aacagcccta ctgacttaac ttcccgtttt 44281 ttcaacagtg gtaagtttgc ttcttcaaaa cgcagacggg tttgtagacc gtacagaact 44341 tgcgccaagc ggtcggcgaa aggacgggtg gatagcacct gttcttgagc gcgacgcacc 44401 cgcgctgcag ccaccagccg catggcttct gtaatttttt tggtattttt gactgattga 44461 atgcgatcgc gtattgcttt gagatttggc ataaattttg tccttaatcg ttagtcctta 44521 gtcgttagtc aagagtcaat agccattaag ctctttacta atgaacggca ggtcgcaccg 44581 tcgggggaac ccccgcaacg cactgcctcc taatgactaa tgactgttga ctaattatgc 44641 tgttgccaag aaggtcttct tgtattcgtt aattgcctct ttcaagagtg cttcttcttc 44701 atcacccagt gctttcttgg attgtattcc ttgaacgtac tgaggtttgc tggtctttaa 44761 gtaatcgcgc agccctttgg tgaagtcagt aattttattg actgcaatgt catctaaata 44821 cccgttgata ccagcataaa gaagagcgac ttgctcatac accgatagcg gcgcgttttg 44881 tggctgtttc aaaagttccc gcaagcgcgc accgcgtgcg agctgatctt gagttgcttt 44941 atctaagtca gaagcaaatt gtgagaacgc ttgcagttcg tcaaactgtg ccaattccag 45001 cttcaattta ccagcaactt tcttcattgc tttggtttga gcagccgaac ccacccgtga 45061 tacggaaata ccagggttaa tagcgggacg gataccagag ttgaacaagt cggaagacag 45121 gaagatctga ccgtctgtaa tagaaattac gttggtggga atataagcag atacgtcacc 45181 tgcttgagtt tcgatgattg gcagggctgt catgctacct ttacccaatt catcactcag 45241 tttagctgca cgttccaaca gacgtgagtg gatgtagaac acgtctccag gataagcttc 45301 ccgtccggga ggacgacgca gcagcaaaga catttgacga taagctgtgg cttgcttgga 45361 gaggtcatcg taaatcacca gggttgcctt acccttgtac atgaagtact cagcaatcga 45421 agcacctgta taaggagcca aaaattgcag ggtggctgga tcactggcgt tagcggcgac 45481 gacaacggtg tagtccagag cacctttttc ttgtaatgtc tgcacaacgt tagctacggt 45541 ggaagctttt tgaccgatag caacatagac gcagatcacg tcttcttctt tttggttgat 45601 gatggtgtct acggcgatcg cagtttttcc tgtctgacgg tcaccaataa tcaattcccg 45661 ctgtccacga ccgacgggaa tcatcgcgtc gatagctgtg ataccagttt gcatcggttc 45721 gtgtacggaa cgacgcgcca caatacctgg tgctggagat tcaatcaaac gggtttctgt 45781 ggtcttgatt tctcccttac catcaattgg acgacctaaa gcatcgacaa ctcgtccaac 45841 caaggcttct cccacgggta cttgagcaat tctaccagtg gcggttacgg agctaccttc 45901 ttgaatctca cgaccttcgc ccatgagcac cgcgcccacg ttgtcttctt ctaagttctg 45961 ggcgatgcca attgtaccgt cttcaaattc tagtagctcg ccagccatgg ccttgtcaag 46021 accataaata cgagcaatac cgtcacctac ttgcagaacg gttccaacgt tagcaacttt 46081 gacctgttgg tcgtactgtt ctatctgttg ctgaataatg ttgctgattt cgtctggtct 46141 gatgctaatt gccatttgtc tattttctct ttattgcgag acggaacgat tttacactca 46201 gcagttaatc agttatcagt tatcagttat cagttatcac tgttcactgt ttactgttta 46261 ctgttcactg atttaagagc cacttaaacg caaggagata cggcgcagct gaccccgtaa 46321 gctggcgtca atgacttgag agcctacctt aataatcaca ccaccaatta actcggggtc 46381 aatttttgta tctaattcga cctcgcgagc cttagtcagt gcgatgactt tatctctgac 46441 tgcttgttgc tgctcctctg tcaggggaac tgcagaaatc acttccgcca gagcaatttg 46501 tttcaattgc cgcaacagcg tcaaatattg ctgtaaaact ggttccagca atgaaatgcg 46561 ccgtctatct accagcagca ataaaaaatt gcggaagtat gcattagcgc cgtcaccaag 46621 gatacggttg atgactgcct ttttgtcatc tagccccaca aatgggttct caagaaaatt 46681 tctcagttgc tcactttcag aaagcaagct tagcaaggaa cgcacatctt ccccgaactg 46741 atctgttaaa tttttggatt gtgcgactga catcaacgcc tctgcgtaag gctgggatat 46801 ttccgccatt gcgatattac ttttcatccc taacctccca atagcgctat actgcggtca 46861 attaattgct gttgagcgtc gtcagcaatt cctgtctgca gttcagactc gactttttgc 46921 aatgccttag acactaaata ttgttgcagt tcgccgatcg cccgctctct ttgtgtgtct 46981 aaatccctag cagctgtttc tttcaaacgt tccacatctt ctgctgctcg tgtcaaaatt 47041 gcttcacgag ttgcctgtgc gttttcttca gctgctttac ggatgcgttg cgcttctgct 47101 tgagcctgcg tcaactgttc ctgtgcctta gacaaagcaa cagctgcgtc ctttgcctgc 47161 gcttctgctt ctttgattac cgtttcaata tttgagcgtc gttcgctgag gatattgcta 47221 agaactttac gcccaaagta aaataatatg ccaaccagaa tcgccaggtt aatcagattg 47281 gtttcaaaaa tgtctatgtt tagaccgaaa ccaccttctg aatgaccatt cgctaccgct 47341 tctgtggcaa gtaaaaaact acttccaatg atacccatct acaactgcgc tgctcccttg 47401 cttagtaggt tgaaaattcc ctaagggaaa aaagtgccac ttttgacact tatttaaccg 47461 ttgtgggacg acatgcgccc cgctacgtat atacttatcg aattgttgtg agatcgcgca 47521 ttgagccgct ctatttagcc aaaacaggtc ccaatagttt ttctagtatc tgcctgctca 47581 gagcatctac ctgttgatct aaggaacgta aagcttcctg cttttgctgt tctatttcaa 47641 gagccgcttg ttctcgttga gcttgagctt ccttttgcgc ctcagctatt ttctgagcag 47701 tcgttttttg agcttctgct tgagctgcag cgacagtcgc ttgtgactgt ctacgagctt 47761 ctgctaattg ttgctcatat tctttagtta atcgctcagc tttagccaag cgttcacgag 47821 cttcaaggtt attcgttcgg atatagttat cgcgatcgtc taatgccttg gtcagtggct 47881 tatagaaaat aacattcaat agagctgcca atagcaggaa ctgcactgcc atcaagggca 47941 aggttgcatc gaaatcaaac atttctctcc tctttggcta caacagcagt tttcacgggc 48001 gaaaaatcat cttacttttc atcatcgtcg agaccccata gccatcaagg gtcaactttt 48061 tttcacagtt atcaagtacc agccgcagtt atcagttatc agttaccagt tatcaatctt 48121 cactgtttac tgtttactgt tcactgttca ctgttcactg ttcactgatt aactcagcta 48181 tagagacgtg aacattcacg tctccaaaac taaggaaaag ttttaggttt taggcaaatg 48241 ggttagcaaa cagtagtact agggcaataa ccagaccata aatggtaagg gattccatga 48301 atgccaaggt taacagcaga gtaccgcgaa ttttaccttc tgcttctggt tgacgagcaa 48361 taccttctac tgcttgtcct gcagcattac cttgaccaat accaggacca attgcagcca 48421 aaccaattgc taacgcagca gcaagaacgg aagcagcaga aactaatgga tccatgttga 48481 ttttccttac cttgagtaca aaacgaacga tttttgtgtt tagactttcg attttagatt 48541 ttgcaacgaa agcgaaaatt cgggcatcca aaatctaact tgacatcaac tgagggttct 48601 aagcagtgat caagtaccag ccgcagttat cagttataac tctttattga tcactgttca 48661 ctgttcactg ttcactgttc actgatttta atgtgcttcc tcatgctctt cgccaccatg 48721 tccctccatc gcctcatgaa tgtatgccgc tgctaaggtg gcaaaaacca aggcttgaat 48781 ggcactggta aacaaaccta aagccatcac tggcagaggt acaaacagag gaaccaaaag 48841 aaccagcact cctacgacta attcatccgc caaaatgtta ccaaatagac ggaagcttag 48901 ggagaggggc ttggtgaaat cttctagaat cgcgatcggt aatagaaccg gcgtcggctc 48961 tatatacttt ttgaagtacc ccaaaccacg cttgctaaaa cctgcgtaaa agtaagctaa 49021 agaagtcaac aacgccagtg ctactgtcgt attaatatca ttggttggag cagccaactc 49081 tcccgatgga agtttgatga gcttccaggg aatcaaagca cccgaccagt tcgatatgaa 49141 gataaacaag aatagtgtgc caataaacgg cacccaagga cggtattctt tctctccaat 49201 ctggtttttt gtgaggtccc gaataaattc cagagcatat tccattaggt tttggatacc 49261 gctgggaatt ttttggatgt ttttagtagc agctagtgaa gctacgacca ggataccaat 49321 cacaaaccaa gacgtgagaa agacctgtcc atgcagcttt aaattgccta actgccagta 49381 gaaatgatga cctacctcta gttctccaag gggaacagaa gttagggcat tcagaaaatt 49441 caccattttc ctaacctaat ctatcatttc tccaatctta cctggagaag tggagcgggt 49501 tcgcggggtg gattttcctc ttggaaaagt tctgtcctcc cgcacagttt gagcgttgct 49561 gtatggttac cggaactttg acaagtcaga ggcaaatgct cctctaacta cgtaaatgag 49621 aagcgttgct ttataagtaa gaaatcccag aaatatgggc aaaatttcca gttgattcca 49681 tcgcgatgct agtagaatca cccctactag taacgcaaat cgagttttac tcaactgttt 49741 tttttctcca ctcaaacgct caacatcttt tgccaacatc ctcaagtaaa ccacacctgt 49801 acatgctcca agcaaataat ttagggcaat gttcagggaa taaaatatcc aaacagagaa 49861 aaatataatc cctgtcaaga caagagtgat gagcaacaac tcttggtaga gttcatagaa 49921 ctcgcgcatc gagttgtctg gttctgtgtc ctcaaaacca ggtcttgaat cttgtcgtgt 49981 tgttggtgtg ggatcagttg attcgtttga caagctcacg ggacttgaaa ccagtacagc 50041 taattgtgtt gactgcatct tcagccagct gaatcatatc acgattgggt aacaatactt 50101 tagaaaaaaa caattaactt attgtcacca aacaaaagac atcaggtaga tgcaccaagc 50161 acaaaaagga taattgacac ggaaaatatg gatgaacagg acaaatcttt accaatccct 50221 ctatcctgaa attccgtgtt gacagctgtg gcgagacgag gagtcgcaat caacttgaaa 50281 ggtagatctc ctcttaattt ctattctacc ttctccctcg tcgcctgctt tgtttggatt 50341 ttggcagttt gtttggttat ctggcaatac agggaacagg gagcagggag cagggagcag 50401 agagcaggga gcagggagca gggaaaaaag tgtttctttt attcatagcg ggtggtacgc 50461 cgttatagtg cgcctgccca tattttgcac tcctcctgcc aacaaccgcc tggggtttaa 50521 accccagtct gatagtaaaa gtcgtcttta gacgactgca taagcctttg agtttacttt 50581 agtagacttg aactgtgagc ctcccaattg attctgaggc ggacgaggtt gtgttcttga 50641 aagtgttgca aaatgtcagt tttgtcaact tggtgtcaac aaaatcagct gctgcttcag 50701 gagcgttctc actccatcta aatccaatga tgagccagct aaaatctccc ctactgttaa 50761 caaaccatcg cagctttcta aaaacttcaa ctcttctggt gataaattca caatctggta 50821 atcgtagttg aagacacatt gactgggaaa accatccata caaggatttc gctcagcaat 50881 tgccgttaat aaatcattat ccgctgacca atcagccttt ggtaaaggtg gacgactcag 50941 aaaaaactcg tagtgagtca cttctggatc taataactct atcaaacgat aacgctcgat 51001 ctcgcctaac ttagctactc gttccattaa ctctggcgct ttgcccaaaa gtctgtctat 51061 ctgccaaaaa cccggatttg agaaaccaac aaaatctaat ccggaagcag aaattagttc 51121 aaacaacgtc tcaatgttat aatctatctc ctgcggatgc acgtacatat cagcaaagca 51181 ctcatcccgt tgattttcta aagaccaacg ttctcgttct cgcttgacaa tacggttatt 51241 ttctggtaat gaagcgaaga tttgccgacc aacttgaaca ccatcacggt aatcaccgcg 51301 cttgtcacct tggaggagtg cgatcgccct ttgcataagt tgaatttccc accgtcccaa 51361 ctccccatac acaaaaatgt gcatcaagcc acctggggct aacttctttg ccaaagcttg 51421 aataccgcga atcgggtcag gtaaatggtg caacacacca acgcagttga taaaatcaaa 51481 ctcacccgac aactgttcca catcatacaa actcaggtga tggaactcaa cacggtttgc 51541 accagaacgt ttacagcgtt cttgagcaac ttccagcgcc ccagcactaa gatcaatgcc 51601 cacaacagaa gccgtaggat taagatggac gaggtattca gtccccactc ccgtaccgca 51661 accagcatct aaaatacgga tatcttgctt ttggggtttt tgaccagtac agaaattata 51721 agcagctaac caattccagc gccagttgta accaggaggc ggttcatcta gtaaaggttc 51781 tggcgggaag gggtaggtgt tgtaaagttt ggcaacagca gcgctaatag tttgagaatc 51841 ggacatatac agcagaattc aggagtcagg agtcaggagt caggagtcag aagtaaaaag 51901 ggttttctgt ctgcctttta gacctcagta ctgtacttca tttacttgca atgtgcagta 51961 gcagttctca ggtgttatca aagtttacag atttttatac acctgattat catttctctt 52021 gccaactctg aagatatcaa caagcaaatt ttcatcatca atcgtataca gaattctgta 52081 ctctccttga tctacgcgat aaccaccttc ataacctttg agtgctttgt agtcttgagg 52141 tcgaggattg ctctggagcg agaaaatctt cgatacgacc tgtttaaact gtttcgcagg 52201 taaatcaatc aagtctttct cagcagtttt ggcaattctc aaatcataac gctcactcat 52261 ccccacgact agcaataatc gcttctagtg tggtaaagcc gtcattttgc tcaacagcac 52321 gacgcagttc agcagagtca atagcatctt ctatagcttc cagcaatctt aagtcttcta 52381 gaccaataat cgccccaaca gccttgccat gacgctgaat gagaatgcgc tcacccttat 52441 attcagcacg attaatgagt tcttgaaaat tagcacgagc ttcagttgca cttacagcag 52501 tcataccgcc tcgcttatgt tgattgaagt caatgagaaa tattgtacat tttttttaaa 52561 ttaggtaaat tgtacaattt gtataattat agcagtttct aattttgcat tagttttggg 52621 gc // LOCUS NODE_428_length_49332_cov_5.05473149332 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 49332) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 49332) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..49332 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 189..692 /locus_tag="DP116_02215" CDS 189..692 /locus_tag="DP116_02215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739898.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02215" /translation="MSFNLTAHSYVSPHPALTGTPLLTKERGEDNNKTFQENLEALDL EPIAQRLMYPEHGLGWSCEEVEQAIAHYKMFLNLLYLYPNSAIVPTREIDIVWHYHIL DTRKYAFDCEWLFGYFVHHNPNFDFGSEPDRLALERAFFDTKTLFAEHFGISLNKLQQ ASACADL" gene complement(792..3155) /locus_tag="DP116_02220" CDS complement(792..3155) /locus_tag="DP116_02220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409749.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase subunit sigma-24" /protein_id="PRJNA477356:DP116_02220" /translation="MLPRDFLAQMARNYELSKDQEEVFLLRYGEELNYYDIAKKLNTS EGACLKRMGQVYNKFKVSGSSRGKENRLRIFLTEQLQQGVTQKTSESTVPAKTPKELQ GESTVSPKVIIKPVTNTGTASPPIYENLPKRECTTFIGRRQELARLLELLSSEHSAHL ISVDGVGGVGKTTLVLETAHRCLEASQPGSVVQSFEGMTVPTFEAIIFTSAKQQYLKA LRILPRLKRERSLRDIYRTIAHTLERPEISYRSPDDQLDLIRDSLKRQRTLLIVDNLE TIEDKQELLSFVYDLPRTVKVILTTREQALFVPIRLESLHEEDALSLITHQAEEKGVE LQPIDAKVLCQNTHAIPAAIVYTVGQLAAGYSLQDVLTKLQSATGDVARFCFEASVER MRGKPSHRILMTLSLFPEPVERETIAEIAEEDSMNTGDGLAKLQQLSLVRQQDARYRL HPLTREYARAELTANLEFEEKVRSRWVEWYLDFSQKYGNIDEKKWYPEYSHIDLEWEN LQEVMKWCREREKERYEDILTLWRQIKGYAHVRGYWDDRLMWTDWLQGATERRGDWST LAEILYDKGWTLTLTRQPECLEEASTLLEHAWDLRNHKDITFQVELAARMVVLCIHQQ KFELAHEWIKIKQNMLKQSGVEELERQRQQIQTHYYQAKILFLTKDYVQAKKLYQRAL KQAKDIGWERAEIAIQNWLADVAIEEGNLEVAQQILEQGFPVAECNKDKHSIAFHQRS FAYLLQRQGNLEQARTWAQEAALSFESLKMIPEAREMRTLLQAIEDR" gene 3550..4089 /locus_tag="DP116_02225" CDS 3550..4089 /locus_tag="DP116_02225" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02225" /translation="MKRLLLTVLGLFTFIVCTALTIQLSKAAPATAGWSDYTTATAGA YCLAKHPYLTSYSGTTFTPDGFEIATAPGLVGAVRRNFVYDRSSSTTDQGNGQTCAQA CGEFGKFYSPSYKGKSLTQKVGQTTIASGLGDIASLAVQDKDFYLDKTVVAGIWARPS TYQEADVAQADFCCCQISN" gene 4122..4364 /locus_tag="DP116_02230" CDS 4122..4364 /locus_tag="DP116_02230" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02230" /translation="MIATHVGWVEARNPTPTMPVNVGFRSALTDTFFNGGNPRTEVSS PTYTSVAFCPQIGISLVVLWRSALRAAAFAQRPLRG" gene 4528..5442 /locus_tag="DP116_02235" CDS 4528..5442 /locus_tag="DP116_02235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353250.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_02235" /translation="MPEEQTLVIDHLQKDSTLPVMPNPPLLTSQKSGWSSIHLGHFRQ PAWELPKFSSLQHIISIPIALHHTVSVEFVLEGHLHRIQYHPSDRIDGCVEIFPAGPC SKISWDKEIDFTHFYLEPTFVSQVAHEAIDPDHVELLFEPKKVDLLVYQICLALKADL DVDGSGNGFYADSMATALSAHLLRHYATRKHTLQEYEDGLPKHKLQQAFDYINEHLGE DLLLTEIAAQLHMSQYYFCRLFKQSTGMTPHRYLIQQRVERAKQLLRQPELTVTDVAM ACGFANQSHLAKHFRQHTGVTPKQFRKM" gene 6017..6910 /locus_tag="DP116_02240" CDS 6017..6910 /locus_tag="DP116_02240" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02240" /translation="MKRIRAVFVFLVVLVAGLRRKRVAVQKLMLRLKLNKGRHFIRGF TVITATLLLIGLLLTTESVLAFKPNEKGHLGITSDAVRPIERVVGGVRLRFSPQALEE IRKANESTDDLSSGDFFRSEKHFDNEEFFAGTTRLLELKGYIISQITSFSPNGKNART ALGGALHTIQDFYAHTNWVELGHTSIDTQLGRTTIPKPPITTPTSPANDQSTLLPGLT VLTSGYFVTPPYCSAPPGKTRHGVSGLCTNGLNKDEPGRPGYPQARGLAVIASQDFIN QILDTPGVAGNDRAIRTLMGI" gene 7176..12695 /locus_tag="DP116_02245" CDS 7176..12695 /locus_tag="DP116_02245" /inference="COORDINATES: protein motif:HMM:PF00353.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02245" /translation="MESLQQLRFADVIHSILEGTSETKLIEASRVAFGDRIELNRLND LVQQWKIGDFSALPEITILPGNNLNGARGAYSAQTNTIYLSQEFITNASNPADVNDVL LEEVGHFIDSVINPVDAPGDEGEIFANLIQGKTLTSQQLENLRNQNDAITIQINGQVL QAEAADTKPEDNLGIIATAIKPYLEKVRSAIDTLVLDNIPLLNTFSLKDYANKIISDE VENKLISAFEKTENKSVESIKQALFDALGPSGLNLLLDLDADGKVQLSDIQAPKDANS LEFKFKLGKNFNPDLPLNSNSLGLNLKGNLTPELTLGVMLGFGVDDTSLLDGKPSADA FFVDVGTTNEIDGKLNVKFTDKENKPLSFTGDLGFVKLNATDNGSNLTSNFSIDIKNP EADGKGRVKSTAIPSISTDKPKLDANASLNLRLDTGLLNGILPSLKSDFSLSGLNYQS GDSTQSTPVAELKNVTIDSGSLVSGLLGKVLEPVQSVTGAFQKPIDIITSPLPLIKKS LVYLAQEFPKGGGFDSVDPFLKGLKTFTDAATAINSLKPSSSTIDLGDYKLDKSGFST IRETAPIINQLKGNALVPQSVVPQFAALAPTLASSPNSDNLLSTLFPVLNNSSQFVNL LLGGQDTNLFEYTTPTLGFKFELAPEPIVPVFGPVVLKFGASAGAGAQLTLGLDSKGF KEYKEEGFKDPSKILNGLYVGKAEPRNLNGKTTENSFEVFGGINARAAVDVGIAEFAA GGGVFLTAGLEVKGSQEDENKSYIKQQSNPLCLFDKTGALSLVVFASLRLNFGFFKVT KRLNLANVDLIDYRGKTCGDEKYKVQNPPLTSEIRAQLAGQGIIEREGTEQNDVITLE GTELFLVNAPKKLKDSNTVVNGKVNLLGLDSKPQEYTDVQLIVINGGKGNDRIEFKDQ EETNLPIVNVQQSKDIVASGQLDGGEGDDTLIGGRGDDYLTGGSGQDTLNGGTGGRNT AVYSNAPEGKGVKVDLVQNFALDDGFGTTDTLINIQNIEGSSGNDILIGRASGDPKDN NFGSLLDGGAGNDTLIGGEGEDVLLGGAGADFIDGKGGLDTTTYLDSTAPVYVNLSSG RVRITSPINLGTSIQLNANAGVGGDAEGDKIFNVENVQGSVYDDILVANGKDSRVDGL LGNDIIIAAPEAQILDGGPGIDWVTYQLSDSGVNVSLKSGLGKALTITQPNILPGFPP IEIEIGSGGYGKDDTLEFAKDANNNVIKDQSSFENLEGSNFDDILEGDLQDNILRGLA GNDKISGGDGNDTLIGGAGADVLDGGNGIDWADYSESFAGVIVNLKTNSGLGGDAQGD TFARLSPTISTIENLLGSKFSDTLTGDDGNNEIKPGYGQDTVYGEGGNDRLSINYSNS VDFFEPGSGVTGGYYPATNPSYSYSSSGITYNPGSVGSVENGFISRRTSDGAKVLDSV AFFGIENLTLIGTSFADSIFSASGNDFLNMGDGDDIITSGQGSDRVFAGSGNDIVVSQ NDLFGRIGGIIDNAKAVIELDGGAGIDTLSVNLSGKKDSISLFSFSSTQENPYQSFSI ADGTVSIKNFEIFKDIITGNGDDFLTQLGRVDNIFATGGGNDTVDAGLGFDNVDGGDL NDLLYINYSLSDTGSGMSMTVDPLKLVGSASRTKGGTSSELLDKINFSNFERFNITGT SKADTIMGGDGADILRGSGGDDTLIGNRGNDQLFGGDGNDILQGTNYISYLIGGGGEF KNYDYPKDIDTLTGGTGADTFVLGNSPYDKYYFGNLFQDYAIISDFKSSEGDKITLSG SDDQYTFEVNAGSTSILRTYESYGPDLIAVVQGVTDLEKNANYITYLPPPPPVVK" gene 12801..15218 /locus_tag="DP116_02250" CDS 12801..15218 /locus_tag="DP116_02250" /inference="COORDINATES: protein motif:HMM:PF13448.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02250" /translation="MTFNITSLDSQFLKQLLGDTTGLSNFTTTLTGDRAAFGKFQDDP FNLKSGVVLSTGKASELAGRNISDGGLSPGTNVKLKFEKLDGTTGSPSNAKTAIYRAD LSELGFDLKSLTIDDGGVIGGSDGRFTGFDLDGIKLSNTKITNAADIQTLPSLNVFDF SPANTIFTPGNQRPPSDATNPDLFGTINGTINNSVANLANFDGVSSTGNDANGFVSLG NLGKVGFNFKSTVTSQLPLYLYIGEVGDNGEVAAGKISVSNRPISGLSDLSTDFGTPG EADDSTTLQIDFDVDSTSRKLFFEYVFGSEEFAEFAGQFNDSFSLQLNGLNLAKLSNN NTVTINNLAQNPFDRSNPDFIYNPANTGPASDSTRLDGYTKTIIFSGDLIPNARNSLT INVKDNRDGLLDSAVFLKGGTLGVTPPDGSNNTPSGNNTPSGNNTLSSNNTPSGNNTP GSNNTSSGNNTLGGRLKNEGGTIFIPGSGNSSQVSLEFATDRYTAKFNNELGVFVVDN EKGEINGIAPGDAGYLQAAIKRSQVVFSALANDPSSKAGNNRIINFAPNTYLSSYLVQ NASTDEVLANLAANKPTPNVFFSTVAANTDKFDHVQFKNTSDGGIFIGWEDLIGGGDQ DFNEPLVSVKVTNNAAPLGNKLQVQRELIDLRDISDAVTANFITNSEAAFDNLVGFYT IDDITGRIGNLRPEDAGYAQAALQRSVFDIKRDENFNKQLNGSALLAPFIIADGSLEQ FLNQNPNNNQLGGNPLAYFAFSGANPDKVDHIRLLGDNKFGFEDIYGGGDRDYNDIVL QVNLRSS" gene 15244..16125 /locus_tag="DP116_02255" CDS 15244..16125 /locus_tag="DP116_02255" /inference="COORDINATES: protein motif:HMM:PF04966.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02255" /translation="MGNSSSAIDTTREINNFLQINSTLDYSGASPDLNLYRLSYTFPL FEDFRVSVFPQGFASDYVDRNSFANNSAGNFSTYGLVNNQLLLANDRAGAGAAISWNP GKGLFTIRGVYRAQQAGLVNTKPDFTNSDKRGGLFDDPNLGVVELEFSPSKTFAIRLQ YSGGTQGGEEYNVVGANLELALGQKVGLFGRFGYAFNFPGNIQPTSWSAGIVFPDFLA KGANLGFSVGQPLIFQEKDNLLGFFNSTQTNYEAFYRLPINNNISVSPILQVITDPGN SQANTIYTGTLRTVFSF" gene complement(16178..16384) /locus_tag="DP116_02260" CDS complement(16178..16384) /locus_tag="DP116_02260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488230.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02260" /translation="MLGLTLEQTRIYQEAKAEGREEGREERKVEMLRVTVPLLLKTGM TVEQIAQQLNVDVKSVGRAAQQNT" gene complement(16459..17247) /locus_tag="DP116_02265" CDS complement(16459..17247) /locus_tag="DP116_02265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02265" /translation="MKRDSIYYQIFKRFPGLLFELVDYHPSQAQNYRFESVEVKETAF RIDGVFLPPEGAIPRIIFFAEVQFQKDEALYHRFFTESMMYLNRNQSQYDDWYCVVIF SSRSLEPSDTKTHRIFLNSDQVQRIYLDELGEPNQQPVGINLMQLAIASSDAMAEQAK QLIERVQLEETDALPKNEIIDIITTIAVYKFSTLSREEVEVMLGLTLEQTRVYQEAKA EGREEQKTEMLKVTVPLLLKTGMSVEQIAQQLNVDVESVRKAAQ" gene complement(17463..17960) /locus_tag="DP116_02270" CDS complement(17463..17960) /locus_tag="DP116_02270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015188820.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein MoaE" /protein_id="PRJNA477356:DP116_02270" /translation="MNLGLVASVTSPVQPKAEDSFAISFAPLSIEEVYGKADDCKNGA VVLMSGMVRNQTDGKPVIALEYQAYEAMAMRVFYQIAADIRQKWSIVNRVVIYHRIGR LQVGEISVLVAVGCPHRSEAFEACRYAIDTLKHNAPIWKKEHWADGSSSWVSIGACEQ SGENC" gene 18702..20219 /locus_tag="DP116_02275" /pseudo CDS 18702..20219 /locus_tag="DP116_02275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314295.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 20364..21314 /locus_tag="DP116_02280" /pseudo CDS 20364..21314 /locus_tag="DP116_02280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314295.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" gene 21537..24839 /locus_tag="DP116_02285" CDS 21537..24839 /locus_tag="DP116_02285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314294.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_02285" /translation="MTNLILVVDDDALTRLQLRTLLQKQGYQVAEAINTEQALELYIK LQPDMVLLDALMPVMNGITCCEQLQTLPGAKKIPVLMITPLDKSATVERALAAGAIDY ITKPIQSQVLRQRVRRLLEARDAMQKLQQQTEQAQSREVQLVMALEAASMITWDWDIL NNKLTWPDNLKPLFGLESATYEAFIERVHPQDRDFVNRSVMQTLQKGTEYDIEFRVVW HNGTICWVASKGVVFRDSSGVAVRMTGIGMDISKRKQAEEALEVYANRQALVAELSQM ALAGVDLTTLMDETVALVAQSLKVEYCKVLELDSDNNTLLLRAGVGWEPGLVGYATVS AEMDSQAGYTLSCQEPVIVNDLGTEKRFNGLQLLHQHQVVSGLSVIIHGKERPFGVLG AHTTTQRTFGKDDIYFLQAVANMLATAIERQKVEDALRESEQRWQLALRGNNDGIWDW NVKTNQVFFSTRWKEMLGYSEYEIPNHFDEWMKRVHPDDMTSVTRVIQNHFARITPFY ISEYRIRCKDGSYKWILDRGQALWDDEGTVVRMAGSHTDITKRKLADEKLQESEKRFQ ILARATNDAVWDWDLLTNKVWWNNNVQTLFGYSTEEVKNEVTWWHEHIHPDDRERIVS DIDAVINSNEQFWSNEYRFRRVDGSYAYIFERGYVVHDNTGKSVRMIGAMIDFSERKR VQEELQRQNLRSQLFADVTVKIRQSLQINEILQTTVKEVQKLLQSERVLIFRLLSNGS GIVVQEAVVPGVPAVLGHNLHDPCFIEDYVHKYRNGRISAVTDIEQGSIEPCYMEFLK KLNVRSNLIVPILLKNQLWGLLIAHQCTQPRQWTSWETELLRHLADQIGIALAQAQLL EQETRQRQELTRSNEELQQFAFIASHDLQEPLRKIKTFGERLKASYGDVLSEQGLDYL ERMQNATRRMQALIEDLLTLSRVTTRGQPFVPVDLTRVTREVLSDLEVRIQQTEAYVE VGELPIIHADPLQMRQLLQNLIGNALKFHRKDEPPIVKIYSQTLNHQDAAQLCHVIVE DNGIGFDEKYLDRIFNVFQRLHGRSEYEGTGIGLAICRKIVERHNGSISAQSRPGQGS KFFIILPMHPPG" gene 25002..25415 /locus_tag="DP116_02290" CDS 25002..25415 /locus_tag="DP116_02290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323240.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_02290" /translation="MADDDEDDCMLAREALAESRVANELHIVNDGEELMDYLYHRGMY THKSSAPRPHLILLDLNMPKKDGREALREIKADPHLRQIPVVILTTSKAEEDVYSSYN LGANSFIIKPVTFASLVEVMKTLGKYWFNIVELPL" gene 25432..27324 /locus_tag="DP116_02295" CDS 25432..27324 /locus_tag="DP116_02295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314292.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_02295" /translation="MKESPFKVLLVDDDEDDYVLTRDWFSEFQVACGELEWKNNYQAA MDAIVKNQYDVCLVDYRLGASNGLDLLREAIHQGCSSPIILLTGKGDREIDIEAMKAG AADYLEKSQLTAPLLERSIRYAVERKRAEQKIREHAALLDVATDAIFVRDLNKRILFW NKAAEQLYGWKATEAIGKNTSELWYEKDIVQFQEALDSLLKNGSWEGELHQITKFDKE IIVESRWTLVHEYDKQGQSILVVNTNITQKKELEAQFFRAQRLESIGTLASGIAHDLN NVLAPILMTAQLLESQLNDQRSKRLLPILISNAKRGANLVKQVLSFTRGIEGDRTLLQ LKHLITEIQQIVKETFPKSIEVSTSQEQTLWTVSGDATQLHQVLINLCVNARDAMPNG GQLTISAENFIVDKNYAKMYIDAQVGSYVVITVTDTGVGIPQEIIDRIFEPFFTTKDL GKGTGLGLSTVLGIVKSHGGFVHVYSEVGKGTQFKVFLPAQEAMETPEEQELELPNGN GELILVVDDEDSIRDVTKTSLESYNYKAITASDGIEAIALYAEHQNEISVVLTDMVMP SMDGITTIRTLKKINPAVKIIAVSGLASSEKVNTVNNMGVKAFLSKPYTAKQLLQTIS AVKSGN" gene 27412..27804 /locus_tag="DP116_02300" CDS 27412..27804 /locus_tag="DP116_02300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4440 domain-containing protein" /protein_id="PRJNA477356:DP116_02300" /translation="MSNQSSEVLAVNAAFYRAFEKKDIEAMSTVWSQGTGSFCVHPGW NVLRGWKEIRSSWVNIFKNTAYIEINTEIVTTEVRDHIAYVVLVENVLQIINGQRRLE AQSIATNMFELLGGKWYLVHHHASPIMR" gene complement(27843..28391) /locus_tag="DP116_02305" CDS complement(27843..28391) /locus_tag="DP116_02305" /EC_number="2.4.2.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314291.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR" /protein_id="PRJNA477356:DP116_02305" /translation="MTASVETKVVEILSPEELRRTVNRLASQIVEKTRDLSQLVLLGI YTRGVPLAQLLTRQIEALEGIPIATGALDITFYRDDLDQIGLRTPAKTDIPFDLTGKT VVLVDDVIYKGRTIRAALNAVNEYGRPEIIRLAVLVDRGHRELPIHPDFVGKQLPTAK EEIVKVYLDNWDRRDAVELIGD" gene complement(28598..29557) /locus_tag="DP116_02310" CDS complement(28598..29557) /locus_tag="DP116_02310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206437.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalamin biosynthesis protein" /protein_id="PRJNA477356:DP116_02310" /translation="MTSAVVLIIAATLDYFIGDPWGWPHPVRVMGWTISRLTKFSITY CHNSLTQRLAGIALGIILIIGSGFIGYLLIQSARLLHPFLGVALESILLASCFAFKSL RVAAETVLQPLTAGHILDARSALSHYVGRDTQNLTELEILRAVLETVTENATDGVMAP LFYAIIGAFIPVIGPTPLALAYKASSTLDSMVGYREKPYTYLGWFSARLEDCLTWIPC RLTVITLALLSGKPLYVWRICRRDAVCDPSPNSGWSECAYAAILGVQVGGTNWYRGVA KHKPLLGDPIHPITPTCIYQALQLTRYCFLLWLGVAIVLLLSK" gene complement(29625..30068) /locus_tag="DP116_02315" CDS complement(29625..30068) /locus_tag="DP116_02315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_02315" /translation="MKLTTRGHYSVKALLDLSLQPGYGPVSTKAIASRQHIPAPYLEK LLIEMRRAGLVKSMRGSIGGYQLAREPAKISLGEVLEAVGETIEPLPHHQASRTQAED WVTFTLWQRLHQKLKEALYSITLADLYYDARSWQASLGEEANFIV" gene 30752..31165 /locus_tag="DP116_02320" CDS 30752..31165 /locus_tag="DP116_02320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314288.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB family transcriptional regulator" /protein_id="PRJNA477356:DP116_02320" /translation="MAKQKKIEPLVGEDLLKKVKELENLSKEEKAKHCGYYTVTKNGI ERVNMMKFLNALIDAEGIQLDSAPSANGRGGRSASYRISVQSNGNLLIGSAYTKQMNL KPGDEFIITLGKKHIRLRQVDSDEREEAELAEVAV" gene complement(31294..33198) /locus_tag="DP116_02325" CDS complement(31294..33198) /locus_tag="DP116_02325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chloride channel protein" /protein_id="PRJNA477356:DP116_02325" /translation="MTLLPSTELRKVVEQPAFTTFSARLTHLLNRFQPSPETLVLLLA VLIGGGGGMGVVTFHYLIELIHRLTLENFMGVIGAWGAWTLACVPILGGLIIGLMRWR TQDFGPGLSTLIAASQGGEIKGQLRPVTKMLAASVSLGSGASLGPEGPSVEIGASFGM LLSVVLQVSQERQRLLLAAGAAAGLAAGFNAPIAGVFFALEVVLGATSFATSAVSVVL LAAVVAALVAQIGLGAQPAFALPVYQVRSLLELPLYLGLGLGASLVSLAYTQLIALAK ACFHGKIPYLRWLGNIREPIHPIIGGTIVGIVALQFPQILGIGYGTIEAMLQDVKFSL QLLLVLLVVKLLMTAVSAGSGFVGGVFAPAMFLGASFGSAYAKILASLVPPVTQYMAA PPAYAMVGMAAVLAASVRAPLTAILLLFELTRDYRIVLPLMAAVGLSVWLVERIKPTS NSNSNLQQIGLPELKDEQAEILHQILVQDAMYSSPKKLLLTMNVIEAALEMSGDLCQS ALVINEAGQLVGIVSLEDINRTLRLWEKYPISSSEIQANTANQTLMDICTTDILYALQ DEPLAEALDRMALKGLHHLPVVDRDNQERILGLLEREQIGLTCNVAVTRRALRHYLPM TRKTGVSISQ" gene 33740..34039 /locus_tag="DP116_02330" CDS 33740..34039 /locus_tag="DP116_02330" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02330" /translation="MHIHLEGDFLQGGMNYKVNYGVIDPLSFFGHSQKNLAILVLQPF CHAGSTSFCLSKLWILYACFLANVRADQVIYTHFEHRFSTWYLKEQVSGLDGRGR" gene 34036..34401 /locus_tag="DP116_02335" CDS 34036..34401 /locus_tag="DP116_02335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319859.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02335" /translation="MNTHTFDNHYVTCPICQKNTKPKLVKTCMGLYTCPYCQERLVVC QSGHYVRDPFAYKQIMISSVLRRQSRPLARILRDFTILKRPVVALVIGGAILLSVIGM TQQASDQNSPRLPKTEKQR" gene complement(34489..34791) /locus_tag="DP116_02340" CDS complement(34489..34791) /locus_tag="DP116_02340" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02340" /translation="MLEVSSVVGIYVSVKRTPEYKSQRDTLLSGETTQLCYNGTAGAI KVKEGLLKLQEADLGGTPKHCLWNLQGHAAFPCAVLVYHSLLKHSRIHHETFKNFF" gene complement(34873..36483) /locus_tag="DP116_02345" CDS complement(34873..36483) /locus_tag="DP116_02345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457788.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phospholipid carrier-dependent glycosyltransferase" /protein_id="PRJNA477356:DP116_02345" /translation="MQEGSFIWGRLEKQYRTVDKWIDWVWVIVLLLAAVLLYTINLGE LPLRDWDEGTVAQVAREIWRAPAGSMHWLYPTLGGEPYHNKPPLMHLLIAWAYSLGGV NEWTSRLPGALLTAMSVPLLYYIGREIFHQRWAAIYSALTYLTMLPVVRHGRLAMLDG AVVSFFMVMILCSLRSRRNLRYCLGVGIGFGLICLTKGILGFLLGAIAFVFLFWDTPR LLTSRYMWTGILLGILPVTFWYGAQFVYYGDKFTKTGMMDQSLSRVWKSVEGHSQPLW YYLLEIFKYGWPWLIFLPSSLRSTWENRNLSWAKLVLVWCGVYLIAISLMETKLPWYV FPIYPGIALACGAKLAEIENMPILSSYPRYLVLSLVLISIVGTGVSIYYGLGGTAQSD LQLVFGAVAITMAMAAILAERGDSQFLKILFWGSYVSLLLLMKSNDWVWELGEAYPVK PVAQMIQQVNPPVTKIYTSFRDHRPSLDFYSDRTIVPASSNELKNYWKSDKQPYFLLD ETALQDLKLESIKVKKASGWSLVTKNTK" gene 36743..37927 /locus_tag="DP116_02350" CDS 36743..37927 /locus_tag="DP116_02350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314284.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02350" /translation="MNLPVVVDIALGLIFIYLILSLLTSEIQELISTFLQWRAKHLKK SIQLLVAGGSETQQSDIDDATVLVHKLYQDPLINTLNQQGQEIVEKELQELNQLRVDP KTLKGKQSAPSYIPSETFAITLLEALRIPELINYVKNPSDTKTNLHMILSSYKELKTA INDKGSDSYQTIQNIYGDISENNEFKKLVQGLPEYVPNNLITSLSLLAQRSRIKIGDI KEEMNQFQREVETWFDRSMDRASGVYKRNAKGVAILIGISVAILTNTDTFFLLKRLSQ DSAVRSAITQSAIQQKDFINDQNARSQFQELIENASVPIGWQNISQQFEPLKTSRGNS AQIFALRIWLVLKILFGWIVSGLAIAMGAPFWFDILNKVINVRNSGPRPVTYTKDQPP EK" gene complement(37941..39239) /locus_tag="DP116_02355" CDS complement(37941..39239) /locus_tag="DP116_02355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873156.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase family protein" /protein_id="PRJNA477356:DP116_02355" /translation="MKIFAYIYTDPILEPTPDPTTWGWELDFIYQDLGKRSQLQQLFN DCKTEPPDCLLIRRLEELGDSLEEVSSRLAELEAMKITLIAVEQDYNSSQQNPNPRAQ LLKLLYEIQRQQRSRRIRQGHARNRLDAAPPPGRVPYGYRRGKAKYIIDRTTSPVVKD FFEHFLLYGSLRGAVRYLAKKYSKKISVTTGRRWLTNPVYRGDTAYQNGEIISNTHAP IITKEEAAQVDRLLRRNSRLPRRTASAPRSLAGLVVCQECQSHMTVTRVTMRNQKKEY LYLRPISCPKNPKCRALPYQEILEQTIQAVCRDLPLAVAGMNFPQLDAVKNSLTEGIS RQQEILNQLPSLLETGVLDAETAQLRAYKLRTEISTLQAKLATLPPVNLRSVAQAVSI PQFWLDLSEAERRFYFREFIRQIEIFRQDQEWKLQIIFIF" gene complement(39735..40508) /locus_tag="DP116_02360" CDS complement(39735..40508) /locus_tag="DP116_02360" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02360" /translation="MTLTDDIEQFQTHTAKSSETLEGINELANKLFECSTDELASLLA EIESLCEQAKTEIKYTADCLVEINDSWGFEPLQHYSEKKQNTNQTTDSTVKGGNSSTQ STTVSENETKQVEAGDFAPRIEQAKELLDKAVDVCGELFNEAVKTGKNLQASLMIATS LTSGFLTKTVKLVDDVVDGISSSIEISNRVVSQPGMPLSENSSFAVIKEISDKTEEYF GVDGELAAAYEIQKDQEEKERKRKRAAQSRDESRGASSK" gene complement(40513..40731) /locus_tag="DP116_02365" CDS complement(40513..40731) /locus_tag="DP116_02365" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02365" /translation="MTLSSDIEEFRTMISRIVDQLEEINSSSKTLPETTATELSRKFS GLMDNLGEIDRELRSATDALEDVRSRWG" gene complement(40764..43793) /locus_tag="DP116_02370" CDS complement(40764..43793) /locus_tag="DP116_02370" /inference="COORDINATES: protein motif:HMM:PF01580.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02370" /translation="MSSLSSEVMDNLSEIQLGGQGASVSKFTPVQFGKDFFDNLAQCR DDGEKIQLFLETAQRQNVPLRAIEELLNYQDEFILAMGLQSFGYITNPQIKAQLATLD YIQGLDEPLEEISLLADPEDYKRHSVRILEKLCQEAKSGSDLIRLSAAWAIQQIGYSA MITGRFLPKSAEDIQSQIISESLNRLNNRSISKEYIDFWVYTPKRHLLRLLQTIPSSY LDVVERILARLGVVGVEFIARSASTLQQFVVEVALDLANLLLRDRQYRKYQDAGTQRT LSDILIPFLDNSDIDLRRLAAEPINEVGSSWSCLNDTIKAKAAIILRDWNKLEELGEL SVPFLIEAIKGSLLLDTQNNLREQVEAVRCISRIYFYNVEQQVTILSEFLQDYQEEIR DVTVSLLKPHQKLLDMKSNHILTGLYFGFQLEDFDVKTMTIREIDRKIADERLYQEDF DKIFQGEVIKPEIFNLQGYYLPYFFEANKDNNMLAVCDVLHRLKEKFLLEITIQVCQS SEDRETWVNALSQMVAQLQLLTSKSKDAFLDTALKTYKQYQELYVNRNLFKYNIKALA ETRSNIRTVLMTLVQSATKSNSSGEQDYIEIVSRDKDEQKFLETLSATENIDISTAVE RKGWEGDFGEKYIRQPIKPKLSNELGDGSTNLYSNSFNSANFSHPTLPSSGGAIIRTG ASAITQSSNNSRRLPLAQIKDLKPLHRLATLEEISGFFRLVIPGNTPVPGMATTRLRN STAEEIFNNYKHLMTKDEYIVGLDDENNPVTSSWSEIPHRLVAGVSGAGKSNFLKWII FQFLYVNPKRRIYIADFGGVDFQWLNHMGVNVEIETTPENCPNLVEKIHQQEYERRLQ LMQEYGVENLKELQEEGVEIDRTLWIIDEAADIADVSSKLRDTIEKRLKEYARKGRKF GIHIIYCTQRPTTEVITKQVTDQCEEKVVFRVSPDASYRILEDAIAGDIPKDAKGRAY LSGYSGAKFVNTPLIKMPIGSKVKISETLWANLHTKK" gene complement(44336..44695) /locus_tag="DP116_02375" CDS complement(44336..44695) /locus_tag="DP116_02375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312729.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glyoxalase" /protein_id="PRJNA477356:DP116_02375" /translation="MQITQCLHTAILVTDLERAEHFYTKVLGLPKVERSLKYAGAWYQ VGNHQIHLIAAQGVPTEDQNEKWGRNPHVAFSVADLDAAKQQLQNNNCFIQPSASGRP ALFTKDPDGNIVELSQQ" gene complement(45131..46321) /locus_tag="DP116_02380" CDS complement(45131..46321) /locus_tag="DP116_02380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317519.1" /note="transforms a conserved lysine residue of initiation factor 5A into deoxyhypusine; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="deoxyhypusine synthase" /protein_id="PRJNA477356:DP116_02380" /translation="MSKQLGKKIAPTPIPTDIGVVDLIDNYFTAYNSARLREICHLLS RDVFKEDVTVGVSLSGAMTPAGFGVSALAPLIRNGFIDWIISTGANLYHDMHYGLGFE LFAGNPFLNDVKLRQEGTIRIYDIIFGYDVLLETDAFIRKILQAEPFQKRMGTAEFHN LLGKYVREVEKQLGVKHSCLLATAYECGVPIYTSSPGDSSIGMNVAALSLEGSPLILD PSIDVNETAAIAYSARETGGKSAAVIIGGGSPKNFLLQTQPQIHEVLGLEERGHDYFV QFTDARPDTGGLSGATPSEAVSWGKIDPNELPSTIVCYTDSTIALPLVTAYALNQCQP CSLKRLYDKREQMLDKLRTDYLAAKTQSVDEIPVAVAQSTSSQEVATSPIGRLIPNTG GTES" gene complement(46527..47048) /locus_tag="DP116_02385" CDS complement(46527..47048) /locus_tag="DP116_02385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317518.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="O-acetyl-ADP-ribose deacetylase" /protein_id="PRJNA477356:DP116_02385" /translation="MENRVSVIQGDITQLQVDAIVNAANNSLLGGSGVDGAIHSVAGP ELLSECRKLQGCDTGQAKITKGYHLPAKWVIHTVGPVWEGGNSGEDELLAQCYRNSLA LAVQNGIKTIAFPAISTGAYCFPLERATKIAVNEVNKFLHSNNSLEQIIFVCFGQNAY NCYLRVVQEITQS" gene 47355..48017 /gene="cobO" /locus_tag="DP116_02390" CDS 47355..48017 /gene="cobO" /locus_tag="DP116_02390" /EC_number="2.5.1.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748286.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cob(I)yrinic acid a,c-diamide adenosyltransferase" /protein_id="PRJNA477356:DP116_02390" /translation="MQTDETWTTSDVSAVGENSSLTSEQYRKKMQRRKEVQEVRMKNA SNEKGLIIVNTGNGKGKTTAALGMVLRALGHGYKVAIIQFIKGAWEPSEKRVFSIWQD DLLEFHALGEGFTWDTQDRDRDIEKAIAAWQKSLEYIRNPQFKLVLLDEINIALKLGY LQVEEVLAGLDQKPPNNHVILTGRGAPTALISRADLVTEMTLVKHPFRDQGIKAQPGI EY" gene complement(48066..48815) /locus_tag="DP116_02395" CDS complement(48066..48815) /locus_tag="DP116_02395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878994.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02395" /translation="MVNNIKKFLSPSLLGIPIIGGLVLVGLVVNRPKPVMSNQLAYID QSAELAEDSQLPTYTWLQQNPNPASDMTRDKQTRLRRVLAKSVTVPTDITKSNDLEAV PTTTVAIAQPQTPTKTASVTPIAAKFPQQDGVYLYGQSPKSGQLGQGYIIFEKHQSKV TGALYMPSSEFSCFNGTLRSSGELAMTVNGYPGDKSPTQVAANNTLPRVMDDESSTYA HSVTLQDYYQLNNISSSDRQVLQMCQANVQQ" BASE COUNT 14507 a 10214 c 10461 g 14150 t ORIGIN 1 acagagcggg aatgcataac tgtgggctgc tgcctcaata taattattgg gaggcagagc 61 ctcctttgat agcattccca tgctgagcat gggaacgaga ggaacgagag gaactacaga 121 aaaagtgaat aacctcaagc cagtagcggc tacatgttca gtagaacaaa aactggagca 181 aatcaaagat gagtttcaat ctgacagctc attcatacgt atcacctcac cccgccctga 241 cgggcacccc tctccttacc aaggagaggg gcgaggataa caacaaaact ttccaagaaa 301 atttagaagc tttagacttg gaaccaattg cccaaagatt gatgtatcct gaacacggtt 361 taggatggag ttgtgaagaa gtagagcagg cgatcgccca ctacaaaatg tttctcaatt 421 tgctctacct ctaccccaat agcgcaattg tccctacccg cgaaattgat attgtctggc 481 attaccacat tctcgacact cgcaagtatg cttttgattg tgaatggtta tttggttact 541 ttgtacatca caatcccaac tttgattttg gcagtgagcc agacagacta gctttagaaa 601 gggcgttttt tgacaccaaa acattgtttg cagaacattt tggcatcagc ttaaataaac 661 tacagcaagc aagtgcgtgt gcagatttgt aaccacgctt acacctcacc ccgcatttga 721 gtatcgcaaa cgctcccctc tccaaaacat cggagagggg cagggggtga ggttcttttg 781 taatgactgc attacctatc ctcaatagct tggagcaagg ttctcatttc tcttgcttcc 841 ggtatcattt tcaagctctc aaaactaagg gcggcttctt gcgcccaggt gcgagcttgt 901 tccaaatttc cttgtcgctg caacaagtaa gcaaaggaac gctggtgaaa ggcgatagaa 961 tgcttatctt tgttacactc cgccacaggg aaaccctgct ccaaaatttg ttgagcgact 1021 tctaaatttc cttcctcaat tgcgacatca gctaaccagt tttggatggc aatctccgcc 1081 cgctcccatc ctatatcctt tgcttgtttc agtgctcgtt ggtaaagttt cttggcttgc 1141 acataatcct tagttaggaa gaggatttta gcttgataat aatgggtttg aatctgctgt 1201 cgctggcgtt ctaactcttc tacaccagac tgtttcagca tattttgctt gattttaatc 1261 cactcatgtg ctaattcaaa tttctgttga tggatgcaca aaacaaccat cctagctgct 1321 aattccactt ggaaggtgat gtctttatgg ttccgtaaat cccaagcatg ttccaaaagt 1381 gtgctggctt cttccagaca ttctggttgt cttgtcaaag tgagtgtcca acctttgtca 1441 tacaaaatct ctgccagtgt tgaccaatcg ccgcgccttt cagtagcccc ttgcaaccaa 1501 tccgtccaca taaggcggtc atcccaatac ccacgcacat gagcatatcc tttgatttgt 1561 ctccataaag ttaggatatc ttcatatctt tccttttccc tttccctgca ccatttcatg 1621 acctcctgaa gattctccca ctccaaatct atatgactgt attcgggata ccatttcttt 1681 tcatctatat ttccgtattt ctgcgagaag tcgagatacc actcaaccca gcgcgatcgc 1741 actttctcct caaattctag attggctgtg agttcagcac gcgcatactc acgagtcagc 1801 gggtgtaacc tataccttgc gtcctgctgg cgaactagcg aaagttgttg cagtttggct 1861 aatccatcac cagtattcat cgagtcttcc tcagcaattt cagcaatggt ttctctctca 1921 actggttcgg ggaatagcga tagcgtcatg agtattcggt gggacggttt tcctcgcata 1981 cgttccacag aagcctcgaa acagaagcga gccacatcac ctgtagcact ttgtaatttt 2041 gtcagcacat cttggagcga atatcctgcc gctaactgac caacagtata aactattgct 2101 gctgggatgg cgtgggtatt ctggcacagt acttttgcat caattggctg taactcgact 2161 cctttttctt cggcttggtg tgtgattagg cttagagcat cttcttcgtg tagcgattcc 2221 aaacggattg gtacaaacaa agcttgttcg cgtgtcgtca ggataacttt aactgtacgt 2281 ggtaaatcat acacaaagct taataattcc tgtttatctt cgatagtttc caaattgtcc 2341 acaattagca aagttcgctg gcgcttgagg ctatctcgaa tcaaatcaag ctgatcgtct 2401 ggcgacctgt agctaatttc agggcgttct agggtatggg cgatcgtccg gtaaatgtct 2461 cgtaaactac gttcgcgctt gaggcgaggc aaaatccgca aagctttgag atactgctgt 2521 ttagctgatg taaaaataat tgcctcaaaa gtcggtacgg tcattccctc aaaggactgc 2581 actactgaac ctggctgact agcttcgaga caacgatgtg cagtttctag gactagtgtt 2641 gtctttccta cacccccaac accatcaaca ctgattaaat gagccgaatg ctcagaggaa 2701 agcagctcca acaagcgagc taattcctga cggcgaccga taaaagtggt acattctcgc 2761 ttgggtaagt tttcgtagat aggcggtgat gcagttccgg tgtttgttac tggcttgatt 2821 ataactttgg gagaaactgt actctcaccc tgcagttctt taggagtttt tgcgggaaca 2881 gttgactctg aagtcttttg tgttacacct tgttgtaatt gctcagtgag aaaaattcgc 2941 agtcggttct ccttacctcg actgctacca ctaactttaa atttgttgta cacctgtccc 3001 attcgcttca aacaggctcc ctcagatgta tttagcttct tagcgatgtc gtaataattc 3061 agttcctcgc catagcgaag cagaaaaact tcctcctggt ctttggacag ttcatagtta 3121 cgagccatct gcgctaaaaa gtcgcgtggg agcattctgc ctttcctcat atcatccgta 3181 tactcagatt ttgcacctag ccctagcgat tagaaatcgc ggctatacaa accaagtccg 3241 cctgcgcgga ctaacagaaa atcagcggtt tcaaacccgc gtaggcgggt tttgtctgtg 3301 tagccgcgac ttccagtcgc ttggtgcaag atgtaagtat acccagttta cttattccga 3361 agtttccgga gaaatacaga ggctcatttg aaattcttat aaatcaggta gttgtcagaa 3421 caaaatttat agtgtaattt cttagattat agtttcagag gttacatatt ttttaagagt 3481 ttttcctacc ctcagaatta aatttccagt cacagcaacg gtcaccagtt ccaaatttag 3541 gaggacatca tgaaaagatt gttgttaaca gtactaggct tgttcacgtt tattgtatgt 3601 actgcactga cgatacaact atccaaagca gctccagcaa ctgcagggtg gtccgattat 3661 acgacagcaa cagcaggagc ttattgccta gcgaaacacc cttatctaac gagttactca 3721 ggcacgacgt tcacaccaga cggctttgag attgctacag ctcctggact cgtcggcgct 3781 gttcgtcgta attttgtata tgacagaagc tcaagcacga cggatcaagg aaatgggcag 3841 acctgcgctc aggcttgcgg ggaatttgga aaattttact ccccttctta caaaggaaaa 3901 tctttgacac aaaaggttgg tcaaacaaca attgcaagtg gtctaggaga cattgcttca 3961 ttggctgtcc aagacaagga tttttatttg gataaaacag tagttgcggg catatgggcg 4021 agaccgagta cctatcagga agctgatgtt gctcaggcag atttctgttg ctgccaaatc 4081 agtaactgat tagttttatc gtagtaatac caattcaaaa aatgattgct acacatgtag 4141 gttgggttga ggcacgaaac ccaacgccta caatgcctgt aaatgttggg tttcgctccg 4201 ctctaacgga cactttcttc aacgggggga acccccgcac agaagtgtcc tccccaacct 4261 acacatctgt cgcattctgt cctcaaattg gtataagttt ggttgttttg tggcgatctg 4321 cgctccgcgc agcagcgttt gcgcagcgcc cccttagggg ctagcgatcg ctcaaggggt 4381 actacactcc cgcagggaac cgtccaaagg gcattagcgc aaagcttctg taaattcatc 4441 tgcgtccatc ggcgtgcatc tgcggttaaa tagttatttt tttagacttt tgcaacaagt 4501 ctagtcaact cgcaccagga gaaagtgatg ccagaagagc agactttggt tattgaccat 4561 ttgcagaaag attctacttt accagttatg ccaaatccgc ctcttttgac cagtcagaaa 4621 tcaggatgga gcagtattca tctaggacat tttcgccagc cagcttggga attacctaaa 4681 ttctctagcc tccaacatat catcagcatt ccaatcgcgc tgcatcacac agtaagcgta 4741 gaatttgttt tggaaggtca tctacataga atccaatatc atccgagcga tcgtatagat 4801 ggctgcgttg aaatattccc tgcaggtcca tgttctaaaa tctcctggga taaagagatt 4861 gactttactc atttctatct tgagcctaca tttgtttctc aagtagctca tgaagcaatc 4921 gatccggatc atgttgagct tctgtttgaa ccgaagaaag tcgatttact agtttatcag 4981 atttgtctag cactcaaagc agatttagat gttgatggaa gtggaaacgg cttctatgca 5041 gattcgatgg caactgcact atcggcacac ttgctacgcc actacgcaac ccgtaaacat 5101 actttgcaag agtatgaaga tggtctaccc aaacacaaat tacaacaagc ttttgactat 5161 atcaatgagc atttaggtga ggatttatta ttgacagaaa ttgctgccca attgcatatg 5221 agccagtact atttctgtcg tctgttcaaa cagtctacgg gaatgactcc ccatcgatac 5281 ttgattcaac agcgggtaga acgggcaaag cagttgttga ggcaaccaga actaacggtt 5341 acggatgtgg cgatggcgtg cggatttgca aatcagagtc atcttgccaa gcactttcgc 5401 caacatacag gggtgacacc caaacaattt cgcaagatgt agcaagattg tgttcaaaat 5461 agcaataaaa tattcgactt caattggcaa tgttagctaa ggttatctca ttatttttcc 5521 tttcccatta ctagagcaag taggcttata gataacctga cacacggtat taacacttaa 5581 aaatgactga tgaattaggg tgcgtcattc cttttccaaa gcactatcag aggaaagaag 5641 atgattgata gctgggcgaa tgcaccaaac ggtagcatca gaagttatgg cgagatgacg 5701 ctctcaattt caggcgataa gccggaggct tgacgctacg cgtatcgcat gagaaaaatc 5761 tcaggcgggg aaattttcgg tggtagcgcg tggagtcgtt aagcaggtgt tttgcagggc 5821 gatcgctctt tctaaattta ggtagtcctg ggcagaaatc ggggtagcaa agcattggct 5881 caggtgaagg aaagtattgc gataggtcgg ttgtgaaata gcagccgtag aaaatcttta 5941 aaaaaagcct aaagaagcct ttagaggctc aggttgcaga tgggaattgt ttacctttca 6001 ataaggtagt taaagtatga aaagaattcg cgcagttttt gtctttttag tggttttagt 6061 tgcaggtttg agaagaaaac gagtagccgt gcagaaactt atgttaaggc taaaactcaa 6121 taagggcagg cacttcatta gaggctttac agttattaca gctacgctgc tgctgatagg 6181 gttactattg acaaccgaat cggttttagc tttcaagcca aatgaaaagg gacacctcgg 6241 tattacgtct gatgcagtaa gacctattga aagagttgtg ggtggtgtga gactcaggtt 6301 ttctccacaa gctttagaag agattcgcaa agccaatgaa tctactgacg acctgagcag 6361 tggcgatttc tttagatccg aaaagcattt tgacaatgag gaatttttcg ctggaacaac 6421 gcgcttattg gagcttaaag gttacatcat tagtcaaatt acaagttttt cacctaatgg 6481 caaaaatgcg cgaactgctc tgggcggcgc gttgcataca attcaggatt tttatgctca 6541 taccaactgg gtagagttag gtcacaccag tatcgacact cagcttggtc gtactacgat 6601 tccaaaaccc cctataacca ctccaacatc tcccgcaaat gaccaaagta cccttctccc 6661 tggtcttact gtactaactt ctggctactt tgtcacgccg ccatattgca gtgcacctcc 6721 aggtaaaact cgtcatggtg tatctggctt gtgtactaat ggtctcaata aggatgaacc 6781 gggcagacct gggtatcctc aagcacgcgg actagcagtt atagctagcc aagactttat 6841 taatcaaatt ctcgacaccc caggtgttgc aggtaatgac agggcgatta gaaccctaat 6901 gggaatttag aagcaccttg tagagttctg ctttcaagga tgtccatacc ttataaggtt 6961 cggtacaaga gataacaaag ttttggtttc ttcgggagtc actgtcaccc ttgtaggaga 7021 agactttcac ctctatccca agatcagcta tgaaattttc gcttgttcga ttccttagcg 7081 tttttatcct aagttattta ttggtttaga aacgagttac acatgtggtt tgccctgcac 7141 cacgactaaa aatatacata cttcaggcaa taactatgga atctcttcaa caactccgtt 7201 tcgctgatgt catacattcc attctagaag gaacctcaga gacgaaactt attgaagcta 7261 gcagagttgc attcggcgat cgcattgagt taaaccggct caatgattta gttcaacaat 7321 ggaaaatagg agatttttca gcgttacccg aaattacgat tctccctggt aataatttaa 7381 atggggcacg aggagcatat tctgctcaaa ctaataccat atacctatcg caggaattta 7441 tcacgaatgc gagtaatccg gcagatgtta acgatgtttt gctagaagaa gttgggcact 7501 tcatcgacag tgttatcaat ccggttgatg caccagggga tgagggggaa atttttgcaa 7561 acctgattca agggaaaact ctgactagcc agcaacttga gaatctcaga aatcaaaacg 7621 atgccatcac gattcagata aatggacaag tgctgcaagc agaagcagct gataccaagc 7681 cagaggataa tttagggatt attgccacag ctatcaaacc gtacttggag aaagtcagaa 7741 gtgcaattga tacgttggtt ttggataaca tacctctttt gaatacgttt tcactcaaag 7801 attatgcgaa caaaatcatc tcggatgaag ttgaaaacaa actgattagt gcatttgaga 7861 aaacagagaa taaatctgtt gaaagcatta aacaggcact ttttgatgca cttggtccga 7921 gtggcttgaa tttgctgctt gatttggacg ctgatggaaa ggttcagttg agcgatattc 7981 aagctccaaa agatgctaac agccttgagt tcaaatttaa gctgggcaaa aactttaacc 8041 ctgatttacc gttaaatagc aattcattag ggctgaattt aaaaggtaac ctaactccag 8101 agctaacttt gggagtcatg ctagggtttg gagtcgatga cacatccctg cttgacggta 8161 aaccaagtgc tgatgccttt tttgttgatg ttggaacaac aaacgaaatt gacggcaagt 8221 tgaatgttaa gttcaccgat aaagaaaata aaccactgag ttttacaggt gatttggggt 8281 ttgtcaaatt aaatgcaaca gataatggta gtaatctcac gagtaacttc tcgattgaca 8341 taaaaaatcc agaagccgat ggaaaaggaa gagtcaaatc aacagcaata ccgagtatat 8401 caacggataa acccaaacta gatgccaatg ctagcctcaa tctgagacta gatacaggat 8461 tgctgaacgg aattttaccc tcgctcaaat cagattttag tctgtcgggt ttgaattatc 8521 aatctggcga ttctacacaa tctactccag tcgcagaact taaaaacgta actatagatt 8581 caggttcact ggtatcaggc ttgttgggta aagttctcga accagttcag tcagtaactg 8641 gagcttttca aaagccgata gatattatta caagcccact accacttatt aaaaagtcgc 8701 ttgtttattt agcacaagaa tttcctaaag gtggtggatt tgattcagtt gatccattcc 8761 tcaaagggtt aaaaactttt actgatgctg ctacagcaat taatagtttg aaacccagtt 8821 cttcaactat cgacttaggt gattacaagc tagataaaag tggtttttct acaattagag 8881 aaacggctcc aattattaac caactcaagg gaaatgctct tgtacctcaa agtgttgtac 8941 cacaatttgc cgcattagct ccgactcttg cctcaagccc aaactctgat aatcttttaa 9001 gtactttatt tcccgtactc aataactcat cacaatttgt taatttacta ctaggaggtc 9061 aagacactaa cttatttgag tacactactc caacccttgg ttttaagttt gaattagctc 9121 ccgaacctat agttccggta tttggtcctg tggtgctgaa atttggagca tcagcgggag 9181 caggagcaca actgacatta ggactagata gcaaaggatt taaagaatat aaagaagagg 9241 gattcaaaga cccctcaaag attttgaatg ggctatatgt cggtaaagcc gagcctcgta 9301 atctcaatgg taaaacaaca gaaaacagtt ttgaggtctt tggtgggata aatgcaaggg 9361 ctgccgttga tgttggaatt gctgaatttg cagcaggtgg aggggttttt ctcacggctg 9421 ggcttgaagt caaaggaagt caagaagatg aaaacaaatc ctacataaaa caacagagta 9481 atcctctgtg tttgtttgac aagacaggag ctttatcact ggttgtattt gcatccctca 9541 gactcaactt tggatttttt aaagttacaa aacgcctgaa tttggcaaac gttgacctaa 9601 ttgactatag aggcaagacg tgtggagacg aaaaatataa agtacaaaat ccccctctaa 9661 cttctgaaat tcgcgctcaa ttggcaggac agggaatcat tgagcgcgaa ggtactgagc 9721 aaaatgacgt aataacttta gaaggaacag aactatttct cgttaacgca cccaaaaaac 9781 ttaaagatag caatactgtc gttaacggta aggtcaattt actcggatta gattccaaac 9841 ctcaagaata tactgatgtt caacttattg tcatcaacgg gggtaaggga aatgatcgca 9901 tcgaatttaa agaccaagag gaaacaaact taccaattgt gaacgttcaa caaagtaaag 9961 atattgttgc cagcggacaa ttagacggcg gagaagggga cgatacactg attggaggac 10021 gtggcgatga ttaccttaca ggaggttctg gacaagatac cctcaatggt ggaacaggtg 10081 gacggaatac agcagtatac tctaacgcgc cggaagggaa aggggttaag gttgatttag 10141 ttcaaaactt tgcattggat gatggcttcg gtacaaccga tactttaatt aatatccaaa 10201 acattgaagg ttcaagcggc aatgacattt taattggacg tgcttctggt gatccaaaag 10261 acaacaactt tggtagttta cttgatggtg gtgctggtaa tgacacctta attggaggtg 10321 agggtgaaga tgttttgtta gggggtgcgg gagcggattt tatcgatggc aaaggaggct 10381 tggatactac aacttaccta gattccactg ctccggttta tgtcaatctg tctagcggta 10441 gggttcgcat cacatcgcca atcaatctcg gaacatcaat ccaactcaac gcaaatgcag 10501 gtgtaggtgg ggatgctgag ggtgacaaaa ttttcaatgt cgagaatgtt caaggctctg 10561 tttacgacga tattttagtc gccaatggta aagactctcg cgttgatggg ttattgggta 10621 atgacataat cattgctgca ccagaagctc aaatccttga tggtggtcct ggaatcgatt 10681 gggtaacata tcaactgtcg gactctggcg ttaacgttag cctcaaaagt ggattaggca 10741 aagctctcac cataactcag cctaatatat taccaggttt tccaccaatt gaaattgaaa 10801 taggcagtgg tggctacggt aaagacgata cacttgaatt tgctaaagat gccaataata 10861 atgtaataaa agaccaaagc tcgtttgaaa acctcgaagg ctcgaacttt gacgacatct 10921 tagagggcga tctacaagac aatattttgc gaggattggc agggaacgat aaaatttcgg 10981 gtggtgatgg taatgatacc cttattggtg gtgcaggcgc agatgtattg gatggcggaa 11041 atggtattga ctgggctgat tacagcgaat catttgcggg cgtaattgta aatttaaaaa 11101 ctaactctgg tttaggtggc gatgctcaag gagacacatt tgcccgcttg tcaccaacaa 11161 tatcaactat tgagaactta ttaggttcca agtttagcga tacattgaca ggcgacgatg 11221 ggaacaatga gataaaaccg ggctatgggc aggatacggt ttatggtgaa ggaggaaacg 11281 atcgcctgtc catcaactat tccaattcag ttgatttctt tgagccaggt agcggtgtta 11341 caggtggata ttatcctgct accaatccta gctattccta tagctcaagt ggcataacat 11401 ataatcctgg cagtgttggc agtgttgaaa atggtttcat ctctagaaga accagcgatg 11461 gtgcaaaagt actagatagc gtcgcatttt tcgggattga aaatctgacg ctgattggta 11521 catcctttgc agatagcatt tttagtgctt ccggtaatga tttcttgaat atgggtgatg 11581 gagacgacat tatcactagt ggacaaggaa gcgatcgcgt ttttgctggt agtggtaatg 11641 atattgtagt gtcgcagaat gacctttttg gacgaattgg tggaattatc gataatgcca 11701 aagcagtaat tgagttagac ggtggcgctg gaattgatac gttatcagtc aacctatcag 11761 gcaaaaagga ttcaatctct ctatttagtt ttagttcaac ccaagaaaat ccttatcaat 11821 cgttttctat agcggatgga acagtttcaa ttaagaactt tgaaattttc aaagacatca 11881 ttacaggcaa tggtgatgat ttcctgacgc aacttggtcg cgttgataat atctttgcaa 11941 ctggtggcgg taacgataca gtcgatgctg gattagggtt tgataatgtt gatggtggcg 12001 atttgaacga cttgttgtat atcaactact ctcttagcga tacaggaagt ggtatgagta 12061 tgactgtcga tccactcaag ctggttggca gtgcttctag aacaaaaggc ggtaccagca 12121 gtgagttatt agacaagatt aacttcagca attttgagcg gtttaatatt acaggcacca 12181 gcaaagccga tactattatg ggtggcgatg gcgcagatat actgcgcggt tcgggtggtg 12241 atgataccct cattggaaat cgaggtaatg accagctgtt tggtggcgat ggcaatgaca 12301 ttttacaagg tactaactat attagctacc tcatcggcgg tgggggagag tttaaaaatt 12361 atgattatcc aaaggatata gataccttaa ctggaggaac gggtgcagat accttcgttc 12421 taggtaacag tccttatgac aaatactatt ttggtaatct tttccaagat tacgcaatta 12481 tctctgactt caaatctagc gaaggtgaca aaattacact atctggtagt gatgatcaat 12541 atacatttga ggtcaatgcg ggtagcactt ctattcttcg cacttatgag tcatacggac 12601 cagatctaat tgcggttgtg caaggagtga ctgaccttga gaagaatgca aattatatta 12661 cttacttacc gccaccacca ccagtagtta aataattttc tagcagtatt acttcgcaag 12721 ttgctttcct taatttgtat caaaagggaa gcaattacaa gacttacatc gcttcattcc 12781 ccaactacaa gactaaaacc atgactttta acattacttc cttagattct caatttctca 12841 aacagttgct tggcgatacc acagggctaa gtaatttcac aactacactc acaggcgatc 12901 gcgctgcatt tggcaaattt caagatgacc cttttaactt aaaatctggg gtagttctca 12961 gcactggtaa agcttcagaa ttagcaggta gaaatatttc tgatggtggc ttgtctccag 13021 gtacaaatgt caaactgaag tttgagaaac tcgatggaac aacaggtagt cctagtaatg 13081 caaaaacagc tatctaccgt gcagatttgt ctgagttagg atttgactta aaatctttga 13141 caatcgatga tggtggtgta attggtggta gcgatggtcg ctttactggg tttgatttag 13201 atggaattaa actcagcaat actaagatta ctaacgctgc agatattcaa accttaccaa 13261 gtttgaatgt ctttgatttt agtccagcaa acactatttt tacaccaggt aatcagcgtc 13321 cacctagcga tgctaccaac ccagatttat ttggcactat taacggcact attaataata 13381 gtgttgcaaa tttagctaat tttgatggtg tttctagtac tggcaatgac gcgaatggtt 13441 ttgttagttt aggaaatctc ggtaaagtcg gttttaactt taaatcaaca gttacttcac 13501 aactgccact ttatctatat attggtgaag ttggtgataa tggggaggtt gcagccggaa 13561 aaattagcgt ttctaaccga cctattagcg ggctaagtga cctgagtaca gactttggaa 13621 ctcctggtga agcagatgat tccaccacgc ttcagattga ttttgatgta gatagcacta 13681 gtagaaaact attttttgaa tatgtttttg gttcagaaga atttgcagaa tttgcaggtc 13741 aatttaatga tagttttagc ctacaactta atggcttgaa ccttgctaaa ctcagtaata 13801 ataatacagt tacaattaat aacttagcac aaaatccttt tgaccgcagc aatcccgatt 13861 ttatctacaa tcctgctaat accggacccg ctagcgactc aacccgctta gatggataca 13921 ctaaaacaat tatatttagt ggcgatttaa ttcccaatgc ccgcaatagt ctcactataa 13981 atgttaaaga caaccgcgat ggtttattag attctgccgt tttcctcaaa ggtggaactc 14041 ttggtgtaac tccacctgat ggcagcaaca acacaccaag tggcaacaac acaccaagtg 14101 gcaacaacac actaagcagc aacaacacac caagtggcaa taacacgcca ggtagcaaca 14161 acacatcaag tggcaacaat acactaggtg gtagactcaa aaacgagggt ggcacaatat 14221 ttattcccgg ctctggaaat agttctcaag tcagtctgga atttgcaact gatagataca 14281 ccgctaaatt taacaacgaa ctgggtgtgt ttgttgttga taacgagaaa ggggaaatta 14341 atggtatcgc tcccggtgat gcaggatatc tacaagctgc aatcaagcga tcacaagtag 14401 ttttctccgc acttgccaat gacccctcat ccaaggctgg gaataatcgc atcatcaatt 14461 ttgcacctaa tacttacctc agtagctatc tagttcaaaa tgccagcact gatgaggtat 14521 tggctaactt agctgcaaat aaaccaactc ctaatgtctt cttctccact gttgctgcta 14581 atacagataa atttgaccac gtacaattca aaaacaccag tgatggcggc atttttatcg 14641 gctgggaaga cctgataggt ggtggcgatc aagactttaa tgagccattg gtatcagtaa 14701 aagtaactaa taatgccgct ccgttaggta acaaacttca agtgcaacgc gaactgattg 14761 atttacgaga tatctcagac gcagtaacag caaatttcat tactaacagc gaagctgctt 14821 ttgataatct tgttggtttt tatactattg atgatattac aggacgcatt ggaaatctgc 14881 gtccggaaga cgcaggttat gctcaagcag ctttgcaacg cagcgttttt gacatcaaac 14941 gcgatgaaaa tttcaacaag cagttaaatg gcagtgcttt gcttgcacca tttattatcg 15001 ctgatggtag cctagagcaa ttccttaacc aaaatcctaa caacaatcaa ttaggtggaa 15061 acccactcgc ctattttgcc ttttcgggag ctaacccaga caaagtagac cacatacgct 15121 tgttgggaga taacaagttt ggttttgaag atatttatgg tggaggcgat cgcgactata 15181 atgacattgt tttacaagtt aatcttagaa gtagttaaga cgagaagttg aggaaataga 15241 ggaatcggca actcgtccag tgcgattgac accaccagag aaattaataa cttcctgcaa 15301 atcaacagta ccttagacta ctctggtgct agtcctgact taaacctcta ccgactcagt 15361 tacactttcc cgctatttga agacttccgc gtgtccgtct ttccccaagg ctttgcttcc 15421 gactacgtgg atagaaacag cttcgccaat aacagtgcag gaaacttctc aacttatggc 15481 ttagtcaata atcagctact gctagcaaac gatagggcgg gggcgggagc agcaatcagt 15541 tggaatccgg gtaaaggttt gtttacaatt aggggagtct atcgcgcaca gcaagccggt 15601 cttgttaaca ccaaacctga ctttaccaac agtgataaac ggggaggtct atttgatgac 15661 cccaacctgg gtgtggtgga actagagttt tcaccttcta aaacttttgc gattcggctg 15721 caatatagtg ggggaacgca aggtggggaa gaatataatg tcgtgggcgc taatttagag 15781 ctagcccttg gtcagaaagt tggactcttt ggtcgcttcg gttatgcatt taactttcct 15841 ggtaatatcc aacctacctc ttggtcagca ggtattgtat tcccagattt tttagctaaa 15901 ggtgctaatc ttgggttttc tgtgggacag cctctcatct tccaagaaaa agataacctt 15961 ctgggctttt ttaacagtac tcaaaccaac tacgaagcat tctacagatt accaatcaac 16021 aacaatatat cagtttcgcc gatattgcag gtcataactg accctggaaa ctcgcaagca 16081 aacactattt atacaggcac tttgcggaca gttttttcct tttaataccc tcgcaagttg 16141 tgcaaatctc aagcttcgac cgggtacctt gttgattcta cgtgttctgc tgtgcggctc 16201 gcccgacaga ttttacatca acattaagct gttgagcaat ctgctccaca gtcatccctg 16261 tttttagtaa cagaggtaca gttactctca acatttcaac ttttcgttct tctcgacctt 16321 cttctcgacc ctcagctttc gcttcctgat aaatcctggt ttgctctaaa gtcagtccta 16381 acatggcttc cacctcttcc gggcaaaact tacaaaaact gataaacggc tagagcgtgc 16441 ccgccccttc tacacgctct actgtgcggc tttgcggaca gattctacat caacattaag 16501 ctgttgagca atctgctcca cgctcatccc tgtttttagt aacaggggca cggtaacttt 16561 caacatttca gttttttgtt cttctcgtcc ctcagctttc gcttcctgat aaacccttgt 16621 ttgctctaaa gtcagtccca gcataacttc cacttcctct ctactcaagg tcgaaaactt 16681 gtaaacagca attgtggtga taatatctat tatttcgttt ttcggcagtg cgtccgtttc 16741 ctctaattgt acccgctcaa ttaattgctt tgcttgctcc gccatcgcat cagatgaagc 16801 gatcgccaac tgcattaaat tgatacctac tggctgctga ttgggttcac ctaactcatc 16861 aagatagatt cgctgcactt ggtcgctatt caaaaatatt cgatgagttt ttgtatcact 16921 tggttccaaa ctgcgtgatg aaaaaattac cacacaatac cagtcatcat attgagactg 16981 attccggttc aaatacatca tcgactcagt aaagaaccga tgatacaggg cttcatcttt 17041 ctgaaattgt acctcagcaa aaaaaataat tctaggtatt gcaccctctg gcggtaaaaa 17101 aaccccatcg atacgaaagg cagtttcttt cacttcaacc gattcaaatc ggtagttctg 17161 cgcttgtgag ggatggtaat caacgagttc aaagagtaac ccaggaaacc gcttaaaaat 17221 ttggtaataa atggaatcgc gtttcaccaa aatatccttc agagctaaat atattttttt 17281 tggaacaata cactcaaatt tccgactaga gatccccgac ttcttaaaga agtcggggat 17341 cttgtttctt caggaatcat ttagatgaga accgctaaaa tcatcggctg gtgagtgtta 17401 gggcggaatt ttcagcctgt tctcacctgg aaacacagga atacactaca cctttaggga 17461 tattaacaat tttctcctga ctgttcacaa gcgccaatac taacccagct acttgaacca 17521 tctgcccaat gttctttttt ccaaattggg gcgttgtgtt tgagagtatc tatcgcatag 17581 cgacaagctt caaacgcttc ggaacgatgc ggacaaccca cagcaactaa aacgctgatt 17641 tcaccaactt gcaagcgtcc aatacgatga taaataacga cgcggttcac aatagaccat 17701 ttttgacgaa tatcagcagc aatttgataa aacactcgca ttgccatagc ttcgtaagct 17761 tgatattcta aggcaatcac aggtttacca tctgtttgat tacgaaccat cccactcatc 17821 agcacaactg cgccattttt acaatcatcc gctttgccat atacctcctc aatagacaat 17881 ggcgcaaagc taatcgcaaa gctatcttca gctttcggct gaacgggaga ggtaactgaa 17941 gctacgagac ccaagttcat ttgaatgatt tctatatgta tgggggattt attcttttaa 18001 gttgaattct caaaaaagaa taccaacaat tatataaatt ttgttgctct tacgtaattt 18061 cactgaaaat aaaattcttt gtcatcctcc tttaggcata ttactataat gaagtgattc 18121 aaatggagaa tgcgatagta atttagttga actattaagt gcagcaagta ccgatttgac 18181 ctaaaggtaa gttcctactt cgagctacca gttgcaagta gtttaagtag ccgtcatgaa 18241 ctgcgtatac caaaacacat gaaaagattt ctccttcact ctgtgactgc ctccctcctt 18301 ccttctctcc cttcctcact ctctccctgc acggcagttt gcaccgtcag gggaacccca 18361 acgctagata cctacagagg aaaaccctca tcaagtactg gtgcgcaacg cactacctcc 18421 tcactccatc actccctccc ttcttttctc cttattttca actcaggtat atttggtttt 18481 tgaaaattaa aaaccaagtc tacatggcag ttgcaaacca acttacagta gacattgttc 18541 tcaaggcgtc attttaaatc agttatcagt gaacaggctg ataaaactta taactggtaa 18601 ctgttcagtg ttcactgtta aaacttcagt tattgaagaa tctggagaac tatatatcac 18661 atcaaggcaa gcaatcattc aatattatag gaaaagatat aatgaattct ttattgtcta 18721 acaatgaagc aacaaggctt gtggctctgc atcagtatga cattctcgac acatcgccag 18781 aacaagcatt tgacgatttg ggattcttag ccgcccagac ttgcccaaca cccattgcag 18841 tcataaattt gatagacgct aaccgtcagt ggttcaaagc taaagtcgga ttggatgtag 18901 aggaaatgcc tgtggatttt gggttttgtc ctctttgtat ccaacaacgt gatattttag 18961 tcattcctga cactttgagt gatgaacggt ttgccactaa tcccgtagtc acttctgccc 19021 cttatgtgag attttatgct ggagttcctt taataacacc agaaggacac gcgatcggga 19081 ctatatgtgt tgttgatcgc gtaccaagga aaatcagccc tgaacaactg gagtcgctaa 19141 aagcaattag ccgcttagtg atgagacaac tagagttacg ccgaaattta actgaagtcg 19201 ctagcataaa aacagaatat cagcaaacag aagagaaatt gcgttggaaa gaagcacttt 19261 tgcgctcaat gactgacgtc tctctactag cattttacgt tgttgaacaa cgtactagta 19321 agattttgta tgtcaatgaa ctcttttgtg aaatttgggg cattcaacac ctgaaagaac 19381 aaattgaatg cggcaaattt aaacatgagg atgtcatgaa caaatgcgac gaattgacag 19441 aagtttcgcc ttttatttcc tgtcaactaa ttcccagtga aacgtgccta tgtgaagacg 19501 aaatcttctt gaatgatggt cgcattatca ggcagttttc taaatgtatt ctagatgaaa 19561 atgaccaata ttttgcacgg ttatatattt ttgaggatat cacagtacgc aaactagcag 19621 aacagcaagt tcgtgaacaa gcagtattac tcgatatggc tagggacgca attattgtac 19681 gtgatttaag cagtcataga attttgctgt ggaataaaag tgccgagaac ctttatggtt 19741 ggaaagtaga agaagctctt ggtaaaaaaa cagacgaaat tttctctaac gaacccttgc 19801 cgcaatactg ggaaatttat aaaaatgtct tggaatctgg ctcatggcaa ggtgagttgc 19861 aaacaatcac gaaatctggc aaaaaaatta ttgttgaaag tcgttggaca ctaatacgcg 19921 atgagcattt tcaaccgaaa tcaatccttg tagttgatac tgatatcaca gagaaaaaac 19981 aattagaaaa gcaattttta cgcaatcagc gcctggagag tataggtact ctcgcaagtg 20041 gtattgctca cgatttaaat aatgtgctgt ctccaatttt gatgtcagta ccaatgctca 20101 aagcgagttg cgatgatgag cgtagtaggc aagtgctaac cattgtagaa aataatgcca 20161 aacgcggcgc aaatttagtc aaacaggtgc tgtcatttgc aaggggaact gaaggcgatt 20221 gggtaaaatg cagcccctct acgggcgatg gcaagggctg tgattatcca aagcgtgact 20281 cccatagggg cggcgaagca ggaaatgctt gcgctgaact tccttcaggg agccaatctt 20341 caattggctg ggcgtatacg agcgatcgca ctgtacttca attacagcac ttgattttag 20401 aaatgaccca agttgtcgaa caaacttttc ccaaatcaat agtcgtccaa actgaaattc 20461 cctcaaattt attaccttta tatggtgata gcacccaaat tcatcaagtg ttaatgaatt 20521 tgtgtctcaa tgcgcgtgat gcgatgacga atggcggaat gttgaaaatt tatgccaaga 20581 atgttttcat tgatgaaaac tttgccaaaa tgcatcaaga tgctcaagta ggttcttaca 20641 ttgtcctcac agtttctgac acaggaattg gtattaagcg cgaattatta gataaagttt 20701 ttgaaccatt ttttacaact aaagagtttg gtaagggaac cggacttgga ttgtcaactg 20761 tcatgggtat tattaaaagt catggtgggt tcattactgt atctagtagc gtcggcgagg 20821 gcacaaaatt tcaagtctac ttgccagcaa tcaaggcagc tgcgacgcag ttaacggaaa 20881 atcaagaaat tataccaata ggatatgggg aatggatttt ggttgtagat gacgaacctg 20941 cgatcagaga aataacaaaa gtttctttag aaaagaataa ttacaaagca atgactgcta 21001 gtgacggcat agaggcagtt gcactttatg cccagcataa agataaaatt agtggagcaa 21061 ttatcgatat catcatgcca gagatggatg gagttaccac catcaatacc ttgcacaaaa 21121 tgaacccact gctaggtatc gtagctgtca gtggactagc aacaagcgaa cagatgctat 21181 taaacaatgt ctctgatctc atagcatttt tacctaagcc ccatacagca caggagttac 21241 taaaaacttt gcattcggtt atttctcgtt atgagatagg gaacagggaa cagggaacag 21301 ggaacaggga atagggaact cttaacagaa cctcgtaaaa tctcactttt gcaagaggtc 21361 tattaatcat tagtccaacc cctgttgcct gttttctctt aataatcact aaaacggctg 21421 gcgtttacct tgtccacttg caaaggtccc tgactccagc cgttttttgt tgaagtcgga 21481 aaagatgagg gaaaataaca aataggtgat aacaaataac tgataataaa tatctaatga 21541 ctaatttgat tctggttgta gacgacgacg ctttaacgcg gttacaacta cgaactttgc 21601 tacaaaaaca agggtatcaa gtagcagagg caatcaatac tgagcaagca ttagaattgt 21661 atattaaatt acagccagac atggtgctgc tggatgccct gatgccagtg atgaatggca 21721 ttacctgctg tgagcaattg caaacactcc ctggtgctaa gaaaatacca gttttaatga 21781 ttactccttt ggacaagtca gcaacagtag aacgcgcttt ggctgctggt gcgattgact 21841 acattactaa accaattcag tcgcaagttt tgcgtcaaag agtacgtcgt cttttggaag 21901 cccgtgacgc gatgcagaaa ttgcaacagc aaacggagca agcgcagagt cgggaggtgc 21961 aactggtgat ggcattagaa gcagccagca tgattacctg ggactgggac atcctaaata 22021 acaagttgac ttggccggat aacctcaaac cgctgtttgg cttggaaagt gctacgtatg 22081 aagcttttat tgaacgtgtt catccccaag accgagattt tgtcaaccgt tcggtgatgc 22141 agactctcca aaaagggaca gaatacgaca tcgaatttcg tgttgtttgg cacaacggta 22201 cgatttgttg ggtagcaagt aaaggtgtcg tttttcgtga ttcttctgga gtcgcggtgc 22261 ggatgactgg aatagggatg gacatcagta aacgcaagca agcggaggag gcgttagaag 22321 tttatgccaa ccgacaggca cttgtcgcag aactcagtca aatggcactc gctggtgtag 22381 atttaaccac gctcatggac gaaactgttg ccctggttgc tcaaagcttg aaagttgagt 22441 attgcaaagt tttggaactc gatagcgata ataacacctt actgctacgg gcgggagtgg 22501 gttgggagcc agggcttgta ggatatgcaa ctgttagtgc tgaaatggac tctcaagctg 22561 gttataccct gtcttgccaa gagccagtta ttgttaatga tctgggcaca gaaaagagat 22621 tcaacggact tcagctactg catcagcatc aagtcgtcag cggtttatca gtcatcattc 22681 atggcaaaga gcgtcctttt ggcgttttag gggcacacac aaccacacag cgtacctttg 22741 gcaaagatga catttacttt ttgcaagctg tagccaatat gttggctaca gctattgagc 22801 gtcaaaaagt cgaagatgcc ctcagagaaa gtgagcaacg ctggcagtta gctttgcggg 22861 gcaataatga tggtatttgg gactggaatg ttaaaacgaa tcaagtcttc ttctcgactc 22921 gctggaagga aatgctcggt tactctgagt atgagatccc caatcatttt gatgaatgga 22981 tgaagcgcgt gcatccagat gatatgactt cggtaacacg agtcattcaa aatcactttg 23041 caagaattac ccctttttac attagtgagt atcgaatccg gtgtaaggac ggtagctaca 23101 aatggatttt ggatcgaggt caagcactgt gggatgatga aggtacagtc gtgcggatgg 23161 ctggctccca tacagatatt accaaacgta agctggcaga cgaaaaacta caagaaagtg 23221 aaaagcgttt tcaaatactc gctcgtgcta cgaatgatgc tgtctgggat tgggatttac 23281 tgacaaataa ggtgtggtgg aataacaacg tgcaaactct gtttggttat tcaacagaag 23341 aagtcaaaaa tgaagtgact tggtggcatg agcatataca cccagacgat agagagagaa 23401 ttgtttctga tattgatgct gtgatcaaca gcaatgaaca attctggtcg aatgaatatc 23461 gctttcgccg tgtagacggt tcttatgcct acatttttga gcgcggttat gttgttcatg 23521 acaacacagg caagtcagtg cgaatgattg gcgcgatgat agacttttct gagcgcaagc 23581 gggttcaaga agagttacag cgtcagaact tgcgatcgca attgtttgcc gatgtcactg 23641 taaaaattcg tcagtcctta caaattaatg aaattctgca aaccaccgta aaagaggtac 23701 aaaagctact gcaatctgag cgtgtcctta tttttcggct actttcaaac ggttctggaa 23761 tagtggtgca agaagctgtc gttcctggtg tacctgctgt tctcggacat aacctgcatg 23821 atccttgctt tatcgaagat tatgtccaca aataccgcaa tggacggatt agtgctgtta 23881 ccgatattga gcaaggtagt atcgagcctt gctatatgga atttctgaaa aaacttaatg 23941 ttagatctaa cctgattgtc cctattctcc taaaaaacca actttggggg ctgctgattg 24001 cccatcagtg tactcaacct cgccaatgga cgagctggga aactgaactt ttgcgacatc 24061 tggcggatca aataggcatt gctcttgcgc aggctcaact cttggaacaa gaaactcgtc 24121 aacgccaaga actcacccgt tccaatgaag aactgcaaca atttgccttt atcgcctctc 24181 acgatttgca agagccatta cgtaagatta agacatttgg cgagcgatta aaagcttctt 24241 atggtgacgt tttaagcgaa cagggacttg attacttaga acggatgcaa aatgcaaccc 24301 gtaggatgca agctttgatt gaagatttgt taacactttc gcgagtgact actagagggc 24361 agccttttgt cccagtggat ttgacacggg ttacgcgaga ggttttatct gatttggagg 24421 tgcggatcca acaaactgag gcgtatgtgg aggtgggcga attacctatc attcacgctg 24481 atcccctgca gatgcgtcaa ctactacaaa acctcatcgg caatgctttg aaattccacc 24541 gtaaggacga accacctatt gttaaaatct atagtcagac attaaaccat caagatgctg 24601 cccaattgtg tcacgttatt gtagaagata atggtattgg ttttgatgaa aaatatcttg 24661 accgcatctt caacgttttt caacgcctgc atggtcgtag cgaatatgaa gggactggta 24721 taggtttggc tatctgccgg aaaattgttg aacgtcacaa tgggagcatc tcggcacaaa 24781 gtagacctgg gcaaggatcg aaatttttta tcatacttcc aatgcatcct cctggataat 24841 atgctcttgc ttaaaagggt ttatcatacc cccagaggtt gtcgaggatt tggcaaccta 24901 tcttcattca tttgttgcaa tttaataatg aattaatcct aaaaccaacg caaaggagaa 24961 aatgcagagt gaagggtcgg caaataaccg tcaccatttt gatggctgat gatgatgagg 25021 atgattgtat gttagctcgc gaggcgttgg cagaaagtcg agtggcaaac gagttacaca 25081 tcgttaacga tggtgaagaa ttaatggatt atctttatca tcgtggtatg tatacccaca 25141 aaagcagtgc accgcgacca catctaattt tgttagattt aaatatgccc aaaaaagatg 25201 gtcgtgaggc gcttagagag attaaagctg atccacattt aaggcaaatt cccgttgtta 25261 ttctgacgac ttccaaggca gaagaagatg tttatagcag ttacaatttg ggtgccaact 25321 catttatcat caaacctgtc accttcgcgt ctttggtaga agttatgaaa acactaggaa 25381 agtactggtt taacatcgtg gaactgccac tataagcaaa aggaggcaga aatgaaagaa 25441 agtccattca aagttcttct tgttgatgac gatgaggatg actacgtttt aactcgtgat 25501 tggttcagtg aatttcaagt cgcttgtgga gaattggaat ggaaaaataa ttatcaagca 25561 gcaatggatg ccattgttaa gaatcaatac gatgtctgtc ttgtagatta tcgtttgggt 25621 gccagtaacg gactagattt gttgcgtgaa gccattcatc aaggctgctc ttccccaatc 25681 attttactca caggaaaagg agatagggaa atagacattg aagcgatgaa agcaggtgca 25741 gcagattatt tagaaaaaag tcaattgact gctcctttgc ttgaacgttc tattcgctac 25801 gcagttgagc gcaagcgagc agaacaaaag attcgcgaac acgccgcact ccttgatgtt 25861 gcgaccgatg ccatttttgt acgcgattta aacaagagaa ttttattttg gaataaagca 25921 gccgaacaac tatatggttg gaaggcaact gaagcaattg gcaagaatac atcagaactt 25981 tggtatgaaa aagatatagt acaatttcaa gaagctcttg atagtttatt gaaaaatggt 26041 tcctgggagg gcgaactaca tcaaataaca aaatttgata aagaaattat cgttgaaagt 26101 cgctggacac tcgtgcacga gtatgataaa caaggacaat caattcttgt tgttaatact 26161 aatattacac aaaaaaaaga attagaagca caattttttc gtgctcagcg tttagaaagt 26221 attggaactt tagcgagcgg tattgctcac gatctcaata acgttcttgc acccatttta 26281 atgaccgctc aacttttaga atcgcaactg aatgatcagc ggagtaaacg actgctgcca 26341 atattgatat ctaatgctaa acgaggagca aatttggtta agcaagtgct atcatttact 26401 cgcgggattg agggcgatcg cactctcttg caattaaagc acttaattac agaaattcag 26461 caaattgtta aagaaacgtt tcccaaatct attgaagttt ccacttcaca agagcaaacc 26521 ctttggacag tttcaggtga tgcaactcaa ttgcatcaag tcctgataaa tttgtgcgtt 26581 aatgctcgtg acgcaatgcc taatggcggt caattgacaa tttcggcgga aaatttcatt 26641 gttgataaaa attacgccaa gatgtatatt gatgctcaag tcggttcgta tgttgtcatt 26701 actgttactg atactggagt tggtatccca caggaaatta tagaccgcat ttttgaacca 26761 ttttttacaa cgaaagacct aggcaaagga acaggacttg gtctttctac tgtacttggg 26821 attgttaaaa gccacggtgg ttttgttcat gtgtatagtg aggtaggaaa aggcactcaa 26881 tttaaggtgt ttttaccagc acaagaagca atggaaactc cagaagaaca agagctggaa 26941 ttaccaaacg gtaatggaga actcattttg gttgtggatg atgaagattc aattcgggat 27001 gttacgaaaa catcgctaga aagctataat tacaaagcaa taactgctag tgatggcatt 27061 gaggcgatcg ccctttatgc agaacatcaa aatgaaatct ctgtggtctt aacagatatg 27121 gttatgcctt ctatggatgg aataaccaca atccggacct taaaaaaaat aaatcctgca 27181 gtcaagatta ttgctgttag tggacttgct tctagtgaaa aggtgaatac agtcaataac 27241 atgggtgtta aagccttttt atcaaaaccg tataccgcca agcagttatt acagactatt 27301 agtgctgtca aaagtgggaa ttaaaataag ggaaatcgtt aattgttagt agtgactaac 27361 tactaacaat taataaacaa catttgctat tcccaatctc cacgaagaat aatgagcaac 27421 caatcttctg aagttttagc cgtcaacgca gctttttatc gagcttttga aaaaaaagat 27481 atcgaggcaa tgagtactgt ctggtctcaa ggaactggca gtttttgtgt tcatcccgga 27541 tggaatgtac tgcgcggttg gaaggagatt cgctcctcct gggttaacat ttttaaaaac 27601 actgcttaca tcgagataaa tacggagata gtgacgacag aagtgcgtga tcacatcgcc 27661 tatgttgtac ttgtagaaaa tgttcttcaa atcattaacg gtcagagaag actagaagca 27721 caatcaattg ctacgaatat gtttgagctt ctaggcggta agtggtatct tgtgcatcat 27781 cacgccagtc caattatgcg ctgaaggagt gatagggaag atgaggaaga aataattatc 27841 tcctagtctc caatcaactc cactgcatca cgtctatccc aattgtctaa gtaaactttg 27901 acaatctctt ctttagcagt gggtagttgc ttaccaacaa aatctggatg aattggtaac 27961 tcccgatgac ctctatctac cagcacagct aaacgaatga tctctggtct gccatactcg 28021 tttaccgcat ttaaagcagc acgaattgtc cgtcctttat aaatgacatc atcgacgagg 28081 acaactgttt tccctgttaa atcaaaaggg atatcagttt ttgcaggagt ccgcaaccca 28141 atttggtcca ggtcatcacg ataaaatgta atatccaagg ctcccgtcgc tatcggtata 28201 ccttcaagtg cctcaatttg acgcgtcagt aactgggcta gaggtacacc tcgggtataa 28261 atacctagca gtacgagttg agacaaatca cgcgtttttt ctacgatttg agaggcgaga 28321 cgattcacgg tacgacggag ttcttcaggt gagagaatct caaccacttt ggtttcgaca 28381 gacgcagtca tagaccaata cccctgaaca atgagtgaga cgactataac aaatgggtga 28441 ttgtgactgg cagctttggt ggttagtcaa attttttact aaccactaac tactcacaac 28501 tagccattcc caattcccta tttccatcat gacgctcgct tcgctcaaat tcaaagtttg 28561 cttgttcaaa attcaaaaaa aaatagccct cactccctca cttactcagc agtagaacta 28621 ttgcaacccc cagccacagt aaaaaacaat atcgagtgag ttgcaaagct tgataaatgc 28681 aagttggggt gatgggatgg atggggtctc ccagcagagg tttatgtttt gcaaccccac 28741 gataccaatt tgtacctccc acctgcacgc ctaaaatagc agcataggcg cactcactcc 28801 agccagagtt aggactagga tcacaaacag catcccgacg gcaaattcgc caaacatata 28861 agggtttccc cgacagaagt gccagagtta tcacagttaa ccgacaagga atccaagtca 28921 aacaatcttc taaccgcgca ctgaaccatc ctaaatatgt ataaggtttt tctcgataac 28981 ccaccattga atcaagagtg ctgctggctt tatatgccaa agccagggga gttggaccga 29041 taactgggat aaatgcacca ataatggcat aaaaaagagg agccataact ccatcagtcg 29101 cattttctgt tactgtttct agaacggctc gtaaaatttc tagttctgtt aagttttgtg 29161 tatctcgtcc tacgtaatga cttaaagcag aacgggcatc tagaatatgt cctgctgtta 29221 atggttgtaa aactgtttcg gctgctactc tcaaactttt aaaagcaaaa cagctagcta 29281 agagaatact ttctaacgcg actcccaaaa acggatgcag taacctagca ctttgaataa 29341 gtaggtagcc tataaacccg ctaccaatta ttaggataat gcctaacgca attcccgcaa 29401 ggcgctgtgt cagggaatta tgacaatatg tgatactaaa tttggtcaag cgagaaattg 29461 tccaccccat gactcgtact ggatgaggcc aaccccacgg atcgccaatg aagtaatcta 29521 aagtagcagc aataattaaa acaacagctg atgtcattag tcaaaagtca aaagtcaaga 29581 gttaacggtc aagagtcaag agtctaatga ctaatgacta atgactaaac aataaaattt 29641 gcttcttccc caagagaagc ttgccaacta cgagcgtcgt aataaagatc tgccaaggta 29701 atactgtaca aagcttcttt gagtttttgg tgcagtcttt gccaaagagt aaatgtgacc 29761 caatcttctg cttgtgttcg agaagcttgg tgatggggta aaggttcaat agtttctcca 29821 actgcttcta aaacttctcc taaagagatt tttgcaggtt ctcgagctag ctggtatcca 29881 ccgatgctac cacgcattga ttttactaat ccagcacgac gcatttctat aagcagtttt 29941 tctaaataag gcgctgggat atgttgacga gaggctattg ccttagtgga aacaggacca 30001 tacccaggct gtaaactcaa atcgagcaac gctttcacac tgtagtgtcc tctagttgtt 30061 agtttcattt tgataagctt tgtcctttgt cttttgtaat gactaatgac aagtgctaat 30121 gactaagaat cagttttgct tatcattctg tgtgccagtg atcaaatttt gtaagtagac 30181 aactttttta gctatgcttt ataaaactta aacaaataca gggtagtcgt caccaccaag 30241 ctagaactgt aaagtatgaa ttatcctgta cgatccatta gggggcttaa aacagtgaac 30301 agtgaacagt gaacagtgaa cagttatcag gatttggtga cggggattta gaccgcgact 30361 gaaacatacc acttaaataa gtgggagact ttagctctct ctcaaaagtg ggccgcaatt 30421 gctgataact gataagtgat aactggttga aataaggata ctccactgga atattttctc 30481 tacggttcaa tcgaggactt gtttcctctt actttatcct ttatttggtc attgacaaac 30541 atacaaacta tagttatcac cattgcctat cttctttaga atatattgta ttggtaatgt 30601 tggtcattca gtcaaaattt tgctctctga gtgtaatata cttggcgata gcaccaccag 30661 aaaaaataaa gaatagcgct cgcattagca taacctcagc taaaataaaa tcgattcact 30721 gtatttgacc attcgttttt gaattaagtt gatggctaaa cagaaaaaaa ttgaacccct 30781 agttggtgaa gatctgctca agaaagtcaa agagctagaa aacctaagca aagaagaaaa 30841 agctaaacac tgtggctact acaccgtaac caaaaacggt atagagcgcg tcaacatgat 30901 gaaattctta aatgctctta ttgatgctga gggcattcag ttagacagcg ctcccagtgc 30961 aaatggtcga ggtggacgca gtgcaagcta tagaattagc gtgcagtcga acggtaactt 31021 gttgataggt tctgcctata ccaaacaaat gaatctcaaa ccaggagatg agtttatcat 31081 tactttaggc aaaaagcaca ttcgtttgag acaagtagac tcagacgaga gagaagaggc 31141 tgagctagca gaagttgctg tgtaaatagt tattagtcgt tagtcaaaag tcattagact 31201 gcaaagctgt tgactaatga cccttacggc gaaccccagt tccctgcggc gggagacccg 31261 cctacaggac tggactcact tttgactgtt ggcttattga cttatgctca cacctgtttt 31321 gcgagtcatt ggtaagtagt ggcgaagtgc tctgcgcgtt acagcgacgt tgcaggttaa 31381 tccaatttgc tctctttcta gtaaacctaa aatacgctct tggttatctc gatcaaccac 31441 tggtaagtga tgcaaacctt taagagccat gcggtctaaa gcttcggcta gaggttcatc 31501 ttgtaatgca taaagaatat cagtggtaca gatatccatg agtgtttgat tggctgtgtt 31561 agcctggatt tcacttgagg aaattggata tttttcccaa aggcgaagag ttcggttgat 31621 atcttctaaa gagacgatac cgactaattg ccctgcttca ttaataacta aagcactctg 31681 gcagagatca ccactcattt ccaaagctgc ttctatcaca ttcatcgtta gtagcagctt 31741 tttgggagaa gaatacatgg catcttgtac caagatttgg tgcaaaattt ctgcttgctc 31801 gtctttcaac tcaggaagac caatttgttg taggttggaa ttagaattag aagttggttt 31861 gattcgttct acaagccaaa cactcaaacc cactgctgcc attaacggta aaacaatgcg 31921 atagtctcgc gttaattcaa aaagcaacaa aattgctgtt aatggagctc taacacttgc 31981 agcgagaact gctgccatcc ccaccattgc gtaagctggg ggagctgcca tatactgtgt 32041 gactggaggt actaggcttg ctaaaatttt tgcataagct gatccaaaag aagcacctaa 32101 aaacatcgct ggtgcaaata caccgccaac aaaaccacta ccagcactaa cagccgtcat 32161 caacagcttc accaccagca gcaccagcaa caactgtaaa gaaaacttca catcctgaag 32221 catagcttct atagtgccgt aaccgatgcc caaaatttga ggaaactgta aagcaacaat 32281 gccaacaatg gttccaccaa taattggatg aatgggttcg cgtatgttac ctagccatcg 32341 caaataaggt attttcccat gaaagcaggc ttttgccaaa gcaatgagtt gggtataagc 32401 tagagaaact aaactggctc ctaaacctaa gccgagataa agcggtaatt ccagaagact 32461 gcggacttgg taaacgggta gggcaaaggc aggctgtgca cccaaaccaa tttgagcaac 32521 taatgccgca acgaccgctg caagcagcac gacactcacg gcagaagtgg caaatgatgt 32581 tgcgcctaac accacctcta aagcaaagaa aactccggcg atgggcgcat taaatcctgc 32641 agctaaacca gcagcagcac cagcagccaa aagcaaccgc tgtcgttctt gagatacttg 32701 taaaacaaca gacagcagca taccaaaact tgcgccaatt tctacacttg gtccctctgg 32761 tcctagagaa gcaccacttc ctaaagacac tgatgctgcc agcatctttg ttactggtcg 32821 taattgtccc ttgatctctc ctccttggga tgcggcgatc agagttgaca gtccaggacc 32881 aaagtcttgg gtgcgccagc gcatcaatcc gataatgagt ccaccaagga tgggaacaca 32941 cgctaaagtc caagcgcccc aagcaccaat gacacccata aaattttcca gtgtcaagcg 33001 atgaatcagc tcgattaaat agtgaaaggt caccacaccc attccaccac caccgcctat 33061 aaggactgct aaaagcagca caagggtttc tggggatggt tgaaaacggt taagcagatg 33121 agtcagacgg gcagaaaaag tggtaaaagc aggttgttct accaccttcc ttagttcagt 33181 tgaaggcaga agagtcattg agtggagaag taaatctaat aaaaaattta aatttttctg 33241 tcttacattt cattgtgagt gatcatttag caacaagaca agtccattgc atatgttagt 33301 tttacaaagc tttagtaaat taattcctgt tgctggcaat ttgtgtaagg acaatgctta 33361 aatgggaaag aattgaaggt gaatggttat ttgcagcgct catctagagt agtgaggagt 33421 aaattgaagg ttaaagagtg tagacgccct atgacgatga gacagcgctc tatggagggg 33481 agccactgtt cgactagggt ttcctggcag tcggtgcaag tggcgttttc tggtgccttg 33541 gcgagtgtgt atctctttga gagacgcttg gcgttggcgt aagcacaaaa cgtacgaatc 33601 gggcattgca agaacctgtc cttttgggac ttacaaagca acgtgcccgc aagacgatag 33661 agatacccaa tagccgtaag gcgtggcgtc agccataggg cttgccacag actacgataa 33721 attatgaact aaagcaaaag tgcatataca ccttgagggt gacttcctac agggtggtat 33781 gaactataaa gtgaattatg gagtcattga tcctttatcc ttttttggcc attctcagaa 33841 aaatctagct atactcgtgt tgcaaccatt ttgccatgct ggtagcacaa gcttttgctt 33901 gtctaaatta tggatattgt atgcgtgctt tctggctaat gtcagagcag accaagtaat 33961 ctacacgcat tttgagcata gattctcgac atggtattta aaagagcaag taagtggact 34021 agacggacga ggtagatgaa tacacacacc tttgataatc attatgtgac ttgcccaatt 34081 tgccaaaaaa atactaagcc caagctagta aagacgtgta tgggattgta tacctgtccg 34141 tattgccagg aaaggctagt agtctgtcaa agcggtcatt atgtccgcga cccgtttgcc 34201 tacaaacaga taatgatatc gtcggtgctc cgtcgccaaa gccgaccttt agctaggatt 34261 ctcagggatt ttaccatcct caagcgccct gtagttgctc tggttatagg aggtgctatt 34321 ctgttgagtg tcattggtat gacacagcaa gcctcagacc agaattctcc aaggttgcca 34381 aaaacagaga agcaacgtta gaaattggcc ttcgcttaat tcaaagtatg ccctatgttc 34441 acagaacacg cactcgcgtt tgggtaggaa ctttacaaaa ttcccaattc aaaaaaaatt 34501 tttgaatgtc tcatggtgaa tgcgtgaatg tttcagcagc gagtggtaaa ccagtactgc 34561 acaagggaag gcagcatgcc cttggagatt ccagaggcag tgcttggggg ttccccctaa 34621 gtctgcctct tgaagcttca acagaccttc ttttaccttt atagcacctg ccgttccgtt 34681 gtagcacagc tgcgtggtct ccccggaaag cagcgtgtcc ctttgggact tatactctgg 34741 cgtgcgcttt acgcttacgt agatccctac aaccgaggaa acctccaaca ctgattcaca 34801 tgcaaggtct ctacatgttg ctgtgtatgt gattcaaacc agaagcttta tattatcttg 34861 tcttttccaa tattatttag tattttttgt aactagtgac caaccggaag cttttttcac 34921 ctttatcgac tccagtttga gatcttggag tgcagtttca tcaagcagaa agtaaggttg 34981 tttgtcagat ttccaataat tctttagttc attagaggaa gcgggaacaa tagtgcgatc 35041 gctataaaaa tccagtgatg gacgatgatc acgaaaagat gtgtaaatct tcgttactgg 35101 tgggttgacc tgctggatca tttgtgctac tggtttgact ggataagctt cccccagttc 35161 ccaaacccag tcattggatt tcattaacag cagtagtgaa acataactgc cccaaaatag 35221 aatcttgagg aattgactgt cacctcgctc tgccaaaata gcagccattg ccattgtgat 35281 cgcgactgct ccaaaaacca gctgcaagtc tgattgtgct gtacccccta aaccatagta 35341 aatactgact ccagttccca ctatagatat taatactaaa gataaaacca aataccgagg 35401 ataagatgat agtattggca tattttcaat ttccgcaagc ttagcaccac aagctaaggc 35461 tatgcctggg tagattggaa acacgtacca aggaagttta gtttccatca aggaaatagc 35521 gatgagataa acaccacacc acactagtac aagtttcgcc cagctcaagt tgcgattttc 35581 ccaagtggag cgcaaactag aaggtaaaaa aatgagccag ggccatccgt acttgaagat 35641 ttcgagtaaa taataccata gcggttgaga atggccctca acagatttcc acacacggct 35701 cagggattga tccatcatgc cggttttggt aaatttgtcg ccatagtata cgaactgggc 35761 accataccaa aaagtgacag gtagaatgcc aagtagaatt cctgtccaca tgtatcgact 35821 ggtgagtagt cgcggtgtgt cccaaaagag aaaaacaaac gcgatcgccc ccaacaaaaa 35881 acctaatata cctttagtta gacaaattaa cccaaaaccg attccaacac caaggcagta 35941 gcgtaagttg cggcgcgatc gcagcgaaca cagtatcata accatgaaaa aactcaccac 36001 tgccccatcc aacattgcta accgcccatg acgtaccact ggtagcattg tcaggtaagt 36061 caaggcgcta taaatagctg cccaacgttg gtgaaatatt tctcgtccaa tgtaatacag 36121 taaaggcact gacatcgctg tcagcagtgc tcccggtagg cgtgatgtcc actcattaac 36181 tcctcctaag gagtaagccc aagcaattag caagtgcatc agcggtggct tattatgata 36241 tggctcacct ccaagcgttg ggtaaagcca atgcattgaa ccagctgggg cacgccatat 36301 ttcgcgtgca acttgtgcga cagtaccttc atcccaatct cgcagcggca attctcccaa 36361 atttatggtg taaagtaaaa ctgctgccaa aagcagcact atcacccata cccaatcaat 36421 ccatttgtct accgtgcgat actgcttctc cagacgaccc caaataaagc ttccttcttg 36481 catgttcggt ggatatttat tgtgcttgct ggttcgatta aactccagac gctttggagt 36541 cttgaatgtt gccacttgtt accacaagtg ttctgaaaca atcccttctt ctttaaatta 36601 acttatattt tctcgtaaac acataacatt tcttacaaat ttttcctctt tttttcattt 36661 tatttcatac tttttatttg gagaaaagtg ctacaactat tactaatcaa aatacccaac 36721 tcgttttttc gggagaagat ttatgaatct acctgttgtt gtagatattg ctcttggctt 36781 aatttttatt tacttgattt tgagcttact gacttcggaa attcaggaat taatttccac 36841 ctttttacaa tggagagcaa aacatttaaa aaaatcaatt caattgcttg tagcaggtgg 36901 tagcgaaact caacagtcag atattgacga tgccacagtt ttagtacata aactttatca 36961 agatccctta attaacacac tgaatcaaca aggtcaagaa atagttgaaa aggaattgca 37021 ggaattgaat caacttaggg ttgatccaaa aacattaaaa ggaaaacaaa gcgctccttc 37081 ctacattcct tctgaaacat ttgcaattac attattagaa gctttaagaa ttccagagtt 37141 aattaattat gttaaaaatc cctctgatac gaaaacaaac ttgcacatga tattatcctc 37201 ttacaaagaa ctgaaaacag ctattaacga caagggtagt gatagctatc aaaccattca 37261 aaatatttac ggagacatta gtgaaaataa cgaatttaaa aagcttgttc agggtttacc 37321 tgagtacgta cctaataatc tcatcacgag tcttagtctt ctagctcagc ggagtaggat 37381 aaaaattggt gacattaaag aggagatgaa tcagtttcaa agagaggttg aaacatggtt 37441 tgaccgttct atggatcggg caagcggtgt ttataagcgt aatgccaaag gagtggctat 37501 tttaattggc atatcagttg ctatattaac caacactgat acatttttcc tcttgaaaag 37561 actgtcgcaa gattcggctg tacgctctgc cattactcaa agcgcaattc aacaaaagga 37621 ttttattaat gaccaaaatg ccagaagtca atttcaggaa cttatagaaa atgcttctgt 37681 acctattgga tggcaaaaca tcagtcaaca atttgagcca ttaaagacga gtcgggggaa 37741 tagtgctcaa atttttgcac tcagaatttg gctagttttg aaaattcttt ttggttggat 37801 tgtaagtggt ttagctattg ctatgggtgc gcctttttgg tttgatattc tcaataaagt 37861 tatcaacgta cgtaattccg gaccaagacc tgttacatat actaaggatc aaccgcctga 37921 gaaatagcat ttccgttcaa ttaaaaaata aaaattattt gcaatttcca ctcttggtct 37981 tgacgaaaaa tctcaatttg ccgaataaac tctcggaagt aaaaccgtcg ctctgcttca 38041 gacaagtcta accaaaattg tggaattgaa acagcttggg cgacagaacg caaattaaca 38101 ggtggtaaag ttgctagttt tgcttgaagt gtagatattt ctgtgcggag tttatacgcc 38161 cttaactgag cagtttccgc atccaaaact ccagtttcta acaaagaggg tagctgattg 38221 agtatttctt gctgacgaga aatgccttca gttaaactat ttttgactgc atctaactga 38281 ggaaaattca tccccgccac agcaaggggt aaatcacgac aaactgcttg aattgtctgt 38341 tctaaaattt cttggtaggg aagggcgcga cacttgggat ttttgggaca gctaatagga 38401 cgtaaataaa gatattcttt cttttggttc cgcatagtca cacgagtgac tgtcatatga 38461 gactgacact cttgacagac aaccaaccca gccagagaac gcggtgcgct agctgtgcga 38521 cgaggtaaac gactgttgcg gcgtaacagt ctatcaacct gggcggcttc ttctttggta 38581 ataataggag catgagtatt ggagataatt tctccatttt gataagctgt atcacctcga 38641 taaactggat ttgtcaacca acggcgtccg gtggtgacag aaattttctt gctgtatttt 38701 ttcgcgagat aacgtactgc tccccgcaga gaaccataaa gtaagaaatg ttcaaaaaaa 38761 tctttgacaa ctggtgaagt ggtacggtca atgatgtatt ttgctttacc tctgcggtaa 38821 ccgtagggta ctctaccggg tggtggtgca gcatccagac ggttgcgggc gtgtccttgg 38881 cgaatgcggc gactacgttg ttgacgctga atttcgtaga gtaatttcag caattgggcg 38941 cgggggttag gattttgttg agaagaatta tagtcttgct caacagcaat gagtgttatt 39001 ttcattgctt caagttccgc aaggcgcgaa ctgacttctt ctagagaatc tcccaattct 39061 tctaagcggc ggataaggag acaatcaggc ggttctgttt tgcagtcgtt gaataattgt 39121 tgtagttgag agcgttttcc taagtcttgg taaataaaat ctaactccca tccccaggtt 39181 gtgggatcag gagttggttc tagtataggg tcagtgtaga tgtaggcgaa gattttcatg 39241 aggcgatgtg ctactttccc taacttagat cttgcacctt gatctgagta cgccttgaac 39301 tcaagttcaa ggcttatagg caaagtccgt taaaacggac tgggtaagtg tttgagtccg 39361 ttttaacgga cttgagcttt gagccaagaa atttatttct tggcggacaa aaactatggt 39421 gcaagatcct atctaatgaa aatcgtaacc gttcaggagg gatagtatag caatcctata 39481 tgagttgtga gaattatcga accgccttag cgaagcgtgt ccgaaggaca tagacgccaa 39541 gaacgccaag aaagagaaga gaaaatctta tgaataattt aggactgcta tagaaccttt 39601 agtgtagaat gctttacgtt ttgtagggga attgtgaact gacaaaaacg tagatacatt 39661 tgcgcagcgc ccccttaggg gctagcggct tgtcgccaga tatagttatt tactcaacac 39721 aaaatacggc aaaatcattt actacttgct ccacgagact catctcttga ttgtgcagca 39781 cgctttcttt ttctttcctt ttcttcttgg tctttttgta tctcgtatgc agcagctagc 39841 tcaccatcta ctccaaaata ctcctcagtt ttatctgata tttctttgat gactgcaaac 39901 gaggaatttt ctgataaggg catacccggc tgagatacga cacgattact tatttctatt 39961 gatgatgaaa ttccatcgac aacatcatcg accaacttaa ctgtttttgt aagaaatcca 40021 cttgtaagcg atgttgctat catcagactt gcttgcaaat ttttcccagt cttaacagct 40081 tcattgaaaa gttctccaca tacatctaca gccttatcta aaagttcttt agcttgctca 40141 atccgaggtg caaaatcacc agcttctact tgtttagtct cattctccga taccgttgtt 40201 gattgtgtgg atgaatttcc acctttaact gttgaatcag tcgtctgatt agtattctgt 40261 tttttctctg aatagtgctg gagaggttca aaaccccaac tgtcattaat ttcaacgaga 40321 caatcagcag tatatttaat ttcagttttg gcttgctcgc ataggctttc tatttcagct 40381 aacagtgagg ctaattcatc tgtggaacat tcaaaaagct tatttgctaa ctcattgata 40441 ccttctaatg tttcgcttga cttagcagta tgcgtttgga attgctcgat atcatcagtg 40501 agagtcatat ttttagcccc aacgtgagcg gacatcttca agtgcatctg tagcagagcg 40561 tagttccctg tctatttctc ctaaattatc cataagccct gaaaattttc tacttaactc 40621 agttgcagtt gtttcaggaa gcgttttaga tgatgaatta atttcctcaa gctggtctac 40681 aattctacta atcattgtgc ggaattcttc aatatcggat gaaagggtca tgtaattatc 40741 tccagttttt ttttcaaaaa attttatttt ttggtgtgaa gattagccca aagagtctca 40801 gatattttga cttttgagcc aataggcatt tttatgagag gagtattcac aaactttgca 40861 ccactataac cactcaagta agctctacct ttagcatctt tgggtatatc gcctgcaatt 40921 gcatcttcta atatgcgata gctagcatct ggactcaccc gaaaaacgac tttttcttca 40981 cattgatctg tcacttgttt agttataact tcggtagttg gtctttgggt acagtaaata 41041 atatgaattc caaatttacg acctttacga gcatactctt tcagtcgctt ttcaatagtg 41101 tctcgcagtt tactggaaac atctgcaatg tcagcggctt cgtcaataat ccagagtgta 41161 cggtcaattt caactccttc ctcttgcaat tctttaaggt tttcgactcc atactcctgc 41221 attaactgca agcgtctttc gtactcttgc tgatgaattt tctctactaa atttggacaa 41281 ttctctggag ttgtctcaat ctcaacatta actcccatat gatttaacca ttgaaagtca 41341 actccaccga aatcagcaat gtaaatccta cgttttgggt taacgtataa aaattgaaag 41401 ataatccatt tcagaaaatt gctttttcca gcgccagaca ccccagctac taggcgatgg 41461 ggaatttcac tccagctaga tgttacagga ttattttcat catctaaacc aactatatat 41521 tcgtcttttg tcatcaggtg tttatagttg ttgaagattt cctcagcagt gctgttacgc 41581 aagcgagttg tcgccatacc cggaactgga gtattcccag gaatgacaag gcggaagaat 41641 ccactaattt cttccagtgt tgcaagacgg tgtaatggtt ttaagtcttt tatttgtgca 41701 agaggtaacc tccgtgaatt atttgaagat tgagttattg ctgaagcgcc agtccggata 41761 attgcgcctc cagatgaagg taaagttgga tgactaaaat ttgcggaatt aaaagagttt 41821 gagtaaagat ttgtcgaacc atcacccaac tcattgctta gcttgggctt aataggttgt 41881 ctgatgtact tctctccaaa atcaccttcc caacctttgc gttcaacagc agtagaaata 41941 tctatatttt cagttgcgct caaggtttct aaaaactttt gctcgtcttt atcccgactg 42001 acaatttcaa tataatcttg ttctccagaa ctattagatt tggttgcact ctgtactaat 42061 gtcattaaaa cagtacgaat attgctgcgg gtttctgcca gtgctttgat attgtatttg 42121 aataaatttc tatttacata taattcttga tattgtttgt aagtttttaa ggctgtatct 42181 agaaaagcat ccttgctctt acttgtgagc agttgcagtt gagctaccat ttgactaaga 42241 gcattaaccc acgtttccct atcctcggaa ctttgacaga cttgaatagt aatttctagt 42301 aagaattttt cttttaatct atggagaaca tcacagacag ctaacatgtt gttatctttg 42361 ttagcttcaa agaagtaagg tagatagtag ccttgcaaat tgaatatttc tggcttgatg 42421 acttctcctt gaaatatctt atcaaaatct tcctgataca gcctttcgtc cgcaatcttt 42481 ctatcaattt ctctaatcgt catagttttt acatcaaaat cttccaattg aaatccaaaa 42541 tatagtcctg tcaggatatg atttgatttc atgtcaagta atttttgatg cggtttcaga 42601 agtgatacag ttacatctct tatctcttct tgataatctt ggagaaattc tgaaagaatt 42661 gtgacttgtt gctcaacgtt ataaaaataa attctgctaa tacaccgaac tgcttctact 42721 tgttctcgta agttattttg tgtatcaagt agaagtgaac ctttaatagc ctctatcaaa 42781 aatggaactg agagttcacc taattcttct aatttattcc agtctcttaa gataatcgct 42841 gctttagcct tgatagtatc attcaagcaa gaccacgatg agcctacttc attaattggt 42901 tcagcagcaa gtcgccgtaa gtcaatatca ctattatcta aaaaaggaat caaaatatct 42961 gataatgttc tttgtgtgcc agcatcctga tattttcgat attgtctatc tctgagaagt 43021 aagttggcta aatccaaagc aacttctaca acaaactgct gtaatgtgga agctgaacga 43081 gcgataaact caacacctac aacaccgagt ctagctagaa ttctctcaac aacatccaga 43141 taagatgaag gaattgtctg caacagtcgt aacagatgtc ttttaggtgt ataaacccaa 43201 aaatcaatgt attctttaga gattgaccta ttattaagac gatttagact ttcactaata 43261 atttgacttt gaatatcctc ggctgacttc ggaagaaatc ttcctgtaat catcgcagag 43321 taacctattt gctgaattgc ccaagccgct gataaacgaa tcaaatcgct acccgattta 43381 gcttcctgac acaacttttc aagtatcctg acactatggc gcttgtagtc ttctggatca 43441 gctagcagag agatttcctc aagaggttcg tcaagtccct ggatatagtc gagagttgca 43501 agttgggctt tgatttgcgg gttagtaata taaccaaaac tttgaagccc catagccaag 43561 atgaattcat cttggtagtt caacaactct tcgatcgccc ttagcggtac attttgtctt 43621 tgggctgttt ccaaaaatag ctggattttc tccccatcgt cacgacattg ggcgaggttg 43681 tcaaaaaagt ctttgccaaa ttggacgggg gtgaacttac ttacactagc gccttgtccc 43741 cctagttgga tttctgagag attgtccata acttcggatg ataaagatga catccgtgaa 43801 attgtttatt tttttacttt tctattgtaa gcggtttaat atttaaagta taaatagcaa 43861 cttaagtatt ctttattttt cttcaggtta cagctactac ctcgttccca gtcagagact 43921 gggaatgacc cattgggagg ctctgcctcc cttacttgcg gcagagccgc cctaagcgca 43981 tttccaggta gaacctggaa acgagatttt agaacgtgga ataaaggctt gagctttaag 44041 ttgacacgta tgggctagcc tttgccccta atctgtagtt tatttacttg aaaagcgctg 44101 taatataacc tctgctcaat cacctactgg tttttactat gccgcgtaga agtgctgagg 44161 cttgtcgcta gatattgccc aatacttgag aacaaaatag gcgatgcgga agttgcttta 44221 aaggcgatcg cccttcaata cgcttcggat aaagatcaca acgcctgcgg cgatcgcctt 44281 aaaaaggggg taaatagata aaatagcccc cctcgtactt ctccactttt ccacctcact 44341 gctgactgag ttcaacaata ttcccatcag gatctttggt gaaaagggct gggcgtccgg 44401 aggcgctggg ttgaatgaag cagttgttat tttggagttg ttgtttggct gcgtctaagt 44461 cagcaactga gaatgcaacg tggggattgc gcccccattt ttcgttttga tcttcagtgg 44521 ggactccctg ggcagctatc aggtgaattt gatgattccc aacttgatac caagcgccag 44581 catatttcag agagcgctct actttgggca atcctagcac ctttgtatag aaatgttcgg 44641 ctcgttctag atccgtgaca agaatggctg tgtgaagaca ctgagtaatc tgcattggtc 44701 ttggtaggtt gagagtttta agtacgtcaa cattgaaaaa cgtaaaatag agtcttggcg 44761 attacttcgc tacccttcgg gaagccgctt cgcgtctaca cttcgtttcg ctcgtaatga 44821 cattttacgt ttaattaggt tgagagtttt attatggagc gagctgagtt gagagaagta 44881 agggacaggg ggacaaggag gataagctca gatcttgcac gataactttc gtccgccaag 44941 aaataaattt cttggctcaa agttcaagtc cgttaaaacg gactgggtaa gtttttgagt 45001 ccgttttaac ggacttgggt tattagcctt gaacttgagt tcaaggcata ctatcggtga 45061 ggtgcaagat ctcagtaagg ggaattttac tctctctgac ttgctcactc cctcatcttc 45121 ccatcttctc ctatgactct gtgcctcctg tattaggaat aagcctgcct ataggagacg 45181 ttgcgacttc ttgagaagat gtgctttgag ccacagcaac aggtatttca tctactgatt 45241 gagttttcgc tgctaaataa tctgtccgca gcttgtctag catttgttct cgtttgtcat 45301 acaaccgctt gagagaacaa ggctgacatt ggttgagggc gtaagctgtg actaacggta 45361 atgcaattgt gctgtcggta taacaaacta ttgtgctagg taattcattt ggatcaattt 45421 taccccaact gacagcttcc gagggtgtgg ctccggataa cccaccagta tctggacggg 45481 catcggtaaa ttggacaaag taatcgtgtc ctcgttcttc tagccctaag acttcgtgaa 45541 tctgcggttg tgtttgtagc aaaaagtttt taggactgcc accaccaata atcacagctg 45601 cacttttgcc tcctgtttct cttgcactat aggcaatggc tgctgtctcg tttacatcaa 45661 tagaaggatc taagatcagc ggagatcctt ccaatgacaa tgcggcaacg ttcataccaa 45721 tagaactatc acctggggag gatgtgtaaa tcggtactcc acactcataa gctgtcgcta 45781 gtagacagga atgcttaacc cccaattgct tttctacctc tcggacatac ttgcctagta 45841 aattatgaaa ttcagcagtt cccatccgtt tttggaatgg ttctgcttgc aaaattttac 45901 ggatgaaagc gtctgtttcc aacaggacat cgtagccaaa gataatgtca taaatgcgga 45961 tagtaccttc ctggcgcagc ttcacatcat ttaaaaatgg attaccagca aagagttcaa 46021 aacctagccc atagtgcata tcatgataaa gattagcacc agtactaatg atccaatcaa 46081 taaatccgtt gcggataagc ggtgctagcg cagaaacccc gaatcctgct ggtgtcatcg 46141 caccagaaag gctaactcct accgttacat cttctttgaa cacatctcga ctcaggagat 46201 gacatatctc ccgcaatcgt gctgaattgt atgctgtgaa gtaattatca atcaaatcga 46261 cgaccccgat atcagttggt atcggtgtag gtgcgatttt ttttcccagc tgttttgaca 46321 tttgtggtag tccccgaaca aagaattaat aattgtacct attcatggtg agtaagcgcg 46381 ctgcgggagg atctccaacg ccagaacaag cgtgagggaa accctcctat gtcctgcgga 46441 cacgctacgc tatcagtact ggctcctccg caggcgacgg gcgttcccag agggtcaaaa 46501 tctgaatcaa caaacagcat acaatcttag ctttgagtta tttcttgcac cacccgcaag 46561 tagcaattgt aggcattttg cccaaaacac acgaatataa tttgttcgag agagttattg 46621 ctgtgtaaaa atttgttaac ttcattgaca gcaattttag tagcacgttc tagtggaaaa 46681 caatatgcgc ctgtgctaat ggctggaaaa gcaatagttt ttattccgtt ttgtactgct 46741 aatgctagac tattgcgata acattgtgcc aacaactcat cttctccaga atttccacct 46801 tcccacaccg gaccaacggt gtgaatcacc cactttgctg gaagatgata acctttagtt 46861 atctttgctt gaccagtatc acacccctgt aacttgcgac actctgacaa aagttctggt 46921 ccggctacac tatgtatcgc gccatcaaca ccactacctc ctagtaagga attattggca 46981 gcattcacaa tagcatctac ctgtagctgg gtaatatccc cttggataac ggaaactcta 47041 ttttccataa ttgataccaa gcaagttgcg tttaaaaacc tcacctccag cccctctcct 47101 tagtaaggaa aggggtgccc gtgagggcgg ggtgaggtgt aagtgatgaa tacaaggcgc 47161 gtatgaggca cagttgctgc aaatgaagtg ttctggttca tcttattctt ctagagcaga 47221 tctggtaagg tttctagagg aatatgctgc attttttacc aatttcttat attttttatc 47281 atttatttaa catttaaata aacgaatttt tgataaagcc ttgtagtaat gatactaaac 47341 tgaacgccag cgctatgcaa accgacgaaa cttggacaac ctccgatgta agtgccgttg 47401 gggaaaattc ctcccttacg tcagaacagt accgtaaaaa gatgcagcga cgcaaagaag 47461 ttcaggaggt gcgaatgaag aatgcgtcta acgaaaaagg gttaattatt gttaatactg 47521 gtaatggcaa ggggaaaacc actgcagcat taggtatggt gttgcgtgcg cttggtcatg 47581 ggtataaggt tgctatcatc caatttatta aaggagcgtg ggaaccttct gaaaaaagag 47641 tttttagtat ttggcaagat gatttgttag aatttcacgc tttgggcgaa ggctttactt 47701 gggatactca agatcgcgat cgcgacatcg aaaaagcgat cgccgcttgg caaaaatcat 47761 tagaatatat ccgcaatcca cagttcaaac tggtgctgtt ggatgaaatt aatattgctc 47821 ttaaacttgg ttacttacaa gttgaggaag ttttagctgg gctagatcaa aagccaccta 47881 acaatcatgt tattctcaca ggcagaggtg cacctacagc attgatttca cgcgctgact 47941 tagtaacgga aatgacttta gtcaaacatc ccttccgaga tcagggtatt aaggcacaac 48001 caggaattga gtattaagaa gtgctaagta aggagttagg agttgtattc ttacctcctg 48061 actcctcact gttgaacatt cgcctgacac atttgtaaaa cctggcgatc gctactactg 48121 atgttattta gctgataata atcttgcaat gttactgagt gagcataagt gctggattca 48181 tcatccatca cccgaggtaa tgtattgttt gctgctactt gagttgggct tttgtcacct 48241 gggtaaccat tcactgtcat tgccaactct ccagaggaac ggagtgtgcc attaaaacaa 48301 ctaaactcag agcttggcat atataacgca cctgtcactt tgctctggtg cttttcaaat 48361 atgatatacc cttgccccag ttgaccagac ttgggtgact gaccgtagag gtaaacgcca 48421 tcttgttgag ggaacttggc tgctatgggc gtaacactgg cagttttggt gggtgtttgg 48481 ggttgtgcaa ttgcgactgt ggttgtagga actgcttcca aatcatttga tttcgttata 48541 tcagttggaa cggtcaccga ttttgccagt actctacgca gtcgggtttg tttgtcccta 48601 gtcatgtctg aagctggatt ggggttctgc tgcaaccaag tgtaagtggg taactgagaa 48661 tcttcagcca attccgcaga ttgatcaata tatgcaagct gattactcat tactggttta 48721 gggcgattaa caaccaagcc aaccaacaca agtccgccaa taatggggat tcctaataaa 48781 ctgggggaaa gaaacttttt gatattgttt accaccggat tcctcctgtg aaaaaaatct 48841 cttgattttc tggaacttac ctacaaacgt accgaagtat tgatgtgtaa aggctccgtc 48901 aatggtttga tttatcttgt ctcgaaaaaa tctttgccac aaaataaatt cttgcttgta 48961 tttttcggga gttattttga gccataatct taacttacca ttgtttgcgc tattgttgcc 49021 tcattcatta gatttactca attgcatgat ttttgaaatg aaaaacttaa aatgaaatgc 49081 taatttttcg gacttttctt ttttttatta tttaatttca aaaacaattg acacgttctt 49141 ggtataatca tctgtctaat gtatgaacta tctgaacaag gaaaactaca caagtaattc 49201 tcttgattcc agatatcatt caaaaatttt ggcaaaaatg acaacgagag tagacgaaag 49261 cgggagtcaa acaactgggg gtgtaagggt gtaaggggga ctcaccagtt ccctctgggg 49321 atagggggag ag // LOCUS NODE_479_length_46839_cov_4.75917046839 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 46839) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 46839) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..46839 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 302..517 /locus_tag="DP116_02400" CDS 302..517 /locus_tag="DP116_02400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749456.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02400" /translation="MRSAIAPKIGDKVLILRPAYVAGRVGKVFAKEVLSGDYPSERWL IRVDSEDIVVSLNSKEFEVVNDDFEIS" gene complement(514..777) /locus_tag="DP116_02405" CDS complement(514..777) /locus_tag="DP116_02405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02405" /translation="MEHSHQRDKEKLLYPRARYYGQIKPENLVFNANLQEFSQKVSYI TCLETSGKLAPHEAYENIKALWKQLKRSRKELGIGNESSSDIT" gene 1307..1561 /locus_tag="DP116_02410" /pseudo CDS 1307..1561 /locus_tag="DP116_02410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315978.1" /note="catalyzes the isomerization of isopentenyl pyrophosphate to dimethylallyl diphosphate; internal stop; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type 2 isopentenyl-diphosphate Delta-isomerase" gene 1854..3689 /gene="sppA" /locus_tag="DP116_02415" CDS 1854..3689 /gene="sppA" /locus_tag="DP116_02415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216601.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal peptide peptidase SppA" /protein_id="PRJNA477356:DP116_02415" /translation="MRNFLKQTFASLLGSLLGLFIFCGVGTTGLLFLLFAAASSKDVG PVVKDKSVLVFDLSTNITDGQPSSSELLQNALSGDDNKRISLHTVLEALDKARRDKRI VGIYLDATDATEGRVTGFATLKEVRQALEKCRAAGKKIIAYGMDWGEKEYYLSSVADS IVVNPLGGMEINGLSTQPMFYAGALEKYGVGVQIVRVGKFKGAVEPFILTKLSPENRA QTQELLNDVWGEWRATVGASRKIKPQQLQAIADNQALLLADQAKANRLVDKVGYFDQV VADLKQLTNSDKEDKTFAQISLKDYAGVAGKSLAVERNSKNKIAVVYAEGEIVDGQGD DGDVGGDRFAKIFRRLRQDEDVKAVILRINSPGGSVSGSEVIQREVRLTGEKKPVIVS MGNVAASGGYWIATDSKRIFAEPNTITGSIGVFGQILNFQKLANDNGVTWDAVKTARY ADTQTVARPKSPEELAIYQRSVNRIYDTFLNKVAQGRKLPKQKVAEIAQGRVWSGAAA KKIGLVDDIGGLDAAVKYAATAANLGDNWQLQEYPKGGNLRERFFGQVSEQARTVLGV DKVQLKAPDPLMSEFQKLQQELAILRKMNDPQGIYARLPFNLKIE" gene complement(3653..3859) /locus_tag="DP116_02420" CDS complement(3653..3859) /locus_tag="DP116_02420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008199529.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02420" /translation="MRKSGSEGEELPVISYQLPVTSYQLPVVSYQLSVFTVYCSLFTV HCSLFRVPYSLLYHSIFKLKGKRA" gene 3974..4225 /locus_tag="DP116_02425" CDS 3974..4225 /locus_tag="DP116_02425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315976.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02425" /translation="MQLPKDIKTKRLLLLVVSCSLAGVIVGGTANWAESNSCLQASTV TNECLTQDQTTKTIEGMSTGLIVGAGAAFGAAWQHRHED" gene 4424..5392 /locus_tag="DP116_02430" CDS 4424..5392 /locus_tag="DP116_02430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RsmB/NOP family class I SAM-dependent RNA methyltransferase" /protein_id="PRJNA477356:DP116_02430" /translation="MEKASNLLVKLSRRLFHNLDEQEIFVETLIHPKPFHPCILWCQE KPKNSPFSVETPTVWQPQFIDRLSLGERPGQHPLHEQGYFYCLDFSSVFAASILLTIP SPVKLVFDMCAAPGGKSIFAWKSLQPELLISNEVIGKRLGMLISNLKRCQINSSVVVS KDSSLFAERIPFSSHLVIVDAPCTGQSLLAKGEKAPGCFHPTAINKSANRQKRILANS AQIVAPQGYLAYMTCTYSPEENEEVCEWFLKRFPQFQAVEINHLQGYQSHLTSVPSYR MFPQDRLGAGAFTVLFKNTEEGEIKEIYVETLSAVWMNIAIRSRTS" gene complement(5478..5753) /locus_tag="DP116_02435" CDS complement(5478..5753) /locus_tag="DP116_02435" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02435" /translation="MISVRELALINDLKLAENAAICSENVVSEPFRQCLMHGFHLTLT SLIGKQLIYLVKFSQYLVLHKTKRKFIADITTKDSVSTNEEVNQQYN" gene 5870..6280 /gene="msrB" /locus_tag="DP116_02440" CDS 5870..6280 /gene="msrB" /locus_tag="DP116_02440" /EC_number="1.8.4.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017322152.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide-methionine (R)-S-oxide reductase" /protein_id="PRJNA477356:DP116_02440" /translation="MATSNSNSQINKTEQEWREELTQEQFCVLRQHATERPHTSPLNK QYAEGTYVCAACGQPLFTSGTKFDSGTGWPSFFNPIETAIGTSVDKSLFMTRVEVHCN NCGGHLGHVFDDGPAPTGKRYCINGVALKFIPQE" gene 6538..7359 /gene="modA" /locus_tag="DP116_02445" CDS 6538..7359 /gene="modA" /locus_tag="DP116_02445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317490.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_02445" /translation="MKRRQFFPFISIVIASLLLALSLRLFDPSAVVAQSKVDLLVSAA ASLKDAMEEIKTTYQQSKPNINLSYNFGASGALQQQIEQGAPADVFISAGKKQMDALE QKGLLVQGTRTNLANNSLVLVVPSNSTAVTSFNTLTDAKVKKIAIGEPRSVPAGQYGE QVLEKLNLLQQVKPKLVYANNVRQVLASVESGNADAGLVYATDAKISNKVKVAVVADD KSHSPIVYPMAVLKSSKNVDAAKEFVQFLTSEPAQTVLKKYGFIVQPAKVPAMSR" gene 7955..10177 /locus_tag="DP116_02450" CDS 7955..10177 /locus_tag="DP116_02450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878909.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="formate dehydrogenase" /protein_id="PRJNA477356:DP116_02450" /translation="MLPKPKKHWTPSHWASWKPFGIGEQYPNNYWEVFRAIWLSRHKL PYAWNILNKGVCDGCALGTTGMKDWTVDGIHVCNVRLRLLQMNTMPAFDPAILADVSA LQTKKSAELRELGRLPYPMIRQSAEKGFRRVNWDEALEAIASRIRATTPDRLSFYITS RGTVNETYYATQKAVRAMGTNNIDNAARICHSPSTAGLKSAIGAAATTCSYKDWIGTD LLVFIGSNVANNQPVTVKYLHNAKKAGTRIVVINTYREPGMERYWVPSIPESALFGTK FAEDFFLVNMGGDMAFLNGTIKHMIANGWVDDSFINRYTAGFDELKAFLETQSWEELE RLSGAKRDEMYAFAKMVGEANKAVFVWSMGITQHECGEDNVRAIINLALTKGFVGREG CGLMPIRGHSGVQGGAEMGCYATVFPGGKPIIPENAAQLSKLWGFDVPVTKGLIAPEM IHAASEGQLDVLFSVGGNFLEVLPEPDYVEDALKRVPMRVHMDIVLSSQMLVEPTDTV VLLPATTRYEIPGGVTETSTERRVIFSPEILGPRIGEARPEWEVFMELARRVHPELAD KLTFVDTAAMRQEIAQVVLQYAGIQHLKEAGDQFQYSGSHLCFGWNFPTADGKAHFAV VSRGERELPEGCFLVATRRGKQFNSMVQERKDAITGAVREAVLINEVDAKHFGLNDGD VVILTNELGNLKGKVYTAPIKPGNLQVHWPEGNVLLDKSKRSREGVPDYNAVVRLEKI " gene 10691..12445 /locus_tag="DP116_02455" CDS 10691..12445 /locus_tag="DP116_02455" /inference="COORDINATES: protein motif:HMM:PF00188.24" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457302.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02455" /translation="MESKDTTFEQQVFELTNQERTKAGLQPLQTNAELNYTADKYAQT MSENRFFSHTGQDGSQPWDRAKAVGYEAQTMGENIAAGQKTPGEVVQAWMNSPGHRAN ILRSQYKDLGVGFEKNYWVQNFGSGDTNPASYIPGSESNTQIPSNPTPPSEPVSTPTP PGATSNPTTPSEPVSTPTLPSEPVSTPTPPESTSNQGGSPNSNQTIPKPPISSDSFEP TAYEQYMLELINRARANPQAEEQRQNIPLTQGLSPQSISYEAKQPLAWNTTLSKAAQD HNKWQEQTGTISHYGDGGSPWERAYKAGYDMTAPQSSQANENLAMGGGSTPKSATQYA EERHNSLYGSGGHRANFFNSDWKEAGIDFLGQQASDGQNLTKSSVVEFFGKPASDNTF LTGVAYNDLVKDDDFYTPGEGLGGIKVEAVRQSDNKLFTTQTSSSGGYQMALEPGDYK VTFSEGKLNEAITNTAKIDSKNVKLDLVSDKLVNGIYHSSDSLTGGDPSDILTGQQSD GMGYDRLEADPGYPTFVASQGENVDLIKNLVTNVGNTEPLPGLGMDKCLPIENGMFNH TKSLIAAPGNTIPTPSLV" gene complement(12624..14240) /locus_tag="DP116_02460" CDS complement(12624..14240) /locus_tag="DP116_02460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017302207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_02460" /translation="MARTQTRHKASDKAEPGWLKWAIAITASLGAILEVIDTSIVNVA LTDMQSSLGATITEIGWVVTGYALANVILIPLTAWLGDYFGRKTYFIFSLIGFTISSI LCGFAVNLPMLIISRILQGLCGGGLLAKAQAILFETFPPSQQGLAQAIFGVGVIAGPA IGPTLGGFLTDNLGWRWIFFINLPFGILAVVMALTFLPGDDKNHKPISNKVDWLGIGL LAMLQQGAAQRAIALGCLQAFLEEGEKEDWFESGFITTLAIVSVIGLVLFIWHELTTD SPAVNLRVLRHRSLAAGSLYSAVLGMGLYGALFAVPIFAQSVLHYTATQTGMLLFPGA LASAVTMLMLGQITSKIDPRAIIAGGGILTSLVMFQLAGINPDTSSDDLFYPLLWRGV GTVMMFLPLSLAALGPLPKQDISAGSGFYNLTRQLGGSIGIAVLTTLLAQRQAFHRKI LVEHVTPYDDATNQRLSMLESALQNRGEDAATAHQQALQIINQTIDTQAQILSFEDIF WIVGVAFLVTLPLLLLLGKRKPTADLPSAH" gene complement(14290..14898) /locus_tag="DP116_02465" CDS complement(14290..14898) /locus_tag="DP116_02465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113600.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_02465" /translation="MNKTARSVALTRTRIIQAAMQVFAQQGLHGATTREIARVAGVNE VTLFRHFASKEQLLGAVIENALALQTETLSQPEEWTNDLWTDLRHFAQLYNTMLEATE DLIRTFIGEAKRHPEAARRVMQEATKPVKEKLIAYLQNGQEKGTVRTDVNPAVAIDMF TGMLLAGMLRRNAPSTSLSYTKQDYLESCVDIFVRGISTAVI" gene 15214..15492 /locus_tag="DP116_02470" CDS 15214..15492 /locus_tag="DP116_02470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996558.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_02470" /translation="MEVQPREIRNYLTDDGRNTFSEWFDSLRDRRAKAKIRARLDRVE QGNLGDYKSVGDGVFELRIDYGSGYRIYFGQEGLTIIILLCRGDKSTQ" gene 15545..15841 /locus_tag="DP116_02475" CDS 15545..15841 /locus_tag="DP116_02475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878910.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02475" /translation="MPRSTSYHEKLIWDLKDPLEAAAYIEVVLEEGEPKMLGKALKNV IEAQGGVDKLYPEVKQFYDKLDQMLSEKGEIEFSCLSALLDALGLQLAVTVKSR" gene complement(15828..16043) /locus_tag="DP116_02480" CDS complement(15828..16043) /locus_tag="DP116_02480" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02480" /translation="MIKTNLLEGYRSHRELSLEIEQQLPLFIAARCVLGALWLAGRSA TNPAVRQVASEWIQVNAKKVQNILTLT" gene complement(16073..16816) /locus_tag="DP116_02485" CDS complement(16073..16816) /locus_tag="DP116_02485" /inference="COORDINATES: protein motif:HMM:PF01636.21" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02485" /translation="MTPDSTEKQITLSSEIAHLAIHQYEFDLVKLELLRHWENTTFKL STNKGNFLLRVHRGVYCTIQDIECEAKIIEYLRSYNDYTYQKPIRNRSGNFVSIGTAS GSSKPVSILSWIDYPPIGSHNSDLNVFVKLGQLIAHIHNKLSEWQKPRDFKRPALDAN GLTSANGALGYAPFGYSYLDSETANDFQAIHNRLLDIEATIGQNPNVFGLIHGDLHLG NALYDGHSIIPIDLMTWAGDIMFMILPYL" gene 16872..17711 /locus_tag="DP116_02490" CDS 16872..17711 /locus_tag="DP116_02490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="formate dehydrogenase accessory sulfurtransferase FdhD" /protein_id="PRJNA477356:DP116_02490" /translation="MIKHPGSKTKTTTWVVEKAQVRPRQDQLTTEEPLEIRLVSPQKT VAVTMRTPGADFELAAGFLYCEGVVSYKEDILRMSYCVDDVDGEQRQNIVNVTLREGL NPDLQPLERHFYTTSACGVCGKASLEALRLRGCPVILPQPIVTAEIIYNLPDKLRAAQ GIFNATGGLHAAALFDDQGQLLNLHEDVGRHNALDKLIGSALLSEQLPLSHHIVMVSG RSSFEILQKSTAAGVPIVCSISAPSSLAVSVAKEFGITLIGFLRGERFNVYTGLERIS VFR" gene 17789..18160 /locus_tag="DP116_02495" CDS 17789..18160 /locus_tag="DP116_02495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112430.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glyoxalase" /protein_id="PRJNA477356:DP116_02495" /translation="MTFQYTDAFVTLATFQIENLVGFYTQFLGIEPTTYIPNIYAEFR LPSFKLGIFQPKQIHFSEFENSAKSKVSLCLEVSDLESVIAYLSVLGCSPPKEVMTAS HGREIYAYDPDGNRIIIHQSK" gene 18781..19662 /locus_tag="DP116_02500" CDS 18781..19662 /locus_tag="DP116_02500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866586.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome biogenesis GTPase YlqF" /protein_id="PRJNA477356:DP116_02500" /translation="MSITQNYKLNLIQWYPGHIAKAEKKLKEHLKLVDVVLEVRDARI PLATHHPRIGEWVAGKTRVLVLNRVDMILPQVQQVWTKWFKSQGEVPYFTNAQHGQGV AAVLKAAQAAGGAINERRNSRGMLPRPVRAVVIGFPNVGKSALINRLLKRRVVESAAR PGVTRQLRWVRISEELELLDAPGVIPLKLENQEAALKLAICDDIGDASYDNQIVAATL VDILKNLRVNTADFIPEEPLESRYKLDPTSLTGEEYMFALAEYRYKGDVEKTARTLLT DFRKGFLGEIPLELPPG" gene 19844..21448 /locus_tag="DP116_02505" CDS 19844..21448 /locus_tag="DP116_02505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878918.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylate cyclase" /protein_id="PRJNA477356:DP116_02505" /translation="MTKLKLRLKEDNSEKTVTVHQDVFTIGRLPQCDLYLVSGGVSRY HARIMKTPCDTWTIEDLGSKNGTQLNEHLINSPQQLQDGDIIWLGDVCLTIVLSSVDS SIFSQGVVSPGITILRDVEQLQQQWILADNVCGDVGIKDKTIARLKDLVNIAKNLSAA ASIEEIFSQVQEVVFRYLNSIDRLGLLIDVSGSGKLELLNAATRNISYQEDLPADGSW ISRSICQKVFEEKVAIQTADAQNDERFAGENSLLVKGIRSAMAVPLWDENKVVGVLYA DAHLSSHHWAEEGEEELSFFSALANLVASSVQRWLLAEKLKSEEVIRRRLERYHSPAV VQQLIAVGALPNGRLPPQESEISILFADLVGFTAISERLTPTDIADLLNNLFEEMLQE VFAGGGTLDKYIGDCIMAFFGAPEPQPDHADRAVTAAMGMLTRLENLNAKNFWTEPLQ LRIAINSGKAVVGDVGSSQRVDYTALGATINLAARMEAICPPGECVVSEDTYTMLSQP SSFLEMGDYRFKGINRLVKIYRTKMH" gene 21812..23179 /locus_tag="DP116_02510" CDS 21812..23179 /locus_tag="DP116_02510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320250.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADP-dependent succinic semialdehyde dehydrogenase" /protein_id="PRJNA477356:DP116_02510" /translation="MAIATINPATGELLKTFEPLNDAEIAQKLDLAQQAFEKYQKISF QERSVWMQKAADILEQEKADFAKIMTLEMGKPLKAAIAEVEKCAQVCRYYAEHAAEFL ADVTVKTDASHSFVKYQPLGIILAVMPWNFPFWQVFRFVAPALMAGNVGLLKHASNVP QCALAIEEIIHKAGFPTGVFQTLLIGAAKVADLMSDDRVKAATLTGSEPAGASLAAAS GKQIKKTVLELGGSDPFIVLESADLEAAVATATTARMLNNGQSCIAAKRFIVVETIAD KFEKLLLEKFQALKIGDPMQAETDLGPLATPDIIKDLDQQVQAGVKNGAKVLTGGHAL SDRPGNYYPPTILTDISPDNPVAQEEFFGPVAMLFRVPDIDAAIRIANATPFGLGASA WTKNSEERDRLIEEIEAGSVFINGMVKSDPRLPFGGIKRSGYGRELSIQGIHEFVNVK TVWVK" gene 23227..24864 /locus_tag="DP116_02515" /pseudo CDS 23227..24864 /locus_tag="DP116_02515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455855.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="acetolactate synthase large subunit" assembly_gap 24391..24400 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 25151..26251 /locus_tag="DP116_02520" CDS 25151..26251 /locus_tag="DP116_02520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010469338.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_02520" /translation="MARSLKVAQEYIQKVKSSLQRNSYPSQKALADDLGISLSTVKNF LSSKPVDYQYFVEICQKLGLDWQEIAFKEPDTQPNSSKTFEETSPFITGSPITHPRHF FGRQKQLKRLFDLLKRRPLQNAAIIGKRRIGKTSLLHYLKNITTTPPEQLRSGQKYDW LPHPETYKWIFVDFQDPRMASRERLLSYILECLSLKVPTPCSLDYFMDVVSDNLHNPT VILLDEIGVGLQRCPELDDEFWESLRSLATNHTRGNLAFVLATHESPIELARNTGHSS PFFNIFGYTATLGALTEPEARELIASSPITFAEEDVEWILQQSQCLALLLQILCRERL FSLEDGETDDWCEEGLRQIEPFAHLLERNRQR" gene 26971..28413 /locus_tag="DP116_02525" /pseudo CDS 26971..28413 /locus_tag="DP116_02525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496028.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ATP-binding protein" gene 28663..30522 /locus_tag="DP116_02530" CDS 28663..30522 /locus_tag="DP116_02530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase family protein" /protein_id="PRJNA477356:DP116_02530" /translation="MRFFTHGYRWVFLTLFVIAVIVACTASPGKRSGQTTQQQATQHN MIIFVADGLRPTSINATDTPSMNEIRERGVKFTNSHSLFPTFTTANASAIATGHYLGD TGDFSNTIQVNAPVKSAKNSLVPFLENNAVLQEVNKQFGANYLNEQTLLEAARKANFS TAAVGKIGPVLIQDVTLQKGEPTIIFDDATGTPTGISLSSEVSEQLAKNSLPTAAPSR GDNGKPGDSKTPGTKVANTTQQQYFADVTTKVILPLFKQRQKPFVLVYWSRDPDGTQH NHGDSLNQLVPGINGPTVQAARQNVDKNLAQIRTALKDLGLEESTNIFLTADHGFSTI SKETKTSYSATLSYPDVPKGFIPAGFVAIDIAQELKLPLFDPDNKNATVDPSKGQFSK NGIIGKDPKNPDVIVAGNGGSDLLYLPNAANKKATAKKIVDLLLKQDYVSGIFVDNSL GEIPGTLSMEAIALRGAARTPKPSILVNFRSFDTGCGNPTACGAELADTGLQQGQGMH GSFSRADTYNTMAAIGPDFKRDYEDLAPASNADVAVTIAKVLKLKLSTQGKLVGRVLN EALISGPDSVEYKSFTLKSPSAANGLKTILKYQTVGKTRYFDVAGFPGRTLGL" gene complement(30619..30867) /locus_tag="DP116_02535" CDS complement(30619..30867) /locus_tag="DP116_02535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740165.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02535" /translation="MEPQIDKYDFLYQRRSYHGDFTPEALVFNANLQEFATRVGYISN LQTLGKLSPQDAYQQITELWEHLERSYSGLGIDSNTET" gene complement(31149..32924) /locus_tag="DP116_02540" CDS complement(31149..32924) /locus_tag="DP116_02540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198728.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="subtilisin-like serine protease" /protein_id="PRJNA477356:DP116_02540" /translation="MSDKRSLHPLADRNQLSDSNNFSAKFEGFFLQMVMPGLEHRVRE IVTKTLGSNWKVKSIGDNHTEFEVSKKGDTLSVKEAWNKTYYLRSQPGVVDAQPLFAV PLPYRSDWNQESKQAVARFEHQALDESSDVEWSLKQLRVLEAWSRFFPDPNLPPGHGI VIGLPDTGYTKHPEIFANLLLRESYDFLKNDKDPTDELETPLGEVINNPGHGTSTASV MISPRRAQSNYPTGKSVTGVAPGAKVVPLRVSYSVVLLSVTNLADAIEYAAGHGVHVL SISLGTGLFNQRLRSAIIYAQKRGVIVVAASGTFIPYVVWPAAYDEVIAVTGSNVRRE IWLGSSRGSQVDVTAPAESVWYAKTDKNNGEFKYNVLQGSGTSFSAPLVAGVAALWLS YHGREQLIQRYGAEKIPFIFNQILRDSCEKFPTWKPNLFGEGIVNAEKVLAAPLPDNV SNSVIPPAFALLQHPSIDNGRLDTFVHLFEQQLSDSQLDGNFVGVGRDNAKLRSCLAK LLQTTETELPKRLKEVGQELAFYFAVNPELYKQFAAALSSKQPSGTQLQTKTLTESPK PRNLEQIREMLLSQGVSQVLQTKLS" gene complement(33430..33669) /locus_tag="DP116_02545" CDS complement(33430..33669) /locus_tag="DP116_02545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02545" /translation="MRTDFDFPKNDLVGPVVFRPEFNNSQAITVNQAWSLFFTAGQED KALGTNPELGRFFTYLFVGVGVAGTLWATIFNSVA" gene complement(33732..33944) /locus_tag="DP116_02550" CDS complement(33732..33944) /locus_tag="DP116_02550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009546417.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_02550" /translation="MTGRTYVMEEGNRLNNYAIEPKMYVDETQNFGFTEYAEQLNGRL AMIGFVSLIALEVLTGHGLIDWITSL" gene 34348..34644 /locus_tag="DP116_02555" CDS 34348..34644 /locus_tag="DP116_02555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456319.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="PRJNA477356:DP116_02555" /translation="MSLYVGNLSYEVTEESLNSVFAEYGTVRRVQIPTDRDTGRVRGF AFVEMGSEAEEAAAIDALDGAEWMGRDLKVNKAKPREDRDSFGGNRNNSFRKRY" gene 34807..35115 /locus_tag="DP116_02560" CDS 34807..35115 /locus_tag="DP116_02560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114590.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02560" /translation="MSNSATPPIQPAVLITIIGETVLKDRIVKLLKSHGVSGYTISQV QGEGGHGRRLSDLAGYNTNIEIKTIVSLEVSDAILSALKEEQGKHALIAFRHNVEAFY " gene 35408..35596 /locus_tag="DP116_02565" CDS 35408..35596 /locus_tag="DP116_02565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017803988.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S21" /protein_id="PRJNA477356:DP116_02565" /translation="MTQIFVGENEPIESALRRFKREVSKAGIFPDIKKNRHFETPLQK RKRKAVARHKQKKRGFRH" gene 35652..36149 /locus_tag="DP116_02570" CDS 35652..36149 /locus_tag="DP116_02570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407558.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02570" /translation="MTPEAKHVVSELRREFYSLFAAAMKMPVIITGTSYSSNSPIAFW MDDRQRLNYVNIYAYIAPDTFMPLRPFILRLAINKSALQFMMVKKGQEHRNPVWDFEL TVLPTEILDFLPWIVSLVEADDKVSPLLLHSLPHPFKLKVPSVGLCHNAWTLEAWLLT NSTMF" gene 36562..37410 /locus_tag="DP116_02575" CDS 36562..37410 /locus_tag="DP116_02575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317784.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase" /protein_id="PRJNA477356:DP116_02575" /translation="MIETEFQKVRPRKVFAQHWLKSEKALNEIIQASQLQSTDRVLEI GPGTGILTHRLLPLVQSMLAVEIDRDLCDRLTKQFGRQENFLLLQGDFLELDLPSLLS PFPAFEKQNKVVANIPYNITGPILEKLLGTISNPNPQPYELIVLLVQKEVAERLVAKP SSKAFGALSVRVQYLAKCELMYTVPAGAFQPPPKVDSAIVRLVPRLVEPPAADTQQLE TLVKLGFGAKRKMLRNNLQSVVERDRLSQLLEKLEINPQARAEDLSVSQWVALANELL VLSDES" gene 37544..38494 /locus_tag="DP116_02580" CDS 37544..38494 /locus_tag="DP116_02580" /EC_number="2.7.1.148" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase" /protein_id="PRJNA477356:DP116_02580" /translation="MRSYTLIAPAKINLHLEILGVRPDAYHELVMILQSINLSDEISV QASDTQAIRVRCNHPEVPADNSNIAYKAAELMATEFPSTFAKYGGVNITINKRIPVAA GLAGGSTNAAAVLVGIDLLWKLGLTQSELEELGAQLGSDVPFCVAGGTAIATGRGEQL SPLPSLDNIYIVLAKFRSLAVSTPWAYKTYRQQFGDSYLKDTDSLTTRAAAIHSGPIV KAILNQDATEIAQKLHNDLERVVLPEYPQVLQLRETFASAGVLGTMMSGSGPSVFAIC ESQQQAEEVKLRVRETIPDEDLELFVTRMTSHGIQIASSV" gene 38510..38830 /locus_tag="DP116_02585" CDS 38510..38830 /locus_tag="DP116_02585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3082 domain-containing protein" /protein_id="PRJNA477356:DP116_02585" /translation="MSDENLTQQTEAQTQVQSSPLRCVIGAMISGALGYGLYSLMIAT ATSFATKPIHSDNVIVLKISSAVRTLVVGVMALGTAVFGIVAIGLLALGVQLLVQQLT KQKS" gene 38863..39501 /locus_tag="DP116_02590" CDS 38863..39501 /locus_tag="DP116_02590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320020.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_02590" /translation="MVTLQLKQIRVPPGQRVLLEDISWVEFEAILNELGEHRNSRVAY QQGTLEIMVPLPEHERAKIIIGDLVKILLDELDLNWESFGSTTFKREDMTAGVEPDDC FYIQNYKLMIGRDRINLTVDPPPDLAIEIDVTSKTKMSAYQALRVPEIWRYENGNLEI NLLQGEQYIKSQKSLTFTNFSVIEEIYQFVEMSRTIGTTPALRKFRKWVRES" gene 39815..41329 /locus_tag="DP116_02595" CDS 39815..41329 /locus_tag="DP116_02595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017716093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cryptochrome/photolyase family protein" /protein_id="PRJNA477356:DP116_02595" /translation="MTIGVWILGDQLWIEQAALQSCQDKVPVIMIESLHHVQERPYHR QKLVLVWSAMRHFAEELRELGYPVTYKLAEDFETPLQEWIQENQITELRVMTPNDKPF TQMIQNFASLHCKITLVPNNHFLWSVEEFKTWAKRRKRLIMEDFYREGRRRFQILMEE DKPVGGEWNFDKQNRQPPKGKLNTPSAKWFEPDEITQDVIAHVKSLSFPLYGEVEPFR WGVTRSQALEILDWFIKNRLPEFGPYQDAMVTGEETMWHSMISPYLNIGLLQPLEVIQ AAEKAYQQNQLSLYSIEGFIRQVMGWREYMHGIYHFVSADYPEKNYFEHTQPLPEFFW TGETKMNCLHQIITQLLRTGYAHHIQRLMVLSNFALIAGLSPQAVENWFHAMFIDAYD WVMQTNVIGMGLFADGGMLASKPYAASGNYVNKMSDYCKGCAYNPKERVGNNACPFNF FYWDFLDRHRSQLQFQGRMSFILGHLERMSLQELETIRQQARDWHVQQLSGEEV" gene complement(41732..41932) /locus_tag="DP116_02600" CDS complement(41732..41932) /locus_tag="DP116_02600" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02600" /translation="MRSRWRWANVILSTGTIAHSGTFGGVVGEGNACLGDDVAKTWTG SFGGVVGEWNASLEDNVAECSV" gene complement(41922..44608) /locus_tag="DP116_02605" /pseudo CDS complement(41922..44608) /locus_tag="DP116_02605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015175874.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 43770..43779 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(44678..45244) /locus_tag="DP116_02610" CDS complement(44678..45244) /locus_tag="DP116_02610" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02610" /translation="MKVFFKTLIFASVLAVAGTVENVQAVIGTIQIAQTQKAEVNAPI PLALPSTADVVLKNGSSMTGQVTAFDPNKQIIQISGSGVSRSLQIAQIQRVTFKRDGL VYTSDGRRVIRGEDNSQAQQSTWKNIPLNAFRFLNPRQASVDLATLMNSRDIRGIQGV AVKSLYVADEIQFQTAGKMTIKVTPTDP" gene 45679..46527 /locus_tag="DP116_02615" CDS 45679..46527 /locus_tag="DP116_02615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316712.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02615" /translation="MSPASQNSDRANLLGSFASLLGVLGIFLYFIGWIYRWAYFGFFE VEITTLNLPLESFLLVPIQVILGDFWIFIRTIIVVSITVVLIQFTLWLIRSPKVSSPA STSKSPISRFTQKLHGFWLLKPLRSFAQLFPQPFRHEIVIIAWILVALFWLARWQGTA DAYQDAVNNTSTRPIVTLVSPSDKMALGRNPDDLLTNLPLKNSRIIGDVNQFRQIFGR ETNDTTNPEQPIVWRLLIENNNWVYLFPAMQPGAKANQRPPVLAINTGDGQVQLLIRS RPKRLP" BASE COUNT 13417 a 9752 c 10564 g 13086 t 20 others ORIGIN 1 atacgccgca aaaatacaag ggttgtccat ccttgtcgag ggtttgccca accctctcct 61 taataaggag aggggatggg ggtgaggttt ttcgtttttt ataagtctat gacgcaccta 121 aggcatagac taacacaacg cggctgttgc acactcgggg gaaccctttg tagtcgccac 181 aacactgttg atccccattg atgtatatct taagggggat tttcatcgtg tttatggtta 241 atataaggtt atgaatacat cggtttttac cgtatattca gtccattttt gaaagttgta 301 gttgagaagt gccattgccc ctaaaatcgg agataaagtg ctgattctac gtccagcata 361 tgtggctgga cgagttggaa aagtctttgc caaagaagtt ttatcaggcg attatcctag 421 tgaacgctgg ctcattcggg tggattcaga agacattgtc gtgtctttga actcaaaaga 481 gtttgaggtc gtcaacgatg attttgagat atcttaggtg atgtctgatg atgattcgtt 541 gccaattccc aattcttttc tgctgcgttt gagctgtttc caaagtgcct tgatattttc 601 ataagcttcg tgtggtgcta gtttccctga ggtttctaaa caagtgatgt agctaacctt 661 ctgagaaaat tcttggagat tggcgttaaa cactaagttt tctggcttga tttgaccgta 721 gtagcgagcg cgagggtaga gaagtttttc tttgtctctt tgatgagagt gttccatgag 781 tgtacttcct atttataatt tactttaata ccatttaatc taaatttaat taaagtaact 841 tttataaaaa tttagcttac aaatgcttac gtatattttt cctaaacctc cacaatacca 901 actaagtgag taaaaagcaa aaatgagagc ttgttgtttt gattcatcta tgcgctagac 961 tgatctactc ttcataagtt ttttcaacaa ttccatcaaa gattcgtttt ttaacctaaa 1021 cttaagatta aaatacgaaa attcgagaga ataaccgtac ttaactaagg ttcttaaaac 1081 tataacatgg cagcatcaaa cacaatctag cgatttggga gagcttgttt ccactcaaga 1141 cgctggtcga tgcttgggta tctgttctag cttcttggtg ggtgtgcaat gagtgatatg 1201 tttttcagtc caaacgattg gcagagtgta gttttccggc taaccatagc tctactcatt 1261 ggtgttgtga ttggtatgaa ccgtgagcga gctggtagac ctgctgattc gtctggtagt 1321 tcctgatgtg cttttgattg cctcaggtgg atgatatgat gtattacata tcgtaaaagc 1381 gctaacactg cgagcagaca ttgctgagtt agtgatgcct tttttgcaag cggcttccca 1441 aagagaagcc gctgtttacg ctttagctaa agtgttattt gccgaaatca tcacggtgct 1501 attttgtact ggtaacgcta cactagaaca actgaagtgt tcaaaaagtt tacgacaaat 1561 aaaaggtaag aattcagcac tcagaagtca gaagccagaa tttataagga gtctattatt 1621 ggtatattca tttcctcaaa tgttctcctg attctttaac tattctgact cctgaattct 1681 ggctcctgaa ttcttacaat aaaagaaaaa taaaaaccct tagtttttag gttattgggt 1741 agtaagagtg aggaattaag tggaaatatt aagaagatac gaagtgatta gtcctcagtc 1801 taatgactaa tggctaatga ctaacaatta atgactaatg actgataaaa aatatgcgta 1861 attttctcaa acaaactttt gccagcctac tcggaagcct attgggactg ttcattttct 1921 gcggtgtggg aacgactgga ctgttgtttc tactgtttgc agcagcctca tccaaagatg 1981 tcggtcctgt tgtgaaagat aaatcagtgc tggtttttga tttgtctaca aacatcactg 2041 atggtcagcc aagttctagt gaattgcttc aaaatgcatt atcaggtgac gataacaaaa 2101 gaatctcact ccacacagtt ctggaggctt tggacaaagc acggcgtgac aaacgaattg 2161 ttggtatata cttggatgca actgacgcca cagaaggtag agtcacaggc tttgccaccc 2221 tcaaggaagt ccgtcaggcg ttggaaaagt gccgtgctgc tgggaagaag attatagcat 2281 atgggatgga ttggggggaa aaagaatatt acttaagttc tgtggctgat agcattgtgg 2341 ttaacccctt aggaggaatg gaaatcaatg gtttgagcac tcagccaatg ttctatgctg 2401 gggctttgga gaagtatggt gttggtgttc agattgtacg ggtaggtaag ttcaaaggag 2461 cagttgaacc atttatactc acaaagctga gtccagaaaa ccgcgcacaa actcaggaat 2521 tgttgaatga tgtttgggga gagtggcgtg ctacggtagg agcaagtcgc aaaataaaac 2581 cgcaacagtt gcaggcgatc gcagataatc aagctttgtt gttagcagat caagccaaag 2641 cgaatcgctt agttgataaa gtaggatatt ttgatcaggt ggttgcggat cttaagcagt 2701 tgaccaacag cgataaggaa gataagacat tcgctcaaat cagcctcaaa gattatgccg 2761 gagttgctgg aaagtcttta gctgttgaac gcaactcgaa aaataaaatt gcggtggttt 2821 atgctgaggg agagattgtc gatggacagg gagatgatgg agatgtgggg ggcgatcgct 2881 ttgccaaaat tttccgtcgg ttacgccagg atgaagacgt caaagctgtg atcctgcgga 2941 taaacagtcc aggtggtagc gtttctggtt ctgaagtaat acaaagggaa gtacggctga 3001 ctggtgaaaa aaaacccgtt atcgtgtcga tgggtaatgt cgccgcttct ggtggctact 3061 ggattgctac agattctaaa cgcatttttg ctgaacctaa cacaattaca ggttcgatag 3121 gcgtgtttgg acaaattctt aatttccaaa aactggcaaa tgacaatggt gtgacctggg 3181 atgcagtgaa aactgcccgc tatgctgata ctcaaactgt tgctcgccct aaatctcccg 3241 aggaattggc aatataccag cgtagtgtta atcgaattta cgatacgttt cttaacaaag 3301 tcgcccaagg tcgcaaactt cccaagcaaa aagtcgctga aattgctcaa ggacgagttt 3361 ggtctggtgc agcagcaaag aaaataggtt tggtggatga cattggtggt ttggatgctg 3421 ctgttaaata cgcagctact gccgcaaatc tgggagataa ttggcaatta caagaatatc 3481 ccaaaggagg caatctcaga gaacgctttt ttggacaggt gagtgaacaa gcacgaactg 3541 ttttaggcgt ggataaagtg caactcaaag caccagatcc gttgatgagc gaattccaaa 3601 aactgcaaca ggaattagca attctgcgga agatgaacga tccacagggt atctatgctc 3661 gcttaccctt taacttgaag attgagtgat atagcaggga atagggaact ctgaacagtg 3721 aacagtgaac agtaaacagt gaacagtaaa cagtgaaaac tgataactga taactgacga 3781 ctggtaactg gtaactggta actggtaact ggtaactgat aactggcaac tcctctccct 3841 cactccctga ctttctcact ttcccatctc ccgacctcct catctgtgca aaaagagtta 3901 acatctgagc tggtgacaag aacagtatgg acgtgtaaag tagttatgga ctacttacca 3961 ttcatgagat tttatgcaac tgcctaagga cataaaaacc aaacgtcttt tgcttcttgt 4021 cgtttcctgt agtttggctg gtgtgattgt cggaggtacc gcaaattggg cagagagcaa 4081 ttcctgtttg caagccagca cagttacgaa tgaatgtcta acccaggatc aaacgaccaa 4141 aactatcgag ggaatgagca ctggtttgat agttggggct ggggcagctt ttggtgcagc 4201 atggcaacat cggcatgaag attgaaagag tttgctgatt tatggtctcg tagctagctg 4261 cctttaagcg cagttgcaaa cccattactt atacttaact gaatcaggaa acagatttaa 4321 acttaaaata atagcgatga ataaacccgg ttttttggag aagccgggtt tttgggcgtt 4381 gaacatattt acattgacat acttattcac ttcacaaaaa tttatggaaa aggcttctaa 4441 tttattagtt aaactttcac gtcgtttatt tcacaactta gatgaacaag agatatttgt 4501 tgaaacttta attcatccca aaccttttca tccctgtatt ctttggtgtc aagaaaagcc 4561 taaaaattct cccttttctg tagaaacacc gacagtttgg caaccacaat tcatagaccg 4621 tttatcactt ggagaaagac caggtcaaca tcccctacat gaacaaggat atttttattg 4681 tctagatttt tcttctgtgt tcgcagcttc cattttgtta acaattccta gccctgttaa 4741 attagttttt gatatgtgtg ctgcgccagg aggcaagagt atctttgctt ggaaaagttt 4801 acaacctgaa ttgttaatca gtaacgaagt cattggcaaa cgcttgggaa tgttaatttc 4861 taacttaaaa cgttgtcaga ttaactcctc tgtcgtagtt agtaaagatt ctagtttgtt 4921 tgctgaaaga ataccttttt ctagtcattt agttatagtg gacgcccctt gtaccggaca 4981 atctttactt gctaaagggg aaaaagcacc tggatgtttt catccaactg ctattaataa 5041 aagtgccaat cggcaaaaaa gaattctagc aaattctgct caaatagttg caccgcaagg 5101 gtatctcgct tacatgactt gcacttattc tccagaggag aatgaggagg tttgtgaatg 5161 gtttttaaaa agattccctc agtttcaggc agttgagatt aatcatttac aaggttatca 5221 gtcgcattta acttctgtgc cttcttatcg gatgtttcct caagataggt tgggtgcagg 5281 tgcgtttaca gttctgttta aaaatactga agagggtgag ataaaggaga tatatgtcga 5341 gactttatca gcagtttgga tgaatatagc aataagatcc cgaacttcgt aaaagttgtc 5401 ggggatctgt ggctttcaat tctcacaaat caaataggaa aggcaatagc atatttatgc 5461 tagcctgtct aggctcgtca gttgtactgt tgattaacct cctcattagt ggataccgaa 5521 tcttttgtcg tgatatctgc aataaatttt cttttggttt tatgtaaaac taaatattgt 5581 gagaacttta ccaggtagat aagttgttta ccaatcaagc tggtgagagt taagtgaaat 5641 ccgtgcatca ggcactgacg aaatggctca gatacaacat tttcagaaca tatagccgca 5701 ttctcagcta attttaagtc atttatcagc gctagttcac gaacagaaat cattctagta 5761 aacagggtat gtcatcattt cataactcga tgacagagtt ctttagcaac tcaacaacag 5821 caacactgga acacattcat cacacgatac aacacaaatt ttagcaatta tggcaacttc 5881 aaacagcaac tcacaaatca acaaaactga gcaagaatgg cgtgaagagt tgacacagga 5941 acagttttgt gtattgcgtc aacatgctac agaacgtcct cacaccagcc cactcaataa 6001 gcagtatgct gaaggcactt atgtgtgtgc tgcgtgtggt cagccactgt ttacatctgg 6061 aactaaattt gacagcggta ctggctggcc tagctttttt aacccaattg agactgcaat 6121 tggcacatct gtagataagt cgttgtttat gaccagagtt gaagtgcatt gcaataactg 6181 tggtggacat ttgggtcatg tgtttgacga tggtcccgca cccactggta aacgctactg 6241 cattaacggt gttgctctaa agtttatccc acaagagtaa tatacctcac ccgcctccgg 6301 caccctctcc ttattaagga gagggttagc gtcagataaa gcaggaatca taactttacc 6361 ttgcacatca tgctcccttt gggttacatc cattgggagc atcgccattg atgacatttc 6421 ctagttcttt tcctggcatt attttaccaa tcttcttggt aattgtgggg taatcttggt 6481 acgattctac caaccgcaat ttcattaaaa gaagatctct gtaagtccag ctgtgttatg 6541 aaaagaagac aattttttcc ctttatcagc atagtcattg ctagcttgct tttggcactt 6601 agtttgcgat tgtttgatcc atctgctgtg gtagcacaat ccaaggttga cctcctcgta 6661 tctgcagccg ccagtcttaa agatgcgatg gaggaaatca agactacata ccaacaaagt 6721 aaaccaaata tcaacctgag ttacaacttt ggtgcttccg gtgctttaca gcaacagata 6781 gagcaaggtg cacccgcaga tgtctttatc tcagctggga aaaagcaaat ggatgccttg 6841 gaacaaaagg ggttattagt tcaaggaacc cgtactaact tggcaaacaa tagtctagtt 6901 ttggttgtgc ctagtaactc tacagctgtc accagcttta acactctcac agatgccaaa 6961 gtgaagaaaa ttgcgatcgg cgaacccaga agtgttcctg caggtcaata cggtgagcaa 7021 gttctggaga aattgaacct tttgcagcaa gtcaaaccca aacttgttta cgccaacaac 7081 gtgcgtcaag tgctggcatc tgtggaaagc ggtaacgctg atgctggtct tgtttatgca 7141 acggatgcca aaatctctaa taaggtaaaa gtggcagttg tcgctgatga caagtcccac 7201 tccccgattg tttatccaat ggcagtgctc aaaagcagta agaatgttga tgctgccaag 7261 gaattcgtcc agtttttaac cagcgagcca gctcagactg tactcaagaa atatggattt 7321 atcgtccaac ctgcaaaagt acctgcaatg agccgttaaa ccctggagtt tagttatttg 7381 gttttaaaaa cgtaagaatt cagcacttgt tgagtcagaa gtgagccagc gcggacgcca 7441 catgcctagg gagggaaacc ctcctcatgc gcagtggctc gtcttgggga ggacagtgcc 7501 gggggtagtc tccgaggtct tgggggtttc caacgccaga taccaagtga gggagaccct 7561 catcaagtac tggctcccca tgttagcgga gcgacgagcc gtaggctgcg actgccgaaa 7621 gggtctcccg acttgagaca tctggcgtgg gcttgaacct tcaaaggctg gataataggg 7681 gtgcaccacc aggaagatcg cacgttatac cctttcacaa aaatcctgat acatttgact 7741 ccccctactc cccctacttc ccctacttcc ccacttgccc accatgttct aactttaaag 7801 tgaaacggta ttacccctac acccttacac ccctagaccc ctacaccctt agttttagtc 7861 aagcttctga taatcgatct tgacaatggc gacatataat agtaggtgcg tcaacgagta 7921 catactcagc taatagtttc tgcgctcctt cattatgttg cctaaaccca aaaagcattg 7981 gacaccctca cattgggcaa gttggaagcc ctttggtatt ggcgagcaat atccaaataa 8041 ctactgggaa gtttttcgtg ccatttggct atctcgtcac aagttgcctt atgcgtggaa 8101 catcctgaat aagggtgttt gtgatggttg cgctttgggg acaaccggga tgaaagattg 8161 gacagtagat ggaattcatg tctgcaatgt ccgattgcgg cttttgcaga tgaacacaat 8221 gccagctttt gaccctgcaa ttttggcgga tgtctctgcg ttgcagacaa aaaagagtgc 8281 tgaactacgc gagttgggac ggcttcctta cccaatgatc cgccagagtg cagaaaaagg 8341 ctttcgccga gtcaactggg atgaggcatt ggaagcgatc gccagtcgca ttcgcgccac 8401 gacgccagac cgcttaagct tttatattac cagcagaggc accgtcaacg aaacatacta 8461 cgctactcaa aaagctgtgc gggcgatggg aaccaacaac attgacaacg ccgcccgtat 8521 ctgtcattct cctagtaccg ctggtctcaa aagtgcaatc ggtgctgctg caaccacctg 8581 ctcttacaaa gactggattg gcacagattt attagttttc attggctcaa acgtagcaaa 8641 caatcaacct gtcaccgtta agtatctgca caatgccaag aaggcaggta caagaatagt 8701 cgtgattaat acctatcgcg agccagggat ggagcgatac tgggttccct caattccaga 8761 aagtgctttg tttggtacta agtttgctga agatttcttt ttggtaaaca tgggcgggga 8821 catggcattt ttgaatggga caatcaagca tatgattgcc aacggttggg tagacgattc 8881 gtttatcaac cgctacactg ctggatttga tgagcttaaa gctttccttg aaactcaatc 8941 ttgggaagag ttagagcggc tttcaggagc gaaacgcgat gaaatgtatg cttttgccaa 9001 aatggttgga gaagcaaata aagctgtctt tgtctggagt atgggtatta cccagcatga 9061 gtgtggcgaa gacaacgtgc gagctattat taacttggct cttaccaaag gttttgttgg 9121 tcgggaaggt tgcggtttga tgcccattcg cggtcactct ggggtacaag gtggtgcaga 9181 aatgggatgt tacgcgacag tctttccagg tggtaagcct attatcccag aaaatgctgc 9241 tcagttgagc aaactctggg gctttgatgt gccagtcaca aaaggtttga ttgctccaga 9301 aatgattcat gcagcatctg agggacagtt agacgtgttg ttctctgtag gtgggaattt 9361 cttggaggtt ttgccggaac ctgattatgt agaagatgcc ctcaaacggg ttccaatgcg 9421 agtacacatg gatattgttc tttctagcca aatgcttgtg gaaccaacag atactgtggt 9481 gctgttacct gcaacaactc gctacgaaat accaggggga gtcacagaaa ccagcactga 9541 acgccgagtc attttcagcc cagaaattct tggtccccgt attggagagg cgcgtccaga 9601 gtgggaagtg tttatggaat tggcaaggcg ggtgcatcct gagttggcag acaagttgac 9661 ttttgttgat actgctgcta tgcgtcaaga aattgcccaa gttgtcctgc aatacgctgg 9721 aattcaacac ctcaaggaag caggcgacca gtttcagtat agtggttcac atctgtgctt 9781 tggctggaat tttccaactg cggatggtaa agctcatttt gcggtggtgt ctcgtggaga 9841 gcgagaatta ccagaaggtt gcttcctagt agctacgcgt cgaggtaagc agtttaattc 9901 tatggtacaa gagcgtaagg atgcgattac aggggcagta cgcgaggcgg ttttgatcaa 9961 tgaggtagat gccaaacact ttggattgaa tgatggtgat gtagttattc tgacaaacga 10021 gcttggtaat ttaaaaggaa aagtgtacac agcacccata aaaccaggaa atttgcaggt 10081 acattggccc gaggggaatg tgttgctgga taaaagtaag cgatcgcgcg aaggagtgcc 10141 agactataat gctgtggtgc ggttggaaaa gatttaaggt tatttggaac cacagatgca 10201 ccagcgttgg acgcagacac tcggcgggtt tcctgatgaa gtttgtagtt gaataactcg 10261 gcgaagcgcc gagtgacccc ctccacaacg taatttcata cttaactaat taataagtcg 10321 cagttcaatt ttggacagtt tgtttccaag ggtaatagat aggttgtagc acgtggcgtg 10381 cagataaaaa tcgtcacatc aacaattttt atcaccatca gcgtcggaga tgataatttg 10441 ctaaatgtga gcggcaaaat cgacctctat ccgagtcatt gcaatagttg gagccacaat 10501 caacgtcgtg gaaaggtggg aagaatgaat tagattttgc taaaaaattt atgttagact 10561 atgcccacag aggttagaca ttactttcac tcgtgactta accctcttca ggtttgcttc 10621 taccacaaaa tgcagcagaa ttattattct tgctgatttc atgcgaacaa atctttaaga 10681 ggacagattc atggaatcaa aagacacgac attcgagcag caggtatttg aactaacgaa 10741 ccaagaacga accaaggctg gtcttcaacc attgcaaaca aatgccgagt tgaactatac 10801 tgccgataaa tatgcacaaa cgatgtcaga aaatcgtttc tttagtcata ctggacagga 10861 tggctctcaa ccttgggatc gagctaaagc agttggctat gaagctcaaa caatggggga 10921 gaatatagct gcgggtcaaa aaacacctgg agaggttgtt caagcttgga tgaacagccc 10981 tggacaccga gctaatatcc ttagatccca atataaagat cttggcgttg gttttgagaa 11041 gaactattgg gttcaaaact ttggtagcgg tgatacgaac cccgctagct atataccagg 11101 ctctgaatct aacacacaga taccatccaa ccccacccct ccctcggaac cagtctccac 11161 gccaactcca ccgggagcta cttccaatcc taccactccc tcggaaccag tctccacgcc 11221 tacccttccc tcggaacctg tttccacgcc aactccacct gaatctactt ccaatcaagg 11281 tggtagtcct aactcgaatc agactatccc taagcctccg atatcttctg actcttttga 11341 gcctacagcg tacgagcagt atatgctcga attgattaac cgagctcgcg ctaatcctca 11401 ggctgaggaa caacgtcaaa atattcctct gacacaggga ctcagtccac aaagtatttc 11461 ttacgaagcg aagcagcctc tagcttggaa taccacgctg tctaaagcgg cacaagacca 11521 taacaagtgg caagagcaaa ctggtacaat ctctcactat ggtgacggtg gctcgccttg 11581 ggaaagggct tataaagctg gctatgatat gacggctcct cagagttcgc aggcgaacga 11641 aaacctcgct atgggaggtg ggtctacacc caaaagtgca actcagtacg ctgaagaaag 11701 acataatagc ttgtacggaa gcggtggaca tcgtgccaac ttcttcaact cagactggaa 11761 agaggctggt atcgatttct taggtcagca agctagtgat ggccaaaact tgacaaaatc 11821 atcagtcgtt gagttttttg gaaaacctgc atcagacaat accttcttaa ctggggtagc 11881 ttataacgac ctagttaagg atgatgactt ctacactcct ggagaggggt taggaggcat 11941 taaggtcgag gcagttcgtc agtcagacaa caaattgttc acaactcaaa cctccagttc 12001 aggtggttat caaatggcac ttgaacctgg agattataaa gttacctttt cagagggtaa 12061 actcaacgag gcaatcacaa atacagcaaa gattgactcg aaaaatgtca agcttgactt 12121 ggtcagtgac aaattggtga acggcattta tcattcaagc gactccctta ccggaggaga 12181 tcctagtgac atacttacag gtcaacaaag tgatggcatg ggctacgaca ggcttgaagc 12241 cgatcctgga tatcctactt ttgtagctag tcagggtgaa aacgttgact tgatcaaaaa 12301 tttagtaacc aatgttggaa acactgagcc attgccaggg cttgggatgg acaagtgttt 12361 gccaattgaa aatgggatgt ttaatcatac taagagtttg attgcagcac caggaaacac 12421 aataccaact ccaagcttag tgtaagcagc atcatgggtt agtatttgtc ttaagttagt 12481 tcaactcgtg gttgcgctac gcgcaaccac acattttttc atgagcgatc acgggcgatt 12541 gcccaaaggg cagacgcgac gagcgcgtat cgcaccactg ggtaagtctg aatcacccag 12601 acgtttataa cccgcaactt ggattagtga gctgatggca aatctgctgt aggttttcgt 12661 ttacccaaca gcaagaggag gggtaatgtt accaaaaacg cgactccaac aatccaaaag 12721 atatcttcaa atgacaaaat ctgagcttgg gtatcaatcg tttgatttat gatttggagt 12781 gcttgttgat gtgctgttgc tgcatcctct ccacgatttt gtaaagcgct ttccagcatt 12841 gatagacgtt gattggtcgc atcatcgtag ggggtgacat gctcaaccag aatttttcga 12901 tggaatgctt gccgttgcgc tagcaatgtt gtcagcaccg caattccaat acttccaccc 12961 aactggcggg tgaggttata aaaccctgaa ccggcagaga tgtcttgctt gggtaaagga 13021 ccaagggctg ctaaactcaa aggtagaaac atcattactg tgccgacacc ccgccaaagt 13081 aggggataaa acaagtcatc gctacttgta tctgggttaa tgcctgcaag ctgaaacatg 13141 actaagctgg tgagaatgcc tccaccagca ataattgcac ggggatcgat tttacttgtg 13201 atttgtccaa gcatcagcat agttacggca gatgctaacg caccaggaaa cagcagcatc 13261 ccggtttgag ttgccgtgta atgcagtacg ctttgagcaa agatgggcac tgcaaataaa 13321 gcaccataaa gtcccattcc tagcacagca gaatagagac ttccagcagc aagcgaacgg 13381 tgtcgcagga ctcgcagatt tactgcagga ctatcagtag tcaactcgtg ccaaatgaac 13441 agcactagcc caatcacact aacaattgcc agtgtggtaa taaaaccaga ttcaaaccaa 13501 tcttcctttt ctccctcctc taggaaggct tgcagacaac cgagtgcgat cgccctttgg 13561 gcggctccct gctggagcat cgccaacaat ccaatcccca accaatccac tttattgctg 13621 attggtttgt ggtttttgtc atcgcctggc aagaacgtta aagccatcac cacagcgaga 13681 ataccaaacg gcaggttaat gaagaaaatc caccgccatc ctaaattatc ggtcaagaaa 13741 cctcccagtg taggaccaat cgctggacct gcaatcacac caaccccaaa gattgcttgt 13801 gctaacccct gttgactggg tggaaatgtc tcaaacaaaa ttgcttgtgc ctttgccaat 13861 aatccacccc cacaaagtcc ctgcaaaatc cgcgaaataa tcagcatggg gagattcaca 13921 gcaaatccac atagaataga ggagatagta aagccaatta gagaaaagat aaagtaagtc 13981 ttgcgcccaa aatagtcgcc tagccaagca gtcaacggaa tcaaaatcac gttggcgagt 14041 gcatagcctg tcacaaccca accaatttcg gtgatagttg ctcccaagct gctttgcatg 14101 tctgtcaaag caacattgac gatgcttgta tcaatgactt ccaatatagc accgagggac 14161 gctgtgatgg cgatcgccca cttcagccac cccggttcag ccttgtcaga tgctttatgt 14221 cgagtttgag tgcgagccat agataatgat gaaacctgaa aaattaaatt taaaaatgaa 14281 attcaaaaat taaataacag ctgtactgat acctcgcaca aaaatatcaa cgcaagattc 14341 gaggtagtct tgtttggtgt agcttaatga agtagatggc gcattgcgcc gtaacattcc 14401 tgcaagcagc atccctgtaa acatatcaat agccactgct ggattcacat ctgttcgcac 14461 tgttcctttt tcctgaccat tttgcagata agctataagc ttttccttga ccggcttggt 14521 agcttcttgc atcacgcgtc gcgctgcttc tggatgtcgt ttggcttccc caatgaaagt 14581 gcggattaag tcttctgttg cttccagcat ggtgttgtaa agctgggcga aatgtcttaa 14641 atccgtccac aagtcattag tccattcctc tggttgggaa agggtttctg tctgaagtgc 14701 cagcgcattc tcaataactg cacccaacag ttgctcttta ctggcaaagt gacgaaataa 14761 agtgacttcg ttaacgcctg caacacgagc aatttcacgg gttgtcgcgc catgtaatcc 14821 ttgctgagca aaaacttgca tggcagcttg aattatccgg gtacgcgtaa gggcgacaga 14881 acgagcagtt ttgttcatca ttttgtatgc aagtgcttac ttacattata gtgatacttg 14941 gtttaaagtg ggttggcgaa gagagaggct tgtgaggctt gtcgccagat atggcgtttt 15001 tcaggctata tttaaatgac ggcaagttgc agttttatca gttaccagtt accagttacc 15061 agttatcagt gagccagcgc ttgatgagtg tttcccgaca gaggcgactg gcgaacccga 15121 agggttacca agttaagggg ggttaagaaa tcgctgactc tgttcactgt tcactgtttg 15181 aaaggtagag gaaataaact tgtcctgaaa atcatggaag tacaaccaag ggagattagg 15241 aattacctca cagatgacgg gagaaatact ttttcggagt ggtttgattc tctgcgagat 15301 agaagagcaa aagctaaaat cagagcaagg cttgaccgag tggaacaggg taatttaggc 15361 gattataagt cagttggaga tggagttttt gaactgagaa tagattatgg ttctggctac 15421 cggatatact ttgggcaaga agggttaaca attattattc ttttgtgtcg tggtgataaa 15481 agcactcaat aaaaagacat tcccagagcg aaggaatatt tggaagacta taggagtaga 15541 gatgatgcct agaagtacaa gctatcacga aaaactgatt tgggacctca aagatccatt 15601 agaagcagca gcttatattg aagttgtttt agaagaaggc gagccgaaaa tgttaggtaa 15661 ggcgctgaaa aatgtgattg aggcacaagg tggagttgat aaactttatc cagaagttaa 15721 gcaattttac gacaaacttg accagatgtt atccgagaaa ggagaaattg aattttcttg 15781 tctaagtgcc ttgctggatg cattgggatt gcagttggca gtaacagtta agtcaaggtg 15841 agaatatttt ggactttctt cgcattcact tgtatccact ccgatgcaac ttgtctgact 15901 gctgggtttg ttgcggatct tccagccaac cataaagccc caagaacgca tcgcgctgct 15961 atgaataacg ggagttgctg ttcaatttcg agactcagtt ctctgtgcga tcgatatcct 16021 tcaagtaagt ttgttttgat cacaagataa tcagacgttg cccagtatat tgctaaaggt 16081 acggcaagat cataaacata atatccccag cccaagtcat caaatcgatg gggataatgg 16141 agtgtccatc atagagagca ttaccaagat gtaagtctcc atgaatcaga ccgaagacgt 16201 tgggattttg tccaatcgtt gcttcaatgt ccaacagccg attatggatt gcttgaaagt 16261 cattggctgt ttcgctgtct aaataactat agccgaatgg tgcgtatcct aaggctccat 16321 ttgcactggt aagcccattt gcatcaagtg ctggtcgttt gaaatccctt ggtttttgcc 16381 attctgagag tttattgtga atgtgggcta taagctgccc aagctttacg aatacattta 16441 agtcactatt atgagaacct attggtgggt aatcaatcca tgaaagtatt gaaactggct 16501 tcgagcttcc ggaagcagtg ccaatcgata caaagttacc cgaacggttt cggatgggtt 16561 tttgatatgt ataatcattg taactgcgaa ggtattcgat gatttttgct tcgcactcaa 16621 tgtcttgtat ggtacagtag actcccctat gcacacgaag caagaaattc cccttattgg 16681 tagataattt gaacgtggtg ttttcccagt gacgtagcag ttcgagtttg actaaatcga 16741 actcatattg atggattgct aaatgagcaa tttcagaact cagggtaatt tgcttttctg 16801 ttgaatcagg cgtcatagtg aaattttaac gctgctaggt ttaaatacac ttaactcatc 16861 actaatgacc aatgattaaa catcctggaa gcaagaccaa aaccacaacc tgggtagtag 16921 aaaaagctca agtacgccct cgacaagacc aactaaccac agaagaaccc ttggaaattc 16981 gccttgtctc tccccaaaag acagtagctg tgaccatgcg aacaccaggg gcagattttg 17041 aactggctgc tggttttctc tactgtgagg gggttgttag ttataaagaa gacattttac 17101 gtatgagtta ctgcgtagat gatgtagatg gcgagcaacg tcaaaacatt gtaaatgtta 17161 cccttcgaga gggtttgaat ccagatttac agcctttgga aagacacttt tatacaacaa 17221 gtgcctgtgg agtttgtggt aaggcaagtc tagaggcttt gcgtttacgg ggttgtccgg 17281 ttattcttcc acaacccatt gttactgctg aaattattta caatttacca gataagctgc 17341 gagcggctca aggtatattt aacgctacag gaggtctgca tgctgcggct ttgtttgacg 17401 accaaggaca gctgctgaac ttgcatgagg atgtagggcg tcacaatgct ttagataaat 17461 tgataggttc agctttgctg agtgagcagt tacccttaag tcatcatatc gtaatggtaa 17521 gcggacgctc tagctttgag attttgcaaa agtctacagc tgctggtgtt ccaattgttt 17581 gttctatttc agcccctagt agtttagcag tatctgttgc aaaggaattt gggattactc 17641 tcattggatt tttgcgaggg gaacgtttca acgtctacac tggtctggag agaataagcg 17701 tttttaggtg acccaccaga gggatattca acacaaacca gctacagcag ttacaagagc 17761 agaataagaa cttgacttta tccttgagat gacttttcaa tacactgatg cattcgtcac 17821 tctggcaact tttcagatag aaaacttagt tggtttctat acccaatttc tcggtataga 17881 accaactacc tacatcccaa atatttatgc tgagttccga cttcctagtt tcaaattagg 17941 tatttttcaa cccaaacaaa tccatttttc tgaatttgaa aactcagcta aaagtaaggt 18001 gagtttgtgt ttagaggtga gtgatttaga aagtgttatt gcttacctaa gcgtattggg 18061 gtgttcacca ccaaaagagg tgatgactgc ttctcatggt agggaaattt acgcctatga 18121 cccagatggt aacaggataa ttatacatca gtcaaagtag atgttttatt cactactatt 18181 atggctaaag ttttacgcta attcgctcgg gagcgtgaaa gctccgaagt gtgagtctca 18241 atggggtttt ggcagtcccc agagtctcgc actctcccct aatactgttg tgaaacagtt 18301 ctagaaatag taaggggtaa cttcgtgaat acagtttcat gggagactgg gctagaccga 18361 agaatctgcg tgtcaaaaga gcgcagagtg tcaaagtaag actcttgcaa atgttaatgt 18421 catgaaattg aaaccataca ctaacacaaa tgcacaggga tgatttatct gtgttaatct 18481 gtctacatat gtagttttta acagttatca gttatcagtc agcagttatg aatttcatcg 18541 cttgggtaaa gacccatccg ccagacaaat gatggaaggc gttgattgcg gctcgggcat 18601 cctccactat acgctgctta actgtgtagg tagctattta ctattttaat atttttctat 18661 tcagtgatag gttacaattt gctaacacaa cctatagatt tcatttcatc agtttgccag 18721 tcaataatca acagtccttc gtcatttgac aaaggacgaa ggacgaagga taaagaataa 18781 atgtcaataa ctcaaaacta taaattaaac cttattcaat ggtatccagg acacatcgct 18841 aaagctgaaa aaaaattaaa agaacatctc aagcttgttg atgtcgtact ggaagtacgc 18901 gacgcccgca ttcctttggc aacacaccat ccccgaatag gagaatgggt ggcaggtaaa 18961 acacgggtat tagtgctaaa ccgagttgat atgattttgc cacaagtcca gcaagtgtgg 19021 acaaagtggt tcaagagtca gggtgaagta ccttatttta ccaatgccca acatggtcaa 19081 ggggtggctg cggtgttaaa agcggcgcaa gcggctggag gcgcgattaa tgaacgtcgg 19141 aacagtcgcg ggatgttacc tcgtccagtc cgtgctgtgg tgattggctt tcccaatgtt 19201 ggtaaatcag cccttatcaa ccgtctattg aaacggcgag tggtggaaag tgcagcgcgt 19261 cctggggtaa ctcgccaatt gcgttgggtg cgaatttctg aggaattgga attgctggat 19321 gcccctggtg ttattcctct taagttggaa aatcaagaag cagctttgaa gttagccatt 19381 tgtgacgata tcggtgacgc atcttacgat aatcaaatag tcgcagcaac tttagtagat 19441 atattgaaaa atctccgagt taatactgct gattttatac cagaggagcc gttagaatca 19501 cgttacaaac ttgacccgac ttccctaacc ggagaagaat atatgtttgc tttagcagag 19561 tatcgttata aaggtgacgt cgaaaaaact gcacgtacgc tactaacaga ttttcgtaag 19621 ggtttcttag gtgaaattcc tttagagtta ccacctggat aagcagtagg gcaagatgtt 19681 gaccatactc gccaggagtt gatgggttgg gtaatctatt tgtggtcgtt ggaatgctat 19741 tgagagtgca atctttgaaa aagtgaaaag tgaaccctga atatatcctt tgctttatca 19801 atgctggcat tctctgcgtc cagaatggat ttttataaag gtcatgacta aactgaaact 19861 gcgcctaaaa gaggataatt ctgaaaaaac ggttacggtg catcaagatg tttttactat 19921 cggtcgttta ccacaatgtg atttatattt agtctccgga ggagtttcac gttaccacgc 19981 ccggattatg aaaactcctt gcgatacgtg gactattgag gatttgggca gcaagaacgg 20041 gactcaatta aacgaacatc tgattaattc tcctcaacag ttgcaagacg gcgatattat 20101 ttggctaggg gatgtgtgtc tgacaatcgt gttgagttct gttgactcat caatattcag 20161 tcagggagtt gtttcgccag gaataacaat tcttcgcgac gttgaacaat tgcaacagca 20221 atggattctc gctgataatg tctgtggtga tgttggcatc aaagacaaaa cgatcgcccg 20281 cctgaaagac ttagtcaata tagccaaaaa cctatctgct gcagcttcaa tagaggaaat 20341 tttctctcaa gttcaagaag tcgtgtttcg ttacctcaat agtattgacc gtttgggatt 20401 attaattgat gttagtgggt ctggtaaact agaactatta aacgctgcga cgagaaatat 20461 ctcttaccaa gaagatctgc cagctgatgg cagttggatt agtcgtagta tatgtcaaaa 20521 agtcttcgag gaaaaagttg ctattcaaac tgctgatgct caaaacgatg aaaggtttgc 20581 aggagaaaat agtcttctag tcaaaggtat tcgtagcgcg atggcggtgc ctttatggga 20641 tgagaataag gttgttggtg ttctttacgc tgatgctcat ttgtcttctc atcattgggc 20701 agaagaagga gaagaagaac tgagcttttt ttctgcctta gcaaaccttg tggcttctag 20761 tgtacaacgt tggctgttgg ccgagaaact caaaagcgaa gaagtcattc gccgaagact 20821 cgaacgctat cattcaccag cagttgtaca gcagttgatc gctgttggtg cattgccaaa 20881 tggacgttta cctccacaag aaagtgaaat tagtatttta tttgctgatt tggtcggttt 20941 tacggcaatt tcagaaagat tgacgccaac tgacattgcc gacttgctca ataatttgtt 21001 tgaggagatg ctgcaagagg tgtttgctgg cggtggcact ttggataagt atattggcga 21061 ttgtattatg gcattttttg gtgctcctga accacaaccg gatcatgccg atcgcgctgt 21121 cactgctgct atgggtatgt taactcgtct ggaaaatctt aatgccaaaa atttttggac 21181 cgaaccacta caattgcgaa ttgctattaa cagtggtaag gctgtggttg gagatgttgg 21241 tagttctcaa agggtcgatt atacagcatt aggagccaca attaatctgg ctgcacggat 21301 ggaagcaatt tgcccaccag gtgaatgtgt tgtcagtgaa gatacttaca caatgctgtc 21361 acaaccctcg tccttcctag aaatgggaga ttatcgtttc aaaggtatca atcgattagt 21421 caagatttat cgaactaaga tgcactaatc aaagattttt tctataaaac tgccaaaata 21481 gggctataaa ggttatgatt ttaggtttaa gagttattct gacctaattt cataatccaa 21541 acctaaaaaa gatattaaat ttgaaattta tttttgatca cagtgtcacc tcctgttaca 21601 atcaaccagg gtcagtgaaa tgcatactcg cttagtggag attgtaagca taaatacata 21661 gaaaatatgt ataggttgtt tcaatttgaa ttgcaattga aagtagataa gcattttttg 21721 aaagactaca atttgaaatg tagcttccat aaagggtgtc tgcaacttat ggatgctacc 21781 ctaggtaaat ttttaacggg gaggtcaggg tatggctatc gccacaatta atcctgcaac 21841 tggagagttg ctgaaaactt ttgagccatt aaacgatgca gaaatcgctc agaaactaga 21901 tttggcacaa caggcttttg aaaagtatca gaaaatttct tttcaagaac gctctgtttg 21961 gatgcaaaaa gctgctgaca ttttagagca agaaaaagca gattttgcca agattatgac 22021 gttggagatg ggcaagcctt tgaaagcggc gatcgccgaa gtagaaaaat gtgcccaggt 22081 ctgtcgctac tacgctgaac acgctgctga atttctggct gatgtcaccg taaaaaccga 22141 tgcaagccat agttttgtca aataccagcc attaggcatt attctcgcag tcatgccgtg 22201 gaattttccc ttttggcaag tgttccgatt tgttgcacct gcactaatgg cagggaatgt 22261 cggattactc aaacatgctt ccaacgtccc acagtgtgct ttagcaatag aagaaattat 22321 acacaaagca ggtttcccaa caggcgtatt tcaaactcta ttgataggtg ctgccaaagt 22381 tgctgacttg atgagtgatg atcgcgtcaa agcagcaacg ttgacaggaa gcgaaccagc 22441 aggggcaagt ttagccgcag cttcaggaaa acagattaaa aaaaccgtcc tggaattagg 22501 aggaagtgac ccatttattg tattggaaag cgcagaccta gaagcagcag tagccaccgc 22561 aacaacagcg cgaatgttga ataacgggca atcatgtatt gcagcaaaac gcttcattgt 22621 cgttgagaca atagcagaca agtttgaaaa gctactacta gagaaatttc aagcgctgaa 22681 aattggcgat ccgatgcaag cagaaactga cttgggtcca ttggcaactc ctgatattat 22741 caaagactta gaccagcagg tgcaagcagg tgttaagaat ggggcaaaag ttttaacagg 22801 tggacatgct ttatcagatc gtcctggtaa ttactatcca ccaacaattc tcacggatat 22861 ttccccagat aatccagtgg cgcaggaaga attttttggt ccggtggcga tgttatttcg 22921 tgtaccggat attgacgcag cgatcagaat cgcaaacgcc acaccatttg gcttaggcgc 22981 gagtgcttgg acaaagaact ccgaagaacg cgatcgcctc atcgaagaaa ttgaagcagg 23041 ttcggtattt atcaacggta tggtaaagtc cgatccccgc ttaccctttg gtggaatcaa 23101 gcgttctgga tatggtaggg aactgagtat ccaaggtata catgagtttg tcaatgttaa 23161 aactgtgtgg gtgaaataac aaaccacatc aggtgcaatc aacccagagg aacaataaga 23221 ggagatatga acacagctga attactcgta cagtgtttgg aaaatgaagg agtgcaatac 23281 gttttcggac tccctgggga agaaaatcta cacgttttag aggcgctgaa acattcttcc 23341 attcagttta ttaccacccg tcacgaacaa ggtgcagcat tcatggcgga tgtctacgga 23401 cgtctgacag gaaaagcagg agtttgtctt tcgactcttg gtcctggggc aacaaacttg 23461 atgactgggg tagcagatgc taaccttgat ggtgccccct tggtagcaat taccggacaa 23521 gtcggcacag atagaatgca catcgaatcc catcaatatt tagatttggt ggcgatgttt 23581 gccccagtga caaagtggaa taaacagatt gtccgcccca gtattacacc agaacttgtg 23641 cgcaaagcat ttaagcgggc acaaagtgaa aaacccggcg cagttcacat tgatttgcca 23701 gaaaatattg ctgccatgcc tgcagaaggc tatcctttgc gtaaagacaa catcgaaaaa 23761 accttcgctt cttttgcaag tattagggca gcagccgcag caatttccca agcagttaac 23821 ccaattattt tagttggtaa tggggcgatt cgcgaccaag caagcgatgc cgtcacacaa 23881 tttgctagtc agatgaatat tcctgttgtt aacactttca tgggcaaagg tgtgattccc 23941 tacactcatt ccttagcttt gtggtcagta ggattgcaac agcgagattt cattacctgt 24001 ggctttgata gcacagattt agtgattgct attggctatg atttgattga gttttccccc 24061 aaaaaatgga atcccgatgg gagaattccc attattcatg ttggcgtaac ccctgcagaa 24121 attgatagta gttatatccc taacgttgaa gtcgtaggaa atatttctga ttccctcttt 24181 gaaattttaa agtttgcaga ccgggaaggt aagcctaatc ctcatgctat cagtttacgg 24241 acaaatatcc gtgctgacta cgaagagtat gctcatgatg aggggtttcc aattaagcct 24301 caaaaattaa tttatgactt gcgacaagtg atgggaccag aagatatcgt catctcagac 24361 gttggcgcac acaaaatgtg gatctgtctc nnnnnnnnnn gccctaatac ttgcattata 24421 tccaatggct ttgctgcaat gggtattgct attccggggg ctgtagccgc aaaactcgtg 24481 catcccaacc gcaaagttat tgctgcaaca ggcgatggtg gctttatgat gaattgccag 24541 gaattagaaa ctgctttacg cgttggtaca ccctttgtca ccctcatttt caatgatggt 24601 ggttatggct tagttgagtg gaagcaagaa aattactttg gtaaaggcag atcatctttt 24661 gtgcattttg gcaaccccga ttttgtaaaa tttgctgaaa gtatgggtct aaaaggctac 24721 cggattgaat ctgttacgga tttggttcct gtgttgaaag aggctctggc acaggatgtt 24781 ccagcagtga tagattgtcc tgttgactac cgcgagaatg cccgtttctc gcgaaaagcc 24841 gttgagttga attgtacggt gtagggagat tttcctgacg ccaagacgag ccagtactgc 24901 aggagggttt ccctccgtag gtatctggcg tcgctaagat ttcttggcgt cctggcggtt 24961 ttcgttgcgg ttcgtttttc atgactccgc ttctagcgcc cttgcagctt catcaagcct 25021 atcgttggca tctttggccg gagtagtcgt aactgtacgt tgagaaatag ttgattcgta 25081 gctgacatac tgctgacttt taggtgctaa aaaaatagct agaatattgg taggttttta 25141 ataaggagta atggcgcgat cgcttaaagt tgcccaagag tacattcaga aagttaaatc 25201 gtctcttcag cgtaacagct atcccagcca gaaagcttta gctgatgatc tggggatatc 25261 tctatccacc gtcaaaaact ttcttagtag taaacccgtt gattaccagt attttgtaga 25321 aatttgccag aaattagggc tagactggca agaaattgca tttaaagaac cagacactca 25381 acccaactca agcaaaacct ttgaggaaac ctcacctttc atcactggtt cacccatcac 25441 ccatcctcgc cacttttttg gacgacaaaa acaactcaag cgtcttttcg acttactcaa 25501 acgccgtccc ttgcaaaatg ctgcaattat cggtaagcgg cgcattggca aaacctctct 25561 gctgcactac ctgaaaaaca tcaccaccac tccaccagaa cagttgcgtt ctggtcaaaa 25621 gtacgactgg ctaccgcatc cagaaactta taagtggata ttcgtagact ttcaagaccc 25681 gcgtatggcg agtagagaaa ggttattgag ctatatccta gaatgcctga gtttaaaagt 25741 accaacacct tgtagtttgg attacttcat ggatgtagtt agcgacaatc tacataatcc 25801 cacagtcatc ttactggatg aaattggcgt tgggttgcaa cgctgtccag aattagatga 25861 tgagttttgg gagagtttgc gctcattggc gactaaccac acgaggggaa atctggcgtt 25921 cgtcttagct acccacgaat caccgattga acttgctcgt aatacggggc atagttcacc 25981 cttcttcaat attttcggct ataccgcaac tttgggagcg ctgactgagc cagaagcacg 26041 ggaattgatt gctagttcac ctattacttt cgctgaggaa gatgtagagt ggattttaca 26101 acaaagtcaa tgcttagcac tcttgctgca aattctttgt cgggaaagat tgtttagtct 26161 agaagatggg gagactgacg actggtgtga ggaggggttg cgacagatag aaccatttgc 26221 tcatttgttg gaaagaaaca ggcagaggtg aaaaaatgcc tttgagcctg ctatacctct 26281 agagagattg gcaagtgctg caatgataag actggataca catatctgga tttgctagta 26341 agccaccaca gccttttgat aagtaatcca taagtcaacc attaataaca tattaatcaa 26401 ccaataaatc aactaaacta cagccatatt tagaagggta tataaattag ttagttctag 26461 catagtgatg actaaaacaa aaaactagga gagtcgttac ctcacctagt tgatagtcat 26521 aatcaaacta aatcaactga tagcctagac tagctacggt tagtgttcat catcgctttt 26581 actagatcga tccatactat taaccagttc gatgagaaca ggaattaaaa ctggatcgtt 26641 gagatggatg tcaccggaaa tgacaccata aaccagcaag gcgacgatta acagtcggca 26701 aaaaagagcc attgaaatct ccttattgtt caaaagtttg ttagacttct gaactcaggc 26761 cttaaggaga gggttcagag gcagcgaagt gcccagccat aagtacattg ggcactttta 26821 gctgtttgcc tgaaataaaa ttagcacact tattcaaata gatttattaa ttttctgtta 26881 agcatttatt ttccttatat agttgtattg atttgtgtat ccgtaattat tagttatctg 26941 cttaccaaac atcggcgcgg gactgctgag gtagcaaaga ttgaacagga tctacaaagg 27001 ttagaacgtt gtgcaaatgt aaaagatatc cgtaacgctc accaaagtct gaaaattgca 27061 gaactggaaa gtccagctag ccctctgcta cgtatcttca gtcgcattag tgaagatgtg 27121 aacgctgccc tcaaccaaaa aagcgcctac aaccagcgtc tagtactcag gattgttgtt 27181 gatcgtttag atgcacagtt gcaaaacttg actcgcagca cagaaaaata cgcccctcgc 27241 ttccgcccca tcgcccaaag ttggcgtgaa ataatagtta actctgtaga ggaactcgcc 27301 aaagaaacag aacttctcca agagattgac aatccctaca taattggtgt tcccctaaat 27361 cagcaattag aaatctttat cggacgcagc agcattggtt tacgcattga ggaattaatt 27421 ttagaccgcc gtcgcccccc tctgctactc tacggtcaac ggcgtatggg caaaacttct 27481 ctgctgaaca atttgggaaa gttgctaccg aatagcatta tccccttatt tattgacttg 27541 caaggcgcac cctcttcagc aagcaacaat gcgggttttc tctacaatct cgctagagga 27601 atgataacct cagctaaaaa acaaagcgca ttaactttac catccttgac acgggaagat 27661 ttagaaaaag accccttcac ctgctttgat gaatggttag ataaagtaga agaggcgtta 27721 caagataata cagcactgct ttccctagac gaattcgagg tgctagacag cgcgatcgcc 27781 aaaggacgct ttgatgaaca agacgttctg ggaatgctgc gtcacctcat acaacatcgt 27841 ccccgcttca aggtgctgct tgctggctct catactatcg aagaatatca gcgctgggct 27901 agttacctta ttaacgtgca agtcgtgcat atcagctacc taaaagaaga cgaagcccgt 27961 caactcatta aaagccctgt caaagatttt accttacgct atgaaccgaa cgccgtggag 28021 cgagtgctgc aactcacgcg ctgtcacccc tttctagtac aactgctgtg tggtgaaatt 28081 gtcgccctca aaaatgagca agatccttcc gtccgccgac tggcaacttt agcggacgtg 28141 gaagcagcag ttccagaagc tttgagtagt ggtggtttct tttttgcaga tattcaaaat 28201 aaccaagtag acgctgcggg gttagctatc ttgaaatact tggcagcaca agaagaaggg 28261 gcaatactca ataaaagaac tatattaaac aaacttcctg atgtatcgga aaacgctttt 28321 aaactcttgt tacaacgcga gttgattgaa gaagttgagg acggctttcg cttccaagtc 28381 gagttgattc gacggtggtt tgctcaaatg tagagatgac tcataactag ctcaaccaga 28441 ttaaacgtaa aatgtcattg cgagcgaaac gcagtgaagc gaagcaatcg cagggtttga 28501 gattgcttca ctccgctatc gctacgtatg ccctccgggc acgctacgct aacgcaatga 28561 caatatccgc acatgatatt attcagccat ttgacataca cctaaagatt gaaatttaac 28621 cttctagtaa tacaatcttg gtgtccttat atccggaagt ttatgcggtt ttttacacac 28681 ggttaccgct gggtatttct aacgcttttt gtgattgcag ttatcgtagc gtgtacagca 28741 tcgcctggaa aacgttctgg gcaaacaaca caacagcaag caacacaaca taatatgatc 28801 attttcgttg ctgatgggct gcgtccaact tcgataaatg ctaccgatac cccaagtatg 28861 aatgagatac gggaacgggg tgtcaagttt accaatagtc actctttgtt tccgactttt 28921 acgactgcta atgcttcggc gatcgccacc ggtcactatt taggcgatac tggtgacttt 28981 agcaatacta tccaagtcaa tgccccggtc aaaagtgcca aaaacagcct tgtccccttc 29041 ttagaaaaca atgctgttct ccaagaagtg aacaaacagt ttggtgctaa ctacctcaac 29101 gaacaaactc tgctagaagc tgcaagaaaa gcaaatttca gcactgctgc agtcggaaaa 29161 attggaccag ttttgattca agatgtgact ttgcaaaaag gtgaaccaac catcattttt 29221 gatgacgcca ctgggacacc tacgggaatt tccttgagtt ctgaagtgag tgaacaactt 29281 gccaaaaact cgttaccaac agcagcacct tcacgcggtg acaacggtaa gcctggcgac 29341 agcaagactc ctggtactaa agttgctaat acaactcagc agcaatattt tgctgatgtg 29401 acgactaagg tgattctacc actgttcaag caacggcaaa aaccctttgt gttagtttat 29461 tggtctcgtg atccagacgg tacgcagcat aatcacggcg atagtctcaa ccagcttgtt 29521 cctggtatta atggtccgac agtgcaagca gcgcgtcaga atgttgacaa aaacctggct 29581 caaattcgca ctgcattaaa ggatttgggt ttagaagaaa gcacaaatat attcttgact 29641 gctgaccacg gattttcgac tattagtaag gagactaaaa ccagttattc agcaacgctt 29701 tcttatccgg atgtaccaaa aggttttatc ccagctggtt ttgtggcgat tgatattgct 29761 caggagttga agttacctct gtttgaccca gataataaga atgcaactgt agatccaagt 29821 aaagggcagt tttctaaaaa cggtatcatt gggaaagacc ccaaaaaccc agatgtgatc 29881 gttgctggta atggtggttc tgatttactg tatctgccaa atgcagcaaa caaaaaagca 29941 actgccaaaa aaattgtaga tttgctgctg aagcaggact atgttagcgg gatatttgtg 30001 gataattctt tgggagaaat tcctgggact ttatcaatgg aggcgatcgc tcttcgaggt 30061 gcagcacgca ctcccaagcc atccatcttg gtcaactttc gttccttcga tacaggttgc 30121 ggtaatccta cggcgtgtgg tgcggaatta gccgatacag gtttgcagca aggacaagga 30181 atgcacggta gctttagtcg tgctgatact tacaatacaa tggcagcaat tggacctgac 30241 tttaaacggg actatgaaga tttagctccc gcaagtaacg cagatgtggc agtcaccata 30301 gcgaaagtac tgaaattgaa gttgtccact cagggtaaac tagttggtcg agtgttaaat 30361 gaagccctaa tcagtggacc tgatagcgtt gagtataaat cttttacgct gaaatctcca 30421 tctgctgcta acggtttgaa gaccatcctc aagtatcaaa cagtaggaaa aactcgctac 30481 tttgatgttg ctggatttcc aggtcgtacc cttggactat gattaaacca ctttgacaca 30541 gcatggactg tctaccagcg atttgtgact catagtttgg gaatgagtat tgaaaattaa 30601 agtcaattgc acagaagcct aagtttctgt gtttgaatca atccccaatc ctgaatagct 30661 tcgttctaaa tgttcccaga gttcagttat ctgctggtaa gcatcttgtg gagagagctt 30721 tccaagagtc tgtaagttgc taatgtagcc aactcgtgtt gcaaattctt gtaaattagc 30781 gttgaagact agagcttctg gcgtgaagtc accgtggtaa cttcgtcttt gataaagaaa 30841 gtcatattta tctatttgcg gttccataat ccaccttttg cttacaacat ctctctaatt 30901 tcttttatat gtctcccaaa cgataaaagc aaatgttttt atttgtcgtt ttaatcaata 30961 aaagtgatta aaacagtgaa ctgttaagag ttccctcttg caacagtaaa gaaaaaccct 31021 gcactggcaa attgcaaggc agggggtttc acaatcgttg ttgcagcata tactatagct 31081 atctgttcag acaaagaaga ttacgcaaga tttgctattt ctttacaaaa cttaataaca 31141 agcttgcttc aactcagttt tgtctgcaag acttgagata caccctgtga taacaacatc 31201 tcacgtatct gctccaaatt ccttggcttt ggtgactcag tcaaagtttt tgtttgcaat 31261 tgagtcccag aaggttgttt actagataga gcagctgcaa attgcttgta aagctctggg 31321 ttgacagcaa agtaaaaagc tagctcttgt ccgacctctt tcaatcgctt tggtaattct 31381 gtttctgttg tctgtaacag cttagcaaga caagaccgca gcttagcgtt atctcgaccc 31441 actccgacaa aattaccatc aagttgactg tctgataatt gctgctcgaa gagatgaaca 31501 aacgtatcga gtctgccgtt atctatagat ggatgttgta gtagggcaaa cgctggtgga 31561 atgactgagt tactaacatt atcaggtagt ggtgcagcaa gtactttctc tgcattcaca 31621 ataccttcac caaatagatt gggcttccat gttggaaatt tctcacaact gtcgcgcaga 31681 atctgattaa agatgaaagg aatcttttcc gctccataac gttggatgag ttgctcgcgt 31741 ccatgatagg atagccacaa agcagccaca cccgcaacta gtggtgcaga aaatgaagtg 31801 ccggagcctt gcagaacgtt atatttgaac tcaccattgt tcttatcggt ttttgcatac 31861 caaactgact ccgcaggggc agtgacatca acttgacttc ctcgcgatga acccaaccaa 31921 atctcacgtc gcacattact gccagttact gcaatcactt cgtcataagc agcaggccaa 31981 accacgtagg gaatgaaagt accagatgct gccaccacaa tgacaccgcg tttttgggca 32041 tatatgatcg cactccgtag acgctggttg aacaatccag ttcctaggct aattgaaagg 32101 acatgcacac catgaccagc cgcgtactca atcgcgtcag caagattagt tacacttaat 32161 aggacaactg agtatgagac tcgaaggggt acaactttgg caccaggtgc cactccggtt 32221 acagattttc cggttgggta attgctttgt gcccttcttg ggctgatcat gacactggct 32281 gttgatgtgc cgtgacctgg attgtttata acttcgccta gaggtgtttc gagttcgtct 32341 gtcggatctt tgtcgttttt gagaaaatcg taactttctc taaggagcag attcgcgaaa 32401 atttctggat gtttcgtgta acctgtatca ggaagcccaa tgacaatccc atgtcctggt 32461 ggtagattcg ggtcaggaaa aaagcgtgac catgcctcca agactcgaag ttgcttgaga 32521 ctccactcta catcactact ctcgtctaaa gcttgatgct caaaacgtgc aactgcctgt 32581 tttgactctt gattccagtc agaacggtac ggtaggggaa cagcaaataa tggctgtgca 32641 tccacaactc ccggctgaga acgcagataa tatgttttat tccaagcttc tttaactgat 32701 agtgtatctc cctttttgct tacttcaaat tcggtgtgat tatctccaat cgatttaacc 32761 ttccaatttg aacccagtgt tttggtaacg atttctcgaa ctctgtgttc taatcccggc 32821 ataaccatct ggagaaaaaa accttcaaac ttagcactga agttatttga gtcgctgagt 32881 tgattgcgat ctgccaatgg atgcaagctt cgcttatcgc tcatacaatt ttcctctggt 32941 gaaattagtt cttagcgtta aatctagttc agtactttac tacactgcaa caattgatga 33001 ttacagtctg agtgttccct gagaaggttt ctccagatga gtctgaaata aaacgcaata 33061 cagttatcac gattgcaatg tagagacgtt gcatgtgagg caggtgctaa cagcgggttc 33121 cccgacttgt tcgcgcagcg tctccgcagg agttagcaac tgccgttcgc gttcgcgcag 33181 cgtgtccctt tgggactcag cgtctccgtt cggcacgggg ccgtgcccga agggctcagg 33241 agatacccga agggcaacgt ctctacacac gtggaatggt taccaataat ttttaactga 33301 accgtattga aataaaacga gtagagcatt tgtatttggg taaattctac tccaaaaaaa 33361 gaagtcccgg ttaacgtaac caggacttct ttgaagattt ggcgttaatt acaagcttga 33421 gagagctatt catgcaacgc tgttgaaaat cgtagcccag agagttcccg ctactccaac 33481 gccaacgaat aggtaagtga agaaacgacc caattcaggg tttgttccca gtgctttatc 33541 ttcttgacct gcagtgaaaa ataatgacca agcttggtta acagtgattg cttgggaatt 33601 gttgaattca ggtctaaaca caacaggtcc aactaggtcg tttttgggaa agtcaaagtc 33661 agttctcata gtcatttgtc cacttttgat ttatacagat tgatgatttt ctaaaaattc 33721 tttccttgtg attacaggct tgtgatccaa tcaatcaaac catgtccagt caaaacttcg 33781 agcgctatga gtgagacaaa accaatcatt gctaaacgtc cattgagttg ctcagcatac 33841 tcagtgaagc caaagttttg ggtttcgtcc acatacatct tgggttcgat tgcgtaattg 33901 tttaatctgt tgccttcttc catcacgtaa gtgcgtcctg tcatcgtttt ttcctcaatt 33961 tctttttctt gttaactaat gtaaagcatt gtaaagaagt attgcaatac ctttacaaaa 34021 taaattatga gtcaaaaccg tactttataa gtagggaaaa ccgtacttag ggagacccaa 34081 caaaaaaccc aacaaaaaaa tattcaacaa gaatgcctgt ggcttacgcc acggtgatgt 34141 gtactcaagc ttaaattgct cacatgactt tttctaacgg tttttccaac tttgccgtaa 34201 taaatgcatg ctatgttatg gtggaaaacg agagttaatt cggcgactgc gtttgttttt 34261 agtattgaca ggctttctcc ttttggtatt cacagacttt tctccgaaac ctaaatctct 34321 acacacttat gatttcggag acaattaatg tcactttatg taggcaatct ttcttacgaa 34381 gttacagaag agagtctgaa tagtgttttc gcagaatatg gtactgtaag gcgtgttcag 34441 atacctactg atcgtgacac aggtcgtgtg cgcgggtttg ccttcgtgga aatgggttca 34501 gaggctgaag aagcagctgc cattgatgct cttgatggtg ctgagtggat gggacgtgac 34561 ctcaaagtta ataaggctaa gcctagagaa gacagagatt cgtttggtgg taatcgaaac 34621 aatagcttcc gtaagcgcta ctaaaattca tagatttagc tctttaagct aaccaattcg 34681 tccagcaaat tcattggatt tagtagcggc tcaggcagga gtgcttgagc cttttttata 34741 caaaacttgg actgcgccaa gaaaacttaa gctttagaag ccaaaaccca ttctctggag 34801 ttttccatgt caaactcagc tactcctccc atccaaccag ccgtcctaat caccatcatt 34861 ggtgaaacag tgctgaaaga ccgaattgtc aagctgctga aaagtcatgg tgtaagtggt 34921 tacaccatta gtcaagtaca gggtgagggc ggacacggaa gacgcttgtc agacttagca 34981 ggctacaaca ctaatattga aatcaagacg attgtttcat tagaagtatc tgatgctatc 35041 ctttcggcac tgaaagagga gcagggcaag cacgccttga tcgctttccg acataatgta 35101 gaagctttct attgattaat cttcctgact agagcagtcc tcacagattc gctgcaacag 35161 caactgtcat aaagtgccaa gtagttgaaa atgcttgatc tgactaactt tacagatggt 35221 gggttgtttg gtggtaaacg aggaaacgat agtgatggat actctcgaca ttactaagct 35281 ttgaaaccaa gaatttcact cgaaagtttt tatggtagac aaggagtctg ccttttttta 35341 ggttctgatt gactggattg cttaccacta aaaagctagt aattaaatca acttagtagg 35401 aaagagaatg acccaaatat tcgtaggcga aaatgaacca attgagtcag ctttacgtcg 35461 atttaagcga gaagtttcca aggcaggaat tttcccagat ataaaaaaga atcgtcactt 35521 tgaaacgccc ctgcaaaaac gtaagcgcaa agcagttgct agacacaaac agaagaagag 35581 aggtttccga cattgaagtc gagtgtctgt tatgaaggca cgcaattttt ttattaacag 35641 gtgtagcaac catgacccca gaagccaaac acgttgttag cgaactaagg cgtgagttct 35701 attctctgtt cgctgctgct atgaagatgc cagtgatcat aactggtaca tcctatagta 35761 gtaattcccc aattgcgttt tggatggatg ataggcagcg actgaattat gtaaatatat 35821 acgcttacat agcacctgac acatttatgc cattacgccc gttcattctc aggctggcta 35881 ttaacaagag cgctcttcaa ttcatgatgg taaaaaaggg gcaagaacac cgaaatcccg 35941 tgtgggactt tgagttaact gtattgccaa cagaaatctt agactttctt ccctggatag 36001 ttagtttagt tgaagctgat gataaagttt cacccttgct gctacactca ttaccccatc 36061 catttaaatt aaaagtgcca tctgtcgggt tatgtcataa cgcatggact cttgaagctt 36121 ggctcctgac aaactcgaca atgttctaga acttggatgc ctcagttacc ttcagcatca 36181 cagcacaccc tcgccagtct gcgctctagt gggaacgacc acgccttacg gtgagtaagc 36241 gcgctgcggt ccagcagtct cccgacagcc aggcgactgg cgtatgcgca aagcgcacgc 36301 ccacagggct aaagcccaca gggctatcgc agcatgtcag caggactcag cgctgagact 36361 ggggactcat gcgatactat ttattggaca atagaataaa caagacgacc tcgtgctttg 36421 aaagctacaa gcaccgttgg gttgggaaga gtaaagcagt aacgctctcc tcgctcctac 36481 cgtaattctc aacacgagga tgctagaagc cgacacttcc atgacggata attgtcggtc 36541 gttcacacaa aagctagaag catgatagaa acagagtttc aaaaagtaag accgcgcaaa 36601 gtctttgcac agcattggct caaaagtgaa aaagctctca acgaaattat tcaagcttcc 36661 cagttacaat ctactgacag ggttctggaa attggtccgg gtacaggtat tctgactcat 36721 cgtctattac ctttggtaca atcaatgctt gcagtggaaa tagaccgcga cttatgcgat 36781 cgcctgacaa aacaattcgg acgccaagaa aacttcctac tactgcaagg agattttcta 36841 gaactagatt taccttccct gctttcacca tttccggcgt ttgagaagca gaataaagtt 36901 gttgccaata ttccctacaa tattacagga ccaattctag aaaagctttt ggggacaatt 36961 agtaacccaa atccccaacc ttatgagttg atagtactcc tggtacaaaa agaagtcgcc 37021 gaaagactgg tggcaaaacc aagctcaaaa gcatttgggg cgctatcggt acgagtgcag 37081 tatttagcaa agtgcgagtt aatgtacact gtcccagcag gagcatttca gccaccaccc 37141 aaagtagact cagcaatcgt gcgcctagtt ccacgtcttg tggaaccacc agccgcagac 37201 acgcaacaat tagaaacttt ggtgaagttg ggctttgggg caaaacgtaa aatgttacga 37261 aataatttgc aatcggtggt agaacgcgat cgcctgtcac aattgttgga aaaattagag 37321 ataaaccctc aagctcgcgc tgaagacctc agcgttagtc aatgggtagc tctagcaaat 37381 gagttgttag tactgagtga tgagtcctga gttttatttt tctactttct cagcattcag 37441 cggttaaaac agttatcagt tatcagttat cagttgttca ctgtttactg tttactgttc 37501 actgttcact gttcactgtt tactgtttac tgatttgttg aatatgcgtt cttatactct 37561 tatcgcccct gccaaaatta acttacatct ggaaattctt ggtgttcgtc ccgatgcgta 37621 tcatgagtta gtcatgatac ttcaaagtat aaatttatca gacgaaattt ctgtacaagc 37681 gagcgataca caagctatcc gcgttcgttg caaccatccg gaagttcccg cagataatag 37741 taatattgca tacaaagcag cggaacttat ggcgacagag tttccatcta cctttgccaa 37801 atatggcggc gttaacatta ccatcaataa gcgcatacct gtagctgcgg ggttagctgg 37861 aggttcgacg aatgcagcag cagtgctagt gggaatagat ttgctgtgga aattggggct 37921 aactcagtca gagttagagg aattgggagc acaactaggt tcagatgtac cgttttgtgt 37981 ggcgggtggg acggcgatcg ccacaggaag aggtgagcaa ctttctccgc ttcccagttt 38041 agataacata tatatagttt tagctaaatt ccgtagtctt gcagtctcta ccccttgggc 38101 atacaaaacc tatcgacaac agtttggtga ttcctatctc aaagataccg acagcttaac 38161 aactcgcgca gcagccatac actctggacc gatagtcaaa gcaatcttga atcaggacgc 38221 aacggagatt gcccaaaagc tgcataatga tttagagcgt gtggtgttac cagagtatcc 38281 gcaagtcttg caattgcggg agacatttgc tagtgctggt gttttgggga caatgatgtc 38341 tggttctggt ccgagtgtat ttgctatttg cgagtcgcaa cagcaggcgg aagaagtgaa 38401 gcttcgtgtg agggagacta ttcctgatga ggacttagaa ttgtttgtga ctcgaatgac 38461 ttcacatggg attcagatag catcatcggt gtaaggacgc gaaaatttta tgagtgacga 38521 aaatctaaca caacaaacag aagctcaaac acaggttcaa tcaagtccct tacgttgtgt 38581 tattggggca atgatttcgg gagcacttgg gtatggacta tactctctaa tgattgccac 38641 agcgacaagt tttgcaacca aacccattca ttcagataac gttatagtac tgaagatttc 38701 ttctgcagtg cgtaccttag ttgtgggtgt tatggcttta ggaactgcgg tatttgggat 38761 agtggcgatc gggttgctgg ctttgggggt gcagttgttg gtgcagcagt tgacaaagca 38821 gaagagttga tgcgtgaatg cgtgaatttg agcaaagcga atatggtgac actgcaactc 38881 aaacaaattc gagttccacc aggacagcgg gtgctgctgg aggatattag ttgggtggaa 38941 tttgaggcaa ttcttaatga attgggggaa caccgtaaca gtcgagttgc atatcagcaa 39001 ggcacattag agattatggt tccactccca gaacatgaaa gagctaaaat cattattgga 39061 gatttagtaa aaatcttgct agatgaacta gacctcaatt gggagtcctt tggttcaacc 39121 acctttaagc gcgaggatat gacagcaggt gttgaaccag atgattgctt ctatattcaa 39181 aactataagc tcatgattgg cagagatagg ataaacttaa ctgttgatcc tcctcctgac 39241 ttagcaattg aaattgatgt cacctcaaaa actaaaatga gtgcttatca agcattaaga 39301 gtacctgaaa tctggcgcta cgaaaacggg aatttagaaa ttaatctgct gcaaggtgaa 39361 caatatatta agtctcaaaa gagtcttact tttaccaatt tttcagttat tgaagagatt 39421 tatcaatttg tagagatgag tcgaacaata ggaacaactc cagcactcag gaaattccga 39481 aagtgggtta gagaatccta acaatcattt tttatgaagt atggaagagt ttaacagtta 39541 tcagttatca gttatcagtt gttcactacg taggtagctg ttcactgttt taaaacttgg 39601 gcaaaacgaa gaataaatca acctcactac ctctagaatg aaagcgtagg ttgccattaa 39661 actgcaattg gaaaaccaga tattgagtgg atacaggtaa gattagcaaa tttttaaaaa 39721 agaagtcaga agtcagaatt caggagtcag aatcaagatg tctgtaaaat aagcaaattg 39781 cccttaataa atttttctcg aaataaagga gaacatgacc atcggcgttt ggatattagg 39841 cgaccaactt tggatagaac aagcagcact gcaaagttgt caagataaag tgcctgttat 39901 catgattgag tcgctgcatc atgtccaaga acgcccctac catcgacaaa agctggtgtt 39961 ggtttggtca gcaatgcgac attttgctga agaattacga gaacttggtt atccagtcac 40021 ctacaaatta gctgaggatt ttgaaacacc actccaagaa tggattcagg aaaaccaaat 40081 tactgaattg cgggtgatga cgccaaatga taaaccattc acccaaatga ttcaaaattt 40141 tgcttcttta cattgcaaaa ttaccctcgt tcctaacaat cattttttat ggagtgtgga 40201 agagttcaaa acttgggcaa aacgtcgtaa acgtctgata atggaagatt tttatcgaga 40261 aggacgacga cgttttcaaa ttttgatgga ggaagataaa ccagtggggg gagagtggaa 40321 ttttgataaa cagaaccgtc aaccgccaaa aggtaaattg aatacgccat cagccaagtg 40381 gtttgaaccg gatgaaatta ctcaagatgt tattgcacac gtcaaatctc tttcttttcc 40441 actctacgga gaagtagaac cgtttcgctg gggagtcact cgttctcaag cactcgaaat 40501 attagactgg tttatcaaaa atcgtctacc tgagtttggt ccttaccagg atgcaatggt 40561 gacaggggaa gagacaatgt ggcattctat gatatccccg tatttgaata ttgggttact 40621 ccaacctttg gaagtcattc aagcagcaga gaaagcttac cagcaaaacc aactgtctct 40681 atatagcata gaaggtttca tccgtcaagt gatgggttgg cgagaatata tgcatggcat 40741 ttaccatttt gtgagtgcag attatccaga aaaaaattac tttgaacaca cgcaaccttt 40801 acctgagttt ttctggacgg gtgaaacgaa gatgaattgt ctgcaccaga taatcactca 40861 gttgctacgt acaggttatg ctcatcatat ccagcggctg atggttttga gtaattttgc 40921 tttgattgca ggtctttcgc cccaagctgt agaaaattgg tttcatgcca tgtttattga 40981 tgcttatgac tgggtgatgc aaacaaatgt aattggtatg gggctatttg ctgatggtgg 41041 aatgttggca tcgaagcctt atgcagcatc tggtaactat gtcaataaga tgagcgatta 41101 ttgtaaagga tgtgcttata atcctaaaga acgtgttggg aataatgctt gtcctttcaa 41161 tttcttctac tgggattttc ttgatcgtca ccgtagtcaa cttcagtttc aaggacggat 41221 gagctttatt ttgggacatc ttgaacgaat gtctctacaa gagttagaaa ctatccgtca 41281 acaagcgcga gattggcatg tacagcaatt gtctggggaa gaagtttaac agccgctgat 41341 taacgagaaa tttcccgatg ggaaaactaa atctgaggct taaaaacctt tttttaattg 41401 tgataattgc tcaataaatc cctagctggt tccccacatc tgtcaagacc gtatgagtcg 41461 ggttgtgggg gaattttttg ttaaggtgtc tgtggtggat taattttggg gttatttatt 41521 tgatgtcgtc gcagagggtg ctgtatatac agtactagta aaataaaaaa tgcttgatgt 41581 atttttcctg tctcaaaagt tagaaaaata tggggaaatc gtgcaaaaac atttttctta 41641 accaactctc aaaactacgc tttgttgtga aagtaaagat tgtgacattg accattgcgg 41701 gtgtgttagt agccacttga ataaccaagt atcaaacaga acactctgcc acattatctt 41761 ctaagctagc gttccattca cctacgactc ctccaaaact tcctgtccag gtttttgcca 41821 catcatctcc caagcaagcg ttcccttcac ctacgactcc tccaaaagtt cctgagtggg 41881 ctattgtgcc tgtactgaga atcacatttg cccaacgcca cctactgcgc atcaccgatg 41941 aggacaaagg atgaccagaa gaaaggatga ctattaattt ggcttaactt agcttggcgc 42001 aacgcttcac tcttgctcat cccttgtttc aggtttcggt aaaactgtac catgatttcc 42061 tgggtcgttt tatcctcaat gctccaaaga ttggcaataa cggctctagc tcctgcacgc 42121 tcaaacaggt aggcaagtcc ggtaattggt tctgcgccct caccaatcaa tagtttccct 42181 ttagaatcag tctttagagc agtttggcaa gcactgaggg tgattaaatc agtgttgctc 42241 aatcccaaca gggctgcatc agcaatgttg agtttttggt cagcaaacag gaggatattc 42301 tctgacaaac ctaatttagg gcagcctcct cgctggaaac agccatgtgt tgccaggtgc 42361 agcagaggaa agcgaggaga ttgagttttg aatgtggcga gtgtagcttt gttgccaatt 42421 atagcttcac ttccgggcag aatttgagtg atgctgttga cctcaatttc agatcctggt 42481 agtgccaagg ggtttttagg aatcgggttg ccaagagcga gaaccttctt gggagatgtt 42541 gagttttttg tccgagttgc ttggagagaa tgggtagaaa ggcgagtgag gtagctaacg 42601 gggtattgct gaatcaggta tttttcggtt ttgctgttgt agagggcttc aaaggggatg 42661 tagcggaaag ctccagtagc gatgatgcta agttgtttgg ggttggttgc ctgaatttca 42721 gtttctactg gacgaatcag caggttataa agcttttcca agttgtcgag gtatttatca 42781 ttaaagtgat tggttagttg agcataagtt tgggtgagta aattatcgag ttcgccagga 42841 ttaacaggta caagctttac tgtcagcttg tctttggtca aaataaagaa agcgattgta 42901 tttgggacat ttttaacgtt ggtcagtaag acaggatgaa ttactgttgt acctgccggg 42961 atactggctc tgagttgagt aatgtcggtg ggattggttt caaacagttc tgccacttcc 43021 ggaaagcgct tcgagatggc ttcagcttgc tggttaactt tggcttctag ctcccgcatt 43081 tgccctgaga gtgtatcgga aaatttctgc tgtaagccct gacgcagaaa ttgcagttgc 43141 tgataaagct ggttccactc atcaatagct cgttgagcat cggggttagc gactttggca 43201 ttaagcaagc gggcatagtc aatcaagtca gcagttgtag tgaggttagc ccattcatag 43261 gctgaatctg ctttattttg cctaatgagg agatccacga gggcgatgga agttcctcgt 43321 tcttgctgca aaaatcctgc ccgaaagtct ttggttagtc ctttacgaat ttccagagtt 43381 atcctgactg cttgctccaa atttgtagca gctttctgcg gctgcccaat ctgactgtag 43441 accgcaccga tattacttag agtagcagct tccccagcgt gatcgcccac cgcttgtcta 43501 atcggcaggg cttggtttaa aaactccaat gctttctggg gctgcccaat ctgactgtag 43561 accgcagcga tattattcag agtagcagct tccccagcgt gatcgcccac cgctcgtgta 43621 atcggcaggg cttggttgta aaactccaac gctttctgag gctgcccaat tttgctgtag 43681 accacgccga tattatccag aattgtggct tccccagcgc gatcgcccac cgcttgtcta 43741 atcggcaggg cttggttgta aaactccaan nnnnnnnnnc agcgcgatcg cccaccgctt 43801 gtctaatcgg cagggcttgg ttgtaaaact ccaacgcttt ctggggctgc ccaatgtcgc 43861 cgtagacccc accgatatta ttcagaattg tggcttcccc agcgcgatcg cccaccgctc 43921 ggataatcgg cagggcttgg ttgtaaaact ccaatgcttt ctggggctgc ccaatctgac 43981 tgtagaccgc accgatatta cttagagtat tggcttcccc agcgcgatcg cccaccgctt 44041 gtctaatcgg cagggcttgg ttgtaaaact ccaatgcttt ctggggctgc ccaatatcgc 44101 tgtagacccc accgatatta cttagagtat tggcttcccc agcacgatcg cccaccgctt 44161 gtctaatcgg cagggcttgg tttaaaaact ccaaagcttt ctggggctgc ccaatatcgc 44221 tgtagactga accgatatta tttagggtaa cagcttcccc agcgcgatcg cccaccgctc 44281 gttcaatcgg cagggcttgg ttgtaaaact ccaatgcttt ctggtgctgc ccaatgttgc 44341 caaagtttac gcctattcca agtagagcag ttgcctccag ctttcgatct ttgagttgtt 44401 gtgcaattgc taaaacttgc ttaaacgttt ctattgcctg ttgtgattgt ccttgctgtg 44461 tttgtttcag cgcttggttc ataagtttta gcacctcttg acttggggga ttctgagttt 44521 gtccccacac tggttcaaca gttgactgca taacaactgg agccaaacca attcccactg 44581 ccaacaaaac tttgagaatg cgttgcatta aaatttcatc tttgatcaga ttacaatccc 44641 agtaatatgt cggtagatag caccagcacc atcgaaccta agggtcagta ggcgttactt 44701 tgatagtcat ctttccggca gtttgaaact gaatttcatc cgctacatat aaactcttaa 44761 cagccactcc ctgaatacca cgaatatccc tcgaattcat gagtgttgcc aggtctacgc 44821 ttgcttgcct agggttgaga aagcgaaacg cattcagtgg aatattcttc caagtgcttt 44881 gttgagcctg tgagttatcc tctccccgga tgacccgcct accatcgctg gtataaacta 44941 gaccatctcg cttaaaggtt actcgctgaa tctgggcaat ctgaagtgag cgagaaacac 45001 cactgccgga tatctgaatt atttgcttgt tcgggtcaaa cgctgtcacc tgtcccgtca 45061 tcgaacttcc attcttcagt accacatccg cagtgctggg tagcgccaag ggtatcggcg 45121 cgttaacttc tgccttctgc gtttgggcta tttgaatagt gccaatcact gcttgtacat 45181 tttctaccgt cccggcgaca gcaagcacgc tagcaaaaat cagtgttttg aaaaatactt 45241 tcatgcattt aaataatact aaggagatag agagtaacgc aataaagtgg ctcacccttg 45301 gatataccta gagctattag tatagaatca caaaaagttc ctttcgttcc ggtaacttta 45361 gaaaaaagtt tgctcaaaag tgcgattgcc caaagggcag acgccaaagg cgtatcgcaa 45421 ccgcttaaat agcgttccaa tacgggcaat tacagaaaca ctaccacata actgaggatg 45481 aaaacttaca cctagccaat gcttaccatg tccattactg gtgagtaatc gccatcttga 45541 gggtaagctt taccgttgta gcataggctg gatggggacg cacagagtgt gtgtaacaat 45601 aaagcgattg agcgttgccc ccttgggtaa accagatact agtttgagcc acacatttag 45661 taacttttgt cactagtcat gtccccagca tctcaaaatt ctgatcgtgc caatcttctt 45721 ggcagcttcg ctagtctgct cggcgtgtta ggtatttttc tttactttat aggttggata 45781 tatcgctggg cttactttgg cttttttgaa gttgaaatta ctaccctaaa cttaccacta 45841 gagtcatttc tactagtacc aattcaagtt attctgggtg atttctggat atttattcgg 45901 acaattatag ttgtaagtat aacagtagtt ttgattcaat tcactctctg gttgattcgc 45961 tcaccaaagg taagcagtcc agcaagcacg tcaaaatcac ccatcagcag atttacgcaa 46021 aaactgcacg gtttttggct cctcaagccg ctacgttcct ttgcccagct ttttccccag 46081 cccttccgtc atgaaatcgt tattatagcc tggattctgg ttgctctatt ttggttagcg 46141 cgttggcagg gtactgctga tgcttaccag gatgcagtca ataacacctc tacaagacct 46201 attgttaccc tagttagccc cagcgataag atggctttgg gacgcaatcc cgatgacctg 46261 ttaaccaatc tccccttgaa gaattcccgc attatcggag atgtgaatca gttcaggcag 46321 attttcggac gggaaacaaa cgatacaact aatccagagc aacctattgt ttggcggttg 46381 cttattgaaa acaacaactg ggtatatctg ttcccagcaa tgcaaccagg agcaaaggct 46441 aatcagcgtc ctccagtgtt agcaattaac acaggtgatg gtcaggtgca gttgttgatt 46501 cgcagccgtc ccaaaaggct tccgtagaaa cagcttaaac aaaaaggtca agctttgggt 46561 tgcaagccag gtaaaacttc tgcaagtcga gcttcataaa ctactatgca acagggctga 46621 tctttattat ctaccttagc tccgtgaagc attccgaaaa aatatttaag taattttttg 46681 atttgcaaaa tcgtagagta gaatgtttca gtcaaaacct ttaaaaaaaa ttttcataga 46741 atagagatat tttgctaaaa atgtctaaaa tcaaaaaaac aaaacaacag cagacaatat 46801 tacaaaaact cacatggcgc acaggaatat tatttgctg // LOCUS NODE_495_length_45681_cov_5.15098845681 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 45681) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 45681) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..45681 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(161..694) /locus_tag="DP116_02620" CDS complement(161..694) /locus_tag="DP116_02620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_02620" /translation="MTNISVIDENPESKNLLVKCLETAGFEVIGTENDLVRVQLAQEK LSTTESSKKSNTPQFIFPSIPRLRDVFEFIELNYHQSISLKEVAQAVGYSSAYLTNLV RNITGKTVNDWIIERRIAQACALLLSTNDSVNQIALQVGYQNLNHFYSQFRYHKNTPH AWRKAQRCKVSQNKIHK" gene 1386..2690 /locus_tag="DP116_02625" CDS 1386..2690 /locus_tag="DP116_02625" /EC_number="4.2.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191784.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine synthase" /protein_id="PRJNA477356:DP116_02625" /translation="MTQATTNNTQATTATFKSLKCKECGAEYELKALHVCEFCFGPLE VTYDYSALRSTVTRETIQAGPNSIWRYRKFLPVASENPIDVGTGMTPLVRSHRLARRL GLNKLYIKNDAVNMPTLSFKDRVVSVALTRARELGFTTVSCASTGNLANSTAAIAAHA GLDCCVFIPADLEAGKILGSLIYSPTLMAVKGNYDQVNRLCSEVANTHGWGFVNINLR PYYSEGSKTLGFEVAEQLGWELPDHIVAPLASGSLYTKIYKGFQEFVELGLVEGKDVR FSGAQAEGCSPIAQAYKEERDFIKPVKPNTIAKSLAIGNPADGVYAVELAKKTGGHIE SVTDTEIIEGIKLLAETEGIFTETAGGTTIAVLKKLAEAGKINPDETTVVYITGNGLK TQEAVQGYIGEPLTIEAKLDSFERALERSRTLDRLEWQQVLV" gene 2790..3065 /locus_tag="DP116_02630" CDS 2790..3065 /locus_tag="DP116_02630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206809.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdopterin synthase sulfur carrier subunit" /protein_id="PRJNA477356:DP116_02630" /translation="MSVKVLVPTALQKFTNNQATLECKGETIAQLFDSLEENCPGIKS RLCDEAGQPRRFLNLYVNSEDIRFLDGKDTELKDGDEVSIVPAVAGG" gene 3344..4036 /locus_tag="DP116_02635" CDS 3344..4036 /locus_tag="DP116_02635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879161.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2996 domain-containing protein" /protein_id="PRJNA477356:DP116_02635" /translation="MAEETNHNEAGEVAPSTVDKQAPSVAEEHAPSTDSPEATDIPTA NAPDPKAAKSEADPDDAAKTKTAAPKREKPAGAAAKAAAGDQPDAKAAAAEEKPAKAK KEKAPAVEDKPFGEFIQQDYLPAVQKAIAKEGVQDLALNFAKQKISIVGFDKSEECWQ IIGTWQNGLRQFNLYFTQEDIQGKKAFSCNEGKKPSTLESFLIDERKVTLDLLVYGLL QRLNGQKWLGRN" gene complement(4033..4539) /locus_tag="DP116_02640" CDS complement(4033..4539) /locus_tag="DP116_02640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02640" /translation="MKAENNNSKDTQERLLVSYILSAAWFVGLGGLHRLYNGKIGTGL LWLLTGGVLGIGQFVDLFIIPNMVDEQQMRLRAKAGLSPLGVPLNQPAVAAQVYRSPQ EKLTMELLRAAEKRGGQLTVTQAVMETGANFAEVEAVFKELLKSGYVKIDNDPETGAV TYHFHELN" gene 5245..6321 /gene="acsF" /locus_tag="DP116_02645" CDS 5245..6321 /gene="acsF" /locus_tag="DP116_02645" /EC_number="1.14.13.81" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015202686.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium-protoporphyrin IX monomethyl ester (oxidative) cyclase" /protein_id="PRJNA477356:DP116_02645" /translation="MVDSLKKPGFEEMRSGIKVPAKETLLTPRFYTTDFDEMARMDIS PNEDELKAILEEFRADYNRHHFVRDAEFEQSWDHIDGETRRLFVEFLERSCTAEFSGF LLYKELGRRLKDKSPVLAECFNLMSRDEARHAGFLNKALSDFNLALDLGFLTKSRNYT FFKPKFIFYATYLSEKIGYWRYITIYRHLEAHPEDRVYPIFRWFENWCQDENRHGDFF DAIMRSQPQMLNDWKARLWSRFFLLSVFATMYLNDVQRKDFYASIGLDAREYDIYVIE KTNETAGRVFPVILDVEHPEFYQRLEICVKNNEKLTAIANSKTPKFLQFFQKLPYYIS NAWQISRLYFIKPIDATKIQATVH" gene 6402..7091 /locus_tag="DP116_02650" CDS 6402..7091 /locus_tag="DP116_02650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874592.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02650" /translation="MRRLLSALSFGVGVNLVVMATPVLCDTPSKNAQDLDISPEIIKN SPVLQRWQHQVPNVLEDIKNDPSFPTKIRLGSSYFSSDEAFGVNIGVEDVFIGRTSLT VSGEYQAAFNGQRQVYGADLHYYLRPLGSYINITPVVGYRHLEINSYSTDGVNLGAKL LLVLSRGGAGDISLTQSWIAPGTGEEVGLTTISVGYALTQNIRISTDIEQQNSKQNKE TRLGIVFEWMP" gene complement(7125..7676) /locus_tag="DP116_02655" CDS complement(7125..7676) /locus_tag="DP116_02655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875735.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_02655" /translation="MDTPQLLDGLRVLVVDDDTDNLDLIKVIFEEYNVQVIAVTSAGE ALEAITQFKPNILISDIAMPGEDGYSLIQKVRNLALTVSQIPAIALTAHASEEAGALA LDAGFSIRLVKPFDPDDLIAVVSKLVLIAQYDICPVCNIKQLSFIEWESLNKIRFHCR SCKWNESYGLHKAKVAGFMNRYH" gene 8068..8382 /locus_tag="DP116_02660" CDS 8068..8382 /locus_tag="DP116_02660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317507.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide-concentrating protein CcmK" /protein_id="PRJNA477356:DP116_02660" /translation="MPMAVGVIETQGFPAVLAAADAMVKAAAVTIVYYGLAESARMLV AVRGHTAEVERAVEAGIEAGNNQSNGGTVITHYIVPNPPENVESILPIHFTQKSEPFR IM" gene complement(8396..9439) /locus_tag="DP116_02665" CDS complement(8396..9439) /locus_tag="DP116_02665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739807.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LD-carboxypeptidase" /protein_id="PRJNA477356:DP116_02665" /translation="MINRRNFLLSLAATPSVFAFPFQLAKAAGKPLLKPKRLQPGSIV GIVGPASAVFVREELNIVIDAVKGLGLVPRLAPHLLERYGYLAGKDKDRAADINQFFS DSSIAAILPVRGGWGCSRILPYLDYQRIRKNPKILVGFSDITALILGLNAQTNLVTFH GPNGLTSWKTTQTDYFRRVLFRGEAVTFQNQKDGDDSNRLMQVKYRKQTITSGKAKGR LIGGNLSVLSAIVGSPYVPDFSGAILFLEDTHENIYRIDRMMTHLKIAGLFNKLAGFV FGQCSDCSPDADYGSLTLEEVVWDHIQPLGIPAWYGAMIGHIENVLTLPIGLEVEIDA DAGTIRMLEPAVE" gene complement(9524..9766) /locus_tag="DP116_02670" CDS complement(9524..9766) /locus_tag="DP116_02670" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02670" /translation="MRDEDRNTYKKVITAVLAAAATTAIVFAVIALVLRKTSFLPSST IQPTTASTQSPATRHDDNNDDKNDNDDKNDDNDDDD" gene 10191..10661 /locus_tag="DP116_02675" CDS 10191..10661 /locus_tag="DP116_02675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456400.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02675" /translation="MRKAVFLILPITLILFPSQRAQSQRVESSFDRPYLALVKGDTPN SCSFISGDFLERAADIVQDLLFVRPGDSPQQVSYEVRFVPDQPYARDALQWTALAEGN YTKAIVNFRDNRAQERIFTMAINYNQPNEKLCQWAVREPQQGTQSQSPASPPGQ" gene complement(10848..11273) /locus_tag="DP116_02680" CDS complement(10848..11273) /locus_tag="DP116_02680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455666.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02680" /translation="MSSLGKKRLEAVLAVATYAVGSGTATAAPTPGVEVPKQLILTAS DIIMYTRVWKIYFEEDLSSQGLLEMLVELGLVTVAATGTAYIVSKASTAILKEITSWT GPLGWGVLAVISGSITGLFGTVWALYCDYLYSQKEVQSA" gene complement(11609..11830) /locus_tag="DP116_02685" CDS complement(11609..11830) /locus_tag="DP116_02685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316935.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02685" /translation="MPSGHANHKGTSDKPNVNAEGQINVSAADKSVEPEDALLEGAMT NTTRTQEFVDYPPATQRPGEEAQTGNQEE" gene 12325..12720 /locus_tag="DP116_02690" CDS 12325..12720 /locus_tag="DP116_02690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313737.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02690" /translation="MKKIITAGLVVLATTGGFLLNNQSVQAHSRGYHHFRPYGYYPFY GRPINSCYPVTHWLVDEDPAYHPQAYADGYRQGQESAKRGNKYKSRTAGGEFGRGFDD GYYRRKFAGQKQVVPNEYKPYTTSECGWY" gene 12734..13039 /locus_tag="DP116_02695" CDS 12734..13039 /locus_tag="DP116_02695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004745696.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="antibiotic biosynthesis monooxygenase" /protein_id="PRJNA477356:DP116_02695" /translation="MILEVAMLNVRSGMENEFEVAFAKASPIIASMKGYIWHELHRCI EAPNRYLLLVRWQTLEDHTTGFRGSPQYDQWKQLLHHFYDPFPTVEHFEGVLQYHCS" gene complement(13042..13206) /locus_tag="DP116_02700" CDS complement(13042..13206) /locus_tag="DP116_02700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008618729.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rubredoxin" /protein_id="PRJNA477356:DP116_02700" /translation="MKKYVCNVCAYIYDPEEGDPDGGIEPGTPFEDIPEDWVCPVCGA SKEDFEPYDE" gene complement(13322..15106) /locus_tag="DP116_02705" CDS complement(13322..15106) /locus_tag="DP116_02705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M61" /protein_id="PRJNA477356:DP116_02705" /translation="MTEATAPRTNIYTKETAPAIYYQVAMPKPETHLFEVTLRLVNYS SPILNLKLPVWTPGSYLVREYAKHLQDFAAFAFDLPLSWQKISKNHWQIETCEVSEIT IKYRIFANELSVRTNHLDSSHGYFNGAALFFRILGWEAEPIQVTIVPPHPEWQVTTAL PPVRQKANTFCASDFDTLVDSPFEIGLHQLHHFEVLGKPHELAIWGKGNFQLQQMIAD IAKIIEVEAAMFGGLPYQRYVFLLHLFAQAYGGLEHKNSCALIYQRNGFRDRDKYERF MQLVAHEFFHLWNVKRIRPKELEVFDYDQENYTSSLWFCEGTTSYYDLLIPLRAGIYD AKSFLNNLSKEIARHETTPGRKVQSLAESSFDAWIKLYRPDANSGNSQISYYLKGEMV SLLLDLLIRSRSRNQRSLDDVMLKMWHQFGRDEIGYTPEQLQSVIEFIAGIDLTDFFN SYIDKTEDLPFNEYLEPFGLQLVEEKDEEPYLGVKVNSDNGRELIKFVDANSPAQFAG IDPGDELLAIDGLRVTANGLGDRLKDYQPRDTIQVTIFHQDELRTLPVTLASPCTRKY QIKAVENPSPTQKENFAGWLGAPLTTLR" gene 15198..15392 /locus_tag="DP116_02710" CDS 15198..15392 /locus_tag="DP116_02710" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02710" /translation="MLTAVASALNASLYFTTSVEQLPEIDEINNHDGIVYFDLTFVTA EESSDFIFFASTHYLRVLTS" gene complement(15840..17015) /locus_tag="DP116_02715" CDS complement(15840..17015) /locus_tag="DP116_02715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317503.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02715" /translation="MQTLKQGFEERNLTMVLEAEKLAQDWRKRLETECPEQSVANRES IVKWLVGSDLDRFEILNSKELDIAKQAMEYRYKILRQRYLGIARERAYRNLITRLGSL VTLRHKIQTWIALSRDRQRTVLDVLQEVIQELLQSDNYIQQQMISISECTTDTRLKNA LLFASVEEYCLRPVRNQPLLAYRFVNYLRRTQRGGLTQVPTQDLVRLVSEEILTDDSE NRVNLVDTQAVAEYQEAQEIEEQQALRKTVQQEFEDYLQENLGLEAVEWLRLYLQGKP QDEIAKKLNKPIKEVYRLREKISYHAVRVFALKGKSELVDSWLAISLKEHNLGLTPKQ WQQLYEQLTPVQRQVLELRKAGHSIEDAASQLKLKMHQAMGEWTKVYLVAQGLRSQD" gene 17547..17780 /locus_tag="DP116_02720" CDS 17547..17780 /locus_tag="DP116_02720" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02720" /translation="MFGKLTSFGKNEQSFLGKEGRVKNCRQNELSKPLVIVSQIHHHV ENIKENHIGKNKKNRRFYHRKIRDTGNSDHARA" gene complement(18301..19032) /locus_tag="DP116_02725" CDS complement(18301..19032) /locus_tag="DP116_02725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006633655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_02725" /translation="MSGVSIIIPTLNEAECLERTLRHLSILVPPVQEVLIVDGGSSDE TVTIAQKAGVSVIAAKKRGRAAQMNQGAEVATGEILCFLHADTLVPDDLVAVIEEILL DKSVAAGGFISLMTGDRTTRWGVSLHNFVKSYYAPLLFRPHLFFQGLRLLFGDQVIFA RRADFWKCGGFDSNLPIMEEADLCLKLVQQGKIRLVNRVVQSSDRRVAHWGFLKATAI YLSIGFLWGIGVSPQYLKQFYEDVR" gene complement(19294..19632) /locus_tag="DP116_02730" CDS complement(19294..19632) /locus_tag="DP116_02730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317505.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide-concentrating protein CcmK" /protein_id="PRJNA477356:DP116_02730" /translation="MPLQAVGSIETKGFPAVLAAADAMVKAGRVTLVGYIRVGSARFT VNIRGDVSEVKTSMQAGIEAIEQVHGGTLESWVIIARPHENVEAVLPIGYTEQVEEYR QAVENPIVRR" gene 20198..21985 /locus_tag="DP116_02735" CDS 20198..21985 /locus_tag="DP116_02735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015172224.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_02735" /translation="MQEVKAIKTLLPLLRLYPWVIPLIIILGIFSSLFEGLGISLFIP FLQTLDTTNSQKVSSNLLVNFLNQIFINIPQDRHLFIIPLCILGLVILKNCLAYISTL LGHWLYWHIGQRLLCRIFQQLLSVSYSFLDSHPSGKLLNTLDVETWRTCDAIFLLVNI AISLCTVFVFVILLMLTSWQLSLLVTVALLLISLSIQYVTRRAKNLGRQSVQANDNLA NRVMEGFYGMREIRAFGHESYELKRFEEATIHSRIIALKLAKLYSITGPLSEVLAVAL LVFILIIALQQKSNVPTLLTFIFMLYRLQPVMRRLDTDRVNLIGLLGSVEDVMSFLDI TERNEICSGDLQFKSLQRGITFESVNFSYNTSERPALQDISFFIPRGKTTAIVGPSGA GKSTVIGLICRFYETEYGEISVDNYPLRKFNLSSWRSRIAIVSQNIYIFNTTVGENIA YGRLDATKSEIIAAAKKASAHEFISQLPEGYDTIVGDRGIRLSGGQKQRIALARAIVR EPEILILDEATNALDSLSENFIQEALNSFSQNRTVIVIAHRLSTIKQAEQIIVLQEGK IVEQGNLQYLLKLNGLFAKLYNLQSGSAL" gene 22132..23784 /locus_tag="DP116_02740" CDS 22132..23784 /locus_tag="DP116_02740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012627187.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 2" /protein_id="PRJNA477356:DP116_02740" /translation="MKSQPLVSVITPFLNTEKFIQEAIESVVTQSYKNWELFLIDDGS SDKSTAIAQEYADLYPEKVYYLDHDGHQNRGKSTSRNLGISKAKGKYIAFLDADDVFL PQKLEQQVAILESQPETGMIYGPTQHWHSWTGQQEDYQRDNMRPLGVQPNTLFHPPSL ITQYLKIAGIVPCTCGLLVRREVIKAVGGFDDTIQHMFEDQVLLAKICLHTPVYVDSS CWDRYRQHSESSCSQAIQTGKYHPLKPNPAHQIYLNWLQKYITEQKIQDAELWNALEK ALYPYHHPMLHFLLRVKKKLAQIARKKFPGFLHRFVGTQFLGDKYIPPTGAVDFGSLR RVTPMSQAFGYDRGQPVDRYYIENFLAHYQEDIRGRVLEVGDDNYTRQFGGYVASIDS IQRITQSDVLHVTKGNPKATIVGDLACGDNIPSNSFDCFILTHTIQIIYDVRAAIKTV HRILKPGGVALVTVPGISHIGDYQWADYWCWSFTALSVKRLFEEFFPAENLQIETHGN VLVANAFLYGIATEELRQEELDYCDRNYQVTITIKAVKPHTV" gene 23781..24815 /locus_tag="DP116_02745" CDS 23781..24815 /locus_tag="DP116_02745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015203733.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide deacetylase family protein" /protein_id="PRJNA477356:DP116_02745" /translation="MKIPGVVRLRRMAKRLRGRLAPGGLILMYHRITEVESDPWSICV SPRHFAQQMEVLHKFGEVVSLQQLNQTLQQGKTPHWQIAVTFDDGYTDNLYNAKPVLE RYDIPATMFLTSGYMEQKRDLWWDELNRLLLEPGSLPEVLCLEINGTTHRWELGTAAN YSEEEYQRDRCWRALGEDNPTPRHTLYRTLYWLLSPLLPEARQKVMDQLLAWGDCVAT LRSNHRILNLEEVSTLGNELISIGAHSVTHPFLSFLPHTRQRQEIQDSKTHLEEIIGQ KVVSFAYPHGNYSEETAGLVREAGFMSACTTYPRTVWKQCDRFKLPRVVVEDWDSEEF AKQLSEWREN" gene complement(24821..26131) /locus_tag="DP116_02750" CDS complement(24821..26131) /locus_tag="DP116_02750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006199031.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Xaa-Pro aminopeptidase" /protein_id="PRJNA477356:DP116_02750" /translation="MQAEYQQRREQLMSKIGNGTAIFRSAPTAVMHNDVEYAFRQDSD FYYLTGFNEPQAVAVLAPSHPEHRFVLFVQPKDREKEVWSGYRCGVEAAKQVYGASVA YPIAELDEKLPQYLEKADRIYYHLGRDRTFNDTILNHWQRLMRTYPKRGTGPIAIEDT GPILHSMRLMKSQAELELMQKAADIAVEAHNHAMKFTQPGRYEYEVQAEIEHIFRLRG ALGPAYPSIVASGVNSCILHYIENDRKMQDKELLLIDAGCAYGYYNSDITRTFPVGGK FTPEQKRLYEIVLEAQKQAIAQVQPGNPYKQIHDTAVRVLTEGLVELGILKGEIDKLI EEEKYKPYYMHRTGHWLGLDVHDVGVYQHGDDNPQILQPGQVLTVEPGLYIVPDTKLA EDQPQTDPRWVGIGIRIEDDVLVTSTGHEVLTAGVPKEVAEVER" gene 26450..26992 /locus_tag="DP116_02755" CDS 26450..26992 /locus_tag="DP116_02755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358523.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxymuconolactone decarboxylase" /protein_id="PRJNA477356:DP116_02755" /translation="MAKFPIIEYEQLSDSNVKAIYEEIQVELGFGIVPNLFKSMAINP RILEANWKKFRSTILKGDVPRTLKEMLGIAISQANSSPYALNVHLHGLSSLGMSEEVL RTLVSDFAACPLPEREKAVIGFGLKAATEPHALTSKDYQHLYDLGLDDSEIFEIVATA DLFTSVNKYTDSISLEIDTL" gene 27010..28362 /locus_tag="DP116_02760" CDS 27010..28362 /locus_tag="DP116_02760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358522.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylate cyclase" /protein_id="PRJNA477356:DP116_02760" /translation="MGSRLEQQDYQQLLFLSRRLLAEAQALSSRIAAVNEIAIAINRS LKLDEILRVVSKQAKWLLDFEHCSVCLRDTTNGSWRIVTLFGSTVEVKFSDTTQMGVV GMSLKTGQARLIHENSTSGFLSQYQSQIIIPLECDHRVLGTINFATSAPKTYTQEDLR IGYLLGVQLSAAIRNAERFEELNRLYLELEKEKRKTEELLLNILPIDIASELKQTGAV KPVYYESASVLFTDFKNFTKLSEQLTPQELVDELDYCFSCFDQFIEAHNLEKLKTIGD SYMCVGGIPNHNKTHAIDAVLAAINIRTFMEWRKKEKAFLNQPYWDIRIGIHSGPLLA GVIGRKKFAYDVWGDTVNTASRMESSGLTGSINISLSTFELIKDFFIVESRGKVNAKN KGEIDMYIVNGIKDSLAVDPTGLLPNKEFNQLYFAIQPETDTLIEEDFIDQKTCCPNH " gene complement(28397..30667) /locus_tag="DP116_02765" CDS complement(28397..30667) /locus_tag="DP116_02765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019487643.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_02765" /translation="MGTWDANLQTGRAIWNEQHFRLLGYEPVLSGEAMIELWHSRLHP EELDRTMQAIEQAQQNQTLYRCEHRIIRADNGQVVWLAEFGQFFYNDAGQAIRLVGVS FDISERKRAELALRESEERLRRAIAIETVGVIFFKTDGFITDANDAFLCMSGYSREDL EQGRVRWNKMTPPEWMPHSVRAVEEFISNGYTTPYEKEYIRKDGSRWWALFAATRLNA EEGVEFIIDITDRKQAQEASRRSEERYRTLFESIDEGFCVIEMLYDENDTPLDYRFLE TNPAFEKQTGLEQAEGKTARQLVPNLEDHWVEIYGKVALTGESLRFENGSEAMNRWFD VYAFRVGQPKSQKVAILFKDVSDRKRIEAEREQILQREQTAREAAENANRIKDEFLAV LSHELRSPLNPILGWTTLLRNGRLDAAKTAYALETIERNAKLQVQLIEDLLDISRILR GKLSLNVMPVDLGAVIWAALETVRLAAVAKSLEIQTTLSPAIGTISADAGRLQQVVWN LLSNAVKFTPQGGQITVTLTQTETHAQIQVSDTGIGINPNFLPYVFEHFRQEDAATTR KFGGLGLGLAIARQIVELHGGTIQAESPGEGKGATFTVNLPLLRNEHRGTKDEQTAVS LTPPALPLSNVRVLVVDDEVDTRELIAFVLEHAGAIVTSVPSALAALDVLARSKPDVL VSDIGMPEMDGYMLMQQMQAILQGKQIVAIALTAYAGEIDRQQALAAGFQQHLSKPIE PKMLVQTIATLVEKDE" gene complement(30713..31330) /locus_tag="DP116_02770" /pseudo CDS complement(30713..31330) /locus_tag="DP116_02770" /inference="COORDINATES: protein motif:HMM:TIGR00229" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(31502..32722) /locus_tag="DP116_02775" /pseudo CDS complement(31502..32722) /locus_tag="DP116_02775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007356041.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="histidine kinase" gene complement(32774..33337) /locus_tag="DP116_02780" /pseudo CDS complement(32774..33337) /locus_tag="DP116_02780" /inference="COORDINATES: protein motif:HMM:PF13185.4" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="diguanylate cyclase" gene complement(33763..33996) /locus_tag="DP116_02785" CDS complement(33763..33996) /locus_tag="DP116_02785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873824.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_02785" /translation="MVVVEQLEIAALIRETRQLLHLSQAELAAKLGVSFHSVNRWENR RTRPLPLARKQISTLLHQLGDSGQALLRKYGWE" gene complement(34234..35355) /locus_tag="DP116_02790" CDS complement(34234..35355) /locus_tag="DP116_02790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875185.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M28" /protein_id="PRJNA477356:DP116_02790" /translation="MKKRFWLLLMLLVTIVVIVATNVFIKQHRESVIYSIRVNNPLSA NKDTNTPRGEDTQTIKREDTQTENFSASLHPPISDSLQVSSDKLFAHILRLNFTRNTP PERSRTRAYITTELKKMGWKPKLEKFPDGVNIFAEKPGTEKDAGAILVGAHYDTVYIS PGADDNATGVAVTLELARLFASRPTPRTLQLAFFDKEEAGLLGSKAFVTNKKHLENLQ GVIIMDMVGYACHTPGCQKYPTTLPVTPPSDKGDFLAVVGDTEHLRLLNAFQNSEKLP AISFNKATTGGSILPSVLTLPIPLKGLLTPDTLRSDHAPFWYQGVGAVLVTDTANLRT AHYHQPSDVPATIDREFFAGSAQIVVNATSKLLENGKIE" gene 35492..35959 /locus_tag="DP116_02795" CDS 35492..35959 /locus_tag="DP116_02795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318258.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02795" /translation="MLKKIFLASTALICLTVLGNSLTIARGDTPNSNATNPHKTMEIS PGQPVPSVDLVVHQDTKKGWNLEVKVTNFRFAPENVNTTPKPGEGHAHLSVNGQKLTR LYSNWYYLEKLPPGKNRITVSLNTNTHEALIFNGKLIQDTEIIDIPAKSIQKY" gene 36168..37391 /locus_tag="DP116_02800" CDS 36168..37391 /locus_tag="DP116_02800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195684.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfite exporter TauE/SafE family protein" /protein_id="PRJNA477356:DP116_02800" /translation="MINLLLILALGFLGSFGHCVGMCGPLAVAFSLSHKQETPRWRQQ LQFHTLLNLGRMLSYVLVGAGIGALGSVLLASGQMAGIGSQVRHWIAIITGILLIWFG IGQIKPDFLPRIPLLHPLVQGRLQNYLSQGMIKLSLQTKWWTPALLGMTWGLMPCGFL YTAQIKAAETGNWWMGAATMLAFGIGTLPTMLGVGVSTSLVSKDRRSQLFRMGGWVTL TIGVLTLLRTGDTMVDYTGHGALVCLILALVARPISNLWAAPLRYRRALGVGAFVLSL AHTVHMMEHSLQWNFAAFLFLLPDYQVGMALGAVALALMTPPALTSFDRLQKSLGKRW RAIHLLSVPALLLSTIHAVIIGSHYLGSSQVTWENKLAAVLLGIVTLVVLLVRWRFFW SMLSLQKFYVPSKKS" gene 37426..38052 /locus_tag="DP116_02805" CDS 37426..38052 /locus_tag="DP116_02805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311133.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02805" /translation="MSAAKRSLKKQVLLLFVFVLFLTIASPAIAHKVKTEGVVGATLH VEPNDNPRAGEPAKTWFALTRKGGKVIPLAECNCQLAVYAEPHSASEPPLIEPPLQAV SVERFQGIPGTEITFPRPGAYQLQLSGKPKDGKSFQPFELKFPVTVAVGSATNNNVQE SQTVQNVNQSVTEERTQGVPFWAIALSVLLAVGVFFGVLRMVKKREGG" gene complement(38149..39147) /gene="galE" /locus_tag="DP116_02810" CDS complement(38149..39147) /gene="galE" /locus_tag="DP116_02810" /EC_number="5.1.3.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866301.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-glucose 4-epimerase GalE" /protein_id="PRJNA477356:DP116_02810" /translation="MSPGKPTILVTGGAGYIGSHTVLALKQAGFDVIILDNLVYGHRD LVEKVLQVELVVGDTGDRPLLDNLFNTRNIAAVMHFSAYAYVGESVSDPAKYYRNNVV GTLTLLEAMLAASVKKFVFSSTCATYGVPEVVPIPEDHPQNPINPYGASKLMVERILS DFHAAYDLKSVRFRYFNAAGADPNGLLGEDHNPETHLIPLVLQTALGKRESISVFGTD YPTPDGTCIRDYIHVTDLASAHVLGLEYLLKGGDSEVFNLGNGNGFSVKEVIETAREV TQRDIKVVECDRRPGDPPALIGSGDKARKILGWHPQYSSVEEIITHAWQWHQKRHQ" gene 39459..40148 /locus_tag="DP116_02815" CDS 39459..40148 /locus_tag="DP116_02815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875181.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02815" /translation="MFVLLSRVLLWLLVGTIIYSLFQRFYPSGTFVGRLILVILLLVV LLSFINPNEPAVASLWGVVSFPLKPLGASILLMIFAAQRMKSGGTLDKPGGYLIGWAL TILLLASTPAVAYFLVRSPIAMVGQPYLANQEIIQNVRLASSMVPATPTSETLVALGQ DTSVASDGIIYASRMSDIKTTPYLLQTPQAIRTRGLRLEDFVPNAETLQITTRVWESY LNQIYTFLRGR" gene 40354..42105 /locus_tag="DP116_02820" CDS 40354..42105 /locus_tag="DP116_02820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_02820" /translation="MANSRRLSQLLAYLRPHWREATFGIIALLIVNGLGAYIPLLIRS AIDRLSVEFSFNQIKYLVIQIVLFTSAMWLIRMASRIWLFGVGRQVESDLKQRIFEHL LKLEPSYFATNTAGDLISRATSDVDNIRRLFGFALLSLANTLFAYIFTLPVMLALSVD LTLTSLAVYPFMFLLVHLFSERLRTEQSAVQERLSDISGLIQEDVSGIALIKIYSQEE NERRAFANQNQELLQANLKLAKSRNTVFPLIGGLATVSSFIIIWLGSTRIANGSLAVS DFIVLFLYIERLVFPTALLGFTITAYQRGEVSIDRVEAILTVTPKIKDTRDTIHLPPE QVKGELTARNLSFTYPGSTTPALCDVNFTIAPSETVAIVGAIGSGKSTLANAIPRLLD IEAGQLFLDGVDITKIVLADLRSAIAYVPQDSFLFSTTIKNNIRYGDPVSQQQEVEYA AKMAQIHPEISYFPQEYETIVGERGITLSGGQRQRTALARGILVNAPVLILDDALSSV DNQTATAILKNLSAGTQRKTVIFITHQLSAAATADRIFVMDKGKIVQKGTHVELLQQP GLYRSLWNQHQIEELLH" gene complement(41993..42265) /locus_tag="DP116_02825" CDS complement(41993..42265) /locus_tag="DP116_02825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017304007.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02825" /translation="MGILVESQSIYLVTGIKNTPLTKVRFYEVLFPVPYSLFPVPCSL FPVPDSCKKSNAIILLSDADSINFYTVQAAAKVLRGCLFERFSPCP" gene 42645..43322 /locus_tag="DP116_02830" CDS 42645..43322 /locus_tag="DP116_02830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860408.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="PRJNA477356:DP116_02830" /translation="MYQISCPTVVDGATADTWELQLTAHPSVNSKLKRCLDIVGSIVG LLILSILFVPIAIAITIDSPGPIFFTQERYGLYGRSFRIRKFRSMVSNAEKLKSLVQN EADGLIFKNKNDFRVTKVGRFLRSTSLDELPQFWNVLVGEMSLVGTRPPTKDEVSQYN QRHWQRLNVKPGLTGEWQVNGRSHVKDFEQVVDLDLQYQKKWYPMYDLLLIVKTFFMI VGRVGAF" gene complement(43586..44266) /locus_tag="DP116_02835" CDS complement(43586..44266) /locus_tag="DP116_02835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860406.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5'/3'-nucleotidase SurE" /protein_id="PRJNA477356:DP116_02835" /translation="MTLILTNDDGIDAPGIQALLKAVNGKEVIIAAPRDHLSGCGHQV TTTRPIHVHRRSENEYAIAGTPADCTRIAITHICKNVQFVLSGINAGGNMGVDAYISG TVAAVREAAMHGIPGIAVSHYLKKKLNVDWDLVARWTSSVLADLLNRPLEPGTFWNVN LPHLVPGEPDPEVVFCQPCTKPLPINYRIEGNDFYYEGEYAKRDRTPGSDVDVCFSGK IAVTKLRV" gene 44830..45491 /locus_tag="DP116_02840" /pseudo CDS 44830..45491 /locus_tag="DP116_02840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744771.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 12818 a 9852 c 9655 g 13356 t ORIGIN 1 attttaatac acaaggcagc agtgacgctg aagcgtcact gctgctttct attcaccact 61 tgataaatct acaaataagc gtgcttggta attgaggcaa tttaacttgc tcgcgagtac 121 caatggaaaa agaatgtaag ctaaacctat tcgcttttat ttacttatgt attttgttct 181 gactgacttt gcaacgctgt gcttttctcc aagcatgggg agtattttta tggtagcgaa 241 actgagagta aaaatgatta agattttgat agcctacttg taaagcaatc tgattaacgg 301 agtcgttagt tgataaaagt aaggcacatg cttgggctat ccgacgctca ataatccaat 361 cattgacggt tttaccagta atatttcgta ctaagttggt taagtaagct gacgaataac 421 caactgcttg agcgacttct tttaggctaa tactttgatg ataattcaat tcgataaact 481 cgaagacgtc acgcagtcgg ggaatagacg gaaatataaa ctgaggagta tttgattttt 541 tacttgactc agttgtagac aatttctcct gtgctagctg aacacgaaca agatcatttt 601 ctgtgccaat cacttcaaac cctgctgtct ccaagcattt tactaagaga tttttacttt 661 caggattctc atctatcact gaaatattgg tcatttttta tcctagtatt taataaagtt 721 ttcacgtgat aagattgcat cacctgtaac tccttgtatc tgaagttata aggttaagaa 781 gaagttatta aatgagcagt ctaatttttg tcaccagtta cgtatcagca aagtacttaa 841 cgctttgctt taactgtcgc aacacgctag atataagtgt gttattacat acttatacgg 901 cgtgttttta atccttatgg atttgaaaaa atgcggtgtc cttttcagga attgaaagtg 961 ctacccttac acccattggt aagtagccat ctagcttgag gacaaccaac tttgtgattg 1021 gtagtaccaa aaaagattat ttcatcccca ctttctatat agatatgagt atcaacgact 1081 tatgcactta gcttcatccc catcagccct taggcgtcac caatacccta ctgattcttt 1141 tgctgacttt tgaatcagga cttgctgaat tcttcttcaa taactatctt tcacaggact 1201 tacgcataaa taacgtagtt gcacccgttg cgacgtaagg aagcaatttt acaatcttgt 1261 tctttagtaa ctcttgcgta agtcttagtt ccatgaaacc cgacaaaaaa agtcgggtat 1321 gtttgacgag caaaaacgct cgtggtaaca tcaatccaca gttgacggac aaccagggaa 1381 gactcatgac acaggcgaca accaacaaca cccaagctac cactgccact tttaaatctt 1441 tgaagtgtaa ggaatgtggt gcggagtatg aactcaaggc tcttcatgta tgcgagttct 1501 gctttggtcc attggaagta acctacgact atagcgccct acgctccaca gtcacccgtg 1561 aaacaattca ggctggacca aactccatct ggcgctatcg taagtttttg cctgtggcta 1621 gtgagaaccc cattgatgtg ggaactggta tgactccgtt ggttcgttcc catcgcttgg 1681 cacgtcgcct tggtttaaat aaactgtata tcaagaatga tgccgtcaac atgcccaccc 1741 tcagcttcaa agatagggtg gtatcagttg cgctcacccg cgcacgggag ttaggtttca 1801 cgacagtttc ttgcgctagc actggtaatc tagcaaattc tactgctgct attgcggcac 1861 acgctggttt agactgctgc gtgttcattc ctgctgattt agaagctgga aaaattttag 1921 gtagcctgat ctacagtcca accctcatgg ctgttaaagg taactacgac caagtcaacc 1981 gcctctgttc tgaagtagcc aatacacatg gttggggttt tgtcaatatt aacttgcgtc 2041 cctactactc cgaaggttct aaaacactgg gctttgaagt agcagaacaa ctgggctggg 2101 agctacccga ccacattgtc gcacctttag cttctggttc actgtacaca aaaatttaca 2161 aaggtttcca agaatttgta gaacttggtt tggtggaagg aaaggatgtt aggtttagcg 2221 gcgcacaagc tgagggatgt tcacccatcg cccaagctta taaagaagaa cgcgacttta 2281 tcaagccagt caaacccaat accattgcca aatctcttgc aattggtaac ccagcagacg 2341 gggtctatgc tgtagaacta gccaagaaaa ctggtggtca tatagaatca gtcacggata 2401 ccgaaattat cgaaggcatt aagctcttgg cagaaacaga aggtatcttt acagaaaccg 2461 ctggtggaac aactattgct gtgctgaaga agttggcaga agctggtaaa attaacccag 2521 atgaaaccac agtggtatac attactggca acggcttaaa aacccaagaa gctgtacaag 2581 gttacatagg agaacctctg acaattgaag cgaaactcga tagctttgaa cgagcgttag 2641 agcgttcccg gacattggat cgcctagaat ggcagcaagt cttggtttag tcattaatta 2701 gtagtcattt gtcaattgtc atttttcttt tgttctaaac ttatgacaat tgataagttt 2761 taactcccaa ttcacgattc tccgaattta tgagtgtaaa agttttagtt cccactgctc 2821 tgcaaaaatt caccaacaac caagccaccc tagaatgtaa aggtgaaacg atcgcacaac 2881 tgtttgactc tttagaagaa aactgtcctg gtatcaagtc gcgcttatgc gatgaagctg 2941 gacaaccacg gcggtttttg aatctgtacg tcaatagtga agatattcgg tttttggatg 3001 ggaaagatac agaactcaaa gatggtgatg aagtaagcat tgtaccagct gtggctggag 3061 gctagcaccg ctacgcggaa gtcaaaagtc aaaagtcaaa agtcaaaagt gagccagtgc 3121 ggtggacggg ttccccggca taaagcaact ggcgaacccc gaaggggtca aaataattat 3181 tttgaatcaa tgactttgga atttgaattg atttgtgctt taggagcaag gcttgggtag 3241 cttgataaag cgatcgctgt cgagaggcgt ggctgtgtca aaatgaaagt gacttaacca 3301 cgctgtgatc tctcaaacgc aagacaaagg atagagctga gcaatggcag aagaaacaaa 3361 tcacaatgaa gctggggaag ttgctcccag cactgtagat aagcaggctc ccagtgtcgc 3421 tgaagaacac gctcccagta cagactcacc cgaagcgaca gacatcccta cagcaaacgc 3481 gcctgatccc aaagcggcga aatcagaagc ggatcccgac gatgcagcaa aaacaaagac 3541 tgctgcaccg aaacgcgaga agccagctgg tgcagcagct aaagccgccg caggagatca 3601 gccagacgct aaagcagccg ccgcagaaga gaaaccagct aaggcaaaaa aagagaaagc 3661 gccagctgtt gaggataagc catttggaga gtttatccag caagattatt taccagcggt 3721 acaaaaggcg atcgccaaag aaggagtgca agatttagcc ctgaattttg ccaagcaaaa 3781 aatttccatc gttggttttg acaagtctga ggaatgctgg caaatcatcg gtacttggca 3841 gaacggttta cgtcaattta acctgtattt tacacaagaa gatattcaag ggaagaaggc 3901 tttttcctgc aatgaaggta aaaagcccag tactcttgag tcattcttaa ttgatgaacg 3961 taaagtcacc cttgacttac tcgtttatgg actcctacag cgcttgaacg gtcaaaagtg 4021 gctgggaaga aattaattta gttcgtggaa atggtaggta acggctccag tctcggggtc 4081 gttgtctatc ttcacatatc ctgatttcag caactcctta aaaacagctt ccacttcagc 4141 aaagtttgca cccgtttcca taactgcttg agttacagtg agttgacctc cccttttttc 4201 tgcagctctg agtaattcca tcgttaattt ttcctgagga gagcggtaaa cttgcgccgc 4261 aacagctggc tgattcaaag gaacacccaa aggtgataaa cctgcttttg ctctgagtct 4321 catttgttgt tcatctacca tgttggggat aatgaacaag tctacaaact gcccgatacc 4381 aagcacacca ccagttaaca gccacaataa acccgtcccg atcttgccgt tgtataaacg 4441 gtgcagtccg cccaagccaa caaaccaggc tgcactcaag atgtaggaga caagaagacg 4501 ttcttgagta tctttactat tattgttttc tgctttcatc tcgtttctat cccctctaat 4561 ttattctttc ctaattacaa ttttcctact ttattttttt catatcagga tgtgctgccc 4621 gagttgtctg atttgtcata tcaccaccac gactgagcaa agcaagcgcg ttacaagcga 4681 cgaacgtgat tggatagtcc tcatttctaa aataactaaa ctataaaaaa atttactggt 4741 ttataatgtc acattaaccg cagaaacgat tatgctgcta tatatttaca gtatggttgg 4801 tttatgctta cctgacgtca cccagtgaat tcattttgct ggatgatttg agttcgcatg 4861 agcttctatt acaggctaca catcctcaat tgacgatttc caaaaaaatt gaagtgttgt 4921 tgcaactctt aacgtatgaa gtgtctgttg tcgccctaca ccttggtaca cttaacaata 4981 cgaatataat actgtcctca tactgtgcct tgtaagtcca gttctacagg ctgtgaggta 5041 gcattaaggg aggattttca acgccagatg ctccacttgg ggaaacacgc caggtcgcac 5101 aacggggggt acccccgcaa gcgactggct ccccaagacc gcactggctc ctcgtgctgg 5161 ctctggcaaa cccgtaaggg taaaagtatt actgttgact tttgcaggca ctcaagataa 5221 taagacactc gtggcaatca aaacatggta gattccctca aaaaaccagg ctttgaagaa 5281 atgcggtccg ggattaaagt cccagccaaa gaaaccctac tgacaccccg attctacacg 5341 accgactttg atgaaatggc acggatggac atctcgccaa acgaggacga gttgaaagcc 5401 attctcgaag agtttcgtgc agactacaac cgccatcact ttgttcgaga tgccgagttt 5461 gaacaatcct gggatcatat tgatggggaa actcgccgat tgttcgttga atttctagag 5521 cgttcgtgta cggcagagtt ttctggcttt ttgctgtaca aagaactcgg tcgccgctta 5581 aaagacaaga gtcctgtttt ggcggaatgt ttcaacctca tgtctcggga tgaagcacgt 5641 catgctggct tcttgaacaa agcgctgtcg gacttcaatc tggcgctgga tttagggttt 5701 ttgaccaaga gccgtaatta caccttcttt aagccaaaat ttatcttcta cgcgacttat 5761 ctctcggaaa aaattggtta ttggcggtat atcacaattt atcgccactt agaagcacat 5821 ccagaagacc gggtttatcc gattttccgc tggtttgaga actggtgtca ggatgaaaac 5881 cgccacgggg atttctttga tgcgatcatg agatcccaac cgcaaatgtt aaatgattgg 5941 aaggcgcggt tgtggagtcg gttcttcctg ctgtctgtct ttgcgacgat gtaccttaac 6001 gacgtccagc gcaaagactt ctatgcatca attggtttag atgcacgaga atatgacatc 6061 tacgtgattg aaaaaaccaa tgaaactgca ggaagagtgt tcccagtcat actggatgtt 6121 gagcatccgg agttttatca gcggttggaa atttgtgtga agaataatga gaagttgacg 6181 gcgatcgcta actcaaagac tcccaaattc ctgcaattct tccaaaagct accctattac 6241 atctccaacg cttggcagat ctcccggttg tacttcatca aaccaattga tgctactaaa 6301 atccaagcaa ctgttcacta agttgtaaag ttttgctaaa acttaagtgg ttctgtatta 6361 tacagagcca ctttttttta tcaacgcaga ggcgcaaaac aatgagaagg ctgttatcag 6421 ctttaagctt tggagtgggt gtgaatttgg ttgtgatggc tactcctgtt ttgtgtgata 6481 caccttcaaa aaatgcccag gatttagata taagcccaga aattattaaa aatagtccag 6541 ttctacaacg ttggcaacat caagtgccta acgtgttgga agatattaag aatgacccta 6601 gttttcctac caagatacgt ctcggatctt cttacttttc ttctgacgaa gcatttggcg 6661 tgaatattgg tgtggaggac gtgtttattg gtcgtactag tctaacagtc agtggtgagt 6721 atcaagcggc gttcaatggt caacgtcagg tttatggtgc agacttgcat tattatctgc 6781 gtcctttggg tagctacatc aatattacac ctgtggtagg ctatcggcat ctggaaatta 6841 acagctactc cacagatgga gtgaatcttg gtgcgaaact gttgttggta ttgtctcgcg 6901 gtggtgcagg agatatttct ttaacccaga gttggattgc tccgggtact ggagaagaag 6961 ttggtttaac aacaatatca gtcggttatg ctctcactca gaatatacgt atatccacag 7021 atatcgaaca gcaaaatagt aaacaaaata aagagactcg cttgggtata gtttttgagt 7081 ggatgcctta aaatcagctt gcttgtagga gctattttac ttttctagtg gtatcgattc 7141 ataaatccag caactttggc tttgtgtaat ccgtaactct cattccattt acagctgcga 7201 caatgaaacc taattttgtt gagtgattcc cactcgataa aggatagttg tttaatgttg 7261 cagacaggac aaatatcgta ttgagcaatt aaaacaagct tagaaaccac tgcaattaaa 7321 tcgtcaggat caaatggttt aactaatctt atagaaaatc ctgcatcaag agcaagagcg 7381 cctgcttctt cagaggcgtg tgctgttaaa gcgattgcag gaatctggct aaccgtcaga 7441 gcaaggtttc taactttttg aatcagcgaa taaccatcct caccaggcat ggcaatatcg 7501 ctaatcagaa tattcggctt gaactgcgta attgcttcaa gggcttcacc tgctgatgta 7561 accgcaatca cttgaacgtt gtactcttca aaaataacct ttatcaaatc aagattatcg 7621 gtatcatcat ctaccaccag cacccgcaac ccatccaaca attgtggagt gtccatacgc 7681 ctaaactcta ctcttaatgt gcttgcttat gttgaaaacc gtaattttgc ttatgatttc 7741 ttatcatttc ataagctttt attaatttta gtgagttttc ttattatttt gtatgagatt 7801 gactagattt atctatcaac aggatgatga atatagaaag aggaagaaaa gttaagattt 7861 tcttaatatt ttagaggtca agagaatttg aagttagggt ataaaattct aaattcccaa 7921 ttgtgagcct ttcacgcttc gatagcagtt atcagttatc agttatcagt atttgtttcc 7981 cataaccttg gtcagtattc actgttaaga gttccctgtt ccctgtacaa ttatgatcac 8041 caatctttca ttaaggggtg taaagccatg ccaatggcgg ttggagtcat agaaactcaa 8101 ggttttccgg ctgtgctagc agctgcggac gcaatggtaa aagctgctgc agtcaccatt 8161 gtctattatg ggctagcgga aagcgctcgc atgttagttg ctgttcgggg acacacagct 8221 gaggtagaaa gagcagttga agctggtatt gaagctggta ataaccaatc taacggtggt 8281 acggtcatta cccactacat tgttcctaat cccccagaaa atgtagaaag tattctgcca 8341 attcatttca ctcaaaaatc agaaccattc cgaataatgt gaattgacag gcaaatcatt 8401 ccactgcagg ttctaacatc cgaatcgtcc cagcgtcagc atcgatttcg acttctaaac 8461 caataggtaa tgtgagtaca ttttcgatgt gaccaatcat cgcaccatac caagcaggaa 8521 tgccgagtgg ttgaatgtga tcccacacaa cttcttctaa ggtgagagaa ccgtagtcag 8581 cgtctgggga acaatcggaa cattgtccaa acacaaaacc tgcaagtttg ttaaataaac 8641 ccgcaatttt caagtgagtc atcatgcgat caatgcggta aatattttcg tgagtatctt 8701 ccaaaaatag aatcgcacca ctgaaatcgg gtacgtaagg agaacctaca atagcggata 8761 aaactgaaag atttccgcca ataagccgac cttttgcttt accagaagta atggtttgtt 8821 tacgatactt cacctgcatg agacggttag aatcgtcgcc atctttttga ttttgaaaag 8881 tcacagcttc acccctaaat aaaacgcggc gaaaataatc cgtttgggtt gttttccaag 8941 aagttaatcc attgggaccg tgaaatgtca ccaagttagt ttgagcattt aatccgagaa 9001 ttaaagctgt gatgtcactg aaaccaacta gaatcttggg atttttgcgg atacgctggt 9061 aatcaagata aggcaatatc cggctacatc cccatccacc gcgaactggt aaaatggcgg 9121 cgatagagga atcactaaaa aactgattaa tatcagcagc acggtcttta tctttacctg 9181 ccaaataacc ataccgttca agtagatggg gtgcgagtcg cgggacaagt cccaatccct 9241 tgactgcatc tatgacaata ttaagttctt cacggacaaa taccgcacta gcaggaccga 9301 caatacccac aatagatcct ggctgcaagc gcttaggttt taataatggt ttacctgctg 9361 cttttgccaa ctgaaaggga aaggcaaata ctgaaggggt agcagctaaa cttaaaagaa 9421 agttacggcg gttaatcata aaaagttttt tacactcaga gtaagtgcta ttaaaccacc 9481 atgcaacagc agctgcaaat actaacttag taatgatatt cgtttaatcg tcatcatcat 9541 tgtcgtcgtt tttgtcatca ttgtcgtttt tgtcatcatt attgtcatca tgtctagtag 9601 ctggcgactg tgttgatgct gtagttggtt gtattgttga tgaaggtaaa aaggaagtct 9661 tgcgtaagac taatgcaatt acagcaaaga ctatggcagt agttgctgcc gcagctaaga 9721 ctgcggttat aactttttta tacgtgtttc tatcttcatc tctcattttt attaacaaaa 9781 ctcattcgca tcaactcttg ctgtactcat aataaaggtt agggtcaatc ctcaatcatc 9841 agacgaatcc tacagacgat tctggatggt gataagcctt tatgcttaac gcgtcccttg 9901 cgcgtgtccc cttgctcctc agcgtatcac cactcattgg tatccatcaa cacgcagagt 9961 cttaacctga actctggctt ttgaaggtaa tttcacaact agagtgtaga tactattgca 10021 gtaagtagtt caaacaggtg aatagctgtt cctcatagct cttgtgcaca cccttactga 10081 gtgtatttta ttgcaagcag aggcaccatc agccaagcct tctaggcgaa gcaatccaag 10141 aatatagtcc tgtgaactgc gctacccagc taatagttgg attttgacct atgagaaaag 10201 ccgttttcct gatactgcct attaccttaa tcctcttccc cagccagagg gcgcaatctc 10261 aacgagtaga gtcctccttt gatcgtccct acttggcgtt agttaaagga gacactccta 10321 acagttgtag ctttatctcc ggcgatttcc tggaacgggc ggcggacatt gttcaggatt 10381 tgctgtttgt ccgccctggg gactcgccac agcaggtcag ctatgaagtg cgcttcgttc 10441 ccgaccagcc ttatgcgcgt gatgcattgc aatggacggc acttgcagaa ggaaactata 10501 ctaaggctat cgtaaatttc cgcgacaacc gcgcgcaaga gcgcattttt acaatggcga 10561 ttaattacaa ccaaccgaat gaaaaactgt gtcagtgggc ggtacgagaa ccacaacagg 10621 gaactcaatc ccaatctcca gcttcaccac cagggcaata aggttcagga ggtttgcgcg 10681 aattgccgca aaccttgaaa ttcgccccat cttgttaagg acaagggcga taagccgaag 10741 gcaaaaagac aatacttgaa actgaacatt gatcgttcta ttgtttagaa ccctctactt 10801 tttgccttgc aagagaaagc tttgttagtt ttggctgggt tgttgagtta agcagattga 10861 acctcttttt gtgagtaaag atagtcgcaa tataatgccc acacagtccc aaataaacca 10921 gttatagaac ctgatattac tgccaaaaca ccccatccaa gtggtccagt ccaactggta 10981 atttccttta gtattgcagt acttgctttg ctaacaatat aagctgttcc tgttgctgct 11041 actgtaacta aacccaactc taccaacatt tctaacagtc cctgactgga taaatcctcc 11101 tcaaaataga ttttccagac acgggtgtac attataatat cagacgcagt tagtataagc 11161 tgcttaggaa cttctacacc gggggttggt gcagctgtag cagtaccact accaacagca 11221 taggtagcaa ctgctaaaac agcttctaat ctttttttgc ctagactgct cacagtgtca 11281 actcctgtac cttaaaaaag gtagtgtttg gttgctatga ttgcacaaac aacctttgat 11341 aaacagctac cactaataag attatcccct gattagcttt gaagtgaaaa ttttagtcaa 11401 cttgttgcta ttttacataa caagtcggat ggcaggatga cacagtgact aagaaagaac 11461 tcaccaacca cacattataa actcagtata agttctatac aattagaaat agaaaactca 11521 gaaaaaagga atgattctgc attctgtgtt ctctttttct gttctctttt tctgttctct 11581 ttttctgttc tctttttcta ctcaaatatt attcctcttg atttccagtt tgtgcttcct 11641 cgcccggacg ttgtgtagca ggagggtaat cgacaaattc ttgtgttctg gttgtatttg 11701 tcatagcgcc ttcaagcaaa gcatcttctg gttctactga tttgtcagca gcagacacat 11761 tgatttgacc ttctgcgttg acgttaggtt tatcagaagt tcctttgtga ttagcatgac 11821 cactaggcat aactattcct ctaaatatgg acaagtgaat atatcatccc tcagtctaaa 11881 ctcctctgtc atggagaata cactatcata aggcttagtt catctacata aagactttta 11941 cttttgacag gtgttggaaa atcaaacggc ataaaagcgt tattttctaa ttacttatct 12001 ggtcaagtac aaatgtttgt ttttattttc gtttgtcttc atcgcagtta tgatttcaag 12061 caaacgaaat aattatagtg ggtttcaatt gcgttgactt ttctccagac tatcggtcaa 12121 ttacaaatga aattttcata ttctacgtcg tatatgaatg tatttgtctt tacacgtaaa 12181 acaataagaa tcgtggtagc agttgtcatc tggaaagctc tttaacaatg ggctatttgg 12241 tcatactatc aggtgatgtt gcataaagat tttgagcgat atcttaaaga tataaatact 12301 taattaacat ttgagactag aactatgaaa aagattatta cggctggtct agtcgtttta 12361 gccactactg gtggattttt acttaataac cagtcagttc aagctcactc tcgcggttac 12421 catcactttc gcccatatgg ttactatccg ttttatggac gtccaataaa ttcttgctat 12481 cctgtcactc actggcttgt tgatgaagat cctgcgtatc atcctcaagc ttatgctgat 12541 ggctaccgtc aaggacaaga gagtgcaaaa agaggaaaca agtacaagtc gcgtaccgct 12601 ggtggagaat ttggtcgtgg ttttgatgac ggttattata gaaggaaatt tgctggtcaa 12661 aagcaagtcg ttcctaacga atataagcca tacactacgt cagaatgcgg ttggtattaa 12721 aaaacaaaat tttatgattt tagaagttgc aatgcttaat gttcgcagcg ggatggagaa 12781 cgagtttgaa gtcgctttcg ccaaagcttc tccgattatt gcttcaatga aaggatacat 12841 ctggcacgaa ctccaccgct gcattgaagc cccaaaccgc tatctgcttc ttgtccggtg 12901 gcaaacccta gaagaccata ctactgggtt tcgtgggtca cctcaatatg atcaatggaa 12961 gcagttactc caccatttct acgacccatt tcccacggtt gagcattttg aaggggtatt 13021 gcagtatcac tgcagttaag cttactcgtc gtatggctca aaatcctctt ttgacgcccc 13081 gcacacagga catacccagt cttctggtat gtcttcaaag ggtgtccccg gttctatccc 13141 accatctgga tcgccttctt ctgggtcata tatgtaggcg cagacgttac atacatactt 13201 tttcatcttc ctctctcctc attaaggatc tgctttgcct caatattaca cacacgttgt 13261 ttgaattcgt aattcgtaat taattgctaa ttacgagtta cgaattatca gttacaagtt 13321 gttaacgaag agttgtcaaa ggcgcaccta accatccagc aaagttttct ttctgtgtag 13381 gagaaggatt ttctactgcc ttgatttgat actttctagt acaaggagac gccaaagtga 13441 cgggtaaggt acggagttcg tcttgatgga aaattgtgac ttgaatggtg tcccttggtt 13501 ggtaatcttt caggcgatcg cccaacccat tcgctgtgac acgcaaacca tcaatagcca 13561 gcaactcatc tcctggatca attcctgcaa attgcgcagg tgaatttgcg tcaacaaact 13621 taatgagttc ccgtccattg tcactattga ctttaactcc taagtaaggt tcttcatcct 13681 tttcctccac cagttgcaag ccaaatggtt ccaggtattc gttgaaaggc aaatcttcag 13741 tcttatcaat gtaggagttg aagaaatcag tcaaatctat accagcaata aactcaatca 13801 ccgactgcaa ttgttctgga gtataaccaa tttcatctct gccaaattga tgccacattt 13861 tcagcatgac atcatctagt gatcgctgat ttctagatcg tgaacgaatc agcaaatcta 13921 gcaacaatga taccatttcc ccttttaaat agtaggaaat ttggctatta ccgctattcg 13981 catctggacg gtaaagttta atccatgcat caaaactcga ttctgcaaga gattgtacct 14041 tgcgtcctgg agttgtttca tgccgagcaa tttctttact caagttattt aagaatgact 14101 ttgcatcata aattcctgcg cgtaagggaa tgagcaaatc gtaataactt gttgtccctt 14161 cacaaaacca cagtgacgat gtatagtttt cttggtcgta atcaaagacc tctagttctt 14221 ttgggcgaat gcgcttaaca ttccacaagt gaaagaactc gtgtgcgacc aattgcatga 14281 agcgttcgta tttgtcgcga tcgcggaaac cattacgctg ataaatgagc gcgcaagagt 14341 ttttatgctc caaaccacca taagcttgag caaacaaatg cagcagaaac acatatcgct 14401 gatatggtaa cccaccaaac atggctgctt ctacttcaat gattttcgca atgtcagcaa 14461 tcatctgctg caattggaaa ttgcccttcc cccaaattgc caattcgtgg ggttttccca 14521 atacctcaaa gtgatgtaac tggtggagac caatttcaaa gggactatct acaagagtat 14581 caaaatctga agcacaaaaa gtatttgctt tttgtcgaac tggtggtaat gctgtagtga 14641 cctgccactc tgggtgtggc ggtactatag tcacttgtat tggttctgct tcccaaccca 14701 gtattctgaa aaacagtgct gcaccattga aatagccgtg gcttgagtcc aagtgatttg 14761 ttcgcactga taactcatta gcaaatatac ggtattttat agttatttcc gacacctcac 14821 atgtttctat ctgccaatga tttttactaa ttttttgcca ggacaaaggc aagtcaaagg 14881 caaaagcagc aaaatcttgt aagtgcttgg catattctcg gactaagtaa gacccaggag 14941 tccacaccgg caatttcaaa tttaaaattg gcgacgagta attcaccaga cgcaaagtca 15001 cttcaaacag gtgcgtttct ggtttgggca ttgccacctg gtagtaaatt gctggtgcag 15061 tttccttggt ataaatgtta gtgcgaggtg cagttgcttc agtcatcttt gtttatgagt 15121 aagaattctg agttattaac tattagttaa tagttaagag ttaatggtca atagtcaata 15181 gttgtaagtc aaaaaacttg cttactgctg ttgcctcagc acttaatgct tctttatact 15241 ttaccacttc agtagagcaa ctacctgaga ttgatgaaat aaacaaccat gatggtattg 15301 tttattttga tctaacattc gtaactgctg aggaaagttc cgattttatt ttttttgcat 15361 caacccatta tcttagggtt ttgacttcat agtcaattct tcagccgctt ggacaatgct 15421 gaattttact tcgttggtgt tttccatatg aacttacctg atatagacta taccattttc 15481 tgcaataatg aaagtctccc taacgcacag agaagtcaca agacaacgtt taccataact 15541 tccgtaaaac acacaggaga acacctaagg ttttagagaa tcttagagga gacagtagcc 15601 aaaaatgcca ttttgtttag gatcaacact taggactgag agatttagca ctgttgtgtt 15661 aagattttca tcatttggta aagtattttt tggtaaataa aaaataaaaa atgacaaata 15721 atcaaaagat tgtttacgaa ttggaaacag gtaattgtga gtctatagtt cagcagctaa 15781 aatctacgac tatagatttc aaatctggag caattaccaa tttccttttt aggacgacct 15841 cagtcttgac tcctcaaacc ttgggctacc aggtaaactt tagtccattc acccattgct 15901 tgatgcattt tgagttttaa ctgggaagct gcgtcttcta tggaatgacc tgctttacgc 15961 aactctagga cttggcgctg cacaggtgtt aattgttcat acaattgttg ccattgctta 16021 ggtgtcagcc ccaagttgtg ttctttcaag gaaattgcca accaactatc taccaattct 16081 gatttccctt tgagggcaaa tacacgtaca gcatggtaac taattttttc tctgagtcga 16141 tagacttcct tgattggttt atttagtttt ttggcaatct catcttgagg cttaccttgg 16201 agatagagtc gcaaccactc aacagcttca agtcctagat tttcttgtaa ataatcctca 16261 aattcctgct gaactgtttt acgcagtgct tgttgttctt ctatctcttg tgcctcttga 16321 tattcagcta ctgcttgagt atcaaccaag ttgactcgat tctcactgtc atcagtaaga 16381 atttcctcag agacaagtct cactaagtct tgagttggca cttgggttaa gccaccacgt 16441 tgagttcttc tcaaataatt tacaaaacga tacgccagta gaggttgatt gcgcactggt 16501 cgcaaacaat attcttctac gctggcaaac agtagagcgt tcttgagtct agtatcagtt 16561 gtacattctg aaatagaaat catttgttgt tgtatgtagt tatcgctttg cagtaactct 16621 tggataactt cttgtagtac atccagcact gtccgttggc gatcgcgact caaggcgatc 16681 caagtttgga tcttatgacg taatgtcacc aaactcccca atcgagtgat taggttgcgg 16741 taagcacgtt ctctggcaat tcctaagtaa cgctgacgca aaatcttgta gcgatactcc 16801 atagcttgtt tggcgatatc cagttccttt gaattcagta tctcaaaccg atctaagtca 16861 cttcccacaa gccatttgac tatactttct ctattggcta cactttgttc tggacattcg 16921 gtctccagac gttttcgcca atcttgcgcc agtttttccg cctccaaaac catagtgaga 16981 ttgcgctcct cgaaaccctg ttttaaagtc tgcatcacaa cccccttttc tcgtccctct 17041 ttacttctta actggcagct gcacctgatt tgatgatgtc catgatgaca tctgatgtgt 17101 ttcctctttt attacgtttt gactgactaa agttatgcag gtagtttgaa atttcgatgc 17161 atcctgacaa tctcgaacta cggtaaatca agctttctga gctttgatcc cgacttttcc 17221 taaactttaa tttttgtaaa cttaggatat gtagtgttgg ctcagaaaaa cgatctaccc 17281 aaaatatata aggtgacaag atttttttga tcagggatca ggaaaaacta acattttgca 17341 cctgataatt ttggcgaatt tcatccacct ctttagacgt tgttttataa tgtatgtttc 17401 accctttcaa gtttcctctc agcagtatac atgattgcta tttttttatc ctattaccta 17461 gaaaaaacgt gtatttataa taactcaata ggatatcatt tgaattctcc aacgtaaatg 17521 aacccagaca ctctcagtct gtcttgatgt ttggaaaact tacatcattt ggaaaaaatg 17581 agcagtcgtt tcttggcaaa gaaggacgag tcaagaattg tcgtcaaaat gagcttagta 17641 aaccacttgt tattgtttca caaatccatc atcatgtaga aaatataaaa gaaaatcata 17701 tcggtaaaaa taagaaaaat aggagatttt atcataggaa aattcgggat acaggaaact 17761 cagaccatgc gcgggcgtag aggaaagtgg cacgggggat tttaatcccc gtaaatcttg 17821 cttgctaccg ccttggaggt acttaatacc gggtactttt gtattgcact ttcgttggga 17881 cagttaccaa ctgatattgt ttgctctgta cgcataacaa gtggcaataa gcccttgtgt 17941 atatatccct tcgctactga ttatcacaga ggttacagac tagggaggta tccggtaacg 18001 tgttttgatg taccactatc agataattca gtttgggctg ttaacccgta agctagacac 18061 tgttttagtc ttgtattaag aaattggtga gttcccctct tacaagccca gtggctttag 18121 ccctgggtca gtgacaggga ggctattgcc tttgccctta ttaagcatca taagatagat 18181 cgctctgctt cacatcagaa aaaacattat actataatgt tcgcaaaaag ctatgaaaaa 18241 tttaaaggta ccctgatgga tacctttaga aactgctggt ttattcaata ggctagagtt 18301 ttagcggaca tcctcataaa actgctttag gtattgtggc gagacgccta taccccaaag 18361 aaagccaatg gaaagatata tggctgtcgc tttcaagaaa ccccaatgtg ccacacggcg 18421 atcgctactt tggacaacgc ggttcaccag acggattttc ccctgttgca ccagcttcag 18481 gcataagtca gcctcttcca tgattggaag attgctgtca aaacctccac atttccaaaa 18541 atcagcgcga cgtgcaaata tgacttggtc gccaaacaat aagcgtagtc cttgaaaaaa 18601 cagatgtggg cgaaaaagta atggcgcgta gtaacttttg acaaagttat gcagtgatac 18661 tccccatcga gtggttctat cacctgtcat caaagaaata aaaccccctg cagcaacact 18721 tttgtctagt agtatttcct caatgactgc gacaagatcg tctggaacca acgtgtccgc 18781 atgtaaaaaa caaaggattt ctcccgttgc gacctccgcg ccttgattca tttgtgctgc 18841 acgtccacgt tttttggcag caatcacaga aactccagcc ttttgggcta tggtcacggt 18901 ttcgtcagaa ctgccaccat ctacaattag tacctcctga actggaggta ctaatatact 18961 aagatggcgt agagtgcgct ccaaacactc cgcctcattt aaggtaggga taataattga 19021 gacaccagac atagattagt cattagtcat tagtcattag tcattagtca ttagtcatta 19081 gtgagccagc gctatgcagt ttttgtttcg gtgcgcgttg gtgactggca aacccgaagg 19141 atcattagtc taaattctac ccgtttttgc agcttatgta tagctggtac tctggttaac 19201 accgtttgct tgcgattaac ttggcttgtt agctactcat tagtaaacat caaacttttg 19261 actgtgaact accgactatc aactaactac caactacctg cgtacaattg gattttccac 19321 tgcttgtcgg tattcttcga cttgttcagt ataaccaata ggcaagacag cttcaacgtt 19381 ttcatgggga cgagcaatga tcacccagga ttcaagggtt ccgccatgaa cctgttctat 19441 agcctcaata ccagcctgca tcgaggtttt tacctcagat acatccccac gaatattaac 19501 tgtaaagcga gcactaccca ctcttatata tcctacgaga gtgactcggc ctgctttcac 19561 catggcgtcc gccgctgcta gcacagcagg aaaacctttt gtttcaattg atccaactgc 19621 ctgtagtggc attggttgtc tcctatgtca aacgtatgtg gcgctactga atcattgtac 19681 gaatcttcct acagatttca tagatgtgtc catgatcttt acttttaacc cgatcacatt 19741 atccctttct gggttcagcc caaaggtaac tccttgggtg gtaaaactat gacaaacaaa 19801 aaaaatatga aaacttacaa taagtttaat tgcattactt ttaagtgaat atactgctcc 19861 tgataaagca ataatctctt aaaatcaagt attgtaaggc acaatcagac ttgaaaatct 19921 tttagtcaaa ctttaatgtt acacgtcaaa atccaggaac acggaaatta acgcttgtta 19981 atccccaaaa agcactgaga ctgcgcgaca gcaaaggtca ccaaaagggg tgtaagggtg 20041 tagggggaac ccctatcaat ggtattaggc gtgggagtgt caaaactaaa atatctgaca 20101 tcttacgctg tcgatttatg taactcttgc ttaaagaaat aacaaataaa gatattcatg 20161 atttcgtgta atttttctga ctccataagg aaacacaatg caagaagtaa aggcaattaa 20221 gactttatta ccactgctaa gattgtatcc ttgggtaatt ccattaatta tcattctggg 20281 aatattctcc tctttgtttg agggattagg aatcagtctg ttcatacctt ttctccaaac 20341 tctagataca accaactcgc aaaaggtatc tagtaattta ctggtcaatt ttctgaatca 20401 aatatttatc aacattccac aagatagaca tctgttcata ataccactgt gtattttggg 20461 attagttata ctgaaaaact gccttgcata tatcagtact cttcttggtc attggctgta 20521 ctggcatatt ggacagcgtt tgctgtgcag aatttttcag cagcttttaa gcgtaagtta 20581 tagttttttg gactctcatc catcaggtaa attgctgaat acactggatg tcgaaacttg 20641 gcgaacttgc gacgctattt ttcttctcgt taatatcgct atcagtctgt gtacagtttt 20701 tgtttttgtc attctgctca tgctaacatc ctggcaattg agtttgctag tgactgttgc 20761 cttgctgctc atttcactga gtatacagta tgtaacccgg agggcaaaaa acctgggtag 20821 gcagtcagta caggcgaatg acaacctggc taatcgggtg atggaagggt tttacggaat 20881 gagggagatt cgcgcttttg gtcacgaatc ttacgaacta aaacgttttg aggaagcaac 20941 aatccactca cgtattatcg ctctgaagtt ggctaaactt tattcaatca cgggaccctt 21001 atccgaagtt ttagcagttg ctttattagt gtttattttg attatcgctc tgcaacaaaa 21061 gtctaatgtt cctacattat taacatttat ttttatgctt tatcgtctcc agccagtcat 21121 gaggcggtta gatactgacc gtgtgaattt gattggttta ttgggttctg tagaggatgt 21181 tatgtcattt ttagatataa cggaaagaaa tgaaatttgt tcaggtgatc ttcaatttaa 21241 aagtttacag cgaggaatca cttttgaatc tgttaatttt tcctacaaca cttcagaaag 21301 acctgcactt caggatattt ccttttttat cccacgaggt aaaaccactg cgatagtggg 21361 accttcaggt gcaggtaaat ctacagttat tgggttaata tgtcgctttt atgaaactga 21421 atatggagaa atatctgttg ataattaccc tctaagaaag tttaatttat cttcttggcg 21481 tagtcgaatt gctattgtaa gtcagaatat ttatatcttt aatacaacag taggagaaaa 21541 tatagcctat ggtcgcttag atgctacaaa aagcgaaatt atcgccgccg ccaaaaaagc 21601 gagtgcccat gaatttatta gtcaattacc tgaaggttac gatactatag ttggcgatcg 21661 cggcattcgg ctttcagggg gacaaaagca acgtatagcc ttagcccgcg cgatcgttcg 21721 cgaaccagag attttgattc ttgatgaagc aactaatgct ttggatagtc tttctgagaa 21781 tttcattcag gaagctctta attcctttag tcaaaatcgt actgtaattg tgattgccca 21841 ccgtttatct actattaagc aggcagaaca aatcattgtg ctgcaagaag gaaagattgt 21901 agaacaaggc aatctccagt atttacttaa actcaatgga ttgtttgcca agctttataa 21961 tctgcaatct ggtagtgctc tttaacagtt atcagttatc agttaccagt taccaggcag 22021 gaaatggact cgtctacccc ttgttcactg tttactgttc actgctcact gttttaagct 22081 caagtttaaa tctttaggac tgagagaatt cattcgagga tatagaaaat catgaaaagt 22141 caacctctcg tttctgtcat cactcccttt ttaaatactg aaaaattcat ccaagaagcg 22201 atagaaagtg tagtcaccca atcatataaa aattgggaat tattcttaat agatgacgga 22261 tccagcgata aaagtactgc aattgctcag gagtatgcag atttgtatcc agaaaaggtg 22321 tattatctcg accatgatgg tcaccaaaat cgtggtaaga gtacttctcg caacttgggg 22381 atcagcaagg caaaaggtaa gtatattgct tttctggacg ctgatgatgt cttcttgcca 22441 caaaaactag aacagcaggt ggcaattttg gaatctcaac ctgagactgg tatgatttac 22501 ggaccgactc aacattggca tagttggact ggtcaacaag aagattacca acgtgacaat 22561 atgagaccac ttggggttca gccgaatact ttgtttcatc caccaagctt aataactcag 22621 tatttaaaaa ttgctggaat cgtaccttgc acctgtggat tgttggtgcg acgggaggtt 22681 ataaaagctg taggaggatt cgatgacact attcaacata tgttcgagga tcaagtgttg 22741 ctagcaaaaa tttgcttgca cacccccgtg tatgtagata gtagctgctg ggataggtat 22801 cgccaacact cagagtctag ttgttcccaa gcaatccaaa caggaaagta tcatccttta 22861 aaacctaatc ctgcacatca gatttatttg aactggctac agaagtacat cactgaacag 22921 aaaatccaag acgctgaact ttggaatgca ctcgaaaagg ctctttaccc ctatcatcac 22981 cctatgttgc actttctatt gcgagtcaaa aagaaactag cgcaaattgc acgtaaaaaa 23041 ttccccggtt ttctccaccg ctttgtagga acgcagttcc ttggtgataa atatatccct 23101 cctacagggg ctgtggactt tggtagcttg cgacgagtta cgccgatgag ccaagccttt 23161 ggttatgaca gaggtcaacc cgttgaccgc tattatattg aaaacttcct tgctcattat 23221 caagaggata ttcgtgggcg tgtcttggag gttggtgatg ataactatac aagacagttt 23281 ggtggctatg tcgcaagcat tgactctatt caacgcatta cccaaagcga tgtacttcat 23341 gtgacaaaag gcaatcccaa ggcgactatt gttggtgatc ttgcttgtgg cgacaatatt 23401 ccatcaaata gctttgactg ctttattctg acgcacacaa tacaaatcat ttacgatgta 23461 cgggcggcta ttaaaacagt tcatcgtatc cttaagcccg gaggagtcgc cctggttaca 23521 gttcccggta ttagtcatat tggtgattat cagtgggctg attactggtg ttggagtttt 23581 actgctttat cagtcaagcg tctgtttgaa gagttctttc cagcagaaaa tctccaaatt 23641 gaaactcacg ggaatgtact cgttgctaat gcttttctct atgggatcgc aacagaagaa 23701 ctgcgtcagg aagaattaga ctactgcgat cgcaattatc aagtcacaat tactatcaaa 23761 gcagtaaaac cacacactgt atgaaaatac ctggagttgt cagactgcgg cgaatggcta 23821 aacgcttgcg ggggcgttta gcaccgggag gtcttatttt gatgtaccac cgtatcacag 23881 aagtggaatc tgatccttgg tcaatttgcg ttagtccccg ccactttgcc caacagatgg 23941 aagtcttaca caagtttggc gaagttgttt cgttacagca attaaaccag acattgcaac 24001 aggggaaaac tcctcattgg cagatagcag tcacttttga tgatggttac acagataatc 24061 tctataacgc caaaccggtg ttagagcgtt acgatattcc tgccacgatg tttttaacca 24121 gtggatatat ggaacaaaaa cgtgatttgt ggtgggatga actgaatagg ttattgctgg 24181 aacctggttc tttaccagaa gttctttgtt tagagattaa cggtacaacc catcgctggg 24241 aattaggtac tgcagccaat tacagcgagg aggaatatca gcgcgatcgc tgttggaggg 24301 cgttagggga agataacccc acccctcgcc atactcttta tcgcacactt tactggctgc 24361 tatcaccctt gcttcccgaa gcacgacaaa aggttatgga tcaactgctg gcgtggggtg 24421 attgtgttgc aacactgcgc tcaaatcacc gcattttgaa tttagaggaa gtgtctactc 24481 tagggaacga gttgatttca attggtgctc attctgtgac acacccattt ttgtctttct 24541 tacctcatac ccgacagagg caagagattc aagacagcaa aactcatctt gaggagatta 24601 taggacaaaa agtggtgagt tttgcttatc ctcacggcaa ctattctgag gaaacagcag 24661 gtttggtgag ggaagcggga tttatgagtg cttgcacgac ttatcctcgt actgtgtgga 24721 aacagtgcga tcgctttaaa ttacctcgcg ttgttgttga ggactgggat tctgaagagt 24781 ttgccaaaca gttgtctgag tggagagaaa actaaacggt ttatctttcc acttcagcga 24841 cttccttagg tacacctgct gtcaagactt catgacctgt tgatgttacc aaaacatcat 24901 cctcaatccg aataccgata ccaacccaac gtggatctgt ctgtggctgg tcttctgcga 24961 gcttggtatc tgggacaata taaagtcctg gttctaccgt cagcacctga cctggttgta 25021 aaatctgcgg gttatcgtca ccatgctgat aaacacccac atcatgaaca tctaaaccta 25081 accaatgacc agtacgatgc atatagtacg gcttgtattt ttcttcttcg atcagcttgt 25141 cgatttcacc tttaaggata ccaagttcaa ctaagccttc tgtcaggacg cgtacggctg 25201 tatcgtgaat ttgtttgtag gggttacctg gttggacttg ggcgatcgct tgcttttgtg 25261 cttccaaaac aatctcataa agtctctttt gttctggggt aaatttgcca cctaccggaa 25321 atgtacgcgt gatatcggag ttgtaataac cataagcaca accagcatca atcagcagca 25381 actccttgtc ctgcattttt cggtcatttt caatgtagtg cagaatgcag gaattcaccc 25441 cagaagcaac tattgagggg taagctggtc caagggcacc ccggaggcga aagatatgtt 25501 caatctctgc ttgaacttcg tactcgtagc gccctggttg tgtaaatttc atggcgtgat 25561 tgtgtgcttc aactgcaata tcagcagctt tctgcatcaa ctccaattct gcttgacttt 25621 tcattaatcg catgctgtgc agaattggac cagtatcttc aatcgcaatt ggtcctgtac 25681 cgcgcttggg ataagtccgc atcaaacgct gccaatgatt aaggattgtg tcgttgaaag 25741 tgcgatcgcg tcctagatgg taatatatcc gatcagcttt ttccaaatac tgtggcaatt 25801 tttcatcgag ttcggcaata gggtaagcaa ccgatgcacc atacacttgt tttgccgctt 25861 ctaccccaca gcgataacca ctccagactt ctttctctct atctttaggt tggacaaaca 25921 acacaaaccg atgttctggg tgtgatggtg ctaaaactgc tactgcttgc ggttcgttaa 25981 acccagtcag atagtaaaag tcgctgtctt gccgaaaagc gtattcaaca tcattgtgca 26041 taacggctgt tggcgcgcta cgaaagattg cagtcccatt gccaattttt gacatcaatt 26101 gctcacggcg ctgctggtat tctgcctgca tcattctaat ggtgtttctt tattaattat 26161 tgtttcaaaa gtactcaaaa ctataagttt ctctagttag tttgtactta ctttcatcca 26221 gacctattat atactctttg ataaattctt tgtagtttct atacctcaag tatgattata 26281 tttgcttgat gtattaattc atatagagta ataaataata ccattgactc agagagtggt 26341 cactacagta ttgttactct agaatgagat atgccttgta ttaagttgta caaaactgtt 26401 ttttgttcag tagctcaaaa gaagttcccc actcaggcga tagataagaa tggctaaatt 26461 tcccatcatc gaatacgagc aactcagcga ttccaacgtc aaagcgatat atgaagagat 26521 tcaggttgaa cttggatttg gcatagtgcc aaatttattt aaatcaatgg caatcaaccc 26581 taggatattg gaagcgaatt ggaagaaatt tcgcagcact atcctcaaag gagatgttcc 26641 acgcacactc aaggaaatgc taggaattgc tatttcccaa gctaatagta gtccttatgc 26701 actgaatgtc catttgcatg gattatcatc attaggcatg agcgaagaag ttttaagaac 26761 cctagtttca gatttcgcag cctgtccgct tcctgaacgt gaaaaagcag tgatcggctt 26821 tggcttgaaa gccgcgactg aaccgcacgc actcacgagt aaagattacc aacacctcta 26881 tgacttaggt ttagacgatt ctgaaatatt tgagattgtc gccacagcag acttatttac 26941 aagtgtcaat aaatatacgg attcaatttc attagaaatt gatacattat gacttttctt 27001 cccgtctcta ttggtagtag gctggaacaa caagattatc aacaactgtt gttcctttcc 27061 cgtcgcctgc ttgctgaagc ccaagcgctt tccagtagga tcgcagctgt taatgaaatt 27121 gcgatcgcca tcaatcgttc gctgaagctg gatgagattt tgcgggttgt tagtaaacaa 27181 gcaaaatggt tgctagactt cgagcattgt agcgtttgcc tgcgcgatac gactaacggt 27241 tcttggcgta tcgtgacgct gtttggttct actgtcgaag tcaagttttc tgacacaaca 27301 caaatgggtg tcgttggtat gagtctgaaa acaggtcaag cccgactcat tcatgaaaac 27361 tctacaagcg gttttcttag ccagtaccaa tcgcagatca tcattccttt ggaatgtgat 27421 catcgggtgc ttggtacgat caactttgcc acctcagcac caaaaactta cactcaagaa 27481 gatttacgca ttggctattt actaggcgtg caattatcag cagccatccg taatgctgaa 27541 cgttttgaag agttaaatcg gctgtatctt gaactagaaa aagaaaaacg caagacagag 27601 gaattgcttt tgaatatact accaatagat attgcctctg aactcaagca gactggtgct 27661 gttaagcctg tctattatga atctgcctct gttctgttta cagactttaa gaattttact 27721 aagttatcag aacagttgac tccacaagaa ttggttgatg aactagatta ttgtttctct 27781 tgttttgacc aatttattga agcacataac ttggaaaaat tgaaaacaat tggtgacagt 27841 tatatgtgtg ttgggggaat tccaaatcac aacaaaactc atgccattga tgctgtacta 27901 gctgctatca atattcgcac ttttatggaa tggcgtaaaa aagaaaaggc gtttttaaat 27961 caaccatatt gggatattcg tattggtatt cattccggac cgttattagc aggtgtgatt 28021 ggacgtaaga agtttgctta cgatgtttgg ggtgacactg ttaacacagc ttcgagaatg 28081 gaatcatctg gtctgactgg aagtattaat atttctctct caacttttga gttaattaaa 28141 gattttttta tagttgaatc tagaggcaaa gttaatgcca aaaataaggg cgaaattgat 28201 atgtatattg ttaatggtat taaagatagt ctcgctgtag atccaacagg tttattacca 28261 aataaggaat ttaatcaatt gtattttgct atccaaccag aaacagatac attaattgaa 28321 gaggatttta ttgatcaaaa aacgtgttgt cctaatcatt gaattagtgt aaagaactaa 28381 acagggctaa acttctttac tcatcttttt caactaaagt tgcaatcgtt tgcacaagca 28441 tttttggctc gattggttta gaaagatgct gttgaaatcc agctgccagt gcctgctggc 28501 ggtcaatttc cccggcatac gcagtgaggg cgattgccac aatctgctta ccttgtaata 28561 ttgcctgcat ttgttgcatc agcatataac catccatctc tggcattccg atgtcactga 28621 ctagtacgtc gggtttagat cgggctagaa catccaatgc agctaaagcc gacggtacag 28681 aggtgacgat cgctcccgcg tgttccagaa caaaagcaat taactcacga gtgtctacct 28741 catcgtcaac aacaagcacc cgaacatttg aaagtggtaa ggcgggaggg gtcagcgata 28801 cagcagtttg ttcatccttt gtcccacgat gttcatttct tagcaatggc aagttgacgg 28861 taaatgttgc cccctttcct tcgcctggac tctcggcttg aatcgtacct ccgtgcagtt 28921 caacaatttg ccgtgcgatc gccagcccca accccaatcc gccaaatttg cgggttgttg 28981 ctgcatcctc ttgtcggaaa tgttcaaaga cataaggcag aaagttagga ttaatcccaa 29041 ttcctgtatc gctcacttga atttgggcat gagtttcagt ttgggttaat gtaactgtaa 29101 tttgacctcc ttgaggcgta aatttgactg cattggataa aagattccac accacctgct 29161 gcaaccgccc tgcatctgcg ctaatcgttc caatcgcggg tgaaagtgtc gtctgaattt 29221 ccagtgactt agcaactgct gctaatcgaa cggtttctaa agcggcccag atgactgcac 29281 ccaaatcgac gggcatcaca ttcaaactga gtttaccgcg tagaatgcga gagatatcaa 29341 gcaaatcctc aatgagttgc acctgtagct tggcgttgcg ctcaatggtt tccagtgcat 29401 aagcggtttt tgccgcatcc aatctgccgt tgcgaagcaa agtagtccaa cctaagatag 29461 ggttgagggg cgatcgcaac tcatgagaca gcactgccaa aaactcatct ttaattcggt 29521 ttgcgttttc ggcggcttct cgtgcagttt gttctcgttg caggatttgt tctcgttcgg 29581 cttcaatgcg cttgcgatcg ctgacatctt tgaaaagtat ggcgactttt tggcttttcg 29641 gctgcccaac gcggaacgca taaacatcga accaacggtt catcgcttcg gagccatttt 29701 caaaccggag ggattcgccc gtcagcgcga ctttgccgta aatctcaacc cagtgatctt 29761 cgagattcgg aactaactga cgcgccgttt taccctctgc ctgttcgagt cccgtttgtt 29821 tctcgaatgc tggattggtt tctaaaaagc ggtaatcgag cggcgtatcg ttttcatcat 29881 acagcatttc gatgacgcaa aagccttcgt caatcgactc aaacagcgtt cgatagcgtt 29941 cctcagatcg gcgcgaggct tcttgtgctt gtttgcgatc agtgatatca atgataaatt 30001 ccacgccttc ttcggcgttg aggcgcgttg cggcaaacaa tgcccaccag cgggagccgt 30061 ccttgcggat gtattccttt tcgtaaggcg ttgtgtaccc gtttgatata aactcctcga 30121 ccgctcgcac cgaatgcggc atccactctg gcggggtcat tttgttccac cgcacccgtc 30181 cttgctctaa atcttcacgg ctgtaaccgc tcatgcacag aaaagcgtcg ttggcatccg 30241 taataaaacc gtctgttttg aagaaaatga cacccaccgt ttcgatagcg atcgcccggc 30301 gcagtcgttc ctcggattcg cgcaacgcca attccgcgcg cttgcgttcg ctgatgtcaa 30361 acgatacgcc caccaagcgg attgcttgtc ctgcgtcgtt atagaaaaat tgtccaaact 30421 cggcaagcca aaccacctgc ccattatctg ctcgaataat ccgatgttca caacggtaca 30481 acgtctggtt ctgttgggct tgctcgatcg cctgcatggt tcggtctaac tcttctggat 30541 gcaatcgtga atgccatagt tcaatcatcg cttcaccaga aagcactggc tcgtaaccta 30601 gtagccggaa gtgttgctca ttccagatcg cgcgtccggt ttgcaggttc gcatcccaag 30661 tacccatgcc tgcaccactt gtcgctaggc tgaggcgttc ctcactctgg cgtaattcgg 30721 tttctgcttg cttgcgatcg ctaatgtcgc tcgaaacccc caaaatctga cgaggaattc 30781 cctctgctgt gcgattaaag acaacgcttt gagaattgaa ccaacggtag ctaccatttg 30841 catgacgagc tcggtattca attgtgttca ccatcccgtc ggttgcggca cgaaatgtct 30901 caaggtaggc agataaatgc gggatgtcgt ctgggtgaat gatcctggca agcacgtctt 30961 ttcccatcac ctgaagttgc tcagacgtat agcccaataa atctgtaatt tgacggctgg 31021 tataaacgct ttgctgttcg attaaatcat gcacaaacag aattccaggc atggcttcaa 31081 tgatctgctg actgtagtgc agactctctt gtagctgtgc ttctatctgt ttgcgtgccg 31141 taatgttttc aaacgacagt cctaagcatt gattgggcag cgcaaacgct ttcaagctat 31201 aaatgcctgc ggtaacccca tcatcgctgt aggaaacttc gcctaaatcc aacggttgcc 31261 cggtctgcac tacagttatg taatgctcta ccagtggcga ttgcagcagc attgagaagt 31321 tttgagccat cgtcgttcca atcaacgact caaaatcaac acctgtgact tcagtagcag 31381 caggattggc aattaacaac cgaaaagagc caggatcgtt ggggtcttca agttgccaaa 31441 ccacgatgcc aacttgagta ctgcgaacca catcggcata gagttgaatt tgacggtggg 31501 cgctttctcg gcgcgatcgc ccaagttcta aattagccgc aacacgggca agcagttcgc 31561 gagtgctaaa gggcttgatc aaataatcat cagcaccggt ttgcaatcct tcgactgtcg 31621 attcttctcc tgcacgagca gacagcagca aaatcggaat ttcgcgagtt ttcggtgaag 31681 cccgcaattg tcgcagcaac tcaaagccat ccattcctgg catcatcaca tcgctgagta 31741 ctaaatcagg tgtgcaggtg tgaatagata ccaatgcagt ttcgccatct gcaaccacgt 31801 ctacctcata gaactggctg aggatgccat gcaggtaacc tcgcatatca gcattgtcat 31861 cgaccagcag aatccgggca gaggggcggg ggggcacaga ggcacacaag agagtttctt 31921 gctccccctc ttctttgggt agccagctta aagcttcctc gacataagag attgcaccag 31981 aagcagtgga ggtttgggtg cggctggcgt tgaggcactc aagcggcaaa tgagccgttc 32041 cgatcggcaa ccgaacggta aaggaactgc cctgacctag cgtactgtta acagcgatcg 32101 ccccaccatg cagctggact aattcctgaa ccaacgatag cccaatgccc gttccctcca 32161 aggtgcgtcc gcgtgcgcct gcgactcgat gaaaccgctc aaataaccgg ggaatctcgt 32221 ctgccggaat gccagtacct gtatcgccca ctaccagttc tacttggtcg ccaatcgaac 32281 gcacagaaac cgcaatttct ccctcaaacg tgaacttgaa cgcattcgag agcagattta 32341 acacaatctt ctcccacatt tcgcgatcaa cataaatggg ttctggcaag ggagggcaat 32401 cgacggttaa ttgcagtccg gcttgctcaa cggctgagcg aaaaatactc gccagttctg 32461 cggtatgcgt agctaaatcg gttggctcat agcaagctat agtgcgtccg gcttcaatgc 32521 gcgagaaatc taacagggtg ttgacaagct tgagtaggcg cgttccattg cgctgcacca 32581 tctctaattg cgatcgcgct tttgcaggta aaattccctc taattccgtt aatgtctgct 32641 ctagtggaga cagcatcagc gtcagtggtg tgcgaaattc gtggctgacg ttgctaaaaa 32701 aggtggtttt tgcccgatca agttctgcca gggcttccac tcgcttgcgt tcttcttcat 32761 acgcacgagc attggctatc gccgtcgcaa cctgccccgc aacaagttct aaaaagccgc 32821 gatactccga gttcagcgcc aggtagggac tgggacccac aactaaaaat cctgtgatgc 32881 cttcttgcgt ttgggcaggg attggtaaaa ttaaggcagt gttgggcagt tctggttgcg 32941 cctcggcatc tagtccaaaa tcatctcgca aattgctcac gaactgcggc tcaagagtct 33001 gcatcacctg ggcaaatgcc cacactccag cgacatcctg acggcaatcg acggtagtta 33061 aattcagggg agagtctgct gctagtccgg tttcactcac ctgctgggca aaagtttgtt 33121 gactgttgag caggtacaac agggcaaacg gaatatcagc gcgattgctg gctaaggtgt 33181 tgatcgccac ttgccctgcc gcttgaacgg ttcgcactct tgtggtattg actgccagtt 33241 cacgtaaagt atggagacgg cgctctccca aaacccgttc ggttgtttct gtcaccgtgc 33301 aaagaatgcc gccgatctca ccaccttcat cccgaatggc actgtaggaa aacgagaaat 33361 acgtttcctc gtcatagccg tagcgacgaa taatgaaccg catgtcaaca gacttgaccg 33421 ctttgcccgt gttgtaaaca cgcccaatca ggcgctcaac cgtttcccac gcttctgacc 33481 agcaatcgct cattcgctgc cccaatgctt tcgggtgttt actccctaac atgggtacat 33541 aggcatcgtt gtagaactgc accaactcgg ttccccaaca aatcagcatc ccaaaagcag 33601 attccaggca gatacttacc gccgttcgta agctaggcga ccagccctcg accgcgccca 33661 ccggagtcgt tgtccaatcg aacgaccgaa tgagtgctcc catctcactg tcgttgacga 33721 agctgaattc agagaccgac gccggttctt gagccacagt tcctactccc agccatattt 33781 cctcagtaag gcttgtcccg aatcacccag ttggtgcagc aaggttgaaa tctgcttccg 33841 tgccaagggg aggggtcggg ttcgtcgatt ctcccaacgg ttgacactat gaaatgacac 33901 gcctaacttg gcagcgagct cagcttgaga aaggtgcagc aactgtcgag tttcccgaat 33961 caacgccgca atttctaatt gttccacgac taccattgag ttcagtcctc tcgatttgaa 34021 ctgacaaaac tatagcaggt gcgatctgtg aaaatatcta tcagaagata tagataaatc 34081 gtatcctgcg gcacgctacg ctttgcgcat acgccggatt cgtgcgcgta ctctgtagca 34141 gagccgcacc ccgagggcag ccgaagctga agcggaaacc tcataaataa atttagcgga 34201 cgtgttacta gactacttga agcagatttg atatcactca atcttgccat tttccaacaa 34261 tttactagtc gcattgacaa caatctgcgc tgaacccgca aaaaactcgc gatcgatagt 34321 tgctggtaca tcactgggtt ggtgataatg ggcagtccgt agatttgcgg tatcagtgac 34381 caagactgcc cccactcctt gataccaaaa tggagcatga tcgctgcgta gagtatcagg 34441 tgttagtaag cctttgaggg gtattgggag tgtgagcaca gatggtaaaa ttgatccccc 34501 ggtagttgcc ttattaaagg agattgctgg gagtttctct gaattttgaa aagcattcag 34561 cagtcgtaaa tgttctgtat cgcctaccac tgctaaaaag tcacccttat cactgggtgg 34621 cgtcactggt aaagtagttg ggtatttttg acagccagga gtgtgacaag cataacccac 34681 catatccata ataataacgc cttgtaagtt ttccagatgt ttcttgtttg tcacaaaagc 34741 tttacttcct aaaagtcctg cttcctcctt atcgaaaaag gcgagttgca acgttctggg 34801 ggtgggacgc gaagcaaaaa gtcgggcaag ttccagagtg acagcaacac cagttgcgtt 34861 gtcatctgca ccaggagaga tataaacagt gtcataatga gcgcctacta atattgctcc 34921 tgcatctttc tcagttcctg gtttttccgc gaagatgttg acaccatctg ggaatttttc 34981 tagcttgggt ttccagccca tttttttgag ttcagttgtg atgtaggcgc gagtgcgcga 35041 tcgctctggt ggtgtatttc gtgtaaaatt taacctgaga atatgggcaa acagcttatc 35101 actagaaact tgtaatgaat cagaaatggg gggatgcaaa gacgcagaaa aattttctgt 35161 ctgtgtatct tcgcgtttta ttgtctgtgt atcttcccct cttggtgtgt ttgtatcttt 35221 atttgcactc agaggattgt taactctaat actataaata actgactccc gatgctgttt 35281 aataaacacg ttagtcgcta ctataacaac tattgtcacg agtagcataa gtaacagcca 35341 aaaccgtttt ttcatcatct catcaaaaag gttaggggag agtactcact cctgacatag 35401 gatagcgcaa acactaagtt tgacatcgtg tcttttgaca gaagaactca cttcatgtaa 35461 agttcctcat gaaacttttc aattttcagt tatgctcaaa aaaatttttc tagcatctac 35521 tgcattaatt tgcctaacgg tactgggaaa cagtttaaca attgccagag gtgacactcc 35581 aaatagtaat gctacaaatc ctcacaagac aatggaaatt tctccaggtc aacctgtacc 35641 ctcagttgac ttagttgtgc atcaagatac caagaaagga tggaatttgg aggtaaaagt 35701 gactaacttt cgcttcgcac cagaaaacgt taatacaaca cctaaacctg gagaaggtca 35761 cgctcatctt tctgtaaatg gacaaaagct taccaggttg tacagtaact ggtactattt 35821 agaaaaattg ccgcctggta aaaaccgcat tactgttagt ttaaatacga acactcatga 35881 agctttgatt tttaacggaa aattgattca agacactgag attatagaca ttcctgcaaa 35941 aagtatccag aaatattagg aagaattaag tgacaaaagt atcaaaaaaa aataaaaaat 36001 ttagtatttt ttcgcgtaag tagataagcg taattaattc gtagatagta gtagtgcgag 36061 cgtctcgctc gcgggcaaga tgcccgcact acatcttata tttaattaaa ctcattactt 36121 atcttcttta atttcatttt ttatttttta taactgctta ataactgatg ataaatttat 36181 tactcatcct agctcttgga ttcctaggaa gttttggaca ttgcgttggc atgtgtggtc 36241 ccttggcagt tgcgttttcc ctatctcata agcaggaaac tccccgttgg cgacagcaat 36301 tgcagtttca cacattacta aacttgggac ggatgttgag ctatgttctc gttggtgctg 36361 ggattggggc acttggttcg gttttgcttg ccagtggaca aatggcagga attggtagcc 36421 aagtacgcca ttggatagcg attattactg gcattttgct gatttggttt ggtattggac 36481 aaataaaacc tgactttctg cctcgaattc cattgctgca ccctcttgta caaggtcgtt 36541 tacaaaacta cctcagtcaa ggaatgatta agctttctct acaaactaaa tggtggacac 36601 cagcgctttt gggtatgact tggggtttaa tgccttgtgg ttttctgtac actgctcaaa 36661 ttaaagctgc ggaaactggc aattggtgga tgggtgcagc aacaatgctg gcatttggca 36721 tcggaacttt gcctacaatg ttaggtgtgg gtgtgtctac atcgttggtt agtaaagaca 36781 ggcgcagtca attgtttcgc atgggtggtt gggtaactct cacaattggt gtgctaactc 36841 tgttgcggac tggcgacaca atggtagatt acaccggaca tggagcgtta gtttgtttaa 36901 ttctggcgct tgttgctcgt cccatcagca acttgtgggc agcacctttg cgttatcgtc 36961 gtgcgttggg agtcggtgct tttgtgttgt ctttggctca cactgtccac atgatggaac 37021 attcactgca atggaatttt gcagcctttt tgttcttgct accagattat caggtgggca 37081 tggctttggg tgctgtagca ctagcgctga tgactccccc tgccctcacc agttttgacc 37141 gtttgcagaa gtctttaggc aagcgctggc gagcaattca tttgttaagc gtgccagctt 37201 tactattaag cacgattcat gctgtgataa ttggctccca ctatttgggt tcttctcaag 37261 ttacctggga gaataaatta gcggcagtgc ttctgggaat tgtcaccttg gttgtgttgc 37321 tggtgcgttg gcgctttttc tggtcaatgt tgtctttaca gaaattttat gttccctcaa 37381 aaaagtccta ggaaaagatg tgatccctcc aacccccgac aggggttgag cgcagcaaag 37441 aggagtctaa aaaaacaagt attactactc tttgtatttg ttctattttt aactatagct 37501 tccccagcca ttgctcacaa ggtcaagact gaaggagttg ttggcgcgac cttacacgta 37561 gaaccaaatg ataacccccg cgccggagag ccagcaaaaa cctggtttgc tctgactcgt 37621 aaaggtggaa aagttatccc tttagcggag tgtaattgtc aattagctgt ttacgctgag 37681 cctcactcag ccagtgaacc accactcatc gaaccgccct tacaagctgt ttcggtagaa 37741 cggtttcaag gcatacctgg tacggaaata acttttccta gaccgggagc gtatcagcta 37801 caactgagtg gtaaacctaa agatggaaag agtttccagc catttgagct aaagtttcca 37861 gttaccgtgg cagttggttc agcaacaaat aataatgtac aagaatcaca aactgtgcag 37921 aatgtcaatc aaagtgtcac cgaagagcga actcagggag tgccattttg ggcgatcgcc 37981 ctctcagttc ttttagcagt tggtgttttt ttcggggtac tgcgaatggt gaagaagagg 38041 gaaggcggat gaggacgatg agaaagtgag agagtgaggg agtgtgagaa aatatgcctc 38101 cttcgctttt tatctcctcc acttctcatc ctcctgatct tcctcactct actgatgacg 38161 tttctggtgc cactgccaag cgtgagtgat aatttcctca acagaagaat attgtggatg 38221 ccagcctagg atttttcggg ctttgtcgcc actaccaatg agtgctggtg gatcaccggg 38281 acggcgatcg cactctacaa ctttgatatc tctttgtgtc acttccctgg cagtttcaat 38341 aacttctttg actgagaagc cgttaccatt tcctaagtta aacacttcgc tgtcgccacc 38401 cttcaacaaa tattccaatc ccaaaacgtg ggcggatgct aagtcggtaa catgaatata 38461 atcccgaata cacgttccat caggcgtggg gtaatcagtg ccgaaaactg agatggattc 38521 gcgcttacct aacgctgtct gcaacacaag gggaattaaa tgggtttctg ggttgtggtc 38581 ttcccccaat aagccattgg gatcggcacc agcagcgtta aaatagcgga accgcaccga 38641 tttcaaatcg taagcagcat gaaaatcaga gagaattcgc tctaccatga gcttgctagc 38701 accatagggg ttaattgggt tctggggatg gtcttcggga attggcacca cttccggtac 38761 cccgtatgtt gcacaagtag aagaaaatac aaatttctta acagacgctg cgagcatcgc 38821 ttctaacaaa gtcaaagtgc cgacaacgtt gttgcggtaa tactttgcag gatcgctaac 38881 agattcccca acatacgcat aagccgaaaa gtgcatcacg gcggcgatgt tgcgtgtatt 38941 gaatagatta tccaataggg ggcgatcgcc agtatctccc actacaagtt ctacctgtaa 39001 aactttttca accaaatccc ggtgtccgta aacgagatta tcgagtatta tcacgtcaaa 39061 acctgcttgt ttaagggcaa gcactgtgtg ggagccgata tatccagcgc cccctgtcac 39121 caaaatagtg ggttttccag gcgacatagt ttatcctttt gacttgattg atttgatgac 39181 tactagaatt aatttaattt acaggaggta ggttaataat ctggaatatt actgatagta 39241 gacgcagttg taaaaaattg ctctaaaatc actacgatgg ttctcgtggt gtgaggggac 39301 ccacccccgt gcggaggttc ccctccgccc ttggggagtg gcgttgtgag gtatttgaag 39361 tcagataaca gtacgattac aatttgacgg tacaataacc ccactgccca aaacctataa 39421 aaccgaccgc aaaaaaggat ttgagagtta atcgatttat gtttgtactg ttaagtcggg 39481 tcttactgtg gttacttgtt ggcactatca tatacagctt gttccagaga ttttatccct 39541 cagggacttt tgtcgggaga ctaatcttag tcattctgtt acttgtggta ctgctgtcgt 39601 tcattaaccc gaatgaacca gctgttgctt cattgtgggg ggtagtgtcg tttccactta 39661 aacctctggg agcatctatc ttgctgatga tttttgcagc acagaggatg aaaagcggag 39721 gaacgctgga taaaccagga ggatacttaa tcggttgggc gctgacaatt ttgttgttag 39781 cgagtacacc ggctgtcgct tacttcctag ttcgctcacc aatcgcaatg gtgggtcaac 39841 catacttagc caatcaggaa attatccaaa atgtgcggct agcttcgtct atggtaccag 39901 caacaccaac atctgagaca cttgttgcat taggacaaga tactagcgta gccagcgatg 39961 gtatcatcta tgcctcaaga atgtctgaca tcaaaacaac accatacctg ttgcaaactc 40021 cacaagctat tagaacaagg gggttgaggt tagaagattt tgtacccaat gcggaaacac 40081 tccaaataac aacccgagtc tgggaaagtt atcttaatca gatatacact ttcctacgtg 40141 gtcgttaggt gtagatttcc aaaatgagta ctcgtaaggt cagtctcgtg gctgacctta 40201 ttcttttcca gaacggcaaa gtttacgctc attggcacaa aaaagataca acattaatga 40261 ctcgcaaaag ttgccgaaaa acagtctcaa gatgaatgtg ccaaaaagta ctgggacttt 40321 aaagtaaaat gaaaaagatg cgtaattcct tgaatggcaa attcacgacg gctgtctcag 40381 cttcttgcat atctacgccc ccattggcgg gaagcaacat ttggcattat cgccttgttg 40441 attgtgaatg gtttgggtgc ttatatccct ctgttgattc gttctgctat tgaccgactc 40501 tcagtagaat tcagctttaa tcaaatcaaa tatcttgtta tacaaattgt attattcact 40561 tcagccatgt ggctgatccg aatggcttct cggatttggc tgtttggtgt ggggcgtcag 40621 gtggaatctg acctgaaaca acggattttt gaacacttac tcaaactgga accgtcgtat 40681 tttgcgacta ataccgcagg cgatttgatt agtcgggcga caagcgatgt ggacaatatc 40741 cgccggctat tcggttttgc gctcttgagt ttggcaaata ctttatttgc ctacattttc 40801 actttaccag tcatgctggc actaagtgtg gatctgacat taacctcact agcagtctat 40861 cctttcatgt ttctcttagt acatttgttt agtgagcgtt tacgcacaga acagtccgct 40921 gtacaggaga gactatctga catcagtgga ctgattcaag aggatgtcag tggtattgcc 40981 ttaataaaaa tttactctca ggaagaaaac gaacgtcgtg cctttgccaa tcaaaatcaa 41041 gagctattgc aggctaattt gaaactggca aaaagtcgaa atacggtgtt tccactgatt 41101 ggtgggctag ccacggtgag ttcttttatc attatctggt taggctccac aaggattgcc 41161 aatggaagcc tggctgttag tgattttatc gtactattcc tgtacataga gcgtctcgtc 41221 ttccccacag ccctcttagg attcactatc actgcttacc aacggggtga agtgagtatt 41281 gaccgcgttg aggcaattct gactgtcaca cccaaaataa aagacacacg agatacgatt 41341 cacttaccac ccgagcaagt gaaaggagaa ctcacagcaa gaaacctcag cttcacctac 41401 cctggttcca ccactcctgc actctgtgac gttaacttta ccatagctcc tagcgagact 41461 gtagcaattg taggcgctat tggttcagga aaatcaactt tggcaaatgc tatcccaaga 41521 ttgttggata ttgaagcggg acagttgttt ttggatggcg tggacattac taaaattgtc 41581 ttggctgatt taaggagtgc gatcgcctat gtccctcaag atagctttct cttcagtaca 41641 acaattaaaa ataatattcg ctacggcgat cctgtgagcc aacaacaaga agtggaatac 41701 gctgccaaga tggcgcaaat tcacccagaa atcagctact ttccccaaga atatgaaacc 41761 atagttgggg aacgtggcat tactctttct ggtggtcaac gacaacgtac tgctttggct 41821 aggggaatat tagttaatgc cccagtgtta attttagatg atgctctttc cagtgtagat 41881 aatcaaacag ccacagccat cttgaaaaat ctttctgctg gtacgcaacg aaaaacagtc 41941 attttcatca ctcatcaact ctctgctgct gctactgctg accgcatttt tgtcatggac 42001 aaggggaaaa tcgttcaaaa aggcacccac gtagaacttt tgcagcagcc tggactgtat 42061 agaagtttat ggaatcagca tcagatagaa gaattattgc attagacttt ttgcaggagt 42121 cgggaacagg gaacagggaa cagggaacag ggaacagcga atagggaaca gggaacagaa 42181 cttcgtaaaa tctcactttt gtaagaggtg tattttttat accagttacg agataaattg 42241 attggctctc aacaagaatt cccaccgtac ataaaagaat ctcaccaaat gtttcacgaa 42301 gcgaagcatc acctatttag tatttttgca agaggtaaca taaaatcaac tctttaggag 42361 gatgtatttt tcctcagctt tgtatacctc agtgaacgca cttgctgagt agcaaacttt 42421 aacatgctta ccagaaatgt tcactttttt aacgactttt catcttacat aaaatatgtt 42481 tattttgctt taacttaaat ttataaaaaa tcatgatccc aagaatgttg atgaagattc 42541 agtgaatcta cggttaaact tcgtttgctt gtggtgagct aggtttatag tctaattatc 42601 gctaaataac catattttct tagctaaata taattggcat aacaatgtat cagatatcct 42661 gcccaacagt tgttgatggt gcaactgccg acacttggga attgcaatta actgctcacc 42721 cgtcagtaaa ttctaagttg aaacgatgtt tagacattgt ggggagcata gtaggactac 42781 taattttatc gattttgttt gtgccaatag caatagcaat tacaatcgat agtcctggtc 42841 caattttctt cacacaagaa cgctatggac tctacgggcg ctcattccgt atccgtaaat 42901 tccgctctat ggtttcaaac gccgagaaat taaaatcttt ggtgcagaat gaagctgatg 42961 gattaatttt taaaaataaa aatgacttcc gagtgacaaa agtaggtcgc tttttaagaa 43021 gtacaagtct agatgaactg ccacagtttt ggaacgttct agtaggcgaa atgagtttag 43081 tagggacacg tccaccaacg aaagatgaag tgtctcagta taatcaacgt cactggcaac 43141 gcctaaatgt taaaccaggt ttaacaggtg aatggcaagt taacggtcgt tctcatgtga 43201 aagattttga gcaagttgta gacttagatt tgcagtatca gaagaaatgg tatccaatgt 43261 acgacttatt actgatcgta aagacattct tcatgatcgt tggtcgagtg ggagctttct 43321 aaggaagcta ttaacactat ttggttatta gttattaatt attctacaac atccgtttca 43381 ttaaggctct caccttctag catgggtaag atgagaattc ttggtgagag cgaattaata 43441 aatctggggc acagtagccc caacgactcc ttttaaatga gttcttgagc aaggggagat 43501 cataggaaat cttacactag caagagagta aacgattctt tataaagtgt gagtacttaa 43561 actctggact ttatcatttg ctttattaaa ctctcagttt ggttacagct atttttccag 43621 aaaaacacac atcaacatca ctgccaggag tgcgatcgcg cttggcatat tcaccctcat 43681 aataaaaatc attgccctca atccggtagt tgatgggcaa tggtttggta caaggttgac 43741 aaaacacaac ttctggatct ggttctcctg gaaccagatg aggcaaattt acgttccaaa 43801 aagttcccgg ttctagagga cgattgagta agtcagctaa aacagaactc gtccaacgtg 43861 ctaccagatc ccaatcaaca ttcagcttct ttttaagata gtgagaaact gcaataccag 43921 gaatgccatg catagctgct tccctcacag cggcgacagt cccggaaatg taggcatcca 43981 cgcccatatt gcctccagca ttgattccgg acaacacaaa ttggacattt ttgcaaatat 44041 gcgttattgc aattctggtg caatcggcag gagtacctgc aatagcatac tcgttctcag 44101 agcgacgatg gacatgaatt ggtcgagtcg tggtgacttg atgtccacaa ccagagagat 44161 gatctctggg agcggcaata atgacttctt taccatttac agctttcagg agcgcttgaa 44221 taccaggagc gtcgatcccg tcatcgttgg tgagaatcaa ggtcatagat tgtcttgtta 44281 ggacttgtac aaacataagc aacaagactg attttgaagc aaatttgctt tgaaagcaat 44341 cgctcaaccg gcaacagtgt gcgttcttct caaaacaaac ttcaatacat aaggttaagt 44401 tgcgagttac tgtttaaagt tgattttaag tataaaaata ttatatagta tttaaaaata 44461 cggtaaatat tccaaattta tacttaaaaa aaattgataa ctagcaactt gagttacata 44521 aataaaggta cctgtacaca agtagtaaat gcaaggaata taacataatg acatttgcga 44581 tctagtaatt attacttaat tcaagattgt aaaatccacg ccaattgtgt aaatattgac 44641 taattttata tgcaatcaaa agagatgaaa taaagattta aaaaaaaatc atgattcttt 44701 tttagacctt tataattaat tctcaaagat tttcttagtg ttgtaaatgt gtgctttgtc 44761 aagcaactgc ttatcagatt ggcaattgga ttgtttaaga aagattaaat tgagaaattg 44821 gtattaatca tgcaagtctt aaaatctggg aaatctgccg aacaaagatt gcgtcttcat 44881 tttttagacg gcttacgtgg attagcatct ctgtatgtgg tgattgtcca cataaatcga 44941 tatatgggag aacaagtgcc tgtatttttg caatttatag gcaaaacttt aagatatgga 45001 aattttgctg tggcgatttt cattgtgctt tcgggttatg ttctgatgct accggtagcc 45061 cgttcgcaaa acggttatct tcctggaggt ttgtgggatt atatccaacg gcgagcgcgt 45121 agaatcttac ccccttacta tgctgctctg ttttttagtt tgctaacagc agtcatcata 45181 ttaggattca ttcacttttt caatttcaag tggcatgaat ctccggaata tggagaattt 45241 catcctttct tttcccctat tgacgtaatc acccacttgc tactcgttca caatctcacg 45301 gagtcgtagt gacagtggta ggtaatttcc ttcttaatct tcacctaccg ccaattgagt 45361 ttatcgcgat attttatgta gtggctatat cactgtcttt actcattgct tacttgtttt 45421 acttagtttt tgagagacca tttatgtcta attttttgaa aaagcgcaag gtaaaagatg 45481 cagtgaatta attgcagtaa aaagtcaaaa gtattttact aataattaat gactattttc 45541 aaatgtacgt ttaccgtctt cgtaaagtgt aaattgacca gtctctggta agatacgttg 45601 ccggggttgg atgtaaacgt gtaaaaattc ggcaaccctt cgggttcgca gtcgctacaa 45661 cggggggaac cccccgaagc t // LOCUS NODE_499_length_45507_cov_4.93989345507 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 45507) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 45507) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..45507 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..198 /locus_tag="DP116_02845" CDS <1..198 /locus_tag="DP116_02845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858624.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02845" /translation="ADYNQALRINPNNALAYGNRGNARAELGDKQGGIADLQKAADLF RQQGNTANYQKVLELIRKLQQ" gene complement(383..700) /locus_tag="DP116_02850" CDS complement(383..700) /locus_tag="DP116_02850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653932.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_02850" /translation="MTKRIIITPKASLDIDEHFAYIAQQNPDAALHFFDAVRETFAQL ARMSGMGSLYQVQNPRLQELRKWAVKDFKKYLIFYFEQDENIQIVRILYAGRDIERIL EQE" gene complement(700..1023) /locus_tag="DP116_02855" CDS complement(700..1023) /locus_tag="DP116_02855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02855" /translation="MKAYEFPAKVTTEGKIELPDTVLQQLPHNQQVKVIILVNEPSED KEDDEAWRRLPSEQLSKGYSEKAQEHIEELLIEGLESGETIEVTDEWWEQKRTYLMNK LRQGQ" gene complement(1079..1312) /locus_tag="DP116_02860" CDS complement(1079..1312) /locus_tag="DP116_02860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02860" /translation="MPTTDQSIQEVYRYVVDTLSLSERLRLAALILNDLTQQNITVID SSDTWSESDQLELTTFSLQYAASLFSESEETTQ" gene complement(1811..3184) /locus_tag="DP116_02865" CDS complement(1811..3184) /locus_tag="DP116_02865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112890.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_02865" /translation="MLGQTICGRYRIIRQIGKGGFGVTFLAEDTQRPGNPQCVVKQLK PQSDDSYTLHHAKRFFDQEAEILEILGNHDQIPRLLAHCTENQEFYIVQEYIKGQDLN QEVFSKRQLSEAEVIKLLKQILEVLAFVHQKRVIHRDLKPSNIMRRESDGKIVLIDFG AVKQVTTQIANPQGETQFTVAIGTPGYVASEQANGKPTLSSDIYALGVICIQALTGIH PDPRRRGFPTDSKTGEIIWRNQAQVSPKLANIIDKMVRYDYRQRYQSADEALEALKTL LPKKFLIGSGIAAVLAISLPAISIFKPPTPESFLQYENSNFGIKIKYPQSWQRQDINN PVTKEVVAFVSPQQSDVDKFKEKVIISVEEFSGTLDEFSKSSVQEIKKNTPDANVSTS ETSFANKLGKELVFPGKTGENSLQNLQVFTLKGDKAYVITYTAEKDNYDEFLKTVEKM IKSFEIQ" gene 3884..4129 /locus_tag="DP116_02870" CDS 3884..4129 /locus_tag="DP116_02870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861005.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I iron-sulfur center protein PsaC" /protein_id="PRJNA477356:DP116_02870" /translation="MSHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAAQIASSPR TEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLAY" gene 4549..6450 /gene="glmS" /locus_tag="DP116_02875" CDS 4549..6450 /gene="glmS" /locus_tag="DP116_02875" /EC_number="2.6.1.16" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871032.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamine--fructose-6-phosphate transaminase (isomerizing)" /protein_id="PRJNA477356:DP116_02875" /translation="MCGIVGYIGTQAATEILLSGLEKLEYRGYDSAGIATIWEGEVNC VRAKGKLYNLRSKLEQVETPSQIGIGHTRWATHGKPEEYNAHPHMDAAMRIAVVVNGI IENYSELREELKQKGYQFLSETDVEVIPHLIAEFFKHPPSSVSPFCPSLLLEAVRDTV DKLRGAFAIALISADYPDELIVVRQQAPMVIGFGQGEFFCASDTPAIISHTRAVLPLD NRELARLTPLGVEVYNFAGERLKKQPRLLNLNPTMVEKQGFKHFMLKEIYEQPAVVRA CLDAYFSADWSVGDSTNSPIKLGLPAEIYANLEQIQIVACGTSWHAALVGKYLIEQLA GIPTQVHYASEFRYAPSPLTPNTLTIGVTQSGETADTLAALAMEKERRQDKEPKYQAR LLGITNRPESTLGHMVTQIINTLAGIEIGVAATKTFIAQLMAFYALALDLAYLRQTIS STKLQEIIDGLRQIPGEIEATLEKQEESIGQLAHEFAETKDFIFLGRGINFPIALEGA LKLKEISYIHAEGYPAGEMKHGPIALLDAKVPVVAIATPGSVYEKVISNAQEAKARDS RLIGVTPFNHGEAAEIFNDFIPVSKVDELLSPILTVIPLQLLAYHIAALRGLDVDQPR NLAKSVTVE" gene 6754..6986 /locus_tag="DP116_02880" /pseudo CDS 6754..6986 /locus_tag="DP116_02880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197940.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 7004..7273 /locus_tag="DP116_02885" CDS 7004..7273 /locus_tag="DP116_02885" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02885" /translation="MNHLGKTVPESARKLDKEQTPQFDMTGGDFVIAKLRGGKGLPDK GWEEVKPQATQKINKIAETINQHAKQINNINEVKDAHFGDVIHKN" gene 7309..9423 /locus_tag="DP116_02890" CDS 7309..9423 /locus_tag="DP116_02890" /inference="COORDINATES: protein motif:HMM:PF05729.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02890" /translation="MPTPDELDELKVVFENLVGRTPSDEEIEALKACFKGNQSINFQV AKYINNVTEAQNSQFGDTHKLKPPAYRFSQNADTEAIQPAKLNQNQLSVDDVVQKVRL RFHDDIQRLHGTMPLLGVDHWVDLGELFVNVNILQEVSSSRKSELGDLWQDFIASVPE YSSDRSLDRIGLGKHQQRVSGLTVLAKNTNLMVLGKPGSGKTTYLQHIVTECNEGRLQ PHRIPVLIKLREFVDDGRKFEYSLERYLTQQWRLSEAETELVLSQGKALILLDGLDEV TGVDGRVISKQIKQFTRTYPQNQLIVTCRTQSQESRFERFDYVEVADFDETQVNAFAK HWFKAVCSDAEEGQAKARQFLDKLYIEENKPIRELAITPILLSLTCAVFQVQGKFYSK RSKLYEEGLELLLEKWDKSREIERDEIYRDLSVERKQELLSYLAVKKFEQPQYVLFEQ EEIEGYIAEFLEISRRDSRVVLRAIEAQHGLLIERAQKVWSFSHLTFQEYFVAKWFCE RSNWNDLVNHIIEMNWREIFLLVVMMYEPVDVLLQLMKQKADHLVVDESKIQKFLFWI FQTSQIVDFPYKLEVIRAFYFELYGGLNSTYKFRIASHLSSTSDGIICGVFYNAHNII KIKTMLRETQPKQKKKENYIDYIWIFSSEHENLLKQYYLANNILVDCLKNGFLKSDEV TKEIETNIFLPIAEIEKRKREK" gene 9536..9844 /locus_tag="DP116_02895" CDS 9536..9844 /locus_tag="DP116_02895" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02895" /translation="MNAKISIQIEKTQLGYSAYSPEIEGSQVQGDSVESVVDALKTIL STYLKKQEHQIDSGTDKPIWEIAQQITQDMTEEEIRQLPSDGAEQHDHYIYGIPKRKS " gene 9841..10251 /locus_tag="DP116_02900" CDS 9841..10251 /locus_tag="DP116_02900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016516942.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleic acid-binding protein" /protein_id="PRJNA477356:DP116_02900" /translation="MKTLFADTFYWVALINPGDDWYNRVLNTSNSLGQIQIVTTDEVL TEVLTFYSEAGTRMRQRTVEFVDNILNNTKIQVIEQTHASFLAGLELYRSRFDKGYSL TDCISMNTMRQLGITEVLTHDQHFAQEGFVILFR" gene complement(10301..10489) /locus_tag="DP116_02905" CDS complement(10301..10489) /locus_tag="DP116_02905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02905" /translation="MPFQKKNKLGATSLNEEPFDKSPVCFNVRIGVKEKLKAVPDWKE RLRKLVDELIEETGNTSQ" gene complement(10516..10875) /locus_tag="DP116_02910" CDS complement(10516..10875) /locus_tag="DP116_02910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208171.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_02910" /translation="MVSKATRRFVRQRARFLCEYCHSPEYLSPDRFTLDHILPQSLGG SDDDENLALACHRCNERHYNFTTAVDPKTQESVPLFNPRQQRWAEHFIWTADGLRILG VTTVGRAMRHLKNCVIC" gene complement(10794..11114) /locus_tag="DP116_02915" CDS complement(10794..11114) /locus_tag="DP116_02915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006105913.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02915" /translation="MITQLMVQPSSFIRSGVRMSQLGEVYIFKFTHELQARFEELLNK NKQDALSQAERAELDGISELSRIFTLINAQLAAQAKWCPRQLEDLSDNELDSSASTAI PPNT" gene 11552..11872 /locus_tag="DP116_02920" CDS 11552..11872 /locus_tag="DP116_02920" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02920" /translation="MNAVSKVDAILNAWFDYIALDDYSNAKIEANSDAIKQRGVSLVR DHVLIEPDTFSELRQKVTQGQKGQQEAVWALSFPQVLDVEKGKSYLCPLFSLDITPLK RVCC" gene complement(12009..12200) /locus_tag="DP116_02925" CDS complement(12009..12200) /locus_tag="DP116_02925" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02925" /translation="MLYNIVEGKPVVVESVAFLSKKVLLNELDNSQSQKFTQIFRLIQ TIIALPSKLERRLPEAQLA" gene 12221..12931 /locus_tag="DP116_02930" CDS 12221..12931 /locus_tag="DP116_02930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribose 5-phosphate isomerase A" /protein_id="PRJNA477356:DP116_02930" /translation="MTAATDPVKLSKQEVGKAAAALVQSGSIVGLGTGSTTAYAIEFI GDRLKSGELKDIVGVPTSFQAEVLAKQYGIPLTTLDAIDHIDIAIDGADEVDPQKNLI KGGGAAHTREKVVDYLASRFIVVVDGGKLVDRLGSVFPVPVEVIPMAVTPVTLALKKL GGKPELRMGVKKAGPVITDQGNMVIDVKFDTIDDPENIEKILNNIPGVLENGIFVNCA DVVLVGEVKDGQPVVREL" gene complement(13188..13427) /locus_tag="DP116_02935" CDS complement(13188..13427) /locus_tag="DP116_02935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316141.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02935" /translation="MPHYFSTQPTTDPSNTHSKGSTDAHTNNLIGLSIISFPLFVLLG IITYKKSRVAVYRRRIAFLEKIWLMDVKNNTYRQD" gene complement(14171..14761) /locus_tag="DP116_02940" CDS complement(14171..14761) /locus_tag="DP116_02940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317462.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02940" /translation="MIADEVNNFVTHCKRIQPQSQEEIKKFFEGVVSFPYDNELLLQA YLFLNIERIFPFCSELLLFEKSPIADYTDLGKCDFVYLSSFGNIFLIETKFIDTEATG ATERKRRNKHRNKVFEQVITLKSRFSEYWNIKLDQLECAVFTTDSEVAWRGTGVNVVT KSISMDKLKYWRRNYKTEVTSHKSEFRIQNELGGCK" gene complement(15099..16454) /locus_tag="DP116_02945" CDS complement(15099..16454) /locus_tag="DP116_02945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317463.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="trans-splicing intein-formed DNA polymerase III subunit alpha C-terminal partner DnaE-C" /protein_id="PRJNA477356:DP116_02945" /translation="MVKIVTRKTLSTQNVYDIGVERDHNFIIRNGLVASNCFNKSHST AYGYVTYQTAYLKANYPLEYMAALLTANSDDTDKVQKYISSCASMNIHIEPPDINRSG EDFTPSGDNILFGLSAVRNVGENAIKCILETREEGGEFKSLADFCDRIDLRTVNRRSL EALISCGAFDTIDSNRNQLLRDLELVYDWAQSRARDRASGQGNLFDFLGGPFSTAATT NQNQNSFDTAPKAQKVPDFPPQKKLQMEKELLGFYVSDHPLKAIRNSARVLAPINLSQ LGEQKEESAICAVVMLNGVKKVMTKKGDPMAILQIEDLTTQLEAVVFPKIYEQIHSLL QIDSRLIVWGKVDRREDQNQLIVEDVELVETVQLVIVQLNLQQADTIEEQHRLRTILQ EYSGEREKAKVPVIGIVQAGTSRQLVRFGRQFWVQDSRTTVQALQNARFSAYPQSLAD V" gene 16651..18105 /gene="gatA" /locus_tag="DP116_02950" CDS 16651..18105 /gene="gatA" /locus_tag="DP116_02950" /EC_number="6.3.5.-" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878833.1" /note="allows the formation of correctly charged Asn-tRNA(Asn) or Gln-tRNA(Gln) through the transamidation of misacylated Asp-tRNA(Asn) or Glu-tRNA(Gln) in organisms which lack either or both of asparaginyl-tRNA or glutaminyl-tRNA synthetases; reaction takes place in the presence of glutamine and ATP through an activated phospho-Asp-tRNA(Asn) or phospho-Glu-tRNA; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase GatCAB subunit A" /protein_id="PRJNA477356:DP116_02950" /translation="MASIRELHTQLIKKERSAVEITQEALDHIQALEPKLHSFLCVTA QQALEQAQQVDAKIAAGEEIGILTGIPIGIKDNLCTKGIPTTCASRILENFVPPYEST VTQKLADAGAVMVGKTNLDEFAMGGSTENSAFKLTANPWDLSRVPGGSSGGSAAAVAA GECVVSLGSDTGGSIRQPASFCGVVGMKPTYGLVSRYGLVAFASSLDQIGPFARTVED AAILLKAIAGYDPKDSTSLKVEIPDYTANLKPEFPKGKLKIGVIKETFGEGLDPQVEE AVNKAIKQLKELGAEIQEISCPRFRYGLPSYYIIAPSEASANLARYDGVKYGYRAPDA DNLISMYARTRATGFGAEVKRRIMIGTYTLSAGYYDAYYLKAQKIRTLIKQDFEKAFE KVDVLVCPTAPTTAFKAGEKTSDPLSMYLIDLMTIPVNLAGLPGISVPCGFDNNGLPI GLQLISNVLREDQLFQVAYAYEQSTTWKERSPQL" gene complement(18151..19032) /locus_tag="DP116_02955" CDS complement(18151..19032) /locus_tag="DP116_02955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317465.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix domain-containing protein" /protein_id="PRJNA477356:DP116_02955" /translation="MKLLKKDQQEQLVEIVAHLRQVREERSVGLKELAAYTRIQPAIL QAMEEGRFEELPEPIYVQGFIRHYANAIGLDGAALAKTVANICLTPEESNNDHQVVDE KPTIYIPLFVPYVLLLAVASFGLFYLLNPQRSVQSSSQKDLSPLAAEQNTESATQASS LTSSQPKTRPPVTSSTTKALSPALTTITPTPSSALTTITPTPSPQEATTTPVEVTLEL QDKSWVRVKVDGKTEFEGELKKGDKKTWTAKKEVMVRSGNAGAVLISTNKKQPTPLGS VGSIKQVTFTPETANSQ" gene 19205..20104 /locus_tag="DP116_02960" CDS 19205..20104 /locus_tag="DP116_02960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phospholipase" /protein_id="PRJNA477356:DP116_02960" /translation="MKINKLQDSVQLIFPFLIYVGVVVAISYFAVCIFLFLKQTRFIF FPSAVIDTTPEFYNLPYEDVWLPVSAKSGKVEQIHGWWIEANQPNGKVLLYLHGNSVN IGANVTRAHWFHQLGFSMLLIDYRGYGRSEGRFPNESQVYQDAATAWDYLVYQQQIPP SKIFIYGHSLGGAVAIDLALKQPNAAGLVVESSFTSIRKVLAYRNNFKMFPVDIILRQ HFDSIRKVPNLKIPVLFIHGTDDVIVPALMSQELYAAAPEPKKLILVPGAAHNNVAQV APSVYLEAVRSFIILAESRILSS" gene 20700..21797 /locus_tag="DP116_02965" CDS 20700..21797 /locus_tag="DP116_02965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457551.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02965" /translation="MATTSYLLKILDNPTIEISQDELRTLLGEIEAELHQSKVYRRAV VILQKLLGSSTEQANVLFKAVGREAIGLAFRQFVQQYQKVKEKPQEDTTIETSTIEKT NSTDESCNDSSQNFRSVVDQKTIIKTDEKADLPSESQVESHKANDSAKTKQNRPKTKT KIGWRGFGKKRKQAELALQMAVEQRVETRRQISQQLRQARESQGLSLSQLHAYTHVPL HHMEALEKGDWELLPEDVYVRGFVRVMANALGLNGTDLAASLPAPEPVKAILPAKYEH KSNFGLGIALHPVHFYLGYAALVAGSVGGLSIMASQQASADKMINKDAATPPSSSFTK SSQETKPISKPGLSNRVSITVGPDIAPPEAL" gene 22150..23508 /locus_tag="DP116_02970" CDS 22150..23508 /locus_tag="DP116_02970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317468.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_02970" /translation="MICCLNPECSYPLNSQGTKFCENCGAELIELLRGRYRIIKPLGG GGFARTYLAEDADKLDEKCVVKQLAPQVQGSWSRQKAMELFQQEAKRLQHLGEHRQIP TLYAYFKERNYLYLVQQFIEGDDLLQELKHKGVFDEAKIQEFLQDLLPVLIAVHQQQV IHRDIKPENILRRESDGKLVLLDFGVSKRKTGTVNPKPGTSIGSFGYAPYEQMYSGEA YPASDLYSLGATTFHLLTGVSPWEIWMKQGYSWTSTWRQYLTEPITEELGLIIDKLLQ EDYNQRYQTAEAVLQDSFFVLSPHSPLQHTILSPVQPETLTQEIQQQPPQYQGQFSVE KLLPWAIMTGSGSSFLLIALLSSVGTMWISSSLWLFVFVGFIFVQPYSVFEKTYLFIV TGITTLFVVFIYKNFYIVNLLKAGIDGFLVLILLAILAGLLTFTLLTVSQILNSFISK YF" gene 23550..24974 /locus_tag="DP116_02975" CDS 23550..24974 /locus_tag="DP116_02975" /inference="COORDINATES: protein motif:HMM:PF13458.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_02975" /translation="MKNQENIRLLISLGLAGLLIAGILWLMGRVIRPDKEYSGQFQPN SASISPSYINSPLKNRMSLGEKVFVREERPPEKDAGSKAFEAGDFSTAVSKFQLSLQA KRNDPETLIYLNNAKIGTSKSLKVAVIVPIGISLNEAEETLRGVAQAQDEVNSSGGIN GLPLQLEIISIDNFDVMKELSTELVKDTSIVAVVGFSRDPSIYNKGGLVMVSTVNPKK PSQPTKYVFYATPKFDVFSDAIASYIIQKTRLTNIAICSDSTFIVNQEKVSQEIVEQY TDSIKKYGGKVTSTACDLSAPDFQPSAFLSQAISDGAEGLILIPRPDKLNLAIDVARE NKGRLPLFSFQGMYTERTLKYGQADVKGMVLGVSWHNDALGNKSFAQKAVGLWGGEVS PRTATAYDALQTIITGLKEGNTRQELQKALSNPKFSAPGATGKIQFSQTGDRKGGVFL VKVEPCNPSQSCNSSTRYHFRLLE" gene complement(25047..25643) /locus_tag="DP116_02980" CDS complement(25047..25643) /locus_tag="DP116_02980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CPBP family intramembrane metalloprotease" /protein_id="PRJNA477356:DP116_02980" /translation="MAQQQKQEPEIPYLTRTQVLVAMGVTAVLLWIVAKVWLQFGNFA LMRWRWDQTELLWGVLLGLGITALSTITYRLWLPYRKSADFYLEMVLKPLALPDLIWL GLLPGLSEELLFRGVMLPAFGYSYAAVIISSICFGVLHLSGSQQWPYVIWASIVGILL GSSALLSGNLLVPIVAHILTNLISSYSWKIRQSQIVKN" gene complement(25689..26756) /locus_tag="DP116_02985" CDS complement(25689..26756) /locus_tag="DP116_02985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408583.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_02985" /translation="MRQYTAILIIPTGIGAAIGGYAGDALPVAKAASQVCDRLITHPN VLNGASLYWNLPNTFYVEGYGLDKFAAGCWGLRPVHQNRIGLLLDQAIEPELRLRHLQ AADAARATLGLSMTDYVITDAPLNVELRESASGISWGTIGNPDSLLRAAEVLIKKAGA EAIAVVARFPDTLDEQADQNYRLGKGVDPIAGAEAVISHLIVRTFQVPCAHSPAFFPS PVDPNLSPRSAAEELGYTFLPCVLVGLSRAPQFITETSYELSEFGDIWANQVDAAIVP ATACGNSALLSLSQSRRCQIITVEENQTQVEVRPQPLGIKSIQVNTYLEAVGVLAALK AGINPSALSRNISPLQSLINS" gene complement(27580..27894) /locus_tag="DP116_02990" CDS complement(27580..27894) /locus_tag="DP116_02990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317442.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_02990" /translation="MSKTYTVEIIHQGKTHTLQVPENETILSVADAAKLDLPSSCHAG VCTTCAAQILSGSVDQSDGMGVSPELQKKGYVLLCVAYPRSDLKIETEKEEIVYQLQF GK" gene 28183..30849 /gene="acnB" /locus_tag="DP116_02995" CDS 28183..30849 /gene="acnB" /locus_tag="DP116_02995" /EC_number="4.2.1.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase" /protein_id="PRJNA477356:DP116_02995" /translation="MLESYRRHVTERAAQGIPPLPLDANQTSELCELLKNPPAGEEET LLLLLRDCIPPGVDPAAYVKAGFLTAIAKEEITSPLISPIEAVQLLGTMIGGYNVQSL IELLQTPSVSLSSSSETPLVMGGQGKEPIAAYAATALSKTLLVYDAFHDILELSKTNP FAKRVIDSWAQTEWFTVRPVIPEFINVIVFKVPGETNTDDLSPAPQAMTRPDIPLHAL SMLESRMPGALQTIAQLKTKGYPVAYVGDVVGTGSSRKSAINSVLWHIGDDIPFVPNK RAGGYILGGSIAPIFFNTAEDAGAFPIQCDVSKMETGMVITIYPYKGSITNEAGEVIS TFTLKPDTILDEVRAGGRIPLLIGRTLTDKTRLALGLEPSTVFTRPQQPADTGKGYSL AQKMVGKACGLSGVRPGTYCEPMMTTVGSQDTTGPMTRDELKELACLGFSAELVMQSF CHTAAYPKPVDIKTHQELPEFFFSRGGVALRPGDGIIHSWLNRMLLPDTVGTGGDSHT RFPLGISFPAGSGLVAFAGALGVMPLDMPESVLVRFKGELQPGVTLRDVVNAIPYVAI QKGLLTVEKKNKKNVFAGRILEIEGLPDLKVEQAFELTDASAERSCAGCTIKLSTETV SEYLRSNITLLKNMVARGYHDERTIMRRVAKMEEWLANPVLLEADADAEYAEVLEIDL NEIQEPIVAAPNDPDNVKLLSEVANDPVQEVFVGSCMTNIGHYRATGKVLEGAGSVKT RLWICPPTRMDEHQLKEEGVYDIFNAAGARTEMPGCSLCMGNQARVADGVTVFSTSTR NFNNRMGQDARVYLGSAELAAVCALLGRIPTVQEYLEIVANKIHPFADNLYRYLNFDQ IAGFEDEGRVIPLEKMPKIEDILGMPTGAGSK" gene 31981..32202 /locus_tag="DP116_03000" CDS 31981..32202 /locus_tag="DP116_03000" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03000" /translation="MNHTDLQSQLLDQLPSGQTVNFEYGILEAQTRLVVQQCTNEIKT LMRRNSQDIIDIGQKLIEVKQHLGHGSFR" gene 32186..32413 /locus_tag="DP116_03005" CDS 32186..32413 /locus_tag="DP116_03005" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03005" /translation="MEASDSIENQVAISHETSDAAIASMAISIKNLTPKQLARMIIEA ANNGLSESELSAIVMASQQVLNTQQQDEYSD" gene 32862..36203 /locus_tag="DP116_03010" CDS 32862..36203 /locus_tag="DP116_03010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865301.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03010" /translation="MNSQNWRRKRGVVLTPKGLEKFQETKRKSETEENFGNRYTFEEI SARCGLYTGTISKVLNREGGVDKRSIEELFKAFQIKLDKSDYLSSNTRIDWGEAISTS VFYGRAEELALLEQWILNERCRLVTLLGMGGIGKTALSVKLAQQIQENFEYVIWRSLR EAPSIKAILANLIQFLSEQQETPVNLPESLSERVSRLLDYLRSHRCLLILDNLESILR SGIRVGQYLEGYEEYGEFIRLVGEATHQSCLVLTSREKPKEVASMEGQALPVRSLQLS GLQVVDGWEIVKIKGLSAAQDEWAPMIQRYAGNPLALKIVATTIQDVFGGNITEFLQQ ETTVFGEIRDILDQQFERLSNLEKNIMYWLAINREPIALSQLQEDMVSSVPQVRLLES LESLIRRTLVEKSATLVTLQPVVMEYVTQRLIEHVCEEIVTQNLDFFRSHALMKATAK DYIREVQIRLILQPVTHELLMIFRSKKSLENQLQKILAMLREKYLLEQSYTAGNILNL LCHLQIDLSNYDFSDLTVWQTDLRNVKLHDVNFQNANLAKSLFAETFGGILSVAFSPE GKVLAMGDTNGDIRLYQVADGLPLLTCKGHANWVLSLAFSPDGTILASGSSDNTVKLW SVGTGQCLQTLQGHNHEVWSVAFSPDGEVFASGSDDQTIKLWSVRTGECLKTFQGHAN WVLSIAFSPDGQTLLSGSEDQTVKLWDINTEECLKTFQGHHDGVRSIAVSPDGQMLVS GSDDQTIKLWSIRTGKCLRTFQGHTNPVYAVAFSPQGDTLASGSHDQTVRLWDVTTGE CLRVFQGHSNWVFSVTFDTEGEMLASGSWDQTVRLWNVSNGECLRTFQGHANQVLSVS FDSDGQRLVSGSNDQTVRLWDVTTGDILKTLYGHTNWVYSVAFSPQGNTLVSGSADKT VKLWNVSTGQVMKTLQGHGAAVRSVAFSPGGQMVVSGSEDYTMKLWELSTGQAMRTCL GHEAAIWSVAFSPRGTMIATASWDHTIKLWDPNTGECLRTLVGHKSWVWSVAFSSDGQ ILASVSPDQTLRLWSVSTGECLRILQLHSSWLQSIAFSPDNRTIATSTHEHTVKLWDI DTNQALRSLQGHTASSSCGVQRLRW" gene 37227..37973 /locus_tag="DP116_03015" CDS 37227..37973 /locus_tag="DP116_03015" /EC_number="2.7.7.60" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015178277.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase" /protein_id="PRJNA477356:DP116_03015" /translation="MYLLIPAAGSGRRMGSSRNKLLLTLLDQPLIAWTLLAADASESI DWIGIILQPQDRADLQTIVSNLSLSKPVHFIQGGATRQDSVYNGLQSLPPMAKHVLIH DGARCLATPNLFDRCAEAILHCQGLIAAVPVKDTIKVVDQKTHLITSTPDRSQLWAAQ TPQGFEVERLQQCHAEGRRQGWEVTDDAALFEQCQLPVQIVEGEETNLKVTTPVDLAI AEFILRQRLAEQSRQPLDKKFYERLPLADS" gene complement(38000..38488) /locus_tag="DP116_03020" CDS complement(38000..38488) /locus_tag="DP116_03020" /EC_number="4.6.1.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140993.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase" /protein_id="PRJNA477356:DP116_03020" /translation="MNIRIGNGYDIHRLVVNRPLILGGVEIHHELGLLGHSDADVLTH AIMDAMLGALSLGDIGHYFPPTDSQWAGADSVVLLTNVHQLVQDRGWQISNIDSVVVA ERPKVKPHIQTMRAKLSSVLELEPDQVSIKATTNEKLGPVGREEGIAAYATVLLLSHT PN" gene complement(38601..39788) /locus_tag="DP116_03025" CDS complement(38601..39788) /locus_tag="DP116_03025" /EC_number="1.1.1.267" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017719772.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-deoxy-D-xylulose-5-phosphate reductoisomerase" /protein_id="PRJNA477356:DP116_03025" /translation="MKTISLLGSTGSIGTQTLDIVSQYPNEFRIVGLAALRNVELLAQ QIREFQPAIVAICDAQKLPELKEAIADLDPQPILLAGEAGVIEVARYCEAEVVVTGIV GCAGLLPTIAAIKAGKDIALANKETLIAGGPVVLPLVQQHGVKLLPADSEHSAIFQCL QGVPNGGLRRLMLTASGGAFRDWSVDKLSTVTVADALKHPNWSMGRKITIDSATLMNK GLEVIEAHWLFGLDYNHIDIVIHPQSIIHSLIELQDTSMLAQLGWPDMRLPLLYALSW PERIYTAWRQLDLVTVGELTFRAPNHQKYPCMQLAYAAGRTGGCMPAVLNAANEQAVA LFLSEEIQFLDIPRLIERACDRYQVHNYSSPTLDDIIAADQWARQEVVAVSKTLGRYT ITV" gene complement(39957..40292) /locus_tag="DP116_03030" /pseudo CDS complement(39957..40292) /locus_tag="DP116_03030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006634538.1" /note="catalyzes the phosphorylation of 4-diphosphocytidyl-2-C-methyl-D-erythritol in the nonmevalonate pathway of isoprenoid biosynthesis; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase" gene complement(40434..41654) /locus_tag="DP116_03035" CDS complement(40434..41654) /locus_tag="DP116_03035" /EC_number="1.17.7.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015173447.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxy-3-methylbut-2-enyl diphosphate reductase" /protein_id="PRJNA477356:DP116_03035" /translation="MNIKAFKRSLHSSERYYRKGFGHEAEVTKMLYSEYQSSLIQQIR ENNYTLQKGDVTIKLAEAFGFCWGVERAIALAYETRQQFPTERIWITNEIIHNPSVNQ NLREMQIEFIPVNEQGKKDFSVVATEDVVILPAFGASIQEMQLLNQKGCKIMDTTCPW VSKVWNSVEKHKKRNYTSIIHGKYNHEETIATSSYADKYLVVLNLQQAEYVCNYILHG GDREEFLRLFAKAYSPGFDPDIDLEWVGIANQTTMLKGETEQIGKLFERTLMRKYGPA QINDHFLSFNTICDATQERQDAMFKVVEDKLDLMVVVGGFNSSNTTHLYEISSDRGIP SYHIDSAERLGPGNRIQHKKLHRDDIEVLEDWLPNGPIVVGITSGASTPDKVVSDVIE KIFAIKAAMNCVLN" gene complement(41779..42837) /locus_tag="DP116_03040" CDS complement(41779..42837) /locus_tag="DP116_03040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aromatic ring-hydroxylating dioxygenase subunit alpha" /protein_id="PRJNA477356:DP116_03040" /translation="MSTKLETVKNLTLTQPEREVNKRAMNLPASWYVAMPSKALGKKP KEIELFGQPLVAWRDQNGHPAIMQRYCSHKGTSLAIGKVVDGCIQCPFHHWRFDSSGQ CVFIPEVDKIPPKARQANYVTAERYGYIWVWYGSETPMFPLPEFPAAEDDRHNYMPFR FADLTKTTVRHVIENGFDQYHIITVHDLKISEPIKFTLLTDQYTAEVSEPPIPKEARF AAKVEFPIHDLDPVARTLGFNAENFTVLLDSWPAGQRVTAFVDGKEVYKLVVGMTPIA EKKSIQHILVMVKKTGKFWLDIFHYVVFSLQNKLGVKEDMPIYDNTNQNFGVADVKHD LGVLKFREFYQRWIDKVE" gene complement(43373..44137) /locus_tag="DP116_03045" CDS complement(43373..44137) /locus_tag="DP116_03045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013320382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_03045" /translation="MTTKIVTAAEMPNCYETSKAFLKQGLQRIFTYDIDTNENWLVLY QALYQVYPEYQYALVESATQQMIALGNCIPIAWKGTFEQLPDEGATWAATQMLTDNHS QGHQPNILCAVNIGVLPEYRGRQISSFMLQQMKKIAQVNKLSSLIVPARPTLKHLYPL TPMERYITWQNENGLPFDPWLRTHVKHGANLVGICSKSATIIDTISNWEARVNMRFPE TGDYVIPEALSPLTIDFTNNQGTYIEPNVWMHYNLA" gene complement(44375..45400) /locus_tag="DP116_03050" CDS complement(44375..45400) /locus_tag="DP116_03050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308628.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polyprenyl synthetase family protein" /protein_id="PRJNA477356:DP116_03050" /translation="MNTKVFERSFQNSKHNQSQVFHREIEVANTKKKSLQLVEAGRTP EEATFNLSTYLSQRQGLVEEALERSITVVYPEKIYEAMRYSLLAGGKRLRPILCLASC ELSGGTVDMAMPTACALEMVHTMSLIHDDLPAMDNDDYRRGKLTNHKVYGEDIAILAG DGLLAYAFEFIVVQTKGVPAEHLLQVVAKLSHAVAATGLVGGQVVDLESEGLKNISLE TLNFIHAHKTGALLETSVVTGGLLTGADEEILQRLSRYACNIGLAFQIIDDVLDITAT AEQLGKSAGKDLQAQKATYPSLLGIQESKNQAKKLIEEAKAELAPFGEKAIPLMAIAD YITDRTH" BASE COUNT 13099 a 9376 c 9797 g 13235 t ORIGIN 1 gctgattaca accaagcact gcggattaat cccaacaatg ctttagctta tggcaaccgg 61 ggtaatgccc gcgccgagtt gggagacaag cagggaggaa ttgctgattt gcagaaagca 121 gctgacttgt ttcgacaaca aggaaataca gctaattacc aaaaagtgct ggagttgatt 181 agaaagcttc agcagtaggc aggcggagtg atgggtagat gattcttagt caggacgtga 241 cttgctattg gattttgaca gcataattga aacatcagca gttttgctat tgggtaagca 301 aacaacatga gtctgcaact atcgattcct gattctgtac tggaagctat ccggctacct 361 gaaaagcgtc gcctacccat tatcattcct gttctaaaat cctctcaata tctctgcctg 421 catagagaat ccgtacaatt tggatatttt cgtcttgttc aaagtaaaaa atgagatact 481 ttttaaaatc cttgacagcc cacttacgca actcttgtaa acgaggattc tgcacttgat 541 acaaactacc cattccagac attcttgcta actgtgcaaa agtttctcta acagcgtcaa 601 aaaaatggag tgcagcatca gggttttgtt gggcaatgta agcaaaatgc tcatctatat 661 ccaaacttgc tttaggtgtg ataattatcc gctttgtcat tattgtccct gacgtagctt 721 gttcatcagg tatgtgcgct tttgctccca ccattcatca gtaacttcaa ttgtctctcc 781 actctccagt ccctcaatta aaagttcctc tatatgttct tgagcctttt cgctatagcc 841 tttggataat tgctcggaag gaagacgacg ccaagcttcg tcatcttctt tgtcttcgct 901 cggctcatta actagaataa taactttaac ttgctgatta tgcggtaact gctgcaaaac 961 tgtatcaggt aattcaattt tgccttcggt tgtgacttta gctggaaact cgtatgcttt 1021 catataacac cagtgacacc aggaaaatcg actgtgacga catcaccagg attgaatgtc 1081 actgggttgt ctcctcgctt tcagagaaaa gactagctgc atactgtaag gagaaggttg 1141 ttaattcaag ctgatctgac tcactccaag tatcgctact atcaatgaca gtaatatttt 1201 gctgggttaa atcattcaga ataagagctg ctagacgaag cctctcgctc agtgacaggg 1261 tatcgacaac atatcgataa acttcttgga tactttggtc agtagttggc atagctattc 1321 ctcctaacta gtattctatt gtaacgggaa gtaaagtccc gaaaatcctt gcgatagcac 1381 cagtcgcaaa gggacacgct gctttgtatg cctagtggga acggccacgc cttacggcta 1441 tcgcagcgtc tttgcaggaa atacggcacg cttagcgctt acgtttcact cgcggcggac 1501 atatagcaat cctaaatcat tcatgagaaa tttggaaaca attgggcgaa tagaattcgc 1561 tactacacaa acaaagtccc ttcgggtatg ccttcggcac accttcggtg aacgcagtcg 1621 cctacggagg gaaaccctcc tgcagcgctg tctcaccacc tacgtggact taaaaaaata 1681 acccgcgcag gcgggtttgg tctgtgtagc cgcgattttt aatcgccata aatcaattaa 1741 attttacatt tatttttgat gttaaaccta cgtattttct cacgattcca ttcggactgc 1801 tatatgctcg ttactgaatt tcaaatgatt taatcatttt ctctactgtt ttgagaaact 1861 catcataatt atctttttct gctgtgtaag taatgacata tgctttatca ccttttaaag 1921 taaaaacttg caaattttgt aaactatttt ctcctgtttt cccaggaaac actaattcct 1981 ttcctagttt attggcaaaa gaagtttcac ttgtactgac attggcatct ggtgtattct 2041 ttttaatttc ttgaactgaa gacttgctaa attcatctaa cgttcctgag aactcttcaa 2101 cgcttataat taccttttct ttaaatttat ctacatcgct ttgttggggt gatacaaatg 2161 caaccacttc tttggtaaca gggttgttta tatcttgcct ttgccagctt tgaggatatt 2221 ttatttttat tccaaaatta gaattttcat attgtagaaa actttctgga gtaggaggtt 2281 taaaaataga aattgcagga agacttatag caagtactgc tgcaattccc gatccaatta 2341 aaaatttctt aggtagtaat gtttttaaag cttccaaagc ttcatctgct gactggtaac 2401 gctgacggta gtcataacga accattttgt ctataatatt ggctagcttt ggactaacct 2461 gtgcttgatt tcgccaaata atttctcctg tcttggagtc tgttgggaat cctcggcgtc 2521 ttggatcggg atgaatccct gtcaaagctt gaatacaaat aactccaagt gcatagatgt 2581 cactgcttaa tgttggtttg ccgtttgctt gttcgcttgc cacataacca ggagtaccaa 2641 tagcaacagt aaactgagtt tctccttgag gattagcgat ttgagtggtg acttgcttga 2701 ctgcaccaaa gtcaatcaat acaatcttgc catctgactc acgtcgcata atattcgagg 2761 gttttaaatc gcggtgaata actcgttttt gatgaacaaa ggctaagacc tctaaaatct 2821 gttttaaaag ctttatcacc tcagcttcac tgagttgtct tttggaaaag acttcttggt 2881 tgaggtcttg acctttaatg tattcctgaa cgatataaaa ttcttggttt tctgtacaat 2941 gtgctaaaag tcttgggatt tggtcatgat ttcctaagat ttccagtatt tctgcttcct 3001 ggtcaaaaaa acgtttcgcg tggtgtagag tgtaggagtc atcagactga ggtttcaatt 3061 gcttgacaac gcactgagga tttcccggtc gttgcgtatc ttcagctaag aaagtgacac 3121 caaacccacc ttttcctatc tgtcggataa tgcggtagcg tccacatatt gtttgaccca 3181 gcatttaagt caccgtgtgt tcaagaactt gtcactgtat cggaaccatc ctaaatgttc 3241 gcttttgaag ccaaatgcag ttcacacgcc ttgggatggt tccactgtat atagcttatt 3301 ctgtcagtag actcaagccc tatctattag gattacaaat gacacaatac tatttttgcg 3361 tatttggtat gagagtgcga taagcctccg gcttgacgca ctctcatatg atgaatccaa 3421 atctttacta agattattta cattaattaa tattttgata aaaatctcag tcatttgaac 3481 acgcagaact gccatttctt gaaaaagtta ttaacaaaca agttatatct tcaacgcccc 3541 caagcactcc tttttgttga cttttgattc acatgtctat ataatacgac ccgtgctcaa 3601 aaagctggtg cttgaaagga gattgataca ggactaaata ctcaacttag gcaatccaaa 3661 actcaaaatt catcatcacc aaatccatgc cagacagtca ttagagctac gcaatacatc 3721 cttacttaaa tgcacaatag cttgactgca atttaaaggt gcttattaca ttctttaaag 3781 ttatgtagcc ttttaattac ggaagatcag ccactgcggt aaactaagcc atgattatga 3841 ttctttcaca ccagaaaaac tctctcaaag gagctttttt tcaatgtctc ataccgtaaa 3901 aatctacgat acctgcatcg gctgtacaca atgcgttcgt gcttgcccca ctgacgtact 3961 agagatggtt ccctgggatg gctgcaaagc tgctcaaatc gcttcttctc cccgcacaga 4021 agattgtgtg ggctgcaagc gatgtgaaac ggcttgtcct accgactttt taagcatccg 4081 cgtttattta ggagctgaaa cgactcgtag tatgggtttg gcttactaaa ttttcccttc 4141 ggaaggcaat ttattctctc gatagcaagc acttattagt agtgggggct gggcattacg 4201 taccatttac aacaatggtg agtcatgtcc agcctgcaat tttatgaact taatgtaaaa 4261 ttttgaaaga aatcgcaata ctgatatagc agggaacagg gaacgcttaa cagggaacgc 4321 ttaacaggga acaggcttga aagtctcgtg gtgtccgtat tttctcatta gttcatgtcc 4381 taacctacct ggcaactgct atagattgca tctggttgca cgcactgaca ggctttcctc 4441 atatagctat aactgtatac aacaaagtca tttttttgcc aaaatgctac ctttgtgctg 4501 ttgactatac tgtgtttatt agtcctataa caagagtggt gtgatcaaat gtgcggaatc 4561 gttgggtata ttggcactca agcagcaaca gaaattctgc tatctgggtt ggaaaaacta 4621 gagtatcggg gatacgattc tgcgggaatc gccacgatat gggaaggtga agtgaattgt 4681 gtccgggcaa agggtaaact ttataatctg cgttctaagc tagaacaagt cgaaacacct 4741 tctcaaattg gtattggtca cactcgctgg gcaactcatg gtaaacctga ggagtataat 4801 gcccatcccc atatggatgc ggctatgcga atagcagtag tggttaatgg gattattgaa 4861 aattacagcg agttgcgcga ggaactaaaa cagaaaggat accagtttct ctcagagact 4921 gatgttgagg ttatccccca cctaatcgct gagtttttca aacatccccc ttcttctgtt 4981 tctcccttct gtccttcttt gttgcttgaa gctgttcggg atactgttga taaacttagg 5041 ggggcgtttg cgatcgcact catcagtgct gactaccccg acgaactcat cgtcgtccga 5101 caacaagcac ctatggtgat tggttttggg caaggagaat tcttctgcgc ttccgacaca 5161 ccagcgatta tttctcatac ccgtgcggtt ttacctctag acaataggga attggcacgt 5221 ctgacacctc tgggtgttga ggtttacaac tttgcaggtg aaaggttgaa aaaacagccc 5281 cgcctgctta acttgaatcc cacaatggtg gaaaagcagg gattcaagca cttcatgctt 5341 aaggaaattt acgaacaacc agcagttgta cgggcttgtt tggatgctta cttcagtgct 5401 gattggagtg ttggtgattc tacaaattct cccattaaac ttggtttacc tgcagagatt 5461 tacgctaatt tagaacaaat tcaaattgtt gcttgtggta caagttggca cgcagcatta 5521 gtcggtaaat atttaattga acaactggca ggaattccaa cgcaagtaca ttacgcttct 5581 gaatttcgtt atgcaccatc acccctgaca cccaatactc tgacgattgg tgttacgcag 5641 tctggtgaaa ccgctgatac gctagcagcc ttggcgatgg aaaaagaacg tcgtcaggat 5701 aaagaaccaa aatatcaagc acgattatta ggaattacca atcgcccaga aagtactctg 5761 ggtcatatgg taactcaaat tattaatact ctggcgggaa ttgaaattgg ggtggccgca 5821 acgaaaactt ttattgccca actgatggca ttttatgcat tagcattgga tttggcatat 5881 cttcgtcaaa cgatttcttc tacgaaattg caagagatta ttgacgggtt gcggcaaatt 5941 cctggtgaaa ttgaagcaac tttggaaaag caagaagagt ctattggaca attagcgcat 6001 gagtttgcag aaaccaaaga tttcatcttt ttgggaagag ggattaactt tcctattgct 6061 ttagaagggg cgttgaaatt aaaggaaatt agctatattc atgcagaagg atatccggct 6121 ggagaaatga agcatggacc gatcgctttg ttggatgcga aagttccagt tgtggcgatc 6181 gccacacccg gaagtgttta cgaaaaagtc atttctaacg cccaagaagc caaagcgcgt 6241 gattctcgct taattggagt gactccattt aatcatgggg aagcagcaga aatctttaac 6301 gactttatcc cagtttccaa agttgatgaa ttgctttccc caattctcac agttattccc 6361 ctgcaattgt tggcttatca tattgcggcg cttcgcggtt tggatgttga ccagccgagg 6421 aatttagcga agtctgtgac ggtggaataa tataatttca gctaatgtgg aggatgtgga 6481 ggaataattc ggttgaacaa taaaaaatgc gattaaacaa tggataaaac agtttaacaa 6541 ttgctggagc cacgttcacc acgatttttc taccatctac atggtgggat gcgaaaattt 6601 aaggtctttc ggttctaaaa ttgttcaaca ggtttatatg tgagcaaatg gctgtgccta 6661 atcagtcaga cagttattgt tcaatttttt ttcatagcat ttacactgta gaacgaggaa 6721 tcaggagttt gagccactga aattgttcaa ctgtttcaaa gtggactcaa acttttaaca 6781 acaacccaac gtgcgatacg cgtagcgtca agcctccggc ttatcgcacc ttttacagaa 6841 atcactgtgg gagttgtaaa aatgcttgct caacgaaatg aaaatgtcgc tgtacaaaaa 6901 tgttacctgg gattagattt tgaaaatgcg gcaatggggt gtcgtctagc cgaggcaatt 6961 atattgcggt acaggttgat gatgaatacg actttctaat ttgatgaatc atttgggtaa 7021 aaccgtacct gaaagtgcga gaaagttaga taaagagcaa actcctcaat ttgatatgac 7081 aggaggagat tttgttattg ccaagctgcg aggtgggaaa ggtttacctg ataagggatg 7141 ggaagaagtt aagccacaag caacacaaaa aattaataaa attgctgaga ctataaacca 7201 gcacgctaaa caaatcaata acattaacga agtcaaggat gctcattttg gtgatgtcat 7261 tcacaaaaac taatttctct agttagaaga agaatgagca aatgagcaat gcctacccct 7321 gatgaactag atgaactaaa agttgttttt gaaaaccttg tgggacgcac tccaagtgat 7381 gaggaaatcg aagctttgaa ggcttgcttt aagggcaacc agtcgataaa ttttcaagtc 7441 gccaagtaca tcaataatgt cactgaagcg caaaatagtc agtttggcga tacgcacaag 7501 ctaaagcctc cggcttatcg cttttcccaa aatgcagata cagaagcaat tcaacccgca 7561 aaattaaacc aaaaccagct ttcagttgat gacgttgtgc aaaaggtgcg cttacgtttt 7621 cacgatgaca tccaacgctt acatggtacg atgccacttt tgggcgttga tcattgggta 7681 gatttgggtg agttgtttgt gaatgttaat atcctgcagg aagtcagcag tagtcgcaaa 7741 tcagaactag gtgatctgtg gcaagatttt atcgctagcg ttccagaata ttccagtgat 7801 cgcagtttgg atcgcattgg cttaggaaag catcaacaac gagtatctgg gttaacagtg 7861 ctggcgaaaa ataccaattt gatggtactg gggaagccgg gttccggtaa aacaacttat 7921 ctacagcata ttgtcaccga atgcaatgaa ggaagattgc agccacatcg gattcctgtg 7981 ttgattaagt tgcgggaatt tgtggatgat ggtcgtaaat ttgaatatag tctagaacgt 8041 tatctcactc aacaatggcg attgtctgaa gctgagactg aattagttct gagtcaaggc 8101 aaagcattga ttttgctgga tggtttagat gaggtgacgg gggtagatgg acgggtgata 8161 tccaaacaaa tcaaacaatt tacccgtacc tatccacaga atcagttgat tgtgacttgt 8221 cgaacgcaaa gccaagaatc gcgatttgag cgtttcgact atgtggaagt agcagacttt 8281 gatgaaacgc aggtaaacgc atttgcaaaa cactggttta aggcagtttg ttctgatgca 8341 gaggaaggac aggcaaaggc aaggcaattt ttagacaagc tttatataga agaaaataag 8401 cccattcggg agttagcgat tacgccaatt ctgctaagtt taacctgtgc tgtgtttcag 8461 gtgcagggga agttttactc aaagcgatcg aaactgtatg aggaaggatt agagttacta 8521 ctggagaagt gggacaagtc gcgggaaatt gagcgggatg aaatttatcg agatttatcc 8581 gtagagcgaa agcaggaact tttgagctat ttggcggtga agaagtttga acaaccacaa 8641 tatgtgctgt ttgagcagga ggagattgaa gggtatattg cagagttttt ggagatttcg 8701 cggcgggata gtcgagtggt gttgcgggcg attgaggcgc agcatgggtt attgattgaa 8761 agggcgcaga aggtttggtc gttttcacat ctaacgtttc aagagtattt tgtggcgaag 8821 tggttttgcg agcggagtaa ctggaatgat ttagtgaatc atattataga gatgaactgg 8881 cgagagattt ttttattagt agttatgatg tatgagcctg tagatgtttt attacagtta 8941 atgaaacaaa aagccgatca ccttgtagtt gatgaatcga aaatacagaa atttctcttt 9001 tggatatttc aaacatctca aatagttgat tttccctata aactagaagt tattcgagcg 9061 ttttattttg aactatatgg tggattgaat tccacttata aatttaggat agctagtcac 9121 ctatcttcta cttcagacgg tattatctgc ggcgttttct ataacgctca caacattatt 9181 aaaattaaaa caatgcttag agaaacccaa ccaaaacaga agaagaagga aaattatatt 9241 gattatatct ggatatttag ttcagagcat gaaaatctat taaaacagta ttatttggct 9301 aataatatac tagtagattg tctgaaaaat ggttttctta aaagtgatga agttacaaag 9361 gaaattgaga caaatatatt cttgccgatc gccgaaatcg aaaagcgtaa gcgtgaaaaa 9421 tgagtatgcg taatttctac ttaagctatg ctgccatgat tttgttgacg gtaaaacgat 9481 acgtgcttca cacaggctaa cgccaatgcc aatatagaga aaaggagaca aaacaatgaa 9541 cgcaaaaatt agtatccaaa ttgaaaaaac tcaattaggt tattctgcct acagccctga 9601 aattgaaggg agtcaagtac aaggtgattc tgttgaaagt gttgttgatg ctctcaagac 9661 aattctcagc acttacctga aaaaacaaga acatcagata gattctggaa cggataaacc 9721 tatctgggaa atcgctcaac aaatcactca agatatgacc gaagaagaaa ttcgtcaact 9781 gccatcagat ggtgccgagc aacatgatca ttacatctat ggcattccta aacggaaatc 9841 atgaagacgc tatttgcaga tactttctac tgggttgctt taatcaatcc aggagatgat 9901 tggtacaatc gagttcttaa taccagcaac tcactggggc aaatccaaat tgtgactaca 9961 gatgaagtgt tgacagaagt gctcactttc tactcagaag cgggtactcg aatgcggcaa 10021 cgtactgtgg agtttgtgga caatattctt aataatacca aaattcaagt aattgaacag 10081 actcatgctt ctttcctggc tggtttggag ttatatcgta gccgttttga taaaggttat 10141 agcctaactg actgtatctc catgaatact atgcgacaac tgggcatcac agaagtttta 10201 acccatgacc agcactttgc tcaagaaggc tttgtcatct tgttcaggta ggtgattgag 10261 cttgtcctaa gtagccaata acctagacaa gcctcgtcat tcattgagaa gtattcccag 10321 tttcttcgat caactcatcc accaattttc ttaaccgttc cttccagtca gggacggctt 10381 ttagcttttc ttttacacct atccgcacat tgaaacacac gggggacttg tcaaaaggtt 10441 cttcatttaa agaagttgct ccaagtttat tttttttctg gaaaggcatc atttcgagtt 10501 gtatatgtta ctatatcaac atataacaca atttttgagg tgacgcattg cacgtccaac 10561 agttgtcaca cccaagattc tcaaaccatc ggctgtccaa ataaaatgct ctgcccagcg 10621 ttgctgacgt ggattgaata gtggtacgct ttcctgagtt tttggatcaa cagccgttgt 10681 aaaattatag tgacgctcgt tgcagcgatg acaagccaac gctaaattct cgtcatcatc 10741 tgaaccgcct aaagattggg gcaaaatgtg gtcaagtgtg aaacgatctg gacttaggta 10801 ttcgggggaa tggcagtact cgcagaggaa tctagctcgt tgtcggacaa atcttcgagt 10861 tgccttggac accattttgc ttgcgctgcc aattgagcat taattaaagt aaaaatccgc 10921 gatagttcag agataccatc tagttctgcc cgttccgctt gagataacgc atcctgctta 10981 tttttgttga gtaattcttc aaaccgagct tgcaactcat gagtaaattt gaagatgtag 11041 acttcaccca actggctcat cctaactcca gaacggataa acgatgatgg ttgtaccatg 11101 agttgagtta tcattgcttt accctacaca aatggttctt tgattgtaac cgaaggaggg 11161 tttgattgct gctcatggta tgtagccttc agatttgagc taagcgatgt ctgacgacaa 11221 gcccttcggg tatgccttcg gcacaccttc ggtgaacgca gtcgcctgcg gagggaaacc 11281 ctcccgcagc gctgtctcac cgctacgcgt ctacgctaca gaagaacctt tgctttgtaa 11341 gttgattcgt gtggtacgtc ctgctaattg aagaaggttt cgccgaaatc gaaaagcgta 11401 accatgaaag atgattaggt gtgatttatg ctacatagca cataatctgt gcaaataaaa 11461 aaaatataat ttagcgctaa ttttccggat tatcactgaa acctgttgta tataaggaag 11521 ctgaaactag tacaacaacc aggagatagt tttgaacgca gttagcaagg tagacgcaat 11581 tctcaacgca tggtttgatt acatagctct agatgattac agtaacgcca agattgaagc 11641 taatagcgat gcaataaaac agcgcggtgt cagcctagta cgcgaccatg tattaattga 11701 gccagatacc ttctcagagc tacgacaaaa agttacccaa gggcaaaaag gtcaacagga 11761 agcagtgtgg gctttgtcct ttccgcaagt tttagatgta gaaaagggaa aatcttatct 11821 ttgtcctttg tttagcttgg atattacacc attgaagcgg gtgtgttgtt aaatgatgca 11881 accacagccc aagtattacg ttcgcagttt gatactttgg tttgtcgaaa aattttgcgt 11941 ccggtatcga tttgtcaaaa aacaggaagc tcgccgcatc gaaacttcga cacggcagac 12001 ttcgcgcgtt aagctagctg cgcctccggc aatcgcctct ccagcttaga cggcaaagca 12061 atgatagtct gtatgagtcg gaaaatttgt gtaaacttct gactttgact gttatctagt 12121 tcatttagca gtaccttttt tgacaaaaag gcaacggatt ctacgacaac aggtttgccc 12181 tctactatgt tgtatagcaa gtttttcagg agctttggga atgacagcag caacagaccc 12241 cgtaaagttg agcaagcagg aagttggcaa agccgctgcc gccctggtac aatcaggttc 12301 tattgttggg ttgggtacgg ggtcaacgac agcatatgca attgagttta taggcgatcg 12361 cctcaaatcc ggcgaactca aagatatcgt tggtgtcccc acttcgtttc aagcagaagt 12421 gctggcgaag cagtacggta ttccactcac caccctagac gcgattgacc acattgatat 12481 tgcaattgat ggtgcagacg aagttgatcc acaaaaaaat ttgattaaag gtggtggtgc 12541 agcacatacc cgcgaaaagg tagtagatta tttagccagc cgattcattg tcgttgtcga 12601 tggtggtaaa ttagttgata gactgggttc tgtttttcca gttcctgtag aagtcatacc 12661 aatggctgtg actcccgtca ccctggcact taaaaaactt ggtggcaaac cagaactgcg 12721 tatgggtgtg aaaaaagctg gtccagtgat taccgaccaa ggcaacatgg ttatagatgt 12781 caaatttgac actattgatg acccagaaaa tatcgaaaaa atactgaata atattcccgg 12841 cgtgttggaa aatggtatct tcgtcaactg tgctgatgtg gtgttggtag gcgaagtcaa 12901 agatggtcag cctgtcgtca gggaattgta gcagtttaaa gtgagtatag gggtgcacag 12961 gcgcaacgcc ttgcgcccac atagaaaaaa ttaaaaatat acctacagac accctaggca 13021 aagcgtgcaa ctgctgtcca cgagaaaagt tagggatgtg agggaataag ggaataagaa 13081 aataaatgca ttttccttat acccctacat tcctatgaaa cctttattaa actttgttct 13141 aagtgtgtgg ctgtacaaaa gcacaagtca ctttattatt tttgtattta atcttgtcta 13201 taagtattat ttttaacatc cattaaccaa attttttcta gaaaagcaat tcttctacgg 13261 taaacagcaa cacgagactt tttataagta attattccta gtaaaacaaa cagaggaaag 13321 gaaatgatag ataaccctat gagattatta gtatgggcat cagtagaacc cttagagtga 13381 gtatttgagg ggtcagttgt tggctgagtg ctaaagtaat gcggcattaa tacctccaga 13441 gattatagtt ctactaataa agcttcaagg ggaacgtttt atttttcgca caagttgagg 13501 attttccgga tgtatccgga ctcttgttac tcaaacaatg agtatttcta gttctcatag 13561 tcaatatctt cgccaaaatg ccctcacttt ttaagagtgg ggcactacaa acaatagccg 13621 taaggcgtga ccgttccgct tcatttttca tagacaagga tggacaaccc tccacaatgt 13681 ctactcgtaa aacccgagat ctggatgtat ttttgcggtg aatccagcgc cgtgcggggg 13741 ttcccccccg ttgaggcgac tggtgaaccc gaagggcact accaaaaaat ttgaatcaaa 13801 ctagttgatt ggaagccgca aaaatacaag ggttgtccat ccttgtcgag ggtttgccca 13861 acccactagg gctacgtctc tacagtgtat tacaggcaac cgagaagcgc tatatactta 13921 cacggtgaca aaatgctttg accgtatgcg tttacagaca tgctgctttg tatgcctgca 13981 ggtatacaaa gcagcatctt gtggaggaga gatccaaagg gttatcccgt tcagttctct 14041 gggctgttat aagtaagtcg gcacaataaa accaatgtat gttgagtttt gtaaaaactg 14101 tgaaataacc tatttctaag ctgtttgcta attttacatt tcgttacata ggtgattttt 14161 taacgccgac ttacttacac ccacctaatt cattttgaat tctgaattct gacttatgac 14221 ttgtgacttc tgttttataa tttcttctcc aatatttgag tttatccata gaaattgatt 14281 ttgtgacaac attcactcca gttcctcgcc aagcaacttc tgaatctgtt gtaaaaacag 14341 cacattctaa ttgatctagc ttaatattcc aatactcact aaatctgctt ttcaaagtta 14401 tgacttgctc aaaaacttta tttctatgct tatttcttct tttcctttct gttgctcctg 14461 ttgcttctgt atcaataaat ttagtttcta ttaagaagat attaccgaat gaactcagat 14521 aaacgaaatc acacttcccc aaatctgtat agtcagcgat tggggacttt tcaaataaca 14581 gtagttcaga gcagaaagga aatatccgtt cgatattcaa gaacaaataa gcttgcaata 14641 gcagttcgtt atcgtaagga aaactaacaa caccctcaaa aaatttttta atctcttcct 14701 gagattgggg ttgaattctt ttacaatgtg taacaaaatt attcacttca tctgcgatca 14761 tgcagttttt gtttcctcct ccttattaac acacaagcct gatgagtaaa aatataccct 14821 gtatgtgaga aaccaaacat ccgcagaagt aactaaaaaa cggttctcct ttcatttact 14881 tacggtaaaa tattgagagt atttttgtaa taattatatc aatttaatca gtgttaatgc 14941 ttagtaagta gttgcatgct gactttttgg tagtcaggtg ttgcgcccat aagttgtaaa 15001 ttgatttgtg aggacagtca acaattaggg gtgtaggtct acaaactaaa ccctacaccc 15061 cctcaaaatt aatgacacag actacaatcg ctctttgatt aaacatcagc aagcgattgt 15121 ggataagcag agaatctggc attttgaaga gcctgaacag ttgttctaga atcttgcacc 15181 caaaattgtc gtccaaaacg gacaagttga cgggaagttc cggcttgtac aatccctatg 15241 acaggaactt ttgctttttc tctctctcct gagtattctt gcaaaattgt tctgaggcga 15301 tgttgttctt caatggtgtc tgcttgttga agattcaatt gcactatcac caactgtact 15361 gtttcgacta gttctacgtc ttcaacaatc aattgatttt ggtcttcgcg tcgatctact 15421 ttcccccaaa caattaacct agagtcaatt tggagcaaag aatgaatctg ctcgtaaatt 15481 ttaggaaaga cgactgcttc taattgtgtc gttaaatctt ctatttgtaa aatcgccatt 15541 ggatcgcctt ttttggtcat cactttctta acaccattga gcatgacaac cgcacaaata 15601 gcactttctt ctttctgttc cccaagctgc gaaaggttaa ttggggctag aacacgtgct 15661 gaattgcgaa tagctttgag tggatgatct gatacataaa atcccaaaag ctccttttcc 15721 atttgcaact ttttctgtgg gggaaaatca ggaacttttt gagctttggg agctgtatca 15781 aagctatttt ggttttgatt ggtagtagca gcagtagaaa atggaccgcc taaaaaatca 15841 aacagatttc cctgtccact tgctctgtct ctggcacgag attgtgccca gtcgtatact 15901 agttctaaat cgcgaagtaa ctggttgcga ttggagtcaa tggtgtcaaa cgctccacaa 15961 gaaatcagtg cttctaaact acggcggttg acagtacgca aatcgatgcg atcgcaaaaa 16021 tcagctagag acttaaactc gcctccttcc tctctcgttt ccaaaataca ctttattgcg 16081 ttctctccca cgttacgaac agcggacaat ccaaacaaaa tgttgtcacc tgacggtgta 16141 aaatcttcac cagagcgatt gatgtctggc ggctctatat gaatattcat acttgcgcag 16201 ctggaaatat atttctgcac cttgtctgtg tcatcgctgt tagccgtcaa cagcgccgcc 16261 atgtattcca acggatagtt cgcttttaag tacgcagttt gatatgtgac ataaccatat 16321 gccgttgaat gggatttatt gaaacaatta gaagcaacca agccatttct aataataaag 16381 ttatggtcgc gctctacccc tatgtcataa acattttgcg tgctgagagt ttttcgagtt 16441 acaattttca ccatttttcc agcccctaaa acctataaaa tcaattaatt tctaaaattg 16501 tcaaaagcct gtaacgccca tttttacaga aataaatatt aatttagtcc aattggcatt 16561 taaggagaat acctggacag cccaatcaag tgggcagaag tcgtcaaagt gtactaaaat 16621 caaggacttg agtcgtaata taaagtaatc atggcgtcca tccgcgagtt gcacacacag 16681 ctaattaaga aagaacgctc tgccgttgaa attactcaag aagctctaga ccacattcaa 16741 gcattagagc caaaattgca cagtttttta tgcgtgactg cacaacaggc attagagcaa 16801 gcgcagcaag tggatgctaa aatcgctgct ggggaagaaa tcggcatact cacagggatt 16861 ccgattggga tcaaagataa tctgtgtacc aagggaattc ccaccacctg cgcctctcga 16921 attttagaaa attttgtacc cccatatgaa tcaactgtca cccaaaaact ggcagatgct 16981 ggtgcggtga tggtaggtaa aaccaatttg gatgagtttg ctatgggagg ttccacagaa 17041 aactctgctt tcaaactcac agctaatcct tgggatttgt cgcgggttcc aggcggttct 17101 tctggtggtt cagcagcagc agtcgcagca ggagaatgtg ttgtctctct gggttctgat 17161 actggtggtt cgattagaca accagcctct ttttgcggtg ttgtgggaat gaagccaact 17221 tatggactcg tttctcgcta tggcttggtg gctttcgctt cctctttgga tcaaattgga 17281 ccatttgcac gcacagtgga agatgcagcg atattattga aggcgatcgc cggttacgat 17341 cccaaagact ccaccagcct aaaagtcgaa attcctgact acactgctaa cttaaaacca 17401 gagttcccta aaggaaagct caaaattggt gtcatcaaag aaacttttgg tgaaggtttg 17461 gatccacaag tcgaagaagc tgtcaacaag gcgattaaac aattaaaaga gttgggagcg 17521 gaaattcaag aaatttcttg tcctcggttt cgctatggct tacccagcta ttacattatt 17581 gctccatctg aagcatcggc aaacctagct cgctacgatg gcgttaagta tggctaccgc 17641 gctcctgatg cagataactt gatttcaatg tatgctcgca cccgtgcgac tggtttcggc 17701 gcagaagtta aacgccggat tatgattgga acttacaccc tttcggctgg ttattacgat 17761 gcatattatc ttaaggctca aaaaattcgc accctgatta agcaagactt tgaaaaagct 17821 tttgaaaaag ttgatgtttt agtttgtccg acagctccca ccactgcatt taaagcaggg 17881 gaaaaaactt cagatccatt aagtatgtac ttaattgact tgatgactat tcccgtcaat 17941 cttgctggtt taccgggtat aagcgtccca tgcggttttg ataacaatgg gttaccaatt 18001 ggtttacagc tgatcagtaa tgtgctgcga gaagaccaac tgtttcaagt cgcttatgct 18061 tatgagcaat caacgacttg gaaagagcga tcgccgcaac tatagtcatt tatatctttt 18121 atattagtag tgatgagtcg tgactcatta ctattgacta ttagctgttt ctggagtgaa 18181 tgtcacttgc ttgatactac ccacacttcc taatggtgtt ggttgcttct tgttcgtaga 18241 aatcaatacc gcaccagcat taccagaacg aaccatcacc tctttttttg ctgtccaggt 18301 tttcttatct cccttcttta attccccttc aaactcagtt ttgccatcta ctttcactcg 18361 tacccatgat ttgtcctgca attcaagagt aacttcgact ggagttgttg tcgcctcttg 18421 cggtgatgga gttggggtta tagtagttag tgcagatgat ggagttgggg ttatagtagt 18481 tagtgcaggt gataaagcct tggttgttga tgaagtgaca ggtggacgag tttttggttg 18541 ggaggatgtt aatgatgaag cttgagttgc agattccgtg ttttgttcag ccgcaagcgg 18601 tgacaagtct ttttgactag aagattgaac tgaacgttgt ggattgagaa gataaaacaa 18661 tccaaaggaa gcgacagcta ataacaggac gtaaggtaca aaaagaggta tatatatggt 18721 tggtttttca tctaccactt gatgatcatt atttgattct tcaggagtca aacaaatatt 18781 tgcaacagtt tttgctaaag cagcaccgtc aagtcctata gcatttgcat agtgacggat 18841 aaacccttgg acataaattg gctcgggcaa ttcttcaaat cgcccttctt ccatagcttg 18901 taagatggcg ggttgaatac gtgtgtaagc tgcgagttct tttagaccta ccgatctttc 18961 ctctcgtact tgccgtagat gtgcaactat ttctaccagt tgctcctgtt gatctttctt 19021 taacagcttc actctcttct ccttgttctt tgttaatgtt ttctcattca atcacaagct 19081 acttaatttt atgtacattt tttgactcat ctgtatttta aatcattttt gataatggat 19141 tcatcacttt ctcttgatag agttctttgg taccgctaaa aatattgtat ttataaatac 19201 ttacatgaaa ataaataagc tgcaagattc agtgcaacta atatttccat tcttaattta 19261 tgtaggagtc gttgtagcaa ttagctattt tgctgtatgt atatttcttt ttctcaagca 19321 aacgcgattt attttttttc cctctgctgt tattgacaca acaccagagt tttataactt 19381 accttatgaa gatgtttggt tgcctgtgtc agctaaatca ggtaaggtag aacagattca 19441 tggttggtgg atagaagcaa accaaccaaa tggcaaagtt ttgctctacc tgcatggtaa 19501 tagtgtcaac attggtgcaa atgttactcg tgcacattgg tttcatcaac tagggttttc 19561 aatgttacta attgattacc gtggttatgg tcgtagtgaa ggtcgctttc ccaatgaatc 19621 acaggtttat caagatgcag ctacagcttg ggactaccta gtctaccagc aacagattcc 19681 acccagcaaa atttttattt acggacactc cctgggaggt gcagtagcta ttgatttggc 19741 tctcaaacaa ccaaacgctg ctggcttagt tgtagaaagc tcttttacct cgatccggaa 19801 agttcttgct tacaggaaca attttaaaat gtttccagtc gatataatcc tgagacagca 19861 ttttgattcg attaggaagg tgccaaactt aaaaatccca gttttattta ttcatggtac 19921 tgatgatgtc atcgtacctg ctttgatgag ccaagaattg tatgctgctg ctcctgaacc 19981 aaaaaaattg attctcgttc ctggtgcagc acataataac gtagcacaag tcgctccttc 20041 tgtatattta gaagctgttc gctcttttat tattcttgct gagtccagaa ttcttagctc 20101 ttgagtacaa actttagcaa aaagaagaac tcagaattta tgagtgggga attaagatcc 20161 acactcatag aagaccacgc ctattttaaa atttggtgtt cttgagtttt accactctac 20221 ctcgcacaca gttgtacaga attacttttt gattcttcct tgttcaacca gtgaacagct 20281 acctacgcag tgaacacttc gacaagcccc ttcggctacg ctcagggcga gtcagtgcat 20341 cgcagtgaac aaggggtgga cgagtccgtt tcctgcctgg taactggtaa ctgataactg 20401 gtaactgata actgttaaaa aacaattcct gcaaatttta gcatgacctc tacaatttta 20461 aaaatgaggt aaaatatcat gccaaaattc aactgtaaaa atttatttga ttattggtat 20521 aaaacccata aataaacacg acattataga gttcgcataa ttgttttttt tgattttttt 20581 tgatttttct tatattgaga aaaaaaactc agtaacaatc cggaaaaatc aggacactgt 20641 agaagagatt atacagatag aacttccctt tgtttaattg ccgcaagagg ctccctacaa 20701 tggctactac atcatatttg ttaaaaattc tcgacaatcc aactattgaa atttctcaag 20761 atgaattgcg gacgctttta ggtgaaatag aagctgaact gcatcagagt aaagtctatc 20821 gccgtgctgt agtcatattg caaaaattac taggttcttc aactgagcaa gccaatgttt 20881 tgtttaaagc agtaggtaga gaagcgattg gtttagcatt tcggcaattt gtgcagcagt 20941 atcaaaaagt taaagagaag ccgcaagaag acaccacgat tgaaacatca actattgaaa 21001 agacaaattc caccgatgaa tcatgtaatg attcatcaca aaatttcaga agtgtggttg 21061 accaaaaaac gattatcaag actgatgaga aagctgattt accatcagaa tctcaagtag 21121 aaagtcacaa ggcaaatgat tccgcaaaaa ccaaacaaaa ccgtcccaaa acgaaaacga 21181 aaattgggtg gcgaggtttt ggtaaaaaac ggaaacaagc tgaactagcc ttgcaaatgg 21241 cagttgagca gcgagtagaa actcggcgtc aaatcagtca acaactacgg caggctcgtg 21301 aatcccaagg actttctttg agccaacttc acgcttacac tcacgtaccc ctgcatcata 21361 tggaggcact ggaaaagggt gattgggagt tattaccaga ggatgtgtat gttcgtggtt 21421 tcgtccgtgt catggctaac gctcttggat taaatggcac agatttagct gcttccttgc 21481 ctgcgccaga gccagtaaaa gcaattttac ctgccaagta tgagcataaa agtaatttcg 21541 gattaggaat agcactgcat ccggtacatt tttacctagg ctatgcagcc cttgtcgctg 21601 gatccgtcgg aggactatca atcatggcgt cccaacaagc aagtgcagac aaaatgatca 21661 ataaagatgc agcaactcca ccctcttcat cattcactaa gtcatcgcag gaaacaaaac 21721 caatttctaa gccagggctg tctaatcgtg ttagtatcac cgttggacct gatattgccc 21781 cacccgaagc tctttaaaag tcatgagtga tgagtgaaca atactgtaga gactgtacag 21841 tacagtctct actatctgat tcaaacaaga actactacat ataagattcc tatctgagtt 21901 tatattaagc gttaagagtt tcctattccc tgttccctgt tccctgttcc ctgttcccta 21961 ttcccttttt cggtaaggga acctagattc tctgaaatca gacttgtaat tataagagtc 22021 atgaggcgca agatttgaga tttgttacat tcgcagatca gaaatccgga aagttcagaa 22081 aacttgatgt atattgatta cgattaataa aagacagtaa taacacttat caaagacagc 22141 aacaatatta tgatttgctg cctaaatccc gaatgctctt atcccctaaa ttcacaagga 22201 acaaagttct gtgagaattg cggggcagag ttgattgagt tactcagggg tcgttaccgc 22261 attatcaaac cattaggagg cggcggattt gctcgcacct accttgctga ggatgcagat 22321 aagcttgatg agaaatgtgt tgtcaagcaa ctagcaccac aagttcaagg aagctggtca 22381 cgccaaaagg cgatggagtt gtttcagcaa gaagcaaaac gtttacaaca tttaggggaa 22441 catcgccaaa ttcctaccct ctatgcctat tttaaggaac gtaactacct ttatttggtg 22501 cagcagttta tcgaagggga cgatctgttg caggaattaa aacacaaggg tgtgtttgac 22561 gaagcaaaga ttcaagaatt cttgcaagat ttattacctg ttctgatagc ggtacatcag 22621 cagcaggtga ttcaccgaga tattaagcca gaaaacatcc ttcgccgtga aagtgatggt 22681 aagttggtgc tgcttgattt tggagtgtcg aagcgaaaga ctggaacagt gaacccgaaa 22741 ccaggaacga gtattggttc ttttggttat gcaccctatg agcaaatgta ctcgggtgaa 22801 gcttatcctg cgagtgatct ctatagcttg ggagcaacga cttttcattt gttgactggt 22861 gtttcaccgt gggagatatg gatgaaacaa ggctatagct ggacttctac ctggcggcag 22921 tatttaacag aaccaatcac tgaggaatta gggctgatta ttgataaatt attgcaagag 22981 gattataatc aacgttatca gacagcagag gctgttttac aagattcgtt ttttgtgttg 23041 tccccacact ctcccctaca gcatacaata ctctcgcctg ttcaaccaga aacactgacg 23101 caagaaattc aacagcaacc accacaatat caagggcagt tctctgtaga gaaattattg 23161 ccttgggcaa ttatgacagg ttcagggagt tcgtttcttt tgatcgccct tctcagttct 23221 gtcggaacta tgtggattag ttctagttta tggctgttcg ttttcgtagg attcatcttt 23281 gtccagcctt attcagtttt tgaaaaaact tatttattta tcgttaccgg aatcacaaca 23341 ttatttgttg tatttatcta caagaatttt tacattgtta atcttctcaa agcaggaata 23401 gatggattct tagttttaat cttgctagcc attcttgctg gattactaac ttttactctc 23461 ttaactgttt ctcaaatctt gaacagcttt atctcgaaat acttctaatt ttagagataa 23521 tttttggttg gctgaggtcg tcatcttata tgaaaaatca agaaaatatt cgcctcctta 23581 tttccttagg actggctgga ttactaatag caggtatttt atggttaatg ggtagagtca 23641 ttcgtccaga taaagagtat tctgggcaat ttcaaccaaa tagtgcttca atttctccat 23701 cttatatcaa ctcaccttta aaaaaccgga tgagcttggg tgagaaagtt tttgtcaggg 23761 aagaaagacc ccctgagaaa gatgctggaa gcaaggcatt tgaagctggt gactttagca 23821 ctgcggttag taaattccaa ttatccttgc aagctaaacg taatgatcct gagacattaa 23881 tttatttaaa taatgccaaa atagggacat cgaaatccct aaaggttgct gtcattgtac 23941 caattggtat ttctttaaat gaagcagaag agactttacg cggagtggct caagctcaag 24001 atgaagtgaa tagcagtgga ggaattaatg gtctaccgtt gcaacttgag attattagta 24061 ttgataactt tgatgtgatg aaagaactca gcacggaatt ggttaaggat accagtatcg 24121 tcgctgttgt agggtttagc cgcgacccat caatttataa taaaggtggt ttggtaatgg 24181 tttcaacagt caatccaaaa aagccatctc aaccaacaaa atacgttttt tatgcgactc 24241 caaagtttga tgtttttagt gatgcgatcg ccagttatat cattcaaaaa actcgcctca 24301 ccaacatcgc tatttgtagt gactctacat ttatagttaa tcaggaaaaa gtcagtcagg 24361 aaattgtaga acaatatact gattccatca aaaaatatgg aggtaaagtt accagtacag 24421 cttgcgattt gagcgctcca gattttcaac ctagtgcttt tctcagtcaa gccatcagtg 24481 atggagcaga aggtttgata ctcattccta gaccagataa actgaactta gctattgatg 24541 tggcacgaga aaataaaggt agactaccac ttttcagttt ccaaggaatg tacactgaaa 24601 gaactttaaa gtatgggcaa gcagatgtca aaggaatggt gcttggagtc tcttggcata 24661 atgatgctct tggaaacaaa tcttttgctc aaaaagccgt tgggctgtgg ggtggagaag 24721 tgagtccacg aactgcgaca gcatatgatg cacttcaaac aattatcact ggcttgaaag 24781 aaggtaacac ccgccaggag ttgcaaaaag ctttatctaa cccaaaattt tcagcgccag 24841 gagcaacggg aaagattcag ttttcgcaaa caggcgatcg caagggcgga gtttttctag 24901 tcaaagtgga accttgcaat ccaagtcagt cttgcaattc tagcactcgc taccatttta 24961 gacttcttga ataagcgtcg tttcgctgag agcgcttttc atcagcgaag cggctaagag 25021 ttagattccc acttcccaat tcctcttcaa tttttaacaa tttgagattg cctaatcttc 25081 catgagtagc tagaaatcaa attggtcaga atatgagcaa caattggtac caacaagtta 25141 ccactaagga gagcactaga acctaataat attcccacaa tgcttgccca aataacataa 25201 ggccattgtt gggaaccact caggtgtaag acaccaaagc aaatacttga gataatcaca 25261 gccgcatagc tatatccaaa ggctggtaac atcacacctc taaataacaa ttcttcactt 25321 aatccaggta gcaaccccag ccatatcaag tctggcaaag ccaaaggctt tagcaccatt 25381 tctaagtaaa aatcagcgct tttacgatag ggaagccata ggcgataagt aatggtactt 25441 aaggcggtga tgcctaaacc taacaacaca ccccaaagca actctgtttg atcccaacgc 25501 caacgcatga gggcaaagtt accaaattgt agccacactt tggcgactat ccataaaaga 25561 actgcagtca ctcccatcgc caccagcact tgagtgcgtg tcaaataggg gatttctggc 25621 tcttgctttt gctgttgtgc cacgataatc tatgagggga caagaggaca aggagaaaaa 25681 ataacttttt aactatttat tagagattgt aaaggcgata tattgcggct cagagcagat 25741 gggttaatgc ctgctttgag ggcggctaat acacctactg cctctaaata tgtgttgacc 25801 tgtattgatt ttatacctaa aggttggggt cgaacctcta cctgagtttg attttcctcc 25861 acagtaataa tttggcatcg cctgctctga cttaaactca gtaaagcgct gttgccacaa 25921 gcagttgctg gtacaattgc tgcatcaact tggtttgccc aaatatcacc aaattcggac 25981 aattcgtatg acgtctcagt tataaactgg ggtgcgcggc tcaagccaac aagcacgcat 26041 ggtaaaaaag tataacctaa ttcttcagca gcagaacggg gagataaatt gggatctaca 26101 ggtgaaggga aaaaagcagg tgaatgggca caaggaactt gaaaagttcg gacaatcaaa 26161 tggctgatga cagcttctgc gccagcaatg ggatcgactc ctttacctaa gcgatagttt 26221 tggtctgctt gttcatccag agtatcagga aaacgggcaa caactgcgat cgcctccgcc 26281 cccgcttttt taattaaaac ttccgccgcc cgtaataaac tatctgggtt gcctatcgtt 26341 ccccaactta tccctgatgc tgattcacgt aattctacgt ttaatggtgc atcagttata 26401 acgtaatctg tcatggataa tcccagagtt gctctggctg catctgctgc ctgtaagtgt 26461 cgtagccgta attctggttc tatcgcttgg tcaagaagca aacctatgcg attttgatga 26521 actggacgta aaccccagca cccagcggca aatttgtcaa gtccgtaacc ttcaacgtag 26581 aaagtgttgg gcaggttcca gtacagactg gcaccattga ggacattggg gtgggtgatg 26641 aggcgatcgc aaacctgtga agcagcttta gcaactggta atgcatctcc tgcataacct 26701 cctattgcag ccccaatgcc tgtgggaata attaagatag cggtgtattg acgcacgaat 26761 aaagtcaaaa ctgaaaagaa attgctattt acttgtgata cagcaatctt agataagtcg 26821 tgaataaaaa aacgtagacg cggagcggct tcccgcaggg taccgcaaag gcgcaaagga 26881 cgcaaagaaa agaaagagaa gaagaggagg atcttcatat cctatttagg aaggctgtag 26941 ttatttagtt atttctcagt aggcaggtct catttaatct taatatcaga ttttatcttc 27001 aatccctctc cttaataagg agagggaagc cggaggcagg gtgaggttta ccaccttact 27061 tgaaataacg agtgagggag tgttgctatt agttattatc cattagcctt ctgtagtaac 27121 aacagcttca acagtgcatg attgttgttt cacgtccacg gatgtcactg cccaacgcaa 27181 aggttctccc tgttttttca actctacttc aattgctttt tggagttctg tgggagtttc 27241 ctgtagatct atttctgcgg taataaaatg ggttgtcatt ttttcactcc ttatacgatt 27301 tctttgtaag gcaaccaaat tttcttgatt tcttctttct tgatttcttc tttcttgatt 27361 tcttctttct tgatttcttt gtgtcctacc ctgcgggaag ccactgagtc ccaaggggac 27421 acgccgcgcg ttcgcccttg gcgtgcgctt tgcgcttacg tgtctttgcg ccctttgtgg 27481 tttgttcctc atagacttgg gcgcattttc agacaaaatt ggtattactt aattatataa 27541 caccaatact gatgtacaga caaggtgtct caaacatctc tacttaccaa attgcaactg 27601 ataaacaatt tcttcctttt cggtttcaat ttttaaatca gaacgagggt aagcaacgca 27661 aagcaataca taaccttttt tttgcagttc tgggcttaca cccataccat cactttgatc 27721 cacgcttccg gatagaattt gggcagcaca agtcgtacaa acaccagcat gacaggaact 27781 tggtaaatcc aatttagccg catcagccac tgatagaatg gtttcatttt cgggaacttg 27841 caaagtatga gttttgcctt ggtggataat ttcaacggtg tatgttttgg acatatacaa 27901 tagcagtgct gtgcaacgga caagttaaat atcttatctt agaaaaagat ggacaactca 27961 ataaagaatc gccagcaata aacttaactg gaaagagctt gactcgctag ggcgttatcc 28021 agtgataatt ataagtcctt gctagacgaa aaacttcacc tcgggaaatt tctcacctgt 28081 tcatctagct tgttattgac aattagcaaa aatcacccat aacaaatggg aagtcactgt 28141 aaacactgaa tcgaattttg gatcgaatat ttggagatca gaatgctgga atcataccgc 28201 agacacgtta ccgaaagagc cgcacaagga attccccccc tacctttaga tgcaaatcaa 28261 acctccgaac tgtgcgaact actaaaaaat ccccctgcgg gtgaagaaga gacattactg 28321 ctgttgttgc gcgattgcat tcctcctgga gtcgatccag ccgcatatgt caaagctggt 28381 ttcctcaccg ccatagccaa agaagaaatc actagccccc tcatttcacc catcgaagct 28441 gtacaattac tagggacgat gataggtggc tacaatgtcc aatcattaat cgaactttta 28501 caaactccta gcgtatcctt atcatcatct tctgaaactc ctttggtgat gggtgggcaa 28561 ggaaaagaac cgatagcagc atacgccgcc accgccttaa gcaaaaccct gttggtgtat 28621 gatgcctttc acgatatttt ggaattatcc aaaaccaatc ctttcgctaa acgagtgata 28681 gactcttggg cccaaaccga gtggtttacc gttcgtccag ttatcccaga attcatcaac 28741 gtcatcgttt tcaaagttcc gggtgagaca aacaccgacg acttatcccc tgcgccccaa 28801 gctatgactc gcccagatat tcctttacac gcattatcaa tgctggaaag tagaatgcct 28861 ggggcgttgc aaactattgc ccagttaaag acaaaagggt accctgtcgc ctacgtggga 28921 gatgtcgttg gtacaggttc ctcccgcaag tcagcaatca actctgtatt atggcacatt 28981 ggggatgata ttccttttgt accgaataaa cgagcaggtg gatatatttt aggcggaagt 29041 atagccccca tctttttcaa cactgctgaa gatgctggtg ctttccccat ccagtgcgat 29101 gtttccaaga tggaaaccgg gatggtgata actatatatc cctataaagg aagtatcacc 29161 aatgaagcag gcgaagtgat ttctaccttc accctcaaac ctgacactat tcttgatgaa 29221 gttcgcgcag gtggacgtat tcccttactg attggacgta ctctcaccga caaaacaaga 29281 cttgcacttg gtttagaacc cagcacagta tttacccgtc cccagcaacc tgctgacaca 29341 ggtaaaggct acagcttagc acagaaaatg gtgggcaaag cttgcggctt atcaggtgtt 29401 cgtcccggta cctactgtga accgatgatg acgactgttg gttcccagga taccacagga 29461 ccaatgaccc gcgatgagtt aaaagaactc gcttgtctgg gtttctctgc tgagttggtg 29521 atgcaaagtt tttgccatac agcagcatat cccaaacctg ttgatattaa aactcatcaa 29581 gaactacctg agtttttctt ttctcgtggc ggtgtcgcat tgcgccccgg tgatggaatc 29641 atccactcct ggttaaaccg gatgctactt cccgacactg tgggaactgg tggcgactcc 29701 cacacccgtt ttcccttggg tatttctttt cccgccggtt ctgggttagt ggcgtttgct 29761 ggggcgttgg gtgtgatgcc cttggatatg ccagaatctg ttttagtacg tttcaaaggt 29821 gagttgcaac ctggcgtcac tttgcgggat gttgtgaatg ctattcccta tgtagcaatt 29881 caaaaaggtt tactgacagt agagaagaag aacaagaaaa atgtctttgc tgggcggatt 29941 ttggaaatag aaggcttacc agatttgaaa gttgagcaag cttttgaact caccgacgct 30001 agcgccgaac gttcttgtgc aggttgtact atcaagctca gtactgagac agtttctgaa 30061 tatttgcgtt ccaacataac gctgttgaag aacatggtag cacggggcta tcacgatgag 30121 cgtactatta tgcgtcgcgt tgccaagatg gaagaatggt tagcaaatcc agtgcttttg 30181 gaagcggatg cagacgcaga atacgcggaa gttcttgaaa ttgatttaaa cgaaattcaa 30241 gaacctattg tcgctgctcc caatgacccc gataatgtca aattattatc ggaagttgcc 30301 aatgatccag tgcaagaagt ttttgttggt tcttgcatga caaatatcgg tcattatcgc 30361 gcaactggta aagtcttaga aggtgcaggt tctgtgaaga ctcggttatg gatttgtccg 30421 ccgactcgca tggatgaaca ccaacttaaa gaagaaggtg tgtatgacat tttcaatgct 30481 gcgggtgcgc ggacagaaat gccaggatgt agcttgtgta tgggtaatca ggcgcgggtt 30541 gctgatggtg tgacggtgtt ttctacttcc acccgcaact tcaataaccg catggggcaa 30601 gatgcgcgag tttatctcgg ttcagcggaa ttagcagctg tttgtgcact gttaggaaga 30661 attcccacgg tgcaagaata tcttgagatt gtggcgaata agattcatcc ttttgctgat 30721 aatttgtatc ggtatttgaa ctttgatcaa attgctggtt ttgaggatga aggtagagtg 30781 attccgttgg aaaaaatgcc caagattgaa gatattttgg gtatgccgac aggggcgggt 30841 agcaagtaga aaaatgtcca tatatgccca taagctcaag cttggggcaa catgaacaaa 30901 gcccgcgtag acgggctttg tttatgtagg gtgggctttg cttaaactgt caggtgggct 30961 tcggaagaca agcgattaag ctgttgtgta ttcaaaatgc acaacagcaa ggcagatgac 31021 agatggcaga tggcaagagg gtttcaatga ttgactagcg ggacttaaac attttttagg 31081 tcagaaacaa gcctcaaacc tttgtagggc aatgaaaata caccaaatgt aaaagtggct 31141 aaaatccaga tctattaaca aaccaagcaa aacgatgtca aagtctttta ggatatacgt 31201 ttgaggctat tttatctgtg ttttttgtcg tctaagtccc agtatgagcg cttgctcaac 31261 ctcaaacgga gatacaatcc agatgatgtc ttttgttcga cgattggaca tcttgcatta 31321 tagcacaaca attaatcaat ctgaagatgg aacagttcaa gcgattcacc ggaatgaccc 31381 caaagcaagt tagctaacac tgtaagaatc tgacaaatcg ttataagaat ctgaaagcag 31441 tcaagcgttg gtactaacta cacttagaac aacgtgaatg tgatcgggct gctttaatcg 31501 aatttcgcta tctcagatat tgcacccgtc tcaattacgg aattagcaaa gaaacggttc 31561 atgacacaac ataattcgct cttggagttt agtgacacaa aagaataaaa ttcagacttg 31621 ggtagccgcc gcgcgatcca tcgtacagtg ctggatgtgt tgcaggaagt actctttgaa 31681 ttactgcacg atagctacat gcattactca aatggcttgt atctataaat gtttcttaga 31741 gaattggtat gacaatgtta atttattcac tttacatttc ttgagaaaca ggctctaggt 31801 gcataagtta atttcactaa cccctagtaa ggatgaggag acacatgaag gattttggtt 31861 ttagccttgg cgtgagcgaa gcctgctgta atcttctaac ttcatcgtgt atcacgtggc 31921 ttgagtctaa acttcagttt ctaaagtgcc attaccatta aaaaacggag ttagtatcac 31981 atgaatcata ctgatttaca atcacaactg ttagaccaat tgccttctgg gcaaactgta 32041 aactttgaat atggtatttt agaagcccaa acgcgactag ttgttcaaca gtgcaccaac 32101 gaaattaaga cgctaatgcg ccgtaattct caagatatta tcgatattgg acagaagcta 32161 attgaggtaa agcagcatct gggacatgga agcttcagat agtatagaaa atcaagttgc 32221 tatttcccac gagacatcgg atgcagcgat cgcttccatg gcaattagta tcaaaaactt 32281 aacaccaaag caactagcta ggatgattat agaggctgcc aataacggat tgagcgagtc 32341 tgagctgtca gccatagtca tggcatctca acaggtactc aacactcagc aacaagacga 32401 atactcagac tgatgggtat tgtgccgcac tttgactact taacgttgtc aagctgggta 32461 caaaacatat aaccaagaat ttttggcgag tggaaatgtt tgtattgggc agtagattcc 32521 aaagtgcaaa ttaaaagcgc aacagcttag tactcatgag tctaaaatcc aacagtcaca 32581 atcaattttg attatcaaat ccagacaaat attacttgct tctttattat ttttcgttgg 32641 tgcttattga gaaacgaatc agcttgtttg aaaacctgca tgaatagcaa tgtaatcttt 32701 gtttcactgt aattagtatt atttgaatca ctgtaaaaaa taatttcagt aataacgcta 32761 ggtgacattt tcgggtgtcg aatctaagtt aaataattct gtaatttcta acccgcaact 32821 tgggttattc taaaaaagaa gttctttata atacttcata tatgaattcc cagaattgga 32881 gacgtaagcg tggcgttgta ctaactccta agggtttaga aaaatttcag gagacaaaac 32941 gcaagtcgga aacagaggaa aattttggca acaggtatac ctttgaagaa ataagcgccc 33001 ggtgtgggtt gtataccggt acaatctcca aggtactaaa tcgcgaagga ggagttgata 33061 aaagaagtat tgaggagctt tttaaagctt ttcagataaa actagataag agtgactatt 33121 taagctctaa tacgcgcata gattgggggg aagctatttc tacatcggtt ttttatggac 33181 gagcagaaga acttgccctg ttagagcaat ggattctcaa tgagcgctgc cgattggtga 33241 cattattggg aatgggaggc attggtaaga cggctctgtc tgtaaagctt gctcaacaga 33301 ttcaggagaa ctttgagtac gttatctggc gttcactacg ggaagccccg tcgataaaag 33361 ctatcctggc taatctaatc cagtttttat ccgagcagca ggaaacacca gttaatttac 33421 cagaaagctt aagcgagagg gtatcccggc tacttgatta tctacgaagt catcggtgtc 33481 tgctgatact cgataatctg gagtcgattt tacgcagtgg tatccgagtg ggacaatacc 33541 tagagggata tgaggagtat ggtgagttca taagactcgt aggagaagca actcaccaaa 33601 gttgcttagt gctaactagt cgggaaaaac ctaaagaagt ggcatcaatg gaaggacaag 33661 cattacctgt tcgctcatta caactgagtg gtttgcaggt agtggacggg tgggaaatcg 33721 ttaaaatcaa ggggctatct gcagcacaag atgaatgggc accaatgata cagcgctatg 33781 caggcaatcc attagccttg aagatagttg ctaccacaat tcaagatgtc tttggtggta 33841 atattactga atttttgcaa caagaaacaa ctgtttttgg agagatccgt gatattttag 33901 accagcaatt tgagcgcttg tctaatttag aaaaaaacat aatgtactgg ctagctatta 33961 accgtgagcc gattgcgctt tcacaattgc aagaagacat ggtctcatca gtaccacagg 34021 taaggttact ggaaagtttg gaatctctga tacggcgaac gctagttgag aaaagtgcaa 34081 cactcgtcac tctacaacct gtagtcatgg agtatgtcac tcagcgattg atagagcacg 34141 tttgtgagga gattgtcact caaaatcttg atttttttag gagtcatgcc ttaatgaagg 34201 caacggcgaa agattacatt agagaagttc aaattcgcct catcctccaa cctgttacac 34261 atgagttgct gatgattttt agaagtaaaa aaagcttgga aaatcagttg cagaaaattt 34321 tagcaatgct gcgagaaaaa tatctgctag aacaaagtta caccgctgga aatattctta 34381 atctactttg tcatctacag attgatctga gcaactatga tttttctgat ctgactgttt 34441 ggcaaacaga cttgcgaaat gtgaaattac atgatgtcaa ttttcaaaat gcaaatttag 34501 ctaagtcttt gtttgctgaa acctttggtg gtattttgtc ggtagccttt agccctgaag 34561 gcaaagtttt ggctatgggt gatactaatg gtgatattcg cttgtaccaa gttgcggatg 34621 gtctaccact cctcacctgt aagggacatg ctaactgggt tttatcactt gcctttagtc 34681 ctgatgggac aattcttgcc agcggaagta gtgacaatac tgtcaagcta tggagtgttg 34741 gtacaggtca atgtcttcaa actttgcagg gacacaatca tgaggtttgg tcagttgctt 34801 ttagtccgga tggtgaggta ttcgccagtg gtagtgatga ccaaacgata aagctatgga 34861 gtgttcgcac tggtgaatgc ctcaaaacgt ttcaaggaca tgccaattgg gtactctcta 34921 ttgcctttag tccggatggt cagacactgc tgagtggtag tgaagaccaa acagtcaaat 34981 tgtgggatat aaatactgag gaatgcctca aaacgttcca gggacatcat gatggagtac 35041 ggtcaatagc tgtcagtcct gacggtcaga tgttggtcag tggcagtgat gaccagacga 35101 taaagctatg gagtatccgc accggtaaat gtctcagaac attccaagga cataccaatc 35161 ctgtatatgc agtcgccttt agcccacaag gtgatacctt agctagtggc agtcacgacc 35221 agacggtgag gctatgggat gtcaccactg gtgaatgtct gagagttttt cagggacatt 35281 ctaactgggt attttcagtc accttcgaca ctgaaggtga gatgttggct agtggcagct 35341 gggatcagac ggtgaggtta tggaacgtta gtaacggtga atgcctcaga actttccaag 35401 ggcatgccaa tcaggtactc tcagtctcct ttgattcgga cggtcagagg ctggtgagtg 35461 gcagtaacga ccagacggta aggttgtggg atgtcaccac tggtgatatt ttgaagactc 35521 tctacggaca caccaattgg gtatactcag tcgcttttag tccacaaggc aataccctgg 35581 ttagtggtag tgcagacaaa acggtgaagc tgtggaatgt tagcacaggt caagtcatga 35641 aaactctcca gggacatggt gctgcagttc ggtcagttgc cttcagtcct ggtggtcaga 35701 tggtggtcag tggtagtgag gactacacaa tgaagttgtg ggaactcagt acaggtcaag 35761 ccatgagaac ttgcttggga catgaagctg cgatctggtc agttgccttc agtcctcgag 35821 ggacaatgat cgcaactgcc tcttgggatc acacaatcaa gttgtgggat cccaacacag 35881 gtgagtgcct caggactttg gtcgggcata agagttgggt ttggtcagtt gcctttagtt 35941 cggatggtca gatactagcg agtgttagtc cggatcaaac gttaaggtta tggagtgtca 36001 gcaccggtga atgcctgaga attttgcaat tacattcgag ttggctacaa tctattgcct 36061 ttagtccgga taaccggacg atcgctacta gcactcatga acacacggtc aaattgtggg 36121 atatcgacac taatcaagct ttgagaagtt tgcaaggaca tacagcttct agcagttgtg 36181 gagtgcagcg gttaagatgg taagaaccag caagtcgcca tacccaacgc ctcgggctta 36241 atgtaggtgg ttcgttactt ggcgaaacag cgcgatgagc agatgcgaca gccccctcta 36301 agagcctgcg ggttgtctct acactacctg cgctgaccat ctgttgggca aaaggatcgt 36361 cgctccttgc tggatcaagt tgatgtaacc aagtgcgatt ggcagtcgcc actactacct 36421 tgtcaggtgt aatacgcgcc cattgaatgt cagttaaaga gttatgactc atagctactt 36481 ggggttgcta tagaatttta gagagttacg cgctttgcaa ctatgtctcg actcaactta 36541 tttggattgg atggtcatta gggatacaaa gataggctgt aaagccattc gctcatacta 36601 tgccaacggc gagccttgta gaagccgttg ggatctgatc attgcactct agggtagtag 36661 aatagaagaa cagtaagcgt caggtaaaaa cgctccttct ggtcatcttg tatgaactag 36721 tttcctgagt aggcaattgt gtgcgatgtg ataccaaaaa cgaaggcgac aattaccttg 36781 cttgacgcag gcaaaagctg cttttcgctt agcgaaaaaa tctagaaact tgatagtgac 36841 ttgaaaaacc gtgagaagct tcttcctaga aaattggaag tagcttatgt tgtcatggaa 36901 aattttgttt tttcacaacc gtcacttcaa ttgtatcaca ctctaaattt ttatcaaggt 36961 catctcgata aaaaatatca attccaaaca aaacttatat tttgttttca aatctaacta 37021 caaagtaaca aaacatcaaa cttgagacaa gcaaagatcc gggagttaac caggaagtta 37081 agggaacaga gtcaataccg ttcggttaag acatctctcc gtttcggcag caagcgtaga 37141 aaaaagtgct cacgccctgg aagcgttaat tttgctgcct ttttaatggt agtgttttat 37201 tgaatttgca tatattttaa atagttgtgt acttactaat tccagccgca ggaagcggtc 37261 gccgaatggg gagtagtaga aataaactac tgctaacctt gttagaccaa cctttaattg 37321 cttggacact tttggccgct gatgcttctg agtccattga ttggataggc ataattcttc 37381 agccacaaga ccgggctgat ttacaaacaa tcgtgtcgaa cctatccctg agtaagccag 37441 tgcatttcat tcaaggagga gcaacccgtc aagattcggt atacaacggg ttgcagtctc 37501 taccccccat ggcaaaacac gtgttgattc atgatggagc gagatgtttg gcaacaccaa 37561 atttgtttga ccgatgtgct gaggcaattc tccattgtca aggtctgatt gccgctgttc 37621 ctgtcaaaga caccataaaa gttgtagatc aaaagacaca tctgattacc agtacgccag 37681 atcgaagtca actctgggcg gctcaaaccc ctcaaggatt tgaagtagag cgactacagc 37741 agtgtcatgc tgaaggtcgt cgccaaggtt gggaagtcac agatgatgct gccttgtttg 37801 aacaatgcca gctgcctgtg cagattgtgg aaggggagga gacaaatttg aaagtgacga 37861 cgcctgtgga tttagcgatc gctgaattca tcctccggca acgattggca gaacagtcaa 37921 ggcaacctct tgacaaaaag ttctacgaga ggttacctct agccgatagc taaagttacg 37981 gtgcccaaat ccccaacttt tagttggggg tatggctcaa tagcaagact gttgcatagg 38041 cagcaattcc ctcttcccgt ccgacaggac cgagtttttc atttgtggtc gccttaatgc 38101 tgacttgatc cggttccaat tctagaacac ttgagagttt ggcacgcatt gtctgaatat 38161 ggggcttgac tttaggtcgc tctgccacca caacagagtc aatattacta atttgccacc 38221 ctcgatcctg tactagttga tgcacattag tcagcaaaac tacactatca gcacctgccc 38281 actgagagtc cgtgggtgga aagtaatgac caatatctcc caaacttaaa gctcccaaca 38341 tcgcatccat aatcgcgtga gtcagcacat cagcatcact gtgaccgagc aaccccaatt 38401 catggtgaat ctccacccct cctaaaatca aggggcgatt tacaactaaa cggtggatat 38461 cgtaaccatt gccgatccta atgttcatgg gtagcgacct tttttattga ttcgggttta 38521 cctcctaatt gtccggctta tatcccacac ctaaacgctt cgcgttataa gtggggactt 38581 agccgactga gttaaaaatt tcacactgtg atcgtgtaac gccccaaggt tttactcacg 38641 gctacaactt cctgtcttgc ccattggtct gctgcaatga tatcatcaag agttggcgag 38701 gagtagttgt ggacttgata gcgatcgcac gcccgttcaa tcaaacgtgg aatgtccaga 38761 aactgaattt cttctgataa aaacaaagct acagcttgct cattcgcagc gttcagtact 38821 gctggcatgc aacccccagt tcgacccgca gcataagcta gttgcataca cgggtacttt 38881 tggtggtttg gggcacgaaa agtcagctcc cctaccgtga ccaaatctaa ctgccgccaa 38941 gcagtgtaaa ttcgctctgg ccaagacaaa gcatacagca ggggcaaacg catatcgggc 39001 caccccagtt gcgctagcat tgaagtgtct tgcagttcaa tcaaagaatg gataatgctt 39061 tggggatgta tgacaatatc aatatggttg taatccaaac caaaaagcca gtgagcctca 39121 atgacctcca atcctttatt catcagagtg gcggaatcaa ttgtaatttt gcgacccatt 39181 gaccagttgg gatgcttcaa ggcatcagca acagtcactg ttgataactt atcaacagac 39241 caatcccgga acgctccccc agaagcagtc agcattagac gccgcagtcc cccgtttggc 39301 actccctgca aacactgaaa aatcgctgaa tgctctgaat ctgcaggcaa aagcttcaca 39361 ccatgctgct gcaccaaagg caaaaccact ggtcctcctg caatcagagt ttccttgttt 39421 gccaaagcta tgtctttgcc tgccttaatt gctgcaattg taggtagtag acctgcacaa 39481 ccaacaatac cagtcaccac aacttcagct tcgcagtaac gagccacttc aatgactccg 39541 gcttctcctg ctaaaagaat tggttgcgga tctaagtcgg cgatcgcctc ttttaactcc 39601 ggcaactttt gtgcgtcaca gatagccaca attgcaggtt ggaactcacg aatctgttga 39661 gccagcaact ctacattccg tagagctgcc aaccccacaa tccgaaactc gttggggtac 39721 tggctaacaa tatcgagagt ttgggtacca atagaacctg ttgagccaag gagagatatt 39781 gttttcatga tagtaaaatt ccaatttttg aattttgtgt ctcctgcaca ggagcttttg 39841 aattgcgtgt agtgtgcccg tagggcatat ttttaatttt gtaatgtgcc tgccctaagg 39901 gcatatgttt gagccgcgac ggtcaggtca atgttttttt gcggttgcag ttcaatagca 39961 gcattggttg aaccaccagc cagtcctgct ccaacgggaa tccgcttgtg aattgtaata 40021 tcgacacctc catacttggc aaaggcgtca gggaatttct ctgccatgat tgccaccgct 40081 ttataagcaa ggttacgatg gtcttgtggt acttctggat ggtcgcaata caggcggatg 40141 gcatcggttc cgagtgggcg caaatcaatt tgatctgcta ggtcgatgct ttggagtacc 40201 attaacaaat catggtagcc atctgggcga tcgcgaataa tctccaaata caaattaatt 40261 tttgctgaag caatcaagga gtaagagcgc atatgttcgc ctcaaaggtt gtagttgttg 40321 ttgtgttatg agtgagtgga gaggttgtgc cgcccatagg tagcccatag accgcacaaa 40381 cctttttgga aaatgggatt atgtgatgct ttcttccagt tagggacgtt cgactagttt 40441 aaaacacagt tcatagccgc tttgatggca aaaatcttct caatgacatc agaaaccact 40501 ttatctggtg tcgaagctcc agaagtaatt cctacgacaa ttggaccatt aggtaaccaa 40561 tcttcaagca cttctatgtc gtctcgatgt aattttttgt gttggatgcg gttaccaggt 40621 cccaagcgct ctgcactgtc aatatggtaa gagggaatcc cgcgatcgct tgagatttca 40681 tacaagtgag ttgtgttaga agagttaaaa ccgccaacga ctaccatcaa atccagcttg 40741 tcttctacaa ccttaaacat tgcatcttga cgttcttggg tagcatcaca gattgtgttg 40801 aagctcaaga aatggtcatt gatttgggca ggaccgtact tacgcatgag cgtacgctca 40861 aataacttac caatctgctc ggtttctccc ttgagcatag tggtttggtt ggcaataccc 40921 acccattcta aatcgatgtc tggatcgaat ccaggagaat aagctttggc aaataacctc 40981 aagaattctt cacggtctcc accatgaagg atatagttgc agacatactc tgcttgttgc 41041 aggtttagga caacgagata tttatctgcg taggaactgg ttgcaattgt ttcttcgtgg 41101 ttgtacttgc catgaataat tgaagtgtag ttacgttttt tatgcttctc aacgctattc 41161 cacactttgg atacccaagg gcaagtcgta tccataattt tgcagccttt ttggtttaac 41221 agctgcattt cctgaatact agcaccaaaa gcaggtaaaa tcacaacatc ttccgtagca 41281 acaacagaaa aatccttctt tccctgttcg ttgactggaa taaattcaat ctgcatttcc 41341 ctcaagtttt gattgaccga ggggttgtga atgatttcgt tggtgatcca aatacgttcg 41401 gtgggaaatt gctggcgtgt ctcgtaagcc agagcaatcg ctcgttctac tccccagcaa 41461 aatccaaacg cctctgccag cttaattgtg acatcaccct tttgcaatgt gtagttgttc 41521 tcccgtatct gctgaattaa gctgctttga tactcggagt acagcatctt ggtgacttct 41581 gcttcatgcc caaatccctt gcgatagtaa cgttcggaac tatggagcga tcgcttaaat 41641 gcttttatgt tcatttgccg ttcttcattt cacaactaca atagccaagg ctatttcact 41701 ttgactccag ctacaggaat taaaatttcg tgggctagct ggataattca tagtgccagc 41761 cctggttttt ctgtcacatt actcgacttt atcaatccat ctttggtaga attctctgaa 41821 cttcaagact cccaggtcgt gcttgacgtc agccactcca aagttttgat ttgtgttgtc 41881 atagattggc atatcttcct tcacaccaag tttgttttgc aaactaaaaa ccacgtaatg 41941 aaatatgtcc agccagaatt tgccagtttt ctttaccatt accaatatgt gctggattga 42001 ttttttttca gcaatcggag tcattcctac taccaactta tacacctcct taccgtctac 42061 aaatgccgta accctctgtc ctgcaggcca actatctagc agtaccgtaa agttttcagc 42121 attaaaccca agtgtccggg ctactggatc taaatcatgg attgggaatt caacttttgc 42181 tgcaaacctg gcttcttttg gaatcggtgg ttcgctcact tctgctgtgt actggtcagt 42241 aagcagagta aattttatag gttcagaaat cttcagatcg tgtactgtaa taatatgata 42301 ttggtcaaaa ccgttttcaa tcacatgtcg tactgttgtt ttggttaggt ctgcaaagcg 42361 gaaaggcata tagttatgtc tgtcgtcttc agcagctgga aactcaggca aagggaacat 42421 aggagtctca gagccgtacc atacccaaat gtaaccatat ctttcggctg tgacatagtt 42481 agcctgacga gcctttgggg gaattttgtc tacttcaggt atgaaaacac actgcccgga 42541 gctatcaaag cgccagtgat ggaaaggaca ttgaatacaa ccgtctacta ccttaccaat 42601 cgccagacta gtgcccttgt gtgaacagta acgctgcatg attgctgggt gaccattttg 42661 atcccgccaa gcaactagtg gctgaccaaa caactctatt tcttttggct ttttacccaa 42721 agctttggag ggcatggcaa cgtaccagct tgcaggaaga ttcatcgccc ttttattaac 42781 ttccctttcc ggttgagtta aggtcaaatt tttgactgtc tcaagtttgg tgctcattta 42841 ttcctccttt gatatttaat cttatgcaca actttgttga aactaatgag ttttagctgt 42901 gacgtaatca gtgatcgcca tttctaggga tagccttctc tcaaatcgca atttcttcca 42961 gggtagtcgt gttaaaagag ttgaagccgc caatcacaac tatcaaatcc agcttcagtt 43021 cacattcaac ttagtttctt gaatttactg agtgaggcca agtatacttt tgaagttgag 43081 gaagtcttaa ttcagattca tgaggctaca ctttctcttc taactacgac ccataagcag 43141 gataatgaca cgcagccttt catctcgacc ctttggattc caaaaaggag aggaggaggg 43201 ggagcaatgc tctgctttgt cctgcctctg ctacattgat tattattatg attgagcttg 43261 gaaaaagaat ttggtttagg gcttcaagcg agcactgctg gtacaaagag cgatacgcgt 43321 agcgtcaagc cagaggctta tcgcacccac taactacttt tgaccacacc cgttaagcaa 43381 gattatagtg catccacaca ttaggctcta tgtaagttcc ttgattgttt gtaaagtcaa 43441 ttgttagagg ggataatgct tcgggaataa catagtcacc tgtttctgga aaacgcatat 43501 taactctggc ttcccaatta ctaatagtat cgataatggt cgcggattta gagcaaatac 43561 caactagatt agctccatgc ttgacatgag tacgtaacca aggatcgaag ggaagaccgt 43621 tttcattctg ccaagtgatg tagcgctcca taggcgtgag aggataaaga tgttttaggg 43681 ttggacgcgc aggaacaata agcgaactaa gcttattaac ctgagcgatt ttcttcatct 43741 gttggagcat aaaactgctt atttgcctac cgcgatattc gggtagaacg ccaatgttta 43801 cagcgcataa aatattaggt tgatgacctt gtgagtggtt gtcggtaagc atctgagttg 43861 ccgcccaagt agcaccttcg tcgggcaact gttcaaaggt tcctttccat gctatgggaa 43921 tacaatttcc aagagcgatc atctgctgcg tagctgattc tactagcgca tactggtact 43981 ctggataaac ctgatataaa gcctgatata aaacaagcca attttcgttg gtatcaatgt 44041 catacgtgaa tattcgctgc aatccttgtt ttaaaaaagc tttacttgtt tcataacagt 44101 tcggcatttc tgctgcagtt actattttgg tggtcattga ttcggtgatc attgattcct 44161 ccttcgatac taaatcctag acaaaacttt tttgaaacta gtggttcgcc acattaattc 44221 tgatgggtaa aacgatgttc agactcatct tcttttcctc ttaattttgt gacggacact 44281 ctccaagggc gggaggaacc tccgaacgga ggtgtcctcc tctttgcgtt tttgcggttt 44341 tttaatcacc cctcaaaatc aatgtggtag accactagtg agttcgatct gtaatgtagt 44401 cagcgatcgc catcagaggg atagctttct ctccaaaggg agctagttca gccttagctt 44461 cctcaattaa ttttttcgct tggtttttag attcctgaat gcccaacaaa ctggggtaag 44521 ttgctttttg cgcctgtaaa tctttgcctg cgcttttacc tagttgctct gcagttgcgg 44581 taatgtcaag cacatcatca ataatctgaa aggcaagacc aatattgcaa gcataacgag 44641 aaagccgttg caaaatctct tcatcagctc ccgtcaaaag ccctcctgtg acaacagagg 44701 tttccaacaa agccccggtt ttgtgggcat gaataaagtt gagggtttct aaactaatat 44761 ttttcaatcc ttccgattcg agatcaacaa cttgaccacc tactaaaccc gttgcagcaa 44821 ccgcatgact taacttagca acgacctgca acaaatgctc tgctggaact ccttttgtct 44881 gcaccacaat aaactcaaag gcgtaggcta aaagtccatc accagctaga atggcaatat 44941 cttcaccata aaccttatgg tttgtcagtt tgcctcgccg atagtcatcg ttatccatag 45001 caggtaagtc atcatggatc agagacatgg tgtgaaccat ttccagcgca caagcagtag 45061 gcatcgccat atcaactgtg cctccagata attcacagct ggcaagacac agaattgggc 45121 gcaggcgttt tcccccagcc agcagggaat agcgcatagc ctcatagatt ttttctggat 45181 agaccacagt aattgagcgc tcaagcgcct cttcaactaa gccttgtcgt tgggataaat 45241 aggttgacag gttaaaagta gcttcttctg gtgttctgcc agcttcaacc agttgtaatg 45301 atttcttttt agtattagct acctcgattt cacgatgaaa tacttggctt tggttgtgtt 45361 tagaattctg aaatgaccgt tcaaaaactt tagtattcat ttgatctccg caagattatg 45421 acactacatc tcattgaatt agccgtttcg ttgggagcat cccaattttg caagaagacg 45481 aaataaagga aaaatgattc gcgagtt // LOCUS NODE_509_length_45009_cov_5.10960145009 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 45009) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 45009) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..45009 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(54..634) /locus_tag="DP116_03055" /pseudo CDS complement(54..634) /locus_tag="DP116_03055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877043.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="magnesium transporter" gene 1252..2379 /locus_tag="DP116_03060" CDS 1252..2379 /locus_tag="DP116_03060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746826.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_03060" /translation="MHQRLTDLYQNLSTVPVIPDVLPQALVELGSASEIVHVATEELH QQNEELLRTRNLLERERQRYLDLFNFAPDAYLVTDTLGIIQEVNHTAATLLNVSQQSM QGKPIVNLIAFGERQIFRSYLTQLWASNKVKELIVHIQKRNGELFDAAITVGVIRTSA HEPIGLRWLVRDITECNREELTLLKNDSDLTQNRLWHKCSKGDFIPLKPGIIWYVCQG SVKLSTLCETGEELIVGLAGIGMVFGSNLTVLQAYQATAQSDVELVSIDYAEIAASPT LSHTLLPKINQRLRQTESFLLIASRRRVQDRFHQLLLLLKQEMGQSVPQGTRLSIRLT HEELASACCTTRVTITRLIGKLQQERKIGFDSRHHIILKDI" gene complement(2681..4225) /locus_tag="DP116_03065" CDS complement(2681..4225) /locus_tag="DP116_03065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318831.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-containing oxidoreductase" /protein_id="PRJNA477356:DP116_03065" /translation="MSELQRVTIRPVDEYNQTLVSYVHPLDWINPQPADCYDLVVIGA GTAGLVAAGGAALVGGGLKVALIERHLMGGDCLNFGCVPSKCLIRSSRVVGEIWNAKA YGIRTSKYVDVDFSSVMERMRRVRSGISHNDSAKRFQNMGVDVFLGNAEFSSSDTVKV DDQTLRFKKAVIATGTRPIEPSIPGLEEAGYLTNETVFSLIQKPQSLVVIGGGPIGCE LAQAFRRLGCEVILFHKGSHILNKEDIDAAEIVQKVFVQEGIRLVLNCQIQNVEKTQD GKTIYFTCNGKQDVVTVDEILVGTGRAPNVEGLNLDVVGVEYDQRQGVKVNDYLQTTN PKIYAAGDICMRWKFTHAADAAARIVIRNALFSPFGLGRQKLSSLIMPWVTYTDPEIA HVGMYEHEAQQMGVDVATIKIPFTDVDRAVADGEEEGFVKIHHRKGSDKILGATIVAR HAGEMISQITTAIVGNIGLNKLSQVIHPYPTQAEAIRKAADAYYLSTAFTPNTKRLLA WVKKFS" gene 4555..5073 /locus_tag="DP116_03070" CDS 4555..5073 /locus_tag="DP116_03070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310860.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_03070" /translation="MKRKDKARLQSIFSVLLLFFFTGCWLLWNPTAVLAQDYTVNYTF ADLQHQDFSNKDLHGTSFAGGNMQAANFRGANLSGTILTKGSFLKADLSGANLAETFA DRVIFSEANLTNAILTDAIFSSSHFFDAVITGADFSNSIVDPYEVKLMCKRADGVNPV TGVSTRDSLGCR" gene 5123..5245 /locus_tag="DP116_03075" /pseudo CDS 5123..5245 /locus_tag="DP116_03075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318835.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="acireductone synthase" gene 5345..6580 /locus_tag="DP116_03080" CDS 5345..6580 /locus_tag="DP116_03080" /EC_number="2.6.1.83" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458476.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LL-diaminopimelate aminotransferase" /protein_id="PRJNA477356:DP116_03080" /translation="MATINDNYLKLKAGYLFPEIGRRVNAFAQANPDAKIIKLGIGDV TEPLPEACRTAMIKAVEEMGDRASFKGYGPEQGYTWLLEKIAAQDFQARGCEVDASEI FTSDGSKCDTGNILDIFGNNNIIAVTDPVYPVYVDTNVMAGNTGSANDKGEFEGLVYL PITAQNNFTAQIPSQKVDLIYLCFPNNPTGAVATKEHLKAWVDYARANSSIIFFDAAY EAYITDAELPHSIYEIEGARECAIEFRSFSKNAGFTGTRCAFTVVPKNLTAKAADGSD VELWKLWNRRQSTKFNGVSYIVQRGAEAVYSQEGQAQIQELISFYLENAKIIREKLTA AGLAVYGGVNAPYVWVQTPNGLSSWDFFDKLLHTCNVVGTPGSGFGAAGEGYFRISAF NSRENVEEAMKRITDRFTS" gene 6624..6854 /locus_tag="DP116_03085" CDS 6624..6854 /locus_tag="DP116_03085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015195985.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03085" /translation="MIVGYARVSTNAQANGEALDQQIARLKAAGAEDIFVDVESGRSV KRIISYQLSVVSCSLFPVHCVGSCFKKLLALH" gene complement(7112..7546) /locus_tag="DP116_03090" /pseudo CDS complement(7112..7546) /locus_tag="DP116_03090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314084.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS5/IS1182 family transposase" gene complement(8108..9112) /locus_tag="DP116_03095" CDS complement(8108..9112) /locus_tag="DP116_03095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017364727.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03095" /translation="MVARGKENANAPYEIGLWGDLPYSEAQEVGVTRLIDDMNSQKLS FTVHDGDLKAGSNSVCDNALYAKARGYFNSLEAPAAFTPGDNDWTDCDRPSNGGFNSL ERLDYERQLFFSTNFSLGQRRLAQEVQTAPLCLGVNGPVPCVENRRWTVGKVTYATLN IQGSCNNLCDTAPDPEEYAARNAANIAWLKETFRVAKEGNSAAVMLISQADPGWDQSD PTRAPLRDPKTLVQTDGQPDGFKDFLLALRDEVIAFGKPVAYVHGDSHYYRIDKPFLD AQGRRLENFTRVETFGDNQENGTNDVQWLKVTVDSRSREVFSYQPQIVPGNRVAVPAQ " gene 9831..10070 /locus_tag="DP116_03100" CDS 9831..10070 /locus_tag="DP116_03100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740165.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03100" /translation="MEKQPEIHNFLCQKYKYRGKFTPQNLVFNANLQEFATHVSYICN LQTLGKLSTEDAYKQINELWQRLERSYLELEIDAD" gene 10790..12169 /locus_tag="DP116_03105" CDS 10790..12169 /locus_tag="DP116_03105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016430515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase" /protein_id="PRJNA477356:DP116_03105" /translation="MFLLKKRRTVTLTATVAATVTAVGFAFASRADYLSLEQFIDNGR AKNVILFIGDGMGDSEITIARNYSVGAAGRLALDTLPLTGEYTTYALQESNPKLPDYV TDSAASGTAWATGSKTSNGRISTTASTDSDLKTILELAQERGFVTGNVSTAELTDATP AVLVSHVSNRNCQGPKDMTSCPQDKKSAGGPGSIAEQSVDHGVDVLLGGGKQRYDQII DGGRFAGKTVIESAQAQGYQVVTDASGLQSAQPGTKLLGLFNSGNMSLEWAGKPAVVY PGNEPQRCREGLRPSNEPSLADMTSKAIELLESKQGGQRRFSQKTGFFLQVEGASIDK QDHAGNPCEQIGENVAFDQAIKVALDYAKTHRDTLVVVTADHGHTSQIIPNPTQTAHS PGKYSTLITADGAQMTVNYATNLSDQSQEHTGTQVRIAAQGPQASKVVGVIDQTDLFH ILARAIGAE" gene complement(12486..14231) /locus_tag="DP116_03110" CDS complement(12486..14231) /locus_tag="DP116_03110" /inference="COORDINATES: protein motif:HMM:PF01663.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03110" /translation="MNNKLQHSQNKSTKPTQKHHWNRRRFFTGVLIGTTVLAMSVFGH SLTRPGMTHADSTLVQKVTTTLPGQARLILVFILDGLRPDLINPQDTPNLYRLRNEGV DYVNGHAVFPTVTRVNATAIGTGYYPGTNGIVSNSMYVPEVNPTRAFSTGEYSDILKL DEVSGGRVVFVKTLGERLQENGMKLAAVSSGSSGSALLLNPRAVNGIGSVINGYFNPG KVVAFPGDVNDAILSRFGAAPPKEGDVDNQYNEAVDWTEQVLREYVLPEQKPDVVLNW LTEPDNTQHNTGAGSPESTNTIRNDDRNIGLVIEQLKALGLEDRTDIFVVSDHGFSLE TFGVNVTQELIRAGLKAGPDSDDVVIASSGQAVLLHVKNRSPQRIKEIVEFLHKQDWI GVVFTAGKNSSSNLSNRKPKSVDGSIPGTFSLELIHEFNQERGPDILFTFPWTSDKNA FGVQGTDFTDTSGTTGPRTGNASGHGSMSPWNVRNTFFAWGVDFKRGVKVQVPASNVD LTPTILALKGINTSEAFDGRVLLEGLKGGPDSEQVRVKTRVLTTKTNQGRYKAAIQIS EVGNQRYIDKSWRLP" gene 15286..15747 /locus_tag="DP116_03115" CDS 15286..15747 /locus_tag="DP116_03115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314509.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03115" /translation="MTNTVIITAASEGIGKATALEFARHEYNVVLAARQSDRLEAAAS QVRAIGRDALAICVDVTDPKQVDALVEKAIAHFGTIDVLINLVDKAIHNPILEKPEDV AVAIWKAVKYQRSDMLVGSARLSKVAYQVFPGLMQSLYQRVLGMRARHYGQ" gene complement(15995..16192) /locus_tag="DP116_03120" CDS complement(15995..16192) /locus_tag="DP116_03120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196892.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03120" /translation="MNDSSFVIFIVFGTIWILMATAGVIAILKMDGQEIRFGKTGLMI AMPIIISIIIALTYAAVKSTF" assembly_gap 16681..16690 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 16980..17183 /locus_tag="DP116_03125" CDS 16980..17183 /locus_tag="DP116_03125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015176414.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03125" /translation="MSQIDYAAMSNQQLKQYMLEHRDDEAALKAYLDRRHQRSTVIIT TVNDPDFDAKVQAAIRQQMSDSG" gene 17374..18117 /locus_tag="DP116_03130" CDS 17374..18117 /locus_tag="DP116_03130" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03130" /translation="MVSEVTKLKRDIVKSVKSATELGQDVAQSAIERGKAAQQETRQF FGDKLEQNQESLRKKLDENQTAKEAAARAHIEQIDLDAVEKFVTLLLTKFPNATPEQM TQRLLRRQLFRVSRTSVVMSVVPSKMAESVGVDYVEIALIQAEIIFQIAVAYGFELQV PECKNEAFAILDRVLRANRLTRIGLSATQMIPVAGGFISTGTDTYLVYQIGNTAQQFY KSLTEEEVPGEILENFIEETQRRYKQRLW" gene complement(18239..21358) /locus_tag="DP116_03135" CDS complement(18239..21358) /locus_tag="DP116_03135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130846.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA primase" /protein_id="PRJNA477356:DP116_03135" /translation="MHLYYLESQHLEELVQGSGIDLNLARLNFMSLKGRTTYDYLLIS EYLPRTNTGMVKSGWLQRYAHIAEGGWWCSGLDSLNHWQEMEWGCFKPNQPRLNENAK LIKYEHPPSTPTRVFCLRVSLDIWQQVAQRYNVPMPDSINITTNGEAEGFWQWVMERN IPIIICEGVKKAAALLTQGYAALAIPGITSGYRVIKDDFGKVISRQLIPDLAAFANTG RIFYICFDFETEPKKIAAVTNAISQLGFLFQEKNCPVRVIKLPGIEKGIDDFLVAKGA SAFETVYRQSVDLEVYLAQTKPHTELTIPPALTLNRRYLGEIPFPSSGLVAVNSAKGT GKTTTLQTVVNQAKSRNKPVLLITHRIQLGRFLCEKIGIQWGMGKKDKETRRQGDKGE KLLPCYPVSLSPYLPIAPSPHSFGLCVDSIWKLNPEDWRGAIIILDEVEQSLWHLLNS NTCKDKRVKILKIFQQLISTVLISGGLVIAQDADLSDVSLEYLQELAGIQIIPWVVVN QWKPQQGWDVTFYDSPNPTPLIHQLELDLITGNKCYVTTDSRSGRYSCETIERYLQER LERLQKQFPKTLVVSSHTTNTPGHEAVDFVAAINQKVTEYNTVFVTPSLGTGISIDVQ HFDRVYGIFQGVIPDSEARQALARVRDDVPRIVWCAKRGIGLIGSGSTNYRSLSYWYQ ENQKENIALLSPLHKVDVDLPLVYDPIHLRTWAKLSARVNASITLYRKSMKDGLISDG HQIHVRGNDVQKNIIRDLRLAFIATDQSDITTRKRLILEIFKVQKDWAKSRKKSKDID GKIREIKQRNQLASATTVANAKDIDYVEYEQLLVKHSLTEEERNQTKKYLLRQKYGIE VSPLLKLRDDKGYYHQLLIHYYLTHESEYLCLRDKQEWHQQLTWGEGKVFLPDLKTYT LKVEALRALGMLQYLDPKRKFTENDADLLLLRNIAFQCSKHIQRAIGINLLQEKEYIS PIKILKQLLSLLGLKLKRVNNAVYQIDPETLYDGRQQIFAVWHQRDELMLANFKSIGF EIVDIAVLV" gene complement(21503..22882) /locus_tag="DP116_03140" CDS complement(21503..22882) /locus_tag="DP116_03140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318794.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03140" /translation="MPKWRKYQTFLHQFPKLNFAVKVNHLTVRVITLGVLVAVLITKY VSELVKPSTQQSLHVPTATSPLPVIQVAAKSSSKQNSKSLTPTVSTATSPLPVIQVVE SSSKQNSKSLTPAVPVPDWAISRRLDATRIAKSLSIPSRQPEKQNSEVVYNLKKPPKF KDSQELQAIVNDVVDLAADENLPKEALSVTLINAKTGETAGYQQDIPRYPASVVKMFW MVVLYAQIERGFWQNEKDFAPYLAKMIQESDNEAASFIIDQVTGTRSESELNSEKFQL WKKKRQQLNRFFHQAGYKNLNIIQKTFPIDYLNLQEPEGSESQLLHQPVGNWNKITTK HAARLLYEMCYAEHAVSLQASRKMCGWLKRDLNPKVWQPPDSYDFNPVRTFFGESLPD TRVRLYSKAGGTSASRSEAAMVVTEDKKATYILAVFAPDSAYADDVEIFPKMSALVYK RMSSRSSRE" gene complement(23092..24132) /locus_tag="DP116_03145" CDS complement(23092..24132) /locus_tag="DP116_03145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315637.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribose-phosphate pyrophosphokinase" /protein_id="PRJNA477356:DP116_03145" /translation="MNAHRGSAVLSSATLKVQPTITGLAENTRLRLLSGSANVPLSQE VARYLGMDLGPMIRKRFADGELYIQIQESIRGCDVYLIQPGCNPVNDNLMELLIIIDA CRRASARQVTAVIPYYGYARADRKTAGRESITAKLVANLITEAGANRILAMDLHSAQI QGYFDIPLDHVYGSPVLLDYLASKELPDLVVVSPDVGGVARARAFAKKLNDAPLAIID KRRQAHNVAEVLNVIGDVKGKTAVLVDDMIDTAGTIAAGAKLLREEGARQVYACATHA VFSPPAIERLSSGLFEEVIVTNTIPIPESDASGKSLRERFPQLVVLSVANLLGETIWR IHEDSSVSSMFR" gene 24869..26644 /locus_tag="DP116_03150" CDS 24869..26644 /locus_tag="DP116_03150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128217.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_03150" /translation="MQAPITTDTILQNRYRITQILGQGGFGRTYLAQDQRRFDELCAI KELIPIVTEGYAWEKAKELFQREAATLYQIQHPQVPQFRERFEEDQRLFLVQDYVAGK TYQTLLDERKLTASAFTEQEVLQLMRSLLPVLEYIHNQGIIHRDISPDNIILRESDRL PVLIDFGVVKELATKLNSPDNTTPATTVGKHGYAPTEQMQTGRTYPSSDLYALAVTAI VLLTGKEPQDLFDEHQLTWNWQHLVTVDPHFASVLNRMLSQKLGDRYSNAPEVLQALQ TPQQPNIYTSNVSKVQTVAVGRRPEQAQASAPNTSNPAIPTQDSPSLLDNPLAVFAIT LAVIVVTGFGSWTLVRSIRTQPQATAEKTNPQTFPSPVIPNRTTITPTATNNEPVVYN KPLNFGKSNTANVDSVIKANEIDQYIFLGEKGQQLTVLLTLRRSVLLSVLDANQQPID NTAKEFSFYQGTLPFTGKYTIQVRPVPGKAQSNYSFSLGLENPLQPRSTPTPTATPTP IATPTPTATPTPIATPTPTATPTPTATPTPTDTPTPTATPTPTATPTPTEQPSTSGSV EPQDTPPASESPNSVPSETAFPSRL" gene 26761..27456 /locus_tag="DP116_03155" CDS 26761..27456 /locus_tag="DP116_03155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315635.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent dethiobiotin synthetase BioD" /protein_id="PRJNA477356:DP116_03155" /translation="MTDKLLKTLLITGTDTEVGKTVLTTALAAYWQKYHSSRSMGIMK LMQSGEGDREWYQKVFSLNQSPEELTPLYFQTPVAPPIAAARENKTIDLAKVWQSFTA LRSRRDFLFLEALGGLGSPVTDELTVADLAGEWRLPTVLVVPIRLGAIGQAVANVALA RQSRVNLKGIVLNCVQPRTNTEIADWTPIELMKSLTQIPICGCLPYLDNLTDLSKMAQ VASDLDLERLLYI" gene 27555..28772 /locus_tag="DP116_03160" CDS 27555..28772 /locus_tag="DP116_03160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315634.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amidohydrolase" /protein_id="PRJNA477356:DP116_03160" /translation="MVSTFPNSSSVDLSRVRLSIRSLQAQLVEWRRQLHQKPELGFKE KLTSQFVSEKLQKWGIEHQTGIAQTGIVATIRGNNTSTQKVLAIRADMDALPIQELNE VPYRSQHDGVMHACGHDGHTAIALGTAYYLHQNRETFAGTVKIIFQPAEEGPGGAKPM IEAGVLKNPDVDAIIGLHLWNNLPLGTLGVRAGALMAAVETFKCTIMGKGGHGAMPHQ TVDSVVVAAQIVNALQTIVARNVNPIDSAVVTVGELHAGTKNNVIADTARMSGTVRYF NPAYEGFFKQRLEQIIAGICQSHGASYDYNCWSLYPPVINDATIADLVRSVAEEVVET PLGVVPECQTMGGEDMSYFLQQVPGCYFFLGSANPEKGLAYPHHHPRFDFDETILAMG VEMFVRCVEKFCN" gene complement(29208..30722) /locus_tag="DP116_03165" CDS complement(29208..30722) /locus_tag="DP116_03165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743552.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_03165" /translation="MEPISLETATPLAPKIPEIAISEPTVVPTSTPRPPGLEKNAIRI SLRASTVDSVFAGIYTSIVTGILLSNFLVELNASPVIFGMLSSIPMLVNLIQPLGAYL SERTTSRFRYSLLTNVTARLVWLFLVIGIAALSFGYINSQQLIVLTLVIVLFSNMSQG LGNASWLSWMAVLVPRQLRGRYFGLRNSVTSLTNLICIPIAGIAVSKWPGGTLQAYGV LVLFAILSGLISIGCQYFKVDVNPQNQQTSILGSFVKNAISKDTENSSHPTVTQAETT KNDTQFAPDSSAIKSILNNFNFLMFLLYEGFQMFACNLSAPYFILYMLDTMHLDVSLV TLYGSVQAGANLVMLILWGKLSDKIGNRPILILVGVLLALIPLFWLGIDINVFSLWLW LPLLHMLIGGVWAAINLCNNNMQLGIAPVKQRSIYFAMAAAVSGASGALGTTIGGLIA QNPSLGGLPAVFVVSFVFRLGAIIPLFFVQEPQRSSLTQVIQTLWIFRKKVVEN" gene 30881..32218 /locus_tag="DP116_03170" CDS 30881..32218 /locus_tag="DP116_03170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321611.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_03170" /translation="MQTNVSKFLRLLVAFALWAAVFGSIALLVIGCQNLPLASVSSAP TALTIKLGGWTASPAEQKLLKELLQDFEAKHPNIKVRHEVINDQYMDVIKTRLVGDAA PDVFYLEALEAPFFMSQNVLEPLDAYITPDFDLADFEETLLKSFKYAKHIYGFPKDYS TLALFYNKKAFAAAGLSTPPTTWNELRTYSKKLTVDNNRDGRIDQYGFGEIPELARQA YKIKAFGGQLVDQNGYAAFASDASLLGLQLAIDQYQKDRSSAQKSDVGTNSGSEMFGQ GKVAMVIEGNWAIPYLTETFPNLEFATAELPTINDNKKTMVFTVAYVMNKQTQHKAEA WELISYLTGKQGMTKWTKTGFALPSRKSVAQKLGYDQDPLRTALVAGVNYAIPWQVGE YPAAIVNNFDNQFVSALLGQQPLQQAMQQAQDEANQLIKAISQRLDAPRIAIK" gene complement(32320..33621) /locus_tag="DP116_03175" CDS complement(32320..33621) /locus_tag="DP116_03175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743548.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ScyD/ScyE family protein" /protein_id="PRJNA477356:DP116_03175" /translation="MRLSLVSKSAISFAALTFCFAAVFGTKPATAASFSVVAEGLDNV RGLNFGPDGSLYITESGVGGNGRCIPGPSLDGTPSCAGTTGAVTRVKDGKQERVLTGL PSTALRPIGSTGEGPQDIQFDAAGNPYLLIGYGGNPTIRDFPENSPSWGQLYRVDFQT ASLTSIADFAKYELANNPDGVETLDFTGEIASNAYAFTIKGNTAYVVDAAANDILTVG LDGSNLKTFTVLPKQTITNPIFPTPAPGQESPPDAPPPGQTPQEVEIQSVPTGAAFGP DGALYVSEYTGFPFPVGKARIFRVGSNGEVTVYADGFTQLSDLEFDAQGNLYTIQYSN APQWLGVGDASLIQISPDGTRTTLLSGNGLESATALTVGPDGAVYVSSKGDRPGVGQV LRVDPKAKVPEPSVVVGAVAFGVLGLSTLHKRKPKRVVLKK" gene complement(33890..34557) /locus_tag="DP116_03180" /pseudo CDS complement(33890..34557) /locus_tag="DP116_03180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013322487.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 35024..35932 /locus_tag="DP116_03185" CDS 35024..35932 /locus_tag="DP116_03185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310547.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar ABC transporter permease" /protein_id="PRJNA477356:DP116_03185" /translation="MFEVRSRQRNTRSNITEDLAGYLFMIPTILVLGTFVVLPILWAV FLSLHKVQLLGNIEYQFVGFRNFTRLIEDERVWIALRNTAQYVAIVVPSQTVLALILA VTLNSGIRAKNWWRILYFLPTVTSSAVLTLIFMWIYNTNGLLNDFLAFVGLPTYNWLG DPAVALKGIMIMNIWSTAPFYMVIYLAALQDIPRSLYEAASLDGANGWQQFIYITIPI LKPVTFFVVTIGVIGTFQLFDQSYIFSNGTGGPNNATLTVVLLIYQAVFRNLQMGYAA AIAFLLAVVIIVVTFIQRRFLGGEKV" gene 35996..36637 /locus_tag="DP116_03190" CDS 35996..36637 /locus_tag="DP116_03190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315094.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_03190" /translation="MLLKLKQLEVPPGQRVLLKNVSWQEFETILEELGEHRAARIAYE NGMLEIMTPLPEHEVTKVFLSNFVEIILEELDIEFLPLGSTTFKNKLMDKGIEPDNCF YIQNEPVVRGKDRLDLTVDPPPDLALEIDVTSRTHSSIYEALAVPELWRFEKGKLQIN VLQNGKYIESTSSPIFPNFPLQQVIPEYLQQCKTVGRNKTMRAFRAWVQELIS" gene 36782..37615 /locus_tag="DP116_03195" CDS 36782..37615 /locus_tag="DP116_03195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874129.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbohydrate ABC transporter permease" /protein_id="PRJNA477356:DP116_03195" /translation="MKIRRFLTWKTLLYILLTLYAIITLIPFLWALSASFKPLSEIVS SEPNFVPKNFTLDNYKQIFFQEPLFLRWLFNSVVIAISVTVLNLLFNSMAGYALARLD FRGKGFWFFLILAVLAVPAQITLIPTFLILKALGWLNSYQGMIVPSMVNATFIFMMRQ FFVNFPRELEEAAQLDGLNTWGIFRHIVLPLAKPALAAQAVFVFMGSWNNFLLPIVIL FDPEMFTLPLGLNSFKGQFISYWNYIMAASMVFTLPALAIYAFFNRYFIEGVTFTGGK G" gene 37681..38880 /locus_tag="DP116_03200" CDS 37681..38880 /locus_tag="DP116_03200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198843.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="PRJNA477356:DP116_03200" /translation="MAQIVIVGAGPTGATLALQLVKRGIAVTLIEAAKDFHRVFRGEG LMPSGLDALEQMGLSTILEDIPHRQLDAWEFILGEKQLFRVEEPMGADRPCTLVSQPP LLEAMITEAKAYDGFEFIQGVSVKDLLWINNRVAGVKLGNGREISAELVIGTDGRNSV IRQRAGLQLVRQPKDVDILWFKLAASSRFTADNVFYFILNGERVFTIFHGAEEGKLHL AWVISADERTDRKQPDWAEIFASLSPSWMAEHFRSYADTIETPIKLSVVVGRCPSWYA PGVLLLGDAAHPMSPIRAQGINVALRDVIVAANHLVPVFHARAGHQEIDAALSRIQAE REPEIIRAQQLQIKEAAQGELLRKNALMRWLLIQLAPLLRRTIRHSWLKRQYKMRQGI TQVHLNV" gene complement(38920..39333) /locus_tag="DP116_03205" CDS complement(38920..39333) /locus_tag="DP116_03205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874577.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_03205" /translation="MSTRDTFAYNVTHSLSQKETSEYNQISSPVESDNVQPLSHQVLN TEKAQRMAEFFGFLGDANRLRILSLLAQQELCVSDLAAVLNMSESAVSHQLRNLRAMR LVSYRKQGRNVFYRLHDSHVLHLYQAVAEHLDEQE" assembly_gap 39359..39368 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 39467..40312 /locus_tag="DP116_03210" CDS 39467..40312 /locus_tag="DP116_03210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874576.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal ABC transporter permease" /protein_id="PRJNA477356:DP116_03210" /translation="MAEFAVTNVNDLVSLLQFPFMQRAIAGAVLMGILGGLLGSFVTL RQLSFFSHAVGHAALVGVALGVILQLNPTWMLLPFTLIFGLVVLYFMDKTDLASDSVL SIVLSGALAVGVILTSLIQGYRGNLMSVLFGDILAIDSTDLILSLLVLVGASVFLLST LRQQILLTLNPAVAKVQGIPVQWYRYGFVVLLSLAVAVAIKAVGVLLVNAFLVIPASC AKLMSHHFNRFLLLSVIVGCISSIAGIIVSGLFNFASGPSIVLVQFVVFLTVFGCVKL RTKAA" gene 40389..40461 /locus_tag="DP116_03215" tRNA 40389..40461 /locus_tag="DP116_03215" /product="tRNA-Phe" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:40422..40424,aa:Phe,seq:gaa) gene complement(40855..41934) /locus_tag="DP116_03220" CDS complement(40855..41934) /locus_tag="DP116_03220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195101.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type 2 isopentenyl-diphosphate Delta-isomerase" /protein_id="PRJNA477356:DP116_03220" /translation="MTNTIPPVDAATEIENRKADHLRVCLEEDVQFRDVTSGFEYYRF THCCLPELDRSDINIQTTFLGKNLGAPLLISSMTGGTELARLVNTRLATVAQHYRLAM GIGSQRIALEMPHLASTFAVRHLAPDILLFANLGAVQLNYGCGLVECLQLVDMLAADA LILHLNPLQECVQSHGDTNFRGLLPKIAELCEKLPVPVVVKEVGNGISAAMAQKLIDV GVTAIDVAGAGGTSWAKVESERAKDNKQRRLGQTFADWGLPTAECITEIRAVAPTIPL IASGGLRNGLDVAKAIALGADLAGLARPFLEAAVNSETAVDELVEVLIAELETVLFCT DNVTLQELKNSGVLQRHVTVPAAAS" gene 42310..42816 /locus_tag="DP116_03225" CDS 42310..42816 /locus_tag="DP116_03225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316751.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF177 domain-containing protein" /protein_id="PRJNA477356:DP116_03225" /translation="MDAIYIPQLTKAPQATEEIQVKEFLPGLETLTPVRGRIRVQHHG NYLEVFSQAETIITCTCSRCLQNYNHRLTIDTKEIIWLDEAANEDDLPLEREIAFDDL VETLSPQGYFDPAAWLYEQICLELPQRQLCDANCLGIQPSAPSKSDKRVDRRWASLEG LKKQFPGA" gene 43087..>45009 /locus_tag="DP116_03230" CDS 43087..>45009 /locus_tag="DP116_03230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867965.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_03230" /translation="MLCCLNPDCPQPQNPEGIIHCQSCGALLIPLLRGHYRIIKVLSD EGGFGRTYLAEDVDKLNERCVVKQLAPKVQGTWALKKAIESFQQEAQRLQELGKNSQI PTLLAYFEEDNYLYLVQEFIDGQTLLKELRQRQTYHEAEIRQVLLDLLPVLKFIHERG VIHRDIKPQNIMRRQTDGKLVLIDFGASKQLSATVRTKPGTSIGTRGYSPLEQLQDGE AHPASDLFSLGATCFHLMSGVSPGALWAENGYGWVASWQQYLKSSISADLAKIVDKLL KKDIHERYQLADEVLKDLEPQPPSPQPPRLKSRLLAGAGIVFLGLGGFLYINNFQILN LIAENNFLMKTLNGHSNLVTSVAVSSIPPDSPLYKGGLGGIVASGSFDTTLKLWNLST RKEIFTLEGNAGSVYSVAISPDGRTVASANGDKTIKLWNVFTGREIYTLYGHSSSVES VAISPDSKTLASGSFDGSIKLWDLPSGREIATLKEHSGAVKSVAFGPDGQILASGSED NTIKLWNLKNKQVIKTFKGHSQPIRSVAISPIHSDSRSLGGRLGGILASSSADDTIKL WDLATGQEIYTFKGHSYSVNSVAFTSDGKTLATGSSDHTIKLWDVATKTEIRTLRGHS KEVTSVAFSPDGNTLVS" BASE COUNT 13083 a 9597 c 9499 g 12810 t 20 others ORIGIN 1 gacaacaata aattacaccg ttttatggag ttttactctg ttttacgagt ctcccttgct 61 gcattcgtaa tgtagagata ataaatcatc tcggtcgcat tagcgaaatt gcgaatccgc 121 tccaaagttt gagccactgt aaaatcttct ttgagcgaga ttaactctgg agtcatgatc 181 cgtccagcag ttccagcttc ataacccaac aatagagatg ttgcttggcg ttcgtttggg 241 ctgagttgtt ctagcaatcg attaacgatt gttgctggta attcgtcaaa taaccgagct 301 cggtcatctg gcgacatttt atcaacaata tcaagaactt cctggctttt caattcctca 361 atgagtcgtt cttggacgct gtagtcgaga tattcataaa cttcgattgc ttcttgttta 421 gaaagcaagc gaaatgccaa agcatgcatt gcttctggta aaccttcaat cgcctccgcg 481 atatctggtg cttgtacagg tatcaaaata gcttttgctc cctgtaaatc ccccgcttcc 541 agcagcattt gcaattgaat tctaactatg tctcgcaatt agaagtgcgc gttacgtctt 601 ggagggtaga tgtcacaccg tttggagttt gcataagata caaccagggt aatgcgagat 661 agtatattgt acctgattta tacgtgacaa taggttttgt ataaatctcc aaaaagaggg 721 attgaaaatt aggaatttca tgaaaaagtt tttttcagaa attttcaaaa caaatggaca 781 aatcaagcgt taaatctgtt atttttgtaa gagagcgcat ctttgttatg ctttctagct 841 ggagatagaa gcccaatttc acttctgtgg attactgtta ttgttaataa gttttcaaaa 901 atgcctgacg catttaagaa gaatgtattt acgaagtgcc acaccttaag actgttgctc 961 aaaatattgt agatgattag aaactgccaa aaaaattttt aaccctgact gaaaaagtaa 1021 actttcgtaa caattgtcta atagaaaaat ctgattttat cacttagtag cttagcgcca 1081 aagagcaatt tttgcaacaa atcgttctgt atatattctt aacttaaatt gtgtccctcc 1141 tacactttac ttcgttagat tttttgttca acctttaaaa ctgtggagaa gaaatttctc 1201 gtgaatataa acaaattaga gatagacaaa tttaccaaac gcgtacaggg aatgcatcag 1261 cgtttaacag atttatacca aaatctaagt acagttcctg ttataccaga tgtattaccg 1321 caagctcttg tcgaacttgg tagcgcctca gaaatagtac atgtagccac agaggaactt 1381 catcagcaga atgaagaact cttgcgaaca cgaaacttgt tagaaagaga acgccaacgc 1441 tacctggatt tattcaattt tgcaccagac gcctatttag tgacagatac cttaggcata 1501 attcaagaag ttaaccatac tgcggcgaca ctactgaacg tctcacagca gtctatgcag 1561 ggcaaaccaa tcgttaactt aatagcgttt ggtgagcgtc aaatctttcg cagctacctg 1621 acgcaactat gggcatccaa taaagtcaaa gagttaatag tacatattca aaaacgcaat 1681 ggcgagttat ttgacgccgc cataacagtg ggagttatcc gcacttctgc gcatgagcca 1741 atcggacttc gctggctagt acgtgacatc accgaatgca accgagaaga attaacactg 1801 ttaaaaaatg actccgactt aactcaaaac cgcctttggc acaaatgtag caaaggagac 1861 tttatccctc tcaaaccagg cataatttgg tatgtatgtc aaggtagtgt caagcttagt 1921 acactgtgtg aaacgggaga ggaactcatc gttgggttgg caggaattgg aatggttttt 1981 ggttcaaatc taactgtgct acaggcttac caagcaacgg ctcaatcgga tgtggaatta 2041 gtgtcaattg attatgcaga aatagcagct tcaccgacgt tgagtcacac cctgttacca 2101 aaaattaatc aacggttacg gcaaacggaa tcttttttgc tcattgcaag tagacgacgg 2161 gtacaagatc gctttcacca attgttactg cttttaaaac aggaaatggg tcaaagcgtg 2221 ccacaaggaa ctcgcttgag tattcgcctc actcatgaag aacttgccag cgcttgctgc 2281 acaaccaggg taacaattac acgactcata ggtaagttac aacaagaacg caaaattggt 2341 tttgactcca gacaccacat tattttgaag gatatatagg aatcagaaga gggaacaggg 2401 aacagggaac agggaacagg gaacagggaa cagtgaacag tgaacagtga gtccagcgcg 2461 catgaggcgt tttcccgccg taggcgactg gcgttcgcga agcgtctccg aaggagatac 2521 ccgaagggtg aacaattatc aagcaattga tgactgatga ctgatagctg ataactgata 2581 actgataact gataactgat aactggttaa cataaactgt acgaagcttc gcaaaaatca 2641 aataggaagc ctatatagtt tgaaaccaaa ggtgaaaaag tcaagaaaac tttttcaccc 2701 acgctaataa tcttttggta tttggtgtga atgcagtcga gagataatag gcatctgcag 2761 ccttcctaat tgcttcagcc tgagttggat aaggatgaat cacttgactc agtttattta 2821 agccaatatt accgacaatt gcggttgtga tctgtgaaat catttcacct gcatggcggg 2881 caacaatagt agcgcccaaa attttatcag acccctttcg atggtgaatt ttgacaaatc 2941 cttcctcttc tccatccgca actgcgcgat ctacatcagt aaaaggaatt ttaatcgtcg 3001 cgacatcaac acccatctgt tgcgcctcat gttcatacat tcccacatgg gcaatttcgg 3061 gatcagtata agtcacccaa ggcatgatta aactactgag tttttgccgt cccaaaccaa 3121 aaggagaaaa cagggcattt ctaatcacga tccgggctgc agcatcagca gcatgggtga 3181 acttccacct catgcaaata tcacctgcgg cataaatttt tggatttgtg gtttgaaggt 3241 aatcattcac cttaacaccc tggcgttggt cgtactctac cccgactaca tctaaattca 3301 gaccttccac atttggggcg cgtcccgtac cgactaaaat ttcatctact gtgacaacat 3361 cttgcttacc attgcaggta aagtaaattg ttttaccgtc ctgggtcttt tccacatttt 3421 gtatctgaca atttagcact aagcgaatgc cttcttggac aaagactttt tggacaatct 3481 cggcggcgtc tatatcctct ttattcagga tatgagatcc tttatggaaa agtatcacct 3541 cacaacccaa gcgtcgaaaa gcctgtgcta actcgcagcc aattggacca ccaccaatca 3601 cgactaaact ttgtggtttt tgaatcagag aaaaaactgt ttcattggtc agataacccg 3661 cctcttcaag tcctgggatg gaaggttcta tgggtctggt tcccgttgca ataaccgctt 3721 ttttaaatcg gagagtctgg tcatcaactt ttacagtatc actgctagaa aactcagcgt 3781 tccctaagaa gacatcaact cccatatttt ggaagcgttt tgctgagtca ttatggctga 3841 ttccagatct gactcgtcgc atccgttcca tgactgagga aaaatcaaca tcaacatatt 3901 tcgacgtacg aattccatat gctttggcat tccagatttc accaaccacg cgagaagacc 3961 ggatcaagca cttggatggt acgcaaccaa aattcaaaca atccccaccc ataaggtgcc 4021 tttcaatcaa ggcaactttt aaacccccac ccacaagcgc tgcacctcca gccgccacta 4081 atccggctgt acctgcacca atgaccacta agtcataaca atcagcaggt tgaggattga 4141 tccaatccag tggatgtacg taagaaacca atgtctggtt atactcatct actgggcgaa 4201 ttgtcactct ttggagttct gacattggtg tatctccttc aagaattgat catgcaaaag 4261 tcttatgggt gtagcttttg ccagcgaaaa attgagcagg ctaacttcac aaggatgaca 4321 acctttgcaa ttgcttctca gttataaaat tgacaagcag ttacaaagaa tttcccgatt 4381 agacaaggga ggatgcaatt ttaatactct gattccaatt taagaatacc ttttaaaggc 4441 actatctcca atcaccacat caaactaaaa tatgacatga tcaggcggaa attcaatcaa 4501 gctctatcga agctaatatc tgaattaaag atgtattttg taaggtatct attaatgaag 4561 cgaaaagata aagctcggtt acagtctatt tttagtgtcc tgctcctttt tttcttcaca 4621 ggctgctggt tgttgtggaa tcccaccgct gttctagctc aggattatac tgttaactac 4681 acgtttgctg acttacaaca tcaagatttt tccaataaag atttacatgg aacgtccttt 4741 gcaggcggaa atatgcaggc ggcaaacttc cgaggggcaa atctcagtgg aactattctc 4801 accaaggggt cgtttctgaa ggcagactta agcggagcta atcttgcaga aacatttgcc 4861 gatcgcgtga tctttagtga agcgaattta accaacgcca ttttgactga tgccattttc 4921 agcagcagtc acttttttga tgcggtgatt acaggagcag atttttctaa ttcaattgtt 4981 gatccctacg aagtaaagtt gatgtgcaaa cgtgctgatg gagtcaatcc cgtcacaggt 5041 gtctcaaccc gtgacagctt gggatgtcgc tgaatgatca gttatcagtt atcagttatc 5101 agttatcagg tttcactgct tagtgttagc tgaactgaaa gctgctcaag ttgcccgaat 5161 gcaaacactc ttttccatac gactgggaaa tcattcgttg gagtcggaag ggtttgcagc 5221 gattcaaaat tttgataacg tttgatgcct tttaccagat ggatgtcacc ccacctgctt 5281 taccagcata tatcaaatgg cagtaagctg tttatttatg acatttgtaa gaaactaaac 5341 aagtatggca acaattaacg acaactacct caagctcaaa gcaggttatc tgttccccga 5401 aatcgggcga cgggttaatg ccttcgcaca agccaatccg gatgctaaga ttatcaagtt 5461 aggtattggc gatgttacag aaccactgcc ggaagcttgc cgaacagcga tgattaaagc 5521 ggttgaagaa atgggcgatc gcgcttcgtt caaaggttat ggtccagaac aaggctatac 5581 ttggttactg gaaaaaatag ctgcccaaga ctttcaagcg cggggatgtg aggttgatgc 5641 atcggaaatc ttcacttccg atggttctaa atgcgacacg ggcaacattc tcgacatctt 5701 tggtaacaac aacattatcg ccgtgactga ccctgtttat ccggtgtatg tggataccaa 5761 cgtcatggcg ggtaataccg gatcagcaaa cgataaaggc gaattcgagg gcttagttta 5821 tctgccaatt accgcccaga acaacttcac tgctcagatt ccttctcaaa aagtcgattt 5881 aatttacttg tgtttcccca ataaccctac aggagccgtt gctacgaaag aacatctaaa 5941 agcatgggtg gactatgcca gagcaaatag ttccatcatt ttcttcgacg ccgcttacga 6001 agcctacatc accgatgcgg aacttcctca ctccatctac gaaattgaag gagcaaggga 6061 atgtgcgatt gaatttcgtt ctttctccaa aaatgcaggc tttaccggta ctcggtgcgc 6121 gttcaccgtt gtcccgaaaa acctgacagc aaaagcagcc gatggttccg atgtggaact 6181 gtggaagctg tggaaccgcc gccagtctac caagttcaat ggcgtgtctt acatcgtgca 6241 acgtggagcc gaggcagttt actcccagga aggacaagct caaattcagg aactgattag 6301 cttctatttg gagaatgcca aaattattcg cgagaaactc acagccgcag gattagcagt 6361 ttatggtggt gtgaatgcac cgtacgtttg ggtacaaact ccaaatggtt tgtccagttg 6421 ggatttcttc gataagttgc tgcacacttg caatgttgta ggtacacctg gttctgggtt 6481 tggtgcagcg ggtgaaggtt acttccgcat ttccgctttc aatagtcggg agaatgtgga 6541 ggaggcgatg aagcggatta cagacagatt cacgtcctag tgtaaaattc cctacaatct 6601 tacgttgttt atataaactg ctcatgattg tagggtatgc ccgtgttagc accaacgccc 6661 aagctaatgg cgaggcgttg gatcagcaaa tagcaagact caaggctgct ggtgcagaag 6721 atatttttgt tgatgtcgag tcgggacgta gcgttaaacg tattatcagt tatcaattat 6781 cagttgtttc ctgttcactg tttcctgttc actgcgtagg tagctgtttt aaaaagttgt 6841 tggcgctaca ttaatcctct gcaaacataa aagttcaccc ttgaggtaga aaagaacaac 6901 aaataatcta atagacatct ccgaaaaatg ctaataccgc tccctaagtg gctttcaaca 6961 ggcgattcct acggagagct acgcttaacg ccaatttcaa acatcagcca gtctccacgc 7021 acaaataaga ctttgagaca gttactgttg ataataaggc aaaaataagt atcttgatac 7081 gtcaattttt acctggttca atcgtgccat tttatatttt taaaagcaaa gctcctattc 7141 ggaatcttac aagtccacaa atagtcatga ttacttgttc gtatttctgc ggatttaatc 7201 taaacctttc agaggctacc ctaaatatct taactaaacg aattaaatgt tcaacaaata 7261 ttcgtgatga tgctagttct ttatttcgtt ctttttgttc tgaggttaat tctccttttt 7321 taggcttttt ctttggggtc ttgattgata attctccttc ataaccttta tcgccgttga 7381 aactttgtct agggtcaaat cctttttggc gttctcggaa taaatttatg tcactttttg 7441 gtctaggttt acccgcgact acatcaacaa tatcatctcc atttggtaga acgataagct 7501 ggtttttcat cgtatgattt ttcttcttcc cagagtaata ctttctgtcg ggaaaccctc 7561 atcaaggact ggctccctcc gggtatgcct gccttcgcct gacggctaac accagtcgct 7621 tacggcggga aaacgcctca tgcgcgccct tattcaccgc aaggcgctgc tctccgcaca 7681 gaagtgacac atgagggaaa ccctcccgca gcgctggact cacatgcaac gtctctacaa 7741 ggttacagga aaatgaacat cgtgctacaa ccataattca ttcccaaatt cagcaacgcc 7801 gaacacttaa cattccttga tctggttgtt gctctcctta cgagctttct cgtaagcctc 7861 catcgacgaa ccatgctttt gtgcgagcag ggcgtgccat tctactgcat gaatactgaa 7921 ttgtgaaacc ttatgtatag ctttctactt gtaacgccac ctattcaagt atttttaaca 7981 tgatttactc aatttcaaat taaaaatatt tggcgacttt tgtattttat gcagtaagcg 8041 ccaagaatac aattagaaac tgttgttgta tcacaatggc cgacaaccca ggcggtggtt 8101 tttgatttta ctgcgccgga accgctactc gattgccggg aacaatctgt ggctgatatg 8161 aaaacacctc tcggctacgg gaatcgacag ttaccttgag ccactgtacg tcgttggtgc 8221 cgttctcctg attgtcgcca aatgtctcga cacgagtgaa gttctcaagt cgtctgcctt 8281 gagcatctag aaatggcttg tcaatgcggt agtagtgcga atcgccatgc acgtaggcaa 8341 ctggtttacc gaaagcgatc acctcatcgc gcaacgctaa taaaaagtcc ttgaagccat 8401 caggctgccc atcggtttgt acgagcgtct tcggatcgcg caaaggtgct ctggttgggt 8461 cgctctgatc ccaccctgga tcagcttggg agatcaacat gaccgccgcc gagttgcctt 8521 cctttgccac tctaaaggtc tccttcagcc atgcaatatt ggctgcgttt cgcgccgcgt 8581 actcctctgg gtcaggcgca gtgtcgcata ggttattgca cgaaccctgg atgttgagtg 8641 tcgcataagt aactttgcca actgtccagc gacgattctc gacacacggc accggaccgt 8701 tgacaccgag gcacagcggt gccgtctgca cttcctgggc taagcgacgc tgccccagag 8761 aaaaatttgt gctgaagaat agctgacgct cgtagtcaag gcgttcgagt gagttgaatc 8821 caccatttga agggcgatcg cagtctgtcc aatcgttgtc acccggtgtg aaggctgctg 8881 gtgcctctag gctgttgaag tagcctctag ctttggcgta cagcgcgttg tcgcaaacgc 8941 tgttagaacc agctttgaga tcgccatcat ggacggtgaa agaaagtttc tgcgaattca 9001 tatcgtcgat tagtcttgtt acgccaacct cctgcgcctc tgagtagggt aaatcgcccc 9061 aaagtccgat ctcgtatggt gcgttcgcat tctccttccc tctggcaacc accagactgg 9121 cagttgcaaa gaggaggaac ggtaaagtca cggtcaggaa aaacatttta ggtagcaaat 9181 gtgcagtagc gacacgtaga aacgtcgagt acccaagctt tttcatcata tcctcgatta 9241 tttttcaaag cgtgagtgca ttttggacaa gtagcttttg ctcacgctat tccagtctgt 9301 gatttagact acagcatgtc aattaacgat taattatgaa taggttatga gtggatagtc 9361 ctatcagggt gttttaataa gagattgtag atgaggctca tacttaagcc tgtctcaaca 9421 ataaaacctt gataattttc tgttgaattt actaaatact gtacgttcgt ttattttaga 9481 atggcaagaa gaattaattc cttgtgggaa aaggagtcga acaattttcc attctaatcg 9541 catatcgcag gagcatttct cggtagtccg gagatgttca tcaaaaaatg gaataaggaa 9601 aagcactaca ctacaagcac agtaaaggat tgagaaagtg aaataattaa aaagcactag 9661 cgaatatcaa cagctagtgc aatcagttca tttttgaata gttatctcta atataatact 9721 tttcaactta agtcattagg tagaaggaga taatcaccaa ccaggctatg tatagttgta 9781 ataaattgtc aaaaaacact ttaatatttc tactcagaaa taggtggttt atggaaaaac 9841 aacctgagat acacaatttt ctctgtcaaa aatataaata cagaggtaaa ttcactccac 9901 agaatttagt attcaacgcc aacttacaag aatttgcaac acacgtgagc tacatttgca 9961 atttacaaac tcttggtaaa ctttcaacag aagatgcata caaacaaatt aacgagcttt 10021 ggcaacggct agaacgtagt tatttagaat tagagattga tgcagattga tatagatgtt 10081 gaaactcagg gattaacctc tccaattctg tatgatagct tgcccgtaaa gtgccataga 10141 tgcgcttaat gatacttaag cttcaattat tcagaataca aactctttag gagcgcaaac 10201 tcctacccaa ggaggagaat gcacatacta tggggggggt gggtcacaac gattgaaaaa 10261 tctgcctaag ctgaggaaaa atcgccctat aaaaaatctt gattttgtat accagttgac 10321 agtccggggc gatttttccg tcaaagttgt ggacttctta cccataaact agtaattcgg 10381 ctggcagatt tttccggttt taatgaagcg gaatgcacag cggcaatgaa acgtatcaag 10441 caaaagctta caatattatg cactattcct atcatttgaa ccaaatgctt tgttcttcag 10501 attttcttta caaacagatt cttactagac atctgtgaaa actaatctca aactcttgtg 10561 gtgtctgggt taatttttca tttctggaga cgtctactgt tctttaacta aatctttccc 10621 aatccataac tcatactgtg ctagcgataa actaatattc taatcggtct ttgaaatcgc 10681 aacacatgca gaaccctacc ccgataaacg cagaactctg cctcagtaaa cggtttgcca 10741 tatctgcatg ggttgtactg actttaatat tatgtatgag gagaaataag tgttcttgtt 10801 aaaaaaacga cgcacagtca ctctgactgc gactgttgct gcaacggtta ctgctgttgg 10861 cttcgctttc gctagcagag ctgattatct atcactcgaa cagtttatcg acaatggtcg 10921 cgcaaagaat gtcattcttt tcattggcga cggtatgggc gattcagaga ttacaatcgc 10981 ccgcaactac tccgtaggcg ctgctggtcg tctggcactc gatactctcc cattgactgg 11041 agagtatacg acttatgctc ttcaggagag caaccccaag ttacccgatt atgtcaccga 11101 ctccgcagct tcgggaacgg cttgggctac aggaagcaaa acctcgaacg gtcggatttc 11161 gacaacggct agcaccgact ctgacctgaa gacaattctc gaactggctc aggaaagagg 11221 ctttgtcact ggtaatgtca gtacagctga gttaactgat gcgacgcctg cggtactcgt 11281 gtctcatgtg tctaatcgca attgccaagg tccaaaagat atgacaagct gtccacagga 11341 caaaaagtca gcgggtggtc ctggttcaat cgctgagcaa tcggttgacc acggtgttga 11401 cgtactgctt ggcggaggta agcagaggta cgaccagatc attgatggcg gtcggtttgc 11461 tggcaagact gtgattgaat cagcgcaggc acagggttat caagttgtga cagatgcttc 11521 cggattgcag tcagcacagc caggaacaaa gctattaggt ttgtttaact ctggcaacat 11581 gagtctggaa tgggctggca agccagcggt tgtgtaccct ggtaatgagc cacaacgctg 11641 tcgagaaggt ctcagaccat caaatgaacc tagcttggct gacatgacga gcaaagcaat 11701 cgaattgctt gaatcaaagc agggtggtca gcggcgtttt tcacagaaaa caggattctt 11761 cctacaggtg gaaggtgctt cgattgacaa gcaagatcat gctgggaacc cgtgtgagca 11821 aattggcgag aacgtcgcat ttgatcaagc gatcaaagta gcgcttgact atgctaagac 11881 tcatcgggat acattagtcg tcgtgactgc tgaccacggt cacacgagtc aaatcattcc 11941 taatccaaca cagactgctc atagcccagg aaaatacagt acactgatca ctgctgatgg 12001 agcgcagatg acagtgaatt acgcaaccaa tctttctgat cagtcgcaag aacacaccgg 12061 aactcaagtg cgtattgctg cacagggacc gcaggcgtca aaagttgttg gtgtaattga 12121 tcagaccgat ctctttcaca ttttagctcg tgctatcggt gcagagtaac tgattattac 12181 gaaggtagtt ttttaacgac tcaattctcc ttctttgggt taaaagtata gttgagttag 12241 ttcacgaacg aattctgcga attcatttgc tctgttatct ttttgcaaat ttggaatgct 12301 cggacgagtg cacgattcag acttcctttt tccaaggaag tctgaaccgt gctggataaa 12361 actttgctgc gtatttaaga gaacgatatc gcctagcaat ctcaaaaaaa aactacagtc 12421 taagataaaa cctaaaactg taggaatgac aattatgtta ttatcaccag aagtttttgt 12481 atagactagg gaagacgcca gcttttgtca atataacgct ggtttcccac ttctgatatt 12541 tgaatagctg ctttgtagcg accttggttt gtcttggttg tgagaacgcg agtcttaact 12601 cgcacctgct cggagtctgg tcctcccttt aatccttcaa gtaacacccg accgtcaaaa 12661 gcttcactag tgttgatgcc cttaagcgct aagattgtgg gggtaagatc tacattacta 12721 gcaggaactt gaactttaac gccgcgcttg aaatccactc cccaagcaaa gaaggtgttg 12781 cgaacattcc agggactcat actaccgtga ccgctggcgt tacctgtgcg cggacctgtt 12841 gttccactgg tatcggtgaa gtcagtccct tgtacaccaa aagcattctt atctgatgtc 12901 caaggaaagg taaagaggat atctggaccg cgctcttggt tgaactcgtg gatcaactcc 12961 agtgaaaagg ttccaggtat tgagccatca acgctcttgg gttttcggtt actgagattg 13021 ctgcttgagt ttttaccagc agtgaaaacg acaccaatcc aatcttgttt gtgcaagaac 13081 tctacaattt ctttaattcg ttgtggtgag cggtttttta cgtgcaaaag gacagcttgc 13141 ccactgctag ctatgacgac atcgtccgaa tcgggtccgg ctttaagtcc agcacggatc 13201 aattcctgtg tgacgttaac gccaaaagtt tccaggctaa aaccgtgatc tgaaacaaca 13261 aagatatcag ttcgatcttc taaacctaga gctttaagct gttctattac gagtccgatg 13321 ttgcgatcat cattgcgaat tgtgttagtg gactctggag aacccgcacc agtattgtgc 13381 tgcgtattgt ctggttctgt caaccaatta aggactacat caggtttttg ctctggaagt 13441 acatactctc gcagaacctg ctcagtccaa tccactgctt cgttgtattg attgtcaacg 13501 tctccttctt tgggtggagc agccccaaag cgagagagaa ttgcatcgtt aacatcaccg 13561 gggaatgcta cgaccttacc agggttgaag tatccattaa tgacactacc aatcccgttg 13621 actgctcggg gattcagcag aagcgcacta cctgacgagc cagaacttac agcagcaagc 13681 ttcatcccgt tttcctgaag tctttcacct aaagttttaa caaagactac ccgcccacca 13741 ctgacttcgt caagttttag gatatctgaa tactctcctg tactgaacgc acgggtcggg 13801 ttgacctcag gcacgtacat agagttactc acaataccgt ttgtccctgg atagtaacca 13861 gttccaattg ccgtagcatt gacgcgagtg actgtaggaa agacagcgtg tccgttgacg 13921 tagtccacac cttcattacg caaccgataa aggtttgggg tatcttgtgg attgattaag 13981 tcaggacgca gcccatctaa aataaaaacc aaaatcaaac gagcttgacc tggcagtgtt 14041 gttgttactt tctgaaccag tgtggaatca gcatgagtca ttcccggtct ggtcaaagaa 14101 tgaccaaaca cgctcattgc taaaactgtt gttccgatga gtaccccagt aaagaatcga 14161 cgccgattcc aatggtgttt ctgagttggt tttgtggatt tattttgact gtgctgtaac 14221 ttattgttca tgcgattgct tcctacacct taagcccctg tttttgaaaa ctccgaccca 14281 atgggttaac tttccagcca gttgcgattg gaattgttct ggatttgcac ctcattagac 14341 acaagacagg tgacatcaaa aaatgatgtt gctggctagc gactatgttt ttaggctttt 14401 ttatttgcag ccacagtcta atggatgctc atcaagctag gaacaaggtt tgattaatct 14461 ccagttaaga cttaattgtg taattgccta gaaatgtaag gtcgacgaac tttaatccct 14521 ggaattattg cctacaggta gtttggaaaa ttagctgttc atcaaattac tgaacttata 14581 ttgacctaat gctacttttg tgaaatgcgg tttatacagt gcgacgctga cgtaatgctt 14641 gaatttacaa atatgtagta cattcgccaa cagtatccgt atctcggcga aataccgtgt 14701 tgcggttaag tttttcagtt ttaccaactt cgggtcataa gttggtattg agttcaaaaa 14761 tagacaaatt gaaaatcaag aagtttgatt tctaaattag tacttcttaa ctggagttta 14821 gactaaacca cgcgatccca gagcgcagac gccaagggcg tattgctatg cggtcgtggt 14881 acgtaagaat aagtataaaa aattaaggag atgcctggag ttaaaacaag catctcccaa 14941 cggaagatga gagaagcact acccttaccg tggcaaaagt caatcgagaa gttgaccctt 15001 ggaaggattg gtcagtctat gggaggagtt ttagcgaaga ttgacacgcc gtggcttacc 15061 tccaaaaccc cgtgcaaaat ctcctggctg ggaaactgga cagccacgct aacgtagaat 15121 cacagttaat ctagtcagat aaaatactta tatacaccgt aagcaggtga ctaaatctgc 15181 ttagtttatc cgtttccata ttttctaatt taattttctt taacccagca ttgctgggtc 15241 attttccatg ccttagtcta aactcgagta tattaagcaa gttgtatgac caacaccgtt 15301 attattacag ctgcctctga gggtatcggc aaagcaactg ctctcgaatt tgcgcgtcac 15361 gaatataatg ttgtgctagc agcacgtcaa agcgatcgcc tagaagcagc tgcctcacaa 15421 gtccgggcaa ttggtcgaga tgccctagct atttgtgttg atgtcacaga tcccaaacag 15481 gtggacgctt tagttgagaa ggcgatcgct cactttggca ctattgatgt gctgatcaac 15541 ttagtggaca aggcaatcca taatcccatc ttagaaaaac cggaggatgt ggcagtggca 15601 atctggaaag ccgttaagta tcagcgatct gatatgttag ttggttcagc acgtttgtcc 15661 aaggtggctt atcaagtatt tcctggtttg atgcaaagcc tataccaacg agtgcttgga 15721 atgcgagcac gccactatgg acagtaaata cttcaatttg tattcagaca attgtataga 15781 aggaaacctc agaaaagttt gtttctgtac agacagagta tcgtcatagt gtaaaattcc 15841 ctacaatcgt aggttgttta tataaacggc ttatgattgt ggggtatgcc cgtgttagca 15901 ccaacgccca agccgacggc gaggcgttgg atcaggctag caatgtggcg cggttctcac 15961 tgagtagcgg cgagggatgg ttagaaggta tgggtcaaaa tgtgctttta actgcggcgt 16021 atgtaagagc aatgatgatt gagatgataa ttggcatagc aatcataagt cctgttttgc 16081 caaaccgaat ttcttgtcca tccattttta agatagctat cacgcctgcg gttgccatta 16141 aaatccatat agttccaaag acaataaaga taacgaatga tgagtcattc atgttttttt 16201 cctttaattg acatcctctc ccgcctgaag tcgggagatt cccgaaacag atggataggt 16261 tcctaattca tcgacccagc ttactaccaa ttgtcctttc ggcaatatct aaaccgaatg 16321 accgcacaaa agtaccaatc atctagccta gttcaattga taccagaggc tctggtctcc 16381 aaaggcgttt tcccatgcat tgggaggttc atcggtgccc ccgattacca agcgagtttt 16441 aaagtttttt gcttttctgg tttgtctaac gctaggcagc gcacagctat gttttatgag 16501 gttttcgcta ccctactttt tgccttgagc aggctttccg gttcaaggag ccaatgcaag 16561 ccaccaccgt gggaattcca accatcaata caatacagca ctttgctcta acaaatcaat 16621 taagaaaagc cgccctcaaa gggcggggct ttaaacccaa tttttcggta agtttggggt 16681 nnnnnnnnnn catcaactac ttgtttttta caatctccct tctctatcct ctctcaccca 16741 tcgtctcctc accctcctcc ccaaacgaac ataagtttta tttttaaact aagagataat 16801 taaatgccta ccgacgtgca gttaaaatgt ctgtaccgaa tcggttatca ataaacttat 16861 gttatgtttc agccgattca cctcatttgt attgactctc gcacacagaa tttattcatc 16921 ctggcaggtg ataatgaaga catcgaattt gaagtgactt ctaatgggga ggtgttttaa 16981 tgagtcaaat tgactatgcg gcaatgtcta accagcagtt gaagcaatat atgctagagc 17041 atcgggacga cgaggctgca ctgaaagctt acttggatcg ccgccatcag cgctcaactg 17101 tcattatcac aaccgttaat gatcccgatt ttgatgctaa agtccaagct gcaattcgtc 17161 agcagatgag tgatagtggt taatttacag tgattttcaa aaaaacttat tatatacata 17221 ggcttaaact tttaaagttg cgcaatgagt aacttccacc gtaggcaatc acgtcgttat 17281 acccccaaaa ataaaaacat accctccctt ctaaagacac tgcaagaact tggcacaact 17341 ctaggtgaag tcagttctca aatcagtgag gatatggtta gcgaggtcac aaagttaaag 17401 cgagacattg tcaaatcagt taaatcggct acagaacttg gacaagacgt agcgcaatct 17461 gctatcgaac gagggaaagc agcacaacag gaaacacgtc agttttttgg agataagcta 17521 gaacagaatc aagaatctct gaggaaaaaa ctagacgaga accaaacagc aaaggaagct 17581 gctgcaaggg cacatattga acagattgat ttagatgcag ttgagaaatt tgtcactctt 17641 cttttaacaa aatttccaaa cgcaacgcct gaacaaatga ctcagcgtct gcttcgccga 17701 cagcttttcc gcgtcagcag aacttctgta gtcatgagtg ttgttccgtc caaaatggca 17761 gagtctgtcg gggttgatta tgttgaaatt gctctcatcc aagctgaaat tatttttcag 17821 attgcagtcg cttacggttt tgaactacaa gttccagagt gtaaaaatga agcgtttgca 17881 attcttgatc gcgtactgag agccaataga ttaaccagaa tcggtttaag tgcaacacaa 17941 atgattccag ttgcaggagg atttattagt acaggaactg acacctattt agtttatcaa 18001 attggaaata cggctcagca gttttataaa tctttaactg aagaagaggt tccgggagaa 18061 attttagaga attttataga agagacccaa aggcgataca agcaacgcct atggtaggag 18121 gaagttaatt ggattgtaac ggattgtata agatccccga cttcgcaaaa gttgtcgggg 18181 atctagctta agttgacaca aataactcta agcccagccc acaagatttg tattgcactc 18241 aaacgagaac cgctatatct actatctcaa aaccaatact cttaaaattt gccaacatca 18301 actcatcacg ctggtgccaa acagcaaata tttgttgtct accatcatag agtgtttcag 18361 ggtctatttg ataaaccgca ttatttaccc gtttcagttt taaccctaat aaactcaaca 18421 gttgcttaag tatttttatc ggtgatatat actctttttc ttgcaataaa ttaataccaa 18481 tggctctttg gatgtgttta ctacattgaa aagcaatatt tcttaacaat aacaaatctg 18541 catcattttc ggtaaacttt cgttttggat caagatactg aagcattcct aatgctctta 18601 aagcttctac tttaagtgta taagttttta aatctggcag aaaaactttg ccttcacccc 18661 aagtcaattg ctgatgccat tcttgcttgt ctctgaggca gagatactcg ctttcatgag 18721 ttagataata atgtattaat agttggtgat aatatccttt gtcgtcacgt aattttagca 18781 ggggacttac ttctatacca tacttttgtc tcagaagata ttttttagtt tggttacgtt 18841 cttcttcagt cagcgagtgt ttaaccaata gctgctcgta ttctacataa tcaatatctt 18901 tagcattggc tacagtagtt gcagatgcta attgatttcg ttgcttaatt tctcttattt 18961 tgccgtcaat atcttttgat tttttacggc ttttcgccca atctttttga actttgaata 19021 tttctaagat gagtcttttc cgggttgtga tgtcactctg gtcagttgct ataaaggcaa 19081 ggcgtaagtc tcggataata ttcttttgaa catcattacc tcgcacatga atttgatgtc 19141 catcagatat caaaccgtct ttcatagatt tacggtaaag cgtaatagaa gcatttaccc 19201 tagcagacaa ttttgcccaa gttcgtaaat gaataggatc atacaccaac ggtaaatcga 19261 catcaacttt atgtagtgga ctgagtaaag ctatgttttc cttttgattt tcttgatacc 19321 aataagacaa tgaacgataa tttgtactac cactaccaat taacccaatt cctcgcttcg 19381 cacaccaaac aatacgcggt acatcgtcgc gtactcttgc taatgcttgt cgtgcttccg 19441 agtcaggaat aacgccttga aaaataccat aaacgcggtc aaaatgttga acatcaatac 19501 taatgcctgt accaaggcta ggagtcacaa aaacagtgtt gtattcagtg actttttgat 19561 tgatagctgc aacaaaatca acagcttcat gacctggtgt atttgttgtg tgactactaa 19621 caaccaaagt ttttggaaac tgcttttgca atctttctaa acgctcttga agataacgtt 19681 caatcgtttc acaactgtaa cgtccagagc gactatcagt cgtaacgtaa catttgtttc 19741 ctgtgattaa atctaattct agctgatgaa ttaagggagt tgggttagga gaatcataga 19801 aagtcacatc ccaaccttgc tgaggtttcc actgattcac aacaacccaa ggaatgattt 19861 gaatccctgc caattcctgc aaatattcta aggaaacatc tgataaatca gcatcttggg 19921 caattactaa tcctccactt atcagcacgg tagaaataag ctgctggaat atttttaaaa 19981 ttttgacgcg tttatcttta caagtgttgc tgttaagtaa atgccataaa gactgttcta 20041 cttcgtccaa aattatgatt gcccctcgcc aatcctctgg attcaacttc caaatggaat 20101 caacgcataa tccaaaagag tgaggagatg gagcaatagg aagataggga gataaggaaa 20161 cggggtaaca agggagtaat ttttctccct tgtctccttg tctccttgtc tccttgtcct 20221 tttttcccat tccccactga ataccgattt tttcgcataa aaaacgtccc agttggattc 20281 tgtgagtaat gagtaataca ggtttgtttc tgcttttggc ttgattcaca acagtttgta 20341 gtgttgtggt tttccctgtt ccttttgcag agttgactgc aactaatcca gaggagggga 20401 acggtatttc tcctaaatag cggcggttga gagtgagtgc aggtggaatg gttaactcgg 20461 tgtgaggctt ggtttgagca aggtaaactt ctaaatcaac actttggcgg tagactgtct 20521 cgaaagcact tgcacctttt gcgacgagaa agtcatcaat ccctttttct attcctggaa 20581 gtttgatgac tcggacggga caattttttt cttgaaatag aaatcctagt tgggagatcg 20641 cattagtgac tgcagcaatt ttcttaggtt cagtttcaaa atcaaagcaa atgtaaaaga 20701 tacgcccagt gttcgcaaat gctgctaaat ctggtatcag ctggcgactg ataactttgc 20761 caaaatcatc cttgatgacg cggtaaccac tggtaattcc agggatagca agagctgcat 20821 atccttgtgt caacagtgct gcagctttct taacaccctc gcaaataatg atggggatgt 20881 tccgttccat cacccattgc caaaagcctt cagcttcacc attggtggta atgttgatac 20941 tgtcaggcat gggtacattg taacgttgag ccacttgctg ccagatatcc agtgataccc 21001 gcagacaaaa tactcgcgtt ggcgtgctag ggggatgttc gtactttatc aacttggcat 21061 tttcatttaa tcgaggttgg ttgggcttaa agcaccccca ctccatctct tgccaatgat 21121 ttagggagtc aagtccagaa caccaccaac caccttccgc gatgtgggcg taacgctgta 21181 accacccact tttgaccatt ccagtatttg tacgcggaag atattcagaa attagcaggt 21241 aatcataagt ggttctgcct tttagagaca taaaattgag tctcgcaagg tttaaatcta 21301 taccacttcc ttgaaccaat tcctcaaggt gttgagactc tagatagtac aaatgcatta 21361 gttaaggcgt taggaaaaaa cagtgcataa caccttatca tgcatatgca attttgagat 21421 gagggagtat gaaagtactg tgaggcttat tacaccattt gaaataatac acctaaaggc 21481 atagattaat gtcaagcgat agctattctc ttgaactgcg agaactcatg cgtttataga 21541 caagagcaga catcttcgga aaaatctcta catcatcagc gtatgccgaa tctggagcaa 21601 aaactgccaa aatataagta gcttttttat cttctgtaac aaccattgct gcttctgacc 21661 gagaagcaga agtcccgccc gctttggagt acaatcgaac acgagtgtcg ggtaaagatt 21721 caccaaaaaa agtgcgtact gggttgaaat cataagaatc tggtggttgc caaactttag 21781 gatttaaatc tctttttagc catccacaca tttttctgct ggcttgtaaa gagacagcgt 21841 gttcagcata acacatttca tacaatagtc tggctgcatg tttcgtcgtt atcttattcc 21901 aatttccaac tggctgatga agcaactgtg attcacttcc ctcaggctct tgaagattaa 21961 gatagtcaat gggaaatgtt ttttgaataa tattaagatt tttgtagcca gcttgatgga 22021 aaaatcgatt aagttgttgg cgcttctttt tccacagttg aaatttttcg ctgttgagtt 22081 ctgattccga acgtgttcct gtgacttgat caataataaa gctcgcagcc tcattatcag 22141 actcctgaat catttttgct aaataaggag caaaatcctt ttcattttgc caaaaacccc 22201 tctctatttg ggcatataaa accaccatcc aaaacatctt aactacgctc gccggatatc 22261 tcggtatatc ttgttgatat ccggctgttt caccagtttt tgcatttatt aaggtaacag 22321 ataaagcctc tttaggcaga ttttcgtcag cagcaagatc gacaacatcg ttaacaattg 22381 cttgcaactc ttgagagtct ttaaattttg gaggtttttt gagattgtag acgacttccg 22441 aattttgttt ctcaggttga cgagagggaa tagacagtga tttggcgata cgcgtagcgt 22501 caagcctccg gcttatcgcc caatcaggaa ctggtacagc tggagttaat gatttggaat 22561 tttgcttgct ggatgattct acaacttgga taacaggaag tggcgaagtg gctgttgata 22621 cagttggagt tagtgatttg gaattttgct tgctggatga tttggcagca acttggatca 22681 caggaagtgg cgaagtggct gttggtacat gtaacgattg ttgggtagat ggtttgacaa 22741 gttctgatac atatttagtt atcaaaacag ctaccaatac acctaaagtg atcaccctca 22801 ctgtcagatg gttgactttg acagcaaagt tcagcttggg gaattggtgc aaaaatgttt 22861 gatatttacg ccatttcggc attatgtctg atttttattt ggtttttaca aggaaaaaaa 22921 agtgaatctg ctaaatttta gcagaaggac gtaatgtctc acaaatatga taagaataga 22981 cttcagtcga ttctgaacaa aaataaacaa taaaggatga agtttgaaaa acctcatcct 23041 ttattctgag ttgatgcaca actgtagtca aagataactg ttttgtatgt gctaacggaa 23101 catactactt acagaactat cttcgtgaat ccgccagata gtttccccaa gtaagttagc 23161 cactgacagc acaactaact gtggaaagcg ttcgcgcagc gatttccctg aagcatcgct 23221 ttctggaatt ggaatcgtat ttgtaacaat cacttcctca aacaagccgc ttgacaaccg 23281 ctcaattgct ggtggtgaga acactgcgtg agtggcacac gcataaacct gacgcgcccc 23341 ttcctcacgc agaagttttg ctcctgcagc aatagtacca gcagtgtcta tcatgtcatc 23401 caccagcact gctgttttgc ccttaacatc accgatgaca tttaacactt cggcaacgtt 23461 gtgagcctga cgacgtttat ctatgatcgc cagtggcgca tcgttgagct ttttggcaaa 23521 tgcccttgct cgtgcaacac caccgacatc tggggaaact acgacaagat caggcagttc 23581 tttgcttgct agatagtcca gcaaaactgg agaaccataa acgtgatcta gtggtatgtc 23641 gaaatagcct tgaatctgag ctgagtgcaa atccattgca agaatacggt tagcacctgc 23701 ttctgtaatg agatttgcca ccaatttggc ggtgattgac tctcttcctg cggtttttcg 23761 gtcagcgcgg gcataaccat aatagggaat gactgcggtg acttgccgtg cagaagcacg 23821 gcgacaggca tcgatgataa tcagcaattc catcaaattg tcattcacag gattacaacc 23881 gggctggatg agataaacat cacaaccccg gattgattct tggatttgaa tgtatagttc 23941 tccatccgca aatcgtttgc gaatcattgg tcccaaatcc atgcccaggt aacgagcgac 24001 ttcttgggac aaaggtacat tggcagagcc agataacagc cgcaggcgag tattttcagc 24061 tagtcctgtt atcgtcggct gcacttttaa agttgcagaa ctgagcacag cagatcctcg 24121 atgtgcattc atggcaatct tatcaccaga gattagacag ctttaaccga aaatagtcat 24181 aaaaaatata cataagcaaa tggtcgtgag tttttttgaa gcttttatca aaactttaaa 24241 caaacctcta tcaaactaca ttttgcaaat tctccagccc ttgcactgag aaacaattga 24301 tagtttaact gaatattctg cctgtaaacc atacctatca tttagatttt ttggcatgaa 24361 ttggaaaaaa ccaactcaat tggatgtgtg tttagtagac tttccccttt aaccaccatt 24421 acagttgctt tgcacaagca ttcacacttg gactcttgaa tctgagaaat gctaagcata 24481 aatacttaac aacgattaaa ataagggaac ctaaatacct aatgatacta gatgatacta 24541 gttttgattg caaattaaaa aactcttagg ctttttttac tttgtcatca aacagtatca 24601 ttcaggggct tgcgccctct gatttccggt catacttgac ctgataactt tagagcacac 24661 aggtaagttt aaagagttgg gaaatctgat agaagttaaa acggacatac tacctacagc 24721 agatgcatgt gattgcgcaa tacgcacatg cgataagccg gaggcttatc gcctcctaaa 24781 agaaagttca accgactgtg tccgtctgga aactatccgt atcatagtga agtcctcctc 24841 tgttgctatt acactcaatt gatcaacgat gcaagcaccg attacaactg acactatctt 24901 gcaaaaccgt taccggatca ctcaaatttt gggtcagggt ggatttggta gaacctatct 24961 ggcacaagac cagaggcgct ttgatgaact ctgtgcaatt aaggaattaa ttccaatagt 25021 cacagaaggt tatgcttggg aaaaagcaaa agagcttttt cagcgagagg cagcaacctt 25081 atatcaaata caacaccccc aggtgcctca attccgcgag agatttgaag aagatcagcg 25141 cttgtttttg gtgcaagact acgttgccgg aaaaacgtat cagactttac tggatgagcg 25201 taaactcact gcgagtgcat ttacagaaca agaagtgtta caacttatgc gctccttgtt 25261 accagtctta gagtatattc ataatcaagg aattattcat cgagatattt caccggataa 25321 tattattctg cgagaaagcg atcgcctacc agttttaatc gactttgggg tcgtaaaaga 25381 actagcaacg aaattgaatt ctccagataa cacaacgcca gcaaccactg ttggaaaaca 25441 tggttatgcc ccaactgagc aaatgcaaac aggtcgaaca tatcctagca gtgatttgta 25501 tgcactagcc gttacagcaa ttgtattgct gacagggaaa gaaccccaag atttatttga 25561 tgaacatcaa ctgacttgga attggcaaca tttggtcaca gtcgatccac acttcgcttc 25621 tgtgttgaat cgaatgttaa gtcaaaaact gggcgatcgc tactcaaatg ccccagaagt 25681 actacaagca ttacaaactc cacaacaacc aaatatttac acttctaatg tatccaaagt 25741 acaaacagta gccgttggtc gccgccctga acaagcacaa gcatctgcgc ccaatacatc 25801 taacccagcc attccaacac aggatagtcc ttcactctta gataatccct tagcagtttt 25861 tgcaattacc cttgctgtga ttgttgtaac aggatttgga tcttggacgc tcgtcagatc 25921 catccgcact cagccacaag caacagcaga aaaaacaaat ccacaaactt tcccttcacc 25981 tgtaattccc aatagaacta ccattacacc gacagctacc aacaatgaac ctgttgtcta 26041 taacaaaccc ctaaattttg gtaaatctaa cactgctaat gttgatagtg tcattaaagc 26101 taatgaaata gaccagtaca tatttcttgg tgaaaaagga cagcagttaa ctgtattgct 26161 gacactacga cgcagcgtgt tgctatcagt tttagatgca aatcaacaac caattgacaa 26221 tactgctaaa gaattttcat tctaccaagg aacattgcct tttactggta aatacacgat 26281 tcaagtacgt cctgtaccag gaaaagccca aagcaattat agttttagcc tcggattaga 26341 aaatccactg caacccagat ctactcccac tcctacagct actcccactc ctatagcgac 26401 tcccactcct acagcgactc ccactcctat agcgactccc actcctacag cgactcccac 26461 tcctacagct acacccactc ctacagatac acccactcct acagctacac ccactcctac 26521 agctactccc actcctacag aacaaccatc cacatctggt tctgtagagc cacaagatac 26581 tccgcctgct agtgaatcac ccaactctgt tcctagtgaa acagctttcc cgtcaaggtt 26641 gtagaatttt atgcagaatt tgcatgaaaa ttatgcgaaa tctttataga atagaattgt 26701 cagcaattta tgatagtaac tgtgagaaat tagttgttgg caactcataa ctaatgacta 26761 atgactgaca agctgttaaa aacactattg attacaggaa cagatacaga ggttggcaaa 26821 actgttttaa cgacagcgct ggcagcctat tggcaaaaat atcactcttc ccgcagtatg 26881 gggattatga aactcatgca atcgggagag ggcgaccgtg agtggtatca aaaagtgttt 26941 tccttaaatc aatctcccga agaactgaca cctttatatt ttcaaacacc cgttgctcct 27001 cccattgctg cagcaaggga aaataaaact atagatttag caaaagtgtg gcaaagtttt 27061 acagctttac gttcgcgccg tgattttctt tttttagaag ctttaggagg attaggttcg 27121 ccagtaaccg atgaactgac ggttgctgac ttggcaggag aatggcgctt accaacagtg 27181 ttagttgttc caattagatt gggtgcaata ggtcaagcag tcgctaatgt agctttagca 27241 cgacaatcac gagtgaattt aaaaggtatt gttctgaatt gtgttcaacc tcgaacaaat 27301 acagaaattg ctgattggac accaatagaa ttgatgaaat cactcactca aattccaatt 27361 tgtggctgct taccttatct agataactta actgatttat ctaaaatggc acaagtagca 27421 tcagatttag atttagaacg attgctatat atctaaaaaa gtgtgaagag tgaggcaaga 27481 aatacctttt gcaaaagttt aaagcgtaaa aataaaaata ctgaactgtc atttcaaatt 27541 aaaaaatgtg aaacatggtt tctacattcc cgaactcctc ttctgttgat ttatctcgtg 27601 ttcgactctc cattcgttca ctgcaagcgc aacttgtgga atggcggcga cagttgcatc 27661 aaaagccaga actgggtttt aaagaaaaat taacctccca gtttgtttct gaaaagttac 27721 aaaaatgggg aattgaacat caaactggca ttgctcagac tggtattgta gctaccatta 27781 ggggtaacaa taccagtact caaaaagtgt tggcaattcg cgcggatatg gacgctttgc 27841 ccattcaaga actcaacgaa gtgccatacc gatcgcaaca tgatggagtg atgcacgcct 27901 gtggacatga cggacataca gcaattgccc taggaacagc atactatctt caccagaatc 27961 gagagacttt tgctggtact gtcaaaatta tcttccagcc agcagaagaa ggaccaggag 28021 gcgcaaagcc gatgatagaa gctggggtat tgaaaaaccc tgatgttgac gcgattatcg 28081 gtttgcacct ttggaataat ctgcctttgg ggacactagg tgtccgcgct ggtgcattga 28141 tggcagctgt agaaacattc aaatgtacaa ttatgggcaa aggcggacac ggtgctatgc 28201 cccatcaaac tgttgattca gtggtggttg ctgcccaaat cgtcaatgcc ttacaaacaa 28261 ttgtcgctcg caacgtcaac cctattgact cagcagtagt gacagtgggt gaacttcatg 28321 ctggaaccaa aaacaatgtc attgctgata cagccaggat gagtggtacc gtaaggtatt 28381 ttaatccagc atatgaaggc ttttttaagc agcgtcttga gcaaattatt gcgggaattt 28441 gccagagtca tggtgctagt tatgactata attgttggtc gctttaccca ccagtcatca 28501 atgatgctac tatagcggac ttggtacgct cagtcgcaga agaagtggta gaaacgcctc 28561 tgggtgttgt tcctgaatgc caaacaatgg gtggcgagga tatgtcttac ttcttgcaac 28621 aggtacctgg ttgttatttc ttcctgggtt ccgcgaaccc tgagaaagga ttggcgtatc 28681 cccaccatca tccccgattt gattttgatg aaaccatctt agcgatgggt gtggagatgt 28741 tcgttcgctg cgtggaaaaa ttctgtaatt gataaaagta gatgggtatt aataaatatt 28801 aggtacgaga gttcgtggct tgtagttaat cactcactta acaagtcgca atgagtaacc 28861 tctgccagtt acttttatgt attacccatc tacttaataa atgaattccc caacgttaga 28921 tacaaaagct tataaccttt acagataaac tgttttaggc aatgcccacc ctactgatat 28981 gtttcattta tttatgtcta tctacttaga atacaaaaat agaagtgcga tacaccctcg 29041 aattaagtag ggtgggcatt gcaatgtcca acgccagatg cctacggagg gaaagccgtc 29101 attcgcactg gcttccctac acaatatgtg tatttgagtt agttgaaaag cgctatatct 29161 tttcgatcat tgatgatacg aaagtgagtg gttgcacgat accatgtcta attctctact 29221 acctttttgc gaaatatcca gagagtttgt atgacctgag tcaaagagct tctttgcggt 29281 tcttgcacaa agaaaagggg aataattgct ccgagtcgga acacaaaaga tacaacaaag 29341 actgctggta aaccccctaa tgagggattt tgtgcaatca aaccacctat agttgtgcct 29401 aaagcgccac tagctccact aacagcagca gccattgcaa aataaataga tcgttgtttg 29461 actggtgcaa ttcctagctg catattgttg ttgcataaat taattgctgc ccaaacccct 29521 ccaattaaca tgtgtaaaag cggtaaccac agccaaagac taaagacatt aatatcaatg 29581 cccagccaaa atagaggtat gagtgcaagt aacaccccaa ccaatatcaa gattggacga 29641 ttacctattt tgtctgataa cttaccccac aaaatcagca tcaccagatt tgcccctgcc 29701 tgaacactgc cgtagagtgt caccaaactg acatctagat gcatagtatc tagcatatag 29761 aggatgaaat aaggagcact caagttacaa gcaaacatct ggaaaccctc atacaacaga 29821 aacatcaaaa aattaaagtt atttaagata ctttttatgg cactcgaatc aggtgcaaat 29881 tgagtgtcat tttttgtagt ttctgcctgt gtaacagtag gatgactaga attctcagta 29941 tctttggaaa tggcattttt cacaaaagaa ccgaggatag aagtctgctg gttctgtgga 30001 ttcacatcca ccttgaagta ctgacatcca atacttatta atcctgacaa aattgcgaac 30061 agcacaagca ctccgtaagc ttgtaaagtt ccaccaggcc attttgagac agcaatacca 30121 gcgattggta tacaaattaa attagttaga ctggtaacgc tgttacgtaa tccaaaatac 30181 ctaccccgga gctgtcgagg gactaaaact gccatccaac tcaaccacga tgcatttcct 30241 aatccttgtg acatattact aaacaagaca atcacaagtg tcagaactat caattgctga 30301 gaatttatat atccgaagct gagggcggcg ataccaataa ccagaaataa ccagactagc 30361 cgagcagtca catttgtaag cagcgaatat cgaaagcgac tggttgtgcg ttctgataga 30421 taggctccca agggctgaat gagattcacc agcattggaa tggaagacag cattccaaat 30481 atcaccggac tggcatttaa ttccaccaag aaattactga gcaaaatacc agtcacaatg 30541 ctcgtgtata ttcctgcaaa aacactatct acagtagagg ctctgaggct gatgcgaata 30601 gcgttctttt ctaaacctgg ggggcggggt gtcgaggtag gaacaacagt cggttctgaa 30661 attgcaatct cgggtatttt aggagctaga ggcgtagcgg tttctagcga aatgggttcc 30721 atacttgtct tcacagataa gtccaccggg cactgtaaac tttgttcgta attgttcgtt 30781 attctatctt gcatgtctca acaaaaattt agagaaacaa aaagtcaatc ttgagctaca 30841 gaggcaatat gtaaagtggc agaattttta tttggctgtg atgcaaacca acgtgtcgaa 30901 atttttaaga ttgttagtgg cgttcgccct ttgggcagct gtttttggga gcatcgccct 30961 cttggtgatt ggttgtcaaa acctacccct tgcgagcgtt tcctcagcac caaccgcact 31021 caccatcaaa ctcggtggtt ggacggcaag tccagcagag caaaaactct tgaaagaact 31081 actacaagac tttgaagcaa agcatcctaa tattaaagtc aggcatgaag tcatcaacga 31141 ccaatatatg gatgtcataa aaacccgctt agtcggtgat gctgctcctg atgtcttcta 31201 tttagaagct cttgaagctc ctttctttat gagtcagaat gttctcgaac cacttgatgc 31261 ctatataact cctgattttg acctagctga ttttgaggaa acgttactca agagcttcaa 31321 atacgctaag catatttacg gttttcccaa ggactattcg acactggctt tattctacaa 31381 caaaaaagct tttgcagccg cagggttgag cactccacca acaacttgga atgaactacg 31441 cacctactca aagaagttga ccgtagataa taatcgagat ggcagaattg atcaatacgg 31501 ttttggagaa attccagagt tagcgcgtca ggcttacaaa atcaaagctt ttggtggaca 31561 acttgtagac caaaatggtt atgcagcatt tgccagtgat gccagcttgc taggattaca 31621 attggcgata gaccagtatc aaaaagaccg ctcctcagct caaaaatctg atgttgggac 31681 aaattctggc agtgaaatgt ttggtcaggg taaagtcgca atggtgattg aggggaactg 31741 ggcaattccc tacctcacag aaacttttcc taatttagag tttgcaactg cagaactacc 31801 tacaattaat gataacaaaa agacgatggt attcactgtt gcctacgtga tgaacaagca 31861 aacccaacac aaagccgaag cctgggaact tatttcttat ctcactggta aacaaggtat 31921 gacaaagtgg acgaaaacag gctttgctct gccatctcgg aaatctgttg ctcagaaatt 31981 gggttatgat caagaccccc tacgaactgc acttgtggca ggagtaaatt atgctatacc 32041 ttggcaagtc ggtgagtatc cagcagcgat tgtgaataat tttgacaacc agtttgtcag 32101 cgctttgtta ggacagcaac cgctacaaca ggcaatgcag caggcgcagg atgaggcgaa 32161 tcaattgata aaggcgataa gccagaggct tgacgctccg cgtatcgcta taaagtaaat 32221 atgagtgagt ggtatatagc ggtttgcaat tgggtaaggt acacaaagaa ataatgaacc 32281 actcatggaa cactcatgga acacagataa atctgtattt catttcttga gtactacgcg 32341 tttgggctta cgcttgtgca aagtactcag tcctaaaaca ccaaacgcga ctgcgcccac 32401 tacaacactt ggttcaggga cttttgcttt tggatcaacc cgcaggactt gcccaactcc 32461 agggcgatcg cctttactag aaacatatac agcaccatca ggaccaactg tcaaggcagt 32521 ggcagattct agtccattgc cactcaagag agttgtacga gtcccatcag gagagatttg 32581 gataagagat gcatctccaa cgcccaacca ctgaggtgca ttggaatatt gtatggtgta 32641 caaattgcct tgggcatcga attctaagtc agagagttgg gtaaagccat cagcgtaaac 32701 tgtaacttca ccattagatc caactcgaaa gatacgcgct ttacccacag ggaaaggaaa 32761 acccgtatat tcgctgacat acaaagcacc atcaggacca aaagcggcac ctgtaggtac 32821 tgattgaatt tcaacttcct gtggagtttg acccggtggc ggcgcgtcag gaggtgactc 32881 ttgtcccggt gcaggagttg ggaagatagg attggtgatt gtttgcttgg gaagcacagt 32941 aaaagtcttc aaattactgc catcaagtcc cacagttaaa atgtcattgg cagctgcatc 33001 cactacataa gcagtattac ccttaattgt aaaggcataa gcgttactag ctatttcacc 33061 agtaaagtcc agcgtctcca ccccatcggg attattagca agttcatatt ttgcaaaatc 33121 ggcaatactt gtcaaagaag ctgtttgaaa gtcaactcga taaagctgtc cccaactggg 33181 agaattttct ggaaaatcac ggatggttgg attaccacca tacccgatga gaagataagg 33241 atttcctgct gcatcaaact ggatgtcctg aggaccttca cctgtactac caatcggtct 33301 taatgctgta gatggtagcc ctgtaagtac acgctcttgc ttaccgtcct taactcttgt 33361 taccgcgcca gttgtaccag cacatgaagg ggtaccatcc aaactaggtc ctggaatgca 33421 tcgcccattt cctcctacac cggactctgt gatgtagaga ctaccatcag gaccaaagtt 33481 gagacctctg acattatcaa gaccctcagc aacgacacta aacgatgcag ctgttgctgg 33541 ttttgttcca aaaacagcag caaaacaaaa cgtcaaagct gcaaaactta tagctgattt 33601 agaaactaaa gataaacgca tacggcaatc aggcgtcaga attataggtc gctttttaac 33661 tgactgaaaa aaacatctat cgtaagggga tggcaatacg taccaacctc atacaaaggc 33721 ttattgtttg aatctgaaaa tggtatttaa aaactctgta tttttagtaa cagcctatag 33781 aatgctgaat gccgattgca atgaataatc ttttgatagg cttgacattt ctttcagttt 33841 tgttttagtc aagcctggtt gaactacaac tgaattgata cacaagctgt taaccgcagc 33901 ttaccaactt ctgtctacgt ttgcgcgtca gatacgaacc tactcctaaa gcaccaaacg 33961 ctagtacgcc caatgcagaa tttgattcag gaacttttgc ttttggatca atcctcagaa 34021 cttgtcccac tccggggcga tcgccaaaac tcgaaacata aactgcacca tcaggaccta 34081 tagtgagggc agtagctgat tccagtccat taccactaag gagagttcta cgagtcccat 34141 caggggctat ttggatcagt gaaccatcca aattcccttg aataaatggc tgattggcaa 34201 attgtaaggc atacagccaa ttccctgacc gccagtgggt gatattgccc aagacggcaa 34261 atttgtaata acacgttctt gcttaccgtc cctaattctc gttacagcac cactcgaacc 34321 agtacaagct tgctggcctg ggaaagtaaa tgatgcacta catagctggt ctccccccac 34381 acctgcttct gtgacgtaga gactaccgtc aggaccaaag ctaagacctc tgacgttatc 34441 aagaccgtca gcaacgacac taaccgtcgt agcagcagct ggttttgttt caacaacaac 34501 agcaaaacaa aaagtaagag ctgcaaaact caaagctgat ttctttagaa aagacataaa 34561 taattagaaa tcaagaatac gttgattctt gattcgataa attgtgcatt tgtgcaaaag 34621 gctataaaat atcactaatc aattacaatg aaaaatcttt tgattaggtt gacatttatt 34681 ttgctattat ttgtatttag attgatttca aaacatgaaa tattaagcat ttttatgcat 34741 attaatttag ctgctttttg tcaaaatatg aaaattagtc aacaataaaa attagatata 34801 aaacctaatt atctattttt gacaagctag aaaaattcaa aaaaacttaa taataaaaac 34861 aaataaataa ttgttcaaaa tgagacaatt tcgcatatgg tgaaagttca aactgctcat 34921 gaaacttctc agaatcgcat tcctctgttg gagttacgat aaataatcat gcattaccta 34981 tcgctacagt agggatggtg gcggatgagt tggaggtaca tttgtgtttg aagtcagaag 35041 ccgacaaaga aacaccaggt cgaatataac agaagacttg gctggatatc tgtttatgat 35101 cccaaccatt ctcgtgttag gcacatttgt cgtgctaccc attctctggg ctgtttttct 35161 ttccctgcac aaagttcaac ttcttggcaa tattgagtat caatttgtag gctttcgcaa 35221 cttcacgcga ctcattgaag atgaacgcgt ttggattgct ttaagaaata cagcccaata 35281 cgttgcaatt gttgtcccaa gccaaactgt cttggcttta attctagcag tgacgctgaa 35341 ttctggtatt cgcgccaaaa attggtggcg tatcctctac tttttgccga cagtcacctc 35401 atcagccgtg ttgacgctga tttttatgtg gatttacaac acgaatgggt tactcaatga 35461 ttttctcgct tttgtagggc tacctacata caactggtta ggagatccag cagttgcgct 35521 caaaggtatt atgatcatga atatttggtc aacagcaccg ttttacatgg tgatttatct 35581 cgcagcgttg caggacattc ctcgttcact ttatgaggct gcatcgctag atggagcaaa 35641 tggctggcag cagtttattt atatcacgat tcctatactc aaaccagtga ccttttttgt 35701 agtcacaata ggagtgattg gcacgtttca gttgtttgac caatcctaca ttttttctaa 35761 cggcactggt ggaccgaata atgctacttt gactgtagtt ctattgatat atcaggctgt 35821 ctttcgtaac ttgcagatgg ggtatgcagc ggcgatcgcc tttttactcg cagtagtcat 35881 tattgttgtt actttcattc aacggcggtt tttgggaggc gaaaaagttt aagataaact 35941 tcctgaccga aagccattta tttcaaagaa tttacacttc atcatggagg caaaaatgct 36001 attaaaactg aagcaattag aagtcccacc aggacagcga gtgctgctaa aaaatgttag 36061 ctggcaagaa tttgaaacga ttttagaaga attaggtgaa catcgtgccg ccaggatagc 36121 ttatgaaaat ggaatgctcg aaattatgac accactacct gaacatgaag tgactaaagt 36181 ctttcttagt aactttgtag aaattatcct tgaggaacta gacattgaat ttctgccttt 36241 aggttcaaca acttttaaaa ataaattaat ggacaaaggc attgaacctg acaactgttt 36301 ttatatccaa aatgaaccag tcgttcgagg caaagacaga ttagacttaa cagtagatcc 36361 tcctccagat ttagcgttag aaattgatgt gacttcccgc actcattcta gtatctatga 36421 agcattagca gtacccgagt tgtggcgctt tgaaaaaggg aaattacaaa tcaatgttct 36481 gcaaaacggt aaatatattg agtctacgtc tagcccaatc ttccctaatt ttcctctcca 36541 gcaagtcatt cctgagtatc ttcaacagtg caaaactgtg ggtagaaaca agactatgag 36601 agcttttcga gcttgggtgc aagaactcat tagctaaaat atttagactc tgtgaatgta 36661 gtttagaata ctggttcttt ttttgaagtt acagaagctg attggattct caagttgaag 36721 agactattga agctgggtga tggtgaggag aaaccaaacc cttgacaaac atattaccaa 36781 aatgaaaata cgtagatttt tgacttggaa gacactccta tacattttac taacactgta 36841 tgcaattatc accctcattc cctttttatg ggcattatca gcatcattta agccgctatc 36901 tgaaattgtc agcagtgaac ctaattttgt acctaaaaac tttactctcg ataactacaa 36961 acaaattttt ttccaagaac cgctgtttct ccggtggctt ttcaatagtg tggttatagc 37021 tatcagtgtg actgtgttaa acttgttgtt caactcaatg gcaggttacg ctttagcaag 37081 actggacttt aggggtaaag gcttctggtt cttcctgatt ttggcagtct tagcagtccc 37141 tgcacagata actctcattc ccacattttt aattttaaaa gccttaggtt ggctgaattc 37201 ctatcaagga atgattgttc ctagcatggt caatgctacg ttcattttta tgatgcggca 37261 gtttttcgtg aattttccta gggaattaga agaagcggct caattggatg gattgaatac 37321 ttggggtatt tttcgacata ttgttttacc tttagcaaaa ccagcattag cagcacaagc 37381 agtttttgtg tttatgggga gttggaataa tttcttgcta cctatagtta ttttatttga 37441 tccagaaatg tttactcttc ctttgggatt gaatagtttt aagggtcaat tcattagtta 37501 ttggaactat attatggcgg cttctatggt ttttacttta ccggctttag cgatttacgc 37561 attttttaac aggtatttta ttgaaggtgt gacgtttacg gggggaaagg gttaatatca 37621 gccatttaaa atttcagatg gacaatttac gaacaattca ataaacctag gggaaatgtg 37681 atggcgcaga ttgtgattgt gggtgcaggt ccgactggtg ctacgcttgc tttacagctt 37741 gtaaaacgcg gtattgcagt aacattaatt gaagcagcga aggattttca tcgagtgttt 37801 cgcggtgaag ggttaatgcc gagtgggttg gatgcacttg agcaaatggg attatcgacg 37861 atactggaag atattcctca tcgacaactc gacgcatggg agttcattct aggcgaaaag 37921 cagttgtttc gagttgagga gccaatgggg gcggatcgac cttgtacact ggtgtcgcaa 37981 ccgccgttgc ttgaagctat gattacggaa gccaaagctt atgatggatt tgaatttata 38041 cagggtgttt ctgtaaagga tttactgtgg atcaataatc gcgttgcagg tgtgaaactt 38101 ggcaatggac gtgaaatttc tgctgaactt gtgattggga cagatggtcg aaactcggtt 38161 atacgacaac gggctgggtt gcaattggtg cgtcagccaa aggatgtgga tattctctgg 38221 ttcaaacttg ctgccagttc caggtttaca gcagataatg ttttttattt tatccttaat 38281 ggcgagcgcg tattcaccat ctttcatgga gcagaggaag gaaaactgca tctggcttgg 38341 gtgatatccg cagacgagag aaccgatagg aaacaaccag actgggcaga aatttttgca 38401 tcgctatcac cctcatggat ggcagagcat ttccgtagct atgcagatac cattgaaacc 38461 cctattaagc tatcagttgt ggttggtcgt tgtccatcct ggtacgcgcc aggagtgttg 38521 cttctgggtg atgctgcaca cccaatgtct cccatccgcg ctcaaggaat caatgtcgca 38581 ttgcgtgatg tcattgtcgc tgctaatcac cttgtaccgg tatttcacgc acgcgccgga 38641 caccaggaga ttgacgcagc actatcgcgc attcaagccg aacgtgagcc agaaattatc 38701 cgcgcccaac aactccagat aaaggaagca gcacaaggcg aactactacg aaaaaatgca 38761 ctgatgcgtt ggctattgat tcaacttgct cctttacttc gtagaacaat tcgccactct 38821 tggctgaagc gacagtacaa aatgcgtcag ggtataacgc aagtgcactt gaatgtctga 38881 aacttgatga attggttaaa actttagtgc gattgtcttt cactcctgtt catctaaatg 38941 ttccgcaaca gcttgataaa gatgcaaaac atgactgtca tgtaggcggt agaagacatt 39001 acgaccttgc ttgcggtaac taaccaaacg catcgcccgt aagtttctca gttgatggga 39061 aacagcagat tcactcatat ttaatactgc agccaaatca ctcacacaaa gttcttgttg 39121 agctaacagt gagagaatac gcaaacgatt agcatcgcct aaaaagccaa aaaattctgc 39181 catgcgctgg gctttctcag tgttgagaac ctgatgtgac agaggttgga cgttatcact 39241 ctctactgga ctggatattt gattgtactc actagtttct ttttgggaga gagaatgagt 39301 gacattgtaa gcaaaagtgt ctctggttga cataatatta atgtagaaaa tttctagtnn 39361 nnnnnnnnat ttctagttat ttcttaagaa aaaagaaatc tgacattctc aaaatacttt 39421 cattgatttg tataaacaac atatgctttt ctggaacaat tgccaaatag ctgagtttgc 39481 tgtcaccaat gttaatgatc tggtgagctt gctacagttt cccttcatgc aacgtgcgat 39541 cgcaggtgct gttttgatgg gaatacttgg cggcttacta ggcagttttg tcaccttgcg 39601 tcagttgtct tttttcagtc atgctgttgg tcatgctgct ttggtaggtg tagcactcgg 39661 tgtcatacta cagttaaatc ccacctggat gctgctacca ttcacactta tttttgggtt 39721 ggttgtcctc tactttatgg ataaaaccga tttagcgagt gacagcgttt tgagtattgt 39781 tctgtcaggc gcgttggcag taggcgtgat tctcactagc ttaattcaag gatatcgcgg 39841 taacctcatg agcgtgctat ttggcgatat tctcgctatt gactcgacag atttgatttt 39901 gagtttgctt gtgcttgttg gagccagcgt attcttatta tcaaccctta ggcagcaaat 39961 tttgttaacg cttaaccctg ctgtagcaaa ggtgcaaggt attccagttc agtggtatcg 40021 ctatgggttt gttgtattgc tgtcattagc tgtggctgtg gcaattaaag ctgttggcgt 40081 tttactagtc aacgcttttt tggttattcc cgcctcttgt gcaaagctga tgagtcacca 40141 cttcaatcgt ttcctgttgc tgtctgtgat tgtgggttgc ataagcagta ttgctggcat 40201 cattgtatca ggtcttttca actttgcttc cggtcctagt atcgttcttg ttcagtttgt 40261 ggtattcttg actgtctttg gctgcgtcaa gttgagaaca aaagccgcgt aaattttttt 40321 tctgctagga cttgcacaat tactgtgact ttgctatatt aagtaatcgt gggaatcaaa 40381 aattccacgc cgggatagct cagttggtag agcagaggac tgaaaatcct cgtgtcacca 40441 gttcaagtct ggttcctggc atatcttgat gagatccgaa ttgacgcgag cctccaatcg 40501 agccattggt tgtgagatat tagcctcact cataaaataa ttcaccaaat ctatattttc 40561 attatgtacg gcaaaatgcc gtatataaac tcgttctaaa ttccaaaagc atcgcgaacc 40621 gaagagtggt tgcaaagtca gtaattattg gggatatggg gtaagatgtt tgcgtcaagg 40681 gtgaaaatac tgaaaaaaag tattgaatga ggacgacgca ttaatcgaca caaaacgcgc 40741 aaagagctcc cccaacggga gagccagtgc aagagtgagg acatggttgg ctcattgatt 40801 gaggacgcac tcatctatac aaaacgcgtc acagccaaga agtactgtag caatttagct 40861 tgcagcggct ggaactgtaa cgtgtcgttg cagcacacca gaatttttga gttcttgcag 40921 ggtgacattg tcagtgcaaa ataaaactgt ttccagttca gcaatcagaa cttcaaccaa 40981 ctcatcaacg gctgtttctg agtttacagc cgcttctaga aaaggtctgg ctaaacctgc 41041 gaggtcggct ccgagggcga tcgccttagc cacatccaat ccattacgca gtcctccaga 41101 agcaatcaag ggaattgttg gcgcgactgc acggatttcg gttatgcact cagctgttgg 41161 aagtccccag tcggcaaaag tttgtcccag acggcgctgt ttgttatctt ttgcccgttc 41221 gctttccact tttgcccatg aggttccacc cgcaccagct acgtcaattg ctgtcactcc 41281 cacatcaatt aacttttgtg ccattgctgc tgagattcca tttcccactt ccttgacaac 41341 cacaggaacg ggaagttttt cgcaaagttc ggcaatttta ggaagcaacc cccggaaatt 41401 tgtgtctcca tgcgattgca cgcactcttg caaggggtta agatgcagaa tcagcgcatc 41461 cgctgccaac atatccacaa gttgcaagca ctcgacaaga ccacatccat aatttaactg 41521 gacagctccc aagtttgcaa acagcagaat gtcaggagca aggtggcgta ctgcaaaggt 41581 cgaggctagg tggggcattt ctagagcaat tcgttgcgaa ccaattccca ttgctaatcg 41641 gtaatgttga gcaacagttg caagccgagt gttgactagt cgtgctagtt cagttccccc 41701 tgtcattgaa gaaatcagaa gtggagcacc gagattctta cccaagaagg ttgtttgaat 41761 attgatatca ctgcggtcta attccggaag acagcaatga gtaaagcgat agtattcaaa 41821 cccactcgtt acgtcccgga actgtacatc ttcctctaag cagacacgaa ggtgatcggc 41881 tttgcggttc tcaatttctg ttgcagcatc gactgggggt atagtattgg tcataatttt 41941 tctctcaaca gttgattcac cttttccctg caataaagta gtgacaaaat cattccactt 42001 ttgttgggac gtcttagcaa gaatcctact catgacagca aggaaagatt gctgcgccta 42061 agtttaacaa aaagctcaca cagttccttt tgttttactc agtgtatgtg cgaccagaga 42121 agaaacagca cacggaaatt ttaatgactg ccaaaatggt ataatgaaag tgcctataca 42181 agacgtaaaa accaaatcca ggatctgaag aaagtagaca cccttctcat gaatggacat 42241 ggttaaggaa atataactcc gcgcttgtcg taagataggg ctatgcagaa aggatgtctg 42301 acaagtgcta tggacgcaat ttatattccg cagcttacaa aagcgccgca agcaacagag 42361 gaaattcaag tcaaagagtt tctacctggt ttagaaactc tgacaccagt tcgaggtcgc 42421 attcgcgtcc agcatcatgg taattactta gaagtatttt ctcaagcaga aacaatcatc 42481 acttgtacct gcagtcgatg cttgcaaaac tacaatcatc gtttaaccat tgatacaaag 42541 gaaatcatct ggttagatga agcagcgaac gaagatgatc tgccactaga acgcgaaatc 42601 gcttttgatg atttagtaga aactttatca ccccaaggat attttgaccc cgctgcatgg 42661 ctgtatgagc aaatatgctt ggaattgcct caacgacagt tgtgtgatgc caattgtttg 42721 ggtattcaac caagcgcccc gagcaaatct gataagcgtg ttgacaggcg ttgggcttct 42781 ttggaaggtt tgaaaaagca atttccggga gcgtagtttg ataaatttgt agtaggcgct 42841 ttagcctgaa agcgtctact gcgaattagg aaaatttcat cgctcatttt cccaaactat 42901 taagtttcat gtaagtgtaa taattttttt cactaatttc ttttcaagtt tcgcactacg 42961 aactagtcta aactccagaa tccagttatt gcaggaattg ttcaatggaa ccactttgag 43021 aattttcgct cagaaaaatt taggatgtaa tatgatagat ggcagcacag tcacgcttgc 43081 cagacgatgc tctgctgcct caaccccgat tgcccgcaac ctcaaaatcc tgaagggata 43141 atccattgcc agagttgtgg cgcgctactg atacctcttt taagaggtca ctatcgtatt 43201 atcaaggtgc tttcagatga aggtgggttt ggtagaacct acttggcaga ggacgtagat 43261 aagctcaatg aacgctgtgt cgtcaagcaa ctggcaccga aagtccaggg gacgtgggct 43321 ttgaagaagg cgattgagtc attccaacaa gaggcgcagc gattgcaaga actaggaaaa 43381 aattcgcaga ttccgacctt gctggcttac tttgaagaag ataactatct gtatttagta 43441 caagagttta ttgatgggca aactttgctc aaagagttgc gacaacggca aacatatcac 43501 gaagcagaaa ttcggcaagt tttgcttgat ttgctgcccg ttctcaagtt tattcacgag 43561 cgcggggtaa ttcaccggga tattaaacca caaaatatta tgcgccgtca aactgatggc 43621 aagttagtac tgattgattt tggtgcctcg aaacaactaa gcgcaacagt acggactaaa 43681 ccagggacga gtattggtac acgtggttat agtcctctcg aacaattaca ggatggtgaa 43741 gctcatccag caagcgattt gtttagcttg ggagctacgt gctttcactt aatgtctggt 43801 gtttcaccag gtgctctttg ggcagaaaat ggctatggct gggttgcttc ttggcagcaa 43861 tatttgaaaa gttcgatcag tgcagattta gcgaaaattg tcgataagct gctgaaaaag 43921 gacattcatg agcgttatca gttagcggat gaagttctca aagatttgga acctcagcca 43981 ccatctccgc agccacctag actgaaaagc agacttttag caggtgctgg cattgtgttc 44041 ttgggattag gaggattttt gtacataaac aatttccaga tcctaaacct gatagcggaa 44101 aacaatttct tgatgaaaac cctcaatggg cattccaatt tggtgacttc tgtcgctgtg 44161 agttcgatcc cccctgactc cccgctttat aaggggggac tggggggaat tgttgctagt 44221 ggtagttttg acacaacact caagctgtgg aatttgtcaa ctagaaagga aattttcaca 44281 cttgagggga atgctggttc agtttattcc gtcgccatca gtccggatgg tcgtactgta 44341 gccagtgcca atggtgacaa aacaattaaa ctatggaatg tattcactgg acgagaaatt 44401 tataccctct atggacattc gagttcggtt gaatctgttg ctattagtcc agatagcaaa 44461 acgcttgcaa gtggcagttt tgatggcagc atcaaactat gggatctgcc atcgggaagg 44521 gaaattgcta ccctaaagga acattctggt gcagtgaaat ctgttgcctt tggtccggat 44581 ggacaaattc ttgctagtgg cagtgaagac aatactatca aactatggaa tttaaaaaat 44641 aaacaggtca tcaaaacttt caagggacat tcccaaccga ttagatctgt ggctattagt 44701 cctatccact ctgactcccg gagtttggga ggacgactcg gagggattct tgccagtagc 44761 agtgctgacg acactatcaa actgtgggat ttagcaaccg gacaggaaat ctacaccttt 44821 aagggacatt cttactcagt taattctgtt gcttttacct cagatggcaa aacccttgca 44881 actggtagta gtgatcacac catcaaattg tgggatgtgg caacgaaaac ggaaattcgt 44941 accctcaggg gacattctaa agaagttact tccgttgcct ttagtcccga tggtaacacc 45001 cttgttagt // LOCUS NODE_511_length_44863_cov_5.18498944863 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 44863) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 44863) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..44863 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(104..1774) /locus_tag="DP116_03235" CDS complement(104..1774) /locus_tag="DP116_03235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317909.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3352 domain-containing protein" /protein_id="PRJNA477356:DP116_03235" /translation="MPESRSKFLIPAVSTALVVAGGIAAYMYFKGGPSGDISSVLGAA KVVPDEAILATYISTDPKVWAKLEQFGTPQAQQIVAKGLENLNKKLSTDSNISYEQDL KPWVGGVMVAMLPSSPAKPVAQNTPQATQESKILIVVGIKDKLGALNFANKLKAQKDV KVEESDYKGEKIIETTGKNEHSYTAVLNTTYIVIAPEKQSVEHAIDTYKGEPSFASKE GAQNMLTSGVNVDNTLAQIYVPDYRNMVQQLITANPNATPIPPETMKQLKQMKSMVAG VGIDNVGVRMKAIAKLDPELVKYQYPNTSADVVSQFPADTIAILSGKGISNWWSALVE QSKDTPELKLTLEQARAQLKLVNIDLDKEIFGWMDGEFGFAAIPSNQGVLAQIGFGGA LVFHTSDRKMASATLNKLDDFVKAQSVNVAKRNIAGKEVTEWQIPQQGALLAHGWLDN NSVFVTIGGPIAETLADRKGESLQGSNNFKAVTSTLQKPNGGYFYLDMDKTVPLLNRL ATAQQQPMTPETSAIISSIRGLGLTATSPDKTTTQVEMLLALKPKNGN" gene 2216..4840 /locus_tag="DP116_03240" CDS 2216..4840 /locus_tag="DP116_03240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744235.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminopeptidase" /protein_id="PRJNA477356:DP116_03240" /translation="MSQYFFDTEKKRYKSFELPGAKPHYNPDHPGQVEHIFLDLSLDI VNKKYQGRCSITLKPIRNGIDRLNLDAVNLGIESVQVDGKAQNFDYDGQQLSIQLEPP TQVDKRLVIAIDYSADKPQRGLYFIQSDKHYPHKPSQVWTQGEDEDSRFWFPCFDYPG QLSTSEIRIRVPKHLIAISNGELIAAEEQGEDKIYHWSQQQVHPTYLMTLAVGDFAEI RDQWNGIPVTYYVEKGREEDAKRSMGKTPQMIEFLSEKYGYPYPYPKYAQVCVDDFIF GGMENTSATLLTDKCLLDEKAALDNRNTESLVVHELAHQWFGDLLVIKHWSHAWIKEG MASYSEVMWTEHEYGTTEAIYYRLLEARSYLAEDSTRYRRPMVTHVYREAIELYDRHI YEKGSCVYHMIRTQLGEELFWKAIQTFVRDNAHRTVETVDLLRAIEKATGRNLLFLFD QYVYRGGHPDFKVAYSWDSDSKLAKITVTQTQASVGNNSDLFDLKIPIGFGYVQQGDF ETSSSTPSQTSYCDLKTFTVRVNEQEQSFYFPLDQKPQFISFDVGNHFLKTVSLEYPV PELKAQLEFDPNPISRILAAEALAKGGGNEALKALSAALKNDPFWGVRVEVAKQLAEI KLDQVFDELVVGLKDKSPYVRRAVVEALGKIKTHESYKVLKELLEVGDPSYYVEATAT RAVGAIAAATTEEKPKEEKVIKLLKSVLEERAGWNEVVRNGAIAGLAELKTSEAALNL LMEYTHLGVPQPLRLAAIRALGKISVGQNSANVQRILERLTEISKETFFLTEIAVVTA LGQMEIAKAIGILQAKAYQTPDGRVRRYAEEEISKVQTNIGSENALRQLRSEVDQLRQ QNQELRSRLENLEARSKS" gene 5524..6741 /locus_tag="DP116_03245" CDS 5524..6741 /locus_tag="DP116_03245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317907.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_03245" /translation="MRKPVITIFYQFNPWNATIGGIQTLINTFIKYAPSEFDVKLVGT NTDPKKPVGKWQQAEFAGREIAFLPLFTLEDDNVRSLIPTTLKYTAALLGHRFASDFM HFHRIEPTIAALNWQGEKTLFIHNDIQTQMQARGDKKAILWRRFPAAYFALEGLLVRQ FNQILSCNTDAAQLYKQRYPNLQDRIAYIKNSFDNGIFYPLREDEREIKRRELASRLG VDEQTRFILFAGRLHPQKDPILLVRAFATLNEPHIHLLIAGDGELGVEIRAEIARLGL VDKITMLGAVTQSQLAQLHRVCNVFVLSSAYEGLPLVVLEALGSGTPVVTTQCGETPK LLTADSGVVCSERTPACIADALRKVLLHPGDYPTESCVRTAKPYAASTVVQQVYSEML NRWEQRNSIALQV" gene 6967..8157 /locus_tag="DP116_03250" CDS 6967..8157 /locus_tag="DP116_03250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008234364.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase" /protein_id="PRJNA477356:DP116_03250" /translation="MKIAVIGAKGLPPKQGGIEHYCAEVYPRMVKQGHSVDLFARSSY TDSSWQEPYDYQGVQVISLPGFGLRGVDALVTSALGAIAASTTKYDIIHFHALGPSLF TCLPKLINSAKIVVSCQGLDWQRAKWGSFSTRVIQMGEKAAVRFADGLIVVSNVLQTY FSQTYGRNTVYIPNAPARYGESDPNFGYGTQLGLEQKRYIVFLGRLVPEKCPDLLVDA FTALNPPGWKLVLAGGVSDTKSFTSQLLQKVANHPNIVFAGELRGQRLWEIVRGAGLF VLPSNLEGLPLAMLEAMEEEIPVVASDIPPHKQLISGGRGKLFEAGNLTSLIRTLDWA IHHPQELRAMAVHAKKHVQLNYSWDHITSETLKLYTTLQTSCEPVHIYKQNQTGLAEV LGKK" gene 8408..10591 /locus_tag="DP116_03255" CDS 8408..10591 /locus_tag="DP116_03255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873168.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chain-length determining protein" /protein_id="PRJNA477356:DP116_03255" /translation="MEKGISSVLAVLTRRSLPAIAAFVAVMGGAIAYLAVTPNKYEAL ARLMLDDKRVSVSELGRDLTQVSANAPGISPLADQAELIKSQRVLERAIAIAFPKTYG NLSLSPVSTAELSHNLKVKIVPATNILELSYQSRDPQLAAKVLNAVSQAMVEDNVKSI GLEATKVKQFLERKQVPEAEKKLLQAEDLENKYRKSSGIVSLEEQTKSLVQSIATLED QERTLSAQLQEIKARDASLQQVTKNTNLNNAYSSVRSGQDDEIKKLRAKLSELENKII ETRLRLTDEHPTVRNLVGERDALGKVLSEQLARVSSKDQSVSTKNFAGDQLSQELNSK FILNRIEESAVNDRLKVLQAKKAELQKRLAQLPITQQTLTVLTRKREEAAVSLKFLQG KLEEARLTEAQKVSNIRAIENAVAPSSPSEPKQKVVLALASVFGTMLAVGVVLLLEVM DNTLRDAAEAEELLQLPLLGILPRLSAKTLVLEPANQFLDNMELIEPYRTLFKTLEFR SKEQLRAIVVSSTISGEGKSVVASHLAAVVGMLSWRTLIIDADLRRPSQHTLFNLAPG PGITDVLEGNVSLLDAVQPTDIENLDVLTCGNQHARPSQLLESIAMKSLMAEAAENYD LVIIDTPPLTACADALTLAQEGNGLMLVARPGFTDKEVLSRCVWDLTQNRISILGVVV NGMTHLTQNYRYPTYRYRPRLPKSQKQLIGAGDSSRNSANGMRQR" gene 10594..12060 /locus_tag="DP116_03260" CDS 10594..12060 /locus_tag="DP116_03260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456380.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polymerase" /protein_id="PRJNA477356:DP116_03260" /translation="MLTKQHFSPSSRLGLLIGLAGVVVGLVTGFLIGSTKPLYLGLAL GAIPFLFYFFTKFEQVVLGLLVLRTSLDPFSGQQIPAMFALGLDVLTLLYVTVMLLRR QTVQTDRFWWFFAGWVILQGLWVVLLCLGGLGLAAGYLSDSIREWIRLFSWLMVYLLV MQLKDRVPPEKMISVLFLALVAPLVVGLMQMFIPSVLPAFLSAQNYDAGSISSEGFRI KGTIGHPNGFVTLLLLFIGLTWWKLRQSRQSFVWLLLLGLLAFFYVSTKALFGLMMIA TFVVVLVAPRLSPVNLIGGVLFVVLVLGLFASTEFGRERLSSLANTPLLNPDIDVSRA ILLSQSDNNSFNWRISQWYTLLNAWRQHPFLGYGLGLSVNVATNKLLPHNDYVRALVE GGVLGFVTFLVFLVGQGVRLIQLMRSAPPRSAQRELCSIMFAISLAIPVGMITENIWS HTTLFFYWFTLMAVAGWNWNEQTVDSSTALIRSPKHFY" gene 12095..13156 /locus_tag="DP116_03265" CDS 12095..13156 /locus_tag="DP116_03265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868794.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="PRJNA477356:DP116_03265" /translation="MSAKRLNLIQVFRGLAAVLVVFAHTDLIYNQNVNQDFLFKMFLF GGSGVDFFFVLSGFIMFYIHYKDIGHPDKLGTFFSKRFTRIYPLYWLILTSKILASFL FSYEPNTNARGIGEFIKAFLLFPQDRTILSSSFLGVSWTLSFEMFFYLMFGVLICLKP KFSFPIIVGWLSGVFLHFLGVIQFPQDNLLIQFLFSDYNLEFVLGSLAAYVVLNKKVS NGIPLLYGGLFLYTLSVINSWYTIIQLSSVVLFGIPCTLIVIGSASLELRKNINVPVF LVFLGNASYSVYLVHGFFMNQMTKILSKLPFPLFENLVVSNIVGFIISIIAIMCGCVI YSYIEKPLLTYFKPKAVTT" gene 13385..14575 /locus_tag="DP116_03270" CDS 13385..14575 /locus_tag="DP116_03270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868795.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 4 protein" /protein_id="PRJNA477356:DP116_03270" /translation="MTTKTISTNNTTQLSARGERPYRIVLVHPSTGVNWSGGSEIVAI ELTRRLSSYFEVELLSGAACGSFSHPIPCIPRSYAYDAVRHPLIAPLVGKFSTPEIVV EHLTSFFPCLFHLLRHPADLIFPHNDYGGLAMAACVRALTGTPILFTEHNSSNADGKC LQRNLRFRPDHLVVLDEATAAFAHNLKPTQAVSVIPNGVNLEQFTPEGTAINLGLPRR IALCVASLSRKNHKRVELAIQAVARLPHVSLCICGDGPDRAYFQALGDELLGPQRFAI RTFPHDQMPEVYRSVDVFTLPSINEPFGLVYLEAMASGLPVVTTDDQMRRYLVGDSGI LCDVTNLDSYTTAIKDALCGDWSERARQNAARFSWDAIALRYRDVILETILQSKKKVS LPTH" gene 14591..15574 /locus_tag="DP116_03275" CDS 14591..15574 /locus_tag="DP116_03275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863019.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_03275" /translation="MPKVSVIIPAYNAMAYLRETVESVLKQTFTDFEVLIVDDGSSDG TVEWVSQIKDLRVRLISQQNQGSSGARNTGISAAGGEYIALLDADDIWEPTKLEKQVR YLEKNPSVGLVDTWTVLIDQQGKSTGRVVVSYAEGDDVWKQLVQFKTVCACDSTPLIR RSCFETVGLFNRELPFLEDLDMWIRLASRYRFAVIKEPLVRYRQHPGSKSTNCQGTLE AFRTIVEKAFESVHADLLPLRERGYGRIYLYLAWRAINNKDYEQALHFNHQAVAHYPQ LIFYWEFIRQTIALTLLKTLGHQTYDKMKTLLQSLRRQTSTNNGQWRIGNS" gene 15704..16663 /locus_tag="DP116_03280" CDS 15704..16663 /locus_tag="DP116_03280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997221.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_03280" /translation="MSKVTVIIPAYNAMKYLPQTVESILNQTLTDFEVLIINDGSSDG IVEWASQITDSRIRLISQVNQGTAAARNKGIFESKGEYIAFLDADDIWEPTKLEKQAQ CLDNNHLVGLVDCWTAFIDENSKPTGLVMRNDTEGDVYKKVVESCDSPVCCGSSPMVR RSCFDTLGLFDRESYIEDVDMWIRIATRYHYGVIKEPLVRYRQHPNNKSKDCESMLRG FRQLIEKTYRSLPTDILHLRPRSYGRLYVFLAWRAIDTKDYKQAFHYSQQAFAMYPQL ILKPWFSRLNIAIAVMRFLGPDGYEKVRSFNRSLRRGLFARAT" gene 16730..18196 /locus_tag="DP116_03285" CDS 16730..18196 /locus_tag="DP116_03285" /inference="COORDINATES: protein motif:HMM:PF13440.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide biosynthesis protein" /protein_id="PRJNA477356:DP116_03285" /translation="MNSSFRDSRNFEFFAKEKSNLFIFGVVVGLRSKVIKGGALMVIR QALGILLSLIGVLFITKVIGPREYGLYGMSYGIVSFLGGLGIWGLDVYLLRKTSNPDK QDYDQVFTLLLCISSVFTLTLVLGQHIIAEMLKVPESAPLLATLGLTLPISLLNLPLT IKLDRDLNFQRVAYIELISQVSYYVAALPLANRGAGAWAPVGGLWLQQITMVLLTVFS TDLRPRLCWKPSLIREMLVYGLSYSSSTWIWQLRSLVNPVIVGRFAGVEAVGFIALAI RLAEMLAFAKSVTWRLAMAALAKLENDPFRLRKSIEEGMRLQALAVGLPMAAFAIVAP VVLPVVFGKDWTPLLQVFPLISIGYISNSIFNLHSSVLYLLGKNLSVTWFHTAHIALF AGSAFLLVPYLGMVGYGWAEIAAIASYIVIHIYIAKEIGSPNYTVAFGWLIISIAVLI LSTVNETVRYLSFVLLLLPLISTKERNSLIGYFQILRS" gene 18232..18510 /locus_tag="DP116_03290" CDS 18232..18510 /locus_tag="DP116_03290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130604.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03290" /translation="MNMGIIEPFKDGFLEIIPEGEGSDYWHIAAIHINGEVFCPSPRI YPSINVAIAKARRIFDWIYNHEIETQGLGCYCEELKITLWRQPKLHPS" gene complement(18977..20557) /locus_tag="DP116_03295" CDS complement(18977..20557) /locus_tag="DP116_03295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006634119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_03295" /translation="MYIVGGIKLPKSADVSDLYIQCNEAASINYQEDDKKVVLRQGGI ISSNSYFNSFYEKFYTKYTTLSSIYYLLKLEGDFKVSIYREVNGENNKEIISQENFEK CQFSEPVKVLPINLLQNENAGRIYFEITCSSEQGAFKEAWIATDENKTRDVSLGIIIC TFKKEDYIKNTLAAIFQDKLLETKDLKLFVVDNGRTLDKADFTKPKLKLVPNINAGGS GGFTRGLVEALEENTYSHFLLMDDDIELESECIYRLFSLHEYAKTDFAVAGGLLNLQK KHMLYEAGATYNEDSKTRGFAPGSLTAANHNIDLRSSSSLNRLLVEEDIDYGGFWFFS FSKDVVEKIKLPLPLFLKIDDIEFCLRIKELGNKIVAFPSLAVWHQPASAKNLNWETY YYARNDLITYAIHYPIGYMDTVKHFTKAIIQSLSKCDYDYVTMLIKSFEDYIKGPDFI KKSEPEKLHFNILKLSQSYNNQKEIDKLAGIKLLTRWFKVAAKSSIEWSSVSREWKSA SKEMTSTIFWQQYLGLKN" gene complement(20852..22294) /gene="xylB" /locus_tag="DP116_03300" CDS complement(20852..22294) /gene="xylB" /locus_tag="DP116_03300" /EC_number="2.7.1.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006634118.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="xylulokinase" /protein_id="PRJNA477356:DP116_03300" /translation="MLLGIDLGTGSAKALLLATDGTAIGEASSSYPVHAPHPGWAESE PGDWWLAVAMAVRKAVGNHADRVQAIALSGQMHGVVLASESGQPLRPAILWADTRSSA TLNAYHSLDAAILERLGNPITAGMAGPTLLWLREHEATVYAQARWALQPKDWLRLRLT GEVATEPSDASGTLLYDVVSDNWASEAITALNLRYDWLAKIIPSSAIAGYLTAVASEH LGLRVGLRVIAGAADTAAAALGNGLLEPGLVQLTIGTGAQIITPRSQPIIDPHGRTHL YRSAVPKGWYTLAAMQNAGLALEWVRGILGLSWQQVYTKAFSVPPGCEGLTFLPYLTG ERTPHLDPHVRGAWVGLGLHHTQAHLMRAALEGVAFALRQGFEALEATGFKATELRLA GGGTQEMPWKQLLTDVLRIPLYATTVAAASARGAALLAGIGIGVYADTNDTLKLAATP TLAATPQSVDSALEEAWMRYQSLYPRLKKI" gene complement(22407..22940) /locus_tag="DP116_03305" CDS complement(22407..22940) /locus_tag="DP116_03305" /inference="COORDINATES: protein motif:HMM:NF033564.0" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ISAs1 family transposase" /protein_id="PRJNA477356:DP116_03305" /translation="MIDSGNEYVIAVKANQKNLHRQIRHNTENTKPTSRYIATERTRN RVTTRIIQVFNDLTGISREWAGLKSLIKVERTGTRGGKPYHQVAYYISSFLRSAVDFA RGIRGHWGIENRLHWVKDVVFGEDRSMIRKGNAPANRSIILAIALNVLRRNGYSSITS AQRLIANDIDKLLLLVE" gene complement(22924..23184) /locus_tag="DP116_03310" CDS complement(22924..23184) /locus_tag="DP116_03310" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03310" /translation="MLDPRKSIKGTVQDYKSEYQNFVSIVSVFAGRRGLVFSMDKLEN KGSSEISTVQNLIAALDIQGVVFSEYQLYIAKKNLQANDRQW" gene complement(23203..23493) /locus_tag="DP116_03315" CDS complement(23203..23493) /locus_tag="DP116_03315" /inference="COORDINATES: protein motif:HMM:PF13808.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03315" /translation="MYVGYRAWGDFVKRHRCVLISTFGIQKHGVPSYSTIRRILMGVD FDTLAATFNQWAQNYVHLETWEWCGIDGDALSPNFWSALFLLICVPRRVFVV" gene 23906..24442 /locus_tag="DP116_03320" CDS 23906..24442 /locus_tag="DP116_03320" /inference="COORDINATES: protein motif:HMM:PF12802.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="PRJNA477356:DP116_03320" /translation="MVLRTVEALLADNTIDARIAGNKNVSFYHKKKTPASNALTDLIR AVLRMNATVQKSGTRLMRGTGITNARWQMLSELFALEKRVTVSELARHMGLTRQAVQR LADDMASDGLVEFAENPGDARAMHLLLTEAGRTTYHDALEREWQWTNAIAEDFDAEQI TRAVALLEAITQKMQTDD" gene 24519..25754 /locus_tag="DP116_03325" CDS 24519..25754 /locus_tag="DP116_03325" /inference="COORDINATES: protein motif:HMM:PF00067.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03325" /translation="MPHPTAAEQFDPHNPRFTADRFTLLAQMRSEAPVTFLPALQVYA VTRWQEVHDVLGDAVTFASSEAFSAGVHLAPEASAIYSLTSPLFAYNLINVDKPLHTR LRDPLMAAFTPKRIQSLAPTVIADVEDLLDAIAASGGETDLLLTLCKPLALRTICRLL GVPLADAEKLSGWSDALVAFQTPGLPIEVQVGAAHGLRALENYIREMVALKAAMPDDG LISALVASRAAALNDLSEDELVADIAIVFFAGHETTINTIANAFHSLLNRREYWEAIA SGTVDAENLTDELLRHDTSVMGLYRRTTVDTVIGGVTVPQGATLWVSYAAANRDPGLF DAPETLQCPRGNARQHLTFGYGAHYCVGPLLARLQIREAVTRVAKRFPEMRLVPGAFV PEIPHHGLRAPITLPVLLK" gene 25979..26839 /locus_tag="DP116_03330" CDS 25979..26839 /locus_tag="DP116_03330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318392.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SDR family NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_03330" /translation="MILVTGATGQLGTAVVKNLLEKTSANRIAAFVRDKSKASALKEK GVDIRVGSYDDTASLDKAMHGIEKVLLIAGTDEDNRIKQHQNVVDAAKKAGVQCVAYT SRTLKDRNTLANKLMEGHFQTEDYIKASGLNYALFRNVLYMDTIGQFVGERVFDTGIN LPTGQGRVPFALRSEMGEAIANALLESGCGNRIYKLTASESYSFDDVAATLSDLTGKE VDYTPTEKSAFEAQMKERGVPEAMVQRVVGFLTDIKNGQEEEVSPDMENLLGRKPASL KEGLKVLFNL" gene 27236..28003 /locus_tag="DP116_03335" CDS 27236..28003 /locus_tag="DP116_03335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009629976.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-oxoacyl-ACP reductase" /protein_id="PRJNA477356:DP116_03335" /translation="MTNHSPKIALITGSSRGLGKSTALNLAKKGVDVIVTYHSNAEEA TKVVVQIESIGAKAVALQLDTGNTKTFDSFVEQVKQSLQDKWHTDRFDFLVNNAGTGI NASIAETTEEEFDHLMNIHLKGVFFLTQKLLLLINDGGRIVNVSTGLTRIIFPGYAAY ASMKGAIETLTLYMAKELGSRRIAVNVVAPGAIETDFRGGAVRDNPEMNKYVASQTAL GRVGLPDDIGGAIASLLSKDNQWVNAQRIEVSGGQSL" gene 28033..28521 /locus_tag="DP116_03340" CDS 28033..28521 /locus_tag="DP116_03340" /inference="COORDINATES: protein motif:HMM:PF02627.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxymuconolactone decarboxylase family protein" /protein_id="PRJNA477356:DP116_03340" /translation="MSRQFLTLIIAEIVFMCSFQAQEAVSQPLETQIRQERGERVLNS LTGGNGLPPHFQQLQKDFPELADLTLKYSLGDIWGREVLDNKTRQLVSLAGFAAQGTM PQFKVHAQYALNYGVTPQELMEVIYITTVTSGYPRALIAAGTLKELFQENKIKFPITS QK" gene 28531..29271 /locus_tag="DP116_03345" CDS 28531..29271 /locus_tag="DP116_03345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015094816.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_03345" /translation="MLNIKDKVVLITGASSGIGEAAARFLAAKGAKVVLGARRTENLK SIAGEIQAAGGEVRFTSLDVTQKEQLESFIQFSQSQFGRVDVLVSNAGLMPLSLIEQL KVEEWDRMIDVNLKGVLYGIAAALPIFQAQNSGHFVNITSIADRWVGPTATIYCATKH AVRVISEGLRQEVGSNIRVTVIAPGATESELLNTISDPEIKKAAIEQFRIDLLPTEAI ARAIAYAVEQPADVDVNEIVVRPSAQKY" gene 29367..30299 /locus_tag="DP116_03350" CDS 29367..30299 /locus_tag="DP116_03350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316881.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADP-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_03350" /translation="MKAIVINAYGNEDVLNYVDVERPAPKADEVLVKVHAAGVNPAEW KVRDGMGEAFGLKLPLILGGDIAGIVEEVGEAVESFKKGDAVYGLTASGGFSGGYAEY AVAKTDTIVPKPDSLSFEEAAAIPIAALTAWQAMFDLAHLSSGQRILITGASGGVGSM AVQLAKAKGAIVIGTASGRNEQFVRDLGADEFVDYTQQPFEEVVKDVDVVFDTVGGDT QERAFQTLKKGGFLVTSAQTPSEEKAKEFGTEAAFVFCKPNAGQLTEINRLIEEGKLK IHIETVLPLTEVKKAHQLSQSGRTRGKIVLQVGA" gene 30395..31096 /locus_tag="DP116_03355" CDS 30395..31096 /locus_tag="DP116_03355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009630191.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_03355" /translation="MNNQEQPVFFGKEQAAGYDQRWTKLAPIRDALHLLLRIRLSELP DDAQVLCVGVGTGAELLYLAQAFPQWSFTLVEPSKPMLDICRQRAEEDGITSRCTFHE GYLDSLPLSLPFNAATCLLVSQFIMQPEERCAFFGQISNQLRPGGYLVSADLASGTSA SAYENLFEVWLQMQRFNGIPEEAIEKMRLVYGRNVAVSTPREIEELIASSGFDAPVLF FQAFLIHAWYARRTT" assembly_gap 31228..31245 /estimated_length=18 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 31329..32240 /locus_tag="DP116_03360" CDS 31329..32240 /locus_tag="DP116_03360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013667631.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_03360" /translation="MQSIQERESYYKQGAPWLPDDLKNEFGLFNVFNFNPGKNGNPPQ LPYSKKDYYKITIIKGSGSGIFLYADREIEIENYSIIFSNPQIPYGWSQRENFSDGFA CVFDQAFFHQYGNITNYSVFQPGNNIYQLDEEQFVQLEDIFRRMFEEIECDFIHKYDL LRTLVYELVLYTMKMKPASKLSKQPINASVRISTLFTELLECQFPIDDIHRPLTLRSA SDYAKNLNIHVNHLNRALKETSNKTTSQLIIDRILQEAKVLLRQTSWTVSEIAYALGY AEVTHFNNLFKKYLNITPTNYRKADIV" gene 32339..33088 /locus_tag="DP116_03365" /pseudo CDS 32339..33088 /locus_tag="DP116_03365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009629879.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="short-chain dehydrogenase" assembly_gap 32981..32990 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 33453..34379 /locus_tag="DP116_03370" CDS 33453..34379 /locus_tag="DP116_03370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008590585.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD-dependent dehydratase" /protein_id="PRJNA477356:DP116_03370" /translation="MKIVVTGSLGNISKPLTQQLVQQGHSVTVISSKAERQKDIEDIG AKAAIGTMEDPDFLSATFKDADVVYVMETLGADSFFNQNLDIIAAITKIGNNYKQAIE QSGVKRVVHLSSIGAHTDKGNGILVFHYNVENILKQLPNDVSIKFMRPVGFYTNMFRF IETIKTQGVIVSNYGGDNKEPWVSPLDIAAAITEEIEKPFEGRTIRYIASEEVSPNEI AKILGEAIGKPELKWVVIPDEQLLNGMLSIGMNLQVANGFVEMQASQRSGLLYADYYR NKPTLGKVKLTDFAKEFSTVYNHKTHSSIERI" gene 34315..34740 /locus_tag="DP116_03375" CDS 34315..34740 /locus_tag="DP116_03375" /inference="COORDINATES: protein motif:HMM:PF13532.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-ketoglutarate-dependent dioxygenase AlkB" /protein_id="PRJNA477356:DP116_03375" /translation="MLQKNFLQFITIKRIQALSGYKFNFVVGNRYRTGKDSIGWHSDN FSQIGKRPALAQGFGAAIASLSLGSTRKFKLRHKDSGETVDYHLESGSLLIMLPGCQE DWVHAVPKTARPVGGRINWTFRPHVEAILQGRGSAKGFC" gene 34725..34898 /locus_tag="DP116_03380" /pseudo CDS 34725..34898 /locus_tag="DP116_03380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318392.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="SDR family NAD(P)-dependent oxidoreductase" gene complement(35194..36684) /locus_tag="DP116_03385" CDS complement(35194..36684) /locus_tag="DP116_03385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017287270.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mannitol dehydrogenase family protein" /protein_id="PRJNA477356:DP116_03385" /translation="MNSNPNTGSTIKLNEASLSRLPGNVRVPKYDRRQITNGIVHIGV GGFHRSHQALYLDDYFHQNPGSEWGICGVGLLDNEYDRRMRDALKSQDCLYTLVERSQ EGDSARIIGSITRYLFAPDNRQAVIEAIAAPECRIVTLTITESGYYYIEGSGNFDVNH PTMQHDLQHPDQPIGTYGFLTAALEKRRKQGLAPFTVLSCDNVQGNGNMVRKMLTTFA QMRDPALGRWIAEHVAFPNCMVDRITPLTTPQDIKMVAQQFGIDDTFPCVAEPFIQWV IEDTFCAGRPDWESVGVQMTSDVHPYEMMKIRLLNASHMLIGYLGSLAGYTYVYEVMA DPLFEQAVANLMDEVTPTLQPVPGIDLDDYKKTLIERFSNPKIRDQLPRLCLNGSAKI PKFVLGSLRDKLQLGGAIDYLSLTIAAWFQYLNGYDDQNRPIVIDDPLADIITRRACS GKTDPRPLLSMFEIFGDLVQSPRFVETVADKLSSLHEFGAKGTLMP" gene complement(36721..36834) /locus_tag="DP116_03390" /pseudo CDS complement(36721..36834) /locus_tag="DP116_03390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015190566.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter" gene complement(36849..37592) /locus_tag="DP116_03395" CDS complement(36849..37592) /locus_tag="DP116_03395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006634115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HAD family phosphatase" /protein_id="PRJNA477356:DP116_03395" /translation="MKPLQHDKVVLFDHDGTVVDSETIALKSAWGLTNEVAREFAGAQ HYELEDFVKSFAGKPYREILKKIYADSLTTLNERDIERLVAEEEKRAIERLSVQAKAT EGTPEVLSYLRDDGFEYALVSNSSLQRLSACLTSAALTDYFPSEQVFSAHDSLPVARP KPLPDIYLHAVKCLEAEVSDCVAVEDSISGVRSAVAAGIGHIIGYVGGTHISEDERTS RADALQSAGAQQIIERMHDLIGILSPTLV" gene complement(37589..39076) /locus_tag="DP116_03400" CDS complement(37589..39076) /locus_tag="DP116_03400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006634114.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mannitol dehydrogenase family protein" /protein_id="PRJNA477356:DP116_03400" /translation="MVNLLRPISVPNSRLGTITGLWRERSPYDRRALNTGIVHFGPGR FFRGHLAYIIHNYLAQKGSQEQRWGICGVSLKSQGTITRLKPQRFLYTLTKHSSCAKN REEAKVIGSIREIVNGREKCDYVLEKMTSPSVHLVTLTITQGGYHLDKSFNLDTANED IAHDLRNPSTPTTAIGFIVEALRRRRDSGMTPFTTLSCDNLPRNGEILRKAVLAYADL IEPFLAEYIRDNAVFPNTVVDRIVPQEQESDYNYPSRLLQVRDRAPIVTEPFWQFVVE DNFTSDRPNWEEAGVIMTKDITPFLYMKSRFLNAVHSFIACLAVRAGIEYMHEAIRQP EFHLFTRLLMSDIAAATPVPREMCEQYMEQVLLRLSNEDLPDTTERISSETARKVGKY IFPILQDAYSRKVSMKRIILPVAAWLLAVREGASESGQPYHAKDTQSAVTAIQEGAVI SGILGLENCEHTEVVDNECHQALRDLQTHGLLTTLKNYSERGQLR" gene 40101..40712 /locus_tag="DP116_03405" CDS 40101..40712 /locus_tag="DP116_03405" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03405" /translation="MDLWQKLYYKYQKDYVRKRLLAIKYLYEGKNRTEVSAIIGCNYK KLLIEKQVIDRQDALKLNERDAQRRKRVLERYHGNALKTGLDFYHAAMIFQHGDDPGD YLLAHDLAIAALTFKDKGAEEAKWLIAATQDRFLMHLGRPQRFGTQQITTKPNANNFR CASIYNLDNSPASVTDEHRQILNVPTLKQAQEKLEEWNKKCKD" gene 41025..42005 /locus_tag="DP116_03410" CDS 41025..42005 /locus_tag="DP116_03410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194106.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_03410" /translation="MQICQELSLTRMGCGTWAWGNRLLWGYDESMDDELQAVFSLCVS NGVTLFDTGDSYGTGRLNGRSESLLGRFSREYLGSGKENICIATKLAAYPWRWTRQSM VSACKSSAKRLGKNVDLVQMHWSTANYAPWQEGGLLDGLADLYEQGLVKGVGLSNYGP KRLKRVHQRFAERGIPIATLQVQYSLLSTYPVTELGVKDVCDQLGIKLIAYSPLALGL LTGKYSEKGPFPKGIRGLLFRQMLPGIRPLLASLREVAQSRNKTMSQVAINWCICKGA IPIPGAKSVEQAKENIGALGWELNSGEIAELDQAAASADKKMVQNIFQTR" gene 42156..43415 /locus_tag="DP116_03415" CDS 42156..43415 /locus_tag="DP116_03415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874075.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03415" /translation="MIDVVREGEVAQTKKNAHQTQGQEIYSGDFSWSFWFALPLYPFG KRRTIRKEVLKDTIWTFDQLQGIFYVVVPIRMTVVKLNEGGLLVYAPVAPTTECIRLV NELVAEHGDVKYIILPTISGLEHKVFVGPFARRFPNAEVFVAPNQWSFPLNLPLSWLG LPSKRTQILPEDSSQTPFADQFDYAILDTIDLGPGQFAEVAFLHRRSHTLLVTDSVVS VPEEPPAIVQLDPYPLLFHAKDKASDIVEDNPANRRKGWQRISLFALYFQPSMVEIIA WGEVFRNAFQAPERSNKAYFGLFPFKWNSNWKSSFDALRGNGRLFVAPILQTLILNRA PKETIDWANKVASWDFGWIIPCHFDSPIQAQPHQFRQAFSFLEQSSSTGLSSSSYPLP EEDFKLLRELDKGLNKFGIVPPAKEFK" gene complement(43449..43757) /locus_tag="DP116_03420" CDS complement(43449..43757) /locus_tag="DP116_03420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03420" /translation="MKLNYQNNNIFTFVKVLSTVLITSAIGLELWNIYAVLTHTKVPS SLNPVFWIERFAVTIHFLEAVVAAFFAPSRKKTPLKYGTYTFFVGTIGLLELFNKEDD " gene 44050..44277 /locus_tag="DP116_03425" CDS 44050..44277 /locus_tag="DP116_03425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03425" /translation="MQTTAQEIYTQVVRNLSPNERLRLATLILNELVGQQQLSSVDQS DTWTQEDQIDLVNFSLQYAATTFSDMEDVEQ" gene 44379..44462 /locus_tag="DP116_03430" /pseudo CDS 44379..44462 /locus_tag="DP116_03430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495918.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="HNH endonuclease" gene 44553..>44863 /locus_tag="DP116_03435" CDS 44553..>44863 /locus_tag="DP116_03435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120705.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTPase" /protein_id="PRJNA477356:DP116_03435" /translation="MTQKELLRVIEEAASEGATELDLSGKELTVLPPEIGKLTQLKKL ILGKYKYNDAGDIVDTIGNKLSALPAEIGQLHHLEELQVVDNRLSSLPQEFGQLTNLQ TL" BASE COUNT 12745 a 9582 c 9920 g 12588 t 28 others ORIGIN 1 gcaggggagc aggggagcag ggaactctta actcttaaca gaacctcgta aaatctcact 61 tttgcaagag gtctattgtt tgtgagtccc tattccaaga gtcctaattc ccatttttcg 121 gcttcagtgc gagcaacatc tcaacttgag tagttgtttt gtcggggcta gtagcagtca 181 aacccaaacc gcgaatggaa cttatgatag cacttgtctc gggagtcatc ggctgctgtt 241 gtgcagtggc aagacgattt aacagaggta cggttttgtc catatctaag taaaaatatc 301 caccgttagg cttttggaga gtgctagtaa cggctttgaa gttgttgcta ccttgtaaag 361 actcaccttt acgatctgca agcgtttcag ctatcggacc accaatggtg acaaatacgg 421 agttattatc tagccagccg tgtgctaata aagctccttg ttgaggaatt tgccattctg 481 tgacttcttt accagcgata tttctttttg cgacattgac tgattgtgct ttgacaaagt 541 catcgagttt gttgagggtg gcggaagcca ttttgcgatc gctcgtgtgg aaaaccaacg 601 ctcccccaaa accaatctgc gccaacacac cttgatttga tggaatagca gcaaaaccga 661 attctccatc catccacccg aaaatttctt tatctaggtc aatattgact aatttcagtt 721 gtgcgcgagc ttgttcaagc gtaagcttca actcaggagt atcttttgat tgttccacaa 781 gcgctgacca ccagttacta attccctttc cgctgagtat ggcaattgta tcagcaggaa 841 attgagatac cacatcagca gaagtgtttg gatactggta tttgaccaat tctgggtcta 901 acttggcaat ggctttcatt ctcacaccca cattatcaat tcctacaccc gccaccatag 961 atttcatctg ctttaattgc ttcatggttt ctggaggaat cggcgttgcg ttgggatttg 1021 cggttattaa ttgctgtacc atgttgcgat agtcaggtac ataaatttgt gccaaggtgt 1081 tgtctacatt gacaccgctt gttaacatat tttgtgcacc ttctttactt gcaaaagaag 1141 gttcaccttt gtaggtatca attgcatgct ctacagattg cttttccgga gcaataacaa 1201 tgtaagtcgt gtttaaaacc gctgtataac tgtgttcatt tttaccagtc gtttcgataa 1261 ttttttcgcc tttgtagtca ctttcctcta ccttgacatc tttttgtgcc ttcaatttat 1321 tagcaaaatt caaggcaccc aatttatcct tgattcccac tacaatcaag attttagatt 1381 cctgtgtcgc ttgcggagta ttctgtgcaa caggttttgc tggacttgag ggtagcatgg 1441 caaccatgac accacctacc caaggtttta aatcttgttc ataagaaatg ttactatcag 1501 ttgacagttt tttatttaag ttttccaaac cttttgcgac tatctgttgc gcttggggag 1561 tgccaaattg ctccaatttt gcccaaactt ttgggtcagt ggaaatgtaa gttgccaata 1621 ttgcctcatc tggcacgact ttagcagcac ccaaaacact agagatatct ccagagggac 1681 cacctttaaa atacatatat gctgctattc ctcctgctac aactagggca gtactaacag 1741 caggaattaa aaattttgat cgactttcgg gcattgtaaa gattcccttt tcttcaacaa 1801 attatcttag atctcagtaa aaacgccatt gtggattttg cggattcaca atctatctac 1861 actcatttat cgggatttag tgtaagaatt gtcaaccaaa gaaataatgg gtataatagc 1921 gctctaagta gaagttcaaa agcgtcaaga acaaatccac aaatgatatt gttctttatc 1981 tgaaaaatat attccttttg gtggtaacaa gctattctaa gctatataaa accatcagga 2041 tattattaca gaatttcaat tttttgggcg caaaacactg tatgctcaat gacaacagtg 2101 tacaaaatgt tcagggtgta tcctaagttg atatgggata acacgcgaaa aagctgtttt 2161 ttccttacaa catccataag gaattttcag ggttaccaag gttcagtaaa catcaatgtc 2221 gcagtacttt tttgatacag aaaaaaaacg gtataaatct tttgaattac ctggggcaaa 2281 accgcactac aatcctgatc atcctggaca agtagagcat atttttttag acctcagtct 2341 ggatatcgtt aacaaaaaat accaaggtcg ttgtagtata acactcaaac caatccgcaa 2401 cggtattgat cgtttaaatt tagatgctgt caacttaggt atcgaatcag tacaagtgga 2461 tggaaaagcg caaaactttg actacgatgg acaacagtta tctatccaac tcgaaccacc 2521 tacacaggtt gacaagcgcc ttgtcattgc tattgattac tcagcagaca aaccacaacg 2581 cggtttgtac tttatccaaa gcgacaaaca ctatcctcat aaaccctccc aagtttggac 2641 tcaaggagaa gacgaagact ctcgcttttg gttcccttgc tttgactacc ctggacaact 2701 ttctacctct gaaattcgta tccgcgtccc caaacacctg atagcaattt ccaacggcga 2761 actcattgct gctgaagaac aaggtgagga caaaatctac cactggtcac agcagcaagt 2821 tcatccgacc tacttaatga ctctagcagt aggtgacttt gctgaaattc gtgatcagtg 2881 gaacggtata cctgtcacat actacgtcga aaaaggtcgc gaagaagatg ctaaacgtag 2941 tatgggcaaa actccccaga tgatagagtt tttgagcgaa aaatatggct atccataccc 3001 ttaccccaaa tacgctcaag tttgtgttga tgacttcatc tttgggggaa tggaaaatac 3061 atctgcaaca ctgttaactg ataaatgttt gctagatgaa aaagctgcat tagataaccg 3121 caacacagaa agcttggttg tccatgaact ggcgcaccaa tggtttggtg atttgctggt 3181 gattaagcac tggtctcatg cttggatcaa ggaggggatg gcttcttact ccgaggttat 3241 gtggacagag catgagtacg gtactactga agccatatac tatcgtttac tggaagctcg 3301 tagttatttg gctgaagata gtactcgcta tcgccgtcca atggtcaccc atgtttaccg 3361 ggaagctatt gaactttatg accgccacat ctacgagaaa gggtcatgtg tttatcacat 3421 gattcggacc caattgggag aagaattgtt ttggaaagca attcaaacat ttgtacggga 3481 taatgctcac agaaccgtgg aaacagtcga tttactacgg gcgattgaaa aagcaactgg 3541 gcgcaatctc ttgtttcttt ttgaccagta tgtctatcga ggtggtcatc ctgattttaa 3601 agtcgcttac tcctgggatt ctgatagcaa gctggcgaag ataaccgtga ctcaaactca 3661 agcaagtgtc ggaaataata gcgatttatt tgacttaaaa atccctatcg gttttgggta 3721 cgtccaacaa ggagacttcg agacttcctc ctccactcct tcacagacta gctattgtga 3781 ccttaagact ttcacagtac gagtgaatga acaagaacaa agcttttact tccctcttga 3841 ccaaaaacct caatttatca gctttgatgt ggggaatcat ttcttaaaaa cggtgtctct 3901 ggaataccca gtaccagagt tgaaagcaca gttggaattt gatcccaacc ctatctcccg 3961 catcctagca gctgaagctt tggcgaaagg cggcggtaat gaagccctaa aagcgctgtc 4021 tgctgcactc aaaaatgacc ccttctgggg tgtacgcgtt gaagtggcta aacaacttgc 4081 agaaatcaag ctggatcaag tctttgatga gttagttgtt ggtttaaagg ataagagtcc 4141 atatgtccgg cgtgctgtgg tggaagcact tggaaaaatc aaaactcacg aaagctacaa 4201 ggtgttgaaa gaactcttgg aggtgggcga tcctagctac tatgtagaag ctacagcaac 4261 tcgtgctgta ggagcaatag cggctgcgac aacagaagaa aaacctaagg aagaaaaggt 4321 catcaagctg ctaaaatccg ttttggaaga aagggcgggt tggaatgagg tcgtacgcaa 4381 tggtgcgatc gcaggtctag ccgaactcaa aacctcagaa gccgctttaa atctcctcat 4441 ggaatacacc caccttggtg taccacaacc cttgcgttta gccgcaattc gtgctcttgg 4501 gaagatttct gttggtcaaa atagtgccaa tgtgcaacgg attttagagc gattgacgga 4561 aatttccaag gaaacattct ttttaactga aatagcagta gtcacagccc ttggacaaat 4621 ggaaatcgcc aaagcaattg gaattttgca agccaaggct taccaaacac cagatgggcg 4681 cgtacgtcgc tacgctgagg aagagatttc caaggtgcaa accaacattg gttctgaaaa 4741 tgcgctgcgt cagttacgtt ctgaagttga ccaacttaga caacaaaacc aagaactcag 4801 aagccgctta gaaaacttgg aagcaagatc taagtcgtaa cagatacaac agcgcttccc 4861 atttgcgtat tccatgcggt agcgaatatt gctgtgctct tgaccactag atctgcgatt 4921 tagtggtcaa gactttgatt gggggtttga attccccact cattgtctga ccttcagcat 4981 agatgctttg aatacttctg ctttcttcaa gcatcttttc agatttgggc gtagacagag 5041 gtcaaaacat cctttttttt tacaattggc tttattaagt tgttgtaaag tttatttttt 5101 ttatttataa gtgtatcaga tgcacataat atcataatgt tgttctcatt aatgtaaaat 5161 ttatcaaaaa aatataaata ttctgtaaag aaaagattga tgaagtaaaa attggcttca 5221 gaaatattac tatttgataa atgtaatcag acctaagtgt aaaaattaca gagaaattat 5281 gtgccttttt tgaaggtaaa aaacctttta tgtgtttgta cgacagtggt tataaatctt 5341 acaatatgcc aatatgatac tgatacagtt cccagttgat tatgatgatt atgactatcg 5401 gaggaatgca aaaacgctca aatttctata atgtgtagga ggatatattt aaagtttttg 5461 agtgcttttt gacaattaaa aaacatttga tcttttgtac aaaagttgtg cagggtagtc 5521 cccatgcgta aacctgtgat cacgatattt taccagttca atccttggaa cgccacaata 5581 ggaggtatac aaacactcat aaatacattc atcaaatacg ctccaagcga gtttgatgta 5641 aaactcgtag gaactaatac tgatccaaaa aagcctgttg gtaaatggca acaagcagaa 5701 ttcgcaggta gagaaattgc ttttttgccc ttatttacac tggaagatga taacgtccga 5761 agcttaattc caacaaccct aaaatataca gcagctcttt tagggcatcg ttttgcttcc 5821 gactttatgc attttcacag gatagagcca acaatagcag ctcttaattg gcagggggag 5881 aaaactctgt ttatccacaa tgatatacag acgcaaatgc aagcgcgtgg tgataaaaaa 5941 gcaattcttt ggcgaagatt tcccgctgca tattttgcat tagaaggttt gttagttcgc 6001 cagtttaacc aaattctctc gtgtaatact gatgcagcgc aactatataa gcagcgttat 6061 cctaacttgc aagaccgaat tgcgtacatc aaaaattctt ttgacaatgg aattttttac 6121 ccattaagag aagatgaacg agaaatcaaa agacgagaac tggcgtctcg gctaggtgtg 6181 gatgaacaga ctcgttttat tttatttgct gggaggcttc atcctcaaaa agacccgatt 6241 ctgctagtac gtgcgtttgc tactttaaac gagcctcata ttcacctact catagcaggg 6301 gatggcgagt taggagtaga aatccgtgcg gaaattgcaa gactaggact tgtagataag 6361 attacaatgc tgggagcagt gactcaaagt caacttgcac agcttcatcg tgtctgcaat 6421 gtttttgtcc tcagcagtgc atacgaaggt ctacctttgg tggttttaga agcgcttggg 6481 agtgggactc cagtcgtcac gactcaatgt ggtgaaaccc caaaattact gactgctgac 6541 agtggggttg tttgttctga acgaacgcct gcgtgcattg cagatgcttt acggaaggtg 6601 cttctacatc caggagatta tccgacagaa tcttgcgtgc ggactgcaaa gccttatgca 6661 gctagtacgg ttgttcagca agtttacagc gaaatgttaa atcgctggga acagcgcaat 6721 agcattgctc ttcaggtctg aacacaattc gctgtcaaaa caaaaactaa gaaataaaaa 6781 agacgaaaaa ttatagttat cagggagagg caatgctttt ttagagttca aaaatttgaa 6841 acaaagaaac agatcttttt tgtgttccca cgtcctgtgt tcttgtttcc atctctgtta 6901 cgaaatgtaa cttgatttcg atttgtacat gcttttccat cgcaaaacta gcacggctta 6961 actagtatga aaattgctgt cattggtgca aaaggtctac ctcccaaaca gggcggcatt 7021 gagcattact gcgcagaagt gtatcctcgt atggtaaaac aaggacactc tgttgattta 7081 tttgcccgct cctcttatac ggacagttcc tggcaagaac cttatgacta tcagggcgtc 7141 caagttatct ctttacctgg ttttggttta agaggagtgg atgctctggt gacgtcggca 7201 ttaggagcga tcgcagcctc caccacaaaa tacgatatca ttcatttcca cgctctgggt 7261 ccatctctat tcacttgttt accaaaactt atcaatagtg caaagattgt tgtaagctgt 7321 caaggtttag attggcaacg tgctaaatgg ggcagctttt caactcgcgt tattcaaatg 7381 ggagagaagg ctgcagttcg tttcgctgac ggattgattg tcgtatctaa tgtgctgcaa 7441 acctactttt cgcaaactta tggtaggaac acagtctaca ttcccaatgc cccagccaga 7501 tatggtgagt cagaccccaa ttttggttac ggtactcaat taggtcttga gcaaaagcgc 7561 tatattgtct ttttgggtag actggtacca gaaaaatgtc ctgacttgct ggttgacgct 7621 tttactgctt tgaatcctcc tggatggaaa ctggttctag ctggcggtgt gagtgatacg 7681 aaatcattca cctcacaact gttacaaaag gttgcaaatc atccaaatat tgtctttgca 7741 ggcgaactga ggggacaacg tctttgggaa attgtccggg gagcagggtt atttgttctt 7801 ccctctaatt tagaaggact acctctagct atgttagaag cgatggagga agagatacca 7861 gtcgtagcaa gcgacatccc acctcacaag caattaatta gtggaggtcg cggaaaatta 7921 tttgaagctg gaaacctcac ctctttgatt cgcaccttag actgggcaat acatcaccca 7981 caagaactaa gagcgatggc tgtgcatgca aaaaaacatg tacaactgaa ttatagctgg 8041 gatcacatca cctcggaaac cctaaaactc tacacaacac ttcaaacctc ctgtgaacca 8101 gtacatattt acaaacaaaa ccaaactgga cttgcagagg ttttggggaa aaaataagca 8161 gtcaaggtag tcgtttgtca tttgtcatcc cgcagtgcca agtaagtttc caactggaca 8221 gcgactgctt agggcagtcc aggaggaaac tcttgtgaag gcgactggcg cttttcgcaa 8281 agtgtactgc atccgcagaa cttacagtgc aaccgtgccg taggcatagg gcaaactcaa 8341 ccaagttatt tgttcaaatt ttcatgagtc atgaatcatg actcatgaaa caagagggaa 8401 atatgtaatg gaaaaaggaa tctcaagtgt actagcagtg ctgacgcggc gaagtcttcc 8461 cgcaatagct gcatttgttg ctgtgatggg tggggcgatc gcatatcttg cagtcacccc 8521 aaataaatat gaagctttag cgcgactgat gctggacgac aaaagagtaa gtgtctcaga 8581 attaggtcgt gatctcaccc aagtgtctgc aaacgcacca ggaattagtc ccttagctga 8641 ccaagcagaa ctgatcaagt cgcaacgtgt tctcgaacga gcgatcgcaa ttgcttttcc 8701 caagacttac ggtaatttat ctctaagtcc tgtctcaact gcagaactca gtcataattt 8761 aaaagtcaaa attgttccag caaccaatat tttggaactg agttatcaat ctcgcgatcc 8821 tcagcttgct gcaaaggtac tcaacgccgt ttctcaagcg atggttgagg acaacgtaaa 8881 aagcataggt cttgaggcta caaaagtcaa gcaatttttg gaacgcaaac aagtacccga 8941 ggctgaaaaa aagctgctac aagctgaaga tcttgaaaac aaatacagaa aatcaagcgg 9001 tattgtctcc cttgaagaac aaacaaaaag cttagtgcag agtatagcga ctctggaaga 9061 ccaagaacgt actctgtcgg ctcaacttca agagataaaa gcgcgagatg catccttaca 9121 gcaagttacc aaaaatacaa acctgaacaa tgcctattca tctgttcgta gcggacaaga 9181 cgacgaaatc aagaagttgc gagctaagtt atcagaattg gagaataaga tcatcgaaac 9241 tcgtttgcgc ttaacagacg agcatccaac tgtgaggaat ttagttggag aacgggatgc 9301 cttgggtaaa gtattatcag aacaactcgc tcgcgtgtcc tcaaaagatc aaagcgtctc 9361 cacaaaaaat ttcgctggtg atcaactctc tcaagaactg aactccaaat ttatcctcaa 9421 tagaatcgaa gaatctgcag ttaatgatag gctgaaagtc ttgcaggcta agaaagctga 9481 actccagaag cgtcttgccc aactacccat tacacagcaa accctaactg tactaacccg 9541 aaaacgggaa gaagccgcag tatctttgaa gtttctacaa ggcaaacttg aagaagcacg 9601 gctgacagaa gcacaaaagg tgagcaacat tcgtgcgatc gaaaatgctg tagcaccatc 9661 atcaccatct gaacccaagc agaaagtagt attagcactt gcttctgtgt ttggaacaat 9721 gctagccgtt ggcgttgtgt tacttctaga ggtgatggat aatacactac gcgatgctgc 9781 cgaagccgaa gaactactac agctaccatt gctaggaatt ttaccacgtc taagcgctaa 9841 aacgcttgtt cttgagccag ccaatcaatt cttagataat atggagttga ttgaacctta 9901 ccgcacactc ttcaaaactt tggagtttcg cagtaaggag caattgcggg caatcgttgt 9961 cagtagtacc atatctggag aaggtaaatc tgtcgtcgca tcacaccttg ctgccgttgt 10021 aggtatgcta tcttggcgaa cattgattat tgatgcagat ttgcgaagac cttcccagca 10081 cacactgttc aatctagctc ctggtccagg gattaccgat gtgctagaag gaaatgtatc 10141 cttgctggat gccgtgcagc caacagatat tgagaatttg gatgtattga cttgtggcaa 10201 tcaacacgca cgtccttcgc aattactaga gtcaatcgct atgaagtctt tgatggcaga 10261 agcggctgaa aattatgatt tagtgataat agatacgcca cccttaactg cttgtgcgga 10321 tgctttaaca ttagctcaag aaggtaatgg acttatgctg gttgcacgtc caggcttcac 10381 ggataaagaa gttctctcaa gatgtgtatg ggatttaaca caaaatcgga tatctatcct 10441 gggagtcgtg gtgaatggaa tgacacatct gacacaaaat taccgttatc cgacttatcg 10501 ttaccgacct cgactaccca aatctcagaa gcaattgatt ggtgcaggag atagtagtag 10561 aaattctgcg aacggtatga ggcagaggta agaatgttga ccaaacaaca cttcagtcct 10621 tcttctcgtc tgggactctt gattgggtta gcaggtgtgg tggttggttt ggtaacagga 10681 ttcctgatag gaagtaccaa acctctctac ttgggcttag ctttgggtgc aataccattt 10741 cttttttact tttttaccaa gtttgaacaa gtcgttctcg gacttttagt tttacgtacc 10801 tctctagatc ctttctctgg gcaacaaata ccagctatgt ttgctctggg gctagatgtt 10861 ctcactttgc tttacgtaac agtgatgtta ctgagaaggc aaactgtaca gactgaccgc 10921 ttttggtggt tctttgctgg ctgggtgatc ttacaaggct tatgggtggt actcctgtgt 10981 ttgggagggt tgggattagc tgctgggtat ttgtcagaca gcatccgtga atggattcgt 11041 cttttttcat ggttgatggt ctatctgttg gtgatgcagc ttaaagaccg agttcctcct 11101 gagaaaatga tctctgtgct gttcttagct ctggtagcac cactcgtggt agggttgatg 11161 cagatgttca taccttctgt cttacccgct tttctctcag cgcagaatta tgatgctggt 11221 tcaatatcat ctgaaggttt tcggatcaaa gggacaattg gtcaccctaa tgggtttgtc 11281 accttgctgt tactatttat tggtttaacc tggtggaaac tcaggcaatc aagacaaagc 11341 tttgtgtggt tgttgttgct aggtttacta gcattttttt atgtcagtac taaggcctta 11401 tttggcttga tgatgattgc taccttcgtt gtggttttgg ttgctcccag attaagccca 11461 gtaaacctca taggtggagt tttattcgta gtccttgtcc tgggactgtt tgctagcacc 11521 gaatttggac gagaacgcct gagttcactc gccaacacac ccctactcaa tccagatatt 11581 gatgtatcgc gggctatttt actttcccaa agtgataata atagttttaa ttggcgaatt 11641 tcccaatggt atactttgtt aaatgcttgg cgtcagcatc ccttcttagg ttacggtttg 11701 ggattgagtg ttaatgtagc cactaataag ttgcttcctc acaatgatta cgtccgagca 11761 ttagtagaag ggggggtact tggttttgta acttttttag tgtttcttgt gggtcagggt 11821 gtgcgtctta tccagctgat gcgatcggca cctcctagga gtgctcaacg tgagttatgt 11881 tcaattatgt tcgctatttc tctagcgata cctgtcggga tgattacaga aaatatctgg 11941 agtcatacaa ctctgttttt ttattggttc actttaatgg cagtcgcagg gtggaactgg 12001 aatgaacaga ctgtagatag cagtactgcg ttgatccgtt ctccaaagca tttttactga 12061 tgaaaaaaat tatgggttgg aaactttaaa atttatgtca gcaaaaagac taaatctcat 12121 tcaggttttt cgagggttag cagcagtact cgtagtattt gcccacacag acctcatata 12181 taaccaaaat gtcaatcaag attttctgtt taaaatgttc ttgtttgggg ggtcaggtgt 12241 agactttttc tttgttttaa gcggttttat tatgttctat attcattaca aggatatagg 12301 tcatccagat aaattaggaa catttttctc gaaacgcttt acacgcatct acccacttta 12361 ctggctcatc ttaacaagta aaatattggc atctttttta ttctcttacg agcctaatac 12421 taatgctcgt gggattggag agtttattaa agcttttcta ctctttcctc aagatagaac 12481 aattctctca tcaagctttc ttggagtaag ttggacactc agttttgaga tgttctttta 12541 cctgatgttt ggggtgttga tttgcttgaa acctaaattt tcttttccga tcattgttgg 12601 ttggttatca ggcgtctttc ttcatttcct tggtgtcatt caattccctc aagataatct 12661 acttattcaa ttcctttttt ctgattataa tttggaattt gttttaggta gcttagccgc 12721 atacgttgtg ttaaataaga aagttagcaa tggaatacca ttactatatg gagggctttt 12781 tctctatacg ttatcagtta tcaattcttg gtacaccata atccaattat catctgttgt 12841 tctattcggc attccttgta cactaattgt tataggtagt gcttctttgg aacttagaaa 12901 aaatattaat gtacctgttt ttcttgtctt tttaggaaac gcttcatact ctgtctattt 12961 agtacatggt ttctttatga atcagatgac aaagattttg agtaaattac catttcctct 13021 ttttgaaaac ttggtagtct caaatattgt aggattcatt atttccatca tagctatcat 13081 gtgtgggtgc gttatatatt catatatcga aaaaccgttg ctcacatatt tcaagccaaa 13141 agcagtgaca acctagtact ttaccaaggc aactttgagg ggtagaaaaa aaccacgaac 13201 gcgagtgcgt gtcctctgga cataggcgca aagaacgcgt tagcgcagcg tgcgcccttg 13261 atgcataaaa agagaaaaag aagatgaatt tgaaagttga ttaaccaagc attgttggtt 13321 tggctgacta ctagcctaaa taagtttccc ggatttcttt tgttgtctat aaagtcgttt 13381 aacgatgacc acaaaaacta ttagcaccaa taacaccacc caattgagtg cgagaggtga 13441 acgaccttac cgaatagttt tagtccatcc cagtaccgga gttaactgga gtggtggatc 13501 agaaattgtg gcgattgagc tgactcgccg cctcagttcc tactttgaag tcgaactgct 13561 tagtggcgct gcttgtggct cgtttagcca ccccattcca tgtattcccc gttcctatgc 13621 ttatgatgct gtgcgtcacc ctctgattgc accgctagtg ggtaaattct caactcctga 13681 gattgtcgtg gaacatctaa ccagtttctt cccttgcttg ttccacttgc tcaggcatcc 13741 tgctgacctt atttttcctc acaatgatta tggtggattg gcaatggcag cttgtgtcag 13801 agcgctgacg gggacgccga tactctttac tgaacacaat agctcaaatg cagatgggaa 13861 atgtttgcag cggaatcttc gttttcgtcc agatcatctc gtcgtgttgg atgaagcgac 13921 agcagcattc gcccataatt tgaaaccgac gcaagctgtc agtgtgattc ctaatggtgt 13981 gaatcttgag caatttacac cagagggaac agcaatcaat cttgggttac caagacgaat 14041 cgcgctgtgc gtggctagtt taagccggaa gaatcataag cgagttgaac tagccattca 14101 agcagtggct cgcttacctc acgttagtct ttgtatatgt ggagatggac cggatcgcgc 14161 ctactttcaa gcgctgggag acgaattgct cggaccacaa cggtttgcga ttcggacttt 14221 tccccatgac cagatgccag aagtttaccg cagtgtggat gtatttacac tcccttccat 14281 taacgagcca tttgggcttg tctacttaga agctatggct agcgggttac ctgttgtcac 14341 tactgacgat cagatgcggc gatatctggt cggtgatagt ggtattttgt gtgatgttac 14401 taatctggat agctacacaa ctgccatcaa agacgcgctg tgtggtgact ggagtgaacg 14461 tgcaagacag aatgctgctc gctttagttg ggatgcgatc gcattacgtt atcgcgatgt 14521 gattttagaa acgattttgc agtccaagaa aaaagtgtcg ttaccaactc attgaataca 14581 agagggtcat atgccgaaag tttctgtgat tattccagct tacaatgcta tggcttacct 14641 cagagaaact gtggagagtg tgctcaagca gacgttcaca gattttgaag tcttaattgt 14701 tgatgatgga agttctgatg gaactgtgga gtgggtttct caaataaaag atctacgagt 14761 tcgactgatt tcacagcaaa accaaggttc atctggagca cgcaacacag gaatttctgc 14821 tgctggtgga gagtatatcg cgcttttgga tgctgatgat atttgggagc caacgaaact 14881 agaaaagcaa gtacgatatc tagaaaaaaa tccatcagtc ggtttggtag atacgtggac 14941 agttttaata gatcagcaag gtaagtccac aggcagagtt gttgtttcct acgcagaggg 15001 agacgacgta tggaagcagc ttgttcagtt taaaacagta tgcgcttgtg atagtacacc 15061 tttgattcgt cgcagttgtt ttgaaacggt tgggttattt aaccgagaat taccatttct 15121 tgaagattta gatatgtgga ttcgccttgc ttcgcgatat cgttttgcag ttataaaaga 15181 acccttagtt cgctatcgcc aacatccagg tagtaagtct acaaattgtc aaggaacttt 15241 ggaggctttt cgtacaattg ttgagaaagc ttttgagtca gttcacgcag atttactgcc 15301 tttaagagaa agaggatatg gtcgtattta tctctactta gcttggagag ctattaataa 15361 taaagattat gagcaagcgc tgcattttaa tcatcaagcc gttgctcatt atccacaact 15421 tattttttat tgggaattca tccgccaaac catagctttg acactattga aaacactcgg 15481 gcatcaaacc tatgacaaaa tgaaaacact tcttcaatca ttgcggcgac agacatcaac 15541 gaataatgga caatggagaa taggaaatag ttagtaggga gtgaaggagt gagagacaat 15601 tgacgagtgt ttttctcttc ctcatttacc taagaattcc ttgttttctc ttatctattt 15661 tcttgtaatc aagttagttt tttcaatgaa agagttcaaa aatatgtcca aagttactgt 15721 tattattccg gcatataatg ccatgaaata tctcccgcaa acagtggaga gtattctgaa 15781 tcagacactc acagattttg aagtcttgat tattaatgat ggcagttccg atggaattgt 15841 tgaatgggct tctcaaatca cagactcgcg aatcagattg atttcccaag taaatcaagg 15901 gacagctgcg gcaagaaata aggggatttt tgagtctaaa ggcgaatata ttgccttttt 15961 agatgctgat gatatttggg aaccaactaa attagaaaaa caagcccaat gcttagataa 16021 taatcacttg gtgggtttgg tagactgttg gacagctttt atagatgaga atagcaagcc 16081 cacgggttta gttatgagga acgatacaga gggtgatgtc tacaagaaag ttgtagaatc 16141 atgtgacagt cctgtttgct gtggaagttc accgatggta cgtcgttctt gtttcgacac 16201 cttgggcttg tttgaccggg agagttatat tgaagatgta gatatgtgga tacgtatcgc 16261 cactcgctat cactatggag tgatcaaaga acctttagtg aggtatcgcc agcacccaaa 16321 caacaaatct aaagattgcg aatcaatgtt aagaggtttt cgccagttaa ttgagaaaac 16381 ctatcgttct ttacccacag atattttaca cctcagacct cggagttatg gtcgattgta 16441 tgttttttta gcatggagag ccattgatac gaaagattac aaacaggctt ttcattatag 16501 tcagcaagca tttgcaatgt atcctcaact gattttgaaa ccgtggtttt ctcgcttaaa 16561 tattgcgatc gcagtcatga gatttcttgg acctgatgga tatgagaaag tgcggtcttt 16621 taatcgcagc ctacgccgag gcttgtttgc tcgtgcaaca tgaagaacaa agaaattcgg 16681 aattcggaag aaaattgtcc gattattttt ctagttaacg aaataaccct tgaattcatc 16741 ttttcgggat agccggaatt ttgaattctt tgcaaaagaa aagtccaact tatttatttt 16801 tggagttgtc gtgggacttc gtagtaaagt catcaagggt ggagccttga tggtcatacg 16861 ccaagctttg ggtatcttac tcagtttaat tggcgtatta ttcatcacta aagttattgg 16921 accgagggag tatggacttt atggaatgag ttatggaatt gtaagttttc ttggtggttt 16981 gggtatatgg ggcttggatg tttatttact acgtaaaacc tctaacccag ataaacaaga 17041 ttatgaccaa gtctttactc tgttgctatg tattagcagt gtttttacac tcactcttgt 17101 attagggcaa cacattattg ccgagatgct gaaagttcca gaatcagcac cactcttggc 17161 aactttaggt ttaacattac ctatttcgct gttaaatctg cctttgacaa tcaaactgga 17221 tcgagactta aattttcagc gtgttgctta tatcgagtta attagccaag ttagctatta 17281 tgtagcagct ttacccttgg caaatcgagg ggctggtgct tgggcacctg ttggtggttt 17341 gtggctgcag caaatcacta tggttttact aactgtcttt agtactgatc tgcgcccacg 17401 cttatgctgg aagcccagtt taatacggga gatgttggtg tatggtttga gttactcaag 17461 ttcaacttgg atttggcagt tgcgatcgct cgtcaatcca gttattgtag gacgttttgc 17521 tggggttgaa gctgttggtt ttatcgcttt agcgattcgc ttagcagaaa tgcttgcttt 17581 tgcgaagtcc gtcacctggc gtttggctat ggcggcgctt gctaaattgg agaatgatcc 17641 atttcgccta cgcaagagta ttgaagaggg aatgcgtcta caagcactag ctgttggttt 17701 gccaatggct gcgtttgcaa tagtagcgcc tgttgtatta ccagtcgttt ttggcaaaga 17761 ttggactcca ctgctgcaag tctttccgtt gatttcaatt ggctatattt ctaattctat 17821 cttcaactta cattcgtcag tcctctatct cttgggtaaa aatctttcgg tcacttggtt 17881 tcatacagca cacatagcac tttttgccgg aagtgccttt ttacttgtac cgtacttagg 17941 gatggttggt tatggctggg cagaaatagc tgcaatagcc agttatatag tcattcatat 18001 atatatagca aaagaaatag gtagtccaaa ttacacagta gcttttggct ggttgattat 18061 atcaattgct gttctgattt tgagtacagt gaatgaaaca gtccgttacc tatcttttgt 18121 gttgctgctg cttccgctta tctcaactaa agagagaaac agcttgattg gctattttca 18181 aatactgaga tcttaaaggc tgtttgtaaa gtgaattgat gtgaacgttg aatgaatatg 18241 ggaattattg aacctttcaa agatgggttt ttagaaatta tcccagaagg tgagggcagt 18301 gattactggc atattgctgc aatccatatt aatggagaag ttttttgtcc tagtccccgt 18361 atttatcctt caataaatgt tgctattgcc aaagcacggc gaatttttga ttggatatac 18421 aatcatgaaa tagaaactca aggcttggga tgctattgcg aggagttaaa gataaccttg 18481 tggcgtcaac ccaagttgca tcctagttga ctattgtcca ccacatttta aaggttatgt 18541 ctgatttaga agaaacgccg ccatacacca gaaatccatt aagtaaattt tgccatcgcg 18601 cgccagataa tgagtcttca cttaaacaaa tagttttcca tgacttttcc caatctttca 18661 acccacctat gtttggacac tgttggcgaa tctactacat atttgtaaat aaaagtatta 18721 cgtaggtcaa aacctccaat ctgtcgttgc ggctttttat ttttaacact tttatctatg 18781 cttttgtaac actttcgcca acagtgtctt cactagtatt atctcagagg atgtttggaa 18841 agtgtcgtct tgtaccaaaa attatcaatg atccccctaa atccttttta aggggggcta 18901 gggatcccct ctggggttgg ggggcaaagg ggggatcaag taaaatattt gatacttctc 18961 aaacattctc tcaattctag ttctttaacc caaggtattg ctgccaaaat atagtagatg 19021 tcatctcctt tgaagcactt ttccattctc tacttacgga tgaccattca atactacttt 19081 tagcagcaac tttaaaccac ctagtcaaaa gcttaatacc agctaattta tctatttcct 19141 tttggttatt atagctttga ctaagcttta aaatgttaaa atgcaacttt tctggctcag 19201 actttttgat aaaatctgga ccttttatat agtcttcaaa agattttata agcattgtta 19261 cataatcata atcacatttt gataaagact gaattattgc ttttgtaaaa tgctttactg 19321 tatccatata ccctatcgga taatggatgg cataagttat taaatcattg cgagcataat 19381 aataagtttc ccaatttaaa ttctttgcag aagcaggttg atgccacaca gctagagagg 19441 gaaaagctac tattttgtta cccaactctt taattcttaa acaaaactct atatcatcta 19501 tctttagaaa taaaggcagc ggtaatttaa ttttttcaac aacgtcttta gaaaaagaga 19561 aaaaccaaaa tcccccataa tctatatctt cttccaccag taacctgttg agagaactgg 19621 aacttcgtaa atcaatatta tgatttgcgg cggttaatga cccaggtgca aatcctctag 19681 ttttagaatc ttcgttgtac gttgctcctg cttcatacaa catatgcttt ttctgtaaat 19741 ttagcagtcc gccagctact gcaaaatcag tcttggcgta ctcatgcaaa gaaaacaacc 19801 tataaataca ttcactttct aactctatat catcatccat aagcaaaaaa tgagagtagg 19861 tgttctcctc taaagcttca actaaccctc tagtaaaacc accacttcca cccgcattta 19921 tattgggaac cagtttcagc tttggctttg tgaaatcagc tttgtctaaa gttctaccat 19981 tatcaacaac aaataatttt aagtccttag tttctagcaa tttatcttga aaaattgcag 20041 ctaaagtatt ttttatataa tcctcttttt tgaaggtgca aatgatgatt cccaatgata 20101 cgtctctagt tttattttca tcagttgcta tccatgcctc tttaaatgca ccttgttcgc 20161 tagaacaggt tatttcaaaa tatattctgc cagcgttttc attttgaagg agatttattg 20221 gtaaaacttt gacgggttct gaaaattgac atttctcaaa gttttcttga gaaataatct 20281 ctttgttatt ttccccatta acttctcggt aaatagaaac tttaaaatcg ccttcaagtt 20341 tcagcaaata gtagatagag ctaagagttg tatatttagt ataaaatttt tcgtaaaatg 20401 aattaaagta agaatttgag gatataatac ccccctgacg taacacaact tttttgtcat 20461 cttcctggta gtttatagat gcagcttcat tacactgtat gtataaatca gaaacgtcag 20521 ctgatttggg gagctttatt ccacctacta tatacatata atgtctacct attctcttag 20581 taaactaact tctttgattg actgatgtta aacagcctga ccgcttagag ctgatttcta 20641 aagtgaatct taaatccttc gggtgcgtta acgagagtac ggcaccattt tccggtggtg 20701 cgttacggcg caaaagttca gtaattggta aaaactccca caaagcaagc gctatcacgc 20761 accctacaga aactgatatt aaacagtctt accttaagaa tgtattctaa ataccttcta 20821 gtgctacctc cttgaggcat aactgctatc atcagatttt tttaagtcgt ggataaaggg 20881 attgatatcg catccaagcc tcttctaggg ctgaatcaac tgattgggga gttgcagcaa 20941 gcgttggtgt ggctgccagt ttgagcgtgt catttgtatc tgcgtataca ccaattccga 21001 tacccgccaa tagagcagca ccccgcgcgg aagcagcagc aactgtagtt gcatagaggg 21061 gtattctcaa tacatcagtc agtaattgtt tccaaggcat ttcttgtgtt ccaccgcctg 21121 ctaaacgcag ttctgttgct ttaaaacctg tcgcctcaag tgcctcaaaa ccttgtcgca 21181 aggcaaaagc aactccttct aaagccgccc gcatcaaatg cgcctgtgtg tgatgaagtc 21241 caagccccac ccatgcgccg cgtacatgag ggtcaaggtg tggagttcgc tcacctgtga 21301 ggtatggcaa aaatgtcaac ccttcacatc ctgggggaac agaaaacgct ttagtataga 21361 cttgctgcca actcaagccg aggatacctc gcacccactc aagcgctagc cccgcatttt 21421 gcattgctgc cagggtgtac caccccttag gtacggcgga tcgatagaga tgtgtacgac 21481 catgaggatc gataattggt tgggagcgag gtgtaatgat ttgagcgcct gtgcctatgg 21541 ttagttgaac taatccaggc tctagtagtc cgttaccaag tgccgccgct gccgtatccg 21601 cagcaccagc gataactctt aagccaactc ttaagccaag atgctctgaa gcaacagccg 21661 ttaagtaacc tgcgatcgca ctagagggga taatttttgc taaccaatca taacgtagat 21721 tcagcgctgt tattgcttcg cttgcccagt tgtccgacac aacatcgtaa agcaaagtac 21781 cactagcgtc agatggttct gttgcgactt ctccagtcag ccgtaaccgt agccaatctt 21841 ttggctggag tgcccaacgt gcttgggcgt agacagtagc ctcgtgttct cgtagccaca 21901 acaaagttgg acccgccatt ccagccgtaa tcgggttgcc caagcgctct agaatagcgg 21961 catcgagcga atgataagcg ttaagtgtgg cactagagcg agtatctgcc caaaggatag 22021 caggacgcag gggctgaccc gactccgaag ctaggacaac accatgcatc tgtcctgaaa 22081 gtgcgatcgc ttgtacccga tcagcgtgat ttcccactgc cttccttact gccatagcaa 22141 cagctaacca ccaatcccct ggctcagact cagcccatcc ggggtgcggt gcatgaacag 22201 gataggagct tgatgcctca cctatagcgg ttccgtctgt cgctaggagc aatgccttag 22261 cagagcctgt tcctaaatct atgccaagca gcattaggct agtcctcaat acttgaaagt 22321 taagaataga aacttaaaga actgtcattt ctcagattct acatattcag atctccctaa 22381 atcccccttt ttaaggcagg gctgtttcat tccacaagga ggaggagttt gtcaatatca 22441 tttgcaatca atcgttgggc agaagtgatg gaagaataac catttcgacg taatacattt 22501 aaggcgatcg ctaaaataat tgaacggttt gcaggagcat tgcctttacg tatcatcgaa 22561 cggtcttcac caaaaaccac atctttaacc caatgaaggc gattctcaat tccccaatgt 22621 ccgcgaatac cacgagcaaa atcaactgct gaacgtaaaa aactactgat gtagtaagca 22681 acctgatggt atggtttccc cccacgagtg ccagttcttt caaccttaat taacgacttc 22741 aacccagccc attcacgact aatgccagtt aagtcattaa acacttgtat aatgcgtgtc 22801 gtaactcggt ttcgagttcg ttcagtggca atgtagcgac tcgttggttt ggtattttcc 22861 gtattgtgtc gaatttgacg atgtaaattt ttttgatttg ctttgactgc aatcacgtat 22921 tcattaccac tgtcgatcat tagcttgcaa gttttttttg gcaatgtaaa gctgatattc 22981 gctgaaaaca acaccttgga tatccaatgc cgcaatcaag ttttgaaccg tactaatctc 23041 gctactaccc ttattctcta atttgtccat gctgaaaacg agtcctcttc ttccagcaaa 23101 cacagacaca atacttacaa aattttgata ctcgctttta tagtcttgca ctgttccttt 23161 gatactttta cgagggtcta acacgaaatt tttggtcagc cctcatacta cgaatactcg 23221 tcttggaaca cagatcaaca aaaacagggc tgaccaaaaa ttaggggaca atgcgtcacc 23281 atcaattcca caccactccc atgtttctaa atgaacgtag ttttgtgccc actgattaaa 23341 ggttgcagca agtgtgtcaa aatcgactcc catcaatatt cgtcgtattg ttgagtatga 23401 tggaacacca tgtttctgga ttccaaatgt tgaaatcaat acacaacgat gccgtttgac 23461 aaagtctccc catgcacgat aaccaacata caaagcgagt tgttcccata attacaaaca 23521 gcaacaccag ccataaagga tgtctctgcc cgtcttttgt gcgaaagtcc ttaacctgcc 23581 gcaattgttc aattagattt gcacccatta ctttcacccc ctcgacactt gtaattttac 23641 gtcttttagg gaatgaaaca gccctgcttt ttaaggggga ctttggctga attctccccc 23701 cttcttaaga ctggatctca tacgaggagc tgggggggat ctccagggcg cgaaaacacg 23761 ccctagcccc ctaaatcccc caattctggg agactttcgc ttgaggcatt gagcctttga 23821 actctgcata caacgcaaat agaggctctt aaagcttatc ctatatagat ttcagccctg 23881 cgcccgatga aaaggcgcag cgtccgtggt attaagaacg gtagaagccc tgcttgccga 23941 taatacgata gatgcccgaa ttgcgggaaa caagaacgtg tctttttacc ataagaaaaa 24001 aacgcccgcg tcgaacgctt taaccgatct gattcgcgcc gtacttcgta tgaacgcaac 24061 ggtgcaaaaa tcaggaacac gcctaatgcg gggtacggga ataaccaatg ctcggtggca 24121 aatgctgagt gagttattcg cacttgaaaa gcgcgtcacg gtaagcgaat tggcgcggca 24181 tatgggctta acacggcaag ccgtacaacg actcgctgac gacatggcaa gcgacggtct 24241 ggttgagttc gccgaaaatc ctggcgacgc ccgagcgatg cacttgcttc tcacggaagc 24301 cggcaggaca acgtatcacg atgcgttgga gcgcgaatgg cagtggacaa atgcgatcgc 24361 cgaagacttt gacgcggaac aaatcactcg tgccgtggca cttctggaag ccattacgca 24421 gaagatgcaa accgatgatt gacaatatgt tgtcaattag gtaacattca tattgacaac 24481 gtgttgtcaa tagctctcgt gagaaagcca ccaacctgat gccacaccca accgccgccg 24541 aacagttcga cccgcataac ccgcgattta ccgccgatcg atttacgctc ctcgcgcaga 24601 tgcgctccga ggctcccgtg acattcctgc ctgcacttca ggtttacgct gtgacgcgct 24661 ggcaggaagt gcacgacgtt ttgggtgatg ccgtgacatt cgcgtcatcc gaggcgttta 24721 gcgcgggagt tcatctcgcg cccgaagcca gtgccatcta ttcactcacg tcaccgctgt 24781 tcgcctacaa cctgatcaac gtggataagc cgctccacac ccgcttacgc gacccgctca 24841 tggcggcgtt cacccccaag cggattcagt cgctcgcccc gacggtgata gcggatgtcg 24901 aagatctgct cgatgcgatc gcggcatcgg gcggtgaaac tgacctgctc cttaccttgt 24961 gcaagccgct tgcgctacga acgatctgcc gactactggg tgttccgctt gcggacgccg 25021 agaagttgag cggctggtcg gatgcccttg tcgcgttcca gacacccggc ttacccatcg 25081 aggtgcaagt cggcgcagca cacggtttgc gggcgttgga aaactacatc cgcgaaatgg 25141 tcgctcttaa agccgcgatg ccggacgacg gtttgatttc cgcgttggtc gccagtcgcg 25201 ccgccgcttt aaacgatttg tcagaggatg agttagtcgc ggatattgcg atcgttttct 25261 ttgccggaca cgaaaccacc atcaatacga tcgccaatgc gttccactcg cttttgaatc 25321 ggcgtgagta ctgggaagcc atcgcatcgg gaaccgtgga cgccgaaaat ctaaccgatg 25381 aactcttgcg ccacgacaca tcggtgatgg gattataccg acgtaccacc gtggataccg 25441 ttatcggcgg cgtgaccgtg ccgcagggcg cgacgctttg ggtgtcgtat gctgcggcga 25501 accgcgatcc tggcttgttc gacgcgccgg aaacgctcca gtgtccgcgt gggaacgccc 25561 gccagcattt gacgttcggc tacggtgctc actattgcgt cgggccgtta ctcgcccgtc 25621 tgcaaatccg ggaagcggtc acccgcgttg caaagcgttt tccagagatg cgtcttgtac 25681 cgggtgcttt tgtgccggaa atccctcatc atggtcttcg cgctccgatt acgcttccgg 25741 tactactgaa ataaaaagaa atcatgcgct ttgaaacact cggcgcaagg gtgagcgagc 25801 cgtcgagtgg tcgcacaagg gcgtagacgt gcgtcagcga gacttcaacg acaccgccac 25861 gatcacaccc gaccatgtcc tcacgtaggc gagaaaagca ctctttcggt atctgagaag 25921 ataacgacgc cgacgattct aacgaactac aagcaaaatt aaggaaaaga gagcgaccat 25981 gattttggta acaggagcta ccggacagtt agggacagca gtagtcaaaa atctactgga 26041 aaagacatcc gctaaccgaa ttgccgcatt tgtgcgtgac aaaagcaaag catctgcctt 26101 aaaggaaaaa ggtgtagaca tacgggtagg gagttacgat gatactgcct cgctcgacaa 26161 ggcaatgcat ggaatcgaaa aggtcttact tatagccgga acggatgagg acaaccgcat 26221 aaagcaacac caaaatgtag tggatgccgc aaaaaaggca ggggttcaat gtgtcgctta 26281 caccagtcga acgttgaaag acagaaacac tttggcaaac aagttgatgg agggtcattt 26341 tcagacagaa gattacatca aagcgagtgg gttgaactat gctctatttc gcaatgtctt 26401 gtatatggac accattgggc agttcgtagg ggaaagagtt tttgatacgg gtatcaactt 26461 accgaccggt cagggaagag tcccctttgc cctgagaagc gaaatgggag aagccattgc 26521 gaatgcactg ttagagagcg gttgcgggaa ccgaatctac aaactcacag caagtgagtc 26581 ctattccttt gatgatgtcg cagctaccct ttccgactta acaggcaaag aggtagatta 26641 tacacctact gagaaatcag catttgaggc gcaaatgaaa gaacgcggcg taccggaagc 26701 gatggttcag agggtcgtgg gttttctaac ggacattaaa aacgggcagg aagaagaagt 26761 aagccccgac atggaaaacc tgcttgggcg aaagcctgca tcgcttaaag aggggttgaa 26821 agttcttttc aatttatagg gagactgacg ttgcaccaga agacccgaca acttttgcca 26881 aggaatttgt tgctactttt taaagcagat ccccgcgcaa aatctagccg accgatcgct 26941 gttcgatgaa acccgtctcc cttagcacgg gtatggcgag ttatcagtgc cgtcagttct 27001 ctacactgat tggtaatctg ttcgttggcg gagtctttgc tgggaacgag aattgtcatc 27061 ctaatttctc gactaattta atcatcttct gacagctatt gtagagcatc tttatccagt 27121 tgcagaacct taatttggag gattaggcaa taaatagcga ggtttgtgta tttaaaactc 27181 ttgctcttat tcataagatg catacatcaa cccaaaacac tcaaaggatc gggtaatgac 27241 taatcacagc ccaaaaattg cactaataac tggatcgagc cgaggactgg gcaaaagtac 27301 tgccttgaac cttgctaaaa aaggggtcga tgttattgtg acctaccata gcaatgctga 27361 ggaagctaca aaagtcgtgg tccaaatcga atcgatcggg gcaaaagccg tagcattgca 27421 actagatact ggaaatacca aaaccttcga tagttttgta gaacaagtta aacagtcact 27481 ccaagacaaa tggcacactg atcgttttga tttcctggtt aataatgcag ggactggcat 27541 aaatgcctcc attgctgaaa cgacagagga agaatttgac cacttgatga atattcatct 27601 caaaggtgtt ttcttcctca cccagaaact gctcctgctg ataaatgatg gcggacgaat 27661 tgtaaatgtt tctactggtt tgacacggat tatcttccct ggctatgccg cttatgcaag 27721 tatgaaaggc gcaatcgaga ccctaactct ctacatggca aaggagttag gatcgagacg 27781 gattgcggta aatgtggtgg ctccaggggc gattgaaacc gattttcgtg gcggtgcagt 27841 acgcgataat ccagaaatga ataaatatgt tgcatcacaa actgccctgg gtcgcgttgg 27901 tcttcccgat gatattggcg gcgcgatcgc atcactacta tctaaagaca atcaatgggt 27961 aaatgcccag cgtattgaag tctccggcgg tcagtcgctc taattcaaat caatccaaat 28021 agaaagaaaa ttatgagtag gcaattttta actctaatca ttgcagaaat agtttttatg 28081 tgttcttttc aagcacaaga agcagtttct caacctttgg aaactcaaat acgccaagaa 28141 cgaggtgaaa gagttttaaa tagtttgaca ggcggaaatg gtctgccacc gcattttcaa 28201 caactgcaaa aggattttcc agagcttgct gatttaactc ttaaatactc ccttggcgat 28261 atctggggtc gggaggtact ggacaacaaa acccgtcaac tggtctctct tgctggattt 28321 gctgcccaag gcacgatgcc gcaattcaaa gttcatgctc agtatgctct caattatggc 28381 gtgactccac aagaactgat ggaagtcatt tatatcacca cggtgacatc tggatatccc 28441 cgcgctctaa ttgcggcagg aactctcaaa gagctattcc aagaaaataa gatcaaattt 28501 ccaataacat cacaaaaata ggaggtttca atgttaaaca ttaaagataa ggtagttctt 28561 attactggag ccagtagcgg catcggtgaa gctgccgctc ggtttttagc cgccaaagga 28621 gccaaggtcg tcttgggcgc tcgacgcacc gaaaacttaa agagtattgc aggtgagatt 28681 caagctgctg gaggagaggt tcgctttact tctttagacg tgacgcaaaa ggaacaacta 28741 gagagcttta tccagttttc acaatcgcaa ttcgggcgcg tagacgtgct ggtgagcaat 28801 gcgggtttga tgccgctttc cctcattgaa cagttgaagg tcgaagaatg ggacagaatg 28861 atcgatgtga acctcaaagg agtgctatac gggattgcgg ctgcgttacc aatttttcaa 28921 gcccaaaact ctggtcattt tgtcaatatt acatcgatcg ccgatcgatg ggtcggacct 28981 actgccacaa tttattgtgc tactaagcac gctgtgcggg tgatctcgga aggactcagg 29041 caggaagttg gtagcaacat ccgagtgacc gtcattgccc caggcgcgac tgaatcagaa 29101 ctgctcaata caatttccga cccagaaata aagaaggccg caatcgaaca attccgcatc 29161 gatctactcc ccaccgaagc gattgcccgt gcgattgcct acgctgtaga gcagcccgct 29221 gacgtagacg tgaacgaaat tgtggtgcga ccatccgcac agaaatactg acttttgcgc 29281 ccaaagattt tcagatcggg gtgaattgtt gcatcgctgt atgtctcgat ctaaaacatt 29341 ttcaaatgaa taaaacagag gaaaaaatga aagcgatcgt aattaacgca tacggcaatg 29401 aggacgtttt gaattacgtc gatgttgaac gtccagcacc gaaagcagac gaagttctgg 29461 tgaaagttca cgccgcaggg gtcaatccgg cggagtggaa agtccgcgat gggatgggcg 29521 aagcgttcgg cttaaaactt ccgctgattc tgggcggcga catcgccgga attgtcgaag 29581 aagtaggcga ggcggttgaa agttttaaga aaggcgatgc ggtttatggg ttgactgcgt 29641 ccggcggctt ctccggtggc tatgccgaat acgcggtcgc caaaacggat acaatcgtgc 29701 ccaaaccaga cagcctcagt tttgaagagg cggcggcgat tccaattgcc gcgttgactg 29761 cgtggcaagc gatgttcgat ttggctcact tgagcagcgg gcaaagaatc ttgataaccg 29821 gcgcgtcggg cggagtcggc tcgatggctg ttcagcttgc caaagcgaaa ggcgcgatcg 29881 ttatcggcac ggcttcgggc agaaacgaac agttcgtccg cgatttgggc gcagatgaat 29941 tcgtcgatta cacacagcaa ccgtttgaag aagtcgtcaa agacgtggat gtagttttcg 30001 atacggtcgg cggcgacacc caggagcgag cctttcaaac tctgaaaaag ggcggctttc 30061 tggtaacatc ggcgcagact ccgtccgaag aaaaagcaaa agaattcggc acagaagccg 30121 cgtttgtctt ttgtaagccg aacgcggggc aattaaccga aatcaaccgg ctgattgaag 30181 aaggcaaatt aaaaatacac atcgaaacgg ttctgccgct cacagaagtg aaaaaggcac 30241 atcaactttc ccaaagcggg cgcacacgcg gcaagattgt tttgcaagtt ggagcataat 30301 caggactgac gcaggacaca ctgcaatgcg ccttctgcgt cagtcctgtt gtagaaaaat 30361 tttccctcaa gcgagaaact cgacgacttt taccatgaac aaccaagaac agccagtatt 30421 tttcggcaaa gagcaagctg cgggttatga ccagcgatgg acaaaattgg ctccgattcg 30481 tgacgcgctt catctgctcc ttcgcatcag gctttcagaa cttcccgacg acgctcaagt 30541 tctctgtgtg ggcgtgggaa ccggcgcgga actgctttat ctggctcaag cgtttcccca 30601 gtggtcgttt accttagtgg aaccttccaa accaatgctt gacatttgcc gccagagagc 30661 cgaagaggat ggtattacat cgcgttgcac ttttcacgaa ggttatctcg attctctgcc 30721 cctttcattg cctttcaacg cggccacctg ccttttggta tcgcaattca ttatgcaacc 30781 agaagaaagg tgcgcctttt ttggtcaaat atccaaccag cttcgccccg gtggatattt 30841 ggtgagcgca gatttggcct ctggcacatc tgcttcagct tacgagaatc tttttgaagt 30901 ttggctccag atgcagcgat ttaatggaat accagaagag gcaatcgaaa agatgcgcct 30961 tgtttacgga cgaaatgtcg ccgtttcaac accacgagaa atcgaagaac ttatcgcatc 31021 aagcggcttc gatgcacctg tgctattttt tcaagccttc ctcattcatg cttggtatgc 31081 cagacgaaca acctaaatta agagttgcca accacagcct aacaaggcgc tgcacccaac 31141 tgcctacagt ttcgctacga cgttgatttc gttattttcg ataaaaaaaa gcatctccga 31201 tttatcgaaa aatacagcga atggattnnn nnnnnnnnnn nnnnngcaca tcaaagaaat 31261 tttcggcgcg gacgaatctt tttggagagt attattaaaa tagaaaaata gaggtaaaga 31321 attgaacgat gcaatctata caagaaagag agagttatta taagcaaggg gctccctggc 31381 tacctgacga tcttaaaaat gaatttgggt tattcaatgt ttttaatttt aatcctggta 31441 agaacgggaa ccctccacaa ttgccatata gcaaaaaaga ttactacaaa ataacaatta 31501 ttaaaggcag tggcagtggt atttttctct atgcggatag agagattgag atagaaaact 31561 attcaattat cttttctaat ccgcaaatac cttatggatg gtcacaaaga gaaaactttt 31621 cagatggttt cgcttgcgtc tttgatcagg cattttttca tcaatacgga aatataacta 31681 attattcagt ttttcagccc ggcaataata tctaccaatt agatgaggag caatttgtcc 31741 aactagaaga catttttaga cgaatgtttg aagaaatcga gtgcgacttt attcataaat 31801 atgacttgct cagaacactt gtttatgaat tagtgcttta tacgatgaaa atgaagcctg 31861 cttctaaatt gagtaagcaa cccattaatg cttcggttag aatttctaca cttttcacag 31921 aacttttaga atgtcagttt ccaattgatg atatacatag accgctcact cttcgatctg 31981 catctgacta cgccaaaaat ttgaacattc atgtcaatca tttaaacagg gcattaaaag 32041 aaacttctaa taaaactaca tctcagttaa ttattgatcg aatattacag gaagccaaag 32101 ttcttttaag acaaacttcc tggacggttt cagaaattgc gtatgccttg ggatatgcgg 32161 aagtgaccca ttttaataat ctcttcaaaa aatatctgaa catcacaccg acaaattata 32221 gaaaagccga tattgtttga tttgcataag caattatttg tttagcgtaa gcccatctgc 32281 ttgtctttcg actaatcttg tatcagtaaa tcaattgata attaaaaatc aaaaaatcat 32341 gaaaaaagta ttaatcaccg gagccaataa aagtattggc ttagaaactg cccgccaact 32401 gctgcaaaag ggatattaca tttatttagg cagccgtaat ttaaagaatg gactggaagc 32461 agtagaaaag cttaaagccg aagggttgaa cgaagtggaa gccatccaaa tagatgtcag 32521 cgatgatgaa tcggtaaagg cagcccgtgc cgaaataggc aaaaaaaccg aagtgttgga 32581 tgtgctgatc aataatgccg ggattaacgg aggtttacca caaacagcaa ctggtgccag 32641 tatagacgca tttaaaaagg tatttgatac caatgtgttt ggcgttgtga gagtcaccca 32701 gtcttttatg gatttattga aaaagtcgcc tgaacctcgc atcgttaatg tcagttcggg 32761 tatgggttcg cttaccctgc acaacgaccc gacgtggaag tattacaaca ataaaggcgc 32821 tatttaccac ccgtcaaaag cggcgctcaa tatgtacact attgttttgg cttacgagct 32881 gcgcgacacg ccgttcaaag ttaatgccgt ttgccccggc tttgtcgcaa ccgattttaa 32941 taatcaccgc ggaaccggaa cggtcgctga ggctggaacg nnnnnnnnnn cgcgcatcgc 33001 taaatacgct ctaattgaca gcgacggacc gaccggaaag tttttcagtg aagaaaacaa 33061 cccggaaacc ggagaaattc cttggtagtt tcaggtagtt aacaacgctc attgcaagga 33121 actaagcaat agcaaaacct attgatgaag aaaaatttat caaaacagaa tacatgcgtc 33181 agtcgccaga attcagaatt gaattctgta cgactggtga ggcagtccgg tggacgggtt 33241 ccccggcata aagtaacgga cgtacccgca gggcagatga ataaacgggt ttaaaacccc 33301 ctccaaattg aaaatttgga ggtcttcaat tagtggcggg tctgaatctc ccactaattg 33361 attgtggatt ctgaattctg acttctggat tcttcttcaa gcagtagata aaattaaaaa 33421 tacatattta aggttttaaa aaggaaatat ttatgaaaat tgtagtaaca ggttctttag 33481 gaaatatcag caaaccactg acacaacagt tagtgcagca aggacactcg gttacagtta 33541 tcagcagcaa agccgaaaga caaaaagaca ttgaagacat tggtgcaaaa gctgctatcg 33601 gcacaatgga agaccctgat tttttatcag ctactttcaa agatgcagat gtagtttatg 33661 ttatggaaac gcttggtgct gacagtttct ttaaccaaaa tcttgacatc atagctgcta 33721 ttaccaaaat tggcaataat tacaaacaag ccatcgagca gtcgggcgta aagcgtgtgg 33781 tgcatctcag cagcattggt gcccataccg ataaaggtaa tggtattctt gtttttcatt 33841 acaatgtaga aaatattttg aaacaattac caaacgatgt ttccattaaa tttatgcgtc 33901 ctgtcggttt ttacaccaat atgtttaggt ttattgagac cataaaaaca caaggtgtaa 33961 tcgtttctaa ttatggtggc gataacaaag aaccttgggt ttcgcctttg gacattgcgg 34021 ctgccattac cgaagaaata gaaaaaccct ttgaaggcag aacaattcgt tacatagcaa 34081 gcgaagaagt ttcgccaaat gaaattgcaa agattttagg cgaagcaatt ggcaaacctg 34141 aattgaaatg ggtggtaatt cctgatgaac aactactgaa tggtatgttg tccatcggaa 34201 tgaatctaca ggtagccaat ggttttgtgg aaatgcaggc aagccaacgt agcggactgt 34261 tatacgcaga ttattaccgc aacaaaccaa cgttgggcaa agtaaaactg acagattttg 34321 caaaagaatt ttctacagtt tataaccata aaacgcattc aagcattgag cggatataag 34381 ttcaattttg tagtcggtaa tcgctaccga acaggcaaag actcgattgg ctggcactcc 34441 gataatttct ctcagatagg taaaagacct gcgttagccc aaggctttgg cgcagccatc 34501 gcctctttaa gcttaggtag cactcgcaaa tttaagctgc ggcacaagga cagcggtgaa 34561 actgttgact accatctaga gagcgggtct ttgttgatca tgctccctgg ctgtcaagaa 34621 gactgggttc acgcggttcc caaaaccgct cgtccagttg gcggacgaat caactggacg 34681 tttagaccac atgtggaggc gattctccaa gggagaggct ccgccaaagg gttttgctga 34741 tagccggaac ggatgaagat aaccgcataa agcaacacca aaatgttgtg gatgccgcaa 34801 aaaaggcagg ggttcaatgt gtcgcttaca ccagtcgaac gttgaaagac agaaacactt 34861 tggcaaacaa gttgatggag ggtcattttc agacagagca gatatgagga tgtgcgattg 34921 cgcggagcgc agtgccggag gcaatcgcgg gaatatctct tcccctgttt ggttaaaata 34981 gagaagtaaa aagctgaaat ctatatagga taagctttaa gagcctctat ttccgttgta 35041 tgcagggttc tcggctcaat gcccttgaga cacatgaaaa acaggagcta ggagcatctc 35101 ccctagcttt caaccacact ctgctttttg atgacaacca ttttttacgt aggttgacgg 35161 cggtgaagta cccaatcaac cacccacagc acatcatggc attagtgttc ctttagcacc 35221 aaactcatgc aaactgctca acttgtcagc aacagtctcg acgaaacgtg gcgactgtac 35281 taaatctcca aaaatctcaa acatgctcag caagggtctg ggatctgttt tacctgagca 35341 agctcgtctc gttatgatat cagctaaggg gtcatcaatg acgataggac ggttctggtc 35401 gtcataacca ttgaggtatt gaaaccaagc agcaatcgtt aaacttaggt aatcgatcgc 35461 acctccaagt tgtaatttat cgcggagcga tcccaaaaca aacttaggaa ttttggctga 35521 tccattcagg cacaggcgcg gaagttgatc gcgaattttg ggattggaaa accgttcaat 35581 taaagtcttt ttataatcgt ctaaatcaat cccaggaacg ggttggagcg ttggtgtcac 35641 ctcgtccatc aagttagcga ctgcttgctc aaacaaggga tcagccatga cttcataaac 35701 gtaggtataa cctgccagag agccgagata gccaatcagc atatgactgg cattaagcag 35761 ccgaattttc atcatctcgt agggatgaac atcgctcgtc atctgtacac caacggattc 35821 ccaatcgggt ctgcctgcac aaaaggtatc ttcgattacc cactggatga aaggctccgc 35881 gacacatgga aacgtatcat caataccaaa ctgttgggcc accattttga tatcttgtgg 35941 ggttgtcagt ggagtaatgc gatcgaccat gcagttggga aaggcaacgt gttcagcaat 36001 ccaacgtccc aaagccggat cgcgcatttg ggcaaatgtc gttagcattt tccgcaccat 36061 gttgccgttg ccctgcacgt tgtcgcagga cagtacggta aagggtgcca acccctgttt 36121 acgtcgcttt tcgagtgcgg ctgtcaaaaa accgtatgtc ccgattggtt ggtcaggatg 36181 ttgcaaatcg tgctgcatcg tcgggtggtt tacatcgaaa ttgccacttc cttctatgta 36241 gtagtagccg ctttcagtga tcgttagggt aacaattcgg cattcgggag ctgctattgc 36301 ttcgatgact gcttgacgat tgtctggtgc aaacaggtat cgagtgattg aaccgatgat 36361 tcgagcgcta tcgccttctt gcgatcgctc aactaaggta tacaaacaat cttgggactt 36421 aagcgcatcc cgcattcgcc tgtcatattc attgtcaagc aatccaacac cacagattcc 36481 ccactcacta ccaggattct gatggaaata atcatcgaga tagagcgctt ggtgcgatcg 36541 atgaaatcca ccaaccccaa tatgcacaat tccattagtg atttgacggc gatcgtactt 36601 gggtacgcgc acgtttccag gtaaacgaga cagggatgct tcattcagct tgattgttga 36661 gcctgtgttg ggattgctgt tcatcttaat ttatgctaaa tggtgacgac gagtttttga 36721 aaccgacagc tttccaggct tcactggaat cgtcacactg gcatcaccag gaagtttgac 36781 aattatacca gagtcattga cacctgttac cgtgaccgag agaaagttca tcttaatgcc 36841 ctgtttcttt agaccagggt gggtgacaat atcccgataa gatcgtgcat ccgctcaata 36901 atttgctggg caccagccga ctgaagagca tcggcacggc ttgttcgttc atcctcgctt 36961 atatgagttc cccctacata accgataata tgcccaattc cagcggcaac ggcagacctt 37021 accccactga tagaatcttc aactgccacg caatccgaaa cttcagcttc cagacacttc 37081 actgcatgta ggtagatgtc aggaagcggc ttgggtcgcg caacggggag agagtcgtga 37141 gcgctaaata cctgctcgct tgggaaatag tcggtcaggg cagcagaagt aaggcaagca 37201 ctcagccgct gtaagctgct gttacttact agtgcatatt caaatccgtc gtcgcgcagg 37261 taagacagga cttcaggcgt tccttcagtc gcttttgcct gtacactaag ccgttcaatc 37321 gctctctttt cttcttcagc cactagcctc tcaatatcgc gctcattaag agtggttaat 37381 gagtctgcgt agatcttctt aagtatttcc ctgtaaggtt tccctgcgaa gcttttaaca 37441 aagtcctcaa gctcatagtg ttgagcgcca gcgaactcac gagccacttc atttgtcagc 37501 ccccacgcgc tcttaagtgc tatggtttca ctatcaacca cagttccgtc atgatcgaaa 37561 agcaccactt tatcatgctg aagcggtttc atctcaattg acctctctca ctataattct 37621 taagtgtagt caatagtccg tgggtctgaa gatctcgcaa agcttggtgg cattcattat 37681 ctacgacctc tgtatgttcg caattctcca atcccagaat cccgctaatc accgcgcctt 37741 cttgaatcgc tgtaactgcg ctttgggtat cctttgcatg ataaggctga ccagactcgc 37801 tcgccccctc cctgactgca agaagccacg ccgcaacggg aagtataatc cgcttcatgc 37861 tgacctttcg gctgtaagca tcctgaagta tcggaaatat atacttgcct actttcctgg 37921 ctgtctctga agagattctc tcagtagtat cgggaagatc ctcgttactc agccttagta 37981 acacctgctc catatattgt tcgcacatct cgcgaggcac tggtgttgct gctgcgatat 38041 cgctcatgag aagccgagta aacaagtgga attctggttg cctgatcgcc tcatgcatat 38101 actctatacc tgctcttaca gctaaacagg ctatgaagga gtgtactgca ttcaagaatc 38161 tcgacttcat atacaaaaag ggagtgatgt ctttcgtcat aatgaccccc gcctcttccc 38221 agttgggtcg atcgcttgtg aagttgtcct ccacgacaaa ctgccaaaat ggctctgtta 38281 caatgggtgc gcgatcgcga acctgtaaca gtcggctggg gtagttatag tcggattctt 38341 gctcttgagg cacaatcctg tctacaaccg tattaggaaa caccgcgtta tccctgatgt 38401 attctgcaag aaacggctca atcaggtcag cgtaagctag cactgctttt cgcaggatct 38461 ccccatttct tggaaggttg tcgcaagaaa gggtggtgaa cggggtcatt ccgctatcgc 38521 gtcgtcggcg tagcgcctct actataaagc caattgcagt tgttggagtg gaaggatttc 38581 gcagatcgtg cgcgatgtct tcattcgctg tgtcgagatt aaaactcttg tctagatggt 38641 atcctccttg agtaattgta agagtcacga gatgcacaga gggagacgtc atcttttcaa 38701 gcacatagtc gcacttctct ctaccattaa cgatttccct gatagaacct ataactttcg 38761 cttcctcacg atttttagcg caactagagt gtttggtcaa agtgtaaagg aatctttgcg 38821 gtttcaacct agtaatggtg ccttggctct tgaggctaac cccacaaatc ccccatctct 38881 gctcttgtga ccccttctgt gcaagatagt tatgtattat gtaggcgagg tgacccctaa 38941 aaaaacgacc aggtccgaaa tggacgatac cagtattaag cgccctcctg tcgtaaggag 39001 accgttccct ccacaaacca gttatagtac cgaggcgaga atttggaaca ctgattggtc 39061 tgagtaagtt aaccatgaga cttcttaaat aagtggaaat aagtggataa tcaggtaaaa 39121 agagtaggtc agtctgggta tagcccatga cctacaagca gcacaaatat ctcaatgtgc 39181 gaaaatttca ggcatatgtg gcatttacaa ggaaacgttt tgccacaggg taaatagggt 39241 gtgacaggta gaacagaaaa agaagaaatc aggtcaacct aatcaattaa gtatcaaagt 39301 tcaagttgtc attcatctgg attaaggtac gataccgcac atacttctac cttgttcaac 39361 cttggaaagt gcgtgattaa aatgtgtgtt ttataattgg aaaattagag acgatggatt 39421 ctcttaggag tctttaatct accggacaaa aagcaattac atgaccgcac ctaggctggg 39481 aaagtaattg taattgatgc aaatgaaacc ccaattagaa cgacccaaaa aagccagaag 39541 cgctttatag tgctaaaaag ggctacagac actcaaatat cagtctgtct gtatttttga 39601 gtgatatgtt gaaggatgtc ccctttcacc ctagccttaa ttacgggcta actgtctgtt 39661 taactacaat taaagactgt tttttatagg ctgcgatcgc atctttgaaa tcctgtattt 39721 acattcctca ataactctgt acacattcat agtatttacc ttggtagctg ctatactctg 39781 tcttcaggag aatctacaag aaaatcttta gagggactta taacaagaat caagtgtatc 39841 actcactact tgaaaaacag ccatttgagt ttcagtaatt gactaatagt cattcgctta 39901 ctagggatac cgaattgggt taagaccaac ctaactgttc taaaaaagta attactcccg 39961 aaccgcgctt aagattacct ataatgtagt aaaaagtatt ataagaggtg acgagggtag 40021 catcgtgggt gaatatgtaa gaaaccgaac attccaacta tgcgaatgcc tacggcacgc 40081 tccgctaacg ccaactcaac ttggatttgt ggcaaaaact gtactacaag tatcaaaaag 40141 actatgtgcg aaaaagatta ttagcaatta aatatctgta cgaaggcaaa aatagaacag 40201 aggtttcagc aattataggt tgtaattata aaaaactact aatagaaaaa caggtaatag 40261 acagacagga tgctctaaag ctgaacgaac gagatgcaca aagacgaaag cgcgtgttgg 40321 agcgatatca tggtaatgct ctcaaaacag gcttagactt ttaccatgca gccatgatat 40381 ttcaacatgg agatgaccca ggagactatc tactggcgca cgatttagcg atcgcagctc 40441 taacttttaa agataagggt gcagaagaag ctaaatggtt aatagcagct acccaagacc 40501 gttttctgat gcatctggga cgtccacagc gctttggtac gcagcaaatt accaccaaac 40561 caaacgcaaa caatttcaga tgcgccagca tttataactt agataactca cctgcaagtg 40621 taaccgatga acaccgtcaa atactaaatg ttcctactct caagcaagcg caggaaaaac 40681 tagaggaatg gaataaaaaa tgtaaggatt gaggaaggtt attgacccca actacagtgc 40741 ttgctccgct cgtgccttca tcatactgaa attctttcgt tgttgataca aagttaaaaa 40801 attcagatca agaggcgatc gcatgattgc gtctgttgct ttgcgcgatc acacttcact 40861 acctgataca ctaaactcag agtagcttaa ggagtcgtaa acttgccgac atacaagcag 40921 tccaacgaga gcgatattcc tttgtcagct atccacttaa tctcacccaa ccaggtttag 40981 cctcagagtt cctctgtgta atgtaacgga caggtatagt aaaaatgcaa atttgtcaag 41041 aactctcact cactcgcatg ggctgcggaa cttgggcatg gggtaaccga ctgctttggg 41101 gatatgacga aagcatggat gacgagttgc aagccgtctt cagcctttgt gtgagcaacg 41161 gtgtaacttt atttgatacg ggtgattctt acggaactgg gagattgaat ggacgaagcg 41221 agtcactcct gggacgattc tctagagaat atctcggttc aggcaaagaa aatatttgca 41281 ttgcaaccaa gcttgctgct tatccttgga gatggacacg ccaatcaatg gtgtcggctt 41341 gcaagtcatc tgctaagcgg ctgggaaaaa acgtggattt ggtacaaatg cactggtcta 41401 cggcgaatta tgctccctgg caggagggag gacttttgga tggtcttgct gacctttatg 41461 agcaaggact tgtcaaggga gtgggcttgt ccaattatgg accaaaacgg ctcaaacgcg 41521 tacatcaaag atttgcagaa cgaggaattc caatcgcaac tctgcaagtt cagtactcgc 41581 tgctgtccac gtatcccgtt accgaattgg gagtcaaaga tgtttgtgat cagcttggaa 41641 tcaaactcat tgcctacagt cctcttgcat tggggctgtt gacaggaaag tactctgaga 41701 aaggtccttt tccaaaaggc attcgaggtt tgctgtttag gcaaatgtta ccaggaatcc 41761 gtccactttt ggcaagctta cgagaggtgg ctcaatccag aaacaaaact atgtcacagg 41821 tagccataaa ttggtgtatt tgtaaaggag ctattcctat tcctggcgcg aagagtgtgg 41881 aacaggcaaa agagaatatt ggtgctttgg gttgggaact aaattctggt gaaatagcag 41941 agttggatca agcggctgcg agtgcagaca aaaaaatggt gcaaaatatt tttcaaactc 42001 gataaaatgt tcaagaaatc tcgcaagata gtttatatgc tcaacaatga acagtgaaca 42061 gtaaacagtg aacacttcga caagctcagt gcatcgcagt gaaaactggt aactggtaac 42121 tggtactgcg ggtacttgat acaaacagtt tgagcatgat agacgtggtt cgggaaggtg 42181 aagtggctca aactaagaag aatgcgcacc aaacgcaagg acaagagatt tattcagggg 42241 acttttcatg gtctttttgg tttgctttgc cactttaccc ctttggtaaa cggcggacaa 42301 ttcgcaaaga agtcctgaag gatacaattt ggacttttga ccagcttcag ggcattttct 42361 atgttgtcgt gccgattcgc atgaccgttg ttaagttaaa tgaaggtggt ttgcttgtct 42421 atgcacctgt cgcaccaacg accgagtgta ttcgccttgt caatgagttg gttgcagaac 42481 acggcgacgt caagtatatc attctgccaa ctatctctgg tttagaacac aaagtcttcg 42541 ttggtccatt tgcgagacgc ttcccgaatg cagaggtttt tgttgctcct aaccagtgga 42601 gttttccgct caatcttccc ctcagctggc tgggtttgcc ttctaagcgt actcagattc 42661 tcccagaaga cagtagccaa actccttttg ccgaccagtt tgattacgca atactagata 42721 ctatagacct tggacccggt caatttgcgg aagttgcgtt cttgcacagg cgatcgcaca 42781 cgttactcgt cacagattca gtggtttccg taccagaaga accaccggcg atcgttcaat 42841 tagatccata ccccttgctg tttcacgcca aagacaaagc gtctgacata gttgaggaca 42901 atccggcaaa tcgtcgtaaa ggatggcagc gtatttcgct gtttgctttg tacttccaac 42961 caagtatggt agagattatt gcatggggtg aggtatttcg taacgccttt caagcaccgg 43021 aacgttccaa taaagcttac tttgggttgt ttcccttcaa atggaactca aattggaagt 43081 cctcatttga tgcactgcga gggaatggtc gtttatttgt cgcaccaatt ttacaaactc 43141 tcattctcaa ccgcgcacca aaggaaacta tcgactgggc taataaagta gcaagttggg 43201 actttgggtg gattattcct tgtcattttg actcaccaat tcaagcacag ccgcatcaat 43261 ttcgacaagc attctccttt ttggagcagt cttctagtac ggggttgagc agtagtagtt 43321 atcctttacc tgaggaggat tttaaattac ttagggaact cgataaaggt ttaaataagt 43381 ttggtattgt gccgcctgca aaggagttta aataagccct gtgtaggcat tgttttcaga 43441 attgactatt agtcatcttc tttgttaaac aattctaaca aaccaattgt gcctacaaaa 43501 aaagtatacg tgccatattt tagtggcgtc ttttttcttg aaggtgcaaa aaaagctgct 43561 acaactgctt caaggaaatg gattgtgact gcaaaacgct caatccaaaa aactggattc 43621 aggctgcttg gtactttagt gtgagttaaa actgcataaa tattccataa ttccaatcca 43681 attgcacttg ttatgagaac tgtagataga actttgacaa aagtaaaaat attgttgttt 43741 tgatagttta atttcatggt atttactagc aaaccccgtt gttgctgata ttgcctgtga 43801 tgggagagtt agagcatatg cccctaaagg agaacctggc gcattgatct ttaaatatcg 43861 taatgcagaa ttttgcagac actatgtcaa tagcggtgag tggctcgctg ttacacgggc 43921 agggggagaa attccaattt cagccaccta attcagaagc gactttccgt gtagaggttc 43981 tgatctaaag gtgagtcgtt agtttgcctt tgtaaattga actaaaatca agggcaggag 44041 gatctgatta tgcaaactac tgcccaagaa atttataccc aggtggttcg caatttatca 44101 ccaaatgagc gactgcgatt ggcaacactt attctgaatg agcttgttgg gcaacaacag 44161 ctatcatcgg tcgatcagag tgatacctgg acccaagagg atcagatcga tcttgtgaat 44221 ttttcactac aatacgcagc tacaactttc tctgatatgg aggacgttga acagtgacgt 44281 tctgtcctgg agatttagtt acagttgatt ttcctggtat aatatcaagt ccggatgatc 44341 acttgtcatt gcgagcggag cggtagcgta gcgtggcaat ctcaaatgat aatttacaga 44401 tgaatagtca gccacaaata accgcacgac aattgtggat tcgtttaggc ttatttccat 44461 aattgtgatt tgaacaataa attaccgcaa gttgctatca gtaaaatttg ggctaccttt 44521 aatatattga ataataactc agtagattgg tgatgacaca gaaagaattg ctgcgggtaa 44581 ttgaggaggc ggcgagtgag ggagcgacag aactcgacct ctctggcaaa gagttgacgg 44641 ttttaccgcc tgagattggc aagttgactc aactcaaaaa actgattctc ggcaaatata 44701 aatataatga cgccggcgat attgttgata ctattgggaa taagctgagt gctttacctg 44761 cagaaatcgg acagcttcat catcttgaag aacttcaggt tgttgataat cgcttaagca 44821 gtttgccaca ggaatttgga caactcacca acctgcaaac gct // LOCUS NODE_527_length_43848_cov_4.89402443848 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 43848) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 43848) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..43848 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 7..651 /locus_tag="DP116_03440" CDS 7..651 /locus_tag="DP116_03440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208505.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03440" /translation="MSQSPSQNESREMLKWLNHNRQMLLDLYRNQYVAYNANGLIAHS ENLREVLELAEASKQLFAIYLVPRRTASIQILPIRFRTVSRHDWQPNYHVRLKHRDIN ISTTILVDSGAELSLISLKVGQDLGYALADAESALLAETIGGRVEYVLRNVEMTIDGH SFLAPVAWLQTNTGGEQLLLGREVVFDKFNIEFRQADEQIIFKWREDLFLKSGF" gene 739..1914 /locus_tag="DP116_03445" CDS 739..1914 /locus_tag="DP116_03445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863222.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03445" /translation="MRHEQQQDLESTSTQEATTSTDIEQNLQEVEEKVSFKQKTERIT AWSDFIKSMTPFIWAVVIIIVLIPLLGKALITGSSPGKLADSGQNPSQEVSIVVPQIP KDIDQALVTALKNAHSQAESFASEQLDNWVDELMTRVDESFFDWYFNYFNQKKMELSA PFTWLYSAVTHWTNKNKPSPGQAVAEKLTEDFQVEFAKRVLRPKVAQLEFEKITTDSI NLYVTELSKNISNIQSSYKIPQGDWERYLGDIAVTINDMEGNISHLSLKVLSGGSTYL LAKAMIPTVTKVGSKIAVSFAGKAGAKMAAKTGGVVAGKIGAQLLDPIVGIGIIIWDV WDYNHTVAVEKPRLREAIYDYLKEVKFSLLENPENGIMVAINQVENGILKSIKSSPQ" gene complement(1940..2965) /locus_tag="DP116_03450" CDS complement(1940..2965) /locus_tag="DP116_03450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015190120.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_03450" /translation="MSDYCTTKAGIFQQVKSVEETLPIRNDACRSKIFLHPHPTSKVC LFFHGFTAGPYQFEPIGKKLFDAGYNVLVPLQPGHGVAGDWNGDNPPPLPTEREIYQG FALYWLKVAQNLGEQVIVGGISTGGNLAAWLALEHPQSIEKALLFAPYISGNNAIINF VVEVLPIYYEWLNKDNTGNFGYEGFRIPALRIFLEMGQEIIDRVKTNLAVPMFIIYSE SDPAINHCELQDFSKTTIEQQAKSWYYSFDKIFEIPHTMMTKAEGNQYQDLLMTMAKA YLESEVTWSEVMELGYQILQGKTFDLATQQLNLTERVSPDLSVLLAVMDKKIIIDSFK ERDSACF" gene 3252..3809 /locus_tag="DP116_03455" CDS 3252..3809 /locus_tag="DP116_03455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873493.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03455" /translation="MAADLQQIAYYLDNLGWDYRIDPEEDRIITGVESDNVEDFLIVV QLDEEGKFFRLFAPQVLSGVSDHPHKVAILQTMLYISWETKMLQWEYDPSDGEIRAII EFPLEDSILTEKQFHRCLHGLVQLVDSVAIPRLMTVMETGHDPGNVELGERILLSIQE QSPGLLDLLEKAMEARKKRGSFPSE" gene 3885..4529 /locus_tag="DP116_03460" CDS 3885..4529 /locus_tag="DP116_03460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191436.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03460" /translation="MTSYVTSSAKAEMSELRRLKGLLPPELQSWVTVEGTTEVNPPMI RCEEIGKDQVEVQIDLAKWDSLAMDQRNLLFWHEVARIQNDTIPKDGWEMAALAIGLG GAVGELWVQDGLLLVLALGLCGVSGWRLYQKNSGEKQFKELVDADEKAIALATRFGYT LPNAYKSLGSALKTLIDTTPSKRLRSRYEARLSALKRSANKAKSKSRNIDEGEI" gene complement(4631..6172) /locus_tag="DP116_03465" CDS complement(4631..6172) /locus_tag="DP116_03465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009460182.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PEP-CTERM sorting domain-containing protein" /protein_id="PRJNA477356:DP116_03465" /translation="MIQAKHLFSLLSCSLILSAVSLTGYIKSAYGISLNNTLSIGGES TDLYPSSGNSANVNRLGFFSDLYYDRSNNVYYGLGDRGPGGGRISYNTRVQKFSLDVD PNTGAIANFQVLETILFTKDGQKFNGLNPTLLNGNAGTLGLSFDPEGFAVAPNGHFYV SDEYGPSVYELKPDGSFLRAFTIPENLIPKEANGKLNYVDGRPTITNGRQDNRGFEGL TLLPDGSKLYAVLQGPLVNEGSNDGSPDGRRSGNLRLVEFDTATGKSTAQYIYQLESL ADINSRIPGTKDDFAATSQGRNIGLSSITALNEKEFLVIERDNRGFGVDAVNGLVADS SATPSPVATKRVYKIDLTNATNVSTISLANTNTLPTGVNPVSKSLFLDIAATLSPDNP GDWTKIAEKMEGLAIGPRLNDGSYALLIGTDNDFSVTQDDNSTTQFDICTNANGTSYS KRPIDSGCPNGQKLIPGFLYSFKVSSAELGKFVPPQKVPEATTTTGLILLGLSGLWLR RRRHP" gene complement(6379..7176) /locus_tag="DP116_03470" CDS complement(6379..7176) /locus_tag="DP116_03470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311258.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="radical SAM protein" /protein_id="PRJNA477356:DP116_03470" /translation="MTIKTTVTPTARLIEVFSAIQGEGLNVGTRQIFIRFALCDLRCH FCDSAHTWNAPATCRIEQTPGLRDFEIYSNPVSLPMLLEWVERQNLPCLHDTITLTGG EPLLHTPFLVEFLPQVRSVTSLPVYLETGGHRPELLNTILPHIDSVGMDFKLPSASGE NRWREHAEFLQLCWTSQVEVFVKIIVSQTTNPTELERSAELVASVNPSIPVFLQSVTP LDDPGAFKQLPVAAPSPSQVLNWQALMKRFVKCVRVVPQTHKMLNQL" gene 7510..8502 /locus_tag="DP116_03475" CDS 7510..8502 /locus_tag="DP116_03475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873490.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-anti-sigma factor" /protein_id="PRJNA477356:DP116_03475" /translation="MTSQPKEVNFPVTFSNNTAIVQVPARLSVLEAVAFKQTCQEITR AYPDTKQIIIDFHQTTFMDSSGLGALVSNLKIAHEKNIDFILRNVTPQVMSVLNLTGL DKVFSIESQGETPSKSQLEELPTTHPSVRSWMKRFIDIVGSLVGLMITGVLFIPIAVA IRLDSSGPIFFSQTRCGWMGKRFLIWKFRSMYIDAEARKAELEKQNQVQGAFFKIDND PRITKVGRFLRRTSLDELPQFWNVFKGEMSLVGTRPPTPDEVERYEVPEWQRLDVKPG MTGEWQVNGRSSIRKFEDVIRLDLQYQKNWNLIYDLMLIFKTVAILFNRNSGAV" gene 9269..10585 /locus_tag="DP116_03480" CDS 9269..10585 /locus_tag="DP116_03480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191433.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="colanic acid biosynthesis glycosyltransferase WcaI" /protein_id="PRJNA477356:DP116_03480" /translation="MRILIYSYNYYPEPIGIAPLMTELAEGLVKRGHQVRVVTAMPNY PERRIYENYRGKWFVSEYKNGVQIQRSYVWIRPQPNLIDRLLLDASFVLTSFFPAIAG WRPDVILSTSPSLPVCLPTALLGWLYRCPVVLNLQDILPDAALHVGLLKNKWLIKVLT ALEKFAYHSATKISVIADGFVENLQKKGVAPGKIVQIPNWVDVNFIRPLPKENNAFRA AHNLDGKFVVLYSGNIALTQGLETVVKAASLLRHIPNITFVIVGEAKGLQRLQMACTE CGADNVLLLPFQPRESLPELLAAADVGLVVQKKNVISFNMPSKIQILLGSGRAVVGSV PSNGTAARAIKQSGGGVVVPPEKPKALAEAILDLYNNPEKVKTLGCNSRQFAVEQYAF EQALTRYESLFYSLTAQRSTIEPLINSFAEKHVLQQSEGIQIVKLR" gene 10655..11251 /locus_tag="DP116_03485" CDS 10655..11251 /locus_tag="DP116_03485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphohydrolase" /protein_id="PRJNA477356:DP116_03485" /translation="MREKVLTWLTQNVPAPRVNHILRVEQMAMDLAVHYKVTPEKAAQ AGLMHDLAKCFKPQKLLQMAQKEGLEVDEVMGANPHLLHADVSAIVARETFGVDDEEV LQAIANHTLGRPGMSPLCSIVFLADSIEPGRGDTPQLQSLRQISRQNIHQAVALTCDY TLKLLLESSCLVHPRVIFTRNWFLQKWKAKQPIVQKTV" gene 11366..11827 /gene="rsfS" /locus_tag="DP116_03490" CDS 11366..11827 /gene="rsfS" /locus_tag="DP116_03490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873487.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome silencing factor" /protein_id="PRJNA477356:DP116_03490" /translation="MSDYFPTNFPRQSIAVTNSLVGNSQVDTDDVSGKIALKIAEAAS DRKAGDILLLKVADVCYLADYFVVVTGYSRVQVRAIADAIEEKVKQEWQRLPLRVEGK SEASWVAQDYGEVIVHIMLPHERGFYNLEAFWGHAERIEYSTSVEGEGKPT" gene 11824..12324 /locus_tag="DP116_03495" CDS 11824..12324 /locus_tag="DP116_03495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129750.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1230 domain-containing protein" /protein_id="PRJNA477356:DP116_03495" /translation="MIKSSVSNCPVPTEQQPLNEYEELKSSWLFRDCTLNLREYIAKI AWIWGIWWLVAFPVAAASFSPYKQTAQFILGSLAGASVGVVLVLVRLYLGWSYIRDRL MSPIIFYEESGWYDGQTWMKPEEVVTRDRLVVSYSIKPIISRLQMTFAGLAVLFVAGT IVWHLV" gene 12408..13361 /locus_tag="DP116_03500" CDS 12408..13361 /locus_tag="DP116_03500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191429.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="asparaginase" /protein_id="PRJNA477356:DP116_03500" /translation="MTMGKRTQAAALEVRLLREGIIESKHIVQAVVCDDRGRVLSVAG NAETATFVRSALKPFQAIAVTSTGTIERYNLSDKDLAIMTSSHKGTIEQVRQAFNILW RADVDPSALQCPIPEGKYSRLEHNCSGKHAGMLAVCQQCNWSLNNYLQRKHPVQQLIL TKVSELLRMPAEEFISAHDDCGAPTYLMQLAQMASLYAQLASRSTLDMERIVRAMTHH SVMVAGEGELDTELMRLALGDLVSKAGAEGVQCIARLGEGMGLAIKVMDGAKRAKYAV AIHLLKQLGWITPSVAEALSEKFMNLGKYKRLEVIGELSIL" gene 13411..13487 /locus_tag="DP116_03505" tRNA 13411..13487 /locus_tag="DP116_03505" /product="tRNA-Met" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:13445..13447,aa:Met,seq:cat) gene 13625..14227 /locus_tag="DP116_03510" /pseudo CDS 13625..14227 /locus_tag="DP116_03510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860055.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(14482..15897) /gene="glnA" /locus_tag="DP116_03515" CDS complement(14482..15897) /gene="glnA" /locus_tag="DP116_03515" /EC_number="6.3.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747008.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I glutamate--ammonia ligase" /protein_id="PRJNA477356:DP116_03515" /translation="MTTPQEVLKQIRDNNIQMIDLKFIDMPGIWQHLTLFHNQIDESS FTDGVPFDGSSIRGWKAINESDMSMVLDPNTAWIDPFMQEPTLSIICSIKEPRTDEWY SRCPRVIAQKAIDYLVSTGLGETAFFGPEAEFFIFDDVRFDQTAHQGYYYVDSVEGRW NSGREEGPNLGYKPPYKQGYFPVPPTDTFQDIRTEMLLTMAKCGVPIEKQHHEVATGG QCELGFRFGKLIEAADWLMTYKYVIKNVAKKYGKTVTFMPKPIFGDNGSGMHTHQSIW KDGQPLFAGDKYAGLSEMALHYIGGILKHAPALLAITNPTTNSYKRLVPGYEAPVNLA YSQGNRSASIRIPLSGTNPKAKRLEFRCPDATSNPYLAFAAMLCAGIDGIKNKIDPGE PLDKNIYELSPEELAKVPSTPGSLELALEALEKDHAFLTEPGVFTKDLIETWISYKLD NEVNPMRLRPHPYEFALYYDV" gene 16256..16765 /gene="apcB" /locus_tag="DP116_03520" CDS 16256..16765 /gene="apcB" /locus_tag="DP116_03520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747007.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="allophycocyanin subunit beta" /protein_id="PRJNA477356:DP116_03520" /translation="MRDAVTSLIKNYDVTGRYFDRNAIDSLKSYFESGTARVQAAAAI NSNAASIVKQAGSRLFDEQPELIRPGGNAYTTRRYAACLRDMDYYLRYATYALVAGNT DVLDERVLQGLRETYNSLGVPIGPTVRGIQIMKDIVKEQVAAAGVTNTSFVDEPFDYM TREFSETDV" gene 16982..17407 /locus_tag="DP116_03525" CDS 16982..17407 /locus_tag="DP116_03525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950521.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03525" /translation="MEELLELKDLLLKGDVPGALVIVEELEEMSRNDIIKTIRSYAII LLLHLIKQQVENRTTRSWDVSIRNSVREIQRENKRRKAGGYYLCSEELFETLEEAYLN AIDEASLKVEQGRYEPEELEQLVNREEIINRALTLILPQ" gene 17418..18254 /locus_tag="DP116_03530" CDS 17418..18254 /locus_tag="DP116_03530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TlyA family rRNA (cytidine-2'-O)-methyltransferase" /protein_id="PRJNA477356:DP116_03530" /translation="MVKKRLDTLLVELNLSSSRALAQRFIQAGEVTVDGQIIDKPGTE VDIAAQIQIKERSRFVSRGGEKLAKAIDVFAIPVEGRICLDGGISTGGFTDCLLQAGA KLVYGVDVGYGQVDWRLRNDSRVVLKERTNLRYLTPDELYGVPTLSPPSKGGEYADLA VVDVSFISLTKILPALWQLLQPPREAVLLVKPQFEVGKSRVGKKGVVRDSKDQAEAIF QVLQAAGELGWKYKGLTWSPVTGPAGNIEYLLWLGMESETPPPDIETVHQITILAASE LR" gene 18361..18624 /locus_tag="DP116_03535" CDS 18361..18624 /locus_tag="DP116_03535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318171.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03535" /translation="MTTQDKARELMVQQRLQDEHLHESMLNRAEAAHPSGTEGMTQEE ARELMAQQRHHEKHLHESMLNRAEAEVGLPSDNTNSSQECSLY" gene complement(18621..20009) /locus_tag="DP116_03540" CDS complement(18621..20009) /locus_tag="DP116_03540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453757.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTPase" /protein_id="PRJNA477356:DP116_03540" /translation="MTQPLPVSKNLQETHLNRARASLSLALSWYGNLRKPGQSSSNSQ LAGLVKPELELLTSTLNKLDYNLIRIAVFGLVSRGKSAVLNALLGQKILQTGPLNGVT QYPRSVRWNPGGKVQVELIDTPGLDEIEGESRAQMAREVVRQADLILFVVSGDITRTE YQMLCELRQAQKPLILVFNKIDLYPDTDKTTIYNNLQQLGAGNPQGKPLKPDEIVMVA AEPAPMEVRVEWSDGSVAHEWETPPPQIDELKETILKILNREGRSLLALNALIQARDA QEAIAQKTIDLRQQQAEDLIWQFTKYKALAVGLNPIPFLDILGGTVADLALVRSLARL YGLPMTGYEAGNFLKTILLSSGGLLLGELGSSLLLGLGKSTAAIASGENPINITAYAG TAITQAGIAGYGAYAVGKAAQVYLERGCSWGQLGASTVIQEILAEVEPNTILYRLRQE LEQFSFNDTSLR" gene 20577..21584 /locus_tag="DP116_03545" CDS 20577..21584 /locus_tag="DP116_03545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318169.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03545" /translation="MKRSNSNTLERCKKFFSKAKNSFTGKYQLYTGEKLNWFQVIFRL EKSVLPAILPWVIVSGLYGFLVSLLYRYGVPIGFPHKSGVLTNAILTFNVGFTLLLVF RTNTANERFWEGRKLWGSLVNTVRNLAQGIYIVVKEQSPKDKVQKEATLRLLVAFAVA MKLHLRAECLDEQVESLMSEIQYLKLQETHHPPLQIAFWIRDYLQHQHDRNCLNIYQL TALHKLVDDLVDILGGCERILKTPLPIIYSIKLRQLLLIFCLLLPLEIVTNLIWWTGV ITAFVSFTLLSIEEIGSQMEEPFGHDPSDLPLDAICNTMQHNVEELITLAPSSEFDWR V" gene complement(21850..25002) /locus_tag="DP116_03550" CDS complement(21850..25002) /locus_tag="DP116_03550" /inference="COORDINATES: protein motif:HMM:PF13476.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03550" /translation="MIPVRVYVENFMSYREGQELLFDGAPLWVLSGENGAGKSTIFDA IRFALYGSHRGGGAKTQKDLINHKADALTVEFDFLVDGEAYRIRRTVRKRGKATREVF RIDSSSKGKLRIEPEANTDSDAGFKDWVKRKISLDDEAFTSCILLIQGKSDKLIEAVP SERYKILKQLIDLTAYDKLHQLTDARRKECEAQFKNFKLQLEASETVSDAEFNAAQEV LNLTNDNLNKSRDKVDNLVTLVELSKQFQQLNSQFIEQQKDVQKWQALIARTDEIEQN FTRYKELDTVLPLLSSIIEQRQRIIDNDNEIEKIIQNYQQVKDDLKLISAEKDELSQQ IEATNQKIENKQKDSNNIDVRLLELAPLVEKLSQLESIKEKIQQLKQTLADFPSDLEE LLQKAESEEKRLDEIVKTLPWLKQLAQERSKLLDTVHRKEEASKNLESLQTQLEEYQT QQAKLNTDFANASEAENKLSHNITRFQTEYESVCNKRDNFEKASHQPICELCGQEITP EHAQAEKSRLYSQITDAETNLTRLKNEHKKAQDNLNSFSAELENLNKEITVNSNNHNE NKIQLNQAQRDAKQYTKQINTAFSNLPEGYQVNVSPYAIDDDLGWLDTSYPTEEDLEE LNQLLDNRQTHTQNLNKLRHQFSEWQSFNTQHQTYSHQFAEIEKNLPLSEAQEARNEK STLQQSQKELQLTLKTLKEEQTLTLNQVKEIEKDVNTLSNKIQKYEIDLSDKRGSQTE IQRRLKADVESLPSQWQESLIKIDEEKIEEMDSERKVLAEYETLSNQLNIASEQVAFL QQQISNFRNQIEQLPNEAKRASQEVEQELVTAKSERDTFDRNRQNAQNSFNELISKRS LYQQLEKQKLDAERNQSLYKILCDLLGNGKQGLQLHLLRRAEQAIVQLANEILDGLSR GKMRLELRGEAEESTGESSKALDLVAYNEEIGPYPTPIAQISGSQRFRVAVSLALAIG QYAGQGARHIESVIIDEGFGSLDKNGRDDMIQELNELQHRLARIILVSHLEDFSSAFS NGYSIELVNQASKVRPLEHV" gene complement(24999..26237) /locus_tag="DP116_03555" CDS complement(24999..26237) /locus_tag="DP116_03555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876034.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03555" /translation="MRILHTSDWHLNDKLGRIPRQGDIVKRLEEIANYLDEHKVDVMV VAGDLFSQYNRLDELKSAVGEMRDVFQPFLLGGGTIVTISGNHDNEAFFNLMRFALDL ADPIDPKKPGAKPSGRLYLAAQPTYLLLKDKAGQPVQFVLMPYPTSSRYLKDEKTRYS SMDEKNSLLHQAMLQKIDSVKNKFIKPELPSVIVGHAHIRGSQLHNLYRISEREDVVF DAGDIPTNWAYAAYGHIHKAQALAGTTHVRYSGSIECLDYGEKDDEKSVVLVEIGAKG RTKEPECLPLNATPIYRVEINNPEEIASLKDKYSELDRALVSYKLTYKPGEHNRDAIC RELDKIFPRWYDREIMTDGSSISLKSSTLAADTQDVATTVRGYLQQQLAGHQDKDAVL ALAEKLLADEKFVNNSEVEI" gene complement(26250..28118) /locus_tag="DP116_03560" CDS complement(26250..28118) /locus_tag="DP116_03560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_03560" /translation="MSDEVLIPEFDVNGNGSHAGTPNADEADSTEDTNASASTPGVRI IGKVASPPQKESTSEEFHFWVRRDELVEKTQIIRTESRVGGDTINFYAIVEEVYRQSR KKDIGEEFDAFDGDVNYQPEFRGEGVTFATATILRTEPPVLAPPLEQSSVFLGDENDA RRAYRADEIDNPLAIGLIKNGGSAIAGPGMIDLDYLLGINGGHMNVNGVAGRGTKSSF LLFVIYQLLREARQRAEEYPSDPNPLMVVPIILNVKGYDLFHINRWSNRYRPEEHLAD WQRLGVEEPRPFEKVTFFAPQIPGGITAVSTGCRSTVQAYSWSLGNIIEKGLFTYLFA DDDSVNDNFRALVLDLEAYLTDERVANDGTVTRSLRNNVPRTFQLLLDWISDLDNRSQ LSPDHHRATWSKLRRKLLLLVHEGNGVLRRYDQNGNPLNLGIRQTTDPIVVDLNALTR VPSLQRFVVATILQQLVNERTGTNRAEGLVYLIALDELNRFAPRGSKDPITQLIETVA AEMRSQGIILLGAQQQASKVSEKVIENASIRVIGRSGSLELSQSIWRFLSKSNQRKAA ELTVSEKMVIQDNEPMHVRVPLPPWAMNPREATGNPVIDVNATTVEEDDDDDIATY" gene complement(28141..29199) /locus_tag="DP116_03565" CDS complement(28141..29199) /locus_tag="DP116_03565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493328.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03565" /translation="MGLDFIKNYGGCRVEQYDAQFPFLEEETDEDLNQTPEKVDLDYE VKHRAIDWEPINLSTSPNWRPSNWHERPTHFIDGKDVGETVASVRSPSGQLVPIRLSQ IGSITMRVENGECRRSFEVVERVVSMAVDLFPWTEVESFAAALQNNGFRLLPVRPPGG ISSYDFETMRKRTQNRSNTEMEVLEESAISHCGGEPTVIDGRLQPRMGGFDIDESPVF GVIKTQRQNYLHIKGIQVLYGLEAGQRTPVFTISRGWLPVVSWFVRLSGGGGGTPSTG IVRVEASKSWFEKYYKRNWDFVDKLSRTIYEYRCRERSYGRAAISLHPIVRAEESLGS LFQPLSILSNRFYRLTQL" gene complement(29255..30030) /locus_tag="DP116_03570" /pseudo CDS complement(29255..30030) /locus_tag="DP116_03570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879858.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 29761..29770 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(30031..30687) /locus_tag="DP116_03575" CDS complement(30031..30687) /locus_tag="DP116_03575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03575" /translation="MQKQNTNQQPYSREKAIQLYVQALDEGDMEVVAQILDIACDDPE LEKIITEINLAYQEEEQITPIATDAEIIRNLLHKHLHSGFEIIQEQEKPLIFEDIDEE EKSVTVGDVVKRLQDINRVPSADKEIINKLLDSSVPLPVKLSIQAVRQLAAELRINTS ERFLNMFRDTAITLSMGRSHNRAQLAAAREQKSRYQSTLNNRQPIENKVKKNTDIDKI " gene complement(30689..31360) /locus_tag="DP116_03580" CDS complement(30689..31360) /locus_tag="DP116_03580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876038.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sigma-70 family RNA polymerase sigma factor" /protein_id="PRJNA477356:DP116_03580" /translation="MKRQVGQPVDPSYDPELYQKQQLVSQVVEEEWQVLQRTIQMYVI KAIRQFGDKFGHSSDRNFTETVAKEIFHETVEEAFKIAGKFDPNRSARPWLLGIAAKK IQRWQRQQTQQNKRITPIAELPLARKLKQQNSEIFSEEEILGILYESSNSSDPEMLEY LLSLVNDGYREVLKLAFVDGLDGESLAAALGTTAGAAYTKKSRAIVQLRQAYAQSNKS SKEGR" gene complement(31570..32055) /locus_tag="DP116_03585" /pseudo CDS complement(31570..32055) /locus_tag="DP116_03585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493324.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(32384..33769) /gene="psbC" /locus_tag="DP116_03590" CDS complement(32384..33769) /gene="psbC" /locus_tag="DP116_03590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493911.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II 44 kDa subunit reaction center protein" /protein_id="PRJNA477356:DP116_03590" /translation="MVTLSNRSIALGGRDQDSTGFAWWAGNARLINLSGKLLGAHVAH SGLIVFWAGAMTLFEVAHFIPEKPIYEQGSILIPHLATLGWGVGPGGEIINTYPYFVI GVLHLISSAVLGFGGIYHAIRGPETLEEYSSFFGYDWKDKNKMTTIIGFHLIILGCGA LLLVLKAMFFGGVYDTWAPGGGDVRVISNPTLNPAVIFGYLLSSPFGGEGWIVGVDNM EDVIGGHIWVALICISGGIFHILTKPFGWARRALVWSGEAYLSYSLAAVSLMAFIASC FVWFNNTAYPSEFYGPTNAEASQAQSFIFLVRDQKLGANVASAQGPTGLGKYLMRSPS GEIIFGGETMRFWDFRGPWLEPLRGPNGLDLDKVKNDVQPWQIRRASEYMTHAPNGSI NSVGGVITEPNSFNYVNPRAWLATSHFVLAFFFLIGHWWHAGRARAAAGGFEKGINRE TEPVLFMNELD" gene complement(33753..34811) /gene="psbD" /locus_tag="DP116_03595" CDS complement(33753..34811) /gene="psbD" /locus_tag="DP116_03595" /EC_number="1.10.3.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015118588.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II D2 protein (photosystem q(a) protein)" /protein_id="PRJNA477356:DP116_03595" /translation="MTIAVGRAPARRGWFDVLDDWLKRDRFVFVGWSGILLFPCAFLA LGGWLTGTTFVTSWYTHGIASSYLEGCNFLTVAVSTPADALGHSLLLLWGPEAQGDLT RWFQLGGLWPFVALHGAFGLIGFMLRQFEIARLVGIRPYNAIAFSAPIAVFVSVFLMY PLGQSSWFFAPSFGVAAIFRFLLFLQGFHNWTLNPFHMMGVAGVLGGALLCAIHGATV ENTLFEDGDAANTFRAFNPTQAEETYSMVTANRFWSQIFGIAFSNKRWLHFFMLFVPV TGLWMSAVGIVGLALNLRAYDFVSQELRAAEDPEFETFYTKNILLNEGIRAWMAPADQ PHEKFVFPEEVLPRGNAL" gene 35365..35961 /locus_tag="DP116_03600" CDS 35365..35961 /locus_tag="DP116_03600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318084.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I assembly protein Ycf4" /protein_id="PRJNA477356:DP116_03600" /translation="MTASTTINRGESPNGDKQTSNVLHQKVLGSRRFSNYWWATVVSL GAAGFLLAGISSYLKINLLIVSDPTQLVFVPQGLVMGLYGTAGLLLALYLWLTILWDV GGGYNEFNRENGTITIFRWGYPGKNRRIEIKSRTEDVQSVRVEVKEGLNPRRELYLRV KGRRDIPLTRVGQPLSLQELEIQGAELARFLGVPLEGL" gene 35997..36815 /locus_tag="DP116_03605" CDS 35997..36815 /locus_tag="DP116_03605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314238.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_03605" /translation="MRLKIPQFLVALIIVGALILGSCSTQQGVSNSSPTSAATEASNN STAETTAKEATTEAISVSESTNESIPGMKDLPRLEGKATVVMTVKGSPITIEVDGTNA PITAGNFVDLVQRGVYDGLVFHRVVRQPQPFVVQGGDPQSKNPKVPASQLGTGSFTDP KTGKIRYIPLEIKPKGSDEPIYSKTLKTAGTDKPPVLQHKQGAVAMARSQMPDSASSQ FYFALADLGFLDGDYAVFGYVTKGMDVVNKIQQGDRIDSAKITSGGENLKNAGQ" gene 37130..38395 /locus_tag="DP116_03610" CDS 37130..38395 /locus_tag="DP116_03610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459899.1" /note="FabF, beta-Ketoacyl-ACP synthase II, KASII; catalyzes a condensation reaction in fatty acid biosynthesis: addition of an acyl acceptor of two carbons from malonyl-ACP; required for the elongation of short-chain unsaturated acyl-ACP; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-ketoacyl-ACP synthase" /protein_id="PRJNA477356:DP116_03610" /translation="MVAVVVTGIGLVSALGKNLEDSWQKLIAGESGIRYHQPFPELEP RPLGMIWEQPAQMRMLTQLVVTSALKDAGLCSPLPDCGVVIGSSRSYQGLWEQMARQM HTRVGELLSRGAGRKGEEELISIGSSSPQHPAKSPVSSSYLTPSLSPSSKMGWLDTLP HMNAIAAARQIGACGVVLAPMAACATGIWAIAQAAYLIQTGQCQRVIAGAVETPITPL TIAGFQQMGALAKTGANPFDIRREGLVLGEGGAVFVLESAELAQQRQAKVYGQILGFG LTADAYHANVPEPEARSAIAAVKQCLNRSHLSANDIDYIHAHGTATQLNDSLESFLIQ KLFFQGVAVSSTKGATGHTLGASGALGVAFSLMALEQQFLPPCVGLRHPEFDLDLVRE SRQSKIQRVLCFSFGFGGQNAVIALSQYS" gene 38603..39712 /locus_tag="DP116_03615" CDS 38603..39712 /locus_tag="DP116_03615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019504738.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-grasp domain-containing protein" /protein_id="PRJNA477356:DP116_03615" /translation="MTRHKIFNHDIMVCTHESVEGNYLYCSRVLGLTEPEDIIQLHPD LKSQWNVITEHYERIGLSYSKNVIWDVALRVLEDYPNYDVSLFFFGNATSKAGCDEDW FHQVDSDWLNVVKFINSKNNFIQLAQELGVSVPVTSYFENKTGIKDLSKFLYPCYFKA AISVNGVGIHRCENQQQLSEILKRFPDEIPLQIQEEIAASSFLNVQYYVEASKLQRLA ITSQILDGCVHIGNCYPSKHQPWETVEPIAEWMVQRGMKEIFAFDVAVVEDATHGETR YLAIECNPRFNGASYPTGIAKKLNIPSWNCENFRTQYRSLEKLDLSDIEFNPQTNTGV VIVNWGTILVGKISILIAGGVQEQNELRTILKERL" gene 39982..41415 /locus_tag="DP116_03620" CDS 39982..41415 /locus_tag="DP116_03620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="folate/biopterin family MFS transporter" /protein_id="PRJNA477356:DP116_03620" /translation="MLVSSSALSQVKNSVREKVFLGYEPTPELLGILSVYFVQGILGL ARLAVSFFLKDELLLTPAQVAALFGVVFLPWTLKPLFGFLSDGLPIFGYRRRPYLVIS GILGAISWISLATIVHTPIGAGIAITLNSLSVAVSDVIVDSLVVQRVRAESQAKAGSL QSLCWGTSALGGLITAYLSGILLEYFTTRTIFWITASFPLLVSAAAWLIAESPVSKDA SGDDSNVVSIRHQLQLLRAAVSQKVIWLPTAFIFLLQATPTAESAFFFFTTNELHFEP EFLGRVHLVTSIALLVGVWIFQRFLKTVPFRVIFAWSTVLSAVLGMTTLVLVTHTNRA LGIDDHWFSLGDSFILSVMGRIAFLPVMVLAARLCPPGIEATLFALLVSVHNLGGLVS QQFGAVLMYWLGITETNFNALWVLVIIANLSRLLPLLFINWLPAADSQTETSTLESAS TNSGEEPFLPEFMSELIVQEPESEPVE" gene 41529..43049 /locus_tag="DP116_03625" CDS 41529..43049 /locus_tag="DP116_03625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459909.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Apocarotenoid-15,15'-oxygenase" /protein_id="PRJNA477356:DP116_03625" /translation="MQTFQVYEQGSTVPEKSYNRGDWQRGYESLEKEFDYWIDDVEGE IPQELQGTLFRNGPGLLDIKGQRIHHPFDGDGMISRITFSNGRAHFRNRFVRTQEYLE EQKAGKILYRGVFGTQKPGGWLANALNFKLKNIANTNVIYWGGKLLALWEAAEPYRLE PYTLETLGKEYFNGVLSEGEAFSAHPRVDFSCAQDNGAPSLVNFSIKPGLSTTITIFE LNTEGKIVRQHAHSVPGFAFIHDFAITPNYCIFFQNPVTFNPIPFVLGMRGAGECIKF QPEQPTRLIIISRNPKQKGVKIIETRAGFVFHHVNAIEREDEIVIDSLCYESLPEVQP ESDFRQVDFDALKPGQLWRFHLNLKDETVHRELLVSRCCEFPTLHPQKVGRPYRYLYM GGAHAESGNAPLQAFMKVDLESGKQQLWSAAPHGFASEPIFVPRTPHIPASQGGTKGG EDDGWVLALVYDSEYHRSDVVILDAKDFHKEPIARLHLKHHVPYGLHGSFTSECFV" gene complement(43168..43605) /locus_tag="DP116_03630" CDS complement(43168..43605) /locus_tag="DP116_03630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746343.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03630" /translation="MGLSDGLSDPIKKDTVVADCTKLLDVQVASMQGVSGLAMKAGYT AVKGLAPTYCASAIATLLPESFAALDPIWSEGVHTGDPVEHLIANRDRSADAILSITD ARIEKSNNKTVRGVYSKLRNSAKKHVEEAIPGLAKIIDNYTKN" BASE COUNT 12389 a 9046 c 9380 g 13023 t 10 others ORIGIN 1 atatatatga gtcaatcacc atctcaaaat gaaagccgcg aaatgctgaa gtggctgaac 61 cacaaccgtc agatgttatt agatttatat cgaaatcaat atgttgctta caatgctaat 121 ggcttaattg ctcacagtga aaacttgcgt gaagtgttag agttagcaga ggcttcaaag 181 caactctttg caatttactt agttccccgc cgcactgctt ctatccaaat tttgccaatt 241 cgctttcgta cagttagtcg ccatgattgg cagccaaatt atcatgttag gctcaagcat 301 agagatataa atatttctac aacaatatta gtagactctg gtgctgaact gagtttaatt 361 tcattaaaag ttggtcaaga tttgggatat gctttagccg atgcggaatc agctttactg 421 gcagaaacta taggcggaag agttgagtat gttttacgca atgttgaaat gacaattgat 481 ggacacagtt ttcttgctcc tgtggcatgg ttacaaacta acacaggtgg agaacaattg 541 cttctggggc gggaagttgt atttgataag tttaatatcg agtttaggca ggctgatgag 601 caaatcattt ttaaatggcg tgaagattta ttcttaaaat caggattttg attatataaa 661 ttttgtacct agtgaagttt gcatacgata aaagctagtt taagttacat aataagataa 721 tatattatcc caaataccat gagacatgaa caacagcagg atttagagtc tacttctact 781 caagaagcaa caacatcaac tgatattgaa caaaatctcc aggaagtaga agaaaaagta 841 tcttttaaac aaaaaacgga gcgtattaca gcttggagcg actttattaa gtcaatgact 901 ccctttatat gggctgttgt cattatcatt gttctgattc ccttactagg caaagcttta 961 ataacaggtt catctcctgg aaaactagca gattcaggtc agaatcctag tcaagaagtt 1021 agtatagtcg tcccacaaat acccaaagat atcgaccaag cactcgtgac tgctttgaag 1081 aatgcacatt ctcaagcaga gagttttgct tcagaacaat tagataattg ggttgatgaa 1141 ttaatgacgc gtgttgatga gagctttttc gattggtatt ttaattactt caatcagaag 1201 aagatggagt tgagcgctcc ttttacgtgg ttatattcag cagttactca ttggacgaat 1261 aaaaataaac catctccagg acaagcagta gccgaaaaat tgactgagga ttttcaagta 1321 gaatttgcta agcgtgtcct cagaccgaaa gttgcacagc ttgaatttga gaaaatcaca 1381 acagatagta taaatttgta tgttacagaa ttgagtaaaa atatttctaa tatccagagc 1441 agttacaaga ttcctcaggg agattgggag cgttatcttg gtgatattgc agtgactatc 1501 aatgatatgg aaggaaatat ttctcattta tcattgaagg ttttgagtgg tggaagtact 1561 tacctacttg caaaagcaat gattccaaca gtcacaaagg taggaagtaa aattgccgta 1621 tcctttgccg gaaaagctgg tgcaaaaatg gcagcaaaga ctggtggtgt cgtggctgga 1681 aaaattggag cacagttatt agacccaatt gtaggtatag gtatcatcat ttgggatgtt 1741 tgggattata atcatacagt tgccgttgaa aaaccaagat tacgagaggc tatttacgat 1801 tatttgaaag aagtcaaatt ttctctgcta gaaaacccag aaaatggcat catggtagcg 1861 attaatcaag tggaaaatgg cattctgaag tcaattaaaa gttctcctca ataaattgtc 1921 aattttatga ttcttgtttt caaaaacagg ctgagtctct ctctttgaaa gaatcaataa 1981 ttattttttt atccatcact gccaacaaaa ccgataaatc tggggaaact ctttctgtta 2041 agtttagctg ttgagtagca agatcaaaag tctttccttg cagtatttga taaccaagtt 2101 ccatgacttc agaccaagtc acctcacttt caagataagc tttagccata gtcatgagta 2161 gatcttgata ttgatttccc tccgcttttg tcatcatagt gtggggaatt tcaaagattt 2221 tgtcaaagct gtaataccaa gattttgcct gttgttcaat tgtagtttta gaaaaatctt 2281 gcaattcaca gtgattgata gctggatcac tttcactata aataataaac atcggcactg 2341 ctaaattcgt tttgactcta tcaataattt cttgacccat ctctaaaaaa atccgcagtg 2401 caggaatgcg gaagccctca tagccaaaat tacctgtgtt atctttattc agccattcat 2461 aataaattgg gagaacttcc acaacaaaat tgataatcgc attattacca cttatgtaag 2521 gagcaaatag caaagctttt tcaattgatt ggggatgttc caaggctaac caagctgcta 2581 aatttccacc tgtggatatt ccaccgacta tcacttgctc tcctaaattt tgtgccacct 2641 tcaaccaata aagtgcaaac ccttgataaa tttcgcgttc cgttggtaga ggagggggat 2701 tgtctccatt ccaatctcca gcgactccat gacctggttg taagggtact aaaacattgt 2761 aaccagcgtc aaatagtttt tttcctattg gttcaaactg gtagggacca gcagtaaatc 2821 cgtggaaaaa aaggcaaact tttgaagttg gatgtggatg gagaaaaatt tttgagcggc 2881 aagcatcatt tcttattggt agagtttcct ctactgattt gacttgttga aagatgcctg 2941 cttttgttgt gcaatagtca gatatatttg acatgtttta tacagattga agttgttatt 3001 tttttaggaa aaactccact tcatttgaat atgctaccag acctataact tttcaaacag 3061 aaactaaatc gtctatggta acgctggaaa atcatccatt tgtgtctcaa aagtcacttt 3121 tattactttt ttattatacg tggagaacac aaggcgagaa aaaagaaggt agcaatggta 3181 ttctatgctc ttacctctgg caggttaaac taggtctgat aacagccact ctgtaaacat 3241 atgaggcttt tatggcagct gatctgcaac agattgctta ttatttagac aaccttggtt 3301 gggactaccg tatcgatcca gaagaagacc gtattatcac aggtgtagaa agtgataatg 3361 ttgaagattt cctgattgtt gtccagctgg acgaagaggg aaaatttttc cggctatttg 3421 cgccgcaagt gttatcaggg gtgagcgatc atcctcacaa agtagctatt cttcaaacaa 3481 tgctgtatat ttcctgggaa accaaaatgc tgcaatggga gtatgatcca tcagatgggg 3541 aaatccgtgc gataattgag tttcccctgg aagactcgat tctgacagaa aaacaatttc 3601 accgttgttt acatgggtta gttcagcttg ttgattctgt ggcgataccc cgtctgatga 3661 cagtgatgga aacgggtcat gacccgggta atgtagaact tggtgagaga attctactca 3721 gcattcagga acagtctcct ggattgctag atctcttaga aaaggcgatg gaagctcgta 3781 aaaaacgagg aagttttcct agcgaatgag tcggatcgcc agtgaaatcg ctatactcca 3841 aagtgatacc ctcaatatac tgaagttttc tagggctgga tattatgaca tcctatgtaa 3901 cctcctctgc caaagcagaa atgagtgaac tacggcggtt aaagggctta ctaccaccgg 3961 aattgcaaag ttgggtcacg gttgaaggta cgactgaggt gaatccaccg atgatccgct 4021 gcgaggaaat tggcaaagac caagtagaag ttcaaatcga cttggcaaaa tgggattcac 4081 tggcgatgga tcagcgcaat ttactattct ggcacgaagt agctcgcatt caaaatgata 4141 ctattcccaa agatggttgg gaaatggcgg cgttggcgat tggtttaggc ggcgctgtcg 4201 gtgagttgtg ggtacaggat ggattgttgc tagtattagc tttgggttta tgcggagttt 4261 ccggctggcg actttatcaa aagaatagtg gcgaaaagca attcaaagaa ttagttgatg 4321 cggacgagaa agcgatcgcc ttagcaactc gcttcggtta cactctcccc aatgcttaca 4381 aaagtctggg tagcgccctg aaaactctta ttgacaccac acctagcaag cgcctgcgtt 4441 ctagatacga agcacggctg tctgctctta agcgtagtgc caataaggca aaatcaaaat 4501 ctaggaatat cgatgagggc gaaatttaac agtcatcagt tatcagttat cagttatcag 4561 ttatcagtta tcaaagagaa atacggattc gtttcttttg ttcactgtta actgttcact 4621 gttcactgat tcaaggatga cgtctccgtc tgagccataa accgcttaaa cccagcaata 4681 tgagtccagt agttgttgtc gcttctggca ctttttgtgg tgggacaaat ttgccgagtt 4741 cagcagaact cactttaaac gagtaaagga acccaggaat taacttttga ccatttgggc 4801 agccactgtc aatcggtctc ttgctatagc tggtaccatt ggcattagta caaatgtcaa 4861 actgggtggt ggaattgtcg tcttgggtga cactaaaatc attgtccgtc ccaattagca 4921 gcgcataaga accatcattg agtcgcggac ctattgccag accttccatc ttttctgcta 4981 tctttgtcca atcacctgga ttgtcaggac tcaaagttgc tgcaatatcc aaaaataaag 5041 atttactaac tgggttgaca cctgtgggta atgtattggt atttgccaaa ctgattgtgc 5101 tgacattggt tgcatttgtt agatcaattt tgtacactcg cttggtagct acaggtgatg 5161 gagttgcgct tgagtccgcc accaatccat tcactgcatc caccccaaac ccgcgattat 5221 ctctttctat gactaggaat tccttttcat tgagcgcagt aatagaactc agaccaatat 5281 tacgcccttg ggaagttgcc gcaaaatcat cttttgttcc aggaatgcgg gagttaatgt 5341 ctgctaaact ctctagttga tagatgtact gagccgtact ttttccagtt gctgtgtcaa 5401 attctactaa tcgtaaattg ccactacgtc gtccgtctgg tgaaccatcg ttagaacctt 5461 cgtttaccaa gggaccttgt agcacagcat agagtttgct accatcaggt aggagagtta 5521 agccttcaaa tcctcgatta tcttggcgtc cattggtaat ggttggacga ccatcgacat 5581 aattgagctt gccattcgct tctttgggga ttaaattttc tgggatggta aatgcccgta 5641 agaacgaacc gtcaggtttt aattcataca cagacggacc atactcatca gacacataaa 5701 aatgaccatt aggagcaaca gcaaatcctt ctgggtcaaa actcaaacca agagtccctg 5761 cattgccatt cagcaatgtg gggtttaacc cgttaaattt ttgaccatct tttgtgaaca 5821 aaattgtttc tagtacttga aaatttgcaa ttgcccctgt attgggatca acatctaaag 5881 aaaacttttg tactcgtgtg ttgtatgaga ttctgccgcc accgggaccg cgatcgccca 5941 acccataata tacattgttc gagcggtcat agtagagatc cgagaaaaat cccaaacgat 6001 tgacattagc gctgttgcca ctagatgggt ataaatctgt actttcacca ccaatggaga 6061 gggtattgtt taaggaaata ccatatgccg attttatgta tcctgtaaga ctgactgcgg 6121 acagaattaa cgagcagctt aacagggaaa acaaatgctt ggcttgaatc atgggtaaga 6181 gaaaaagaat catgaataac ttttttcagc atctgactcc cgtattaaga acaagttaac 6241 tctctaacaa tgtcatgttt cttacatcaa aaatttatac aattggattt ttgcttgata 6301 atatcttcac aaaggttagt agtgagtagg acaattaaag actaactact aacacgttta 6361 gatttttact ctcaaaagtt aaagttgatt cagcatttta tgggtttgag gaacgacacg 6421 tacacacttg acaaaacgct tcattaaagc ttgccaattt aagacttgac tgggagaggg 6481 agcagccaca ggtagttgtt tgaacgcacc aggatcatct aatggcgtga cagattgtaa 6541 aaatactggg atagatggat tgacagatgc taccaattca gctgaacgtt ctaactcagt 6601 gggatttgtt gtttgagaaa caattatctt aacaaaaact tctacttgtg atgtccaaca 6661 cagctgaaga aattcagcat gttctcgcca acgattttcg ccacttgcac taggcagttt 6721 gaaatccatt cccacagagt ctatgtgggg tagtatagta ttcagtagtt ctgggcgatg 6781 tccacctgtt tccaagtata ctggtaaact tgtcactgag cgtacttggg gtaaaaattc 6841 taccaaaaat ggggtgtgta aaagtggttc gccacccgtt aacgtaatgg tatcgtgtag 6901 acaaggcaag ttttgtcgct caacccattc tagtaacatg ggtaatgaaa caggattcga 6961 gtaaatttca aagtcacgta atccaggggt ttgctctatc ctacaagtcg cgggtgcatt 7021 ccaagtatgt gcgctatcgc aaaagtgaca acgcaagtca caaagagcaa aacgaatgaa 7081 aatttgacgt gttcccacat tcagtccttc cccttgtatg gcggaaaaga cctctatcag 7141 gcgtgcggta ggtgtaacag tagtcttaat ggtcatgtga tgatagcaat aggtatgtgc 7201 tacgttggca caaaagcaag aatgtttttc actaaagaaa tatgaagcat gaagtcaaaa 7261 atatgaagat acggcaaaaa tgagaaaccg aaagtaggaa atgaagaata aaatatgaaa 7321 tactttattt cttacccgtt attctttctt ggcatacttc attcttgaca cttcaaatta 7381 gttagcttct tgttccattg tgaatcgtcc ctgatgatat ccaatctcaa ctgaagtaac 7441 tatcaattcg cttggactaa atagtgtttg gcagattata ttgacatggc aacgaaagtg 7501 cagggtttca tgactagcca acccaaagag gtaaattttc cggtgacgtt ctcaaataac 7561 acggcaattg tgcaggtacc agcgcggttg agcgtgcttg aggctgtagc ctttaagcaa 7621 acctgccaag agataactcg agcttatccc gatactaaac aaattatcat tgactttcac 7681 caaacgactt ttatggatag tagtggttta ggtgccttgg tcagtaattt gaaaattgct 7741 catgaaaaaa atatcgattt tatactacgg aatgtgactc ctcaggttat gtcagtactt 7801 aacctgacag gactggataa agttttttct attgagtctc aaggggaaac accatcaaag 7861 agccaattag aagaactacc aacgacacat ccttctgttc gttcttggat gaaacggttt 7921 atagacatag ttggatcact agttggtttg atgattacag gagttttgtt tatccctatt 7981 gctgtggcta ttcgactcga tagttcaggt ccaattttct ttagtcaaac tcgttgtggt 8041 tggatgggca aacgctttct gatttggaaa ttccgctcaa tgtatattga cgcagaagct 8101 aggaaagctg aactagaaaa gcagaaccag gttcaaggag catttttcaa aatagacaat 8161 gatccacgaa taaccaaggt cggacggttt ttgcgtcgaa ccagcttaga tgaactaccc 8221 caattttgga atgttttcaa aggagagatg agtttagtag gtaccagacc acctacacca 8281 gatgaagtag agcgctatga agttccagag tggcaacgtt tggatgtcaa accaggaatg 8341 accggagaat ggcaggtaaa tggacggtct agtatccgta aatttgaaga tgtcattcgt 8401 ttagatttac agtatcaaaa gaactggaac ttaatctacg atttgatgtt aatttttaaa 8461 acagtagcta tcttatttaa cagaaacagt ggcgctgttt aaatctattt ttagaggtgt 8521 aaaggaacta agagtgtatt aagagggaac agggaacgct taacagggaa atcctcataa 8581 ttgaaacctt tattcgttga ggcaagcagt tcttgagcaa ggtaaaacag agcaccaggg 8641 gaagaaataa ttgtcaatta gctctttcct cccctgccta aagatgctta cccgtattac 8701 ctaaccgtat tgggagatac ctcagtcacc gtgagagcac cgaaaccacg cggcgacgct 8761 ggactgtcgg gagacccgat gttagcgcct agcgacccaa gaacgccctc cctcaccagt 8821 cctacacaat acacacgcgc tcaattctga cccctgagga ctgatgactt tcctggtcaa 8881 tatacttcac tctccactac agaatcctta cactcaagtt tcatgtcctg gcacttatgc 8941 caggtttttt cttgtcagaa atttcttgac attatacgta taccatattt atctctggtt 9001 gaagcataaa aaaagttcat gatacagcaa tttaagtgaa tttaatgata tagataattg 9061 atatagcaat tgctcagcaa gaaaaatact aatgaaagcc ttacaaagtg gcatcaaatt 9121 ttactcaatc ttcaataaat catggctaac ataggtagcg tctgaagcga aaatgcatat 9181 ataaactcac ataagtgcgg aataaggtta acaacaaatg ttgtcataga aaagacaccg 9241 gggttcttca gactcaaact aatttgccat gcgtattttg atttactctt acaactatta 9301 tccagaacca attggtattg cccccctgat gactgaatta gcagagggat tggttaagcg 9361 cggacatcaa gtacgcgtag tgactgctat gccgaattac cccgagcgtc ggatatacga 9421 gaattatcgg ggcaagtggt ttgtgagtga gtacaagaat ggggttcaaa tccaacggag 9481 ttatgtttgg attcgtccgc agcctaactt aatagatcga ctattgctag atgcaagctt 9541 cgtgcttaca agttttttcc cagccattgc aggttggcgt ccagatgtca ttctctcaac 9601 ttcaccctcc ctacctgttt gcttaccaac tgccctttta ggatggcttt acagatgccc 9661 tgtagtctta aatcttcaag atatactgcc agacgctgca ttacacgttg gattgttgaa 9721 gaacaaatgg cttattaagg tgttaacagc attagaaaaa ttcgcttacc acagtgctac 9781 taaaattagt gtgattgctg acgggtttgt agaaaattta caaaaaaagg gcgtagcacc 9841 cggcaaaatt gtacaaattc ccaactgggt cgatgtcaat ttcatccgac ctctaccaaa 9901 agaaaataac gcttttagag cagctcataa cttagatggc aaatttgtgg tgctatactc 9961 tggcaacatt gccctcactc aaggtttaga aactgttgtc aaagcggctt cgttattacg 10021 ccatattcca aatataacct ttgtgattgt aggagaagca aaaggtttgc agcggttaca 10081 aatggcttgc accgaatgtg gtgctgataa tgttttgctg ctaccttttc aacctcgcga 10141 aagtttacca gaattgttgg cagctgctga tgtgggtttg gtggtgcaaa agaaaaatgt 10201 tatatctttt aacatgccat cgaaaatcca aatattgctc ggcagtggtc gggcagttgt 10261 gggatctgta cctagtaatg gcacagcagc aagagcgatc aaacaaagtg gtggtggagt 10321 ggttgttcct ccagaaaaac ccaaagcttt agcagaggcg attttagatt tgtacaacaa 10381 cccagaaaaa gtgaaaacat tggggtgtaa tagtcgccaa tttgcagttg agcagtatgc 10441 ttttgaacaa gcattaactc gctacgagtc tttgttctat tcattgacag cccaacgttc 10501 aacaattgag ccactgatca attcctttgc tgaaaaacac gttctccagc agagcgaagg 10561 aatccagata gtaaaacttc gttagaataa agttatatat tcatcactca actctattac 10621 tgcaataagc ctccattgtc agaggacagc cattatgcgc gaaaaagtct taacctggtt 10681 aacacagaat gttccagctc ccagagtgaa ccacattctc agagttgagc aaatggcaat 10741 ggatctcgca gttcattaca aagtgactcc agaaaaagct gcacaggccg ggttaatgca 10801 cgatttagca aaatgtttta aaccacagaa acttttgcaa atggcacaaa aagagggatt 10861 ggaagtagac gaagtgatgg gagcaaatcc ccatttgttg catgcagacg taagcgctat 10921 cgtcgcaaga gaaacatttg gcgtagacga tgaagaagtc ttacaagcta ttgccaatca 10981 cactttaggt agaccaggca tgagtccgct gtgctctatc gtgtttttag cggatagcat 11041 agagccggga cgaggcgata ccccacaatt gcaatcatta agacaaatta gccgtcaaaa 11101 tattcatcag gctgtcgctt tgacctgtga ctatactctc aaattattgc ttgagagttc 11161 ttgtttggtt catccgcgag ttatcttcac gcgtaactgg ttcctacaaa agtggaaagc 11221 caaacagccg attgtacaaa aaaccgtata gatacctgtt ttttttgtta gtattcaaaa 11281 aaaccaactc tcacagttgc aatttaattg ttgttcgcga gtcattgtaa aaaattgtta 11341 caaaaaaaag agcagctgag gtttaatgtc tgattatttc ccaaccaatt tcccaagaca 11401 atcgatcgca gtgacaaaca gtctggtagg aaactcacag gttgataccg acgatgtgag 11461 tggaaagatc gcattgaaga tcgccgaagc cgcatcagac cgcaaagcag gtgatatttt 11521 actactaaag gtagcagatg tttgttacct ggcagattac tttgtggtgg tgacgggcta 11581 ttcgagagta caggtaagag cgatcgccga cgcaattgaa gaaaaagtca aacaagaatg 11641 gcaaaggctt cccctgcgag tagagggaaa atctgaggct tcttgggtgg cgcaagacta 11701 tggcgaagtc attgttcaca tcatgttgcc tcatgaacgg gggttctata atctagaagc 11761 gttttgggga catgcagaac gtatcgagta ttcaacttcc gtagaaggtg agggtaaacc 11821 aacatgataa aatcttcggt ctcaaattgc ccagttccca ctgaacagca accactaaat 11881 gagtacgaag agttaaaatc ctcctggcta tttcgtgact gtaccttaaa tttgcgcgag 11941 tatatcgcca aaatcgcgtg gatttgggga atatggtggt tagtggcatt cccagtcgca 12001 gcagcaagct tttccccata caagcagact gcacagttta ttctcggtag tctagcagga 12061 gcaagtgtgg gggttgtgct ggtactagta cggttatatt tgggttggtc ttatatccgc 12121 gatcgcctga tgagtccaat catcttttac gaagagtcag gatggtatga cggacaaact 12181 tggatgaaac cagaggaagt ggtgacacgc gatcgcctgg tagtttccta ctcaatcaaa 12241 ccaattataa gtcggttaca aatgaccttt gctggcttgg ctgtattgtt cgttgctggt 12301 acgatagttt ggcacctagt gtaatcgtca tcagtcattc gtcatttgtt atttgcaaaa 12361 gactaaggac taaggacaaa agacaaaatt atggtaaatt tctacccatg acaatgggaa 12421 aacgaacaca agccgccgca ttagaagtca ggttactgcg tgaaggcatt atagaatcaa 12481 agcatatcgt ccaagccgtc gtctgtgatg atcgaggacg agtactatcg gttgctggaa 12541 acgctgaaac tgcaacattt gtccgttcgg cgctcaaacc atttcaggcg atcgcagtaa 12601 ccagcacagg gacaatagag cgctataacc taagcgataa agacctggcg attatgacaa 12661 gttcccataa aggaacaata gagcaagtta gacaggcatt taatatactt tggcgagctg 12721 atgtagatcc ttcagcactc caatgtccaa ttccagaagg taagtacagt cgtttggaac 12781 acaattgctc tggcaaacac gcaggcatgc tagccgtttg tcaacaatgc aattggtctt 12841 taaataacta cttgcagcga aaacacccag tgcagcagtt gattctcacc aaagtatcag 12901 aattgctgcg aatgccagca gaggaattca tcagtgctca tgatgactgt ggcgcgccta 12961 cgtatttgat gcaactcgct caaatggcat cattgtatgc tcagcttgct tctcgtagca 13021 ctttggacat ggaacgtatt gtgcgtgcca tgacgcatca ctctgtgatg gtggcgggag 13081 agggagaatt agatacggaa ctgatgcgtt tagcccttgg agatctcgtc agtaaagcgg 13141 gtgctgaagg agtacagtgc attgccagat tgggtgaggg catgggattg gcaattaaag 13201 ttatggatgg agcaaagcga gcaaagtatg ccgttgccat tcacttactt aagcaattgg 13261 gctggattac tcccagcgtt gccgaagccc tgtcagaaaa attcatgaac ctcggaaaat 13321 acaagcgttt agaagttatt ggagaattat cgattttata gttgccaaat ttctaacttt 13381 gttgttatac tgaggaagac gaagagacga cgcgggatag agcagcctgg tagctcgtcg 13441 ggctcataac ccgaaggtca gtggttcaaa tccacttccc gccaccaaat aaaagatcaa 13501 tcaaatccct acaattgtaa aaagttgtag ggatttgctt ttgataaact agatgcaacc 13561 tcaactcacc gttccgtaat gtactcgttc gtatcacaca gcgatccgcc gaaatccagt 13621 cagtatgacc aacgattgta ttgacccaat ttcagagata gttagctcac gccaacgagc 13681 cttatcctta aatgacgttt gtgcctccgt gtgctctctt gcaactgtag cagttgatgg 13741 atctccaaga acgcgagtca tgaacctgaa tggaatctct gggcgttgat ttgaacttca 13801 ttgcagtgag caaagcagta aatggcaaga actcctacac aatcccgctt atgaactttt 13861 actttggtgg cccagtgtag cacgacagta tagaattcgg ggagagttag agatgatgcc 13921 gaaatctctt gtagaacaaa gctggttgcg tgcgccctac ttattcaaac tccttgaccg 13981 tttttaccaa gagtatcagc cacttggttc aaagctccaa aaccagcaaa tgctactctc 14041 atctataaat aatctcaaaa cgaaatatcc aaacgagcaa gaagttccac ctccacagca 14101 cttgagaggc ttttactgca aagctagctg gattgaggct ctctacattg ataatcttcg 14161 tattcacgaa agacattgct attcgtacac cgagaattct tggcatcacg agttacttgt 14221 cccatagcat tactcccttc ccggtctaag attacctata ataatttatt cctctgtctc 14281 tgtgtcctct gcgcctctgc ggttcaaaaa taggtattct tctggcggtt cgggagtaag 14341 attgctaggg gagtgtgtga agtaaaaaag ccaccctgac aggatgggtg gctcaactta 14401 tgcgatttat gcgctcaaat agcgctaata gaccattaac acccttgctt taagccaaag 14461 agtggattta caagtcagga attaaacatc gtaataaaga gcaaattcgt agggatgagg 14521 acgtaaccgc atggggttaa cttcattatc caacttgtag gaaatccaag tttcaatcaa 14581 gtctttggtg aatacgcctg gttctgttaa gaaagcgtgg tctttctcca gtgcttccaa 14641 cgccagttcc agagaacctg gagttgaagg aaccttcgcc agttcttctg gagaaagttc 14701 gtagatattt ttatctagag gttcacctgg gtcaattttg ttcttaatgc catctatccc 14761 agcgcacagc atcgcagcaa atgccaagta ggggttagaa gtcgcatcag gacaacggaa 14821 ctccaaccgc ttggctttgg ggttagtacc agaaagagga atccggatag aagcagaacg 14881 gttaccttgg gagtaagcca aatttacggg cgcttcataa ccaggtacca aacgcttgta 14941 agagttggta gtggggttag taattgccag cagtgctggt gcatgcttga ggataccacc 15001 aatatagtgc aacgccattt cgctcaagcc agcgtattta tctccagcaa acagcggttg 15061 accatctttc caaatggact ggtgggtgtg cattccagaa ccattatcgc caaaaattgg 15121 cttaggcatg aaggtgacag tcttgccata cttcttagct acgttcttga tgacatattt 15181 gtaagtcatc agccagtctg ctgcttcgat caacttacca aagcggaagc ccaattcgca 15241 ctgtccacca gttgcaactt cgtggtgctg cttttcaatt ggtactccgc actttgccat 15301 tgtcagcagc atttctgtgc gaatatcctg gaaagtatcg gtagggggaa ctgggaagta 15361 accctgtttg tagggaggtt tgtaacccag gttaggacct tcttctctgc cagaattcca 15421 acgaccttca actgagtcta cgtagtagta gccttggtgc gctgtttggt caaagcggac 15481 atcatcaaag atgaaaaact cggcttccgg accaaaaaac gctgtttcac caagaccagt 15541 ggaaactagg taatctatag ctttctgggc aataacacgg ggacaacggc tgtaccactc 15601 atctgtccgt ggttctttga tgctacaaat gatactcaga gttggctctt gcatgaaagg 15661 gtcgatccag gcagtgtttg gatcgagtac catcgacatg tctgattcat tgatggcttt 15721 ccaaccccgg atactagaac cgtcgaaagg tacgccatct gtaaaggaac tttcatcgat 15781 ttggttgtgg aaaagcgtca aatgctgcca gattcctggc atatcgatga atttcagatc 15841 aatcatctga atgttgttgt cccgaatctg cttcagaact tcttgtgggg ttgtcatggt 15901 tactccttaa atctgccgat tttttaatat taagactaga gtgttcaatg catgttgacc 15961 ccgtttgtaa atccaaatcc atgagcctcc tgcaatatcg taaagatttg aatttggata 16021 attttgtatc ttttgtaaca gaatatcaag ttaaaagatt atctttaacc tttccttaac 16081 tagtttcaca aaagtagaca gttgatctcg caggggtcaa ctttggcata gaatccataa 16141 ttagcgctat agattgtttt tgtgctaggg ctttttttac aagaggcata aaagcttgct 16201 aaaagcagtc agattaatat caaaccatag aaaggattga ttaggagaaa aaagaatgcg 16261 ggatgcagta accagcttaa ttaagaatta tgacgttaca ggtcggtatt tcgaccggaa 16321 tgccattgac agccttaagt cttactttga aagcggtacg gcacgtgtgc aagcagcagc 16381 ggctatcaat tcaaatgcgg cgtcgattgt caagcaggct ggttctcgac tatttgatga 16441 acagccagag ttgattcgcc caggtggaaa tgcctacacc actcgtcgtt atgcagcttg 16501 tctgcgcgat atggactact acctacgcta tgctacctat gcccttgttg ctggcaacac 16561 ggatgtactg gatgagcgtg tgctgcaagg gctacgagaa acttacaatt ctctgggagt 16621 gccaattggt cccacagttc gcggtatcca aattatgaag gatattgtga aggagcaagt 16681 ggctgcagca ggtgtaacaa atacctcctt tgtcgatgag ccatttgact acatgactag 16741 agagtttagc gaaacagatg tttaacagtt atcagtgatc agttatcagt tactcgataa 16801 ctgttcactg ttcactgttc actggttaaa attgctctgt cggttttaaa atcaggcgag 16861 tttccctgag gaaactcgct tttttccacc ttcgcgacgc ataccgcacc aagtgcacca 16921 ggaagcacac aaagctgaaa attcgccaca atagaaggtg ccaacgattg aggcgctggt 16981 tatggaagag ttactagagt taaaagattt gttgctaaaa ggtgatgtac caggggcgtt 17041 ggtcatcgtt gaagagttag aagagatgag ccgtaacgat ataatcaaga caattcgtag 17101 ctatgccata attctgctgt tacatctaat taaacagcaa gtcgaaaatc gcacaactcg 17161 ctcttgggat gtctctattc gcaattcagt tcgggaaatt caaagggaaa ataagcgccg 17221 taaagcaggg ggttattact tatgttcaga agaattattt gaaaccttag aagaagcgta 17281 cttaaatgcc atcgatgaag cttccctaaa agtagaacaa gggcgttatg aaccagaaga 17341 gttagaacaa ctggttaatc gagaagaaat tataaatcgt gcattaacgt taatattacc 17401 acaatagatc aagataattg gttaaaaaac gactcgacac tctattagta gaactcaatt 17461 taagttcgtc tcgcgcctta gcacagcgct ttatccaagc gggagaagtc actgttgatg 17521 gtcagataat tgacaagcca ggtacggaag ttgatattgc ggctcaaatt caaattaaag 17581 agcgatcgcg cttcgtttcc cgaggaggag aaaagttagc caaagcaatc gatgtgtttg 17641 caattcctgt agaaggacga atttgtttgg atggtggtat ctctacaggt ggcttcactg 17701 attgcttgct gcaagctggg gcaaaactcg tttatggggt tgacgttggt tatggacaag 17761 ttgactggcg tctgcgaaat gattctcgcg tggttttgaa ggaacgcacc aatttacggt 17821 atttaacgcc agatgagttg tatggtgtac ccaccttaag ccccccttct aaaggggggg 17881 aatatgctga tttggcagtg gtggatgtct catttatttc cttaaccaag attttgcctg 17941 ctttatggca gctgctgcaa cctccccgtg aagctgtatt gttagtaaaa ccccagtttg 18001 aagtgggaaa aagccgcgtt ggtaaaaaag gcgttgtgcg tgattctaaa gaccaagccg 18061 aggcaatttt tcaggtgttg caagcggctg gtgaattagg atggaagtac aaaggtttga 18121 cttggtcgcc tgtgactggt cctgctggaa acatcgagta tttattgtgg ctgggtatgg 18181 agagtgaaac tccgccgcct gatatagaga ctgttcacca gataacaata ttagctgcat 18241 ctgagttgcg ataaattgtt ttttgtcaat ttctgtttta tttattggat taacgttttg 18301 ctttgttttt ttaagttgca aaatagaaga acagtttaaa tattgatgag ataaaattgc 18361 atgactactc aagataaagc acgggaatta atggttcagc agcgtctgca agacgaacac 18421 ttgcacgagt caatgctaaa tcgtgctgaa gcagcacatc ctagtgggac agaaggtatg 18481 actcaagaag aagcacggga gttaatggcg cagcaacgtc atcatgaaaa acacttacat 18541 gagtcaatgc tgaaccgtgc ggaggctgag gtaggattgc ctagtgataa cactaactct 18601 tctcaagagt gtagtcttta ttaacgtagt gatgtatcgt tgaaagaaaa ctgctctaac 18661 tcttgccgca aacgatacaa aatcgtattt ggctccactt cagctagaat ttcttgaata 18721 actgtacttg ctcccaattg tccccaagag cagcctcttt ctagatagac ttgcgccgct 18781 ttacccacag catatgctcc gtagcctgca ataccagctt gtgtgattgc agttcctgca 18841 taagcagtga tatttatagg gttttctcca gaagcaattg cagcggtact cttacccaaa 18901 cctaacagca aactgctacc tagttctccc agcagtaagc caccagaact taataaaatc 18961 gtttttaaaa aattccctgc ttcgtaccca gtcattggta agccgtacag gcgtgctagt 19021 gagcgaacta aagctaaatc ggcaactgtt ccgcctagaa tatctaagaa ggggatggga 19081 tttagcccta ctgctaaagc tttgtattta gtaaattgcc aaatcaggtc ttctgcttgt 19141 tgttggcgta agtcgatggt tttttgggcg atcgcctctt gtgcatcccg cgcttgaatg 19201 agtgcgttta aagcgaggag cgatcgccct tcacgattga gaattttaag aattgtttct 19261 ttgagttcgt ctatttgtgg tggtggtgtc tcccattcat gtgcgacact accatcagac 19321 cattcaaccc gtacttccat cggtgcaggt tccgccgcca ccatgacaat ttcatccggt 19381 tttaagggtt tgccttgggg atttcctgca cccaactgtt gtaaattatt gtaaattgtt 19441 gttttgtctg tatctgggta aaggtctatc ttgttgaaca ccaaaattaa aggtttctgt 19501 gcctgacgca attcacacag catctgatat tcagtgcggg tgatatcacc agacacgaca 19561 aagagaatta aatctgcttg acgcacgact tctcgcgcca tttgtgcccg agactcaccc 19621 tcaatttcat ctaaccctgg tgtatcaatc aactccacct gcaccttacc accaggattc 19681 cagcgtacag aacgggggta ctgagtcaca ccattaagag gaccagtttg taaaatcttt 19741 tgtcccagta aagcattcaa taccgctgac ttcccacgac tcaccaaacc aaaaacagcg 19801 atacgaatga ggttatagtc tagtttgttg agtgttgagg tcaaaagttc caattctggt 19861 tttaccaaac ctgctaattg tgagtttgat gacgactgtc ccggtttgcg aagatttcca 19921 taccaagaca gcgcaaggct tagactagca cgcgcacggt ttaagtgagt ttcttgcagg 19981 ttctttgaca ctggtaatgg ttgagtcaag actatatcaa ttttatattt tagattttgg 20041 attggaaaaa tcaaactaac cgctgactaa tatattttta tctaatgaca aaaatacgag 20101 tattgtttgt tgaatacatt acacttgtgg taagtatagg taggagtgta attcatgcca 20161 gttaaatcga gaaaatgtat tgtcagtacc taggcgtggt acgagtcaag ccaagaatta 20221 aagataccag aattgcaatt ggtctcgtag tgtggtcttt agcggctgag ttcacgacat 20281 atgcaaatat tcaagagtta cagacttgtc aaaagaggag atagctgctt gttttgatga 20341 tgctcttaat ttgtgagagt ttcagactgt tgcatcgtga gtttgaggtt tttagcaaat 20401 cattgtgtct ttaatttcat tatctaaata gtacacgaag ctggctatga agttttgatt 20461 ttgaaggata acattcctgc cgattcaatc acaatcttgt ctgaatcatg aaatatttca 20521 taactgcttt tcatttcatt tcacttgatg tgttcaacta tttggagttc ttgtaaatga 20581 aacgctcaaa ttccaataca ttggaacggt gcaaaaagtt tttcagcaag gcaaaaaatt 20641 cgtttacagg aaagtatcag ctttacacgg gcgaaaaact gaattggttt caggtcattt 20701 tccgactaga aaagtcagtt cttccagcca ttcttccttg ggtaatagtt tctggtttgt 20761 atggtttttt agtatcatta ctataccgtt atggagttcc catcggattt ccacataaga 20821 gcggtgtcct taccaatgct atcttgactt ttaatgttgg ctttactttg ttattggtat 20881 tccggacaaa tacagcgaat gagcggtttt gggaaggtcg taaactctgg ggttctttag 20941 tcaatacggt tcgtaacttg gctcagggaa tttacattgt agttaaagag caatcaccga 21001 aggataaagt acaaaaagaa gcaacactcc ggctattggt cgcttttgca gtcgctatga 21061 agttacatct aagggcagag tgtttggatg agcaagtaga gtcattgatg tctgaaatcc 21121 aatatttgaa gctgcaagag actcatcatc ctcctctaca aattgccttc tggatcaggg 21181 attatttaca gcatcagcat gaccgtaatt gtctaaatat ctatcaattg actgctctac 21241 ataagttggt ggatgatttg gtagatattt tgggtggctg cgaacgcatt ttgaaaacac 21301 cactgcccat aatatatagt attaagctta gacaattgtt gttgattttc tgtttgctat 21361 tgcctcttga aatagtcact aatttaattt ggtggactgg tgtcatcacg gcttttgtga 21421 gtttcacctt attaagtatt gaagaaattg gctcgcaaat ggaagagcct tttggacatg 21481 atccaagtga tttaccattg gatgcgattt gcaacacaat gcaacacaat gttgaagagt 21541 taataacgct cgctcctagt agtgagtttg actggcgagt ttaagactat ttggtttggg 21601 tttctatgcc caatttatgt taagcaattc tggaacagat ctactttttg gagacgtgac 21661 ttcgacggag gcgatcgcgg agtgctcgca attgtggaag ataataagga gagtttttat 21721 cccacgcgcg aggtttgatt ccttttagac tgatgccagc aacaatttgc attaacaggc 21781 gacctaagca acttcattta gctttgcata gaagctctat ttcatgacta gcaatatagt 21841 atttctatat cagacatgtt ccaaaggtct gacttttgag gcttgattga ctaattcaat 21901 ggaataacca ttggaaaaag cagatgaaaa atcttctagg tgggaaacta aaatgatgcg 21961 tgctagtcga tgttgtaatt cgttaagttc ttgaatcata tcatcacgcc cgtttttgtc 22021 caaactacca aagccttcat caatgatgac tgattctatg tgacgtgcgc cttgtcctgc 22081 atattgaccg atcgccaatg ctaaactaac ggcaactcta aatcgctgac taccactaat 22141 ttgagcgatt ggtgtgggat acggtccaat ttcttcgtta tatgccacta aatcaagggc 22201 tttgctagac tctccggtgg attcttctgc ttctccacgt aattccaatc gcatttttcc 22261 gcgtgataaa ccatctagaa tttcattggc aagctgcaca atagcttgtt ctgcacgacg 22321 taataaatgc aattgcagac cttgtttccc attacctagc aaatcgcaaa gtattttgta 22381 taagctttgg ttgcgttctg cgtcaagctt ttgtttttca agttgctggt acagactgcg 22441 tttactaatc agttcattaa aactattttg agcattctgt cggtttctat caaaagtgtc 22501 tcgctcggat ttagcagtta caagttcttg ctcaacttcc tgagaagcgc gttttgcctc 22561 gttaggcaac tgttcaatct ggttacggaa attgctaatt tgttgctgga gaaaggctac 22621 ctgttcacta gcaatattta attgatttga taaagtctca tactccgcta gcacttttct 22681 ttcactatcc atctcttcta ttttttcttc atcaattttt attaatgatt cttgccactg 22741 tgatggtaag ctttctacat ccgcttttaa gcgacgttga atttcagttt gagaacctct 22801 tttatcactt aaatcgattt catacttttg tattttattt gacaatgtat tcacatcttt 22861 ctcgatttct ttaacttgat taagtgtcag tgtttgttct tctttgagtg ttttaagtgt 22921 taattgcagc tccttttgtg attgttgtaa tgtacttttt tcgttacgtg cttcttgagc 22981 ctcagatagt ggcaaatttt tctcaatttc cgcaaactgg tgactgtaag tttgatgttg 23041 ggtattgaag ctttgccact cagagaattg atggcgtagc ttgtttaagt tttgagtatg 23101 agtttgtcga ttatctagca gttgatttag ctcttccaaa tcttcttctg ttgggtaact 23161 tgtatcaagc cagcctaaat catcgtctat ggcatatgga gaaacattta cctgataacc 23221 ttctggcaag ttgctaaatg cagtattaat ttgcttagta tactgtttag catctcgttg 23281 tgcttgattt agttgaattt tattttcgtt atgattattg ctgttcactg taatttcctt 23341 gtttaagttc tctaattctg ctgagaaaga attaaggtta tcctgtgctt ttttgtgttc 23401 attcttaagt ctagttaaat tagtttcagc atctgttatc tgagaatata aacgtgattt 23461 ttccgcttga gcatgttcgg gtgtaatctc ttgtccacat aattcacata tgggttgatg 23521 agatgctttt tcaaaattat cgcgtttatt acaaacagat tcatactcgg tttgaaaacg 23581 agttatatta tgagataatt tgttttcggc ttcactagca ttggcaaaat cagtatttaa 23641 ttttgcttgt tgagtttgat attcttctaa ttgagtttgc aaagactcta aatttttact 23701 agcttcttct tttcggtgta cagtgtctaa taatttacta cgttcctgag ctagttgttt 23761 taaccaaggt aatgttttta caatctcatc caaccgtttt tcttctgatt ctgctttttg 23821 taataattct tctaaatcag aaggaaaatc tgcaagagtc tgctttaact gttgtatttt 23881 ttctttaatc gactctaatt gtgatagttt ttcaactaaa ggtgctaatt caagtagccg 23941 aacgtcgata ttattactat ctttttgttt attttctatt ttctggttag ttgcttctat 24001 ttgctgactt aattcatctt tttcggcaga tattagtttt aaatcatcct ttacttgttg 24061 gtagttctgg ataatttttt caatctcgtt atcgttgtca ataatacgtt gacgttgctc 24121 aataatagaa ctgagtaaag gtaacacagt atctagctct ttgtagcgag taaaattttg 24181 ttctatctcg tcagtgcgag ctatcaatgc ttgccatttt tgtacatcct tttgctgctc 24241 tataaattga gaatttaatt gttggaattg tttagatagt tcgactaaag ttactaaatt 24301 atctacttta tctcgcgatt tattcaaatt atcattggtc agatttagca cttcttgagc 24361 cgcattaaat tctgcgtcac taactgtttc tgatgcttct aattgtagtt taaaattttt 24421 aaactgcgct tcacattctt tacggcgagc atctgtaagt tgatgtaact tgtcatatgc 24481 agtcaaatct atcaattgtt tcagaatttt gtaacgttcc gatggtacag cttctattaa 24541 tttgtcactt ttaccttgaa tgagaagtat gcatgatgtg aaagcttcat catccaaact 24601 aatcttacgt tttacccagt ctttaaaacc agcatcgcta tctgtatttg cttctggctc 24661 gatcctcagt tttccttttg atgaagaatc aatacgaaaa acttctcttg tcgcttttcc 24721 acgcttgcgt actgttctac gaatacgata ggcttctcca tctacaagaa aatcaaattc 24781 aacagttaat gcatcagctt tatgattaat taagtctttt tgagtttttg cacctcctcc 24841 gcgatggctt ccgtataaag caaagcgaat cgcatcaaaa atagtcgatt taccagcacc 24901 attttctcca gataaaaccc acaaaggtgc gccatcaaag agtaactctt gtccttcacg 24961 atagctcata aaattttcaa catacacccg tacaggaatc atatttctac ctctgaatta 25021 tttacaaatt tctcgtcagc taataatttt tctgccaatg ccaacacagc atccttgtct 25081 tgatgtcctg ctaattgttg ttgcaggtat ccgcgtacgg ttgttgctac atcttgcgta 25141 tcggcagcta aagtagagct tttaagggat attgatgacc cgtcagtcat tatttcccgg 25201 tcgtaccaac gggggaagat tttatccagt tcgcggcaaa tagcatcgcg gttatgttct 25261 cctggtttgt aagttagttt gtaggaaact aaggctctat ctaattccga atatttatct 25321 ttcaggctgg cgatttcttc aggattgttt atttctacgc gatatatagg tgtggcattg 25381 agtggtaaac attctggttc tttagtccga cctttagcac caatttcaac taaaacaaca 25441 cttttttcat cgtctttttc accgtagtcg agacactcaa tgctaccaga ataacgcaca 25501 tgggttgtac ccgctaaagc ttgtgctttg tgaatatgac cgtaggcagc ataagcccaa 25561 ttagtaggga tatctcctgc atcgaaaacg acatcttcgc gttcgctaat gcgataaaga 25621 ttatgtagtt gactaccccg aatgtgtgca tgaccaacaa ttacgcttgg taactctggc 25681 ttgataaatt tattttttac actatcaatt ttctgaagca tggcttgatg aagtaaacta 25741 tttttctcgt ccatgctgct gtatcgcgtt ttttcatctt ttagataacg ggacgaagtt 25801 ggataaggca tcaaaacaaa ttggactggc tgaccagctt tatcttttag taataaataa 25861 gttggttgtg ctgccaaata aagccttcca cttggttttg cacctggttt ctttgggtca 25921 attgggtcgg caaggtcaag agcaaagcgc attaaattaa agaaggcttc gttatcgtga 25981 ttgccactaa tggtgacaat tgtgccaccc cctaagagaa aaggctgaaa aacatccctc 26041 atttccccaa cggctgattt tagttcgtca agtcggttgt actggctaaa taaatcacca 26101 gcaacaacca ttacatctac cttatgttcg tctaagtaat tggcaatttc ctccaagcgt 26161 tttacaatgt cgccttgccg gggtattcgc cccaatttgt cgtttaaatg ccagtcagat 26221 gtgtgtaaaa ttcgcatctt atctccttct tagtaagtag caatatcatc gtcgtcgtct 26281 tcttctacgg ttgttgcgtt aacatcaata acgggattac ctgttgcttc tctaggattc 26341 atcgcccaag gaggaagcgg tactcgaacg tgcattggtt cgttatcctg aatcaccatt 26401 ttttcgctga ctgttaattc tgcggctttg cgttggttgc ttttactcaa gaaacgccag 26461 atagattggg aaagttcgag acttccacta cgaccaatga cgcggatact tgcgttctca 26521 attacttttt cggaaacttt tgaggcttgt tgttgagcgc cgagtaaaat gattccttgc 26581 gatcgcattt cagcagcgac ggtttctatt agttgcgtga tgggatcttt ggaaccacgg 26641 ggagcaaaac ggttaagttc atccaaagca attaggtaaa ctaagccttc agcgcgattg 26701 gttccggtac gttcgttgac taattgttgg agaattgtgg cgacaacaaa ccgttgcaaa 26761 gatgggactc gcgttaaagc atttaaatca acaactatcg ggtctgttgt ttgtcttatt 26821 cctaaattaa gaggattacc gttttggtca taacgtcgaa gtaccccatt accttcatgt 26881 accaacaata gaagcttacg acgtaatttg ctccaagtgg ctctgtgatg atcaggggat 26941 aattgactac gattgtctag atctgagatc caatctaaaa gtagttggaa agtacgaggt 27001 acattatttc gtaaactgcg ggtgacagta ccatcgtttg cgactctttc atctgtcagg 27061 taagcttcta aatccaggac taaagctcta aagttgtcgt taacggaatc atcatcagca 27121 aataaataag taaataagcc tttttcaata atattgccaa gactccaact ataagcctgc 27181 actgtactcc gacatcctgt agaaactgct gtaattccac caggaatttg tggtgcgaaa 27241 aaggtaacct tttcaaaagg tcttggttcc tcaacaccca atctttgcca gtccgccaaa 27301 tgttcttcag gtctatacct attactccag cgattaatgt ggaataaatc gtagccttta 27361 acgttgagaa ttatagggac aaccatcagg ggatttgggt ctgatggata ttcctcagca 27421 cgctggcgtg cttcccgcaa taattggtaa ataacaaata gcaaaaaact agattttgtt 27481 cctcgaccag cgacaccgtt aacgttcatg tgtccgccat taatacctag taaataatct 27541 aggtctatca tccctggtcc ggcgatcgcg cttccaccat ttttaatcaa accaatcgct 27601 aagggattat caatttcatc tgctctgtaa gcacgtcgag catcattttc atctcctaag 27661 aaaacggaac tttgttctaa aggtggagct aaaacagggg gttctgtacg taaaattgta 27721 gcagtagcaa aagtaacgcc ttctccacga aattccggtt ggtaattgac atcgccatca 27781 aaagcatcaa attcttcgcc aatatctttt ttgcggcttt gtcggtagac ctcttctaca 27841 attgcgtaga aattaattgt gtcgccacca acacgacttt cagtacgaat aatttgggtt 27901 ttttcaacca gttcgtctcg tcgcacccaa aaatgaaact cttctgaggt agattctttc 27961 tgcggtggcg atgcgacttt accgataatt ctgacaccag gggtactagc cgatgcattt 28021 gtatcttctg tgctatctgc ttcatcggcg ttcggtgtac ccgcatggga gccattcccg 28081 ttaacatcga attccggaat taatacttca tcactcatat ttttagttta attttatacc 28141 ttataactgg gttaaacggt aaaaacgatt ggataggata cttaatggtt ggaaaagcga 28201 acctaaactt tcttcagctc gaacaatcgg gtgtagagaa atggcagcgc gtccataact 28261 gcgctctcga caacggtact catatatagt tcgggataac ttatccacaa agtcccaatt 28321 gcgcttgtaa tacttctcaa accagctttt tgatgcttcc acgcgcacta ttcccgtact 28381 gggagttcca ccaccccctc cagaaagacg cacaaaccaa gaaacaacgg gtagccatcc 28441 cctagagatg gtaaacactg gtgtacgttg accagcttcc agtccatata gaacttgtat 28501 tcctttgata tgtaaatagt tttggcgctg ggttttaatc acaccaaaaa ctggtgattc 28561 gtcgatatcg aaacctccca tccgtggttg caatcgtcca tctattaccg ttggttcgcc 28621 accacagtgg gaaattgctg attcctccaa tacttccatt tccgtattag aacggttttg 28681 agttcgtttt cgcatggttt caaaatcgta ggaggatatc ccaccaggag ggcgaactgg 28741 tagcaaacga aatccattgt tctgtaaagc agcagcaaaa ctttctactt ctgtccaagg 28801 gaaaaggtct accgccatcg ataccacacg ctcaacaacc tcaaaggagc gccgacattc 28861 tccgttttct acccgcatgg taatactgcc aatttgggaa aggcggatag gaactaactg 28921 accgctagga gaacgcactg aagcgactgt ttcccctaca tctttaccat ctataaagtg 28981 agttggtcgt tcatgccagt tactaggtcg ccagttaggt gaggtgctta aattgatagg 29041 ttcccaatca atggcacggt gttttacttc ataatctaaa tcaacttttt ctggggtttg 29101 gtttaagtct tcgtcggttt cttcttctaa gaaaggaaac tgtgcatcat attgttcgac 29161 gcgacaacca ccgtagtttt taataaaatc taatcccatt tttatagata tactcctcct 29221 atgtattagt tgttttttta attttttatt tatttcagag tttttctggt aattgcaaat 29281 cttttaaccg tcgtttcatc gcttcctgac tgactaaaaa ctccgttgct aatcgttttg 29341 ctaaaacatc tctttttgta gcaaatcgac cgcgataacg cttggataaa gcgaaacaag 29401 ccgatgctgg cattaaaagt tcagccgcaa attgatcggc ttctgcctcc atttgcttct 29461 cgtctatgcc tagagaatca atgactgctt gtatgccatt tgtaaaggta aattgcgcca 29521 tagatttatc gttatctttt ttactttcat catcatcttc acttctcttt tttacgtata 29581 aaccttctgt taaaatcaaa ggttctttca aactattttc ttgatgagtt tgaagtaaag 29641 gaagaaaatg taacaagtaa tgtcctaatt cgtgggcggc tgaaaatcgt cggcgtgata 29701 ctggggctaa atggggcttt tgttccacta aaatacagcc gtaaaaaaaa cttttatatt 29761 nnnnnnnnnn aaatacaagt atccagctaa ctttaaatct tggttttcgc tatctgtcaa 29821 aggtagtttt tgtcctgttt ccacttctaa aaagtcggca atactttgaa tggtcatatc 29881 tttgatttct gctaccctaa ttgggtaagc tctaatcaat tcatacaggg gtacaacacc 29941 aggcttaaga ttttctaact ccagttcagc ttcttgatat gccagttcta cagcctggga 30001 aattgccata cggcgatgtt ctagtatcat ctatatttta tcgatatcag tatttttttt 30061 gactttattt tctattggtt gacgattatt caatgtagat tgatatcgac ttttttgttc 30121 acgggctgcg gctaattgcg ccctattatg gctgcgtccc atcgataaag taatggcagt 30181 atctcgaaac atattcaaga atcgctcact cgtattaatt cttaattcag ccgctaattg 30241 cctaacagct tggatactaa gtttgactgg cagtggtaca gaactgtcaa gcaatttatt 30301 aataatctct ttatctgctg atggtacgcg gttgatatct tgcaaacgct taacaacatc 30361 gccaactgtt actgattttt cttcttcatc aatatcttca aaaattaaag gtttttcctg 30421 ttcttgaata atttcaaatc cactatgcaa gtgtttgtgg agtaaatttc taataatttc 30481 tgcgtcggta gcaattggtg ttatctgctc ctcttcctgg taagctaaat taatttctgt 30541 aataattttt tctaattcag ggtcatcgca ggcaatatca agtatttggg ctactacttc 30601 catatcacct tcgtctaaag cctgtacata taattgaatg gctttttcac gggagtaagg 30661 ctgctgattt gtattttgtt tttgcatctt atctgccctc ctttgatgac ttgttacttt 30721 gggcataagc ttgacgcaac tgcacgattg cacgagactt tttggtgtaa gctgcccctg 30781 ctgttgtccc taatgcagca gctaaggatt ctccgtctaa accatcaaca aaagctaatt 30841 tcagtacttc tcgataaccg tcatttacca atgacagcaa atattccaac atttcggggt 30901 cggatgaatt tgatgactcg tataatattc ctaaaatttc ctcttctgaa aaaatttctg 30961 aattttgctg cttaagtttg cgagctaagg ggagttcggc aatgggtgta attcgcttat 31021 tttgctgggt ttgctgacgt tgccaccgtt gtattttttt agctgcaata cccaataacc 31081 agggacgagc cgaacgatta gggtcaaatt taccagctat tttaaaagcc tcctcaacag 31141 tttcgtggaa aatttccttt gctactgttt cggtaaaatt gcgatcgctg ctgtgaccga 31201 acttatcgcc gaattgtctg attgccttaa tcacatacat ttgaattgta cgttgaagaa 31261 cttgccactc ctcctcaact acctgactta ccaactgctg tttctggtac aattcagggt 31321 cataggacgg gtctactggt tgccctactt gtcgtttcaa tgtgacctcc taaacatcgc 31381 ccctgatatt taattgccac gagcaaggca aattcttaca gtggtcatca aaaaatatcc 31441 gcacttcgcg cattcaatat cctcctaata tcgctcctaa tattgaattg ccataagatc 31501 cacaaaatct tacagtcaaa gcgcaaaatt tttttgtgtc tctaattctc aatttcttta 31561 agtgttaaat tatactctaa aaacttcaag agctaattgc accaacaacc gtagtgcttc 31621 cttcgtggaa aagttgctta cagggagccc ggtaatttgg ctaactagcc acttgggtgc 31681 tacacgctca tttcctacaa caacatacca ggattggaaa ttcgcaaatt ctcgatgacc 31741 tcttcggggt tcgtcagtcg cctaggtcgg gaaaacccca ggtggcactc tcggaggaac 31801 cccaacgctc ttatcctgca acgggaaacc cgttttcagc atatatgtac ttatattgtt 31861 tgaaacttga ggctgttgct tcagaaatcg acgcagtacc tggaaggtgg gaccatcata 31921 tctggcgatc gctcttaaca atcctgagtc ggaacgtttg cgttggaaac aacccagaat 31981 gagtagattt gggttgtgtg tagagacttc gttcgtaagg ttatcaagac tatctttctc 32041 acaaaattta accattttag attccattcc acacataaag cgccatacct ttcaaatagt 32101 tccactttca ttgtggctgc aattgagtga tgtgcttctt catccgatcc actaaatgcc 32161 ccatcaccgt caattgcttt tctggagtca gtgctcaatc tcgcttaaaa atttttgaag 32221 tgatggactc atcgacttaa caattatcaa ctactgtaga ttttagcgtc ttctctcacg 32281 ctgctctgct gtgaagatgg atagcggagt gtgttaaaaa aaagcccccg ctttgataca 32341 ggagcttttc aagtaagtga ttgtcagtga caaaacaaat ctgttagtcg agttcgttca 32401 tgaacaatac tggctcagtt tcgcggttaa ttcccttctc aaaaccgcca gcggctgcac 32461 gagcacgacc agcgtgccac cagtgaccga tgaggaagaa gaaggctaac acgaagtgag 32521 aagttgccaa ccaagcacgg ggattcacgt agttgaagga gttaggctcc gtgatcacgc 32581 cacccacaga gttgattgaa ccgttgggag catgggtcat gtactcggat gcacgacgaa 32641 tttgccaagg ctgaacgtcg ttctttacct tatctaagtc aagaccattg ggaccgcgca 32701 gaggctcaag ccaaggacca cggaaatccc agaagcgcat ggtttcacca ccgaagatga 32761 tttcaccact aggagagcgc atgaggtatt tacctagacc agtaggacct tgagcagaag 32821 caacgttagc tcccagtttc tggtcacgca ctaagaagat aaaagactga gcttgagatg 32881 cttcagcgtt tgttggtccg taaaattcac tggggtaggc agtgttgttg aaccagacga 32941 agcaagaagc aatgaaggcc atcagcgata cagcagccaa gctgtaggag aggtaagcct 33001 cacctgacca gactaaagcg cgacgtgccc agccaaaagg cttagtgaga atgtggaaaa 33061 taccaccaga aatacagatg agagcaaccc aaatgtgacc gccgatgacg tcttccatgt 33121 tatcgacgcc aacgatccag ccttctccgc cgaatggaga actcagcaaa taaccgaaga 33181 taaccgctgg gttcagtgtt ggattgctaa tgacgcgaac atcaccaccg ccaggtgccc 33241 aagtatcgta gacaccgccg aagaacattg ccttcagcac caacagtagg gcaccacatc 33301 ccaaaatgat caggtggaaa ccgatgatgg tggtcatctt gttcttatct ttccagtcgt 33361 aaccaaagaa agaggaatac tcttctaagg tttctggacc acggatggcg tgataaatac 33421 cgccaaaacc gagtacagct gaggagataa ggtgcagtac accgataaca aagtagggat 33481 aggtgttaat aatttcacca ccagggccta caccccatcc taaggtagca aggtgaggta 33541 tcaggatgct gccttgctca tatataggct tttcgggaat gaagtgagcg acttcaaaca 33601 atgtcatcgc tccggcccaa aatacaatca agccagaatg ggcaacgtga gcacccagta 33661 gtttaccgga gaggttaatg agacgggcgt taccagccca ccaagcaaac ccggtggaat 33721 cttggtcacg accgcctaaa gcgattgatc tattagagag cgttaccacg tggtagtacc 33781 tcttcaggga atacaaattt ttcgtgaggt tgatcagcag gagccatcca agcgcggata 33841 ccctcgttga gcaaaatatt cttggtatag aaagtttcaa actctgggtc ttctgctgcc 33901 cgcaattctt gcgacacgaa gtcgtaagcc cgcaggttga gtgctaaacc cacaattcca 33961 actgcgctca tccacaagcc tgtgactggc acaaacaaca tgaagaagtg caaccaacgc 34021 ttgttggaga aagcaattcc aaatatctgt gaccagaatc tgtttgctgt caccattgag 34081 taggtttctt ctgcttgggt ggggttgaag gcgcggaatg tgttagctgc atcaccgtct 34141 tcaaacaggg tgttttctac tgttgcaccg tgaatagcac acagcagcgc tccacctagc 34201 actcctgcta ctcccatcat gtggaagggg ttgagtgtcc agttgtggaa tccttgcaag 34261 aacaacagga agcggaagat tgctgccact ccgaagctgg gtgcaaagaa ccagcttgat 34321 tgtcccaatg ggtacatcaa gaagaccgaa acgaacactg cgattggagc tgagaaggcg 34381 atcgcattgt aaggacgaat ccctactaag cgagcgattt caaattgccg cagcatgaag 34441 cctatcagtc caaatgctcc gtgcagcgct acaaatggcc acaagcctcc caattggaac 34501 caacgggtca ggtcgccttg tgcttctggt ccccacaaca gcaataacga gtgtcccaat 34561 gcgtctgctg gggttgatac tgctactgtc aggaagttac atccttctag gtacgatgag 34621 gctatgccgt gggtgtacca agaggtgacg aaggttgtac cggttaacca gccgcctagt 34681 gctaggaagg cgcaggggaa cagtagtatc cctgaccaac ctacgaatac gaaacggtct 34741 cgctttaacc agtcgtcgag aacgtcaaac cacccccgtc ttgccggggc gcgtcctact 34801 gcgatggtca tcggactaaa atcctctttt tactaaaatt gcaacttttt tgaggaatta 34861 acttttttgt gacactgaca gcagccatag agctactggc attatgagag tctgtcttat 34921 taagtcagca tttctcactt tgatatgtgg cacaagagtt tccactagta ggttgtccag 34981 cttgcggaaa cttaatgatt cttaatttat catatcagtc cggctttgtg cccgatttgc 35041 attaagaaat ctggtataaa gcacgaacct actaattaat atcaatatca gaaattttac 35101 aatctgacac aacttttctc agaaaactta aaaaatttca atttttgggc aatctatcta 35161 aatgatcaga ccagtactgt tccaaggagc aggcgcacta taacggcata ccacccgtta 35221 cgaatgaaag aaacactctt tccctattcc ctgttccctg ttccctgttc cctgttccct 35281 gttccctatt agctaaactt gcaatacatc tgctctaatg gcagactttg taaagggaag 35341 atatcacgca gcaaggtagt tctaatgacg gcatcaacaa ccattaacag aggtgagtcg 35401 cccaacggcg ataaacaaac ttcaaatgtc ttacatcaga aggttctcgg ttctcgtcgg 35461 tttagtaact actggtgggc aactgtcgtt tctttagggg ctgcgggctt cttgctcgct 35521 ggtatttcca gctatctcaa aatcaattta ctcatagttt ccgaccccac tcaactggta 35581 ttcgtcccgc aagggttggt tatgggtttg tatggtacag caggcttact cctagcctta 35641 tacctatggc taacgatttt atgggatgtg ggaggcggat ataacgagtt taaccgagaa 35701 aatggcacca ttacaatttt ccggtgggga tatcctggga aaaaccgccg tattgaaatt 35761 aaatctcgta cagaagatgt gcaatctgtg cgagttgagg tgaaagaagg tcttaaccct 35821 cgtcgcgaac tttacctgcg cgttaaaggt cggcgagaca ttcccctaac acgggtaggt 35881 caacctttat ctttacaaga gttggaaatt caaggtgctg agttagctcg ttttttgggt 35941 gtgcccttag aaggacttta gaggctaaga tacagtggtt gtgtctgtaa ctggcaatgc 36001 ggttaaaaat tccccaattt ttagtggctc tcatcattgt cggtgcgttg atattgggga 36061 gctgttccac acagcaaggt gtttctaatt cttccccaac atccgcagcg accgaagcaa 36121 gcaacaactc aaccgccgag acaacggcta aagaggcaac cacagaggcg atttctgtat 36181 ctgaaagtac aaacgagagt attcctggaa tgaaagattt accacgactc gaaggaaaag 36241 caacggtggt tatgaccgtt aaaggttcac ctattaccat cgaagtagac ggtactaatg 36301 ccccaatcac cgctggtaac tttgttgatt tagtccagag gggcgtgtac gatggcttag 36361 ttttccatag agttgtacgc caaccccaac cctttgtggt tcaaggaggc gatccccaaa 36421 gtaaaaaccc gaaagttcca gcaagtcagt tgggaacagg aagttttact gacccaaaaa 36481 ctggaaaaat tcgctacata cccttagaaa ttaagccaaa aggctcagat gagccaattt 36541 acagcaaaac actcaagacg gctggtacag ataagccgcc tgtattacaa cacaagcagg 36601 gtgcagtcgc aatggcgcga tcgcaaatgc cagactctgc ttcctcacag ttttacttcg 36661 ctttagcaga tttgggtttc ctagatggtg actatgctgt ttttggctat gtgaccaagg 36721 gtatggatgt tgtcaacaag attcagcagg gcgatcgcat tgactctgcc aaaatcacat 36781 ctggtggaga aaacctgaaa aacgctggac agtagggaat agaaatgagg gagtgaggga 36841 gtgagggagt gagctctaaa tctcgaaaac tatgcgcgga aggctaagcc gtgcacggca 36901 gtcgctacaa cggggggaac ccccgcaacg cgctgcctcc tgaacactga cggataaccg 36961 ctcccatgct ccggttgaag ttcaattaaa cccgattgac atatgttgtc taaggtttga 37021 atagcaggta agaaactctg ttattcggag gaagaagaac caatgaacat cagctcctgt 37081 tctccctttc tccctactcc ctttctccct ttcttccttt ctcccaaaaa tggtcgcagt 37141 tgttgtcact ggtattggtt tagtttccgc tttaggcaaa aatttagagg acagttggca 37201 aaagttaata gcaggtgaat ctggaattag ataccatcaa ccatttccag aactcgaacc 37261 acgtccctta ggaatgattt gggaacaacc agcacagatg agaatgctga ctcagttggt 37321 tgtgacatct gctttgaaag atgcaggatt atgctcacct ttaccagatt gtggagttgt 37381 aattggttcg agtcgcagct atcagggatt atgggagcag atggcgcgac agatgcacac 37441 aagagtaggg gagctcttga gcaggggagc agggagaaaa ggagaggaag agctaatctc 37501 cattggttct tcttcccccc agcaccccgc taagagcccc gtttcctctt cttacctcac 37561 tccctcactc tccccctctt ctaagatggg ttggttagat actttacctc atatgaatgc 37621 tattgcagca gcacggcaaa tcggtgcttg tggagttgtt ttggcaccaa tggcggcttg 37681 tgcgactgga atttgggcga tcgcacaagc tgcttatctt atccaaactg ggcaatgtca 37741 gcgagtgatt gctggtgcag tggaaacacc gattacaccc cttaccatag ctggatttca 37801 gcaaatggga gccttagcga aaacaggagc aaatccattc gatattcgtc gggaaggctt 37861 ggttttggga gaaggaggcg cagtctttgt tctagaatca gcagagttgg cacagcagcg 37921 tcaggcaaaa gtatatggac aaattcttgg ttttggcttg acagcagacg cctatcatgc 37981 aaacgtgcca gaaccagaag ctaggagtgc gatcgccgca gtgaaacaat gtctaaaccg 38041 cagtcacctg agtgcaaatg atattgatta cattcatgct catggtaccg ctacgcaact 38101 caatgactct ctagaaagct ttttgataca aaagttattt tttcaaggtg tggcagtcag 38161 ttctacaaaa ggagccacag gtcatacatt aggagcctca ggagctttgg gtgttgcttt 38221 ttcactgatg gcgctagagc aacagttttt accaccttgt gttggtttga gacacccgga 38281 atttgattta gatttggtga gggagtcgcg tcaaagtaaa attcagcgcg tattgtgttt 38341 tagctttggc tttggaggac agaatgcagt gatagctttg agtcagtatt cttagaacgc 38401 acatgcccag atgcgttatg cacttgggtt tgtgtcaata agttatatgt caattgtttt 38461 ctgtatttat agatacttaa ttaatgattt cacagtatct atctggagac agattttttt 38521 tattttgctc tgatgagagc ataatgagag aaggtactat ttcttatcag ttttcagtta 38581 atctaataag acttttaata aagtgactcg ccataaaatt ttcaaccatg acattatggt 38641 ctgcacccat gagtctgtag aaggtaatta cctctattgt agtagggtgc taggtttgac 38701 tgaaccagag gatattattc agttgcaccc agatttgaag tcgcaatgga atgttattac 38761 cgaacactat gaacggatag gtttaagcta tagcaaaaat gttatctggg atgtcgcgct 38821 gagagttttg gaagactatc ctaactatga cgtttccctt ttcttcttcg gcaatgcgac 38881 aagcaaagct ggttgtgatg aagattggtt tcaccaggta gattccgact ggttgaacgt 38941 agttaaattt atcaactcca aaaacaactt tatacagctt gcacaagaac ttggagtcag 39001 tgttcctgta acgtcttatt ttgaaaacaa gactggtatc aaagacttat ctaagtttct 39061 ttacccgtgc tattttaaag ctgcgatttc tgttaatggt gttggaattc accgatgtga 39121 aaatcaacaa caactttccg aaattctaaa aagatttcca gatgaaatac ctttgcaaat 39181 tcaagaggaa atcgcagctt cctctttttt aaatgtgcaa tactatgttg aagcttcaaa 39241 gctacaacgt ctcgccataa cttctcaaat actcgatggt tgcgtccata ttggcaattg 39301 ctatcccagt aagcaccaac catgggaaac tgttgaacca atagctgagt ggatggtgca 39361 acgtgggatg aaagagatat ttgcctttga tgttgcagtc gtggaagatg caacacacgg 39421 cgaaacgcgt tatctagcaa tcgagtgcaa tccacgcttt aatggcgcat catatcccac 39481 aggtattgct aagaagctta atatccctag ttggaactgc gaaaacttta ggacgcaata 39541 tcgctcactt gaaaagcttg atctcagtga tattgagttc aaccctcaaa ccaatacagg 39601 tgttgtgata gtcaactggg gcactattct agttggaaaa atcagtatct tgattgctgg 39661 aggtgtccaa gagcagaatg aactcagaac catattaaaa gagcgattgt gaaaggctga 39721 gatactctgc tgattatccc tacacatttg aatgcaaatg ctcttgaaag taattcctag 39781 tagaatctgc ttatacatta agccaacaaa cttcaacttc aaacctccta cgcaggtaca 39841 gaaaacttag tggtgatagt tttagcttcc ctcatattcg ttttagacgt tcggttggaa 39901 ggattcacac tcgccacgct cccattacga aaaaatcgat atgatatgta aaagtagata 39961 aatcttaaaa atagtttatc aatgctggtt tcctcctctg ccttgtcaca ggtcaaaaac 40021 tcagtcaggg aaaaggtttt tttgggttat gaacctactc ctgagttact gggaattctt 40081 agtgtttatt ttgttcaagg cattttaggg ttggcgcgtc ttgctgtcag ctttttcttg 40141 aaagatgaac tcttgctcac tccagctcaa gttgcagctt tattcggcgt agtttttcta 40201 ccttggacac ttaaaccact gtttggtttt ctctccgatg gcttgcctat attcgggtac 40261 cgacggcgtc catacctagt tatttctggg atactgggag ctatttcttg gataagtctg 40321 gcaacaatag ttcacacacc tataggtgct gggatagcga tcacacttaa ttctctctct 40381 gttgccgtca gtgatgtgat tgtagactca ttggttgtcc aaagagtaag agctgagtca 40441 caagcaaaag caggttctct ccaatcgctg tgttggggta cttcagcgtt aggagggtta 40501 ataacagctt acctcagcgg tatactctta gagtatttca ccactcgcac tatcttttgg 40561 attactgctt cattcccgct tctggtgtct gctgcggctt ggttaattgc tgagtcacct 40621 gtgagcaaag acgcaagcgg ggatgattca aatgtcgtat caattaggca tcaactgcaa 40681 ctactacgtg cagccgttag tcaaaaagtc atttggttac caacagcatt tatcttcctt 40741 ttgcaggcta caccaacagc tgaatcagct tttttcttct tcacaaccaa cgaactgcac 40801 tttgaaccag aatttctggg acgagtacac ttggtgacaa gcattgcttt gcttgttggt 40861 gtttggattt ttcaacgctt tcttaaaact gttccttttc gcgtcatttt tgcttggagt 40921 actgtcctct cagcagtatt aggaatgaca acgctggtat tggtgactca tactaaccgt 40981 gctttaggca tagatgatca ctggttcagt cttggtgata gctttatttt gtctgtgatg 41041 gggcgaatag cttttctgcc agttatggta ttagcggcga ggctttgtcc ccctggaata 41101 gaagctactt tatttgccct gttagtgtcg gtgcacaact tgggaggact agtttcacag 41161 caattcgggg cagtgctgat gtattggctg ggaattaccg aaactaattt taatgctttg 41221 tgggtgttgg ttattattgc taacctcagc agactcctac cattgctatt catcaactgg 41281 ctccctgctg ctgactctca aactgaaaca tcaactttgg aatcagcttc aacaaacagt 41341 ggagaagaac catttttacc tgaattcatg tctgagttga tcgtgcaaga accagaatct 41401 gaaccagttg aatagcaaca gactaattag tcgtgactca tttttttgaa aaagtcacaa 41461 ttaaccattc tccattccta atctaaaatc caaaatttac aattcaaaat ttcaaatcca 41521 gaattgacat gcaaactttt caagtttacg aacaaggatc tacagttcca gaaaaatcct 41581 ataatcgcgg ggattggcag agaggatatg aatccctaga gaaagaattt gattattgga 41641 ttgatgatgt agaaggagaa attccgcagg aactgcaagg tacgctgttt agaaatggtc 41701 ctggtttgtt agatatcaaa gggcaacgga ttcatcaccc atttgatgga gatggcatga 41761 ttagccgtat caccttctcg aatggtcgtg ctcacttccg caaccgcttt gtccgcactc 41821 aagagtattt ggaagagcaa aaagctggca aaattctcta tcgtggtgtc tttggtacac 41881 aaaaacccgg tggttggttg gctaatgctt taaatttcaa actcaaaaat attgccaata 41941 ctaacgtgat ttattggggc ggaaaactgc ttgcactgtg ggaagctgct gaaccttatc 42001 gtcttgaacc atatactttg gaaactttgg gcaaagaata ctttaacggt gtgttgtcag 42061 aaggagaagc attttctgcc catccccgcg ttgactttag ttgtgctcaa gataatggtg 42121 cgccaagcct tgtcaacttt tccatcaagc cgggattatc caccacgatt actattttcg 42181 agctaaacac tgagggtaag attgtacgac agcacgccca tagtgttcca ggatttgcct 42241 ttatccacga ttttgccatt acccccaatt actgtatatt ctttcaaaat cccgtcacct 42301 ttaatcccat accttttgta ttgggaatgc gtggtgcagg ggagtgtatt aagtttcagc 42361 cagaacaacc cactcgcctg atcatcattt cgcgtaaccc gaaacaaaag ggagtaaaaa 42421 ttatagaaac gcgagccggc tttgtcttcc accatgtcaa tgcaattgag cgggaagatg 42481 aaattgttat tgactcactt tgctacgaat ctctcccaga agtgcaaccg gaaagcgatt 42541 ttcgacaagt agattttgat gctctcaagc caggacaatt atggcgtttt catcttaatc 42601 tcaaggatga gacagtgcac cgggagttgc tggtcagtcg gtgttgtgaa tttcccacct 42661 tacatccgca aaaggttggt cgtccatatc gttatttata catgggtggc gctcatgcag 42721 agtctggtaa tgctccattg caagcattta tgaaagtgga tttggagtca ggaaaacaac 42781 aactttggag cgctgcaccg catggatttg ctagtgaacc tatttttgtt ccacgcaccc 42841 cccatatccc tgcttctcaa ggggggacaa aggggggtga agatgatggc tgggtgctgg 42901 cgttggttta tgattccgag taccatcgct cagatgtggt gattttagat gccaaagatt 42961 tccacaaaga acctatcgcc agattgcatc ttaagcacca tgttccttat gggttgcatg 43021 gaagttttac ttctgagtgc tttgtctgat ttctacgtat agctggtaaa tcattgggat 43081 tgggggaaaa ccccaatctt agtcaccatc tgctagagaa cccgtctaca cagtcagctt 43141 ctttgttgat aactaccgat tacgcaatca gtttttggta tagttatcaa ttattttagc 43201 taagcccggt atcgcctcct ccacatgctt cttagctgag ttgcggagtt tgctatatac 43261 acctcgcacg gtcttattgt tgcttttctc aatccttgcg tcagtaatgc tgagtattgc 43321 atcggctgag cgatcgcggt tcgcaatcag gtgttccacc ggatcacctg tgtgtacgcc 43381 ctcactccaa ataggatcga gtgctgcgaa ggattctgga agtaacgtcg caatggcgct 43441 agcgcaataa gtgggcgcaa gcccctttac agcagtgtag ccggctttca tcgccaaacc 43501 agaaacccct tgcatcgaag caacttgcac gtctagcaac ttcgtgcagt cagccacgac 43561 tgtatctttc ttaatcggat ctgacagacc gtcgcttagt cccatttcaa ttctccttaa 43621 agtcatcgca cctaatttct attcaattaa ttatcctaac aagcaggaaa ttacggaaaa 43681 tggtatgata atttagagaa ttttcataaa attttcaatt aagttcataa attctttaag 43741 aaaaatttga cacgaaaagg tgagtcactc gcttcgctca aattcaaaat tcaaaattca 43801 aaattcaaaa ttaattgtgt aatcgcttgt actcaccagc agttagaa // LOCUS NODE_530_length_43685_cov_5.11971143685 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 43685) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 43685) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..43685 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 190..393 /locus_tag="DP116_03635" CDS 190..393 /locus_tag="DP116_03635" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03635" /translation="MDLAQFPLVWVFIRKPQSHTQIKVKQSCRFFQEIQEKVRCQNRF CFLTSRYYTTFNGYPDSFFVTDF" gene complement(482..883) /locus_tag="DP116_03640" CDS complement(482..883) /locus_tag="DP116_03640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03640" /translation="MNKILKRAFMPSLLAATLTGVSFIPARPASADDKILEDTAIGAG VGAVSGLIRGRSVVKGAINGAGTGAAINGANGLRGTHREREKRSIIQDAAVGAAAGAA TNGITNGGRGTFGSAATGAATGVIINKIRSK" gene complement(1016..1285) /locus_tag="DP116_03645" CDS complement(1016..1285) /locus_tag="DP116_03645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407835.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03645" /translation="MPKFVMWGSYCQDVLEKRAPYRQAHLDGLAKQKESGVLVTIGPT TDVTKVFGIYEAEDEAIVRQLVEADPYWQNGIWTEYSVQEWIQAF" gene complement(1385..3817) /locus_tag="DP116_03650" CDS complement(1385..3817) /locus_tag="DP116_03650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017656129.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phenylalanine--tRNA ligase subunit beta" /protein_id="PRJNA477356:DP116_03650" /translation="MRISLKWLQELVELKLSPEELAETLTLAGFEVEDIEDRRTWAAG VVVGKVLERQPHPNADKLSVCQVDIGTDETLNIVCGASNVRADIYVPVATVGTYLPNI DLKIKPAKLRGVPSQGMICSLKELGLPNDVDGIHIFPQENLALGSDVRPLLGLDDVIL DVTATANRADALCMVGIAREVTALTGAKLTLPQVGEVSISQGGFNLNLKIADTQACPA YIGTVIDQVKIAPSPEWLQQRLRAAGVRPINNIVDITNYVMLEWGQPLHAFDAKRLQS VAGGENLAIGVRFANAGETLKTLDGQTRTLSTQNLLITANDKPVAIAGVMGGEETEVH DGTQNLVLEAALFDSVAIRRSSRSVGLRSESSGRYERGVNRAELEVATRRALSLFREL SGGVIIHQEINDSRPDRSTWSRSIELRLDRVNQVLGPIDLGEETGEIQSQDVERILTA LGCQLTSIPDNRWTVSVPPYRYRDLEREIDLIEEIARLYGYDRFYDTLPDKAEAGYLP VDQELLRKLRALLRAEGLTELIHYSLVKPGEDRQIVLANPLFVEYSALRTDLISGLID AFQYNLEQGNGSLNGFEIGRIFWQEEGLQETDAIAGIMGGDISSSKWTRSDRQQPMTW FEAKGILENVFKQFEVQVEYQPDCRDSRLHPGRTASLWLGGNRLGVFGQLHPQLRQEK GLPDSVYVFQWDVDVLLDSLDDDKILVPQFRPYSTYPAADRDIAFFAPVKVSVAEIEK VITKAGKELLESVELFDEYRGEHVPQGQRSLAFRLIYRTGDRTLTDSEVEPVHNKVRE ALAEKFGVTLRS" gene complement(4265..5683) /locus_tag="DP116_03655" CDS complement(4265..5683) /locus_tag="DP116_03655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316844.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system response regulator" /protein_id="PRJNA477356:DP116_03655" /translation="MNNFGTFTKLRPLSLLRHLSNCSESTCLQVLSSSISWSIYLEQG KITYATHSVEPFDRLERHLRRLSHHVPSIVGEVRVQLRLMFEPDTNTQPVVHHSDNPE ELIPPPIPPEYQAICWLVSERYLHSTQAAVLIQELVKEVLESYFLTKQATFTLKDSDY RVPTICKLDVEKIAERCQQKLQNWHSLGPHISSPYQRPYLLMKTTDHQKNLSGIEPQL TEWMKGFSLRHLSVIMNQDEVQLAKTLYPHILNGTILLHEPDPPFDKLPKTFTDFIQA SRSVTGSTHTKLVDSEVGSTRTTYSDNSAISASHVATQQRQTPVSPEKKEIQKSTISN NNNLKHEAGTSDTVTPRKVYKIISVDDSPTILKEISRFLEDENFSVVTINDPLKAVMS IIRYKPDLILLDLNMDGMDGYELCRIVRNNSMFKKTPIIMVTGNKGLVDRVKARLVGA SGYLTKPFTRADLLKIVFTYLA" gene 6665..7339 /locus_tag="DP116_03660" CDS 6665..7339 /locus_tag="DP116_03660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316843.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03660" /translation="MRDTLKLDEVVEFAENPEPRCPCVLLLDTSGSMQGVALDSLNQG LQSFKEELIKNSLAARRVEVAVVTFDSHVNVVQDFVTADQFSPPMLTAQGLTSMGAGI HKTLDMIQERKAQYRANGIAYYRPWVFMITDGEPQGEFEDVVEQATRRLQEDEVRKRV AFFTVGVDNANMARLTQIAVRTPLKLQGLNFVEMFVWLSASMSAVSHSKVDEQVALPP IGWGSV" gene 7391..8173 /locus_tag="DP116_03665" CDS 7391..8173 /locus_tag="DP116_03665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996974.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein phosphatase 2C domain-containing protein" /protein_id="PRJNA477356:DP116_03665" /translation="MNTSKQRPHWRVVAASVCGTSHLKNKQLCQDAHHWQILPDNILV VAAADGAGSASMGKVGAMIAVETAIENISIKKFSRRTLVDDSAVRSLLTEAIIAAKKA VEEEAVACQKQSLDLASTLIIALATPEFVAVAQIGDGVAVVRDFQDNLIALTIPDSGE YINETVFLTSPTALDAVQLRLWRQAVANVGVLTDGLQMLAMNMAVGVPHKPFFLPLFD FAANADDKTVAKEQLVRFLRSERITQRTDDDLTLIIAALSDL" gene 8238..10427 /locus_tag="DP116_03670" CDS 8238..10427 /locus_tag="DP116_03670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316841.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03670" /translation="MQVLRCLPNQQISNVSLTVSLGRGGEACIYTVPTDAGLVAKVYH KPTPAQARKLEVMLAHPPDNPTASLGHISIAWPIELLRSSNGAREVIGFLMPRIQGMR PIIDFYNPRTRRQHCPLFNYQYLIRTARNLASAFAALHERGYCVGDVNESNILVSDMA LVTIVDTDSFQVKDPENGVIYRCHVGKAEFTPPELHNKTFAQCDRNSTHDLFGLGVVI FQLLMEGTHPFSGIYQGAGEPPPYEARIVSGHFTYSQKRRVPYVPTPIAPPWDILPPG LQELFIRCFEDGHNNPQIRPNAQTWLTALADAENSLISCTVNPQHRYSNHLDKCPWCE RTLRLGGRDPFPSLKAIANKEHLQPRVQKKKRQTQVPRTPQPTIPFNTHYRSSGFSQK IPYYKPQKKQNFYPVVFCLLGLVGALGYLDLMVKFTNRPFIAQNSYAQQNLISLQQNQ SHKNQNFADYYKQGHASYKVKDYQQAIESFTQAIQKDPKYAKAYVNRGNAHYNLKEYE AALADYNQAIGINSTEVKAYVNRGNSRYMLAEYSTDPDKEYNLAIADYNNALRLNPNE VEAYIRRGVVRSQMAKYSGDSQQEYKKAIADFNQALSLNSSKAEAFFQRGLVRYQVAQ YSSDFEQEYKHAIADFNQALSINPKLSKVYLKRGVVRYELAQYGGGKSSQYHQQAVDD LQKAAKISLEQEDMENYQQALSSICVVVENKCDTFLQTTTNTSPKSN" gene 10873..11646 /gene="rfbF" /locus_tag="DP116_03675" CDS 10873..11646 /gene="rfbF" /locus_tag="DP116_03675" /EC_number="2.7.7.33" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136801.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucose-1-phosphate cytidylyltransferase" /protein_id="PRJNA477356:DP116_03675" /translation="MKAVILAGGLGTRISEETSIKPKPMVEIGGKPILWHIMKIYSTY GINEFIICCGYKGYVIKEYFANYFLHMSDVTFDMRFNQMNVHAGKAEPWRVTLVDTGD NTMTGGRLKRVREHIGNETFCFTYGDGVSNVNVEELINFHKKQNNLATMTAVQPPGRF GAIVLGQEQTKITSFREKPEGDGAWINGGYFVLEPEVINFINDDSTVWEQTPLEKLAE MEQLSAYKHNGFWQPMDTLRDKNYLEDLWKNNKAPWKVW" gene 11746..12963 /locus_tag="DP116_03680" CDS 11746..12963 /locus_tag="DP116_03680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="L-2-hydroxyglutarate oxidase" /protein_id="PRJNA477356:DP116_03680" /translation="MYDFAIIGGGIVGLSTAMALGKRYPNARILVLEKESNWAFHQTG NNSGVIHSGIYYKPGSFKAKFCRDGCRSMVEFCQEHGIEHEVCGKVIVATDETELPRL ENLYTRGLENGIDVKRMTPEEVKEVEPHVSSVGGVLVSSTGIANYKQVCHKYAEIIKQ QGGELRLNTKVEKIVISGKHQVLETNRGPFETRFVINCAGLHSDRVAKMGKTDPKAKI VPFRGEYYELTPEKRYLVKGLIYPVPNPDFPFLGVHFTRMIDKSVHAGPNAVLSLKRE GYNKTDFDLRDFAEVMTYPGFWKLAAKHADEGIQEIIRSFSKAAFTRSLQKLIPEVQQ EDLVPTHAGVRAQALMNDGKLVDDFLIVQGQNSVHVCNAHSPAATSSIEIGKAIVDKI PQQPHLKAVVTQL" gene 13088..14029 /locus_tag="DP116_03685" CDS 13088..14029 /locus_tag="DP116_03685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316838.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 8 protein" /protein_id="PRJNA477356:DP116_03685" /translation="MFLNSDVQPIVLVCAADNNYAMPLTVTVRSAVANLKKNHQIALY VLDGGITEANKRRISKSFNKEQVSISWIQPDNAVFENLVLTRHLTLTCYYRLLITEYL PKEFHKAIYLDTDMVVTGDLAELWAIDMGDNYALAVQDDVELYVGMSEGLKNYREVGI SPDEKYFNSGLLVINLDKWRSEDIGKKVLEYIKQNREYVRNDQDGLNAVLAGKWGELH PKWNQMPKIHEYSSWKDSPFTEDIYNELQHNPCIIHFTNSPKPWYAGLREECKHPKKH LFFQYLDMTDWSGWRDTIWRRFWRKFMKVTSLTTSKL" gene 14209..15237 /locus_tag="DP116_03690" CDS 14209..15237 /locus_tag="DP116_03690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127770.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD-dependent dehydratase" /protein_id="PRJNA477356:DP116_03690" /translation="MRILVTGTEGYLGSLLPPLLIERGHEVIGVDTGYYKVGWLYNGT EITAKTLNKDIRHITPEDVQGVDAIVHMADLSNDPTGQLAPHITYEINHKGSVRLAKL AKEAGVRRFVYMSSCSVYGVATEGDVTEESPVNPQTAYAECKTMVERDVKPLADDDFS PTFMRNATAFGASPRMRFDIVLNNLAGLAWTTKEIKMTSDGTPWRPLVHALDICKAIV CAVEAPRDIVHNQVFNVGDTANNYRVKEVAEIVAQVFPDCKLSFGTQGADNRSYRVSF EKINTVLPGFKCDWNAQRGAQQLYDLFSQIDMTEEVFLSRGFTRLKQLEYLIRTQQID KDFFWSQK" gene 15618..16607 /locus_tag="DP116_03695" CDS 15618..16607 /locus_tag="DP116_03695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407823.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_03695" /translation="MNKLLTIAIPTYNRAQLLDKQLAWVAQAIIGLESECEIFVSDNC STDNTQEVIKKWQTNLSHITFKSSRNAENIGVMRNIIHCLKSATTKYVWTIGDDDPIQ DRAVAYVINKLKKYEDLSLLFLNFSGRNQKTGEPVHPPTIVGNRWFDVDSEDGSGDPK AIFEHCFSKSVGAVIFLTASIYRTDLVQRALQIWSEAENNWISLAYLAGYCAANGRII VTKDIYMECIVGVSYWQKDPQSALLMQYKHLPEVVTKLEENGYSRQFYRRMMFQSFKE ANLKVFLGALRRWPMFTIKTIVPFLTLVGLSVFDAVPTKEFKMAQTTEPFTQN" gene 16711..17562 /locus_tag="DP116_03700" CDS 16711..17562 /locus_tag="DP116_03700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316835.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phytanoyl-CoA dioxygenase" /protein_id="PRJNA477356:DP116_03700" /translation="MFNSTINKISQLTSELGYRAALLNHARKLPALEESDRVIVDTLK RDGVYVTTLEKLGLGSTSALLKASYNQLSRMTDASNSHLTKRLPQIYTVTDLPEFSQW GREQKLLNIIENYIGLPVAFQGVHLRKDFPNEDQFGTLLWHKDSEDRRMLKMIIYLSD VEQKHGPFEYVPVSLTSLYSLNYYRIYYKLWQSGYLGITDEQLKEVIPEDKWKSCPGP AGTVIFTDPKVALHHGTLRTEERPALFFTYTANPPKRPELCTQYWDDTFAKPESYQEA DSVTALR" gene 17780..18331 /gene="rfbC" /locus_tag="DP116_03705" CDS 17780..18331 /gene="rfbC" /locus_tag="DP116_03705" /EC_number="5.1.3.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742347.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dTDP-4-dehydrorhamnose 3,5-epimerase" /protein_id="PRJNA477356:DP116_03705" /translation="MIFVETELKDAYIIELEQKQDHRGFFARTFCAQEFEAHGLKPTV AQCNLSFNYKKGTLRGMHYQTLPAAETKLVRCTQGAIYDVIIDMRPESPTYLQYIGVE LTAENHRALYVPEMFAHGYQTLTDSAEVAYQVGEFYTAGYERGLRYDDPFFNIQWPLE VTDISEKDKNWPLMKMMSVGGNV" gene 18414..19724 /locus_tag="DP116_03710" CDS 18414..19724 /locus_tag="DP116_03710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872577.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_03710" /translation="MIIVDRALQARAEAGNPIKVGMIGAGFMGRGIANQIKNSVPGME LVAISNRSVDAAKRAYSEAGIEDSKVVSTVTELEEAIARNQYAVTEDPMLLCRAEGID ALIEVTGAVEFGAHVVMEAIAHHKHIIMMNAELDGTIGSILKVYADKAGVILTACDGD QPGVQMNLYRFVKSIGLTPLLCGNIKGLQDPYRNPTTQEAFAKRWGQKAHMVTSFADG TKISFEQAIVANATGMTVAKRGMLGYDFNGHVDEMTKMYDVEQLKELGGIVDYVVGAK PGPGVFVFGTHDDPKQRHYLNLYKLGEGPLYSFYTPYHLCHFEVPLSVARAVLFQDYV LSPLGGPLVDVITTAKIDLKAGETLDGIGYYMTYGQCENSNIVQEQNLLPIGLAEGCR LKRDIPKDQVLTYDDVELPEGRLCDKLRAEQNAYFTKSKTLAAV" gene 19838..19948 /locus_tag="DP116_03715" /pseudo CDS 19838..19948 /locus_tag="DP116_03715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872578.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" gene 20318..21403 /locus_tag="DP116_03720" CDS 20318..21403 /locus_tag="DP116_03720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209997.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 4 protein" /protein_id="PRJNA477356:DP116_03720" /translation="MKIALVHDYLTQKGGAERVFELLCKRYPQADVFTSLYDPEKTID IGERIVNTTFLQNIPGAAKYFRLMAPLYFPAFRALDLQDYDLIISSSTSFAKAVRKNQ KSRHICFCHNITRFLWDTETYLREYGDYRYFAPLIDQVFEMMRKVDLAYAQEPDLYIA NSSIVARRIKSTYGKEAIVINYPIDTSKFVFSDTKEEFYLASARMISYKRLDIIVEAF NWLGWRLLISGNGPERERLKSKALSNIEFLGHVTDIQRTQLFSKAKSVIVAALEDYGL VPVEANASGTPVIAFGAGGVLDTQIPGKTGVFFQKQTPESLNRALLEAREIYWDYNNI RNHAVANFSEEAFFNKVEQVIEQACTV" gene 21445..23688 /locus_tag="DP116_03725" CDS 21445..23688 /locus_tag="DP116_03725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872581.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="capsular biosynthesis protein" /protein_id="PRJNA477356:DP116_03725" /translation="MVQSSLSPYGNPVTESEPGYGQLFAVLIRRFPWFLAAFVASVAV AGVMTKKTAPTYRSSMQLLVEPNYQGKKEGGGVDSQFTEPTVQVDAATQLSLMQSSPL LKKAVTELKSKYPDMTVGQLKSSLVLSQIKNKDDNVATKIFQVQYTEKDPVKTHKALQ AIQKVYLAYNIQQQNERLKKGLKVIRQQLDEARNDVKKADGELRDFRTGKNITGKNLI DPETQAKATQDELTRIVQERGTARSLYKEAEATYKNIQKQLQSTPQNALVEARLSQST RYQGLLNEIQKTELALAQERLRFTEEAPSVQKLAGQLQSQKALLQEEQRRTLGAESAQ AISQAPSLLQQGQKSAIDLNLAAKLVDTQTTMLSLSAKDEVLAKKEQELREQLTKFPK LLAEYGRLQPQVQLSRERLQELLKAEQRLRQEIAKGGFNWEIVEEPQEGIRQGPNEQQ NLMLGAVVGLMLGGIAAFVREAADDSVHTTAELERQVALPILGTTPKLPPAKTRESVI KLPFGKPDVPAPWTIQVLQSSPRWESLDLIYKNIELLNSVASFKSLMITSALSDDGKS GLALGLAMSAARLHKRVLLIDANLREPSLHKQLNLPNEQGLSTLLASEVTIPNQISIQ SSGSSYIDILTAGPTPADPANLLSSPRMQQLMATFENNYDLVLVDSTPVLGLVDAMLT ASSCRGVVMVASIGRVTRTQLTQATAMLSRLNLLGVVANGVSNSSSTFVPYTHQQRFA LQQAVEK" gene complement(23691..23969) /locus_tag="DP116_03730" CDS complement(23691..23969) /locus_tag="DP116_03730" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03730" /translation="MATEAQESPRLKTPEILKKIEAESPQLSKKPLHTIGVTRSRRVK LLPLLRKNTAYAFLGSMCRFIYIFGCAFVLVSQARHIQAVCALHQAVA" gene 25415..26251 /locus_tag="DP116_03735" CDS 25415..26251 /locus_tag="DP116_03735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="PRJNA477356:DP116_03735" /translation="MTTSIIPTLQSNYTVKQQQQDNPSQYCALRWCLGQLLVIPPGGI TQPYMPSLDRKELLVECLKRSPANLVRIDPRLGETRLRFWADACAQANKPIFLRIPSI DKQPKLLDSTLWWLKRFTDWLTALIFLLAISPIMLGLVLLMRISSPEPRLLFSCQWHV GERGKVFKVFKFRTTTASKKAMGDKGITYPEDLCDGEDSQNLTKLGRWMRKYGLDHLP QLLNVLRGEMSLTGPRCWTLEDAVRLSPEAQRQLNRLPGMMRSWEIEAESNLLHLDSS TL" gene 26486..28336 /locus_tag="DP116_03740" CDS 26486..28336 /locus_tag="DP116_03740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407815.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_03740" /translation="MHVKLSKPVQNLLKNSVKTTKFWQDNYLILREFKHFRRITVFAL VFSFVAAVFEGVNVGLLSAFLQTLTTPDIPFKTQIDWFNTSVLGVNASAPERLYRVSA LILFSTWIRSGLNYLAQLYTEITQLNLVDRLRKQIFEKLQSVKISYFSKTKSGELINT ITTEIERLKQAFGSAAFLFARTLTAVTYLVSMFVLTWQLSIISLMLFSLLAVALSTLN GRVREASFEVTKASNRFTSIAVEFINGIRTVQAFSTENFERQRYYNASLNLFTASKNI ALVWMVVKPLAEALATTVLVAIIVLAVTGVLTNGAPQVGSLLTFFFVLFRLVPIIQDI NGVAAHISTMYGSSEVVKKLLAIDEEQYFRNGHIEFPGLKRSIEFVSVDFGYDSESLV LSNIRLMIERGRMTALVGASGAGKTTLADLIPRFYDPTRGHVFIDGVDLRDIDITSLR RKIAVVSQDTFIFNTSVRNNIGYGSEGATDEEIYKAARLANAMEFVQEMPEGFNTQLG DRGVRLSGGQRQRIAIARALLRNPEILILDEATSALDTVSEKLIQESIEKLSVGRTVI VIAHRLSTIVKADKVVVLEQGQIVEQGGYQELLELKGKLWKYHQMQHQLN" gene 28728..29903 /locus_tag="DP116_03745" CDS 28728..29903 /locus_tag="DP116_03745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407813.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="colanic acid biosynthesis glycosyltransferase WcaL" /protein_id="PRJNA477356:DP116_03745" /translation="MPNLITATQERAEKLNQPEYDENLIYHCSSFRVNGNGGAETYLT SLIQSRQSGVSDFVIKSLKELDQSRFKLLHIHSPDLLEQVKGECPTVFTVHNHSLYCA SGTKYLAAQDVICDRNFSYLGCLWGKIIDGCGSRKPARVIQELKSTHHLNHFIRNLKV TFLANSDYVREQLIKNGLPPQQTVTLRCGITVPQIATAPLSLETYKIGRILFVGRIVP DKGLEWLLKTLVHTDSQIHLDIAGEGWERPRLEKLAQKLGLNNRITWHGWCDSNKLNQ LYEQCFAVIFPSVWPEPAGLVTLEAYAHYRPVIASAVGGIPEYLRDGETGILVPANNI KMLAQAITHLSSDYQKCRQMGEQGHALLMQEFTMDVHVKHLQKIYENTISEFASQKI" gene 30189..31178 /locus_tag="DP116_03750" CDS 30189..31178 /locus_tag="DP116_03750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006199128.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_03750" /translation="MRPKSPQEVAESFGDSRIRPKSPQKESLQKESPQKESPQKVVES FGDSRIRFFRQAKNVGMFANQMNAFKMAQGKYVASLHDDDMWNEDFLEKLVPALEENP DIILAFCDQYIIDKDGNIDNVGTEGNSKAYKRTSLKKGVHQPFIEIAVIDKSVPIAAA CVIRKEFVDWDKIPQEVGGMWDLYLSYLCSRSGHGAYYYPEKLTRYRAHEQTDTNRSG SLDAQAKIRKGTAEIFCYEQFMGDEILKKYNLYFKQKWLEAHTTLAIGLLRSEKTTQA RPYLWRALSEEKFNLRTIAALTLSFIPRPIANQFKRLNATKNRIFKSEMLSQK" gene 31199..32392 /locus_tag="DP116_03755" CDS 31199..32392 /locus_tag="DP116_03755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210006.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_03755" /translation="MKLCIVTHMVKKGDGQGRVNYEVAKEAIRQGHHLTLLASEIAPE LQQSSQVNWISIPVNGFPTEFFRNFIFSQKSADWLRKHRSQVDLVKVNGAITSAASDV NAVHFVHSSWLRSPVHISRLRKDLYGLYQKLYTAFNARWEKQAFQKAKIVVSVSEKVA QELESIGVPRSRIRVIVNGVDLEEFSPGTVSRQTLGLPENVTLALFAGDIRTPRKNLD TVLRALVKVPNLHLAVVGSPEGSPFPQMAESLGLNERVHFLGYRRDIPQIMRAVDLFV FPSRYEACTLVLLEALACGLPVITATATGGAELVTPECGVVLSDSDDVEALAEVLLSL VNDRNKMQQMGQAARTVAEQHSWATMAKTYVDLFEELSKNEEHRSHTDLSPSSRPITL PSGAT" gene 32319..33242 /locus_tag="DP116_03760" CDS 32319..33242 /locus_tag="DP116_03760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015180205.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_03760" /translation="MRNTVLIPTYRRPQDLLRCLQALLEQTKPPFQVIVVVRDIDTDT WQFLEEFQETTLPLDTVKVTVGGVVAALNAGLEAVKGDTVSITDDDAAPRPDWLERIS AHFTSDSRIGGVGGRDWIHQGDKLLDDSCEVVGQLQWFGRVIGNHHLGMGEPREVDVL KGVNMSFRTSAIAGLRFDERMRGTGAQVHFEMAFTLALKRAGWKMIYDPAVAVDHYPA QRFDEDQRNSFNEIAWINLVHNETLVLLEHLPPIRRVFFLLWAILVGTRDSLGFVQWL RLLPREGKLAGQKWLASIRGRWQGWQAHVNP" gene 33555..34973 /locus_tag="DP116_03765" /pseudo CDS 33555..34973 /locus_tag="DP116_03765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949038.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="O-antigen ligase domain-containing protein" gene 35269..36390 /locus_tag="DP116_03770" CDS 35269..36390 /locus_tag="DP116_03770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407808.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 1" /protein_id="PRJNA477356:DP116_03770" /translation="MKVVVIMPLAELRGGGEMMLWDLMQQGRNAGVEWLVIFLEHGPM VEQVRALGIDTRVVESGRLREVHRFIAAVFRIASIARRERADMIVNWMWITHICGGLA AMLAGLPSVWYQLEVPYDQPWLVRLATLVPARAIVTLSKDGKEAQARIWPHRPTPLVY PGVSLDRFNGSTVPSPAEARRKLGLPLHGPIIGIVGRLQRWKGIHVLVEAMPKVLQKY PDAHCLVVGGKHNLEPDYEDFVKEKITALELQDKVILAGLQSNVPEWMQAMDVFVHAS DNEPFGIVIIEAMALGKPVIAGNGGGPTEIITDGKNGQLTPYGDANALADAILRYLND QEFAHNAAIAAQQRALDFSTERYVQNFINTLRSAVPSVS" gene complement(36446..36661) /locus_tag="DP116_03775" CDS complement(36446..36661) /locus_tag="DP116_03775" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03775" /translation="MRKAHALWAKAVRRAVGISDRRSRAAGIGLSALTASLAFKRSAT QHNKMFILGFGKLFTKTQQRSKVNSSY" gene complement(36987..37934) /locus_tag="DP116_03780" CDS complement(36987..37934) /locus_tag="DP116_03780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872579.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03780" /translation="MKNLKKIIKELHFYVVNSFPVPIANKLPLGALTKKERWSIGIYT GKSPVDFQAPKEIKNPVLTRRNVSDVRAGFVADPFMIKADSTWFMFFEVLNQQTRRGE IGLATSKDTKNWKYEQIVLAEPFHLSYPYVFEWMNEYYMIPETHQANSIRLYKASKFP TEWSFVGNLSSGASFLDASIFRHADKWWLFTETNPQHKFDTLRLYYADELLGSWIEHP KSPIITGNAHIARPGGRVVVINDKIIRYTQDCQPDYGTQLRAFEITELTTTSYQEREI EQNLVLKPTGVGWNGAGMHHIDPHFIHEGQWIACVDGRG" gene complement(37967..38905) /locus_tag="DP116_03785" CDS complement(37967..38905) /locus_tag="DP116_03785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015203731.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-glucose 4-epimerase" /protein_id="PRJNA477356:DP116_03785" /translation="MHFIVTGGAGFIGSHLTEQLLSEGHYVTVIDNLTTGNLQNLPEH SRLKFLHKNILDCHPEDFSTQIDGIAHLAATPSVTESWLKPLEAHDNNLSATLAVIEL CQALKIPKLVFASSAAVYGNKTPLSISEDQPPAPISPYGLQKLVSEQYAVLFAKQYNF SFIGLRLFNVYGPRQVPGSQYSGVISIFVDAMLKGLPINIYGDGSQTRDFVFVKDVAT AFAKALTTPLTPGLALSCNIGTSKTTSILQLVNIMRNYFPKWELEPGFAPSRHGDIQH SLADISKALSVLNFVPQWSIESGLKNLIEYSQQQNS" gene complement(39019..40128) /locus_tag="DP116_03790" CDS complement(39019..40128) /locus_tag="DP116_03790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015180208.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_03790" /translation="MRILHLTNHVQEIGNGIVNVAIDLACMQAKDGHNVAVASSGGEY ETLLKSYGIRHFHLDQSRTPLNIIKAVWRYRNIVDEFKPDIVHVHMMTGVILAATLRF GYEYGLIATVHNEFQRSATLMGLGDRVIVVSQAVGDSMARRGIPKEKMRVISNGTLGS PRQRSIKEYEPQPLHHPAITTVSGMYRRKGVGELIDAFVEIAADFPDAHLYLVGNGPD RESLEEQARNTPLTSRIHFEGFQTEPKRYLLATDIFVLVSHKDSSPLVIPEAREAGCA IIGSNVDGIPEALDGGKAGILVPVKDSHTLAENLRQLLSDRDRLQWWRTQASQNLERL SAARVHKETLAVYTELRPNNTPNKYEYAEAKLIHK" gene complement(40683..42866) /locus_tag="DP116_03795" CDS complement(40683..42866) /locus_tag="DP116_03795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198709.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03795" /translation="MKNKFISHLGGLYRSQRNSSYLRFRQFFKFFLCLMTAMSITLYF TARVVPATKAYTTSWIGNTYGGKKWVQNYVEAMYVGSDGTVYTDSTWDEAGREAGIYK DGDVIGPDFELHGWSRVGGKAITVTDKYIYLAMVQGSMGKIPEDYPPEKTAWYCVRRY TLSGKPAGFAGGRGYDKSMLIVSTKSPVTGLAIVGKELYVGIAEANRISVYNTDTMTE VRNFSVPNPKQITVDKQGNLWIIQSKNGSSPAQILHYSSQGKRLPEVIADVVDASAIA LDNQGRLLVAENGPRQQVLIYNITAKPTRAGTFGVEGGVYAGVPGEVGDLKFYGITGV GTDAAGNIYVNSNGFNNSGTDLRKFSSSGKLQWRLLGLHFVDNADTDPQSDGTEVYTK HEYFRMDYKKPSGKQWSYKAYTLNPFKYPQDPRIHTSPTSVFFRRIQGKPFMFLTDMY ASFMEVYRFNPVTDGYIAIPSAMFSATSMNHGKPVQSWLSNQPQEEKDWIWRDRNGNG AFDQGEYDTRTLDYPYLGGWWVDSKGDLWKTLRTKVGILHFVVQGLDQKGNPIYSSSS MKKETTPNQITDLRRIEYFPDTDTMYLSGFTKDHPPIGDDSGVVGSEIIRYDNWSKGN RTPRWRTVIPYDTTGKREVMTAAMSVAGDYVFAVTGKTSEVYVYKAATGKQVQKLKPG PEVAGESGWIDIPYGIRAFRRSNGEYIVFVEEDLKGKVIMYQLPR" BASE COUNT 12909 a 9020 c 9201 g 12555 t ORIGIN 1 tccgggttcg ccagtcgcct gcggagggaa accctcccgc agcgctggtc tcaccgctgt 61 gaactatcgt ccctaactca ttgtccagta atttaccact cctgcgacaa gatgtagagg 121 aggcagtggg ttgcggtgag gcagtgtcgt gcagggggtt cccttcgtgc atgcaaaccg 181 ccgtgatgtg tggatttagc ccagtttccc ctcgtctggg ttttcatcag aaaaccccaa 241 tctcacactc aaataaaagt caaacagtct tgtcgctttt ttcaagagat acaagaaaaa 301 gtgcgttgtc aaaatcggtt ttgctttttg acttcccgtt attatacaac ttttaatggt 361 tatcctgata gtttttttgt aaccgatttt taatttttta actatataat aagttcttct 421 aacatcaaga ccgcctcagt tcaattctga actgaggcgg tctaatttcg agacgatatt 481 actatttgct tctgatttta ttgataatta cacctgttgc tgcacctgtt gctgcactgc 541 caaaggtacc tctaccaccg ttagtaatac cattagtggc tgcaccggcg gctgcaccta 601 ctgcagcatc ttgaatgatg ctgcgtttct ctctttctct atgagtgcct ctaagaccat 661 tagcaccatt gatagcagca ccagtaccag caccattaat ggcgcccttt accacgctac 721 gtcctctaat gaggcctgag actgcgccaa caccggctcc aattgctgta tcctcaagga 781 ttttatcatc agcagacgcg ggtcgagctg gaataaaact aacaccagtt aaagttgcag 841 ccaataggct gggcataaat gcgcgtttca aaattttgtt catagaatca ccttttcaac 901 tcatcaaccg actagataaa agttgaagtt gttccagttt ttacctttcc tctagtctag 961 acaacaaaaa actttttctc catgaaaatg ctttcatgat taccgtacct tttattcaaa 1021 aagcctgaat ccactcttgg acagaatatt cagtccagat gccattttgc caataaggat 1081 cagcttcaac caactggcgt acgatcgctt cgtcttccgc ttcataaatg ccaaaaactt 1141 tagtgacatc tgtcgtagga ccaatagtaa ctaacacacc ggattctttc tgttttgcta 1201 gtccatctag atgcgcttga cggtaagggg cgcgtttttc cagaacgtct tggcagtaac 1261 ttccccacat gacaaatttc ggcataattt tacgcaattc caaatctaaa aaatcaaaat 1321 tcaacaaaac aggaacaaac aagagcaaaa aatttcttta gttttcactc ttgtttcttg 1381 tggcttaact cctcaaagtt acgccaaatt tctccgctag ggcttcgcgg actttattat 1441 gtactggttc aacttcgcta tcggtgaggg tgcgatcgcc tgtccgataa atcaagcgaa 1501 atgccagact ccgctgtcct tggggcacgt gttctccgcg atattcatca aacaattcta 1561 cagactccag taattcttta cctgctttgg tgatgacttt ttcaatttcg gcaacactca 1621 ccttaacagg cgcaaagaaa gcaatatctc ggtctgcggc tggataagtg gagtaagggc 1681 ggaattgtgg tactaggatt ttatcatcgt ccagagaatc caaaagcaca tctacatccc 1741 actggaacac gtagactgaa tctggtaaac ctttttcttg gcgtagttgg ggatgcaatt 1801 gcccaaacac accaagccta ttacctccta gccataggga agctgtgcgt cctgggtgca 1861 agcgggaatc tctgcaatca ggttgatatt ccacctgcac ttcaaactgt ttaaacacat 1921 tttctaaaat acctttggct tcaaaccagg tcataggttg ctggcgatcg cttcttgtcc 1981 atttgctgga ggaaatatcg cctcccataa tcccagcaat ggcgtctgtt tcttgcaaac 2041 cttcttcttg ccagaaaatt cgtccaattt caaagccgtt cagcgaacca ttaccctgtt 2101 ccaggttgta ttgaaatgca tcaattaatc ccgatatcaa gtcagttcgc aacgccgaat 2161 attcaacaaa taacggattt gccaggacaa tttgtctatc ttccccaggt tttaccaagg 2221 aatagtggat taattctgtc aatccttctg ctcgcaatag tgctcgtaac ttgcgaagca 2281 attcttgatc aacaggcaaa taacctgctt ctgccttgtc tggcaaggtg tcataaaagc 2341 gatcatagcc gtagagacga gcaatttctt caattaagtc tatttcccgt tctaagtcgc 2401 ggtaacgata gggaggtaca gaaaccgtcc acctgttatc gggtatggaa gtgagttgac 2461 atcccaatgc tgtgaggatg cgttcaacat cttggctttg gatttctcct gtttcctcac 2521 ctaaatcgat cggtccgaga acctggttga ctcgatccaa gcgtaattca atcgaacgcg 2581 accaagttga acggtcagga cgagaatcat taatttcttg atgtatgatc actcctccag 2641 acagttcgcg aaagagcgat aacgcacgac gagttgcgac ttccaactca gcgcggttga 2701 ctcctcgttc gtatcttcca gaggactcgc ttcttaaacc gacactacgg gaagaacggc 2761 gaattgcgac tgaatcaaat agcgctgctt ctaaaacgag gttttgagtt ccatcatgga 2821 cttcggtttc ttcaccaccc ataactccgg cgatcgcaac aggtttatca ttagcggtga 2881 tcagcaaatt ttgtgtagat agagtgcgag tttgtccatc cagagttttg agagtttccc 2941 ctgcgttggc gaagcggaca ccaatcgcga gattttcacc tcctgcaaca gattgtaaac 3001 gcttagcgtc aaaagcatgc agcggctgtc cccattccaa catcacatag ttagtaatat 3061 ccacgatatt attgatgggt cgcactccag cagcacgcaa gcgctgttgc agccactcag 3121 gtgatggtgc aatttttacc tggtcaatca ccgtgccaat gtatgctggg caagcttgtg 3181 tatcagcgat ttttaaatta agattaaacc caccttgaga aatcgaaact tcacctactt 3241 ggggaagcgt tagctttgcc cctgtcaagg ctgtgacttc ccgcgcgata ccaaccatac 3301 ataaagcatc agcacggtta gcagtcgcgg tgacatctaa aattacatca tctaaaccta 3361 acagtggacg tacatcgcta ccaagtgcca agttctcttg aggaaaaata tgaattccgt 3421 ctacatcatt gggcaaaccc agttccttga gggaacaaat catcccctgt gatggaactc 3481 ctcgtagttt cgcgggtttg atttttaaat cgatgtttgg taagtaggtg cctacagttg 3541 ctacaggcac gtaaatatct gcccggacat tggaagcacc acaaacgata tttaaagttt 3601 catccgtacc gatatcgact tggcaaacac ttaacttgtc ggcgttggga tgcggttgtc 3661 gttccagcac tttccccaca acaacgcctg cagcccaagt gcggcgatcc tcaatatctt 3721 ctacctcaaa cccagcaagt gtcagcgttt ctgctagttc ttctgggcta agtttcaact 3781 ccactagttc ttgcaaccat ttgagtgaga tacgcatagt cttgtgttgc cttgtctgaa 3841 atctaagcta aatcataatg cacagactat tctagaacta ttccacgaag ttcctcacgt 3901 cgagttgcag cgagcaagat ttcatgaaaa ttttattatc agtaattatc cggatatgct 3961 attataggtt agtatccaag cttgatacta acagtaaata ttccaaaacc tatagaaaaa 4021 tcataaaggg ttgactatct ttttagaaag ataagccaaa gacataattc aagaaacttt 4081 agttaagcta aggcgcgatt attgagtcta taaaagcagg ccatttcaag tttcattcct 4141 gatatgcagc tgtatcactt gttggacggt aactagactg tcgttttttg actatacaaa 4201 tatggtgggc ttcggtgtct atttcttgca acttggatac aatgcccgct taatcactaa 4261 atcgctaagc caaatacgta aatacaattt tgagtaaatc tgcacgagta aaaggcttag 4321 ttaaatatcc agaggctcct accaatcttg ctttgactcg gtctacaagt cccttgtttc 4381 ctgttaccat aatgatggga gtttttttga acattgaatt atttcggaca attcggcata 4441 attcataacc atccatccca tccatattta aatccagtaa aatcaggtct ggtttgtacc 4501 taataattga catcaccgct ttcagtgggt cattgattgt cacgacagaa aaattttcat 4561 cttctaaaaa gcggctaatt tctttaagga tggtgggact atcatctaca gaaataattt 4621 tgtagacttt tcgtggagtg acagtatcag aagttcctgc ttcatgtttc agattgttgt 4681 tattggagat tgttgatttc tgaatctctt tcttctcagg agaaaccgga gtttgtcgct 4741 gttgagtcgc gacatgactc gcagaaatag cagagttatc cgaatatgtg gtgcgagttg 4801 accctacttc tgaatcaacc agtttagtat gagttgatcc cgtaacagaa cgagaagctt 4861 gtataaaatc tgtaaatgtc tttggtaatt tatcaaaggg tggatctggt tcatgcaata 4921 gtatagtgcc atttaaaata tgagggtata atgttttagc tagttgaact tcgtcttgat 4981 tcataatcac tgacaaatgt cgcaaactaa aacccttcat ccactcagtt aactgaggct 5041 caatccctga taaattcttt tgatgatctg tggttttcat taacaaatat ggacgttgat 5101 agggagaaga aatgtggggt cccaatgaat gccaattctg caacttttgt tggcaacgtt 5161 ccgcaatttt ttcgacatca agcttgcaaa tagttggaac tcgataatca gaatctttta 5221 atgtaaaagt cgcttgtttg gttaaaaaat atgattccag cacttctttt accaattctt 5281 ggatcaagac tgctgcttgt gtagaatgta aataccgttc gctcactaac cagcagatag 5341 cttgatattc aggaggtatt gggggaggaa tcaattcctc tggattatca ctgtgatgaa 5401 cgactggttg agtgttggtg tctggttcaa acattaaacg taattgaaca cgaacctcac 5461 caacgatgct aggaacatga tggcttaggc gacgcaaatg acgttccaag cgatcaaaag 5521 gttcaactga atgggtagca tatgttattt ttccctgctc tagatatatt gaccaggaaa 5581 tagaactgct aagcacttgt aaacaagtac tctcagaaca attagataaa tgtctcaata 5641 agctgagagg acgtagtttg gtaaatgtgc caaaattatt catttctcta aagattttct 5701 gtaaaaaaag accgattgtt taaattttcg agtgatcggt aataatggta tctctctaac 5761 tttgagattt ctttatacag aaattttctc tgaatttggg ttttatacat atattacaag 5821 cttgaaaggg atattatagc tagtctgtta cattttttcg taacataact aacaaaggag 5881 atttttcaac ttttttaaac aaaaaagaac ttgcactcct caccaactca tgcgagtggg 5941 tggggagtga ggtgtcagct tcccgaccaa atctttgaaa aagtggttcc cagtcagtgg 6001 tagctcacag gggtcttttt agagtcaagt tagtcaacga gcaatcgtgg ggcttgacaa 6061 ataaccgaat caaggctcac ctgacttcag acaagataca tcccaatctc ccacttattg 6121 cattctgagt tgtagttgct gaattttgca ttctcgttca atgttaatta gttgagcatt 6181 ggtactagag gtgttgtgat atgagcctta aactcgtgcg ctcttaaccg ctagcaacat 6241 aattaagaaa caccatcccc ctgttcagag ttaaggcaag agagttaagc gtgtttcgta 6301 aggaatgatt tgataaactt taatgagtta ttttcatcta atatcctacc tttctacatc 6361 gaatctttaa atttctatgt cagcaaaggt actgacatat tgtatcgtat acgtattacg 6421 tataaaactc gttttgattg caaagaaggt catgctgact atttgttggc tcagctttac 6481 tccaatctgt ctcaaataaa attccacaaa aaaattgtca tgaatttagc agcggaattt 6541 ggttaaataa atagtaaact gtttgaactc aaataaattg gatatggttt tattggtgct 6601 ataaactata tactttgagg aaaaggaaag ctaagctttg gactttaaca taacacaaaa 6661 agttatgcgg gatacattaa aacttgatga agtagtggag tttgctgaaa atccagaacc 6721 tcgttgtcct tgtgtgcttc ttctagatac atctggctca atgcaaggtg tagcattaga 6781 ttcattaaat cagggtttac agagtttcaa agaagaatta ataaaaaatt ccctagcggc 6841 aaggagggtt gaagtagcag tagttacttt tgatagtcat gtcaacgtag tacaggactt 6901 tgtgactgct gaccaattca gtccgccgat gctgacagca cagggactga cgagtatggg 6961 tgctggaatt cataaaacct tggacatgat tcaagaacga aaagctcaat atcgtgccaa 7021 tggtattgct tactatcgtc cttgggtatt catgattact gatggtgagc cacaaggcga 7081 gtttgaagat gttgtagagc aagccacaag gcgactacaa gaagatgagg tgaggaagcg 7141 tgtggcattt ttcactgttg gagtggacaa tgcgaatatg gcgcgcttaa ctcaaatagc 7201 tgtacgtaca cccttaaaac tccaaggact caattttgtt gaaatgtttg tttggctatc 7261 agccagtatg tcagctgttt ctcattccaa agtagacgaa caagtggcac taccacctat 7321 tggttggggt tctgtttagt acaagatgtg acagaatgct caaacaagag cagtaactaa 7381 aagatcatct atgaacactt ctaaacaaag acctcattgg cgggtagtag ctgcatcggt 7441 atgtggaaca agtcatttga agaacaaaca attgtgtcag gatgctcacc attggcaaat 7501 tttgccagat aatatcttag ttgttgcggc agcagatgga gcaggtagtg caagcatggg 7561 taaggtggga gcaatgattg ctgtggaaac agcgatagaa aacatatcta tcaaaaaatt 7621 ctctcgacga acactagtcg atgatagtgc tgtgcgctca cttttgaccg aggcaataat 7681 tgctgccaag aaagcggtcg aggaagaggc tgtcgcttgc cagaagcagt cattggattt 7741 agcaagtacg ctgattattg cgttagcaac accagaattt gtggcagtgg cacaaatcgg 7801 ggatggtgtg gcagtggtaa gagattttca ggataacctc attgccctga ccatacctga 7861 cagtggtgaa tatatcaacg agacagtttt tttgacctca ccgacagccc tagatgctgt 7921 gcaactgaga ttatggcgtc aggcagtagc caatgttggt gtcctcacag atggactaca 7981 aatgcttgcc atgaatatgg ctgtgggagt tcctcataaa ccattctttc ttccactatt 8041 tgacttcgca gctaatgctg atgacaaaac agtggcaaaa gagcagttag tgaggttttt 8101 gcgttccgag cgaattacgc aacgtacaga tgatgacttg acactaataa ttgctgcttt 8161 aagtgattta taaactatgt cgcgaattcg tttgtctgga agtgtataaa ctcatagagt 8221 atctataaaa atatatcatg caggtactac gttgtcttcc caaccaacaa atcagtaatg 8281 tcagcctgac cgtaagttta ggacgaggcg gtgaggcatg catatacaca gtaccaacag 8341 atgctggttt agtggccaag gtttaccaca aaccaacacc tgcccaagct cgcaaactgg 8401 aggtcatgct tgcccacccg ccggacaacc caaccgccag cttggggcat atttccatag 8461 cttggccgat tgaattgttg cgatcgtcga acggagcccg tgaggtgata gggtttttga 8521 tgccgcgcat tcaaggaatg cgtccaatca tcgactttta caatcccaga actcgccgcc 8581 agcactgtcc tttattcaac tatcagtact taattcgcac tgctcgcaat ctcgcttcag 8641 cttttgctgc gttacacgag cgtggatact gtgtcggcga tgtgaacgag tccaatatcc 8701 tcgtcagtga catggcattg gtgacaatag tagacaccga ttcattccaa gtaaaagacc 8761 cagaaaatgg agttatttac cgctgtcatg ttgggaaagc agaatttacc ccaccagaac 8821 ttcacaacaa aacctttgct cagtgcgatc gcaattctac tcatgacttg tttgggttag 8881 gggtagtgat atttcaattg ttaatggaag ggacacaccc attttctggg atttatcaag 8941 gagcaggtga acctccacca tatgaagcac gtattgttag tggacatttt acctacagtc 9001 aaaagcgtcg cgtaccctac gttccaacac ccatcgcacc tccttgggat attcttcctc 9061 cgggtttaca ggaactgttt atacgttgtt ttgaggatgg tcacaacaac ccgcagatac 9121 gccccaatgc tcaaacttgg ctcacagcac tcgccgatgc tgaaaactcg ctcatcagct 9181 gcactgttaa cccccagcat cgctatagca accacttgga caaatgtcct tggtgcgaac 9241 gtacattgcg attgggtgga cgtgatccat tcccatcttt aaaggcgatc gcaaacaaag 9301 aacatttaca accgcgagta caaaagaaaa aacgccaaac tcaagttcct cgtacacccc 9361 aaccaaccat tccattcaac actcattatc gtagttcagg cttttcccaa aaaattcctt 9421 actataaacc tcagaaaaag cagaatttct accctgttgt tttctgcttg cttggtctcg 9481 ttggtgcttt ggggtatttg gatttaatgg ttaaattcac aaatcgcccc tttattgctc 9541 aaaattctta cgctcagcaa aatttgattt ctttgcaaca gaatcaaagt cacaaaaatc 9601 aaaattttgc tgattactac aagcaaggtc atgcttcata caaagtgaaa gattatcaac 9661 aagccattga aagtttcact caagcaatac aaaaagatcc aaaatacgca aaagcttatg 9721 taaatcgtgg aaatgcacac tataacctca aagagtatga agcagcactt gcagactaca 9781 atcaagcaat tggcatcaat tccacggaag taaaagcgta tgtcaatcga ggaaattccc 9841 gctatatgct tgccgaatat agcactgatc cagacaaaga atataatcta gcaattgcag 9901 actataacaa tgccctgcgt cttaatccga atgaagtaga agcatacatc agacgaggtg 9961 ttgttcgctc acaaatggct aaatatagtg gcgattctca acaggaatat aaaaaagcaa 10021 ttgctgactt caaccaagca ctaagcctga attcatcaaa agccgaagcc ttttttcagc 10081 gaggtttggt acgttatcaa gtagcccaat atagcagtga ttttgagcaa gaatataaac 10141 acgcaatcgc ggactttaat caggcattga gtatcaatcc caaactgtca aaagtctatt 10201 taaaacgagg tgttgtccgc tacgaacttg cacaatatgg aggtggcaaa tctagtcagt 10261 atcatcagca agcagttgat gatttgcaga aggctgccaa aatatctttg gaacaagaag 10321 atatggagaa ttaccaacaa gcattaagca gcatctgtgt tgtcgtcgaa aacaaatgcg 10381 atactttttt gcaaaccacg acgaatacat ccccaaaaag caactaagta cggcgctacc 10441 tcaccaaagt tctgatctga ggaaacctcg gttcagactt gtttcaaagt atgctctatg 10501 cctatggcta ttcccctgag ttttgcggag ggttgcgcca acaaagctgt ttatactatt 10561 gattattttt tgtgttctga atgatgtaac aagttattta ttactcttat gaaagagtag 10621 gactcggtta ttttatctgg tgttatcttg cgttttttaa aaacttctta agtgctctta 10681 tctgtagatt tcagatattt ataaagcttt ttcatttttt catacttaag ataggtttat 10741 ttatattgtc ttcatactta agtgtaaatt tacggctaaa ttatatactg agaaaattaa 10801 aaaacaactt tttctctact tagcccatga aaagttgatg actcatctac ctaaaaacct 10861 agagataagc ttatgaaagc agtgattttg gctggaggtc ttggtacacg tataagtgaa 10921 gaaacgagca tcaaacctaa gcctatggtg gaaattggtg gtaagcctat attatggcat 10981 attatgaaaa tttattccac ctacggcatt aatgaattta ttatctgctg tggttacaaa 11041 ggctacgtca ttaaggagta ttttgcgaac tacttcctac atatgtcaga cgtaaccttt 11101 gatatgcgat ttaaccagat gaacgtgcat gcaggtaagg cagaaccctg gcgtgtcacc 11161 ttagtagata caggtgacaa cacaatgacg ggtggacggt tgaagcgagt cagagagcat 11221 attggtaatg aaactttttg cttcacctac ggtgatggtg tcagtaatgt caatgttgag 11281 gagttaatta attttcacaa aaaacaaaat aatttagcaa caatgacagc agttcaacca 11341 ccaggacgct ttggtgctat tgttttagga caagaacaaa caaaaatcac tagttttcgg 11401 gaaaagccag aaggagatgg ggcctggatt aatggtggtt attttgtgtt agagccagaa 11461 gtcatcaatt tcattaatga tgattccact gtttgggagc agacaccatt agaaaagctg 11521 gcagaaatgg aacagctatc tgcttacaag cataatggct tttggcaacc aatggatact 11581 ttacgagata aaaattatct cgaagacttg tggaagaata ataaggctcc ttggaaggta 11641 tggtgagata aaagaaagag gtacattcat tcttcactca tcgttattga ttttttactt 11701 ttcattcttg attcatattt ctagattccg ggagctagag tactcatgta tgacttcgcg 11761 attataggtg gtggaattgt tggtctttcc accgctatgg ctttaggtaa acgctatccc 11821 aatgctcgta ttctagtttt agaaaaagag agcaattggg cttttcacca aacaggtaat 11881 aatagtggcg tcattcattc tggtatttac tacaagccgg gtagtttcaa agctaaattt 11941 tgccgtgacg gttgtcgttc aatggtggaa ttttgccaag aacatggaat tgagcatgaa 12001 gtttgcggta aggtcatagt tgctactgat gaaacagagt tacctcgcct agaaaatctt 12061 tacacgcgag gtttggaaaa tggcatagac gttaagagaa tgactccaga ggaagtcaaa 12121 gaagttgaac ctcatgtgag tagcgttgga ggagttctag tttcttcaac tggtattgca 12181 aattacaagc aagtttgcca taaatatgct gaaatcatta aacagcaggg aggagaactg 12241 cgtctcaata ccaaggttga aaaaatagtt atcagtggaa aacatcaggt attagaaaca 12301 aaccgtggtc cttttgaaac tcgctttgtg atcaattgtg ctggattaca tagcgatcgc 12361 gttgccaaaa tgggcaaaac tgatccgaaa gcaaaaattg ttcctttccg gggagagtat 12421 tacgaactca caccagaaaa acgctatttg gtcaaaggct taatttaccc agttcccaat 12481 ccagactttc ctttcttggg tgtccatttt acgcgcatga ttgacaagag cgtacatgct 12541 ggaccaaatg cagtcttgag tcttaagcgc gaaggctata acaaaacaga ctttgacttg 12601 cgtgattttg cagaagtcat gacttatcct ggtttttgga aactcgcagc caaacacgct 12661 gacgaaggaa tccaagaaat tattcgctcc tttagtaaag cagctttcac tagaagtttg 12721 caaaaactca ttcccgaagt tcaacaagaa gatttagttc ccactcacgc gggtgttcgc 12781 gcccaagcac tgatgaatga tggcaaactg gtggatgact ttttgattgt tcaaggtcaa 12841 aactccgtcc atgtttgtaa tgcccattca ccagcagcaa catcttccat agaaattggc 12901 aaagcaattg ttgacaaaat tccccaacag ccacatctca aagctgtagt cactcaactc 12961 taaaatattt tgtttgcctg tgcttgctta agttaatagt gttaagaaaa agcacaggca 13021 ttgtggcatt tgtcaatttg cgagaacatc taatttaagg ttgtaattag ttgtcagggg 13081 aaaactaatg ttccttaact cagatgttca gccaatcgtt cttgtgtgcg ctgctgataa 13141 taactatgcc atgccactca ctgtcacagt tcgttcagca gttgccaatc tgaaaaagaa 13201 tcatcagata gcgttatatg ttcttgatgg aggtattacc gaagcgaaca aacgcagaat 13261 tagtaaatct tttaacaaag aacaggtcag tatttcatgg atacagccag ataatgcagt 13321 ctttgaaaac ttagtcttga ccagacattt aacactgact tgctattatc ggcttctgat 13381 taccgaatat ttaccaaaag agtttcacaa agccatttac ttagacaccg acatggtagt 13441 gacgggagat ttggcagagt tatgggcgat tgacatggga gataactatg cattagcagt 13501 tcaagacgat gttgaattgt acgtaggaat gtctgaaggt ttaaaaaact accgtgaggt 13561 ggggattagt ccggatgaga aatatttcaa ctccggactt ttagtgataa atcttgacaa 13621 gtggcgatca gaggatattg gcaaaaaagt ccttgaatac atcaaacaaa atagagaata 13681 cgtacgcaat gatcaggatg gattaaacgc agttcttgct gggaaatggg gagaacttca 13741 cccaaaatgg aatcaaatgc ccaaaataca tgaatattca tcctggaaag acagcccctt 13801 cacggaggat atttataacg aactgcagca caacccttgc atcattcact ttacgaactc 13861 tcccaaaccc tggtatgcag gattgcgaga agaatgtaaa catcctaaaa aacatttgtt 13921 cttccaatat cttgacatga cagactggtc aggatggcga gacaccattt ggagacgctt 13981 ttggagaaaa tttatgaaag taacatcgtt gactacttca aaactttaaa catgctgtgt 14041 tacggtgcat aacatataca caaaaggtga gtaaaggcaa gaatttctgt agtcatgaat 14101 ccatgatgat gaaagtgcct aaaccatcct acagagaatg atgcaccgta cttcaccaag 14161 gtaatttgtt taccacatga aattaaactt gaataggaat caaaaacaat gagaattcta 14221 gtcactggta ctgaaggtta tcttggtagt ttattacctc cgttgttaat tgaacgggga 14281 catgaagtta tcggagtaga tactggttat tacaaagtag gttggttgta caacggtact 14341 gagataacag ccaaaactct caataaagat attcgccaca tcaccccgga ggatgtgcaa 14401 ggtgttgatg cgatcgttca catggctgat ctatcaaatg atcccactgg acaacttgca 14461 ccgcatatca cttacgaaat taatcataaa ggctcagttc gtcttgccaa attggcaaaa 14521 gaagctggtg tgcgtcgctt tgtgtatatg tcctcatgta gtgtctatgg tgttgctact 14581 gaaggtgatg tcacagaaga atctcctgtc aatccccaaa ccgcctatgc agaatgtaag 14641 acaatggtag agcgagatgt caagccacta gcagatgatg acttctctcc tacctttatg 14701 cggaacgcca ctgcttttgg tgcttccccc agaatgcgct ttgatattgt tttgaacaat 14761 ttggcaggtt tggcatggac aaccaaagaa attaagatga ctagtgatgg tacgccttgg 14821 cgtccattag ttcacgcact ggatatttgc aaagcaattg tttgcgcagt ggaagcacct 14881 cgtgatattg tacataacca agtttttaac gtgggagata cagcaaataa ttatcgagtc 14941 aaagaagttg ccgaaattgt tgctcaagtt ttcccagatt gtaaattgag ctttggcact 15001 caaggtgcag acaatcgcag ttatcgcgtc tcttttgaaa aaattaatac cgttctaccc 15061 ggctttaagt gtgattggaa tgctcaacga ggagcacaac agttgtatga tttgttctcg 15121 caaattgata tgactgaaga agttttctta tctagaggct tcactcgctt aaagcaatta 15181 gaatatctca tccgtacaca acaaattgac aaagatttct tctggagtca gaagtaatta 15241 tcttgaattg ataagcaaaa aaagtgattt caataattac gaattaaaaa aaaatgattc 15301 ggaattgttg aaatcatcaa tgtatcatag tatggtaatg aaataaggca aaaaaataag 15361 atattcaaat gaattaaata gaaaataaat atgtatgttt taaaacttta actgtttttg 15421 tgaggagttt tgagaagcta agattagctg ttatagcagt agtatgatta tcatacctga 15481 tttctgtata ttcctgaagc cttaagtctt cttctcattt ttattccttc gcttattcaa 15541 atcaaaagtc ataataacat atgcaaaatc cacaacaact caatccacat gaaacacaac 15601 ctgatgtaca gtccaccatg aataaactac tcactatcgc tattccgact tacaaccgcg 15661 cccaactact tgataagcaa ctagcatggg ttgctcaagc gatcattggt ttggaatctg 15721 aatgtgaaat ttttgtttca gataattgtt caactgataa tactcaagag gtgattaaaa 15781 agtggcaaac aaatctcagc cacatcacat ttaaatctag ccgaaatgca gaaaatattg 15841 gtgtcatgcg gaatatcatt cattgcttaa agtccgcaac gaccaaatat gtttggacaa 15901 ttggtgatga tgatcccatt caggatagag cagttgctta tgtcatcaac aaactcaaaa 15961 aatatgaaga tttatcatta ttgttcctta acttttctgg tcgtaatcaa aaaactgggg 16021 aacctgtaca cccacccaca atagtaggta accgttggtt tgatgttgat agtgaagatg 16081 gaagcggtga cccaaaagct atctttgaac attgtttttc aaaaagtgtt ggcgcagtca 16141 tctttctcac ggcttcgatc tatcgtactg acttagtaca acgcgctctt caaatttggt 16201 cagaagctga gaataactgg atatcattag catacttggc tgggtattgc gctgctaatg 16261 gcaggataat tgttactaaa gatatctata tggaatgtat tgttggtgtg agttattggc 16321 aaaaagatcc acaatcggca ctattaatgc aatacaagca cttaccggaa gttgtcacga 16381 aattggagga aaacggatac tctagacaat tttaccgtag gatgatgttc cagagtttca 16441 aagaagctaa tttgaaagtt ttcttaggtg ctttgagaag atggcctatg ttcactatta 16501 aaacaatagt tccctttttg actttagtcg gtctgtctgt ttttgacgca gttcctacta 16561 aagaattcaa aatggcacaa acaacggaac cattcactca aaattgacga aaataagact 16621 tgtgagccag cgcggtcttg ggggtttccc ccatgagcga ctggcgaacc ccgaaggggt 16681 aatcaatcaa ttgaacaagg tagttattct atgtttaaca gtactatcaa caaaatatct 16741 cagttgacat ctgagttagg ttacagagca gcactcctaa accatgcaag gaaattgcct 16801 gctttggaag aaagcgatcg cgtgattgtc gataccctca aacgcgatgg agtttatgtg 16861 acaacactcg aaaagttggg acttggctct acatctgctc ttctcaaagc atcttacaat 16921 caactatcta gaatgacaga tgcaagcaat agccacctca caaaaagact gccgcaaatt 16981 tacaccgtta cagatttacc agaattttct cagtggggac gggaacaaaa gctacttaat 17041 atcatcgaga attacatcgg tcttcctgtt gcctttcagg gcgtacattt acgtaaagat 17101 tttccgaacg aagatcagtt tggcacactg ttatggcata aagactcaga agaccgtcgg 17161 atgctcaaaa tgatcattta tttgtccgat gtagaacaaa aacacggtcc ttttgaatat 17221 gttccggtct ctttgacttc tttatacagt ctcaattatt accggattta ctacaagctt 17281 tggcagtcag gttacttagg aatcactgat gaacagctta aggaagtcat cccagaagac 17341 aagtggaaat catgtccagg tccagcaggt actgtgattt ttacagatcc aaaagtcgct 17401 ttacaccacg gaacactacg gacagaagaa agaccagcac tattttttac ttacactgca 17461 aacccaccga agagaccaga actttgcacc cagtactggg atgatacttt tgcaaaacca 17521 gagtcctatc aagaagctga ttctgtgaca gctctaagat agtgcgattc ttcagttgaa 17581 gaatttcttt gtgttttagt ttcccttgaa gcccttttgg tgagcagttg tctgtcttcg 17641 ggttaccctc ggtgcataac gatcgctgct tgacccctac acctctacag ggcttcaaaa 17701 aaaagttgca aaaaacacga agaaaaatcc gaaatgaccc agtgccactg tgtagttaag 17761 atttcaagcg aggttaccca tgatttttgt agaaactgaa ctgaaagacg cgtacatcat 17821 tgagctagaa caaaagcagg atcatcgtgg tttctttgcc cgtactttct gcgctcaaga 17881 atttgaggca catggtttaa agccaacagt tgcccaatgc aatctatctt ttaactacaa 17941 aaaagggacg ctgcggggca tgcattatca aactctacca gcagcagaaa caaaattagt 18001 ccggtgtacc caaggcgcta tctatgacgt cattattgat atgcgtcctg agtctccaac 18061 ttaccttcaa tatatcggtg tggaattaac tgcggaaaat catcgcgcct tatatgttcc 18121 agaaatgttc gctcacggct atcaaacact aacagactct gctgaggtcg cgtatcaggt 18181 aggagagttt tacacagctg gatatgagcg ggggttgcgc tacgatgatc catttttcaa 18241 tattcaatgg ccattggaag taactgacat ttctgaaaaa gataaaaatt ggcctttgat 18301 gaaaatgatg agcgttggag ggaatgttta gttgttagtg gtgatcgaaa aaacgactaa 18361 ttaacaatca acaactaagc aattcgcaaa tattattcat aggagattaa ccaatgatta 18421 tcgtagatcg cgccttacaa gctcgtgctg aagcaggtaa ccccatcaag gttgggatga 18481 ttggtgctgg gtttatggga cggggaattg ccaatcagat taaaaattcc gttcctggta 18541 tggagttggt tgctatctcc aaccgaagtg ttgatgcagc taagcgagca tattcggaag 18601 caggtattga agatagcaaa gtcgtttcta ccgtcaccga attagaagag gcgatcgccc 18661 gcaatcaata cgcagtcact gaagatccaa tgttactgtg tcgcgccgag gggatcgatg 18721 ctctgattga agtgacaggc gcagtagaat ttggcgctca tgttgtgatg gaggcgatcg 18781 cccatcacaa gcacatcatt atgatgaacg ccgaattaga tggtaccatc ggttctattc 18841 tcaaagttta cgctgacaaa gcaggagtta tcctcaccgc ttgtgatggg gatcagccag 18901 gagtgcaaat gaatctctac cgcttcgtca agagtatcgg cttaactcca ctgttgtgtg 18961 gtaacatcaa aggtttacaa gatccctatc gcaatccaac aacacaggaa gcatttgcta 19021 agcgttgggg tcaaaaagct catatggtca ccagctttgc cgatggaaca aagatatcct 19081 ttgagcaagc aatagttgct aacgccacag gtatgacagt tgccaagcgg ggaatgctgg 19141 gatacgactt taacggtcat gtcgatgaaa tgaccaaaat gtacgatgtc gaacaactca 19201 aagaactggg cggtatcgtt gattacgtcg ttggtgcaaa accaggtcca ggggtctttg 19261 tctttggaac ccacgacgat cccaaacaac gtcactacct caacttatac aagttaggtg 19321 aaggtccgct ttacagcttt tacactcctt atcacctctg tcactttgag gtaccattgt 19381 ctgttgctcg cgctgtcctt ttccaagatt acgtcttaag tcctttaggt ggtcctctgg 19441 tagatgtgat caccactgcc aaaatcgacc tcaaggctgg agaaaccttg gatggcattg 19501 gctactacat gacatatggg caatgtgaaa attccaacat cgttcaagag caaaacctct 19561 taccaatcgg tcttgcagaa ggatgtcgcc tgaagcgaga tattccaaaa gaccaagtcc 19621 tgacctacga cgatgtagaa ctccctgaag gtagactgtg cgacaaactg cgagctgagc 19681 aaaacgctta tttcaccaaa tccaaaactt tagcagcagt ttaggtaatt tcgttaaccg 19741 aagttaactc atacttaagt tatggtgttg acttcggtag caaaaatagc atacgttggg 19801 tagtaaagaa tctctagtct taagcaaaac ttatgcatta gtattagaag cttttgccca 19861 aatgtctgat ttccatttaa ctatttgtgg atcgattagt caagaagaag attttgtgaa 19921 agctttttat aaggaacttt atcagagtta ttgcaatacc ctatttggat cttctggtat 19981 aacctgttgc catttaagtt ctttttgcaa tacttagctc acgatcaccg actgtgtatc 20041 agttctgttc atcaacaaaa aaagctatta gccaataaaa atctcaagtt ctcaatttac 20101 ccaatgggtt actcatttct ttctttcttc tctccttacg aaatatttat taaaaattaa 20161 cacttcttaa tcacgtttta gtatcttgag aacaagtagg ggcttgaatc agatttgcat 20221 ttagagtctg tgaactgtta actgtcggtg cagtattcgg agcaaaccta tcaccacttg 20281 tagatagttg tatggagtca atacattaag gattcatatg aaaattgcct tggtgcatga 20341 ttatttaacc cagaaaggag gggcagagcg cgttttcgag ttactctgta agcgctaccc 20401 ccaagcagat gtttttactt ccttatacga ccccgaaaaa accattgaca ttggtgagcg 20461 cattgtcaac acaaccttcc tgcaaaatat tccaggagca gcgaagtatt ttaggttgat 20521 ggctccctta tattttccag cctttcgcgc cttagatttg caagactacg accttattat 20581 cagcagtagc acaagctttg ccaaagcagt gcgaaagaac caaaagagcc gccatatttg 20641 cttttgtcat aacattaccc gttttttgtg ggacacagaa acttatttac gtgagtacgg 20701 agattatcga tattttgctc cattaatcga ccaagttttt gaaatgatga gaaaggtaga 20761 ccttgcctat gcacaggaac ctgaccttta cattgctaac tccagtattg tagctcgtcg 20821 gattaaaagt acttacggca aagaagccat tgttattaac tatccgattg acaccagtaa 20881 atttgttttt tcagatacaa aagaagaatt ttatcttgcc tcggctcgga tgatcagcta 20941 caagcgtctt gatataatag tcgaagcttt taactggctg ggatggcggt tgctgatatc 21001 aggaaatggt ccagaaaggg agaggttaaa gtctaaagca ttatccaata ttgagttttt 21061 aggacacgta actgatatac aacgcactca gttgttttct aaagcaaagt ctgttatagt 21121 tgcagcccta gaagattatg gtttagttcc agtagaggcc aatgctagtg gaacacctgt 21181 catcgctttt ggagcgggtg gggtattaga tactcaaata cctggtaaaa caggagtctt 21241 ttttcagaaa caaacacccg aatcactcaa tcgtgcacta ctagaagcca gggaaatcta 21301 ttgggattac aacaacatcc gtaatcatgc agtggcaaac ttttccgagg aagctttctt 21361 taacaaagtt gagcaagtta ttgagcaagc ttgtactgtg taaatacatt attcatttaa 21421 tttgttagct gagggatagt aaaagtggtt cagagcagtc ttagtccata cggaaatcca 21481 gttactgagt cagaaccggg ttacggacaa ctatttgcgg ttttaatacg gagatttcct 21541 tggtttttgg cagcatttgt ggcttctgtt gccgttgcag gtgtgatgac aaaaaaaaca 21601 gctcctacct atagaagctc aatgcagctg ttggtagaac ctaactatca aggaaagaaa 21661 gaaggtggtg gtgtagacag tcagtttacc gaacctactg ttcaggtaga cgctgcaacc 21721 caactgagct tgatgcaaag ttctcctctg cttaaaaaag cagttactga actcaagtcc 21781 aaatatccag acatgaccgt aggtcagtta aaaagttctt tggttttaag tcaaattaaa 21841 aacaaagacg ataatgttgc tactaaaata tttcaggtgc aatatactga gaaagaccca 21901 gtaaaaacac ataaggctct gcaagcaatt cagaaagttt atttagctta caacattcag 21961 cagcagaatg agcgtttaaa aaaagggctt aaagttatta gacaacagtt agacgaagca 22021 agaaatgatg tcaagaaggc tgatggtgag ttacgagact ttcgcacagg gaagaatatt 22081 acagggaaga atcttataga tccagagaca caggcaaaag ctacacaaga cgaattgacc 22141 aggattgtgc aagagcgtgg cacagctcgt tctctgtata aagaagctga agctacgtac 22201 aaaaatatac aaaagcaact tcaaagtaca ccacagaatg cccttgtcga agctcgtctg 22261 agtcagtcta ctcgttatca aggactactg aacgaaattc aaaaaacaga actggctcta 22321 gcacaagaac gcttgcgctt tacagaggaa gctccaagtg tgcagaaatt ggcaggtcaa 22381 cttcaaagtc aaaaggcact attgcaggaa gagcaaagac gaaccttagg ggcagagtct 22441 gctcaagcaa ttagccaagc accgtctctc ctacaacaag gacagaagag cgcaattgac 22501 ctcaaccttg ctgcgaagtt agtggacaca caaacaacaa tgctgtcttt aagcgcaaaa 22561 gacgaagttc tggcgaagaa agagcaagaa ctacgtgagc aactcacaaa atttcccaag 22621 ttgttagctg agtacggtcg cctacagccg caagtacaac tgagccggga aagattgcag 22681 gaacttttga aagcagaaca gcgattacgg caagaaattg ccaagggagg atttaactgg 22741 gaaatcgtgg aagaacccca agaaggtata cgccagggtc ccaacgaaca gcagaatcta 22801 atgttaggtg cagtggttgg gttgatgtta ggaggtattg ctgcgtttgt ccgtgaagcg 22861 gctgatgatt cagttcacac cacagctgag ttggagaggc aggttgcttt accgatattg 22921 ggaacaactc ccaagttacc acctgctaaa accagagaat cagttatcaa gttaccattt 22981 ggtaagccag atgtccctgc cccttggaca attcaggtat tgcaatcttc accgcgttgg 23041 gaatcgctag atctgattta taaaaacatt gaacttttga actctgttgc ttcgttcaaa 23101 tctttgatga ttacctcagc tttatcagat gacggtaagt caggtctggc attgggctta 23161 gcaatgagcg cagctcgttt acataaacgg gtactgctga ttgatgccaa cctacgtgaa 23221 cccagcctgc acaaacagct aaatcttcct aacgaacagg ggctttcaac tctattagca 23281 agtgaggtca caatacctaa tcagattagt atccaatcct caggctcatc ctatatcgac 23341 attttgaccg caggaccaac acctgctgat ccagctaatc tgctaagttc ccctcggatg 23401 cagcaattaa tggcaacatt tgagaataac tatgatttgg tacttgtaga ttcaacccca 23461 gttcttggct tagttgatgc gatgcttacg gcctcatctt gtcgtggcgt ggttatggta 23521 gctagcatag gtagagtgac tcgaactcaa ctcacacaag ctacagccat gttgagccgg 23581 ttaaacttac ttggagttgt ggcaaatggg gtctcaaact ctagtagtac atttgtgcca 23641 tatacacatc aacaacgatt tgcactgcaa caagctgtag agaaataaaa ctacgccaca 23701 gcttggtgca aggcgcacac tgcctgaata tgcctggctt gactaacgag aacgaacgca 23761 caaccaaaaa tataaataaa gcggcacatg ctacctagaa aagcatatgc cgtatttttt 23821 cttagtagcg gcaggagttt aaccctccta ctcctcgtca caccgatggt atgaagaggt 23881 ttcttagaaa gctgtgggct ttctgcttca atttttttga gaatttccgg cgtctttaga 23941 cgcggagatt cttgggcttc agttgccatt aagtagaccg actggaaaaa aaaccgagcc 24001 agcacgccac ttacgtctgt cgagaaaggt caatgcagtg gtgggtctaa tatttaacta 24061 gttttgtccc aaaatctctc gtacaggtag tcaagtccca gagagtacag gacttgcgct 24121 ggcttggctt ttttctgtct atgtgtttag tcacttttgg tataagtagc tcaacttaat 24181 tatctgtttt tgttcgcgca gcgtctccgt atctcctctg gaggcgccct cgttgcaaag 24241 cgacacgctg aatgcagcgg aaggtttttc cgtaagcgca aagcgtgcca ctaggcatac 24301 gcgagtgcgt gtccaaagga cataggttgc tggatcttta cacaggtttt acccaacccg 24361 acagtaattg tggttaaaat agtcaaattc tcattttctc ttcccacaac aacagcacgc 24421 ttggagtgca ttgtggcagc gcaagttacc aactgtctcg tgccttcttt gacgtaagta 24481 attaagcaga tttggtatta attaaaagac aaaatgttaa ttgtttattg tgtattatga 24541 tatttaaagg cagagtatat aactatcttt tcccaaagag cacaatcaaa gtctatcgta 24601 catcaaacta cttacggaca agagaaataa ttttgtctta atacgtcttg ttctattgat 24661 aaaataaaag ttaataaatc atatatgctc cttttactca taataagttt attagagaag 24721 atatcaaaag atttgggcga atataattcc ctactagaca aatgaactcc gtccgcgcgg 24781 aaagcaagaa aaatcatgac tttcaatgcc gtacatgcgg cttttgagtg tatcgggacg 24841 aatagaattc gctccgtaaa gtaaaaaatc aggcttgaag agatatttta ttagaagtat 24901 tggaactcaa atttatatca gcgtataatt tgagtgcttt gcactcaaat ttttgagttt 24961 gtgtcttttg attagttgtg atagctatgc ttttttttaa caagatttca aattacgtac 25021 ggtgacaagg aagggtctta tttcgtaatt acacaagcta tcttattcat aagaacttaa 25081 taattatcgg aaaaacgtcc ttgctcttta aagactaaat aaagatctct atatgtagcg 25141 gttcgtctta tatgctagat attaaatccc cctaaaaaac acaagagtga agtttctgag 25201 attatctgaa aactactatt gttgtatgat atacatttct ggtaaagaat aattaaccaa 25261 gatgtgcaat tgttaatagg gataattgtg ttatcttttt gcactcgatt actccatgtt 25321 ttggagtaaa gagcaagatc catctaggac tctttggcaa tagccgcctt aaatattagg 25381 tcagcaatct actatagtat gagcgaagat acatatgaca acctcaataa taccaacatt 25441 acagagtaac tatacagtga agcagcagca acaagataat ccctctcaat actgcgcact 25501 tcgatggtgt ctgggtcagt tgctggtcat ccctcctgga gggataacac aaccctatat 25561 gccttcactg gatagaaaag aattgttagt ggagtgtctg aagcgttctc ctgcaaattt 25621 ggtacgcata gatccaagac ttggtgaaac aaggttaaga ttttgggcag atgcgtgcgc 25681 tcaagccaac aagcccatat tcctgcgcat accctcgatt gataaacagc cgaaactact 25741 agactcaaca ttgtggtggt taaagcgttt tactgactgg ctcactgcct tgatttttct 25801 gcttgctata agtccaatca tgttagggtt ggttttgtta atgcgcattt cctccccaga 25861 accgagacta cttttttctt gccaatggca tgttggagaa cgaggaaaag tgttcaaggt 25921 tttcaagttt cgcacaacta cagccagcaa aaaagcaatg ggagataagg gtatcacata 25981 tccagaggat ttgtgcgatg gagaagatag tcagaatctg acaaaactag gacgatggat 26041 gcgtaagtac ggactggatc atctgccgca gttattgaat gtactacgtg gtgaaatgag 26101 tttgactgga cctcgttgct ggactttgga agatgcagta cgactcagcc cagaagcaca 26161 acgacagctc aacagattac caggaatgat gaggtcatgg gagattgaag cagagtcgaa 26221 ccttttacat ctggatagtt caactctgtg atttgttgtt cgttacctca tggcttgaac 26281 acccttgagg tctttcacca actataagta ttggttccac ccttggaaag tgactttcgc 26341 tctgtttatt acgacaaaac aactactgga acttacagta gtaagtaaaa ttgccgcacg 26401 tcagtcaacg cgttgcggct ttttgtaaga ccataatatt tagagcgctc tatcttccct 26461 gcaaaactac ggttgaatct cactcatgca tgtcaaacta tccaaaccag tccaaaattt 26521 acttaagaat tcagtcaaga ctactaagtt ctggcaggac aactacctta tattgcgaga 26581 atttaaacac tttcgccgaa tcacagtttt tgctttggtg ttttcatttg tggcggcagt 26641 ttttgagggc gttaatgtcg gtcttttgtc tgcctttctg caaaccttaa ccactcccga 26701 tatcccgttt aagactcaaa ttgattggtt caatacttct gtattgggag tgaacgcatc 26761 tgctcctgag cggctatatc gcgtctcggc attgatttta tttagcactt ggatacgttc 26821 ggggttaaat tatttagcac aactttacac cgaaataact caacttaatt tagttgatag 26881 actccgtaaa caaatttttg agaaactaca gtctgtaaaa attagttatt ttagcaagac 26941 taagtctggc gaactgataa atactataac aacggaaata gaaaggctta aacaagcttt 27001 cggatcggca gctttcttgt ttgcaagaac actcactgct gtcacgtact tagtatcaat 27061 gttcgtgtta acttggcagt tatctatcat ttcattgatg ctgttcagtt tgttagcagt 27121 agccttatct acattgaatg gtcgcgtgcg ggaggcaagt tttgaagtaa cgaaagccag 27181 caataggttt acttcaatag ccgtagaatt catcaatggg attcgtacag ttcaggcgtt 27241 ttccactgaa aattttgaac gtcaacgtta ttataatgcc agcttgaatc ttttcacggc 27301 ttcaaagaac attgccttgg tatggatggt tgtcaagcca cttgcagagg cgttagcgac 27361 cacagtactc gtggcaataa ttgttctggc agttacaggt gttttaacta acggagcacc 27421 acaagtaggt tctctattaa cgtttttctt cgttctcttt cgcctcgtgc caattattca 27481 agatattaat ggtgtagcgg cgcatatcag tacgatgtat ggctcgtcgg aagtcgttaa 27541 aaaacttttg gcaattgatg aggaacagta tttccgcaat ggacatatcg aatttcctgg 27601 tttaaaacgt tcaatcgagt tcgtgtctgt tgattttggc tacgactccg aaagtttggt 27661 gcttagtaac attaggctta tgattgaacg aggcagaatg acagcattgg tgggcgcttc 27721 cggagctggt aaaacaacat tggcggattt gattccgcga ttttacgatc caacccgggg 27781 tcatgttttt attgatgggg ttgatttgcg ggatattgac attacctcac tgcgccgaaa 27841 aattgctgtg gtcagccaag atacttttat tttcaacact tctgtccgta acaatattgg 27901 ctacggctca gaaggagcaa cagatgagga aatttacaaa gcagctcgac tggcaaatgc 27961 tatggaattt gtccaagaaa tgcctgaagg tttcaacact cagcttggag acagaggcgt 28021 aagactttct ggtggtcagc gtcagcgaat tgcaattgct cgtgctttgc tgcggaatcc 28081 ggaaattctg attttggatg aagcaacaag tgctcttgat actgtgtcgg agaagttgat 28141 tcaagagtct atagaaaagc tttctgtagg acgaacggtg attgtgattg ctcaccgctt 28201 atcgacaata gttaaagcag acaaagtcgt cgtactcgaa cagggacaga ttgtagaaca 28261 gggcggatat caggagttgc ttgagctaaa aggtaagctt tggaaatatc accagatgca 28321 gcatcaacta aattaaggat caatagcaat atctaataaa gcctaaacgt aaacgaaact 28381 atgaaaaata tggcttgaga aaattacctt agatagtgag tcctatgaaa gttactctca 28441 catgtaatac ttgatgaggt attggttgaa ggtgacgtca attgatttct agtaccacct 28501 acagatgaaa cagctattcc tgaaaaatta ggttttttag ctgctaacga atcgatgcac 28561 caagcaagaa atgggtaaag cagctagaaa gactccagga aacgaatagc caggattttt 28621 ctagcaacta gaacaaatat ggataagtca aaaattagca aaaaattaaa aatattttgt 28681 gagtaaaaaa tagaattgtg ggaacttgaa actaaaatgg aggagcaatg cccaacctaa 28741 tcacagctac acaggagaga gcagaaaaat tgaaccaacc cgagtatgat gagaatttga 28801 tatatcactg ctctagtttc agagttaatg gcaatggagg agctgaaact tatttaactt 28861 ctctcattca atctcggcaa tctggtgtga gtgacttcgt tattaaatct ctcaaagaac 28921 ttgaccagag tcggttcaaa ttgctgcaca tccacagtcc agatttatta gagcaagtaa 28981 aaggagaatg tcccactgtc ttcactgttc acaatcactc attgtactgt gctagtggca 29041 caaaatattt agcagcacag gacgtcattt gcgatcgcaa tttctcttat ttaggatgcc 29101 tctggggcaa aataatagat ggctgtggaa gccgcaaacc cgcaagagtg attcaagaat 29161 taaaaagtac tcatcatctc aaccatttta tcagaaatct gaaagttact tttttggcta 29221 acagtgacta tgttcgggaa cagttgatta aaaatggctt gccacctcaa caaactgtaa 29281 cactacgctg tggtattacc gtaccgcaaa tagcgactgc acctctaagt ttagaaacat 29341 acaaaatagg gcgaatatta tttgtagggc gaatagtacc tgataaagga ctagaatggc 29401 tgctcaaaac cttagtacac acagattcgc aaattcatct tgatatcgcg ggtgaaggtt 29461 gggaacgacc acggttggaa aaattagctc aaaaactcgg gttaaacaac cgtatcactt 29521 ggcatggttg gtgtgatagt aacaaattaa atcaacttta cgaacagtgt tttgctgtta 29581 tcttccccag tgtttggccc gaaccagctg gtcttgtcac tctcgaagct tatgctcatt 29641 accgacctgt cattgccagc gcagtcggag gtattccaga atatttacga gatggagaaa 29701 caggtattct tgtaccagct aataatatta agatgctggc acaagcgata actcatttgt 29761 cttctgatta tcaaaaatgc cgacagatgg gcgaacaagg tcatgctttg ctcatgcaag 29821 aattcacaat ggatgtccac gtcaaacatt tacaaaaaat ttatgaaaac acaatatcag 29881 aatttgccag tcaaaaaatt tagttagata atatatgaaa gctaaatagg cttccttaat 29941 gccaaaccaa gatagagagt agtaatttta tagtaacatg gaaggatttt tataaaatgt 30001 cacacgttac tagctcccaa gagcctctag tcagcgttat tatcccaact tataatagac 30061 cagactatct caagcaagca attactagtg ctgttaaaca aacttatcga aatattgaaa 30121 ttattgtttc agataattgc actccagaaa gttcccaaga ggtagttaaa tcttttggtg 30181 attcacgcat cagaccaaaa agcccccaag aggtagctga atcttttggt gattcacgca 30241 tcagaccaaa aagtccccaa aaggaaagcc tccaaaagga aagtccccaa aaggaaagtc 30301 ctcaaaaggt agttgaatct tttggtgatt cacgcatccg atttttcaga caggcaaaaa 30361 atgtcgggat gtttgccaat cagatgaatg ccttcaagat ggcgcaaggc aaatacgttg 30421 ccagtctcca tgatgacgat atgtggaatg aggactttct ggaaaaactg gttccagcct 30481 tagaagaaaa cccagatatc attctcgctt tctgcgacca atatattata gataaagatg 30541 gcaacattga taatgttggt actgaaggaa attccaaagc atacaaacga actagcttaa 30601 aaaaaggagt tcatcaacct tttattgaaa ttgcagtcat agataagtct gtaccaatag 30661 ccgcagcttg tgtgattcgc aaagaatttg ttgactggga taagattcca caggaagtag 30721 gcggtatgtg ggatttatat ttatcatatc tatgttcccg ttccggtcat ggagcttact 30781 actatccaga aaaattgaca cggtatcgcg cccatgaaca aacagacaca aatcgcagtg 30841 gtagtctaga tgctcaggca aaaattcgca aaggaacagc agaaatcttt tgctacgagc 30901 aatttatggg agatgaaatt ctcaagaaat acaacctgta ttttaagcaa aaatggttag 30961 aagctcatac aactttagct ataggtttgc tacgctctga aaagacaacc caagcacgtc 31021 cttatctttg gcgggcgctt tctgaagaaa agtttaattt aagaactata gcagcattga 31081 ctttgagttt cattccacga ccaatagcaa atcaatttaa aaggctcaac gcaactaaaa 31141 atagaatttt taagtcagaa atgttaagtc agaaatgaac tgaaaaggaa ggtgatgcgt 31201 gaaactctgt attgtgactc acatggtaaa aaagggtgat ggacaaggac gagtgaatta 31261 tgaggttgca aaagaagcca ttcgtcaggg tcatcacttg actttattgg caagtgaaat 31321 agcaccagaa ttgcaacaaa gtagccaagt caactggatt tctattcccg tcaacgggtt 31381 tccaaccgaa ttttttcgta atttcatatt ttctcaaaaa agtgcagatt ggttgcggaa 31441 acatcgctcc caagtcgatt tggtgaaagt gaacggagca attacatcag cagcatctga 31501 tgtgaatgct gtacattttg tccacagttc ttggctgcga tcgcccgtac atatttcccg 31561 tctccgcaaa gatttgtatg gtttgtacca aaagctgtac acagcattca atgcgcgttg 31621 ggaaaaacag gctttccaaa aagcaaagat tgtggtgtcg gtatccgaaa aagttgccca 31681 agaattggaa agtattggtg taccccgttc tcgaattcga gtgattgtca atggagttga 31741 tttagaagag ttttctcctg gtacggtatc tcgtcaaaca ctcggtttgc cggaaaatgt 31801 gactttggcg ctgtttgcag gggacattcg cacacccagg aaaaacttag atactgtact 31861 gcgtgccttg gtgaaagtac caaatttaca tcttgcagtt gtgggaagtc ctgaaggtag 31921 ccccttccca caaatggcag aatccctagg gttgaatgaa cgagtacatt ttctgggata 31981 tcggcgtgat ataccccaga ttatgcgtgc tgtagattta ttcgtgtttc cctctcgtta 32041 cgaagcatgt accctagtct tattagaagc ccttgcttgt ggacttcccg tcattaccgc 32101 aacggcaaca ggaggtgcag aattagtcac accagaatgc ggggttgtct tatctgactc 32161 cgacgacgtt gaggctttgg cagaggtact gttgtctttg gtgaatgatc gcaacaaaat 32221 gcaacagatg ggtcaagctg cacgcactgt cgcagaacag catagctggg ctactatggc 32281 aaaaacttat gtggatttgt ttgaggagtt aagcaagaat gaggaacacc gttctcatac 32341 cgacttatcg ccgtcctcaa gacctattac gctgccttca ggcgctactt gagcaaacta 32401 aaccaccttt tcaggtgata gtggttgtac gggatataga tactgatact tggcagtttt 32461 tggaagaatt ccaagaaact acactgccac tggataccgt aaaagttaca gttgggggag 32521 tagttgcagc tcttaatgct ggactggaag cggttaaagg agatacggtt tcaattactg 32581 atgatgatgc agcacctcgc cctgattggt tagaacgcat cagtgctcat ttcacctcag 32641 acagtcgcat cggtggagtc ggcggacgcg attggataca tcaaggtgac aagctgttag 32701 atgactcgtg cgaggtggtc ggtcaactac agtggtttgg gcgagtcatt ggtaatcatc 32761 acttgggaat gggagaaccc cgcgaagtcg atgtcctcaa gggtgttaac atgagttttc 32821 gtaccagcgc gatcgccgga ttgcgctttg atgagcggat gcgaggtacg ggagcacagg 32881 tacactttga aatggcattt accctagctt taaagcgggc aggttggaag atgatttacg 32941 atcccgctgt agccgttgac cactatccag cacaacgttt tgatgaagac cagcgcaata 33001 gttttaatga aatagcttgg ataaatctag ttcataatga aacgcttgtg ttacttgagc 33061 atctgccacc tataaggcgt gtattttttt tactttgggc tatattagtt ggcacacggg 33121 atagtttggg gttcgtacaa tggcttaggc ttttgccccg cgaaggcaag cttgcggggc 33181 aaaagtggct agcttccatc cgtgggcgtt ggcagggatg gcaggcacat gttaatcctt 33241 gaaagggatt gatacagtag tttgctaaaa agctaattgt gacgagtggg cagggtttga 33301 caattcaaaa ttcaaaattc aaaattcaaa aaaattgttc tgaattgata attttgattt 33361 ttaaattttc tacagggtgt agtgtttccc tacaccctac actctctaga aacaccctac 33421 cacgctcctc ggggaaaacc cccgctctgg ttacactcac cccacttgcg ggactcgggg 33481 gaaatcccca agggcgcagt ggctccccta gaccctagtc cttacccgtt agtaaactaa 33541 aaaattgaat tagcatgagt tccagacaga ttctttataa tcctattgtc aaagaaaagt 33601 actcaccaga tgagcgatcg ctcctaggtt ggatagtcat cttcgccttt gtattgctga 33661 atctagcttg ctattttgca agtgctagta tggtgcgcct catcttcccc gtgacatctt 33721 ttatagtagc tatattttta tacttaagac atccggttct atacttgggc tttacctggt 33781 ggatatggtt tctatcagcg tttcttacgc gcttagttga ccttcgtgct ggttgggata 33841 taacgcgtca gatgctgatc gcaccatact tagttacatt tgtaactttt ggaacgacct 33901 tcagatacct tcccagcgcc tctaggcaag gaggcttacc ttttgttttg gctcttgtag 33961 gggtgtttta tggcttgctt gtgggactgg tatacaacgg acctgtcccc gtagcacgca 34021 gtcttctgga ctggctatcc cctgtccttt ttggtttcca tttatttatg tgctggcggg 34081 attatcctgc ctatcgtgac aaccttcagc gcacattctt gtggtgtgca ttaattacag 34141 gagcttacgg cgtatttcaa ttcgtggtag cacctgaatg ggatagattt tggctcatag 34201 aatcaaaaat gaataccagt tctggagaac cggtaccctt tggaatgcgc gtatggagca 34261 caatgcactc agtcggtcca tttggagctg tgatgcaagc tgctttgctg ttattgttca 34321 cagcccgcca aggaccttta aattttcccg cctcagcagt gggatatttg tcattcctac 34381 tcacccaggc acgtactaac tggggcggct ggttacttgg ggtggttatg atttttggtt 34441 cggtcaaagc acagattcaa atgcgcttga ttacgatcat attggcaatg gcattgtgtg 34501 tcgtgccatt agcaaccata gaaccaattt ccagtgttgt tgccggtcga ttggaaagtt 34561 tttccaatct tgaacaagat ggcagcttca aagatagatc tggaagctat gacagaaacc 34621 tgagtgttgc cctgtctaat gttttcggca atgggtttgg aaacacctgg aaagtcgatg 34681 aaaaaactgg tcaaatagta gtatttgtca tagacagcgg cattttagat atattcttca 34741 cgctgggctg ggttggaggt cttccttact tgggtggact cattctcatg attttcactg 34801 tatgcaaata taatgaggct cgcttcgata gttttgtcag tgctgcccgc gccattggta 34861 tatcttcctg ttctcagctc attatcggta gcggaatgct gagtatagca ggcatggttc 34921 tttggggatt ttttggtatg actatggcag cacataagta ctatcagcat caagaaatga 34981 gtgcagcgca atatgagatt ccacctcaat atcagcgtcc tgatctgagc gccccttcag 35041 gaaaggcatg aggtatactg cggggtgcat tatgagtggg atgaaaggtc gcagtaactt 35101 ttttcatgga ctttttttgg gtgtggctag cgagagcgat gcctaaaaac tcgaatttga 35161 cgttggtaaa caatgcacac ccgaaggtga ttgcaaggag tgcaagatat cagttgagct 35221 aacctttttg caaatacatt cgtatacaaa caaataagga ggtaacacgt gaaagttgtt 35281 gtaattatgc cgctagctga actacgaggc ggcggtgaaa tgatgctttg ggatttgatg 35341 cagcagggac gtaacgcagg tgtcgagtgg ttggtaatat ttttggaaca tggtccaatg 35401 gtagaacagg taagagcgct cggtattgac acgcgagttg ttgaaagcgg tcgcctgcgc 35461 gaagttcacc gtttcatcgc tgctgttttt aggatagctt ccatcgcccg acgcgaacgt 35521 gcagatatga ttgtcaactg gatgtggatc acccatatct gtggaggact cgcagcaatg 35581 cttgcgggtt tgccttctgt gtggtaccaa ctggaagtgc cttatgacca gccttggtta 35641 gtgcggcttg ctactctcgt gccagcccgt gctattgtga cgctctcgaa agatggcaag 35701 gaggcacagg cgcggatttg gccccataga ccaacaccct tggtctatcc tggagtatca 35761 ctagaccgct tcaatggctc tactgtgcct tctcctgctg aagctcggcg caagctaggt 35821 ttgccattgc atggaccaat aataggaatt gtgggacgat tgcagcgctg gaagggaatc 35881 cacgttcttg tagaagcaat gcccaaagtt ttacaaaagt atcctgatgc ccattgtttg 35941 gtagtcgggg gcaagcataa tttagaaccg gattatgaag attttgtaaa agagaaaata 36001 acagctttgg aactccaaga taaagtcatc ttagctggac tacagagtaa cgtcccagag 36061 tggatgcaag cgatggatgt ctttgttcat gcctcggata acgaaccgtt tgggattgtg 36121 attatcgagg cgatggcatt aggcaagccc gtaattgctg gcaatggtgg gggaccaacg 36181 gagatcatca ccgacggcaa aaacggacag ctcacgcctt atggcgatgc aaacgcactg 36241 gcagatgcaa ttctgcgcta cctcaacgat caagaatttg cccacaatgc tgcaatcgcc 36301 gcacagcaac gcgcccttga tttctccaca gagcgttacg ttcaaaactt tatcaacaca 36361 cttcgctcag cagtgccaag tgtttcgtag gttcataagc gaatgtgcgt ttatattgta 36421 attttttcct caaggcagtc aaaagtcaat aagaactatt gacttttgac ctttgttgcg 36481 ttttagtgaa aagtttacca aacccaagaa taaacatttt gttgtgttgg gttgcgctac 36541 gcttaaacgc cagacttgcc gtgagagcgg aaagccctat gcctgcggca cggctgcgcc 36601 tatcggatat gcctacggca cgcctgacgg ctttagccca gagggcgtgc gctttgcgca 36661 tacgccagat gcctggctgt cgggaaaacg ccaggtgctc tacttgggga gaccccaaga 36721 ccgcactggc tcccctcccg cacttctgtt gtcgtcaccg tcattcgcgc tggctcaacg 36781 gacaggtgcc tcaagtcggg aaaccctttc ggcagtcgct catgggggga acccccaaga 36841 ccgcgctgcc tcaccccggc actgtcctcc ccaacctact ttatccttta ctgaaccgta 36901 ttgctttatg agtccccgtc cttcagaacg gggtttttat cccacggatg gaatccgggg 36961 ggattttttg cgttggtgcg ttcttcttac cctctaccat caacgcaagc aatccattgc 37021 ccttcatgga taaagtgagg gtcaatatgg tgcataccag caccgttcca accaacacca 37081 gtaggtttca gcactaagtt ttgctcgatt tctcgttctt gataacttgt cgttgtcaac 37141 tcagtaattt caaatgctct gagctgtgtg ccatagtcgg gctgacagtc ttgagtgtag 37201 cgaataatct tatcatttat cacgacaact ctaccgcctg gtctggcaat gtgtgcatta 37261 ccagtaataa ttggactttt tgggtgttct atccaagaac caagcaactc atcagcataa 37321 tagaggcgga gagtgtcaaa tttatgctgg ggatttgttt ccgtaaatag ccaccattta 37381 tcagcgtgac ggaaaataga agcgtctaaa aacgacgcac cactactgag attaccaaca 37441 aaactccatt ctgtaggaaa cttacttgct ttgtataatc ggatcgagtt tgcctgatgg 37501 gtttccggaa tcatgtagta ctcattcatc cactcaaaaa cgtagggata agataagtga 37561 aatggttcag caagtacaat ttgttcgtac ttccaattct ttgtatcctt gctggttgct 37621 aacccaattt ctcctctacg tgtctgttga tttaacacct caaaaaacat gaaccaggtg 37681 ctatcagctt taatcataaa agggtctgca acaaacccag ccctgacatc tgatacattc 37741 cggcgtgtca agacaggatt ttttatctct ttaggagcct ggaaatcaac aggggatttg 37801 ccagtataga ttccaataga ccatctttcc ttttttgtca gcgcacctaa tggcagtttg 37861 ttcgcgatag gaactgggaa actatttact acataaaagt gtagttcttt tataatcttt 37921 ttgaggttct tcatcagaaa aagttgtaca agatatattc accattctat gaattctgtt 37981 gttgagaata ttctatcaaa tttttcaaac cagattctat agaccactga ggtacaaaat 38041 ttaaaactga caacgcttta gatatatcag cgagtgagtg ttgaatatct ccatgacgag 38101 aaggtgcaaa gccaggttcc aattcccatt ttggaaaata atttctcata atattcacaa 38161 gctgaagaat tgaagttgtt ttacttgtgc caatattaca acttaatgcc aaaccaggtg 38221 ttagtggagt tgttaacgct ttggcaaaag cagttgctac gtctttcaca aagacaaaat 38281 ctcttgtctg actaccatcg ccataaatat tgatgggtaa gcctttaagc attgcatcaa 38341 caaatatgga aatcactcca gagtattgag aaccaggtac ttgtcgaggt ccatatacat 38401 tgaataagcg taagccaata aatgaaaagt tgtattgttt agcaaaaagg acagcgtatt 38461 gctcgcttac taatttttgc aaaccataag gagatatggg agcgggaggc tgatcttctg 38521 aaattgatag aggtgttttg ttaccatata cagcagcaga actagcaaag actagcttag 38581 gaatctttaa agcctgacaa agttcaatca cagctagagt tgcagataaa ttattatcat 38641 gtgcttccag tggctttaac caagattcgg taaccgatgg agtagctgct aaatgagcaa 38701 tgccatcaat ctgtgtggaa aaatcttctg ggtgacagtc taatatattt ttgtgcagaa 38761 attttaaccg ggaatgttca ggcaggtttt gtaaattacc tgtcgtcaag ttgtcaatca 38821 cagtgacata gtgaccctct gataacaact gttcagtaag gtgagaacct ataaagcctg 38881 ctccgccagt gacaataaaa tgcataattt catctatgaa aaacagtgta attttcattc 38941 gtaattcgtc atttaaaatt atgacttaaa tgacgaatta cgagtcatcg ataacagatt 39001 aatctgtcgt gtaatgtgtt atttatgaat tagcttagct tcagcatatt catacttatt 39061 gggcgtgtta tttggtctta actctgtata gacagctagt gtttctttat gaacgcgagc 39121 agcactgagc ctttctaaat tttgagatgc ttgagtcctc caccattgca ggcgatcgcg 39181 atcgctcaac aactgcctta aattttctgc caaggtatgg ctgtctttta ctggtacgag 39241 aatacctgct ttcccaccgt ccaaagcttc aggtatacca tctacattag aaccaatgat 39301 cgcgcaacca gcttctcgtg cttctggaat cacaagcggt gaagaatctt tgtgagaaac 39361 aagtacgaag atatcagttg cgagtaaata tctctttggt tctgtttgga aaccctcaaa 39421 atgaatgcgg gaagtcaaag gagtatttcg cgcttgttct tccaatgact cgcgatcagg 39481 accattacct accaagtata aatgggcatc gggaaaatcg gcagcaattt caacaaaagc 39541 atctattaat tccccgactc ccttacgccg atacatccct gatactgttg taatcgcagg 39601 atggtgtagg ggttgcggtt cgtattcttt gatgctgcgc tggcggggac tgcctaatgt 39661 gccattggat ataacccgca ttttttcttt tggtatacca cgccgtgcca tcgagtctcc 39721 cactgcttga ctgacgacaa tcactcggtc acccaagccc atcaatgtgg cactgcgctg 39781 gaattcgttg tgtactgtgg caattaaacc atattcataa ccgaacctga gagtcgccgc 39841 cagtataact cccgtcatca tatgtacgtg aacaatatct ggtttaaact catccacaat 39901 gttccgatat cgccataccg ctttgataat atttaacggt gttcttgact ggtctagatg 39961 aaagtgtctg atgccataag atttgagtaa tgtctcatat tctcctccag aggaggctac 40021 agcaacatta tgaccatctt ttgcttgcat acaagcaagg tctattgcca cattaacgat 40081 accgttacca atctcttgta cgtggtttgt aagatgtagt atacgcattt actatgagct 40141 ccacagtggt cgcttgttac aaatagctta tcttcattca gtatcaatag aagcaaccgc 40201 tgataaacgc gggatcttaa aaaacaggca gcagtttaaa attctgcttt gtcaaaaatt 40261 tgacaatatc ccaaaatctc ttaacaaagc tacgaggtag acattcaact tgagaacagt 40321 aaagggcgat agcctatccc tacggcatag gctatcgcga acactgaacg cgtgcgtcgc 40381 ggctcttttt tcataaaggt gcaaaaaaat acttaagaca ctgccattat ctcttatttt 40441 ttatactttc ttattcaaga tattttcatt ttttgtaaaa acattataaa tttactgctt 40501 aaatgttatc taagttacaa atcttgagca taccgatttc gccaaaagaa tgcgacagat 40561 gcgtgccaag agttcacgac tagccccaat atttgaaaaa aaacaggcaa agcctgtcat 40621 aattgcttcg ttattcattc ccaaacacag ttagggaatg agtgccaaaa actatcatac 40681 acttagcgcg gtaactgata cataatcacc tttcctttca aatcttcctc aacaaatact 40741 atgtactctc catttgatcg gcgaaaagca cggataccgt aaggtatatc aatccagcca 40801 ctttcaccag ccacttctgg accaggtttt aacttttgta cttgctttcc tgtcgctgct 40861 ttgtaaacat atacctctga tgtttttcct gtaacggcaa atacataatc tccggcgaca 40921 ctcatcgctg ctgtcatcac ttcgcgcttt ccagttgtgt cataagggat cacggttcgc 40981 caccggggag tgcggtttcc tttactccaa ttgtcataac gtataatttc agaaccaacg 41041 actccagaat catctccaat cggaggatgg tctttagtaa agcctgataa atacatggtg 41101 tctgtgtcag ggaaatactc aatccttcgc aaatctgtaa tttggttagg agttgtctcc 41161 tttttcatag aactgctgga gtaaatcggg ttaccctttt gatctagccc ctgtacaaca 41221 aaatgcagga taccaacttt cgttcgcaga gttttccata agtcgccttt gctatctacc 41281 caccatcctc caaggtaagg ataatctagg gttctagtat catattcacc ctgatcaaaa 41341 gcaccgttac cgttgcgatc gcgccaaatc caatcctttt cttcctgagg ttgattagac 41401 agccaacttt gaacgggctt accatgattc atggatgtag cagaaaacat agcagatggt 41461 attgcaatat aaccatctgt aactggatta aaacgataaa cttccataaa gctagcatac 41521 atatctgtga gaaacataaa aggtttcccc tggatgcgcc ggaagaaaac tgaagttggc 41581 gatgtgtgga tccgtggatc ttgggggtat ttaaaaggat ttaaggtgta agccttgtaa 41641 ctccattgct tacctgatgg ttttttatag tccatccgga agtattcatg cttggtgtat 41701 acttctgtgc catcactttg aggatctgta tctgcattat ccacaaagtg caaacccagt 41761 aatcgccatt gtaactttcc tgatgacgaa aattttctta agtcggttcc tgaattattg 41821 aaaccattac tatttacata gatattacct gctgcatctg tacccactcc agtaataccg 41881 taaaatttta aatctccaac ttcaccagga acaccagcat aaacgccccc ctcgactccg 41941 aaagttcctg ctcgtgttgg ctttgctgta atgttatata tcaagacttg ctgacgaggt 42001 ccattttctg cgaccaaaag tctgccttga ttgtcgagtg cgatcgcact tgcatcaaca 42061 acatcagcaa tgacctcagg caaacgcttt ccttgacttg aataatgtaa aatctgagca 42121 gggctactgc cattcttact ttgtataatc cacagatttc cctgcttatc aactgttatc 42181 tgttttggat tgggaacaga aaaattccgg acttccgtca ttgtgtcagt gttgtagaca 42241 ctaatgcggt ttgcctcagc aataccgaca tataactctt tccctacaat tgctaatcct 42301 gtgactgggc tttttgtact gacaatcagc atacttttgt catagccccg ccctcctgca 42361 aaacccgcag gctttccaga caaagtatag cgtctgacac agtaccaagc tgttttttcc 42421 ggtgggtaat cctctggtat cttgcccatt gatccctgaa ccatagcaag gtaaatatat 42481 ttatcagtga ctgttatggc tttcccgcct acacgactcc aaccatgaag ctcaaagtca 42541 ggaccaataa catcaccatc tttatatatt cctgcttctc ttccagcttc atcccaagtg 42601 ctgtcggtgt atactgtgcc atcagaaccg acatacattg cctctacata attttgcacc 42661 catttcttac caccataagt attacctatc caagatgttg tgtaggcttt tgtggcaggt 42721 acgactctgg ctgtaaagta caatgtaata ctcattgcag tcatcagaca gagaaaaaac 42781 ttgaaaaatt gtctgaatcg taaataacta ctatttcgtt ggctacgata tagtccaccc 42841 agatgtgaaa tgaatttatt tttcataaat taacaaaatg cttgatgact tctcaaatat 42901 catttgatat cgtttgacaa tgaggctgcg tagaatagga aatcatttgc atcgtaatta 42961 ccctgacagc gcttttctgt cattatctac atttaagagt gttccacgtc tggttttaca 43021 taaactttat cagaatcaca ctattttttt aatttcccac atttgtttgg ggcaacacat 43081 attttcaatt agactggcaa tatagcaatc tttgaacaga gaactcttaa cagggaactc 43141 ttaacaggga actcttaaga agcaataaag gtgtactaag ttttttttaa aaatcaaata 43201 taggaatccg atttgatttg agcaacaaat ctaagtatct gtagtcttat tgcgtagcat 43261 aacgcactat agcctgatgc tcatggtgag ccagaggcat aacgcaccct actacgtaca 43321 tttcaaaaat caaatatgag tcctatatct acacaaaata tgactgggaa atacgtataa 43381 tatttaactt ttgatttgct gaaaaaagta caggttattg ctcatttata atactagata 43441 tgcaccaaat ttaccgagca gcagttaacc ccaacctcat ttagactgac taacgcaacc 43501 aggttgtgac acaagcatca acccataata cactgtgtaa ctttctatta cacaaaaatc 43561 aattaaaaat tttgcgtaaa aactccccac ctcattcgta gaagggtttg aagtcatacg 43621 gggacgaaat catcatggat agacaaaaat ttgtcctttc ctagtctgag atcttgcacc 43681 attat // LOCUS NODE_537_length_43363_cov_5.22353843363 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 43363) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 43363) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..43363 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..775 /locus_tag="DP116_03800" CDS <1..775 /locus_tag="DP116_03800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320484.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="sulfite exporter TauE/SafE family protein" /protein_id="PRJNA477356:DP116_03800" /translation="ALTGLGGGVVIVPLLTSIFGVDIRYAIGASLVSVIATSLGAAST YIKKGYTNLRLGMFLEVATTIGALIGALIATLISVKALTIVLAVVLLYSAYLSQQPKL NNLENESTDSLGAYLQLNSNYPTPEGLMSYQVHSVRSGFSVMLVAGVISGLLGIGSGA FKVLAMDQIMRVPFKVSTTTSNFMISVTAAASAGVYLARGYIDPGLSMPVMLGVLPGA FLGARVLIGANTQTLRIIFSVVLVVMAFKMVYSSLLGEL" gene 777..1331 /locus_tag="DP116_03805" CDS 777..1331 /locus_tag="DP116_03805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318629.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1634 domain-containing protein" /protein_id="PRJNA477356:DP116_03805" /translation="MNKISFSFWWNSSTPSESEVIKFQVQQKKSDSDIQQLAQHTSVI AEFPNDYEVDANKENTKTLSELQLEQLLSNVLKYGVIFASTVVLIGGILYLIRHGAER ADFQFFQGEPSHFRSPTGVAVAVLSGSRRGIIQLGLLLLVATPILRVVISLFIFLKQR ELTYVVITLIVLTALMYSLIGAYY" gene complement(1409..1669) /locus_tag="DP116_03810" CDS complement(1409..1669) /locus_tag="DP116_03810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746511.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03810" /translation="MTNSPLTNLLLIAGIVFLAIAVVGQARLGFAEINPGFFGRLLAL IIGLFSLTGAVVLVLFPVEIVDLIRNYLAQQIQQNLGLIIQL" gene 1996..2457 /locus_tag="DP116_03815" CDS 1996..2457 /locus_tag="DP116_03815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410969.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03815" /translation="MKNLPVITQESSLAGQTTPKQLLLAVLGDWKQNTFKKLRGGFFL VLGYLLSPLCWWNDLLFNLPIAYGFGYVCSLFNPKLLLPCSILGYWLSNIVGILLMQF GAMDILPNQSKERNLRKELLTGLVSSTLYTLVILALVQLKVLGAPIALFGS" gene complement(2643..2846) /locus_tag="DP116_03820" CDS complement(2643..2846) /locus_tag="DP116_03820" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03820" /translation="MVLKYFEKIVITELPDSCVETNNNVPIDSKYCMQLIVENGVAVP FMFADSLEEFLPYLKAMEKSTYV" gene complement(2928..3605) /locus_tag="DP116_03825" CDS complement(2928..3605) /locus_tag="DP116_03825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="PRJNA477356:DP116_03825" /translation="MNLNDLAANVLQQQRSTPDLIADALREAIVRGIFQEGQSLRQDE IATQFGVSRIPVREALRQLEAEGLVTLHLNRGATVSALSPTEAQEIFEIRSALEVKAI QLAIPKLTASDLEKATVILEKTDQTIDAGMLAKLNWEFHETLYKSAERPRLLTIIKTL HLNVDRYVRLQMSQMDYLERSQKEHYQLLEACQQHDTKAAVRLLKKHIDIAGEQLVTY LQQNIHS" gene 3704..5335 /locus_tag="DP116_03830" CDS 3704..5335 /locus_tag="DP116_03830" /EC_number="2.3.3.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314323.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-isopropylmalate synthase" /protein_id="PRJNA477356:DP116_03830" /translation="MSIQPQPDRVIIFDTTLRDGEQSPGATLLVEEKLAIAHQLSLLG VDVIEAGFAVASPGDFEAVKTIAQQVGVLGGPIICSLARAIRQDIQAAAEALKPAAHP RIHTFISTSDIHLKYQLKKSRSEVLAIAQEMVTYAKSFVEDVEFSPMDASRTDPEFLY QILERVIAAGATTINIPDTVGYCTPKDMAVLIQGVQENVPNIDQVILSVHTQNDLGLA TANALTAIENGVRQVECTVNGIGERAGNAALEEIVMALQVRKPHFNPYFGRPVDSDAP LTKIQTQEIYKTSCLVSQFTGIVVQPNKAIVGANAFAHESGIHQDGILKHRQTYEIME AADIGLAENSIILGKHSGRNAFRTRLQELGFELSEADLNKAFMRFKEIADKKKEMSDW DLEAIVRDETQIQVDKGFQLEHVQVTCGDCTCPTATITVVTPNGEISTDAAVGTGPVD AVYKAINRLVQIPNQLIEFSVQSVTGGIDALGTVTIRLRYQERIFSGQASDTDIVVAA AYAHMNALNRLYRFLQRETQKSHASGSASVVNFLI" gene 5738..6265 /locus_tag="DP116_03835" CDS 5738..6265 /locus_tag="DP116_03835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200208.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4440 domain-containing protein" /protein_id="PRJNA477356:DP116_03835" /translation="MFKFNKQLSALALLTMTSVGETMLTVAKASAQTQSYRIAASTTP QQLAVGPKERDIAALFDKWNAALQTGNPDEVVKLYAKDGVLLPTVSNKVRSTHSQMRD YFEHFLKYKPKGTILQQNVRIIDKLFAINSGVYSFNIIKNGKPGKVVARYSFVYRHDG NDWLIVDHHSSAMPE" gene complement(6635..6829) /locus_tag="DP116_03840" CDS complement(6635..6829) /locus_tag="DP116_03840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03840" /translation="MKMTTNSCPCCGSSLLRHVRHTGVYWFCQSCWQEVPPLHIYQTD SLTHRRLQKRPVSSQFILTR" gene complement(6868..7956) /locus_tag="DP116_03845" CDS complement(6868..7956) /locus_tag="DP116_03845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747002.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 4 protein" /protein_id="PRJNA477356:DP116_03845" /translation="MKIAQIAPLWERVPPSTYGGIELVVSHLTDELVRRGHDVTLFAS GDSQTLARLEAIHPRALRLDPSVKESIMYEMLETSQVYQQAEEFDIIHSHVGVSTLQL ASISSTPTVHTLHGAFTNESSRVYTLHATQRYISISDAQRQPDLNYLSTVYNGIKIED YPFVAQPKEPPYLAFLGRFSPEKGPQHAIAIAKQAGWRLKMAGKVDVVDKEFFEKEIA PFIDGEQIQFLGEVDHAAKVELLSNASITLFPITWREPFGLVMIESMATGTPVIGINM GSVPEVIAHGKTGFVCASYEQMAEMIPAALELNRQTCREDVENRFTVSQMVDGYEAVY EQILKERTPKSGLLHALRNSLESIAFVK" gene 9004..9789 /locus_tag="DP116_03850" CDS 9004..9789 /locus_tag="DP116_03850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197088.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HAD family hydrolase" /protein_id="PRJNA477356:DP116_03850" /translation="MTPSTPTILALDFDGVICDGLIEYFEVAWRTYCQIWLPEQETPE NNLASKFYQLRPVIETGWEMPVLVKALVEEIPEETILQEWTKIAQELLLKDNLKATDI GHQLDKIRDEWIATDLDGWLSLHRFYPGIVEKIKATVDSTTKLYVITTKEGRFAQQLL KQGGVDLPREIILGKEVKRPKHEILRELIQTTNTLPERVWFVEDRLKTLQLVQQQPGL EGVKLFLADWGYNTPTEKVTAQNDPGIQLLSLPQFPKDFSQWL" gene complement(9875..10225) /locus_tag="DP116_03855" CDS complement(9875..10225) /locus_tag="DP116_03855" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03855" /translation="MEQKNYGGKNYQIQAESIGHVGDIYQSPQVTAEELLHKGIQLLN QRAYRQAIDVLSDATKTNPSISDTHYYLAIALLSGEKPRKIDVWTIQSIESELNTAVG SAKKFISWLKAQVS" gene complement(10392..10838) /locus_tag="DP116_03860" CDS complement(10392..10838) /locus_tag="DP116_03860" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03860" /translation="MTNPIVALSSAEILKLAFNEFIKSSAGEAAKKLTTEALSKANEL RHKIVSRFKDRQNVKAEKAITAVQEQHSVEALHKLTTYLDDEMDEEPSFADDLRQLAQ QIINIQKISQEQLQFGEMKQLNRDNAKGFQVQANRIDRIGDDYTKK" gene 11233..12423 /locus_tag="DP116_03865" CDS 11233..12423 /locus_tag="DP116_03865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998005.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nuclease" /protein_id="PRJNA477356:DP116_03865" /translation="MLDLTKLTRQMQGLSQHLTQEAAASRQRLELAQKHFQNALGCQD DLVQRQEKWRDRILFANATPLEPLNTCIDIPVPPKIHTVIATDGSQISPSHHEIAYCY LLNIGRVILHYGQNRYPLLDSLPEIFYRPEDLYFSRKWGIRTEEWMSFCRTASEATVL AELACSVKEDRGTEEVPTLAMVDGSLIYWFLEQLPLEARDRILPPILEAWQKLRDAQI PLMGYVSASRNVEGMNFLRLLACPHTVPDCASFCPNQLEKVPCKVFEPLRDTTLLSTL LKPGQRGCLWRSNVRILDLYQDQQIYFCYVHVGTEIARIEVPSWVAQNTTMFDQALGL MLAQVQKGYGYPVAIAEAHNQAVVRGGDKARFFAILERQMIKAGIKNVGTSYKEARKR GSIA" gene complement(12595..13602) /locus_tag="DP116_03870" CDS complement(12595..13602) /locus_tag="DP116_03870" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03870" /translation="MDNKPLELQAEKLIEHKLIKYGLLVTKPSFDKEGTDLLIIKDIS TKITPFIKVQCKGRTLKTNSSVIIKKKYVNENFVLFLYVEEENSKDDFLYVFFESDIK NWRIEKESFDLAIPRNFQTKEEFKKRVFNKEAIMKIENILLRQAVNQLIKTSNFIIID GIFLEKVAKETQRFYQKLYPEKTFQKLSIDAIVDQLLTYTHIERKNEVNCYLIYSEHF NLEYLVDIGEITKHGFFMGDMPESVGCNYNLYKQKTADIVSFKVEELLNRIINVDNIL LVADDFAYVPYLQDLKERGVEIVIFQCSENSGSRMYHQFKWANVIYPLGLAMGLKEYE L" gene complement(13688..14113) /locus_tag="DP116_03875" CDS complement(13688..14113) /locus_tag="DP116_03875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877789.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VapC toxin family PIN domain ribonuclease" /protein_id="PRJNA477356:DP116_03875" /translation="MTYHPLILADSGFLFAYYSARDKHHQQVRRFFERCTSNLVTTPI CIAEVMWLLSSDWRTQNEFLLDVAKGLYECEQLLPQDFSRIAELNATYRDLPGDFADL SLIVISERLDIAAIATLDSDFDIYRRYRKKPFERVFLPE" gene complement(14110..14376) /locus_tag="DP116_03880" CDS complement(14110..14376) /locus_tag="DP116_03880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877788.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03880" /translation="MTKAMLTVRLDEETERQLADILAHEKTDKSELIRRLIAERWLNL QAGRTLVDRRGGHPEHLLQNAPPDLSERANRKKAIAEYLKERHS" gene complement(14420..14935) /locus_tag="DP116_03885" CDS complement(14420..14935) /locus_tag="DP116_03885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015161936.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_03885" /translation="MSLVFETQRLRLKPILESELNTLHSIFINPYVKKYLCDDKVLSL QQVEEMLIESQKLFDEKRFGLWFIETKDEKEIIGFVGLWYFFGEGQPQLVYALLPEAT KKGYATEAATRILKYCFNELGYQYLLASCDQPNLESRKVAERIGMREVEEKIVNGNPI LFFRVEKQLLL" gene 15266..16903 /locus_tag="DP116_03890" CDS 15266..16903 /locus_tag="DP116_03890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317798.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NCS2 family permease" /protein_id="PRJNA477356:DP116_03890" /translation="MEISTVKAPRWFVRRDIDGLFGLFLDNLIQILLIVNLCQGVLGF PPSLVYGRILPGIALSLIVGNFYYGWLAYQKGQREQRDDITALPYGINTVSLFAYVFL VMLPVRLTAIAQGTSSEQAAELAWQAGLVACLGSGLIELVGAWVGNPLRRLAPRAAML STLGGIAITFIAIGFLFRTFANPVVGLVPLGVILITYFGQVRFAIPGGLLAVLLGIGL AWGTGLVSWDNAKFATALQPIGVYIPRLWLGDLWNSRAVLLDYFSIILPMGLFNLVGS LQNLESAEAAGDVYPATPSLAANGIGTLVAAICGSCFPTTIYIGHPGWKALGARVGYS ILNGIFMGLLCLTGTVAMLAYFVPIEAGMAIVLWIGIVIVAQSFTATPSHHAPAVVVG LLPGIAGWGALIAKNALRAAGLGTPEKPLTPALIEQFKLSDTYIDGAFALEQGFIFSA MILAGITVYIIERDFRKAGYWSIAAAFLSWFGLMHSYRWTVADTVVNLGFGTGTPWAV GYILLAILFFYTEWQARHQKGNQKVVQSPSQDTFDRN" gene 17411..18166 /locus_tag="DP116_03895" CDS 17411..18166 /locus_tag="DP116_03895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318568.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03895" /translation="MKLSHLSKNLWASAFALGLATLSFTLPVSAQSGSSGSGGTSGGG TSTGTTGTTGTGTGTTGTGTTGTTGTGTTGTTGTGTGTGTTGTGTGTGTTGTGTGTGT GTTGTGTGTTGTTGTGTDTGTGTTGTGTGTTGTGTDTTGGTTGSGTTNGTGTTGTGTD TTGGTTGSGTTNGTGTTGTGTDTTGGTTGTGTTTDTTNNQGIREVRSERHSDWGWLGL LGLIGLTGLIPKRSQPRVIRDPNEVTRPGSTKL" gene 18394..21019 /locus_tag="DP116_03900" /pseudo CDS 18394..21019 /locus_tag="DP116_03900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317238.1" /note="main replicative polymerase; these cyanobacterial enzymes contain C-terminal split inteins; they are joined with the C-terminal fragments that contain N-terminal inteins that are encoded elsewhere in the genome; frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="trans-splicing intein-formed DNA polymerase III subunit alpha N-terminal partner DnaE-N" assembly_gap 19554..19563 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 21388..21765 /locus_tag="DP116_03905" CDS 21388..21765 /locus_tag="DP116_03905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317218.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03905" /translation="MCAFSNWLGKRLQWLCCLAALCLVCLVMGWSLPTNAANHISQQL EQQNVLLTHEKQLGELMYSDIAKNFLAVEKFQRAHTLRSSLWEHLIRADSAIKKDIRL AQRLRPSGIPFFVMNGKNLSGGL" gene 22061..22804 /locus_tag="DP116_03910" CDS 22061..22804 /locus_tag="DP116_03910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317219.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CAP domain-containing protein" /protein_id="PRJNA477356:DP116_03910" /translation="MRCSGFSVFFPIALATSYSSQLLMSKEAVAEDAQNVLPTEVQEA YSTGAIQANDLSPLEQQVINEMNKIRTNPKAYIPILENYKQRFQGKRVKIYNQRFMLT HEGLSAVDEAIRFLQSASPVAPLTISRGMSLGAKDLVKDNGSRGSTGHLGSDGSNPST RMERYGNWQSSAGENISYGPSTAEDIVIQLIVDDGVPNRGHRTNIFNPTFRVAGVAYG IHARYKTMCVINYAGEYQEAISVSSRGRR" gene 23264..23956 /locus_tag="DP116_03915" CDS 23264..23956 /locus_tag="DP116_03915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312839.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CAP domain-containing protein" /protein_id="PRJNA477356:DP116_03915" /translation="MRQMVFWIFTLVTLSTTSCDVLFEPDSSVPNIPEGNSDVLVTGK TLSRLEQQVIVEMNKARTNPTAYAAVLKNYRQRFEGNRVKISRHVYLQTQEGVKAVDE AIAFLKSVTPVGSLTASKGMSRAARDHVKDQGSKGILGHKGSDKSDPFTRLNRYGTWK RTAGENISYGSHTAQDIVMQLIIDDGVPDRGHRINMFNPAFKVAGVAFGIHNTYRQMC VINYAGRYIEKS" gene 24064..24918 /locus_tag="DP116_03920" CDS 24064..24918 /locus_tag="DP116_03920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206108.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_03920" /translation="MNSRHSNTASSALHLNVDVQGQGFPILCLHGHPGSGSSLSVFTN HLSKRFQTLAPDLRGYGKSRCDGNFDMRDHLIDLEALLDRFRIKKCLILGWSLGGILA MELALQLPQRVTGLILVATAARPRGNHPPITWQDNVYTGVAALLSYIKPDWQWNIETF GKRSLFRYLIQQHTPTAYRYIAKDAVSAYLQTTAPATRALYSAIGAGYNRLTDLEQIQ CPCLVLAGACDRHITADASLETFRYLKDSQSHCYPNTAHLFPWEIPNQVLSDIDRWLE KNQKVVSV" gene complement(25186..25488) /locus_tag="DP116_03925" CDS complement(25186..25488) /locus_tag="DP116_03925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875328.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03925" /translation="MKSIWKNLKESLINIFDNFGLAWWVEIITQNPRCTYYFGPFISS ADAKAAIKGYVEDLELEGAQGIIVDVKRCKPSALTIADDLGERTDRKVQPVFSGQM" gene complement(25569..26546) /locus_tag="DP116_03930" CDS complement(25569..26546) /locus_tag="DP116_03930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875329.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="23S rRNA (guanosine(2251)-2'-O)-methyltransferase RlmB" /protein_id="PRJNA477356:DP116_03930" /translation="MIDKPRKIKTSGEHNSAQPIRMKGKRVISHLSPNPRVKGANGLR DSPRPHRNFSDSSISSEQSEDTDLIYGRHPVLSALENERDLNRIWITSRLRYDPRFHS LILRAKENGAIIDEVEPKRLDQITEQANHQGVAAQIAPYAYIDLDELIAKAKSETDPV IVVADGITDPHNLGAIIRTAEALGAQGMVIPQRRASGITSTVMKVAAGALENFLVVRA VNLQRALEELKAAGFWIYGTATEASEPMHTVKFNGPIVLVVGSEGEGLSILTQRHCDF LVSIPLQGKTPSLNASVAAGMALYEIYRHRWINTLYLDKLQKISLKKQE" gene complement(26640..27122) /locus_tag="DP116_03935" CDS complement(26640..27122) /locus_tag="DP116_03935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454167.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease III" /protein_id="PRJNA477356:DP116_03935" /translation="MTSKEEDLLDGSNEIPSQDSIWCQLLRATKEQLFQEISQAQLQQ ISPTALAYLGDAVCELYVRMFYLLPPKRPEAYHRLVVAQVRAETQAKMLQSLYPHLNN TELEIVRRGRNATTGRPRRVDLATYQQATSLETLIGYLYLTDFHRLSELLQKLDLEKP " gene complement(27566..27955) /locus_tag="DP116_03940" CDS complement(27566..27955) /locus_tag="DP116_03940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015227559.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-anti-sigma factor" /protein_id="PRJNA477356:DP116_03940" /translation="MTYEEGIIAEPLTLTVSLRGTREVRDNCQLFRLTGLLDAFSEPT FTKVLGSKIEEGPKHIILDLSQIDFVDSSGLGALVQLAKRAQNSAGTLQIVSNARVTQ TVKLVRLEKFLALQPSVDVALENVKSS" gene complement(28185..29327) /locus_tag="DP116_03945" CDS complement(28185..29327) /locus_tag="DP116_03945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317227.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbamoyl phosphate synthase small subunit" /protein_id="PRJNA477356:DP116_03945" /translation="MSLFDAIPAIFVLADGTTYRGWSFGATRTTIGEVVFNTGMTGYQ EVLTDPSYCGQIVVFTYPELGNTGVNIEDEESSKPQVRGAIARNICTRPSNWRSTQSL PEYLEQHHIPGIYGIDTRALTRKIRMFGAMNGGISTEILDEAELLEQVQAAPSMKGLN LVHEVTTRDIYEWSEPTQPVWEFNPTCQENSKESFTVVAIDFGVKRNILRRLASYGCR VIIVPANTLPEDILKYNPDGIFLSNGPGDPASVTEGIKTTKALLETHKPMFGICMGHQ ILGHALGAETFKLKFGHRGLNQPAGLQQKIEITSQNHSFAIDPDTLPNSVVEISHLNL NDSTVAGLRHKSLPVFSVQYHPEASPGPHDADYLFEQFVQAMRAVR" gene 29654..30808 /locus_tag="DP116_03950" CDS 29654..30808 /locus_tag="DP116_03950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03950" /translation="MHQFILSRAALILVSTTLAVLTVACNGKQQVTVRSGEQQPVPKD QSVASPVTTMAPLSASPQPQATMQPTREISVSYFEQGLDKAVGALSISQSAKSTEDWN LVAIQLADAIALMRKVPVDSPDFTNAQAKILDYQPHLKDAIQKATRPVNPPRQAQPER IGVAIPQVPVTPTVTPAITKPSFTETPAAPAEKLQPPLPKLTPLAPPKQQEVLAPPTI KQQDEQQVYTVGIKRRMGGTPIIEVTFNGTQPFEMILDTGASGTVITQKMANALGVVQ VGKAKANTANSKAVEFPIGYVDSMEVAGAKVNHVAVAIAGADLETGLLGHDFFGDYDI TIKRDVVEFRPQLRSPIGRLPPKASQTNPTQETEFTAPNAPKELPSVTDP" gene 31166..>33011 /locus_tag="DP116_03955" CDS 31166..>33011 /locus_tag="DP116_03955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747868.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyclase" /protein_id="PRJNA477356:DP116_03955" /translation="MEAILEQTLKDNHFLLDTAKSYLLKEIVPQANEIDGDPNRLFQA LRGLGELDLLALKVPRHWGGKEVNEQTYGSFQELVARYSGALAFLQTQHQSAAGMLVA SSNSSLQERYLPRMGNGQVLLGVGFSQLRREGDPLSVATPVAGGYQLNGVVPWVTGWE FFSEFIIAATLPDGRAVFGIVPLLETHQDSGGTLTLSPPAQLAAMTSANTVTATLKNW FLPTESVVFIKPAGWIHENDKKNVLGATFLATGCALAGLDIVESVVSRKPLPFIQKAF DSLQQELNNCRNAVREAQNNSSLELVERLQLRAWAIDLAGRIAHAAVTVSSGAAIYSH HNAQRVYREALVFTVTGQTRAVMEATLGRLTRHGFENEPQRRRGRREGREGREGREEE KNITYSRVIHLSHIIDSDIPQWEGDPPVEFEAVAQLYKDGYYLRGFSMGEHSATHMNA PNSFRLDGMGIDEYSAESLVVPAVVIDIREQALVNPDYTLCVEDILTWEKRYGKISSG CVVLLYTGWQEKWLDKNAFFNQDEQGGMHFPGFGSDATRFLLEERQIAGVGIDTHGVD SGQDTTFATNRLVLEKPRIVLENLTNLDQLPPTGATLAIGVLRLRGGSG" assembly_gap 33012..33021 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(33204..33614) /locus_tag="DP116_03960" CDS complement(33204..33614) /locus_tag="DP116_03960" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03960" /translation="MVTGTLKLSDWNECSEEISQLIQSIRKRRQNLLNRNKISLDDFL KIGNLETDLATTKSLIGLKLIDEIVTDLKQPKERIEKVTKQLQSAIQDLEDSKKVLTI LTSFVNLVDAILNPVSGLVKIAGIVTQLDNLTIG" gene complement(33680..34765) /locus_tag="DP116_03965" CDS complement(33680..34765) /locus_tag="DP116_03965" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03965" /translation="MIPHIKPRRAIFALVPLALLLNSCGRDFPSVQNVGKMATTLELS SSNIADDIYNSCITRTKYIPFLAAKGSSDSFHSFLQRQDEQKRCDDLYKQKVTDVKKA NSVLVEYVKALGKLASDDTASFDKNFTALDESLKNLKFSQSNGQVFSFKGPDVDAGIN IAKFLTNAFTREFRREKLKKAILCTDKDIQTYTGATSSVKDSASNQPATGGLIALTQQ AYINGILTAEEEQIRTYFTDYIGGLTQVSETHTLDFITLEENYNKAMDSIRSRKDAAE NYVELLQKIATLHSNLKTEFQGKDQIDDAQLSNYCQDLYTTKADKAATKEKAVTYDQE ELKRVRKIVSDSERTLEPLIQKMDKGL" gene 34994..36490 /locus_tag="DP116_03970" CDS 34994..36490 /locus_tag="DP116_03970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129488.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase" /protein_id="PRJNA477356:DP116_03970" /translation="MTTPLICRNYIANQWVNAASLDTLESRNPADNREVVATFPRSAT VDVDTAVAAARKAYRSWRLVPAPARAEYIHKVGELLQKYKEELAQLMSREMGKPLTEA RGDVQEGIDCAFYNAGEGRRLFGQTTPSEMPNKFAMTVRMPVGVCALITPWNFPVAIP CWKAMPALVCGNTVILKPAEDTPACATKLIEIFAEAGFPEGVVNLVHGVGQEAGKALV EHPDVDLVSFTGSSETGAFVASTCGRTHKRVCLEMGGKNAQIVMEDADLELALEGAVW GAFGTTGQRCTATSRLILHRDIKEKFTAMLKERTSKLRLGAGIDPNIDIGPIINQKQL QRVNKYLDIAREEGAKVLIGGEIATEGELQHGYFFQPTILDQVTPNMRVACEEIFGPV VAVIEVSSFEEAIAILNNTPYGLSSSIYTRDVNRAFAAMRDIEAGITYINGPTIGAEV HLPFGGVKQTGNGHREAGSAVLDVFTEWKTVYVDFSGSLQRAQIDNRS" gene 36503..36712 /locus_tag="DP116_03975" CDS 36503..36712 /locus_tag="DP116_03975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317819.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding S4 domain-containing protein" /protein_id="PRJNA477356:DP116_03975" /translation="METSASTIKLDQFLKFVGIAPTGGQAKLLIQGGDVKVNNTVETR RGRKLVSGDKVTLGGQTFEVDLENL" gene 36997..40155 /locus_tag="DP116_03980" CDS 36997..40155 /locus_tag="DP116_03980" /inference="COORDINATES: protein motif:HMM:PF05729.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03980" /translation="MSADFQKYLQSVCNAYQHWWRLYTITDVRGSKPAQSKSLPSLFD FGLMVETVQPQQQEKRETPEKTERLPVQEGLRKYSENHVLLVGRPGSGKSTALARLLL EEAGNSVIASEAKRNVDAIAASRRVAISTTTRIPILVELRQYNTSVLELIQNFLLRHD PNLTFNSETLKTWLRQGQFLLLLDGLNELPNDDARRDLKTFRQNYPNTPMIFTTRNLG VGGDLGIEKKLEMQPLTEEQMQQFVRAYLSEQGEEMLRRLGGRLREFGQTPLLLWMLC ELFRTTGNVPPNLGLVFRYFAQSYDGKIKDDVPVSGESRRWWQELLQHLAFAMMQGET RTELRVTIDRREAEAIFTKFLEGKVDYPPSRAKEWLEDLLKHHLIQVVSNNQIEFRHQ LLQEYYAAECLLQQLPNISDEKLKRDYLNYLKWTEPLALMLALVEKEKKEQAVQVVKL ALDVDLQLGARLAGEVKREFQEKTVGLILGLDVPELLKIKCLGITRSECAIAFLSKSL QHENFIVRGSAAYVLGEIGNQAAVSALIKALQDKDSFVRANTAYLLGKISNTAAIPAL IAILQDKDSFVRWRGVDALREIGNEAAVSALIKALQHEDPYVRKSTAEALGKIGNEAA ISALIKALQDKDSSVRENAAEALGKIGNELAVSALIKALQDKFSGVCWRAAYALEKIC NATAVSALIKALHDEYSLTRVRSAKALRETRNEAAVSALISALQDKSSFVRGSAAEAL AKKDNEAAVSALISALQDEDSYVRANAAYALGEISNEAAVSALTKALQDEVYSVRESA VYGLEKIGNEAAVSALIAALQDENYFVRWRGVHALGEIGNEAAASALIAALQDEYSYV RGSAAYALGKIGNEAAASALIAALHDEDLEVRVDAADALGKIGNEAAVSALISTLQHK DSDVRGSAAYRLGIIGNEAAISALIAALQDEGSHVRSQAADALGKIAGSKVLCQIWEL QLKTPSWDKSDAISKIQERCKFYNHEIFHSPPVEEETKTKSETSKSSTYIIQRVGNLN TGDVNIHGDQIGTQHNQLNNKD" gene 40160..40549 /locus_tag="DP116_03985" CDS 40160..40549 /locus_tag="DP116_03985" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_03985" /translation="MSETSKYQNSFHFNQAPGNVNTGDVTIQRDQVGIQHNYAPEQKQ NLAETAKEIQQLLNQLSVSYPTTTESQKQAIANQAIARIKHDNPTTWQRLRSATEAAL IEAFKEVLDNPFVNVTVAAVEGYRKAE" gene complement(40606..41655) /locus_tag="DP116_03990" CDS complement(40606..41655) /locus_tag="DP116_03990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869415.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA 3'-phosphate cyclase" /protein_id="PRJNA477356:DP116_03990" /translation="MIHIDGSYGEGGGQILRTSLSLAAITGNSIQIENIRAGRSKPGL AAQHLTSVRAAAAICNAKVRGDALGSMTLEFVPGNSVLAGSYIFDVSEAREGGSAGAV TLVLQTILLPLVLASGDSQVTLKGGTHVSYSPSVNYIQQVYLPTLQRMGVQVEVKLNA WGWYPQGGGEVELLVSGGSKLGGINLVERGDLQQVRGLAVVTELPSHIPQRIASRAEN VLREAQLKPSVQALRAKGVAPGAGLFLTAEYENTLAGFGALGRLGLAAEKVADMVCEE LLKFHQTGAPVDEHLGDQLLLPAALASTKSEYRVAEVSQHLTTNAWVIEKFGLARVMV DEVEKRVIVEPLGKN" gene complement(41778..43136) /locus_tag="DP116_03995" CDS complement(41778..43136) /locus_tag="DP116_03995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877770.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-linked oxidase" /protein_id="PRJNA477356:DP116_03995" /translation="MTTTPVKSFDLDALTASLEGIEIITEPSQVAKLSQDYHTFSPVL VPKLEGKVGDIVVRPANEAEVLKVAATCVKHRVPVTVRGAGTGNYGQCVPLHGGVILD MTRMHEIRWVKPGIARVEAGVKLAALDKKAREIGWEMRMTPSTYRTATMGGFIAGGSG GIGSIQYGFLADRGNLLALQIVTMEDEPRVIELRGDDVQKVKHAWGINGIITEIEIPL GPAYPWAEVIVTFPEFMAAARFGQALGDADGMIKKEISIFASPIPDYFAAFREYIPDE THAALLILAEPSLELLPGLVEQYGGKITYQKTATDAGKGTHLGEFTWNHTTLHARSED TSITYLQSTFPADKNLQMVEHMYHHFGDEVMMHLEFTRVNGTVVPVGLQLVRYTSEER LNEIIHYHEEQGVFIANPHTYIIEDSGRKVIDPEQLKFKEMVDPYGLMNPGKSKVLEF KI" BASE COUNT 12302 a 8945 c 9237 g 12859 t 20 others ORIGIN 1 agcgctcact ggtttagggg gtggagttgt cattgtcccc ttattaactt caatctttgg 61 agttgatatc cgctacgcga ttggtgcttc cttggtgtct gtgattgcga cttcgttagg 121 tgccgcatct acatatataa aaaaaggcta taccaatctg cgattgggaa tgtttttaga 181 agtagcaacg acaatcggag ctttaatagg agctttgatt gccactttga tttctgtcaa 241 agcactaacc atcgtccttg ctgttgtttt actttattca gcttacctgt cacaacaacc 301 caaactcaac aaccttgaaa atgagtcaac tgattcctta ggagcttatt tacaacttaa 361 tagtaattac ccgactccag aaggactgat gtcatatcaa gttcattctg tccggagtgg 421 gtttagtgtg atgttagtcg caggagtcat ttctggacta cttggaattg gttcgggagc 481 gtttaaagtc ttggcgatgg atcaaatcat gcgtgtcccg ttcaaagttt ctacaaccac 541 cagcaatttt atgattagtg taactgcagc cgcatcagca ggagtttact tagcacgtgg 601 ttacatcgat ccaggactat caatgccagt catgttgggg gtattgcctg gtgctttttt 661 gggggcgcga gtcttaatag gagccaacac tcagacttta agaattattt tctctgttgt 721 gctggtagtg atggcattca agatggtcta tagcagtttg cttggggagc tttaaaatga 781 ataaaataag ttttagtttc tggtggaact catcgacacc atcagaaagc gaagttatca 841 aattccaagt gcagcagaaa aaatcagaca gtgatattca acagttagcc cagcacacca 901 gtgtgatagc agagtttcct aacgactatg aggttgatgc taacaaggaa aatacgaaaa 961 cactaagcga actgcaactt gagcagctgc ttagtaatgt gctaaagtac ggcgttatat 1021 ttgctagcac tgtcgtttta ataggcggca tattatattt gattcgccac ggtgctgaaa 1081 gagctgactt tcaatttttt cagggagaac catcccactt ccgttctcct acaggggttg 1141 cggttgcagt tttgtcaggt agtcgtcgtg gtattattca gcttggactg ctgctacttg 1201 ttgctactcc aattcttcgc gtggtcattt ctctgtttat ttttctaaaa cagcgagaat 1261 tgacatacgt tgttattact ttgatagttc tgacagcgct aatgtatagc cttataggag 1321 catattatta atttctaatt aattgttaga gtgcgttacc tcaaaagtaa cgcactcttt 1381 agctacagct tatctgttat gtgaaagctt aaagttggat aatcaaacca agattttgtt 1441 gaatttgctg tgccaaataa tttcttatca aatcaacgat ttctactgga aataatacta 1501 agacaacagc accagtcaag ctaaacaaac caataatgag agctaacaat cttccaaaaa 1561 aaccaggatt aatttcagca aaacccaatc tggcttgtcc gacaacagca atcgcaagga 1621 aaacaattcc tgcgattaac aacaaattag ttaaaggtga attagtcata tttttgataa 1681 aaagcattat cttaattata gctatcagaa ttctcagtca attgtgaaat ctctctcttt 1741 gtgcttcttc tcttgttctc tgcatttgat acagttaatt tactgattgt ttcacaaatt 1801 aaataaaaat gttacacgag caggtttgat tttttgtcta tacataaagt ataaattgat 1861 atgacttcaa cctccaagcc aagcgtggaa acgagaaaaa atagatcaat ctatgcttaa 1921 tgtttgattc cccactagac tttttcttag tttttcgttt aaagtacggt taggacatag 1981 aatttatgaa ttattatgaa aaatcttcca gtcatcactc aggagtcaag cttagcagga 2041 caaacaaccc caaagcagtt actgttggct gttcttgggg actggaaaca aaatacattt 2101 aagaaactta gaggcggatt ttttctagtt cttggatatt tgctgtcacc cctgtgttgg 2161 tggaatgatc tgttattcaa tttgccaatt gcttatggtt ttggttatgt ctgtagttta 2221 tttaatccaa agttactctt gccttgctca attctagggt actggctttc taacattgtt 2281 ggtatcctct tgatgcaatt cggagctatg gatattctgc caaatcaatc caaggagcgt 2341 aatttgagaa aagaattgct tacagggtta gtttcctcaa cgctttatac tcttgtgatt 2401 ttggcattag ttcagttgaa agttctcggc gctcctattg cgttatttgg tagttaatat 2461 catgtccgct taattactta taattcctgc ttgacctcac cctaaccctc tccttataaa 2521 ggagagggta ccggaggcgg gtgaggttcc tcgtttttat aagtgtttat ccggacatga 2581 tataatttct tttggtgcgt tagcgctcac gctaacgcac caaaagaaat taagaatatt 2641 ttttatacat atgttgattt ctccattgct ttaagatagg gcagaaattc ttctaaacta 2701 tcagcaaaca taaaaggtac agcaactcca ttttcaacaa tcagttgcat acagtacttg 2761 gagtcaatag gaacattatt atttgtttca acacatgaat cgggtaactc tgtgatgaca 2821 attttctcaa aatatttaag taccacaaaa agcctattca aagagaataa tcgttcataa 2881 atgtattatt tcccataagt atgaataagt tgtggcaatt atatatttta gctgtgaata 2941 ttctgttgca agtatgtaac tagctgctct cctgctatgt caatatgttt tttgagtaac 3001 ctaacagcag ctttggtatc gtgctgttga caagcttcta ataattgata atgttccttt 3061 tgggagcgct ctaagtaatc catctgactc atctgaagac ggacataacg gtctacattc 3121 aggtgtaaag ttttaattat tgttagtaac cgaggacgtt cagcactttt ataaagtgtt 3181 tcatggaatt cccaattcag tttcgcaagc attccggcgt caattgtctg gtctgtcttt 3241 tctaaaatga cagtggcttt ttctaaatct gatgcggtta gcttgggaat tgccagttgg 3301 atagctttta cttccaacgc actcctaatc tcgaatattt cttgtgcttc tgttggagaa 3361 agtgctgaca ctgttgctcc ccgattgaga tggagtgtca ccaatccttc tgcctcaagc 3421 tgacgtaaag cttcccgcac aggaatgcga ctcactccaa actgagtggc aatttcatct 3481 tgtctgagag attgtccttc ctgaaaaatt ccccgcacga tcgcttctcg caaagcgtct 3541 gcaattaaat ctggggtact gcgttgttgt tgcagcacat tggctgctag gtcatttaag 3601 ttcataattt attaagttaa taatttatat tgtatactat tttttgtata caatattcaa 3661 aagatatcta acaatccctc tattgatacc acgggagaca gtgatgagca ttcaacctca 3721 accagaccgg gtcattatct ttgacacaac gctacgagat ggtgaacagt caccaggtgc 3781 aacgctgctg gtggaagaga aactggcgat cgctcatcaa ctgtctctcc tgggggttga 3841 tgtgattgaa gccggatttg ccgtagcgag tcccggagat tttgaagctg tcaaaacaat 3901 agctcaacaa gtcggcgtat tgggtggacc aatcatttgc tctttagcaa gagcgattcg 3961 gcaagatatt caagctgctg ctgaagcatt gaaaccggct gctcatccga gaattcatac 4021 tttcatttcg acttcagata ttcacctgaa gtaccagttg aaaaagtctc ggagtgaggt 4081 tttggcgatc gcccaagaaa tggttaccta cgccaagtct tttgtagagg atgtggaatt 4141 ctcaccgatg gatgcaagcc gcactgatcc agaattcctc taccaaattt tggaacgagt 4201 gattgctgca ggtgcaacca ccatcaacat tcctgacacc gtcggctact gcacacccaa 4261 agacatggca gttcttattc agggtgtgca agaaaatgtt cctaatattg atcaagtcat 4321 tctttcagtt cacactcaaa acgatttggg tttggcgaca gcgaacgcct tgacagcaat 4381 tgaaaatggc gtgcgtcagg tggagtgtac cgtgaatgga attggtgaac gagctggaaa 4441 tgcagcgctg gaagagattg tcatggcatt gcaagttcgt aaacctcatt tcaatcccta 4501 ctttggtcgt ccggttgatt ccgatgcacc tttgacaaag attcaaacgc aagagattta 4561 caaaacttct tgcttggttt cgcaattcac aggaattgtg gttcaaccta ataaagcgat 4621 tgttggagca aatgcttttg ctcatgaatc tggaattcac caagatggaa ttctcaaaca 4681 tcgccagact tatgagatta tggaagctgc tgatatcggt cttgctgaaa acagcattat 4741 tttgggcaag cactctggaa gaaatgcttt ccgaaccagg ctgcaggaat tggggtttga 4801 gttgagtgag gcagatttga acaaagcctt tatgcggttc aaggaaattg ctgataagaa 4861 aaaggaaatg tctgattggg atttagaggc gatcgttcgg gacgaaacac aaattcaagt 4921 agacaaaggc tttcaactcg aacacgtcca agtcacctgt ggagattgca cctgtccgac 4981 tgcaacgatt acagttgtaa ctcccaacgg agaaatttca acggatgcag cagtaggaac 5041 tggtccagta gatgctgtct ataaagccat taatcgactt gtgcagattc ccaaccaact 5101 gatagagttt tccgttcagt ccgtgacagg tggaattgat gctctgggaa cagttacaat 5161 tcgcttacga tatcaggagc gcattttctc tggacaagct tcagatactg atattgtagt 5221 cgcagcagct tatgctcaca tgaacgcact gaatcggctt tatcgatttt tgcaaaggga 5281 gacacagaag tcccatgctt caggttctgc ctcagttgtt aattttctaa tttaaatttc 5341 tttaacgagc gcattgttgg gtcattttcc atcccttagt ctaaactcca gtaactaagt 5401 tttgacactc cccggcttaa aagtcgggga ttcttggtct gggcttgctc tgtgcccgca 5461 gggcataggg cttcgtgcag ggtacttgag ttcaaggcgt actcaccggt aaggtgcaag 5521 atttgagtaa tgacagatgt aatgctgtga agatgatatt tatcagatgt agagcagtgc 5581 tgtttggcaa tcaattgaga actaacaaga cataaaaggt gctcggaata ggtttaagag 5641 ataattcatt tccctgtctt agcataattt tgatttccga ctcctattaa tgacaagttg 5701 tactaagcaa ataagctaaa actttaagga gtaaattatg tttaaattca acaaacagct 5761 gagtgcctta gcgcttttaa cgatgacttc tgttggtgaa actatgctaa ccgttgcaaa 5821 ggcatctgct caaactcaat catatagaat agcagcaagc acaaccccac agcaacttgc 5881 tgtggggcct aaagagcggg atatcgccgc gctctttgac aaatggaatg ctgctttgca 5941 aacaggaaat cctgatgaag tagtgaagct ctacgcaaaa gatggtgtgc tcctgccaac 6001 ggtatctaat aaggtgcgga gcactcactc acaaatgaga gactacttcg agcatttcct 6061 caaatacaag cctaaaggta caatccttca acaaaacgtc cgcatcattg acaagctgtt 6121 cgccatcaac tcgggtgtct acagcttcaa tataatcaag aatggtaaac ctggaaaggt 6181 tgtagcacgt tactcgttcg tctatcgcca tgatggcaac gactggctaa tcgttgacca 6241 ccattcctct gctatgcctg agtagtgtct gccttgtcct atcgaatata ttgaatcatc 6301 aagcccttac gatcatgtaa atatgacgtt agacatgacg aatgcttgcc ctgctatttt 6361 caactgaggt ttattgtatc gcctttacca ctactgaggc gatgtcagat atgactatca 6421 aacaagtaaa agctggtacc acaggaattg ggtgctaggt agaaaaaaac ataaataaca 6481 cccaaaaaga ggaaccagct tagagaaatt aacgttttgt cggacaggaa aaagctgtac 6541 taagtactgg aggagctaaa aattattcga cttggttgga ggtgtattgg aggtaggatg 6601 gagatgagag ttagttattg ctgcactgaa actgtcatcg tgtcaggatg aattgagatg 6661 agactggacg cttttgaaga cgtctgtgag ttaaggagtc tgtctgatag atatgtaaag 6721 gcggcacttc ttgccagcag gattgacaaa accagtatac tcctgtgtga cgcacatgac 6781 gcagcaagct actgccgcaa catggacaac tatttgttgt cattttcata agtctctcct 6841 taacaccgaa agctgacgag aacagcttta cttcacaaaa gctattgatt ccaacgagtt 6901 tctcagggca tgaagcagcc cacttttggg agtacgttct ttgagaattt gttcgtacac 6961 tgcttcatat ccatccacca tttggctaac ggtgaatctg ttttccacgt cttctcggca 7021 ggtttgacga tttaattcca atgcagccgg aatcatttct gccatctgtt catagctagc 7081 gcaaacaaaa cctgtttttc cgtgagcaat gacttctggt accgaaccca tgttaatccc 7141 aatgactggt gtcccagttg ccattgattc aatcatgact aagccgaagg gttctcgcca 7201 agtgatggga aacagagtta tcgatgcgtt actcaacaat tcaactttgg cagcatgatc 7261 tacttcgcct aagaattgta tttgctcacc atctatgaat ggggcaattt ctttttcaaa 7321 aaactctttg tctacaacat ctactttccc tgccattttc aagcgccaac ccgcttgttt 7381 tgctatggcg atcgcatgtt gtggtccttt ttccggcgag aatctgccca aaaatgccag 7441 atatggtggc tcttttggtt gcgctacaaa tgggtaatct tcaattttga tgccattgta 7501 gaccgtgcta aggtagttca agtctggttg acgctgggca tcactaatac tgatgtatcg 7561 ttgcgttgca tgaagggtat atactctgct actttcattc gtaaatgcgc catgcaaggt 7621 atgtactgtg ggtgttgacg agatacttgc taattgcaac gtcgataccc ctacatggga 7681 atgaatgatg tcgaattcct ccgcttgttg gtaaacctgg ctagtttcta gcatttcgta 7741 cattatactc tctttaacgc ttgggtctaa gcgcaatgca cgaggatgaa ttgcttcaag 7801 tctagctaat gtttgcgaat ctccggaggc gaacaaagtg acatcgtgac cgcgacgcac 7861 taattcatca gtcaggtgac tcactacgag ttcaattcct ccgtaagttg aaggtggaac 7921 ccgttcccac aagggggcaa tttgagcgat tttcatacgt tttcttggtt cagctggtta 7981 aagtggtcgt tgaacaaggg taatagacgc ttttgcttca ggtgttgata cctcccttgg 8041 ctgaagttgg aattaaccaa acttcctatc tacaaggtat tttcatcttg ttttgcaccc 8101 ttataatcgc cacctccaat atgtgtgctg accagtcata gtgcagctat tttttacttt 8161 taccccagct ttttacatcc cacatctatc ttgaggacta tataaaaata tttaattata 8221 atagtcattg agtcatattt tgtggtaaac gatatttaat aaccatataa cgaaaaattg 8281 gtgttaatgt tagaaaaacc aaaacatagc gtcctctaaa atcacaattg cttttcaaaa 8341 ctccttaatt cgttgagcgt tgaaatggtg tatatttccc caatgctttt acaatagtcc 8401 gcctgttgac ctttatcaaa atgatttagg tatcaccttc atttccagac acaccaagcg 8461 aatcacaata caattaactt ggtggtaact tgacttttgc ttcttgagca aaagtgttaa 8521 agttagcttg ggttaatcgt cgtttcagct ttaattgtag gaacaccagt ttttttaaaa 8581 agaaagaatc cagagcacag ttgtcagaat atgtcctgcg gacacgctct tgcaacagca 8641 agataatatg gggtttgtaa gaagaatcta gaaggcatta gaaacgactg tgcgcaaagt 8701 agaatgggag ttttgataga atctaatact ttacaaacat cctcttaggc atagtagtta 8761 gcttgacttt tcacgggaaa ttctcaaaac acgaaacagc gttgtctttt gtcaaaacat 8821 aattcagaat tcggaattcc gaattccgcc ttgcggtact acgcgtatcg cgctagtcag 8881 aagttgtttc ttcaaccgcg ttcccgtgag ttcatgctca aattcagcaa cgcgtctcta 8941 ggctactgca tgatttgcaa ttgtcaggta agctgacagc agcagttttc atcctcaaaa 9001 aaaatgacgc caagtactcc cacaattctc gccttagact ttgatggtgt tatttgcgac 9061 ggattgattg aatattttga agtcgcgtgg cgtacctatt gtcaaatttg gttacctgaa 9121 caagaaacac cagagaataa tttagcctca aaattctatc agctcagacc tgtgatcgaa 9181 acaggttggg aaatgcccgt tttagtcaaa gccttggtag aggaaattcc tgaagaaacg 9241 attcttcaag aatggacaaa gattgctcaa gaacttttgt taaaagacaa tctcaaggcg 9301 acagacattg gtcatcaact agataaaata cgggatgagt ggattgctac agatttagat 9361 ggttggttga gtttgcatag attttatcca ggtatagtgg aaaaaattaa agcaacagtc 9421 gatagcacaa ccaagttata cgtcatcaca acaaaagaag ggcgttttgc acagcagcta 9481 ttgaaacaag gaggagtcga tttaccgcga gaaattattc ttggcaaaga agtcaagcgt 9541 cccaaacacg aaattctgcg agaattaatt cagacaacaa atactttgcc agaaagagtg 9601 tggtttgtag aagaccgact caaaacatta caacttgttc aacagcaacc gggtcttgag 9661 ggagtaaaat tatttcttgc agattggggt tataatactc caactgagaa agtaaccgcc 9721 caaaatgatc cgggaattca gctattatca ctgcctcagt ttccgaaaga tttctcgcaa 9781 tggctttaag aggatgttag tacgccttga actaaagttc aaggctcata gccaaagtcc 9841 gttaaaacgg actcgaatgt atattcagta agctttagct tacttgagct ttcagccaag 9901 aaataaattt cttggctgat ccaactgctg tattcagttc tgattcaata ctttgaatcg 9961 tccatacgtc aatttttcta ggtttttcac cactcaatag tgcaattgcc aaatagtagt 10021 gagtatcgga gattgacgga ttagtttttg ttgcatcact caacacatca atagcttggc 10081 gataagctcg ttgattgaga agctgaattc ctttgtgcag caactcctct gcagtgactt 10141 gtggactttg atatatatca ccaacatgcc caatactctc tgcttgaatc tgataatttt 10201 tcccgccgta attcttttgc tccatatagc agtcctaaat tattcataag attttctttt 10261 ctctttcttg gcgtccttgg cgacgccagt cacctacgga gggaaaccct cctgcagcgc 10321 tggctcgtct tggcggttcg ataattctca caactcatat aggattgcta tatttgtgtt 10381 ccaacaacaa actacttttt tgtgtaatca tcaccaattc gatcaattcg atttgcttga 10441 acttgaaaac ctttagcatt gtcacggttt aattgcttca tttcgccaaa ctgcaattgt 10501 tcctggctta tcttctgaat gttgataatt tgctgtgcga gttgtcgcaa atcatcagca 10561 aaacttggtt cttcatccat ctcgtcatct aggtaagtcg tcaatttgtg caatgcttcc 10621 acagaatgct gttcttgaac tgcggtaatt gctttttctg ccttaacatt ctgtctatcc 10681 ttgaaccgtg acacaatctt gtgacgtaat tcattagctt tgcttaaagc ctcagttgtc 10741 agctttttag cagcttctcc agcactagat ttaataaact cattgaaagc aagcttaaga 10801 atttcagcag aactcaaagc aacaattggg ttagtcatgt aatcctccag aagactagta 10861 aatttactta actttataga ctctcaacaa tcaagtgcga caaactttat gtaacacctg 10921 caattgaata gcaaaaactg taccaacacc gcccaattga gcgaatatgg cgctggcaac 10981 tcacacctat ttggttctac aactcggctt atattatact tttttatact tatatttacg 11041 tatatctact tgatttgtca ggatagcaat aatttttatg gatatcattt gatatcaaaa 11101 catttcgctc ctactggcta gtgagtatca ccactgatga aacatcctct aagatagtgc 11161 tgagtactga gttctgagtc tctttgtttc aacactcaga actcatactt cacaactggt 11221 aaacttacga ttatgcttga cctcactaaa ctgacaagac aaatgcaggg tttgagtcaa 11281 catctgacgc aagaggcggc tgcgagtcgt cagcgtttgg aattggctca aaaacatttt 11341 caaaatgctt tggggtgtca ggatgacttg gtacagcgac aggaaaaatg gcgcgatcgc 11401 atcctctttg ccaatgcgac tcccctagaa ccactcaaca cgtgtattga tattccagtt 11461 cctcctaaaa tacacaccgt cattgcaacc gatggttcgc aaatatctcc cagtcaccac 11521 gaaattgctt actgctatct tcttaatatt ggtagagtta tcttacatta tggacaaaac 11581 cgttacccgc ttctagatag tttaccagag atattttatc gccccgaaga tttatatttt 11641 tctcgcaaat ggggtattcg tactgaagaa tggatgagtt tttgccgcac agcaagtgaa 11701 gcaacggtgc ttgcagaatt ggcttgcagt gtaaaagagg acaggggaac ggaagaagtc 11761 cctacgttag cgatggtgga tggttcgttg atttactggt ttttagaaca attaccttta 11821 gaagcacgcg atcgcatttt accccccatc ttagaagctt ggcaaaagtt gcgtgatgct 11881 caaattcccc tgatgggtta tgtgagtgct tctcgtaatg ttgaaggaat gaactttttg 11941 cgtcttttag cttgtccgca cacagtacca gattgtgcaa gcttttgccc aaatcaactc 12001 gaaaaagtcc cttgcaaagt ctttgaacct ttacgagata caactctttt atccacctta 12061 ctcaaaccag gacaacgcgg ttgcttatgg cgcagcaatg ttcgtattct tgacttatac 12121 caagaccagc aaatttactt ttgttacgtg catgtgggta ccgaaattgc ccggatagaa 12181 gtgccatcat gggtagccca gaacacaaca atgtttgacc aagcactagg actgatgtta 12241 gcacaagtgc agaagggata tgggtatcca gtggcgatcg ccgaagcgca taatcaggcg 12301 gtggtacgtg gcggggataa ggcgcgattc tttgccatcc ttgaaagaca aatgatcaaa 12361 gctggtataa aaaatgtagg aacttcttac aaagaagcca gaaagcgtgg cagtatagct 12421 taagtcaaga actaggggta taagggtgta agggtgtaag ggtgtaaggg atgtatgggt 12481 ttgtctcccc cgttcctccg cttcttctgt gacctcatct ttgatacttc ttcttatagt 12541 atttgggctg acgggtagtt agggtattta agtgtctgct tgataatcgc ttttctataa 12601 ttcgtactct tttaaaccca tcgctaagcc caatggatat attacattag cccacttaaa 12661 ctgatgatac attcttgaac cagaattttc ggaacattga aagattacta tttctactcc 12721 tctttccttt aaatcttgaa ggtatggaac ataagcaaaa tcatcagcta ctaaaagtat 12781 attatcaaca ttaataatgc gattaagcag ttcttctacc ttaaacgata caatatctgc 12841 tgttttttgc ttatataaat tgtaattaca accgaccgac tctggcatat ctcccatgaa 12901 aaagccatgt ttggtaattt cccctatatc tactaaatac tctaaattga aatgctccga 12961 gtaaatcaaa taacagttca cctcattctt acgttcaata tgagtgtatg ttaagagttg 13021 atcgactata gcatcaatgc tcagcttctg gaatgttttt tccggataca gtttctgata 13081 aaatctttgc gtctccttag ctaccttttc aagaaaaatt ccatctatga tgatgaaatt 13141 gcttgttttt atgagttgat tgacggcttg tcttaaaagg atattttcaa ttttcatgat 13201 agcttcctta ttaaagactc gttttttaaa ctcttctttt gtctgaaaat ttcttggtat 13261 agctagatca aaactttctt tttcaatccg ccagttttta atatcacttt caaaaaatac 13321 ataaagaaaa tcatctttgc tgttctcttc ttctacatat agaaaaagaa caaaattttc 13381 atttacatac tttttcttta ttatgacact agaatttgtt ttaagagttc tacctttgca 13441 ttgaacctta atgaagggag taatttttgt ggaaatatct ttgatgatca gtaaatcagt 13501 tccttctttg tcaaatgatg gttttgtaac gagtaaacca tacttaatta gtttatgttc 13561 aattagtttt tctgcctgaa gttctaatgg cttattgtcc atccttatat ttctgtttat 13621 ctataatctt tatctaagct taactgataa ctcaacccat ttaacagtgt agctcaacgt 13681 ccgcttgcta ttctggtaaa aacactcgct cgaaaggctt ttttcgataa cgccgataaa 13741 tatcaaaatc actatccaaa gtcgctattg cagcgatatc taagcgctct gaaatgacaa 13801 tgagggataa atctgcaaaa tctcctggta aatctctgta ggtggcattt agctcagcaa 13861 tgcgcgaaaa atcttgaggt aacaactgct cacactcgta aagtcctttt gcaacgtcta 13921 gaagaaactc gttttgtgtg cgccaatcgg aactgagtaa ccacatcact tctgcgatgc 13981 agataggtgt agttaccaga ttggaagtgc agcgttcaaa gaaacgacgt acttgttgat 14041 gatgcttatc tctggcgctg tagtacgcaa acaaaaaccc actatctgcg agtatgagcg 14101 gatggtaagt catgaatggc gttctttgag atactcagcg atcgccttct tacgatttgc 14161 tcgttcagat aaatcgggcg gtgcgttttg taagaggtgt tcagggtgtc cacctcgcct 14221 atcaaccaaa gttctgcctg cttgcaaatt aagccagcgc tcagcaatca agcgtcgaat 14281 caactcactt ttgtctgttt tctcgtgagc caagatatcg gctagctggc gttctgtttc 14341 ttcgtctagt ctgacggtga gcatggcttt tgtcatactt gtattgcgct tatgacaagt 14401 ataacagaaa gctttttggc taaagcaaaa gttgcttttc taccctgaaa aataatatgg 14461 gattaccatt cacaatcttt tcctctactt ctctcattcc aattctttct gctactttgc 14521 gtgactcaag atttggctga tcacagcttg ctaaaagata ctgatagcca agttcgttga 14581 aacaatactt tagtattctg gttgcagctt cagtagcata gccttttttg gttgcttcag 14641 gcagcaaagc ataaactaat tggggttgcc cttcaccaaa gaaataccac aaaccaacaa 14701 acccgataat ttctttctca tctttagtct caataaacca aagaccaaac cttttttcat 14761 caaagagctt ttggctttct atcaacattt cttcaacctg ttgcaaagac aaaaccttgt 14821 catcgcacaa atatttttta acgtatggat taataaaaat actgtgtagt gtgttaagct 14881 cactttctaa gatcggcttt aatcgcaacc tctgagtttc aaataccaag ctcatgacat 14941 cttgctacat tcaatttttt aatcttctgt ctactgagat cttgcaccat cattctcatc 15001 cgccaagaaa taaatttctt ggctcaaagt ctaagtaagc taaagctcac tggatatgta 15061 tttccagtcc gttttaacgg actttgctta taagccttga aatttatttc aaggcgtacg 15121 tcgaggtgaa agatctgatt caactatacc gcctgtccat tttttccaaa gcatactcga 15181 agatcgacaa ctccaaacag tacgcttgct gatatcctaa aaccaacaag cgaacgcaaa 15241 aaaattgtcg aggtaaaagt agacgatgga aatctcaaca gttaaggctc cccgctggtt 15301 tgtccgcaga gatatagatg gtttgtttgg gcttttcctc gacaacttga ttcaaatttt 15361 gttgattgtg aatttgtgtc aaggggtgct tggctttcct ccctctctgg tatacgggcg 15421 tattctccca ggaattgctt taagtttaat tgttggtaac ttttattatg ggtggttagc 15481 ttaccaaaaa ggtcagcggg aacaacggga tgatatcact gctttgcctt atggcattaa 15541 tactgtgagc ctttttgctt atgtattttt ggtgatgtta cctgtgcgct tgacggcgat 15601 cgcccaaggt acatcatcag aacaagctgc tgaactcgca tggcaagcag gtttagtagc 15661 ttgcttgggt tcaggtttga tagaattagt cggtgcatgg gttggtaatc ctctgcgtcg 15721 ccttgctccc cgtgcagcga tgctttccac tttaggtggt attgcgatca ctttcatagc 15781 gattggcttt ctatttcgga cttttgccaa tcccgtagtt ggtttagttc ctctaggcgt 15841 tatcttaatt acctactttg gacaagtgcg atttgccatt ccaggtggat tattagcagt 15901 ccttttggga attggtttgg cttggggtac aggtttagtc agttgggata acgccaagtt 15961 cgccactgca ctgcaaccga taggagttta cataccaagg ttatggctag gggatttatg 16021 gaatagtcgc gcagtcttac tcgactattt cagcatcatt ttaccaatgg gactatttaa 16081 ccttgtcggt agtctgcaaa atttagaaag tgctgaagct gcaggagatg tttatcctgc 16141 aaccccaagt ctggcggcta atggtattgg taccttggtt gcagctattt gtggttcttg 16201 tttcccaaca accatttata ttggacatcc tggttggaaa gctttaggtg caagagtagg 16261 ctactccatt ctcaacggta tttttatggg tttactgtgc ctgactggaa ccgttgccat 16321 gcttgcctat tttgtcccta ttgaagctgg aatggcaatt gtgttatgga ttggcattgt 16381 cattgtagct caaagcttca cagcaactcc ttctcaccat gcccctgctg ttgttgtcgg 16441 tttgttacca ggtatagcag gttggggcgc tttaattgcc aaaaatgcac tacgcgcagc 16501 aggtttggga acaccagaaa aacccttgac acctgcctta attgaacagt ttaaattgag 16561 cgatacatat atagatggtg cttttgcttt ggagcagggc tttatttttt cagctatgat 16621 tttagctggt ataacagttt atattatcga gcgagacttt cgcaaagctg gttattggtc 16681 tatagccgca gctttcctct cttggtttgg gttgatgcat agctaccgtt ggactgtagc 16741 agatactgtt gtcaatttgg gttttggaac tggaacgcct tgggctgtcg gctacatttt 16801 actagctatt ctgtttttct acaccgaatg gcaagcgcgt catcaaaaag gaaatcaaaa 16861 ggttgttcaa tctccatcac aagacacttt tgatagaaat tgagagaaat gaggttatgt 16921 tcatagttac cgtccaacct ctcatatcta cttccatcag ccagatttaa ataaaattgg 16981 atattttttc ggatgtttgt gcctgatcgc aatcgtattg cgatatctcc tactcataat 17041 agccacatct ttgatattga aggctgttca ctgtattgat gctcctaagc taactgaaaa 17101 tccagcttac tccttgagga ttagtcggtt tattcctgtg gtagaaggtt gagtaaaaca 17161 atagtaggta tcgtacttac tagtgctgtc aattttacgg ttaccacgtt atgttgttcg 17221 attcaacaaa aaattttgca gagttattaa agattgcgat tgttttcctc gtaggaaaaa 17281 aggtttttaa aaatttcgat aaagaagtta taaatttcca aaaaactagc aatctatagc 17341 agtcctcctt ttttaaaaag cacaaaattt ccgaaaaaaa cttgcgacac aaacaaaagc 17401 aacaaaaact atgaagcttt ctcatttatc taaaaatctt tgggctagcg cctttgccct 17461 cggtttagct actttgtctt ttactctacc tgtttctgcc caatctggtt cttctggtag 17521 tggcggaaca tctggtggtg gtactagcac aggtactact ggcactactg gcactggcac 17581 tggcactaca ggtactggca ctactggcac tacaggtact ggcactacag gtactacagg 17641 tactggcaca ggtactggca ctactggcac tggcactggc actggcacta caggtactgg 17701 cactggcact ggcactggca ctacaggtac tggcactggc actactggca ctacaggtac 17761 tggcactgac actggcactg gcactacagg tactggcaca ggaacaacag gtactggcac 17821 tgataccact ggtggcacta ctggtagtgg tacgacaaat ggcacaggaa caacaggtac 17881 tggcactgat accactggtg gcactactgg tagtggtacg acaaatggca caggaacaac 17941 aggtactggc actgacacca ctggtggcac tactggcaca ggcaccacca cagacacgac 18001 aaataatcaa ggcattagag aggtgaggag cgaacgccat tccgactggg gttggcttgg 18061 tttgcttggt cttataggtc ttacaggtct tattcctaaa cgctctcaac cccgagtaat 18121 ccgcgatccg aatgaagtaa cgcgtcctgg atctactaag ttataaggct ttgctgaagg 18181 tagggggtaa aggggaataa taattcactt tccacacaaa ttgagtattg tgaactcttt 18241 accgccgcct taagccagtt cttagagcat gactttagat attaaatatt aaatttaatt 18301 attagaaatt ctaaataaga taaccgcgct caggtacaaa ctgcgttacc ctggtatacg 18361 cttcgtgttt ttcactgcta gctggaaacg catatgtcct ttgtcccttt gcacattcat 18421 agtgattaca gtctgcttga tggagcaagt caactgccgg atttagcgga tcgcgccatc 18481 gaactgggga ttaaagcaat agccctgaca gatcatggtg tgatgtatgg tgcgatcgaa 18541 ctgattaaaa tttgccgcaa taaaaatatc aagccaatta tcggcaatga aatgtatatc 18601 atcaacggcg atattacaaa acaagaacgc cgtccccgct atcatcaagt tgttttagcc 18661 aaaaatacaa aaggatataa aaatttagtc aagctcacaa cagtttctca tcttcaaggt 18721 gtccagggca aagggatttt tgcccgccct tgcgtaaaca aagaattatt aaaacagtat 18781 catgaaggct tgattgtaac gagtgcttgt ttgggtggag aagtccccca agcaattctc 18841 agtggaaaat tagaggctgc acgtaaagtt gctcggtggt ataaagaagt ctttggtgaa 18901 gattattatt tagaaattca agaccacggc tcccaagaag accggattgt gaatgtagaa 18961 attgtcaaaa ttgcgcggga acttggtatt aaaattgtcg ccaccaatga ctcacatttt 19021 atttcttgtt acgatgttga agcacacgat gctttgctat gcattcaaac tggcaaattg 19081 attgctgaag ataatcgaat gcgttatagc ggtacagaat atctcaaatc tgctgaggaa 19141 atgaaagcgc tatttcgcga tcatttgcca gatgatgtca ttgaggaagc tgtgacaacc 19201 acagttgaag ttgcagataa agtcgaacct taccagattt taggtgaacc tcgcatcccc 19261 aattatccca ttccatctgg tcatactgct gatacatatg ttgaagaagt tgcttggcaa 19321 ggacttttac aaaaattaaa tcgtaaattg agaaatgaag ttgaccaagt ttacaaagac 19381 cggctggaat atgaactaaa aatgattcag aagatgggtt tttccagcta ctttttagtg 19441 gtgtgggact acatcaaatt tgcgagagac aaaaatattc ctgtaggtcc aggtcgcggt 19501 tctgctgcag ggtcattagt tgcttatact atgggaatta cgaacattga cccnnnnnnn 19561 nnnacttttt gaaagatttc tcaatccaga acggaaatct atgccggata ttgatacaga 19621 tttctgtatc gaaaaacggg atcaagttat tgattatgtt acagaaaaat atggcaaaga 19681 gagagttgcc caaatcatta cctttaaccg cttgacttct aaagcagttt taaaagatgt 19741 tgccagagtt ttaaatattc cctatgggga agcagacaaa atggcaaagc ttattcctgt 19801 ttcccgagga aaacccacca aactcaaggt gatgatttcc gacgccacac cacaaccaga 19861 gtttaaagaa aaatatgata atgacccaag ggtacagcat tggcttgata tggcaatccg 19921 tattgaggga acaaacaaaa catttggtgt gcacgctgca ggtgtggtaa tttctgctga 19981 accgttggat gaaattgtcc cactacaaaa aaataacgac ggttctgtga tcacccagta 20041 tttcatggaa gatttggaat cactgggctt gctaaaaatg gattttttgg gtttaaaaaa 20101 cctaacaatg attcagaaaa caatcgattt gatttcgcaa aacaaaggat ttagaattga 20161 gccatacgac atcacgactc aagaaagaaa agctcaaaga attttagcaa aaggtgaata 20221 taacacctta ccaaaagacg ttcaaaagac ttacgaacta ttagaagcag gtgaactaga 20281 aggtatattt caattagaat cttctgggat gcggcaaata gtacgtgatt taaagccttc 20341 caacatagaa gatatttcct cgattttagc actctatcga cctggtccat tagatgcagg 20401 actgattccc aagtttataa accgcaagca tggtcgagaa aatattgact atgaaaccca 20461 aattctagaa ccgatactag atgagaccta tggaattatg gtctatcaag agcaaattat 20521 gaaaattgct caagatatgg ctggatattc tttaggacag gcagatttgc tccgccgtgc 20581 tatggggaaa aagaaagttg aggaaatgat gaagcagcga gaaaagtttg tggatggcgc 20641 tgcgaaaaat ggagtgaaaa aacaaattgc tgaagaatta ttcgagcaaa tgttgaagtt 20701 tgctgaatat tgtctcagct atgacacaga agtgttaacg gtagaatatg ggtttttacc 20761 tattgggcaa attgtagaaa aaagaattga atgtagcgtg tacacagttg ataataacgg 20821 aaatatttac acacaaccta ttgctcaatg gcatgatcgt ggtcagcaag aagtttttga 20881 atattctcta gaggatggtt ctgttattca ggcaacaaaa gaccataaat ttatgacgac 20941 tgatgggcaa atgttgccaa ttgatgaaat ctttgaacga ggtttggagt tgatgcaggt 21001 taagggttta ccgcaatagc tacgaataac agtttttgat gccatttacc tatgaatctt 21061 tttaatcaat agtttttttt atattaaaaa taactatcga tcatgcgagc taactttact 21121 gattcttaac acagcagagt gaaatttact tagccactaa ctgacaacct cgaaatcacg 21181 ttcatctgta acgtattctg caggcatatt gactaattgt aaaaaataga ctttgcttac 21241 tcttggattt atgagataat cttgtgatcg ggactaaact cagtaaagac acttaattaa 21301 tttgtttgca aagtattaac tattctggtt catatcaggg aataatacta gataagtgta 21361 gaaatataaa ctacaacgat tgtttctatg tgtgcattct cgaattggct cggaaaacga 21421 cttcagtggc tgtgttgctt agctgcatta tgcttggttt gtctggttat gggttggtca 21481 cttcccacaa atgcagcaaa tcacatcagt cagcaattag aacagcaaaa cgtactattg 21541 actcatgaaa aacaacttgg tgagttgatg tattcagata tcgccaaaaa ttttttggcg 21601 gttgaaaagt ttcaacgcgc tcacaccttg cgcagttccc tctgggagca tctaattcgt 21661 gctgacagtg caatcaagaa agatataagg cttgctcaga gattgagacc ttctggcatt 21721 cccttctttg tgatgaatgg taaaaatttg tccggtggat tgtaactgtc taatatggag 21781 aatgtattga atcatcatca ttagttttca gtattcatcg ggttttgctt agtctacaaa 21841 gtctatccat acttaagcat ctgcatcatc agtaatcact gttaatccca attaagccag 21901 gattactctt tattgtgttt agttcgcaag gttacaaagg atgcaattgt atagcttaaa 21961 atgtgtatga ttataccaaa aaatgggaat tatgagcaaa acgcgatgca ttccaaataa 22021 ccctattaac ttcggtggaa cagaaatcat gaagtcatta atgcgttgca gtggattttc 22081 tgtgtttttt cctatcgctc tggctacaag ttacagttcc caactgttga tgtctaagga 22141 agcggtggca gaagatgcac agaatgtact ccccactgaa gtccaagaag cttattctac 22201 tggagcgata caggctaatg acctgtctcc attagaacaa caagtcatta atgaaatgaa 22261 caagatacgg acaaatccca aggcatacat cccgatcttg gaaaactata aacagcgctt 22321 tcaaggtaag cgagttaaaa tctacaatca gcgctttatg ctaacacacg aagggctatc 22381 agccgttgat gaagcaatta ggtttctgca gtcagcgagt cctgttgctc ctttaacgat 22441 atctagggga atgtctttag gtgctaaaga tcttgtcaaa gataatggct caagaggttc 22501 gacaggtcac ctaggtagcg atggtagtaa tccatctact cgtatggaac gctacggaaa 22561 ttggcaatct agtgctgggg aaaatatcag ctatggtccg agtacggctg aagatatcgt 22621 catacaatta attgttgatg atggagtgcc caatcgcggt catagaacaa acatttttaa 22681 ccccactttt agagtcgctg gagttgctta tggaattcac gctagataca aaacgatgtg 22741 cgtaattaat tatgcgggag aatatcaaga agcaatttca gtttcgtcta gaggtcgtcg 22801 ttaaacatgg ctttttgaac ctataaactc tcgaccccac tgctttctct aaacgcgagt 22861 gcgtgccata ctcataaacc agcagtcttc gtgttttcag ttcgcagtgg tatgagttcc 22921 ttagttatca tcgaaaccag tcatcaattc atgcatctaa agacatatag cgattctcat 22981 ttgcgtaaaa tacaggtatg cggtgggagc acgccataga acattggtgt caatttcagc 23041 aaaactcaag tctaacagga ctttcaggct ctgtggttcc catgcttcgc ctaaaatgcc 23101 tgaaaacttg gctatagcct cttaacgagg aacaggctat ggacttgggt tttcttgtta 23161 aattgacatc agtggctata gaacagctct tataaacaat ctgtatgaca cccaattggg 23221 aactgctata ttgactggtt gctccaaagt tatgaagttg ttgatgcgtc agatggtatt 23281 ttggatattt acactcgtca ccctatcgac tacatcttgt gatgtattgt ttgagcctga 23341 ctcctctgta cctaacattc ctgaaggaaa ttctgatgtc ttggtaaccg ggaaaactct 23401 ttctcgcctt gaacaacaag tcatagtaga aatgaacaag gcgagaacaa atcccactgc 23461 ttatgctgct gtattaaaaa actatagaca acgttttgaa ggaaaccgag ttaaaatttc 23521 ccgtcatgtc tatttgcaaa cacaggaagg agtgaaagct gttgatgaag cgatcgcctt 23581 tctcaagtca gtcactccag tcggatcttt aacagcatcc aaaggaatgt cacgagcagc 23641 tagggatcat gtcaaagacc aaggatcgaa aggaattctt ggtcacaaag gaagcgacaa 23701 aagtgatccc tttacacgcc tcaaccgcta cggaacttgg aaacgcaccg ccggagaaaa 23761 tattagctat ggctcccaca cagcacagga tattgtcatg caattaatta ttgacgatgg 23821 agttcctgat cgtggtcata gaataaatat gtttaatcct gcttttaaag tagcgggagt 23881 tgcttttgga attcacaaca catacaggca gatgtgtgtg attaattatg ccggaaggta 23941 tatagaaaag tcataattta tcactcaatg acctttgaca gatgactgct tcatatggtg 24001 tgtggtgacg atgcgcccct aatttgcctc tgtgacagaa ttatataaag caacataaaa 24061 ataatgaact ctcgccatag caacacagct tcatctgctc ttcacctcaa cgttgacgtt 24121 caaggtcaag gcttcccaat tctgtgctta cacggtcatc ctggttccgg ttctagtctt 24181 tctgtcttta ccaatcacct ctcaaaacgc tttcaaactt tggctccaga cttgcgtgga 24241 tatggcaaaa gtcggtgtga tggcaatttt gatatgcggg atcatttgat tgacctggaa 24301 gcgcttctag accgctttag aattaaaaaa tgtctgatat tgggttggtc gctcggtggc 24361 attttagcaa tggaattagc actacagcta cctcagcgag tcactggact gattttagtg 24421 gcgacagctg cccgacccag aggcaatcat ccgcctataa cctggcaaga taatgtttac 24481 actggtgttg ccgcgctatt gagttatatt aaaccagatt ggcagtggaa tattgaaact 24541 tttggcaagc gatcgctctt tcgctacctg attcaacaac acactcctac agcctaccgc 24601 tacatagcaa aagatgcagt gtcagcttat ttacaaacta ctgctcctgc gactcgcgct 24661 ctttattccg cgattggggc tgggtacaat cgactcacag acctagaaca aattcaatgt 24721 ccttgtttgg tactagctgg tgcatgcgat cgccacatta cagctgacgc cagcttagaa 24781 acttttcggt acctcaaaga ttctcagtca cactgctatc ctaataccgc ccatcttttc 24841 ccatgggaaa tccctaacca ggtattgagc gacattgacc gctggctaga gaaaaaccaa 24901 aaagttgtca gtgtgtagtt acgcttaggc gtaactacga aatagcctta taaccagaaa 24961 agaactcagc acagtgaaca gtgagtaagc gcgcgcattc ggcgggtttc ccacgcccag 25021 gcgactggcg tatctcctcc accagacgct tcgctttagc cctctgggcg tgcgcgcagc 25081 gcacacccag agggcgcagc cgtggcgtca gccatagggt tatcactaaa cagtgaattg 25141 ataactgatc actgttcact gataactgaa ttctcctcaa tttaattaca tttgaccgct 25201 aaagacaggc tgtactttgc ggtcagtccg ttcccccagg tcatcagcaa tcgtcaaagc 25261 acttggttta cagcgcttaa catcgacaat aattccttgc gctccttcca gttccaaatc 25321 ttctacgtag ccttttatcg ccgcttttgc atcagcagaa ctgataaacg gtccaaagta 25381 gtaagtgcaa cgggggtttt gtgttataat ttctacccac caagccaagc caaaattgtc 25441 gaatatgtta atcaacgatt ccttaaggtt tttccaaata cttttcatgg ttctcgtcaa 25501 ttgctaaatg tgatagttgg ttctaaatcc tgctggtttt gactgcttta catttcttta 25561 tactctgttt actcttgttt tttcaaagaa attttttgta atttatccag gtatagggta 25621 tttatccagc gatgacgata aatctcataa agcgccatac ccgctgctac tgaggcattc 25681 aggcttggag tcttaccttg taaggggatc gacaccaaaa aatcacaatg tcgttgtgtc 25741 aaaatactga gaccttcgcc ttcagaacca actaccaaaa caatcggacc gttaaatttc 25801 acggtatgca taggttcgct tgcttctgta gcagtaccgt aaatccaaaa gccagcagct 25861 ttgagttctt ccaaagcacg ttgcaagttc accgctctaa ccacgagaaa gttttctaaa 25921 gctccagcag ccactttcat gactgtagag gtaataccag atgctctgcg ttgtgggatc 25981 accattcctt gagcacccaa tgcttctgct gtgcgaataa ttgctcccaa attgtggggg 26041 tcagtgattc catcagcgac aacaatcaca ggatcagttt cagattttgc ctttgcaatc 26101 aactcatcca agtcaatgta agcgtaggga gcaatttgtg ctgctacacc ttggtggtta 26161 gcttgctctg taatttggtc taagcgctta ggctcaacct cgtcgataat tgcgccattt 26221 tccttcgctc gtaaaatcag cgaatgaaag cggggatcgt aacggaggcg ggaagtaatc 26281 cagatacggt tgagatcccg ctcattttcc aaagcactta atacaggatg acgaccgtaa 26341 atcaaatctg tatcttctga ttgttcagaa gaaatagatg aatcagaaaa gttacgatgc 26401 ggacgtgggc tgtctcttaa accgttagca cccttaactc tgggattggg actaagatga 26461 gaaatgacac gcttgccctt cattcttatc ggttgcgcac tattgtgttc gccagaagtc 26521 ttaatttttc ttggtttatc tatcatgtga gttattggta gagcttaata ggtgagtaac 26581 ttgtgacctc aattcaccga accaactctg caagttttac gactcgtgaa ttgaatttct 26641 tacggctttt ctaagtcaag tttttgcaaa agttcgctta agcgatggaa atcggtgaga 26701 tacaggtagc caattaatgt ttctaaacta gttgcttgtt gataggttgc cagatcaacc 26761 cgcctagggc gtcctgtggt agcgttgcga ccccgtcgga caatttctaa ctcagtgtta 26821 tttagatgag gatacagcga ctgcaacatt ttcgcttgcg tctctgctct gacttgtgcc 26881 acaactagac gatggtatgc ttctggtcgc tttggtggta gtagataaaa cattctaacg 26941 tataattcac aaaccgcatc tcccaagtaa gctaaagcag taggagaaat ttgttgcagt 27001 tgggcttgtg aaatctcttg gaataattgc tctttagttg cccttagtag ttgacaccaa 27061 attgaatcct gactgggaat ttcatttgag ccatctagta ggtcttcctc ctttgatgtc 27121 acaagttgtt ctttcctcac acgctacatt aggcaggtct acctaagcat agtcaatgct 27181 aaaaaattta cttgctaaat tttatctttt tttcaatttt ggttgccatt tcatatatta 27241 agggcactga ggcgtgaagt ctgaaatcca gatgagtcca tgattcttta tctggtctcc 27301 aaaagtacaa gtgcttttaa aagcttaaag tataaggtat gaattatgaa atggatttta 27361 tatcgtccca ttataacata cagttagccg tccttaagtc ttttacacta tttatctttc 27421 tcaacccgag atctttacac tttacttttt tgctccgtac tttatgagtt aacgttatgc 27481 aaaaaaaaga aattctaatt taataataat taagctatcc agattggatg atcggatagc 27541 tatatgaagt tctcaggtca agaaatcaag acgatttgac gttttctaga gccacatcaa 27601 cggatggttg taaggcgaga aacttctcca agcgaacaag cttgaccgtc tgagtgacac 27661 gggcattact gacaatttgc aaggtgccag cggagttttg agcacgcttg gctaactgta 27721 ccaaagcacc caagccagag ctgtcaacaa aatcgatttg cgagagatcc agaattatgt 27781 gctttggtcc ctcctcaatc ttgctcccaa gaaccttggt aaatgtcggt tcagaaaagg 27841 cgtctaacaa acctgtgagg cggaatagct ggcagttatc ccggacttca cgagtgcctc 27901 tcaggcttac agttagagta agtggctcag caataattcc ctcctcataa gttatggtga 27961 acgctcaagt atagatggtt ttagtgtagg ttgtctacaa cctgctggca gggaagaaga 28021 cgggaagatg ggatatgggg gaagtgagga atttgagagc cagagagtgc gttgtaagac 28081 tggtgctgct cgctgtctga cttcctcact ccctcatctt tcttttcctc ttgccctctt 28141 ttctccgact ttcttcattt ttatcctact ctggcttttg tggatcaacg aactgctcgc 28201 attgcttgga caaactgctc aaacagataa tcagcatcgt gaggaccagg acttgcctca 28261 gggtgatatt gtacagaaaa tacaggcaaa gatttgtgac gtaacccagc aacagtggag 28321 tcgtttaaat taagatgact aatttctaca acactgttgg gtaaagtgtc tgggtcaatc 28381 gcaaagctat ggttttggct ggtaatttct attttttgtt gtagaccagc tggctgattt 28441 aaaccacgat gaccaaattt tagtttaaaa gtttccgctc ctagtgcatg acctaatatc 28501 tggtgtccca tacaaatgcc aaacattggt ttgtgagttt ccagtaaggc tttagtggtt 28561 ttaattcctt cggtgacaga tgctggatcg cccggaccat tagaaagaaa gatcccatcg 28621 ggattgtatt tgagaatatc ttctggtaaa gtattggcgg gaacaataat aacccgacag 28681 ccataacttg ctaaacgacg caatatattg cgtttcaccc caaagtcaat ggcgacgaca 28741 gtaaaagatt ccttggaatt ttcttgacaa gtagggttaa attcccatac tggctgggta 28801 ggttctgacc attcgtatat gtctcgggtg gtaacctcgt ggacaaggtt caatcctttc 28861 atgctaggag ctgcctgcac ttgctctagc aattcagcct catccagaat ttctgtggaa 28921 ataccaccat tcattgctcc gaacatccga attttacgag ttaaggcgcg ggtatcaatt 28981 ccatagatac caggtatgtg gtgctgttct aaatattcag gtaaagattg tgtggagcgc 29041 caattactcg gtcgagtaca aatattgcga gcaatagcac cccgcacctg tggtttactt 29101 gattcttcgt cctctatatt aacgccagta ttccccaatt ccggataagt aaaaacgact 29161 atctgaccgc agtaactcgg atcagtcagg acttcttggt atccagtcat accagtatta 29221 aagacaactt ccccaattgt ggttctagtg gcaccaaaag accaaccacg atatgtggtt 29281 ccatctgcca aaacaaaaat agcaggtatt gcgtcaaaaa gagacatagg ctataaaatc 29341 ccaagatttg tgatgtgtgt gattaggcga caggtcaaca attcaaaatc cccatacagg 29401 agaaattcaa aatttcaaat gaaaaaaaac agcactggta accatcctga tcgcttgctg 29461 tatgcaaatt caatgaagga caggtaaaac acgagcgaca aaggaactaa gcttgaactg 29521 aatcatcact tcacctgtcc gttgctaagc ctacaaacta aaatttaaga gcatgacata 29581 taaaatcttc caacaataat agacagtcgc aatagatttc gacgccgaag cgcaagtaaa 29641 attataatta atcatgcatc agtttatttt atcccgagca gcgctgattc ttgtgtcaac 29701 cacactagcg gttttgactg tagcctgtaa tggaaagcaa caggttactg tcagaagtgg 29761 tgagcagcaa cccgtgccta aagatcagag tgttgcctcg cctgtaacca caatggcacc 29821 tctaagtgct agtccacaac cacaggcaac gatgcagccc actagagaga tttcagtgag 29881 ctattttgaa cagggattgg acaaagctgt tggtgctctg agcatcagcc aatcagctaa 29941 atccacagaa gattggaatt tggtggcaat tcagcttgca gatgcgatcg ccctaatgag 30001 aaaagtgcca gttgatagtc ctgatttcac aaacgctcaa gcaaaaatat tagattacca 30061 gcctcatctt aaagatgcca tacaaaaagc cactcgccct gtcaatccgc cacgacaagc 30121 acaaccagaa aggatagggg ttgccattcc ccaggttcct gtgacaccaa ctgtgacacc 30181 agctattacc aaaccaagtt tcaccgaaac acctgccgca ccagcagaaa aactgcaacc 30241 accattgcca aagctgactc cattagcacc gcctaaacaa caagaggtgt tggcaccgcc 30301 aacaatcaaa cagcaagacg aacaacaggt gtatactgtc ggcattaaac ggcgaatggg 30361 tggtacacca attatcgaag tcacctttaa tggcactcaa ccgtttgaga tgattttgga 30421 tacgggagcc agtgggacag tgattaccca gaagatggca aatgctttgg gagtggtgca 30481 agtcggaaag gctaaggcaa acacagcaaa ttctaaagca gtggaatttc ctatcggtta 30541 tgtggattcg atggaagttg ctggggcgaa ggtgaatcac gttgctgtgg cgatcgcagg 30601 cgcggattta gaaactgggc ttctcggaca tgactttttt ggtgactacg atatcaccat 30661 taaacgtgac gttgtggaat ttcgtcctca attgcgatcg cccattgggc gtctccctcc 30721 taaagcatcg caaaccaatc ccacacaaga aactgaattt actgctccaa atgcgcccaa 30781 ggaacttccc tccgtaacag atccttagcg ttgatgagaa aactttctac ttaagtcact 30841 gttgtcagta acactataag ggttaacggc tagggatact ttgctcaagt cttgcttgac 30901 acctctcaaa tctaaaaaga actcatttca tttgaaaaat ctttgatatt tgatactttt 30961 tatcaaataa aatcctttga ataagtctct cacctgaaaa ttttactttt tcttaaccaa 31021 cttttaatct aaggttaata ttcacattgt ttacttatta ctaattcact ttttgctgca 31081 acatttaagc cctccaaaat ctgataattc cagaaattgg agggttgttt atgctactaa 31141 gtagcgagta aacctttgca atcctatcga agcgatttta gaacagactt tgaaggataa 31201 tcactttctc ttagatacag ccaagtccta cttgctcaaa gaaattgtac ctcaagctaa 31261 cgagatagac ggtgatccaa atcgcttatt tcaagcactt cgaggtcttg gcgaattaga 31321 ccttctagca ctcaaggttc cacgtcactg gggtgggaaa gaagttaatg agcagacata 31381 tggcagcttt caagaactag tagcacggta ttctggtgct ttagcctttt tgcaaactca 31441 acaccagagt gcggctggta tgcttgttgc tagtagcaac tcttcgctac aggaaagata 31501 cctccctcgt atgggcaacg gtcaagtttt actaggtgtt ggcttttccc agttgcgacg 31561 tgaaggcgat cccctgagtg tagctacacc agttgctgga gggtatcaac tcaatggcgt 31621 tgttccttgg gtgacaggat gggaattttt tagcgaattt atcattgctg ccacattacc 31681 agatggtcgt gctgtttttg gtattgtacc cttacttgaa acacatcaag actcaggagg 31741 tacgctgaca ctttcacctc cagcacagct agcggctatg acatcagcta ataccgtaac 31801 cgctaccctg aaaaactggt tcttgcccac agaaagtgtt gttttcatta aacctgctgg 31861 ttggattcac gaaaacgata aaaagaatgt tcttggggct acttttctgg cgacgggatg 31921 cgcccttgct ggtttagata ttgtggagtc cgtcgttagc agaaaacctc taccttttat 31981 ccaaaaagct tttgattctc tgcaacagga attgaataac tgtcgcaacg ctgttcgcga 32041 agcgcaaaat aattctagtt tggaattagt tgaacgtttg caattgcggg cttgggcgat 32101 tgatctggca ggacgaatag ctcatgcagc ggtgactgtt tctagtggtg ctgctatata 32161 tagtcatcat aatgcgcagc gagtgtatcg agaagcgctg gtatttactg tgactggtca 32221 aactcgtgct gttatggaag cgacgttggg gaggttgacg cgtcatggtt ttgagaatga 32281 accgcagagg cgcagaggac gcagagaggg aagagaggga agagagggaa gagaagagga 32341 gaaaaatatc acttattcgc gggtgataca tttgagtcat atcattgatt cagatattcc 32401 tcaatgggaa ggtgatccgc cagtggaatt tgaggctgtg gcccaactgt acaaggatgg 32461 ttattatctg cggggttttt ctatggggga acacagcgcg actcatatga atgctcccaa 32521 cagttttcgt cttgatggta tggggattga tgagtattct gctgagtcgt tggttgtccc 32581 tgcggtagtt attgatattc gggaacaggc gttagtgaat ccagattata ctctttgtgt 32641 tgaggatatt ctgacttggg agaaacggta cggtaagatt tcctctggtt gtgtggtgtt 32701 actatacaca ggttggcaag agaaatggtt ggataaaaat gcttttttta accaggacga 32761 gcaagggggt atgcattttc ctggttttgg tagtgatgcg acacggttct tacttgagga 32821 acgtcaaatt gcaggggttg gaattgatac tcatggggta gattctgggc aagataccac 32881 ttttgcgact aatcgtttgg tgttggaaaa accgcgtatt gtgttggaga atttgaccaa 32941 tttggatcag ttaccgccaa caggggcgac tttggcaatt ggtgtgctgc ggttgcgggg 33001 tggttctggt tnnnnnnnnn nggttctccg gctggggtgt tggcgttgat tcgataaaaa 33061 tctttgttgg cttttgataa agataaaact tacgcatttc gaggaagggg acaaacaaag 33121 agattatctc ctgttgtctc ctcctggatt gctttttaag tccattcaac tgcgcgaaag 33181 ttttgtacga atctgctcaa agtttatcct attgtaaggt tatcaagctg agttactatt 33241 ccagcaattt ttaccagtcc tgaaactgga tttaaaattg cgtcaactaa gttcacaaaa 33301 ctagttaata ttgttaaaac tttcttagaa tcttccagat cctggatggc agattgtaac 33361 tgttttgtta ctttttctat tctttctttt ggttgtttca aatcagtgac tatttcatct 33421 atgagtttaa gacctatcag acttttcgta gtagccaggt cagtttctaa gtttcctatt 33481 tttaagaaat catccaaact tatcttattt ctattaagta aattttgcct acgcttgcgg 33541 attgactgaa taagttggct gatctcttca ctacactcat tccaatcact caatttcaaa 33601 gtacctgtta ccattttaca tctcctttta cgtttaaatt gtcagcaacc ataagaccta 33661 taaagtaagt agatttttct tacaaacctt tatccatttt ctgaatcaaa ggttctaatg 33721 ttctttcaga atcggatact attttcctaa ctctttttag ttcttcctga tcgtatgtaa 33781 ctgctttctc tttagtcgct gctttatctg ccttggttgt gtaaagatct tgacaatagt 33841 ttgataactg ggcatcatcg atctgatctt taccttgaaa ctctgttttc agattgctat 33901 gcagtgtggc tattttctga agtagttcta cataattttc tgctgcatct tttctacttc 33961 tgatggaatc catagccttg ttatagtttt cttctagcgt gatgaaatct agagtatgag 34021 tttctgaaac ttgtgtcagt ccacctatat aatcagtaaa gtaggttcgg atttgttcct 34081 cttctgctgt caaaattcca ttaatgtaag cctgttgtgt cagtgctatt aagcctcctg 34141 tcgcaggttg gttagaggct gaatctttta cactagatgt tgctccagta taagtctgta 34201 tatctttatc agtacacaaa attgcttttt ttaacttttc gcggcgaaat tcccttgtga 34261 atgcatttgt aagaaacttg gcaatattta tgcctgcgtc aacatcaggg cctttgaaac 34321 taaagacttg tccgttagat tgcgaaaatt ttaaattctt gagtgattcg tcaagagcag 34381 taaagttttt atcgaaagat gcagtatcat cgctagccag cttgcccaag gcttttacat 34441 attctacaag tacagaattt gcctttttaa cgtctgtaac cttctgttta tataaatcgt 34501 cacacctttt ttgttcatct tgtctttgta aaaatgagtg aaacgagtca gaagaacctt 34561 ttgctgccag aaaaggtata tattttgttc ttgttatgca agagttataa atgtcatcag 34621 ctatgttcga tgacgatagc tcaagtgtag tagccatttt tcctacattt tggactgagg 34681 gaaaatcacg accacaactg ttaagtaaaa gagcaagcgg aactagagca aatattgctc 34741 tgcgaggttt aatatgaggt atcatgtaat ttgtttaggt atgattgaga aagaaagtta 34801 agctgatttg tcttccttct acattcttag tagttgcaat ggaaagtgac ttatgcagtg 34861 ttgagagaaa aatagataag atgtcgtttt gtctctagtt catgatcgct gaggtgtgac 34921 ttgactacaa ctgataaatc aagcgaaact aatgcaaatg ttcgagtgca tctcaaatct 34981 ggaggtgtat atcatgacca ctcccctaat ttgccgcaat tacattgcca atcaatgggt 35041 gaatgctgct tctcttgaca ccctagaaag tcgtaaccct gctgacaatc gcgaagttgt 35101 cgctactttc ccgcgttccg ctacagttga tgttgataca gcagtagcag ctgcgcgcaa 35161 agcataccgc agttggcgac ttgtccccgc gccagcgaga gcagaatata tccataaagt 35221 gggagaactt ttacaaaaat ataaagaaga actcgcccaa ctcatgagtc gggaaatggg 35281 taaacctctg acggaagcac ggggagatgt tcaagaaggt attgattgcg ctttttacaa 35341 tgctggtgag ggacggcggc tgtttgggca aacgacacct tcagaaatgc ccaacaagtt 35401 cgcgatgacg gtaagaatgc ccgttggagt ttgcgccctt atcactcctt ggaatttccc 35461 agttgcgatt ccctgctgga aagctatgcc agctttggtg tgtggtaata ctgttatcct 35521 caaacccgct gaagatactc cggcttgtgc aacgaaactg attgaaattt ttgcagaagc 35581 tggtttccct gaaggtgttg ttaacttggt gcacggggtg ggacaagaag cgggaaaagc 35641 tttagttgaa catcctgatg ttgacttggt atcttttacg ggttcttctg aaacgggtgc 35701 ttttgtcgcc tcgacttgtg gacgcactca caagcgagtt tgtttggaga tgggcggtaa 35761 aaatgctcaa attgtgatgg aagatgcaga tttagaactt gctcttgagg gtgcagtttg 35821 gggcgctttt ggaacaacgg ggcaaaggtg tactgctaca agtcgcctga ttttgcaccg 35881 cgacatcaaa gagaaattta ccgctatgct caaagagcgt actagtaagt tacgcttggg 35941 tgcaggaatt gatcccaaca ttgatatcgg tccaattatt aaccaaaagc aacttcaacg 36001 ggtgaacaag tatctggata ttgctcgtga agaaggtgca aaagtgttga ttggtggaga 36061 aatcgcaact gagggagaat tacaacacgg ttactttttt caaccgacga ttcttgatca 36121 agtcactccc aacatgcgag tcgcttgtga agagatattt ggacctgtgg tggcggtgat 36181 agaggtgagt tcatttgaag aggcgatcgc catcctcaac aacactccct acggtctttc 36241 ttcttcaatc tacacccgcg atgtcaaccg cgctttcgcc gctatgcgcg atatcgaagc 36301 cgggatcact tatatcaatg gtcctacgat tggtgcagaa gttcatctgc cctttggtgg 36361 tgtcaaacaa actggaaacg gacaccggga agctggttct gctgttttgg atgtgtttac 36421 agaatggaag acagtgtacg tggatttctc tggaagtcta cagcgtgctc aaatagataa 36481 tcgtagttag gttattagat ttatggagac aagcgccagc acaattaaac tcgaccagtt 36541 tttaaagttc gtgggtatag caccgactgg tgggcaagct aagctactca ttcaaggagg 36601 cgatgtcaaa gttaataata cagttgaaac ccgacgagga cggaaattag tatcaggcga 36661 caaggtgacg ctgggaggac aaacctttga ggttgatttg gagaatttat aattgacagc 36721 atgccaaata ttcgctcttt atgccagatg cagtgcgagc atcttgctcg cgtactatga 36781 caagcgagca agatgctcgc actactttca caccattgga atgctttctt tataattttt 36841 ctaataccgt ttcacgaaac acttaataca aataatttgt ttctaattct ttccccctgc 36901 tccctgctcc ctactacgca tttgtatcaa ccttaaagta aaacgctata acagatatag 36961 gaaataacat cagtcattaa tcagtcaact tcctgcatga gcgcagattt tcaaaaatac 37021 ctgcaatctg tgtgtaatgc ctaccaacat tggtggaggc tgtacaccat cacggatgtg 37081 cgagggtcta aaccagcaca gtcaaaaagt ttaccgtcgc tgtttgattt tggcttgatg 37141 gtagaaacgg tacagcctca gcagcaggag aaacgcgaaa caccagaaaa aacagaacgc 37201 ctcccagtgc aggaaggatt acgtaaatat tctgaaaatc atgtgctact tgtaggtaga 37261 ccaggttcag gcaagtccac agctttagcg cggctattgc tggaagaagc aggaaattct 37321 gtcattgcga gtgaagcgaa gcggaacgta gacgcgatag cggcttcccg cagggtagca 37381 atctccacaa caactcgaat acccatacta gtagaactcc gccaatacaa cacctctgtc 37441 cttgagctta ttcaaaactt cctcttgcgt cacgatccca acttaacctt taacagcgaa 37501 actctaaaaa cttggttgcg acagggacaa tttttgctgc tgctagatgg actcaacgag 37561 ttacccaatg acgatgcacg gcgagactta aaaacctttc ggcagaatta cccgaacaca 37621 ccaatgattt tcaccacgcg gaatttgggc gtgggtggcg atttagggat tgagaaaaag 37681 ctggaaatgc agcccctgac agaagaacag atgcagcaat ttgtccgcgc ctatttgtca 37741 gaacaaggtg aggaaatgct gcggcggtta ggtggaagat tgcgggagtt cgggcaaacg 37801 ccgttgctgc tatggatgct gtgtgagtta tttagaacaa cgggcaatgt tccaccgaat 37861 ttgggtttag tctttcgcta ctttgcccag agttatgacg gcaaaatcaa agatgatgtc 37921 ccagtttctg gtgagtcgcg ccggtggtgg caggagttat tacaacattt ggcgtttgcc 37981 atgatgcagg gagaaacgcg tacagagttg cgagtcacga ttgacaggcg agaagcggaa 38041 gcaattttca cgaagttcct tgaaggtaaa gttgattatc ctcccagtcg ggcgaaagaa 38101 tggttagagg atttgctcaa gcatcattta atccaagttg tgagcaataa ccaaatagag 38161 tttcgtcacc agctgcttca ggaatactac gcggcagaat gtttactcca gcagcttccg 38221 aatattagtg atgagaagtt gaaacgagat tatcttaatt atttgaagtg gacagaacct 38281 ttagcgctga tgttggcgtt ggtggagaag gagaaaaaag agcaggcggt gcaggtggtg 38341 aagttagccc tagatgtgga tttgcagtta ggggcaagat tggcagggga ggtgaaacga 38401 gagtttcagg aaaaaacagt tggtttaatt ttagggttag atgttcctga attactcaag 38461 attaagtgtt tgggtatcac gcgatctgag tgtgcgatcg cttttttaag taaatcattg 38521 cagcatgaaa attttattgt tcgtggaagt gccgcgtatg tattaggaga gattggcaat 38581 caagcagcag tatctgcctt aattaaggct ttgcaggata aagattcctt tgttcgtgca 38641 aataccgcat atttgttagg aaaaatcagt aatacagcag cgatacctgc cttaattgcc 38701 attttgcagg ataaagattc ttttgtccgt tggagaggcg tagatgcgtt aagagaaatc 38761 ggcaatgaag cagcggtatc tgccttaatt aaggctttgc agcatgaaga tccttatgtt 38821 cgtaagagta ccgcagaagc attaggaaaa atcggcaatg aggcggcgat atctgcttta 38881 attaaggctt tgcaggataa agattcctct gttcgtgaga atgccgcaga ggcgttagga 38941 aaaatcggca atgaattggc ggtatctgct ttaattaagg ctttgcagga taaattttct 39001 ggcgtttgtt ggagagctgc atatgcgtta gaaaaaattt gcaatgcaac ggcggtatct 39061 gccttaatta aggctttgca tgatgaatat tctcttactc gtgtgagatc tgcaaaagcg 39121 ttaagagaaa ccaggaatga agcggctgta tctgccttaa ttagtgcttt gcaggataaa 39181 tcttcttttg ttcgtggaag tgctgcagaa gcattagcaa aaaaagataa tgaagcggcg 39241 gtatctgcct taattagtgc tttgcaggat gaagattctt atgttcgtgc gaatgccgcg 39301 tatgccttag gagaaatcag taatgaagcg gcggtatctg ccttaactaa ggctttgcag 39361 gatgaagttt atagcgttcg tgaaagtgcc gtgtatgggt tagaaaaaat cggcaatgaa 39421 gcggcggtat ctgccttaat tgctgctttg caggatgaaa attattttgt tcgttggaga 39481 ggcgtacatg cgttaggaga aatcggcaat gaagcggcgg catctgcctt aattgctgct 39541 ttgcaggatg aatattctta tgttcgtggg agtgccgcgt atgccttagg aaaaatcggc 39601 aatgaagcgg cggcatctgc cttaattgct gctttacatg atgaagattt agaagttcgt 39661 gtggatgccg cagatgcgtt aggaaaaatc ggcaatgaag cggcggtatc tgccttaatt 39721 agtactttgc agcataaaga ttctgatgtt cgtggcagtg ctgcgtatag attaggaata 39781 atcggcaatg aagcagcgat atctgcctta attgctgctt tgcaggatga aggttctcat 39841 gttcgtagtc aagccgcaga tgcgttaggt aaaattgctg gctctaaagt tttgtgtcag 39901 atatgggagt tacaattaaa aacaccatca tgggataaaa gcgatgcaat ttcaaaaatt 39961 caagagcgtt gcaagtttta taaccatgaa atttttcatt ctccgccagt tgaggaggaa 40021 actaaaacca aatcagaaac ttctaaatcg tcaacttaca tcattcaacg tgtaggtaac 40081 ctcaacacag gagatgtcaa tatccacggc gatcaaattg gaactcagca caaccaacta 40141 aataacaaag attaaaacta tgtcagaaac ttctaaatat caaaatagtt ttcatttcaa 40201 tcaagcccct ggcaacgtca acacaggaga tgttaccatc caacgcgatc aagttggcat 40261 tcaacacaac tacgcgccag aacaaaaaca aaatctggcg gaaacagcca aagaaattca 40321 gcaattacta aatcaactgt ctgtgagtta tccaacaacc actgaatctc aaaagcaggc 40381 gatcgccaat caagcaatag cccgtatcaa acacgataac ccaaccacct ggcaacgttt 40441 acgcagtgct accgaggcag cattaattga ggcgtttaag gaagtgcttg ataatccgtt 40501 tgttaatgtc acggttgctg ctgttgaagg ttataggaaa gctgagtaag gtttgattga 40561 gtgtttgaga ctcgcggacg agtgtttgag actcgtgagt gagtattaat tcttgcctaa 40621 aggttccacg atcaccctct tctccacttc atccaccatc acccgcgcca acccaaattt 40681 ctcaatcacc caagcattcg tcgtcaaatg ctgactcacc tcagccactc ggtactcact 40741 ttttgttgac gccaaagctg caggcaataa taactgatcc cccaaatgtt catccacggg 40801 tgcgccagtt tgatgaaact tcaacagttc ctcacaaacc atatctgcta ctttttcggc 40861 agctaaaccg agacgcccta acgcaccaaa tccagctaaa gtattttcgt actcagctgt 40921 aagaaacaaa cccgctcctg gtgcgacacc ttttgctcgc aaagcttgta cactgggttt 40981 caattgggct tcacgcaaca cattttcagc acgactcgcg attctttggg gaatgtgaga 41041 aggtagttcc gtgacaactg ctaacccccg cacctgttgc aagtcgccgc gttccaccaa 41101 gttgatcccg ccaagtttgc taccaccact cacaagtagt tccacttctc ccccgccttg 41161 gggataccac ccccaagcgt tcagtttcac ttcaacttgc acacccatac gttgcaatgt 41221 tggtaggtaa acttgttgaa tgtaattcac ggaaggacta taactgacat gagttccccc 41281 cttgagtgtc acctgagagt caccacttgc aagcactaaa ggtaagagaa tcgtttgtaa 41341 caccagagtg actgcaccag cagaaccgcc ttcccgcgct tcactcacat caaaaatata 41401 acttcccgct agcacggaat taccaggaac aaactctagc gtcatcgaac ccaaagcatc 41461 accccgcact ttggcgttgc aaattgctgc agccgcgcga actgatgtga ggtgttgtgc 41521 tgcaagtccc ggttttgaac gtccggcgcg gatgttctct atctgtattg agttaccagt 41581 aatagcagcg agactcaggg aggttcggag aatttgcccg ccgccttcac cgtaggaacc 41641 gtcaatgtga atcatgaggg atttttaacc gccaagacgc caaggacgcc aagataagag 41701 agaagagaga attttacgaa tcatttagga ttgctatagt tttatttcat ctgcgtttat 41761 ccgcgtccat ctgcggttta aattttaaac tccaacactt tgcttttacc aggattcatc 41821 aacccatagg ggtcaaccat ttccttaaac ttcaattgtt cagggtcaat cacctttcta 41881 ccgctatctt caataatata cgtgtgggga ttggcaataa acacaccttg ctcttcgtgg 41941 tagtgaataa tttcgttgag acgttcctca ctcgtgtagc ggacgagttg caaacccaca 42001 ggaactactg taccgttaac ccgagtaaat tccaaatgca tcatcacttc atcgccgaag 42061 tggtgataca tatgctctac catttgtaga tttttatccg ccgggaacgt actttgcagg 42121 tatgtaatag aagtatcttc actccgggcg tgtaaggtgg tatgattcca ggtaaattct 42181 cctaaatgtg tgccttttcc ggcatcagta gctgtttttt gatatgttat cttgccacca 42241 tattgttcta ctaacccagg caataattcc aaacttggtt cagcaagaat caataaagca 42301 gcgtgagttt catctggaat gtactcccga aaagctgcaa aataatctgg aattggggat 42361 gcaaaaatgc taatctcctt tttaatcatt ccatcagcat cgcctaaagc ttgaccaaac 42421 cttgccgcag ccataaattc tgggaaggtc acaatcactt ctgcccaagg ataagctggt 42481 cctaatggaa tttctatttc agtgataatg ccattaatgc cccaagcatg cttcaccttt 42541 tgcacatcgt cgccgcgcag ttcaatgaca cgtggttcat cttccattgt tacgatttgt 42601 aaagccagaa gattacctct gtcagccaaa aagccatact gtattgaccc tattccgccg 42661 ctaccacccg caataaatcc acccatagtc gcggtacggt aggtcgatgg tgtcatccgc 42721 atttcccagc cgatttctcg tgctttttta tccaaagccg ctaacttgac gccagcttca 42781 actcgcgcga ttcctggttt cacccagcgt atctcgtgca tccgcgtcat atccagaatg 42841 acgcctccat gtaacggtac gcattgcccg taatttcctg ttcctgcacc gcgtactgtt 42901 acggggactc tgtgtttcac gcaagttgct gctactttta agacttctgc ttcgttggca 42961 ggacgcacga ctatatcccc aacttttcct tccagtttgg gtacgagtac tgggctaaag 43021 gtatggtaat cttgagataa ttttgctact tgactcggtt cagtgataat ttcaatacct 43081 tctaaagaag cggtgagggc atctaaatca aatgatttta cgggtgtggt tgtcataatt 43141 ataacttttt tgaaatatgg aacgcataat aattcaatta aactacctca cccaatattg 43201 aatatctccc ctatgttaga gtaagcgatt agacattggg gatgtttgaa agagatttag 43261 cgcaccaccc gctatgaatg aaagaaacac catctccctt cttaagagtt aagcgttaag 43321 agttaagcgt tccctgttcc ctgttccctg ttccctgttc cct // LOCUS NODE_541_length_43285_cov_5.45244043285 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 43285) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 43285) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..43285 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(110..1093) /locus_tag="DP116_04000" CDS complement(110..1093) /locus_tag="DP116_04000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007354063.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_04000" /translation="MKTRQLGRELVVSELGLGCMGMSEFYAGRDEQESIATIHRALEL GVTLLDTADIYGPFTNEQLVGKVIRDRRDRVVLATKFGNVRGTDGQFLGVSGKPEYVH QACDASLQRLGVDVIDLYYQHRVDPTVPIEETIGAMAELVQQGKVRYLGMSEAAPATI RRAHAVHPITALQTEYSLWSRDPEDEILPTVRELGIGYVAYSPLGRGFLSGQFTSPED FAEDDYRRNSPRFQGENFYKNLQLVEQVKAIAKEKGVTPSQLALAWLLAQGDDIVPIP GTKRRSYLEENIGATDITLTSQELSRIEAVAPKGIAAGSRYAQQQMKALNH" gene 1284..2201 /locus_tag="DP116_04005" CDS 1284..2201 /locus_tag="DP116_04005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015178282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator CmrA" /protein_id="PRJNA477356:DP116_04005" /translation="MTIETLSHQTAITKCEELAALVARHTDSKGNGFHTTAIDPLAFT RECDTSKAIHSVSEPILGIVVQGKKEVLLNDESYWYGVAQYLVVSVDLPLSGCAIKAT PDQPYLGFKLKLDSAQLCDIIAQTNPDIGQKESVRGWFISDADPSLIDCAIRLTKLLD TPQDIPFLAPIIIREIYYRLLMGEQSEAVRQIATSGSNMQRIALVIKRLKSDFAKPLR VEDLAEQANMSPSSFHRHFKAVTSMSPLQYQKQLRLLTARQIMLAENADATQAAYQVG YESTSQFSREYARMFGAPPIRNIQRLRTA" gene complement(2355..3551) /locus_tag="DP116_04010" CDS complement(2355..3551) /locus_tag="DP116_04010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_04010" /translation="MDSSHAVSSAVAKLYNTYPFPPDPLLDEPPPGYNWRWNWNTAYN FCTGHKPNRENIRILDAGCGTGAGTEYIVHLNPHAQVVAIDLSAGAIAVARERCQRSG ADGVEFHNLSLYDVEQVSGEFDYINCVGVLHHLPDPIKGIQSLAGKLAPGGLMHIFVY AELGRWEISLMQKAIALLQGDRRGDYEDGVKVGREIFANLPENNRLLKREKERWSLEN HRDGYFADMYVHPQEIDYRIDTLFELIDASGLEFIGFSNPLYWQLERLIGQQPDLMER AAHLSERQRYRLTELLDPEISHYEFFLGRPPISRMDWSDDATVLNSIPERSPCMEGWP SQVILDYNYQVAKLSEAEYAFLEACDHAQAPETVETLIQNTGMDLEGVRSLQKRQLIL LTPNPQ" gene complement(3598..4179) /locus_tag="DP116_04015" CDS complement(3598..4179) /locus_tag="DP116_04015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017716832.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobilisome protein" /protein_id="PRJNA477356:DP116_04015" /translation="MKPELSEAVKELIQKARIVSFSSWETTHPRAIIPLFQAADDQGR YLTDEDLQQIQTLSPQTSGFIPVARLLRDRVTEIVDEARVQVLITFPDITQPGGGLYP PGRAEACWRDFWHFLRCITYGIAGQSTDYTSPAGLNYMNMLYQELQVPLDAMVVGLEN IKIASLKRIDSEQQAALAPYFDHLITQLKSFKP" gene complement(4273..4365) /locus_tag="DP116_04020" CDS complement(4273..4365) /locus_tag="DP116_04020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878424.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04020" /translation="MKSQWDCVLQKLGEWHGSFTRVSPQGKLME" gene complement(4387..5307) /locus_tag="DP116_04025" CDS complement(4387..5307) /locus_tag="DP116_04025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015143150.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_04025" /translation="MSDLLNSETKSGGNLIPPSQEQTDALLEAVKEQMDLDTFNPNDH ELLKNLIESLGDSRGLVRMRVAETLGQIGELATPFLLEALAHHPNVVVRRASAKTLTI IADPKAVPILVNSFLNDEDTVVQGSSVGALARTGEAAVPALLEILADSNSSENTKGHA AWALAFIGAQAKEYIYKEIDSSSPDVRAAVVGAIGKILQENPEEEAFQILVNALGDPD TSVRCEAAAVLGNLTYRPAVANLISLLHHPDWESRKAAALALMKIGDRTALEPLQAAL GQESETGVQPVIKLAISQIDKQSEQDDDWE" assembly_gap 5360..5369 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(5439..5654) /locus_tag="DP116_04030" CDS complement(5439..5654) /locus_tag="DP116_04030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015191532.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04030" /translation="MTETSEHTPTEQELSEVIAEFEEYRERLISQTLATAQKAKVMKA TALAQLEPELAKIDAVLKELRDRETTS" gene 5773..6435 /locus_tag="DP116_04035" CDS 5773..6435 /locus_tag="DP116_04035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407511.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_04035" /translation="MDIHEIKTALNNSDFQYRLKAIAALKDYTPEIAVPLLTSKLRDS EFLVRSFVAMGLGKQQTAESFAALLELMKFDNTPNVRAEAASSLSLFGRVAASHLVLT FFRDDHWLVRRSILAVLLDLECPEELFEVCIQALAGEDSSVQADAIDALGTFALTSLE EPALSQLLALVGSESKQIRIHVVHALKRFDHPQAKEALRQLRQDPDHQVVAAALENLL QQ" gene 6704..6904 /locus_tag="DP116_04040" CDS 6704..6904 /locus_tag="DP116_04040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494183.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04040" /translation="MSDTLPGQMTWKLPGTTDCVLHLRHSSSEPWRYYKEFPEYVLPD PPHFSEGYATFVALLKKKWQTV" gene complement(6999..8564) /locus_tag="DP116_04045" CDS complement(6999..8564) /locus_tag="DP116_04045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_04045" /translation="MNFSFAQDVSVYQQLASGMQALPQLLPHSPATLLSQLKSQIDLL IEQQIAATLWVKLPPGKIWHSEIQRYQQQLTMSGVIRTLHIQESKTGVQEQQTPPSSR QGRAAGTQQGNSPVLTQTPRQSSTPVQETRDNSWHGNASPTSSSSESFLVNSMPNSQL QREYFVMVLSSQYCCLILAHRPFRTRKNNLAKANRKKTAPLLAVTIFEGEIIQKVLNV MKSAITPQSSLLMPADFICPSAPDPRLLSQLFAKQLQQQDEIYRQRTIKRLANMQQKN QKLQENLQLKDEYLSNVCQELRAPLTHMKTALSLLNSPNLKMIQRQRYFQMLKQECDR QNALITGVFDMVQLERNLPRMPLEVVRLSEVVPGVVSIYQPLAREKGIMLGYTVPTEL PAVWCVSGGLRQIVINLLFNSIKFTPKGGQVWVQGRVQGDYVQLEFRDTGIGIADSEI PKIFERFYRLRPTTTEDSGGVGLGLPIVQLLLSRCSGCISVKSKLNEGSIFTVQLAIA YGRAALNAPGDAS" gene complement(8739..9881) /locus_tag="DP116_04050" CDS complement(8739..9881) /locus_tag="DP116_04050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869338.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="BMP family ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_04050" /translation="MQRRKFLKYTTLAGTSFTFAGCTQGSKSQPQAEVSVQASPMAAN EPLKVGFVYVEPVNDFGWTYAHDLGRREMEANLQEKVKTTFVENVSEGADAQRVIRQL ALEGNKLIFATFLGYMNATLKVAKEFPNVHFEQCSGDKRATNVGTYLGRFEEARYLTG MIAGKITKSNVIGFVGSYPIPEVIRGIGAFTQGLRQTNPKAKVRVVWVQSWYDPAKER EAAQALVNLGADVLAQHTNSPAPIQLAEEKGIYAFAYNTDMSRFGAKACLTSALNKWG KFYTDKALAVINGTWKPEEVWYGIAQEMVDISPLNPVIPQDVQQLVQALRDEFIRGVA HPFDGPVKDQKGVVRVPKGQVLGDKEQRAMDWYVEGIEGLIPKVQS" gene 10091..11605 /locus_tag="DP116_04055" CDS 10091..11605 /locus_tag="DP116_04055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AMP-dependent synthetase" /protein_id="PRJNA477356:DP116_04055" /translation="MNIFELLAGEDNHKALVTPEGSALTYKQLRENIVELVSQLNSFG LQRGERIAIAMSNGVPMVLTFLAAALCGTAAPLNPKYKQEEFAFYYEDTQAKALITLP DVPEAAIAAATPDMIHIHAKVTENGTLSFELVKTASGEGESLGNQEFPDSDDVAMILH TSGTTSRPKRVPIRHRNLIASANNIVDAYSLSAVDKTLCFMPLFHIHGLVGCMLATLA SGGTLVIPDGFNALSFWKLVETYKPTWYSAAPTMHQTILARASRNEAIIKANRFRFIR SSSAPLPPVIIEQMEATLNAPVLESYSMTEAAHLMATNPLPPKVRKPGMVGYGFGVEV GIMDEDGNLLSQGSLGEVVVKGPNVIDGYENNPQANATTFVNGWFRTGDQGTLDADGY LRLTGRIKELINRGGEKISPLEVDDILLRHPAVAEALAFAVPHKSLGEDIHAAVVLKG ETSEKELLAYCATILADFKLPNQIHILDELPRGATGKLQRLNMAKLLKIETEHR" gene 11847..12824 /locus_tag="DP116_04060" CDS 11847..12824 /locus_tag="DP116_04060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412536.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_04060" /translation="MKICIVGAGAIGGYLGAKLALSGHEVTLIARGAHLEAIQKNGLK LIMADGSTHTATPMATSDMSQAEPQDVVILAVKTTSVAAIASHLPCLYKPETMVVTAQ NGIPWWYFRKHGGEYEGTAIQSVDPDGTIEAHINVERVIGCVVYPATEIVEPGVIKHI EGDRFSLGEIDASKTERIQLLAQALKSAGLKAPIRTQIRTDMWVKLWGNLAFNPISAL TRATLDHICQYPLTRELARQMMSEAQAIAQKLGIDFGITLEQRIEGAQKVGAHKTSML QDVEACRPTEVDAIVGAVAELGRLTQTPTPHIDAIYASVKLLEKTYMGN" gene complement(13194..14471) /locus_tag="DP116_04065" CDS complement(13194..14471) /locus_tag="DP116_04065" /inference="COORDINATES: protein motif:HMM:PF12770.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04065" /translation="MENFDLKIIPVNDHVAILAESNFGEATIQVAKKFFDEMADTSND LRLLYSRLLRRSISRKSIESEVEKFGISLFNEIFKGEILSLLNRTIGGATDSQINLRL MLGQPMLNTIHWEVMRFRNEYIGFRHNLIRHPFVPKPVNIPGERREKLHILIVSVDPF SQRGKDVLDKEHETLVNMLKGFGEQIRISELRDERATVENIKDILFEGVDIFHFGGHG FLDSRNPMESSLIVWRSESSNNWNIPGREFGNLSIRLLTTLAANQSLGFCFLNACDTA RSVETDTVEGMSSNNVDDILRGGANDFVNMAHNLIQAGVPIVLATNHAITFEAGYQLS KRFYTSVLKYGRRVDQAVKDARAELYIDANIQDAGSGFDTVAFGDWSCPVLYARSRQM EFGTKNLRWEPTLDIYTIRNVEKPPLALVGHNF" gene complement(14495..15751) /locus_tag="DP116_04070" CDS complement(14495..15751) /locus_tag="DP116_04070" /inference="COORDINATES: protein motif:HMM:PF12770.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04070" /translation="MNLINSELSLQFFYIDKQILVVCTGSSGEAYSFIKESDVKALIK RSILVHQHIDKFFPKDSILFESNHNSIEQGEAIKDASSPVNLDTTGLGNRVLNSKQER EKLSSNLKQLGTLLFKNVFNHQILDLLNVAIGQAIRLKNDVLIRLMIASDFLNLVPWE LFYNEDRYLCHVYDIVRHPFTLQPVRKPISSSENIRLLFIGANPSHDIYVQGQIEAVQ AALEESSIQFEKLPDSTYKSIANSIYDGITILHFLAHGECEYDGNHLKYYFRIDSEDE NKPYDKLPIEMLESFCRANPMQVAVLNACRSDQAIMYSKSKSGKKLTNRLTESGYYSM AHALIKTGIPCVIGMSHPISKIGAEILTRRLYRTLTSKGGSIYKAVRQIRLELFAHTD FLPPSDWLTPVLYLRNSNYNGLIQTQ" gene complement(15763..16131) /locus_tag="DP116_04075" CDS complement(15763..16131) /locus_tag="DP116_04075" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04075" /translation="MQLRLRSFSTREEQNIDRLLNLDTNLLYVEFGQQIGSLGPANDI TSQVKQWIVDRREDIYQKICIEGNYCLFIKQNKNASRIAIIIAIGDLLATVFSLIPVN TLAVLLTREFLDDFCNCIDN" gene complement(16526..17491) /locus_tag="DP116_04080" CDS complement(16526..17491) /locus_tag="DP116_04080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878507.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetamidase" /protein_id="PRJNA477356:DP116_04080" /translation="MTHHILKATKETVHLGGFSHLLEPALIVDSGDTVEIETYTGYYV HDKAPPEFLTPAFLDICQNLPSERKVAAGPHLLTGPIYVQGAEPGDVLEVKLEAITPS LPVGFNAIRTGWGALPHQFHQPALRFIPLDLEHNITEFPIDSGIKIPLKPFFGILGVA TTEASRNSIPPGNYGGNIDNQELQAGTKIFLPIFVLGALFSIGDGHSVQGDGEVNVTA IETSMNGTIQLKVRKDLQLTMPIAETPTDIITMGFGQTLDEALELALKNMIDLLERFI NLSAEDAYVLCSLAVNFRITQVVNSPQKGVHGMLSKSILPKAIEL" gene 18046..18543 /locus_tag="DP116_04085" CDS 18046..18543 /locus_tag="DP116_04085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869341.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04085" /translation="MRSTGLAYLLWFTCIFGLAGTHRFYSGKYVSGIVWLFTFGLFGL GQLVDLALIPGMVEDQNLKYRMLHGSPNSNNISNTQQVVINVADYIAPNANTNKPLST KSDLQLILELAKNNGGNISVTDCVIATGKPIVEVKQTIESLCAEGLLEAANHQETGAI IYKLI" gene complement(18751..19428) /locus_tag="DP116_04090" CDS complement(18751..19428) /locus_tag="DP116_04090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AIM24 family protein" /protein_id="PRJNA477356:DP116_04090" /translation="MATFEVIAQEGLRLVKVILQDETVRTESGALYYMRGKITMQSKA PTAGSFLKSLATGENIFRPTYTGTGELYLEPSLSGFHIMELNGSEWILDRGAYWASDG SVEVSVERNKLFSGLIGGEGLFQTKVKGRGKVVMVAQGPVEEVHLQNDRLVVDGNFAI ARTNTLNYRVEKATKSIFGSMTSGEFLVNTFEGTGTVLLAPVPYWGVMLLRQINSARP TNTSGSE" gene complement(19735..20556) /locus_tag="DP116_04095" CDS complement(19735..20556) /locus_tag="DP116_04095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019498981.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_04095" /translation="MTAKNSNISLSSNIAALKAGEALNHEDLYQSDLRGLDLQRANLR QANLIGANLSGTMLCEADLSGADLRKADLQGADLSNANLQGAFLHRAYLQKANFSGAN LEGAKLQAARYDQHTIWPEGFTYKTCGAIGPGANLSGAHLNTANLRDADLRNANLLGA YLCGADLTGANLQNARLSGADLRLAYLTGADLRNARLNNVDLQGADLRASNFSGVEIE YLQSIAGADFTLVQGLSEAIRAILLRRPASELDAWNSFTRRTTRQSLEVGDSANK" gene 20862..21452 /locus_tag="DP116_04100" CDS 20862..21452 /locus_tag="DP116_04100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015196721.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobiliprotein lyase" /protein_id="PRJNA477356:DP116_04100" /translation="MRLIPPMNMMDFFRKSQGTWFTQRTVHHFDLAADQSGESNLIVQ VVEREDPRVKAICEQQGIDPAKGMGGGSFMWQENQDNREPNPDQAAVLVDVPDDESLL SGKLIRDRGYVEKMPVISRYWFGKDGILTIDTEYDINQGQERCWFITDDFRVRVSTVR TMNGVYMMTYGSERRCVSEATLEQLIQKNLARASGN" gene 21583..22314 /locus_tag="DP116_04105" CDS 21583..22314 /locus_tag="DP116_04105" /EC_number="1.3.7.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015184551.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="15,16-dihydrobiliverdin:ferredoxin oxidoreductase" /protein_id="PRJNA477356:DP116_04105" /translation="MYKPFVKHLENELFNRFNLQNRAIPTGLELKVSDRGRNPATIQS WCYQCHQLRKIRYTYIDAGESAQILNSVIYPSYDYDLPLLGVDFLSFGKIKNLVVLDF QPLFQDEAYQRKYIEPLKFLHAKYPDLAQNLEMKFYDANQYFSKYLLFAKTDPETVAT RLLAAFKDYLNLYWQMLDEAEPQKDPEDIARIVKAQKDYDQYSADRDPASGLFSSYFG HEWAERFLYGFLFEDAVPLAATAKK" gene 22405..23157 /locus_tag="DP116_04110" CDS 22405..23157 /locus_tag="DP116_04110" /EC_number="1.3.7.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015161139.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycoerythrobilin:ferredoxin oxidoreductase" /protein_id="PRJNA477356:DP116_04110" /translation="MTLYQPFLDYAIAYLQSRLELMPYPIPSGFEYKSAITGKGKNQE EVVTTSHGFCAPKLRQIRAAHVQGGQSLQVLNFVIFPHLNYDLPFFGADLVTLPGGHL IALDMQPLFRDDPDYQAKYTAPILPIFHTHQQHLPWGGDFPEEASPFFSPAFLWTRPK ETTVVETHVFAAFKDYLKAYLDFVEQAEPVMDAQALARIKQAQLRYLRYRAEKDPARG MFRRFYGEEWTEEYIHGFLFDLERKLTESVVS" gene complement(23297..23452) /locus_tag="DP116_04115" CDS complement(23297..23452) /locus_tag="DP116_04115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186600.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribbon-helix-helix protein, CopG family" /protein_id="PRJNA477356:DP116_04115" /translation="MATKKSRLNVTLNDIEREKLEKLAEESGLSLSRTIAQLIRKAKL KPNDKES" gene 23876..24394 /locus_tag="DP116_04120" CDS 23876..24394 /locus_tag="DP116_04120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017295398.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycocyanin subunit beta" /protein_id="PRJNA477356:DP116_04120" /translation="MLDAFAKVVSQADARGEYLSAGQLDALSAMVADGNKRMDTVNRI TSNSSAIVADAARSLFAEQPQLIAPGGNAYTNRRMAACLRDMEIILRYVTYAIFSGDA SVLDDRCLNGLKETYLALGTPGASVAVGVQKMKEAALKIANDTNGITRGDCSALMSEV AGYFDRAASAVA" gene 24526..25014 /gene="cpcA" /locus_tag="DP116_04125" CDS 24526..25014 /gene="cpcA" /locus_tag="DP116_04125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018398923.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycocyanin subunit alpha" /protein_id="PRJNA477356:DP116_04125" /translation="MKTPLTEAVSAADSQGRFLSSTEVQVAFGRFRQASASLEAAKAL TSKAQSLAEGAANAVYQKYPYTTQMQGPQYAADSRGKAKCVRDIGYYLRMVTYCLVVG GTGPMDDYLIAGLAEINKTFDLSPSWYVEALKYIKANHGLSGDPAVEANSYLDYAINC LS" gene 25320..26135 /locus_tag="DP116_04130" CDS 25320..26135 /locus_tag="DP116_04130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006631255.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_04130" /translation="MAVFAGSERLGITPSEASAWIELHSNDSAEDVEVVIRAVYRQVL GNSYVMESERLVVPESQLKRGEISVREFVRLVAKSDLYRERFFDNCYRYRTIELNFKH LLGRAPNDYSEMVCHSQILDERGFEADIDSYIDSDEYQENFGENIVPYHRGFTTQVGQ KNVGFSRMFQLFRGYSSSDRAQKQNQGRLTREVAQNTASPIYAASSGSLTGISTGSRG GSTYRLRIMQPPSSKSAVLRRATSEVVVPFEQLSSKLQQLNSKGFKVMSITLS" gene 26224..27075 /locus_tag="DP116_04135" CDS 26224..27075 /locus_tag="DP116_04135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006631254.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_04135" /translation="MAITTEASRLGTSAFSDFSPVELRSPGDVQNVIAAVYRQLLGND YLMASQRLTSMESLLTNGKITVQEFVRQVAKSELYKSKFFYNSFQTRTIELNYKHLLG RAPYDEAEIIHHLDLYQNKGYDADIDSYIDSPEYQGNFGEYIVPYYRGFATQTGQKTV GFSRMFQLYRGYANSDRAQFAGNSPHLATELGRNTASAVVAPGSPGFGYRPSAKGVTP NTAFGGSTIYGDRRLYRVEVSALLTPKYPRVRRSNKAVVIPYDQLSDYMQRVQREGGK IASITPL" gene 27204..28031 /locus_tag="DP116_04140" CDS 27204..28031 /locus_tag="DP116_04140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006103241.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_04140" /translation="MGLDTFDPNESSSSESLTVEQAIANLQGEDLGLRMYAAWWLGRF RVQEPAAISTLIQALEDEADRTEEGGYPLRRNAARALGKLGDRQAVLPLIQSLNCSDF YVREAAAQSLEMLGDPVCIPNLIELLKAGLQGRQLVSNQPDFSQPYDAILEALGTLGA TIAVPLIQPFLEHPIERIQYAAARAMYQLTQDMIYGERLVQALAGKDLQLRRSALADL GAIGYLPAAEAISQTLAENSLKLIALKGLLEHQIYMSPSNELSEGAVRVMTLMDGLL" gene complement(28060..28401) /locus_tag="DP116_04145" CDS complement(28060..28401) /locus_tag="DP116_04145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002752537.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" /protein_id="PRJNA477356:DP116_04145" /translation="MKNPDRGEVWLVDLGYAAKVRPCLVISIPALEQDRALVTLVPHT TSPRGSRFEVEVKVNFLRSGVFDVQNIITIPHAKLLRKLGSLTPKQMAQVEEVLLLWL GFNEASTENED" gene complement(28388..28585) /locus_tag="DP116_04150" CDS complement(28388..28585) /locus_tag="DP116_04150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874064.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04150" /translation="MTPVVKEFLSTFDRLPDSERLEIALEILKRVIHVDFPPLSDEDL VLNAEAIFLELDKQEAAHEKS" gene 28743..29555 /locus_tag="DP116_04155" CDS 28743..29555 /locus_tag="DP116_04155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015160736.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_04155" /translation="MTANLAQTLIRAVEEADSSARLLEAVEQLSAARLEAAIPTLIAA LGYNNPGAAVAAVEGLIQIGKPAVTPLMELLDGYNYGARAWAIRALAGIGDPRGLETL LDAAKNDFSFSVRRAAARGLGTIAWEDLPPEQLKSAQTQVLETLLQVSQDHEWVVRYA AVTALQKLAIAVAISHTDWAMEIITHFDRQVESEDNLTVIARIWLAQREIQEYAVEVL DKATAATVSTLDIDWQATLEKLYARKRQEQPLPEGDPRKFREVAAAIARANA" gene complement(29735..30025) /locus_tag="DP116_04160" CDS complement(29735..30025) /locus_tag="DP116_04160" /inference="COORDINATES: protein motif:HMM:PF00400.30" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04160" /translation="MVTIGQELPSIIHKVFLSNFSGKSTVLSHDTSKVRTHINFNADG SLIATASYANVVLWDLQGRQLMEFKAYEDEIKSISFSANGSKLAVSCHWQGR" gene complement(30330..32151) /locus_tag="DP116_04165" /pseudo CDS complement(30330..32151) /locus_tag="DP116_04165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137448.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" assembly_gap 31713..31722 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(32217..32567) /locus_tag="DP116_04170" CDS complement(32217..32567) /locus_tag="DP116_04170" /inference="COORDINATES: protein motif:HMM:PF13471.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lasso peptide biosynthesis B2 protein" /protein_id="PRJNA477356:DP116_04170" /translation="MSYLYALLIGIDCYLPNRLPDGASYKSLEGCVRDINHVEAFLKR QFNLPSKQIYKLTASNVDGSNVLFALTKGLSSTWCVGVATEPIKAHAWVEIGGKPFRE VNNFQHHFRKLLAA" gene complement(32902..33840) /locus_tag="DP116_04175" CDS complement(32902..33840) /locus_tag="DP116_04175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310603.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RluA family pseudouridine synthase" /protein_id="PRJNA477356:DP116_04175" /translation="MNEFYLEVKQNSDRLDRYLSQELPNLSRSRIQQLIEQNNVQLND NVCTSKKTAVKTGDRISIKIPEPEPLELQPENIPLDILYEDDSLLIINKSAGLVVHPA PGHQDGTLVNAILAHCPNLPGIGGVQRPGIVHRLDKDTTGAIAIAKTEHAHQHLQGQL KAKTARREYLGIIYGVPKTESGTIDLPIGRHPVDRKKMAVVPVEQGGRTAVTHWKVKE RFGNYTLMHFQLETGRTHQIRVHSTHIGHPIVGDPVYSSGRSVGVNLSGQALHAWRLR LEHPVSGEWIEVTAPPPQEFTTLLEVLRRRFSMSQF" gene complement(33843..34553) /locus_tag="DP116_04180" CDS complement(33843..34553) /locus_tag="DP116_04180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200129.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_04180" /translation="MLTQLSSYNQRDIIYPESDGQPMADNTKQFELIVLIKKNLDLLF DNHPNVFVAGDLLWYPIEGNNIIRRAPDVMVVFGRPKGNRGSYLQWREDNIPPQVVFE ILSPSNSAKEMISLYKFYERYGVEEYYLYDPDTGELTGWLRSGDELAEIEQMIGWVSP RLSIRFEMSDGELQIYRPDGQRFLTYLELAQKQEQAEARAEQAEARAEKAEARAEKAE AELQALRALLQERGVNPN" gene 34680..35876 /locus_tag="DP116_04185" CDS 34680..35876 /locus_tag="DP116_04185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119980.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sigma-70 family RNA polymerase sigma factor" /protein_id="PRJNA477356:DP116_04185" /translation="MRPRQEITDMFSTFMQLEGDRFSKWLTDTKLYRNIQNHLGRSSE ALKSENFWALYWHKHWRSRSNNLARMHLSAYLQEPCYWAASKTVAKFTNSQYSLADYF QMAIAEVEIILKDFNPEKCSSLKAYAIMAIPSRLRDILRQRKEANLCTNWALLRKVSK KLLSEALSEAGLSQSAIAQYRLAWTCFKELYVQNQPGGSSKLPEPNRQLWEAITNLYN HQRQSQLTQPTEQRNAQTIEQWLNQTALYVRAYLFPPVKSLNAFKQDDDTTVTLDLPD PSSDSPMADMIAAEDVQNRQNQISQMFSVLLKALQNLDLQTQEVLKLYYQQGLTQQQI MQQLQMSQPTVSRRLVKGRESILAALIKWSQDLNISINSNQIKDMSLALEEWLRNQYG EYNINP" gene 35896..37080 /locus_tag="DP116_04190" CDS 35896..37080 /locus_tag="DP116_04190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741640.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04190" /translation="MTCAFADPREWLLEISPTIQAQSWQQSQIFATPSSRWCAYINQI CLHAFLDWISTEDFPQASVWYTSPGTPAFWEFVNGTAILLEGRRVVLIPSEAIDDSEL EVPQEWVDIPSWAADYYLAVQVQPDAEWVRIWGYTTHTELKSLAHYDSVDRTYCIDAR HLTKDLNAFSLSYQFCGEEQMKAAIAPLPQLSTQQAENLVLRLGNSYVTFSRLGVPFA TWGALLENEQWRQRLYQQRQQSQSSQVQVNLSRWLEGIYDNTWEAIETFFQLNSSSLA FNFRSSSGLNVSSIKRAKLIDLGMEIESQKVVLLVALIPEDSQQVSIRVQLHPTGAES YLPVNIKLALLLESGEIIQEVQARVQDNYIQLKRFDGEVGECFSIQVAFDSYQITENF VI" gene 37302..39734 /locus_tag="DP116_04195" CDS 37302..39734 /locus_tag="DP116_04195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Chase2 sensor protein" /protein_id="PRJNA477356:DP116_04195" /translation="MSKLVVFSLLGGDLNQGFPVVTAQLWHNNQFFKITGSLPAIPEL SELYKRWQLLYEAVHQRLGSNQRIKVHSQDITNISVNDFEQVCQQLQANINAWLKSES FQNIERQLRTLLSRDDEIRVIIETNIALLHRLPWHLWDFFEDYPKAELALSNHEYASP QVVRKSPTDQVNILAILGNSFGIDIEKDRSLLQGLTDTQTTFLIEPTRKELDEQLWNQ DWDILFFAGHSASIADGEIGEIYINQSESLTISQFKNALKAAITRGLKLAIFNSCDGI SLARNLADLNIPQIIVMREGVPDLVAQEFLKNFLVAFAGGKPFYLAVREARERLQGLE NDFPCASWLPVICQNPTTVPVTWQELRSGWGDTETGSEVSTKSRISTRRCKLWTVLLS TVIVTLSTIGLRYFGLFEKLELQIFDQMLLLRPKEELDPRLLVVEITEKDIQSPQEMI TGVKSISDSTLAKLLNKLQKHQPRVIGLDIYRDFADPLRLRSGQAPNKSKPIQLPTEL SKENVVVVCKGRDRKHDPQGVKPPLGVPEERQGFTDAIQDPDGIVRRQILMMAQEPSS PCTTPHSLSLQLAARYLSYENIKPDFNEDYVQFGSKVFKRLKPGRSGGYQQTVVGGIQ ILVNYRDVDYERVSLEDVLSDKVNSDWLKDKVVLIGVTANTVSDTWSTPYSAAQQNYQ KIPGVFIQAQMVSQILSAVLDKRPILGVLPFWCDILWIWGWSSVSGLIVWRFRSHSDK GCAIFVIVIILYGVCAIALCAAPFGSIAIFKQAVWLPFIPSAFAVVTSGVVVLFIQRR REFSSTPQFLVE" gene 39925..>43285 /locus_tag="DP116_04200" CDS 39925..>43285 /locus_tag="DP116_04200" /inference="COORDINATES: protein motif:HMM:PF05860.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04200" /translation="MSGVSIRLYWWQSLGIVIGSVIALYTNSARAQITPDGTLPNNSN VRLEGNTRVIEGGTTRGANLFHSFSEFSVRTGETALFNNARDIQNIISRVTGKSISEI NGLIKANDAANLFLINPNGIIFGPNGSLDIRGSFVATTANALGFGNLGSFSATNPEAP SPLLTINPSAFLYNQITTGAIQNNSVAPAGLHLANFETFGLRVPDGKSLLLIGGNVTM DGGQLNAYGGRIELAGLAGPGNINLIIDGNTFSTQVPELPRSDVFLNNGAIANVAYAS GGSIAVNARNIDLLKNSSINAGIETTLGSATAKAGNITLNATEAIRIEEQSEIRNQVR PNGIGNAGDINITTGLLSVKGGARIYTSSFGQGNAGNININARTQVSFLDGSFGWTTL EPTGVGKSGDLQIQADSVSIGNDSQLKVSTLGTGNSGNLTIDARNTVDFYNNSLALSQ VEKGEGNGGIIKINTGSLALNNNSFLSASTIAKGNAGSVQIQARDAVSFANSRAFTTV QEGGKGNGNDLSIQARSVSVTNGSFLTAGTFGEGNAGNVIIDAADSVIFDDKSYAASS IGLEGYGVGLGNGGIIKISTGSLQLANQAKLTAESYSREGKAGNIIINARDTVSLSDG SSIKTLLGGGIKGQAGDISITTGSLSIASASSLQADTFGQGNGGNINIQARDNVSFIT EGGAYSRVQLNAIGTGGDIDTKADSLTLSDGSFVATSTLGQGNSGNVTINAANAVNLS GVNQKGFPGGIYSEVATDKPNSNGGNVNITARSLKVTDGALLATRTNGNGNAGKVNIN ATDFVLFDGVNIKNVNGDIKINRSGAYSLVFPLGTGRANDININTDSLFLRNGANLRA GTDGVGNAGKINIIAQDMVSIDGASSNGLPSRIETQVIENAVGNAGNVDLRTRSLLIT NSAELSSSSQGNGAAGNLTVNANLISLDNRGKISANTIAREGNINLNPRDLLIMRRNS EITTNASGTNITGGNIKIDGKNAFVVAVQNENSNIRADSENFRGGNVTINVANIFGFQ SQTTSFPQTSSITAKGATPDLSGNVQINTPEVDPTNGLIELPTNLVDVSQQISTACTP GSRQSQSSFVSTGRGGLPMSPTEPLQDTSTLS" BASE COUNT 12295 a 9458 c 9389 g 12123 t 20 others ORIGIN 1 tgttctctgc gcccaccgcg aacgagtgcg ggaggaacat ccacaccgca aattcactct 61 ctacggttta ttttttgcgc tttttcaaga gattagaggt gaaaaggctt taatgattga 121 gcgctttcat ttgctgttgt gcgtagcgag acccagcggc gattcctttg ggagcaaccg 181 cttcaattcg gctcaactct tgagaagtca gcgtaatatc agttgcgcca atgttttctt 241 ccaggtagct ccgtcgcttt gtacccggaa tcggcacaat gtcatcgccc tgagccagta 301 gccacgccaa ggcaagttga ctgggtgtca cgcctttttc ttttgcgatc gctttcacct 361 gttctaccaa ttgcagattc ttgtagaaat tctcgccctg aaatctgggt gaatttctgc 421 gataatcatc ttcagcaaaa tcttctgggc tggtgaactg ccctgagaga aatccacgac 481 cgagcgggct gtaagccacg tagccaatgc ctaactcgcg cacagtcggc aaaatctcat 541 cttctggatc acggctccac agggagtatt ccgtttgcag ggcagtaatc gggtgaactg 601 catgtgcccg tcgaatcgtg gcgggagccg cttcggacat accaagatag cggactttgc 661 cctgctgcac cagttctgcc attgccccga tggtctcttc aatcggcacc gttggatcaa 721 cccgatgctg atagtaaagg tcaatgacat caactcccaa gcgttgcaat gaagcatcac 781 acgcttgatg cacatattct ggtttgccgc tcacgcccaa aaattgacca tcggtgccac 841 ggacgttacc gaacttagtg gctaacacaa cccgatctcg acgatcgcgg attactttgc 901 ccaccaattg ttcatttgtg aagggaccgt atatatcagc cgtatccagc agcgtgacac 961 cgagttctaa agcgcggtga attgtggcga tcgattcttg ctcatcacgc cctgcataga 1021 actcagacat gcccatgcag cctagcccta gttctgaaac caccagttct ctgccgagtt 1081 gacgtgtttt cactcttcgt tctcctgatt taacttgcga ttttcagtct tatcttgcgg 1141 catggtttca tcataggctc gatcggcttt agcatgaata cacaaacctt gcagatgatt 1201 gcctaatcct ctcaagcaag gcattggcaa cgaagaaata tttcctataa taagcacatg 1261 gttaaatcag tcaaaaattc aacatgacga ttgagacact gagccaccag acagcgatca 1321 ccaaatgtga agaactggcg gcattggttg cgaggcacac tgacagcaag ggaaacggtt 1381 tccatacaac tgcgatcgat cccttagcgt tcacgcgaga atgtgatact tccaaagcaa 1441 ttcacagtgt tagcgaacca attcttggca ttgtggtgca gggcaaaaaa gaagtgttgc 1501 tgaatgacga aagctattgg tacggtgtcg ctcagtatct agttgtttcg gtcgatttgc 1561 cactgagtgg atgtgcgata aaggcgacac ccgatcagcc ctatctggga tttaagctga 1621 agctagactc agcccaactc tgtgacatca ttgctcaaac caacccagac ataggtcaga 1681 aagaatcagt cagaggctgg ttcatcagcg atgccgatcc atcgttgatt gactgcgcca 1741 tccggctgac aaagcttttg gatacaccgc aggatatccc gtttctagca ccgataatca 1801 tccgcgaaat ctattaccgt ttactcatgg gtgaacaaag cgaagcggtt cgtcagattg 1861 ctacctctgg cagcaacatg cagcggattg ctcttgtgat caaacggctc aaatctgatt 1921 ttgcaaagcc gttgcgggtt gaggatttgg cagagcaagc gaatatgtct ccttcatcgt 1981 tccatcgcca cttcaaggca gtcacctcaa tgagtccgct gcaatatcaa aaacaattga 2041 gactattgac agcgcgtcag attatgcttg ccgagaatgc tgatgcgacc caggctgctt 2101 atcaggttgg gtatgagagc acttcgcagt tcagccgcga atatgcccgc atgttcggtg 2161 cgccaccgat caggaatatt caacgtttac gcacagcctg agcgtagact gctgccttac 2221 ccactcgtaa cccgtgacga tagaggcatt agcacttttt gattgcctcc tggcatagta 2281 ctgcagtact ataatacgct caaacgtaat agagattgaa ccaacaaatc actcactcct 2341 caggagtcac attgttactg ggggttcggt gtaagcagaa tcaactggcg tttttggagc 2401 gatcgcactc cttccaaatc catccccgtg ttctgtatta atgtctcaac cgtttccggt 2461 gcctgagcat gatcgcacgc ctctaagaaa gcatactctg cttcactgag tttcgcgact 2521 tgatagttat aatctaagat gacttgactt ggccatcctt ccatgcatgg actgcgttcg 2581 ggaatggaat tgagaacagt cgcgtcatct gaccaatcca tgcgggatat tggaggacga 2641 ccgaggaaga attcgtagtg gctgatttct ggatcaagaa gttcagtcag acggtagcgt 2701 tgacgttcgc tcaaatgtgc tgcccgttcc atcaaatctg gttgttgacc aatcaagcgt 2761 tccaattgcc agtacaaagg atttgagaag ccgatgaact ctaacccaga ggcatcaatc 2821 aactcaaaca aagtatcgat gcgatagtct atttcctggg gatggacgta catatcggca 2881 aagtaaccat cgcggtgatt ttccagcgac caacgttcct tttcccgctt cagcagtcgg 2941 ttattttccg gcaggttggc aaagatttcc cgaccgactt tcaccccatc ctcgtagtct 3001 ccacggcgat cgccttggag gagtgcgatc gctttttgca tcaggctaat ttcccaacgt 3061 cccaattcgg catagacaaa aatatgcatc aacccccccg gagcgagctt acccgctaga 3121 ctttgtatgc ccttaatggg gtcggggaga tggtgtaaaa cgccaacgca attgatatag 3181 tcgaattcac cggatacctg ttccacatca tacagactca ggttgtgaaa ttcaacacca 3241 tcagctccgg aacgctgaca acgttcgcgg gcaactgcga tcgcccctgc actcaaatca 3301 atcgcgacta cctgagcatg gggatttagg tgtacgatat actctgttcc tgctcccgtt 3361 ccgcatcccg catctaagat acgaatattt tcccggttag gtttgtgccc cgtgcagaag 3421 ttataagctg tgttccaatt ccaccgccag ttatatcctg gtgggggttc atctagaagc 3481 ggatctggag ggaagggata ggtgttgtaa agtttggcaa ctgccgagct gactgcgtga 3541 gaggaatcca ttgttcagtt atcaggtatc agttatcagt tatcagtgtc ttgctattca 3601 cggtttgaaa gatttcagtt gagtgatcag atggtcaaaa tagggagcga gggctgcttg 3661 ctgctcagaa tcaatccgtt tcaaactagc aattttaata ttttccaagc caaccaccat 3721 tgcatctaaa ggaacttgta attcttgata cagcatgttc atgtagttta gtcctgcggg 3781 gcttgtgtaa tcagtactct gaccagcaat tccatacgtg atgcagcgca aaaagtgcca 3841 aaaatcacgc cagcaggctt cggcacgccc tggtgggtaa agaccacctc cgggttgagt 3901 aatgtcaggg aaggttatca gcacttgaac tctcgcttcg tctacgattt cagtcacgcg 3961 atcgcgcaac agccgtgcga ctggaatgaa tcctgaggtt tggggggaga gggtttggat 4021 ctgctgtaaa tcttcatcgg tgagatatcg cccctgatca tcggctgctt gaaacagggg 4081 aatgatcgct cttgggtgcg tcgtctccca gctggaaaaa ctgacaattc tcgctttttg 4141 aatgagttct ttgactgctt cactgagttc aggcttcatg ttttaccgat actttttcgc 4201 taacgtatgc taacaaatac aatcttgaag ttcttgtact aacgtattcg gtgttcagtc 4261 actagggtta agctactcca taagttttcc ctggggcgag acacgggtga atgatccgtg 4321 ccactctccc agtttttgca atacacagtc ccattgagat ttcatatgac ttccagtctg 4381 gatttcttat tcccagtcgt catcttgctc tgactgtttg tctatttgag aaatcgccaa 4441 cttgatcact ggttgaaccc ctgtttcact ttcttgccct aacgccgctt gcaagggttc 4501 taatgccgtg cgatcgccaa tcttcattaa tgccagtgct gctgctttgc ggctttccca 4561 atcaggatga tgcagtaacg aaatcaggtt ggcaactgct ggtcgatagg tcaagttgcc 4621 caaaactgct gctgcttcgc accgtacact tgtgtctgga tcgcccaggg cgttaacgag 4681 aatttgaaat gcttcttctt cgggattttc ttggagaatt ttgccgatcg ccccaaccac 4741 agcagcccga acatcaggcg atgaggagtc aatttcctta taaatatact cctttgcctg 4801 tgccccgata aatgccagcg cccatgcggc atgacccttg gtattttcag aggaatttga 4861 gtccgctaaa atttccagca aggctggtac agctgcttca cctgtcctcg caagcgcgcc 4921 gactgatgaa ccctggacaa ctgtgtcctc atcattcagg aaagagttta ccagaatcgg 4981 aactgctttt ggatcggcga taattgtcaa agttttggcg ctagcgcgac gcacaaccac 5041 attcgggtga tgtgccaatg cttccaataa gaatggggtt gccaactcac caatttgacc 5101 gagtgtctcc gcaacacgca tcctaaccaa gccccgtgaa tctcccaaag actcaatcaa 5161 atttttgagg agttcgtgat cattgggatt gaaggtgtcc agatccattt gctccttaac 5221 cgcttccagc aaagcatcgg tttgttcctg ggacggaggt attaaattac cgccggattt 5281 cgtttcagaa ttgagcaaat cactcattat caaataccta tgaaattcgt actaactgta 5341 ggggtgtagg ggtgtagggn nnnnnnnnna agaaacacca tccgtcctgt tccctgttcc 5401 ctgttccctg ttccctgttc ccttttcctg actgaagttt acgaagtcgt ttcgcgatca 5461 cggagttctt tgagcacagc atcaattttg gcaagttccg gctcaagttg agcgagtgcg 5521 gttgccttca tgactttagc cttttgcgcc gttgctagag tttggctaat caacctttcc 5581 cgatattctt caaattctgc aatgacttcg ctcagttctt gttctgttgg cgtgtgttct 5641 gaagtttcag tcattttgag tttttcctaa atttgtattt atataaagtc tacgctcgat 5701 tttctcagat cacgtcacca ccaagcaaac ctttttgatt acagtgatgg tgaagttgtg 5761 ccatatgttg ctatggatat tcacgaaatc aaaactgctt taaataattc tgattttcaa 5821 tatcgattga aagcgatcgc cgccctcaaa gactatacac cagagatagc cgttccccta 5881 ctcaccagca aacttcgtga ttcggaattt ttggtgcgat cgttcgttgc tatgggactg 5941 ggcaagcagc agactgctga atccttcgct gccttgctgg agttgatgaa atttgacaac 6001 acccccaatg tccgggcaga ggctgcgagt tctctttctt tgtttggcag agtcgctgcg 6061 tctcatctgg ttttgacttt ttttcgcgat gatcactggt tagtgcgacg tagtattctg 6121 gcagttttgc ttgatctgga gtgtcctgag gaattgtttg aagtctgcat ccaagcttta 6181 gcaggagaag attctagtgt acaggcagat gctatagatg cgcttggtac atttgctctc 6241 acaagtcttg aggaacccgc tttgtctcaa ttgttggcgc tggttggttc agagtctaaa 6301 cagattcgca tacacgttgt tcacgcgctg aagcggtttg atcatccgca ggcaaaagaa 6361 gccctcaggc aactccgcca agaccctgat catcaagtcg tcgccgctgc actagaaaac 6421 ttgcttcagc agtaatgtca gccaattgaa gcaaaaagaa ttcctctcct tggcagacag 6481 tatccaagga gggaacgcta tcatataaat acataattct gttgaatttg caaaaaaatt 6541 catggagttc caaaagtcca tcaaaaagac ttctacgtag ctattgtcaa aatcggacta 6601 aaaaagtata aatattgaca tattttcttg acttgtctgt aaaaaaccat aaacacaaaa 6661 acactaaaat gcagttgttt cgctcatatt taagaggtta tttatgtctg acactctccc 6721 aggtcaaatg acgtggaaac tcccagggac gacagattgt gtgttgcacc tgcgacacag 6781 ttcatcagaa ccttggcggt actataagga atttccagag tatgttctgc cggatccacc 6841 ccacttttca gaggggtatg ccacgtttgt tgcactcttg aaaaagaagt ggcaaacagt 6901 ttgatgttga caaattttca tgacagcggt tatcacctta attgcagatt aaggtgataa 6961 acgtaagctt tcatttaaaa ctcatctcaa agagagtctt atgaggcatc gccgggagcg 7021 ttaagcgcag ctctgccgta ggcaatcgcc aactgcactg taaaaatgga accctcattt 7081 agcttacttt tgacagaaat acaaccacta cagcgagata gcaataattg tacaattggt 7141 aagcctaagc caacaccacc agaatcttct gttgtggttg gacgtaagcg atagaaacgc 7201 tcaaagattt taggaatctc gctatcagca ataccaatac cagtatcgcg aaattccaat 7261 tggacataat caccctgaac acgtccttgt acccaaacct gtccaccttt gggggtgaac 7321 ttaatgctgt taaaaagcaa attaatcaca atttgcctca gtccaccact cacacaccaa 7381 acagcgggaa gttcagttgg tacggtataa cctagcatga tacctttttc tcgggctagg 7441 ggctggtaaa tactaacgac tccaggcaca acttccgaaa gacgtacaac ctccaaaggc 7501 atccgtggta aattgcgctc taactgtacc atatcgaaga ctccagtaat cagagcattt 7561 tggcgatcgc actcttgttt aagcatttga aaataacgtt gtcgctgaat cattttcaaa 7621 ttaggggaat ttaaaagact caaagctgtc ttcatgtgtg ttagaggtgc acgtaattcc 7681 tgacagacat tgctcaaata ttcatctttg agttgcagat tctcttggag tttttgattt 7741 ttctgctgca tgttagcaag acgctttatc gttctttgac gataaatttc atcctgttgc 7801 tgaagttgtt ttgcaaacag ttgactgaga agtcttggat ctggtgctga gggacaaata 7861 aaatcagcag gcataagtag ggatgactgt ggcgtgatgg cggatttcat tacatttagc 7921 actttttgaa taatttctcc ctcaaaaatc gtcacagcaa gtaacggtgc agtttttttt 7981 ctgtttgctt tagctaaatt gtttttacgt gttctaaatg gtctatgagc tagaatcaga 8041 cagcaatact gcgacgacaa caccatcaca aagtattctc gttgcaactg gctatttggc 8101 atgctattta ccagaaaaga ttcggaagat gaggaggtag gggatgcatt cccgtgccaa 8161 gagttgtcgc gtgtctcctg cacaggggtg ctgctttggc gtggcgtctg tgtcaacaca 8221 ggagagtttc cttgttgagt gccagcggcg cgcccctgcc gtgaggatgg gggagtttgt 8281 tgttcctgta ctcctgtttt gctttcctga atgtgcaaag tacgaatcac accagacatc 8341 gtcagttgtt gctgataacg ctgaatttct gagtgccaaa tttttcctgg tggtagctta 8401 acccataaag tagctgcaat ctgttgctca atgagtaaat caatttgtga cttcaattgt 8461 gacagcagag tcgcaggact atgaggcaat agttggggaa gcgcttgcat ccctgaagcc 8521 aactgctgat aaactgacac atcctgagca aaagaaaagt tcatgggtta gcacaacaag 8581 gaaaagactg cactaacttc tcatcttaat taataaagtc gcaaaaaaac cttgtcttat 8641 gattatttac ataaaaaaga cacagagcat taagtattaa tgtctatata taacagatca 8701 ggtaatcagg aaggacggag gcaagagttt cttacccttc acgattgtac ctttggaatt 8761 aacccttcaa ttccttctac ataccaatcc atcgctcgtt gttccttatc acccaacacc 8821 tgacccttcg ggactcgcac cacacccttt tggtctttca ctgggccatc aaaaggatgt 8881 gcaacacctc tgataaattc atcacgcaat gcttgcacca gttgctgaac atcttgagga 8941 atgacaggat tcaagggcga aatatccacc atttcttgag caattccata ccaaacctcc 9001 tcaggcttcc aagtaccatt gataacagcc aaggctttat ctgtataaaa cttaccccat 9061 ttattcaggg ctgatgttaa acaagctttt gcaccaaatc gactcatatc ggtgttgtag 9121 gcaaaagcat aaataccttt ttcctctgct agttggatgg gggcaggaga gttggtatgc 9181 tgtgccagaa catcagcacc taaattaacc aaagcttggg ctgcttctct ttctttagct 9241 ggatcatacc aactctgcac ccagactact ctcacttttg ctttaggatt tgtttgccgt 9301 agtccttgcg taaacgcacc tatccctcga atgacttcag gaattgggta tgaaccaaca 9361 aaaccaatca cattcgattt tgttattttg ccagcaatca tgcccgttaa gtatcgcgct 9421 tcttcaaaac gtcccagata agtaccgaca tttgtagcgc gtttgtctcc actacactgc 9481 tcaaaatgaa cattgggaaa ttcttttgct actttaaggg ttgcattcat gtagcccaaa 9541 aaagtcgcaa aaattaattt attaccttct aacgccagtt gacgaatgac tctctgggcg 9601 tccgcacctt cactgacatt ttccacaaaa gttgttttca ccttctcctg aagatttgcc 9661 tccatttctc tacgacccaa gtcgtgggca taagtccatc caaaatcatt cacaggttca 9721 acatagacaa atcctacctt aagaggttca ttcgccgcca taggcgaagc ttgaacagat 9781 acctcagctt gtggttgaga cttactgccc tgagtacagc ctgcaaatgt aaagctagtt 9841 cctgctaaag tggtatactt tagaaactta cgacgctgca taaataggat attaagaaag 9901 atattaaaat attgtattgc ggctccaagc gtgttttctc gtaccagaaa ttgctgtaat 9961 aacaattacc cataaagtcg tgtgaataac caataaaatt ggttatcaaa gtagcaaaaa 10021 ttgcattgat accacagaaa aatcacgtat tgataaatca acacaaatat tttctataca 10081 aatattttct atgaacatat ttgagctttt agcaggggaa gacaatcata aggcgctagt 10141 tacacctgag ggatcagcac ttacctacaa gcaactgcgt gaaaacattg tcgaactggt 10201 atcccaactt aacagttttg gtttgcaacg gggcgaacgc atcgccattg ctatgtcaaa 10261 cggggtacca atggtactca cctttcttgc agctgcttta tgtggtaccg cagcaccact 10321 caatccaaaa tacaagcaag aagaatttgc cttctactac gaagatactc aggcaaaagc 10381 actgattacg ctgcctgacg tgccagaagc cgcaatcgca gctgccacac ccgacatgat 10441 acacattcac gctaaagtta ccgagaacgg cacattaagc tttgagttag tgaaaacagc 10501 ttccggggaa ggggaatcct tgggcaatca agagtttccc gactccgacg atgtggcaat 10561 gattctgcac accagcggta ccacgagtcg tcccaagcga gtcccgattc ggcatcgtaa 10621 cttgattgct tctgctaaca atattgttga tgcatactcg ctctcggcag ttgacaaaac 10681 actctgtttt atgcctctgt tccatattca cggactggta gggtgtatgc tggcgactct 10741 ggcatctggc ggaacgcttg tcatacccga tggtttcaat gccctaagtt tttggaaact 10801 ggtggaaact tataaaccca cttggtactc ggctgcacct accatgcacc aaacaatcct 10861 agcgcgagct agccgcaacg aagccattat caaagcgaac cgcttccgct tcattcgctc 10921 tagcagcgct ccccttcccc ctgtcattat cgaacagatg gaagcaacgc tcaatgctcc 10981 tgtcttagaa tcttacagca tgaccgaagc tgctcacttg atggcgacta atccgctacc 11041 gcctaaagta cgtaaaccgg gcatggttgg ctatggcttt ggtgtggaag tcggtattat 11101 ggacgaagat ggcaatttac tttctcaagg aagcttgggt gaggttgtgg ttaaaggacc 11161 aaatgtgatt gatggttacg aaaataaccc gcaagccaat gcgactactt ttgtcaatgg 11221 ttggttccgt actggcgatc aggggacact tgatgctgac ggatacctcc gcctgactgg 11281 acggattaag gaattgatta accgaggcgg tgaaaaaatt tctccactag aagtagatga 11341 tatactgctg cgtcatcctg cggttgctga agcccttgct tttgctgtac cccataaatc 11401 gttgggagaa gatattcatg cggctgtcgt tcttaagggt gaaaccagcg aaaaggaact 11461 tttggcttac tgtgcaacca tattggcaga cttcaaactc cctaaccaaa ttcacatttt 11521 ggatgaacta ccccgtgggg cgacagggaa gctgcaacgg ttgaatatgg cgaagttgct 11581 caagatagag actgaacaca gataaacata tagcaatcct aatttattcg tgaaatcttc 11641 ttctctcttt tctacccttc gggaagccgc cctccgggcg tctacgtgta cttcgtgcct 11701 atgtccttcg gacacgctgc gctaacgtgg ttcgttaaca aaagaaaatt tcacaaatca 11761 gatgggattt ctatagattc acacagataa ttcaacagtg tgaatctgtg gttccatata 11821 aaaaaaatac tttggcaaga gttctcatga aaatctgtat tgttggcgcg ggtgctattg 11881 gtggttactt aggagcaaag ttagccctct ctggtcatga ggtgacgctg attgctcgtg 11941 gtgctcattt agaagcaatt caaaaaaatg ggctgaaact gattatggca gatggctcaa 12001 cccacacagc taccccaatg gcaactagtg atatgagtca agccgaacct caggatgtgg 12061 tgattttggc tgtcaagact acgagtgtgg cggcgatcgc ctcccatctc ccctgtctct 12121 ataaaccaga aacaatggta gtgacggctc aaaatggcat tccctggtgg tactttcgca 12181 agcacggcgg cgagtatgaa ggaactgcga ttcaatctgt agatccggat ggtaccattg 12241 aagctcatat taatgttgag cgcgtgattg gttgtgttgt ttacccagca actgagatag 12301 ttgaaccagg tgtgattaaa catattgagg gcgatcgctt cagtttgggt gaaatcgatg 12361 cttctaaaac cgaacgcatt cagttgttag cgcaagcttt gaagtcagca ggtttgaaag 12421 cacccatacg cactcaaatc cggactgata tgtgggtgaa gttgtggggg aacctggcgt 12481 tcaatcccat tagtgccctg acgcgtgcaa cactagatca tatttgccaa taccctttga 12541 ctcgcgaact cgcaagacag atgatgagcg aagcacaggc gatcgctcaa aagctaggaa 12601 ttgattttgg catcacctta gagcaacgaa ttgaaggtgc ccaaaaagtt ggtgctcaca 12661 aaacttcgat gctgcaggat gttgaagcgt gtcgtccaac agaagtggat gcgatcgtcg 12721 gcgctgtcgc agaattggga agactgacgc aaactcccac accccatatt gatgctattt 12781 atgccagtgt caagttgctg gagaaaacct acatggggaa ctagaaaaat atgatatcga 12841 taatatagca ggaagtgtca attcactttc ttcctaatgg caaaataatt tgtaacgcaa 12901 tgttagagcc aatgtaggcg caatccgcaa gcccagcatt ggctgttagc acatgtgtac 12961 attttcagaa caatttctgt cattccacga aaatcccctc tcgtgtaatg tccaaaattc 13021 gtctcaaaat gtcaacccat gctgacagcc aattcagaaa aactagaatc ctagagagaa 13081 gttcttgata cttcaactag caactagtag tccttaatcg ttaaaatccg tgtttttgtt 13141 ttctgtaaaa cgcttattca acgcaagttc catgaggaat gaaaatacgg tttttagaaa 13201 ttatgtccta ctagagctaa aggaggtttc tcaacattac ggatagtata aatatctaga 13261 gttggctccc accttagatt tttggttccg aactccattt gacgtgagcg tgcatataaa 13321 acaggacaag accaatcgcc gaatgcaaca gtatcgaatc cactacctgc atcttgtata 13381 ttggcatcta tgtacagttc tgcacgtgca tctttgacag cttgatctac tcttctaccg 13441 tatttcaaaa cacttgtata aaacctttta ctcaattggt atccagcctc aaatgtaata 13501 gcatgattag ttgctaacac aattggaact cctgcttgaa ttaggttgtg agccatgtta 13561 acaaaatcat tagctcctcc tctaaggatg tcatccacgt tgttactaga cattccttct 13621 acagtgtcag tttctacaga gcgagcagtg tcacatgcat taagaaaaca aaagcccagg 13681 ctttgatttg ctgctaatgt cgttaacaaa cggatagata ggttcccaaa ttctctacct 13741 ggaatattcc aattattact actttcgctc ctccagacaa ttaacgagct ttccattgga 13801 tttctactgt ccagaaatcc atgaccaccg aagtgaaata tatctacacc ttcaaacaga 13861 atatccttga tgttttctac agttgctctc tcatccctta gttctgagat cctgatctgc 13921 tctccaaagc ccttgagcat attgactaga gtttcgtgct ctttatcaag aacatcttta 13981 cctctctgag aaaaaggatc tacgctgaca atcaggatat gtaacttttc tctacgttca 14041 ccagggatat taacaggttt tggcacaaat gggtgtctaa ttaggttatg tctaaagcca 14101 atatattcgt tacgaaaacg cataacttcc caatgaatag tatttagcat tggttgacca 14161 agcattaaac gaagattgat ttgtgaatct gttgctcctc ctatagttct atttaggagg 14221 cttagaattt cacctttgaa gatttcatta aatagactga ttccaaattt ttcaacttct 14281 gactcaatgc tctttcttga aatactcctt cttagaaggc gactatagag taatcgaagg 14341 tcattcgatg tgtcagccat ttcatcaaaa aactttttag cgacttgaat agttgcttct 14401 ccgaagttag attcagcgag aatagctaca tgatcattta ctggaataat ttttaggtca 14461 aaattctcca ttaccggact tctataatga taatttattg agtctgaatc aaaccgttat 14521 agttcgagtt tcgtaaatat aaaactggag tcaaccaatc tgacggtgga agaaaatctg 14581 tatgagcaaa taattcaagc cttatttgtc taacagcctt gtaaatacta ccacctttag 14641 aagtaagtgt tctgtaaagt cttctggtta atatttctgc tcctattttt gatattggat 14701 gagacatgcc aataacacat ggaatacctg ttttaattag tgcatgtgcc atactatagt 14761 aaccagattc agttaatcta ttggttaatt ttttaccaga cttagattta ctgtacataa 14821 tagcttgatc tgacctacaa gcatttaaaa ctgcaacttg cattggattg gcacgacaaa 14881 aactttctaa catttctatt ggtagtttat catacggttt attttcatct tcgctatcta 14941 tacgaaagta atacttgaga tggttaccat cgtactcgca ctcaccatga gctagaaaat 15001 gaaggatagt tataccgtca tagatgctgt ttgcaatact tttatacgta ctatctggga 15061 gtttttcaaa ctgaatgcta ctttcttcta aggcagcttg aactgcctct atttgtccct 15121 gtacataaat atcatgtgaa ggattagcac ctatgaataa caatcgaatg ttttcgctgc 15181 tagatattgg ttttctgaca ggttgaagtg tgaaagggtg tctaactata tcatagacat 15241 gacacaaata cctatcctca ttatagaata gttcccacgg aacaagatta agaaagtcgc 15301 tagctatcat taacctgatt agaacgtcat tcttcaaccg tatcgcttga ccaattgcta 15361 catttaataa atccaaaatc tggtgattga atacattttt aaataaaagt gttcctagct 15421 gtttcaaatt actagacaac ttttctcgtt cctgcttgga atttaataca cgattaccta 15481 atcctgttgt atctaagttt acgggagagc ttgcatcttt gatcgcctca ccctgttcaa 15541 ttgaattatg atttgactca aatagtattg agtcttttgg aaaaaattta tcaatatgct 15601 gatgtactaa tattgacctt tttatcaaag cttttacatc agattctttg ataaaggaat 15661 atgcttcacc acttgaccct gtacaaacaa ctaaaatttg tttgtcaata taaaaaaact 15721 gtagagacaa ctcagagttg atgagattca tgaaactaaa cttcagttat ctatacaatt 15781 gcagaaatca tctagaaatt ctctagttag tagcactgct agagtattta caggaattaa 15841 tgaaaaaaca gttgcaagca aatcacctat agcaattatt attgctatac gactggcatt 15901 tttattttgt tttataaata aacaataatt tccctcaata caaatttttt gatatatatc 15961 ctctcttcta tcaactatcc attgttttac ttgagatgta atatcatttg ctggtcctaa 16021 agaaccaatt tgctgaccaa attcaacata aagtaagttc gtatcaagat ttaataatct 16081 atcaatgttt tgttcctcac gagttgagaa ggaacgcaat cttaattgca taatgagcct 16141 cagaaattaa tagataagcc aagctcaggc ttgcagattc agtaatccta agccttaagt 16201 tgtaggttca atgtagatta tacttaattt ataacaaaat gagttaattt agatacaaaa 16261 tatcaatcct cacaactgtg aagtataccg tattgctgtc gaatttcact ttgttcgtta 16321 gttaaataag cacgtcggct cataagtaat acagcccttg ttggcgctct cggcgtctac 16381 cctgcgggta tgcctagtgg gtacggccac ggctatcgcc gtatgcgaag tgtctccgga 16441 ggagccagta ctgcggtgct gtctcaccgc tacgcgtcta tggggttagt gaaaagaaat 16501 aattttcaca aataaaataa gattgctata gttcaattgc tttaggcaaa atggactttg 16561 ataacattcc atgaacacct ttttgcggac tgtttacaac ttgagtaatg cgaaaattca 16621 cggctaaact acacaaaaca taagcatctt ccgccgacaa attgatgaag cgttctaaca 16681 agtcaatcat gttttttaaa gccagttcca aagcttcatc taatgtttga ccaaatccca 16741 ttgtgataat atctgtagga gtttcggcaa ttggcattgt caattgtaaa tccttgcgaa 16801 ctttcaactg aatcgtgccg ttcattgaag tctcaattgc ggtaacatta acttccccgt 16861 ctccctgtac tgaatgtccg tcaccaatag aaaataaagc accaagaaca aagattggca 16921 gaaaaatctt agtcccagct tggagttcct ggttatcaat attaccacca taattaccag 16981 gaggaattga gtttcgggat gcttccgtag tcgcgacacc cagaatacca aaaaagggtt 17041 tgaggggaat tttgatacca ctatcaattg gaaattctgt aatgttatgt tctaaatcaa 17101 gaggaataaa tctcaaagct ggttgatgaa attgatgtgg taatgctccc caacctgtcc 17161 gaatcgcatt gaagccaaca ggtaaactcg gtgttattgc ttctagtttg acttccaaaa 17221 catctccagg ttccgcacct tgcacataaa ttggtcctgt cagtaaatgt ggtcctgctg 17281 caactttgcg ttctgaggga agattttgac aaatatccaa aaatgcaggt gtgagaaact 17341 ctggtggtgc tttgtcgtga acgtagtaac cagtgtatgt ttctatttca actgtatcgc 17401 cagagtcaac aatcagtgct ggttctagaa gatgagaaaa cccacctaga tgcacagttt 17461 ccttggtggc tttgagaatg tggtgagtca tcgcgtagtg gtactttatc agaattattg 17521 cttagtacag gattgcacaa gttcgtagca atatgtatgt cattctgtaa gcgttgctcc 17581 acttggttaa agcgcttgga aatcaatatc agtaatttaa tgtattgaaa tttacagatt 17641 taattctaaa aatcctgatg tcagtcaggt gtttgtacta attataattt gtatagaagt 17701 tttcaattgg gtgcaataca tcaaggagtt agggaaccgt agatgtagac gccggaggcg 17761 gtgagggggt cgcagtcttg gcggtttccg tcgattgcgt tcgcgcagcg tctccgaagg 17821 agttaccccc gaacccggag ggcttcccgg agggtacgca gatgtaaatg ccccaggcta 17881 gaggaactgg cgtacgccca cagggggtac gctccgtgca tacccgtaag gcttatggca 17941 aagtacaccg ataaatatgt acttcatcta aatgagaacg ctatatatct ttgatttaag 18001 taaaaaataa cacagaaaac atacatagtt acaaaggtaa ttcttatgag aagtacagga 18061 ctcgcttatc ttctttggtt cacttgtatt tttggacttg ctggaaccca ccgtttttac 18121 agtggcaagt atgttagtgg aatagtgtgg ctgtttacat ttgggttatt tggtcttggt 18181 cagcttgtag atttagcgct aattccagga atggttgaag atcaaaatct taaatatagg 18241 atgcttcatg gtagcccgaa tagtaacaat atttcaaata ctcaacaggt tgtcattaat 18301 gtagctgact acatagctcc aaatgccaat actaataagc cactatctac taagtctgat 18361 ctccaactaa ttctagaatt agcgaaaaat aacggtggga acatatccgt tacagattgt 18421 gtgattgcca ctggtaaacc tattgtagaa gtcaagcaaa ctattgagag cttatgtgct 18481 gaaggtttat tagaagctgc taatcatcaa gaaactggtg caataattta caaacttatt 18541 taaaggtttt ttcaattgag acaaaagtcc agtctgacaa gacctttagg ctctctcgtt 18601 cctatgctct gcatgggaat gcactacagg aggctctgcc tcccaacaat tatattgata 18661 tattgaggca gcagcccaca gttatgcatt ccttgccaga gacaaggaac gagaaatttt 18721 ctaattcctt tcttttctcc actttcctaa ttattctgaa ccagaagtat ttgtcggacg 18781 cgctgagtta atctgacgca gcaacatcac tccccaataa ggaacaggag caagtagcac 18841 agtaccagtt ccctcaaaag tattgactaa aaactcaccc gaagtcatcg aaccgaaaat 18901 agacttagta gctttttcaa ctcgataatt tagggtgttc gtgcgagcaa tcgcaaaatt 18961 accatccaca accaagcggt cattttgcaa atgcacttct tcaacaggtc cttgcgctac 19021 catgaccact ttacctctac ctttgacttt agtttgaaac aaaccttcgc caccgattaa 19081 accagaaaat aatttatttc tctcaacact gacttctaca gaaccatcac ttgcccagta 19141 tgctcctcta tctaaaatcc attcactgcc atttaattcc atgatatgaa atccagataa 19201 agaaggctcc aaatataatt cacctgttcc tgtataagtg ggtctaaaaa tattttcacc 19261 tgttgctaga gattttaaaa aactaccggc tgttggtgct ttagattgca tggtaatctt 19321 gccacgcata tagtacaaag caccagactc cgttctcacg gtttcgtctt gcaaaataac 19381 cttgactaag cgcaatcctt cttgtgctat aacttcaaaa gttgccattg ctttcctctg 19441 ttattacaga aatacaaatt cattgtttcc aagactaagt ttgcttgttc catgtcattc 19501 gcgctctcct cacctcccaa caattatata aaggcagcaa cccacagtta tgcattcctt 19561 gccagagaca aggaacgaga aaatagtttt attgtaagca aagaaaataa aattatcccg 19621 caaaaatgta acgcctatag gactcacgca cgagttatga aagaacaaga ctgtgagatt 19681 gggcactcct cgtcgcaacg ggtgcaagta cgttattttt gcgtaagtcc tggcttattt 19741 gtttgcgcta tcaccaactt ctaaactctg acgcgttgtt ctgcgagtaa aggaattcca 19801 agcatccaat tctgaagcag gacgtctgag caggattgct cttatcgcct cacttaatcc 19861 ttgaacaaga gtaaaatctg ccccagcaat actctgaagg tactcgattt caacaccact 19921 gaagttgctt gctctcaaat cagccccttg taaatctaca ttatttagtc gcgcattgcg 19981 taagtcagct cctgtcaaat acgctaaccg taaatccgca ccactcaggc gtgcattctg 20041 taaatttgct ccagtgagat ctgcgccaca cagatatgcc cctagcagat ttgcattcct 20101 caaatcggca tctctcaaat tggcagtgtt gagatgggca ccgctgagat tggcaccagg 20161 accaattgcc ccacaggttt tgtaagtaaa tccttcgggc caaatcgtgt gttggtcata 20221 tcgagctgct tgtagtttgg ctccttccaa attcgccccg ctgaaatttg ccttctgaag 20281 atatgcccgg tgcaaaaagg ctccttgcaa attggcattg ctcaaatcag cgccctgcaa 20341 gtcagcctta cgcagatccg caccactcag atctgcttca cacagcattg tcccactcag 20401 attagctcca atcagatttg cctgacgcag gtttgctcgt tgcaaatcga gtcccctaag 20461 atcggactga tacaaatctt catgattgag agcctctcct gctttgaggg ctgcgatgtt 20521 actagataag gaaatattgg agttttttgc agtcattttc tttgaggagc gcttagagta 20581 ggggaaaaca gtgaacagtc aacagtaaac agtaaacagt gagtccagcg cgcatgaggc 20641 gttttcccgc cgtaggcgac tggcgtatgc gcaaagcgca cgcccagagg gctaaagccg 20701 taaggcgtgc gctttgcgca tacccgaagg gtgaacagag tcagcgattt cttgacctcc 20761 ctgatcactg gtaactgata actgataaaa gtgtacaatg tgggacatca gataaattct 20821 tgttacattc ctttatattt tttctgctgt tgaggttaga aatgcgcctg attccaccca 20881 tgaatatgat ggacttcttt cgtaagagtc aaggcacgtg gtttacccaa cgcactgtcc 20941 accattttga cctggcggcg gatcagtctg gggagtcaaa tttgattgtt caagtcgttg 21001 aacgggaaga tccaagagtc aaagcgattt gtgaacaaca aggaattgat cctgccaagg 21061 gaatgggggg aggcagtttt atgtggcaag aaaaccagga caatcgcgaa cctaatccag 21121 accaagcggc ggtgctagtt gatgtgcccg atgatgaaag tctgttgtct ggaaaactca 21181 tccgcgatcg cggctacgtg gaaaaaatgc ccgttattag tcggtactgg tttggtaaag 21241 acggcatttt gacaatcgac acagaatatg acattaacca gggacaagaa cggtgctggt 21301 tcattaccga tgactttcgc gtgcgtgtta gcaccgtgcg gacgatgaat ggtgtatata 21361 tgatgaccta tggctcagag cgtcgttgcg tatcagaggc aacgttagag caactgattc 21421 agaagaattt ggcaagggca tcaggaaatt aagttaaagg gtaagcgcta ggttcagcga 21481 ctgcagtatg tcacaaatta ttgcgtcctt gttaacaaat gggacagacg ttctcataat 21541 cctttattgt cgagaaagtc ttataaatat ggattaatat caatgtacaa gccctttgta 21601 aaacatctag aaaatgaact gtttaacaga tttaatttac aaaatcgggc aatccctact 21661 ggtttagaat tgaaggtcag cgatcgcggc agaaacccag caaccattca aagctggtgc 21721 tatcaatgcc atcaattgcg gaaaattcgc tatacctaca ttgatgcggg cgaaagcgcc 21781 caaattctca atagcgttat ttatcctagc tatgactacg acttaccctt gttgggagta 21841 gactttctgt cgtttggcaa aatcaaaaac ctggttgtgc ttgatttcca gcctctgttc 21901 caagatgagg cgtatcagcg caagtacatt gagccattaa aattcctaca tgctaaatat 21961 ccagatttgg cacaaaattt agagatgaag ttttatgacg ccaaccaata tttctctaaa 22021 tacctgctat ttgccaagac agatcctgaa accgtggcga cacggttact tgcggcgttc 22081 aaggactact tgaatttgta ctggcaaatg ctagatgagg cggaacctca aaaagatcca 22141 gaagacattg cgcgaatcgt aaaagcccaa aaagactacg accaatacag tgccgatcgc 22201 gatccagcat caggtctatt cagcagttac tttggtcatg agtgggcaga acgcttcctc 22261 tacgggtttt tatttgaaga tgcagtccca ttagccgcta ccgccaaaaa gtagtttttt 22321 gccgcttgtg atctgtcaaa agtcctttgt taaaaaaaat taatgactga tgactaatga 22381 ctaatgacta ataactaata accaatgact ctttatcaac cgtttctcga ttatgcgatc 22441 gcctatttgc aatcgcgatt agaactcatg ccctatccca tcccttcagg gtttgaatac 22501 aaaagtgcta tcacgggcaa gggcaagaat caggaggaag ttgtcacgac cagtcatggc 22561 ttttgtgcac cgaaacttcg acagattcgg gcagcgcacg tacaaggtgg tcaatctctc 22621 caggttctca atttcgttat ttttcctcat ctgaactacg atctgccttt ttttggggca 22681 gacttagtaa ccttgccagg aggacacctg attgccctgg atatgcaacc gttgttccgg 22741 gatgatccgg actatcaggc aaagtatacc gcacccattc tgcccatctt tcacactcat 22801 cagcaacacc taccgtgggg tggagacttc cccgaagaag ccagcccctt cttttcgcca 22861 gcattccttt ggactcgccc caaagagact actgttgttg aaactcatgt gtttgctgcg 22921 tttaaagact atctcaaagc atacttggat tttgtagaac aggcagaacc tgtgatggat 22981 gctcaagccc tagctaggat caagcaggca caattgcgct atttgcggta tcgtgccgaa 23041 aaagaccccg cacgaggaat gttcaggcgt ttttacggcg aagagtggac agaggaatac 23101 attcatggct tcctgtttga cctagagaga aagctgaccg aatcagttgt cagttaacac 23161 tcagcagtta tcctgcacag aatagacctc ttgcaaaagt gagattttac gaggttctgt 23221 taagagttaa gcgttcagaa ttaagagtta agcgttaaga gttccctgtt aagcgttccc 23281 tgttccctgt taagcgttaa ctttccttat catttggctt aagctttgct ttgcgaatta 23341 attgggcgat cgtacgagat agggagagtc cagactcttc ggcgagcttt tctaacttct 23401 cccgttcaat gtcatttaat gtaacgttta gtctactttt cttcgttgcc acggcgcaaa 23461 agtggtgtaa ctatggtacg tattgtcaag cttaaaagct aagctgcatc tagattgtta 23521 ccgtttgtaa aaaaatagta agttctaaca acttttaaca cacaaagttc ccccttacat 23581 tgtataaaga tttaggaagg caaaaatgag aagcaaatta tggctcagtt gttgttttct 23641 cagactcatt ttcctataac atagggagga gagaagttag tctcagattc gttaaagaac 23701 ttaacactgt tctcaacact ctgaaacagg taaatctgag aattgctgag aactctgatg 23761 cccaatatca ctgtatagcc tttctaagct gcccacacag gcaaaattca ctctagaacg 23821 atgtcgtttg acaacgaaat cgtaactaaa acaaaggaat taggagacca aatttatgct 23881 agacgcattt gcaaaagttg tttctcaagc cgatgcacga ggtgaatacc tgagcgctgg 23941 tcaattggat gctctgtcag caatggttgc agatggcaac aagcggatgg atactgtcaa 24001 ccgcattacc agcaattcct ccgcgatcgt tgctgatgca gctcgcagtc tgtttgcaga 24061 acaaccccag ctgatcgctc ctggtggcaa tgcttatacc aaccgtcgga tggcagcttg 24121 cttacgcgac atggaaatta tcttgcgtta tgtcacctac gctatttttt ctggagatgc 24181 tagcgtactc gacgatcgct gtctgaacgg tctgaaggaa acatacttgg cgctaggaac 24241 ccccggtgct tctgtggcag tgggtgttca aaaaatgaaa gaagcagcgt taaaaattgc 24301 gaatgacacc aacggtatca ccagaggaga ttgcagcgcg ttaatgtcag aagtcgctgg 24361 ctattttgat cgtgcagctt ctgcagttgc ttaaatctat tagtcagctc aatagtcatg 24421 agtcattgat cattaacttg agacaaatga ctgtagacaa atgactttag acaaactcgc 24481 tttcaagcaa aactttgtaa aacaatacaa gagatagaaa aagcaatgaa aactccactc 24541 actgaagccg ttagcgctgc tgattcccaa ggacgtttcc taagcagcac agaagtccaa 24601 gttgcctttg gtcgcttccg tcaagctagc gctagcttgg aagctgctaa agctttgacc 24661 tctaaagctc aaagtttggc tgaaggtgct gcaaatgcgg tataccaaaa atatccctac 24721 accacccaaa tgcagggacc tcagtatgca gctgattccc gtggaaaggc aaagtgcgtc 24781 cgtgacatcg gttactacct gcggatggtc acctactgct tggttgttgg tggtacaggt 24841 ccaatggatg attacttgat tgctggatta gctgaaatta acaagacctt cgacctatct 24901 cctagctggt acgttgaagc actcaagtac attaaagcta accacggtct cagcggcgat 24961 ccagctgtag aagctaactc ttacctcgac tacgctatta actgtctaag ctagtaaaca 25021 cacacaggcc cggtaaggta aacagctttg ttagcaatca tgatgattga tgatgtagtt 25081 gttcacttta ccgggctgct ttattccgat aaggcacatt aaaacaggga acagggaaca 25141 gggaacaggg aacagggaac agggaacagg gaacagggaa caaaagaaac gaatccgtat 25201 ttctctttga taactgataa ctgataactg ataactgata actgataact gataactagt 25261 aactgtaaga aatgcctgga aaaaattgtc aagttggcag ttaaaagagg gaggcaaata 25321 tggcggtttt cgctggatcg gaacgactag ggattacacc aagcgaggct tcagcatgga 25381 tagaactgca ctctaatgac agcgcagaag atgtcgaggt ggtcatccgg gcagtctatc 25441 gacaagtatt gggcaattcc tatgtcatgg aaagtgagcg actcgttgtc ccggaatctc 25501 aactcaagcg cggtgagata tcggttcgcg agtttgtgcg gctggttgcc aagtcggatt 25561 tgtaccgcga gcggttcttt gacaattgct accgttaccg caccattgag ttgaacttca 25621 agcatttgct tggtcgtgct ccaaatgact acagtgaaat ggtctgtcac agtcagatat 25681 tggatgagcg cggctttgag gcagacattg attcttacat tgacagcgac gaatatcaag 25741 aaaattttgg cgaaaatata gtaccctacc atcgaggctt tacgactcaa gttggtcaga 25801 aaaatgtcgg gtttagccgg atgttccaac tgttccgggg ctattccagt agcgatcgcg 25861 cccaaaaaca aaatcaagga cggctgactc gggaagtagc tcagaatacc gcctcaccca 25921 tctacgcagc ctccagtggg tcgttgactg gtatctcaac aggtagccgt ggtggaagta 25981 cctaccgact gcggattatg caaccaccat cttccaaatc agccgtgctt cggcgtgcta 26041 cgagtgaagt tgttgtgcct tttgagcagc tttccagcaa gttgcaacaa ctgaacagca 26101 agggcttcaa ggttatgagc attacacttt cttagtcagt cattagtcat tagtcattag 26161 tcattggtat taggcaaagg acaatgacaa atgatgaact aggttcataa ggagattttt 26221 taaatggcta ttacaacaga agcatcccgc cttggaacct ctgcctttag tgatttttcc 26281 cctgtggaac tgcgtagtcc gggagatgtc cagaatgtta ttgccgctgt ctatcgtcag 26341 ctgttgggca acgattattt aatggcatcg caacgcctca ccagtatgga atcgctcctg 26401 acaaatggca agattacagt gcaagaattt gtccggcaag tggctaaatc agaactgtat 26461 aaatccaagt ttttctataa cagtttccaa acccgaacca ttgaactgaa ttataagcac 26521 ttgttgggtc gggcaccata tgacgaagct gaaatcatcc atcacttgga tttgtatcaa 26581 aacaaaggtt acgacgccga cattgattcc tacattgatt cacctgagta tcaaggaaac 26641 tttggcgaat acatagtccc ctattatcgc ggctttgcga ctcaaactgg acagaaaact 26701 gttggattta gccggatgtt ccaattgtat cggggttatg ccaacagcga tcgcgctcag 26761 tttgctggca attctccaca cctagccaca gagttgggac gcaataccgc atcggcagtt 26821 gtcgcaccag gtagccctgg ctttggctat cgtccctcag ccaaaggagt cactcccaat 26881 actgccttcg gtggctcaac catttatgga gatcgtcgct tgtaccgtgt ggaagtatct 26941 gctcttttaa cccccaaata cccacgtgta cgccgcagca acaaagctgt tgtcattcct 27001 tatgatcagc tatcagatta catgcaacgg gtgcaacgtg aagggggcaa gatagccagc 27061 atcacaccat tgtagtcccc tttggtgagt cgtcctttgg tggaaaaaac tatcaccggc 27121 aatcagctat ggaagtttcc agctgctgat tgccgatttt ccataggaga atcatcaata 27181 agctgtgaga aatgactgat attatgggtt tagacacgtt tgatccaaat gaatcgagta 27241 gcagtgaatc tttaactgtc gagcaggcga tcgccaacct ccagggagaa gacttggggt 27301 tgcggatgta tgcagcttgg tggctaggca gatttcgggt acaggaaccc gcagcgattt 27361 caaccctcat ccaagcatta gaggacgaag ccgatcgcac agaggaaggc ggatacccgt 27421 tgcgacggaa cgctgctagg gcattgggaa agttgggcga tcgccaagca gtgctaccgc 27481 tgattcagtc tttaaactgc tcagattttt atgtccggga agctgcggct caatcactag 27541 agatgctggg cgatccagtt tgtattccta atctcattga gctactcaaa gcaggtttac 27601 aagggagaca actcgtcagc aatcagccag acttttctca accctacgat gccatcttag 27661 aagctttagg aaccttagga gcaaccatag ccgttcccct gattcagccg tttttggagc 27721 atccaatcga gcggattcag tatgctgctg cacgagccat gtaccagtta actcaggata 27781 tgatttatgg agaacgcctg gtgcaagcgt tggcaggtaa agatttgcaa ctacgtcgat 27841 ctgcgcttgc agacttgggt gcgatcggct acttgccagc agcagaggca atttcccaaa 27901 ctcttgctga aaacagtcta aaattgatcg cgcttaaagg actgctggaa caccagattt 27961 atatgtcacc ttccaatgaa ttatctgagg gtgctgttcg ggtgatgacg ttaatggatg 28021 ggctacttta gtatgatgcc gcagcaagcg acacacagat taatcctcgt tttcagtgct 28081 tgcttcgtta aatcctaacc acaagagtaa tacttcttcc acttgtgcca tttgctttgg 28141 cgtcaaactt cctaactttc gcagtaactt tgcatgagga attgtaatga tgttctgcac 28201 atcaaacact ccagatcgaa ggaagttgac tttgacctct acttcaaatc tcgaaccacg 28261 tgggctagtt gtgtgaggaa ctaaggttac caaggcacga tcttgttcca aagctggaat 28321 actaataacc aaacatggtc taacttttgc tgcatagcct agatcgacga gccatacctc 28381 tccacgatca ggatttttca tgggcagctt cttgcttatc taactctaaa aaaattgctt 28441 cagcatttaa gactaaatct tcatccgata aaggcggaaa atctacatga atcacccgct 28501 tcaaaatttc caatgcaatc tctagccgct ctgagtctgg caagcgatca aaggtgctta 28561 ggaattcttt tactacagga gtcataattc ttttgcttgg atacacttca caatagcaca 28621 aacctcattt atgagttcat tcattattga ccagcgacta gtctaaaatt gaaacaatac 28681 aaagaactca gactatataa aaaattcaac gtccagccat ccctatatta aaaattaagt 28741 ttatgactgc taatcttgcc caaaccctaa tccgcgctgt tgaagaagca gactcctctg 28801 cccgtctact ggaagctgtc gagcaattgt cagctgcccg tctggaagcg gcgattccaa 28861 ccctgattgc agcgctaggc tacaacaacc caggggcggc ggttgcagcc gttgagggat 28921 taattcaaat cggcaagcca gcagtaacgc ccctgatgga attgctggat ggctacaact 28981 atggtgctag ggcttgggca attcgcgccc ttgctggaat tggcgatcca cgaggattgg 29041 agacgttgct tgatgcggca aaaaatgact tttctttcag tgttcggcga gctgctgcga 29101 gaggattagg tacgatcgcc tgggaagatt taccaccaga gcaacttaaa agcgcacaaa 29161 ctcaggtgct agagactcta ctacaagtgt cccaagacca cgaatgggtg gtgcgttatg 29221 cagcggtgac tgcgttgcag aaattggcga tcgccgtcgc tatctctcat accgattggg 29281 cgatggaaat tataacacat tttgaccgcc aggtggaatc tgaagacaat ctcacagtca 29341 ttgctcgcat ttggctggcg cagagagaaa ttcaggagta tgccgttgag gtactggata 29401 aggcaactgc tgcaactgtc agcactttgg acatcgactg gcaagcaact ctcgaaaaac 29461 tttatgcgcg taagcgtcag gaacaaccct tgccagaagg cgatccgcgc aagtttcgtg 29521 aggttgcggc tgcgattgcg agggcgaatg cttagtttga agaagaatga cgagtagtta 29581 aacctacttg tcgggattta tgggggctga tggtatattg tcacagagat ggcgatcgct 29641 atttttctca tcaagggtag ctagataacc acgcactcgt tcacaacccc gttctagcag 29701 ttcatctaat cctcctagct gccataactt aactttaccg tccttgccaa tgacagctga 29761 cagccaactt actgccatta gcactaaagc tgatgctttt aatttcatct tcatatgctt 29821 taaattccat caactgcctg ccttgtaagt cccaaaggac gacatttgcg tatgacgcgg 29881 tggcaattaa gctgccatca gcattgaaat tgatatgagt cctaactttt gatgtgtcat 29941 gactcaggac agtcgatttt cctgaaaaat tagacaaaaa aactttatgt atgatggaag 30001 ggagttcttg tcctatagtg actattctta actctccatc gggctgaaag ctgacgcttt 30061 ggacgctctg aaaagcaggg ctgtttcatt ccacaaggag gaggagtttg tcaatatcat 30121 ttgcgatcaa tcgttgggca caagtgatgt atgacacttg cgtaagtcct atggttcaaa 30181 gatgggctgg ttgttcatct ctatcttgct ccaaaggact tgggttatgt agctgcagct 30241 taaacaacca gccaaggagc gctccaaaat ggcacccaaa aatcgacacc tttgaatacg 30301 tttagaaagg cagcagaaaa aacaacagct taaaggtcgt actgctttgc ctgcatattc 30361 caaagagtag catactcacc tttgcgctgt aaaagttctt ggtgggtgcc cgactcgatt 30421 aaatgaccag ccttcatgac taagatacgg tcagccatac gcactgaagc tagtcgatgg 30481 gtaatcagga tggtagtttt gccctgagct agctcaacaa agcggcggta aacttcgtac 30541 tcgctacggg ggtcaagtgc agcagtaggc tcatcaagga tcagtagctg ggcttgttcc 30601 cgcacgaagg cgcgggcgag ggataacttt tgccactctc ctccggaaag ctctgtcccg 30661 ccaaactgtt tgcccaaggg tgtgtcctct ccttgaggaa atttctcaac cagttctgct 30721 agaccagctt tttgggctgc acgcctaagg cgttttgggt catccagagc ttgaatgtct 30781 cccaaagcaa tattctcccc tatggtgagc gcgtagcgac caaagtcttg aaacactccc 30841 gcaatctgcc gccgccactc atctaaattt aattccctta aatctacacc atcgaccaaa 30901 acaaccccat ctgtgggatc atatagtctg gtgagtagct tcactagggt agttttgcct 30961 gctccatttt ccccaactat tgcgactgtt tcaccagggt aaagggtaaa agaaacatcc 31021 accaaagcag agcgattgtc tgggtagcaa aagtgaactt tctcaaaagt aatgcccgac 31081 cgaatcggat cgggcacggg ctttccaggg acgcacagtg ccattggtgg tttactgtct 31141 aggaagctga aaaactgctg catatagagc aaattctcgt agagttctag ggagtttctg 31201 accaattctt caagatttcg ctggagataa gccagtgatt ctaggaaaag gagtacgctt 31261 cctgggctga gccgtcctct aaaagcctgc tgcactaccc agtagaaggc aaaaccattt 31321 cctatagtac tcccgaacgc caagctagaa gaccaaaggg cttgcttacc cctcagatga 31381 cgcatagatt gatgtagaga ccgaaatgct tgcaggtata aatttgtaaa gaacgaactt 31441 agcttaaaca accgtacttc cttggcatag gtatcagtga gcatgactga agtgcaatac 31501 tgcatccgcc gggcttgtgg actgttctca aacagagtta gccaaatctt tgtctcatac 31561 tggaaagaaa caactatctg aggtatgact gcgatgagaa tcgcaagggg aatccaaaac 31621 cctagaggag ctagcagccc taccatcgct actaaagtga cgaaagaccg acctccctca 31681 gccaaagtct gaagcaagct cgccggctga tannnnnnnn nncttctgga gaagctgtag 31741 ttcgtcgtaa aactcagggt cttcaaaacg gctcaggtct ggaaaagtat ccgctttacg 31801 cattaaaagt aagttgatgt gagccgtgag tttatcgctg agatttcccc gaactgctgc 31861 tatccaagga ctcaggagca cctctagtag caaagcccct atccatccag ccactaaact 31921 cccaatgagt acattgccta attctctgcc ttgagtcaaa gctgttgcta tcgtatctac 31981 cactcgcttg ctaatccaga tactcagtgc tggaacaagc ccttgtaaca gaatagcgac 32041 ggtgagaaag actgtttccc ttggtgctgc tgtccataaa aatggtagag aacggaagaa 32101 agcttttgta tagctacgta caagctcaag ccagcttctg aaatcagtca tcaattcaac 32161 ccttgtttgt gtggcttcta ataaagatga atcttttatt attcttaatt taaagcctaa 32221 gctgcaagca attttctgaa gtggtgttga aagttattca cttcacggaa gggtttacca 32281 ccaatttcta cccatgcatg cgctttaatc ggttcagtag ctacacccac acaccacgtt 32341 gacgatagac ctttagtcag agcaaaaagc acattagaac cgtcaacatt agaagctgtt 32401 agcttgtaaa tttgtttaga tggtagatta aactggcgct ttaggaaagc ttccacatgg 32461 ttaatatcac gcacgcatcc ctcaaggctt ttataagaag caccatcggg taatctatta 32521 gggagataac aatcaatacc tattagcaaa gcgtacaagt agctcattcg ttaataacct 32581 gtagacgttt tcaagggtag gtgtgaatga atttagttgg ttgaaacaac cgtaaatcct 32641 tacaatttat atagatgtaa acaaaaataa cttatgcagt tttgaaaaac cccagagaac 32701 atttatggca atagccataa aggttagtct aggtttctca atggcgcatt tgtaaggctt 32761 taaaataaag cgccttttgt caataagtta aaatactcga tttataagtg agtacgctta 32821 tatttgatgt acatcaattg ccataaaggt ttggagtttc ggaccgcgaa ctagtcaaaa 32881 ctccagtgag tattatctta attaaaattg agacatgcta aatcgtcgcc tgagaacttc 32941 caacaaagtt gtaaactcct gtggaggagg tgcagtgact tcgatccatt ctccagaaac 33001 tggatgctct aacctcagtc gccaagcgtg aagtgcttga cctgataaat tcacaccaac 33061 ggaacgaccg gaactataca caggatcgcc aacaatagga tgaccgatat gggtgctgtg 33121 gacacgaatt tgatgggtgc gtccggtttc cagttggaag tgcatcaatg tatagttgcc 33181 aaaacgctct ttcactttcc agtgagtaac tgcagttctt cccccttgtt ctactggaac 33241 aaccgccatt tttttgcggt caactggatg acgaccgatg ggtaagtcaa tggtaccgct 33301 ttctgttttg ggtacgccgt agataatgcc gaggtattct cgccgtgcag tttttgcttt 33361 gagttgacct tgtagatgtt gatgggcgtg ttctgttttg gctattgcga tcgcccccgt 33421 tgtatcttta tccaatcgat ggacaattcc cggacgttgg actcccccaa ttcctggtaa 33481 attagggcag tgagctaaaa tagcgttgac taaagtaccg tcttgatgac ctggtgcggg 33541 atggacgact aacccagcgc ttttgttgat gataagtagg gagtcgtctt cataaagaat 33601 gtcgagagga atattttctg gttggagttc tagaggttca ggttctggta tttttatgga 33661 aatgcgatcg cctgttttga cagcagtttt cttggatgtg caaacattgt catttagttg 33721 gacattattt tgttctatga gttgttgaat gcgcgatcgc gataaatttg gtaattcttg 33781 ggagaggtag cgatctaggc gatcgctatt ttgtttgact tctagataaa attcgttcac 33841 acttaatttg gattcactcc tcgttcttga agtaaagcac ggagtgcttg tagttcggcc 33901 tcagcttttt ctgctctagc ttcagctttt tctgctctag cttcagcttg ttctgcccta 33961 gcttcagctt gttcttgttt ctgcgctagt tccaaatagg taagaaaccg ttgtccatca 34021 ggacgataaa tctgaagttc gccgtctgac atttcaaagc gtatgcttag acgaggactc 34081 acccaaccga tcatttgctc aatttctgct aattcatcac cagagcgtaa ccaaccagtc 34141 aactcacctg tgtctggatc atacaagtag tattcttcca caccatagcg ctcataaaat 34201 ttgtataaac ttatcatctc tttggcgcta ttgctagggg aaagaatttc aaacacgact 34261 tgtgggggaa tgttgtcttc tcgccactgt aaataggaac ctcggtttcc ttttggtctg 34321 ccgaagacaa ccataacatc aggtgctctg cgaatgatgt tgttgccctc aataggatac 34381 cacagtaagt cccctgcaac gaacacattc gggtgattgt caaacagcag atccaagttt 34441 tttttaatga gtacaatcaa ctcaaattgc tttgtattgt ctgccattgg ctgtccgtcg 34501 ctttccgggt aaatgatatc tctttggttg taagatgata gctgagtaag cattccagta 34561 aatcctattc ttgcacgtgg catagttcat tttaacgaac gcaagtgcgt gtgcctgaac 34621 gccaacacca agaaggcatg ctatttgcag tcccgctcaa tttatatttt taattgctta 34681 tgcgtcctcg ccaagaaatt acggatatgt tctcaacatt catgcagttg gaaggagaca 34741 gattcagcaa atggttaact gatactaaac tgtatagaaa tatccaaaat cacttggggc 34801 gttcgtcaga agcactgaaa tcagaaaact tttgggcgct ttactggcac aagcactggc 34861 gctctagatc caataatctt gccagaatgc atttgtcggc ttatttacaa gaaccatgtt 34921 attgggcggc ttcaaaaaca gttgccaaat ttacgaacag ccaatatagt ctagctgatt 34981 acttccaaat ggcgatcgca gaagtcgaaa taattcttaa agacttcaat ccggaaaaat 35041 gctctagttt aaaagcctac gccattatgg caattcccag tcggctgaga gatattttac 35101 gccaacgcaa agaagctaat ctttgcacca attgggcatt gttacgcaaa gtgagcaaaa 35161 agctgcttag tgaagctctt agtgaagctg gattatcaca aagcgcaatt gctcaatatc 35221 gcttagcttg gacttgtttc aaagaactgt acgtccaaaa ccaaccagga ggtagcagta 35281 agttaccaga accgaatcgt caactttggg aagctattac taatctctac aaccaccaac 35341 gccaaagcca actcactcaa cctactgagc aacgcaacgc acagacaatt gaacagtggt 35401 taaatcagac agctctctac gtgcgagctt atctgtttcc acctgtgaaa tctctaaatg 35461 cttttaagca agatgatgac acaacagtaa cgcttgattt gccagaccca tcctctgact 35521 caccaatggc tgatatgata gccgcagagg atgtgcaaaa tcgacaaaac caaatatctc 35581 agatgttttc tgtattgtta aaagctttac agaacctgga tttacaaacc caagaagtac 35641 ttaagcttta ctatcaacag ggactcactc aacaacagat tatgcaacag ttgcagatga 35701 gtcagcctac ggtgtctcgc agactggtta aaggtagaga atcaatactt gcagcactga 35761 ttaagtggag ccaggatttg aatatttcta ttaattccaa ccaaattaaa gatatgagtc 35821 ttgctttgga agaatggctg agaaaccagt atggtgagta caacataaat ccgtagtttt 35881 cataaggtag taaaaatgac ttgcgcgttt gcagatccaa gagaatggtt gttagaaata 35941 tcaccaacaa ttcaggcgca atcctggcag caaagccaga tttttgccac gcctagcagt 36001 cgctggtgcg cttacataaa tcaaatttgt ctccatgcct tcttggattg gatctcaact 36061 gaagattttc cccaagcgag tgtttggtat acttcccctg gtacgccagc tttttgggaa 36121 tttgtcaatg gcacagcaat tttgttagag gggaggcgag tcgtgttgat ccccagtgag 36181 gcaattgatg atagcgagtt agaagttcct caggaatggg tggacattcc cagttgggca 36241 gcagactact atttagcagt gcaagtccaa ccagatgccg agtgggtgcg aatttggggt 36301 tacacaactc atactgaact gaaatctctg gctcactatg actcagtgga taggacttat 36361 tgtatagatg cacggcattt gacaaaagac ctcaatgctt tctctttgag ctatcaattt 36421 tgtggagaag aacagatgaa agctgcaata gctccgttac cacaattatc cacccaacaa 36481 gcagaaaatc ttgtgctacg gttaggcaat tcctatgtga ctttctctag gcttggagtg 36541 ccatttgcaa cttggggagc actactagag aatgagcagt ggcggcagcg tttgtaccaa 36601 caacgacagc agtcacaatc ttcccaagtg caagtgaatc tcagtcgttg gctggaaggt 36661 atatatgata atacttggga agcaatagaa acatttttcc agttaaattc cagtagtcta 36721 gctttcaatt ttaggagtag ctctgggcta aatgttagta gtatcaaacg agccaagctg 36781 attgacttgg gaatggaaat agagtctcaa aaagtggtgc tattagttgc cttgatacca 36841 gaggatagcc aacaagttag tattcgcgtg caacttcatc caacgggtgc agaatcttat 36901 ttacctgtta atatcaagtt agctttgctt ttagagtcag gggaaattat tcaggaagtg 36961 caagcaagag ttcaagataa ctatatacaa ttgaaacgct ttgatggaga agtaggagaa 37021 tgttttagca tccaagttgc ttttgatagt taccagataa cagaaaattt tgttatttag 37081 tcattaatca ttatattatg ttcagatgaa cacttataaa aaacgaatca cctcaccccg 37141 ccctccgggc acccctctcc ttattaagga gaggggatgg gggtgaggtc tttttattgt 37201 aattaatcaa acgaacttga tattagtcat aagtcagaac gttaatgaca aatggctctt 37261 ggacaaatga caactaacaa ctcacaacta ccaaacaaca aatgagtaag ctagtcgtct 37321 tcagcttgtt gggaggagac ttaaatcaag gatttcctgt tgtcacggct caactttggc 37381 ataataacca atttttcaaa ataacaggga gtctgccagc cataccggaa ctcagcgaac 37441 tctataaaag gtggcagtta ctctacgaag ctgtccatca acgtttgggc agcaatcaac 37501 gtataaaagt gcattcacaa gatattacta atatttctgt gaatgacttc gagcaagtgt 37561 gtcaacagct acaagcaaat atcaatgctt ggttgaaatc tgaatcattc caaaatattg 37621 agcgacagct acggactcta ctcagtcgag atgatgaaat tagagttatt attgaaacaa 37681 atatagcttt actgcatcgc ttaccttggc atctgtggga tttttttgag gattacccga 37741 aagctgaatt agctttgagc aatcatgagt atgcatctcc tcaggtagtg cgaaaatctc 37801 ctacagatca agtcaatatt ttagcaattc taggtaacag ttttggtatt gatattgaga 37861 aagaccgctc tttgttgcaa ggtttaacag atactcaaac tacgtttctt atcgaaccaa 37921 cacggaaaga acttgacgag caactgtgga atcaggattg ggatattctg ttttttgcag 37981 gacatagcgc tagtattgca gatggagaga taggggaaat ttatattaat caaagcgaga 38041 gtttaacaat ttcccagttt aagaatgcac tgaaggcggc aattacacgc ggtttaaaac 38101 tggcaatttt caactcctgt gatggaatta gtttggcacg aaatttggct gatttgaata 38161 ttccccaaat aattgtgatg cgagaagggg taccagattt agtcgcacag gagttcttga 38221 aaaactttct tgtcgctttt gcgggtggga agccatttta tctggctgtg cgagaagcgc 38281 gggaaagact tcagggattg gagaatgact ttccttgtgc tagttggtta ccagtcatct 38341 gccaaaatcc tacaactgtt ccggtaactt ggcaagaact gcgcagtggt tggggtgata 38401 ctgaaactgg cagcgaagtt tcgacgaaaa gcagaatttc gactcgaagg tgcaaacttt 38461 ggactgtact gctgagtact gttattgtga ctctttcaac aataggtctg agatacttcg 38521 ggctttttga aaaacttgaa ctgcaaatct ttgaccaaat gctactcctg cgaccaaagg 38581 aagagttaga tccaagattg ctcgttgttg aaatcaccga aaaagatatt caatccccac 38641 aagagatgat cacaggagtg aaatccattt ctgatagtac gttagctaaa ttactcaaca 38701 aattacaaaa gcatcaaccg cgagttattg gattagatat ttatagagat tttgctgacc 38761 cccttcggct acgctcaggg caagccccaa ataagtcaaa gccaatacaa ctccccactg 38821 aactgagcaa agaaaatgtg gtagttgtct gtaaaggcag agatcgtaaa cacgaccccc 38881 aaggtgttaa acctccatta ggagtacctg aagaacgtca gggttttacc gatgcaattc 38941 aagatccaga tggtatcgtt cgtcgtcaaa ttttgatgat ggcacaagag ccttcgtctc 39001 cctgtacaac gcctcactca ctctcgttac aattggctgc tcgctactta tcttacgaaa 39061 atattaaacc tgatttcaat gaagattatg tacagtttgg ttctaaagtc tttaaacgct 39121 taaagcctgg tcgcagtggt ggctatcaac aaacagttgt aggaggaatt caaatacttg 39181 tcaactaccg tgatgtagat tacgaaagag tttctctgga agatgtattg agtgataaag 39241 ttaactctga ttggctcaaa gataaagtag ttctcatcgg ggtgacagct aacaccgtca 39301 gcgatacttg gtcaactccc tactctgctg cccagcaaaa ttaccagaaa ataccaggcg 39361 tatttataca agcgcaaatg gtcagccaaa ttttgagtgc agttttagac aaacgtccta 39421 ttcttggggt tttgcctttc tggtgcgata ttctctggat atggggatgg tcgagtgtgt 39481 ctggtttgat tgtttggcgc tttcgttcgc actcagataa gggatgtgcc atatttgtga 39541 tagttatcat tttatatgga gtttgtgcga tcgctctctg cgcggctccc tttggaagca 39601 tcgccatttt taagcaagca gtatggttac catttattcc ctcagccttt gctgtagtta 39661 caagtggtgt tgttgtgtta tttattcaac gtcgtcggga attttccagt actccacaat 39721 ttttagttga ataaaacgac aagagtaaac catccttgac aggttaagat cataaacaat 39781 ttgtcaataa gtaatagtgc ttaaaatttt tatgcaattc tttctagtta tgaattaata 39841 aaaaagtgca taagtcgatc gccctgacat ctatacctaa ttgaggattc actcgctaaa 39901 tttccttctt aataggagag tgcaatgtct ggggtatcta tacgtttgta ctggtggcaa 39961 agtttaggaa ttgtgattgg cagcgtaata gccttatata caaatagcgc tcgcgcccaa 40021 atcacgccag atgggactct gccaaataat tccaacgtca gattagaggg taacactaga 40081 gttattgaag gtggaacaac aagaggcgct aatctgttcc acagcttctc tgagttttct 40141 gttcgcactg gcgagacagc tttgttcaat aatgcacggg atattcaaaa cattattagt 40201 cgagtaacgg gtaaatctat ctctgagatt aatggtttaa tcaaagctaa cgacgcagcg 40261 aacctgtttc taatcaatcc caatggaatt atatttggac caaatggcag cttagatatt 40321 cgcggttctt ttgttgctac aacagccaac gctttaggat tcggtaatct aggcagcttc 40381 agcgctacaa atccagaagc accttcaccg ttactgacga ttaacccttc agctttttta 40441 tacaaccaaa tcactacagg agcaattcaa aataactcag ttgcacctgc tggtcttcat 40501 ttagcaaatt ttgaaacatt tggtttgcga gtacccgatg gtaaaagctt attactaatc 40561 ggtggtaatg taacgatgga tggtggacaa ttaaatgctt atggtggacg aatcgaatta 40621 gcaggattag ctggtccggg aaatataaac cttatcattg atggcaacac ttttagtact 40681 caagtacctg aattgccacg aagtgatgta tttttgaata atggagcgat cgctaatgtt 40741 gcttatgcca gtggtggtag tattgcagtc aacgcacgca acatagattt attaaaaaac 40801 agttcaatca atgctggtat cgagacaact ttgggttcag caactgcaaa agcaggaaat 40861 attactctta acgccacgga agcaattagg atagaagagc aaagcgaaat tagaaatcag 40921 gttcgcccga atggcatcgg gaatgctggt gatattaata ttactactgg cttattgtcc 40981 gtaaagggcg gagctagaat ttacactagt tccttcggtc aaggtaatgc cgggaatatc 41041 aatataaatg ctcgcacaca agtctctttt ctagatggaa gttttggctg gactacttta 41101 gaacctacag gtgtaggaaa aagtggagat ttgcagattc aagctgattc tgtatcaatt 41161 ggtaacgact ctcaattaaa ggttagtact ttaggtacgg gaaattctgg caatttaaca 41221 attgatgctc gtaataccgt tgatttttat aataattcct tggccttaag ccaagttgaa 41281 aaaggagaag gtaatggcgg tatcataaaa atcaacacgg gttcacttgc actgaataat 41341 aattcgtttc ttagtgctag taccattgca aaaggaaatg cgggtagtgt gcaaatacaa 41401 gctcgcgatg ctgtctcctt tgcaaatagt agagcgttta ccacagtcca agaaggcggc 41461 aaaggaaatg gaaatgacct ttcaatccaa gcaaggtcag tctcggttac taatggatca 41521 tttttaacag caggtacttt tggtgaaggc aacgcgggta atgtaataat tgatgctgca 41581 gattcagtta tatttgatga taaaagttat gctgctagca gcataggttt ggaaggatat 41641 ggggtaggat taggcaacgg aggaataata aaaatctcta caggttcgct tcagctcgcc 41701 aatcaagcca aattaacagc agaaagctac agcagagaag gaaaagcagg taatatcatc 41761 attaatgccc gcgataccgt ctctttatca gatggaagtt ctataaaaac cctgttagga 41821 ggaggaatca agggtcaagc aggtgatata agcattacca caggttcact ctcaattgcc 41881 agtgcttcct cgctccaggc tgatactttt ggacagggaa acggcggtaa tattaatatt 41941 caagcacgag ataatgtgtc ttttataact gagggaggag cttatagtcg agtacaacta 42001 aatgccattg gtacgggagg tgatatagat actaaagcag attctttgac gctgagtgat 42061 ggttcttttg tggcaacaag cactctaggg caaggaaatt ctggcaatgt aactattaac 42121 gccgctaatg ccgttaactt aagtggagtc aaccaaaaag gttttccagg aggtatttac 42181 agcgaagttg cgaccgataa accaaatagt aatggaggca atgtaaatat cactgctaga 42241 tcattgaaag tcactgatgg tgcgctcttg gctactcgta caaatggtaa tggaaatgct 42301 ggtaaggtca atatcaacgc gactgatttt gttttatttg atggtgtaaa tataaaaaac 42361 gttaatggtg acattaaaat aaacagaagt ggtgcctata gtcttgtgtt ccctctagga 42421 acaggtaggg caaatgatat taatattaat acagactcac tttttttgcg aaatggtgcg 42481 aacttgcgcg caggcacgga cggagtaggg aatgcaggaa aaataaatat tattgcccaa 42541 gatatggtat ctatcgatgg agcaagcagc aatggattgc caagtaggat tgagactcag 42601 gtaatagaaa atgctgtggg caatgcaggt aatgttgatc ttagaactag atcacttctt 42661 attaccaata gcgccgaact atctagcagc agtcagggaa atggagcagc aggcaattta 42721 acggtaaatg ctaatttgat tagtcttgac aaccgaggta aaatttcagc caatacgatt 42781 gcacgagaag gcaatataaa tctaaatccc agagatttat taataatgcg tcgtaatagt 42841 gaaatcacta ctaatgcttc tggaacaaat attactggtg gcaatattaa aattgatggc 42901 aaaaatgcct ttgttgttgc agttcaaaat gagaatagta atattcgtgc tgattctgaa 42961 aactttcgtg gtggaaatgt cacgattaat gtagctaaca tctttggttt tcaatcccaa 43021 acaacatcat tcccgcaaac gagtagcatt actgcaaaag gagcaacgcc agatttaagc 43081 ggcaatgtgc aaattaacac acctgaagta gaccccacta acggtttaat cgaactccct 43141 actaatcttg ttgatgtctc gcagcaaatt tccaccgcct gcactcctgg aagtcggcaa 43201 agccaaagtt cttttgtgtc aacagggcgt ggcggactac ccatgagtcc cactgaacca 43261 ttacaagata caagcactct atcag // LOCUS NODE_573_length_42012_cov_5.14329042012 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 42012) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 42012) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..42012 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..469) /locus_tag="DP116_04205" CDS complement(<1..469) /locus_tag="DP116_04205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876930.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide-binding protein" /protein_id="PRJNA477356:DP116_04205" /translation="MIVCPNCNHPNPDGAVQCEACYTPLPATSNCPSCGATVQADAAF CGQCGYNLRAATPTAVAAATVAPDIAPDVPPLVNPEPLVQPQPMSVNGADYTFPHSAA IPPTAVAFGDPFVETTQSSPPPPTITTPPPAPAVSEQPSIPTPPPIPTPPPPPV" gene complement(630..1361) /gene="pgl" /locus_tag="DP116_04210" CDS complement(630..1361) /gene="pgl" /locus_tag="DP116_04210" /EC_number="3.1.1.31" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876931.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="6-phosphogluconolactonase" /protein_id="PRJNA477356:DP116_04210" /translation="MSKKVEVLCDQKALIARALELILSKIEIAIKQRGQFTIALSGGS TPKPLYEAISTQKLPWDKIHVFWGDERYVSPDHPDSNQLMTRRAWLDRVEIPPGNIHP IPTDEADPAVAAAKYERHLQEFFHASPGEFPALDVNLLGMGDDAHTASLFPHTEALKV TDKLITVGNKDGNPRITFTYPFINSAHCVIFVVAGANKRPALAQVFAPVADEFTYPSR LIQPQGELWWLLDAAAGSDLKKDAG" gene complement(1807..2547) /gene="rph" /locus_tag="DP116_04215" CDS complement(1807..2547) /gene="rph" /locus_tag="DP116_04215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876932.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease PH" /protein_id="PRJNA477356:DP116_04215" /translation="MAWQRPDGRQSYQLRPISFQQNFTRFAPAAVLTKCGDTQVLCSV SVTQGVPKFLEGTGKGWLTAEYRMLPSATPQRQEREFLKLSGRTQEIQRLIGRSLRAA LDLEAIGERTLTVDADVLQADAGTRTTAITGGFVALAHAISTLLQQGVLERSPIIGQV AAISVGLLQGEPFLDLNYIEDVRAEVDFNVVMNQQLGIIEVQGTAESGSFSRTQLNSL LDFAQKGIQQLLIAQREAIANWNDLYQG" gene complement(2691..3092) /locus_tag="DP116_04220" CDS complement(2691..3092) /locus_tag="DP116_04220" /EC_number="2.7.4.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136843.1" /note="essential enzyme that recycles AMP in active cells; converts ATP and AMP to two molecules of ADP; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylate kinase" /protein_id="PRJNA477356:DP116_04220" /translation="METGELVTNEMIIESIQIELKKPDFKDGWVLEGYPRTAEQAEEL DFLLDHLGHNLDWAIYLQVPQAVMVNRSMGRFLPDDLPEIVQRRVETFYDRTVPILEY YDRRRRLLTINGDQSPEAVQQDIITLLITSQ" gene complement(3264..3389) /locus_tag="DP116_04225" /pseudo CDS complement(3264..3389) /locus_tag="DP116_04225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308870.1" /note="essential enzyme that recycles AMP in active cells; converts ATP and AMP to two molecules of ADP; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="adenylate kinase" gene 3648..4184 /locus_tag="DP116_04230" CDS 3648..4184 /locus_tag="DP116_04230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205711.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="P-loop NTPase family protein" /protein_id="PRJNA477356:DP116_04230" /translation="MVAQLETPSLNSTLDFPYSVEGIVQVFTSSHRSFFTSVMSQALR IAGQGTAVLVVQFLKGGIRQGQENPIRLGQKLDWIRCDLPRCIDTPHLDESETKSLQK LWQHTQKVVLEGQYSLVVLDELSLAIHFGLIAETEVLAFLDKRPNHVDIIFTGSQMPK SILEIADQITEIRRSHQP" gene 4906..4986 /gene="dcd" /locus_tag="DP116_04235" /pseudo CDS 4906..4986 /gene="dcd" /locus_tag="DP116_04235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876936.1" /note="Catalyzes the formation of dUTP from dCTP in thymidylate biosynthesis; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="dCTP deaminase" gene 5167..5766 /locus_tag="DP116_04240" CDS 5167..5766 /locus_tag="DP116_04240" /EC_number="3.5.4.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dCTP deaminase" /protein_id="PRJNA477356:DP116_04240" /translation="MLKNDRWIIEQAQLGMIEPFEPSLVRAVSIDSTGLLRPVLSYGL SSYGYDIRLSPVEFKVFRHIPGTVVNPKRFNPDNLESVPLHSDEDGDYFIIPAHSYGL GVALERLVIPANITCLFIGKSTYARCGLIWNLTPGESGWTGWLTLEVSNSSSADCRVF ANEGVVQALFMEGEPCGTTYADRAGKYQDQPHKVTLAKV" gene complement(5996..9364) /locus_tag="DP116_04245" CDS complement(5996..9364) /locus_tag="DP116_04245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877188.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helicase" /protein_id="PRJNA477356:DP116_04245" /translation="MPTHDIIDNRNQKLVDQIKRILDSSEAAHFAVGYFFISGFTAIA ERLPNIKELRLLIGNTSNRETIEQIAQGYRRLELIEDKIEAQKYPKRSEEKRMANVTA ANIRSSIELMDQTDEAEILVKNLVQMIEEKRLHVRLYTKGTLHAKAYIFDYGTVYDHK GRALERAEKGIAIVGSSNLTLSGITHNTELNVVIHGNDNHTELTHWFNELWNEAEDFN EALMREMKQSWAGSTVRPYDVYMKTLYSLVKERLEDTAPKDLILEDAITKQLADFQKV AVNNAVQNIRDYGGAFVSDVVGLGKSFIGAAIVKRFEQTERARPLIICPAPLIEMWER YNEVYQLNARVLSMGMLKEDEESGFKVLLDDFRFQDRDFVLIDESHNLRNHTTQRYKV VEAFLGAGKRCCLLTATPRNKSAWDIYHQLKLFHQDDKTDLPIDPPDLKEYFQLVEKG EKKLQELLSHILIRRTRNHILRFYGFDSQTQQAVDPANFREYLDGTRRAYVIVGGKHR FFPKRELETIEYSIEDTYQGLYEELRQYLGKSRKRQLAKPPTNELSYARYGLWNYVLK PKQKQEPYNTLQRAGANLRGLMRVLLFKRFESSVYAFQESIRKLLIVHERFLKALSQG FVAAGEEAQTLLSEDYNQAEEQDLMDGLQKVSDKYDIADFYTEKLQQHIEHDIKLLKN ILALVEPITADKDTKLQTLIKWLSKPSLKNKKRLIFTQYADTAKYLYENLNPGGQRDD FDVIYSGSNKNKARLVGRFAPNANKEYKFKPGESEINTLIATDVLAEGLNLQDGDLII NYDLHWNPVKLIQRFGRIDRIGSEKDVIYGYNFLPELGIERNLGLKQKLKNRIQEIHD TIGEDAAILDRTEQLNEEAMYAIYEQQGKQLSLFDFEAEEDFLDLNEAEEILRKLQKD NPSEYERIAHLPHGIRTAKFSMQQKGTYVFCEASDPSKPDLKGYQQLFLLDDRGEIIS RDIPRILGAIKADTTTPTSTLPQEHNSTVMRLKRQFAEEVKHRQAEREFKGRLTQGQR YILRELRIFFKAIADEELKSQINILEQKFRACVNQAVNRELNKLRRDSFTGQELFTQL AQIYLQHNMHQWQDNSLPTFSQPIPMIVCSEALV" gene complement(9651..9890) /locus_tag="DP116_04250" CDS complement(9651..9890) /locus_tag="DP116_04250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877191.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04250" /translation="MQAALRITTKVLPGNKIEIELPEAEIGDTVDVFVILPEKPKVKQ RSVIEILEEVHAKRPSKTAEEIDRQLQEERDSWDS" gene complement(9904..10503) /locus_tag="DP116_04255" CDS complement(9904..10503) /locus_tag="DP116_04255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877192.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_04255" /translation="MQSPVNFFTVEEYLKAEDKSEIRHEYLGGQVFAMSGGSKEHNII TLNIASRLRSHLRGGSCSVFMADMKVRIELASQNKSIFYYPDVVVTCDRPLGGSLLEH RQDQDRFCLNYPCLIIEVLSPSTEVTDRREKLVNYRTLESLREYVLVSQDEIKVEVYR KDNQGNWSVQTLFKGDDLKLDSVGLTLTMTDVYEDVFTL" gene complement(10563..11120) /locus_tag="DP116_04260" CDS complement(10563..11120) /locus_tag="DP116_04260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877193.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04260" /translation="MFTRRGIDINAYPAIKALLAQWQIQLEPKPRNWSASKEWLGRKS GSYKWYEIQDEVAYYTAFDKPKIVYPVIAKESRFAFDTTGAFTNDKGFIIPVPDLYLL GVLNSSSIWEYLKNFCSVLGDADKGGRLELRAIYISKIPIPNASTTEREAISKLVQKC LDAKGVDCQAWEKEIDERVAALYGL" gene complement(11230..11439) /locus_tag="DP116_04265" CDS complement(11230..11439) /locus_tag="DP116_04265" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04265" /translation="MGFHKGSKQLRVGNFVTTTPMRSDFGIKVTRLLPTLEPCIDGIA TDIEEFTRSRALHSIEFNRLDHFSS" gene 11563..12012 /locus_tag="DP116_04270" CDS 11563..12012 /locus_tag="DP116_04270" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04270" /translation="MKNRRQRVSRSTIRKYSAELEQRLREQSKLERLEKLSEFRDWLI HGLLVLLLILGGVFFSLVDRVFDLLYCKILQLFPFFSCSGDTIVRATRKDPIYVLLMY FLIYFIFIVVFKLYVEDKKKRSQNTHTNSVCNEYDDYDTDDDYDHDR" gene complement(12309..13817) /locus_tag="DP116_04275" CDS complement(12309..13817) /locus_tag="DP116_04275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318807.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thymidylate synthase" /protein_id="PRJNA477356:DP116_04275" /translation="MTATGKAIKYNYNALHKPKQLIYGQGQTAVITGWSVKGAIAKHL HPQEYAVIGQLYSPTRGINLLIRNLLYNPHVRYLVVLNATREDKNAGAGICLLDFFRN GFEEGLSDTGRRCWVIRSAYAGYIDIEVEASALEKLRQSIEFQEAKSITEACTLVKSF AQTEAVEPWGFPLEFPMSTLVPTVLPGQRYGHRVEGKTIAETWVKIIHLIKTTGTIRP TVYEGQWQELIDLIAVVTSEPEKFYFPEPNYLPIDRSFLQEYISQILDDAPNQEGVKY TYGQRLRSWFGRDQIQQVIDKLVIDIDSARAVMSLWDVTNDANDSPPCLNHIWVRIVD NELSITATLRSNDMFSAWPANAMGLRALQQHIRDEICKFSTHFLKMGPLITVSQSAHI YDDCWENAEKVIQSQYTKICQQRDYADPVGSFVIAVQDGKILVEQMTPGSGEVVNCFL GKSAKQLYQQIAASCPGLQVEHAMYLGTELQKAEFVLSMEQELSYEQDKSIK" gene 14116..14331 /locus_tag="DP116_04280" CDS 14116..14331 /locus_tag="DP116_04280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_04280" /translation="MQYKGAIIDEAGKMNNFALEPKVYVDEQGDRTGLTPYAELLNGR LAMIGFVSLIALEVFTGHGVFGLFRSL" gene 14495..14941 /locus_tag="DP116_04285" CDS 14495..14941 /locus_tag="DP116_04285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197304.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04285" /translation="MSNVPKIDRQREHQANERTFLAWLRTSIALIGFGFAIARFGLFI RQLNFTLTQQQHEPSPYPFFTSENLGISLVIFGILTLLLAVWRYNQVFSQIEEGNYQP SKFTVWVMTGVVIIFGILSIPLLLLRSHVPRSTYPLPNQPQSRNFR" gene 15080..15661 /locus_tag="DP116_04290" CDS 15080..15661 /locus_tag="DP116_04290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin family protein" /protein_id="PRJNA477356:DP116_04290" /translation="MALTASTMLPLGTKAPDFHLSDVVHEETISLATFADKKALLVMF ICQHCPYVKLVKAELAQLGKDYIHDGLGIVAISTNDVHNYPDDDPEFLKAMAIELDFK FPFCYDESQETAKAYTAACTPDFFLFNAKRQLVYRGQLDDSRPGNGKPVTGTDLRAAI DAVLSNQPVTADQKPSIGCNIKWKPGNEPSYFG" gene complement(15965..16480) /locus_tag="DP116_04295" CDS complement(15965..16480) /locus_tag="DP116_04295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457727.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04295" /translation="MLTHHRKPVCLSLISTDLPVWSVVETAATLYQKDRDRFHLLLSA PPVKQSEKVDDSLLTENTLCQVQGSTPVPSGPRVLWLEISPYRVTMTMQGNGQLSYRH FWEQGVYGITRYWLPVESLQPSEPIRLRNFTRSLKLDGHPLPKHLRVEYELWAGEIQL GCYILSLEIKH" gene 16607..17959 /gene="aroA" /locus_tag="DP116_04300" CDS 16607..17959 /gene="aroA" /locus_tag="DP116_04300" /EC_number="2.5.1.19" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136852.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-phosphoshikimate 1-carboxyvinyltransferase" /protein_id="PRJNA477356:DP116_04300" /translation="MSASVITVEKRENASQNLIIEPPGSLSLQGRIRVPGDKSISHRA LMLGAIAEGETQISGLLLGEDPCSTASCFQAMGAEISELNTELVRVKGIGLGNLQEPV DVLNAGNSGTTIRLMLGLLASHAGRFFTVTGDSSLRSRPMSRVIKPLQQMGADIWGRK GNTLAPLAVQGKSLKPIHYNSPIASAQVKSCILLAGLTTEGKTTVTEPSLSRDHSERM LRAFGADLVTDPETNSVIITGPTQLYGQTVIVPGDISSAAFWLVAGAIVPDSELVIEN VGVNPTRTGILEALAMMGANIQQENQREVAGEPVADLRVRSSRLQSCTISGDIIPRLI DEIPILAVAAVFAEGTTIIRDAAELRVKESDRIAVTAQQLNKMGAKVTELPDGMEITG GTFLSGAEVDSHTDHRIAMSLAIASLKASGQTIIQRAEAAAVSYPEFFTTLQQVCGDR " gene 18354..20147 /locus_tag="DP116_04305" CDS 18354..20147 /locus_tag="DP116_04305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412069.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_04305" /translation="MKFRIFKQLFQRSWIVALLGLLVAVTLNGCNPSQFKSQAAQVPR MITATLGAPSTFNSALNETAYGVFGFIYDSLINENPLTNKQEPALAESWEVFDNGKRI IITLREGLKWSDGQPMTADDVVFSYNEIYLNPKIPTPVKDSLKIGESGATPKVKKLDE RRVEFTIPEPFAPFLRWVGGITILPAHVLQESIRTTGSDGNPKFISIWGTDTDPKKIV GNGPYVMESYVPSQRVIFKRNPYYWRKDTQGKSQPYIERIVYQIIESTDNQLISFRSG QLDDLEVTPEGFSLLKREEKRARFNIYNGGPDTSTTFIAFNLSKGKNSKGQPFVDPIK SGWFNKKEFRQAIAYAINREAMKINVFRGLGEPQNSFVYVKSPFYLPPEKGLKVYNYD PEKAKKLLLQAGFKYNSQNQLLDADGTRVRFTLLTNVERKTRADMAAQMRQDLANIGI HLDLQVLTFNAYIDKLKVSQNWDCYLGGFAGGGVEPHGASNIWRIKGASHAFNQGSQP GKPPIIGWEASDWEKEIDRLYVKGAQELDENKRKEIYYEYQRIASEQLPFIHLVERLN LQAVRDRFQGIKYSALGGPFWNLYEIKVTQN" gene complement(20224..20328) /locus_tag="DP116_04310" /pseudo CDS complement(20224..20328) /locus_tag="DP116_04310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995207.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" gene 20412..20621 /locus_tag="DP116_04315" /pseudo CDS 20412..20621 /locus_tag="DP116_04315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316640.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" gene complement(20693..20890) /locus_tag="DP116_04320" CDS complement(20693..20890) /locus_tag="DP116_04320" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04320" /translation="MLERDTSPITNLAAISQLDLLRQLYTHVIIPTAVYDEMVAVDKP VPGAVEVQTLPWIQTQVITDF" gene complement(20891..21136) /locus_tag="DP116_04325" CDS complement(20891..21136) /locus_tag="DP116_04325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136697.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UPF0175 family protein" /protein_id="PRJNA477356:DP116_04325" /translation="MSVLIPDDILRATKMTEDELKLEIAIMLYKQEKISSGKARAWTG LTVIEFQHELAKRGLCINYDVQDFQSDVRTLQSMGLL" gene 21363..22136 /gene="larB" /locus_tag="DP116_04330" CDS 21363..22136 /gene="larB" /locus_tag="DP116_04330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114751.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nickel pincer cofactor biosynthesis protein LarB" /protein_id="PRJNA477356:DP116_04330" /translation="MTDDKSLRSLLESVANGNVTPDGALEKLKHLAYEPVGDFAKVDH HRALRTGFPEVIWGPGKTPDQIAQIMEAMRQRNSVVMATRIEPDVFATLETKVQGLRY YNLARICAITPPTIEPQYPGTIGILSAGTADLAVAEEAAVTAELSGFHVQRLWDVGVA GIHRLLNNRHVIESASVLIVVAGMEGALPSVVAGLADCPVVAVPTSIGYGASFGGLAP LLTMLNSCAAGVGVVNIDNGFGAAVLAGQIVRTAYKLRK" gene 22213..22500 /locus_tag="DP116_04335" CDS 22213..22500 /locus_tag="DP116_04335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002789208.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1778 domain-containing protein" /protein_id="PRJNA477356:DP116_04335" /translation="MSELSSAKDCRIDLRVTQEQKELLERAASLKGISLSAYTLFHVL PAAKQDIDTHERLVLSNRDRDLFMSVMENPPELKGKLKSAIHKYRKKYDKS" gene 22487..23008 /locus_tag="DP116_04340" CDS 22487..23008 /locus_tag="DP116_04340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749005.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_04340" /translation="MTSHREARWTFVPIDKKHQRDSFDCGYPILNDYLKKYARQNHNK GVAKTFVAIPASGSLKIDGYYTVSASVIEYESLPESYQRGMPAYPIPAILIGRLAVDH PVKGQGLGGELLADALYRAVRASQEIGVYAVRVDAIDFQAREFYLKYEFIPFQDQELS LFLPMATIIGEFS" gene 23030..24418 /gene="argH" /locus_tag="DP116_04345" CDS 23030..24418 /gene="argH" /locus_tag="DP116_04345" /EC_number="4.3.2.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316645.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="argininosuccinate lyase" /protein_id="PRJNA477356:DP116_04345" /translation="MTKQQTWSQRFESALHPAIARFNASINFDIELIEYDLTGSQAHA KMLAHTGIISQEEGEQLVAALEQIRQEYRQDKFHPGIEAEDVHFAVERRLVEIVGDLG KKLHTARSRNDQVGTDTRLYLRDQIQQIREQLREFQQVLLDIAEKHVETLIPGYTHLQ RAQPLSLAHHLLAYFHMAQRDWERLGDVSHRVNISPLGCGALAGTTFPIDRHYTAKLL DFEGVYANSLDGVSDRDFAIEFLCAASLIMVHLSRLSEEVILWASEEFSFVTLTDSCA TGSSIMPQKKNPDVPELVRGKTGRVFGHLQAMLVMMKGLPLAYNKDLQEDKEGLFDSV VTVKACLEAMTILLREGLEFRTQRLAEAVAEDFSNATDVADYLAARGVPFREAYNLVG KVVKTSIAAGKLLKDLSLEEWQQLHPAFAEDIYQAILPQQVVAARNSYGGTGFAQVKG ALLTARAQITAK" gene 25175..25669 /locus_tag="DP116_04350" CDS 25175..25669 /locus_tag="DP116_04350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412075.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NUDIX hydrolase" /protein_id="PRJNA477356:DP116_04350" /translation="MNILAFFAVAVQSTRNLWRIGQTVLGIIFRHPIPGTSIIPILPD GRIVLIRRRDNGLWALPGGMVDWGEDVNTAIRRELIEETGLELVSIRRLVGVYSAPDR DPRIHSICIVAEAIVQGKMEIQDTLEVMEIQAFPLDSLPPGQMSHDHNRQLQDYLNGL TTLA" gene 26100..26924 /locus_tag="DP116_04355" CDS 26100..26924 /locus_tag="DP116_04355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309089.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_04355" /translation="MHSIFRNSRIKTSQGMLFWREIGEGIPIIFLHSAWKDGSQWISV MEMLAQDFHCFAPDVLGFGESDFPNVHYSIDLQVECLAQLLHTLKRALKQKRVYLIGD SLGGWIAASYALKYPEQINGLVLLAPEGVEIEGQKKRWETMQRILNRSTLLFQLLRLF RPITKIFGLDQKIEQDWEFHQRMLQYPTACELLFERQHPEIQAELLHDELSFITVPVL ILQGEKDNQEAIAMSRAYAQYIPHAQLKIIAHAEDNLPESCASVVAGDIRNFIKGK" gene 27742..28389 /locus_tag="DP116_04360" CDS 27742..28389 /locus_tag="DP116_04360" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04360" /translation="MAIAGAVIMALGSGAISPAHAELLDFSFTTVSGATGSFTLDTDT PASGESSLGGGAAFPGTPGILYPNAVSNLFLSSTQLNLSGVTADYEVVPGLTSAGLGL PPGLGVLSGPVYPAGCSTGTNFTCAVTIGVLYSGSPSELSDDPASYLSLGIEFFDPET AEQINLTPDLYTNFQVVRRQAVPESNSSLSLLAFGIGGVGLLLKRKKNSNKPLTI" gene complement(28539..29366) /locus_tag="DP116_04365" CDS complement(28539..29366) /locus_tag="DP116_04365" /EC_number="2.7.1.33" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875487.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pantothenate kinase" /protein_id="PRJNA477356:DP116_04365" /translation="MWLALMIGNSRLHWAHFTGETLIHAWDTDYLPADVIQQLSQSQT LKDLLLKIFPSNEATGKARHGNMEDFLKTLSSSPPSPPPLLVASVVPSQTALWQTYPN VRILTLDQVPIKGIYPTLGIDRALALWGAGNNWGFPVLVIDAGTALTFTAADANQHLV GGAILPGLGLQLATLAERTGQLPTVELPQQLPQRYALNTQEAMQSGVIYTLVAGIKDF IEAWWRDFPQGNVVMTGGDRTLLVNYLQSQFPEIVDSLNVEANLIFWGMRSAALLQR" gene 29535..31259 /locus_tag="DP116_04370" CDS 29535..31259 /locus_tag="DP116_04370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210416.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diflavin flavoprotein A" /protein_id="PRJNA477356:DP116_04370" /translation="MVALTEKTKKRLTIEIGEIAPETTAIRSLDWDRDRFDIEFGLQN GTTYNSFLIRGEKIALVDTSHEKFRQLYLDTLKDLVNPADIQYLIISHTEPDHSGLVK DVLQLAPDVTVVGSKVAIQFLENLVHRPFKRQIVKNGDRLDLGNGHELEFVIAPNLHW PDTIFSFDHKTQTLFTCDAFGMHYCSDSTFDEDLKTIEADFQYYYECLMAPNARSVLS ALKRMDELEKISMIATGHGPLLSHNVDELTGRYRNWSKTQAKAETSVAVFYVSDYGYS DRLAQAIASGITKTGVATEMVDLRPGVDLQELRELVSRCAGIVVGMPPASGAANIQAA LSTILGSVHEKQAVGIFESGGGDDEPIDPLLSKFRNLGLITAFPGIRIKQTPTENTYK QCEEAGTDIGQWVTRDRSIKQMKSLGADVDKALGRISGGLYIITAKKGDVSSAMLASW VSQASFKPIGISIAVAKDRAIESLMQVGDKFVLNVLEEGNYQKLMKHFLKRFAPGADR FEGVRTQPAENGAPILTDALAYMECEVISRMDGGDHWLVYSTVYAGRVSKPETLTAVH HRKVGNHY" gene 31356..32111 /locus_tag="DP116_04375" CDS 31356..32111 /locus_tag="DP116_04375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_04375" /translation="MSLLCLVHGAYLGAWCWDLLTPEIEARGHQTVAVDLPIEDPTAG VAQYAEVVSKALQGFEDDVVLVGHSMAGLIIPLVASQRPVRQLVFIAGVIPHIGVSLL DQSHDEPDPNLLKAIGYELPEADKFEQFSDEPNMFNPAALGKNLLQDEAVAREFLFHD CASDVASWAFPKLRNQQFLYISEVSPLQAWPDVKCTYIVCGEDRSLSPAWCRYAARKR LGVDAIELPQSSHCPMLSHPAQLADILAKVAST" gene 32152..33864 /locus_tag="DP116_04380" CDS 32152..33864 /locus_tag="DP116_04380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316652.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flavin oxidoreductase" /protein_id="PRJNA477356:DP116_04380" /translation="MPENKPRDVQVYPIATDTTVFRSRSWTRLRFEIEYALAKGTTAN SYIIKGDKIALVDPPGETFTEIFLKALQQRFDLKKIDYVILGHINPNRAATLKALLEL APQITFVCSNPGAKNLRGALENPDLPVMVMRGEETLDLGKGHHLQFIPTPNPRYPDLL CTYDPQTEILYTDKLFGAHICGDQVFDEGWEIINEDKRYYFDCLMAPHARQVETALDK LGDLPVRMYATGHGPLVRYSLIELTEFYRQWCQEQASQDTSVALIYASAYGNTATLAQ AIARGMTKAGVAVESINCEFADPEEIRAAVEKASGFIIGSPTLGGHAPTPVQVALGIV LSTATNNKLAGVFGSFGWSGEAVDIIEGKLKDAGYRFGFETIRVKFKPNDATLQMCEE AGTDFAQALKKAKKVRTPSQPATTVEQAVGRVVGSLCILTAKQGDISSAMLASWVSQA SFNPPGLTVAVAKDRAVESLTHSGNKFVINVLKEGNHIGLMKHFLKPFGPGQDRFESV ATQEVENGCPVLNDALAYLECSVKTRMEVGDHWLVYATVDNGKVLDGEGVTAVHHRKS GNHY" gene 34106..34333 /locus_tag="DP116_04385" CDS 34106..34333 /locus_tag="DP116_04385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197658.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04385" /translation="MYFLMEAAAQAVKEPYQFPFAFTAVYVIGFIAAVTIGSIAWYNS KRPVGWEAKERPDIVPKVDKDPTPGLGEPKS" gene complement(34512..34934) /locus_tag="DP116_04390" CDS complement(34512..34934) /locus_tag="DP116_04390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TerB family tellurite resistance protein" /protein_id="PRJNA477356:DP116_04390" /translation="MVNNSSMKNLVKILIGAAWIDGRIQPEEREYLRKIAQEKGMANE PEIQPLLHELVAVQPEEFYEWVKEYLGDRPTPEQCQNLIEAISGLIYSDGEVAIEEAR LLTKLQQLSDTGDSTQPRHNAVLKQIQKLYRRWVEVQN" gene complement(35025..36428) /locus_tag="DP116_04395" CDS complement(35025..36428) /locus_tag="DP116_04395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321103.1" /note="catalyzes the methylation of cytosine at position 967 (m5C967) of 16S rRNA; SAM-dependent methyltransferase; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="16S rRNA (cytosine(967)-C(5))-methyltransferase" /protein_id="PRJNA477356:DP116_04395" /translation="MLNFELLTSPRQLAFIALRDVHKGAYVDIALDKVLQKASLPDTD RRLVTELVYGCTRRQRTLDAFIDQLGKKKSHQQPKDLRTILHLGLYQLQYQQRIPESA AVNTTVQLARENGFSGLTGFVNGLLRQYIRLENKEDATTKDKVSVQSSNPIERLGILH SFPDWMIQVWVEQFGLAQTEQLCLWMNKTPAIDLRVNVLRTSTEEVEAALTSAGVASQ PVPHLPQALRLMSHAGPIQNLPGYNEGWWTVQDSSAQLVSHLLDPQPGEVVIDACAAP GGKTTHIAELMRDEGKIFGCDRLKEAGAKRHRTPSRLKKLQENAQRLKLKSIEVCTGD SRNMPQFQNTAHRVLLDAPCSGLGTLHRHADARWRQTPETVEELSLLQKQLISHTSTF VKQGGVLVYSTCTLHPKENEEVVSSFLAENPNWEIETPSADSPASTYSTPQGWIKVLP HQQDMDGFFMVRLRKNS" gene complement(36605..38053) /locus_tag="DP116_04400" CDS complement(36605..38053) /locus_tag="DP116_04400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318725.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline invertase" /protein_id="PRJNA477356:DP116_04400" /translation="MQLNDLVSTDSVEKEAWAAIENSILYYRGRPVGTVAAYDVTIEA LNYDQCFVRDFVSSALIFLIKGRTDIVRNFLEETLRLQPKERQLNAYMPGRGLMPASF KVVCEGEEEYLEPDFGEHAIARVTPVDSCLWWVILLRAYVVATRDFSLAYQPDFQHGL RLIMELCLATRFDMYPTLLVPDGACMIDRRMGIFGHPLEIQALFFAALRAARELLVCQ GNEDIVTAIDNRLPLLQAHIRQHYWMDLHRLNKIYRFKSEEYGKAAANPFNVYADSLP YYELDKWLPKKGGYLVGNVGPSQLDTRFFCLANMISIVSDLASEEQSQAIMNLIEERW EDLVGDMPMKICFPALEHEEYKIVTGCDPKNIPWSYHNGGSWPVLLWLLSAAAVKTNR MDLAHKAIEIAEARLHLDEWPEYYDGKKGRLTGKQARKYQTWTIAGFLLAKELLRNPR FLPLVSFGSFSVEPASRACEFEMVEANTLYFG" gene complement(39852..40799) /locus_tag="DP116_04405" CDS complement(39852..40799) /locus_tag="DP116_04405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129080.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04405" /translation="MSQDNQNSQPPFSPEPEANPPQPTVESASQPPQEQYQTVQPIWK AITIRILRGTIGVLETTVDKLETQPSAGDKETPGFFQKLQFVWSAVLGKIRSRLPRNL SRKLSDTAITGIVAGITVILVWTSSTVFASKPTEVAIPPEVETPPSATITTAPEVETP LTPPIAEETPPPTEETPPPPVAEETPPPVEETPPPEPEPEPTPTPTIILTPEQTLIAA IENQVAEVSDRFAPGLIQSIQANFRSSSLAITIQDEWYNLKQSEQDKLLAQMLERSKE LDFIHLDIIDSRGKLVARNPVVGTEMIVFQRRISLIPQQ" gene 40927..41841 /locus_tag="DP116_04410" CDS 40927..41841 /locus_tag="DP116_04410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319719.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metallophosphoesterase" /protein_id="PRJNA477356:DP116_04410" /translation="MKLKRRQFLFFSSLSAIGLGFIGCVTRQAGKNPGIAVSKAATPP NPDKNNSLLRFVSVADTGTGDQGQYAVAKAMTNYHSKNPYDLVVLAGDNIYTNGEMEK IGAVFERPYAPLLKQGVKFQAALGNHDIRTANGDLQLKYLNFNMKGRYYTFQRDHVQF FVLDTNTNADWQKQLKWLEQELSASKTPWKIVYGHHPVYASGVYGSNPTFIKSFTPLF QKYDVQLYINGHEHHYERTRSINGTTYLICGGGAGTRPVGRSEWTEYSAEKLSFAAYE VYADRIEISGIGTDNSVFDKGIVQLKSV" BASE COUNT 11926 a 8752 c 8875 g 12459 t ORIGIN 1 gtactggggg tggtggtggt gttggtattg ggggtggagt tggtattgat ggctgttcgc 61 tgaccgcagg tgctggaggt ggcgtagtta tagttggtgg cggggggctg ctttgtgttg 121 tctctacgaa tggatcacca aacgcaactg ctgttggtgg tatggctgct gaatgaggga 181 atgtgtaatc agcaccatta accgacatag gttgtggttg tacaagtggc tcagggttaa 241 ccagaggcgg tacatctggt gcaatatcag gtgcgacagt tgcggcggca actgcagtgg 301 gagtagctgc acgcaggtta taaccacact gaccgcaaaa agctgcatct gcttgcacgg 361 ttgccccaca actgggacag ttactagtag ctggtaatgg ggtatagcaa gcttcacact 421 ggacagcgcc gtctgggttt gggtgattgc aattagggca gacgatcatg aggttaagtt 481 agcctttgtt gttgttcaat ggtataatag tgccttgaag cagaggggca actctagctt 541 tcgattctaa caagtctgtt tgaaggcaga agtatgaagt cttatgaagt gtaatagaat 601 tccatgcctc atacttcatc cctcttgttt tatcctgcgt cctttttcag gtcacttcca 661 gctgctgcat ctagcaacca ccaaagttct ccttgaggtt gaattaagcg ggatgggtaa 721 gtgaattcat ccgccactgg tgcaaagact tgagcaagag ctggtctttt attcgcacca 781 gctacaacaa agataacgca gtgggcagag ttaatgaatg gatatgtgaa ggtgatgcga 841 gggtttccat ctttgttccc tacggtgatc agcttatctg taacctttaa agcttccgtg 901 tggggaaaca aagatgcggt gtgagcatca tctcccattc ctaataaatt gacatctaaa 961 gcaggaaatt ctcccggtga agcgtgaaaa aattcttgca gatgtctttc gtacttagcc 1021 gctgctactg cggggtcagc ttcatctgta ggtatggggt ggatgttacc tggtgggatt 1081 tcaacgcgat ctaaccacgc acgacgcgtc atcagctgat tgctatctgg atggtcaggt 1141 gaaacgtaac gctcatctcc ccaaaacaca tgaattttgt cccaaggcag tttctgggta 1201 gagattgctt catacaacgg tttaggtgta ctaccaccag acaaggcaat ggtaaactgc 1261 ccacgctgtt tgatagctat ttctatttta gacaagatca attcaagcgc tcgcgcaatt 1321 agcgcttttt gatcacaaag aacttcaact tttttgctca tcgcgtcatt atcacactag 1381 cgacttccag caatacaata gcgaaataat gacaacttcc gtttttactt tttgaaactt 1441 ttaacatcat tctctctgaa tggatctata ccctaggtga gccgttgtac aaaattcttg 1501 ccaaatcgta acaaaaattt ttataaacaa cgacaggaac taaaccgcct ctttttcaag 1561 ggcggatgta gcgacggaag cgatttaaat ccctgtacta caatcatatc agtcagcaca 1621 aaacttgaaa acctaccccc ttcccaggga aagctgagcg ctttacctcg aatcaaggaa 1681 attttccgga aatatcactc ggttcttctg ggctaagggg gtgaaccgct taactcgact 1741 acttgttttg gatataagaa gcggaatatt gtttgaattt gcgactatta agagcacaga 1801 gctgttttag ccttggtaga gatcattcca gtttgcgatc gcttcacgtt gagcaattaa 1861 caattgttga atgccttttt gggcaaaatc cagtagagaa ttgagttggg tacggctaaa 1921 actaccggac tcagcagttc cctgaacttc tattatcccc aactgctgat tcatcaccac 1981 attaaaatct acctcagccc tgacatcttc aatgtagttg agatccaaaa atggctcacc 2041 ttgcaacaat cctacagaaa tcgctgctac ctgtccaatg atgggcgatc gctctaacac 2101 tccctgctgc aacaatgtag aaattgcatg agctaacgcc acaaacccgc ctgtaatcgc 2161 agtcgtcctc gttcctgcat ctgcttgcaa cacatccgca tcgacagtca gcgtgcgttc 2221 tcctatcgcc tctaaatcta atgctgctct caaactacgt ccaatcaaac gttggatttc 2281 ttgagttcgt ccagacaact tcagaaattc tcgttcttgc cgttgcggtg tggctgatgg 2341 taacattcga tattccgcag ttagccaacc tttaccagtt ccttctagaa acttagggac 2401 tccctgtgtc acgctgacgc tacaaagtac ttgagtgtcg ccacattttg tgagaacagc 2461 agcaggcgca aagcgcgtaa aattttgctg aaagctaata ggacgtagtt ggtaggattg 2521 ccgaccgtct ggacgctgcc aagccattgg gtttgcctca aaatttgtat ccacaataca 2581 gtcggttatg ctttgagtat tagggcttaa gtattataaa gcttcactca ttttttagcc 2641 aagagtttgg tactctaaaa aagataagct caaaactttt gtctcaaaac ctattgggaa 2701 gttatcagca aagtaattat atcttgctgc accgcttctg gtgactggtc tccgttgata 2761 gttaataagc ggcggcgacg gtcgtagtac tctaaaatgg gaacagtacg atcataaaaa 2821 gtttcgacac gacgctgcac aatttccggt agatcatctg gaagaaaacg ccccattgaa 2881 cgattcacca taaccgcctg tgggacttgt aaataaattg cccagtccaa attgtgccct 2941 agatggtcta acaaaaagtc aagttcttca gcttgctctg ctgtacgggg ataaccctcc 3001 aatacccaac catccttaaa atcaggcttt tttagctcaa tctgaatgga ttcaatgatc 3061 atttcgttcg tgacgagttc ccctgtctcc atgtatgact gtggaagatg actgagatac 3121 ggaaaagatg ggattgcaaa gcggcgctgg tgacccaaca ggtttgctgc atcttgttct 3181 gcgcttcctt cagggagaga attttctgga gaaaaacctt ctgggcaaaa aacggcgatg 3241 ctcctgcagg aagccgctct ttgggcgatc gcatctcgta atatttcatc actggaaata 3301 agaggaatat taaagtatgt gcaaagccct tgtgcttgag tgcttttccc tgctcctgaa 3361 cctcctatga tcactaattt cacaacaatt tactcctcat tttttgatca gaactcacta 3421 tgctgctcag ttatatgttt tcatttttac aactactttc cgaaaaaagg caatcatctc 3481 ggtttcgata actcttgact tcaggataaa tgtagtgttt gactaagaat gcaacgcacg 3541 caaatacatt acgtatcgtc aaaagcaact aggttataca gtctacgttc gccagaaaaa 3601 atctcagcaa aagcacttta ttttgttgta ttattttaga tattattatg gttgctcagt 3661 tagaaacccc gagcttaaat tcaaccctcg attttccata ctccgttgaa ggtatagtac 3721 aagttttcac tagctctcat cgtagctttt ttacgagcgt catgtctcaa gcactgagaa 3781 tcgcaggtca aggcacagca gtgttagtag tacagtttct caagggtggg atacgtcaag 3841 gacaagaaaa ccccatccga ctgggacaaa agttagattg gattcgctgt gatttgcctc 3901 gttgcattga tacaccacac ttggacgaaa gcgaaacgaa atctctacaa aagttgtggc 3961 aacatacaca aaaggtagtc ttagaaggtc agtattctct ggtagtatta gacgagctca 4021 gcttagccat tcactttggc ttaattgctg aaacagaagt tctagcattt ctggacaaac 4081 gcccgaacca tgtcgatatc atcttcacag gatcacaaat gcccaaatct attttagaaa 4141 tagcagatca aatcaccgaa atccgtcgta gtcatcaacc gtaattcagt gtatcgatac 4201 cgagtgctaa gtcctaagtg atgagtttca ttgttgatct gacttatcac tcggattcag 4261 tgactcttct ctaacaagcg tgagttcgcg ctattctagt tattcggtgt gaagtaccca 4321 ccaccaaaac ggagtaccgt ttatggtggg ggcttcgcgt ctcactcgcc gttgctctgt 4381 taggtacatc cattgctgaa tgttcgtcac tcaagacgat ccaagtctta cacagcgtct 4441 ctcggtttgc accgtttatc tcctcaaagg gagaacccgc gcagccgccc tgcttcccaa 4501 ggcagtaatt gtttttccgt acattcctgt gtggtttatc ttttaccgca gagcgtaatc 4561 atacgaacca gtgaccggaa tcttttctcc tttctctcgc tcttagactt ggtaagcgtc 4621 gaaccgctac tgtgaccgtt tgttatgagc cgagtgagtg ggtcgcaacc aatattatta 4681 taccagacat aatcggaaat ttacggttaa attgacaatt ccgcgctaac gcgccgctcc 4741 gctaacatgc atctttgtct tttcttccct gcgggaagat gagcgccttc agatgcaatc 4801 gtcattgtcc gccttatatc gcaccactaa tcggagtacc gatatagtgg gggacttacg 4861 gcgacaaagt taaataaata ggaaagtagt gattagaaac gacatgataa tattacagtg 4921 atttgcatga gtaaaagcac ttctacaaga tgcggtataa tcaaaaactt aactattgct 4981 gaagcttgaa agaggggcaa agcctctgcg tattgaatgt atttaaagaa caacaaaaaa 5041 aatttttacc ttgaaggtta tgaattgaga gcaaaataac taaagcagaa gcttcaattc 5101 gttcatcaga gattcagaat tggaggtaaa tgatgaagtc taaaacaccc taatatggag 5161 aagtaaatgt taaaaaatga caggtggata attgagcaag ctcaattggg tatgattgaa 5221 ccttttgaac ctagtttggt tcgtgcagtt tcgatagata gcacaggttt acttcgtcca 5281 gttctgagtt atggactttc ttcttatgga tatgatataa ggttgtcacc tgtagaattt 5341 aaggtattcc gtcatattcc tggcacagtc gttaatccta aaagattcaa ccctgataat 5401 cttgaatctg taccattaca ctctgatgaa gatggtgatt attttattat tcctgcacac 5461 tcttatggtt taggggttgc actagaacgg cttgtcatcc cagctaatat tacctgttta 5521 tttattggca aatctacata tgctcgttgt gggttaatct ggaacctgac tcccggagaa 5581 agtggttgga caggctggct gacactggag gtgtcgaatt cctctagtgc tgattgtcga 5641 gtattcgcca atgaaggtgt agttcaagct ttgttcatgg aaggcgaacc atgtggaacc 5701 acttatgcag atcgtgctgg aaaatatcaa gatcagcctc ataaagtgac gctagctaaa 5761 gtttaataga tatagcaatc ctaaatgagt tgtgaaatta ttcgttgagg tggggaacag 5821 ggaacaggga acagggaaca gggaagaaga atgtgcaaat gtactttgtt ttttcagaaa 5881 tgaaatagga ctgctatatg attgaaagta ggagttatac cataccagtc aaacagccga 5941 atactgaaag attaaagaaa atggtactct cgaaacgaga tatgattcga tgaaatcata 6001 ccaaagcttc actacaaaca atcatgggta ttggttgaga gaatgtaggt aaactgttat 6061 cttgccattg atgcatatta tgttgaaggt aaatttgcgc cagttgagtg aacaattctt 6121 gacctgtgaa actatcacgc cgcaatttat ttaactctcg gttaactgct tgattcacac 6181 aagcgcgaaa cttttgttcc aaaatattga tctgactttt gagttcttca tctgcgatcg 6241 ctttaaaaaa gatcctcaat tcccgcagaa tatatctttg accttgagtc aaacgcccct 6301 taaactctct ttccgcttgg cgatgcttca cttcttctgc aaattgacgt ttgagacgca 6361 taactgtaga attatgttct tggggaagtg ttgaagtagg agtggtggta tcagctttaa 6421 ttgcacccaa gatgcggggg atatccctag aaattatttc ccctctgtcg tccaataaaa 6481 ataactgctg gtagcctttt agatcaggtt tactggggtc tgaagcttca cagaaaacat 6541 atgttccctt ttgctgcata gagaatttag ctgtacgaat accatgcggt aaatgagcaa 6601 ttcgttcata ttctgagggg ttatcttttt gtaattttct caggatttct tcagcctcat 6661 ttaaatccaa gaaatcctct tctgcttcaa aatcaaaaag actcaactgt ttgccttgtt 6721 gttcgtaaat ggcatacata gcttcttcgt tgagttgctc agttcggtca agaatggcag 6781 catcttcacc gatagtatca tgaatttcct gaattctatt tttaagcttt tgcttcaaac 6841 ctaaattgcg ttcaatgcct aattcaggta aaaagttgta tccataaata acatcttttt 6901 cgctaccaat tctgtctata cgaccaaaac gctgaatcaa ttttactgga ttccagtgca 6961 aatcataatt tattatcagg tcaccgtctt ggagatttaa accttctgcc aagacatcgg 7021 tagcaatcag tgtattgatt tctgattcac ctggtttaaa tttatattcc ttgttagcgt 7081 taggcgcaaa tcttcccact aaacgcgctt tatttttgtt gctaccacta taaatgacat 7141 caaaatcatc tcgttgtccg cctgggttta agttctcata tagatacttt gcagtatctg 7201 catactgagt gaatattaac cgttttttat ttttcagact aggtttagat aaccacttta 7261 taagagtttg taattttgta tctttatcag cggtgatagg ttcaactaat gccaaaatat 7321 tttttagcag tttaatatca tgctcgatat gctgttgtaa tttttcggta taaaaatctg 7381 ctatgtcgta tttatcagag actttttgca atccatccat caaatcttgt tcttctgctt 7441 ggttataatc ttccgatagc agagtttggg cttcttctcc tgctgcgaca aatccttggg 7501 acagtgcttt taaaaacctt tcatgcacta tcaagagttt tctgatagat tcttgaaaag 7561 cgtaaacact ggattcaaag cgcttaaaca acagcactcg catgagtcct cgcaagttag 7621 cgcctgcgcg ttgcaatgtg ttataaggtt cttgcttttg tttcggtttt aaaacataat 7681 tccacagtcc ataacgcgca taactgagtt cgttagttgg tggtttggct aattggcgct 7741 tgcgagattt acccaagtat tgacgtaact cttcgtatag tccttggtat gtgtcctcaa 7801 tgctgtactc aatagtttct agttctcgct tggggaaaaa gcgatgcttt ccaccaacga 7861 tgacataagc gcgtcgggtt ccgtctaggt actcacgaaa gttagctgga tctactgctt 7921 gttgagtttg tgagtcgaaa ccgtaaaaac gtagaatatg gttgcgggtt cgacgaatca 7981 ggatatgaga gagtaattct tgtaactttt tctcgccttt ttcaactaac tggaagtatt 8041 cttttaaatc aggagggtct attgggaggt cagttttgtc gtcttgatga aatagtttta 8101 gctggtggta gatatcccaa gcactcttat ttcgcggagt tgctgttaat aagcaacaac 8161 gtttcccagc acctaaaaaa gcttcgacta ctttatatct ttgagtcgtg tgatttcgca 8221 ggttatggct ttcgtctata agtacaaaat ctctatcttg aaatctgaaa tcatctaata 8281 aaactttgaa gccgctttcc tcgtcttctt tgagcattcc cattgaaaga acacgagcat 8341 ttagctggta gacttcgttg tagcgttccc acatttctat gagtggtgca ggacaaataa 8401 ttaaaggacg tgcgcgttct gtttgttcaa atcttttaac aattgcagcg ccaataaagc 8461 tttttcccag tccgacgaca tcggagacga aagcgccgcc atagtctctg atattttgca 8521 cagcattatt aacagcgact ttttgaaaat ctgctaattg tttagtaatt gcgtcttcta 8581 aaattaggtc ttttggtgcg gtatcttcta atctttcttt tactaaacta tagagagttt 8641 tcatataaac atcataagga cggactgtag aacccgccca agattgtttc atttctctca 8701 tcaaggcttc gttaaaatct tccgcttcgt tccacagttc attaaaccag tgagttaatt 8761 ctgtatggtt atcgtttccg tgaataacta cattgagttc ggtattatgg gtgataccag 8821 aaagggtaag attggatgaa ccaactatag caatgccttt ctcagcacgt tctaaggctc 8881 tacctttatg atcgtaaaca gtaccgtagt caaaaatata agccttggcg tgtaaagttc 8941 ctttggtgta aagccggaca tgaagccgtt tttcttcaat catctgcacg agatttttta 9001 ccagtatctc agcttcatct gtctgatcca tgagttcgat actagaacgg atattggctg 9061 ctgtaacgtt cgccatccgt ttttcttcac tgcgcttggg atatttttgc gcttcgattt 9121 tatcttcaat taattctagt cttcggtatc cttgggctat ttgttcaatg gtttcgcgat 9181 tgctagtatt gccaattagc agacgtaatt cttttatgtt ggggaggcgt tctgcaatgg 9241 cggtgaagcc agaaataaag aagtaaccta ccgcaaaatg agccgcttct gaggagtcga 9301 gaatgcgctt gatttggtct actaattttt gattgcgatt atctattata tcgtgggtag 9361 gcatggtttt ttatttaacc cagaggcgca gagaaaggcg atcgtcaagc ttgaaattgt 9421 tggctttgtg gataaatcac tactgatgta tctatataaa tcaatacggt tcagttagtg 9481 gtatttaaga tcccccaacc acgccaggtg ctacaacggg gggaaccccc gcaacgcact 9541 ggctcccctt aaaaaggggg cttagttccc tcctttttaa ggagggctag ggaggatcaa 9601 cccttaactg aactgtattg ctatataaat aacgccacaa gagggaagtc ttagctgtcc 9661 catgaatccc gttcctcttg tagctgtcta tctatttctt cagcagtttt tgatggacgt 9721 ttagcatgaa cttcttccag aatttctatt acagaacgct gttttacttt aggcttttcc 9781 ggcaaaataa caaatacatc aacggtgtca ccaatctcag cttctggaag ttctatttct 9841 atcttgtttc ctggtaaaac tttggttgta atgcgtaaag ctgcttgcat aatttataaa 9901 accttaaaga gtaaagacat cttcgtaaac atctgtcatt gttagggtta accctaccga 9961 gtctaatttt aggtcatcac ccttgaacaa agtttgcact gaccaatttc cttggttatc 10021 tttacgataa acttctactt tgatttcatc ttgggaaact aatacatatt cccgtaagct 10081 ttctaaagtt cggtaattaa caagtttttc tcgtctgtct gttacttctg tgctaggtga 10141 gagaacttct atgattaaac aaggataatt taagcaaaag cggtcttggt cttggcgatg 10201 ctccagcagg gagccgccca aagggcgatc gcaggtgact acgacatcag gatagtaaaa 10261 tatactttta ttttggctgg ctaattctat cctgactttc atgtcagcca taaaaacact 10321 acaggaacca ccccgtaaat gggaacgtag tctgctggcg atattgaggg taattatatt 10381 gtgttccttg ctaccaccag acatggcgaa tacttgaccg ccaagatatt catggcggat 10441 ttcgctctta tcttcggctt tgaggtattc ttcaacggtg aaaaagttta caggggattg 10501 cataattttt ccctcagcaa ttcttaatac tacgagcgag atttgttctc ttctctattg 10561 tcctacaagc cataaagggc ggcgactcgc tcatctattt ctttctccca tgcttgacag 10621 tcaacgcctt ttgcgtctag gcatttttgg actagtttag atatggcttc tcgttctgta 10681 gttgaagcat tggggatagg aattttacta atatagattg ctcttaactc taaccgtcca 10741 cctttatcag catcgccaag tacagaacaa aaatttttta ggtattccca tatagatgaa 10801 gagttaagca ctccaagaag gtaaagatca ggaacaggaa taataaaacc tttatcgtta 10861 gtgaatgcac ctgttgtatc aaaggcaaaa cgcgactctt ttgcaataac cggatagaca 10921 attttaggct tatcaaatgc agtgtagtaa gctacttcat cttgaatctc ataccattta 10981 taagagccgg attttcgacc taaccattct ttagatgcag accaattacg aggttttggt 11041 tctaactgta tttgccactg tgctagaagt gctttaatag ctggataagc atttatatca 11101 attcctcttc ttgtgaaaat gagccatttg tcctgatagt tatagcagtc gccacctcag 11161 tcaggacgta ggatagatga agaagtgatt agtttccgga ttcctatggc aaacccttac 11221 agttatgacc tacgacgaaa agtgatcgag gcgattgaac tcaatggaat gaagcgctct 11281 tgagcgagtg aattcttcaa tatcagtcgc aataccatca atacatggtt ccaacgtcgg 11341 aagcaaacgg gtgactttaa tgccaaagtc agagcgcatg ggggtagtgg tcacaaaatt 11401 accgacacgg agttgtttcg agcctttgtg gaaacccacg gggacaaaac tcaggcggaa 11461 atagctcaac tgctaatcgc ctcgattttt gatgccgaac gtgcttgcta aaactctaca 11521 ttggggatta ttttgtagtt gaacgaaatc taggaatctc ttatgaaaaa ccgtcgtcag 11581 cgcgtgagtc gttcaactat tcggaaatat agtgccgaac ttgaacagcg tttgagagag 11641 cagtccaagc tcgaacgctt agaaaaatta tcagagttta gggattggct aatacatggg 11701 ctattggttc tactcttgat cctaggcgga gtatttttta gcttagtgga tagagttttt 11761 gacttattgt attgcaaaat tttacaactc ttcccatttt tctcgtgttc cggcgatacc 11821 attgtaagag caacacgtaa ggatccaatt tacgtactgt taatgtattt tttgatatat 11881 tttattttta ttgttgtttt caagttatac gtggaggata agaaaaaacg tagccaaaac 11941 acccatacta atagtgtttg caatgagtat gacgattacg ataccgatga tgactatgac 12001 cacgatcgtt agtcaaaata actatagcga cagtgactac aatgactact aaatgtaacg 12061 aactgctaac aagaagtgat aagcgcactc gatggtttag atcgtctagt aaagcgatca 12121 attccaaaaa agcaatcttt atgatatgta gtataacgct ctacctctga tgaaattgat 12181 attcaaagct atatgaacaa tatgtatatt cttctacaaa cttcaaagta caatttaatt 12241 tacttgttac ggtatagatc ccagtgtttt cactgaaagc aaaacattgg tttactatcc 12301 tttatccttc acttgattga tttatcttgc tcataactca actcttgttc cattgacaag 12361 acgaattctg ctttttgtaa ctctgtccct aagtacatag catgctcaac ttgcaagccc 12421 ggacaactag ctgctatttg ttgatacagt tgttttgcag acttacctaa aaagcagttc 12481 acaacttctc cagaaccagg agtcatctgt tcgacaagaa ttttgccgtc ttgtacagca 12541 atcacaaaac tgcctacggg gtctgcataa tctcgctgct gacaaatttt ggtatactga 12601 gactgaataa ctttttctgc attttcccag cagtcatcat aaatatgggc gctttgactg 12661 acagtaatta atggtcccat ttttaagaaa tgagttgaga acttgcatat ttcatctcga 12721 atatgttgtt gcaaagcacg caatcccatt gcatttgcgg gccatgctga aaacatatca 12781 ttactgcgca aagtcgcggt tattgaaagt tcattatcaa ctattctgac ccaaatatga 12841 ttcaggcaag gcggactatc attagcatca ttggtaacat cccacaaaga catgaccgcc 12901 ctagctgaat caatatcaat gactagttta tcaatgactt gctgaatttg gtcgcgtcca 12961 aaccaagaac gtaaacgttg accgtatgta tacttcactc cttcttggtt gggtgcatcg 13021 tctaaaatct gcgaaatata ttcttgcaaa aaactacggt caattggcaa ataattgggt 13081 tctggaaaat aaaatttttc tggttctgaa gtaacaactg ctattaaatc aatcaattct 13141 tgccattgtc cctcatatac agtcggtcga attgttccgg ttgttttaat taaatggatg 13201 atttttaccc aagtttcagc tatcgtctta ccctcgactc tgtgcccata gcgttgtcct 13261 ggtaaaacag ttggtactaa agtagacatg ggaaattcta atggaaaacc ccaaggttcc 13321 accgcttctg tttgagcaaa agatttcacc agagtacaag cttcagtgat tgatttcgct 13381 tcttgaaact ctatagactg acgcagtttt tctaacgcac tggcttcaac ttcaatatca 13441 atatatcctg cataagctga gcgaattacc cagcaacgtc gtccagtatc gctcaatcct 13501 tcttcaaagc cattgcggaa aaagtctaat aagcaaatcc cagcaccggc gtttttgtct 13561 tctctggtgg cgttgagaac aaccaagtag cggacatggg gattataaag caaattgcga 13621 atcagtaagt ttatccccct tgtgggcgag tagagctgtc caatgacagc atattcttgg 13681 gggtgtagat gtttggcgat cgcccccttt acactccacc ccgtaatcac tgctgtctga 13741 ccttgaccat aaatcaactg cttaggcttg tgcagtgcgt tgtagttata tttgattgcc 13801 ttgcccgttg ctgtcatggt tgattgccag aaagtcaaaa caatctttta tcttaacata 13861 tcaccaccgc ttgctctggc aaattgttca caataccaac taggacttac gcaccatctg 13921 ccagaaaccc ggtttcttcg ttcgtatttg gttgctaaac agacgatttt tggtagaaac 13981 cgggtttctt tgcgtaagtc ctgccaactc agctgacacc aacacggcgc gcctgattgc 14041 tcatctactt tacatttttt aatattaacg ttacacttct ttacataaca gcaattattt 14101 cgttaggaca tttccatgca gtacaaaggt gccattattg acgaggccgg caaaatgaac 14161 aattttgcgc tagaaccaaa ggtgtatgtg gatgagcaag gcgatcgcac cggtttaact 14221 ccctacgctg aacttctcaa cggtcgtttg gcaatgattg gttttgtatc tttgatagcg 14281 ttggaagtgt ttacaggaca tggcgttttt ggtttgttca gaagtttgta aaaacacatc 14341 ctgttaaaaa caaattttta tcatgctcgt agatacacaa gacgcaagat tttgcgtctt 14401 ttgtttgtaa aagctacccg tcagagagag tatctatagg gacataatgc ggtagtgttc 14461 tcttgtggct aaaagagaac taaagctggt aattatgagc aacgtgccca aaatcgaccg 14521 tcaaagggag catcaagcaa acgagcggac atttctcgct tggctacgca cttcaatagc 14581 attaattggc tttggttttg ctattgctag atttggtcta tttatacgcc agctgaattt 14641 tactctgact caacagcaac atgaaccatc accatatccc tttttcacct cagaaaattt 14701 gggtatcagt ttggtaatct ttggtattct gactttactt ttagcagtct ggcgatataa 14761 ccaagttttt tcccaaattg aagaaggtaa ctatcaaccc agtaagttta ctgtttgggt 14821 aatgactgga gtagtgatta tttttggaat tctcagtatt cctttgctgc ttttacgaag 14881 tcatgtgccc cgttccacgt atccgctccc aaatcagccc cagtcacgta attttcgtta 14941 aattgaaaaa atcttgattg aataaaaata tactaacaag acagagcaag tgtcatctgt 15001 aaaaaggttt tattaatcac aaactattgg tatattgaaa acgatatgat tcagtcttca 15061 gagaaaatat atcaggatta tggctttaac tgcttcaaca atgttaccgc taggtactaa 15121 agcgccagat tttcatctat cagatgtggt acatgaagaa acgatttcgc ttgccacttt 15181 tgctgataaa aaagctttac ttgtgatgtt tatttgtcag cattgcccgt atgtaaagct 15241 tgtgaaggca gaattggcgc agttgggaaa agattatatt catgatggtt taggaattgt 15301 tgctattagc accaatgatg tccataatta tccagatgat gatcctgaat ttctcaaggc 15361 aatggcgata gaacttgatt ttaagtttcc tttttgttac gacgaaagcc aagaaactgc 15421 aaaagcttac acagcagctt gcacaccaga tttctttctc tttaatgcca agcgccaact 15481 cgtttatcga ggacaattgg atgatagtcg ccccggtaat ggcaaacctg taacaggtac 15541 agatttacgc gcagctattg atgcagtatt gtcgaatcaa ccagttacag ccgaccaaaa 15601 gcccagtatt ggttgcaata tcaagtggaa accgggcaac gaaccgagtt attttggtta 15661 agagattccg agttttgagt cgtacattga aactcaaaac tcagtaccag caccagagta 15721 ttgtaaagaa tcttccaacc ttgggatggt cagagaaagt gctttaatct ttcttggtat 15781 ggtgaagcgg atcatttgtg gcagatgtta atgtgtccag tatagtggag tctttgtctt 15841 ctttgtcatc cacaatgtgt atattggctc agatgttaat cgacaccatc taccgcataa 15901 caaaacacaa atcctccttt gcaaaggaga aatagtcgca aaggaggatt ctaaggggct 15961 aggttcaatg tttgatttct aaactaagga tgtaacatcc tagttggatt tctcccgccc 16021 acaattcgta ctctactcgt agatgctttg gtaaggggtg tccgtcgagt ttgaggctgc 16081 gggtgaaatt tcgtaagcga atcggttcgc taggttgtaa tgactcaaca ggtaaccaat 16141 agcgagtaat gccgtaaact ccttgttccc aaaagtgacg ataactcaat tgaccatttc 16201 cttgcatggt catagtcact ctgtaagggg aaatctctag ccataatact cgcggaccac 16261 tgggaactgg agtgctacct tggacttggc aaagggtgtt ctctgtcagc agagagtcgt 16321 ctactttttc agactgtttt acaggcggtg cagacaaaag taggtggaat ctgtcgcgat 16381 ctttttggta gagtgttgca gcagtttcga caacagacca aactggcaga tctgtggaaa 16441 taagtgatag gcacacgggc ttgcgatggt gagtaagcat gagagcggta agggcaaaaa 16501 gccaatcaat gagatgagtt acaagaagaa attgataaag attagtagag ctggaggagc 16561 aaggaaagcg atacgatact agcgcagtca acgccagttt atcctaatgt cagcttctgt 16621 tataaccgta gaaaagaggg agaacgcttc tcaaaactta atcattgagc cacctggttc 16681 tctatctcta caaggtcgca tccgtgttcc tggtgataaa tctatctcac acagagcttt 16741 gatgctgggt gctattgctg aaggtgaaac ccagatttca ggtttacttt tgggagaaga 16801 cccctgtagc acagcaagct gtttccaagc gatgggggca gaaatttcgg aactgaatac 16861 ggaattagtg cgggttaaag gcatcggttt aggaaattta caagaaccag ttgatgtctt 16921 aaacgctggt aactctggaa caacgatacg cctgatgctg ggacttttag catctcatgc 16981 agggcgcttt tttactgtga cgggtgatag ttctttgcga tcgcgtccca tgtcccgtgt 17041 tatcaaacca ttacaacaga tgggggcaga catttgggga cgaaaaggca atacactcgc 17101 acctttagca gttcaaggaa agtccctcaa acccattcat tacaattctc ccatcgcctc 17161 tgctcaagtg aaatcttgca ttctccttgc aggtttgacg acagagggaa aaacgactgt 17221 cacagaacca tcactttcac gcgatcacag tgaacggatg ctacgggcat ttggggcaga 17281 tttagttacg gaccctgaaa ctaacagcgt cataatcact ggacctactc aactttacgg 17341 gcaaacagtg attgttccag gggatataag ttcagccgct ttttggttag tcgctggggc 17401 aattgtacca gattctgaac tcgtgattga aaatgttggg gtaaatccca cccgcacggg 17461 cattttagaa gctctggcga tgatgggagc gaacattcaa caagaaaatc agcgggaggt 17521 tgctggagaa ccagtcgctg atttacgagt gcgttccagt cgtttgcaaa gctgcacaat 17581 ctctggtgat attatcccca gattgattga tgaaattcca attttggcag tggcggcggt 17641 ttttgctgaa gggacaacaa ttattcggga tgctgctgag ttaagggtga aagagagcga 17701 tcgcattgct gtcaccgcgc agcaactcaa caaaatgggg gcaaaggtga cagaattacc 17761 tgatggtatg gaaattactg gcggtacctt tctatcgggt gctgaggttg atagccatac 17821 tgaccatcgg attgcgatga gtttggcgat cgcatctctc aaagcttctg gtcaaacgat 17881 tatccagcgt gcagaagctg cagcggtatc ttacccagag ttttttacaa ccctacaaca 17941 agtttgtggc gatcgctaaa caaagtctgt tatagcctag aattctacgt aaaagcctgc 18001 tcttcgtttt acttgagggt aaactccggc aggctttatt ggcgtaaata aatgcaacag 18061 ccgcattaaa atattcctca attagataca atatcagttt tatgcctcaa tttgaaatat 18121 aattttatgt tttttttgat ctggcgtaaa tggttggtgg tactattgcc tgcattgttt 18181 agcgctatgg cactttgcgc acgttccgcg ctccgtattt tgctatccag caaactttag 18241 aactcaagca gcagtatctg aatatcgatc ttagacggct cgctgtggaa tcttcacgaa 18301 ttgaaagtcg taaaaagtac aactaagtta agttacagta ctcattaaga attatgaagt 18361 ttcgtatttt taaacaactt ttccaacgta gttggattgt cgctttactg ggtctattgg 18421 tagcagtaac cctcaatggt tgtaatccca gtcagtttaa aagccaagcc gctcaagtgc 18481 cgcgaatgat cactgctact ctgggtgcac cttcaacttt taactcagca ttgaatgaga 18541 cggcatatgg cgtttttggc tttatctacg actcattgat aaatgaaaac cctcttacca 18601 acaagcaaga gcctgcttta gccgagtcgt gggaagtttt tgacaatggt aagcggatta 18661 ttattaccct cagagaagga ctaaagtggt cagatggtca gccaatgact gctgatgatg 18721 ttgtgttttc ttacaacgaa atttacttaa atccaaaaat tcccactcct gttaaggact 18781 ctctgaaaat tggcgaaagt ggggcaacac caaaggtaaa aaaacttgat gaacgacggg 18841 tagaattcac tatacccgaa ccttttgctc cttttttaag atgggtaggc ggtatcacaa 18901 ttctgcctgc tcatgttcta caggaatcta ttcgtacaac tggttctgat ggtaatccca 18961 agtttatttc aatctgggga acagacaccg atccaaaaaa aattgtgggg aacggtcctt 19021 atgtgatgga aagttatgtt cccagtcagc gagtgatatt caagcgtaac ccatactact 19081 ggcgcaaaga tactcaaggc aaatctcaac cttacattga gcggattgtt taccaaatta 19141 ttgaatctac tgacaatcag ttaatcagtt ttcgctctgg gcagctagat gatttggaag 19201 tgaccccgga agggtttagt ttgcttaaac gagaggaaaa gcgggcacgt ttcaacattt 19261 ataacggagg acctgataca agcacaactt ttattgcttt caatctcagt aaaggcaaaa 19321 attctaaagg acagcctttt gtagatccaa tcaagtctgg ctggtttaat aaaaaggaat 19381 tcaggcaagc tatagcctat gcaattaacc gtgaagcgat gaaaataaat gtttttcgcg 19441 gactaggaga accgcaaaat tcctttgttt atgtcaaaag tcccttctac cttccaccag 19501 aaaaaggatt aaaagtttac aattacgatc cagagaaagc aaagaaatta ctattacaag 19561 caggtttcaa atataattct cagaatcaac tattggatgc tgatggtacc cgagtcagat 19621 ttacgctgtt aaccaatgtg gaaaggaaaa cgagggcaga tatggcagcg caaatgagac 19681 aggatctggc taacattgga attcatcttg atttgcaagt ccttactttt aacgcctata 19741 tagataaact taaagtgtcc cagaattggg attgttacct tggtgggttt gctggcggtg 19801 gtgttgaacc tcacggtgct agcaatatct ggagaattaa aggagcatct cacgcattta 19861 atcaaggttc acaacctggt aaacctccaa tcattggctg ggaagcttcc gattgggaaa 19921 aggaaattga ccgactttat gttaagggtg cacaagaatt ggatgagaac aagcgcaagg 19981 aaatttatta cgaatatcag cgaattgcct cggaacagtt gccgtttatt cacttggtgg 20041 aacggttaaa tttgcaggca gtgcgcgatc gcttccaagg tatcaaatat agtgctctcg 20101 gtggtccatt ctggaacttg tatgaaatca aagttacaca aaactagttg tatatgagtt 20161 gagctaaatt cagcctgcaa aatatcaaag caaccaattt ttgtttaccg ctgaaggact 20221 agacaaccct atcttaccag gggtaaaaag ttcgtctgca tcctctaaac cgacgtcagc 20281 gaatacgttt gggctgcttt cttcaaaaac aggttcttct tctgtcatgc ttgttcctct 20341 ctagctagcg cgtccttata gcgttgttta atcaggtgta tgagtgcgaa aatatgccaa 20401 gaatctgttc catgatgttt gcccacttcc catttcgtcg tcgctggctt tcgattgccc 20461 tgatcttaac tctatggcta agttgtcagc ttattttcac cagctgcaac ccagccaatt 20521 ttaaaactca ggcggcgcaa gtctcccaat gggtcaccac aactttgggt gatccaaaaa 20581 cttttaacta cgccttcaat caagagtatc ctcacgtttt ttatttacta acgattgcct 20641 cagcctcacc caaatcaata ttttcctggc tttcctgctt cactttcact tgttaaaaat 20701 ccgtaattac ctgagtttga atccagggta aagtttgtac ttccaccgca ccgggaacgg 20761 gtttatcgac cgctaccatc tcgtcgtaaa cggctgttgg gataatcaca tgagtgtaaa 20821 gttgccggag tagatctagt tggctgattg ctgccagatt cgtaataggc gatgtatcgc 20881 gctccaacaa tcacagcaac cccatcgatt gcagcgttct gacatctgac tggaaatctt 20941 gcacatcata gttgatgcac agtccgcgct tggcaagttc atgttgaaat tcaatcactg 21001 tcagtccagt ccaagcacgg gctttgccac tactgatttt ttcctgcttg tagagcatga 21061 tagcaatctc caactttaat tcatcctcgg tcatttttgt tgcccggaga atgtcatcag 21121 gtattaagac gctcatagga atttggtttg gggttgtatt ttctataatt ttagggtatg 21181 taaaacagcg aaacagaaca gttgcggttt aacgcaaccg ttaagctgtt tcttgataga 21241 acagaaattc ttgttgactt gaaactactg tatatattga ataccaagat ttggatttta 21301 caaagttcca caaccatcaa caagcaaata acaactaaca aatgacaaac ggtaaaagac 21361 aaatgaccga tgacaaatct ttgcgatcgc tcttagaaag cgttgccaat ggtaacgtga 21421 cacctgatgg agcattagaa aaactgaaac acttggctta tgaacctgta ggcgattttg 21481 ccaaagttga ccatcatcgc gccttgagaa caggatttcc agaagtaatt tggggaccag 21541 gcaaaactcc tgatcaaatt gctcaaatta tggaagctat gcgccagcgt aactcagtgg 21601 tgatggcgac gcgcatagaa ccagatgttt tcgccacact ggaaacaaaa gttcaaggtt 21661 tgcgctacta caacttggcg cgaatttgtg caattactcc tcctaccatt gaaccacagt 21721 atcctggcac aattgggatt ctttctgctg gcactgctga tttagctgta gctgaggaag 21781 ctgctgtcac cgcagaactt tctggtttcc atgtgcagcg tctttgggat gtgggagttg 21841 caggaattca tcgtttacta aataaccgcc atgttattga atcagcatca gtcttgattg 21901 ttgtagcggg gatggaaggc gctttaccta gtgtagttgc aggtttagca gattgtcctg 21961 ttgtcgctgt tcccactagc attggttacg gcgcaagttt tggaggttta gcgccactct 22021 tgacaatgct caactcttgt gcagctggtg tgggtgtagt gaatattgat aatggttttg 22081 gtgcagcagt cttggcgggg caaattgtgc gtacagctta taaattaagg aagtaggacg 22141 ttttgtgtgt acattgtccg ccactattac ctatactgta aataaagact aagctgaagc 22201 taggaattag tcatgtcaga actatcatct gcaaaagatt gtcgcatcga tttacgagtc 22261 acccaagaac aaaaagaact attagaacgt gctgcaagtc ttaaaggaat ttctttgagt 22321 gcttatacac tgtttcatgt tttgcccgcc gctaaacaag atatagatac tcatgaaagg 22381 ctagtgctgt ctaatcgtga cagagatttg tttatgtcag taatggaaaa tccgccagaa 22441 ctcaaaggaa aactcaaatc tgctatccac aaatatagaa agaagtatga caagtcatag 22501 ggaggcgaga tggacttttg taccaattga taaaaaacat cagagagatt cttttgattg 22561 cggttatccg attttgaatg attatctcaa aaaatatgcg cggcaaaatc ataataaggg 22621 agttgccaaa acatttgtgg caattccggc atcgggaagt ttgaaaattg atggatatta 22681 tactgtcagc gccagtgtca ttgagtacga atctttaccg gaatcttatc aacgtggaat 22741 gcctgcctat ccaattccag cgatactaat tgggagatta gctgtagatc atccagtgaa 22801 aggacaaggt ttgggagggg aattgttagc cgatgctctc taccgtgctg ttcgtgcttc 22861 tcaagaaatt ggggtatatg ctgtcagagt cgatgctatt gattttcagg caagggaatt 22921 ttatctcaag tatgagttta ttccttttca agatcaagaa ctctcactat ttctaccgat 22981 ggcaaccata attggagagt ttagttaact tcacaatttt cccacactca tgacaaaaca 23041 acaaacttgg agtcaacggt ttgaatcggc actacaccct gcgatcgctc gctttaatgc 23101 aagtattaat tttgacattg aattaatcga gtatgacctc acaggctctc aagctcatgc 23161 caaaatgctg gctcacactg gtatcatttc ccaagaagaa ggagaacaac tcgtcgcagc 23221 tttagagcaa atccgccagg aatatcgcca agacaaattt cacccaggta tcgaagcgga 23281 agacgtacac tttgccgttg agcgccgtct tgtggaaatt gttggtgatt tgggtaaaaa 23341 gttacacact gctcgctccc gcaatgacca agtaggtact gatactagac tttacctccg 23401 ggatcaaatt caacaaattc gtgagcaatt acgagaattt caacaagtct tactagatat 23461 cgccgaaaaa cacgttgaaa cgctgattcc tggatatacc cacctacaac gcgcccaacc 23521 tctgagtttg gctcaccact tgctggcata ctttcatatg gcacaacgcg actgggaacg 23581 tttaggagat gtttctcacc gagtcaatat ctcaccattg ggttgcggtg ctttagcagg 23641 aacgactttc cctatagatc gccactacac tgcaaaactc ttggattttg agggagttta 23701 tgctaatagc cttgatggag tgagcgatcg cgattttgcc atcgagtttc tctgtgctgc 23761 gagtctgatt atggttcacc tcagtcgtct gtcagaagaa gtcatccttt gggcatcaga 23821 agaatttagc tttgtgactc tcaccgatag ctgcgcgact ggttccagca ttatgcccca 23881 aaaaaagaac cccgacgtac cagaattggt gcgcgggaaa actggacgcg tatttggtca 23941 tcttcaggca atgttggtca tgatgaaggg gctacccttg gcttacaaca aagacctgca 24001 agaggacaaa gaaggtttat ttgacagtgt cgtcacagtc aaagcttgtt tagaagcaat 24061 gacaatttta cttcgagaag gtttagaatt ccgtactcag cgcttggcag aagcagtcgc 24121 agaagacttt tctaatgcta ccgatgtagc agactatctt gctgcacggg gtgttccttt 24181 ccgagaagca tacaatcttg tgggaaaagt ggtaaaaact agtattgccg ccggaaaact 24241 cctcaaagat ttgagcttgg aggagtggca acaacttcat ccagcatttg cagaagatat 24301 ttaccaggca atattgcccc agcaagtggt tgcggctcgt aatagctacg gtggcacagg 24361 ctttgcccaa gtcaaaggcg cactccttac cgcccgcgcc caaattactg ctaaataacc 24421 aacaaaggcg tacatagtgt acgccctgac acttcaaaga gcgatagggc aaaagaaatt 24481 tgaacagaga acttattcag catcaattgt tgctgcgccc tcatcctctt cactcttcgg 24541 cggtctttgt tggaaccttc tttggaaacg tcccgttgta gtccgtttac gaaactttcg 24601 ggcatacgcc tggttccgta gcgccttttc tttcttcgga ttgcggcgct tagccatgtt 24661 cacctcatta ataaacaaca aaaaagctac ttctaggctg attgtaggca ataacaagta 24721 ttggtattac ttatataact ttagcctagt tttgccacga aatcatttca gacggtacgc 24781 tagtatagca tattctaagt ttaaatttca agcaatttgt agaattcaga atgcctacaa 24841 gttgtagagt atactgtaaa atccttgtat tgcaaaagtc tatgtgttcg agtacgggaa 24901 tagaagaaat caaaataata agaattggct tctacttaga gcaagtataa tgtttttgct 24961 ataagctgaa aatttgcttg attgataagt ctatgtttct acacagacac gactttgagt 25021 ttgagcaaat gctagaatct acactgattc tacgttaatc gttagaatca aaccccaaac 25081 cacaaacatt gttgtatact aagcaactca acatatcagt tgaaaaaatg tgaatggcgg 25141 ggaaactcta cactacaagc caaagatttg aaacttgaac attcttgctt tttttgcagt 25201 ggctgttcag tccacacgta acttatggcg cattggacaa accgtactgg gcattatctt 25261 tcgtcacccg attccaggca cgagcattat tccaatttta ccagatggtc gaattgtgct 25321 gatccgacgg cgcgacaacg gtctgtgggc attacctgga ggtatggtgg actggggaga 25381 ggatgtcaat actgctatcc gacgagagtt gatcgaagaa accggactag aactggtatc 25441 tattcgacgt ttagttggtg tgtactccgc accagatcgc gatcccagaa ttcattcaat 25501 ttgtattgta gctgaagcaa tagtgcaggg aaaaatggaa attcaagata ctttggaagt 25561 gatggaaatc caggctttcc cactcgattc cttacctccg ggacagatgt ctcatgatca 25621 taatcgccag ttgcaagact atttaaatgg cttgacaaca ttagcgtaat gatacttcca 25681 tcatgaattc atacggttca gttaacgcag ccccacgggc agcaattgca acccgtgatc 25741 aataacacaa acatctctct cactttctcg ctccctcaag agtgtttttt gagactaaga 25801 gtagctaggt agaacgtaag cgaaactcaa tctacattgg ttcgatgttt tttgctttaa 25861 ctgaaccgta ttgagatata tttaataatt agtagttaga agaaaattca atgaactact 25921 agtaacaatc tagtacagcc ttcgcaattc aaaattcaaa attcaaaaat atggctgatc 25981 tcgcaagctt ttagctaagt ttgaattttt gcttgatttc cggagttttg tattagcagt 26041 aaatctcaac tcccaatcca aaacatattt aatagaacgc ttatattaaa accttaagta 26101 tgcattcgat attccgtaac tcccggataa agacctctca agggatgctt ttctggcgtg 26161 aaattggaga aggaattccc ataattttct tacatagtgc ttggaaagat ggcagccagt 26221 ggatatcggt tatggagatg ctagcacaag atttccattg ctttgcacca gatgtattag 26281 gattcggcga gtcagacttt cctaatgttc actattcaat tgatttgcaa gtggagtgtt 26341 tagcacaatt gctacatact ttaaagcgtg ctttaaagca aaaaagggta tatttaattg 26401 gagattccct tggaggttgg attgcggcta gttacgcttt gaagtatcca gaacaaatta 26461 atggtttggt attactagcg ccagagggtg tagaaataga agggcaaaaa aaacgttggg 26521 agacgatgca gcggatatta aaccgttcaa cactactgtt tcaattgttg cgactatttc 26581 gtcccataac gaaaattttt ggtttagatc aaaaaattga acaggattgg gagtttcatc 26641 agcgaatgct gcaatacccc acagcttgtg agcttctctt tgaacgccag catccagaaa 26701 ttcaagcaga attgttacac gacgaattgt cctttataac agtcccggtc ttaattttac 26761 aaggtgaaaa agacaaccaa gaagctatag ccatgagtcg ggcttacgct caatatatcc 26821 ctcatgcaca gttgaaaata atcgctcacg cagaggataa cttaccagaa tcttgtgcct 26881 cggttgtggc tggggatatt cgcaacttta ttaagggtaa gtaatgtact gctatcctag 26941 ctttttgaga ttgcagatag tttaccagta ggtggcgatt ggtgatacgc cgccagaagt 27001 ctcttactac gactgcaatc gttacgcgat acgcggagcg tgacgccaag ggcgtatcgc 27061 accacaaaag tacactggta aacattgctg ctctcaactg tagtgtttat ctgttgcata 27121 caccacgacg ccataagggt acagcaatgc tgtgctccta caattgactt gtattgcacg 27181 acggttgaaa acggctatct ctcgttccta tgctcagcat gggaatgtct aaaacgaggc 27241 tgctggctcc caatgattac attgaaccag cagagtagcc gctatattcc ataagttgtt 27301 catgtatttc cagaaaaatt actgagctat agttttcttt ttgggataca taaagtattt 27361 tttaatactt ggttgcataa ttgacttttt tttataaaga cacctcaaaa atatatattt 27421 cttttataga gtattatact atacacgcat aaagtgattt ttagctgcca aggacattac 27481 acacctaaat aatatcccta gtaaaacttg tcgttttttc ttaacaattt tgacaattct 27541 atttattaca cgtaatcagc acattatatc tgtgttgaga aagatacaga cgtgtataaa 27601 cgagagtcat gcaaacccaa gccagcttgg agcagctttt aaagtcattt aacccggtgc 27661 atcttcgtga aaccttatct aaccagagaa ttaggctctg aacttttttt aggagatatc 27721 aaggatatgc gaaggttatt tatggctatt gcgggagcgg tgatcatggc tttaggaagt 27781 ggagccatat ccccagctca cgctgaactc ttagacttca gctttactac tgtaagcggt 27841 gcaacaggtt cgtttacctt agacacggac actcccgcct ctggtgagtc atcccttgga 27901 ggtggagctg cgtttccagg gactccagga attttatacc ctaatgctgt ttctaactta 27961 tttttgtcat ccacacaact aaatttgagc ggcgtcaccg ctgactatga ggttgtccca 28021 ggtttgactt ctgcaggtct tggtcttcct ccaggtctgg gagttcttag cggtcctgtt 28081 tatccagccg gatgttcaac aggaactaac ttcacatgtg cagtgactat tggtgtactc 28141 tactcgggta gcccctcaga actttccgat gacccagctt cttacttaag tcttggaatt 28201 gaatttttcg atcccgagac cgcagaacag attaacctta ccccggatct gtataccaac 28261 tttcaagttg tgcgtaggca agctgtccct gagtcgaact ctagcttaag tttattagct 28321 tttggtattg ggggcgtagg tttactactg aaacgtaaga agaacagcaa caagccgcta 28381 acgatctaat aggacttacg caccatctgc cagaaacccg gtttcttcgt tcgtatttgg 28441 ttgcgaaaca gacgattttt ggtagaaacc gggtttcttt gcgtaagtcc tgtctaaaaa 28501 aatagcgata gcgttcgcgt agcgtgtccc tttgggactc agcgctgcag caaagcagca 28561 gatcgcattc cccaaaatat caaattcgct tcaacattca aactatctac aatctcagga 28621 aactgggatt gcagatagtt tactaacaaa gtgcgatcgc ccccagtcat cacaacattc 28681 ccttgcggaa aatcccgcca ccacgcctca ataaaatctt ttattccagc aactagagtg 28741 taaataaccc cactttgcat tgcttcttga gtattcaaag cataccgctg gggtaattgc 28801 tggggtaact caactgttgg taattgtcct gttctttcag caagagtcgc aagctgcaac 28861 cccagtcctg gaagaattgc gcctccaacc aaatgttggt tagcatccgc agcagtgaag 28921 gtaagtgctg tccccgcatc aatcaccagc acaggaaaac cccaattgtt cccagcaccc 28981 cataaagcta atgcacggtc aattcccagt gtgggataaa ttccctttat gggaacttgg 29041 tctaatgtaa ggatgcgaac attaggataa gtttgccaca gtgctgtttg actgggaaca 29101 acggaggcta ccaggagtgg aggaggggag gggggagatg aggagagtgt ttttaaaaaa 29161 tcttccatgt tcccgtgtct cgctttccca gttgcttcat tgcttgggaa aatctttagt 29221 aataaatcct tcagtgtttg agattgagac agttgctgga taacatctgc tggtagataa 29281 tctgtatccc aagcatgaat aagcgtttcg cctgtaaagt gcgcccaatg gagtcgggaa 29341 tttccaatca tcaaagctag ccaaattgtc tggtgagtgt ctctcatctg ctgtttaagc 29401 tttacactct ttatcaatag tgaacctttt catttctctt atcaaatata tagaaaattt 29461 aataaaaatt tacaaaaaaa acatgtcaca atagggacag agtaataaat actcaattgc 29521 taaggagact agttatggta gcgctcaccg aaaaaaccaa aaaaaggcta accattgaga 29581 ttggcgagat tgctcccgag acgacagcaa ttcgctcttt ggattgggat cgcgatcgct 29641 ttgatatcga gttcggtcta caaaacggta cgacctacaa ttcattcctg atacgcggtg 29701 agaaaatagc gctggttgac acttcccacg aaaagtttcg ccagctgtac ttagataccc 29761 ttaaagatct cgttaatcca gcagatattc aatatcttat tatcagccac actgagccag 29821 accacagcgg tttagtgaaa gatgttttgc aacttgcgcc ggatgtaaca gttgtcggtt 29881 ctaaagttgc tatccagttt ttggagaatt tagtgcatcg tccattcaaa cggcaaattg 29941 ttaaaaatgg cgatcgcctc gatttaggca acggacacga actagaattt gtcatcgcac 30001 caaatctaca ctggcccgac accatcttca gctttgacca taaaacccaa actctgttta 30061 cctgtgatgc gtttgggatg cactattgct cagatagcac ctttgacgaa gacttaaaaa 30121 caatagaagc agattttcaa tattactacg aatgcctgat ggctccaaac gcccgttcag 30181 ttctgtctgc cctcaagcgg atggatgaac tggaaaaaat cagcatgatt gctacaggtc 30241 atggacctct attatcccat aacgttgacg aactcactgg gcgttatcgc aattggagca 30301 aaacacaagc gaaggcagaa accagtgtag cagtctttta tgtttcagat tacggttata 30361 gcgatcgcct cgctcaagca attgccagtg gtatcaccaa aactggcgtc gccacagaaa 30421 tggtagactt gcgtccaggc gttgacttac aagaattgcg ggaacttgtc agccgttgtg 30481 ctggtattgt tgttggaatg cctccagctt ctggtgcagc caatatccaa gcagccttga 30541 gtaccatttt aggctcagtt catgaaaagc aagcggtggg catttttgaa tcaggcggtg 30601 gagatgacga accaattgat ccattgctga gtaaattccg gaatttgggt ttaataacag 30661 catttccagg aattcggatt aaacaaacac ctacagaaaa cacctacaag cagtgtgaag 30721 aagcggggac agatatcgga cagtgggtga cacgcgatcg cagcatcaag cagatgaaat 30781 ccctaggtgc tgacgttgat aaagcattag gtagaatcag tggtggactg tacatcatca 30841 ccgccaaaaa aggcgatgta tccagcgcca tgttagcttc ttgggtttct caagccagct 30901 tcaaacccat aggaatatct atcgctgttg ctaaagatag ggcgatcgaa tcacttatgc 30961 aggtgggcga taaattcgtg ctaaacgtct tggaagaagg taactatcaa aaattgatga 31021 agcacttcct caaacgtttt gccccaggtg ctgaccgttt tgaaggagtc agaacccagc 31081 cagccgaaaa tggtgcaccc attcttacag acgctttagc atatatggag tgcgaagtca 31141 tcagtaggat ggatggtggc gaccactggc tagtatacag caccgtatac gccggacgag 31201 tttccaaacc agaaaccctc accgcagtcc accatcgtaa agtgggtaat cattattaac 31261 agttatcagt taacagttaa cagttatcag ttaattgtta actgtgtact gatttaacaa 31321 caccagtcca ctctactgat aactaataac tgcttatgag tcttttgtgt ttagttcacg 31381 gtgcttatct gggcgcgtgg tgttgggatc tactcacccc agagatagaa gcacgtggtc 31441 atcagacggt agcagtggat ctcccaattg aagaccccac tgctggtgtt gctcaatatg 31501 ccgaagtcgt gagcaaggcg ctgcaaggat ttgaagacga tgtggtgctg gtaggtcatt 31561 cgatggcagg cttaattatc ccccttgttg ccagccagcg tccagtgcgt cagcttgttt 31621 tcattgcggg agttatcccg cacatcggtg taagtcttct cgaccagtct catgacgagc 31681 cagatcccaa cttactcaaa gcaataggct acgaactccc agaagctgat aaattcgagc 31741 agttcagcga tgagccaaat atgttcaatc cggcggcgct ggggaaaaat cttttacaag 31801 acgaggctgt agcgagagaa tttctctttc acgactgcgc ctcagatgtg gcgagttggg 31861 cttttccgaa gttgcgcaat cagcagtttt tatacataag tgaggtcagt cctctacaag 31921 cttggccaga tgtaaagtgt acgtatatcg tctgtggtga ggatcgttcg ctttctcctg 31981 catggtgtcg atatgctgca cgcaaacgtc ttggagttga tgcaattgag ttacctcaaa 32041 gcagccattg cccaatgtta tctcaccctg ctcagctcgc cgatatacta gcgaaagtgg 32101 cctctacatg aggtacacag ttcaatataa aatcttatta aaataagact tatgccagaa 32161 aataaacccc gtgacgttca ggtttacccc atagctacag acacaactgt gtttcgttcc 32221 cgcagttgga cacgcctgag atttgaaata gaatacgctc tcgcaaaagg gacaacagca 32281 aattcttata tcatcaaagg agataaaatt gctctggttg accctccagg agaaacattt 32341 acagaaattt tcctgaaagc gttgcagcaa aggttcgatt taaaaaaaat tgattacgtc 32401 attctcggtc acatcaatcc caaccgcgcc gcaactctca aagctttgct agaactcgca 32461 ccgcaaatca catttgtgtg ttctaaccca ggagcaaaaa atttgcgtgg ggcattggaa 32521 aatccagatt taccagtgat ggtgatgcgt ggggaagaaa ctttagattt gggcaaagga 32581 catcatctac agtttatccc cactcccaat cctcgctatc cagacttact ttgcacctac 32641 gatccgcaaa cagaaattct ctacacagat aagctttttg gcgcgcatat ctgtggcgac 32701 caagtttttg atgaaggctg ggaaatcatt aacgaagata aacgctatta ctttgattgt 32761 ctgatggctc cccatgctcg tcaagtcgaa acagcactgg ataaacttgg cgacttaccc 32821 gtgagaatgt atgcgactgg gcatggacct ctggtacgct acagcttaat tgaactaacg 32881 gaattttacc gccaatggtg tcaggagcaa gcatcgcaag atacatctgt cgcgttgatt 32941 tatgcttcgg cgtatggaaa caccgcgact cttgctcagg cgatcgctcg tggtatgaca 33001 aaagcaggcg ttgctgtaga atcaattaac tgcgaatttg ccgacccaga agaaatccgc 33061 gctgctgtgg agaaagcatc tggctttatc attggttctc ctacccttgg tggtcatgca 33121 ccgactcccg tgcaagtcgc tttgggtatt gtactttcca ccgcgactaa caacaagctt 33181 gctggtgtct ttggttcttt cggctggagt ggggaagcag tcgatattat cgagggtaaa 33241 ctgaaagacg ctggctatcg atttggtttt gaaactatca gggtgaagtt taaacccaat 33301 gatgctaccc tgcaaatgtg tgaagaagca ggaaccgact ttgcccaagc actgaagaaa 33361 gccaaaaaag tacgcactcc aagtcaaccc gcgacaactg ttgaacaagc agtcgggcgc 33421 gttgttggtt cattgtgtat cctcacagca aagcaaggcg atatttccag tgcaatgtta 33481 gcctcttggg tatctcaagc aagctttaat ccacctggtt tgacagttgc tgtggctaaa 33541 gatcgtgccg tggaatccct gacacactct ggtaataaat ttgtcataaa tgtgttaaag 33601 gaaggaaatc acatcggctt aatgaagcac ttcctcaaac cctttggtcc aggacaagac 33661 cgatttgaga gtgtcgcaac tcaagaagtt gagaacggtt gtcctgttct taatgatgct 33721 ttagcttact tggaatgctc cgtaaaaacc cggatggaag taggcgatca ttggcttgtt 33781 tatgcaactg tggataatgg caaagtttta gatggtgagg gggttaccgc tgtgcatcat 33841 cgtaaatctg gtaatcatta ttaaggacct cggtaggagg gaacacttcg acaagctcag 33901 tgcatcgcag ggaataggga acagggaaca gggaattaaa actaaatatt aggacttacg 33961 cactcgtgac aaaaattctt cgctgacgac aaaaacgccg atctgcctga cttccctact 34021 actcttcttt cctgaggtgg atgttgatga tacgctcttt ctatacggtc taatgtaatg 34081 ttgcctagtg tagggggtta tgtttatgta ttttttgatg gaagcagcag cgcaagcagt 34141 caaagaacca taccaatttc cttttgcttt caccgctgtg tatgtgattg gctttatcgc 34201 tgctgtcacc attggctcaa ttgcttggta caattctaaa cgtcccgtgg gttgggaagc 34261 caaagagcgt cctgatattg tacctaaagt tgacaaagac ccaactccag gactgggtga 34321 gccgaagtcc tagatagttc gtaagttgtg agtgctgata ccaaatccgc ttcttgtgtc 34381 tcctttgaac cgtagcacag gtggggctac acaaaacttg tccataagcg tgcggactat 34441 tgtccaagga cagttgttga ctcggagttt ggttttactc aggacttatt actcagaact 34501 ggtggactct ttcaattctg aacttcaacc caacgacggt aaagcttttg aatttgtttc 34561 agaacagcgt tatgccgagg ttgggttgaa tcacctgtgt ctgataattg ttgtagtttg 34621 gtgaggagtc ttgcttcttc tatcgccact tcaccatcac tgtaaattaa gccactgata 34681 gcttcaatca gattttggca ttgttctgga gtggggcgat cgcctaaata ttccttcacc 34741 cactcgtaaa actcttctgg ctgtacagca acgagttcat gtagcaaagg ctgaatttct 34801 ggctcgttag ccataccctt ttcttgagcg attttgcgaa gatattctct ctcttctggc 34861 tgaattctgc catcaatcca agcagctcct atcaggattt ttaccaagtt tttcatactg 34921 gaattattaa ccatgatttt tcttaaggta ttaagatttg ggaatatgtc ttcttactta 34981 ttttccccat tctctatacc tcttaactga aaaatctcaa aactttaact gttttttctt 35041 aagcgcacca taaaaaaccc atccatgtct tgctggtggg gtaatacttt gatccagcct 35101 tgtggagtgg agtatgtact cgcgggagaa tcagcactag gagtctctat ttcccaattt 35161 ggattctcag ctagaaaaga cgaaacgact tcttcatttt ctttgggatg cagtgtacag 35221 gtggaataaa cgaggacacc accttgcttg acaaaagtcg atgtatgtga aattagttgt 35281 ttttgaagca gcgagagttc ttccactgtt tctggtgtct gtcgccaacg agcatcagca 35341 tggcggtgca aggttcctaa cccagaacat ggtgcatcta gtaggacacg gtgagcagtg 35401 ttttgaaact ggggcatatt acgactgtca ccagtacaaa cttcaataga ttttaacttt 35461 agccgttgag cattttcttg gagttttttt agacgagagg gagtgcgatg gcgcttcgcg 35521 cccgcctctt tgaggcgatc gcaaccaaaa atcttacctt catccctcat caactctgca 35581 atgtgagttg ttttaccccc tggtgcagca caagcatcaa tcaccacctc acctggttga 35641 gggtcgagta aatgactaac caattgggcg ctactatctt gtacagtcca ccaaccctcg 35701 ttataaccag gtaaattttg aatcggaccc gcatgactca tcaatcgtaa agcttggggt 35761 aggtggggaa ccggttgaga cgcaacacca gcagatgtca acgccgcttc cacttcttct 35821 gttgaagtac gaaggacgtt gactcgcaaa tcaatagctg gagttttatt catccacaag 35881 caaagttgtt ctgtttgcgc taaaccgaat tgttccaccc agacttgaat catccagtca 35941 ggaaagctgt gtaaaatacc taagcgttct atggggttag aagattgaac agagacttta 36001 tccttagttg ttgcatcttc tttgttttct aaacggatat actgtcgcaa tagaccgttc 36061 acaaaacccg ttagtccaga aaagccattc tctctggcga gttggacagt cgtattcaca 36121 gcagcacttt caggaattcg ttgttgatac tgtaattggt ataagcccaa atgtagaata 36181 gtgcggaggt cttttggctg ttggtgggat ttctttttcc ctaattggtc gataaaagcg 36241 tcaagagtac gctgtcttct ggtacagcca taaactaatt ctgtgacgag gcgtcggtca 36301 gtatctggta aacttgcttt ttgtagcact ttgtctaggg caatatcaac ataagcccct 36361 ttgtgaacgt ctcgcaaggc aataaaagct agttgacggg gggatgtgag caattcaaaa 36421 ttcaaaattc aaaatatgcc cttcgggcac gcactttaca aaattcaaga ttggacaagc 36481 taatggtagc aagtatcttg gcaattttgg cgatattagt gaactgcgtt gaagcaataa 36541 ggtgcgcttg catgagcata acgcaccctg gtgtgaaaat gaactttaag ttatattaca 36601 aggcttaacc aaaatataga gtattcgcct caaccatttc aaactcacaa gcccgagaag 36661 caggttctac actaaatgat ccaaaactga ccaaaggtaa aaatctaggg ttgcgcaata 36721 gttcttttgc taataagaat ccagcaattg tccaagtttg atattttctg gcttgtttgc 36781 cagtgagtcg tcctttctta ccatcgtaat actctggcca ttcatctaga tgtaagcgcg 36841 cttctgcaat ttcaatagcc ttgtgtgcaa gatccattct atttgttttc acagcagctg 36901 ctgataacaa ccacagtaaa acaggccaac ttccaccatt atgatatgac cagggtatgt 36961 ttttcgggtc acatccagtg acaatcttat attcttcgtg ttccaaagct gggaaacaaa 37021 ttttcatggg catatctccc accaaatcct cccatcgctc ttcgatgaga ttcataattg 37081 cttgtgattg ctcttcactg gcaaggtctg agacaataga gatcatgttt gctaaacaaa 37141 agaagcgagt atctagttgc gaaggtccaa cattaccaac tagataaccg ccttttttcg 37201 gcagccactt gtctaattca taataaggta gtgaatcggc atatacgttg aaggggttcg 37261 ctgcggcttt gccatactct tcacttttaa agcgataaat cttatttaaa cgatgaagat 37321 ccatccaata atgctggcga atatgagcct gtaaaagcgg taagcgatta tctatagccg 37381 taacaatatc ttcattaccc tgacaaacca gtaattctct agctgcacgc aatgcagcaa 37441 aaaatagagc ttgaatttcc aatggatgcc caaagatacc catacgacga tcaatcatac 37501 aagcaccatc tggaaccaat agcgtcgggt acatatcaaa ccgagttgct aaacacagtt 37561 ccataattaa tcttagaccg tgttgaaaat ccggttgata tgctagagaa aaatctctcg 37621 tagcaaccac ataagcacgc aataaaatca cccaccacaa gcaagaatca acaggtgtga 37681 ctcgtgcaat agcatgttca ccaaaatcgg gttctaaata ttcttcttca ccttcgcata 37741 caactttgaa gctagctggc attaaacccc gacctggcat ataagcattc agctgtcttt 37801 ctttcggctg taaccttaaa gtttcttcta aaaagttacg gacaatatct gttctacctt 37861 tgatcagaaa tattaaagca gaagagacaa aatcccgaac aaaacattgg tcataattca 37921 gcgcttctat tgtcacatca taagctgcta ccgtccctac aggacgacct cgatagtaaa 37981 gaattgagtt ctctattgct gcccacgcct ctttttctac actgtcggtg gataccaagt 38041 catttaattg cattttctga gcaactccat ttatttcgta aactgtgatt tggttgtaaa 38101 tgcaccaaga tttaagtttt agctgttgtt ttcatcataa cagtcatatt ttttgaaata 38161 gagaatatac actgttcgac gatactaaca gcaaaaatct tgcattttat gcttttgcgg 38221 ctttcttgag agttcttatc tttgtgttaa ctctctcttt ttaaaaaatt tcttgtcact 38281 tcaaacagtc aatgaaaatt tattctctga ggatttatga gcataatatt tgtttgtttg 38341 caacccagaa atatttgttt tcccaaacaa tgacatactt gagtgacttt gtaacatttg 38401 gctcaattca aagtagcaac ttaagtcatt ttactgaaaa gacctagata ttaattggac 38461 gatcaacgaa atgattgaac catgcctata tgatcaaaaa ttataacaat ttttcacaat 38521 ttgcaatctg attttaattt ttttctttta ctgattagca aaactatcca gattacctca 38581 attccctttg aataacaagc ttctcaaacg tagaatcaat tgaaaatctc cacaaaagta 38641 aaaacacaat acaagtcttc ccgatgtagt ggcgaaaaat tcataagcga cagaagtcaa 38701 aacaaagtaa tttttgaaac aacaaaagtg ttgtattttt ttagacgagt tacgaatctg 38761 tttaatgacg gaacactttt gcgacagaca aatccacttt ttgtgttatg ccattcgata 38821 cttatccgtg aataccccat gatttactat ttactaagaa agaaatttct ctccacgatt 38881 tgggtaactg cttcgtgact agacaaaaaa aatagcagtt aggtgtccgc aaaactataa 38941 caggttgcag tgagtgaaca tatcgatcaa ctatggatgc ccgtcaaagt acgccatgtc 39001 ttagtaacat atgtatactc catagttagt aggatggcag tacaaatcca gctactccct 39061 ctacacgaaa attctattct attatgctga ctcaatagaa actgtgattt aaaagtccac 39121 ttctaaatct cctagtgtgt cattccaact aatcctctaa ctgtgggcaa gactcatttg 39181 atatacaatg agattgtata catatcaata catttgtatt tgtgtcatat attagacaga 39241 caaaaatgtt acatgtacta atactctcaa aactgcaatc taacaaaatt ttataatctt 39301 tggcattctt tttttactag aacgattttg aagaagaatt ctgtgtctgt atatacatat 39361 tacattgaaa aagcgatttt tgttcagcaa aaatccgtaa actatttgaa ttttcttgtg 39421 aattttatgt aaagagtcaa actggttttt tgtaatattt atacatatat ttgctgtcgt 39481 atgttttgtt tttttctccg ttgataaata agtataaccg tgaaaataag gataactttg 39541 taaccaaaca tgaagcgatt aaaagccttc tgattggttt gttatcaatt ttaaaaatgc 39601 ttttagctaa cgcaccaacg actgtgcaag gtgcgctgcg tgaaaacgca cccggagagt 39661 gatttaacat tttttacaaa gcttgaattt ttctcgccaa cttacttact ggagttaaca 39721 catcctacaa tttaaggttt acttaataaa tcagttctca caaggctgtt tcaaaaaata 39781 gttgtaaata gttgtaaact actttaaata ggtagcgatt tagttaatcg atttgcatga 39841 tttgtgccat ttcattgctg aggaatgagt gaaatccgtc gttgaaaaac aatcatctca 39901 gtaccaacaa caggattccg tgcgaccagc ttacctcgtg agtcaataat atctaaatgg 39961 atgaaatcca attctttgga acgttccagc atttgagcca agagtttgtc ttgctctgat 40021 tgtttgagat tataccattc atcctgaatt gtgatagcaa gactactgct acgaaagtta 40081 gcttgaattg actgtattaa accgggggcg aagcgatcgc taacctctgc tacttgattt 40141 tcgatagctg caatcaaagt ttgttctggt gtcaaaataa tggttggcgt cggagttggt 40201 tccggttctg gttctggtgg tggagtttcc tccactggcg gcggagtttc ctcagctact 40261 ggtggcggtg gagtttcttc agttggcggc ggggtttcct cagctattgg tggcgtgaga 40321 ggtgtttcta cctctggcgc agtcgttatc gttgcggatg gaggtgtttc tacttctggt 40381 ggaattgcga cttcagtagg cttactagca aacaccgtag aacttgtcca aacaagaatc 40441 actgtaattc cagcaactat accggttatt gctgtatccg acaactttct agataaattt 40501 cgtggtaatc ttgagcgaat tttacctaaa acagcactcc atacaaactg gagtttctgg 40561 aaaaaaccag gggtttcttt atccccagca gagggttgtg tctctaattt gtctaccgtt 40621 gtttctagaa ccccaattgt ccctctgaga attcggatag ttattgcctt ccaaattggc 40681 tggactgtct ggtattgttc ttggggaggt tgagatgctg actcaaccgt cggttgcggt 40741 gggttcgctt ccggttcagg ggaaaatggt ggttgtgaat tctgattgtc ttgagacatg 40801 gaactttaac cgacagctgc gaatcaaaga caagagactt tcttcttgta gtgcagcctc 40861 ctcagcctta atttatcact tcattgcatc agctatactt tactatttat tgaatttggt 40921 aattttatga agttgaaacg tcgtcaattt ttatttttca gtagcctcag cgccattggc 40981 ttaggattta taggttgtgt cactcgtcaa gctggcaaaa atcctggaat tgctgtatca 41041 aaagcggcta caccccctaa tccagacaaa aacaactcgt tactacgttt tgtttctgtt 41101 gcagatacag gtactggaga tcaaggacaa tacgctgtag ctaaagcaat gactaactat 41161 cacagtaaaa atccctatga tttagtcgtt ctagctggtg ataacattta caccaatggt 41221 gaaatggaaa aaattggtgc ggtatttgag cgtccttatg caccattact aaaacaaggt 41281 gtgaaatttc aggctgctct cggaaatcac gatattcgta ccgcaaacgg cgaccttcaa 41341 ctaaaatatc tcaattttaa tatgaagggg cgatattaca catttcaacg cgatcatgtg 41401 caattttttg ttttagatac gaacaccaat gctgattggc aaaagcagct aaaatggttg 41461 gagcaagaat taagcgcttc taaaacacct tggaaaattg tgtatggtca tcatccggtt 41521 tatgcatccg gtgtgtatgg tagcaatcca acttttatta aaagctttac tccgttattt 41581 caaaaatacg atgttcaact atatataaat ggtcatgaac atcattatga acgcactcgt 41641 tcaattaatg gaactactta tttaatatgt ggcggtggtg cgggaactcg tcctgtcggt 41701 cgttccgaat ggacagaata ttcggcagaa aagttaagtt ttgctgccta cgaagtttat 41761 gcagatagaa tagaaattag tggtattggt actgataata gcgtttttga taaagggatt 41821 gttcaactta aatcagttta agttttgaac ctctgataaa actcatcttt tggggaaatc 41881 aaaccacaga tatacacaaa tcacttatct gcgtatatct gtttgtatct gcggttcaaa 41941 atagctctta ccttaattta cagcaatttt cagataaata gaccacagtg attgaaccaa 42001 ccaaaggcat ca // LOCUS NODE_588_length_41329_cov_5.28594341329 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 41329) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 41329) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..41329 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(86..1438) /gene="accC" /locus_tag="DP116_04415" CDS complement(86..1438) /gene="accC" /locus_tag="DP116_04415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetyl-CoA carboxylase biotin carboxylase subunit" /protein_id="PRJNA477356:DP116_04415" /translation="MKFDKILIANRGEIALRILRACEEMGIATVAVHSTVDRNALHVQ LADEAVCIGEAASSKSYLNIPNIIAAALIHNASAIHPGYGFLAENARFAEICADHHIA FIGPSPEAIRLMGDKSTAKETMQKAGVPTVPGSDGLIESEQEGLAIANKIGYPVMIKA TAGGGGRGMRLVHYESEFVKSYQAAQGEAGAAFGNSGVYLEKFIERPRHIEFQILADN YGNVIHLGERDCSIQRRNQKLLEEAPSPALDPDLREKMGQAAVKAAEFINYSGAGTIE FLLDKSGKFYFMEMNTRIQVEHPVTEMITGIDLVVEQIRIAQGERLQLTQDQVVLRGH AIECRINAEDPDHDFRPSAGRISGYLPPGGPGVRIDSHVYTDYQIPPYYDSLIGKLIV WGVDRPTAINRMKRALRECAITGLPTTIGFHQKIMEHPQFLQGQVYTSFVQEMKTLVQ " gene complement(1509..1733) /locus_tag="DP116_04420" CDS complement(1509..1733) /locus_tag="DP116_04420" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04420" /translation="MAAQVILKVSRPQLLENWFYGEAHGTRLLCQSRENELILGNIAA ASVVNTRRGKQRNLSLYLEGILGISSQNLK" gene 1864..1944 /locus_tag="DP116_04425" tRNA 1864..1944 /locus_tag="DP116_04425" /product="tRNA-Leu" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:1898..1900,aa:Leu,seq:gag) gene 2094..2723 /locus_tag="DP116_04430" CDS 2094..2723 /locus_tag="DP116_04430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316038.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF98 domain-containing protein" /protein_id="PRJNA477356:DP116_04430" /translation="MTATFRPTNNLTLPVGWHRLSATWQGGEEVIQQSLPHTQLAPAW QLLLLGDGSPTRHLQLLTGEPTEVDVIDMSLIGMDLDGAPELIKAVPGPRLRRQVWLR TASGQRLAYATSWWEASHVDEYLQNRSLPIWASLARLRTELYRDVQGIYYGDSDALQS GFDETGPFWGRHYLFWHHGQPLTLIYEVFSPYLTKYLGPTQLSSINAEV" gene complement(2824..3792) /gene="argC" /locus_tag="DP116_04435" CDS complement(2824..3792) /gene="argC" /locus_tag="DP116_04435" /EC_number="1.2.1.38" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317314.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyl-gamma-glutamyl-phosphate reductase" /protein_id="PRJNA477356:DP116_04435" /translation="MTKPKIFIDGESGTTGLQIYSRLNQRDDIELVNIEPSRRRDSAE RAKLINAVDVAILCLPDDAAREAVSFVRSTKVKILDASTAHRTAEGWVYGFPELSSGQ REKIASAQFVSVPGCYPTGFLACIRPLIAKGLLPSHFPITINAVSGYSGGGKNLIKDY DAFHDQQDGATSLYPYGIYSTQFGHKHVKEMHKYSGLASPPLFVPAVGDFEQGMLVQI PLPLWTLENPPSGEVIHQAIADYYQGEKFVQIAPFQDSTLLRDGKFLDVMAMNGTNIV QIFVFANDTTHEALLVARLDNLGKGASGAAVQNLNIMLGFPEELGL" gene complement(4022..4395) /locus_tag="DP116_04440" /pseudo CDS complement(4022..4395) /locus_tag="DP116_04440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014145811.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Crp/Fnr family transcriptional regulator" gene complement(5236..5424) /locus_tag="DP116_04445" CDS complement(5236..5424) /locus_tag="DP116_04445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008178652.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04445" /translation="MALFPAGSGGSEEVAYGNKGKGILIHTLTEGNGMPLSNRTTPAN GSEREQVIPLLYQVKLKT" gene complement(5390..5740) /locus_tag="DP116_04450" /pseudo CDS complement(5390..5740) /locus_tag="DP116_04450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006510351.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 6178..6471 /locus_tag="DP116_04455" CDS 6178..6471 /locus_tag="DP116_04455" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04455" /translation="MKKNMIQSVLVFALCSFMLLVSGFTAVQDSLASGPLQEQCFWME NISGQFYWVPAPQGKISKQQCYQQNSCGAGGGQSGGGCYKWAISAQAPALPWN" gene 7392..9092 /locus_tag="DP116_04460" CDS 7392..9092 /locus_tag="DP116_04460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876354.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04460" /translation="MKKKKKPSLVLTLSFAGFLICAGSAAYWLLTQGNSSSKDLLPGA NIIPQDALFAVSLSTDPGQWQKLREFGTKETQSLLDKNLVQLRDRFLTNNGYDFQKDI SPWVGDQVTIAVLAPNVSKPVSKPIATNAEATTNEQSMVMILPVKNLEKASSILAQPK AAKGGKWIDRTYENIVIKETEGQVGEKLSAALLDKRFLVITDNSKTTERAIDAYKGQS SLATSPGFAENVPKIYNYQPFGQFYVNVPYSARIAAKSPNRPLPAQVLSQLQNNQGIA GTMTLESQGIRLKSVSWLNPTSPRVLTVENKAGNMQNRLPTETLMMLSGGNLQQFWAD YVSTSQGNPSAPVMPEQLRNGVKSLTNLDLERDLLSWMGREFSFSVIPNIPKQGMADD FRAALVFMVQASDRTRAETALKQLDDVMKNQYQFQIKYTTVDGKPVVNWLGPFGTLTA SHGWLDDDVAFLAVGAPVTDKIVPRPNNTLGSSGLFQDTVPRQPNPTNGQFFLDVEHT AKNFPLPIFLPDQQTLLQATRSIGVTGAVSDSRSTRYDIFLSLQKAGKPDPLPNPTNQ " gene 9610..10653 /gene="ccsB" /locus_tag="DP116_04465" CDS 9610..10653 /gene="ccsB" /locus_tag="DP116_04465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="c-type cytochrome biogenesis protein CcsB" /protein_id="PRJNA477356:DP116_04465" /translation="MNLVGLQNWLDNASFAILFVTMLVYWGGAAFPNVPATVVGTAGM AIANLCIATLLGARWLEAGYFPLSNLYESLFFLTWGITTVHLIAENTSRSRLVGVVTA PVAMCITAFATLTLPSQMQVAEPLVPALKSNWLMMHVSVMMLSYAALMVGSLVAIAFL IVTRAQEIQLQGSSVGTGGYRSNGYRLHKVTDLSAQPSTLAVENNGVTRIESNNNGKT AVLDLVTVTQSQAVAAEPLSPQRLSLAETLDNISYRIIGLGFPLLTIGIIAGAVWANE AWGSYWSWDPKETWALVTWLVFAAYLHARITRGWQGRRPAILAATGFVVVWICYLGVN LLGKGLHSYGWFL" gene 10958..11989 /locus_tag="DP116_04470" CDS 10958..11989 /locus_tag="DP116_04470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009786162.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CHAT domain-containing protein" /protein_id="PRJNA477356:DP116_04470" /translation="MKKILVLSANPINTNKLRLDEEVREIQSALERSRHREEFELISR LAVRIDDLRRALLDHEPQVVHFSGHGDGTDGIALEDNFGYVQLVSTESLSNLFKLFKD TVECVLLNACYSETQAEAIYQHINCVIGMKRAITDKAAIHFSKGFYDTLGAGRSYKDA FDLGCNNIDLNSIPEFLTPKIQIRDNFKTLFFKKQSTTKLIKQKPSQSFTISGGQLSN VQIGGQAGRDMDVTQNQLLAQGNSEKPLIQTDVVELIAQLEELFRNSELPEAQTSKAI KHLEAAKEEVQEKEPDKDFAAKNLQKATKVLKEANEAVTAGTNIWEKVQPIITKLLPW LGVAASFFS" gene complement(12122..14689) /locus_tag="DP116_04475" CDS complement(12122..14689) /locus_tag="DP116_04475" /inference="COORDINATES: protein motif:HMM:PF05729.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04475" /translation="MNESSPFNPQQHLNVSDGSSIKDSQLGGIAGQDLNVTQIQGSVV NLSIYDRIDGSKAVIKPKFLTRYEYRQRQALLRNIKKIWITDYLKKSLHSKALIELQL EARPDVVPGRITNTDEFPEEHSQPLSEKTSAIHLFKETEEGETLLILGEPGSGKTTIL LRIAQDLITQAEKNVGQFIPVVLNLSSWASERQSIGDWLVQELDSKYLVPKALSKKWI ENQDLILLLDGLDEVKANRREACVQAVNEFRKSHGLTQIVISSRIRDYEALSSRLQVQ SAVCLKSLTQEQIKQYLDRAGGQLEGVKTLLQEDEALHELAKSPLTLSVIALAYRGIP AEELPHYGSIEERRKRLFDAYIERMFRRKEVEKKYPNYGSIEERRKHLLDAYIERMFR RKGVEKKYPKAKVKHYLSWLAQELNRTSQSIFLIEHMQPSWLPSNQRRKYQWGNVITF IIATFFIFLFTNYIPQNISIISTGEQIHHLVLERIIKSIFIGLWIFGFNQKEIKTFET IKSPFQLIRILTVSKALMTTKNILCQSFLYSLIYGAYCATLSGLYVWTQPSLIKPSNP EFAWVIGIIIGLIVGPIIGFITGFLGHYSGISGLIFSLIYSVYFYFIEDPQKFIQNDK PETLFVRAALGYLIGIILGLLARFGKNKSIIFGIIVSLSYGLLYLDENLYADFSKNES NSVPTERRILEGLNDGLFPGMFTGLFVHWTEGMQGPEIETKIEPNQGIWKSASSAMFI GIIVTPVFGVIHGPIYLMAKKFNLFLLVLNALSFGAFIGLVSGGGQACIRHFTLRFLL WRKGNIPWNYVRFLDYATDLIFLQKVGGGYIFIHRMLMEHFAQMSGQEATQLTTEEG" gene 15466..15720 /locus_tag="DP116_04480" CDS 15466..15720 /locus_tag="DP116_04480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138440.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04480" /translation="MEREAVTIRFPSDLLAKARSLKEGNESLNDLVVEAVEQEVRRRR GWAAHQRIIARSEAVKAKTGIQSASTELIRSLRESEDRCD" gene 15713..16138 /locus_tag="DP116_04485" CDS 15713..16138 /locus_tag="DP116_04485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138441.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain nuclease" /protein_id="PRJNA477356:DP116_04485" /translation="MTRVLCLDTSVWIPYLVPETYQLQARTLVTEALSLNLRLVAPAF TWAEVGSVLRKKTRMKVITTEEAQGFFQDFCELPIDYIEEEVIRVKAWEIAEQYVLPT LYDAAFLACAESVSAEFWTADVTLIRQLTPQPTYLRELC" gene complement(16204..18270) /locus_tag="DP116_04490" CDS complement(16204..18270) /locus_tag="DP116_04490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S9 family peptidase" /protein_id="PRJNA477356:DP116_04490" /translation="MSEKKHLSYPVSPKNNQVDDYHGTSVADPYRPLENPDSVETKAW VEAQNKVTFGYLQEIPAREKIKQRLTKLWDYEKYSLPFKEGKRYFYFKNNGLQNQSVL YTLTSLDAESKVLLDPNTLSEDGTVALSGTVISEDGNFLAYGLSTAGSDWQEWKVRDI ETGIDLKDHLKWIKFSGASWTNDSKGFFYSRYDEPNEKTKLEDVNYYQKLYYHKLGTP QSEDLLIYNRLDQKEWGFNGDVTEDGRYLIISVWQGTDPRNLVFYKDLTNPKAKVVEL IQEFESSFGFIDNDDHIFYFRTDFNAPRGRVIAIDTKNPARQNWKEIIPQATETLESV STINNQFVADYLQDAHTQIKIFDLKGKFIREVKLPGLGSAGGFHGKRHDTETFYSFTS FTTPGTVYRYDMVSGKSTVFRQPKVAFNPEDYETKQVFYKSNDGTKVPMFLTYKKGIK LDGNNPTYLYGYGGFNISLSPNFSVSNLVWMEMGGVYALANLRGGGEYGEEWHQAGMK SKKQNVFDDFIAAAEWLIANNYTKPAKLAIGGGSNGGLLVGACITQRPDLFGAALPAV GVMDMLRFHKFTIGWAWTSEYGSPDNQEDFKTLYAYSPLHKLKSGTAYPATMITTADH DDRVVPAHSFKFAAALQATHNGDAPVLIRIETKAGHGAGKPTAKIIEEIADKWAFLVR TLDMKV" gene complement(18450..23033) /locus_tag="DP116_04495" CDS complement(18450..23033) /locus_tag="DP116_04495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454482.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04495" /translation="MLPEQQQRILGYFIEEAKDHLDTIEKGLLNLQSTLDNPEMINEV FRAAHSIKGGAAMLGLSSIQHTAHRLEDYFKVLKEHPIQVDQKLESLFLGVSDTLKAL LESCTEPYGLTDEMAQTLMSETEPVFKWLQEHLNLLIEQNRNNITKNTSILAKASPAT SPQTVAQTSESWQNLQTVQSEVIQILREMLQLFKQTATSQTRQSLVKCCDKLAELGEE LNWSNWCGLCRTAASAIAFSQNTYLTLAKTIITDIKQALELVIALREAEITISQQLQA LVRVEETIELLEIPLILADESDSPTEGLELPPVTSSSTINLGTGKKIPQTISDATSTT VLDIDRQDRITSLSELCEQFNSNEHAALTHTTDTNNQEVRIANLNTLADFFEGESPDL EGMWEQEEILDINPEATLKRDVSNANTEETDNKNEILIDEDTISTQQKAAETEEFTLA LSDDLLEDKLESALHKTSGLTRELSIDTLASVEQLFDAGEKRQDQVTSLLELLLDENQ ALPTGERNQIQTETIVNVELSKNQEISYHDFSLSTKAKLMEEIDYPKSIKSKKISHKD EILTLEELFIETEEDKTIALSANNSTFGDLLNAQFETEDLNKIWELELKGEKDKFSPA HQQDVATLEEILLTTAAEDFFDNVAQSGNSSLDSLSFEDLELNFPIDEQEFNLLFSSE TDDDWFQDLTSNLINTSSLDTTIEPQSTSYYRVASKTEKSVFSQQPEAVEFSPEFTIL PLETDNKQDLLNNSLFTQQPQDINLMVNECSQEQSGMRHTEAETISPTDLDGEHSLEL YSAFDFKENLFLTEAVSSEEELTQEIDIFLNDELTELNELLNQEVVSEVNAELRPEER TAQEKLLGVAANQVTSDRGKKDLPPRPFVYRTTKFEQLIKVPVKHLDDLSNLVGELVV NRNTLEQDQQRLRQFLDNLLHQMQQLSNVGARMEELYERSLLDASVNPNRRNHREDHN KDIDRGLTELEMDQFTPFHTLSQEMIELIVRVREAASDIDFVTEDTERVARQFRQVTN QLQSGIMRSRMEPFAEVTTPLERGVRESAIKCGKQAQLVIEGRETLIDKVILEHLKTP LTHLLNNAIAHGIETPDIRQAIGKPPVGVITVRAFHQGNQTVISISDDGAGIDIEAVK AKAIKIGIITPEQAKSLSQHHVYDLLFQPGFSTKEKEDELAGRGIGLDVVLARISEMR GKININSTLGGGTTFTIRLPLTLSICKALCCISDKARIAFPMDGVEDTLDVPVKNIQQ GSDGQKYIPWRNTLLPFQPLKEILTINRQLSRGSIYGGHRDDDMICVVVVRSANVFLA LQVDQVLNEQEIVIKQFEGPFPKPIGVAGATILGDGRIMPIADVIEIIDIFQGRASKY RSSSLWEQQQTPTVQETTAGKINPTVLIVDDSITVRELLSLTFSKAGYRVEQARDGQE AWDKLRSDLPCNIVFCDIEMPRCDGLELLSRIQKDPNLKHLPIAMLTSRGAQKHRQMA MKLGASGYFTKPYLEEVLLEAASRMLKGEKLVD" gene complement(23137..25611) /locus_tag="DP116_04500" CDS complement(23137..25611) /locus_tag="DP116_04500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454481.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methyl-accepting chemotaxis protein" /protein_id="PRJNA477356:DP116_04500" /translation="MKWRQKKVVSIKEYEHIYHQAHKAYVQGNYKQAATLIDQLVQHL PYDPNISLLRGHIYYVLQQCDVAKIEYQKVLKFTDDQEIISLARNGIDNIKQYQRKLD AQDIEKSGQKYKNFSDSSESQTVIKQQLENLADDQDFNSNSSSFHLTSLDEDQKAVDY MEELPVSSPFDLSIDDSTVSDTTFNSKENFSKAPSTFIQQQVWQLSNVNDEEAKAENT NTCKGNKKKCLDDFDKFDDSGYIPRFDLTEDSRFEEPQTLKISVENNTSRRSKVETTP TQQNHDFTSDTTPRDDSGFLPPQTDDWASRSTLTNHRDSELFITTGSQQAVPVLIGAD LCSRKPQVSVKQGFLAPLENASLQTKQWIVAGTVGVVSALVVAGVSFVYAKLLPLEQR ELVQNTGRVMTLASGIAGFATAGIMGSLSHRQICRTAKNLQSQFDAVREGNFHVQATV YSKDEFGQLAASFNQMTRVILTTTNEATRKAQEQEEARESLQRQVIHLLDDVEGAARG DLTVQAEVTADVLGAIADAFNFTIQNIRDLVQQVKIAAQEVTRGATNSETFARALSRD ALRQAQKLAVTLNSVQVMTDSIQRVAQAAREAEAVTHDASNIALLGGEAVDNTLAGIL EIRETVAHTTRKVKRLAESSQEISKIVALISQIASRTNLLALNASIEAARAGEAGRGF AIVADEVRQLADKSAKSLKEIEQIVMQIQSETGSVMIAMEEGIQQVINGTKLAEEAKR SLENIIQVAKRIDILVRGITSDTVEQTKTFRGVAEVMQSVELTAQDTSREAQRVSGAL HSLVSVSGELIASVERFQVEISENTR" gene complement(25755..26291) /locus_tag="DP116_04505" CDS complement(25755..26291) /locus_tag="DP116_04505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995102.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein CheW" /protein_id="PRJNA477356:DP116_04505" /translation="MITQPDFLGGGGQDHSGSELQVKSPEAELCLRFYIPLHQEFALL ATDIREVIELSPDRITPIPNTSGLLLGALNLRGRVIWVVDLGQFLGQGTTLNTNRSEI PVIAIEEQDTIVGLAVEEIGGMDWLDKKHLTVLKSVSDTMAPFLQGEWILENKKNQCL RLLDHKAILRSARLLGKN" gene complement(26305..26670) /locus_tag="DP116_04510" CDS complement(26305..26670) /locus_tag="DP116_04510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017290869.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_04510" /translation="MSTVLIVEDSVTQREMITDLLKASGLTVTHASDGIEALEAIQTA CPDLVVLDIVMPRMNGYEVCRRLKSNPKTQNLPVVICSSKGEEVDLYWGIKQGADAYI AKPFQPTELVGTVKQLLRG" gene 27216..27977 /locus_tag="DP116_04515" /pseudo CDS 27216..27977 /locus_tag="DP116_04515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454477.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(27992..29035) /gene="tilS" /locus_tag="DP116_04520" CDS complement(27992..29035) /gene="tilS" /locus_tag="DP116_04520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745591.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA lysidine(34) synthetase TilS" /protein_id="PRJNA477356:DP116_04520" /translation="MSDKRSLLPKADRTPWTPLHAKLHRTIRSRHLFERNQRLLIAVS GGQDSLCLIKILLDLQPKWGWYLSIAHCDHCWRSDSQANAKHVENLAKNWSVSFYLRT ANEPLKSEAAARNWRYQALSAIAQENQFHCIVTGHTASDRAETLLYNLIRGTGADGLQ ALTWQRLLDNSILLVRPLLEITRMQTGQFCQDFLVPVWEDSTNQDLKFARNRIRQNVL PYLQENFNPQVESVLAQTAEILQAEVEYLEQAAYQLRKEAMVTSTGEEGNLYNPYSIK LNRRVLQKAPLALQRRVMRQVLQEIIQSGCTFEQIEKLIALISAPNRSQSDPFPGGAI AQVQGDWIYLKQL" gene 29308..32136 /locus_tag="DP116_04525" CDS 29308..32136 /locus_tag="DP116_04525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745599.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="insulinase family protein" /protein_id="PRJNA477356:DP116_04525" /translation="MGFFSKLRPFILIVFFISFLLSGVLSGGYFSNAATPSAVTPVSS FALTQGVRKTVLKNGLTVLTKEVHTAPVVSVQVWYRVGSRNEKAGENGISHQLEHLMF KGTTDRPVQFGRLFSALGSQFNAFTSYDETAYFGTVQRDKLEALVTLEADRMESALVG AEQLTSEKRVVISELQGYENSPGYRLNRAVMRAAFPKRAYGLPVGGTKADVQQFTLEQ VRNYYNTYYSPDNATLVITGDFATEPVLKTVQKTFGKLPKRAKQDNQTKGNSSKTGSP SSNNTSTPKTTPSPSSNNTSTAKKSPIVLKEPGSAALLQVVYPLPNLTHPDVPAIDLM DVILTGGRSSQLYQALVESGLASSVGASPSELIEPGWYEINVTAAPGQQLSKIALVLE QSLTKLQQKQVTAEELNRAKTQLQASFVLGNQDITSQATQLGYNQTVAGDYRYVEGYL KTIAKVTAADVQRVAKTYLNPAKQTIGFFEPTLPGGKPGSSSGGSNRTVENFSPGKPV DPAELAKYLPPATSATASTQQPLPEQFILKNGLRVFLLPDHSVPTVNLSGQIDAGAEF DTNQKAGLASFVASNLINGTQTKNALTLAKTLEDKGVSLGFSASREGVGISGNGLSAN LPILIQTLADVVQDATFPDKQLELTRQRALTSLKVQLDDPRGLGRRVFQQAIYPENHP FHSFPTEESLKSVTRADVLRFYEEHYRPDTTTIALVGDFDPNQVKALLNKAFGKWQTQ GKPPTLNLPQVSLPQTMKQLSSVIPGKTEAVTYIGYNGISRKDSRFYSALVLNQILGG DTLSSRLGTEVRDRQGLTYGIYSGFATGVNPGPFLIQMQTAPGDAQKAITSTLALLKQ LREQGITEAELNTAKRSITNNYPVELANPGNVAGMILENAVYGLSQAEIREFPKRIEA VTPVQVQQTIQELIHPDKLVIVTAGPGA" gene complement(32219..32291) /locus_tag="DP116_04530" tRNA complement(32219..32291) /locus_tag="DP116_04530" /product="tRNA-Arg" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(32256..32258),aa:Arg,seq:tct) gene 32628..33014 /locus_tag="DP116_04535" CDS 32628..33014 /locus_tag="DP116_04535" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04535" /translation="MFFTQGIVPGLLSWSLLALLHSLVFSGRISNSLLLLKRAEWSVV GTHIFEAIAYLAAGVLCLRNWRSPQIPSGYSVWLVISIAMLLYFQRRDICQLYRNSLA RRTYRFPCRSVFCDHLFFSGDGHDAL" gene 33180..34325 /locus_tag="DP116_04540" CDS 33180..34325 /locus_tag="DP116_04540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195672.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_04540" /translation="MSTVRKLTEPDKHEILKLYRETAETTSTLAERYDVSNSTISRLL KSTLPEDEYEYLVSLKRAARTPEGRAQVNYDNLPAFTNQPQEDQTQKQEVLSPVVEQN VVETQPQRVSPAQRQIPKPKDSPAPQVSPLRLGAVAPGGNPQDLGDYPQEEPVVAQNL HFVELDEPTHPSRRLKRRSSAPTKPILPIQQARSEQPVAEQLELLEQKPPEITSIPSP LLEDTHPNANVIAEMFGEDLLDESDDLEDLDDDDDDDYDEEDFEPAAPLVTRPRSGDA LVKVLPLSAAALPKTCYLVIDRSSELITRPLREFGDLGQIPSLETQQRTLPVFDNHRI AKRFSTKRDRVIKIPDTKMLHKARYHLQAKGITRLLIDGQVYSLSSV" gene 34662..35129 /locus_tag="DP116_04545" CDS 34662..35129 /locus_tag="DP116_04545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318768.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="PRJNA477356:DP116_04545" /translation="MVHKPNKSQDLESWQQVRAPYGLGYRIKLLSQLLSRKLTERLEP FGLTPFHWVVLCCLWEEDGLPTSSIGEKLQQVGGTLTGVLDRMEERGLIRRERDCRDR RIWRIWLTDAGKELETVLPAIAVEIREQAMHGISYAEREQFSQLLNQAIDNLS" gene 35277..36797 /locus_tag="DP116_04550" CDS 35277..36797 /locus_tag="DP116_04550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862687.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_04550" /translation="MERTNNTNSNGRNGHKTAVLEKELVKDIETLEALTSENLTNKMP AETLVKADLHTQPETADKEAKPQPPETPKKKKPIALILTALGVGAVAAGGFGYHWWQY ASTHQETDNAQVAGHLHQVSARIPGTISQVLVNDNQEVQPGQLLVTLDPRDYQSKVQQ TEAALQNARRQAQAAQANINLASKTTSAKTVQAQGDVSGAVAAISTAQAAVQEAEAGI PAAQAEVKQAEAGIPAAQAQVAQANANLQKAQADYNRYNTLSQQGAIPRQQLDTSKSA YDVAVAQKDAAIQGVNQAQARLAAARVGVAKAQSQLAQAQEGVVSAQAKLAASKGGLQ QATAGGEQTTVNRNQYEAAKAAIAQADASLKDAQLQLSYTNITAPAAGRVGRKTVEVG NRVQAGTPLMAIVNDDYWVVANFKETQLEDMKPGEEVEIKLDAFPHHTFKGRVDSISP ASGAQFALLPPDNATGNFTKIVQRVPVKIVFDKESIKGYESRITPGMSAEISVKVK" gene 36945..38222 /locus_tag="DP116_04555" CDS 36945..38222 /locus_tag="DP116_04555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315532.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_04555" /translation="MNAQATRKSPLLPAFRSRNYRLFFAGQGISLIGSWMTQLATIWL VYHLTNNPFMLGVVGFTSQIPSFFLTPLGGVFVDRFSRHRILIGTQILAMIQSLALAV LTFTGMIQIWHIIALSLLQGFINAFDAPARQAFVTELVERRDDVANVIAINSTMFNGA RLIGPAIAGLLIARVGAAYCFLIDGLSYIAVIIALLAMKFKPWKTTVTGGNPLQNFKE GFVYAFGFPPIRAILLLTAFFSFFGMQYTVIVPIFAEEILKGSAETLGFLMAASGVGA LASGIHLATRKTVMGLGKVIVLGLAIAGIALIAFSLSRLLPLSLLAMLFVGLGVILVI AGSNTVLQTIIEEEKRGRVLSLYTMSFLGMIPFGNLAAGALAHQIGAPYTLIIDGIAC IFGSIYFAKQLPELRKMVLAIYEQKGILATAKP" gene 38412..39911 /locus_tag="DP116_04560" CDS 38412..39911 /locus_tag="DP116_04560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019490922.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="EmrB/QacA family drug resistance transporter" /protein_id="PRJNA477356:DP116_04560" /translation="MLGAFMAVLDIQITNSSLQDIQASLGATLEEGSWISTAYLVAEI VVIPLTGWLSRVFSLRRYLLVNTALFIFFSVCCAWSWDLNSMIVFRALQGFTGGVLIP TAMTVVLTTLPPSKQAIGLAAFAITAVFAPSIGPTFGGWLTENFSWHYSFYINVVPGV LMLAGVWYGIKQERPQLQLLKQGDWWGIISMAIGLASLQVVLEEGSRKDWFGSALIVR LSILAVIFLTIFFWIELTRKQPFINLRLVRYRNFGLASIINVSLGVGLYGSIYILPLY LAQIQGYNALQIGQVLIWAGIPQLFIIPFIPKAMQRIDVRLMVAVGVALFAVSAFMNS KMTYQTGYDQLIWSQLVRAMGQPLIMVPLTSIATSGLSPKEAGSASGLFNMMRNMGGS MGIAALATLLTNREQFHSNRLGESVSLYNPATQERINQMTQYFVSRGSDLSTAQDQAI KAIDNIVRREAFVNAFNDCFYFIAIALLLSGLAVLFIKKVKVTGGAVAH" gene 40373..40876 /locus_tag="DP116_04565" CDS 40373..40876 /locus_tag="DP116_04565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315372.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04565" /translation="MNTTTNLERKKASLPVAVFLSALLTLPVLSGCGGGSRTAAPLPP VDDTAGRNVGYPQSPNQPQQTKRGLTTGQKVAITLVGAAALYYLYNQRKNARGNGAQG KYYLSKNGRVYYRDDQGRPHWVTPPSEGIRVPESQAQQYRDFKGYNGRTTGRDLTDIA PQEAPTY" gene 40972..41280 /locus_tag="DP116_04570" CDS 40972..41280 /locus_tag="DP116_04570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="translation initiation factor" /protein_id="PRJNA477356:DP116_04570" /translation="MVDDFIRKAKDFLGGGSNEDDREVRPASEDPYGDPADQDYYSNA IPASQDPYGDPADQNYYGNAIPASQDPYGDPANDYGQFGNVRPASEDPYGDPADEDYR " BASE COUNT 11669 a 8847 c 8710 g 12103 t ORIGIN 1 acggagggag accctcctac agtactggct ccccaacgca ctggctcccc ttacccccct 61 acacccatag ctttagtcac aaagactact gcactaacgt tttcatttcc tgtacgaaac 121 tcgtgtagac ttgcccctgc aaaaactgtg gatgttccat gattttttga tgaaacccaa 181 tcgttgtggg taatcccgtg atggcacatt ctcgcagtgc tcgtttcata cggttaatag 241 cagtgggacg gtctactccc caaactatta acttgccaat gagggagtcg tagtaaggag 301 gaatttggta atcagtgtaa acatgggagt caatccgaac tccgggacct ccaggaggga 361 gataaccact gatacgtccg gcggagggac gaaagtcatg atcgggatct tcggcattaa 421 tacgacattc gatcgcatga ccccgtaaaa cgacttggtc ttgtgtcagc tggagtcttt 481 ccccttgagc aatacggatt tgttcaacaa ccaaatctat cccagtaatc atctccgtga 541 ctggatgctc aacttgaatc cgggtgttca tttccataaa gtagaattta ccggatttat 601 ccaagagaaa ctcaatcgta cctgccccac tgtagttaat aaactcagca gctttgacag 661 cagcttgtcc cattttctct cgcagatccg gatcgagagc gggactggga gcttcttcta 721 aaagcttctg gttacggcgt tgaattgagc aatcccgttc acccaagtgg atgacattac 781 catagttatc agccaagatt tgaaattcaa tatgacgggg acgctcaata aatttttcca 841 gataaactcc ggaattgcca aatgctgctc ccgcttcgcc ttgggctgct tggtaggatt 901 tgacaaattc gctttcataa tgtaccaagc gcataccccg tccgccaccg cctgctgtgg 961 ctttaatcat cactgggtag ccaattttgt tggcgatcgc caagccctct tgctccgact 1021 ctatcaaccc atcactacct ggtactgtgg gtactccagc tttttgcatc gtttctttag 1081 ccgtggattt atcccccatc agccggatag cttccggaga tggaccaatg aaagcaatat 1141 gatggtctgc acagatttct gcaaaccgag cattttcagc taaaaaccca taaccagggt 1201 gaatagcgct ggcattgtgg atcaatgccg ctgcaataat attgggaata ttcaaataac 1261 ttttactgct agcagcttcg ccaatgcaaa ccgcttcatc agcaagttgg acgtgcagag 1321 cattacggtc aacagtggag tgtaccgcaa ccgtggcaat ccccatttct tcacaggcgc 1381 ggagaatgcg aagcgcgatt tctccccgat tagcaattaa tattttgtca aacttcatct 1441 tttagctttt gatatcagtc cagtagattg acattgttct gccgtcaaaa ttgaaattac 1501 gcagagcctc atttcaaatt ctgactggag attcccaaaa ttccttccaa atataaggaa 1561 agatttcgct gcttgcctct acgggtatta actacactag cagcagcgat gttgccgagt 1621 atgagttcat tttcccgact ttggcataat aagcgcgtac cgtgcgcttc accgtagaac 1681 caattttcca gcaactgcgg tcgcgatacc tttaatatca cttgtgctgc catgacattt 1741 ctgtccgtcg agtatatcca tctgttgata gttgtgaatt cttgcactta aattttctgt 1801 aacttgggta tcactgatca taaatatttc tgttatgcta tactataaat ctagggaaac 1861 cttgcggatg tggcggaatt ggtatacgcg cacgcttgag gtgcgtgtgg ctttgccttg 1921 cgagttcgag tctcgccatc cgcatatttt agtatgtagt gccaagttat gggtgctgaa 1981 gtgtcaactt tcagtcaatg gaagaatgac tcagcactca taacttaagg ttcgttcgct 2041 attaagtctt cacttgtgct ttgatataaa taagtgaagt tttgtaaaga accttgactg 2101 cgacttttag gccgacaaac aacttaacat tacctgtggg atggcaccgc ctcagtgcga 2161 cttggcaagg tggcgaagaa gtcattcaac aaagtttgcc ccacactcag ctagcaccag 2221 cttggcaatt gctccttttg ggtgatggct ctccaacgag acacttacaa ttgctaacgg 2281 gcgaaccgac agaggtggat gtcatcgata tgtcattaat tggcatggac ttggatggtg 2341 caccagaact tatcaaagct gtcccaggac caaggctacg gcgacaggtg tggctgcgta 2401 ccgcctctgg tcaacgattg gcatatgcca cctcatggtg ggaagcttct cacgtagatg 2461 agtatttgca aaaccgttca ttaccgattt gggcgagttt agctcgtctg cgtacggaat 2521 tgtatcgcga tgtccaaggg atttactacg gtgactcaga tgctttgcag tcaggttttg 2581 atgaaaccgg acctttttgg ggtcgccact acttgttttg gcatcacgga cagcctttaa 2641 cgttaattta cgaggtgttt tcgccttact tgacgaaata tttaggaccg acgcagctaa 2701 gttctattaa tgctgaggtt tagtggcaaa ttttgctgtg tagcaaaaag tttttctttt 2761 ttgctacaca acgaagattt tccatccatc tgtgctagct tgcctatgca gctttcttca 2821 gcatcacaat cccaattcct ctggaaagcc cagcatgatg tttaagtttt gtacggcggc 2881 tcctgatgcg cctttgccta aattatcgag gcgagcaact aagagggctt cgtgggttgt 2941 atcattggcg aagacgaaaa tttgaacaat attggtgcca ttcattgcca tcacgtctag 3001 gaatttaccg tctcgcagaa gagtggagtc ttggaagggg gcgatttgta caaatttttc 3061 gccttggtag tagtcggcga tcgcctgatg aataacctca ccagatggtg gattctccaa 3121 agtccacagt ggtaaaggta tctgtaccag catcccctgc tcaaaatccc ccactgctgg 3181 tacaaacagt ggcggcgatg ctaatcctga atacttatgc atttccttaa cgtgcttgtg 3241 cccaaactgt gtactgtaga tgccataggg atagagcgat gttgctccgt cttgctgatc 3301 gtggaaagcg tcatagtctt tgatcagatt ctttccgcca ccagagtatc ctgataccgc 3361 attaatggta atgggaaagt gactagggag gagtcccttg gctatcaacg gacggataca 3421 agctagaaat cctgtgggat aacagcccgg aacactaaca aactgggcgc tggcaatttt 3481 ctctctttgt cccgaactca gttcaggaaa gccataaacc caaccctcag ccgttcgatg 3541 ggcagtacta gcatcgagga ttttaacctt agtactgcgg acaaagctaa cagcttcgcg 3601 ggctgcatca tcaggtagac aaagaatggc aacatcgaca gcattgatca gtttcgctcg 3661 ctcagctgaa tccctgcgtc gagatggttc aatgttaact aactcgatgt catctcgttg 3721 gttgaggcgc gaataaattt gtaagcctgt ggttcccgat tccccatcaa tgaaaatctt 3781 aggtttagta atcatgtgct cagcatcctg atccgtaaca ggttttttgt cccaccaata 3841 cttaacttag tctgttgtgg atcaagtcat caataaagga aaggaaacct tttggttaat 3901 attaccaaca agcaattatc atgtcacaag aatgagttgc gcttacgact ctgagccagt 3961 catgatgcgc cgactggccc agagtcatct gccttcgtgt ctaagtgccc gatccgttcc 4021 ctcagcgccc gatttcgaca ccttcaagaa tgccgagtgc gtcggggatg agtacggcag 4081 cagagtagta agcacttact aagtaggaaa tgatcgcctt ttcgttgatc cccataaagc 4141 gcacagacaa actgggttgg tattcatcag ggatacccgt tttgtgtaga cctaccaccc 4201 cttggttttc ttctccaaca cgagtaccaa aatagagcta gtaccctcct tggtgatagg 4261 aattttgtta caaggaaaaa tcggatagtt gcgccaagca ggtacaatct ttccgttgac 4321 ctccctgctt ggggaaccga ctccaagacg gttacactct cggccaaaag cagcgatcgt 4381 gcgagggtga tgtacagtga cagattattt gtcattggtg acagattagt gccgtcgtaa 4441 aaaatggttg tcattaaaaa gcagagtgtg gttgaaagcc aggggagatt ctcttatctc 4501 ctgtttctca tctgtctcaa gggcattgag ccgagaactc tgcatacaac ggaaatagag 4561 gctcttaaag cttattctat atagatttca gctttttact tctctgtttt aaccaaacat 4621 caggaaggct acaagcagac atattttagt atatagttac taatcaaata cagtcagaac 4681 tcaaatgcca acttatgacc ctaagttggc aaaaccctca aagttgttga cgctgcttac 4741 agccacgaga gattaatact ttgggctacc tatttcacga aatacctcaa acgtagaatt 4801 gctaccctta actcggtttt ctgcgcgaat gctactcata ctccatcgta ggttaaaaat 4861 atcgcgagtc gttgcgacct ctaccacttg gagcgactac cgaaaaaatc aagaacctat 4921 cccactaata aaattctttg aacccaaata tggatggtag caaggtcaat gaatgcatca 4981 aaatatactt ttcgacgttc acgggcgtac tactaggcga cggtatttac gttgaaacca 5041 agcgaaacac cgctcttgct ggaatctagg aacagagatt ttgatgggtc tgcctttatt 5101 tttcctggtt ttccagactc gttttggtat ttcaggtcta atgcctcgtt tacgtaggct 5161 tgcgcgttgc tgtttggaat catacccttt atcagcagcg agtaccttaa gtcgtttacg 5221 tggtctaccc cgcttttaag tttttaattt tacttgatat agtaaaggta ttacttgctc 5281 cctttcgctt ccattggcag gggtagttct gttggacaag ggcattccat taccttcggt 5341 aagcgtgtgg ataagaatcc ctttcccttt atttccatac gcaacttctt cacttcctcc 5401 cgatcctgcg ggaaaaagag ccatcaacgg ctccaaactc ccaatttatc aatcctttct 5461 cgtctgcgat tgctagtaaa cgagcctgca aatgttcaaa tgttccatca gagcgccatc 5521 gctttaacca tcggtgagat gagcttttcg atgcccagat tcccccctgt ggaatatcac 5581 accattgaca cccagtaaca aggatatata gcagactatt tcatacataa cgatacggtg 5641 aatgaggcat tcctttatga cgcttagatg gttcaggagg gaatatatcc tcaaatagtc 5701 tccattccag gtcactcaat ccttcaaaac gtcctgccat aaagtgatac gcccctttta 5761 tttaagaagt agattatcac gtttgggtat tagtgggata atttaaaaaa attgtgcata 5821 agttcatttt gacagcacct atatatatag tttagttaaa aaataattca atattacctc 5881 tatctttaga tatatttttg atttttatcc acaaaaaagt caccagttat atagccagtt 5941 atgcaaaaat acttcttagt aaattacaaa gttcgcaagc aagtctaaaa atgggttttg 6001 ctgaattaat gtatggattt tgtggaaatt acaatttttt acaggcgtga catgtcatgt 6061 ctgtacaagt gtttggaaaa attacaaacc cgtaagtcat tattaattca gtaacacctt 6121 ctcatggatg cttgcactct aagtaataga actatactca tcaatgaggt ataaaccatg 6181 aagaaaaaca tgattcagtc tgtgttagtt tttgctttat gcagctttat gttgcttgtt 6241 tcaggattta ccgcagtaca agatagtttg gctagtggtc ctttacaaga gcaatgtttt 6301 tggatggaaa atatttcagg gcaattttat tgggttcctg ctcctcaagg aaaaatatcg 6361 aagcaacagt gttatcaaca aaatagttgt ggtgcaggag gaggtcaatc tggtggcggt 6421 tgttataaat gggcaatatc tgctcaggca cccgcactgc cttggaatta atattctccc 6481 aactctgaca aacccttcgg gtaaggttat ttctgattaa attcttaatc agaacaacaa 6541 cagcgtagct gatattaagt tttatcacta cgctgttttt tgcatttgat gtcaaatttc 6601 tctattatgt agcgacagct attttgtcat ttatctggta aatgacattt attttgtcat 6661 taggaagaat attgggtcga gtatcgaatg gtttttaaaa gctttgctac acagggattt 6721 ttcgacggca acaatttgtc atgatgatag ataattagtc actgtacact aattcttctt 6781 aaatcagaca tactgatttt aatactgagt gaccaaatat atcatttttt acctggtgcg 6841 agatgtgagt taagcagaca gttaagacag ttaagacagt taagatagtt aagatagtta 6901 taaaacttta cataattggt tacacttaga acttaagcat tgacaaagta gacaaacaat 6961 gcatatttgt aaaaatgaaa gtggtaggcg attgcgcaga gcgcagacgc caagggcgta 7021 tagctaatca ctagaagcgt tggaagacgg ataacacgta tgttgcttta tctggggtag 7081 gaacactatg agagctacta ccataggact tacgcatgaa ttactaatga acttgactgt 7141 gagattgctt ccttacgtcg caacgggtgc aactacgtta ttgttgccta agtcttgtac 7201 caaaccatta accgattcgt gtgacttgaa cacttacctc tattcctact ggcagaatca 7261 tgcggatgct ggatgtgtac ttttgccgga gattatgtca ggggtttgtc gcatgatggt 7321 gtctaataga tttttgcgac acttgcagta taaacccaag aatttatttg atgaaatcaa 7381 acagcatatc aatgaagaaa aaaaagaaac cgtctctggt actaacgctc tcgtttgcag 7441 gatttttaat ttgtgcaggg agtgcagcgt actggctgtt aactcaggga aactcatcat 7501 ctaaagattt actaccaggt gcgaatatta ttccccaaga tgccctattt gcagtatctc 7561 taagcacaga tccaggacaa tggcagaaat tacgagagtt tggcacaaaa gaaactcaaa 7621 gcttattgga taaaaattta gtacagttgc gcgatcgctt cttaaccaat aatggttacg 7681 acttccaaaa agatatcagt ccttgggttg gcgatcaagt cacaatcgca gttctcgccc 7741 caaacgtcag taaaccagta tcaaagccga ttgcaactaa tgcagaagca acgaccaacg 7801 aacagtcaat ggtgatgatt ctgcctgtca aaaatctaga aaaagcaagc agcattttag 7861 cacaacccaa agccgctaaa ggtggtaaat ggattgaccg cacttacgaa aatattgtca 7921 ttaaagaaac tgaagggcaa gtgggagaaa aattgtcagc agccctatta gacaaacgtt 7981 ttctcgtcat caccgataat tccaaaacca cagaacgggc aattgacgca tacaaaggtc 8041 aatcatccct agcgacaagt ccgggttttg cagagaatgt gccaaaaatt tataattacc 8101 aaccctttgg tcaattctat gtgaatgtgc cttactctgc tagaatagcc gctaaatctc 8161 caaaccgtcc tttgcctgct caagttctga gtcaacttca gaataaccaa ggtatagcag 8221 gaacgatgac tttggagtca caaggaattc ggttaaaaag tgtttcttgg ctaaatccga 8281 ccagcccgcg cgtgctaaca gtggaaaata aagcagggaa catgcaaaac cgcttgccaa 8341 cagaaacttt aatgatgctg tctggcggaa acttacagca gttttgggct gattatgtct 8401 caacttccca aggaaatccc tctgcaccag ttatgccaga acaactgcga aatggtgtca 8461 aatccctcac aaatcttgat ttagagcgag atttgctctc atggatggga cgagagtttt 8521 ccttctcagt gattccaaat attcccaaac aaggtatggc agacgatttt cgtgctgctt 8581 tggttttcat ggtacaggca agcgatcgca cccgtgctga aaccgcccta aaacagcttg 8641 atgacgtcat gaaaaatcaa taccagttcc agataaaata tacaacagtt gatggtaaac 8701 ctgttgtcaa ctggctcgga ccttttggaa ctttgacagc tagtcatggt tggttagatg 8761 atgatgtcgc tttcttggcg gttggtgcgc ctgtcactga taaaatagtt cctagaccaa 8821 ataacacttt aggcagtagt gggctattcc aggacacagt tccacgacaa cccaatccta 8881 caaatggtca gtttttcttg gatgtagaac atacggcaaa aaattttccc ctaccgatct 8941 tcttgcctga tcaacaaact ttgttacaag caacacgctc aataggagtc acaggtgctg 9001 tgagtgatag ccgcagtact cgctacgaca tttttctatc acttcaaaaa gctggaaaac 9061 cggatccttt accaaatccc acaaatcaat gatgaatcat gaaattttta gcattaatga 9121 ttcatgattc atcatttata actcctacct gccttaaaac agataggatc gtaacaatcg 9181 gaatcaatac tatattgact ggcgataaag ttaattgtca tcaacctctg ccttcaggct 9241 aggggcttgt ccgaaacaag tcggacaacg taagtgacga ctagctcatt gagacacagt 9301 ttggtacaca cttccgaata cttcacgcca catcctaact cctgcggaga cgctgcgcga 9361 acaagtcggc acagccgcaa gggcggagtg gccacccagt tcggattatc tgtaagactt 9421 cttgttaagt cgtggggtaa agcccagcca tcttaattgt gttgagcgag gggactaaaa 9481 caggtttact ggttcctggg attatatcca ttcaaaaacc agtccctttc acggggacgc 9541 gcattccccg cgcccttgtt ttctcccatg ccatcatggc ttgggtttgc acacatcccg 9601 cgaggtttta tgaatttggt tggactccag aactggctag acaatgcctc atttgcaatt 9661 ttatttgtga cgatgctagt ttattggggt ggggcggctt ttccaaatgt gcctgcaact 9721 gttgtaggta cggctggaat ggcgatcgct aatttgtgca tagcaacctt actaggtgca 9781 agatggctgg aggctgggta ttttcccctc agcaatttat atgaatcttt attcttctta 9841 acttggggaa taacgactgt ccatctgatt gccgaaaata caagtcgcag ccgcttagtg 9901 ggagttgtga ctgcacctgt cgctatgtgt atcaccgctt ttgccaccct caccctacca 9961 tcgcaaatgc aagtcgcaga acctttagta cctgcgctga aatctaattg gctgatgatg 10021 catgtcagcg tgatgatgtt gagttatgca gctttgatgg tgggttcgct agtggcgatc 10081 gccttcctga ttgtcacacg tgctcaagaa atccaattac aagggagttc tgtcggtact 10141 ggtggatatc gcagtaacgg ctatcgcttg cataaagtca ctgacctaag tgctcaacct 10201 tcgacacttg ctgttgagaa taacggagtc acgcgtatag aaagcaataa caacggcaaa 10261 actgctgtgt tagatttagt cactgtcact cagtctcaag ctgtcgcagc agaacccctt 10321 tcacctcagc gtctgagcct agccgaaact ctagacaaca tcagctatcg tattattgga 10381 ttgggatttc ccctactgac tattggtatc attgctggtg ctgtttgggc gaacgaagct 10441 tggggatctt actggagttg ggatccaaag gaaacctggg cgctggttac ctggttagtt 10501 tttgctgctt atcttcacgc tcgtatcact cgcggttggc aagggcgtcg tcccgcaatt 10561 ttagccgcta ctggctttgt tgttgtttgg atttgctatt taggggttaa tttgttaggt 10621 aagggtttac actcttacgg ttggtttttg taattacttt aatctgctac tccctcttta 10681 cttgcgaaga gggggtagtt tagcaaattt aaaatctgcc atgaaatgct tttgataaaa 10741 actcttaaat gtagatttac tagtgaaaat actcgtttac tgtaaatact tcaatttcag 10801 aaatcctcta agataaatac gatcattgtg cgtaatctat gaaaaggatt cttacacttt 10861 agatgcgttt gccctatata tttgtataag tactatttct atacagccgc cattttctgt 10921 aaaagctaat gatgtttgct tttctatggt tagaagtata aaaaagattt tggttttatc 10981 agccaatcct atcaatacaa ataaactgcg tctagatgag gaagtgaggg aaatccaatc 11041 tgcattggag cgttccagac acagggaaga gtttgaactc atttctagat tagctgtgag 11101 gattgacgat ttacgccgag cattgttaga ccacgagcca caagttgtac atttttctgg 11161 tcatggggat ggaacggacg gcatagcatt ggaggataac ttcggttacg tacaattggt 11221 tagtacagaa tcgttgagca atttattcaa gttatttaaa gacacagttg aatgtgtgtt 11281 gctcaatgct tgttattcag aaactcaagc agaagcaatt taccagcaca tcaattgtgt 11341 tattgggatg aaacgtgcca tcacagataa agctgctatt catttttcta aaggatttta 11401 tgataccctt ggtgctggga gaagctataa agatgctttt gatttaggtt gtaacaatat 11461 tgaccttaat agcataccag aatttttaac tccaaaaatt caaatccgag ataactttaa 11521 aactctattt tttaaaaaac aatcaactac taaattgatc aaacaaaaac cttctcaatc 11581 cttcaccatc agtgggggtc agctttctaa tgtccaaatt ggaggtcaag cgggtcgcga 11641 tatggatgtg acccaaaatc agcttcttgc tcaaggtaat tctgaaaagc cgctgatcca 11701 aacagatgta gtcgaattaa ttgcccaact tgaagaactg tttcgcaact cagaactacc 11761 agaggcgcaa acttcaaaag caattaagca cttagaagct gctaaagaag aggttcagga 11821 aaaggaacct gacaaggatt ttgcagctaa aaatttgcaa aaagctacaa aagttctcaa 11881 agaagcgaat gaagcagtga cagcaggaac aaatatctgg gaaaaagttc aacccattat 11941 tactaagctt ttgccttggc tgggtgtagc ggcaagcttt tttagctgaa agggtcttgt 12001 ttatgagcga acacactccg aatgaccaca tcaaaatctt aatattttag atggcgcttc 12061 ccttcagaat ggtcaaattg caggaatagc tggtcgtgac ctgaaggtaa atcaaatatt 12121 gttaaccttc ctcagtagtc aattgcgttg cttcttgacc actcatctgg gcaaaatgct 12181 ccatcagcat tcgatgaatg aagatgtagc cgccccccac tttttgtagg aagatgagat 12241 cagtagcata gtcaagaaag cgaacataat tccaaggaat gttgcccttg cgccaaagga 12301 gaaaacgtaa agtaaagtga cggatacaag cctgtccgcc tccactgact agtccaataa 12361 atgctccaaa actgagtgca ttaagtacta aaaggaacaa attaaatttt tttgccatca 12421 aataaattgg tccgtgaatt actccgaaca ccggagtaac aattattcca ataaacatgg 12481 cactgctagc cgatttccaa atcccctgat ttggctctat tttcgtttca atttctggac 12541 cttgcatacc ctctgtccaa tgaacaaaca accctgtaaa cattcctggg aataatccat 12601 cgttcaaacc ttcaagtatt cgcctttcgg ttgggactga attcgattca tttttagaaa 12661 aatctgcata caaattttcg tcaaggtata gcaaaccata actcaaactt acaataatgc 12721 caaatataat acttttattt ttaccaaaac gggcaagtag cccaagtatt atacctatca 12781 aataacccaa agctgcccga acaaataaag tttcaggctt atcattttga atgaacttct 12841 gcggatcttc aatgaagtaa aagtaaacgc tatatatcag tgaaaaaatc agaccactta 12901 ttccagaata gtgtcctaaa aatccggtaa taaagccaat aataggtcct actattagac 12961 caataattat accaataacc caagcaaatt ccgggttaga tggtttaatt aatgaaggtt 13021 gagtccaaac atacaagcca gataaggttg cacaataagc tccataaatt agactatata 13081 aaaacgattg acacaaaata tttttcgtcg tcattaaagc ttttgaaact gttaatattc 13141 ttatcaattg aaaaggtgac ttgatagttt caaaggtctt tatttccttt tgattaaatc 13201 caaaaatcca taatccaata aagatagatt tgattattct ctctaaaaca agatgatgta 13261 tttgctcccc agtagaaata atactaatgt tttgaggtat gtagttcgtg aacagaaata 13321 taaaaaatgt tgctattata aaagttatta cattacccca ttggtatttt cttcgttgat 13381 tagaaggcaa ccaactaggt tgcatatgct caattagaaa tattgattga gaagttcgat 13441 taagttcttg agctaaccac gaaagatagt gttttacttt ggcttttgga tacttctttt 13501 ccactccctt tcgtctaaac atcctttcga tataggcatc aagtagatgt ttgcggcgtt 13561 cttctatcga gccgtaattt ggatacttct tttccacttc ctttcgtcta aacatccgtt 13621 cgatataggc atcaaataga cgtttgcggc gttcttctat cgagccgtaa tggggtaact 13681 cttcagcagg tatacctcta tatgccaacg ctataacact aagcgttaga ggtgacttgg 13741 ctaactcatg taaggcttca tcttcttgaa gtaacgtctt cactccttct aattgaccac 13801 ctgccctatc taagtattgc ttaatttgtt cttgggtcag cgatttaagg catacagcag 13861 attgaacttg caatcgagaa gataaagctt catagtctcg gatgcgacta ctgatgacta 13921 tctgtgttag accatgagat tttctaaatt cattgacagc ttgaacacag gcttctcgac 13981 gatttgcttt gacctcatct agcccgtcca acagtaaaat caaatcttgg ttctcaatcc 14041 attttttact aagtgctttg ggaactaagt atttgctatc tagttcctga actagccaat 14101 ctccaatgct ttgccgttca cttgcccaag acgataagtt aagaactact ggaataaatt 14161 gacctacatt tttttcagct tgagtaatca aatcctgggc gatccttaag agtattgtcg 14221 tttttccaga accgggttct cccaaaatca gtaaagtttc tccctcttct gtttctttaa 14281 agagatggat tgcacttgtt ttctcagata aaggctgcga atgttcttca ggaaactcat 14341 cagtattagt aatacgacct ggcactacgt cgggtctggc ttctagttga agttcaatta 14401 gcgctttgga atgaagtgac ttttttaaat aatctgtaat ccagattttc ttaatgtttc 14461 ttagcagagc ttgacgttgc cgatattcat acctcgtgag aaattttggc tttataactg 14521 ctttagaccc atcgatccgg tcataaatac ttaaattaac cacgcttcct tgaatctgag 14581 tgacattcag atcttgtcct gcaatgcctc caagttgaga gtccttaatc gaactaccat 14641 cagagacatt gagatgctgt tgaggattaa agggagatga ttcattcata tattatcctg 14701 tgtgaacaaa aattttactc ttacccctaa cctatttact tagtaccaca cagtctctgt 14761 tctaaattct gtaatcgaag taaaaagatt atattataac ttacgcattg acaaaatttt 14821 gattatgtgg ttataccata gtaggcataa atcacctcaa taaagtcggg gttattacct 14881 tgtgaggaca taagccctcc ttctacttct tatttaggtg taagtgaaat caacttatgc 14941 acgctgcgga agatgaacgt gaattgtaga taagtagtcg gacaaaatta aattcattcg 15001 ttcaggcagg agacaggaga caggagacag gagacagttc agttaaggta atggtaggtt 15061 ggggagcgta agctccagga gggtttcccg acctaggcaa gtctggcgtt taagcgtagc 15121 gcaacccaac aataaacgtt ttgtgttggg tttcctgacg tcaaacgcca cgcgcctcaa 15181 gccggggaac ccgtccacgg cagtcgctca tggggggaac ccccaagacc gcgctgcctc 15241 accccggtac tgtcctcccc aacctacgca gtttaagggt tttggctcta actgaaccgt 15301 attggattaa aggatagcga tcgctcaata ctgttcggta ggagaaatcc tgcttttagc 15361 tttgatcgcg gaccagaaag tacggggttt tcagcatttt tcttataaac tcacttcaat 15421 tgcggttaca ataagtggta tcataatttg gttacaaaag ctgctatgga acgcgaagct 15481 gtaacaatcc gttttccatc tgatctactt gctaaagcca gaagtctcaa ggaaggcaat 15541 gaatctctaa acgatttagt cgttgaagct gtagagcagg aggtgcgacg tcggaggggt 15601 tgggcagccc atcaacggat tattgctcgc agcgaggcag tgaaagcgaa aactggtata 15661 caatcagcct ctacagagtt gatccgtagc ctcagagaga gcgaggatcg atgtgactag 15721 agtcttgtgc cttgatacca gcgtttggat accttacctt gtaccagaga cttatcagct 15781 tcaagctaga accttagtca cagaggcatt aagtcttaac ttgcgcttgg tggccccagc 15841 atttacttgg gcagaggtag gatctgtgct gcgaaagaaa actcggatga aagtcatcac 15901 aacagaggaa gcacagggtt tttttcaaga cttctgcgaa cttcccattg attacataga 15961 agaggaagtc attcgggtga aagcttggga aattgcagag caatatgtct tacctacttt 16021 atatgatgca gcatttctcg cttgcgctga gagtgtttct gccgagtttt ggacggctga 16081 cgtcacactt atcagacaac tcacacccca acccacctac ctcagagaac tttgttgaaa 16141 tttagtgcaa agacgcaaag aaataattct ttgcgttttt gcgtgtttgc atttttgtaa 16201 agactaaact ttcatatcca atgtacgcac caaaaaagcc cacttatctg caatttcctc 16261 aatgatcttt gctgtaggct taccagcacc atgtcccgct tttgtctcaa ttctaatcag 16321 tactggcgca tcaccattgt gagtagcttg caaagcagca gcaaatttga aactatgggc 16381 gggaacaacg cgatcgtcat gatcggctgt ggtaatcatt gttgctgggt atgctgtacc 16441 tgatttaagc ttgtgcaatg gcgaataagc atacagtgtt ttgaaatctt cttgattgtc 16501 tggcgagcca tattcagaag tccatgccca accaatagta aatttatgga agcgcaacat 16561 atccataaca cccacagcag gtaaagcagc accgaataaa tcgggacgct gagttatgca 16621 agcgccaact aataatcccc cgttacttcc accgccaatt gctaacttag caggctttgt 16681 ataattattg gcaatcagcc actcagcagc agcgataaag tcatcaaaca cattttgctt 16741 tttcgacttc atacccgcct gatgccattc ttccccatac tcaccgccac cgcgtaagtt 16801 agccagagcg taaacaccac ccatttccat ccatactaaa ttactgacag aaaagttggg 16861 acttagagag atattgaaac caccataacc gtagagatat gttggattat tgccatctaa 16921 tttaatccct tttttataag taagaaacat tggcactttt gtaccatcat tgcttttata 16981 aaaaacttgc tttgtctcgt aatcttcagg gttaaaagct acttttggct gacggaaaac 17041 tgtgctcttt ccgcttacca tgtcgtagcg atagacggtt cctggtgtgg taaagctggt 17101 gaaactatag aaagtttcgg tatcatgtcg cttgccatga aagccaccgg ctgaaccgag 17161 tcctggtaat ttaacctcac gtatgaattt gccttttagg tcaaaaattt taatttgggt 17221 atgggcatct tgaagataat cagcaacaaa ctggttatta attgtactaa cgctttctaa 17281 agtttctgtc gcctggggga tgatttcttt ccagttttgt cgcgccggat ttttagtatc 17341 aatagcaatg actcgtcctc ttggcgcgtt gaaatcagtg cggaaataga agatgtgatc 17401 gtcattgtcg ataaaaccga aactagactc aaattcctga atcagttcta caactttcgc 17461 tttcggattc gtcaaatctt tgtagaaaac tagatttctg gggtcagttc cttgccatac 17521 agaaataata aggtaacgtc catcttctgt gacatcaccg ttaaatcccc attctttctg 17581 gtcgagacgg ttgtaaatca gtaagtcttc tgattgcggt gttcccagtt tatggtagta 17641 aagcttttgg taataattga catcttctaa tttggttttt tcgtttggtt catcatagcg 17701 actgtagaaa aatcctttac tatcatttgt ccaggatgcg ccagagaatt taatccactt 17761 caagtggtct ttgaggtcaa tacctgtttc aatatcccta actttccatt cttgccagtc 17821 agaaccagca gtggatagac cgtatgctaa aaaattacca tcttcactga taactgttcc 17881 tgaaagagca acagtaccat cttctgaaag tgtatttggg tcaagtaaaa cttttgattc 17941 agcgtcgagg gaagttaagg tgtataaaac agactggttt tgcagtccgt tatttttaaa 18001 gtaaaagtaa cgcttaccct ctttgaaagg aagactatat ttttcgtaat cccataattt 18061 ggtaagacgc tgcttaattt tttctcttgc aggaatttcc tgaagatagc caaaagtcac 18121 tttattttgt gcttctaccc aagcctttgt ttctactgag tcaggatttt ctaagggacg 18181 gtaagggtct gcgactgaag taccgtggta atcatcaact tgattgtttt tggggctaac 18241 tggatagctc aagtgttttt tttcagacat atttcggtgt gaagagggta cttttacata 18301 gtgtaggggt tgctccttag ttatgccatt ttgtattgat gaatgccagt cagtataagt 18361 tttggcaatt gcaatatcag gagctaacaa ccacctacat ataataatag ttgtgagtaa 18421 gatgaataaa aatcgctgta gagttttcat cagtcaacaa gtttttcgcc tttgagcatc 18481 cgggatgcgg cttctagtaa gacttcctca agatatggtt tggtgaagta accacttgca 18541 cctagtttca ttgccatttg tctgtgtttc tgtgcacccc gtgaggtgag catagcaata 18601 ggcaaatgct tgaggttagg gtctttctgg atgcgagaga gcaactccaa gccatcgcag 18661 cgaggcattt caatgtcaca aaatacaata ttacaaggta gatcggaacg cagtttatcc 18721 caggcttctt gaccatcacg cgcctgttct acgcgataac ctgctttact aaatgtgagt 18781 gatagcaatt cccgtactgt aattgagtcg tccacaatca gcactgtggg gtttattttc 18841 cctgcagtcg tttcttgtac ggtgggagtt tgctgctgtt cccaaagact cgaactacga 18901 tatttggagg ctcgtccttg gaagatgtca atgatttcga tcacatccgc aatgggcata 18961 atacgaccat cacccaggat agttgcacca gcaacaccaa taggtttggg aaatggccct 19021 tcaaattgct taattacgat ttcttgctcg ttaagcactt gatcgacctg tagggcaaga 19081 aagacatttg ccgatcgcac gacaacaaca caaatcatgt cgtcatcccg atgaccacca 19141 taaatactac cccgactaag ttgacgatta atagttaaaa tttctttgag cggttggaat 19201 ggtaggagtg tgttacgcca aggaatgtat ttctgcccat cagaaccttg ttggatattt 19261 tttacaggaa catctagggt atcttccaca ccatccattg ggaaagcaat gcgcgcttta 19321 tcagatatac agcaaagggc tttgcaaata ctcagagtca ggggtagacg aatggtgaag 19381 gttgttcccc cgccgagagt ggaattaata ttaattttcc ctcgcatttc gcttattcta 19441 gccagaacaa catccaaacc gataccacga cctgctaatt catcttcttt ttctttggta 19501 ctaaaacctg gttggaacag taaatcgtaa acatgatgtt gagataagct tttagcttgt 19561 tctggcgtta ttatgccaat cttaattgct tttgccttca ccgcttctat gtcaatacct 19621 gcgccatcat cactgataga aatgacggtt tgattacctt gatggaatgc gcggacagta 19681 atgactccca caggtggctt accaatcgct tgtcgtatgt ctggtgtttc gataccgtgg 19741 gcgatcgcat tgttcagtaa atgagtcagc ggggttttaa gatgttctaa aatcactttg 19801 tctatcaagg tttctcgacc ttcgataacg agttgcgctt gtttaccaca cttgatagcg 19861 ctttcgcgca ctcctcgttc taaaggagtg gttacttccg caaatggttc cattcgcgat 19921 cgcattatcc cggattgaag ctggttagtc acctgtcgga actgtcgcgc gactcgttct 19981 gtatcttctg ttacaaaatc aatatcactg gctgcctcac gcacccgcac aatcagttca 20041 atcatttcct gcgacagggt atgaaaagga gtaaactgat ccatttccaa ttcagtcaaa 20101 cccctgtcta tatctttatt gtgatcttcg cggtgatttc tgcggttcgg gttaacagat 20161 gcatctaata gcgatcgttc gtataattcc tccattctcg ccccgacatt actcaactgc 20221 tgcatctgat gcagcaagtt atctaaaaac tgtcgcagac gttgttggtc ttgttccaat 20281 gtgttgcgat taacaactaa ttctcctact aaattactca gatcatccag atgcttaact 20341 ggaactttta ttagttgttc aaattttgtc gtgcgataga caaagggacg aggaggcaaa 20401 tctttcttgc ctctatctga tgttacctgg tttgctgcta cacccaataa tttttcttgg 20461 gcggttcttt cttctggtct gagttcagca ttgacttctg atacaacttc ctggttgagt 20521 aactcattta actcagtcaa ttcgtcattc aagaagatat caatttcttg agtcaattcc 20581 tcctctgaag aaacggcttc tgttaggaac aaattttcct tgaaatcaaa tgctgaatat 20641 agctctaaag aatgctcgcc atccaaatct gttggtgaga tagtttctgc ttctgtgtgt 20701 ctcatccctg actgttcttg tgaacactca ttgaccatca aattaatatc ttgaggctgt 20761 tgtgtaaaca acgaattgtt aagtaagtct tgcttattgt cagtttctaa aggcagaatt 20821 gtaaattctg gactgaattc tacagcttct ggctgttgcg aaaacacact cttttctgtc 20881 ttagacgcta cgcgatagta agatgtactt tggggttcta tagtcgtgtc aagagaagaa 20941 gtgtttatga gatttgatgt taaatcttga aaccaatcat catccgtctc tgatgagaat 21001 aatagattaa attcttgttc atcaattgga aaatttagtt ccaaatcttc aaaactcaaa 21061 ctgtctaaac tagagttccc cgactgagca acattatcaa aaaagtcctc agcagcggtt 21121 gtcaacaaaa tttcttctaa agttgctaca tcctgttgat gggctggaga aaatttgtcc 21181 ttttctccct ttaattccag ttcccaaatt ttgtttaaat cttctgtttc aaactgtgca 21241 ttcaacaaat caccaaatgt tgagttgttt gcagacaagg ctatagtctt gtcttcctct 21301 gtctcaataa atagctcttc taaagtcaat atttcgtctt tatgtgatat tttttttgat 21361 tttatggact ttggataatc aatttcctcc attagtttgg ctttcgtcga taaggaaaaa 21421 tcatgatagc taatttcttg gtttttggaa agctcgacat tgactatagt ctctgtttgt 21481 atttgatttc tttcgcctgt gggtaaggct tgattttcat ctagtaacaa ttctaataaa 21541 ctagtgactt gatcttgtct tttttcgcca gcatcaaaca actgctctac tgatgccaat 21601 gtatcgatac ttaattctct cgttaagcca ctcgttttat gaagggctga ttctaactta 21661 tcttctaata aatcatcaga taacgccaga gtaaattctt cggtttctgc tgctttctgc 21721 tgagtactaa tcgtgtcttc atcaattaaa atttcattct tgttatcagt ctcttccgta 21781 ttggcattgc tcacatctct ttttaatgtt gcctcagggt taatatctag aatttcttct 21841 tgctcccaca ttccctctaa gtcgggactt tcaccttcaa aaaaatcagc cagggtattt 21901 aagttagcta ttctgacttc ttggttattc gtatctgtcg tatgggtgag agcagcgtgt 21961 tcattgctat taaattgttc gcaaagttca gataaactag taattctgtc ttgtcggtca 22021 atgtccaaaa ctgttgtaga agttgcatca cttattgttt gtggaatttt tttacctgtt 22081 cccaaattaa tggtgctaga agatgtaact ggtggcaatt ctaatccttc tgttgggcta 22141 tctgactcat ctgccagaat caggggaatt tctaacaatt caattgtttc ttctactctg 22201 acgagtgctt gtagttgctg actaattgtt atttccgctt ctctaagagc aataactaat 22261 tctaaagctt gtttaatatc tgtaataatg gttttagcta gagtgagata ggtattttgt 22321 gaaaatgcga tcgcacttgc tgctgttcga cacaaaccac accagttaga ccaattcaat 22381 tcttcaccaa gttcagctaa cttgtcacaa cacttcacaa ggctttgtcg cgtttgcgat 22441 gttgcagttt gtttaaacag ttgcaacatt tcccgtaata tttggataac ttcactttgt 22501 acagtttgta ggttctgcca actctcactt gtttgtgcta cggtttgtgg cgaagttgca 22561 ggagaagctt ttgctaagat acttgtattt tttgtgatat tattgcgatt ttgttctatt 22621 agtaagttta aatgttcttg caaccatttg aagacaggtt cagtttctga cattaaggtc 22681 tgtgccattt catcagtaag accataaggt tcagtgcaac tttctaacag cgcttttaag 22741 gtatcagata caccaagaaa caaagattct aatttttggt caacttgaat tggatgctct 22801 ttgagaacct taaaatagtc ttccaaacgg tgagctgtgt gttgaatgct actcagacca 22861 agcattgctg ctcctccttt tatagagtga gccgcccgaa aaacttcgtt aatcatttct 22921 gggttatcaa gggtactctg tagattcagc aacccctttt ctatcgtatc taggtggtct 22981 tttgcttcct caatgaagta acccaaaatg cgctgttgtt gttctggcag catagttttt 23041 aatcattaag tcattggtca agagtgaaaa ctcaatagtc tataatctat agtcaaaagt 23101 caatacttat actttgaact atagactatt gactaattac cttgtatttt cagagatttc 23161 gacttggaaa cgttcaacgg aggcgatcag ctcaccagat acactgacca aactatgtaa 23221 agcaccagag actcgttgtg cttctcgtga agtgtcttgc gctgttaatt ccacagattg 23281 cataacttca gccacaccac ggaaggtttt ggtctgttca acagtgtccg aggtaatccc 23341 gcgcactaaa atatcgatgc gcttggcgac ttgaataatg ttttctagcg atcgcttggc 23401 ttcctctgcc aacttcgtac cattaatgac ctgttgtata ccttcctcca tcgcaatcat 23461 tacagagcct gtttcgctct gaatctgcat gactatttgt tctatttctt ttaatgactt 23521 agctgattta tctgctaact ggcgcacttc atccgctaca attgcaaaac ctcgtccggc 23581 ttctcccgcg cgtgccgctt caatactagc attgagtgct aacaaattcg tccgagaagc 23641 aatttgcgaa atcaacgcca caatcttgga aatttcttga gaagactctg ccagtcgctt 23701 cactttccgg gttgtgtgcg ccacggtttc ccgaatttct aaaatccctg ccaaagtatt 23761 atctactgct tctccaccta ggagagcaat attgctagca tcatgggtaa cagcttctgc 23821 ttcgcgtgct gcttgtgcga cacgttgaat ggagtcagtc ataacttgta cagaattcag 23881 cgtcaccgcc aatttttgcg cttgtcgtaa agcatcacga gataaggctc tcgcaaaggt 23941 ttcggaatta gttgcccctc tggttacttc ctgcgctgcg attttcacct gttgtacaag 24001 atcccgtatg ttttgaattg tgaagttaaa agcatcagcg atcgctccaa gtacgtcggc 24061 ggttacctcg gcttgcactg tcaaatctcc tctggcagct ccttctacat cgtctaacag 24121 gtgaatcact tggcgttgca ggctttctct agcttcctct tgttcttgag ctttacgtgt 24181 cgcttcatta gtagtcgtca atataacacg agtcatttgg ttaaagctag ctgccagctg 24241 cccaaattca tcttttgaat acacggtggc ttggacatga aagtttcctt cgcgcacagc 24301 atcaaattga ctttgtaaat ttttagcagt gcggcaaatt tgcctgtggc tgagactccc 24361 cataatacct gcggtagcaa acccagcaat tccagatgcc aaagtcatta ccctacctgt 24421 gttttgcacc aactcccgtt gttcgagtgg taacaactta gcataaacaa agctgactcc 24481 agcaacaacc agggctgaaa caacacccac tgtaccagca acaatccatt gcttagtttg 24541 tagagaggca ttttctaatg gagcaagaaa accttgcttg acagagactt gaggttttct 24601 actgcacaga tcagccccga tcaaaactgg gactgcttgt tgggaaccag tggttataaa 24661 tagctccgaa tcgcgatgat tcgtaagggt actacgcgaa gcccaatcgt ccgtttgggg 24721 cgggagaaaa ccgctatcgt ctctgggggt agtatctgaa gtgaaatcgt gattttgctg 24781 agtgggtgta gtctctactt tactgcgtcg tgaagtgttg ttttccacag atatcttcaa 24841 cgtctgtggc tcctcaaagc gagaatcttc tgtcaagtca aatctcggaa tataccctga 24901 gtcgtcaaat ttatcaaagt cgtctaaaca cttctttttg ttacccttgc aagtatttgt 24961 gttctctgct ttcgcttctt catcattgac attagatagt tgccatacct gttgttgaat 25021 aaatgtagaa ggggctttgc taaaattttc ttttgaattg aacgttgtat cagatacagt 25081 actatcgtct attgatagat caaatggact gctaacaggt agctcttcca tatagtctac 25141 agctttttga tcctcatcca aggaagtcaa atgaaagcta ctactgttac tgttaaagtc 25201 ttgatcatct gccagatttt ctaattgttg ttttatcact gtttgagatt cagaagaatc 25261 agaaaaattc ttatattttt gaccagattt ttcaatatct tgagcgtcta gtttgcgttg 25321 atattgtttt atgttgtcaa tgccattgcg ggcaaggctg ataatttctt gatcgtcagt 25381 aaattttaaa accttttggt attctatttt tgccacatca cactgctgca aaacataata 25441 gatatgaccc cgcagcaagg atatattagg gtcatatggt aaatgttgca ccagttgatc 25501 aatcagagtg gctgcttgtt tgtagtttcc ttgcacgtaa gctttatgag cctgatgata 25561 gatatgctcg tactctttta tactcactac ctttttctgc ctccatttca tcctgcccct 25621 ataagtccaa agcgtgctcg ttcacccttg gcataccgaa attgcgaggg tgagcagcgc 25681 gttcccagag ttgtctgttg ctcgaatttt ctccgatgag aacttaggag gatgtctcct 25741 caacggcgat ttttctagtt tttacccaac aaccgcgcac ttcgtaaaat tgctttgtga 25801 tcaagcagcc ttagacactg atttttttta ttttctaata tccattctcc ttgcaaaaag 25861 ggagccattg tatctgatac gctctttaac actgtcagat gctttttgtc aagccagtcc 25921 ataccgccga tttcttcaac tgctaaaccg actattgtgt cttgctcttc tatagcaatc 25981 acaggaatct ccgaacgatt cgtatttaaa gtggttcctt gtcctagaaa ttgacccaaa 26041 tcaaccaccc aaatgactcg acctcgtaga tttagagcac ccaaaagcaa accagaagta 26101 ttaggaattg gggtaattct atcaggactt agttcaatga cctctcgaat atcggttgct 26161 agtagtgcaa actcctgatg caagggaata taaaatctta gacataactc tgcttctgga 26221 ctttttactt gtaattcaga accggagtgg tcttgtccac ctcctcctaa aaagtccggt 26281 tgagtgatca ttttgtgttt atccttatcc tcgcagtagc tgtttgactg ttcccaccaa 26341 ctcagttggt tgaaagggtt tagctatata ggcatcggct ccttgtttga taccccagta 26401 gagatcaact tcttcacctt tagaagaaca tatcaccacc ggaagatttt gggttttagg 26461 attagatttt aagcgacgac aaacttcata gccattcatc cgaggcatga ctatatccaa 26521 aacgactaaa tcgggacaag cagtttgaat ggcttccaat gcttctattc catcactggc 26581 atgggtcact gttaagccac tcgctttcaa taggtctgta atcatctctc tctgtgtaac 26641 actgtcttcc actatcagaa ctgtactcat atgacgccca tatacctcct gatgtagacg 26701 tttgtggtta gaacatttgc tgattttcct aactgtcttt actgtataaa agaattctat 26761 cccaagctgc cgagagtacc ccatccgaat cgaaatgcga cttaagtaat aacaagggaa 26821 aacaaaatgt tagcaattat acggctgaca gacaagtctg tcagtataaa atagaaatag 26881 gtgacaaatt ataacgagta tctgttgtgt gtcatacagt atagacctaa catgtaacat 26941 ccatatttgt atgtgattca aacgataacc gctatttttt atattagtca ttaatgatac 27001 ataacttatg gacgataaaa ttgaagtatt tcagccgtgg gataaaaaca agaaaaatcg 27061 gacaacctac ccctacaccc ctataccctt acagagttac atcccttttg gtgactgata 27121 ttaccatgcg ccacgtcact gcgtaggtag ctgttcacta aatttaaggt tttgctcaca 27181 ataggagtct taagtacaca ataaggaata acagtgtgct gtatttagca gaagtacaaa 27241 agcagaaagg cggtttactt ggtggtggca aaactgaact aaaactactg gcttgtcaac 27301 gaagtgacca aagttggagt acagtttctg aagaagtggt ttttgctgag gaagctagca 27361 aattcagtga tggagtcctc gtattggttg aaatgaatcc gaaccgtaaa gtacaacgga 27421 ttcaggaggc ggggcgtcct ttggtgaata ttttgcagaa tttttcccgt caggtggaaa 27481 aatttaagct tagagaagaa gaaattaacc agtggaagca gtctctgacg tttcaagcgc 27541 aagagttaaa tcggcgtgaa atggatatgg aagagcgttt ggaacaactg caacaaatgg 27601 aggatgaatt tcagcgcttg gaggcgcaac aacaacaagt agaagcttcc cgccaggaaa 27661 ttgaaaagtt gcaaacggag atttcgcgca accgtaaaga actccaagga gcatgggagc 27721 atttgcggag tgagcagcgt cgcttggagg agtacaaagc acagtgttcc cactctgctg 27781 tgttggatca tcagcaaaat ccagtactaa acgagttact tgctcaagtg tctactagcg 27841 ttactctcac acaaacagta cgcgaatatc tcaatcatgc cctgaaagtc ctcgaaacgc 27901 agcaggaagt tttaaaccca cactggcaac agttacaaca gtatacaact acagttcatc 27961 aacagcagga gattgaataa agaagttata ttcacaactg tttgaggtaa atccagtcac 28021 cttgaacttg ggcgatcgca ccaccaggaa atggatcact ctgcgaacgg tttggtgcac 28081 taattaaagc gattagtttt tctatttgct caaaggtgca accactctga atgatttctt 28141 gtaatacctg acgcatgacg cgacgctgta gcgccagtgg tgctttctgt aatacgcgac 28201 ggtttagctt tattgaatat ggattataaa gatttccttc ctcacctgtt gatgttacca 28261 tcgcctcttt tcgcaactga taagcagctt gttctaaata ttccacttct gcttgcaaaa 28321 tttcagcagt ctgggcaaga actgattcaa cttgagggtt gaaattttct tgcaaatatg 28381 gtaatacatt ttgacggatg cggttacgtg caaatttcaa atcttgattt gttgaatctt 28441 cccaaacagg aaccagaaaa tcttggcaaa attgcccagt ttgcatacga gttatttcta 28501 gcagtgggcg tacaagcagg atgctgttat ccagcagcct ttgccatgtt aatgcttgca 28561 agccatcagc acctgtaccg cgaattaagt tgtaaagtag agtttcagcg cgatcgctgg 28621 ctgtgtgacc tgtaacaatg caatgaaatt gattttcctg ggcgatcgca ctcaaagctt 28681 gataccgcca gttgcgtgca gcagcttcgc tttttaaagg ttcgttagct gttcgtaaat 28741 aaaatgagac actccaattt tttgccaagt tttctacatg cttggcattt gcttgggagt 28801 cagaacgcca gcaatggtcg cagtgggcaa tactgagata ccatccccat tttggttgta 28861 aatctaacag tattttgatc agacacaaag aatcttgtcc acctgaaact gcaatgagta 28921 gtcgttggtt gcgctcaaat aggtgacggg agcggatcgt gcggtgtaat tttgcatgga 28981 gaggagtcca aggagtgcga tctgcctttg gcagcaagct tcgcttatcg ctcatttagg 29041 aatttaaagt tcgggggcta tggtgagaag aggagggagg gagaggggga agtgaactga 29101 aatgataact ggtaactgat tcagtcatgg aagtgacaaa gaattgtcat tgacccgtca 29161 tattagcttg ctagagttat ctcaaggaaa aattaagttt gctcctatca ccacagttta 29221 tgctctcttg tatggtagtt agcaagggag catttttata gcatatattt attcacacca 29281 cgccgtttat tttttatgtt acctgccatg ggttttttct ccaaactgcg tccattcatt 29341 ttgatagtct tttttataag cttcctctta agtggagtgc tatcaggagg ctattttagc 29401 aatgctgcaa caccaagtgc agtcacacct gtgtctagtt ttgccctcac tcagggggta 29461 cgtaaaacag tattaaaaaa tggtttaaca gtcttaacta aagaagtcca cactgcacct 29521 gtggtgagtg tgcaggtttg gtatcgagtt ggttcgcgca acgagaaagc aggagagaat 29581 ggtatctctc accagctaga gcatttgatg ttcaaaggaa ccacagatcg tccagtacaa 29641 tttggtcggc tgtttagtgc tttgggaagt cagttcaatg cttttacgag ttatgatgag 29701 acagcttatt ttggcacagt gcaacgcgac aaattagaag cactagtcac cttagaagcc 29761 gatcgcatgg aaagcgcgtt agttggagct gaacaactga caagtgagaa gcgtgtcgtc 29821 atctcagaat tacaaggata cgaaaattct ccaggctatc gcttaaatcg ggctgtgatg 29881 cgagcagctt tcccgaaacg agcttatggt ttacctgtag gaggtacaaa agctgatgta 29941 cagcaattca cactggagca ggtacgcaat tactacaata cctactacag ccctgacaat 30001 gcaaccctag ttattacggg ggattttgcc acagaacccg tgctgaaaac tgtccaaaaa 30061 acttttggga agctaccaaa acgggcgaaa caagacaacc agactaaggg gaattcttca 30121 aaaacaggtt ctccctcctc gaataatact tccactccta aaacaacgcc ttctccctcc 30181 tcgaataata cttccactgc taaaaaatca cctattgtct taaaagagcc aggaagcgct 30241 gcattattgc aagtcgttta tcctctacca aatcttactc atccagacgt acctgcaatt 30301 gatctgatgg atgtgattct cacgggcgga cgtagctcac agctttacca agctttggta 30361 gaatcaggct tagcaagctc ggtgggtgca agtccttcag aactcattga accaggttgg 30421 tacgaaatta atgtcacagc tgctcccggt caacagctat caaaaattgc cctggtactg 30481 gaacagtctt taactaaact acaacaaaaa caagtcacgg cagaagaatt aaaccgagcg 30541 aagacgcaac tccaagcttc ctttgtactc gggaaccaag acatcacctc tcaggcgacg 30601 caactaggat ataaccaaac tgtcgctggg gattatcgtt atgttgaggg gtatcttaaa 30661 acaattgcca aagtcacagc agcagatgta cagcgagtgg cgaaaactta cctcaatcct 30721 gctaaacaaa ctatcggctt ctttgaaccg actttaccag gtggtaagcc agggagttcc 30781 agtggtggtt ctaatcgcac tgtagaaaac ttcagtcctg gtaagcctgt cgatccagca 30841 gaacttgcca aatatctccc tcccgcaaca tcagccactg cttcgactca acaaccgttg 30901 ccagagcaat ttatactcaa aaatgggtta cgcgttttcc tactgcctga tcacagtgtc 30961 ccgaccgtta atctcagtgg acaaattgat gctggtgccg aatttgatac taaccagaaa 31021 gcaggtttag caagtttcgt tgctagcaat ttaatcaatg gaactcaaac aaaaaatgct 31081 ctaactctag caaaaacgtt ggaagataag ggagtaagct tgggattcag tgctagtcgc 31141 gaaggagttg gtattagtgg aaatgggttg tctgctaatt tgccgatatt aattcaaact 31201 ctggcagatg tggtacaaga tgctaccttc ccagacaagc agctagaact tactcgtcag 31261 agagctttga caagtctgaa agtacagctc gatgatcctc ggggattggg acgacgggtc 31321 tttcagcaag caatttaccc ggaaaatcac ccgtttcata gctttcctac agaagaaagc 31381 ttaaaaagcg tgactcgtgc tgatgtgctt cgcttctacg aggaacacta ccgcccggac 31441 acgacaacga tcgcccttgt tggtgatttt gacccaaatc aagtcaaagc tttactgaat 31501 aaagctttcg ggaaatggca aactcagggt aagccaccga ctcttaactt acctcaggtg 31561 tctttaccgc aaacaatgaa acagttgagt tcagtgattc ccggtaagac ggaagctgtg 31621 acttacattg gttataatgg catctcgcgc aaagactctc gtttctattc tgctttggta 31681 ctcaatcaaa ttttgggcgg cgataccttg tctagtcgct tgggtactga ggtgcgcgat 31741 cgccaaggtc taacctacgg tatctacagt ggctttgcca caggtgtcaa tcctggtcca 31801 ttcttaattc agatgcaaac tgccccagga gatgcccaaa aggctattac cagcactcta 31861 gctttactca agcagttgcg agagcaagga atcactgagg ctgaattgaa tacagcgaaa 31921 cgctcaataa ccaacaacta ccctgtagaa ttagctaatc ctggtaacgt agcgggcatg 31981 attttagaaa atgccgttta tggtctttct caagcagaaa tccgagaatt ccccaagcga 32041 attgaagcag tgactccggt tcaggtgcag cagacaattc aagaacttat tcatccagat 32101 aagctagtga ttgtcacggc tggacctgga gcttaggtca agtatgtaag gataaatatt 32161 cagcaaccca gtgtctgaag acactgggtt gctgactaac gactggtcaa atttaaaacg 32221 agcgcggcag ggttcgaacc tgcgaccaac ggattagaaa tccgtggctc tatccactga 32281 gctacgcgcc caatcaaaat aactggcttg ttgcacagtc agtttctata ttatatctgt 32341 gaacctaagg aaaaggcaat caaaaattat tgcagatttc accttggcga agctaagatc 32401 caagtcgtcg caagagtgag tcaatgaaca aacagcaaaa acaaagcaaa tggcatattg 32461 gttagtcatt ggcgtgagtg tggtagcagt tttaccctga aattgactgg ttcagtttca 32521 taaaaactgt ctcctgaact attattcagc caaaaaactg ttagagtgca tttatactgt 32581 aaagttctat gagtagcgct gacaccaagt tacactgtct caaaactatg ttcttcactc 32641 aaggcattgt cccaggtctc cttagttgga gtctgctagc acttctacac tctttggtgt 32701 ttagtggtag aatctcaaat tcactcttgc tcctaaagcg tgctgagtgg tctgttgttg 32761 gaacacatat ttttgaagca atagcttatc tggctgctgg tgtcttatgc ttaagaaatt 32821 ggcgcagtcc acaaattcct agtggttata gtgtgtggtt agtcatcagt atcgctatgc 32881 ttttatattt ccaaaggaga gatatttgtc agttgtacag aaatagtctt gcaagaagaa 32941 cttatcgttt cccttgccga tctgtttttt gtgatcactt atttttctct ggagatgggc 33001 atgacgcgct ttaaatggac cacaataggt gttactaact atcaaaatag aaagatattt 33061 agaggtattt tgggcgtgga attgagtttt atttggcatg ggcgctattg gtggaatacg 33121 acgcatcatt catccgttcg caacgtgtca gtggaagaaa aacagtttag aaatttgtgg 33181 tgtctaccgt gagaaaacta acagaaccag ataaacacga aattctcaaa ttataccgcg 33241 aaactgccga aacgacctca actttggcag aacgctatga cgtgagtaac tcgacgatta 33301 gccgcttact caaaagtact ctgccagaag atgagtatga atatcttgtg tctttaaagc 33361 gtgctgcaag aacgcctgaa ggaagggcac aggttaatta cgacaatttg cctgcgttta 33421 ccaaccagcc acaagaagat cagacacaaa agcaagaagt attatctcct gttgttgagc 33481 aaaatgtcgt tgagactcag ccccagaggg tatctcctgc tcaacgccag atacccaagc 33541 caaaagactc tcctgcacca caggtttcac cccttcggct tggtgcagtc gctcctgggg 33601 gaaaccccca agacctcggc gactacccgc aagaagaacc agtcgtggca caaaacttgc 33661 actttgtgga acttgatgag ccaactcatc ctagcagacg attgaagcgg cgatcttcag 33721 ctccaactaa accaatttta ccaatccaac aagcgcgatc tgagcaacca gttgccgagc 33781 aattggaact tctggaacaa aaacctccag aaatcaccag cattcctagt ccgcttttgg 33841 aagatacaca tccaaacgct aacgtcattg cggaaatgtt tggcgaagac ttactagatg 33901 agtcagatga tttggaggat ttagatgatg atgatgatga tgactatgac gaagaagact 33961 ttgaaccagc tgcacctttg gtcacaagac caaggtcggg tgacgcatta gtaaaagtct 34021 taccactctc agcagcggct ttacccaaaa cttgttattt agtcatagat cgttcctcag 34081 aattaatcac tcgtccattg cgggaatttg gcgacttggg acaaattcct agcttggaaa 34141 ctcaacaaag aaccttacct gtgttcgata accatcgaat tgcaaagcgc ttctcaacta 34201 agcgcgatcg cgtgattaaa attcctgata caaaaatgct acacaaggct cgttatcatc 34261 tacaggcaaa aggaatcact cgactgttaa ttgatggtca agtttactcc ttgtctagtg 34321 tttagttatt agtttgattc caaaacaagc ctcccactct actcaaaaat tttgtagagt 34381 agtccggaca gatgcacaca gaacagataa tttatttgcc tttatctgca cgccacatgc 34441 ttcaacgggg ggaacccccg caacgcagtg gctcgttttt ctgcgtgcat ctgcgtgcat 34501 ctgcggtttc aagtagcccc aaacctcgtt ttttcccaaa tctaatgtgc cgttctataa 34561 aaggcagcta actccgttca tcaaaaagtg gcaataagtc cttgccatat agttgaatca 34621 tgctatgttg atattacgta cactaagttt tttgttcaaa gatggttcat aaacctaaca 34681 agtctcaaga tttggagtcg tggcaacaag tccgtgcacc atacggttta ggttaccgga 34741 ttaaactcct ctcacaactg ttaagtcgca agttgactga gcggttggag ccgtttggac 34801 taaccccttt tcactgggta gtgctttgct gtttatggga ggaagacggt ttaccgactt 34861 ctagtatcgg ggaaaaactg caacaggtgg gaggtacctt aactggcgta ttggacagaa 34921 tggaagaaag aggtttgatt cgtcgagaac gtgactgtcg cgatcgccga atctggcgta 34981 tttggctaac cgatgcaggc aaagaactgg aaacagtctt gccagctatt gcagtagaaa 35041 ttcgcgagca agctatgcac ggtatttcct atgctgaacg agaacagttt tcccaacttc 35101 tgaatcaggc aatcgataat ttatcgtaga gtcattcatc tggatgaggc gctttattag 35161 gtcttttctg taaaatatta cgtatactaa ctaaataact ataaaataac cttatttaaa 35221 tatttagcac actaaattta taacgggcaa aaacgatact gaggggctta actatcatgg 35281 aacgcacaaa caacaccaac tccaacggac gtaacggaca caaaacagca gttctggaga 35341 aagaactcgt taaagacata gagactttgg aagctttaac atcagaaaat ttgaccaata 35401 aaatgccagc agaaacactc gtcaaagctg atctacatac ccaacctgaa acggcggaca 35461 aagaagccaa accccaaccg ccggaaacac ctaagaagaa aaaaccgatt gccttgattt 35521 tgacagcatt gggtgtgggt gctgtagctg caggaggttt tggttatcat tggtggcagt 35581 atgcttctac tcaccaagag acagacaacg ctcaagttgc aggacacctg caccaagtca 35641 gcgcccgcat tcctggaact ataagtcaag ttttggtaaa tgataaccag gaagtccaac 35701 caggacaatt gctggtaaca ctcgatccac gcgattatca aagcaaagtg caacaaactg 35761 aagccgcgct acaaaatgct cgtcgtcagg cgcaagccgc acaagcaaat attaacttag 35821 cctcaaaaac aaccagtgct aagacagttc aggcacaagg agatgttagc ggtgctgtcg 35881 cagcaatttc cacagcgcaa gcagcggtac aggaagcaga agctgggatt cccgccgcac 35941 aagctgaggt gaaacaggca gaagctggga ttcccgccgc acaagcgcaa gttgcacaag 36001 caaacgccaa tttgcaaaaa gcgcaagctg attacaaccg ttacaacacc ttgtctcaac 36061 aaggagcaat tccccgtcag caattagaca cttctaagtc agcgtatgat gtggctgtag 36121 cgcaaaagga tgctgctatt caaggagtta accaagcgca agcacgatta gccgccgcca 36181 gagttggggt agcaaaagca cagtcgcaac tagcgcaagc acaagaagga gtggtgagtg 36241 cgcaagcaaa actggcagca tctaaaggtg gattgcaaca agcaactgca ggcggggaac 36301 aaacaacagt caaccgcaac cagtatgaag cagcaaaagc agctatcgct caagcggacg 36361 catccttgaa agacgcgcag ttgcagttat cctacaccaa cattaccgct cctgctgctg 36421 gacgcgtggg gcgcaagaca gtggaagttg gcaaccgggt acaagcagga acacctttaa 36481 tggcgattgt caatgatgat tattgggtcg ttgccaactt taaagaaact cagttggaag 36541 acatgaagcc aggagaagaa gttgagatta aactcgatgc ttttcctcat cataccttta 36601 agggtcgcgt tgatagtatc tcgccagctt ccggcgcaca gtttgccctc ttaccaccgg 36661 ataacgccac aggtaacttt acgaaaattg tgcaacgtgt tcctgtgaaa attgttttcg 36721 acaaagaaag catcaaaggt tatgaatcgc ggattacacc gggaatgtca gcagaaatca 36781 gtgtaaaagt caagtaggac gaggtgatag ggtacaagga aaaatttgtt cctaaaacct 36841 cttccaaaag cagtggtatt cctttccacc tgtagagagg ggttaggtgt agtgtcaaat 36901 gatatctatg aaaggatatt taatttgtat aaagtaaccc ttgaatgaac gcacaagcaa 36961 ctagaaagag tccactgtta ccggcattca ggtcaagaaa ctaccgtttg ttttttgctg 37021 gacaaggcat ttcccttatt ggctcgtgga tgacgcaact tgccacgatt tggctagttt 37081 atcacttaac caacaaccca tttatgttgg gggttgttgg atttaccagt caaattccta 37141 gcttttttct aactccctta ggtggggtat ttgtggatcg tttttcccgt catcgtattc 37201 ttattggcac gcaaatacta gcgatgattc agtcgctggc gctagcagtg ctgactttta 37261 caggcatgat tcaaatttgg cacattatcg ccttgagttt actgcaagga tttattaatg 37321 cctttgatgc accagcaaga caagcatttg taacagagtt agtggaacgc agagacgatg 37381 tagcaaatgt catagccatc aactcaacaa tgtttaatgg cgcgcgcttg attggacctg 37441 caattgctgg tttactcatt gccagagtcg gcgcagctta ctgtttttta attgatggtt 37501 taagctacat tgctgtgatt atcgctttgt tagcgatgaa atttaagcct tggaaaacta 37561 cagttactgg cggcaatcct ttgcaaaatt ttaaagaagg gtttgtgtat gcctttggtt 37621 ttccaccaat tcgagcaata ttattattaa cagctttctt cagctttttc ggaatgcaat 37681 acaccgttat cgttccgatt tttgcagagg aaatcctcaa aggtagcgca gaaactctag 37741 gttttttgat ggctgcgtcg ggagtcggag cattagcaag tggtattcat ttagctacgc 37801 gaaaaacagt tatgggactt ggtaaagtga ttgttttggg tctagcaatt gcaggaattg 37861 ctttgattgc cttttccttg tcgcgcttgc ttccactttc tttactagca atgttgtttg 37921 ttggtttagg agtaattctc gtaattgccg gtagtaacac agttttacaa accattattg 37981 aagaggaaaa gcgtggacgg gtcctgagct tatacacaat gtcatttttg ggaatgatac 38041 cctttggtaa tttagctgca ggtgcattag cccatcaaat tggtgctcct tacactttaa 38101 ttattgatgg tattgcttgt attttcgggt ctatctattt tgccaaacag ttgcctgagt 38161 taagaaaaat ggtactcgca atttatgagc aaaagggtat attagcaact gcgaaacctt 38221 aaaaaactca cttaagaatt actaacaagg ttcagttagg gtagaaaata gaaatatatc 38281 tctcattcaa tacctggagt actgctgtgg ctgaaacaaa tgcagttgac aaccaaggaa 38341 atcaaggttc tatacaatct tcctcgaatc aacaaatacc gctcagaact tggattggcg 38401 ttcttgcgag tatgctaggc gcatttatgg cggtactgga tattcaaatt accaactctt 38461 cgctacaaga tattcaggca agtttggggg caactttaga ggaaggttct tggatttcta 38521 ctgcttattt ggtggcggaa attgtggtca ttcctttaac cggatggttg tcacgggtgt 38581 tttccctaag acggtatttg ttagttaaca ccgccttatt tatctttttc tctgtatgct 38641 gcgcttggtc atgggatctc aattctatga ttgttttccg cgccttacaa ggcttcaccg 38701 gaggggtttt aatccctacc gccatgacgg ttgtactgac cactttacca ccatccaagc 38761 aagcaattgg gttagctgcg tttgcgatta cagccgtttt tgcaccttca attggtccga 38821 cattcggagg ttggttaaca gaaaacttca gctggcacta cagcttttac ataaatgtag 38881 ttccaggagt gttgatgctt gctggggttt ggtatggaat taagcaagaa agaccccaat 38941 tacaattgtt aaaacaaggc gactggtggg gaattatttc aatggcaatt gggttagctt 39001 ccctgcaagt tgttttagaa gaaggtagcc gcaaagattg gtttggttca gcgttaattg 39061 tacgcttgag tatattggcg gtgattttcc taacaatctt tttctggatt gaattaactc 39121 gcaagcagcc atttattaat ttgcgattgg tgcgatatcg taactttggt ttagcaagta 39181 ttatcaatgt ttctttggga gtgggattgt atggttcaat ttatattttg ccgctgtatc 39241 tagctcaaat tcaaggatat aatgctctgc aaattggtca agtgctgatc tgggcgggaa 39301 ttccgcaact gtttattatt ccctttattc caaaagcgat gcaacgtatt gatgtgcggt 39361 tgatggttgc tgtgggtgtg gctttatttg cagtaagtgc attcatgaat tccaagatga 39421 cttatcaaac gggatatgac caattaattt ggtcgcaact tgttcgcgca atgggacaac 39481 cattgattat ggtaccccta acctctatcg ccacttctgg tttgagtccg aaagaagcag 39541 gttcagcaag cggtttattt aatatgatgc gtaatatggg cggttctatg ggaattgcag 39601 cattagcgac tctactaact aatagagagc aatttcattc caatagatta ggtgaatcag 39661 tatctttata taacccagct actcaagaga gaattaacca aatgactcaa tattttgtca 39721 gtcgtggttc tgatttgagt acagcacaag accaagccat aaaagctatt gacaacatag 39781 tgcgtcgtga agctttcgtt aacgctttta acgattgttt ctactttatc gcaattgctt 39841 tgttacttag cggtcttgct gttttattca tcaaaaaagt taaggtaacg ggtggtgctg 39901 ttgctcatta aatctactgt acttcgctgg atataacctt caaaaagggt gagagcaaga 39961 taaaaaagcc tacgaattat ttcacaaaat ttataccatt tctgtatgat agcttgcccg 40021 taaagtgcca tagttgcggc acgaacttat caggaacgaa ccgctaaaaa tctttatttt 40081 ataactctaa tttgataaaa agtttctcgg ttccagagaa tttaaatatt acggttatta 40141 tttgttatct acttacctcc atacttcttg aggatgaatg caacttctct caaaggcaag 40201 atgcataatc tgaaaaattg catcatgatt tgagtcaaag aattaacttg tgctcctgtg 40261 ctgttgtcaa aaaacgctta gcaggattct ttctggctga cgagaaatca cagaaaacct 40321 gtcacggtta ttgcagcagt acgtagcaaa agcagggtga tgaggaaaca caatgaacac 40381 gacgacaaac ctagagcgta aaaaagcatc gcttccggtg gcagtatttc tctcagcgtt 40441 gctcacacta ccagtgctta gtggttgcgg tggaggttct cgaactgctg cgccgctacc 40501 gcctgttgac gatactgctg gtagaaatgt gggttatcca caatctccaa accaacccca 40561 acagactaag aggggattga caacagggca gaaagtggcg ataactttgg tgggagcagc 40621 tgcgctttac tacttgtaca accagcgaaa gaacgctaga ggaaacggag cgcagggtaa 40681 gtactacctt tctaagaatg ggcgtgtata ctaccgcgat gatcaaggtc gtcctcattg 40741 ggtaacacct ccatccgaag gaattcgagt tccagagtca caagcgcagc agtatcggga 40801 tttcaaaggc tacaatgggc gtacgacagg tcgcgatttg actgacatcg ctccacagga 40861 agcgccgaca tactaaaaat cacatctagt gttgcactaa agaccccaaa aggatagaag 40921 ccaaaaggat agaagccata attaaccaca aatatcaaaa ggagataagt catggtagac 40981 gactttattc gcaaggcgaa ggattttttg ggtggtgggt caaacgaaga tgaccgcgaa 41041 gtacgtccag caagtgaaga cccatacggc gaccccgctg accaggatta ctacagtaac 41101 gccatccccg ccagccaaga cccctacggc gaccctgcag accaaaatta ctacggtaac 41161 gccatccccg ccagccaaga cccctacggc gaccctgcca atgactacgg gcagtttggc 41221 aatgttcgcc cagcaagtga agacccctac ggcgatccag cagatgagga ttaccgttaa 41281 cgcacaaaaa acaaatttct gcttatgaca gtaaagtcca atactgact // LOCUS NODE_593_length_41065_cov_5.63018841065 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 41065) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 41065) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..41065 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 83..1153 /locus_tag="DP116_04575" CDS 83..1153 /locus_tag="DP116_04575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205930.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aliphatic sulfonate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_04575" /translation="MLIQLLKTRILGIRQLLYSNFRQRKSRSLSVLFAVGLGLSLVVS ACSSNATKPEAVNSPTSKQNVTGSITVNIGFQKAATILNALKSKSSLDQAIAASGGTV KWTEFPAGLPMLEAMNAGSVDFGYTGESPPIFAQAAGNPLVYVAYDPWGPKAEAIIVQ KNSPIKSVAELKGKKVAFAKGSNTNYLVVKALESAGLKYSDIKPAFLTPADARAAFEG GNVDAWAIWDPYLAAAQEATGARTITDATNLAPNRGYYLARKSFVEEHPDVLKTILDE VSKVDKWAASNPAEVAKFLEPQLGIKAAALEVAEKRRKYGVLPLTEEVIAKQQEVADT FQKIKLLPKQIQVKEIVWKGNK" gene 1237..2925 /locus_tag="DP116_04580" CDS 1237..2925 /locus_tag="DP116_04580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009630489.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fumarate reductase/succinate dehydrogenase flavoprotein subunit" /protein_id="PRJNA477356:DP116_04580" /translation="MKTDVLVIGGGTAGTMAAIKAKQANPDAEVLILEKANIRRSGAI AMGMDGVNTAVIPGHSTPEKYVREVTLANDGILNQKAVYQTGKLGYEVIQELESWSVK FQKDAQGNYDLKQVHRVGKYVLPMPEGKDLKPILTRQVKRHKVKVTNRVMATRVLVGE KRAIGAVGFDVRNGDFIVIQAKAVILCTGACGRLGLPASGYLYGTYENPTNAGDGYSM AYHAGAELSNIECFQINPLIKDYNGPACAYVAGPFGAHTANAEGNRFISCDYWSGQMM LEIWKELNSGKGPVQLKMTHLDEDTISEIESILWSNERPSRERFHQGRGEDYRTHGVE MNISEIGLCSGHSASGVWVNEKAETTVPGLYAAGDMASVPHNYMIGAFVFGRLAGTHA IEYIQDLDHVEPEKDFLEQEKLRIYTPLTRPNGIPHTQVEYKLRRLVNDYLQPPKAGN KIEIGLRNFVYYQETLDLMGARDPHELMRCMEVHFIRDCAEMAARASLYRQESRWGLY HYRLDYPEKNDDEWFCHVNLKKDESGEMVLFKRPVEPYIVEVDSAKDVYDVAVR" gene 3109..3891 /locus_tag="DP116_04585" CDS 3109..3891 /locus_tag="DP116_04585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408946.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_04585" /translation="MPIYNSIGQQYSKTRVPDPRIINTLIDLLNLPKGSIIADIGAGT GSYSLALANQGFSVNAIEPSAVMQKQAVEHPQVNWFTGYAEDLPLADNSVDAVISILT IHHFSNLEKSFQEMHRVVRNGAIVLLTFDIRLAQKIWLYDYFPFLWQDALRFLPLNEL ANLIQASTGRHIETIPFLLPPDLSDLFAASAWKRPKLYLQQEVRAGISSFALADPNLV KQGVKLLAADLSSGEWDAKYADIENLTEIDVGYRFIRATLDN" gene 3923..4150 /locus_tag="DP116_04590" CDS 3923..4150 /locus_tag="DP116_04590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006453436.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin family protein" /protein_id="PRJNA477356:DP116_04590" /translation="MALINQRIDVPVIVDESKCLEKCNACIEVCPLDVLAKNPETGKA YMKYDECWFCLPCEKECPTNAITVQIPFLLR" gene 4205..5179 /locus_tag="DP116_04595" CDS 4205..5179 /locus_tag="DP116_04595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009343910.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_04595" /translation="MYNNTHNNSNDWIEMLRSPEVDDRLVAVKALQHLGEEEAIEPLI YALQDENLNVQKIAISALWEIANPVAVPALLKYLGSSNAEIRTEALSALNDLVSPTDL SLLLDSLSHNNIYLQLNILILLRKIHDIQSLPYIIQFFNSENADLREAAITTLRYLNQ VEKCPQALALISDYNVTVRRATALTLGYLQDAEVISILTHALTSDSDWQVRRNAAKSL AIHENDQAISALEIALGDEHWQVRKSAAQTLQKIPHIKVLPVLIQALTDEYADVRKEV AIALGNLGHPDAINPLQQSLDDLDKEVAIQSLRAIKKIQESIKSSTHD" gene 5172..6209 /locus_tag="DP116_04600" CDS 5172..6209 /locus_tag="DP116_04600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MRP family ATP-binding protein" /protein_id="PRJNA477356:DP116_04600" /translation="MTDEHLNSLTASRQQEVIRYLKTVIEPILKNNVVSLGMVRNLCI VDDYVYLRLYIGAHQQDFFKEIQYVLSNLSWCKKTYIQICTIPGVKVTLAISSGKGGV GKSTTAVNIAAALKLQGAKVGLLDADVYGPNIPQMLGLGQADIQVIHTPTGEKFLPLE IQGIKLMSVGLLAEPNRPLAWRGPVLHKIITQFLQDVEWGELDYLLIDLPPGTGDAQI TIVQESPICGVILVTTPQQVAISDVRRNIHMFRQVGVPVLGIVENMSYLICSDCGLRT PIFGSGGGEQLAAELQAPLLGQIPIDPRICSGGDTGHPIAVTNSTSAAGEVFVQIATA LLGTFCLMQSI" gene 6234..7427 /locus_tag="DP116_04605" CDS 6234..7427 /locus_tag="DP116_04605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013321679.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminotransferase" /protein_id="PRJNA477356:DP116_04605" /translation="MTPVRKQKISDKANQFTESVIREMTRVALQYSAVNLAQGFPDFP CPPELKRAACEAIEEDVNQYAITWGDKAFRQAIAQKVHWYLGLNIDPERQITVTCGST EAMAAVMLATLNPGDEVIVFEPYYENYGPDAILASATPRYVSLHPPEWTFDEAQLRDC FNERTKAIIINTPHNPTGKVFTREELTLIAELCQKWDVLAFTDEIYEHILYDRTQHIA MATLPGMSERTVTINGLSKTYSVTGWRVGYILANPELTAAIRKVHDFLTVGAPAPLQR AGVAAMQLRVSYYEELAKLYHQKRDDILRILDAVGIPYFIPKGAYYVFADISSFGYKN DVEFTRFLIQEIGVAVVPGSSFFSQSEAGKNFIRFCFSKKPETLAKAGERLLKLQSTL QAAPT" gene complement(7420..7572) /locus_tag="DP116_04610" CDS complement(7420..7572) /locus_tag="DP116_04610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020710935.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04610" /translation="MQNYYNLVYREQEREIVLLSRAEAIGIIPLSPLARGFLAGNRYQ ANLASK" gene 8063..8944 /locus_tag="DP116_04615" CDS 8063..8944 /locus_tag="DP116_04615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HIT family protein" /protein_id="PRJNA477356:DP116_04615" /translation="MKKQKNLFSHLTAIERIHLSVPAQFLLDKNLLQDKVNNKNNIVL DFGCGLGNDVKLLRKKGFDVTGYDPYYFPQYPNEKFDTIICFYVLNVLFPEEQGDVLM RISNLLKPGGKAYYAVRRDLKREGFREHYIHKKPTYQCIVKLPFQSIYLDENCEIYEY MHYNLRTNASNNCNCIFCQPHKNLTILTESATAYAMYDGYPVNKGHVLVVPKRHVSNY FDLPFKEQSACWFMVNRVQEILGKEFQPDGFNVGMNINRDAGQNMMHASIHIIPRYKG DTVGAKGGIRYVIPKRK" gene 10260..10535 /locus_tag="DP116_04620" CDS 10260..10535 /locus_tag="DP116_04620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195182.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04620" /translation="MNKVHLAFLINYLLMTGYFFINWLRFCRSHPSYSPEEKFLSNVI LFITTVLWPIIVPMSFLKILTTRKVEFNTVIPLIVAVFAFSVALYMG" gene complement(10986..12251) /locus_tag="DP116_04625" CDS complement(10986..12251) /locus_tag="DP116_04625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138029.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_04625" /translation="MVSRKSYRNKMYKKPSDDNHESLNHNTAPWRKTAFHLSLVLLGS GMTFAGGYLASHSQQVTQNASNLAVSPVNAGVPIPSGGEPTSFVTKVVQRVGPAVVRI DASRTVKTQLPEQFRDPFFRQFFGSALPDSQQRVERGTGSGFIMSADGRILTNAHVVN GADTVKVTLKDGRTYQGKVMGRDELTDVAVVKIQADNLPTVTLGNSDQLQPGDWAIAI GNPLGLGNTVTTGIISATGRTSNQIGAPDKRVEYLQTDAAINPGNSGGPLINAQGQVI GMNTAIIQGAQGLGFAIPIKTAQRISNQLIATGKVQHPYLGIQMVGLTPELKQNINSD PNSGLSVDQDYGVLVVKVMPNSPAAKAGIRAGDVIQKLNGKAVTDANSVQNAVENSRV GGDVRMELHRNGQNLNLAVRPGAFPTQTQ" gene 12782..14905 /locus_tag="DP116_04630" CDS 12782..14905 /locus_tag="DP116_04630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115363.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfatase-modifying factor protein" /protein_id="PRJNA477356:DP116_04630" /translation="MGKLALLIGVSEYQPGLNPLPCAVKDVEAMRRVLTHPEIGNFAD DDIKVLKNPQGQEIQYAIENLFADRQKEDLLLFYFSGHGIKDESGRLYLSTSTTRKQN NRLFKASAVAASVLHENMNESRSQRQVIILDCCFSGAIAQGLTVKDDGTVNLQEQLGG KGRAILTSSTSTQYSFEQEGSDLSIYTRYLVEGMEKGAADRDGDGWISIDELHEYASD KVKEAAPAMTPKFYPIEEGHKILLAKSPKDDPKVKYRKEVETRAKQGHEFSVFARRIL DGKRDEWGLTPQEAAAIEEEVLQPYREYERKRHEYEQALIQAIDQEYPFSKTTQKDLK EYQQYLGLRDEDIASIEQRIITPKQAEYQRNLQQAQRLQQEQERTQQQKQQAELRELP ETQSSPVSQPKSPSVIQTNIQTDIQTQRFEFEYATIIVKSKFLGIEKTWEINRHRGRA EFFIQNLGNDALLEMVAIPGGQFLMGSPENEPERLANESPQHTVTIQPFHMGKFPVTQ AQWQAVAALPKVKIDLNPDPSSFKGANRPVEKVSWDNAIEFCARLANKTGKPYRLPSE AEWEYACRAGTTTPFHFGETITTDLANYNGNYIYASGPKGEYRGQTTEVGKFPSNAFG LYDMHGNIWEWCQDAWYESYEGAPADGTAWMSENDKNSRRLLRGSSWYGDPWRCRSGS RSRNARVNRVDDVGFRVVVARFRTS" gene 15104..15292 /locus_tag="DP116_04635" CDS 15104..15292 /locus_tag="DP116_04635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017718323.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04635" /translation="MLRGGSWNNNPRNCRSANRNRNARDNTNNNAGFRVVVARLSALL CQNWWMGIHRAYQRRVQT" gene complement(15402..15989) /locus_tag="DP116_04640" CDS complement(15402..15989) /locus_tag="DP116_04640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_04640" /translation="MILQTQKRHYTPEEYLELEEQAEYRSEYRDGEIIPMPGGTTNHN KIALNFCRKFPLTVQGQAYEIYIIDVKVWIPRYRLYTYPDVMVVKGEPIYEGTNTTTI TNPMLIVEVLSNSTKNYDKTDKFKYYRSIPQFQEYIMIDQYSFSVEQYVKKAEGEWTF KEHEGEDAVLALHSIDFQISFRDIYERVNFELSEE" gene complement(16341..17180) /locus_tag="DP116_04645" CDS complement(16341..17180) /locus_tag="DP116_04645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875155.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prephenate/arogenate dehydrogenase" /protein_id="PRJNA477356:DP116_04645" /translation="MNIGILGLGLIGGCLGLDLRSQGHHVLGVSRRESTCQRAIALGS TDEASVDMRLLTKADVVFICTPIGLIVPTLEQLIPHLASHTIVTDVGSVKTPIVQAIA PLWDNFIGGHPMAGRTDSGIEAAIPNLFEKKPYVLTPTQTTPPHAITIVEKIVHELGA TLYHCSPEEHDRAVSWISHLPVMVSSSLIAACMSETDTNVLELAQKFASSGFRDTSRV GGGNPELGLMMARYNRACLLNSLQQYRHNLDELIHFVEQEDWVALEQHIQSTQKARPK FVE" gene 17367..17960 /locus_tag="DP116_04650" CDS 17367..17960 /locus_tag="DP116_04650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875153.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04650" /translation="MRDTFNKMIGKTRYVVSRIMLHLSGSEVAPILGVLNRAAREAID TDGDIEILGEGLVEICQTLLQYDEYWLSAANEGDVFWSEGEAGDYVNELFTDSAQRYL SEPDFGSDSGYDQPLSIPVTRNVVVMITIACEGEVPDLETDLANITALKEGFKALINL HYKHKLRAIQVHFSPARLGDELTNDQLLQYYPELIPL" gene 18030..18917 /locus_tag="DP116_04655" CDS 18030..18917 /locus_tag="DP116_04655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995317.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04655" /translation="MFMTIVRKFTAVFLALSLCVTTVACGGGGSDKTTPQASNTTQTT TATTKLNDGQYPVQQASYNDADGEYTLLLLNATPPSFKSQNLQMARLTDDEIKQGKKS YLKVENGQSALYLTEDFKIDYVHNVTETKTNPQTGQQETVVVRQQNSFWAPFAGSVAG SLAGQAIGSMLFRPQHYVPPVYQPGGVLTGYGGYGSSYGDAVSSYRSRYNAAPAVERN RTAFRSTGTIRRSYPGDSSTLRRTTPNTGSRATGSGFGGSTLRPSGNSNVRRSPSGSG FGSGGRSRVPRSSGFGRRR" gene complement(19095..20153) /gene="aroF" /locus_tag="DP116_04660" CDS complement(19095..20153) /gene="aroF" /locus_tag="DP116_04660" /EC_number="2.5.1.54" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321134.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-deoxy-7-phosphoheptulonate synthase" /protein_id="PRJNA477356:DP116_04660" /translation="MIIVMKIGSPEAEIDRLSEELGTWGLTAEKIVGKHKVVIGLVGE TAVLDPLQIQEVSPWIEQVLRVEQPFKRASRQFRHGESSEVVVNTPNGPVPFGEHHPV VIVAGPCSVENEQMIVETARRVKAAGAHFLRGGAYKPRTSPYAFQGHGESALELLVKA REETGLGIITEVMDSGDLEKVAEVADVVQVGARNMQNFSLLKKVGAQSKPVLLKRGMA ATIEDWLMAAEYILAAGNPNVILCERGIRTFDRQYTRNTLDLSAVPVLRKLTHLPIMV DPSHGTGWATYVPSMCLGAIASGCDSLMIEVHPNPAKALSDGPQSLTPELFDRLMQEL AVIGKAVGRWQQPVVALA" gene complement(20410..20877) /locus_tag="DP116_04665" CDS complement(20410..20877) /locus_tag="DP116_04665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3464 domain-containing protein" /protein_id="PRJNA477356:DP116_04665" /translation="MSAESERSQLPFEPNKKRQKPAKGKGKQPETKQESGKKDEKKPP FTKEEMAIPQVVSQRMIRRVALFCGIPTALGISTLIASYFLLTYAGIKLAPIAVLLVN MGFFGLGVLGITYGVLSASWDEHRVGGWLGWNEFTTNGERMIAAWRETRQKNV" gene complement(20890..21159) /locus_tag="DP116_04670" CDS complement(20890..21159) /locus_tag="DP116_04670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321314.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S15" /protein_id="PRJNA477356:DP116_04670" /translation="MALTQQRKQEILTQYQVHETDTGSADVQVAMLTARINRLSEHLQ ANKKDHSSRRGLLKLIGQRKRLLSYILEESRERYQALIGRLGIRG" gene complement(21359..23029) /locus_tag="DP116_04675" CDS complement(21359..23029) /locus_tag="DP116_04675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182231.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-amylase" /protein_id="PRJNA477356:DP116_04675" /translation="MATLTVFNLFAPRNQKAALIGSFSQWQEIPMEKGEDGYFRTQIE LEDDIYQYKFRIQTKSPQFEPEQWIDVIDPYATEVDEKLKVGVVRIKNGKKIVDTYVW QHDDAPLPENCELVIYEMHVADFTGGEVDPEKRGKYIDAIEKLDYLHELGINAIELMP VNEYPGDYNWGYKVRHYFATESSYGSTEDLKRFVDECHARGIRVFLDGIYNHTDEECP LMLIDRNYWYYEYMHYPEDPGNYWGPEFNYDNYDEKLDVKPAWKYVGDVVRFWVQEYH IDGIRFDAVRQLANYEFLDWVAKQAKKNTAPKPFYNIAENIPDTSKVTTPEGPLDACW HESFRYFVIPHICGEMFEPQKLKEVFDPRKQGYKTSINVVNYLATHDREHIFRELGDR GIFDEDAFRRAKLAAALLLTAMGIPMLWMGEEFGEHKRKSETVTQPKKIAWPLLERDE NRDLFEYYKKLIALRKKNSALQSDHIEFFHENLDEKVLAYVRGQKEDSRVVVIANFSD KNLSRYHVPRFPCAGTWHEWTGNYDVEAGEDGIRIDIESFQAKILVRQ" gene complement(23301..24143) /locus_tag="DP116_04680" CDS complement(23301..24143) /locus_tag="DP116_04680" /EC_number="3.4.15.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318530.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanophycinase" /protein_id="PRJNA477356:DP116_04680" /translation="MVASEIKRQLVIIGGAEDKEGDCQILREFVRYAGGTKARIVIMT AATELPREVGQNYIRVFERLGAEDVRILDTESREDASSSTALEAVSKATGIFFTGGDQ GRITSVLKDTELDAAIHQRFSEGIVVGGTSAGAAVMPDVMIVEGDSETNPRMEIVDLG PGLAFLPGVVIDQHFSQRGRLGRLVAALAQQPVVLGFGIDENTAMVVTDDQFEVVGQG CVTVVDDSEVVHSNVDEILKDEPLAVCGAKLHILPHGYKFDLKTRKPILDSRTVTTVP APAG" gene complement(24641..24985) /locus_tag="DP116_04685" CDS complement(24641..24985) /locus_tag="DP116_04685" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04685" /translation="MVFIPRLFLNASACPPGIFYALWLTPRLEAERARCANGDAESQS DTLRERERQMLYLGSHPAASFFMSLGWTTRQQCLLVKPEIWMYFCGESSAVRGFPPVE ATGEPEGHYQKI" gene complement(24998..25150) /locus_tag="DP116_04690" CDS complement(24998..25150) /locus_tag="DP116_04690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749087.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_04690" /translation="MGKPTNFITFRVNDGEKEILRNYCEKLGRTQTDVLRELIRNLQK ENITLD" gene complement(25512..25739) /locus_tag="DP116_04695" CDS complement(25512..25739) /locus_tag="DP116_04695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318532.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04695" /translation="MSNAKAVLRKEVQHLAEEAFHHKLISGYGDGPDTNEYQIVFQGK PRHFALEEARSFLCELLFETQVDQHSEELSI" gene 26123..28033 /locus_tag="DP116_04700" CDS 26123..28033 /locus_tag="DP116_04700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875098.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanophycin synthetase" /protein_id="PRJNA477356:DP116_04700" /translation="MVQDKITDTVRVNARRTDAFDIFKFKHYIGPNPYLDAGALVFDF AVTEFTRPLPIEDYVSIISDRYPHLRDQTFDSYAHLFARTVSEVGKLDMGLHLDRWSI KPYENYATISIQSLHERTTRGVLYFVWDWFEAITQHKDITFEEQLLSLQSKFRQSVYG GPTVYALLRTADTKGIPTFYLWDEGLTQYGYGKKQVRGIATTFDSDSHLDSDFTTRKD DCKAFLKTLGFPVPKGDIVNSEKQALAVARDINYPVAVKPVAGHKGIGVTAEVRDEYE LKSAFRRALQAIPENEPTRIIVEKSIVGSDFRLLCVNGRFVAAIERRPAWVVGDGKST IEELIRDENRKPGRLDTPTSPMTKIQCDEAMEQYLEQQGLSLDSVIEKDRNVSLRKVA NLSAGGVSIDATRTVHPDNIILAQDIAQHFRLVCLGIDVIAPNLSESWKSNDFSILEI NAAPGILMHLNPSIGESVDVPSHILETFYKLGTDARIPIITFNHISVHEIQETIDHIL LQHPDWKIGAVCRDGVFINRSAKILSEDYNTNILSLLRNPQLDILIAEYKEDILEKEG MFYYGSNMVVLNNPTETEMTLARDVFDDSTVVIKNGDNISIRRKGLIEEYTLGVDEPF TRVYLKEIGTVL" gene 28224..28916 /locus_tag="DP116_04705" CDS 28224..28916 /locus_tag="DP116_04705" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04705" /translation="MSDISWLRLSLHYGYIDAQGEVYKRVLEELGKAIEKSTSLANEL ETNQKLLPQEKDSIIDDGCLLIEYLLGTAFVLCQAYIVDVVSTVEKIHVRAKQSLGRD LKTIPVFNEKSKISRKDILRFGQTLPFSYEGDNFSPIQLINAFANYFKHRDEWDVNWE KLEGTQKDTAKVIQSVGAKFGCSGNLRQGSSALGNPEFTNTLIFFEKLQQWHIDLASA YDKELSSYDTVF" gene complement(28966..29691) /locus_tag="DP116_04710" CDS complement(28966..29691) /locus_tag="DP116_04710" /inference="COORDINATES: protein motif:HMM:PF01476.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04710" /translation="MFYTVKDGDTLPKIAEKFYGVRSRWQQIYDANPEVIILIPGVKL FIPLPQTLYDKELEKNEKIKTYAQLWQSKNMTKSKHSIKINFVRLGAICGSLLIGLSS IPIPIFLAQSAQAQQTPKTSNDSVIPPMPEKLQTPSAKVVPVNSKVSVKLTNQTNAVL TYEVIGHTKQRTLSGKSTVTLKDLPVAVAISFRRQDKGLLTVHLQGETAPGLLEVKLD EGNNFGEDKIAMKIEKTGEVFLK" gene complement(30370..33102) /locus_tag="DP116_04715" CDS complement(30370..33102) /locus_tag="DP116_04715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation-transporting P-type ATPase" /protein_id="PRJNA477356:DP116_04715" /translation="MQDKLPQSWHSLDIEETIDLLETDQAQGLNSIQVKERIARFGFN ELTGKKGKPWWLKFLLQYNQPLLYILLIAGATKAIIGEFVNAFVIWGVTTTNAIIGYV QESKAEGAIAALAKSITTEATVIREGQKLRIPSRELVPGDVVLLTSGDKVPADLRLIK VRNLQVDESALTGESVAVEKEAGVETPPLPPDIPLAERKNMAYAGGFVTFGQATGIVV GTGNTTETGRISQLLENKVDLSTPLTRKFDKFSKNWLYMVLGVATLTLMAGLQTKPWR DAIEATVALIVGSIPEGLPAVVTVTLAIGVSRMARRHAIVRKLPAVETLGSATVICSD KTGTLTENQMTVQAIYAGGHQYAVTGVGYSPEGEIVQDGKPIDLSGDIGLQECLSAGL LCNDSHLETKNGKWVVFGDPTEGALITSAHKLGLIKPLLEQQMPRLDGIPFESEFQYM ATLHGTPTGKTIYVKGSVEAIAKRCTFMLNSKGQLKRIDCQETLNTSSIEREVNIMAR QGLRVLALAKKLVPNEQTTVDHVDIDTGLIFLGLQGMIDPPRESAMKAVKACQEAGVQ VKMITGDHAVTAQAIATRMGINKNGSVLAFSGAELAQMDYQELAQVAEDGVVFARVAP EQKLRLVEALQSKGEIVAMTGDGVNDAPALKQADIGIAMGMAGTEVAKEAADMLLTDD NFASIKAAVEEGRAVYKNLVKAMCFILPVNGGESMTILFSTLVGRELPILSLQILWLN VINSITMTVPLAFEPKPQNVMQQPPRRPNESLLSGSRIKRILAISLFNWIVIFGVFEY IRQTTGDINLARTMAINSLIAGRIFYLLSLSQLVPNLIAKMDGTIQENVDIPAIGFGI IGAIILQLCFSYVPLINEFFFTVPLRFDQWLFCLAVGLPMIPWAAFVNRFDPPN" gene complement(33182..33520) /locus_tag="DP116_04720" CDS complement(33182..33520) /locus_tag="DP116_04720" /inference="COORDINATES: protein motif:HMM:PF03413.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04720" /translation="MGIKQTIIAACGVVALILVGSWFFRIRPVQASPDNTAPAVTMVE AIETVLAANPGTAAIDVNLEHENNNLVWEIELNNDLEVYIDANTKEIVKTDQGWNLTD VPLLANWIPN" gene complement(34480..35892) /locus_tag="DP116_04725" CDS complement(34480..35892) /locus_tag="DP116_04725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749100.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_04725" /translation="MISLKWLNHIPVRIKLTAWYLLLLGLTLGGFTGYLYFRLERKII NQADTALQIAGSQSLVYLSDKNNALAFVDKPSRQNTVQRLIEAGFAVRLMTPQGKIVD GFGKYQEVPLWIPSASGYTRVARNKADWRLISQPVIRQGQIIGWLQIAKSLEALEEIS DKLSAELLFLLPFILIIAGCGGLFLSSRALQPISQITQTAQAISAIDLAQRLNYKGAK DEVGQLATTFDQMLERLQAGFEREQRFTADAAHELRTPLTVIKGRIDVTRSRERTPDE YEQTLQDLEQEVDRLIRLSNGLLLLARIDRGQLPFEPLPVDLSNLLEVIVEQVQHAAE SQQIKLLNNLTPDLWVQGDPDQLTSLFLNLVDNAVKYTPQGGVVWVRSNLHSNVVQVM IINTGYGISKEHLPHLFERFYRADSARFQGKSGAGLGLAIAHEIARLHGGTITADSIP NKETTFTVTLPIAQHSGLFR" gene complement(35889..36569) /locus_tag="DP116_04730" CDS complement(35889..36569) /locus_tag="DP116_04730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749101.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_04730" /translation="MRVLLVEDEPGIAQFISQGLKEAGYATDIATDGQEGLDYAASAE YDIIVLDIMLPQMDGLQVLRKLRSQGLKTPVLLLTARDAVEDRVRGLDAGADDYLFKP FALSELLARLRALLRRPPMQQDTILRVGDLEMDVATREVRRAGKSINLSPREFTLLQY LMRHPRQVLSRNQITEHTWNFDFYENSNVIDVYIGYLRRKIDHGFDKPLLHTVRGVGY CLTADAES" gene complement(37330..38988) /locus_tag="DP116_04735" CDS complement(37330..38988) /locus_tag="DP116_04735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995049.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-amylase" /protein_id="PRJNA477356:DP116_04735" /translation="MAKPIEFNLFAPYNKAASLIGSFSDWEPIPMEKGDDGYFRTTVE LEDGAYQYKFRVQSKSWFFEPDQWVDVTDPYATDIDELSGKDDGVVYIKDGERIVDTY VWQHDDKPLPADHELVIYELHVGDFSGGEDDPYARGKYKHVVEKLDYLSELGINAVEL MPVKEYPGDYSWGYNPRHFFAPESSYGPTSGLKNLVDECHARGIRVIMDGIYNHSESS SPLTQIDHDYWYHHEPRDPDNNWGPEFNYEHYDKNLDTYPARKFIGDTVRYWVGEYHL DGIRYDAARQIANYDFMHWIVQEAKDTASMKPFYNIAEHIPETTSITNVDGPMDGCWH DSFYHCIKDHICGNTFDLERLKDVIDAKRQGFMGATNVVNYLTNHDHDRLMVELGNNN IFDEEAFKRLKLGAAILLTAMGIPMIWMGQEFGEYKPKTQESSKIEWGLLGNDLNRSL FEYYKGLINLRKNNHALYTENVDFIHENPETKVLAYSRWNDEGSRVVVVANFSENFLA GYQIPNFPSPGTWHEWTGNYDVEAGDDGIMIDIGPYEAKVFVWQ" gene 39114..39347 /locus_tag="DP116_04740" CDS 39114..39347 /locus_tag="DP116_04740" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04740" /translation="MNSASDEYFLISKQTSQSSSRDNQILFTSISRSNFYKKYILGDL LINYKILMIHLKYLEKISQKLVNQKNIKIVSEE" gene complement(39396..40325) /gene="murQ" /locus_tag="DP116_04745" CDS complement(39396..40325) /gene="murQ" /locus_tag="DP116_04745" /EC_number="4.2.1.126" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412247.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetylmuramic acid 6-phosphate etherase" /protein_id="PRJNA477356:DP116_04745" /translation="MTTNLQGRGHLLTEQVNPDSLDLDQLSSLELVELFNREDAKAVA AVAAAKLQLAEAIERTAECLRHGGRLFYIGAGTSGRLGVLDAAECPPTFCTPPEMVQG IIAGGAGALVRSSEDLEDSAEDGEAAIAQRQITQLDVVVGITAGGTTPFVHGAMSAAR QRGATTIFIACVPVEQVECEADIDIRLLTGPEILAGSTRLKAGTVTKLALNILSTGVM VKLGKVYGNRMVDVAVTNQKLRDRAMRILQDLTGLSREAAGFLLERSGKWVKLALLMH WTGLEKEEGLQLLSQYQGNLRAAVASYKNNEQA" gene complement(40437..40859) /locus_tag="DP116_04750" CDS complement(40437..40859) /locus_tag="DP116_04750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3110 domain-containing protein" /protein_id="PRJNA477356:DP116_04750" /translation="MITPMRVFVLLFNARTENEGIHTIQVGERNKVLMFESEDDATRF ALMLEAQDFLSPTVEAIDSEEVKEFCQAADYDWELVQNDSALAIPPEINVEETDWKLE SQEDVAEETLPRNQESQEKPELSDSELDSIRRKLEGLL" BASE COUNT 12013 a 8746 c 8393 g 11913 t ORIGIN 1 actattaatt aagtgtatgt ttggaaaaat atagtgttat cagcagtaag acttttccat 61 aaatccttta ttctatgcat ctatgttgat tcaactctta aaaactagga ttttgggtat 121 tagacaactt ttatactcaa atttccgaca acggaaaagc cgttccttgt ctgtgctatt 181 tgccgtgggg ttgggcttaa gcctagttgt gtctgcctgt tcttccaatg ccactaaacc 241 agaagcagtc aattcaccta ctagcaagca aaatgtgact ggcagtatta cagttaatat 301 tggctttcaa aaggcggcaa ctatcctcaa tgccctaaag agcaaaagca gcctagacca 361 agcaatagca gcttctggtg gaactgtaaa atggactgaa ttccctgcag gtctacccat 421 gctggaagcc atgaacgcgg gtagtgttga ctttggttac acaggtgaat caccgcccat 481 ctttgcccaa gccgcaggta atcctttagt ttacgtcgcc tacgatcctt ggggaccgaa 541 ggcagaagcg attatagtac agaaaaattc accaataaaa agtgtggctg aacttaaagg 601 caaaaaagtt gcctttgcca aaggttcaaa taccaactat ctcgttgtca aagccttaga 661 atctgccgga ttaaagtaca gtgacatcaa gccagccttt ctcacaccag cagatgctcg 721 tgctgccttt gaaggtggta acgttgatgc ctgggcaatt tgggacccct atctagcagc 781 agcccaagaa gcaacagggg ctagaaccat aacagatgca acaaatttag caccaaatcg 841 cggttattac cttgcccgca aatcatttgt ggaagaacat cctgatgttt tgaaaacaat 901 tttggatgaa gtcagtaaag tagacaaatg ggcagcaagt aatccagcag aagttgcgaa 961 gtttttagaa ccacaattag gcatcaaagc agcggcgctg gaagttgcag aaaaacggcg 1021 aaagtatggt gttttaccct tgacagaaga agtcattgct aaacaacaag aagttgctga 1081 tactttccaa aaaatcaagc tgcttcccaa gcaaatccag gtaaaggaaa tcgtttggaa 1141 aggcaataag taaaggatga aaaatcgttt ttaatttcat actgttttgt attgggactt 1201 tcaaacgcta tggtgacgaa tgcgactacc cagtggataa aaactgatgt gctggttatt 1261 ggtggcggga cagcagggac aatggctgct attaaagcca agcaagcaaa tcctgatgct 1321 gaggtgctga ttttagaaaa ggctaacatt cgtcggagtg gggcgatcgc aatgggtatg 1381 gatggcgtta atactgctgt cattcccggt cattccaccc cagaaaaata cgttcgtgaa 1441 gtcactcttg ccaacgatgg tattctcaat caaaaagccg tttatcaaac aggcaaatta 1501 ggctacgaag tcatccaaga actagaaagc tggagtgtga aatttcagaa agatgctcaa 1561 ggtaactacg acttaaaaca agtgcatcgt gttgggaaat atgttctgcc catgccagaa 1621 ggcaaagacc tcaagccaat tctgacccga caagtcaaac gccacaaagt caaagtcacc 1681 aatcgcgtca tggcaacccg cgttttagta ggagaaaaac gtgctattgg tgctgtagga 1741 tttgatgtta gaaacggtga ttttattgtc attcaagcca aagcagtcat cctatgtaca 1801 ggagcatgtg gacgtttagg attacctgct tctggttatc tctacggcac ttatgaaaat 1861 cctactaatg ccggagatgg ctactcaatg gcttatcatg ctggtgcaga actcagcaac 1921 attgagtgct ttcaaatcaa tcctttaatt aaagattata acggtcctgc ttgcgcctat 1981 gtagctggac ctttcggtgc tcatactgct aacgccgaag gaaatcgctt catcagttgc 2041 gactattgga gtggacaaat gatgctggaa atctggaaag aattaaactc cggcaaggga 2101 cctgttcaac tcaaaatgac ccaccttgat gaagatacga tttctgaaat tgaatccata 2161 ctttggtcaa atgaacgacc aagccgcgaa cgctttcatc aaggtagagg cgaagactat 2221 cgcacccacg gcgttgaaat gaacatatca gaaattggct tgtgtagcgg acatagtgct 2281 tctggcgttt gggtgaatga aaaagctgaa acaactgtcc caggtttgta tgccgctgga 2341 gacatggcaa gcgttcctca taattatatg attggggcat ttgtcttcgg tcgtctagca 2401 ggaactcatg ccattgagta tatccaggat ttagatcatg tcgagccaga gaaagatttt 2461 ttagaacaag aaaaattgcg aatttataca cctttaactc gaccaaatgg tattcctcac 2521 acccaggtgg aatataaatt acggcgctta gttaatgatt atctgcaacc accgaaagcg 2581 ggaaacaaaa tagagattgg cttgagaaat tttgtctact atcaagaaac attagattta 2641 atgggtgctc gcgatcccca tgaattgatg cgctgtatgg aagttcattt tattcgcgac 2701 tgtgcagaaa tggcagccag agcatcatta tatcgtcaag aaagtcgctg gggtctttat 2761 cattaccgat tagattatcc agaaaagaac gatgatgagt ggttttgtca tgtcaattta 2821 aagaaagatg aatcagggga aatggtgctg ttcaagcgtc cagttgaacc ttatattgtg 2881 gaggttgatt cagcaaaaga tgtttatgat gtcgctgtga ggtaggatat tcaaaatgaa 2941 tccgaagcaa gtcttacaga aagtgagaca attctcacac acattctaaa tcaagactct 3001 ttatgccata gctttcatat cagaccaaga aggcttgctg ctggtagaga atcatttatt 3061 aagagattgc ttgaagagtt aagtcatcta aatttcaaca ttctaccaat gcctatctac 3121 aattcaattg gtcaacaata ctccaaaacc cgtgtccctg atcctcgcat tattaataca 3181 ttaattgacc tactcaattt acccaaaggt agcattatcg ctgatattgg agcggggact 3241 ggtagttaca gtctagcgct tgctaaccaa ggattttctg tgaatgctat tgaaccttct 3301 gcggtgatgc aaaaacaagc agtagaacat ccacaagtta actggttcac tggctatgca 3361 gaagatttac ctttagcaga taattctgtt gatgcagtca tcagtatcct cacaattcat 3421 catttttcta acttagaaaa atcatttcaa gaaatgcatc gagtagtcag gaacggagca 3481 atagttttgc tgacatttga tatcagatta gctcagaaga tatggcttta tgattacttt 3541 ccatttttgt ggcaagatgc tctacgattt ttaccactta acgaacttgc taacctaatt 3601 caagctagta ccggaagaca tattgaaact ataccttttt tgttgcctcc tgatttgtct 3661 gatttatttg cagcttcggc ttggaagcgt cctaagttat atcttcaaca agaagtacgt 3721 gctggaatat catctttcgc tttagcagat ccgaatttag tcaagcaagg agtgaaatta 3781 ctcgcagcag atttaagtag tggggaatgg gatgcaaagt atgctgatat tgagaattta 3841 acagaaattg atgtgggtta tcgttttatc cgtgctacgc ttgataacta aaagacgaga 3901 tagtaaaaga gagttatata ctatggcttt aataaatcaa agaattgatg tccctgttat 3961 cgtcgatgaa tcaaaatgtt tagaaaaatg caacgcttgt attgaagttt gtccactcga 4021 tgtactcgca aaaaacccag aaacaggaaa agcctacatg aaatatgacg agtgctggtt 4081 ctgtctacct tgcgaaaaag aatgtccaac caatgccatt actgtacaga ttccattctt 4141 attgcgctaa tttatttgta aaaattatta ctctgtgctc tgtgcgtctc tgcggtttta 4201 aaaaatgtat aataacactc ataataactc caatgattgg atcgaaatgc tgcgatcgcc 4261 agaagttgat gaccgcttag ttgctgtcaa agctttacaa catcttggtg aagaagaagc 4321 aatagaaccc ctaatttatg ctctacaaga tgaaaacttg aatgttcaaa aaatagccat 4381 ctctgctctt tgggaaattg cgaaccctgt tgctgtacca gctttactaa aataccttgg 4441 ttcatcaaat gcagaaattc gtacggaagc attatcagct ttaaatgact tagtttcacc 4501 tacagattta tcattgttgc tagatagtct cagtcataac aatatctacc tacaattgaa 4561 tatcctaatt cttctccgca aaattcatga tattcagtct ttaccatata ttatacaatt 4621 ttttaattca gaaaatgcag atttaagaga agccgcaatc acaacactcc gctatctcaa 4681 tcaagttgaa aaatgtcccc aagctttagc gttaatatct gattataatg tgactgtgcg 4741 ccgcgctact gctttaactt tagggtattt acaagacgca gaagttattt caatacttac 4801 tcacgcactc acaagtgaca gcgattggca agttcgtcgc aatgcagcaa aatctcttgc 4861 tattcatgaa aatgaccaag caatttcagc attagaaatc gctttaggtg atgagcattg 4921 gcaagtacgg aaatcagcag cacagacttt acaaaaaata ccacatataa aagttctgcc 4981 agtgttaatt caagcattaa cagatgaata tgcagatgta cgaaaagaag ttgcgatcgc 5041 tcttggtaat ttaggtcatc ctgatgcgat aaatccactc cagcaatcat tggatgacct 5101 agataaagaa gttgctatcc aatcactcag agcaattaaa aaaattcaag aatcgataaa 5161 atcatcgact catgactgat gaacacctca attccttgac agcatctcgt caacaggaag 5221 tgattcgcta cctgaaaact gtcattgaac caatccttaa aaataatgtt gttagcttgg 5281 ggatggtgcg gaatttatgc atagttgatg actatgttta cctgcgcctt tatattggag 5341 cacatcagca ggatttcttc aaagaaattc aatatgttct atcaaattta agttggtgca 5401 agaagactta tattcaaatt tgtacaattc ccggagtcaa agtaaccctt gctatttcta 5461 gtggcaaagg tggtgtaggt aaatcgacaa cagccgtgaa tatagccgca gctttaaaat 5521 tacaaggtgc aaaagttggt ttgctagatg ctgatgtcta cggtcctaat attccacaaa 5581 tgttgggttt aggacaagct gatattcaag ttattcatac tcccaccggt gagaaatttt 5641 tacccctaga aatccaggga attaaactca tgtcagtggg tttactcgca gaaccaaatc 5701 gtcctttagc atggcgagga ccagtattac acaagattat cactcaattt ctgcaagatg 5761 tagaatgggg cgaattggat tatttattga ttgatttgcc gccaggaaca ggggatgctc 5821 aaattacaat tgtacaagaa agcccaattt gtggagttat tttggtgacg actccccaac 5881 aggttgctat ttccgatgtg cgccgtaaca tacatatgtt tcgccaagtg ggtgttcctg 5941 ttcttggcat tgtggaaaat atgagttatt taatctgtag tgattgtggc ttacgcacac 6001 caatttttgg tagtggtggt ggtgaacaac ttgcagcaga attgcaagca cctttactgg 6061 gacagattcc catagatccc cgtatctgta gcggtggtga tacaggacat cccattgcag 6121 taaccaactc cacctctgct gcaggtgagg tttttgtgca aattgcgact gcactgttag 6181 ggactttttg cctgatgcag tcgatttaat aacactatat agaggatata aatgtgactc 6241 cggtaagaaa gcagaaaatt tcagataaag caaaccagtt cacagagtca gtcattcgag 6301 aaatgactag agttgcctta caatatagtg cggtgaatct ggcgcaagga tttcctgatt 6361 ttccttgtcc gcctgagtta aaacgggcag cttgtgaggc aattgaggaa gacgtgaatc 6421 agtatgctat cacatgggga gataaagctt tccgtcaggc gatcgcccaa aaagtccatt 6481 ggtatttagg cttgaatatt gatcctgaac gacaaatcac cgtcacctgt ggttcaacag 6541 aagcaatggc tgcagtgatg ctagcaactc tgaatccggg tgatgaagtg atagtgtttg 6601 agccatatta cgaaaactat ggtcccgatg caattttagc cagtgcaact cctcgctatg 6661 tgtcgctgca tccacctgag tggacatttg atgaagcaca actgcgcgac tgctttaatg 6721 aacgtacaaa agccattatc attaacacgc cccataaccc aactggcaaa gtctttactc 6781 gtgaagaact aaccctcatt gctgaacttt gtcagaagtg ggatgtatta gcattcacag 6841 atgaaatcta cgaacacatt ctttatgaca gaacacagca tattgcaatg gcgactttgc 6901 caggaatgtc tgagagaact gtcactatca atggtctttc caaaacctac agtgtgactg 6961 gttggcgagt cggctatatt ctagcaaatc cagaattaac agcagcaatt cgcaaagttc 7021 atgattttct taccgttggc gcaccagcac ccttgcaaag ggctggagtc gcggcgatgc 7081 aacttagagt tagctactac gaagaattgg caaagcttta ccaccaaaag cgagacgata 7141 ttctgcggat tttggatgca gtgggaattc cctattttat tccaaaagga gcttactacg 7201 tgtttgctga tatctcgtct tttggttaca aaaatgatgt ggaatttact agattcttaa 7261 ttcaagagat aggtgttgcc gtagttcccg gttccagctt tttctcgcaa tcagaggcgg 7321 gtaaaaattt tatcagattc tgctttagta agaaaccaga gactttggca aaggcagggg 7381 aacgtttact gaaattgcaa tcaactctac aagctgcacc tacttagaag ctaggttggc 7441 ttgataacga ttgcccgcta aaaagcctcg cgctaaagga ctcaagggaa taatcccaat 7501 cgcctcagcg cgagataata acactatctc tcgctcttgt tcccgataaa ctaggttgta 7561 gtaattttgc atggaaacaa agcgactcta aatagattca cgcagatttc agcctctggc 7621 aaagtgcttt tatcaaacat gtagattgga taactggctt agtaaataag tccaactggc 7681 aatgagacaa gcagcatcgt cacttgccac tgctttttag tatagatagc ataagtgtta 7741 gtgctgatgc aattatcatt actttagtat taaatagcac tataataacc aaagtcgtat 7801 aacaatagct aatactctta ccaatcaaca gcaggttgca cacttcgcga aacccgacgc 7861 caatcgcctc gacggaacgc gcttgttgca tgcacattcg agggctttgc ggtgagacag 7921 cgagttaact tgtgcaaact agcttgaaaa aggcagaggg caagctatcc agaacgaaag 7981 aaaaaccatt attctttttt ctttccttct ggtgacgaat tcgtcatgtc ggctacttaa 8041 tcgaaagaaa atttatgttt ccatgaaaaa gcaaaaaaat ctattcagcc accttacagc 8101 aatagaaaga attcacctat cagtacctgc acagtttttg ttagataaga atctacttca 8161 agataaagtc aacaacaaaa acaacattgt cctagatttt ggctgcggct tgggcaatga 8221 tgttaaatta ttgcgaaaaa aaggatttga tgtcacaggt tacgatcctt attatttccc 8281 gcaataccct aatgaaaaat ttgacacaat aatttgcttt tatgttttaa acgttttatt 8341 tcctgaagaa caaggggatg ttctcatgag aatatcgaac ctattgaagc ctggaggaaa 8401 agcgtattat gcagttagaa gggatctaaa aagagaaggc tttcgagaac attatatcca 8461 caaaaaacct acatatcagt gcattgtcaa acttcctttt caatctatct acttagatga 8521 gaattgcgaa atttacgagt atatgcatta taacttacga acaaacgcat ctaataattg 8581 taattgtata ttttgtcaac ctcataaaaa tttaactatc ctaaccgaat cagctactgc 8641 ctatgctatg tatgatggtt atcctgtgaa taaaggtcat gttttggttg ttccaaaacg 8701 tcacgtgagt aattattttg acctaccgtt taaagagcaa tctgcttgtt ggtttatggt 8761 taacagagtc caagaaattc tcggcaaaga atttcaaccc gatggtttca atgtgggaat 8821 gaacatcaat cgagatgcag gtcaaaatat gatgcacgcc agtatccata ttatccctcg 8881 ttacaaaggt gatactgttg gtgctaaagg tgggataaga tatgtgattc ctaaaagaaa 8941 atagtttgta atttatcggt taatataagc agaaccgcaa gaatattatc aatagataac 9001 ttgaaaaaat aagtggtcac cagaaggtag cgccagggtt tgccagcgct gccttcgtgt 9061 tacccaaagg ctggtaactg actctcaaag ttgggctagc agatttttta gtcctttaat 9121 aaagcggcat ttcaagcact gcaatcttta gtcgagcatt aagttaagac tttagttgca 9181 ctaaatatac aagctattga cacaatgggg tcatacctgt gtcatatcca cttggtactt 9241 taaaagtagc caacactaag gtttttgtca agaatactga gttctagctt cagaatttct 9301 ctgagaaact ttagtcaata atactgttac ttagttgcac attatgttcc cttttttctt 9361 tgctgtcttt ttggtaagct tactaacttt atttggtgga gctgcaatag cttacctgtt 9421 cccagagaat aattctcaaa attaagggat actccacaaa tcattcaaaa tcgcgtagcg 9481 ttgcgtcagc gttgcgacgc aatctcaaaa ttcaaaaatt gaaaaattta gaacagaact 9541 actctactta tgttatcaag caaataaaca tttaaaaact gtctttttgg gtactttagc 9601 ttaggcacga ttgacccata gtccaagtcc ttgtgtagca gaaatttctc tttccctgcg 9661 ttgtattgat aacgcagggt ttctttttta ttgggatatg tagtactatt atcaagaatt 9721 attcaacttt ttcatcattc agtcttagta taaagaaaat gcataactgc tgatgtccat 9781 aaaaaaataa ctgaaattat acttgaattg attctatctc actgtaaata tcttgagatt 9841 tcaaactgtt actctacttc tgaagtggta gaaattagct tagtataagc ttcgttaaag 9901 ttaacttatg atgtatgaat tttatttttt aaaattttaa tatatacact atataaaagg 9961 tggaatagtt tatgattaat cagtaaattt tctaacctct ggagacacaa gggtaagata 10021 gttaaaacat ttccgtaaaa tttctattat cttcaatttt ttattaatta ggtgtatgta 10081 ctgattggga aaattttacg gttcgattaa catttagcta aattatagaa gaattttatg 10141 tatgtgcttc accaacagac agccgtttta caaacctcta gtttgtaccg tttaagttgt 10201 gtcataaatg aatctaagtt atgttttctg aatctttgat aactaaagga gttgctacag 10261 tgaataaggt gcacttggct tttctgatta actacctact gatgactggc tattttttca 10321 ttaattggtt aagattttgt cgaagtcatc ctagttattc gccagaagaa aaatttttgt 10381 ctaatgtgat acttttcatc actactgttt tatggcccat cattgttcct atgtcttttc 10441 taaaaatatt gacgacacgg aaagtagagt ttaatactgt aattcctctt atcgtcgcag 10501 tctttgcgtt cagtgttgca ctttatatgg gttagctact tttatggcat tgaactgtgc 10561 ttttgcgtgt cataattgtt atcaataaag caaactgcca ttttgcatct accactacat 10621 ctcaaaaaaa tgcagtttgc atataatgct tgctattcac ttaataatga taagaactat 10681 cgaggtattt tctagtaaac tatggtttga ttgctagctt aaactgataa caactatact 10741 gaaatttatc agatatctag tacgggttgg cataagtgag cctaacatct ggagtttaga 10801 ctatgagcag gcgatcgccc cccagtcaca ctggatgtag tgagtgtaaa aaaatcaaag 10861 ctagcgctta ccacaacatc tctgttggac ttttaaggct gcggtagtaa ctgaagttac 10921 agagataaaa aatagggatg agcatcttgc tcatcccgca cataagttat tttaaagtaa 10981 aaggcttatt gcgtttgagt aggaaaggca cccggacgca cagctaagtt gagattttgc 11041 ccgttgcgat gcaattccat acgtacatct cctccaactc ggctattttc tactgcattt 11101 tgaacagagt ttgcgtctgt cactgcttta ccgttaagtt tttggatgac atcaccagca 11161 cgtatccctg ctttagctgc tggtgaattg ggcataactt taacaactaa aacgccataa 11221 tcttgatcaa cactcaaacc gctattgggg tcggagttga tgttttgttt caactcaggc 11281 gttaatccta ccatctgaat tcctaaataa ggatgttgta ctttacctgt tgctatcagt 11341 tgattggaaa ttcgttgcgc tgttttgata ggaattgcaa agcccaatcc ttgtgctcct 11401 tggattatgg ctgtgttcat ccctatgact tgtccttggg cattaattag aggtccgcca 11461 gagttaccgg ggttaattgc tgcgtctgtt tgaagatatt ctactcgctt gtcgggagca 11521 ccaatctgat tgctagtgcg accggttgca ctgatgattc cagtagtaac ggtgttaccc 11581 aaaccaaggg gattaccaat ggcgatcgcc cagtctccag gttgtagctg atctgagtta 11641 cccaatgtca ctgttggtag gttatctgct tgaattttaa caacagccac atctgtcaat 11701 tcatctctcc ccattacttt accttgataa gtgcgcccat ccttgagcgt cactttcacg 11761 gtatccgcac cattcacgac gtgagcattc gtaagaatgc gaccatcagc actcatgata 11821 aaacctgaac cagtaccccg ttctacccgc tgttgtgaat ctggtagtgc agatccaaag 11881 aattgacgga aaaacggatc cctaaattgc tctggtaact gggttttgac tgttcgggaa 11941 gcatcaatcc gtacaaccgc aggtcctacc ctttgcacaa ccttcgtaac aaagctagta 12001 ggctctccac cagatggtat tggcacacca gcatttaccg gactcacagc aagattagat 12061 gcgttttgag ttacctgttg ggagtgagaa gccaagtagc cacccgcaaa cgtcatccca 12121 gagccgagta acactagcga taaatgaaaa gctgtttttc tccaaggagc agtattatgg 12181 tttaaagact cgtgattgtc atcactcggt tttttataca ttttgtttct gtaagatttt 12241 ctagatacca ttcccaatct aaaaagactt tgtgacgatg ctgttacaga gatatgacgg 12301 acttgtgaaa agtcttactg cataagttct atgtctattg aagttgtgtt tgtgttcctc 12361 agcgagagcc tttggtggaa aaacccttga ataaaaagta gaagcaacag taaaacattc 12421 aaaggaaaaa tcagcaaaca ggcacaatag aattggtgcc aacacttgag acaacactga 12481 aattatatct attgcagtta ggtttgtgtt gctcggcaag agcctttcct cagatcttgc 12541 atctccccct gagtacgcct tgaactaaag ttcaaggctg catagtcgaa gtccgttaaa 12601 tcggagaggg gttgggggtg aggtcaaacg agtgatttgt atgttattga atcgataatc 12661 ataaataaaa agtcgaacca atagtaagac actcaaagga aaactcagcc aacagtcata 12721 atagaattgg tggcaacagt cgagacaacg cagaaatttg tggaaacgta ggtaggcaaa 12781 gatggggaag ctagcattgc tgataggggt gagtgagtat caacccggat taaatccgtt 12841 accgtgtgcg gtgaaagatg ttgaagcaat gcgacgagta ctgacacacc cagaaatagg 12901 aaattttgct gatgatgata tcaaagtcct gaaaaatccc caaggtcagg aaatccaata 12961 tgctattgag aatctgtttg ccgatcgcca gaaagaagac ttgttattat tttacttttc 13021 tggtcacggg attaaagatg agtccggcag gctttacctc tcaacttcta ccactcgtaa 13081 acagaacaac aggttgttca aggcttcagc agtagcagcg agtgttctgc acgaaaatat 13141 gaatgagagc cgttcccaaa gacaggtgat tatcttagac tgttgcttta gcggggcgat 13201 cgcccaagga ctaacagtta aagatgatgg tactgtaaat ttacaggaac agctaggcgg 13261 taaagggcgg gcaatcctca cctcttccac atctacccaa tactcatttg aacaagaagg 13321 ttcagacctt tccatataca cccgctactt ggtcgaaggt atggaaaaag gtgcagcaga 13381 tcgcgatggc gatggctgga tttcgattga tgaactgcac gagtatgcta gcgacaaggt 13441 aaaagaagca gcaccagcca tgactcccaa gttttaccca attgaggaag gtcataaaat 13501 tttgctggca aaatcaccaa aggatgatcc taaggtgaaa tatcgtaagg aagtagaaac 13561 tcgtgctaag caaggtcacg agttttctgt ttttgctcgt agaattttag acggaaagcg 13621 ggatgagtgg ggattaactc cgcaagaagc agctgcaatt gaagaagaag ttttgcaacc 13681 ttatcgggag tatgaacgta agcgccatga atatgagcaa gcattgattc aggcaattga 13741 tcaagaatat ccctttagca aaacaaccca aaaagattta aaagaatatc agcagtatct 13801 gggactgcgg gatgaagata tcgcctcaat tgaacaacgg atcattactc cgaaacaagc 13861 agaatatcag cgcaacctac aacaagcaca gaggttacag caagagcaag aaagaactca 13921 gcaacaaaag cagcaagcag aattacgaga actacctgaa actcaatcat caccagtttc 13981 tcagccaaaa tctccctctg ttattcaaac taatattcaa actgatattc aaactcagcg 14041 ctttgagttc gagtatgcca caatcatcgt caaatcaaaa tttttaggta tagaaaaaac 14101 ctgggaaatc aaccgtcatc ggggtcgagc agaatttttc atacaaaact tgggcaatga 14161 tgcgttgctg gaaatggtgg cgattcctgg cggtcagttt ttaatgggtt ctccagagaa 14221 tgagccagaa cgccttgcta atgaaagtcc acagcacacc gtcaccattc aacccttcca 14281 tatgggtaag tttcctgtaa cacaagctca atggcaagcg gttgccgctc ttcctaaggt 14341 caaaatagat ttaaatccag atccatcctc ctttaaagga gctaatcgac ctgttgagaa 14401 ggtatcatgg gacaatgcaa ttgaattttg cgctcgctta gctaacaaaa ctggaaagcc 14461 ctatcgtttg cccagtgaag cagaatggga atatgcttgt cgcgcgggaa caactactcc 14521 ctttcacttt ggcgaaacta ttaccacaga tttagcaaac tacaacggca actatattta 14581 tgcttctgga ccaaagggtg aatatcgtgg gcaaacaaca gaggtaggaa aatttccatc 14641 aaatgctttt ggtttgtatg atatgcatgg taatatatgg gagtggtgcc aagatgcttg 14701 gtatgagagc tacgaaggag cgcctgcaga tggtaccgct tggatgagtg aaaatgataa 14761 gaattctcga cgcctgctgc gtgggagttc gtggtacggc gatccgtggc gctgtcgttc 14821 aggtagtcgc agcaggaacg cgcgtgtcaa tagggttgac gacgtgggtt ttcgggttgt 14881 ggttgcgcgg ttcaggactt cttagccctt tacactctta gcctttttgc actttacgat 14941 ttattctttt ctcttcactt ttgcgcgcga agcgcgagca atttttttta gatttttgga 15001 tgagtttaaa tctatcactt taatcataca aaaaacttac cgtttaatca agaggtctgt 15061 cttaaatata tgttaacttt aacaatgggg taggcaattc cgactgctgc ggggcggttc 15121 gtggaacaac aatccgagga actgccgctc ggcgaaccgc aacaggaacg cgcgtgacaa 15181 cacgaacaac aacgcgggtt ttcgggttgt ggttgcgcgg ctcagcgctc ttctgtgtca 15241 gaactggtgg atgggaattc atcgggcgta ccaaagaaga gtccagacct agtccagtga 15301 ggttgataac aacttccgaa aatcaaactg agccggatag cttggtagat tgcaagatcg 15361 aacagttgtc cggcttatca tattaaaagc aaaagcgctt cctattcctc actcaactca 15421 aaattcaccc gctcataaat atcacgaaag ctaatttgaa aatctataga atgtagggct 15481 aaaacggcat cttccccttc atgctcttta aaagtccatt ctccttctgc ttttttcaca 15541 tattgttcta cagagaaact atactggtca atcataatat attcttgaaa ctgaggaatt 15601 gaacggtaat atttaaactt gtctgttttg tcataatttt tagtggaatt tgataaaact 15661 tcaacaatta acattggatt agtaatagtc gtcgtattcg ttccttcgta tatcggttca 15721 cctttaacga ccatcacatc aggataagtg tagagacggt aacgaggtat ccataccttt 15781 acatcaataa tatatatttc ataagcttgt ccttgtactg ttaaaggaaa ttttctacaa 15841 aaatttagcg cgattttatt atggttggtt gttccccctg gcattgggat aatttctcca 15901 tctctatatt cacttctata ttcggcttgc tcttctaatt ccagatattc ttctggggtg 15961 taatggcgct tttgtgtttg taatatcata tccagtctct caatgcataa tagctatagt 16021 gttttaacag aaagaacaga ggcgtcagtc gccagaattc agtatgaatt ctatgcgatc 16081 tcggatgact tagatcttgc accataattt tagtccgcca agaaataaat ttcttggctt 16141 aaagcccaag tccgttaaaa cggactcttg ttagtctttt agtccgtttt aacggactta 16201 agctattagc ctggaactta agttccaggc gtactccggt taggtgcaag atgtgagatg 16261 agtaagtttt aatcccgcta tcgaaagtga ttctgactcc tgactcctga ctcaggaatt 16321 gctgaattct tttttaaatc ttactcaaca aacttcggtc ttgccttttg agtagactgt 16381 atgtgttgtt ctaaagctac ccaatcttcc tgctcaacga aatgaatcaa ttcatctaga 16441 ttgtgacgat attgctggag tgaattaagc aagcaggcgc gattgtaccg cgccatcatc 16501 agccctaact ctggattacc gccaccaaca cgacttgtat ctctaaagcc agaactagca 16561 aacttctggg ctaattccag cacattcgtg tcggtttcac tcatacaagc agcaatcaac 16621 gaggaactaa ccatcacggg caaatgcgaa atccaactca ctgctcggtc gtgctcttct 16681 ggagaacaat ggtagagagt cgctccaagt tcatgtacaa ttttttctac gattgtgatt 16741 gcatgtggtg gtgttgtttg tgttggtgtc aggacataag gttttttttc aaataaattg 16801 gggatagctg cttctatacc actatcggtt cttcccgcca ttgggtgtcc accgataaaa 16861 ttgtcccaga ggggagctat tgcttgaact atcggtgttt taactgaacc gacatcagtg 16921 acaattgtat gagaagccag atgagggatg agctgctcta atgtgggaac aatcaaccct 16981 atgggtgtac agataaagac tacgtctgct tttgtgagca gcctcatgtc aactgacgcc 17041 tcatctgtgc tacctagggc tattgctcgc tgacaggttg attctcgacg gctaacgcct 17101 aaaacatgat gtccttgcga tcgcaaatct aaacctaagc agccacctat cagtccaagt 17161 cctaaaatac caatattcat ctatctgtca ttttgagatt ctatatcatg tatattttgc 17221 accttgctgc tcaaagacta ggaatctgga actatgaagc ggattttccc atatgggtgc 17281 tttttaccag caaggattac attatataag tcattactca gcggttagtc atcaacagct 17341 aacaatattc taaaggagga agaaacatgc gtgatacctt taataaaatg atcggtaaga 17401 cccgctatgt ggtctctcgt atcatgctac atttaagtgg atctgaggta gcacctattt 17461 tgggagtttt aaatcgtgct gccagagaag ccatagatac cgatggtgac atagaaattt 17521 tgggagaagg attggtagaa atctgccaaa ccttactgca atacgatgaa tattggcttt 17581 ctgcggctaa tgaaggcgac gtattttgga gcgaagggga ggcgggagac tatgtgaatg 17641 aattatttac agactccgct caacgttacc tcagtgaacc agattttggt tctgactcgg 17701 gatacgatca acctttatct attcctgtaa cgcgcaatgt cgttgtcatg attacaatcg 17761 cttgtgaagg agaagtccca gatttagaga ctgacttggc taatattaca gcactcaagg 17821 aaggctttaa agctttaata aacttacact acaaacataa attacgggca attcaggtgc 17881 atttttcacc agctcgattg ggtgacgaac tcaccaacga ccaactcttg caatattacc 17941 cggaattaat tcctttgtaa ttggttgggt tgtccgtacc aaccatcata gtaaaaaatt 18001 gttaagctcg gagataggat gatagtcaaa tgttcatgac aatcgtgcgt aaattcacag 18061 ctgttttttt agccttgagt ttgtgtgtaa caactgtcgc ctgcggtggc ggggggtcag 18121 ataaaaccac acctcaagca agcaatacca ctcaaacgac gacagcaacc accaaactga 18181 atgatggtca gtatcctgta caacaagcta gctacaacga tgctgacggg gagtacacgt 18241 tgcttttact caacgctacg cccccaagtt ttaaaagcca aaatttacaa atggcgcggc 18301 tgacggatga tgaaattaag caaggaaaga aaagttacct aaaggtagag aatgggcagt 18361 cagctttgta tctaacagaa gacttcaaaa ttgattacgt ccacaacgtt accgagacga 18421 aaacgaatcc ccaaaccgga caacaagaaa cagtagtcgt tcgtcaacaa aatagcttct 18481 gggcaccttt tgccggatct gttgctggtt ctttagccgg acaggctatt ggtagtatgt 18541 tgtttagacc ccaacattat gtcccgcctg tataccagcc tggtggagtg ctgactggct 18601 acggtggtta tggtagtagc tatggcgatg cggttagtag ttaccgcagc cgctataatg 18661 cagccccagc agtagagaga aaccgcactg ctttccgtag cacaggaacg attagaaggt 18721 catatcctgg tgattcttca actctacgac ggactacccc aaatacgggc agtcgcgcga 18781 ctggttccgg ctttggtgga agtaccctaa gaccttctgg taactctaat gtcagacgca 18841 gtcctagcgg tagtggtttt ggtagtggtg gtcgttcgcg agtcccccgt tcaagtggtt 18901 ttggtagaag gcggtagaga agtccgcact tgtatcatgt ctggataaac acttataaaa 18961 aacgaggaac ctcaccccgc ccttcgggca cccctctcct taataaggag aggggatggg 19021 ggtgaggtct ttttattgtt gtaagtaatt aggcgaactt gatattaccc acaatcattg 19081 actttttgca tcttttatgc taaggcaaca actggctgct gccaacgccc aactgcctta 19141 ccaatcaccg caagttcctg catcaagcgg tcgaaaagtt ctggtgtgag agattgcggt 19201 ccatcagata aggcttttgc tgggttggga tgaacttcaa tcattaaaga gtcacatcca 19261 gaggcgatcg cccctaaaca cattgaaggg acataggtag cccagcccgt accatgacta 19321 gggtcaacca taattggtag gtgagttagc ttgcgtaaaa ctggcacagc tgacaaatcc 19381 aaggtattgc gggtgtactg gcggtcaaaa gtgcgaatac cgcgctcaca caaaatcaca 19441 tttggatttc cggctgccaa aatatattcc gccgccatca accaatcttc aatagttgca 19501 gccattcctc gcttgagaag aactggtttt gactgtgcgc ctactttttt caacagcgag 19561 aagttctgca tattccttgc cccgacctgg acaacatctg cgacttctgc gactttctcc 19621 aagtctccac tgtccatgac ttctgtaata atgcccagtc cagtttcttc ccgcgccttt 19681 accaacaatt ccaaagcact ctcgccgtgt ccttggaagg cgtaaggcga ggttcggggt 19741 ttgtatgcgc caccacgtaa aaagtgtgcc ccggctgctt taactcgccg cgccgtctct 19801 acaatcattt gttcattttc tactgaacac ggacccgcaa caataaccac agggtgatgt 19861 tcgccaaaag gaactggtcc attaggagtg ttaacgacga cttcagaaga ttctccatgt 19921 cgaaattgac ggctagcccg tttgaaaggc tgttctactc gcagcacctg ctcaatccaa 19981 gggctaactt cttgaatttg caggggatct aagacagcgg tctcaccaac cagaccaatc 20041 actaccttat gcttaccaac aattttttct gctgtcagtc cccaagtgcc tagttcctcg 20101 ctcagacggt ctatttccgc ctcaggggaa ccaattttca tgactataat catgctaaag 20161 ttcctgctta gttgagtcct agctgagaat attttcctag atgaatcagt ggtttaaaat 20221 tataaagcgt gagcaaccaa aatccacttt tcaatatagc gctcaacagt cgcgtcattg 20281 aagatattgt gtctcaaaaa attcataaat ttggaatttg taatgttgaa gttacaaaat 20341 ttaaacttca acattatttt tttgacattt acccaacatc acagtgattt gctcactgag 20401 tactgcttat caaacgtttt tttggcgagt ttcacgccaa gctgctatca ttcgttcccc 20461 attggtcgtg aactcattcc agcctagcca acccccaact ctgtgttcat cccaagaggc 20521 agagagaaca ccataagtta ttcccaatac ccccaaacca aaaaatccca tgttcaccag 20581 taatacggcg atgggagcga gtttaattcc agcataagtc aacagaaagt aactggcaat 20641 cagggtgctg attcccaaag ctgttggtat tccacaaaaa agagccactc ggcgaatcat 20701 ccgttggcta acaacttggg gtatcgccat ctcttctttg gtaaaaggtg gcttcttctc 20761 gtcctttttc ccagattcct gcttagtttc tggttgtttg ccttttcctt ttgctggttt 20821 ttggcgcttt ttgtttggtt caaaaggcaa ctgactgcgt tcagattcag cagacataag 20881 cactagtctc tatccacgaa tgccgagacg accaatcaaa gcttgataac gttctcggct 20941 ttcttccaga atataagata aaagacgctt acgttgacca ataagcttca acaatcctcg 21001 acgtgaagaa tggtctttct tgtttgcctg gagatgttcg ctaagacggt taatgcgcgc 21061 tgtgagcata gcgacttgaa catcagcaga accagtgtcg gtttcgtgaa cttggtactg 21121 tgtcagtatt tcttgtttgc gctgttgcgt taaagccatg agaccaatca acttctaaat 21181 gtatgcagca gtttcccata atatcatagc catccgggag catcccaact ttcccaaaag 21241 acttggcgat gagaaatcgc gaacaggcga agtctactta ggtggactaa atctagcctg 21301 cctcaggcat cctcttttga gtgtgcagtt tagcaagatg aggattttta cgtttttcct 21361 actgccggac aagaattttc gcttggaatg attctatatc aattctaatg ccgtcttccc 21421 cagcttccac atcataatta ccagtccact catgccatgt ccccgcacag gggaagcgag 21481 gaacatgata tctactaagg tttttgtctg aaaaatttgc tatgaccaca acgcgagaat 21541 cttctttctg accgcgaaca tatgctagta ctttttcgtc tagattctca tggaaaaatt 21601 caatgtgatc actttgcaaa gcagaatttt ttttgcgtag ggcaataagt tttttataat 21661 attcaaataa atcgcgattc tcgtctctct ctaataaagg ccatgcaatc tttttcggct 21721 gagtgacagt ttcacttttg cgtttgtgtt ctccaaattc ttcccccatc cacagcatag 21781 gtatacccat tgctgttagc aatagcgctg ctgctaactt ggctcgtcta aatgcatctt 21841 cgtcaaagat accgcgatcg cccaactctc taaaaatatg ttcacggtca tgagtcgcta 21901 agtaattgac tacgtttata ctagttttat aaccctgttt tcttggatcg aaaacctcct 21961 tgagcttttg gggttcaaac atttcaccgc aaatatgcgg aatcacaaaa tagcgaaaac 22021 tttcatgcca acaagcatct agcggtccct ctggagtcgt cactttactg gtatcaggaa 22081 tattctcagc aatgttgtaa aatggcttag gggcagtgtt ttttttcgct tgtttagcta 22141 cccaatctag aaattcgtag ttcgctaatt gacgcaccgc atcgaagcga attccatcaa 22201 tgtgatactc ctgaacccaa aaccgtacga catcaccaac atatttccat gcaggcttaa 22261 cgtctaactt ctcgtcgtaa ttgtcatagt taaactccgg accccaataa ttacctggat 22321 cttccggata atgcatatat tcgtaatacc aataattcct gtctatcagc attaacggac 22381 attcttcatc agtgtggttg tagattccat caagaaaaac gcgaataccc cttgcatgac 22441 actcgtctac aaaccgcttt aaatcttctg ttgagccgta gctagattct gtagcgaagt 22501 agtgacgtac tttgtatccc caattataat caccagggta ctcattaact ggcattaatt 22561 caatcgcgtt aattcctaat tcatgaaggt aatctaactt ttcaatggca tctatatatt 22621 ttccacgttt ctcagggtca acttcaccac ccgtaaaatc ggcaacgtgc atttcgtata 22681 taactaattc acaattttct ggtaacggtg catcgtcgtg ttgccaaaca taagtatcaa 22741 caatcttttt tccattttta attctgacga caccgacttt tagcttttca tcgacctctg 22801 tcgcataagg atcaataaca tctatccatt gctctggttc aaattgtgga ctcttcgttt 22861 gtatacggaa tttatattga tagatatcat cctctaattc tatttgagta cgaaaatagc 22921 catcttctcc tttttccatt ggtatttctt gccattgcga aaaagagcca attaaggcgg 22981 ctttttgatt tcgaggagca aataaattaa atactgttaa tgttgccatt cctactagtt 23041 aataatcaga atcaagtcaa atcaataaga cagaaatata ttgtcatttc cagaaaaatt 23101 aatcttctct cttgagtcac cttttttaaa gagtcaataa tttttctcta atatacgtca 23161 ggaatttgtt gtactttatg caaccactaa taaacgttag ttttttggtc atttgatggg 23221 gacaaaggag aaaaaagtgg gaacaaacat tttctaattc ccacttcttt tcaatacatc 23281 tgtgaaaata taagtatttt ttaaccagca ggtgcaggga cagttgttac agtacgacta 23341 tctaggatag gcttgcgggt tttcaggtca aatttgtatc catgtggtaa gatatgaagc 23401 tttgctccgc aaactgctaa aggttcatcc ttcaaaatct cgtcaacgtt gctgtgcaca 23461 acctctgaat catcaacaac tgtcacacaa ccttgcccaa cgacttcaaa ctggtcatcg 23521 gttactacca tagcggtgtt ttcatcaata ccaaatccca agacaacagg ttgctgtgct 23581 aaagctgcaa ctaagcgtcc caagcgtccg cgctgtgaga aatgttggtc tatcaccact 23641 cctgggagaa aagctaaacc aggacctaag tctacaattt ccattcgagg gtttgtctcg 23701 gagtcgcctt ccacaatcat cacatctggc atgacagcgg ctcctgcgct tgtacctcct 23761 acaactatac cttcagagaa gcgctgatga atagctgcgt ccagttcagt atccttgaga 23821 acactcgtga tccggccttg gtctcctcca gtaaaaaata tacctgtagc tttacttacc 23881 gcttctaacg ctgtagatga agacgcatcc tcacggcttt cagtgtcaag tatacgaaca 23941 tcctcagcac ccagccgttc aaaaactcta atataattct gtcctacttc tcttggcagt 24001 tctgtcgcgg ctgtcatgat cacgattctg gcttttgtac ccccagcgta gcgaacaaac 24061 tcccgaagaa tttggcaatc tccctcctta tcttccgccc ccccaataat gactaattgt 24121 cgcttaatct cactagctac catagagcct cctgatagaa gttatttatt tctataaaac 24181 atgatgattt gagtggtaaa aactaccatt tggtagagaa agatacaaaa atttaatgca 24241 gactataata aagaaagtaa acaagcaaat gttaatatgt aaaaaagagt agaattagag 24301 taaaaatgac gactgatttt tttatcagct ttaagtaaat aaataaagtc aaaagtcaga 24361 aacatttggc tgttctgact tttgaaccgt aaaaagcttg tattaacttt tgtataaatt 24421 ttaaatgcaa cttttctaac tcatcccacg gcgcgtgaaa acattaagac cgttcattta 24481 tgcctaacgg tgagtccagc gctgcgggag gggagccagt gcggtcttgg ggtctcacgc 24541 cactcccttc aagtcgggaa acccgcaagg gttgggcaaa ccctcgacaa ggatggacaa 24601 cccttgtatt tttgcggcgt atcctttcaa ctagtttgat tcaaattttt tggtagtgcc 24661 cttcgggttc accagtcgcc tcaacggggg ggaacccccg cacggcgctg gattcaccgc 24721 aaaaatacat ccagatctcg ggttttacga gtagacattg ttgtcgggtt gtccatccaa 24781 ggctcatgaa aaatgaagcg gcggggtggc tccccaagta gagcatctgg cgttcgcgtt 24841 cgcgcagcgt gtcgctttgc gactcagcgt ctccgttcgc gcagcgtgcc cgttcggcct 24901 caagccgtgg cgttagccat agggcataga agatacccgg agggcacgca ctcgcgttca 24961 gaaaaagccg tggaatgaag accattttta gttgacttta gtctaacgtg atgttttctt 25021 tttgcaagtt acgtataagt tctctcagta catcagtttg cgttctgcct agtttttcgc 25081 agtaattacg caagatttct ttctctccat cattcactcg aaatgtaata aaatttgttg 25141 gttttcccat agcactgaca tcaatatgtg gcaccctaga aatatagcac aggtaggtag 25201 cataactgcg cttcacttga caacacaaat ttgtggagtc cagatatgac tggctagggt 25261 gcgactcttt gagagtgaat ttatacctct aggtgagaac gccaaggaac accttggaaa 25321 gttccataac tggctgtttg aatgttggtg caaacccccg ctataccgct tgcggttagc 25381 ggcaggagga tgtcacaatc aagatggctc tataaatcaa aacacctccc gatttccata 25441 agttccggag gtttatctga ctgagtgccg cgcactcttg atggtaaatt gtggtgttct 25501 gtagtttaac gttaaatact caattcttca gaatgttgat caacctgtgt ttcaaacagt 25561 aactcgcata aaaaagaacg tgcctcttct aaagcgaaat gcctgggttt accctgaaat 25621 acaatttggt actcattagt gtctggacca tctccatacc cagaaatgag cttgtggtga 25681 aaagcttcct cagcaagatg ctgaacctct tttctgagaa cagcttttgc gttactcata 25741 cttgccgtct ccttagcgct tcttagtggt tactatttat aaaattaact agtgtctaat 25801 tgcacctttc aaaggttata tatttagcta ctagtcaata gcaatggttt aatgttgatg 25861 cacctgttat tttaactatt gagtattaat ttagtataac agcttaacta tatctttaga 25921 tagctgaatg ttcttggtaa cactgagtac aatgtttcaa taaacgtagc tattgatgcc 25981 agtgttaacc agaaagcaca gatgtagcgt aggcattagc cctcgcttcc caaagggtac 26041 taaacttata gacagtaaat acgtctgggg aaacgctagt atttgttcag attccagggt 26101 gtttagtcag aggtaagagt caatggttca agataaaatc accgatacgg ttcgcgttaa 26161 tgccagaaga actgatgcat tcgacatctt caagttcaag cattacatag gaccaaatcc 26221 ttacttggac gcaggggcat tagtatttga ttttgctgtc actgagttta caagacctct 26281 gccgattgag gattatgtgt caattattag cgatcgctac ccacacctgc gcgaccaaac 26341 atttgattcc tatgctcatc tgtttgcccg caccgtgtca gaggttggaa agctggatat 26401 gggtttgcac ctcgaccgtt ggagtatcaa gccatatgaa aattatgcga caattagcat 26461 acaatcgctt cacgagcgta caactagagg tgtactttac tttgtatggg attggtttga 26521 agccataact caacacaaag acattacttt tgaagagcaa cttttatccc ttcaaagtaa 26581 gttccggcaa tcagtctacg gtggtcctac agtttacgct ttactgcgta cagctgacac 26641 aaaaggtatt cccaccttct atctatggga tgaaggactg acacagtacg gatacggcaa 26701 aaaacaggtt cgcggaatag caacgacatt tgacagcgat agccatctag attccgactt 26761 caccacgcgc aaagatgact gcaaagcgtt tctaaaaacc ttaggttttc cagtgccaaa 26821 gggtgatatt gtgaattccg aaaagcaggc tcttgcagta gctagagaca ttaactaccc 26881 agtagcagtc aaacctgtgg ctggtcacaa agggattgga gtcaccgcag aagtacgaga 26941 cgagtatgaa ctgaaatctg cttttagaag ggcacttcaa gcaattccag aaaacgagcc 27001 gacccgaatt attgttgaaa aaagcattgt ggggtcagac ttccgcttgt tgtgtgtgaa 27061 tggcaggttt gttgctgcta ttgaacgtcg tcctgcatgg gttgtgggtg atggaaaatc 27121 aaccattgaa gagttaatcc gagacgaaaa ccgaaaacct gggcgtttgg acacacccac 27181 ctcgccgatg acgaaaattc agtgcgacga agcgatggaa cagtatcttg aacaacaggg 27241 cttgtcattg gacagcgtca ttgagaaaga tcgcaacgtt tctcttcgca aagttgctaa 27301 cctctcggct ggaggtgtga gcattgatgc aacccgcact gttcatcctg acaatatcat 27361 tttggcacaa gatattgctc aacacttccg tctcgtttgc ctggggattg atgttattgc 27421 tccaaatctt agtgaatctt ggaagtctaa cgacttttct atcctagaaa tcaatgctgc 27481 accaggaatt ttgatgcatc ttaacccctc tataggtgaa agcgttgatg taccctcaca 27541 cattctcgaa accttttata aattgggtac agatgccaga ataccaatca tcacgtttaa 27601 ccatatctcg gttcacgaaa ttcaagaaac aattgaccat attcttttgc aacacccaga 27661 ctggaaaata ggcgctgtct gtcgtgatgg tgttttcata aatcgttcag caaaaatttt 27721 gagtgaagat tacaacacca atatcctctc tttgttgcgt aatcctcagc ttgatatcct 27781 gattgctgaa tacaaagaag acattttgga aaaagaagga atgttttact acgggagtaa 27841 catggtagtc ttaaacaatc ctaccgaaac cgagatgacg ctagcgcggg atgtttttga 27901 tgattcaact gttgtgatta aaaatggaga taacatttct attagacgca agggtttgat 27961 tgaagaatat actcttggtg tggatgaacc atttacacgg gtttatttga aggagattgg 28021 aacggttttg taaggtaatt agcgttcaga gacgttgcat tgcaacgtct ctacattatt 28081 cagtgtataa actacaaatt tgcagggtta gaacaagata atgcccgtct aaaagcgaaa 28141 ctgcaaacaa tctttttcaa accacactca atgtcaagag gttgtaagct tcagctagat 28201 taagtttaag atcttttgca aaaatgtcag atatttcctg gttaaggttg agtttacatt 28261 acggttatat agacgcacaa ggtgaagtgt acaaacgtgt actagaagaa cttggaaaag 28321 ctattgaaaa atctacatct ctagctaatg agttagaaac aaatcaaaaa cttctaccac 28381 aagaaaaaga ctctattatt gatgatgggt gtttacttat tgaatatctc cttggcactg 28441 cttttgtgct ctgtcaagca tatatagtag atgttgtttc tactgttgag aaaattcatg 28501 ttagagctaa acaatctttg ggtagagatt taaaaacaat tccagttttt aacgaaaaat 28561 caaaaataag cagaaaagat attcttcgat ttggtcaaac tcttccattt tcttatgaag 28621 gggataattt ttcaccaatt caactcatta atgctttcgc aaattatttc aagcatcgcg 28681 atgaatggga tgtcaattgg gaaaagctag aaggtacaca aaaagatact gccaaggtga 28741 ttcaatcggt aggagctaag tttggttgta gtggtaattt acgacaaggt tcatccgctt 28801 taggaaatcc agaatttacc aatacactaa tcttctttga aaaactacaa caatggcata 28861 tcgatctagc aagtgcgtat gataaagagt taagtagtta tgatactgta ttctgatgtt 28921 tttgtgtttc atttatgaga gaaaaaataa cttggtaata tgcaattact tcaagaagac 28981 ctcgcctgtt ttttcaattt tcatcgcaat tttatcctca ccaaaattgt tcccttcatc 29041 caacttgacc tctaacaatc cgggtgctgt ttctccttga agatgaacag taagcagtcc 29101 tttatcctgt cgccggaagc taatagctac ggcgactggt aagtctttta aggtgacagt 29161 tgattttccg gaaagggtgc gttgtttggt atgtccaatg acttcgtatg tcagcacagc 29221 gttagtttgg ttagtcagct tcacgcttac cttactattt acaggtacaa ccttagcact 29281 tggagtttgt agtttttctg gcataggtgg gatcacagaa tcattagaag tcttgggcgt 29341 ttgttgagct tgggcgcttt gtgctagaaa aattgggatt ggaattgatg ataaaccaat 29401 tagtaagctt ccacaaattg ctcctagcct cacaaaatta attttgatag aatgtttaga 29461 cttagtcata tttttactct gccatagctg cgcataagtt tttatttttt catttttttc 29521 taactcttta tcataaagag tttgaggaag aggaataaaa agttttaccc caggaatgag 29581 tataatgact tctggattag catcataaat ttgttgccat cgactgcgaa cgccataaaa 29641 tttttcggct atttttggta gtgtatctcc atcttttacg gtgtaaaaca taagtgcctt 29701 ctcgtgaatc aaactattga cgtatttatg atttgagcta ctgcatctga acaggaattg 29761 cacctgttcg cacagatagg tcgatatttc ttccctgacg acgcaattgc aagcgccaat 29821 tctcgccaac ggagctattt tctacagttt gttatttagt taattgtgac agtaggggct 29881 gtgtcatggc gagaagcttg aacaggtctg gttttgagcc ataaacttcc tccgagaatc 29941 agagctacta ctccaacagc gccaacaatt ttttgctgaa tttccatagt tcaacctccc 30001 atccattgag tttattgtta tttgcttatg ttttcatgtt aaataggctt tctaagaaaa 30061 cgatgagaga aacagaaaat tttggttgag taaaagcgca aatttatcaa tattaacaag 30121 ataaattaag aaaatgatga gaggataaat tacagcagtt tgtgtttaga taaactatcg 30181 caatcctaaa taatagctaa agccagtcag aaagacaaag cgatgcagcc ccattgtgtc 30241 catgattatt tcactgataa acacgtagag gctaccgaaa atgagcagga tgatgacaat 30301 atagtctaag tctgttcgtc tcatactcac ccatctcctg ctcttgctga ttctttctat 30361 attttatttt taattcggag ggtcaaagcg attaacaaag gcagcccaag gaatcattgg 30421 taaacccacg gccaaacaga acaaccattg gtcaaagcgt aggggtactg tgaagaaaaa 30481 ctcattaatt aaaggtacat aagaaaagca aagctgcaaa attattgcgc caataatacc 30541 aaaaccaatg gctggaatat caacattttc ttgaattgta ccgtccattt tggcaatcag 30601 attaggcact aattgactaa ggctcaacag ataaaagatt ctgcctgcaa ttaaagaatt 30661 gattgccata gtacgggcta gattgatatc ccccgtagtt tgtcgaatgt attcaaaaac 30721 tccaaatatc acaatccagt taaacagaga aatcgccaag atgcgtttga ttctactacc 30781 cgagagcaat gattcattgg gacgacgggg cggttgctgc atgacatttt gaggtttcgg 30841 ctcaaaggct aagggaactg tcatggtaat ggaattaatt acattcagcc aaagaatttg 30901 caatgataat atcggtaatt ctctgcctac cagcgtgctg aataaaattg tcatcgactc 30961 tccaccattg actgggagaa taaagcacat agctttaacg agatttttgt aaacagcacg 31021 cccttcctca accgcagctt taattgaagc aaagttgtca tcggtgagta gcatatcagc 31081 cgcttccttt gctacctcgg ttcccgccat acccatcgca atgccaatat cggcttgttt 31141 caaggcgggt gcatcgttaa ctccatcccc cgtcatggcg acaatttcgc cttttgattg 31201 taacgcttcc actagacgca gcttttgttc tggtgcgacg cgggcgaata caaccccatc 31261 ttctgcaact tgggcaagtt cttggtagtc catttgagca agttcagccc cggaaaatgc 31321 cagcaccgaa ccatttttat tgatgcccat acgagtcgcg atcgcctgcg ctgtgacagc 31381 atggtcgcca gtaatcattt ttacctgaac acccgcttct tgacaggctt tcaccgcttt 31441 catcgcactt tcgcgtggtg ggtcaatcat cccttgcaag cccaagaaaa tcaacccagt 31501 gtcaatatct acatggtcta cggtagtttg ttcgtttggt acaagctttt ttgccaacgc 31561 caacacccgt aagccttgac gtgccataat attgacttcc cgttcgatag aggaggtatt 31621 taatgtctct tggcagtcga ttcgtttgag ttgtcccttg ctatttaaca tgaaagtgca 31681 gcgcttggcg atcgcctcta ctgaaccctt aacataaatc gtcttacccg ttggagttcc 31741 gtgcagagtc gccatgtact gaaactcaga ctcaaaagga atcccatcca gtctcggcat 31801 ttgctgttct agaagcggct tgattaaccc caatttatga gcagaggtga ttaacgcccc 31861 ctcagtcgga tcgccaaaaa ctacccattt accgtttttg gtttccaagt gagagtcatt 31921 gcataacagt ccagcgctga gacattcctg caaaccgata tcgccactca agtcgattgg 31981 tttgccatcc tgtacaattt ccccttccgg agaataaccc acaccagtga ctgcatactg 32041 atgtcccccc gcatagatgg cttgcactgt catttgattt tccgtcaaag tgccagtttt 32101 atcggaacaa atgacagtag cactgccgag agtttctact gctggcaact tgcggacaat 32161 tgcgtggcga cgagccatgc gggatacacc gatagctagt gtcacagtca ctaccgctgg 32221 taatccttcc ggaatcgagc cgacaattaa tgccacagta gcttctatag catctctcca 32281 aggtttggtt tgcaatcccg ccatcaaagt cagagttgca acccccaaaa ccatgtacag 32341 ccaatttttg ctgaacttgt cgaattttcg ggttaagggg gtagaaaggt caactttatt 32401 ttctaatagc tgagaaattc gccctgtttc ggtagtatta cccgtgccta ctactatacc 32461 tgtcgcctgc ccaaaagtaa cgaaaccgcc tgcataagcc atatttttgc gttctgctaa 32521 cgggatatct ggcggtaggg gcggcgtctc tacacccgcc tctttttcaa cagctacaga 32581 ctcaccagtc agggcggatt catctacttg cagattccgc accttaatca gtcgtaagtc 32641 agcaggtact ttgtcaccag aagttaacag caccacatcg ccaggaacta actctcgtga 32701 gggaatgcgt aacttttggc cctcgcggat aacagttgct tctgtagtga tggatttagc 32761 tagggctgcg atcgcccctt ctgctttcga ttcctgaaca tatccaataa tcgcgttagt 32821 tgtagttaca ccccaaatca caaaggcgtt gacaaactca cctataattg ccttggttgc 32881 acccgcaatt agcaagatgt acagtagcgg ctggttatat tgcaaaagaa attttaacca 32941 ccagggtttt cctttctttc cggtcaattc attgaaacca aaacgcgcta ttcgctcttt 33001 gacttgaata ctattcaagc cttgtgcttg gtcagtttct aacaagtcga tggtttcttc 33061 tatatcgagg gaatgccagc tttgaggcaa cttgtcttgc attgggcgaa ctggtttcac 33121 tttaacttta tgggtagata cagacttagt catgagtaaa ttccgattac agggatataa 33181 attaattagg aatccagttt gctaagagag gaacatctgt caaattccag ccttggtcag 33241 ttttaacaat ttcttttgtg ttggcatcga tgtacacttc taaatcattg tttagttcaa 33301 tttcccacac caaattatta ttttcatgct ctaaattgac atcaattgct gctgtacctg 33361 ggttagcagc aagaaccgtt tctatggctt caaccattgt gaccgcaggt gctgtgttgt 33421 caggggaagc ttgaacaggt ctaattctaa aaaaccaact tcctacaaga atcaaagcta 33481 ctactccaca agcagctata atggtttgct taattcccat aattaaacct cagaattgaa 33541 gttttgatgt tattagctca taccttcatg ttagagagac tttttaagaa tacgatgaga 33601 ggaaaaagaa agttttggta tgagaaaagc ttcaatcaat agacattacc aaattagatt 33661 aagaaataga tgagagtcta gaaaataaat ttttatatag caatccaata tgaatgctga 33721 gaattgaaag tgtgagaaac ccggtaactt ctgcgaaacc gggtttccca tttttcatga 33781 ttgatgcgcg gattgctata tgtaagtaaa gagaaatata tataataatt gtaggtttgc 33841 gtgaacgttg cgactgtcta cgctaaccta caatttataa ctattaaaac actagcatta 33901 atacgccaat cgcacttacc caaaacacaa tagagaaaaa aagaccccac aaaaatcctc 33961 tcacagcata cggatagcaa tcaatcatat attgactagc ttgagtttca catatcgtgt 34021 gttcaggtat ctgctgcgat tgggatattt gctgtaatga agttgcattc agtctcattt 34081 cacttactgt catggtctaa ccctatttca actcgctata atttcaagtt acaaaacgaa 34141 attaagaaaa taatgagagt caacaaccat tagtccatat caggacttac gtaaaatcat 34201 ggaaaaacgt agacgcccag cggcttcccg cagggtaccg caaaggacgc aaaggacacg 34261 ttagcgcgca gcgtgcgcct gcggcgcata ggaataggag tttgggagag tttttgcgta 34321 agtcctacat atcattaatt ttgatacgtt acataattta atgaattacc cttttttcct 34381 cttctcttgg cgcacttggc gacgccagtc gcctcaacgg ggggaacccc cgcacggcgc 34441 tggctcgtct tggcggttcg tttatttaac ccctcaaaat taacgaaaca gtccactatg 34501 ttgggcaatc ggtagagtca cagtaaaagt agtttcttta ttaggaatgc tgtcagcagt 34561 aatagtaccg ccatgcaagc gagcaatttc gtgagcgatc gccaacccca acccagctcc 34621 actttttcct tgaaaacggg ctgagtctgc ccgataaaac cgctcaaata ggtgaggtag 34681 atgctctttt gaaataccat accctgtgtt aattatcatc acctgcacta catttgagtg 34741 caaatttgac cgcacccaaa caacacctcc ttgtggtgtg tatttaacgg cgttgtctac 34801 caggttcaaa aacaaactgg ttaactggtc gggatcgccc tgcacccaca aatcgggggt 34861 caagttattt agcagcttaa tctgttgcga ttctgcagca tgttgcactt gttccacaat 34921 cacttccaac aagttgctta agtccacagg taatggctca aaaggcaact gtccccggtc 34981 aatccttgct agtaacagta aaccattgct gaggcgaatg agtcggtcta cctcttgttc 35041 taaatcctgc aaggtttgct cgtattcatc tggagtgcgt tcccgactgc gggttacatc 35101 tatacgtccc ttgataaccg tcaggggagt gcgtaattca tgggctgcat cagccgtaaa 35161 ccgttgttcc cgctcaaaac ctgcttgtaa acgttctaac atctggtcaa aggtggtcgc 35221 caactgtccc acttcatctt ttgccccttt gtagtttaga cgttgcgcga ggtcaatggc 35281 gctaatagct tgagcagtct gggtgatttg actgatgggt tgcaatgctc ttgatgagag 35341 aaataaacca ccacaaccag caataataag tataaatggc aacaaaaata aaagctctgc 35401 agataattta tcggatattt cttctaatgc ctcaagagac tttgctattt gcaaccagcc 35461 gataatctga ccttgacgga ttactggctg gctaattagt cgccagtctg ccttatttct 35521 tgccactctt gtgtatccac tggcacttgg tatccagaga ggcacttctt ggtatttgcc 35581 aaagccgtcc actattttgc cttggggagt catcaggcga actgcaaatc cagcctctat 35641 caaacgttgt acggtatttt gtcggcttgg cttgtccaca aaggcaaggg cgttattctt 35701 atcgcttaaa taaaccaagg actgagaacc tgctatttgc agagcggtat ccgcctggtt 35761 aattatttta cgctcaagtc ggaaatacaa gtagcccgtg aacccgccca gagtcagccc 35821 caacaacagg agataccagg cagttaattt tatacgtaca gggatgtgat tgagccactt 35881 caaactaatc atgattcagc atctgcggtg aggcaatatc ctacaccacg cactgtgtgc 35941 aggaggggtt tgtcgaaacc gtggtctatc ttgcgtcgca agtagccaat ataaacatca 36001 ataacgttgg aattttcgta aaagtcaaaa ttccaggtat gttcggtgat ttgattgcgg 36061 ctaagtactt ggcgagggtg gcgcataaga tactgtagca gtgtaaattc acgagggcta 36121 aggttgatac tttttccagc gcggcgtact tccctagtag caacgtccat ctccaagtct 36181 ccgactcgca gtattgtgtc ctgttgcatc ggaggacgac gtagtaaggc acgcagacgc 36241 gccaatagtt cgctcaaagc aaagggctta aacaaatagt catcagcacc agcatctaga 36301 cctcgaaccc ggtcttctac agcatcacgg gcggttagaa gcagtacagg ggttttaaga 36361 ccttgcgatc gcagtttccg caaaacctgt agaccgtcca tttgaggcaa cattatatcc 36421 aatacgataa tgtcatactc tgctgaagca gcataatcta gtccctcctg accatcagta 36481 gctatatctg ttgcgtaacc tgcttctttc aatccctgac taataaattg agcaatgcca 36541 ggttcatcct ctactagtaa gacgcgcata agtcaagcaa tgcatagatt ttgaagccac 36601 tccgttgcaa acgcagcaga gtgcggtgaa tcatccctat atacttgtat tttccaataa 36661 ttcctaatta atggctacac tcgctacaat catgccaagc tgtgtaaatc ttcgtgcgcg 36721 attctgctac cacaagaatg aatattgctc aaattgcctt tacttggtca aaaaaataca 36781 gactctttcc ggtaaattat ctttgcaatc aagtatcagt caggcgtgag ctatgaccca 36841 cgctcacgct caaattcaaa ggagggaata gaccgctgcg ctaacaaaat tcgctcgctc 36901 gctgcctacg gcacgctact aacacattca aaaaagaaaa aaaggataca agcccagacc 36961 gccacgcacc gaagagcggc ttgcacttta tttatttatg gaaaatataa agaaattttt 37021 tccatcgagt aggcgcaagg aaaaaggcgt ccttatttca gagccaacaa cttatggtat 37081 ggggttctta atttctcacg ttcgcgaatg agcgaatgaa cgaattgttt tgacgactct 37141 acttcatgca tccgtaaaaa agcgggaagt atgaagcaac tgggacaagc gaagcgtcaa 37201 aaatctaaat ctactggtct ggttacaaaa acccagacct ctacgttagc gcagcggcgc 37261 gaagcgcagt aggtctgggt agttcatttc gtcgaactca cgttatatca cctttagcta 37321 ataactcatt tactgccaaa caaacacttt agcttcgtat ggtccaatat caatcatgat 37381 accatcgtcg ccagcttcca catcataatt gcctgtccac tcatgccatg taccaggact 37441 ggggaagtta ggaatctgat atccagctag gaagttttct gagaaattag ccacaaccac 37501 aacacgagaa ccttcatcgt tccaacggct gtaagctagc acttttgtct ctggattttc 37561 gtggatgaag tcaacatttt ctgtgtagag agcatgatta tttttacgca ggtttattaa 37621 ccctttgtag tattcaaaca agctacgatt caaatcatta cctagcaatc cccattcaat 37681 tttagatgat tcttgtgttt tcggcttgta ctcaccaaat tcctgaccca tccaaatcat 37741 gggtataccc atagctgtca gaaggatagc agcgcccaac ttaagccgct taaaagcttc 37801 ttcatcaaag atattgttat tacccagttc taccattaaa cgatcatggt catggttagt 37861 gaggtaattc acaacattcg tcgcacccat aaagccttgg cgctttgcgt caatcacatc 37921 tttgaggcgt tctaaatcaa aggtgttacc acagatatga tccttaatac aatgatagaa 37981 actatcatgc cagcagccat ccattggtcc atcaacgttg gtaatactag tcgtttccgg 38041 aatatgttcg gcaatattat aaaaaggctt catgctggca gtatctttgg cttcttgcac 38101 aatccaatgc atgaaatcat agttagcaat ttgccgtgct gcatcgtagc gaattccatc 38161 aaggtgatac tcaccaaccc aataacgaac tgtatcacca ataaatttgc gagcaggata 38221 agtgtctaaa tttttgtcat agtgttcgta attaaattct ggtccccagt tgttatcggg 38281 gtcgcgaggc tcgtgatgat accaataatc gtggtcgatt tgtgttaatg gacttgatga 38341 ttctgaatgg ttataaatac catccataat cacacgaatt cctcttgcat ggcactcatc 38401 aacaagattt tttaatcccg atgtcggacc gtaacttgat tctggggcga agaagtgacg 38461 tggattgtaa ccccaactat aatctcctgg atattcttta actggcatta actcaaccgc 38521 gttaattccc agttcactca ggtaatctaa cttctcaacg acgtgcttgt actttcctcg 38581 cgcataagga tcatcttcac caccagaaaa gtcaccaacg tgcaattcat aaatgactaa 38641 ttcgtggtct gcgggtaggg gtttatcgtc gtgttgccaa acgtaagtat caacaatacg 38701 ttccccatct tttatgtata caacaccgtc atccttccca ctcaattcat caatatcagt 38761 tgcataggga tctgtcacat ctacccattg atctggttca aaaaaccaag attttgactg 38821 cacgcgaaat ttatattgat aagcaccatc ttctaattca acagttgtgc gaaaataacc 38881 atcatcacct ttttccattg ggattggttc ccaatcagaa aaagaaccaa ttaaagacgc 38941 agctttgttg tagggtgcaa ataaattaaa ttcaattggc tttgccatag taactatatc 39001 aagatatcac aaggaaatgt gttgacaaaa tacagaaaaa tcattattta gctaatgact 39061 ataatgatta atggctcaaa gcgattaact tatagtgatt agccaagtgt tgaatgaatt 39121 ctgccagtga tgagtatttt ctaatttcta aacaaactag ccaatcatcc tctagagata 39181 atcaaatact ttttacgtct atctcaagaa gtaattttta taaaaagtat attttggggg 39241 atttattaat taactataaa attttaatga ttcatctaaa atacttagaa aaaataagtc 39301 aaaagcttgt taatcaaaaa aacataaaaa ttgtaagtga ggaataagtc actcctcatt 39361 tctcactttt tgagtcataa ctcgaagtaa aattttcagg cttgctcgtt gtttttataa 39421 ctagcaacag ccgccctgag attaccctgg tattgtgaca agagttgtaa accctcctct 39481 ttttccaaac cagtccaatg cattaatagc gccaatttta cccatttacc actacgttcc 39541 aataaaaaac cagcggcttc tcgacttaaa cctgtgaggt cttgtaaaat acgcatggcg 39601 cgatcgcgta acttttgatt cgttaccgcc acatccacca tacgattgcc ataaacttta 39661 cccaacttga ccatcacacc cgtagacagg atatttaagg caagtttagt aactgtacca 39721 gctttgagac gagttgaacc agcaagtatc tctggtccag ttaacaggcg aatatcaata 39781 tcagcctcac actccacttg ttcaacaggg acgcaagcta taaatatagt agtcgcccct 39841 cgctgacgag cagcactcat cgcgccgtgg acgaaaggtg ttgttccacc cgctgtaata 39901 ccaacgacta catctagttg cgtgatttgt cgttgggcga tcgccgcctc accatcttca 39961 gcgctgtcct ccaaatcttc ggaactgcgt acgagtgcac ccgcaccacc cgcaataatt 40021 ccctgtacca tctctggagg tgtacaaaaa gtaggtggac actctgcggc gtctaacact 40081 cctaaccgac cacttgtccc tgctccgatg taaaaaagac gtcctccatg acgcaaacac 40141 tcagcagtac gttcaatcgc ttcagccaac tgaagtttag ccgcagctac tgcagccact 40201 gcctttgcat cttcgcgatt aaataactct accagttcaa gagaactcag ctgatctagg 40261 tcaagactat caggattaac ctgctctgtc aaaagatgac cgcgcccctg caaattcgtc 40321 gtcatttgtt ttttgtccaa agtggaattg ggagacaata ctgtgagcgg gttttccgga 40381 agtactgccg ttcattcgtt ttttgtcttt agtcgtgtct catgagacat gactcattac 40441 aataatcctt ccagtttgcg gcgaatgctg tctagttccg aatcagaaag ttccggtttt 40501 tcctgcgact cctggttgcg agggagagtt tcctcagcaa catcctcctg ggattctagt 40561 ttccagtctg tttcttcaac gttgatttct gggggaattg ctaaagcact gtcattttga 40621 accaattccc aatcatagtc cgcagcttga caaaactctt ttacttcctc tgaatcaatc 40681 gcctctacag ttggtgatag gaaatcttga gcttccagca tgagggcaaa gcgtgtcgcg 40741 tcgtcttctg actcgaacat cagaacttta ttgcgctcgc ccacctgaat tgtgtgaatt 40801 ccctcatttt ctgtccgagc attaaacagt agtacaaaaa cacgcatggg tgtaatcact 40861 aatcttgcat cttggatcta tttttaatta aagttctaga aggtgtccaa agaaaagtcg 40921 aagcaaaaat tttcattttc ttataaactc ttaatattct gctgacaagg taaagacgac 40981 gatatcaata attttcttca agaaagtgag tagtctgtga agaaacacag gggtgtaagg 41041 gtgtaggggt gtaggggtgt agggg // LOCUS NODE_595_length_40905_cov_5.15836040905 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 40905) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 40905) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..40905 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 6..344 /locus_tag="DP116_04755" CDS 6..344 /locus_tag="DP116_04755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741211.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04755" /translation="MKLSFKDIQFIIEAIDYLKKLYESRLNNENLDDDEISDLGNDCM FLEALRVDLEKNLKQVVPQNENSLQSDLLNLSAQNLKQSVQQLPISQRLVLVDAITES IRQELSLIQR" gene 707..916 /locus_tag="DP116_04760" CDS 707..916 /locus_tag="DP116_04760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995666.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04760" /translation="MTNREELIQELEQAPDDIVQSVLDFFRRIKATRKTHPLAKFAGI LSDVEAEELKRAIATECRQVDVNEW" gene 906..1295 /locus_tag="DP116_04765" CDS 906..1295 /locus_tag="DP116_04765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112283.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_04765" /translation="MSGEIALDTSVAVRFLNGDTAIVSRVLAFPEVILPTVVVGELLF GAENSTRPLQNLPRYLEFIEACVVLPLGRETAVVYAQTRLALKRKGRPIPMNDVWIAA QCLEQGWILVTDDTDFDYVDGLMLERW" gene 1737..2177 /locus_tag="DP116_04770" CDS 1737..2177 /locus_tag="DP116_04770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739988.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04770" /translation="MEPFTAGAIAVGTIVATKALEKTTEKGTEILLDKAGKFLVTLKK HSPHTVIALEKAPQKPLDYGKAVLEVESAAKANREVAQAVQELATAAQANRPSNLVEI LREIKASVEKSQQSYPSTFIQNIEKAINAAQNQTIDQRYSTFNV" gene 2544..3332 /locus_tag="DP116_04775" CDS 2544..3332 /locus_tag="DP116_04775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952026.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="formylglycine-generating enzyme family protein" /protein_id="PRJNA477356:DP116_04775" /translation="MWIEGLGERIKLEMVYIPAGTFIMGSPGDEKGRQRNEEPRREVT IQPFLMGRYTVTQAQWRYVSTLPRVQIDLASDPSYFKGDLRPVERISWYEAREFCARL SKATNREYRLPSESEWEYACRAGTTTPFYFGETITTELANYDGSLVYGRGSKGIYREK TTEVETFPANAYGLFDMHGNIWEWCADHFHENYKRALRNETAWLSSDENARRVIRGGS WYSSPKFCRSANRTPLPPHERGDPSGIEGTGFRIVCVPESNLFQ" gene complement(3477..3662) /locus_tag="DP116_04780" CDS complement(3477..3662) /locus_tag="DP116_04780" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04780" /translation="MSMNNRIQDVAKNAASAPSEGGGWLPAALKVGAVLVTVALSAVG AGEAADAAESLSEIGRK" gene complement(3786..6490) /locus_tag="DP116_04785" /pseudo CDS complement(3786..6490) /locus_tag="DP116_04785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875484.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DNA mismatch repair protein MutS" assembly_gap 4471..4480 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(6983..7618) /locus_tag="DP116_04790" CDS complement(6983..7618) /locus_tag="DP116_04790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196998.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease subunit R" /protein_id="PRJNA477356:DP116_04790" /translation="MTILKASNLSLEDVQRLLGFQKQYTDSFTPLLSLEPITQDEQQE LEQIRNDFDRYLTTGKVSEGQVKFLVLAPLMRLAGFYRYPIEILLEEDIADIEIEDED TKIKGRFDILAITKAKRTKVNAFFWVLLIESKNSQIDISTGLPQLLTYAYKSLEHQES VWGLTTNGRSYQFVNIQQGHLPTYHLMPELNLMERQRSILLLQVLKAICQL" gene complement(7897..8229) /locus_tag="DP116_04795" CDS complement(7897..8229) /locus_tag="DP116_04795" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04795" /translation="MNSYQTAGETSPTEACCGQKSFALEDFNLPFAALNSKYAHMAKR VSPPLDTRESRLIAGLCAYIIAITNGKAPCWSIAPKGRALRAIAFSAKRFGSKLKKEV RMDCVPLR" gene 8319..8840 /locus_tag="DP116_04800" CDS 8319..8840 /locus_tag="DP116_04800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019492592.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_04800" /translation="MTMTFTNEAASAQLNKPVLKEGSKGDAVKELQKLLLKWGAFVSL DNNGACVFPGEEVIDGVFGPKTKNAVIFFQGKVFLVQDGIVADKTWRALFKNAPVDMP ILKKDSKGELVKKVQERLEIGDYYNGKIDGDFGNSTEKAVKALQKHTGLPVDGVIGDR TWFEVSQINTIFC" gene complement(8909..10201) /locus_tag="DP116_04805" CDS complement(8909..10201) /locus_tag="DP116_04805" /EC_number="3.5.2.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316285.1" /note="Catalyzes the reversible hydrolysis of the amide bond within dihydroorotate. This metabolic intermediate is required for the biosynthesis of pyrimidine nucleotides; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydroorotase" /protein_id="PRJNA477356:DP116_04805" /translation="MTSELLQQVRVIDPVSGTDQIADVLIEDGYIRSVMNHISDMPND TDVRDCHGLVLGPGLVDLYSHSGEPGFEERETLSSLLQAAAAGGFTRISILPDTSPVI DHPAVAVQLQKNTGAQASFHSSPLLKSSTAPLLNVWGAISLDIAGKQMTELVDLAAVG VVGFTDSQPWENFGLVRRVLEYIQPLRKPVAFWCCDRQLMGNGVIREGPDAIRFGLPF IPSSAETSAIAALLELVTATSTPVHIMRVSTARSVELIASAKARGLPITVSTTWMHLL LDMQALKSYNTSLRLEPPLGTANDVAALRQAVRTGVIDAIAIDHRPYTYEEKTVAFAE APPGAIGYELALPLLWQHLVETGEFTALELWKALSTRPTECLQQKISVIAPDQKAELT LFDPQQNWKVEKQNLHTLSSNTFWLGQQLTGRVVQTWC" gene complement(10328..11680) /locus_tag="DP116_04810" CDS complement(10328..11680) /locus_tag="DP116_04810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874842.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine phosphatase family protein" /protein_id="PRJNA477356:DP116_04810" /translation="MTRVIIVRHGQSTYNTEKRIQGRSDASKLTEKGRSDSSLVGKTL SNILFNAIYSSPLQRAKDTAEIIHHELTIHAQQSAVPQTSEQLLEIDLPLWQGMLSAE VKEKFREDYRIWQVSPHLLRMSVKDGEQTREHFPVLALYEQARQFWQEVLLRHRGETI LIVGHNGINRALLGTALGISPERYHSIQQSNCCISVLNFSGGLGEPVQLESLNQTQHL GEILPSLRPNHNGVRLLLVRHGETEWNRQTRFQGQIDVPLNDNGRQQAQKAAQFLKDI AIDFAVSSTMVRPKETAEIILQYHTDVNLELQDGLREISHGLWEGKLEKEIEQEFPGE LHRWRTVPGQVQMPEGENLQQVWERSVAAWESIVQTALAKQLKTGLVVAHDATNKTLL CHVLGLSSEQFWNFRQGNGAVSVIDYPLEAGGLPVLQAMNITTHLGGGVLDKTAAGAL " gene complement(11881..12282) /locus_tag="DP116_04815" CDS complement(11881..12282) /locus_tag="DP116_04815" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04815" /translation="MNSKIFAALAIVAAIAGLQSKVHAQSSVAPNSKTGNYTIQGDSL TGIGRTAKDDFARFFAERNSQNNVPRRNYQESTSEVEVLELGDQIELRRREPITTPNN VIFPQGDESFYSNDGVQVQFDLNRDSNRQKR" gene 12846..13541 /locus_tag="DP116_04820" CDS 12846..13541 /locus_tag="DP116_04820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875513.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="exonuclease" /protein_id="PRJNA477356:DP116_04820" /translation="MPFNTQLLPRINAKSMRENGKQYYVDTQGNRFPSVTTILNATKP QEDRERLLNWKARVGSEEATRITTGASRRGTQTHKQIERYLLGENPACSEASRPYWES IKPVLEEIDTVKLVEGSVFHYNMSYSGKVDCVASYKGIPCICEWKTADKPKGSVERLY EHPLQLTAYLGAVNQCYQEYDIEVNHALLVVAIPDTEAEVFWFEPEVMKDYWMQWEQR VAEFWKRRNTWGW" gene complement(13631..14629) /locus_tag="DP116_04825" CDS complement(13631..14629) /locus_tag="DP116_04825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748126.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="formylglycine-generating enzyme family protein" /protein_id="PRJNA477356:DP116_04825" /translation="MQERTTVHQEVSSGKSDRQIAIKRPGKAPDKDMVWIPGGAFVIG SDHHYPEESPAHLVRVDGFWMDRYAVTNKQFQRFVKATGYVTVAERPPKPEDYPGAIP ELLVPGSAVFQQPKHPVHLQTCSWWVYVPGANWRHPTGPGSSIKGRENYPVVHIAYED AEAYAAWAGKLLPTEAQWEFAARGGLEGAVYSWGNEFAPKGRRMANTWEGEFPWQNLK SRSPGAESVGSYPANGYGLYDMIGNVWEWTTDWYRDSHPENKTKSCCIPVNPRGGTQQ ESLDPNTPSQIPRKVLKGGSFLCAPNYCQRYRPAARHPETVDTSTSHIGFRCVVNA" gene complement(14643..16013) /locus_tag="DP116_04830" CDS complement(14643..16013) /locus_tag="DP116_04830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748125.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="twin-arginine translocation pathway signal protein" /protein_id="PRJNA477356:DP116_04830" /translation="MNKRKLPAISRRQFVGTALASTILTLGGSSVFEAVAAKRKRPNI LFILTDDLGWGDLSIYGKTYQTPNLDQLAKEGTRFTNAYAAQTVCTPTRIGFFTGRYP ARLPVGLQEPLVESDTVGLPPQQPTIASLLKANGYQTALVGKWHAGFLPDYSPLKSGF DEFFGNYSGAIDYFTHKGLDGELDFYEGEVRVDKPGYATDLYTERAVEFLTRPRNQPF YLSLHYNAPHWPWEGPEDEELSRTFYNSNAFTAGGSPETYAAILKSLDDGVGRVLQAL KDSGQADNTIVIFVSDNGGERYSDIGPFQGKKGSLYEGGLRVPTFIRWPGVIQPNQVN SQVIITFDLTATILAATGTSPDPNYPLDGQNLLPVLLGRKPVSPRTLFWRYKSNVGGN SGQLQAAVRSGDWKYLRQGEKEYLFDLANDEGEQTDLKDNNPKVLQRLRDRFEEWNGQ VLPYPA" gene 16464..18413 /locus_tag="DP116_04835" CDS 16464..18413 /locus_tag="DP116_04835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320131.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04835" /translation="MGRYERRPDNPREKGDPYGTAENALRAVTEELQILQRNLLKSLQ EDTKRLQAEKDRLTEDIRQLQEEKEHLQQAYQINEQQTLIRQLSQVLANHISLQLQSS LETLATQAMERVSQPVGSSEPREATSHSTSEVNEYAEKLLGSLDDTLTITFNSLQQEL KNYQSGLSQQLSRMIVQQREGEAILIELVNCLRRELKETTQNSPSAIVPVPSGEIEQI LQQQQLTSEEVPTKLQTDIPPETTVLPRKLPQEETPAWNSLSQSIASPSETTQSPPVL EEPTPIASPRRAVLEPETSPPPEPQLTPRQRGISSLQITGIMLLAFSTVASALYNVAI KVIFLPGSQIFGVFDAQPLISPNLGNSLFILTLRMLVVLPLMLLLAPMLHSRVWQDLQ YLGDSVRGNSSNPTTKRVLILSIVSGCFLFLSQVLIYLAIGQIPTGMAIALFFVYPII NGLLSWFLFRDRLTLFSSFAIAVIGMGELFVLGSSNSSVIGNIRIGSIAAIASGGAFA LYLLVSRMCAAKLHPVSLTLINFATMLLLSVFGLILPLPTSWNLQLNRAYLLELVLCA FLLGVLTLFSYLFNNFGIRKIGASRSAIIGATIPALTVIFAGLILQETLQLEQVLGVL LVSFGAGALCFEKIRRTKPFDQSSQ" gene 20259..21158 /gene="hetR" /locus_tag="DP116_04840" CDS 20259..21158 /gene="hetR" /locus_tag="DP116_04840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873360.1" /note="controls heterocyst differentiation; has protease DNA-binding activity; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heterocyst differentiation control protein" /protein_id="PRJNA477356:DP116_04840" /translation="MSNDVDLIKRLGPSAMDQIMLYLAFSAMRTSGHRHGAFLDAAAT AAKCAIYMTYLEQGQNLRMTGHLHHLEPKRVKIIVEEVRQALTEGKLLKMLGSQEPGY LIQLPYVWMEKHPWRPGRSRVPGTNLTSEEKRQIEQKLPSNLPDAQLITSFEFLELIE FLHKRSQEDLPPEHRMELSEALAEHIKRRLLYSGTVTRIDSPWGMPFYALTRPFYAPV DEQERTYIMVEDTARYFRMMRDWAERRPNTMRVLEELDIPPERFEKAMEELDEIIRAW ADKYHQDGGVPMILQMVFGKKED" gene complement(21251..21949) /locus_tag="DP116_04845" CDS complement(21251..21949) /locus_tag="DP116_04845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318188.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04845" /translation="MLGKRKSKNTHTLGQIAGHTAQRLERSADALKAVIFRATAVLAT PVFAVTGWFAMTSSGMAVTPTYGNDFRVCAGRLLSVGVAAQAASIACAEALRPGDLSA CVTGIGRQTQIAASEALASCRQARRPEQLGSCVVGISRYSREAVGPEVLNYCGRSLLP VRFAQCVVGLRAEIDFAPTQAMEACIDASDKVSGFLPSFIPSNRQPTDFRPTFESNPI PSQPSQTPANPSRK" gene 22217..22825 /gene="rpsD" /locus_tag="DP116_04850" CDS 22217..22825 /gene="rpsD" /locus_tag="DP116_04850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009342824.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S4" /protein_id="PRJNA477356:DP116_04850" /translation="MSRYRGPRLRVIRRLGDLPGLTRKSARRAYPPGQHGQNRKKRSE YAIRLEEKQKLRFNYGVTETQLLRYVRKARRVTGSTGQVLLQLLEMRLDNTVFRMGMA PTIPAARQLVNHGHVTVNGRVVNIASYQCRPGEEIAVRNREASRKLVEANLQYPGLAN LPNHLEFDKNKLLGKVNSVIEREWVALQINELLVVEYYSRQA" gene complement(22905..23894) /gene="moaA" /locus_tag="DP116_04855" CDS complement(22905..23894) /gene="moaA" /locus_tag="DP116_04855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952727.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTP 3',8-cyclase MoaA" /protein_id="PRJNA477356:DP116_04855" /translation="MNQVDYLRISLIDRCNFRCQYCMPEGAELDYILKQQLLTDKELL TLIEEVFIPVGFTRFRLTGGEPLLRPRVVELVKAIASLPQTQDLSMTTNGFLLAPMAQ NLYDAGLRRVNISLDSLDPDTFDQIINNRGRPRWEQVWQGIHAAYRVGFDPLKLNVVV IPDVNDDEVLDLAALTIDKQWHVRFIEFMPIGNSQLFGDRSWIPSEELRQRIRQRWGL TESQVRGNGPADVFQIPGAKGTLGFISQMSECFCDRCNRMRLSADGWLRPCLLNETNQ IDLKTALRTGISTAKLREQVRDLLAIKPDINFKQRYSGTETGVYTRTMSQIGG" gene 24265..25524 /locus_tag="DP116_04860" CDS 24265..25524 /locus_tag="DP116_04860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009783093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_04860" /translation="MLIMNYRYRIYPDATQQTVLFEWMEISRNAYNYALREIKDWCNS RKCMIDRCSLEQEYILPADLKFPSEVQQLNALPRAKKEFPRLGEVPSQVLQQAIKQLH RAWECFTERGFGFPRFKKYGQLKSLLFPQFKENPVTGKHLKLPKIGLIFINLHRPIPD GFDVKQVRIFKKADRWYASVCIQCNVSVPDSKPHGHPVGVDVGLEKFLATSDGDLVKP PRFFQTMQSKLKLLQRRLSRKQKRSKNYEKQRLKVARMHHTIDNTRKDFHFKQAHALC DTGDMIFMEDLDYSKMAKGMLGKHMLDAGFGQFRTITKYICWKRGKFFAQVDSRGTSQ ECPECGAVTKKDLKTRVHHCLVCGYTTDRDVASGQVIRNRGIASISTPGLGGTKTACA VDLPGTKISSSRQVTKSRKRKTRNAKL" gene complement(25897..27162) /locus_tag="DP116_04865" CDS complement(25897..27162) /locus_tag="DP116_04865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873000.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04865" /translation="MNDEFYNKGLERAKQKDYAEAIQEFTRALQLTPYFAEAYYQRGL AYYDLGEMLNAVSDFTEALKLNPQSVEAYYCRALGRMALKNLPGALADVDQAIRLNVN YAAAYNLRGTVHRKQGYIQDAIANFKKAAELYLQQKDAENCRLCLEKIKQLQPKEKPA FVQPSPIIAPLKSEKEYFTQLLEKAEKGDTREAMEDLNWALRVDPQDAQAYCCRGVVR CKLGNYREAISDFNQALRLNFDDAIVYRNRGKARSYLGDHQGAIADFNSALQMQPQDA MLYIARGNAYRVMGNYLGAINDYTQALQINPDDATAYYNRGIAYTCLEEMERAVEDYQ RAASIFCEKEDWGNYQQVLDSLKKIHSPSSHSEKAKYNLLRQRLLRLVGGYWEMAQRL IDQAKYDYPGMSEDWYLQKVIGDWERDRG" gene 27281..28069 /locus_tag="DP116_04870" CDS 27281..28069 /locus_tag="DP116_04870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317447.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA/RNA nuclease SfsA" /protein_id="PRJNA477356:DP116_04870" /translation="MTNDYLYRYPPLYPGILLKRYKRFFADVELASGEIVTAHCPNTG PMTGVSTPGSAVQLSYSDNPSRKLPYTWEMIQVHDNEPTWVGINTNLPNRIIKLALET YLFPELGNYSQIRPEVSYGENKSSRVDFLLSPHTNLNVPSGDLFLKSDHLLLTKNACP IYLEIKNTTLAQGKLALFPDTETTRGQKHLRELTALLPQNRAVMLYFINRSDCTEFAP GDSTDPVYGKLFRLGIELGLEILPCRFEVNPEGVRYLGLAECQF" gene complement(28120..30366) /locus_tag="DP116_04875" CDS complement(28120..30366) /locus_tag="DP116_04875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316186.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_04875" /translation="MKILLVEDNPGDVVLLEEFLQDVSSVKFHLKQAEQLDEALRFLE QESFDVILLDLLLPDSQGLETFIKIHHQVPAIPIIVLTGFDDETLAIKAMQEGAQDYL VKGQVNGDLLVRSMRYAIERQRTEEARRRSEERFRVALKNSPIVVFNQDQDLRYTWVY NPNFASTPEEMLGKQDSDLLRAEDAEHMTIIKRGVLTTGIGTKEEVSITTAQGTKYYD LTVEPLHNESQEVVGITCASIDISERKLAEEQIRQQAALLDVTTDAIFVRDLDNCIIF WNQGAENLYGWQAQEVFGKNASQVLYKEQSSEVEAAFSIVISKGQWQGEVTKVTKSGK EILIGTRWTLVCDQNGKPKSILTVDTNITEKKLLEAQLFRAQRLESIGTLASGIAHDL NNILTPILAVAQLLPLKFPHVYEQDNHLLEILENSAKRGAELVKQVLSFARGVEGKRI TLQPKHLIREVTKIIRETFPKFIEAYADVPQDLWLVSGDGTQLHQVLMNLCVNARDAM PDGGTLSICAENFFIDESYAQMHLEAKTGPYILITVSDTGVGIDHEIIDRIFEPFFTT KEQGKGTGLGLSTVIGIVKSHGGFVNVYSEVGSGTSFKVYLPAVQGIETPPLVMVEVF AGHGEVILIVDDEPSIQEITKASLEAYNYKILTASDGIEAIALYAQRKNEISAVLIDM MLPALDGFTVMRTLQKINPQVKILATSGLMFTTKLATVGNGVKSFLPKPYTVKELLQA LQQVLHQK" gene complement(30608..31033) /locus_tag="DP116_04880" CDS complement(30608..31033) /locus_tag="DP116_04880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859993.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_04880" /translation="MPIEILLVEDNPGDVQLTQIALEDSKISVNLSVAADGVEAIAFL RKQENYTQVPTPDLILLDLNLPRKDGREVLAEIKADQILKRIPVVVLTTSGAEEDVLR AYNLCANCYIKKPVDFDQFVKIVHSIESFWFTVVKLPPE" gene complement(31048..33573) /locus_tag="DP116_04885" CDS complement(31048..33573) /locus_tag="DP116_04885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316188.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain S-box protein" /protein_id="PRJNA477356:DP116_04885" /translation="MVMMTHNEAARLEALRQYQILDTEPEETYDNIAQLAAFICDTPI VLVNFIDENRQWFKAKIGLDVPEMPRSVGLSYLCQERRDVVVVYDTWTDEKLARNSVV TSYPYVRFYAGVPLITPKGDMVGTLCLIDQVPREISHKQVEALVALGRQVISELELRR NLAEVSHFAEELKRTKEKLAHSENLLRTIIESEPECVKLLAKDGTLLEMNPAGLAMIE ADCINQVQGSCIYPLIAPEYRQAFVTLTEQVFQGESGIQEFEIIGLKGTRRWLESHAV PLRNSDKNIIALLAVTRDITERKRTEASLRDREQHLKLALQTAKLGSWELDLKTGDLS CSKQCKANFGLPPEIELSYDTLHERVHPEDRAHRQEAVRQALEERKDYEAEYRNIWSD NSIHWVLARGAGIYDADGSPTRMIGVTLDITARRQAEEELKRQTKRSQLFAEITLKIR QSLQIEAILQTTVIEVQKLLHTDRVMILRLWGDGSATVMQEAVLPGFPVLLGKNILEH CFEQDSQELFRQGRVSAIADIETANIQLCYKEFLSQFGVKANLVVPILQRENLWGLLI AHQCECPREWNNVEIELLQQLADQIGIALSQAQLLEQETHQRQELARSNAELEQFAYV ASHDLQEPLRMVASYLQLLERRYKDNLDARANEFIGYAVDGALRMQTLINDLLSYSRV STRSQPFEPVDCRFVVNCVLANLKVALEESNAVLTYDTLPEVMADATQLSQLFQNLIS NAIKFRSQQPPQIHIGVERIDQKWQFAVRDNGIGIEPQYTERIFVIFQRLHTRNKYPG TGIGLAICKKIVERHGGNIWVESQPQHGTTFFFTIPDTAGNKL" gene complement(33903..34412) /locus_tag="DP116_04890" CDS complement(33903..34412) /locus_tag="DP116_04890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015175888.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR00725 family protein" /protein_id="PRJNA477356:DP116_04890" /translation="MRKIIIGVMGPGEKATAVDMQNAYELGKLIATEGWVLLTGGRNV GVMDAASRGAKSVDGLTIGILPCHDSQGVSEAIDIAIFTDMGNARNNINVLSSDVVIA CGMGAGTASEISLALKGNKKVILLSVDEESKNFFQKLASKNVYFVNDAENAIATTKEI LSPNKTSSI" gene complement(34506..35387) /gene="ubiA" /locus_tag="DP116_04895" CDS complement(34506..35387) /gene="ubiA" /locus_tag="DP116_04895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311837.1" /note="UbiA prenyltransferase family catalyzes the transfer of a prenyl group to various acceptors with hydrophobic ring structures in the biosynthesis of respiratory quinones, hemes, chlorophylls, vitamin E, and shikonin; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxybenzoate octaprenyltransferase" /protein_id="PRJNA477356:DP116_04895" /translation="MLTTPEQNSEPIWLVIIRLLRWHKPEGRLILMIPALWAVFLAAA GKPPLPLVGVIILGTLATSAAGCVVNDLWDRDIDPQVERTRDRPLASRALTVKVGVIV AIVAMACAAVLAFYLNVLSFWLCVAAVPVILLYPGAKRVFPVPQLVLSIAWGFGVLIS WSAVTHNLSLSTWLLWGATVMWTLGFDTVYAMSDREDDRRIGVNSSALFFGDFAPIAI GIFFASTVFLLSWLGLVMYLRPTFWISLAIATVGWVWQYTRLGQQDLPNSAYGEMFRQ NVWIGFIVLAGMIVGSL" gene 35662..37311 /locus_tag="DP116_04900" CDS 35662..37311 /locus_tag="DP116_04900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315701.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Ppx/GppA family phosphatase" /protein_id="PRJNA477356:DP116_04900" /translation="MVNLVSANWESVPTQTDQQDRIIAAIDLGTNSLHMVVVQIEPTL PSFSIIAREKETVRLGDRDLETGNLKPEVIQRAIATLGRFQKIAKTLNVETIIAVATS AVREAPNGKDFLQRIEDELGLSVDLISGQEEARRIYLGVLSGMEFNNHPHIIIDIGGG STELILGDSHEARTLTSTKVGAVRLTTELITTDPISNIELKYLHAYARGMLERAVEEV QANLKDEESPRLVGTSGTIETLVIIHAREKVDSVPSTLNGYEMSLKDLQEWVNRLRKM SNSERAAIPGMPEKRSEVILAGAVILQEAMNLFGLESLTVCERSLREGVIVDWMLTHG YIEDRLRYQTSVRQRSVLTIANKFHVNLEHSDRVAVFALSIFDQTKGTLHYWGAEERQ LLWAAAILHNCGHYISHSSHHKHSYYLIRNSELLGYNETEIEIIANLARYHRKSSPKK KHESYRNIASKNHRQIVNKLSAILRLAVALDRRQIGAIVKVQCEYLTEQQEFHLKIYP SRADDDCALELWSLDYNKGVFESEFEVKLVANLELNIAAFS" gene complement(37422..38765) /locus_tag="DP116_04905" CDS complement(37422..38765) /locus_tag="DP116_04905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743653.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04905" /translation="MKKFFIPCLFFIGIAWTIFTQVSLPIHGESSTVASATRPRNPGY EVWAIDQADSLDGTVLGGNLFVFSGNDRDFLRGNAKVEQFNLAASAKKNNLAPGQKPH WITFNQGGTHAIVGHATTAHVYAIDANKREVVDSILPPGLPNSNSHAVYLSKDNKFVY VADTPGQRIHKIATNYEAPGGKIFGDVQTLDFNTPQTKTALGVPTSGATARPVVAVVD DTGKFVYVTFADGGVAIVNAETLTIAHIYSKDEATFNGLIAYEIGDNFVTNAGNADPQ IADFLYLYNNKSLLENPSKRPDFFKVPQSGNDVHGVTLVGGKYLWQVNRASNSITIHE INPKPFDPNVEGSNKARAVNLVDLVSDALGPDPTPDLIETSASEQVAFFTQRGPNPIS ANDPVFFNSVGIFPGLGVVEVENGGKSAKPAHLYRFDNIVGGKNIADFHALAVRK" gene 39765..40847 /locus_tag="DP116_04910" CDS 39765..40847 /locus_tag="DP116_04910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873785.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="EamA family transporter" /protein_id="PRJNA477356:DP116_04910" /translation="MQLKISASRLPIPPLLLLIAPFFLWGTAMVAMKGVIPHTTPLFM AGVRLIPAGVLILIAAALMGRPLPKGWVAWLWIALFALVDGTLFQGFLAEGLLRTSAG LGSVMIDSQPLAVALLSLWLFQEHIGFWGWLGLGIGVTGISLIGLPDQWILHFLNSGT IVETLSTASIQQLFASGEWLMLLAALSMAVGTVLIRFVCRHADPVMATGWHMILGGLP LWGISSTVESGQLQNIVPSDWFALLYATVFGSAIAYALFFYFASSGNLTSLSSLTFLT PIFALLFGNLLLQEVLSPLQWVGVSLTLVSIYLINQRETLSGLSKKVATSEKTMTQQQ QVLEASAKTINKVTLPVRKSESEMLP" BASE COUNT 11609 a 9119 c 8665 g 11502 t 10 others ORIGIN 1 taattatgaa actatctttt aaagatattc agtttattat tgaagcaatt gattacttaa 61 aaaaactata tgaatcccgc ctaaacaatg aaaatctaga tgatgatgag atatcggatc 121 ttggcaatga ttgtatgttt ttggaagcgc tgcgtgtaga tttagaaaag aatttaaagc 181 aagttgtacc acaaaatgaa aatagtttgc aatcagattt actaaattta tcagcacaga 241 atttaaaaca atctgttcag caattaccaa ttagtcagcg tttagttttg gtagatgcga 301 ttactgaatc aattaggcaa gaattatcgc taattcagag gtaattgatg tgatattgaa 361 attgcaaatc atacaggcga tcgcccgagg tgggggcttg cgcatcgcat tccatcccat 421 caactttaat aaaagcgtat cccaccaaaa tgcgatctac agcaaagcag cagcgctagc 481 cccgataggc gtagccgtgc cgtaggcata gggggcgcac tctcggacgc tatcgcacta 541 atccatcaat ccaataaatg cgtagacgca cagggcttat cgtcagacat cacactttct 601 cccatcaatc caaatgcgat ctgccgctat gctgcagcga agctatcgct ttgggacttt 661 ggatatactg aaaatagaaa gaactagcca agaacaatta gaatgcatga caaatcgtga 721 agaattgatc caggaactcg aacaagctcc tgatgatata gtccagtcag tactggattt 781 ttttcgtagg ataaaagcaa cacgcaaaac tcatcccctc gcaaaatttg caggtatttt 841 aagtgatgtc gaagcggaag aattaaaacg ggcgatcgca acagaatgtc ggcaggtgga 901 tgtgaatgag tggtgagatt gccctagata cctctgtggc agtgcgtttt ttgaatggag 961 acacagcgat cgtttcacgg gtactagcat tcccagaggt gattctgcct accgtcgtag 1021 tgggggagtt actttttggt gcggaaaact ctactcgtcc attacaaaat ttacctcgtt 1081 atctggagtt tattgaggct tgtgtagttc tgccattagg tcgagaaaca gcagtagtct 1141 atgctcaaac ccgactggct ttgaaacgga agggacgacc aattccaatg aatgatgttt 1201 ggattgcggc acaatgttta gagcagggtt ggatattggt gacagatgat acagactttg 1261 actatgttga tggcttgatg ttagagcgtt ggtagttcaa tctaaggatt tatggctcaa 1321 tcgtattcca acccaaacta aatgcgatgg tcgaagaccg ctcctccgga gcatcgcact 1381 ttcctcaaac taaccaaaat gcgtagacgc gcagcggtga agcagcgtga atgaccgctt 1441 tcccgacaga ggcgactgtg ttcgcgtagc gtctccgcag gagatacccg aagggcttgt 1501 cgtcagacat cgcactctcc tcaaaaccac cgaaacgcga tcaccgcagg tggcagcttc 1561 cgcatcgcac aaatctgaac atcacaaatg cgatcacact atcctcaaca tcacacttgt 1621 gatcgcactt taaaacttaa gctatggttt ttttacttcc tttgcggaaa atactgaatt 1681 tagaatataa ctatgtaata ataagcaaat aaaatgccac cttctagagg tattacatgg 1741 aaccatttac tgctggtgcg atcgctgttg gcactatagt tgccactaaa gcattagaaa 1801 aaacaaccga aaagggaaca gaaattttat tggataaagc cggtaagttc ttagttaccc 1861 taaaaaaaca ctctcctcat actgtaattg ctcttgagaa agcgccacaa aaaccactcg 1921 attatggcaa agctgtactt gaagtggaat ctgcggctaa ggctaatcgt gaagtggctc 1981 aagctgtaca ggaattagca acagcagcac aagctaatcg accatcaaac ttggttgaaa 2041 ttttgagaga aatcaaagca tctgtggaaa aatctcagca atcgtaccct tcaacattta 2101 tacagaacat tgaaaaagct attaatgccg cccaaaacca aacaattgat caaagatata 2161 gtacttttaa tgtctgacta cagtcgcccg aatgggaatg ggtaaagtta aatcccccaa 2221 aatatgtttc ttaccaaggt gtcactaact ttaaacgtag gaaattattg atatctgtat 2281 tttttacttt gattggtgcg ggaggggcta caatcttcaa cagtaaatta ggagacggtt 2341 ggacatcgag taaaaaaaaa gaagtcaatt cctctcctcc tactgtgcca actcaaaaac 2401 ctcctgaagt aaccccaaaa cctgaatctc ctcctcctat aaaaactaat attccagctt 2461 tgacagagga tagcttctcc tttgagacag tcaaagtcaa tgaacgaggt gaagttatca 2521 agcgagagtc gaaggctaca tcaatgtgga ttgaaggttt aggtgagcga ataaaactag 2581 agatggtcta tattccagct ggaactttta ttatggggtc tccaggggat gaaaaagggc 2641 gtcaaaggaa tgaggaacca cggcgtgaag tgacaattca gcctttttta atggggcgat 2701 atacagttac ccaagctcag tggaggtatg tttctacact accaagagtc caaattgatt 2761 tggcatcaga tccatcttat ttcaaaggtg atctacgacc agttgaacga atctcctggt 2821 atgaagcaag ggaattttgt gctcgtctta gcaaggcgac taatagggaa tatcggttac 2881 caagcgagtc cgaatgggaa tatgcttgtc gggcagggac aacaacgcct ttttactttg 2941 gtgagaccat tacaacagag ttagctaact atgatgggag cctcgtatat ggtcgaggat 3001 caaaaggcat ataccgagaa aaaacaacgg aagtggagac gtttccagcc aatgcctatg 3061 gtttattcga tatgcatgga aatatttggg aatggtgtgc ggatcacttt catgaaaatt 3121 acaagagagc gttaagaaat gaaactgcct ggttatctag tgatgagaat gctaggagag 3181 ttatacgtgg cggctcttgg tatagctcac caaagttttg tcgctctgct aatcgaactc 3241 ccctcccccc acacgaacgt ggtgatccat cggggataga agggactgga tttcgcatcg 3301 tctgtgtacc tgagagcaac ctttttcaat aacacaaggt atctaacaga aaagttaaga 3361 cacataacaa ttgaagaact tataaagaag tatgccctag aaactagagc atacctttct 3421 cacttttatt ttctgaaaaa tgcggaaatt tgacctcttc gcaccccaga aaatctttac 3481 tttctaccaa tttcgctcag tgattccgca gcatccgcag cctcacctgc acctacagca 3541 gatagtgcca cagtaacaag tactgcacca acttttaatg ctgcgggtag ccaacctcca 3601 ccttccgaag gagcactagc agcattctta gccacatcct gtatccgatt attcatagac 3661 atagatttac tccattgctc aacagtttta cggaacaatc atagttatta atacaaggaa 3721 ttatgtatgg gttccggaaa atcgcgtatc ccagttttct gttccaatga tacttctacc 3781 cctaactatt ccaaattttg cagtcccatt gcaatcttgc tgtgcttttc tatttgtccc 3841 atcacttgtt ttgcccgtgc aatcacaacc tttggtaaac cagccaacct tcccgcttca 3901 atcccataag acttatccgc acctcctggt tggacttggt gcaaaaagat aatttggtca 3961 gctaactcct tcacagtcac ctggtaatta gcaatatttg gcaacagcga agccaactca 4021 tttaactcat ggtagtgcgt tgcaaaaatc gtccgcgcca gaatctccgt tgcaatgtac 4081 tccgccaccg cccaagcaat agaaagacca tcaaacgtcg cagttcccct gccaatctca 4141 tccaacaaca ccagtgatct cgacgttgca tggttgagaa tattcgccgt ctcattcatc 4201 tccaccataa aagtagattg accagtcgcc aaatcatcta ctgcacctac acgagtgaaa 4261 atgcgatcgc acacccccaa cctagcagat ccagcaggca caaaactacc aatctgcgcc 4321 atcaactgaa ttaaccccac ctgacgcaaa taacaactct taccactcgc attcggacca 4381 gtaaggatga taaggtcagg atggagggag tgagagagtg agggagtgag agaagtttcc 4441 ttccctcttt ctctccctcc tgtttccctt nnnnnnnnnn cctttgtttc cttgttccct 4501 tctcctcctc tttctcctct gcgttctctg cgcctctgcg gttcgtttct cttcccagct 4561 gcgtcgaatt cggtacaaaa aaccccgcag gtaaagactg ttccaccacc ggatgacgtc 4621 catccacaat cataatttct cgtccttcca ccatttgggg acgacaatac ccttgataaa 4681 ccgccaactc agccaaacca cacaacacat ccgccgccgc taccgcacga gagatattgc 4741 gaataatttc tgcttgtgcg cccacttctt cgcgcaacgc cacaaaaatc tcatattcca 4801 actgattcaa atcatctcgc gccgtgagaa tccgcgcttc ccgttccttc aactctggcg 4861 tgatgtaacg ctcctcattc accagcgttt gcttgcgtat ataattagac ggcacttggt 4921 cagcttttgc acgagaaatg ctgatgtaat aaccaaaagt tttgttaaat cccactttca 4981 gcgtcggaat tcccgtcttt gctctctcgt caacttccaa attggcaatc catttttggt 5041 catcttccac agtcgcacgt ctctcgtcca attgagcatt caccccaggg cgaatcaacc 5101 cgccttcttt gatatgtata ggtggtgact ccaccagatg agcgcgaatt ttctgtgcca 5161 attcttctaa aacaggtggt accttctgca atgctttcag aaatggagaa tgagcactac 5221 ccaccaagcg agataattcc ggtaaacgag agagtgagtc tgccaaagca actaaatccc 5281 ttgcatgagc agtaccagaa ccagcgcgtc ccgtcaaccg ttctaagtca taaatttgac 5341 gtaacaactg ccgtagttct tgacggagag ttgtattttc taccaattct tggatagtgt 5401 cttgccgaga acgaataccc ttaacatcaa gtaacggttg caataaccac cgtcgtaacg 5461 cacgactgcc catcgctgta ctagttctat ccaaagccca cagcaaggag ccgacaaaac 5521 taccatcgcg gactgtttgc gtaatttcta ggttacgtcg agtttgataa tcaacaatta 5581 aaaagtcagt gagagtatag gtgcgtaaca actgtagggg aatcgcgttt tctttttgtg 5641 tatcttctaa atattccagc aacccacctg cagcgcggac ggctaggggg agttggtcac 5701 agccaagtcc ttctagggag cgtaccttaa atttttgtag caatcttgct ctggcttctg 5761 cttgagagaa gggaatttgc gatcgcaaac aataacaaaa agacggtggc aaacattcag 5821 gtaaatgttc cgatttttct cctgcacgca gcagtaaact tctcaagtct ggtgcgttcg 5881 tcggaacaag cacttccgca ggttgcaacc gcatcagttc ctgtgtcaaa tgttctaaat 5941 tgctaccttg cgtagtgagg aattcaccag tagaaatatc tgcataagcc aaaccccaat 6001 gctcaagagc catgaccaca gccgcaaggt aattattgcg actggatttg agcattcctt 6061 cctctagcaa agttccaggg gtgaggatac gggtaacctc ccggcgcacc aatcgaccag 6121 ccgcttcagc agcatcttcc acttggtcgc aaatcacaac cgcataccct ttttccacca 6181 actgcgtcgc gtggcgttcc caagcgtgat gcggtacacc actcatcgcc actcgtccta 6241 cgtcgcctac aagcttgctg gtgagggaga gttctaattc ctgcgccaaa gtcacagcat 6301 cttcaaaata acattcaaag aaatctccca ctcgatacag tagcatcgcg tgaggatact 6361 tatccttgat atccacataa tgctggaaca tttgagtcag cttactgcga tccactagcc 6421 gatgatcggc gtgaggcgca gttgggttac tgggttgggt tgtttgggat gcagagtaag 6481 aagcgctcat cgatgtctac caatacacat gacaacctgc caatatccga tgataccttg 6541 atatcgagga tactctcaaa ttgttcaagg tttttaactc ttcgacgact ttgcctagga 6601 atctggatag aaactgttaa ttcttaacac tccccacggc tatagcaatc ctagatgatt 6661 tgtaaaaaaa gaatttgatg aaattaaccg cagaggcgca gaggagccag cgccgtgcgg 6721 gggttcccca acgccagacg ccataggagg gtttccctcc tatggcacaa agtgccacgc 6781 tgcgcgaaca gcgctggctc ccgttgaggc gtctggcgtg gcgcagagaa agagaagaga 6841 gaattttatg aatgatttag gattgctata tcaaaacaag gcgttagctc gttttgactt 6901 aaagctgggg gattcttggc gggggcgtcg gaacttgcga gggactggct tgcggctatt 6961 atagcgatac tcaatttcct tcctacaact gacaaattgc tttcagaact tgtaacaata 7021 agattgaacg ctgtctttcc atcaaattta attctggcat caagtgataa gttggaagat 7081 gtccctgttg gatattgaca aactgataac ttcttccatt ggttgtcaaa ccccaaactg 7141 attcttgatg ctctaaactt ttataagcgt atgtcagcag ttgaggcaat cctgttgata 7201 tatcaatctg actattttta gattcaatta ataataccca aaaaaatgca ttaacttttg 7261 tacgtttcgc tttggtaata gctaagatat cgaacctacc tttaattttt gtatcttcat 7321 cttcaatttc tatatcagct atatcctctt ccagaagaat ctctatagga tagcggtaaa 7381 aaccagccaa cctcatcaaa ggagcaagta caagaaactt tacctgtcct tcggaaactt 7441 tgccagtggt gagatatcta tcaaagtcat tccgaatttg ctcaagttcc tgttgttcgt 7501 cttgtgttat gggttctagc gataacagtg gagtgaatga gtcagtgtac tgtttttgga 7561 agcctaaaag gcgctgaaca tcttccaatg ataaattgct agctttgagg attgtcatag 7621 ttatgttgtg cgtgttgtga agacttattt tcctctcaat tttatagcaa ccagactttt 7681 aatagttatc aagcaagcgc agtatcagtt ccatcccttg gtgaatgatc ggttcactac 7741 tcgaaacatt cacgcattat aaacttaaaa gaaaaatatt tttttaacaa aagaactcag 7801 gaatcagtcg ccagaaatca gaatggattt ctgtgcgact ggcggtgatt cgggggttta 7861 aataccaata catttgattc tgtctcctga attctgttag cgtagcggga cgcagtccat 7921 tctgacttct tttttaagct tagatccaaa gcgctttgct gagaacgcga tcgcacgcag 7981 tgcccgcccc tttggggcga tgctccagca gggggcctta ccattggtga tcgcgattat 8041 gtaagcgcaa agcccagcaa tcaggcgtga ttcgcgcgtg tctagtggag gagatacgcg 8101 cttcgccatg tgggcatact ttgaattaag cgcagcgaat ggcaagttga aatcttccaa 8161 cgcgaaagat ttctgtccgc agcacgcttc tgtaggactg gtttcaccgg ctgtctggta 8221 actgttcact gattttagag aaatttctaa atttcagatt tttctttgcg gaatgcacta 8281 atcaagcagt gattctaagg atacaaacac attgaggaat gacaatgacc tttacaaatg 8341 aagctgcttc tgcccaactt aacaagcctg ttctcaaaga aggttccaag ggtgacgctg 8401 ttaaagaatt acaaaaactg ttattaaaat ggggtgcttt tgtatctctc gataacaacg 8461 gtgcttgtgt atttcctggt gaagaagtta ttgacggtgt atttggtccc aagacaaaaa 8521 atgcggttat attcttccaa ggtaaggtgt ttcttgtaca agatggaatt gttgcagata 8581 aaacttggcg agcacttttt aaaaatgcac cagttgatat gcctatcctc aagaaagata 8641 gcaagggaga acttgtaaag aaggttcaag aaagactgga gatcggcgac tactacaacg 8701 gtaaaattga cggcgatttt ggtaacagta cagaaaaagc agtcaaagct ttgcaaaagc 8761 acacagggtt acctgtggat ggagttattg gcgatcgcac ctggtttgag gtaagccaga 8821 ttaatacaat tttctgttaa ttcattcgcg ctcaaagacc ctgtttcttt gataaacagg 8881 gtctttttga ttgtttaact tctaggaact aacaccaagt ttgtacaacc cgacctgtca 8941 attgctgtcc cagccaaaat gtattactag aaagtgtatg taaattttgc ttttcgactt 9001 tccagttttg ctgcggatca aataaagtga gttccgcttt ttgatctggc gcaatcacac 9061 ttattttctg ttgtaaacac tccgtcggac gagtactaag agctttccac aattctaatg 9121 ctgtaaattc tcctgtctct actagatgct gccataaaag gggtaatgct aactcataac 9181 caattgcccc tggtggtgct tcggcaaagg caacggtttt ttcttcgtag gtgtacggtc 9241 tgtggtctat ggctattgca tctataacac ccgtccgtac tgcctgccgt aacgctgcta 9301 cgtcattagc agtccccaaa ggtggttcta aacggaggct tgtattatag cttttcagtg 9361 cttgcatgtc aagtaaaaga tgcatccaag tggtactgac ggtaatcggt agacctctcg 9421 ctttggctga tgcaatcagt tcgacactgc gagcagtgga aacacgcatg atatgtactg 9481 gcgtacttgt ggcggtgact aattccaaca aagcggcgat cgcggaggtt tccgcgctag 9541 aaggtataaa aggcagccca aaacggatag catctggacc ttcccggatg acgccatttc 9601 ccatcagttg gcgatcgcaa caccaaaatg cgactggttt cctcaaaggt tggatatatt 9661 ccaacacccg acgcaccaac ccaaaatttt cccaaggctg actatcagta aagccaacca 9721 ctccaactgc tgctaagtct accaattcag tcatctgctt accagcaata tccaaactga 9781 ttgcacccca aacattgagc aggggagctg ttgagctctt gagcagggga gaagaatgaa 9841 aactcgcttg tgcccctgtg tttttttgca actgtaccgc cacagcggga tgatcaatga 9901 ctggggatgt atcgggtaag atactaattc gtgtaaagcc gccagcagca gcagcttgca 9961 agagagacga gagggtttca cgttcttcaa acccaggttc tccagagtga ctatacaaat 10021 ctaccaaccc tggtcctaaa accaatccat gacaatcccg gacatcagtg tcatttggca 10081 tatcagaaat gtgattcatc acagacctga tataaccatc ctcaatgagc acatcagcaa 10141 tttggtcagt tccagaaaca gggtcaataa cccttacttg ttgtagcagt tcacttgtca 10201 ttttcgctat caaccatcag ctattagcta tcagcttttt aacttttata tattgttaag 10261 tgggagtttg tgttacctaa ccctagaatc aactagcgac taaaggctga cagctaatca 10321 aaagacgcta caatgcccca gccgctgttt tatcgagcac accaccaccc aagtgagtcg 10381 taatattcat agcttgcagt actggtaaac cacccgcttc caagggatag tcgataacgc 10441 tgactgcacc attaccctgg cggaaattcc aaaactgttc tgatgataaa cctagaacgt 10501 gacaaagcaa agttttatta gtcgcatcat gagcaacgac taagccagtt ttgagttgct 10561 ttgccaatgc agtttggaca atggactccc acgctgcgac actacgttcc cacacttgct 10621 gcaaattttc tccttctggc atttggactt gtcctgggac tgttcgccag cgatgcaact 10681 ctcctgggaa ttcctgttct atttcttttt ctaattttcc ttcccaaagt ccgtgactaa 10741 tttcccttaa accatcctgc aattccaaat ttacatcagt atggtactgc aaaataattt 10801 ctgcagtttc ttttggacgc accattgtgc tactgactgc aaaatcaatt gctatatctt 10861 tgagaaattg agcagctttt tgtgcttgct gtctaccatt gtcgttgaga gggacatcaa 10921 tttgtccctg aaatctggtt tggcggttcc actcggtttc tccgtgacgc accagtaaca 10981 accttacacc attgtgattt ggacgtaagg aggggagaat ttcgcctaga tgttgtgtct 11041 gattcagaga ttctagttga actggttctc ccaatcctcc agaaaaatta agtacactga 11101 tgcaacagtt agattgttgt atggaatggt agcgttctgg agagattcct agcgcagtgc 11161 ctaaaagagc acgattaatg ccattgtgcc ccacgataag tattgtttcg cctcgatggc 11221 gcagtaaaac ttcttgccaa aattggcgcg cctgttcata caaagccaga acaggaaaat 11281 gttctcttgt ttgctcacca tcttttacag acatccgcag tagatgagga ctcacctgcc 11341 aaatgcggta gtcttcgcga aacttctcct taacttcagc agagagcatt ccttgccata 11401 aagggagatc aatttccagc aactgttcag aagtttgggg aacagcagac tgctgagcat 11461 gaattgttag ttcatgatgg ataatctctg ctgtgtcttt tgcacgttgc aaaggactgc 11521 tgtatattgc gttaaatagt atattactga gggttttacc tacaagactt gaatcactac 11581 gacctttttc agttaacttg gaagcatccg agcgaccttg tatgcgcttt tcggtattat 11641 aagtactttg accgtggcgc acaataatta cacgagtcac gttttaccct cctgtatctc 11701 aaggactcat tctactgcaa agtgttagca gattgaccct tgcgactacc ccaatacctc 11761 agttctaaaa cgaacaaccc tccagagagc taggcatttt cgcccaaaat ctggagggta 11821 tttgagtttt tggcattttt ttggtttttg gagatttgcg cgaatctagc aaaaattagc 11881 ttatcgtttc tgcctgttgc tatctctatt caagtcaaat tggacttgca ctccatcatt 11941 gctgtagaac gactcatctc cttggggaaa aatcacatta tttggagttg tgattggttc 12001 tctgcggcgt aattcgattt gatcaccaag ttccaaaact tcaacctcag atgtgctttc 12061 ttgataattt cttcgtggaa cattattttg agaatttcgt tctgcaaaaa accttgcgaa 12121 atcgtccttg gctgttctac ctatgccagt taaagaatcc ccttgaattg tgtaatttcc 12181 agtttttgag tttggtgcaa ctgatgactg tgcgtgaact ttgctttgta atccagcaat 12241 ggctgctact atagccaagg cggcaaatat cttagagttc attgtttttc cccaaatttt 12301 atattatttt taggacaaga ttgtataaca aagctttttt actctgatta tagagtaaac 12361 ttgacaaaca aaaggtactg aaagatacac ttgttaaagt tgatcaaatg cagttatgac 12421 cagaaagctg aacgaaagta cgacattttt catggtttct ggttgagaac gcattacggt 12481 tcttgacagt tttctcttaa agtttcctgc tcaattgagt tagatcgcta gttttcatca 12541 ttgctgttat ctccaatgac aaatcaataa ctacttttat aggttattaa atatttgtaa 12601 atttttcatc tcatttgaga gatatgtcta caagcatagt tgagatttgc tgggatttac 12661 tggtttccaa agcgaatcaa taacgccaac actatagcgg ttctcgtttg agtcacatac 12721 accatgctat gtagagaccg tacagtaagg tctctactgt tttggttgac gtaaaaatga 12781 gtcgatacaa gcattctcgc cttaattttg cgcttgaatg atatagtacg aaaaaaaaac 12841 cagtaatgcc ttttaatacc caactgctac cccgcatcaa tgctaaatca atgcgggaaa 12901 atggtaaaca atattacgtg gatactcagg gaaatcgctt ccctagtgtg acgacaatac 12961 ttaacgccac caaaccgcaa gaagaccgtg agcggctgtt aaattggaaa gcacgtgttg 13021 gaagtgaaga agctacccga attacaacag gagctagtcg tcgaggaacg caaacacaca 13081 aacaaattga acgttatcta ctcggagaaa atcctgcttg ttctgaagcc agtcgtcctt 13141 attgggaaag tatcaaacct gttttagaag aaattgatac agtcaaactt gtagaaggct 13201 cagtctttca ctacaacatg agctattccg gcaaagtaga ttgtgtcgca agttacaaag 13261 gtataccctg catttgcgag tggaaaacag cagacaaacc taaaggttca gttgagcgtt 13321 tatatgaaca tcccttgcaa ttaacagctt atttaggagc agttaaccag tgttatcaag 13381 aatatgatat tgaagtgaat cacgccctgt tggttgtagc aataccagac acagaagctg 13441 aggtattttg gtttgagcca gaagtcatga aagattactg gatgcagtgg gaacaacgag 13501 ttgctgagtt ttggaaacgt agaaacactt ggggttggta atggggcgaa aacctgaaac 13561 gcgaaaattg gacaataatt acgaagcaat ccagacattg tccaatttta tctaagtgtt 13621 gtagagcagt ttatgcatta actacacaac gaaacccaat atgagacgtt gaagtatcta 13681 ctgtttctgg atgacgcgcg gctggtcgat aacgctgaca gtagttaggt gcacacagaa 13741 aggaaccacc tttgagtacc tttctaggta tttgggatgg tgtatttgga tcaaggctct 13801 cttgttgggt tccgcctctg gggttgactg gaatacagca agattttgtc ttgttttccg 13861 ggtgagaatc tcgataccag tcagttgtcc actcccaaac gttgcctatc atatcgtaca 13921 aaccataacc atttgcagga tacgagccga cagattctgc cccaggagag cgcgacttta 13981 gattttgcca aggaaactca ccttcccaag tgttagccat ccgcctacct ttgggagcaa 14041 attcatttcc ccaagagtag acagcaccct ccagcccacc gcgagccgca aattcccact 14101 gtgcttccgt tggcaacaat ttgcctgccc aagctgcgta agcttcagca tcttcatagg 14161 caatatgtac cacaggataa ttttctcgcc ctttgataga acttccaggt cctgttggat 14221 gtcgccagtt tgcacccggt acataaaccc accagctaca agtttgtaaa tgaactggat 14281 gtttcggctg ctgaaataca gctgagcctg gtactaagag ttccggtata gctccaggat 14341 aatcttctgg ctttggtgga cgttctgcca ctgtcacgta accagttgct ttgacaaaac 14401 gttgaaactg tttgttagta acagcatagc gatccatcca aaacccatct acacggacaa 14461 gatgggctgg actttcctct gggtaatggt ggtcggaccc gatcacaaaa gcaccaccag 14521 gtatccagac catgtcttta tcgggtgctt taccaggacg cttgatcgcg atttgcctgt 14581 cagatttacc agatgaaacc tcttgatgta ctgttgtacg ttcctgcatg aatgactttt 14641 ccttatgctg ggtaaggtaa gacttgcccg ttccattcct caaagcgatc gcgcaaccgc 14701 tgcaatactt taggattgtt atccttgagg tcagtttgtt cgccttcgtc attggcaaga 14761 tcaaacagat attccttctc tccctgacgc aagtacttcc aatccccact tcggactgct 14821 gcttgaagtt gccctgaatt cccaccaaca ttagatttat agcgccagaa cagcgttcga 14881 ggagaaacgg gttttctgcc cagaagaact ggaagcaaat tctgaccatc aagtggatag 14941 tttgggtcgg gtgaagtacc tgttgctgcc agaattgttg ctgtcaagtc gaaagtaatg 15001 atgacttggc tgttcacttg gtttggttga atcactccgg gccagcgaat aaaagtgggg 15061 actcgcaaac caccttcata caaactgccc tttttccctt gaaaaggtcc aatatcagag 15121 tacctctcac ccccgttatc actgacaaaa atgactatgg tgttatcagc ctgcccagaa 15181 tcttttaaag cttgcaagac tctaccaact ccgtcgtcca aactttttag gatcgcagcg 15241 taagtttctg gtgaacctcc ggctgtgaaa gcatttgaat tgtagaaagt tcgactcaac 15301 tcttcatctt ctggtccttc ccagggccaa tggggcgcat tgtaatgaag actcaggtaa 15361 aacggttgat tgcggggtct agtgaggaat tcaacagcac gttcagtgta caagtctgta 15421 gcgtagcctg gtttatctac acgcacctca ccttcataga agtcaagttc tccatctaga 15481 cctttgtggg taaagtagtc gatcgcccca ctataatttc caaaaaactc atcaaaacca 15541 ctcttaagtg gactgtaatc tggaagaaaa ccagcgtgcc atttaccaac caatgctgtt 15601 tgataaccat tagctttcaa aagggaagcg attgttggct gttgaggcgg taatcctaca 15661 gtgtcacttt caactagggg ttcttgaaga ccaactggca atcgcgctgg ataacgacct 15721 gtaaaaaagc caatccgtgt tggtgtgcaa acagtttgtg ccgcataggc attggtgaag 15781 cgtgtacctt cctttgccaa ctggtctaaa tttggcgttt gataggtttt tccataaata 15841 ctaaggtcgc cccagcccaa gtcatccgtc aaaatgaaca gtatgtttgg acgctttcgt 15901 ttggcggcga ctgcttcaaa aacgcttgag cctcctaatg tcaaaattgt gctagcaagt 15961 gcagtcccca caaactgacg acgactaatt gctggtagtt tgcgtttatt cattgatttc 16021 tcctgatgac tttttgatta cctacgttaa gcatggtgat tcgagcctga gtgatgaaac 16081 ttcagacttg ttggacacac atttgtgaca agcggctgaa aattggttgg tgctatatcc 16141 cggactatac tacggtaaac ttatcttgta acagtatctt tactacagac ataattgcac 16201 atataggata cgtgaagcaa gtatttttct aacaaattgt gtgggaaatg gcagaaaaat 16261 aaataagtag ccctcgctcc tgctttcaag ttaggctgga gtggattctg ttttaggaat 16321 aattgccttt caagcccttg ccgcaatttt tctctgaagg gagagatata tccattgaca 16381 gaaaacagaa ataataggaa tataagcgcc gatcaaatca atagtaagct gcttggtcat 16441 caaaggtcag aggttaaaga cgaatggggc gatacgaaag gcgaccagat aacccacgag 16501 aaaaaggcga tccatatggg acagcagaaa atgccttacg ggctgtcact gaggaactgc 16561 aaattctcca acggaatttg cttaagtcgt tacaggaaga taccaagcgg ttgcaagcag 16621 aaaaagaccg cttaactgaa gacattagac agctgcaaga ggaaaaggaa catcttcaac 16681 aggcgtatca aattaatgag cagcaaacgt tgattcgtca gttgtcacaa gtattagcta 16741 atcatatatc tttgcagctg caatcttctt tggaaacttt agctacccaa gctatggaac 16801 gtgtttccca accagtggga agttctgaac caagagaagc cacaagtcac tcaaccagtg 16861 aagtaaatga gtatgcggaa aaactcctcg gttctttgga tgacaccctc accattacct 16921 ttaactcact gcaacaggag ttgaaaaact atcaaagcgg tctttctcaa cagttatccc 16981 gaatgatagt ccaacaaagg gaaggcgaag caattttaat agagttagtg aattgtctcc 17041 gtagagaact caaagaaaca acacaaaatt caccatctgc aattgtacca gttccctctg 17101 gtgagattga acaaattcta caacagcaac agctgacttc tgaggaagta ccaacaaagt 17161 tgcaaacaga tattcctcca gaaactacag ttttacctag gaaattaccc caggaagaaa 17221 cccctgcttg gaattctctt tcccaaagta tagcctcgcc aagcgaaaca actcagtctc 17281 ctcctgtgtt agaagaacca actcccatcg catctccaag acgcgccgtt ttagagccag 17341 aaacatctcc tccaccagag ccacaactca caccaaggca gaggggaatt tcttctttac 17401 aaatcacagg aattatgctg cttgcattct caacagtcgc gtcagcgctg tacaacgtag 17461 ctattaaggt gattttcctc cctggctccc aaattttcgg tgtgtttgat gcacagccct 17521 tgatctcacc taatttgggt aattcacttt tcattttgac tctcaggatg ttggtggttc 17581 ttccgctaat gttgcttttg gctcccatgc tacattcacg agtatggcaa gatctgcaat 17641 acctgggtga ctcagttcga gggaactcca gtaatccgac tacaaaacga gtgttgatac 17701 tgtcaattgt tagtggatgc tttttgtttc tctctcaggt actcatctac cttgccatcg 17761 gtcaaattcc gactgggatg gcgatcgcac ttttctttgt ctatccaatt atcaacggac 17821 tgctgtcatg gtttttgttc cgcgatcgcc taactttatt cagttctttt gctatagctg 17881 ttattggcat gggtgaattg ttcgttttag gaagttctaa cagtagtgtc attggcaata 17941 ttcgtatcgg cagcattgca gcaattgctt caggaggtgc ttttgctttg tacctcctag 18001 tctcccgcat gtgtgcagcc aaactgcatc cagtgtctct taccttaatc aacttcgcta 18061 cgatgttgct gttgtctgtc ttcggtttga tactgccttt acccacaagc tggaacttgc 18121 aactaaaccg tgcttacttg ctcgaacttg ttttatgtgc tttcctactg ggtgtactga 18181 cgcttttcag ctatttgttc aataattttg gaattcgtaa aattggtgcc tcacgctcag 18241 caattatcgg tgccactatc cccgctctaa cggtcatttt tgctgggttg attcttcagg 18301 aaaccttgca acttgagcaa gttttgggag ttctgctggt tagctttgga gctggagctc 18361 tctgttttga aaaaatccgc aggactaaac cttttgatca gtcttcgcag tgaactaacc 18421 gcttctcgcc cccgccccta atccccaacg cccttcgggt atgtcctgag ccctatggct 18481 aacgccacgg ctccgtgccg ttcgcgaagc gtctccgaag gagatacgga cacgctacgc 18541 gttcgcctct ggcgtgcgct tgcgcgaacg cagtcgccta cggagggaga ccctcccgca 18601 gccggttgtc gtcaccgtca ttcgcgctgt ctcaccagtc gccacaacgg gggcgtgcgc 18661 tttgcgctta cgctccttgg cggtacgaag gaaaccctca aagcaacgcc cttattcacc 18721 gcacggggct agctcctccg tagggcgtgc tgcaggcata ggacttctcg tgcttagtgg 18781 agcttgcgcg cttccgagga aaccggagca acagacaact tctggcacat aagtgtcctc 18841 cgcaaccgaa gcatctgtcg cattcttttt gaaatcggta ttacccataa tggcagcagg 18901 ggaagcttta ccgagtagtt gttgagcagg ggagaaaaaa taataaattt tacaccggaa 18961 agccagtaaa catttagtta ataaacagtt agacctgaag attgactatc caacagctat 19021 gagttctaaa ctctgccgct aaatccggaa aatttacaaa aaagtttttc ttttacacat 19081 ttcatttgta tactgtcagc gattcaatac ttaccataag gaaggtttca ttcagaattt 19141 tgtcttcata agattgaggg gtgaaacccc acaccctctt attccactta ttaagatata 19201 aatttgtgtt ttttattgtc acaatgccta caagtgccta aatatcaagc ttttattaga 19261 caaataagtt gttattagtg actctttgtt gattaatttt cataattttg agtaatattt 19321 ataccataaa aaagacattt ttgtttaacg aaaaataatt atttacagtt aatagtataa 19381 attaatttgt aaatataact ttttttaaca agactcgttg aatattttgt atcttgtcta 19441 acattactat ctcagccata ttgataaaat taacaaaaat tgtcgaaata ggaaggtagt 19501 ttgaaccttg gattatcttt tcaacgtaag gtcagcgatt ttcccgaaaa aaaattgctt 19561 ttactgccaa aacataacta tcagttttga gcagtacagc aatttgcaaa gagccgcgcc 19621 accacctgtt gtgtcaatgt tattaccagg gcaggtgaaa cacccaaacc tgcattctct 19681 tggcatctat attgaggcag gctaaaaccg tagggaagat attgaataaa tgcggttcgg 19741 gagtaactcg ctcacaatta tcttgctcta cccagtgcta cccccaattc gcaagaaccc 19801 cacgaaaatg acctttcttg tgaaagagac atcatccatt agccgtacat actagctagt 19861 cgttgaactg aaacaatccc ttcaaagcag gcattctcct tgaaaccatg ccttatttgt 19921 tggatctgac tttccattgt gtcggttcaa acctctagag gaaaggtgcg cttgcgaaat 19981 ggttggtagt ttctcctgcg tatatgctct atatgatacc tgcgattaga tgataataac 20041 tttcacggtt ttttcttaag ggaactctac atatagagtg caagctcgga ttttttgctt 20101 aaaaacacgg atgaattgcc aactatgact accatcaaca ttcatactta acgcataata 20161 gataatctga ggtaaacaca ttaccctgat gtgggtgtac actactaact ttccattgac 20221 cgacaattgt gtaaacgtgc tattcaactg tttgtaatat gagtaacgac gtagatctga 20281 tcaaacgtct cggccccagt gcaatggatc agatcatgct atatctagct tttagcgcta 20341 tgcgcacaag tgggcatagg catggggcat ttttagatgc agcagcaacg gctgcaaagt 20401 gtgcaattta catgacctat ctggaacagg gacaaaacct gcgaatgaca ggacatttgc 20461 accaccttga gccgaaacga gtcaaaatca ttgttgagga agttagacaa gcgctaactg 20521 aggggaaatt gctgaagatg ctgggttctc aggaaccggg ttatctgatt cagttacctt 20581 atgtatggat ggaaaaacat ccttggcgac cagggcgatc gcgcgttcca ggaacaaatc 20641 tgacatcaga agagaaaaga caaattgagc aaaaactgcc atctaatcta ccagatgctc 20701 agttaatcac ctcttttgaa tttttggagt taatcgaatt cctgcacaag cgctctcagg 20761 aagatttgcc tccagaacat cgtatggaac tgagtgaagc attggcggaa cacattaagc 20821 gccgtctgct ctactctgga acggtgacac gcattgattc accttgggga atgcctttct 20881 atgctctgac tcgtcctttt tatgctccag ttgatgagca agagcggact tacatcatgg 20941 tcgaagatac tgctcggtat tttcggatga tgagagattg ggcagaacgc agaccaaaca 21001 caatgcgtgt tttggaagaa ctggatatcc caccagagcg atttgagaaa gcgatggaag 21061 aattagatga aatcatccgt gcttgggcag ataaatatca ccaagacgga ggagttccca 21121 tgattctaca gatggtgttt ggcaagaaag aagactaaaa gcttgttaac agttaacagt 21181 tagcagtatt aaagttgatt gatagctgat aactgactga tcactgataa ctgataactg 21241 ttagtagagc ttatttgcga ctgggatttg ccggggtttg agatggttga gatgggattg 21301 gattagactc aaaagttggt ctaaaatccg taggttgccg atttgacggg ataaaagaag 21361 gcaaaaagcc gctaactttg tcgctggcgt cgatacaagc ctccattgct tgagtcgggg 21421 caaaatcgat ttcagcacgc aaacccacta cacattgagc aaagcgtaca ggtaacaagc 21481 tacgaccaca ataatttaaa acttctggac caacagcttc acgactgtac cgactgatac 21541 caacaacaca actccctagc tgttcaggac gccgtgcttg gcgacaactt gctagcgcct 21601 cagaagctgc aatttgtgtt tgtcgtccaa ttccggtcac acaggcagac aagtccccag 21661 gacgtagtgc ttcagcacaa gcaattgacg ctgcttgtgc agctacaccc acactcaaaa 21721 gtcgtccagc acatacacga aaatcatttc cgtatgtggg ggtgacagcc atccctgatg 21781 aagtcattgc aaaccatcca gtcacagcaa agactggtgt ggcgagtacc gcagtggctc 21841 taaaaataac tgcttttaag gcgtctgcgc ttctttcaag acgctgtgca gtgtgtccag 21901 ctatttgccc aagcgtgtgg gtatttttgc tttttctctt gcctaacatt attttccgtt 21961 ctccaaacac ccaaaggtat gctacctcat gatagaagcc cactttgatg cttgcaatac 22021 agagcagcga agatgaggaa gtgagggagt gagaaagtgg ggagtgagaa agtggggagt 22081 gaggaagtca aagacaattt ctgacctttc ctccttgggt cctcatcatt caatgccttg 22141 atttcacaac ttaaatagga ctgcgatata attatttgtc tgggtaaagt taagcaagat 22201 taaatttagg aaactcatgt cccgatacag aggaccacgt cttagggtta tacgtcgctt 22261 aggcgattta ccaggattaa ctcgtaaaag cgctagacgc gcctatccgc caggtcagca 22321 tggtcagaac cgcaaaaaac gttctgaata tgccatccgg ttggaagaaa agcaaaaact 22381 ccgctttaac tacggtgtga cagaaacgca attgctgcgc tacgtacgca aagcaagacg 22441 cgttaccggt tctaccggac aagtgctgct gcaattgcta gaaatgcgct tggataatac 22501 tgttttccgc atgggtatgg ctcccactat tccggcggct cgccaactgg tgaatcacgg 22561 tcatgttaca gttaacggtc gcgttgtcaa tattgccagt taccaatgcc gtcccggaga 22621 agaaattgcc gttagaaacc gggaagcatc acggaagttg gtggaagcta acttacaata 22681 ccctggtttg gcaaacttac ctaaccatct ggagtttgac aaaaacaagt tgcttggcaa 22741 agtcaacagc gttattgaac gcgagtgggt ggcgctacaa attaacgaac tgcttgtggt 22801 ggaatactac tcacgtcaag cttaagtcaa cactcaacag tccaaagtca acagtccaaa 22861 gtcaagagtt aaaactctat gagtttggac tcgcgactgg tggactaacc accaatttgt 22921 gacatggtgc gagtataaac accagtctct gtacctgaat aacgttgctt aaagttaata 22981 tctggcttaa ttgccaacaa gtccctaacc tgctcccgca atttggcagt gctgatacct 23041 gtacgcagag cagtttttaa gtcaatttga ttagtttcat ttaataaaca gggacgtaac 23101 caaccatcgg cagaaaggcg catccggttg cagcgatcgc aaaaacactc cgacatctga 23161 ctgataaatc ccagtgtccc tttcgctccc ggaatctgaa acacatcagc gggtccatta 23221 ccacgaactt gggattctgt caatccccaa cgctggcgga tacgttggcg taactcttct 23281 gaaggtatcc aactgcgatc gccaaacaac tgcgaattac caattggcat aaactcaata 23341 aatcgcacgt gccattgttt atcaatcgtc aacgccgcga gatccagaac ttcgtcgtcg 23401 ttgacatcgg gaatgacgac cacatttaac ttcagcgggt caaatcctac gcgataagcc 23461 gcgtgaatcc cctgccaaac ttgttcccag cgtggacgac cgcgattatt aataatttgg 23521 tcaaaggtgt cgggatcaag ggaatctagg ctaatattga ctcgtcgcaa accagcatcg 23581 tacaggtttt gcgccattgg agcgagtaaa aagccgtttg ttgtcatcga gaggtcttgg 23641 gtttggggga gagatgctat tgctttcacc aactccacca cacggggacg cagtaaaggt 23701 tctcccccag tcaaacgaaa tcgggtaaat ccaacgggga taaagacttc ttctatgagg 23761 gttagcagtt ccttatcagt caacaactgc tgcttgagga tatagtcaag ttccgctccc 23821 tctggcatac agtattgaca acggaagtta cagcggtcta ttaaacttat gcggaggtaa 23881 tctacctggt tcatggtttt atagaattat ctgaacggct atatacacag aataatatct 23941 ttgtcaagca acacaagctt gctttacgaa ttttatttat cttatggggg ttcgggggct 24001 ataagaccgc gcgcgtaatc tttgattacg cgcgcggata catggacctc cgcgcttgtc 24061 gattcccgga gcctgatcct gaaaaaatgg gcaggcttaa taagaatgag tattcgacta 24121 cgacgacaga tcaacagacg ttcgcgtagc gtctcctccg gagacagcct gcccattttt 24181 ttgcggcaaa ttcgacccct cgatggggat gaggcaagca tttactgggt gcgattcttt 24241 tagtaaaagc gctataatca ttgtatgtta atcatgaact atcgttatcg aatttatccc 24301 gacgccactc aacaaaccgt tttgtttgag tggatggaga tttctcgtaa tgcttataac 24361 tatgcgttac gagaaatcaa ggattggtgc aatagccgga aatgcatgat tgaccgatgc 24421 tctttagaac aagaatatat cttaccagca gacttgaaat tcccaagtga agtgcaacag 24481 ttgaatgcgt tgccacgtgc aaagaaagaa ttcccaaggc tgggcgaggt accttctcag 24541 gttttacagc aagctatcaa acaactacac cgcgcttggg aatgttttac cgagcgcggt 24601 tttggctttc cgcgatttaa aaaatatggg cagcttaaat ctttgctgtt tccgcaattt 24661 aaagaaaatc ctgttactgg aaagcattta aaattaccga aaatcggttt gatttttatt 24721 aacctgcacc gtccgattcc tgatggattt gatgtaaagc aggtccgtat ttttaaaaag 24781 gccgataggt ggtatgcatc tgtctgcatt caatgtaatg ttagtgttcc tgactcaaaa 24841 ccacatggtc accctgttgg tgtagacgta gggttagaaa agtttttagc aactagcgat 24901 ggtgatctcg taaagccgcc tagatttttt caaacaatgc aaagcaagct gaaattgctg 24961 caacgcagat tatctagaaa acaaaagcgg tcgaaaaact acgagaaaca acgtctaaaa 25021 gttgcaagaa tgcatcacac aatagacaat actcgaaaag atttccattt caaacaagct 25081 catgctcttt gcgacaccgg agatatgatc ttcatggaag atctagatta ctctaagatg 25141 gcaaaaggaa tgctgggcaa gcatatgctt gatgcaggat ttggtcaatt ccgaactatt 25201 accaagtata tatgttggaa acgaggaaaa ttttttgcac aggttgattc taggggaact 25261 tctcaagagt gtccagagtg tggtgcagtt acaaaaaaag acttgaaaac tagagtacat 25321 cactgtttgg tttgtggata taccacagac agggatgtcg ctagtggtca agtcatcaga 25381 aatcgaggca tagcctcaat tagtacgcca gggcttggcg gaacgaaaac tgcctgcgca 25441 gtcgatctac cggggacgaa gatatcttcg tctaggcaag tgacgaaatc tcgcaagaga 25501 aaaaccagga atgctaagtt gtgagactta gaagccccca ccataatctt tgatttggtg 25561 aacccagtac tgaaggaggg tttccctcac gcttgttctg gtgttagccg tcaggcgtgc 25621 cgtaggcata cccgaagggt gggggaggat gtcacattct tttaaggtaa cgtgtaccaa 25681 atatatttgc gaaatagcaa tcctaataac ctgagcagca tcttcactgg tgcgccaggg 25741 cgagtccgcc gcaatatgtc ctttggcaat tcctgtcatg gtgattgggc ggctggcggt 25801 ggatcgaact ggcaagggtg agtgccgaca ccgccagcaa tgttgcagga agtattagcg 25861 ttgcctaaag gtttcaagtc atgacactat ctttagctac ccccgatcgc gttcccaatc 25921 accaatgact ttttgcaaat accaatcctc cgacattcca ggatagtcat acttcgcttg 25981 atcaatcaat ctttgggcca tttcccaata tcccccaact agtctcaaca agcgttgtcg 26041 tagcaagtta tactttgctt tttcggagtg agaactaggg gagtgaattt tctttagact 26101 atctaagact tgttggtaat ttccccagtc ttctttttca caaaatatac ttgctgcacg 26161 ctgataatct tcaacagcac gttccatttc ttctaagcaa gtgtaggcaa tgccacgatt 26221 gtagtaagct gttgcatcat caggatttat ttgcagtgct tgggtatagt cgttaattgc 26281 gccaaggtaa ttccccatga ctcgataggc attacctcta gcaatataca gcattgcatc 26341 ttggggttgc atttggagtg cagagttgaa atcggcgatc gccccttgat gatctcctaa 26401 ataagaacgc gcttttcctc ggttgcggta aacaattgca tcatcaaaat tcagtcgcag 26461 cgcttggtta aagtctgaga tagcctcccg atagttaccc aacttgcaac gcaccacccc 26521 tcgacagcag taggcttgtg catcctgtgg atcaacccgc aaagcccagt ttaaatcctc 26581 cattgcttcg cgggtgtctc ctttttctgc cttttctagt aactgcgtga aatattcttt 26641 ttcggatttg agtggtgcaa ttatgggact tggttgtaca aaagcaggtt tttcctttgg 26701 ttgaagctgt ttaatctttt ctagacacag gcgacaattc tctgcgtcct tttgttgcaa 26761 atataattct gctgcttttt taaagttggc gatcgcatct tgaatatacc cttgtttacg 26821 gtgcacggtt ccccgtaaat tataagcagc cgcataattt acatttaaac gaatagcttg 26881 atctacatcc gcaagcgccc ctggcagatt ttttagcgcc attcgtccca aggcgcgaca 26941 gtaataagcc tctacacttt ggggatttaa tttcagcgct tctgtgaagt cagaaacagc 27001 attcaacatt tctcctaaat catagtatgc tagaccccgt tgatagtaag cttcagcaaa 27061 gtaaggtgtt agctgcaaag cacgggtaaa ttcttgaatg gcttcagcat agtctttttg 27121 cttggctctt tccaatcctt tattgtaaaa ttcgtcattc atggcgtttc tttgaacaat 27181 gaagtcagga tacaggttga aaagtgaaat actttaatct ttacacttca tacttaaatc 27241 ttccgttgac tcatttcaaa ttgttaatga aaatcacaaa atgactaacg actaccttta 27301 ccgctaccca ccgttgtatc ctggtatctt gctgaaacgc tacaagcgat tttttgccga 27361 tgttgaactt gcttctgggg aaatagtaac agcccactgc cccaatacag gaccaatgac 27421 aggagtttct actcctggta gtgcagtgca gctgtcttac agtgataacc ctagccgcaa 27481 acttccctat acctgggaaa tgatccaagt acatgacaat gaaccaactt gggtaggtat 27541 aaatacgaat cttcccaata ggattatcaa attagcttta gaaacatacc tttttccaga 27601 attgggaaac tatagccaaa ttcgtcctga agtttcttat ggagaaaata aatcaagtcg 27661 ggtagatttc ttgttatctc cacatacaaa cctcaacgtt ccatctggcg atttgttttt 27721 aaaaagtgat catttacttt taaccaaaaa tgcttgtcct atatatcttg aaatcaaaaa 27781 tacaactttg gcgcagggga aattagcact ttttccagac acagaaacaa cacgaggaca 27841 aaagcatttg cgagaactga cagcactctt acctcaaaat cgtgccgtca tgctttactt 27901 tataaatcga agtgattgca ctgaatttgc tcctggcgat agtactgacc ctgtgtatgg 27961 taaattattc aggcttggaa tcgagcttgg tttagaaatt ttgccttgcc gttttgaagt 28021 taacccagaa ggtgtgcgtt atttaggttt agcagaatgt cagttttgag ccttcagcag 28081 caggggaagc ttggcaacta aatgatgctg ctgcttattc tacttctggt gaagaacctg 28141 ctgtaaagcc tgcaataatt ccttaactgt gtaaggcttc ggtaaaaatg atttgacacc 28201 attgcccact gttgctagtt tagtggtgaa catcagtccg ctagttgcaa gaattttgac 28261 ttgtggattg attttttgca aagtacgcat gacagtgaaa ccatctaaag cgggtagcat 28321 catatcaatc aacacagcac taatttcgtt cttacgttga gcatataggg cgatcgcctc 28381 aataccatcg ctcgcagtca gaattttata gttgtatgct tctaacgaag ctttggtaat 28441 ttcttgaatc gacggttcat catccacaat cagaataact tctccatgtc ctgcaaaaac 28501 ttccaccatc accagtggcg gtgtttctat tccctgcact gctggtaagt aaaccttaaa 28561 agacgtgcca gatcccactt cactgtacac attcacaaat ccaccgtgac ttttgacaat 28621 gccaataaca gtagaaagac ctagtcccgt acctttacct tgttctttgg ttgtgaaaaa 28681 tggctcaaaa attctatcga tgatttcatg atcaatgcca actccagtat cggaaacagt 28741 tatcaagata taggggccag ttttggcttc caaatgcatt tgagcatagc tttcatcaat 28801 aaaaaaattt tcagcacaga tactcaaagt accgccatca ggcatagcat cacgggcgtt 28861 gacgcaaagg ttcatcagca cctgatgaag ttgtgtccca tctccagaaa ctagccacag 28921 atcttgcggt acatccgcat aagcttcgat aaattttggg aacgtttctc taataatttt 28981 tgtgacttcc ctgatgaggt gtttgggttg taaagttatg cgcttccctt ctacacctcg 29041 cgcgaatgac aaaacctgct taacgagttc agctcctcgt ttggcactgt tttctagtat 29101 ttctagcagg tgattatctt gctcgtagac atgggggaat tttagaggta ataattgagc 29161 cactgccaaa attggcgtca ggatgttgtt taagtcgtgg gcgataccac tagccaaagt 29221 accaatactt tccaagcgtt gggcgcggaa caattgcgct tccaaaagtt ttttctcagt 29281 aatatttgtg tcaacagtca gaatagactt gggctttcca ttttgatcac atactaatgt 29341 ccaccgggtt ccaatgagaa tttccttgcc acttttggta actttagtga cctcgccttg 29401 ccactgacct ttactaataa caattgaaaa agcagcttcg acttctgagg actgttcctt 29461 gtacagaacc tgactggcat ttttaccaaa tacctcttga gcctgccatc cgtaaagatt 29521 ttctgcacct tgattccaga atataatgca gttatctaaa tctcgcacaa aaatcgcatc 29581 agtagtcaca tcaagcaaag ccgcttgttg acgaatttgc tcttcagcaa gtttacgctc 29641 actaatgtcg atgctagcgc aggtgatgcc tacaacttct tgcgactcat tatgtaatgg 29701 ttctacagtt aagtcataat acttcgttcc ttgggcggtt gtgatagata cttcctcttt 29761 ggttcctata cccgtggtga gtacaccacg tttaataatc gtcatatgtt cagcatcttc 29821 agcacggagc aaatctgaat cttgtttgcc taacatctcc tcaggtgtgg aggcaaaatt 29881 ggggttgtaa acccaggtgt agcgtaaatc ctgatcctgg ttaaaaacaa caattggcga 29941 attttttaaa gctacccgaa atcgctcctc actgcgtcgc cgtgcttcct ccgttcgctg 30001 acgctcaata gcataccgca ttgagcgcac cagtaagtca ccattaactt gccccttgac 30061 cagataatct tgtgctccct cttgcatagc tttaattgcc aaagtttcat catcaaaacc 30121 tgtcagcaca ataatgggaa tggctggaac ttgatgatgg attttaataa atgtttctaa 30181 tccttgacta tccggaagta aaaggtctag caaaataaca tcaaagcttt cttgctccaa 30241 aaagcgcagc gcttcgtcca attgctccgc ttgcttcagg tggaacttta cagaagagac 30301 atcttgtaaa aactcttcta aaagaacgac atcaccagga ttgtcttcta ctaataaaat 30361 tttcattttt tttagttatt agtcatttgt tattagttat aacctgacac ttttctctcg 30421 ttcctatgct cagcatgaga acaagagagc ctgaaggtca agtctcgact taagttgaac 30481 tgatgcaccc gttgcattca tgacttacac cctccggagg gagccagcgc tatcaaaaaa 30541 agaattctta attttgaatt ttgaattttg aattttgaat ttgagcgaag cgagtggggt 30601 gaggtcatca ctccggaggt agcttgacaa cagtgaacca aaaactttct attgaatgta 30661 ctattttgac aaactggtcg aagtcaacag gttttttgat gtagcagtta gcacagagat 30721 tataggctct taaaacatct tcttcagccc cagatgtcgt taaaacaact acaggaatac 30781 gtttaaggat ttgatctgct ttgatttctg ccagcacctc ccttccatct tttctaggca 30841 agtttaaatc aagcaggatg agatcaggag tgggtacctg agtatagttt tcctgtttgc 30901 gtaagaatgc tatggcttct acgccatccg ccgcaacact caagttcacg gagattttgc 30961 tgtcttcgag agcaatctgg gttaattgca catcgccggg gttgtcttcc actagcaaaa 31021 tctcaatagg catgattgat tcaaagctca caatttgttt cctgctgtat caggaattgt 31081 aaagaagaaa gttgtaccat gttgcggttg tgactcaacc cagatatttc caccgtggcg 31141 ttctacaatt ttcttacata tcgctagccc aatgcctgta ccaggatact tatttctagt 31201 gtgtaaacgc tgaaaaatca caaaaatgcg ttcagtatat tgaggttcta tgccaattcc 31261 attatcacgc accgcaaact gccacttttg gtctatgcgt tcaaccccaa tatggatttg 31321 aggtggctgt tggctgcgga acttaattgc attgctaatt aagttttgaa acaactgcga 31381 cagttgcgta gcgtccgcca tgacttcggg taaagtatcg taggtgagga ctgcgttact 31441 ttcttcaaga gcaactttca gatttgcgag aacgcaattc acaacaaaac gacaatcaac 31501 tggctcaaaa ggctgactac gtgtgctcac acgggaataa ctcaacaagt cattgatcag 31561 tgtctgcatc cgaagcgcac cgtctacagc gtaaccaatg aactcattcg ccctagcatc 31621 aagattatct ttgtatcgtc gctctaaaag ttgcaagtaa cttgctacca ttcgcaacgg 31681 ttcttgcaaa tcatgagaag caacgtaggc aaattgttct aattcagcgt tggaacgagc 31741 cagttcctga cgttggtggg tttcttgttc caaaagttgt gcttgagaga gtgcaatgcc 31801 tatttggtct gctagctgtt gcaacaactc tatttccaca ttattccatt ctcggggaca 31861 ctcacattga tgagcaatca gtaaacccca gaggttttcc ctttgaagaa ttgggacaac 31921 gagattagct ttgacaccaa actgcgagag aaattctttg tagcaaagtt ggatgtttgc 31981 tgtttcaatg tcggctatcg cactcactct cccctgacga aacaattctt gagagtcttg 32041 ttcaaagcag tgctcaagga tattttttcc cagcaaaacg ggaaaaccag gcagcacagc 32101 ctcttgcatc acagttgctg aaccatctcc ccacaaccgc aagatcataa ctctgtctgt 32161 gtgtagcagt ttttgaactt caatgaccgt ggtttgtaga attgcctcaa tttgtaaaga 32221 ttgacgaatt ttgagggtga tttcggcaaa tagttgagaa cgcttggttt gacgtttcaa 32281 ttcctcttct gcttgtctgc gagcagtgat gtcaagtgtg acaccaatca tgcgagtggg 32341 gctaccatca gcatcataaa ttccagcacc tcgtgctagt acccaatgga tgctattatc 32401 agaccaaata ttgcggtact ctgcctcata atctttgcgt tcctctaaag cttgtctcac 32461 agcctcctgc ctatgggcgc ggtcttcagg atgaacgcgt tcgtgtagcg tatcatagga 32521 aagctcaatt tcaggcggca aaccaaaatt tgccttgcat tgtttggagc aagataagtc 32581 gcccgttttc aaatctagct cccaagaacc gagtttagca gtctgcaatg ccagcttcag 32641 atgttgttcg cgatcgcgta aggaagcctc agttcgctta cgttcagtga tgtcccgtgt 32701 cactgctaac aaggcaatta tgtttttatc agaattacgc aggggaacag cgtggctttc 32761 cagccagcga cgagtacctt tgagtccaat aatttcaaac tcctgtatcc ctgattcgcc 32821 ttgaaaaacc tgttctgtca gtgttacaaa cgcttgacga tactcagggg caatcagggg 32881 ataaatgcag cttccctgca cttgatttat acaatcagct tcaatcatcg ccaatccagc 32941 cggattcatt tccaataacg tgccatcttt ggctaacaat ttgacacact ctggttctga 33001 ctcaatgatt gtacggagaa gattttcact atgcgcgagt ttttccttcg tgcgtttaag 33061 ttcctcagca aaatgggata cctcagctaa attacgcctt aactccaatt cgctaatcac 33121 ctggcgacct agtgccacaa gtgcttctac ttgtttgtga ctgatttccc gtggtacttg 33181 gtcaatcaag caaagagtcc ccaccatatc tccctttggg gttattaagg ggacaccagc 33241 ataaaaccgc acgtaaggat aagatgtgac gactgaattt cttgctaact tttcatccgt 33301 ccaagtatca taaacgacca ccacatctcg tcgctcttgg caaaggtaag ataatccgac 33361 actacgaggc atttctggca catccaaacc gatttttgcc ttaaaccatt ggcggttttc 33421 atcaataaaa ttgaccagga ctataggggt gtcacaaata aaggcagcta actgggcaat 33481 attgtcgtag gtttcttcag gttcagtgtc aagaatttga tattggcgga gggcttcaag 33541 tcttgctgct tcgttatgag tcatcataac cataaatatg tattaatttt ttgaaacact 33601 taatactttg agttttctat cactcaaaaa agttgcttat ttcgataagt gtgttgttct 33661 ttcttctatt ttatactact tttgatagat gttatgtgtc aaatacacgg gttatattga 33721 aaaattgcag caattactag taggttgatc tcgtctgtat ctcctccacc aaacgctcgt 33781 accccgcttc gggggcgctt tgcccataca gtgtagccgt gccgcaggct caggaggatc 33841 gctttgattt gtgcatggga ttacctccgc tttctcaagt gcaaatcttg aaattaatag 33901 tattatatgc tactagtttt gttcggactt aaaatttcct tagttgttgc tattgcattt 33961 tccgcatcat tcacaaaata aacatttttt gacgctagtt tctggaagaa gtttttactt 34021 tcttcatcta cactcaacaa aatcactttc ttattgcctt ttaaagctag agaaatttct 34081 gaagcagtac ccgcacccat tccacaagca attaccacat cactagaaag aacattgata 34141 ttgttgcgag catttcccat gtcagtaaaa atggcaatat caattgcttc agaaacgcct 34201 tggctatcat gacaaggaag aataccaata gttaaaccat ccacagattt tgctccccga 34261 cttgctgcat ccatgactcc aacatttcta ccgccagtca gcaaaaccca tccttctgtt 34321 gcaatgagtt ttccaagttc ataggcgttt tgcatatcaa ctgctgtagc tttttctcca 34381 ggtcccatca ctccaataat aatttttctc atcgttgttg attgacaaga acacagtacg 34441 caaaaatcag aataaaattt ccatgtcaac tcggcttctt ctagaaaccg agttaatgaa 34501 gaatactaca aactgccaac aatcatccca gcaagtacga taaaaccaat ccacacgttt 34561 tgccggaaca tttcaccata agcagagtta ggtaagtctt gctgtcccag acgtgtgtac 34621 tgccaaaccc aacctacagt agcaatagca agactgatcc aaaaagtagg gcgcagatac 34681 ataactaaac ctagccaact cagtagaaag acagtgctag caaagaaaat gccaattgca 34741 ataggcgcga aatcaccaaa aaacaaagcg ctagaattca caccaattcg acggtcatct 34801 tccctatcgc tcatagcgta aacagtatca aatcctaatg tccacataac agtagcaccc 34861 cagagaagcc aagtggaaag tgaaaggtta tgcgtcaccg cactccagct aatcaaaaca 34921 ccaaaacccc aagcgataga aagaaccagt tgcggaacag gaaacacccg ctttgcgccg 34981 ggataaagca aaatgactgg aactgctgct acacacaacc aaaatgagag tacgttaaga 35041 tagaaggcga gaactgccgc acatgccatc gcgactatgg caactatgac gccaactttc 35101 acagtcagcg cacgggaagc aagggggcga tcgcgagttc tttccacttg tggatctata 35161 tctctatccc ataaatcatt gacgacacat ccagcagcac ttgtggcgag agtacccagg 35221 atgataactc caacaagtgg taaaggtggt tttccagcag ctgccaaaaa gacagcccat 35281 aaagcgggaa tcattaaaat taagcgtcct tctggtttgt gccatcgcaa aagccggata 35341 atcacaagcc atatgggttc gctgttctgt tctggcgtag ttagcatagg tagtaaaaga 35401 tgactaaaga tagattggtt cacacatatt tacatctata gaatatcttt atctgatatt 35461 tataaaatca caaccacaag tggcaacttt ggatacaatc aataagtcac ttcagtggtg 35521 agtaacaggg tacatttcct gattcaagca gggtcacaaa atgttatacc tgaaagcaat 35581 tttgtttgac aaaatacatc gtctcgccga tacgtcgaga ttgccagctt ttaccatcct 35641 taatatagag agaaacgata gatggtgaat ttagtttcag ctaactggga gagtgttccg 35701 actcaaacag atcagcaaga tcgaattatt gctgctattg acttagggac caattcacta 35761 cacatggtag ttgtgcaaat tgaacctaca ctgccctctt ttagcattat cgcgagagaa 35821 aaagaaactg tcagactggg cgatcgcgat cttgagacgg ggaatttgaa accagaagtc 35881 attcaaaggg cgatcgccac cttgggacgc ttccaaaaga tagcaaaaac tctcaacgtc 35941 gaaacaatca ttgctgtagc aaccagcgcc gtacgagaag cacccaatgg caaagatttt 36001 ttgcaaagaa tcgaagacga gttgggttta agcgttgacc taatttccgg tcaagaagaa 36061 gcgcgacgga tctacttagg tgtgctgtca gggatggaat ttaacaacca cccccacatc 36121 atcatagata tcggcggcgg ttccacagaa ttaattttag gcgactctca cgaagcacga 36181 accctcacca gtacaaaagt cggcgcagtt agactcacaa ctgagttaat caccactgac 36241 cccattagca acattgaatt aaagtactta cacgcttatg cgcgcgggat gttagaacgt 36301 gctgtagaag aagtgcaggc gaatctaaag gatgaagaat ccccccgttt agtaggaact 36361 tctggcacaa ttgaaactct ggtgattatt catgcacggg aaaaggtaga ctctgttcct 36421 tctacgctca atggttacga aatgagctta aaagatttgc aggagtgggt caatcgccta 36481 cgaaaaatga gtaactcaga acgagctgcg attcctggta tgccagaaaa acggtctgaa 36541 gttatacttg caggcgcagt cattttacag gaagcaatga accttttcgg gctagagtca 36601 ctcacagtat gcgaacgttc tctcagggag ggtgtcattg tagactggat gcttactcac 36661 ggttatattg aagaccgcct acgctaccaa acttcagtac gccagcggag tgtgctcaca 36721 attgctaaca agttccacgt taatttagaa catagtgata gagttgcagt ctttgccttg 36781 agcatatttg accaaacaaa aggaacactc cactactggg gagcagaaga acgccaactt 36841 ctttgggctg cggcgatttt gcataattgc ggtcattata tcagccattc ttctcaccac 36901 aagcactctt actatctcat tcgcaatagt gaattactag gctataacga aacagagata 36961 gaaatcattg ctaatttagc gcgttatcat cgcaaatctt ctcctaagaa aaagcacgaa 37021 agctaccgca atattgcaag caaaaatcat cgacaaattg tcaataaatt gagtgctatt 37081 ttaagattgg cagttgcttt ggataggcgg caaattggag caattgtgaa agtgcaatgt 37141 gaatatctaa ctgaacaaca agaattccat ctcaaaattt atccgtcccg tgctgatgat 37201 gactgtgctt tagaactatg gagtttggat tataacaagg gtgtatttga atcggagttt 37261 gaagtgaaat tagtagcaaa tttagagctt aatatcgctg catttagtta gataaagaaa 37321 gtgttggata aattaggtgc attacattag tgtaacgcac cttcaaatag gtttgattta 37381 ctactcaaaa attttcggtt tgtcgcttgt gtccttaagt tctatttacg aactgcaaga 37441 gcgtggaagt cagcaatatt cttaccaccg acaatgttat cgaatctata cagatgagca 37501 ggtttagcac tcttacctcc gttttcaact tccacgacac ctagccctgg gaaaataccc 37561 acactgttga aaaatactgg gtcgtttgct gagatcggat taggtccacg ctgtgtgaaa 37621 aaggcaactt gttcggaggc tgatgtttct attaagtctg gtgtaggatc aggacctaag 37681 gcatcactta ctagatcgac aagattaact gcgcgagctt tgtttgaacc ttcaacgttt 37741 ggatcaaagg gtttaggatt gatctcgtgt attgtgatgg aattggatgc acgattaact 37801 tgccaaaggt attttccacc aactagtgtc acaccatgaa catcattgcc ggattgagga 37861 actttgaaga aatctggacg cttcgaggga ttttctagga gagacttgtt gttatacaga 37921 tagagaaagt cagctatttg gggatcagca ttgcctgcgt tggtcacaaa gttgtcgcca 37981 atctcatagg caatcaatcc attgaaggtg gcttcatctt tactgtaaat gtgtgcgatt 38041 gtcagtgttt ctgcatttac gatcgccacg ccaccatcag caaaagtcac ataaacaaac 38101 ttacctgtat catcaacgac agctacaact ggtcgggcag tggcaccact ggtaggtact 38161 cctaaggctg tctttgtctg tggagtattg aagtccaatg tttgaacatc accaaaaatt 38221 ttacctccgg gtgcctcata atttgtggca atcttatgaa tccgctgacc tggagtatcg 38281 gcaacatata caaatttgtt gtcttttgag aggtaaacag cgtgagagtt ggagtttggt 38341 aatcctggtg gaagaattga atccacaact tcacgcttgt tggcgtcaat tgcgtagacg 38401 tgagccgtag tcgcatgtcc tacaatcgcg tgtgtaccgc cttggttgaa agtaatccag 38461 tggggttttt gacctggcgc aagattgttc ttcttagcgc tggctgccaa gttaaattgc 38521 tcaacctttg cattacctct taaaaagtct cggtcgttac ctgaaaaaac aaagagattt 38581 cctcccaaaa cagtaccatc aagggaatct gcttggtcaa tagcccaaac ctcataacct 38641 ggatttcttg gtcgtgttgc tgaagctaca gtggacgatt ccccgtgtat tgggagggaa 38701 acttgggtga atatggtcca agcaattcca atgaaaaaca agcatggaat gaagaatttt 38761 ttcactttta accttcctta tttgagtaca tgaaagagtc tcgtgtaact cctgatcact 38821 gatttgatgt ggttagatag gattgggtaa caagcgatcg cttgtctttt gttacgttaa 38881 aaaccctcag ttttgggagt gataacaaag atttgactag aacaacactg ccagatcatt 38941 aatcacaaac cgtatagaaa atctctttgt ctggtgaata aaagaacgtt gattaagatt 39001 gaacacaatc agtaaagcta tgttctatat gctacgcgtt ttacgtcata acacatatgg 39061 tcaacattcc gttattgata acatcacgca atcttttgta ttaagaatga cttaatatta 39121 tgaatacaaa agttactcca ttgagaaatg gaggtagaga tttgtatcaa aagcgcaaca 39181 tcaaccgcca atacggactt ggttaacacc ctttcaaaac caagaagcaa gctggggtct 39241 tcttaacgaa aacccttata actgtataaa agttgcaatt atgacagact gtcaaagtct 39301 acaataaata cggttaaata atcaagtaat agtattataa agattgcatt aattttacat 39361 gtatttcgct acgtaaactt aaaaactaaa gctaacaaat cttgaccacc tagtaacgca 39421 aataaattaa cctgcatctg ttagtcagga gtcaggagaa gaaacaggac tagttttcag 39481 gacttacgca aaaataacgt agttgcaccc gttgcgacgt aagtcgtaac tacagtcaag 39541 ttcattcgta actcgtgcgt aagtcctagt tttattcaga attcttcagg gttggtgaat 39601 gcgccgctac attatggttt cccctaaccc tgcatggcaa gcaccactca gcctgctgtg 39661 aattcagcta caactaagta tttgttcctt cacaaatcga aagaattcgg taaaattagt 39721 taaagtgatt tttgcatttg taaaaccaag gcatcagcaa tctcatgcaa ttgaaaatca 39781 gtgcatcccg actacccatc ccaccactac tgttgttgat cgcccccttt ttcttatggg 39841 gtacagcaat ggtggcgatg aaaggagtga taccccacac aacaccgtta tttatggcgg 39901 gagtgcgttt gataccggct ggggtactta ttctgatagc agcagcattg atgggtagac 39961 ctttgcccaa gggttgggta gcgtggttgt ggattgcttt atttgccttg gtggatggaa 40021 cgctgtttca aggcttttta gcagagggat tgctcagaac aagtgctgga ttgggatctg 40081 tgatgattga ctctcagccc ttggctgtgg cattgctatc tttgtggtta ttccaagaac 40141 atatcggttt ttggggatgg ttgggactag gaataggagt cacaggcatt agtttaattg 40201 gcttgccgga tcagtggatt ctccattttc tcaactcagg cacgattgta gagacattgt 40261 ctacagcatc aatacaacaa ttgtttgcca gtggcgagtg gttgatgctt ctagcagcgc 40321 tgagtatggc agtgggaaca gtattgatcc ggttcgtttg tcgccatgct gatcctgtga 40381 tggctacagg atggcatatg attttgggtg gcttgccatt gtggggaatt tcatcaactg 40441 tagaatctgg gcagttgcag aacattgtcc cgtctgactg gtttgcactc ttgtacgcga 40501 cggtgtttgg tagtgcgatc gcctacgcat tatttttcta ctttgcctct agcggcaatc 40561 tcaccagtct cagttccctc accttcctca cacccatatt cgcattgtta tttggtaatt 40621 tgttacttca agaagttctc agtccgttgc agtgggtagg agtcagcctg actttagtta 40681 gcatctatct catcaaccag cgcgaaactc tatcagggtt aagcaagaaa gttgctacaa 40741 gtgaaaaaac tatgacacag caacaacaag ttttagaagc gtctgcaaag acgataaaca 40801 aagttacctt acctgtgaga aaatctgaat cagagatgtt gccttaaata aggtgagacc 40861 agcgctgcgg gagggtttcc ctccgcaggc gactggcgaa cccgg // LOCUS NODE_604_length_40647_cov_5.29705440647 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 40647) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 40647) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..40647 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 239..826 /locus_tag="DP116_04915" CDS 239..826 /locus_tag="DP116_04915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_04915" /translation="MQTKEAPFPLRLWTVKEYHKMAELGIFHPEERVELIAGQIVKMS AKGTIHTTTVRRIANVLREKLQGQVDIHTQDPVQLNDFSEPEPDIAVVKVDPLDYVDH HPTASEVYLIIEVADSSFKYDCETKGKAYAKSGILDYWVLDVNNRKLHVFREPTQEKY HSEVIFSQEAIISPLNFPNLMITVSDILPPVINKP" gene 1033..1896 /locus_tag="DP116_04920" CDS 1033..1896 /locus_tag="DP116_04920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999885.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_04920" /translation="MTTITTMTIDNFTVRPMKRSELDLVIDWAAIEGWNPGIYDAECF YQADQCGFFVGELNNELVASISAVAYSKHFAVIGFYIVKPQFRGRGFGMKMWRAAMAY LGTERNISLDGVIAQQKNYQKSGFQITYRHIRYETVGGGLAPDGIVELKTVPFDKLVA YDQELFPAERKQFLQLWINQPNSSALGVVRDGHVVGYGVIRQSYTGFRIGPLNADDEQ IAEQLLLALLAFASDAPVFLDVPDANPEAIKLAQRYGMKPVFEVARMYNKEIPNLPIN RVFAVTSLEVG" gene 2353..2973 /locus_tag="DP116_04925" CDS 2353..2973 /locus_tag="DP116_04925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867021.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UPF0016 domain-containing protein" /protein_id="PRJNA477356:DP116_04925" /translation="MLTAFTAGLLLITVSELGDKTFFIAVILAMHHPRRLVFAGVVAA LAAMTILSVVVGQAVSVLPKVYIHFAEIALFIGFGFKLLYDASRMPVSSCDTEVVEEA KAAVKEADMQLPKKKNALAILTEAFVLTFMAEWGDRTQIATIALAAGNNAIGVTTGAI LGHAICAAIAVIGGRMLAGRISERKLTFIGGFLFLVFGVVAAIEGA" gene complement(3038..3898) /locus_tag="DP116_04930" CDS complement(3038..3898) /locus_tag="DP116_04930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129225.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Tfp pilus assembly protein PilF" /protein_id="PRJNA477356:DP116_04930" /translation="MSQTRKHWIINVVLVLILFAFVGVSVVPLIGAFDETQSSTEKST NARSVSPSSDLKSKLENAAQGYAQVLQREPENQTALRGLLETRLQLLSLGAGDVKSVI EPLEKLTQLNPEETRYAVLLAQAKQQSGDKEGAATAYRSALEKKPGNLEALQGLVTLH LSEKRPEAAISLLQNTLNTASKANKSQPGSVDTVAVQVLLGKVHASQKNYTQALSAYD QAISNDKQDFRPVLAKAMLLKEQGKVEEAKTLFTQAAALAPSQYKDEINKQASESSSP ASTPTSTPKQ" gene 4299..5303 /locus_tag="DP116_04935" CDS 4299..5303 /locus_tag="DP116_04935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316777.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04935" /translation="MNSDLMPSPESSNSTASKEALESKDTAAMNINAATPAFQEKNSP VISSETANDQNLENRQQPIPDPSEPMQYRAIGLVRGRYVAANSEQFTQGSIVTTDGTQ LEAVLLGRIMSLVKNHLDLEKEHLWVVYPRTRQENDKLHIQIVGVWEPENLAKHQIAN DEQDSASQKDLPIPGTNGSSSALIPSSEVPDGDFSVRGEVVYQSYEAEHLVIKIRQAA RKQDDKPKYFKLKIKGKLEARAVGKFWDLQVKRETDSLVVESAEAIADLPKKRRPPQT RHGGGGRRPSGKKPFPPRPRNGEIPRPVKKTGDTGAASIPISSKPIPKPVKRLKPPEQ " gene complement(5463..6632) /locus_tag="DP116_04940" CDS complement(5463..6632) /locus_tag="DP116_04940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138620.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04940" /translation="MRTIAEINDKISQKLAVVLTIEELKARVAEVGVTKVAKEVDVIT TGTFEPMESSGAIVNLGHTDPPIKIRRCWFDGVPAYSGFGAVDLYLGASCAVEVMDGE EVRERGGGHVIEDLIAGKAVHIRAQGQVTDCYPRGSFETTITRETINQFYLFNPRNLY QNFIVGVNGGDRPLHTYLGPLQPRLGNAVYSNPGAISPLLNDPDLQLVGIGTRIFLGG GIGYVAWEGTQHFPLQKRLPNHTPIGPAATLALIGDAKQMDARWVRGCYFKSYGPSLM LGVGVPLPVLNEEVVARCAVEDKDLVAPIVDFSIPRRVRPTFGLVSYAQLKSGRITIE GRTVRVAPLASLFLSRQVAVELKQSILRGDFTLTEPVAPIEMQRSFLPQDRWTEF" gene 7170..9698 /locus_tag="DP116_04945" CDS 7170..9698 /locus_tag="DP116_04945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454836.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mannose-1-phosphate guanylyltransferase" /protein_id="PRJNA477356:DP116_04945" /translation="MRAVLMAGGSGTRLRPLTCDLPKPMVPILNRPIAEHIINLLKRH QITEVVATLHYLPDVMRDYFQDGSDFGVQMTYAVEEDQPLGTAGCVKNIAELLDETFL VISGDSITDFDLTAAIEFHKQKKSKATLVLTRVPNPIEFGVVITDEESRIRRFLEKPS TSEIFSDTVNTGTYILEPEVLEYLPANQESDFSKDLFPLLLAKNEPMYGYIAQGYWFD VGHLDAYREAQYDGLYHKVKLDFAYNEVSPGLWVGQNTYIDPAAQIETPVVIGDNCRI GARVQIEAGTVIGDNVTIGSDANLKRPIIWNGAILGDEAELSACVICRGARVDRRAHV LEGAVVGSLSTVGEEAQISPFVRVWPSKKIESGAILNINLIWGNTAQRNLFGQRGVQG LANIDITPEFAVKLGAAYGSTLKPGSRVTVSRDQRNVSRMVTRSLIAGLMSVGIDIQN LDATAIPITRTVIPTMPVSGGIHVRVHPDRPDYILIEFIDAKGINITKSLEKKIEGAY FKEDMRRAQIHEIGDVAYPSQVMDRYCTAFEKLLHIDSIRNSRAKVVIDYVYSVSGAV LPQMLDKFGADAVVLNASLNKTAMSAAEREALLTQLGHVVEALKANFGVQVSANGEQL ILVDESGISIRGEMLTALMIDMMLTAHPRGTVVVPVHASSAVEQIARRHDGKVIRTKA NPTALMEACQKNSNVVLGGNGDTGFIVPQLHPGFDAMFPTAKLIEMLTIQERSLASVR AELPRVVHKAYTVRCPWTVKGALMRHLVETHPAQNLELIDGVKICQPYDDSWVLVLPD ASEPVVHLYANSNDRDWVDESLRQYRARVQAFVEREEEQAVAEV" gene 10010..10570 /locus_tag="DP116_04950" CDS 10010..10570 /locus_tag="DP116_04950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320411.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_04950" /translation="MTITVPATTTLKEFLELPYVDESPAWEFMNGEAIQKPMGGGKHS LLQKSLIAVIDAAGSQYESFPELRCTFGNRSVVPDVVVVAIHQLPLDDSGEIISSGID FPPPWVIEILSPAQSQTKVTGNILHCLKNGTQLGWLIDPSERSILVYHLDRLPDLLTG ADVLPVLENINLTLSVDEVFSWLKRR" gene 10712..11582 /locus_tag="DP116_04955" CDS join(10712..11017,11019..11582) /locus_tag="DP116_04955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_076611779.1" /ribosomal_slippage /note="programmed frameshift; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS630 family transposase" /protein_id="PRJNA477356:DP116_04955" /translation="MPASYSYDLRTKVINAIDGEMRKTQASRIFEISRNTIETWLNKR KETGDYQPKVGYQQGYNPKIADLEEFQRFAQINGSKTQAEMAEAWSEKISDRTIGKAL KKLVIPEKKTYGYRERDEEKRREFRAKISQKKSSQLVYVDESGIDNREDYGYGWNPKG KRFHDLKSGKRNIRVSIISALCQGKLVAPLTFEGSCNRLVFEKWLEEKLLRELKSGQT IILDNATFHKSQKNRELIESVGCEIEYLPPYSPDLNDIEHYWFPIKNRVRKSQGAIED FRERVDTAIRLTS" gene complement(11656..14079) /locus_tag="DP116_04960" CDS complement(11656..14079) /locus_tag="DP116_04960" /inference="COORDINATES: protein motif:HMM:PF08881.8" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04960" /translation="MDAGTQKNNPDWLNPNVLTDFIDDLRDAGYKIGISQYIAAQDLI LALTAQGETLNRPERLKTLLGPIFCSSPAEQEDFQERFDRWIEFVSHTPRATERADVK AQALSKELGKVRRGFRQLIQTLIAIVLTGIFLPILFEQSIKDDSAQVTQSSGTQSSGT QSSATQSSGTQSSATKSSATQPSATTFSATKPNSDFPLDWQISLFCFLLTLCIAFLVW RLWWLWRANLFLQRHGTTEQPELHKISIRDFEQNLFPTIMFIHAAQSLRQRIRIPSHE LDVDTTIDATLRQGGWLTPVYGTRQVLPEYLFLINRVSYRDHQAKFMEEMVDRLQHNG VFITSYFFDDDPRICLSYDGTSSPQKLHEIIAKYTQHRLVIVSDTEKLFSTQTGEPEP WVSQMTTWGSRAILTPKPVENWGYQELELARQFIILPATPKGVRVLSQVLHQGSATYV LSEKDQISLPEPLRVRPHYWLERNPPKPEQINAMLNSLQEYLGKDGFYWLSACAVFPE LHWNITIYLGNVLKTAEGHCLLEVCSPTNLARLPWFRYGNMPDWLRSVLIATFTHQQK HAIRTALQDLLITAVQGSVGRLQLEVAKKHHSFLPKLADAVLHLLSIRVIKGSPLRDY LFLSVMTGQPKLAIEVSDTFSRLLNVSKHKSLLKIICDHKRNFFGLTAIQVFGIIVTG VLISFFTSHMNSKPTKFSETCSNFNILIPDDKSYVKLQATCRKFGSEPNSTSIHLKQI ENDNGQLRINAISEESTFQKTCYNMKVEGSRLFADCEKRNREKVLSSLELKGIFNENG MLKYLQDPN" gene complement(14040..15053) /locus_tag="DP116_04965" CDS complement(14040..15053) /locus_tag="DP116_04965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006277218.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MoxR family ATPase" /protein_id="PRJNA477356:DP116_04965" /translation="MNFPFYIGDGTRKRYESPAKLSVSARSQLLKPEYYTADQGLKDA CNVALLLGQPLLLTGEPGTGKTQFAYSVAWELGFDPPLKFETKSTSAARDLFYTYDAL KRFQDAQSGTVSASFSDYITYQALGLAILRTRNPAEVEQILPSNFPHSGKARSVILID EIDKAPRDFPNDLLNELEHMYFRIPEFGNKIIEADPALQPILIITSNTEKDLPDAFLR RCIYYDIPFPKHERLAEIIANRLGLHTGSSNPFLQDALNLFYRLRAPQSGLRKKPATA ELLGWLLALQTLAGNTANPLTKLDVILPTLSSLVKTAEDQEKAKKLVEQWMLEHKKII QIG" gene complement(15142..15549) /locus_tag="DP116_04970" CDS complement(15142..15549) /locus_tag="DP116_04970" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04970" /translation="MFIKYQTKRQKHTERFSFGRIFWYLISFCKRRRYQSINNKIYTE IKVLNTSKFKHFNRLSFTVLSELTNINRTHVENWVRSEETKQFVGEVMIQKLVDEVKD MFYRWEKETSSDKIPMDDLADELTRLLKSLSRH" gene complement(15622..16245) /locus_tag="DP116_04975" CDS complement(15622..16245) /locus_tag="DP116_04975" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04975" /translation="MQAVQGNNNTQVQQDVERNQGQAIGQMYGSQAFCVQEVRKGGFF VAGNVIFQNSGRDNVQPTLYQSERELPSLLPYLANRSDQEFELAKALQLLLKQVLPLP LVCIIHGNEFQSHDKFLERLQKFSVPRLLGLDPNQTVIKKYCLDWPAGLKNLDELRDR LSKNLADSVLGYSFASLEEINATFCKYPNPIIIHTHLLTEDWRQQRF" gene complement(16253..16609) /locus_tag="DP116_04980" CDS complement(16253..16609) /locus_tag="DP116_04980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312478.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_04980" /translation="MFSANPKNTNPLRLGEEARKIQEALKLAKRRDQFEIATEWAVRV EDLRRAMLDHQPHIVHFSGHGAGKEGLAFEENSGITQLVSGEAIASLFELFRGTVECV LLNACYSEVQTQKFYN" gene complement(16713..16823) /locus_tag="DP116_04985" CDS complement(16713..16823) /locus_tag="DP116_04985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007309807.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="integrase" /protein_id="PRJNA477356:DP116_04985" /translation="MRHIQEISAHNDLGTLQRYLEVIPKQRKKAVSVIGF" gene complement(17293..>18213) /locus_tag="DP116_04990" CDS complement(17293..>18213) /locus_tag="DP116_04990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872554.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_04990" /translation="NPKGEAGEAGGAEGTRISNWAENLLVGMTKTLPKGLRYISDRER AVHEQVRTVAQLLQIEPLLNRLPKQLSGGQRQRVALGRAIARNPQVFLMDEPLSNLDA KLRAETRAQIVKLQRQLGVTTIYVTHDQTEAMTMGDRIAILNHGQLQQVASPLELYNR PANRFVAEFIGSPPMNFIPVKFHAPQLITNGDFRFTLPAVCGEALQKYDNQNLILGIR PEHLILSVPANKNLQVQVELVENLGNDIYLSTKLLQPGFQKSAFGIKDVQVRVPPERF VSCGEELWLSLTPEKLHFFDPETELAIFPN" assembly_gap 18214..18223 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(18318..18713) /locus_tag="DP116_04995" /pseudo CDS complement(18318..18713) /locus_tag="DP116_04995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194804.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="sugar ABC transporter ATP-binding protein" gene complement(19205..19384) /locus_tag="DP116_05000" CDS complement(19205..19384) /locus_tag="DP116_05000" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05000" /translation="MSERVDEGIRINAFEDRSIMKISGFLTNLFEKAQFSVISSIKAK QLSQTTSNTLFTTSA" gene 19459..20118 /locus_tag="DP116_05005" CDS 19459..20118 /locus_tag="DP116_05005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008233850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="single-stranded DNA-binding protein" /protein_id="PRJNA477356:DP116_05005" /translation="MNSCILLAEIIQEPQLRFTADNLEVTEMLVQFPSVREGEPPATL KVVGWGNLAKEIQQNYHQGDRVLLEGRLAMNTIERPEGFKEKRAELTVQKIHSLGTAI DTSSSPAAVNTQLTPVASPPKTTPTHESPLPTPTPVTSNVGVLPQDDRPQQRRSQSTN LERNIERNTYGVTPTEEPDPDDIPFVRSVYSRTSWAHELCDSYELEANAYSKTVEQIK P" gene 20302..21879 /locus_tag="DP116_05010" CDS 20302..21879 /locus_tag="DP116_05010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878155.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CPBP family intramembrane metalloprotease domain-containing protein" /protein_id="PRJNA477356:DP116_05010" /translation="MTLKRLGLIILTLVATLFAGSALLGSWQEPQFQSRLELYQTNIV LQAASWEPPNDNTDLKPLQDALLGVKPVESATEQYQQTRQSAQTNLEKAKKQLAQVQS QPATTPTTPKSQSEYPPVSNTSSELQQQQLQQSLKGLQKLIAELDLRLGILQANQGKI DAALKSWSQLQQRSDINPEFDKAAAVLAGLWSDPPRILPNAERLIQNNLEGWFRYTAV DQLYRLQQRQDGLADLETIQQKTAEQAILKLAVIGTIPALTAFFGVGLVIFLIGQRLI KGKTSLLAENSDVRWSTPWDGETILQVFVIGFFLMGQIIVPLTIQLLPLQRPTQNVRI QALYVLISYLLVAFGALSVLYLSIKRFFPLPENWFRFRLQGSWFLWGLGGYCVALPIV VVVSLINQQLWQGQGGSNPLLQLALESQDTVALAIFFSTAAIAAPIFEEVLFRGFLLP SLTRYLSVWRSILASGFLFAVAHLSLSEILPLFALGIVLGVVYTRSRNLLAPMLLHSL WNSGTLLSLFVLGSSGQ" gene 22583..22858 /locus_tag="DP116_05015" CDS 22583..22858 /locus_tag="DP116_05015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316110.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05015" /translation="MANLTPSHATKQLLAGYCGIIFGGFGVHKFILGYAPEGLIMLVI SLVGGYFTYGLTLLIMQLVGLVEGMIYLNKSHNEFVDTYFLKKQGWF" gene 22965..23159 /locus_tag="DP116_05020" CDS 22965..23159 /locus_tag="DP116_05020" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05020" /translation="MQNLWLFAADSYYSLFLAYVLQTGKFNLFLIHSPLSKYVSVLIT IVLGKTQKPECFQKILTLTL" gene 23755..26211 /locus_tag="DP116_05025" CDS 23755..26211 /locus_tag="DP116_05025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871169.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heavy metal translocating P-type ATPase" /protein_id="PRJNA477356:DP116_05025" /translation="MQIVSKTNLSPEAASTTEKITLDVTGMKCAGCVRAVERHLTQYP GVKSACVNLATEVAVVELEAGAVDADALAKKLTAAGFPTQPRKLGGKVAGETQLIQDP VQRQRQEMQAVKRQFLIAVVLLVLSFVGHVANISGHVIPVLHNIWFHCGLATVALLVP GRPILVDGWVGLRHNAPNMNTLVGLGTLTAYTASLVALLFPQLGWECFFDEPVMMLGF ILLGRTLEKQARIRAAAAFKELLALQPQLARLIAKPKTTEATPSVSSSAGVVEIPAEL VRVGEWLQVLPGEKIPVDGEVREGQTTVDESMLTGEAVPVTKQPGDLVTAGTLNQSGA IAVQATRVGSDTTLAQIVALVEAAQTRKAPVQKLADIVAGYFTYGVLTASLLTFVFWY FFGTHIWHHAVMAYAMQISHHSLFSTSHTPHLTIYTPLLVSLKLAIAVMVVACPCALG LATPTAILVGTAIGAERGLLIKGGDILEKVHQLNTVVFDKTGTLTTGRAVVTDCLPCQ AEEENAFRTPVALLQLAAAVESGTIHPLATAIQQEAQQQELSIPDAADFHTEPGLGVS AVVESTLVLLGNCDWLQWHGISISETAQKQSQELAADGKTIVFVAVGDTVAGLIAVQD TLRPDAKATIEKLRQMGLRVMLLSGDTQDAASATAKQLGLNTGDVMAGVPPVKKAAAI QELQARLTKGRTQHSIVAMVGDGINDAPALSQADVGIALHSGTDVAMETAEIVLMRDN LSDVVASIQLSRATLSKIRQNLFWAFAYNTLGIPLAAGVLLPSFGFVLNPSGAAALMA FSSVSVVTNSLLLRRFAYRS" gene 26431..26973 /locus_tag="DP116_05030" CDS 26431..26973 /locus_tag="DP116_05030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309074.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FHA domain-containing protein" /protein_id="PRJNA477356:DP116_05030" /translation="MAANNTKEILINNSPTHAFSNDVSMAAQTNESHVLIIEHDQERR ELILDRPVYSIGRDSCCDICFINSLFVSRRHATLIRVPRDDKKHSYYYRIVDGDAKGK PSSNGLMINGRKILDGLMINGQRIPAHDLKNEDEIVFAPQVRAIYYLRRNSMPAGEET DSSYDDITLIDPRMTNDIED" gene complement(27502..27849) /locus_tag="DP116_05035" CDS complement(27502..27849) /locus_tag="DP116_05035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860057.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05035" /translation="MARYTCSFILSVPTDQLQASLVELLQDCHLDVQYYTSDYILARE ALGSVSISKLVTVEILIDKTRPNDTETRMSIVIKNEELPLQRDNHCRQIFEYVKQAIE HCRYWHLIESIAG" gene 28048..28968 /locus_tag="DP116_05040" CDS 28048..28968 /locus_tag="DP116_05040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872378.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR01777 family protein" /protein_id="PRJNA477356:DP116_05040" /translation="MKIAIAGATGFVGSRLVERLHGEGMEVVVLTRNTTYAQKVFPST AFPNVEIIAYTPNTSGSWQSVISSCDGVVNLAGEPIGEARWTPERKQEILNSRKFVTQ NIVDTVINANPKPSVLVNASAIGYYGTSETATYDETSLPGNDFLAQVCQAWEAEASKV KDVGVRLVILRFGIVLGLGGALGKMITPFKLFAGGPIGSGRQWFSWIHVDDVVNLILQ ALMKPEIQGVYNATAPHPVRMAQLSQVMGKVMNRPSWLPVPAFAIEALLGDGAIVVLE GQQVLPKRTLESGFEYKYPNLEPALAEILK" gene complement(29291..29542) /locus_tag="DP116_05045" CDS complement(29291..29542) /locus_tag="DP116_05045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316105.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05045" /translation="MKEKFLNWLNLVLMADVFLVLFGFGWFAVAVIGKTVGVPLGLDL WHSLWQPVFNPAIGILMAGAIISGIINWVSQKFIKTDNG" gene complement(29611..30036) /locus_tag="DP116_05050" CDS complement(29611..30036) /locus_tag="DP116_05050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316104.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05050" /translation="MTNTVESLFDTGLERYKAGEPVADLIPVFKEICDRTPKSSAAWT CLAWLYLLDNNPNLAHKAAQKAVKLNPQDPQARVNLAVAMLETGQKGLRQHVDFTQQL IFVNPDWREEIKKSIEDGLGRKPDWQSLAKVKVWLFPEE" gene complement(30200..30556) /locus_tag="DP116_05055" CDS complement(30200..30556) /locus_tag="DP116_05055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194394.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron-sulfur cluster assembly accessory protein" /protein_id="PRJNA477356:DP116_05055" /translation="MTQATQSLQRGIQLSEAALQQVKFLRNQQGQDLCLRVGVRQGGC SGMSYMMDFEDASKITPQDEVFDYDGFKIVCDRKSLLYLYGLMLDYSNSMIGGGFQFT NPNATQTCGCGKSFGV" gene 30908..32557 /locus_tag="DP116_05060" CDS 30908..32557 /locus_tag="DP116_05060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05060" /translation="MTSFNRSTSRRLKKLTQIPSVWEGDRRPLSSSQTQNSDPDVKGE CILWVDGSQGIVRGMDVVAPDTGPEAIVRTLMRAMEHPHNPAKPARPQRIVVKDREIQ FYLRGVLQDLDIAIDYTPDLPLIDELFRGFTEILDSQVPDLPPQYAQALREKAFAIWQ AAPWEFLEEQQILSIEINKWDVGTLYASVMGMLGMEYGILLYRSEDSLKRFRTSVLKD DESQGHLEEAFLKQDCLFLTFENANDTEEEEDEFDDLADLPISEIEPTFGNIHPLEGL RSVLYDEEALVVYVAIESLSRFIRDYRNQLGGDTFPALNRRYRISLPESSNEPTKSVS IAVSSMPQLATELEEIAGFDTSEDESESPASAFESLRDDLIPEDSFLSLGVVSWEMLD YLRQSGTYHTTGEITQAGDGLPVILVQTSRPKAKTVISNIEQAGGLRAICFNPGADPF DGTRYDLGLLQTENAELFLFGEFEDDDPIHVEARKKWNQRCKNTQGYCGLIIAKGLTG ASRGNPQLRDMMALFEAQFLSPKDLGLGTLQLMPQLQFEET" gene 32678..32929 /locus_tag="DP116_05065" /pseudo CDS 32678..32929 /locus_tag="DP116_05065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867758.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="FMN-dependent NADH-azoreductase" gene 32945..33448 /locus_tag="DP116_05070" CDS 32945..33448 /locus_tag="DP116_05070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015175912.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05070" /translation="MAMTNHSLFALKLFSALGCGLIAGVFFAFSTFVMSALARLKPTQ GIIAMQSINITVINPLFFTALFGTAVACIFLAVFSVLRWHQPGAFYLLVGSLLYLVGT IGVTIVFNVPLNEALAIVDPGSTEGANLWSRYLINWTIWNHIRAAAALAAAASFTIAL CYRTSQS" gene 33692..34561 /locus_tag="DP116_05075" CDS 33692..34561 /locus_tag="DP116_05075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318941.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05075" /translation="MSQAAPAEVDSSLLLVTQNYREWYRQGKRLEKQENYEEALTYYD KAIECCPDEYWLWYDRGSVLRELGQYQEALASFDRALKLRSNDYWTWYSRGYILLEEL DQFEEAIANFDKALAIRRDDYWAWFRRGDAFRHLERYEDAISDYDEALSIRRNDYWAW FRRGDALRHLRRYEDALKSYETALSVRSDNFWIHYKIGDTLRHLERYEEALASYQKAT ELKPDDEYAWYNIACCAARVGKESLALESLETALKINLNFQIFVKTDPDLDVIQDREQ LDELLCKIAEWNP" gene 34732..34824 /locus_tag="DP116_05080" /pseudo CDS 34732..34824 /locus_tag="DP116_05080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455171.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="rhodanese-related sulfurtransferase" gene complement(35191..36717) /gene="crtD" /locus_tag="DP116_05085" CDS complement(35191..36717) /gene="crtD" /locus_tag="DP116_05085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868688.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="C-3',4' desaturase CrtD" /protein_id="PRJNA477356:DP116_05085" /translation="MPGHRVGNSKPRVIVIGAGIGGLTAGALLAHRGYSVLILDQALV PGGCASTFKRKGFTFDVGATQVAGLEPGGIHHRIFKELEIDLPQATPCDPACAVYLPG EETPINVWRDPDKWKEERQRQFPGSEPFWLLMAALFKASWEFQGRDPVLPPRNVWDFL QLTKAVRPSTFITLPYTLFTVGDALRFYKLGNDRRLKTFLDLQLKLYSQVDADNTALL YAATALSVSQLPQGLYHLQGSMQVLSDRLVEALEKKGGKLLMRHTVEHIKVEKGKVSA VVIRNQKTGEVWTEPADDVVANVTVQNLVQLLGNKAPNGYKRRVEKIPTASGAFVMYL GVDESAIPLGCPPHLQFLYDAKGSIGENNSLFVSVSHSGDGRAPEGKATIIASSFVDT KQWWLTEDYEALKQQYTQDAISHLEKFFYLKPETIVHVEAATPRSFARYTARDQGIVG GIGQRISTFGPFGFANRTPINNLWLVGDSTHPGEGTAGVSYSALTAVKQIEAQKVTRY " gene complement(36984..38162) /locus_tag="DP116_05090" CDS complement(36984..38162) /locus_tag="DP116_05090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318947.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_05090" /translation="MNMVNKMKGELPLVSVILPAYNAEAFIIRTLQSIISQTYKNIEI LVVDDGSQDKTAEIVESFAQKDRRISLFKQSNSGVASARNLAIENSKGEYIAPIDADD IWYPEKLEKQVQCMLMADQSVGLIYAWSVFIDEEDAIVGQYIPHHHLNVLSIEGEVYP AMLYTNFITNASVPLIRRVCFEKVGGYSSKFREQNAQGCEDWDIYLRIAEYYQFRVVP EFLIGYRQVKASMSNGCQTMEKSYNLVLADFRKKHPEIPAHIYHWAASSYYVHLAWKS RASGDYSSTLTWLYKSIKLDYSPLLLRPVYQCLIECLFKIAVEPITSLIWQDHHSWLQ CREKFSYPKNRVAVSQLTTVSDIQNQMYQPYKLPLKPYARIMWQRWLKVLQLCRALSN " gene 38380..39663 /locus_tag="DP116_05095" CDS 38380..39663 /locus_tag="DP116_05095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318949.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="insulinase family protein" /protein_id="PRJNA477356:DP116_05095" /translation="MTITLLNFPRLNAPKLYQLPDGLTIVVEQMPVEAVNLSLWVNVG SAAESDTINGMAHFLEHMIFKGTERLASGEFERRIEERGAVTNAATSQDYTHYYITTA PKDFVELAPLQIDVVLNASIPNQAFERERLVVLEEIRRSEDNPRRRTFQRVIETAFDK LPYRRPVLGPETVISQLQPQQMRDFHATWYQPRSITAVAVGYLPVEELVEIVAKGFER TLSTQHPTPNTQHAPANPEPLFTNVVRKEFIDESLQQARLIMVWRVPGLNQLEKTYAL DVLAAILGHGRTSRLVRDLREEQGLVSHISVSNMTQQLQGIFSITAYCTTENLSATEA RIVQHIQNLQTEIVKEAEIARVRTKVANRFIFANETPSDRANLYGYYQSMVGDLEPAF NYPARIQAQNTTTLMQAAQEYLSPDAYGVVVIKPF" gene 39679..40575 /locus_tag="DP116_05100" CDS 39679..40575 /locus_tag="DP116_05100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fructosamine kinase family protein" /protein_id="PRJNA477356:DP116_05100" /translation="MWTEIDANISQVTGEKFQSSQRRSVGGGCINQGYSVCDGERTYF VKFNQASQIAMFEAEALGLQQMYQTATIRVPKPICWGTAGDSSYVVLEWLEMGSGNTK SWEEMGRKLAAMHRWNPPRLGKRGSQESFGWDINNTIGSTPQVNIWTADWAEFYAKYR LGYQFQLARRKGGHFPQEKELLEAIPEILADHKPQPSLVHGDLWGGNAGCTVSGEPVI FDPAAYFGDREVDIAMTELFGGFSAAFYRGYNEVWRLDQGYEQRKTLYNLYHILNHFN LFGGSYLSQANRMISQILAISR" BASE COUNT 11717 a 8457 c 8801 g 11662 t 10 others ORIGIN 1 agtatggtgc aagatctgag aaaggctaaa gtgtcaaggc tcaaggatga aaaattttca 61 tccttgcttg tttctatttg tatttctttt ggcgaaatga agtaactaag cttttcgctt 121 aacttgataa atcgagaaaa taaaaactta cagtagggaa ttatctaata tttcttgcaa 181 gcaacgctcg ctaagatagt ttcaatattc attgagaaat gcgttaaaga tataaactat 241 gcagacaaaa gaagcccctt tccctcttcg gctttggaca gtgaaggaat atcataaaat 301 ggcagagttg ggaatttttc acccagaaga acgagtggaa ttaatcgcag gacagattgt 361 taagatgagt gcaaaaggaa ctatccatac tactactgtc aggcgtattg ctaatgttct 421 gcgtgagaaa ttacaaggtc aggtagatat tcatactcaa gatccagtcc agttaaatga 481 tttttcagaa ccagaacctg atattgctgt ggtaaaagtc gatccactcg attacgttga 541 ccaccatcct acagcctcag aagtatattt gattattgag gtagcagaca gtagctttaa 601 atatgattgc gagactaaag gtaaagccta cgcgaaatca ggaattctag attattgggt 661 actagatgtt aataaccgta aactccatgt ttttcgagaa ccaactcaag agaaatatca 721 cagtgaagtg attttttcac aagaggcaat tatttcacct ttgaattttc ctaacttgat 781 gattacagtt agcgatatat tgccaccagt gataaataag ccgtgacaac aatagtaagt 841 gggtcaaatt aaatataaaa cctcacgctg cctccggctt ccatctggtg cgagttccca 901 gagcgattga gggtgaggtc tgatcttaca gcagattcct agactactca tgacaagata 961 tagcttaaca gcccatctgc tgacgtgagt aaatccgatc actgcacaat aatatggaag 1021 tacaaagatg ttatgaccac aataacaaca atgacaatag acaattttac cgttcgacca 1081 atgaagaggt cagaattaga tttggtaatt gattgggcag caattgaagg ctggaatcca 1141 ggaatttatg atgctgaatg tttttaccaa gctgatcaat gcggtttttt cgtgggggag 1201 ttgaacaatg aacttgtggc aagtatttcc gctgttgctt actctaagca ctttgccgtt 1261 attggctttt atattgtcaa accgcagttt cgtgggcgag gttttggcat gaaaatgtgg 1321 agggcagcaa tggcatatct tggtactgag cgtaacatta gcttagatgg agtgattgct 1381 caacagaaga actaccagaa atcaggtttt caaatcacct accgtcatat tcgttatgag 1441 actgtaggcg gcggtctagc cccggacggc attgtagaac tcaaaactgt gccttttgat 1501 aaattagttg cttatgatca agagcttttt ccggctgagc gcaagcagtt cctacaactt 1561 tggataaacc aaccaaacag ttctgcctta ggggttgtta gagatgggca tgttgttggt 1621 tatggagtca tccgtcaaag ctacacaggt ttccggattg gaccgttgaa tgctgatgat 1681 gaacaaattg ccgagcaact cttactcgcc ttgcttgctt ttgcttctga tgctccagtg 1741 tttcttgatg taccagatgc taatccagaa gcgattaagc tggctcaacg ttacgggatg 1801 aagccagtct ttgaagtcgc tcggatgtac aataaagaaa ttcctaactt gcctataaat 1861 cgtgtatttg ctgtgacaag cttagaagtt ggttaggtaa ctctcattcg ccaatcagtc 1921 atcccttcta acggttggta acagattgta ataactctgt gatgcgccac gaaaaccctg 1981 acacatatgc gatagcaatt ttgtcaggct aactttccca aaccttaacc ttgtccataa 2041 gctctacttc ttgtgacact tgttatggta ctcccaaacc atatttattg aataactggt 2101 ctattacttt actataactt cggatagatg tataattact caaaagaagg ggagtagctg 2161 ctggcaatca tattagccag tacacctgag tcaacatact ggcaatctgc ctggttcagg 2221 tgacgagtaa gtcattagct tcttggaagt gagaccttca ctaagttcac atgcaaaagt 2281 gtgagcttgg tggagtctca ctactcgcat caagctcaca caagaaaaaa attcttgaag 2341 gagcttgaga aagtgttaac agcttttact gcaggtttgt tgctaattac ggtttcagaa 2401 ctaggagata aaacattttt tatcgctgta attttagcga tgcatcaccc acggcgattg 2461 gtttttgcag gtgtggtagc ggctttagct gcaatgacaa tactttctgt tgtcgtagga 2521 caagcagttt ctgtacttcc aaaagtttac attcattttg cggagatagc tttatttatt 2581 ggctttggct ttaaactact ttacgatgca agtcgaatgc ctgtctccag ttgtgataca 2641 gaagttgtag aagaagcaaa agctgcagtg aaggaggcag atatgcaact gccgaaaaag 2701 aaaaatgctt tagcaatttt aacagaagcc tttgtactga catttatggc agagtggggc 2761 gatcgcaccc aaattgccac aattgccctc gccgcaggaa ataacgccat tggggtgaca 2821 acaggtgcta ttttgggaca cgccatttgt gctgctattg cagttattgg tggcagaatg 2881 ttggcagggc gaatttctga gcggaagctc acctttatcg gtggattttt gtttcttgta 2941 tttggtgtcg tcgccgccat tgaaggagcg tgagataagt agaggagtgc ggaagtgtgg 3001 taagatttcc ctccctccgt tctttcctcc gtcatcctca ctgcttaggc gttgatgtcg 3061 gcgtactagc gggactagaa gattcgctag cctgcttatt aatttcgtct ttatactggc 3121 ttggtgctaa agccgcagct tgggtaaaca aagtttttgc ctcctcaact ttgccctgtt 3181 ctttcaggag catcgccttt gctaaaacgg gacgaaaatc ctgtttatca ttgcttattg 3241 cttggtcata agcacttaga gcttgagtat agtttttttg agacgcatgt actttaccta 3301 agagtacttg tacggcgact gtatccacac ttccaggctg acttttattt gctttgcttg 3361 cagtattgag agtattttgt agcaaactaa tagctgcttc cgggcgtttt tcagacagat 3421 ggagggttac cagaccttgt aaagcctcaa gattaccagg tttcttttct aaagcagaac 3481 gataagctgt agccgctcct tccttgtctc cactttgctg cttggcttga gccagtagca 3541 ctgcatatct tgtttcttct ggattgagct gagttaattt ttccaaaggt tcaattacgc 3601 ttttgacatc acccgcgccc aagctgagta attgtagtcg agtttcaagc aagcctcgca 3661 gcgccgtttg attttccggt tctcgctgca aaacttgtgc ataaccctgt gcagcatttt 3721 ccagtttcga ttttagatca gaggaaggtg aaacacttct ggcgtttgtg gatttctcag 3781 tggaactttg cgtttcatca aatgccccaa tgagtggcac caccgaaact ccgacaaaag 3841 caaaaagtat cagcaccaac actacattaa ttatccagtg tttgcgagtt tgagacacaa 3901 aaccatcctt aaattctaat gagattatcg ttgacccttc ggctgcgctc agggtcaaca 3961 gaaaaaatga aagttaaaaa ttttaaaatt tttacggaaa cccgcgaatt gtttgtgaat 4021 tttataacaa aaagcaatat acgcatttca tgcaactgtt gattttagga ggaaccgtct 4081 gaaattgtag ctatgatacg cttccgaaac taacataaaa acgctaaatt acagatttag 4141 ggtgtgtatg taaagttttg cgtctttagt attgcacaga gcgttaagac gtggctgtct 4201 tgaaaggaaa tatacttaca agcgccgcac aaacgctctt ttcagctgcc tttctaaaat 4261 cccacaatgg gatgtgtctc cgccgcaagg agttttttat gaattccgac ctgatgccgt 4321 cacctgaatc gtccaattca actgcttcta aagaagcttt agagagcaaa gacacagctg 4381 ctatgaatat caatgcagca actcctgctt ttcaagaaaa aaattcacct gttatctcta 4441 gtgagactgc taatgatcag aacttagaaa accggcaaca gccgattcca gatcccagcg 4501 aaccaatgca gtaccgggct attggcttag tgcgaggtcg ctacgttgcc gccaatagcg 4561 aacaatttac tcaaggttca atagtcacta cagatggtac acaacttgaa gccgtccttc 4621 tgggtcggat tatgagttta gtcaaaaatc atttggattt ggaaaaagaa catctgtggg 4681 ttgtttatcc gcgtactcgg caagaaaacg acaaattaca tatacaaatt gtcggagttt 4741 gggagccgga aaacttggct aaacatcaaa tagccaatga cgagcaagat tcggcttcac 4801 aaaaagattt gccaataccg ggcactaatg gttcatcaag cgccctcatc ccctcatcag 4861 aagttccaga tggtgatttt tctgttcgtg gcgaggttgt ttaccagtct tatgaagctg 4921 aacacttagt catcaaaatt aggcaagctg cgcgcaaaca agatgacaaa ccaaagtatt 4981 tcaagttgaa aattaaaggt aagcttgaag ccagagcagt cggtaagttt tgggatttac 5041 aagtcaagcg cgaaactgac tcattagtag tagaaagcgc tgaggcgatc gctgatttgc 5101 ccaaaaagcg cagaccacca caaaccagac atggcggcgg aggtcgtcgt cctagtggaa 5161 aaaaaccatt tcctccaaga ccacgcaacg gcgaaattcc acgcccagta aagaaaacag 5221 gcgacacagg tgcggcgtca ataccaatat cctcaaaacc catacctaag cccgttaagc 5281 gtctgaagcc accggaacag taagaggaca agggggaagg gagtgaggca gtgatatcat 5341 gtccggatga acacttataa aacgaagaac ctcaccccgc cctccaccgc acccctctcc 5401 ttaatcaccc ctctccttaa taaggagagg ggatgggggt gaggttttgt cctcttgtcc 5461 tcttaaaatt ccgtccacct gtcttgcgga agaaaagacc gctgcatctc aattggtgca 5521 acaggctctg ttaaagtaaa atcaccacgt aaaatcgatt gtttcaactc cacggcaact 5581 tgcctagaga gaaataaact agcaaggggt gcaacgcgca ctgttctgcc ttctatggtg 5641 attcgtccag atttgagttg ggcgtaactg actaaaccaa aggtaggacg aacacgccgg 5701 ggaatggaaa agtctactat tggggctacc aagtctttat cttcaactgc acatcttgca 5761 acgacttctt cgttcaacac agggagtggt acacccactc ccaacatcaa agaaggaccg 5821 taacttttaa agtaacaccc acgcacccaa cgagcatcca tttgcttggc atcaccaatt 5881 aaagctaaag ttgcagccgg tccaatgggt gtatgattgg gcaatcgttt ttgtaaagga 5941 aagtgctgag tgccttccca agcaacataa ccaataccgc ctcctagaaa aattcgtgtg 6001 ccaatcccta cgagttgtag atctgggtcg ttgagtaggg gggagatggc accagggtta 6061 gaatagacgg cattgcctaa acgcggttgt agaggaccga ggtatgtgtg gagtgggcga 6121 tcgcccccat tcacgcctac tataaaattt tgataaagat tgcgtggatt gaataaataa 6181 aactgattaa ttgtttcacg ggtaattgtg gtttcaaaac tgcctctagg atagcaatca 6241 gtcacttgtc cttgtgctcg aatgtgtaca gctttaccag ctatcaaatc ttcaatgaca 6301 tgaccgccac cccgttcccg aacttcttcg ccgtccatga cttccacggc gcaactcgca 6361 cccaagtata aatctacggc tccaaaacca gagtatgccg gaacaccatc aaaccagcaa 6421 cggcgaattt ttatcggtgg atcagtatgt cctaggttaa caatggcacc actagactcc 6481 attggctcaa aagtgccagt agtaatgaca tcaacttcct tagcaacttt ggtaacacct 6541 acttctgcaa ctcttgcctt taattcctca atagttaaca ccaccgccag tttctggcta 6601 attttgtcat taatctccgc aattgttcgc atattgatag tagctaatta acagagttat 6661 aacaggattg ctgctttatg agtatagtac gagcttcttg ggatgagaaa acaagggaag 6721 tcaaaaataa aaagtaaaaa actctttcga ctcttagagg tatacctatg gctttagcgt 6781 atcccgaggg cgggacgctt tgcgacgagc gcgtctccaa aggagataca ccagttgatg 6841 ccagatacga cactgtcggg cattgctaaa gttaaggggc gcacggaaac ccgactgctc 6901 ctttgtgagg actcaacgca ctggctgctg tagcctgacc gaacccgagt cgagcagtgg 6961 tttacttttg ttgtttgact ttccatccgt tttccttctt gattttgtac agataaacag 7021 gctagtgtgt cgagtagata ttcctagtcg tgatcagatg catcttgcga aagttcccta 7081 aacactaaaa tccataggag actaaagaaa acaaagttag tcttatgggt attaaggcag 7141 caagtggtgc aaatagataa ggaggattta tgcgtgcagt actgatggca ggtggttcag 7201 gaacgcggct gcgtccgcta acttgtgatc tgcccaaacc aatggtccct atcctgaatc 7261 gacccattgc cgaacatatc atcaatcttc tcaaacggca tcaaataaca gaagtggttg 7321 ccacactcca ttatttacct gatgtcatgc gagattactt ccaagatggc agtgactttg 7381 gcgtacaaat gacctatgcc gtagaagaag atcagccttt aggtactgcc ggctgcgtga 7441 aaaatattgc cgaacttctt gacgagacgt ttttagtcat tagcggcgat agtataacag 7501 atttcgatct gacagcagcg attgaatttc ataaacaaaa gaagtcaaaa gcgactttag 7561 ttttaacccg tgtccccaac ccgatagagt ttggggtggt gattactgat gaagaaagtc 7621 ggattcgtag atttttagaa aaaccgtcta cgagtgaaat tttctccgac actgttaaca 7681 ctggtacata cattctggaa ccagaagttt tggaatatct gccagcaaac caagaatctg 7741 acttttctaa agatttgttc ccgttgctat tggcaaagaa tgagccaatg tatggttaca 7801 ttgcccaagg ttattggttt gatgtcggtc acttagatgc ttatcgtgaa gcgcagtacg 7861 atgggttata tcacaaagtc aaacttgact ttgcctataa cgaagtttcc cctggcttgt 7921 gggtaggtca aaacacttat attgacccag cagcgcagat tgaaactcca gtagtcattg 7981 gtgataattg ccgcattgga gcaagagtac agattgaagc cggaaccgtc attggggata 8041 atgtcaccat tggctctgat gccaatctca agcgtccgat tatttggaat ggagcgatcc 8101 ttggcgacga ggcagaactc agtgcttgtg ttatttgccg tggtgcgcgt gtagaccgcc 8161 gtgcccatgt cttagaaggt gctgtggttg gttcgctttc gactgtggga gaagaagccc 8221 aaattagccc atttgtgcgg gtgtggccca gtaaaaagat tgagtctggg gcaattttaa 8281 atattaattt gatttggggt aacactgccc aacgtaattt atttgggcaa cgaggcgtcc 8341 aaggactagc caatattgac atcaccccag aatttgcggt gaaattggga gcagcatacg 8401 gttcaacttt gaaaccaggt tctcgggtaa cagtttctcg tgaccaacgc aatgtctctc 8461 gcatggttac tcgctcatta attgctggtt tgatgtcagt aggtatcgat attcagaacc 8521 tagatgccac agctattcca ataactcgca cagtgatacc cactatgcca gtcagtggtg 8581 gaattcatgt gcgggtgcat ccagaccgcc ccgactatat cctaattgag tttatagatg 8641 ccaagggaat caatatcacc aaatccttgg aaaagaaaat cgaaggggct tacttcaagg 8701 aagatatgcg ccgggcgcaa attcatgaaa ttggtgatgt tgcttacccc agtcaggtga 8761 tggatcgata ctgcaccgct tttgagaagc tgttacatat tgatagtatt cgcaacagtc 8821 gggcaaaagt cgttattgat tacgtttatt cagtatctgg agctgtgtta ccccaaatgc 8881 tagataagtt tggggctgat gcagtggtgt tgaatgcgag tctcaataaa actgcgatgt 8941 cagcagctga gcgcgaagca cttttgactc agcttggtca tgtggtggag gcgttgaaag 9001 cgaactttgg tgttcaagtt tccgcgaatg gagaacaact cattttggtt gatgaatctg 9061 gcatttctat tcgtggagaa atgttaacag cactgatgat agacatgatg ttaactgctc 9121 accccagagg aacagtggtg gtaccagttc atgcttctag tgcagtggaa caaattgctc 9181 gtcgccatga tggtaaggtc attcgcacaa aagcaaatcc cacagcatta atggaggctt 9241 gtcagaaaaa ttccaatgtg gtgttgggtg gtaatggcga cactggcttt attgtgccgc 9301 aattgcatcc aggatttgat gctatgttcc ccactgctaa actgattgag atgctgacga 9361 tacaggagcg ctctctcgcc tctgtacggg cagaattgcc gcgtgtggtt cacaaagcat 9421 atacagtacg ttgcccttgg actgtcaaag gagcgctgat gcgccactta gtggaaactc 9481 acccagctca aaatctggag ttaattgatg gagtgaaaat ttgtcaaccg tatgatgaca 9541 gttgggtatt agtattgccg gatgctagtg aaccagtagt gcatttgtac gctaacagca 9601 atgatcgcga ttgggtagac gaatcattaa gacaataccg tgctcgcgtg caggcgtttg 9661 ttgaaagaga agaagaacaa gctgttgctg aggtgtaatt agcgactaca aatgactaat 9721 ggctcctagc cattagtcat ttctcttgct ctacttacct tgtagagttt tatcttctga 9781 ttcttttact catattacgc ggaaagtgac agaagatggg tatttctgcc gttctgtgct 9841 aaacagtaca actacgaact ataacgctac gaacttgtga atgttgtact taatgaacta 9901 tttctggcgt atctgtcatt agcctttttg tatcccaatt aatggtaaaa tcagatgaca 9961 gctgactgaa aaaatgtaac taaagtcgaa taaaaaatgg gtgacggaaa tgacaattac 10021 agttccagca accactacac tgaaggagtt tttggaactt ccatacgttg atgagtcacc 10081 agcttgggag tttatgaatg gagaagcgat tcagaaaccg atgggaggag gtaaacacag 10141 cctgttgcag aaaagtttaa ttgcagtgat tgatgcagca ggaagtcagt acgaaagctt 10201 tccagaatta cgctgtactt tcggtaatcg ttcagtagtt cctgatgtcg tggttgtcgc 10261 cattcatcaa ctaccgctgg atgatagcgg tgagatcatt agcagtggta ttgactttcc 10321 tccaccttgg gtgattgaaa tcctttcccc tgctcaaagt cagaccaagg taacaggcaa 10381 catcctgcac tgtctgaaaa atggtactca gctaggatgg ttgatagatc cgagcgaacg 10441 ctctatccta gtttaccatc ttgatcggct gccggatttg ttgactggag cagatgtttt 10501 acccgtgtta gaaaatataa acctgacact atccgttgac gaagtcttta gttggttgaa 10561 gcgcagataa agttttcagg cgatcgccat tggtgtctca ggggtaatga aaaaattaat 10621 aattaggtct attgagaaaa agcgatatat tttaagtata gcagtcgaag catagtttag 10681 gacataatag aaataaatga cttgaaaaaa aatgccagct tcttatagtt acgatttaag 10741 aactaaagtg attaatgcga ttgatggcga aatgagaaaa actcaagcaa gtcgcatctt 10801 tgaaattagt cgcaatacaa tagagacctg gttaaataag agaaaagaaa ccggagatta 10861 tcagcccaaa gtaggatatc aacaagggta taacccgaaa attgctgatt tagaagaatt 10921 tcagcgattc gctcagataa atggcagtaa aactcaagca gaaatggcag aagcttggtc 10981 agaaaagatt agcgatcgca ctataggaaa agccttaaaa aaaattggtt atacccgaaa 11041 aaaaaactta cggttatcga gaaagggatg aagagaaaag gcgggaattt cgagcaaaaa 11101 ttagtcaaaa aaaatcatcg cagctagttt acgttgatga atcgggaata gataatagag 11161 aagactatgg ttatggatgg aatcccaaag gaaaaagatt tcacgattta aaatcaggaa 11221 aaagaaacat cagagttagc attatcagtg ctttgtgtca aggtaaatta gtggctccac 11281 taacttttga aggttcttgt aatcggttag tttttgaaaa atggctcgaa gaaaaacttt 11341 taagagaact caaatcagga cagacaatta ttttagataa tgccactttc cacaaatctc 11401 agaaaaatcg tgaattaatt gaatcagttg ggtgtgaaat tgagtatttg ccaccctatt 11461 ctcctgattt gaatgatatc gaacattatt ggtttcccat taaaaatcga gtcagaaagt 11521 cacaaggggc tatcgaagat tttcgggaga gagttgatac agctattcgt ttaacgtcct 11581 aacctatact tcgattgcta taagcttaat cagcgatatg agaatgaaat tgaaaacatt 11641 atgtcatccc tgtgattaat tcgggtcttg aagatatttt aacatcccat tttcattaaa 11701 aattcctttt aattccaggg aagaaagaac cttttctcta ttccttttct cgcaatcagc 11761 aaatagtcta ctgccttcaa ctttcatgtt atagcaagtt ttctgaaaag tactctcctc 11821 tgatatcgca tttattctca gttgtccatt atcgttttcg atttgcttca aatggattga 11881 ggttgagttc ggttcactac caaattttcg acaagtggct tgcagcttaa catagctttt 11941 gtcatcggga attagaatat taaagttact gcatgtttcc gaaaatttag taggctttga 12001 attcatatgg gatgtaaaaa aacttattaa gactcctgtg actattattc caaatacttg 12061 aattgcagtc aatccaaaaa aatttctttt atggtcacaa attattttca acagtgattt 12121 atgtttggat acatttaata gacgactgaa agtatctgac acctcaatag cgagtttggg 12181 ctgtcctgtc ataacgctta aaaacaagta gtctcgtaat ggactacctt tgatcactct 12241 tatagacaag aggtgaagta ccgcatcagc aagtttaggc agaaaactgt gatgcttttt 12301 tgcgacttct aattgcagtc taccaacaga tccttgaaca gcagtgatca gtaaatcttg 12361 gagtgcagtg cggattgcat gtttctgttg gtgtgtaaaa gttgcaatca aaacagagcg 12421 caaccaatct ggcatgttac catagcgaaa ccagggtaag cgagccagat tagtcggtga 12481 acacacctct aaaagacaat gcccttctgc tgttttgaga acattaccca gatagattgt 12541 aatgttccaa tgcaactctg gaaaaacagc gcaagcactc agccagtaaa aaccgtcttt 12601 gccgagatat tcttgcaatg agtttaacat tgcattgatc tgctcaggtt tgggtggatt 12661 gcgctcaagc cagtaatgag gtcgaacacg caacggttct ggtagagaaa tttggtcttt 12721 ttcagagaga acgtaagttg cagatccctg atgtagaact tggctaagca cccgcacccc 12781 tttaggtgtt gcaggtaaaa taataaactg ccgtgcgagt tctaactcct gatatcccca 12841 attttctaca ggttttggcg tgagaatagc ccgactaccc caggttgtca tttgacttac 12901 ccaaggttct ggctcaccag tttgagtact aaatagcttc tctgtatctg agacgataac 12961 aaggcgatgt tgagtatatt ttgctattat ttcgtggagt ttttgtggag agcttgtacc 13021 atcataggac aagcaaattc gcggatcatc atcgaagaaa tagcttgtga taaaaactcc 13081 attgtgttga aggcgatcaa ccatctcttc cataaattta gcctgatgat ctcgataact 13141 gacgcgatta ataagaaaga gatactccgg aagaacttgg cgagtaccgt atactggtgt 13201 cagccatcct ccttggcgaa gagtggcgtc aatagttgta tcaacatcaa gttcatgaga 13261 tgggatacgg attcgttgcc ttaaactttg agcagcgtgt ataaacataa ttgtagggaa 13321 taagttttgc tcaaagtcac gaatggaaat cttatgtagc tcaggctgtt ccgtagttcc 13381 atgacgttgt aggaacaggt tcgcacgcca taaccaccat aaccgccaga ctagaaaagc 13441 aatacaaaga gtaagcaaaa aacaaaacag tgatatctgc caatccagtg gaaagtccga 13501 attaggtttt gttgcagaaa atgtcgttgc agaaggttgt gttgcagaag attttgttgc 13561 agaagattgt gttccagaag attgtgttgc agaagattgt gttccagaag attgtgttcc 13621 agaagattgt gttacctgag cgctatcatc ttttatagac tgctcgaaga gaatgggtag 13681 aaaaatgcct gttagaacaa ttgcaataag tgtctgaatc aactgacgga agccccgtcg 13741 gacttttcct aattcttttg acagtgcttg ggcttttaca tctgctctct cggttgcacg 13801 aggagtatgg ctaacaaact caatccaccg gtcaaagcgt tcctgaaaat cttcctgctc 13861 agcaggtgaa ctacaaaaga ttggcccaag cagagttttg agacgctctg gtctattcaa 13921 agtttcaccc tgagcagtta gtgctaagat taaatcctga gctgcaatat actgagaaat 13981 accaattttg taccctgcat ctctgagatc atcaataaag tcggttagaa cattgggatt 14041 caaccaatct ggattatttt tttgtgttcc agcatccatt gttctacaag ttttttggct 14101 ttttcttgat cttcagcagt tttgactaaa ctactgaggg tggggagaat cacatctaat 14161 ttagttaagg gattagctgt attaccagca agggtttgta gagccaagag ccaacctaat 14221 aactcagctg ttgcaggttt tttccttaaa ccactttgag gagcacgtaa tcgataaaac 14281 aagttaaggg catcttgtaa aaatggattg ctgctaccag tatgaagtcc taaacggttg 14341 gcaataattt ctgcaagccg ctcatgcttt gggaagggaa tatcgtaata aatgcaacgc 14401 cgtagaaagg catctggcaa atctttttca gtattgctgg taataattaa aatgggttgc 14461 agtgctgggt cagcttcaat tattttattt ccaaactctg gaatacgaaa atacatatgt 14521 tccagttcat tgagcagatc atttgggaag tcacgaggag ctttatcaat ttcatcaatt 14581 aaaataacag aacgtgcttt gccagaatgt ggaaaatttg atggtaaaat ttgctcaact 14641 tcagccggat ttcgggttcg taaaatcgcc agccccagag cctggtaggt gatataatcc 14701 gaaaagctgg ctgagacagt accactttgg gcatcttgaa aacgtttcaa agcatcatag 14761 gtatagaaca gatcccgagc ggcactagta gacttagttt caaacttcaa tggtgggtcg 14821 aaaccaagtt cccaggcaac actgtaagca aattgagtct ttccagttcc cggttctccc 14881 gtaaggagaa gcggctgccc taataggagg gctacattac aagcatcttt aagtccttga 14941 tcggcagtgt aatattctgg cttgagcagt tgagaacggg cagagacaga cagttttgct 15001 ggcgattcat accgctttcg agttccatca cctatataaa atggaaagtt cataaaaaga 15061 gtttataatc tattgcagta caaattttta ctaaggtgtg tcaacatttt gatttttgac 15121 aatcattaat catgagtgta attagtgacg gctaagtgac tttagtaggc gagttaactc 15181 atcagctaaa tcatccatag gaattttatc ggatgacgtt tctttttccc aacgataaaa 15241 catatcttta acttcatcaa ccagtttttg aatcattacc tcacccacaa actgttttgt 15301 ttcttcactg cgtacccagt tttcaacatg cgttctgtta atatttgtta attctgacaa 15361 gacagtaaac gataaacgat tgaaatgctt aaattttgat gtattaagta cttttatttc 15421 tgtataaatc ttattattta tactttgata gcgacgccgc ttgcaaaagc ttatcaaata 15481 ccagaaaatc cgtccaaatg aaaatctttc agtgtgtttt tgtcgtttgg tttggtattt 15541 gataaacaca cagattatga gtttttgtcc tggaataaga gcaggccaat tctgccaaaa 15601 ttccagcagc ttatttaaga tttaaaatct ttgctgccgc caatcctcag tgagtaaatg 15661 agtatgaatg ataattggat ttgggtattt gcagaaggtt gcattaattt cttctaaaga 15721 ggcaaaacta tagcccagaa cactatctgc cagatttttg gacaaccgat ctctcagttc 15781 gtctaaattt tttaatccag caggccagtc caggcagtac ttcttaatta ccgtctggtt 15841 tggatctaat cctaacaatc tggggacaga gaacttttgc aaacgctcta aaaatttatc 15901 atggctttgg aattcattgc catgaataat gcaaaccaga ggtaatggta gaacttgctt 15961 tagcaacagt tgaagtgctt tagctagctc aaattcctga tcactacggt ttgctaaata 16021 tggcaggaga gagggaagct cccgttcaga ctgatacagt gttggttgta cattgtcccg 16081 cccactattt tgaaagatga catttccagc aacaaagaaa cctcccttac gaacttcctg 16141 aacacagaag gcttggctgc catacatttg accaatagcc tgtccttgat ttcgctcaac 16201 atcttgctga acttgggtat tgttgttccc ctgaacagct tgcattccgc cactaattgt 16261 agaatttttg tgtttgaact tcgctatagc aggcgttaag taggacacac tcaactgttc 16321 cacgaaacaa ctcaaacaaa cttgcgatcg cttcgccgct taccagttgc gttatcccag 16381 aattctcttc aaatgctaaa ccctcttttc ccgctccatg tccggaaaaa tgaacgatat 16441 gaggctggtg gtctaacatt gcacggcgta aatcttctac tcgtactgcc cattcagtag 16501 caatttcaaa ctgatctcga cgtttagcta gtttcagcgc ttcctgaatt ttccgcgcct 16561 cttcccctaa gcgcaaaggg tttgtgttct tgggatttgc tgagaacatt aagatctttt 16621 tcacggtttt tatgaatctg ggaaatttgc taacgttcaa gccttaatac aaactgagca 16681 tactattttt cggaatttcg gtctgtttct cctcaaaaac caattactga aacagctttc 16741 ttccgttgtt ttgggatcac ttccaaatac cgctgcaaag tccccaagtc gttgtgagcg 16801 ctgatttctt ggatgtgccg caagggaacg ccagcataga acttacaact gctgcaactc 16861 ctgactcaga agaaaataga acctaatcct tgacaaaatt ttgattatgt gatatgtata 16921 gtaggcataa atcacctcga ataaagtcgg ggttattacc ttgttcggac ataagccttc 16981 cttctacttc ttatttaggt gtaagtgaaa tcaacttatg cacactgcgg aagataaacg 17041 tggattgtag atagaatcaa gcattttgct gtcgcagtac aattgttttg cagatacaga 17101 caatgtttcc acacgagcgc gcacgttatg cacgggtcta gtctttttgt acagtgcgta 17161 agttctagaa aaaatcaatt ttcatgtgtt ctggtgaggc agtgcgttgg ggaggacagt 17221 aaggcgtgcg ctctgcgcac acccgaaggg cactggctcc ccttaccccc ttacacccct 17281 aattcttgac ggctaatttg gaaatattgc caattccgtt tccgggtcaa aaaagtgtag 17341 tttttctggc gtcaaagata accacagttc ttccccacaa gacacaaaac gctctggagg 17401 aactcgcact tgcacgtcct ttatcccaaa ggcagatttt tgaaaacctg gttggagaag 17461 ttttgtggaa agatatatat catttcccag attttctact aattccactt gcacctgcag 17521 atttttattt gcaggtacgc tcaaaattaa atgttctgga cgaattccta aaataaggtt 17581 ttgattgtcg tatttttgta gagcttcacc gcaaactgct ggtaaggtga agcgaaaatc 17641 accattggta attaattggg gagcatggaa cttcacaggg ataaaattca tcggtggtga 17701 gccaataaat tctgcaacaa agcggttggc tggacgatta taaagttcca agggggatgc 17761 aacttgttgg agttgaccat gattaagaat agcaatgcga tcgcccatcg tcatagcttc 17821 agtttgatca tgagtgacat aaattgtcgt gacgcccaac tgacgctgta gtttgacaat 17881 ttgggcgcgg gtttcggcgc ggagttttgc atccaagtta gaaagcggtt catccatcaa 17941 aaacacttgg ggattacgtg cgatcgctct tcccaaagca accctttgcc tttgtccacc 18001 agacagttgc ttgggtaaac gatttaacag gggttcaatt tgcaacagtt gagcaacagt 18061 acgcacctgc tcatgcacag cccgttctct atcagaaata tatctcagtc ccttgggaag 18121 cgtctttgtc attcctacca acaaattttc tgcccagttg gaaattcttg ttccctctgc 18181 tccccctgct tcccctgctt cccccttcgg gttnnnnnnn nnnctgcagg agggtctccc 18241 tccgtaggta tctggcgttg gaaaccctcc cgcagcgctg tctcacctgc tcccaaagct 18301 tccccacctc gcagaggacg gcgtcgtagc ccaaaagcaa tgttgtcata cacgctcata 18361 tggggataga gggcgtaatt ttgaaacacc attgcgatgt cgcgttcttt ggggggtaaa 18421 tcattgatga ggcgatcgcc cacccaaata ttaccgcccg tcatcgtttc caagccagca 18481 attaaccgca gaagggtgct ttttccacaa ccggaaggtc ccaccagcac cataaattca 18541 ccgtctgcga tcgttaggtt aatccgccgc aggacgttga ggctttctcc acggtgaaca 18601 actgtatcgc caacatttcc agatgtgaga gagggttgtg tttgggatac aacaccttcc 18661 cctcttcgcg ggggaaaact tttataaacg ttttctaaaa caacttgcgc cacgagaatt 18721 taaggtaact gtaatagtca ctagtcatta gtcagtagtt ataaattact aaagacaaag 18781 gactaacgac agataacgaa ggtttagcgc ttatcataca cgctagaaga gaatacgaaa 18841 agtgaactac ctacaccgac ctgacggtac ggttaggagc atctgccgtc ggctgtgccg 18901 acagtcacgc atgtgccgtc gggtttcccg acagtcccgc acctggcgtg gtttccccat 18961 tcgcgactgt cgtctcaaca cgctccctgg gtacaggtac agtgtgagcc ttcttgctga 19021 attaggtaaa aacgtgacct aagcgcgaag ttaatcgctt ttggtatttt cccagctcat 19081 gtcaagataa atttctaagt ttttattttt gttgtgttca ttattacaaa tgaccaatga 19141 ttcattaaga gttttttatg gtacaattat actaattaag tgtcaaaatc aatgcttttt 19201 tcagtcaggc ggaggttgta aacagggtat tgctcgttgt ttgcgacaat tgcttggctt 19261 ttatgctaga aatgactgaa aactgagctt tttcaaataa atttgtcaaa aagccagata 19321 ttttcatgat tgatctatct tcaaaagcgt ttatgcgaat cccttcatcc actctttccg 19381 acattctatc aatcagcaag ggcgctacat ctttttttat caattagcac tctattttga 19441 caaggaggat ttcatttcat gaatagttgc atattgttag cagaaattat tcaagaacca 19501 caactgcgct ttacagcaga taacctggaa gtgactgaaa tgctggtgca gtttccttca 19561 gtgcgagagg gagaacctcc ggcaacctta aaagtggtag gctggggaaa cttagcaaaa 19621 gagattcagc aaaactatca ccaaggcgat cgcgttcttc tggaaggacg tttagcaatg 19681 aataccatcg aacgtccaga agggtttaaa gaaaaacgtg ctgaattgac agtgcaaaag 19741 attcactctc tgggaacagc gattgatact agctcatcac cagcagctgt caacacacaa 19801 ttaacgcctg tagcatctcc tccaaaaacg actcctactc acgagtcgcc cctccccaca 19861 ccaactccag tcacaagcaa cgttggtgtt cttccccaag acgacagacc acaacaacgc 19921 agatcccaaa gcactaatct tgagcgcaat attgagcgca atacctatgg agtcacacca 19981 accgaagaac cagatccgga cgatattccg tttgtgcgat ctgtttactc ccgaaccagt 20041 tgggcacatg agttatgtga ctcttatgag ttagaagcta atgcttactc aaaaacagta 20101 gagcaaatta aaccctaaaa aagttgaact tgcttaacag ggtgatggta gtgtgaaatt 20161 tcacccctgt gggattccca gtcacagaac ataccacacc ctgagattga ttatcaaaaa 20221 atgcatagaa atctatactg tgagagtatt ttcttgcgat aattcccaag gggtaaacgg 20281 agattttatg gagcaaaata aatgactctc aaacggttgg gtttaatcat actgacactg 20341 gttgcaacac tgtttgcagg ctcggctttg ttgggtagtt ggcaggaacc tcagttccaa 20401 agtcgtttgg aactctacca aacgaatatt gtgctacaag ctgcttcttg ggaaccgcca 20461 aatgacaaca cagatttgaa gccattgcaa gatgcactcc ttggcgtcaa acctgttgaa 20521 agcgctacag aacaatatca acagacgcgc caatctgctc aaactaattt ggaaaaagcg 20581 aaaaagcaac tagcacaggt tcagtctcaa cccgctacga ctcctacaac cccaaaatcg 20641 caatcggaat atcctcctgt tagtaacacc tcaagtgaat tacagcaaca acaattgcag 20701 caatcgctca aaggactgca aaaattgatt gctgaattag acttgcgact gggaatttta 20761 caagcgaatc aaggaaagat agatgcagct ttgaaaagtt ggagtcaatt gcagcaacga 20821 tcagacatta atcctgaatt tgataaagct gctgctgttt tggctggtct ttggagtgat 20881 cccccgcgta ttttgccaaa tgctgaacga ctgattcaaa ataatttaga aggttggttt 20941 cgctatactg ctgttgatca gctataccga cttcaacaac gtcaagatgg tttagcagat 21001 cttgaaacca ttcaacaaaa aaccgctgaa caagctattt tgaaattagc agttattggt 21061 actattcctg ccctaacagc attttttggt gtcgggctgg tgatatttct gataggtcag 21121 cgcctaatca aagggaaaac atctctattg gctgaaaatt cagatgtgcg ttggtcaaca 21181 ccttgggatg gtgaaacaat tttacaagtt tttgtcatag gctttttctt gatgggacaa 21241 atcatagtgc ccctgacaat ccaactgctt cccctacagc gtcctaccca aaatgtcaga 21301 atacaggcat tgtatgtctt gataagttac ctgttagtgg catttggtgc cctatcagtg 21361 ctgtatttat ctatcaaacg cttttttcct ctaccagaga actggtttcg ttttcggtta 21421 cagggtagct ggtttttatg ggggctgggt ggctattgcg tagctttgcc catagtagtg 21481 gttgtgtctt taatcaatca acagctatgg caaggacaag gtggtagcaa tcctttgtta 21541 caactggcat tagaaagcca agatactgtg gcgcttgcta tattcttctc taccgctgct 21601 attgctgccc caatttttga agaagttttg tttcgcggct ttttgttacc ctcccttacc 21661 cgttacttat ctgtatggag atcaattctt gcgagtggtt ttctgtttgc agttgctcac 21721 ctcagcttat ctgaaatttt accgctcttc gctctaggta ttgtcttggg tgttgtttac 21781 acgcgatcgc gcaacctcct tgctcccatg cttcttcaca gcctttggaa tagcggcact 21841 ttgttaagct tgtttgtctt aggtagtagt ggtcaataat actttgtgtt tgccgaaaac 21901 aatacttaga aaattttcgt gatttttaca aaactttaca aaataattct tggtatttag 21961 tgcacattag tgctattttt aatgcacaga tattgtcaag ggtaaaactt actacaattt 22021 tttgtaagta taaaagtcta aacccttcat aaggtgaaat aaaaaactag caactccact 22081 tttgatgaat cgtcaagatc cttataacgt ggagtgcaag aacttatctg attgtagagg 22141 gcagttaagc gcttaaaaac atattagtat gtcaacgaaa attaaaaaat tccggaaata 22201 acaatgctta aattgtaaaa agtaaatggc atcaccttag ctttgcgtaa atgcccgaaa 22261 attcttttta gagttctttt ctggcatgaa atttcgtcgc aaaactgctg tactggaatt 22321 tctcgttaaa gtaaagaaac aggggttatt gaggagtgca gtttcagatg ctgtctgaaa 22381 acttgctcaa ttctaaactt tagtgtcctt tgtaaaggag tggcatgcca cttttgtatg 22441 ttggtttttg tatttcggtc gtatttccac tgaaagacaa aaaagtatga agatcaagta 22501 agcagacaaa cagaacttat ttgagaaaaa tatacagtct taattttata catcacatag 22561 aaagtgcagg agcttgatac atatggcaaa tcttactccc agtcacgcca ccaaacaact 22621 tttggctgga tactgcggca ttatttttgg aggttttggt gttcataaat ttattcttgg 22681 gtatgcacca gaaggattga taatgctggt catttcttta gtgggtggtt attttaccta 22741 cggtttgaca ttgttgatta tgcagcttgt gggtttagtt gaagggatga tttatttgaa 22801 caaatcccat aatgaatttg ttgatactta cttccttaaa aagcaggggt ggttctagaa 22861 atctttatcc cagtattgaa tctgtttttt agatttctat tctcggtaca tcaggtcata 22921 attttacagc agtaggatgg gcactgccta tcctatcaat gaatatgcaa aacttatggc 22981 tatttgcagc agatagctac tattccttat tccttgctta tgtgttacaa accggaaaat 23041 tcaatctctt tttgattcat tcacctttga gtaaatatgt ttctgtattg atcacaatag 23101 tcttgggcaa aacgcagaag ccggaatgtt tccaaaagat tttgacgctt actttgtaag 23161 tatggcaagt catccagtca tagagaatga ctgctggagg gcgcagcgtg tccgcaggac 23221 atattgtgac gagtgattct gagttctttc tgattcagta tcgattcatt attaatacca 23281 ataaatcaga agtctactac tgactattaa cagacttttt gttgactcgc aaatatttta 23341 tttcttctac gaatatgcta acgcttaaac gtaaaaatgt aacttggact atactcagct 23401 tattaggatt cgtgtatttc tgtacgatgt ctaatatggc aataaatcct ttctggagaa 23461 gcgagatgac tttgatttca ctacaagctg ttgcagtgat atctgtcact tattttcgtt 23521 tgactcgtcg gtgataatat tctttttagt gaaaatgtta cggaaaatca attagaaatt 23581 caaaacgcaa aatgtttttc cataacagat acacaactac accccaatgc actcataaat 23641 taataaaact taacaattat gacaatcagt caaagctttg gatttcgcct acgctcaaat 23701 taaagacgct tttagagagg cgttaaaatt tcttagcaaa cccaaaaact ttccatgcaa 23761 attgtctcaa aaactaacct ttccccagaa gcagcctcga cgacagagaa aatcactttg 23821 gatgtcacag gcatgaagtg tgctggttgt gtaagggcag tagaacgtca tctgactcag 23881 tatcctggag tcaaaagtgc ctgtgtgaac ctggcgacag aagtagcagt tgtagaattg 23941 gaagctggtg cggtagatgc ggatgcacta gctaaaaaat tgacagcagc tgggttccca 24001 actcaacctc ggaaacttgg tggaaaagta gcaggagaga cacaattaat acaagatcca 24061 gtccaacgac agcggcaaga aatgcaagct gtcaaacggc aattcctcat tgctgtcgtt 24121 ctgctagttt tatcgtttgt tggtcatgtt gctaacatta gcggtcacgt gataccagtg 24181 ttgcataaca tctggttcca ctgtggattg gcaacggtag cacttctggt tcctggtcgt 24241 ccaattttag tggatggctg ggtgggttta cggcacaatg caccaaacat gaacaccttg 24301 gtgggattgg gaacactcac agcttacacc gctagtttgg ttgccctgct atttccccaa 24361 cttggttggg agtgcttctt tgatgaacca gtgatgatgt tgggctttat cttactggga 24421 cggacactag aaaaacaagc tagaattcgc gcggcagcag catttaagga attgcttgct 24481 cttcaaccac agctagcgcg gttaattgct aagcctaaga cgactgaagc aactccctca 24541 gtctcttcca gtgcaggggt cgtagaaatt ccagctgagt tggtgcgtgt tggtgaatgg 24601 ttacaggttt tgccaggaga gaaaatcccg gttgatggtg aggtgcggga aggacaaaca 24661 acagtcgatg agtccatgct gactggggaa gcagtaccag tgacaaagca accaggagat 24721 ttggtgacag cagggacact taatcaatca ggggcgatcg cagttcaagc aactcgcgtt 24781 ggaagtgata caaccctagc tcaaatcgtc gccttggtag aagccgcaca aacccgtaaa 24841 gcgcctgtac aaaaattagc agatatagta gctggatact tcacctacgg tgtgttgaca 24901 gcatctttgt tgacatttgt cttctggtac ttttttggaa cccacatctg gcatcacgcc 24961 gttatggcgt atgccatgca aatctctcac cactccttat tcagcacatc tcatacccca 25021 cacctgacaa tctacacacc actcctagtg agtttaaagc tagcaattgc cgtcatggtc 25081 gtcgcttgtc cctgtgcttt aggacttgcc acaccaacag cgattctagt gggaacagct 25141 attggtgcgg aacggggtct attaatcaaa ggcggtgaca ttttagaaaa agttcaccag 25201 ttaaacacag tggtgtttga taaaactggt acccttacca caggtcgtgc cgtagtaaca 25261 gattgcctac cttgtcaagc agaggaagaa aatgcttttc gtacccctgt tgctcttctt 25321 caactggctg cagcggtaga aagcggtaca attcatcctt tagcaacagc aattcaacaa 25381 gaagcgcaac agcaagagtt atctattcca gatgctgcag actttcacac agaaccagga 25441 cttggcgttt ctgctgtagt agaaagtact ttggtgcttt tagggaactg cgactggttg 25501 caatggcacg gtatttctat cagtgaaact gcccaaaaac agtctcaaga gttagctgca 25561 gatggaaaaa caatagtttt tgtggcagtt ggagatacag tagccggact cattgctgtt 25621 caagatactc taagaccaga tgcgaaagct acaatagaga aattacgcca aatgggttta 25681 cgggttatgc tgctcagtgg agatacgcaa gacgccgcaa gtgcgacagc aaaacaacta 25741 ggactcaata ccggtgatgt tatggcaggt gtacctcctg ttaaaaaagc tgctgccata 25801 caagaattac aagcacgctt aacaaaggga aggactcagc actctattgt cgcgatggta 25861 ggagatggta ttaatgatgc cccagctcta tctcaagccg atgtgggcat tgctttacac 25921 tctggcactg atgtcgcaat ggaaacagct gaaattgtct tgatgcggga taacttaagt 25981 gatgtcgtcg catccataca acttagtcgt gcaactttaa gcaaaatccg tcaaaattta 26041 ttttgggctt ttgcgtataa cacccttggc attcctttag ctgctggtgt tttattgcca 26101 agtttcggtt ttgtcttaaa tccatcgggt gcagctgcac tcatggcttt tagctctgtg 26161 agtgtcgtca ctaattctct tttattacga cgttttgctt atcgctctta atttcctctt 26221 ttcgaggtat tatttgtgat cataaatatt tgattttgcc actctcaagt tcaactaggt 26281 tgatagattg ttgtttagag cacaaaacct gttaaaactg ctacaacatc tatagtagtt 26341 aaaagagtag aacatattgt tctactcttg tgtacaacta aggaatttaa atgcatataa 26401 gataattacc atagactttt actaaaaaaa atggcagcaa acaatactaa agaaattttg 26461 attaataaca gtccaactca cgcatttagc aacgatgtgt caatggcagc acaaacaaat 26521 gaaagccatg tactgataat agaacatgat caagaacgca gagagttaat tcttgatcgg 26581 cctgtttact ccattggtag agattcctgc tgcgatatct gtttcataaa ctcgctgttt 26641 gtctcgcgcc gtcatgccac actcattagg gtaccgcgcg acgataagaa gcatagctat 26701 tattatcgaa ttgtcgatgg tgatgctaaa ggcaaaccga gttccaatgg tttgatgatt 26761 aatgggcgaa aaatactaga tggtctgatg attaacgggc aaagaatacc agctcatgac 26821 ctgaaaaatg aggacgaaat cgtttttgct cctcaagtac gtgccattta ttacctacgg 26881 cgaaatagta tgcccgcagg agaagaaaca gattccagtt acgatgacat tacactaata 26941 gaccccagaa tgaccaatga tattgaggac taaaggcata ttcattctcg aatgagctgt 27001 tagccataaa gaatgtactt ttattcatca ggggagctaa cactaaacag tcaaaagtcc 27061 agaagaactc aagagtgttg atctttgact gttgagcggg ttaaggtcaa aaaagtatca 27121 gaattttcgt aaaagagtaa ttacgccagt tgacggtagt ttggacacac caggaaaacc 27181 acccacttgt gctacaataa aagactttga ggtatgtttt gggcataaga ggcagggagg 27241 gaagaccctc ctgcaagtgt atgttatgtc ctatacctac agcacgcggc ttgggcgaac 27301 aggcacgcta ctttgaacgg acatgccctt gtcgcacagc aacacacttt gcaatcagcg 27361 ccagtcgcca aggcgcaccg aaatgtcggt ggatagcacg cttacttccc aagggcccac 27421 gaccgtgcat gcaccacagt cgcttacctt ctggtaccct cggtgcatag cgctgactat 27481 ttttgacgaa caccaagcgt actagcctgc tatgctctca ataagatgcc aatagcggca 27541 atgttcaata gcctgcttaa catattcaaa tatttgtcgg cagtgattat ctctttgaag 27601 tggtagttcc tcatttttaa tcacaatact catccgggtt tctgtgtcat taggtctggt 27661 tttatcaatc aatatttcca cggttactag cttggaaatg gatacagaac caagagcttc 27721 ccgtgccagg atgtaatcag atgtatagta ttgaacatcc aaatgacaat cttggagaag 27781 ttctacaagt gaagcttgaa gttggtcagt aggaacagaa agaataaatg aacacgtata 27841 gcgagccata atagccccac gccttgcaac aatccgtcca tcctataata tcgaaccatc 27901 taagtcatta tagattcatt aaagaatcaa aattgagttt gcacaaaagc aacacactgc 27961 acacaaacaa atttttgact taaaactgca gtagcgaaac caaaagtttg atagaaactt 28021 taaaataacg gtttggggag aatgattatg aaaatagcaa ttgctggagc aacaggattt 28081 gttggtagtc gtttggtaga acgattacac ggagaaggta tggaagtggt cgtgttaact 28141 cgtaacacaa cctatgccca aaaggttttt ccatctacgg cttttccaaa tgtagaaatt 28201 attgcctaca cacccaatac atccggttct tggcaaagtg tcatctctag ctgtgatggt 28261 gttgttaatt tagcaggtga acccattggt gaagcacgtt ggacacctga acgcaaacag 28321 gaaattttaa acagccgtaa gttcgttacg caaaacattg tggacacagt aatcaatgca 28381 aaccctaagc ctagtgtctt agttaacgct tcggcgattg gctactacgg cacaagtgaa 28441 acggctactt atgatgaaac cagcttacca ggtaacgatt ttcttgccca agtctgccaa 28501 gcttgggaag cagaagcgag taaggtaaaa gatgttggtg tacgactcgt cattctgcga 28561 ttcggtattg ttctcggtct gggtggtgcc ttgggtaaaa tgattacacc attcaaactt 28621 tttgctggtg gtcccatagg gagtggtcgg caatggtttt cctggattca cgttgatgat 28681 gttgttaatt tgattttgca agctctaatg aaaccagaaa tacaaggagt ctataatgct 28741 actgccccgc atcctgtacg aatggctcaa ttgagccaag tgatgggaaa agtcatgaat 28801 cgtccttcct ggttacctgt tccagctttt gctattgagg ctcttttagg ggacggggca 28861 atagtggttc tggaaggtca acaagttctt cccaagcgca ctttagaaag tggctttgag 28921 tacaagtatc ctaatttgga accagcactg gcagagattt tgaaatgaaa aattaagact 28981 gcgggcggtg cgtagagaat cttcaaattc tcgtaggatg ggtagtgcct tataaaccct 29041 tatgagacaa gattaggagt tgctagacag tgcccaacgc cagatgctac cctacgggaa 29101 gccgcctccg gcgtctacaa gccgggaaac ccggacgcca gatacctacg gagggagacc 29161 ctcatcaagt actggctctc caacgcactg gctcccctac aaggaaattg ttgtgatttt 29221 tctgtattag taattttcgc tcgtcgtctg cgtaacaaag aaccgattta ctcacccgac 29281 tgaaaaaaaa ttacccattg tcagttttga tgaatttttg ggaaacccaa ttaatgatgc 29341 cactgatgat ggcaccagcc ataaggatgc caatagcagg gttaaataca ggttgccaaa 29401 ggctgtgcca caaatctaag cccagcggaa ctccaactgt tttaccaata actgcaactg 29461 caaaccaccc aaaaccaaac aagactagga acacatctgc cattaaaacc aagtttagcc 29521 agtttaaaaa tttctctttc atatctattg gttatgaata accagtcaat actttaaaac 29581 tattaactgg tgatttggac tgttgagctg ttattcctct ggaaaaagcc agactttgac 29641 ttttgccaaa ctttgccagt ctggttttct gcccaaaccg tcttcaatac tttttttgat 29701 ttcctctcgc cagtctgggt tgacaaaaat caactgttgt gtaaagtcta cgtgttgtcg 29761 caaacctttt tgaccggttt ccagcattgc aacggctagg ttcacccgtg cttgtggatc 29821 ttgtggatta agctttacag ctttttgtgc agctttgtga gcaagattag gattgttatc 29881 caatagatac aaccatgcca aacaagtcca ggcagcacta cttttgggag tgcgatcgca 29941 aatttcttta aaaacaggga ttaaatccgc tactggttct cctgctttat aacgttctaa 30001 acctgtatca aacagagatt caactgtgtt agtcattagt cattagtcac tactcattaa 30061 ggggcagtgc gcccttgtgg aggacagtgc cggggtgagg cagtctctca tgggggggga 30121 accccctctc tcgcactgct tcaccagtcg ccgccctgtt ggcttccaac tgagctgcgc 30181 tggctcacaa atgagtatat tacactccaa atgatttacc gcaaccacaa gtttgagtgg 30241 cgttggggtt agtgaattga aaaccgcccc caatcataga attgctgtaa tctagcatca 30301 aaccatagag atataataaa cttttgcgat cgcagacaat tttgaagcca tcataatcaa 30361 aaacttcatc ttgaggtgta atcttgctgg cgtcttcaaa atccatcatg taagacattc 30421 cagagcagcc accttgacgc actccaaccc gtaagcatag gtcttgacct tgctgatttc 30481 gcaagaattt cacctgctgc aatgctgctt cgctcaactg aattccccgt tgtaaagact 30541 gagttgcttg tgtcatctgc tgtttatact ccttagctgg tgtgagcttg attttgtagc 30601 ggcaagctct ttggttgaaa ctccaagctc accgcatagc aggctgcaaa agtttttgta 30661 ttcattctag cgatattcat gtgctaggaa tgcaaattgt aaatactctg tcaaaagcat 30721 gcaaagattt ttattctaga attgagagat tcaatgtgtt ttgagattcg atattttatt 30781 cgtccaaaca ttgtggtagc taccagctgc aactacttgt ccacaagcag ccaaagagtt 30841 tttatccgag gctagcacca aaccaaaagt ttaattccat attgcttgtt ttgattaatt 30901 tacctctatg acaagtttta atcgctctac cagtcgtcgg ttgaagaaat taacccaaat 30961 tccttctgtg tgggagggcg atcgccgtcc attgtcgtca tcacaaaccc agaactcaga 31021 tccagatgtc aaaggcgaat gtattctttg ggtggatggc tcacaaggta ttgtccgggg 31081 tatggacgta gtagcaccag acactggtcc agaagcaatt gttcgtacct tgatgcgagc 31141 gatggagcat cctcacaatc ccgcaaaacc tgctcgtcct caaagaattg tggtgaaaga 31201 ccgcgaaatc caattttacc tgcgcggagt gctacaggat ttggatattg cgattgacta 31261 tacgccagat ctccctttaa ttgatgaact tttccgtgga tttactgaaa tattagatag 31321 ccaagttcct gatttacctc cacaatatgc acaagcgctg cgagaaaaag catttgcaat 31381 ttggcaagca gcaccttggg aattcttgga agaacagcaa atcttgtcca tagaaatcaa 31441 taaatgggat gtcggtacac tctacgccag tgttatgggg atgctgggga tggaatatgg 31501 gattttgttg tatcgttcag aagactcttt aaagcgtttc cgcacaagtg ttttaaaaga 31561 tgacgaatcg caagggcatt tagaagaagc ttttttaaaa caagattgtc tatttctgac 31621 ctttgaaaat gctaacgata ccgaagaaga ggaggacgaa ttcgatgact tggcagattt 31681 gccaatatcc gaaattgaac ccacttttgg taatatccat cctttagaag gactgcgttc 31741 tgttttgtat gacgaagaag cacttgtcgt ttatgtggca atagaaagtc tttcccgctt 31801 tatccgtgat taccgtaatc agcttggtgg tgataccttc cccgccctga atcgtcgcta 31861 tcggatctcg ttacctgagt cgtcaaatga accaacaaaa tcagtgtcta tcgctgtctc 31921 ttccatgcca cagttagcaa ccgagttaga ggaaatagca ggttttgaca cttcagaaga 31981 cgaaagcgag tcaccagcat cagcctttga gtcattacga gatgatttga tcccagaaga 32041 ttcatttctc agtttaggag ttgtgtcatg ggaaatgttg gattacttgc gtcagagtgg 32101 aacatatcac acaacagggg aaatcacaca agcaggggat ggtttgccag tgattttggt 32161 tcaaacttcc cgtcccaagg caaagactgt gatttcaaat attgaacaag ctggaggact 32221 cagggcaatt tgttttaatc caggtgctga tccctttgat gggacacgct acgatttagg 32281 tttgttacaa accgaaaacg ccgaattatt cctgtttggt gagtttgaag atgacgatcc 32341 aatacacgta gaagctcgca agaaatggaa tcaacgatgt aaaaataccc aagggtactg 32401 tggcttgatt attgctaaag gattgacagg ggcttctcgt ggtaatcccc agttacgaga 32461 tatgatggct ttgtttgaag cacagttcct ctcacccaaa gatttgggtc ttggaactct 32521 tcaacttatg cctcaacttc aatttgaaga aacctgatct gcacggattt tagtattcag 32581 aaccatccca cggatgaatc ctggggatgc gcgtaaagat gctttgtttt tcttttttgc 32641 acggataaat ctggtctgtt gataccctga gttattctca acttataaac ccctcctatt 32701 gaagaagaaa atttttatca tcacggcgcg aggtggttct ggctttggat ctggtgagcg 32761 caacgagaag ctaaatcatc aagatttata ttgaagaaag atctttgaat ttattggcat 32821 caccgacatc acctttattc atgttgagaa tgacgagtta ggtggcacaa gtttggcgaa 32881 cttgatcgcg gctgcccgtg cccaagtgac tcaattgctg ggaaggtaat tctcagaagg 32941 ctctatggca atgactaacc actcactttt cgcattgaag cttttctcgg cactaggctg 33001 cgggctgata gccggagtct tcttcgcctt ctcgactttc gtgatgagcg cccttgctcg 33061 actgaagccg acgcagggga ttattgccat gcaatccatc aacatcacgg tgatcaatcc 33121 gttgtttttt acggcattat tcggaaccgc tgtggcttgt atctttctgg ctgttttctc 33181 agtgttaagg tggcatcaac ccggtgcttt ctacttgctc gttggcagct tgctttatct 33241 tgtcggcact ataggcgtga caatcgtgtt caatgtgccg ctgaacgaag ccttggcgat 33301 cgtcgatccg ggcagcactg agggcgcgaa cctatggtct cgctacctta tcaactggac 33361 aatctggaac cacattcgag cagcagcggc gcttgcagca gcggcatcgt ttactatcgc 33421 tctctgttat cgaacgtcac aatcttgaca tactcccacg gcttttagcc gttcgcccaa 33481 gccgtgtgcc gtaggtatag ggattccaag cattgatact gtaatttttg acactaagcg 33541 taacagtaat aagatttgtt aaataaaact agcaattaaa aacatttttg cgtattccat 33601 acctctaagt agattttgtt tatcctgtaa ccttaagtaa tataaatagt gtgcttgtct 33661 gtaacgataa taactttttg gggaatcaac aatgagtcaa gcagcaccag cagaggtaga 33721 tagttcatta ctactcgtca cacagaacta tagagagtgg tatcgacagg gtaaacggct 33781 cgaaaagcaa gagaattacg aagaagcgct cacttattac gacaaagcaa tagagtgttg 33841 tcctgatgaa tactggctct ggtatgaccg gggtagcgtg ctacgggaat taggtcagta 33901 tcaggaagca cttgctagtt ttgaccgagc gttaaaactt cgctccaatg attactggac 33961 gtggtacagt cggggctaca ttttactaga agaactagac cagtttgagg aggcgatcgc 34021 gaattttgac aaagcccttg caattcgccg cgatgattac tgggcatggt ttcgtcgggg 34081 agatgctttt agacatttag agcgctatga agatgcaatt tctgactatg atgaagcttt 34141 atcaattcgc cgcaatgatt actgggcgtg gtttcgtcgg ggagatgcat taaggcattt 34201 acgtcgttat gaagatgcgc tcaaaagtta tgaaacagct ctttctgttc gctcagataa 34261 tttttggatt cattacaaga taggcgatac gttgagacat ttagagcgtt acgaagaggc 34321 gcttgcaagc tatcaaaaag caactgaact gaagccagac gatgagtatg cttggtataa 34381 catcgcttgc tgtgcagccc gagttggaaa agaatcgcta gcgcttgaaa gcctggagac 34441 agcgctgaaa atcaatctaa attttcaaat atttgtcaag actgaccccg acttggatgt 34501 catccaagat cgtgaacaac ttgatgagtt gctgtgcaag atagctgagt ggaatccata 34561 aagttggatc agaaaccttg ttgggttttc atatcgtaaa cccaacaagc tatgaaaatg 34621 cacaataaga ctaagctaca acgaagacta tcactaaccc gaatctttcc aaaacagagt 34681 taagcacctc aaaagtaacc aagtcttttt gtggtgttgc atatttgcgg gatgatttta 34741 ttttctttgc ttactaaaaa acttctttct tggcgctctt ggcgacttgg cggttcgtac 34801 ttcatcccct ttttgtgcaa tgcccttttt gttaattatt atgtgtgcag aacgaaaacg 34861 tcagtatcac gaattttgtc cttgtcgcga ccagtctcat ctatagccaa ggcatctcct 34921 tttacctgcg ccttgctaat cgtcacattt ggttgctcac ggacaatagt ataagcagtg 34981 gcattggtgt aagtgatatc tcctagagct acagcatcta ctatagcaat cgcaccacct 35041 ggaaaagcat atgcattacc aagaacttca gccgttgctc caccaataat tttgttggca 35101 aattctactg cttcacaaaa gttcaagtca gaaataatca ttgttttatc ctgggaatta 35161 cgcaaaatgt ttggctgtca tcatgttatc tcaatacctt gtaacttttt gagcctcaat 35221 ttgcttcact gctgtcagcg ccgagtaact cacaccagca gtcccttcgc ctggatgggt 35281 ggaatcacca acaagccata gattgttgat gggtgtacga ttggcaaaac caaagggacc 35341 gaaagttgat attctttgac caataccacc aacaataccc tggtctcgtg ctgtgtaccg 35401 tgcaaaactg cggggagtag ctgcttctac atgaacaata gtttctggtt tcaggtaaaa 35461 gaacttttcc agatgggaga tagcatcttg tgtatactgt tgttttaacg cctcataatc 35521 ctcagtcagc caccactgct tcgtatctac aaaagaagaa gcaatgattg ttgcttttcc 35581 ttctggggcg cgtccatctc ctgaatgact cacagaaaca aacagggaat tattctcacc 35641 aattgaacct ttggcatcat acaaaaattg aagatggggc ggacatccta aaggaattgc 35701 gctttcgtct acacccaaat acataacgaa agcacctgaa gcggttggta ttttttcgac 35761 tcgtcgtttg taaccattag gagccttatt acctaataac tgtactaaat tttgcacagt 35821 gacattagca acgacatcgt cagctggttc tgtccagact tcgccagttt tctggttacg 35881 aataaccaca gcacttactt tgcccttttc gactttaata tgttcgactg tgtggcgcat 35941 gagcaacttg ccaccttttt tttctagggc ttctactagg cgatcgctga ggacttgcat 36001 actaccttga agatgataca gtccctgcgg caactgggat acacttaatg ctgtagctgc 36061 atacagcaaa gccgtattat cagcatctac ctgagagtag agcttgagtt gcaaatccaa 36121 aaatgttttt aaacggcgat cgtttcctag cttgtagaaa cgcaacgcat ccccaactgt 36181 aaacaaagtg tagggtaggg ttataaacgt actaggacgc actgctttgg taagttgcag 36241 gaaatcccac acattacgtg gcggtagcac tggatcgcgt ccttgaaatt cccaactcgc 36301 tttaaacaaa gccgccatca acagccaaaa aggttcgcta ccgggaaatt gtcgctgtcg 36361 ttcctctttc catttatctg gatctcgcca aacgttgatt ggtgtctcct cacctggcaa 36421 ataaactgca caagctggat cacaaggcgt tgcttgtggt agatcaattt ctaattcttt 36481 gaaaatacga tggtggattc ctcctggttc caacccagct acttgagtcg caccgacatc 36541 aaaggtaaat cctttacgtt taaaggtaga agcacagcct cctggtacaa gggcttgatc 36601 cagaattaag acactgtaac ccctatgtgc tagtaatgca ccagcagtca gtccgccaat 36661 tccggcaccg ataactataa cgcggggttt gctgttgcca acacgatgac ctggcattga 36721 tatttacgac atatttctta atatttataa ttataatgtt agaatatgta gaaatactaa 36781 aaaaaataaa gttaaacagt tgggacaatc aagaagttgc gtagttgctt ggtaatctct 36841 tctgaggaaa agcactgctt agctttacag cggtcacgtt taccgagtta tagggtaagt 36901 catgaaccct tccaacgtac aatgactctt aactgaaccg tattaggttt taaaaaatca 36961 ttaaaaacct ttaaagactt aatttaattg gataatgcac gacaaagctg caaaactttt 37021 aaccatcttt gccacataat ccttgcataa ggtttcagcg gaagtttgta aggttgatac 37081 atttgatttt gaatatcaga aactgtagtc aattgactaa ccgctaccct atttttagga 37141 taagaaaatt tttctctaca ctgcaaccaa gaatgatgat cttgccaaat caaagatgta 37201 ataggttcaa ccgcaatttt gaaaaggcat tcaattaaac attgataaac tggtcgaagt 37261 aacagtggac tgtagtctaa tttgatgctt ttgtataacc aagttagagt actcgaataa 37321 tctccacttg ccctgctttt ccatgcaaga tggacgtagt aagaactagc agcccaatga 37381 tatatgtgag caggtatttc tggatgtttt ttgcgaaaat ctgccagaac cagattgtaa 37441 gacttctcca ttgtctgaca tccgttagac atacttgctt tgacctgacg atagccaatc 37501 aaaaactcag gtacaactcg aaattgataa tattcagcaa tccgcaagta aatgtcccaa 37561 tcttcacaac cttgtgcatt ctgctctctg aatttgctgc tgtaaccgcc aactttctca 37621 aaacaaactc gacgaatcag agggacacta gcattggtga taaagtttgt atatagcata 37681 gctgggtaaa cctctccctc tatgctgaga acatttaaat ggtggtgggg aatgtactgt 37741 cctacaattg catcttcttc gtcaataaac actgaccaag catagattaa tcctaccgat 37801 tggtctgcca ttaacataca ttggacttgt ttttctagtt tctcgggata ccaaatatca 37861 tcagcatcaa tcggtgcaat gtattcacct ttagaatttt caattgccaa attacgagcc 37921 gaggctactc cggaatttga ttgtttgaaa aggctgatgc gacggtcttt ttgagcaaaa 37981 gattctacaa tctcagccgt tttatcctgg gaaccgtcgt caactactaa aatttctata 38041 tttttatatg tttgagagat tattgattgc aaggtgcgaa tgatgaaagc ttctgcatta 38101 tacgcaggaa ggatcactga cactaacggc aattcaccct tcattttatt gaccatattc 38161 atgcaacttt tgtttgtgac aatcaagtca acaaattacg attacttatt cttcattatg 38221 tatgaaaagc atcaatggta ttatcagtaa taatatgcaa aacttgaaaa aaaattattt 38281 actatttctt aacaaaaccc cactcacaca ctgataatcc cggtacagat ttcagtccta 38341 ggtaagataa agaggtgtaa gcttacccca caagttcaaa tgaccatcac actgctaaac 38401 tttcctcgtc ttaatgctcc aaagctttac caattgccgg atggtttgac tatagttgta 38461 gaacagatgc ctgttgaagc agttaacctc agcttgtggg ttaatgttgg ttcagctgcc 38521 gagtcagata ccattaacgg tatggctcac tttttagagc atatgatttt taagggaact 38581 gaacgactgg caagcggcga gtttgaacgt cgaattgaag aacggggtgc tgtaactaat 38641 gccgccacaa gtcaagacta tactcattac tacattacca ctgctcccaa ggattttgta 38701 gagcttgccc cattgcaaat tgatgtcgta ctcaatgcga gtatacctaa ccaagctttt 38761 gaacgcgaac gattggtcgt tctggaagaa attcgacgtt cggaagataa tccccgtcgt 38821 cgcacttttc aacgagtgat cgagacagct tttgataagt taccttatcg gcgtccagta 38881 ttgggaccag aaactgtcat ttctcaattg caacctcagc aaatgcggga ttttcatgca 38941 acttggtatc aaccccgttc aataactgct gtggctgtag gttatttacc cgtggaagag 39001 ttagttgaaa ttgttgcaaa aggttttgag agaactctaa gcactcagca tccaacccct 39061 aacactcagc acgcaccagc taaccctgaa cctctgttta caaacgttgt ccgtaaagaa 39121 tttatagatg aaagcctcca acaagcaagg ctaatcatgg tttggcgagt acctgggttg 39181 aaccaactag aaaaaactta tgctttggat gttttggcag caattttggg acatgggcgg 39241 acatcaaggc tggtaaggga tttgcgagaa gaacagggac tcgtttccca tatttctgtc 39301 agcaatatga cgcagcagtt gcaaggtatc ttttctatta cggcttattg tacaactgaa 39361 aatctgtcgg caacagaggc tcggattgtg cagcatatcc aaaatctgca aacagaaatc 39421 gtaaaagagg cggaaatcgc tcgtgtgcgg acaaaagtgg ctaacagatt tatttttgcc 39481 aatgaaacac caagcgatcg cgccaattta tacggttact accaatcaat ggtaggagat 39541 ttggaaccag catttaacta cccagctcgc attcaagccc aaaatacaac taccttgatg 39601 caagcagcac aagaatacct ttccccagat gcctacggtg tcgttgttat taaaccattt 39661 tagtttccgg ttaccatgat gtggactgaa attgatgcca atattagcca ggtgactggc 39721 gaaaaatttc aatcgtcaca acgacgctca gttggtggtg gatgtataaa tcaaggttac 39781 agtgtctgtg acggtgagcg cacctatttt gtcaagttta atcaagcgtc gcaaattgca 39841 atgttcgagg ctgaggcgct aggcttacaa caaatgtacc aaacagcgac tatccgcgtt 39901 cccaaaccca tttgctgggg tactgctggt gattctagct atgtggtgtt ggaatggcta 39961 gagatgggtt caggaaatac caaatcttgg gaagaaatgg ggcgcaagtt agcagcgatg 40021 caccgatgga acccccctcg ccttggtaag cggggaagtc aggagagttt tggttgggac 40081 ataaacaaca ctattggttc cacgcctcaa gtgaatattt ggacagcaga ttgggctgag 40141 ttttatgcta aatatcggct aggttatcaa ttccagttgg caaggcgcaa ggggggtcat 40201 tttcctcaag aaaaggagtt actagaggct attcctgaaa tattggcaga tcataaacca 40261 cagccttctc ttgtacatgg tgatttatgg ggaggaaatg ctgggtgtac tgtttcagga 40321 gaaccagtga tttttgatcc ggcggcttat tttggcgatc gcgaagtcga tattgccatg 40381 acagaattat tcggtggttt ctctgctgcg ttttatcggg gttataacga agtttggcgg 40441 ttagatcaag ggtatgagca aagaaaaact ctctacaacc tgtatcacat cttgaaccat 40501 ttcaatttat ttggtggtag ttatttgtcg caagcaaacc ggatgatttc ccagattttg 40561 gcgatttcgc gttaactcag atcttgcacc tcggcgttag tacgccttga actgaagttc 40621 aaggctcata gccaaagtcc gttaaaa // LOCUS NODE_621_length_39870_cov_5.30684439870 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 39870) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 39870) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..39870 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 152..1570 /locus_tag="DP116_05105" CDS 152..1570 /locus_tag="DP116_05105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_05105" /translation="MSRVSDNSSVREQFFDNDKPQIWWLIAVTLPVIVATGVLSIAKM EQVKKFNTPVSSAPITSINALGRLEPQGEVFKLSAPVGVQGTSRVEQVFVKQGEQVKK NQIIAILDNFSSSQAAMEEAKAKVQEARANLANVKAGSPREIEAQRAVIARLEAQLRG ELDAQQATITRLQAELGGEKTVLQATINRIKAEVQGQRDAFQATVSRIRAEQRNAEVD AQRYQMLYAEGAISQQERDRKQLGAETSTQQLVEAQANQRKTVATLRQQLAEAKANQV KTIATLQQQLVEARVNRNKILATLQRQIDEERAKFNRLKEVRPIDLLVAQAQVSNAIA ALKRAQAQLNLSYIKAPISGEILKIHTKAGESINTNGIAEIGRTDQMIVIAEVPEDSI SKVRLGQQAVITSDNGAFSGGLQGTVAEIGRKIGKKDVLNTDPAADVDARVVEVKIVL TPQDSKRVTGLTYAKVAVEINL" gene 1617..2771 /locus_tag="DP116_05110" CDS 1617..2771 /locus_tag="DP116_05110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216453.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_05110" /translation="MMTKIPLAWLQLKREKTRLAVALAGIAFADILMFMQLGFRDSLY FSNVRFHTSLQTDIVLINNQSNALLSMKGFSQRRLYKALDVQAVQSVHPIYLDYTAWK NPLTGRSRNLLVIGVNPAVNVLDLPGVKENLDKLKLPDVVLYDRSSRQEYGPIATEFE QGKTVAAEVNSRRMKVGGLFTLGASFGADGNLITSDLNFLRIFPLHKQGIIDIGLIRL KPGADANTVAQSLRNYLPRDIKVLTKQEFIDYERYYWESGTAIGFIFTLGTIMGFIVG TVIVYQILYTEVADHLGEYATLKAIGYTQNYLLIVILQEALILAILGYIPGFAITMFL YARARDATLLPVLMSVGRAVMVLILTFLMCFISGTIAMRKLRSADPADIF" gene 3043..6288 /locus_tag="DP116_05115" CDS 3043..6288 /locus_tag="DP116_05115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013567850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AcrB/AcrD/AcrF family protein" /protein_id="PRJNA477356:DP116_05115" /translation="MWIVQLALRRPYTFVIAAVLILVLGVVTITRMAIDIFPEINIPV VSVIWSYNGVAPEEMEQRIVTVSERAFTTTVNDIEHIESQSLNGVSVIKVFFQPGAKV EAAVAQLTSVTQTILRVLPPGITPPLIIRYNASSVPILQLSVSSKSLSEQELYDNGNN FLRTQLATVQGASVPLPYGGKPRQIMVDIDPQALYAKGLSATDITTAISAQNLILPAG SAKLGEREYSVRLNSSPDAVDALNNLPIREVNGTVLYIRDVAQVHDGFAVQTNIVRQN GRRSSLITVLKNGSASTLDVVACVKEAIPRIAATLPKELHLELLFDQSLFVKASIQGV LTEGLIAACLTGTMILLFLGSWRSTLIVTISIPLSILCSIITLRLLGQTLNIMTLGGL SLAVGILVDDATVEIENIHRNLGQGKPLHQAILDGAQQIAVPAFVSTLAICIVFVPVV FLTGVAQSLFMPLGMAVVFAMLASYVLSRTVVPMLSQFLLKHEVHLYTDHEHSNGNGH HTDTHSSSAGKDIFWRVHEQFNRQFEKFRNRYRRFLAWALSHRRQVFVMFGAFWVSGL VLLPFVGQDFFPQVDAGQFRLHVRTPAGTRVEETERIFTQVENVIRQAIPAQELEIIL DNIGLPVGGINLAFSDSATISAADGEILVALKEGEHHPTWQYVKQLRQKLEAQFPQLT FFFQPADIVTQILNFGLPAPIDVQVIGPSRNRKANYAIAKQIEAQIAKIPGTADVHLH QIVDAPELRINVDRTQAQQIGLTQRDVANNLLTSLSSSGQTSPNYWLDPVKGVSYLVA VQVPQYKINSLEALQSTPITNNSTSPQLLSNLATVQRRTTMAVVNHYNVQPVYDVYAN VQGRDLGAVSRDIDKVLAQFRPKLARGSSIVVRGQVETMNASFLGLGVGLLFAIALVY CLMVVNFQSWIDPLIIMMALPNALAGIIWILFITNTTFSVPSLMGAIMCIGVATANSI LLVTFANEQRLEGKKALSSALAAGYTRLRPVLMTASAMIIGMLPMSLGFGEGGEQNAP LGRAVIGGLFAATFATLIFVPVVYSILRRKQPHNLDDTMLPSLMEADGVMR" gene 6460..7809 /locus_tag="DP116_05120" CDS 6460..7809 /locus_tag="DP116_05120" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_923821.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux RND transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_05120" /translation="MTLIAGIGALVTGLLAIGILPRLQQRAELKALAKSAQTDVPTVN FVKPKRAANFTVLSLPASIQANQETSVYARTNGYLRRRTVDIGDKVQAGQILAEIDTP ETDQEVAQSRAELARAQANLAQARANLAQKQSNFSEAKSNLAARQAELMQARTNLELA RQTWQRWQELQQQGAVTQQAADERKTSFSANLANVDAVKARVNSDQNSVNAALASINS DQANVNAYLASVAASRASVEKSVVLQSFKRVTAPYNGVITGRNVETGGLISAGSNSNS SNAWLFKIAQTNSLRIRVNVPQTLIQSIRQGQTAQIHVRELPSKPFTGKVVRTSDSLD PKTNTLLTEIQVPNPNDTVRPGMYAQVTFTTTRMNPPMLIPANTLVVNSEGTQVASVT RDQTVHYHKVELGRDYGTEVEVISGLNPNESLITNPSDDLGEGARVQAVAVKPKGKS" gene 8256..9410 /gene="corA" /locus_tag="DP116_05125" CDS 8256..9410 /gene="corA" /locus_tag="DP116_05125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196359.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium and cobalt transport protein CorA" /protein_id="PRJNA477356:DP116_05125" /translation="MARKLRPSPKILTKPDNEDDFYHEPGTIPGTIIIPTDASPPEMT LIDYSTGDVIRKEIKTPEECADYLDTASVSWVEVQGLGDEDILRRLGKVFDLHPLVLE DIVNLVERPKIEEYEDYLLIICRMVVPKEKRYGFYSEQVSLVLGKNYLLTVQEEPEHD CLEGVRARICNNKGIIRKRKADYLAYSLLDAIIDGFFPVLELYGERIEELEEEVMVNP SRQTLQKIYQVRRELLQLRRYIWPQRDAINSLIRDGNELISEDVRIYLRDCYDHAVQV MDMLETYRELATGLMDVYLSAVSNRMNEIMKFLTVMSSIFIPLTFVAGIYGMNFNTDK SPYNMPELNWYWGYPMCLAFMAAVAFVLVFIFWRRGWFQNFSQINSDYKI" gene 9468..10178 /locus_tag="DP116_05130" CDS 9468..10178 /locus_tag="DP116_05130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196360.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05130" /translation="MGNADNNLLILTLYIIGITYTINQMVESIAEQVKIEFNKALVDE QLKDQSLQDKIGISFGGLAGTFDITKPQSLSINIENKSQDLAIYVDWDNCAFEEFDGT SKRVIRMSPDITRDLGVFQSPSLIVPKKTLKESVSSESVFQFDKLSATYTATKNSIAN VLKWKTSPIRSQRIEFNRFMSRKRNFDFSLDLVFRLAETNFGVAQGVNAPPLCIVKCP FTVRKLPWTYALPWNKRR" gene 10289..10752 /gene="rnpB" /locus_tag="DP116_05135" ncRNA 10289..10752 /ncRNA_class="RNase_P_RNA" /gene="rnpB" /locus_tag="DP116_05135" /product="RNase P RNA component class A" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00010" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="Derived by automated computational analysis using gene prediction method: cmsearch." /db_xref="RFAM:RF00010" gene 10882..11601 /gene="rnc" /locus_tag="DP116_05140" CDS 10882..11601 /gene="rnc" /locus_tag="DP116_05140" /EC_number="3.1.26.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316385.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease III" /protein_id="PRJNA477356:DP116_05140" /translation="MSIAYPRRQRQLESLVRKLGLQTDAPIKWQLLDLALTHPTVSES ANYEHLEFVGDAVVRLVAAIVLWENYPNCPVGDFAAIRSVLVSDRILAQLAREYGLEL HLLVAGSATADKVGQESRLADAFEAILGALYLSTHSLELIRSWLDPHFKKLAAEIRLD PARFNYKAALQEWTQAKYKVLPEYRVIEMNQPQHNQERFLAEVWLHGEKLGQGKGRSI KAAEQAAAKVAFLSLDHQEKP" gene 11791..12945 /locus_tag="DP116_05145" CDS 11791..12945 /locus_tag="DP116_05145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316386.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gfo/Idh/MocA family oxidoreductase" /protein_id="PRJNA477356:DP116_05145" /translation="MIKKIKIAVVGVGRWGVHLLRNFLEHPQASVVAVVDPNPECLAA VKRQHNLGNEILLTTEWQAIQQVEGLEAVVIATPASTHYSLITDALHWGYHVLAEKPL TLNPAECRELCQLAQKQQRQLIVDHTYLFHPAVTRGKAVVQARQLGDLRYGYATRTHL GPVRQDVDALWDLAIHDIAIFNSWLNQIPVKVQATGTVWLQPSVGGVGGAEEQRSRGA GGAGEAGEDKEITSPSLPHSLPPSLQNQGLSDLVWVTLTYPNNFQAYIHLCWLNPDKQ RRLGVVGNLGSLIFDEMSRTSPLTILHGEFEQQENRFVPVNQKQQVLEIETGEPLGRV CNHFITCVLENSPSKVSSGLVGTQLVQILAALTESLNNGGKPVFLNANED" gene complement(12950..14980) /locus_tag="DP116_05150" CDS complement(12950..14980) /locus_tag="DP116_05150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316387.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_05150" /translation="MAKPRQSSFRRILVSKILLLLVPVLLIGEIVAYKKARSSLLETA RQNLTESAIIKGEKIVETSAALKANLLSASQTTVIQSGSPTEVQRFLNQVAQRLPRQV ECIQLTEPESGNIVASTCGEQPIGELKFPIPNDGINVEAILPPKLGTTGRKDLPNQLR LLLSSPIYDSIGYLRYALIFRATILQENGRTKPGSLTGSTMVIADDGTVMAHPIANRV GTNIKQHADASRLKQILRAALAGRKYFLHFFFENEGEELLAGYTAIPSPVTEGQEGKW VIIAVTELDNALYGLEGIKIILIVLTLGLIGASVLASLYLARYLARPVEQLRDYAINL HLNQSTEPIPHNFKIREFNQLAQALEQMVERLKAWAEELEIAWKEAKTANQVKSQFLA TTSHELRNPLHTIINCVRLVRDGMCDDREEELEFLSRVDDAAIHLLGIINDLLDISKI EAGKLSVVLQPIDLRQILKEVINIQSVNVQQKGLQLNIPQLNETIPVNADTAKLKQVL INVIGNATKFTEQGSITISTEIQRRDDKSEVIISVTDTGIGIDPVQQQKLFRPFVMVG SNSGKFGGTGLGLAISRNLMELMGGTITLESAGLDQGTTMKITLPLIDGSQLPDAEGK KNLENLSVSSGDQAVRGEKQAAEEVTLQKNRFPSTLNKELKNSTLSMQNAKL" gene 15307..15835 /locus_tag="DP116_05155" /pseudo CDS 15307..15835 /locus_tag="DP116_05155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873099.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="riboflavin deaminase" gene 15907..16089 /locus_tag="DP116_05160" /pseudo CDS 15907..16089 /locus_tag="DP116_05160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407982.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="riboflavin deaminase" gene 16139..17320 /locus_tag="DP116_05165" CDS 16139..17320 /locus_tag="DP116_05165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869809.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_05165" /translation="MVEQLQPRYSVAWINKIAEVPQEGWDALALSLKTPFLEWEWLKN LETSQSATANTGWLPNHLTVWRDRTLIAAAPLYLKGHSSGEFVFDHQWADLAQRIGVK YYPKMLGMTPFTPAEGYRFLIAPGEDEDEMTALMVHEIDAFCTKHRISGCHFLYVDPE WRPVLERQGFTPWLHHSYIWENLGFNNFDDYLAVFNANQRRNIKRERKAVSKAGLKLQ PITGDEIPKSLFPLMYQFYADTCDKFGWWGSKYLTKRFFEQLHAHYRHRVVFFAAYTE QDHRQPVGMSFCLFKGDRMYGRYWGSFQEIDCLHFDACYYSPVEWAIANGIQVFDPGA GGRHKKRRGFPAAANYSLHRFYNGRLAQILGHYISEVNEIEQQEISAINAELPFARSN P" gene 17457..17840 /locus_tag="DP116_05170" CDS 17457..17840 /locus_tag="DP116_05170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457998.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05170" /translation="MDLTVENLAAIDNKLSQRHIDLDPGGYFIIYLNREEGLIYAKHF TNVIDERGLAVDPETGKVIPARGKVERTHTTVFSGRTAKELCVKIFEETQPCPVTQLS HAAYLGREFVRSEISLVTGQDYVQD" gene complement(17960..18703) /locus_tag="DP116_05175" CDS complement(17960..18703) /locus_tag="DP116_05175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873095.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YgcG family protein" /protein_id="PRJNA477356:DP116_05175" /translation="MKQLLNRVLSLKKRIQRLLLPSVMVILASLLFAASASATGVYDM PTITSGEPTWVVDQADVISRLNEGKLSSTLENLAKQTGNEVRIVTIRRLDYGETPVNF TKALFEKWFPTKEAQANQTILMIDTAKNGSAIITGDKVKSVMSDDIALSVASETLTVP LRDGDKYNQAFLDASDRLVAVLSGEPDPGPPQITNNVPVKSNFATAEDTNTNKGNATA WVIGLLIAATVIPMATYYIYQVFQPSSNG" gene complement(18854..19393) /locus_tag="DP116_05180" CDS complement(18854..19393) /locus_tag="DP116_05180" /inference="COORDINATES: protein motif:HMM:PF01757.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="PRJNA477356:DP116_05180" /translation="MDVSRPWLVGSFTLGMLAAEIGFSQKPHWIRLRKSVPWAVLAVI FAGIAFVTEWKRLGLDAWIFESFAALAAACLIIYCTNFILETNTLPKGLRVLESPVAI TLGAFSYSLYLTHGVIVTLVHHFLHNLQLPAITYTVLLYLVSVGISLVFAYLFYLAFE RRFTTSGRLKHKSIDLKSH" gene complement(19428..20615) /locus_tag="DP116_05185" CDS complement(19428..20615) /locus_tag="DP116_05185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015222875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_05185" /translation="MKARYQFRFYPTDQQQRSLAQLFGCVRVVWNDALALCKQSEKLP SNNDLQKLVITQAKKTESRVWLGEVSNIPLQQSVADLGVAYKNFFDSLKGKRKGKKVG TPRFKKKTSQQSARFRIGGFSIKGRRVYLAKIGEVSPIWSRELPSAPSSVTVIKDCAN RYFLSFVVEVEPIQIDAKNQSIGIDLGIKTFAVMSNGEKAESPSYSVLDRKIRKLQKK LARQPKDSKRRVKTRIQIAKLHNQITDTRKDFLHKLSTKIVSENQTIVLEDLNVSGLV KNRRLARSISLQGWREFRTQCEAKSAKLGRNFRVISRWEPTSQICSECGYKWGKLDLS IRSVLCLSCGAEHDRDENAAKNINKVGTGHCHDSKWTQRRDKTISVASVNEASRITDP LGR" gene 20776..21174 /gene="tnpA" /locus_tag="DP116_05190" CDS 20776..21174 /gene="tnpA" /locus_tag="DP116_05190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011056255.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS200/IS605 family transposase" /protein_id="PRJNA477356:DP116_05190" /translation="MTSQFRKERHSVTDLKMHLVCVTKYRRSVFTSESLGLIEKSFRE VAQKMDFVVLEFNGESNHVHALIEYPPKLSVSQIVNALKGVSSRRYGQAGYKKLHKEA LWSPSYFAVSVGGAPIEVLKRYIRNQEKPS" gene complement(21273..21743) /locus_tag="DP116_05195" CDS complement(21273..21743) /locus_tag="DP116_05195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456509.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Swarming motility protein ybiA" /protein_id="PRJNA477356:DP116_05195" /translation="MTIYFYKVWQPYGCFSNFSPHGIVMQDIFWSTVEHYYQAQKFVG TSDAMIIPLIHSAETPELAAALGRDCTRQVRLDWEEVKTQVMREAVLLKFLTHSDIRE ILLTTGDNLIVENSPTDYFWGCGVDKTGHNHLGRILMSVREEIHNLPSLSVVSD" gene 21904..22392 /locus_tag="DP116_05200" CDS 21904..22392 /locus_tag="DP116_05200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2243 domain-containing protein" /protein_id="PRJNA477356:DP116_05200" /translation="MEAKTEKTNQRVPLIVAGIFLGVGLSGFFDGIVLHQILQWHHML SNVRPLTTMSNIDVNTVWDGLFHAFDWIMTVIGVVLLWRAGGREDVPWSSNIYFGSIL IGAGLFDVVEGVIDHQILGIHHVKPGPNQLAWDLGFLAFGALLVIVGLVLVQKNGKNY EL" gene 22430..23035 /locus_tag="DP116_05205" CDS 22430..23035 /locus_tag="DP116_05205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05205" /translation="MSADAFEKTSWVWQSSQFQQQVGEWFEYQLSRFDWALPKFSPQW SISPWMLKLLNFIFWLLLGLFVVWVGWRLWRELRPYFNSWLAGHNWTNSQAKTAESEL SVDQLLTRSQEFSRQGNYRQACRYLYFAMLQHLHGQGILPHKSSRTDGEYLQLLRMFA ISIQPYETLMTTHEQLCFGNAEISADNYQHCQQAYREISNT" gene 23032..24123 /locus_tag="DP116_05210" CDS 23032..24123 /locus_tag="DP116_05210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4350 domain-containing protein" /protein_id="PRJNA477356:DP116_05210" /translation="MKRSNRLTWIGAIALGVMLLLSFFTAPTSSNINSGSTYNRAPDG YGAWYAYMENQKTGIKRWQKPFSDLNTEKRPITLLQINSRLGDGLYDHEKQWVEKGNN LVILGVGAASTAADFTTMQKSLRGDVKIETRRRRRLKTQEKVSLGDRFGAVVWEENYG QGKAIFSTTPYLAANAYQDYQSNFQYLSDLVTQKGHLLFVDEYIHGYKEPSVRKREGK GDFLSYFTKTPVFPAFLQAGILLLVLIWSQNRRFGKPVALDVPVVDNSEAYIKALAGV LQKAKSSDFVVEMIGKAEQLQLQKALGLGQELLDQQTIINAWVQQTGIPPTELEEVLK RQSQKRHMSESELLSWLGKWQTLRRIKNS" gene 24395..25345 /locus_tag="DP116_05215" CDS 24395..25345 /locus_tag="DP116_05215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412568.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium chelatase" /protein_id="PRJNA477356:DP116_05215" /translation="MSEINPVLNRLSQELNRVVVGQSTLIHQLLIALLAGGHVILEGV PGTGKTLLVKVLAQLIQADFRRIQLTPDVLPSDITGTNIFDLNTRSFSLKKGPVFTEV LLADEINRTPPKTQAALLEAMEETQVTLDGESLALPELFWVIATQNPLEFEGTYPLPE AQLDRFVFKLVVDYPDQAAEKQMLLNRQAGFAARRLDIARLQPVATIPQILSARQVVR EVKVSETIIDYLLLLVRTSRQYPDLSLGASPRAAGLWLQTSQAAAYLAGRNFVTPDDV KAVASPLLRHRLILKPEAMLDGVQIDGAIASILNKVPVPR" gene 25389..26393 /locus_tag="DP116_05220" CDS 25389..26393 /locus_tag="DP116_05220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05220" /translation="MAQSDAYERIAKAALNQYGVVQKELRFLGHSGNVTFYVEAPEEK FLLRIHQPFSGLQDDIWLRPDVIESELLWLVALRHQTNIIVQEPVQNLEGRWVTQVLA DDTQDVFYCSLLRWIDGYVSDTHRTPQQAYQLGSLTAQLHRHSSQWKLPQNFVRPIFD ENRLRAALSAFYPAVSYGLISPEHYRMLTQATQKIESMMKTLGQAQDVWGLIHADLHD SNYLFHNEEIRPIDFARCGFGYYLYDIAESIQYLLPQVRFSFFEGYQTIRQLPERYLE IVEGFFIMAIIYNYSFHLNNPKEHEWISNDVQHIAKRHLHKYLEGESFLLESQLVPTE " gene 26479..27834 /locus_tag="DP116_05225" CDS 26479..27834 /locus_tag="DP116_05225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877978.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF58 domain-containing protein" /protein_id="PRJNA477356:DP116_05225" /translation="MLPSKRVYFLLILGIAIASTLAFFISIQVSLFITLLFDMTVLGL MVVDGLQVRQHRVQMTREIPSRLSIGRENPVLLKVTSANANAIIQIRDDYPTSFGVSV PALRATVPSQSTQELTYSVHPTRRGEFSWGDIQVRQLGAWGLAWDDRKIFHSLKVKVY PDLVGLRSLSIRLVLQSSGVNRQSRQFNIGTEFAELRNYRAGDDLRFIDWKATARRVG AYGTPPLVKVLQSEQEQTLVILLDRGRLMTAKVRGLQRFDWGLNATLSLALAGLHRGD RVGVGVFDRQMHTWIPPERGQHRLSHLIDRLTPIQPVLLESDYVGAVTSVVQQQTRRS LVVVITDLVDMTASGELLAALTRLTPRYLSFCVTLRDPLVDHLAHTTTPPQSPPFKGR EDGVIAAYTRAVALDLLAQRQVAFAQLKHKGVLVLDAPANQVTDQLVDKYLQIKARNL L" gene complement(28051..29022) /locus_tag="DP116_05230" CDS complement(28051..29022) /locus_tag="DP116_05230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136948.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="stage II sporulation protein M" /protein_id="PRJNA477356:DP116_05230" /translation="MNIQRWIARREPNWQRLDALLRQVEKKGLKSLRATEIRELASLY RSVAADLARARTQQLGNILIQNLHLLTNRGYSQIYQGSRRQEWQAVVQFYKWGLPAVV QQTFPYTATAIALFLVGAIIAWWYAWQDPTFLSIVVPEKLIKLVRDEQKLWMGSIVGI EPLASTGIMINNLSVSFGAVAGGITAGVFTAYLMIFNGLSIGAIATLVGQNHLAYPFW AFVFPHGSLELPAIFFAGGAGFLLGRAILFPGKYRRVDALKFYGSQAVQLIFGIVPML ILAGIIEGFFSPHPSVPEVFKYLVGMGLLMLLVAYCSRKFKAVKPDN" gene 29106..29978 /locus_tag="DP116_05235" CDS 29106..29978 /locus_tag="DP116_05235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456237.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF975 domain-containing protein" /protein_id="PRJNA477356:DP116_05235" /translation="MSYNIGSPSPIQPLSLGNVVSAGLRLYRSHLKDYFLLALKAYVW LLVPFYGWAKFYALSALISRLAFGELVNQPESISSGQRFVNSRLWQFFITILLMFLLI VGIYIGAVILAVILAVILGLIFYLLGVPFGGIAQQGDTGAVLITVVSLIIIIAFLIAF LWLLMRFFLVDVPLAIEDNIDARSTIARSRELTQGYVWRILFISLVAFLITLPFQIVV QIITTIIPLIFAPLVEQNSLVFSTIVFLLNLAVSFASGAVVLPFWQAIKAVIYYDLRS RREGLGLRLRDHEI" gene 29999..30787 /locus_tag="DP116_05240" CDS 29999..30787 /locus_tag="DP116_05240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873609.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RDD family protein" /protein_id="PRJNA477356:DP116_05240" /translation="MHIFNRVKFRTPESVELEFTLAGIGNRAWALLIDYLVLSVILIL FLIAWITVFIQLADLWKFIFRDQAGFWLVAIAFLIGFAIYVGYFVFFETLWQGQTPGK RFAKIRVVRDDGRPIGLQQATLRALLRPFDEVLFIGAILIMFSNQEKRLGDLAAGTIV IQTQTPTTSTTLTISEQAKSVSEQLLQIADLSAMLPDDFAVIREYLHRRPAMAPKART SVALQLAKDVKAIIHLENIPEAVTPDVFLEAIYLAYQKFSNFSH" gene complement(30800..31555) /locus_tag="DP116_05245" CDS complement(30800..31555) /locus_tag="DP116_05245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873608.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1868 domain-containing protein" /protein_id="PRJNA477356:DP116_05245" /translation="MTLSEAYKTQIQHIQTSSKFQPHSDQGRQAVPFPGYTVITPPWE EETDNSTFYAHLQDYQQELLQLRVNSDWIVPVPPASFHLTLADLIWDSAYYDAREKNP KFEEQLSSCFVDIFRQYQQSTQAQTYPIRWQMQGLVVMPRAIGVCLVPQNEASYEQIV NLRRAIYQNSHLMALGIEQHYHFSAHVTLGYFGEIAPNLDHENLATMFSQLNQQWAEN SLELSINRAELRKFDDMTRYYRQPDWPILDFSH" gene 31801..32787 /gene="holA" /locus_tag="DP116_05250" CDS 31801..32787 /gene="holA" /locus_tag="DP116_05250" /EC_number="2.7.7.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209308.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit delta" /protein_id="PRJNA477356:DP116_05250" /translation="MPVYFYWGEDDFAIEKAIALVRDRVLDPLWTSFNYTVLPNDLAD APIQGLNQVMTPPFGTGGRLVWLANTTVCQQFSENVLSELERTLPVIPEDSYLLLTSS HKPDERLKSTKILKKFAEFKEFSLIPPWKTELLMQSVSQAAQSVGVKLTQSSVEMLAD AVGNNTRLLYNELEKLRLYAEGSNQPLDTDAVAQLVRNTTQNSLQLAATIRTGDTARA LAILIDLINACEPPLRIVATLIGQFRTWLWVKLMIESGERNSQVIAKAAEIGNPNRIY YLQKEIQSVSVQQLLATLPILLDLEVSLKQGASDMSTLQTKVIELCQVYQRT" gene 32894..33373 /locus_tag="DP116_05255" CDS 32894..33373 /locus_tag="DP116_05255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05255" /translation="MKRYYHRFFRLSLIRILAQSLFVGTVATASLVSSTLVLSSKADA QAAQAVNPGELRNYARAMLKMEPERQQAFDDIKKIMGTGEVPKIVCNDNNSFSSLPGK AREIAVNYCQRYQKAVEDNGLSIDRYNTITTQVQGNDDLKRKMYNELLRQQKMPKSP" gene complement(34217..34834) /locus_tag="DP116_05260" CDS complement(34217..34834) /locus_tag="DP116_05260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015201045.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix transcriptional regulator" /protein_id="PRJNA477356:DP116_05260" /translation="MITTRKISPLSREKTVQRQQLDNIELHDSLRAYFFQEVIEGLQD GILLLNETGELIHANTSACNIVSQINQDSSNYIPPAIWSLCETLLENRSNLAKKTMIL SHEIVVDKSKVFCVRARWLNLEKCNRSYLLVSMENKYESLKNIAIVEVNKYKLTRREA EIWCLYRGKFSYKEIADQLYISMNTVKKHMRNIHAKRQAFLNCEN" gene complement(35216..35476) /locus_tag="DP116_05265" /pseudo CDS complement(35216..35476) /locus_tag="DP116_05265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411584.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 36202..36457 /locus_tag="DP116_05270" /pseudo CDS 36202..36457 /locus_tag="DP116_05270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493310.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 36461..37753 /locus_tag="DP116_05275" CDS 36461..37753 /locus_tag="DP116_05275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454197.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="murein transglycosylase" /protein_id="PRJNA477356:DP116_05275" /translation="MRKTLTLFFLSLGVALFIPLCSVFAQVPDLPPIPIPVPSQPEIL PLPVPVPVPVPKPTQFRHLMSPLMRVDVATICSSSYSCLGWDEQIWGENGKAGDSKAL LTAIDNSLYYLTTDKAAAVYRNYPIREITLDRVRRSLLRFRQLVVSCESPAQLQAAIR QEFVFYQSSGNDRNGNVRFTAYYEPVYTASRVRTPVYQYPIYRRPPDFERWAKPHPKR VDLEGKDALLGDRSRLRGLELFWFANRFDAYMVQIQGSAQLNLTDGTTTSVGYGGATD YPWTSIGKELAKDGKLPLSGLTLPVMTQYFQQNPNQMDNYLPRWKRFVFFQETGSTGA KGSILVPVTAERSIATDKSIMPPGALALINTSLPFPTEYGQMVPRTVSRYVLDQDAGS AIKGPGRVDYFMGTGKLAGDRAGITGGNGKLYYLLLKQ" gene 37989..39041 /gene="lpxD" /locus_tag="DP116_05280" CDS 37989..39041 /gene="lpxD" /locus_tag="DP116_05280" /EC_number="2.3.1.191" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319970.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-3-O-(3-hydroxymyristoyl)glucosamine N-acyltransferase" /protein_id="PRJNA477356:DP116_05280" /translation="MKFSTILEKFGDAASSNSFCSNKENDPEITGVTAVDEATTGTLS YIEGAKFASMVSKTNASALILPADETLQAQAQERSIVWIATPEPRLLFAKAIALFYKP WRPTPEIHPTAVIHPTAKIGKQVYVGAHVVIQQGVEIGDDVCIHPNVVIYPDVKIGDR TTLHANCTIHERTRIGADCMIHSGAVIGSEGFGFVPSSTGWVKMEQSGYTVLEDNVEV GCNSAIDRPAVGETRIGRQTIIDNLVQIGHGCQIGAGCAIAGQSGMAGGVKLGNGVIL AGQSGISNQVKIGDRAIASAKAGVHSDIAPGEIVSGSPSLPHKQYLKVSAILARLPEM YQTLRQLQRQINNGNG" gene complement(39436..>39870) /locus_tag="DP116_05285" CDS complement(39436..>39870) /locus_tag="DP116_05285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012594639.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S1" /protein_id="PRJNA477356:DP116_05285" /translation="ALSAQGKLDEAIASYQRALQIEPNLALAHHNLGLALKNQGKLDQ AIACYQRALQIDPNYATAHNNLGIALKNQGKLDQAIASYCKALQIDPNYTDAHYALGI ALYDQGKLEEAIAELEIAVRLDPSSTQYRKNLENYKNEKKGF" BASE COUNT 11536 a 8135 c 8504 g 11695 t ORIGIN 1 gtgtaggggt gtagggggaa gaaggggatt ttctttttgt gcaacgccac atcctatctc 61 ctatgtcctt cctagttgtt gacttaatct cattgcgccg aactcgcttt tttactttgt 121 atatgcaaaa ttccaactaa taggaaatat catgtcaagg gtaagtgata attcaagcgt 181 aagggagcag ttttttgata atgataaacc acaaatttgg tggttaattg ctgtaacctt 241 gccagtcata gttgctactg gagtcctaag catagccaaa atggagcaag tgaaaaagtt 301 caatacgccc gtatccagtg cgccaatcac aagtataaat gctttaggac gtttggaacc 361 ccaaggggaa gtatttaaac tgtctgcacc agtaggagtc caaggaacgt cgcgagtcga 421 acaagttttt gtcaaacaag gagagcaagt caaaaaaaat caaatcattg ctattttgga 481 taacttttcc agtagccaag ccgcaatgga agaagctaaa gcaaaagtcc aagaagcccg 541 tgccaattta gcaaatgtca aagcaggttc accaagagag attgaggctc aaagagctgt 601 aattgctcgc ttggaagcac aattacgtgg ggaactagat gctcaacaag caacaattac 661 tcgcctgcaa gcagagttag gcggggaaaa aactgtttta caagcaacaa ttaatcgtat 721 caaagctgaa gtccaaggac aaagggatgc tttccaagct acggtttcgc gtatacgagc 781 tgaacaacgt aatgccgaag tcgatgctca acgctatcaa atgttatacg cagaaggtgc 841 gatttcgcaa caagaacgtg atagaaagca attaggcgca gaaacttcga ctcagcaact 901 tgtagaagct caagccaacc aaagaaaaac ggttgcaact ttgcgacagc aacttgctga 961 agctaaagcc aatcaagtca aaactatagc aactttgcaa cagcaactgg ttgaagctag 1021 agttaaccgg aacaaaattc tcgcaacttt gcaaagacaa attgatgagg aaagagccaa 1081 atttaacaga cttaaagaag ttcgtcccat tgatttacta gttgcacagg cacaagttag 1141 caatgcaatt gcggctctga aaagagcaca agctcaattg aatttaagct atatcaaagc 1201 accaatctct ggcgaaattc ttaaaattca tacgaaagct ggagaaagca tcaatacaaa 1261 tggcattgct gagattggac gaacggatca gatgatagtc atagctgaag ttcccgaaga 1321 cagtattagt aaggtgcgcc ttggtcaaca agccgtaata accagtgata atggagcttt 1381 tagcggagga ttacaaggaa ctgttgctga aattggtagg aaaattggca aaaaagatgt 1441 cttaaataca gacccagcag cagatgttga tgccagagtc gtagaagtga aaattgtcct 1501 gactccacaa gatagtaaaa gagtgacagg cttaacttac gccaaggtgg ctgttgaaat 1561 taatctttga aacatcaatt ttggctcagc taataactaa caactgagga caaaagataa 1621 tgaccaaaat tcctctggct tggttacaac ttaagcgaga aaaaactcgt ttagctgtag 1681 ctctggcagg aattgccttt gctgatattt taatgtttat gcagcttggt ttccgggatt 1741 ctctatattt tagtaacgtc cgttttcata cgagtttaca aactgacatt gttctcatca 1801 ataatcaatc taatgcttta ctttccatga aaggtttttc tcagaggcgt ttatacaaag 1861 ctttagatgt acaggcagtg caaagtgtgc atccaattta tctagattat actgcctgga 1921 aaaatccttt aacgggtcgt tctcgtaatc tcctagttat tggagtaaat ccagccgtca 1981 atgtcttaga tttacctggt gttaaagaaa atttagataa actgaaactg cctgatgttg 2041 ttttatatga ccgttcttct agacaggaat atgggcctat cgctactgag tttgaacaag 2101 gaaaaactgt agcagcagaa gtcaattctc ggagaatgaa agtaggggga ctatttacat 2161 taggtgcttc cttcggagca gatggaaatt taattaccag tgatttaaac tttttgcgaa 2221 tatttcccct tcataagcaa ggaataattg atatcggtct gattagatta aaaccaggag 2281 cagatgcaaa cactgttgct caaagtttga gaaattattt acctagagat atcaaagttt 2341 taactaagca agaatttatt gattatgaaa gatattactg ggaaagtggt acagcaatag 2401 gttttatttt cactttaggc acaattatgg gttttatagt aggaactgtc attgtctatc 2461 aaatacttta cactgaagtg gcagaccatt taggtgagta tgcaacccta aaagcaatag 2521 ggtatacaca aaattatttg ttaattgtta tcctacaaga agctttgatt ttagcaattt 2581 taggatatat tccaggattt gctataacta tgtttttgta tgctcgcgca agggatgcca 2641 cacttctacc agttttgatg agtgtaggac gagccgtaat ggtactcatt ttgacttttc 2701 ttatgtgctt tatatctggc actattgcta tgcggaagtt acgttctgct gacccagcag 2761 atatcttcta aaaaagtgtt tgggttttca aaaaaataac attttgtcag atataaaatt 2821 cttattttat tttttaacag aactccgtac agtttctgtt aatcattaag agttaagcgt 2881 tcccctgtta agagttccct attccctagc gagacgagtt cgtatggctc cgccacgcaa 2941 gctaacataa accaaaccgg attcctatat attctaggag tttttgttat caaaaaattt 3001 tagcaacaca agtacagaac ccatctacca ctctactgaa ttatgtggat tgttcaactt 3061 gctcttcgtc gtccttacac cttcgtcatc gccgctgttc tcattctcgt tttgggcgtg 3121 gtgactatca ctcgtatggc gatagatatt ttccccgaaa ttaacattcc cgtcgtcagt 3181 gttatttggt cttacaacgg tgttgcaccg gaagaaatgg aacagcgcat cgtcacagtc 3241 agtgaacgcg cttttacaac cacagtcaat gacattgaac acattgaatc acaatcactc 3301 aatggtgtca gcgtcatcaa agtgtttttc cagcccggtg cgaaggttga agccgctgta 3361 gcgcaattaa cctctgtcac gcaaacaatt ttgcgcgtcc tacctccagg tatcacgcca 3421 ccgctgatta ttcgctataa cgcttcgagt gtgccgattt tacagttaag tgtcagcagc 3481 aaatcgcttt cagaacagga actttacgac aacggtaata actttctcag aacacagtta 3541 gccacagtac agggtgcttc tgtacctctg ccctacggag gtaaacctcg gcaaattatg 3601 gtagatatcg acccgcaggc gctgtatgct aaaggacttt ctgcgactga tatcaccact 3661 gccattagcg cccaaaattt gattttgcct gccggaagtg caaagttggg agaacgtgaa 3721 tattctgtgc gtcttaatag cagtccggat gctgtagatg ctctcaacaa cttgccgatt 3781 cgagaagtta acggtactgt tctctacatc cgcgatgttg ctcaagtcca tgatggtttc 3841 gctgttcaaa ccaatatcgt gcgtcaaaat ggtcggcgtt ccagcctcat cactgtcctg 3901 aaaaatggta gtgcttccac attggatgtc gtcgcctgcg tgaaagaagc gataccacgc 3961 atcgctgcta ctctacccaa ggaactgcac ttagagttat tgtttgacca atctttattt 4021 gttaaagctt ctatccaagg ggtattgaca gaaggactca tagctgcatg cttaactggc 4081 acgatgattt tattattctt aggcagttgg cgcagtacct tgattgtcac tatctccatt 4141 cccctgtcaa ttctctgttc catcatcacg ctgcggttgc tgggacaaac gctcaatatt 4201 atgaccttgg gtggtctttc cctggcagtt ggtattctgg tggatgacgc caccgttgaa 4261 attgaaaata ttcaccgtaa cttaggacaa ggcaaaccgt tacaccaagc catcttggat 4321 ggtgcgcagc aaatcgctgt tccggcgttt gtttcaactc tggcaatctg tattgtgttt 4381 gtgccagtcg tctttttaac aggagttgca cagtcacttt tcatgcctct ggggatggca 4441 gttgtgtttg caatgcttgc gtcttatgta ctttcccgga ctgtggtgcc tatgctgtca 4501 cagtttctcc tcaagcatga ggtacatctt tatactgatc atgaacattc caatggtaat 4561 ggtcaccata cagataccca ttcctcaagc gctggtaaag acatcttctg gcgcgttcac 4621 gaacagttca atcgccaatt tgaaaaattt cgcaaccgtt atcgtcggtt ccttgcttgg 4681 gctttgagtc atcgccgcca agtgtttgtt atgtttggcg cgttttgggt gagtggatta 4741 gttttgttac cattcgtcgg tcaagacttt ttccctcaag ttgatgctgg tcagtttcgc 4801 ttacacgtcc gcactcctgc tgggacgcgc gtagaagaaa ccgagcggat ttttactcag 4861 gtagagaatg tgattcgcca agcaatacca gcccaagagt tagaaattat tctggataat 4921 atcggcttac cagttggtgg gatcaacctt gcttttagtg atagtgccac aattagtgct 4981 gcggatggtg aaattctcgt agccctcaaa gaaggagagc atcaccccac ttggcagtat 5041 gttaaacaac tgcgtcaaaa attagaagcg cagttcccgc agttgacgtt ctttttccaa 5101 cctgctgata ttgttactca aattcttaac tttggtttac cagctcctat tgatgttcaa 5161 gtcataggtc caagtcgcaa ccgtaaggca aattatgcga tcgccaagca aattgaagcg 5221 caaatcgcta aaattcctgg tacagcggat gttcacttac atcaaattgt tgatgcacca 5281 gagttacgta ttaatgtaga ccgcactcag gcacaacaaa taggcttaac tcaacgggat 5341 gtggcaaata acttgctgac gtccttaagt tctagtggtc aaacttcccc caactattgg 5401 ctagatcctg ttaagggtgt gagttatctg gtggcggtgc aagtaccgca atataaaatc 5461 aattctttag aagcactcca gagtactccg atcacgaaca actcgacttc accgcaactg 5521 ttaagcaatc ttgccacagt gcaacggcgg acaacaatgg ctgtggtgaa tcattataat 5581 gtgcagcctg tgtatgatgt ttacgctaat gtccaaggtc gggatttagg cgctgtgtcg 5641 cgagatatag ataaggtttt ggcacagttt cgccccaagt tagcccgtgg tagttcgatt 5701 gtggtaaggg gtcaggtgga aactatgaat gcttcgtttt tgggattggg agtggggttg 5761 ttatttgcga tcgccctagt ctactgcttg atggtagtga atttccaatc ttggatcgac 5821 ccactgatca ttatgatggc actgccaaac gctttggctg ggattatctg gattttgttt 5881 attacgaata caactttcag cgtaccttct ttaatgggtg caattatgtg cattggagtt 5941 gcgacagcta atagcatctt gcttgtgacg tttgcaaatg agcaacgtct ggaggggaaa 6001 aaagccctct cctcagcttt agcagcaggt tacactcgtc tgcgaccagt attgatgact 6061 gcttctgcaa tgattatcgg gatgctacca atgtctcttg gttttggaga aggtggtgaa 6121 caaaatgctc ctttgggtcg tgctgttatt ggtggattgt tcgcagcgac ctttgccaca 6181 ttgatttttg tccccgtagt ctacagcatt ttgcgacgga aacaacccca caaccttgat 6241 gatacgatgt tgccttcatt gatggaagca gatggagtta tgaggtaatg ggcaaaagaa 6301 aagtaaaaag taaaaaattg tgactctttc gtacttctcc aaatccttac tatctttcaa 6361 ttgcaactcg atatgaatac tcatcaatct cccaacccta actcttttga cgagttaact 6421 tctagagata aaagacgcaa aagaggttat ctcggctcga tgacactcat tgctggtatt 6481 ggtgcattag taactggact tctagcaatt ggcattttgc ctcgtctcca acagcgtgca 6541 gaattaaaag cacttgccaa gtctgcacaa acagatgttc ccactgtcaa ttttgtcaag 6601 cccaaacgtg ctgctaattt tacggtttta tccttacctg ctagtatcca agcgaaccaa 6661 gaaacatcag tctatgcccg aaccaatggg tatttgcggc gacgaactgt agatattggt 6721 gacaaagtac aagcagggca aatacttgca gaaattgaca caccggaaac cgaccaagaa 6781 gtggcacaat cacgtgctga gttagcaaga gcacaagcaa atctcgctca agcccgcgct 6841 aacttagccc aaaagcagag caatttttct gaggctaaat caaacttagc agccagacag 6901 gcagaactga tgcaagcacg cacaaattta gaattagccc gtcaaacctg gcagcgctgg 6961 caagagcttc agcagcaagg agcagtgact cagcaagctg ctgacgaacg caaaacgtca 7021 ttcagtgcga atcttgctaa tgtcgatgca gtgaaagctc gtgtcaattc tgatcaaaac 7081 agtgttaatg cggcgcttgc tagtatcaac tctgaccaag caaacgtcaa cgcatatctt 7141 gccagtgttg ctgctagtcg cgcaagtgtg gaaaaatcgg tggttttgca atcttttaaa 7201 cgtgtgacag ctccctataa tggggtgatt acaggacgca atgtagagac gggggggtta 7261 atttcggctg gtagcaatag taactcaagc aatgcatggc tatttaaaat agctcaaacc 7321 aacagcttgc gtatccgcgt taacgtacca caaactttga ttcaatccat tcggcaaggt 7381 caaacggctc agattcacgt ccgcgagtta cccagcaaac cgtttacagg caaagttgtt 7441 cgcacttctg actccctcga ccccaaaacg aatacgctgt tgacagaaat ccaagtgcca 7501 aatcctaacg atacagtgcg acctgggatg tatgctcaag tcacatttac aactacgcgt 7561 atgaacccac cgatgctgat accagctaat actttggttg ttaattcgga ggggactcaa 7621 gtagctagtg tgacaagaga ccaaactgta cactatcaca aagtagagct tggtcgagat 7681 tatggtactg aagtagaagt gatttctggt ttgaacccaa atgaatctct gattactaat 7741 cctagcgatg atttaggcga gggtgcacgg gttcaagctg ttgctgttaa gcctaagggg 7801 aaaagttagg tatttgcaac cgcagatgca cgcagataaa tatagataat gtatctgtgt 7861 gcatctgaag gacactgctg tgcagtacag caaagcgaaa tattatgtat gaatggtaac 7921 atttgtgtgt tcaacaactg ttgtagagat aggttaatat ttttgaagca gctggtcttg 7981 caaagacaag ccattttttt atagtattgc tagcttatgg tgtcttgtgg ctgtaattgg 8041 aagctcttgt gatctgactt taacatagga cttgcagaca ctcctgtgat cagaaagtga 8101 gaaccaaaat gcattcatgg taaattactg acaaataatc agatcaactc caccatgtag 8161 tcctattgca aagccaaaag tcaagagtcc gaagtcatga gctatttgct aatgcctgct 8221 gaataagaac taatgactag actaggtaaa aattcatggc aagaaaactt cgtccctcgc 8281 ctaaaatcct cactaaacct gataacgaag acgacttcta tcacgaacca ggaaccatac 8341 caggaaccat cattattccc acagatgctt caccaccaga gatgacattg attgactata 8401 gtactggcga tgttatccgt aaagaaatca aaaccccgga agagtgcgct gactatctag 8461 atacagcatc tgtttcttgg gtagaagtcc aaggtttagg ggatgaagac attttgcgac 8521 gactggggaa ggtgtttgat ttacatcctt tggttttaga agatatagtc aatttggtag 8581 agcgtccaaa aatagaggaa tatgaagact atttactaat tatttgccga atggttgtgc 8641 caaaggaaaa aagatatgga ttttatagtg agcaagtgag tctggtgtta ggaaaaaatt 8701 acttgcttac agtacaagag gaaccagaac atgattgctt ggaaggagtg cgagcgcgaa 8761 tttgtaacaa caaaggtatt atccgcaagc gcaaagcaga ttatttagct tatagtttat 8821 tggatgcaat tattgatggt ttttttccgg ttctagaact ttatggtgag cgaattgaag 8881 agttagagga agaagttatg gttaaccctt cgcgacaaac actacaaaaa atttatcaag 8941 taaggcgaga attgttacaa ctacgtcgct acatctggcc tcaacgggat gcaataaatt 9001 ctttgattcg tgatggcaat gaactcatca gtgaggatgt gagaatttac ttgcgagatt 9061 gttatgacca tgcagtgcag gtaatggata tgctagaaac ttaccgggaa cttgccacag 9121 gattgatgga tgtttatctt tcggcagtaa gtaacagaat gaatgaaatt atgaagtttt 9181 tgactgttat gtcgtcaatt tttatccccc taacctttgt tgccggaatt tatggtatga 9241 acttcaatac tgacaaatca ccatataata tgcctgaact caattggtat tggggttatc 9301 ctatgtgttt agcattcatg gcagcagtcg cctttgtctt ggtcttcatc ttctggcgaa 9361 gaggctggtt tcagaatttc tcacaaatta attctgatta taagatttga aattgccact 9421 tagttaagtt tgaaaagtga ataaaaggga gttcatatac taaaaatatg ggtaatgctg 9481 acaacaattt actgattctg acgctttata taattggtat tacttacacc attaaccaaa 9541 tggtcgaatc aattgcagaa caagtaaaga tagaattcaa taaagcactt gtagatgaac 9601 aactgaaaga ccaaagtctt caagataaga ttgggattag tttcggagga ctggcaggta 9661 cgtttgacat taccaaacca caatctttgt cgattaacat tgaaaataaa tctcaggatt 9721 tagcaatata cgttgattgg gacaattgcg cttttgagga atttgacgga acctcgaagc 9781 gcgtgattcg gatgtcacca gatataaccc gtgatttagg tgtattccaa agtcccagtc 9841 ttatcgtccc taaaaaaaca ctgaaagaat cagtttcatc tgaaagtgtt ttccaattcg 9901 ataagttatc tgcaacatac acagcaacaa aaaattcaat agccaatgtc ctcaaatgga 9961 aaacaagccc cattaggtca caaagaattg aatttaacag atttatgagc agaaaacgca 10021 actttgattt ctcactagat ttagtcttcc gactagccga aacaaatttt ggtgttgctc 10081 aaggtgtgaa tgcacctccc ctttgcattg tcaagtgtcc cttcactgtt agaaaactcc 10141 cttggacgta tgccttgcct tggaataaaa ggcgttgaca tcctcccact actgagctga 10201 gtaccgtttc agtaggagcc ttctggcgga actaggtaaa cttaaacgac tggttcgctc 10261 attcaaagac tacactggat attaggagag agtaggcgca ggtggttaca gactagtttc 10321 gttgagactg gtttgaggaa agtccgggct cccgaaagac cagacttgct ggataactcc 10381 cagtgcgagc gatcgcgagg atagtgccac agaaacatac cgcccttttt cagtgaacag 10441 tgaacagtca acagtgaaca gtcagtaact gataactgat agctgataac tggtaactga 10501 agagggtaag ggtgcaaagg tgcggtaaga gcgcaccagc atgatcgaga gatcatggct 10561 cggtaaaccc cggttgggag caaggcgaag gaactatggt tggtctttta ccagttccgc 10621 tctcataaga gccgctagag gcgtttggta acaaacgtcc cagatagata accgccctct 10681 taaatgtaga aacgtaatat gttacgtttt tacaaaagag aacagaaccc ggcttacgtt 10741 ccgactctct ccctcttttt tgtcctttat aaagttaaac cccgtcccta ggctcggtat 10801 atccttgtat tattggtctt agttttttgt taacaagcaa tgactaatga ctcatggctc 10861 atgattcatg actaataact aatgagcatt gcttatccac gccgtcaacg acaactcgaa 10921 agcttagtca gaaaattagg tctccagaca gatgcgccga taaaatggca actgctggac 10981 ttagcgctga ctcatcccac tgtctccgag tcagcaaatt atgaacattt ggagtttgtc 11041 ggcgatgcag tcgtgcgatt agtggctgct attgtgctat gggaaaatta tcctaattgt 11101 ccagtgggag attttgcagc gattcgttcg gtgttggtga gcgatcgcat tctcgcccaa 11161 ctagccagag aatacggctt ggagttacat ttactcgttg caggcagtgc gactgctgat 11221 aaagttggtc aagagtcaag actggctgat gcctttgaag caattttagg tgctctttat 11281 ttgagtacac acagtctcga acttattcgc tcttggctag atcctcattt caaaaaactt 11341 gcagcagaaa ttcgcctcga tccggctaga tttaactaca aagccgccct gcaagaatgg 11401 actcaagcaa agtataaagt cttacctgaa tatcgagtca tagaaatgaa tcaaccgcag 11461 cacaatcaag agcgtttcct ggctgaggta tggttacacg gagaaaagct cggtcaagga 11521 aagggacgct ctatcaaagc tgccgaacaa gctgctgcaa aagttgcttt tttatcactg 11581 gatcatcagg aaaagccctg aaatcaaatg gtagatgata agctaataac taaaactagg 11641 ggtgtaaggg tgtaagggtg tgggggtgta aggggtaaga gagttttact tcttctcctc 11701 tcccttttca agattgccgg aaccgacgag tcatcttatt cccttacacc catacacccc 11761 tactccccta tacccctagt ttgttgacta atgattaaaa aaattaaaat tgctgtagtt 11821 ggtgttggac gttggggcgt tcatttgcta agaaattttt tagaacatcc gcaagcgagt 11881 gtagtggcgg tagtcgatcc gaacccagag tgtttggcgg cggtgaaacg gcagcataat 11941 ttaggtaacg aaattctact gacaacagag tggcaagcaa tacagcaagt ggaaggttta 12001 gaagccgttg ttattgccac tcccgccagc acgcattact ccttaattac cgatgccttg 12061 cattggggtt accacgtttt agcagaaaag cctttaactc tcaacccagc agaatgtcga 12121 gaactgtgcc aacttgccca aaaacagcaa cgacaactga tagttgacca cacttactta 12181 tttcacccag cagtcacgcg agggaaagcg gttgtacagg cgcgtcaatt aggtgattta 12241 cgttatggtt acgctactcg cactcattta ggaccagttc gccaagatgt tgatgcactc 12301 tgggacttag ccattcacga tattgccatc tttaattctt ggctaaacca aattccagtg 12361 aaagtccaag caacgggtac agtgtggctg caaccaagtg tagggggagt agggggagca 12421 gaggagcaga ggagcagagg agcaggggga gcaggggaag caggggaaga caaagaaata 12481 acatctcctt cactccctca ctccctccct ccctcacttc aaaaccaagg attatccgac 12541 ctagtctggg ttacactcac atatccgaat aattttcaag cgtatattca cttatgttgg 12601 ctcaatcctg ataaacaacg acgactagga gttgttggaa atcttggcag tttaattttt 12661 gatgaaatgt ctcgtacctc acctttgacg atactgcacg gcgaatttga acagcaggaa 12721 aatcgatttg ttcctgtgaa tcagaagcaa caagtgctgg aaatagaaac aggtgaacca 12781 ttagggcgag tgtgcaatca ctttatcact tgtgtgcttg aaaatagtcc ctccaaagtg 12841 tcctctggtt tagtaggcac gcagttagtg caaattcttg ctgctttgac agaatctctt 12901 aataatggtg gcaagcctgt ttttcttaat gcaaatgaag actgaaattc tacaactttg 12961 cattttgcat actcagtgtt gaattcttta attccttgtt gagagtagac ggaaaacggt 13021 ttttttgtaa agtaacttct tctgctgctt gcttttcccc tcttactgct tgatccccag 13081 aggaaacact gaggttttct aaattttttt ttccttctgc gtctggtaac tgtgacccat 13141 caattaaagg taaagttatt ttcattgtcg tgccttgatc cagacctgca ctttcaagag 13201 taatagtacc tcccataagt tccattaaat ttcgtgaaat ggcaagtcct aatcctgtac 13261 cgccaaattt acctgaattt gaacccacca tcacaaaagg gcgaaatagt ttttgctgtt 13321 gaacgggatc aattcctata cctgtatctg taacagagat aatgacttca gatttatcat 13381 ctctacgttg aatttctgtg gagattgtaa tgcttccttg ttcagtaaac ttcgttgcat 13441 taccgatgac attaatcagc acttgtttaa gcttggcggt atcagcatta actggaattg 13501 tctcattcaa ttgaggaata tttaattgca agcctttttg ttgaacatta actgattgta 13561 tgtttataac ttctttaaga atctgccgga gatcaatagg ttgcagaact accgaaagct 13621 ttcctgcttc tattttggaa atgtcaagta aatcattaat aatgcctaat aagtgaattg 13681 ctgcgtcatc cacacgactg agaaactcca attcttcttc tcggtcatcg cacataccat 13741 ctcgaaccag acgaacgcaa ttaatgatgg tgtgtagtgg gtttctcaac tcatgggaag 13801 tcgtagctaa aaactgactt tttacctggt tggcagtttt tgcttctttc catgctattt 13861 ccaattcttc tgcccaagct ttcaaccgtt caaccatttg ctcaagtgct tgtgctaatt 13921 ggttaaactc ccgaattttg aagttatgag ggattggttc tgttgattgg tttaaatgaa 13981 gatttatggc gtagtctcgc agttgttcta ctggacgtgc caaataacga gctaaataca 14041 gcgacgccaa aacacttgcg ccaattaaac ctaatgttaa aacaatgagg ataattttaa 14101 ttccctcaag accatataaa gcattatcta attctgtaac agctataatc acccattttc 14161 cctcttgccc ttctgttacg ggactgggaa tagcagtgta gccagcaagt aattcttctc 14221 cttcattttc aaaaaagaaa tgtagaaaat atttcctccc tgccaaagca gctcttagaa 14281 tttgtttgag tcgcgaagca tctgcatgtt gctttatatt agtcccaacg cgattagcaa 14341 ttgggtgtgc cataactgtt ccatcatcag cgataaccat cgttgagcct gttagcgaac 14401 ccggcttagt tcggccattt tcttgcagta tggtcgcacg aaaaatcaag gcataacgta 14461 aatatcctat actatcgtag atgggagatg ataacagtaa tcgcagttga tttggcaagt 14521 cttttctacc agttgttccg agctttggtg gtaatattgc ctcaacatta atcccatcat 14581 ttggtatggg aaattttaat tcaccaattg gttgttcgcc gcaagtactt gcaacaatat 14641 taccgctttc tggctcagtc agttgaatac actctacttg ccttggtagt cgttgggcaa 14701 cttggttcag aaatcgttgt acttcagtag gcgaacctga ctgaataacc gttgtttgac 14761 ttgcactgag taagtttgct tttaaggcag cgctcgtctc tacaattttt tcacctttga 14821 taatggcact ttctgttaag ttttgacggg cagtttcaag aaggctagaa cgtgcctttt 14881 tataggctac aatctcccct atcaataaaa caggaactaa cagaagcaag atttttgata 14941 ctaaaatccg acgaaaggat gattgacggg gtttagccat cggcagtctc tcttagtatg 15001 tgccaatcac aacagtaatc ttaagtaatt agagcagcta gtgttgttaa atgataaaaa 15061 tcctatttta taatcatatt aaaaatataa agtaatttaa tatcaaaaag aagcttatga 15121 tacaaacact cacaaaaact acgatccaca gggactgttg ttgcatataa atattacaga 15181 gaaaataggg agagaggagg tttcctacat cactattttt aaagatcagc attttgaaca 15241 gaaaacaaaa actatcagac caacccaatg gtacaacatc gtccacatac cactgtgatt 15301 ttggcgatga gtgcagacgg caaaatagcg gatgtcaggc gatcgcgtgc gtcgcggttt 15361 gcagcatcgc ctgctcgatt tggttccaag acggacattg cacacctgga ggaacaaatt 15421 gcttgcttca gatgctgttt tattcggtgc tggtactcta cgcgcttacg gtacaaccct 15481 aactgtatcg cacccacaac tgctgcaaca acgaactcat gcgggaaagc cttcccaacc 15541 tgtctatata cttattacac attctggtaa cctcaatccg gaaatacgat tctttcagca 15601 gcaagtcttg cgctggttac tgacgacaac agcaggagcg cttttttgga aagaacgtca 15661 agaatttgag ctcattttgg catgtggcgc accaacggca aaagttgaca ttcatacagc 15721 tttacgacat ttaactactt tgggtataaa acgcttagct gttttgggcg gcggtacctg 15781 tgtcgcctcc atgctggaac tgcatttgat atcaaatccg gtagcatctg aatgattaaa 15841 aaaccgcaga ggcgcagagg acgcagagag agaggaagga taaataattc cgatacaaac 15901 ggatttgata tgattgacga attttggcta accatctgtc cgttgattct aggcggttct 15961 actgcaccga ctccagtgga aggtagcgga tttttctttg agttggctcc ccgtctacag 16021 ttgctggaag ttcacacagt agagtctgaa gtttttgtgc attatcggct gcaacgaccg 16081 acagattagt ttacgctaaa taaagtttgt gctaaatttt gtatcttaaa aacctctaat 16141 ggtggaacaa ctgcaacctc gctattctgt cgcttggata aataaaattg ctgaagtccc 16201 ccaagagggg tgggatgctt tagcattgtc actcaaaact ccgtttttgg agtgggagtg 16261 gctgaaaaat ctcgaaacct cccaaagcgc tacagctaac actggttggt taccgaatca 16321 cttaactgta tggcgagaca gaacgctcat cgctgctgct ccactttacc tgaaaggaca 16381 cagttctggc gaatttgttt ttgaccatca gtgggcagac ttagcacaac gtattggtgt 16441 aaagtattac ccaaaaatgc tgggaatgac tccatttacc ccggctgaag gctatcgatt 16501 tttgatagcg ccaggggaag atgaagatga gatgacagca ttgatggtgc acgagattga 16561 tgctttctgc accaaacacc gtatttctgg gtgtcatttt ctttatgtag accccgaatg 16621 gcgtcctgtt ctggaacgcc aaggcttcac accttggctg caccacagct acatttggga 16681 aaatcttggt ttcaacaatt ttgatgacta cttggcagtg ttcaacgcca accagcgtcg 16741 taacattaag cgtgaacgca aagcagtgtc caaggctggc ttgaaactac aaccgatcac 16801 aggcgatgaa attcccaagt cactatttcc tttgatgtat cagttctacg ctgatacctg 16861 tgataaattc ggctggtggg gtagtaaata cttaacaaag agattttttg aacagctaca 16921 cgcacattat cgccatcggg ttgtgttttt cgcggcatac acagaacagg atcatcgtca 16981 accagtaggg atgtcttttt gtctgtttaa gggtgatagg atgtatggac gctactgggg 17041 ttcttttcaa gagatagatt gcttgcattt tgatgcttgc tattattcac cagtagagtg 17101 ggcgatcgcc aacggtatcc aagtttttga ccctggtgct ggtggacgcc acaaaaaacg 17161 tcgcggtttc cctgcagctg ccaattacag tttgcaccgc ttttacaatg gtcgcttagc 17221 acaaattctc ggacattaca tcagcgaggt gaatgaaatc gaacagcagg agatttctgc 17281 aatcaatgcc gagttgcctt ttgctcgttc caatccctaa tacgaatttt tgttatgcat 17341 tttcagcaac actgttgcgc tatgagcgcc aaaatgttat gaacaccaaa gaagataaga 17401 attacacttg agaaggggta aaaaacccct aagacaagtt aatttctatg atgattatgg 17461 atttgacggt tgaaaacttg gcagcaattg acaataaact ttctcagcgt catattgacc 17521 ttgaccccgg tgggtatttc attatttatt taaatcggga ggaaggatta atttatgcca 17581 aacatttcac gaatgtgatt gatgagcgtg gtttagctgt tgatccagaa actggcaagg 17641 tcattccagc acggggaaag gtggaacgca ctcacacgac tgtgtttagt ggaagaacgg 17701 caaaagaatt gtgtgtgaaa atttttgagg aaactcagcc ttgtccagtg actcagttaa 17761 gccatgcagc ttatttgggt cgtgagtttg tccgatctga gatttcttta gtgacaggac 17821 aagattatgt tcaggattaa aacagtgaac agttatcagt gaacagtgaa cagtgaacag 17881 tgaacagtaa acagttatca gtcaacagtg aagttgataa ctgataactg ataactgata 17941 actggttaac aagcgaatat catccgttag aagatggctg aaacacttga taaatgtagt 18001 aagtcgccat tgggataacg gtggcggcga ttaaaagtcc tatcacccaa gcagttgcat 18061 tacctttgtt ggtgttggtg tcttctgcag tggcaaagtt actttttacc gggacgttat 18121 tcgtaatttg gggaggtccg gggtcgggtt cgccagataa aacggcgact aggcgatcgc 18181 tagcatctag aaatgcctga ttatatttgt caccgtcccg taatggtacc gtcaaagttt 18241 cggaagctac actcaaggca atatcatcgc tcatcacaga tttcacttta tcgccagtta 18301 taattgcgct accgtttttt gcggtatcaa tcattaatat cgtctgattt gcctgtgctt 18361 cctttgtagg aaaccatttt tcaaacagtg ctttggtgaa gttcaccgga gtttcaccgt 18421 agtcaagacg acgaatcgtg acaattctta cttcattacc tgtttgcttt gccaaatttt 18481 ccagggtgct gcttagctta ccttcattta aacggctgat gacgtcagct tgatccacaa 18541 cccaagtggg ttcgccactt gtaatggttg gcatatcata cacaccagtc gccgaagcag 18601 atgctgcaaa cagtaaagaa gccagaatca ccatcactga cggcagaagt agccgttgga 18661 tgcgtttttt caagcttaag actcgattga ggagctgttt catgggattt accaaaacac 18721 acacttgcac aaaaaaaata ctacataacc agtggcaaag tctcctattt gcctgcgtct 18781 tatgtgcaaa gcgcaatagt ggttatgcga acactgctaa taacttctag cttaaatctc 18841 aagtagcata aaatcagtgg ctttttaagt caattgattt gtgttttagt cttccactgg 18901 ttgtaaatcg ccgttcaaaa gctaaataga acaaataggc gaatactaac gatattccta 18961 cactaactag atatagtaac actgtatacg taatggcagg tagttgcaag ttgtgtagaa 19021 agtgatgcac tagcgtcact atcactccgt gagtgagata aaggctgtag gaaaatgctc 19081 ccagtgtgat tgcaacaggt gattctaata cgcgcagccc ttttggcaac gtatttgtct 19141 caagaataaa gttagtgcag taaattatca gacaagccgc agctagcgca gcaaaactct 19201 caaaaatcca tgcatccaat cctaatcttt tccattctgt tacaaaagca atgccagcaa 19261 aaataactgc caacacagcc catggtacag actttcttaa cctgatccaa tgaggctttt 19321 gtgaaaatcc aatttcagcg gctagcattc ctaaagtaaa cgaacctacc aaccaaggac 19381 gacttacatc cataaaacca ttgaataaat agtggggttt gacatactca ccgccctaaa 19441 gggtcggtga ttcttgacgc ttcgttgaca gatgctaccg aaatagtctt atcccgtctc 19501 tgcgtccatt tagagtcgtg gcaatgccct gtcccgactt tgtttatatt ttttgccgcg 19561 ttctcgtctc ggtcgtgttc cgcgccgcaa ctaagacaga gaaccgaccg aatcgataaa 19621 tcaagcttgc cccatttgta accgcactct gagcagattt ggcttgtagg ttcccatctg 19681 ctaatgacgc gaaaattcct gcccagtttg gctgatttcg cctcgcactg tgtccgaaat 19741 tctctccatc cttgaagact aattgacctg gcaagcctac ggtttttgac caatcccgac 19801 acattcaagt cttccaaaac aatagtttgg ttttcactaa caattttggt agacagtttg 19861 tgcaaaaaat cttttcgcgt atctgtaatt tggttgtgca gttttgcgat ttgaatgcga 19921 gtcttgactc tacgttttga gtctttgggt tgtcgtgcta atttcttttg cagtttgcgt 19981 atctttctgt ctagaaccga gtaactagga ctttcggctt tctcgccatt gctcatcacg 20041 gcaaaagttt ttattcctaa atcaattcca atgctttggt tcttagcatc gatttgaata 20101 ggttcaactt ctaccacgaa gcttaaaaaa tagcgattcg cgcagtcctt aatcacagtt 20161 accgagctag gcgcagaagg caactctctc gaccaaattg ggctgacttc gccaatcttg 20221 gcaagataaa cgcgtcgtcc cttaatcgaa aatccaccaa ttctaaacct tgcagattgc 20281 tgactcgtct tttttttgaa tcgtggcgtc ccaaccttct tgcctttgcg cttgcctttt 20341 agcgaatcaa agaagttttt ataagcaact cctaaatccg ccacggattg ttgcaagggg 20401 atgtttgaaa cctctccaag ccatacgcga gactcagttt ttttggcttg cgtaataacg 20461 agcttttgca agtcgttatt gctgggaagc ttttcagact gcttacaaag tgctaacgca 20521 tcgttccaaa ctacccgaac gcagccgaac aactgagcta aactccgttg ttgttggtct 20581 gtcgggtaga atcggaactg atacctggct ttcatggctt gtgttaagat tggtctgtta 20641 ctattatgta ttggtctaca cgtgatgacg cattgtcccc tggaaatgaa gccagccctc 20701 gttttgttga tctgtattgg cgttacgagt actcgaagta atagggctgg cttcatttcc 20761 tacgtgtaag accccgtgac aagccaattc cggaaagaga ggcacagtgt tacagattta 20821 aaaatgcatt tggtctgcgt taccaaatat cgccgttcag tattcacatc tgaaagtttg 20881 ggattaattg aaaagtcttt ccgcgaagtt gctcaaaaaa tggactttgt agtgcttgaa 20941 ttcaatggcg agagtaacca tgttcacgcg ctgattgagt atccccccaa gctgtctgtt 21001 tctcagatag taaacgcatt gaaaggcgta tcaagtcgca ggtatggaca agctgggtac 21061 aagaaactcc acaaagaagc tttatggagt cctagctact tcgccgtctc tgtaggaggt 21121 gcaccaatag aagtactaaa gcgatacatt agaaatcaag aaaagccgtc ctagaaggac 21181 ggggcttgta tcccaaaatt tttggtcagt ccaattaaaa aagcaatgaa aataactgct 21241 aaccaacccc atagatgaca aataagtaac acttaatcag aaacgactga caaagatggt 21301 aaattgtgga tttcctcacg cacactcata aggattctac cgagatgatt gtgacccgtt 21361 ttatctacac cacaacccca gaaataatcg gttggtgagt tttctacaat caagttatca 21421 ccagtcgtta aaaggatttc tctaatatca ctatgtgtca gaaacttcag tagtacagct 21481 tctcgcatca cttgagtttt gacttcctcc cagtctaaac gaacttggcg cgtgcagtcg 21541 cgtcctaatg ccgcagcaag ttctggtgtt tcagcactat ggataagagg tatgatcata 21601 gcatctgagg ttccaacaaa cttttgagct tgataataat gctcaactgt tgaccaaaag 21661 atgtcttgca tgacgattcc atgaggagaa aaattggaga aacagccgta gggctgccaa 21721 accttgtaaa agtaaatggt catttgaaaa ttgctgtttt atacattaat atccactatc 21781 acggtagata ctagcaagaa acagtctgct ttgaattcca ccacagtact tgtctcaagg 21841 tagaagattg ctttcaaaat aaagttctat ggtgaagtca aaagtcgtag cgagaggtta 21901 ataatggaag caaaaactga aaagaccaac caacgtgtcc cgttgattgt tgctggaatt 21961 ttcctaggtg taggtcttag tggcttcttt gatggcattg tactgcatca gattcttcag 22021 tggcatcata tgttgagtaa cgttcgacct ctgaccacaa tgtctaatat agatgtgaat 22081 acagtctggg acggcttatt tcacgctttt gattggatca tgactgtaat tggagtggta 22141 ttgctatggc gggcaggagg gcgtgaagat gttccttggt catccaacat ctactttggg 22201 tcaatactta tcggtgctgg attgttcgat gtcgttgaag gagtcattga ccatcagatt 22261 cttggcattc atcatgtcaa accaggacca aatcagttag cctgggattt gggatttctt 22321 gctttcggtg cgcttcttgt tatcgttggg ttggtattgg tacaaaaaaa cggtaaaaat 22381 tatgaattat gagtgctcaa tgaaaatttc ataattcata acttgtatca tgtccgcaga 22441 tgcttttgaa aaaacaagct gggtttggca gagttcccag tttcaacaac aagttggaga 22501 atggtttgag taccagttat ccagatttga ttgggctttg ccaaagttct cacctcagtg 22561 gtctattagt ccgtggatgc tgaagctgct gaactttatt ttttggctct tactggggtt 22621 atttgtagtt tgggttggtt ggcgtttgtg gcgagaactc cgcccttatt ttaattcttg 22681 gctggctggt cacaattgga ctaattctca agcaaaaact gcagaaagtg aattatctgt 22741 agatcagttg ttgacacgat cgcaagaatt ttcccgtcaa ggtaattacc gtcaggcatg 22801 ccgttatctt tacttcgcta tgttgcagca cttgcatggg caaggaatcc tccctcacaa 22861 atctagccgt acagatggag aatatttgca attgctacga atgtttgcga tttcaatcca 22921 gccctatgaa actttgatga ctactcacga acaattgtgt tttggcaatg ctgaaatttc 22981 ggcagacaat tatcagcatt gtcagcaagc gtatcgggag atttccaata catgaaacgt 23041 tcaaaccgtc tgacttggat tggggcgatc gccttgggag tgatgctact actcagtttc 23101 ttcaccgccc caaccagcag caacatcaat agtggttcca cctacaaccg cgctcccgac 23161 ggttacggcg cttggtatgc ttatatggaa aaccaaaaaa ctggtatcaa gcgttggcaa 23221 aagcctttta gtgatttaaa tacagaaaaa cgtcccatta ctttattaca aatcaacagc 23281 cgcttgggcg atgggctgta tgatcacgaa aaacaatggg tagaaaaagg caataatttg 23341 gttattttag gtgtgggggc ggcgagtaca gcagcagact tcaccaccat gcaaaaatct 23401 ctaagaggag atgtcaaaat tgagacgcga agacggcgac gactcaaaac tcaagaaaag 23461 gtttctttgg gcgatcgctt tggtgcagtt gtttgggaag aaaattacgg tcaaggaaaa 23521 gctatttttt ctacgactcc ctacttagct gccaatgcct accaagacta tcaaagtaat 23581 tttcagtatt tatcagattt agtcactcaa aaaggtcatt tattatttgt agatgaatac 23641 atccacggct acaaggaacc tagtgtcaga aaaagagaag gcaaagggga tttcttaagt 23701 tattttacca aaactcctgt atttccagct ttcttacaag caggcatact gctactggtg 23761 cttatttggt cgcaaaatag gcgctttggc aagccagttg cattagatgt gccagtcgtt 23821 gacaacagtg aagcctacat caaagctttg gcaggagttt tgcaaaaagc taaatccagc 23881 gactttgttg tagaaatgat cggaaaagca gaacaattgc aactacaaaa agctttagga 23941 ttggggcaag aattactgga tcagcaaact ataatcaatg cttgggtaca gcaaacaggg 24001 attcctccta cagaactaga agaggtgtta aaacgacaat cccaaaaacg ccacatgagt 24061 gaatcagaac tgttaagctg gttggggaaa tggcaaactc ttcggagaat taaaaattca 24121 taattaacaa ttaccttcta tataaaaatg ctatttgatt tttgaaacac acgtagggtg 24181 ggcactgcta gtcagatcct gcccacttat atgaaggttt tagtggcagt gcccacccta 24241 cactactcta cactactcaa gatttttcat aaatgattta ggattgctat agtattttaa 24301 ataattaggg aaattttcta tagcttttaa ctctgcgtcc tctgtagttt ttaaaaatgc 24361 ttttgtgcct gacgttaatt gtttataaat aataatgagc gaaatcaatc ctgtattaaa 24421 tcgcctcagt caagaactta atcgagttgt agtcggtcaa tccaccctaa tacaccagct 24481 tttgatagct ttgcttgcag gtggacatgt gattttggaa ggagtaccag gaactgggaa 24541 aacactgctt gtaaaagttt tggcacagct tattcaagct gattttcgcc gaattcaact 24601 gacaccagat gttctaccat cagatattac tggaacaaat attttcgatc taaacacccg 24661 cagtttcagc ctgaaaaagg gaccagtttt taccgaagtg ctgcttgcag atgaaatcaa 24721 ccgaactccc cccaaaaccc aagcggcgct gctagaagca atggaagaga cgcaggtgac 24781 actggatggt gaaagtttgg cgttaccaga gttattttgg gtaattgcaa cgcaaaatcc 24841 tttggaattt gaaggtactt atcctctacc agaagcacaa ttggacagat ttgtcttcaa 24901 actagtggta gattatcctg accaagctgc tgaaaagcaa atgttactca atcgtcaggc 24961 gggatttgca gcacgacgtc tagatattgc tcgtttacag ccagtcgcaa caatacctca 25021 aattttgtca gcacgacaag tcgtcagaga agttaaagtt tctgaaacaa tcattgatta 25081 tctgcttttg ttagtcagaa cctcgcgaca atatcctgat ttaagtttag gtgcatctcc 25141 tcgggctgct ggtttgtggt tgcagacatc acaagcagca gcgtacttag ctggacgaaa 25201 ttttgtcact ccggatgatg tgaaagcagt tgcatcacca ctactacgtc atcgtttgat 25261 tttgaaacca gaagcgatgt tagatggtgt acaaattgat ggggcgatcg catcaatatt 25321 aaataaagtg ccagtcccaa ggtaaaatat tttttttgca acacaacctt tatggaacat 25381 tgtaaataat ggcacagagt gacgcttatg aacgcattgc caaggctgca ctcaatcaat 25441 atggtgtagt tcagaaagaa cttcggtttc tgggtcatag tggaaacgtg accttttatg 25501 tggaagcacc agaggaaaag tttctcctcc gtatccacca accattttca gggttacagg 25561 atgatatatg gcttagacca gatgttattg aatcagaact tttgtggctt gtggccttgc 25621 gtcaccagac caatatcata gtgcaagagc cagtgcaaaa tctggagggt agatgggtga 25681 cacaagtgtt agcagatgat acccaagatg ttttttattg ctcattgcta cgttggattg 25741 atggctacgt ctcggatact catcgaacac cacaacaagc ttaccagctt ggctcattaa 25801 cggctcaact gcatcgccat agcagtcaat ggaaattacc acaaaatttt gtacgcccaa 25861 tatttgatga aaatcgtctg cgtgcagcac tatcagcatt ttacccagca gtgtcctatg 25921 gcttgatttc accagagcat tataggatgc ttacacaagc aacccaaaag attgaaagca 25981 tgatgaaaac actgggtcaa gcacaagatg tttggggatt aattcatgca gacttacatg 26041 acagcaatta tctatttcac aacgaagaaa ttcgacccat tgatttcgct cgttgtggct 26101 ttggatacta cctttatgat attgctgagt caattcaata tttactgcct caagtacgat 26161 tctccttttt tgaagggtat caaactattc gtcagttacc tgagcgttac ttagaaatag 26221 tagagggatt ctttatcatg gcaataatat ataattactc gtttcatctg aataatccaa 26281 aagaacatga atggatttca aatgatgttc aacatattgc taagaggcac cttcataaat 26341 atcttgaagg tgagtcgttt ttgcttgaat cacaacttgt acctaccgaa taaacccgga 26401 aaaaacaatt agagagtgca taagtgtggc agtagacaac gggtaagggt tcagtaagat 26461 gtagttgtaa attaacaaat gcttccttct aaacgagttt attttttatt gatactggga 26521 attgcgatcg cctcaacttt agccttcttt atcagcattc aggtaagtct atttataact 26581 ttgctgtttg atatgaccgt gctgggattg atggttgtgg atggtttgca agttcgacaa 26641 catcgcgtgc agatgacacg cgaaataccc tcgcgattgt ccattggacg agagaatcca 26701 gttttactca aggtaacatc agcaaatgcc aacgcgatca ttcaaatccg cgatgactac 26761 ccaactagtt ttggagtgtc tgtccccgca ctacgcgcca ctgttcctag tcagagtact 26821 caagaattaa cttatagcgt ccatcccaca aggcgcggag agttttcttg gggggatatt 26881 caggtgagac aacttggtgc ttggggatta gcttgggatg atagaaagat ttttcacagt 26941 ctaaaagtga aagtttatcc agatttagtg gggttgcgat cgctctcaat tcgcttagtg 27001 ctgcaatcat caggagtcaa ccgtcaatct cgccaattta atattggcac agaatttgct 27061 gaactccgga actatcgcgc tggtgatgat ttgcggttta ttgattggaa agcaactgcc 27121 cgtcgtgtgg gcgcttatgg tacaccacca ttggtgaagg tacttcaaag cgaacaagag 27181 caaaccttgg ttattttgtt ggatcgagga cggttaatga cagcaaaagt acggggtttg 27241 cagcgatttg actggggttt gaatgcgact ttgtccttag ctttggctgg attacatcgg 27301 ggcgatcgcg tgggcgtagg cgtttttgac cgccaaatgc atacttggat tcctccagaa 27361 cgcggtcaac atcgcctgag tcacctgatt gaccgcttaa ctcccattca accagtgttg 27421 ttagaatctg attacgtagg ggcagttaca agtgttgtgc agcagcaaac taggcgaagt 27481 cttgtggtag ttataactga tttagttgat atgaccgcct ctggtgaact tctagccgca 27541 ctcaccaggc ttacacctcg ctacttgtcg ttttgcgtca ccctacgcga tcccttagtt 27601 gaccatttgg cacatactac caccccccct caatcccccc ctttcaaagg gagggaagac 27661 ggagtgatag ctgcctatac tcgtgctgtc gccctagatt tattagcaca acgacaagtc 27721 gcgtttgccc aactcaaaca caaaggtgtg ttggtgttag atgcgccagc caatcaagtg 27781 acagatcaat tagtagataa atatttgcaa atcaaagcac ggaatctgct ttaactcata 27841 tctcgctttt cacctcaccc tcgctttttg ctacgcaaaa atctttccct ctcctttata 27901 aggagagcca gtgcgttggg gagccagtac tgcaggaggg tctccctccg taggtatctg 27961 gcgtccggct ttgccgactt gaagcacctg gcgtgaggga tgcccgacag ggcagggtga 28021 ggttccgtaa gtcctgcttt tgaatcagta tcagttatca ggtttcactg ctttaaactt 28081 gcgactacaa taagccacca aaagcatcaa taatcccatc ccaacaagat atttaaaaac 28141 ttctggaaca ctaggatgag gagagaaaaa gccttcaata ataccagcaa gaatcaacat 28201 tggtacaata ccaaatatca actgtaccgc ttgagaaccg tagaatttca gtgcatctac 28261 acgacgatat ttaccaggaa ataaaatagc tcttccaagt aaaaaacctg caccacctgc 28321 aaaaaagatg gctggtaatt ctagggaacc atgcggaaaa acaaatgccc aaaaaggata 28381 agcaagatga ttttgaccca ccaaggtagc aatagcacca atcgacaagc cattaaaaat 28441 catcaaatag gctgtaaaaa ctcctgcggt gattccacca gcaacagcac caaaggatac 28501 tgacaggttg ttaatcataa taccggtaga cgctaggggt tcaatgccga caatagaccc 28561 catccacaat ttctgctcgt ctcgtaccaa tttgatcagt ttttctggaa ctacaattga 28621 cagaaaagtt ggatcttgcc aagcatacca ccaagcaatg attgctccta ctaaaaatag 28681 ggcaatggca gttgctgtat atggaaatgt ctgttgcact acggctggta atccccattt 28741 gtaaaattgc actacagctt gccactcttg tcgtcgtgaa ccttggtaaa tttggctata 28801 acctctattt gttaataaat gtaaattttg tattaaaata ttacccaatt gttgagtacg 28861 agcgcgagcc aaatctgcgg ctactgaacg atataaactt gctaattctc taatttctgt 28921 agcccgtaga gactttagtc cttttttctc tacctgcctt agtaaggcgt ccaatcgctg 28981 ccaatttggt tctcgccgcg cgatccatcg ttgaatattc atgaattttt agaatactga 29041 gtattaactt atcctaaaat agcctcacat ttaccgccct actttacaag agagacttag 29101 acatcatgtc atataacatt ggttctccga gtccgataca acctctcagt cttggaaatg 29161 ttgtcagcgc aggattacgg ttatatcgct ctcatcttaa ggattatttt ttactcgctt 29221 tgaaagctta tgtatggcta cttgttccat tttatgggtg ggcaaagttt tatgccttat 29281 cagcgctgat ttcccgatta gcttttggtg aattggtgaa tcaacctgaa agtatttcgt 29341 ctggtcagcg ttttgtcaat tctcgattat ggcagttttt tattacgata ctgttgatgt 29401 ttcttttgat agtggggatt tatatcggtg ctgtaatttt agctgtaatt ttagctgtaa 29461 ttttaggttt gattttttat ttattgggag tgccatttgg cggaatagct cagcaaggag 29521 atacaggtgc cgttcttata acagtggtca gtcttattat aataatagca tttttaatag 29581 catttttatg gttgttaatg cggttttttt tagtggatgt gccgcttgct attgaggata 29641 atattgatgc taggtcaacg attgctcgaa gtagggagtt aactcaaggt tacgtctggc 29701 ggattctttt tatttcgttg gttgcctttt taattacact accttttcaa attgttgttc 29761 agattattac cactattatt ccgttgattt ttgcaccttt agtagaacaa aattcattag 29821 tttttagtac tattgttttt ctgttaaatt tggctgtaag ttttgccagt ggggcggtcg 29881 ttttaccatt ctggcaagca attaaagcgg tgatttacta tgacctgcgg agtcgtcgtg 29941 agggattagg cttgagatta cgcgaccatg agatttaaac gtcagcaaaa aaattattat 30001 gcatatattt aatcgtgtta aattcaggac accagaaagt gtagaactgg agtttaccct 30061 tgcgggaatt ggtaatcgcg cttgggcact actgattgac tatctcgttt tgagtgtgat 30121 attgatactg ttccttatcg cttggataac tgttttcatt caactggctg atttgtggaa 30181 atttattttt cgtgatcaag caggtttttg gctggtggcg atcgcattcc ttattggctt 30241 cgctatttac gtaggatatt ttgttttttt tgaaactcta tggcaaggtc aaacccctgg 30301 taaacgcttt gcaaaaattc gtgtggtgcg agatgatggt agaccgattg ggctgcaaca 30361 agcaacacta cgtgccttac tgcgaccatt tgacgaagtt ttgtttattg gggcaattct 30421 tatcatgttt agtaaccaag aaaagcgttt gggtgattta gccgctggca caattgtgat 30481 tcaaactcag acacccacca catcaacgac tttgactatt tcagaacagg caaaatctgt 30541 gagcgagcaa ttgctacaaa ttgccgattt gtcagctatg ttacctgatg attttgccgt 30601 tattcgagaa tatttgcaca gacgccctgc aatggcacca aaggcaagaa cctcagtagc 30661 tctgcaatta gctaaggatg tcaaagctat tattcactta gaaaatatac cagaagctgt 30721 caccccagat gtgtttttag aagctattta ccttgcatat caaaaatttt caaattttag 30781 tcattagtta attgatgaac taatgactaa aatctaaaat tggccagtct ggttggcgat 30841 aatagcgcgt catgtcgtca aacttccgta gttcggcacg gtttatgctc aattccaagg 30901 aattctccgc ccattgctga ttcaactgag aaaacatggt agcaaggttc tcgtggtcta 30961 gattgggcgc aatttctcca aaatagccca gtgtcacatg agcgctaaag tgataatgtt 31021 gctcaatccc caatgccatc aagtgggaat tttgataaat tgctcgacgc aagttaacaa 31081 tttgttcgta gctagcttca ttttgaggta ccaaacaaac gcctatagcc cttggcatca 31141 ctaccaatcc ctgcatttgc cagcgaatgg ggtaagtttg cgcctgagtc gattgttgat 31201 actgcctaaa gatgtccaca aagcaagaac ttagctgttc ttcaaatttt gggttttttt 31261 cgcgagcatc atagtaagca ctatcccaaa tcaagtccgc caaagtcaaa tgaaagcttg 31321 caggaggtac aggcacgatc caatcagagt ttactcttaa ctgtaaaagt tcctgctggt 31381 aatcctgtaa gtgtgcgtag aaagtagagt tgtctgtttc ttcttcccaa ggtggggtga 31441 tcactgtata gccaggaaaa ggtactgctt gccttccttg gtctgaatgc ggctgaaatt 31501 tagaagatgt ctggatatgc tggatctgag ttttgtacgc ttcgcttagc gtcatccgcg 31561 ctaaacgatt taagtaaatt tgatatttgt cgtccaacgt tcatatcctc gaactccacg 31621 cttattgatt gtaggcaaaa tcacctgcat taaatcaggg atttagaaag tatttagctt 31681 ttttgaggtg gtagggtgat gaggagaata aggagcacca aagaaatatt ataaataatg 31741 actactttcc cctgattgta cctactttct ctaatttctc taacaacagt gaggttccct 31801 atgccagtct acttttactg gggtgaggat gattttgcga tagaaaaggc gatcgctctt 31861 gtgcgcgatc gcgttctcga tcccctgtgg acaagtttta actacacagt actcccaaac 31921 gatttagctg atgcgccgat tcaaggatta aatcaagtga tgacaccacc ttttggcact 31981 ggcggacgtt tagtatggct tgccaataca accgtttgcc agcaattctc agaaaatgtg 32041 ttatcagaac tggagcgaac tttgccagtc atccccgaag attcatattt gttactcacc 32101 agtagtcaca aaccagatga gcgtctcaaa tcaaccaaaa ttttgaaaaa atttgctgaa 32161 ttcaaagaat tttctttaat tccaccttgg aagacggaac tactaatgca gtctgtcagt 32221 caagctgctc aatcagtagg agtgaaacta actcaatcga gtgtagagat gttggcagat 32281 gctgtaggga ataatacacg ccttctctac aatgagttgg agaaattacg gctttacgct 32341 gaaggcagca atcaaccttt ggacacagac gccgttgctc agttagtccg aaatacaact 32401 caaaatagcc tacaattagc tgcaacaatt agaacagggg acacagctag agctttagcg 32461 atattaatcg atcttattaa tgcttgtgaa cctcctttgc gaatagttgc caccctcatt 32521 ggtcaatttc gcacttggtt atgggtcaag ctcatgatag aaagtggtga gcgcaattca 32581 caagttattg ctaaagctgc ggagataggc aatcccaacc gcatttacta tctacaaaaa 32641 gaaattcaat ctgtttctgt acagcaactt cttgccactt tacctatatt gttagattta 32701 gaagttagtc tcaaacaagg agcttcagat atgtcaacac ttcagactaa agtcatagag 32761 ctttgtcaag tctatcaaag aacttaacaa gaaaaacaaa cattttcttg tttgagtgtt 32821 ttctggaact tctgactctt aaaataattc acagtgatta agatatcccg actgtagttg 32881 tttcatcaaa caaatgaaaa gatattacca tcgctttttc cgacttagtc taattcggat 32941 acttgcccaa tctttgtttg tgggcaccgt tgcaactgca agtttggttt ctagtacttt 33001 agttttgagt tcaaaagccg atgctcaagc tgcacaagca gtgaatcctg gtgaacttag 33061 aaactatgct agagctatgt taaaaatgga acctgagcgt caacaagcct ttgacgacat 33121 taaaaaaatt atgggtaccg gggaggttcc aaagattgtt tgtaacgata acaacagctt 33181 cagtagtctt cctggcaaag caagagaaat tgctgtaaat tactgtcaac gctatcaaaa 33241 agctgttgaa gataatggtc tttctattga taggtataac accattacga cacaggttca 33301 aggtaatgat gatttaaaac gaaagatgta caatgaatta cttcgccaac aaaaaatgcc 33361 taaatccccg taaatcaaac tgaaacaatc aaaaaaatca agaaaaattc tttctcctgg 33421 ttaaaactgc tataagcatt ttagagcttg tcactaatca aaaaagggca aacttcatag 33481 tccatagtct tatgtcgaac cttttggtaa aagaagatat agcaactgta tgtcatactc 33541 aattgcaatc tgtgatgtca ctttgagaca taagatttcc acactacact acgttcggat 33601 cgctgcggat agattacgct ttaacaatgc aacttgatat cagtttctgt agtattgaac 33661 tatctagata tctcaaatcc aaaatttgag ttaatggttc tgaatgggtg tataatcatc 33721 tatctgtact gttgataatt aattttagca atacgaagat tctggaaaga aaaagattcg 33781 cattgcggca cattagtcta taagatttgc aagttttttt gtcagtaacg atttaacttc 33841 tagatatata aaaaaataga aaaatgtcat attgagcgtt atgcaggaaa gcagagtctc 33901 tcggaaattc caccttttgg aaacccgcga taactaagca tctttgcttg gctcaaaatg 33961 acattattgt atcaaatatc ccgtttttgt agtgagtaca taataatacg attctaatat 34021 agtcagaaag tacccttagg ggtactcttt ttatgaagag gtatcaaacc acaagtaaac 34081 agaagtatgt aaaaaaaatt aatgtgctaa aactccttcc cttgagcatc gctgtcactg 34141 tcgcgttcgt tcctgcgaaa cacgctacgc tcctaatttt ctttctagat ataaatatta 34201 taatagctga aaaagcttaa ttttcacaat ttaaaaatgc ttgtcgcttt gcatgaatat 34261 ttctcatgtg cttttttact gtattcatag aaatgtaaag ttgatctgca atctctttat 34321 agctaaactt tcctctataa agacaccaaa tctctgcttc tcgtcgtgtt aatttatatt 34381 tgttaacctc cacaatggca atatttttca aagactcata tttattttcc atactcacca 34441 atagataaga tctattacat ttttctaaat taagccatct agcacgaaca cagaaaactt 34501 ttgacttatc tacgacaatc tcatgagaga gaatcatagt ttttttggct aaattactac 34561 ggttttctaa taatgtctcg caaagactcc agatagcagg tggtatataa ttggaactgt 34621 cttgattaat ttgagaaacg atattacaag cagatgtatt tgcatgaatc agttcacctg 34681 tctcgttcaa aagtaaaata ccgtcttgta agccttcaat cacttcttgg aaaaaataag 34741 ctcgcagaga atcatgtaac tcaatattat ctaactgttg tcgttgaact gttttttctc 34801 tagatagtgg tgagattttt cttgtggtta tcatgggaca tactttattt tgcataacac 34861 tattctttat ttagtagttt tttttctgtt gaaaatatgt accttaattg tctgcatgat 34921 attgcagaac aatgacacat aggaatgcaa aggagtgtgt catagacaac atggcacaca 34981 tgagagtgaa ctttcagaag tttttatagc cctcttaaca acttagtttt gaatacagct 35041 gttaaatact ctatttcgga tatagctttt gtaagtgtaa attaatttat gtatatgcca 35101 atcgttataa aagcgttgac tcactttgga tttcaggggg agcattgtca ataggagcat 35161 ctgaatatca ataagctggc aatcgcatta agacttgctt atgcttaaca gcttatcaag 35221 caaataattt aaatcttgca agctggattt caatcgtttc aggcaagatt ttactgagtt 35281 ctttagtgtc tcgaaattgc aaaatttgtg atgatggtaa gtcaaaagga aactgccctg 35341 ctatatctga gcgttccatt tgggctagta gaatttgttc agaacactta ctttgaatag 35401 cataaccaat ttcaatacag acaaatgggc taggaagcag tagatccgtt tctttaccct 35461 caattttagt aataggtctt tctattcaac tttgtacaag cttacacgca gatgcattcc 35521 taaatgacaa caacacccaa acctctacta gataagttat ttcctcttat tgttcattct 35581 agccagggat gtactgagtt tatcacttgg acaagtgtaa actgctcagt gacagtgccc 35641 tctatgccgc tacatgtgag gaaatatcac cattcgagtt gggtgttgct gtgttgttgc 35701 gagacagatg tggattcaac agcgttcttt acctcttcac ttgtttttct ctagctgttt 35761 tatctgccag ggaagctttt tagcttcaat tattaattta gcattggttg aatatgagtg 35821 gttgttgttg aaactaaagt ttaccttctt aagcaaacta atatgtatga aaattttgtt 35881 aatgtctgcg atttctatca gaaacattgt tgtcagaaat gagcaagtac tcagttattg 35941 ttaaatatga gcaacatttg agttcaaagc aaacctttgg tagtcagtaa gtcagtatcg 36001 aaattctcta cttcccactt tctcgggaac tgtttctggg ataacaaccg tcaaaagcta 36061 caaggacaag gagatgaggt tctaggagag tggaacaatt tcctactctt cttcttttct 36121 tgtttctctc cctctttctc ttcccatttg agggttgcgt ggttaacaat gagttgaatg 36181 tgttccaaat gatttaacgt gatgcccaaa acagcaaatg agtacaccgt ataccttcta 36241 ctagaaaatg gtcaccgtga acaagcccgt tttccaacaa ttcaaaaatt ttagaagtgg 36301 tatggtggcg aacctgtgcc gaagtctgct tctaatgaca atatccgggt accaatcaaa 36361 aatattcagt cctgagtaca tggtcgtgcg tccgtctcgg attgtagcta ttcgggtaag 36421 acccattttt tgctctagtg tggaaagatt tacctaaatc atgagaaaaa cgcttacttt 36481 atttttcttg agtctaggag ttgctctttt tattccactt tgctcagttt ttgctcaggt 36541 tcctgactta ccaccgatac caataccagt cccaagccaa cctgagattt tgccactgcc 36601 agtgccagtg ccagtgccag tgccaaaacc cactcaattt agacacctta tgtcaccatt 36661 gatgcgtgta gatgtggcga caatctgtag ctcttcatac agttgtttgg gttgggatga 36721 gcaaatttgg ggtgaaaacg ggaaagcagg cgatagcaag gcactgttaa cagcaataga 36781 caacagcttg tattacctaa caactgataa ggctgcagca gtttatcgaa attatcctat 36841 tagagagatt acccttgacc gcgtccgacg tagtttgctg cgtttccgcc aactggttgt 36901 tagctgtgag tcaccagcac aactgcaagc tgctattcgt caggagtttg tcttttacca 36961 gtcttctgga aatgatcgca atggtaacgt gaggtttact gcttattacg agcctgtata 37021 taccgcgagt cgcgtacgga caccagtcta tcagtatccc atttatcgac gaccacccga 37081 ctttgagaga tgggcgaagc cacatccaaa acgggttgat ttagaaggaa aagatgcttt 37141 actcggcgat agaagccggt tacgtggttt ggagttattt tggtttgcca atcggtttga 37201 tgcgtacatg gttcaaattc aaggttctgc ccaactcaac ttaactgatg gtacgacaac 37261 atctgtgggt tatggaggtg caacagatta cccgtggaca agcataggca aagaactggc 37321 gaaagatggc aaacttccct tgagtggttt aacattgcca gttatgactc agtattttca 37381 gcagaatcca aaccagatgg ataattatct gccgcgctgg aaacgctttg tcttcttcca 37441 ggaaactggt agtacagggg ctaaaggaag tatccttgtg ccagtcacag cagaacgttc 37501 tattgcgact gataaatcta ttatgcctcc aggagcatta gcgctgatta acaccagttt 37561 accctttccc actgagtatg gacagatggt gcctcgtact gtgagtcgct acgtgcttga 37621 ccaagatgca ggaagcgcta ttaaaggacc aggacgagtt gattatttta tgggaactgg 37681 caaattagct ggcgatcgcg ctggtataac gggtggcaat ggtaagctgt attatttatt 37741 attgaagcaa taactctggc aactcttcaa tcttttcttt tagaggtgat aagtcacgcc 37801 aatactgcct gtaaatatat agttgtagga tgttttgaac cgcagatgca cacagatgta 37861 tacagataat ttatctgtgt gcatcttacg aaaaagttaa atattattca tctaagtgca 37921 aagaacataa ttctaataaa aagcgacaat tattttgcct atatttaagg actcctgaaa 37981 aaataaccat gaaattcagt acaattttag aaaaatttgg tgacgccgcc tcctccaata 38041 gcttctgttc caataaagaa aacgacccag aaattacagg ggttacagca gttgatgaag 38101 cgacaacggg tactctcagc tacattgaag gagcaaaatt cgcttctatg gttagtaaga 38161 caaatgctag tgctttaatt ttacctgcag atgaaacatt acaggcacaa gcacaagaac 38221 ggagtattgt ttggattgca acgccagaac cacgactgtt atttgccaaa gcgatcgcat 38281 tattttataa accctggcgt ccgactccag aaattcatcc gactgctgtt attcacccga 38341 cagcaaaaat tggtaaacaa gtgtatgtag gtgctcatgt tgtgattcag cagggagtag 38401 aaattggtga tgatgtgtgc attcatccta atgtggtaat ttatccagat gttaagatag 38461 gcgatcgcac aaccttgcac gccaattgca ccatccacga acgaactcgt attggtgcag 38521 attgtatgat tcacagtggt gctgtcattg gttcagaagg ctttggcttt gttcccagtt 38581 ccactggttg ggtaaaaatg gaacaatccg gttacactgt tttagaagat aatgtagaag 38641 tgggctgtaa tagcgctatt gaccgcccag ccgtaggaga aacacggata ggtcgccaaa 38701 ctataattga caatctggtg caaataggtc acggttgcca aattggtgct ggatgtgcta 38761 tagcaggtca gtctggtatg gcgggaggcg tgaaacttgg taatggtgtt atcctggctg 38821 gacagtcggg aattagcaat caggtgaaga ttggcgatag ggcgatcgca tctgcaaaag 38881 ctggtgttca tagtgatatc gcaccaggtg aaattgtctc tggaagtcca tctttgccgc 38941 acaaacagta cttgaaggtg tcagctatcc tcgctcgctt accagaaatg tatcaaacat 39001 tgagacagtt acaacgtcag attaataatg gtaatggtta gtgcctgctg aaatacacat 39061 cattttgaat tttgaattgc ttaactcaga tcttgcacca taatttttgt ccgccaagaa 39121 ataaatttct tggctcaaag ctcaagtaag ctaaagctta ctggatatgc gtttcagtcc 39181 gttttaacgg actttggcta taagccttgt agacgcctct ggcggcttcc cgcagggtac 39241 ttcagttcaa ggcgtactcc ggtcgaggtg caagatctga gttaaggggt ggggttatgc 39301 aacgcaatta ttcaccaaaa aattccagtt tgcttcactt gaagagcgaa aaaagaaaag 39361 agcaaagagg aaaaggacaa aactacaaaa taatactaaa taactgagaa tattctaccc 39421 accaaataaa cgcctctaaa aacccttctt ttcattctta taattctcta agtttttacg 39481 atactgcgta ctactaggat caaggcgaac agctatttcc agttctgcga tcgcttcttc 39541 gagtttgcct tgatcgtaca gcgcaatccc cagggcatag tgagcatctg tataatttgg 39601 gtcgatttgc aacgctttac agtaagaggc gatcgcctga tcgagtttgc cttgattttt 39661 cagcgcaatc cccaggttat tgtgagcagt cgcataattt gggtcgattt gcaacgctct 39721 ttggtaacag gcgatcgcct gatcgagttt gccttgattt ttcagcgcta accccaggtt 39781 atggtgagca agtgccaaat ttggctcgat ttgcaacgct ctttggtaag aggcgatcgc 39841 ttcatcgagt ttgccttgag cggacagcgc // LOCUS NODE_623_length_39823_cov_5.08303239823 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 39823) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 39823) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..39823 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 82..744 /locus_tag="DP116_05290" /pseudo CDS 82..744 /locus_tag="DP116_05290" /inference="COORDINATES: protein motif:HMM:PF01609.19" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS701 family transposase" gene complement(1469..2551) /gene="cax" /locus_tag="DP116_05295" CDS complement(1469..2551) /gene="cax" /locus_tag="DP116_05295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131605.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="calcium/proton exchanger" /protein_id="PRJNA477356:DP116_05295" /translation="MSGKNILFLVLLLFVPVSIAAHYLEWGELTVFITAGLAILPLAG WMGTATEEVAVVVGPTVGGLLNATFGNATELIIALVALNAGLVNVVKASITGSIIGNL LLVMGLSMFLGGLRYKEQTFQPVVARVNASSMNLAVIAILLPTAMKFSSQAIEEKTLQ NLSLAVAVVLIIVYALTLLFSMKTHTYLYDVGVAEEETHLHKPNIWLWTGILLICTLL VALESEMLVDSLEVATSQLGLTALFTGVILVPIIGNAAEHATAVTVAMKDKMDLSVSV AVGSSMQIALFVAPFLVLAGWVLGQPMDLNFNPFELVAVVVAVLIANSISSDGKSNWL EGVLLLAAYAVLGFAFYFLPVVDGMV" gene complement(2865..3098) /locus_tag="DP116_05300" CDS complement(2865..3098) /locus_tag="DP116_05300" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05300" /translation="MLAVAFFDNADKPSYSGISTTVSFNSNYEIKPNSGVVVFVLGVI CIIKQRFARCSQSLTQAMLAGFVVGTAKILIVF" gene 3122..5827 /locus_tag="DP116_05305" CDS 3122..5827 /locus_tag="DP116_05305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875420.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase" /protein_id="PRJNA477356:DP116_05305" /translation="MTSAAELPASTGSNLTPHENIKNTKMLDPLRQATGVYVTVHGHF YQPPRENPYLDAIERQSGAAPFHDWNERIHHECYRPNAFARVLNERGEVVGIVNNYEY LSFNIGPTLMSWLERHDVEVYQRILEADAKSCDRLNGHGNAIAQVYNHIIMPLANEHD KYTQIRWGKEDFRSRFGRDPEGMWLAETAVDYATLKALVDEGIRFIVLAPSQAQRCRV LPTEDNPNPEWHEVGGNQIDPTRPYRCFLKESHQKVEAGSDNSSVQSGPYIDIFFYDG PISRDMGFSDVLFNSNHLAGRIGSAVRGDHRPAQLISVATDGETFGHHKSGTEKTLAY AFTQEFPHWGWTVSNFAHYLSLNTPTWEVKLKPVTAWSCAHGVDRWQDDCGCGGGGTW HQKWRRPLRDALNWLRDQLISVYEEHGKQFFVDPWHARDEYIQVIRDRSPENVSRFLS RHQTRKLIASEQVDALRLLEMQRHALLMFTSCGWFFEELSRPEGTQILRYAARALELA GDVAGVQLEKGFLKRLTQAPSNVDSFKHGGEVYRQLVLTAQVSFKQVAAQYAITSLFA NHKPVETSQTASSQNGHNGSIKHSHPHQKRVYCYTANELDYQLQRMGSLTLAVGNLNL VSEITWESEHLVFAVLHLGGWDFHCCIQPSEGRLAYTELKEKLFGALQEASAAHTILA MTQLFGEESFSLRNLFAEERHRLMHLLSQETLSRLDQLYTQAYRDNYGVLMAYHRDEV PAPRELQVAAEIALGSRCMINLRLLEQDIAEPLSSWNHIVELEAIATEAKHLHCHLNI PEGKQMLEQLILRSLWQFLHDPNRTFDADLQRLDRLIDVANQLHLGISLERSQELYFS CLHSLIVPLFMAKTAQSQDTAQCRQLLKLGQKLGVEVSTWLSQLG" gene 6157..6705 /locus_tag="DP116_05310" CDS 6157..6705 /locus_tag="DP116_05310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199076.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thermonuclease family protein" /protein_id="PRJNA477356:DP116_05310" /translation="MKQTLIWLCAATLIFGLIGCDRIFPSGDSVQKVSDGDTIAVKDA KGDKINVRFACVDAPEIPHSKKEKESKSSVFRNQFDWGAKAQERVQELVKQGGDRVKL NITDSDRYGRKVAEIRLRDGTFIQEVLVREGLALVYRPYLNKCPSRNIIEQAETQAKN SRRGVWSDAKFVKPWEYRSLYK" gene 6758..7573 /locus_tag="DP116_05315" CDS 6758..7573 /locus_tag="DP116_05315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317794.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MOSC domain-containing protein" /protein_id="PRJNA477356:DP116_05315" /translation="MSPYLAAISIYPIKSLDRIDVNQATILESGALQHDREFALFDEQ GHFVNAKRNAKIHLLRSTFDPNLKTISLHIQGTDQKAIFHLHDEQPSLKAWLSDYFGF AVKLMQNAITGFPDDLNAKGPTVVSTATLEEVASWFPVGSVDEMRLRMRANIEISGVP PFWEDQLFTEVGKYVHFQVGEVLFEGVNPCQRCVVPSRDSQTGKVTTHFQKVFVAKRK ESLPSWTTPTRFNHFYRLSVNTNIPASEAGKILHQGDEVKILGVSESNLRIAD" gene 7810..8454 /locus_tag="DP116_05320" CDS 7810..8454 /locus_tag="DP116_05320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317795.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05320" /translation="MSIFLSSVAVLLSTLALLCSGYTAYQVFTLQQTLNVASAGNKNA TTSTQKTSSSETSTVPPDNKPNPSEASSSSTTTAQSPATTGSAIKPGQFVQPSFGQKA EVELLSVKRIKDPETANRDVVNVQMRIRRVATDGINPAESVGVYNTTARNPDTSETYK GVNLKRSTGSVELFSLRPQASADAYVWLRIPEGVNSIDVFVPDTAAFKNVPVSN" gene complement(8623..8820) /locus_tag="DP116_05325" CDS complement(8623..8820) /locus_tag="DP116_05325" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05325" /translation="MKESKTSYNLENQTQNIELTAEELQAVVGGTNSGGGILLPEDAV PGVKVSVTGNILGIPIGVGAD" gene complement(8879..9085) /locus_tag="DP116_05330" CDS complement(8879..9085) /locus_tag="DP116_05330" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05330" /translation="MHRNINQQQTAINTNVADKNHSRQENTNIANNHRSWEDACADEL TDLEMQAISGGAVSVDLGNIGLEI" gene complement(9439..11340) /gene="ilvB" /locus_tag="DP116_05335" CDS complement(9439..11340) /gene="ilvB" /locus_tag="DP116_05335" /EC_number="2.2.1.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140607.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="biosynthetic-type acetolactate synthase large subunit" /protein_id="PRJNA477356:DP116_05335" /translation="MTVRLPSQISLPQIENESISNVSVSPAVTPKRASGGFALLDSLK RHGVEYIFGYPGGAILPIYDDLYKVEAAGNGIKHILVRHEQGAAHAADGYARATGKVG VCFGTSGPGATNLVTGIATAYMDSIPMVIVTGQVSRAVIGTDAFQETDIYGITLPIVK HSYVVRDAKDMARIVAEAFHIASTGRPGPVLIDVPKDVALEEFDYVPVEPGKVKLPGY RPTVKGNPRQINAAIQLIRESRRPLLYVGGGAIASGAHEEIKQLAELFNIPVSTTFMG IGAFDEHHPLSLGMLGMHGTAYANFAVTDCDLLICVGARFDDRVTGKLDEFATRAKVI HIDIDPAEVGKNRVPDVPIVGDVRKVLLDLLRRVQQAGTKDTPNQTQEWLNLINRWKE EYPLEVPHHADSMSPQEVIVEIARQAPDAYYTTDVGQHQMWSAQFLKNGPRRWISSGG LGTMGFGLPAAIGAKVAFPDEQVICISGDASFQMNLQELGTAAQYGLNVKTVIINNGW QGMVRQWQEAFYGERYSCSNMEVGMPDIEFLAKAYGIKGMVIKNREELKEAIAEMLAH DGPVIVNAYVTKDENCYPMVAPGKSNAQMVGLRRQPKKASLEPVYCNNCGAKNAPNNN FCPECGTKL" gene complement(11475..12716) /locus_tag="DP116_05340" CDS complement(11475..12716) /locus_tag="DP116_05340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_05340" /translation="MFPTEPAAVNKGFIAILKNRGFMLLWIGQLVSQLADKVFFVLMV ALLENYPAPAGLAQNSMYSILMVAFTVPAILFGSAGGVFVDRFPKKLIMIGSDIVRGL LTLCIPFLPRQFLILLIITFAISTVTQFFAPAEQAAIPLLVKKENLMAANALFSSTMM GALIVGFAVGEPILSWAKDSIGAKYGQELVVAGLYLLSAGVMQPINFRDKVSFDNNQP VINPWADFKDGLRYLKKNRLVLNAMLQLTTLYCVFAALTVLAIRLAEKFGLQGKQFGF FLAAAGVGMVLGAAILGNWGEKLHQKPLPLIGFLIMALVLGVFSFTQNLLLALALCAL LGIGAAFIGVPMQTLIQQQTPPTMHGKVFGFQNHAVNIALSAPLAITGPLTDALGLRI VLVGTSVVVAIVGVWAWQNTR" gene complement(13123..13695) /locus_tag="DP116_05345" CDS complement(13123..13695) /locus_tag="DP116_05345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871730.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05345" /translation="MSIIICPGIHESALTQCFVSGWIEQVIDEAKNKKAVDLLIFPGE SGLALSAFHILQFLHKHLQDRIESPVVFISFSAGVVGSILAAHKWQQIGGSVKAFIAI DGWGVPMWGNFPIHRMSHDYFTHWSSSMLGSGENNFYADPPVEHLEMWRSPQTVQGYV LDSCFGQSPPKQRLSAAEFLHLLLKHYEDN" gene 14168..15286 /locus_tag="DP116_05350" CDS 14168..15286 /locus_tag="DP116_05350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741362.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferrochelatase" /protein_id="PRJNA477356:DP116_05350" /translation="MVATPEKLHSTHEQLPSQRRVAVLLMGYGEVESYEDFANYNEQA LNLLTAKFAPVPTWVYPPLAKLLALFDRHEWGHQHHDFISPHNAIFEKQRAGIEKDLQ AKWGEGVQVFKAFNFCAPFLPKQVLAEIKSQGFDKILIYPLLVVDSIFTSGIAIEQVN NALSEMAQGDEHWVKGLRYIPSFYNEPAYIDLMAHLVEDKITADLASAYLPSQIGIVL MNHGCPHKAKGFTSGITESQILYDLVRDKLIYRYPLISVGWLNHDTPLIEWTLPNAEQ AAKNLIQLGAKAIVFMPIGFATENHETLLDVHHIIHALEKKHSDVNYVQMPCVNDHPE FLNMAAQWANAHINELLSEETVAVNPELAVAHHHHHHH" gene complement(15573..16118) /locus_tag="DP116_05355" CDS complement(15573..16118) /locus_tag="DP116_05355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADPH-dependent FMN reductase" /protein_id="PRJNA477356:DP116_05355" /translation="MVRIVGIAGSLRSESYSQLALELAAQKTQELGAEVEVLDLRKMN LPFCDGGDDYPDYPDVKRLQDAFSRADGLIMVTPEYHGSVSGVLKNALDLMSFDHLAG KVAGFISILGGQSNNNALNDLRVILRWVHAWSIPEQVAIGQAWKAFSPEGKLVDEKLS QRLDQFAKSLVENTRKLRGVE" gene 16563..18467 /locus_tag="DP116_05360" CDS 16563..18467 /locus_tag="DP116_05360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459600.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsH" /protein_id="PRJNA477356:DP116_05360" /translation="MKNFEKKTSRKQHPAKNAALTGALAAGLIMLPGMLGATPALAQK GEREPLSYGKLIEKIENKEVKRVELDETDQLARVYLNGQKQGEQPIQVRLLEQNSELI NKLKEKDVEFGEAPSANSKAALGLLINLMWILPLVALMLLFLRRSTNASSQAMNFGKT RARFQMEAKTGIKFDDVAGIEEAKEELQEVVTFLKQPEKFTAVGARIPKGVLLVGPPG TGKTLLAKAIAGEAGVPFFSISGSEFVEMFVGVGASRVRDLFKKAKENAPCLIFIDEI DAVGRQRGAGIGGGNDEREQTLNQLLTEMDGFEGNTGIIIIAATNRPDVLDTALLRPG RFDRQVIVDAPDRKGRLEILKVHARNKKVDPSVSLEIIARRTPGFTGADLANLLNEAA ILTARRRKEGITPLEIDDAIDRLTIGLTLNPLMDSKKKRLIAYHEVGHALLSTLLENA DPLNKVTIIPRSGGVGGFSQQILNEEMIDSGLYTKAWLHDNIIMTLGGKAAEIEVFGE AEVTGGASNDLKVVTNLARKMVTMYGMSELGLVALENQSSDVFLGRDWMNRSEYSEEM ATKIDRQVREMAVVCYRKARQIIRENRALLDRLVDLLVEQETIEGEQFRKIVSEYTQL PKKQQLVVSG" gene complement(18706..19386) /locus_tag="DP116_05365" CDS complement(18706..19386) /locus_tag="DP116_05365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130523.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M23 family peptidase" /protein_id="PRJNA477356:DP116_05365" /translation="MSSKQIATVKNPLPASIPVENFQTSKPKLPVEFNTKQIARANNW LAASFPVEHFQSYTSAFGYRRSATGGSSLEFHSGLDIAAPQGSYIRNWWIGTVMKVGD RDACGTHIAIQSGEWEHTYCHMQGQVETVDGRRYLIDRTGGIQIAEGQQLGAGMRIGR VGMTGRTTGSHLHWGLKYAKQYVDPAMVLREMFSQQQIARSSSQGWTSQQSQVIIEES KAVGDSGY" gene 20040..20429 /locus_tag="DP116_05370" CDS 20040..20429 /locus_tag="DP116_05370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119225.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05370" /translation="MLTEILPFSFELDTIAIAGASLWSLALYLGFSPVREWVILQLNR WFNFAERSLYTSQSEFEKTRKARESQNAFYASVFSILPFLVIGGLLNWGVEISLGSSW AISMGILACMGCGVYELGRRDGESSGK" gene complement(20445..22958) /gene="cadA" /locus_tag="DP116_05375" CDS complement(20445..22958) /gene="cadA" /locus_tag="DP116_05375" /EC_number="3.6.3.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015364312.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cadmium-translocating P-type ATPase" /protein_id="PRJNA477356:DP116_05375" /translation="MTQTPSLKTQVLQVDGMDCGSCAKSIEASLLSLRGVMEATVSFA TTKVKVSYDTKLLSEAEIINRITALGYTAKLSGAVKPYNHTHCCDSDHNHDHKQVEVA SAKSLQVKVNGMDCGSCAKTIEVGLQQMAGVLDVSVNFANERLQLSYDPSVVSETAIT DRIRGLGYTVGKNFEASSHRHTHDDDCQNHNHDHNHTHPVPKPSQNPAPTNWFFWISN RRGQSVILAGMGLVLGLLAQHLALPIWIARGFYGVGIVVAGYPIARAGLFELRLRRAD MNLLMTISVIGAMILGDWFEGTLVLFLFSLGTTLQVFTFGRTRNAIRALMGLTPPTAT VKRGNKEVTVTVESVQVGEILMIRPGQRVALDGVVVSGTSAIDQSPITGESIPEDKAA GDTVYAGTLNQSGFLEVKVTHTSNDTSVAKIINLVEQAQGSRAASQQWVDRFAEVYTP IVILIAIAITLIPPLAFAQPFNVWFYRALVMLVIACPCALVISTPVSIVSAIGAATRK GVLFKGGNALERAGHLTTFAFDKTGTITQGLPVVLHVYDFGKVSADMVFQIAASLEQH SEHPLAKAIVQAAKSKSIELQTPSKFTALPGKGIEAKIGDSLYFVGNRRLFADRGIPS SSDAESLLVEIETFGQTPVLVGNETGLLGAVALADGLRLEASEAVRCLKQVGLKRLVM LTGDRAAVAKQIAQQVNINEYQAELLPEDKLQAIQKLRRDGVVGMVGDGINDAPALAA ADVSFAVGGIDIAIETADVVLVGSDLRRLAYAVDLSRRTVSVIQQNVVFSLVTKALFL LLATFGFVGLAVAVLADTGTSLLVTANGMRLFRTKAFED" gene 23093..23497 /locus_tag="DP116_05380" CDS 23093..23497 /locus_tag="DP116_05380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015364311.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="PRJNA477356:DP116_05380" /translation="MSKHKGKQNLDQIQNSDAPNCNTHLVHLDNVRSTQAQILATPKA QQIAEIFGVLADPNRLRLLSALADQELCVCDLAAVTKMTESAVCHQLRLLKAMRLVNY RRSGRNVYYSLVNSHIVNLYRSVEEHLDESSV" gene 23698..24420 /locus_tag="DP116_05385" CDS 23698..24420 /locus_tag="DP116_05385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128136.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dienelactone hydrolase family protein" /protein_id="PRJNA477356:DP116_05385" /translation="MQIEQTEVMIPTPEGQMPAFLCTPAEHGHKPAVILLMEAFGLTS HIRDVAARIANEGYVVLAPDLYYRELPNNKFGYDEVEQARAMMFRLDFGKPVEEDIRA ALTYVKSRPDVNPGKVGVTGFCLGGGLTFLTACKLSDEIAAAAAFYGVVLDEWIDAVT NITVPVYFFFGGVDPFIPNERVKQIESRFEEQGIEYTLKVYSNADHGFFCDERSSYNS SAAEDSWRELTRFFHKHLQEAV" gene complement(24380..25786) /locus_tag="DP116_05390" CDS complement(24380..25786) /locus_tag="DP116_05390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317747.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipid-A-disaccharide synthase" /protein_id="PRJNA477356:DP116_05390" /translation="MTADILILSNAPGEVTTWVPPVVRQLRQQLGDDRNQIRISVVLS PCPNSSGKEADIAKSYPEVDRVQAAEHFWQFLLWGKTFENWDWRDRGVVVFLGGDQFF SVVIGKRLRYRTVVYAEWDARWHSMIDRFGVMKPAVAERVPKKFAHKFSVVGDLMQEA QDLEAGEAEGAEGAGGESNSKSRSVASEATQSQNSKLGVSSSSSPPHTELIGVLPGSK PSKLMQGVPLTLAIAEYVHAKRPQTKFVIPVAPTLDLQALAKFADPEKNSFVQTFGFS GASLIVPKLGTHSERPELKTGTGLCVQLWTQTPAYDLLSQCCLCLTTVGANTAELGAL AVPMIVLLPTQQLDAMRSWDGLPGLLANLPIVGSGFAKVINWLVLRRLGLLAWPNIWA QEMVVPELVGKLQSQEVGEMVLDFLAHPEKLAEIRAKLRSIRGEPGAAQKLATLVEEE LGKQEVRRLPVDVYGRTV" gene complement(26515..26598) /locus_tag="DP116_05395" /pseudo CDS complement(26515..26598) /locus_tag="DP116_05395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198315.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 26732..27679 /locus_tag="DP116_05400" CDS 26732..27679 /locus_tag="DP116_05400" /inference="COORDINATES: protein motif:HMM:PF00487.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid desaturase" /protein_id="PRJNA477356:DP116_05400" /translation="MVVMKSKRFEQRILEHIRPMQKPDNWRNFVYLLRDYCLIALSIA LYKIYPSIGTYLLTVLLIGSRMRAFDNLTHESSHKMLFTNPRLNYWIATLFCAFPVGT STSTYWQSHMDHHKWLGNPERDPDLIRYQSLNVDRFPVPYREMVFHLLKVFCLTHVPK YLYGTLQSFVLSSDTPRSERIARTLFWITVFTVLTAFNLWHDFLLFWVIPFLTSFQIL RYLSEISEHGGLYSAEHTIELARNNFCHPVLRFILYPHGDFYHLVHHLFPAIPHYNLG PAHQILLEDSEYQQAHHCYGYFYSSDPNQKSTLGEMILK" gene 28168..28872 /locus_tag="DP116_05405" CDS 28168..28872 /locus_tag="DP116_05405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014807015.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05405" /translation="MKITVAKSVLNRYPESIIAYLLAEVKVEEKHQYVETLKAELWER LTRLGITQTNLTEHPNINGWRQIYHDEFGVKPSKFRSSVEALVRRVIGGQGLWQVSSV VDLYNCVSVLTLLSIGAYDLRKIRGDIHLRYGHNGEVFLPLSSQEVIPVSEKQIVYAD EEKVLCYLWNHRDSRLSAIDADTRHALFFIDTAFHPQTCSMQEALQTLSRHLSQIGGV ELGSGLLNVNYPSVEV" gene 28923..29531 /locus_tag="DP116_05410" CDS 28923..29531 /locus_tag="DP116_05410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007083123.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid transporter" /protein_id="PRJNA477356:DP116_05410" /translation="MISAFIHGIILAFGLILPLGPQNVFVFTQGATQPRLFYALPIAL VASLSDTLLILLAVLGVSVVVLSLPWVKTVLVVAGVLFLCYIGWITWKSDEETNGKSN NASNWSLKQKIMFTLSVSLLNPHAILDTIGVIGTSSLSYTGLDKVVFTLTCISVSWLW FFTLTVVGRLVGTFKHTRKMFNRVSAVIMWLSALYLVYNFTN" gene 30056..34045 /locus_tag="DP116_05415" CDS 30056..34045 /locus_tag="DP116_05415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111305.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium chelatase subunit H" /protein_id="PRJNA477356:DP116_05415" /translation="MFTHVKSTIRHIAPDNLRGRHLIKVVYVVLESQYQSALSQAVRE INQNHPNLAIEISGYLIEELRDQENYEEFKKEMASANIFIASLIFIEDLAQKLVAAVA PYRDRLDVAVVFPSMPEVMRLNKMGSFSLAQLGQSKSVIAQFMRKRKEKSGAGFQDGM LKLLRTLPQVLKYLPVDKAQDARNFMLSFQYWLGGSPDNLENFLLMLADKYVLKNNVE TKNFASVQYKAPVVYPDMGIWHPLAPTMFEDVKEYLNWHNSRKDIPYDLKDPLAPCVG LVLQRTHLVTGDDAHYVAIVQELEAMGARVLPVFAGGLDFSKPVEAYLYEPTTKTPLV DAVVSLTGFALVGGPARQDHPKAIDALKRLNRPYMVALPLVFQTTEEWQDSDLGLHPI QVALQIAIPELDGAIEPIILSGRDGTTGKAIALQDRVEAVAQRALKWANLRRKPKLNK KVAITVFSFPPDKGNVGTAAYLDVFGSIYEVMKALKNNGYDVQDLPESAKELMQEVIH DAQAQYNSPELNIAYRMSVPEYEAFTPYSQRLEENWGPPPGQLNSDGQNLLVYGKQFG NVFIGVQPTFGYEGDPMRLLFSRSASPHHGFAAYYTYLEQIWQADAVLHFGTHGSLEF MPGKQMGMSGECYPDNLIGAIPNLYYYAANNPSEATIAKRRSYAETISYLTPPAENAG LYKGLKELSELIASYQTLKDTGRGIPIVNTIMDKCRMVNLDKDIALPEQDAKDITAEE RDNIVGSVYRKLMEIESRLLPCGLHVIGKPPSAEEAIATLVNIASLDRSEEEIQSLPR IIANSIGRNIDEVYQNSDKGILDDVQLLQDITMATRAAVSALVKEQTDAEGRVSLVSK LNFFNMGKKEPWVEALHKAGYTKVDVTALKPVFEYLEFCLQQVCADNELGALLKGLAG EYILPGPGGDPIRNPDVLPTGKNIHALDPQSIPTTAAVQSAKIVVDRLLARQMAENGG QYPETIACVLWGTDNIKTYGESLAQIMWMVGVRPVPDALGRVNKLELIPLEELERPRI DVVVNCSGVFRDLFINQMNLLDQAVKMAAEADEPSSMNYVRKHALEQAEEMGINLRQA ATRIFSNASGSYSSNINLAVENSTWESEAELQEMYLNRKSFAFSADNPGTMAESRKIF EKTLKTAEVTFQNLDSSEISLTDVSHYFDSDPTKVVASLRGDGKTPASYIADTTTANA QVRSLSETVRLDARTKLLNPKWYEGMLSHGYEGVRELSKRLVNTMGWSATAGAVDNWV YEDTNETFIKDEAMRNRLLNLNPHSFRKVVSTLLEVNGRGYWETSESNLELLRELYQE VEDRIEGIE" gene 34146..34721 /locus_tag="DP116_05420" CDS 34146..34721 /locus_tag="DP116_05420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130873.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_05420" /translation="MDALTVNLNPVIELTDEQFFQLCQANRDLRFERTATGELIIMPP TGGETSNSNAGLTAQLWLWNEQDKLGKVFDSSGGFKLPNGADRSPDAAWVKLERWNTL TQEQQTRFLPLCPDFVVELLSPSDSLKVTQQKMKEYQENGARLGWLINRKSRQVEIYR IGQEVEVLESPVNLSGEDVLPGFVLNLEAIW" gene 34743..36110 /locus_tag="DP116_05425" CDS 34743..36110 /locus_tag="DP116_05425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015227176.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA (cytosine-5-)-methyltransferase" /protein_id="PRJNA477356:DP116_05425" /translation="METDVYAQQLELFQLFQPRNTRSCHYKFTFVDLFAGIGGFRIPL EELGGQCLGYSEIDKEAIKVYQNNFLNSNSDEAYLGDITKLNILPFQIDILVGGVPCQ PWSIAGKLQGLDDPRGKLWIDVFRVVKANKPKAFIFENVKGLTEPRNRSSLQYIINNL TAYGYVCSWKVLNSYDFGLPQDRDRIFIVGIRNDIENCWGLTFPKPLDKQPKLYDVIP GLQQANFLKKKFPPEVLFDGKVPASRGRFQKIDELNDFFLFSDIRDGHTTIHSWDLID TTLREKLICQTILKNRRKKMYGLKDGNPLSREVLETLIPNLQQQEVDSLVFKEILRFV EGQGYEFVNSKISSGINGISKIFLPHADAIATLTATGTRDFVATISIQCQKPEAYKQT FIKEIYTKKKFKHLTAQDYARLQGFPEGFQIANNESTAKHQFGNAVSVPVVYHLAKAL LKIIL" gene 36125..36964 /locus_tag="DP116_05430" CDS 36125..36964 /locus_tag="DP116_05430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TdeIII family type II restriction endonuclease" /protein_id="PRJNA477356:DP116_05430" /translation="MAAISSKTGAEIKGYLEGFIQGLVDEYKGREIIKPDNPAEYLSR FSPNGELKPFQAALIPPELIRINQFERGLSTRLGNSLEECARLIALEHHQQARRGYDI RAEVSLAAFAEAERQKENYETVAHRGQAKPSLEQMITAVLNARRSDDLETKSVRTDLY ILAKDGTEFFFEIKAPKPNKGQCLEVTQRLLRFHLLCGINRPQVKAYYAMPYNPYGFT KSDYKWSYGLNYMPFEEAVVIGNEFWNIVGGATAYEELLDIYLEIGREKSKYMLDALA FGF" gene complement(36961..37149) /locus_tag="DP116_05435" CDS complement(36961..37149) /locus_tag="DP116_05435" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05435" /translation="MSFVQIRFSHHHFWIRHNQLFKVFLHWFGWLLASMNATLTDVDA HWFVVKHRTGEDEGDART" gene complement(37407..37628) /locus_tag="DP116_05440" CDS complement(37407..37628) /locus_tag="DP116_05440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408264.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05440" /translation="MLNIGDYARHEITGQIGQVIGYGHQILHGVYLTTLKVRASNLQG VDNQKRFLEDVYSAWILAESTEGSVALQA" gene complement(37686..37865) /locus_tag="DP116_05445" CDS complement(37686..37865) /locus_tag="DP116_05445" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05445" /translation="MLRNFPLAKNLCVVDKAPAFRNGGDWCYQVQDGKYSGLSTENAA LIKRAILHHAAPNGT" gene complement(37883..39160) /locus_tag="DP116_05450" CDS complement(37883..39160) /locus_tag="DP116_05450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318430.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_05450" /translation="MLPTKKHRIALISVDGDPAAEIGQEEAGGQNVYVRQVGYALAQQ GWQVDMFTRQSSPEQATIVQHGPLCRIIRLKSGPAEFIGRDDLFEHLPEFIAEFQAFQ QKQGFQYPLIHTNYWLSSWVGMELKKQQPLIQLHTYHSLGTVKYSAISDIPAIASKRI AVEKACLESVDRVVATSPQEQEHMRRLVSTKGKIEVIPCGTDVDRFGLIQRSAARQEL GIAPDVKMILYVGRFDRRKGIETLVRAVAKSSLRGHANLQLVIGGGYRPGHSDGIERD RIASIVAELGLENCTTFPGRLNEAVLPFYYAAADVCVVPSHYEPFGLVAIEAMASQTP VVASDVGGLQFTVVPEVTGLLVPSKDEVAFAAAIDRILLNPDWRDQLGEAGRQRVEIA FSWHSVASRLTQLYTHLLAQTMPSIESKAQVAA" BASE COUNT 11529 a 8825 c 8464 g 11005 t ORIGIN 1 tattattcag gaaggcatca ccgtgccgtt aaaggtattc agttaattac cttgtattac 61 acggactgat caggtaaatc tgtgcctgtc aattatcgca tttataataa acaggaagaa 121 aaaaccaaaa atgactattt acgagagatg attaatgaag ttttggactg gggtttaaag 181 ccgaagacag tgaccactga tgcttggtat tccagccaag aaaacctgaa gttactaaaa 241 aacaagcaat tagggttttt aactggtgtg gctaaaaatc gctcgtgttc ggttgatggt 301 aaaaatttta cccaagtcca aaatttagaa attcccgaag atggtttaat agtgtatctg 361 aaaaattttg gtcaagttag ggtatttcgg agaagtttca aaaacgaaac ttccagatat 421 tacatcatgt atactcctga aaaagataca ctaaagtcaa tttcaagaac agaatttaag 481 gaactacatt ctattcattg gggaattgag tgttaccaca gagctattaa acaactatgt 541 ggtatcggtc aatttatggt cagaacgact gaagcaataa aaactcattt ttttagcgca 601 attcgagctt ttacacactt agaattaatg cgtgcagaag atttaattga aaattggtat 661 gaactccaaa gaaatttatc tcttcaagta gctcgtgact ttattttaga acacctaaca 721 cagaagttta acttggctgc atagtatcaa ttttctgtca atgcgtaagt tctaacgtcg 781 tgtcagtatg cgatctagta gtgcaccctc tacgcaagct gtgccgtctg cacgtgttaa 841 ataactaatg acgctcggcg ggcgaggctt tatcgcattt ctacaagaaa gcagatcacc 901 cgtcctcttc gggtaattag gctacacata ttaatatgaa aaattactga agctacactc 961 aatataaaaa gttactaaag ttaggtgaga aagtttgaag aattaataaa gttagtttac 1021 ttacatcttg cacctgagcc ataacatgta aaataaatga catttgattt cacttcgttc 1081 tcgtccgcct aggactgtaa gtcccaggct aataagcgaa gtccattaaa atggactaaa 1141 actggaacta atttttgagt atttttaaac attaaacaat aaaagtatta gtggtttact 1201 cgtaatggtg caagatctca gtttatccac tttaattatc aataatctta catatcgctt 1261 ttcaagcatc tactgtctag taagcaattc aaccgtgctt catatgatag tggtattaca 1321 tatcacattt ttgttccaac catcaatttt tttgtcaagt atatcgggtt ttgaagtact 1381 atttttgaca aaaaaataaa gcaaagcccc tggactacct tactaacggt gagtcagggg 1441 atcattcata attgcagcta agctattctc aaaccatacc atcaacaacc ggaaggaagt 1501 agaaagcaaa ccccaacact gcatatgctg ctaagagtag tacgccttct aaccaatttg 1561 acttaccgtc ggaactgatg ctatttgcaa tcaacaccgc tacaaccaca gctactaatt 1621 caaagggatt aaaatttaaa tccatcggtt gaccaagtac ccaccccgct aaaactaaaa 1681 agggagcaac aaatagagca atctgcatac ttgatcccac cgccacagaa acagaaagat 1741 ccattttatc tttcatagcc acagtaacag ctgttgcgtg ttcagcagcg ttgccaataa 1801 tgggtacgag gatcacccct gtaaatagtg ctgtcaaacc taactgcgat gtggctactt 1861 ctaaagaatc aactagcatt tctgattcta atgcaactaa aagggtacat attagtagaa 1921 tcccagtcca caaccaaata tttggtttgt ggagatgtgt ttcctcttct gcgacaccca 1981 catcgtaaag ataagtgtgg gttttcatgg aaaataacaa agttagggcg tagacgataa 2041 ttagtaccac agcaaccgca agggacagat tttgcagggt tttttcttca attgcttgag 2101 agctaaattt catcgctgtg ggcagtagaa tcgcaatcac cgccaaattc atagaagaag 2161 cgttgactcg cgctacaact ggctgaaatg tctgttcttt gtagcgcagt ccacctaaaa 2221 acatggaaag acccatcacc agtagtaagt tgccgataat tgatccggtg atactggctt 2281 tgacgacatt cacaagtccg gcgtttaggg caactaaggc gataatcagt tctgtggcgt 2341 tgccaaaggt ggcgtttagc aaccccccaa ctgttggacc aaccacgaca gcaacttctt 2401 ctgtggctgt ccccatccaa cctgctaagg gcaaaatagc taaaccagct gtgatgaaaa 2461 ctgttaactc tccccactct aagtaatgag ccgctatgga aaccggaaca aacagtaaca 2521 aaaccaaaaa tagaatgttt ttacctgaca tttgtcattt ttgaggtgaa agtattagaa 2581 acagttgtct gaggaggaag tcctcgcaaa tcctgagcac aatgagtttt ctgtggaaca 2641 gagtcctgag ttttgaatat attacactac agcactcagc actcgtgtag accaaactct 2701 aggatactcc cagccctctg attcaggagt tctcaaattt gattatttgc tttctcaact 2761 gagatttgta aaataaaatt gcaatatgag tacaatccga aatgtttcgt tgcaaaagtt 2821 tccaataaac ttacgatttt acaccaagga tggggtttaa tatattaaaa gacaattaaa 2881 attttagcag taccaacgac aaagcctgct aacatggctt gtgtcaatga ttgactgcat 2941 cgggcaaagc gctgcttgat aatgcagatt acccccaaca cgaacaccac taccccggaa 3001 tttggtttga tctcatagtt gctgttgaaa ctaactgttg ttgaaatgcc gctgtaggaa 3061 ggtttatcag cattatcaaa aaacgcaact gcaagcactg tttatttttt ggaataactt 3121 catgacttct gctgctgaac taccagctag tactggttca aacttgacgc cgcatgaaaa 3181 tatcaaaaac accaaaatgc tcgatcccct gagacaggct actggtgttt acgtcactgt 3241 tcatgggcat ttttatcaac caccacgaga aaacccttat ctagacgcga ttgagcgcca 3301 atctggtgca gcgcctttcc atgattggaa cgaacgaatt catcatgaat gctatcgccc 3361 caatgccttt gccagggtgt taaatgaacg aggcgaagtg gtggggatcg tgaataatta 3421 tgagtatttg agctttaata taggaccaac gctcatgtcg tggctagaac gccacgatgt 3481 agaggtttat cagcgaattt tggaagcaga tgctaaaagt tgcgatcgcc taaacggaca 3541 tggtaatgcg atcgcgcaag tatataacca catcatcatg cccctggcaa acgagcatga 3601 taaatacacc caaatccgct ggggcaaaga agatttccgc tcccgctttg gacgcgatcc 3661 cgaaggaatg tggctggcgg agactgctgt agactacgcc acgttgaaag ctttagtcga 3721 tgaaggaatt cgctttattg tgcttgcacc atcacaagca caacgttgtc gtgttttgcc 3781 caccgaggat aatcccaacc ctgaatggca cgaagttggt gggaatcaga ttgatcccac 3841 acgcccatat cgttgctttt tgaaagagag tcaccaaaaa gtagaagcag gatcagataa 3901 ctcatctgtt caatcgggcc cttatatcga catcttcttc tacgatggtc ccatctcccg 3961 agatatgggt tttagtgatg tcctgtttaa ttccaatcat ttagccggac gcattggttc 4021 agcagtgcgt ggggatcatc gccctgcaca gttaatttct gtcgccacag atggagaaac 4081 ctttggacat cacaaaagcg gaaccgagaa aacattagcc tacgccttta cacaagagtt 4141 tccccattgg ggttggacag tcagcaactt tgctcactac ctcagcttaa atactcctac 4201 atgggaagta aaactcaagc cagtcactgc atggagttgc gcccacggcg tcgatagatg 4261 gcaagatgat tgcggttgcg gtggtggtgg gacatggcac caaaaatggc gtcgcccctt 4321 acgggatgcg ttgaattggc tccgggatca gctcatttcc gtgtacgaag aacatggtaa 4381 acagtttttc gttgatccct ggcatgcaag agatgaatat atccaagtga tacgcgatcg 4441 ctctccagaa aacgtgagtc gcttcctgtc gcgtcatcaa actcgtaaac tcatagccag 4501 tgaacaagta gacgccctgc gcttattgga aatgcaacgt catgctttgc tcatgttcac 4561 cagttgtggc tggttttttg aagaactttc gcgtccagaa ggaacgcaaa ttctgcgtta 4621 cgccgcccgt gctttagaac tagcaggaga tgtcgctggt gtacagttag aaaaaggctt 4681 cctcaaacgt ctaacccaag cacccagcaa tgttgattcg ttcaaacatg gtggcgaagt 4741 ttatcgccag ttggtgctta cagctcaagt cagctttaaa caagttgctg cccaatacgc 4801 cattacttca ctatttgcca accacaaacc tgtagagacg tcccagacag cgtcttcaca 4861 aaatgggcac aatggttcca tcaaacattc tcatccacat caaaagcggg tttattgcta 4921 caccgcgaac gaactcgatt accaattgca acggatggga tcattgactt tggcggtggg 4981 aaatttgaac ctcgtctcag agattacctg ggaaagcgaa catttagtgt ttgcagttct 5041 gcatcttggt ggttgggatt tccattgttg cattcaaccc tccgagggac gactggctta 5101 cactgagttg aaagaaaagc tgtttggggc actgcaagag gctagtgcag ctcacacgat 5161 cctagctatg acgcagttgt ttggtgaaga atctttcagc ttgcggaatc tatttgcaga 5221 agaacgccac cgactcatgc acctcttgag tcaagaaacc ctgtcacgat tagatcaact 5281 ttatactcaa gcataccgtg ataattacgg tgttttgatg gcgtatcacc gtgatgaagt 5341 ccctgcaccg cgagagttgc aagttgcagc agaaattgct ttgggttctc ggtgtatgat 5401 aaatttacgc ttacttgagc aagacatcgc tgaaccgcta tcaagttgga atcacatcgt 5461 cgagttagag gcgatcgcca cagaagccaa gcacctgcat tgtcatctaa atatacccga 5521 aggcaagcag atgttagagc aattgatatt gcgttctctg tggcaattct tgcacgatcc 5581 caacaggact tttgatgcag atttgcaacg cctagatcga ttaattgatg tggcaaatca 5641 gttacatctt ggtatctctc ttgagcgttc tcaagaactc tacttcagtt gtctgcacag 5701 tctgattgtg ccactcttta tggctaaaac tgctcaaagt caggataccg ctcagtgccg 5761 tcagttgttg aaactggggc aaaaattagg ggtagaagtg agtacttggt taagtcagtt 5821 gggctaagac aaagagcgaa aataacgcgc aaccagggaa gcctctaagt caaagagtca 5881 ggcgtgcgct ttgcgcatac gtgaacaaca agattcccga cttcttaaag aagtcgggaa 5941 tctgagcacg cgcaggcact ttcacacaaa tcaaataaga ttgctatggc agttgccagg 6001 tagattagga catgaactaa tgagaaaata cgggcaccac caaaccatca agcttattaa 6061 gcgttaagag ttaagcgttc cctgttccct gttccctgtt ccctgttccc tgttccctgc 6121 tatatatgat ttgatagaag atatattgtc agacaaatga aacagacgtt gatttggctt 6181 tgtgcagcta ccctcatttt tggtttgata ggctgtgatc gaatttttcc ctctggtgac 6241 tcagtgcaaa aagttagtga tggtgatact atcgcggtga aagacgccaa gggggacaaa 6301 attaatgtgc ggtttgcatg tgtggatgca ccggaaatcc cgcactctaa gaaggaaaaa 6361 gaaagtaaaa gttctgtttt tcgcaaccaa tttgattggg gtgcaaaggc gcaagaacgg 6421 gtgcaggaac tggtgaaaca aggaggcgat cgcgtcaaac taaatatcac cgatagcgat 6481 aggtacggac gcaaagtggc tgaaatccgt ttgcgcgatg gaacttttat ccaagaagtt 6541 ttagtacgag agggattagc gcttgtttac cgtccttatt taaacaagtg tcccagcaga 6601 aatatcattg aacaagctga aactcaggcg aaaaatagtc ggcgaggagt ctggagtgat 6661 gctaagtttg tgaagccttg ggagtacagg agtctatata agtaacattt aaaatctgag 6721 tagcgagttt tgcaaggttc gtgaaaagat tttgattatg tcaccctatc tagcagcaat 6781 ttctatttac ccgataaaat ctcttgacag aattgatgtc aatcaagcaa caattcttga 6841 gagtggcgca cttcagcatg accgagagtt tgccttattt gatgaacaag gtcattttgt 6901 caatgcaaag cgcaatgcca agattcactt gttgcgatca acgtttgatc ctaacttaaa 6961 aacaatctcc ctacacattc aggggactga ccaaaaagct atttttcacc ttcatgatga 7021 gcaaccctct ctaaaagcct ggttaagtga ttactttggc tttgcagtta agttaatgca 7081 aaacgcgata acgggctttc cagatgatct taatgccaaa ggacccaccg tggttagtac 7141 agccacgctt gaagaagtcg cttcctggtt tcctgtagga agtgttgatg aaatgcgact 7201 ccgtatgcga gcgaatatag aaattagtgg agttccacca ttttgggaag atcaattatt 7261 tactgaagta ggaaagtatg ttcactttca agtgggagaa gtcttgtttg agggagttaa 7321 tccttgtcag cgttgcgtcg ttccctcacg agattctcaa accggaaagg tgacaactca 7381 ttttcagaag gtgtttgtgg ctaagcgcaa agagtctcta ccatcttgga cgacgccaac 7441 ccgtttcaat catttctaca ggttaagcgt gaatacaaat atacctgcat cagaagcagg 7501 aaaaatttta catcaaggag atgaggtaaa aattctgggt gtcagcgaaa gcaacttaag 7561 gatagcggat tgaataacat ccaaatcact cctttaacct cactcccgcc tttaactatc 7621 gttaaagtct cccctctcct tgataaggag aggggttggg ggtgaggttt tgtacttaat 7681 taaattcgcg ttcctaagct aaatttttaa agattagtag gcaactcatt aaccaaatgt 7741 actcacagca tgacaaaatc gagagattga caaacttgta tatctagaga tagaaataga 7801 ggttacggtg tgagtatatt cctatctagt gttgctgtct tactctcgac tttggcttta 7861 ttatgcagtg gctatacagc ttaccaagtt ttcaccttac aacagacgtt gaatgttgca 7921 agtgctggta ataagaacgc caccacttct actcaaaaaa cttccagttc agaaaccagt 7981 actgttcctc ctgacaataa gcctaatcca tcagaagcgt cttcatcttc aaccacaaca 8041 gcacaatcac cagccaccac tggtagtgct attaaaccgg gtcaatttgt gcaaccttcc 8101 tttggtcaga aagcagaagt tgagttactc tcggtaaagc ggattaaaga tccggaaaca 8161 gcaaaccgtg atgtcgtgaa tgtgcagatg cgtatccgtc gcgtggctac agatggaatt 8221 aatcctgctg aatctgtagg agtctacaac acaacagcac gtaatcctga cacaagcgaa 8281 acctataagg gagttaacct caagcgctca actggtagtg ttgagttatt ttctttgcgt 8341 cctcaagctt cagctgatgc ttatgtctgg ttaaggattc ctgagggtgt taacagcata 8401 gatgtctttg taccagatac agcggcattt aaaaatgtgc cggtttctaa ttagtccgga 8461 gtctttgtat gaacaacaag acccccgact tatcaaataa gtcggggatc tcaagccagc 8521 tgattttcac aagtctaccc aggattgcga tagtgtgccg agattttatt gagacgaata 8581 cctctactgt tgcttttgca cactctggca atccaactga gtttaatcag cacctacgcc 8641 aattggtatg ccaagtatat ttcccgttac tgacacttta actcctggca cggcatcttc 8701 tggtaacaat ataccaccac cagagtttgt acctccgaca accgcctgta actcctctgc 8761 tgtcaattct atgttctgtg tttggttttc taggttatag ctggttttgg attctttcat 8821 gattttcgag gtgtatcgag tcaataggtt actgttcaaa aacctttatt taacagaatt 8881 aaatctcaag accgatattt ccaaggtcta cgcttacagc accaccactg atagcctgca 8941 tctcaaggtc agtcaactca tctgcacatg catcttccca tgagcgatga ttgtttgcaa 9001 tgttggtatt ttcttgccgt gagtgatttt tgtcagcaac gttggtattg attgcggttt 9061 gttgttggtt gatgttgcga tgcatgtgaa ttcctcgata aatagtatgt gtttgattga 9121 tgattcgaga ataccaagta gatactgatg attctatcta tcgacagggt gaaaacatgt 9181 cttttttttt acaaaaatgt ctatacgtag acgggtgtcg cccgactcta cccacaagct 9241 agggtttttc tgtttacctg tgcgttttgt atgaacaaca agatccctga cttatcaaat 9301 cccacttttc cacactctcg caatccaact gagtttcatc aacactcgtg ctaaacggta 9361 tgcaaagtat atttcccttt gcacataaaa gggtgagcat tgcccaccct acataattat 9421 tcttgaacca caatcgctct acaacttagt cccgcactca gggcagaagt tgttatttgg 9481 cgcgtttttc gcaccgcagt tgttgcaata aactggctct agtgaagctt tctttggctg 9541 gcgtcgtaac ccaaccatct gagcgttgct ctttccagga gcaaccattg ggtaacagtt 9601 ttcgtcttta gtcacgtagg cattgacgat aactggacca tcgtgtgcca gcatttcggc 9661 gatcgcctcc ttcaactcct ctcggttttt aatcaccatg cctttaatgc catacgcctt 9721 tgccaaaaat tcaatgtccg gcattcctac ttccatattc gagcaggagt aacgctcacc 9781 gtagaacgct tcttgccact ggcgcaccat tccctgccag ccgttattga ttatcacagt 9841 cttgacattt aagccatact gtgcagcagt tcctagttcc tgtaaattca tttggaaact 9901 ggcgtcaccg ctaatacaga tgacttgttc atccgggaac gctaccttag cgccgattgc 9961 tgctggtaag ccaaaaccca tcgttcccaa accaccactg gaaatccagc gccggggacc 10021 attcttgagg aattgcgccg accacatttg atgttgaccc acatctgtgg tgtagtaggc 10081 gtcgggtgct tggcgtgcga tttctacaat cacctcttgc ggtgacatac tatcagcatg 10141 gtgtggcacc tccagaggat actcttcctt ccagcggttg atcaggttca gccactcttg 10201 ggtttggttg ggtgtgtcct ttgtacctgc ttgctgaacg cgacgcaaca aatcaagcaa 10261 gactttccgc acatcgccga cgatgggtac atcaggaacg cggtttttgc caacttccgc 10321 tgggtcgatg tcgatgtgaa tgactttagc gcgggtggcg aattcatcca acttgcctgt 10381 cacacggtca tcaaatcttg caccaacaca aatcagcaag tcacaatctg tgacagcaaa 10441 gttagcgtag gcggtgccgt gcattcccaa cattcccaaa gacaggggat gatgttcgtc 10501 aaaggcaccg atacccatga atgttgtgct gacagggata ttgaataatt ccgccagttg 10561 tttaatttcc tcgtgtgctc ctgaggcgat cgcaccacca cccacataca acaacggacg 10621 acgactttct cgaatcaact gtattgctgc gttaatctgg cgtggatttc ccttcaccgt 10681 gggacgatac ccaggtaact tcacttttcc cggttccaca ggcacatagt caaattcttc 10741 caacgccaca tctttgggaa catcaatcaa aactggtccc ggacgtccag tgctggcgat 10801 gtggaaggct tcagcaacaa ttcgcgccat atctttagcg tcacgcacca cataggagtg 10861 cttgacaatt ggtagcgtaa ttccgtaaat atcagtttcc tgaaacgcgt ccgtaccaat 10921 taccgcccgt gatacctgtc ccgtaacaat aaccatcggg attgagtcca tataggcagt 10981 ggcgatgccc gtcaccaaat tcgtcgcccc aggacccgaa gttccaaagc acactcctac 11041 cttccccgtc gcacgggcgt aaccgtcggc ggcgtgtgcg gcaccttgtt cgtgtctcac 11101 caagatgtgc ttaatgccat taccagcagc ttctactttg tataggtcgt cgtaaatcgg 11161 taggattgca ccaccaggat aaccaaaaat atactcaacg ccgtgtcgct tgagactatc 11221 aagcaaggca aaaccaccgg atgcacgttt tggcgtcacg gctggcgaga cagagacatt 11281 ggagatgctt tcgttttcga tttgtgggag actaatttgg gaaggcaaac gcacagtcaa 11341 acctcacgct aagcttaagt taatgctgaa tctttgtagt taaaacttca ttttaattga 11401 aaagcttagt acaatagcga cgacatttcc tgtaaaaaat agaaaattca ttttagaaaa 11461 atgtctcgca atactcaacg agtgttttgc caagcccaaa cgcctacaat cgccaccaca 11521 acactcgttc ccacaagcac aattcgcaac ccaagagcat cagtaagtgg gccagtgatt 11581 gctagtggtg cgctaagggc tatgttcacg gcatgatttt gaaacccaaa cactttacca 11641 tgcattgtgg gtggtgtttg ctgttgaatc agagtttgca ttggtacgcc aataaaagca 11701 gcgcctatac ccaataatgc acaaagtgct aaagctaaga gcaagttttg agtaaaactg 11761 aacactccta aaaccagcgc catgattaaa aatccaatca ggggtaaggg tttctggtga 11821 agtttctcac cccagttacc taaaatcgct gctcccaaaa ccatacctac cccagctgct 11881 gctaagaaaa agccaaattg tttcccttgt aagccaaact tttctgctaa tctgatagcc 11941 aacactgtta atgctgcaaa cacacaatac aaagttgtca gttgcagcat ggcattcaac 12001 acgagacgat ttttcttaag ataacgcaaa ccatctttaa aatcagccca aggattgatg 12061 actggctgat tgttatcaaa ggataccttg tctctaaagt taatcggttg catgacgccc 12121 gcagacaaca ggtataatcc tgctaccaca agctcttgac catattttgc tcctattgag 12181 tctttcgccc aacttaaaat tggctctccc accgcaaagc caacaatcaa agctcccatc 12241 attgtgctgc taaacaacgc attggctgcc atcaaatttt cttttttcac caacagcgga 12301 atagcagctt gctcagcagg tgcaaaaaac tgcgtcacag tggagatggc aaaggttatt 12361 ataagcagaa tcagaaattg tcgtggcaaa aaaggaatac acagcgttaa tagcccgcgc 12421 acgatatcag atccaatcat aatcagcttt ttggggaagc ggtcaacaaa gacaccgcct 12481 gcggaaccaa acaatatcgc tggtactgta aacgccacca tcaatatcga atacatcgag 12541 ttttgcgcca atccagcagg ggctgggtag ttttcaagca gagcaaccat taaaacaaaa 12601 aagactttat ctgctaattg agataccaat tgcccaatcc acagcagcat gaagccacgg 12661 tttttgagaa tggcaataaa ccctttattt acagcagcag gttcagttgg aaacatcaga 12721 atcttttgaa atagggatta gggaacgcag taagggaggc agggagggga ggcaggggaa 12781 gcaggggaca gatcaattcg ctcattcgct cattaggaga aatttgctca ttcgctcatt 12841 agaagtatct ccctcatctc cacggcaggt aggtgcactc aacgacgggg aaccctctcc 12901 tagtctgtcc tatgccctat gcctacggca tgaatcaggc gttcgctttg cgcttacgct 12961 ttttgcctca acgaaggaaa acgccacatc cgaagaaagc cggaagccct caaaacagtg 13021 cacgctgcct caccaagggc gcactgcctc atagtcttcc tcatcatctt tattccctga 13081 tcttctttta ctcactcctg ctttgcaaat ctgttttccg gactagttat cttcataatg 13141 ctttaaaagc aagtgcaaaa actcagccgc gcttagacgc tgtttcggcg gtgattggcc 13201 aaaacaagag tctaaaacgt agccttgtac agtttggggc gatcgccaca tttctaaatg 13261 ctcaacaggt ggatccgcat aaaagttatt ttccccgctt cctagcatag aagaactcca 13321 gtgggtgaaa taatcatgac tcatccgatg gataggaaaa tttccccaca taggcactcc 13381 ccatccatct atagcaataa aagctttgac agaaccgcca atttgttgcc atttatgagc 13441 agcgagtatt gacccaacca caccagcact aaagctaatg aatacaactg gcgattctat 13501 cctatcctgc aagtgtttgt gtaaaaattg caatatgtga aatgctgata aagctaaacc 13561 actctcacca ggaaatatca gtaaatctac tgctttttta ttttttgctt catctataac 13621 ttgctctatc caccctgaga caaaacattg agttaaggct gactcatgaa tcccagggca 13681 aataattata ctcatctctt cttgtgttag cagctttttc ttcaaactat atcagtataa 13741 atgttataaa gttaacattt ttttccatgc ttttccggac acattccggc ttgcaatatt 13801 caaaaaaagt tcaacctgtt tgaattcagc aaagcggtga cttgtcctta taattaacaa 13861 ctcaacatat caattccact ctatcaagcg ctacacaatt tgcgaaacaa gtatatgtca 13921 agcctctaaa gtcattttta gtcattgttg tgtttctatc ttcggtagta aatagagtat 13981 gtgatgacag atacgactca tctttacttt ttcatcctat gcccctggat aggatagaat 14041 tttacgagta tgtaaaacaa caaaacccaa acagtgctga agcatgagta ccgagattga 14101 agtggaaatc atcaaaggac tcattgctta gcgctcagaa cttttgaagt tgtattgagg 14161 aactattgtg gttgccacac cagaaaaact acactcaaca catgagcaat tgcccagtca 14221 acgccgagta gcagtattac tcatgggtta tggcgaagtc gaaagctacg aagatttcgc 14281 taactataat gaacaagctt taaacctgct gacagcaaaa tttgcacccg tgcctacttg 14341 ggtttatcct cccttggcaa agcttttggc attatttgat cgtcatgagt ggggacatca 14401 gcaccatgat tttatttccc cacacaacgc catttttgaa aaacaacgcg ctggcattga 14461 aaaagaccta caagccaagt ggggtgaggg tgttcaagtt tttaaagctt ttaacttttg 14521 cgcccccttc ctacccaaac aagtcctggc agaaatcaaa agccaaggtt ttgacaaaat 14581 actcatctac ccactgcttg ttgtagattc catcttcacc agtggtattg ccatagaaca 14641 agttaacaat gctttatccg agatggccca gggcgatgaa cactgggtta aaggactgcg 14701 ctacattccg tcgttctaca atgaaccagc ttacatcgac ttgatggcgc atctagtcga 14761 agacaaaatt acagctgact tagccagtgc ttacctacct tctcaaattg gcattgtgct 14821 gatgaaccac ggttgtcccc ataaagccaa aggatttacc tctggaatta ccgaaagtca 14881 aatactctac gacttagtcc gcgacaaatt aatctaccgc taccctctga tttctgtcgg 14941 atggcttaac catgacacac ccttgattga atggacgctg ccaaatgctg aacaagcagc 15001 gaaaaacctg attcaattag gtgccaaagc aattgtattt atgccgattg gttttgctac 15061 agaaaatcat gaaactctcc tagatgtaca ccacatcatt catgccttag agaaaaagca 15121 ttctgatgta aactacgtgc agatgccctg cgttaacgac catcctgagt tcttaaacat 15181 ggcagcgcag tgggcaaatg ctcatattaa cgagttgttg tcagaagaaa ctgtggcagt 15241 taatcccgag ttagctgtag cgcatcatca tcaccatcat cattaaggtg taaggtgggc 15301 aaaacattgc ccaacgccag atgcctacgg agggaaaacg ccacatgctt caagtcggca 15361 aagccgccca acgcagtggc tcccctcctg cagcactggc tcccctacta ctgctgaatt 15421 ctaattcacg cctctcaact tgcgagtgtt gtcaattaaa ctcggagtca gattaattta 15481 gctatacacc caatatttaa atgctatcac tagcaaaata ttgggtgtaa cagtgaattt 15541 attctgacaa aagttggggg caaaacttac gcttattcaa ctcctcttaa tttgcgtgta 15601 ttttcaacta aactttttgc aaattggtct aatctttggg aaagtttttc gtctacaagt 15661 ttaccttcag gactaaaagc tttccaagct tgcccaattg ctacctgttc tggaatagac 15721 catgcatgca cccatcggag aattactctc aaatcattga gtgcattgtt attagattga 15781 ccgcctaaaa tactgatgaa tcctgcgact ttacctgcta aatggtcaaa gctcatcaaa 15841 tcaagagcat ttttcaggac accactaacg ctaccgtgat attcaggtgt aaccataatt 15901 aaaccatcag cacggctaaa agcgtcttgc agccgtttaa cgtctgggta atctggataa 15961 tcgtcccctc catcacaaaa tggtaggttc attttgcgca aatcaagaac ttctacttct 16021 gcacctagct cttgtgtttt ttgcgctgct aattctaaag ccaattggct gtaagactct 16081 gatcgtaagc taccagcaat gcctactatc cttaccataa ctgaatcctc atgttgtttt 16141 ttttacaata tttgcgacag tatgtcttta cattcaaaca ctgatattta tttggcaaag 16201 tgtaatcatt acgagtatga tttaattata gcgatttgtg acatagtatg ccaaatgcca 16261 attaacagag aatttcatgc tagggagtca gaaatatagc aatgtagaag tagtacttca 16321 aaagattttt gagcgtcata gtctaacctg tgaggaatcc taagcttcca attaggcgtc 16381 taacatagag aaatattcat aaacagacgc acttatggat attgcaagga aaaacttagg 16441 aatgcttttt ttttggaagc tacactagaa gattagacac ccctgatatc tactcttggt 16501 agaggaaaag gcataaaaat gagtgctcaa gctgatagag cagcaaacaa aggataatta 16561 ttatgaaaaa ttttgagaaa aagacatcga ggaaacagca ccctgcaaag aatgcagcct 16621 tgactggagc gctagcagcc ggtttaatca tgttgccagg aatgttaggg gcgactcctg 16681 ctttagcaca aaaaggagag cgcgagcctc tatcttatgg taaactaatc gaaaaaatag 16741 agaataaaga agtcaaaaga gtagagcttg acgaaaccga ccagctagca agggtttatc 16801 tcaacggaca aaagcaaggt gaacaaccga tacaggtgcg gcttttagaa caaaacagtg 16861 agttaattaa taagctaaaa gaaaaggacg ttgagtttgg ggaagctccc tctgccaata 16921 gtaaagcggc tctcggactt ttgatcaact tgatgtggat cttaccacta gttgctctga 16981 tgttattgtt ccttcgccgc tctactaatg cctctagcca ggcaatgaac tttggtaaaa 17041 ccagagctcg tttccagatg gaggcaaaga ctggaatcaa atttgacgat gttgcaggta 17101 ttgaagaagc caaggaagaa ctgcaagaag tcgtgacttt cctcaaacaa ccagaaaaat 17161 tcactgctgt gggcgcacgc attcccaaag gagtgctgtt agtgggacct ccgggaacag 17221 gtaaaacact gcttgcaaaa gcgatcgccg gtgaagcagg cgtaccattc ttcagcattt 17281 ccggctcaga gtttgtggaa atgttcgtgg gtgtgggtgc atcccgtgtg cgtgatttat 17341 tcaaaaaagc aaaagaaaat gctccctgtc tgatatttat cgatgaaatc gacgctgtag 17401 gacgccaaag aggtgcaggt atcggtggtg gaaatgatga acgggaacaa accctcaacc 17461 aattgctcac cgaaatggat ggttttgaag gtaacactgg tattattata atcgccgcta 17521 ccaaccgtcc agatgtccta gataccgcgt tgcttcgacc cggacggttt gaccgacaag 17581 tgattgtaga tgcaccagac cggaaaggtc gtctggaaat tttaaaagtc catgcccgta 17641 ataagaaagt tgacccatcc gtttccttgg aaattattgc ccgccgcacc cctggtttta 17701 caggagcaga cttagcaaac ttactcaatg aagcagcaat tctgacagca cgtagacgta 17761 aagagggtat cacaccgtta gaaattgacg atgctattga cagattgaca attgggttga 17821 ccctcaaccc actcatggat agcaagaaaa agcgcttgat tgcctatcat gaagttggac 17881 acgctctttt gtctacactt ctggaaaatg ctgacccttt aaataaggtg acaattattc 17941 cccgttctgg tggagtcggt ggtttttctc agcaaattct caacgaagaa atgattgaca 18001 gtgggcttta tactaaagct tggctgcacg ataacatcat catgacttta ggaggaaaag 18061 cagcagaaat agaggtattt ggcgaagctg aggtcacagg tggggctagc aacgacttaa 18121 aagttgtaac aaaccttgct cgtaagatgg tgactatgta tggtatgtct gagttagggt 18181 tagtagctct ggaaaatcag agtagtgatg tttttcttgg ccgagattgg atgaatcgct 18241 ctgaatattc agaagaaatg gcgactaaga ttgaccgaca agtgcgagaa atggcagttg 18301 tttgctatag aaaagctcgt caaattattc gtgaaaatag agctttgcta gatcgccttg 18361 tggatttgct tgtcgaacag gaaacaatag agggcgagca atttcgtaag atagtctctg 18421 aatatactca actgccaaag aagcaacaat tagttgtatc tggttagatg aagtcttgag 18481 tgatatcatg tccggataaa cacttataaa aacgatgaac ctcacccgcc tccggcaccc 18541 tctccttaat aaggagaggg ttggggtgag gtcaagcagg aattataagt aattaagcgg 18601 acttgatatg agaaatactc agcacccagt actcattcag tacttaatta acagttatca 18661 gttatcagta gcctgttccc tgatcactga taactgttta ctgatttaat atcctgaatc 18721 tcctacagct tttgactcct caatgatgac ttgtgactgt tgagaagtcc aaccctgcga 18781 tgaactccta gcaatttgct gctgggagaa catttctcga agaaccattg ctggatcaac 18841 atattgttta gcatacttga gtccccaatg aaggtgagaa cctgtggttc gtccggtcat 18901 tcctacccga ccaattctca taccagcgcc cagttgctga ccttccgcaa tttgaattcc 18961 acctgttcta tctatcagat aacggcgacc atctacggtt tcaacttgtc cttgcatatg 19021 acagtaggtg tgttcccatt cgccagattg aatggctata tgagttccgc aagcatcgcg 19081 atcgccaacc ttcatgaccg tacctatcca ccaattgcga atataactcc cttgtggcgc 19141 ggcgatatct aagccactgt gaaattctaa actagaaccg ccagttgcag aacgacggta 19201 gccaaatgcg gatgtgtagg actgaaaatg ttctactgga aaagacgctg ctaaccaatt 19261 attagctctt gctatctgct tggtgttaaa ttctacaggt agttttggct tgctggtctg 19321 aaaattttcg accggaattg aggctggtaa cggatttttg acagttgcta tctgcttgga 19381 actcaattct acagcttcaa ctgtttttaa tttggtcaat agagtgataa aactgacaac 19441 accaatacct agtatagata ggctgactac ggaaagtgct actgtttgtc gccacttact 19501 cattttttag tcctctagct cttactacat atgcgtaaat tgatctaaat ttcagccgga 19561 tcaaagattt aaattttgat taggaattga catttgcttt ggagtgcttt caccctgtga 19621 ccactagaaa aagcgtatat tttttactga ttcatcgcaa tacatttacc ttgaaaatgt 19681 tgctatgtca agtcattttc ccaaaatagt attttttcag gtatcaaaat ggccggattt 19741 atttgataat caaaaatata aaatattttc taatttcatt agaattttag atacctttga 19801 gcagtgatga tttgttgata gttcagtgct gtactagcta gtgatcatca gttttcaaag 19861 aaaagtattg ataatgacct ttcttattac gtatatactg agtgaaaaaa atgtaaactt 19921 gtgcacaata aaatgtaata gtggaagagt tgcaaataag taacaatgct gttaagttta 19981 tcgttattat tattggttta acttcccaaa cgcaacttcc acaagacttc caccaagcca 20041 tgctgactga aatattgcct ttcagctttg agttagacac aatcgcgatc gccggagcaa 20101 gtttgtggtc tttggcgctg tatcttggtt tttccccagt cagggaatgg gtgatactgc 20161 aattgaaccg ttggtttaac tttgctgagc gatcgctcta cacgagtcag tcagaatttg 20221 aaaaaacaag aaaagcgaga gaatcacaaa atgcgtttta cgcctcagtt ttcagtatcc 20281 tgccctttct ggtcattggt ggcttactga attggggagt agaaattagt ttaggttcga 20341 gctgggcaat tagcatgggg atacttgctt gtatgggttg tggcgtgtat gaactaggac 20401 gaagagatgg ggaatcttct gggaagtagg ttatgtgcca ttccttaatc ctcaaaagcc 20461 ttagttctaa atagccgcat tccatttgcc gtaaccagca aggaagttcc tgtatctgcc 20521 aagacagcaa cagctaaccc aacaaaccca aacgtcgcca gcaatagaaa caatgccttt 20581 gttaccaaag aaaagactac gttttgttga ataacagaca ccgtgcggcg gcttaaatcc 20641 actgcatagg caagtcgtcg caggtcactc cctaccagca ccacatctgc tgtctctatg 20701 gcaatatcaa ttccacccac agcaaagcta acatctgccg cagccagggc tggtgcatcg 20761 ttaatgccat ctccaaccat gcccacaacc ccgtcacgac gcagcttttg aatcgcttgc 20821 agcttatctt cgggcaataa ttctgcctga tattcgttga tgttaacctg ttgagcaatc 20881 tgttttgcga cagcagcgcg atcgcccgtt agcatcacta atcttttcaa cccaacttgt 20941 ttgaggcatc gcactgcctc ggaagcctcc aaccgtaatc catctgctaa agcaacagca 21001 cccaacaatc ccgtttcatt tcctaccagc acaggggttt gaccgaaggt ttcaatctca 21061 accaataaag attcagcatc tgaagatgaa ggaatacccc gatcagcaaa caaacgccga 21121 ttaccgacaa agtaaagtga atcacctatt tttgcctcaa ttcctttacc aggtagcgcc 21181 gtaaacttgg atggagtctg taactctatt gatttcgatt tagcagcctg gacaattgct 21241 ttcgctaaag gatgttctga gtgttgttcc aaactagcag caatttgaaa caccatatcc 21301 gcactcacct tgccaaaatc gtaaacatga agcaccacgg gtagcccttg cgtaatcgta 21361 ccagtcttat caaaggcaaa agttgtaaga tgtccggctc tctccaatgc attgccccct 21421 ttgaacaaaa cgcctttgcg agttgctgca ccaatagcgc tgacaatcga aacaggggta 21481 gaaatcacta aagcgcaagg gcaggctatt accaacatca ccagcgcccg atagaaccaa 21541 acgttgaaag gttgagcaaa tgccaaaggt ggaatcaggg tgatagcgat cgctattaaa 21601 atcacaattg gggtgtaaac ttctgcaaac cgatctaccc actgctgaga agcagcgcgg 21661 cttccttgag cttgttccac caaattgata attttggcaa cgctggtatc gttcgacgta 21721 tgggtaacct taacttctaa aaaacctgat tgattcaacg tcccagcata gacagtatcg 21781 ccagcagctt tatcttctgg gattgattcc cccgtaattg gagattggtc tatcgcgctt 21841 gtaccagaaa cgactacacc atccaacgcc acgcgctgtc ccggtcgaat catcaaaatt 21901 tccccaactt gaacactttc aacggttaca gtaacttctt tatttccccg cttgacggta 21961 gcagtgggcg gagtcaaacc cattagggcg cggatagcat tgcgggtgcg accaaaggtg 22021 aaaacttgca gtgttgtgcc taaagagaac aaaaacaaaa caagcgttcc ttcaaaccag 22081 tctcctaaaa tcattgcccc aataactgaa atggtcatca gcagattcat atcggcgcgg 22141 cgcaagcgca actcaaacaa acctgcccgt gctatgggat agccagcaac aacgatgcca 22201 acaccataaa aaccccgtgc tatccaaatg ggtaaggcta aatgttgagc aagtaagccc 22261 aggactaacc ccattcccgc taaaattacg ctttgtcccc ggcggttgct aatccagaaa 22321 aaccagtttg ttggagcagg attttgagag ggcttgggga ctggatgagt gtggttgtgg 22381 tcatgattat gattctggca atcgtcgtca tgggtgtgac gatgggaact tgcctcaaaa 22441 ttcttcccaa ccgtatagcc cagaccccta atccggtcag tgattgcagt ctcactcaca 22501 acagaaggat cataggacag ttgcaatcgt tcatttgcaa agttaaccga cacatccaaa 22561 acacctgcca tttgctgcaa accaacctca attgtcttgg cgcaactgcc gcaatccatt 22621 ccgttaactt tcacttgtag gcttttagca gaagcgactt cgacctgttt gtggtcatgg 22681 ttgtggtcgc tgtcacagca atgggtgtga ttatacggtt ttactgcacc actcagttta 22741 gccgtataac ccaaagcagt tatgcgattt ataatttcag cttcgctcaa gagcttagtg 22801 tcataggata ctttaacttt tgtagttgca aaacttacag tcgcttccat tacaccgcgt 22861 aatgaaagca agctggcttc aatgctcttg gcacagctac cgcaatccat gccatcgact 22921 tgtaaaacct gagttttgag ggaaggggtt tgagtcatgg cagctttgaa gaaaagatta 22981 tgaagttaac tgccaatatt ctaatacaac agttgaataa ctattcaact gtagaactgt 23041 tcttatttta ggtagtattg aaattgataa ccattgccaa aacgggaacg ccatgagcaa 23101 gcacaaggga aagcaaaact tagaccagat ccaaaattct gatgcgccaa actgtaatac 23161 ccatctggtg catttagata atgtgcgctc aactcaagca caaatcctgg caactcccaa 23221 agcacagcag atagcagaaa tctttggggt gctggcagat ccgaaccgct tacgcctcct 23281 atcagctttg gctgaccaag agttgtgtgt ttgcgattta gccgcagtga caaaaatgac 23341 ggaatcagcc gtttgccatc aactgagatt attaaaagcg atgcgtttgg tcaactatcg 23401 ccgatcaggt cgcaacgtgt attacagctt ggttaacagt cacatcgtta acctgtatcg 23461 ctctgtagag gaacacttag atgaatcaag tgtttaggtg ctggtttatc ttgttatcaa 23521 gatatgcccc ctccgcatac ctgttaaagt gtaactattg ggggtagatt ttgacatcct 23581 tgccaaaagg ttcaagtggg cacaaaacta cgttcatgcg gatttttggt ttgcatattt 23641 ttgacgcgaa ctttgtgatt tttgtccaaa atccaatcaa caaacaggag tgaaaaaatg 23701 caaatcgaac agacagaggt gatgattccc acgcccgagg gacagatgcc cgccttcttg 23761 tgtacacctg ctgagcatgg ccacaagcca gctgttatcc ttctgatgga ggcatttggc 23821 ttaacatcgc acatccgaga cgttgcagcc cggattgcta acgaaggtta cgtggttctc 23881 gcaccagact tgtattaccg cgagttgccg aacaacaagt ttggatacga tgaggttgag 23941 caagccaggg ccatgatgtt ccgccttgat ttcggtaagc ccgtggagga ggacattcgg 24001 gcggcattaa cttatgtgaa gtcgcgacca gatgttaacc caggtaaagt tggcgtcact 24061 gggttctgct tgggcggcgg tttgaccttt ctcaccgcct gcaagttgtc ggacgaaatc 24121 gcggcggcag ctgccttcta cggtgtggtt ctagatgagt ggatcgacgc ggtgacaaat 24181 atcaccgtgc ccgtatactt tttcttcggt ggcgtcgatc cattcattcc taacgaacgc 24241 gttaaacaaa tcgagtcccg gttcgaggaa caaggcattg agtacacatt gaaagtttac 24301 agcaatgccg accacggttt tttctgcgac gagcgttctt cctacaacag ttcagcagct 24361 gaagactctt ggcgcgaact tacacggttc ttccataaac atctacagga agccgtctaa 24421 cttcctgctt gcctaattct tcctctacta aagttgctag cttttgtgct gcaccaggtt 24481 cacctcggat actccgcagc ttggctcgaa tttctgctaa cttttctgga tgagcaagaa 24541 aatctaaaac catttctcct acttcttggg attgaagctt tccgactaat tctggtacga 24601 ccatttcttg cgcccaaata ttgggccatg ctagcaaacc caatcgtctc agtactagcc 24661 aattaatcac cttagcaaaa ccagaaccaa ctattggcaa atttgctaat aaaccaggta 24721 aaccatccca agatctcatc gcatctagct gttgtgttgg tagcaaaaca atcattggca 24781 cagctaatgc acccagttca gcagtgtttg ctcctactgt cgttaggcag aggcaacact 24841 gggataataa gtcgtatgct ggcgtctgag tccacagttg gacacataaa cctgttcctg 24901 ttttcagttc gggacgctct gaatgagtgc cgagttttgg gacaattaaa gaagcaccac 24961 taaaaccaaa ggtttgaaca aaagagtttt tctctggatc ggcaaactta gctaaagctt 25021 gtaaatccaa agtcggggcg acaggaatga caaacttggt ttgtggtctt tttgcatgga 25081 catattccgc aattgccaaa gttaagggca ccccttgcat caactttgac ggctttgaac 25141 caggaagtac gccaatgagt tcagtatgag gtggggagga tgaggaagag actcctaatt 25201 ttgaattttg agattgcgtc gcttcgctcg caacgctacg cgattttgaa ttgctttctc 25261 cccctgctcc ctctgctccc tctgcttccc ctgcttccaa atcctgggct tcttgcatca 25321 aatctcccac aacgctaaac ttgtgggcaa attttttagg aacacgctca gctactgcag 25381 gtttcatgac tccaaagcgg tcaatcatac tgtgccaacg tgcatcccat tcagcgtaga 25441 caacagtgcg ataccttagc cttttgccga tgacaacaga aaaaaactga tccccgccca 25501 ggaaaacaac gacaccgcga tcgcgccaat cccaattctc aaatgtcttt ccccacagca 25561 agaactgcca aaaatgttct gctgcttgca ctcggtcaac ttcgggataa gatttcgcaa 25621 tatctgcttc tttgccactg gagtttggac aaggtgacaa aaccacagaa attcttattt 25681 gatttcggtc atctcctagt tgttggcgta attgcctaac gactggaggt acccaagttg 25741 ttacctctcc aggagcgttt gaaagaatca gaatatcagc agtcatgagt catgagtcat 25801 gagtcgttag tcattagtca atttttagca gttattagtt cccactgacc tacagatttc 25861 tcagatgttg taaaatcacc cgcttagcta caactcaagc accttttctg tgttcttgta 25921 cctgttccca atacggttca gttaaggatt tttagtgaga tttagtgatg tgtcaagact 25981 tgccatgctg tgtagagact tgccatgcta cgtctctaca ttgccgtgct gataagaatt 26041 ttagtcttat ttgaaccgta ttgtacctgt ttccacttgc ataaagcagg gctgtttcat 26101 ttcctcaaag gcttacattg tcaagtgttg aggagatgaa aatcatgggt gcaaaattga 26161 taaatgcgat cgccctttgg gcggctcccc aagtgagcat ctgagtgttt gaaatgatag 26221 ctttatttcc gccgacgtgt actaacccaa gttgctagtt atcaattggt gtgtgcaagt 26281 tactcctcga ttgatgcgat cgcctacaaa aaagtgctgg tgacgctacg cgccaccgcg 26341 ccctagactg aattaagcat tcagtctgcg ccatagcatt cccaatgaat gtgcagttga 26401 ggctgccctg aaaattgcac gcaacgcgat ggggcggaaa aacgttatct cattctcaaa 26461 tggcttccat gatcccttgt ttttttagga cttacgcact ttacaaatag acagttccct 26521 atcttggata tgtccacaat tagggcaggt gtgcgtccta gtactcaaag attttctaac 26581 aacttttccg caatttgcaa atcaagtaag gctttactga cacaatcaca tttgtgttaa 26641 tgcgtaagtc ctatttttgt ttgtttgcta ctcgctgccc taattctgtt actaattctt 26701 gacattaact tcatcaaact ctcgtatttt tatggttgtg atgaaatcca aacgatttga 26761 acaacgaatt cttgagcata tcagaccgat gcagaagccg gataactggc gaaacttcgt 26821 ctatcttttg agagattact gtctgatagc tttgtctatt gctctctaca aaatttatcc 26881 atcgattggc acatacttac taacagtttt gttaattggg tcaagaatgc gggctttcga 26941 taatctcacc cacgagagtt cacacaaaat gttgtttacc aaccctcgac ttaactactg 27001 gattgcaaca ttgttttgtg cgtttccagt tggcacatct acctctactt actggcaatc 27061 acacatggat catcataagt ggttgggaaa cccagaacga gaccctgacc ttatccgtta 27121 tcagtcacta aacgtagatc gtttccctgt accataccgt gaaatggtct ttcacttgct 27181 caaagttttt tgtctgaccc acgttccaaa atatttatat ggcactttgc agtcgttcgt 27241 tttatcaagt gacactcctc gtagcgaaag aatcgcacgc acactatttt ggataacagt 27301 cttcacagtg ttaacagcat tcaatctgtg gcatgatttt ttactattct gggtgatacc 27361 attcctgact tcctttcaga ttttgcgcta tctttcagaa atttctgaac acgggggact 27421 ctacagtgca gagcacacaa ttgaattagc tagaaataat ttttgtcatc cagtcttacg 27481 gtttatcctg tatccacatg gcgattttta tcaccttgta catcatttgt ttccagctat 27541 tccacattac aatcttggtc ctgctcatca aattctctta gaggatagtg aatatcaaca 27601 agcacaccac tgctacggat acttttactc aagtgaccct aatcaaaaaa gcactttggg 27661 tgaaatgatt cttaaatagg acttaggcaa ctggcacgtt ggggatttcc aagaaataaa 27721 ttatccaatc ttgtaaggtg ggcatcttgt ctgccttaat tcagatgcag acgagacgtc 27781 tgtctccaca agaaataatt ggatattttt ttatttggaa gtcccttgac tctgactttt 27841 cacggcatgt ggattcatac cattttgagg gtttctcaag cgcttatttt ttcgtaactc 27901 attctcatta atctatataa attgagaata aggctgtgga aaaagccacg ttgtttcgtg 27961 gtttgaggac ttcccgtgaa aagtcagccc ttgacttagg ttcgtagtaa gcgctcttgc 28021 gcttaaaaag gcgtacttgg tgcctactac aaacgaacaa tgtgtgacac ttgcctaagt 28081 cgtgttaaag aaagcaatag ttgaaaaacc acagcaagtc tatgctgctt gtccaaacca 28141 ttacgttgga acaaaggtag ttacctgatg aaaatcaccg ttgcaaaatc ggtattgaac 28201 cgttatcctg agagcatcat cgcttacctc cttgctgaag ttaaagtcga agagaaacat 28261 caatacgttg aaactttaaa agcagagtta tgggaacgct taactagatt agggattacc 28321 caaacaaacc tcacggaaca cccaaacatc aacggatggc gacagattta tcatgacgag 28381 ttcggcgtca agcccagtaa gtttcgttca tctgtagagg cgctggtgcg tcgagtcatt 28441 ggtggacaag gactatggca agtatcaagt gttgttgact tgtacaattg tgtatctgtt 28501 ctcactctac tttctatagg agcgtatgat ttgagaaaaa ttaggggaga catacatcta 28561 cgatatgggc acaatggtga ggtgtttcta cctcttagct cacaagaagt tatccctgtt 28621 agtgagaaac agattgtata cgcagatgaa gaaaaagttt tatgttatct atggaaccat 28681 agagattcgc gattgtctgc aattgatgct gacacgcgac acgccctgtt ctttatagat 28741 acagcctttc acccacaaac ctgctctatg caagaagcat tacaaacttt atcacggcat 28801 ctaagtcaaa ttgggggtgt tgaacttggt tcgggactac ttaatgtcaa ctatcctagt 28861 gttgaagttt aaggcaaata ctcactcttt aagctctacc tgcttgtcta taaaattgaa 28921 ctatgattag tgcatttatc cacggaataa tcttagcctt tggtctgata ctaccacttg 28981 gtccgcaaaa cgtgtttgtt tttacgcaag gtgcaaccca gcctcgcctg ttttacgcac 29041 tgccaatagc tttagttgct tcactctcag atacgttgct tattctgtta gctgtgcttg 29101 gtgtatctgt tgtcgtattg tctttgcctt gggttaaaac ggtgttagta gtggctggag 29161 tgttatttct ttgctatatt gggtggatca cctggaaaag tgatgaagag acaaatggga 29221 aatcgaacaa tgcgtcaaac tggtctctga agcaaaaaat tatgtttact ttgtccgtat 29281 ctctacttaa tcctcacgca atattagata ctatcggcgt tattggtacc agttcactat 29341 catatacagg tttggacaag gtcgttttta ctctaacgtg tatttcagta tcctggctat 29401 ggtttttcac gttaacagtt gtagggcgac tggtgggaac attcaaacac acccgcaaaa 29461 tgtttaaccg cgtgtctgcc gtcattatgt ggctgagtgc gctttacttg gtgtacaact 29521 ttactaactg aatccaccta gggcgtgttt tcaaactcgt tgtatcctaa aaaaacagtg 29581 attccagggt gtattaatga accgaggaga cttaagtaat gcacatgagt tctttgcttg 29641 atatttcaag agggtctgcc tggaaacatg taaaatacaa caagttttca aattccgtta 29701 agatacttgc aatcatttag gtttttcata catatgtact aacttagatt taactgcttt 29761 aaaactacag tggcttctgt tcaactatcg gctttttgac taaagtatct catgacagat 29821 ttgtaaaatt tacgtaactt atacctgagt aaatacctca cttattagtt aacaattttt 29881 aaaaaatatt tcgcttttta tatcgttttg tgtcagatga gcgataaaat gcccactggg 29941 caagcatttt ctgataaaaa tctcatattg cccaacaaac aaagtcatca ttaggcaaat 30001 tttgcctgac ataactttac aaaacgagaa cacgcatcaa aggagccgag gaagcatgtt 30061 cactcacgtc aagtccacca ttagacatat tgcgcctgat aatctgcggg ggcgtcattt 30121 aattaaggtg gtctatgtcg tgttagagtc ccagtaccag agtgcattgt cacaagcggt 30181 tcgggaaatt aaccaaaatc atcccaatct ggcgattgaa atcagtggtt acttgattga 30241 agaactccga gaccaagaaa actacgagga gttcaaaaag gaaatggcaa gtgccaatat 30301 ctttattgcc tcgctgattt ttatagaaga cttagcacaa aagttagtcg cagcagtagc 30361 accatatcgc gatcgcctgg acgttgccgt tgttttcccc tcaatgcccg aagttatgcg 30421 cctaaacaaa atgggcagct tctctttggc acaattgggg caatctaaaa gtgtcatagc 30481 acagttcatg cgcaagcgca aggaaaaatc cggcgctggc ttccaagatg gaatgctcaa 30541 gttgctgcgg acactgccgc aagtcctcaa gtatctgcca gtagataaag cacaagacgc 30601 tcgcaatttc atgcttagct ttcagtattg gctggggggt tctccagaca acttagaaaa 30661 cttcttgctg atgctagcgg ataagtacgt cttgaaaaat aatgtagaaa cgaaaaattt 30721 cgcatccgta caatacaaag caccagttgt ctatcctgat atggggatat ggcatcctct 30781 agcgccaaca atgtttgaag atgtcaagga atacctcaac tggcacaaca gccgtaagga 30841 tatcccctat gatttaaaag acccgctagc accttgtgtt gggttggtgt tgcagcggac 30901 acaccttgtg actggtgatg atgctcatta tgtcgcaata gtgcaggaac tcgaagctat 30961 gggcgcacga gttcttccag tgtttgcagg aggtttagac ttctccaaac ctgtagaagc 31021 gtacttatac gaaccgacta ccaaaacacc cctagtggat gccgttgtat ccttgacggg 31081 ttttgcccta gtaggtggac cagcacgcca agaccatcct aaggcaattg atgccctgaa 31141 gcggttgaac cgtccttaca tggtagcgct acctctggtc ttccaaacaa cagaagaatg 31201 gcaagatagc gatttggggt tacatccaat tcaggtggcg ttgcaaattg cgattcctga 31261 attagatggc gcaattgagc cgataatatt gtcaggacgg gatggaacaa cagggaaggc 31321 gatcgcccta caagaccgcg tcgaagcagt cgcacaacgc gccctaaaat gggctaacct 31381 gcgccgtaag ccgaagttaa acaagaaagt tgccatcacc gttttcagct tcccaccaga 31441 taaaggtaac gtgggaaccg ccgcatactt ggatgtgttc ggctcaattt acgaggtgat 31501 gaaagccctc aagaacaacg ggtatgatgt gcaagacttg cctgagtccg ccaaggagtt 31561 gatgcaagaa gtcatccacg acgcgcaggc gcagtacaac agccccgaac tcaacattgc 31621 ttatcggatg tcggttcctg agtatgaagc attcacacct tactcacaac ggctggagga 31681 aaactgggga ccacctcccg gacaactcaa cagcgatgga caaaacttgc tcgtttatgg 31741 taagcaattt ggtaacgtct tcattggtgt tcagcccacc tttggttacg aaggtgaccc 31801 catgcggctg ttgttctcgc gttctgcaag tccacaccac ggttttgctg cttactacac 31861 ttatctggaa caaatttggc aagctgacgc tgtgctgcat tttggaactc acggttcctt 31921 ggaatttatg ccaggtaaac agatggggat gtctggagaa tgttatccag ataacttgat 31981 tggcgcaatt cccaatcttt actactacgc tgcaaataac cccagtgaag cgacaattgc 32041 caagcgtcgc agctatgcgg aaacaatttc ttacctgact cctccagcag aaaatgctgg 32101 attgtacaaa ggtttgaagg aactcagtga gttgattgct tcttaccaaa ccttgaaaga 32161 tactggacgc ggtattccca tcgtcaacac cattatggat aaatgccgga tggtgaatct 32221 ggataaggac attgccttgc cagagcaaga cgccaaagat ataaccgccg aagaacgcga 32281 taatattgtt ggttcggttt accgtaagtt gatggagatt gaatcgcgat tgttgccttg 32341 tggattgcat gtgattggga aaccaccttc ggcggaggag gcgatcgcaa ctctcgtcaa 32401 cattgccagc ttagaccgtt ctgaggaaga aattcaaagt ctaccccgca ttatcgccaa 32461 cagcataggg cgtaacattg atgaagtcta ccaaaacagc gacaaaggca ttttagatga 32521 tgtccagctg ctgcaagaca tcacaatggc aacgcgcgcg gcagtttccg cccttgttaa 32581 agagcaaacc gacgcagaag gacgagtttc cctcgtttcc aagctcaact tcttcaacat 32641 gggcaaaaaa gaaccttggg tagaagcact gcataaagca ggttacacca aggtagatgt 32701 tacagcactc aaaccagtgt ttgagtatct agaattctgc ttgcagcaag tctgcgctga 32761 taatgaattg ggcgcattac tcaaaggctt agcaggcgag tacatcctcc ctggtcctgg 32821 tggcgatccc attcgtaacc cggatgtctt gcccacgggt aaaaatatcc acgccctcga 32881 cccccaatct atccccacaa cggctgctgt acaatcggcg aaaatcgttg tagatcggct 32941 cttggcgcgt cagatggcag aaaacggcgg tcagtatccc gaaaccattg cttgtgtgct 33001 atggggaacc gataacatca aaacctacgg ggaatcactg gcacaaatca tgtggatggt 33061 tggagtgcgt ccggttcccg atgcattggg acgagtcaac aagttagaat tgataccttt 33121 agaagagttg gaacgccccc gcattgatgt cgttgtcaac tgttctggtg ttttccgcga 33181 cttgttcatc aaccaaatga atctgctgga tcaagcggtg aaaatggcgg cggaagcaga 33241 tgaaccatca tcaatgaact acgtccgcaa acatgccctt gaacaagcag aggaaatggg 33301 gatcaacctg cgccaagcag caactcgcat cttctccaat gcttctggtt cctactcgtc 33361 caatatcaac ttggcggtag aaaacagcac ttgggaaagc gaagccgagt tgcaagaaat 33421 gtacctcaac cgcaaatcct tcgctttcag tgccgataac cctggtacga tggcagaatc 33481 aagaaagatt tttgagaaaa ccctgaaaac tgctgaggtc actttccaaa atctcgattc 33541 atccgaaatt agtttgacgg atgtttccca ttacttcgat tctgatccta ccaaagtagt 33601 cgccagcttg cggggtgatg gcaaaacacc agcatcctac attgcagaca ccaccacagc 33661 taatgctcaa gtccgtagct tatcagaaac cgtgcgttta gatgcccgta ccaaattgtt 33721 aaatcccaaa tggtacgaag gaatgctgtc tcacggttat gaaggtgtgc gcgaactttc 33781 caagcgcttg gtaaatacga tgggttggag tgcaaccgct ggtgctgtgg ataactgggt 33841 ttatgaggac accaatgaaa ccttcatcaa agatgaagca atgcggaacc gcttgctgaa 33901 ccttaatcct cattctttcc gcaaagttgt ttccaccttg ttggaagtta atggtcgcgg 33961 ctattgggag acaagcgaga gcaatttgga attgctgcgc gagttgtatc aggaggttga 34021 agatcggatt gagggaattg agtaggttat aaccgtgtaa agcgaaatgt agagacgttg 34081 catgcaacgt ctctacaata gaatcggtaa aaatagatat ataatcctca atcttaaatc 34141 cccctatgga tgccctaaca gttaatctca atcccgtgat tgaactgaca gatgagcaat 34201 ttttccagct atgtcaggca aaccgcgatt tgaggtttga acgcactgct actggagaat 34261 taattatcat gccgcccaca gggggagaaa cgagcaacag caacgcagga ttaactgctc 34321 aactttggct atggaatgag caagataaac taggtaaagt ttttgattcc tctggtggtt 34381 ttaaactccc taacggtgcg gatcgttccc ctgacgctgc ttgggtgaaa ctggaacgct 34441 ggaatacact gactcaagaa caacaaacca gatttctgcc actttgtcct gattttgtag 34501 ttgaattact ttcgccaagt gatagtttga aagtgactca acagaagatg aaagaatacc 34561 aagagaatgg tgctcgtttg ggttggttaa ttaatcgtaa gtctcggcaa gtagagattt 34621 atcgaattgg tcaagaggtt gaagttttgg aatctcctgt taatttgtca ggggaagatg 34681 tgctacctgg gtttgtgtta aatcttgagg cgatttggta aattggggcg tttgtaaagc 34741 aaatggaaac agatgtttat gctcaacaac tagaattatt tcaattattt caaccaagaa 34801 atactagaag ttgtcattat aaatttacat ttgtagactt attcgctggt ataggtggat 34861 ttagaattcc tctggaagag ttaggaggac aatgtttagg ctactcagaa attgacaaag 34921 aagctattaa agtttatcaa aataattttc tgaattccaa ctcagacgaa gcctatttag 34981 gagatattac caagctcaat atactccctt ttcaaataga tatattggtt ggaggggttc 35041 cctgtcaacc ttggtctatt gctggaaaat tacaaggttt agatgaccca agaggtaagt 35101 tatggattga tgtttttaga gtagtaaagg ctaataaacc aaaagccttt atttttgaaa 35161 atgtcaaagg tttaacggaa ccgagaaata gatcaagcct gcaatatata attaataatc 35221 tgacagcata tggctatgta tgtagttgga aggttctgaa ttcctatgat tttgggctgc 35281 cacaagatag agatagaata tttatcgtcg ggattagaaa tgatatagaa aattgctggg 35341 gtttaacctt tcctaaaccg ttagacaaac agcctaagct gtatgatgtg attcctgggt 35401 tacaacaggc taattttctt aagaaaaagt ttccaccaga agttttattt gatggcaaag 35461 tccccgcttc aagaggccga tttcaaaaaa tagatgaatt aaatgatttc ttccttttct 35521 ctgacatccg agatgggcac actactattc attcttggga cttaattgat acaactttaa 35581 gggaaaagtt aatttgtcaa actatcctga aaaatagaag aaagaagatg tacggactga 35641 aggatggtaa tccgttaagc agggaagtat tagaaacgtt aattcccaat ttacaacaac 35701 aagaagttga tagtttggtt tttaaagaaa ttctgcgttt tgtagaagga caaggatacg 35761 agtttgttaa ttctaaaata tcttcgggaa ttaatggcat atctaaaatt ttcttgcccc 35821 atgctgatgc tattgcaact ttaactgcaa ctggaactag agattttgtg gcaacaatat 35881 ccatacaatg tcaaaagcca gaagcatata agcaaacatt tattaaagaa atttatacaa 35941 agaaaaaatt caagcatcta acagcacaag attacgccag attgcaaggt tttcccgaag 36001 ggtttcagat tgctaataat gagtcaactg caaagcatca atttggtaat gctgtttcag 36061 ttcctgttgt gtaccatttg gcgaaagctt tactgaaaat aattttatag taataaattt 36121 acacatggca gctattagtt ctaaaactgg tgcagaaatt aaagggtatc tagaggggtt 36181 tattcaggga cttgtagatg aatataaagg acgtgagatt ataaaacccg ataatccagc 36241 agagtaccta tctcgatttt cacctaatgg agaattgaag ccgtttcaag cagcacttat 36301 tccaccagaa ttaatacgta tcaaccagtt tgaacgggga ttgagtacga gattaggaaa 36361 ttctcttgaa gaatgcgccc gcttaattgc ccttgaacat catcaacaag cacgccgagg 36421 ttatgatatc agggcagaag taagtctggc agcatttgcc gaagccgaac gccaaaaaga 36481 aaattacgaa actgtggctc atagaggaca ggctaaacct tctttagaac agatgataac 36541 agcagtactt aacgctcgac gctcagacga tttagaaaca aagagtgttc ggactgacct 36601 ttatattctt gcgaaagatg gaacagaatt cttctttgaa ataaaagctc caaagccaaa 36661 taaggggcaa tgcttagagg tgacacagcg tcttcttaga tttcacctac tctgtggtat 36721 aaaccgtcct caagtcaaag cctactatgc tatgccctat aatccttacg gtttcacaaa 36781 atctgattat aagtggtcat acgggttaaa ttatatgcct tttgaggaag ctgtcgttat 36841 tggaaatgaa ttctggaata ttgtaggcgg agcaactgct tacgaggaat tgttggacat 36901 ttatctagaa ataggacggg aaaaaagtaa atacatgctt gatgctttag cttttggatt 36961 ttaagtgcgg gcatcgccct catcttctcc agtgcgatgt ttgacgacaa accagtgtgc 37021 gtctacatca gtcagagtag cattcataga agcaagcaac cagccaaacc aatgcaaaaa 37081 aactttaaac aactgattgt gtctgatcca aaagtgatga tgggaaaatc tgatttgcac 37141 aaacgacatg gtagaggcac agcattgcta tgcccccaca ccatttcacg aaaagtctac 37201 aaataaagcc gggggaatct gattaggctt cccctcaggg caaacgcacg tgatgtctga 37261 aatgcttact gggtaaggat tttgctcaaa aataggttgg gagagaaaac gcccaaaccc 37321 gtgcaatcaa aggattctta cactttagat gcgtttgccc tgggcttccc ctagctttat 37381 tcataatcct gatgtctcca aacttattaa gcttgcagtg ccacgctgcc ttctgttgac 37441 tctgctaaga tccaggcgga gtagacatcc tcaagaaatc ttttctgatt atcaactccc 37501 tgaaggttgc ttgctcgtac ttttaaggtg gttaaataaa ctccgtgcag tatttgatgt 37561 ccataaccaa taacttgacc tatctgccca gtgatttcat ggcgtgcgta atctccaatg 37621 ttcagcatat tgataactcc ctgacgtaaa aggtttagtg acctgttttt tactgtttgt 37681 ggacgttagg tgccatttgg tgcagcatga tgcagaatcg ctctttttat taatgctgcg 37741 ttttcagtgc taagtcctga gtattttcca tcttgcactt ggtaacacca gtcgccaccg 37801 ttacgaaaag ccggtgcttt gtctacaacg cacagattct ttgcaagcgg aaagttccgc 37861 aacactatca aagtcgtaga aattaagctg caacttgagc tttgctctct attgagggca 37921 tggtttgagc tagtagatgt gtgtacagct gagtcaagcg ggatgcgaca ctatgccaac 37981 taaaggcaat ctctactcgt tgtctaccag cttcacctaa ttgatctcgc caatctgggt 38041 tgagcaaaat gcggtcaata gcagcagcaa aagcgacttc atctttagag ggaacgagta 38101 acccagtcac ttctggtacg actgtaaatt gcagcccccc aacatcacta gccacgactg 38161 gagtctgact agccattgct tcaattgcaa ctaaaccaaa aggttcgtag tgactaggta 38221 caacacaaac atctgcggct gcatagtaga aggggagaac agcttcattt aggcgacctg 38281 gaaaggttgt acagttttcc aatccgagtt cagcgacaat gctggcgatg cgatcgcgct 38341 cgatcccatc gctatgacct ggacgatagc cacctccaat cacaagttgt aggttagcat 38401 gaccccgcaa actagacttt gcaacggctc gtactaatgt ttcaattccc ttgcgtcgat 38461 caaagcgacc aacatagaga atcatcttaa catcaggtgc aattcccaac tcctgccgtg 38521 ctgcagacct ttgaatcaac ccaaacctat ccacatcagt cccacaagga atgacttcga 38581 ttttcccttt agtggaaacg agtcttcgca tatgttcctg ttcttgagga ctagttgcaa 38641 caactcgatc cacgctttct agacaggctt tttcaacggc tattcgcttg ctagcaattg 38701 ctggaatatc agaaatagcg ctgtacttga ctgttcctaa agagtggtaa gtatgcaact 38761 gaatcaacgg ttgttgtttt ttcagttcca tacccaccca cgaggacaac caatagttag 38821 tatgaatcaa agggtattga aatccttgct tttgctggaa tgcctgaaac tctgcaatga 38881 attctggcag atgctcgaat aagtcatctc gccctataaa ttcagcaggt ccagacttca 38941 atcggataat gcgacataat ggtccatgct gcacaatcgt agcctgctca ggactacttt 39001 ggcgggtaaa catatcaacc tgccaacctt gctgagcaag tgcataaccg acttgacgca 39061 cataaacgtt ttgtccccca gcctcttcct gtccgatttc tgcagcagga tctccatcaa 39121 cagaaatcag agctatgcga tgttttttgg tcggtaacat agttttactt caccatatgg 39181 caatgactcc tgcctcacca gaccggaaga aaaagttata tagtcctttc tcatggtcaa 39241 gggcaggtac aacgaagcgc acagatacca gccaaggaca ctgacgagaa caatgcatta 39301 cttgttaaga atgcgactcg ttcagtctcc tctcaatgcc gacgaagtta gctgacgggc 39361 tagaactgag agatgtccct ctcacagaac agaagaaaga aaaaaaaaga caggagaaaa 39421 aaccggagta attttgtctt ttttcattct tcatttttca ttcttcgctc ttctactgag 39481 attagcccca aacttggttc ctccgctcca ggcttacccc ggtgtaacaa aatgttttgg 39541 gtaaacccta gattaggcag actaatgtag atttgttaag aattttacct gtaagcgtga 39601 cttaagtcaa cctcttttta gagtgattta tgctatctac aagcaataat gcgctctaat 39661 gatgttccat tgcaagttat acacacacat aacaggatta tctggcagta tcgccatgtg 39721 aaaagattag gtagatattg cgtttttagc gtaatattgc gttcttggtg taaacgttat 39781 gttgattata tcttttacag agaataaaaa gaaatattaa acc // LOCUS NODE_687_length_37223_cov_4.85810437223 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 37223) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 37223) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..37223 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(184..1305) /locus_tag="DP116_05455" CDS complement(184..1305) /locus_tag="DP116_05455" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05455" /translation="MSTQFFLEETDIDTLINLLLRSQQSRTREALCVSIGIDPKRLSF IRDSSESDFFLLLIRYLNEIGEQEALCKLCCKELLPVFHHGKYAPILSEIAAKLNCNQ KLSQNFTNNQQPTVLSSSPAPSVSVNPFIQLAKNKFIRYSVIVIIGLAGFVSFIQSSK LSSNADTTEQPVSLSPTAKSNKSVNQQIIDIYQEVLGRKPTDNELNSNVNLLENRQAE LSNIRAWITKTSSLLQDGNVISLECLGNQYTGLRFLDSRIKNNNVSLGSSNIDASTKW KVHIINNRVVALENQGVINGSKWLDGVTADDSVKLAPNTKGGYTGTKWTINILGDGVV ALDNQGKSKWLDGVTADGSVRLAPNTEGNTGIRWRISKQ" gene complement(1323..2234) /locus_tag="DP116_05460" CDS complement(1323..2234) /locus_tag="DP116_05460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140180.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05460" /translation="MQQKILILAANPKSTTPLRLNEEVREIDAGLQRAKHRDQFVLEQ KWAVRPRDIQRAMLDINPSIVHFSGHGTGDEGLVFEDETGLAKLVDGEALAGLFDLFA DQVECVVLNGCYSQVQADAISQHINYVIGMSKAIGDRAAIEFAVGFYDALGAGKPVEF AYKFGCAAIRLAGIPEQLTPTLKKKPLNGSKDQPPVPNERVSELDQELNDSDRELLTE LLIRSGRAEYSARKPLCIKIGIEPNQLGFLRQSTDADFALELISYLHSIGDKQALCKI SKELEVVFKRSKYSADLENVKSKLNCN" gene complement(2309..2482) /locus_tag="DP116_05465" CDS complement(2309..2482) /locus_tag="DP116_05465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307411.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05465" /translation="MHINISDDLKKQFHATCVMQGKKMNQVVIELIQQWLRANEISQT DSEIAKKLPPKSC" gene complement(2869..5982) /locus_tag="DP116_05470" CDS complement(2869..5982) /locus_tag="DP116_05470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179014.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acriflavine resistance protein B" /protein_id="PRJNA477356:DP116_05470" /translation="MNLSELFIRRPVMTTLVMIGIVIFGLMSYALLPISALPHVEYPF ISVSASLPGATPETMASSVAAPLERQFSSIAGLNSFNSTSSTGSTNISLQFDFSRSVN DVAKDVQAAISAAAGQLPPGMPKPPTYRKVNPSVAPILYLYMYSETLPISTVDEYAEI TVGQPISMIDGVAQVQVYGQQQYAVRVQVDPQKLTTRGIGLNQVRNAIAQSNVNLPTG SLSGHDKTYTIQANGQLTNAAAYRSLIISYKNGAPVRLQDVGQVIDSVQNDKVLNLYN GIHSIVLAVQPQPDGNTVEIVDTIKQLLPTLREQVPKSLEMGIMYDRSESIRASVDDV KFTLFLSVCLVVVVIFLFLRELSATLIPSLALPVALIGTFAVMYLSGYSLDNISLMAL TLSVGFVVDDAVVVLENIVRHREMGESPLEAAFNGSREISFTIVSMTLSLVAVFIPLI FMGGLIGRLFHEFAVTIAVAILVSGFVSLSLTPMLCSRFIRPPNHQHQSRLYRVSERV FDLLLRGYEWSLKPFFKYRLITLIGSVILLALTVYLFVLVPKGFIPTEDTGQIMGNTR AAQDISFDAMLSHQQKVVDIIRRDPNVQAVDSIVGASGPNAAVNSGRITILLKPRSQR RLNSDQIIQEMRPKLTRIPGIQVFLRSPPAIPIGGQQTNSSYQFTLQSLDLQALRQYV PILKDKIKALPGFRDVDSDLELSTPQLQVEIDHKKAATLGITAEQVEQTLGAAYGSSQ ISKIYTPDDQFYVILELEPQYQHDPNSLSLLYIQSSNGQQVPLTAIARITQGVSPLTV KHVGQLPSATISFDVTSGMSLSQATDTIKQLASQILPQSITTNFQGSAQVFQQSFNDL GWLLLVSIVVIYLILGILYEDFIHPITILSGLPSAGCGALLTLLIFDVELNLYSFIGI ILLVGIVKKNGIMLVDFAIEAQRREGKNSFDAIYEACLVRFRPIMMTTMAALMGTIPI ALGTGSGSEARRPLGIAIVGGLVFSQILTLYLTPVFYTYMEEWRKKLGQPKFGRIFFW KKAKHLG" gene complement(6227..6556) /locus_tag="DP116_05475" CDS complement(6227..6556) /locus_tag="DP116_05475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008311714.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05475" /translation="MSLILEQESPPLSQDTTGAIRVGNTRVLLELVIHAFQDGASPES IVQRYSSLSLSDVYLTIGYYLRHRDAVEAYLDQREQLAESVRQRLSSVQPDLSLVHSR LLAQQQS" gene complement(6584..6847) /locus_tag="DP116_05480" CDS complement(6584..6847) /locus_tag="DP116_05480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012266339.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05480" /translation="MAQETLKQILNQLETLEIQELQQLHQTIQRYLAEKETTNKQAAF HQALIDAGLVKQIKHPSYDLISERRLIQVEGKPVSETIIAESR" gene complement(6902..7318) /locus_tag="DP116_05485" CDS complement(6902..7318) /locus_tag="DP116_05485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015188217.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain-containing protein" /protein_id="PRJNA477356:DP116_05485" /translation="MKVLIDTNIVLDYLLEREPFLQHAEALFNAIDSGKVVGYVTATT LTDIFYIARRQTGSIEQAQQAILTTLAVMVICSVDRAILEAAISSGLADFEDAVQIYC AVFQSLDAIVTRDTKGFSSSVIPVMSVRQLLESLES" gene complement(7318..7575) /locus_tag="DP116_05490" CDS complement(7318..7575) /locus_tag="DP116_05490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016925310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05490" /translation="MSSLQELKKQARQLSVNDRLELVHAIIESLQDVPNQQSERLLEG QTPERSRIIKQMKGLLKTDKPAPTDEQVAAMLEERRMEKYY" gene complement(7678..10878) /locus_tag="DP116_05495" /pseudo CDS complement(7678..10878) /locus_tag="DP116_05495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179014.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="acriflavine resistance protein B" assembly_gap 8740..8749 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(10875..12500) /locus_tag="DP116_05500" CDS complement(10875..12500) /locus_tag="DP116_05500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408450.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux RND transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_05500" /translation="MSIYKIDIVNAEISPSTINLNQLEALVLDNIPKESLAQKNSAQN HLEFKQNPPSKKRTGLVLLGLVLLATLGFLGYRTFFAKPNKTETSQRSERSGRKGRSM ITPVTVAKVGQKTVPVQLQAIGNVQAQSTVSVTPQIGGRITGVFFKKGQEVKKGQLLF TLDDQTQRAAIQQAQGTVAKDLALVEQARATLAKDQGLVEQARATLAKDQGLVRQAEA TLAKDQAQAQYAQAQSNRYTNLYKQGAVSQDQAQQYSTSSQVNVATLQSDREAIANAQ EVVKGDQVAIQNAQQVVKSDQVAIKNAQEVVKGDQAAIQNAQAVVASDQGALKNAQVQ LSYAKIYAPISGQAGDILVTQGNVVAANSTSPLLTISQIRPIQVSFSVPETQLPEIQK YASNNKLAVDVTIPNTNRQIRGVLTFINNTVDNSTGTIKLIGQFDNAQGQLWPGQYVN TTLTLRTQPNATVVPSQAVQNGPNGQFVFVVKPDNTVENVPVTVGSMINGLNVIEKGL QPGQTVVTDGQANLISGSKVRVKPASKSAGGAS" gene 12866..14068 /locus_tag="DP116_05505" CDS 12866..14068 /locus_tag="DP116_05505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179017.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_05505" /translation="MVRPSYQSFRLLLYLEWLLLATAVLMEILLPFELSWHLLERIFI IAAFGLMGLRLPTKKLAEKLLYTSVEFGLIMLAVTPQGLTIRSLFLLCLVLVMRSCLL FERKGQLIVLSLTLVSYVMLLVSRPIVPAKLKVAVWDWRLSSLLLYSLTLVFALLLIN ALLAEWQSRKQLEIAHQKLEMTHEQLRQYALLIEDQATLQERNRIAREIHDGLGHTLA AQTIQMNNALLFWQSNNDKALTFLKQAKQLGAEALLEIRRSVSVLRSNPLQGQSLESV IEKLLKDFQHNTGIELSSKINLPLSLPTEVNTTVYRIVQESLTNIYKHAQATAVTVQL QHQAGILDLSIEDNGKGFNPTQNTTGFGLQGMRERALALGGQFHLHSQPAKGCCVCVS LPLSNLLL" gene 14149..14787 /locus_tag="DP116_05510" CDS 14149..14787 /locus_tag="DP116_05510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408456.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_05510" /translation="MIRILLVDDQLLIRQGLKSLLESNCDMQVVGEAENGQRALEQIS TLQPDIVLMDIRMPVMDGVAATGAIAQQYPDTKVLVLTTFDDDGYVSQAMRVGAKGYL LKDTEPDELALAIRAVYKGHTQLGPGLFEKALMPVPESAPSIAQPPELAQLTRRELDV LRLMASGANNREIAQSLFLSENTVKNYVTNILSRLNLRDRTQAALLAHSLFN" gene complement(14800..16059) /locus_tag="DP116_05515" CDS complement(14800..16059) /locus_tag="DP116_05515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008177657.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_05515" /translation="MFNLTYEFKLKPTQQQVALFEEWLETHRRVYNHALAERKDWYKS RSCQINACSLRSCYIIRADAPRPTFASQCKSLTAARNATCFMPGNPSTAVAQESEYLK RVNAQSLQQTLRRLEKAFVSMWEQNHGFPRFKKAGRMRSFSFPQLGQNPLSNRYIKLP VIGAVKIRQSRSIPEGGVIKQARVVKRASGWYVMLTVQWDVNPPQRLPHGEAVGIDVG LTSFVATSNGLLVKRPRFFVDAERKLKLLQQRVSRKRIGSNNWKKAQKKVASLHEYVA NCRKDWHRKLSHQICNDAGMVFVEDLNLIGLSRGMLGKHCLDAGFGQFFNILEQTCFK RDVYFQKVDSRKTSQICPNCGTETGKKELSERTHACSNCGYTTDRDVAAAQVVAIRGL AAVGHTVKMLAEGKFIGIPVKQESSYQ" gene complement(16190..16588) /locus_tag="DP116_05520" /pseudo CDS complement(16190..16588) /locus_tag="DP116_05520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018397029.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="sulfite exporter TauE/SafE family protein" gene complement(16498..16899) /locus_tag="DP116_05525" CDS complement(16498..16899) /locus_tag="DP116_05525" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05525" /translation="MIGAKLFDTSMNLKNIIINIETVSRMNEVELTQMLHSFNHYGFA ILQCIQLNHFALDFLLLSKIFGKPTRHNRADDRGIVPIRPLPGYQAYLGASNEQLVRL QRRKPYLSRFSPGIFFQLASSTLGCLASLAV" gene complement(17197..17820) /locus_tag="DP116_05530" CDS complement(17197..17820) /locus_tag="DP116_05530" /inference="COORDINATES: protein motif:HMM:PF01202.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05530" /translation="MFNICANCGEYHPDKVIVSSGPYAVCPNCGFKHKFIQLPLFILS GASGTGKSTISLALADKMKEVVVMDSDILWRRELQQPGTDLREYRETWLRVCKNISQS GKSVVLCGAAIPEHFEECVERRYFSEIYYLALICDDEILTSRLRSRPAWRGFTDEWIK EHISFNRWFKDNAHKTKPPITLLDTSKMTVNQSVEEVVRWINVKYLH" gene 18061..19299 /locus_tag="DP116_05535" CDS 18061..19299 /locus_tag="DP116_05535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863106.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminoglycoside phosphotransferase family protein" /protein_id="PRJNA477356:DP116_05535" /translation="MVLSLSSQNVIQYLYQAGLCSSEEGKNSYSELPQTSQKNFNLVV TLPGNQKLLVKQERCIDNNENPHDFFNEWLFHQLLKQFPVLGNISATASLVVHFEPEK SILVRHYLKEYFELASFYQKNLYFPKAIASAIGTSLGALHRATFNRREYRDFMATAPE GQFRYQFYNPAQGLGSIGSEIFGNVPTDALKFYALYQRYESLEAAIAELAYEWNPCCL THNDLKLENILVHSRWEKLDNCLIRLIDWEGCSWGDPAFDLGTLVASYLTLWLESLVV DDTIELEESLLLAAIPLEVIQPSLLNLILAYLDTFPVILEYRCEFVQRVIQFAGLVLL HRIKEKINSHKYFDNSSICMFKFAKSLLSRPQESVLTVFGITESEILKLFAKFVQLSH PKKENNLLRLYYDKTRLRGC" gene 19457..20545 /locus_tag="DP116_05540" CDS 19457..20545 /locus_tag="DP116_05540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315790.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05540" /translation="MLESSTKPLLSSLLDIASNIQIKSNFCIRHPKYQPFALPSKIAE RFRQNSPALQHKYLALLLRNFLHGIYYNGSLQTTLSLSSDVNHDLPQKNLESHSILEM DWQFYELLHISNHGIGYFDPGWQLLRREPDGSIAVSKGGLTLYVEYNHNLEPSTQTAK VGDLIDIWMPKNRLQNGFYVAVSNVGQDLQTNPDTDLGAGRIYFNVTPMGALALMNSL TQQLNAAAIPFSFQVLHNRAAYGRYDSGVLSFEREDYPAVRKVLRSVYAEHQSHFHTE IPLFTKFLAPGLSLAEELSQKFAVQESFGMNRCQIVANALLEVWQQDNDSTDERMRAI QLHFTRLGIDLQRPYLNPCSEDIYYPLN" gene 20638..21084 /locus_tag="DP116_05545" CDS 20638..21084 /locus_tag="DP116_05545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459014.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1,4-dihydroxy-2-naphthoyl-CoA hydrolase" /protein_id="PRJNA477356:DP116_05545" /translation="MAFTYNRTIRFQDTDAAGVVYFANILSICHEAYEESLVMSGINL KDFFTNPSVAFPIVHANVDFFRPLYCGDNLVIRLMPQQLSVDRFEVASEVIVGEVMAA KVVTRHVCIETNSRTKTELPENMKQWLEINRRGAESAERRKSREAI" gene complement(21086..22195) /locus_tag="DP116_05550" CDS complement(21086..22195) /locus_tag="DP116_05550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009545596.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_05550" /translation="MAYNAFFQGLRKPPKFKSIRKFSGWTYPATSGWKVNTSGRHGSV NLNDLGVKIKMRGQARQWGKPSTLTVVYKPGLKQWFASFTVEVPDVTVRFGSQSELAY NEIVAFDLGCETAITTYDGKKVEQVANPRFTQKTEAKIKKVSKELRRKQAPNRTKKIK ASRRWKKANRRVAQLQRKAGSQRRDWQHKVTSEIASCYDIGVTEQLNTKGMTRKAKNG SKRKKQKAGLNKSILSVGFGTLNKMLTYKIEAKGGLMIMLPTKDVKPSQRCPKCGTVH KQWAELSNRHHVCLSCGFDVPRDAGSAMVMYNVVTNQQPGLGTSLDSLGCLSSTSKTS KRKNTGSMKQLGQARRQKSSYVAEPGETPSAYAAG" gene 22544..22888 /gene="tnpA" /locus_tag="DP116_05555" /pseudo CDS 22544..22888 /gene="tnpA" /locus_tag="DP116_05555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749332.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS200/IS605 family transposase" gene 23379..25793 /gene="glf" /locus_tag="DP116_05560" CDS 23379..25793 /gene="glf" /locus_tag="DP116_05560" /EC_number="5.4.99.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879642.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-galactopyranose mutase" /protein_id="PRJNA477356:DP116_05560" /translation="MSGEHTQIKNNGISNGKSRTKLSTLTEPQSSGAAKLQKLSLSNK SFKEASDCDTPDIVCFSHLRWNFVYQRPQHLLVRCAQGRRVFFIEEPIFSTEPLGRLE VSQDKNGVVVVVPHLSEGLSEDGINADLKVLIDGLFAQHNICKYMFWYYTPMAIAFTS HLEPEAVLYDCMDELSAFKGASTGLKNYEAELFRRADLVFTGGQSLYESKVNQHPNVY AFPSSVDVPHFAQARNLKEEPADQANIPHPRLGFFGVIDERMDIELVAGIADARPDWH LVIIGPVVKIDLALLPQRENIHYLGGKDYKELPYYLAGWDLAMLTFARNESTRFISPT KTPEYLAAGKPVVSTSIRDVVRPYGDLKLVRIADTADHFVTAAEQAMQEDTAASGWLS RVDAFLEQISWDRTWGSMMQLIDSAIAARQDSAVTNIPQAPSIITRDFVFDYLIVGAG FSGSVIAERLATQSGKKVLVVDKRNHIGGNAYDHYDEHGVLVHRYGPHIFHTNSREVF EYLSQFTQWRSYEHRVLASIDGQLLPIPINLDTINKLYGMNLNSFQVEDFYKSLAQPR EYIRTSEDVVVSKVGQELYEKFFRGYTRKQWGLDPSELDKSVIARIPTRTNRDDRYFT DTYQAMPLHGFTRMFEKMLNHPNIKVMLNTDYQEIEKAIPCREMVYTGPVDEFFDYRF GKLPYRSLDFKHETHNKEVFQSAPVINYPNEQLYTRVTEFKYLTGQEHSKTSIVYEFP KALGDPYYPVPRPENQEVYKQYKALADATPGVYFVGRLATYKYYNMDQCVAQALSVYK QIPVRA" assembly_gap 25884..25894 /estimated_length=11 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 25895..26016 /locus_tag="DP116_05565" /pseudo CDS 25895..26016 /locus_tag="DP116_05565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010872578.1" /note="frameshifted; incomplete; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=2 /transl_table=11 /product="UDP-glucose 4-epimerase GalE" gene 26295..28019 /gene="hflX" /locus_tag="DP116_05570" CDS 26295..28019 /gene="hflX" /locus_tag="DP116_05570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407039.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTPase HflX" /protein_id="PRJNA477356:DP116_05570" /translation="MQNIYGNLQGIKSNQIKLLQRLYEERQPKDRFITLEFAQALGEI STEIHQPICCYINRRGQVIRIAVGTPSQTQIPPQELPRHSAERLSGIHCVATQFKSEP PDEAALIAMVRQRLDALVVLSLADGEGGRHKKAATSNVKEAFIANLVPNAEKPWEISP PLSLDDLTEQDFDDLVGEWEKEISDSGDSMSFQDIVSDQDKVLLVGLKTDDISQQRFE DGLQELVRLVETAGGIVSDTVEQKRSRPHPQTVVGKGKVEEIAFQAQKVGANLIIFDR DISASQARNLENEIGMRVIDRTEVILDIFAQRAQSQAGKLQVELAQLEYMLPRLRGQG REMSRLGAGIGTRGPGETKLETERRVIQRRIAQLQQEVNQLQAHRDRIRQQRQRQEIP VVALVGYTNAGKSTLLNVLTNAEVYTADQLFATLDPTTRKLVITNPETQERRTILLTD TVGFIHELPPALIDAFRATLEEVIEANVLLHVVDLSHPAWESHIASVEEILAEMPAIP GKSLIVFNKIDSVDSETLAKAQQEYPEAVFISATKRLGLETLKGRALQLIDETVATSE LQNAVTQS" gene 28049..28852 /locus_tag="DP116_05575" CDS 28049..28852 /locus_tag="DP116_05575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019492073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phytanoyl-CoA dioxygenase" /protein_id="PRJNA477356:DP116_05575" /translation="MMTRRYNPQQIESFTQSVLNDGFCVLPNHFSPATLKAWHSAFIP KLTEHIASEGHLRNRGTARYYVTLPFAAPFADTSIYEDEDLLDIVERLVGADFVMCQL ATDTPLLHSEYQDIHRDTLPLFPETGMETPPYQLAVNFPLVDVTLENGPMEIARGTHM MSKEEGLRRIESGEIKLEPVTMQLGDVMIRDVRGLHRGTPNYTETPRPMVVIGYSRRW LFRPEVSIQIPRAAMTTLSERGRHLLRFNPIVESLDEFTGTEVYQSFAY" gene 29095..29994 /locus_tag="DP116_05580" CDS 29095..29994 /locus_tag="DP116_05580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318377.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase T" /protein_id="PRJNA477356:DP116_05580" /translation="MTLAIIVHGGAKTISEDKVAANNAGCTAAAEAGWAVLTSGGTAA EAVEAAIRVLEADQTFNASLGATLNTEGEVELDAAIMDGSSLGWGAVAAVQGVRHPIS VARKIMDEKPRMLVARGAERFAADNKAEMCKKEDLIADEQWEQWKEDQEVLDRPNTVG CVALDANGILAAGTSTGGTTNQQAGRVGDTALVGCGLYADNQLGACSTTGDGESIIPV VLAKTAIDFLDGDRHPDEAAQKAIDTLKSKVTGEAGCILLDRQGRVGWAYNSSHMACA YMTTAQDEVAVFTKKEAALSHQM" gene 30172..31452 /locus_tag="DP116_05585" CDS 30172..31452 /locus_tag="DP116_05585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357348.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbamoylphosphate synthase large subunit" /protein_id="PRJNA477356:DP116_05585" /translation="MNSKYDFQYFQGSYLSNLFAQDITDARYAFILNYPGTASWAAYP NRKKYFIQDGSSEATKTSFDKICQKEPWKNLAVLGDTLPGIVIISPPKLLIDYWQEHF GFSHSNMNMEMMDCSTYLNDLNQSERTDKLITLFPFDNLQPEKHAVNPDTHYRLLSKV TLAELGVQCPKYSSYNLHTQSLEDIELPQFPYLIKTSHGLSGEGTYIIKSASDLNYCL EEIRKYLDIKLLDTIIVSEFVKNQVQNYCVQFYVNKAGDITLIGTTSQLVTPEGNYLG GLIHYRETDMSKFYEMIAAIGQYAHKLGYFGVIGFDVLEDQDGEFYVIDVNFRVNGST GLCLQRHTLLSLGKEVAKYSSEYRMDGTLDSILVTLKPQLDRKDFIILSALEKVKYGK IYTEIYGIVTGQTIEEMQHIEQKLQNKGLQMSFV" gene complement(31690..33111) /locus_tag="DP116_05590" CDS complement(31690..33111) /locus_tag="DP116_05590" /EC_number="6.2.1.26" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-succinylbenzoate--CoA ligase" /protein_id="PRJNA477356:DP116_05590" /translation="MEKTLENFVQNASCQSLSNDWLICHDSHLFAQLTEQLYLELTQF SYYRGTPIKILLAERDPVRFLAGFLAACAARCPVFLCNPDWTKQEWQQVFDLVEPDLI WGLGTKNWGQNNSKFQIPIIPSPHSGLIMIPTGGSTGKIKFAIHTWETLMASVRGFTE HFLINHVNSFCVLPLYHVSGLMQFMRSFTTGGKLVILSFKELEYRQVDNIEPSKTFIS LVPTQLQRLLQNPELTQWLSQFKTVLLGGGPAWNELLEKARYYNIRLSPTYGMTETAS QIATLKPDEFLNGKDNCTQILPHASIKICNEQGEELNSNQIGNITIYSQALALGYYPN IWENPAYLQVDDLGFLDNKGYLHIVGRNSDKIITGGENVYPIEVESSIRATKMVADVC VIGMPDKLWGQAVTAIYIPKDSNTSDIEIRNLLKDKLSKFKIPKNWIPVQTLPRNSQG KINRQQLQQIATKFLQTRLTDEI" gene complement(33096..34298) /locus_tag="DP116_05595" CDS complement(33096..34298) /locus_tag="DP116_05595" /EC_number="4.2.1.113" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459021.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="o-succinylbenzoate synthase" /protein_id="PRJNA477356:DP116_05595" /translation="MRYRFEFRPYQQKFVTSLMTSHGIWDIREGIILRLTDETGKIGW GEIAPISWFGSETLEQALEFCRQLPEEITQETIFSIPDELPSCQFGFESAWETISTLT RPEINFYSYSPSPLKWTTNLGQSSLDDLGYEPGVLNPRRLLGLVQDLIKIETLSYSAL LPAGEVALEAWQKLWKEGYRTFKWKIGVYPIVQELEIFELLSQTLPASAKLRLDANGG LRYEEAQLWLVNCDNIKANEEICLEIEFIEQPLSVDQFAAMLELSHCYQTAIALDESV ATLNQLATCFQQGWREIFVIKPGIVGSPARLRQFCQQHKIDAVFSSVFETAIGRQAAL QIAAELSRTLSSTGASAAGGFPSVGVWRWGTPARKCPPQRNRAVGFGVNHWFAQQETT PEELWKKL" gene 34416..35831 /locus_tag="DP116_05600" CDS 34416..35831 /locus_tag="DP116_05600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311242.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="isochorismate synthase" /protein_id="PRJNA477356:DP116_05600" /translation="MTVSSCNADFFVYNKELYCFLLAVQQNCIKNNDTQIASFALDID WVDPLVVLDKLAQPNKVSFYWENKRKKEALAAVDAVAKIEIAGKDRFAKSEEFIKECV KNITSFSRTNQAFFGTRFLCSFSFFDQNSQADYPFSAATIFLPRWQVAVKNERCVLVF NTIINADTNIQRILQSLSRKLEIINSLESNSQTLDYSLPKFSHKSVANPQHFKCSVLS ALEKIQSNDLTKVVLADILDVRSNAHLNVIKSLNNLRQLHPNCYVFSHSNGKGQNFIG ASPERLINIQEQQLMTDALAGSAPRGKTPGEDANNASRLLNSEKERHEHSLVIDFITQ RLSQLGLLPQVLAPRLRQLSNIQHLWTPISAGVPADVHPLKIVAQLHPTPAVAGTSQE VACREIRRYESFERGLYAAPLGWVDLEGNCEFIVGIRSALIDGDRARLYAGVGIVAGS DPDKELAEVQLKLQALLKALV" gene complement(35823..>37223) /locus_tag="DP116_05605" CDS complement(35823..>37223) /locus_tag="DP116_05605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874812.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_05605" /translation="NRSAERVYGWKAKDALGKNANELLYKKIYSQLQEALSQVNLTGQ WYGELSQIRKDGKEVIVETRWTLVRDEHENPKSILTVNTDITEKKKLQTQFLRAQRLE SLGTLASGIAHDLNNTLAPMLMSAQLLRMRISDERNQQLLETLETNAQRGAAMVRQVL SFARGVEGKRTILQIKHLISEIEQFAKQTFSKSIEFCTDIAPNLWTVSGDATQLHQVL MNLVINARDAMPKGGVLSLCAENFLIDENYARMNLDATVGPHIVITVKDTGIGMPPEV LDRIFEPFFTTKEVGKGSGLGLSTVLGIMKSHNGFISVSSKVGKGTQFKVFLKAVLEN QTPLEESLELPTGNGELILVVDDEAEIREITKITLQNYNYKVLTACDGIEAIASYAQN REEIKLVFMDMMMPVMDGLTTIRTLLKMNPYVKIMAASGLADNKQLTQPLGVETFLLK PYTVKQLLQTLDKILN" BASE COUNT 10506 a 7825 c 8084 g 10787 t 21 others ORIGIN 1 aattcaaaca ggtatacagc gtaagcgttt catggatttg caatgcgtag tttatttatg 61 ccgtggtgta ctagttcaaa atctcccaag tgggcaaagt gtttttctat catatttaca 121 agatttacag gcggtttttg attttctatt ttcaagataa tcagatgcaa aaatctggag 181 ttttcactgc ttactaattc tccatcgtat acctgtattg ccttctgtgt taggagctag 241 cctaacagag ccgtccgctg ttacaccatc taaccatttt gattttccct gattatctaa 301 ggcaactact ccatcaccaa gtatattgat tgtccatttt gtgcctgtat agccaccttt 361 tgtgttagga gctaacttaa cagagtcgtc cgctgttaca ccatccaacc attttgaacc 421 attaattact ccctgatttt ctaaggcaac tactctatta ttaataatgt gtaccttcca 481 tttagtagaa gcatcaatat tagaagaacc taagcttaca ttattatttt ttatgcgaga 541 atctaaaaat ctcaaaccag tatactggtt tcctagacat tctaatgaaa taacattacc 601 atcttgtaga agtgaagaag ttttagtaat ccaagcacga atattactta attctgcttg 661 tctgttttct aacaaattaa cattggaatt cagctcatta tctgtaggtt ttcgtccaag 721 cacctcttga taaatatcga tgatttgctg attcacagat ttattgcttt tagcggttgg 781 actaaggctt acaggttgct cagttgtatc tgcattacta gacaatttac tgctctggat 841 aaacgagaca aacccggcta acccaattat aacaatcaca ctatatctaa tgaatttgtt 901 tttagcaagt tgaataaagg gattaacgct gacactaggg gcaggactag acgatagaac 961 agttggctgc tgattattgg tgaaattctg acttaacttc tgattacaat tgagtttggc 1021 tgcaatttca cttaaaatag gggcatattt accatgatga aaaaccggaa gtagttcctt 1081 acaacaaagc ttgcaaagag cttcttgttc acctatttca ttcaaatacc taataagtag 1141 tagaaaaaag tcagattcgg aagaatctct gataaatgaa agtcgcttgg ggtcgattcc 1201 aatgctaaca catagtgctt ctcgtgttct tgattgttgg ctacgcagta aaaggtttat 1261 taatgtgtct atatcagttt cttctaaaaa aaattgagtt gacatacttt aaattcctca 1321 cactagttac aatttagttt agatttaaca ttttccaaat cagccgagta cttacttctt 1381 ttaaacacaa cttcaagttc tttgctgatt ttacaaagtg cttgtttgtc acctatacta 1441 tgcaaatagc taattagctc taaggcaaag tcagcatcgg tagattgcct taaaaatcca 1501 agttgatttg gttcaattcc aatttttata caaagtggtt ttcgcgccga atattctgca 1561 cgtccactac gtattaagag ttctgtaagt aattctctgt cagagtcgtt tagttcctga 1621 tctaactcag acactcgttc attggggact ggaggctggt ctttagatcc attcaaaggc 1681 ttcttcttta aagttggggt tagttgctct ggaatacctg ctaacctaat tgcagcacaa 1741 ccaaatttgt aggcaaattc aacaggcttt cctgctccta gagcatcgta aaaacccaca 1801 gcaaattcaa ttgctgctct atctccgatt gccttgctca taccaatcac ataattaatg 1861 tgttgagaga tagcatctgc ttgcacttgg gagtaacagc cattaagaac aacacactca 1921 acctgatcag caaataaatc aaacagtccc gctagagctt ctccatcgac taactttgcc 1981 aaacctgttt cgtcttcaaa tactagaccc tcatctcctg tcccatgccc tgagaaatgg 2041 acaatagaag gattaatatc cagcattgct cgctggatat ccctcggtcg cactgcccac 2101 ttctgctcta agacaaactg gtcgcggtgt ttggctcgct gtagtccagc atcgatttca 2161 cgcacctctt cattcaaacg cagcggtgtt gtgcttttag gatttgctgc taaaatcaag 2221 attttttgtt gcataaatta attagctcca gttatattaa attgaaaagt ttttattaca 2281 taactaactt tgttactagc aaggttattc aacaggattt gggtggtaac ttcttagcta 2341 tttcactatc agtttgtgaa atttcgtttg ccctaagcca ttgctggatt aactcaatga 2401 ctacctgatt catctttttc ccctgcataa cgcaggtagc atgaaactgc tttttcaaat 2461 catctgaaat gttgatatgc atttctttag tagttttggc attcatgtaa ttcactcaga 2521 catggataaa gctacaaaag taaaaacggc atttctgaaa aaatgccgtt aactcctacc 2581 ttgcatctat accctattcg tatgaagaga tacgaataag cccttgatta gcacgctaaa 2641 aaacctgatt tttcgtatta ttgacaataa tatacgtttc tttagaacgt caaattccag 2701 tattaatacg ttagtgaaag atataaaatt aatgtaagcg ctctgacatt caaaacccaa 2761 taacagaaac agccttcttc cagtgctctt ataccagttg accctaattt ctcaattaac 2821 agctcccctt cttttcatat tctcataatt acgtactagg tttcagaatc agcccaagtg 2881 cttagctttc ttccaaaaga agattctccc aaacttaggt tgaccaagct ttttccgcca 2941 ttcttccatg taagtgtaga acactggagt caaatagaga gtcaatattt gagagaacac 3001 taatccgccg acaattgcaa tccccaaagg acgacgcgct tcagaacccg aacccgtccc 3061 caacgcaatt ggtattgtcc ccatcaaagc tgccattgtt gtcatcataa ttggacggaa 3121 gcgaaccaag caggcttcgt agatggcatc aaaggaattt tttccttccc gtcgctgcgc 3181 ttcaatggcg aagtctacca acataatacc gtttttcttt acaataccta ctagcaaaat 3241 gattccgatg aaggagtaga gatttaactc aacatcgaaa atcaacagtg ttaacaaagc 3301 accacagcca gccgaaggca aaccagaaag aatggttatg gggtgaatga aatcctcata 3361 gaggatacct agaatcagat aaatcaccac aatcgacaca agcaacagcc agcccaagtc 3421 gttgaaggac tgttggaaca cctgtgctga accttgaaag ttggtagtga tgctttgagg 3481 cagaatctga ctagcgagtt gcttgatggt gtctgtcgct tgactcagag acattcctga 3541 tgttacgtcg aaagagatag tggcagaagg aagctgacca acgtgtttga cggtgagggg 3601 actcactcct tgggtgatac gggcgatcgc tgttaaagga acctgttgtc cgttactcga 3661 ttggatgtag agtagcgata gagagttggg atcgtgctga tactgcggct ctagttccag 3721 aatcacataa aactgatcgt ctggggtata gatttttgaa atttggctag aaccataagc 3781 agccccaagt gtttgttcga cttgttccgc cgtaatccca agagttgctg cttttttgtg 3841 gtcaatctcg acttgtaact ggggagtact tagttctaag tcgctgtcaa catcgcggaa 3901 tcctggtagc gctttgattt tatctttcag tataggaacg tattgacgga gtgcttgtag 3961 gtcaaggctt tgcaatgtaa actggtatga tgaattagtc tgttgaccac caatagggat 4021 cgcaggtgga gagcgtaaga atacttgaat gcctggtatg cgcgtcaact tagggcgcat 4081 ttcttgaatg atttggtcgg aattgagacg gcgttgggaa cgcggcttca acagaattgt 4141 aattcgaccg gagttgacag cagcattagg tccactcgca cctacaatag agtcaactgc 4201 ttggacattg gggtctcgac gaatgatgtc aacgaccttc tgctggtgac ttagcatagc 4261 atcaaaggat atgtcttgtg ctgctcgcgt gtttcccatg atttgtccag tatcctctgt 4321 ggggataaat cctttgggaa caaggacaaa caagtagact gtcaacgcta agagaatgac 4381 tgaaccgata agtgttatga ggcggtattt gaaaaatggc ttgagactcc actcatagcc 4441 ccgcaacagt aaatcaaaga cacgctctga gactcggtac agacggcttt gatgctgatg 4501 atttggtgga cggatgaatc gactacacag cattggcgtg aggctgaggg aaacaaagcc 4561 cgacaccaaa atcgcaacgg caattgtcac ggcaaattca tgaaacaacc gaccaattaa 4621 tccacccata aatatgaggg ggatgaacac cgccaccagc gataacgtca tcgacacaat 4681 tgtgaagctg atttctcgcg agccattaaa cgctgcttcc agtggagatt cacccatttc 4741 gcggtgacga acgatatttt ccagtacaac gaccgcatca tcaacaacaa agccgactga 4801 gagcgttaat gccatcagtg agatgttatc cagagaatag cccgacagat acattacggc 4861 aaatgtgccg atgagtgcca cgggaagcgc cagactggga atgagtgtgg cagagagttc 4921 gcgtaaaaat aggaagataa ctacgacaac caaacaaact gatagaaata acgtgaattt 4981 cacatcatca acggaagcgc ggatcgattc agagcgatcg tacatgatac ccatctcaag 5041 tgatttggga acttgctcgc gcagagtggg tagaagttgc ttgatggtat cgacaatttc 5101 caccgtatta ccatctggtt gaggctgcac agccaaaaca atcgaatgaa tgccgttgta 5161 aagatttaag actttatcat tttgcacgct gtcaatcacc tgacccacat cttggagtct 5221 cacaggtgcg ccatttttgt agctgataat cagtgaacga taagcagctg cattcgtgag 5281 ttgaccgttt gcttggatgg tgtaagtttt atcatgacca gaaagactgc ctgttggcaa 5341 gttcacattt gactgggcga tcgcatttct aacctgattt agcccaattc cccgcgtggt 5401 caatttttgt ggatcaactt ggacgcgaac agcatattgc tgctgaccgt atacttgtac 5461 ttgggcaacg ccgtcaatca tagagattgg ctgaccgacc gtaatctctg cgtattcgtc 5521 cactgtagag atagggagcg tttcagaata catgtagagg tagagaattg gtgcaaccga 5581 tgggttgacc ttgcggtagg taggtggttt aggcatacct gggggtaatt gtccagctgc 5641 tgctgagatt gctgcttgga catctttggc aacatcgttc acgctgcggc tgaagtcgaa 5701 ttgcagggag atgttggtac tgcctgttga actggtggag ttgaaggaat tgagtccggc 5761 tatgctggaa aattgtcttt ctaagggcgc agctacggaa gatgccattg tttcaggcgt 5821 cgcaccgggt aaactggcgg agacggaaat gaagggatat tctacatgag gtaaggcgct 5881 aattggtaag agtgcataac tcatgagacc gaagatgacg atacctatca tgactagggt 5941 tgtcatgacg ggacggcgga taaagagttc tgaaaggttc atgagttgtg ctgttccaga 6001 ttttttcttt caaaatttgg ggtggatgat gttggcaaat atgaagttat ttgattgctg 6061 tcaaaatcat ctctacatct ttgatgagat tagcgatttg tgaaaatgat tttttaaata 6121 aaccgcagag gcgcagagta cacagagaaa agagtaaaga gaggttttta cgtatgattt 6181 gggactgcta tagatcgtcg ctggatcatt gacttggcac aaagttttat gattgttgct 6241 gggctaacaa gcgagaatga accaggctca agtcaggttg aacactggac aaacgttgac 6301 gaactgattc agctaactgc tcgcgttgat cgagatatgc ttccacagca tcgcgatgtc 6361 gaagatagta accaatggtc agataaacat cggataaaga taaggatgag taacgctgga 6421 ctatcgattc aggggatgca ccatcctgaa aggcgtgaat gaccagttcc aacaaaactc 6481 ttgtattacc gactcgaatt gcgcctgtgg tatcttgact taggggagga ctttcttgtt 6541 ctagtattaa actcatggat caatctccta ggtaaactac tatctaacgg gattccgcga 6601 tgattgtttc agaaacgggc tttccttcaa cttgaatgag tcttcgttca gagattagat 6661 cgtaggaagg atgcttaatc tgcttgacta agccagcatc aatcaaagct tggtgaaatg 6721 ctgcttgctt gttagttgtt tctttttcgg caagatacct ttgaattgtc tgatgaagct 6781 gctgaagttc ttgtatttca agtgtctcaa gttgattcag tatttgctta agggtttctt 6841 gtgccataat tttttccttt ttatagatta ttgtgtttac cgctatttgc tgaaagcttc 6901 ttcatgactc cagtgactct aatagctgac gcacggacat tacaggtatc actgaactag 6961 aaaatccctt ggtgtcgcgg gtcacaattg cgtctaaact ttggaacact gcacagtaaa 7021 tctgtaccgc gtcttcaaaa tcagctaaac cagatgaaat tgccgcttct aaaatagccc 7081 gatctacaga gcaaatgacc ataacagcca acgtggttaa aatcgcttgt tgagcttgtt 7141 caatacttcc agtttgtctg cgggcgatgt agaaaatgtc agtgagtgtt gtagcagtga 7201 catacccaac tactttccca gaatcaattg cgttgaatag agcttccgca tgttgcagaa 7261 atggctctcg ctctagcaag taatctaaaa caatattggt atcaatgaga actttcacta 7321 ataatacttc tccatccgac gttcttctag cattgctgct acctgctcat cggttggtgc 7381 gggtttgtca gttttcaaca accctttcat ttgtttgata atacgggatc tttcaggcgt 7441 ctgcccttca agcaatcgct ctgattgttg attgggaaca tcttgtaaag attcgataat 7501 cgcatgaaca agttcaaggc gatcgtttac agataattga cgggcttgct ttttcagttc 7561 ttgcagggac gacatatgaa tgactccaac aaacactagc atctaaattt tagtctgagc 7621 aatcaaggac tcaggttata tgtgaaactc atcgcatcag ctttatttca aagtttttta 7681 ccgccctcct gaaaacgaga caacccgacg aaacttaacc tgtctaagtt ttttccgcca 7741 agcttccata tatgtaaaaa atacaggtgt taaatacagc gttaatatct gagaaaacac 7801 caaacctcct acaaccgcaa ttcccagagg acgccgagat tcagaacctg caccataacc 7861 tagggcgatt ggtaatgttc ccattaaagc tgccattgtc gtcatcatga tcggacggaa 7921 gcgaatcaaa caagcttcat aaatggcatc aaacggactt ttgccttcgt ttcgctgtgc 7981 ttctatggca aagtccacca tcataatgcc gtttttcttg acgataccta ccaagaggat 8041 gatgccgatg aatgaataaa cgttcaattc cacatgaaaa attaataacg ttaacaacgc 8101 accaaatcca gcggaaggta gaccggaaag aatcgttaaa ggatgtataa aatcttcgta 8161 gagaataccg agaatcagat aaatcaccaa aatagcgaca gctagtaaca atcccaaact 8221 cggcaatgaa gattgaaata cttgacttgc tccttggaag cttgttgtga tgctagcggg 8281 aatcacttta tgcaccaagt cttctatggt ttgagtggct gctcccagcg atgcatctgg 8341 tgctaagttg aaggagatcg tcgcagcatt catccgacca tagtgattga ccatcagcgg 8401 tccaacacct tgggatattg tgacaaatgt actcagagga actgcttgac cagtagtgga 8461 atttgtgctt gcgctagtac tgctatttgt ggtggaattt gtactggtgc tagtactgct 8521 attcgtggta gagttcgtac tcgtactagt gttaacgtag agcttcatta aagcatttgg 8581 atcttgctgg tactgcggtt ccagttctaa tatgactttg tactggtcac tagcggcata 8641 aattgttgac acttgataag cgctataggc gtttttcaac gtgttttcga tttgttgggc 8701 ggtaatgcca agggtggaag ctttatcacg gtcgatatcn nnnnnnnnna tttgtaagtc 8761 gctgttgaca tccagaagtt ctggaagggt ttgcatctgg gcgacaagtt gcggaacgta 8821 tttttcgagg gattggacat cggtactctg aagcgcaagc tgatacagtc ctgttgtctg 8881 ttgtgttccg atgggaattg caggaggatt ttgtaagaag actttaatcc ctgggacagt 8941 tgctagcttg cttcgtagtt cttggacgat ttcatcagcg ctgttgtggc gatgtgaacg 9001 ttccttcagg cgaatcaaca agttcccaga gttccctgga actgctgccc cgctaccgct 9061 cgcactggaa ccaggaccaa tgttagaatt aactgcatcg acattcgggt tttgccgaat 9121 caaatttacc acttcttgct ggtgacgcac caagttatcg aaagaagcat cctgtgatgc 9181 ctgtgttgtt gcaataattt gtccagtatc ttcactagga ataaatcctt tgggaacagc 9241 aataaacaga taaatagtga caataaaaag cacaccagac aaaatcattg tggtcaagtg 9301 aaactttaaa gctattttca aactccaatc gtaaacagcc aagaagcggt caaacacata 9361 ctctgaagct tggtacagac ggctttgatt ttcgtgattt gttgcgccta caaagcgact 9421 acatagcatc ggggtgaggg agagggaaac aaacccagac accaaaattg caacggcgat 9481 cgttaccgcg aattcgtgga acagtcgtcc caacactcca cccatgaaca gcattgggat 9541 aaaaaccgcc accagtgaaa ttgtcatcga caagatggta aaactgattt ctctagaacc 9601 gtttaaggca gcttctagac gagattcacc catttccatg tggcgaacga tgttttccag 9661 catgacgatc gcatcatcaa ccacaaaacc caccgatagc gtcaacgcca tcattgataa 9721 gttatccagc gagtaaccca gcatgtacat cgctgcaaaa gtcgcaatca gcgatacagg 9781 taatgccaaa ctgggaatca ccgttgctga gagattccgc agaaacagga atattactaa 9841 aataaccaga ccaattgtca aaatgagcgt aaatctgaca tcatctactg actctcgaat 9901 tgactgggat gcatcataga aaatccctat ttcaactgat tttggaattt gctcacttaa 9961 ctttggtaat gatttcttaa ttgtgtctac aacttgtact gtatttgtac ctggttgccg 10021 ttgaatggtc aaaatgatag cgcgagtgtc attataccag cttgctacct tatcattttg 10081 aacactgtca acaatcttac cgagttgttc aaggtaaatt ggggcaccat tgcgataggc 10141 gacaatcagg gagcgataag cagccgcatc ttgtaattga ccgtttgcct ggactgtaaa 10201 attcttgttc ttgcccgaaa tactacctgt aggtaaattt acgtttcctt gttgtattgc 10261 agtttgcacc tgatccagtc caatctgctg acttgctaac ttctgaggat ccagttgaat 10321 acgagctgca tacttttggg aaccgtaaac ctgcacctga gcgacaccat tgattgtaga 10381 cagcttttgt gctatgtagg tttgggcgta gcggtctact tgggaaagcg gcagagtcgg 10441 tgagtttata tataagtaaa gaatcggctg atcggctggg ttgaccttgc tgtaagatgg 10501 ggggttaggt aaatcgttag gaatttgtcc agacgctgct gatatcgccg cttgcacatc 10561 ctgagccgca tcgtcaatat ttcggctgag gttaaactga agcgtaattt gggtactacc 10621 taatgtgctg gtcgagttga gggaatcaag tccggcaata ctagaaaact gcttttccaa 10681 aggacgggca acagaagatg ccattgtttc cgggctagct cctggtcgcg ccgctgacac 10741 ctgaatggta ggataatcaa cgctgggtaa gtcgctgatt ggcagtaagc ggtaactcat 10801 gagaccgaag atgaggatac ccgccatcac caaggtggtc atgatcgggc gacggatgaa 10861 tagttgcgag aggttcatga agcgcctcct gctgacttcg acgccggttt gactcgcacc 10921 ttactaccag aaatcagatt cgcctgaccg tcagtcacaa ctgtttgacc tggttgtagt 10981 cctttctcaa tgacatttaa accattaatc atactgccaa cagtcacagg gacattttct 11041 actgtgttat ctggcttgac aacgaagaca aattgaccat tgggtccatt ctgaacagct 11101 tgcgagggaa cgacagttgc attcggttgt gtcctcagcg ttaatgttgt gttcacatat 11161 tgaccaggcc atagttgccc ttgagcatta tcaaactgac ctataagttt aatggttccg 11221 gtgctgttat cgactgtgtt attgatgaat gtcaaaacac cccgtatctg acgattggtg 11281 ttggggatag taacatccac tgccagctta ttattgctcg cgtacttttg aatttccggt 11341 agttgcgtct ctggaaccga aaaggaaacc tgaattgggc gaatctggga gattgtcaaa 11401 agcggactcg tactattggc tgcaactaca ttgccctgag tcaccaaaat atctccagct 11461 tgaccggaga ttggggcgta tattttggcg taagaaagtt gcacttgggc gttttttaat 11521 gctccttgat cagaagcgac aactgcctgt gcattctgga ttgcagcttg atcgcccttg 11581 acgacttctt gagcattctt gattgcaact tgatcgctct tgacgacttg ttgagcattc 11641 tggattgcga cttgatctcc cttgacaact tcttgagcat tggcgatcgc ctctctatct 11701 gactgaagcg ttgccacatt cacctgacta ctagtagagt attgctgagc ttgatcctgg 11761 ctaacagcac cctgcttgta caagttagta taacgattac tttgtgcctg tgcgtattgt 11821 gcttgcgctt gatcctttgc caaagtcgcc tcagcttgtc gcaccagtcc ttggtctttg 11881 gctaacgtgg ctctggcttg ctctaccagt ccttggtctt tggctaacgt tgctcttgct 11941 tgctctacca gcgccaagtc cttggcgaca gttccttgag cttgttggat ggcagccctt 12001 tgtgtttggt catccagcgt gaacaggagt tgcccttttt tcacctcctg accttttttg 12061 aaaaacactc cggtaattct tccaccaatc tggggtgtca ccgatacggt agactgggct 12121 tgaacattac caattgcctg caattgcacc ggaactgttt tttggccgac tttggcgacg 12181 gttacaggag ttatcataga ccgtccttta cgtcccgacc tttcggacct ttgggatgtt 12241 tcagttttgt tgggcttggc aaaaaacgta cggtaaccaa gaaagcccag cgtagcgagc 12301 aacaccagac cgagtaaaac taaaccagtc cgcttcttgg atggaggatt ttgcttgaat 12361 tctaagtgat tttgggctga gttcttctgc gctaaactct ctttgggaat gttatcgagc 12421 accagagcct ccaattggtt taagtttatg gttgatggtg aaatctccgc attcactatg 12481 tcaattttat aaattgacat gaatgtttac atgaagcagg attcctaagc tgatcagtca 12541 gaaattacta aaactgatgt gacttttttc ctagtccaga aagagaatca gtagaaaaaa 12601 ctctatcgat actggtaaca attttgttag agaagaagag aatagacttc ttgcaaaagt 12661 atgatcaatg gatatttaca accaaggatg gataaagata aataactcgt gttaatcagt 12721 gtgcatctgt ggttttacgt tcactcctgt tgacctttgc aaaaagttta aggtttccaa 12781 cgttctttgt tggcgctgta ctttttttaa aatcaggata atgctagtgg tactgttcct 12841 ttggtcgttc attttagaca gtcctatggt tcgcccgagt taccaatcgt ttcgtctgtt 12901 gctttatctg gagtggctgt tgttagccac agcagtgtta atggaaattt tgttaccgtt 12961 cgagttgtct tggcacttgc tagagcgaat ttttataatt gctgccttcg gtttaatggg 13021 cttaagactg ccaactaaaa agttggcaga aaaattgctt tacacgagtg tggagtttgg 13081 cttgataatg cttgcggtca ctccacaagg cttaaccatt cgctctcttt tcctactgtg 13141 cctggtattg gtgatgcgga gttgcttgct gtttgagcga aagggacagt tgattgtatt 13201 gagcttaact cttgtgtcct atgtcatgct gcttgtgtca agacctatcg tgcctgcgaa 13261 gcttaaagtc gctgtatggg attggcgact gagttccttg ttattatata gcttgacatt 13321 agtatttgct ttattgttga ttaacgctct gcttgcagag tggcaaagtc gaaagcagtt 13381 ggaaattgcc catcagaaac tagaaatgac gcatgagcaa ctccgacagt atgcgttgct 13441 cattgaggat caggcgactt tgcaggaacg taaccgcatt gcccgtgaaa ttcacgatgg 13501 acttggacac accctagcag ctcaaaccat tcaaatgaat aatgccctgc tgttctggca 13561 atcgaataat gataaggcat tgacatttct caaacaagca aagcaactag gggctgaggc 13621 gttgctagaa atccggcgat cagtttcagt tttgcgttca aacccgttgc aaggacaatc 13681 acttgaatca gtaattgaaa aactgctgaa ggattttcag cacaacacag ggattgaact 13741 gtctagtaaa attaatttac cattatcctt acccacagaa gtgaacacaa cggtctaccg 13801 cattgtgcaa gaatcgctaa caaatattta caaacatgca caggcaacag ctgtgaccgt 13861 tcagctacag catcaggctg ggatacttga tctttccatt gaggacaatg gtaaagggtt 13921 taacccaact caaaatacaa ccgggtttgg actgcagggg atgcgagaac gagcactggc 13981 gttgggcggt cagtttcatc tccacagtca accagcaaaa ggttgctgcg tctgtgtttc 14041 tcttccacta tcaaacttgt tactatagac ttcgtggcag ttgtagggaa cgcttaactc 14101 ttaacaggga acagaacctc gtaaaatctc atttttgcaa gaggtcttat gattcggatc 14161 ttgttagtcg atgatcaact tcttatccgt cagggactca agagtcttct agagtccaat 14221 tgcgatatgc aagttgtcgg tgaggcagaa aatgggcaac gagcgcttga acaaatctct 14281 actctacaac cggatattgt gctaatggat attcggatgc ccgtgatgga tggagttgct 14341 gctactgggg cgatcgctca acaatatccc gacacgaaag tactggtgtt gacaactttc 14401 gatgatgatg ggtacgtctc gcaagcgatg cgagtggggg ctaagggcta tttgctcaag 14461 gacactgagc cagatgaact ggcgctagct attcgcgctg tctacaaagg acatacccaa 14521 cttggaccag gattgtttga aaaagcactc atgcccgtcc cagaatcagc cccctcgatt 14581 gcacaacccc cagaattggc gcaacttaca cgcagagagt tggatgtgtt gcgtttaatg 14641 gcttctggag ctaataaccg tgaaattgcg cagtcgcttt ttctctcaga gaacactgtt 14701 aagaactatg tgactaatat tcttagtcgg ttaaacttgc gcgatcgcac ccaagcagcc 14761 ctacttgccc attctctgtt taattgacac tcctcgcgcc tattggtacg aggattcttg 14821 cttcacgggg attccaatga atttaccctc ggcaagcatc ttgaccgtat gccctacggc 14881 tgcaagcccc ctgatggcga ctacttgagc ggctgcaaca tctctatcag ttgtatagcc 14941 gcaatttgaa caagcatggg tacgttctga tagctctttc ttacctgtct cggttccaca 15001 gttggggcaa atctggcttg ttttgcgact atctactttt tgaaaataga catcacgctt 15061 gaaacaagtt tgctcaagga tattgaaaaa ctgaccaaat cccgcatcta gacaatgctt 15121 acctagcatc cctctagaca aaccaattaa gtttaaatcc tcaacaaaca ccattccggc 15181 gtcgttgcaa atttgatgag acagttttct atgccaatct ttacgacaat tggcaacgta 15241 ctcatgcaat gaagcaactt tcttttgggc tttcttccaa ttgttcgagc caatacgttt 15301 tctggagaca cgctgttgca gcaatttaag cttgcgttca gcgtctacaa aaaacctcgg 15361 acgcttgact aaaagaccat tagacgtcgc aacaaaactt gttagaccta catctattcc 15421 tactgcttct ccatgtggca acctttgggg cgggttaaca tcccattgaa cggttaacat 15481 tacgtaccac ccagaagcac gttttaccac acgggcttgc ttgatcactc ctccctctgg 15541 gatgcttcgt gattgacgaa tttttaccgc accaatcaca ggcaacttaa tgtacctatt 15601 gcttaatggg ttttgtccta gctggggaaa agagaaagac cgcatccgtc ctgctttttt 15661 aaaacgtggg aatccatgat tttgctccca catactcaca aaagctttct caagccgtct 15721 tagagtctgc tgcaaagatt gagcattcac acgtttcaaa tactcgcttt cttgagccac 15781 tgcggtggac gggtttcccg gcataaagca agtggcgtta cgcgcagccg ttaaagattt 15841 acactgactt gcaaaggtcg ggcgcggcgc gtcagctctg atgatatagc aactgcggag 15901 gctgcaagcg ttaatctgac agctacgaga cttataccag tccttgcgct cagccaaagc 15961 atggttgtaa acccgcctgt gggtttctaa ccattcttca aacaatgcaa cttgttgttg 16021 ggttggcttc agcttaaatt cgtaagtcag gttaaacact ttcattacct cctgactaca 16081 ttatacaacg attctaataa tggtcaggac gttttttaaa gtcgcccgag aagggcgagg 16141 ctctaacccc agaattttcg gtagccatca tgaattacct acagtaccct taaagtccca 16201 gaaactcacg ttgttgccac agcatcaata ctgcactcat taccacaaag cccgtaacga 16261 tctgtcggaa gcggtgttca ctagttcttt ccaaagctag ctgtcccagc cagttaccag 16321 gaaatgctgc cacaccaatt aagactccgt agcatagact tttaaaagtc aaaaccccga 16381 agacagcata cgcagcaacc tttacaaggt gaatagctgc cctagcagcc gccgcagtgg 16441 caagtagttc ctcttttgaa agaccgtagt tgagataaaa cggcgtcagt actggtccta 16501 tactgccaag gatgccagac aaccaagcgt agaagaagcc agctggaaga aaataccagg 16561 actgaaccgg gaaagatacg gtttccttct ttgtaaacgt actaactgct catttgaggc 16621 acctaaataa gcctgataac ctggtaaagg tcttattggc acaattcctc tatcatctgc 16681 tcgattatgc cttgttggct tgccaaatat ttttgaaagc aacagaaagt ccagggcaaa 16741 atgattcaac tgaatgcatt gtaggattgc aaagccatag tgattgaaag aatgcagcat 16801 ttgtgtaagt tctacttcat tcattctact tactgtctca atatttataa tgatattttt 16861 aagattcata gaagtatcaa ataatttggc accaatcatc tctcacctta caaaaattga 16921 cgatttaaaa tatttgcttg acaaaatgat tagcttcaag cagcaaaaac actgcctaat 16981 tagcactcaa cggagttatc tgagttgttc agcaataaat ataagtttag cttttttacg 17041 tcagtttttt gtatattata atatacgttt ttgcaaaccc tcttttgtta ctggacttta 17101 gactatgaca tgggaaatga cctagcgacg ctgggttaaa gaagttcaca ttataaaatt 17161 aagaaagtac ttattaagta gttgctgtag caaagctcaa tgtaggtact ttacatttat 17221 ccaccgtact acttcttcga cactttgatt tactgtcatt ttggaagtat ctagaagagt 17281 tattggtggc tttgtcttgt gtgcattatc tttaaaccaa cgattaaatg atatatgttc 17341 ctttatccat tcatccgtaa aaccacgcca tgcgggacga cttcgtaatc gtgaggtaag 17401 aatttcatcg tcgcaaatca atgccaaata ataaatttct gaaaaatagc gacgctctac 17461 gcattcttca aaatgttcag gaatcgcagc gccacataga actacggact tcccagattg 17521 cgaaatattt ttacataccc gcagccaagt ctctcgatat tcacgtaaat cagttcctgg 17581 ttgctgtaat tccctacgcc aaagaatatc actatccatt accacaactt ctttcatttt 17641 gtcagcaagt gcaagactta ttgttgattt gccagtacca ctcgcgcctg aaagaataaa 17701 caacggcagt tgtataaact tgtgcttgaa tccgcagttt ggacatacag catatggtcc 17761 agaagaaaca attactttat cagggtgata ttccccacag ttggcgcaga tgttaaacat 17821 tgatatttct gaacgctatt gtttcaaaga attgactgag cattgagcaa aatttaaagc 17881 aatagtcata atagtactgc cattttggca ggtttctttt agataaatcg gatattagta 17941 tattgtgcta atcgttattg atatttgcac gattcatgaa tatctactga ataccgctga 18001 agtaaagcta atactgaaat agtggtttca ggagcaattt tacgctttag gaaataagac 18061 atggtacttt cactgtcttc tcagaatgtt attcagtatc tgtatcaagc aggtctgtgt 18121 agctcagaag aaggtaaaaa ttcctattct gaactcccgc aaacgagtca aaaaaatttt 18181 aatttagttg ttactttgcc tgggaatcag aagctactgg ttaagcaaga acgatgtatt 18241 gataataacg aaaaccctca tgactttttc aatgagtggt tgtttcatca gttacttaaa 18301 cagtttccag tgttaggaaa tatttctgcg acggcatctt tggtagtgca ttttgagcca 18361 gaaaaatcta ttctcgttcg ccactacctt aaggagtatt ttgaactagc aagtttttat 18421 caaaaaaatt tgtactttcc aaaagcgatc gcaagcgcta ttggtactag tttgggagca 18481 ctgcatcgtg ccactttcaa ccgccgggag tatcgtgatt ttatggcaac tgctcctgaa 18541 gggcagtttc gctatcaatt ttataatcca gcccaagggt tagggtcaat tggatcggaa 18601 atttttggca atgttcctac agatgcgctg aaattctacg ctctttatca acgttatgag 18661 agtttagaag cagcaattgc agaattggcg tatgaatgga atccttgctg tttaactcat 18721 aatgacctga aattggaaaa tattttagtg cattcaaggt gggagaagtt agacaactgt 18781 cttatacgac tgattgattg ggaaggttgt tcctggggag atccagcttt tgatttggga 18841 actttagtgg caagctactt aacactttgg ttagaaagct tagtggtaga tgatactatc 18901 gagttagaag aatcattact tcttgcagcg attccattgg aagttatcca accttcacta 18961 ctaaatctca ttttagctta tcttgatact ttcccagtga ttcttgagta tcgttgtgaa 19021 tttgttcaac gagttatcca atttgcgggt ttagttctac ttcatcgaat caaggaaaaa 19081 ataaactctc acaaatactt tgataattcc agtatttgta tgttcaaatt tgctaaaagc 19141 ttactcagta gaccacaaga gtctgtactg actgttttcg gtatcacaga gtcagaaatc 19201 ctaaagctgt ttgcaaagtt tgttcaactc tctcatccaa aaaaagagaa taatttgctt 19261 cgtctttatt acgacaaaac tcgtctgcgc ggttgttaaa acagtgaaca gcagagtcag 19321 tgaacagtga acagtaaaca gttaacaaaa gaaacagctc cgtattttct gttgtaactg 19381 ataactggta actggtaact gataactgat aactgatacc tgataactgt taattcttct 19441 ttgagaagtt ccactcatgc tagagtcttc taccaaacca ctgctaagtt ctctattgga 19501 tattgctagc aatatccaaa ttaaatccaa cttttgcatt cgccatccaa aatatcaacc 19561 ctttgcacta ccatctaaaa tagcagagcg atttcgacaa aactcgccag cgttacaaca 19621 caaatatctt gctctactat tgcggaattt ccttcacggt atttattaca atggttctct 19681 gcaaactaca ttgtcactca gtagtgatgt caatcatgac ttgccacaga aaaacttgga 19741 aagccattct atcttagaga tggattggca attttacgag ttactacata tcagtaatca 19801 tggaataggc tactttgatc ctggttggca gctattgcga cgggaaccag atggtagtat 19861 tgcagtgagc aaaggcggtt tgacgttgta tgttgagtac aatcacaatc tagaaccgtc 19921 aacgcaaact gccaaggtag gagatttgat cgatatatgg atgcctaaaa atcgacttca 19981 aaacggcttt tacgtagcgg ttagcaatgt cggacaggat ttgcagacta acccggatac 20041 tgatttgggg gcagggcgaa tttactttaa tgtgactcca atgggtgcct tagcccttat 20101 gaatagcctc acacaacaac tgaacgctgc tgcaattccc tttagttttc aggtgttaca 20161 caatcgggct gcttatggac gctacgattc aggagtgcta tcctttgaac gcgaagacta 20221 tccagcagtg cgaaaagtcc taagaagtgt ctatgcagaa catcaatctc atttccacac 20281 agaaatcccc ttgtttacca agtttttagc acctgggttg agtttagctg aagaactgag 20341 ccaaaaattt gcggtacagg aaagttttgg catgaaccgc tgtcaaatcg tggcgaatgc 20401 tttgttggaa gtttggcaac aagataacga ttcaactgat gagcggatga gggcaattca 20461 gttacatttt actcggctcg gtattgattt acagcgtcct tacctcaatc cttgctctga 20521 ggatatttat tacccattaa actgataaaa cagataattg cggtaaaatt gtttactgta 20581 gcaggctgtt tgtttgctat atttttttga actttggctt gaactttggc ttgaagtatg 20641 gcctttacat ataaccgcac cattcgcttt caagatactg atgctgctgg ggtagtttac 20701 tttgccaata tcttgagtat ttgtcatgag gcttatgaag agtctctagt tatgtctggc 20761 attaatctca aagatttttt tacgaatcct tctgtagctt ttccgattgt tcatgctaat 20821 gtggactttt ttcgtccgct atattgtggg gacaatttgg ttattaggtt aatgcctcaa 20881 cagcttagtg ttgataggtt tgaagtggct tctgaggtga tagttggtga ggtgatggct 20941 gctaaagtag tgactaggca cgtttgtatt gagacaaata gcagaacgaa aacagagttg 21001 ccagaaaaca tgaagcaatg gttggagata aaccgcagag gcgcagagag tgcagagaga 21061 agaaagtcaa gagaggctat atgaactacc ccgccgcata ggcggacggg gtttcgcccg 21121 gttccgccac gtagctggat ttttgtctcc tcgcttgccc aagttgcttc atagatccgg 21181 tattctttcg tttactggtc ttagaagtag agcttagaca tccgagacta tcgaggctgg 21241 ttcccaaccc cggttgttga ttagtaacaa cgttgtacat caccatcgcg ctacccgcgt 21301 cgcgtggcac atcaaagcca caacttaggc aaacatgatg acgatttgac aattctgccc 21361 attgtttatg aaccgttcca cacttaggac aacgctgaga tggcttgaca tccttggtag 21421 gtagcataat cattaatccc ccctttgcct caatcttata agtcagcatc ttgttgagag 21481 tgccaaaccc gactgaaaga atagatttat tgagtcctgc tttttgcttt ttacgtttgc 21541 taccgttctt cgctttcctc gtcatcccct ttgtgttaag ctgctcggtt acgccgatgt 21601 cataacaact tgcaatctct gacgtaactt tatgctgcca atcacgacgt tgacttccag 21661 ctttgcgctg caattgagca acacgcctgt tcgctttctt ccaacgtcgt gacgccttga 21721 tcttcttcgt acgatttggg gcttgtttgc gtcgtagttc tttggacacc ttttttatct 21781 tcgcttcagt tttctgagta aaacgcggat tagcaacttg ctcaaccttt ttcccgtcat 21841 aagttgtaat tgctgtttcg catcccaagt caaaagcgac gatttcgttg tacgctagct 21901 cggattgaga cccaaacctc acggtaacat cgggaacttc gacagtgaaa gaagcaaacc 21961 actgtttcag cccaggcttg taaacaacgg tgagagtact aggcttgccc cactgcctag 22021 cttgacctcg catctttatt ttgaccccaa ggtcgttgag gtttacactg ccatgcctcc 22081 ctgatgtgtt gactttccat cccgaagtgg cagggtaagt ccaacctgaa aacttgcgaa 22141 ttgacttgaa tttaggcggc ttacgcagtc cctgaaagaa agcgttgtac gccaagtcaa 22201 ctcgctttac cgtagcttgc aatgcttgag agtgaagata agcaaactcg acccactctt 22261 tcttgtaagc aggcaagcag ttttgctgct ctaaatatga aactgatttg cggttagctc 22321 tccactcata tttacggtga gcaatgcagg cattgtacaa gtagcaatgg tcacgtcttg 22381 cttgaagtaa tttagcttct tgtactttgt ttggatacag tcgaaatgtt tgtctcctca 22441 cagccacaac ttgctcaccc ccttggctta cttcactcga tactatgaac gacaattagt 22501 gtaacacggt ttccggaagt gactagaatt actgcaaaat gttatgcgta aaggcgctca 22561 tgtcgttttt gatattcact tgcatattgt ttttgtgaca aagtatcgac gcaaagtcct 22621 tactcagtcg atgattgaag atataaaaga gatatttgga cgagtactgg agaacagtaa 22681 ctcattgctc gaagaatgta atggtgaagc tgaccacgtt catctgttga ttagtttgca 22741 tccagataac aatatttctg atttagttgt ttccctcaag tcagccagta gtcgaatcct 22801 tagagaaaaa tataggtctg aaatagataa gttttactgg ggtaaagcga aattgtggca 22861 cgactcaaag tgtattgtct cttgtggggt actttactta cgcgcccgct gtactagtag 22921 caatagagtg gttagcgtga ggcaattcag agcttacttt tatagaattt ttttgtaaat 22981 tagcaagatt ccaaatgtaa acactaatga cagcaaacat cctaaatgct gaatctatgc 23041 tttggggaca tataagcgct gattttgctc ttatctattt attgtttttg gagaattttg 23101 tattcaggat tactaagaag cttaatatta actatattcc ttaacggtag gcaaatatat 23161 cacttgatag aagattggat aaataattag ccatattctt aaaaaaaagt gatgtaaatg 23221 ttgcccaggt aaatagcctg actagtttct aggcaggtaa aaaaaaacaa aagcttcgtt 23281 atttagataa ttattactgc actttggctg gaaaattttc cgagtcaagg ttattttggg 23341 caattaaatt tcctacaaac acaaaattga gataacctat gtcaggcgaa catactcaaa 23401 taaaaaataa cggtatcagt aatggtaagt cgcgaaccaa gctatcgaca ctaactgaac 23461 cgcaatcatc aggtgcagcg aaattgcaga agttatcttt atccaacaaa agctttaaag 23521 aagcttcaga ttgtgatacg cctgatatag tttgtttttc tcatttgcgt tggaatttcg 23581 tttatcaaag accgcagcat cttctagttc gttgcgctca aggacggcgg gttttcttta 23641 ttgaggagcc gatttttagc accgaaccgt tgggacggtt ggaagtaagc caagataaga 23701 atggggtagt ggttgttgtt ccacacctat cagaaggttt gagtgaagac ggtatcaacg 23761 cggatttaaa agtgttgatt gatggcttgt ttgcacagca taacatctgc aagtacatgt 23821 tttggtacta cacgccgatg gcgatcgctt ttacaagcca cttggagcca gaagcagtac 23881 tctacgattg catggacgag ttatctgcat tcaaaggtgc gtcaaccggt ttaaagaact 23941 acgaagccga actattccgc cgtgcagact tggtgtttac gggtggacaa agcctttatg 24001 aaagcaaggt gaaccagcac cccaacgtct atgcgtttcc aagtagtgtg gatgtaccac 24061 attttgccca agcgagaaat ctgaaagaag aaccagcaga tcaagctaat attcctcacc 24121 cgcgtcttgg gttctttggc gtgattgacg aacggatgga tattgagctg gtggcgggaa 24181 ttgccgatgc gcgtcctgac tggcatttgg tgataattgg tccagttgtg aaaatcgacc 24241 tagcacttct gccccaacgt gagaatatcc attatctcgg tggtaaagat tataaagaac 24301 taccatacta tttggcgggg tgggatttgg cgatgctgac gtttgcgcgg aacgaatcaa 24361 cgcgctttat tagtccaact aaaaccccag agtatcttgc cgcaggtaag cctgtagtgt 24421 ctacctccat tcgagatgtg gtgcgcccct acggtgactt gaagttggtg cgaattgcag 24481 acacggctga tcacttcgtc accgcagcag aacaggctat gcaagaagac accgcagcat 24541 cagggtggct gagtcgcgta gatgcatttt tagagcagat ttcttgggat agaacttggg 24601 gatcgatgat gcaacttata gattcggcga ttgctgcccg tcaagatagc gcagttacaa 24661 atatccccca agcaccaagc atcattacca gagattttgt cttcgattac ttgattgtcg 24721 gtgcgggttt ttctggaagc gtcatcgctg aacgcttggc aactcagtct ggcaaaaaag 24781 tgctggttgt ggacaagcgc aaccacatcg gcggcaacgc ttacgaccat tatgatgagc 24841 atggtgtcct cgtacacaga tatggtcccc acatttttca caccaactcc cgcgaagtct 24901 ttgaatacct ttcgcagttc acacagtggc gtagttacga acatcgcgtt cttgccagca 24961 tagacggaca gcttcttccc atccccatca acctcgacac catcaacaaa ttgtatggaa 25021 tgaacctcaa ttcatttcag gtggaggatt tttacaagtc gcttgcccaa ccaagagaat 25081 acattcgcac tagtgaagat gtggtggtga gcaaagttgg tcaggaacta tatgaaaagt 25141 tctttcgggg ctacactcgc aaacaatggg gactcgaccc ttcagaactg gataaatcag 25201 taatcgcccg gattccaacc cgtactaatc gcgacgacag atatttcact gatacttatc 25261 aagcaatgcc gctgcacggc tttacccgga tgtttgagaa gatgttaaat cacccgaaca 25321 ttaaggtaat gctgaatacc gattaccagg aaatcgaaaa agcgatacct tgccgggaaa 25381 tggtttacac aggtcctgtt gatgagttct ttgattatcg ctttggcaag ttgccctatc 25441 gatcgcttga ttttaagcac gagacgcaca acaaagaggt gtttcagtca gcaccagtca 25501 tcaactaccc gaatgaacag ctgtatactc gcgttacaga gtttaaatat ttgactggac 25561 aggaacactc taaaactagc atcgtttacg agtttcccaa ggctcttgga gatccgtatt 25621 accctgttcc acgtcctgaa aatcaggaag tttacaagca atacaaggcg ctggctgatg 25681 caactcctgg tgtgtatttt gtcggacggt tggcaactta taagtattac aacatggatc 25741 aatgtgttgc tcaggctctt tctgtttaca aacaaatccc agttagggct tgagaattct 25801 ctgtgttctt tgtggttcaa taatttaaga aaccacgaag gcaccaagaa cgtgtatctc 25861 ctgcgcgaat gccgtcttcg catnnnnnnn nnnnccctat tttagttggc agaagcgaca 25921 aagcaactaa aattttaggt tggcatccgc agtatccaaa caagcgagga aattatcgac 25981 cacgcttggc agtggcatca aaaacggcat ggataaaaaa gaattcagga gccagaatta 26041 atgagaaccg agaagcccgg tttcgcaaga gttaccgggc ttctgagcaa aaattgctat 26101 aaaactgata tcacgtttgg ttaatgaacc tgaagaaaaa aacttctaaa aacttcagtt 26161 ttttccttga ggactttagt cctcactaca aactttagat taatctactt atttcctttg 26221 gtagattaaa tgaaaattga aaaacagata attagaatga aacaggtgaa atcacctttt 26281 gaattgttag ttttatgcaa aatatttacg gaaatcttca ggggataaaa tcgaatcaaa 26341 tcaagttatt acagcggctt tacgaggaac gacaaccgaa agatagattc attacgctgg 26401 aatttgccca agccttaggt gaaatcagta cagaaatcca tcagccgatt tgttgttata 26461 ttaatcggcg cggacaagtt atcagaattg ctgtgggaac gccaagtcaa acccaaattc 26521 cacctcaaga attacctcgc catagtgcgg aacggttgag cggaattcat tgtgttgcga 26581 ctcaattcaa atcagagcca cctgatgaag cagcattgat tgcaatggta cgccagcgct 26641 tagatgcttt ggtggtgcta tccttggctg acggtgaagg tggaagacac aaaaaagcag 26701 ccactagcaa tgttaaagaa gctttcatcg ctaaccttgt ccccaatgct gaaaagcctt 26761 gggagatttc tcctccccta agtttagatg acttaacaga gcaagacttt gacgacttag 26821 ttggtgaatg ggaaaaggag atttctgact ctggcgatag catgtcattc caggacattg 26881 tatctgacca ggataaagtg ctgctggtgg gattgaagac agatgatatt tctcaacaac 26941 ggtttgaaga tggactgcaa gaactagttc gtttggtaga aactgctggc ggaattgtat 27001 cagacacagt ggaacaaaag cgaagtcgtc cccatccgca aactgttgtc ggtaagggaa 27061 aagttgaaga aatcgctttt caagctcaaa aggtgggagc taatttaatt atctttgacc 27121 gagatatttc ggcatcacaa gctcgcaact tagaaaacga gattggtatg cgagttatag 27181 atagaacgga ggtaatttta gatattttcg ctcaacgcgc tcaatcccaa gcgggtaaat 27241 tgcaggttga attagcccaa cttgaatata tgctgccacg gttgcgcggt caaggtcggg 27301 aaatgtccag attaggtgct ggaattggta ctagaggacc tggtgaaact aaattagaaa 27361 cagaaagacg agttattcaa cggcgaattg ctcagttaca gcaagaagta aaccaactac 27421 aagcacatcg cgatcgcatt agacaacagc gacaaaggca agaaattcct gttgtcgcat 27481 tagttggtta caccaatgct ggtaaatcca ctttgttgaa tgtgttaact aatgcggagg 27541 tttatacagc agaccaacta tttgccaccc ttgacccgac aacgcgcaag ctggtaatta 27601 caaatccaga aactcaagaa cgcagaacta ttctgctgac agatactgtc ggttttattc 27661 acgaacttcc accagcgcta atagatgctt ttcgcgccac tctggaagaa gtcattgagg 27721 caaatgtgtt gcttcatgtg gtggatttgt cccatccagc ttgggaaagt catatcgcaa 27781 gtgtagagga aatacttgca gaaatgcctg ctatcccagg aaaaagtttg attgtcttta 27841 acaaaattga ctctgtagat agcgaaactt tagcaaaagc gcaacaagaa tatcctgaag 27901 cggtatttat ttccgcaact aagcgcttgg gtttagagac tttaaaaggg cgagcgctgc 27961 aactaattga cgaaaccgtt gccacgtcag aactacagaa cgcagttaca caaagctaga 28021 agttgtggat tctgtctaaa atcaagatat gatgacaaga cgctacaacc ctcaacaaat 28081 agaatcgttc acccaatcgg tactgaacga cggcttctgt gtgttgccca atcacttttc 28141 tccagcaaca ctcaaggctt ggcattctgc tttcatcccc aagctaactg aacatatcgc 28201 cagcgaagga cacctgcgta atcgaggtac agcccgctat tatgtaactt tgcctttcgc 28261 cgctccattt gctgatacta gtatctatga agatgaagat ctcctggata ttgtggaacg 28321 cttggtaggc gctgatttcg tgatgtgcca gttggcgaca gatacgcctt tgctacattc 28381 agagtatcaa gacatacacc gcgatacttt gccactcttc cccgaaactg gtatggaaac 28441 accgccatat cagctagcag tcaacttccc actggtagac gtcaccttgg aaaacggacc 28501 gatggagatt gcgcggggta cgcacatgat gtccaaagaa gaaggattac gccgtataga 28561 gtcgggtgaa attaagttgg aacccgtaac tatgcaactg ggagacgtaa tgattcgcga 28621 tgtgcgtggt cttcaccgtg gcacacctaa ctacacagaa acgccgcgtc caatggtcgt 28681 aattggctat agccgtcgct ggctgtttcg tccggaggtg tctatccaga taccgcgtgc 28741 tgctatgacc acactatccg aacgaggtcg tcacttatta cgctttaatc cgattgtcga 28801 atcacttgac gaatttaccg gaactgaggt ttatcagtcg ttcgcttact agtaagcaaa 28861 cgattgcatt tactcccttc ccactacaaa attacctatc ttcttttttc ttacttcgtg 28921 tccttcgcgt ctagccttcg gcaacgccaa gggcgaacgc ggttcgttaa ataggtattc 28981 ttctgacggg aagggagtaa gaacgcaata tcaaattata acttttgtgg gatgggcatc 29041 ctgcccgtcc taaatatgca gtcagacaaa attcataatc aaaaaaggaa atttatgaca 29101 ttagctatta ttgttcacgg cggagcaaaa accatctcag aagacaaagt tgcagccaat 29161 aatgcaggct gcacagcagc agcagaggct ggttgggcag tcctgacaag tgggggtact 29221 gccgcagaag ccgttgaggc agccatccgc gttctcgaag ctgaccagac ctttaacgcc 29281 agtcttggcg cgactctcaa caccgaggga gaggtggagc tagacgcggc gataatggat 29341 ggatcttctt taggttgggg agcagttgca gcagttcagg gtgtgcgtca tccaatctcg 29401 gtggcacgga aaattatgga tgaaaaaccc cggatgttag tggcgcgggg tgcagaacgc 29461 tttgccgccg acaacaaagc ggaaatgtgt aaaaaagaag acttaatcgc tgatgagcag 29521 tgggagcagt ggaaggagga tcaagaagtt ctggatcgcc ccaacaccgt tggttgtgtg 29581 gctttggatg ctaacggtat cttagctgct ggcacctcaa ctggcggcac tacgaatcag 29641 caagccggtc gcgtcggcga cactgctctt gtcggctgtg gcttgtacgc tgacaatcaa 29701 ctcggcgctt gctcaaccac aggtgatggt gagtcgatta tcccggtggt tctggctaaa 29761 actgcgattg actttctaga tggagataga cacccagacg aggcagcgca gaaggcgatc 29821 gacactttga aatctaaggt tacaggagaa gctgggtgta ttctcttaga ccgtcaggga 29881 cgagttggtt gggcgtataa ttcatcacat atggcttgtg cttacatgac cacggcacaa 29941 gacgaggtgg ctgtgtttac caaaaaagaa gctgctttgt ctcatcaaat gtgatgggtt 30001 tagggtattg tagagaggcg aaatatcaag tctctacaga acgtctaggc gttactcaaa 30061 ttagtgcgga atttttgacc aatcgtttct gcttgcagat agaggaatca aaatttcgca 30121 actaatacct taggtgattg ggtcttgaaa acagtttaat taaaactaga tatgaactca 30181 aaatacgact ttcagtactt tcaagggagt tatttatcta atctgtttgc tcaagacatt 30241 acagatgcac gttatgcgtt catcctaaat taccctggga ctgccagttg ggcagcttac 30301 cccaatagaa aaaaatactt tattcaagac ggtagtagtg aagcgaccaa aacctccttc 30361 gacaagattt gccagaagga accttggaaa aatctggctg tgttaggcga cacccttcca 30421 ggaattgtga ttatttcacc gccaaagtta ctgattgact actggcagga gcattttggg 30481 ttcagccact ccaatatgaa tatggagatg atggattgct caacttatct gaatgacctc 30541 aatcagagcg aacgcacgga caaactcatc actttatttc cctttgataa tctgcaacca 30601 gaaaaacacg ccgttaatcc agatacccac taccgcttgc tgagtaaagt aaccctagcc 30661 gagttaggag tgcaatgtcc gaaatactca agctacaatt tgcacaccca gagtctcgaa 30721 gacattgagt taccacaatt cccatactta atcaaaacgt cccacggact ttcaggagag 30781 ggcacttata ttatcaaaag cgccagcgat cttaactact gccttgaaga aatcaggaaa 30841 tatcttgata tcaagttgct cgatacgatt attgtctcag agttcgtcaa aaatcaggtg 30901 cagaactatt gcgtgcaatt ctatgtcaac aaagcgggag acataacact catcggcacc 30961 acgagtcaac tcgtcacccc agagggcaac tatttagggg gactgattca ctaccgcgaa 31021 actgacatga gcaagttcta cgagatgatt gctgctattg gtcagtatgc tcataagctt 31081 gggtatttcg gtgttattgg cttcgacgtg ttggaagacc aagacggaga attttatgta 31141 attgatgtta atttccgagt caatggttca actgggctgt gcttgcagcg ccatacccta 31201 ctgtcccttg gaaaggaggt ggctaaatat tcgagtgagt accgcatgga tgggacgttg 31261 gactcgattt tagtaaccct gaaaccacaa ctggatcgca aagactttat cattttatca 31321 gctttagaga aagtcaaata cggaaaaatc tacaccgaaa tttacgggat tgtgactgga 31381 cagacaatcg aggaaatgca gcacatcgag caaaagttac aaaataaggg attgcaaatg 31441 agcttcgttt agcgtgaaat aacctagcca tacatgctca tagcatttca tagtcgagag 31501 caagaagctg gtagattgca caaaaaaagt atttttgctg aggaatctac ctgaagctct 31561 atggtagagc ctagaccgtc ggacatagga cataggacat agagcatact ttgaattata 31621 aatcattttt aatttgagcg tgctcgtggg tcaagagtga aaaattaaag attttttatt 31681 ttccacgttt tatatttcat cagtgagcct agtttgtaga aattttgtgg ctatttgctg 31741 tagctgttgg cgattgattt taccttgaga gttgcgaggt aaggtttgta caggaatcca 31801 atttttagga attttaaatt tgctgagttt atctttcagc agatttcgga tttctatatc 31861 agaggtattc gagtctttgg ggatgtaaat tgctgtgact gcttgtcccc agagtttatc 31921 tggcataccg atgacacaaa catctgctac catttttgtt gctctaatag atgattcaac 31981 ttctattgga tagacatttt caccgcctgt tataattttg tcgctgttac gtccgacgat 32041 atgtaaataa cctttgttat ctaaaaaacc caaatcatct acttgtaaat aagccggatt 32101 ctcccaaatg ttaggatagt atcctagggc gagagcttga gagtagattg tgatgtttcc 32161 gatttgattt gaattcaatt cctcgccttg ctcattgcaa atttttatag aggcatgggg 32221 gagaatttga gtacagttat ctttaccatt aagaaattca tctggtttga gggtggcaat 32281 ttgggaggcg gtttccgtca tgccataagt gggagataag cgaatattgt aatatcttgc 32341 tttttcaaga agttcattcc acgctggtcc acctcctaat agtacggttt taaattggga 32401 tagccactga gttaattctg gattttgtag cagacgctgt aactgtgttg gtactaaaga 32461 aataaaggtt tttgatggtt ctatgttgtc tacttgacga tattctaatt ctttgaatga 32521 cagaataact agttttccac cagtcgtaaa ggaacgcata aattgcatta acccgctgac 32581 gtgatagagc ggtaagacac aaaaagaatt aacatgatta atcaaaaaat gttctgtaaa 32641 tcctcgcacg gatgccatta aagtttccca ggtgtgaatg gcaaacttaa tctttcctgt 32701 tgaaccaccc gtgggaatca tgatgagtcc tgagtgaggt gatggaatga taggaatttg 32761 gaatttggaa ttgttttgtc cccaattttt agttccaagt ccccaaatta aatctggctc 32821 gactaaatca aaaacttgtt gccattcttg ttttgtccag tcggggttgc agagaaaaac 32881 tggacaacgg gcggcacaag ccgccagaaa acctgctaaa aaacgcactg gatcgcgttc 32941 agctaggaga attttgattg gtgtgcctcg ataatatgaa aactgggtta attctaaata 33001 gagttgttcg gttaattgag caaataaatg gctatcatga caaatgagcc agtcattgga 33061 gagagactgg caagatgcat tctgcacaaa attttctaaa gttttttcca taattcttcg 33121 ggcgttgtct cttgttgagc aaaccaatgg ttgacaccaa aacccactgc tctgttacgc 33181 tgaggaggac acttccgtgc gggggttccc caacgccaga cgcctacgga gggaaaccct 33241 cctgcagcgc tggctcccgt tgaggaaagt gtccgtgata attcggctgc gatctggagt 33301 gcggcttgtc taccaattgc ggtttcaaac acggatgaaa atacagcatc aattttatgc 33361 tgttgacaaa actgacgcag acgtgctggt gaaccgacta taccaggctt aatcacaaaa 33421 atttctcgcc aaccttgttg gaagcatgtg gcgagttggt tgagtgtggc gacagattca 33481 tctaaggcga tcgccgtctg gtaacaatga ctcaactcca acatcgccgc aaattgatcc 33541 acagacaaag gctgttcaat aaattcaatt tctaagcaaa tttcttcatt tgccttgata 33601 ttatcgcaat ttacgagcca taactgagct tcttcatacc tcagtccacc gttcgcatct 33661 aatcgtagtt ttgcagaggc tggtaaagtt tgagagagta attcaaaaat ttctagttct 33721 tgaacaattg gataaacgcc tatcttccac ttaaaggtac gatatccctc cttccacagc 33781 ttttgccacg cctctagcgc cacttcccca gctggtaata aagcgctata gcttaatgtc 33841 tcaattttta tcagatcttg taccagtccc aacaaccgcc tagggtttaa aaccccaggc 33901 tcatagccca agtcgtctaa agacgactgt ccaaggtttg tagtccattt taatggactt 33961 gggctatagc tgtagaaatt aatttccggg cgggtgaggg tgctaatcgt ttcccaagca 34021 gactcgaatc caaattgaca cgagggtaac tcgtcaggaa tagagaaaat cgtctcttgt 34081 gtgatttcct ctggaagctg acggcaaaat tctaaagctt gttctagagt ttctgaacca 34141 aaccaactga tgggtgcaat ttccccccag ccgatttttc ctgtttcatc ggtgagacga 34201 agaataattc cttcgcggat atcccaaatg ccatgacttg tcatgagcga agtcacaaat 34261 ttttgctgat aaggacgaaa ctcaaatcga tatcgcatta taagcttgtc agtgcaggta 34321 aaaaggtaac tgttttgtgt tacctgtaga aatacgacca cgaaatatca ttcttttgga 34381 agttaactta aagaaaattt actctttcta gatccatgac agtttcttca tgtaatgctg 34441 acttttttgt ttataataag gaactatact gttttctttt agctgttcaa caaaactgca 34501 ttaagaataa tgacacgcaa attgcaagct tcgcgctaga cattgactgg gttgacccct 34561 tggttgtact tgataaatta gcacagccaa ataaggttag tttttattgg gaaaataaaa 34621 gaaaaaagga agccctggct gctgttgatg ctgtcgcgaa aatagaaatt gcagggaaag 34681 accgttttgc aaaatcagaa gagttcatca aagaatgtgt taaaaatata actagctttt 34741 ctagaacgaa tcaagctttt tttggaactc gatttttatg tagttttagt ttttttgatc 34801 aaaatagcca agcagattat ccattttctg cagctaccat atttcttcca cgttggcaag 34861 tggctgttaa aaatgagcgt tgcgtgttgg tttttaatac aattatcaat gctgacacaa 34921 atattcaaag aatattgcaa agtttatcga gaaaactaga aattatcaat tctttagaat 34981 ccaactcgca aactcttgat tactctttgc cgaaatttag tcataaatct gttgcaaatc 35041 ctcaacattt caagtgttcc gtcttgtcag ctttggaaaa aatccagtct aatgatttaa 35101 ctaaagttgt cttggcggat atactagatg ttaggtcaaa tgctcatttg aatgtcataa 35161 aatctttaaa taacctgaga cagttgcatc ctaattgtta tgtattttct catagtaatg 35221 gtaaaggaca gaattttatc ggggcaagtc cggaacgttt aattaatatt caagaacaac 35281 agttaatgac tgatgctttg gctggttctg cgccacgggg taaaacacct ggtgaggatg 35341 cgaacaatgc gagtcgtttg ctcaatagtg agaaagaaag acacgaacat tcgttagtga 35401 ttgattttat cactcaacgt ctctctcaac taggtttact acctcaagtc ttagcacccc 35461 gcttgcgaca attatctaat attcagcact tgtggacacc tatcagcgca ggagttccgg 35521 ctgatgttca tcctttaaag atagttgctc aactgcatcc tacaccagcg gttgcaggta 35581 cttcccagga agttgcttgt cgggaaattc gtcgttatga aagctttgaa aggggtttat 35641 atgctgcgcc gcttggatgg gtagacttgg agggaaactg tgagtttatt gttggaattc 35701 gttcagcact tatagatggc gatcgcgcca gactctacgc cggtgttggt atcgttgctg 35761 gttccgatcc cgataaagaa ctcgcagaag ttcaactcaa actacaagcg ttgctaaaag 35821 cattagttta gtattttgtc caatgtttgt aacaattgct taacagtgta aggctttaat 35881 aaaaatgttt caacaccaag aggttgcgtc aattgcttat tgtctgcaag tccacttgca 35941 gccataatct tgacatatgg attcattttt agcaaagtac ggatggtggt taaaccatcc 36001 ataactggca tcatcatatc cataaacacg agtttaattt cttctcgatt ttgagcatat 36061 gatgcaattg cctcaatgcc atcacaagct gttaacactt tatagttgta gttttggagc 36121 gtaattttgg taatctctcg aatttcagct tcatcatcta ccaccaaaat caattctcca 36181 tttcctgtag gcaattccaa agattcttct agcggcgttt ggttttctaa aactgctttt 36241 aaaaagactt taaattgtgt tcctttacca accttgctag aaacactgat aaaaccgtta 36301 tggcttttca taataccaag cacagttgaa agacctagtc ctgaaccttt tccaacctct 36361 ttggttgtaa agaaaggctc aaaaattcta tccaatactt ccgggggcat accaattcca 36421 gtatccttga cggtgatgac aatgtggggt ccaacagttg catcaagatt catgcgggca 36481 tagttttcgt caatcaagaa attttcggca caaagactca aaacaccgcc ttttggcatc 36541 gcatcgcgag cgttaataac tagattcatc agcacttgat gcagttgggt cgcatctcca 36601 gaaacagtcc aaagatttgg tgcaatatca gtgcaaaatt cgatagattt ggaaaatgtt 36661 tgtttggcaa actgctcaat ttctgatatc aaatgcttga tttgtaaaat cgtgcgctta 36721 ccttcgacac cgcgtgcaaa tgatagcact tgcctaacca tagctgcacc acgttgagcg 36781 tttgtttcta atgtctcaag aagctgctgg ttgcgctcat cagaaattct catccgcaaa 36841 agctgagccg acatcagcat gggagccagt gtattattga gatcgtgggc aataccgctc 36901 gcaagcgtac cgaggctttc cagtcgttga gcacgaagaa actgtgtctg aagttttttc 36961 ttttctgtga tgtcggtgtt tacagtcaag attgattttg ggttctcgtg ttcatcgcgc 37021 acgagtgtcc agcgagtttc aacaatcact tccttgccat cttttcggat ctggctcaat 37081 tcaccatacc actgacctgt caagttaact tgagacaaag cctcttggag ttgcgaataa 37141 atttttttat acaaaagctc attagcattt ttacccaaag cgtcttttgc tttccaaccg 37201 taaactcgtt cagcgcttct gtt // LOCUS NODE_701_length_36763_cov_5.65024036763 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 36763) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 36763) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..36763 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..941) /locus_tag="DP116_05610" CDS complement(<1..941) /locus_tag="DP116_05610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861129.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-hydroxy-acid oxidizing protein" /protein_id="PRJNA477356:DP116_05610" /translation="MEEVKSLKPLNLFEYEQLATKCLSQMALDYYASGAWDEVTLRDN RTAFERFKLRPRVLVDVSQRNLATEVLGQPLQIPLLIAPMAFQCLANPQGEVATATAA ASAGVGMVLSTLSTKSIEEVAAVCHDTNLQTPQWFQLYIHKDRGLTRALVERAYTAGY KALCITVDTPILGRRERDKRNEFALPTGMELANFTNLSGLNIPHQEGESGLFTYVAQQ LNSAVTWDDLEWFQSLCPLPLVLKGILRGDDAVRAVECGVRAIIVSNHGGRQLDGAIA SLDALAEVVDAVDGRAEVLLDMSGNLQSTPHLALHSI" gene complement(1599..1835) /locus_tag="DP116_05615" CDS complement(1599..1835) /locus_tag="DP116_05615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013190688.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05615" /translation="MDILLLTLINVAVCFAFPKVIFMILATVTGQNQLLQTTSTSQKK VIELTSFPYCTSYTLTKRPFCKFAPSFCDRCSPG" gene 2108..4102 /locus_tag="DP116_05620" CDS 2108..4102 /locus_tag="DP116_05620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309714.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="long-chain fatty acid--CoA ligase" /protein_id="PRJNA477356:DP116_05620" /translation="MSKTYDIYSIISNLYELERRNLERLADYSNLNSLPEIWSLVAKR CGDTVALRNPHAKPEVVITYTQLFQKIQYFAAGLQALGVKAGDRVSLISDNSPRWFIA DQGIMTAGAVDAVRSSQAEREELVFILGNSGSTALVVEDLKTFNKLKQRLGDLPIQLV ILLSDEVPPTDETIKVINFNQLMEIGANHNLTPVKQNHDTLATLIYTSGTTGKPKGVM LSHGNLMHQVVICGTVLQPEPGAVVLSILPSWHSYERTCEYFLLSQGCTQIYTNLRSV KGDLKEFKPNYIVCVPRLLESIYEGVQKQFREQPANKQRLINYLLGVSQKYIKARRIA QGLSLENLNPSMVERLAASIQASALFPLHALGERLVYAKVREATGGEIKQMISGGGAL PKHIDEFFEIINVEILQGYGLTETSPIVHVRRPWHNVRGSSGEPVPGTETKIVDPETR KTLPLGERGLVMLRGPQIMQGYYQNPEATAKAIDKEGWFDSGDLGWVTPQNDLVLTGR AKDTIVLTNGENIEPQPIEDACLRSPYIDQIMLVGQDQRSLGALIVPNLSALQKWVEA KNLHLRLSDEAAKQISTEDAPGKTEVTLESKMIQDLFRQELIREVQNRPGYRPDDRIG PFKLILEPFSPENGMMTQTLKIRRQVVMERYHDIINPMFA" gene 4145..4594 /locus_tag="DP116_05625" CDS 4145..4594 /locus_tag="DP116_05625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210070.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05625" /translation="MDASKSQLLLKRVVNVKAIVTPLWKEEVQQQLQAQINQIDQQLQ QIDMEGQRAISAVQKQSLQPPGPQTLQQIENIQGQINQKKSELLEQKNQSLQNLQQVQ FLELDQEVNQFQMEGFFHVEPGDNLISKLQVEVVLRDGIVEEVRGDI" gene complement(4648..5172) /locus_tag="DP116_05630" CDS complement(4648..5172) /locus_tag="DP116_05630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319194.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05630" /translation="MSAKLKLFLIVILSLFATAATGVYIIERQPSEAVYSMNNGQPPQ FSDVNITQEIVNSERANYNTVPLESINSPLQGSDPADLALNAFDNMDSTLETRKVEVF YPYPNQALVTITQIEPTKNFLKAMKYRVELTTFGRSLFVSSPRVWQIVWAGSQVQCIP GSSLHLPQSTQTCQ" gene 5544..6869 /locus_tag="DP116_05635" CDS 5544..6869 /locus_tag="DP116_05635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-oxo acid dehydrogenase subunit E2" /protein_id="PRJNA477356:DP116_05635" /translation="MSIYEVFMPALSSTMTEGKIVSWVKSPGDKVEKGETVVVVESDK ADMDVESFYEGFLAHIIVQAGETAAVGSAIALLAETEAEIETAAQANSGSSAAKHEAT AAIKSEKTAETTTVATPAASQNGTSSRTNGRLVVSPRARKLAKELKVDLSGISGSGPH GRIVAQDVEAAAGKSSKQPATATPVAPPQPAPTITPVAPTPTKVAPAPAPAPAIAALP GQVVPLTTLQNAVVRNMVASLSVPVIHIGYTITTDALDKLYKQIKSKGVTMTTLLAKA VAVTLQKHPLLNARYSEQGIVYHSSINVAVAVAMDDGGLITPVLQNADMVDIYSLSRD WKSLVERARAKKLQPEEYNTGTFTISNLGMFGVDRFDAILPPGQGSILAIGASRPQIV ATAEGLFGVKQQMQVNITCDHRIIYGAHAAAFLQDLAKLISTNPESLML" gene complement(7012..7272) /locus_tag="DP116_05640" CDS complement(7012..7272) /locus_tag="DP116_05640" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05640" /translation="MAALGALFRRFFDEFNDTRLSFVELFNSKYFTSFAGMPQRLGIL YCPIRLMLLSLVPTLSQTLGLARVLLLFLAITLLTLVLDYGL" gene complement(7758..8735) /locus_tag="DP116_05645" CDS complement(7758..8735) /locus_tag="DP116_05645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194860.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PatU" /protein_id="PRJNA477356:DP116_05645" /translation="MNSDSESLQNHLIAWLLAKNAQTNDPKLVNCEENEGVEHINQTA AALNGGAFELRCLPRTIQLGEIPTVQERFQAVLKRRLQTQIQNHPPLFPWEAQLIEYP ECLDKPALELVPVWGWAVQQSKLNLPIQLPERIFQQLLEKCQAMIASSIPLGAKLVAA VESLFPEEYQTLNDLAGIVLRSPSRSDALETMPNLESDYSDLQPQQQMALSLLAAKQL LENLTLPVSATNPVVERQWLTSAGVMTLKVEYQTQDQLTKLRVETELPSKAIVNLQGD AAQATAESSSPGCLSVELHNTQLNQTYTLEVELKEIDQQPLVFVIVPTL" gene complement(8728..9921) /locus_tag="DP116_05650" CDS complement(8728..9921) /locus_tag="DP116_05650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05650" /translation="MNSAATITTIQGVSSIGVEAIFQLLFKELQQSTKASEQNCRDVA TRIAAEVYRICSESKRIQAAGTVENSAMTLARHRLQQCLRYYELGSNRGRVELHSTLS AIIYRYINPPQRQLSYQGRLVVIEDFLQSFYLEALNAFRRENQLGTSYRPQTLLELAE YMAFTERYGKRRIPLPGRQQQLIILRAQTFSQQQPPETCVDIEQAAEGSGNEADGSWE DPAVQQLRSAMATQPEPEPQEDTLRSVVITELMNYLEERQQSDCADYFTLRLQDLSAQ EIESILGLTARQRDYLQQRFKYHLIRFALLHRWELVHEWLEADLQTNLGLTPQQWEVY TAQLDEKQQSLLELKQQGHPDEKIAKTLGLSMAQMQKRWFKILEQAWEIRNSLISGSG ASTHE" gene 10127..10600 /locus_tag="DP116_05655" CDS 10127..10600 /locus_tag="DP116_05655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458041.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05655" /translation="MDLKAQIQLLIDNAPQDGVTPQLVTAIAPVLSARAQKLRHSQYY ILQNMEEGWVLTTLSARANPQLEKRVIYAFPTIQDVPLGSSAGLDPQLIAAPIPVTHI LFQLVAMEPVDSIVFFETPGITTDSVEVRRADLQHLIQQRLQQNRVKKQIPPDIA" gene complement(10674..11645) /gene="sds" /locus_tag="DP116_05660" CDS complement(10674..11645) /gene="sds" /locus_tag="DP116_05660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949902.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="solanesyl diphosphate synthase" /protein_id="PRJNA477356:DP116_05660" /translation="MTPATSLFSPVEADLQILADNLKQLVGNGHPILCAAAEHLFGAG GKRIRPAIVLLISRATMLEQDITPRHRRLAEITEMIHTASLVHDDVVDESEMRRSVPT VHSLFGNRIAILAGDFLFAQSSWYLANLDNLEVVKLLSEVIMDLATGEIQQGLNRFET NISIDTYLKKTYYKTASLIANSSKAAGLLSNVSQETADHLYSYGRHIGLAFQIVDDIL DFTSSTDTLGKPAGSDLKSGNLTAPVLFALEEKPYLEVLIDREFAQKGDLEQAISLIH DSQGIQRSRDLAEHHAKLAVEHLADLPSSESWQVLMKIADYVLSRLY" gene 12739..12933 /locus_tag="DP116_05665" CDS 12739..12933 /locus_tag="DP116_05665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312460.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05665" /translation="MLYFLYTSVGKKMVYGKMPMIIFILASGYLLAVYLLLALAKRTG KKTTATSVSLGMHSKGGKSA" gene complement(13051..13911) /locus_tag="DP116_05670" CDS complement(13051..13911) /locus_tag="DP116_05670" /EC_number="5.1.1.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319260.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamate racemase" /protein_id="PRJNA477356:DP116_05670" /translation="MFPFFTFEGNLYDFSEAPQRAPIGIFDSGVGGLTVLHQLYRQLP NESIIYFGDTARLPYGIRSQTEILQFTREILTWMQQQRVKMVVMACNTSSALALEAVR EEFSVPILGLILPGARAAVNSGKRIGVIATAATAKSNAYRHAILEINPEAQVWQVSCP EFVPLIEQNRIYEPYTLQVARSYLEPLLLHEIDTLVYGCTHYPLLAPVLRSLLPSYVK LVDPAEYVVAACAQDLDILGLRNTYPPLPTRFAVSGCPQQFAQSSVQWLGYTPVTEAV QLTNVTLYSN" gene complement(14303..16333) /locus_tag="DP116_05675" CDS complement(14303..16333) /locus_tag="DP116_05675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865076.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetylmuramoyl-L-alanine amidase" /protein_id="PRJNA477356:DP116_05675" /translation="MKMNWLLPVTVATTTVFMLQSQAQAAKLQFWRFDASQNRLEINT EGAVQPQAQLLFNPTRLVIDLPGTDFGRSQLIQPVNSSAIRTVRVGQFDGQTTRIVIE LSPGYTLDPKQVKFEGKSSSRWTVQLPTPQVEQAASSSGNTSLNVYSVVRPDNSTATK DVIVNTDSAASKDIVVNPVDRPTQKQTLISNTQGLTQIESLRVTGDGLFVRTNGPNPQ IQTFRSSDKSAINIDILGAALSPRFFQQNTPVNKYGIKRIEFTQLKTTPGVRMTLWVE RNSPDWRASMSSFGGLVILPNGDASKLSRDTTSTSNSSDGNGSLLPNSEIPRNVTPRS SNLMPPASDSISTIESVELTATGTQLLIRGDQPLSSVNTGWDRASGLYRIIIPNAKLA ASVRGPNFDASSPVLRVRLQQLDPRSVAVYIQPAGGVRIGQLNQLSSQLLSLELQRSF SPLTPRFSLPPLPRSNPQPLSSGSMTNNPLPMPQMMPQPMPRPRVPNGRVVIVVDPGH GGHDSGAPGLGGLLEKDVVLPIGRRVATILQQNGLQVALTRDTDYFVTLQGRVDIAAQ ANADLFVSVHANSVDRRPDVNGLETYYYDSGLDLARVIHSNILRSIPTLKDRGVRKAR FYVLRKSSMPSILVETGYMTGQEDNPRLGSPEYQNRMAEAIANGILLYLRQR" gene complement(17126..19081) /locus_tag="DP116_05680" CDS complement(17126..19081) /locus_tag="DP116_05680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879087.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetylmuramoyl-L-alanine amidase" /protein_id="PRJNA477356:DP116_05680" /translation="MKTHWLLPGTFVTTSLLTLLSPAQAAQSAKLQSWRFDANQNRLE INTQGPVQPQAQLVFNPTRLVIDLPGTDFGRPQLTESVGGAIRSIRVGQFDPQTTRIV VELSPGYTLDPKQVKFEGKSASRWTVQLPTPQTEGVASSPASPTPKAPSPKSEEVVIS PPRNIYNVVTVDSDTDKKPEFSKAVVVAERTIQVESLQVTGDGFFLRTSGGNPQTQVI RSRDRKTIFVDIPSATLSPNFGARDRSINKHGVNRVEMIQLQKTPPAVRMTLRVDKDT PDWRISTSSSDGSDGLVVLPKNRYASDSPRNYYSSTPSDNPPTPAVETPKSDAISTIE AVELTGTGTQLLIRADQRLSSASTSWDQSSNQFRITIPNAKLASAVKGPNFDASSPVL RVRLQQQDPRTVVVYVQPALGVQIGQVNKLNGQVLSLGLERIPSIKPPIALPPIQRQN PQPLALQNQPSKPVVQKPRAPKGRVVIVVDPGHGGKDSGALGIGAIQEKNIILPIGKR LAEILQQNGLQAILTRDSDYFVTLQGRVDIAERTNADVFVSVHANSAGDDRPDVSGLE TYYFDSGLSLAQIVHKSILRSVNVKDRGVRKARFYVLRKNSMPAILVETGYLTGREDA AKLSNRLYQNKMAEGIADGVLQYLKQK" gene complement(20684..22108) /locus_tag="DP116_05685" CDS complement(20684..22108) /locus_tag="DP116_05685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319257.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation:proton antiporter" /protein_id="PRJNA477356:DP116_05685" /translation="MQLDFTTFSLPLLATATETADSSLVLAAVLLSMVVIYLASKVGG ELSNRFGLPPVLGELVGGVVVGISVLHLLVFPEGGADSSNSVIMTFLQTTGGLNPDAA DAVFKAQSEVISILAELGVIILLFEIGLESNIKDLIAVGIQASVVAVVGVTVPFAAGT VGLMTLFGIPAVPAIFAGAALTATSIGITSKVLSELGRLNSKEGQIILGAAVIDDVLG IIVLAVVASLAKEGSVDVGKVIYLIISASSFLVGAIVLGNLFSNTFVAIASKLKTRGG LVIPALVFAFVMAYFAAAIQLEAILGSFAAGLVLDETDERVELQKQVIPIADILVPIF FVTVGAKTDLGVLNPAIPSNREGLVMAIFLIIVAILGKVVTGLSVFGQPQINRLAIGV GMIPRGEVGLVFLGIGSSIGILSKPLEAAIIMMVILTTFLAPPLLRFVFPEPTTATAI EEVLLDNSSGKSLVIESPDSRDDK" gene complement(22190..22659) /locus_tag="DP116_05690" /pseudo CDS complement(22190..22659) /locus_tag="DP116_05690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319256.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="EVE domain-containing protein" gene 22955..23710 /locus_tag="DP116_05695" CDS 22955..23710 /locus_tag="DP116_05695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458017.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05695" /translation="MNTNAVFGSQFNAGNRWKFLSLALLVYISSTQPALAQQKERMLR TLSVNGRGMETIPTTLSQVSLGVEVQGKTAQQVQQEAARRSSAVVALLKSRNVQKLQT TGITLNPVYNYDNKVQRITGYAASNVVSFRIPTERAGTLLDEAVIAGATQISGISFVA TDEAITLAQQQALKKATQDAQQQAQAVLSTLGFQPKEVVSIQVNGASAPPPPRPLLDA AELGRLTTKQNAATTIVGGEQQVEATVTLQISY" gene complement(23810..24172) /locus_tag="DP116_05700" CDS complement(23810..24172) /locus_tag="DP116_05700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="single-stranded DNA-binding protein" /protein_id="PRJNA477356:DP116_05700" /translation="MSINIVTLIGRVGTDPDMKYFESGSVKCKLTLAVNRRTRDSEHT DWFNLELWGKTAQVAGDYVRKGKQIAVKGSLKFDNWSDRATGANRSTPVIIVDQLQLL GSKRDAEDGDIDMNPDNF" gene 24544..25551 /locus_tag="DP116_05705" CDS 24544..25551 /locus_tag="DP116_05705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746676.1" /note="functions in MreBCD complex in some organisms; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rod shape-determining protein" /protein_id="PRJNA477356:DP116_05705" /translation="MGIDLGTANTLVYVSGKGIVLQEPSVVAIDLNEKVPIAVGEEAK KMLGRTPANVIAVRPLRDGVIADFDTAEVMLKSFIQRVNEGKSLVLPRIVIGIPSGVT GVERRAVMDAASQAGAREVYLIDEPVAAAIGAGLPVTEATGNMIIDIGGGTTEVAVLS LQGTVLSESVRIAGDELTEAIMQYMKKVHNLVIGERTAEDIKIRIGSAYPTHDDDDAM MEVRGLHLLSGLPRTVTIKGPEVRESMSEPLLVIIEAVKRTLERIPPELASDIIDRGI MLAGGGALLKGVDTLISHETGIVTHIASDPLECVVRGTGRVLENFKQMERIFSGRSRN M" gene 25635..26444 /locus_tag="DP116_05710" CDS 25635..26444 /locus_tag="DP116_05710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rod shape-determining protein MreC" /protein_id="PRJNA477356:DP116_05710" /translation="MFTARRWWERRGLQIGLLGIVVGGAWVLRQTQGALMFEIYQEIT RPIQMLQTPPAQEERLKDARFLELQTQIAELKNQNKKLKQLLGYVEKEPSTSRPVIAR VMTRGADNWWQQVTLNRGSLAGIQEGYIVKGDGGLVGLVESVTPNTSRVLLISDLKSQ VGVTVSRTGAKGVLRGDSSADGILEFYEKVPNVKPGDVVTTSTYSQKFPSGLAVGRVK SLDLKKLPASVGKVELFPSIRSLDWVTVYPKPQTPPPENVGSVNQKSEKSK" gene 26449..27066 /gene="mreD" /locus_tag="DP116_05715" CDS 26449..27066 /gene="mreD" /locus_tag="DP116_05715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015117713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rod shape-determining protein MreD" /protein_id="PRJNA477356:DP116_05715" /translation="MRTPQLQTSRKKKSKSPRRKSKIQIVPLSRWHPRLLQFTDWAII TGSVMLCLLMLLIRVPGMELLGIGPNWPVIWVVAWSVKRTAFSGALAGVVLGLLQDAM TSPDPTHALSLAVVGSLTGLLQKQRFIQEDFISIALIVFGMALWSETIFASQLILMGD RNPADVWAHFQKVALASAILSSLWAPVIYFPLNRWWQQVKLAQQS" gene 27297..28472 /gene="ribD" /locus_tag="DP116_05720" CDS 27297..28472 /gene="ribD" /locus_tag="DP116_05720" /EC_number="1.1.1.193" /EC_number="3.5.4.26" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879078.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino)uracil reductase RibD" /protein_id="PRJNA477356:DP116_05720" /translation="MDNLPMVEPDAFIAESEFQDDSHPKQTLLSDFDHAMMRRCIELA RRALGRTSPNPMVGAVIVKDGEIIGEGFHPRAGEPHAEVFALKAAGENARGATIYVSL EPCNHYGRTPPCSEALVAAGIAKVVVGMVDPNPLVAGGGIARLRAAGIEVLVGVEEQA CKKLNEGFIHRILHHRPFGILKYAMTLDGKIATTTGHSAWVTNKDARGEVHQLRAACD AVIVGGNTIRLDNPYLTSHREGAHNPLRVVMSRSLDLPQNARIWQTAEAPTLVFTEKG ANPDFQELLRNLGVEVVELTPLTPDQVMAYLYKRGFCSVLWECGGVLAASAIAQKAVQ KVFAFIAPKIVGGVHAPTPVGDLGLTSMTEALSLERVEIRVVGSDCLVEGYLPSHQL" gene complement(28504..28722) /locus_tag="DP116_05725" CDS complement(28504..28722) /locus_tag="DP116_05725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05725" /translation="MTPKVMRQLWSVVETAQTKTLLQLDDASLVQWLVKQIKTQALLD CYESDFLSDYVKSRLALIRDLAQERQYS" gene 29257..30651 /locus_tag="DP116_05730" CDS 29257..30651 /locus_tag="DP116_05730" /inference="COORDINATES: protein motif:HMM:PF13245.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05730" /translation="MSLPQPIGRQKEVLYLPARGHFAVLGTAGSGKTTLAILRSAFLA DPRTEHCGKTLLVTFNKALVTYLNHLQDRKLANVIIENYHLFARGYLAFRNKMSRHAI LTPDDREALVKQAVKNISQHHSLHPLFDYPVELFSEEIRWMAHYGITTYEEYQNFDVL SDVEMRFKGKERELVFEIYQTYLNLRQQSGKKYDWDDIATTVCEEFGADTSKRLYKHI VIDEGQDFSPQMIRSLAWAIPADGSLTFFGDVAQQIYGHRISWRDAGLDIEQVWEFKQ NYRNTKQIAKLGLAISKMPYFKGVPDLVEPVSPPADGLLPTIVEFSSPDQEILFVVHQ AITLARTQNVAILFRDRQDEKLIGQYLPKGTIRLHREMTTWQAGAGIRYGTYHSAKGL EFDAVILPFCNNKKLPDPEAVEAFGEADASAQDGRLLYVGVTRAKTRLIMTYCGEVTS LLPSDTSLYERVKR" gene 30648..31355 /locus_tag="DP116_05735" CDS 30648..31355 /locus_tag="DP116_05735" /inference="COORDINATES: protein motif:HMM:PF14487.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4433 domain-containing protein" /protein_id="PRJNA477356:DP116_05735" /translation="MIDSIKREAERRSITRLCHFTPSRNLVHILTGETGILATKHLQK NERSVFTPTDLKRLDGHQGYICCSIQYPNVWYFNTAKSKDILFRDWVVLFINPKYLWL AGTRFCPRNAASAFGSTIAEGEAAFLSMFAQSVSGAGGRTFSRWTNHLACCPTDNQAE VLIPDQIGISDILAIAVPSETQAKNEAVRLDILGISEEKYKFVVAPDLFDKYNLSNLI RSGKRPDETPWKVGEEL" gene 31352..33163 /locus_tag="DP116_05740" CDS 31352..33163 /locus_tag="DP116_05740" /inference="COORDINATES: protein motif:HMM:PF03747.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05740" /translation="MISNEQHLHTISLMTKGQGAFLGAAVGDALGWSQEPEAKRIDKK TSSPAEVLDNGFQQWVRKSGGQYYPHHEVILAGEYSDDTQLILCTARSLLYGARWWHH LTKRELPIWTSYERGGGGATKRSAQQWLAGIEPWSSPDKEKKRYFGAGGNGVAMRILP HCLLGATETNFENIAKNIVANGVTTHGHPRALVGALAYGFAIWVALRETNTLQYGAIL EKVLSELNSWSVLPDLNDICPNWKNSALQTTDGQYDDSWQHTIKEMLQLLELCQEGMK QGALSIEREVLTKLGCFNKSVKGAGTVSAAAAIFLASRFAANPFYGLLEAGFAHGADT DTIASMTGGILGAIAGIEWLGNYAVKVQDANYIRDIAEHLARNQGDSQTQQADNFTIK KTHLDAFVKQIEVSKPADSILIPDGRKAQISASVNHQSISKSTVATSWKLTTAEGQSL YVKKLSRPKNNTEINSELKSISVSEHNFEFQQVNILHVGIEIPVSNLDKSRFFYEKVL GIQVEQESKLLVRCGSIVLINIEDYKKRHGFYSGTTKAPTTHTIDLEVESLDEAYNNV SKVEAKIVKDISKNHERRYFYCLDPDDNQIRIFEVKF" gene 33358..33711 /locus_tag="DP116_05745" CDS 33358..33711 /locus_tag="DP116_05745" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05745" /translation="MPIIESKDVVNGEEVVIYIEVDNIPPSRSPYENVRGVDTARVVA AARDVFGEAMQLTRSCAKRVVESVKQMEKETRPNELEVKLAIKLDSEVGAVIAKVNTG AQIEVTMKWKSTGES" gene 33708..34661 /locus_tag="DP116_05750" CDS 33708..34661 /locus_tag="DP116_05750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013320646.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium chelatase" /protein_id="PRJNA477356:DP116_05750" /translation="MNETTTIQILPYTLIVGQQQIKLALELAYIAPRIGGVLLSGQRG TGKSTAVRAFAQMMYERLPVTLPINATEDRVVGGWRIDELMQSKAVPQKGLLEEANGG LLYIDEVNLLDDHIVNIILDVTSTGVLVVQREGQSFQKPVSYTLVGTMNPEEGGLRPQ LLDRFGLMVSVEAEQNEAERTMILQTVLEFDEALSQLKVGETSAYINEALQKDKKRKA LLEKARQNFYNVKVPVSVARNCVKLAIKFNAEGNRGDYIIALAARAYAALVDAKQVTN DHVAEVARLALQHRRPEVLQSNQMPWSDEDDEQVMKMMNGE" gene 34654..36561 /locus_tag="DP116_05755" CDS 34654..36561 /locus_tag="DP116_05755" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05755" /translation="MSDSAKLSQLSLNDLVMRSLACAAISPGLRSILMFDTTPNALRL AAQTTAQMLEVVTGHGVTLVTLGTYEAEDDLWGSWGLAGDNEQQLLQWKPGLLSQHSC DIPYESINKFQQRIVVIPDLTKLSLAAARACVVLMGSDVAHLERHGQQVHWQPNICWM AGCASGEVGMVSPHLLDRFALRLSGQVAKTVDRAESILELLDERRLGKEKPKPLSTEI REQLLKALQVHPKMTAKALARIFDYTLEVEVYSPRREIALARLSLANARLVGAKQMTT EQVDTAAGMIGLTIGIKQKAKRSTQSSEPQPELTKLTESTSVSTPSSEQLELSPEQEP VYESDEPEELPATPLTFNAIPVNPYPEDEAPIEREAASLRLPPRRFRASAAARGVIIG VEKATNMRDLALVRTLLEAAKFQPIRQQGKTHSQRRLVLSLTDLHSYRRAPVAEQMLM LVLDHTCLLDCNWQEELLPYLNWAYVERASVCLIQVGAAQASHELRAYQVMAQNILVP SISAGIEAESGKATPLAHGLDLALQTLRHALQHGRSRVQRAVLVVISDGRGNVPLEAS RFGRITPPVGCKGVDDALQVAERIRGLDGVKAVLLNPQPKQYADLPLELAKTLGATVV AIGRLEQVEVE" BASE COUNT 10714 a 7664 c 8171 g 10214 t ORIGIN 1 ctaatgctgt gcaaagccag atgtggagtg ctttgcaaat tcccggacat gtctaataat 61 acttcagcgc gtccatccac agcatcgacc acttcagcta aagcgtcaag agatgcgatc 121 gctccatcta attgacgccc cccgtgatta gatacaataa ttgcccttac tccacactcc 181 acagcacgaa ctgcatcatc tccccgtaaa attcctttca gtaccaacgg aagcggacat 241 aaagactgaa accattccaa atcatcccac gttactgcag agtttagttg ctgtgcaaca 301 taagtaaaca atccagattc tccctcttga tggggaatgt tcaaccctga taaatttgta 361 aaattagcta attccatacc tgttgggaga gcaaactcat tgcgcttgtc tcgttctctc 421 cgtcccagga taggagtatc tacagtgata caaagtgctt tatagcctgc agtataagct 481 ctttctacta aagcacgagt tagccctctg tctttgtgga tgtaaagctg gaaccattgg 541 ggagtttgaa gatttgtgtc gtgacacact gctgctactt cttctatgct tttggtagaa 601 agagtactca acaccatgcc aacaccagca gatgcagcgg ctgttgctgt tgcaacttct 661 ccttgtggat tggcaagaca ttgaaacgcc atcggagcaa tcaacagagg tatttgaagc 721 ggttgaccca aaacttcagt tgctaggttg cgctgactga catctactag cactcgcgga 781 cgaagtttaa atctttcaaa agcagttcgg ttatcccgca atgtgacttc atcccacgca 841 ccgctagcgt aatagtccaa agccatttga gacagacatt tggtcgctag ctgttcgtat 901 tcaaataagt taagcggttt caacgatttt acttcctcca ccaccacctc accaagtcat 961 tcaaaattgc aatttaaatg cgcgacatct aaattgtctc gcacactagg cgcaaagaag 1021 aacgcacata gtctttaaat aaatgaaaaa ccgctgtaat aaagaatgac agataaacac 1081 aaatgcacac aaaaccgttt tgtgtgcatt tacagttttt tgtttcttga cgctcccacg 1141 gttttttgct tcaaattctg taaggagaac tgaccacctt cgcatcggtt gaggacaaag 1201 taacctgtga tggttcaaag cgtttattct tgagattcaa aatgacagat agtaacttgg 1261 gaaacgcaaa acagactaca acgtttatca aagttaggaa cagaccatcc atcaaactca 1321 tagataatcc ccttttttat aaatggtgct ctttatatat ttctattttt aaacaaaatt 1381 cgttgaaata acgaaaagca gaatttgaat ttattgtgat gtctataagt aacttttact 1441 ataaaagttt ttgattctaa ctatagagtg aataattatt ttagaaataa gcaaattgag 1501 ttatcagtta accgtgaata tttattaagt ttctatacaa gatagataca aatctgacgt 1561 cttgatttcc gcagaagtta actgataacc ttaccctgtt aaccaggaga acacctgtca 1621 caaaaactgg gagcgaattt acaaaacggt ctttttgtca aggtgtaact tgtgcaatac 1681 ggaaaactgg taagttcaat tacctttttt tggctcgttg aagtagtctg caataactga 1741 ttctgcccag ttacagtagc tagaatcata aatataactt tgggaaatgc aaaacatacg 1801 gcaacattta ttagcgtcaa cagcaaaatg tctatcaaag gcataggtca tcactttact 1861 tttacaaatg aggtgcttgc tgcttcaatt caaaagcaat atttctgtaa caagttaaaa 1921 gcaaaatttg gccttgcttc tctctcagat aagtttacta taacataaac ttttgttaat 1981 caatagattc ctcaagtttt gagcattgct atagacagaa taaaatgtgt catttcaaat 2041 ttgctatttg aaccacagat cggaatatgc tggagagata agaaattttc tatttgggag 2101 atgtgttatg tcgaaaacat acgatattta ttctataatc tctaaccttt atgagctaga 2161 gcgccggaat ttagagcgtt tggcagatta ctccaacttg aattcactgc cagaaatatg 2221 gtcgttggta gcaaagcgat gtggtgatac cgtcgctctt cgcaatcctc atgccaaacc 2281 agaagttgtt atcacatata cgcaactctt tcaaaaaatt caatattttg ctgcaggatt 2341 gcaagcgttg ggggtgaaag caggagatcg tgtctctttg atttcggata atagtccccg 2401 ctggttcata gcagatcagg gcattatgac tgctggggcg gttgatgcag tgcgcagttc 2461 tcaagcagaa cgggaagaac ttgtgtttat cttgggaaat agtggtagca cagcgttggt 2521 ggtagaagat ttaaaaacct ttaacaagct taaacagcgt cttggtgatt tgcccattca 2581 actggtcatc ttgcttagtg atgaagtacc accaacagac gaaaccataa aagtcataaa 2641 ctttaaccag ttgatggaaa ttggggcaaa ccataattta acaccagtta agcaaaatca 2701 cgatacttta gcaaccttaa tatatacttc tggcactaca gggaaaccca aaggtgtgat 2761 gctttctcac ggtaatttga tgcaccaagt ggtaatctgt gggaccgtat tgcaaccaga 2821 accaggagca gttgttctca gcattctccc cagttggcac agctatgaac gcacttgtga 2881 atatttctta ctatctcaag gttgtaccca aatttacact aacttgcgat ctgtgaaagg 2941 agatttgaaa gaatttaaac ccaattacat agtgtgtgtg ccccggttat tggagtcaat 3001 ttatgaagga gtacaaaagc agttccgtga acaaccagca aacaagcaac gcctgattaa 3061 ctaccttttg ggtgtgagtc agaagtatat caaagcgcgg cgaatcgccc aaggattgag 3121 tttagaaaat ctgaacccct caatggtcga gcgattagca gctagtatac aagcatcagc 3181 tttgtttcct ctccacgctt tgggagaacg acttgtttat gccaaggtac gggaagcaac 3241 ggggggagaa attaaacaga tgattagcgg tggtggtgca cttcccaagc acatagatga 3301 gttttttgaa attattaatg tagagatttt gcagggttat ggcttgacgg aaacttctcc 3361 cattgtccat gtgcgtcgtc cttggcataa tgtacgcggt tcatctgggg aaccagtacc 3421 aggtacagaa acgaaaatag tagatccgga aactcgcaaa actttgccac ttggggagcg 3481 aggtttggtg atgctgcgag gaccacaaat tatgcaaggc tattaccaaa acccagaagc 3541 tacggcgaaa gcaattgaca aggagggttg gtttgatagc ggtgacttgg gttgggtgac 3601 tccacagaat gacttggtat tgaccggacg agcaaaggat acgattgtat taaccaatgg 3661 ggaaaatatc gagccgcagc cgattgaaga tgcgtgtttg cgatcgccct acatagatca 3721 gataatgctt gttggacaag accaacgcag tctcggtgct ttgatagtcc ccaatttgtc 3781 agccctgcaa aaatgggtgg aagctaaaaa tctgcatcta cgtctaagcg atgaagccgc 3841 caaacaaata agtactgaag atgccccagg caagacggaa gtcaccttgg agagtaaaat 3901 gattcaggat ttatttcggc aagaattgat tcgggaagtg caaaaccgtc caggctatcg 3961 accggatgac cgcattggtc cgttcaagct cattctggag ccgttttcac cagaaaatgg 4021 catgatgaca caaacactga aaattaggcg acaggtcgtg atggagcgct atcacgatat 4081 tattaaccca atgtttgcct gattatattc cacctgcaaa ctatagagtg aacgtgaaac 4141 atttatggat gcctccaaat ctcaactgct gctaaaacgg gttgttaacg tcaaagcgat 4201 tgtgactccc ctttggaaag aggaagtgca gcagcaactg caagcacaaa ttaatcaaat 4261 tgaccagcaa ctgcaacaaa tcgacatgga aggacagcgg gcgatttcag cagttcaaaa 4321 gcagagtctc cagccaccag gtccccaaac tcttcaacaa attgagaata ttcaaggtca 4381 aattaatcaa aagaaaagtg aactcttgga gcaaaaaaat caaagtctgc aaaatctcca 4441 gcaagtccag tttttagagt tggatcaaga agttaaccaa ttccaaatgg aaggcttttt 4501 ccacgtggaa ccaggtgata acttaattag caaattgcag gtagaagttg tgctacgtga 4561 tggtattgta gaagaagttc gcggtgatat ttaaatttgt cctttgttct tggttgttat 4621 atagtgagtg agactcagaa caaataacta ttgacaggtt tgggttgact gtggcaagtg 4681 aaggctactt ccaggtatac attgtacctg ggagccagcc caaacaattt gccatacacg 4741 aggggaactc acaaaaagcg atcgcccaaa agttgtgagt tcaactcgat atttcattgc 4801 tttgagaaaa ttctttgttg gctctatttg ggtaattgtt actaacgctt gattaggata 4861 gggataaaaa acctcaacct tgcgagtttc tagagttgaa tccatgttat caaaagcatt 4921 gagcgcaagg tctgctggat cactaccttg gagaggagag ttaatacttt ctagaggtac 4981 agtgttataa tttgcccgct ctgagttgac aatttcctgt gttatgttca catctgaaaa 5041 ctgaggtggt tgcccattgt tcatactata aaccgcttca gatggctggc gctcaatgat 5101 ataaacacca gtagcagcgg tggcaaagag acttagtatc acaatcaaaa acaatttcaa 5161 ctttgccgac ataaaattta acagccaatt aaaattaggt taaatattaa ttttgaatct 5221 gagattaagg tattgtcctt cctcttatta cttatgagtg gaatttagat tttgacagtc 5281 gtgatgagaa atacaaataa atagaccttg tggtcttaat atagttcctc accgctcttg 5341 agcaactaga ggtatcagaa tgtaagggta tcaagaataa taaccatctt atttatgacc 5401 tggtcaaatt ttaataatca ctgtcctaca cccgagatct atgtttccca aaactccgcc 5461 ccaagggtag cgaggtcaag atacaattcg atctaacaca aactgtaaaa gtccattaat 5521 tatcgattca ccaaaccacc tatatgagca tttacgaagt atttatgccg gcgctaagtt 5581 ccaccatgac cgaaggtaaa atcgtctctt gggtaaaatc tcctggagat aaagtggaaa 5641 aaggcgaaac agtggtggtt gtcgagtcag ataaggcaga tatggatgta gaatccttct 5701 atgaaggatt tcttgctcac atcatcgtgc aagctggtga aacagccgcc gttggaagtg 5761 cgatcgcctt gttggcggaa accgaagctg aaatcgaaac tgcagctcaa gcgaattctg 5821 ggagtagtgc tgccaaacat gaagcaactg cagccattaa gagtgaaaaa acagcagaaa 5881 caaccacagt cgcaacaccc gctgcttctc aaaacggaac ctctagccgt acaaatggtc 5941 ggctggtagt ttcaccccgt gcgcgtaagt tagcgaagga actgaaagtt gatttaagtg 6001 gtatctctgg tagtggtcct catggtcgca ttgtcgccca ggatgtagaa gcagcagctg 6061 gaaaatctag caaacaacct gctacagcga ctcctgtcgc acctccacaa ccagcgccaa 6121 ccatcacccc ggttgcacct acacccacga aagttgctcc tgcacctgca cccgcgccgg 6181 cgatcgctgc acttccgggt caagtagtgc ctttaactac cctgcaaaat gctgtagtac 6241 gcaacatggt agcgagccta tccgtaccag tcatccatat aggttacact attaccactg 6301 atgcacttga caaactttat aaacaaataa agtctaaagg cgtgactatg acaacccttc 6361 tggcaaaagc cgtagcggtg acattgcaaa aacacccact gcttaatgcc agatactcag 6421 aacaaggaat tgtctaccac tctagtataa atgttgctgt agcggtggca atggatgatg 6481 ggggattgat tacaccagta ttacaaaatg cagacatggt ggatatatac tctctatccc 6541 gcgattggaa gtccctagta gaacgtgcta gagctaaaaa gcttcaacca gaagagtata 6601 acaccggcac ttttacaata tcgaacctag ggatgtttgg tgtagataga tttgacgcaa 6661 ttctaccacc cggacaaggt tctattttgg caattggcgc atctcgtccg caaatcgtag 6721 caacagctga aggtctattt ggcgtcaagc aacagatgca ggtaaatatt acctgtgatc 6781 accgcattat ctatggtgct catgctgcgg cgttcttgca agatttggcg aaattgattt 6841 cgacaaatcc tgaatctttg atgttgtagt ttgagtgata ccagaatcaa accgccttgg 6901 ctcacaattg aaccaaggcg gtttatagta ggtaaaattt ttgaagatca tttttatttt 6961 atctcgttcc caggcacagc ctgagaacga ttttaccagt tgcaagccga gtcataaacc 7021 gtaatctagc actagggtta gtagggtaat tgcaagaaat aacaacagca ctcgggctaa 7081 tcctaaagtt tgactcagcg tagggacaag actaagcaac atcaagcgta ttggacaata 7141 gagtatccca agtctttgcg gcattcctgc aaaacttgtg aaatacttgg agttgaataa 7201 ctccacaaaa ctcagccgcg tgtcattgaa ttcatcaaaa aatcgccgaa acaaagcccc 7261 cagtgctgcc aatatttcta tggcgataca gaacctaacc gagacattac gattgtggct 7321 atagtagtca ctccaaattg cgcccaccgc ctaaagcggt acccaatcag tcgccacaac 7381 tcttacatct acaattacca acgctagcac aagcacccgc aaggtcatcg aaccttccac 7441 ttgcatcaac ggcgactcct gcgcattttg ccgccagaaa agacttgcag gtagacacca 7501 atttcatcct ggcttctaga tctgttagac atcttaacga gaaagcagtc tatgggtata 7561 taaatcaagc aatgatatgg acgccacaga tacaaaacga gtcggtaaaa cgaaactcgt 7621 aagtgtcttc ctcacaacag ccttctgtag acgccctaat gatcgcttgt gtaaaatggg 7681 gtatttttca aaaccaggac attcatttta tatcaatcgc gtttatcaac gtggattgct 7741 agcgatggca agcttattta taaggtggga acaatcacaa atacaagcgg ttgttggtct 7801 atttctttta actcaacttc caaggtataa gtctggttta gttgcgtgtt gtgtaattct 7861 acacttaggc atcccggact tgatgactca gctgttgctt gggcagcatc tccttggagg 7921 ttcacaatcg ctttagaagg taattcagtt tcaactctta gcttggtcag ttgatcttga 7981 gtttggtatt ctacctttag ggtcataaca cctgcacttg taagccattg tctttctact 8041 actgggttgg tagctgaaac tggcagagtt agattttcta gtagctgttt tgccgccagc 8101 aacgaaagcg ccatctgttg ctgtggctgt aagtctgagt aatcactctc tagattcggc 8161 atagtttcta gagcatctga acgagaggga cttcttagca ctattcctgc taagtcattc 8221 aatgtttgat attcttcagg aaagaggctc tccacagcag caacgagttt tgcccctaac 8281 ggtatagaag aggcaatcat cgcctgacac ttctccagca attgctggaa aattctttca 8341 ggcagctgaa tcggcaaatt cagtttcgat tgctgtactg cccagcccca aaccggaacc 8401 aactcaagcg ctggcttgtc taggcattcg ggatactcta taagctgagc ttcccaaggg 8461 aataacggtg ggtgattttg aatttgagtt tgtaaccgac gtttaaggac ggcttggaaa 8521 cgttcttgca cagtaggaat ttctcccagt tgaatagttc ggggtagaca ccttaactca 8581 aaagcgccac cgtttaaggc ggcggctgtt tgattaatat gttctacccc ttcattttcc 8641 tcacaattga ccaattttgg atcgttggtt tgggcatttt ttgccaacaa ccaagcgatt 8701 aagtggtttt gtaaggattc tgagtcacta ttcatgggta gatgcacctg atccggaaat 8761 tagagagtta cgaatttccc aagcttgctc aagaatttta aaccaccgtt tttgcatttg 8821 tgccatcgat aatcctaaag ttttggcaat tttttcatct ggatgacctt gttgtttcaa 8881 ctctagtaaa gactgttgtt tctcgtccag ttgagccgta tacacttccc attgctgagg 8941 agttaagccc aaatttgtct gtaaatctgc ttccagccac tcgtgaacca attcccagcg 9001 atgcaataag gcaaacctga ttaaatggta tttaaagcgc tgctgcaaat aatcccgctg 9061 gcgagcagtt aagcccaata ttgactcaat ttcctgcgct gataaatcct ggagacgcag 9121 agtaaagtaa tcagcacaat ctgattgctg ccgttcttcc agataattca tcaattcagt 9181 aatcacgaca gagcgtaacg tatcttcttg aggttctggc tctggttgtg tcgccattgc 9241 acttcgcaat tgctgtacag ctggatcttc ccaagagcca tcagcttcgt taccactccc 9301 ttctgctgct tgttctatat ctacgcaagt ttccggcggt tgttgttgag agaatgtttg 9361 cgctcggaga ataatcagct gttgttgacg ccctggcaaa ggaatccttc gcttgccata 9421 gcgttcggta aatgccatgt attccgccaa ttccaaaagt gtttggggac gataactagt 9481 ccccagttgg ttctcccgtc ggaaggcatt taatgcctct aggtaaaaac tctgcaagaa 9541 atcttcaatg acaacgagcc gcccttgata actcaattgt ctttggggag gattgatgta 9601 gcgataaata atcgcactca aagtactgtg taactctact ctgccccgat tggaacccaa 9661 ctcgtagtac ctcaaacact gttgtaaccg atgtcgggca agggtcattg cagaattttc 9721 tacagtccca gcagcttgga tgcgcttgct ttcactgcaa attctataaa cttcggcagc 9781 aattcgtgtt gccacatcgc ggcaattttg ctccgaagct ttggttgatt gctgaagctc 9841 cttgaaaagg agttgaaata tcgcctccac gccgatagaa cttactccct gaattgttgt 9901 tatggttgcg gctgaattca tagtctggtg tttagaaaga ccctaacata cttatgtacc 9961 acctgtaaga ctgagggtat tgggttgagg attttgcgta caagtcaaac gtgcctcaat 10021 tatgtgttat aggattaggc tgaatttcat attggcaaat ctcgtttgat tgttcgcttt 10081 tcatagatgt tactgccact acccattcat cagtttacag gtcattatgg acttaaaagc 10141 acaaattcaa ttgctaattg acaatgcccc ccaagatggt gttactccgc aacttgtcac 10201 agcaatcgct cctgtcctca gcgcgcgcgc tcagaaatta cgccattccc agtactatat 10261 tctccagaat atggaagagg gctgggtttt gactacattg agcgctcgtg caaatccaca 10321 gttggaaaag cgcgtcattt atgctttccc tacaatacag gatgtcccac tgggatcatc 10381 tgctgggctt gaccctcaat tgatagctgc accaatccct gttactcaca ttttgttcca 10441 attggtagca atggaacccg tagatagtat agtttttttt gaaacccctg gcatcactac 10501 cgactcagtc gaagtccgac gagccgacct gcaacactta attcaacaac ggttacagca 10561 aaaccgagta aaaaaacaaa ttcctccaga tattgcctaa cagcaggagt gcggggcgag 10621 gaacacaagg aagaaaccca tagaattctt cctacctcac ttcctcgttt cccctaatag 10681 agcctactca gtacgtaatc agctatcttc attaggactt gccaagattc agaggatggc 10741 agatctgcaa gatgctcaac agctagcttc gcatgatgtt cggctaagtc tcttgagcgc 10801 tgaatacctt gactatcgtg tatcaaggat atggcttgct ctaagtcgcc tttctgggca 10861 aactctctgt ctatcaggac ttccaaatat ggcttttctt ccaatgcaaa taaaacaggc 10921 gcagttaaat taccactttt caaatccgat cctgctggct tacccaaggt atctgtcgaa 10981 cttgtgaaat ctaaaatgtc atctactatc tggaatgcta aaccaatatg acgcccgtag 11041 ctatacaaat ggtcggctgt ttcttgggaa acgttgctca gtaatccagc tgcttttgaa 11101 ctattagcaa ttaatgacgc tgttttgtaa taagtctttt tgaggtaagt gtcaatcgat 11161 atgtttgtct cgaagcgatt caagccctgc tgaatctcgc cagtagccag atccataatc 11221 acctctgaca gcagtttcac cacctccaaa ttgtccagat tggctagata ccaggaagat 11281 tgagcaaaca gaaaatctcc tgctaatatc gctattctgt tgccaaataa actgtgaaca 11341 gtgggaacac tgcgtcgcat ctcggattca tcaaccacat cgtcatgtac taagcttgct 11401 gtgtgaatca tttccgtaat ctcagctaag cgtcggtgac gaggcgtaat gtcttgttcg 11461 agcatggtcg cccgcgatat cagcaggacg atcgctggtc ttatgcgctt tcccccagct 11521 ccaaagagat gttcggctgc tgcacaaagg atcggatgac cgtttccaac tagctgtttc 11581 aagttgtctg ctagtatttg caggtcggct tccactgggg aaaaaaggga ggtggctggg 11641 gtcatggatg ggcagactct ggctttagtt acgaaagttt acatatccta cacttattct 11701 aagataaccc tgtcccagcg caaagttttc atgagtcatt aatcattact tttgtgcatg 11761 gcactcgcac gccagttgta cagacgctct tgcaacaagg agggcattgc cgacggacgg 11821 tggttttctt ctggaacttg atacgctact tgcactcaat gagtttgtat ctcactcatg 11881 gtacggctgc gcctctatcg tgcacaaacc ataggaggta aattttgtca acaaagtttt 11941 aatgtttttc tatagacaga gaaaatatac cgaaatgagt gttagtatat tcttttattt 12001 atatgatatt tttcatactt gtattattaa accatagacc tgatacagta gacaaaatct 12061 ttgtatctta accaaaactt tatcttcaac aacatatttt tcaagtaaag atgaaaacca 12121 ggctggaaaa cttccatatc ctttagtcgt cacctatgac acatgactta tttttaccga 12181 aaaaaattaa gcgaaatctt gtataaccga cactaaaaaa tttcaaattt agcagaaaaa 12241 ttagccaaaa acttagggtc gttgtgttac gctaacatca agggatttaa atttttcgag 12301 atattcacat taccagagaa aaaccaccgt ttggtgtttt tacctatcag gtgctgtgac 12361 tttgaatata gtattgtgga ctaccatcaa tgaatatatc tcattggtta tggtctcaat 12421 acctaagcct atagagttag aactatgcgc ttccctgtaa aggaaaaatg ctgtgcttgt 12481 ttagtgtagc aagagtgcat accttggaca acaaaataaa gtgtttgcgg aaaaatgcag 12541 actaggtaga ctgctgaggt cgtagttcta agttgctctt tggtaatttg actactgcct 12601 taggtatagt ttgcaacaaa aattgaatag aaattaagtt tacgaaagag agatagatct 12661 ctaactgtga ttttgcagtg tcaaaaacag ctattaaaag taagtttgtt tagctgtgac 12721 tctgctaaat attattggat tttatatttc ttatacacaa gtgttggtaa aaaaatggtt 12781 tacggtaaaa tgcctatgat aatttttata ttagcttctg gttatttgtt agcagtttat 12841 ctactattag ctctagcaaa gagaacggga aaaaaaacta ctgccacaag tgtttcttta 12901 gggatgcact ccaaaggggg caaaagcgca taatcttaag tctggataat cactgacgat 12961 aaatacaatg tagtgtcaat tagccttcat cagtgattaa gtatttagtc aagctttttc 13021 cccttgacta aatacttaat caacaaaaca ttaattacta taaagtgtga catttgtcaa 13081 ctgtactgct tcagtgactg gggtataacc tagccactgt acagaggact gagcaaattg 13141 ttgtggacaa ccactgaccg caaagcgagt tggcaaaggt gggtaggtat ttctcaaacc 13201 caaaatatct aaatcttggg cacaagcagc aacgacatat tctgctggat caactagctt 13261 gacgtaggaa gggaggagcg atcgcaaaac tggtgccaac agaggatagt gagtacatcc 13321 atagactaat gtgtcgatct catgcagcaa tagtggttcc agatatgagc gtgccacttg 13381 tagtgtgtag ggttcataaa tgcggttttg ctcaatcagc ggcacaaact ctgggcaact 13441 cacttgccag acttgggctt cgggatttat ttctaggata gcatggcgat aggcattgct 13501 tttcgctgtt gcagcagtgg caatcacacc aatgcgcttt ccactgttaa ctgctgctct 13561 tgccccaggt aagatcagac cgagaattgg cacagagaat tcctcacgta ccgcctccag 13621 cgctagggct gaactcgtgt tacaagccat aacgaccatt ttgacacgct gctgttgcat 13681 ccaggtgaga atttcacgcg taaactgtaa aatttctgtt tgcgaacgaa tcccatatgg 13741 aagtcgagct gtatccccaa agtaaataat cgattcatta gggagttgac ggtagagttg 13801 gtgtagaact gttaaaccac ctacaccact gtcaaaaata ccaattgggg cacgttgcgg 13861 tgcttcagaa aaatcataaa gattaccttc aaaggtaaaa aatggaaaca caggcggata 13921 tcgttaattt aataagttgt tagtggttag ttgttagcta ctagtgatga atcgttaatg 13981 gttagttggt agttctcagc gcttggttgt tagtggttaa ttgttccact cacgactacc 14041 cactacccac tgacttgaaa atagaactca gcaactcctg actcagaact cagaaaacaa 14101 gactcagaaa aaaagaaaaa ttttcatgtg tccggtgagg gagtgctcgg ctagagtttc 14161 acaggcttag acgcccgcca gaaggcgaac cgcagggtag ggtagcaact gccgctcttg 14221 ggtcattcgc gttggcaggc atacccgttc gcccaagcca cgtgccgtag acatagggtg 14281 gtttgcagtt catgacggct acttatcttt gtcgcaagta tagtaatata ccgttagcga 14341 ttgcctctgc cattcgattt tggtattcag ggcttcccag tctaggatta tcttcctgac 14401 cagtcatgta gcctgtttct actagaatcg agggcataga actttttctg agaacataga 14461 atcttgcttt gcgtactcct cggtctttga gggtggggat acttcggaga atgttactgt 14521 ggataacacg agccaaatct agcccactgt cgtaataata agtttccaat ccattcacgt 14581 cagggcgacg atcaacagaa ttagcgtgaa cgctgacaaa taaatcagca tttgcttggg 14641 ctgctatatc cacccgtccc tgaagggtaa caaaatagtc agtatcacgc gtgagtgcaa 14701 cctgtaagcc attttgctgt aaaatggtcg ccacccttct tccaataggt aaaacgacat 14761 ctttttccaa cagtccaccc aaaccaggag cgccagaatc gtgaccgcca tgtcctggat 14821 caacaacgat gacaactcgt ccattaggca cgcggggacg cggcattggc tgcggcatca 14881 tctgcggcat tggcagcgga ttgttcgtca ttgatcctga ggataatggt tgtgggtttg 14941 atcgtggtaa gggaggtaga ctaaaacggg gagttaatgg agaaaacgag cgttgtaatt 15001 ctaaagacaa aagctgacta ctcagttgat tgagttgtcc aattcgcact cctcctgctg 15061 gttggatgta gacggcaaca ctacgcggat ccagttgttg caggcgtacg cgcagaactg 15121 gactactggc gtcaaaattt ggacctctga cgctagcagc taacttagca ttgggaatga 15181 ttatgcgata taaaccagaa gctctatccc aaccagtatt tacagatgaa aggggttggt 15241 ctcctctaat aagcagttgc gtaccagtag cagtcagttc tacagactca attgtagata 15301 ttgagtcact agctggcggc attaggttgg agcttcgagg cgtgacattc ctaggtatct 15361 ctgagttagg aagtaaacta ccattaccat cactactatt acttgtactt gtagtatctc 15421 tggacaattt gcttgcatca ccattaggca gaattaccaa accgccaaag ctactcatgc 15481 ttgctcgcca gtcggggcta tttctctcca cccacagcgt catacgaaca ccaggtgttg 15541 tttttagttg agtgaattca atacgtttta taccatactt attgactggt gtattctgtt 15601 ggaagaaacg tggtgacaga gctgcaccaa gtatatcaat gttgatagca ctcttgtcac 15661 tgctacgaaa tgtctggatt tgaggatttg ggccatttgt gcggacaaac aaaccatcac 15721 ctgtaacacg taagctctca atttgagtca atccttgagt attgctgata agtgtttgtt 15781 tttgagttgg cctatcaaca ggattgacaa ctatatcttt acttgctgcc gaatcagtat 15841 tgacaattac gtctttagtg gctgtggagt tgtcaggcct taccacacta tagacattca 15901 aagaagtgtt gccagaagat gaagcagctt gttcaacttg tggagttggt aactgtactg 15961 tccagcgact agaactttta ccttcaaatt ttacctgctt tggatcgagg gtataacctg 16021 gactaagttc aatgactata cgagttgttt gtccatcaaa ctgaccgaca cgaacagtac 16081 gaattgcact actattcacc ggctgaatca actgcgaacg tccaaagtca gttccaggca 16141 aatcaataac cagtcgcgtg gggttgaaca ggagttgcgc ttggggttga acagcacctt 16201 cagtgttgat ttctaatcgg ttttgactgg catcaaaacg ccaaaattgt aatttcgcag 16261 cttgagcctg cgattgtagc atgaagacag tagtagttgc aacagtaaca ggtaataacc 16321 agttcatttt cacagtcttt tctcctgatt cgtacttctg agatctgtca agatgtcaaa 16381 gtttagggaa atcctcacac tagcaccgct caaatcttag ccgcaaatac gttattaact 16441 cgtagaaaga tcttaatagc gctcaactgt ggcatgccct ctcagttttt gtatgctttt 16501 ctggcacaga aaatgcagtt tgcaaagcat aatcatcaca ccaatctaaa aataatagag 16561 tagtcaatct tggtcatttt tgcttaactt tgtgagaact ttatattaat ttgtctgatt 16621 cctagatatc ttagtccgtg ttaactgatg tggtaaacac agcacactct ctagagtaaa 16681 atctagaacc tgaaagcgaa tcgccctctg aactagacca gacgctacgc cccacataag 16741 agttctctgc agaacttacg ctagtgggta gacttagact ttggcattaa cacaagactc 16801 attttgtcaa aaattatatt taatattgaa acaaaatagt agtttcaggt aaagttgtat 16861 tctggtgtca gcgatgacca gactccagac acaatccctg agatctgaac aggggaaagg 16921 atcctactag acaacttcaa taagtattcc cggacacaaa gtatcaaaaa ttttacaaat 16981 aaaaaccgga acttaagttc cggtaaaaga gtttgctcaa ccaattatca gctacctagg 17041 cagttatcaa ggttttaaac actgataact gcctaaatag ctgcttgaac acagtgctaa 17101 ttaagacact gatattttaa atgttttact tttgctttaa gtactgaaga acaccatcag 17161 caatcccctc tgccatttta ttttgataca gtcggttgga caacttagca gcgtcctcac 17221 gaccagttaa ataccctgtt tccaccaaaa ttgctggcat agaatttttt cttaagacat 17281 agaatctggc tttgcgcaca ccacggtctt tcacattgac acttcggaga atgctcttat 17341 gaacgatctg agctaggctt aaaccactgt caaaataata tgtttctaga ccactcacgt 17401 ctggacgatc atcacctgct gagttcgcat ggacactgac aaagacatca gcattggttc 17461 gctctgctat gtctacccgt ccttgaagag tcacaaaata gtctgagtca cgcgttagta 17521 ttgcctgtaa accattttgc tgtaagattt ctgccagcct tttgcctatt ggcaaaataa 17581 tatttttctc ttgaatagca ccaataccaa gtgctccaga gtctttaccc ccatgtcctg 17641 gatcaacgac aatgacgact cgtcccttag gagcacgagg cttctggact acaggcttgc 17701 ttggctggtt ctgtaatgcc aaaggctgtg ggttttgtcg ttgtatgggt ggtaaagcaa 17761 tgggtggttt gatagaaggg atacgttcta atcccaaaga caaaacctga ccattcagtt 17821 tgttaacttg cccaatttgc actcccaatg ctggttgcac atagacaaca acagtacggg 17881 ggtcttgttg ttgcaagcgc acgcggagaa cagggctact agcatcgaaa tttggacctt 17941 tgaccgcaga agccaacttc gcattgggaa tggtgatgcg aaattgatta gatgactgat 18001 cccaactagt actagcagat gagagacgtt gatctgctct aatcagcagt tgcgtaccag 18061 taccagtcag ttctacagcc tcaattgtgg agattgcatc gctttttggt gtctccacag 18121 caggagtggg aggattatct gatggagtgc tggaataata atttctgggc gagtcactcg 18181 cataacgatt cttcggtaga acgactaaac catcagaacc atcagaactg cttgtgctta 18241 tgcgccaatc aggagtatct ttgtctaccc gcagcgtcat tcggacagca ggtggtgttt 18301 tttgcagttg aatcatttca actcggttca caccatgctt atttatagat ctatcgcgag 18361 caccaaaatt cggtgacaaa gtagcactgg gaatgtcaac gaagatggtt ttgcgatcgc 18421 ggctgcgaat gacctgagtc tgaggatttc caccactggt acgcagaaaa aaaccgtccc 18481 ctgtaacttg cagactctca acttgaattg ttctttctgc aacgacaaca gccttagaaa 18541 actcaggttt tttatctgtg tcagaatcta ctgtcaccac attgtaaata tttcttggcg 18601 gtgagatgac gacttcttca ctttttgggc ttggagcttt tggagttggg gatgctggtg 18661 aggaggcgac tccttcagtt tgtggcgttg gtaattgtac cgtccagcga ctggcacttt 18721 tcccttcaaa ttttacctgc ttggggtcga gggtataacc aggactcagt tcaacaacta 18781 tacgagttgt ttgtggatca aattgaccaa cacgaataga gcgaattgca ccgcctactg 18841 actccgtcaa ctgcggacgc ccaaaatcag ttcctggcaa atcaataacc aaccgtgtag 18901 ggttgaaaac gagttgcgcc tggggttgaa ctggaccttg agtattgatt tctaatcggt 18961 tttgattggc atcaaagcgc caagattgca atttcgcgct ttgtgctgct tgagctggcg 19021 acaatagcgt caaaagacta gtcgttacaa aggtaccagg tagtaaccag tgggttttca 19081 cagtgttttc tcctgattcg tatttatgag agtctatgaa gtggagaagc acccagaaca 19141 caaaacgcag aattcagtac agtgaacagt gaacagttat cagtaaacag tgaattgata 19201 actgataact gataactgaa taactgactt ctgagttttt tgttggtcaa agtttcggta 19261 aatcctcaca ctatcaccgc ctacaccatt actttggtgg cacaagcatc attttctgat 19321 ctcataaaga cacagttata gtgattattt tttgaatata atccaaattt ggtgatagca 19381 aagcgtgtca tttacctaat atctattaag gtttaaccat taggagtaga tgttgttcta 19441 tattaattct tctcaggaac taaattgaat gattggcgtc aaattctaac acgctgccaa 19501 tagtttttca tgccgttatg gcattttttc gttgttagat gtttaattga cgccctctag 19561 gcggcttcac gcaatttgcc gcaaatcaca ttaaagtatc cgtcaatagc taaacacttc 19621 atccgtcaaa aagagatttt ttatattatc tgatttggta gtcgtcaatg caccaactga 19681 taatacttct actaaatcct tacgggagca tcccaatgca tgagaaaata cgcttgtcat 19741 tctttctgta aacggttctg ccgtgagcta gtccggtgaa cgggttcggt caggctacag 19801 gagccagtgc tcaggcgcga gtttcacagg cttagacgcc caccagaagg cttcccgcaa 19861 tctgccgcac ctagcgtgaa ctggcgctct tcgtagcgtg tccggaggac atacccgata 19921 gccgtaaggc gtagcgtcag ccatagggct tcccgcaggg tgtgaaacga agtgcagcgt 19981 atctcctgta taaacgctct tgcgaacagg atgacaatga atttagagat ttgggatgtt 20041 actctttcct gatgtgaact tgacttttca gtgcacctgt ttaactgcta aagaaatacc 20101 attacttgga tgagtgagca agaaggaaga taaaacaaaa acaacattcc ctaatgagtg 20161 ctggcgttta gaacaaactc cttaattttt tagtttaagg agacgaagat tttgggagta 20221 aagtttccat tagtaataac aaagttacaa aatattcata ttctgaagtt tagtaggtca 20281 tgaacttgtc gtacagaagc acggatgcaa atttcttttc agagtagatt aaaaattgac 20341 ttgataagac ttaagggaaa tgcaacagag tggagcgttg attatgatag ctagtcaaag 20401 cactcctgag ctttttgcct cctcaagaaa aaagacaatc aaatcgggga gaacgctttg 20461 agctatgact aacccgcttg gagtgcgatg ctcctttgaa gcgttgtaca cctatgagaa 20521 ataatacgaa cgttggatga atgtgcccgg aaatatcggg aattttgttg ccagtataaa 20581 ctccaactta aacaaaagga aaaacacttc acctggacaa attcacgact cagtcatcat 20641 ctaagtacag acgctgtaaa taatgcgtct gcactctaga attttactta tcgtcccgag 20701 aatctggtga ttctataacc aacgattttc cggaagaatt atctaaaagc acttcctcta 20761 ttgctgtagc tgttgttggt tctggaaata cgaaccgcaa aaggggaggc gcgagaaatg 20821 ttgtcaggat gaccatcata ataattgccg cctccaaagg tttcgagaga atgccaatgg 20881 aggagccaat accaagaaag actaatccta cctcgcctct aggaatcatg cccacaccaa 20941 ttgccaaacg gttgatttga ggctgaccaa atacacttaa gccagtgaca actttaccta 21001 ggatggcaac aattatcaga aaaatcgcca tcactaaacc ttcccggttg ctaggaattg 21061 ctgggtttaa aactcctaaa tcagtttttg caccaacagt cacaaagaaa attggcacca 21121 gtatatcagc aattgggata acttgctttt gcagttccac acgctcatct gtttcgtcta 21181 agactaaacc agcagcaaaa gaacctaaaa ttgcctctag ttgaatggca gcagcaaagt 21241 atgccatcac aaaggcaaag acaagtgctg gtatgactaa tccaccacga gtttttaact 21301 tgctcgcaat tgctacaaag gtattactga aaagatttcc tagaacaatt gcgcccacca 21361 ggaaactact ggcactaatg atcagataaa taactttacc gacatccacc gaaccttctt 21421 ttgcaagact agccaccact gctaaaacaa taattcccaa cacatcatca ataacagcag 21481 cgcctaaaat aatttgccct tctttagagt tgagacgccc aagttctgac aataccttgg 21541 aagtaatacc aatacttgta gcagtcaaag ctgctcctgc aaaaattgct ggtactgctg 21601 gaatgccaaa taaagtcatt aatcctaccg taccagcagc aaagggtact gtcaccccta 21661 ctactgctac tacactcgct tggataccaa cagctattaa gtcttttatg tttgattcca 21721 aaccaatttc aaataggaga atgatgacac ctagttctgc caatatggaa atgacctcag 21781 actgggcttt aaagacagca tcagcagcgt ctggattgag accaccagtg gtttgaagga 21841 aggtcatgat aacagagttg gaactatcag cccccccttc tggaaacact aatagatgga 21901 ggacagaaat acctaccacg acaccaccta ccagttctcc caacactggc ggtaacccaa 21961 agcggtttga tagttccccg cctactttac tggcaaggta aatgacgacc atactcagca 22021 gcactgctgc cagtacaagc gaactatctg ctgtttctgt tgcggttgcc aacaagggta 22081 atgagaaggt tgtgaaatct aactgcatcg gttattttgg aaaatacttc cttttaattt 22141 acagtttgtt gaaagttacc atagaagcgt tacacaaagc attgattaat caattcattc 22201 cggctgttag ttggacaagg cgctgccagt gagactctga tactgggact acggatagtc 22261 tactcaggcg tagtaagtca aaaccttcaa aattttcgtc ctgcttaatt tgggttaggg 22321 tgattggctg agccactctt cgtatcgctc gtacctggac aacaacccgt ttagcatcat 22381 ctaattttgg atctggataa ggttgagtga cgacctccgc tacacctatc actcgccgct 22441 ctttccctgt gtgataaatg aataccctta cgggttcacc agtcgcctac ggcgggaaac 22501 ccgcctgcag cgctggttca ccaaatcagc aatttccatc gtacgcagat gcttaagagc 22561 caaaggattg ctcactccat cccaaactgt agtgctatcc cgttctaaat cggagtagga 22621 atactcttct ggttctgttt tcagcagcca gtatcgcaca aatattctcc ttctattgtt 22681 tgcagttaac agttaacagt aaaccaatag ctatttttta gtttctaaaa atgtaactgc 22741 ttagacttta gacagtaatc gtgtcaaagt aaaaatacct ctcatataga cacactactg 22801 gctgtgtctt ctccgaatgg atagggagtg ttagttgtgt gttgtttatt gttagcgatg 22861 agtggttagt tgttggttca agctaaccac tcaggatttt tcacaactta ccagatcttc 22921 atagatgaag aatttttaga ggagtgaacg ttttgtgaat acaaatgctg tatttggttc 22981 tcagttcaat gctgggaacc ggtggaaatt tctatcttta gcattgcttg tatatataag 23041 tagtacccaa cctgctttag cacagcagaa agaaaggatg ttgcgtactc tgagtgttaa 23101 tggtcgtgga atggagacaa ttcctacaac tttgtctcaa gtcagtttgg gagtagaggt 23161 tcagggaaaa acagcacagc aggtacaaca agaagctgct cgcaggtcat ctgctgtggt 23221 tgctttacta aaaagccgta atgtgcaaaa attacaaacc acaggcatta ctcttaaccc 23281 agtttacaat tacgacaata aagtgcagcg tattacaggg tatgctgctt ctaacgttgt 23341 cagttttcgc attcccactg aacgcgcggg tactctgtta gatgaagcgg tcattgctgg 23401 tgcgactcaa attagcggca ttagttttgt tgcaactgat gaggcgataa cccttgctca 23461 acaacaagca ttaaaaaaag ccacccagga tgctcaacag caagctcaag ctgttttgag 23521 tactttgggt ttccagccga aagaagtcgt cagcattcaa gtgaatggag ccagcgcacc 23581 tccaccacca agacccctct tggatgctgc tgagcttggt aggctaacaa caaaacaaaa 23641 tgctgccacc actatagttg gtggtgaaca gcaggtggaa gctacggtga cattacaaat 23701 tagttattag tcattagtta ttagctttac ctattttgac cccttcgggg ttcgccagtt 23761 gctttatgcc ggggaacccg tccaccgcac tggctcactt ttgactaaat tagaaattat 23821 cagggttcat atctatgtca ccatcttcgg cgtcccgttt agaacctagt agttgcagct 23881 gatccacgat gataacgggt gtagatcgat ttgctcctgt ggcgcgatcg ctccaattat 23941 caaattttaa ggaaccctta acagcaattt gtttgccttt acgcacgtaa tcacctgcca 24001 cctgcgctgt ttttccccat aattccagat taaaccaatc agtatgttcg ctgtctcttg 24061 tacgccgatt gacggcaagt gttaatttac acttaacgct acctgactcg aaatatttca 24121 tatccgggtc agttcctaca cgaccaatga gggtaacaat atttatgctc atgtgcagtt 24181 tatccagtac taatgtactt atattgtgaa tatcatagta gcgaattgct atgttatcag 24241 caaagcgttc tgtaaaaaat cattacagtt accgccgaga caaatgatat gaatcactct 24301 gtaaaaactc taaaacaata cggatacact tatgccgcct gggaatatag aacaatagaa 24361 gcttgagact ggtcaaagat tacgaaaaaa tatataaaaa tataatctag tttactggac 24421 tcaggttaaa aaacataata cgatcctgca caattctcat ttgtaacagt actcaaaaaa 24481 tatcaaaatt agggggcata gagacgcgtg ggtcttttta gtaaattttc cttatcgcgg 24541 gatatgggta ttgaccttgg taccgcaaac accctcgttt atgtatcagg taaaggcatt 24601 gttctgcaag aaccttctgt agtggcgatt gacctaaacg aaaaggttcc aatagcagtt 24661 ggagaagaag caaaaaaaat gctcggtcgc acacccgcga atgtaattgc ggtgcgcccc 24721 ttgcgtgatg gtgtcatcgc tgacttcgat acagcagaag tgatgctaaa aagctttatc 24781 caaagagtca acgagggcaa atctctagta ttaccaagaa ttgtcattgg tattcctagt 24841 ggagtaacag gggtagaaag aagagctgtc atggatgcgg cttctcaagc tggggcaaga 24901 gaagtttact tgattgatga accggtggct gcagcaattg gggcaggact acctgtgact 24961 gaagcaacag gcaatatgat tattgatatt ggtggcggga caacagaagt tgctgtgttg 25021 agcctgcaag ggacagtgct ttctgaatca gtacgcattg caggagatga gctgaccgaa 25081 gccatcatgc agtatatgaa gaaagttcat aacctagtca taggggaacg cactgctgag 25141 gacattaaaa ttcgcatagg atcagcatat ccgactcatg atgatgatga tgccatgatg 25201 gaagtccgag gcttgcacct gctttctggt ttgccgcgaa ctgtcacaat taaaggacca 25261 gaagtacgtg aaagtatgtc ggaacctttg ttggtgatta tcgaagcggt gaagcggact 25321 ttggaacgca tcccgccaga gttggcatca gacatcattg accgaggaat tatgctagct 25381 ggtggaggcg cgctgcttaa aggagtggat accttaatca gccatgagac tggaatagta 25441 acacatatcg ctagtgatcc attggaatgt gttgtacggg gaacaggtcg tgtattggaa 25501 aatttcaagc agatggaaag aattttcagc gggcgttctc gcaatatgta acacaaacgc 25561 tatcagatat tgggttcgat cgcgttgcgg gaatccaata tcttaattta tattacagct 25621 aaaaaaggtt tcttatgttt acggcacgtc ggtggtggga gcgcagagga ttacaaatag 25681 ggttgctagg tatagttgtt ggtggtgctt gggtgcttcg acaaactcaa ggtgctctta 25741 tgtttgagat atatcaagag ataactcgtc caatccagat gttgcagaca ccaccagctc 25801 aagaagaacg tctcaaggat gctcggtttt tagaactgca aacccagatc gcagaactga 25861 aaaaccaaaa taaaaagtta aaacagttat taggctatgt agaaaaagaa ccatccacat 25921 cccgccctgt gatcgcgcga gtgatgacac gtggtgctga caactggtgg caacaagtaa 25981 ctctcaatcg tggaagcctt gcaggtattc aggaaggtta tatcgttaag ggtgatggtg 26041 gattggtggg tttagtagaa agcgtaactc ccaataccag ccgggtgctg ttaattagtg 26101 acctcaagag tcaagttggt gtaacagtca gccgcacagg ggcaaagggt gttttgcgag 26161 gagattcctc tgctgatgga atcctggaat tttatgaaaa agtcccaaat gtaaaacctg 26221 gagacgtagt tactacatct acctatagtc agaagtttcc ttccggattg gctgtcggac 26281 gagtgaagtc tttagattta aagaaacttc cagcatcagt ggggaaagtg gaacttttcc 26341 cgtcgatacg ctctttggat tgggtgacgg tatatcctaa accacaaacc ccgccaccag 26401 aaaatgtcgg ttctgtcaat caaaagtcag aaaaatctaa atgaaacaat gaggactcct 26461 caattacaaa ccagtaggaa aaaaaagtca aaatcgccaa gacgaaaatc caaaattcaa 26521 atcgtccccc tttctcgttg gcacccgcgc ttacttcagt tcactgattg ggcgataata 26581 actggatcag taatgttatg cttactgatg ctgttaatcc gagtccctgg tatggaatta 26641 ttgggaatag gaccaaactg gcctgtgatt tgggtagttg cttggagtgt caagcgcaca 26701 gcttttagtg gcgcattggc aggtgttgtt ttgggtctac tacaagatgc tatgacatca 26761 cctgatccaa ctcatgctct aagtttggca gtggttggaa gtttgactgg tctgttgcag 26821 aagcagcgtt ttatacaaga agattttatt tctattgcct taattgtgtt tggtatggca 26881 ctgtggtcag agacaatttt tgcatcgcag ttgattttaa tgggcgatcg caatccggct 26941 gatgtctggg cacatttcca gaaagtcgcc cttgcctctg ccattctcag tagcctctgg 27001 gcaccagtga tttattttcc cctaaatcgt tggtggcagc aagtaaaatt agcacaacag 27061 tcataaatca gtgaagaaga attcaaaatt cagaaatctt cttacaaaga accgagtgcg 27121 gagtgtggat aacctgggtt taaactcaga aaggagtgag gagtgaggag tcaattaatg 27181 tgactcttca ctcctaaatt tttctgaact atacgcatcc tttaacaaaa taaattcaag 27241 ctaaagttaa ttcatcttta tgtagctaca aaattattac tctgttgcta agaattatgg 27301 ataatttacc aatggtcgaa ccagatgctt tcatcgcaga atcagaattt caggatgatt 27361 cacatccaaa gcaaacactc ttaagtgact ttgaccacgc tatgatgcgg cggtgtatag 27421 aactcgcccg tcgcgcttta ggacgcacct ccccaaaccc aatggtgggg gcggtgattg 27481 tcaaagatgg cgagattatc ggggaagggt ttcatccgcg tgctggtgag ccgcatgcag 27541 aagtttttgc cctcaaagca gcaggtgaaa atgctcgtgg agcaacgatt tacgtgagct 27601 tagagccttg caatcactat ggacgtactc ccccttgttc agaagcgttg gtagccgctg 27661 ggatcgctaa ggtggtcgtg gggatggttg acccaaatcc acttgtcgct ggtggtggta 27721 tagcccgtct acgggctgca gggatagaag ttttggtggg agtggaagaa caagcctgta 27781 agaagctcaa tgaaggtttt atccatcgta ttcttcatca tcgacctttt ggcattttga 27841 aatatgccat gactttagat ggcaaaattg cgacgacgac tggtcatagt gcatgggtga 27901 caaataagga tgctcgcggc gaagttcatc aactgcgagc agcttgtgat gcggtgattg 27961 tgggtggaaa cacaatcaga ctggataatc cttatttgac tagccatcgg gaaggagcac 28021 ataatcccct acgggtggtc atgagccgta gtctggattt acctcaaaac gctcgtatct 28081 ggcaaaccgc agaagctccc actttggttt tcacagaaaa aggagctaac cccgattttc 28141 aagaactgtt gcggaatctg ggggtggaag tggtggagtt aacaccactc acaccagatc 28201 aggtgatggc gtacttatac aagcggggat tttgcagcgt gttatgggag tgcggtggtg 28261 ttttagctgc tagtgcgatc gcccaaaaag cagtgcaaaa agtttttgct tttattgctc 28321 ctaaaatcgt tggtggtgtt catgctccca cacctgtggg tgacttaggt ttgacctcca 28381 tgactgaggc gctatccttg gaacgtgtag agataagggt agttggttct gactgtttgg 28441 tagagggtta tttgccaagt catcagttat aagtcataag tcatgagtta tgacttatga 28501 cttttaggaa tactgacgtt cttgagccaa atcacggatc agtgctagcc gagatttaac 28561 gtagtcactg aggaaatcgc tttcataaca atctagcaaa gcttgggttt tgatttgttt 28621 gaccaaccat tgtaccaagc tagcatcatc gagctgtaat agcgtcttag tttgtgcagt 28681 ttccactaca gaccaaagct gacgcataac tttgggagtc attagacctc cattgaagat 28741 tagcttagct gttttcagga tacacaattg cggtaatagg tttcggtctt aatgtagagg 28801 ctgacataag tgttattaaa aacttaatat attgagttat aagttgatga tgattaagga 28861 ttaataattt taaacaatac atgaacaaaa agtagtattg ttatgccctg catgagtctt 28921 gactttttgc ttgtgttgtt ctgtgtctca acgtctacct gtgctgtacg ttcgcttatt 28981 tttcatattt taaacacttt tgtaaatact tgttaagcgc acaatcaatt ttattttgat 29041 tgagtagtaa aaccaaatat gctgattgct aagtttgaga aacttggttt tttcaagcta 29101 tacttttgtt tacaaataac caatgaagga gatccaaaaa aatgcttacc cttcaataat 29161 tgattgggca ctttttatgg cgatcgcaat ttttacccaa tacaggcagc tactgaaaag 29221 ctatagtata aattcaaact aaccttccac gagataatga gtctacctca gccaattggt 29281 cgccaaaaag aggtgctata tctaccagca cgggggcact tcgctgtttt aggaactgcg 29341 ggaagtggta agacaacact tgctattttg cgatccgctt ttcttgctga tccaaggact 29401 gaacattgcg gcaagacgtt gttagtcaca tttaataaag cgcttgtaac ttatttaaac 29461 caccttcaag atagaaaact agccaacgtg attatcgaga actaccatct cttcgccagg 29521 ggatatctcg cttttcgcaa taaaatgtct cgtcatgcga ttctcactcc agatgatcgc 29581 gaagctttag tgaagcaagc ggtaaaaaat atttcacagc atcatagttt gcatcctctt 29641 tttgactatc ctgtggagct tttttcagaa gaaattcgct ggatggcaca ctatggtatt 29701 actacttatg aagagtatca aaactttgat gtacttagtg atgtagaaat gcgttttaag 29761 gggaaagaac gggagttagt ctttgaaatc taccagactt acctaaacct gcgtcagcaa 29821 agcggtaaaa agtatgactg ggacgatata gcaactactg tttgtgaaga attcggtgct 29881 gatacatcaa aacgtttata taagcacatc gtaattgatg agggtcagga cttttcgcca 29941 caaatgattc gctcccttgc ctgggctatt cctgcggatg gttcattgac attctttgga 30001 gatgttgcac agcaaatata tggacatcgt atatcttggc gagatgcagg acttgatatc 30061 gaacaagttt gggaatttaa acaaaattac cggaacacca aacaaattgc caaacttgga 30121 cttgctatct caaagatgcc ttattttaaa ggtgtccccg atttagtgga accagtttca 30181 ccacctgctg atggtctgtt accaacaatc gttgaatttt cttctcctga ccaggaaata 30241 ttatttgtcg tccaccaagc tataacgcta gctaggacgc agaacgtggc aattcttttt 30301 cgcgatcgcc aagatgaaaa acttatcgga caatatttac caaaaggaac tatccgtctg 30361 catcgagaga tgacaacttg gcaagccggg gctggtatta gatatggaac ttatcattca 30421 gcaaagggtc tcgaattcga cgcagttatc ttgccatttt gtaataataa gaaattacca 30481 gaccccgaag cagtagaagc tttcggtgaa gcagatgcaa gcgcccaaga tggaagatta 30541 ttgtatgttg gtgtcacacg tgctaaaaca cggctgatta tgacgtactg cggtgaagtt 30601 actagtcttt taccaagtga taccagcctt tacgagaggg tgaaaagatg attgattcga 30661 ttaagcgtga agctgaacgt cgcagcataa cacgtctttg tcattttacc ccatcgcgca 30721 acctcgttca tattttgaca ggtgaaacgg gaatactggc aacaaaacat ttacaaaaaa 30781 atgagcgcag tgtttttaca ccaactgacc taaagcggct tgatggacat caaggataca 30841 tctgttgttc aattcaatac cccaatgtct ggtatttcaa tacagcaaaa tccaaagata 30901 ttttatttag ggattgggtt gtgctattca tcaatccgaa gtatctctgg ttggctggta 30961 ctcgtttctg tccaagaaat gctgcatctg cttttggtag tactattgcc gaaggtgagg 31021 cagctttttt gtcaatgttt gctcagtctg tatctggtgc tggaggacga acttttagcc 31081 gttggactaa tcatttagcc tgctgtccca cagacaacca agcagaagtt ctgattccag 31141 accaaattgg aatatctgac attttagcga ttgcagtacc aagtgaaacg caggcaaaaa 31201 acgaagcagt acgccttgac attctgggta tttctgagga gaaatacaaa ttcgtagtag 31261 caccagattt atttgataag tacaatttaa gcaacttgat tcgctcaggt aaaagaccag 31321 atgaaacacc ttggaaagtc ggggaggaac tatgattagt aatgaacagc acctacatac 31381 aatctccctt atgacaaaag gacaaggtgc atttcttggc gcagctgttg gtgatgcgtt 31441 gggatggtct caagaaccag aagctaagag aattgataaa aagacttctt ctccagcaga 31501 agttttagat aatggattcc aacaatgggt gcggaagtca ggggggcaat actaccctca 31561 tcacgaagtt attcttgctg gtgaatatag tgacgatacg cagctaattc tctgtactgc 31621 cagaagctta ctttatggcg cacgctggtg gcaccattta actaagcggg aattaccaat 31681 ttggacttct tacgaacgtg gtggaggagg agcaacaaaa cgatcagcgc agcaatggct 31741 tgcaggaata gagccttggt cttctcctga taaggagaaa aagcgatatt ttggtgctgg 31801 tggaaatggt gtggcgatgc gaatattgcc tcattgtttg ttaggtgcga cagaaactaa 31861 ttttgaaaac attgcgaaaa atattgttgc taatggtgtt acaactcacg gacaccccag 31921 agctttggtg ggtgcgcttg cttacggttt tgccatttgg gtagcattac gagagacgaa 31981 tactttgcaa tacggtgcaa ttcttgaaaa agttttatcg gaacttaatt cttggtctgt 32041 tttaccagat ttaaatgata tttgcccgaa ttggaaaaat tcagctttgc aaacaactga 32101 tggacaatat gacgattctt ggcaacacac tattaaggag atgttgcaac ttcttgagtt 32161 atgccaagag ggaatgaagc aaggagcgct ctcgattgag cgagaagttc ttactaaact 32221 tggatgcttt aataaaagtg tcaaaggagc aggaacggta agtgctgcgg ctgccatatt 32281 tctggcatcg cgttttgccg caaatccgtt ctatggtctt ttagaagcag gattcgctca 32341 cggtgctgac actgatacga ttgcgtcaat gactggtgga attcttggtg ctatagcggg 32401 aattgaatgg ctaggaaatt acgcagtaaa agttcaagat gcaaactata tccgagatat 32461 agcagaacat ttagccagaa accaaggcga ttcacaaact caacaagcag ataatttcac 32521 aattaagaaa acgcatctag atgctttcgt aaaacaaatc gaagtgtcaa aacctgctga 32581 tagtatattg attccagatg gaagaaaagc ccaaatatca gcttctgtaa atcatcagtc 32641 tatatctaaa tcaactgtgg cgacatcgtg gaagctgact actgctgaag gtcagtcttt 32701 atatgtcaaa aagctttcac gccctaaaaa caacacagaa attaactctg agttaaaatc 32761 tattagtgtc tctgaacata attttgaatt ccaacaagtt aatattctcc atgtaggaat 32821 tgaaattcca gttagcaatc ttgataaatc tcgttttttc tatgagaaag ttttaggaat 32881 tcaagtagaa caagaatcaa agttgttggt aagatgtggg agtattgtat taattaatat 32941 agaagattat aaaaaaaggc acggctttta ttcagggact actaaagcac ctactactca 33001 cacgattgac cttgaagttg aatcactcga tgaagcttat aataatgtga gcaaagtaga 33061 agcaaaaatt gtaaaagata tttcaaaaaa ccatgaaaga cgttactttt attgtcttga 33121 tcctgatgat aatcagatta gaatttttga agtaaagttt taatcaaaaa ttacgaacct 33181 gtatggcgct tcatttcggc ttcgccccct agcagaaatc taaagatata gcgttcgcgg 33241 agcggctcct ccggagcttc gcctaataaa tttttagaac ttcaggaaaa gcatattgtc 33301 attttgtata aaaaagtaaa cagtaaagat gtttcagtaa acaagtgagt gctatccatg 33361 ccgattattg aaagtaaaga tgtcgtcaac ggggaagaag tcgttatcta tattgaggta 33421 gataatattc caccgagtag aagcccatac gaaaatgtac ggggtgttga cacagctaga 33481 gttgtggctg cggctcgtga tgtttttggg gaagcaatgc agctaacccg tagttgtgcg 33541 aagcgggttg tagaaagcgt caagcaaatg gaaaaggaga cacgacccaa tgaattagag 33601 gtcaagcttg ctatcaaact ggattcagag gttggtgcag ttatagccaa agttaatacg 33661 ggggcgcaaa tagaggtgac aatgaagtgg aagtcaacag gggagtcgtg aacgaaacaa 33721 cgacgattca gatattgcct tacacgctga ttgtcggtca gcagcagatc aagctggctt 33781 tggaactagc gtatattgcg ccaaggattg gaggagtgtt gctcagtgga caacggggaa 33841 ccgggaaatc tacagctgtg cgtgcttttg ctcagatgat gtatgagcgc ctcccggtga 33901 ctctacctat taatgctaca gaagaccgag tggttggggg ctggcggatt gatgaattga 33961 tgcagagcaa ggctgttccc cagaaaggat tgctggaaga agcaaatggt ggcttacttt 34021 atatcgatga agtcaatctg ctggatgacc acatcgtgaa tattattctt gatgtcacct 34081 cgacaggagt gctggtggtg cagcgggaag ggcaaagttt tcaaaaacct gtttcttata 34141 ccctggttgg tactatgaac ccggaagaag gaggactgcg accacagtta ctggatcggt 34201 ttggcttgat ggtgagtgtg gaagcagaac aaaatgaagc tgaacgcacg atgattttac 34261 aaactgtttt agagtttgat gaagcgctct cccagttaaa agtgggagaa acgtcagctt 34321 atatcaatga agcgctgcaa aaagataaga aacgcaaagc gttacttgag aaagcacgcc 34381 agaattttta taacgttaaa gtgcctgtga gcgttgctag aaactgcgtt aagttagcaa 34441 taaagttcaa tgctgaaggt aaccgaggtg actatatcat tgctttagca gcgcgtgcct 34501 atgctgcgct tgttgatgcc aaacaagtca caaacgacca tgtcgcagag gtcgctcgat 34561 tggcacttca gcatcgtcgc cctgaagtcc tccagagcaa ccaaatgccg tggagtgatg 34621 aagatgatga gcaagtgatg aagatgatga acggtgagtg attcggcaaa attatcgcaa 34681 ctgagtttga atgatctggt gatgcgatcg ctcgcctgtg ctgccatcag cccaggtctt 34741 cgtagtattt tgatgttcga tacgacacct aatgctttgc ggttggcagc tcaaaccaca 34801 gcccagatgc tggaagtggt gactggacat ggagtcactc tggtaacatt gggaacctat 34861 gaagcagaag acgatttgtg gggtagttgg gggttagcgg gtgataacga gcaacaattg 34921 ttgcaatgga aaccaggttt gctgtctcaa cattcctgtg acattccata cgaatctata 34981 aataaattcc aacagcggat agtcgtcatt ccagacttaa ctaagctgag tttggcagca 35041 gcacgagctt gtgtggtact gatggggtct gatgtagcac atttggagcg tcacgggcag 35101 caggtgcatt ggcaacccaa tatttgctgg atggcaggct gtgctagtgg tgaagtggga 35161 atggtttctc ctcacctgct cgatcgcttt gccctgcgct tgagtggaca ggtggcaaaa 35221 acagtagaca gagcagaatc tattctggag ttgcttgacg agcggaggtt aggcaaggaa 35281 aagccgaaac ctttgtcaac tgaaattcgc gagcaacttc tcaaagctct gcaagtccat 35341 cccaaaatga cagccaaagc tttagccaga atttttgatt atacattaga agtggaagtt 35401 tatagtcccc gtcgagagat tgctttagcg cgactgtctc tagcaaatgc acggctagtg 35461 ggtgcaaagc agatgacaac ggagcaggtt gatactgcgg ctgggatgat tggcttgaca 35521 atcggtatca agcagaaagc aaaacgctca actcagtcct cagaaccaca gcctgaactc 35581 actaaactta cagagtcaac atcggtctct acaccctcat cagaacagct agagttatcc 35641 ccagagcaag aacctgtcta tgagtcggat gaaccagaag aactacccgc tacaccatta 35701 acatttaatg cgattcctgt caatccttac ccagaggatg aagcacctat tgagcgggaa 35761 gcggcttcgc tgcgattacc acctcgtcga tttcgagcat cggctgctgc tcgtggggtg 35821 attattggtg tggagaaagc cacaaatatg cgggatttag ctttggtgag gacgttacta 35881 gaagcagcaa agttccagcc aattcgtcaa caaggaaaaa ctcatagtca acggcgcttg 35941 gtgttgtcgc tgacggactt gcatagttat cgcagagcac ctgtagccga gcaaatgctg 36001 atgctggtac ttgatcacac ttgtttattg gattgcaact ggcaagagga gttattacct 36061 tacctgaact gggcgtatgt ggaacgagca agtgtttgct taattcaggt aggagcagca 36121 caagcaagcc acgagttgcg agcttatcag gtgatggcgc aaaacattct agtacctagc 36181 atcagtgcag gaattgaggc agaatcaggc aaagcgactc ctttggctca tggtttggat 36241 ttggcattgc agactttacg ccatgctttg caacacggac gcagtagagt tcagcgggcg 36301 gtactggttg ttattagtga tgggcggggc aatgtgcctt tagaagccag tcgcttcggt 36361 aggataacac cacctgtggg atgtaaggga gttgacgatg ctttgcaagt agcagaacgt 36421 atccgtggat tggatggagt gaaggcggtt ttgttgaatc cgcagccaaa gcaatatgct 36481 gatttaccat tggaactggc taagacattg ggtgcaactg tggttgctat tgggcgtcta 36541 gagcaagtgg aggtggaatg atggctgcta aatcaaaact ttcactgact caacgtgcaa 36601 taacctcgcc gacaggtttg agtcgaacca aacagctacc ctttgaggca aaagaaacta 36661 caacagcaac acttgctgta tcggatttac cagaaaatat agaagctgcg tgcgattcgt 36721 tgtgagtgaa tcccggtaat cagtccgcaa ataacctcac caa // LOCUS NODE_711_length_36394_cov_4.83301736394 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 36394) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 36394) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..36394 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..711) /locus_tag="DP116_05760" CDS complement(<1..711) /locus_tag="DP116_05760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137985.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease subunit R" /protein_id="PRJNA477356:DP116_05760" /translation="MPQSTPEYIHVEKPTIDQLISMGWQHIEGDKFNSQITERENFKQ VLLIQRLKTLIKRINLDDNGNSWLDDIQVNAVVSQLERLAASRLMEANKAATELLLSG TTVLGKDGKQHIVHYIDFEHPENNDFLAINQYRVDPLWITADKGFIVPDVVLFVNGIP LVVVECKSPNLDNPITAAINDLLQYSNQRNSTQPEGAEKLFHYNLLMIAASRGRAVAG AVGANHEYYVEWKTDLTPL" gene 951..1319 /locus_tag="DP116_05765" CDS 951..1319 /locus_tag="DP116_05765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740720.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05765" /translation="MSKLFEILQKIKAKPGMYIGRASVSDLFHFLVGFKTALRELGVE ATEEEINFYREFQPWVQKKYHVSTSNSWAKIIMLHCTNEQEGFSVFYKLLDEFQNREK NLGDDSFGESKTKQGAKMQQ" gene complement(1344..2684) /locus_tag="DP116_05770" CDS complement(1344..2684) /locus_tag="DP116_05770" /inference="COORDINATES: protein motif:HMM:PF01420.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05770" /translation="MREVLVKVEEFKDSSLGKIPKDWETVTLKDEINLLHGYAFEGKY FSDRPPGEVLLVPGNFHREGGLYFDENNTKYYQGTIPNNTVLNNGDLLIVMTDLSPRT LILGRVVQLELPFKVLHNQRIGKIIPKLPDTWDKRFLMLVMNSHRVRRNIISNATGTT VRHTSPDRITTNVVPKPSRQEQSKIAEILDTVDDAIAHTSSLITKLKQIKAGLLHDLL TRGLDENGQLRDPEAHPEEFKDSALGRIPKEWEVHPLQNFTLSSAFGPRFSANAYDEK GNIATLRTTDMDDEGNLNLSTMPLAKLSLDNYSLHFLEVGDLLVSRSGTCGITSVFLG FDIPVLPGAFLIRFRLKNGLLAEFVRRYFNWDIGRERVLREAEGGVQKNLRGSTLLKL LIPVPPEIEQREILRVLDVKELCILKEEAYLKKLKLQKQGLMHDLLTGKVRVNC" gene complement(2709..3308) /locus_tag="DP116_05775" CDS complement(2709..3308) /locus_tag="DP116_05775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015183061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05775" /translation="MAYSDFTLSKFKKSFNIRIDEEIDLFANVEPVQVSDKLITNLEE TAELALAINTEKARSEMIITPILLEVRRQANYQISLFSGTDFNVDAEKGLNGYCDFII SRSKEQLTINAPVVIIVEAKNENIKGGLGQCAAAMLAAQLFNQEEGNDIKTIYGAVTT GDIWKFLKLQGSDVFIDLNNYYIKEINNILGVLSQGVQI" gene complement(3540..4610) /locus_tag="DP116_05780" CDS complement(3540..4610) /locus_tag="DP116_05780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015183060.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05780" /translation="MDYLDFIPTITERGQIKTFIESDAGIQQQEGKLMSVFAAWWQMH SPSLGELPKTKKVMELRAEFLSSFVDSLAPVGLLDRFKVAGVVASWWDEVKYELRTLS ESDFSGLVDSWVDTIKDALEQDDEEKKSKPLFDPLNHKLVVRLMPDYLAEIATVEASI AELEQQKETFERPEEAEAEAGEEGEEEAEAVNLVKELDTKLKHLKNLIKEPKKELKIL KKSPLLNADKIAEIEKLIKQTEVEIAEIETQLEPLKEITKQLKEAKAKLKTLKKELVK RLDVARAVLINEECQGLVLGIFKDGLIVELERYVTAHRQQVIAAVENWWDKYRVTLQD IETERDAAAKRLNEFLQGLGYA" gene complement(4610..6040) /locus_tag="DP116_05785" CDS complement(4610..6040) /locus_tag="DP116_05785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211636.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I restriction endonuclease subunit M" /protein_id="PRJNA477356:DP116_05785" /translation="MLFLKRASDVFEQQYEQILQQNLQRGRSFEEAKMRAENPMSYRE TFFVSEKARWIYIRDELHKNVADGLNKALAALEENNQALAGVLGHIDFNRQVGKSRIP DAKLRELIQHFNKYRLRSEDFVFPDLLGAAYEYLIKEFADSAGKKGGEFYTPREVVQL MVRLLKPQAGMSVYDPCCGSGGMLIQAKQYVEECGDNANNLHLCGQDNNGGVWAICKI NMLLHGIRDADIQNEDTLLNPLHINDGELMRFDRIISNPPFSQNYSRKDIKFSQRFTY GFCPETGKKADLMFAQHMLSVLKNNGVMATVMPHGVLFRGGEEKKIRESFIKNDNLEA VIGLPPNLFYGTGIPACILVMRPKNAKPAERQGKVLFINADGEYYTGRAQNYLRPEHI EKMTWVSENFVSLPGYSAVVDEEELAANDYNCNIRRYADNAPPPEPHDVKAHLLGGIP KAEVEAKRKLLAAHGFDSSKILVERN" gene complement(6157..6318) /locus_tag="DP116_05790" CDS complement(6157..6318) /locus_tag="DP116_05790" /inference="COORDINATES: protein motif:HMM:PF12161.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05790" /translation="MGKLTLPQLERHLFSAADILRGKMDASEFKKRIQKSEFRSQNQS DGDLDPNRL" gene complement(6486..6557) /locus_tag="DP116_05795" tRNA complement(6486..6557) /locus_tag="DP116_05795" /product="tRNA-Thr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(6523..6525),aa:Thr,seq:ggt) gene complement(6651..6733) /locus_tag="DP116_05800" tRNA complement(6651..6733) /locus_tag="DP116_05800" /product="tRNA-Tyr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(6697..6699),aa:Tyr,seq:gta) gene complement(6854..8674) /locus_tag="DP116_05805" CDS complement(6854..8674) /locus_tag="DP116_05805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459122.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_05805" /translation="MANFRDILNYFRPYWSLSIFSIAASSVYEIIDLVVPYGIGQILN VLSNQPLDKPLQGAIATISNLTNNPVNKPLELGVLLSFIFIVTVLKAPTQPWLTTWFH WDITLKARRDQNQKAIEKILTLPLEFYDENNPGRIAGRVARGVTNHTWSYPEISGQLI PKLFRVLGIFVFIWFVDWRVAVLYLISFVIILSFTLRKLQQLIWRENRLDKYMEDTES RTSEIISNIKTVKAFAAEAKELQRQKRRLIRELRVTETRIHKGYVKLNTWQKTMIQFC VFTVLGLTLLETVKGQISLGHFVMTLTLSSMAYAELEPISVLAEVFARRYPSMVRFHE FLKVPIRVDGVGLLEERNIANNPYQFTGKVEFSHVSFGYQSERPVLEDINLLIEPYQT VALVGRSGSGKSTLVKLLLQYFEPQQGQILIDGQDIRSLDVGNYRRRLAIVHQEVDVF NGTLLDNLKYGKPDATFEQVQEACRISRLDEVIQQLPQGYYTVVGERGVRLSGGQRQR LGIARALVVEPDVLIFDEATSSLDYESERSIQLAMRSILGTRTTIIIAHRLSTVREAD KIVVLDKGKIVEIGSHDELLRHKGIYRRLHSLQQTGELVS" gene complement(8809..8988) /locus_tag="DP116_05810" /pseudo CDS complement(8809..8988) /locus_tag="DP116_05810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008179364.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(9008..9769) /locus_tag="DP116_05815" CDS complement(9008..9769) /locus_tag="DP116_05815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871801.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05815" /translation="MNRFWSFCLIFVFLLFHAEAQAGELSERLAHFPEWEKLTSVRPA VGDLVYPNWMMGSWQVKSTLIDLVAPLAPDIVTPGFEGNRQYLNQPVSFDVRFVRETT PHSGLKIIPRTSSKSAVVADRAFNGLNLARSYLGDTVLSVKVDPNSPNRQITFLRGER QLVSIVTARATETTPDGKFLTTEVFQQLFKGGGRPYFNTVESTTAYRKLDQSNPAIEA DQITAVYLSPQDPNYFAALSRPIALYRYRLEFFAQ" gene 10136..11008 /locus_tag="DP116_05820" CDS 10136..11008 /locus_tag="DP116_05820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868520.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05820" /translation="MSISSSFTDFSLAELFQLIDQGRKSGCLTVCTLPDLHTPGSKSH YYYIWFRLGCVVAAANRLNGQSLTHNMTQRGWVNQQTIEQVWTQTPAALPLGLLLKTQ GVLSTEQLNLLFASQLHQVRELFEIQKGVFKLDTKADLPLQEMTGLSLRTLEVALMAV RGLKNWNMLAEVLPDVSSGIRSKTNDKPQIHLNTLEWQVWELADGSVSLSAITYKLNQ SITIVQQAAFRLMLVGLVEEVSLAESTLSLENYPMNSNSDNSFTSGFKKSKPLQTRNV SASFLQNLVGFLKS" gene complement(11276..11968) /gene="plsY" /locus_tag="DP116_05825" CDS complement(11276..11968) /gene="plsY" /locus_tag="DP116_05825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318989.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-phosphate glycerol 3-phosphate acyltransferase" /protein_id="PRJNA477356:DP116_05825" /translation="MAIWLSLCGAVLVLAYLVGSTPTGYTVAKRLKGIDLREVGSGST GATNVLRTLGKGPGAFVLVIDCLKGVLAIALVYWLFNFAPSQNLIPPEVNPQLWEPWM VILSGLAAILGHSKSIFLGFAGGKSVATGLGILLAMNWQVGLATFGVFAIVVAISRIV SLSSIAGAVGVSVFMILLHQPLAYILFGAVAGLYVIWRHSSNIGRILAGTEPKLGQNL QLEAAESVNSAS" gene complement(12047..13336) /locus_tag="DP116_05830" CDS complement(12047..13336) /locus_tag="DP116_05830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3086 domain-containing protein" /protein_id="PRJNA477356:DP116_05830" /translation="MNPEEFQTPQPTDESLNHKEQLKVKQPENSKVESVVEIAAQNSK VDTQNEQSSSAVSDSDAEELNLIDELTPESTVEVVTLLEDEITALGSDVESKSQNYRQ QEELAQQVTLLQSQKEALKEEIANLQASYKTLYSQLGETQMTMTQLVQEALSGLEQRK AALQITVEQLERRQERIRNEMRTTFAGTSQDLAIRVQGFKDYLTGSLQDLAAAAEQLQ LVPKPKPEPEKPEVKEVKEAQAQSATPQFAQQQFQDTTKQVRRLIDQYRTKPDYYGPP WQLRRTFEPVHAERVSNWFFSQGGRGALRTTGSRLQNILIASAVISILHQLYGDRLRT LVLANTPERLGEWRRGLQDCLGIGRPDFGPDRGMVLFETAEALAQKADRLVKADQLPL ILIDDSEEQISLALLQFPLWLAFAPDPKMMRNYDDDY" gene complement(13475..13864) /locus_tag="DP116_05835" CDS complement(13475..13864) /locus_tag="DP116_05835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131798.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3119 domain-containing protein" /protein_id="PRJNA477356:DP116_05835" /translation="MTTSYTSNPASTTVELKPSYTIPLVLVVAAVPMLIVQPSVGGLM GLFGLFLMFQAVTLRLLFTPTDLDIYRGEKLIRRFPYREWQNWRIFWNPVPILFYFKE IKSIHFLPILFDPKTLKTCLEQRCPRI" gene complement(14034..14831) /locus_tag="DP116_05840" CDS complement(14034..14831) /locus_tag="DP116_05840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131799.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_05840" /translation="MSETTSKSSLGAWSQRLLAAVFLGGQVLVHLLRGRIHRRNTLEQ MAAVGPDSLFIALVTAVFVGAVFTIQVAREFINFGAGSTVGGVLAVALTRELSPVLTA VILAGRVGSAFAAEIGTMRVTEQIDALLMLKTDPIDYLVIPRVLACCLMLPILTLLSL VTGMFGGLIIAINIYNLSENVFLDSARNFLGIWDIASAMIKACCFGILIAIIGCSWGL TTTGGAKGVGQSTTTAVVTALLIIFISNFFLSWVMFQGTGSSSLQGL" gene 14942..15154 /locus_tag="DP116_05845" CDS 14942..15154 /locus_tag="DP116_05845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873820.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05845" /translation="MGSQCVAERSLSGGFLRSKLRRVPPTVPSGVVSPMSDWQTRKGH WSLVKNYSPLLGCSLTPRIPQSSTFG" gene 15446..15733 /locus_tag="DP116_05850" CDS 15446..15733 /locus_tag="DP116_05850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05850" /translation="MTVMMNFLRSLLLSIIFSFVAPMFLFGGVLFALSVAGYIPGLQE VTEVVSSLITEFLSVFGSGTPVGGLFVICSTCSFVGALFDTYAYYRYQILR" assembly_gap 16261..16270 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 16286..17107 /gene="sppA" /locus_tag="DP116_05855" CDS 16286..17107 /gene="sppA" /locus_tag="DP116_05855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318984.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal peptide peptidase SppA" /protein_id="PRJNA477356:DP116_05855" /translation="MVWPFKPNFRKQIARIEITGAIAGATRKRVLEALKTLEERKFPA LLLRIDSPGGTVGDSQEIYSALKRLREKVKIVASFGNISASGGVYVGMGAQHVMANPG TITGSIGVILRGNNLERLLQKVGVSFKVIKSGPYKDILAFDRELTEPEQSILQELIDT SYQQFVQTVADARSLAVETVRSFADGRIFTGQQALELGIVDRLGTEEDARRWAAELAG LDPEKTPVYTFEEPKPLLSRILPGSRQASSGLGAGLGWVEFEVSTSGLPLWLYRP" gene 17174..17554 /gene="aroH" /locus_tag="DP116_05860" CDS 17174..17554 /gene="aroH" /locus_tag="DP116_05860" /EC_number="5.4.99.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874730.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chorismate mutase" /protein_id="PRJNA477356:DP116_05860" /translation="MEWRMRAIRGATTVPENTVEAMREAVMELLDELEKRNQLHPTDI ISVTFSVTHDLNATFPAAIARSRPYWDSVPMLDVQEMEVDGSLKRCIRFLVHAYLPVS TPIYHVYLRQAAQLRPDWNVPQLL" gene complement(17557..17739) /locus_tag="DP116_05865" CDS complement(17557..17739) /locus_tag="DP116_05865" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05865" /translation="MVSKLFVIFKNKGREQQPNEAHTRLLLSDQGQKRIEQAVLILIR YTCLTIANMHNAINCQ" gene 18286..19530 /locus_tag="DP116_05870" CDS 18286..19530 /locus_tag="DP116_05870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318981.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_05870" /translation="MLTKLQGERYQVVQVLGQSLFGQTYLVQDTHLPDHPTCVVKHFL PSSQCPIPVEIRRRLFTREVEALKKLDNYDLVPHLLAHFEDNLEFYLVQQFIEGHPLT AELSPGDSWSQSKVFQLLYEVLSILNFVHSYGLIHRDVKPSNILRRKQDNRLVLIDFG AVKPIWNQLILNQAKTSNFIPLEYTTIAIGTPGYMPHEQQRGKPRPNSDIYALGMIAI QALTGVHPTQLPEDRNTNEILWQELAQVNDELALILNKMVCYHFQDRYNSAKEALEAL APLTHLYTSTQEWAPTLIPQNVTFDNQNSLPEQNVKQVFGNNPSAPLFNNQTSDISEL EIVELLSKLYTPTQESAPTSRLDNDQTISISPGKLTLLIGLMLGVVSSLIFIVFSYWS VQVIGPIPQIQNSSSEPPEGLR" gene complement(19815..20717) /locus_tag="DP116_05875" CDS complement(19815..20717) /locus_tag="DP116_05875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318979.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-carotene hydroxylase" /protein_id="PRJNA477356:DP116_05875" /translation="MLTSEAKKPLTIPPKEFLAPPGDFNPTLLMFLAAVAILVLSNFG YWVWQWPHWLCFAANTLALHIAGTVIHDACHQSAHRNRVMNAMLGHGSALMLAFAFPV FTRVHLQHHGHVNHPEDDPDHYVSTGGPLWLIAVRFLYHEVFFFKRQLWRKYELLEWF ISRLIVISIVYISVQYHFLGYILNFWFIPAFVVGIALGLFFDYFPHRPFVERDRWKNA RVYPHPILNLLIMGQNYHLIHHLWPSIPWYNYQPTYYLMKPLLDQKGCYQSIGLLQKK DFFEFVYDIFLGIRFHHHKSTKND" gene 21169..22938 /gene="pyk" /locus_tag="DP116_05880" CDS 21169..22938 /gene="pyk" /locus_tag="DP116_05880" /EC_number="2.7.1.40" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318505.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyruvate kinase" /protein_id="PRJNA477356:DP116_05880" /translation="MQLKDSVRRTKIVATIGPATSSPETLKALIEAGATTLRLNFSHG SHADHQRNIRLIRQTAFELNQPVAILQDLQGPKIRLGKFDNGSIVVAKGDRFTLTNRP VVGTQDISCVTYDYLAEEVPAGAKILLDDGRVEMVVEEINRDKGDLHCRVTVGGVLSN NKGVNFPGVYLSVKAMTDKDREDLMFGLDQGVDWVALSFVRNPQDMIEIKELISSTGK RVPVIAKIEKHEAIEQMEAVLALCDGVMVARGDLGVELPAEDVPVLQKRLIATANRLG IPIITATQMLDSMVNNPRPTRAEVSDVANAILDGTDAVMLSNETAVGKFPVEAVATMA RIAERMEQEVWLNTNASQVRDTKHSIPNAISQAVGQIAEQLGAAAIMTLTQTGATARN VSKFRPKTPILAVTPHVNVARQLQLVWGVKPLLMLELPSTGQTFQAAINVAQERELLS QGDLVVMTAGTLQGISGSTDLIKVEVVTAVLGQGIGLGQGSVSGRARVVYTGMDASNF NYGDILVASGTSADFVEAIRKAGGIITEEESLNSHAAVIGLRLGVPVIVGVKKATQVI RDGTILTMDMQRGLVYSGAVGTP" gene complement(22952..26081) /locus_tag="DP116_05885" /pseudo CDS complement(22952..26081) /locus_tag="DP116_05885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311746.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 26415..26804 /locus_tag="DP116_05890" CDS 26415..26804 /locus_tag="DP116_05890" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05890" /translation="MQILHSCSHYFESPLREMIKRILATIALIIAVTFFAPPAFASIN SSNPDSTVITGKDDIVNASLPLFIYQTAKSSDNTWGRVVRLSGFGNFFDIGIDEQGNF FINSPEDTKTSHALKISPTGVVTIPGK" gene 27220..27849 /locus_tag="DP116_05895" CDS 27220..27849 /locus_tag="DP116_05895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008049736.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_05895" /translation="MFALVSPEKIQLPAGAVVRLPATWEDYQDLCHWRGDGSIPRVKY RSGEVLLMSPLPKHGRDAHLIANVITVLLDHIGREYDAFTPVTMELPQKSGIEPDYCF YINHWEAVSGKERIDWSIDPPPDLVLEIDVTSYSDVNDYLPYQVPEIWLFRKKQLYVY QLQGTEYLVQTQSQYFPNINIQDIVSRCVEVAYERNTSAAIRELKQRLG" gene 28643..29512 /locus_tag="DP116_05900" CDS 28643..29512 /locus_tag="DP116_05900" /inference="COORDINATES: protein motif:HMM:PF00022.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05900" /translation="MNSNRGENLKRKAIVIDITGGLVKAGFAGENAPSVVFPTLVGRL KSESMGLNEIYFGDEAALKRDVLAMTSPVEKGIVTNWDDFQKLLEYTFSALKVNVQEC NVLITDIFWNSQSNREKLCQMLFEFGVAGLYLAHDAVLSLYASEKSTAIVINITNDFT DVVPICEGCSIPHARRRISIGKENLVSSEPFRPLETLFEPTLTKSIVSAISNCDPKIC QTLYKNIVLAGEGSMFEGLPERLEKEVRSLAPSGVTVKVVASPQRKDFAWIGGSMLAS LTTFETMWFTKEE" gene 29584..29997 /locus_tag="DP116_05905" CDS 29584..29997 /locus_tag="DP116_05905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009787122.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05905" /translation="MQLTIDLPEQTFQRLAQIAELTNQSLEDLIVQSVTGNLPPAVET APSEIQAELLELQTLSVEALRQIAQSQVSPFQQDRHMALLERNQDGLLTPQEQQELRQ LTLAADQLMLKKAHACAILRWRGQPIRTLSQLSPN" gene 30001..30423 /locus_tag="DP116_05910" CDS 30001..30423 /locus_tag="DP116_05910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009787123.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_05910" /translation="MSSISQSIRQQVVSEAKQCCEYCMTQQLLIGMPLVIDHVIPRSA GGSDKRENLAACCYRCNEFKGAKTQATDPMTGEQAPLFNPRQQIWSDHFAWTNAGTHI AGLTAIGRATVETLKLNNDYIVEARKIWVAQNWHPPEL" gene complement(30499..30705) /locus_tag="DP116_05915" CDS complement(30499..30705) /locus_tag="DP116_05915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408781.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05915" /translation="MFATWCFTNSNSLNSSLILKALQSTYVNPIDVYKLDSLGLICFK GDRILPSCELYRAYFEKQLATTRI" gene 30658..31452 /locus_tag="DP116_05920" /pseudo CDS 30658..31452 /locus_tag="DP116_05920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013325800.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(31580..32677) /locus_tag="DP116_05925" CDS complement(31580..32677) /locus_tag="DP116_05925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877772.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_05925" /translation="MSNILFNLSIVFSQPTGISNYAKNLFPYLKTLKPTLLTAQSYPE FNCYSIPDKLTPAQGTKGHFDRLIWTQFQLRKIYKKLKSQLIFSPLPEAPLYANCHSI VMVHDLIPLRFPKLLSPLMHYSRLYVPQVLAQAQHIICNSHATAKDIIDFYHISASKI TPIPLAHDTNHFRPIHLDSDGQDTRLTKFPYFLYIGRHDPYKNLHRLISAFAGLSHNR DYELWLAGPQDKRFTPLLQTQVQELEITGKVKFLNYVPYSELPKIISGATALVFPSLW EGFGFPVLEAMACGTPVITSNLSSLPEVAGDAAIFINPYNVDEMTQAMQIIATDSVLR QHLSTQSRNRASQFSWEKTGLATVEVLSGYL" gene complement(32670..33260) /locus_tag="DP116_05930" CDS complement(32670..33260) /locus_tag="DP116_05930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876460.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_05930" /translation="MVVQIPPRLYSIEEYFALEEATNYRTEYRDGEIVPMTGSSINHN QIVVNLIVTLALSLTLKEQNYHIYANDLGLWIPRYRQYVYPDVLIIKGEPVFEEGRTD TILNPCIIFEVFSKSSSSRDRGDKFTYYRSIPQFQEYILINQYQIHIEQFSKTPESNW LFSESDADDGVLTLNSANCQISHRQIYERVKFDINE" gene complement(33412..34248) /locus_tag="DP116_05935" CDS complement(33412..34248) /locus_tag="DP116_05935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318967.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_05935" /translation="MIYFLVVNYYSTNLITKLITSLPSYDSHDYKVVIINNSPDDNSI YHLKSELVLIFNAESNVGFGGGCNLGMKWIYTQDAHGLVWIINPDAYFLEIFLEKVRL FFEGHPKISILGTIVHTTTGEVWFAGGRFISSTGAIITQDLLTNTDTDYVACDWITGC SLIVNLRNFDECPQFDPAYFLYYEDFDFCRRYANQGHLIAVTKQFGVIHQPSSITNKY VFRKIKNSTYGYLLSLDRYTNQWILNIRLLRLISHALILSFVKPQVAFGKFAGVFMYW RR" gene complement(34398..35129) /locus_tag="DP116_05940" CDS complement(34398..35129) /locus_tag="DP116_05940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318966.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05940" /translation="MTYSLRIADIPVTERPRERLLTHGPKILATAELIAILLGTGQGH GKLSAVGLGQYILQQLSKHQRDPLAVLRDVSAAELMQIPGVGPAKATTILAAIELGKR AFQSRPGERTLIDSPAAAAAALSQDLMWQNQERFAVLLLDVKNRLLGTQVITIGTATE TLAPPREIFREVIRQGATRVIVAHNHPSGNVEPSQEDIELTRQLLMGGQFLAIPVLDH LILGDGNHQSLREITTLWEDYPQGD" gene complement(35258..36196) /locus_tag="DP116_05945" CDS complement(35258..36196) /locus_tag="DP116_05945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456404.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribulokinase" /protein_id="PRJNA477356:DP116_05945" /translation="MSRPIILGIVGDSAAGKTTLTKGIAQVLGPENVTVICTDDYHKY DRKQRAEIGITALHPDCNHLDIMQQHLSQLRIGQPILKPVYSHKTGTFEPPVYIKPSK FVIVEGLLGYSTRGACESYDVKVYLAPPESVRAQWKVKRDTQKRGYTEEQVLEELRKR EPDSEQFIRPQRQWSDIVVSFYPPSDDLEQTNGHLNVRLVLRPTIPHPDFLQIINYGD GSFESAIRLGLDRDMSKPVDVLEVDGHATLEQVNKLEHILCADMPYLKNICDREGNPE LGKVAGTTGETIQSYPLALTQLLITYHMLKATQIYQ" BASE COUNT 10488 a 7945 c 7411 g 10540 t 10 others ORIGIN 1 gaggggagtg aggtcagtct tccactcaac gtaatactca tggtttgcac ccactgcacc 61 agctaccgcc cgtcctctag aagcggctat cattaacagg ttgtaatgaa agagtttttc 121 tgcgccttct ggttgagtgc tattacgttg gttggaatat tgcaacaagt cgttaattgc 181 tgcagtgatg ggattatcta agttgggact tttgcactca acgacgacaa gcgggatacc 241 gttgacgaac agcacaacat caggaacgat aaaaccttta tctgctgtta tccaaagcgg 301 atctactcga tattgattga ttgccaaaaa atcgttattt tcaggatgct caaaatcaat 361 atagtggacg atgtgctgtt taccgtcttt tcctaaaaca gtagtaccag aaagcagtaa 421 ttctgtagca gctttatttg cttccatgag tcttgatgct gctaaacgct caagttgact 481 caccactgcg ttgacttgga tatcatcgag ccaagaatta ccattgtcat ctaagttgat 541 gcgtttgatg agcgttttca agcgttggat gagcagaact tgcttgaagt tttcgcgttc 601 ggtgatttgg gagttgaatt tatcgccttc gatatgctgc caacccatgc taataagttg 661 gtcaatggta ggtttttcta catggatata ttctggcgtt gattgtggca tggtggcgat 721 ttttacatgc agcgatatat ctatcatgca ccaggctcaa ctgtttcttg aagaggaagc 781 agccagattg gtagaaatta ctaattcttg ccaatataca tatatatagc aatcttattt 841 gagttgtaaa aattatcgaa ccgccaagac accaaggacg ccaagaaaaa agagaagaga 901 gaatttcaca aatcattcag gattgctata gatgataaga gaatttagag atgagcaagt 961 tgtttgagat tttacaaaaa attaaagcta aaccaggcat gtatatcggg cgtgcttctg 1021 tcagtgacct ttttcatttt ttagttggtt tcaaaactgc tttaagagaa ctaggagttg 1081 aagcaactga agaggaaata aatttttatc gagaatttca gccttgggtg caaaaaaaat 1141 atcacgtttc aacctctaat tcttgggcaa agataattat gcttcattgt acgaatgagc 1201 aagagggttt ttctgttttc tataagttgt tagatgagtt tcaaaatcga gagaaaaatt 1261 taggcgatga tagctttggg gaaagcaaaa caaaacaggg tgcaaaaatg cagcaataag 1321 tggtgaaagg tagttagtga agattagcag ttaacgcgta cttttccagt taacaagtcg 1381 tgcatcagtc cttgtttttg gagtttgagc tttttaaggt aggcttcttc tttcaggatg 1441 cataactctt tgacatcaag cactcgtaaa atttctcttt gttctatttc aggtggaaca 1501 ggtattaaaa gttttaggag tgtacttcct cgaagatttt tctgaacacc accttctgcc 1561 tcacgaagaa ctcgctctct accaatatcc caattaaaat accttctaac aaactctgct 1621 aacaatccat ttttcaaacg aaatctaatt aagaatgctc caggtaacac aggaatatca 1681 aagccaagaa aaacagatgt aataccacaa gtaccactgc gagaaactaa aaggtcgccg 1741 acttcaagaa agtgtaagct atagttatcc agggaaagtt tagctagggg catagtggat 1801 aaattaagat ttccttcgtc atccatgtct gttgttcgca gtgtggcgat attccctttc 1861 tcatcatagg catttgctga aaatctcgga ccaaatgcac tagatagagt gaagttttgc 1921 agtggatgta cctcccattc tttcggaatt ctccccagcg ccgaatcttt aaactcttca 1981 ggatgcgcct ctggatctct caactgtcca ttctcatcta acccgcgtgt gagcaagtcg 2041 tgcagcaagc cagctttgat ttgtttaagt tttgtaatga gggaggaggt gtgggcgatc 2101 gcatcatcca ccgtgtccaa aatttcagca attttgcttt gttcttgacg gctgggcttg 2161 ggaacaacat tcgttgtaat tcgatctggt gaagtatgtc gtaccgtagt acccgttgca 2221 ttgctgatga tgtttcgtcg cacccggtgg ctattcatca caagcattag aaagcgttta 2281 tcccatgtat caggtaactt tggaataatt ttcccaatcc gttggttatg taagacttta 2341 aaaggtagtt caagctgaac aactcgcccc aaaatcagcg ttctgggtga aaggtctgtc 2401 ataacgatga gtaaatcacc attatttaat acagtgttat tgggaattgt accttgataa 2461 tacttggtgt tgttctcatc aaagtaaaga ccaccctctc tgtgaaaatt acctggaact 2521 aaaagtactt ctcctggagg tctgtcagaa aaatatttac cctcaaaagc atagccgtga 2581 agcaaattaa tctcgtcttt tagagttaca gtttcccaat ccttcggaat cttccccaga 2641 gacgaatcct taaactcctc aactttcacc aaaacctcac gcatacccca acctctgcaa 2701 aaactacctc aaatctgaac accctgagat aaaacaccca aaatattatt tatttcctta 2761 atataataat tattcaaatc aataaacaca tcagaacctt gtaacttcaa aaacttccaa 2821 atatcccccg ttgtcactgc accataaata gttttaatat catttccttc ttcttggtta 2881 aacaattgtg ctgccaacat cgccgccgca cattgaccta atccgccttt aatattttca 2941 tttttagctt ctacaataat caccacaggt gcattaattg ttaattgctc tttagaacga 3001 ctgatgataa aatcacaata accattcaat cctttctcag catcaacatt aaaatctgtt 3061 ccagaaaata aactaatctg ataattagcc tgacgtctta cttctaataa gattggcgta 3121 ataatcattt cggaacgtgc tttttcggta ttaattgcta aagccagttc tgcagtctct 3181 tctaaattcg ttattagctt atcactgact tgtacaggtt ctacgttggc gaataaatca 3241 atttcttcat caatacgaat gttgaaactc tttttaaact tacttaatgt aaagtcgcta 3301 taagccatta ttaacacctc aacttatata gcaatcctaa atcatgagcc gctgcgggca 3361 cactacgcgt tcgcatgatt gcgtgcgctt tgcgcttacg taaaattctc tcttctctgt 3421 tctctctttt cttggcgctc ttggcgtctt ggcggtaccc tgcgggaagc cgctgcgcgt 3481 ctacgtttaa taaatataaa ggcactttac gctataatac agaattagta taagatagtt 3541 taagcatacc ccaacccctg taaaaactca ttcaaccgct tcgccgccgc atccctttcc 3601 gtctcaatat cctgcaacgt cacgcgatac ttatcccacc aattctctac cgccgcaatc 3661 acctgctgac ggtgcgcggt gacatatctc tccaattcaa caattaagcc atccttaaaa 3721 attcctaaca ccaaaccctg acattcttca tttatcaaca ctgcacgcgc aacatccaaa 3781 cgctttacca attccttttt tagcgtcttc aattttgcct tagcttcctt cagttgcttg 3841 gtaatttcct ttaatggttc tagttgtgtt tcaatctcag caatttctac ttcagtctgc 3901 ttaatcaatt tctcaatctc agcaatttta tctgcattca ataaaggcga cttctttaaa 3961 atcttgagtt ccttcttcgg ttccttaatc aaatttttca gatgcttaag ctttgtgtct 4021 aactccttga caagattaac cgcttctgct tcttcctcac cttcttcccc agcttcagcc 4081 tccgcttctt ctgggcgttc aaaggtttct ttctgctgtt ctaattcggc aatactcgcc 4141 tcaactgtgg caatttccgc caaataatcc ggcatcaacc gcacgactaa tttatgattt 4201 aaagggtcaa ataaaggctt acttttcttc tcttcatcat cttgttccaa agcatcttta 4261 atcgtatcca cccaactatc aactaaaccg ctaaaatctg actcagataa agttcgcaat 4321 tcatacttca cctcatccca ccaactcgca acgactcccg ccaccttaaa acgatcaagt 4381 aaacccacag gtgccagact atcaacaaaa gaactcaaaa actcagcgcg gagttccatc 4441 actttttttg ttttcggcaa ctcacccaaa gaaggtgagt gcatctgcca ccaagcagca 4501 aacacactca ttaacttgcc ttcttgctgc tgaatgcctg catctgattc tataaaagtt 4561 tttatctgtc ctctttcggt aattgtggga ataaaatcta aataatccac taattgcgct 4621 ccactaaaat cttacttgaa tcaaacccat gtgctgccaa taatttgcgc ttcgcctcta 4681 cctcagcttt gggaattcct cctaataaat gcgctttgac atcatgaggt tctggtggtg 4741 gggcattatc agcatagcgg cggatattgc agttgtaatc atttgctgct aattcttcct 4801 catccacgac tgcggaataa ccaggcaaag acacaaaatt ttcagacacc caagtcatct 4861 tttcaatatg ctccggtcgc aaataattct gagcacgacc agtataatac tcaccatcag 4921 cattaataaa caaaaccttt ccctgacgtt cagcaggttt tgcgtttttg ggacgcatca 4981 ccaaaataca agcagggata cccgttccat aaaacaaatt gggcggtaaa ccaatcactg 5041 cttctaaatt gtcattttta ataaaacttt ctctaatttt cttttcttcg ccaccacgaa 5101 acagcactcc atgcggcata actgtcgcca tgactccatt atttttgaga acagataaca 5161 tatgctgggc aaacatcaaa tcagcttttt tacctgtttc cggacaaaat ccatatgtaa 5221 accgttgaga aaattttata tctttacggc tatagttttg agagaaaggc ggattgctaa 5281 ttatccggtc aaaacgcatc agttcgccgt cgttgatatg taagggatta agtaaagtat 5341 cttcattttg aatatctgca tccctaatgc cgtgcagcag catattaatc ttacaaattg 5401 cccacacgcc gccgttatta tcctgaccac acagatgtaa attattggca ttatcaccac 5461 attcttccac atattgcttc gcctgaataa gcattccccc agaaccacaa caaggatcat 5521 aaacagacat tcccgcttgt ggtttcaaca ggcgcaccat caactgcaca acttcacgcg 5581 gcgtataaaa ttcgcctcct tttttcccag cagaatcagc gaattcctta atcaaatatt 5641 cataagctgc acccaacaaa tcaggaaaga caaaatcttc acttcgcaga cgatacttat 5701 taaagtgttg aatcagttca cgcaatttag catcaggaat acggcttttt cccacctgac 5761 ggttaaaatc aatatgacca agtacacccg ccagcgcttg attgttttcc tccagtgctg 5821 ctaaagcttt attgagtcca tcagcgacat ttttatgtag ttcatcacgg atataaatcc 5881 accgtgcctt ctctgacaca aaaaaagttt ctctgtaact catggggttt tctgctctca 5941 ttttcgcttc ttcaaaagag cgtccccttt gcaaattctg ctgtaaaatc tgttcatact 6001 gctgttcaaa tacatccgaa gcacgcttga ggaaaagcat cccaaagatg tactctttaa 6061 cagaagaact caggagtcag gagccagaag tcagaatgaa ttctgtgcga ctggcggatg 6121 agtatggggt ctaaattccc actgtctaca cggtgtctac aatcggttgg ggtctaaatc 6181 cccatctgat tgattctgac ttctgaattc tgacttctga attctttttt taaactcaga 6241 agcatccatc ttgccccgca aaatgtcagc agcagaaaat aaatggcgtt ctaattgtgg 6301 tagggtgagt ttacccataa ttgatgcttt ttgctcttat tcaatgtttt gtcagtatat 6361 ctctagccgc gttctccatg cgtgctgtaa cggtacacca cagaacattg ccctaaatag 6421 ccactcttgc ttaccaaagc ttggcaaaca atacaattgc ttctgaagga gttggttttc 6481 tcgaaagccc gtgacgagga ttgaactcgt gacctcaccc ttaccaaggg tgtgctctac 6541 cactgagcca cacgggcaaa tgaatcttaa aattgtgagt gctaaatgct gagtaaatat 6601 ttcgttttct gatttactca acgctcagca ctcacaactg agatttgaga tgggccgagc 6661 tagattcgaa ctagcgtagg cgttagccag cggatttaca gtccgcctcc tttagccact 6721 cggacatcga cccatgttgt ccacgacttc taatattatc acaatagttt gaaattgcaa 6781 ggtctttttg aaaaaaaatc atagggcaca ggtgacaggt cattacgcca tcccataccc 6841 cacaacctgt acgctagctt acgagttcac ctgtttgttg cagtgagtgc aagcggcggt 6901 aaattccttt gtggcgcaag agttcatcgt ggctaccgat ttctacaatc ttgcccttat 6961 ccagaaccac aattttatct gcttcccgta ctgtactcag acggtgggca atgatgattg 7021 tagtacgggt tcccagaatt gagcgcattg ctagctgaat agaacgctct gactcgtagt 7081 ctaagctgga agtcgcttcg tcaaaaatca gcacgtctgg ttccacaacc aatgccctag 7141 caattcctaa gcgttgtctt tgtccaccag acaaccttac gcccctctca ccgactactg 7201 tgtaatagcc ttgaggtagc tgctgtatga cttcatctag tctggagatt ctacaggctt 7261 cttgaacctg ctcaaaagtg gcatcaggct tcccgtattt aaggttatct agcaaggttc 7321 cgttgaaaac atcaacttct tggtgaacta tagctaatct tcgtctatag tttccaacat 7381 ccagactgcg aatatcttga ccgtcaatta gaatttgacc ttgttgaggt tcaaagtact 7441 gcaaaagcag cttgactaaa gtagacttac cagaaccaga acgacccact aatgctactg 7501 tttgatatgg ctctatcaag agattgatat cttccaaaac tggacgttcg gattgatatc 7561 caaaactgac gtgtgaaaac tcgactttcc ctgtaaattg gtaaggatta tttgcaatgt 7621 ttcgctcttc taaaagtcca actccatcga ctctgattgg gactttgagg aactcgtgaa 7681 accgcaccat tgaaggatag cgacgggcaa aaacctccgc taggacgctg ataggttcca 7741 actcggcata agccatactg gaaagagtta aagtcatgac aaagtgacct aaggaaattt 7801 gaccttttac tgtctccagc aacgtcaaac ctagcactgt aaaaacacaa aactgaatca 7861 tagttttttg ccaggtattg agcttgacat aacctttgtg gatacgagtt tcagtcaccc 7921 tcagttcacg aatcaagcgt cgtttttgcc gttgtagttc ttttgcttca gcagcaaatg 7981 ctttaactgt tttgatattg gagatgattt cggaagtccg actttcggta tcttccatgt 8041 atttatccag gcggttttcg cgccaaatca attgctgcaa ttttcttaag gtaaagctta 8101 ggataatgac aaaggaaatg agatagagaa ccgcaactcg ccagtcaaca aaccagatga 8161 acacaaaaat tcccaatacg cggaacagtt tgggaatcaa ctgtccagaa atctcaggat 8221 agctccaggt gtggttggtg acacctcttg cgactcgtcc ggcaatgcgt ccggggttgt 8281 tttcatcata aaattccagt ggtagagtga gaattttctc aatcgctttt tggttttggt 8341 cgcgacgtgc ttttaaagtt atatcccagt gaaaccaagt cgttagccaa ggctgagttg 8401 gtgctttcaa cacggtgaca ataaaaatga aactcagcaa tacacccaac tccagaggtt 8461 tattgactgg gttattggtt aggtttgaga ttgtggcgat cgctccttga agtggtttgt 8521 ctaatggttg attagacaaa acgtttaaaa tttgcccaat cccataaggg acaaccaaat 8581 caataatctc gtaaacgctg cttgctgcga tactgaaaat acttagcgac cagtagggac 8641 ggaaatagtt cagaatatct cgaaaatttg ccatgatgca cactgccctt ggagcgctag 8701 cctataactc ctggaaaata atactacatt tatactacac agcagtcaat agttaagaat 8761 ggtaaatcat cctttagaag gagttgagtt gcaaggagtt ataccatttc acgatctggc 8821 ctgatacaaa tgattcatct ctaattcttt cctcctgctc cctgtttcct gttccctgtt 8881 ccctattccc tgttccctat tccctgttcc ctattccctg ttccctattc cctattccct 8941 gttccctgtt ccctgttccc tgttccctat tccctattcc ctattccctc tatttcttca 9001 tcggtgacta ttgggcaaaa aattccaagc gatagcgata gagtgcaatt ggtcgagaaa 9061 gagcggcaaa gtaatttgga tcttgaggcg acaaataaac agctgtgatt tgatctgcct 9121 ctatggcagg atttgattga tcaagttttc gatatgcagt ggtagattct acagtgttga 9181 aataaggacg cccaccgcct ttaaagagtt gttgaaacac ttctgttgtc agaaatttgc 9241 catctggtgt tgtttctgtc gcacgtgctg taacaataga aacaagttga cgctcaccac 9301 gtaaaaatgt aatctgacga ttgggagaat ttggatcgac tttgactgat aacaccgtgt 9361 cgcctaaata cgaccgtgct aaattcaaac cattaaaagc tctatccgcc acgactgctg 9421 atttgctact tgttcgaggg atgattttca atccagaatg aggcgttgtt tcccgaacaa 9481 atcgcacgtc aaaactgaca ggttgattga gatattgacg attaccctca aatcctggtg 9541 tgacgatatc aggagctaag ggtgcaacca aatctatcag cgtacttttc acctgccaag 9601 aacccatcat ccagttagga taaaccaaat ctcctactgc aggtcgcacg gaagttaatt 9661 tttcccattc aggaaaatgt gccaaacgtt cagataactc tcctgcttga gcctcagcat 9721 gaaatagcag gaatacaaaa atcaaacaaa aactccaaaa tctattcata gcatacattt 9781 tcacaatatt ttgcagatat agaaattttc aattgagtca taactcattc aatcaaaaat 9841 tgtcttgcta ttctttcatt aaatttaact tagatatcga gcaattaaat atttagccca 9901 atttttttgt tattataaaa tttatataac aattctattt gacttgtcag actgttcatt 9961 atggtcgtca tttgtcagta gtcatttttg actgcaatcg actcttgaaa ctccatcctt 10021 ttacaggtag aactatccta agttctttca tcaaagcaag tataaaacat agttgctaaa 10081 aaagatatca gggcgcgagt ttgtatgagt tgactctttt atagttcttg cacggatgag 10141 tatatctagt tcttttacag atttttcctt ggctgaattg tttcaactga ttgatcaagg 10201 gcgaaaatct ggttgtttaa cggtttgtac tttaccagac cttcataccc cgggttctaa 10261 atcccattac tactacattt ggtttcggct aggctgtgtt gttgctgcag ctaatcgctt 10321 aaacggtcag agtttaactc acaatatgac acaacggggc tgggtaaacc agcaaaccat 10381 tgaacaagtt tggacccaaa caccagcagc actaccgctt ggattattat taaaaactca 10441 aggagtatta agtactgaac agctaaattt attattcgcc agccaattac accaggttcg 10501 agagcttttt gaaatccaaa agggagtttt taaactagat actaaggctg atttgccttt 10561 gcaggagatg acaggactca gcttgagaac actagaagtg gctctgatgg ctgtcagagg 10621 gttaaaaaac tggaacatgc tcgctgaggt actcccggat gtgagttctg gaattaggag 10681 taaaactaat gacaaacccc agatccactt gaatacatta gagtggcaag tgtgggaatt 10741 ggctgatggt agtgtttctt taagtgcgat cacatataaa ctcaaccaat caataaccat 10801 agttcagcaa gctgctttcc ggctaatgct tgtgggtttg gtagaagaag tttcgttggc 10861 agagtctacg ctgagtctgg aaaattatcc catgaattct aactcagaca attcctttac 10921 ttctggattc aaaaaatcca aacccttaca aacgcgtaat gtaagtgcct cgtttttaca 10981 aaatctggtt ggttttttaa aaagctaaat ttcacaagct gctttgacac ttgtttgaag 11041 agcgatagct agcccctaag ggggcgctgc gcaaacgctg ctgcgatcct ttgcagatcg 11101 ctctacgcca ttttggattc aatggctgaa aactttattt tgcttgaaag ctatccctaa 11161 ttggcaggaa cagcttagag attttgtaga tgagttaata tctcatcaaa atgataagca 11221 atgatgagag gtgatgagga attaagcgac agcttctagc cccattcatc atcagctaac 11281 ttgctgaatt tacactttct gccgcttcca actgtaaatt ctgccctaat tttggttcag 11341 tacctgcaag tatacgccca atattgctac tgtgtcgcca aatgacatac aaaccagcaa 11401 cagcaccaaa gaggatgtaa gctaaaggtt gatgcagaag tatcataaaa acagaaacgc 11461 caacagctcc agcaatcgaa ctcaaagaga cgatccgcga tatcgccacg acaatagcaa 11521 atacgccaaa cgtcgctaaa ccaacctgcc aattcatcgc cagtaaaata cctaaaccag 11581 tagcaacaga tttaccacca gcaaaaccca gaaaaatcga tttactatgt ccaagaatgg 11641 cagccaagcc agataaaatc accatccaag gttcccacaa ttgcggatta acttctggag 11701 gaataagatt ttgactgggg gcaaagttga acaaccagta aactagggcg atcgccaaca 11761 ctcccttcaa gcaatctatc actaaaacaa aagcccctgg tccttttccc agagttctca 11821 gcacattagt cgccccggtt gaaccggaac caacttccct cagatcaata cctttcaatc 11881 gcttagctac tgtgtatcca gtgggtgtag aacccactaa ataagctaac accaaaactg 11941 ctccgcacaa acttaaccaa atagccataa acaattcaaa attcgctcat tcaaaattcg 12001 ctcattcaaa aaagtcaaga gtcattcttc ttgacttttg actaagttaa tagtcatcat 12061 cataatttct catcattttg gggtcaggag caaaggctaa ccacaaagga aactgcaaga 12121 gtgccaaact aatttgctct tcagaatcat caatcaaaat caaaggtaat tgatcggctt 12181 ttacgagtcg gtctgctttt tgtgctaacg cttctgcagt ctcaaataat accatgcctc 12241 tgtctggtcc aaaatctgga cgacctattc ctaagcagtc ctgtaaacct cgtcgccatt 12301 cacccaaacg ttctggtgta tttgctaata ctagagtgcg gaggcgatcg ccatacaatt 12361 ggtgtaatat cgaaatgact gctgaggcaa taagaatatt ctgcaaccga ctgcccgtcg 12421 tccgcaaagc acctcgtcct ccttggctaa aaaaccaatt agaaactcgt tctgcatgca 12481 ctggttcaaa ggtgcggcgc agttgccaag gtggaccata ataatctggt ttcgtgcgat 12541 actggtcaat caaacggcgg acttgttttg ttgtatcttg aaactgctgt tgagcaaatt 12601 gtggtgttgc gctttgcgct tgggcttcct taacctcttt aacctctggc ttttctggtt 12661 ctggtttcgg ttttggcacg agttgcaact gttctgctgc tgctgccaaa tcttgtaaac 12721 tccctgtgag atagtcttta aaaccttgta cccgaatcgc taagtcttga gaagtacctg 12781 caaaagtggt tcgcatctca ttgcgaatgc gttcttgacg gcgttccaac tgttctacag 12841 taatttgcag cgctgcttta cgttgttcta gcccagaaag cgcctcttgc accaattgtg 12901 tcatcgtcat ttgagtttcg cccaactggc tgtaaagagt tttgtaagag gcttgcaaat 12961 ttgctatttc ttctttcagc gcttcttttt ggctttgcaa cagtgtaact tgttgtgcga 13021 gttcctcttg ttgacgataa ttttgtgatt ttgattctac atctgatccc agcgcggtaa 13081 tttcgtcttc taaaagcgtc acaacctcga cagttgactc tggtgttaac tcatctataa 13141 gatttagttc ttctgcgtct gaatcagaca cagcagaact cgactgctca ttttgtgtgt 13201 caactttgga gttttgcgct gctatttcca ccactgactc aactttagag ttttctggtt 13261 gttttacttt gagttgttct ttgtggttca acgactcatc agttggttgt ggtgtttgaa 13321 attcctctgg gttcataaac aacagtgtca ttcctatgac gcgaataaat agctttcagc 13381 aatatataat ttacttgcta tcaaagtctt aagtccaaaa tagtcctaag caatgagtcc 13441 tcagttttga gtcatttgca ctcagcactt ggaactaaat gcgcggacag cgttgttcta 13501 agcaagtttt gagagtctta ggatcaaata aaatcggcag aaagtgaata cttttaattt 13561 ctttaaagta aaacagaata ggaaccggat tccagaatat gcgccagttt tgccattccc 13621 ggtagggaaa gcgtcgaatt aatttttcac ctctgtaaat atccaaatcg gtaggagtaa 13681 ataacaaacg cagtgtgaca gcttgaaaca tgagaaacaa gccaaaaaga cccatcaagc 13741 ctcccacact cggttgcaca atcagcattg ggactgctgc aaccaccaac accaagggta 13801 tagtgtaact aggcttgagc tcgacagtgg tagatgccgg gttagacgta tatgaagtcg 13861 tcacagtcaa aactcctgct tggtgtgagt aaacattatt tctattctag gagctagtgt 13921 atagactaag gactaagaac tgtcacttca attcagtcat cagtcatcag tgaacaaaaa 13981 actgatacag ggaacaggga acgcttaaca gggaacaggg aactgttaat cgtctataac 14041 ccttgcaatg atgaacttcc agtcccttgg aacataaccc aagaaagaaa gaaattgctt 14101 ataaatataa tcaataaggc cgtgacaaca gcagttgtgg tagattgtcc gactcctttt 14161 gctcctcccg ttgtcgttaa accccaactg cacccaataa tggctattaa aattccaaag 14221 caacacgcct taatcatggc gctcgcaatg tcccaaatgc caagaaagtt gcgggctgag 14281 tctagaaaga cattctccga cagattatat atgtttattg caataattaa tcccccgaac 14341 atacctgtta ccagagataa gagcgttaaa attggtaaca ttaagcagca cgcgagtacg 14401 cgagggataa ccaggtaatc aattggatcg gtttttaaca tcaaaagggc atcaatttgt 14461 tcagtaactc gcattgtgcc gatttctgct gcaaatgccg aaccaactcg tcctgctaaa 14521 atgacagctg tcaaaacagg cgagagttcg cgtgttagcg ctaccgccag cactccgcca 14581 acggtgcttc ccgcgccaaa gttaataaac tcccgcgcca cctgaattgt aaatactgcg 14641 ccaacaaaaa cagccgttac tagggcaata aacagcgaat ctggaccaac tgctgccatt 14701 tgctctaaag tattgcgccg atggattctc ccccttagca ggtgaactaa gacttgtcca 14761 cccaagaaaa ctgctgccag caatcgctgg ctccatgctc ctaaactaga tttggatgtc 14821 gtttcactca atgttgacaa gctaactctc taactgccgt catcatagcg aattccaaag 14881 ctagttatta gtcattagtc attagtcatt ggtcattggt cattagtgag ccagcgcggt 14941 cttggggagc cagtgcgttg cggagagaag tttgagcgga ggtttcctcc gatcaaagct 15001 tcggagggtt cccccgacgg tgccatctgg cgtggtttcc cccatgagcg actggcaaac 15061 ccgtaagggt cattggtcat tagtcaaaaa ttattctccc ctgctcgggt gctccctcac 15121 tccccgaatc ccccaatcct ctactttcgg gtgaaagatt aacttagttt aaagcgaatc 15181 tcagaaattt aagctgaagc cacgcagcgc tgtcactgga acaaaagctg caacgacatg 15241 cttagaggtt gatttttcca ggtttgggtt gtgtcaaact ctgtttgcat ttagcaacat 15301 atcctaagga gattgtgtat atactttaat aaaaacttaa gatttttgaa ctatcgtctg 15361 aggcagggcg atcgcgctta ttgtagaagg tgagaactga ccagacaaaa cttgtcagcc 15421 ttgtaaagtt atggagtttt tccaaatgac tgtgatgatg aactttctgc gatcactgct 15481 gctatcgatt atttttagct tcgtcgctcc catgtttttg tttggtggcg tattattcgc 15541 cttatccgtc gctggctata ttcctggctt acaagaagtt acagaagttg tttctagtct 15601 gattacggag tttctctccg tatttggcag tggcactcct gttggagggt tattcgttat 15661 ctgttcaact tgcagctttg tgggagcgct atttgacact tatgcttatt atcggtacca 15721 aatattacgt taaataaatt ctgttctact cagagcgata tttatatttt gttctctatg 15781 ttgatggatt aaaaattaga cttctcaatc attaaagggc tggtagtgct accttgctca 15841 caattggcat caatcatcca cgctttgtat tgaatgccgc aacactgagg cgtgtacata 15901 gtattttaag tacagcattt catgactatg tgttcaaaat gctggaattt tatcttgggt 15961 ggcactatgg ttcagtcagc gtacaaaaac tctagcgatt taccctattg acagtggact 16021 ctactatgtg acatcgcttg aatatgtagg cttatccttc acattcaagg aaaaaatact 16081 tatgctattt tttttgcttg aagtattttt gacatttttt tctttatgta agcatcatag 16141 caggtcacga actacagaag ttattaaatt tttattaaaa ttttaatacg cgcaacagtt 16201 gctggttttt cttaacatag ggaaagcaag tggtttgcgc gcttaacgct ttatgattaa 16261 nnnnnnnnnn gtttttttac tgctcatggt ttggcctttt aagcccaatt ttcggaaaca 16321 aattgctcgt attgaaataa caggtgccat tgccggtgcg actcgcaagc gagtactaga 16381 agcgctaaaa actttagaag aaagaaaatt tccggcgtta ctgctacgga tagacagtcc 16441 tggtgggaca gtaggagatt ctcaagaaat ctacagtgct ttgaagcggt tgcgcgagaa 16501 agtaaaaatt gttgctagtt ttggtaacat ttctgcttct ggtggagtct acgtaggcat 16561 gggagcgcaa catgtcatgg ctaacccagg tacgattaca ggtagtattg gtgtcatttt 16621 gcgtggtaat aacttggaac gtctgttgca aaaagtaggt gtttccttta aggttataaa 16681 gtctggtcct tacaaagaca ttttggcgtt tgatcgggaa ctgacagaac cggaacaaag 16741 tattttgcaa gaattaattg acacaagtta ccagcagttt gttcagacag tagctgacgc 16801 acgttccttg gcggtagaaa ctgtgagaag tttcgcagat ggtcgaattt tcactggaca 16861 gcaagcctta gagttaggta tcgtagatcg tctgggaacg gaggaagatg ctcggcgttg 16921 ggctgcggaa ctggctggtc ttgaccccga aaaaactccc gtttacacct ttgaagaacc 16981 caaacctctg ttgagtcgca ttttgccagg aagccgtcaa gcttcttcag gacttggggc 17041 tggtcttggt tgggtcgaat tcgaggtgtc tactagtggt ttacccttat ggttgtatag 17101 accataaatc gtgatttgtt ttttgttatt ggtcgtttga cgaatgacca ttcataataa 17161 ggaggatttt tgagtggagt ggcgaatgcg ggctattcgc ggagcaacga ctgtaccaga 17221 aaatacggtt gaagcaatgc gagaagcagt aatggaatta ctggatgaac tagaaaaacg 17281 gaatcaattg catccaacag atatcattag tgtgactttt tccgttacac acgacttaaa 17341 tgctactttc cctgcagcaa ttgcacgttc acgcccctac tgggacagtg tacctatgtt 17401 agatgtacag gaaatggaag ttgatggcag cttaaaacgt tgcatccggt ttttagttca 17461 cgcctatctc ccggtctcca ccccaatcta ccatgtctat ttgcgtcagg cagcccagtt 17521 gcgtcctgat tggaatgtgc cccagctatt gtagtgttat tgacagttta tagcattatg 17581 catattcgct atagtcaagc aggtgtaacg aatcaaaatc aacacagcct gctcaattct 17641 tttttgcccc tgatccgaca gtaaaagtct ggtatgggct tcatttggtt gctgttctcg 17701 acccttattt ttaaaaatca caaacaattt ggagaccatt aaataacctt cgtgttgatt 17761 tgtgcttttg tcggcactag tttagcagca ttaaatctct agcaactact gcacagtttg 17821 tgaagtttaa atacgggttt cggaattttc taaagttgac ttggtaaaaa gtatatttta 17881 gtagaaactg cctttccaaa ccttatcccc cccagcggag ggtgaggtaa ggtgatacgt 17941 atgaagacaa cacgcgtata attttgctca caaagatttt tgaagtcgtc gtcagtaagt 18001 cagtatagaa tcatacataa attctttttt ctgagttcta ttttcaagtc gctattgctg 18061 agaaaatctc acaaaccttg ccataaaaat gtctttcccc tacaccccca ccccctaagc 18121 cctgcgggca cgctgcgcgt tcgccctctg ggcgtgcgct ttgcgcttac gggggaattc 18181 aaaattcaaa attcaaaatt caaaattaat acttttgact tgttttgccc ctacacccat 18241 tttcaagcta aaatgagtta atgcaaccgt caggtatgtc gtaacatgtt aaccaagtta 18301 caaggggagc gttaccaggt tgttcaagtt ttaggtcaga gtctattcgg tcaaacctac 18361 ttggttcaag atacccacct cccggatcat cccacctgtg tcgtcaaaca ttttttacct 18421 agcagtcaat gtcctattcc agtggaaata cgcagacggc tatttacccg agaagtagaa 18481 gccctgaaaa aactggacaa ctatgacctg gttcctcatc ttttagctca ttttgaagac 18541 aatcttgagt tttacttggt gcaacagttc attgaagggc atcctctgac tgcagaattg 18601 tcgccgggtg attcctggtc ccaaagcaag gtttttcaac tactatacga ggtcttgagc 18661 atcctgaatt ttgtccacag ctacggactc atccacagag atgtcaagcc cagtaatatc 18721 cttagacgaa agcaagacaa tcggttagtc ctgattgatt ttggtgctgt caagccaatt 18781 tggaatcaat tgattctaaa tcaagcaaaa acttcaaatt tcattcctct tgaatataca 18841 accattgcga tcggtacgcc gggctatatg cctcatgagc aacaacgagg caaaccacgt 18901 cctaatagtg atatttatgc cctaggtatg attgctattc aagcactaac aggagtccat 18961 ccgacacaat taccagaaga ccggaataca aacgagattc tttggcaaga gttagctcag 19021 gttaatgatg agctagccct aatactcaat aagatggtgt gttaccactt tcaagaccga 19081 tataactcgg caaaagaagc attagaagcg cttgcgccac tcactcatct ttacacatca 19141 acacaagagt gggctccgac tttaatacca caaaacgtta catttgacaa tcaaaactcc 19201 ttacctgagc agaatgttaa acaagttttt gggaacaacc catcagcacc actttttaac 19261 aaccaaacaa gcgatatttc agaactagag atagtggagt tactgagcaa actctacaca 19321 ccaacacagg agtcagcccc tacttcaaga ctagacaatg accagactat ttctatttct 19381 cctggaaaac tgactttact cattggctta atgttgggcg tagtgtctag tctgattttc 19441 atagttttca gctactggtc tgtgcaagtc attggtccta ttcctcaaat tcaaaattct 19501 tcatctgaac caccagaggg tttgcgttaa tcaacgcgag gttatacagc agaactcaga 19561 actcagaatt cagaactcag aattcagaat tcagaactca gaactcagaa ctcagaactc 19621 agaactcaga agtaaaaatg gctttctgcc tgcacgccgg aagctttagt cgtgaggcgt 19681 gcggctgttt cctacgcgca gcgtatccct ttgggactca gcaatcgcag aatttctatt 19741 ttaagttttt tttaggttga gctacttcag tttttatccc tgttccctgt tccctgttcc 19801 ctgttccctg ttccctagtc atttttagtt gatttgtggt gatgaaaccg aattcctaaa 19861 aaaatgtcat aaacaaattc aaaaaagtct ttcttttgca gaagacctat agattgatag 19921 cagccttttt gatctaaaag aggcttcatg agataatatg tgggctggta attataccaa 19981 ggaatggaag gccataaatg atgaattaaa tgataattct gacccataat gaggagattc 20041 agaattggat ggggatagac gcgagcattt ttccagcgat cgcgttccac aaaaggacga 20101 tgaggaaaat aatcaaaaaa taaacccagt gctatcccta cgacaaatgc tggaatgaac 20161 caaaaattga gaatgtaacc taaaaagtga tactgaaccg agatataaac gattgaaatc 20221 acaatcaagc gactgataaa ccattccagt agctcatatt tacgccaaag ttgccgctta 20281 aagaaaaata cctcgtggta taaaaagcga accgcaatca gccaaagtgg accacccgtg 20341 gaaacataat gatcagggtc gtcttctgga tggttcacat gtccatgatg ctgcaaatgt 20401 acccgtgtaa atacgggaaa agcaaaagct agcatcaagg cactgccatg tcctaacatg 20461 gcgttcatga ctcggttacg atgggcagat tgatggcagg cgtcgtgaat caccgttcct 20521 gcaatgtgca aagcaagagt atttgcagca aaacataacc agtgtggcca ttgccaaacc 20581 cagtaaccaa aattggataa tacaagaatt gccacagctg ctaaaaacat cagcagcgtc 20641 ggattaaaat cacccggagg cgctaaaaat tcctttggcg ggattgtcag tggcttcttt 20701 gcctccgacg tcagcattat gaactccttg ttgctttaac gagagattgt atttagatga 20761 tgccactttt ctataaggaa agtgaagttt tgcaatgcaa aatgacggat aataaacttg 20821 tgggtcacaa gatttatcta cttctttaca aatcagccgc ataatagcgg tatgatttca 20881 acatcaaacg agcttgtgct gagtggctca atcgaatgaa gatcagggga taaggggaca 20941 aggaaacaaa aagacaaggg gaagacactt gtgtgtaggg gttaagcgtt aagtaaagta 21001 tctgtcgcca aggcaacaag aagaataagt ttcttaactt tctgatgacc tcacctcccc 21061 attacctcac tcatctaccc agctttggtg attgactcag caacaacaga caacaggtgc 21121 tccctgattt aggataattc ccagagtagc tccctcgatt tagtttttat gcaattaaaa 21181 gattctgtac gccgaacaaa aattgtcgct acaattggtc ctgcaaccag cagcccagaa 21241 acgctcaaag ctttaattga ggctggtgca acaacactgc ggctaaactt ctctcacggt 21301 tcccatgccg accatcagcg taatattcgc ctgattcggc aaaccgcctt tgaactaaat 21361 caaccggtag ctatccttca agacttgcaa ggacccaaaa ttcgtttggg aaagtttgat 21421 aacggatcta tagtcgtggc gaagggcgat cgcttcacct taaccaatcg tcctgttgtt 21481 ggtacacagg acattagctg cgtcacgtac gattatttag cagaagaagt ccccgcagga 21541 gcaaaaattc ttcttgatga tgggcgtgta gaaatggtcg tagaggagat taaccgcgac 21601 aaaggagact tgcactgtcg cgtgactgta ggaggggtac tttcaaacaa taaaggggtg 21661 aactttcccg gagtttacct atctgtcaaa gcaatgaccg ataaagaccg agaagatctc 21721 atgttcggtc tggatcaggg cgtggactgg gtcgcacttt cctttgtccg caacccgcaa 21781 gacatgatag aaattaaaga actcatttct agtacgggta aacgagtacc agttattgcc 21841 aaaattgaga agcacgaagc gatagaacaa atggaagcag ttctagcttt gtgtgatggc 21901 gttatggtgg caagaggtga cttgggcgta gaattaccag cagaggatgt cccagtccta 21961 caaaagcgac tgattgcaac cgccaaccgc ttgggaattc ccatcatcac cgccacccag 22021 atgttagaca gcatggttaa caacccccgt ccgactcgtg cggaagtgtc cgatgtcgca 22081 aatgcgattt tagacggtac agatgcggtg atgctctcga atgaaaccgc tgtcggtaaa 22141 ttcccggtgg aagctgtagc aacgatggca cgaattgccg aacgtatgga acaagaggtg 22201 tggctgaaca caaacgccag ccaggtaaga gacaccaaac attctattcc taacgccatc 22261 agtcaagctg tcggtcaaat tgcagaacag ctaggagcag cagcaattat gaccttaacg 22321 caaacaggag caacagcccg gaatgtctcc aaatttcgtc ccaaaacacc aattctagca 22381 gtaacacctc atgtgaatgt cgcacgacag ttacaacttg tgtggggagt caagccgttg 22441 ttgatgttag aacttccttc tactggtcag acattccaag ccgctattaa cgtggcccaa 22501 gaaagagaac tcttgtctca aggggatttg gttgtgatga ccgctgggac tctccaaggg 22561 atttctggat caacagattt gattaaagtt gaagttgtga cggcggtact cggtcaggga 22621 attggactgg gacaaggttc tgtgagtggt cgcgcacgcg tcgtgtacac tggcatggat 22681 gctagtaact ttaattacgg agatattttg gttgcctcag gtacaagtgc tgattttgtt 22741 gaggcaattc gtaaagctgg cggtattatt actgaagagg aaagtcttaa tagtcacgct 22801 gctgtgattg gcttacgtct tggtgtgcca gtgatcgttg gtgtgaaaaa ggcaactcaa 22861 gtgattcggg atggtacgat tttaacaatg gatatgcaac gcggtttggt ttactcgggt 22921 gcagtgggga caccgtagga aatatcaagt atcatttgca gataaagcga ttcccctcct 22981 ccactgctat attggttttt aaataatcct gcacaagagc gcaaccataa gccagtgcat 23041 ctaggttgag aattcgcggc aaattccaca gaatgagtgt gttgtcatca ccccctgagg 23101 caagtattgt cccgtcgcgg ctgatggcaa tatccctaat cgctgcggta tgtcctctga 23161 gggttgttag ttccgtacca aactcctgcg gagacgctgt ctcctgcaca gaggctgagt 23221 ccagaggact cgcttcgcga acgccaacgc gaacaagctt ccaaagtttg acggtagcat 23281 ccacgctgcc agaagcaact atttttccgt cgggagtaaa cgcaactccc caaatcgcag 23341 ctgtatgacc tttgagagtt ttcagcaact taccatcaag tgtccataac ttaacggtat 23401 tgtcaccact tccagtagcc accattttac cgtcttggct aaaagcaact ctccatactg 23461 cagcagtgtg tccgacaagg gttttgaata acttcccatc aagcgtccac agcttagcag 23521 taccatcgcc actggctgaa ccaaccagcc gaccatcggg actaaatacg acatgccaca 23581 cttccgcctg atgtcctttg agaacctgcg gtacagggcg atctctacgc catatctgaa 23641 caattttgtc aacatttgcc attgcgatcg cttgtcctga gggactcaag actcctgcta 23701 agagtttacc gagaactttg taagtagcaa ccaaagtgcc gtctggcttc ttgagtttta 23761 cagtattatt atcagcggca agagcaatta acttgcgatc gtcactgaat gaagcctcga 23821 agactatgct cccagattca gtgaaagttt tgagcaattt accttggcga ctccaaagtt 23881 tagcggtgtt ttcgtgacca gccgtggcaa tggtcgaact gtcggaggtg atagctattg 23941 accaaatccc gccgttatgg gcaatgatag acttttggaa tgggttctct ttctgccaga 24001 gtctgacaac gttttctgca cctgccgaag caataaagct gctgtcaggg ctaaaagtca 24061 ctccccaaac cccggcactg tgtcttctaa gcgttctcag ttccgtacca tcaatgttcc 24121 aaagtttaat cgttttgtcg agactggcgg tggcaatagt ctgaccatcg ggactaaaag 24181 cgactcccca aaccccggca ctgtgacctt taagagtttt atactggcgg tagcttccat 24241 cagtgctgtc tcgctgccaa agtttaacag ttgtgtcttc actcgctgaa gcaatcgtct 24301 gactgtcagg gctgaaagct actcctacaa cccaaccagt gtgaccttgg agagtttgta 24361 ggggcttggt gtatgctgtg cccgaaccgt ctcgtttcca aagttttact gtcttgtcta 24421 cacttgtggc agcaacgatt tgaccatcag gactccatgc gactccccag accgaagcag 24481 tgtttttgca gtctacgttt ggctttgatt gcttctatga gtgcatccaa tgtgcgattt 24541 gaggcaaaca gtccttcaga agaagataca agcgcttgaa tttcactgct tctggcttga 24601 ctttcgcttt tttgggcttg gcggtacaaa atgaaagtcc ctagccccaa actgctagaa 24661 atgaccaacc cgatacacac tgcaactaac agaaaccttt gtaatttagc cgttcttttc 24721 tcttgtgcta gtcgtgcttc tacctctttg gctcgtgcgg cttccaatct ttgctgagtt 24781 tctcgacgtt ccagttcctg gctggctgct aaaaatcgat agtccaaatc gctgaggctt 24841 ttgcgcccac tccagtctaa gacttcttgt aatgctcgcc cccgcagcag acgtgattca 24901 tcttgataac ccgatatgac ccatgcatta aatgcctgcg agtaaggacg caaattatct 24961 aactgtctga gtacccattc agaattgaag acactgcggt agataggatt tttaattttg 25021 agatagccgt cgtgtttctc gaccaatccc gatagtaaaa gttctgtctg ttctcgactg 25081 tcatcagtag gtacaccaga tgtttgattc tcttctgctt gcaacacctg ctgataaaga 25141 cctaacaacc gccctgcgcg ctgttcgttg aaaaggaggc gatcgcgaat cgtgcacagg 25201 tgttccggtt catcttttgc ttcccagtgt tggataattt gctgtagcac aaattgttct 25261 acccaatatg atgcggtttc tggaggtaag acaatttttc tgtttgatgt ctctaaggca 25321 gtatggacaa ttaactgaca gagtttttga gtcagaaatg gttgtccccc actccagtaa 25381 atgatttccc gcagcacatc ttcgctttgg ctgacgactt cccctaaccc ttttagcagt 25441 ggtgttgctt catgcagttg aaaaccatat aactcaatcg ctgttccaat attaaacggt 25501 gtccggcgtt tgtcagcaat gagatcagat ggactagcta ctccaaacag tacaaatccc 25561 agacgttgga atttcgggtc atgtgcctgc tgattatagc aatgacgaat ccaggcaaaa 25621 aagtcactga caggaaaact caaactcaac agactgtcaa tttcatcaat aaagatgaaa 25681 atgcgttcgc tttgaacatt tggcagcaaa actgcttcaa caaacagatc tagtttttgc 25741 actggggaaa tacctgcttg catatcccac cattgcttaa acttgacgtg ttctgccaga 25801 tttaagtcgt aaaacaagct gaggatgata cctttatacc attgttctgg tgtggtattc 25861 tcgctaccca atcgggtgac atccaagtaa gcacagctat gcccttcttc tctgagacga 25921 taactcgtgc gttgtaataa ggatgacttg cccatctggc gagaatttaa gacgtaacag 25981 aaatcaccag ctttcaggct ggcataaagt ttttcgtctg cctgacggac aacatatgtg 26041 ggatcgtcac tgtgaaggct gccaccaact tggtatctca tatttaagtg tgattaaaat 26101 tagctttttc aagatgaagc gatcgcgaaa tctctgccat cttgaaagcc aaaattgctg 26161 taagcgattc ctacggagag ctacgcttaa cgcacaagct ctataaagaa agcgcaaaac 26221 ttccattttc gtctggtgcc tgtaactatt atctcactat gcaaggaaat ctgcttttgt 26281 attctcttaa ctgtcttaac tgtcttaact gtcttaactg tctaattgta ggttattaaa 26341 ttttttttaa ctgtctgctt aactgtctta actgtccttg cattcttttt agaaagctga 26401 aaatatatca aaggatgcaa atcttgcatt cttgttcaca ttactttgaa tcaccactga 26461 gagaaatgat caaacggatt ttagcaacaa tagcgttgat tattgcagta actttctttg 26521 cccctcctgc attcgcttcc atcaacagtt ctaacccaga ttcaactgtt attacaggaa 26581 aggatgatat agttaacgca agtctgccac ttttcattta ccaaacagct aaatcttcag 26641 ataatacttg gggaagagta gtacgtttaa gtgggtttgg aaattttttt gacattggaa 26701 ttgatgagca aggaaacttt ttcattaact cacctgaaga tacaaaaacc tcacatgctc 26761 taaaaatttc cccaactggc gttgtgacaa ttcccggaaa ataatagtaa atgattgcgg 26821 cgatagctaa atgtttagtc cttaattttg gttagttcta aacttctgcg ccagcatcat 26881 cttaccatca aggggatttg taaccttcaa atctccttga ttcttgaatt ttgttaaaca 26941 attgtcgata gttgttttgc aaacaggagc gcggtaaagt tcgcaacgca gtaggatgcg 27001 atcgcaaggg aaagcagacg cccttggcgt atcgctttac tgatttcggt taatgaaagc 27061 gatcgctctt tttttatctg atttcaactt gtatcacttt tcgaggtcat gaaatttgca 27121 tcttcccatt tttcacttaa ctgtcttaac tgtcttaagg gcagggagta tgtcaaaata 27181 atcctgagct tgtaattttt gacctctaaa atttgaacca tgtttgctct agtctcgcct 27241 gagaaaattc agctacctgc gggagcggtt gtccggttac ctgcaacctg ggaagattat 27301 caggatttgt gtcactggcg gggcgatggt tcaattcccc gcgtgaagta ccgatccgga 27361 gaagtgttac tcatgtctcc tcttcccaag catggacgcg atgcgcattt aatcgcgaat 27421 gtcatcacag tgctgctgga tcatattgga cgtgagtacg atgcttttac tcctgtgact 27481 atggaattac cacaaaagag cggaatcgaa ccagactatt gtttttacat taaccattgg 27541 gaagctgtct ctggtaaaga gcgaattgac tggagtattg atcctcctcc tgatttagta 27601 ctggaaattg atgtgacgag ctattctgac gtgaatgatt acctccctta tcaagtacca 27661 gaaatctggt tgttccgcaa aaaacaactg tatgtttatc agttgcaagg tacagaatac 27721 ctcgttcaaa ctcaaagcca atatttccca aatataaaca tccaagatat agtttccagg 27781 tgcgttgaag ttgcctatga acggaatacc agtgcagcaa ttcgtgaact caagcagcgc 27841 ttaggttgaa cagaggtttt ttttaagtac aaaatggacg ctttattgat tgaggtaaaa 27901 tgcgatgtct acgacgggct acgcctacgc ccttaaagcg aattaatact aaagtgtcga 27961 gtttgcaagc ctcagtggga ttaagagaaa taccttgctt tgtatagtac tgagtttttt 28021 aactgtctta tctgtccttg tattcaaaag tgtttaaatt tactcaaaaa tcctatcccg 28081 ttccagtcca ttttaatgga cttcgtctat gagcctggga cttacagtcc cgtgacgaga 28141 acaaagtcaa ataatctgta ataactttga cattgatttt cccaggtgca agatataagc 28201 caaaaaataa tagttgacaa tacagacaca atctgagtta agagcttttg tagtttcagt 28261 cacatccgct gacttgcttg tcatccgccc aattctctaa aaattggata cgcataaaca 28321 tcccattggg tcaattacga atacttgacc gacgccctag cataaatagg taatactcag 28381 atcttgcacc attattttcg tccgccaaga aataaattct gaggctaaga gttcaagtcc 28441 gttaaaacgg actgggtaag tctttgagtc cgtacggact taagctatta gccttgtaga 28501 cgcccgccag aaggcttccc tagtggtact tcagttcaag gtgtactcag gtagaggtgc 28561 aagatctgag taataagaat aggaaatact cataaagcaa tgacatctca acctcactaa 28621 agactcaaag ataacttagg caatgaatag caacagaggt gaaaatctaa agagaaaagc 28681 catagttata gacataaccg gaggtttggt caaagcagga tttgctggag aaaatgcacc 28741 tagtgtggtc tttcctactc ttgtcggtcg tcttaagtct gagagtatgg gactcaacga 28801 aatctacttt ggtgatgaag cggcgcttaa gcgagatgtt ttagcaatga catctccagt 28861 ggagaaaggc atagtcacaa actgggacga cttccaaaaa ttgctggaat acacttttag 28921 cgccttgaaa gtcaacgtgc aagaatgtaa cgtcctaatc accgatatat tctggaactc 28981 tcagagtaac cgtgaaaagc tttgtcaaat gttgtttgaa tttggtgttg caggtctata 29041 ccttgctcac gacgcagtcc tatctctgta tgcctcggaa aaaagtaccg ctatagttat 29101 caatattacc aacgacttca ctgatgttgt tcctatttgc gaagggtgtt ctattcctca 29161 tgccaggcga cgtattagca tagggaaaga gaaccttgta agttcagagc cttttcgtcc 29221 gcttgagact ttgtttgagc ctactctaac gaaatcaatt gttagtgcta tctcgaattg 29281 cgatccgaag atttgccaaa ctctgtataa aaacattgta cttgcaggag agggcagtat 29341 gtttgagggg ttgcctgagc gacttgagaa ggaagtacgc tcccttgcgc catcaggggt 29401 cacagttaag gtcgtcgcct ctccccagag gaaagatttt gcttggattg gcggctcaat 29461 gttagcatct ttaacgacgt ttgagaccat gtggttcacc aaggaagaat aatactttgc 29521 tttgtatagc agttgaggtc atcttaagat aaaattgtca tcccctagca tctcaatcaa 29581 accatgcagc tgacgataga tcttcccgaa cagacatttc aaagactggc tcagattgca 29641 gagttaacca accaatcctt ggaggatttg attgtccaga gcgttactgg gaatttaccc 29701 cctgctgttg aaactgctcc ctctgagatc caggctgaac tgcttgaact ccagactttg 29761 agtgttgaag cgttacggca aattgctcaa agtcaggtgt cgccgtttca gcaagatcgg 29821 cacatggcac ttctagagcg aaatcaagat ggtttactca cccctcaaga gcagcaggaa 29881 cttcgccaac taaccctggc ggcggatcaa ttgatgctca aaaaagccca tgcttgtgcc 29941 atattacgtt ggcgagggca acccattcgt accctgagcc aactttctcc caattaagga 30001 atgtcttcaa tttcccaatc aattcgccag caagttgttt cagaagcaaa acaatgctgt 30061 gaatattgta tgactcagca attgctaata ggtatgcctc ttgtgattga ccatgtaatt 30121 ccccgttcag ctggcggaag cgacaaacgc gaaaatttag ctgcttgttg ctatcgctgt 30181 aatgaattta aaggtgccaa aactcaagct actgatccaa tgacaggaga acaggctccc 30241 ctttttaacc cacgtcaaca aatttggtct gaccattttg cttggacaaa tgctggcaca 30301 catattgctg gcttaactgc aataggtcga gctactgtgg aaacacttaa acttaacaac 30361 gattacatcg ttgaagcccg aaagatttgg gttgctcaaa actggcatcc accggaactt 30421 tagccggcat ttttgcacgc attgggatgc tccctgataa ctgataattg ataactgtta 30481 attggttaaa ttaattggtt aaattcgagt tgtcgctagt tgtttttcaa agtaagcacg 30541 gtagagttcg caactcggta aaatgcgatc gcccttaaaa caaatcaatc ccaaactgtc 30601 gagcttatat acatcaatgg gattgacata agtactttgc aatgctttca aaattaagga 30661 actattaaga ctgttactgt tagtaaaaca ccaagtggca aatattttgc atctatacta 30721 accgaagtgg aaggggacaa cccatctact tgcgaaggta agatatacgg tattgatttt 30781 gggctgaaac attttgccgt tgtaactgac ggtgagaaag tttcaaagta cgacaatcca 30841 agaaatcttg ccaagcacga aaaaaatcta aagcgtaaac aacaaaagtt agcacgtaaa 30901 caaaaaggta gctcttctcg gttcaaatat aaaaaagttg ttgccaaagt atacgaacgg 30961 gttagcaatt cgcggcaaga ttttttacac aaacttagtt ataagttggt cagcgatagc 31021 caagctgtca tagtagaaaa tcttcatctt ctaggcatgg ttcgtaatca taaattggca 31081 aaagcaatat ctgatctagg ctggggaacg ttcactaatt ttctagctta taagctagaa 31141 cgcaagggcg caaagttagt tgaaatcgac agatggtttc ccagttccaa gctctgctct 31201 aattgtttct atcaaattgg tgaaatgcca ctggatgtga gggactggac ttgtccccac 31261 tgtggcactc atcatgaccg tgatggaaat gcagctatca atattagagc agaaggtatc 31321 agaatgctaa aggcggaagg ttcagccgtc tctgctgtag gaggagaagt aagaccaaag 31381 cttggacgaa agtctaatct aaggcattct cccacgagta cagaagcccc gtccgcctat 31441 gcggcggggt agttcactca gttatgtcaa atgtgactaa acggtttttt tgagtgttat 31501 tcataacaca agtctttttg aaaagttatt aagcagctat ttaggactta cgcattgaca 31561 aaaagttgac caatgggttt taaaggtagc ctgataaaac ttctacagta gcgagtcctg 31621 ttttttccca actgaattga ctagccctgt ttctgctttg agtcgagaga tgttgacgca 31681 atactgaatc agttgcaata atttgcatcg cttgagtcat ctcgtcaacg ttgtaagggt 31741 tgatgaaaat agcagcatca ccagccactt ctggtaaaga tgagaggttg gaggtgatca 31801 cgggagtacc acaagccatt gcttcgagaa ctgggaaacc aaaaccttcc caaagactgg 31861 ggaagacgag ggctgttgca ccactaataa ttttgggtaa ttcgctgtaa ggaacatagt 31921 tgaggaattt cactttacca gttatctcca attcctgaac ttgcgtttgc aacaaaggag 31981 taaagcgttt atcttgtggt cctgctaacc acagttcata gtcccgattg tgagataatc 32041 ctgcaaaggc gctgataagt ctgtgcaagt tcttataggg atcgtgtcgt ccaatataaa 32101 ggaagtatgg aaatttcgtg aggcgagtat cttgtccatc actatctaaa tgaattgggc 32161 gaaaatgatt ggtgtcgtgt gcgaggggaa taggagtgat tttgctggcg ctgatgtgat 32221 agaagtcgat gatatctttt gctgtagcgt gagagttgca gataatatgt tgcgcttggg 32281 cgagaacttg aggaacatac aagcgcgagt aatgcatcag tggtgacaaa agtttgggaa 32341 agcgcaacgg gatcaaatcg tgaaccatca cgatagaatg acaattggca taaaggggcg 32401 cttccggaag tggagaaaat atcagttggg atttcagctt tttatagatt tttcgcagtt 32461 gaaattgtgt ccagataaga cgatcaaaat gtcctttagt tccttgagca ggagtcagct 32521 tatctggtat agaataacag ttaaactcgg ggtagctttg agcagttaat agggtgggct 32581 taagagtttt taaataagga aaaagatttt tggcatagtt gcttatgcct gtaggttggg 32641 aaaaaacaat cgataggtta aataatatat tactcattta tatcaaactt aacccgctca 32701 taaatctgac ggtgtgagat ttgacaatta gccgaattta aagttaaaac tccatcatca 32761 gcatcagact cgctaaatag ccagttactc tctggggttt tactaaattg ctcaatatgg 32821 atttgatatt ggttaatgag aatgtattct tgaaattgag gaatagaacg atagtatgtg 32881 aatttatcac ccctatcccg gctactgcta gatttagaga aaacttcaaa gataatacag 32941 ggattaagta tagtgtcagt acgtccttcc tcaaaaacag gttcaccttt aataattaag 33001 acatctggat agacgtattg gcgataacga ggaatccaca agcctagatc attagcataa 33061 atgtggtaat tttgctcttt gagtgtaagg ctaagggcaa gagtcactat taaattaaca 33121 acgatttgat tatggttaat cgaactacct gtcatgggaa caatttctcc atcacggtat 33181 tcagtgcggt aattggtggc ttcttcaagt gcaaaatact cctctatcga gtaaagacgg 33241 ggtggaattt gtacaaccat aacacgagcc tgtgagtttt tgaatgttgc tctatgacct 33301 tagcctagca ttagagagaa agtttgaagg aagcgattac ctagccccta aaaggggcgc 33361 tgcgcaaacg gtagagctaa gcgcggctag gcgctcagca tgagccttcg cttaacgcct 33421 ccaatacatg aacacgcccg caaatttccc aaatgctact tgaggtttga caaaactcaa 33481 aatcagagca tgagaaatca agcgaagtaa cctgatattg agaatccatt gatttgtata 33541 tctatctaaa gataacaaat aaccataagt actatttttt attttccgaa aaacatattt 33601 atttgtaatt gaagagggct gatgtatgac accaaactgc ttagtcaccg caatcaaatg 33661 tccctgattg gcataacgtc ggcaaaagtc aaaatcttca tagtagagaa aataagcagg 33721 gtcaaactga ggacactcat caaaattccg aagattaacg atcaagctac aacctgtaat 33781 ccaatcacaa gcaacataat ctgtatctgt attcgtcaac aaatcttgag tgataattgc 33841 ccctgttgag gaaataaagc gaccacctgc gaaccaaact tcacctgtcg tagtatgaac 33901 aattgtgccg agaatagaaa ttttgggatg accctcaaaa aataaccgaa ctttttccag 33961 aaaaatttct aaaaaataag catcaggatt aattatccag acaagtccgt gagcatcttg 34021 ggtataaatc catttcatcc ctaaattgca accaccacca aagccgacat tactctccgc 34081 attaaaaatg agtaccaatt cacttttaag atgatagata gaattgtcat caggagaatt 34141 attaatgatg acgactttat aatcatgact gtcataactt ggtagagaag taatcagttt 34201 agtaataaga tttgtggaat aataattaac aactaaaaag taaatcacgt tagcttaatt 34261 gataaatttt ctctaatgat gaaaccacag aaaaacaaaa atacacacag ataaattatc 34321 tgtatttatc tgcgtccatc tacttcggcg tccatctgcg gttccaaata atcctaaacc 34381 caatttttac aacaaatcta atctccctgc ggataatcct cccacaatgt tgtaatctct 34441 cgcaaacttt gatgattacc atcgcctaaa atcagatgat ctaaaacagg aatggctaaa 34501 aattgtcctc ccattaacaa ctgtcgcgtc aattcaatat cttcttgact cggttccaca 34561 tttccggaag gatgattatg tgccacaata acccgcgttg ctccttgacg aatcacttca 34621 cggaaaattt cccgaggagg agccaaggtt tccgttgctg taccaatagt aatcacttgc 34681 gttccaagca ggcgattctt cacatccaac aataacaccg caaacctttc ctgattttgc 34741 cacatcaaat cttgactcaa agccgccgcc gcagctgctg ggctatcaat taatgtgcgt 34801 tctcctggac gcgattgaaa ggctcgtttc cctaattcaa tcgccgctaa gatagttgtc 34861 gcttttgctg gaccaacacc aggaatttgc atcaactcgg cggcgctgac atctcgcaga 34921 accgccaagg ggtctcgctg gtgtttgctc agttgctgta aaatatattg tcctagaccc 34981 acggcagaaa gttttccgtg tccttgacca gtacctagca gaattgctat taactcggct 35041 gtggctaaaa ttttaggacc atgtgttaac agccgctcac gcggacgctc agtcacaggt 35101 atatcggcaa ttctgaggct ataagtcata ggcgaataca gaggaaaaat catgatatat 35161 agacttcttc ctagttatcc cctgtaattc tgaaaacatc tagatatttg accaaatatt 35221 taatttaccg gtcttttttg gctttttcta ctggttttta ctgatatatt tgtgttgcct 35281 tgagcatatg atatgtaatt aagagttggg tgagggcaag ggggtaactc tggatggttt 35341 cgcctgttgt accagcgact ttgcctaact ctggatttcc ttcgcgatcg catatattct 35401 tgagataagg catatcagca caaagaatat gttctagctt attgacctgt tccaaagtag 35461 catgaccatc tacttccaga acatcaactg gtttgctcat atccctgtct aatcccaagc 35521 ggattgctga ctcaaaactg ccatcgccgt aattgataat ctggagaaaa tctggatgag 35581 gaattgtcgg acggagtacc aaacgtacat ttaggtgtcc attagtttgc tctaaatcgt 35641 cactgggagg gtaaaaactc accacaatat ctgaccattg ccgttgtgga cggataaatt 35701 gctcagaatc tggttcacgt tttctaagtt cctcaagcac ttgttcttct gtataccctc 35761 gtttttgggt atcccgctta actttccact gagcacgcac agattcgggg ggagcaagat 35821 aaacttttac atcgtaggat tcacaggcac cacgggtaga ataacccagt aatccctcaa 35881 caatcacaaa tttacttggt tttatataaa ctggtggctc aaaagtacca gttttgtggc 35941 tgtatactgg cttgagaatt ggctgtccta tcctaagttg cgacaggtgt tgctgcataa 36001 tatcgagatg gttgcagtca ggatgaagag cagtgatacc aatctcggca cgttgcttac 36061 ggtcgtactt atgataatca tctgtacaaa taaccgtgac attctccggt ccaagaacct 36121 gagcaatccc cttagtcagt gttgttttcc cagcagcact gtcgccaaca ataccaagaa 36181 ttataggacg gctcataact cccccttcca gaagatacag taaactttaa ttttagattg 36241 tatgttattt tactattttt ctaaaaaaaa tagaagacag cttaatcaag aaagtgatga 36301 ttttttgaca taattactaa aaattaagat ttttatcttt aacaaatatc tgcatttatt 36361 tttgtaaaaa aatactttta taaatattat atgc // LOCUS NODE_718_length_36040_cov_5.21967536040 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 36040) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 36040) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..36040 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 153..797 /locus_tag="DP116_05950" CDS 153..797 /locus_tag="DP116_05950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015183371.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05950" /translation="MASTCGSFPDIKNQWARLFVESLANEGVITGFPNRTFRPDTSVT RAEFAVIIAKTFTKLKKKRDYIRFIDVPTHHWAAKAIQTAYAIGFLNEFPNNRFFPDN RISRVEVLVTLVKGLDIASKVKPEELATLQVIYQDTDQIPDYAMTDVAIATSAGLVAS YPNTKLLKPNIPATRADVAVSVYQALLFLGEVQKIPSDYLVVAPQPQTELEHKS" gene 1005..1826 /locus_tag="DP116_05955" CDS 1005..1826 /locus_tag="DP116_05955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307389.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbonic anhydrase" /protein_id="PRJNA477356:DP116_05955" /translation="MKKLIKGLRQFKAKYVSTHQELFEQLSQGQKPRVLFVTCSDSRV DPNLITQAELGELFVIRNAGNIIPPYGATNGGEGATIEYAVQALGIRQIIVCGHSHCG AMKGLLKLYSLRDEMPLVHDWLKYAEATRRLVKDHYSQYEGEELLEIMTAENVLTQIE NLRTYPIVRSKLYQGQLNIYAWIYNIEKGEVFAFDPESHAYVLPQSQLKTDEIDETLL TEDLLNGNAIDNNISEKQTVVEQPQEFIFDADQRFPITRLSKDQMDRIYRGSRTN" gene complement(2259..3758) /locus_tag="DP116_05960" CDS complement(2259..3758) /locus_tag="DP116_05960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316322.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="catalase" /protein_id="PRJNA477356:DP116_05960" /translation="MTEPQNLTTADGIPVSDNQNSLTAGARGPVLMQDFHLLEKLAHF NRERIPERVVHAKGAAAFGTFTVTNDITRYSKAKLFSEIGKKTEVLLRFSTVGGERGS ADAERDPRGFALKFYTEEGNWDITGNNTPIFFIRDPLKFPDFIHTQKRNPQTNTKDHN ARWDFWSLSPESLHQVTILFSDRGIPKTYRHMDGFGSHTFSLINAEGDRVWCKFHFKT LQGHQTLTEEEATKIKGEDPDHATHDLFEAIAQGDYPKWRMCIQVMTDEQASKHPDNP FDVTKVWKHSEYPLIEVGILELNRNPENYFAEVEQAAFSPSAVVPGVSFSPDKMLQAR IISYPDAQRYRLGGNYQQLPVNQPKCPVMHYQRDGFMALGNNGGSGPNYEPNSAEGTP KENPAYAEPAIHLGDVSVDRYNHREGNDDYTQAGDLYRLLTPEQQQRLAENIVGSLSQ ARQDIQMRQLCHFFRADISYGRRVAEGLGISIDPSMLAMLGASAQTVSI" gene complement(3928..7047) /locus_tag="DP116_05965" CDS complement(3928..7047) /locus_tag="DP116_05965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858394.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05965" /translation="MKSTFLSRFTPSLMTPETLETIFVQRHQLADYLVGLIRESALTA NKHFRLLLGMRGIGKTHMITLMYHRVSKMEDLQDKLVIAWLREEEWGVTSFLDLLLRI FRALQKEYPAEYNAKLNQQVEALYQLSQQEAELQAAALLREFVGQRTLLLLMENLDDV FNGLGDIGQKQLRAYIQNYSFLTILATAQSLFDGIIRKDDPFYNFFYYHHLEELTLDE AVDLLRHIAQLEGDKELEDFIKTATGRDRIRAIHHLSGGNPRIYVIFSQFLTRKLLDE LVEPFMRMLDDLTPYYQARMSWLSQQQRKIIEFLADRRRAVTVKEIAQRCFMTHQTAS SQLKDLHQKGYVTPEFIGRESFYELHEPLMRFCLEVKKQRDEPIRLFIDFLRIWYTRT ELQQRLGQDINEIRNNGWFDDLGQQFNEARIDDFADEQNHSRYQQRLEPLPPDAVVER EYVLYALQAMEDDDEDPRVAAYWQEYENCREKKDYVNALRYAEKLVTIRGQAKDWFAQ GRCFGSLKRYEEALASFDKAIELDPNDENSWGGRAAAFYFLQRYEEALASFKKVIELD PNDAQVWCEQGDVLKKLQRYEDALTSYDKAISLDPPNIKRVWGERGDVLDKLQRYEDA LGSYDKAISLDPNYKWAWANRGNVLDKLQRYEEALVSYDKVISLDPNYKWAWADRGWS LNKLQRYEEALVSYDKAISLDPNYEWTWANRGDVLDNLQRYEEALVSYDKAIELNPNY AWAWANRGSSLKKLQRYEEALVSYDKAISLDPNYKWAWVERGLVLANLKRYEEALASF DSAISLDPNYKWAWRERGRVLDNLQRYEEALVSYDKAISLDPNYANAWGGKGWLLDIL GRHEEALASCDRAIALGDQSSSVFFNRAIAILGLNRWEEGIAALDNALERMESTDKAS ADDTELILRNLLNSTNDVALWKTRITTLIELYNKHQVAPALAQGLVREKTIGALMSEM VSDKAAQTWLEVWQEVVGNRPEFQIPLRLLNAAIRYRETKGDRRVLLELPIEERKLLQ EVLHISESK" gene complement(7052..8215) /locus_tag="DP116_05970" CDS complement(7052..8215) /locus_tag="DP116_05970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_05970" /translation="MVIPLKLRANPGGQISPNEVIGRDQLIQQFWEILDRQSLMLNAE RRMGKTCIIKKMEAEAPEDKLPIYHDLEKVRSPLEFVETILQDVEEYLSGLRRTARRT RQLLTQISGTEAMGVKLPEFAAPHWKILLTKTIGDLVENQERKVILLWDEVPYMLGNI GNQAAMEVLDTLRSIRQMYPDVRMVFTGSIGLHHVIASLKKEGYTNEPTNDMYSADIP PLSHVDAIGLAQKLLLGENISTTDSWVSAAAIAEAMDDIPFYIHHLIIKLKMRGGTVN EATITKTIEDCLLDPLNPWKMDHYRERIDNYYNDEQRFYALNLLDLLAVSNEPLLFDE LFQNLKQEPETQNKEIARTVLRLLERDYYIIRQSEGYGFRYGLIQRYWNLLRG" gene 8909..10021 /locus_tag="DP116_05975" CDS 8909..10021 /locus_tag="DP116_05975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313387.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="septal ring lytic transglycosylase RlpA family protein" /protein_id="PRJNA477356:DP116_05975" /translation="MNQKHLWTVVAVFSTVFGTQSVGRAQTTKETSPSSPLASGEVVK VGEYKSPAENPASNAVMTEIHTHEVAGRQAATLYIRKIPVLTFVGENVGQKPTASAQT KVGAFSDENDTKLESTNASSPTNVVSIGDSVYVKNQPNSTKDDPVQRASVVAAKINQL IWDKVDANKITVSWIAGGESTANEAQKNDQAGQQSVQGRYIIKANGDEIVEIDDHTWL ADTTKDRAKDALQATNRLRRLVGKASPLNEIANLPAKALVQIPKISVSNLPEQIVKGL KGVASYYGYDGSGNRTATGERFNPEGMTAAHRSLPFGTRVRVTNTRNGRSVVVRINDR GPYIRGRMIDISVGAARILGMMGSGVAPVRIEVLGR" gene 10222..11820 /locus_tag="DP116_05980" CDS 10222..11820 /locus_tag="DP116_05980" /EC_number="2.7.4.14" /EC_number="6.3.2.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870177.1" /note="catalyzes the formation of pantothenate from pantoate and beta-alanine and the formation of cytidine diphosphate from cytidine monophosphate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytidylate kinase" /protein_id="PRJNA477356:DP116_05980" /translation="MRLLTTVAALRCYLAQHRFKNQLVDQAQLLAFEVTARSKTAVGL VPTMGALHEGHLSLIQRARQENAIVIVSIFVNPLQFSPNEDYQRYPRTREQDQLFCEQ AGVDAIFAPTPEEMGVAQKIVQESKVTQVIPPSAMISGLCGRTRLGHFQGVATIVTKL FNLVQPDRAYFGQKDGQQLAVIKRLVADLNFPIEIVACPTVREASGLAVSSRNQYLTA TQKQQASVLYRGLQSSQAAFRAGVRDAKALIAAVRQEVAMFSSVLVEYVELVEPNTLM LIEEKVEEEGMLAVAARLGSTRLIDNIILHDRQPIIAIDGPAGAGKSTVARQVAAKLG LVYLDTGAMYRAITWLVLQKGIALDDECAIAELANHCVIQLSPGKDLQTPVQVWINDT DVTQAIRTHEVTSKVSAIAAQSAVRQALLKLQQNWGKKGGLVAEGRDIGTQVFPDAEV KIFLTASVSERARRRQQDFQKQGQPEVSIEQLEKDIAERDRKDSTRKVSPLQKAADAV EIRTDGMSVSEVTEQIVDFYTNLR" gene 12103..12972 /locus_tag="DP116_05985" CDS 12103..12972 /locus_tag="DP116_05985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319296.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_05985" /translation="MKLSVKFDFEDEKYNAPASQVIPWCQMINPRYGTNGLQPHGLAI KLDNAQAVGFQSDDNWHELEHEFSSGVETVYLTTTPRIVVVRRGPLSVKDRETGVKLG TLKDNYDAFLADKLKFKTFTRYLIYLVGEDKKFLNESPLQLTLNGAAGASFSKAYSEY QQGKVTSGFVAELERAYAGYRKQPLTPKGALFHAHGIFCPIINCEERGIEPNTVLVAS TVDYKHPTVSTLTEYLIASDSQESEIISKTFEEYKDFGKEAMKAETPRMEMAGVSSSY VYPDEDDYAYPPY" gene 13808..13987 /locus_tag="DP116_05990" /pseudo CDS 13808..13987 /locus_tag="DP116_05990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651099.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" gene complement(14323..15009) /locus_tag="DP116_05995" CDS complement(14323..15009) /locus_tag="DP116_05995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865192.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PHP domain-containing protein" /protein_id="PRJNA477356:DP116_05995" /translation="MTVNIVEVFTSREQLKQVFQSIDAKSCPTYFNFHMHTVHSDGRL QPSALMEQAIAIGLKGLAITDHHGIGGYQAAQSWLENWRWNNPGANAPILWTGVEIHA NLLNIEVHILGYAFNPEHSSMKPYIQRRITTGEEYQAANVISAIQKAGGLAVLAHPAR YRRSHFELIPAAAEAGIDGVETFYAYNNPNPWKPSERESMQVEQLAYEYSLLNTCGTD THGLNLLQRL" gene complement(15348..16373) /locus_tag="DP116_06000" CDS complement(15348..16373) /locus_tag="DP116_06000" /EC_number="6.3.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743972.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylformylglycinamidine cyclo-ligase" /protein_id="PRJNA477356:DP116_06000" /translation="MDYKDAGVDVEAGREFVNQIRNLVHSTFRPEVIGGLGGFSGCFQ LPSGYKEPVLVSGTDGVGTKLKIAHVLNRHNSVGIDLVAMCVNDVLTSGAEPLFFLDY LATGQLDKEQLTQVVAGIASGCQQAGCALLGGETAEMPGFYQAGEYDLAGFCVGIVER SQMLDGSQVQVGDVAIALASAGVHSNGFSLVRKIVSQTGFSWNDRLEIFGDQTLGEVF LSPTRIYVKPVLSARQAGLEIHGMAHITGGGLPENLPRCLGKDQAIKIICDWSIPPVF QWLAQTGSVSSQAMYNTFNMGIGFVLLVPPHQVQQAITYFESQNVTTFTIGEVITGSG ELIGLPI" gene complement(16468..16674) /locus_tag="DP116_06005" CDS complement(16468..16674) /locus_tag="DP116_06005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873390.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06005" /translation="MDSTELAQYMEATDSISKPWLLVQLRLKKLQERRATLSDNEYFH QLEDIHRDFMNLGEWWRGIEDEVF" gene complement(16930..17532) /locus_tag="DP116_06010" CDS complement(16930..17532) /locus_tag="DP116_06010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743974.1" /note="SodB; iron binding; present under aerobic and anaerobic conditions; destroys free radicals; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="superoxide dismutase [Fe]" /protein_id="PRJNA477356:DP116_06010" /translation="MAFELPPLPYDYDALSPLISSDTLKFHHDKHHAGYVTNLNKLIE GTELANKSLEEIVLATVNDSAKTGIFNNAAQVWNHTFYWHGLKKGAGAPSGELAEKIN ASFGSLDEFKKQFKEAGATQFGSGYAWLVLDNGELKVVKTPNAANPITNGQTPLLTAD VWEHAYYLDYQNRRPDYLDTFLNELINWDFVAEQYANAAK" gene 17813..18301 /locus_tag="DP116_06015" CDS 17813..18301 /locus_tag="DP116_06015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872690.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06015" /translation="MKTAEKLAAGWLLTLGFMFLSLSASAVMQKNAMERPIEPSLRPV VALDESNEDAKYVLDNTAFNGLIFGVPTSVLGVWLALGVYRKTQHERKAIKAQTNDQL QSAFYRMIHENNGRITLMSFAMQLQLPPATAKQYLDEQAKVFNANFKVSEEGGVSYHF DV" gene 18330..18707 /locus_tag="DP116_06020" CDS 18330..18707 /locus_tag="DP116_06020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872689.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF423 domain-containing protein" /protein_id="PRJNA477356:DP116_06020" /translation="MTRIFLSLAALFAGLSVAAGAFGSHALRDKISDRSLEIFEVGAR YQMYHALALLVVGLLLSRIESPPATMIASGWLFIIGIVIFSGSLYALSLSGVKSLGAI APLGGAAFLAGWGALAFAAWSLK" gene 18968..21315 /locus_tag="DP116_06025" /pseudo CDS 18968..21315 /locus_tag="DP116_06025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749376.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" assembly_gap 19986..19995 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 21886..22110 /locus_tag="DP116_06030" CDS 21886..22110 /locus_tag="DP116_06030" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06030" /translation="MEEVVESLNGNDKKEILRNYIQQNKEKIEDAKEDKCISFQEIIA IASGLLTIGDIAANGIGELISTLQNTGLLR" gene 22331..24355 /locus_tag="DP116_06035" CDS 22331..24355 /locus_tag="DP116_06035" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06035" /translation="MNGKTLNTAAQILEAFRACKNAGEEIDLFECLATRPEVRVVVFI EILQKIKREPVLALTLQAFGKITDADVKQQLKQSDDLLIMLSEQARSGSTDLIRWAAA TTIEKLGFDLTIVSQHLSEEPQSIAEKIMQSQMKRFADENLIQSNDYDEFLRFWIYGY WYKLKELTLGYEFWKLKEEWESKREIGWEDSHDPQEIQRLNKFSVCWDVMNALNLRGL SEVNLALQKAEECGDNASEIDENEVFEGIAQVFSANQLTELSSDSDFQVLMETQFHCL ESNNKTTRLVAAKEILTFGNNSLDKIRDEQLQMFHSLEAFLEIETCEVTRQTTYEQLE TLSENIDFLEKQLKRNKVRSALSQMKLVILDEMSKRKSKFAAAKLSCENLRESIKLKI DQYLKTIQSISTEIYDQIFDQVQLDPIPDITVDNEHALSLLEDYKNLLAKKLSIASSV HKQVLTFYSKLEKCQTNLEKIQNNLNLIKSINYRVYGYLALKDELIELKVPNSINYHN YTQVTLLDQYSNYLKNKLKKLRDEVVSRYEKLIPSQLSPLDTEIKKAKKKMERAKNIG DAINNFSRILCIYIISLIMLPWTLVMMIPVGLLIGIVKLLFGDAGENLTRILGLIAAP GMLFYLGIEKTIYTLIQQYEKRLESQRILRYEKLESLKDEERKILNLLSI" gene complement(24425..25033) /locus_tag="DP116_06040" CDS complement(24425..25033) /locus_tag="DP116_06040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873912.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06040" /translation="MSKVYTTEQLIQILRSERQACLKGNRLKLAVTVSGNPVIDQFIR TDGLQQFTAYQDFKATIHEYQNEHQVSGIVWREVTVKGKILRYPEVDAQLIALPSDIE ILIAAKNSILEFWNEVTVGMDLYLSFSHGKQHRQIEKNDVDRIGQRTEWTSLSKCENT NFLEVILQLGWGKPEEACYKRGFPTSGSECIHAVNPGNRPIG" gene complement(25229..25813) /locus_tag="DP116_06045" CDS complement(25229..25813) /locus_tag="DP116_06045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873913.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin family protein" /protein_id="PRJNA477356:DP116_06045" /translation="MNGIKWRFLLNLIPNWRRLFSLFLLLTCLLFSNGQSASAGIDDD RYDGNIFVVYAGNGSLVPPKLNLAKALAEHKPTLLAFYLDDSSDCKKYAIFVSTIQEY YGRVAEIIPVNVDSILDGKTYNSTEPGYYYSGVVPQVVVFNQSGEVVLNQKGQVPFEK IDDKFRQVFDLLPRTESVPLKRRAFNELNSELTR" gene 26117..26317 /locus_tag="DP116_06050" CDS 26117..26317 /locus_tag="DP116_06050" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06050" /translation="MTLCAILKKNKCEKSAAIGRWAHKGRERGIAVTLESGFKAMTTT NDKSKQQQVRRIIELGHIKILE" gene complement(26372..26746) /locus_tag="DP116_06055" CDS complement(26372..26746) /locus_tag="DP116_06055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878569.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_06055" /translation="MELTQALILLDTNIVLYFLGGRLVKPLPSGKYFVSVITEMELLS YSSLSSDEEVQIRNFLTKITVIGISSHIKEIVIELRRQYKLKLPDAIIAATAQSLNAT LFTNDVKLTNLKEINTQSVQII" gene complement(26737..26952) /locus_tag="DP116_06060" CDS complement(26737..26952) /locus_tag="DP116_06060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878568.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06060" /translation="MLSFNKKIVTDEAMRPVAVLIDYQDWQKIEQILEAYQAQQKEEF DINKYAGVMKLTQDPLEYQQQIRNEWS" gene complement(27031..27504) /locus_tag="DP116_06065" /pseudo CDS complement(27031..27504) /locus_tag="DP116_06065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740417.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(27551..27916) /locus_tag="DP116_06070" CDS complement(27551..27916) /locus_tag="DP116_06070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002746759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" /protein_id="PRJNA477356:DP116_06070" /translation="MTTNAETPKRGDIWLVNFDPTIGAEIKKIRPAVVISSDAVGKLP IKLIAPLTDWKPYFADNLWHVKIEPDMANNLTKASAIDALQLRGVDLQRFIRKLGIVS DITMSAISTAIVTVIEAEV" gene complement(27906..28202) /locus_tag="DP116_06075" CDS complement(27906..28202) /locus_tag="DP116_06075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017720570.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06075" /translation="MKSSLYLTATVSQGNKIEIQNPNLIEGQTVEIVIIIPQPDIPTP DDNQSISLEQRQAFLKLPLAERRRILENQAETMVSHYQQDSDWQELMVGDIIDY" gene 28485..29822 /locus_tag="DP116_06080" CDS 28485..29822 /locus_tag="DP116_06080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409929.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1727 domain-containing protein" /protein_id="PRJNA477356:DP116_06080" /translation="MGNKIQLIDRIRLAFAVSVAKGVTLAVRSLRLGAASVLPGSLAR RIEPRLLQLLTQQVKNGVILIAGTNGKTTTSLLLRTILERKGYRVAHNSTGANLENGL MTALLEDTNLVGGLDVDYAILEVDENILPKILAPIQPKIILCLNLFRDQLDRYGEVDT ISKRWTKVISTLPTETVVIPNADDPTLSYLGQQLPQRVLFFGLNEPENYLDEIPHAVD SIFCPNCGHSLDYKGVYLSHLGDFHCPKCGFSRSKPALDSSEFPQILVGLYNKYNTLA AVTAAIELGVDEATILDTINNFQAAFGRAEELEINGKRVRILLSKNPVGTNETIRVVT QSIDKTTLLVLNDRTPDGTDVSWIWDVDTEKLVERGGTLVVSGDRLYDMALRLRYSEK TPESHCNLIVEEDLRQAIATALEHTPENETLHILPTYSAMLEVREVLTGRKIL" gene 29914..30405 /locus_tag="DP116_06085" CDS 29914..30405 /locus_tag="DP116_06085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320364.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="damage-inducible protein DinB" /protein_id="PRJNA477356:DP116_06085" /translation="MLIQHFQMLANYNTITNRKVYEVCSQLSDVERKQIRQAFFKSIH GTLNHIMVGDRIWMGRFEGKQMPSTNLDAILYEDFDELRSVRVLEDERIEAFMSKLNE DFLTKTISYVNNQGKLHTDPPNLLLAHFFNHQTHHRGQIHDMLSQTEIAPPVLDMHRV IRP" gene 31061..31879 /locus_tag="DP116_06090" CDS 31061..31879 /locus_tag="DP116_06090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalamin biosynthesis protein CobQ" /protein_id="PRJNA477356:DP116_06090" /translation="MSYQQHELTIGWLYPKLMSTYGDRGNVICIERRCQWRGYSVKVL PLDQSATAADIRSVDVIVGGGAQDRQQEIVMRDLQGAKAQAMREKIENGTPGVFTCGS PQLLGHYYEPAFGQRIEGLGILDLVSVHPGENVRRCIGNLVIEVTATRLARDLEEMMG SKPYLIGFENHGGRTKLGKVEALGRVVYGLGNNGEDGTEGAFYQNAIATYSHGPLLPK NPFVADWLIQTALRLKYQQPITLPQMDNTLALEAREAMFKRLKVSIPSVTAAKV" gene complement(32088..32369) /locus_tag="DP116_06095" CDS complement(32088..32369) /locus_tag="DP116_06095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_06095" /translation="MARLDGLATVLDFINGLQPKIAAQIAKKVLALNVDPIPVDSQAL SGYEGYYRVDSGEYRIVYRFFPDQDLVEVILVGKRNDDDVYKRLKRLLG" gene complement(32356..32625) /locus_tag="DP116_06100" CDS complement(32356..32625) /locus_tag="DP116_06100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013190594.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prevent-host-death protein" /protein_id="PRJNA477356:DP116_06100" /translation="MHIYTLTDARNKHGEVFDKATVEPVLLTKQSRPSHVIMSAESYQ QLINRLTELEDMVLGESAKAALSQSKMVGTETFTSALERLADGET" gene 32855..34366 /locus_tag="DP116_06105" CDS 32855..34366 /locus_tag="DP116_06105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006635813.1" /note="This is a divergent form of trpE. It is not obvious if it is active in Trp biosynthesis. Component I catalyzes the formation of anthranilate using ammonia rather than glutamine, whereas component II provides glutamine amidotransferase activity; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anthranilate synthase component I" /protein_id="PRJNA477356:DP116_06105" /translation="MKPIQPWHWRKLPLAKLSGAQVFEVLFLKTSDDVERAQSGSRQR IATLLESPVTPNCTHLARYSICAGSPRCIEGKPQLWTPPVGEILPFLRHLLNSQTQAE DASRKAEYIVLGELPFTGGWLGWLGYDLAWEIEELPQLKADPLPFPVAYWYEPESFAV ADHQQQILWLAATDSAQLDVMQSQLEQADKEREIQNSPKAYKTNPITPVFQMSQDDYE AAVRRAKKHIQAGDIFQANLSLRFETHTPCDSWLIYRALQQINPSPFASYWQTPWGAM MSCSPERLVQLSGRQVQTRPIAGTRSRGATPTQDDLLAQELISNTKERAEHIMLVDLE RNDIGRVCEWGTVKVDELLTIERYSHVMHLVSNVIGTLHPNYDAVDLIRAVFPGGTIT GCPKVRCMEIIEELEPVKRNLFYGSCGYLDWRGNLDLNILIRTLLYSNRSDSPPGAIV WGQVGAGIVADSNPEKEWYESLHKAQAQLNALKLVFDTTKSLFSRSLALPGNA" gene 34629..36017 /locus_tag="DP116_06110" CDS 34629..36017 /locus_tag="DP116_06110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111292.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06110" /translation="MLSEEQRLQQFINTKGKQLQELPSKPETEPVKAKLLQEVEALLT QQKELQTQIRTSSPKYAALQYPQPLKLPQIQQQLDKDTLLLEYSLGEERSYLWVVTPN SLNSYELPAREQIEKAAKNFRDDLQQPTAGNLAAKTATELSKLILAPVADKLAQKRLV IVADGALQSIPFAALTEPGKSAASSNYQPLIVNHEIVNLPSASTIAFHRLELKGRKTA PKTLAILADPVFGVDDDRLSGKSKALAAELDLRSQLQQSALKQAARNFNRNGWGRLPG TGEEAKAILKLVPSSNYLQAFAFDANYNWATNKQLSQYRFLHFATHGFADPNSPELSG IVLSLVDKSGKPIEGYLRLGDIFNLDFTADLVVLSACETGLGKDVNGEGLVGLTRGLM YAGAERVAVSLWQVSDEGTSQLMQEFYKEMLQQGKSPTAALRAAQLKLLQDSKWNKPS YWAAFTLQGEWR" BASE COUNT 10629 a 7442 c 7574 g 10385 t 10 others ORIGIN 1 ttataattct taataagatt tctcatttct ttaaattgct gaatactctt aagcctctct 61 cgtcgtcagg ctaaaaattt cggttttctg ctgatgggaa aaataataca tagcaagaac 121 atcaggaaaa tagctagctt ggagtgacga caatggcatc tacttgtgga tcctttccgg 181 atattaaaaa tcaatgggca cgcttatttg tagaatcctt agccaatgaa ggggtaatca 241 ccgggtttcc caataggacg tttcgccctg acacttcagt aacccgtgct gaatttgctg 301 ttattattgc taaaacattt acaaaactta agaaaaaacg agactatatt cgttttatcg 361 atgttccgac tcatcactgg gcagcaaaag ctattcaaac agcttatgca ataggatttc 421 taaatgagtt tcctaacaat cgcttctttc ctgacaatag gatttcgcga gtcgaagttt 481 tagtaactct ggtaaaaggc ttggatattg cctcaaaggt aaaacctgag gaacttgcaa 541 cccttcaggt catttatcag gatacagatc agattcctga ttatgcaatg actgatgttg 601 caattgctac aagtgcaggt cttgtagcta gttacccaaa tacaaaatta ctcaagccca 661 atataccagc aactcgcgcc gatgtagcag tttctgtgta tcaagcactt ctatttctgg 721 gtgaggtgca aaaaatcccc tccgactacc ttgtcgtcgc accacaaccg caaactgagt 781 tagaacacaa gtcatgatca agaaattcgt tcgtgacgtt ctctcccgtg agttcttgtg 841 ggtataacgg aagatactcg cctccacata aaagaaatgc tgtatggctc tagaaaaagt 901 atttttaata ctaccaacag atgaaattga gaacactttt atactacact aactatcaga 961 aagaaaattt ctgagattgc ctctatttat ctaaagttag ctgtatgaag aaattaatta 1021 agggtctgcg gcagtttaag gctaagtatg ttagtacgca ccaagaactc tttgaacaac 1081 tatctcaagg tcaaaaaccc cgagtgttgt ttgttacttg ttcagattcg cgtgtagacc 1141 caaacttaat tacacaagct gaattaggtg aattgtttgt tatccgcaat gctggcaata 1201 ttattccgcc atatggagca accaatggcg gtgaaggtgc gaccattgaa tatgcagtgc 1261 aagctttagg tattcgacaa attattgttt gtggtcactc gcattgtggt gcgatgaaag 1321 ggctattgaa gttatatagc ttgcgagacg aaatgccgct tgttcatgat tggttaaagt 1381 atgcagaagc aacccgacgg ttggtaaaag accattacag ccaatacgaa ggggaagaac 1441 tactcgaaat tatgactgct gaaaatgtac taactcaaat tgaaaatctg cggacttatc 1501 caattgttcg ctccaagcta taccaaggac aactaaatat ttacgcatgg atttataaca 1561 ttgagaaagg agaagttttt gccttcgatc cagaaagtca tgcttatgtc ttgcctcaaa 1621 gtcaactgaa aactgatgaa atcgacgaga cacttcttac agaagattta ttaaatggta 1681 atgctataga caataacatt tctgaaaaac aaacggtagt tgaacagcct caagaattta 1741 tctttgacgc ggatcaaagg tttccaataa cacgactttc aaaagaccaa atggatcgaa 1801 tttaccgagg ttcaagaaca aactgatttg agaagactcc tcatgctaaa gatttgttaa 1861 gtagaactac tcccttcccg gttcaagatt acctaatctt cttcctttct tgcttcgtgt 1921 cctatgcgct aaggcgcacg ctacgcgaac gcgtctatgt cctatggcta acgccacgct 1981 tcgctatcgg acacgctagc cccgaagggg gcgctatgcc ttcggcacga cttcgtctat 2041 cgcaaacgct aacgtggttc atttaatcgg tattcttttg gcgggaaggg agtaagcgaa 2101 ttttgtaggg tgggcactgc taactgtgcg ctcaaacctt tattttcagc ttgtgtgcag 2161 tgcccacccg atagccgtaa cgcgtggcgt cagccatagc gttttgagaa tggtgcgaga 2221 tctcaatctc agtctaatta ttctactgga ctggctttct agatacttac agtttgagca 2281 ctagcgccca gcatagccag catagatggg tcaattgaaa tgcccaagcc ctcagcgaca 2341 cgacgaccgt aagaaatatc tgcccggaag aagtggcaaa gttgacgcat ttggatatcc 2401 tgtcttgctt gactcagact accaacgatg ttttcagcaa gacgctgttg ctgttcagga 2461 gttagtaacc gatacaaatc acctgcttga gtgtaatcgt cgtttccttc acgatgattg 2521 taacgatcaa cagagacatc acccagatgg atggctggct ctgcataagc tggattttct 2581 ttcggcgtac cctcagcact attgggttca tagttaggac cgctaccacc gttgtttccc 2641 aacgccataa agccatctcg ctggtaatgc atcactggac acttgggctg gttaacgggt 2701 agttgctgat agttaccacc caagcgatac cgttgagcat ctgggtaaga tatgatgcgg 2761 gcttgaagca ttttgtcagg agagaaactc acgccgggaa ccaccgcact agggctaaaa 2821 gcggcttgtt ctacttcggc aaaataattc tcagggttgc gatttagttc tagtattcca 2881 acttcgatta agggatattc tgagtgcttc caaactttcg tcacgtcaaa gggattgtct 2941 ggatgttttg acgcttgctc gtctgtcatc acttgaatgc acatacgcca cttgggataa 3001 tctccttggg cgatcgcctc aaacaaatca tgagtcgcat gatccggatc ttctcctttg 3061 atcttggtcg cttcttcttc cgtcaaagtt tgatgacctt gcagagtctt aaagtgaaat 3121 ttgcaccaga cgcgatcgcc ttcagcatta attaaactga aagtgtggct accgaagccg 3181 tccatgtgtc gataggtttt cggaattcct cgatctgaaa acaagatcgt cacttggtga 3241 agcgattccg gactgagtga ccaaaaatcc catcttgcat tgtgatcttt ggtattggtt 3301 tggggattac gcttttgggt atggataaaa tcaggaaact tgagtggatc gcggatgaag 3361 aaaataggag tgttgttacc tgttatatcc cagtttccct cttcagtata aaacttgagc 3421 gcaaagcctc tggggtcacg ctcggcatct gctgaacccc tttctccgcc gactgtagaa 3481 aagcgtagga ggacttctgt tttcttgcca atctcagaga aaagtttggc tttactgtag 3541 cgtgtgatat cattagttac agtaaaagta ccgaaagctg ccgctccttt agcgtggaca 3601 acccgctcag ggatacgttc tctattaaaa tgagccagct tttcgagcaa atggaaatcc 3661 tgcattaata caggtccacg tgctccagcc gtgagtgaat tctggttatc actaacagga 3721 ataccgtcag cagttgtcaa attttgaggt tcagtcatgg gtacgaaagt tctccattaa 3781 gttactcaga ttgccctgca tgacattttg gtcagcagac aaacactcac ttataaaatc 3841 aaataactcg aactcatatc gagtacatat aaaaatcata atcgttataa ctatgatttg 3901 caacaaacca acagaaaatg agtattatta cttggattca ctgatatgca gcacctcttg 3961 taacaacttg cgttcttcaa tgggtagttc tagcaagaca cgcctgtcac cttttgtctc 4021 ccgatagcgg atagcagcat tgagaagccg gaggggaatt tggaactcag ggcgatttcc 4081 tactacttcc tgccaaactt ccagccatgt ttgtgctgct ttgtcgctta ccatttctga 4141 catcaacgca ccgatggttt tttcgcggac aagtccctgt gctaatgctg gggcgacttg 4201 gtgtttgttg taaagttcaa ttagggttgt gatgcgggtt ttccataacg ctacatcgtt 4261 tgtgctgtta aggagattac gtaaaattaa ctcagtatcg tcagcagaag ctttatctgt 4321 agactccata cgctcaagtg cgttgtctaa tgctgcgatg ccttcctccc aacgatttag 4381 tcctagaatt gcgatcgctc tgttgaagaa aactgacgaa gattggtcgc cgagtgcgat 4441 cgcgcgatcg caggacgcta aagcctcctc atgacgtccg agtatatcca gtaaccagcc 4501 tttgccgccc caggcatttg cataattggg gtcaagcgaa atcgctttgt cgtaagatac 4561 taaagcttcc tcatagcgtt gtagattgtc tagtaccctg cctcgctcac gccaagccca 4621 cttataatta ggatcaagcg aaatcgcact gtcaaaagat gctaaagctt cctcgtaacg 4681 tttgaggttg gctagcacca agcctcgctc aacccaagcc cacttataat tggggtcaag 4741 cgaaattgct ttgtcgtaag ataccaaagc ttcctcgtag cgttgcaatt ttttcagcga 4801 cgagcctcgg ttagcccaag cccaagcata attggggtta agctcaattg ctttgtcata 4861 ggataccaaa gcttcctcgt agcgttggag gttgtctaac acgtcgcctc ggttagccca 4921 agtccactca tagttcgggt caagcgaaat cgctttgtcg taagatacca aagcttcctc 4981 gtagcgttgc agcttgttca gcgaccaacc tcggtctgcc caagcccact tatagttcgg 5041 gtcaagcgaa atcactttgt cgtaagatac caaagcttcc tcgtagcgtt gcagcttgtc 5101 tagcacattg cctcggtttg cccaagccca cttatagttc gggtcaagcg aaatcgcttt 5161 gtcatatgat cctagtgcat cttcgtagcg ttgaagtttg tctagtacat caccacgttc 5221 tccccaaacc cgcttaatat ttggtggatc aagtgaaatc gctttgtcat atgatgtcaa 5281 agcatcttcg tagcgttgaa gcttcttcaa aacgtcccct tgttcacacc aaacttgtgc 5341 atcattgggg tcaagctcaa ttactttttt aaaggatgcc aatgcttctt cgtagcgttg 5401 gagaaagtaa aatgccgctg ctcgcccacc ccaactgttc tcatcattgg ggtcaagttc 5461 aatcgctttg tcaaaggatg ctaaagcttc ctcatagcgt ttgaggctac caaagcaacg 5521 tccttgtgcg aaccaatcct ttgcttgacc gcgaattgtt actagtttct ctgcatatcg 5581 caaagcattc acatagtctt tcttttcacg gcaattttca tactcctgcc agtacgccgc 5641 cacacgagga tcttcatcat catcttccat tgcttgaagt gcataaagca catattctcg 5701 ttccactaca gcatcaggtg gaagcggttc tagacgttgc tgatacctgg aatgattttg 5761 ttcgtcagca aaatcatcaa ttcgcgcttc attgaactgt tgtcctaaat catcaaacca 5821 tccattattc ctgatttcat taatatcctg tcccaaccgc tgttgcaact ctgtgcgcgt 5881 ataccaaatt cgcaaaaagt caataaataa tcgtattggt tcatctcgct gcttctttac 5941 ctccaaacaa aatcgcatca gcggttcgtg cagttcataa aatgactcac gcccaataaa 6001 ttcaggagta acgtagccct tctggtgcaa atccttaagt tgacttgatg ctgtctgatg 6061 agtcataaaa cagcgttggg caatctcctt aacagtaaca gcacggcggc ggtctgccag 6121 aaactcaata atttttcttt gctgctgcga aagccaagac atccgcgctt gatagtaagg 6181 tgtcaaatca tctagcatcc gcatgaacgg ttctaccagt tcatctagca acttgcgcgt 6241 gaggaactgg gagaaaataa cgtagatgcg ggggtttcca ccagagaggt gatggatggc 6301 gcggatgcga tcgcgtcccg ttgccgtttt aataaaatct tctagttctt tatccccttc 6361 taattgggca atatgcctca acaaatccac agcttcatct agcgtcaact cctctaaatg 6421 gtgataataa aagaagttat agaaggggtc atctttacgg ataattccat caaacaaact 6481 ttgtgctgtt gccaaaattg ttaaaaatga ataattctgt atataagccc gcagttgctt 6541 ttgcccaata tcacctaagc cgttaaatac atcatctaaa ttttccatca acagcagcag 6601 cgtgcgttga ccaacaaatt ctctgagcaa tgctgcagct tgaagttcag cttcttgttg 6661 cgataactga taaagtgctt caacttgctg atttaactta gcattatatt ctgctggata 6721 ttctttttgt agcgctcgaa atatccgcag tagcaaatct aaaaacgatg taacgcccca 6781 ctcttcctct ctcaaccatg cgatgactaa cttatcctgt aagtcttcca ttttggagac 6841 gcggtgatac atcagtgtta tcatgtgcgt ttttccgata ccgcgcatcc ccagcaacag 6901 gcgaaaatgt ttatttgctg tcagcgcact ttcacgaatc aatccaacta aataatctgc 6961 taactgatga cgctggacaa aaattgtttc caaagtttct ggcgtcatca gactaggggt 7021 gaagcgagag aggaaggtgc ttttcatctt ttcaacctct caataagttc cagtaacgtt 7081 gaatcagccc atagcgaaaa ccatatcctt cactttgtcg aataatatag taatcacgtt 7141 ccaagagtct taatacagtc cgtgctattt ctttgttttg tgtttctggc tcttgtttaa 7201 gattctgaaa caactcatca aataacaaag gttcgtttga aactgccaaa agatcaagta 7261 aatttagagc ataaaaacgt tgctcgtcat tatagtaatt atcaatccgt tctcgataat 7321 gatccatctt ccaaggattg agcggatcta ataaacaatc ttcgatagtt ttagttatag 7381 ttgcttcatt gacagtaccg ccccgcattt ttagtttaat aatcaagtgg tggatgtaga 7441 agggaatgtc atccattgct tcagcaattg cagcagcact tacccatgaa tcagtcgtag 7501 aaatattttc tcctaaaagt aatttctggg ctaaaccaat agcatcaaca tgagataatg 7561 gcggaatatc tgctgaatac atatcatttg tcggctcatt ggtgtatcct tcttttttca 7621 aagaagcaat cacatgatgc aagccaatag aacctgtaaa caccatcctg acatcgggat 7681 acatctgacg gatggaacgc aatgtatcca aaacttccat cgctgcttga ttaccgatat 7741 tacccaacat atatggaact tcgtcccaaa gcagaatgac tttgcgttct tgattctcaa 7801 ctaaatcgcc tatagttttt gtgagtaata ttttccagtg gggagcagca aattcaggta 7861 acttaacacc catagcttct gtaccactta tttgtgtcaa tagctggcgg gtgcgtcttg 7921 ctgttcgtcg caagccgctt aagtattctt caacatcttg caaaatagtc tcaacaaatt 7981 ctaggggcga tcgcactttc tccaagtcat ggtaaatagg taacttatcc tctggagctt 8041 ctgcttccat tttcttaatg atgcaagttt tacccatccg ccgttctgca ttcagcatca 8101 ggctttgtct gtccaaaatt tcccaaaact gctgaatgag ctgatctcgt ccaatcacct 8161 catttggaga aatttgccct ccagggttag cccttagttt taacggtatc accatatcta 8221 cacctccaaa tatacgtcaa tattaacgta cgtaattttt aaggtacgaa agaaaacatg 8281 attatgctca aaccattttt tacggagaat gcgacaaagt gagggagtga ggaagaaaac 8341 tattttagtg ttttattata cttattattg ttacctactt agttgtttgc aaatatattc 8401 acaaatagga ctaaaaagtt agttttttac aaaaaatcgt tctcttttgt gataaagaat 8461 actaaagctc tctcttattt cgggaaactt ttgtatataa ataattacta taaataaagt 8521 caaagtatgt tatttttagc cgtttgaata atctcccttt atcatcaatt ttgtttatag 8581 acaagcgtat atgctaccgt aatcaattta attaccaaat tattgtgagg cttttagaaa 8641 aaaattttta attaaatcaa aaactgtcag atagccaaaa tgctcatttt tgtaaggttc 8701 ttgacttgaa ataggagttt aagtttgata aattgaccaa ggattcatct aaaacctgat 8761 ccccaaacat ctgggtttgt gatagtctgt tgcagttacg caaaaagtga aagcagaacg 8821 agttgaagtt ttttttttaa agatctatca ctcatcagtg gtgcaaatta gtaacctctg 8881 ttcttcagtc gctttaacat tcggacgcat gaatcaaaaa catttgtgga ctgttgtcgc 8941 tgtcttttct actgtttttg ggacacaatc tgtaggtcgc gcccaaacga ccaaggaaac 9001 gtctccaagt tcgccacttg cttctggtga ggtggtgaaa gtaggagagt ataaatcccc 9061 tgcggaaaac cctgcttcta atgctgtgat gactgaaatt cacactcatg aagtggcagg 9121 acgtcaagcg gcaacactct atattcgtaa aatcccggtt cttacgtttg tgggtgaaaa 9181 tgttggtcag aagccaactg caagtgccca gacaaaagtt ggtgcattta gtgatgaaaa 9241 tgacacaaag cttgagagca caaatgcaag cagcccaaca aacgtagtat caattgggga 9301 cagcgtatat gtcaaaaacc aacctaactc tactaaggat gacccagttc agagagcttc 9361 agtagtagcc gctaagataa accagctcat ctgggacaaa gtggacgcta acaagattac 9421 ggtgagttgg atagccggag gtgagtccac cgcaaatgaa gcccagaaaa atgaccaagc 9481 tggtcagcaa tcagtacaag gtcgctatat tattaaggca aatggcgacg aaattgtaga 9541 aatcgatgat catacatggc tagcagatac aaccaaagat cgagctaaag atgctctgca 9601 agcaaccaat cgtctgcgga gattagtagg taaagcatct cccctgaatg agattgctaa 9661 cttacccgca aaagccctgg tacaaatacc aaaaatctcc gtgtcaaatt tgccagaaca 9721 gatagtcaaa ggactaaaag gagtagcttc ctattatggc tatgatggtt ctggcaaccg 9781 cactgctact ggtgaaaggt ttaatccaga agggatgact gctgcccatc gcagcttacc 9841 ttttgggacg cgagtccgtg tgaccaacac tcgtaatggt cgttcagtag tggttcggat 9901 taatgaccga ggaccatata ttcggggtcg aatgattgac atttctgttg gtgcagcgcg 9961 gattttagga atgatgggca gcggtgttgc acctgtgcgg atagaagtgt taggaaggta 10021 agagtaggta atgaggaggc agagtatgct cttgggagtt cccgcaagca cctgccttgc 10081 aaatgaggag gatgaggaga atgaagaagt catttcttcc cctgctccct aactcctcct 10141 caatgctctt agctggtggt ctactcctca atttcagata taaactactg gggagagata 10201 gacaacacta ggggtatttt tgtgcgcctg ttaacaacag tcgcagcttt acgctgctat 10261 ttagctcaac accgttttaa aaaccagctt gtggatcaag cgcagctgtt ggcatttgag 10321 gtgactgctc gatccaagac agcagttggt ttggttccca cgatgggagc gttgcatgaa 10381 ggtcatttaa gcttaattca acgggcgcgg caagaaaatg ccatagtgat tgttagtatc 10441 ttcgtgaatc cgctacaatt tagtccaaac gaagattacc aacgctatcc tcgcacaaga 10501 gagcaagacc aattattttg cgaacaagca ggggtagatg caatttttgc tccgactccg 10561 gaagagatgg gagttgccca gaagattgta caagaatcaa aagttacaca agttatcccg 10621 ccatctgcta tgatatctgg cttgtgtggt cgtactcggc tgggtcactt tcaaggtgta 10681 gctacgattg taaccaagct tttcaacttg gtacagcctg atcgagctta ctttggtcaa 10741 aaggacggtc agcaacttgc tgttattaaa cggctagtgg ctgatttgaa ttttccaata 10801 gagattgttg cttgtcctac cgtgcgggaa gcatcaggtc ttgctgtaag ctctcgtaac 10861 caatatttga ccgcgacgca aaagcaacaa gcatctgtgt tgtatcgtgg tttgcaatca 10921 agccaagcag cttttcgcgc aggagttcgt gatgcaaaag ccctgatagc ggcggtacgg 10981 caagaagtgg caatgttcag ttctgtttta gtggaatatg ttgaattggt tgaaccaaat 11041 acgttgatgc ttatagaaga aaaagttgag gaggaaggaa tgctcgcggt cgccgctcgc 11101 cttggttcta cacgtttgat tgataatatc attctgcacg atcgccaacc catcattgcc 11161 atagacggac ctgctggggc tggaaaatct actgtcgctc gtcaagtcgc agcaaagctg 11221 ggattagtgt atttagatac aggagcaatg tatcgtgcta tcacttggtt ggtgctgcaa 11281 aagggaattg ctttagatga tgagtgtgcg atcgcagaat tagctaatca ctgtgtcatt 11341 caactatctc ccggcaaaga cttacaaaca ccagtgcagg tttggattaa cgatactgat 11401 gtcacccaag caattcgtac tcacgaggtg acatctaaag tatcggcgat cgccgctcaa 11461 agtgctgtac gtcaagcact ccttaaacta cagcaaaatt ggggtaaaaa aggtggttta 11521 gttgcagaag ggagggacat aggtacccaa gttttccctg atgcagaagt aaaaatcttt 11581 ctcactgcct ctgtcagcga gcgtgctcgt agacgccagc aagactttca aaaacaaggt 11641 caaccagaag tcagtataga gcagctggaa aaggacatcg ctgaacgcga caggaaagac 11701 agcactcgca aagtttcccc cctgcaaaaa gcagcagatg ctgttgaaat taggacggat 11761 ggcatgagtg tttctgaagt cacagaacaa attgttgact tttacaccaa tttgcggtaa 11821 ttctaaaaga caatgaggtg caagcaggct aagcctgctt gcacctcatt ttagacgtgt 11881 tgtcgcatat ttcctcattc caaccgaaat actctaaata gctgctttga caggcatttg 11941 ttaaaatctt gggtatatta tgcatgacct gcggaataaa ctcattgagt gcttttactc 12001 ataatattga ttatgtttca agccaccaag tgaattgaaa cctcaagcat tgctaaacca 12061 tgtaaacttg atctgcaaaa cactgacaag cgcctaatga cgatgaaatt atctgtaaaa 12121 ttcgactttg aggatgagaa gtataatgca ccagcttctc aagtcatccc ttggtgtcag 12181 atgattaatc ctcgctatgg cacaaatggc ttacagcctc acggtttggc aattaaactg 12241 gataatgctc aagctgtagg ttttcaatcc gatgacaatt ggcatgagct agagcatgaa 12301 tttagctctg gagtcgaaac ggtttatctc actaccactc ctcgcatagt cgtcgtgcgt 12361 cggggaccat tatctgttaa agaccgagaa actggtgtga aattgggtac actcaaagac 12421 aattatgatg cttttttagc agacaaactt aaatttaaaa catttactcg ctatttaatt 12481 tatttagtag gagaagataa aaagttttta aatgaatcac ctctacagtt aactctgaat 12541 ggagcagctg gagcaagttt cagcaaggct tactctgagt accaacaagg taaagtcact 12601 agcgggttcg ttgctgaact agaaagggct tatgctggat accgcaagca gcctctgaca 12661 ccaaaaggcg cactattcca cgctcacgga attttttgcc ctattatcaa ctgtgaagaa 12721 agaggaattg aacctaacac agttttggtc gcttcaactg tagactacaa acatcccaca 12781 gtttccacct taacagagta cctaattgct tccgactccc aagaatctga aattatctct 12841 aagacttttg aagagtataa ggactttgga aaggaagcta tgaaagcaga aactcctcga 12901 atggaaatgg caggggtttc tagttcctac gtttatccgg atgaagatga ttatgcttat 12961 ccaccgtatt agtaacaggg aataaagaag cagggagaga gggagagaag gagaaaaggc 13021 tcttcacgtg gatcaatctt ctcctgggta ccctgccccc ttttcccccg cttcttcgcc 13081 cccttgcttg ttttggtcaa tgttcagcta agtttaattg aatttctgta aaagcctctt 13141 taacgggaaa agtcagcagc tgacagtatc gatgcttggc gatggaagct cagaaagcct 13201 cttgaagcta atatctctag gtcttgaagt gttaaagata actgtgtaca tagatgcgat 13261 acgctgcact gaccattaca agccagtaga atctcaccca ttcgagatcc tattggcttg 13321 atctgcaact gatttgtgga agacattgtc aaaaggagat agtagttacc ttgaagccca 13381 gattcaggca cccgacctga attcaaggtt gtgagcagat tatccggcga atgccggaaa 13441 aagacaatgc gatttcctgc tggtagtaaa aattgcgcgg cggctgtgtt tgccagtaaa 13501 tattgaatac ctggttgaga ttctgagtat ttagcactgt gcaggttttg ggcaagagat 13561 ttaccagcat gctaagacga aatgaacgat ggagcatcct gccgatgcca agcccaggcg 13621 tctgagccaa aaatactagg cgctgatgct ctaggggggt ttgaaactgt aacttgatat 13681 ctggacaaag ctaccatttg ctatacattt ttggtagctt ttaaattttt gtacggtatc 13741 tggaataatc tctaaaaaat caaaggtgtc caatttttaa ttttgcatta gaaaagtgag 13801 attcactgat gtacaaaatg ttcaaaattt tgctttttct gtagaaaatt ttggtagtcg 13861 taaaatttct cagtctctag ttctacttag gttggcaatt tctcttctgg aagatgcttt 13921 tcgccatact cgtcaaataa gtcaaacaca agttggcgct tggttattat cccaatttca 13981 gcaataattt agcataactc taatgtaagt tttaccatct agaatccaca attcaagatg 14041 agttatcacc catttaattt acatagagtt aggagcaaga cgacttgcta gcagtggttt 14101 tcttgatgcg tgaattttga gattgcttcg cttcccgcca cggctatgtc ctctggacac 14161 gctgcgcgaa cgcctcgcgg ctatgtcctc tggacacgca ctcgcgttcg gctcgcaacg 14221 ctgagtcctt cgggcacgcg caagggacgc gattttgaat tttgaattgt ttaccccctg 14281 gcttgttgaa ccaggagtat tgtgagaaaa atagacatca gcttacaacc gctgaagcaa 14341 gtttaaaccg tgagtatctg taccacaggt gttcaaaaga ctatattcat aagccaattg 14401 ttccacttgc atcgattctc tctcactcgg tttccaaggg ttggggttgt tgtaggcgta 14461 aaaagtttca acaccgtcaa tccccgcttc tgctgcagcg ggtatcaatt caaagtgcga 14521 tcgcctataa cgagctgggt gagcaagtac tgctaatcct cctgcttttt gaatggcgga 14581 aatgacgtta gctgcttgat actcttcacc tgtagtaatt cttctttgaa tgtaaggttt 14641 catgcttgaa tgttctggat taaaagcata acccaaaata tgaacttcaa tatttaaaag 14701 attggcatga atttctacac cagtccaaag aatgggagcg tttgcaccag ggttgttcca 14761 cctccagttc tctaaccaac tttgagccgc ctggtagcca cctataccat gatggtcagt 14821 aatggcaaga ccttttaaac cgatggcgat cgcctgttcc atcaatgcac tcggttgcaa 14881 cctgccatct gagtggacag tatgcatatg aaagttaaaa taggttgggc aactttttgc 14941 atcgatgctc tggaagactt gctttaattg ctccctagag gtaaacactt caacgatatt 15001 gacagtcata accccctctt ttccaaaatt ggtgcttttt tacactaaaa cacatgacac 15061 caacagccca gaatgaacca gcaccggtgt gaagtgttgg ttgctacttt ctgggcatca 15121 ttagcaaaat ttttcaatat gttaagacta cgttagcaaa ttcaaaggct agcgatcatg 15181 cgtattttcg atttctttta taagaaagag taaggaatgt tgcatcaaat ctccgtttaa 15241 tctacatttc gtaacactcg tagatattta gtaaatatta aaaatcaaac aaataagtac 15301 tttcaaagac caaaaataaa gactttatgt tgtataaatc gattcactta tattggtaat 15361 ccaatcaatt cacctgagcc agtgatgacc tcaccaattg taaaggttgt aacgttttgt 15421 gactcaaagt aggttattgc ttgctgtacc tgatgaggag gtacaagtag cacaaatcca 15481 atccccatat tgaaagtgtt atacatagct tgggaactca cagatccagt ttgtgctaac 15541 cactgaaaca caggtggaat agaccaatca cagattattt taatagcttg atctttaccc 15601 aaacaacggg gcaagttttc tggtaaacct ccacccgtga tatgagccat accgtgaatt 15661 tctaatcctg cttgacgtgc actcagtacg ggtttaacgt aaatacgcgt gggtgagagg 15721 aaaacttctc ctaatgtttg atcaccaaat atttctaaac gatcattcca tgaaaatccc 15781 gtttggctga caattttcct gaccaaacta aagccattgc tatgtacgcc agcgctagcg 15841 agtgcgatcg ccacatcccc cacttgtacc tgggaaccat ccagcatttg gcttctttcg 15901 acaattccta cacaaaaccc agccaaatca tactcacccg cctgataaaa acctggcatt 15961 tcggcggttt ctcctcccag taaagcgcaa cccgcctgct gacacccaga ggctatacct 16021 gcaacaactt gcgtgagctg ctctttatct agctgacctg ttgccagata atctaaaaag 16081 aatagcggtt ctgcgccaga tgtcagcaca tcattgacgc acattgctac caaatcaatt 16141 ccaacgctgt tgtgacggtt gagaacgtga gcgattttta gtttagtacc tacaccatca 16201 gtcccagaaa ccaaaacagg ttctttataa ccacttggta gttgaaagca gccactgaag 16261 ccacccagtc caccaatgac ttctggtcta aaggtgctat gaaccaaatt acgaatttga 16321 tttacaaact ctcgaccagc ttcaacatca actcctgcat ccttgtaatc catgagtcaa 16381 aacatcgcta tcacttcaag ttactaaagt ttacaatcaa tcgtccaaaa ttttttacta 16441 atgactaatg accaatgact aatgacttca aaatacctca tcttcaatac cgcgccacca 16501 ttcccccaag ttcataaaat ctcggtgaat gtcttctaac tgatgaaaat actcattgtc 16561 tgaaagtgtc gctcgacgtt cttgaagttt cttcagtcgc agttgcacca atagccaagg 16621 tttgctgata ctatcagttg cttccatgta ttgcgcgagt tcggtactat ccatatgctt 16681 tttaaaattg tagattttgg atggagtttt ctaaaagata gccaggaact tggtttctgg 16741 ctcatagctt aagtctactc aagtagactc aaaaccaagt caccgaaata atttagtcgt 16801 cttgagacga cttgagttat tagcctaggg ttttaaaccc taggcggttg ttgggactaa 16861 tgcaaaatct cagttaaaac aaactcaagc tatgcaagag aaacttgttt gcagacgagt 16921 tttgccagct tatttagctg cgttggcgta ttgttcggct acaaagtccc agttaatcag 16981 ctcgttcagg aaagtatcca ggtaatctgg acgacgattt tggtaatcca gatagtaggc 17041 gtgttcccaa acatcagcgg ttaacaaagg agtttgaccg tttgtgatcg ggttagcagc 17101 atttggtgtt ttcacaacct tgagttcgcc gttatccagc actagccaag cgtaaccact 17161 gccaaactga gtagcaccag cttctttgaa ttgttttttg aattcatcta agctaccaaa 17221 gctggcattg attttttcgg ctagttcccc agaaggagca ccagcgcctt ttttcaagcc 17281 atgccagtag aaggtatgat tccatacctg agctgcattg ttgaagatac cagttttcgc 17341 tgagtcatta acagtcgcca atacgatttc ttcgagtgac ttgttggcaa gttctgtacc 17401 ctcaatcagt ttgttgaggt tagtgacgta accagcgtgg tgcttgtcgt gatggaactt 17461 tagagtgtca cttgaaatca atggagacag agcgtcgtaa tcgtatggta atgggggaag 17521 ttcaaatgcc attgtgtaaa tcctctcttt agcgtttttc agtttagagc ttatggggta 17581 gctctggaaa aaaatacccg tgctaagcag ttcttacagc ttggcactgc gtgataaaac 17641 ttaaacagtt tgattctact agcaaatgtg gcatcaatag cctcatgcat aggtaggata 17701 tgttcgagat tgcgaatcgc cccttgcagt caacatttct actaatgact ggaagcttct 17761 aagaactgac gttagaatca caaacaaggt ttgaatcaga aagataacaa ctatgaagac 17821 tgctgaaaaa ttggctgcgg gttggctact aacactcgga ttcatgtttt tgtcgctctc 17881 agcctctgct gtaatgcaga aaaatgctat ggagaggcct atcgaaccaa gtctaagacc 17941 ggttgtcgct cttgatgagt ctaatgaaga tgcaaaatat gtacttgata atactgcttt 18001 taatggtcta atttttggcg tacctacctc agtattagga gtatggttag cattgggagt 18061 atatcgtaaa actcagcatg agaggaaggc gattaaggca cagacaaatg accagctgca 18121 atccgccttc tatcgtatga ttcacgaaaa taatgggcgc atcactctta tgagctttgc 18181 aatgcagtta caattgccac cagcaactgc aaagcaatat ttagacgagc aagctaaagt 18241 atttaatgct aattttaaag taagcgaaga agggggagtc tcttaccatt ttgatgttta 18301 aagaacaaat tggctaaata gtgtacccca tgacacgaat ttttttgagt ttagcagccc 18361 ttttcgcggg tttatcagtt gctgcaggcg cgtttggttc ccatgccttg cgggataaaa 18421 tcagcgatcg ctccctagaa atttttgaag tcggcgctcg ttatcagatg tatcatgcct 18481 tagcgctgct agtggtggga ctactcctca gtcgcatcga gtcacctcca gctactatga 18541 tcgcaagtgg atggctgttc atcattggta tcgtcatttt ttcagggagt ttatatgctt 18601 tgagcttaag tggtgttaaa tctttgggag cgatcgctcc attaggagga gcagcatttc 18661 ttgctggttg gggcgcttta gcttttgctg cttggagttt gaaataattc tgattccaca 18721 actagtaccg caatcattac aacgcaaaac tcttgcgatg cgagagtttt gtttgtatca 18781 aaaactgtta tggtacattt gcgctcgccc ccgtggggcg ggtgctctgc cttgtataac 18841 caggttttac taaactcata aaatgtaact tttcagacat cttctgaaat cagaactaag 18901 tagtacattt ataccaatag ttaagagatc tctttgagat gatatttgga gtggaatcgg 18961 cgacgctatg ctgtatacaa cactacaaag tcttgtgact ttatttggat ttattcctca 19021 tggacattgc tatctctgga agacacaatt agtttcggtt catgtcatat ccgatgcttt 19081 gattgcgctt tcctattatt caattccaat ctcgctagcg tattttatca acaagcgtaa 19141 cgatttcccc ttcatcggca tagtttggct tttcggggca ttcatctttg cttgcggtac 19201 caatcatctg atggaaattt ggacgctttg gtatcccact tattggctat caggatggtt 19261 gaaagttgtc actgctgtga tctcagtata cacagcgctg gcgcttgtgc cattgatccc 19321 aaaagcactt gctctcccca gcccagcaca gctagaggct gtcaaccgtg aactacaaca 19381 acaaattgca gagcgcctcc ttgcagaaga ggctctgcaa aaagctaacg aggagttact 19441 taaaagaagc cagtcgcggc tagagttagc tcaaaaggtt ggaagaattg gaacctttga 19501 atggaacatt caaacaatgg aggcgatgtg gacggaagaa tttgaagctt tgtacggact 19561 cgcccctggt ggttttggca atagatatga caactgggtg caagcagttc atcctggtga 19621 cttggctaga acagaacaag aaattcaacg tgctatcaat gaaagctcag aattagatac 19681 tgagtttcgc attgtttggc aagatgggag tgaacgctgg attgcagcca aagctcaagt 19741 ctttagtgat gacaccggaa agcctttgcg gatgattggt gtcaatatgg atattacgga 19801 acgcaaacga gcagagcaga aaattcgcga acaagctgct ctgttagata taacaaaaga 19861 cgctattgtt gttcattctt taaacgataa caaaattcta tattggaata aaggtggcga 19921 gagtctctat ggatggaaag cagaacaggc attaggtcag gatgccaata ttttattata 19981 tgatgnnnnn nnnnngaaga tgctaagaaa acagttacaa tgacaggtga atggcatgga 20041 gaattgcatc acttcacaca gtccaacaag aaaattattg tctcaagtcg gtggactctc 20101 atgtgtgact ggtcagagga atcaacatca attctgactg taaatactga tatcacagag 20161 aagaaagaac ttgaagcgca atttctgcgc gcacagcgtt tggagagtat tggtacactt 20221 gcgagtggta ttgcccatga tttaaataat gtgctaacac caattattgc ttctgctcaa 20281 cttcttctac atgtagaatt ctctgaacag aagaggcaga gatatttgac aatagtagaa 20341 tcaagcgcaa aacgggcggc tgctttagtc aagcaagttc tacagtttgc acgaggattt 20401 gaaagtcagc gcactattgt tcaacttgag catttacttc aggaatttaa gcaagtgact 20461 cttgagacat ttcctaaatc catcgaaatt tgcatgaaca tagcacccgc cctttggaca 20521 gttttaggag atgttactca actgcaacag attttcatga atttttgcgt taatgcccgc 20581 gatgcaatgc caaatggcgg taccctgaat atctctgcag agaatattct cattgatgaa 20641 aattttgtca ggatgaatcc agatgccaaa attggttcct acattgtagt tactgtttgt 20701 gatacgggaa ttggcatttc tccagaaata atagatagaa tttttgagcc atttttcacg 20761 actaaggaag tgagtagagg tacaggatta ggtctttcaa cagtgtttgg tatcaccaaa 20821 aaccacggcg ggtttattaa agtgtatagc gaagttggac aaggaaccga atttaaggtg 20881 tacttaccag cactggaagg aagcgtttat cagccagtag aagatattga actatttaca 20941 ggaaatggag aattaattct gtttgtagat gacgaacttg ctattcaaga gattagcaag 21001 actttacttt caacacataa ctataaggtt atgactgcta gtgatggaat tgaggcgatc 21061 gcactctacg tcgaacatca acaacaaatt caagcagtgg tgacagatat aatgatgcca 21121 tctttggatg gtatgaatac cattcgtgct ttgcagaaaa ttaacccatc tgtaaaaatc 21181 gttgccatta gcggactgtc atcaaacaaa aggatggccc aaatgtctgg gattggtgtg 21241 aaagcgtttt tgtctaagcc ttacacaatg caggaattac tcaaaacctt acactcggta 21301 ttgaacgtta aatagtcata ctactccttt tcctcgttcc ctggctctga acggggatgc 21361 ataactaaag gctctctcta cctccaatta tattgaggca gaacctcctt tcgctgacat 21421 cttgcacctg gtcaaacaaa tgttaaattc attacatatt atttgacgca gtgttctcgt 21481 ccgcctggga ctataagtcg cgccgcttat agccgaagtc cattttaatg gactagtagt 21541 ccttaatcgt taaaacccgt gtttttgttt tctgtaaaac gcttattcaa cgcaagttcc 21601 atgaggaatg aaaatacggt ttttagaaat tatgtcctac tagctaaaga tcatcaggaa 21661 gacttggaaa agctgcaaaa agcacactaa atactcaaaa gcaccgccgg attttatcgc 21721 cagtcagata accaattagc taaacatgta gaagccgcta ttaaaactcg tgaaagtagt 21781 gttgaaactt gcaaagaaaa aattggtgag gttaagtctg atatagagtc taagttagag 21841 aatctaaagc gaagtattaa gaaagcagaa gaacgtgtta ctcagatgga agaggtagta 21901 gaatctctaa acgggaatga taaaaaagaa attcttcgta attatatcca gcaaaataaa 21961 gaaaaaattg aagatgctaa agaagataag tgtatttcat tccaagaaat aatcgcaatt 22021 gcatctgggt tgctaactat tggagatatt gcggcaaatg gtattggtga gttaattagc 22081 actttgcaaa acacaggact attgcgatga caggtgtacg tagttaactg atatagtcgc 22141 aagcaatgta gttcaaacag tagaattgct cagcatagac aggagatacg ggtgcaaatc 22201 ccgttgttgc tatcagattg taaagttcta tttcttaaca aaaaataaac cgtatttata 22261 ctgtactcat gttgttatta cgatgataag aatatatgag ttgtgtaaaa tttagcatat 22321 tactgacagt atgaacggca aaactctcaa caccgcagcg caaatcctcg aagctttccg 22381 cgcttgtaag aatgcgggag aagagataga tttatttgag tgtttggcaa ctcgccctga 22441 ggtacgcgta gtagtattta tagagatact gcaaaaaatc aaacgagagc ctgttttggc 22501 tcttacgtta caggcgtttg gcaagattac ggatgcggat gtgaagcagc aactcaagca 22561 aagtgatgat ttgctgatca tgctgagtga acaggcgcgt tctggttcga ctgacttgat 22621 tcgctgggct gcggcgacaa cgattgagaa attaggattt gatctcacca tagtgtctca 22681 acatctttct gaagaacccc aaagtattgc tgagaagata atgcagtctc agatgaaaag 22741 atttgccgac gaaaatctga tacaaagtaa tgattatgat gagtttttac gtttctggat 22801 ttatggatat tggtacaaac tcaaggaatt aactttaggt tatgagtttt ggaaattaaa 22861 ggaggaatgg gagtctaaaa gagaaatagg atgggaagat tcacatgatc ctcaagaaat 22921 tcagcgtctc aacaagtttt cagtttgttg ggatgtgatg aatgctctta atttgagggg 22981 attaagcgag gtgaacttgg cactacaaaa agcagaagaa tgtggagata atgcttcaga 23041 aattgatgaa aatgaagtat ttgaaggtat tgcacaggtt ttctcagcta accaattgac 23101 tgaacttagt agtgacagtg atttccaagt cttgatggag actcaatttc actgtttgga 23161 aagtaataat aaaacaactc ggttagttgc agctaaagaa attctgactt tcggaaataa 23221 ttctttagat aaaatcagag atgagcaact acaaatgttc cattcacttg aagcgtttct 23281 agaaatagaa acatgcgaag tgactcgtca gactacctat gagcaattag aaactttatc 23341 cgaaaatata gattttttag aaaaacaact gaagcgaaac aaagtacgta gtgctctttc 23401 tcagatgaag ttggtcattt tagacgaaat gagcaagaga aaaagtaaat ttgctgctgc 23461 taaactctca tgtgaaaatt tacgagaaag tatcaaactt aagattgatc aatatttgaa 23521 aacaattcag tctatcagta cagaaattta cgaccaaata tttgatcaag tacaacttga 23581 cccaatccct gatataactg ttgataatga gcacgcactt agcttgcttg aagattacaa 23641 aaacctattg gcaaagaagt tgtccatagc ttcttcggtt cataaacaag ttttgacgtt 23701 ttattccaaa ctagaaaaat gtcaaactaa tctggaaaaa atccagaata atttaaattt 23761 aattaaatca atcaattata gagtttatgg atatttagct ttaaaagatg aattgattga 23821 gctgaaagtg ccaaatagca taaattatca taactacaca caagtgactt tacttgatca 23881 atactctaat tacttaaaaa acaagttaaa gaaacttaga gatgaagtcg tctctcgtta 23941 tgaaaaatta attccaagcc aattatcacc tcttgacaca gaaataaaaa aagcaaagaa 24001 aaaaatggag agagctaaaa atataggaga tgccattaat aatttttctc ggatattatg 24061 catatatatc atttctctta tcatgctacc gtggacgctg gtcatgatga ttcctgttgg 24121 gttgctgata ggaattgtga agttgttatt tggtgatgca ggagagaatc tgactcgtat 24181 tttgggttta attgctgctc ctggtatgtt attttatctg ggaattgaaa aaacaatata 24241 tacactcatt caacagtatg aaaaaagatt agagtctcaa aggatattga ggtatgaaaa 24301 actggaaagt ttgaaagatg aggaaaggaa gatactaaat ttactgtcta tttgaatatc 24361 caagaatggg atgtttttta cgagagtttt ttgtaaggtg ggcaataccc accctacttt 24421 tgactcaacc aattggacga tttcctggat ttacagcatg aatacactca cttcctgagg 24481 ttggaaaccc tcgtttataa caggcttcct ctggtttccc ccatcctaac tgtaaaatta 24541 cttccaaaaa attagtgttt tcacatttgg acaaacttgt ccattctgtt ctttgaccaa 24601 ttctatctac atcatttttt tctatctgtc gatgctgttt gccatgacta aaacttaagt 24661 acaagtccat ccctacagtg acttcattcc aaaattccag tatagaattt tttgctgcta 24721 ttaatatttc tatgtcactg ggtaaagcaa ttaattgagc atcgacttct ggatagcgta 24781 gaattttccc tttgacagtc acctcacgcc agacaatacc tgaaacttga tgttcgtttt 24841 ggtattcgtg aatggtggct ttaaaatctt gatatgcggt aaactgttgt agcccatcgg 24901 ttctgataaa ctggtctatg acaggattgc cagagacggt gactgctaac ttcaggcggt 24961 ttcccttgag gcaagcttgg cgttcagaac gtaaaatttg aattaactgc tcggtggtgt 25021 aaacctttga catcagagca cactctataa gtcattgtca ttggtcaata gtcattattt 25081 cttcagttac ttgttcttga gttcctaact cctcagttct caacaatttt taaagattat 25141 tcgtatacat tatcccttag aataaacagc tttgcagtat ctaataacac gaaaaaacag 25201 ggcagacaaa ttttgtctgc ctcacaactc acctcgttaa ctcgctgttt aactcattaa 25261 atgctcggcg cttcaatggt actgattctg ttctgggtaa taaatcaaac acttgtctaa 25321 atttgtcgtc tattttctca aaggggactt gacccttttg attcaaaacg acctcaccgg 25381 actgattaaa tactacaact tggggaacaa ctccagaata gtaataacct ggttctgtgg 25441 agttataagt ttttccatct aaaatgctat caacattaac tggaataatt tctgctactc 25501 gaccgtaata ttcttgtatc gttgaaacaa aaatggcata tttcttacaa tcgctgctgt 25561 catccaaata gaatgccaat agtgttggtt tatgctctgc taaagctttt gcgagattca 25621 gtttaggagg aacgagtgaa ccattccccg cataaaccac aaaaatattc ccatcgtatc 25681 tgtcatcatc aataccagca gaggctgatt gtccattgct gaataacaag caagtcagca 25741 gcaggaataa ggaaaacaac cgccgccagt ttggaatgag attcagtaaa aagcgccact 25801 ttatgccatt cattaaaagg aacctttcaa cgtttatatt ttttcttgtt tgtgctgaat 25861 gtagcaaact ggggtagagg aatagggtat agggtttaga gtacaggatt tagagagaca 25921 gtagggaata aaatacgggg tttataaaaa aattagtctg tcacgtatcc agttgtgacc 25981 agtagaagaa ttaaatgacc aatgattcct tgatggtaca agcccctgga ttgatccgtg 26041 gggtcaattc cattatccat aagttaagat ccaaaattgt ttgacctgct gaatcgaggc 26101 tgtatctgac ttaaagatga ccttgtgtgc catcctgaag aaaaacaagt gcgagaaatc 26161 tgccgcaatc ggtagatggg cacacaaggg tagagaaaga gggatcgcgg ttacgttaga 26221 gagtgggttc aaggcgatga caacaactaa cgataaaagc aaacaacagc aagttcgccg 26281 aatcatagaa ctcggacaca ttaaaatact agagtagctg tctagatctg attgggcaaa 26341 gctcagcaac ctataaggcg atcgccctcc actatataat ttgcactgat tgagtgttaa 26401 tttcttttaa attagttaat tttacatcat tagtaaacaa agttgcattt agagattgtg 26461 cagttgctgc aattattgcg tcaggaagtt ttagtttgta ttgtctacga agttcgataa 26521 ctatttcttt aatatgactt gagattccaa taactgtaat cttagttaaa aaattacgaa 26581 tctgtacttc ttcatctgaa ctgaggctgg aataagatag caattccatc tcagttatca 26641 ctgaaacaaa atattttcct gatggtaatg gttttactaa ccgaccaccc aagaaataca 26701 agactatatt ggtatccaac aaaattaaag cttgtgtcaa ctccactcat tccttatttg 26761 ctgttgatac tccaatgggt cttgagtgag tttcatgaca cctgcatact tattgatatc 26821 aaactcttct ttctgttgtg cttgataagc ttccaaaatt tgctcaattt tctgccagtc 26881 ttgataatca atcaacactg caactggacg catggcttca tcagttacaa tttttttgtt 26941 aaaagatagc ataatagttt tcttccaatg caaagtttaa ctcttacaga gtatctttaa 27001 gaaaataaat caagataggg ttctcatctt ttacgattct ctatcgttga taacctgctg 27061 aaattcttct atagaaccaa tcgtaatagc gcttgtaaaa aacgtttgca gtacagataa 27121 atcctcaatt ccagaaatca cttccacaac tggttcaggt actactccaa accgcgtctg 27181 taggatttta attacatttt ctcgatttgt ttcgagaatt ccttcttctt tcgctagtct 27241 ttcaatacta gttacatacc gcatcttctg tacctcctcg taacttctga cttcttgtat 27301 aaaactgtgc gctaattctt ttggtaatac cattacccag tcaagaaatc gaaataaccc 27361 cagaatatct tctcgactat atccccgttc aaacagtctt cttgttaaac tcaatttcca 27421 atgtaaccga ctttcgggat ttcgatgcgt cgccttggtt ttcagatgtg ccatgactac 27481 tacactgaaa ggattggtac tttccctcac catacggttg ctcactgata acttttcact 27541 gttcactgtt ttaaacttca gcctcaatta cagtcacaat tgcggtagaa attgctgaca 27601 ttgtaatatc agagacaata ccaagcttgc gaataaatct ttgcaaatct actcctctca 27661 attgcaatgc atcgatagca gaagctttag tcaaattgtt tgccatatcc ggctcaattt 27721 tcacatgcca gagattgtct gcaaaatagg gtttccaatc tgtaagaggg gctatcagtt 27781 taataggtaa tttacccaca gcatcggagc taattactac tgcggggcgt atctttttaa 27841 tttctgcccc gatagtagga tcgaaattaa ctaaccaaat gtccccacgc ttgggtgttt 27901 cggcattagt agtcaatgat gtcacctacc attaattcct gccaatcaga atcttgttgg 27961 tagtgcgata ccattgtttc agcttgattt tcaagaattc gtctgcgttc tgctagaggt 28021 aatttcagga aagcttgacg ctgttctaaa gaaatagact gattatcatc gggagtagga 28081 atatcaggct gcggaattat aatcacgatt tctacagttt gtccttctat cagatttgga 28141 ttttgaatct cgattttatt tccttgcgaa actgttgctg ttaaatataa gctagatttc 28201 acttgttttt cctacctttt ttatttattg acatcacttt gagtgggtat catctaaacc 28261 aaatataggc gatcgcctct caatactgtt cagttacagg gcgatagggg aaaaagagga 28321 caaacagaca aggtggacaa ggggtttaca gagaaattag tctgcacata tcacctatca 28381 tttccggaac tggctatttt ggcaagaatt agctaaactc acatgattaa atttgagact 28441 cagggctaca atctagcaac actcaccaag attagggtac tgatgtggga aacaaaattc 28501 aactcataga caggatacga ctggcttttg ctgtgtcagt ggcgaaaggc gtgactttag 28561 cagtgcgatc gctgcgctta ggtgcggcga gtgtcttacc aggttctctt gcgcgtcgca 28621 ttgaaccccg actgttgcag ttattaactc aacaagtcaa aaacggagtc attttaattg 28681 ctggtacaaa tggtaaaaca accacatcac tgcttctgcg tacaatttta gaacgcaaag 28741 ggtatcgtgt tgctcacaac tctacaggcg caaatctaga aaatggcttg atgacagcac 28801 tgctggaaga cactaactta gtcggcggac tagatgttga ttacgcgatt ttagaagtcg 28861 atgaaaatat cctaccgaag attctcgcac caattcagcc taagataatc ctgtgtttaa 28921 acctattccg cgatcaactt gataggtacg gagaagttga tacaattagc aagcgttgga 28981 caaaagtcat ttctactctg ccaacagaaa cggttgtcat tcctaatgct gatgacccaa 29041 ccttatctta tcttggtcag caattacccc agcgggtgtt attctttggt ttaaatgaac 29101 cagaaaatta tttagatgaa attcctcacg ctgttgattc tatattttgt cctaattgcg 29161 gacactcgct agattacaaa ggagtttact tgtcccattt aggagatttt cactgtccta 29221 agtgtgggtt tagtagaagt aaacccgcgc ttgatagcag cgaatttcca caaatacttg 29281 taggtttata caacaaatat aatactttgg cagccgttac cgctgcgata gaattaggag 29341 ttgatgaagc aactatcctg gatacgatta acaacttcca agctgcattt ggtcgtgctg 29401 aagaattaga aattaatggt aaacgggtac gaattttgtt atcaaaaaac cctgtgggaa 29461 caaatgaaac aattcgcgtc gtgactcaaa gtattgacaa aacgacactg cttgtgctaa 29521 acgataggac acccgatgga actgatgtat cctggatttg ggatgttgat acagagaaat 29581 tggtagaacg gggagggact ttagttgtga gtggcgatcg cctctacgac atggcgttac 29641 gtctacgcta cagcgaaaaa actcctgaaa gtcactgcaa tttaatagtc gaagaagatt 29701 tacggcaagc aattgcaacc gccttagaac acacaccaga aaatgaaacc ttgcacatat 29761 tacctaccta ttctgccatg ctggaagtgc gagaagtgct gacaggcaga aaaattcttt 29821 aaatgggatc ataggcttgt agtgagcatt taagcgctta ctacgaacct attatcattt 29881 tgaattttga attttgaatt ttgaattgtt tctatgctta tccagcactt tcaaatgctt 29941 gcaaattata acaccattac aaaccgaaaa gtttatgaag tttgttctca actttcggat 30001 gtagaacgca agcagattag acaagctttt ttcaaaagta ttcatgggac gctgaatcat 30061 attatggtgg gcgatcgcat ctggatggga cgctttgaag gcaaacaaat gccatctacc 30121 aatcttgatg ctatcctcta tgaagacttt gatgagttgc gttcagttcg tgtcttagaa 30181 gatgagcgca tagaagcttt catgtctaag ttaaatgagg attttttaac taaaacaatc 30241 agctacgtga ataatcaggg gaaacttcat actgatccgc ctaacttatt gctagcacat 30301 ttttttaacc accaaacaca ccatcgcggt caaattcacg atatgctgag tcaaaccgaa 30361 attgctccac cagttttgga tatgcaccgg gttatccgac cgtagttgaa gaaaacgatg 30421 aattctagag tggaataggc aggcgtcatt aaatataaaa taagacctca cccgaaatcc 30481 ctctggtgcg atatcggaga gggaaggtgc aggtagggtg aggtttttat atttaatttg 30541 acttataaaa tgcttacaaa aacttgcgtg ttgtctgcgt ctttcctgac atttttatta 30601 tcaatgctat atgatgatgc aataagatag cagtgaagca catagacaaa aaaacagagc 30661 cagcgcaact agtgtgccct ccaggcgctt gacaagcttt aggagagacc cgggaagaaa 30721 actagttgaa tattggctgg ctcactttat ttcagtcgat ttactcggtt ttgtgtatca 30781 accttatttt cacctattgg atcggcaaat acaatgaaga ttatatttaa attggctaac 30841 agtgtttaaa gttttgttcg gagggagtga aacgaggaac gcaaggtatt tcccccttac 30901 gcgtacactt tctcaataac caaacacacc attacggtta aattcacgat atgctaagcc 30961 aaaccgaaat tgcttcacca gttttggtac atgcaccgga tgatcgaccg taattaagga 31021 aaactatgaa ttgtagagac gcttcaaaca gttataaatt atgagttatc aacagcatga 31081 gttaacaatt ggttggctgt atccaaagct gatgagtaca tacggtgata gaggtaatgt 31141 catctgtata gaacgtcggt gtcaatggcg gggatacagc gtcaaagttt tacctttaga 31201 tcaaagcgcg acagcagctg atattcgttc agtagatgtg attgttggtg gtggcgcaca 31261 agatcgccag caagaaatcg tgatgcgcga tttgcaaggt gctaaagcac aagcaatgcg 31321 tgaaaaaatt gaaaatggga ctccaggagt gtttacctgt ggttcacccc agttgttagg 31381 acactattat gaacccgctt ttggacagcg cattgaaggc ttaggcatat tagatctggt 31441 gtccgtacat cctggcgaaa atgtccgtcg ctgtattggt aatttggtca tagaagtcac 31501 agcaactcgt ctggcgcggg acttggaaga gatgatgggt agcaaaccgt atttgattgg 31561 ttttgaaaat catggcggac gcaccaagct aggaaaagta gaagcactag gacgcgtagt 31621 gtacgggttg ggtaacaatg gtgaggatgg gacagaagga gcattttatc agaatgctat 31681 agcaacgtat tctcatggtc ccctgttacc aaagaatccc tttgttgctg actggctgat 31741 tcaaacagcg ctgcggttga agtatcaaca gcctatcact ttgccacaga tggataatac 31801 tttagctttg gaagcacggg aagcgatgtt taagcggttg aaggttagta ttccaagcgt 31861 tacggctgct aaagtttgaa tttgaagttc attgcagggt gcgttagcgc tttttgtaac 31921 gcaccgtttt atcatgacgc gatatttgtg atttcaaaaa actgatttca aaaaacgaat 31981 taatacgaag ttgccaaaga taggattgtt tggaatcgag cttcctactg gactagcttt 32041 caagcaaata atacgagcta agacaacttg gtataagtta cagtttctta ccctaataaa 32101 cgctttaatc gtttgtacac atcatcatca ttacgcttgc ctactaaaat gacttcaact 32161 aaatcctggt ctggaaaaaa cctgtaaaca atccgatact caccactatc tactcgatag 32221 taaccctcgt agccagataa cgcttgacta tcaacaggta ttggatctac attcaaagct 32281 aagacttttt tagcaatttg agcggcgatt ttaggctgca aaccattgat aaaatcgaga 32341 acagttgcta agccatcaag tctcgccatc agctaaacgc tccagtgcag atgtaaaagt 32401 ctctgtacca accatttttg attgactaag agcagctttg gcactttcac ccaaaaccat 32461 atcctctaat tctgtcagtc gattaatcaa ttgctggtaa ctctctgccg acataatgac 32521 atggctaggt cgtgattgct ttgttagtaa aactggttca acagtcgcct tgtcaaaaac 32581 ttcaccatgt ttgttgcgag catctgtgag agtgtagatg tgcataataa aagttgtttt 32641 tgactatttt agacattttg actatatttt aacagatgcg tagacaactt accacccagt 32701 agaggtgaat gctgcgatct gcctttggca gcaagctagc gccgataggc gtagccgtgc 32761 cgcaggcata gggcgcgcta cgcaaacgct tatcgccgaa ggcggctcct aaaggagcat 32821 cgcgcagcgt gtcctatccc caacactatc atctatgaaa ccgatacaac cttggcattg 32881 gcgaaagtta cccctggcta aactgagtgg agcacaagtt ttcgaggtat tatttctcaa 32941 aaccagcgat gatgtagaaa gggcacaaag tggttctcgt cagcgtattg caaccctctt 33001 agaaagccca gtgacgccaa attgcaccca tctggctcga tattccattt gtgcgggttc 33061 tcctcgctgt attgagggga aacctcagct atggacaccg cctgtaggag aaattcttcc 33121 ttttctgcgt catctgctta actctcaaac gcaggctgaa gacgcgagta ggaaagcaga 33181 atacattgta cttggtgaac ttccttttac aggtggttgg ctgggatggc taggatatga 33241 tttggcatgg gagattgaag aactacctca actcaaagcc gatccgcttc ctttcccagt 33301 cgcctattgg tatgaaccag aatcatttgc agttgcggat catcaacagc aaattctgtg 33361 gttagctgcg actgactccg ctcaacttga cgttatgcaa agtcaattag agcaagcaga 33421 caaagagagg gaaatccaaa actcaccaaa agcttacaaa acgaatccta tcactcctgt 33481 tttccaaatg tctcaggatg attatgaagc ggcggtgcgg cgagcgaaga aacatattca 33541 ggctggtgat atctttcagg ctaatttgtc cttacgattt gaaacacata ccccttgtga 33601 tagttggcta atttatcgag ctttgcaaca gataaatcct tccccttttg ctagttattg 33661 gcaaactcct tggggagcaa tgatgagctg ttcgcctgag agactggtac agttatcagg 33721 aaggcaagtt caaactcgcc cgattgcagg aacgcgatcg cgcggtgcta cccccactca 33781 agatgatctc ttagcgcaag aattaatcag caacaccaaa gaaagagccg aacatatcat 33841 gcttgttgac ttagagcgca atgatatagg cagagtgtgc gagtggggaa ctgtcaaagt 33901 cgatgaactc ctgacaattg aacgctacag tcatgtgatg caccttgtca gcaatgttat 33961 cggtacgtta cacccgaatt atgacgctgt agatttgatt cgagctgtgt ttcctggcgg 34021 aacaattaca ggttgtccta aagttcgttg tatggaaatt attgaagaac tcgaacccgt 34081 aaagcgcaac ttattttacg gttcttgtgg ttatctcgat tggcgaggaa acctcgactt 34141 aaatattttg attcgtacac ttttatatag taatagaagt gattctccac caggggcaat 34201 tgtttgggga caagtcggtg ctggtattgt tgcagatagt aatccagaaa aagaatggta 34261 tgaatctttg cataaagcac aagctcaact taatgccttg aagctagtat ttgatacaac 34321 aaagagctta ttctctcgtt ccctggctct gccagggaat gcataactgt ggactgctgc 34381 ctcaatataa tcactgagag gcagtatcag cataggaacg agagaacctc accctgccct 34441 gtcgggcatc cctctgggaa ctccggctac ggtggaacct caccctgccc tgtcgggcat 34501 ccctctcctt actaaggaga gggaaagatt ttagcgtagc taaaagcgag ggtgaggttt 34561 tgagtgagca gttttgttgt cagattcagt gggggcaata tctcatagct ctcgcttact 34621 ggtaactaat cctttcagag gaacagcgct tgcagcagtt cattaatacc aaaggaaaac 34681 aactacaaga actacccagc aaaccagaaa ccgaaccagt caaagcaaag ctactacaag 34741 aagttgaagc cctcctgact caacaaaaag aactacaaac tcaaatccgc accagtagcc 34801 ctaaatatgc agcactacag tatccccaac ctctaaagtt acctcaaatt cagcaacaac 34861 ttgataaaga taccctcttg ttggaatatt ccttaggtga agaacgcagt tatctttggg 34921 ttgtcactcc caattctctc aatagctatg aacttcctgc acgcgaacag atagaaaaag 34981 cagccaagaa tttcagagac gacttgcaac aaccaacagc agggaactta gcagccaaaa 35041 ctgcaactga acttagtaaa ctcattctcg cacctgtggc tgacaagttg gcacaaaagc 35101 ggttagtcat tgtcgctgat ggtgctttgc aatcgattcc ttttgccgca ttaactgaac 35161 caggaaaatc agctgcttct tccaattatc aaccgttaat tgtcaaccat gaaatagtca 35221 atctaccttc agcatcaacc attgcctttc atagacttga actcaaagga cgcaaaaccg 35281 cacccaaaac cctcgccatc ttagcagatc ccgtattcgg tgtagatgac gatcgcctga 35341 gtggtaaatc aaaggcactt gctgctgaac tagatttgag gagtcaacta caacaatctg 35401 cccttaagca agcagcaaga aatttcaacc gcaacgggtg ggggcgactt ccaggtacag 35461 gtgaggaagc aaaggcaata ttgaaacttg taccatcatc aaattacctg caagcctttg 35521 cttttgatgc taactacaac tgggcaacta acaaacaact ctcgcaatat cggtttctcc 35581 attttgccac ccacggcttt gcagatccta acagcccaga attatcagga attgttctct 35641 cgcttgtaga taaatctggg aaaccgattg aaggttatct gcgcttgggt gacatcttca 35701 acctcgattt taccgctgat ttagtcgtcc tgagtgcttg cgaaaccgga ttgggcaagg 35761 atgtcaatgg tgaaggatta gtcgggttga caagaggact gatgtatgca ggggctgaaa 35821 gagtggctgt gtctctgtgg caggttagtg atgaggggac atcacaattg atgcaggagt 35881 tttataaaga aatgttacag caaggaaagt cacctactgc tgctttacgt gctgcacaac 35941 tgaagctgtt gcaagactcg aaatggaata aaccttcgta ttgggcggcg ttcaccttgc 36001 agggtgagtg gagataacta ggggtgtagg gggatagggg // LOCUS NODE_736_length_35628_cov_4.71647035628 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 35628) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 35628) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..35628 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1013..1459 /locus_tag="DP116_06115" CDS 1013..1459 /locus_tag="DP116_06115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455330.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Tellurite resistance protein TerB" /protein_id="PRJNA477356:DP116_06115" /translation="MGLFDKMFGRESQVQEALSQAEAIAAIALAATASDGNLSDEQAR GILSVLSSMKLFRYYSNDEINRMFEKLLNILRWEGINALFHSAKESLPYDLRETAFAI ATDLVLADGVSPQEELEFLNDLSQDLGISGYIAIQIVQVMLVKNRG" gene 1950..3305 /locus_tag="DP116_06120" CDS 1950..3305 /locus_tag="DP116_06120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318436.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase" /protein_id="PRJNA477356:DP116_06120" /translation="MPFSATTSQLIEVLCAKAANLSEAALANFSSGIQTDSRILSRGE VFVALRGEKFDGHDFVPMAIEKGAMAAIVDFDYENCEFPVLQVKDTLEAYQKIGRWWR EQFSIPVIGVTGSVGKTTTKELIAATLATKGTVLKTHGNYNNEIGVPKTLLQLSAEHD FAVIEMAMRGKGQIAELTQIARPTIGVITNVGTAHIELLGSEEAIAEAKCELLAQMPK DGVAILNYDNPLLMETAARAWQGKVLTYGFSGGDIHGNLIDSDTLEVTGMQLPLPLPG RHNASNFLAALAVAKVLGIDWSCLKSGVRVDMPGGRSQRFTLLNDVVILDETYNAAPE AMQAALHLLAETPGKRRIAVLGAMKELGERSHQLHQQVGETVRKLNLDALLVLVDGED ALAIAKSAEGISQMCFATHADLVASLKTFVQEGDRILFKAAHSVGLDRVVNQFRTELA K" gene 3390..3962 /locus_tag="DP116_06125" CDS 3390..3962 /locus_tag="DP116_06125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859416.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_06125" /translation="MMNINLPLETQRLILRDLTKSDWQGVHNYASDPEVVRYLPFGPN TEEDTKSFLQKEIKAQRQQLRQHFTLAMTLKDDKQFIGTCRISITNPEKLEGDIGYCI GKEFWGQGYATEAARKLLNFGFQQLNLHRIFATSDPKNTLSMRILVKIGMRQEGYLRE YEWVKGEWRDSLLYAILEREWIQMQIRDVE" gene 4334..5140 /locus_tag="DP116_06130" CDS 4334..5140 /locus_tag="DP116_06130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06130" /translation="MITMLSDFGDRDIYVGVMKGVISQINPELRVIDLTHQIPPQNIA AARFCLMNAYPYFPDGTVHLAVVDPGVGGRRRAIAVEFANGFLVGPDNGIFSGLLSQN PANAVVELTNPDYWRTPQPSRTFHGRDIFAPVAAHLASGVPLTELGNLIHRAALVQLD IVECILTETGIVGSIQYIDHFGNLVTNIPGSYVQGKTWCVKAGGLTMRGCETYGDVEV GDAIAHTGDSLRAIALVESHGWIEIAINSGNAQSQLHLQLGDTIEVAFES" gene 5296..5949 /locus_tag="DP116_06135" CDS 5296..5949 /locus_tag="DP116_06135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312372.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06135" /translation="MIQQTVSAPEVYQGQFGEFTITEGDRTGVIIYRAGLMVAAVSFA IASALVLLNNNPDFKLLTPLYAFFSLALGVSLLTIHIYMAFLHRVLQVFWVIGSVTSL ILALSSSEPLAITIYTQPLTLLGVGFIFAALTGIYFKEAFCFNRFETKILTPMIPSLL LGHMLGILPVQGEKVILGLWAILFLVFALRKAVQAIPPDIGDKSVFEYLKANHSDKV" gene 5949..6635 /locus_tag="DP116_06140" CDS 5949..6635 /locus_tag="DP116_06140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878298.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YdcF family protein" /protein_id="PRJNA477356:DP116_06140" /translation="MPAKVFYKKQKFPKIRLLKRQQMWTLTLQGWVSLFATAALFFVF TITHIHSFLAVTSPIKADALVVEGWVTDEALQQAFTEFCNGSYRQIFTTGIPVERGFY LAEYKNYAEIAAATLKKLGVPKEKLVVVPTPHVIKDRTHASAVAFRQWLLNSNDQVAS VNLFTNDAHARRSWLIYKQVLAPVKVGVIAANTSNYDPKRWWVSSEGVRTVISEMIAF IYALFVNWKA" gene complement(6878..8941) /locus_tag="DP116_06145" CDS complement(6878..8941) /locus_tag="DP116_06145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314328.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AarF/ABC1/UbiB kinase family protein" /protein_id="PRJNA477356:DP116_06145" /translation="MNAKTTLPTSQFIEDSRVTSNPPELLNTEQVRENRALVQVVNLQ ASEGTQETQVLVTRKTHSEAIAYNPQEISAHYKKRPLQVFRRIITVLTSTVTFAVGLW WDSKRGVVVKNDLRRAVALRELLTKLGPAYIKIGQALSTRPDLVPPIYLEELTRLQDK LPAFPNEIAYRFIEEELGLPPEEIYLELSSEPIAAASLGQVYKGKLKSGEQVAVKVQR PDLRESITIDLYLLRKLAAWAKKTFKRVRSDLVGILDELGDRIFEEMDYIHEGENAER FYQLYGHMKDVYVPKIYWEYTNRRVLTMEWIDGVKLTETEELRNLGINARYLIEVGVQ CSLRQLLEHGFFHADPHPGNLLATFDGQLAYLDFGMMSEIMPQQRYGLIEAIVHVVNR DFDGLAKDYVKLDFLSPETDLTPIIPAFANVFADAQGASVAELNIKSITDDLSALMYE YPFRVPPYYALIIRSLVTLEGIAIYIDPNFKVLSEAYPYVAKRLLTDPAPELRASLQD LLFKENRFRWNRLENLLRNARNSQDYDFNLVTNQALDFLSSERGAFIREKLVDEIVKG LDALTKNVLHNFTYLLRERVGITAVNETPAASVEQQQTLEHIKRILNILQQTRGFDAT QLAPQITQLLFNPGVQRLSQQLANQLAQKAVARLIRQLLASPEVENVQESNLTQSRRL SLPAG" gene 9396..11138 /gene="recN" /locus_tag="DP116_06150" CDS 9396..11138 /gene="recN" /locus_tag="DP116_06150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314327.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA repair protein RecN" /protein_id="PRJNA477356:DP116_06150" /translation="MLSLLQIENFALIDQLELEFGTGLNVLTGETGAGKSIILDAIDA VLGGKVSSRVIRSGTTRAMIEATFSSNSALTAWLSEQEIDLIEDNFIIISREIAATSS NIRSRSRVNGVLVNRALMSGLRERLVEITAQGQTVQVGQSAQVREWLDMYGGDSLMQQ RQLVAANFLEYQKARLSLEKRRTSERDRLQQLDLLTYQVQELTAANLSEPDELEKLLQ ERERLNHVVDLQQMSYKVYQALYQNDAETPAAGDLLGDSEITLSDMVEYDTQLQPLLE MVRDAQATLAEVGRQINTYGENLEADPQRLEEVEERIQELKQICRKYGSTLTEAIAYY QRIQGELAELNNSDQSIESLEQQENLCWEKLTQACQKLTLLRRTTAAVLEAELIRELK PLAMEKVQFQVEIVPTSPTAAGADKITFLFSPNPGEPLQPLTQIASGGEMSRFLLALK TCFSQADENATLIFDEIDVGVSGRVAQSIAEKLHQLGESSQVLCVTHQPLVAAMADRH FRVDKQVITSAEGHKTNNGHPEQRTVVRVTSLDNLKKRREELAQLAGGKSAQEAIAFA ESLLTQAAHHRQKS" gene 11289..12878 /locus_tag="DP116_06155" /pseudo CDS 11289..12878 /locus_tag="DP116_06155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015121550.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(12970..13968) /locus_tag="DP116_06160" CDS complement(12970..13968) /locus_tag="DP116_06160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873892.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphorylase" /protein_id="PRJNA477356:DP116_06160" /translation="MPEEKQTIHEEILLKPGTLWKRVKEQTEYALQCGALLTIPTEFE FIEQDGVRFLVRVLSNLVRKEAVKQKQQTQTSSGQEFNPFLPYEEDLFVAEISQTHVC ILNKFNVTDYHLLLITRAFEEQETLLTLQDFAAMWACLAEFDGLVFYNAGKIAGASQR HKHLQLVPLPLIPNGLQIPVEPLFAFAEFQDCVATIPQLPFVHAFTKLDPLSVQSPLK AAEVTLSQYRTLLHAVGLENSGKQSGAYNLLATREWMLIVPRSAESSSASPDRRSRAA GIGGATQTLSHESFESIAVNSLGFAGSLFVRNEQQMQILKDVGPMTLLQKVAVATK" gene complement(14000..14896) /locus_tag="DP116_06165" CDS complement(14000..14896) /locus_tag="DP116_06165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878101.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_06165" /translation="MLQFQPPGFGQKVVHTSVGFMTYYTQTTAPWLIAERENLPPLVF LHNFGGGACAYEWSKVYPAFAVTHEIIAPDLIGWGESAHPVRDYQINDYLTSIAEFIR HTCRPPVTVIASSLTAGLIIRLAITHPFLFQALFLVCPSGFDDFGQGVGRRLPLGLIN TPLLDDFIYTIGAKNELAVRNFLQSFLFAKPERVSQEMVQAFLASAKQPNAKFAALAF LQGNLYFDLSLYIQQLTIPTVILWGEEAQFTSVQLGQRLRKLNTNAIRDFHTIPDAGV LPHLERPEVVIGLLQRFLKINT" gene 15259..16968 /locus_tag="DP116_06170" CDS 15259..16968 /locus_tag="DP116_06170" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06170" /translation="MDLSFPGSTTENQSPLFAPLTPVFSDNVLDVLGVSTFHKSPSAI FTVNSTADVVDDSDGVTTLREAINQANADDGQDLIVFERSLFSNAQTITLSLGELDIT HNLDIIAPRDLLTGGNLVTVSGNNASRVFEIETGASVNLSGLIIADGSVTGDNGAGIK NSGNLTLDNSIVRNNSAFSILVPAYKSYTLSAGLGGGIYNYRGNLEVNNSTIIGNSAR DGGGIYNELGISTVNNSTINGNSAKYNGGGITNYGTGTVNNSTITGNSAGASGGGIRT SNSIVMFSKTGEQLNRTSMVVSNSTITGNSAKASDGGGICNQFGDTTLTVSNSTINGN SAGGKGGGIYNDNDFGSGNSNSNNTVSNSTINGNSAGDKGGGIYNGGPGTLEKDNTSI TVNNSTVSGNTAGNNGGGIYNNHALTLLFSTITLNQAADGGGVFNSLYPTPYTTTIPT GVATVHNTIIAANTPTAKGVNRDVAGPFTSNGYNLIGDSTGSTGFGSTGDIVGTSDNP IDPRLAVLDFNGGSTATHALFPDSPAIDAADPTVLDTDPTTDQRGKPRVSSSTDIGAF EFA" gene 17271..18752 /locus_tag="DP116_06175" CDS 17271..18752 /locus_tag="DP116_06175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878104.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH:ubiquinone oxidoreductase" /protein_id="PRJNA477356:DP116_06175" /translation="MTNNNRSQWDLGRFIQTLTYFEVIPFVNWVQDLIQGRPNSSQNI PDGAKQVGVILVAGATGGVGQRVVKRLLEQGYKVRALVRDIDKARSILSDKVELVVAD ITKPETLTPLVMANIQAVICCTAVRVQPVEGDTPERAKYYQGIKFYQPEIVGDTPENV EYQGVKNLVEAAAKYLPKAGEKLLFDFTKPSTELKNTWGAVDDVVMGGVSQSQIQLVE ETALFAGNVSTANSGGFASVRTKNFAPPFNLSGYEGVELRLRGDGKRYKFLLRTETQW DGVAYSYSFNTEANTWIDVRIPFAQMIAVFRAKSLKDSPQIDQSKICSFQLMLSKFEY DGELNPQFSPGGFALEVESMKAYGGVNLPQFVLVSSAGVTRPGRPGINLDEEPPAVKL NDQLGGILTWKLKGEDSLRESGIPYTIIRPCALTEEAGGKEFILEQGDNIRGKISRED VAKLCVEALQQTKASNVTFEVKQGENTASYIDWQRLFSQLQPN" gene 18817..19167 /locus_tag="DP116_06180" CDS 18817..19167 /locus_tag="DP116_06180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872230.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06180" /translation="MKRIGLMALALFLPIVLWGTVFSNTALSQQIESRFYNLEADFNR LESRVNRIEAQLNQSGRSSPSGSATITPSPRSGRTVSPQEREKMFDRLATLVVELKQQ VNALEGRVAKLEKR" gene 19428..19676 /locus_tag="DP116_06185" CDS 19428..19676 /locus_tag="DP116_06185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012627420.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06185" /translation="MSTQTQPILKKGSQGPEVTRLQKLLNQADRKKNFGNPPPLKEDG DFGGNTETAVKNFQKFYGLTIDGVVGSKTWAKLTEVAS" gene 19707..21155 /locus_tag="DP116_06190" CDS 19707..21155 /locus_tag="DP116_06190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015121752.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PLP-dependent aminotransferase family protein" /protein_id="PRJNA477356:DP116_06190" /translation="MTPVCFLPQGDRLPSIRCLAESLQVNKLTIIEAYNVLEADGVVC ARQGSGYFVNSVSVPSANLKSTFAPAQNVIIPKQGGSCFFDMYTAGVHAQSQPGIINF SLGFPHPPKDIDLIARRALKQAPDSLFQYDLPQGQLTLRRQIAQMLIQQGMEISADNL IITNGSEQGLSLALQHHIQPGDWVIVESPTYFGAITFGRASLNAILEKLKAKIIGIPM TAEGMNLELLEQYLKSHRPKLIYTISTLHNPTGITTTQAHRQELLSLAEKYECPILED NAYEGLCFETVPPPIKAFDKQDLVTYVNTFSKTLMPGLRVGYMVVTGKHYQEILEQKL LHDFHTSTISQAIVSEYLASGHLRRHLKQMRAELLQSRNLMLQALECYFPEEARWTVP NGGLFLWVQLPENIPTKTIRIEALSQNVLVACGSAFFPDKKGYPAMRLSYCLTPEEIE KGISILGKLLKKYLYKGCERISNNHQTLVHSI" gene 21311..21508 /locus_tag="DP116_06195" CDS 21311..21508 /locus_tag="DP116_06195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317663.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06195" /translation="MSYPENEKSQEQSTDASGGYQTTIDEETRKTGAATEVGAKESAK PESGTSDQFVEGAPNQGTEKR" gene complement(21627..23201) /locus_tag="DP116_06200" CDS complement(21627..23201) /locus_tag="DP116_06200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316471.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="PRJNA477356:DP116_06200" /translation="MIIDDQHYDLIIVGTGAGGGTLLHKLAPTGKKILVLERGNFLPK ENDNWNAGEVFGKGRYHTREQWYDISGEPFRPQTNYWVGGNTKVYGAALLRMREKDFE KVQHQDGISPEWSLKYQDFEPYYTEAEKLYFVHGKQGDDPTEPHRSEDYPYPPISHEP RMQQICDAISNQGLHPGYLPLGLRLNESDVNESLCVRCKTCDGFPCKIDGKADAEVSG VVPALEFPNVTLKTDAKVVCLHTSPSGREVQAVEAEIGGQSYLFFGDIVVLACGAVNS AALLLRSANEKHPTGLANSSDQVGRNFMKHLLSAVVQLTATPNPAVFQKTICVHDFYW GDSDFEYPMGHIQNTGNILQDMIPAEAPPLLSLLSRLIPGFGLQQLATRTIGWWLQTE DLPDPNNRVRVQGSKLHLDYTPNNFEAHDRLIYHWTEVLKAIDKTTKSAVFPLSIYPY SNTPIRVVAHQSGTCRFGEDPTTSVLDLNCRTHDVDNLYIVDSSFFPSSSGVSPALTI MANALRVGEHLISRLN" gene complement(23285..23887) /locus_tag="DP116_06205" CDS complement(23285..23887) /locus_tag="DP116_06205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139983.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme-copper oxidase subunit III" /protein_id="PRJNA477356:DP116_06205" /translation="MDSSIIPEQIQEPSHEHTHDEEGNKMFGFIVFLLSESVIFLSFF AGYIVYKTTTPDWLPPGVSGLEVKDPAINTVVLVASSFVIYLAERALVRHNLNQFRLF LLTTMAMGTYFLVGQAIEWNHLAFGFTSGVFGGTFYLLTGFHGLHVLTGIILQMIIFA RSFIPGNYDSSHFGVNATSLFWHFVDVIWIILFVLIYIWQ" gene complement(24165..25854) /gene="ctaD" /locus_tag="DP116_06210" /pseudo CDS complement(24165..25854) /gene="ctaD" /locus_tag="DP116_06210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139982.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="cytochrome c oxidase subunit I" assembly_gap 25491..25500 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(25890..26795) /locus_tag="DP116_06215" CDS complement(25890..26795) /locus_tag="DP116_06215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c oxidase subunit II" /protein_id="PRJNA477356:DP116_06215" /translation="MKIWNVLRLIVSAIALTLISLWVGQQAYSWLPPQAAAESQLIDD LFSFLVTLGAFIFVGVTATIIYSVTFHRAGRYDFKDGPPIEGNITLEVVWTAIPILVV LWIAGYSYQVYEQMAIRGPMEIVHLHTPMGMESAYAAPVDSSTEPVENIEVDAKQWAW VFRYPNQGITSTELHLPSDRRVRLALQSEDVIHGFYIPAFRLKQDIIPKRTIDFEFTP IRVGKYQLTDSQFSGTYFATMQANVVVESPEDYDKWLAEVATQKPSPAPNQAFAEYTQ ETKGLIKTGWATVEPAQPPLVNYSN" gene complement(26920..27525) /locus_tag="DP116_06220" CDS complement(26920..27525) /locus_tag="DP116_06220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319234.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06220" /translation="MNAEIIDQLKTHVGVNGLPYAIPIHPNLVHLTLGLFILGILFDF VGVLFPLQTLVFKFLAIKAVRSNFFDVGWYNILGSAIISFFTVTAGFYEIMLANPPSD MKSAWGLQAMETMLWHGVGGVLLLALIVGMTVWRGLQRYVWCKDESQQVQWSYLFSGF AIMFLMYLHGTLGAQLAAEFGVHNTADMLLRLGQNPNTLLK" gene complement(27522..28022) /locus_tag="DP116_06225" CDS complement(27522..28022) /locus_tag="DP116_06225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009543815.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06225" /translation="MFEYLPALNDHYLPYPDTIHPIVVHFVIAMVLFAFVCDVIGYFT GKYRLFEVSWWNMFFATISIFIAIIFGQFEAGLAEPYDAVESVLNFHTLLGWSLSGIL ASITAWRYVIRIRTPNKIPFSYLTLGLILTLLVGVQVYLGDKLVWVYGLHTVPVVEAL KEGLLQ" gene 28842..29438 /locus_tag="DP116_06230" CDS 28842..29438 /locus_tag="DP116_06230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase" /protein_id="PRJNA477356:DP116_06230" /translation="MIKLYQVELSGNCYKVRLMLSLLEIKHEQVLVDLPSGEHKSLQF LKLNPFGQIPVLVDGDVVVRDSHAILVYLARRYGDENWLPTEAEPISKVMRWLFTAAN EIRQGPEFARRYHLFQIPLDVQLATERAYAILKILDEHLTGRQWLELNRPTIADVACF PYIALAPDGKVSLDAYPNVIAWIKRMKQLPGYVGMPGL" gene 29464..30363 /locus_tag="DP116_06235" CDS 29464..30363 /locus_tag="DP116_06235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874795.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxamine 5-phosphate oxidase" /protein_id="PRJNA477356:DP116_06235" /translation="MTFHSGEIAVQTQAGVRDEAQRLCTVVSNIIKPAAQEFLGSQNL AVAGTVDVNGRVWASLLTGQSGFVQVLNEQTVQIDANLIPDVVLKQNLYSNSQIGLLV IDLANRRRLRLNGKAEIQPEGKIIVQIQQAFFNCPKYIQIRHIEKGVIEALGKPEIFT TEALNETINTLITTADTFFIASSHPDFGADASHRGGYPGFIQVVNSNKLVFPDYTGNN MFQTFGNLVVNPHAGLLFIDFEHGHTLQLTGKAEVIWNANKLSTFAGAQRLVEFDVEQ VLETRNASLLRWRFGEYSPVNPR" gene 30648..30854 /locus_tag="DP116_06240" CDS 30648..30854 /locus_tag="DP116_06240" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06240" /translation="MTEYFAFTLWLTMHIFIEIVVLKAIIRNFLYLKCLDGVRIALKK FCADTMNLDNFFCQIRLIEVVIST" gene 31248..31913 /locus_tag="DP116_06245" /pseudo CDS 31248..31913 /locus_tag="DP116_06245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749171.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(32239..32943) /locus_tag="DP116_06250" CDS complement(32239..32943) /locus_tag="DP116_06250" /EC_number="2.1.1.130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319616.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="precorrin-2 C(20)-methyltransferase" /protein_id="PRJNA477356:DP116_06250" /translation="MMNKGRLYGVGVGPGDPELLTLKALRLLRSCPVVVYQSADDKQS IARGIVAQYLPGNQIEVQYHLPRALEPYAAQPIYDQVVIPIKEHLAAGRDVVVLCEGD PLFYGSFMYVFTRLSDHFETEVVPGVSSPMGCASALAVPLSYRNDVFSVLPAPLPAEV LEAQLLNADAAVIIKLRRHFTKVHDVLHKLGLLSRARYIERGTMANQRIVSLDDVDPA QVPYFSMILVPSKSQF" gene complement(32940..33569) /locus_tag="DP116_06255" CDS complement(32940..33569) /locus_tag="DP116_06255" /EC_number="5.4.99.61" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="precorrin-8X methylmutase" /protein_id="PRJNA477356:DP116_06255" /translation="MTINYIKNGNEIYSKSFAMIRSEANLSVLAPDVANVAVRLIHAC GMTDIVYDLAASPTAVESGRTALAAGAPILCDCRMVAEGITRRRLPADNSVICTLNHP DVPELAQKLETTRSAAALELWLPVLEGAVVAVGNAPTVLFRLLEMLSAGAPKPSLILG FPVGFVGAAESKAALAADSFGVPFMTLHGRRGGSAIAAAAVNALATEEE" gene complement(33680..35233) /gene="cobG" /locus_tag="DP116_06260" CDS complement(33680..35233) /gene="cobG" /locus_tag="DP116_06260" /EC_number="1.14.13.83" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319523.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="precorrin-3B synthase" /protein_id="PRJNA477356:DP116_06260" /translation="MSGCPGLFYQTSARDGILSRIRIPGGILTSQQFHIIADLADEFG EGYTNITNRANLQIRAIRTEIPSNVLRVLQKIGVASAIPEVDHLRNIMGSPTAGIDLY ELIDTRPLICELDHYITTHAELAPLSPKFSVAFDGGGCVSIGEAPKGSRARRERPNDI VLSAVMVDNNVYFRLKLNLGVSLFEPFEPFIDTDIRLKPEECVEVVAALAQVYLQHLT LETGNTIPHRKSRQPRFWEILNNLGIERFLQEVERHLPYPLQRFSVKSSHTQIKEEYR DIGCNVYSKYQHIGVHPQRHKGLSYIGVVVPVGRLNTLQIRGLADIAETYASGTLRLT PWQNLLISDIPQQWIPKIQSKIENLGLHWSVTNIRSALVACAGNTGCASSATDTKNHA LALAQYLDDRVILDQPVNIHFTGCEKSCAQHSRSDIALVGFNIEDKGEVVEGYKVCVG DGDSHEKFGRELYGWVSFTELPSLLERMLQIYMAERDTSNPSFGEFVNQYVIADLQQL FFERRVVSG" regulatory complement(35282..35430) /regulatory_class="riboswitch" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00174" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="cobalamin riboswitch; Derived by automated computational analysis using gene prediction method: cmsearch." /bound_moiety="adenosylcobalamin" /db_xref="RFAM:RF00174" BASE COUNT 10588 a 7550 c 7764 g 9716 t 10 others ORIGIN 1 caatactttt atgcccagtc aaagtagaaa tttcttttcc tgtggcgaca ttccacaatt 61 tgatgattgc gagtgatatc acgtccggct gatgacttac gatatgtccg cagcgagtgg 121 aacgtagacg ccgaagaaag ctgcgtatgc ccagagggca cgctacgcgc ttcgctgcgg 181 acaattattc aaacggactt aatgtcatat ctgtggcttc ggcgatcgcc ttctcactaa 241 attatctaaa aatgtcttct tgatatttct gtgatgtttg tagtaacaaa agccaatttg 301 tcaagatttc taattgaggc agttgcatac tctattctta ctagtaagca tgtaactaag 361 tttacttatt tcatgaatga attctgtaga cttattgaaa cttcttttct attagttatt 421 caccttagta attggtttaa caaaacagca ctgaaagttt atctgaactt caagaagaaa 481 aaatctaagt ttcgttatga aagagctaaa gttttgttaa gatgcaaatg ctagtgttct 541 atagacggaa aatgccaatc acggtgtgga gaaataaata ggtactagca ctacatgagt 601 tcgggtagac aaaacaatga ctagttatat ttttttcgat gacgccctcc aaatgcttgt 661 aaacgccttc ttggcttcga tactgctact tatggtcata tcttgcttag gtttggcagt 721 tatcgaaaca atggaaaaaa caggtattcg cctagacacc aaacgacttc actaagctga 781 gtctttgtgt agaggctatc agcacagggg ctttggtaag gtaaggttcc gctcatttag 841 agtcaaatag agtcaaaaat ttattctttg tcgaaaacca ttgagttcaa gtgttcgcta 901 gcaagaaatc ttgggaacac tgaaattata gaagtctcac agcgttgcac aatgacgaac 961 cagtacgagt gagtgtatcg cgattgagac gctagcaatg gagtaaagtg taatgggtct 1021 attcgacaaa atgtttggta gagaaagcca agttcaagaa gcactcagtc aagcggaagc 1081 aattgctgca attgctttgg ctgcaacagc ctcagatgga aacctctctg atgagcaggc 1141 acgtggtatt ttgtcagtgc tgtcgagtat gaagcttttc agatattatt ccaacgatga 1201 aataaacagg atgtttgaga aactcctgaa tattctcagg tgggaaggca taaatgcttt 1261 gtttcattca gccaaagaat ccttacccta cgacttgcga gaaacagctt ttgcgatcgc 1321 caccgatttg gttttagccg atggagtctc tcctcaagaa gagcttgagt ttttaaacga 1381 tttgtctcaa gatttgggaa tctctggtta tatcgctata caaattgtgc aagtgatgtt 1441 agttaaaaat cggggatagc tcaccccacg ggaaagccct tacgggttcg gcacttcctt 1501 caagtcgggg aacccggacg ccagaacaag cgtgagggag accctcatca agtactggct 1561 ccccaacgga gtgcctcacc agttgcctgc tttgggaaat cctcaaagca gcacgcttac 1621 ttagcatata ctttttcact gttttaaagt tgttttgatt ctaaatttca taaaaatata 1681 taaaaatatt aaatctctta actttatatt gtcagcaaat ttggatgctg tcgggaagga 1741 ggatccccta caccctcaca cctttacccc cgcacccgtt gcgctttcaa gggctgtctt 1801 gttgaattta aacgagaaat tttagattcc aaaagctcta agtgttatgc tgtaaggcat 1861 ttcacttgca tacaactagg gggttagaca acctcctagt actgattttc ttggcgactt 1921 cactctgaat tttgaatctt aaattgctta tgcctttctc tgccaccact agccaactga 1981 ttgaagttct ttgtgccaaa gcagcaaact tatctgaagc tgccttggca aatttcagca 2041 gtggaatcca aacagattcc cgtatactca gcaggggtga agtattcgtg gctttacgcg 2101 gtgaaaagtt tgatggacac gattttgtgc caatggcaat agaaaaaggt gcaatggctg 2161 cgatagttga ttttgattac gagaattgtg aatttcccgt cttgcaagtc aaagacacgc 2221 tcgaagcata tcaaaaaatt ggtaggtggt ggcgtgagca gttttctatt ccagtgattg 2281 gggttacggg ttctgtaggt aaaaccacaa ccaaggaact tattgctgct actttagcaa 2341 caaaaggaac agtccttaag actcatggaa attacaataa cgaaattgga gtgccaaaaa 2401 cgctcttgca attgagcgca gaacatgatt tcgccgttat tgaaatggcg atgcggggaa 2461 aggggcaaat tgctgaactc acacaaatag cgcgtccgac tattggtgta attactaatg 2521 tgggaactgc gcacattgag ttactaggtt cagaggaggc aatagccgag gcaaaatgtg 2581 agttactagc ccaaatgcct aaagatggtg tggcaatcct caattatgat aatccgctat 2641 tgatggaaac agcagcgaga gcctggcaag gaaaagtttt gacttatggc ttttctggtg 2701 gcgacattca tgggaattta attgatagtg acacgttgga agtaacagga atgcaattgc 2761 ctttaccatt accaggacga cacaatgcaa gtaacttttt ggcagcttta gcggtggcaa 2821 aggtgctggg tattgattgg tcatgtttga aatcgggtgt gagggtagat atgccggggg 2881 ggcgatcgca acgttttact ttgctcaatg atgtggtaat cttagatgaa acttataatg 2941 ctgcaccaga agccatgcaa gcggcgttac atttattggc agagacacca ggaaaacgac 3001 ggattgctgt gttgggtgca atgaaggaat tgggagagcg atcgcaccag ttacatcagc 3061 aagtaggaga aacagtacga aaattaaatt tagatgcttt attagtattg gtggatggag 3121 aagatgcttt agccatagcc aagagtgctg aaggtatttc tcagatgtgc tttgcaactc 3181 atgcagactt agttgcctct ttaaagactt ttgtacaaga aggtgatagg attcttttta 3241 aagcagcaca ttcagtggga ctagatagag tggtgaatca gttccgtaca gaacttgcca 3301 aatgaatttt gtagttctta gctcatgact aatcgcgaat tgtgaatcgt gcaattagca 3361 agtagcgatt aacaaataat aattactaag tgatgaacat taatctaccg ctagagacac 3421 aacgtcttat tctcagggat ttgacaaaat cagattggca aggagtgcat aactacgcct 3481 ctgatccaga agttgttcgc tatctacctt ttggtcctaa taccgaagaa gatactaagt 3541 cttttttgca aaaagaaatt aaagcgcaac ggcaacaact ccgtcagcat tttactttag 3601 caatgacttt aaaagatgac aaacaattca ttggtacctg tcgcatttcc ataacaaatc 3661 cagaaaagct agaaggtgat attggatact gtataggtaa ggaattttgg ggtcaaggat 3721 atgcaactga agctgcacga aaactcttaa actttggttt ccagcaactc aatttgcatc 3781 ggatttttgc aacgtctgat ccgaaaaata ctctctcaat gcgaattttg gtaaaaatcg 3841 ggatgcgaca ggaggggtat ttgcgagagt atgaatgggt taagggtgag tggcgagatt 3901 cgttactata tgccatcctt gagcgtgaat ggatacagat gcaaattcgc gatgtcgagt 3961 aaataaaaca caaaaacccg ccgaagcggg ctgacacaca acaaaactat gaattagaac 4021 aaaaaaatct tgacttagct aaaatacttc atttcggctt ccagttggcg aactaaactg 4081 tcgtcgcctt tggctttagc tacttctaag cgatgttgta aacttctgat gatattgagt 4141 tgatgtgttt tccgtgcctt ttttacgact tccactctgt ccttaatcat tttaatgatt 4201 tacctattgt gtctctataa atgtctacct ttcctaacat aacatatttt tcgtagctta 4261 tgttacagaa aaaataaagt taatatacat taatcaacca gaaacatatt gcatccagat 4321 gcatcctcat ttgatgatca ctatgctcag cgattttggc gatcgcgata tttatgtcgg 4381 cgtgatgaaa ggagtcatct cccaaatcaa cccagaactg agagtcatag acttgacgca 4441 ccaaattccg ccgcaaaaca ttgcagcggc taggttttgc ttgatgaatg cttaccctta 4501 ctttccggat gggacagtgc atttggcagt cgtcgatccg ggtgtaggag gaaggcgacg 4561 ggcgatcgca gtagaatttg ctaatgggtt tctggtagga ccagataacg gaatctttag 4621 cggattattg agtcaaaatc cggcgaacgc agttgtagaa ttgacaaacc ctgattattg 4681 gagaacccct caacccagca ggacttttca cggcagggat atctttgcac cagtagctgc 4741 acaccttgcg agcggagtcc ccttgacaga actggggaat cttattcatc gcgcagcttt 4801 ggtacaattg gacatagtag aatgtatttt aacagaaact ggtattgtag gttctatcca 4861 atatattgac cactttggga acttagtcac caacattcca gggagttacg tccaaggcaa 4921 aacctggtgt gtgaaagcag gtgggttaac aatgagagga tgtgaaactt atggtgatgt 4981 cgaagttggg gatgcgatcg cccatacggg agactcactc agggcgatcg cacttgttga 5041 aagtcacggc tggatagaaa ttgcgatcaa tagcggtaac gcacagtcac agttacacct 5101 ccagttggga gatacaatag aagttgcttt tgaaagctag cggtttttcg tcagccgagt 5161 ttgagcctca gatgaaaatt tctcaaactt cagccaagat aacacgccag caaaggttct 5221 tattatagaa attgtaggta aaagattttt tggttatcac gaccaactca ttcaaatcct 5281 aaatttaaaa ctgctatgat tcagcaaaca gtatctgcac cagaagttta tcaaggtcag 5341 tttggcgaat ttacgatcac tgagggcgat cgcacaggcg taatcatcta ccgcgctggg 5401 ttaatggtag ctgcagtgag ttttgccata gctagcgctt tggttttgct aaacaataat 5461 ccagatttta aactactcac acctttgtat gcctttttca gtctcgctct tggtgtcagt 5521 ttactaacca ttcacatata catggcattt ctgcaccgag ttttacaagt tttttgggtt 5581 attgggagtg taacctcact gatactggca ttatctagca gtgaaccttt agctataacc 5641 atttacactc aaccactcac cttgctggga gtaggcttta tctttgcagc cttgacaggg 5701 atttacttca aagaagcatt ttgctttaat cgttttgaaa ccaaaatact cacacccatg 5761 attccatcac tattgctggg acatatgctt ggtattttgc cagtacaggg agaaaaggta 5821 atactcggac tgtgggcaat tttgtttcta gtgtttgcct tacgcaaggc agtacaagca 5881 attcctccgg atattggaga taaatctgtg tttgaatatc tgaaagcaaa tcattcagat 5941 aaggtgtaat gcctgcaaaa gttttttaca aaaagcaaaa attccccaaa attcgcttac 6001 tcaaacggca acaaatgtgg acgcttaccc ttcaaggatg ggtgagtttg ttcgcaactg 6061 ccgctctttt tttcgttttc acgataactc acatacattc atttctggca gtcacttctc 6121 ccatcaaagc ggatgcactg gttgtggaag gatgggtaac agacgaagca cttcaacaag 6181 cgtttaccga attttgcaac ggttcctatc gccaaatttt tacaacagga attccggtgg 6241 aaagagggtt ttatctcgct gaatacaaga attatgcaga aattgctgca gccactctca 6301 aaaaactggg cgtcccaaaa gaaaaactag tcgttgttcc tacacctcac gttattaagg 6361 atcgtactca tgcatctgct gtggcatttc gtcaatggct attaaattca aatgaccagg 6421 tagcatcagt taatctgttt acgaatgacg cccatgcccg tagaagctgg ttgatataca 6481 aacaagtcct tgctcccgtc aaagtgggtg tgattgctgc gaacacatca aattatgatc 6541 caaagagatg gtgggtttca agcgaaggtg tgcgaacagt catttctgaa atgattgctt 6601 ttatttatgc gctgtttgta aattggaaag cttgaagaaa atgtaatttt ataatcactt 6661 tcttttacct acaaaaattg caatactgag aaactggaac cttgaagtca caagagcatc 6721 tgtttttatt tgtgagaatt gaaagctaca gatccccgac aactttggcg aagtcgggga 6781 tcaagtttgc tcacgtaagc gcaagccagg ggcgaacggg caatagccaa aaagcgcttg 6841 ggtgaggttt tgcgcttttc ccctatttga gtaagtttca accagcaggt aaagataacc 6901 ttctagactg agttaagtta ctctcttgaa cgttctccac ttctggtgat gccaacaatt 6961 gccgaatcaa ccttgctaca gctttctgtg ctaactggtt ggcgagttgt tgacttaaac 7021 gctgcactcc tggattgaac aataactgag taatttgagg tgcgagttgt gttgcgtcaa 7081 aaccacgggt ttgttggaga atattcaaaa tacgtttgat atgctccaag gtttgttgtt 7141 gttcgacact cgcagcagga gtttcattga ctgctgttat cccaactctt tcgcgcagta 7201 agtaagtaaa gttatgcaaa acattttttg ttaaagcatc aagtccttta acaatttcat 7261 ccaccagttt ctcgcgaata aaagcgccgc gttcggaaga caaaaagtct agtgcttgat 7321 ttgtgactaa gttaaagtca tagtcttgac tattacgagc attacgtaat aaattttcta 7381 aacgattcca gcgaaatcta ttttctttaa acagcaaatc ctgcaaagat gctcttaatt 7441 ccggcgctgg atcggttaac agacgtttag caacataagg ataagcttcg ctgaggactt 7501 tgaagttagg atcaatatag atagcaattc cttccaaagt taccaatgaa cgaataatca 7561 aagcgtagta tggtggtacg cggaagggat attcatacat taaagctgag agatcatcgg 7621 tgatgctttt gatgtttaac tcagcaacac tggctccttg agcatcagca aatacgttgg 7681 caaaagctgg aataattggt gttaaatctg tctctggaga taagaaatct aacttgacgt 7741 agtcttttgc caagccgtca aaatcgcggt tgacaacatg gacgatcgcc tcaatcaaac 7801 catagcgctg ctggggcata atctcgctca tcatcccaaa gtcaagataa gccaattgcc 7861 cgtcaaatgt cgctaataaa ttacctgggt gaggatcagc atggaaaaat ccatgttcca 7921 gcagctggcg tagcgaacac tgcacaccga cttctatcaa ataacgagca tttataccta 7981 aattccgaag ttcttctgtc tcagttaatt taacaccatc aatccactcc atcgtcaaaa 8041 cacgacgatt ggtgtattcc caataaattt ttggcacata gacatctttc atgtgaccat 8101 aaagctggta gaaacgctcg gcgttttctc cttcgtgaat gtagtccatc tcttcaaaga 8161 tgcgatcgcc taattcatca agaataccaa caaggtcact ccgcacccgt ttaaatgttt 8221 tcttcgccca tgcagcgagt ttgcgtaaaa gatacaaatc aatggtaata ctttctctca 8281 agtctggacg ttggacttta acagcgactt gttcaccaga tttgagctta cctttataaa 8341 cttgacctaa ggaagcagca gcaattggtt cactagaaag ttcaaggtaa atctcctcgg 8401 gaggtaatcc taactcttct tctataaagc gataagcaat ttcattagga aaagctggta 8461 acttgtcttg caatcgagtc agttcttcca aatagatagg aggaaccaaa tccggtcgag 8521 tggataaagc ttgtccaatt ttgatgtaag ctggtcccag tttggtcaac aattctcgta 8581 gcgcgactgc tcgccttagg tcatttttaa caacaactcc ccgcttgcta tcccaccaca 8641 accccacagc aaaggtcaca gttgatgtca aaactgtgat aatgcgtcgg aaaacttgca 8701 gaggtctttt tttgtagtgc gccgaaatct cctggggatt gtaagctatt gcttcagagt 8761 gagtttttct agtgaccagc acttgggttt cttgggttcc ctctgaagct tgtaagttaa 8821 caacttggac aagcgcacga ttttctctta cttgttcagt attcaacagt tccggtgggt 8881 tggaggttac gcgactgtct tctatgaatt gggaagttgg gagagttgtc ttagcattca 8941 tgtagccata ccacagcatc tgagtcgttt tgttaattat tgtaacaaga gcttgtacaa 9001 aaatgtaatt tttattaaga aacggaagat agtgagaaat tcataaactg caacctctca 9061 cttaggatct atttctcaat tcacagaact ctatcgtcag cacaatttac acttcatgaa 9121 ctaaactaaa tttggtacac ttcaattagt cgtactagtc cgatggtatg agatattgct 9181 aactgcttaa atcagtgaac agtgaacagt gaacagtgac cagtgaccag tgaacagtga 9241 acagtgacca ctgaccagtg accagtgaac agtgaacagt gaacagtgaa actgatactg 9301 ggcttcctta ataattgcta actgctaacc gttcatttat agtttcccac aaatgaaaag 9361 aatccggcgt ctgccctcat ggagaatttg attgaatgtt gtctctattg caaatagaaa 9421 attttgccct gattgaccaa ttagaactgg aatttgggac tggactcaat gttttgacag 9481 gggaaaccgg agcgggaaag tcgattattt tagatgcgat tgatgctgta ttgggtggta 9541 aagtctccag ccgtgtgatt cgcagtggta caactcgggc aatgatagaa gcgacgttta 9601 gctccaattc agcattaacc gcttggttga gcgaacaaga aatagattta atcgaagata 9661 atttcataat tattagccga gaaattgcag cgacctcaag taatattcgc agtcgatcgc 9721 gcgttaatgg agtgttggtt aatcgagcgt tgatgtccgg actcagagaa cgcttggtgg 9781 aaatcaccgc tcaagggcaa actgtacaag tgggacaatc tgctcaagtc cgcgagtggt 9841 tggatatgta cggtggtgat tccctgatgc agcagcgaca gcttgtcgcc gccaattttt 9901 tagagtatca aaaagcacga ttatcattag aaaaacgccg gacatcggaa cgcgatcgcc 9961 tacaacaact cgacttactc acatatcaag tacaggaact cacagccgct aatctcagtg 10021 aacctgatga attagaaaaa cttttacaag aacgagaacg tctcaatcat gtcgtcgatt 10081 tacaacagat gagttacaaa gtttaccaag ctttgtacca aaatgatgct gaaactccag 10141 cagcaggaga cttgctcgga gacagtgaga taacattgag cgacatggta gaatatgata 10201 cgcaactgca accgctgttg gaaatggtta gagatgctca agcaaccttg gcagaagtag 10261 ggcgacaaat taatacctat ggggaaaatc tggaagcaga tccgcagcgc ttagaggaag 10321 tagaagagcg tattcaggaa ttaaagcaaa tttgtcgtaa gtacggttcg actctgactg 10381 aagcgatcgc ttattaccaa cgcatccaag gagaattggc tgaacttaat aatagcgacc 10441 aatcaatcga aagtttagaa caacaagaaa atctttgttg ggaaaagctg actcaagctt 10501 gtcagaagtt aactctgctg cggcgtacca ctgcggctgt tctagaagca gaactcatca 10561 gggaactcaa acctttggct atggaaaagg tacagtttca agttgagatt gtaccaactt 10621 ccccaaccgc agcgggagca gataaaataa cctttttgtt tagtccgaac ccaggagaac 10681 ccctgcaacc cttaacacaa attgcttccg ggggtgaaat gagtcgtttt ttattggcgc 10741 tcaaaacttg tttttctcaa gctgatgaga atgcgacact gatatttgat gaaattgacg 10801 ttggggtttc tggacgtgtc gcacagtcta ttgcggaaaa attgcatcag ttaggtgaaa 10861 gctctcaagt attatgtgtg acacaccagc ctttggttgc agcaatggca gatcgacatt 10921 tccgagtcga taagcaagtc attacttccg ctgaaggtca caaaacaaac aacggacatc 10981 cggaacaacg tactgttgtg cgagtcacaa gcctggataa tttaaaaaag cgtcgagaag 11041 aactagcaca gttagctggt gggaaatcgg cgcaggaggc gatcgcattt gctgagtctt 11101 tgttgacgca agcagcgcac caccgacaaa aaagttaatc tgaggcagtg cagttaattg 11161 ctggacgcca agaaccataa gacatgagta ttattttttg cagacacaaa ctgtttgcct 11221 tgcagcgata tatctcccca atctctaaaa gtcgttcatc tttgagtttt tgacgctcgt 11281 tgaccggtac tataccgttt cggacttttc ctaaaggaac tgtcgttaat acaggtttaa 11341 aagtagaaaa gggaaggaat ttccctatag gattcacatc taaggttgac gttacctgtg 11401 ctctttgtca tgcaaccctt tcagataaga gtgaacgcct tgcgggagta cctaatggcg 11461 aacttgctat tcctctgttg gttgccttgt caccaaatac agcagcaggg tttgcaaggt 11521 tgaacttcaa ccctctagat ccacaataca aaggcaatgg taagactatt ttggacagta 11581 aaaataacct tgtagaacta cccgatccaa ataagtttga acaagctttc gacgatgccg 11641 tattagatgt gccttttggt aactttgaaa gttctccaga tagcatcaac aacaccaccc 11701 aaattcccag tgccttcaca tttaagaatc atccttattt agctgacgga cagtttgccg 11761 taggtccgtt tgctggactg agtgctatca acaacgctgt tcattcttca gaaatcaatc 11821 tgttggcagc atctcaactg agtgcagcaa ctcttaacat tgacccagaa gtttatattg 11881 gtacatttct tcagaatgct gtcgatccaa acctacgttt accggaagga aatccagtaa 11941 aaccctcgga gtggctgcgt caggttgcgc cgaatgtgac acaagcagag ttagaagatc 12001 aagtccctgc ccccggtacg ggaagtcctc caaacttgcg acccagcttg gttacataca 12061 acggtttggt tttcagcccg aatactggaa atccggatga tgttgccagt ggaactttcc 12121 tatttgctaa caatgccatg tctgctttcc aaaatagctt ggtaccacct gccaatcgca 12181 ctcctgaaaa caggcaagcg ttgaatacag gttctgttag gcgtggggca aaagtgtttg 12241 cacaagccaa atgcgcaact tgtcacattg cacccttttt cactgacaac aaaattcatc 12301 cgattgagga aattggtaca aattcagctc gtgctaagtc tcgccttaaa ctaaatgact 12361 tgttagtacc acctcaactc tacaccttta acacagccgt tcctataact agtaacgccg 12421 aagtgctaga tgtgccaacg gatggcatct ccgatagtcc cacgactcta cctaaaggta 12481 tattaccgaa tggtggttac aaaacgactt ccttacttgg tttgtctttc agcgcaccat 12541 atttgcacga tggtggtgta gcagtgcggg caggaggtct gagggttaac gaaaatggta 12601 gttttactgt tgttgatccc agtggattag ggctaacagg tactctcagc caaggtttac 12661 ctgccgatcc tgctagcagc ttgcgtgcct tggttgatcg tgaccttcgc gccctagttg 12721 tgacagcaaa caaagcttac ccccctctag tgcgtagtaa ccttgatggc acaggtcacg 12781 atttctacgt agataggcaa gctgggttta atccaaacca gcaaacagat ttgattaatt 12841 tcttactggc actagatgac aagcctggta acttttaata ttgacattgg gtgtaactac 12901 agttgtttgg tgcacccaag gtcttatcgg ctgcaatatt gctttgagcc agtgcagatt 12961 gtttagactc tattttgttg cgacggcaac tttctgtagt aatgtcatag gtccaacgtc 13021 tttgagaatc tgcatttgtt gctcgtttcg tacgaaaagc gaacctgcaa atcctaatga 13081 gttgacagca atggattcaa aagattcatg cgatagcgtt tgcgtagcgc cccctatgcc 13141 tgcggcacgg ctacgcctat cggggctagc gctgctgctt tcagcagatc gcggtacaat 13201 caacatccat tcccttgtcg ccagaagatt ataagcacca gattgttttc cactgttttc 13261 taaaccgaca gcgtgtagca gagtgcggta ctgcgagagt gtgacttcag ccgctttcaa 13321 tggagattgt accgaaagag gatctagctt tgtaaaggcg tgtacaaaag gaagttgtgg 13381 tatagtggca acacagtctt gaaattcagc gaatgcgaac agaggttcta ctggtatttg 13441 caacccattg ggtatgagtg gaagtggaac cagttgcaaa tgcttgtgtc gctgactagc 13501 acctgcaatt ttgcccgcat tgtaaaagac taaaccatca aactcagcta gacacgccca 13561 cattgctgca aaatcttgca gagtgagtag agtttcctgt tcctcaaaag cacgagtaat 13621 aagcagcagg tgatagtcag taacgttaaa tttgtttaag atacatacgt gagtttgaga 13681 aatttctgca acaaacaaat cctcctcgta aggaagaaaa ggattaaatt cttgaccgga 13741 ggaagtttgt gtttgttgtt tttgcttaac agcttctttg cgaaccaggt tagataagac 13801 tcgcactaag aaacgaacac catcttgttc tatgaactca aattctgttg gtattgtcag 13861 caacgcccca cattgtagag cgtactcagt ctgttctttg acacgtttcc acaaagtgcc 13921 tggtttcagt aaaatttctt catgtatagt ttgtttttct tctggcatga tagcgcgaat 13981 agagagtgtg aaaaataact tatgtattaa tttttaaaaa ccgttgcaac aaaccaatga 14041 caacctcagg tctttccaaa tggggtaata ctcctgcatc tgggattgta tgaaaatcac 14101 gaattgcatt ggtatttaac tttctcaaac gctgtcctag ctgaacactg gtaaattgtg 14161 cctcctcacc ccacaatatt actgtaggaa tagtcagttg ttgaatatac aaactcaagt 14221 caaaataaag gttaccttgt aaaaatgcca aggctgcaaa tttagcatta ggctgttttg 14281 cagaggctaa gaaagcttgt accatttctt gagaaactcg ttctggtttg gcaaacaaaa 14341 aactttgtaa aaagttgcgt accgccagtt catttttagc gccaatagta taaatgaaat 14401 cgtccaagag tggtgtgttg atgagtccaa gtggaagtct gcgtcctaca ccttgcccaa 14461 aatcatcaaa tccagaggga cacaccagaa acagggcttg aaataaaaaa gggtgagtaa 14521 tagcgaggcg aattatcaaa ccagctgtta atgaagacgc aatcaccgtt actggtggac 14581 gacaagtatg tctaataaat tctgcaatac tcgtcaaata atcgttaatt tggtaatctc 14641 gcacaggatg tgcagattca ccccaaccga ttaagtccgg cgctatgatc tcgtgcgtaa 14701 ctgcaaaagc agggtaaact ttagaccatt cataagcaca agccccgcca ccaaagttat 14761 ggagaaaaac taggggaggt aaattttctc tctcagcaat caaccaaggt gcagtcgttt 14821 gggtgtagta agtcataaac ccaacagagg tatgaacgac tttttgtcca aagccagggg 14881 gttgaaactg gagcatagtt tgctaccctc gtgaatatgt gtttctagtg gcaaagttaa 14941 agcgtggtgg cgtataggtc aaggttgctg cctacaaatc aaataattgt ggcgtgtcac 15001 ttttgtactt gccatttttt tgctgtatac tcccaatgct ttaataatat caatattgat 15061 gtcatgagtt attaattttt caggtgtata tacataaatt gggaacctaa gcaatcgtac 15121 tatactggag tttagactac aaaaaagatt ttagcgttta gctctcgaaa ttagtctcaa 15181 gtccagcatc aaaccaagtg cgccaaaagt gagaaatcag tagatggcgc acgcccccaa 15241 tcactattcg ttgaatttat ggatttatca tttccagggt ctactacaga gaaccaatct 15301 ccgctatttg cacctctgac acctgtattc tcggataatg tacttgacgt gcttggtgtt 15361 agcactttcc ataaaagccc tagtgctata tttacggtga acagtacggc agatgtggtg 15421 gacgatagcg atggcgtgac gactctgcgt gaagctatta atcaagctaa tgctgatgat 15481 ggtcaagatt taattgtgtt tgagcgatcg ctcttttcca atgcgcagac gattaccttg 15541 agcttgggag aactagacat cactcataac ttggatatta tcgccccaag ggatttgtta 15601 acaggtggga acttggtgac agtgagtggc aataatgctt cgcgggtatt tgagattgag 15661 acgggagcat cggtgaatct ctctgggttg attatagccg atggcagtgt gacgggtgat 15721 aacggtgctg ggatcaagaa ctctggtaat ctgactctag ataacagcat tgttcgcaat 15781 aactctgctt ttagcatact cgtacccgcg tacaaaagct ataccctctc tgctggtctt 15841 ggcggcggca tctataatta ccgtggcaac cttgaggtga acaacagtac catcatcggt 15901 aactcggcac gtgatggcgg cggcatctat aacgaacttg gtattagtac ggtgaacaac 15961 agtaccatca acggtaactc ggcaaagtat aacggcggtg gcatcacaaa ctacggcact 16021 ggtacagtga acaacagcac aatcaccggc aattcagcag gtgccagcgg cggtggcatc 16081 cgcaccagca acagcatagt gatgttcagc aaaaccggtg agcaattgaa cagaaccagc 16141 atggtcgtta gtaacagcac catcaccggt aactcggcaa aggctagtga cggcggtggc 16201 atctgtaacc aatttggtga cactacctta acggtaagta acagcaccat caacggtaac 16261 tctgcaggtg gcaaaggcgg cggcatctat aacgataacg actttggtag tggtaacagt 16321 aacagtaaca atacggtgag caacagtacc atcaacggta actcggcagg tgataaaggc 16381 ggcggcatct ataacggtgg accaggaact cttgagaaag ataacaccag cattacggtg 16441 aacaacagca ccgtcagcgg caataccgca ggtaataacg gcggcggtat ctataacaat 16501 cacgccctga cgctgctgtt cagtactatc accctcaacc aagctgctga tggcggaggc 16561 gtcttcaata gtctttatcc tactccttat actactacta ttcctacagg agtagcaact 16621 gtgcacaaca cgattattgc tgcaaacacc cccactgcca agggtgttaa ccgcgatgtg 16681 gcaggtccct tcacaagcaa tggttacaac ctgattggcg acagcacggg cagtaccgga 16741 tttggatcta caggagacat agtaggcact agtgacaacc caattgaccc ccgcttagct 16801 gtgttggact tcaatggtgg ttctacagca acccatgctc tcttcccaga tagtccagca 16861 attgatgctg ccgatcctac cgtgctggat actgacccta caaccgacca acgcgggaaa 16921 cctcgtgtaa gtagtagtac cgatattggg gcatttgaat ttgcctgatt atggcaattg 16981 agcgctcttt taattcacaa tcttaatatt taaactattg tacaaaaatt cacagtcctc 17041 aaagtatact gaggactgct tttatttaga tgaaaaacag aaacaccctg atatctagcc 17101 tttgactatt taatgactag caacgaccaa agcggggcgg ctttgatccc atcattatag 17161 gaatgggctt ttgcgccgtt gttttgtcaa aaatcagtct atatggtggt gaactgatga 17221 agaatcatcg tcataatgat tctgcaacag aacaagcgaa gatatgaatc gtgactaaca 17281 acaatcgctc tcaatgggat ttaggcagat tcatccaaac cctcacctat tttgaggtca 17341 tcccttttgt aaactgggta caggatttga tccaaggtcg tcctaatagt agccaaaata 17401 tacctgatgg agcaaaacaa gtgggtgtga tactagtagc aggtgcgacg ggtggagttg 17461 gtcagcgagt ggtcaaacga ctgctggaac aaggttataa agtgcgtgca ctcgtgcgag 17521 atatcgacaa agcacggtca attctcagtg acaaggttga attagtcgtt gcagatatta 17581 ccaaaccaga aactttaact cccctagtta tggctaatat ccaagcggtg atatgctgca 17641 ccgccgtacg cgtgcaacca gtagaaggag atacgccaga acgcgccaaa tactatcagg 17701 gcatcaaatt ttatcaacca gaaattgttg gcgatactcc tgaaaatgta gaataccaag 17761 gtgtgaaaaa cttggtagaa gctgctgcta aatatctgcc caaagcaggg gaaaaactat 17821 tatttgattt caccaaacca tcaacagaat taaagaatac ctggggtgcg gtggatgatg 17881 ttgtgatggg tggcgtgagt caaagtcaaa tccagttggt ggaagaaaca gctttgtttg 17941 ctggtaatgt ctcaactgcg aactcgggag gctttgcttc tgtaagaacg aaaaatttcg 18001 cgcctccctt caatctctct ggttacgaag gtgtagaatt gcgcttaaga ggtgatggta 18061 agcgttataa atttcttttg cgtacagaaa cacaatggga tggtgttgcc tacagttact 18121 cttttaatac agaagctaat acctggatag atgttcgcat tccctttgcc cagatgattg 18181 cggtatttcg tgccaaaagt ctgaaagatt ctccgcaaat tgatcaaagc aaaatttgct 18241 cttttcaact gatgctgagc aagtttgaat atgatggtga gttaaatccc caattttctc 18301 ctggtggttt tgctttggaa gtggaatcaa tgaaagctta tggtggggta aatttaccac 18361 agtttgtctt agtcagttca gcgggtgtga ctcgtcctgg tcgtcctgga atcaatttag 18421 atgaagaacc gccagcagtc aaattaaatg accagttagg aggaatttta acatggaagt 18481 tgaaaggaga agatagttta agagaaagtg gaattcctta cacgattatt agaccatgtg 18541 cgctgactga ggaagcagga ggtaaggagt ttattttaga acaaggtgac aatatcagag 18601 gaaaaatcag ccgcgaggat gtggctaagc tttgtgtgga agcgctacaa caaacgaaag 18661 cgtctaacgt tacgtttgag gtgaaacagg gagaaaatac tgctagttat attgattggc 18721 aaaggttatt ttctcaactg caaccaaact gagggtagag tgtgtgaggt acttcatgta 18781 agcaactttg gataagtttt acggagtagc aaagtcatga aacgtattgg gctgatggca 18841 ttagccctat ttttgcctat tgtactttgg ggcacagtct tttcaaacac cgccttatct 18901 cagcaaatag aatctcgctt ctacaaccta gaagcagatt ttaatcgttt ggagtcgcgg 18961 gttaatcgca ttgaggcgca gttaaatcaa agtgggcgat cctcgccttc tggtagcgca 19021 acgattacac catctccacg ttctggaaga actgtgtcgc cgcaagaacg agagaagatg 19081 tttgatagac ttgcaacttt ggtggtagaa ctcaagcagc aggttaatgc actagaaggg 19141 cgagttgcta aattagagaa acgctagtgc aaggagtcag aagtcagaag tcaggagaga 19201 aaaatactta cacaatcaga gttataagct cttcctaact gtctagttat ttctgccgcg 19261 ctgcgctagg ttgttgtagc gttggtactt ggaatggaaa atcagattga tagaccaaat 19321 tctttatgca tgagaactgc ataagttgat tttgctgcca actattaatt ttgtacaggg 19381 gaaggaaaga accttcacct tcaaaacaac tttaatccag gagaattatg tctactcaaa 19441 ctcaacctat tctgaaaaaa ggtagccaag gtccagaagt cacccgtctg caaaagttgc 19501 tgaatcaggc tgaccgcaaa aaaaacttcg gaaatcctcc tcctctaaaa gaggacggag 19561 attttggagg taatactgag actgctgtta agaacttcca aaaattttat ggattaacta 19621 tcgatggagt tgttggttct aagacctggg cgaaattgac cgaagtagca agttagtgtt 19681 ttagctttgc gtcggattta gaagttatta cccccgtctg tttcttacct caaggcgatc 19741 gcctaccctc aatccgctgt ttggcagaga gtttgcaagt taataaactc acaatcattg 19801 aagcttataa cgttctagaa gccgatggtg ttgtgtgtgc gcgtcaaggt tcaggatatt 19861 ttgtcaatag tgtctctgtt ccttctgcta acctaaaatc aacatttgcc cccgcacaaa 19921 atgtcataat tccaaaacaa ggaggaagtt gcttttttga tatgtacacg gctggggtgc 19981 acgcacaatc tcaaccgggg ataattaact ttagtcttgg ttttcctcat ccaccaaaag 20041 atatagattt aattgctaga cgagcgctca aacaagcacc tgatagctta tttcaatacg 20101 atttgcctca aggacaactg actctgcgca ggcaaattgc tcagatgttg attcaacaag 20161 gaatggagat atcggcagat aatttgatta ttaccaatgg ctctgaacaa gggctatcat 20221 tggcattgca acatcacata caaccaggtg actgggtgat tgttgagagt cctacatatt 20281 ttggggcgat taccttcggt agagcttcgc ttaacgccat cctagaaaaa ttaaaagcca 20341 aaattatagg cattcccatg actgcggagg ggatgaacct tgagttatta gagcaatatc 20401 tcaaaagtca tcgcccaaaa ttaatttata ccatcagtac cttacataac cctacaggga 20461 taacgacaac tcaagcacat cgccaagaat tactctcttt agccgaaaaa tacgagtgtc 20521 cgattttaga agataatgct tatgaaggac tatgttttga aacggtacca ccaccaatca 20581 aagctttcga caaacaagat ttggtgactt atgtaaacac tttttctaaa actttgatgc 20641 ctggtttacg agtcggttat atggtagtga caggcaaaca ttatcaagaa atacttgagc 20701 agaaattgct ccatgatttt catacatcca ccatttccca agcaatagtc agcgagtacc 20761 tagcatcagg acatctccgc cgtcacctca agcaaatgcg tgcagaactt cttcaaagcc 20821 gtaatcttat gcttcaagcc ttggagtgtt actttcccga agaagcacgg tggactgttc 20881 caaatggtgg attatttctt tgggtgcaat tacctgagaa tattcctact aagacaattc 20941 gcatcgaagc tttatctcaa aatgtcttag ttgcttgtgg ttcggcgttt ttcccagaca 21001 agaaaggtta tccagcaatg cggttgagtt attgtctcac accagaggag atcgaaaaag 21061 gtatttctat attgggtaaa ttgttgaaaa aatatcttta caaagggtgt gagaggatat 21121 ctaataatca tcaaacactt gttcacagca tttagataag ctaatatttt gtattcctgt 21181 ttaacatatc tgagtgataa ttcatccaga agatagatgg agtgattaag caaaactatc 21241 tcttggagag aagaaatgac attcctatac ttcctaaaat gttggcagtg ttaaatacgg 21301 agttttagtt atgtcttatc cagaaaacga aaaatctcaa gaacaaagca ctgacgcttc 21361 cggcggttat caaaccacta ttgacgaaga aacccgcaag accggggctg ctactgaagt 21421 tggtgctaaa gagtctgcaa aaccagaatc tggtaccagc gaccaatttg ttgagggcgc 21481 tcctaatcag ggaaccgaaa agcgttaagt ttcaattcaa ttgaaacttg tcgtattcac 21541 aaaatcacag gtggaattgt gtgaagcagg ctgcactttt tccacctatt ttttgttata 21601 cctgactatt tgatatcaca ctcgcactaa ttcaaccgtg aaatcaagtg ttcgccaacg 21661 cgcaacgcat tagccataat cgtcagcgca ggactcacac cggaactaga cgggaagaaa 21721 ctgctatcta caatgtagag attatctaca tcatgggtgc gacaattgag gtcgagaaca 21781 gatgttgtcg gatcttctcc aaagcgacaa gtcccactct gatgcgccac aactcgaata 21841 ggcgtattgc tataaggata aatgcttaac gggaataccg cactttttgt agttttgtct 21901 atagctttca acacctctgt ccaatgataa atcagacggt catgagcctc gaaattattg 21961 ggcgtgtaat ccagatgcag cttggaacct tgcacccgca ctctattgtt ggggtcaggc 22021 aaatcttctg tttgcaacca ccaacctatt gtacgtgttg ccaactgctg tagtccaaat 22081 cctggtatta atcttgatag aagagacaac aaaggcggtg cttcagcagg aatcatatct 22141 tgaagaatat tacctgtatt ttggatatga cccattggat attcaaaatc tgagtctccc 22201 cagtaaaaat cgtggacgca aatcgtcttt tggaacacag caggattggg tgtagcagtg 22261 agttggacga ctgctgagag taggtgtttc atgaaattgc gtcccacctg atctgaacta 22321 tttgctagtc cagtggggtg tttttcattt gccgatcgca acaacaaagc cgctgagttc 22381 acagcaccac acgccaagac tacaatatca ccgaaaaata gataagattg accaccaatt 22441 tccgcttcaa cagcttgcac ttctcgacca gacggactag tgtgtaaaca cacaactttc 22501 gcgtcagttt ttagtgtaac gttaggaaac tctaaagcag ggacaactcc tgatacctca 22561 gcatcagctt taccatcaat cttacaggga aagccatcac aagttttaca ccggacacaa 22621 agactctcat taacatcact ttcattcagc ctcaagccca aaggcagata tcctggatgc 22681 agcccttgat tagagatagc atcacaaatt tgctgcatcc gaggttcgtg gcttattgga 22741 ggataaggat aatcttcact cctatgaggc tctgttggat cgtctccttg tttaccatga 22801 acaaagtaca gcttttctgc ttctgtgtag taaggttcaa agtcttggta ttttagagac 22861 cattctggag agataccgtc ttgatgttga accttctcaa aatccttttc tcgcatccgc 22921 aacaaagccg caccataaac cttggtgttc cctcctaccc aatagttggt ttggggacga 22981 aaaggctctc cagaaatatc gtaccactgt tcacgagtgt gatagcgtcc ttttccaaaa 23041 acctctccgg cattccaatt atcattttct ttgggtaaga aattgcctct ctccaggaca 23101 agaattttct tacctgttgg tgcaagtttg tgtaataatg tacctccacc tgcacctgta 23161 ccgacgataa tcaagtcata gtgttggtca tcaataatca taggatttct cctcttttaa 23221 ggctgtagat aaacaagtaa aacccaaatt tgacaacttc aaaaaatgat tgctcaacct 23281 tccttcactg ccagatataa atgagaacaa acaaaataat ccaaatcaca tcaacaaagt 23341 gccaaaacaa tgatgtcgca ttcacaccaa agtgacttga atcataatta cccggaatga 23401 acgagcgagc aaaaattatc atttgtaaaa tgattccagt cagaacgtgc aaaccgtgaa 23461 aaccagtcaa caagtagaac gtcccaccaa aaacaccact ggtaaagcca aaagcaagat 23521 gattccattc aatcgcctgt ccaacaagga agtaagttcc cattgccatt gttgttagaa 23581 gaaacagacg aaattgattt aaattgtgac gtactaaggc acgttctgcg agataaatca 23641 caaagctact ggcaacaaga actaccgtat taatagcagg gtctttaact tctaatccag 23701 aaacaccagg cggtaaccag tcgggtgttg ttgttttata gacaatatat ccggcgaaaa 23761 aacttaagaa aatgacactt tctgaaagta agaacacaat aaagccgaac attttattgc 23821 cttcttcatc gtgggtatgc tcatgagaag gttcttgtat ctgttctgga attatggaac 23881 tgtccattga ttaaattctc cttgacttga cggctgatta tttgtactta ttcgttggca 23941 taattgtaga attgtaggtt gggttaagcg gagcacaacc caacaaaact ttcggttggt 24001 gttgggtttc gttccaaggg cccaacctac ttacttgtta ttcacgtatg cgctttgcgc 24061 acgcccagag ggctaacgcg cagcgtgccc gcagggctta tgatttagca ttgctatgtg 24121 aacaagcaac agtcatcacc agaaatacgt tcatttctgt caatctacga tttgtgatta 24181 tgaacttgac cagccgcagc gactaatggt tcagatttcc cataaccata tggttcagaa 24241 attatcacag gaatttcttc aaaattctct acaggtggtg gtgaagaaac cagccactct 24301 agtccaattg cccgccaagg atttttagga gcctgctttc catgcatcca ggaagctatc 24361 atattcaaaa tgaaaggtaa agtagacatc cctagtaaaa acgctccaat actggcaaca 24421 atattccaaa atgtatactc aggagcgtag gaagaaacgc gacgcaacat tccttgtaat 24481 cctaatggat gcattgggaa aaagttaaga ttggtgccaa taaatgccag ccaaaagtgc 24541 aactgacccc agccctcgtt gtacatacgt ccagtcattt tggggaacca gtgatatatg 24601 gcagcaaaca ttcccattgt gaccgtaccg tagagcacgt agtggaaatg acccaccaca 24661 tagtatgtat tgttgacgtg aacatcaatt ggaacagaag agagtgtaat gccagttatt 24721 cctgcaaaca cgaacaatat taatccacct aacgcgaata gcatcggcgt atttagccgc 24781 aacttacctc cccagatagt tgcaacccaa gcaaatactt taatacctgt aggcacagaa 24841 acgaacattg tcgagagcat gaaaatcatc cgcatccagc ccggagtccc actcacatac 24901 agatggtgta cccaaactaa agcactgact ccagcaatca agagtgatga aattgcaacg 24961 actttgtaac caaacagagg tttacgagca tacacaggaa agatttctga aaaaatcccg 25021 aaaacgggca gaatgatgac gtaaaccgca gggtgagaat agaaccagaa aaagtgttgg 25081 aaaagaactg cgttacctcc ttttgcaggg tcaaaaaagc tcgttccaac tgttaagtca 25141 aacagaagca taactgcgcc tgcagttaag gcagggagtc caaagagttg gataatctgg 25201 gcgctaaaga ctgcccagac aaacagaggc atccggaaga atgtcatgcc aggtgcacgc 25261 atcttgacaa tagttgtcac aaagttgact gctcccataa tcgaggaaac ccccgatatt 25321 gccacagcca aaagccagac aaattgacca tttatcaatt gacctgaggg gttttgcaaa 25381 ctgacgggag ggtaggacca ccagcctgct tgtgctgggc cacctgggac aaagaaactc 25441 gccatcagca aaattccggc tatgggaacc atccagaagg caacggcatt nnnnnnnnnn 25501 gcgcggaaat gccatatctc gcgccccaat cattaacggc acaaggtagt tggcaaaacc 25561 tactaatact ggaaatgtcc aaccgaataa catgactgtg ccatgcatag taaacatggc 25621 attgtagact gtgcggtcaa caaggtctga ttctggggta atcagttctc cccgaatcac 25681 cattgcaaag atgccaccaa caagaaagaa aatgaaagca gtgacgatgt actggatacc 25741 aatgactttg tggtcagtgc tgaagctgaa gtatcttttc cagcctgttg cggcttcgtg 25801 gtgaggtttt ccaccactat ttagattgat gtctttaata gaaatattgg tcatttgtta 25861 tttgtccttg cgaaactcac aaatcacaac taattgctgt aattgaccag aggaggctgt 25921 gcaggctcaa ctgtagccca gccagttttg ataagtcctt ttgtttcctg ggtatactca 25981 gcaaaagctt gattgggtgc aggagatggc ttttgagttg ctacttcggc aagccactta 26041 tcatagtctt cgggagactc aacaactaca ttcgcctgca ttgtagcgaa gtaagtaccg 26101 ctaaattggg aatcggtcaa ttggtatttg ccgacgcgga tgggagtgaa ttcaaagtca 26161 attgtgcgtt tgggaatgat gtcttgcttg agtcggaaag cgggaatata aaagccgtgg 26221 atcacgtctt ctgattgcag tgctaaacga acgcggcgat cgctgggcaa atgcaattca 26281 gtactggtga taccttgatt ggggtagcgg aatacccaag cccactgttt agcatcgact 26341 tcaatgtttt ctacaggttc agttgaggag tcaacaggcg cagcataagc tgattccatc 26401 cccattggtg tatgcaggtg cacaatttcc attggaccac gaatcgccat ttgctcgtag 26461 acttgatagc tataccccgc aatccataac actaccaaaa ttgggatagc tgtccagaca 26521 acttccagtg tgatatttcc ttcaatcggg ggaccatctt taaaatcata cctgccagct 26581 cgatggaaag tcacggagta tatgatagtc gcagtcactc caacaaagat gaatgcgccc 26641 agtgtgacga gaaaactgaa caaatcatct atcagttgtg attctgctgc agcttgcggg 26701 ggaagccaag aatatgcctg ctgccctacc cagagactga taagagtgag ggcgatcgca 26761 cttacaatga gtctcaaaac attccagatt ttcatagtca ttagtcatta gtcattagtc 26821 atcagttatc agtcattggt cattagtcat tagtcattgg tcatcagtca ttagtcatta 26881 gccattagca aagaacaaat gactaaggac tagaactatt tacttcagca gcgtattggg 26941 gttttgaccc aatcgcaaca acatgtcagc agtattatgt accccaaact cagcagccag 27001 ttgcgctcct agtgttccat gtaagtacat taaaaacatg attgcgaacc cagaaaacag 27061 ataactccat tgcacttgtt ggctttcgtc cttgcaccaa acatagcgct gtaatcctct 27121 ccagactgtc atgccaacaa tcagtgctaa taacaaaaca ccacccacac catgccaaag 27181 cattgtttcc attgcctgca atccccaagc actcttcata tcagatggtg gatttgccag 27241 catgatttcg taaaagcctg ctgtcacggt aaaaaagcta ataatggctg aacccaggat 27301 attgtaccag ccgacatcaa agaagttgga acgaacagcc ttaattgcca aaaacttgaa 27361 gactagagtt tgtaacggga acagtacacc tacgaaatca aagagaatgc cgagaataaa 27421 caaacctaga gtgagatgaa ctaagtttgg atgaattggt attgcatagg gcaatccgtt 27481 tacgccaaca tgtgttttta attggtcaat aatttcagcg ttcattgcaa caacccctct 27541 tttagtgctt caacaactgg tactgtatgc agtccataga cccaaaccag tttgtctccg 27601 agatacactt gtacgccaac cagaagagtt aaaatcagtc ctagtgtaag ataagaaaaa 27661 ggtatcttgt ttggggtgcg gatacgaatc acatagcgcc aagctgtaat agatgcaaga 27721 attcctgaaa gcgaccagcc aagcagtgta tgaaaattca gcactgactc aacagcatcg 27781 taaggttctg ccaaaccagc ttcaaactga ccaaaaatga tagcaatgaa gatggaaatt 27841 gtagcgaaaa acatattcca ccaactcact tcaaaaagac ggtatttccc tgtgaaatag 27901 ccaatgacat cgcaaacaaa ggcaaacaat accatcgcaa tgacgaagtg tacaacgatg 27961 gggtgaattg tatccggata cggtaaatag tggtcgttca aagccggaag atactcaaac 28021 atatttttct ccaagacata tcttctgaat gatttgccac tgcaaattat gtattcagta 28081 aaaacttttg agactaaaaa ttctcatact gaatcaaatt aaggctctac gacgtttcat 28141 gtttgtcaat tcttgacaaa agaaattttt agaactttct gtaagccgtt gtggagagtg 28201 tttaacagtt atcagttatc agttgctgaa taactgttca ctgtttactg tttactgatt 28261 gaatgaaaac ctactattta acaaagtgaa gtttgagtca tccagcgtgc aattaacacg 28321 cacaatgatt cgatacctac tttgaggttc agatactcaa tcattaagtc agaaaatagc 28381 cacaggtttt aaccaaaaac ctggttcaat cgcagcccca tgcccaatca atttgcattt 28441 ttgtggtaat ttattcccct gggctgttta ccactgttag ctatcttaaa cttatgtaaa 28501 caaaatttat gtattctttg atagttttat atttttttta ttttgttata atgggacaat 28561 aaaatcaact gttttttctc tttgagatat tagaagatag cacaaataat cgtaaaatta 28621 agcataccaa caagggtgat gcaaacgcct acaaatagtt tgttccaacc tccttgaatg 28681 gtatatttat ttacgcgccc actgtgctag tcaattcaga attgatcttg gagagttcta 28741 ggtaaaccat caaatattgt aagatacaga agttgtgtgc aacatcttta ccatatacat 28801 aagggtagtg ggaaagaact ggctaccacg cgagaaatgc catgattaag ctttatcagg 28861 tcgaattatc cggaaattgt tataaagtac gactaatgct gtcgctgctt gagattaagc 28921 atgagcaggt gttggttgat cttcccagcg gcgaacacaa atctctccaa ttcctcaaac 28981 taaatccctt tggtcaaatt cctgtgctgg tggatgggga tgttgtggtg cgagattctc 29041 atgcaattct ggtttatcta gcacgacgtt acggtgatga gaattggtta cccacagaag 29101 cagaaccaat aagcaaagtc atgcggtggc tgttcactgc tgcaaacgaa attcgtcagg 29161 gaccggaatt tgccagacga taccacctgt ttcaaattcc gctagatgtg caactggcaa 29221 cagaaagagc ttatgcaatc ctgaaaatac tggatgagca tttgactggg cgacaatggc 29281 tggaactgaa ccgccccact atcgcagatg ttgcctgttt cccatacatt gcactagccc 29341 cagatggcaa agtctcttta gatgcttacc ccaacgtaat tgcttggatt aagcggatga 29401 aacaattacc aggatacgtt ggaatgccag ggctgtagta tcaatccaaa atctaaaatc 29461 gatatgacgt ttcattctgg agaaatcgca gtccaaactc aggcaggggt gagagatgaa 29521 gcacaacggc tatgtaccgt cgtcagcaac attatcaaac cagctgccca ggaattcctg 29581 ggcagccaaa atttagcagt agctggtaca gtcgatgtaa atggcagagt ttgggcatca 29641 ctgctcacag gacaatccgg ttttgtgcaa gtcttaaacg aacaaacagt acagattgat 29701 gccaacctta taccagacgt tgtgctaaag caaaacctgt acagtaatag ccaaataggt 29761 ctactagtca ttgatttggc aaaccgcagg cgcttgcgtc tcaacggcaa agctgagata 29821 cagccagaag ggaaaataat tgtacaaatt cagcaggcat tttttaactg tcctaaatac 29881 atccagatac gtcatataga aaaaggagtc attgaggcgc taggaaaacc tgaaatcttt 29941 accacagagg ctttgaacga gacaatcaac actctcatta ccacagccga tacctttttc 30001 atcgctagtt ctcatccaga ttttggagca gatgcttccc accggggagg atatccgggg 30061 tttatccaag tggtgaacag caacaagcta gtttttcccg attacactgg taataatatg 30121 tttcaaacct tcggcaacct ggttgtaaac cctcatgcag gtcttttatt tatagatttc 30181 gagcatggtc ataccttgca gcttactggc aaagctgaag tgatttggaa tgcaaataaa 30241 ttaagtactt ttgctggagc gcaacgttta gtagagtttg atgttgagca agtgttggaa 30301 accaggaatg ctagcctcct gcgttggcgg tttggggaat attcgcctgt gaatcctaga 30361 tgagaacaaa ggttgttctt gtgtttgttc tgcctgctgg ggtaagcacg ccttgaagtc 30421 tcgaacacgt aactcgaaca tctgctcaaa ttcttttgga ttgaaatatt caataataat 30481 ttcgttacct ttgtagaaga gcgaaatcag atgaatccaa gttttaacac caatgtctct 30541 gtctattccc caaagcttga tttttttgta gctgttcttg tctggtagct ttgaagtttt 30601 tcgtgtaaag taattatctt gtagataaac ttagctgctc cgttcagatg acggaatatt 30661 ttgcgttcac tctctggctc actatgcaca tatttattga aatagtagtg ctcaaagcaa 30721 ttatcaggaa tttcctctat ctcaaatgtc ttgatggtgt caggatagct ctaaaaaaat 30781 tctgtgcgga caccatgaat ttggataatt ttttttgtca aattcgattg atagaggtgg 30841 tgatatctac atgatatata agggcttatc cggtgaaagg gcaatggctc ggtggtgctg 30901 gtgcccaaag tcactccgcg actttgcccc gttccagagg aacgcttaag cctgtttttg 30961 gtgtgaactt tctttggcag aagaagagcg atcgcacaca aatcccaaaa aacttgggat 31021 attgttccaa attttttata atagaatcaa aaattatgaa accgaaaaaa gtagaaattg 31081 tcaccattaa aactccccca gagccatctc tgagtggttc cgaaggagta tcacctgaat 31141 tggaacaaac cactgaatca gaacaaacta cagccgacgt agcgcaagaa cacaaagacg 31201 atccggaaca aaccactgct tttgcagagg aagagaagcc ggaaatagaa gagcctggtg 31261 ccatctttca agctgttgga gttgttactg ggaaggtcgt tttcaccgag gagaacaaaa 31321 gtactatcat cattgggaac aaagagtatc ccctctttta cgcgcccaag aaacagcgag 31381 catttgaggc actaaaaaaa gaagtacagg caacaggcga agccactcaa cggttggttg 31441 tttacccgcg ttttacgcac ttccctagac gcgatcaacc gccccaggtt tcgtttcaag 31501 tcgtcgggtt tgacaaagga cgtgaaacaa agggggcggc atccgaggaa ctttccgaca 31561 tggagttcaa actctgtgga ttgtggcagt tcatccccgt ttgccccact ccgtgcatct 31621 ctatattccg aaattttacc aaagagcggc tggatcatgt gaagaaatcg gaaccagcaa 31681 aaaaagtaaa atttatgaaa gcaagccacc tgccactgtt ttggagggat gcgccagttc 31741 cgccgttcag attcaatccc aaagctgaga aagaacagca aggcaagccg gttttcgtgc 31801 agatcaaagc taaatttctg cctgcgcgtg atgcttttgc ttttgattcg ttgctggggc 31861 tgccgttgga gaagccaccc aagttcctaa aagtcagcaa ggaagataag gcaacagcat 31921 taaaggcgaa gagggacgct gaaagagctg ctggcaagga tgttggggga tacgaacgta 31981 agccaccgta tacagtcagg cccaaaggtc attttagtga aaaaccaatg accgaaaaac 32041 caatgaccga aaaaccaatc atcgaaaaac caaagccgaa accaaagcca caacaacagt 32101 agtaaaaaag aaactccctc gtataccaag ggagtttctg cttagggggg gtacggtcgc 32161 cagggaaagc actggctcct tttgattttt gatttttgac ttgttttggc ccctacaccc 32221 ctacaccctt agttttggtc aaaattgact cttactcggt accagaatca tcgaaaagta 32281 aggcacttgg gctggatcta catcatctag ggaaacgatt cgctggtttg ccattgtgcc 32341 tcgttcaatg taacgcgccc gtgacagcag ccctaactta tgtaacacat catgaacttt 32401 tgtaaaatgc cgtcgcagct tgataataac agcagcatca gcatttaaca attgtgcttc 32461 caaaacttcg gctggtagag gagctggtag tacactaaaa acatcgttac gataactcag 32521 aggaactgct agtgctgaag cacatcccat tggtgaagac actcctggca cgacttctgt 32581 ttcaaaatgg tcagacagtc gtgtaaatac gtacataaac gagccgtaaa acaacgggtc 32641 gccctcgcac agtaccacca catcccgtcc agcagcaaga tgttctttta tagggataac 32701 aacctggtca taaataggtt gtgctgcata cggttcaaga gcgcggggaa gatgatactg 32761 tacctcgatt tgattgccag gaagatattg ggctacaata cctctggcaa tactctgctt 32821 atcgtccgca gattgataga cgaccactgg gcaagaacgt aatagccgca gtgcttttag 32881 tgtcagaagt tccggatcgc caggacctac accaactcca taaagacgac ctttgttcat 32941 cattcttcct ccgttgctaa ggcgttaact gcggcggcgg cgatcgcact cccaccccgc 33001 cgtccatgta atgtcataaa tggcacacca aaactatctg ctgccagtgc agctttcgac 33061 tctgccgccc ccacaaaccc aaccggaaaa cccagaatga gcgatggttt cggtgcccca 33121 gcgctcaaca tttctagtag ccgaaacaaa acagtcggtg cattcccaac tgctaccact 33181 gccccttcta acactggtaa ccacaattcc aaagcagcag cagaacgggt ggtttccagt 33241 ttttgagcaa gttctggcac atcgggatga ttgagggtac aaatcactga attgtctgct 33301 ggtagtcgtc gccgcgtaat accttctgct accatccgac aatcacacaa aatcggtgcc 33361 cctgctgcta aagctgtacg cccagattct accgccgttg gtgaagctgc taagtcataa 33421 acaatatctg tcatcccaca agcatgaatg agacgcacgg ctacattggc gacatcggga 33481 gcaagcacag acagatttgc ctctgagcga atcatggcaa aggatttgct atatatttcg 33541 ttaccgtttt ttatatagtt aatagtcatt tgttagtttt taattcttaa acgaaccgcc 33601 aagacgccaa ggacgccaag aataaagata ggtaattttg tagcgggaag ggagtagttg 33661 ttagttgttt gtttatttgc taaccactaa ccactctcct ttcaaaaaac aactgttgta 33721 agtctgcgat aacatattgg ttgacaaatt caccaaatga tgggttggaa gtatcacgtt 33781 ctgccatata tatttgcaac attcgctcta gcagtgaagg taattctgta aaggacaccc 33841 aaccgtaaag ttctcgtcca aatttttcgt gactatcacc atcacctacg cagactttat 33901 aaccttccac cacttcccct ttatcttcaa tattgaaacc aacaagagca atatcgctcc 33961 tgctatgctg agcacaggat ttttcgcaac ctgtgaagtg aatgttgact ggttggtcaa 34021 gaataacgcg atcatccaag tattgtgcta atgctagggc atgatttttg gtatctgttg 34081 ctgaagacgc acatccggtg ttaccagcgc aagcaactag ggcactgcga atattagtta 34141 cagaccaatg caatcctaag ttttcaattt tactttgtat cttgggaatc cactgttgag 34201 gaatatctga tatcaaaaga ttttgccaag gtgtcagcct aagtgtacca ctagcgtagg 34261 tttcagctat atccgctaac ccacgtattt gtaaagtatt taatctacca actggtacga 34321 caacaccaat gtaggacaat cctttgtgtc gttggggatg gacaccgatg tgttggtatt 34381 tagaatagac attgcatcca atatctctat attcctcctt aatttgtgta tgactgcttt 34441 tcacagaaaa acgctgtagt ggatagggaa gatgacgctc aacttcttga agaaatcttt 34501 ctatacctaa gttgttcaaa atctcccaaa aacgtggctg acgtgacttg cgatgaggaa 34561 ttgtgtttcc agtttctagt gtgagatgct gcaaatagac ttgtgccaat gcagccacaa 34621 cctcaacaca ctcctctggt ttcaggcgaa tgtctgtatc aataaatggt tcaaatggtt 34681 caaacagtga tacacccaaa ttcagcttga ggcggaagta aacgttatta tcaaccataa 34741 ccgcactcaa gacgatatcg ttggggcgtt cgcgccttgc gcggctccct ttgggagctt 34801 cgccaatcga cacacaacca ccgccatcaa aagcaacgct aaatttcggt gaaagtggtg 34861 ctagctcggc gtgagtggta atgtaatgat ctaattcaca aatgagtggt cgagtatcaa 34921 tgagttcgta caaatcaatg ccagccgtgg gactacccat gatattacgc aaatgatcaa 34981 cctctggaat ggctgaagct acaccaatct tctgtagaac tcttaaaaca ttactgggaa 35041 tctctgtacg aattgcacgg atttgaagat tggcacgatt ggtaatattc gtgtaacctt 35101 ctccaaactc gtcagctaag tcagctatga tatgaaactg ctgactcgtc aatattcctc 35161 caggtatgcg gatgcgagat aaaataccgt ctcgtgcgga cgtttgataa aaaagaccag 35221 gacaaccaga cacgtctaca actccttccc cgtgcgacgc ggagagcaga taatcattgt 35281 gtctacaaga cactcttgga gtcggtattc tgactagggt ttacccatta cagctgcggc 35341 acagtcccgg aatctcaccg gactttcccg actctaagtt tgtgtagttt agcatgagat 35401 aatagcctta catctttttg ccaaatcttc ttaattcttc catcagtcgc tagatgcaga 35461 aatggttatc agtagtgggt attggtgagg atgggttatc gggattgagt gcgttcgcct 35521 ctggcggggc tttgcccatc gcccgttctc tccttgaaca tgcacaagtg atagtcgggg 35581 gaaaacgcca tctcgccatg ttaccaccag acgattcacg agaaaaac // LOCUS NODE_755_length_35175_cov_5.41093435175 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 35175) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 35175) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..35175 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(330..734) /locus_tag="DP116_06265" CDS complement(330..734) /locus_tag="DP116_06265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06265" /translation="MPSIDSQNQKTFTYSRSTLERAERALVCSPFNPCLFETMRSRRV ALSEMTGSSGVQNGYTKHSISELVADNALVWLIQVGVLRREVDGQGITDSFRLTPLGR QIVEQFQRKSWRTPTWSDNLYNAVIRWFRLPF" gene 920..1297 /locus_tag="DP116_06270" CDS 920..1297 /locus_tag="DP116_06270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456221.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06270" /translation="MTLQKLMEPDSLPTEVILTHSRQSLGSVKLDWAPQPGNYLDFEG KTYAVLERRHRYQFKAGRYRLQKIALYVQSAIRPNEKSLVGGRWVIGDATCRYNACSE IMRCAVNPDGPCETCRYYQRREC" gene complement(1331..2335) /locus_tag="DP116_06275" CDS complement(1331..2335) /locus_tag="DP116_06275" /EC_number="2.1.2.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310840.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methionyl-tRNA formyltransferase" /protein_id="PRJNA477356:DP116_06275" /translation="MKILFFGTPSFAVPTLEKLLNNPEFDVLAVVTQPDKRRGRGNQL IPSPVKAVATSANVPVWQPQRVKKDIETLTKLKDSDADVFVVVAYGQILSQEILDMPK LGCVNVHGSILPKYRGAAPIQWCLYNGETETGITTMLMDAGMDTGAMLLKATTPIELL DNAHDLAEKLAVLGADLLVETLSKLEHQEIQPIPQDNSQATYAPLIKKENYQLNWSKS AIQLHNQIRGFYPDCTATFRNQPLKITATAPLDDAEGYELPAELQEIIHKLPNLSTAS GSPGEVVSIVKGIGAIVQTGEGLLLLREVQSAGKRPQSGWDFVNGSRLAVGEVFGSAE " gene complement(2378..2746) /locus_tag="DP116_06280" CDS complement(2378..2746) /locus_tag="DP116_06280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013325094.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06280" /translation="MYKTNKLGRLINPLRKWLVQHLREFPNALGQAIEFKSIRVRYNA VAPNERQVTPSAPTVGTALASCGTRGEPRHLRDCRHPRRQSPQRREPPHGAGSPTQWL PKTSATTPGTVLPNTLALVA" gene 2984..4333 /locus_tag="DP116_06285" CDS 2984..4333 /locus_tag="DP116_06285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320198.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06285" /translation="MLKHLMLSGWFRAQPFVKYLIFVMLIAPLMAASEHTTSAQQVQV AKVASATKEPDSIAVDPRLLTTAAVDKANFVSVPKAAPIKSFTEKSHSIPMSLGAVSQ PQRQIDEQTNNIEALTVKDSSASVQMPYNDQLPPLELPPALKESESASNEMAASQHNL VQRLKASKVLALAANNGSASGEVLAAKTLEAVKIDPFKEGQQKSTAVPVVEQSGQQAQ AAQQDPIGSPHPIPWTWIQATQDAIGSKGLSGVRYYRSMPVISADGRYAMYSRVQLEV KPQMYNSRVTSVLFVEDRQTKKLRVVSSTGSINDPLLKVQVSSPSDAQGTIAVLVPVS WSQKGDRFLARKFEGAMNTSDVTDYALIWDRQQNRSESVTPSQEEYKHEIAVLLGWSK THPDQVVFRAGELGEEEWPLMTVAYDGKTVAATTTEQPVVYGKRVTDVWAEPQVAYR" gene complement(4605..6032) /locus_tag="DP116_06290" CDS complement(4605..6032) /locus_tag="DP116_06290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875517.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="PRJNA477356:DP116_06290" /translation="MASRTLSRVKFKPDLRSAKGTTIQRGLTIRFLRILTLIFLDVIS LILACKLAVFLGTPLESPWTKQTSFLLLVLTVEIGLIAAQGLYKAGIFRRNYPALIKA VSLSGILLLLIAFLYEPESYVSRSTFLLFWFFSVAFICAGRCIFDVTTRLLRKKGAIR YPVFLISDIEDQESHIRLIEQENCYTVQGISDSKCLDRANREQTLEYLRNQGIVEALV SWNSIKNRLYVCWNFQTAGITLRILPTQNTIFHPKSVVWMIGEVPCMTIPAPIIAGSD FWVKRCFDLCCSIILLVILSPVYLLITLLIKLDSPGPIFFKQERIGLHCKKFKIWKFR TMVVNAEKMQKDLEAKNEIKDGVLFKLKNDPRITRVGTFLRRYSLDELPQVFNVLLGQ MSLVGPRPLPLRDVEKFQTGHFIRQEVLPGITGLWQVSGRSNIDNFEDAVKLDLSYIE NWSLWLDLKILLKTVQVVLRKTGAY" gene complement(6166..7332) /locus_tag="DP116_06295" CDS complement(6166..7332) /locus_tag="DP116_06295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl hydrolase family 10" /protein_id="PRJNA477356:DP116_06295" /translation="MRFFIKRRELLLGLGTLASTVPFACANRFKDYNQVKAQYNRKRN FSIVGNTSLRKRAAVKGLIYGAFPSFGYENLSRNKQLQSAFIRECGLLVGGFYWGVTR PSINNFNFNDTDSFAQFASEHGMLFRGHPLVWHQVIPQWLISKFQDPKTTSKEIENLL TNHVSTIVKRYAGRIHSWDVVNEAIEPKHGRPDGLQDTPWLKFLGPDYIDIAFRTARD ADPKALLVYNDALLDHDIPEHEARRIATLKLLKTLKSKGTPVQALGIQAHLFADKPFN PKKLRAFLRDVASLGLKILITELDVSDNQLPRDINVRDRIVASVYEDYLSIVLDEPAV IAVITWGFTDSDTWLSRFPRSDRAPLRPLPFDFNMKPKLAWNAIARAFDKAPKR" gene complement(7520..8329) /locus_tag="DP116_06300" CDS complement(7520..8329) /locus_tag="DP116_06300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651400.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_06300" /translation="MKLTISQRNLLKHRYILGMRVDGTSYEDATQRILGWAKARKSCY ICVANVHMTMEVHDDPVFASVVNNAALVTPDGMPLVWALTALGVKNASRVYGPTLTLY VCEAAAQAGIPIGLYGGTSESLVAFVKFLHQRFPGIKIACQISPPFRPLTREEDDAYT QQIVDSGARILFVGIGCPKQELWMAAHKNRIPAVILGVGAAFDFHSGRVKQAPSWMQK RGMEWMFRLLMEPKRLWKRYFKHNPRFLLFFVMQWLASKFGWRLFETNSLD" gene complement(8378..9553) /locus_tag="DP116_06305" CDS complement(8378..9553) /locus_tag="DP116_06305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651401.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_06305" /translation="MRILSLHNRYQLPGGEDVVVSMERELLEANGHCVDLFEVNNDHI TNSIEKAKSAVNSIYSISSKTQILQRIASFKPDIVHVHNFFPILSPSVYYACREAAVP VIQTLHNYRLLCLNSYFFREGKVCEDCLGKSFAWPGVVHSCYRGSKTGSAVIGAMQSI HRTLQTWNKVVDVYITVTEFARQKYIQGNLPASKLVVKPNFLYPDPKPGEGQGNYALF VGRLSPEKGLETLLKAWEKLAGKIPLKIVGDGPLANTVASTAQRLTGVEWLGRLPKQE VLKLMKDAQALIFPSLWYEGFPLVIVEAYAVGLPVIASNLGSQSSLVEHGRTGLHFRP GDPEDLVAQVEWALTHPAALAQMRQETRGEFEAKYTAERNYQMLMDIYERVVNCKTF" gene complement(10199..11308) /locus_tag="DP116_06310" CDS complement(10199..11308) /locus_tag="DP116_06310" /inference="COORDINATES: protein motif:HMM:PF00534.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06310" /translation="MKLLIISQFGGIGGAERSLIPLAKELQSDGYELTLLLLKLPTDS HIFQDFPGTVLFPESASSWYRNHTLSQLNREIANTDLVIATSELSPTYISWLLSRWHR KPFIADVQVHLSQWINDSAKSLHHYLCRWIYPQISYIRCVSEGVSDDLRLNYGVPSEH LSIIYVPFELDAIIQASQSPLPSEHHHIFNKPTIVSIGRFTSQKRFDIAIEALLYLRQ SYDIEGNLLILGDGELRPQLEQQIRMLELSNSVFMPGFVENPLMYIARSQVFLLSSDY EGFGRVIVEALALGCPVVSTNCPSGPSEVLEQGKCGLLVPTAHPQEMAHAIAQILTNS QLSQTLRSAGLQRAKDFSSQVIAQDYKRLINCALS" gene complement(11483..12259) /locus_tag="DP116_06315" CDS complement(11483..12259) /locus_tag="DP116_06315" /inference="COORDINATES: protein motif:HMM:PF13641.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_06315" /translation="MNLKYPKITVVTPSYNQAPFLKMTLESILNQDYPNLEYIVIDGG STDGSVDILRQYDKQLTYWVSEPDQGQTDALNKGFLRATGDILCWLCSDDLFESWTLK EVAQFFQDNPQARVVYGDSTWIDVEGQPLRIKKELPFNRFIFLYSHNFIPQPSTFWRR DLYEEVGGLNSEFDLAMDADLWMRFSEVTDFYHVRRSWSKMRLYPEQKTQRLSVRSRE EGRMIEQRYCGEEPMWSFKVKKFLATSLRVGWKLAAGNYW" gene complement(12360..13607) /locus_tag="DP116_06320" CDS complement(12360..13607) /locus_tag="DP116_06320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_06320" /translation="MKVLQMSTYDGRGAGRAAYRLHQGLQKIGISSQFLVQAKNSDDE TVIAPHTKLEKGVAKLRPSLSRLPLSFYPQRELTDYYPSWLPDTLRPQVKQLNPDIVN LHWICNGYLEVETIAKLNKPIVWTLHDMWAFTGGCHYSQNCDRYMNSCGACPQLRSQK NMDMSGWQWQRKAKAWKEINLILVSPSAWLAKCAQASSIFKNVRVEVIPNGLDTTIYK PIDRRIARQILNLPQEKQLILFGAMNAMHDKRKGFHLLLSALQSLMKSEWRDRIEIVI FGSSQSKNEVDLGFKSHYLGKLEDNISLSLVYAAVDAFVAPSIEDNLPNTVMEALACG TPCVAFKIGGMPDMIEHQKNGYLSHPYIIEDLAQGITWVLENRERHQKLCKYAREKAE QEFTIQIQASRYLSVYTEIMDQH" gene complement(13752..15038) /locus_tag="DP116_06325" CDS complement(13752..15038) /locus_tag="DP116_06325" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06325" /translation="MKIKIFYIPIFLLVADLLLSLPQFNLRVIGALRVAFIPLLVLVL IEQLRKVDEKRIRFTISMSWPLLLLPILFLFQIVAIPSSAVSTHLSGCTKYISWCLLY LCGLLSLNSQTTRQVRHILLLTLLFVFLATIVQYPILLQRSSESLSSIISSYGHQEDK DIFGLFGAANEDANSLMTLFPLSLFYISQKRGSKRNLWKVFLLLYIPIILFFNGTRTA LFITFPLVIFLFHFSLSIKNLIKLVPFLITFVLIYSLYVADFAGSSFSKESEGEGSWG FRVERVWVPASNYTSENSPLFGFGSRGWEYVCLINGIVRGAGELNEFEVVPSHNVYVW SYVSWGFFGLFIYIAFLLTLLKESFQLSIFPQEKEVALFGQTLFCCVVAYCIWASISN AYIESGWDILFVLGILIASLKMTVLLNQNRTYLSNR" gene complement(15273..16187) /locus_tag="DP116_06330" CDS complement(15273..16187) /locus_tag="DP116_06330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651406.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_06330" /translation="METPIAFIIFKRPHTTEKVFEAIRQAKPPKLFVIADGSRAEHPG EAEKCEATRAIIERVDWNCEVIKNYSDTNLGCAKRVVSGIDWVFSNVEEAIILEDDCV PHPTFFPFCEELLEKYRYDSRVASISGSNYQLGHRRTNYSYYFSIYNHCWGWASWRRA WQDFDIYMKLWPQIQAEGCLSDILEDYKAVRYWSNLFQSVFENPSDQIWDYQWTFACW IQSALSIIPKANLISNIGFGLESTHFTSKKVSPYINMPTEVMEFPLKHPPFIVRNLKA DKFTQQTLFKQTAFQLIKQRIKKLIKPK" gene complement(16230..17162) /locus_tag="DP116_06335" CDS complement(16230..17162) /locus_tag="DP116_06335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 8 protein" /protein_id="PRJNA477356:DP116_06335" /translation="MVTPSSKESIIVVCGADDRYAMPLAVVICSVLENLSSNRHCTFF IIDGGISKKNKNRILKLTDSKQCLINWLQPPDAMLNNVILSGHITVAGYYKILIPVLL PDSYSKAIYLDSDLIVKGDLAKLWDINIEDNYLLAVPDIGIPYVSSLYGLKNYKELGI PSHQKYFNAGVFVLNLEKMRTENISMQVIQYLQDNKEHIRWHDQDALNAVLAGKWTEL ELRWNQLPSIYNNSSCEDSPFSQEEWTNALKDPYIIHFASSSKPWNSTVYHPANDLFF HYVDKTPWAGWRFTILRRIWIKLMLKIVTTMSKA" gene complement(17297..18454) /locus_tag="DP116_06340" CDS complement(17297..18454) /locus_tag="DP116_06340" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06340" /translation="MRSKKLLIVSPFPNDQDLYPHLKYLIDELSVHYTIDYFYMEERG LWIENIIYDVIRKGKIYSSLKRLYILTKDVFRLHKIKLGKTKYDAVVAVDNFLYVLSS FILKQEVVLWSHDLLGYDEPRNYCRTAFIHRVIAKFTRKSLAKNKKLIIQDKERLDFL LESIGYKGTLDNVFFLPVSLPPIQVPKKELKSKAKVPTLMQSGSIASWRGSDDLIQYH QQNFDKFDLFLHGFISDEIQELLGKVEVLPLISSFKVLPDKVPQLIQLCDIGFINYAV EDLNHFYTSNASGQFVEFIRCGKPVIVKGHTNLQQYVEEKKVGVSIISIDELASAIQK IKTHYSEYSYNCIKIFEKSYDIKNYTEKLIVYLEKENSSYESPTYQLTRYF" gene complement(18483..19253) /locus_tag="DP116_06345" CDS complement(18483..19253) /locus_tag="DP116_06345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015143085.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_06345" /translation="MSIFNNYARYYDLLYLDKDYVGETKFIQQLIQTHAPNAQNILEL GCGTGNHAVLLAKEGYQIHGVDLSQEMLRKADSRLSQLDPELASQLKFTHGDIRHLRL NQTFDVILSLFHVISYQTTNEDLLAAFTTVKEHLKPGGIFVFDIWYGPAVLSDRPSVR VKRLEDEEIKVTRVAEPVIYPNENLVDVNYQIFIQDKTSGAVEEIQETHQMRYLFKPE LDLLLKDVGLKIINDMEWMSRRALGFHTWGSVFVVDFL" gene complement(19477..20493) /locus_tag="DP116_06350" CDS complement(19477..20493) /locus_tag="DP116_06350" /inference="COORDINATES: protein motif:HMM:PF13641.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06350" /translation="MISIIIPTLNRAISLKLAINSICQQNFSPDKFEIIVVDNGSTDN TKQVTEAAIAAYPFHKIHYIYEVEPGLLSGRHRGALEAKGDILIFVDDDIEADVNWLQ AIQESFDDSSVQIVGGRNLPKYEVEPPEWLEWFWLEHPYGKLCGYLSLLDFGDQVRNI DANYVWGLNFSIRKSALFELGGFHPDCIPKHLQYLQGDGETGLTQKANSQGYKAIYQP KALVFHSVPKERMTYEYFEQRSFYQGVCDSYSNIRQPNEKLEQVSLINKIKQPLRFLK KIGLKLFRKQTEKDKLNERFLKAYQRGYQFHQNSVLCNRELLDWVLRKDYWNYQLPDI CINT" gene complement(20750..21865) /locus_tag="DP116_06355" CDS complement(20750..21865) /locus_tag="DP116_06355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015143086.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminotransferase DegT" /protein_id="PRJNA477356:DP116_06355" /translation="MKPIPVNEPLLDGNEKKYLFECIETGWISSEGPFVKQFEEQFAA SVGCKYGIAVCNGSVALDAAVAALGIGSGDEVILPTFTIISCAAAIVRAGAVPVVVDC DPHTWNMDVNQIESKITPRTKAIMVVHIYGLPVDMDQVLGLADKYGLHIIEDAAEMHG QTYKGRRCGSLGTISTFSFYPNKHITTGEGGMLLTNDEKLAERCRSLRNLCFQPQKRF VHEELGWNMRMTNLQAAIGVAQLERLDEFVARKRRMGQIYTELLANISDVQLPLPLTT YATNIYWVYGLVLKDNVAFDAQEAMQYLAKHKIGTRPFFWCMHEQPVFRKMGLFEEES CPVAENIARRGFYVPSGLALTEEQMTQVALAVKDMLK" gene complement(21862..22305) /locus_tag="DP116_06360" CDS complement(21862..22305) /locus_tag="DP116_06360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015143087.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06360" /translation="MIEQILYQDQLLAVIISHKFDKPGIHFFTPNELSQQLAYMHHPK GKIIQPHVHNAVPREVLYTQEVLFLKRGKLRVDFYNDQQKYLESRMLEAGDVILLVTG GHGFEVLEEVEMIEVKQGPYLGEQDKTRFVGISAESAKIPELSQL" gene complement(22794..24611) /locus_tag="DP116_06365" CDS complement(22794..24611) /locus_tag="DP116_06365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008189392.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_06365" /translation="MKKYFSKFLYVISAKKRTIFILLCLFLLISVLDALGIGLVGPFM SLATNPDLVFKSSWLNWGYVNSGFQSTSQYIALLGLGIIIIFGIKSLLYFQVQRYIYD FSLTQQGLLKLRLLHGYLTVNYTYHLNKNSALLIQNIIHETFLFCYSVTLPLLSSAAN SVVVSALILLLLKTDFSATASILLMLTLAFAFYNKFKDQMAYWGKEGSESDTEMIRII NHSVGGLKETRVIGCESYFESQMNIQAQRHALTGSLFQVFQSLPRIAIEALLVTFIVC FISVSLVFNQNPQNLISILSIFAVASIRLIPAASQLMSAIGTLRNSSYSLNKLYFDLK ELETNKLESIKSIKSLYKSESNVTIGKSTSIQKLNFRNVLILNKINYSYPNVSENALE NVSLTLKKGQSIALIGKSGAGKTTLVDVILGLLIPNHGDIRVDGVSIYDNLRSWQNLI GYIPQSIFLMDDTIERNIAFGVLDEQIDSQKLQKAIQTAQLEELISQLPDGIKTAVGE RGVRLSGGQRQRIGIARALYHQREILILDEATSALDNETENLISEAIRSLSGTKTLIL IAHRLSTVEHCDRIYVLERGRIVKSGNYQEVVLGQTISS" gene complement(24806..25666) /locus_tag="DP116_06370" CDS complement(24806..25666) /locus_tag="DP116_06370" /inference="COORDINATES: protein motif:HMM:PF13489.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06370" /translation="MDRDSIMTEAISRSKYLIKRLIFVFDPLYVSFYRIFNNYYAPIP PMENRVRVGAHYKIGGFLKSGRNCYEPIKKSIVTYQSNQINHLKILDFGAGCGRTLQF FYPDIPKIFATDVDSSAINYLSKAFPEANASCNQYDPPLKYDDNFFDTVYSVSIWTHL PVSKQKPWLNEISRILKPNGLALITVLGDFSLTIKKTKNMNLDVTPEKLEKEGILYLE YPGVNLSSEMRNKLFPGIDQSYGTTYHSEKYIREEWSDNFEVLDIQKGVIDNLQDLVI LRKNGSSNLA" gene complement(25749..27956) /locus_tag="DP116_06375" CDS complement(25749..27956) /locus_tag="DP116_06375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875533.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipopolysaccharide biosynthesis protein" /protein_id="PRJNA477356:DP116_06375" /translation="MSAMEPVPNSEEIDFQKYWLILQRRWLPAVGVCGVVVTLASLLA FSSKPTFKAEGSLLIKTNRTSSLTGLGEAIGRLDTLTMESNPSETQAKIVASVPVIQE TIAELNLRDDKGKAITIEELTNSLKISSLKGTEILQVSYIDKDPKLAARVVNKVMQAY IKNNIEANRQEAVSARKFIQKQLPTTEVSVKQAESALRQFKENNKIITLQEEASAAVQ AIAKLNEQISQAQAQLEDVTARSQKLQAQAKIDSSQTLTNSSLTQASGIQQVLTQLQE AQSQLAVERTRLKPEHPIIVNLEEKVAALNSVLQQRMKQVTGSNQQISLGNLQNGSLR EKLLEDYAGTEAQRVGLVKQIATLSNQRTVYKERANILPKLEQTQRELERKLKAAQTT YETLLTRLQEVQVAENQNIGNARVISPALVPDKSIGSGKILIIGAGGILGVLFGAVAA FALDTIDPSLKTVKQAKELFKYTLLGVIPLTGRNQKKSLRVPEVDQSIPKVIGRDIPH FPVGDAYQMLQANLKFLSSDREPRVIVVTSSVSGEGKSEVAANLAMAMTQVGHRVLLV DADMRHPVQHHIWNLTNAVGLSHVIVNPDTLNVALQEAMPNLDVIPSGVVPPNPVALL DSKCMAALVSSFTEEYDFVIFDTPPLGGVADAAVLGNLVDGILWVVRPGVVDFSSANA AKDFLKQSGHNVLGMVINGVNVKSEPDSYFYYTKEAVDSSRSVSRHSVVVRRN" gene complement(27999..28448) /locus_tag="DP116_06380" CDS complement(27999..28448) /locus_tag="DP116_06380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06380" /translation="MHRRCNRNRNRSKRRAIYCPIHGCYLESVSQKYPLFADRAGQLQ QRGISRQNALILVAAKTAVCLEGEWLEAFWCDQCQQTKWYHVKRRVTKTQNKESCTYE VSIVSPELWQQAIGVIHPEGNPSVGEFTRRHAQMVSYNGSKDFQFGD" assembly_gap 29431..29440 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(30376..32163) /locus_tag="DP116_06385" CDS complement(30376..32163) /locus_tag="DP116_06385" /EC_number="6.1.1.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197362.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate--tRNA ligase" /protein_id="PRJNA477356:DP116_06385" /translation="MRTNYCGELRKEHIGETVTLYGWVDRRRDHGGVIFIDLRDRTAI VQVVSDPQRTPNSYELANTLRNEYVVEITGRVTQRPEESLNPRIPTGEIEIYADTIKL LNSVAKQLPFQVSTADADPVREELRLKYRYLDLRRDRMANNLQLRHQVVKAMRRYLED VEGFIEIETPVLTRSTPEGARDYLVPSRVNPSEWFALPQSPQLFKQLLMVSGFDRYYQ IARCFRDEDLRADRQPEFTQLDMEMSFMSQEEIIELNEKLVCHIFKTVKGIELQRPFP RLVYAEAMERYGSDKPDTRYGLELVDVSDVVKDCGFKVFREAVTNGGIVKILPIPNGN DSISNVRIKPGGDLFKEATDAGAKGLAYIRVRDDGEIDTIGAIKDNLTAEQKQEILRR TDAKPGHLLLFAAADTATVNKTLDRLRQVTAREFGLIDPEKINLLWITDFPMFEWNAQ ENRLEALHHPFTAPHPDDLSDLKSARAQAYDLVFNGFEVGGGSLRIYQREIQEQVFEA IGLSPEEAHNKFGFLLEAFEYGTPPHGGIAYGLDRLVMLLAKEESIRDVIAFPKTQQA RCLLTDAPSVVDAKQLKELHVASTYKPKS" gene complement(32438..33274) /locus_tag="DP116_06390" /pseudo CDS complement(32438..33274) /locus_tag="DP116_06390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868004.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="protease HtpX" gene 33947..34861 /locus_tag="DP116_06395" CDS 33947..34861 /locus_tag="DP116_06395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316543.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06395" /translation="MAQNINRVGMVQKYTKILHDLAIFIISPWLERASVKSLQKRVDY LERRLCQDLPNTEQRLGSLIQQLQRQFNLLQDNTTNRLDKLLEYDLPQQVLDVLKEQT DLIIQNIKPIVEDLIREQEENKPLQENISEALIQVKHHKIEEFSEHLQEFTALPFGFA VAYGGRPSCSAVSLPLPEEQSAASLEAGLWLLAHRDKITQEVADELLGLQASQTEVFR QNLSRYLKLLGSCLENGIQPRLLYQGIITHEQPAVEIYVNGFKLIQNKYIKCWEDSAQ VSTEAAQQLRVYFKYLIDYLLKALAFSP" BASE COUNT 10565 a 7321 c 6784 g 10495 t 10 others ORIGIN 1 cccccctaca cccctacacc ctacacccct acaccccttc ttggtcaaaa accaagattt 61 cactgattta gtcaaatagt taccataaag acttcataaa ctgaatttaa acttccgtat 121 tgattgtttt ttttgattcc ccactcttaa ctcttttata gcttttctcc gtttacaaaa 181 gtataataat taacttgtat agcagacacc ttcacgccgc gacaacgctt ggtaccccat 241 gcggctgtgc gtgactatcg gtaaacccga cgccagttgc tatctcctgc acgccgaagc 301 tcttgcaaca acgcttggta cccttaattc taaaagggta gcctaaacca acgaataaca 361 gcattgtata ggttatcact ccatgtagga gtccgccaag atttcctttg aaattgttct 421 actatttgac gaccaagagg agtgaggcgg aaactatctg taattccttg accatctact 481 tcccgtcgca gaacacccac ttggatcagc cacacaagtg cattgtctgc tactaactcc 541 gagatagaat gtttagtata gccattttga acaccagaac taccagtcat ttcacttaat 601 gctactcgac gagaacgcat tgtttcaaat aagcaaggat tgaagggaga acacactaag 661 gctcgttcgg ctcgttctag ggtgctgcga gaataggtaa aggttttctg gttttgagaa 721 tcaatactag gcatttggtc acttttaata ctttttagct cggaatctca aactgctaaa 781 acatgagtta gtaaaatatt acttcatggt ttgatggcta gaagctgtgg ctgcgtcaaa 841 aaatattagc aaatcgttaa cgataacata aactatgcaa gtcaacacag gcaggtgcgt 901 taccatccta ttaatgtaaa tcacattaca aaagcttatg gaaccagatt ctctaccaac 961 tgaggtgatt ctgacgcatt cgcgtcagtc cctcgggagt gtgaaacttg attgggcacc 1021 acaacctgga aactatcttg attttgaggg caaaacatac gcagttttag agcgccgcca 1081 ccgttatcaa ttcaaggcag ggcgctatcg tttgcaaaaa attgctttat acgtacagtc 1141 cgctatacga ccaaatgaaa aaagtcttgt aggagggcgt tgggtcatcg gtgatgccac 1201 ctgccgctac aatgcttgtt cagaaatcat gcgctgcgcg gttaacccag acggaccatg 1261 tgaaacatgc cgctactacc agaggcgaga gtgctgatta aagtaaagga aatttggaaa 1321 caatcactca tcactcagca ctcccaaata cctctcccac tgctaaacgc gaaccattaa 1381 caaaatccca tcctgactgg ggacgtttgc cagcactttg aacttctcgc aagagcaata 1441 agccctctcc agtttggaca attgctccta tccccttgac gatgcttaca acttcccctg 1501 gactacccga tgcagttgac aaattaggta atttatgtat tatttcttgg agttctgctg 1561 gtagttcata accttcagca tcatcaaggg gagcagtggc ggtaattttc agtggttggt 1621 tgcgaaaggt agcggtacag tctggataaa accctctgat ttgattgtgt aattgtatag 1681 cgctctttga ccaatttaat tgataatttt ctttcttaat caaaggcgca taagtcgctt 1741 gagaattatc ttgaggaatt ggctgaattt cctggtgttc cagcttggac aaagtctcta 1801 ccaacaaatc tgcacccaga actgcaagtt tttctgccaa atcatgagca ttatctagta 1861 attctatcgg tgttgtcgct ttgagaagca ttgcaccagt atccattcct gcatccatta 1921 acattgtcgt gatgccggtt tcagtctcgc cgttatacaa acaccactga atcggagcag 1981 cacctcgata cttgggtaaa attgagccat gcacattaac acaacccaat ttaggcatat 2041 ctaaaatttc ttgagacaga atttgcccat aggcaacgac gacaaacaca tctgcgtctg 2101 agtctttgag tttcgttaaa gtttcaatgt cttttttcac ccgctgaggt tgccatacag 2161 gcacattagc agaggtggcg acagctttta ctggtgaggg tatcagttga tttccacgtc 2221 cccgacgttt atcaggctgt gttacaactg ctaagacatc gaattctgga ttattcagta 2281 gtttttccag agtcggtaca gcaaaactag gagtgccaaa gaataggatt ttcatcagtc 2341 attagtcatt aggaggcagt gctcgactag ggttcggtca ggctacaaga gccagtgtgt 2401 tggggaggac agtgccgggc gtagtcgccg aggtcttggg gagccactgc gttggggagc 2461 cagcgccgtg cggaggttcc ctccgttgag gcgactggcg tcggggatgc cgacagtcac 2521 gcaagtggcg tggttccccc cgagtcccgc aactggcaag ggctgtgccg acagtcggtg 2581 cactaggcgt cacctgccgt tcattaggag ccactgcgtt atatcgtact cgaattgact 2641 tgaactctat cgcctgtccc aaagcattag ggaattctct aagatgttgt acaagccatt 2701 tacggagcgg gtttatcagt cgtcctaatt tgttggtttt gtacaaacct cttaagatgc 2761 gccctcctga tggtttccaa ttttttggtt tactaccgtt cttaaaaatt ctgtagccag 2821 aacgcgtact acgaatgact ctattttgta aaagtatgcg aaatttgagt aaactatatc 2881 gttaaaactg ctagttcctt gttatacata taatgataac ctgctctatt gctgcatccc 2941 tttgtgctcc tcacacagca accgccctct agaaagtcaa caaatgctta aacaccttat 3001 gctgtctgga tggttccgtg cgcaaccatt cgttaagtac ctgatttttg tgatgctgat 3061 cgcgcccttg atggcagcat cagagcacac gacttcggca caacaagtac aagtagctaa 3121 agtagcttcg gcaacaaaag agccagactc tatagcggtt gatccccggt tattgacaac 3181 ggctgctgtt gataaagcta atttcgtttc agtaccaaaa gcagccccaa tcaagtcctt 3241 tacagaaaaa tctcactcta taccaatgtc tttaggtgcg gtttcccaac cgcaaagaca 3301 aatagatgaa caaacaaata acattgaggc tttgactgta aaagatagtt ctgcgtcagt 3361 tcagatgccg tataatgacc aattgccacc attagagtta ccgccagcac taaaagaatc 3421 tgagtcggca tcaaacgaga tggcagctag tcaacacaat ttagtacaac gattgaaagc 3481 ctctaaagtt ctagccttgg cggcgaacaa tggctctgca tctggggaag tgcttgctgc 3541 taaaacatta gaagcagtta aaattgaccc tttcaaagaa ggacaacaaa agtcaacagc 3601 agtgcctgtt gtcgaacaat caggacaaca agctcaagca gcacagcaag atccaatagg 3661 aagtcctcat cctattcctt ggacttggat acaggcgact caagatgcta ttggttctaa 3721 gggtctttcc ggagtgcgtt actaccgcag tatgcctgtt atttctgcag atggtagata 3781 tgctatgtac agccgcgtac aactggaagt caaaccacaa atgtataata gccgcgtgac 3841 cagcgttcta tttgtagaag ataggcaaac taagaagttg cgagtggtta gctcaactgg 3901 ttctattaac gatcccctgt taaaggttca agtttcttca ccaagtgatg cacaaggaac 3961 tattgcggta ttagttcctg ttagctggtc acaaaaaggc gatcgctttt tagcacgcaa 4021 atttgaaggg gcgatgaaca cttctgatgt cacagattat gcactcatat gggatcggca 4081 acaaaatcgc tctgaaagtg tcactccttc acaagaagaa tataaacacg agatcgctgt 4141 gttgttaggt tggagtaaaa ctcaccctga ccaagtggta tttcgtgctg gcgaattggg 4201 cgaagaagaa tggcccttga tgacagttgc ttacgatggt aagactgtag ctgctacaac 4261 tacagagcag cctgttgtct atggtaaacg ggtcacagac gtctgggctg agccacaagt 4321 cgcttacaga tagatcatat ttgtccagtt gcccataatt aggactaaca cagaaaaaat 4381 caaaccattc atttagcgac tgagggagaa tattggcgtt caagagtaat aattccatca 4441 ctccagtcgc ttaactaatt tttaaaaatt gcaatctgag cctgagaaag agcagaaatc 4501 taaaatagga aatctctact ttcataggct caaacttcaa aaatgattaa aacagtgaag 4561 aagcaggtgg agatggagaa aatttctgca ttttgtttgc caatttaata ggcacctgtt 4621 ttgcgcagga caacttgaac tgttttcagc aaaattttca aatctagcca gagtgaccaa 4681 ttttcaatgt aagaaagatc taatttcacc gcgtcttcaa agttatcgat attggaacgt 4741 ccagaaacct gccacaatcc agtaataccg ggcaacactt cttgacggat aaagtgcccg 4801 gtttggaatt tttctacatc tcttagagga agaggacgcg gaccgaccaa actcatttgt 4861 ccgagtaaaa cattaaacac ttggggcaat tcatctagac tataacggcg cagaaatgta 4921 ccaactcttg tgatccgagg gtcattcttc agtttaaaca aaacaccatc tttaatttca 4981 ttctttgctt cgaggtcttt ttgcatcttt tccgcattca caaccattgt gcggaatttc 5041 caaattttaa atttcttaca atgtagacca attcgctcct gcttgaagaa gattggtcca 5101 ggtgaatcga gcttaatcag caaagtaata agcaaataca caggtgagag tatcactaac 5161 aaaataatgg aacaacaaag gtcaaaacac cgctttaccc aaaaatcact gcctgcgata 5221 attggtgctg gaatcgtcat acaagggact tcacctatca tccatactac agatttcgga 5281 tgaaagattg tattttgtgt agggagtatt ctcagggtaa taccagctgt ttggaaattc 5341 cagcaaacat agagacgatt cttaatggaa ttccaagaaa ccaaagcctc tactatccct 5401 tggttgcgaa gatattccaa ggtttgttct ctgttcgctc tgtctagaca cttagagtca 5461 gaaattccct gtacagtgta gcaattctct tgttctatta acctaatatg actctcttga 5521 tcttctatgt cggagatgag aaaaacagga taacgaattg ctcctttttt acgaagaagt 5581 ctagtggtaa catcaaagat acagcgtcca gcacagatga aggctacaga gaaaaaccaa 5641 aatagtagaa aagttgagcg agaaacatag ctctctggtt cgtagagaaa agcaatcagc 5701 aaaaggagaa tgcctgacaa agaaactgct ttaatcagag caggataatt acgacgaaaa 5761 atacctgctt tatacagtcc ttgcgctgca atgagtccta tttctactgt taaaactagc 5821 agtagaaaag atgtttgttt tgtccaagga gattctaatg gagttcccaa aaagactgcc 5881 aacttgcatg ctaggattag ggaaataaca tctaggaaaa taagtgttaa tatccgtaaa 5941 aatcggatag ttaatcctct ctgtatcgtt gtgcctttgg ctgagcgtaa atcgggtttg 6001 aattttactc ttgatagagt tctggaagcc acgatttctg tccctcctta tatttgtacc 6061 gttatggcgg tttttattcg agttgaacac atttggcata tgcaaccata actcttacca 6121 ttaagggcaa gagtggttgt taataaatgt gattctgaat tgaaattaac gctttggagc 6181 tttgtcaaac gctcttgcta tagcattcca tgccaattta ggtttcatat taaaatcgaa 6241 aggtaaaggt cgtaatggtg ctctatctga gcggggaaac ctcgaaagcc aggtgtcgct 6301 atcagtaaat ccccaagtta tgactgcaat gacagctggt tcatctaaaa caatagacag 6361 atagtcttca tagacacttg caacaatgcg atcgcgcaca tttatatctc taggcaactg 6421 attgtctgat acatctagct cagtaatcag aatttttaga ccaaggctgg caacatcacg 6481 tagaaaagct ctcagcttct ttggattgaa tggtttatct gcaaacaagt gagcttgtat 6541 gcctaaggct tgaacaggcg tacccttaga cttcaaagtt tttagcaatt tcagtgtagc 6601 aattcttctt gcttcatgct cgggtatatc atgatccaat aaagcatcgt tatataccaa 6661 cagcgccttg ggatcagcat ctctagcagt gcgaaaggca atgtcgatat aatctggacc 6721 caaaaacttt agccatggtg tgtcttgtaa accgtcaggt cgtccgtgtt tcggctcaat 6781 tgcctcgttg accacatccc atgagtggat tcgtccagca taccgtttaa caattgttga 6841 cacatggttt gttaaaaggt tctcaatttc tttagaggtg gttttggggt cttggaattt 6901 actaattaac cactgaggaa tgacttgatg ccaaactaaa ggatgtcctc ggaagagcat 6961 cccatgttca gacgcaaatt gggcaaaaga gtcagtatca ttaaagttga aattattgat 7021 actagggcga gtgacacccc aataaaaccc ccccactaga agaccgcact cccgaatgaa 7081 ggcagactgt aactgcttat ttctagataa attctcgtaa ccaaaagacg gaaatgctcc 7141 atagattaat cctttaacag cagcccgttt gcgtaaagaa gtattaccca caatagagaa 7201 atttctcttt cggttatatt gagctttaac ttgattgtaa tctttgaatc tattggcaca 7261 agcaaaggga accgtgcttg ccaaagttcc taatcctagt agaagttcac gccgtttgat 7321 gaagaatcgc attttgatta attgttagta cattaactct gctaacaacg ttttcacagt 7381 tttaccagcc actgaccagt tgagacgaga ctgatattca ttaaatgagg ataatgctag 7441 ttgtttgtac cttgagtact caacaaaaaa accagaaatg tagtcacatc ttcaatggtg 7501 gcttgtctag aatgcgcagt tagtctaggg aatttgtttc aaacagtctc cagccaaact 7561 tgcttgctag ccattgcatc acaaagaaca gcaaaaaccg tggattgtgc ttaaagtaac 7621 gtttccataa acgttttggc tccatgagca gcctaaacat ccactccata ccgcgtttct 7681 gcatccagct aggggcttgt ttgacacgac ccgaatgaaa gtcgaaagcg gcaccgacac 7741 cgagtataac ggcaggaatc cggtttttgt gcgctgccat ccacaattct tgtttaggac 7801 aaccgatgcc cacaaataaa attcgtgcgc ctgagtctac aatttgctgg gtgtaggcat 7861 catcttcttc gcgtgttaac gggcgaaaag gtggcgaaat ctgacaggca atcttaatgc 7921 ctggaaaacg ctggtgcaaa aattttacaa atgcaactaa gctttctgat gttccaccat 7981 agagtccgat tggtatacca gcttgtgctg ctgcttcgca tacgtagagt gtcagggtag 8041 gcccgtagac tcgggaggca ttttttacac caagtgcagt caatgcccaa actaggggca 8101 taccatcggg tgttactagg gcagcattgt taactacact cgcaaataca ggatcgtcgt 8161 gaacttccat agtcatgtgg acgttggcga cacagatgta gcaacttttt ctcgccttgg 8221 cccagcccaa aatccgctgt gtagcgtctt cgtaactggt accgtctact cgcatgccta 8281 gtatatagcg gtgcttgaga agattccgtt gactaattgt tagtttcatt tcttagtatc 8341 tccatgttaa tacgcagaat acgcatgcta gtttgggtca aaaagtctta cagttcacaa 8401 ctcgctcata aatatccatc aacatctggt aatttctttc tgccgtgtac ttagcttcaa 8461 attcacctct agtttcctgt cgcatctgag caagcgcggc tgggtgagtt aaagcccact 8521 ctacctgtgc aaccagatct tctggatctc cagggcgaaa gtggaggcct gtacgaccat 8581 gttcaacaag cgaagactgg ctaccaaggt ttgacgcaat caccggaagt cccacagcat 8641 atgcttccac tatgactagg gggaaacctt cataccaaag agaaggaaaa attaaagcct 8701 gagcatcttt cattaacttg agcacttcct gcttgggcaa ccttcccaac cattccacac 8761 ctgtcaacct ttgggcagtt gaggctactg tatttgccaa tggtccatct ccgacaattt 8821 tgagtggtat ttttcctgcc agtttctccc atgcctttaa aagcgtctct aatccctttt 8881 cgggtgaaag tctccctaca aacagagcat aattgccttg tccctcaccc ggtttggggt 8941 cggggtaaag gaagttaggc ttgactacca gcttcgaggc tggcaagttc ccctggatgt 9001 acttttgtcg agcaaattcc gttacggtga tgtagacatc caccaccttg ttccaagttt 9061 gtaaggttcg gtggatagat tgcatagcac caattacggc actacccgtt ttgctgccgc 9121 gatagcaact gtgaacaaca cctggccaag caaaagattt tcccaaacag tcctcacaca 9181 ccttgccctc tcgaaaaaaa taagaattga gacaaaggag gcgatagtta tgcaaggttt 9241 ggattactgg tacagcagcc tctctacagg catagtagac tgatggagaa aggattggga 9301 agaaattatg tacatgtaca atatctggtt tgaaactagc aattctttga agaatttgtg 9361 ttttcgatga tattgagtag atagagttaa cagcagattt cgctttttct atggaattcg 9421 taatgtgatc attattcacc tcgaagagat ctacacaatg accatttgct tctaatagtt 9481 ctctctccat gctgacaacg acatcttcac ccccaggaag ctgataacgg ttgtgaaggc 9541 tcaggatacg cataaagaaa cctctttagt atagattgaa tctatttttt agtaagtagc 9601 cctacatgaa aaaacataaa atacgattct tgtgattgtt ttgcttgacc ttccgggtat 9661 tatcgctctg agggcacgct actttgtagc gtctggagaa agtgatacgc gtatgcctgg 9721 gggcacactt tgcaatagcc ctccgggttc gccagtcgcc tgcggaggga gaccctcccg 9781 cagcgctgga ctcaccgtaa ggcgtggcgt cagccataga gcgagtaacc tgccttcagg 9841 ttaccttcaa agcagcaagt tgtctgactg cattttgctc gctgcggaca ttttacgttg 9901 aatttggtcg agttgcttat tgctattttc taaattggac ttccgcgata caaaagtact 9961 attttgacga aaatttttgt gttgtaaaac acatagttat tgatcccaaa acctgtagct 10021 atatattcaa cctttgctca aaagaggtag ggcatttctg tcgtaaattt cattaagttg 10081 gctattgcaa gcagacacca aaagcgtaga tggatacaca gtttgagaac aatcgctcaa 10141 atccatgctg taaaaaaaag ctttttgtgg ttgtgtaagt atatttgttg aggtaacttc 10201 aggataaggc gcaattgata agtcgcttgt agtcttgcgc aataacttgg gaactaaaat 10261 cctttgctcg ctgtaatcct gctgaacgaa gtgtttgaga tagttgtgag ttcgttaata 10321 tctgtgctat tgcgtgagcc atttcttgag gatgagctgt aggaactaat aatccacatt 10381 tcccttgttc aagaacttca gaaggaccag atggacaatt tgttgatacg actggacatc 10441 ccaaagccaa agcttcaacg atgacacgac caaagccttc ataatcagat gacaatagaa 10501 acacctgcga tcgcgcaata tacatgagtg ggttttcaac aaatccaggc ataaaaacag 10561 aattggaaag ttctaacatt ctgatttgtt gttctagttg agggcgtaat tcaccatctc 10621 ccaaaatgag taaatttcct tcaatatcat aagactgccg caggtaaaga agagcttcaa 10681 tagcaatgtc aaaccgcttt tgactagtga aacgaccaat ggagacaatt gttggtttat 10741 taaagatgtg atgatgttct gaggggagag gtgattgaga agcctgaata atagcatcaa 10801 gctcaaatgg gacataaatg atggaaagat gttcagatgg aacaccataa ttcaaccgta 10861 aatcatctga aactccctca gaaacacagc ggatgtaaga gatttgagga tatatccaac 10921 ggcacagata atgatgtaat gatttagcac tatcgttaat ccactgactt aggtgaactt 10981 gaacatccgc aataaatggc ttacggtgcc agcgtgataa gagccaactg atgtaggttg 11041 gagaaagttc ggaagtggca attaccaaat ctgtatttgc aatttcacga ttaagttggg 11101 acagtgtatg atttcgatac caagaagacg cactttcagg aaaaagcact gtacctggaa 11161 aatcttgaaa gatatgggaa tctgttggta gtttcaaaag caatagtgtg agttcatatc 11221 catcgctctg aagctcctta gcaagcggaa tcaggctgcg ttccgcgccc ccaatacctc 11281 caaattgact aataatgaga agtttcacaa tattttactg actcttgata actcatctca 11341 aaaattgatg tctttaacaa acaagggtct ttaactctgt ttttttcaca acttccgatg 11401 cgtagaacaa tgaaaacaac ctgcttcaaa ctcctaacag attctagaga ttaaccccta 11461 aatattgagt tgactttata ccctaccagt agtttccagc tgctaacttc cagccgactc 11521 ggagactcgt tgctaagaat ttcttaactt taaaagacca catgggttct tcaccacagt 11581 atctttgttc aatcattcgt ccttcttctc gcgatctaac cgacaatcgc tgagtctttt 11641 gttcaggata tagccgcatt tttgaccaag atcttcgaac atgatagaaa tcagtcactt 11701 cagagaagcg catccaaaga tcagcatcca tagctaaatc aaattcagaa ttaagtccac 11761 cgacttcttc atataaatca cgacgccaaa aagtagacgg ttgaggaata aagttatgag 11821 agtataagaa aataaaacgg ttaaatggta gttccttttt aattcttaga ggttgtccct 11881 ctacatcaat ccaagtacta tcaccataga caacgcgagc ttgagggttg tcttggaaaa 11941 actgagcgac ttctttgaga gtccaagatt caaataaatc atcagagcag agccaacaca 12001 gaatatcacc tgttgctctt aaaaaaccct tgttgagtgc atcagtttgc ccttgatccg 12061 gctcacttac ccaatatgtc agttgtttgt catattgacg gagaatatca acactgccat 12121 cagttgaacc accatcaatc acaatgtact ccaaatttgg ataatcttgg tttagtatgc 12181 tctctagtgt cattttcaaa aaaggagctt gattgtatga aggagtgaca actgtaattt 12241 tcgggtattt taaattcatg tcgttaattc tccaatttct tagtgatagt tagcatctca 12301 ggtgaaatta tttcttttca tcactcatga atgtaaatga cgttaagaga tacatctccc 12361 taatgttgat ccataatttc agtgtagaca gataaataac gagaagcttg aatctgtata 12421 gtaaactctt gctctgcttt ttcacgagca tatttgcata gcttctgatg tcgttctcta 12481 ttttctagaa cccaggtaat tccttgggct aaatcttcaa ttatgtatgg atgagataaa 12541 tagccatttt tctggtgttc aatcatatca ggcataccac cgattttgaa agcaacacag 12601 ggagtgccac acgcaagagc ttccatcaca gtattaggta aattatcttc aatggatggt 12661 gctacaaaag catctactgc tgcgtatacc agagataatg aaatattgtc ttccaactta 12721 cctaagtaat gagatttaaa tcctaaatcg acttcattct ttgattgtga agaaccaaat 12781 atcacgattt ctattctgtc tcgccattca gatttcatta aagattgcaa tgctgagagc 12841 aataggtgaa atccttttcg tttatcgtgc attgcattca tagctccaaa aagaattaat 12901 tgtttctctt gaggcaaatt aagtatttgt cgtgctattc tgcggtcaat tggtttatat 12961 atcgtagtat ccaaaccatt gggaatgacc tcaactctca catttttaaa aatagaactc 13021 gcttgagcac acttagctag ccaggcgcta ggagaaacta atatgagatt gatttctttc 13081 caggctttag ctttacgttg ccattgccaa ccagacatat ccatgttttt ctggctgcga 13141 agttgaggac aagcgccaca ggaattcatg tagcgatcgc agttctggct gtaatgacag 13201 ccccccgtaa atgcccacat atcatgaaga gtccagacta taggcttatt tagcttcgca 13261 atggtttcaa cctccaagta accattacaa atccagtgca gattcacaat atcaggattg 13321 agttgtttaa cttgaggacg gagtgtatct ggaagccacg agggataata atcagtcagt 13381 tctcgctggg ggtaaaaact cagcggcagt ctacttaaag atggtctaag cttggcaact 13441 cctttttcta acttagtgtg tggagcaata actgtttcat catcactatt tttcgcttgc 13501 actaaaaatt gggaactgat gccaatcttc tgtaaaccct gatggagtcg atatgcagcc 13561 cgacctgctc ctctaccatc gtaagtactc atctgtaaaa ctttcataga taaattccct 13621 cttttctcta ataaattgct attttatata acattttcat ccacgccgaa agcatcttgt 13681 tttcgcataa gtttgcctat cattttgagg tacggatatt tatctaaaat aaagcttttt 13741 taaaaacatc tttatctatt ggataaataa gttctgtttt gattgagtaa aactgtcatt 13801 ttaagcgaag ctatgagaat tccgagcaca aacaaaatat cccaccctga ttcaatatat 13861 gcattagaaa tcgatgccca tatacaatat gctacaacac aacaaaataa tgtttgtccg 13921 aacagggcta cctctttttc ttgtggaaat attgacaact gaaaagattc tttcaataga 13981 gtaagtagaa aggctatata tataaaaagt ccaaaaaatc cccaagatac ataactccat 14041 acatatacat tatgtgaagg aacaacttca aattcattta attctcctgc accccttaca 14101 ataccattta taaggcaaac atattcccaa ccacgagaac caaaaccaaa aagaggtgaa 14161 ttttcagatg tgtaattact tgcaggaacc catactctct ctacacggaa tccccaactt 14221 ccttcacctt cagactcttt ggaaaaagag gatccagcaa agtcagccac atacagactg 14281 tatatcaaaa caaaggtaat taaaaatggc acaagtttga ttaaattttt gatagaaaga 14341 ctaaaatgga ataaaaaaat aacgagtgga aatgttataa acagggctgt tcttgtacca 14401 ttaaagaaca aaattatagg gatatatagt aaaagaaaaa ccttccataa gtttcttttg 14461 gaacctcgtt tttggctgat atagaatagg gaaagcggaa ataaagtcat taagctatta 14521 gcatcttcat tggcagctcc aaatagacca aaaatatcct tgtcttcctg atgaccatag 14581 gaggaaataa tactggataa actctcagag gaacgctgaa ggagtattgg atactgaaca 14641 atagttgcta aaaatacaaa tagtaatgtc aacagcaaaa tatgtctgac ttgacgagtt 14701 gtttgactgt ttagactcaa tagcccacac agatacaaca gacaccaaga aatatatttt 14761 gtacaaccag aaagatgagt tgatacagca gatgaaggaa ttgctacgat ttggaacaaa 14821 aaaagtattg gaagtagtag caatggccag ctcatactaa ttgtaaatct tattctcttc 14881 tcatcaactt ttcttaattg ctcaatcaag acaaggacta aaagcggaat aaaagctact 14941 ctgagcgctc caataacacg taagttaaat tgagggagcg ataagagaag gtcagccact 15001 aataaaaata taggaatgta aaagatttta attttcattt tttatttgga gcaagctttg 15061 tcagttcgct tgcgaaaatt caaatcgtta tcaatggaag tggctataaa aggttatgtg 15121 caatgacatt aggatgagca caattatagt caatcttaat cgagtatttt tagtattttt 15181 atccacatgt aatatttact ttttccatta attaatgaaa gactgatatt ttcctatagc 15241 agaaaatttt gctcatactt tgattccaga ctttacttgg gtttgatgag ctttttaatt 15301 ctctgtttaa taagttgaaa cgctgtttgc ttaaacaacg tttgttgtgt gaatttatct 15361 gcttttagat tgcgaactat aaagggtgga tgcttgagag gaaattccat tacctctgta 15421 ggcatattga tgtaagggct tactttttta gaagtgaaat gagtcgattc taaaccaaag 15481 ccaatattag aaattaaatt tgccttcggg ataatactta gagcactctg tatccaacac 15541 gcaaatgtcc actgataatc ccaaatttga tctgaagggt tttcaaaaac agactgaaaa 15601 agattactcc aataccttac tgccttatag tcttctaaaa tatcgcttag acaaccttcc 15661 gcttgaatct gtggccaaag cttcatataa atatcaaagt cctgccaagc ccgtctccag 15721 cttgcccaac cccagcagtg gttgtagata gaaaagtagt aactataatt tgttcgtcta 15781 tgacctaact gataatttga cccggaaatt gaagcaactc gggaatcgta tctatatttc 15841 tcaagtaatt cttcacaaaa ggggaaaaat gtaggatgag gtacgcaatc atcctctaaa 15901 ataattgcct cttcaacatt gctaaaaacc caatcaatac cactgacgac acgttttgca 15961 cagcctaagt tggtatcaga gtaatttttg ataacttcgc aattccaatc aactcgctct 16021 ataattgcac gagtagcttc gcacttttct gcctcaccag gatgttcagc acgggatccg 16081 tctgcgatca caaaaagttt tggtggttta gcttggcgga ttgcttcaaa tactttttca 16141 gttgtatgag gtcttttgaa gatgataaat gctataggag tttccatata gattcttttg 16201 ttgaagtcgt caccattttc tttttcttat taagctttac tcatcgtcgt tactattttt 16261 aacatgagtt ttatccaaat acgtcttaga atcgtaaacc gccaccctgc ccacggtgtc 16321 ttatcaacat aatgaaaaaa taagtcgttt gctggatgat agacagtaga attccaaggc 16381 ttactacttg acgcaaaatg tataatgtag ggatctttta gagcatttgt ccattcttct 16441 tgtgaaaatg ggctatcttc gcaggaagag ttattatata tgctaggaag ctgattccat 16501 ctgagttcaa gttcagtcca tttacctgca agcacagcat ttagtgcatc ttgatcgtgc 16561 caacgaatat gttctttgtt gtcttgaaga tattgaatca cttgcatgct gatgttttct 16621 gttcgcatct tctcaagatt gagaacaaaa actcctgcgt taaagtattt ctggtgtgag 16681 ggaattccca gttctttata gttttttagt ccataaagtg aggacacata aggtattcct 16741 atatctggta cggcaagtag atagttatcc tcgatgttga tatcccataa tttcgccaaa 16801 tcacccttta ctattaaatc actatcaaga taaattgctt tggaataaga atctggcaaa 16861 agaacaggaa taagtatctt ataatatccc gcaactgtga tatgacctga taatatcacg 16921 ttattaagca tggcatcagg tggttgtaac cagttgataa gacattgctt tgaatcagtt 16981 agttttaaaa ttctattttt atttttttta ctgatacctc catcaatgat aaaaaaagta 17041 cagtggcgat tactgctgag attttctaat actgaacaaa tgacaactgc caaaggcatt 17101 gcatatctat catcagcacc acaaacaaca ataatagatt ctttgcttga aggagtgacc 17161 attgtatttt tttgttgtta agtaagttgt accaagaaag aactcagaac acagttgtca 17221 gacaattttg aatcttaact tatggattcg ttccaaagga tttatccggg ggcttgtacc 17281 atcaaggaat cattggtcag aaatatctgg ttaattgata tgtcggtgat tcataacttg 17341 agttttcttt ttctaaatag actataagct tttcagtata atttttgata tcgtaagatt 17401 tttcaaagat ttttatacag ttataggaat attcactata atgagtttta attttttgaa 17461 tagctgaagc taattcatca attgagatta tagagactcc tactttcttt tcttctacat 17521 attgctgcaa attagtgtga cccttaacaa tgactggttt tccacatcga ataaactcta 17581 caaactgacc tgacgcattt gaggtataga aatgatttag atcttcaaca gcatagttga 17641 taaaacctat atcacacagc tgaatcaact gaggtacttt atcaggtaac actttgaaag 17701 aacttataag aggtaaaact tctactttgc ctaatagttc ttggatttca tctgaaatga 17761 agccgtgaag gaacaaatca aatttatcaa agttttgttg atggtattgt attaaatcat 17821 cactaccacg ccaactagca atggaaccag actgcattaa tgtaggaact ttagctttgc 17881 tctttagttc ttttttgggt acttgtatag gtggtaagga tacaggcaaa aagaaaacat 17941 tgtctaaagt acctttatat ccaatgcttt ccaataaaaa atctagacgt tctttgtctt 18001 ggataattaa ttttttattt tttgccaaac tttttcttgt aaacttagct attaccctgt 18061 gaataaaagc agtcctacaa tagtttcttg gttcatcata acctagtaag tcatgactcc 18121 aaagaactac ttcttgtttc agaatgaacg aagaaagaac atataaaaag ttatcaacag 18181 ctacaacagc gtcatatttt gtcttcccca atttaatttt gtgtaaacga aacacatctt 18241 ttgttaaaat gtataaacgt ttcaatgatg aatagatttt accttttctg ataacatcat 18301 aaataatgtt ttctatccaa agtcctcgtt cctccatata aaaatagtca attgtataat 18361 gaacagatag ttcatcaatc agatacttca aatgaggata taaatcctga tcgttaggaa 18421 atgggctgac tattagtaat ttttttgatc tcatgtcttt actctatcaa tgtgatgatt 18481 aattacaaga aatcgacaac gaaaacgcta ccccacgtat gaaaccccaa tgctcttctt 18541 gacatccatt ccatatcatt aattattttc aaaccaacat ctttaagcaa cagatctaat 18601 tctggcttaa acagataacg catttggtgg gtttcttgaa tctcttcaac cgcaccgctc 18661 gttttatcct gaataaaaat ctgatagttc acatctacca agttctcatt aggataaatg 18721 acaggttccg caactctagt gaccttaatt tcttcatctt ctagtcgctt gacgcggaca 18781 ctcgggcgat cgctcaacac tgcaggtcca taccatatat caaaaacaaa aattccgcct 18841 ggctttaaat gttccttgac agtagtaaaa gctgcgagta aatcctcatt cgtagtttgg 18901 tagctgatga catgaaaaag agagaggatg acatcaaatg tctgattcag cctgaggtga 18961 cgaatatctc cgtgagtaaa tttcagttga gaagctaatt ctggatcgag ttgggagagg 19021 cggctgtcag cttttcgtaa catctcctgg ctcaaatcca ccccatgaat ttggtaccct 19081 tcctttgcta ggagaactgc atgatttcca gtaccacagc ctaactctag aatattctgt 19141 gcatttggtg catgagtttg aattaactgt tgaataaact tcgtttctcc aacataatcc 19201 ttatctaggt agagaagatc atagtaacgg gcatagttgt taaaaatact cataatcaat 19261 catccaaact taagttttca gtaaaattcg tgagaaaaga gaacggagtt atcttgatga 19321 aaaggaatga tacaagagtc aactctaatc ttgcaaaata ttgattgaca tcatgactac 19381 gtcgcgatcg cttgttttta tatccaactc caccacctac tcgttaaaaa ctcaatcgcc 19441 aaactgctta agaaagaaga tgacatatct gccccattat gtgttgatac atatatctgg 19501 taattggtaa ttccagtaat ctttcctgag aacccagtct agtaattccc gattgcataa 19561 gacactattt tgatgaaatt ggtatcccct ttgataagct ttcaaaaaac gctcattcaa 19621 tttatccttt tctgtttgct tacgaaacaa ctttagacct atttttttca aaaacctcaa 19681 aggttgttta attttattaa ttagactgac ctgctctaat ttctcatttg gttgcctaat 19741 atttgagtaa gaatcacaaa ctccttgata aaaagaacgc tgctcaaagt attcatacgt 19801 cattctttct ttaggaacac tatgaaatac taaggcttta ggttgataaa tggctttgta 19861 tccttgggag ttagccttct gagttaaccc tgtttctcca tctccttgca gatactgcaa 19921 atgtttagga atacaatcag gatgaaatcc accaagctcg aaaagagcac ttttacgaat 19981 tgagaagttt aagccccaaa catagttagc atcaatgttt cgaacttggt caccaaaatc 20041 taacaaactc agataaccac agagtttacc gtagggatgc tctaaccaaa accactctag 20101 ccactcaggt ggttcaactt cgtacttagg taaattacgc ccacccacaa tttgcacact 20161 agagtcatca aatgattctt gaattgcttg caaccagttg acatctgctt caatatcgtc 20221 atctacaaaa attaaaatat caccctttgc ttccaatgct cctctatgac gacctgataa 20281 caatcctggt tcaacttcgt aaatatagtg gattttatga aaagggtatg cagcaatagc 20341 agcttctgta acttgtttgg tattgtcagt tgaaccatta tcaaccacaa taatttcaaa 20401 tttatcaggt gagaagtttt gctgacaaat agaattgatt gctaacttca gggagatcgc 20461 acgatttaat gtaggaatga taattgaaat catatatgac ttaagtagat aggcgggaat 20521 aaacaacgtt atgtaaacaa aggtaaatag acacttgcga tcatcgctca aaacaacttc 20581 agtggaatga aatcgaaaac gctcaatgct cgcggtaact cctccactct ctgcttttgc 20641 gccttagtaa gaaaaattct cactgcaatc tctcctttac acatttttac ataattccgt 20701 ttcttcacgc ccatccactt aagcctgttt cattttttga gaattctcct tatttcaaca 20761 tgtccttaac tgccagtgcc acctgtgtca tttgttcctc cgtgagtgcc agtccactag 20821 gaacatagaa acctcgacga gcgatgtttt ctgccaccgg gcaagattct tcttcaaaca 20881 atcccatctt tctgaacaca ggttgttcgt gcatacacca aaagaaagga cgagtaccaa 20941 tcttgtgctt cgcaagatac tgcatcgctt cttgagcgtc aaacgctaca ttgtctttca 21001 gaactagacc gtacacccag tatatatttg ttgcatatgt agtcagcggc aacggtaatt 21061 gtacgtcaga gatatttgca agtaactctg tgtaaatttg tcccatccgt cgcttacgtg 21121 ctacaaactc atccaaacgc tctaactgag caacgccaat cgctgcttgt aagttagtca 21181 ttcgcatatt ccaacccaac tcttcatgca caaagcgttt ttggggttga aagcacaaat 21241 tgcgtaaaga acggcaacgt tctgccaact tctcatcatt agtcagcagc attcccccct 21301 cgcctgtggt gatgtgcttg ttagggtaaa aactgaaagt actgatagtg cctaagctac 21361 cacaacggcg tcctttatag gtttgaccgt gcatttcggc agcatcttca ataatgtgca 21421 gcccatactt gtcggctaag cccaacactt gatccatatc cacaggtaac ccatagatat 21481 gaacaaccat aatagctttt gttctgggtg taattttgga ctcaatttgg ttaacatcca 21541 tattccaggt gtggggatca caatctacca ctactggtac agcaccagca cggacaatcg 21601 ccgccgcaca agaaataatt gtgaatgtgg gcagaatgac ttcatcccca gagccaattc 21661 ctaatgctgc aactgctgca tcaagagcaa cagaaccgtt gcagacagct attccatatt 21721 tgcatcctac actggcagca aattgttctt caaattgttt aacaaagggt ccttcagaag 21781 aaatccaacc agtttcgata cactcaaata agtatttctt ctcgtttcca tctagaagtg 21841 gctcgttgac cggaattggc ttcatagctg actcaactct ggtattttag ctgactctgc 21901 agaaatgccc acaaaccggg ttttatcctg ctctcccaaa taaggacctt gtttaacctc 21961 aatcatctcg acttcttcaa gcacctcaaa accatgccca ccagtgacta gaagaattac 22021 atcaccagct tctaacattc ggctttctag atacttctgt tggtcgttgt aaaagtcaac 22081 acgcaacttg cctcttttga gaaaaagaac ctcttgagta taaagcactt cacgaggtac 22141 ggcattatgg acgtgaggtt gaataatttt tccttttgga tggtgcatat acgctaattg 22201 ttgggaaagt tcgttgggtg tgaagaagtg aatccctggt ttgtcaaatt tatgggaaat 22261 aatcacagca agtaactgat cctggtagag aatttgttct atcatgattc attcaagtcg 22321 agtataggta atttataata ccattagcgt catctgaaaa gacagccgac tagctctact 22381 ttatgcattt ttcacaaatc gcaaatgcgt atgctcaaac cctgatttct cagtgaaatt 22441 aagttttttg atgtcaacag aattcctcac aatggaaaaa gttttgtgcg actggctata 22501 acttttgcca aaaaacaagg ctggcgttgt tatttttgga taaagtcgat atcattatgt 22561 atttttatag acagcattct gaacaattaa tactcccgaa aaagttcaga attacaataa 22621 attgcagctt gttatctctc atcttattcc attgaactac ttacatgaat ccaggtaaac 22681 acttattgct tgtaaattta tgttcaaatc tttactcaga gagaatgcaa taataaccaa 22741 tttccagaat ttgatttcat tcaactttca gtgtttctta tttacagaaa acttcaggat 22801 gagatagttt gtcccaagac aacctcctga taatttcctg acttgactat ccttccacgc 22861 tcaagtacat agatgcgatc gcaatgttcg acagtggaaa gtctatgagc aataagaatt 22921 aacgtttttg taccactcaa tgaccggatt gcctcagaaa tcaggttctc tgtttcatta 22981 tctaatgctg atgtcgcttc atctaaaatc aaaatctctc tttggtggta aagcgctcta 23041 gcaattccta tccgttggcg ctgaccacca gacaaccgta ccccacgctc tccaacagca 23101 gtttttattc catcagggag ctgcgaaatc aattcttcaa gttgagcagt ttgaattgcc 23161 ttttgcaatt tttgtgaatc aatttgctca tccagaacac caaaagcaat attcctctcg 23221 atcgtgtcat ccattaggaa aatagattgc ggaatatagc caatcaaatt ttgccaagag 23281 cgtaagttat catatattga cactccatca actctaatat ctccatgatt aggtatcagc 23341 agacctagaa taacatccac caaagttgtt ttacctgccc cagacttacc aataagagca 23401 atagattgac ccttcttaag agttaaagaa acattttcta acgcattttc tgaaacatta 23461 gggtaagaat aattgatttt gttaagaatt agaacgttac gaaaatttaa tttttgaatg 23521 gaagttgatt taccaatcgt aacattggac tctgatttat ataaactttt tatagatttt 23581 atactttcta atttattggt ctctaattct ttcaaatcga aatacaactt attcagtgag 23641 tagcttgaat ttcttaatgt accaattgca gacattaatt gactagccgc tggaatcaag 23701 cgaattgagg cgacggcaaa aatgcttaaa attgagatta ggttttgtgg attttgattg 23761 aatactaaag atacagaaat gaaacatact ataaatgtaa ctagaagtgc ttcaattgca 23821 atacgtggta agctttgaaa aacttgaaat aaactacctg tcaatgcgtg tctctgagct 23881 tgtatgttca tctgactctc aaagtacgat tcgcaaccta taactcttgt ttcttttaat 23941 cctcctacac tatgattgat aatccgaatc atttcagtat ctgattcact gccttcttta 24001 ccccaataag ccatctggtc tttaaattta ttgtaaaaag caaatgccaa tgtcagcatt 24061 aacaaaatgc tagctgtcgc cgaaaaatct gttttcagta acagcaatat caaagcagat 24121 acaacaacac tatttgctgc agaactcaaa agaggaagtg tgacagagta acagaaaaga 24181 aaagtttcgt gaatgatatt ctgaattaat aaagcactat ttttatttaa atgataagtg 24241 taattaactg ttaaataacc atgcagtaac ctgagtttaa gtaatccctg ctgagtgaga 24301 ctgaaatcat aaatgtatct ttgaacttga aagtacaata atgatttaat gccaaaaata 24361 ataataattc ccaaaccgag tagagcaata tattgactag tagattgaaa acctgaattc 24421 acgtagcccc aatttaacca agagctttta aaaactaaat ctgggtttgt ggctaaactc 24481 ataaacggtc ctactagtcc aatacctaaa gcatccaaga cagaaattaa taaaaataaa 24541 cataagagaa taaaaatcgt tctttttttc gccgatataa cgtataaaaa ctttgaaaaa 24601 tattttttca ttttacttag ttaattagat taacactcca atcttgtacc ttcaaggatt 24661 tgatataaca gctgaatcaa gaagcttcaa tgaaaagctt gagcaacaat aatgattcac 24721 caatcataac cattccgaat tttgattgtc ttagtcaacc gttgccttta ccttaacctt 24781 aggtataatt ttgtctgcta acatatcaag caaggttact agaaccattt ttccttagaa 24841 tcactaaatc ttgaagattg tctataactc ctttttgtat gtccaaaact tcaaaattgt 24901 cactccattc ttcacgaata tatttttcgg agtgataggt tgtaccataa gattggtcta 24961 ttcctggaaa gagtttgttt ctcatttcag aagagagatt aactcctggg tattctaaat 25021 ataaaattcc ctctttttcg agtttttcag gtgtgacatc aagattcata ttttttgttt 25081 tttttatagt taaagaaaag tctcctaaaa cagtaattaa ggctaatcca ttaggtttta 25141 aaattctaga gatttcattg agccaaggtt tttgctttga cactggcaaa tgagtccata 25201 ttgaaacaga ataaacagta tcaaaaaaat tatcatcata ctttaaagga gggtcgtatt 25261 gattacaact agcatttgcc tccggaaatg cctttgataa ataattgatc gctgaagagt 25321 caacatcggt agcaaagatt ttaggtatgt ctggataaaa aaattgcaga gttctaccac 25381 acccagcacc aaaatccaag atctttaaat gattgatttg attgctttga taagtcacaa 25441 ttgacttttt aattggttca tagcaatttc ttccactttt caggaaaccg ccaatcttgt 25501 agtgagcacc cactctgacc ctattttcca tcggcggaat cggagcatag tagttattaa 25561 atattctata aaaagagacg tagagaggat caaaaacgaa aatcaatcgc ttaatcagat 25621 atttggatct agaaattgct tctgtcatta ttgaatctct atccattaat gaaattaact 25681 gttctgttta ttgacaaagg cgatcaactc tgaacttatc acctatggtc tttactaaag 25741 cttcctcatt aattccgtct tacaactaca gaatgtcgag acacactacg actcgaatca 25801 actgcttctt ttgtgtagta aaagtagctg tcaggctcac tcttgacatt cacaccatta 25861 atcaccatac ccaacacatt gtgtcctgac tgtttcaaaa agtctttagc cgcattggca 25921 ctcgaaaaat caaccacccc tggacgaacg acccataata taccgtcaac tagattgcct 25981 aaaaccgcag catctgcgac tcctcctaaa ggaggggtat caaagatgac gaaatcatat 26041 tcctctgtaa acgagctaac aagtgcagcc atacacttag aatctaatag tgccactgga 26101 tttggaggta caactccaga gggaataacg tctaggttag gcatggcttc ctgtaaggcg 26161 acattaagtg tatccggatt aactatgaca tgacttaaac ctacagcatt cgtcaaattc 26221 caaatatgat gctggactgg atgacgcata tcggcatcga ccagcagaac ccgatgcccc 26281 acttgagtca ttgccatagc taaatttgct gctacctcag atttaccctc accagacaca 26341 gaacttgtca ccacaataac tcttggttcc ctatccgaac ttaaaaactt caagttggct 26401 tgcagcattt ggtaggcgtc accaacaggg aagtggggaa tgtctctacc aatcaccttg 26461 ggaattgatt gatccacctc tggcactctc aaactcttct tctgatttct acctgttaaa 26521 ggtatgaccc cgagtaaggt gtatttgaac aattccttag cttgtttaac tgtcttaagt 26581 gatgggtcaa ttgtatctaa agcaaaggca gcaacggcac caaacaggac acctaaaatc 26641 cctccagccc cgataatcag aatttttcca ctcccaattg atttatcagg taccaaggca 26701 ggggagatga cacgagcatt tccaatattt tgattttctg ctacttgaac ctcttgcagt 26761 cttgtcaaga gtgtttcgta agtcgtttga gccgctttga gtttccgctc aagttcccgc 26821 tgagtctgtt ctagcttggg caagatgttt gctcgttctt tgtagacagt ccgttggtta 26881 gacaatgtag cgatttgttt gactaaacca acgcgttgtg cttccgtgcc agcataatct 26941 tccagtagct tttctcgcaa agagccattc tgtaaattcc ctagagaaat ttgctgattg 27001 ctgccagtga cctgcttcat ccgctgttgt agcacgctgt tgagagcagc aactttttct 27061 tccaagttga caataatagg gtgttctggc ttcaaacgag tacgctcaac tgcaagctgg 27121 ctttgtgctt cctgtagttg agtcagcact tgctgaattc cagatgcctg agtcaatgag 27181 gaattggtta gagtttgcga cgaatcaatt ttcgcttgag cctgtaactt ttgtgaccta 27241 gcagtgacat cttctagttg agcctgggct tgagaaattt gctcatttaa tttggcgatc 27301 gcttgaactg cagcactggc ttcttcttga agggtgatga ttttattgtt ctctttgaat 27361 tgacgcagcg ctgactcagc ttgttttaca gatacctcag ttgtcggtag ttgtttttga 27421 ataaacttgc gggctgagac tgcctcttgt cgattcgctt ctatgttatt tttgatgtaa 27481 gcctgcatga ccttattgac aactctcgca gccagttttg ggtctttgtc aatgtaagaa 27541 acttgcagga tttcagttcc tttgagactg ctgattttaa gtgaattggt taattcctca 27601 attgtgatgg ctttgccctt atcatctctg aggttaagtt ctgcaatagt ttcctgaatc 27661 actggaacgg atgccactat cttcgcttga gtctcggacg gattactttc catagttaaa 27721 gtatcaagcc gtccaatcgc ttcccccaaa cctgttagag aggaggtacg attcgtctta 27781 atcaacagac ttccctctgc cttgaacgtg ggttttgatg agaatgcaag taaggatgca 27841 agggtcacaa caacaccaca aacccctact gcaggtagcc agcgtctttg caggatcagc 27901 cagtattttt gaaaatctat ttcttcagag tttggtacag gctccatagc agacattgat 27961 tttcctatcg catttatgat tggtattgtc tgacgcactc aatctccaaa ctggaaatct 28021 ttacttccgt tatagctaac catctgtgca tgtctgcgag taaactcgcc tacggaagga 28081 tttccctctg ggtgaatgac ccctatcgct tgttgccaaa gttctggaga tactatcgat 28141 acctcatagg tgcaactctc cttgttttga gttttagtaa cacgcctttt aacatggtac 28201 catttggttt gttggcattg gtcacaccaa aatgcctcca accattcacc ttccagacaa 28261 accgcagtct tggccgctac caaaatcaaa gcattttgtc ggctaattcc ccgttgctgg 28321 agttgtccag cacgatccgc aaatagagga tacttttgac taacgctctc aagatagcac 28381 ccatgaattg gacagtagat cgcccgtctt tttgaacggt tccgatttcg gttacatctt 28441 ctatgcaatt tgacctccag catcggtaaa tacaccacat agcaaaataa gcttacgttg 28501 gagaaactgc ggttaggaag accgtatcac ccaacaccta ctactttaaa atctcctgtg 28561 tcattaaaat caaataaata gttaccgaca aacagcttag taatcagtag tcggattttt 28621 tcttaaaggc aaaagaaagt ttctatcatg ggtcgtcatt tgtgactcac ttaatcaaat 28681 tgaaaaatac actccattgc caagaaatta acctattaag gaggtatgag cctaggttgt 28741 taggtagaat taaaaacttt cataccactg gatagaccgt gagcttgaac gtttttgatt 28801 ttagttacac tatttctctg attcggcctt cttatgaggc tacgttggag ggactgagat 28861 caaaaagacc atcaaagccc aactccggac atcaacaata atcagatatt ttttgacaac 28921 catatatcca gtgaaccgat accagataat ctatagagat tgctactatt tccctaatac 28981 tctacataaa cttagcctca aataaatact gcatatcctg aatacagttg aaaactagat 29041 atgtttaacg tgtaatcaac atgttttaga gtataacaaa tctctgttgt catttgattg 29101 taagcagaag tttgtttgaa attctctcca aaagtaaaaa caaagtacag attttgacca 29161 tcataagcct tactcaattg ataaaaagtt tgccaaacgg attgccagaa ttgttctaga 29221 gcttaagtaa gtagtcattc ttctcagtaa ttgtatcgta ctacatattg ctttgcaact 29281 acctgagttc agtctaatgt ttgaagaatt aataaaacta ttgggggatt gataattatg 29341 actgttgagc attcgccagt tatggcatag agtttaatct tttttgactt tctctatata 29401 tgctagcagg aaattatact tagccatata nnnnnnnnnn agccatataa aaatatgaca 29461 aaaatatgtt tctaattttt ctaatttttt gaatttttgt tatgaaaaaa cattgacggg 29521 aacagtataa ataaggagaa ttccaaatgt gtaaattcac agaaagggta attactgagt 29581 gtctaagcaa taggtaacac tcacgaacgc ggcggacgaa tacttttctg aaattggcat 29641 cttctctgga atcatgtatt aaaaagtact ttttatctcc acttgaaaaa cacagtgaag 29701 atactgattg aacaagatac taattatagt gttattaaca tcctatcttg aatgaccgga 29761 aaaggagggg gagaataaga gagtgaagga attagaaaaa cacccccctt tacatggttt 29821 caaccccaac tggtcaaact atttatcaat aaaaatacgg atggatctgt aattatttaa 29881 ttcaacaact aaaatttgcg gcgatgctcc tttaggagcg ctagcattgc tgctcgccat 29941 gtttagtatc tcgctcaaaa ctgccgaacg cttgtgagaa ctgcgtaagt cctgaaatca 30001 ttagaccaca aagcgtaggg tgacctggct cctttatgcc ggggttgggc aaagttttga 30061 taaaaatgga caaccctgta gtctaattgg atcaacctta aagtgaaacg gtattagagg 30121 ttgtttgtaa actcccagaa ggtataattt tgtcgttctg atttgggtca actgcaacgc 30181 aaaattaagg ttttaggcat gcactgagat atttcgctgt gcccactatc agagcaaaag 30241 acacttttaa aacaccctct taaatatatt ccccgccctt aacgtagagt acaggacgag 30301 gaatatcaat agttgtgttt tcttcaactg agaactgtca aaaaatatag ttcatctttg 30361 gctgctttcg tcgcactaag acttaggctt atatgttgaa gcaacatgca actcttttaa 30421 ttgctttgcg tctactactg aaggtgcatc tgttaacaaa caacgtgctt gttgcgtttt 30481 tggaaaagca atgacatcgc gaattgattc ttctttcgct agcaacatca ccaagcgatc 30541 taaaccatag gcgatcccac catgaggcgg ggtaccatat tcaaacgctt ccaacagaaa 30601 gccaaactta ttatgtgctt cctctgggga aagtccaatc gcctcaaaca cctgctcctg 30661 aatttcccgc tgataaatcc gcagactacc accgcctact tcaaaaccgt tgaacaccaa 30721 gtcataagct tgtgctcttg cactttttaa gtcgctcaaa tcatcaggat ggggtgcagt 30781 aaacgggtga tgcagtgctt ctaggcgatt ttcctgtgcg ttccattcaa acatgggaaa 30841 gtctgtaatc cagagcaagt tgattttttc tggatctatt aacccgaatt ctctagcggt 30901 aacttggcgt aacctatcta atgttttatt gacagtagca gtgtcagcag ccgcgaaaag 30961 cagtaaatga ccaggtttcg catctgtccg gcgtaagatt tcctgttttt gctctgcggt 31021 gaggttgtct ttaattgcac caatggtgtc gatttctcca tcatctctga ctcggatgta 31081 agctaaacct ttagcacctg catcagtggc ttccttgaat aaatcaccgc ctggtttgat 31141 gcggacatta gaaattgaat cgttcccatt ggggatggga agaattttga caattcctcc 31201 gttggtaaca gcttcccgaa aaaccttgaa gccacagtct ttgacaacat ctgagacatc 31261 aacgagttct aaaccataac gtgtatctgg tttatcacta ccatagcgtt ccattgcttc 31321 ggcgtagact agacgcggga aaggacgttg taattcaatg cctttaactg tcttgaagat 31381 gtgacaaact aatttctcgt ttaattcgat aatttcttct tgggacatga aactcatttc 31441 catgtccaac tgggtaaatt ctggttgtct gtcggcgcgt aagtcttcat cgcgaaagca 31501 acgggcaatt tgatagtatc tgtcaaaacc tgataccatc agcaattgtt tgaatagttg 31561 gggtgattgc ggtaaagcaa accattcact agggttgacg cgactgggta cgagataatc 31621 tcgcgcacct tcgggagtgg aacgggtgag gacaggggtt tcgatttcta taaaaccttc 31681 cacgtcttct aggtaacgac gcatggcttt gacaacttga tggcgtagtt gcaagttatt 31741 tgccatgcga tcgcgccgca aatctaaata acgatacttc agccgcaact cttcccgcac 31801 aggatcggcg tctgcggtgg aaacttggaa tggtaactgt ttagcgactg agttaaggag 31861 tttaattgta tcagcgtaaa tttcaatttc gcctgtaggt atgcgggggt tcagggattc 31921 ttcaggacgt tgcgtcactc tacctgtgat ttcaacaacg tattcatttc gtagagtatt 31981 tgcgagttca taagaatttg gggtacgctg tggatcgctg acgacttgaa caatggcagt 32041 gcgatcgcgt aaatctataa atatcacacc tccatgatcg cggcgacgat ctacccatcc 32101 gtacaaggtg acagtttctc caatgtgttc ttttcggagt tcgccgcaat agttagttcg 32161 cataagtatt agttcttaat ggccgagcta gagaattcaa gtcaatgtta aagctttttc 32221 attatctagc atcattagtc attttaattt tagaaatctc atgcaaaaac tgacaacaga 32281 tgaaacaggt aagcacccgt gtcatctgct caaaaattcg ttgctcgttt tgacgtgaaa 32341 ttaagcaaaa ttaacgttga gggcgaggac tactcaaagc atctcacgcg gctaaagctg 32401 catctagtca gcgatgatcg cctagttcgg cgacttgtta tcgtccaaac gaaaattcag 32461 acgttctgga aagttgttct tgttctatct ttcgcaatgc ctcaattcgt gcttcagtag 32521 agggatggct agagaacaag ttgcccagga attgtcccga gattgcgttt gtaatcaaca 32581 gtggctcaaa cgctgggttt gcttgtaggg gtagttgtct tgttgtcgct tccaaacgtt 32641 gcagtgcacg ggctaaggcg cgagggttac gtgttaatct agcagaacct gcgtctgctg 32701 aaaattctcg cgtacgtgat attcccagtt gaatcactgt cgcagaaatt ggagcaagaa 32761 atattgtcaa caacatcccc agaggattta gaccattccg gtcatcccgt gaatatggag 32821 tataccaaac actgtagcta acgatttggg ctaagaacga gattgcacct gcaatcgtag 32881 cagcaaccgc ctgtgttaag gtgtcacggt tagcaacgtg agtgagttcg tgggcgatga 32941 cgccttcgag ttcatcctct ggtaataaat ttaaaagacc ttcggtgact gcaatcgcag 33001 cgtgttctgg atcgcgtcct gtggcgaagg cgttggcagc tgggctaggg acaatgaaaa 33061 ttcttggcat gggtagattg gcccgttggc acaatttttg caccatctga tacagtcccg 33121 gtgcttcctg aggtgatact ggctgggcac gttatgccgc cagcgcaatt ttatcagatt 33181 gataccaaga aaataagttt gtcactgcta tgacaattcc cacaattaca ccactggtac 33241 cgccaatcac ccagtagcta atcgtaatca aaatcgcgct taaggaaaga ccccgtcctt 33301 aaggacgggg tttaatccct ataatgcagc agttttgaat tgatttctca tattttggtc 33361 tccttagtaa tactgaagtc atagatttat tgctcgcaac agcattactt aacccgcact 33421 ttgattgtat cgacctcaac acagattcat caggtagaaa ttgcgctcta gatttagtac 33481 ggttttctca ctcaagtttt taagtcctct ttgtggttta tacttaaagt tcatcccatt 33541 tattgctgcc tcagcgactt gtcgttgagt gattttgatg acaacttagc tcacatcctg 33601 atgataggtt ccaaaaactt atgtgaagac ggcgatcatt tgtatactgc cgttccattg 33661 tagtgagttg catcgggaag gtgaacacaa ttggaactcc tgtctagcca cctgtgctat 33721 gcaggcggat ttcggtgcgc cctctaactt tggctctccc gtcgggtgaa cccgtaaggg 33781 cggttcccaa tcccccactg aaagattact caattctgaa ttctgaattc tgagttggtg 33841 aatttttctt taatgattta actgctctag acaataacgc aaacataagg cagaccaaca 33901 cagggaatcc aacggattaa gacatcacct ctcttggaca aatctcatgg cgcaaaatat 33961 caacagagta ggaatggtgc agaagtatac taagattttg cacgatttag caatatttat 34021 catcagccct tggttagaga gagcatctgt caaatctctg caaaaacgag tcgattatct 34081 agaacgacgt ttatgtcaag atctaccgaa cacagaacaa cgtttggggt cgctcattca 34141 gcaactgcaa cgccagttca accttctgca agacaataca acaaatcgat tagacaagtt 34201 attagagtac gatttgcctc aacaagtgct ggacgtgctg aaagagcaga ctgatttgat 34261 aattcaaaac ataaaaccga ttgtggaaga tttgatacgc gagcaagagg agaataaacc 34321 tttacaagaa aatatcagcg aagcacttat tcaagtcaaa catcacaaaa tcgaagaatt 34381 ttcagaacat cttcaagagt tcacggcgtt acccttcggg ttcgcagtcg cctacggagg 34441 gagaccctcc tgcagcgctg tctcactccc actcccagaa gaacaatcag cagcctctct 34501 agaagcaggc ttgtggctcc tagctcatcg agataaaata acacaagaag tagcggatga 34561 gttattgggt ttgcaagctt cccaaacaga agtctttcgt cagaacctta gtagatattt 34621 gaaactgctt ggaagctgct tggaaaacgg aattcaacct cgtttacttt atcaaggtat 34681 catcactcat gagcagccgg cagttgagat atatgtgaat gggttcaagt tgattcaaaa 34741 taaatacatc aagtgttggg aagattcagc acaagtctca actgaagcag ctcaacagtt 34801 gagggtgtac ttcaagtacc ttattgatta tttgctcaaa gctttagctt tctcgccgta 34861 agccccgcac catagatgta atcagcttgg tgtcgggata taaggcggac aaaagctcgg 34921 tgcacccgac gcattgtccc ctaaaaatca gccagccttt gttttgtgga tctgtgttcc 34981 aagacgagta ctcgttgtag taaggctggc tgattttttc gtgtaagacc cccgccctga 35041 cctatgttag ataagattta acgagggaat ggggtgtccg agtttttgtc tatctctgat 35101 actggtaacg aattaataag atttgacact cccctgccta aaggcgaggg gattctacat 35161 tcatcgtcag aactt // LOCUS NODE_774_length_34522_cov_5.60498434522 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 34522) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 34522) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..34522 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..264 /locus_tag="DP116_06400" CDS <1..264 /locus_tag="DP116_06400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860607.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_06400" /translation="AVNPRMTSQKCSDCGAIVKKSLSTRTHKCACGCELQRDVNAAIN ILNLAKARDGQSRSNATGVGTSTLLGANLVEQVLTMNVESPRL" gene 281..835 /locus_tag="DP116_06405" /pseudo CDS 281..835 /locus_tag="DP116_06405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739726.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="glycosyl transferase family 2" gene 1020..2903 /locus_tag="DP116_06410" CDS 1020..2903 /locus_tag="DP116_06410" /inference="COORDINATES: protein motif:HMM:PF13458.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06410" /translation="MGEWWRVIITRIVLFVVIVLFLGSLVFAASQSEQRKVIKERISF GEKSLIPVELSPHKKAGMKAMADKNLSLAREEFDKSLKKNRNDPEALIFFNNANIGYN RSYTIVVSVPITKNINDIDINNSLEILRGVAQAQRGINDFGGINGVRLEVGIASDDND PEVAKQVATALFNERKVLGVVGHYASGVTIAAKDIYTSKELVAITPISTSVCLTNTPT PTKTPTPISNSICSKYSDRSKNSKPYVFRTVPSDSDTAKALADHMLQDWKKKNVAVFY NSSSDYSMSLKSEFKTVVKQKGGQVLQEFNLNERNFDANSRVKQAKEQGTQVLMLAAD TKTLPKALEVVRSAKGKNLKILAGDDVYTPDTLKEDAAVGMVVAAFWHIDNAPNQDFV KKSKEKELWGGATVNWRTALAYDATQALIDAIEKNPTRSGVQQALLSPKFKTTGASGD IEFLPSGDRKNAKVELVQICTTKNSSTPYKFVPVAAYKGKPSDDCPSSLTNNNSSPTP TSSPLPTATPTLTSTPTPSSSTTSLESNSVCRLQPESPESYKAKVVQREGLNLRKQSN QDSGELGKLPRGQGIIVLKEEKNEKGEIWKNICAEVGGTKKEGWVKAKDSKGKDNTKK VVE" gene complement(3017..3280) /locus_tag="DP116_06415" CDS complement(3017..3280) /locus_tag="DP116_06415" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06415" /translation="MGEKEDYIFQDLCATRYSINTKDQICCEAKEKTKARLKRSPDAE AVVIALENSMQLSAVCSHVVGADGLESRGVGELALLHQLFDGG" gene complement(3330..3689) /locus_tag="DP116_06420" CDS complement(3330..3689) /locus_tag="DP116_06420" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06420" /translation="MQARDRYDANPQYWDDLATSDIWRLGLDVGDGQDNHALAIWRGS VLYDVQIHPTVGDIAVDNTGVGAGTLASLLASGMMNASGCRFGDGADDRSLFFNRKSQ LYWEFREGLRTGKVAIA" gene 4056..5765 /gene="kdpA" /locus_tag="DP116_06425" CDS 4056..5765 /gene="kdpA" /locus_tag="DP116_06425" /EC_number="3.6.3.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="potassium-transporting ATPase subunit A" /protein_id="PRJNA477356:DP116_06425" /translation="MGQGFFQIGVTLCIVIAITPLLGRYMARVFLGEKTLLDSIMNPV EGIIYKVADTARIDDMTGWQYARAVLCSNIIMGIVVFLLICLQQFLPWNPQGLAAPKW DITLHTTISFLTNTDQQHYSGETTLSYFSQTAALTFLMFTSAATGLAVGIAFIRGLTG RRLGNFYIDLTRSITRILLPISIVGAIALLALGVPQTLDGSLKLTTLEEGTQYLARGP VASFEIIKQLGENGGGFFGANSAHPFENPNTISNLIEIIAMICIPAALIHTYGIFANN AKQAKLLFWMVFAIYGILIGVTAIAEYQGNPLIDNALGLEQPNLEGKEVRFGWAQTAL WAITTTGTMTGSVNGMHDSLMPQGVFSTLLNMFLQIVWGGQGTGTAYLFIYSILTVFL TGLMVGRTPEFLGRKIEKQQIVLASVVLLVHPIAVLIPSAISLAFPNTLAFLIPPPGY TKSPEFHNISRVIYEYASASANNGSGLEGLGDSTLWWNLSTCFSLLMGRYIPIIAILL LAQSMAAKQPVPETPGTLRTDSTLFTAITAGVILILGVLTFFPVLALGPIAESIKLAS RIK" gene 6139..8247 /gene="kdpB" /locus_tag="DP116_06430" CDS 6139..8247 /gene="kdpB" /locus_tag="DP116_06430" /EC_number="3.6.3.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407112.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="K(+)-transporting ATPase subunit B" /protein_id="PRJNA477356:DP116_06430" /translation="MDSTNPSPKPSYSPRKSDRRQARKQSRVSTRGLYLRAIKDAFVK LNPKYAIKNPVMFLVWVGTIITTIATIDPYLFGPVSGNNLQLFNGLITAILFFTVLFA NFAEAVAEGRGKAQADALRTTKSETIAKKLLSDGSVSEVSSTSLRIGDTVYVAAGDVI PADGEVTMGVASVDESAITGESAPVLKETGSDVASSVTGGTRILSDELIIRITADPGK GFIDRMIALVEGAERTKTPNEIALTVLLAVLTLVFLFVVVTLPAIARYVGSPISVVVL VALLVALIPTTIGGLLSAIGIAGMDRVAQFNVIATSGRAVEACGDVNTLVLDKTGTIT LGNRLAEGFIPISTYSTQQVANVALAASIFDDTPEGKSIVRLAEKLGAKIDFDSHRAE AVEFSAKTRMSGTNLPNGSQARKGAVSAIKGFVRSRNQQDTPTPVLDAAYEGISQLGG TPLAVALDSEIYGVIYLKDIIKPGIRERFEQLRRMGVRTVMLTGDNRITASVIAGEAG VDDFIAEATPEDKISVIQREQAAGKLVAMTGDGTNDAPALAQANVGVAMNTGTQAAKE AANMVDLDSDPTKLIDIVAIGKQLLITRGALTTFSIANDIAKYFAIIPVLFTSANLGS LNIMKLTSTNSAVLSALIYNALIIPALIPLALTGVKFQPLTANQLLQRNIFLYGLGGV IAPFIAIKLIDVLITLAGLA" gene 8289..8558 /locus_tag="DP116_06435" CDS 8289..8558 /locus_tag="DP116_06435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872729.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="K(+)-transporting ATPase subunit F" /protein_id="PRJNA477356:DP116_06435" /translation="MKKNVLSSKFFLTNVPEAISFVWLQWRSQKLPLAIFVALCLNLL IAPIVYAAGDGTLERLSAWAIGVLGFITLAIILYLSVVVFQPERF" gene 8750..9355 /gene="kdpC" /locus_tag="DP116_06440" CDS 8750..9355 /gene="kdpC" /locus_tag="DP116_06440" /EC_number="3.6.3.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861681.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="K(+)-transporting ATPase subunit C" /protein_id="PRJNA477356:DP116_06440" /translation="MSIYREISKAIRITFILWLLTAIIYPLFILFVAQVPFLKYKAQG SIVQNIKGEIIGSALIGQQFKSDQYFHSRPSTIRYSQGKQGNPTGVSGASNLAPSSPQ LLQRIVEEANLLKDEEIQPIADLIYTSGSGLDPHISIKAVRQQLDRVARARKLRPDEI LPFINKYTEGRFLGIFGEPGVNVLKLNYALDLDEFNRQQNR" gene 9430..10596 /locus_tag="DP116_06445" CDS 9430..10596 /locus_tag="DP116_06445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114719.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase KdpD" /protein_id="PRJNA477356:DP116_06445" /translation="MFDNTQTPQAGAMPATFPASISPARRGKHKIYIGMAPGVGKTYR MLEEAHALKDEGIDVVIGLLETHGRKETGQKANGLEIIPRKEIPRGGLTLTEMDTNAI IARSPQLALIDELAHTNVPGSLREKRYQDVEVILASGIDVYSTMNIQHLESLNDLVAR ITGVVVRERVPDRILEEADEVVVIDVTPETLQERLLEGKIYAPQKIQQSLDNFFQRRN LIALRELALREVADNVEEDAVASTPQGQFCNIHERVLVCVSTYPNSIQLLRRGARIAS YMSAPLFTIYISHPERFLTKEESLHIETCEKLCKEFDGTFIRTTGTDVAKAIAQIAEQ YRITQIVIGASQRSRWQILFKGALTHKLLRLLKNVDLHIISAEKNNTPTRSAID" gene 10760..11917 /locus_tag="DP116_06450" CDS 10760..11917 /locus_tag="DP116_06450" /EC_number="2.7.1.170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anhydro-N-acetylmuramic acid kinase" /protein_id="PRJNA477356:DP116_06450" /translation="MTRVIGLISGTSVDGIDAALVDISGTDLDIKIELVAGATYPYPA DLRERILAVCAGVAISMAELAELDDAIACTFAQAAQNIQIGHHKPTLIGSHGQTVYHK PPSQPITPIGYSLQLGRGESIANQTGITTISNFRVADIAAAGHGAPLVPRIDAALLSH PQEERCIQNIGGIGNVTYIPVRRDNWLEKIRAWDTGPGNSLLDLAVEHLTDGAKTYDE NGSWAASGTPCSPLVEHWLSLDYFHLPPPKSTGRELFGVDYLHQCLQDAEAYQLSPAD LLATLTELTAASIVHSYRTFLPQMPQRVLLCGGGSRNLYLKRRLQVLLEPVPVVTTDE VGLSADFKEAIAFAVLAYWRSMGTPGNLPTATGARQEVLLGQIHSPLVVSA" gene 12145..13599 /locus_tag="DP116_06455" CDS 12145..13599 /locus_tag="DP116_06455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal recognition particle protein" /protein_id="PRJNA477356:DP116_06455" /translation="MFEALADRLEGAWKKLRGQDKISQSNIQDALREVRRALLEADVN LQVVKDFVTEVETKALGAEVIAGVRPDQQFIKIVHDELVQVMGEENVPLAQADHSPTV VLMAGLQGTGKTTATAKLALHLRKLERSCLMVATDVYRPAAIDQLITLGKQIDVPVFE LGSDADPVEIARQGVERAKAEGVDTVIIDTAGRLQIDQDMMAELAQIKKTVQPDETLL VVDSMTGQEAANLTRTFHEKIGITGAILTKLDGDSRGGAALSVRHISGAPIKFVGVGE KVEALQPFYPDRMASRILGMGDVLTLVEKAQEEIDLADAEKMQEKILSANFDFTDFLK QMRLLKNMGSLGGIIKLIPGMNKLTDDQLKHGETQLKRCEAMINSMTRQERQNPDLLA SSPSRRRRIANGAGYKETDVSKLVSDFQRMRNMMQQMGQGQFAGMPGMFGGMGGPAAA GNSPAAPGWRGYNSGAGSKKKPKKDKKKKGFGTL" gene 13839..14099 /gene="rpsP" /locus_tag="DP116_06460" CDS 13839..14099 /gene="rpsP" /locus_tag="DP116_06460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S16" /protein_id="PRJNA477356:DP116_06460" /translation="MIKLRLKRYGKKGEPSYRIIAINNLARRDGRPLEELGFYNPRTD EVRLDVPGIVKRLQQGAQPTDTVRHILQKANVFEQVSGKAST" gene 14080..14493 /locus_tag="DP116_06465" CDS 14080..14493 /locus_tag="DP116_06465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="KH domain-containing protein" /protein_id="PRJNA477356:DP116_06465" /translation="MEKPQHKIGTKSPTTSPDYVGLVRFLMQPFLDSPESLSIDCEMS NTLNRAWIRIAFESLDKGKVFGRGGRNIQAIRTVITAAAQTAGHSVYLDIYGSSAGNR DDSSFEEDREERLPPPKPREKRGNEHKPIARIRSH" gene 14887..15063 /locus_tag="DP116_06470" CDS 14887..15063 /locus_tag="DP116_06470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S21" /protein_id="PRJNA477356:DP116_06470" /translation="MAEVRLGEDESIDSALRRFKKKIQKAGILSEVKRRERYEKPSLR RKRKAEAARKGGRF" gene 15313..16266 /locus_tag="DP116_06475" CDS 15313..16266 /locus_tag="DP116_06475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868526.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate starvation-inducible protein PhoH" /protein_id="PRJNA477356:DP116_06475" /translation="MSGALTIELPNLPSAIALAGEGEENLKTLAQQTGANIVLRGQEL YISGTEKQVDLASRLVRSLEDLWGKGNNISSTDILTARQALDTHREGELLDLQRDIIA RTRKSEEVRAKTFRQRQYIEALRKRDLTFCTGPAGTGKTFLAVVVAVQALLANQFERL ILTRPAVEAGERLGFLPGDLQQKVNPYLRPLYDAINEFIDPEKVPSLMERGVIEVAPL AYMRGRTLNNAFVIVDEAQNTTPAQMKMVLTRLGFRSRMVITGDITQTDLPTNQQSGL TVAIQILKHVEGIAFCEFSQKDVVRHPLVQRIVAAYEQHEK" gene 16312..16716 /locus_tag="DP116_06480" CDS 16312..16716 /locus_tag="DP116_06480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876703.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme utilization protein HuvX" /protein_id="PRJNA477356:DP116_06480" /translation="MTKTLKEFLEACETLGTLRLIVTSSAAVLEARGRVEKLFYAELP KGKYANMHTEGFEFHLNMDKITQVKFETGEAKRGNFTTYAIRFLDDKQEPALSLFLQW GKPGEYEPGQVEAWQTLREQYGELWEPAPAEI" gene complement(16821..17330) /locus_tag="DP116_06485" CDS complement(16821..17330) /locus_tag="DP116_06485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006530244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06485" /translation="MISSEQTFISHKMGHLRPELGDFSSIIFLKAILVGIEETLGDKT AGIAMISAGRNQGKNLARDLNLVGNEAILSLEQIQYKINLVLGKEGTRLCLIDKIELE GDIYKVYAKETFCSAGEAEGSLRNCSYTLGVIQGFLEAFLKKRLHGKQIESVLRGSNH DVMQFSIIA" gene complement(17609..17986) /locus_tag="DP116_06490" CDS complement(17609..17986) /locus_tag="DP116_06490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314952.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diacylglyceryl transferase" /protein_id="PRJNA477356:DP116_06490" /translation="MERVRIHVSMLQDELQNFVSNSTGVQGAILATPDGLALASVLPP GMNEGRTAAISASMLSLSEQIGHELVRGNIDRIFVEGEKGYGVLVSCGDAILLVLAHA SVKQGLLFLEIKRAVAKITSLLG" gene complement(18401..18898) /locus_tag="DP116_06495" CDS complement(18401..18898) /locus_tag="DP116_06495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrocarbon-binding protein" /protein_id="PRJNA477356:DP116_06495" /translation="MPDEDKALVKSEAKLRPILGDFNSIICFKSAVTGIEKALGEKAA AIALTTAGRHRGKDLAQELCFSGSSCSFEDIGYKLETALGKDGTCLCIINNVIREDDV IHVYTSETFCSAGEPQGSERKCTYTLGVVWGFIEQVLGKRLQGKHTESVLRGGDYDVF KFTFL" gene complement(19409..20434) /locus_tag="DP116_06500" CDS complement(19409..20434) /locus_tag="DP116_06500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314951.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06500" /translation="MIFTGNLAEFSLPKIFQLLEQGDNTGLLTIRTITADVTGFMPVY YIWLYQGRIVAAADRLDEKGLVSMIAQRGWVSQDVTLRMAKICRVNTPIGLCLKFQGL LQAEQLKLLFRTQVVDQVSALFELKDGQFEFDAEADLPSAEMTGMSLPATEVTLIGLR YAKRQASGLSLRDWSLLAEQLPELSLILSNKNMTQPQFQLDSLEWQVWQLANGTISLR EIANQLGLAVEKVQQIAFQLLMSDLAEEVFPTAILRNEVTETTSVAENLPESIVDSSS QTDNTQALTNIVAETTLVAENLPTPVIDSSGQTNVSQSFLQKLVGWLRVIRSFFQGLF SFRRNKT" gene complement(20924..21298) /locus_tag="DP116_06505" CDS complement(20924..21298) /locus_tag="DP116_06505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314950.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diacylglyceryl transferase" /protein_id="PRJNA477356:DP116_06505" /translation="MAINTAKLSSILQNFVTATSDVQGAVLVSPDGLTLSSSLPGGMD EERVSAMAAAMISLGERIGSELTRGNIERIYVEGDKGFGILNGCGEDAVLLVLASDSA KQGLLLLEIKRVLTELRKAMMY" gene complement(21934..22407) /locus_tag="DP116_06510" CDS complement(21934..22407) /locus_tag="DP116_06510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314948.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06510" /translation="MATSCTLRPQLGDFVNVMDYKALLDGIEDNLGSKAAAVIISAAG RTYGKKIATALGASMESQDLPTILKRLNLCLGMEGTRLCMIEEVSQQENFIRVKVTEP VEMSGETGGSSRLCPFTLGILGGFIDQVMQRRHQARQVPVADQINLQTEFEFTPL" gene complement(22937..23482) /locus_tag="DP116_06515" CDS complement(22937..23482) /locus_tag="DP116_06515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459117.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTPase" /protein_id="PRJNA477356:DP116_06515" /translation="MEILRIVVTGGMGAGKTTLIRTISEIEVVDTDRKATDEVGLLKK TTTVTLDFGRLTIGPNQSLHLYGTPGQSRFDYMWEILIAKAHAYILLVAAHRPHHFRY GRKQLNFMNQRVQIPYLIGLTHTDCPDAWEAEDVAIALGLDDEKTRPPIITVNATEVT SVKQALNALVEEFANYYQHTT" gene complement(23815..25542) /locus_tag="DP116_06520" CDS complement(23815..25542) /locus_tag="DP116_06520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209213.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phospholipid carrier-dependent glycosyltransferase" /protein_id="PRJNA477356:DP116_06520" /translation="MLHRLHLFHQTKLTIGFLSAFPLVLLLIWALPLLLFSSGESSLM AHDEGLYAWRSRLMIDTGDWIHPWTTPHHKTPGPYWLIASCYRLFGISEASARLPSMI AGVLSIQLVYEIGKILLGKKLAWLAAAILSVEFLWLQYCRLGTPDVPVIFLILFAIWS LLKAELHPKYGFVWCFLAGLSFGLGFLVRSFMIFLPIIALLPYLIWEHRRHRHLANPM LYFGFVVGLIPTFIWLWFSWLRYGDDSFGQLINFVLKLGSGERAHNGLGFYFWDIPLK AFPWFFFSLLGLVLLIRRPIPRYHLLLVGYPLILFAELSFFSTRLSHYSLLLYPFIAL FAAVGLNWLSGGMRKQQRARGAGEAESSGHAARTGAGGARREMTFLSSSSSSSPVPPI CFPRNLSYAFGVLGVLLLGVGMVALVWGDAQVDKYAIVALVSGTSWLILPLVWIGRYH LGKKSLTSRYWLASWLIPVWISLAAAGSSGFLGDYNPDVKAFIQQPTIAQVLHSSAVN FVDIRGKSDVLLKFYTPHHGKRVHQVSELPPLSYAWVSVKQMTNLSRRHRVLGTVQDV SLIELLNQF" gene 25699..26346 /locus_tag="DP116_06525" CDS 25699..26346 /locus_tag="DP116_06525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315527.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_06525" /translation="MPLKPDERTKLDSTDDKLFYEYPRFVTHVDEGFIQQLTDLYRER LKPNTRILDMMSSWVSHLPQEIPFAHVEGHGLNAEELARNPQFNHYFVQNLNENPQLP LKDQDFDAVLNCVSVQYLQYPEAVFSEIHRILKPGGVAIFSFSNRMFFQKAIQAWREG TEASRVELVKSYFSAVPGLTPPEVIARQSSLPNFLQWMGVAGGDPFYAVIAYRSP" gene 26455..26985 /locus_tag="DP116_06530" CDS 26455..26985 /locus_tag="DP116_06530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311213.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06530" /translation="MFFPRARRVLAALLLCLLLFTTACAPKTPGRFDQAQQESSRQRS GQAVAKDSTQGSEFNKFFPSADAGYQRVYTQEKKGFAEAKLKKDGKDLAVLSINDTQA VKGAANPAAKFVNSPKTIAGYPAVSQGSTGTAILVANRYQVKVQSRDASFTEAEREAW IQKFNLGGLARLAKAQ" gene 27062..27949 /locus_tag="DP116_06535" CDS 27062..27949 /locus_tag="DP116_06535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408955.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06535" /translation="MSKSIFELVDELPTSNLTVSALRSLDFVAPGEWQNVVGFVNTIK TVTGEDDEDLIQQIGERAVYLYNDRSQGYQRAMWLYQTVDSTDKALGAAALANKVGEK IPLLGFLNRVTPKAEKAQTIDLCLKLVAELVAFCQINGIPGDSIGDFVASLGEYSGES FIRMAALVCFDGLIPLGPDFISSALSRINQTSPQELDQNSTFANIREAIPGNDSSSKL NFIGESFHSVSGWMSGLVASNNLTPQKVANNLQNFVDFADDKLDYLAAFLDVSTNYYE HTGTQTLARRLIERASAEI" gene complement(27950..28336) /locus_tag="DP116_06540" CDS complement(27950..28336) /locus_tag="DP116_06540" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06540" /translation="MLGRSSDIRKLIWRSKKSTALPCFYGKRTSTSPDLLYATYLADK TSPEVDFNVNLWKLKKNVDRTINVDKLYDTCIFESEVRIQRYRINQWGIGTRHLEAPR TRVLSSNIGLYPAAKRVHYGGGLTPN" gene complement(28496..28645) /locus_tag="DP116_06545" /pseudo CDS complement(28496..28645) /locus_tag="DP116_06545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013726751.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 28700..29646 /locus_tag="DP116_06550" /pseudo CDS 28700..29646 /locus_tag="DP116_06550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006631927.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" gene complement(29618..30640) /locus_tag="DP116_06555" CDS complement(29618..30640) /locus_tag="DP116_06555" /inference="COORDINATES: protein motif:HMM:PF01385.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_06555" /translation="MIQAMYAVKVELKVNNKEQTLLRKHAGFARFVYNYGLALMQGLE RDGIAGGYGKKIKAIKKVLTNYTKKQQEFRWMSKLSSKVYQSSLQALESAYNRWGQGI SDRPRFKRRKDGESFTVYDGNGKVLLRSGKQIKIPTLGIFRLKEALPCSYCTQTFTIS YCAGKWFVSFAVDAEKIPPTYHPEEKVGIDLGVKCFATLSDGTSVESPKPMKKAKIKL AKLQWRNRKKQLGNRRLGVKQSNKAKKYFDSLARQHYAIASQRRDFLHKLTTDISRKY YRIRIEDLNVSGMFANRKLSAAISDLGFYEFRRQLTYKSQVYGTKVELVDTAYCRFMR YSNSSL" gene 30689..30850 /locus_tag="DP116_06560" CDS 30689..30850 /locus_tag="DP116_06560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_06560" /translation="MKQDKRIDLRVTQTELELLDEYCQLTGKNRTDVLREFIRSLKKK MRNAEKYSI" gene 30946..32184 /locus_tag="DP116_06565" CDS 30946..32184 /locus_tag="DP116_06565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491308.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="PRJNA477356:DP116_06565" /translation="MKKYYVKEATITAQLSKDVVIQERHSSSIAEALTHAVEIADYCA ANANAIDNNGAFPESEFKRIAKAGLLAAPLQRELGGWGAGIDANVTYESLMLLKQMGR GNLAVGRVYEGHVNALQLIQSFGTREQIAAYACDARARHKIFGVWNAEASDGVKIIPL DNGKYRLEGSKTFCSGSGYVERPFVNGALPDGSWQMCIVPMDEVTTVSDPNWWQPSGM RATASYKVDFSGVELAESSLIAKPGDYLRQPWLSAGVIRFAAVQLGGAEALFDLTRQY LQNMEYTNDPYQKERLGRMAIAIESGNLWLRGAADMVAAYAPVFGGYPTVDNPQAEQL VAYANMVRTTIEQICIDTMQICERCIGTRGLLPPNPMERIIRDLTLYLRQPAFDAALA NVGQYVLAETHPARSLWNNE" gene 32177..32923 /locus_tag="DP116_06570" CDS 32177..32923 /locus_tag="DP116_06570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491307.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIG-L family deacetylase" /protein_id="PRJNA477356:DP116_06570" /translation="MNNQVTVASPLTHSNSLPWRSLKDIACGSALVVAPHPDDETLGC GGAIALLRSLNLMVRVLVISNGTLSHPNSQKYPAPALQALRESETLSALSVLGVEANA VTFLRLQDGSVPAQYKGAVTTCVAYLTEIAPRMIFLPWRYDPHPDHQASWKLIHTALC DSHISPQLIEYPIWDWDPDQRGTLPESLEVTSWRLDISAVVELKQQAIAAYRSQTTDL IDDDPEGFRLTPEMLLNFTRSWEVYLEAKI" gene 33078..34376 /locus_tag="DP116_06575" CDS 33078..34376 /locus_tag="DP116_06575" /inference="COORDINATES: protein motif:HMM:PF13419.4" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_925996.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06575" /translation="MKLAIFDIDGTLTQTNDVDNQCFVQAFAKEFQIKEINTNWATYG HTTDSGIALQIFQQNWGRVPETSELCQLQQCFVELLHGHYTETPGSFVEIPGACVMLQ RLAQTKDWASAIATGGWRASAEMKLQAAGLDIRELPAAFADDSISREDIVKTAVSRAK EFYHQPDFERIVCIGDGIWDVLTAIQLQLPFVGVASDTQKPLLENAGVECIIPDFVDF DSFLKALDTASIPNQHKPLQVQNNSLPPSYFETLYGSNPDPWKFETSEYENQKYTATI AALPKQRYHSGFEIGGSIGVLTEKLAQRCDSLLSVDVSKIAQKRAIQRCQHLPQVRFE IMCLPQEYPEEMFDLTVVSEVGYYWCWEDLKKAQQCILKHLEPGGHLLLVHWTQYAPD YPLNGDQVHDSFFDLTPTHLRHLKGKREKEYRLDVFERVF" BASE COUNT 10010 a 7459 c 7449 g 9604 t ORIGIN 1 gcagtcaatc ctagaatgac atctcaaaag tgttctgatt gtggcgcaat tgtgaaaaaa 61 tctctttcaa ctcgcaccca taaatgtgct tgtggttgcg agttacaaag agacgtgaac 121 gcagcaatta atattctaaa tcttgcaaaa gctagggacg ggcagtcccg aagtaacgct 181 acaggagttg gaacctctac gctacttggt gcaaacctgg tagagcaagt tctgacgatg 241 aatgtagaat cccctcgcct ttaggcaggg gagtgtcaag ctgtttcaac gctcatccta 301 cgataaaata ggaggacatc aggcagttgc aggctatgtt gctgaggatg tggctttggc 361 tcgacgcatt aaagagagtg gtttgaagtt gcgacacttc ttaggagcaa atttagcaaa 421 attaaggatg taccgctctt ggacagcact ctgggagggc tggactaaag ttttgtactt 481 gggtgcccaa agaagcttgt ggctaatggt atccctagct ctgctaatgc taaatattta 541 tttgattcct tggctagtac tagttattgt gtttagtaaa agttttttaa ttggctggaa 601 aacagttgac ttgctggcga tgtgtctagc tttgattgct attctcctcc agtacaattt 661 acgcacacta gcagcacagg catttcgtag ctctccaaaa tactggtggc tgcatggttt 721 gggaggtcta ctggttgcag tcattgccat tgcctcagtt attaagactg agacgggttg 781 gggttggact tggcgaggtc gagtgcttaa gcatcctatt tcacctcgtg aatgaatttt 841 gtagatccag aagtagcaaa cgtggcataa ggtgatatgc ataatacaaa gtgaaaatgc 901 acagaaaaag tttcctgaaa aatactactt ctaaactccg ctcactcgga ttccatcatc 961 aatcctgaaa ggcgataaat cgaaatcttt aacaataaat ttttaaagga aaataatata 1021 tgggagaatg gtggcgcgta ataattactc gaatcgttct ttttgttgtt atcgtgctct 1081 tcttgggatc actggtcttt gctgcatcgc agtcggaaca acgtaaggtc attaaagaac 1141 gcattagttt tggtgagaaa agtttaattc cagtagaact ttccccccat aaaaaagccg 1201 ggatgaaagc tatggcagac aaaaatttgt cccttgcaag ggaggaattt gacaaatcgc 1261 taaaaaagaa tcgtaacgat ccagaggcac ttattttctt taacaatgcc aatattggtt 1321 ataatcgcag ctacaccatt gttgtatctg taccaattac caagaatatc aatgatatcg 1381 atatcaataa ttctttggaa attttgcgtg gggtagctca agctcaacgt ggaattaatg 1441 actttggggg aattaatgga gtgcgtttgg aggtagggat cgcatcggac gataatgatc 1501 cagaagttgc taaacaagtt gctacagcct tattcaacga gcgtaaggtg ttaggtgtag 1561 tcggtcatta cgctagtggt gtcactatag cagcaaaaga tatttataca tccaaagaac 1621 ttgtagccat tactcccatc agtacttccg tttgtcttac aaacactccc actcccacca 1681 agactcccac tcctatcagt aattctattt gttctaaata ttctgaccgt tcaaagaact 1741 ctaaacctta cgtttttcgc acagttccta gtgatagtga taccgcgaag gcattagcag 1801 accatatgtt gcaagattgg aagaaaaaga atgtagcagt tttttataat tccagtagtg 1861 actacagtat gtcgcttaag tctgagttta agacagttgt gaaacaaaaa ggaggacaag 1921 tattgcaaga gtttaacttg aatgagcgga attttgatgc caattcaagg gtgaaacagg 1981 ctaaggaaca aggtacccaa gtactaatgt tagcagccga tacgaagact ttaccaaaag 2041 cgctggaagt ggttaggtct gctaagggga agaacctcaa gattttggcg ggagatgatg 2101 tttacacccc agacactttg aaagaagacg cagcagttgg gatggtggta gcagcttttt 2161 ggcatattga taacgctccc aaccaagact ttgtcaaaaa gtctaaagag aaagagcttt 2221 ggggtggtgc tacagtaaac tggcgaactg cccttgccta tgatgcaaca caggctttga 2281 ttgatgcaat tgagaagaat ccgactcgaa gtggggtaca acaggctctc ctatcaccaa 2341 aatttaaaac taccggagct tctggtgata ttgagttttt gccgtcgggc gatcgcaaga 2401 atgctaaagt agaacttgtg caaatttgca ccactaagaa ttctagtact ccttacaaat 2461 ttgtccctgt tgcagcttat aaaggtaaac cctccgacga ttgtcccagt tcactgacta 2521 ataacaattc aagtccaacc cctacttctt cgccgctacc tactgcaacg cctacattaa 2581 ctagtacacc cacaccatcc tcttctacaa cttctttgga aagtaattca gtatgtcgtc 2641 tccaacccga atccccggag tcgtacaaag ctaaagttgt tcagcgtgaa ggtctgaatt 2701 tacgaaaaca atcaaaccaa gattccgggg agttagggaa gcttccgcgt ggacaaggaa 2761 ttattgtttt gaaggaggag aagaacgaga agggtgaaat ttggaaaaat atttgcgcag 2821 aagtaggagg tactaagaaa gaaggttggg taaaagcaaa agatagtaaa ggaaaagata 2881 atacaaagaa agttgtggaa taacgtgatt tttcacagtt aaataaagca tcaccaccaa 2941 actccctgtg caatatttcc cacagaaacg ctgtgcggta tgtcctctgg acacactgtg 3001 cgattcgcac accacctcaa cccccatcaa aaagctgatg caacaaagcc aactcaccaa 3061 caccccgcga ctccaaccca tccgcaccaa caacatgaga acacacagca cttaactgca 3121 tcgaattctc caaagcaatc acaaccgcct ccgcatccgg actccgcttc aaccgcgcct 3181 tcgtcttctc cttcgcctca caacaaatct ggtctttcgt attgatacta taacgagttg 3241 cacacaaatc ctgaaaaata taatcctcct tttcacccaa aggagcgtta agcgaagctc 3301 tgccgtttgc gcagcgcccc ttttaggggc taggcaatcg ccactttccc cgtcctcaac 3361 ccctcacgaa actcccaata caactgagac ttgcggttaa aaaacaaaga acggtcatca 3421 gccccatcac caaaacggca accacttgca ttcatcatcc cacttgcaag caaacttgcc 3481 aacgtccccg cacccacccc agtattatca acagcaatat cacccaccgt cggatgtatc 3541 tgtacatcat acagcacaga tccacgccaa atcgccaaag cgtgattatc ttgtccatcc 3601 cccacatcca aacccagccg ccagatatca ctcgtcgcca aatcatccca atattggggg 3661 ttagcatcat agcgatcgcg agcttgcaaa agccatgatt tcggaataat accatcagta 3721 tcatccgtcg ggaactgcgc ttctacacga gaatgggatg gtcccaggaa tttaaatatt 3781 tggaatgctc tctatataca cttgctttta ccgatgtcat gtatagaata gttatgacaa 3841 atcatgaatt gctgacaacg ccataaatat gagatcagtg aacagtgaac acttcgacaa 3901 gctcagtgca tcgcagtgaa cagtaaaaac tgataactga taactgctaa ctggtaactt 3961 ataaagaata aaattgttca gctatcaata aatagttgaa caattttaaa atgcaaagac 4021 tttaacaaga tccatctaca attttggcta gataaatggg acaaggcttt tttcaaattg 4081 gggtaacgtt gtgtattgta atcgcaatca ctccactatt gggcagatac atggcgcgtg 4141 tctttttggg agaaaaaacg ctgctggact caatcatgaa ccctgtagag ggaataattt 4201 ataaagtggc agatacagcc agaattgatg acatgacggg ttggcagtat gccagagcag 4261 tactttgcag caatataatc atgggtattg tggtgttttt gctgatatgt cttcagcaat 4321 ttttaccttg gaatcctcag ggattagctg ctcctaaatg ggacatcacg ctgcacacga 4381 caatttcatt tttaaccaac actgaccagc aacactactc tggtgaaaca actttaagtt 4441 attttagcca aacagcagct ttaacttttt tgatgtttac ttcagccgca actgggttag 4501 cagtcggcat cgcattcatc agaggtttga caggtagacg actgggaaat ttctacattg 4561 atctgactcg ttctataacg cgcatattgc tgccaatttc gattgtgggt gcgatcgcac 4621 tactagccct aggcgtacca caaaccttag atggatcttt gaaattgaca acattggagg 4681 aaggaacgca gtatctcgcc agaggtcctg ttgcttcctt tgagatcatc aaacaattgg 4741 gagagaatgg cggtggtttt tttggcgcaa attctgctca tccgtttgaa aatcccaata 4801 ccatttctaa cctcatagaa atcatcgcca tgatttgtat tccagcggcg ttgattcaca 4861 cttatggtat ttttgccaat aacgccaagc aagctaaact acttttttgg atggtctttg 4921 ccatctatgg aattttgatt ggtgtcacag cgattgctga gtatcaagga aatcccctga 4981 ttgataacgc cttgggatta gaacagccta atttagaggg gaaggaagtc aggtttggct 5041 gggcacaaac ggcgttgtgg gcaattacga cgactggaac tatgactggt tctgtaaacg 5101 ggatgcatga ttccttgatg cctcagggag ttttttccac cttattaaat atgtttttgc 5161 agatagtttg gggtggacaa ggaactggaa ccgcttactt attcatttat tcaattctca 5221 ccgtatttct cacaggactg atggtgggac ggactccaga atttttagga cgcaaaatag 5281 aaaagcaaca aattgtcctt gccagtgtcg tcctgctggt tcacccaatt gctgtattga 5341 tccccagtgc cattagctta gcatttccaa ataccttagc atttctgatt ccaccgccgg 5401 gatatacaaa atcccccgaa tttcacaaca tctcacgagt gatttacgaa tacgcctcag 5461 ctagtgctaa caatggttct ggcttggagg gattgggaga tagcactttg tggtggaact 5521 taagtacttg ttttagcctt ctgatgggaa gatacatacc aattattgcc attttgcttt 5581 tagcacaaag catggcagcc aaacaaccag tacccgaaac accaggcacc ctcagaaccg 5641 actctacatt atttactgct attacagctg gagtcatttt gattttgggt gtgttgacgt 5701 tcttccccgt tttagcttta ggaccgatcg ccgaaagtat taaacttgcc agccgcatca 5761 aataatcttg ttcgtaatag gcgtttatca gttatcagtt atcagtagga cttatatcat 5821 gtccggttaa ttgcttataa attccgaaga accccacccc ggttttgtct tgcgccaaaa 5881 ccgcccctcc ccgcttgcgg ggaggggatt aaggggaggg gtgcagatat cgcggtaatc 5941 acaactaacc agccggacat gatattacgc aaaaactctt ttaaaccctc ttaactatgc 6001 gctaaggcgc acgctacgct aacgtgttct ttgcgtcctt tgcggtttat tttttcatta 6061 ttttgcgtaa gtcctgatca ataagcgtgt tcactggacg cccaccataa cataatcccc 6121 aattcacaat tccctctcat ggactccaca aatccttcac ccaaaccttc ttattctcct 6181 cgtaaaagcg atcgccgcca agcacgtaaa caatcgcgag tcagcaccag aggactttac 6241 ctcagagcca tcaaagatgc ttttgtcaag ctcaatccta aatacgcgat caaaaacccg 6301 gtcatgttct tggtttgggt tggtacaatt atcaccacta tagcgactat cgacccttat 6361 ttgtttggtc cagtttcagg caacaattta caacttttca acggattaat tacagcaatt 6421 ttgttcttca ccgttttatt tgccaacttt gcggaagcag tcgcagaggg acggggtaaa 6481 gcgcaagctg atgctttaag gacaaccaag tcagaaacaa ttgccaaaaa acttctctct 6541 gatggttcag ttagtgaagt ttcttccacc agcttgcgga taggcgatac agtgtatgtg 6601 gctgctggtg atgtcatccc tgctgatggg gaagtgacta tgggtgtcgc cagtgtggat 6661 gaatctgcaa ttactgggga atctgcacca gttttgaagg aaacaggttc agatgtggct 6721 agttcagtta ctggtggtac gcgaattctc tctgatgaac ttattatccg tatcactgca 6781 gatccaggca aggggtttat cgaccggatg attgctttgg tggaaggagc agaacgcaca 6841 aaaacaccga atgagattgc tttgacggtg ttactagcag ttctaacctt ggtgtttctt 6901 tttgttgtgg tgactttgcc cgcgatcgcc cgctatgttg gaagtccaat tagtgtggta 6961 gtattagttg ccttgttagt tgcgttaatt ccgacaacaa ttgggggatt actgagtgcg 7021 atcggcattg ctggtatgga tcgagttgcc cagtttaacg tcatagccac ctcaggacga 7081 gcagttgaag cgtgcggtga tgtcaatacc ttagtactcg ataaaacagg tacaatcact 7141 ctcggtaatc gtttggcaga agggtttatc cccatcagca cctattcaac acagcaagtt 7201 gccaatgttg ccttagcagc aagtattttt gatgatacac cagaagggaa atcaattgtt 7261 cgactggcag aaaagttagg tgcaaaaatt gattttgaca gccacagagc cgaagctgta 7321 gagttttcag caaaaactcg tatgagtggt acaaacttac ccaatggtag ccaagcgcgt 7381 aagggtgcag tatcggcaat taagggattt gtccgttctc gcaatcaaca agatacgccg 7441 acgccggtac tggatgcagc atatgaagga atttcacaac tgggagggac acctttagca 7501 gtagcccttg atagtgaaat ttacggcgtc atttatctta aggatatcat taaaccaggt 7561 atccgtgaac gttttgagca gttgcggcgg atgggagtgc gtaccgtcat gctgactgga 7621 gacaaccgca ttaccgcgtc tgtgattgcg ggggaagctg gagttgatga ctttattgcc 7681 gaagccaccc cagaagataa aatctctgtg attcagcgcg aacaagcggc gggcaaactg 7741 gtggcgatga cgggtgacgg tactaacgat gcacccgcac tagcccaggc aaatgtcgga 7801 gtagcaatga atacgggtac tcaggcagca aaagaagcag ccaatatggt ggatttggac 7861 tctgatccca caaagctgat tgatattgtt gcgattggaa aacaactgct gattactcgc 7921 ggtgctttga caacgttttc tattgccaat gatattgcca agtattttgc aattatccca 7981 gtgttgttta cctcagctaa tttgggaagt ttaaatatca tgaagttaac gagtaccaat 8041 tctgctgtcc tgtcggcgtt gatttacaat gctttgatta ttccagcttt gattcctttg 8101 gcactcactg gtgtaaaatt tcaacctttg acggctaatc agcttttgca acgaaatatt 8161 tttctgtatg gtttgggagg tgtgattgcg ccgtttatcg caattaagtt gattgacgtg 8221 ttgattactc tggcaggatt ggcttagaaa acaaatcgta gcaagcgcag agaaaggaga 8281 ggaaaatgat caaaaaaaat gttctatcaa gtaagttttt cttaacaaac gtacctgagg 8341 caatttcttt tgtctggttg caatggcgct ctcaaaagtt gcctctcgct atttttgtag 8401 cgctgtgcct aaatttgctg attgccccga tagtatatgc tgctggcgac ggtactttgg 8461 aacgcctttc tgcttgggca attggtgttt tgggattcat aacactagca attatccttt 8521 atttgtctgt tgtcgttttt cagccagaac gcttttaatt aggtaatata ggattcttat 8581 ttgatttttg cgaaactagg tacagtttct gttccctgtt aagcgttccc tgttccctat 8641 taagcgttcc ctctttccta gctcaactag taaattcata aaccaaaccc gattccgaca 8701 caattttgtt aattatgaca gtaaatttcc gaggtcgtta aagttttgta tgtctattta 8761 tagagaaatt agtaaagcaa ttcgcatcac cttcatctta tggttattaa cggcaattat 8821 ctatccttta ttcatacttt tcgtcgctca agtccccttt cttaaataca aagctcaagg 8881 cagcatagtg caaaatatca agggagaaat catcggttca gctttgattg gtcaacaatt 8941 caaatctgac cagtatttcc acagtcgtcc tagtacaatt agatatagcc aaggaaagca 9001 aggtaatcct actggggtct ctggagctag caatctcgca cctagcagtc cacaattgct 9061 acagcgaatt gtagaagaag caaatctact gaaagatgaa gaaattcaac ccattgctga 9121 cttaatttac acatctggtt caggtttaga tccgcatatc tctatcaaag cagtacggca 9181 gcagttggat agagtggctc gtgcccgtaa actcagacca gatgagatac ttccttttat 9241 aaataagtat acagaaggaa gatttttagg tatttttggt gagcctggag ttaatgttct 9301 aaaattaaat tatgctctgg atttagatga gtttaaccgt caacaaaata gataagtagt 9361 tattagtcat tgtttcttga cctgtgattt tcgcctttta aatactctca ggatttatca 9421 caccaccaga tgtttgataa tactcaaaca ccacaagccg gggctatgcc tgcaaccttt 9481 cctgctagca taagtccagc aagacgaggt aagcataaaa tttacatagg tatggcaccc 9541 ggagtaggca aaacctaccg gatgctggaa gaagctcatg cactcaaaga cgaaggaatt 9601 gatgttgtta ttgggctttt agaaacccac ggacgcaaag aaacagggca aaaggcaaac 9661 gggttagaaa taataccccg taaggaaatt cctcgcggtg gattaactct aacagaaatg 9721 gatacgaatg cgatcattgc tcgctcacct cagttagcat tgattgatga attagcacat 9781 acaaatgtcc caggttccct acgagaaaaa cgctaccaag atgtagaagt cattttggca 9841 tcaggtattg atgtctactc cacaatgaat attcagcatt tggagagtct caatgatttg 9901 gtagcgcgaa ttactggtgt ggttgtgcga gaacgggttc ctgaccgtat tttagaagaa 9961 gcggatgagg tggtagtaat agatgttaca ccagaaacac tgcaagaacg cttgttagaa 10021 ggcaaaatct acgcgccgca aaaaattcaa caatcactag ataacttttt ccaacgccgt 10081 aacctgattg ccttaagaga gttagcactg cgggaggtag cagataacgt tgaggaagat 10141 gccgttgctt ccactccaca aggtcaattc tgtaacattc acgagcgagt tttggtatgt 10201 gtatccactt atcccaactc aatccaatta ttacgtcggg gagcgagaat tgcgagttac 10261 atgagtgctc ctctatttac catatatatt tctcatccag agcgcttcct gactaaggag 10321 gaaagtttgc acatcgaaac ttgtgaaaaa ctttgcaaag aatttgacgg tacattcatt 10381 cgtaccactg gcactgacgt agccaaggcg atcgcacaaa tcgctgaaca ataccgtatc 10441 actcaaattg tgattggggc aagtcagcga tcacgctggc aaattctctt caaaggcgct 10501 ttaacccata aattgctacg attactgaaa aacgttgatt tacatattat ttccgctgag 10561 aaaaacaata ctcccactcg ttcagcaatt gattaaaaat agtaaccaca aatacacaag 10621 ggatatacgc ggataatttt gagataatta tctgcgtacc ctccgggttc gggggttcgc 10681 aatcgacggg aaccgccaag actgcgaccc cctcaccgct cagagggcgt ctacaacaac 10741 atcagcggtt taaaatcaaa tgactcgtgt aattggttta ataagtggca cgtctgtaga 10801 tggtatagac gccgccttgg tagatatttc tggtacagac ttggatatca aaattgagtt 10861 agtggctggt gcaacatatc cttatccagc agatttgaga gaacgcattc tagcagtttg 10921 tgctggtgtt gcgatttcga tggcagagtt ggcagaattg gatgatgcga tcgcttgtac 10981 ttttgctcaa gctgcacaaa atattcaaat tggtcaccac aagcccactc taattggttc 11041 tcatggtcaa acagtatatc ataaaccacc atcacaacct atcacaccaa tcgggtatag 11101 cttgcaactt gggcgtggtg aatcaattgc caatcaaaca ggtataacaa cgatttctaa 11161 ctttcgtgta gcggatattg ctgctgctgg tcatggtgcg ccccttgtac cacgtatcga 11221 tgcggctttg ctcagtcatc ctcaagaaga acgttgtatt caaaatattg gtggcattgg 11281 taatgttact tatataccag tccgtcgtga caactggcta gaaaaaattc gcgcctggga 11341 cacaggacca ggaaatagtc ttttggatct ggcggtggag catttaacgg atggtgcgaa 11401 aacttatgat gaaaatggtt cttgggcagc gagtggtact ccctgttctc ccttagtaga 11461 gcactggcta agccttgact actttcatct accaccaccc aaatccacag gtcgagaatt 11521 atttggtgtt gattacctgc atcaatgttt acaagacgcc gaagcctacc aactcagtcc 11581 agccgactta ctggcgacgc ttacggaact caccgcagct tcaattgttc atagttaccg 11641 gactttttta ccgcaaatgc cgcagcgagt gcttttgtgt ggcggaggta gtcgcaatct 11701 ctatctcaag cgtcggttac aggtactatt ggaacctgtg ccagttgtta ccacagatga 11761 agttggtttg agtgctgatt ttaaagaagc aatagccttt gcagttttgg catactggcg 11821 atctatgggt actcctggta acttaccaac agcaaccgga gcgcgtcaag aagtactgct 11881 gggtcaaatt cactcaccgt tggttgtgag cgcctaaggg acgcgattac acgcccttct 11941 taaaacagtt atcagttatc agttatcagt tatcaagtta agaggggtca agaaatcgct 12001 tcctctgttt actgttcact gttcactgtt cactgtattt gtagcccttc tgactgtatt 12061 ccaatacagt cgaagtgaaa aaacactaag ctgtatatat ggcataaaaa ctctgctgac 12121 taactacctg taacttaact gattatgttt gaagcattag ctgaccgttt agaaggtgcc 12181 tggaagaaac tccggggtca ggacaaaatc tcccaatcga atattcaaga cgctttgcgc 12241 gaagtgcgtc gtgcgctgtt agaagcggat gttaatcttc aggtagttaa agattttgtc 12301 accgaagttg aaaccaaggc gctgggagcg gaagtcattg ctggcgtccg acctgaccaa 12361 cagttcatca agattgtcca cgatgaacta gtgcaggtga tgggggaaga gaatgttcct 12421 ttagcacaag ctgatcactc tcccacagtt gttctgatgg cagggttgca gggtactggg 12481 aaaacgactg ctactgccaa gttagccttg catctacgta aattagaacg cagctgctta 12541 atggtggcga cggacgtgta tcgtccagcg gctattgacc aattgatcac actaggtaag 12601 caaattgacg tgccagtgtt tgaactcgga tctgatgctg atccagttga gattgcacgg 12661 cagggcgtgg aacgcgccaa agcagaaggg gttgacacag ttattattga tactgctggt 12721 cggttacaaa ttgaccaaga catgatggcg gagttagccc agatcaaaaa aactgtccaa 12781 cccgatgaaa ctcttttggt ggtagactcc atgacgggtc aagaagccgc caatctcacc 12841 cgtactttcc acgaaaaaat tgggattacg ggtgcaattc ttaccaagtt ggatggtgat 12901 agccggggtg gggcagcact ttctgtgcgg catatctcag gagcacccat taagtttgtc 12961 ggtgtgggtg aaaaggttga agctttacaa ccgttttatc ctgaccgtat ggcatcgcgc 13021 attttgggca tgggcgatgt tctgaccttg gtagaaaagg ctcaagaaga aattgacctt 13081 gctgatgctg agaaaatgca ggagaaaatc ctgtcagcga acttcgactt taccgatttt 13141 ctcaagcaaa tgcgcctact caagaacatg ggttcactgg gtggtataat taagctcatc 13201 cctgggatga acaagctaac agatgaccaa cttaagcacg gagaaaccca gcttaagcgc 13261 tgtgaagcga tgattaattc catgacgcgt caagagcgcc agaaccccga tttgttggca 13321 agttctccta gtagacggcg acgcattgcg aatggagcgg gttataaaga gacagacgtt 13381 agtaaactgg tcagtgactt ccaaagaatg cgcaatatga tgcagcaaat gggtcaaggg 13441 cagttcgctg gtatgccagg aatgtttggt ggaatgggtg gtccggctgc tgctggtaac 13501 agtccagctg cccctggttg gcggggttac aatagcggcg ctggctcgaa aaagaagccg 13561 aagaaagaca aaaagaagaa aggcttcggt acgctttagc cactcatgag tggttagccg 13621 caattcacaa ataacaaaca gctactaaca cctaagactt ggcaaatgct aaaataggca 13681 ttttagcacc agaagtaaaa ctgcctaccc aaggggacag ctttcaacag gaataacatt 13741 gcctttggaa ctgtcttcag gaaacttcac gctcacctaa cttgccaata aactaactgc 13801 aaacagattc ctaaaaacag gagaatgatt tcttaaccat gattaaactg cgcctgaagc 13861 gatacggtaa gaagggagaa ccaagttacc gcataatcgc aatcaacaac ctcgctcgcc 13921 gcgatggtcg tccgttggaa gaactgggtt tttataaccc cagaactgat gaagtacgac 13981 tggatgttcc cggaatcgtc aagcgactac aacaaggcgc tcaaccgact gataccgtac 14041 gtcacatcct gcaaaaagcc aatgtctttg aacaggtcag tggaaaagcc tcaacataaa 14101 atcggaacaa aatcccccac aaccagtcca gactacgttg gactggtacg gtttctgatg 14161 caaccgtttt tagattctcc tgagtcttta agcatcgatt gtgaaatgtc taacaccctc 14221 aaccgtgctt ggattcgcat tgcctttgaa agcttagata agggaaaagt atttggtcga 14281 ggtggacgta atattcaagc gattcgcact gtgattactg cagcagctca aacagctgga 14341 cactcagtat acctggacat ctatggcagc agtgctggta atcgggatga ctcatctttt 14401 gaagaagaca gggaagaacg attaccgcca ccaaaaccca gagaaaaacg tggaaacgaa 14461 cataaaccta ttgctagaat acgctcccac tagatttcac acaaagtccg taacaaaagg 14521 ttaaaaggtg aataccagca ctcagtaact agtcaaccgc agcttttatc gacttaggaa 14581 attcaagtca aagtttgtac cgaagtgcga tcatagtaga aaagctagca tttcaaaaaa 14641 aacccggtta gcaatagctt ggtttggcac agttgcaagc ttcatttgat tcggtctacc 14701 ccaaagcgtc aaaggtgttg tgtaagcgcc gagcatgatt gctaccatct tacgcctact 14761 cggttgagtc tagcactcgt aaactaagta aagcgctcac accataagtc agaaaataaa 14821 aagcagcggt tgactacgtt tttagatgat acttagagaa tctctgtaag gaggtgaaac 14881 cagggagtgg ctgaagtccg tttgggtgaa gatgagtcaa ttgactcagc attaaggcgt 14941 tttaaaaaga aaattcaaaa agccggaatt ttatccgagg tcaagcgccg agaaagatac 15001 gaaaaaccca gtctgcgccg taagcgcaaa gcggaagccg cacgcaaagg tggtcgcttc 15061 taaatcaaag tcatatcttt gatgatggcg gttaaccaat gccagctagc tatttacgtt 15121 aaggtttgaa agttgttgta gagcaaactc agggaaagta gagcagcgat tttacctgcc 15181 tctacttttc ctaaccaagt ccgatttcaa cgccaatgtt cttaacatca actaaacaac 15241 aaaattgttt gaattgttta gttcaattaa agtctgtgaa gtcttttcct aattttgaat 15301 tctgaattta tgatgtcagg tgccttaacg attgaactgc cgaatcttcc aagtgcgatc 15361 gcactagcag gagaggggga agaaaatctc aaaactttgg cacaacaaac aggagccaat 15421 atagttctgc ggggacaaga actgtatatt tctggcacag aaaagcaagt ggatttggcg 15481 agtcgattag tgcgatcgct tgaagacctt tggggcaaag gcaacaatat ttctagtaca 15541 gatatattaa cagctcgcca agccttggac actcatcgtg aaggggaact gctagattta 15601 cagcgggaca tcattgctag aacgcgtaaa agtgaggaag tccgcgccaa aaccttccga 15661 caacgacagt atatcgaagc actacgcaaa cgggatttaa ctttttgcac tggtccagcc 15721 ggtactggta agactttcct cgcagtcgtc gtcgctgtac aagcacttct ggcgaaccaa 15781 tttgaacggc tgattctcac tcgtcctgct gtcgaagctg gcgaaagact tggtttttta 15841 ccaggagatt tgcagcagaa agtgaatccc tatttgcgtc cactctacga tgctatcaac 15901 gaatttattg atccagaaaa agttccatca ctgatggaaa gaggcgtcat tgaagttgct 15961 cctttagcgt atatgcgggg acggactctc aacaacgctt ttgtcattgt tgatgaagct 16021 caaaacacca caccagctca aatgaaaatg gttttgactc gtttgggttt ccgttcccgt 16081 atggtgatta caggtgacat cacacaaacc gacttaccaa ccaaccaaca atctggatta 16141 acggtagcta tacaaatttt aaaacacgta gaaggcatag ctttttgcga attctctcaa 16201 aaagatgttg tgcgccatcc tcttgttcag cgtatagtcg ctgcatacga acaacatgaa 16261 aaatagtccc gagtcatgaa aaaatgacaa atgacaaatg actaatgaca catgactaaa 16321 acactaaaag aatttttgga agcttgtgaa actctgggaa cactgcgtct gattgtcaca 16381 agcagtgctg ctgttttgga agcacgcggt agagtggaaa agctgtttta tgcagaactc 16441 ccaaaaggta agtacgcgaa catgcacact gaaggttttg aatttcactt gaatatggat 16501 aaaattacgc aggtgaaatt tgaaacgggt gaagcgaagc gaggtaactt taccacctat 16561 gccattcggt ttttggatga caaacaagag ccagctttga gtttgtttct acaatggggt 16621 aaaccaggag aatacgaacc tggacaagtg gaggcttggc agactttacg ggagcaatat 16681 ggagaactct gggaacctgc accagcagag atttgatgag aattaactca tcccacggtt 16741 atgttaaacc ccgcagggtt tcaaccaaag acttgcgggg tttgccattt acgactcctt 16801 aattgacaag tgtcttatgt ttaagctata atagaaaact gcataacatc atgattacta 16861 ccacgcaaaa cagactcaat ttgttttccg tgcaagcgct tcttcaaaaa agcttctaaa 16921 aagccttgaa taacgcctaa agtatagctg cagtttcgta atgatccctc tgcttcacca 16981 gcagaacaaa aagtttcttt ggcatagact ttgtaaatgt ctccttctag ttctatcttg 17041 tcgatcaggc acaagcgagt tccctctttg cccagaacca aattaatctt gtattggatt 17101 tgttcaagag acaatattgc ttcattgcca actaaattta agtctctagc caaatttttg 17161 ccttgattgc gaccagcaga tatcattgcg ataccagcgg ttttgtcacc taaagtttct 17221 tcgataccaa caagaattgc tttcaaaaag attatactgc taaaatcacc aagttctgga 17281 cgcaaatgcc ccattttatg acttataaaa gtctgttcac ttgatatcat cttttccttt 17341 cctttgtctg ttgagtttat ctagaattcg ttttacggta gaaattagtc ttaagtttcc 17401 ctgtaattat tgtcagagtt ttgtcatatc taactcagtt tgtctggttt tttccctgaa 17461 aatatagcgt ttaccggttt cagctatcga atgatggtaa gtgtaatccc atcactctat 17521 ttttgtcata gtttaagttg catttggtgt ttgaattttg catttccatg caaaatttca 17581 aatctacaat ccatgcattg gttgaccttt aacccaatag ggatgttatt ttagcaacag 17641 cccgtttgat ttctagaaac aataagcctt gcttgacaga ggcatgagcg agaaccaata 17701 aaattgcatc accacagcta accaacacgc catagccttt ctcaccttca acgaagatac 17761 gatcaatatt accgcgaacg agttcgtgtc ctatttgttc gctcaacgag agcatagatg 17821 cagatatggc agcagtgcgc ccttcattca ttcctggtgg taacacagaa gccaaagcta 17881 agccatcagg agttgccaag attgcaccct gaacccctgt gctgtttgag acaaaatttt 17941 gcagctcgtc ttgtagcatg gaaacgtgaa tcctaactct ttccattgtg agtgagggtg 18001 ttttagtact ttttacagat gtcgctatta tagtaaatcg ctatacctac aatagtatta 18061 attaagtagg cgtcatgaac tgcgtacacc agcacgcatg aaaaattttc tcatctcccc 18121 ctgctcaaag ctccctgctc ccttgctttt tcgagtcaga tgggtgtaaa taaacagaac 18181 tatattgcaa tcggcaaagg ggcagggaaa aggggaaaac ctttaacctt ttctctttgc 18241 gctgacaaaa cgttatatac ttcgctaaaa ttagtgccaa cctactcgga gatcatttat 18301 aaagtaaatc ttatctgagg agggttttcc tccgggagaa actgtccctc ctattgtagc 18361 acaagcgctt gaaaaaaaat tttgtgagtc ttcttagctc tcacaaaaaa gtaaacttaa 18421 agacatcgta atcaccacca cgaagcactg attccgtgtg tttaccttgc aagcgcttac 18481 cgagaacttg ctctataaat ccccaaacaa caccgagcgt gtaagtacat ttacgctcgg 18541 aaccttgagg ctcgcccgct gaacaaaagg tttctgaagt gtaaacatga ataacatcgt 18601 cttctcttat gacgttatta atgatgcaca agcaagttcc gtctttacct aaagctgttt 18661 cgagtttgta gccaatatcc tcaaaggagc aggatgatcc tgagaaacat aattcttgtg 18721 caagatcttt gccgcgatga cgcccagctg tagtcaaggc gatcgcagct gccttttcac 18781 ctaaagcttt ttctattccg gtaactgccg acttaaaaca gataatacta ttgaagtctc 18841 caagaattgg acgtagcttt gcctctgatt ttactagagc tttgtcctca tctggcagcc 18901 caacacgtct ttccgctaaa gacaaggctt gtggttgtaa agtcatgtga tagttctcct 18961 agtatcaaca aaaaccagat ttcatcaaaa gagcgcgtct agtctaattt gttggctaaa 19021 aattaatcgg ttgatgagtc tgttttcctt cttccttgtg cctctgcact cctgcaccct 19081 tgcaacttca atccaacatt ttagtgtgga cgcagtacta ctcaatgtct ttctatattt 19141 atagttaggt aattactgat aatcccaaaa aatacatata attgatacaa gtatttatta 19201 cttaatatat tcctaataaa tatttacaat aaaattaaca aaaagttgaa atataaaggt 19261 ttattatcta tcctgtagat aaaaaccgac aggctggttt taagcgagtc ataaagtcca 19321 aaaaaattct agttcacttg agcttgcagt ggtctagctt gctatgctca ccacttccga 19381 cctctaccat tactcctcag tggcgctgct aagttttgtt tcgacgaaaa ctaaataatc 19441 cttggaagaa cgaccgaatg acacgcagcc aaccaactaa cttttggaga aacgactgag 19501 agacatttgt ttggcccgaa gagtctatga ctggagttgg taagttttct gctacaagag 19561 tcgtttctgc tactatgtta gttaacgcct gcgtgttatc ggtttgactg gaagagtcta 19621 caattgactc tggtaaattt tctgctacag aagtcgtttc tgtgacttcg ttccttaaaa 19681 tagcagttgg gaaaacttct tctgctagat cactcatcag gagttgaaag gcaatctgct 19741 gcactttttc tacagcaagt ccaagttgat tggcaatttc cctcaaggaa atggtaccat 19801 tcgccaactg ccaaacttgc cattccagag agtctaactg gaactgaggc tgagtcatat 19861 tcttatttga caggattaaa cttaactcag gcagttgttc tgctagtaga ctccagtctc 19921 gcagcgataa gccggaggct tgacgcttcg cgtatcgcaa accgataagt gttacttcag 19981 ttgctggtag actcatgcct gtcatctctg ctgagggtaa atcagcttcg gcatcaaatt 20041 caaactgacc atccttgagt tcaaataaag cagatacttg atctacaacc tgggtacgaa 20101 acagtagttt cagttgttca gcttgcagca gtccttggaa cttgaggcat aaacctatcg 20161 gtgtgttaac ccgacagatc ttagccattc tcaaagtaac atcctggctg acccagcccc 20221 gttgggcgat cattgaaacc aagccttttt cgtctagacg atcagcagca gccacaatcc 20281 gaccttgata cagccagatg taataaactg gcataaaacc agtcacatca gcagtaatgg 20341 tacgaattgt gagtaatcct gtgttgtctc cttgctctag caattgaaag atttttggca 20401 aggaaaattc tgctaaatta ccagtaaata tcatattagt tttacttagt agtctgacag 20461 actagcaggt tgaaatgtaa attttatatc tagtacctct ctcaaaaatt ttgtcagggt 20521 atgaaataaa gctacgaagt taatagactg ataagctgtt tttaatgaga gttatgctga 20581 taaggcttaa taaataagct aggtcttgtt gataaagtca gacctagctt tttacccgtt 20641 gacaaatggg aatttcccag cggctttggt taaaatttta caggtcacaa agcacgtgtg 20701 accttcatag gcagctaggc gttatgatcg acttgattgt acatccaaat ttctgcaatt 20761 ttctgtaaat gtgcgctgaa gtgccacaat tttagttagc agtttctagc cgcttggtga 20821 actttaacag ttatcagtta tcagttatca gttatcagtt atcactgttc actgtttact 20881 gtttactgtt cactgttcac tgttcactga tttaaaggca gaattagtac atcatagcct 20941 ttctcaattc tgtaagaaca cgcttaattt ctagcaagag caagccctgc ttggcactat 21001 cacttgctaa aacgagtagt acagcatcct cgccgcatcc attgagaatt ccaaaaccct 21061 tgtcaccttc aacataaatc cgctctatat tgcctctggt taactcactc ccaatgcgtt 21121 caccaagaga tatcatagcc gcagccattg ctgatacccg ctcttcatcc atcccgcctg 21181 gtaagcttga ggataaagtt agaccatcgg gagaaacaag cactgcacct tgcacatcac 21241 tggttgcagt gacaaagttt tgcagaatgc tgctaagttt tgctgtgttg attgccatct 21301 tacactccta tgcaagtttt gattttgtaa cctgtagata atgtttggtt gccaattcag 21361 ttttctattt gactacctgc attacaggat ttttcaaatg attaaaccac aaatcgtaca 21421 gtgggcattg tccataaaag tcatttgatg gacatgtgaa atatagcgct tctggtgttg 21481 tgtgcaatat acggttgtaa tgaaagtgca tgcactttca ttacattatt cgtggtagtg 21541 tgtgtgattc aaatgagaac cgctatattc gttcgtggca atgctcacca aagtagaaaa 21601 cccctgtaat aagcttgcag gcaaattcta tctgtgtgaa cagagtttgg gtaagaggat 21661 gactgcattt gctgtattaa tgatcatcct ctaaatatat gcgatgaggc agtgggaggc 21721 gagagtttca caggcttaga cgcccgccag aagacttccc gcaatctgcc gcaactgccg 21781 taagcgcagc gtctggtgga ggagatacct gatagccctc cgggttcgcc agagcatagc 21841 cgcgttgtct tcaattggtt tggtgcagca ccgatagctt gcgtggcgga gccattccaa 21901 ttgcttctta cccttacacc ccttcttttt atttcaaagg ggtgtaaact cgaactccgt 21961 ttgaagattt atttgatcgg caacaggaac ttgtcgagct tggtggcgtc tttgcataac 22021 ctgatcgata aagccaccca gtatacctag ggtaaatggg cacaaacggc ttgaacctcc 22081 ggtctcacca gacatttcaa ctggctcggt cacctttacc ctaataaagt tttcttgttg 22141 agaaacttct tcgatcatac acagtcgtgt tccttccata cctaaacaca agtttaggcg 22201 ctttagtata gttggaagat cttgtgattc cattgatgcc ccaagtgctg tggcaatttt 22261 cttaccgtaa gtacgacccg cagcactaat tataactgct gcagctttgg aacctaggtt 22321 atcttctatg ccatccagca aagctttgta gtccattaca ttgacgaaat ctcctagctg 22381 tggtctgaga gtacaggaag ttgccatcat acggtgatca ctcatagaga aagaaatatt 22441 tgtccaaaac ttgtagatag atggtttggt tgatcgacgt gccataaaaa acctgtccct 22501 tgcgggcaat ttcttaaccc gaagggcgct gctctgggca acggactggc tcccctgccg 22561 cacttctgtt gtttcaccag tctctacgat gggaaacccg tctccatgtc tagcgctcaa 22621 caaacactcg ttcacgtcct gcttacgagt ttgtacttaa gtcgctgctt tctggaacgt 22681 cttgcacccg aattccattt agctctaaag gttttacggc tatctggata ggttaaggag 22741 ctactaatca ggtggttatt tgctgcctga tgcctttgtt taagataaca ttaaaaatga 22801 catattcagg ataattactg atttttttgg atgatgggct gaaaacacac cttaaggttg 22861 ataccataca atacaattcg cttagaaaaa gttatttgtt gaggcaagca tttgtaggca 22921 taaaagcgcc cagctatcag gtggtgtgct ggtagtagtt agcaaattct tcgacgagtg 22981 cattcaaagc ttgtttgacc gaagtcacct cggtagcatt gactgtaatt attggtggtc 23041 gagttttctc atcgtccagt cctaaggcga tcgccacatc ttccgcttcc catgcatctg 23101 ggcaatcagt atgagtcaat ccaatcagat acggaatttg cacccgttga ttcataaagt 23161 tgagttgttt acgaccatag cgaaaatggt gtggacggtg agctgctact agcaaaatgt 23221 aagcgtgggc ttttgcaatc aatatctccc acatatagtc aaatctactt tgtcctggtg 23281 tgccatacag gtgaagtgat tgatttggtc ctatagtcag ccgcccaaaa tccaaagtga 23341 ctgtggttgt tttcttcagt agtccaactt catctgtcgc tttcctgtca gtatccacaa 23401 cttctatttc actaattgtc cgaatcaagg ttgttttccc cgcacccata cctcctgtca 23461 ccactatacg taggatttcc atctatactc tgctactcca tacaacttat ataatgctga 23521 acattgatgt ttacttgtgc ccaaagtcgc tcaccgttgc cccgaatctc cgattcgcgt 23581 cagcctgttc agcaaacaac aggcgcttgc gtatacggac gatcgctctt gaggaaaaac 23641 taccatcagc gagcttccac acctgtcact cgaaaggctt caagtggatt tgtggttggt 23701 cttgcgtaag attttgtatt cctgaactag tatgcagcac tacatcaccc ggtactttat 23761 tttttatctt acttagcttt tactcaaaaa aagtttacaa aacgcaacta aaattcaaaa 23821 ctgattcaga agttctatta aactgacatc ttgcacagta ccgagaactc gatgacgccg 23881 agataagtta gtcatctgtt tgacagaaac ccaagcatag cttaagggtg gaagttccga 23941 aacctgatgt actcgcttgc cgtgatgggg agtataaaat ttgagtaata cgtcagattt 24001 gcctcttatg tctacaaaat taacagcaga tgagtgtaaa acttgagcaa ttgtaggttg 24061 ttggataaag gctttgacat cagggttgta gtcgccaaga aagccactac taccagcagc 24121 agccaaagat atccaaacag gaattaacca actcgctaac cagtaacgag aagttagaga 24181 ctttttgccc aggtggtaac gaccaatcca tactaaaggt aatattaacc aactggttcc 24241 tgataccaat gccacgatag cgtacttatc aacttgtgca tcaccccaaa ctaaagcaac 24301 cattcccacc cctaacagca aaacacctag tacgccaaag gcataactca gattgcgagg 24361 aaaacagatg ggaggaactg gggatgagga cgatgaagag gataagaaag tcatttctcg 24421 ccttgctccc cctgctcccg ttcgcgcagc gtgtccggag gactcagctt cccctgctcc 24481 ccttgctctc tgttgttttc tcatccctcc actcagccaa tttaacccta cagcagcaaa 24541 caaagcgatg aatggataca gcaaaagact gtagtgagat aaacgagtgg aaaaaaaact 24601 tagttcggca aataaaatga gcggatagcc aactaatagg agatggtagc gaggaatagg 24661 acgacgaatg agcaaaacta agcctaaaag actgaaaaaa aaccaaggaa atgcttttag 24721 gggtatatcc cagaaataaa accctaaccc attgtgagcg cgttcaccag aacctaattt 24781 gagaacaaag ttaattaatt gtccaaaact atcatcaccg tatcgtaacc agctaaacca 24841 taaccagata aaggtaggaa ttaaccctac cacgaacccg aaatacaaca ttgggtttgc 24901 aagatgacga tgacggcgat gttcccatat gagatagggt agcaaagcta tgattggcaa 24961 aaaaatcata aagcttctga ctaaaaagcc caagccaaaa ctcaaaccag caagaaagca 25021 ccaaacaaag ccatatttgg ggtgtaattc tgctttcagt aaagaccaaa tcgcgaaaag 25081 aatcaggaaa ataacaggca catctggtgt gcctaaacgg cagtattgta gccagagaaa 25141 ttctacactc aaaattgccg cagcaagcca agcaagtttt ttcccaagaa gaattttgcc 25201 tatttcatag acaagttgta tgctcagaac accagcaatc atgcttggta agcgcgcact 25261 cgcttcactg atgccaaata atctgtaaca actggctatt aaccaataag gaccaggagt 25321 tttatgatgg ggagttgtcc aaggatgtat ccaatcacca gtgtctatca tcaggcgcga 25381 gcgccaagcg taaagccctt catcatgcgc catcaggcta ctctccccag aactgaacag 25441 caataagggt agtgcccaaa ttaacagcaa aaccagggga aaagcactta aaaagccgat 25501 ggtaagcttg gtttgatgaa acaagtgtaa tctatgtaac ataaaatcat gggcagccaa 25561 attcttcttt tggctcatga atatttttgt gtctattttt aaaatacgta tggtgatata 25621 tcaattgttt ccaccaaacg gaatcaaaat tgacaaaact ttggaaaaat tggaaaaact 25681 tgttcttaaa agatgtttat gcccctgaaa ccagacgaac gcactaagct agacagcaca 25741 gacgacaagc tattttatga atatccccgc tttgtcactc atgtggacga aggttttatt 25801 caacagctga cggatttata tcgcgagcgc ctcaaaccca atacccgtat cttggacatg 25861 atgagcagtt gggtgtcaca tctaccacaa gagatacctt ttgcccatgt tgagggacac 25921 ggactcaatg cagaggaact agcacggaat cctcaattca atcattactt tgtccaaaat 25981 cttaacgaaa atcctcaact acccctcaaa gaccaagatt ttgatgccgt tcttaactgc 26041 gtttcagtac agtatttgca atatccagaa gcggtctttt cagaaattca ccgcatcctc 26101 aaacctggtg gtgtggcaat tttcagcttt tctaaccgca tgttttttca aaaagcgatt 26161 caagcatggc gagagggtac agaagccagt agagtggaat tggtaaaaag ctatttctct 26221 gcagttccag gacttacacc tccagaggta attgctcgtc agtcaagtct tcctaatttc 26281 ttacagtgga tgggtgtcgc aggaggcgat ccgttttatg ctgttattgc ttaccgtagt 26341 ccttagtcat cggtaattgc gaaaaacagt catcaaaaag taaattttcc aggtattagc 26401 caaaacaaac gtcgataatt ttcacagaaa gccaagcaaa aagaggagtc caatatgttt 26461 ttcccccgtg ctcggagagt tttagcagct ttgttgttat gtttgttact gtttacaaca 26521 gcctgcgcgc cgaagactcc tggacgtttt gaccaggcac agcaggaaag cagccgacaa 26581 agaagcggtc aggcggttgc gaaagattca acccaaggta gcgaatttaa caaatttttc 26641 ccaagtgctg acgctggcta ccaacgcgtc tatacccaag agaaaaaagg ctttgctgaa 26701 gcgaaattga aaaaagacgg caaagaccta gctgtacttt ccattaatga tacacaagca 26761 gtcaaaggtg ccgcaaaccc agcggcaaaa ttcgtgaaca gcccaaagac aatagccgga 26821 tatccagctg ttagtcaggg gagtactggt actgctattt tagttgctaa tcgctatcaa 26881 gtgaaagtgc aatctcgcga cgcttcattt acagaggctg aacgcgaagc ttggatacag 26941 aaatttaact taggtggtct ggcgcgactc gctaaagccc aatagctaca aagacaagct 27001 gttaggagtt tgagaactga aatcctgagt caccagaaaa taattaagta ggagtctatt 27061 tgtgagtaaa tcaatttttg agttggttga tgaactacca accagcaact tgacggtttc 27121 ggcattgcga tcgctcgatt ttgttgctcc tggtgagtgg caaaatgtag ttggctttgt 27181 caacacgatc aaaactgtca ctggtgaaga cgacgaagac ctcattcaac aaattggcga 27241 acgagcggtt tatctctaca atgatcgctc tcaaggatac caaagagcaa tgtggcttta 27301 tcaaaccgtt gatagcacag ataaagcact tggtgcagcc gctttagcaa acaaagtggg 27361 tgagaaaatt ccccttttgg gttttttaaa ccgagtcact cccaaagcgg agaaagcaca 27421 aactatagac ttgtgcttga aattagttgc tgagttagtc gccttctgcc aaattaacgg 27481 tattcctgga gacagtattg gggattttgt cgcgtctttg ggagaatata gcggtgaatc 27541 gttcatccgc atggctgcat tagtttgctt tgatggtttg atacccctag gtccagactt 27601 tatcagtagc gcactatcga gaattaatca gacaagtcct caagagttag atcaaaactc 27661 gacctttgcg aatattagag aggcaattcc aggtaatgat tctagcagca aactgaactt 27721 tatcggtgaa agcttccatt cagtcagtgg ttggatgagt ggtctagttg cttcaaataa 27781 cttaacccca caaaaggtag ctaacaactt gcaaaatttt gtcgattttg ctgatgataa 27841 gctagactac ctcgctgcat tcctggatgt atcgacaaat tactacgaac atacaggtac 27901 acaaacttta gcacgtcgat tgattgagcg agcttcggcg gaaatttaac tagtttggcg 27961 taagtccccc gccataatgt actcttttgg cggcgggata taagccaatg ttggacgata 28021 aaactctagt cctaggtgct tctaggtggc gggtcccaat cccccactga ttgattctgt 28081 agcgctgaat tctgacttct gattcaaata tgcacgtatc gtataatttg tcaacgttaa 28141 tggtgcgatc tacgttcttc ttaagtttcc aaaggttcac attgaaatca acctcagggc 28201 tggttttatc agccagataa gttgcgtata gtaaatcagg tgatgtgctt gttctttttc 28261 cgtagaagca cggcaatgcc gtgcttttct tgctgcgcca aattaacttt ctgatgtcgc 28321 tacttcttcc aagcaaatat aacgcttaat cctgctccat aaaacttata caaatcgctc 28381 aaaactgagc aaatcgatgt ttacttcccg cttcaatcaa ggcagtcggc tgctacttgt 28441 ccacgggcgt atcttccggt tgagccagcc gtacatactc attcgaggca ttcaacaaat 28501 tcaaagccgc gttcaaatct ctatctaacg tgattgtgtg agctacgcaa ctagagttaa 28561 tgcatttgta aactctatca gatagtttca aatcatcgcg ctttgcaccg cacatgcaac 28621 acgtttttga tgatggatac catctacagc cgattgcagg taaatgaggt acagtcatat 28681 aagtgcatca actcttaaaa tgaaagctta ctcaattgac ttgcgagaaa aaataataaa 28741 ggcttacgaa cagacagaca cctcaattag gaaggtggct gatagatttg gtgtcgctaa 28801 aagctttgta caaaaacttc tctccatgaa gaaaattcaa ggtcacgtag aacccaaaca 28861 acagggcgga gcaatgaagg gagagttgga tggatctgaa gctcaattag ctgcaatggt 28921 tgaacaatat ccagacgcga cattactaga atactgtgaa tattggggta caacttataa 28981 tcattggatc agcacgagta cgatgtgccg tacattacaa aaacaaaaac taacattaaa 29041 aaaaagacgc tacgcagcag ccaagggaaa acagaaagag tccaaaagtt gagaagtgag 29101 tactggcagc aagttaagca agtagatcca gaaaacttag tttttatcga tgaaatgggt 29161 gttttattgg gtttgacacg aactcatgct agaagccctc atggaagtag ggtgtatgat 29221 tttaaaccat tttatcgagg agccaaggtg acagtaattg gggcaattag cttaaaacaa 29281 gttttggctg ttatgactct gaatggttca atggatggaa acgcatttaa agtctttgtc 29341 gagaagtgtc tgcttcctca attatggaaa ggtgctgtag ttgttatgga taaccttcca 29401 gcacataaag tccaagaaat cgagccttta attgaatctg ttggtgccag tgttatctac 29461 cagtctccct attctcctga ctttaatcca atcgaacatt ggtggtcgca attaaaagct 29521 tttttacgac aattttctcc aactaatgca tttagggttg atgttctaat tagaactgcg 29581 cttgatttag tcaattcccg acatctaaaa aactggttta caaactgctg ttactgtacc 29641 tcataaacct gcaatacgct gtatcaacca actcaacttt agtcccgtaa acttgggatt 29701 tataggttag ttgacggcga aactcataaa atcctaaatc agagattgct gctgatagtt 29761 ttctattagc aaacatccca cttacgttca gatcttctat gcggattcga tagtatttgc 29821 ggcttatatc tgttgttaac ttgtgcaaga agtctcgacg ctgactagct atcgcgtaat 29881 gttgcctagc tagagagtca aaatacttct tagctttatt tgattgcttg acacccaaac 29941 gacggttacc caattgcttt ttgcggttac gccattgtaa tttagccagc ttgattttcg 30001 cttttttcat tggctttggc gattccacgc tagtcccgtc acttaaagtg gcaaaacact 30061 ttacccctaa atcaattcca actttctcct ctggatggta agtcggtggg attttctctg 30121 catctacagc aaaacttaca aaccatttac ccgcacaata gctaatcgta aaagtctgcg 30181 tgcaatatga acaaggcaaa gcttctttta gccgaaatat ccccagtgta ggaattttga 30241 tttgctttcc tgatcgaagc aataccttac cattgccgtc atagacagta aaagattctc 30301 catctttgcg acgcttaaat cttggtctgt cagatatacc ttgcccccat ctgttgtaag 30361 cactctccaa ggcttgtaat gagctttggt aaaccttgga agagagcttg gacatccatc 30421 gaaattcttg ctgctttttg gtgtagttcg tcaaaacctt tttgatagct ttgattttct 30481 taccgtaccc acctgctatc ccatcccttt ctagcccctg catgagtgct agcccgtagt 30541 tgtaaacaaa acgagcaaag ccagcatgtt tacgcaataa agtttgctct ttgttattca 30601 cttttagctc aactttgaca gcgtacattg cttgaatcat aaagggttat actgtctatt 30661 gtacacacac cttacggata tagcaagagt gaaacaggat aagagaatag acttacgagt 30721 aacacaaaca gagttggaat tgctagatga gtattgccag ttgaccggaa agaaccggac 30781 tgatgtgcta cgagaattca tccgtagctt gaaaaagaaa atgcggaatg ctgaaaaata 30841 ttcgatttag ctagattttt gactatctag tatcacctcc tttcaacgga taccatctga 30901 agatagaggt atttaccgac tgaattttat tatcttcagg aatatatcaa gaaatattat 30961 gtaaaggagg ctactattac cgctcaacta tccaaggatg ttgtcatcca ggaacgccac 31021 agctcaagta ttgctgaagc gttgacacat gctgtagaaa ttgctgatta ttgtgcagct 31081 aacgccaatg ccatagataa caatggcgct tttcccgaga gcgagtttaa gcgaattgca 31141 aaagcaggtt tactggctgc acctttgcag cgagagttag gcgggtgggg tgcaggtatt 31201 gatgccaacg ttacctacga atcactaatg ctattaaagc agatggggcg tgggaatttg 31261 gcagtgggtc gagtttatga ggggcatgtg aatgcactgc aactgattca gagttttgga 31321 actagagaac agattgcagc ttatgcttgt gatgcccgcg ctcgccacaa aatctttggc 31381 gtttggaatg cagaagcctc tgatggtgtc aagattatcc ctcttgacaa tggtaagtat 31441 cgtctggaag gttctaaaac cttttgttcg ggatctggtt atgttgagcg tccctttgtg 31501 aatggagcct tacccgatgg tagttggcaa atgtgcattg tgccaatgga tgaagtgact 31561 acagtcagtg atcctaattg gtggcaacct tctggaatgc gagcaactgc tagctacaaa 31621 gtagatttta gcggcgtgga gttggcagaa agttcattaa ttgctaagcc aggggactac 31681 cttcgccagc cttggttgtc tgcgggagtg attcgctttg ctgcggtgca attgggtggg 31741 gcggaagcac tgtttgattt gactcgccag tatctccaga acatggaata tacaaacgat 31801 ccataccaaa aagaacgctt gggcaggatg gcgatcgcca ttgaaagcgg taatctctgg 31861 ctgcggggtg ctgcggatat ggtagcagct tatgcgcctg tgtttggagg ctatcccact 31921 gttgacaacc ctcaagcaga gcaacttgtc gcctacgcaa atatggtgcg gacgacaatt 31981 gaacaaattt gcattgacac gatgcaaatt tgtgagcgct gtattggcac tcgcggttta 32041 ttaccaccaa atccgatgga acgcatcatc cgggatttaa ctttgtacct acgtcaacct 32101 gcctttgatg cagcccttgc caatgttgga cagtatgtcc tagctgaaac tcatcctgct 32161 cgctcgcttt ggaataatga ataaccaggt tactgttgca tcaccactga ctcattctaa 32221 tagtttgcct tggcgttcac ttaaggatat tgcttgtggt tcggcgttag tcgttgcccc 32281 tcatcccgat gatgagacac taggttgtgg tggtgcgatc gccctactgc gttctctcaa 32341 tttgatggta cgtgttttgg tcatcagtaa cggtactctt tcacatccta attctcagaa 32401 atatcccgca cctgcgctgc aggcattgcg cgaaagtgaa acactttcag cgctttctgt 32461 attaggagtg gaagcgaatg ctgtcacttt tttacgactg caagatggct cagttccagc 32521 acagtataaa ggtgcagtga cgacttgcgt tgcttatcta acagaaattg cgccgcgaat 32581 gatcttttta ccgtggcgct acgatcctca cccagatcat caagccagtt ggaagttaat 32641 tcacactgcg ctgtgtgact cacacatatc accgcaatta atcgagtatc ctatttggga 32701 ctgggatcca gatcaacgtg gaaccctacc agaatctctt gaagtcacaa gttggcggtt 32761 ggatattagc gcggtagtgg agttgaaaca gcaagcgatc gccgcctatc gttcccaaac 32821 cacagattta attgatgatg atccagaagg ctttcgcctg actccggaaa tgcttttgaa 32881 ctttacccgt tcttgggaag tttatttaga agccaaaatt taagaactat aggatttcta 32941 tttgattttt aaacagaact ccgtacagtt tatgttaagc attcgggtgt taagcgttct 33001 ctgttgccta ttccctgttc cctgttccct gttccctgtt ccctctagta acttcataaa 33061 ccaaacccga ttcctatatg aaactggcaa tttttgatat tgatggaacc ctcactcaaa 33121 caaatgatgt agataaccaa tgcttcgtgc aagcatttgc caaggaattt caaatcaaag 33181 aaattaacac taattgggct acctatggac ataccactga ctctgggata gctttgcaaa 33241 tttttcagca gaactgggga cgtgttcccg aaaccagtga gttatgccaa ttgcagcaat 33301 gctttgtaga gttgttgcat ggtcattata cagaaactcc tggatcattt gtcgaaatac 33361 ctggagcttg tgtgatgtta cagcgcctag cccaaacaaa agattgggcg agcgcaattg 33421 ctacaggcgg atggcgtgct tcagccgaga tgaagctgca agcagcagga ttagatatca 33481 gggaactacc agcagcgttt gctgacgata gcatttcccg agaagatatt gtgaaaacgg 33541 ctgtgtcgag agctaaagag ttctaccatc agcctgattt tgaaagaatt gtttgcatcg 33601 gcgatggtat ttgggatgtt ttgactgcta tccagctgca actacctttt gttggtgtcg 33661 ccagcgatac acaaaagccg ctcttagaaa atgctggtgt agagtgtatt ataccagact 33721 ttgtagactt tgattctttt ttaaaagccc tagatactgc cagcatcccc aaccagcaca 33781 aaccgttgca ggtacaaaac aattccctgc caccgagtta ttttgaaacg ctctatggta 33841 gcaaccccga cccgtggaag tttgaaacga gcgaatacga aaatcaaaaa tatactgcta 33901 caatcgcagc tttacccaaa cagcgctatc attctggttt tgaaattggt ggttctattg 33961 gtgttttgac agagaagtta gctcaacgtt gcgactcgct actttcggtt gatgtatcaa 34021 aaatcgctca aaaaagagca attcaacgct gtcaacactt accgcaggtg cgctttgaaa 34081 ttatgtgttt gccacaggag tatcctgagg aaatgtttga tttgactgtg gtttctgaag 34141 ttggttacta ctggtgctgg gaagatctga aaaaagccca acagtgcatt ctcaaacacc 34201 ttgaaccagg aggacatctg cttttagttc actggacaca atatgctcct gattatcccc 34261 tcaatggcga tcaggttcac gactcatttt ttgatttaac gcctactcac ttgcggcatt 34321 taaaaggtaa acgggaaaaa gagtatcggc ttgacgtgtt tgaacgggtg ttttgatcaa 34381 cagagaacgc accaacacat aggtcagaac tcacaagtca ggagaagaaa caaacagttc 34441 cttgtgaata ctgaatactg aatactgata ctaagttgcg ttcagaggta gtatccacca 34501 gatgagggaa aaaagttaaa ga // LOCUS NODE_794_length_33752_cov_5.07858333752 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 33752) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 33752) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..33752 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(179..733) /locus_tag="DP116_06580" CDS complement(179..733) /locus_tag="DP116_06580" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06580" /translation="MLFLVEKEMYQIENQEHTVMNLENMHETLNELEAQKSQIERSIA SIRADISERKEKLRKVEFDIFGDIDERARKCLELEKSISLLNFNLSKYQAMLVDVEIK IPPLQKRIQDLNQHLQLLQNTNEIRQALEPFIELEQQYLAKRDEIDQMLRSKSHYLGL KALPATQKLIRKGTEYELRGTLHG" gene 1771..5532 /locus_tag="DP116_06585" CDS 1771..5532 /locus_tag="DP116_06585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995940.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06585" /translation="MFHYRQKLKTINKTFVPIIKKKIIRLLRALLVNQKKRRTSVNAG FVLPTVAMVALVVVLMTTAILFRSFERAKNASNVRVNEAVLNAANPAIERAKAKIEQL FGDSRLESSIPSDFSLEEIINKNLNQFTFGDETQLKLVKDKEIQTVWKYPVDTDNNGK FDSYTLYGIYFRTPTSNKATSVLQARTQPMDTGTVGTECQSLATTSANLVSTYGWYKV GEKLKKSIFVYTTTVPITDLTRLNTNKYETFKGNKGFVALEYQQDRERIPLSNNAVVY EDDLEIAPEEGISLNGRILTNGNLLTTRLDQPIKFYLVSSPKSCYFKEGNSKIIVAGN VIDSRGTDNSSHDDVQVDLFEQQDAPNNTMKSDVINNTNKTVPTSVYGNTAAYNDEAY AKRIERLVQATNSAYPNNAELPDEVKQKIERYLAEDFTLDPVKVREKKLKTYFKKRTR RVPYAEVAQGSNPLKYGSHDYETNSPLQGNGNSLRPVDEWVFPFNPADGKTATNYAEI GIKDNGSKLYLPATEPVEQAKADKEQKIGDRLLVGNNLPQLWYDTTKSKFLSSPDEGQ TITGKEWDVDRKGNNSTVTRKRFSQAYHFEDLGATDRDEFWEKSAAQKPQSSLDVVGG LRVVTGAGIYVDGPSSDSTASYPWDADVDPMTPDIQPRSFLSASRPNWDTNFVDPSNA DPNRIKVSSLQFKGKDPIIVWSDSMPMTDSNLVANPKVKGDLLMRTTAVYFYKDPSDK DNSGKDQMPIACVSSFYDPTNAITAQNQNTLLSTAKNPDADLTTISGKSNNGVVYPPY SGNRASAISTYNSQLEQQARLVFPNGRFVNEPLRNALEHYSAGKPLSMADNSAIDTAV CAIQILNDTLRPNSSPPVPHRAIYETTFLDARQIKAIENTANVSTLTLQEYTERQPLE VRVTVLDLDKLRTTKIGTSTPQEYLIPNSGIIYATRDDALEDKSNPGSRDVSATDFKL DPTRRPNAIMLINGSNLSRHTTYNPKKPEENEKGLILVSNLPVYIKGNFNVHTQEEFL DNSLKGEKDWSNKFYARQSPNPNFGCRPSQFPDCNIGETWRPAVVIADAITVLSDNFR FGFREENDYDSMQTATNTETNLIFAQGNTPERPTESNGGLENFVRYLERWEGVSHTVA GSFIQFKHSNYATAPWQTVINGSSGSNQNQNRTPFFTPPNRFWSYDTALLSQSPDLFS LHFTTPATNKPNEFYREVGRDDAWVKTLLCAQEINGSYAISQDQRGTCQ" gene 5641..6348 /locus_tag="DP116_06590" CDS 5641..6348 /locus_tag="DP116_06590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870966.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II secretion system protein" /protein_id="PRJNA477356:DP116_06590" /translation="MIHQKQQQESLSSQSGFTIIESLVAVVVVGILLAAIAPVIILSV ATRLQAKRIELATNAAQTYIDGIKSSTIPSPSIITKSTEDSTDPPPPDAPSGTLICPT TGTGLCEISPTTASSQLYCVDGDTGGCTSDNFKDMIVQGFGYNPTSDKPEDGYRLGLR VYRADAFNKPSITLKALKDPGIKQAETFTSGTALIAIQAPLVEMTTEISSKVTTFSDF CRRLKPPASDSNPQSNC" gene 6360..7322 /locus_tag="DP116_06595" CDS 6360..7322 /locus_tag="DP116_06595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459789.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prepilin-type cleavage/methylation domain-containing protein" /protein_id="PRJNA477356:DP116_06595" /translation="MNPLKLLFIKQITLFRLKQKCDGFTLVELLVGIVIATLVITPLL GFMINVMTTERQEQAKANTEQEIKAALDYIARDLQQSVYIYDADGINKIRQQLRRYGD KNTFFPVLVFWKRTFLSKESSTILKDNTFFYSLVAYYLITEDNATWSKAARIGRFQIR DGYDSSTETDNKDNPRDKAKPDEGFQMFNLQGLGNLKSKMNQWTKKSEDYTQKIVPLV DYIDQTIINNTTNPAPPTCTIGQQVPKFDEKHDNDDAVATGNVRTRGFYVCVDSENTV AEVYLRGNALARLQNNNIDFNESRQTYFPQASIRVQGNGFLSAK" gene 7300..7914 /locus_tag="DP116_06600" CDS 7300..7914 /locus_tag="DP116_06600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858868.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prepilin-type cleavage/methylation domain-containing protein" /protein_id="PRJNA477356:DP116_06600" /translation="MDSYLLNKFNKYSNSGFTLLELLVSLLIIGILAAISIPSWLAFV DTQRLNTAQNEVYLAIRQAQSQAIKNKLTWQVSFREQNNIVQWTVHQAEVGVFIPNAI SNNNTLWHNLDQNIHLDKNKYETTLPKQTTKQEWRIMFNSQGCPVYQVADECTQTSFQ TLGQITLFSINNSKAKRCVYISTILGAMRTGKEHDEPDGNKYCY" gene 7947..8588 /locus_tag="DP116_06605" CDS 7947..8588 /locus_tag="DP116_06605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319132.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06605" /translation="MLNLPETAKQHLIIFTRYPEPGKTKTRLIPALGTEGAANLQRQM TEHTLSQVKQLQKTSVISFEVRFAGGNLQLMQEWLGYDLVYQPQGEGDLGLRMTQSFL NAFQSGAEKVLTIGTDCPGVNNQILAKAFAQLQQSEVVLGPAVDGGYYLIGLQRPMPE LFINIDWGTSQVLHQTITIAQMLNLSVTNLPHLADIDRPEDLPIWEEILRLTH" gene 8922..9758 /locus_tag="DP116_06610" CDS 8922..9758 /locus_tag="DP116_06610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869562.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06610" /translation="MNNRFAAAIAARTLKVIGIILILSFLIDFVIISLDFSPTEKLLQ LRWAASLVDRGVVPLVGLGMLFTGYWIDSFDDGTQPQPIDLKMPALIISSILGLLFLI IAPVHAMNIIHQRTQAVDQITKNAQLAENQLNTQLNQVQAQLGNDQVKAAVEKQKAQV KAQYTELVQDEQRYKQALNNPNIPPATKDLLQKFKANPQELDKFIAQQSDPKQLASQR LGQIRAQRDELIKQAEDNWKPGLRIVIGSLLLSIAYIIIGWSGLKGMNALQGGKRKIP AR" gene 9918..10799 /locus_tag="DP116_06615" CDS 9918..10799 /locus_tag="DP116_06615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141332.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_06615" /translation="MFSIYILTYNEEIDIAACIESAMLSDDIIVVDSCSSDRTVEIAS RYPVRVVQHAFESHGRQRTWMLENISPKYEWVYILEADERMTPELFAECQKASRNPDY IGYYAAERVMFMNSWIRRSTQYPRYQLRLFRHGKVWFTDYGHTEREVCDGSTSFLKET YPHYTSGKGFSRWIDKHNRYSTDEARETLHQLQNGTVSWKDLFFGKSEVERRRALKNL SLRLPARPLIRFLYMYFILGGCLDGRAGFTWCTLQAFYEYLILLKVWEMKHIPTPKLD AEVSENQATQLVSTSAD" gene complement(10999..12798) /locus_tag="DP116_06620" CDS complement(10999..12798) /locus_tag="DP116_06620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872680.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3685 domain-containing protein" /protein_id="PRJNA477356:DP116_06620" /translation="MSDRPLKLLLIDQDPIFRLGLKVALEEFSNLQVVSEAETDTAGL QILAKLAQEDPNQVNLVVLELGNSRSRSRQQLGLQLCRHLKTQYPNLPLLLLSSVQEQ GLLLAAKAAGVDGYCPKGTPVSELVTIMHDVVAGGSSWDTEMGKENMTIYRREDVGYA LENPTSSELPFARLRKHLRLSGIEYINVTLSEVTAQLQVPGLPLLDRAVLAGQRRELL AARWLLNRLLASPLERRQEVESHSRNVSTRNHFASAQLQSPTNSLPSTSVSKSDSAQK LLSPRSLQAALFASCITKLQLPLQNVTATPLEIDILREDKKRDLLYLIIQKVADALDE MRSHVVEISQLYELKNTVLVDVWQSALTNFFGKFYRVRVDNHNLEIVAFILQDAAVVR SEILNKIPLVEELFSYLLFQRNLQIDNTSYPAGSSEAKDYAEMILENLLIQVANGVIQ PLLNSLADIEEIKQSFYDRQLISTREIERFRNSLSWRYRLRNYVNEPKAIFESRYELF VFAPRGIATTSIYAPRGQELTQLSGIGLVVTLAIEFRDAIAPRIQSLFSLFGSGVVFV LTQVIGRGIGLIGRGILQGIGSVSLTERKNKKL" gene complement(13019..13468) /locus_tag="DP116_06625" CDS complement(13019..13468) /locus_tag="DP116_06625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314448.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional repressor" /protein_id="PRJNA477356:DP116_06625" /translation="MQKQTISTKPIRSLEDALEQCQVLGMRVSRQRRFILELLWEARE HLSAREIYDRLNHEGKEIGHTSVYQNLEALSSQGIIECIERCDGRLYGNISDSHSHVN CLDTNQILDVHVELPQEFIRQIEEQTGVRITEYSINFYGYRNSQESK" gene 14438..14941 /locus_tag="DP116_06630" CDS 14438..14941 /locus_tag="DP116_06630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872678.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1817 domain-containing protein" /protein_id="PRJNA477356:DP116_06630" /translation="MTITIPLNTDCINSLDLSPVVTEIEKLLQEGAMLQQGAAQSAIA SYEQQLHFDIDYALEPGDPRELSEIPEVRLWFIRLDTRYPWLPFLLDWKTGEFARYAA MLVPHQFSTKEGIQYNPEALEIFLMHKLFALNDWLQQQGIPSKSRLQSMAQLLGYELD DALFEMF" gene complement(15620..16387) /locus_tag="DP116_06635" CDS complement(15620..16387) /locus_tag="DP116_06635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872677.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06635" /translation="MAIRLHGFMSSGKRYIQVENQPHHITGIFRTLTHFSKNMHPCTL KDAENAYFRYEEDGTITFYEAENSEVCDSVGIWTYLVYECPEGEEKVFLDPSIDTNVN SLKQLFAGYKIVQVTVDIRDYLKYQYIQDEYLDVQLPCDWNTSVGRKIANLLLEEFKA FKSSTIFTERAGQEYRKTVLDGFIKAAQEVLENGGTVRDFESAQYDVLRKIRIDDMAN LILEYNDYRIWQAALPSKSKAVEYAFSTALRLICRIK" gene complement(17030..17611) /locus_tag="DP116_06640" CDS complement(17030..17611) /locus_tag="DP116_06640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314470.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D,D-heptose 1,7-bisphosphate phosphatase" /protein_id="PRJNA477356:DP116_06640" /translation="MGKPAVFLDRDGVLNVEAGYIHSVEDLHLIPGVAKSLRQLNDRG IFCCLVSNQSGPARGYYPDSHVQALHQRLCRLLAQEAGAKLDALYYCPYLSPPEGGLD PAYTRWSTWRKPNTGMLVAAAWEHDLDLKHSFMVGDKATDVDMAHNAGCVGILVETGF GDRVLAGDYQHHTKPDYIAKNLAVAVEWILQQL" gene complement(17604..18563) /locus_tag="DP116_06645" CDS complement(17604..18563) /locus_tag="DP116_06645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipopolysaccharide heptosyltransferase family protein" /protein_id="PRJNA477356:DP116_06645" /translation="MRIVALVPGGIGDQILFFPTLDDLKHYYPNAQVDVVVEPRSKTA YRVSKSVHDVLAFDYKDRNSMADWGNLVGTIRDREYDVAIALGQSALVGVFLWLTGIP TRIGYKSKGSVFLTNSVPLKTEQYAACMYHDLLQGLGINSPCPELAVNVPVADIDWAN KEQQRLGIKETGYILIHGGSSALAKTKGLDKVYPVANWQQIIQDFQQKQPEMPVVVIQ GPEDEEFVRSLKQSVPNIKISSPDDIGKLTAMIAGANLMLCTDSAPMHLSVAVQTYTI ALFGPTDPAKLLPNSDKFLAIKSSTGKMADISPKTVLEKIWGG" gene 18726..19415 /locus_tag="DP116_06650" CDS 18726..19415 /locus_tag="DP116_06650" /EC_number="2.7.7.60" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872675.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase" /protein_id="PRJNA477356:DP116_06650" /translation="MHLLIPAAGSGRRMGSDRNKLLLVVRSKPIIAWTLLAAEAASQI SWIGIISQPLDWQDFQTILTQLKQSSPVELIPGGSTRQESVYNGLQALPDSAKEVLIH DGARCLATPDLFNSCAQAIQHCPGLIAAVPVKDTIKVVDENGIIQSTPDRRQLWAAQT PQGFDVKLLKQCHTEGVRQGWEVTDDAALFEKCGFPVRVVEGEETNLKLTTPQDLAIA EFILKTRLGEQ" gene 19669..20223 /gene="scpB" /locus_tag="DP116_06655" CDS 19669..20223 /gene="scpB" /locus_tag="DP116_06655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859812.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SMC-Scp complex subunit ScpB" /protein_id="PRJNA477356:DP116_06655" /translation="MNAATKIEAILYLKGKPLSISEITEYAACDRATAQEGIIELIDE YARRDSALEVVETPNGYSLQLRSDFHDLVQTLIPVELGVGALRTLAAIALNSPILQSD LINVRGSGAYQHVQELVELGFVRKRRDSESRSYSLQTTSKFHQYFQIDQLPASFSNGQ EQKQLELELTTAESNSSSGTVEQS" gene 20307..20654 /locus_tag="DP116_06660" CDS 20307..20654 /locus_tag="DP116_06660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867028.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06660" /translation="MVFDPNFLNDYPEEHPNQLISDSFEEHPNHLLKYLQHQSPEVLA RVAQSVSPEIKQIISQNVQGLVGMLPAENFNVQITTDRDNLAGLLASAMMTGYFLRQM EQRMQLDHLSNNH" gene complement(20706..21923) /locus_tag="DP116_06665" CDS complement(20706..21923) /locus_tag="DP116_06665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314465.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_06665" /translation="MIMKLSLSLKQLVIYLSLLTIGGGAGLFGSRYFVPQNSWFRELR NVTASSSSENPVPSPVGGQTGTTIGDNMNFIATAVQRTGPAVVRINATRKVANPISDA LKNPLLRRFFGEEEQPFPRERIERGTGSGFILSENGRILTNAHVVADTDTVLVTLKDG RTFDGTVVGVDSVTDVAVVKISASDLPTVKIGNSQNLIPGQWAIAIGNPLGLDNTVTI GIISATDRTSAQVGVPDKRVGFIQTDAAINPGNSGGPLLNAQGDVIGVNTAIRADAQG LGFAIPIETAARIANELFTKGRVQHPFLGIEMSDLSPAKKQQINQDKNLNIKQNVGVA ITGVLEKSPAQRAGLLPEDMIQKVNGKPVKTSAQVQKLVESSTVGKILEIEVNRNGEF QTFQVQLGTYPQK" gene 22276..23160 /locus_tag="DP116_06670" CDS 22276..23160 /locus_tag="DP116_06670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195365.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="PRJNA477356:DP116_06670" /translation="MTFCYRLLFLCISIVTSTLGLMHSSPYSVLAQTSVTGCQSSALE RFGRHKIAPGETVESIAQRYDLTPATIIAMNPTLRNNKVTIGREIQIPPYNGIVVEVP PGQNWRQIAAKYKIRPDVLFEVNGCQKNSRFVFVPEVKRSPNRPITESAASNSTPTKL AGNPLAEVATVALPYGWQTNPTDGKVFFHSGVDLLAAKGTSVQAIGDGTVAFASEQGT YGNLVIINHSGGLQSRYAHLENIKVSVGQQVNKGDIVGTVGTTGTPTTNQPHLHFEVR SSSSLGWAAQDPRGYLQQ" gene complement(23279..24409) /locus_tag="DP116_06675" CDS complement(23279..24409) /locus_tag="DP116_06675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867031.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="isochorismatase" /protein_id="PRJNA477356:DP116_06675" /translation="MNTPTKTQLPIPPHFNPEEVGYVWRVPYQQRAKEAREWAKKYNI KPSSEDKTRICLLLIDVQNTFCIPEFELFVGGKSGTGAVDDNIRLCEFIYGNLGVITK MIPTMDTHTAMQIFHPIFWINTAGEHPTPSATSITPADIEKGVWKVNPRVARQVLQRG EPQRQVPLSGNPPAGLAPQRTGSPSLGYNYEFLEKHAYHYVKQLTQDGKYPLTVWPYH SMLGGIGHALVSAVEEAVFFHCIARQSQTQFEIKGNHPLTENYSVLRPEVLESFDQRQ IVQKNTRLIQELLEFDAVIIAGQAKSHCVAWTVDDLLTEIQQIDSRLAKKVYLLEDCT SPVVVPGVVDYAEAADAAFERFAAAGMHLVQSTESILSWFKE" gene complement(24524..25966) /locus_tag="DP116_06680" CDS complement(24524..25966) /locus_tag="DP116_06680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112067.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06680" /translation="MKTKETIFVIPTYRLRDVAETIEKYDDNFWVNGHAPKIIIFDDS SVANYEKYYQLLEQTKTVNDVFYVGPREKEQFINFLNQRLRDKKLESLVRNLFRPSYG GNRNFTLMYTLGHLMVSSDDDMRPDALIENSPESLLADEICRGKLFKSKQDGFVHRSY DLLTCFEDVLGQKVNMIPENFEKGELVVDTAMELETNTTKGFFKENSLFLQRSKVSNS AVVKIAQTFRTGTHDIDTLDFIHMYLNDENQISLDELNDIYVLVNFRPVVTNKNWRMD CGVAGYDNQFGLPPFFPTRLRFEDYIYRLWIQQEGIVAAHVDAAQNHIRNNYMRNPIA SEIFNEEICNLLKKKIKNGIYELEDLTIKFDYSGEVTSQDSEEILERVGDIYNQVVKA SISTQNEERRQSLQFFADNLSRVFYGFEPDFFQQNVSRIVDDVISQFQASLEIWPTLV EICYFQKDKKDLPQTRVRNKKLKSSNGNRF" gene 27204..28430 /locus_tag="DP116_06685" CDS 27204..28430 /locus_tag="DP116_06685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214161.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase" /protein_id="PRJNA477356:DP116_06685" /translation="MQTLPTLTTSNTVNLQPTFDTTIKRRKTRPVKVGDVTIGGGYPV VVQSMINEDTLDINGSVAAIRRLHEIGCEIVRVTVPSMAHAKALAEIKQKLIKTYQDV PIVADVHHNGMKIALEVAKHIEKVRINPGLYVFEKPNPNRTEYTKAEFDEIGEKVRET LAPLVISLRDQGKAMRIGVNHGSLGERMLFTYGDTPEGMVQSAIEFLRICESLDYHNL VISMKASRVPVMIAAYRLMAQRMDELGMDYPLHLGVTEAGDGEYGRIKSTAGIATLLA DGIGDTIRVSLTESPEKEIPVCYSILQALGLRKTMVEYVACPSCGRTLFNLEEVLHKV REATKHLTGLDIAVMGCIVNGPGEMADADYGYVGKTPGYISLYRGREEIKKVPEDKGV EELINLIKIDGRWINP" gene complement(28331..28546) /locus_tag="DP116_06690" CDS complement(28331..28546) /locus_tag="DP116_06690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209635.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06690" /translation="MLNLTQGFAVKTEHIFLFHKNALSSGDYLCRRYKNCRKIYGFIQ RPSILIKLINSSTPLSSGTFLISSLPR" gene 28585..29871 /locus_tag="DP116_06695" CDS 28585..29871 /locus_tag="DP116_06695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459427.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S41" /protein_id="PRJNA477356:DP116_06695" /translation="MVITRSGLVLGATAVTLTTIAVTSLGIHSQGQALFKESPKELID EVWQVINRQYVDGTFNKLDWQAVRREYLNKPYSDKQQAYKSIREMLKKLGDPYTRFMD PEEFKNMQVDTSGELTGIGIQIGLDEKTKKLTVIAPIEDTPAAKAGVLAKDIITKING KSTEGMDTNQAVSLIRGEAGTTVNLTVLRSGQEKQFNIARAKIEIHPVEYSQKQTPAG NLGYIRLKQFSANAGKEMQQAIRNLESKQVAGYVLDLRNNPGGLLFSSVEIARMWINN GTIVSTKDRLSEVEREVANGRALTNKPLVVIVDKGSASASEILSGALQDNKRAVIVGS QTFGKGLVQSVRPLDDGSGLAVTIAKYYTPNNRDINKHGIDPDVKVDLTTAQRERLWL KERDKVATLQDPQFAKAVEVIGKEIAQKTNNRAEKN" gene complement(29917..30105) /locus_tag="DP116_06700" CDS complement(29917..30105) /locus_tag="DP116_06700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314452.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DDE transposase family protein" /protein_id="PRJNA477356:DP116_06700" /translation="MSDTQTWYIVKHSAGHCEIIPSDEATEESSPEIIEQWGPFSSQE EAIARRVGLIRSGKCQPV" gene 30327..32669 /locus_tag="DP116_06705" CDS 30327..32669 /locus_tag="DP116_06705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015190196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-glucosidase" /protein_id="PRJNA477356:DP116_06705" /translation="MPQYFGQLHTTEPAWSILEGVQAIQQSDRHILFKCGDPCLTISV LAPNLIRVRMTPTSEFLPRRSWAVAQADEEWPTVPFEVREKAEAIEIETEQLRLVVSR NPCRIQCFDKSGQPFAHDADPGMGWRTGAIAGWKQIETDEHFYGFGEPTGLLDQRSKV KTNWTSDAIDYGILTDSMYQAIPFLIALRPGLGYGLFFNTTFWSRFDLGAEQPGVWRM ETQGGELDYYIIYGPEPAKIIETYTQLTGRMPLPPKWSLGYHQCRWSYESQDIVRKLA DEFRQRRIPCDVIHLDIDYMSGYRVFTWSQKRFANPKELIDNLKQDGFKVTTIVDPGV KYEPEADYKVFDEGLKNDYFIRKTNGQLFHGYVWPDKAVFADFLRPEVRDWWGSLLNS LTDVGVAGIWNDMNEPTLDDRPFGDPGKKMAFPLDAAQGPTDERTTHTETHNLYGQMM AQASYQGLEKSRPTERSFFLTRSGYAGIQRWSAVWTGDNQSLWEHLEMSLPMMCNLGL SGVAFVGSDIGGFAGNATAELFARWMQVGMLYPLMRGHSALTTAQHEPWVFGDRVEKI CREYIELRYQLLPYIYTLFWKAATTGSPILRPLLYDFPNDPKTFTLCDQVMLGPSLLA APIYRPGVEHRAVYLPEGCWYDWWSGETFQGPIHILAHAPLERMPLYVRAGSIIPMAP VMQYVDERPLDQMRLRIWMGTGEFTLYEDDGHTFEHKTGAFCTTTYQVCLQGQRTIVE IGGGEGNFSPATREVIVELVGVGEQSFVDDGAARQLTFEI" gene 33155..33673 /locus_tag="DP116_06710" CDS 33155..33673 /locus_tag="DP116_06710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012507873.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06710" /translation="MLGWIYMILAILLEVAGTTCMKFSEGFTKVWPSIFIFVFYALCF SILTLALKTIEISLAYAIWSGLGTVLIVSIGILWFQESVNIVKILSIVLILIGVIGLH ISHEPVSEEEGILSSVATSVDQLETTQPSKTQDILPPVSDPALIMPESVEYPESEKVP IIKVLSKNHAED" BASE COUNT 10046 a 7036 c 6916 g 9754 t ORIGIN 1 gcactgtcta acaaggttca actttgtcta ggtttgtcca agtgttgtag aacggttcgt 61 gaagcaacca cattgacgag agagttttgc acttctcgtc agaagacagt aaaaatccac 121 ctacacccat cgtaggtggg ttttgcttat acgatggttt aaaaaatgct tcttttattt 181 aaccgtgcaa tgtgcctcgt agctcgtatt cagtcccttt cctaatgagt ttctgcgtcg 241 ctgggagggc tttaagtcct aaataatgac ttttgcttct aagcatttgg tcaatttcat 301 cacgctttgc taaatactgc tgttcaagtt cgataaatgg ctctaaagcc tgtctgatct 361 cattggtatt ttgtagtaat tgcagatgct gatttaaatc ttgtatgcgc ttttgtagcg 421 gaggtatctt tatttctaca tctacaagca ttgcttgata tttcgataaa ttgaaattta 481 gtaatgatat agatttctct agttccagac actttcgggc gcgttcatcg atatcaccaa 541 atatatcaaa ttcaactttg cggagttttt cttttctttc actaatatcc gctcgtatac 601 ttgctatact cctttcaatt tggcttttct gggcttctag ttcattaagt gtttcgtgca 661 tgttttctaa attcataacc gtgtgttcct gattttctat ttgatacatc tctttttcga 721 ctagaaataa catagatatt ttctaactcc taagcatgga aaatttcaca ttagatatgt 781 gatgacggta caagagcact ataaaggtag tgattgattc tttattctgt tatttcatgc 841 tcactggtaa tttttacata ttactgtgat acattttttg cttatctata gcgcttgtgg 901 tgttgcgggc aatacacaac tgtagagacg ttacatttac gtgctctaca ttattcatgg 961 tggtgtatgt gattcaaatg agaaccgcta tattccacac ttgaattttg atgatggact 1021 ttacttttta aaaattttca gcttctgctg attatactca gatttagaat tccctgattg 1081 gtaactcaga tagtggcagt gccgggattt taaaccacca cttttaaatt tgagccttag 1141 tagcatattt tcccctgcac ttgtatgggt attctaattc tggattgata gtaaatcgtg 1201 taagtcgtcc ttaagcactg gggtttatcc gtacccagtg cttttttatt tctccaattc 1261 taattatatt ttcctgggta ttctaatttt gagtaagagt tgttgccttc aagcactgga 1321 gaatagggat actccagtgc ttttctaatt ttttaggact taggcacgag ttacgaaaga 1381 acaagactct gagattgctt ccttacgtcg caacgggtgc aactacgtta tttttgcgta 1441 agtcctgttt ttgtatctta tctatgcttt tgctcataag ccgtccattg acatagccta 1501 tttattcagc ttcattccat tttgagcagt gtttccaaca tcttctgtta ccaggaactt 1561 gtctcaattg caggcttctt gagacatcgt ataagtaatc atttttgcga gattgatatg 1621 ttttctctaa aagtgggtaa tataagctca ctttcatagc attttcacga agttccctat 1681 aaagctattt ttgatctgta gatgcacaca gataaattat ccgtgtccat ttgcggtttc 1741 atgataacaa aacttggtca ggaagcactc atgtttcact atcgtcaaaa attaaaaaca 1801 atcaataaaa catttgtacc tataattaaa aagaaaatta ttcggctatt acgggcttta 1861 cttgtcaatc agaaaaaacg acgaacaagt gtgaatgctg gttttgtgtt accgacagtg 1921 gcaatggtag cattggtggt tgtgctgatg acaactgcta ttttatttcg gtcttttgaa 1981 agggcaaaaa atgctagtaa tgtccgagtg aatgaggctg tcctgaatgc agctaaccct 2041 gctattgaaa gagcaaaagc aaaaatagaa cagttatttg gagactctag gttagagtcc 2101 tcaattcctt cagacttttc actagaagaa attatcaata agaaccttaa ccaatttact 2161 tttggtgatg agacacagtt aaaacttgtt aaagataaag aaatacaaac tgtttggaaa 2221 tatcctgtag atacggacaa taacggcaag tttgatagtt acactctcta tggtatttac 2281 tttcggactc caactagtaa taaagccaca agtgtcttac aagcaagaac acaaccgatg 2341 gacacaggta ctgttggtac tgaatgtcaa agcctagcta ccactagcgc gaacttagta 2401 agtacttatg gttggtataa agtcggtgaa aagctaaaaa aaagtatttt tgtttatact 2461 acgactgtac ctattacaga tttaacaaga ttaaatacta ataaatatga aacatttaaa 2521 ggtaataaag gttttgtcgc tctcgaatat cagcaagacc gagaaagaat accactcagt 2581 aacaatgccg ttgtctatga agatgattta gaaattgctc cggaagaagg tatcagtttg 2641 aatgggcgca ttttgactaa tggtaatttg ctaacaacaa gactcgatca gcctatcaag 2701 ttttatttag tcagtagtcc gaaatcgtgc tatttcaaag agggaaatag caaaattatt 2761 gtcgctggaa atgttataga tagtagaggt acagataatt ccagtcatga tgatgtgcag 2821 gtagacttgt ttgagcaaca agatgcacct aacaacacaa tgaaaagtga cgttatcaac 2881 aacacaaata aaactgtacc tacatctgtt tacggaaaca cggcagctta taatgatgaa 2941 gcatatgcaa aacgaattga gcgattagta caagcaacaa atagtgcata tccaaataac 3001 gcagaacttc cagatgaagt gaaacagaaa atagaacgat atttagcaga ggactttact 3061 ttagaccctg ttaaagttcg tgaaaaaaag ctaaaaactt actttaaaaa aagaacacgt 3121 cgcgttccat atgccgaagt tgcacagggt agtaatccac ttaaatatgg aagccacgac 3181 tatgaaacaa atagtcctct tcaaggaaat gggaactctt taagaccagt ggatgaatgg 3241 gtatttccct tcaatccggc tgatggaaaa actgctacta attatgccga aatagggatt 3301 aaagataacg gtagtaaact ttatctacct gcaacagaac cagtagaaca agcaaaagca 3361 gataaagagc aaaaaatagg cgatcgcctt ttagttggca ataacctacc tcaattatgg 3421 tacgacacaa ccaaaagcaa atttctcagt tcaccagacg aaggacagac tattactggt 3481 aaggaatggg atgttgatag gaaaggaaat aatagcactg tgactcgtaa gcgcttctcc 3541 caagcatatc acttcgagga tttgggagct acagatcgag acgagttttg ggaaaagtca 3601 gcagcacaaa aaccacaaag ttctctggat gtcgttgggg gtttgcgagt tgtcacgggc 3661 gcaggaattt atgttgacgg tcccagttca gattcaactg cttcctatcc gtgggatgct 3721 gatgttgatc ctatgactcc ggatattcag cccaggtctt tcttgagtgc gtcacgtcct 3781 aattgggaca caaactttgt agatccaagt aatgctgatc caaacagaat taaggttagc 3841 agtctacagt tcaaaggtaa agacccaatc attgtctggt ctgattccat gcctatgaca 3901 gatagtaatc tagtagcgaa tcctaaagtc aaaggtgact tgctcatgag aactacggct 3961 gtttacttct ataaagatcc ttctgacaaa gataattctg gaaaagatca gatgcctata 4021 gcttgtgtaa gcagcttcta cgatcctaca aatgccatca cagcgcaaaa ccagaacaca 4081 ttattatcta ctgcgaaaaa tcccgatgcc gatctaacaa caatctctgg taaatccaat 4141 aatggtgttg tctatcctcc ctattctgga aatagagcaa gtgccatatc tacttacaac 4201 agtcaactag aacaacaggc aagattagta tttcctaatg gacgttttgt caatgaacct 4261 ttaagaaacg ctttagaaca ctatagcgca ggtaaacctc tatcaatggc tgataattcg 4321 gcaattgata cagctgtttg tgccattcaa attcttaatg ataccttgag acccaactct 4381 agtccacctg tacctcatag agcaatctac gaaacgacat ttttggatgc tcgacaaatt 4441 aaagcaattg aaaacacagc taacgtatct accctaacat tacaggaata caccgagcgt 4501 cagcctttag aagttcgggt tacggtctta gatttagata aactgcgtac gacaaagatt 4561 ggtactagta ccccacaaga gtatctaata ccaaacagtg gaattattta cgctactcgt 4621 gatgatgcac tcgaagacaa aagtaatcct ggaagtagag atgtcagtgc tactgacttc 4681 aagctagatc caactcggcg acctaatgcc atcatgctta ttaatgggag taacttaagc 4741 cgtcacacta cttacaatcc aaaaaaacca gaagaaaatg agaagggtct aattttggta 4801 tccaatctgc cagtttacat caaagggaat ttcaatgttc acacacaaga agagttttta 4861 gacaattcct tgaaagggga gaaggattgg agtaacaaat tctatgcacg tcaatcaccg 4921 aatcctaact tcggctgtcg cccaagtcag tttcctgatt gtaacattgg tgaaacttgg 4981 cgacctgcag tagttattgc tgatgcaatt acagttttat ctgataattt ccgcttcggt 5041 tttcgtgaag agaatgacta tgattcgatg caaaccgcca caaatacaga aacaaatctt 5101 atattcgctc aaggtaacac gccagaacgc ccaacagaaa gcaatggtgg tttggaaaat 5161 tttgtgcgtt acttagaacg ctgggaaggc gtaagccaca cagtagccgg ttcttttatc 5221 caatttaaac acagtaacta tgctactgct ccttggcaaa cagttattaa tggaagcagt 5281 ggttccaatc aaaatcaaaa tcgtacaccc ttctttactc cgcccaaccg tttctggagt 5341 tatgatacag ccttgctgag tcagtcacct gacttgttct ctctacactt tacgacaccc 5401 gcaacaaaca agccgaatga gttttacaga gaagtcggac gagacgatgc ttgggtaaaa 5461 actttgttat gtgctcaaga aattaatggt agctacgcca ttagccaaga tcaacgtggt 5521 acttgtcagt gaacagtgaa cagtgaacag tgaagaaaag aaacgaatcc gtatttctct 5581 ttgataactg gtacctggta cctggtaact gataactgat aactgataac tgttaatatt 5641 atgattcacc aaaaacagca gcaagaatct ttatctagtc aatcaggttt taccatcatt 5701 gagtctttag tagcagtcgt tgttgttggg attttactag cggcgatcgc acctgtcatt 5761 atcctatctg tcgcaacacg gttacaagca aagcgtattg agttagcaac caacgctgct 5821 caaacctata ttgatggcat taaatcaagc acaattccat ccccatccat cataacaaaa 5881 agcacagaag acagtaccga tccacctccg ccagatgcac cctcaggaac cctgatttgt 5941 cctacaactg gaactggctt atgcgaaatc tctcctacaa ccgcatcatc tcagttatat 6001 tgcgtggatg gagatacggg aggttgtaca agtgataatt tcaaagacat gattgttcag 6061 ggatttgggt ataatccaac ttccgataaa cccgaggatg gctatagatt ggggttacga 6121 gtatacagag cagacgcttt caataagcct agcatcacat tgaaggcatt gaaagaccca 6181 ggaattaaac aagcagaaac tttcactagc gggacagctt taatagccat ccaagcaccc 6241 ttagtagaaa tgacaacaga aatatcaagt aaagtaacaa catttagtga tttctgtcgg 6301 cgtctaaaac ctccagcttc tgattctaat cctcagtcta actgctaaac ctcaatatta 6361 tgaatccact caaattgctt tttattaagc aaataacact ctttaggctg aaacaaaaat 6421 gtgatggttt taccctagtt gagttgctcg taggtattgt catcgcgact cttgttatca 6481 cacctttgtt gggattcatg attaacgtta tgacaactga gcgacaggag caagcaaaag 6541 caaacactga gcaagaaata aaagctgcac ttgattatat tgcgcgagac ttgcagcagt 6601 cagtctatat atatgatgct gatggcatta ataaaattag gcagcaacta cgaaggtatg 6661 gtgataaaaa tacatttttc cctgttcttg ttttctggaa gcgaacattt ctttcaaaag 6721 aaagttctac tatattgaaa gataatacct tcttctattc tttagttgcc tattatttga 6781 ttacagagga taacgcgaca tggtctaaag cagctcgaat cggtagattt caaataagag 6841 atggctatga ttcttctacc gaaactgata ataaagacaa tccgcgagac aaagctaagc 6901 cagacgaagg ttttcagatg tttaatctgc aaggcttagg caatcttaaa agtaagatga 6961 atcaatggac aaaaaagagc gaagattata cacaaaaaat tgtaccttta gttgattaca 7021 ttgatcaaac gataataaat aatactacca atccagcacc tcctacttgc acaataggtc 7081 aacaagttcc aaaatttgat gaaaagcacg ataacgatga tgctgttgcc actggaaatg 7141 ttagaacacg aggcttttac gtttgtgtgg attcagaaaa taccgtagca gaagtttatt 7201 tacgtggtaa tgccctcgct cgtcttcaaa ataataatat agatttcaat gaaagtcgtc 7261 aaacatattt tcctcaggca agtatacgag tccaaggtaa tggattctta tctgctaaat 7321 aagtttaata aatattctaa tagtggcttt acactactag aacttttagt aagtcttctc 7381 ataattggta tcttagctgc catatcaatt ccaagttggc tagcttttgt cgatactcag 7441 cgcctcaaca ctgcccaaaa cgaagtttat cttgctatac gccaagctca aagccaagct 7501 atcaaaaaca aattaacttg gcaagttagt tttcgcgaac agaacaatat tgtgcaatgg 7561 acagttcatc aagcagaagt aggggtgttt attcctaatg ctatcagtaa caacaatact 7621 ctatggcata accttgacca aaatattcat cttgataaaa ataaatatga gacaactttg 7681 ccaaaacaaa ctacaaagca agagtggcga attatgttta actcccaagg ttgtccggtt 7741 taccaagttg cagatgagtg tactcaaaca tcattccaaa cattaggaca gataacttta 7801 tttagtatca ataatagtaa agctaagcgg tgcgtctata tttctacgat tttaggtgca 7861 atgcgaacgg gaaaagaaca cgatgaacct gacggaaata agtattgcta ttaatatatt 7921 tttaattgat ttcaataagg tatctcatgc taaatctacc agaaacagca aaacagcatc 7981 tcatcatttt cactcgctat ccagaaccag ggaagacaaa aactcgactg atacctgctt 8041 tgggaactga aggtgcagca aatcttcagc gtcaaatgac ggaacacaca ctatctcagg 8101 tgaaacagtt gcaaaagact tctgtcatat cttttgaagt gcggtttgca ggtggtaatt 8161 tgcaacttat gcaagagtgg ctaggatatg acttggttta ccaacctcaa ggagagggag 8221 atttgggttt acggatgaca caatctttct tgaatgcctt tcaatcaggt gcagaaaaag 8281 tcctgactat cggtaccgac tgccctggtg tcaataacca gattttagca aaagcttttg 8341 cacaactcca gcaatccgag gttgtccttg gtcctgcggt agatggtggc tattatttaa 8401 ttggtttgca gcgtcctatg ccagaattat ttatcaatat agactgggga acctctcaag 8461 tgttgcatca gactataact attgctcaga tgcttaattt atcagtcact aacttacctc 8521 acttggctga tatcgatcgc ccagaggatt tgccaatttg ggaagagatt ctcagactta 8581 cgcactaaac tggtttgttg tactttcgtg gtgtagtgat ttatcccctt acacctactc 8641 atcacatcat aaaaatgtat cctacttaac ctctctaatg aaatatttac tacaggtatt 8701 aagcgtgtcg atgtttttta tggtattttt taacacaggg gtttttctga agacaaccta 8761 gaatcgagat gctaccgtgg tgtggctaac atcaaagggt aaaaagtata tcaccaaaaa 8821 aagaatctct caaacagaaa caataggtaa cgtcatagga agaatattcc agtaccttta 8881 tataaaatat tcacttcagt caactagaag acgaaaaact tatgaataac cgttttgctg 8941 ctgcgatcgc agcccgcaca ctcaaagtaa ttgggataat cttaatatta tcctttttga 9001 tagattttgt gattataagc cttgacttca gcccaacaga aaagctgttg caactcagat 9061 gggcagcaag tttggttgat cgaggagtcg tgccattggt aggattgggt atgctgttta 9121 ctggctattg gattgacagt tttgatgatg gcactcaacc ccagcccata gacttgaaaa 9181 tgccagctct tatcatatca agcatcttgg ggttactctt tttgattatt gctcctgtcc 9241 atgctatgaa tatcattcat caaagaactc aagcagtgga ccaaatcacc aagaatgcac 9301 aactagcaga aaaccaattg aatactcagc ttaaccaagt gcaagctcaa ttgggtaacg 9361 atcaagtcaa agcagctgta gaaaagcaga aagcccaagt caaagctcag tacactgaac 9421 tcgttcaaga tgagcagcgt tataagcaag cactgaataa ccccaatatt cccccagcta 9481 caaaagattt gctccagaag ttcaaagcaa atcctcaaga acttgacaaa ttcatagccc 9541 aacaaagtga tccaaagcaa cttgcaagtc aaagactagg ccaaattcgc gctcaacggg 9601 acgaactgat aaaacaagct gaagacaatt ggaagcccgg tttaagaatt gttataggca 9661 gtttgttatt gtctattgct tacattatta tcggctggtc aggattaaaa ggtatgaatg 9721 ctctccaagg tggtaaacgc aaaatacctg cacgctagtc cataattttg agggcagtat 9781 gaagcataag accttcttat tgccttatac tgtcattttt gcgaattatt gatgaaaatt 9841 atcttgcttg acttgcataa attcaccaga aaaaagtata attccccacc agagaagcca 9901 tatgattaac attggaaatg ttttcaattt acattttgac ttataacgaa gaaatagata 9961 tcgccgcttg tatcgagtcg gcgatgctat cggatgacat tattgttgta gactcatgca 10021 gtagcgatcg caccgtcgaa atcgccagcc gctatccagt tcgtgtggtt cagcacgctt 10081 ttgaaagcca cggtcgtcaa cgtacttgga tgctagaaaa tatatctccc aagtatgaat 10141 gggtttatat tctagaagct gacgaacgca tgacgccaga actgttcgcg gaatgccaaa 10201 aagcaagtcg taatccagat tacatcggct actatgccgc tgaacgtgtt atgttcatga 10261 attcttggat tcgccgcagt acccaatatc ctcgttacca actgcgcctt ttccgccacg 10321 gtaaagtctg gtttacagac tatggtcata cagaacgaga agtttgtgac ggttcaacaa 10381 gctttttgaa agaaacatac cctcattata cttctggcaa ggggttcagc cgctggattg 10441 ataaacacaa ccgttactcc acagatgaag ccagagaaac ccttcatcag ttacaaaacg 10501 gaacagttag ctggaaagat ttattttttg ggaagtcaga agtcgaacga cgccgcgcct 10561 taaaaaattt gtccttgcgc ttaccagcta gaccacttat acgttttttg tatatgtatt 10621 ttatcttagg tggctgctta gacggacgcg ctgggtttac ttggtgtaca ttgcaagctt 10681 tctacgaata cctaattctg ctaaaagttt gggaaatgaa gcatatccca acacccaagt 10741 tggatgcaga agtgtctgag aatcaagcta cgcagttagt atcaacttcc gctgattaga 10801 catagctcaa gttttgttaa agtaactata caaacaaagt ccgtttgtac agatttactc 10861 ttttgcaaaa gttaacttga tggaaattca accgcagatg aacacagata atttatctgc 10921 acggcaggtg ctacaacggg gggaaccccc gcaacgcact gcctcgtgca tctgcggttt 10981 gaaatactga ttgccaaatc ataacttttt attcttcctt tccgttaaag aaacactccc 11041 aattccttga agaataccac gaccaatcaa acctatacca cgaccaataa cttgcgtgag 11101 aacaaagaca acaccgcttc caaacaagga aaaaagtgat tgtatacgag gagcgatcgc 11161 atcacgaaat tctattgcta acgtgacaac tagcccaata ccagaaagtt gcgttaactc 11221 ttgtccacgc ggcgcataaa tggaagttgt agcaatacca cgaggtgcaa atacaaatag 11281 ctcataacgg ctttcaaaaa ttgccttagg ctcattcaca taattcctga ggcgatatct 11341 ccacgacaaa ctatttctaa atcgttctat ttctcgtgtt gaaattaatt gcctatcata 11401 aaaactttgc tttatttctt ctatatctgc taaagagttg agcaacggtt gtataacacc 11461 atttgctacc tgaatcaata aattctccag aatcatttct gcataatcct ttgcttcaga 11521 gcttcctgct ggataggaag tattatcaat ttgcaaattt ctttgaaata gaagataaga 11581 aaataactcc tcaaccagag gaattttatt gagaatttca gaacgcacca cagcagcatc 11641 ttgcagaata aaagccacga tttctaaatt atgattgtct acccgcactc gataaaattt 11701 cccaaaaaag ttcgtaagtg ctgattgcca tacatcaact aagacagtat tttttaattc 11761 atataattgg cttatttcta ccacatgaga acgcatttca tctagtgcat cagcaacttt 11821 ttgtataatc aaataaagta aatcacgttt tttatcttca cgtaaaatat caatttctaa 11881 aggggtagct gtcacatttt gtaaaggaag ctgaagtttg gtaatacagg atgcaaataa 11941 tgccgcttgt aatgatctag gactcagcag cttctgtgct gaatcagatt tggaaacaga 12001 agtactcggg agagaatttg tcggagactg caactgtgca gatgcgaaat gattccttgt 12061 agaaacatta cgtgaatgcg actctacctc ctgtcgcctt tccagaggtg aagctaataa 12121 tcgattcaac agccaacgcg ctgctagcaa ttctcgtctt tgcccagcta acactgcccg 12181 atctaataat ggtaatccgg ggacttgtaa ttgtgctgtc acttcactca aagtgacgtt 12241 gatgtactca atccctgata agcgtagatg cttccgcaac cgagcaaaag gaagttcaga 12301 agacgtgggg ttttcaagag cataccccac atcctcccgt ctgtagatcg tcatgttctc 12361 ttttcccatc tcggtatccc aagatgagcc acctgccacc acatcgtgca taatggtgac 12421 tagttcagaa acaggagttc ctttgggaca gtaaccatcc accccagcag cttttgctgc 12481 tagaagtagt ccttgttctt ggacagaact caaaagtaat aatggcaggt ttgggtactg 12541 ggttttcaaa tggcgacaga gttgtaaacc cagctgctgg cgagatcggg agcgagaatt 12601 ccccaattcc aaaacgacta aattgacctg gtttggatct tcttgtgcaa gtttggctaa 12661 aatctgcaat ccagcagtgt ctgtttctgc ttctgatacc acctgtaaat tggagaattc 12721 ttccaaagct acttttagcc ctaatcggaa gatggggtcc tgatctatta acaatagttt 12781 taaagggcga tcgctcataa tccgcttaaa aaacgcaagc tttttttact ttagtcgtaa 12841 ataagcttga gaaattagcc tagtttcttt ggttggaact gtttcacaat gcactccctt 12901 ctggagtcat tgaaagtgca gatggtgaaa cggttgtaag cgatggcgat aactagagtg 12961 aatatggtaa tcagtaacag gtagtggttg cacttgggcg caactcacgc ttcaaccgtc 13021 atttgctctc ttgcgagttg cggtacccat aaaagttaat actatactca gtaatccgca 13081 ctcctgtttg ttcttcaatt tggcgaataa attcctgtgg tagttctaca tgaacatcca 13141 agatttgatt tgtatccaaa cagttgacat gactgtggga atcactaata ttaccgtata 13201 aacgtccatc acagcgttca atacattcaa tgatgccttg gctggataac gcttctaaat 13261 tttgataaac agaagtatgt ccaatctctt tgccttcgtg attcaggcga tcataaattt 13321 ctctggcaga aagatgctct cttgcttccc aaagtagttc taggataaag cgacgctgac 13381 gactaacgcg catacccagt acctgacact gctcaagagc atcttccagt gaacgaatag 13441 gttttgttga tatcgtttgc ttttgcatat tttaaagttt attaactgat gggttaattt 13501 tccctgggaa gtgtttattt gttattttat atatccaaac gttttgcacc ctgtcagctc 13561 cgctttgcca gtgtttttgc tattgacggg tacggaaatc acaggtgagg gtgaagagtt 13621 tttttgctgg ttgaccttag gtttctcacg ctctaaatat aaaatgaact cattacaact 13681 ttagcttaaa atggctgcaa acgtctactt tacaacgtta atagcagaaa gccctacgcc 13741 ttcggcgggg atgaatgcgt tgaggcttta gcctcagtca aaaagaactg agaacaaaag 13801 accagaattg tgcaacaata taaatagagc cagtcggaag gacagagata ggcagacaaa 13861 gcgcaaatag agcccaaaga gcttgcccct gaccgggttc tactaacaat acgcgctgca 13921 cggtggcgaa gtcgtgttga aacatagcca gaatttataa cgaaatcaaa ccagaaaaac 13981 aaaacaattc gttgattaaa aggtaccttg gggcgcaggg gaaccgggga aaaagggagg 14041 aacagttttt aaggcactct cgataaagta ccaaaagctt gtggagatga agccctctgg 14101 ggctatctaa gttaggctcg aaatttggtt ctattgaacg gatttttggt ttaagcattg 14161 ctcgcaagag atagatagtt ttaaggcgcg tcgatgaatc aagaatcccg tcggctttta 14221 gccgtgggag acgtcaatgc tagtgtcttg tcttgagtca atagtctatt cttatgccct 14281 tgttcaccca tccctcacag cttccaaccc gagacttgaa ctcttgccac ctttttctga 14341 tactgtttgc gatgactggt taatcgagtt taaaaatgac agtaatggtt gaacaagaac 14401 agagtagtat aaatcatcga ggctattttt tgactcgatg acaatcacta tcccactcaa 14461 caccgactgc attaactcct tggatctgtc ccccgtagtc acagagattg aaaaattgct 14521 gcaggagggg gcgatgctcc agcagggagc cgcgcaaagc gcgatcgcat cctacgagca 14581 gcaactccat tttgatatcg actatgcttt ggaaccaggc gatccacggg aactttcaga 14641 aattccagag gtgcggctct ggtttatccg tctagatact cgctatccct ggttaccatt 14701 tttacttgat tggaaaactg gcgaatttgc gcgttatgca gcaatgcttg tcccgcacca 14761 gtttagtacc aaagaaggta ttcagtataa tcccgaagct ttagaaattt ttttaatgca 14821 caagttgttt gctttaaatg attggttaca acagcagggc attcctagta aatctcgcct 14881 ccagtcgatg gcacaacttc taggttatga gttagatgat gccctgtttg agatgtttta 14941 gttattagtg actaagttta gtatagcaat cctaaatcat aagccctacg ggcatgcact 15001 cgcgtatgcc tatggctatc gcctccggcg tgcgcttgcg cttacgtgaa caacaagatt 15061 ctcgacttcg ccaaaagttg tcgggaattt gaatgttcca attttcacaa atcaaatagg 15121 attactaagt tactagtaac tataagtcct tagggagatg caaaaaataa aatgcccagc 15181 cgtttcatat gtattttagt aaaaccgatg attttgctac ggttttttga ggatcggtga 15241 accacgacgg ctgggtattc tattacttgt aagtgcttta gctcaaaact aagtaagtgg 15301 acagaaataa tcgtaaccaa ggcaattgcg tttgtgtacc atgctacgaa catacggaac 15361 gtagtcagcc agtgctgcac caacgtctcc ctcgtgctgg atttggcgct cataggaaag 15421 cgcagatccg aaggggaaac ctgtagtttg ttcgctctgt gggcgtgtgc tcttttttcg 15481 ttgggcagac tcaatcaatc gcaaacctct gcgattggtt cgctatacct atggctcacg 15541 ccatgactag cgcctgtacc gtgcggtctg gtgcaggaga tacactcgca acgcacatga 15601 aacacttttg tccacttagt tacttaatac ggcatataag tcttaatgct gtactaaaag 15661 catattcaac tgcttttgat ttacttggta atgccgcttg ccagatgcga taatcgttat 15721 actcaagaat taaatttgcc atgtcatcaa ttctaatctt ccttagcaca tcatattgag 15781 ccgattcaaa gtctctaacg gttccaccgt tttccaaaac ttcttgtgca gctttgataa 15841 aaccgtctaa aacagttttt ctgtattctt gaccagcacg ttcagtaaag atagtagatg 15901 atttaaaagc cttaaattct tctaaaagaa ggttggcaat ttttctacca actgatgtgt 15961 tccaatcaca cggtagttga acatctagat attcatcttg aatgtattga tacttgagat 16021 aatcccttat gtctacagtt acttgaacaa ttttgtaacc agcaaacaat tgtttcaagg 16081 aattgacatt tgtatcaata gatggatcaa ggaaaacttt ttcttcacct tctggacatt 16141 catagacaag atatgtccaa attcctactg aatcacagac ttcagagttt tcagcttcat 16201 agaaagtaat tgttccatct tcctcatacc taaaataggc attttccgca tcttttagcg 16261 tacagggatg catatttttt gaaaagtgcg tgagcgttct gaaaatccca gtgatgtggt 16321 gaggttggtt ttctacttgg atgtaacgtt ttccgctgct catgaagcca tgtaatcgaa 16381 tcgccatctg tttttccttg tgtatttcgt cacattcttg ttcgatgcac cagcctgaca 16441 gcaagaggga aattaaggaa cacataatat ggcgcgctag atctatcggt aagctattca 16501 cacctgataa caatggcaac aatatagttg ccatgcagcc gagaatcctg agcttgctgc 16561 actaaggtca ctattctttg ttctaacgta aatttttaat tccagttcac tagtttcgta 16621 acaagtttaa aatcttgtta tttttttcat tttctttgtc attatgaagt tattgataaa 16681 ataactcagt gttaccttca taaattaggc tataagctat acccaaccta aacttccaac 16741 ttatggccaa taatgacaaa acacgatgaa tgtgtagtag attacccaaa aaatttgtta 16801 tatcaaaaat ctaccgaaaa tattttattt ctactagcct tttgcaattt ttttttacaa 16861 aaggataaaa atcaagctat tacgcacaca ataatgctat ttacttgttt ggaaagtcaa 16921 taagctaaaa tccgcctttt aaaggcggat tttagatggt aacaaaagtt tttgagacaa 16981 gctgtaccga agtgaagctc ggtagcgctt gtatttttgt attttttttc tataactgct 17041 gtaaaatcca ttcaacagct acagctaagt ttttggcgat gtaatctggt tttgtgtggt 17101 gctggtaatc acctgccaaa acgcgatcgc caaaacctgt ttccaccaaa atacccacac 17161 agccagcatt gtgtgccata tccacatcag ttgctttgtc ccctaccata aaactgtgct 17221 tgagatccaa atcatgttcc caggctgctg caaccaacat tcctgtgttg ggtttgcgcc 17281 aagtagacca ccgggtgtac gctgggtcta gtcctccttc tggtgggctg aggtagggac 17341 aataatacaa agcatctaat tttgctccag cctcctgtgc caagagtcgg caaagtcgct 17401 gatgtaacgc ttgtacgtga ctgtctggat aataacctct ggcaggtccg gattgatttg 17461 aaaccagaca gcaaaaaata cctcggtcat taagctgacg cagggactta gctaccccag 17521 gaattaaatg taaatcttct acagagtgaa tgtagccagc ctcaacattt aatactccat 17581 cacggtctag gaatacagca ggtttaccca ccccaaattt tctccagcac agttttaggc 17641 gaaatatctg ccattttacc tgtagaggat ttgatggcga ggaatttatc actgttgggc 17701 agtaactttg ctgggtctgt tggaccaaac aaggcgatgg tataagtctg taccgccaca 17761 cttaagtgca tgggggcgct gtcggtacac agcattaaat ttgctccagc aatcatggct 17821 gttaacttgc caatatcatc aggagaactg atttttatat tgggtacgga ctgtttgaga 17881 ctgcggacaa attcttcatc ctctggtcct tggatgacta ccacgggcat ttctggctgc 17941 ttttgctgga agtcttgaat aatttgctgc caatttgcga cagggtaaac tttatccaga 18001 ccttttgtct tggcaagtgc gctagaacca ccgtgaatca aaatgtagcc agtttccttg 18061 atccccaagc gctgttgctc cttattcgcc cagtcaatat ctgcaactgg cacattaact 18121 gctaactcgg gacaagggga gttaatacct aacccttgca gcaagtcatg gtacatacaa 18181 gcggcatact gttctgtttt cagaggaact gaattggtga gaaaaacaga tcctttgctc 18241 ttgtagccaa tccgtgtggg gattccagtc agccagagaa aaacacccac taaggcgctt 18301 tgccccagag caatggcaac atcatattcg cgatcgcgaa tcgtgcccac caagttgccc 18361 caatctgcca tactgttacg gtctttgtag tcaaacgcca gtacatcatg gactgactta 18421 ctcacccggt aagcagtctt tgagcggggt tccacaacaa catctacctg agcattgggg 18481 taatagtgct tcaggtcatc taaggtcggg aaaaagagaa tttggtcgcc aattccgcca 18541 gggacaaggg ctactattcg cataatatat attgacgctt actcgctcat tattttaggg 18601 gaagatttgg gttagtaatt aggaattaga aattaaggat tagagttgac cagttgtggg 18661 ttgttagttg tcaaataata ctactaacca ctaacctatc ccctatagtg agattgaggg 18721 attctgtgca tttattaatt ccagccgctg gaagcggaag aagaatgggg agtgaccgca 18781 ataaactcct acttgtggtg cgctctaagc ctattatcgc ctggactctt ctcgctgctg 18841 aagcggcaag tcaaatcagt tggataggca ttatctccca accattagac tggcaagatt 18901 tccagacgat tttgactcaa ttaaagcaat cttcgcctgt ggaactgatt ccagggggtt 18961 ctacccgcca agaatcggtt tacaatggat tacaggcgtt gccagactcc gcaaaagaag 19021 tattgattca tgatggagcc agatgcctcg ccacaccaga tttgttcaac tcttgtgctc 19081 aagctattca acactgtccc ggtttgattg ctgctgtccc ggttaaagac acgattaaag 19141 tcgtagatga aaatggcatt attcaaagta cgcctgaccg acggcaatta tgggcagccc 19201 aaactcccca aggatttgat gttaagttgc ttaagcagtg tcacaccgaa ggtgtccgtc 19261 aggggtggga agtgacagat gatgctgctt tgtttgaaaa gtgcggcttt cctgtgcgag 19321 ttgttgaggg ggaggagacg aatttgaagc taacaactcc ccaagatttg gcgatcgctg 19381 aattcattct caaaactagg ctcggtgagc agtgaagtcg gttataacaa tttttgggca 19441 ttgccgaagg ttcgccaatg gttatctcaa taatagcctt gacttttgtc atccaatatg 19501 ctaattttcc aaacacttgt gattctcctt agtcattcgt cattttttct ttgtacaagt 19561 gactaaatga gtcatgacca atgaacggca agtgctgtag cctggaaaac cctagtcgag 19621 cactgcctcc taatgaccaa tgactaatga ccaatgacta atgccgtgat aaatgcagcg 19681 accaagatag aagcaattct ctatttgaag ggtaagcccc tgtctatcag tgaaattact 19741 gagtatgccg cttgcgatcg cgcgacagca caggaaggca tcatagaact cattgatgag 19801 tatgcccgcc gagatagcgc cctagaagtc gttgaaactc caaatggtta tagtttgcaa 19861 ctgcggtcag attttcatga cctagtccaa actctgattc cagtagaatt gggcgtggga 19921 gcattgcgga ctttagcagc tattgcccta aatagtccaa tactccaaag cgacttgatt 19981 aacgtgcgcg gttctggtgc atatcaacac gttcaagaac tcgtcgaact tggttttgtc 20041 agaaaacgcc gagacagtga atctcgctcg tattcattac aaaccacctc taaatttcac 20101 cagtatttcc aaatcgacca acttccagca tcattctcca acggacagga gcaaaaacaa 20161 ctagagctag aactcacaac agcagaatcc aacagcagtt ctggtactgt ggaacaatcc 20221 tagtacctca gaaagaggtg gtgatatgtc tgacccttct actatggttt agagtagaat 20281 aagaaactca gctataaaaa cagccaatgg tgtttgatcc taactttctg aatgactacc 20341 ctgaggaaca tcccaatcag cttatctctg atagctttga ggaacaccct aatcacttac 20401 tcaaatatct acagcaccag tcccctgagg ttctagcccg cgtcgctcag tccgtcagcc 20461 ccgaaattaa acaaatcatt tcgcaaaatg tccaagggct tgtcggaatg ttacccgcag 20521 aaaattttaa cgtgcaaatt acaacagatc gagacaatct cgcgggtctt ttggcgtcgg 20581 ctatgatgac gggttatttt cttcgccaaa tggaacaacg gatgcagtta gatcatttga 20641 gtaacaatca ttagtcaaga gtcaacactc aagagtcaaa aattattaga ctcccagact 20701 atggactact tctggggata ggttcctaac tgtacttgaa atgtttgaaa ttccccatta 20761 cggttgacct caatttccag aatcttgcct acagtactcg actctactag tttctgtact 20821 tgagcagatg tcttaaccgg tttaccgttg actttttgaa tcatgtcctc aggaaggagt 20881 ccggctcgtt gtgctggaga tttttctaga actcctgtaa tagcgacacc aacattctgc 20941 ttaatattga gatttttgtc ttgattaatc tgctgttttt tcgcaggaga aagatctgac 21001 atttcaatcc ccaagaaagg atgttgtaca cgccctttag taaaaagctc gttagcaata 21061 cgggcagctg tttcaattgg gatggcaaaa cctagccctt gagcatcggc gcggatagca 21121 gtgttaacgc caatgacatc accttgagcg tttaacaatg gtccgccaga gttaccaggg 21181 ttaatagctg catctgtctg aataaaaccg actcgcttgt ctggaacacc aacttgagcg 21241 ctagtgcgat ctgtagcgct gataataccg attgtgacag tattgtctag acctaaggga 21301 ttaccaatgg cgatcgccca ttgacctggt attaagtttt gtgaattacc tatcttaact 21361 gttggtagat cagaagccga aattttgaca acagcaacat ctgttacaga atcaactccc 21421 accaccgtac cgtcaaaagt ccgaccatct ttgagggtta ctaatactgt gtcagtatct 21481 gcaaccacat gagcgtttgt gagtattcgt ccattttcac tcaaaataaa cccagagcct 21541 gtaccacgct ctattcgttc tcgggggaat ggttgctctt cttcaccaaa aaaccgccgc 21601 aacagcggat tctttaaagc atcagaaata ggattagcaa ctttgcgggt tgcattaatt 21661 cggacgactg ctggtccagt tctttgaaca gcagtcgcaa taaaattcat attatcgcca 21721 atagtagtcc cagtctgacc tcccacaggg cttggaactg ggttttctga agatgaagaa 21781 gctgttacat ttcttaactc tctaaaccaa ctattttgtg gtacaaaata gcgactaccg 21841 aacaagcctg caccgccgcc aatagtcaat aaggatagat aaataaccag ttgctttaaa 21901 gataaggata acttcataat cattagattc aaggacagca atgcagatgc taacttctaa 21961 gtgtagtcaa gtcaggggat acttgactaa caaagaaccc ggaactggtg gagtcagaac 22021 tcacgacagt gaacagtacc agccgcagcg aacaaaaaaa ttgctaactg aatcactgat 22081 aactgataac tggtagctga ttcatccctt aaagttccaa ctgggttttg tattctgcat 22141 tctgattctt ctcaaagatt cattgctcat cagtcagttt tgattattaa caagcagcac 22201 ttaaggactc acgacctttg atcgcctgcc gttggcattt aagatctaga gtaattgagt 22261 gtctccttta gatacatgac tttttgctat cgtttactct ttctctgtat tagcatagtc 22321 accagcaccc ttgggcttat gcacagtagc ccatactctg tattggcgca aacttctgtt 22381 actggttgtc aaagttcagc ccttgaacgc ttcgggcgac acaaaattgc accaggagaa 22441 actgtggaga gtatagcgca gcgttacgat ctcacaccag ctactattat cgccatgaat 22501 ccgaccttaa gaaacaataa agttactatt ggtcgcgaaa ttcaaatccc tccctacaat 22561 gggattgttg ttgaagtacc tcctggtcaa aactggcgac aaatcgcagc aaaatataaa 22621 atccgccctg atgtcttatt tgaggtgaat ggctgtcaga aaaattccag atttgtgttt 22681 gttccagagg taaaacgctc acccaatcgt cccataacag agtctgctgc atcaaattct 22741 actcctacta agttagctgg gaatccgtta gcggaagtcg caactgtggc tttaccttat 22801 ggctggcaga ctaatcctac cgatggtaaa gttttttttc atagtggtgt ggatttgtta 22861 gcagcaaaag gaacgtctgt acaagcgata ggtgatggga cagtcgcttt tgccagtgaa 22921 caagggactt atggtaactt agtcattatc aaccacagtg ggggattgca aagccgctac 22981 gcccatcttg aaaatatcaa agtctccgtt ggtcaacaag tgaataaagg agacatcgtc 23041 ggaactgtgg gtacaaccgg aacacctacc acaaaccaac cccatctcca ttttgaagtg 23101 cgttctagct catccctagg ttgggcggct caagatccta gaggatattt acagcagtga 23161 aactcttact ctctcgttcc ctcgttgttc tctctcgttc ccaggctctg cctgggaatg 23221 cccaccttca ggcagagcct catattctcc gctaaaatgt agataaagta ctcaataatt 23281 attctttaaa ccaactcaaa atagactcag tcgattgaac taagtgcatc cctgctgctg 23341 caaatctctc aaatgcagca tctgcggctt ctgcataatc cactacacca ggaacaacaa 23401 caggagaagt acaatcttcc agcaaataaa cttttttcgc taaacgagag tctatctgtt 23461 gaatttctgt taataaatca tccactgtcc aagcaacaca gtgacttttg gcttgtcccg 23521 caataatcac agcatcaaat tctaacaatt cttgaatcaa acgcgtattc ttttgtacaa 23581 tctgacgttg atcaaaactt tccaaaactt ctggacgtaa aacagaatag ttttctgtta 23641 aaggatgatt acctttaatt tcaaattgtg tctgactctg acgagctatg cagtggaaaa 23701 atactgcctc ttccacagca gaaactaatg catgaccgat accacccagc atagaatgat 23761 agggccatac agtcaaagga tatttgccat cttgagtcag ttgcttaacg tagtgataag 23821 cgtgtttttc taaaaattca tagttatatc caagacttgg ggagccagtg cgttgcggag 23881 ccagtcctgc aggagggttt cccgacagag ggacctggcg ttggggttcc ccccgttgta 23941 gcacctggcg tgcaactctt gggttcactt tccaaacccc tttttcaata tctgctggtg 24001 taatgctggt agcagaagga gttgggtgtt caccagcagt attaatccaa aaaatgggat 24061 gaaaaatttg cattgctgta tgagtatcca tagtcgggat cattttggta atgactccca 24121 aattgccata gataaactca cacaaccgga tgttatcatc taccgcaccc gttccagatt 24181 tcccacctac gaataattca aattcgggaa tgcaaaaggt gttttgtaca tcaatcaaaa 24241 gtaaacaaat gcgtgtttta tcttcggaag atggtttaat attatatttt tttgcccatt 24301 ctctcgcttc ttttgcacgt tgttggtaag gtacgcgcca gacatagccg acttcttcag 24361 ggttgaaatg cggaggaata ggtagttggg tttttgttgg ggtgttcata agttaaattg 24421 caacctctat ctaaacctta atagaaagct tgccatgtct gggctacttg gataggtttg 24481 tggtgggtgg atgactctat tttagtgacg gattttgtcg ttattaaaat ctattgccat 24541 tactagactt taacttttta ttcctgactc tggtttgcgg caaatctttt ttatccttct 24601 gaaagtagca gatttcgacg agagtgggcc atatctctaa agatgcctga aattgactga 24661 taacgtcatc aacaatccga gagacatttt gctgaaagaa atcgggttca aacccataaa 24721 atacgcgcga taggttgtca gcaaagaact ggagagattg tctacgttct tcattctgag 24781 tggatattga agctttgacc acttgattat agatgtcgcc gactctttcc aaaatttcct 24841 cagagtcttg tgaagttact tcgccggaat agtcaaactt gatcgtaaga tcttcaagtt 24901 cataaatacc gttcttgatc ttctttttga gtaaattaca tatctcctca ttgaaaattt 24961 cacttgcaat cggattacgc atatagttat tccgaatatg gttctgtgca gcatctacgt 25021 gtgcagctac tatcccttct tgctgtatcc aaagccgata gatgtaatcc tcgaacctca 25081 gtcgagtcgg gaaaaagggt ggcaaaccaa actgattatc ataccccgcg accccacaat 25141 ccatcctcca atttttgttg gtaacaactg gtctgaaatt tacgagaaca taaatatcat 25201 tgagttcgtc tagactaatt tggttttcat cattcagata catgtgaata aaatcgagcg 25261 tatcaatatc gtgagttcct gtccgaaaag tctgggcaat tttgacaaca gcactattag 25321 agactttgct tcgttgcagg aagagagagt tttctttgaa aaaacctttt gttgtgttgg 25381 tttctaattc catagctgta tctacaacta gttctccttt ctcaaagttc tccggtatca 25441 tattgacttt ctgaccgaga acatcctcaa aacaggtgag taaatcatag gaacgatgga 25501 cgaaaccatc ctgcttggac ttgaatagtt tgcctcgaca gatttcatct gctaataaag 25561 attctgggct attttcgatt aatgcatccg gtctcatgtc atcatctgaa ctcaccatta 25621 aatgtccaag tgtgtacatt agggtaaaat tccgatttcc gccataactg ggtcgaaaca 25681 ggtttctgac gagtgactca agtttcttat cgcgaagtct ctggtttaag aaattgataa 25741 actgttcttt ctcacgagga ccgacataaa atacgtcgtt gaccgtttta gtttgctcta 25801 ggagttgata gtatttttca tagttagcaa cacttgaatc atcaaaaata atgatttttg 25861 gagcatgtcc gtttacccag aaattatcat cgtacttttc aatcgtttct gctacatctc 25921 ttagtcgata agtcgggata acaaagattg tttctttggt tttcataatg tgttgaattg 25981 tctcgaaaag ttcaaaacta gaaagcaaaa tcgggatgct ttgatgtaag atatccgata 26041 aattaggtca tgctacaaaa gagactaaaa catgaaacgt ctgactctca ttttttggaa 26101 atcacaataa ttacagaatt aggaaaacga tggttgtctc gctaatttat tgattaatgt 26161 gctactttcc gaaatgcact tagcatgaat gaaatttagg gcgatcgcgc atcatataac 26221 taacgcaaga aattctctct cggttctgca ttagccagag ttacctgttc ttttcaaaaa 26281 aaaccagggg aaaacaggat tcaatcagta gaatcttggt caaagtaacg ctgcataaac 26341 cgtaaaaatt cagaagtcag gagtcagaac ccttgtcaaa taaataattt caacgataga 26401 tattttgtgg aaatcgtgac cccaatagct ggaagctttg ttcattctga atttttgatt 26461 ctggattctg aatccttaac agaaacccta tccacacaat ttgtaattac aactaatttt 26521 gcctttctca aagcctgctt attagacaca acagaattgt ttcatatttt tatacttcat 26581 actaagaatt ataggattct ttctaagagc tacaatatat cgccagtata aattgaggta 26641 taaacaaaaa gtatagggcg tttagaattt tttcaaacta cttgaaagca caaattcatc 26701 tactatattg ctacgtccat acctagggtt gtcttgttgt caagctagtt tctacagtca 26761 atcaggatgt ccctttcttc aaataatcat tgcagcttac tgataattgc atccacagcg 26821 tcatctgctc tcatcattat gtatattttg cagctacgga tattcctgat gaaatagtgg 26881 tatattacgg ggtcaatcaa atccctggca tcaacaatgg aattcataac agatgttgct 26941 agtttacttg gttaagtcat aggcgctgct gataacatct atcgtttaga ggaatttgaa 27001 atttgaaatt ttggatttta gataattcat tagtcatttt tacctgtttc ccaactctcc 27061 ctcctccccc actcatcctc tccctttttt tcctgctctt tttttcaact tcatcgctca 27121 aatgagcctt tcatatggca atctggaaaa cagaatgttt tccaactgga tatctttgct 27181 acaaaaattt tacttaacag gatatgcaaa ccctgcctac gctaacaact tccaacactg 27241 taaaccttca acccaccttt gacaccacga tcaaacggcg caaaacccgt ccagtcaaag 27301 ttggcgatgt caccattggg ggtggctacc ctgtggtggt gcagtcaatg attaacgaag 27361 acaccttaga cataaatggt tcagttgcag caattcgccg tctccatgaa atcggctgcg 27421 aaatcgttcg cgttactgta cctagtatgg ctcatgccaa agctctggca gagattaaac 27481 aaaaattaat caaaacatac caggatgtgc ctattgtggc ggatgtacat cacaatggta 27541 tgaaaatcgc cttggaagtc gccaaacata tagaaaaagt gcggattaat ccagggttgt 27601 acgtgtttga aaagccaaac cctaatagaa ccgaatacac taaggctgaa tttgacgaaa 27661 ttggcgaaaa agttcgtgaa actttggctc cattagtgat ttccttacgc gatcaaggca 27721 aagctatgcg aattggggtc aatcacggtt cccttggtga aagaatgttg ttcacctacg 27781 gcgatacgcc agaaggtatg gtgcaatcag caatagaatt tcttcgcatt tgtgaatcct 27841 tagattacca caacttggtt atttctatga aagcttcacg agtgccagtg atgatagccg 27901 cttatcgcct catggcacag cggatggatg aactgggtat ggattatcct ttgcacttgg 27961 gtgttaccga agcaggtgat ggcgagtacg gacggattaa atccaccgct ggtattgcca 28021 ctttattagc ggatggcatt ggcgatacaa ttcgtgtgtc actaacggaa tcaccagaga 28081 aagaaattcc tgtatgttac agtattttgc aagctttggg attgcggaaa acaatggtgg 28141 agtacgtcgc ttgtccctct tgtggacgta cattgtttaa cctagaagaa gttctgcaca 28201 aagtccgcga agccactaaa catcttaccg gactagatat agcagtcatg gggtgtattg 28261 tcaatggacc cggagaaatg gctgatgctg attacggtta tgttggtaaa acgcctggat 28321 acatttctct ttaccgtgga agagaagaaa ttaaaaaagt tccagaagat aaaggagtgg 28381 aagaattgat caacttaatt aagatagatg gacgttggat aaatccataa atttttcgac 28441 aatttttgta ccgccgacat aggtagtcgc cggatgacaa agcattttta tgaaataaaa 28501 aaatatgctc agtctttaca gcaaaaccct gtgttagatt gagcatactg attataaaat 28561 tttgcgtctt gcacagctgt cattatggta attacaagaa gtgggcttgt tttgggtgct 28621 acagcggtga cactgacaac aatcgcagtc actagtctgg gtattcactc acaaggacaa 28681 gctttattta aagaaagtcc taaggaatta atagatgaag tttggcaagt tattaaccgc 28741 caatatgtag atggtacttt taataagtta gattggcagg ctgttcgtcg tgagtacctg 28801 aacaagccct acagcgacaa gcagcaggct tacaagtcca tccgcgaaat gctgaagaag 28861 ctgggtgatc cttacactcg gtttatggat ccagaggagt tcaaaaacat gcaagtggat 28921 acctctggag aactgacagg tattggtatc caaattggtt tagatgagaa aaccaagaag 28981 ctgactgtaa ttgcgcccat tgaggataca cctgctgcaa aagctggtgt cctggcaaaa 29041 gatatcatca ccaaaatcaa cggaaaaagt accgagggta tggataccaa tcaggcagta 29101 tccttaatcc gaggtgaagc aggaacaacg gtcaacttga cagttctgcg gagtggtcag 29161 gaaaaacaat ttaacattgc aagagctaag attgaaattc atccagtaga gtactctcaa 29221 aaacaaactc cagcaggcaa tcttggttat attcgcttga agcagtttag cgccaacgct 29281 ggtaaagaaa tgcaacaagc aatcagaaat ttagagagta agcaagtcgc tggttatgtc 29341 ctggatttac gtaataatcc tggtggcttg cttttctcta gcgtcgaaat tgcgcgaatg 29401 tggataaaca atggcacaat agtctccaca aaagaccgct tgtcagaagt agaacgagaa 29461 gtggcgaatg gacgtgcttt gacaaacaaa ccgctcgtgg taatagtgga taaaggttca 29521 gcaagtgcta gtgaaatcct ctctggagct ttgcaggata acaagcgtgc tgttatagtc 29581 ggtagtcaga cctttggcaa aggcttagtg caatcagtgc gtcctctgga cgatggttca 29641 ggactagcgg tgacaattgc gaaatactac actcctaata accgagatat taataagcat 29701 ggaattgacc cagatgtcaa ggtggacttg acgactgctc agcgagaacg gctatggctt 29761 aaagagcgag acaaagttgc caccctacaa gatccccaat ttgctaaagc tgtagaagtg 29821 ataggcaaag aaattgcaca aaaaacgaac aatagggccg aaaaaaatta aaaaatcaac 29881 actcaagagt caaaagattt ttttcgactt ttgatttcac actggttggc atttaccaga 29941 cctaatcagt cctacgcggc gggcaatagc ttcttcttgg gaactaaacg gcccccattg 30001 ctcaataatt tctggactac tttcctcggt tgcttcgtca ctggggatta tttcgcaatg 30061 accagccgaa tgcttgacaa tataccaagt ttgtgtatca ctcatgaatt gatgtgaata 30121 tcgccagttt agtgtatata tgcctaatga tacaagcttt ttggcagtag gtcgtgtgaa 30181 tctcaatgag gcgaggtcaa cccctacgtt ccattcaaaa ttcaaaattc aaaattcaaa 30241 aataattaat cagttttaaa ctctttgaat tttggatttc agttgaattt gagtgaatcg 30301 agggctattg tctttcaatc aggattatgc cgcaatactt tggacaactg cacacaactg 30361 aaccggcttg gtcaattctt gagggagtgc aagcaataca acagagcgat cgccatatcc 30421 tcttcaaatg tggcgacccc tgtttaacca ttagcgtgct agccccgaac ttaattcggg 30481 tacgaatgac gccaacgagc gaattcctac ctcggcgatc atgggcagtc gcacaagcag 30541 atgaagaatg gccgactgtg ccgtttgagg tgcgagaaaa agcagaggct atagaaattg 30601 aaactgagca gctgcgcctc gttgtgtccc gcaatccttg tcgtatccag tgcttcgaca 30661 aatccggaca gccctttgct cacgatgccg acccagggat ggggtggcga actggtgcca 30721 ttgctgggtg gaaacagatt gaaactgatg aacattttta tggtttcggt gaacccactg 30781 gcttactcga tcagcgttca aaagtgaaaa ccaactggac atctgatgcg attgactacg 30841 gtatcctgac agacagtatg tatcaggcaa ttcccttttt gatcgcgtta cgtcctggat 30901 tggggtacgg gcttttcttc aatacgactt tttggagtcg ctttgatttg ggggcagaac 30961 aacctggagt ttggcggatg gaaactcaag ggggtgaact ggattactac attatttatg 31021 gaccagaacc cgcaaaaatt atcgagactt acacccagtt aactggacgg atgcccttac 31081 cgcccaaatg gtcactaggt taccaccagt gtcgctggag ttacgagtca caagatatag 31141 tacgcaaact ggcggatgaa tttcgccagc gccgcattcc ctgtgatgtt atccatctcg 31201 atattgacta tatgagtggc taccgggttt ttacctggag tcagaagcga tttgctaacc 31261 ccaaagaatt aatagacaat ctcaagcaag atggtttcaa ggtaacgaca attgttgacc 31321 caggggtcaa gtacgaacca gaagcagatt acaaagtctt tgatgaggga ttaaaaaacg 31381 actattttat ccgaaaaacg aatggtcagc tatttcacgg ctatgtctgg cccgataaag 31441 ccgtctttgc tgatttcctg cgccctgaag ttagagattg gtggggaagt ttgctaaaca 31501 gtctcactga cgtaggtgtt gctggaatct ggaatgatat gaatgaaccg acacttgatg 31561 accgtccatt cggtgatcct ggtaaaaaga tggcgtttcc cctcgatgca gcccaaggac 31621 caactgacga gagaactacc catacagaaa ctcacaacct gtatggacaa atgatggcac 31681 aggcatctta tcagggactt gaaaaatctc gtccgacaga acgctccttt tttctgacac 31741 gatctggata cgctgggatt cagcgctggt ctgcagtatg gacgggagat aatcaatccc 31801 tgtgggaaca cctggaaatg tccttaccga tgatgtgtaa cttgggtcta tcgggcgtcg 31861 cgtttgtggg tagtgatatt ggggggtttg cgggtaacgc gacggctgaa ctatttgctc 31921 gttggatgca ggtaggaatg ctttacccct tgatgcgggg acactcagca ttaacgacag 31981 cacagcatga accttgggtg tttggcgatc gcgttgaaaa aatttgccgc gagtacatcg 32041 aactgcgtta ccaactgctg ccctacattt acactctctt ctggaaagcc gcaaccactg 32101 gctcaccaat tctgcgcccc ctgctgtatg attttcccaa tgacccgaaa actttcaccc 32161 tctgtgacca agttatgctt ggtccgtcat tattagcagc accaatttat cgtccaggtg 32221 ttgaacaccg tgctgtgtac ttgcctgaag gttgctggta cgactggtgg agtggcgaga 32281 cttttcaagg accaattcac attctggcac acgcaccgct tgagcgaatg ccattatatg 32341 ttcgtgctgg ctcgattatt ccgatggcac cagtcatgca atacgtagat gaacgtccct 32401 tagaccagat gaggcttcgg atctggatgg ggacaggtga gtttacactt tatgaggatg 32461 acggtcatac cttcgagcac aaaacaggag ccttttgcac aacaacttac caggtttgtt 32521 tacaagggca acgaacgatt gttgagattg gaggaggaga aggtaacttt tcacccgcaa 32581 cgcgtgaagt cattgtggaa ctagtcggtg ttggcgaaca gagttttgtc gatgatggcg 32641 ctgcgcgtca gttgacgttt gaaatttaag ggtggttatg ccatttatgt ctttcgtcag 32701 ccaccctccc atctatcccc tgtaatccgt gtggttgtgg gggattttgt tgttagcgcg 32761 atatgcgcag tgtcaagcct ccggcttatc gcgcaaagcg cagacacaat catgcgtatc 32821 gccccacgac gaaaatgcat aaatcacttc acccatccct aactaaataa tgatggcacg 32881 caccccgcta tccttgggcg gggtatcccg gcacatcttg ttcatagtag ttgtaagtcc 32941 ccataatccg acgtggttgt ggggtctttt gttgcctcag ggaaacacat gtagtattgt 33001 aatgcacagt ctttttaacc ccaagaaaat taaccccaag aaaactagta ctttttaacg 33061 atgatttgta gtataatcaa gctccagtac cctagcaagt tgctgacaat aatttaagtt 33121 taataactag actgtttcta tttagaggta aacaatgctc ggttggatat atatgatttt 33181 agcaatcctt ctagaagtag caggcactac ttgtatgaaa ttctcagaag gattcactaa 33241 agtatggccc tcaattttca tattcgtctt ctatgctctt tgttttagca tattgacgct 33301 cgctctgaaa acaattgaga taagtcttgc ttatgcaata tggtctggtt taggaacagt 33361 cttaatagta tccataggaa ttttgtggtt tcaggagtca gtaaatatcg taaaaatatt 33421 atcaatcgta ctgatattaa taggagtgat tggtctgcac ataagtcatg aacctgtgtc 33481 cgaagaagaa gggattttgt caagtgttgc tactagtgtt gaccaactcg aaactacaca 33541 gccgagcaaa acacaagata tacttcctcc ggtttctgac ccggctctaa taatgccgga 33601 gtcggtagaa tacccagaaa gtgaaaaggt gccaattatc aaggtattgt caaaaaacca 33661 tgctgaggat taaacttact gactgacatt atctgttaca gtttataaat tgtaaaagtt 33721 tgttattaat aaatatacta tcagggtaag cg // LOCUS NODE_802_length_33523_cov_4.77949133523 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 33523) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 33523) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..33523 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(150..461) /locus_tag="DP116_06715" CDS complement(150..461) /locus_tag="DP116_06715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997538.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thylakoid-associated protein" /protein_id="PRJNA477356:DP116_06715" /translation="MAKTNTTELLEALAAEIGENIYIDIAKWHLYLSNAKLHTIVAER VYPLITASKSVEEDEVIQVLQSIPVKIGGGRKELPLIDLLPQQSQANLVDILKRFQQD I" gene complement(575..832) /locus_tag="DP116_06720" CDS complement(575..832) /locus_tag="DP116_06720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314637.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06720" /translation="MPPRWPRKPDRKDPAYRKLDDRMNFAIHVAIFALCNSGLWFFHN LNMITWEWLPWLTVGWMVVLLGHLIYISAIANYSETPPTST" gene complement(952..1458) /gene="moaC" /locus_tag="DP116_06725" CDS complement(952..1458) /gene="moaC" /locus_tag="DP116_06725" /EC_number="4.6.1.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877417.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyclic pyranopterin monophosphate synthase MoaC" /protein_id="PRJNA477356:DP116_06725" /translation="MTQENFEIFSNNLSHLDNQGQAQMVDVSGKASTVRQAVAAARVR MLPETFAAIQAGNAPKGDVLGTTRLAGIMAAKHTASLIPLCHPLPLQKIEVQVTPCPD LPGYQIEATVKTKAETGVEMEALTAVSVAALTLYDMAKALEKSIQIESIRLISKTGGK SGDYCQSD" gene 1493..1566 /locus_tag="DP116_06730" tRNA 1493..1566 /locus_tag="DP116_06730" /product="tRNA-Arg" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:1527..1529,aa:Arg,seq:acg) gene complement(1612..2844) /locus_tag="DP116_06735" CDS complement(1612..2844) /locus_tag="DP116_06735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745644.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_06735" /translation="MNMSRTINIQLQSNLLILFAAGLLFWCSMASLLPTLPLYVESVG ATKLQTGIVMGSFAIGLLLFRPLLGRLADWRGRKIVLLIGTLVAVIAPLGYLSVKSIP LLILVRAFHGISMAAFTTGFNALVADIAPLEKRGEIIGYMSLVNPLGVAIGPALGGYL QAGVGNQILFVITAELAFVAFLGLLTIVNPPLITNQQANTKNSQFWQVLFSPRVRIPA IIMLLVGLAFGALHVFIPLFMKSTGVDLNPGLFYTAAAVASFGVRLFTGKASDRFGRG LFITISLVFYTLSLSVLWLANSAAAFLFAGIIEGIASGTLIPMIAVLMVDRAHPYERG RVFAMCLMGLDVGIAIAGPILGYVADYLGYRDMFGLCASLTFLGVVVFLTFSGKDLSS SVRFALGRGRDVYALKDT" gene complement(3309..4244) /locus_tag="DP116_06740" CDS complement(3309..4244) /locus_tag="DP116_06740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-beta hydroxysteroid dehydrogenase" /protein_id="PRJNA477356:DP116_06740" /translation="MRILVMGGTRFIGVYLTKLLVEQGHSVVLFNRGNRPAPVEGVGQ IWGDRTDAAQLKEKLSSVNFDAIFDNNGRELTDTQPLAEIFQDRVEHFVYMSSAGVYL KSDQMPHVEGDTIDPKSRHLGKYDTEAYLTQQGLPFTSIRPTYIYGPLNYNDLEAWFF DRIVRNRPIPIPGNGLHFTQFGHVKDLATAMSKVLGNPVALRQIYNVSGDRFVTFDGL ARACIVAAGKSPDEIKIVHYEPKKFDFGKRKAFPLRVQHFFASVNKAKTELNWHPEYD LISGLKDSFENDYLVSGRDKLEVDFSLDEEILQSL" gene 4621..5763 /locus_tag="DP116_06745" CDS 4621..5763 /locus_tag="DP116_06745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 4 protein" /protein_id="PRJNA477356:DP116_06745" /translation="MPLKCALVHEWLTPKATGGSELVVREILNHVDADLYALIDFESS HSESYLYQRKIGTTFLQHFPFARNGVQKYLPLLPLAIEQLDLRAYDVILSSSHAVAKG VLTTSEQLHICYCHSPMRYAWDLTFDYLQQSKLGSGIPGWMTRYLLHRLRQWDVLSAN RVDYFIANSKHTASRIWRCYRREATVIYPPVNIESLPFLPQKEDFYLTVSRLVSYKQV SLIIRAFNKLQLPLVVIGTGPEMRKIRRIAQSNIQILGWQPDDVVKKYMANAKGFVYA AHEDFGIALVEAQACGTPVIAYGAGGALETVRDLREHGEQGTGIFFKEQTEEALVDAI EKFEVYQGKLNPEYARVHAAQFSPQMFAQRYLDFLNKCIQTRPSPD" gene 6079..6831 /locus_tag="DP116_06750" CDS 6079..6831 /locus_tag="DP116_06750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128320.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="PRJNA477356:DP116_06750" /translation="MPWKRADVVETNTSTTYRGSRLFFKRGQQKKTPRVQTKGLSFQV LNGEFTKRLFDIFFSLSVLILFSPVYLILALLIALSSEGPIFYIQERVGKNYKPFNCI KFRTMVTNADEILVQMMETSPHMRQEFEANFKLKHDPRITKIGRFLRMTSLDEFPQFW NVLKGDMSVVGPRPLVAEELPKYGCHIEQILTIQPGITGLWQVSGRNDIPYPRRVQID LHYAKFRNFWLDLWIIFKTIGVVIMPKDNGAY" gene 7423..8502 /gene="gmd" /locus_tag="DP116_06755" CDS 7423..8502 /gene="gmd" /locus_tag="DP116_06755" /EC_number="4.2.1.47" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878234.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GDP-mannose 4,6-dehydratase" /protein_id="PRJNA477356:DP116_06755" /translation="MTQKKRALITGITGQDGSYLSEFLLEQGYEVHGIIRRTSTFNTD RIDHIYEDPHKEGVRLFLHYGDLTDGTTLRRILEEVQPVEIYNLGAQSHVRVSFDSPE YTVDSVGMGTLRLLEAIRDYQHRTGIQVRFYQAGSSEMFGLVQEIPQKETTPFYPRSP YACAKVYAHWQTVNYRESYDIFACNGILFNHESPRRGETFVTRKITMAVARIFAGKQK KLYMGNLDAKRDWGYAKDYVKAMWLMLQQEQPDDYVIATGETHTVREFLELAFGYVNL NWEDYVEFDKRYLRPAEVDLLVGDSTKAQQKLGWKLSVTFEQLVAIMVEADLQALGLT SPNGKVAKVLKDNAMIRQELGALHL" gene 8525..9469 /locus_tag="DP116_06760" CDS 8525..9469 /locus_tag="DP116_06760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GDP-L-fucose synthase" /protein_id="PRJNA477356:DP116_06760" /translation="MNALELKDKRILVTGGAGFLGRQVIDQLCKAGATENKITVPRSH DCDLRILENCQRAVDQQDIVIHLAAHVGGIGLNREKPAELFYDNLIMGTQLIHASYQA RVEKFVCVGTICAYPKFTPVPFKEDDLWDGYPEETNAPYGIAKKALLVQLQAYRQQYD FNGVYLLPVNLYGPEDNFDPRSSHVIPALIRKVYEAQIKGEKKLPVWGDGSPTREFLY SEDAGRGIVMGTQFYNDAEPVNLGTGYEISIRDLINLICELMEFDGEIVWETDKPNGQ PRRCLDTERAKKAFGFNAEVDFKQGLKNTIEWWRQNAA" gene 9547..10098 /locus_tag="DP116_06765" CDS 9547..10098 /locus_tag="DP116_06765" /EC_number="6.3.3.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453977.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5-formyltetrahydrofolate cyclo-ligase" /protein_id="PRJNA477356:DP116_06765" /translation="MGKTELRKSLLKTRQSLSVTDWKHKSQLICQNLLNSPQFNQAKT ILAYFSFRQEPDLSQLFTDSSHRWGFPRCVGQSLDWHIWTHKDTVITGIYGITEPHPD APTIASADVDLIFVPCVACDYQGYRLGYGGGYYDRMLSSPEWINKPTIGIVFEFAYLP EIPIDTWDKPLQGVCTEIALVYQ" gene 10215..10442 /locus_tag="DP116_06770" CDS 10215..10442 /locus_tag="DP116_06770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012594638.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06770" /translation="MPINWREHVVSSSDVLRGKPRIKGTRIPVSLILGYLAAGKTDDQ ILQEFPDLQKEQILACLDYARDLANFETVAS" gene 10457..10807 /locus_tag="DP116_06775" CDS 10457..10807 /locus_tag="DP116_06775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131368.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06775" /translation="MDQCVPASIGQFLRDNGYDVLILKDYIPIESPDEIVIAKAQELD AILVSLNGDFADIVRYLPGNYRGIISIQLRNHPEIIPLLMQRLVDYLSNHGDMEHYQG KLFIVEANRIRIRE" gene complement(10870..12084) /locus_tag="DP116_06780" CDS complement(10870..12084) /locus_tag="DP116_06780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113081.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PBS lyase" /protein_id="PRJNA477356:DP116_06780" /translation="MVNNINQLLVQAQAAYNAANWSSLIQCLQQLTQQQDSEHPEILK NQEHLLELALQVLETGDFQQRWEIAKVLTSLGNIVIPPLIDILKDEDAEEELRAYAAR ILGDLKNPNAIPPLVELLKTNESDELMKIAATALGQMGSLAIASLTELLAQEQTRLLA TQMLSYIRQKETIAPLLSVVEDSQVAIRAVAVEALSSFHDHRVPPVLINALNDVAAPV RREAVVGLGFRPDLREALNLVARLQPRLSDINQDVCCAAVAALSRMGGEAAAQQLFQV LVSPNIPIQLQLEAIRALSWVGTLSGLKYLQQALYQLQFPTVWQEIVTVLGRVSDTTL TDKAAEILLEMLQQNHPGVEIGNIKSAIALSLGQLGKKQAINPLTQMLADEDAQVRLH ASAALKKLSFTT" gene 12477..14072 /locus_tag="DP116_06785" CDS 12477..14072 /locus_tag="DP116_06785" /EC_number="1.7.7.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877347.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin--nitrite reductase" /protein_id="PRJNA477356:DP116_06785" /translation="MTDLATTTTASLNKFEKFKAEKDGLAVKAEIEKFASLGWEAMDE TDRDHRLKWVGVFFRPVTPGKFMMRMRIPNGILTSEQMRVLAEVIQRYGDDGNADITT RQNIQLRGIRIEDLPEIFEKLRAVDLTSVQSGMDNVRNITGDPVAGLDADELFDTREL VQQIQDMVTNRGEGNSEFSNLPRKFNIAITGGRDNSVHAEINDLAFVPAFKEAQGARG AGEEITPSSPLFGFNVLVGGFFSAKRCEAAVPLNVWVPPEDVVALCRAVLEVFRDHGL RANRQKARLMWLIDEWGIEKFRTEVEQRFGKSLLGAAARDEIDWEKRDHVGVYKQKQP GLNYVGLHVPVGRLFAEDMFEIARLAEVYGSGEIRLTVEQNVIIPNIGDSLLETFLTE PVLDKFTINPTLLTRSLVSCTGAQFCNFALIETKNRALETIKALEAELELTRPVRIHW TGCPNSCGQPQVADIGLMGTKVRKNGKTLEGVDIYMGGKVGKEAQLGTCITKGIPCED LLPVLQELLISHFGARLREPAAV" gene 14267..15781 /locus_tag="DP116_06790" CDS 14267..15781 /locus_tag="DP116_06790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310278.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_06790" /translation="MLKGLFSLNGRYRILHQTWFAFFLTFVCWFNFAPFATTIGKELH LAPEQVKTLGICNLALTIPARIIIGMLLDRFGPRITYSMLLIFAAVPCFATALSQNFD HLVISRLLMGIVGSGFVVGIRMVSEWFPPKEIGIAQGIYGGWGNFGAFGAEFALPTIA VAASFLSGGGSNWRFAIALTGVIAAIYGVIYYNSVQDTPAGKVYQRPKKNGALEVTSV KSFWAMILSNFGLIFALGLLAWRLEQKNIHFLTQSQMYLAWLLLAGLFAYQTYKAWQV NKELLTGNKTYAPSQRYQFSQVALLEFTYVTNFGSELAAVSMLPAFFEKTFGLEHVVA GMIAATYPFLNLISRPSGGLISDKFGSRKWTMTIISAGIGVAYLMAHYINGSWALPVA IAVTMFAAYFAQAGCGATYGIVPLIKKEATGQIAGNVGAYGNFGGVVYLTIFSLTDAP TLFTTMGVAALVCAFMCAFFLKEPKGSFAAAYEGEAPETQTSVNHHSSGILAEK" gene 16227..18527 /locus_tag="DP116_06795" CDS 16227..18527 /locus_tag="DP116_06795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrate reductase catalytic subunit" /protein_id="PRJNA477356:DP116_06795" /translation="MTESTKTVCPYCGVGCGLEVSPPAQLGKATHRDSHGNPIWRVRG DKAHPSSQGMICVKGATIAESLDKNRLHYPMVRDSLDQEFRRVSWDEAFDIIVNRIQT VRLTTGSEAICMYGSGQFQTEDYYTAQKLLKGCLGSNNFDANSRLCMSSAVAGYIQSF GSDGPPCCYEDLELTDCAFLIGTNTAECHPIIFNRLEKYRKKNRKVKMIVVDPRRTPT AEAADLHLAIRPGTDIDLLNGIAHLLMRWNYIDTGFIDDCTSNFPAYAEVIRHYSPDV VARQCGISVEDLETAARYWGQSNRVLSLWSMGVNQSSEGTAKVRTIINLHLMTGQIGK PGAGPFSLTGQPNAMGGREAGGLSHLLPGYRTVKNAQHRAEVEEFWGLKPGQISATPG MTAWDMITGLENGSVGLLWVAATNPAVSMPDLERTKKALLRSPFTIYQDAYYPTETAA YAHVLLPAAQWSEKTGVMTNSERMVTLGPAFRQPPGEAKADWEIFAEVGRRLGYHKEF AFANSAVVYAEFVKLTGDRPCDMTGISHDQLRESPIQWPHPEQRGTQELESRLSPFGY ACGTASPNAVACGGKPSRSAVSPGSLQGKRLYTDLRFHTPDGRARFGAYHSRGLAEPP DPNYPLVLTTGRLYGHWHTQTRTGRIEKTRQMHPEPFIEIHPRDAAQIGITDNQLVEV RSRRGKARFPAKVTKAIAPGTVFVPMHWGALWADNAEANALTHPESCPDSLQPELKAC AVQLVPINVDIALRNFQKTRAETLIN" gene 19157..19612 /locus_tag="DP116_06800" CDS 19157..19612 /locus_tag="DP116_06800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315406.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06800" /translation="MRKLFKRIAENTNDQKFMHFIENIQVLVSKLLSLFMVVVIVAAI VDLGFFLFKDLFYTPHGQFNTTLFEIFGLFLNILIALEILENITGYLKKHVLQVELVI VTSLIAVARKIIILDLKKVSGIDIIGLGIAVLALSISYLIIRFSHRQKV" gene 19629..20081 /locus_tag="DP116_06805" CDS 19629..20081 /locus_tag="DP116_06805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrate reductase associated protein" /protein_id="PRJNA477356:DP116_06805" /translation="MVVFFEFEADFVDSLRCIPMQVRQKLDISGIKLKLSDWSHLTKD EREALVELPCSTESEIQTYKEYLQNLILQRTGTPPAELPIEPHPAWLDANTLPTNLQE KAREFGVTLSPQQWAELTSLQRFALIKLSRPGHENQNFPKALKEFHLL" gene 20212..20433 /locus_tag="DP116_06810" CDS 20212..20433 /locus_tag="DP116_06810" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06810" /translation="MLEVDFGLRSAPFNLRQIEFRTPLTVMCIKTSKHKLILTNERQV LMGETTPGASTGGTPATHWLPKTALPPND" gene 20442..21041 /locus_tag="DP116_06815" /pseudo CDS 20442..21041 /locus_tag="DP116_06815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877337.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="protease" gene 21149..21582 /locus_tag="DP116_06820" /pseudo CDS 21149..21582 /locus_tag="DP116_06820" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_925784.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 21848..23248 /locus_tag="DP116_06825" CDS 21848..23248 /locus_tag="DP116_06825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PrsW family intramembrane metalloprotease" /protein_id="PRJNA477356:DP116_06825" /translation="MTGKNARQNAILRQVSGIGTSFGSEPRYSLLPRQEVVIGRDPSC QVVLDAMLYRMVSRRHAAVRPLASSPDGESSWLICDLASANGTYLNGQRLQGCQQLMN GDSITLGHDGPEFVFECEHNHQQATAVAPPAATPLPPANSYPTPTSTYYQSPPKPPDA LSFTQLFPIISTGKDLTRKAYLIPGVLTVVFVVLMFATVGQPQANQLIVAIYIASAAY YFIYQLCGKPKPWWVLCASALTTAVILRTPVLDLFIFVFRGLLPGSLPSEQESITLTE LLVRMFFGAGLMEELLKAIPILLAFLIARAVPSPWRERIGIAEPLDGILLGTASAVGF TLLETLGQYVPTITQNISAQSGLESGQLVGLQLLIPRILGSVAGHMAYSGYFGYFIGL AVLKPSKSWQILLVGYLTAAGLHALWNTTGVFNGLLLVIVGVLSYAFLMAAILKARVL SPTRSQNFATRFLEPK" gene complement(23522..24169) /locus_tag="DP116_06830" CDS complement(23522..24169) /locus_tag="DP116_06830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748006.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06830" /translation="MAKKINGSKTSPYKQSQVNFELLEALLEPEDATYPWNPADEESE NYFAQLEQQFQLDDVLDEELAERSQAFYNSLDTLWYNNLNSQHYKCNTKSTVLANLQK NLQAGFAASVPQDWITEIAQKAAEIFHSGQSKGEQLVSCVKSVLPNWETDDLLVMARP FAYAMRSGEQKNVNSVVDNLGNREWTNLSEIEKAKVSLAVACHALDELKNIEEEV" gene complement(24317..25906) /locus_tag="DP116_06835" CDS complement(24317..25906) /locus_tag="DP116_06835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317191.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06835" /translation="MPSLNLAIARLINTGNDSFAIWVVNAPYPSGYVLRDCVWPPHLT QAWLEWQQMFAGHSRLDISPGATSQEANPLPLDFVAPTSTQPTSYTARLMQYLGISLW GWVFEGAILNSFERSRGIAMGERTRLRMRLEIRDPDLIALPWEIMQREPGQSAISLSQ HILFSRTTSEVEPLPYLRSEQALNILLVLGEDEKHLELKKEAASLEEILSNGSVVGSN YNGYAPCMVNTLLQPTPQELIQQLETKAYNVLFYAGHGLQGPDGGLLFLRPGMTLNGM ELAQVLTSTGVKLAVFNACWGAQPAAANHQAVSHSSLAEVLIRQGLPAVLAMRDEIAS QESQTFIQAFAAALRKRLPIDEAVAEARQQLLIVYRFNQPAWTLPILYLHPDYEGELI KNFDEGITELPETSIPGIASLVSNACLRSLSAGGKTSLLRPGITRIGRTGDNDIVIPE PSVSKRHAEILCRNSLTGATAVRTYYLQDLSTYGTTWVFRSDGWQQIHRQEVPLQSGM QLKFGSTKSQPWEFIIDNSPG" gene complement(26153..28444) /locus_tag="DP116_06840" CDS complement(26153..28444) /locus_tag="DP116_06840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315412.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein phosphatase" /protein_id="PRJNA477356:DP116_06840" /translation="MENDAATLYCPNELCQAPNPLTHKFCLRCSTPLPKRFLWAVAEG ESLANAGELLADRYLVISKALLLDTKPGLLPQTPNGEHVQAIKPYLRLFPYRLHVPQV YGILPMTGNTSDKEILLLEKPPLSVLDSTESLEVQIACELTTAWHNATSIRQLNLLWQ IAHLWQPLASEGVASSLLNPELIRVEGSLVRLLELRQDNQTSPGLPQLGEFWQQLQPQ AKGAIAEFLNEMCRSLIAGEIHSGEQLVAIIDRGLTQLGQTQTPIIKIVTKTDTGPCR QRNEDACYPAAGSIVSKPPSQTGLAIVCDGIGGHEGGNVASLLAIETIQQHLHELTKL PSEDIDSSIVLAQLEGATAAANDIISQRNDDEHRQGRQRMGTTLVMALPIAHQMYIAH VGDSRAYWITRSGCYQVTLDDDVASREVRLGYATYRDAVQQGASGSLVQALGMSASHL LHPTSQRFILDEDSVFLLCSDGLSDFDRVDQYWETEILPIITENFDIAKVTDKLVEIA NIQNGHDNVTIALVHCQVKYSEPKSTLHTSLANLSTLPIASNTAIKTPLLASPVGRNQ KTQVMPTSKPAKSLKVPLQVIVVPLLFFVVSVLLAYLWRQGSLSIPTKLFNSSPSIPL PSTSGSQDSSGNSSKDSVNAPGVILQTNSEIEFITTTSPPFGAIAPNGSVLQILPKQQ KTEKDTWIHLQVCSIGQASPSNSQSTSSTAKKRLLKQGDKVRILSSNLAKAQPKVVSR CQSSKDTSTPPDGESTTESVPQQ" gene 29322..29591 /locus_tag="DP116_06845" CDS 29322..29591 /locus_tag="DP116_06845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316697.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06845" /translation="MEDNKEKEIHRAVNPGDVISEEPQTVEEKAQQLAVDSPDITGDH IQVPTYFVVKEPNGEEKALHHVKDAEEISDVIRQARVDEEGNRVW" gene complement(30193..30567) /locus_tag="DP116_06850" CDS complement(30193..30567) /locus_tag="DP116_06850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316698.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase" /protein_id="PRJNA477356:DP116_06850" /translation="MDNPMLLKSTTRHIRIFAGEIDQDGELVPSSQVLTLDVDPDNEF NWNEDALQKVYRKFDELVEASSGADLIDYNLRRIGSDLEHFLRSLLQKGEVSYNLSAR VTNYSMGLPQVASEENTEANGV" gene complement(30621..31166) /locus_tag="DP116_06855" CDS complement(30621..31166) /locus_tag="DP116_06855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007356139.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-isopropylmalate dehydratase" /protein_id="PRJNA477356:DP116_06855" /translation="MTKVIRGKIFVLDDNIDTDQIIPAEYLTLVPSKPDEYEKLGSYA LAGLPDRYGKFVPPGEMKTTYPIIVAGENFGCGSSREHAPIALGASGVKAVIAESYAR IFFRNCAATGELYPWESQERLCDKFETGHEVSIDFESNQLINHTLGQIYNLKPLGEVG PVIDAGGIFAYARQTGMISSR" gene 31640..31918 /locus_tag="DP116_06860" CDS 31640..31918 /locus_tag="DP116_06860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208992.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06860" /translation="MNFEALPRQVNSVDVGVYECEIHLKFRLIEEKSLLSDRDQLLQV LLDALTEGSDDFLETLQANVKAQEVSEFKASPQMRRQLMRLRNSAEAS" gene complement(32750..33445) /locus_tag="DP116_06865" CDS complement(32750..33445) /locus_tag="DP116_06865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015079647.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_06865" /translation="MAAQLLLVDDEPGLREAVKDYLQESGFSVQVASNARQGWEMMQQ NTPDLVISDIMMPQVDGYQFLKQLRDDPRFRSLPVIFLTAKGMTTDRIQGYQAGVDAY LPKPFDPDELVAIVENLLERRTARTTTPTEEGDTPDITELANQIAQIKALLTQRNAIA QSPAPFTIDLTPREQSVLNLVAEGLMNKEIARRLETSVRNVEKYVSRLFSKTGTNSRT ELVRFALEHGLAK" BASE COUNT 9378 a 7084 c 7363 g 9698 t ORIGIN 1 aaatacggag cggtggggtg aacccgcacc gctccgtaac attttagaat ctcctggcct 61 ttaggcagga gagtagtcaa ggtgaattat tattttttat gaagtcgcga aaaaattaag 121 ttgtcagaca agccgagtaa gtagtaagac tatatgtctt gctgaaatct ttttaaaata 181 tctactaggt tggcttggga ttgctgcggt agtaaatcaa tcaagggaag ttcttttctg 241 ccaccaccaa ttttaactgg aatagattgt aaaacttgta taacttcgtc ttcttccaca 301 gatttactag cagtaattaa gggatacact cgctctgcga ctatagtatg taacttggca 361 ttcgataagt aaagatgcca tttggcaata tctatataaa tgttttcgcc tatttctgct 421 gctagggctt ccagtaactc tgtagtgtta gtcttagcca tataaataat cctatataaa 481 gcgaggaatc tcattctata tatattatcg ctcaaatatt gagactgact ctaccaccgc 541 gagttacaat cgccctagtc tgtctggtca gtcttcaggt ggatgtaggt ggtgtctcgg 601 agtaattggc gatcgccgaa atataaatca aatgtcccaa tagcacgacc atccaaccta 661 ctgtcaacca aggaagccac tcccaagtaa tcatattcaa gttatggaaa aaccacaaac 721 cggaattaca cagggcaaat atcgccacat gaatggcaaa attcatccgg tcatctaatt 781 tccggtaagc agggtctttg cgatcaggtt tacgaggcca acgaggaggc ataagttttt 841 gttacagata ttaacaaagt gctttctcac tcattttatg atttttggga gtaagatagt 901 gtccacccca caaatctcat cgttctaaaa ttaaaccatt cggacagtac atcaatctga 961 ctggcaataa tctcctgatt tccccccagt tttactaatc aacctaatcg attcaatttg 1021 aatcgacttt tccaaggctt tcgccatatc atataaggta agagctgcaa cagaaacggc 1081 ggttaaggct tccatttcga ctccagtttc tgctttggtc ttgactgttg cttcaatttg 1141 ataaccaggt aagtctggac atggtgttac ctgtacttcg attttttgta atggtaaagg 1201 atgacataag ggaatcaaag aagctgtgtg ttttgccgcc ataatcccag ctagtctcgt 1261 ggttcctaac acatcccctt ttggagcatt ccctgcttga atggcggcga aggtttctgg 1321 tagcattcgg actctggctg cagcgacagc ttggcgaacg gtggacgctt ttccagacac 1381 atccaccatc tgagcctgac cctgattatc tagatggctc aagttgttgg agaaaatttc 1441 aaaattttct tgcgtcattt caaaaaagtg tgttagtatt agatgtcggt aagggcgtgt 1501 agctcagtgg actagagcac gtggctacgg accacggtgt cgggggttcg aatccctcct 1561 cgcccgttat tatagagaca cgaactttcg tgtctctctt tgtttatatg tctacgtgtc 1621 tttcaatgca taaacatctc gaccgcgccc aagagcaaag cgcacggaac tagataaatc 1681 tttgccagaa aaagtcaaaa agacgacaac gcctaagaaa gttaaactag cgcatagacc 1741 aaacatatct ctatagccaa gataatcggc aacataacca agaattggac ctgcgatcgc 1801 aatccctaca tctaagccca tcagacacat agcaaatacc cgcccccgtt cataagggtg 1861 ggctcgatcc accatcaaaa ctgcaatcat gggaatcaga gtgccagaag caatgccctc 1921 aataattcct gcaaataaga aagcagcagc actgtttgct agccatagaa ccgagagtga 1981 cagtgtgtaa aaaactagac tgatagtaat aaataaacca cgaccgaagc gatcgctcgc 2041 ctttccagtg aacagcctga cgccaaaact agcaactgcg gctgctgtat aaaacaaccc 2101 tggatttaaa tccaccccag ttgatttcat gaacaacggg ataaaaacgt gtaaagcacc 2161 aaaagccaaa ccaaccagca acatgatgat cgctggaatt ctcactctgg gactaaacaa 2221 aacttgccaa aattgactat ttttagtatt ggcttgctgg tttgttatca gtggtggatt 2281 gacgattgtc aacagaccca aaaacgccac aaaagctaat tcagcagtga tcacaaataa 2341 aatttggtta ccgactccgg cttgcaaata tccgcctaaa gctggtccaa ttgctactcc 2401 caatggatta accaagctca tgtagccaat gatttcaccc cgcttttcta agggagcgat 2461 gtcagccacc aacgcattaa aaccagtggt aaaagcagcc atgctaatac catgaaaagc 2521 acgcactaat atcagcaggg gaattgattt gactgacaag taaccaagcg gtgcgatgac 2581 agccacaagc gtaccaatga gcaacacaat tttacgaccc cgccaatccg ccaagcgtcc 2641 taacaaaggg cgaaataaca acagtccgat ggcaaaacta cccatcacaa tcccagtttg 2701 cagctttgtt gcacccacag actcaacata cagtggtagc gttggcagca atgaagccat 2761 gctgcaccag aataacaaac ctgctgcaaa taaaatcagc aggttgcttt gcaattggat 2821 attgattgtg cgagacatat tcacagcaga tgcttattaa ttgtgttttt gtttaatctt 2881 ttaccagcaa atacaatgaa atacaatcat gactcgttct tcaaaaagta tcaagttcag 2941 tcatttacga attggttccc ctgtgtcaaa aaaagtactt gaatatttgc tgttgggttg 3001 tgtacgatgc aatctttgat tctgaactca agtataagtg taacatttat tattgtcttg 3061 gcgctgacaa acttatttca gatttcaggt atatagacca cgtttgagac agggtgtgag 3121 ggtgtggagg gcaatatgcc cacctcacaa acaccaagaa taacttgacc aaaactaggg 3181 tgtaggggag aaaattagga acattttttg tgatctccca ctcaattctt tgatttctta 3241 ccccccatac ccccacaccc ttataccctt acaccccttc ttgttgacac ctgtgcacct 3301 gacgaaatct aaagagactg cagaatctct tcatccagag aaaaatcaac ttctagttta 3361 tctcgcccag agacaagata atcattctca aaagagtctt tgagtccaga aatcaggtca 3421 tactcagggt gccagtttaa ttctgttttc gctttattca ccgaggcaaa gaagtgctgc 3481 acccgcaaag gaaaagcttt gcgcttaccg aaatcaaatt ttttcggttc gtaatggaca 3541 attttgattt catcgggaga tttgccagcc gcaacgatgc aggcgcgggc taaaccatca 3601 aaagtgacga agcgatcgcc agacacattg taaatttgtc tgagtgcaac aggattgccc 3661 agaactttgg acattgctgt tgctaagtct ttgacatgac cgaattgagt gaagtgcaag 3721 ccgttcccag ggataggaat gggacggtta cgcacaattc tgtcaaaaaa ccaggcttct 3781 aaatcgttat agttcagagg accataaatg taggtgggac gaattgaggt gaagggcaat 3841 ccctgttgtg ttaggtaagc ttccgtatca tatttaccta agtggcgact tttgggatcg 3901 atggtatccc cttctacatg aggcatttgg tctgatttga gataaactcc agcagaactc 3961 atgtacacaa aatgttctac gcggtcttga aaaatttctg ctaatggttg agtatcagta 4021 agttctcgtc cgttattgtc aaaaatggca tcaaaattta cagatgataa tttttctttt 4081 aactgggcag catcagtgcg atcgccccat atttgtccta cgccctcaac tggtgcagga 4141 cgatttccac gattaaatag tacaactgaa tgtccttgtt caaccaataa tttggttaaa 4201 taaacaccaa tgaaccgcgt gccacccata actaaaattc gcatactagt tcccgatctt 4261 aagttgcttg atgaagggac tatgaccgtt ttacggaatt cactcattcc aaattcaaaa 4321 accttgaaca taagggcttt tgatattttg tgaaatggta tgcttatttc cgtcgcttag 4381 tactagagac tggtaagtag gggtgaggat tgcaaaaaaa atccgcgccc taagactcaa 4441 tacctagtat tttgtcccca tattgaggtt atcagtttgc ggtattctgc aggaacctcc 4501 taaaaaagga ttacgtcttt agctagaagt ggggcgtttc cgattccttt gggaatactg 4561 tttggtgtaa agcaacctct aactcccgcc ttgcgtgttc tctaccaact acagttagct 4621 gtgcccttga aatgtgccct cgttcatgaa tggttaacac cgaaagccac tggtggttca 4681 gaactcgttg tgcgagaaat tcttaaccac gtcgatgctg atttatacgc cctcatcgat 4741 tttgaatcca gtcactcgga aagttactta taccagcgga agattggcac aacgtttctt 4801 caacactttc cttttgcccg taatggtgta caaaaatatt tgcccttgtt gccactagca 4861 atagaacagc tagatttacg agcatatgat gtcattttat cttcatccca cgctgtagcg 4921 aaaggagttc taaccacttc cgaacagtta catatttgct actgtcatag ccccatgcgc 4981 tatgcttggg acttgacttt cgattatctt caacagagca agttgggaag tggtatacct 5041 gggtggatga cgcggtattt actgcatcgt ctgcgtcagt gggatgtgtt gagtgcgaat 5101 cgtgttgatt actttattgc taactcaaaa cacactgcga gtcgtatttg gcgttgctat 5161 cgaagagaag caacagtcat ttacccgcca gtgaatattg agagtcttcc tttcttgcct 5221 caaaaagaag atttttatct aaccgtttcc cggttggtga gttacaaaca agtatctttg 5281 ataatcaggg cttttaataa attgcaacta cctttagtag taattggaac aggaccagag 5341 atgagaaaaa tccggaggat agcacaatca aatatacaaa tactcgggtg gcaacccgat 5401 gatgtggtaa aaaaatatat ggctaatgcc aaaggttttg tctatgcagc tcatgaagat 5461 tttggcattg ctttagtgga agcgcaggct tgcggcactc cggtgattgc ctacggtgct 5521 ggtggtgctt tagaaaccgt gcgagatcta cgcgaacacg gcgagcaagg aactggtatc 5581 ttttttaaag agcaaacaga ggaggcatta gtagacgcaa tagaaaaatt tgaagtttat 5641 caaggaaaat tgaatcctga gtatgcgcga gtgcacgctg ctcaattttc cccgcaaatg 5701 tttgcacagc gctatcttga ctttctaaat aagtgtatac aaacaagacc atcacctgat 5761 tgatggtctt gttttacagg cttttggaaa tttgacccca gaagcacctc cttttagctt 5821 taagattggg actatgtgtg gtgtggattg ttaaggagta tgatgactgc ccagagctca 5881 ctcctctccg ggaagcgatc gcgtagtaag cccgggactg ggcgctgttt ttccagcagc 5941 gtccgctctt ctgaaggacg gacacaacta aatgcacctc ggttaacgtt gagcgggaga 6001 tcgcccttta gggcactgcg ctctgcacaa ccgcatagag cgtctttacg ctcaactgca 6061 acaaatgtgg ctgccttatt gccctggaaa cgggcagatg ttgttgagac aaacacttca 6121 acaacataca gaggatcacg gcttttcttc aaacgcggtc aacagaaaaa gacacctaga 6181 gttcaaacga aaggtctgtc ttttcaggtt ttaaacggag agtttaccaa gcgactgttc 6241 gatattttct tttccctgtc ggtactaatt ctgttctccc ctgtgtactt aattttggct 6301 ttgctgattg ctttaagttc agaaggtcca attttttata tacaggaacg ggttgggaaa 6361 aattacaaac cttttaattg tattaaattc cgaacaatgg taaccaatgc cgacgaaatt 6421 ctcgtgcaaa tgatggaaac atctcctcat atgcggcaag aatttgaggc aaattttaaa 6481 ctcaagcacg acccgcgaat tacgaaaata ggtcgttttt tgcgaatgac tagcttggat 6541 gaatttcctc agttctggaa tgttttaaaa ggagacatga gtgtcgtcgg tccaagacct 6601 ctagttgcag aagagttacc taagtatggt tgtcacatag agcaaatttt aactatccaa 6661 ccaggaatta ctggattgtg gcaagtctcc ggacgtaatg atattcccta tcctcgacgg 6721 gttcaaatag acttacatta tgccaagttc agaaattttt ggttagattt gtggataatt 6781 tttaaaacca ttggtgttgt cattatgccc aaagataatg gagcatacta aacaattcaa 6841 aaccaaaagt atggcacttt gtctggggca caggctaacg ctctacgtac caagctatct 6901 cctttggaga cgctatgtgt atgcctacga taccttttgt gctgacactg tcaaaattga 6961 caagctaatg ctcggaagca ttgtagtcgt tttgttattg caaattaaat tcgcgacaat 7021 gatgtcgtga tgaaaaattc gtaattcgta attgaagttg tcattatgaa ttacgatttt 7081 ttcctgagac actgttctat aggggcaatt tgactgttga gttgagcctc aatagccact 7141 tcactaaatt ttctcttatt ttaagataat tttaaaataa atcagtgata tataaatttt 7201 gtgtaaagat aacgaaaaat gacagaatca ggtaaaacag tctgtatagg gtaagagaat 7261 ctagtcgtca actaatgttt cggcagattt acgcatgtgt ggatgcatct gcaaattaga 7321 acttgtaagt tcccggttgg aaaattaagc aaatacctgt taggttgtac cagggtcgcc 7381 tgtagtattc atctattcag tcacgacaag ggataagaaa gcatgacgca aaagaagcga 7441 gcgttgatta ctggtattac cggtcaagat ggttcatatc tgagtgagtt tttgctagag 7501 caaggatatg aagttcatgg tataatacgt cggacatcca ccttcaacac tgaccgcata 7561 gatcatatct acgaagaccc tcacaaagag ggagtgcggt tatttctaca ctatggcgac 7621 ttgacagatg gtaccacatt gcgccgcatt ctggaagaag ttcaaccagt agaaatttac 7681 aatctgggtg ctcaatcgca tgtacgggta agctttgatt ctcctgagta cacagtggat 7741 tcagtcggga tgggaacgct gcgtttgtta gaagcaattc gtgactacca gcatcgcaca 7801 ggtatccaag ttcggttcta ccaagcaggt tcttcggaaa tgtttggttt ggtacaagaa 7861 atcccacaga aggaaaccac accgttttat ccccgtagtc cttacgcttg tgctaaggtt 7921 tacgctcatt ggcaaacagt aaattaccgt gaatcttacg atatttttgc gtgtaatgga 7981 atacttttta accatgaatc accaagacga ggtgaaacgt ttgtcacccg caagattact 8041 atggcagttg ccagaatttt tgctggcaaa cagaaaaaac tttacatggg taatcttgat 8101 gccaagcgag actggggcta tgccaaggat tacgtcaagg caatgtggtt gatgcttcag 8161 caggagcagc ctgacgatta cgtcattgct acaggtgaaa ctcatacagt acgagagttt 8221 ctcgaactag cgtttggtta tgtcaacctt aactgggaag actatgtgga gtttgacaag 8281 cgctatctcc gtccagcaga ggtagacttg ttagttggtg attctaccaa ggcgcagcaa 8341 aagttgggtt ggaagctatc ggtaacattt gagcaactgg tagctatcat ggtagaagct 8401 gaccttcaag cattgggact gacttcacca aatggtaagg tagcaaaagt cctcaaggat 8461 aatgctatga ttcggcaaga attgggtgcg ctccacttgt gatctacacc gcgaggaaaa 8521 aaatatgaac gccttagaac tcaaggacaa aaggattctc gtgactggtg gagctggttt 8581 tttggggcgt caagtcatag accagctgtg taaggcagga gccactgaga ataaaattac 8641 ggtaccgcga tcgcatgact gcgatttacg catcctagaa aattgccaac gagcagtgga 8701 tcaacaagac attgttatcc acctagcagc tcacgtcggc ggtatcggtc tcaaccgtga 8761 gaaacctgct gaattattct atgacaactt gataatggga actcagttga ttcatgcatc 8821 ctatcaagcc agagtagaaa aatttgtctg cgttggtaca atctgcgctt atcccaaatt 8881 taccccagtc ccattcaaag aggatgacct gtgggatggg tacccagaag aaaccaacgc 8941 tccctacggg atagcaaaaa aagctctttt agtccaactg caagcttacc gccagcaata 9001 cgacttcaat ggtgtttacc tgctacctgt gaatctgtat ggaccagaag ataactttga 9061 tcctagaagt tctcacgtca ttccagcgtt gattcgcaaa gtttatgaag cacagataaa 9121 gggagaaaag aaactcccag tttggggtga cggtagtcct acccgcgagt ttttgtattc 9181 agaagacgcg gggcggggta ttgtgatggg gactcaattt tacaacgacg ctgaacccgt 9241 taacttggga acaggttatg aaatctccat ccgtgactta atcaatctca tctgtgaatt 9301 gatggagttt gacggcgaaa ttgtttggga aaccgacaaa cccaatggtc aaccgcgtcg 9361 ctgtttagat acagaacgag ctaaaaaagc ctttggtttt aatgctgaag tagacttcaa 9421 gcaagggttg aagaacacga ttgagtggtg gcgtcaaaac gctgcttaat cgttgtcaaa 9481 atactaatta attgtagggt gggcattgct caccttattt atttgttttt ggaactaggt 9541 tttcagatgg gaaaaacaga gttacgcaaa tcgctgctca aaacacgtca atctttatcc 9601 gtaacagact ggaaacataa gagtcagctt atctgccaaa accttttaaa ctctccccaa 9661 tttaaccaag caaaaacaat actcgcttat ttcagctttc gccaagaacc agatctgagc 9721 caattattta cagattcctc tcatcgttgg ggtttccctc gctgtgttgg tcagtcgctt 9781 gactggcata tttggacaca taaagacacc gtaataaccg gtatttatgg tatcacggaa 9841 cctcatcctg atgcaccaac tatagcttct gcggacgttg atttaatttt tgttccctgt 9901 gttgcttgcg actaccaagg atatcgcttg ggttatggcg ggggatatta tgaccgcatg 9961 ctgagttctc cggaatggat caacaagcca actataggta ttgtgtttga atttgcttat 10021 ttgcctgaga tccccattga tacttgggat aaaccattgc aaggtgtatg tacagaaata 10081 gctcttgttt atcaataaat tttttgcaga aatttgattg caaaaatccg aaccgtcttt 10141 tcacccctac cctctaatca gttcgtttac aatacaagag ttcacttttg actttcagtg 10201 ataggttaac caaaatgccg attaactggc gtgaacatgt tgtgagtagt tctgatgtac 10261 ttcgaggtaa gccaaggatt aagggaacgc gcattccagt cagcctgatt ttgggctatt 10321 tggctgctgg taagaccgat gatcagattc tccaagaatt tccagattta caaaaagagc 10381 agattttggc atgtctcgat tatgcgcgtg atttggcaaa ttttgagacg gttgcatcgt 10441 gagcttgacg tttttcatgg atcaatgcgt tccggcttcc attgggcagt ttttgcggga 10501 taacggttat gatgtgttga ttttgaaaga ctatattcca atcgagtctc ctgatgaaat 10561 tgtgattgcc aaagcacagg aattggatgc aattttagtt tcactgaatg gagattttgc 10621 tgacattgtc aggtatcttc ctggtaatta tcggggtatt atttcgatcc aacttcggaa 10681 tcatcctgaa atcattcctc tactgatgca gaggttggtg gattatttgt ctaaccacgg 10741 tgatatggaa cattatcaag gcaagttatt cattgttgaa gcaaatagaa ttcgcatacg 10801 agagtaacgc gatctgccgc tatgctgcag cgaagctatc gcgatcgcca cctcttgtct 10861 actttcaacc tacgttgtga acgatagctt cttcaatgca gcgcttgcat gtagtctcac 10921 ttgagcatct tcatctgcca acatctgagt taatggattt attgcctgct ttttgcctaa 10981 ctgtcctaaa gagagagcga tcgcactttt aatattgcca atttcaactc ccggatggtt 11041 ttgttgcaac atttccagca aaatttctgc tgctttgtca gttaacgttg tatcactaac 11101 tcgccctaaa acggtaacaa tttcctgcca taccgtgggg aactgcagtt gataaagtgc 11161 ttgttgcaaa tacttcaaac ccgataatgt ccctacccaa cttaaagcgc ggatggcttc 11221 cagttgcaac tgaattggta tattagggga caccaaaacc tgaaataatt gctgtgctgc 11281 agcttcacca cccattctag aaagggcagc tactgcggca caacaaacat cttgattaat 11341 gtcagaaagc ctgggttgca atcttgctac caaatttagt gcttcacgta agtcaggacg 11401 aaaacccaaa ccaacgactg cttctcgcct cactggggca gcaacatcat tcaaagcatt 11461 gataagcact ggtgggacgc ggtgatcgtg aaagctactc agtgcttcaa cagccacagc 11521 acggatagca acttgtgaat cttccactac gctcaacaaa ggtgcgattg tttctttttg 11581 ccggatgtaa gaaagcattt gagttgctaa tagccgtgtc tgttcttggg ctaaaagttc 11641 tgtcagggag gcaatagcaa gtgaacccat ttgccccaag gcagtggctg ctatcttcat 11701 aagttcatca ctttcattag tttttagcaa ttccaccaga ggcggaatgg catttggatt 11761 tttcaaatcc cccaaaatgc gtgctgcata agcacgcaac tcttcctctg cgtcttcatc 11821 ttttaatata tctatcaggg gagggatgac aatatttccc agacttgtca gcactttggc 11881 aatttcccag cgttgctgaa aatctccagt ttcgagaacc tgaagtgcta attccaagag 11941 atgttcttga tttttcagta tctctggatg ttctgagtcc tgttgttgag tcaactgttg 12001 taagcattga atcagtgatg accaattagc tgcattgtat gccgcctgcg cttgtaccag 12061 aagctgattg atgttgttca caatctgcaa attcccaaat agtcatgcca aattgatttt 12121 acttggcact taagttcaag ggctgctcaa gattgccgtt gacaagcgct ttacgtccct 12181 tgtgtgctta tttttatgca aaaaatgtat caatttgatg tcttttcctc aatatagaga 12241 aaatttagta accaaatata cgatcttgat cgctttgcaa tattaaggta aagaggaaca 12301 taaagggtgc acaaagagat ttttttaaca agtatgtaaa cactaaaaaa tacactttct 12361 ctactatata cctcacaaat ctagaaaaac aggatgttag aagctgaaaa attgatattg 12421 agcaacgact tttaaaactt ttctggaatt actaatttgc aaccgagttg agcttcatga 12481 cagacttagc aactaccacc acagccagct taaacaagtt tgagaaattc aaggcagaaa 12541 aagatggtct tgccgttaag gcagaaatag aaaagtttgc ctctcttggc tgggaggcga 12601 tggacgaaac tgaccgagat catcggctca agtgggtggg tgtattcttt cgcccagtca 12661 ctccaggcaa gtttatgatg cggatgcgga tacctaatgg tattctcaca agcgagcaaa 12721 tgcgtgtgtt agctgaagtg attcaacgtt acggtgatga tggtaacgct gacattacta 12781 ccagacaaaa tatccaactg cgaggcatcc ggattgaaga tttaccagag atctttgaga 12841 aacttcgtgc agttgactta accagtgtgc agtcagggat ggataacgtc cgcaatatta 12901 caggcgatcc agtcgctggg ttagatgcgg atgagttgtt tgacacgcga gagttggtac 12961 aacaaattca agacatggtc accaaccgag gtgaaggcaa ttccgagttt agcaacctcc 13021 cacggaaatt taatattgcg attactggtg gacgggacaa ttcagttcac gccgaaatta 13081 acgatttagc tttcgttccc gcttttaagg aagcacaggg agcaagggga gcaggggaag 13141 aaattactcc ctcatcccca ctatttggct tcaatgtcct tgtcggtgga ttcttttctg 13201 ctaagcgctg tgaggcggct gtacccctga atgtttgggt tcctccagag gatgtcgtcg 13261 ctttgtgtag agccgttttg gaagtttttc gtgatcacgg tttacgcgca aatcggcaaa 13321 aagcccgcct gatgtggtta attgacgaat ggggaatcga aaagttccgt acagaagttg 13381 aacaacggtt tggtaagtcg ttgctgggcg cagcagcacg ggatgaaatt gattgggaaa 13441 aacgcgacca cgttggggta tataaacaaa aacaaccagg attgaactac gtagggttgc 13501 acgttccggt cggtcggttg tttgccgagg atatgtttga aatagcacgt ctggcagaag 13561 tttacggcag tggcgaaatc cgtctcacgg ttgaacaaaa cgtcattatc cccaatattg 13621 gtgattcact gctggaaact tttttaaccg aaccggtact tgacaaattt accattaacc 13681 ccactttgct gacgcgatcg ctcgtttctt gcacaggcgc acaattttgc aactttgccc 13741 tcattgaaac caaaaaccgc gctttagaaa ccataaaggc gttggaagca gagttagaac 13801 tcactcgtcc tgtgcgaatt cactggacag gatgcccaaa ctcctgcgga cagccacaag 13861 tcgcagacat tggcttaatg ggaactaaag ttcgcaaaaa tggtaaaact ctggaagggg 13921 ttgatattta catgggtggc aaagttggca aggaggcaca attaggaact tgtatcacta 13981 aaggtattcc ttgcgaagac ttgctaccag tcttgcaaga acttctgatt tcacattttg 14041 gcgcacgctt acgagaacca gccgcagttt aaagctacgc gtccaatttg cacatttatt 14101 ccttatggtc gttatttgtc cttagtcact cagtgttata aatgacctaa ggaataagga 14161 caggcttcac caaaaattca gccacttgaa tgtttttcag ggcaattgca ccattgcccc 14221 caaatttgtc catgtaattt ttctgttttc tatccaataa aacccaatgc tcaaaggatt 14281 gttttcactt aacggtcgtt atcgcattct gcaccaaact tggtttgcct tctttctgac 14341 ttttgtttgt tggtttaact ttgctccttt tgcgacaaca atcggcaagg agttacatct 14401 ggcaccagag caagttaaaa ctttgggcat ctgcaacctt gcgctaacaa tccccgcacg 14461 aatcatcatt gggatgcttc tagaccgttt tggtccgaga attacctatt caatgctact 14521 gatttttgct gctgttcctt gttttgcaac ggcgctatcg cagaattttg accatctcgt 14581 catcagtcgc ctactaatgg gaattgttgg cagtgggttt gtcgtgggta tccggatggt 14641 gtcggaatgg ttcccaccaa aagagattgg aattgcccaa gggatttatg gcggttgggg 14701 taactttggt gcttttgggg cagagtttgc cctacctaca attgcagttg ctgctagctt 14761 tctgagtggt ggcggttcta attggcggtt tgcgatcgcc ctaactggtg tcatcgccgc 14821 tatctatggc gtgatttatt acaacagtgt ccaagacaca cctgctggca aagtttatca 14881 aagacctaag aaaaatggtg ctctagaagt gaccagtgtt aaaagcttct gggcaatgat 14941 tctctcaaat ttcggtttga tttttgcctt gggtttattg gcttggcgtc tggaacaaaa 15001 gaacattcac tttttgactc aaagccaaat gtatctggct tggctactgt tagcaggatt 15061 atttgcttac caaacataca aagcttggca ggttaacaaa gaacttctca ctggcaacaa 15121 aacctacgct ccctcccaac gctatcaatt tagtcaagtc gctttactcg aattcactta 15181 cgtgactaac tttggttcgg aacttgctgc tgtttccatg ctacccgcgt tttttgaaaa 15241 aacctttggt ttagagcatg tggtcgctgg gatgattgct gctacttatc cctttttaaa 15301 cttaatttct cgtcccagtg gaggtttaat ttctgataag tttggctccc gtaaatggac 15361 gatgacaatt atctctgctg gcattggtgt tgcctatttg atggcacatt atattaacgg 15421 cagttgggcg cttccagttg caattgcagt cacgatgttt gccgcttatt tcgctcaagc 15481 tggctgcggt gcaacctacg gtattgtacc tctcatcaag aaagaagcta caggacaaat 15541 cgccggaaat gtgggagctt acggtaactt tgggggcgtt gtttatctga caattttcag 15601 cttaactgat gcaccaacgc tgtttaccac aatgggtgta gctgctctgg tctgcgcttt 15661 tatgtgtgcc ttcttcctga aagaaccaaa gggttccttt gctgctgctt atgaaggtga 15721 agcaccagaa acacaaactt cagtgaatca ccacagttct ggtattctag ctgagaaata 15781 agatgaaaga attgatatct tagaaaaatc tggcttaaga gtatttagaa ccgcagagaa 15841 acgcagagaa aataactgtg tttctctgtg gttttgttta ccaaaaatca tgttgttaag 15901 gaatttaata attttccata atttgcataa gaaaactcta tcaaataaat cactataaaa 15961 gaataatctg tatcagtaag aacatttttt ctgtattgat tcatatttaa ttgagtagaa 16021 atacttttcc gagattgtac cgaaaaaagg taaaccaaat taagcaaact tgagtcacta 16081 gaatctcaga ttttatcaat cagtattcgg agaattttct aagtgctgtc aactgaatag 16141 aaaaagtttg aaaacatagc attttgctgg aatgtatctt ttacaaacaa aacatttaaa 16201 caatttatca acatggtgca attaccatga ctgaatctac taaaaccgtg tgtccatact 16261 gtggtgttgg ctgtggactg gaagtttctc cgccagcaca attaggcaaa gcgactcatc 16321 gagatagcca cggaaatccg atttggcggg tgcgaggcga caaagcgcac ccttctagtc 16381 agggtatgat ttgtgttaaa ggtgccacaa tcgcggagtc tttagacaaa aacagactgc 16441 attacccaat ggtacgagat tccttagatc aggagtttcg gcgcgttagt tgggatgaag 16501 cttttgatat tatcgttaat cgcattcaaa cagttcgtct caccaccgga tcagaagcta 16561 tatgtatgta tggttctggt cagtttcaaa ctgaagacta ctacacggct cagaaactct 16621 taaaagggtg tctggggagc aataattttg atgctaattc gcgcttatgt atgtccagtg 16681 ctgtggctgg gtacatacaa agctttggtt cggatggtcc gccgtgttgt tatgaagatt 16741 tggagttaac ggactgtgcg tttttaattg ggacaaatac ggctgaatgt cacccaatta 16801 ttttcaacag actagaaaaa taccgcaaaa agaaccgcaa agtcaaaatg attgtggttg 16861 atccccgtcg cacaccaact gcagaagccg ctgatctaca tttagctatt cgtcccggta 16921 cagatatcga cttgttgaat ggtatcgctc acttgttgat gcgctggaac tacatagata 16981 ccgggtttat tgacgactgc accagcaact ttcccgcata tgctgaggtg attcgccact 17041 attctccaga tgtggtggct cgtcaatgtg gaatcagtgt tgaagattta gaaacagcag 17101 cacgctactg gggtcaatct aatcgggtgt tgtccctgtg gtcgatgggt gtgaatcaat 17161 cctcggaagg gacggctaag gtcagaacta tcattaatct gcacctgatg actggacaaa 17221 ttggcaagcc aggagcagga cctttctctc tcacaggtca gccaaacgcg atgggaggac 17281 gagaagcagg aggtttatcc catttgttac ctggttatcg aacggtgaaa aatgctcagc 17341 accgagcaga ggttgaggag ttttggggac tcaagccagg acaaatttca gcgactcctg 17401 gtatgacggc ttgggacatg attactgggc tggaaaatgg cagtgtgggg ttactgtggg 17461 ttgctgctac taatcctgct gtgagtatgc cggatttgga gcgaactaag aaggctttgt 17521 tgcgatcgcc cttcaccatc taccaagacg cttactaccc aacagaaacc gccgcttacg 17581 ctcacgtcct gcttccagct gcacagtgga gtgagaaaac tggcgtgatg accaattccg 17641 aacgcatggt cacgctgggt ccagcattcc gccaaccgcc tggtgaagcg aaagcagatt 17701 gggaaatttt tgcagaagtt ggtcgtaggt taggttacca caaggagttt gcctttgcta 17761 actcggctgt tgtctacgct gagtttgtca aactcacagg cgatcgcccc tgtgatatga 17821 caggtattag tcacgatcaa ttacgcgaaa gtccaataca atggccccac ccggaacaga 17881 ggggcacaca agaacttgaa tctagactct cccccttcgg gtatgcctgc ggcacggctt 17941 cgccgaacgc agtcgcctgc ggagggaaac cctcccgcag cgctgtctca cctggttcct 18001 tgcaagggaa gcgtctctac accgacctcc gcttccatac tcctgatgga cgcgcgcgct 18061 ttggggcgta tcactcgcga ggcttggcag aaccaccaga cccgaattat cctttagtct 18121 tgacaactgg gcgactttat ggtcattggc acacccaaac ccgcactggt cgaattgaaa 18181 aaactcgcca aatgcatcct gaacccttta ttgagattca tccccgtgat gcagcacaga 18241 taggcattac agataaccag ttggtggaag tgcgatcgcg tcggggaaaa gcgcggttcc 18301 cagcaaaagt cacaaaggcg atcgcccctg gtacagtgtt tgtccccatg cactggggtg 18361 cgctttgggc agataatgct gaagccaacg ccctgactca tccagaatct tgtcccgact 18421 cactgcaacc agaattaaaa gcctgtgcag ttcagctagt gccgattaat gttgatattg 18481 cgttgcgtaa ctttcaaaaa acacgagccg aaacccttat aaattaggga ttggagtgaa 18541 gtctttcttc cattcatagc gggtggcgca ccgtccgtga agtgctttga tgaaacacat 18601 tttcaaaccc tcatatttca aggcttttag aaagttccga aaaatgtgtt tctttcatct 18661 ttcactggcg cttgcgagcg ctagttgggc actcacatct tgcaacggaa aaatgaaatg 18721 ttatttaagc actaagtatt aagcttggag tgcctgattt actacaaata tctgcataat 18781 aaaatgtatc tttatttcgc agctttttac tgaaacgtgg tttacacgta taggtgcaag 18841 atatcaatcg atcaatacta tattatgcat tgaaagaatc attatcaact gtcactctga 18901 aaatggtaag ctgctaaaga tatacgacag tatgtttggc tataagaaca attcgtcaat 18961 agttaatagt gtttaaaatt cttgactatt gtttcttgat ggtctcctag ctttagctat 19021 gcaaaagaaa aaagtgccac ttaaacagaa agaacaaatc gacaaaacat cattttccca 19081 tcctagaaat ttctctgggt ggtttttatt ttttcttttt gctgttgact ttttacttca 19141 aaggggggtg cagcagatgc gaaagctatt caagcgaatt gcagaaaaca ccaatgatca 19201 aaagtttatg cacttcattg aaaacataca agtgctagta tccaagttgc tatctctttt 19261 tatggtggta gtgattgtgg cagcaattgt tgacctgggg ttttttcttt ttaaagattt 19321 attttataca cctcatggtc agtttaacac aacattattt gaaatatttg gtttattcct 19381 caatatctta attgctttag aaatattaga aaatatcacg ggttatctga aaaaacacgt 19441 cttgcaagta gaattagtta ttgtaacttc cttaattgct gtcgctagaa aaatcattat 19501 tcttgactta aaaaaagtat caggtattga tattatcggt ctaggaattg cggttcttgc 19561 attatctatc agttatttaa taattcggtt cagtcataga caaaaggttt aaagcaaagt 19621 aggatattat ggtggttttt tttgagtttg aagctgattt tgttgattcc ctgcgttgta 19681 ttcctatgca ggtgcgccaa aaacttgata tttctggcat caagttaaag ttatccgatt 19741 ggagtcattt aacgaaggat gagcgtgaag ctttggttga attaccttgc tctacagaat 19801 ctgaaattca aacatacaag gaatatctcc aaaacctgat tttacaacgc actggcacac 19861 caccggctga gttaccaatc gaaccacatc cagcatggtt agatgctaac actttaccaa 19921 ctaaccttca ggagaaagca agagaatttg gtgttacgct ttcgcctcaa cagtgggcag 19981 aattaacttc attacaacgt tttgccttga ttaaacttag ccgtccagga catgaaaatc 20041 aaaactttcc caaagctttg aaggaatttc atttgctttg atgagtcatt agtcattaat 20101 cactcacctg aggaactttg aaactttgag aatatgttac attttttgaa atgagcgtgt 20161 ttaggtagcg cctgacactg caaaaaattt tacgattaat aggagcctca aatgctggag 20221 gttgattttg ggttgcgctc cgccccattt aacctacgac aaatcgaatt ccgtacccct 20281 ttgaccgtga tgtgtattaa gacaagcaaa cataaattaa tattgactaa tgaacggcag 20341 gtgctcatgg gggaaaccac gccaggtgct tcaacggggg gaacccccgc aacgcactgg 20401 ctccccaaga ccgcactgcc tcctaatgac taaaaaagaa aatggctact aaaaaaattt 20461 tgatgctcgt aggagactat gtagaagact acgaggtgat ggttcccttc caagctttgc 20521 aaatggtagg acataccgtt catgcagttt gtccagacaa aaaagctgga gaaaaagtta 20581 gaactgctgt tcacgatttt gaaggtgacc aaacttacag tgaaaaaccc ggtcacaatt 20641 ttactcttaa tgcttcattt gcagaagtta aagcggaaaa ctatgatgcg ttaattattc 20701 ccggaggacg tgcacctgaa tatatccgcc ttaataaaaa ggtgctataa atcacccgcc 20761 acttcgccca agcaaacaag ccgattgctg ctatttgcca cggcttgcag ttgttggctg 20821 ctgctgatgt cctgcaaggc aaaaactgca ctgcttaccc tgcttgtagc acagaggtgt 20881 ggcgtgctgg tggtacttac gtggatgttc cagctgatga ggttgtcgtt gatggtaatt 20941 tagtcacagc accagcttgg cctgctcacc cccgttggtt ggcagaattc cttaagatac 21001 taggtactaa aattgaacat ctagaaatgg ctgctgttta ggggtgtggt caacagggaa 21061 ttgaacacgg cgatagattc gcttgcctta ctttgcaaag cgcacgctac gtgcttcggc 21121 agatcgctca gttgtactgc aatgagctat cgtcacaggt ggtagttccg ggattggtcg 21181 agcgaccgca gttgcctttg ctcaagcggg ggcctcggtt gttgtagcag cgcggcgtgc 21241 gggggagggg gaagaaaccg ttcgtctcgc caaagatgca ggtagtgaag ctctgcctac 21301 gaagccgcta aagcaggagt cattgggtta acgcgctcgg cagcgataga atatgcagga 21361 cagggaattc ggatcaatgc tgttagtcca gggatgattg caacggatat cctatccaat 21421 gttccagaag atatggttca acaagtgagt aacgcagttc cacttaaaag actcggggaa 21481 gcagcagaaa ttgcaaatgc agttatttgg ttatgttctg atgctgcgtc ctatatcaca 21541 gggcacaact tagtcattga tggcggtttt actgtccggt agaaagcagc ataattcttt 21601 taaaaaaggg aagtcagaca aataggcttt tgatggtagc agagaattta gagatagtgc 21661 tggctgttaa gggttaactg tcaacagtca tactgaacaa gacaatgttt attttgattt 21721 ttttaatttc tcattgataa gcagtgactg aagctatagt ggatttagca attttgctgt 21781 tgtggctagg gttaagtcct actgcccaat ttttttcaga aatactttag aaacaccctg 21841 caactcaatg acaggtaaaa atgcaagaca gaatgcaatt ctgcggcaag tgtctggtat 21901 tggaacttct tttggatcgg aacctcgcta ctcgctgctt ccccgtcaag aggtagtcat 21961 tggacgtgat cccagctgcc aagttgtgtt agatgccatg ttataccgca tggtatctcg 22021 tcgccatgca gcggttcgtc ctctagcttc atctccagat ggggaatcta gttggttaat 22081 ctgtgattta gctagtgcca atggcaccta cttgaacgga caacggttgc agggttgtca 22141 gcaattgatg aacggagata gcattactct gggtcacgat ggtccagaat ttgtttttga 22201 gtgtgaacac aaccaccaac aagccactgc ggtcgcccca ccagcagcta caccgcttcc 22261 cccagcaaat agttacccaa caccaacctc gacgtattat caatctcccc caaaaccacc 22321 agatgcgctt agcttcactc agctgtttcc gattatttct actggcaaag atttaactcg 22381 taaagcttac ttgataccgg gagttctcac agtcgtcttc gtggtactta tgtttgccac 22441 agtaggtcaa ccacaagcca atcaactcat agtcgcaatt tatatagcct ctgctgctta 22501 ctattttatt taccagttgt gtggtaaacc aaagccttgg tgggtgctgt gtgcatcggc 22561 attgacaacg gcagtgattt tacggactcc cgttttggat ttgtttatct ttgtgtttcg 22621 cggtttgtta cctggtagtt tgccctcaga acaggagtcc attactttaa cagagttgct 22681 ggtgcggatg ttttttggcg ctggcttgat ggaggaatta ctcaaggcaa tacctatact 22741 attggcattt ctaattgcta gggcagtccc gtcaccgtgg cgggaacgca tcggtatcgc 22801 cgaacctctt gatggtattc tgctgggaac agcttctgct gtgggcttta ccctgttgga 22861 aactttggga caatatgtgc caactattac tcaaaatatc tcagcacaat cggggttaga 22921 aagtggtcag cttgtgggat tgcaactgct gattccacga attcttggtt ctgttgccgg 22981 acacatggct tacagtgggt atttcgggta ttttatcggt ttggctgtac tcaaaccctc 23041 taaaagttgg caaattttgc tagttggtta tctgacagca gccggactac atgctttatg 23101 gaatacaacg ggagttttta acggtttgct gttggtgatt gttggggttt tgtcttatgc 23161 ctttctgatg gcagctatcc tcaaagcccg cgtgctttca ccaacgcgat cgcaaaactt 23221 tgctactcgt tttcttgaac ctaagtaatg agtgcaacaa gatgatcaca gagatcgcgg 23281 atgattgatt tatagttaac tcttcagtct acaagcggct actgttgctg cactcaactt 23341 gctttttcta tctttagata gatgtgcttg gcttggaact aactacaaac atattaagtc 23401 agacattttg tgtctcaatg agaaaaaccc tgcattctta gataaacaac aaaattatcg 23461 cttgtaggca aagcaagttc tcctcgctag ttggatcaaa gcagcaaaca gggaacaaat 23521 cttaaacttc ttcttcaata ttcttgagtt catcaagcgc atggcaagca actgctaaac 23581 tcacttttgc cttttcaatt tcagataagt ttgtccactc acgattgcct aagttatcta 23641 caactgagtt tacatttttc tgttcaccgc ttctcattgc gtaagcgaaa ggacgcgcca 23701 taaccaaaag atcatctgtt tcccaattag gtaatacaga tttgacacaa cttaccagtt 23761 gctcgccctt agattgtcct gaatggaaaa tttccgccgc tttctgggca atttccgtga 23821 tccaatcttg aggaacactg gcggcgaagc ctgcttgtaa atttttttga agattcgcca 23881 agacagtcga ttttgtatta catttgtaat gctgggaatt cagattattg taccaaagcg 23941 tatccaagct gttgtagaaa gcttgggaac gttctgccag ttcttcatca aggacatcgt 24001 ctaactggaa ctgttgttct agttgtgcaa aatagttttc tgattcttca tctgcgggat 24061 tccaaggata tgtcgcatct tcgggttcaa gcagtgcttc taaaagctcg aaattaactt 24121 gagactgttt gtaaggagaa gtttttgatc cgttaatttt cttagccatt tgtcgctcgc 24181 tccgggttaa ttatttagtg tacatacaca gctacaactg ccaaaatctc atttccgcaa 24241 aaagaacgcc gttctaccaa aaagcactgg tttatttagc taaatttaga tggtagaaaa 24301 tcagtgttcc caattcttac cctggggaat tatcaatgat gaactcccaa ggttgacttt 24361 tcgtactacc aaactttaac tgcatgccag attgtaacgg cacttcctga cgatggattt 24421 gttgccaacc atcagatcga aaaacccagg ttgtcccgta ggtggacaaa tcttgcaggt 24481 aataggttcg cactgctgtt gctcctgtca gactatttcg acataatatc tcagcatggc 24541 gttttgaaac cgaaggttct ggaatgacaa tatcattgtc tccggtgcgc ccgatgcggg 24601 tgattcctgg tcgcaataac gaagttttac ctccagcgga gagcgatcgc aaacaagcat 24661 tggaaaccaa agaagctatt cctggaatag aagtttctgg gagttctgtg atcccctcat 24721 caaaattttt tatgagttct ccctcgtaat ctggatggag gtagagaatc ggcaatgtcc 24781 aagcgggttg attgaacctg taaactatta atagctgttg ccttgcttct gccacggctt 24841 catcaattgg cagtcgcttt cgcaaagcag cagcaaaagc ttgaataaaa gtctggcttt 24901 cttgggatgc tatttcatca cgcatcgcca aaactgcagg caaaccttga cgtatcagca 24961 cttctgctag actgctgtgc gacacagcct ggtgattagc agccgcaggt tgtgctcccc 25021 aacaagcatt gaaaactgcc agtttcacac ctgtactcgt caatacttgg gctaattcca 25081 ttccattgag ggtcataccc ggtcgcaaaa ataacaaacc tccgtctggt ccttgcaaac 25141 cgtgaccggc gtagaacaag acgttgtatg ctttcgtttc tagctgttga attaactcct 25201 gcggggttgg ttgcaggagt gtattcacca tacagggtgc ataaccattg taattgctac 25261 ccacaacaga cccattggag agtatttctt ctaggctagc ggcttctttt ttaagttcta 25321 gatgcttttc atcttcaccc aaaaccaaca gtatatttaa cgcttgttct gagcgtaaat 25381 acggtagagg ttcgacttca ctagtggtac gactaaaaag tatgtgttgc gacagagaaa 25441 tggcagattg accgggttcg cgctgcatga tttcccaagg cagagcaatc agatccggat 25501 cacgaatttc cagtcgcatg cgcaaacgcg tacgctcacc catagcaatg ccacgactgc 25561 gttcaaagct attgagaatt gctccctcaa agacccaacc ccacaagctt attcccaaat 25621 attgcatcaa acgagcagtg taactggtgg gttgagtgga agttggggca acaaaatcca 25681 gagggagagg atttgcctct tgggaggttg ctcctgggga aatatctaag cggctgtgac 25741 cggcaaacat ttgctgccac tcaagccatg cttgagttag atgtggaggc catacacaat 25801 cacgcagaac atagccactg ggatagggag cattcaccac ccaaatggcg aaactgtcat 25861 tgccggtatt gatgagacgg gcgatcgcca ggttaaggga tggcatggat tcgttagtct 25921 tggttaatct atagaatcaa ataatttcaa ataacaattt taaaaggcaa aaatttttac 25981 ctttcgagtt tttgatttaa agttatatca tctatactat ttcaggcttc taccagttta 26041 tcgagttttt gacgaagtgc tgctcttcgg ctaacaggtg tcacctattt taaccagtta 26101 tctgttatca gtttttgtta acaactctta agagttgacc ctccgcgaga tttcattgct 26161 gtggcacaga ctcagttgtt gattctccgt caggaggtgt ggaagtgtct tttgaggact 26221 ggcatcgact caccactttt ggttgagctt tggcaagatt tgaggataaa attctgactt 26281 tatctccttg cttgagtaaa cgttttttgg ctgtggaaga agttgattgt gagtttgatg 26341 gacttgcttg cccgatagag cagacttgca ggtgaatcca agtgtccttc tctgttttct 26401 gttgcttggg aagaatttgt aaaacactac catttggagc gatcgcacca aatgggggcg 26461 aagtcgttgt tatgaattct atttcgctgt tagtttgaag aatcactcct ggtgcattca 26521 cactatcttt gctcgaatta cccgaactgt cctgagaacc acttgtgctc ggtaacggta 26581 tagatggtga ggagttaaat aatttcgtcg ggatagaaag cgatccttga cgccacaggt 26641 atgccaacaa aacgctaaca acaaaaaata gcagcggaac aacaatcacc tgtagcggta 26701 cttttagact tttggcgggc ttgcttgttg gcatgacctg agttttctga tttctaccca 26761 caggtgatgc aagtagtggc gttttgatcg cagtgttaga agctattggt aatgtagaaa 26821 ggttagcaag ggaagtgtga agggttgatt ttggttctga gtatttaact tggcagtgta 26881 ctagcgctat cgtgacatta tcatgtccat tttgaatatt cgctatttcc actaatttat 26941 ctgtgacttt tgctatgtca aaattttcag taataatggg taaaatttct gtttcccagt 27001 attgatctac gcggtcaaag tcactcaaac cgtcagaaca cagcagaaaa acgctatctt 27061 cgtcgagaat aaatcgttgt gacgtcggat gcaataaatg actggcactc atgcctaatg 27121 cctgcaccaa agatcctgat gctccctgtt ggactgcatc tcggtatgtg gcatagccta 27181 agcgtacctc acgagaagca acatcgtcat ctagggtcac ttgatagcag ccagaacgtg 27241 taatccaata agcacggcta tcaccaacgt gggcaatata catttgatga gcaatgggca 27301 atgccatcac tagggttgtg cccatgcgtt gacgcccttg ccgatgttcg tcgtcgttgc 27361 gctggctgat gatgtcatta gcagcagcgg ttgccccttc taactgcgca agtacaatag 27421 atgagtctat atcctcagaa ggcaatttcg tcagctcatg caggtgctgt tgaatcgttt 27481 caatagctaa aagtgatgca acattgcctc cctcgtgtcc accaatgcca tcacagacaa 27541 tagctaaacc tgtttgtgat ggtggcttgc tgacaatact gccagcagca gggtaacaag 27601 catcttcatt acgttggcga caaggtccgg tgtcagtctt ggtgacaatt ttgataatag 27661 gtgtttgggt ttgccctaat tgtgttaacc ctctgtctat aatggcaacg agttgctcac 27721 cagagtgtat ttctccggca atcagagaac gacacatttc attcaaaaat tcggcgatcg 27781 cccctttggc ttgtggctgt aactgctgcc aaaattcccc aagttggggt aaaccagggg 27841 aagtttgatt gtcctgacgc aattccaata agcgtactaa cgacccttct accctgatta 27901 actctgggtt gagtaagcta gaggctacgc cttcacttgc cagaggttgc cacagatgag 27961 ctatctgcca cagcaaattc agttgccgta ttgatgttgc gttatgccaa gctgtggtta 28021 actcgcaagc aatctggact tccaaggatt ctgtactgtc tagtacagaa agcggaggtt 28081 tttctaaaag taatatttct ttgtcagaag tattgccagt catcggcagt attccataca 28141 cttgcggtac gtgcaagcga tagggaaata gccgcaagta aggttttatt gcctgtacat 28201 gttctccatt tggcgtttgg ggtaacaaac cgggcttggt atccaaaaga agtgccttgc 28261 tgataaccaa atagcgatcg gctaataatt ctccggcatt agccaagctt tccccctctg 28321 ccacagccca gaggaatcgt ttaggaaggg gtgtggagca tcttaggcaa aatttgtgtg 28381 taagcgggtt tggagcttga caaagttcat ttgggcagta cagcgttgcc gcgtcatttt 28441 ccatagtctt cgcaccgatc agcagattgg catttttttc aacagcattg gcagcggcac 28501 tgttgaccgc tcatatctga tatatccaat tttggatcac gattttaact tgcgaacgaa 28561 acttaagatg ataaaacctt tcaaaaattt gcttaatgat aaaaatagat atttttcagt 28621 tggtacgcaa tgcaacccac aagacttact cacagacact ttaactgaat caggcataaa 28681 gccatttctc ttgtttattt atcaacatac agcattccca aattggtcgc gagcgccaaa 28741 accaatcata aaggttattc ggagcgagaa ttttctggtt cacaaacaat catataattg 28801 ctgtagcagg ctgaaaacca ctctccgatg agaaaagttg gtcatgtgag ttaggtctga 28861 gtcttagtag gatgattgca cctctgtctt aaaagaacag atgttggcac aagagcgtac 28921 ttcacgaaaa tttcatacct ttacccatat aactttgtga aagcttcatc acctcccctc 28981 ttcaggattt tctttataga aagctgtatt ttcaaccaca agtttttttt ctttgctttt 29041 ttttaatgat gtatctcaag gtaggtgtca agtgttaagt aactttattt tacactttac 29101 acccttctga aatgaaatta ttccatcaga aagaggagat aagtgagctt tttccctaat 29161 ttatatgtag gtggtaatca attaacttta gttaagacta agagcaccag aggctcttgt 29221 tagttcctat tgtgtcttca caatgacttt tcttctgggt tattgatcac atcaccattg 29281 ccaataacaa ttccacacac ttaagaattg gtgaaatccg aatggaagat aacaaagaaa 29341 aagaaattca tcgtgcagtg aaccctggtg acgtcatttc tgaagaacca caaacagtgg 29401 aagaaaaggc acaacaactt gccgttgatt ctccagatat cacaggtgat cacattcaag 29461 ttccgactta ttttgtcgtt aaagagccaa atggtgaaga aaaggctctc catcacgtga 29521 aagatgcaga agaaatctct gatgtgattc gccaagcacg agtagacgaa gaaggaaata 29581 gagtttggtg atagatatta aacggtcagc agtcagcaat tagctttcag gtagctttct 29641 gctgccgttt tttattaaga aaatatatga atattattca gttgaatggt tataaatcgt 29701 cttttctttt tttctttgtt tctttttttc tgagttggtg agtcaggagt tgctgagttg 29761 gtgagttcta ttttcaagtc agggtggaca ttgcaatgcc caacctacac aatatgtcta 29821 tttccggaaa ccccacagct atgaaaacat cgtcttaggc tcgttttcat agctgtggaa 29881 tgaattacac tcaattgttg agttcacaaa atttaactta cggacactaa gctatttaag 29941 cgagaatgca atgagtcatg agcaatgagc tttcgggtat gcgcgaagcg cacgcccgga 30001 aggcattagt gtaagtctgg tggtggagga gatacgcgcc tacgcgcccg agagcggcaa 30061 accctcctgg ggcttatgct tactcatgac taagacgctg agcgagcgta aattgagaaa 30121 aacggcgtac agcagagtag cttacaattt tggaagtttg ttctcactca gtactcattt 30181 ttcactttta catcacacgc cattagcttc tgtattttcc tcagatgcaa cttggggtag 30241 ccccatgctg taattagtga cgcgtgcgga aagattgtag ctgacttcgc ctttttgcaa 30301 gagcgatcgc agaaaatgct ccaagtctga gccaatgcgg cgtaaattat aatctatcag 30361 atccgcccca ctagatgctt ctaccagttc atcaaacttg cgataaacct tttgcagagc 30421 atcttcattc cagttgaact cgttgtctgg gtcaacatct aaggttaaaa cctgactact 30481 gggaacgagt tctccgtcct ggtctatttc gccggcaaaa atccttatgt gccgggttgt 30541 ggacttaagc agcatcgggt tatccattgg cggttttatt taggggttaa tcgcgctctc 30601 aatcattgta gacggaattc ttagcgagaa gaaatcatgc ctgtctgacg agcataagca 30661 aaaatacctc ctgcgtcaat cacgggtccc acttctccta aaggtttgag gttgtatatc 30721 tgaccaagag tatggttaat caattgattg ctttcaaaat ctattgatac ttcatgacca 30781 gtctcaaact tgtcacataa tctttcttgt gattcccaag gataaagttc ccctgtagca 30841 gcacaattac ggaaaaagat gcgggcataa gactcggcaa tcactgcttt cacgccgctt 30901 gcacccaagg ctatgggggc atgttcgcgt gaggaaccac agccaaagtt ttcccctgcc 30961 acaatgatgg ggtaagttgt tttcatttcg cctggaggga caaatttgcc gtaacggtct 31021 ggtaaaccag ctaaagcata acttccaagc ttttcgtatt catctggttt ggaaggaact 31081 aaggtgagat attcggctgg aataatttgg tctgtgtcta tgttgtcgtc cagaacaaaa 31141 atcttacccc taattacttt ggtcatttcg gtctcctacg tttttaaatg aataaattgt 31201 acatttagta attttacgga attatggtac agcaacgtct ggctatcatt atatagtaaa 31261 taatcagcgg attcacttac gtgttaatac ctgttttaca gtaacaactc aaactcttat 31321 tgacttctag ataaagttga tgtaaaaagc tttaaaaagc ttggacaagt tcttacagaa 31381 actttattgc atggaatcct aaaaactgaa taatagttat ggtttacttg aaccatagcg 31441 attgcagaaa actgacaaac cggagagtga aaatcaaacc tgaagataaa ttaggagagt 31501 aaaatgcaat gacattgttt cttgacaaac aagcattgtt aattccgaca ctttgaaagt 31561 caaaaaagta taaagaattc ttgcttgaaa aatagcggat cacccaaatt acctcttagt 31621 acaaggaagt ctaaaagcca tgaactttga agcattacca cggcaagtaa atagtgtcga 31681 tgtaggtgtt tatgagtgcg aaatccatct gaaattcagg ttgatagaag aaaaaagtct 31741 gttgagcgat cgcgaccaac ttttgcaggt tctgctagac gctttaaccg agggttcaga 31801 cgacttttta gagactttac aagcaaatgt caaggctcaa gaagtttctg agttcaaagc 31861 atcaccgcaa atgagacgcc aactgatgcg cttacgcaac tctgctgaag ctagttaata 31921 cttaagtttg tgtcacccta ctaatggcag attgtgcatt aaattttccc atttactgat 31981 tgttctaata atgacttggc ttgcttaacc atcccccact aaccatagcg ctggggctta 32041 tagtagccct cacccttagc actacggaaa caatgggtgc ctactctaat tttgtattca 32101 tgctttgctt ctataacagg gttgtggtta gtaaaaatta gatttcagtt tcattcctct 32161 atccccgtta ttcctccaag aatcaaagga gtttgaagaa aacttaatat ttttcctcta 32221 aggaaaggaa aataagcaaa aacagcgaaa ccctaacaaa tcagtgttgt taggagattt 32281 tgctttttct caatgagtta agtaggatct gttatactta aactcaaacg tgttataatt 32341 aatttgcctt ggtataagtg ggaaaaattc ggcgttaaag ttgtcataga tcaagatcgt 32401 tgagttaagg attgtttttg gagatttcag atgtgtcttg acgttgccct atggctcttg 32461 ccacgcctga cggctaacac cagtcgcctc tgtcgggaaa gccgtcattg gcgcccttat 32521 tcaccaaggg cgtcctgact cacatgcaac gtcaagacaa tagctttgtt tgtgctttat 32581 aaataacggc agaagtgctc accgcttaat gggaagatgt agcgactgaa gcgatttgag 32641 tcactgtgct agaatttcag atataagtcg ctttacccca aaaattgggg atatcagaag 32701 caaaatcttc tggtagctct gcgactgttc agagcgcaga gcctcaagat tattttgcaa 32761 gaccgtgttc taaagcaaaa cgaactaact ctgtacggct attagtacca gttttactga 32821 acaaacggct aacatacttt tctacatttc ggacactcgt ttctagacgg cgagcaattt 32881 ctttattcat caacccctca gcgactaagt ttaaaacact ttgttctctg ggggttaaat 32941 caatagtaaa aggtgctgga gattgagcta ttgcatttct ttgagttaac aaagctttta 33001 tttgagcaat ttgattagcc aactcggtaa tatctggcgt atcaccttct tctgttggag 33061 ttgtagttct ggcggtgcgg cgttccagta aattttccac aattgccaca agttcatctg 33121 gatcaaaggg cttgggtaga taggcgtcaa caccagcttg ataaccttgg atgcgatcag 33181 tcgtcatacc cttagcagtg agaaatatca ctggtagcga tcggaagcga ggatcgtccc 33241 gcagttgttt gaggaactgg taaccatcca cctgaggcat cataatatca gaaatgacta 33301 agtctggtgt attttgctgc atcatctccc agccctgtcg ggcgttactg gcaacttgaa 33361 cgctaaaacc gctctcttgc aaatagtctt ttacggcttc acgcaatcct ggttcatcat 33421 ctaccagtaa caattgtgct gccatttacc gtttcctttg cgctactttt ctccaattta 33481 gcgaacttga gtcatactgg gctactggca agatggagag att // LOCUS NODE_805_length_33471_cov_4.85836133471 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 33471) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 33471) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..33471 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 366..932 /locus_tag="DP116_06870" CDS 366..932 /locus_tag="DP116_06870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459404.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_06870" /translation="MNIKIVTATVLFTFFGLTAQAFALNEQDLELLRTTNTCPRCDLS GADLSQARLSGANLREANLKGANLSQANLTNADLTGANLETAVLTSANLSNASLTGAN LKLASLDNTNLTSAGFIGANLEAANLTGAKRQYTNFRGANFRLTTMPTGTVTSDKNYG WSLQRPAQQVECDKFKNQKVPGTTCRGE" gene 1263..2444 /locus_tag="DP116_06875" CDS 1263..2444 /locus_tag="DP116_06875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877036.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="PRJNA477356:DP116_06875" /translation="MLTHIKDLATKLSPRLIEIRRHIHSHPELSGQEQQTSAFVSGVL SSSGLHVQEGVGKTGVIAELQGTGKDERLLAIRTDMDALPITERTCLEFASRSQGVMH ACGHDVHTTVGLGTAMILSQMAEELPGGIRFLFQPAEEIAQGASWMVNDGAMKQVSNI LSLHVFPSIPAGSIGVRYGALTAAADTIEIIIIGESGHGARPHEAVDAIWIASQVVTT LQQAISRTQNPLRPVVLSIGQINGGRAPNIIADKVQLLGTVRSLHPETRANLPNWIEK IVANVCDSYGAKYQVNYSQGVPSVQNDYFLTQLTQAAAEEAWGSDRVQVLPEPSLGAE DFSVYLEHAPGSMFRLGVGFKDRILNYPLHHPLFDVDESAIITGVVTMAYTAYKYWHQ N" gene complement(2563..3612) /locus_tag="DP116_06880" CDS complement(2563..3612) /locus_tag="DP116_06880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318822.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA nickase" /protein_id="PRJNA477356:DP116_06880" /translation="MVATLDDTKRSAIAIKLADLKALQQLLIENEEQLIKQVSDQEIA DRLRNFLEDDKKNLGVLETVIGQYGIQAEPKKTVTELIEKVRKLQQGSELSLYEKVFQ HELLKHQQVMTGLTVHKAAQKVGADVMLALGPLNTINFENRAHQEQLKGVLEILGVRE LTGQEAEQGIWARVQDAMAAVSGVVGSAVTQNTDKKDLNIQDVIRLDHGKVNTLFTEL LASQDPQKIQEYFGQIYKDLSAHAEAEEQVVYPRVRPFYGQNDTQELYDEQAEMKKML EQIKGLNPSNMEQFKGKIKQLMDAVGDHIRQEESSMFAAIRNNLSTEQSEQLATEFKA AKTELQKKLGVVGAK" gene 3872..4612 /locus_tag="DP116_06885" CDS 3872..4612 /locus_tag="DP116_06885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865507.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2993 domain-containing protein" /protein_id="PRJNA477356:DP116_06885" /translation="MPDSPGLGEQALNKAAEIGLSSQLDKVEDLNVDVKTDPLKLVQG EVDSVSIEGKGLVMQKDLRVEELKMQTDSVAINPLSAAFGKIELTKPTQASTLVVLTE NDINHAFNSEYVRSQLQTQKIHINGQLTTIIPQHVDFHLPGENKVALGASILLRETNE THQVAFSAVPKVSANGQTVSLENVEYGDTKEISPELTKTLIDATSEILNLSNFDLEGM TLRVKQLEVETGKLILQAEAYVEQIPSA" gene 4642..5184 /locus_tag="DP116_06890" CDS 4642..5184 /locus_tag="DP116_06890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06890" /translation="METTERTSTPFPNLPPVIESDDREYHDTGVPSTVAIAGHPLHPL SVIFPIAFLAAALGSDFGYWLTHDFFWARASLWLIGLGLLGGVVAALIGISDFLQIER VRKRSAGWVHLTLNVAILVLSAINFILRLGNPESAILPWGLIISLIVGTLTSASGWFG AELSYRHKIGVVGAGSRRYP" gene complement(5774..7009) /locus_tag="DP116_06895" CDS complement(5774..7009) /locus_tag="DP116_06895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309420.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_06895" /translation="MRRQAVKIRLYPTDQQVKILAQHFGCARWWWNYGLNKCIETYKA TGKGLSQSGLNSLLPALKKDEETEWLGECYSQILQSVSLNLSRAYKNFFDGRAQYPKF KSRHHRQSIQYPQKVKQVNDCLKFPGTLGVVKANIHRLLDGTIKTVTVSKCPSGKYYA SVLMEYEDDYPTPSTDGKVIGVDLGIKDFAITYDGEKVSKYPNPKHLAKYEKKLAKKQ RIAARKVKGSSRRRKALRKVARVYEQVSNVRQDYLHKLSRKIVDNNQVVVVENLNVKG MVRNDKLAKAISDTGWGTFVNFLSYKLERNGGMLIEINRWFPSSKLCSNCHYQIKELS LNTRTWVCPSCGTHHDRDGNAAMNIRTEGVRMLSSSGTGEANANGEEVRPIRGRKPTM RPSSVKLEAPTIPLAVGGG" gene complement(7303..7929) /locus_tag="DP116_06900" CDS complement(7303..7929) /locus_tag="DP116_06900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318817.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase family protein" /protein_id="PRJNA477356:DP116_06900" /translation="MKTLRLYDFLPSGNGYKIRLLLTQIGMPFERIEVDITKGESRTP DFLGKNSNGKIPVLEVEPGRYLAESNAIMTYLSEGTEFLPYDPFLRAQVLQWLFFEQY SHEPYIATLRFWISILDKAQEYHEVIEQKREPGYAALMVMEKHLSNHAYFVGERYTIA DIGLFAYTHVADEGGFDLTRFPAIQDWIERVKAQPAYISITEELTSSS" gene 8131..8799 /locus_tag="DP116_06905" CDS 8131..8799 /locus_tag="DP116_06905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06905" /translation="MCGRFTLIQTAEALYETFHVEKLPELEPQYNIAPTQMVGAVLYN QESEERKFQKLRWGLIPSWSKDLGIGVKLINARAETVAEKPAFRSALKYRRCLVVADG FYEWRLQENQKQPYYFRLQEGKPFGFAGLWEQWRSPEGEEITSCTILTTEANELVQPI HERMPVILQQQDYDVWLNPEIQTPSSLQQLLHPYPSEAMTAYPVSKVVNSPKQNIPDC IKPL" gene 8908..9429 /locus_tag="DP116_06910" CDS 8908..9429 /locus_tag="DP116_06910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006276872.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I assembly protein Ycf3" /protein_id="PRJNA477356:DP116_06910" /translation="MPRTQKNDNFVDKSFTVMADLILKLLPTNKKAKEAFVYYRDGMS AQSEGEYAEALENYKEALELEEDTNDRSYILYNMGLIYASNGEHDKALELYHQALEFN PRLPQALNNIAVIYHYQGEKAKEAGDEDGGEALFDKAADYWIRAIRQAPNNYIEAQNW LKTTKRSQVDVFF" gene 9493..9783 /locus_tag="DP116_06915" CDS 9493..9783 /locus_tag="DP116_06915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997343.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase GatCAB subunit C" /protein_id="PRJNA477356:DP116_06915" /translation="MIDREQVRKVAHLARLELTPEEEEKFTTQLGSILDYFEQLSELD VNNVPPTLRAIDVKNVTRADDLQPYPNREDILQSAPEQERDFFKVPKILNEE" gene 9932..11395 /locus_tag="DP116_06920" CDS 9932..11395 /locus_tag="DP116_06920" /inference="COORDINATES: protein motif:HMM:PF00144.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine hydrolase" /protein_id="PRJNA477356:DP116_06920" /translation="MLQSVEQRIERVINNLLPATALEGKFGSPKTLDEQLVHYHTPGI SIAVINDFEIEWARGFGVCEARTSREVTPNTLFQAASISKPVFALAVMRLAQEGRLNL DEDVNTYLTSWRVPAIGDWQPRVTLRQLLSHTAGLTVHGFPGYLNSEPLPTTIQVLNG EPPANTDKVEVNIIPGLHYRYSGGGTTVAQQVLVDLLKQPFPEIMRELVLNPLGMTNS TYQQPLPNDWSARAATAHPFSGIPLEGKHHVYPEMAAAGLWTTATDLAKVGVEILRVL RGLPATVWSKETIEEMLRPQQPEQTQGANESFVGLGNGLFVGLGFFAGGGIGDGFYFF HSGTNEGFVALMRVYAHIGKGAVVMLNSNEIELMPEVMRSLALEYDWPDVFPQEKPII TLSQTDSYSGLYLTKSGLQFKVMSQDGSLFLQCEQQLPLQFFPTSELEFFAKAANTSI SFEKDDTGNITAMTLSQAGVMSLRQQADQQIRAQRQG" gene 11562..12542 /locus_tag="DP116_06925" CDS 11562..12542 /locus_tag="DP116_06925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318813.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase family protein" /protein_id="PRJNA477356:DP116_06925" /translation="MGLGILKDGKWVSDREQEDSQGKFLRPSTTFRNRITADGSSGFK AEPGRYHLYISWACPWAHRTAIMRQLKGLQDVISMSVVAAEIHDNSWEFADEPGSTPD TVNGTQYLWQVYLKADPNYSGRVTVPVLWDTQNQTIVNNESREIIRMLDTEFDAFAKQ NVNFYPEHLQKVIDETIDAIYQPINNGVYRAGFATTQSAYDEAVTELFDALDHWEKVL GKQRYLCGEQLTEADWCLFTTLFRFDAVYYVHFKCNLRRIVDYSNLWNYLKDLYQQPG VKETCNLDHIKRHYYKSHPKVNPTRIVPKGPIIDFDAPHNRDQLRAAMTV" gene complement(12632..13111) /locus_tag="DP116_06930" CDS complement(12632..13111) /locus_tag="DP116_06930" /EC_number="4.6.1.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455995.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase" /protein_id="PRJNA477356:DP116_06930" /translation="MNIRIGNGYDIHRLVSDRPLILGGVHIPHSLGLLGHSDADVLTH AIMDAMLGALSLGDIGHYFPPSDPQWAGADSLVLLSKVHQLVRQHGWQIGNVDSVVVA ERPKLKPHIEKMRETIAQVLQVQPNQVGVKATTNEKLGPTGREEGICAYAVVLLEKF" gene complement(13322..14026) /gene="trmD" /locus_tag="DP116_06935" CDS complement(13322..14026) /gene="trmD" /locus_tag="DP116_06935" /EC_number="2.1.1.228" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140992.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA (guanosine(37)-N1)-methyltransferase TrmD" /protein_id="PRJNA477356:DP116_06935" /translation="MRFDIVTLFPDCFTSVLTTGLIGKALAKQIAQVHLINPRDFTTD KHRKVDDEPYGGGVGMLMKPEPIFAAVESLPVLDRREVILMSPQGQTMNQPLLRELAT NYDQLVVICGHYEGVDERVLHLVDREVSLGDFILTGGEIPAMALMNGVVRLLPGTVGK VESLTAESFEEELLDYPQYTRPAVFRGWKVPEVLLSGNHAAIYKWRYEQQITRTRLRR PDLFKCWQEKRKQQGE" gene 14695..15558 /locus_tag="DP116_06940" CDS 14695..15558 /locus_tag="DP116_06940" /EC_number="3.4.15.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanophycinase" /protein_id="PRJNA477356:DP116_06940" /translation="MPQLKVKSLEMRTPQATKTAVLVIGGAEDKVHGREILRTFVSRS GASNGHITIIPSASREPSIIGGRYIRIFEEMGAKKVEILDIRERDQCEDSYIQASLEN CTGVFLTGGDQLRLCGVLSDTPAMDIIRQRVRAGQLTLAGTSAGAAVMGHHMIAGGGS GESPNRSLVDMATGLGLIPEVIVDQHFHNRNRMVRLMSALAAHPDRLGIGIDEDTCAM FERDGWLQVLGKGSVTIVDPTEVTHTNEPHVGATEPLNIHNLRMHLLSHGDRYHVYQR TVLPAVYRVSS" gene 15754..18453 /gene="cphA" /locus_tag="DP116_06945" CDS 15754..18453 /gene="cphA" /locus_tag="DP116_06945" /EC_number="6.3.2.29" /EC_number="6.3.2.30" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318616.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanophycin synthetase" /protein_id="PRJNA477356:DP116_06945" /translation="MRILKIQTLRGPNYWSIRRHKLIVMRLDLENLAETPTNEIHGFY EGLVEALPSLEGHFCSPGCRGGFLMRVREGTMMGHVVEHVALELQDLTGMNVGFGRTR ETSTSGVYQVVLEYLNEEAGRYAGRAAVRLCQSIVDRGRYPKAELEQDLQDLRDLWRD AALGPSTETLVKEAEKRGIPWMQLGARFLIQLGYGVNQKRVQATMTDSTSILGVELAC DKEATKRVLANAGVPVPRGTVINFLDDLQQAIEYVGGYPIVIKPLDGNHGRGISINIT STEEAEAAYDSARQVSRAIIVERYYTGRDHRVLVVDGKVVAVAERVPAHVVGNGKSTI FELIEETNKDPNRGEGHDNVLTKIELDRTSYQLMEKQGLTLNSVLPKNQICYLRATAN LSTGGIAIDRTDEIHPENIWLAQRIVKVIGLDIAGIDIVTADISRPLREVDGVIVEVN AAPGFRMHVAPSQGIPRNVGGAVMDMLFPADKISHIPILAVTGTNGKTTTTRLLAHIF KQTHKVIGYTTTDGTYIGDYLVESGDNTGPQSAQLILQDPTVEVAVLETARGGILRSG LAFENANVGVILNVASDHLGIGDIDTIEQLANLKSVVAEAVFPDGYAVLNADDHRVAA MAEKTKANIAYFTMNPDSELVRKHIQKGGVAAVYENGYLSILKGDWTHRIERAENIPL TMGGRAPFMIANALAASLAAFVQNVSIEQIRAGLNTFRASVSQTPGRMNLFNLGNYHA LVDYAHNPASYQALGAFVRNWISGKRIGVVGGPGDRRDEDFVMLGKLAAEIFDSIIVK EDDDTRGRGRGSAADLIVQGIKQINPNYQHQTILSETEAINRALDMAPDNSLVVILPE SISRAIALITARGVVKDEILQQTNPSTIDSQVGVKSTVVNTLL" gene complement(18571..18744) /locus_tag="DP116_06950" CDS complement(18571..18744) /locus_tag="DP116_06950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875501.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="twin-arginine translocase TatA/TatE family subunit" /protein_id="PRJNA477356:DP116_06950" /translation="MFGLGWTEVGVIVLVAIVIFGPKKIPELGSALGKTLRGFKEELK NPSEDTNPEEEKQ" gene complement(19145..19363) /locus_tag="DP116_06955" CDS complement(19145..19363) /locus_tag="DP116_06955" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06955" /translation="MNEQIEKTDKLIFSLFSTMLCLANHLITYNSLLFVIFLVKKDRK LKDWPKICEIDSTFGSGSFTALTYIIVG" gene 19426..20082 /locus_tag="DP116_06960" CDS 19426..20082 /locus_tag="DP116_06960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019490164.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NYN domain-containing protein" /protein_id="PRJNA477356:DP116_06960" /translation="MQLLPRPQHNNQRRLKLEPEPLLNMTQLSAFEKPDIASANTLQP CEKNNPDNKLHQRIAIFIDGANLFYSAMHLNLEIDYTKLLRYLTKKRQLLRAYFYTGV DYTNDKQQGFLMWMSRNGYRVVSKELIVLPNGSKKADLDVEIAVDMMTLARYCDTLVL LSGDGDLAYAVDNITYRGVKVEVVGLGCMTNESLIRVADSYTDLELIKQDIQKKVSAQ " gene complement(20740..21252) /locus_tag="DP116_06965" CDS complement(20740..21252) /locus_tag="DP116_06965" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06965" /translation="MSFDGKCHFDWIYQRLNPLVLDSKTYVLFGSDNGQPGGTDPYNG DTNINEYRSLLCIKKTGAPAPEGLPPSSVTPGGATKASWSGGTALIIPNIQGKQLTSQ AVADKMCDQVGQITRGTSGYRMAEFHDGTGPNPGWSFWAEAYGEINGLAPSTRYWVRI NDQPANPWGN" gene complement(21261..23279) /locus_tag="DP116_06970" CDS complement(21261..23279) /locus_tag="DP116_06970" /inference="COORDINATES: protein motif:HMM:PF13646.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06970" /translation="MSLTTKHHSTLNQKFFRPKSILALSGLLLTLAVPTTVVYGKQLS QLGTDLTSTILAKNNTLPVYQFAPGQRFTYKLDYTNSSNSDLRALFGDLKTSNRSEAT SINGFVNTFETSVKGELVVTILEQKGSCFLISYTVNNPAVTLKANGQDVTDQAQLIQQ DLKRQVFATVNSQGKIIAVRFDPTMSEVAQNFARTLLATTQFVTPDSSKTFANKWTSQ EDDPNGHYIASYQTEAGKYQKNKLRYLQPPSSKKANNTQVPTTINSEGKLTGDFNSHG GYLVSLQGTELQKFIIAGKNVGQAKTTLNMAYTNQATLNPTELTTLRDTNTKREKVAP AIALSATPTEAEVEAKIQRQELGDTTLESLLADLEKAEASPDKNQNNTPLYLKFKALI YVYPESSATLGKRLAAANAKSLTMQMLAGALSVVGNTQAQSALVTATQAHQKDWLAMS ILIPSLATVNSPTQESENVLRNLAFNSKEERTASTAQLALGAMARNLAENSPERANKI VEQFVQQLQTAKNDEQTRQYLLVLGNAGSKQALKAIAQLTSAPNPSVRAAAVTALRWI QDNQVDTLLTKALSSDSDDGVRLEATVALGFREMNKATFQVQKQAFLSDNAIKVRLAA LKNVWEAHEGFPEVRQLVKQAAQNDVSKDVQEAAASIVKMYPQGYFDK" gene complement(23498..24946) /locus_tag="DP116_06975" CDS complement(23498..24946) /locus_tag="DP116_06975" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06975" /translation="MSKRIAFISSIGSMLAIYALNLAATLPVMAKTDLQANQAVDSSN FERPNNAKIDNSAFKEAPTPEVNIDNTLVQLKKQEPIKFQPFELKDPETGKEVSEDTI LTLPNGEKIQAAEYYAEINRLEQQFNELGYSLRSPEKEVTLQESNINQSTLEKQAAEI DQAHQPEDSAKAALRESLEPNKVLDFIKQRLNQETELAPSKTNNNELQKSLSKQIKSD AINKGLRFPFPRTKTILKTFNLDTGDPKIIAAYIKGKMELTASPLSSSAYAEANAGGY MFNNHADLLRATASLNAPASGNLTTNMNLFVHGNNVYNFQDARQASLQLGDKYSRSLD KEVANFGFSLGPVSMKGKLGVQGSGGFTYNLVASPKSAYAKVNAFLDTRGYGQAVASI KVVSAGLDVDLTFLKDNLDISALALVDIEPGTNRAYLKADYYAFNDIKALNGKIYGYV KVFGKQLKTKIWDWNGFNKSGYLFNGSEKIYF" gene 25177..26052 /locus_tag="DP116_06980" CDS 25177..26052 /locus_tag="DP116_06980" /inference="COORDINATES: protein motif:HMM:PF14516.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06980" /translation="MSRIMHHAAQQGYRTVVLNLRDAVGEDFNSLDKFLQWFITSIAE TLELGQPVEEHWRKSLGNCKIKCRTYFEKYLLPGDSAVAIALDEVDRLFVHGEIAGEF LGMLRTWHEDAKTKPLWRQLRLLMLHTQVYTQLNINQSPFNAGTEIKLTDFTSEEVES LSRQYKLNWKNTQVEQLMAMVGGHPYLATKAIQVVSRQDMTLESLLQSAPTASGIYRN HLERHWRYLQANIPLATAFKTIVLADSPVEFNSNLNLDDAVKLYDFGLVELQTNSVVS RYQIYSLYFQERLGS" gene 26251..28272 /locus_tag="DP116_06985" CDS 26251..28272 /locus_tag="DP116_06985" /inference="COORDINATES: protein motif:HMM:PF13181.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999745.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06985" /translation="MSYSKQSNGYQVGGSLGESASTYVVRRADSQLYSGLKTGKFCYL LNCRQMGKSSLRVQTMARLKKEGVACVAFEMREFCLHEVTEDEFYGGFVSYLVNEFNL EIDLESWWYGHSLIHPALRLTKFIEEILLEQIPQSIVIFADEIDSVLNLSFKDDFFAL IRGCYNKRADKLKYNRLTFALLGVATPADLIEDKDNTPFNIESEAIELTGFQLDEATP LEKGFVGISSNPRAVLQEILLWTGGQPFLTQWICQLVSSNLSHIAAGVEADCVATIVR SRILSNWLAQDKQQHLQTIRDRILNNEQLACWSLGMYQKVLQAGELAVDDSPEIMKLR LSSLVIKQEGKLRVYNRIYQSVFDNTWVEKELQNMRPYAEAFAAWEASGRDTSHLLRG DDLGFALAWANGRSLSDKDYQFLVASQELELTQVQQRTEIALLQEKQAQQNFVEAQRK AKRSARLTFASMIASLTMGAITIILTPGPLAILLNNIAFQIYADDHLQTALQVYDLAL LIKPAYPEALYTKGRIYEDLQDFDNASNNYESAKDHKFPQAYSELARLHILNKRYSRA VDIISEGLKLRLTDREKYGMFKNLGWAQFELARQGKATYEQAEISLDEAIDLQEESAS PHCLLAQLLEAKALKSKSLTEWRKCFELGDSDHPDERKWIEDARQRLRM" gene 28387..28860 /locus_tag="DP116_06990" CDS 28387..28860 /locus_tag="DP116_06990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06990" /translation="MLKGSHTIAFALTLTSLFLVINTNIAYAKQNPERGTYVINEPEP GDAPGLPRPPRPCSRPGGTFPCTRRPAPPIPIIDLELNDVTNDVAIKKLEALKASGKA TNDDYILLGYFYSLEKKYDLSEANYLKALELATDDGRKAIIQQELKKVRDHNVKQ" gene complement(29056..30264) /locus_tag="DP116_06995" CDS complement(29056..30264) /locus_tag="DP116_06995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875378.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_06995" /translation="MSDTRTLLIIDDCAQDRRIYRRYLLKDPHQSYQILEAESAKDGL ALCQKILCDVILLDFYLPDMTGLEILEQLTRERLDTAASVIMLTGQGDEAVAVQAMKK GVQDYLVKQHLKPDVLQLAVRKVIQHSDLQTVLTKTRERQRFMLKQAELLAQTQAALR KEQKLNAFKSQMMTTVSYEYRTPLASILAATSTLKQHGVKLDESKQEKFLQIIEHKAR YMAKLVDDLLVFNQFESNKAQFKPQPLDLYHFFSGLIEEQQEKLSDSYAAAQSADRHH LVYEITGNYKGFWGDGGLLRQIFVNLISNAIKFSPDGGEIEFHLIGGEEQVIFYVKDK GIGIPINEQENLFQAFSRGSNVDTIPGTGLGLAIAKVCVELHGGDITLESQVGKGTRV IVTLPKRSSV" gene complement(30594..33434) /locus_tag="DP116_07000" CDS complement(30594..33434) /locus_tag="DP116_07000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874089.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="insulinase family protein" /protein_id="PRJNA477356:DP116_07000" /translation="MSVFPKVHRYRFRLLLLTFSLITVLLLGDKFSYSQITTSSSLLD SQKITVSHKKERISLTENVQKTLLDNGLTVLTREIHSSPVVTVQVWYKVGSRNEEPGL NGIAHQLEHMMFKGTKNRPIQFGRLLSALGSDSNAFTSYDQTAYYNTAQADKLTALLT LEADRMQNARIDPTELDLEKRVVVSELQGYENSPDYRLNRALMRAAFPNHAYGLPIGG TEADVQRFDLEQVQKYYRNFYSPDNAVLVIVGDFQSEPTLEAVKEIFGKIPKTQESTL KSQHSKVLQPTPVTPHFLPIVLKESGAAALVQAVYPLPDAKSPDVPALDLMDRILTDG RNSRLEQALVESGLATDVSASVVNLMELGWYELLVTADPDQDLKKIDSVLNSAIAKLI NKEVTTEELKRAKVMLEASVLLSNRDITSLALQLGNDETTAGDYHYTDRYLAAVRQVT AQDVQRVAKKYLKQEARTVGYFKPTQVKGKGNDPKKINNSKRNTENFATSASVTTEEV TKYLPPTDSRSVPTSHTLPEQFTLSNGLRVLLVPDKSTPTVTLSSYVKAGKEFDPQDK AGLASLVAENLMNGTKTKDALTLAKILEDRGASLHFDAYREGVRIEGDSLAGDLSVLI QTLADVVKNAYFPTKELKLTKKQALTALKHELDDPSEVAQRTFVQSIYPKQHPLHVFP TEQSLRRISRKDVIEFKTKHYRPDTMVLALVGDFACDQVQALISSEFGHWKATGSPPT LKYPTVSLPEKVINVNPVLPGKAQAITYMGNTAINRKDSRFYAALVLNQILGGDTLSS RLGAEVRDRQGLTYGIYSSFVAGNNSGTFLISMQTSPEDTREAIASTRELLKEIHQKG VTEPEVETAKHILISNYIVSLANPEELIDQILMNEVYGLNKQELRDFTEKIQAVHFEQ VNQAARELLYPDKIVVVTAGPAVDTEQGSISH" BASE COUNT 9621 a 7077 c 7173 g 9600 t ORIGIN 1 aatgttaatt aatgtctagc tttgtccaac aatttagtta cagtttctag ccccttagtt 61 cctgaagccg tatcgcactt gggaaatgtg ggctaaagtg ggctaaaaat aatttccccc 121 ctgccgttct tatatgtggc tcggggggaa aaaataggtg gtagactata gagtttctgt 181 ataacttgat gtctctattc tattaactgt gttcactttt gtacagtcta atcataagaa 241 tttacggtat tttcacaaca tctttactag ttctttattc tcgctatctc tggagctagt 301 cttatatata agtgtagcat tacatctttg accaaacttt ctaacttgag tctctctgta 361 aaaatatgaa cattaaaatt gtcacggcta cggttctgtt tacttttttt gggttgacag 421 cacaagcctt cgcactcaac gagcaagact tggaactttt gagaacaaca aatacttgtc 481 ctcgttgtga tttaagtggt gctgatttat ctcaagctag attaagtgga gccaatttac 541 gagaagcaaa cttaaagggt gcgaacttat ctcaggcaaa cctcacaaat gcagatctca 601 caggtgcaaa tctagaaact gcagttttaa cttccgcaaa cctctcgaat gcttccctaa 661 caggtgcgaa tctcaaatta gcatctctgg acaataccaa tttgacttct gctgggttta 721 taggtgccaa tttagaagct gccaatctca caggagctaa acgacagtat acaaactttc 781 gaggagcaaa tttccgtttg accacaatgc ccacaggcac tgtcacttct gacaaaaatt 841 atggctggtc actacagcgt ccagctcaac aagttgaatg tgataagttc aaaaaccaga 901 aggttcctgg tacaacttgt cgcggggaat aaggaggacc aggggagcac ctttagcagg 961 agagcagggg aagaaattac ttgcttattc cgcgtcttcc ctacttgctc atctccctca 1021 ctactccatc ttccccactt gcaaaaggga gcatcccaaa tgtgtaaatt tattttattt 1081 tgtagtgcgg gccggacagc ccgcttcggt tataaactgg ggagcaagat gctcccacta 1141 ccattttttt tgcaaaattg ggatgctccc cttgcaaaaa ataattgaag ttaaattagc 1201 atttggttaa catttcttta tcataggaaa aatgacaaaa cagtgtgtaa cttaattcac 1261 caatgcttac tcatattaaa gacttggcaa caaaactgtc accccgctta attgagattc 1321 gccgtcacat ccactctcac ccagaactca gtggtcaaga gcagcaaaca tctgcttttg 1381 tttctggtgt tttatcttct agtggtcttc acgtacaaga aggagtcggt aaaactggag 1441 tgattgcaga actgcaaggg actggcaaag atgaacgttt attggcaatt cgcactgaca 1501 tggatgcgct accaattaca gaacgcacct gtttagagtt cgcctctcgc tcacaaggtg 1561 tcatgcacgc ttgtggtcat gatgtccaca cgacagtcgg tttaggaaca gcaatgatac 1621 tgtctcaaat ggcagaggag ttaccgggtg gaatacggtt cttatttcag ccagcagagg 1681 aaatagccca aggtgctagt tggatggtaa acgatggcgc aatgaagcag gtatcaaata 1741 tacttagtct tcatgttttc ccctcgatac ccgcaggttc cattggtgtg cgttacggcg 1801 cgttaacggc agctgctgac actatagaga ttatcattat cggtgaatct ggacatggtg 1861 ctcgtcctca tgaggcggta gatgctatct ggattgcttc gcaagtcgta accacactgc 1921 aacaagcaat cagccgaacc cagaaccctt tacgtcctgt cgtgttaagc atagggcaga 1981 ttaacggtgg cagagcaccg aatataattg ctgataaagt gcagttgttg ggaacagtcc 2041 gttctctcca tcccgaaacc cgtgccaatc tgccaaattg gattgaaaaa atcgtcgcta 2101 atgtttgtga ttcctatggt gcaaaatatc aggtaaacta tagccaaggc gtgcctagcg 2161 tgcaaaacga ttatttccta acgcagttga cgcaagcagc agcagaggaa gcatggggta 2221 gcgatcgcgt tcaagtgtta cccgaacctt ctcttggtgc cgaagatttt tctgtttatt 2281 tagaacacgc tcctggtagc atgtttcgct tgggtgtggg cttcaaagat agaatcctta 2341 attacccatt acaccatccc ctctttgacg tagatgaatc tgccattatt actggagttg 2401 tcacgatggc gtacacagct tacaaatatt ggcaccaaaa ttgataaact taaatatcta 2461 aataaaaact cccagcactc ttgtactgga agttgatact ttagcgagac taatcactag 2521 acgcaaaaca aagcaaaaat ttgcgtctag tacttccata gactacttag cgccgactac 2581 gcccaatttc ttttggagct cggtcttagc agctttgaat tctgtagcca gttgctcgct 2641 ttgctccgtg ctcaagttgt tgcgaatagc agcgaacatg gagctttctt cttgacgaat 2701 atggtcgcca acagcgtcca tcagctgttt gattttacct ttgaattgct ccatgttaga 2761 ggggttaaga cccttgattt gctctaacat cttcttcatt tcagcttgct cgtcatacag 2821 ttcttgagta tcgttctgac cgtagaaagg acgtactctg gggtagacga cttgctcttc 2881 agcttcagcg tgagcactca aatccttgta gatttgacca aagtactctt gaatcttttg 2941 gggatcttgg ctagccaaaa gttcagtgaa caaggtgtta accttgccgt gatccaaacg 3001 aataacatct tggatgttca gatccttctt gtcagtgttt tgggtaacag cgctaccgac 3061 aacgccgctg actgcagcca tagcgtcttg cacgcgtgcc caaatacctt gctccgcttc 3121 ttgtccggtc agttcgcgga cacccaaaat ttccagaacg cctttgagtt gctcttggtg 3181 agcgcggttc tcaaagttga tggtgtttag aggtccgaga gccagcatga cgtcagcacc 3241 aactttttgt gctgctttgt gaactgtcaa gccggtcatg acttgttggt gcttcagcaa 3301 ttcatgctga aagactttct cgtacaggct cagctcagag ccttgttgca atttacgaac 3361 cttttcaatt aactctgtaa cggtcttctt tggctcggct tggatgccat actgaccgat 3421 tacagtttcc aaaacgccca gatttttttt gtcatcttcg agaaagttcc ggaggcgatc 3481 agcaatttct tggtcactga cttgtttgat aagctgttct tcattttcaa tcagcaactg 3541 ttgaagtgct ttcagatctg ccagtttaat agcaatagca gaacgctttg tatcatctag 3601 tgttgcaacc attttctggt tcccctcagg atcaagtttt ttgcgttatt tacattctca 3661 aggtgacaca atcatgaata agcttccatc tttcttttga ccgatagctc ataaactctg 3721 gatagaaaaa ggagaaaaaa acttcatctt ttataatttc tacttggaga tagatagtta 3781 ctttcacgat gctctatcgc tagatagaga aaaatttttt gcaaattctc tagattgacg 3841 caaggatagt taaaaaggag ataaaagcat aatgcccgat agtccagggc taggagagca 3901 agcgctgaat aaagccgcag aaatagggtt atctagccag ttagataaag tagaagattt 3961 gaatgtagat gtcaaaacag atcctctgaa actggttcaa ggagaagtcg attcagtctc 4021 aattgaaggc aaaggattgg taatgcaaaa agacctccga gtggaggagt tgaaaatgca 4081 aaccgatagc gttgccatta atcccttgag tgcagctttt ggtaagattg aactgacaaa 4141 accaacgcaa gcaagtacac tagttgtttt gactgaaaac gatatcaatc atgcctttaa 4201 ctcggaatat gtccgttcac aattgcaaac ccagaaaatt cacatcaacg ggcagttaac 4261 gacgattatt cctcaacacg tagattttca tttacctggt gagaataaag tcgcattagg 4321 agcttctatt ctattaagag aaaccaatga aactcaccaa gttgcttttt ccgcagtgcc 4381 gaaagtgagt gctaacggac aaacagtttc tctagaaaat gttgagtatg gtgatacaaa 4441 agaaatctca ccagagttga caaaaacttt gatagatgca acgagcgaaa ttttgaattt 4501 aagtaacttc gatttagaag gaatgaccct acgagttaaa caactcgagg tggaaacagg 4561 taaactgata ctgcaagcag aagcttacgt cgagcaaatt ccctcagcat aaaattagga 4621 tactttagga tactaaaaat tatggaaact acagagagaa cttcgacacc gttcccaaat 4681 ctcccaccag ttattgaaag tgacgataga gagtatcatg atactggtgt acctagcaca 4741 gttgcgatcg caggacatcc cttacacccc ctgagtgtca tctttcccat agccttttta 4801 gccgccgcct tgggtagcga cttcggctac tggttaactc atgatttctt ttgggcaagg 4861 gcttcgctgt ggttaatcgg acttggattg cttggaggcg tggtagcagc gctaatcggc 4921 ataagcgact ttttacaaat tgaacgagtc cgcaagcgtt ctgctggctg ggtgcacctg 4981 actcttaacg tggctatcct tgttttgagt gccatcaact tcattctgcg tctcggcaat 5041 cctgagtcag caatattacc ttggggtctc ataatctcac ttattgttgg tacgctgacg 5101 agtgcttctg gctggttcgg tgctgaacta tcctatcgcc acaaaattgg tgtagtgggt 5161 gctggtagta gaagatatcc gtgacaaaat tttagatttc gccaatagca acccaaaaag 5221 ctttcgtgta ataccatttc tttgtgaggc tgcgccaaat tttcttgact tcttctttct 5281 ttctttgtgt actttgcgcc ctaccctgcg ggaagccctc cgggttctta gagtatctga 5341 aatttaacag ttatcagtta tcaagtacca gccgcagtac cagccgcagt accagccgca 5401 gtaccagccg cagttatcag gtaggaaacg gactcgtccg cgtaggtagc tgcgtaggta 5461 gctgtttact gttcactgtt cactgtttta acagcctcac tcgaagacag gcgactgtcg 5521 tacatggttt cccaacttgt tcgcgtctcg tctgcccgta cacgctacgc aaagaagaac 5581 accaagaaaa cgaatgagtg caacaacttt atttctttag tagcgtgaac agattgtccg 5641 ccctgatcta aaagtcatga tttcatgatg agatcaatat cttgcccatc tcaaatattg 5701 aacgggcaag actggtatct caaaaaaaca cacaagaaat gaatcttgtt ataagtatga 5761 ctaaggctgt gaactaccca ccaccaaccg caagcggtat ggtgggggct tccaacttca 5821 cggaggaagg cctcatcgtg ggcttgcgcc ctcgaattgg tcttacttcc tctccattgg 5881 cgttagcctc ccctgtccca gaggaggaca gcattctgac accttctgtt ctaatattca 5941 tcgcagcatt accatctcta tcgtgatgag taccgcaact aggacaaacc caggttcttg 6001 tatttagtga caactctttg atttgataat ggcaattaga gcaaagtttg gaactaggaa 6061 accatcggtt tatttctatc aacatcccgc cattacgttc taatttgtaa gacaagaagt 6121 tgacgaaagt cccccatccc gtatcagata ttgcttttgc tagtttgtcg ttacgaacca 6181 tgcccttgac gtttaggttt tcaactacaa cgacttgatt gttatcaact atctttctag 6241 atagcttatg taaatagtct tggcggacat tgctaacttg ttcgtatact ctagctactt 6301 ttcttaaagc ctttctacga cgactactac cttttacttt tcgtgcagca atacgttgtt 6361 tcttggctaa tttcttttca tatttggcta agtgttttgg gtttggatat ttggaaactt 6421 tttcaccgtc gtaggtaatt gcaaaatctt ttattcctag gtcaacacca ataaccttgc 6481 catctgtact aggtgttggg taatcgtctt catattccat caacacagaa gcgtagtact 6541 taccagaagg acatttacta acggtgacag tcttgatagt cccatcaagt agacgatgaa 6601 tattcgcttt tacaacgcct aacgttccag gaaatttcag gcaatcattg acttgtttta 6661 ccttctgtgg atactggatg gactgacgat gatgtcttga cttgaactta ggatattgcg 6721 ctctaccatc aaaaaagttt ttgtacgcac gactaagatt gagacttaca gattgtaaaa 6781 tctgtgagta gcattctcct agccattcag tttcttcatc ttttttgagt gctggcaata 6841 aagaatttag cccagattga gacaaacctt ttccagttgc cttgtaggtt tcaatgcact 6901 tgtttaaacc gtaattccac caccagcgag cacatccaaa atgctgtgca agtatcttga 6961 cttgttggtc tgtgggatac aacctaattt ttacggcttg tcgtctcatc taaacttctc 7021 caaatatcga cctgttacta ttatatacaa ttttcagtaa cagcgttccg gaaaagttgg 7081 caactcctca tacatctttc cctctcttac cgtcggtaag aattggtgac gcactgggac 7141 aaagaaaatg agccacacga gtgtttgttg atctgtcggc gcttacgagt actcgtagtt 7201 ataaggctgg ctcattttcc cagtcggaac cccgcaaaga tgcaatcgtc gttgcccgtc 7261 gcctctcccg gcgctaaacg gagtaccgtt atagcgcggg acttacgacg aagaagttaa 7321 ctcttctgta atactaatat aggcaggctg tgctttcact ctttctatcc aatcttggat 7381 ggcaggaaat cgtgtcaagt caaatccacc ttcatcagcg acatgagtgt aagcaaacaa 7441 gccaatatca gcaattgtgt aacgctctcc tacaaaataa gcgtggttgg ataagtgttt 7501 ttccatcacc attagagctg cgtaaccagg ttcacgtttt tgctctataa cttcatggta 7561 ttcttgagct ttatctaaaa tagaaatcca aaatctcaat gtagcgatat aaggctcatg 7621 gctgtattgt tcaaagaata accattgcag tacttgtgct cgtaaaaagg ggtcatatgg 7681 taaaaattct gttccttcac tcagataagt cataatggca tttgattcag ctaaatatct 7741 ccctggttca acttccaaaa cgggaatttt tccgttggaa tttttaccta aaaaatctgg 7801 cgttcgagac tcacctttag tgatatcaac ctctatcctc tcaaatggca taccaatttg 7861 cgtcaataaa agacgaatct tgtaaccatt gcctgagggt aaaaaatcat acaaacgcag 7921 ggttttcatg ccaaaaagtt gagatgaaat ggttaattaa tcatgccgtc aaaatacagc 7981 atattatcgg aaatgacaaa taaatatgat aaaatggttt gatatatagc attttagtac 8041 gaaacagaat tctctgatta ataaaaaata aaagtaatat ttgttgcacg atttagacgt 8101 ttgattttaa ttagcggttg aggaaaaagt atgtgtggaa gatttacttt aatccaaaca 8161 gcagaagcat tatacgaaac cttccatgta gaaaaacttc ccgagctaga gccgcaatat 8221 aacattgctc ctacgcaaat ggtcggggca gtcttgtata atcaagaaag cgaggagcgc 8281 aaatttcaga agttgcgttg ggggttaatt ccctcttggt caaaagatct aggaatagga 8341 gtgaagctga tcaacgctag ggcagaaaca gtagcagaaa aaccagcttt ccgctctgca 8401 ttaaagtatc gtcgctgttt ggtagtagca gatggctttt acgaatggcg gcttcaagaa 8461 aatcaaaaac agccttatta ttttcgtctg caagaaggaa aaccctttgg ctttgcaggg 8521 ttgtgggagc aatggcgatc gcccgaaggt gaggaaatca catcctgtac aattttgacc 8581 acggaagcaa acgaattagt gcagccgatc catgagcgta tgccagttat tcttcaacaa 8641 caggattacg atgtgtggtt aaatccagaa atacaaacac cctcatcgct acaacaactg 8701 ctgcacccct acccatccga agcaatgact gcttacccag tcagcaaagt tgtcaacagc 8761 cccaagcaaa atattccaga ttgtattaag cctttgtaac ttttcatgag gtactaatca 8821 gttagattaa cttgtagtga gcgctcaagc actcattaaa cacccagtca tcgtcatgat 8881 tgcggcaaac ctatcacata agccgttatg ccaagaaccc agaaaaacga taactttgtt 8941 gacaaaagct ttacagtcat ggcagatctg atcctcaagc ttctgccaac caataaaaaa 9001 gctaaagaag cctttgttta ttaccgagat ggaatgtccg cacagtcgga aggagaatac 9061 gctgaagcct tagaaaacta caaagaggct ttagaactag aagaagacac caacgaccga 9121 agctatatcc tctataacat ggggcttatc tatgccagca acggggaaca tgacaaagct 9181 ctagagttgt atcatcaagc actggaattt aatccacgcc taccccaagc tttaaacaac 9241 atcgctgtca tttaccacta tcaaggggaa aaggcgaaag aagcaggaga cgaagacggg 9301 ggagaagcac tgtttgacaa agcggctgat tattggatta gagctatccg ccaagctccc 9361 aataactaca tcgaagctca aaactggtta aaaaccacta aacgctctca agttgacgta 9421 ttcttttaat gatttgtcgt tagtcgtttg tcatgatact aatgactaat gactaatgac 9481 taatgactaa ccatgattga ccgtgaacaa gttcgtaaag tagctcatct tgcgcgttta 9541 gagttgacgc ccgaagaaga agagaaattc acaactcagt tgggaagtat tcttgactat 9601 tttgaacaac tgagtgaatt ggatgtgaat aatgtgccac caacattacg agcaattgat 9661 gtgaagaatg tgacacgagc agacgatttg caaccttatc ccaaccgcga agacattctt 9721 cagagtgcgc cagaacaaga acgcgacttt tttaaggtac ctaaaatcct gaatgaggaa 9781 tagttatgag tcttcaataa gttatgagtg agtagttggg ttgctcaggt tatcggtgtg 9841 ggctgtattt tagattgtca gtggggcaat tcaacattga gtagaattgg acagcagtga 9901 cgaatccaaa acagaaaagc ggagtacacc gatgctgcaa tccgttgagc aacgcattga 9961 gcgtgtcatc aataatcttc tgccagcaac agcgctagaa ggaaaattcg gttcaccaaa 10021 aacattggat gagcaattgg tgcattacca cacgccaggc atcagtattg ccgtcattaa 10081 tgactttgaa atcgaatggg cacgtggatt tggtgtgtgt gaggctcgaa caagccgtga 10141 agtcacacca aacacgttgt ttcaagcagc ctcaatcagc aaacctgttt tcgccttagc 10201 ggttatgcgt cttgcacaag aaggtcgtct caatcttgat gaagatgtca acacctatct 10261 cacctcatgg cgtgtgcctg ctatcggaga ttggcaacct cgcgtcacgc tgcgccagtt 10321 gttgagtcat actgctggct tgactgtgca tggctttcct ggctatctaa actcagagcc 10381 attgccaacc acgattcaag ttctcaatgg cgaacctcca gccaataccg ataaagtgga 10441 ggttaatatc atccctggtt tgcattatcg ctattcgggt ggtggtacga cagttgccca 10501 acaggtattg gttgatttgc tcaagcaacc ttttccagaa atcatgcgcg aattggtgct 10561 caacccgttg ggcatgacaa acagcaccta ccaacagccg cttccaaacg attggtcagc 10621 aagggcagca acagctcacc cgttctctgg tatcccactt gaaggcaagc accacgttta 10681 tcctgagatg gcagcagcag gtttgtggac gacagcaaca gatttagcca aagtcggagt 10741 cgaaattttg cgagtattgc gtgggttacc tgccactgtg tggagtaaag agacgataga 10801 agaaatgttg cgtccccaac aaccagagca aacacaagga gcgaatgaat catttgtggg 10861 attagggaat ggattatttg tgggattagg attcttcgcg ggtggtggaa ttggcgatgg 10921 tttctacttt tttcatagtg gtacgaatga aggatttgta gcattgatgc gtgtttacgc 10981 gcatattgga aaaggtgctg ttgtcatgct caactccaat gaaattgagc tgatgccaga 11041 agttatgcga tcgctcgcgc ttgagtatga ctggcctgac gtgttcccgc aagaaaaacc 11101 aatcatcact ttgtcgcaaa ctgatagcta ctcagggtta tacttaacaa aatctggatt 11161 gcagttcaaa gtgatgagcc aagatgggag tttgttccta caatgtgagc agcaactgcc 11221 gttacagttt ttcccgacat cagagctaga gttctttgct aaagcagcga atacgagtat 11281 ttcctttgaa aaggatgaca caggcaatat tactgcaatg acattaagcc aagcaggtgt 11341 catgagttta agacaacaag cggatcaaca gattagagca cagagacagg ggtgattaaa 11401 tcagttgtca gttatcagtt atcagttatc actgttcact gtttactgtt cactgttcac 11461 tgttaataat gccgctagta gttctatcta aagacgctta ggcaattgtt gattattaga 11521 tacaactagt agtgattaac gactaactac taattaaact tatgggcttg ggaatcctca 11581 aggatggtaa gtgggtaagc gatcgcgagc aagaagactc acaaggtaaa tttctccgtc 11641 cttcaacaac tttccgcaac cgtattacag cagatggttc tagcggattt aaagcagaac 11701 caggacgcta tcacttgtac atttcctggg cttgtccttg ggcgcatcgc accgccatta 11761 tgcgtcagct caaaggactg caagatgtca tcagtatgtc cgtcgtagcg gcagaaattc 11821 atgataacag ttgggaattt gctgatgaac ctgggagtac tcccgataca gtcaatggaa 11881 ctcagtatct ctggcaagtt tatctcaaag ccgatcccaa ctacagtgga cgggtgacag 11941 ttccggtttt gtgggataca caaaatcaga cgattgtcaa taacgaatcc cgcgaaatca 12001 tacggatgtt ggatacagag tttgatgcct tcgcgaaaca aaatgtaaac ttttatccag 12061 aacatttaca aaaagttatt gacgagacaa ttgatgcaat ttaccagccg ataaataatg 12121 gtgtctaccg cgcaggattt gccacgacac aatcagctta tgatgaagcg gtaaccgaac 12181 tttttgatgc tcttgatcat tgggagaaag tgttaggaaa gcaacgttat ctttgtggcg 12241 aacaactcac tgaagcggac tggtgtctgt ttacaaccct gtttcgcttt gatgccgttt 12301 actatgtgca tttcaagtgt aacttacgcc ggattgtaga ttattccaat ttgtggaact 12361 atctcaaaga cctctaccaa cagccaggag tcaaagaaac ttgtaatctt gaccacatca 12421 aacggcatta ttacaaaagt catcccaagg tcaacccaac tcgaattgta cccaaaggac 12481 cgattattga ttttgatgca ccgcataacc gggatcaact gcgtgcagct atgacagttt 12541 agattaacaa tgaacagtta tcagtgaaca accatttgat aacttttgac tgtttactga 12601 taactgtttt gattattcag gaaatggggt cttaaaattt ctcaagcaag acaacagcat 12661 acgcacaaat accttcttca cgtccagtcg gaccaagttt ttcattggta gtcgctttaa 12721 caccaacttg attcggctgt acttgcaaaa cttgtgctat tgtttcccgc atcttctcaa 12781 tatggggttt caatttcggg cgttctgcga caacgaccga gtcaacattg cctatttgcc 12841 aaccatgttg gcgaaccaac tgatggactt tacttaacag taccaaacta tctgcccctg 12901 cccattgggg atctgaggga ggaaaataat gtccaatgtc ccccaaagaa agcgccccca 12961 gcatagcatc catgatcgcg tgtgtcagaa catcagcatc gctgtgtcct agcaaaccca 13021 aggagtgggg tatatggact ccgcctagaa tgagggggcg atcgcttacc aagcggtgga 13081 tatcgtagcc gttaccaata cgaatattca tattttagtc attagtcact agtcactagt 13141 catgaattag tactcatacc gtttcttcat gaagctgcgc taaataattt cttagccccc 13201 cttgggaagg ggggttgggg ggatctgatt tgttgcatct tcatatagag atggtatcaa 13261 gtgtcaacag tcaaaagtca aaaattattc ttctccctca ctccctcact ccctcactcc 13321 cttactctcc ttgctgcttc cgcttctctt gccaacattt gaacaaatca ggacggcgca 13381 ggcgtgttct ggtgatttgt tgttcatagc gccacttata aattgcagca tgatttcctg 13441 aaagcaggac ttcaggaacc ttccagccac gaaacactgc aggacgggta tactggggat 13501 agtccaacaa ttcctcttca aagctttctg ccgttaggga ctcaactttt cccactgttc 13561 ccggtaacag acgtacgaca ccattcatta atgccattgc tggaatttct ccaccagtca 13621 gaataaaatc gcctaaagag acctcgcggt caactaaatg caatacccgt tcatccaccc 13681 cttcgtaatg accacaaatc actaccaatt ggtcataatt cgtcgccaat tctcgtaaca 13741 gtggctgatt catcgtttga ccttgaggac tcatgagaat aacctctcgt cgatctaaaa 13801 ctggcagtga ctctacagca gcaaatatag gttctggctt catcagcatc cccacaccac 13861 cgccgtaagg ttcatcatca actttccggt gcttgtcagt ggtaaagtct cgtggattaa 13921 tcaaatgcac ttgggcaatc tgctttgcta gggctttacc tatcagcccg gtagtgagaa 13981 cagaggtaaa acagtcagga aacagcgtaa ctatatcaaa gcgcacagta tttcgtattg 14041 gaggacacta atctaaaatt ggatgatttg aaaacgatgt tgtgtcacag acaaatcaac 14101 atcatcaatt gtgtctcaaa gtaccggaag tattgggttt tccgtgtcaa cacaagtttt 14161 tctaattact taatttatgt ttaaatattg tcacatctac atttaactta ttaaactgaa 14221 ccgagttaac aacttcttcg gatgcatctg gctagcctca gaaaaaagtt tgtcaacacc 14281 tgctaaagtc cctccacttg aagtgttggt ggtatgaggt aggaaaaccc cacagacgtg 14341 aactcagagt atgaaataaa gaatctcgtt gtttttcttc atgaaccaat tcacaagact 14401 aaaaccaaat cttcatggac acaggtgttc aaatcacctt cttaccttac accagtacgg 14461 gggcgaggct ttgcgcccgt acaaaccgag aggcgcaaag cctagatccc ctacaccata 14521 ccctagagtg agcgaatgtg aaaaacaaac cccaaaagaa acggcaaagg acactccaca 14581 aactgaggca ttttgtacat gaagcactgt cctccacgag aggcaaagcg taaaaagaat 14641 caggcgaaag ctacgttgtt taattgacct gaagttgttg acaggagaaa cacaatgccg 14701 caattaaaag tcaaatcgct ggaaatgagg acaccccaag caactaaaac cgctgttctg 14761 gtcatcggag gcgcagaaga caaagtacat ggacgcgaaa ttttgcggac atttgtcagt 14821 cggtctggtg ccagtaatgg ccacattaca attatcccat ctgcctcccg tgaaccaagc 14881 atcataggtg gtagatatat tcgtattttt gaagaaatgg gtgctaagaa agtcgaaatc 14941 ttagacattc gagaacggga tcagtgtgaa gattcttaca tccaagcatc tctagaaaat 15001 tgtacaggag tgtttttgac aggcggagac caattgcgtc tctgtggcgt tctgtcagat 15061 acgccagcaa tggatattat tcggcaacgc gtgagagccg gacaactcac cttagcaggg 15121 acgagtgcag gagcagctgt gatggggcat cacatgatcg cagggggtgg tagcggcgaa 15181 tcgccaaatc gctccctagt ggatatggct acaggtttgg gattgatccc agaggtaata 15241 gtagatcaac acttccacaa tcgcaatcgt atggtgcggc tgatgagcgc gctggcagct 15301 catccagatc gcctggggat cgggattgat gaagatacat gtgccatgtt tgagcgggat 15361 ggttggctac aagtcttagg caaaggtagt gttacgattg tagatccaac tgaggtcact 15421 cacacgaatg aaccacatgt cggagcaact gaacccttaa acatccataa tctacgaatg 15481 catcttctca gtcatggtga tcgctatcac gtgtatcagc gtacagtatt acctgccgta 15541 taccgtgtct ccagctgacg aagcgagtag ctgatgttac attagtaagc gaaacaaaca 15601 ctgaattgac cgctaaattt taggttgctt gcagccaaaa aaacgagaat aatacgatgt 15661 aagaagaaaa aactgtttac tatgaacagt taagcggtca attccagtaa gaaactagaa 15721 tattggttcc gaatctccat ctacctattt cccatgagaa tcctcaagat ccagacctta 15781 cgcggcccca actattggag cattcgacgc cacaagctca ttgtcatgcg cctcgattta 15841 gaaaaccttg ccgagacgcc cacaaacgaa attcacggct tctatgaagg attagtggag 15901 gctttgccca gtctggaagg tcatttttgc tcaccgggct gtcgtggtgg ttttttgatg 15961 cgagtgcgcg aaggcaccat gatgggtcat gttgtggaac atgtagccct agaattgcaa 16021 gatttgactg gaatgaacgt aggctttggt cggacccgag aaacatcaac atcgggagta 16081 taccaggtag tgctcgagta tctcaatgag gaagcgggac gctacgctgg cagagcagca 16141 gtgcggctat gccaaagtat tgtagaccgg ggtcgctatc caaaggcaga gttagagcaa 16201 gatttgcaag acctcagaga cttatggcgt gacgctgctt taggaccgag tacggaaaca 16261 cttgtcaaag aagcagaaaa aagaggcatt ccctggatgc aactcggcgc acgctttttg 16321 attcagctag gctacggcgt gaatcaaaag cgggtacaag cgacgatgac agacagtacc 16381 agcatcttgg gagtagaact cgcctgcgat aaagaagcca ccaaacgcgt tcttgccaac 16441 gctggagtcc cagtaccaag aggtaccgtc atcaacttct tggatgattt acaacaagcc 16501 atagaatacg ttggcggcta ccctattgtc atcaaacctt tagatggcaa ccacggacgc 16561 ggtatctcta ttaatatcac ttccaccgaa gaagcggaag ctgcttatga ttctgctaga 16621 caggtttccc gagcaattat tgttgagcgg tattacactg ggcgggatca cagagtactg 16681 gtggtggatg gcaaagttgt ggcagttgct gaacgtgtgc cagctcatgt ggtgggcaat 16741 ggcaaatcta ccatctttga actcattgag gaaacgaaca aagatccaaa ccgtggcgaa 16801 ggacatgata atgtcctcac caaaattgaa ctcgaccgca ccagctacca actgatggaa 16861 aagcaaggtc tcactcttaa tagcgtgtta cctaaaaacc aaatttgcta tctgcgggca 16921 acggcaaact tgagtacagg gggcattgcc atagaccgta cagatgaaat tcatccagaa 16981 aatatttggc tggcacaacg gatagtaaaa gttatcggtt tggatattgc aggaattgat 17041 attgtgacag cggatattag ccgtcccttg cgagaagtgg atggtgtgat tgtagaagtt 17101 aatgccgctc ccggattccg gatgcatgtt gctccaagcc aaggcatccc tcgtaatgta 17161 ggaggagcag tcatggatat gctgttccca gcggataaaa tcagtcacat acccattctt 17221 gccgtcacgg gcactaacgg caaaactacc acgactcggc tgttggcaca catttttaag 17281 cagactcata aagtcatagg ttatacaact acagatggga catatatcgg ggattattta 17341 gtagagtcag gagacaatac aggtccccaa agtgcccaac tcatcctaca agatccaaca 17401 gtggaagtgg cggtactgga aactgctcgt gggggtattc tccggtctgg actagctttt 17461 gaaaacgcta acgtgggcgt gatattgaac gttgcctctg accacctggg cataggcgat 17521 attgatacta ttgaacagtt ggcgaatctc aagagtgtgg tcgcggaagc tgtgtttcct 17581 gacggctacg cagtgcttaa cgctgatgat cacagagtcg ctgctatggc agaaaaaaca 17641 aaagccaata ttgcctactt caccatgaat ccggattcgg aattggtgcg aaagcacatt 17701 caaaagggag gagtagccgc agtctatgaa aatggctatc tgtcaatctt aaaaggcgat 17761 tggacgcacc ggattgaacg ggcagaaaat atacccctga caatgggtgg acgtgcaccg 17821 tttatgatag ccaatgcttt agcagcttct ttggcagcgt tcgtacaaaa tgtcagcata 17881 gaacaaattc gtgctggctt gaataccttt cgcgcttcgg tgagtcaaac accgggacga 17941 atgaatttgt ttaatttggg gaattaccac gctttggtag attatgccca caacccagct 18001 agttatcaag ctttaggtgc tttcgtccgc aattggattt ctggcaaacg gattggagtt 18061 gttggcggac ctggcgatcg ccgcgacgaa gatttcgtca tgcttggcaa actagcagcg 18121 gaaatttttg actctattat cgtcaaagaa gatgacgaca ccaggggacg ggggcgcggt 18181 tcagcggctg atttaattgt tcaaggtatc aaacaaatca accccaacta ccagcatcaa 18241 acaattctca gcgaaacaga agcgattaac agagcattag acatggcccc agataacagt 18301 ctagtggtca ttttgccaga aagcattagc cgtgctattg ccttaattac agcacgtgga 18361 gtcgtcaaag acgagatact ccaacaaacc aacccgtcta ccatagattc tcaagttggg 18421 gtgaaatcaa ctgtcgtcaa cacgctgtta tagtccgaat caggcagcac ttgattcggg 18481 tttcccgact tgaagtgagt gcccttcggc tccgctcagg gtaaaccgtt catttgtcat 18541 tacgaaaaat aaacgacaaa cgacaaagga ctattgtttt tcttcttccg gattagtatc 18601 ctcactggga tttttcagtt cctccttaaa accccgcaga gttttcccca gtgcgctgcc 18661 taactccgga atttttttgg gcccaaaaat cacaatagcg accaaaacaa tcacacctac 18721 ttccgtccat cccaatccaa acataagcct ctactccata aaactaatac aacgccacac 18781 aagtctagta tttaagtata atttctgctt cactgggctt ttcacaaaaa tatgtaaaac 18841 tgggcgagga tggacaatgc gacagcaaca gagggcatcc atcgcctaat ttttttgtca 18901 acaaaactat agcgctcgcc acaggcaacc taggtgggca cgacgattgc tcgtggcagt 18961 cttgcagttt tacacaagtt caaccccttc actaagctta ctcaaaattg cccaaagagc 19021 aaaattgggc gcacttgtgt gtcaggttta ttgtcagagt tttaaacaag cacccgagcg 19081 tgctcccgaa gcgtggattc aagcaggacg tcaattctag acggtggctt tgttagatcc 19141 ggatttaccc aacgattatg tatgtgagcg cagtgaacga cccactacca aacgtactat 19201 caatttcaca gatttttggc caatctttta gttttctatc tttttttaca agaaaaatta 19261 caaaaagtaa agaattgtac gtaattagat gatttgcaag acaaagcata gtagaaaata 19321 aagaaaatat taatttgtca gttttctcta tctgttcatt catgagtcga ataattttca 19381 ccgctatctt taataaaaat cacgcacgct cggagtcata tttccatgca gctacttcca 19441 cgacctcagc ataataacca gcgacgctta aagttggagc ctgagccact gttgaatatg 19501 acacagcttt ctgcatttga gaaaccagat attgcaagcg caaatacctt acaaccgtgc 19561 gagaaaaaca atccagacaa caagctacac cagcgcattg ccatttttat tgatggtgca 19621 aacttgtttt actcagctat gcacctcaat cttgaaattg attacactaa actgctgcgc 19681 tacctaacaa aaaagcgtca actgcttagg gcttactttt atacaggtgt tgattacacg 19741 aatgataaac agcaaggttt tttgatgtgg atgagtcgca atggctatcg tgtagtgagt 19801 aaagaactta ttgtgctccc taatggctct aaaaaagcgg atttggatgt agagattgct 19861 gtcgatatga tgaccttagc aaggtactgc gacactttag ttctgttgag tggagatggc 19921 gaccttgctt atgctgtaga taacattacc taccgaggag ttaaagtgga agttgttggt 19981 ttgggttgta tgactaatga aagtctgatt agggttgctg actcttatac tgaccttgaa 20041 cttatcaaac aagatattca aaaaaaggta tccgctcaat aattagggtg tttcattttt 20101 tcgatcaatt ttgctagtcc atgctgaaat cgatcaactg ccaggttaat aaattcacgc 20161 acggcaggcg atagactttt atgccagggt tggaaactgt aacataattc ctacgcattt 20221 atcttcaaaa ctagtcaatt ttgagtaggt attgcggaaa tatgctatga tttcgatatt 20281 agtaacggtt aacaaccgtg ccgtcagagg ataaaccttc cagcgaacga ctggcgtttg 20341 ataaacgtgc cacaggcgaa gtgcaggctc catagacaga gggtcgttag tagcgcctca 20401 aaagaagttt tttaataact tcgacacata tagcggtcag ttagaccgtt ctcgcgcctg 20461 tggactggtt gatgccgaca tcgccagaat gaagcaggaa gcaagcttca aactagatca 20521 tgagcagatt tgtctactta tgaatagttg tgaataggtc ttgtgtaacg gatacagggc 20581 ggtgatatta acagatcaaa ccctgactca atcaaatcga catacctgtt tgataattca 20641 atcaggcagg ggctatgtaa agaaaagtaa agtttgaaaa ttcgtttgta gtaagcgctt 20701 actacaaacc tttaattttt tgcgccgact tatttaatgt taattacccc aaggattcgc 20761 aggttggtca ttaatacgaa cccaatagcg ggtagacggg gctaaaccgt ttatttcgcc 20821 ataagcttct gcccagaaac tccatccagg gttaggacca gtaccatcat ggaattctgc 20881 catccgatat ccagaagtac cacgagtaat ttgtcctacc tggtcgcaca tcttgtctgc 20941 aactgcttga gacgttaatt gcttcccttg gatattggga attatcaacg ctgtaccgcc 21001 agaccaagag gctttagtcg caccgccagg tgttacacta gacggcggca gaccttctgg 21061 tgccggagcg cctgtctttt taatgcagag cagcgatcga tattcgttga tattagtgtc 21121 gccgttataa ggatcggtac caccaggctg accattatcc gaaccaaaca aaacataggt 21181 tttggagtca agcactaaag ggttgagcct ttgataaatc caatcgaagt ggcacttacc 21241 gtcaaaagac ataatttttg ttatttatca aagtagcctt gaggatacat cttcacgatg 21301 cttgcagctg cctcttgaac atcttttgaa acatcgtttt gagcagcttg tttgacaagt 21361 tggcgaactt cggggaaacc ttcatgggct tcccaaacgt ttttcaaagc tgctaggcgc 21421 actttaatgg cattgtctga gagaaatgcc tgcttttgca cttggaaagt agccttgttc 21481 atttcgcgaa atcctaaagc aacagtggct tctagcctta ccccatcatc tgaatcagaa 21541 gacaaagcct tagtaagtaa ggtatcgact tggttatcct gaatccagcg caaagctgtg 21601 actgcggcag cgcgtacgct tggattagga gcagaagtca actgagcgat cgctttcaat 21661 gcttgcttag aaccagcatt gcccagcact aacaaatact gcctggtttg ctcgtcgttc 21721 ttagcagttt gcaactgttg tacaaactgc tctactatct tattggcgcg ttctggcgaa 21781 ttttcagcaa ggtttcgcgc cattgctcct aaagctaatt gtgcggtaga agcggtgcgt 21841 tcttccttcg agttaaaggc taagttacgc agcacatttt ctgattcttg ggtaggtgaa 21901 tttactgttg ccaagctagg aattaaaatt gacatggcaa gccagtcttt ctggtgagcc 21961 tgagtagcgg ttactaaggc agactgtgcc tgagtatttc cgactacact taaggcgcct 22021 gctagcattt gcatggtaag acttttggca tttgcagcag ctaatcgttt gcccaaagta 22081 gcgctagact caggatacac ataaattagt gccttaaatt tgagatacaa cggtgtattg 22141 ttctggtttt tgtcagggga tgcttctgct ttttccaagt cagcgagtaa actttctaac 22201 gtggtatccc ctaattcttg acgctgaatc ttagcttcaa cttctgcttc tgttggggtt 22261 gcagacagtg caatagcagg cgccactttt tctcgcttag tattggtatc acgcagggta 22321 gttagttctg taggatttaa ggtagcttga tttgtataag ccatgttcag cgtcgtcttt 22381 gcctgaccaa cgtttttgcc tgcgatgatg aatttttgta gctctgttcc ttgcaacgaa 22441 actaaatacc ccccgtgact gttaaaatca ccagtaagtt tgccttcaga atttatcgtc 22501 gtaggaactt gggtattatt tgcttttttg ctagagggtg gttgtagata tcgcagttta 22561 tttttctggt attttccagc ttcagtttga taactagcta tataatgtcc attagggtcg 22621 tcttcctggc ttgtccattt atttgcaaaa gttttgctgc tatcgggggt aacaaactgt 22681 gttgtagcta gcaatgtacg ggcaaagttt tgagcaacct cgctcattgt cgggtcaaat 22741 cgcactgcga taattttacc ttgagaattg acagtagcaa aaacttgccg ttttaggtct 22801 tgttgaataa gttgagcttg gtctgttaca tcttgtccgt tggcttttaa agtaactgcg 22861 ggattattta cagtataaga aattagaaag cacgagcctt tttgctccaa gatagtaaca 22921 accaattcac ccttaacact ggtttcaaat gtatttacaa aaccgtttat actagttgct 22981 tcagagcgat tagaagtttt taaatctcca aacaaagccc gcaagtcgga attagaagaa 23041 tttgtataat ctagtttgta ggtaaagcgt tgaccaggag caaattgata aacgggtaag 23101 gtattgttct tagctaagat agtagaagtt aaatctgtac ccaactgact taattgcttt 23161 ccgtatacaa cggtggttgg aactgcgagt gtaagcaaaa gaccagaaag agctagtata 23221 gattttggtc ggaaaaattt ttggtttaag gttgaatgat gttttgttgt caaagacata 23281 ttgcagttct ctaaagtaaa taataatctg tatcaattaa cactaaagtt tgcagagatg 23341 attgtaagaa agtgtgaggt gaagttttta gtagtttggt agaagcactt gataatgcca 23401 agttgccttc aaataattaa ttatttagat tacttaaatg caacttgata tgattaatta 23461 aggatgcagt tagctaatga aaacctcttt ttagcaatta gaaatatatt ttttcagaac 23521 cattgaatag atatccactt ttgttaaagc cattccagtc ccaaatcttg gtcttcaact 23581 gtttgccaaa tactttgaca tagccataga ttttaccgtt gagagctttt atatcgttga 23641 aagcataata gtccgcttta agatacgctc tgttagtacc gggttcaata tctactaatg 23701 ctaaggcact gatatccaga ttgtctttta ggaaagtgag gtctacatca agtccagcag 23761 atacaacctt aatacttgct actgcctgac cataacctct agtgtctaag aacgcattaa 23821 ccttcgcata agcagattta ggggacgcaa ccaaattgta agtaaatcca ccactacctt 23881 gtactccaag tttacctttc attgaaacag gaccaaggga aaagccaaaa tttgcaactt 23941 ctttatctaa agagcgagag tacttatcac caagttgcaa agaagcttgg cgtgcatctt 24001 ggaagttata aacgttgttt ccatgcacga ataaattcat gttggtagta aggttaccag 24061 atgcaggtgc atttaagctt gctgttgccc gtaacaagtc agcgtgattg ttgaacatgt 24121 aacccccagc attggcttct gcataagcac tggaagaaag aggtgaagct gttagttcca 24181 ttttgccttt gatgtaagct gcgatgattt ttgggtctcc tgtatccagg ttgaaggttt 24241 tgaggatagt tttagttcta ggaaacggaa atcgcagacc tttgtttatg gcatcgcttt 24301 tgatttgttt cgataaactt ttttgcaatt cgttgttgtt agttttactt ggagctaatt 24361 cggtttcttg atttagtcgc tgtttgataa agtcgagtac tttatttggt tcgagcgatt 24421 ctcttagtgc tgctttagca ctatcctctg gttgatgcgc ctgatcaatc tctgctgcct 24481 gtttttctaa agtagattgg tttatattag attcctgcaa cgttacttcc ttttcaggag 24541 aacgaagcga atatcctagt tcgttaaact gctgttccaa gcgattaatt tcagcatagt 24601 attctgcagc ttgtattttt tcaccattag gcaatgtgag gattgtatct tcggaaacct 24661 ctttacctgt ttctgggtct ttcaactcaa aaggctgaaa tttgataggt tcttgcttct 24721 taagttgtac tagagtatta tcaatattta cctctggtgt tggagcttcc ttaaatgcag 24781 aattatcaat tttggcatta ttaggacgct caaagttgct agaatccact gcttgatttg 24841 cttgcaagtc agttttagcc ataacaggta atgtcgcagc caagtttaag gcataaattg 24901 ctaacattga accaattgaa ctaataaaag ctatacgttt cgacatatct caaagttaat 24961 ccatgtaatt tgatattagg gtcagtgcaa cattaatcac tgtcccgaag attattttgc 25021 tggtacgaat ttggagtaag atttccgact ttttcaaaag tcggaaactt gggtctgtaa 25081 ttaactgtta agcacttaaa aaaaatccgg taagccaaaa gctagtagaa cattctattt 25141 ttgaattact attactatac gtataattag gagttaatgt cacgaataat gcaccacgct 25201 gcacaacaag gataccgcac ggtagtttta aatttacgag atgcagtagg tgaagatttt 25261 aatagcttag ataagttttt gcagtggttt atcaccagta ttgccgagac attagagtta 25321 gggcaacctg tagaagaaca ctggcgcaaa agtttgggta attgtaagat aaaatgccga 25381 acctactttg aaaagtatct cttaccagga gatagtgccg ttgcgatcgc cttagatgaa 25441 gtagatagac tttttgtgca tggggaaata gctggagagt ttttgggaat gctgcgaact 25501 tggcacgaag acgcaaaaac taaacctctt tggcgacaat tgcggttatt gatgttgcac 25561 acacaagttt atactcaact aaatattaat cagtcaccgt ttaatgccgg gacagaaatt 25621 aaattaacag attttactag cgaagaggtg gaatctctgt cacggcagta taaattaaac 25681 tggaaaaata ctcaagttga acaactaatg gcaatggtag gaggacatcc ctatctagca 25741 acaaaagcca tacaggtagt atcgcgccag gatatgactt tagaaagttt attgcaatca 25801 gcacccacag cttctgggat ttatcggaac catttagaaa gacactggcg ttatttacaa 25861 gcaaatatcc cactagcaac agcatttaaa acaatagttt tagcagatag tccggttgaa 25921 tttaattcaa atttaaattt agacgatgca gtaaaattat atgattttgg cttggtagaa 25981 ctgcaaacta atagtgtagt gtcacgttat caaatctata gtttatattt ccaagaacgc 26041 ttgggaagtt agcgcttttt attaatatga aaaaacaaaa atgtgtgtag gttgggttga 26101 acaaagtgaa acccaacaat gcttaatttt gttaacgttg ggtttcgttc ctcaaacgcc 26161 agacgcctct gtcgggaaac cctcctggag cttacgctcc ccaacctacg attctctacg 26221 attctcaaat ttattcatgt atatagaatt atgagttact cgaaacaaag taatgggtat 26281 caagtaggag gaagtcttgg cgaatcagct tcaacttatg ttgtacgccg agcagattct 26341 caactttact cagggttaaa aactggtaaa ttttgctatt tattaaattg ccgacaaatg 26401 gggaaatcga gtttgcgcgt acagacaatg gcacgcctta aaaaagaagg ggttgcttgc 26461 gtggcatttg aaatgcgtga attctgccta catgaagtga cagaagatga attttatggt 26521 ggttttgtta gctatctagt aaatgaattt aatttagaga ttgacttaga aagctggtgg 26581 tatgggcata gtttaattca tccggcttta aggttgacta agtttatcga agaaatacta 26641 ctagaacaaa ttccccaaag tattgttatc tttgccgatg aaattgatag cgttttaaat 26701 ttaagtttca aagacgattt ttttgccttg attcgcggat gttataacaa acgtgctgat 26761 aaactcaaat acaaccgtct cacttttgca ctgttggggg tagctactcc tgctgattta 26821 attgaagata aagacaatac accgtttaat atcgaaagcg aagcaattga attaactggt 26881 tttcaattgg atgaagcaac acctctagaa aagggatttg ttggtatcag cagcaatccc 26941 cgtgcagtac ttcaggaaat tttgctctgg actggggggc aaccgtttct gacgcaatgg 27001 atttgtcagc ttgtatcttc taacttgtca cacattgctg ctggagtaga agcagattgt 27061 gttgccacaa ttgtgcgatc gcgcattctc tctaattggt tagcgcaaga taaacaacaa 27121 catttacaga caatacgcga tcgcattctc aacaacgagc aacttgcttg ttggtcgctg 27181 gggatgtatc aaaaagtttt acaagctgga gaattagctg ttgatgacag tccagaaatc 27241 atgaaattgc ggttatcgag tttagtcatc aagcaagaag gtaaattaag agtttataac 27301 cgcatctacc agtcagtttt tgataacact tgggtagaaa aggaactaca aaatatgaga 27361 ccatatgctg aagcatttgc cgcttgggaa gcttcaggac gtgatacgtc gcatttattg 27421 cgtggggatg atttaggctt cgctttggcg tgggcgaatg gaagaagttt gagtgataag 27481 gattatcagt ttttggttgc tagccaagag ttggaactaa cgcaggtgca gcagagaacg 27541 gaaatcgctt tgctgcaaga gaaacaagcg cagcagaatt tcgttgaggc gcaacgaaaa 27601 gccaagcgca gcgcgcgtct tacctttgcc tcaatgattg caagcttaac gatgggtgcg 27661 attacgatta ttttaactcc aggacctcta gcaattttgt taaataatat agcgttccaa 27721 atttacgctg acgatcatct acaaactgct ttacaggttt atgacttagc actcttaatt 27781 aaaccagcct atccagaagc tctttatacc aaaggaagaa tttacgaaga cctgcaagat 27841 tttgataacg cctccaataa ttacgaatct gccaaagacc ataagtttcc tcaagcatat 27901 agtgaattag ctcgcctaca cattcttaat aaaagatact cccgagctgt tgacataatt 27961 tcagagggtt taaaactgcg tcttacagat cgagaaaagt atggtatgtt taaaaatttg 28021 ggctgggcac aattcgagct agcgcgacag ggaaaagcta cctatgagca agcagaaatt 28081 tccctagatg aagctattga tttacaagaa gagtctgctt ccccccactg tttattagct 28141 caacttctcg aagctaaagc tttaaagtca aaatctctta cagaatggcg taaatgcttt 28201 gaacttgggg actctgacca ccctgatgag cgtaagtgga ttgaggatgc acgtcaacgc 28261 ctcagaatgt agccgattga tgcatttata tcgctaaata tatttaaatt atttgaataa 28321 tctccggaat ctgaatttta agcattgaat aaattctcgt caaagttgaa atatttagaa 28381 ttgaccatgc ttaaaggctc acataccata gcatttgcat taacactgac atcactattt 28441 ctcgtaatta atacaaacat tgcctacgca aagcagaacc ctgaacgtgg aacatatgta 28501 attaacgaac ctgaacctgg agatgcacct ggactaccta gaccacccag accttgttct 28561 agacccggtg gcacatttcc atgtactcga cgtcctgctc ctcccattcc aattattgat 28621 cttgaactta atgatgtcac gaatgacgtt gcaattaaaa aacttgaagc gctgaaagct 28681 tcaggaaaag caaccaacga tgattacata cttttgggtt atttctacag tctggagaaa 28741 aaatacgacc tgtctgaggc taattacctc aaagcgttgg aactcgcaac tgatgatggt 28801 agaaaagcca tcatccaaca agaactcaaa aaagtccgtg atcataatgt aaagcaataa 28861 cttaaatagg tggctatagt agcaaacaaa gccgagaata aagccaacaa cctaccttcg 28921 cgatctgccg aacgcgagtg cgtgcgcttt gcgcgtgagg caagcgaagc tatcgccgga 28981 ttgtcttgga agcctacact cacttttgag caggagtgta ggagatgtca ctatattggt 29041 acgggtaact tttggttaaa ctgaggatcg ctttggtaaa gttactatta cccttgttcc 29101 tttgcccacc tgactttcta aagtaatatc accgccatgt aactctacac aaactttcgc 29161 aatagcaagc cccaaacctg ttccagggat tgtgtcaacg ttactcccac gactgaaagc 29221 ttgaaacaag ttttcttgct cgtttatagg tataccaata cctttgtctt taacgtaaaa 29281 gataacttgt tcctcacccc caatcagatg aaactctatc tcacctccat ctggggagaa 29341 tttaatcgca ttagatatta agttaacaaa aatttgccgc aagagtcctc catcacccca 29401 gaaccccttg taattgccag ttatctcata gactaaatga tggcgatctg cgctttgcgc 29461 agcagcatag ctatcgctta atttttcctg ctgctcttct atcaagccag aaaaaaagtg 29521 gtataagtcg agtggctggg gcttaaactg ggctttgttt gactcaaatt ggttaaacac 29581 aagcaagtca tcgaccaatt tagccatgta gcgagctttg tgttcaataa tttgtaggaa 29641 cttctcttgt ttagattcat cgagcttcac gccatgttgc ttaagtgtgg atgttgcagc 29701 aagaatagaa gctaatggcg ttcgatactc gtaagaaact gttgtcatca tttgggattt 29761 aaatgcgttc agtttctgtt cctttctcaa agctgcttga gtttgagcaa gcagttctgc 29821 ttgtttgagc atgaagcgct gtctttccct agttttggtc agcacggttt gcaagtccga 29881 gtgctgaatc accttgcgaa ctgccaactg taacacatcg ggtttgagat gttgcttgac 29941 caagtaatct tggacaccct ttttcatcgc ttgtacagcg accgcttcat caccttggcc 30001 tgtcagcata atcactgatg ccgcagtatc caatctttcc cgcgttagtt gctcaagaat 30061 ttctagccca gtcatatcag gtaggtagaa atcgagtaga atgacatcac agagtatttt 30121 ttggcacaag gcaagtccat ccttagcaga ctctgcttcc aaaatctggt aggactggtg 30181 aggatctttc aaaagatatc gccgatagat tcttcgatcc tgtgcacaat catcaatgat 30241 gagtagcgtc ctggtatctg acataaaagg atgaagcaga cttgctgcac tctttcaaca 30301 agcttcttat ctatcataga atttttaaca aaattttaaa caattaaaaa atttgaattg 30361 gaaattatta ttaattaatg ataacatttt ctcctgtcat acccgcttgg tgtgtgtccc 30421 gctatcgggc aacgtagggg aactagagga gacaggaggg caaattgata gaaaagttta 30481 taaatttttg tgacaaagtt tatcatctga aaaaaaatta actggatgag tcatgaaatt 30541 ttcgtccaaa gtcttgtact catgactcat cattcatgac tcatgattgt tcgctaatga 30601 cttatgctac cttgctctgt atctacagca ggtccagctg tcactacaac aattttatct 30661 ggatacagta actcacgagc ggcttgatta acttgttcaa agtgaactgc ttgaattttt 30721 tcggtaaaat cgcgtagttc ctgtttattt agtccgtaca cctcattcat caggatttga 30781 tcgattaatt cttctgggtt tgccagagat acgatgtagt tgctgatcaa gatgtgttta 30841 gctgtctcta cttctggttc tgtgacgcct ttttgatgga tttctttgag tagttcacgg 30901 gtactggcga tcgcctcacg agtatcttct ggacttgttt gcatcgaaat caaaaatgtg 30961 ccggaattgt ttccagcaac gaaactgcta taaattccgt aggttagccc ttggcgatcg 31021 cgcacttctg ctcccagtct acttgataga gtatcgccac ctaaaatctg attcaacacc 31081 aaagctgcat aaaagcggga gtctttgcga ttaatcgcag tgtttcccat ataggtaata 31141 gcttgggctt tacctggaag cactgggtta acattgataa ctttttctgg caaagagact 31201 gttggatact ttaatgttgg tggtgaacca gtggctttcc agtgaccaaa ttcacttgag 31261 atgagtgctt ggacttgatc acaagcaaaa tctcctacca gtgcaagcac catcgtatcg 31321 ggacgataat gttttgtctt aaattcaata acatccttgc ggctaatccg tcgtaaactt 31381 tgctctgtag ggaagacatg taaaggatgt tgtttggggt aaatcgactg tacaaatgtt 31441 ctttgtgcaa cttctgaagg atcatctaat tcgtgcttta gggcggttaa tgcttgtttt 31501 ttagtcagtt ttaactcttt tgttgggaaa taagcatttt tgaccacatc tgctaaagtc 31561 tgaatcagaa cgcttaagtc acctgctaag ctatcacctt caatgcgcac accttctcgg 31621 tacgcatcaa aatgaaggct tgctcctcga tcttctaaaa tttttgcaag agtcaaagcg 31681 tcctttgttt tcgtcccatt cattaagttc tcggcgacaa gagacgccaa tccagcttta 31741 tcttgtggat caaattcctt accagctttg acgtagctag aaagcgtcac cgtgggagta 31801 ctcttgtcgg gcacaagcaa tactcgtaaa ccattagata atgtaaattg ctctggtaag 31861 gtatgacttg taggaacact acgactatct gttggaggta agtactttgt tacttcctct 31921 gtcgtcacag atgctgatgt ggcaaaattt tcagtgttgc gtttggagtt gttgatcttc 31981 ttaggatcat tcccctttcc ctttacctga gtcggcttaa agtagcccac tgtccgtgct 32041 tcttgtttca ggtatttttt tgccacacgc tggacatctt gggctgtcac ttggcgaaca 32101 gctgccaagt agcggtctgt ataatgataa tcaccagcag ttgtttcatc attacccagt 32161 tgcaatgcca aacttgtgat atcacggtta ctcaaaagaa cagatgcttc caacattacc 32221 tttgctcgct tgagttcttc tgttgtgact tctttattta tgagtttggc gatcgcactg 32281 tttaaaactg aatcaatttt tttcaaatct tgatcaggat ctgctgtgac taacaactca 32341 taccagccta attccattaa gttaaccaca gaagcggaaa catcagttgc taaacctgat 32401 tccaccaatg cttgctcaag acgagaattc cgtccgtctg ttaaaatccg atccatcagg 32461 tctaatgctg gcacatccgg actctttgca tctggtaagg gatagacagc ttgcaccaat 32521 gctgctgcgc ctgattcctt taaaacaatt ggtaagaagt gtggggtgac aggcgtgggt 32581 tgtaaaactt ttgagtgttg acttttgagt gttgactctt gagttttagg tattttccca 32641 aaaatctctt tcaccgcctc aagggttggt tcagattgga aatctccaac aatcactaaa 32701 acagcgttgt caggactgta aaaattgcgg tagtatttct ggacttgctc caaatcaaat 32761 ctttggacat cagcttcagt tccaccaata ggtaacccgt aagcatgatt gggaaaagcc 32821 gcccgcatca gcgcccggtt taagcggtag tcaggactat tttcgtaacc ttgcaactcg 32881 gaaaccacaa ctcgtttttc cagatcaagt tctgtcggat caatccgagc attttgcatt 32941 ctatctgctt ctagcgtcag cagggctgtg agcttgtcag cttgtgcggt attgtagtat 33001 gccgtttgat catagcttgt gaaagcattt gagtcactgc ccaaagcact caacaaccgt 33061 ccaaattgaa tgggacgatt ttttgtgcct ttaaacatca tgtgctctaa ttgatgggca 33121 ataccattca atccaggttc ttcgttacgt gatcccacct tgtaccagac ctgtacagtc 33181 accaccgggc tggagtgaat ttccctagtc aagacagtta gaccattgtc caacagtgtt 33241 ttctgaacat tctcagtcaa tgaaatacgc tcttttttat gagaaactgt gattttttgt 33301 gaatctaaca atgaggaaga agttgttatc tggctataag aaaatttatc tccaagcaac 33361 aacactgtta ttagcgaaaa agttaacagt aataaacgaa aacggtatcg atgcactttc 33421 ggaaacacag acataaattt tactatattg tctgtaaact aagttacaaa a // LOCUS NODE_823_length_33006_cov_4.91144433006 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 33006) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 33006) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..33006 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 186..1151 /locus_tag="DP116_07005" CDS 186..1151 /locus_tag="DP116_07005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314746.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_07005" /translation="MSGNTTRRNFLITGVAVTGSIVGASTLQQKVGDTAKPPATMLER VLGRTGVKVPIFGLGGAGQTPLSWEGKERDAVAIINKALELGIRYFDTAADYGPSEDY FGKVLPSHRSKIFLASKTDKRDRDGAWRELERSLKRLNTDHLDLWQLHHVSSREELNT IFSSSGAVKALEEAMQQKIVRFVGITGHHDPQVIAEGLRRYPFHTTLIPVNAADKHHP RPFLPVVLPIAQQKNVGVIAMKVPAYGRLFKPGGLSGMQQAMGYSLSQAGVNCCVIAA ETVKQLEDNVKVAQAFQPLGDKELAAIEQRSAAVWKDSTFFRAWT" gene complement(1166..2017) /locus_tag="DP116_07010" CDS complement(1166..2017) /locus_tag="DP116_07010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008180259.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="PRJNA477356:DP116_07010" /translation="MINQLKLNDYKKEIADLYSRRSQTYDNSDWHTQIAHRLVEYAQI SPGQHVLDIATGTGMVAIEAAQIVRPEGRVVGVDISTGMLEVAKQKVEGLSFEHVEFQ LADAEALNFPANSFDRVLCSSALIWMADIPAALRQWMRFLKPGGLIGFHAFAQTAFVG GVVVQKVVEKYGVSLAFNKPTGTVEKCQNLLQQAGFEAIEIQPEQYGSYISLEQAKGM WTGNSHVAPGQFPNPVSQLSSELLAQVKAEFETELEALNTDEGVWNDITVFFTFGRKP VDSSEFR" gene complement(2162..3451) /locus_tag="DP116_07015" CDS complement(2162..3451) /locus_tag="DP116_07015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314745.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA (cytosine-5-)-methyltransferase" /protein_id="PRJNA477356:DP116_07015" /translation="MIDSNKKERPIAVDLFAGAGGMTLGFDQAGFDVLASIEIDPIHC ATHQFNFPFWRILCKSVVDTTAAEIRSRSSIGDREIDVVFGGPPCQGFSLIGKRIFDD PRNSLVKNFINLVIELQPKFFVLENVKGMTLGKHREFISVIINEFEQSGYKVRKDYKV LNAAEYGIPQNRERLFLLGCRHDLELPNYPAALTRPAKLNKAGFQNQLPLSPTVWDAI GDLPEVEDYIELFEKDSVIAEFGKPSDYSRQLRGLSCTDNDYSYERKYDSRILTSSLR TKHNLESIKRFKATLPGKTEPISRFYKLAPEGICNTLRAGTPSNRGAFTSARPIHPLT PRCITVREAARLHSYPDWFRFHVTKWHGFRQIGNSVPPLLAKAVALEIIKILGVCPSQ PRIIQELGNENLLMYDMSEAAEYYGVEPQTIEPRKRN" gene complement(3760..4929) /locus_tag="DP116_07020" CDS complement(3760..4929) /locus_tag="DP116_07020" /EC_number="2.4.1.182" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipid-A-disaccharide synthase" /protein_id="PRJNA477356:DP116_07020" /translation="MRIFISTGEVSGDLQGSLLITALKRQAAAAGLDLEIVALGGEKM ASVGATLIGNTSEIGSVGIWESLPYVLPTLQMQQRAIAYLKQNSPDLVILIDYMGPNL AIGNYIRRKLSHLRVVYYIAPQEWVWSLNSRNTNMVVGITDKLLAIFPEEARYFQERG AKVTWVGHPLVDRVQNFPSREAARAKLGIAEDVISVALLPASRRQELKHLLPVIFQAA QVIQAKFDHVYFWIPLSLEIYREPIEKAIQQYGLQASVVSGKTQEVIAAADLAITKSG TVNLELALLKVPQVVLYRLHPITAWIARTVLKGSIPFASPPNLVVMKPIVPEFLQEKA TPENITQAALEILLNPERKNQMLADYEEMRQCLGEVGVCDRAAKEILEMLPNLKH" gene complement(5065..5883) /locus_tag="DP116_07025" CDS complement(5065..5883) /locus_tag="DP116_07025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866997.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-[acyl-carrier-protein]--UDP-N- acetylglucosamine O-acyltransferase" /protein_id="PRJNA477356:DP116_07025" /translation="MKTLIHSTAVIHPTAQLHPTVEVGPYAVIGGHVKVGPETIIGAH AVLEGPLEIGARNQIFPGAVIGMEPQDRKYDGELSWVKIGDDNRIREYVTINRATGAG KETRIGNGNLLMAYVHVGHNSVVEDNVTISNSVAIAGHVHIESRAVISGVLGIHQFVH IGSFAMVGGMSRIERDVPPYMLVEGNPSRIRSLNLVGLKRAGFSTDEFQILKKAFRIL YRSELLFKDSLEQLELLGDTQQLQHLRRFLLLSQMPGRRGLIPGKGKVAASDES" gene complement(6101..6634) /gene="fabZ" /locus_tag="DP116_07030" CDS complement(6101..6634) /gene="fabZ" /locus_tag="DP116_07030" /EC_number="4.2.1.59" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872819.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ" /protein_id="PRJNA477356:DP116_07030" /translation="MSTVTEQTNTIDAPTPASSNNQPDDTSTNKADNQIIYSIEDIQK LLPHRYPFALVDRIIEYVPGKRAVGIKNVTINEPHFQGHFPGRPIMPGVLIVEAMAQV GGIVLTQLPELEGGLFVFAGIDKTRFRRQVVPGDQLVMTVELLWVKQRRFGKMQARAE VDGQLATEGELMFSLVS" gene complement(6742..7593) /locus_tag="DP116_07035" CDS complement(6742..7593) /locus_tag="DP116_07035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129785.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase" /protein_id="PRJNA477356:DP116_07035" /translation="MQQHTLADEIIQTGVGLHSGVTTQVRIRPDAAGSGRYFVRVDLP DTPIIPAQVAAVSQTVLSTQLGKGEAYVRTVEHLLASLAAMGVDNARIEIDGPEVPLL DGSAKVWTDAIAQVGLMSQNLTKDKAPLVIDQPIWVRQGDAFACALPAPETRFTYGID FDLAAIGNQWHSCSLPSQNENAYGSFVAEIAPARTFGLLHQIEHLQQTGLIKGGTLDN ALVCGPEGWLNPPLRYANEPVRHKILDLVGDLSLLGIFPCAHFLAYKASHNLHIQLAQ RILDFRF" gene complement(7700..10495) /locus_tag="DP116_07040" CDS complement(7700..10495) /locus_tag="DP116_07040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872821.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07040" /translation="MRLSPVIVTVVAISSPFASLVSANAQTLNSSNSNQTAEVLKPTA DPPQKVVGVLPATAAVATSEVVVPSSTEKSTTTQPLKNFHSTAEALKPLSTPQYKVPG EFAASTRVAKPEVIVPSFTEKSTTQPVNSFHSTAEALKPLTTQQYKVSGKLPATATLV KKLLVAKRSSTASKTAQNLPQTPTPNTQQEQISPQTPTTTPAPENNNQLPSIQQQTPA PSNQQQQTLPQTPGTPTTPDGTNQQQTQPGQNSPQTPFPSIQQQTPAPNNQQQQIPPQ TPGTPTTPDGTNQPGTQPTSPQTPGSPQLEQSPEASEPRVLVSEVFIRSETGQQLAAE LEDQVYRVIRTQPGRTTTRSQLQEDINSIFATGFFSNVQAVPEDSPLGVRVSFVVRPN PVLTKVQVQANPGTNVSSVLPGNAADEIFKDQYGKILNLRDLQEGIKQLTKRYQDQGY ALANVIGAPQVSDNGVVTLQVAEGVVGNIRVQFRNKEGQVTNDKGEPIRGRTQPYIVT RELELKPGKIFNKNTVQKDLQRVFGLGLFEDVNVSLDPSKDPSTGAADPSRVDVVVNV VERNSGSIGAGAGISSASGLFGSVSYQQQNLNGRNQKLGAEVQIGTRDEFLFDLRYTD PWIAGDPYRTSYTANLFRRRSISLIFEGKDKNFETFNPNKPDDDGDRPRITRLGGGVN FTRPLSKNPFERSQWTASAGLQYQRVSIKDADGDIRKQGRLEGTTDQFVELSQSGQGE DDLLLLQLGVARDLRNNPLQPTRGSFFRVGLDQSVPIGLGNIFLTRLRGSYSQYFPVS FINFSKGPQTIAFNLQAGTVLGDLPPYEAFSLGGSNSVRGYQEGGLGSGRSFVQASLE YRFPVFSVVSGAVFFDFGTDLGSGTRPAEILNKSGTGYGYGLGVRVQSPLGPIRIDYG INDDGDSRINFGIGERF" gene complement(10743..11483) /locus_tag="DP116_07045" CDS complement(10743..11483) /locus_tag="DP116_07045" /EC_number="6.3.2.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409449.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylaminoimidazolesuccinocarboxamide synthase" /protein_id="PRJNA477356:DP116_07045" /translation="MSVKTRVYEGKAKILYTTDEPEILLADFKDDATAFNAQKRGSIE NKGNINCSISSKLFQQLEAHGIKTHYVDSPAPHQMRVKALKILPLEVVVRNIAAGSLC QQTGLALGTVLKKPLVEFYYKNDQLGDPLLTRDRLFLLELATPEQVDTIAHLALQINK FLCDFFGRCDITLVDFKLEFGLDAQQQLLLADEISPDTCRLWDNSKGNDPNLRVLDKD RFRRDLGNVENAYQEVLQRVLKAIESSN" gene 11883..12155 /locus_tag="DP116_07050" CDS 11883..12155 /locus_tag="DP116_07050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013192300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07050" /translation="MERKMIDKDDFLYPRGRYYGQVKPENLVFNANLQEFAQRVSYIC NLETAGKVPPEQAYDQIKELWKNLKNSKKQLGIGEHPFENDEGNSE" gene 12148..12432 /locus_tag="DP116_07055" CDS 12148..12432 /locus_tag="DP116_07055" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07055" /translation="MNKLSLTSLENNSKLHELPFMGFADTFSNQQSMRPFVIAVVAHK NSSKALPDDSSILSKINPTSQAWFQASCVLWNKLLANAGNSNNCLITSSA" gene complement(12416..14989) /locus_tag="DP116_07060" CDS complement(12416..14989) /locus_tag="DP116_07060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316988.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylate/guanylate cyclase domain-containing protein" /protein_id="PRJNA477356:DP116_07060" /translation="MTLLNPGSVLATLTELTQVNRTHSLLSRVKNLSVNEFVCLLDFI TAEFQQFLRAIELINNEALETMLEKVLEAITLKIGQILQAEHTTIFLVDYDKGQLWSK LPQDKTQKAIEIRTPMTVGIPGHVANTGECLNISDTSSHPLFNPELEKQMGYKFRNIL CMPVLSSKNQVVAVVQLANKTVGTPFDYEDEVHFRDFASSIGIILESCHSLYVAARNQ RGATALLRATQTLGQSLDLEATLQIVMEQARILMQADRSTLFLYRKEMAELWTKVASA DGKTTMEIRMPSNKGIAGYVASTGLALNIPDAYKDPRFDPSTDQKTGYATRNILCLPV FNSANELIGVTQLINKHQGSFFTASDEEFMRAFNIQAGIALENARLFENVLLEKQHQK DILQSLSDAVISTDMEGRIVTINGAALQLLGCPIREANAKNNQYLWEQNLIGRLVWEV VPIDNLQLRLQESLKTGARHYVPEQTLTVGLYVETLYLKSGQDASVYILAVRDRTNPN IFIPWNQPLTDRLALLDADNVQKIERSINLTVNPLTNPEGGVRGGLVVLEDISQEKRM KTTMYRYLTPRVAEQVMALGEDALMVGERKDVTILFSDIRGYTTLTENLGAAEVVSLL NQYFETMVEAVFNHEGTLDKFIGDALMAVFGAPLPLTENHAWRAIQSALEMRQRLAEF NQGRLILKKPQIHVGIGISSGEVVSGNIGSRRRMDYTVIGDGVNLSSRLEGVTKEYGC DIILSEFTYQLCSELIWVRELDKIRVKGKHQAVNIYELIGDRHTPLDSNTQEFLFHYQ NGRDAYLSRNFQYAIACFKAAKKIRPKDQAVDIHLERSCHYLNYNPSDSWDGVYSMLT K" gene complement(15491..16624) /locus_tag="DP116_07065" CDS complement(15491..16624) /locus_tag="DP116_07065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456118.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_07065" /translation="MRIALFTETFLPKVDGIVTRLRHTIDHLQRHGHQVLVIAPDGGI VEHKGAKVYGVSGFPLPLYPELKMALPRPAIGEALEEFKPDIIHVVNPAILGLAGIFY SKYLNVPLVASYHTHLPQYLQHYGLAMLEGVLWELLKAAHNQAALNLCTSTAMMQELI GHGIERVDLWQRGVDTESFHPDNACEQMRSRLSQNHPESPLLLYVGRLSAEKEIERIK PILEAIPKARLALVGDGPHRQALEKHFAGTNTYFVGYLVGRELASAFASADAFIFPSR TETLGLVLLEAMAGGCPVVAARSGGIPDIVTDGVNGYLFEPEQDVQSAIDATVRLLQQ QQERETIRQSARQEAERWGWAAATRQLESYYQKVIYSGLAKVA" gene complement(16979..18133) /locus_tag="DP116_07070" CDS complement(16979..18133) /locus_tag="DP116_07070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD-dependent dehydratase" /protein_id="PRJNA477356:DP116_07070" /translation="MKVLVIGGDGYCGWATALYLSNRGYEVGILDNLVRRHWDNELGV QTLTPIAPIQQRIQRWHDLTGRSIDLLIGDITNYEFLSKALHRFEPEAIVHFGEQRSA PFSMIDREHAVMTQVNNVVGTLNLLYAMREDFPNCHMVKLGTMGEYGTPNIDIEEGYI TIEHNGRKDTLPYPKQPGSMYHLSKVHDSHNIHFACRIWGLRATDLNQGVVYGVLTEE TGMDELLINRLDYDGVFGTALNRFCIQAAIGHPLTVYGKGGQTRGFLDIRDTVRCIEI AIANPAEPGEFRVFNQFTEQFSVGDLAMMVKKAGNAMGLNVEINHLDNPRVEREEHYF NAKNTKLLDLGLQPHYLSDSLLDSLLNFAMKYQHHVDQNQILPKVSWRRN" gene 18566..18916 /locus_tag="DP116_07075" CDS 18566..18916 /locus_tag="DP116_07075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016953046.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07075" /translation="MQAKQKVTLYLSPELHRKLKIRSAIDSEPMSELAERALDFYLAN PELVEEMEASYGRTHRVYSCPTCESSVVLRDGELVSLGQQPGIIGQQEERLPIDEVNR DQTNRKGKEELVPC" gene 19012..20541 /locus_tag="DP116_07080" CDS 19012..20541 /locus_tag="DP116_07080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208503.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AAA family ATPase" /protein_id="PRJNA477356:DP116_07080" /translation="MKEELNILIQAQYPLIYLVTSEEERAEQAISTIAQSSKPQRKVY VWTVTHGIVEYGQPRNVTQHNTVSPEAAVEWIIRQKEPGIFILKDLHPFIDAPATTRS LRDAIASFKGSHKNVILMSPMQQVPIELEKEVVVLDFPLPDMGDLNKVLSSHLEQNRG RRLTTEAREKLLRAALGLTKDEAEKVYRKAQVTTGRLTEDEVDIVLSEKKQLIRRNGI LEYIEEDATIDAVGGLEELKRWLKQRSNAFTERAREYGLPQPKGMLILGVPGCGKSLI AKTTSRLWGLPLLRLDMGRVYDGSMVGRSEANLRNALKTAESISPAILFIDELDKSFA GSGGSGDSDGGTSSRIFGSFLTWMQEKKSPVFVMATANRVERLPGEFLRKGRFDEIFF VDLPTPEERQDIFTIHLSXXXHLSKRREDISRFDLEQLSKMSDGFSGAEIEQAIVAAM YEAFAQDREFTQLDIIAAIKATLPLSRTMQEQVTALRDWARQRARPAASSVAEYQRME F" assembly_gap 20241..20250 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 20682..21032 /locus_tag="DP116_07085" CDS 20682..21032 /locus_tag="DP116_07085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200135.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1257 domain-containing protein" /protein_id="PRJNA477356:DP116_07085" /translation="MSHFSTLRTKITDAEILKSSLRDLGISVKTEADVRGYNGQRVRS DIVAVLEGEYDLGWSRNSDGSFDLIADLWGVAKKHNQTELINSINQKYAVNKTLAEVK QRGLQNANVKLVLQ" gene 21277..21621 /locus_tag="DP116_07090" CDS 21277..21621 /locus_tag="DP116_07090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007313295.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07090" /translation="MKLPHPEQAVIDRQKLSGYCLNPEHPEGRHKARLFKSVLGIALD DEEELEIALRQAIKNYDVIPTKRNQYGQKYVVDFMMVRGEQRAVVRSAWIVRDTENFP RLISCYILLDKG" gene 21636..21854 /locus_tag="DP116_07095" CDS 21636..21854 /locus_tag="DP116_07095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412332.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4926 domain-containing protein" /protein_id="PRJNA477356:DP116_07095" /translation="MKLLDVVALLKDLPEFDLYRGQVGTIVEEYEPGIFEVEFSDTHG RTYAMETLEADNLMILYHQRLAEDRIAM" gene complement(21948..24851) /gene="gcvP" /locus_tag="DP116_07100" CDS complement(21948..24851) /gene="gcvP" /locus_tag="DP116_07100" /EC_number="1.4.4.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874717.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycine dehydrogenase (aminomethyl-transferring)" /protein_id="PRJNA477356:DP116_07100" /translation="MVTNVPRPKSNHQQMSGETNQKLSSFQERHIGPSSSDIQQMLEV LGDSTLDEVIDQAVPQAIRLSDSLELPEAQNEYAALAQLKEIASKNQVFRSFIGMGYY DCITPPVIGRNILENPGWYTAYTPYQPEIAQGRLEALLNFQTMIIDLTGLEIANASLL DEATAAAEAMSISYGVCKNKANTYFVSRDCHPQTIDVLQTRAQPLGIDIIVGNHQTFD FSEPIFGAILQYPASDGTIYDYRAFIEKAHAMGALVTVAADPLSLTLLTPPGEFGADI AVGSTQRFGIPLGYGGPHAAYFATKEEYKRQVPGRIVGVSKDAQGKPALRLALQTREQ HIRREKATSNICTAQVLLAVMASMYGVYHGPTGLKRIAENIHQKTVILAEGLKRLGYS IGSEYFFDTLQVDLGERSLDEILQACEAHKINIRILNTTTVGISLDETTTEKDLIDLL DIFAFGDDLLFPPASAAFPASPTLLLPRTTSYLTHPVFNRYHSETELLRYLHKLEAKD LSLTTSMIPLGSCTMKLNATSEMIPVTWAEFGKIHPFAPLSQTRGYQILFQQLEEWLG QITGFAGVSLQPNAGSQGEYTGLLVIRKYHESRGEGHRNVCLIPQSAHGTNPASAVMC GMKVVAVACDAEGNVDLDDLKAKAQKHSKELAALMVTYPSTHGVFEEQIQEICAVVHT HGGQVYMDGANMNAQVGLCRPRDIGADVCHLNLHKTFCIPHGGGGPGMGPIGVASHLV PFVPGHAVVEMGGEQKMGAVSAAPWGSASILVISWMYIAMMGADGLTEATKVAILNAN YIAKRLESYYPVLYKGKNGLVAHECILDLRSLKKSANIEIDDIAKRLMDYGFHAPTVS WPVAGTIMVEPTESESKEELDRFCDAMIAIRQEIAEIESGKMDAQDNVLKNAPHTAES LIVGEWNHPYSREQAAYPAPWTREHKFWPSVGRIDAAYGDRNFVCSCLPMDAY" gene complement(25206..25781) /locus_tag="DP116_07105" CDS complement(25206..25781) /locus_tag="DP116_07105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130564.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR04376 family protein" /protein_id="PRJNA477356:DP116_07105" /translation="MGLFDDFNRFLENRLEEFLRNNPHLELEALLEQLRGQEEDTLKL IADLQVQEKRSQEQVLSTAQEIQRWHIRVEKAKSLNRQDLAAAAAQREAALLREGNQL WGQMQGVKERITQAKELLRKIQQRRQEVQAKAAEAQTARAKTQAQQPLETSGWWGATS GSFSGHDDLEEKFRRWETDEELEQMKRKMGK" gene complement(26011..26190) /locus_tag="DP116_07110" CDS complement(26011..26190) /locus_tag="DP116_07110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207235.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07110" /translation="MCRYCVRFMAKKELHIRITERRMNKLRLYAAQKDKTITQIVEDL LDTLPEPQKPSITVG" gene 26239..27444 /locus_tag="DP116_07115" CDS 26239..27444 /locus_tag="DP116_07115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006106339.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_07115" /translation="MLVLEYKVKGKKQQYQAIDEAIRTTQFVRNKAIRYWMDASREAK INRIALNNYSTVLRKEFKFVEELNSMACQAATERAWSAIDRFYGNCKSKKPGKKGFPR FQKDNRSVEYKTSGWALHPTKRRVTFTDKKSIGEVKLLGKWEIHTYPVKSIKRVRLVR KADGYYCQFAINVDAKPEQRTGDSEIGLDVGLEFFYSDSSGHHEPNPRFLRKAEKAIK HAQRAIFKKEKGRNQRRIARQRYAKKHLRVNRQRNEHAKRIARNVCKANALVVYENLN VKGMVKNHCLAKSINDVAWSLFRRWLEYFAVKFNTAVVAVNPKMTSQKCSDCGAIVKK SLSTRTHKCNCGCELQRDVNAAINILNLAKARGGHPQSNATGVGTSTLVGASLLEQVL TMNVESPSL" gene complement(27469..28848) /locus_tag="DP116_07120" CDS complement(27469..28848) /locus_tag="DP116_07120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_07120" /translation="MLAAVEHETARLKTLYQYEILDTPAEADFDDLTKLAAQICQTPV ALITFVDAYRQWFKSKVGMEITNAPLEAGFCPLTVQKGDTLIIPDTLADPQFAKNPVV VSPPHVRFYAAVPLITKDDYSIGTLCVVDFVPRQLEQKQIEALQTLTRQVMAQLERRL FSYRITEKTQQLDQALKELTHTQTQLMHNEKMVSLGHLVAGIAHEINNTLNFIYANLP HANQYTEDLLSLIKLYQKYYPNPVVEIEAATRAIDFNFIEEDVSKLMSSMTIGAERIH EIVLQLRNFSRVEQTEKRAVNIHEDLENTLLLLGHRLKGISEYPKITVIKEYGHLPEI ECYAGQLNQVFMNILSNAIDSLQCNIVSKINDQPTPDSNPCIWITTKVLDSDYVVIQI ADNGAGMTDKTREMIFNPFFTTKPIGYGTGLGLSISYQIINKCGGQLTCVSAPGQGAK FVIKLPIRA" gene complement(28938..29816) /locus_tag="DP116_07125" CDS complement(28938..29816) /locus_tag="DP116_07125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872986.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M15" /protein_id="PRJNA477356:DP116_07125" /translation="MKTFLKKTFFYILIAFLSCALVASHGISRYKLMADVPNPKDCVT TPSLENNRVLTQSCTNIPQTLPTQTPSQPFTPNPNLTEKERFFSAITNKLPTIPRNNT FEYILLRAYGSVFVNQNPEIKLPQKVLFTNEQETKQFQSSLTLTQVKNTSQCYLQKSA AEAFNQARSQVQIPLKSGYGASDCTRSFATNLRFWQKYANNQTLEQVRQGKETTILGV VAPPGASQHLWGLAIDLRVTTEAQKQVLNQNGWYRTVENDIPHWTYVGLPLEKLTQFG FQNKVVRNVTYWLTPL" gene 30090..32012 /locus_tag="DP116_07130" CDS 30090..32012 /locus_tag="DP116_07130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456719.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG" /protein_id="PRJNA477356:DP116_07130" /translation="MTMHTCVEFQDAYDVIVVGAGHSGCEAALATARLGCRTLLLTLN LDKIAWQPCNPAVGGPAKSQLTHEVDALGGEIGKMADRTYLQKRILNSSRGPAVWALR AQTDKREYAALMKAIVENQDNLIIREGMVTDLVLGANDEVVGVQTYFGVAFECKAVIL TTGTFLGGRIWVGNKSMEAGRAGEFAAVGLTETLNRLGFETGRLKTGTPARVDKRSVD YSKMTPQPGDEDVRWFSFDPEVWVEREQLPCHITRTTPETHRLIRENLQLSPVYGGWV DAKGPRYCPSIEDKIVRFADKESHQIFIEPEGRDIPELYIQGFSTGLPENLQLHMLRS LTGLEKCVMLRPAYAVEYDYLPATQCYPTLMTKKVEGLFCAGQINGTTGYEEAAAQGL VAGINAARFARGQKMIVFAREQSYIGTLMDDLCTKDLREPYRMLTSRSEYRLILRSDN SDQRLTPLGREIGLIDDRRWELFTRKQEKIAAEKERLYGTRVKEHDEIGQAIAQTTQQ AIKGSITLADLLRRPGFHYVDLNTYGLGNPNLARAEKEGAEIDIKYSGYLQRQQNQID QIARQAHRQLPADLNYQAIETLSKEAREKLTKVKPLTIGQAARIGGVNPADVNALLIY LELHKTKSPKEFSVLA" gene 32369..32884 /locus_tag="DP116_07135" CDS 32369..32884 /locus_tag="DP116_07135" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07135" /translation="MSQNANTYLILPEASEEFIVNEPWAIETYADDLMDELFADINLI LDRSDRLSSQTALPQENIPQVPTVTTPPVVIPETQISLVPSGKQPRNNPLSPVGVATP GVPKLIKKRVKKSKRRKAIAVLMGVTAGLAAACIVGVANSGLFNRLVVIKSFQQSLLQ PHVEKCVQLAD" BASE COUNT 9290 a 6980 c 7221 g 9505 t 10 others ORIGIN 1 aggggagcca gtgcggtctt ggggtctccc caagtagagc acctggcgtt ttccctccgc 61 aggcgactgg cgaacccaga gggtgattcg ttttttatca gtgtttatcc gaacataata 121 ttaacccaga gtatctctag gctaacatac gaaaagaaga gcagatttac cccagcaagc 181 aaactatgtc aggaaacacg acgcggcgta acttcctcat cactggtgtt gctgtcacag 241 gcagtatagt aggagcatca actttgcaac aaaaagtagg tgacactgct aaaccaccag 301 cgactatgct agaacgagta ctgggacgca caggcgtgaa agttcccatt tttggtttag 361 gaggagcagg tcaaacgcct ctgtcttggg aaggaaaaga acgtgatgct gtcgcaatta 421 ttaacaaagc acttgaactt ggcatccgtt actttgatac tgctgctgat tatggaccaa 481 gtgaagatta tttcgggaaa gtactaccat cccatcggtc aaagattttt ctggcaagca 541 agacagataa aagagaccgt gatggtgcgt ggcgagagtt agaaagaagt cttaaacgtc 601 taaacacaga tcatcttgac ttgtggcaat tgcatcacgt ctcttcgcgc gaagaactca 661 acaccatctt tagttcatct ggtgcagtga aagctttaga agaggcgatg cagcaaaaaa 721 tcgtgcggtt tgttggtatc accggacatc atgatccaca ggtgattgct gaaggattgc 781 gtcgctatcc gttccacaca acgcttatcc ctgtcaacgc tgctgacaaa caccacccgc 841 gtccattcct gcctgttgtt cttccaatcg cacaacaaaa aaatgttggt gtgattgcga 901 tgaaagtacc agcttatggt cggttgttta aaccaggtgg tttaagtggg atgcagcaag 961 ctatgggata tagtttgtcc caagcgggtg ttaattgttg cgtgattgcg gctgaaacag 1021 tcaagcaatt ggaagataac gttaaggtgg ctcaagcttt ccaacctctt ggcgacaagg 1081 aattagctgc aattgagcag cgtagtgctg cagtttggaa agatagcacg tttttccgag 1141 cttggacatg aaatcttgct atttcttaac ggaactcact gctgtctact ggcttgcgac 1201 caaatgtaaa aaagacggtg atatcgttcc aaactccttc gtcggtgttt aatgcttcta 1261 attctgtctc aaattcagcc ttgacttgtg ctaacagttc agatgaaagt tgcgatacgg 1321 gatttggaaa ttgtccgggt gctacatggg aatttcctgt ccacattcct tttgcttgct 1381 ccaaactaat atagctgcca tattgttcag gctggatttc aattgcctca aatcctgctt 1441 gctggagtaa attttggcac ttttcaactg ttcctgttgg cttattaaac gctaatgaaa 1501 cgccatactt ttcaacgact ttctggacaa caacccctcc aacaaaagcg gtttgtgcaa 1561 atgcatgaaa gccaattaat ccacctggtt tgagaaaccg catccattga cgcaatgcgg 1621 ctggaatatc cgccatccaa atcagcgcag acgaacacaa aactctgtca aaactattgg 1681 cgggaaagtt gagtgcttca gcatccgcaa gctgaaattc aacatgctca aaacttaatc 1741 cttcaacctt ttgctttgct acctcaagca ttccagttga aatatccacg ccgacgacgc 1801 gtccttcagg tcttacaatt tgagctgctt caatcgcgac catacctgtt ccagttgcga 1861 tgtctaaaac gtgttgtcca ggactgattt gtgcatattc aacgagacga tgagcaatct 1921 gagtatgcca atctgaatta tcgtaggttt ggcttctgcg gctgtacaaa tccgctattt 1981 cctttttgta gtcatttaac ttaagttgat taatcattta gaaatttcaa caattacagt 2041 gttatatata gcaatcgtaa atcattctta aaattttctc ttctctcttt tcttggcgtc 2101 cttggcgtct tggcggttcg ataaattctc ataactcata ttcaattgct agaaagattt 2161 tctaatttct cttccttggc tctatagtct gtggctcaac accgtaatat tcagcagctt 2221 cagacatatc atacatcagt aaattttcgt ttcctagttc ttgtataatt ctaggttgag 2281 atgggcaaac accaagaatt ttaataattt ctaacgctac tgctttagct aacaatggtg 2341 gaacagaatt tcctatttgt cgaaatccat gccattttgt gacatgaaat ctgaaccaat 2401 ctggataaga atgtaaacgt gcagcttcac gcacagtaat acatcttgga gtcaatgggt 2461 gaattggtct agcagaagta aaagctcccc tattactagg agttcctgct ctaagtgtat 2521 tgcatattcc ttcaggagca agtttataaa agcggctaat aggttctgtt tttccaggaa 2581 gagtagcttt aaacctttta atagattcta aattatgctt tgttcttaaa cttgaggtga 2641 ggattctaga gtcatattta cgctcatatg aataatcgtt atctgtacag gaaagaccac 2701 gaagttgtct actgtaatca cttggtttac caaattctgc aattaccgaa tctttttcaa 2761 aaagctctat ataatcttcg acttcaggta agtcgccaat tgcatcccaa actgttggac 2821 ttaatggtag ttggttttga aaacctgctt tatttaattt tgccggtcta gtcagggctg 2881 caggatagtt tggtaattct aaatcatgac gacaacctag caaaaataat ctttcacgat 2941 tctgtggaat cccgtattca gcagcattta agactttata atctttgcga actttgtaac 3001 cactttgctc aaattcatta atgatgactg aaataaactc tctgtgcttt cctagagtca 3061 ttcctttaac attttctaaa acaaaaaact ttggctgcaa ttctataact aaatttataa 3121 aatttttgac tagggaatta cgagggtcat caaaaatacg tttacctatc aatgaaaaac 3181 cttgacaagg tggtccccca aacactacat cgatttctct atcaccaata gaagagcgac 3241 ttctaatttc tgctgctgta gtatcaacga cacttttaca taaaatacgc caaaaaggaa 3301 aattaaattg atgtgttgca caatgaattg gatcaatttc aatagatgca agtacgtcaa 3361 atcctgcttg atcaaaacca agagtcatgc caccagcacc agcaaataaa tcaacagcta 3421 taggtctttc tttcttatta ctgtcaatca tgtttgatgt ttaagctact cacaagtgaa 3481 ttttataaaa acaaataagg gcgctgttag cagcgccgct agcttgaatc attgacttat 3541 agcgtaacct acatcttgca cctggtaaaa gaaatgtcaa agttattaca tattatttga 3601 ctcccttctc gtcacgggac tgtaagtcct aggctcatag acgaagtcca ttaaaatgga 3661 ctggaacggg atataatttt tgagtaaatt taaacactat ttaatataag cattggtggt 3721 ttacaaggta atggtgcaag atatcaggta attgaaaatc taatgtttca aatttggtag 3781 catttccaaa atttctttag cggcgcgatc gcacactccc acttctccca aacactgccg 3841 catttcttca taatctgcca acatttggtt tttacgttcc ggattaagca gtatttccaa 3901 cgctgcttgg gtgatgttct ctggagtcgc tttttcttgt aaaaactctg gcacaattgg 3961 tttcatcaca actaaattgg gtggcgatgc aaaaggtata gaacctttaa gcacagtacg 4021 cgcaatccaa gccgtgatgg gatgaagtcg gtagaggaca acttgaggca ctttcaatag 4081 tgccaattcc aagttaactg taccagattt ggtgattgct aaatcggcgg cggcgatcac 4141 ttcctgagtt tttccggata ccacagaagc ttgcaaaccg tactgctgta ttgctttttc 4201 aatcggttct cgataaattt ctagagataa aggtatccaa aaatacacat gatcgaattt 4261 tgcttgaatc acctgagccg cctgaaatat cacaggcaaa agatgtttca gttcttgacg 4321 acgagaagcg gggagaagag caacactgat cacatcttct gcaattccta acttggcacg 4381 agctgcttct cgactgggaa agttttgaac tctatctact aaaggatgcc ctacccaggt 4441 cacttttgct cctctttctt ggaagtaacg cgcttcctct ggaaagattg ctagcaactt 4501 gtctgttata ccaacaacca tattagtatt acgtgaatta agtgaccaaa cccattcttg 4561 tggtgcaata taatacacaa cacgaaggtg tgacaacttc cggcgaatat aattgccaat 4621 tgccaaattt ggtcccatgt agtcaatcag tatcaccaag tcaggtgaat tttgtttgag 4681 ataggcgatc gcccgttgct gcatttgcag tgttggtaac acataaggca gtgattccca 4741 aatacccacc gagccaatct cactagtatt acctataagg gtggctccaa cactagccat 4801 tttttcgcca cccagcgcca caatctctaa atccaagcca gccgccgcag cttggcgctt 4861 gagtgcagta ataagtagcg acccttgcag gtcgccagaa acttcgccag tactgataaa 4921 tatacgcatt tataaattta ggaataggga gagagggaga gagggaggaa gagagagagg 4981 gagagaggga taatttcctt ttctcatttt gtccgctttt ccctcatctg cctgacatat 5041 tttcttatcc ccttgtcttc tgacttaaga ttcgtcactc gcggcgactt tccccttacc 5101 aggaattaag ccacgtcttc ctggcatttg agaaagtagc aaaaagcgac gcagatgctg 5161 taattgttga gtatccccta aaagttctag ctgttctaag gagtccttaa agagcaactc 5221 agaacgataa agaatacgga aggctttttt aaggatttga aactcgtcag tactaaaacc 5281 agcgcgtttg agtccaacaa ggtttagtga acgaattcgc gatggatttc cttctacgag 5341 catatatgga gggacatctc gctcaatacg gctcatacct cccaccattg cgaagctacc 5401 aatatggaca aattgatgaa ttcctagaac tccactaata actgctcgtg attctatgtg 5461 gacgtgaccg gcgatcgcta cagaattaga aatcgtcaca ttgtcttcta ccacagaatt 5521 atgaccgaca tggacgtaag ccatcagcaa gttaccgttg ccaatacgag tttctttacc 5581 cgcaccagta gcgcggttaa tcgtgacgta ttcgcgaatg cgattatcat ctccgatttt 5641 gacccagctc aattcgccat catatttccg gtcttggggt tccattccaa taaccgcgcc 5701 cggaaaaatt tgatttcgtg ccccaatctc caaaggtcct tctagtactg catgagcgcc 5761 gatgatggtt tcaggaccaa ctttaacatg ccctccaatg acagcataag gacctacttc 5821 cactgtcggg tgcaattggg cagtaggatg aataacagca gtagaatgaa tgagcgtttt 5881 caagggtgca tctccagaac agacttgagt gctgagagtt tagttagcta gtagccgaaa 5941 gtgtgaggta agctactcaa gtgtgagggt gtcgaaaaat atagaactca aataataatg 6001 ctattgtggc ttgtgtcttt tactgatatc cttattcagg atttgataaa ggcacgtcac 6061 ggtcatgcct ttaccacgcc tttaccacaa aagcaaattt ttaggatacg agggaaaaca 6121 tcagttcgcc ttcagtagcg agttgaccat caacttcggc gcgagcttgc atcttaccga 6181 aacgacgttg ttttacccac aacagttcca cagtcatcac tagttgatct cctggtacaa 6241 cttgacggcg gaagcgagtt ttatcgatac cagcaaagac aaacagccca ccttccaact 6301 ctggaagttg agtcaaaaca ataccgccga cttgtgccat tgcttctaca atcagcactc 6361 ctggcataat tggacgtcct ggaaaatgac cttgaaaatg gggttcattg atcgtgacgt 6421 ttttaatgcc aacagctcgt tttccgggta catattcaat aatccggtct acgagcgcaa 6481 aggggtagcg atggggtagc agcttttgga tgtcttctat agagtagatg atttgattgt 6541 ccgctttgtt tgtgcttgta tcatctggct gattgttgct agatgctggt gtgggagcat 6601 caatagtatt ggtttgttcg gtgacagttg acattgacac tgttatgtga tttctatttt 6661 tgtttttccg taggtgccgc tttgaacctc ccccttttga ttttggattt gggattttgg 6721 atttgagaac caatctaaaa tctaaaatct aaaatctaaa atcctttgtg ccagttgaat 6781 gtgtaaattg tggctggctt tgtacgccaa gaaatgagcg caggggaaaa ttccaagtaa 6841 gcttaaatct cctactaaat ccaagatttt atgacgtact ggctcatttg catatcttaa 6901 tggtggattt agccaacctt ctggtccaca aacaagtgca ttatctaagg ttccaccttt 6961 aattaaccct gtctgttgta gatgttctat ttgatgcagt aaaccaaagg tacgggctgg 7021 agcaatttct gcaacgaagc taccataagc attttcattt tgggaaggga gtgaacaact 7081 gtgccattga ttaccaattg ccgccaaatc gaaatcaata ccataggtaa accgagtttc 7141 tggggctgga agcgcacaag caaaagcatc accttgacgc acccaaattg gctggtcaat 7201 gactagagga gctttgtcct ttgttaagtt ttgtgacatt aagccaactt gggcgatcgc 7261 atctgtccac acctttgccg aaccatctaa aagcggcact tctggaccgt caatttcaat 7321 ccgggcgtta tctactccca ttgcagcaag ggacgccagc aaatgctcaa ccgtgcgaac 7381 gtatgcctca cctttgccca actgagtcga aagcacagtt tgactaaccg ctgcgacttg 7441 ggctggaata atcggagtat caggcaaatc cacacgtaca aagtagcgtc cacttccggc 7501 tgcgtctggt cgtatccgaa cttgggttgt gactccgcta tgcagtccta cccctgtttg 7561 gatgatttca tctgctaatg tgtgttgttg catattgtgt ttttgtcctt cgtccctaat 7621 cagttgtcat ttgtcttaaa gaatgaccaa tgaccaatga ccaatgacta atgactaatg 7681 accaatgact aatgactact tagaaccttt cgccaatacc aaagttaata cggctatctc 7741 catcatcgtt gataccgtag tcaatccgaa tcggtcctaa aggagactgt acgcgcactc 7801 caagaccgta gccatagcca gttccgcttt tgttcagtat ctcagcgggt ctggttccgc 7861 ttcccagatc ggtaccaaaa tcaaaaaata ctgcgccact cactacagag aacactggga 7921 accgatactc aagcgatgct tgtacgaaac tgcgtccact acctaagcct ccttcttgat 7981 atcctcggac ggaattgcta ccaccgagag agaaggcttc gtagggaggt aagtcaccga 8041 ggactgttcc cgcttgaaga ttgaaagcaa tggtttgcgg tcctttactg aagttgataa 8101 agctgacagg gaagtattga ctgtagctac cccgcaacct agttagaaaa atattcccta 8161 gtcctatggg taccgactga tcaagaccaa cgcggaagaa agaaccacga gtcggttgca 8221 aggggttatt tctcaggtcg cgtgcaacac ccagttgcaa taacagcaaa tcgtcttcac 8281 cttgtccaga ttggctcaac tcaacgaatt gatcagttgt gccttcaaga cgtccctgct 8341 ttctaatatc gccatcagca tctttgatag aaactctttg atactgcaaa ccagctgaag 8401 ctgtccattg cgacctctcg aagggatttt tggacaaggg acgggtgaag ttaacaccgc 8461 ctcctaaacg ggtgatgcgg gggcgatcgc catcatcatc aggtttattg ggattaaaag 8521 tctcaaagtt tttatcctta ccctcaaaaa tcaaggaaat cgaccgacga cggaaaagat 8581 ttgccgtata ggaagttctg tatggatcac ctgctatcca agggtctgta tagcggaggt 8641 caaacaaaaa ttcatctcgt gttccgattt gtacctctgc ccctaatttt tgatttctgc 8701 cattgaggtt ttgctgttga tagctgacgg aaccaaataa accactagca gaactaatac 8761 cagcaccagc accaattgaa ccgctgttgc gctcaaccac gttaaccacc acatccaccc 8821 tactcggatc tgctgcgccc gtactggggt ctttactggg gtcaagggaa acattcacat 8881 cttcaaacag ccccagtccg aacacccttt gtaagtcttt ttgcactgtg ttcttgttaa 8941 agattttccc tggcttcaac tccagttctc ttgtcacgat atagggttgt gtccgtccgc 9001 ggattggttc tcccttatcg tttgttacct gaccttcttt attgcggaac tgtactcgaa 9061 tatttcctac gaccccttct gctacttgca aggtgacaac accgttgtca gaaacttggg 9121 gtgctccgat tacgtttgcc agtgcataac cttggtcttg atagcgctta gttaattgct 9181 tgatgccttc ttgtaagtca cgcaagttga gtattttacc atactggtct ttgaatattt 9241 catctgcggc attaccgggt agtacggatg aaacgtttgt gccaggattt gcctgtactt 9301 gcactttagt caggacgggg ttgggtcgta caacaaaact cacccgcact cccaaggggc 9361 tatcttccgg tactgcttgg acattggaga agaaaccggt ggcgaagatg gagttgatat 9421 cttcttgtaa ttgcgaacgg gttgttgttc gtcctggctg ggtgcgaatg actcgataaa 9481 cttggtcttc tagttcagct gctaattgtt ggccagtttc agatctaata aagacttcag 9541 aaaccagtac gcggggttca gatgcttctg gggattgttc tagttgggga cttcctggag 9601 tttgtgggga tgtgggttga gtccctggtt gattggttcc atctggcgtg gtgggagtac 9661 caggagtttg gggaggtatc tgctgctgtt gattgttagg tgcgggtgtt tgttgttgaa 9721 tgcttggaaa aggtgtttgc ggagaatttt gtcctggttg agtctgttgt tgattggttc 9781 cgtctggcgt ggtgggagta ccaggagttt ggggaagtgt ctgctgctgt tgattgctag 9841 gtgcgggtgt ttgttgttga atgcttggaa gctgattatt attttctggt gcaggagtgg 9901 tcgttggagt ttggggagat atttgctctt gctgagtgtt gggagtaggc gtttgcggaa 9961 gattttgtgc agtttttgag gctgtggaac ttctttttgc cactaacagc ttttttacaa 10021 gagtggctgt tgctggcaat tttccagaaa ctttgtactg ttgagtggtt aaaggtttta 10081 aagcttctgc ggttgagtga aaactgttga cgggttgtgt cgttgacttt tctgtgaaac 10141 ttggcactat gacttccggt tttgccactc ttgttgaagc tgcaaattcc ccaggaactt 10201 tgtactgtgg agtggataaa ggttttaaag cctctgcggt tgagtgaaaa ttcttgaggg 10261 gttgtgtcgt tgttgacttt tctgtggaac ttggcactac gacttccgat gtcgccacag 10321 cggctgttgc tgggagaacc ccgacaactt tttgcggagg atcagctgtc ggtttcaaaa 10381 cttctgctgt ctggtttgaa tttgaactat tgagggtttg tgcatttgca ctcactaaac 10441 tagcaaaagg cgatgaaatc gccacaactg tgactataac gggagataag cgcattttat 10501 ttagattcct cttcacatcc acacaccatc acaaagtaat catcctttcg agcaaaggtt 10561 atgtaaaaat tacatttcta ctcctgcaat gtacctttat aaggcaaatt gcaaaagaca 10621 taaagtcaat aaaactcatt tgcttgtcct ttttaccaaa agtatgagga attggggatc 10681 accgatgaag gattgggaat aacctaattc tcagtaaaca gtctccaata tcctctctcg 10741 cttcaattgc tgctttctat agctttgagt actcgttgta aaacctcctg gtaagcattt 10801 tctacatttc ctaaatcccg acggaagcgg tctttgtcca gtacccggag atttgggtca 10861 tttcctttgg agttgtccca caatcgacaa gtatcaggac taatttcatc tgccaatagc 10921 aactgctgtt gtgcgtctag accaaactcc agtttgaagt ctactaaggt aatatcgcac 10981 cgcccgaaaa agtcacagag aaacttgttg atttgtaatg ctagatgggc aatagtatcc 11041 acttgttccg gagttgctag ttctagcagg aataggcgat cgcgtgtcaa taatgggtct 11101 ccaagttggt cgtttttata ataaaactcg accaatggtt tttttagcac agtccccagc 11161 gctaatcctg tttgttggca gagacttcca gcagcaatat tcctaacaac gacttctaat 11221 ggcaaaatct tcaatgcctt aacccgcatt tgatgaggag cagggctgtc tacatagtga 11281 gtctttatac catgcgcttc cagttgctga aatagtttac tggaaatgct acaattgata 11341 tttcctttat tctctatgct acctcgcttt tgggcattaa aggcagttgc atcgtcttta 11401 aagtcagcta ataagatttc aggttcatcc gtcgtgtaaa ggattttagc tttgccttcg 11461 tatactcttg ttttaacaga catggctaaa ggtaaagttt atgagctatg aggagttttt 11521 atgctttcgg gcttaaccat tttatcttta gtcattagtt attagtcatt aataacagac 11581 aaatgatcta agcacccgga tctatgcttg ctttagttaa gactgtagtc tgtaactagc 11641 aaagaaactt ttcaataaga gtatagctgt ttacatcaac ttgttatatt ttttaactag 11701 ataaattatt aatgatgaat tcttaaagag taaaaaatag tacattcttc ggtaacaatt 11761 ttggctgaag ttgaatacaa tcattagttg aaggcataga aagtgagtga gcaaccaaag 11821 gaaactctaa ccttggttac gagttattca cttcaaagcc gaaaacaggg ggtattgagt 11881 taatggagag aaaaatgata gataaagatg attttctcta tcctcgtggt cgctactacg 11941 gtcaggtcaa gccagaaaac ctagttttca atgctaactt gcaggaattt gcgcaaagag 12001 tgagttacat ttgtaactta gaaacagcag gcaaagtacc accagagcaa gcatacgatc 12061 aaattaagga gctttggaaa aacttaaaaa actccaaaaa acaactcggg atcggtgaac 12121 atccttttga gaatgacgaa ggaaacagtg aataagttgt cactgacctc tttagaaaat 12181 aactcaaaat tgcacgagct tccatttatg ggtttcgctg atacattttc aaatcagcag 12241 tcgatgcggc cttttgtaat cgctgttgtt gctcataaaa atagctcaaa ggcactccct 12301 gatgactcgt caattttgag caagattaac ccgacctctc aggcatggtt ccaagcctcc 12361 tgtgtccttt ggaacaaact tttggcaaat gcagggaaca gcaacaattg cctgattact 12421 tcgtcagcat agagtatact ccatcccaag aatcactggg gttgtagtta agataatgac 12481 atgatctttc taaatgaata tcaacagctt ggtctttggg tcggattttt ttggcagctt 12541 taaaacaggc aatagcgtat tggaaattgc gtgacaaata agcatcacgt ccattttgat 12601 agtgaaacaa aaactcctga gtgttgctat ctaggggggt gtggcgatcg cctatcaact 12661 cgtagatatt aactgcttga tgttttcctt ttacccttat tttgtccaac tcacgcaccc 12721 aaatcagttc gctacacaat tggtaagtaa attcgcttaa gataatatca cagccgtatt 12781 ccttggtgac gccttctaga cgtgaactca aattgacacc atctccgatg actgtatagt 12841 ccatccgtct tcgagaacca atattgccag aaacgacttc tccagaacta attccaatac 12901 caacatggat ttgtggcttt ttcaggataa gtcgcccttg attaaactct gccagtcgtt 12961 ggcgcatttc taaagctgac tggattgccc tccaagcatg attctccgtt aacggtagtg 13021 gtgcaccaaa cactgccatc aaggcatcac caataaactt atctaaggtg ccttcatggt 13081 taaaaactgc ttcgaccatc gtttcaaaat actgattcag caaggatacc acttcggcag 13141 caccgagatt ttctgttaac gtggtgtaac ctctgatatc agaaaacaaa atcgttacgt 13201 ctttgcgttc gcccaccatc aaagcatctt cccctagtgc cataacttgt tctgcaactc 13261 ggggagtcag gtagcggtac atggtcgttt tcatgcgttt ttcctgactg atgtcttcca 13321 ataccaccaa accacccctg actccacctt ctggattcgt caaggggttg acggtgagat 13381 tgatgctacg ttcaattttc tgaacattgt ctgcatcaag caacgccaag cgatctgtca 13441 gaggttgatt ccaaggaatg aaaatgtttg ggttggtgcg atcgcgtact gccagaatgt 13501 agacagatgc atcctgccca gactttagat acaacgtctc tacatacagt cctaccgtta 13561 aagtttgctc tggcacataa tgtcttgccc cagtttttaa actctcttgt agccgcaatt 13621 gtagattatc aataggcaca acctcccaca ctaagcgacc aatcaagttt tgttcccaaa 13681 gatactggtt gttttttgcg tttgcttctc taatgggaca acctaatagc tgcaatgccg 13741 caccattaat tgtcacaatt cttccttcca tatccgtaga tataacggca tcggaaagac 13801 tttgtagaat atctttttga tgctgttttt ctaataaaac attttcaaat aagcgggcat 13861 tttctagagc aatccctgcc tgaatattaa aagcccgcat aaattcttca tctgatgcgg 13921 taaaaaagct accttgatgt ttattaatta actgcgtgac gccaattaat tcattggctg 13981 agttaaacac aggtaaacac agaatattgc gggtagcata ccctgtcttt tgatctgtgc 14041 tggggtcaaa acgtggatct ttataagcat cgggaatatt tagcgcaaga cctgtggatg 14101 ctacgtagcc tgcaatccct ttgttagagg gcatacggat ttccatcgta gttttaccat 14161 ctgcagatgc taccttagtc caaagttccg ccatttcttt acgatataaa aataaagtgc 14221 tgcggtctgc ttgcatgaga attcgggctt gttccatgac tatttgcaaa gttgcttcaa 14281 gatccaaact ttgtcctaaa gtctgagttg cccgtaaaag agcagttgca cctcgttgat 14341 tacgagctgc aacatacaaa gagtgacaac tttccaagat aataccgata gaagaagcaa 14401 agtcgcgaaa gtgcacttca tcttcataat caaatggagt acccacagtt ttatttgcca 14461 gttgcacgac cgccacaact tggtttttgc tgctcaaaac tggcatacat aatatattgc 14521 ggaacttgta gcccatttgt ttttctagtt ccgggttaaa aaggggatga ctagaggtat 14581 cagatatgtt taaacattca cctgtattag caacatgacc gggaatacca acagtcatag 14641 gagtccgaat ttctatagcc ttttgagtct tatcctgagg aagttttgac cataactgac 14701 ctttgtcgta gtcaactaaa aaaattgttg tgtgttctgc ttgcaaaatt tgaccaattt 14761 taagtgtaat cgcctccagc actttctcca gcattgtttc taaagcttcg ttattaatta 14821 attcaattgc tctaagaaac tgctgaaact cggctgtgat aaagtcaagt aagcaaacaa 14881 attcgttaac ggaaagattt ttgacgcgag acagtaagga gtgagtgcga ttaacttgag 14941 tcagttcagt taatgtagcc aggacgctac caggatttag gagtgtcatg gggattttag 15001 attctatggt aaatcgtaga tagtaggttg taggtagtgg ttagtcggtc gtgggtagtg 15061 gttagtcggt cgcaaaacaa ccaacaccta gccactcata accgataact aacaactaag 15121 taggtcggca ggatgaaact aaagtatgta actttttgta aaggaaaagg gctttgatca 15181 aaagtatgta aaactgggct accctggcgc aaagctgtgc gagggaagta ctctgtgcaa 15241 gactttgcaa aaaaagatgg attaattcgt caataagact agttactgag aagaaaccgc 15301 tttttttgcc gccaccacag aaatctgaag tgcccacgat gatcactcgt ggcagatttg 15361 cagttttaca caagttcaac ctaagggaaa gggtaaaagt caaaatagaa aaaaattttc 15421 tatttgaggc tttcccttct accgttccta acatagttct gtttatttgt actcacctac 15481 ttaataatcc ctatgccact tttgctagtc cagaataaat caccttttga tagtaacttt 15541 ctagctggcg tgtagcagca gcccatcccc aacgttctgc ttcctgacgc gcactttgac 15601 ggatagtttc tcgttcctgt tgctgttgca gaagacgaac tgttgcatca atcgcacttt 15661 gaacgtcttg ttctggctca aataaatatc cattcactcc atctgtcaca atatcaggaa 15721 tgccaccaga acgtgctgca acgactggac atccacctgc catagcttct aggagaacta 15781 atcctaatgt ctctgttcga gaaggaaaaa taaatgcatc agcactcgcg aaagcagaag 15841 ctaattctct acccaccaaa tacccaacaa aataagtatt tgttccagca aaatgttttt 15901 ccagtgcttg acgatgggga ccatctccta ctaatgctag ccgtgctttg ggaatagctt 15961 ctaagattgg tttgatacgt tcaatttctt tttcggcgga aagacgccct acgtaaagta 16021 acaatgggct ttctgggtga ttttgcgaca ggcgcgatcg catttgttca caagcattgt 16081 caggatgaaa tgattcggta tccactcctc tttgccacag atctaccctt tcaataccgt 16141 gtcctatcaa ttcctgcatc attgctgtgg aggtacatag atttaaagct gcttgattgt 16201 gagctgcttt tagtagttcc cacaatactc cttccaacat tgctaaaccg taatgctgaa 16261 gatactgagg cagatgagta tggtaagacg cgaccaaggg gacatttaga tatttgctat 16321 aaaatatacc agctaatccc agaattgctg ggttaacgac atgaataata tctggcttaa 16381 actcttccaa tgcttcaccg attgctggac gaggcagtgc catttttaac tctggataca 16441 gtggcagggg aaaacccgac acaccatata ctttagctcc tttgtgctca acaatgccac 16501 catccggggc aataactagg acttggtgac catgacgttg aagatggtca atggtatgac 16561 gcaggcgtgt tacaattcca tcaaccttgg gtaaaaaggt ttcggtaaac agggcgattc 16621 gcataaacaa ctatttagta caagccaagt atgaactatg aaatgaaggt aaaggtttga 16681 agtataaagc atgtagttaa gaatgcctat ttcatccttg atccttctaa aatatagtcg 16741 tcatactggg tcacaatatg acgactatat tttagacggg caacgttcct taagaaagaa 16801 ctcaccaacc cagaataacc ttacagcatc tactgaagta ttcagcattg aggaaactgt 16861 tgttagattt tttggggact tgcaacctca actcctccgt agcgtggtta gcgactaata 16921 ctgaattggt gagttggtga gttatgtttg tttttcacta cccgtcttca ggatttacct 16981 aattcctccg ccaagagact ttgggtagaa tttgattttg atcaacatga tgctggtact 17041 tcatagcaaa gttcaagaga gaatcgagta gggaatcaga tagatagtgg ggctgcaagc 17101 caagatccaa caatttggtg tttttggcgt tgaagtaatg ttcttctctc tcaactctgg 17161 gattatctaa atgattaatt tcaacattca atcccattgc attaccggct ttcttgacca 17221 tcatagccaa atcaccgaca ctgaattgtt cggtaaattg gttaaatacg cggaattctc 17281 caggttcagc tgggttagca attgcaatct caatacatcg tactgtatct cgaatatcta 17341 agaatccccg agtttggcca cctttaccat aaacagtcaa cggatgacca atcgctgctt 17401 gaatgcagaa acggtttagt gcagttccaa agacgccatc gtaatcaagg cggttaatca 17461 acagttcgtc catgcctgtc tcttcggtta aaacaccgta gacaacgccc tgattcaagt 17521 ctgttgctct taagccccaa atccgacaag caaagtggat attatggctg tcatgaactt 17581 tgctcaagtg atacattgaa ccgggctgct tgggataagg aagggtatct ttacgcccat 17641 tgtgttctat ggtgatgtat ccttcttcga tatcgatgtt gggtgtgccg tattcaccca 17701 ttgtccctaa tttcaccatg tgacagttag ggaaatcttc ccgcatcgca tacagcaagt 17761 tcaacgtacc aactacgttg ttcacttgag tcataactgc atgttcgcgg tcaatcattg 17821 aaaaaggtgc tgaacgctgt tcaccaaaat gcacaattgc ttccggctca aatctgtgca 17881 gcgccttact aagaaattcg taattagtaa tgtcgccaat caaaagatca atagatctac 17941 cggtcaaatc gtgccagcgc tgtatacgtt gctgaattgg ggcgattggg gtcagagttt 18001 gcacccccag ctcgttatcc cagtgccgtc gcactaaatt atctaagata ccaacttcat 18061 aaccgcgatt tgaaaggtaa agtgcggttg cccaaccgca atatccatcg ccaccaataa 18121 ccaggacttt cattttcacc agtttttact cgctgatagc taaatctacc aggtttgtgt 18181 cccctctaat catgaaagat ggggcaggtg gggtgattaa gtgatgagga agtcaggggt 18241 gcctttgaac gtaattctgc ctttgttcaa aagagttcag gacttaggca aacaaacccg 18301 gtttctacca aaaatttttg tttcgcaacc aaatacgaac gaagaaaccg ggtttctagc 18361 agatagtgcg taagtcctag agttgtcgaa tttttgacta ttgtattgcc cctgccagta 18421 gtacttgagt tgaattcgtc attcaaaaca gctttgcatt aagaaatgta acaaaaaagc 18481 tgtcaaaacg ctttgacgtc ttgacaagca attggggttc cgatatcgtg atatacatac 18541 ccgggtaaga ccgttaaaaa gtaatatgca agctaagcaa aaggttacgt tgtatctgtc 18601 gccagaactg cacaggaagt taaaaattcg ctcggcaatt gactccgaac cgatgtcaga 18661 actagctgag cgtgccctag acttctacct ggcaaatcct gaattagtag aggaaatgga 18721 agcatcatat ggaaggacgc acagagttta ttcctgtcca acttgcgaga gttcagtagt 18781 attgcgagat ggcgaattgg tcagtctggg tcagcagccc ggaattattg gtcagcagga 18841 agagcgtctt ccgattgatg aagtgaatcg ggatcaaaca aatagaaaag gtaaggaaga 18901 gctagttcct tgctaagcag gaattatgaa tattaagttc ataacttgct tctcttgtac 18961 atagcaaagc gaagtctgta ctgtctgtaa aggtctcaag taggtcgatg tatgaaagaa 19021 gagctcaata tcctcattca agctcaatac cctctaatct accttgtgac ctccgaggaa 19081 gagcgggctg agcaggcaat ttctacaatc gctcaatcgt caaagccaca gcgaaaagtg 19141 tatgtgtgga cagtaaccca cggcatcgtg gagtatggtc aaccccggaa cgtcactcaa 19201 cataacaccg tttctccaga agccgcagtt gaatggatta tccggcagaa agaaccaggt 19261 atatttattc ttaaagattt acatccattt atagatgcgc cagcgacaac aaggtcattg 19321 cgtgatgcga tcgccagctt caaaggatcg cacaagaatg ttatcttgat gtcaccgatg 19381 cagcaagtcc caattgaact ggaaaaggaa gttgtcgttc ttgattttcc gcttccagat 19441 atgggagatt taaataaagt tttatctagt catttagagc aaaatcgtgg ccgacggcta 19501 acaacagaag cgcgtgaaaa acttttgagg gctgctttag gattaactaa agatgaagct 19561 gaaaaagtct accgtaaggc acaggtaacg acagggcgtc tgacggaaga tgaagtagat 19621 atagttttat ctgagaaaaa gcaactcatt cgacgcaatg gtatcttaga atacatagaa 19681 gaagatgcaa ctattgatgc tgttggtggt ttggaagagc tgaagagatg gctgaagcag 19741 cgctctaatg cttttacaga gagggcacgt gagtacggtt tacctcaacc aaaaggcatg 19801 ttaatcttag gagtcccggg ttgtggtaaa tccttgattg ccaaaacaac ttctcgactg 19861 tggggtttgc cactgttgcg attagatatg gggcgagtct atgacggctc aatggtagga 19921 cgttcagagg caaatttacg taacgcccta aaaacagcag aatctatttc acccgcaatt 19981 ttatttattg atgagttaga taaatcattt gctggaagtg gaggttctgg agattctgat 20041 ggtggaactt ccagccggat attcggttct ttcctcacct ggatgcaaga aaagaaatca 20101 ccagtgtttg tcatggctac agccaaccga gttgaacgtc tacctggtga atttttgagg 20161 aaaggtcgct ttgatgagat tttctttgta gatctgccca ccccggaaga gcgccaagac 20221 atctttacga tacacttgtc nnnnnnnnnn cacttgtcta agcgccggga agacatctcg 20281 cgatttgacc tagaacaact ttctaagatg tctgacggat tttctggagc agaaattgaa 20341 caagcgattg ttgcggcaat gtacgaagct tttgcccaag atcgggagtt cacacagtta 20401 gatattattg cggccattaa agcaacactg ccgttgtctc gtacgatgca agaacaagtg 20461 acagccctca gagattgggc tagacaacga gcaagaccag ccgcatcctc cgttgctgag 20521 tatcagcgaa tggagttcta aaagctttct cctgctacca cagggggaaa ggctagcccc 20581 acaaagctag cagttatgaa aaaaccgcgt cttggtaaac gcggcttctg ttaaaacaaa 20641 ctgttgtctt tttctctact ttctcattgg aggaaaccca aatgtctcac tttagcactc 20701 tgcgtaccaa aatcaccgat gctgaaattc ttaaatcttc cctgcgcgat ctaggcatct 20761 ctgttaagac cgaagctgat gttcgtggtt acaacggtca gcgtgttcgt tctgacatcg 20821 ttgcagtttt ggaaggcgag tatgacctgg gttggtctcg caacagcgat ggttccttcg 20881 atctgatcgc agacctgtgg ggtgttgcta agaaacacaa ccaaaccgag ttgatcaact 20941 ccatcaacca gaagtatgcc gttaacaaga ccctggcaga agtgaagcaa cgcggtctgc 21001 aaaacgccaa cgttaagttg gtattgcaat agtcatatct ctgcgcgttc ccaaagctgc 21061 acgggttaac caggttaata gcggttagcc cgcttttttt acggactagg aattatccta 21121 gtccgttaat tttttttatc tgttataaat ttactatctc gttattgttg gcgatcgccc 21181 taatatagca gggaccaccc gagcagcctt taaagtcccg tggtttccgt attttctcat 21241 tagttgatgt cctaatctac ctggcaactg ctatatatga aacttcctca tcccgaacag 21301 gcagttattg acaggcaaaa actttccggt tattgtttga atccagaaca tccagagggg 21361 cgacacaaag cgcggttgtt taagtctgtt ttaggaatcg ctttagatga cgaggaagaa 21421 ctagagatag ccttgaggca agctattaaa aattatgatg tgattcctac taaaagaaat 21481 caatatgggc agaaatatgt cgttgatttt atgatggtta gaggtgaaca acgagcagtt 21541 gtgaggagtg cgtggattgt acgagataca gaaaattttc ctcgtttaat aagttgttat 21601 attcttttag ataagggctg agtatgaatt cacaaattaa gttacttgat gtagttgcat 21661 tgctgaaaga tttaccagag ttcgatttgt ataggggaca ggttggaaca attgttgaag 21721 agtatgagcc tggaatattt gaagtagagt ttagcgatac tcatggtcgc acttatgcga 21781 tggaaacatt agaagctgac aatttgatga ttttatatca tcagcgattg gcagaagata 21841 ggattgctat gtagcattgc tgaataaatt agtattagga ttagtaagta actcttcatc 21901 tcaaaagtat tgagcgagga ataaaaactc ctcactcctc attctactta ataagcatcc 21961 attggcagac aagaacaaac aaaattccta tctccatagg cggcgtcaat gcgaccgaca 22021 ctaggccaga atttgtgttc gcgagtccac ggtgctgggt aagcagcttg ttcccgtgaa 22081 tagggatgat tccattctcc gacaatcaga ctttctgcgg tgtggggtgc attcttcaag 22141 acattatctt gagcatccat cttacctgat tcaatttccg cgatttcttg gcgaatcgca 22201 atcattgcat cacagaaacg gtctaactct tctttggatt cgctttctgt gggttctacc 22261 atgattgtac ccgccacagg ccaggaaaca gttggtgcat ggaaaccata atccattagg 22321 cgcttggcaa tatcatcgat ttcgatattt gcagattttt tgagcgatcg caaatccaaa 22381 atgcactcat gagcaactaa accatttttc cctttataca aaacgggata gtaagattca 22441 agtctcttgg caatgtagtt tgcattcaaa attgcgactt tggttgcttc cgtcaaacca 22501 tctgcgccca tcatagcaat atacatccaa gaaatcacga ggatactcgc gcttccccaa 22561 ggcgcagcgg aaactgcacc cattttttgt tcaccgccca tttcaacaac agcgtgtcca 22621 ggaacaaaag gtaccagatg ggaagcgacg ccaatcggtc ccataccagg accaccgcca 22681 ccatgtggaa tacagaaggt tttatgcaag ttcaagtgac aaacatccgc accgatatct 22741 ctcggacgac aaagccccac ttgagcattc atattcgccc catccatgta aacttgtcca 22801 ccatgagtgt gaacaacagc acaaatttcc tgaatttgct cctcaaacac accatgagtt 22861 gagggatatg ttaccatcaa tgccgcaagt tctttactat gcttttgagc ttttgcctta 22921 aggtcatcta aatcaacatt accttcagcg tcacaagcaa cagcaactac cttcatcccg 22981 cacatcacag cacttgcagg gtttgttcca tgtgccgatt ggggaatcaa acaaacattg 23041 cggtgtcctt ctcctcgact ttcgtgatac tttctaatga cgagcaaccc tgtatattca 23101 ccttgagaac cagcatttgg ttgcagggaa actcccgcaa atcctgtgat ttgtcctaac 23161 cattcctcta gctgctggaa caggatttga taaccccgtg tttgtgatag tggtgcaaaa 23221 ggatgtattt tgccaaactc agcccatgtg actggtatca tctcagatgt ggcattcagc 23281 ttcatcgtac atgaccccaa gggaatcatc gacgttgtca gtgacaaatc cttcgcttcc 23341 agcttgtgca ggtagcgcaa caactcagtt tctgagtgat agcggttgaa gacggggtgg 23401 gtaaggtaac tagtggtgcg gggtaagagg agagtagggg aagctgggaa agcagcggaa 23461 gcagggggga agagtaggtc gtcgccaaaa gcgaaaatgt ccaagagatc gattaagtct 23521 ttttctgtag tggtttcgtc taaggatata cctacagtag ttgtgtttaa aatgcgtatg 23581 ttaattttat gtgcttcaca agcttgtaaa atctcatcta aactgcgttc tcctaaatcg 23641 acttgtaggg tatcaaagaa atattcagaa ccgatgctgt aacccaaacg ctttaatcct 23701 tctgccagga tcacagtctt ttggtggatg ttctcggcaa ttcttttgag tccagtagga 23761 ccgtggtata caccatacat actcgccatt actgccagca acacctgggc tgtacagata 23821 ttactagtcg ctttttcgcg gcgaatgtgc tgttcacgag tttgcaaggc aaggcgcaac 23881 gctggtttcc cctgagcatc ttttgataca ccaacaattc gccctggaac ttggcgctta 23941 tactcttctt tggtcgcaaa ataagctgca tgaggtccac cgtagcccaa gggaatacca 24001 aaacgctgag tgcttcctac agcaatatca gcaccaaatt cacctggggg tgttagcaaa 24061 gtcagactta aggggtctgc tgctactgtt accaatgctc ccatagcatg ggctttttct 24121 ataaaagcgc gatagtcgta aattgtgcca tcgctagcgg ggtactgaag aattgcccca 24181 aaaatgggtt cagaaaaatc aaaagtttga tgattgccga caataatatc aattcccaga 24241 ggttgagcac gtgtttgtaa cacatctata gtttggggat gacagtcacg agagacaaaa 24301 taggtatttg ctttattttt gcaaacacca tagcttatgc tcatcgcttc tgcggctgct 24361 gtggcttcat ctagcaatga agcattcgca atttctaaac ctgttaagtc aataatcatg 24421 gtttggaaat tgagcagcgc ttcgagtcgt ccttgggcaa tttctggctg atagggagtg 24481 taagcagtat accaaccggg gttttctaga atattgcgtc caatcacggg tggggtgata 24541 cagtcgtagt accccatacc aataaacgag cggaaaactt gattttttga agcgatttct 24601 ttgagctgag ctagtgctgc gtactcattt tgtgcttctg gtaactctaa tgaatcagat 24661 aaccgaattg cctgcggtac tgcttggtca ataacttcat caagagtact atcacccagc 24721 acctcaagca tttgctggat atcactggat gatggtccaa tgtgccgttc ttgaaaagaa 24781 cttaactttt ggttagtttc ccctgacatc tgctgatggt ttgatttagg acgaggaaca 24841 ttagttacca caaactgctc tccgacgcga ctaattcata ttttgcaaca attctgtttg 24901 aggcgttggg tataatttgt caatttatca aaattaatga agtttttcca gaaagccgtg 24961 aaggagacaa ggggaaagac cccacccctt tatcccctcc ccgctcgcgg ggaggggata 25021 aaggggtggg gtgcagtcaa cgtgggaatc ataactaatt agccgaacat gatataaaaa 25081 gcacaacaca tcatagattt caaccctgtt ccccaggttt agacttcttg caaaaccccc 25141 gatatttagc cgcacattat ggtagcttta tttggtgcgc ccctgtacta tgactgtttc 25201 ttggatcatt tccccatctt tcgcttcatt tgttctaact cttcgtcagt ttcccagcgg 25261 cgaaacttct cttctaaatc gtcatgtcca ctaaaagagc cgctagtcgc accccaccaa 25321 ccgctagttt ctaagggctg ctgcgcctga gtcttagcac gcgctgtttg agcttcagct 25381 gctttggctt gcacttcctg tcgccgttgt tgaattttgc gcaatagttc tttcgcttgg 25441 gtgatgcgtt ctttcactcc ttgcatttgt ccccaaagct gatttccttc gcgcaacagt 25501 gcggcttctc gttgtgctgc agccgctgct aagtcttgtc tattcagaga ttttgccttt 25561 tctacacgaa tatgccacct ttggatttct tgagctgtgg agagaacttg ttcttgcgat 25621 cgcttctcct gcacttgtaa atctgcaatc agcttcaacg tgtcttcctc ttgcccacgc 25681 agctgctcca acagcgcttc taactccaaa tgcggattat tgcgcaagaa ttcctccaaa 25741 cggttttcta aaaaccgatt aaaatcgtca aataagccca ctgctagaac tccagaggtt 25801 aggtgcttct tttattgtag taattataag gaaggctagc tgaaaaccct tgaggatgtg 25861 tcctcctgag tcgtacacat ccgtacttta ggcaagggga cgccagatgc ctaggtcggg 25921 agaccctccc gcagcactgg ctcatgaaag ctgctgacgt aggacgccac atgcttcaag 25981 tcggcagagc cgcccaacgc agtggctcct ttagcctacc gtgatgctgg gtttttgtgg 26041 ttcgggtaaa gtatcaagca aatcttcaac tatttgggta atagttttat ccttctgagc 26101 agcatataag cgtaatttat tcatcctacg ctctgtaatc ctaatatgta attctttttt 26161 tgccataaat ctgacacaat agcgacacat ttatgttaac ataggttgag ttcataaaaa 26221 tgaggaggtg ataacgcagt gctagtttta gagtacaaag taaagggtaa aaaacaacag 26281 tatcaagcta ttgatgaagc tattaggact actcagtttg tccgaaataa agcaattaga 26341 tactggatgg atgcttcaag agaagcaaag attaacagga ttgctttaaa taactactct 26401 accgtactgc gtaaggaatt taaatttgtt gaagaattaa actcaatggc ttgtcaggct 26461 gctactgaaa gagcctggag tgctattgat agattctacg gtaattgcaa atcgaagaag 26521 ccgggaaaga aaggttttcc acgttttcaa aaagataatc gttccgttga gtataagact 26581 agtgggtggg cattacatcc tacaaaacga cgtgtcactt ttactgataa aaaaagtatt 26641 ggagaggtca agctattggg taagtgggag attcacactt accctgtaaa gtcaatcaaa 26701 cgagttcggt tagttagaaa agctgacggt tactattgcc aatttgcgat taatgttgat 26761 gcgaaacctg agcaaagaac aggtgatagt gaaataggtt tagacgttgg attggagttt 26821 ttctactctg attcaagcgg gcatcatgaa ccaaatccaa ggtttctaag aaaagctgaa 26881 aaagctatta aacacgctca aagagcaatt ttcaaaaagg aaaaaggtag aaaccaaaga 26941 cggatagcta gacaaagata tgcaaagaag catttaagag taaatagaca acggaatgaa 27001 cacgcaaaga gaattgcgcg taacgtatgc aaggctaacg ccttagtcgt ctatgaaaac 27061 ttaaatgtga aaggcatggt aaagaatcat tgtcttgcta agtccatcaa tgatgtggct 27121 tggagtcttt ttcgtcgttg gttagaatat tttgctgtta agttcaacac cgccgttgtt 27181 gctgtcaacc ctaaaatgac atctcaaaag tgttcagatt gtggtgcaat tgtgaaaaag 27241 tccctttcaa ctcgcaccca taaatgtaat tgcggatgtg aactccaaag agatgtgaat 27301 gcagcaataa atattctcaa tcttgcaaaa gctaggggag ggcatcccca aagtaacgct 27361 acaggagttg gaacctctac actagttggt gcaagcctac tagagcaagt tctgacaatg 27421 aatgtagaat ctcctagcct ttaggcagga gagtgtcaat aaggaaattc aagcacgaat 27481 tggcagcttg atgacaaact tagcaccttg tcctggtgct gaaacacacg ttaattgtcc 27541 gccacacttg ttgataattt ggtaactaat cgacaaaccc aaacctgtac cgtaaccgat 27601 aggtttcgtg gtaaagaacg ggttaaaaat catttcgcgt gtcttgtccg tcatacctgc 27661 tccgttgtca gcaatttgga tcacaacata atcagagtct aaaactttcg ttgtgatcca 27721 aatgcatgga ttcgagtcag gggttggctg atcgtttatc tttgatacga tattgcattg 27781 aagggaatca attgcattac tcagaatatt catgaacacc tggttgagct gtccagcata 27841 gcattctatt tcaggcaggt gaccgtactc ttttataact gtgatctttg gatactctga 27901 tatacctttt aggcggtgtc caaggagcaa cagtgtattt tcaagatcct cgtgaatatt 27961 cactgctctt ttttcggttt gctcgacccg tgagaagttc cgcaattgta agacaatttc 28021 gtgaatgcgc tcagccccaa ttgtcattga ggacataagt ttggatacgt cttcctcaat 28081 gaaattgaag tcaatcgccc tcgttgcagc ttcaatttcc accactggat tgggatagta 28141 tttctggtaa agtttaatta agctcagcaa atcttctgtg tattgattgg cgtggggcaa 28201 attagcatag ataaaattga gtgtattgtt aatttcatgg gcaattcctg caactagatg 28261 acccaaactg accatttttt cgttgtgcat cagttgggtt tgtgtatgag tgagttcttt 28321 taaagcttgg tctagctgct gagttttttc agtaattcga taactaaaaa gccgtcgctc 28381 taattgcgcc atcacttgac gggtgagagt ttgtagtgct tcaatttgtt tttgttcaag 28441 ttgacgcggt acaaaatcga ctacacagag tgtacctatc gaataatcat cttttgtaat 28501 caggggtact gctgcataaa accgaacgtg aggcggtgag acgacaacgg ggttcttggc 28561 aaattgtgga tctgctaaag tatctggaat gattaacgta tcaccttttt gcacagtcaa 28621 cgggcagaaa ccagcctcaa gtggggcatt cgtaatttcc ataccaacct ttgatttgaa 28681 ccattgtcgg tatgcgtcta caaaagtaat caacgccact ggggtttggc aaatctgtgc 28741 tgccaattta gtcaaatcat caaagtccgc ttcagcaggc gtgtcgagta tttcatactg 28801 ataaagtgtt tttagccgtg ctgtttcgtg ttccacagca gctagcatct tatcaggttg 28861 tttctgcatc aattttggca aggtggcgtg gcacttttgc gctgagtatt tcatgagata 28921 acaaaactta attctaatta tagtggtgtt agccagtatg taacatttct tactacttta 28981 ttttgaaacc caaactgagt taatttctca agtggtaaac ctacataagt ccaatggggt 29041 atatcattct caacagtccg ataccagcca ttttgattga gaacctgttt ttgggcttca 29101 gtggtgactc tcaaatcaat ggctaagccc caaagatgtt gtgatgcacc tggtggtgca 29161 acaacaccaa gaattgttgt ttctttgccc tgacgcactt gttctaaagt ctgattgttt 29221 gcgtattttt gccaaaatct caaattagtg gcaaaactgc gagtacaatc actcgcacca 29281 taacctgatt ttagagggat ttgtacttgt gagcgtgctt gattgaaagc ttcggctgct 29341 gacttttgta aataacattg actcgtattc ttcacttgag tcagtgttaa actcgattga 29401 aattgtttgg tttcttgctc gttagtaaag agaacctttt gtggaagttt aatttccgga 29461 ttttgattaa caaaaactga accataagca cgtaatagta tatattcaaa tgtattgtta 29521 cggggaattg ttggtaattt atttgtaatt gctgagaaaa atcgctcttt ttcagtcaag 29581 tttgggttgg gagtgaatgg ctggctggga gtctgagtag gaagtgtttg aggaatattt 29641 gtacaggatt gagttaacac tctgttattt tccaaagatg gagttgtgac acaatctttc 29701 ggatttggaa catctgccat caacttatag cggctgattc catgacttgc gactaatgca 29761 caagaaagaa aagcaatcag aatgtaaaaa aatgtcttct ttaaaaaagt tttcattttc 29821 atattcacta cggtaggtac atcctcaaag agaattgcta tattagctgt atatcactgt 29881 gacttgaaaa tagggaacag ggaacaggga acagggaaca ggctaagagg ttttttcatg 29941 tgttctggcg tacgcagttc atgatggcta ctaaaaagaa aaaacgtaga tataacagag 30001 tgatttatct gacatttcca tattcaatct tcagcaatgc aatcaccgta tcagctaata 30061 ttgagattga ctatatgctt gtttaaacta tgaccatgca cacttgtgtt gaattccaag 30121 acgcctatga tgtcatcgtc gtcggtgcag gtcactctgg ttgtgaagcc gcccttgcga 30181 ccgcacgcct cggctgtcga actttgctgt tgacactaaa tttggataaa atagcttggc 30241 aaccctgtaa cccagcagtg ggtggtccag ctaaatccca gttgactcac gaggtggatg 30301 cactcggcgg agaaattggc aaaatggcag atcggactta tttgcaaaag cggattctca 30361 actcttcgcg aggacctgct gtttgggcgt tgcgtgctca aacagataaa agagagtacg 30421 cagcactaat gaaagctatt gtcgagaacc aagacaactt aatcatccgc gaaggaatgg 30481 tcacagattt ggttttgggc gcaaacgatg aagtcgtcgg cgttcaaaca tattttggtg 30541 tggcgtttga gtgcaaagca gttatcttga caaccgggac gttcttggga ggacggattt 30601 gggttggtaa caagtcgatg gaggcgggac gcgctggaga atttgctgct gtcggtttga 30661 cggaaactct caatcgcttg gggtttgaaa caggaagact caaaacagga acccccgcac 30721 gggtagacaa gcggtctgtt gactacagta agatgacacc acagccaggg gatgaagatg 30781 tccgttggtt tagctttgat ccagaggttt gggtagaacg ggaacaactt ccttgtcata 30841 tcacccgcac gacgccagaa actcatcgcc tgattcggga aaatctgcaa ctttcaccag 30901 tttatggcgg gtgggtggat gctaaaggac cacgttattg tcccagtata gaagataaga 30961 tagtccgctt tgctgataag gaaagccacc aaatttttat tgaacctgaa ggaagagata 31021 ttcctgaact ttatattcaa gggttttcta caggattacc ggaaaatttg caactccaca 31081 tgttacggag tctgactggg ttggaaaagt gcgttatgct ccgcccagct tatgctgttg 31141 agtatgatta tttaccggcg acacagtgtt atccgacact gatgacgaaa aaggtagaag 31201 ggctgttctg cgctggacag attaacggca caacaggata cgaagaagca gcagctcaag 31261 ggcttgttgc tggaattaat gctgctcgct ttgctcgcgg tcaaaaaatg attgtttttg 31321 cccgtgagca aagttacatc ggtacactga tggatgactt gtgtacgaaa gacttacggg 31381 agccttaccg tatgctcacg agcaggtctg aatatagatt gatacttcgt tcggataatt 31441 ccgaccagcg cttaacacca ttgggacggg aaatcggctt aattgatgac cgacgttggg 31501 aactttttac tcgcaagcaa gaaaagattg cagcggaaaa agaacggttg tatgggacac 31561 gggtgaagga acatgatgaa attgggcagg cgatcgccca aaccacccaa caagcaatca 31621 aaggctcaat taccctagct gacttgttac ggcgtccagg atttcattac gtagatctca 31681 acacatacgg actgggaaac cccaatcttg ctcgtgctga gaaagaaggt gcagaaattg 31741 acatcaagta ttctggctac ctacaaagac aacaaaatca gattgaccaa attgctcgtc 31801 aagcacaccg ccagttacct gcagacttaa actaccaggc gattgaaact ctttctaaag 31861 aagcgcggga aaaactgacc aaggtgaaac cattgacaat agggcaagca gctcgtattg 31921 gtggagtgaa tccagcggat gtaaatgctt tattaattta tctagaattg cataaaacga 31981 agtctccgaa ggaattttca gtcttggctt gaaaacaatt ccacaagagc gagtatagtt 32041 ataggtgaaa tctgaagaga ctagtatcgg tgatgacatt aaattaagac agtgccatca 32101 ttcaaagtct ctatattcaa tgagaatctc aaatccacct gaaatctact cgttatattt 32161 aacagggaac agtcaacgct taacagggaa cgcttaacag ggaacagtga acaattatca 32221 agcaattgat aactgataac tgataactga taactgatga ctgataagtg ttttaagacg 32281 agagattgga ggtttcatct aaggttttta ttcgcgggca gagggggaac aatatttatc 32341 ttgctaggag ttaatagtca agaacgccat gtcacaaaac gccaatacct atctcattct 32401 tccagaagca tccgaagaat tcatcgttaa cgaaccctgg gcgatagaaa cctatgccga 32461 tgatttgatg gatgaactct ttgctgatat taatctcatt ttggatcgtt ctgatcgctt 32521 atcttcccaa acagcattac ctcaagagaa cattcctcaa gtaccaacgg ttacaacacc 32581 gccagttgtc atcccagaaa cacaaatcag ccttgtacca tccgggaagc aacctagaaa 32641 taatccattg agtccagttg gagtcgcgac tccaggggtt cccaaattaa taaagaaacg 32701 agtcaaaaaa tcaaaacgca gaaaagccat agcggttttg atgggagtca ccgcagggtt 32761 agccgctgct tgcattgttg gggtagcaaa ctctgggctg ttcaatcgtc tagttgtcat 32821 caaatcattc cagcaaagcc tactgcagcc gcatgtagag aagtgcgtac agttagcgga 32881 ttgatcagaa ctagggggat aggggagtgg gggtgtaagg gcaaaacaag tcaaaaccat 32941 aaggtaagaa gtcgtcttgg aaagtttgct acggagggaa accctcctgg caactttccg 33001 caaaat // LOCUS NODE_824_length_32975_cov_8.65762532975 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 32975) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 32975) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..32975 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" assembly_gap 213..238 /estimated_length=26 /gap_type="within scaffold" /linkage_evidence="paired-ends" repeat_region 271..1176 /inference="COORDINATES: alignment:crt:1.2" /inference="COORDINATES: alignment:pilercr:v1.02" /rpt_family="CRISPR" /rpt_type=direct /rpt_unit_range=271..307 /rpt_unit_seq="gtttcaatccctaatagggagaaagagaaatttcaac" gene complement(1481..3547) /locus_tag="DP116_07140" CDS complement(1481..3547) /locus_tag="DP116_07140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNB domain-containing ribonuclease" /protein_id="PRJNA477356:DP116_07140" /translation="MEKGTLVEFRLGSDRRLGVVDRPDGKSRFFVVDERGQSHSLAPR QITYTVTGETYKSSQIPGFLEEVKPYLEPTSLEVAWELLVEGGETVTPCEMANLLFSE SQPPHCYAAYYLLSDDKIYFKQKGDAYEPRSAAQVAERKHQLEVEALKAKGQQEFLAR VEQALKGEPVEWQRYDRHRLEALEKYAALVADIVRVGLNYDSLARAYPPPALVLETMN MLGRPATPQGAFQLLVDLGSWSPHENLFLRRSSIPVQFASKVLEVSQQRLESHPPDRD VNRLDLTHLKVYTIDDESTTEIDDGLSWELLPDGKERLWVHIADPTRWLIPEDELDLD ARKRGSTVYLPTGMVPMFPEVLATGPMSLVQGKVCYALSFGIILDDTGAVEDYSIHPS FIKPTYRLTYEDVDEVLDLGVEAESEIAAIANWARRRKAWRYAQGAISINMPEAMIKV KGDNISIDILQDSTSRQLVAEMMIVAGEVAARYGQKYNIPLPFRGQPQPELPPEQELL QLPAGFVRACAMRRCMPKSEMSISPVRHAGLGLDTYTQATSPIRRYSDLLTHFQLKAH LRGEVLPFSAEQLKEVMMTVSSITQEVTMVERQTNRYWALEYLRRQPDEIWEATVLMW LREDSGLALILLEDLGLQLPMLFKRSVKLGEQMLVKVFHADPLRDVIQFQEIIYQEAQ PTANSM" gene complement(3656..3871) /gene="rpsR" /locus_tag="DP116_07145" CDS complement(3656..3871) /gene="rpsR" /locus_tag="DP116_07145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019490626.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S18" /protein_id="PRJNA477356:DP116_07145" /translation="MSYYRRRLSPIKPGEPIDYKDVDLLRKFITERGKILPRRITGLT SQQQRELTLSIKRARIMALLPFINAEG" gene complement(3874..4068) /gene="rpmG" /locus_tag="DP116_07150" CDS complement(3874..4068) /gene="rpmG" /locus_tag="DP116_07150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411241.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L33" /protein_id="PRJNA477356:DP116_07150" /translation="MAKSKGARIIITLECTECRTNSDKRSPGVSRYTTTKNRRNTTNR LELKKFCPNCNRHTVHKEIK" gene complement(4260..4802) /locus_tag="DP116_07155" CDS complement(4260..4802) /locus_tag="DP116_07155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453781.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07155" /translation="MSIEQVPPKHYPKVDFGRRGMASGIDFLCVSVVSSLLGSSQLGV QIVQILVFAIAWVILRVVVPYNNQGQSLGRYAFDIKVLEIERGRVPDLQSLLKREGIV GLGALLVSIALSNIIRNPTAILLFIPLAIDCGAALSDTQLRQALHDRYAKTMIISSRR GYSLDIKIKRLVEKTRRNMR" gene complement(4873..6465) /locus_tag="DP116_07160" CDS complement(4873..6465) /locus_tag="DP116_07160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741895.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_07160" /translation="MNKASINILLVEDNPSDAKLLQQTLWHLGKEKWHVVHLERLSDA LSACSKRVFDIVLLDLSLPDSQGLKTLAKFSTAAPNVPIVVLTGFDDEDIALQAVANS VQDYLVKGQITPKLLEHVIRYAIERGQILNQLQECQHRLRGVFKQTPQSIVLVTCSGM IVEMNQSALNLWGTQQQDCVGKPLWELESWNLSSANPGWLKSIIAKAADGESLRHELH LRGANDAMLWIDFSVRPLKDETGKVVLLIVEAWDISEQKRAEAEMIKAWQQERELNEM KSSFVSMVSHEFRNPMSVIRTAIELLESYNHQLSDPQRSKYFGKIQTAIRQMQQLLDE VLFWGKSDAGKLQYEPTLLDLQNFCSELTQNLQLSANGKHQIIFRFQGKPTPVLVDEN LLRYILTNLLSNAIKYSPQGGVIQFDVICQDDTLTFQIQDSGIGIPIKDQQLLFETFH RASNVGSIPGTGLGLSIVKKCVELHQGQICLESQVGVGTTFTVKLSLNHQSPQSLELI ESGSDLQSLNHKSHRLSAGKNA" gene 6890..8158 /locus_tag="DP116_07165" CDS 6890..8158 /locus_tag="DP116_07165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315688.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdopterin molybdenumtransferase MoeA" /protein_id="PRJNA477356:DP116_07165" /translation="MLSVSDAEGIIFNLVQPLNTQQDTELVDLLFAGDSFAAALCADR ILASEVTSQLDFPHWDNSAMDGYAVRYEDVQDCSEDKPAVLEIVEEIPAGVQPKSTIQ SGQAARIFTGAVMPAGADTVVMQEKTRREENRVIILTAPKLQEFVRKRGMYYRAGTQL VPAGIPLKASEIAVLTAVQCTQLKVFRRPRVAIFSTGDELVTPDKPLQPGQIVDSNQY ALATLIRQIGAEPIMLGIVKDKPEALKEAIAYAIAHADVVISSGGVSVGDYDYVEQIL ESLGGEIHVRAVATKPGKPLTVATFKNDPRPILYFGLPGNPVAALVTFWRFVQPAIKK LSGLAHGWEPVFLKALTRQELCSGGKRETYVWGQLYVKNGVYEFQPAGGLQNSGNLIN LAQTNGLAVLPVGTTLIPAGEEVQVLQVGS" gene complement(8253..11816) /locus_tag="DP116_07170" CDS complement(8253..11816) /locus_tag="DP116_07170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phytase" /protein_id="PRJNA477356:DP116_07170" /translation="MVTTNTVRFSQFNASLNRSAEGQLVTDLSTPDNAQAKAVAEIIQ RNNPDVLLINEFDYNQADPLSPVRLFQKNYLGVSQNGANVVDYPYVYIAPSNTGIPSG FDLNNDGTVVTTPGTRGYGDDAFGFGEFPGQYGLLLLSKYPIDTANVRTFQNFLWKDI PGSLLPTIALPDSQTPWYSEEEQAVLRLSSKSHWDVPITVNGETIHALVSHPTPPTFD GPEDRNGKRNYDEIRFFSDYITPGKGVYIYDDAGKKGGLAAGSRFVIMGDQNADPFDG DSYNNAIRQLLLNPNINTNFIPSSSGGSQQAILQGGANLNHRGNSAFDTADFSDTNPG NLRTDYVLPSADLNITNSAVFWPLNTDPLFPLVGTYDSSLSGGFPSSDHRLVWADLQV PPTEAGKTIPDVNFSGQTIFPTGFIPNGAAGTVDGKETPVGGLSGVTYDAANNRFYSI SDDRSQIAPARFYTFTLETASPQKTDLSVTFTDVTTLKDENGKEFLLNSLDPEGIALT KNNTVFISSEGEVNVSAGRVTNPFVNEFSLTTGQQVRSLPVPSKFLPVVQDTNGNGVI DAGDTQISGVRNNLAFESLTITPDQKTLYTATENALFQDGPTATLTDGSRSRILQYNL VSGQPEKEYLYKTDAIATLPNPTTGSGDNGLVDLLALDNRGTLLALERSFSAGVGNTI KIYEVSLQGATDIKYYDSLNNLSAEQLADIKPAEKRLVLNLNSLNLPTGTDNIEGIAF GPKLDDGRQSIVLVSDNNFSQTQFTQILALGADLVPTVAPTVETRPDLLNDPNLPRDQ RADADDPAIYVNSSNPEQSLVLTAVKNAGLRVYDLSGNLLQEFNPGNIRYNNIDLQYG FKLGGDSVDIAVATDRNNDKLAIFKINSSPSTSGQYLEDITDSSIGTLFQSSPFEPPY SPSSRSAYGIALYHSPVTDDYYVFANRRETGDVAQYKLIDTGNGKIGAERVRNFTVPT TAGRDAQLEGTVADQELGYLYIGQEDVGIWKYQAEPNGGTTGTLIDKVKDLGGSYVED DVEGLSIYYAKDGTGYLLASSQGDSTFVVYTREGQNDFVGRFGVGNNGGIDSVQESDG ADVINVPLGPNFPYGVFITQDGRNLPAKIVDGENVNTNFKFVPWENIAYAFPNPLTID TSSYNPRNPNTNLVNGSVSSDITQSSSVLRVDNNLTGDVTW" gene complement(12198..13058) /locus_tag="DP116_07175" CDS complement(12198..13058) /locus_tag="DP116_07175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013192348.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_07175" /translation="MINKSSQKILIIEDNAMSRKIFLDGLEAEGFDTIGAENGTIGIQ KAQEHLPDLVICDIMMPDMDGFGVLRMLRQDPVTAIIPFIFLTGSASNESLRKGMELG ADDYLTKPCTVQQLLRAIAVRLEKQVKIMQYWYTACCQNFSAPVAPDTASSVDSESIF PSVPQLKEVFDYIEANYDQGITLCDVAEAVGYSSAYLTNRVGKITGETVNNWIVKRRM SAARSLLQDTNQTIEQIALALGYQNACHFSRQFRQHHGIPPQTWRKEHQLIRNSKVGD YKTSLNLSQS" gene complement(13096..15051) /locus_tag="DP116_07180" CDS complement(13096..15051) /locus_tag="DP116_07180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015118018.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain-containing sensor histidine kinase" /protein_id="PRJNA477356:DP116_07180" /translation="MLFDTSSQSVLQPEEELKLAQFLINQVADAAFSVGLNAQILYVN DAMCRMSEYSREELLSMTLQDIDVDFSAHNWSDKWIALKEKGSLTFKFRYQTRTGRVF LVEMNLTYIEYQGKEFGCAFARDSSDELVGLSVQQYIDRTSDPKEEFEQEVIEYQRTQ TELETSLSLLRSTLESTANGIVAVNFEGEILCYNQKFLEMWQFPNSVSLSKKSHRAKG FFENQVKYPEIFRQAVWEMPSQSDKQSYDLVELKDGRVFAHYSEPQRLGDKIIGRVWS IWDVTESRKTEEALRLNEARFRTLAETTEASIFLICDSRICYANSAAEVLTGYPKKEL FNNFNIDRLITSKKLRQVHKQNGAGYSEYQEMQIRTKNGVERWLACTVGVLDGMLDFA RKPVELLTAIDITDYKQAESELHQALEHAKRLSELRERFVSMLCHQFRTPLNIVSFSA DLLKRHIHQWTEEKNRSYLDLITVAVQQISELLDEILLYGQAESARLECQPRQLNLER FCTDILAQVQIAGGNQKAINFVSQGNCSTGYLDPKLLQHILTNLLSNAIKYSPTSSTV TFRLYCQNNQVVFQIEDSGIGIPVVDQQQIFEPFYRGSNIDNIPGTGLGLSIVKTLVD LHGGEITVESVVGVGTTFTVNLPTVSS" gene 16294..18165 /locus_tag="DP116_07185" CDS 16294..18165 /locus_tag="DP116_07185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317588.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiamine pyrophosphate-binding protein" /protein_id="PRJNA477356:DP116_07185" /translation="MSQNYTQTNSLVGTQPYDVYQPTDSSKDQTNGSSNGHSNGAAVA QKRSPTVADAIANMMEDLGVSCAFGVNGGAMAGLWGSLSNSLLQVMNCRHEAGAAFAA AEAYFATGRPTVVFTTAGPGITNALTGLFAARGEGAKVILLSACTSSPQRGRWAIQET SSHTLPTEGIFTSGTLFNYATTVESGAQLPQIFRKLALGLSQPGAFVAHLSIPTAVQT SPLDRMPLFQGMDCSLVVPHTEAIVKCTQLLSEGPFAIWVGFGARGAAEEILQLAERT GAAVICSPRGKGIFPEDHPQFVGVTGLGGHGSVMTYMQQQTPLRTLVLGTRLSEPTSF WNEALVPPGGFVHVDIDPTVPGVAYPEAETFAIQSDIKAFLQALLQHDEHLASASVTA LSLPRPEGQTIQPGSDSPVRPEVLMEAIQKVIIDGTDAIVMAECGNSFLWSTHLLRFA NANRYRISTGVGAMGHAAAGVIGAAAARNGKAVAIVGDGAMLMNNEISTAVKYEIPAV WIVLNDARYNMSHQGMEMLGLKGADASIPQADFAAIAHAMGAEGIRINKEMDLFSALE QAMVATGPIVIDVVINPDRRAPSKGRNAGLASQGVKSTPAQKTELHVSFPQVSFPNA" gene 18239..19297 /locus_tag="DP116_07190" CDS 18239..19297 /locus_tag="DP116_07190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748782.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="leucine dehydrogenase" /protein_id="PRJNA477356:DP116_07190" /translation="MNLFETVTEMGHEQILFCHGKDPDIKAIIAIHDTSLGPAMGATR LWPYASEAAALKDALRLSRGMTYKAACANIPVGGGKAVIIANPENKTEDLFRAYGRFV ESLKGRFITGQDVNLTPEDVRTISKETQYVVGVEERSGGPAPVTAWGVFLGLKAAVEF RLQTENLKGLRVAVQGLGNVGQNLCRHLHEHGAKLFVTDISPDKTEQVKRLFGATVVE PDEIYSLDVDVLSPCALGGILNSETIPRIKASVIAGAANNQLGVEVLHGQMLAAQEIL YAPDYVINAGGLINVYNEMIGYNEQRAFKQVNNIYDTLLEIFDRAQKQDITTNDASKQ LAEDRILKARHLKTLAVV" gene 19309..20265 /locus_tag="DP116_07195" CDS 19309..20265 /locus_tag="DP116_07195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748783.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07195" /translation="MVTERNTFGTSAYIETTPESAFEYLAELKNLGEWTLYSRMEEQI DEDTWLGTASGYQTKLYYHVERLDHPNFYGIEWHCGLEYQKYYQVYPVLLFPADYVEP GTDEKGVYLHWVSFVDPKRRSPLIMEGIHTVHTSECRSLKGNLERKAGLKTAAKGRYY VDTDTIYVNAPIEMGIEYLSDLKNMDDWAHLLRADGEISGQSGEFRDEYGQKVKVTQR LQAVNKSYLLEHEFFYPDYGFYQRSPVLLIPTSHAFRDPEAPGFIQHRITFWKTGEQL PHGKLQIEDFGSESLNIKRILEGKAGNLNTLAQGLSYMPQAK" gene 20431..21552 /locus_tag="DP116_07200" CDS 20431..21552 /locus_tag="DP116_07200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317591.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ScyD/ScyE family protein" /protein_id="PRJNA477356:DP116_07200" /translation="MKLEQLTITSFTVLADGLDNPKGLSFGPDGSLYITEAGTGGDGA SVPSPSGQGSLLFGTTGAILRVNNATIERIVTGLPSLAFPDGTGAAGPHDIKFDTTGK PYVLVGYAANPALRDSTFGETDLGKIITPDFQTNSWTTVADIANYELTHNPDGGDVVS NPLAFLIDGDKIVVVDPGANDLLSVGTDGSHLEAIAVLPQQPVINPIFPGFNSQNFDR GHVPPPSAYRNATPSQITIQSVPTGIAKGPDGAYYISTFTGFPFPEGGAKIYRVDADG QLTVYADGFTQLIDLAFDAQGNLYALQHMNSSGWKGNLDGSLIKIAQDGTRTTILSGD GLQAPSALTIGPDDALYVINRGGLPGKGQVIRIENPRSV" gene 21766..23130 /locus_tag="DP116_07205" CDS 21766..23130 /locus_tag="DP116_07205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408014.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ScyD/ScyE family protein" /protein_id="PRJNA477356:DP116_07205" /translation="MKFKSFALTVFSVCAAVACGTQAARAASLSVVADQLNNPRNLDF APDGSIYLTESGAGGDGKDGRCIASPSAQYIPLCAGSNGTLVKIAKDGTKTNVISNLT SIALVPSGEQAAGPADFKFDSKGNAYLLTGLAGNPNQRDTVLQSPDLGKLYKVDLKTG SLTTLADFANYEAKYNPDGTDLISNPYAFAIKGDNAYVVDGGGNSIYSVALDGSGIKN VAAIPQKRISPDQLQFPTLPEGTTDPTGGAAPPPGYTIAPNGLPVSNQSVPTGIVVAP DGSLTLSEYTYFPYPENEARIFKVDPDTLQTQVLADGFTQLTGVTYDSDGNLYALQHI NQSEWKGIQQGGVITGDISGSIIKIAKDGTRQTIWSGNGLEAASGLFFGPDGDLYTSN RTRLVAGERGGQLIKIDPRSSGGATKVPEPASVIAVLATAALGAKAMKRKRQEQVLAK VETI" gene 23303..24490 /locus_tag="DP116_07210" CDS 23303..24490 /locus_tag="DP116_07210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408013.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PEP-CTERM sorting domain-containing beta-propeller repeat protein" /protein_id="PRJNA477356:DP116_07210" /translation="MGLVKNLSLALIGASFLVAGAAVQAMALTLQYDRSIGEPGFGPG QLFVPQGIAVDSQGNTLVANGRGVNPVTGAPDYSLGNKIEKFSPSGQYIGAIGTGGTG PGQFDEPTTVDFNPVTGDLYAGDVYNNRINQFDSQGNFIRSFGNGSFTPLVEGRLFFG PSGVTFDKAGNVYVGDFNGERIFKFTPDGQQIGVIGGTNGTALGEFQGVAGVRISPVS GNIYIADQFNNRVQVLDPNGKPLFTFGSAGSGPGQLLQPIGIEVDDQENVYVADSINS RVQVFDKNGKFLTNYGQPALDASGKPVPPPGLTDGPFGNPLDLTPGRFNWTGGTALKD GKLYVSDFFQGRVQVLNVNRSGSTSVPESSSLLGLAVLGVGATATLRKKQQKSPILEK TSV" gene complement(24710..25627) /locus_tag="DP116_07215" CDS complement(24710..25627) /locus_tag="DP116_07215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314398.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histone deacetylase" /protein_id="PRJNA477356:DP116_07215" /translation="MLPVIYSDEFLDHETGSFHPEKPERLTAIATALKQAEFAPHIEW RLPTLPEKRPIISVLEQAHTRRYIKKVQEIAVAGGGYLDGDTPVSPRSYDVALLAVSA WLDGVDVVLETGEPAFVLARPPGHHAESDAGMGFCLFSNAAIAGLYALQQPEINRVAI LDWDVHHGNGTQAIVEKYEQIAYCSLHQYPCYPGTGRHTEQGCHHNVLNLPIPPGGDI EIYQPLFEKKVVPFLSSFQPHLLIVSAGYDANADDPLASINLQPQDYGLFTDYCLGIT RKILFGLEGGYDFDTLSKSVLATIERCIA" gene complement(25676..25897) /locus_tag="DP116_07220" CDS complement(25676..25897) /locus_tag="DP116_07220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314399.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07220" /translation="MSSNETTPNQAEKTNPQSDESQLSPETLDKIKNPPRIDDVILSQ SPEERRSNPAVAPEMLDEPNDEFTGFTQE" gene 26137..26418 /locus_tag="DP116_07225" CDS 26137..26418 /locus_tag="DP116_07225" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07225" /translation="MYPGSNPPPAGCLIVFRTTQLGVFLTALGVPAVVFFPGTLIFLR QEEAVRWGGQCRGEPALREGFPTARRLAKGFPDLRHLSVEPVLEEGLPT" gene 26789..27925 /gene="gshA" /locus_tag="DP116_07230" CDS 26789..27925 /gene="gshA" /locus_tag="DP116_07230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879281.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative glutamate--cysteine ligase" /protein_id="PRJNA477356:DP116_07230" /translation="MLLKGFEIEMYTGTPSGDIVGLSDKIVGSLDGFVREPDSRNVEY ITAPLQNYEQLLCALLRPRLQLRNFIKQLGDYTLIPGSTLSLGGGDRFFRSDPTNPYH DYIEQTYGTKVVTASVHINVGISDPEVLMRACRVIRVEAPLFLALSASSPFLNGKATG YHSTRWGVFPQTPPQVPLFESHAHHIQWVENQLMAGTMQNVRHLWVSVRPNGDRRPYD LNRLELRICDLVTDPIALLGIAALVEARLLQVINNPSIDPLTQSTFTPDELISLTYAN ETAAATSSLDAQLQHWQDGRSILARDWIGEIYQDVWAIAKQHGFACFLSPLQKILREG NEAQQWLQLHALGLSERHVLTHAIDVTRECEVQLEDKLCSSLVA" gene 28448..28903 /locus_tag="DP116_07235" CDS 28448..28903 /locus_tag="DP116_07235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208903.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA (uridine(34)/cytosine(34)/5- carboxymethylaminomethyluridine(34)-2'-O)- methyltransferase TrmL" /protein_id="PRJNA477356:DP116_07235" /translation="MPQIVLVNPLIPPNTGNIARTCAATGTELHLVGPLGFEISDRYL KRAGLDYWPYVKLHYHESLEAFKSVHQARGGRLLGFSVKGSCNYVDFQFQPHDWLLFG SETTGLPLEIISACDVTLYIPMNEPNVRSLNLSVSVAVGLFEARRQLGL" gene 29969..32293 /locus_tag="DP116_07240" CDS 29969..32293 /locus_tag="DP116_07240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208904.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M23" /protein_id="PRJNA477356:DP116_07240" /translation="MKRALKNRGRAELENTPGDDVPVEPIDTVNPKVNQCRMPTTAAM IGLAISMGATSLLVTRQSDQATAAEALGYQNTTSTIPASNNVEVKFASTKKLGSQAVS SVSLPETGTVVEPTAISQLTEQRAKWQVAANKVSVQTSPLVGIPNSLTTAQQSIAWEK NNFQQSRKQGVQRLSHADGIASVQTVSSPSAKPSTVEVNNTEEPEVNAQLKAQQEFAR NQLQEKSDRLRKSLTQWQSEHTKDLSQLAATRLVQPMTVAGKMSQTSTITGTSQSNMT SDVSRARLVSKLKQESEAQVATVPAPTVPASTVVAPIAITQTATAVYEVKPGDTIGAI ANDYGISVLELIKANNLNNPHQLQISQKLFIPVAENPTTAQPTVAMNKSAVAGSGSSN TRETTGNSLIADNRNITVPTSVVGNTQFLSYIQSTTTNLPENTITNSQASDSITSYYG VGGDSPMPQVVTEPQLAQIPTVTKTKQVKNNQRLRSLQLEIERLRQKYRSQEAGNTVV PDENEANDAPVTVPDPSRNDGAVPIAVPRPNNPAVQIPVSEQNKAGIPIPVPRPIAPN YVGKPGKPVFGANRRPTNEPINPEFLPNQAIVTPPTGIDASRALGPMQGRTVSPQLPP LAAVDRYLPRVLDENTPIPSSSSTAYMWPAKGTLTSGYGWRWGRMHKGIDIANSVGTP IYASADGVVEKAGWSSGGYGNFVDIRHLDGSMTRYGHNSKLLVQRGQQVHQGQIIASM GSTGFSTGPHSHFEIHPSGKDAVNPIALLSTARL" gene complement(32436..32840) /gene="tnpA" /locus_tag="DP116_07245" CDS complement(32436..32840) /gene="tnpA" /locus_tag="DP116_07245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458219.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS200/IS605 family transposase" /protein_id="PRJNA477356:DP116_07245" /translation="MAKFTPDEYRHEGNSIALLNYHFVFIPKRRKKVLVNAVADRLQE IICDVCNENRWKIVAMEIMPDHVHLFVNVKVTDCPAAVMNKIKGRASHFLRKEFPELL KLPTLWTPSYFVSTAGNISTEAVKNYIAQQRT" BASE COUNT 9365 a 7134 c 7234 g 9216 t 26 others ORIGIN 1 ttcggttcct tcggtcgcct ataccgcgtt ttaggcgttt caatccctaa tagggagaaa 61 gagaaatttc aacaagctgt ttgggagttt atgtgcatgc caacggtttc aatccctaat 121 agggagaaag agaaatttca actcgttttt tgtgcaaaaa aagctgcttg ccaaatgttt 181 caatccctaa tagggagaaa gagaaatttc aannnnnnnn nnnnnnnnnn nnnnnnnncg 241 tcctgttcag aaagaaaacc aaatcaactc gtttcaatcc ctaataggga gaaagagaaa 301 tttcaaccaa gtgccaggtc tacgagccgc agcaattacc gtgtttcaat ccctaatagg 361 gagaaagaga aatttcaacc ttgggcttgc tccctactgg gacgaggggg agtatggttt 421 caatccctaa tagggagaaa gagaaatttc aactcattgg cagtcattca cctagaacaa 481 gtaggtttca gtttcaatcc ctaataggga gaaagagaaa tttcaacgtt atagatctag 541 agatagtcac aagtcttata gtttcaatcc ctaataggga gaaagagaaa tttcaacgca 601 tggtttgcac ccgatgtctc ccctgcgaaa cgtttcaatc cctaataggg agaaagagaa 661 atttcaacta ttgtatcagg caataaaaaa gctgccactc gctgtttcaa tccctaatag 721 ggagaaagag aaatttcaac taaagaaaag ttaaggctta aagctgaaga gaaaggtcag 781 gtttcaatcc ctaataggga gaaagagaaa tttcaaccat atctgtctcc ccaggggtcg 841 gtttcagtga ggtgtttcaa tccctaatag ggagaaagag aaatttcaac tttcccttgc 901 gccgcgagtc gattatgaat tgggtttcaa tccctaatag ggagaaagag aaatttcaac 961 aaaactgcga gttttgattt tcaatcatta agtagtttca atccctaata gggagaaaga 1021 gaaatttcaa cgacctattg tgggcggcta ccgagggctt tagagatgtt tcaatcccta 1081 atagggagaa agagaaattt caactggttt aagcaacagt tgctgcggtt cgatgaggtg 1141 tttcaatccc taatagggag aaagagaaat ttcaacggag gtactcagaa aactgcacca 1201 tataaagttt tcaaggttct aaatcgcgga tgatctaatg atagcatgaa aaaagtaaat 1261 tgattgacag gcaaatcgct aaaatccagt ctaggcaagg ggcgcggatg ggttatcatc 1321 gaatattctc caaaatcctt gtactgtaag caatccaacc atttttttct taaccctttt 1381 ttctacacct acccatccgc gctatcggca aagaaaatca tgttatcacg aggttattga 1441 ccacccatct ttaccaaaac agcaggcgca cagcatggaa ttacattgaa tttgccgtag 1501 gttgcgcttc ttgataaata atttcttgaa actgaatgac gtctctaaga ggatcagcat 1561 gaaagacttt cactaacatc tgttcgccca acttcacaga acgtttaaat aacattggca 1621 actgcaagcc taaatcttct aacaaaatca gcgctaaacc actatcttct cttaaccaca 1681 tcagcactgt tgcttcccaa atttcatcag gctgacggcg cagatactct aaagcccaat 1741 atctattagt ttgccgttct accattgtca cttcttgggt gattgaagat acggtcatca 1801 tgacttcttt caattgttcg gctgaaaagg gcaacacttc accacgcaaa tgtgctttca 1861 gttgaaagtg agtcagcaaa tcgctgtagc ggcggatagg agaagttgct tgagtataag 1921 tatctaaccc taaaccagca tgtctgacgg ggctaatgct catttcgctc ttaggcatac 1981 aacgacgcat agcgcaagcc cggacaaatc ctgctggcag ctgtagcaat tcttgctccg 2041 gaggtaactc tggctgtggt tgaccgcgaa agggcaaggg gatgttatac ttttgaccgt 2101 agcgggctgc aacttcgcca gcaacaatca tcatctcggc gactaattgc cgtgatgtgg 2161 agtcttgtaa tatatcgatg ctgatattat cgcctttgac tttaatcatt gcctcgggca 2221 tgttgatgct gattgctcct tgagcatagc gccaagcttt gcgtcgtcgt gcccagttgg 2281 cgatcgccgc aatttccgat tctgcctcca ctcccaaatc caacacttca tcaacatctt 2341 cgtaagtcag gcgatatgtt ggtttaataa aacttggatg gatgctataa tcttctactg 2401 ccccagtatc atccaaaata atcccaaaac tcaaggcata acagactttt ccctgcacca 2461 gactcattgg tcctgttgcc aaaacttcag ggaacatagg aaccatgcct gtgggcaaat 2521 atactgtgct gcctcgcttt cttgcgtcta aatctaactc atcttctggt atcaaccagc 2581 gcgtcggatc agcaatatgc acccacaacc gctctttgcc atcgggaagt aattcccaac 2641 ttagaccatc atctatctcg gttgtacttt catcatcaat tgtgtacacc ttaagatggg 2701 ttagatccag gcggttgaca tctctgtctg gtggatgtga ttccaatcgc tgttgcgaca 2761 cttctaatac cttactagcg aactgtaccg gaattgacga acgacgtaaa aataagtttt 2821 catggggact ccagctaccc aagtccacta acagttgaaa agcaccttgg ggggttgcgg 2881 gacgtcccag catattcatc gtttccaaga ccaatgctgg gggaggatag gcacgagcta 2941 gggaatcata gtttaaccct acacgtacta tatcagcaac tagagctgca tatttttcta 3001 gggcttctag gcggtggcga tcatagcgtt gccactctac tggttcaccc ttaagcgcct 3061 gttctactct tgccaggaat tcttgctgtc ctttagcttt gagtgcctct acctctagtt 3121 ggtgtttgcg ttctgccact tgagctgcac ttcgcggctc gtaagcatct cccttctgtt 3181 tgaagtaaat tttatcatct gataataaat aatacgctgc gtagcaatga ggcggttgtg 3241 attctgaaaa cagtagattt gccatctcac aaggcgtcac tgtttcccca ccttctacga 3301 gtaattccca agcgacttct aaactggtgg gttccaaata aggcttgact tcctctaaga 3361 atccggggat ttgcgaagat ttgtaggttt ccccagtcac tgtgtaggtg atttgccgag 3421 gtgcgagact gtgggattga ccacgttcat ctaccacaaa gaaacgactt ttgccgtctg 3481 ggcggtctac tacacccaaa cggcgatcgc taccaagtcg aaattcaact agcgtcccct 3541 tctccacaag cctcgcaatg atttatattc agaattttta ttttggattt tagttaggag 3601 ttaggagtta ggagtcaatt actcattact cttaactcat cactcttaac tattattagc 3661 cttcagcatt gataaacggc aacaatgcca taatacgagc acgtttaatc gataatgtca 3721 actctcgctg ttgctgagat gtcagtccag taatccgtcg aggtaatatc ttgcctcgct 3781 cagtgataaa cttacgcagc aaatcaacat ctttgtaatc gattggttct cccggcttaa 3841 tcggagatag acgacggcga taatagctca ttgttactta atttccttgt gaacggtatg 3901 tctgttgcag ttcgggcaga actttttcag ttctagtcga ttggtggtgt tacgacggtt 3961 cttagtcgtt gtataacgcg aaactccagg agagcgcttg tctgaatttg tacgacactc 4021 agtacattcc agtgtgatta ttatccgggc acctttactc ttggccataa tcttacaaaa 4081 cgttgactct gacgagaatt agcagttatc agttatcagt tatcagttat caagtaccag 4141 ccgcagttat cagttaccag ttatcaggga ggttaagaaa tcgctgactc tgttccctgt 4201 tccctgttcc ctgttccctg ttccctgttc cctgttcact gatttaacac aaatgactat 4261 tatctcatat tccgccgtgt tttttcaact aaccgcttaa ttttaatgtc aagcgaatag 4321 ccacgacgcg acgaaatgat catagtcttg gcatagcggt catgcaaagc ttgccgcagt 4381 tgggtatcag ataaagctgc accacagtca attgccaacg gaataaatag cagtatagca 4441 gtgggattac gtatgatgtt actcagagcg atcgaaacca gaagtgcgcc aagcccaact 4501 attccttctc gcttcaaaag agattgcaaa tcaggaactc taccccgttc gatttctaac 4561 accttgatat caaaagcata gcgccctaaa ctttgcccct gattgttgta aggtacaaca 4621 acccgcaaaa ttacccaagc aatagcgaaa actagtatct gaacaatttg cacgccgagt 4681 tgagaacttc ctaataagga actcactacc gagacacaaa ggaaatcaat gcctgatgcc 4741 attcctcttc gcccaaaatc aaccttagga tagtgtttgg gaggaacttg ttcaatagac 4801 atatttaaaa tgcgtattta tagagagttt ggaggttggt tttttttccc tctactttaa 4861 tgttaaagtt acttacgcat tctttcccgc agaaagccga tgagatttgt ggttcaaact 4921 ttgtaaatct gatccacttt caatcagttc taaggactgt ggactttgat gattcaatga 4981 aagcttgact gtaaatgttg tcccgactcc cacttggctt tctaagcaga tctgaccttg 5041 atgcagttcg acacactttt tgactatcga tagccctagt cccgttcctg gaatactacc 5101 cacattgcta gcacgatgga aagtttcaaa caaaagttgt tggtctttga tggggatacc 5161 aataccagaa tcttggattt ggaacgtgag ggtatcatct tgacaaatga catcaaattg 5221 aattacgcca ccttgaggag aatacttgat ggcattggaa agtagattag tcaaaatgta 5281 acgcaacaga ttttcgtcca caagaactgg agtaggtttt ccctgaaacc taaagataat 5341 ttggtgtttt ccatttgcac tcaattgcaa attttgtgtc agttcgctac agaagttttg 5401 taaatctagt aatgtaggtt cgtattgtag tttgcctgca tcactcttgc cccaaaataa 5461 aacttcatct aataattgct gcatctgacg tatggcagtt tgaattttgc caaagtattt 5521 actcctttgc gggtcagaca actgatggtt atatgattcg agtaattcta ttgctgtgcg 5581 aatcacggac atcggattgc ggaactcatg agaaaccatt gaaacaaagc tagatttcat 5641 ttcgttgagt tctcgttcct gttgccatgc tttgatcatc tctgcttcag cgcgtttttg 5701 ctcgctaata tcccaagctt cgacaatcag tagcaccact tttccagttt catcttttaa 5761 gggtctgacg gaaaagtcaa tccacagcat cgcatcgttt gcaccgcgta agtgcaattc 5821 atggcggaga gattcaccat cagcagcttt ggcaatgata cttttcaacc accctggatt 5881 tgcggaagat aaattccagc tttcaagttc ccacaacggt ttgccaacac aatcttgttg 5941 ttgagtaccc cacaaattta gggcagattg gttcatttcc acaatcatcc ccgaacatgt 6001 gaccagcact atcgattgag gtgtttgttt aaaaactccc cgcaagcgat gctgacattc 6061 ttgtagttga ttaagaattt gtccccgctc gatagcatat cgaataacat gctctagtaa 6121 cttaggcgta atttgtcctt ttaccagata atcttgtaca ctattagcca ctgcttgtaa 6181 agcaatgtct tcatcgtcaa accccgttag caccactatt gggacgtttg gtgctgctgt 6241 agaaaatttt gctaaggttt ttaatccttg agaatctggt agagaaaggt ctaacaaaac 6301 tatatcaaaa acccttttgc tacaggcact aagcgcatca ctgagacgct ccaaatgtac 6361 cacatgccat ttttctttgc ctaaatgcca aagcgtttgc tgcagcaact tggcatcgct 6421 gggattatct tctaccaaaa gaatgttaat tgatgcctta ttcataaaat aaagttattt 6481 taccgctgtg aactgaactt ggggtggttc ttcgcagcga aacataatga cgacgttatt 6541 ataatgttga actagctgta tgaattgtgc tgtgtcagta gtcactgttg ttaagggttt 6601 atcagtaata atggcattgc tttatacaat ctttatggaa actttattta aaaattcctc 6661 ctcagcagcc gcagtataaa atagtaataa gtgagagaaa agctctgagg ctagaactca 6721 tttagtaatt aaccagcaag tagtaaacta attgctgcat ctacattgac cttttcacaa 6781 tgtaactcat gtcgtctaat ggggtcgagt tgagacgatc ttggtgtttt gatagtttca 6841 gttttgtgct tggaaagttt tttggttagg ttattacgag ggcaatatta tgctatcagt 6901 aagcgatgca gaaggcatta ttttcaattt agtacaaccc ctgaatactc aacaggatac 6961 agaattggtt gatttgttat tcgcaggcga tagcttcgct gctgcgctct gcgcagatcg 7021 cattttagca tctgaggtga ctagtcaact agattttccc cactgggaca actcagcaat 7081 ggatggctat gcagtgcgat acgaagatgt acaggattgt agcgaggaca agccagcagt 7141 tttagagatt gttgaagaaa ttcctgctgg ggttcaaccg aaatctacta ttcaatcagg 7201 gcaagcggcg cgaattttta caggtgcagt catgcctgcg ggtgcagata cagttgtgat 7261 gcaagaaaaa acacgtaggg aagaaaatcg tgtgattatc cttacagccc caaaactgca 7321 agaatttgtg agaaaacggg gaatgtacta ccgagcaggg acacaattag taccagcagg 7381 gattccgttg aaggcttcag aaattgctgt attaactgca gttcaatgta ctcaactgaa 7441 ggtttttcgc cgtcctcgtg ttgctatttt ctctacagga gatgaactgg tgacacctga 7501 caagccgttg caacctggtc aaattgtaga ttcaaatcag tatgccctag caactttgat 7561 tcgacagatt ggtgccgaac ctataatgtt aggaattgtt aaggataaac cagaagcact 7621 caaggaagcg atcgcttacg ctattgctca cgctgatgtc gtaatctctt cggggggtgt 7681 ctctgtggga gattatgatt atgttgagca aattctagag tcactgggag gcgaaattca 7741 cgttcgtgct gttgcgacca aacctggtaa acccctgaca gttgctacat ttaaaaatga 7801 tccacgtcca attttgtact ttggtttgcc aggaaaccct gttgctgctt tggtgacttt 7861 ttggcggttt gtacaaccag ccatcaaaaa actttcagga cttgctcatg gttgggaacc 7921 agtgtttctc aaagcgctga cgcgtcagga gttgtgttct ggtggtaagc gtgaaactta 7981 tgtttggggt cagttgtacg tcaaaaatgg agtttacgaa tttcagccag caggtggact 8041 tcaaaattct gggaacttaa ttaatttagc tcaaaccaac ggcttagctg ttctaccagt 8101 gggtacgaca cttattcctg ctggagaaga agtgcaagtc ctacaggtag ggtcgtaaac 8161 tcgtacaaaa agacacaaaa acagcaatac tcaatgtgtc tttgtgtcta ctcaacttta 8221 ctgcttgtgg ctaaattcta atgatgaagt actcaccaag tcacatcgcc agtcaaatta 8281 ttatctaccc gtagaacaga agatgattgg gtaatgtcgc tgctaacact gccattcacg 8341 aggttagtat taggattgcg ggggttgtag ctgctggtgt caattgtcaa cgggttaggg 8401 aaggcgtaag caatgttctc ccaaggaaca aacttaaagt tggtgttgac gttttcacca 8461 tcgacaatct tagcaggcaa atttctgcca tcctgagtaa tgaatacacc atacgggaag 8521 ttgggaccca acggcacgtt gataacatct gcaccatctg actcttgaac gctgtcaatt 8581 cctccgttgt taccaacacc aaaacgacct acgaagtcat tttgaccctc acgagtgtag 8641 actacaaagg tgctgtcacc ttgactagaa gctagcagat aacctgtacc atctttagca 8701 taatagatgg agagtccttc cacatcgtct tccacatagc taccgcctaa gtctttcact 8761 ttgtcaatca gcgtaccagt tgtgcctcca tttggttctg cttgatactt ccaaatccct 8821 acatcttctt gaccaatgta aagataaccg agttcttggt ctgctactgt tccctcaagt 8881 tgagcatcac gtccagcagt tgtaggtact gtgaaattac gtactctttc agcaccaatc 8941 ttaccatttc ctgtgtcaat caatttgtac tgtgcaacat ctccagtttc tctacgattg 9001 gcaaagacgt agtaatcatc tgtcacagga ctatggtaca aagctatgcc ataagcactc 9061 cgcgatgatg gtgagtaggg aggttcaaag ggtgatgatt ggaacaaagt accgatactg 9121 ctatctgtga tatcttccag gtattgaccc gaggtgctgg gagaggaatt gattttgaag 9181 atagccagct tatcgttgtt gcggtctgtc gctacggcaa tatcaacaga atcaccgcct 9241 aacttaaagc cgtattgcag gtcaatgttg ttgtaacgga tattacctgg attgaactct 9301 tgtaagagat taccagacaa gtcataaact cgtaagcctg catttttcac tgctgtgagt 9361 actaggcttt gttctggatt agaggaattt acatagatag ccgggtcatc cgcgtctgcg 9421 cgttggtcgc gtggtaaatt tgggtcattc aacaagtcag gacgggtttc cactgtaggc 9481 gcgacggtag gaactaagtc tgcacccaaa gcgagaattt gagtaaattg ggtctggcta 9541 aagttgttgt cactcaccaa aacgatggac tgacgaccat catccagttt tggaccaaag 9601 gcgatgcctt caatattatc tgtacctgtg ggtagattca gcgagttgag atttaacacc 9661 aaacgcttct cggcaggttt gatatctgct agttgttcag cacttaaatt attaagagaa 9721 tcgtaatact tgatgtcagt tgctccttgc aagctgactt cgtaaatttt gatggtattg 9781 ccaactcctg cggagaaaga gcgttccaat gctagtagtg tacctcggtt atcaagtgcg 9841 agtaaatcta ctaagccatt atcacctgaa cctgtcgtgg ggtttggtag agtggcgatc 9901 gcatcagtct tataaagata ttccttctct ggctgtccgc tcaccaagtt atattgtaag 9961 atacgagaac ggcttccatc agtcagtgtc gcagtaggac catcttggaa cagggcgttt 10021 tctgttgctg tgtacaaagt cttttggtca ggcgtaatgg tgagactttc aaatgccaag 10081 ttgttgcgaa cacctgaaat ctgagtatcg ccagcatcga taacaccatt tccattagta 10141 tcttgtacaa cgggaagaaa cttactagga acaggtaagg aacgtacttg ctgtcctgtg 10201 gtgagagaaa attcattgac gaaaggattt gtcactcgac ctgcactgac gttcacctct 10261 ccttcagaag agatgaatac cgtattattc ttggttaaag caatgccttc agggtcgaga 10321 ctgttgagta ggaactcctt accattctca tctttgaggg tagtcacatc tgtgaatgtg 10381 acactcaagt cggttttttg aggagatgca gtttctaggg taaatgtgta gaagcgcgcg 10441 ggagcaattt gggagcggtc atctgagata ctgtagaagc gattatttgc tgcatcgtat 10501 gtgactccgg acaatccgcc aacaggagtt tcttttccat ccacggttcc agccgcacca 10561 ttcgggataa agccagtggg gaagattgtt tgtcctgaga agtttacatc cggaattgtc 10621 ttacctgctt ccgtgggtgg tacctgcaaa tcagcccata ctaaacgatg gtcggaactg 10681 gggaaaccac cagataaact agagtcgtaa gtgcctacca gtggaaacag tgggtcagta 10741 ttgaggggcc agaaaacggc tgagtttgtg atgtttaagt cagcagaagg taatacgtag 10801 tctgtccgca aattgccagg attcgtatcc gaaaagtctg ctgtgtcaaa agcagagtta 10861 ccgcgatgat tcagatttgc tccaccctgt aatatagctt gctgtgaacc accggaacta 10921 gaagggatga aatttgtatt gatattgggg ttcagcaata attgccggat agcgttgttg 10981 tagctgtcgc catcaaaggg gtctgcattt tggtcaccca taatgacaaa gcgtgaacca 11041 gcagcaagac cacctttttt acccgcatcg tcgtagatgt agacaccttt acctggagta 11101 atgtaatctg aaaagaaacg aatttcgtcg tagttgcgtt taccattgcg gtcttccgga 11161 ccatcgaacg tcgggggtgt cggatggctg accaaagcgt gaattgtctc gccattgact 11221 gtaattggaa catcccaatg acttttggaa gaaagacgta acacggcttg ttcttcttct 11281 gagtaccaag gtgtctggga gtcaggaaga gcaatagttg gcaagagtga ccctggtata 11341 tccttccaca ggaagttttg gaacgtccgt acattagcag tgtcgatggg gtacttcgac 11401 agcagcaaca agccgtactg accagggaac tcgccaaagc caaacgcatc gtcaccatat 11461 cctcgtgtac caggagttgt gacgactgta ccatcgttat tcaagtcaaa tccagatggg 11521 ataccagtat ttgagggtgc gatgtagacg taagggtagt caactacgtt tgccccgttt 11581 tgactgacac ctaagtaatt tttctggaag agtcggactg gcgaaagagg atctgcctga 11641 ttgtaatcaa actcgttaat cagcagcaca tctgggttat tacgctggat aatttcagca 11701 acagcttttg cttgagcatt atcaggggta gacaaatcag tgactaactg accttctgcg 11761 ctgcggttta gagaagcatt gaactgtgaa aagcggactg tattagtggt taccataata 11821 ttcgcctgaa taagtgtatc aactgcaaag acaagcttta gattcggggc aattcgtgat 11881 gtacaaacga caatattgcg cattccttct atgaagaagc gcagattctg aaatgatcgt 11941 cattgctatt tgtattaatt tgtattcagc atcatcaaat tatcttagta acattaagaa 12001 gtagatagca aaaggttaag acatcagtaa aacttgtaac ggtctttacc tcttttgtta 12061 attaacctgc ggaatgtggg atctatctat gcctatctat gccgatgcga aaaatacaga 12121 tcccccactt cgtaagagaa gtcggggatt ttgttttgtt tttcacattt tttgtagcgt 12181 gcgcgtaggc ttatgattta ggactgcgat agattcagtg aagttttata atctccgact 12241 ttggaattgc gaatcagttg atgctctttt ctccaggttt gaggagggat accgtgatgt 12301 tgacgaaact ggcgggaaaa atgacacgca ttttgatagc cgagtgctaa agcaatttgc 12361 tcgattgttt ggttagtatc ttggagtaaa gaacgtgctg ctgacatccg gcgcttaaca 12421 atccagttgt tgactgtttc tcctgtgatt tttccaactc tgttagtcaa gtaagccgaa 12481 gagtaaccaa ctgcttcagc tacgtcacac agagtgatac cttggtcata atttgcctct 12541 atatagtcga aaacttcttt aagttgcggt acagaaggaa agatagattc tgagtccact 12601 gatgaagctg tgtctggtgc gacgggtgct gaaaaattct ggcagcatgc agtataccag 12661 tactgcataa tttttacttg cttttctaat ctgactgcta tagctctgag taactgctgt 12721 actgtacaag gcttggtaag atagtcatct gctcccaatt ccataccttt gcgaagagat 12781 tcgttactgg cgctaccagt tagaaaaata aacggtataa ttgcagtcac aggatcttgg 12841 cgcagcatcc tgagaacgcc aaaaccatcc atatccggca tcataatatc gcaaatcact 12901 aaatcgggta gatgctcttg tgctttttgg ataccaatag taccattttc agcacctatg 12961 gtgtcgaaac cttccgcctc aagaccgtct aaaaagatct tacggctcat ggcattatct 13021 tcaataatca gaattttttg tgacgatttg tttatcatgt ccaattggta gatgttgatg 13081 tcttttagct aagagttaag agctaaccgt tggtagattg acagtaaaag tcgtgccaac 13141 accaaccaca ctttccacgg tgatttcacc gccatgtaga tctacgagag ttttaacaat 13201 cgatagaccc aatccagttc ctggtatatt gtcaatattg ctgccacgat aaaatggctc 13261 gaatatttgt tgttgatcca ctactggaat accaatacct gaatcttcaa tttgaaaaac 13321 gacttgattg ttttgacagt aaagtctaaa agtcaccgtg ctactagtag gtgaatactt 13381 gatagcattc gagagtaaat tagttaatat atgctgtagc agttttggat ctaaataacc 13441 agtagagcag ttaccttggc tgacaaagtt gatagctttt tggttaccgc ctgctatctg 13501 tacctgtgct aggatatctg tgcaaaacct ctccaggttg agttgtcttg gctgacactc 13561 caatcttgca gattctgcct gaccatacaa tagaatttcg tctaataatt cactaatttg 13621 ctgaacagca actgtaatca aatccagata cgaacgattt ttttcttcag tccattggtg 13681 aatatgacgt ttgagtaaat cagcagagaa tgagacaatg ttgagcggag tgcggaattg 13741 atggcaaagc atagaaacaa aacgttctct gagttcgctc agtcgctttg catgctcaag 13801 agcttgatga agctccgatt ctgcctgctt ataatctgta atatcaatgg ctgtcagtaa 13861 ttcgactggc tttcttgcga aatccagcat tccatccagt actcctactg tacaggctag 13921 ccagcgctcc acaccatttt ttgttctaat ctgcatctcc tgatattcac tgtagccggc 13981 tccattctgc ttgtgaacct gcctaagctt tttgcttgta ataagtcggt ctatattaaa 14041 gttattgaac aattcttttt ttggatagcc agtgagtacc tctgctgcag aattggcata 14101 gcaaatgcga ctgtcgcaaa ttaggaaaat actagcttct gtagtttctg ccaaagtacg 14161 aaatctggct tcattgagcc taagtgcttc ttctgttttt ctggactcag taacgtccca 14221 tatactccat actctaccga taattttgtc tccaagccgt tgtggctcag agtaatgtgc 14281 aaagactctc ccatccttca attccaccaa gtcatagctc tgtttatccg attgactagg 14341 catttcccaa acagcctgac gaaaaatctc tggatattta acttggttct caaaaaagcc 14401 ttttgctcgg tgagattttt tagacagact aaccgagttt gggaattgcc acatctccag 14461 aaatttctga ttgtagcaaa gaatttctcc ttcaaaattc actgcaacta tgccatttgc 14521 agtggattct aacgtcgagc gaagtaggga aagagatgtt tctagttctg tttgtgttct 14581 ttgatattca ataacttctt gctcgaactc ttcttttgga tcagaggttc tgtcaatata 14641 ctgttgtacg ctcaagccta ctaattcatc actactatca cgagcaaagg cacagccaaa 14701 ttctttgcct tggtactcta tataagtaag attcatttct actagaaaga ctcgacctgt 14761 tcttgtttga tatctaaatt taaaggtgag ggaacctttt tccttgagtg ctatccattt 14821 gtctgaccaa ttgtgtgccg aaaagtctac atctatatct tgcagtgtca tcgagagtag 14881 ttcctcacgg gaatactcac tcatacggca catggcatca ttcacgtaga gaatttgtgc 14941 gtttaatccc acactgaaag cagcatccgc aacctggtta ataagaaatt gtgctagctt 15001 caattcctct tccggctgta gcaccgattg agaactagta tcaaatagca taatattcac 15061 caaaactcat caccctctac tgacctgaca aaaaaagtat aaattagcac tgtagcgttt 15121 ttgggacttg ttatcaaaat ttctgaaatt tggtgttacg ttttatctgt cgcctaactt 15181 taacttatga aatctataag tatcttaata taacttaatg taaaagtcaa aaaaattcat 15241 ttttatcacc ctgaagcacg atagctgaca gtaccgctgc ttaaacgctc tgttaaatga 15301 cagtggaaaa atcaaataca gaaaacattc aagtataaat cttagccttt aacaacgatg 15361 agcagaatat ttccttttga acatttttac tagttttacc tcgtataagt ttaagcactc 15421 atatcagctt cttagttgca aaatgaataa ctgtattact actgtgtact tttctagaat 15481 atagtgacta aaaacacgca atacagttaa cttacttaat ttgagtaaag agcagacaaa 15541 tttatagatt ttcatggctg aattgtttct agttaaccta atttctttaa tttactcttc 15601 tatgaattca actacaaaat attttacaag atcatctgtc aaaagaaata ccctaatcct 15661 ggcaaaatat gtataggcta atacttatct atcttgagat tgattatagt agaaccgaaa 15721 gaagtgtttt tgataagcaa acttctactt aaagggcgtt gtgctgcaac agcactagaa 15781 tttattccac ttttcaagca gaatcaggca aaacattaaa attttattaa aaatacgatg 15841 aacgctattt tttgtgctta tgttcactgg ttgtccttag tttttcttag gtaactctac 15901 tttcaagtca taattaagac atgttaaaaa agtaactgaa tgaccgatgt aaagtaccaa 15961 aaggtagaga aaagttgttc aaagagtcgt acatatcata tttagtacaa acaaaaaatc 16021 tgacaactca atcaaaagat ttttcattgc agtaaagtat ctagcttttg tagtcttgtt 16081 tccataagca cagcctcaat tgtttaactt cttcgcaaca agttttttta ctgacgaaaa 16141 agcgatcact cgcctttccc agtgttcgtc aaactgggtc gtgaaaatgt agaaaaaatg 16201 gtgacataag ggatacttaa ggctgtgtgt gcgagtgcag cctgagaaat ataactggta 16261 gcctctccaa taatcaggga atatgtagac gcaatgagcc aaaactatac tcagacaaac 16321 tctctagtcg gcactcagcc atacgacgtg taccaaccaa ctgactcttc caaagaccag 16381 acaaatggta gttccaatgg tcactcaaat ggtgcagccg ttgcacaaaa gcgatcgcca 16441 actgtagcag atgcgatcgc gaatatgatg gaagatttgg gagtcagctg cgcctttggc 16501 gtcaatggag gtgcgatggc tggtctttgg ggctcgctat cgaatagcct cttacaagta 16561 atgaactgcc gtcatgaagc cggagctgct tttgcagcag ctgaagcgta cttcgcgact 16621 ggtcgtccca ccgtagtttt tacaaccgct ggaccaggta taaccaacgc tctgactggg 16681 ttatttgctg ctcggggtga gggtgcaaaa gtgattttgc tgtcagcttg cacctcctca 16741 ccacagcgcg gacggtgggc aatacaggaa accagcagcc acactctgcc aactgaggga 16801 atttttacct caggaacgct gtttaactat gctaccactg tggagagtgg tgcacaactc 16861 ccacaaattt ttcgtaaact ggctctgggt ttgtcccaac caggagcctt tgtcgcccat 16921 ttgagcattc ccactgctgt gcaaacaagt ccgcttgata gaatgccatt gttccaagga 16981 atggattgtt ctttggtagt gccacatact gaagcaattg ttaaatgtac acagttgcta 17041 tcagaaggac cgtttgccat ttgggttggt tttggtgccc gtggcgcagc agaagaaatt 17101 ctccaactcg ccgagagaac cggggcagca gttatatgtt caccccgtgg taaaggtatc 17161 tttcccgaag atcatcctca atttgttggg gtgacaggct tgggaggtca tggttccgtc 17221 atgacctata tgcaacagca aacaccttta cgaacacttg tgctgggaac gcgccttagc 17281 gaaccaactt ccttctggaa tgaagcattg gttcccccag gaggcttcgt gcatgtcgat 17341 atcgatccaa cagtgccagg agttgcatat ccagaagcgg aaaccttcgc tatccaatca 17401 gacatcaaag catttttgca agcgctgttg caacatgatg agcatctcgc cagtgcatcc 17461 gtaacagctt tatcgctccc tcgtcctgaa ggtcagacaa ttcaaccagg ttcagactct 17521 ccagtcagac cagaggtgtt gatggaagct attcaaaagg tgatcatcga tggcactgat 17581 gccatagtca tggcggagtg tggtaactcg ttcctttggt caacacattt actgcgattt 17641 gcaaacgcaa atcgttaccg aatcagcacc ggagttgggg ctatgggtca tgctgctgcg 17701 ggagttatcg gagcagctgc tgcacgcaat ggcaaagcag tcgccatcgt tggggatggg 17761 gcaatgctga tgaataatga aatcagcaca gctgtgaaat acgaaattcc tgccgtctgg 17821 attgtcctca acgatgctcg ttacaacatg agccaccagg gtatggaaat gttgggactc 17881 aaaggtgcag atgcatcaat tccgcaagca gattttgcag cgattgctca cgctatggga 17941 gccgaaggaa tccgcatcaa caaagaaatg gatcttttct cagcgttaga gcaagcaatg 18001 gtagccacgg gtccgatcgt tatcgatgtc gtgattaacc cagatagacg cgcaccttcc 18061 aaaggacgca acgccggtct ggcatcacaa ggagtcaagt caactccggc tcaaaagact 18121 gaattgcacg tgtcatttcc acaagtatca tttccaaatg cctaatcaat ttgtagtaag 18181 cgcaaacggc gtttactaca aacacatagg tgatatatcg gtgagcgagg agatagtggt 18241 gaaccttttt gaaactgtta cagagatggg tcatgagcag attcttttct gccatggaaa 18301 agacccagat attaaggcaa tcattgccat ccatgacacg agtctgggac ccgcaatggg 18361 ggcgacacga ttgtggcctt atgccagtga ggctgctgcg ttaaaagacg ctcttcgtct 18421 cagtcgtggt atgacttaca aagctgcttg tgctaatatt ccagttggtg gaggcaaagc 18481 agttattatt gctaatcctg aaaataagac agaagaccta tttagagcct acggacgttt 18541 tgtcgaaagt cttaaaggac gatttatcac aggtcaagat gtgaatttga ccccggaaga 18601 tgtcagaaca atcagcaaag aaacccaata tgttgtaggg gtagaagagc gctcaggcgg 18661 accagctcct gtgaccgcat ggggagtttt tctaggactc aaggctgctg ttgaatttcg 18721 tttacaaacc gaaaacctca aagggttgag ggttgcagtt caaggtttgg gaaatgtcgg 18781 tcaaaatctt tgccgacacc tgcacgaaca tggagcaaaa ctatttgtta ccgatataag 18841 tccagataaa acagagcaag ttaaacgtct ttttggtgcc acagttgtgg agccagatga 18901 aatttactct ctggatgtcg atgtactttc cccttgtgct ctaggcggaa ttctcaatag 18961 tgaaacgatt cctcgcatta aagcttcagt tattgctggt gctgccaaca atcagctagg 19021 agtagaagta ctccacggtc aaatgcttgc agcacaagaa attctttatg ctccagatta 19081 cgttattaat gcaggtgggc taatcaacgt ttacaacgaa atgattggct acaacgaaca 19141 aagagctttc aagcaagtga ataacatcta cgacacgctg cttgaaattt ttgatagagc 19201 gcaaaagcag gacatcacta ccaacgatgc ttctaagcag ttggcagaag acagaatcct 19261 caaagccaga catctcaaga ccttagctgt ggtctaaagg agtgacaaat ggtaactgaa 19321 agaaatacat ttggaacatc tgcctacatc gaaacgactc cagagagtgc ctttgaatac 19381 cttgccgagc taaaaaactt aggcgagtgg actctctata gccgcatgga agagcaaatc 19441 gacgaagata cctggctcgg aactgcctct ggctaccaga caaagctcta ttatcatgtc 19501 gaaagactgg atcatcccaa tttttacggc attgagtggc actgcgggtt agagtatcag 19561 aaatattatc aggtctaccc tgtcctcctt tttcctgctg actacgttga gccgggaacc 19621 gatgaaaagg gtgtgtactt acactgggtc agctttgttg atcccaagcg gcgcagtccc 19681 ttgattatgg agggaattca cacggtacac acttccgagt gtcgttctct taaaggtaat 19741 ttggaacgca aagctggtct taagaccgca gcaaaaggac gttactacgt tgacaccgac 19801 accatctatg ttaatgcccc aatagaaatg gggattgagt atttgtcaga cctcaaaaac 19861 atggatgatt gggcacacct actgcgagcc gatggcgaaa tttctggtca gtctggagag 19921 tttcgcgatg aatatggtca aaaggtgaaa gtcactcagc gcctgcaagc tgttaacaaa 19981 tcgtacttgc tggagcacga gttcttttat ccagactacg gattttatca gcgctctccg 20041 gtgctgctaa tcccaacctc ccatgctttc cgggatccag aagctcctgg tttcatccag 20101 catcgaatca ccttctggaa aacaggtgag caactacctc acggtaaact ccaaatcgaa 20161 gactttggct ctgagagctt gaatatcaag cgtatcttgg aaggcaaagc tggcaacctt 20221 aatacattag ctcaaggttt gagctatatg ccgcaggcta agtagtgact tgttagtagt 20281 tagtggttaa taaaacaact aattactaac aactaacaac taataacaaa taaataactt 20341 tatatattca gggtgaaaac gctgccattg cttgttatga gcaatgggag ttctacacat 20401 aaaatctaaa gccgacctct gttgtgacac atgaaactgg aacaacttac tattacgtct 20461 ttcacggtac ttgccgatgg tcttgacaat ccgaaaggtc taagctttgg tcctgacggt 20521 agtctctata ttacagaagc agggacgggg ggagatggag ctagcgttcc atcacctagt 20581 ggtcaaggtt ctttactttt tggcacaact ggtgctattt taagagtaaa taatgctaca 20641 atagaacgta tagtcactgg actcccttcc ttggcatttc cagatggtac tggagccgct 20701 ggtcctcacg atataaaatt tgatactaca ggcaagcctt atgttctcgt tgggtacgct 20761 gcaaatcctg ccttacgcga cagcacattt ggtgagactg acttaggaaa aattatcact 20821 cccgatttcc agacgaattc gtggactact gttgccgata tagccaacta tgaactgact 20881 cataatcctg acggaggcga tgtcgtcagt aatcccctgg cttttttaat agatggcgac 20941 aagattgttg ttgttgatcc gggtgcaaac gatttgctga gtgtaggcac cgatggaagt 21001 catttggagg cgatcgctgt acttccccaa cagccggtta ttaatccaat ttttcctggt 21061 tttaactcgc aaaattttga ccggggacac gtaccacctc ccagcgctta tcgtaatgcg 21121 acaccatcgc aaataacgat tcaatctgta cctacaggaa ttgccaaagg tccagatggt 21181 gcttattaca tcagtacatt cactggtttt cctttcccag aaggtggagc aaaaatctac 21241 cgagtggatg ctgatggtca actaacagtt tatgctgatg gctttacaca actgattgac 21301 ttggctttcg atgcgcaagg caacttgtat gcattgcaac acatgaattc ctctggctgg 21361 aaaggaaatc tggatggaag tctgatcaaa atagcacagg atggcactcg cacaactatt 21421 ctcagtggtg atggattaca agcaccaagt gcactgacta ttggtcctga tgatgctctt 21481 tatgtgatca accgaggcgg tctgccagga aaagggcaag tcataagaat tgagaatcca 21541 aggtctgttt gatgagtaaa agacaatagt ctgtgctcaa cacaagaacg agattcctgc 21601 aattttaaca acaagtttca agcgggtgca ataccagttc ctctgaactt ggagggactg 21661 tcgcggtaga tgcttgattt ccacatccta gtcactgcga ctctttatct ctaacaccgt 21721 acaacattcc acaactacca acactcgatc tcaatctaaa taactatgaa attcaagtca 21781 tttgctctta cagttttctc cgtttgtgcc gctgttgctt gtggaaccca agctgcacga 21841 gcagcatcgc tgtcagtagt tgctgaccag cttaacaacc cacggaatct tgactttgct 21901 cctgacggca gtatttatct gacagagagt ggtgccggag gtgacggaaa agatggaaga 21961 tgtatcgcat cacctagcgc gcaatacatc cctttatgtg ctgggagtaa tggtaccctg 22021 gtcaaaattg ctaaggacgg tacaaaaaca aatgtaattt caaaccttac atccatagcg 22081 ttagttccct ctggcgaaca agctgccggt cctgctgact tcaaatttga ttccaaaggc 22141 aacgcttatc ttctaactgg cttggctggc aatccgaacc aacgcgatac cgtcttgcaa 22201 agccccgatc tcggaaaatt atacaaagta gacttaaaaa ccggttcgct gacaactctt 22261 gccgattttg caaactacga agctaaatat aatcctgatg gcactgattt gattagcaac 22321 ccctatgctt ttgcaattaa gggtgataat gcttacgtcg ttgatggggg tggaaactcg 22381 atatattccg tggcactgga tggcagcggt attaagaatg tagcagccat acctcagaaa 22441 cgcatatcac cagatcaact gcaattccca actcttcctg agggaacaac agacccaaca 22501 ggaggcgcag caccacctcc aggttataca attgctccca atggtcttcc agtatcaaac 22561 caatcagtgc caacaggtat cgtagttgcc ccggatggaa gtttgacttt aagtgaatac 22621 acttattttc cttatccaga aaatgaagca cgtatcttta aagttgaccc cgatactttg 22681 caaacacaag tccttgctga tggctttacg cagttgactg gcgtgacata cgactctgat 22741 ggcaatttgt atgccttgca acacatcaat cagtcagaat ggaagggcat tcaacagggt 22801 ggtgtgatca caggtgatat cagtggttct atcatcaaaa tagccaagga tggaactcgc 22861 caaactattt ggagtggtaa tggactagag gcagcttctg gtttattttt cggtcctgat 22921 ggcgatttat atacttcaaa ccgtaccaga ctcgtagctg gggaacgagg aggacagttg 22981 atcaagattg atcctagatc ttctggtggc gcgacaaaag ttcctgaacc cgcttctgtg 23041 attgctgtat tagcaactgc agctttaggc gcaaaagcaa tgaagcgcaa gcgccaagaa 23101 caggtgttgg ctaaggtaga aactatctaa ttccttgttt gtttcttggg ctataccaaa 23161 gaatgctcaa gaaataagct gctacctcaa atacaaagct gtctttcaaa aagacagcaa 23221 cacaaaattg aagttcccag actaggtccg ggaactacat aaacctaggt aaaagtataa 23281 ctcataacac agaagattca ctatgggatt agtcaagaat ttgtcacttg ccttaatcgg 23341 tgccagcttt ctggtagcgg gtgcagcagt ccaagcgatg gcgttaacct tacagtacga 23401 tcgctctata ggtgagcctg gtttcggtcc tgggcaactg tttgttcccc aaggcatagc 23461 ggtagatagc caagggaata ccctcgtagc taacggacgc ggtgttaacc cggtgactgg 23521 tgctcctgac tacagcctcg gtaacaaaat tgaaaaattt agtcctagcg gtcagtatat 23581 tggagcaatt ggcacaggcg gcacaggacc cggacagttt gacgagccaa caactgtaga 23641 ctttaatcca gtaacagggg atctgtatgc aggtgatgtt tacaacaacc gcatcaatca 23701 attcgattct cagggtaact ttattagatc ctttggaaat ggatcattta cccctctagt 23761 agagggtaga ttgttctttg gaccatctgg tgtgacattt gacaaagctg gcaacgtgta 23821 cgtcggtgat tttaacggcg aaaggatttt taaattcaca ccagacggac agcaaattgg 23881 tgtcattggt ggcaccaatg gcactgcact tggggagttc caaggtgtag caggtgtaag 23941 aatttcccca gttagtggaa atatctatat agctgaccag tttaacaacc gcgttcaagt 24001 actcgatcca aatggtaaac ctctgttcac atttggttca gcaggtagcg gacctggaca 24061 gcttcttcag ccaattggca tcgaagtgga cgaccaagag aatgtctatg tagctgattc 24121 tatcaatagc cgtgttcagg tattcgataa aaacggtaag ttcctgacga actacggtca 24181 accagccctg gatgcatcag gtaagccagt cccgcctcca ggattaactg acggtccctt 24241 tggcaatccc cttgacctca ctccaggcag atttaactgg acaggtggta cagcccttaa 24301 agatggcaag ctgtatgtta gcgacttctt ccaaggtcgc gtacaagtgt taaatgtcaa 24361 cagaagtggc agcacttcag tacctgaatc tagctcatta ttaggtctag cagtactggg 24421 agttggcgcg actgctacac tgcgcaagaa gcagcaaaaa tcaccgattc tcgaaaagac 24481 ctctgtttaa cagggcaaac gcatctaaag tgtaagaatc cttttattac aagggtttca 24541 gcattttatc ttcaagccta ttttttgagc aaaatcctta gccagtaagc atttcacaaa 24601 tcacgtgcgt tgtggtctat ttacttgaaa attgctgtaa ttaatttttt catgggattt 24661 catgagattt tttttacaat ttcgcactgc gaacaagtct acttactact cacgcgatgc 24721 agcgctcaat tgttgctaaa accgacttag aaagagtatc aaaatcgtaa ccaccttcca 24781 aaccaaacaa aattttacga gttattccca gacaataatc tgtgaataag ccataatctt 24841 gcggttgcaa atttatacta gccaacgggt catcagcgtt ggcgtcgtaa cctgcactca 24901 caatcagtag atgtggctga aaactggata aaaagggtac tacttttttt tcaaacaaag 24961 gttggtatat ttcgatatcg cccccaggag gaattggcag attcagtaca ttatgatgac 25021 aaccttgctc tgtatgacgt ccagtccctg ggtagcaggg gtactgatgg agagaacagt 25081 aggcaatttg ttcatacttt tctactattg cctgcgtacc attaccgtga tgcacatccc 25141 agtcaagaat cgcaacacgg ttaatttctg gttgttgcag ggcataaaga ccagcgatcg 25201 ccgcattgga aaacaagcaa aaccccatcc ccgcatcact ttccgcgtga tgtcctggtg 25261 gacgcgccaa cacaaaagct ggttcgccag tttccagtac aacatctact ccatccaacc 25321 aggcacttac tgccaacaaa gcaacatcat aactgcgtgg agaaacaggc gtatctccat 25381 ccaaataacc accgccagca acagcaattt cttgaacttt cttgatgtag cgccgggtgt 25441 gtgcttgttc caacacagag atgatagggc gcttttctgg tagagtgggc aagcgccatt 25501 caatatgtgg tgcaaactca gcttgtttta aggcggtggc gatcgctgtc agacgttctg 25561 gtttctctgg atggaaagat ccagtttcgt gatccagaaa ctcgtcggaa taaatcactg 25621 gcagcatagg tcgagtgaag taagcagtaa gatacagatt ttaattgtat cctgactact 25681 cttgagtgaa cccggtaaat tcatcgtttg gttcatccag catttctggt gctacagctg 25741 ggttactcct acgttcttct ggcgattgac tcagtatcac atcatctatt cttggaggat 25801 ttttaatttt gtcaagcgtc tcaggactta attgggattc atcgctttgt ggatttgttt 25861 tttcagcttg attaggagtt gtttcattac tggacataat tatttcctta accttagacg 25921 cctcagctta agctctgctg tggtcggtac acactatctt ggggcgtgaa gtacccacca 25981 ccaaaacgga gtaccgttta tagtgggggc ttccagtttc attcgtcgat gccacgttag 26041 ggatttctat tgctagaaat tcgtcactca agacgatgat ggtcttactc ggcgtccgga 26101 ggtttgagca gcagctcctt actcccgtac acccgtgtgt accctggcag caacccgcct 26161 cccgcaggtt gtttaattgt ttttcgtaca acacaacttg gtgtctttct taccgcactt 26221 ggcgtgccag cggtcgtgtt ctttcccgga actttgatct ttctgcggca agaggaggca 26281 gtgcgttggg gaggacagtg ccggggtgag ccagcgctgc gggagggttt cccgacagcc 26341 aggcgactgg cgaaagggtt tcccgacttg aggcacctgt ccgttgagcc agtactggag 26401 gagggtctcc cgacctaggt atctggcgtc cgggttaagc gacttgtaga cgccctccgg 26461 gcggcttccc gcaatctgcc gcacctgccg ttcgtacctc tactttgatg tttgcctgtc 26521 tgtctgcgac acaatccggc gggtcagcaa ccaataatat tatacaatgg cgattcgtct 26581 gattctggag agattagcaa ctctcccggc acgctcacgg agtaccgtta tagtgcggga 26641 cttacggcgg gcaagttaaa ttaagaataa gagaaagatt tcggtactct ctcgtttgtc 26701 aactgttaat taaccctaag gtcataaaca gaatatcgtc gccttcgata caatttaaga 26761 agtagggaga gccaaggaga acagacgggt gctattaaaa ggctttgaaa tagagatgta 26821 cactggcacg cccagtggtg atatcgtcgg actctccgac aagattgttg gatctttgga 26881 tggatttgtc cgagaaccag atagtcgaaa tgtagaatac atcactgcac cattgcagaa 26941 ttacgaacag cttttatgtg ccttgttgcg tcctcggctt caactcagga atttcatcaa 27001 gcaattgggt gattacacgc tgattccagg aagcactcta tcgttagggg gcggcgatcg 27061 cttttttcgc tctgacccaa caaaccctta tcatgactac attgagcaaa catacggtac 27121 caaggttgtc accgccagtg ttcacattaa tgtaggcatt agcgatccag aagtcttaat 27181 gcgcgcgtgt cgggttataa gggtagaagc acctttgttc cttgccttga gtgcatcctc 27241 tccgtttctg aatggtaaag caactggtta tcattccaca cgctggggtg tttttccgca 27301 aacgcctcct caagtacctt tatttgaaag ccacgcccat catattcaat gggtggaaaa 27361 ccaattgatg gctggaacaa tgcaaaatgt tcgtcatctt tgggtgtcag ttcgaccaaa 27421 tggcgatcgc cgcccttatg atttaaatcg tttagagcta cgaatttgcg atttagtcac 27481 agatccaata gctttactgg ggatagcagc tttagtagag gcacgtttat tacaagttat 27541 caataatccc tctatcgatc cattaaccca aagcaccttc accccagacg aacttatttc 27601 cctaacctat gccaacgaaa ctgcagcagc gacttccagt ctcgatgctc aactccagca 27661 ttggcaagat ggtaggagta ttttagccag agattggatt ggtgaaatat atcaagatgt 27721 ttgggcgatc gctaagcagc atggttttgc ctgtttcctt tcaccattac aaaaaatctt 27781 gcgtgaaggt aacgaagctc aacaatggtt acaactgcac gcacttggct tgagtgaaag 27841 acacgttctc actcacgcta ttgacgtaac aagagaatgc gaagttcaat tagaagataa 27901 attgtgttcg tctttggttg cttagcacac ctcaccctcc gggtatgcct atggcacgtc 27961 tatgtccttt ggacacgcta cgcgtacccc taccgggggc gctttgcgcg accgacaacg 28021 gcagtttgct gtagcctgac cgaaccatgc acggcagttc ctcttgggga ggactgcctc 28081 accaccgcaa ctgcctcacc ataaataggg gtcggaaatc taaatgtgaa caaatctgga 28141 cgatgattta tcccttcagt acattccgaa cccttcttat ggaaatgtgt acttagcatg 28201 aaaattttgt ctttgtcagc tgtaagtgtt gtccaataca gctacacata acttaataat 28261 ctgttattaa tcttaatatt tagatcatag agtctgggac atactttatg tttgttttaa 28321 atttctaatc aatagattga tttttttatg attaagaata cttatatata atgtatatta 28381 aggtagattg agaaaagaaa aaaaaactgc attttcccag attttcaata tatttcttcc 28441 tccacaaatg ccgcagatcg ttttagttaa cccccttatt cctcctaata caggtaacat 28501 tgctcgcact tgtgctgcca caggtactga attacatttg gtaggacctt tgggttttga 28561 aattagcgat cgctacctca aaagagctgg cttagattac tggccttatg tcaaactgca 28621 ctatcacgaa tctctggaag cttttaaatc cgtacatcag gcacgtggcg gaagattatt 28681 aggttttagt gttaaaggta gttgcaacta cgtggatttt cagtttcaac cccatgattg 28741 gttgctgttt ggtagtgaaa ccactggctt accacttgag ataatatcag cttgcgatgt 28801 taccctctac attcccatga acgaaccaaa tgttcgcagc ttaaatcttt ctgtgagtgt 28861 agcagtgggt ctttttgaag ctcgccgtca attggggtta tagtaattca agtcgtttta 28921 aaacagtacc aatcgtcaac tgtagtcgtg catacgtgtg gtgagacgag tgctggtttg 28981 agggtaagcc gaaggctggt gaccaagaaa ggggtgtagg ggtgtaaggg tataagggag 29041 tgctaaactc ctacaccccc caaagtcgag tccccgactt gagtcgtgga gaaaaagaaa 29101 tatttgtttt gagcggaaaa tgcgggtttt aaagtgtttt ctaccgtgct gtacgaaatg 29161 ttttttaggt aaaaaagaaa tattatacag taaataagaa aaatctctga tacgttagta 29221 gtaaggagaa ctgaatggtg agcgcaaata cacagaaaac gaaactatat tgagtttgta 29281 ctcactccca gtactcctga ttcaccactg ttggagtaag aatgacttac agactggaaa 29341 aaagctataa gaaatttcta tagtaatttt gaaaaacagt tgaggtggaa tgatatattt 29401 tcataaattt gaattttctc ttgagtaggg taagaaaaca tcataaccca gtcagttcaa 29461 gagtttgagg tagtttacct catgtgtttt cttcatctgg gctatataat atgcgagtta 29521 atacggttgt ttttgggtaa taatcaacgt agagtctcgt gtaattgata cggttgatag 29581 aaaaaaaatc tgaaagatat ttacagaagg gtgtagtggt attccgtgga gtagcagcat 29641 ctgattagct caaagcaaga gaacatgcca aatgtgtgat tgtcagtcgt gcagcaaaag 29701 caacaaaagg ctcaagtgtc aaaaagaaaa cttatatgaa ctattaagat gaaaggtgtc 29761 agatgtgagc agaacaagtc aaggcaaaaa tttaatattt aaaataacga tgtgaacttg 29821 tctaaatcag agaagcaggt taatcttgcg taagtatctc tggtgtatgt ggtgttgatc 29881 tattgcttgt agtgatctaa atcattgact agtactcaga ctggagaatt gagatcggtt 29941 aagcgcaagt gatcatagga ggtcgtcttt gaaacgagca ttaaaaaaca gagggagagc 30001 tgagttggaa aataccccag gtgatgatgt tccagtagaa ccgatagaca cagttaatcc 30061 aaaggttaac caatgtcgga tgccaacaac agccgcgatg attggcttgg caatatcaat 30121 gggagcaacc agccttttgg tgactcgaca aagcgaccaa gccactgcag cagaggcgtt 30181 aggttatcaa aatacaacct caacgattcc tgcttctaat aatgtggagg tgaaatttgc 30241 ttctactaag aaactaggtt cgcaagctgt ctcatcagta agcttgccag aaactggaac 30301 agtagtagaa ccaacagcaa tttcacagct aactgaacaa agagcaaaat ggcaagttgc 30361 agccaataaa gtgtcagtgc aaacttcgcc attagtagga attccaaatt cgctgacaac 30421 agcacaacaa agcattgctt gggaaaaaaa caacttccaa cagtccagaa agcagggagt 30481 acaaagactc tctcatgctg atggtattgc tagtgtgcaa accgtgtctt ctccatctgc 30541 gaaaccttca acagtagaag ttaataacac agaagaaccg gaagtaaatg ctcaactcaa 30601 agcgcagcag gaatttgcgc gaaatcagtt acaggaaaaa tcagaccgtc tcagaaaaag 30661 tttaactcaa tggcagtctg aacacaccaa agatttatca caactagctg cgaccaggtt 30721 agtacagccg atgactgtgg ctgggaaaat gtcccagaca agtactatca ctggtacatc 30781 gcagtcgaat atgactagcg atgtcagcag agcaaggctc gtgtcaaagt taaaacaaga 30841 atcagaggca caggtggcaa cagtaccagc tccaacagta ccagcttcaa cagtagttgc 30901 accaatagca ataacacaaa cagcaactgc agtatatgag gtcaagccag gagatacgat 30961 aggggcgatc gccaacgatt atggcatttc agtcctagaa ctcataaagg caaataatct 31021 taacaatcct catcagctac aaatcagcca aaaactgttt attcctgtgg cggaaaaccc 31081 cactactgcc cagccaactg tcgcaatgaa taagagtgct gttgcaggta gtggtagtag 31141 caacactaga gaaacaacag gaaactcact cattgccgac aacaggaata ttactgttcc 31201 aacatcagta gttggtaaca ctcagtttct gtcatacatt caatcaacaa ctactaactt 31261 accagaaaat actattacaa atagtcaggc ttccgattcc atcacatcat attatggtgt 31321 tggtggtgac agcccaatgc cacaagttgt tacagaacct caattggcac aaataccaac 31381 tgttacaaaa acaaaacagg taaaaaataa tcaacgtcta cgcagcttgc aattggaaat 31441 tgaaagattg cggcagaaat accgttctca ggaagctggt aatacagttg tgccagacga 31501 gaatgaagct aatgatgccc cagtaacagt tcccgatccc agtcgaaatg atggtgcggt 31561 gccaattgct gttcccagac caaataatcc agcagtgcaa atacctgtca gcgaacagaa 31621 caaggctgga attcccatac ccgttcccag accgatagcg ccgaactatg ttggcaaacc 31681 aggcaaaccc gtattcggcg ctaatcgaag acctaccaac gaacccatta atccggaatt 31741 cttaccaaat caagcaatag taacacctcc tacaggtatt gatgcttctc gcgcccttgg 31801 gcctatgcag ggaagaacag tttctccaca gttaccgcct ttagcagcag tggatagata 31861 cttgccaaga gtccttgacg aaaatacacc tatcccgtct agctcatcca cagcttatat 31921 gtggcctgca aaaggtaccc tgacctctgg ttatggctgg cgctggggaa gaatgcacaa 31981 gggaattgat attgctaact cagttggcac cccgatttac gcatcagctg acggtgttgt 32041 agaaaaggca ggttggagca gtggtggcta tggtaatttt gttgatatcc gccatcttga 32101 tggtagcatg actcgctatg gtcacaatag caagcttttg gtgcagcgcg gtcaacaagt 32161 acatcaaggt caaatcatag ctagcatggg tagcactggt ttcagcactg gtccccacag 32221 tcactttgaa atccatccaa gtggcaagga cgcagttaac ccaatcgcct tgctgtcaac 32281 agcacgtttg tagtagttgg caatctcgta cactgcttat agtgggtgag atgaaaaccc 32341 tgtggcttta gcccagggac gccacatgcc tcaacggggg gaacccccgc acggcagtgg 32401 ctcatgaaag ccacgacgta gcctttaggc taccgtcaag ttctttgttg agcaatatag 32461 tttttgaccg cctctgtgga tatgtttcct gccgtgctaa caaaatagct aggagtccac 32521 aaagttggta atttcaataa ctctggaaat tcttttcgca agaaatgtga tgcccgacct 32581 ttaatcttgt tcatgacagc cgctgggcaa tcagttactt tgacgttgac aaatagatga 32641 acgtggtcag gcataatttc catcgctaca atcttccacc gattctcgtt gcacacatcg 32701 cagatgattt cttgtaatct gtcggcaaca gcattgacca aaactttttt acggcgctta 32761 gggataaaga caaagtgata gtttaacaaa gctattgagt ttccctcatg cctatattcg 32821 tctggagtaa atttcgccat atttccggaa ttcctgttgc tgcgttgtat acatatattg 32881 tagaagaaac atagcgcata aacaaagagg aggtgattaa gtagtgctgg tactggaatt 32941 aaaagttaaa ggaaatcgtc aacaatacaa agcga // LOCUS NODE_839_length_32518_cov_5.03859832518 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 32518) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 32518) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..32518 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 147..1637 /gene="crtH" /locus_tag="DP116_07250" CDS 147..1637 /gene="crtH" /locus_tag="DP116_07250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410567.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carotene isomerase" /protein_id="PRJNA477356:DP116_07250" /translation="MQSDVIVIGSGIGGLVTATGLATKGARVLVLERYLIPGGSAGYF ERQGYRFDVGASMIFGFGQNGTTNLLTRALQAVNVSLEVIPDPVQIHYHLPNGLNLKV DKVYEKFLQNLTAYFPNESQGIRQFYDECWKVFHCLNSMDLLSLEEPRYLMRVFFQHP LACLGLVKYLPQNAGDIARRYIKDPQLLKFIDMECYCWSVVPADMTPMINAGMVFSDR HYGGVNYPKGGVGQIAQKLVEGLVKAGGQIQYQARVTKIIIENRRAVGVQLANGQIHR AKRIVSNATRWDTFQKLLTQQELSVGERNWQERYQKSPSFLSLHMGVKAEILPKGTEC HHIVLEDWEKMTEPEGTLFVSIPTLLDPDLAPAGHHIIHAFTPHWIDEWKGLSVREYE VKKEEAAWRMIDRLEKIFPGLNAALDYLEVGTPLTHRRFLGRQDGTYGPIPRRKLRGL LGMPFNRTSIPGLYCVGDSTFPGQGLNAVAFSGFACAHRIAVDLGL" gene 1818..3281 /locus_tag="DP116_07255" CDS 1818..3281 /locus_tag="DP116_07255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994843.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07255" /translation="MDVSSRYWKIYRISTSQRVGYEHCLVPPAQEFLKQVRNLVPTNT QAVLLSYFVDQNSAVDVTTRAKAGLCLRSYVSEPILRACQKIDSLFSGDNFTYQDLLT YVLDDDGKTLVIVDGDGKTQLIVDNNGETRTTDYKFFSVRILQKFNADLESKMSLDNW AYFQTTQNPELKNYLAEFGFKHLSDWALLNRVRAKQFERLSKRDRHLVEVFHAVYRRD RLQKHSRGVRKCPEPSSTQLQEMLNGLRKRDVMINSTVELMNELKQVVTQLRHYDIWS YREPLEVQDPNTGGYTFRTDLPADNLNEVDIEEQEILDFLHEQLSLALKNAIEQEIGD RIKILKKSKNYAVFAQQFLPGLQFYYCQSLSLKDIAPKLEMTSWDQARRVLNPGELLS KVRTTTVKQLLDSILKKAEEKHLTKNPPEPDYLKTLAEYIEAFADDEIFREAAEEIRV GKNRSMNSFYARQLCKYIEQRTQNLRTCPPAVLLKWG" gene 3321..4370 /locus_tag="DP116_07260" CDS 3321..4370 /locus_tag="DP116_07260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198158.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07260" /translation="MLNSQVNSTDIRLLLSEVIWLELEDFDHAIDNSKLVNSEAQQWQ TYLNTLAMIGCEKWLSTRIPQKPISQETNVIEDIYHLEVGDLKICPIAIEHVLDEVIN IPKSAIDKSELAVHFYVVVEVLEEEEQVIIRGFLRYDELMNYCSQFNLELRDGYYQVP LSFFDAELNHLLFYYYFSEPLAIPLPVTSAQSSRVSLQKSLDNTRTKLSEWLEGVFEQ GWQTIDTLINPEANLALSTRITQRGAKKAKLIDLGVQLGYQTVALLVNITEESDEKFG VLIQVHPTGGERFLPPDLKLSLLSKAGKTLQEVTSRLQDNYIQLKFFKGESGKRFSVE LSLGENIKVKEDFEL" gene 4385..7252 /locus_tag="DP116_07265" CDS 4385..7252 /locus_tag="DP116_07265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318553.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07265" /translation="MAGKKPVFFRILKSIVSFKYPGRSWIAQPIVAAIAVFLCVIVSP VLAQAPLVNSSVHQFTIGEGKTQDLVQQGKKLYDAGQFTLAVKVLQQASAAFRTQGDH LREAMTLSNLSLAFQQLSLWNQAEKSIVQTVNLLKGLNNSQESSKILAQAFDVQGRLQ FSQGQAEAALTTWQKAAEIYQQMKDTAGLTRNRINSAQALQVLGRFRQAKKILTEVSQ TIQNQPDSSTKASGLLSLGNIMQVVGDLKQSQQVLQQSLALAKATSSDQMIGETLFSL GNLARAQYNTKMALDYYQQAASASTDSTTRIQAHLNRLSLLVETKQFADALALSSQIQ SEISNLPASRMVVEAKINFAQSLMKLNSCTSCKTLGPKVGNPPRVAKVFSTELNTDSP KFLVHFNGLGLLARNFSSGRATLSPSSPSLIETSAQILASAVQQAQSLLDLRAESYAL GTLGNLYEINQQFNDAQKLTEKALVMAQSLNASDIVYQWQWQLGRIHKQQGNKKGAIV YYSEAFKTLQTLRSDLVAINPNIQFSFRESVEPVYRELVALLLQTENESVQHQESQKP ENKKPKNQKSENQKNLQQARFVMESLQIAELDNFFRSACLTAKQELDPIVDKKDSRSA VFYPIILPDRLDVILKLPNQNLRHYKTVIAEDKVEGVVENLREYLGDVTRTSQVKQLS EQIYDWLIQPAEPELSQSGIKTLVFVLDGALRNIPMSVLYDKQQGKYLVEKYAIAVAP GLQLLDPKPLQQVKLNTIIAGVAAERAIENRKFPRLENVPRELQQIQSEVPKTEELLD QKFTENNLQNQLQSLPFTVVHLATHGEFSSDPEKTFVLTWDKLLKVKEFDNLLRVSDT KRSSTIELLVLSACKTAVGDKRAALGLAGVAVHAGARSTLATLWSVDDEYTADLMSRF YLELKAGVNKAEALRRAQLAVFAHQKNPYFWAPFVLVGNWL" gene complement(7407..8207) /locus_tag="DP116_07270" CDS complement(7407..8207) /locus_tag="DP116_07270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318554.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07270" /translation="MIKNKLFQRHYQLACTILLLSLGLMNNPLKVQANPEQSFQNKIT HTPNLFAQKFPDNGAPKGRRRGGTSRRDGCPSLKTPVTAIVPGEEKNNKSFLGSTVAE YPTFWVYLPELPTNLRSGEFVLQDDQGHDIYRTSLTLPPKAGTIGVSLPPNSQYALKQ NSKYHWFFQVYCGDPQNKPEYFFVDAWLERVTLTPQLQQQLKSAKSQEYKVYAANNLW YDAITNLAELRRTRSDATKLTKDWTSLFKAVDLEELAGVPILQVYSLQ" gene complement(8204..10741) /locus_tag="DP116_07275" CDS complement(8204..10741) /locus_tag="DP116_07275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transmembrane sensor domain-containing protein" /protein_id="PRJNA477356:DP116_07275" /translation="MNRLVVLNLGQGNLYEGFPVVTAYIGEADNLYQMKFSANLPAAR EIPELYHHWKSLYSAFYYRPFLRLGVQEIEEIDDTFEIEEDTVTNISEVEIKQLCEQL YNCLNLWLNSVEFRKIEQQLRTHLKPSEEIRFIVETNDNLLRRLPWHVWNWFDDYPRA ELALSASEYQQPQKFPKNIPKNQKSSMRILAIFGNSQEIDISQDRIFLENLLTGAEIQ FLVEPRLDELNNQLWQQGWDILFFAGHSCCTQEKGCLQLNQTDIITLDQLKYGLKQAI SRGLRLAIFNSCDGLGLAQQLQELHISQVIVMREPVPNVIAQKFLKYFLTEFSQGQSL YTAVRSARERLQGLEREYPCATWLPVICQNPAEPPMIWNRERQTERVDKVDSSVAVSE TKQRSRSYESNTLSGTIKQKLLDRHRFVVAATRRLLTVLLASVLVASSIMGLRHLGIL QPWELHSYDYLIHLRPADEKPDPRLLIITIDEADIQYQINKKMNMRWSLSDQALSQLL QKLEQYQPAAIGIDIYRDFPIDSNYPELGEVLQQDKRIFTVCKVSAPDDGAPLGVPSP SKVPRERISFSDFVADADRVLRRQLVQLTPPLESPCAAEYAFSFQLAQHYLNAQGIKW DINSEKNLQIGNTVFRRLQSHTSGYQGVDASGYQILLNYRSLPSLLKIAEKISLKELL NDEIIPELLHKSVKNRIVLIGVTASSPPPDDWETPYTAYASDQQKQTPGVFLQAQMVS HILSAVLDGRPLMWWWSQWFEVWWVWGWSLVGGIISWRILQPLHIGLAIVIALLTLFS ICFGIFTQAGWIPLVPSTLALVSCAVVLSILAPTHVSNSKKRWNSRI" gene 11458..16266 /locus_tag="DP116_07280" CDS 11458..16266 /locus_tag="DP116_07280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019503834.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07280" /translation="MSKSVFISLGSGDLHNGFPNVTARLWSSEKSLAQQFIGSLPAAP SLVELSKNWQSNYKNICGRQQLRSLLIEEDDELEIDEEGLTNVSVVSFDDLCQKLQEN INDWLKSSGFLNIERQLRSQLQPTEEIRLIIETNDRLIRRLPWHRWDFFNDYPKAEMA LSQTEYKSRELSKSILPRKKVRILAVLGNSQGIDLDREIKFLNSLLDAEIVFIVKPSR QEFNNLLWQNPGWDILFFAGHSQTEGETGRIYINDNKTNNSLTIKQLEEALKAAIDNG LQLAIFNSCDGLGLANALEKLNIPTVIVMREPVPNLVAQEFFKHFLESFAVERRSLYL AVQQARRKLQGLEDDFPGASWLPVICQNPAVEPPTWLKLGGTPPCPYRGLFAFREEDV DLFFGREQFTQNLVTASKRKPLVAVVGPSGSGKSSVVFAGLVPQLRQDSYTDWQIVSF RPGHNPFEALAGALTSSIGEALSLRGSQRDNLNEYINQNPKLTARRLIELKLEIALQQ DHKVLYKIIESFVQQNPKTRLVLIADQFEELYTLCSEEERQGFLDTLLNAVQFAPAFT LVMTLRADFYGYALSYRPFSDALQGAVLNLGPMNREELRSVIEQPAAQMQVRLEKELT KKLINAVEGQSGRLPLLEFALTQLWSKQTDGWLTHAGYEEIGGVEEALAIHAEAVYAQ LDETDRTRAQQVFMQLMRLGEGIEATRRLATRDEVKSENWDLVRRLADARLVVTNRNE LSGEETVEIVHEALIRSWGRLEGWIQVDGEFRYWQEQLRSLIRQWESSGKDQGALLRG KPLSDAEYWQSKRIDELSTGERHFIQLSLALRDNEINKLKRRRQLTILGLTGGLVGAL ILAGVAWWQSHKASISEIQTMTESSEALFASNNTLDALIQAITAKEKLKTIGTVDANI QDRLESVLRQATYTVVEHNRLIGHSEKVNAVAFSPNGQLIATASDDNTVKLWKPDGTL LTSLKGHNSPVFGVAFSPQGNIIATASGDKTVKLWKLDGTLLTTLNGHSDVVNAVAFS PQGNIIATASSDKTVKLWRASDGTLLTTLNGHSDAVSAVTFSPVGVASPQGFGQLIAT ASRDKTIKLWKQDGTLLTTLKGHSDVVSAVAFSPVGVASPQGFGQLIATASWDKTVKL WKRDGTLLTTLNNPSGKVYGLAFSPDGDTIASAGWDRTIKLWRWRDGESALQEGIPPQ ATGEPRSRSVPEGHRGTLLTTLNGHSDTVWGVAFSPDGKTIASASSDKTVKLWKRDKI LLTTLNGHSGAVWGVAFSPQGSIIATAGDDNTVKLWKPNGTLLKTLKGHNAGVWAVTF SPDGQTIASASGDKTIKLWKRDGALLTTLNGHSRQVNAVAFSPQGNIIASASDDYTVQ LHKSDGTLLTTLRHDNEVWGVAFSPVGVASPQGFGQIIASVTRDKTLKLWKQDGTLLT TVKGHNGGVGGVAFSPDGQTIATGSQDKTVKLWKGDGTLLTTLKDHYGTVWGVAFSPD GKMIASASDDKTVKVWKRDGTLLTTLNGHNGTVWRVAFSPDSKTIASTSDDKTVILWN LDRVLDTDKLVSYACDWVKDYLRTNAQHKQSVSGYSVQEASRFCSGIKPQ" gene 16586..16996 /locus_tag="DP116_07285" CDS 16586..16996 /locus_tag="DP116_07285" /inference="COORDINATES: protein motif:HMM:PF00076.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07285" /translation="MTILVGNLSSQVTEANLRELFTKFGTVRKIDIFPDSGFATVAIK GEANEDLAVQELNGVEQFGQKLKLFKSAPVQPSDNQRDGERRLLILHAPTHTARGLGD GPSPSPKPTKPSPPNKPKPRPLKAGLVSTSLSEN" gene complement(17060..18238) /gene="purT" /locus_tag="DP116_07290" CDS complement(17060..18238) /gene="purT" /locus_tag="DP116_07290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872795.1" /note="non-folate utilizing enzyme, catalyzes the production of beta-formyl glycinamide ribonucleotide from formate, ATP, and beta-GAR and a side reaction producing acetyl phosphate and ADP from acetate and ATP; involved in de novo purine biosynthesis; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylglycinamide formyltransferase 2" /protein_id="PRJNA477356:DP116_07290" /translation="MNNSMKLPQKLMLLGSGELGKEFVIAAKRFGNYVIAVDRYANAP AMQVADCSEVISMLSADDLEGVVSKYQPDLIIPEIEAIRTEKLLEFEQRGITVIPTAA ATNYTMNRDRIRELAHKELGIRTAKYGYATTLEELMSVSNEIGFPNVVKPVMSSSGKG QSVVKEKSEVEKAWNYAIANSRGDSQKVIVEEFINFEIEITLLTIKQWNAPTIFCSPI GHRQERGDYQESWQPAEISEKMIVEAQAIAKKVTDALGGAGIFGVEFFVTKDEVIFSE LSPRPHDTGMVTLISQNLNEFELHLRAILGLPIPNIEQLGYSATAVILASEKSDFIAY TGVAEALSEKDVDIKLFGKPNAHPYRRMGVALAKGSDIQEAREKATRAASKIKLEYQP " gene complement(18316..18819) /locus_tag="DP116_07295" CDS complement(18316..18819) /locus_tag="DP116_07295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995470.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07295" /translation="MIMLNIPTQNLRSRAIQFLEQSPQQRLENLKQLGIARYDFLTKM CLKEANIACVMRFFQNPTQLKFPNLIGADLSCLILDGVNFIRGNLSGANLQGTSLVNA DLIFANFTNADLRNANLNGATLNETIWLNALVEECEFGEAIGLTKVQRQDLQLRGAKF KYLEEDS" gene 19046..19582 /locus_tag="DP116_07300" CDS 19046..19582 /locus_tag="DP116_07300" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07300" /translation="MKGHLLTVATLFGIISLFWISFFAKVVRVKFVEIDFVLKRLEIQ KEAQVEIPYTVAKVDILRRLGSDVKPDPRCLLWATTEVGRGWTNDSEDRDFFIDYYIP PDKKAMICTTPALAAALLAKRHEKPLLYKVYPTEDGFHVRIVEGLSEVREPCKNWTGN VDCADPILSRQGVVRYEP" gene complement(19773..21701) /locus_tag="DP116_07305" CDS complement(19773..21701) /locus_tag="DP116_07305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874605.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_07305" /translation="MSLITLQSVKKDFGIKEILKDASFSLDATDKVGLIGTNGSGKST LLKMIAGLEPIDSGQILANSGAKIVYLPQQPDLDDNRTVLEQVFADSGEQMELVREYE EISDKLAHAAEDKQLMARLSSVMQRMDSLGAWELETNAKIILSKLGITDFHALVGTLS GGYRKRIALATALLSEPELLLMDEPTNHLDANSVEWLQSYLNRYRGALFLITHDRYFL DKVTNRIIEIDRGDIYTYTGNYSYYLEKKALAEESAISSQRKHQGVLRRELEWLKRGP KARSTKQKARIDRVHDLKETEFKQAQGKVDISTPSRRIGKKVIELNNTSKAYDGRTLI KDFTYEFSPEDRIGIIGTNGAGKSTLLDIITERVEPDSGSVEIGTTIHIGYFNQHSEE LQSALNENQRVIDYIKEEGEFVKIADGTRITASQMLERFLFPGNQQYSPIHKLSGGEK RRLFLLRVLMSAPNVLILDEPTNDLDVQTLAVLEEYLEEFSGCVIVVSHDRYFLDRTV DTIFTFEEGGNIRQYPGNYSVYLDYKQAEEAQQQQINSTKEKPKNTETLQTTSSKETE TKKRRRLSNWEKREFEQLEGKIAKMEAEKAEAEKAMANVSPGNYSQVQKLYEQVETLK EEIDEATERWMELAEIES" gene complement(21846..22262) /locus_tag="DP116_07310" CDS complement(21846..22262) /locus_tag="DP116_07310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311750.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1824 domain-containing protein" /protein_id="PRJNA477356:DP116_07310" /translation="MSSNNQSNLTVQDAKKILNKFNCIDIAPNLKPSEKTLIRQALVL LAKISDYQILGICADTPEEAILAMKTYSHAFGYEPPSNLPEIEGPVYIKLNGKNGVCY LDSYSGHHRGVLVSCQSYSQSGINEMYGHLPLDLFV" gene complement(22342..23184) /locus_tag="DP116_07315" CDS complement(22342..23184) /locus_tag="DP116_07315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874607.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prohibitin family protein" /protein_id="PRJNA477356:DP116_07315" /translation="MKNQPLGNWQTLVAAIVVAILVILSLNSFTIINPGQAGVLSILG KARDGALIEGLHVIPPFISRVDVYDLTVQKFEVPAESSTKDLQTLSARFAINFRIDPT EVVDIRRTQGTLENIVNKIIAPQTQESFKIAAAKRTVEESITKRNELKEDFDMALGQR LDKYGIIVLDTSVVDLTFSPEFARAVEEKQIAEQRAQRAVYVAREAEQEAQAEINRAK GKAEAQRLLAETLKAQGGQLVLQKEAIEAWKTGGAQMPKVLVMSGDTKSSVPFLFNFG NMQD" gene 23322..25181 /locus_tag="DP116_07320" CDS 23322..25181 /locus_tag="DP116_07320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_07320" /translation="MVLKSHRGSRRNRRVLGDAPRGSRPKGERQTSSPNHPLMRLLDY GHQYRKRIWLSTTCSILNKLFDLAPPALIGMAVDVVVKQQDSIIAQWGVKDIFGQFFI LSFLTVIIWILESVFEYAYGRLWRNLAQDIQHDLRLDAYEHLQELELAYFEERSTGGL MSILSDDINQLERFLDVGANDILQVVTTVVIISGAFFILAPSVAWMAILPIPFILWGS FAFQKLLAPRYADVREKVGLLNSRLVNNLSGITTIKSFTSEDYEISRFTKESEAYRRS NAKAIKLSAAFVPLIRMLILVSFTALLLFAGMAAANGKISVGTYSVLLFLVQRLLWPL TRLGDTFDQYQRAMASTNRVMNLLDTPIAIHTGEIHLPTQTVRGELELTNVTFAYKDR PSIVTDLSLHVPAGKTIAIVGSTGSGKSTLVKLLLRLYEVQSGTIALDGIDIQQLNLQ DLRRCIGLVSQDVFLFHGSVAENIAYGSFDAVEDEIIMAAKIAEAHEFIVRLPQGYET IVGERGQKLSGGQRQRIAIARAILKNPPILILDEATSAVDNETEAAIQRSLEHITVNR TTIAIAHRLSTIRNASRIYVMEYGQFVESGTHQELLDKNGVYASLWRVQSGLR" gene 25213..25656 /locus_tag="DP116_07325" CDS 25213..25656 /locus_tag="DP116_07325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316522.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07325" /translation="MIFASTQSQSPNNKWRRQLDKFVKENQQELAALSWGLWLENADE KGTVGIYLQPTPHFVYCPRQAIEQLNSKVENRLQELVGIVEHHKPEVEVLMIAISKDE VKLIYFEPQLAPPTCYERVGKDVDTLLVYLEQLLSEQFNAEQSAQ" gene 25761..26435 /locus_tag="DP116_07330" CDS 25761..26435 /locus_tag="DP116_07330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017291004.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_07330" /translation="MNRLIESVHKDLLPRFRIESVDPHDPIKVHHVPKPWQLLGAGNY AAVVYHPDYPELVVKIYAPGRPGFFEEVEVYRRLGSHPAFSECLYANDCFLILKRLHG VTLYDCMQRGLRIPKQVIQDIDQALDYARKRGLHPHDVHGRNVMMYKGRGLVVDVSDF LHEEACSKWDDLKKGYYWLYRPLLSPLGIRIPYFVLDVVRRSYRFLVNLPSRLKQLGS GRRRRD" gene 26656..27633 /locus_tag="DP116_07335" CDS 26656..27633 /locus_tag="DP116_07335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium-dependent bicarbonate transport family permease" /protein_id="PRJNA477356:DP116_07335" /translation="MDASLIISNILNPPVLFFFLGMTAVFVKSDLEIPPPAPKLLSLY LLFAIGFKGGVELVKSGITQEVVFTLLAAVLMACFVPIYTFFILKLKLDVYDAAAIAA TYGSISAVTFITASAFLNELGIPFDGYMIAALALMESPAIIVGLILVNLFTVEQGKSR EVAWSEVLRDAFLNSSVFLLVGSVLIGFLTGEHGGKVLEPFTQGLFYGILTFFLLDMG LVAAKRIKDLQKTGFFLISFAILIPILNAAIGLAIAKFLGISQGNALLFAVLCASASY IAVPAAMRLTVPEANPSLYVSTALAVTFPFNIIVGIPVYLYVIKLFWSQ" gene 27635..27946 /locus_tag="DP116_07340" CDS 27635..27946 /locus_tag="DP116_07340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017297552.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="P-II family nitrogen regulator" /protein_id="PRJNA477356:DP116_07340" /translation="MHLVKKIEIIANSFELGKILDRLDKSGVHGHIVIRNVAGKGLRG TAEDLDMTMLDNVYIIAFSTPEQIKPVVENIRPLLNKFGGTCYISDVMEISSVKCVAS M" gene 27954..28676 /locus_tag="DP116_07345" CDS 27954..28676 /locus_tag="DP116_07345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874162.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbonic anhydrase" /protein_id="PRJNA477356:DP116_07345" /translation="MNPRKRFMERRDFLKLGATGAFGLMATAGNLLWSVEQAHAAELP PTVPKSLSPDLALQKLMAGNQRFVQHQLRHPDQSEIRLHEVAQAQHPFVTILSCADSR VPAEIIFDQGIGDIFDVRIAGNIATPEALGSIEYSVVLLGTPLLMVLGHERCGAVTAA VQNEALLGDIGSFVKAIKPAIKRVKDQSGDPVENAVVANVHYQIEQLKRSTLLTQRLE SGQLKLVGGRYDLDTGVVSIIT" gene 28821..29018 /locus_tag="DP116_07350" CDS 28821..29018 /locus_tag="DP116_07350" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07350" /translation="MGVKSSLMAIFSQAKAMTKKVVRRSQRRERVSRLAATAVQKPGV RMPPVRVTDDLGVLTAPTLIW" gene 29187..30563 /locus_tag="DP116_07355" CDS 29187..30563 /locus_tag="DP116_07355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741651.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3754 domain-containing protein" /protein_id="PRJNA477356:DP116_07355" /translation="MAVYKNREAFIPHSRTDIIQLCLQDGQLSATSAEKFKNFCQILS AYYHFRFHKTQETIKDNYAPFDPNTNVQPLTQPTFDQYKEMELKVVDAFKHILERANY IPLPASVVQESVGKASLIDLKTQVDFEDFELFCCYYQGDISKKISVKKLFFWEEEKII DVFERIVLLIKFKEEAYFSAKKVKIEELKFTPGKMYVYFYKNIPKLDIDLLFPNVATS MNWKDRLLFGIPAIGAAIPLLLKTLPNLLLLIAAILLVLNASSLVESLHVEQEKVRNV LPILVATLSLGMGLGGFAFKQYTNYKNKKIKFQKDITDTLFFKNLANNAGVFQTLIDI AEEEECKEIILVYYHLLTSPTPLNPEQLDSRIESWMEKKLGTKINFDINGPLNNLENI RGKAQRNASEADSAHQKPLLSYDNQGFCHVLPLENALAVIDDVWDNAFQYNGIALYGG SASALKFH" gene complement(31184..31558) /locus_tag="DP116_07360" CDS complement(31184..31558) /locus_tag="DP116_07360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951625.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07360" /translation="MSRIRLYLDEDTMKGALIQALRNADLDVVTVTDADRLGYPDEEQ LIWAVEQGRVIYSFNIRDFCKLHADFVVEQRNHAGIVLAPQQQYSVGQQLRGLLKLAA DKSAKEMVNQLVFLNAYVEKII" gene complement(31555..31848) /locus_tag="DP116_07365" CDS complement(31555..31848) /locus_tag="DP116_07365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016044754.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF433 domain-containing protein" /protein_id="PRJNA477356:DP116_07365" /translation="MQTITDIGTLIVRTPETCGGRPRIAGTRITVQYIVNEIKAGVTP EEILEDKPHLALAGIYSALAYYYANKESLDAEFAAYNEECRRLEAEYKAGNLS" gene complement(31971..>32518) /locus_tag="DP116_07370" CDS complement(31971..>32518) /locus_tag="DP116_07370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015783585.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="peptidase S1" /protein_id="PRJNA477356:DP116_07370" /translation="QGKFDEAIASYRKALQIDPNYATAHNNLGNALYNQGKLDGAIAS WQKALQIDPNNAVAHYNLGNALCKQGKLNKAIISYQKALKIDPNYAYAHYNLGLALYD QGKFDEAIASYRKALQIDPNYATAHNNLGVALKNQGKLDEAIAQLEIAVSLDPSSTLF SENLENYKNKKKGFWGRLFGG" BASE COUNT 9488 a 6658 c 6962 g 9410 t ORIGIN 1 agggaacagg gaacagggaa cagggaacag ggaacaggga acagggaata gggaagagaa 61 aaaatgctgt tggcggtata cctagtttca caaaaatctg cggaggagtc ctatatttac 121 ctaaacagac aaaaaaacta aaatgcatgc agagtgatgt gattgtcatt ggatctggta 181 ttggcggctt agtgacagca actgggctgg cgacaaaagg tgctagggta ctcgtgttgg 241 aacgttatct gattccagga ggcagtgctg gttactttga acggcaaggc tatcgcttcg 301 atgttggggc gtcgatgatt tttgggtttg ggcagaacgg tacgaccaac ttactcactc 361 gtgctttgca agctgtcaac gttagtttag aagtgattcc tgatccagta caaattcact 421 atcatctgcc taacggctta aacctcaagg tagataaagt ttatgaaaaa tttttgcaaa 481 atctgactgc gtattttcct aatgaaagcc aaggaattcg tcaattttat gacgaatgct 541 ggaaagtttt tcattgtctt aacagcatgg acttgctgtc gctagaagaa ccccggtatt 601 taatgcgtgt attttttcag catcctttgg catgtctggg tttggtaaag tatctacccc 661 aaaatgctgg agacatagca cgacgctaca ttaaagaccc tcagttgttg aaatttattg 721 atatggagtg ttattgctgg tcggtggtac cagctgacat gacaccaatg ataaatgctg 781 ggatggtatt ttctgacagg cactatggag gggtcaacta ccctaaaggg ggtgtaggac 841 aaatagccca aaaattagta gaggggcttg tcaaagcagg tggtcagatc cagtaccaag 901 ctagagtaac aaaaattatc atagaaaaca gacgcgctgt gggagtgcaa ctggctaacg 961 gtcaaatcca ccgtgccaag cgcatcgtgt ccaatgcaac acgttgggat acatttcaaa 1021 aattactgac tcaacaagaa ctttctgttg gtgagagaaa ctggcaggaa cgctatcaaa 1081 aatcacccag ttttctcagt ttacacatgg gagtgaaggc agaaattttg ccaaagggta 1141 cggagtgtca tcatatcgtg ttagaagatt gggaaaagat gacagaaccg gaagggactt 1201 tatttgtgtc tatcccaacg ttgcttgacc cagatttagc gccagcagga catcacatta 1261 ttcacgcctt cacacctcac tggattgatg aatggaaagg actttctgtt agggagtacg 1321 aggtgaagaa ggaagaagca gcttggcgaa tgattgaccg tctggagaag atttttcctg 1381 gtttaaatgc ggcgttggat tatctggagg tggggacgcc attaactcat cgccgctttt 1441 taggtcgtca ggatggtact tacggaccaa ttccgcgccg gaagttgcgc gggttgttgg 1501 ggatgccgtt taatagaaca tctatcccag gactttattg tgtgggagat agtacgtttc 1561 caggtcaggg tttgaacgca gtggcgtttt cggggtttgc ttgcgcccat cgcattgcag 1621 tggatttggg attgtaaacc atcccccctt cttaagagtt ccctattccc tgttccctgt 1681 tccctattcc ctgttccctg ttaagcgtgt ttcttcagag ttaacgagac tgggagaact 1741 gcataagtta aaaacgccgg cgcttatata tagagtgcaa cgatgtgaga ttggttattc 1801 tggagcctgc actcgctatg gatgtatcat ctagatactg gaaaatttac agaatcagta 1861 ccagtcaaag agttggatac gaacactgtt tagttcctcc agcacaggag tttctcaaac 1921 aagttcgtaa tctcgttcca actaataccc aagctgttct gctatcgtat tttgtggatc 1981 aaaattctgc tgttgatgtc acaactcgcg ccaaagcagg gctttgtttg cggtcttatg 2041 tttctgaacc gatcctaaga gcgtgtcaaa aaattgacag tttattcagt ggtgacaatt 2101 tcacttacca ggatttgcta acctatgtcc tcgatgacga tggaaaaact ttagttattg 2161 tagacgggga tggaaaaact caactcattg tggataacaa tggggaaact aggaccacag 2221 attacaaatt cttttctgtg agaattttgc aaaaatttaa tgccgactta gagtctaaga 2281 tgagtttaga caattgggca tacttccaaa cgacacaaaa ccctgaatta aaaaattatt 2341 tagcagagtt tggatttaag catctgagtg attgggcgct tcttaataga gtgagagcaa 2401 aacaatttga acgtttatca aagcgcgatc gccacttggt agaagttttt cacgctgttt 2461 atcgtcgtga cagacttcaa aaacattcaa ggggagtcag aaaatgtcca gaaccgtcaa 2521 gcactcagct gcaagaaatg ctcaacggct tgcgaaaaag agatgtgatg atcaactcca 2581 ctgttgagtt gatgaatgaa ctcaaacagg ttgttacaca actaaggcac tatgatattt 2641 ggagttatcg agaaccgtta gaagtccagg atcctaatac tggaggttat acttttagaa 2701 cagatttacc tgctgacaac ctcaatgagg tagatataga agagcaggaa atattggatt 2761 ttttacatga gcaacttagc ttagccttga agaatgcgat agagcaagaa ataggcgatc 2821 gcataaaaat tttgaagaaa agcaaaaatt atgctgtttt tgcccaacaa tttcttccag 2881 gtttgcagtt ctattattgt cagagtttat ctctcaagga tattgcgcca aaattggaaa 2941 tgacaagttg ggatcaagct agacgagtct taaatccagg agaactgctt agtaaagtac 3001 gcaccacaac agtcaaacaa cttctggata gcattctcaa aaaggctgaa gaaaaacatt 3061 tgaccaaaaa tcctccggaa ccagactatc tcaaaacatt ggctgaatat atagaagctt 3121 ttgctgatga cgaaatcttt cgagaagcag cagaagaaat cagagtgggg aaaaaccgtt 3181 cgatgaatag cttctatgct cggcaacttt gcaaatatat agaacaacgc acacaaaatc 3241 tacgcacatg tcctccagcc gtcctcttga agtggggcta agagaatctg aaacagaaaa 3301 caaaaatcag gagtaaaata atgcttaatt ctcaggttaa ttcaactgat ataagactct 3361 tgctctcaga agtgatttgg ctagagttag aagattttga ccacgccatc gataatagca 3421 aactagtcaa tagtgaagca caacaatggc aaacttatct aaatacactc gcaatgattg 3481 gctgtgaaaa atggctaagt acacgcatac cacaaaagcc aatttctcaa gagacgaatg 3541 ttattgagga tatttatcac ctggaagtag gagacttgaa aatttgtcca attgcgatag 3601 aacatgtatt ggatgaagtt atcaatattc caaaatctgc tattgataaa tcagaattag 3661 cagtccattt ctatgtcgtt gttgaggtgt tagaagagga agaacaagtt attattagag 3721 gatttttgcg ctacgatgaa ctgatgaatt attgcagtca attcaattta gagcttcggg 3781 atggttatta ccaagtaccg ctgtctttct ttgatgccga actcaatcac ttattgttct 3841 actactattt ctcggaacct ttagctattc ctttacctgt aacttccgct caatcttcga 3901 gagtctcact ccagaaatct ttggataata ccagaaccaa gttgagtgaa tggttagagg 3961 gagtgtttga acaaggctgg caaacaattg acaccctgat aaatccagag gcaaatttag 4021 ctcttagtac tagaattaca caaagaggtg ctaaaaaagc caaactcatt gacttgggag 4081 tgcaactcgg ttatcaaact gtggctcttt tggtaaatat cacagaggaa agcgatgaaa 4141 aattcggtgt tttgatccag gtacatccca cgggtggaga aagatttttg ccgcctgatc 4201 tgaaattgag tttgctgtct aaagcaggaa aaacccttca ggaagtcact tcgaggttgc 4261 aagataacta cattcaactg aaatttttca aaggggaatc aggaaaacgc tttagtgttg 4321 agttaagttt aggggaaaat ataaaggtga aagaagactt tgaattgtag taaggggaaa 4381 gagcatggcg ggaaaaaagc ctgtattttt ccgtatccta aaatccattg tttcatttaa 4441 atacccaggt agaagttgga tagctcaacc aatagtcgca gcaattgcgg tgtttttatg 4501 cgtgatcgta tcaccagtgt tagcacaagc tcctcttgtc aattctagtg tccatcaatt 4561 tacaattggc gagggtaaaa cacaagacct tgtgcaacag ggtaaaaaac tttatgatgc 4621 tggacagttt actcttgctg ttaaagtctt gcaacaggct tctgctgctt ttagaactca 4681 gggagatcat ttacgcgaag ctatgacttt gagcaatctc tctttagctt ttcaacaact 4741 cagtttatgg aaccaagctg aaaaatcaat tgttcaaact gttaatttat tgaaaggttt 4801 aaacaactct caagagagtt ctaaaatttt ggcacaagcc tttgacgtgc aaggaaggtt 4861 acaattttcc cagggacaag ctgaagcagc attgacgact tggcaaaaag ctgctgaaat 4921 ttatcagcaa atgaaagata ctgctggatt aacccgtaac cgcattaact cggctcaagc 4981 tctacaagtt ttaggacgat tccgtcaggc aaagaaaatt ttaactgaag tttcgcaaac 5041 tattcaaaac caaccagatt catcaacaaa agcatcggga ctattgagtc ttggcaacat 5101 tatgcaagtc gttggtgatt taaagcaatc tcagcaggta ttgcagcaaa gtttagcgct 5161 tgcaaaagcg acgtcatctg atcaaatgat cggtgaaact ctctttagct tgggaaatct 5221 cgcccgtgcc cagtataata cgaaaatggc attggattac taccagcaag ctgcaagtgc 5281 atcgactgat tcaaccacac gtattcaggc acatctgaat cgactcagct tacttgtaga 5341 gacaaagcaa tttgcagatg cattagcatt atcgtcccaa atccaatctg aaatcagcaa 5401 cttgcctgca agtcgcatgg tagttgaggc aaaaattaac tttgcccaaa gtttgatgaa 5461 actgaactct tgcaccagtt gcaagactct tgggccaaaa gttggaaacc caccacgagt 5521 agcaaaagtc ttctcaacag aactaaatac cgactcacca aagtttttag tccattttaa 5581 tggacttgga ttattagccc ggaacttcag ttctgggcgg gcgacgttat ctccctcatc 5641 cccttctctc atagaaactt cagcccagat actcgctagt gcagttcaac aggcgcaaag 5701 tttactagac ttgcgagcag aatcttatgc ccttggaact cttgggaatc tgtatgaaat 5761 aaatcagcaa tttaatgatg cccaaaagct gactgaaaaa gctttggtta tggctcaaag 5821 tctgaatgca tcagatatag tttatcaatg gcaatggcaa ttaggacgta tacacaagca 5881 gcagggaaat aaaaaaggcg caattgttta ctattctgaa gcttttaaga ctctccaaac 5941 tttacgtagt gatttggttg ctatcaatcc aaacattcag ttttctttcc gcgaaagtgt 6001 agaacctgtg tatcgggagt tagtcgcatt gctgttgcaa acagagaacg agagtgtcca 6061 acaccaggag agtcaaaagc cagaaaacaa aaagccaaaa aaccaaaagt cagaaaacca 6121 aaaaaactta cagcaagctc gttttgtgat ggaatcgctg caaatcgcag aattggacaa 6181 cttttttcgc tcagcttgtc taactgcaaa gcaagaactt gatccaatcg ttgataaaaa 6241 agattcacgc tcagcagttt tttatcctat tattctacca gaccgtttag atgttattct 6301 caaattgcct aaccaaaact tgcggcacta caaaactgtt attgctgaag ataaagtaga 6361 aggtgttgta gaaaatttga gagaatattt aggcgatgtc accagaactt ctcaggtaaa 6421 gcagctgtcc gaacagatat atgactggtt aattcaacca gctgaaccag aattaagcca 6481 aagtgggatc aaaaccctag tgtttgtcct agatggagcg ttgcgtaaca taccaatgtc 6541 agttctttat gacaaacaac aggggaaata tttggttgaa aagtatgcga tcgccgtcgc 6601 ccccggttta caactccttg atcccaaacc gttgcagcaa gtcaagttaa acaccattat 6661 tgctggagtc gctgcagaac gcgccattga aaaccgaaag tttcctcgac ttgagaacgt 6721 accacgggaa ttgcaacaaa ttcagtctga ggtacccaaa actgaagaac ttttagatca 6781 aaagtttacc gaaaacaacc tgcaaaatca attgcaatca cttcccttca ccgtagttca 6841 cctagcgact catggggagt ttagttccga tccagaaaaa acctttgttc tcacctggga 6901 taaactactt aaggtgaaag aatttgataa cttactgcga gtcagtgaca caaaaaggtc 6961 gagtactatt gaattgcttg tcttgagtgc ttgtaaaaca gctgttggag acaagcgagc 7021 cgctttggga cttgctgggg tagctgtaca tgcaggtgca cgtagtacac tagcaacatt 7081 gtggtctgta gatgatgaat atacagcaga tttaatgagt cggttttacc tagaattgaa 7141 agcaggagtg aataaagccg aagcgctccg gcgtgctcaa ctagctgttt ttgcccacca 7201 gaaaaatcca tatttctggg caccttttgt attagtgggg aattggctct aatgggagca 7261 agcagcagaa gccagccaag aagaactatg aggaatcgca ttcgatggtt gtgcgactaa 7321 gaaaatgtcg cccccctgtg gtttatttac ttcaaaagcg ctgtaataaa ccggttttta 7381 aaaaaactga gtttctcaaa tcgttattat tgcagactgt aaacttgtaa aatcggcact 7441 cctgccaact cttccaagtc aacagcctta aacaagctcg tccaatcttt agtcaacttg 7501 gtagcgtccg aacgagtgcg gcggagttct gctaaatttg tgatggcatc gtaccaaaga 7561 ttattcgcag catagacctt atactcttgt gattttgcac tcttcaattg ctgctgaagc 7621 tggggagtca gcgtcacgcg ttccagccat gcgtccacaa aaaaatactc aggtttgttt 7681 tgcggatcac cacaataaac ttggaaaaac cagtgatatt ttgagttttg cttgagagca 7741 tattgtgaat ttggcggtag gctaacgcct atggtacctg cttttggtgg caatgttaga 7801 gaagttcggt aaatatcatg accttggtca tcttgcagga caaactcccc agaacgcaaa 7861 ttcgtaggta attcaggaag ataaacccaa aatgttggat attcagcaac tgttgatccc 7921 aagaacgatt tgttgttttt ttcttcacca ggtactatgg cagtcacagg tgttttcaag 7981 cttggacatc catcacgacg gcttgtccct cctcgccgac gacctttggg agcaccgtta 8041 tcaggaaatt tctgggcaaa aaggtttggt gtatgagtga ttttattctg gaatgattgt 8101 tctgggtttg cttgcacttt tagtgggtta ttcataagcc caagacttag caaaaggata 8161 gtacaagcta gttgatagtg tctttgaaat aacttatttt taatcatatt cgagagttcc 8221 aacgtttttt gctgttgctg acatgggtgg gcgctagtat gcttaacaca accgcacaag 8281 acactaatgc taatgttgag ggaaccagtg gaatccaccc cgcttgagta aatataccaa 8341 agcagatact aaaaagtgtt aacagcgcta tgacaatcgc taatccaata tgaagtggct 8401 gtaaaattcg ccaagatata ataccgccca ctaatgacca accccatacc caccatacct 8461 cgaaccattg cgaccaccac cacattagag gtctaccatc caaaacagca ctgagaatat 8521 ggctcaccat ctgcgcctgt aaaaatacgc ctggggtttg cttttgttga tctgaggcgt 8581 aagcagtgta tggagtttcc caatcatccg gaggaggact ggaggctgtc acgccaatga 8641 gaacgatgcg attttttact gacttgtgta ataactcagg aataatctcg tcattaagaa 8701 gttctttgag agagattttt tcggcaattt taaggagaga aggtaaagaa cgatagttca 8761 gcaggatttg atatcctgat gcatcgactc cttggtagcc gcttgtatgg gattgtaagc 8821 gccgaaaaac agtgtttcct atttgcaaat ttttttcgga attaatatcc cattttatac 8881 cttgtgcatt taagtaatgc tgtgctagct gaaagctgaa agcatattca gctgcacaag 8941 gagattctaa aggaggggtt aattgcacaa gttggcgacg aagaactcga tcagcgtcag 9001 ccacaaagtc actaaaacta atgcgttctc ttggaacttt ggaaggagac gggacaccta 9061 atggtgcacc atcatcagga gcagaaactt tgcacactgt aaatatgcgc ttgtcttgct 9121 gcaagacttc gcctaattct ggataattag aatcaatggg gaaatcacga taaatatcta 9181 taccaatcgc tgctggttga tattgctcca gtttttgcaa gagttgactg agtgcttggt 9241 ctgataatga ccaacgcata ttcattttct tattgatttg atactgaata tccgcctcat 9301 caatagtgat tattaatagg cgcgggtctg gcttttcatc agctggtcgc agatggatta 9361 aataatcata agaatgcagt tcccaaggtt gtagtatccc taaatgacgc agtcccataa 9421 tactgctggc taccagcaca cttgctagca gcacggttaa cagacgacgg gtagctgcaa 9481 caacaaagcg atgtctatct aacagctttt gctttattgt tccagaaaga gtattgcttt 9541 cataagagcg agaacgttgt tttgtttcag aaaccgccac tgaggagtct accttgtcta 9601 cacgttcagt ttgacgctcc cgattccaaa tcatcggcgg ttcagcagga ttttgacaaa 9661 tcacgggtaa ccaagtcgcg caaggatatt ctcgctctaa tccttggagt ctttctcgtg 9721 cagaacgtac cgcagtatac aaagactgtc cttgtgaaaa ttcagtaaga aaatacttta 9781 aaaatttttg agcaatgacg ttcggtactg gctcacgcat cactataact tgcgaaatat 9841 gtaattcctg tagctgctgc gccaacccca agccatcaca agagttaaaa atcgccaaac 9901 gtaatccacg acttatagct tgcttcagac catactttaa ttggtcaaga gtaattatat 9961 ccgtttgatt aagttgcaag caaccttttt cttgagtgca acaactatga ccagcaaaaa 10021 aaagaatatc ccaaccttgt tgccaaagct ggttattaag ctcgtctaac cttggctcaa 10081 ctaaaaattg aatttctgca ccagttaata aattttctaa aaaaattcta tcctgactaa 10141 tatcaatttc ttgactgtta ccaaatattg ctagtattct catactagat ttttgatttt 10201 tcggaatatt tttcggaaat ttctgtggtt gttgatactc ggacgcactc aaggcaagtt 10261 ctgctcttgg ataatcatca aaccaattcc acacatgcca gggtaatcgt cgcaataaat 10321 tgtcatttgt ttcaacaatg aagcgaattt cctcagatgg ttttaaatgt gtgcgtaatt 10381 gctgttcaat tttgcgaaat tctaccgaat tgagccataa gtttaaacag ttgtagagtt 10441 gctcgcataa ttgcttaatc tctacttcag aaatgttagt aacggtatct tcttcaattt 10501 caaaagtatc gtcaatttcc tcaatttctt ggacaccaag acgtaaaaat ggacgataat 10561 aaaacgcaga ataaagtgac ttccaatggt gatagagttc aggaatttct cgtgcagcag 10621 gtaagttagc actaaatttc atctgataga gattgtctgc ttccccaata tatgctgtga 10681 caacgggaaa accttcgtat agatttcctt gacccaagtt taaaactacc aacctattca 10741 tgggtgtctt aggtatctaa tcaaaaaagt gtgatctaat acatttttga aaattaatta 10801 accaaactta attttcaact tttaacaggg aacgcttaac aaaaaacgct taacagggaa 10861 cgcttaacag ggaacgctta acagggagca gggaactctt tagaagggag atgatgtttc 10921 tttcattcgt tgcgagcgtg taaccgccct agttaaaggc tcatatcttg cactcatcgg 10981 tagacaccac gaagaagtta agatctgtgg cagaaaatta catccttttt ttgactttac 11041 ttgaaatcta acgaatggca gccctgagat ttgctcatct aattgacata taatgtggga 11101 gatacaaaat gtcagtcatg aaaacgctta ccaaaagtga aacacatttg tataaaatat 11161 gactacattt tgttaacttt gaaacaatcc tagatgagtt caatcataga acatcataca 11221 agagattgaa ctagactaca atttacttat agacaaaata aaataacaga ttgatttcaa 11281 gatagctgat ctcgaaatca tgataaaaat tattctccac tacactagga gaaatcgcga 11341 ttccgaaatc caatccatta atttataaat cttatctttg atgtgaagat acgaaaaaga 11401 agaaactgaa tgtgtagcaa agagcatacc tgcaaataag acagagaatt cttcttcatg 11461 agtaaatcag tttttattag cctaggaagc ggtgatttac acaacggatt tccaaatgtt 11521 actgctcgat tatggtcatc agaaaaatcc cttgcacagc aatttattgg tagcttaccc 11581 gcagcaccat ctttggtgga attatctaag aattggcagt caaattataa aaatatctgt 11641 ggtcgccaac agttgcgctc tttattaatt gaagaagatg acgaactgga aattgacgaa 11701 gagggcctaa caaatgtctc tgttgtcagt tttgatgatc tgtgccaaaa attacaagaa 11761 aatattaatg attggctcaa atctagtgga tttctgaata tagaacgaca gttgcgatcg 11821 caactacaac caacagaaga aattcgcctc attattgaaa ccaatgatcg tctgatacgg 11881 cgattacctt ggcatcgttg ggattttttt aatgattatc ccaaagcaga aatggcgctt 11941 tctcaaacag agtataaatc ccgagaatta tcaaagtcaa tattacctag gaaaaaagtt 12001 agaattttag cagttttagg caatagccaa ggtattgatt tagacagaga aatcaagttt 12061 cttaacagct tgttagatgc agaaatagtt tttattgtca agccctctcg tcaagaattt 12121 aacaatcttc tttggcaaaa tcctggctgg gacattcttt tttttgcggg tcatagtcaa 12181 actgagggtg aaacgggcag aatttatatt aatgataata agactaataa tagtttaaca 12241 attaaacagt tagaagaagc tctcaaagct gctattgata atggtttaca actagcaatt 12301 ttcaactcct gtgatgggct aggactagct aatgctctag aaaaattaaa tattccgaca 12361 gtcattgtca tgcgggagcc agtgccaaat ttggtagcac aggaattttt taaacatttt 12421 ttagaatctt ttgcagttga acgacggtct ttatatttag cagtccaaca agcacgcagg 12481 aagttacaag gtttggaaga tgattttccg ggtgcttctt ggttacctgt gatttgtcaa 12541 aatccagctg tggaaccgcc gacttggctg aagttagggg gaactcctcc atgtccttat 12601 cgcgggttat ttgcctttcg cgaggaagat gttgacctat tttttggacg ggagcaattt 12661 acacagaact tggtgacagc gagcaaaaga aagccattgg tggcggtggt tggtcccagt 12721 ggaagtggta agtcgagtgt cgtgtttgct gggttggttc ctcagttacg ccaggattca 12781 tatactgact ggcagattgt ctcatttcgt ccgggtcata atccttttga agcgttagca 12841 ggggcattga cttctagcat tggtgaagcc ttatcgcttc ggggttctca acgagacaac 12901 ttaaatgaat atataaatca aaatcccaaa ttaactgctc gtcgcctgat tgaattaaaa 12961 ctagaaatag cactgcaaca agatcacaaa gttttataca aaattataga aagctttgtc 13021 cagcaaaacc ccaaaactcg tctggtttta atcgcagacc aatttgaaga actctacact 13081 ctttgctcag aagaagaacg ccaaggtttc ttagatacat tacttaatgc agttcagttc 13141 gctccagcat ttactttagt tatgacctta agggctgatt tttacggata tgccctttct 13201 taccgacctt ttagtgatgc gttgcaagga gcagttctca atcttggtcc aatgaaccgt 13261 gaggaattgc gatcggttat tgaacagcca gccgcacaaa tgcaggtgag actagaaaaa 13321 gagttgacaa agaaactgat taatgctgta gagggtcagt caggacgttt accattgctg 13381 gagtttgctt taacacaact gtggtcaaaa caaacagatg ggtggctgac tcatgcaggc 13441 tacgaagaga ttggcggtgt cgaggaggct ttggctatcc acgccgaagc agtgtatgct 13501 cagcttgatg aaacagaccg aactcgggcg cagcaagtgt ttatgcagtt gatgcgtttg 13561 ggggaaggaa tcgaggcgac gcggagattg gcaactcgtg atgaggtgaa gtcagaaaac 13621 tgggatttgg tgaggcgctt agctgatgca cgtcttgtgg taaccaaccg caacgagtta 13681 tcgggtgaag aaacggtgga aattgtgcat gaggcgctga ttagaagctg gggacgccta 13741 gaggggtgga tacaagttga tggtgaattt cggtattggc aggagcagtt gcgatcgctc 13801 attcgtcaat gggaaagtag tggtaaagat caaggagcac tgctgcgtgg aaagccactt 13861 tcggatgcag aatattggca gagcaaacgc atagacgaac tgagtacagg ggagagacat 13921 ttcatacagc tttctttagc attgcgggat aacgagataa ataagctcaa gcgtagacgc 13981 caactcacca ttttaggact cacaggtggt ttagtgggag ctttgatcct ggctggggta 14041 gcttggtggc aatcccataa agcaagcatc agcgagattc aaactatgac cgaatcttct 14101 gaagcattgt ttgcctcgaa taacacattg gatgcgctca tacaagcaat cacagcgaag 14161 gagaaattaa aaacaatagg tacagtagat gcaaatatcc aagatcggct tgaatcagtg 14221 ctaagacagg caacttatac ggtagttgaa cataaccggc tgatagggca tagcgaaaaa 14281 gttaacgcag tcgccttcag ccccaatggt cagctcattg ccacagcgag tgatgataat 14341 acggtgaaac tctggaaacc tgatggcacg ttgcttacca gtttaaaagg acacaatagt 14401 ccagttttcg gagttgcgtt cagccctcaa ggtaatatca ttgccacagc aagtggagat 14461 aagacagtca aactctggaa gcttgatggt actttgctta ctactctaaa cggacatagt 14521 gatgtagtta acgcagttgc gttcagtcct caaggtaaca ttatagccac agcaagtagt 14581 gacaaaacag tcaaactctg gagagctagc gatggcacct tgcttactac tctaaacgga 14641 catagcgatg cagtgagtgc agtcactttt agccctgttg gcgtagcgtc cccgcagggg 14701 tttggtcaac tgattgctac agcaagtaga gacaagacca tcaaactttg gaagcaagat 14761 ggcaccttgc ttactactct aaagggacat agcgatgtag tcagtgctgt agcttttagt 14821 cctgttggcg tagcgtcccc gcaggggttt ggtcaactga ttgctacagc aagttgggac 14881 aaaacagtca aactgtggaa gcgtgatggc accttgctga ccaccctcaa taatcctagc 14941 ggtaaggttt acggattagc gttcagtcct gacggtgata caatcgcctc ggctggttgg 15001 gacaggacaa taaaactctg gagatggcgt gatggtgagt cagcgctgca ggagggtatc 15061 cctccgcagg cgactggcga accccgttcg cgtagcgtgc ccgaagggca taggggcact 15121 ttgctgacta cccttaatgg acatagcgat actgtttggg gagtcgcgtt cagccccgat 15181 ggtaagacaa ttgcttcggc aagtagcgac aaaacagtca aactctggaa gcgggataag 15241 attttgctga ctacccttaa tggtcacagt ggtgcagttt ggggagtcgc gttcagccct 15301 caaggtagta tcattgccac agcaggtgac gacaacacgg tgaaactctg gaagcccaac 15361 ggcactttgc tgaagactct aaagggacat aatgccgggg tttgggcagt aacattcagt 15421 cctgacggtc agacaattgc ttcagcaagt ggggacaaga caatcaaact ctggaaacga 15481 gatggcgctt tgttgactac ccttaatgga catagtcgtc aggttaacgc agttgcattt 15541 agccctcaag gtaatatcat tgcctcagca agtgacgatt acacggtaca actccacaag 15601 tcagatggca ctttgctaac taccttgaga catgataatg aagtttgggg agtagcgttt 15661 agtcctgttg gcgtagcgtc cccgcagggg tttggtcaga taattgcttc ggtaactcgg 15721 gataagacgc tgaagctttg gaagcaagat ggcactttgc tgaccaccgt taaaggtcac 15781 aatggcggag ttgggggagt cgcgttcagc cctgatggtc agacaattgc tacagggagt 15841 caagataaga cagtgaaact gtggaagggt gatggcactt tgctgactac cctcaaggat 15901 cactatggca cggtttgggg agtagcgttc agtcctgacg gcaaaatgat tgcttcagca 15961 agtgacgaca aaacagtgaa ggtctggaaa cgagatggca ctttgctgac taccctcaat 16021 ggtcacaatg gcacggtttg gagagtggcg ttcagtcccg acagtaagac aattgcttcg 16081 acgagtgacg acaagacagt gattctgtgg aatttggatc gtgttctaga tacagataaa 16141 ctagtgagct acgcttgtga ttgggttaag gattatctca gaacaaatgc acagcacaag 16201 cagagcgtta gcggatattc tgtccaggag gcgagccgtt tctgtagtgg gattaaacct 16261 caatagtttc aagcaacatg attgcatgtt tcccattttt atttcttaaa tagacatctg 16321 aggaaaacaa tgtagagacg ttccatagaa cgtctctaca agggttgcag acaacgcaca 16381 attaatttct ggagatgtct aatgcataag ttagttttgc ttacacctaa taaagagtgt 16441 agaggcaaag ctctctaagc tgaactgacc ttcagttttt cagatcaagc ttcattctat 16501 taattaaact tttgtcaatg cgtaaatcct acgcattgat tccactttgg attatctttt 16561 tgtgtaactt tgtttggaga taaatatgac tatattggtt ggtaatcttt cttctcaggt 16621 cactgaggca aatttaagag agctttttac aaaatttggc acagtcagaa aaattgacat 16681 ttttcctgac tccggttttg caacagtagc aataaaaggt gaagcaaatg aggatcttgc 16741 tgttcaagaa ctcaatggcg tggaacagtt tgggcagaaa cttaaactct ttaagtcagc 16801 cccagttcag ccatcagaca accaacgcga tggagagagg agattgctga tactacacgc 16861 accaacacat acagcaaggg gactaggtga tgggccaagt ccatcgccca agccgacgaa 16921 gccctcgccc cccaacaagc cgaagccccg tcctctaaag gctggcttgg taagcacatc 16981 attatctgaa aactaattcc aatattaatg agaacgtaag cgttactgat ccactggaaa 17041 ctcgcacctt tctaataaat cagggttggt attccaattt aattttactt gctgctctag 17101 tagccttttc ccgtgcttct tgaatatcac tgcctttagc caaagccact cccattcgac 17161 gataaggatg ggcattaggt ttaccaaata gcttaatgtc cacatctttt tctgatagcg 17221 cctcggctac acctgtataa gcaatgaaat ctgatttttc tgaggctaaa atgacagcag 17281 tagctgaata acctaactgt tctatattag gaataggcaa gcctaaaatt gctctcaagt 17341 gtagttcaaa ttcgttaaga ttttgcgaga ttaatgttac cattcccgta tcatgtggtc 17401 tgggggaaag ctcggaaaaa atgacttcgt ctttggtcac aaaaaactcc acgccaaaaa 17461 ttcccgctcc tcccaatgca tcggtgactt ttttagctat agcctgcgct tccactatca 17521 ttttttccga aatttctgct ggttgccaag attcttgata atctcctctt tcttggcgat 17581 gaccgatagg agaacaaaaa attgtcggtg cattccactg cttaattgtg agtaaagtaa 17641 tttcaatttc aaaatttata aattcttcga caatcacctt ttgactgtca cctctggaat 17701 tggcgatcgc ataattccac gccttctcaa cttcactctt ttctttaact acagattgtc 17761 ccttaccaga agatgacatc acaggtttca ctacattggg aaacccaatt tcattcgaga 17821 cagacatcaa ctcttctaaa gttgtcgcat aaccatactt agcagtgcgg atgcctaact 17881 ctttatgtgc taattctctg attctgtctc ggttcattgt atagttagtc gcagccgcag 17941 ttggtatgac tgttattcct cgctgctcaa attccaaaag tttttctgtt ctaattgctt 18001 caatttctgg tataattaaa tcaggctgat atttgctaac aaccccttcc aaatcatcag 18061 cactcagcat agaaatcact tccgaacagt cagcaacttg cattgctgga gcattggcat 18121 agcggtcaac agcaatcaca taattgccaa accgtttagc agcaataaca aattctttgc 18181 ctagttctcc cgaacctagc aacattagtt tttggggtag cttcatcgaa ttattcatta 18241 atttctccta gttagttgca agtcaaaaaa cttaattctt tctttacttt ttttacacct 18301 ttgcacttcg ttatttcaag aatcttcttc caaatactta aatttggcac cgcgaagttg 18361 taaatcttga cgctgaacct ttgtcaatcc aatcgcttct ccaaactcac actcctctac 18421 aagagcattt aaccatatag tctcattcag agtagcacca ttcaaatttg catttctcaa 18481 gtctgcgttt gtaaaattgg caaatataag gtctgcattc actaaactag ttccttgtaa 18541 gtttgcacct gataaattac cacggataaa gttcactccg tctaaaatca agcaagataa 18601 atccgctcct attaaattag ggaattttaa ctgagtggga ttttggaaaa atcgcatgac 18661 acaagctata tttgcttcct tgagacacat cttggttaag aaatcatagc gggctatacc 18721 aagttgcttg agattttcta gacgttgttg gggactttgt tctaaaaact gaatagcgcg 18781 actacgaagg ttttgagtag gaatattgag cataatcatc aaattgtcgt aatactattg 18841 atgtttaaaa tctttaatgg ctagttcttt ggtggaacca ctgaaaagct ttgctattat 18901 cagtagtgtg ttagttatat atatgaaatt taaaaatttt gtgtcaaaat ttcatccggc 18961 taagagtcag taccattagt aaagttaggt aattacaggg tttgttattg aagtcactat 19021 gagaaaaata atcctaccgg ataaaatgaa aggtcatcta ctaacagtcg caacattatt 19081 tggcattatt tcactattct ggatcagctt tttcgctaaa gttgtgagag taaaatttgt 19141 cgaaattgat tttgtactta aaagattaga aattcaaaag gaagcacagg ttgaaattcc 19201 ttatactgtt gctaaagttg atatactcag gcggttaggt tctgatgtga aacctgatcc 19261 tagatgttta ttgtgggcga caactgaagt aggtagaggc tggactaatg attcagaaga 19321 ccgtgatttt tttatagatt attatatacc tccagataaa aaagcaatga tttgtacaac 19381 accagcccta gcagctgcac tccttgctaa acgtcatgag aaaccccttt tatataaagt 19441 ttatccaaca gaagatggtt ttcatgttcg cattgttgaa ggtctttcag aagtgagaga 19501 accttgtaag aattggacag gaaatgttga ttgtgcagat cctatattgt cgcgacaggg 19561 agttgttagg tatgaaccgt agaagataat acggcggaag tcggaagtaa aaaagctttc 19621 tgtgtggctt ttaaagctta ttgtttgtac tttatgcaca aacaatgtgc tgtcaatgca 19681 gctttctctc tactgtgctg cattgaaagc catctgtcaa aaatcttcct cgaattttgc 19741 tacccttaaa aagctgtttc taaaatatag aattaagact caatctcagc taattccatc 19801 caacgttcag tcgcctcatc aatttcttcc ttcagggttt ccacctgttc atataatttt 19861 tgtacttgag agtagttacc aggagaaaca ttcgccattg ctttctctgc ttctgctttt 19921 tcggcttcca tcttggcaat tttaccttcc aactgctcaa attctcgctt ttcccaatta 19981 gataacctac ggcgtttttt cgtttcggtt tctttggaag atgttgtctg taacgtctct 20041 gtattttttg gcttttcttt cgtgctattt atctgttgct gttgtgcttc ttctgcttgc 20101 ttataatcca gataaactga atagttacct ggatattgtc ggatattccc accttcttca 20161 aaggtaaaaa ttgtatctac ggtgcggtct agaaaatagc gatcgtgaga aacaacaata 20221 acacaaccgg aaaattcttc taaatattct tccagtacag ccaatgtctg tacatccaaa 20281 tcattcgtag gttcatctaa tattaaaaca ttcggtgcac tcatgagaac gcgcaacaga 20341 aataaccgcc gtttttcacc acccgaaagt ttatgaattg gggaatactg ttgattccca 20401 ggaaacagaa aacgctccaa catttgggaa gcagtaattc tcgttccatc ggcaattttg 20461 acaaattcac cttcttcttt aatgtaatca atcactcgtt gattttcgtt caacgctgat 20521 tgtaattctt ctgaatgctg gttaaaataa ccaatgtgaa tagtcgtacc aatttctaca 20581 cttcctgaat ctggctcaac acgttccgtg ataatatcta gtaacgtaga tttacccgcg 20641 ccattagtgc cgataatacc aatgcggtct tctggactaa actcgtaggt aaaatcctta 20701 attaaagtac gtccgtcata agctttcgat gtattattca gttcaataac ttttttgcca 20761 atgcgacgac taggtgtaga aatatcaact tttccctgag cttgtttaaa ctcagtttcc 20821 ttcagatcgt gaacgcgatc aattctcgct ttttgtttcg tactacgagc ttttggtcct 20881 cgttttagcc attccaactc acggcgcaag acaccttgat gtttgcgctg actactaata 20941 gcagattctt cagctagtgc tttcttttcc aagtaatatg aatagttgcc tgtgtatgtg 21001 taaatatctc ctcggtcaat ttcaatgata cgattggtga ctttatccaa aaagtagcga 21061 tcgtgagtga taagaaagag tgcgccacga taacggttca aataactttg taaccactca 21121 acagaattag catctagatg gtttgtcggt tcatccatca acaacaactc tggttctgac 21181 aacaaggctg tggctaaagc aatgcgctta cgataaccac cagataaagt cccaacaaga 21241 gcatgaaaat ctgtaattcc taactttgaa agaatgattt tagcattcgt ttccagttcc 21301 catgcaccaa gtgagtccat gcgctgcatt acagaagaaa gacgcgccat taattgttta 21361 tcttccgctg catgagctaa tttatcagaa atttcctcgt actcacgcac caattccatt 21421 tgttcgccac tgtcagcgaa aacttgctct aagactgtgc gattgtcgtc taaatctggc 21481 tgttgaggca agtatacaat tttagctcct gaatttgcta aaatttgacc actatcaatc 21541 ggttctagtc cagcaatcat tttgagtaat gttgatttac cagaaccgtt agtaccaatt 21601 aagccaactt tatcggtagc atccaagcta aagctggcat ctttcaatat ttccttgata 21661 ccaaaatctt ttttaacaga ttgtagggtg ataagactca tatcaattta gggtttagat 21721 ttgaaccgca gacaaacgct gagaaacgca gataatgatc tattttatca gcgtccgtct 21781 gcgtgtatct gcggttacaa ttttagagta cgctatgggt caaggtgagc aatacaaccc 21841 tactattaaa caaataaatc aaggggtaaa tgcccgtaca tttcgttaat tccactttga 21901 gagtaagact gacaagagac tagtacaccc cgatgatgtc ctgagtaaga atccagatag 21961 catacaccat ttttgccgtt taatttgatg taaactggac cttcaatctc aggtaaattg 22021 cttggtggtt catagccaaa agcatgggaa tatgttttca tcgctagtat tgcttcctct 22081 ggcgtatcag cacaaatacc taatatttgg tagtcagaaa tcttagctag caaaactaaa 22141 gcttgacgaa ttaaggtctt ttctgatggc ttgaggttag gggcaatgtc tatacagttg 22201 aatttattga gaatcttttt ggcatcttga acggtgagat tgctctgatt attgcttgac 22261 ataatttttt tattattcct tgatgttctc tatctgctta ttaagtaggg tacacaccct 22321 aaatcagggt tcaaattgta attaatcttg catattaccg aaattaaaga gaaatggtac 22381 actacttttt gtatcaccac tcataaccag cacttttggc atctgagcac ccccagtctt 22441 ccaagcttcg atcgcttctt tttgcagaac caactgtcca ccttgagctt ttaaagtttc 22501 agccaaaagt ctttgagctt ccgcttttcc tttggcgcga ttgatttctg cttgagcctc 22561 ttgttccgct tctcgggcta cgtatacagc tctttgtgct ctttgctctg caatttgttt 22621 ttcttcaact gctctggcga attctggaga aaaagttaag tcaactacgc tggtatctaa 22681 cacgattatt ccatatttat ctaagcgttg ccccaatgcc atatcaaaat cttctttcaa 22741 ttcatttctt tttgtaatcg actcctcaac tgttctttta gcagctgcaa ttttaaatga 22801 ttcctgagtt tggggtgcaa taattttgtt gacaatattt tctaatgtcc cttgtgtcct 22861 tctgatatca acaacttctg tgggatcaat acgaaagtta atagcaaatc ttgcagatag 22921 ggtttgcaaa tccttggttg aactctctgc tggaacctca aacttttgca cagtcaaatc 22981 atacacatct actctggaga tgaagggcgg tatcacgtga agaccctcga ttaatgctcc 23041 atctctggct ttacccaaga tactaagtac tcctgcttgc cctggattta taatggtaaa 23101 ggaattgagg ctcaaaatga caagtattgc taccacaatt gctgctacta aagtctgcca 23161 attccccaat ggttgatttt tcaaatcgtt atctcctttg cattttcaga caactcttgt 23221 gaattgtgat atcttactgc tcgcctttgc acctttgtga cacctttgga agctaattgt 23281 ggtaatattt taaatccacg catctacaaa taactgtggt aatggtattg aaatctcatc 23341 ggggatcgag aagaaacaga cgcgtactag gcgatgctcc cagagggagc cgcccaaagg 23401 gcgaacgtca gacatcgtcc ccaaatcatc ccctcatgcg gctgctggac tacgggcacc 23461 agtatcgtaa acgaatttgg ttgtcaacta cttgttctat tcttaataag cttttcgact 23521 tggcaccacc agccttaatt gggatggcgg tggatgtggt agtaaaacag caagattcta 23581 tcatcgctca gtggggagtg aaagatattt ttgggcaatt ttttattctt tcgttcctga 23641 ctgttatcat ctggatacta gaatcggttt ttgaatacgc ttacggaagg ctttggcgta 23701 atttggcaca ggacatacag catgatttgc gtctggatgc ttatgaacat ttgcaggagt 23761 tagaactcgc atattttgaa gaacgcagta caggtggttt gatgtctatc ctcagtgatg 23821 atatcaacca actggaacgt tttttagatg tgggggcaaa tgatattctc caagttgtca 23881 caacagttgt cattataagt ggtgctttct ttattttggc tcctagtgtt gcctggatgg 23941 caatattgcc gataccattt attctttggg gttcgtttgc ttttcaaaaa ttactcgcac 24001 ctcgctatgc tgatgtcaga gaaaaggttg gtctactcaa ctcgcgtctg gtaaataatt 24061 tgagcggtat tactactatt aaaagtttta cctccgaaga ttacgaaatt tcacgtttca 24121 caaaagaaag tgaggcatat cgccgcagta acgctaaagc aattaaactt tccgctgcat 24181 ttgttcctct gatccgaatg ttgattttgg tgagttttac agcattattg ctgtttgctg 24241 gaatggcagc agcgaatggc aaaatatctg tcggtaccta cagtgtacta ttatttctcg 24301 tccagcggtt actctggcct ttaacaagat taggtgatac ttttgaccaa tatcaaagag 24361 caatggcttc cacaaatcga gtcatgaatt tgttggacac tcctattgct attcatacgg 24421 gagaaataca tttaccgact cagacagtac gtggtgaatt agaactaaca aatgtcactt 24481 ttgcttataa agatagacca tccattgtta cagatctgtc tttacacgtt ccagcaggta 24541 aaacaattgc cattgttggt tctactggtt ccggaaaaag cactttggtg aaacttctct 24601 tgcgattgta tgaagtgcaa tcaggaacaa tagcccttga tggcattgat atacaacagt 24661 tgaatctaca ggatttacgc cgttgcattg gtctagtcag ccaggatgtt ttcttgtttc 24721 atggtagtgt agcagaaaat attgcttatg gcagttttga tgctgtagag gatgaaatta 24781 ttatggcagc aaaaatcgct gaagcacacg aatttattgt gcgattgcct caaggatatg 24841 agacaattgt cggtgaacga ggacaaaagt tatctggtgg acagcgacaa cggattgcca 24901 ttgctcgcgc catcttaaag aatccaccaa ttttgatttt agatgaagcg acatcagcag 24961 tggataatga gacagaggca gccattcagc gatcgctaga acatattact gtcaatcgga 25021 caacaattgc gatcgcacac cgtctttcca ccatccgcaa tgcttcacgc atctatgtca 25081 tggaatacgg acaatttgtt gaatcaggaa cccatcaaga actattggac aaaaacggag 25141 tttatgcgag tctgtggcgc gtgcagtcag gtttgaggta aacttttcaa caatcaaaaa 25201 tctaaaattg gtatgatatt tgcttctacc caatctcaat caccaaacaa caagtggcgt 25261 cgccagttgg ataaattcgt taaagaaaat caacaggaat tagcagcgct gtcttggggt 25321 ttgtggttag aaaatgctga cgaaaagggt acagttggta tttatttgca accaacacca 25381 cactttgttt attgtcccag acaggctata gaacaactaa atagcaaagt tgagaacaga 25441 cttcaggaac ttgtagggat tgtggaacac cataaaccag aagtagaagt tctcatgatc 25501 gcaattagca aagatgaagt taagttaatt tactttgaac cacaacttgc accaccaact 25561 tgttatgaac gagttggtaa agatgtggat actttattag tatatctcga acagctcctg 25621 agcgagcagt ttaacgcgga acaatcagca cagtaatcaa attgcaacat tgaaaaaaat 25681 caaggtcata aaaacggtag tataggggaa aattccatat gaatttgtag tagctggaac 25741 ccagtatcga tggagcacac atgaatcggc taatcgagag tgtccacaaa gacctacttc 25801 ccaggttccg aattgaaagc gtagaccccc acgacccaat caaagtccac catgtcccca 25861 aaccctggca gcttctgggg gcggggaact acgctgcagt ggtttatcac ccagattacc 25921 ctgaattggt ggtaaaaatt tatgcgcctg gacgcccggg tttttttgag gaagtagagg 25981 tctatcgtcg cttggggtct catcctgctt tctctgagtg tctgtatgcc aatgattgct 26041 tcttaatctt gaagcggtta catggagtga ctctctacga ctgtatgcag cgtggtctac 26101 ggataccgaa gcaggtgata caagacattg accaagccct agattatgcc cgtaagcgcg 26161 gtctccaccc ccatgacgtc cacggacgca atgtgatgat gtacaaaggc aggggacttg 26221 ttgtggacgt ctcggacttc cttcatgagg aggcttgctc aaagtgggat gacctgaaga 26281 agggatacta ctggttgtat cgccccctcc tctcccccct tgggatacga ataccctact 26341 ttgtcctgga tgttgttcgc aggagttatc gcttcttggt caacctacct tcacgcttga 26401 agcagcttgg aagtggacgc agaaggcggg attgagcaat tcgctcattc gctcattcac 26461 tcattcaaaa atattatcta ttagcaattg cgtagagcgc actgaccttt gggcgacttt 26521 atggggagca tcgctgatta cccaattgac attaggtcac agtacgcgat acatcatgat 26581 tcaagggtta cataagtttt cgacaatgca aagttttcta aattttcggt tgcatcacta 26641 ctatgaggga tagacatgga tgctagtttg atcatatcca acattttgaa tccgccagtt 26701 ctgttcttct ttttagggat gactgccgtt tttgtcaagt ccgatctgga aattcctccc 26761 cctgcgccca aacttctttc gctttatctg ctgtttgcga ttgggtttaa ggggggagta 26821 gaactggtga aaagtggaat cactcaggaa gtggttttca cactgttggc agcagtgttg 26881 atggcttgtt ttgtcccgat ttacaccttt tttattttaa agctgaagct ggatgtttat 26941 gatgctgcag cgatcgccgc aacctacggc tctatcagtg ctgtcacctt catcaccgcg 27001 agtgcctttc taaatgaact tggcattcct tttgatggct acatgatagc agctcttgct 27061 ctgatggaat ctccagcgat cattgttggt ctcatcctgg taaatctatt tactgttgag 27121 caagggaagt cacgagaggt tgcttggtca gaagtattgc gagatgcgtt tctgaacagt 27181 tcagtctttc tactggtcgg tagcgtccta attggattct tgaccggaga acacggtggg 27241 aaggttttgg aaccctttac tcagggattg ttctatggaa ttctcacctt ctttttactg 27301 gatatgggat tagttgctgc caaaagaatt aaagacttgc aaaaaacagg atttttcctg 27361 atttcatttg caatactgat tccaatactg aatgcagcta ttgggttagc gatcgccaaa 27421 ttccttggta tatctcaagg aaatgcactc ttgttcgccg tactatgtgc cagcgcctct 27481 tacatcgctg ttcccgcagc catgcggctg actgttccgg aagcgaatcc cagcttgtac 27541 gtttctacgg ctttggcagt aacattccca ttcaatatta ttgtgggaat tccggtatat 27601 ctgtacgtaa ttaaattgtt ttggagccaa taatatgcac ttagttaaaa agatagaaat 27661 tatcgccaac tcgtttgagc ttggcaaaat tttagatcgt ttagataagt caggtgtaca 27721 cggtcatatt gtgatccgaa atgttgctgg caaaggattg cgaggaacag cggaagattt 27781 agacatgacg atgctcgata atgtttacat catcgcgttc tctacgcctg agcaaatcaa 27841 gcctgttgta gaaaacatca gacctctgct caataagttt ggaggcacct gttatatctc 27901 cgatgttatg gagatttcct ctgtaaaatg cgtcgcgtca atgtgagata atgatgaacc 27961 caagaaagcg tttcatggaa cgtcgtgatt tcttaaaatt aggagccact ggagcatttg 28021 gtttgatggc gactgctggc aacttgctct ggtctgttga acaagcacac gccgccgaat 28081 tgcctcccac agttccgaaa tctctcagtc ccgatctagc cctgcaaaag ttgatggcag 28141 gaaatcagcg gtttgtccag catcaactcc gacatcccga tcaatccgag attcggttgc 28201 acgaagttgc tcaagctcaa catccatttg taaccatcct cagttgtgct gactcacggg 28261 tacctgcaga aattattttt gatcaaggca tcggagacat ctttgatgtt cgtattgccg 28321 gaaatattgc cacgcctgaa gccctcggta gtattgaata ttcggttgtc ttgctaggca 28381 cgcctttgct gatggtgctc ggtcatgaac gatgcggggc agttactgcc gccgtacaaa 28441 acgaagcgct gcttggtgat attggtagct ttgttaaggc aattaagcca gccataaaaa 28501 gagttaagga tcaatcgggc gacccggttg aaaatgctgt ggttgcaaat gtgcattatc 28561 aaattgaaca attgaagcga tcaacgcttt taactcagcg attagagtcc ggtcaattga 28621 aactcgtagg aggtcgttat gatttggata caggtgtagt aagtattata acttaatgaa 28681 ttagaagcta aagaaatata aaaacggtgg tgggataaag actgtttgat caatgactgc 28741 ttatgtgtgc aatgttccca cggctgactg atcggcggtt gctagatttc ctactcaaaa 28801 gactagtaat tgagatctaa atgggtgtaa aatcttcact tatggcgata tttagtcagg 28861 caaaggcaat gaccaaaaaa gtagtcagga gaagtcagcg ccgtgagcgg gtttcgcgac 28921 ttgcggcgac tgccgttcaa aagccaggag tcagaatgcc tcctgtgcga gtgaccgatg 28981 acttgggggt actcaccgca cccacactca tctggtaatg agcgcggtca acactcttcg 29041 tggcgcatcc taccgccccc actcattgca tgctgactcc accagatgag cgtaggcagg 29101 gggaacgctc ctgaattagg aattgccgaa ttcttcttat tgacccgtgg catgcacaaa 29161 cctattagtc cgatggattt gagattatgg cagtttacaa aaatcgagaa gcatttattc 29221 cccacagtcg cacagatatt atccaacttt gcctgcaaga tggtcagcta agtgctacta 29281 gtgctgaaaa gtttaaaaat ttctgccaaa tcctatccgc gtattaccac tttcgctttc 29341 acaaaacgca ggaaactatt aaagataact acgcaccctt cgaccccaat acaaatgttc 29401 aaccactaac tcaaccaacc tttgaccaat acaaggagat ggagttaaaa gtggttgatg 29461 catttaagca tattttagaa agagccaatt acattccttt gcctgcatcg gtagtacaag 29521 aatctgtggg caaggcatct ctgattgatt tgaaaactca ggtcgatttc gaggactttg 29581 agcttttctg ttgctattac caaggcgata tttctaagaa aatttcggtg aaaaagctct 29641 tcttttggga agaagaaaag ataattgatg tttttgaacg aattgttttg ctgattaagt 29701 tcaaagaaga ggcttacttt agtgccaaga aagtcaagat agaggaactg aaatttactc 29761 caggtaaaat gtatgtctac ttctacaaaa acattccaaa actagatatt gacttgctct 29821 ttccgaatgt tgccaccagt atgaactgga aagatcggct gctatttgga attcctgcga 29881 ttggggctgc aattcccctg ctgttaaaaa cattgccgaa tttgttattg ctcatcgctg 29941 caatattgct agtgctcaat gcgtcatctc tagtcgaatc actacatgta gaacaggaga 30001 aagtccgaaa tgtgttgccg attctagtag caacgctatc gctgggtatg ggcttaggcg 30061 gatttgcctt caagcaatat accaattaca aaaacaaaaa aatcaaattc caaaaagata 30121 ttactgacac gcttttcttc aaaaatttag ccaacaatgc tggtgtcttt cagacgctta 30181 tcgatattgc tgaagaagag gaatgtaagg aaattattct ggtctattac catctcctga 30241 ctagtccaac tccgctgaat ccagaacaat tagactctcg catcgaatct tggatggaga 30301 aaaagctggg caccaagatt aactttgaca tcaacggtcc gttgaacaat ctggaaaata 30361 ttcgtggcaa agcgcaacga aatgcctctg aagcagacag cgcacaccaa aagcctctac 30421 tatcttatga taaccaggga ttttgtcacg ttctcccctt agagaatgcc ttggcagtca 30481 tcgatgatgt ttgggataat gctttccaat ataacggcat agccttgtat ggtggttctg 30541 catccgctct caaatttcat tgactctaat tccccagtca gcgactcctc tgccaactat 30601 tattatcaag ggcgttgtcc tcaaagtggg gaactactga gactaccacg cactcctttg 30661 gtagaggcga tcgcctatag tttgatgcaa caccttggaa ctgatgagtg ttattcctgt 30721 gatatcaagt ccgcttaatt acttataatg cctgcttgac ctcaccccaa ccctctcctt 30781 attaaggaga gggcaagcga ggcgggtgag gttcctcgtt tttataagtg tttatccgga 30841 catgatatga aggcaagatg tacggggtat tactgatttc actgccgact ggtgaacaaa 30901 aaatactcaa agctttctcc ggtcttctga atggcgacag cctagttgag ggctgggtac 30961 cgcccattcc aggacgagat caagttgctt tacacgaagc cagcacacta tagcggttct 31021 catttgaatc acatacacca ccacgaataa tgtagagacg tagcatgcta cgtctctaca 31081 cccgtgtatt gcacgcaacc gagaagcgct atatctaaat tggatgctat caagcaggaa 31141 ctgataaccc tcaagcaact gccagaacgg ctgcagtaag tttctaaata attttctcaa 31201 cataagcatt caaaaacacc agctggttca ccatctcctt ggctgattta tcagcagcta 31261 atttgagcaa accgcgtaac tgctgtccaa ccgagtattg ttgttgcggt gcgagaacta 31321 tgccagcatg atttctttgt tcaactacaa agtcagcgtg caatttgcaa aaatccctaa 31381 tgttgaagct ataaataact cgtccttgct caactgccca aattagttgt tcttcatcgg 31441 gatatcctaa tctatcagca tctgtgactg taacgacatc caaatcggca tttcgcagtg 31501 cctggatcag cgcccctttc atcgtgtctt catccaagta taagcgaatt cggctcacga 31561 cagatttcct gctttatact cggcttcaag gcgacggcat tcctcattat aggcagcaaa 31621 ttcggcatcc agcgattctt tgttggcata atagtacgcc aaagcgctat aaatccctgc 31681 aagtgctaag tgtggtttat cttccagaat ttcttcaggc gtgacacctg ctttaatctc 31741 attcacgata tactgaactg taatacgggt tccggcgatg cggggacgac caccgcaagt 31801 ttccggcgta cgaacaatta gcgtaccaat gtcggtaatg gtctgcatag ctggtttgcc 31861 aaatcttgct ccgtcatcaa gaactctaac atttatcaca ggggtgttgg aacaacaatt 31921 gtggtctgat ctgatcgcac accgtttact ttaataaaac tgagaatatt ctacccacca 31981 aataaacgcc cccaaaaacc cttcttttta ttcttataat tctctaagtt ttcactaaat 32041 agcgtactac taggatcgag gctaacagct atttccagtt gtgcgatcgc ttcatctagt 32101 ttgccttgat ttttcagcgc aacccccagg ttattgtgag cagttgcata atttgggtcg 32161 atttgcaagg ctttacggta agaggcgatc gcttcatcga atttgccttg atcgtacagc 32221 gctaacccca ggttatagtg agcatatgcg taatttgggt cgattttcaa cgctttttgg 32281 taagagatga tcgccttatt gagtttgcct tgcttgcaca gcgcattccc caggttatag 32341 tgagcaactg cattatttgg gtcgatttgc aacgcttttt gccaagaggc gatcgctcca 32401 tctagtttgc cttgattgta cagcgcattc cccagattat tgtgagcagt tgcataattt 32461 gggtcgattt gcaaggcttt acggtaagag gcgatcgctt catcgaattt gccttgat // LOCUS NODE_852_length_32001_cov_4.84702332001 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 32001) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 32001) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..32001 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 573..884 /locus_tag="DP116_07375" CDS 573..884 /locus_tag="DP116_07375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07375" /translation="MLHRKIYQLCCDGREVCVFLRDQQRWIERARIIDIEGDLVTLRY ETEEEDEVCSWEEMVRLESIGAVTQKLASVPRGNVEPLLTEDCPEAERIRNRFTDSNP D" gene complement(1013..2686) /locus_tag="DP116_07380" CDS complement(1013..2686) /locus_tag="DP116_07380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315508.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" /protein_id="PRJNA477356:DP116_07380" /translation="MSDRLSVSSRFDAGETASELECLPFGVHHDDEGVCLLVRMGPHR ILLDCGLADVSTLVQELKKSARRGSSPEPADFVLVSHAHPDHARGLLALHKAFPLLPI YASEVTSKLLPLNWLEQPPQEVPSFCQALPLRSPVEFKDGLVAEIIPGGHLPGAVAIL LTYTTKERSYRLLYTGDFFLSNSRLVEGLRLEELRGIELDVLIIEGSYGTSRHPHRRT QENQLAERINRAIAERCSVLLPTPALGLGQELLMLLRSHHYFTGRDLDIWVDGSVAVG CDAYLELLSHLPPSVQNFARHQPLFWDERIRPRVRRMQPEQRPDVSNPCIILTDSTAD LNEYWQNGTGYWLTLFPEKTNIKINDQYSRVTTVETYLLAQHSDGPGTTQLIHNLRPQ HVIFVHGSPAYLADLTSLEELQNRYHLHSPAAGTLVELPIGETFLQPAAPETNYEGEL TELGTVVTITLPDAITADPRWRHFADTGLIEAKWQGEDLVLRGLSQRELLNQNSDRFT WSDIDCCGTCRHQRGQRCWNPASPLYNFKVTLEGYCPAFERNIEKNPES" gene complement(3193..3963) /locus_tag="DP116_07385" CDS complement(3193..3963) /locus_tag="DP116_07385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Zn-dependent hydrolase" /protein_id="PRJNA477356:DP116_07385" /translation="MKRRQLLGYAGAGLATTLVTSLGSSLQANAQSGGSLSIQWLGHT SFLFTGGGVKILTNPFRTVGCTAGYRAPKVAANLVLISSQLLDEGAVEDLVGNPKLIY EAGVYESNGIKFQGIAINHDRRGGRQFGVNTAWLWKQGGISILHLGGAAAPISIEQKI LMGRPDVALVPVGGGAKAYNPEEARLAIQTLNPKIVIPTHYRTQAADAANCDISPLDD FLKVMNGMTVRPSSGDTITVSSGKLPDKSVIQVLGYKF" gene complement(3960..4568) /locus_tag="DP116_07390" CDS complement(3960..4568) /locus_tag="DP116_07390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315489.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminodeoxychorismate/anthranilate synthase component II" /protein_id="PRJNA477356:DP116_07390" /translation="MIIIIDNYDSFTYNLVQYLGELAAEFPVAAEIKVFRNDKITVEE IRDLKPNGVVISPGPGRPEDAGISLDVISLLGPSLPILGVCLGHQSIGQVFGGKIVSA PELMHGKTSQVYHTGVGIFNKIETPMTATRYHSLVISRDTCPNVLEITAWLEDGTIMG VRHRNYPHIEGVQFHPESILTTSGKQLLQNFLEQLQSGEMNP" gene complement(4677..5156) /locus_tag="DP116_07395" CDS complement(4677..5156) /locus_tag="DP116_07395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315490.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diacylglycerol kinase family protein" /protein_id="PRJNA477356:DP116_07395" /translation="MSQQVSSPPKASLPPVPDCIETVVSKQRQLSWQVASNLFISFKY AWCGISYAFQTQRNFRIHVSVGALAIGLSIFLHLKSVEIAVIGLTIGLVLALELLNTA IESIVDLTVKQTYHELAKIAKDCAAGAVLVSALVAVTVAATLLLPPLLALVLGSKSS" gene complement(5298..5825) /locus_tag="DP116_07400" CDS complement(5298..5825) /locus_tag="DP116_07400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196709.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rRNA maturation RNase YbeY" /protein_id="PRJNA477356:DP116_07400" /translation="MQVELYVQDCYDESSEPEALCSGEKDAHITVETWKDWFECWLEI LQPSISPASAYEVGLRLTDNSEILTLNQQYRHQNKPTDVLSFAALEVDTPELAEIDAQ EPLYLGDIVISVETAQQQAQQQEHPLPTELAWLAAHGLLHLLGWDHPDEESLSQMLKQ QVILLKAIGIDTDFE" gene complement(5866..6021) /locus_tag="DP116_07405" CDS complement(5866..6021) /locus_tag="DP116_07405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873544.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3285 domain-containing protein" /protein_id="PRJNA477356:DP116_07405" /translation="MSDSRSAEPKPSYVKLAMRNMVQKRLTSLKHFVLTTVGLLAVLV GLAYLTR" gene complement(6177..7296) /gene="prfB" /locus_tag="DP116_07410" CDS complement(join(6177..7223,7225..7296)) /gene="prfB" /locus_tag="DP116_07410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216633.1" /ribosomal_slippage /note="programmed frameshift; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide chain release factor 2" /protein_id="PRJNA477356:DP116_07410" /translation="MEVLELKREIETLSDRLGKTQDYLDIPALSAKIQDLEQIAAQPE LWNDQTQAQKTLQELNDLKAHLQQYHQWQTSLEDTKAVLELLELDTDEGLLQEAESNV DKLKRELDQWELLQLLSGPYDEKGAVLTINAGAGGTDAQDWAEMLLRMYTRWGEKYGY KVSLSELSEGDEAGIKSATVEITGRYAYGYLRSETGTHRLVRISPFNANGKRQTSFAG VEVMPEIDNNIQLEIPEKDLEVTTTRSGGKGGQNVNKVETAVRVVHIPTGIAVRCTEE RSQLQNKEKALARLKAKLLVIAQEQHAKEVAEIRGDMVEASWGNQIRNYVFHPYQMVK DLRTAEETTAIADVMNGEIDMFIQAYLRQENQLVEASA" gene complement(7622..7990) /locus_tag="DP116_07415" CDS complement(7622..7990) /locus_tag="DP116_07415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311500.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phospholipid-binding protein" /protein_id="PRJNA477356:DP116_07415" /translation="MGWLKRLFGMEKPQEAQVNPEPQPVEQAQQTAAPAAATQQVPPE RVGLNGEYDQSGLAKRVALAFDQDQQLDDIETLWVAQTGSTVVLKGKVPSQDILNKMV SVARSVNGATAVDSNQVTVG" gene 8142..8414 /locus_tag="DP116_07420" CDS 8142..8414 /locus_tag="DP116_07420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008048943.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07420" /translation="MNRRGAERSQRVAGVPPAVAVTPDGKYVISGSEDNTLKVWNLET GEVIASFTADSALMCCAVAPDGVTIAAGEVTGRIHFLRLEGMEAKP" assembly_gap 8450..8459 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 8910..9995 /locus_tag="DP116_07425" CDS 8910..9995 /locus_tag="DP116_07425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314536.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aliphatic sulfonate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_07425" /translation="MNLFNLLKPFLGITSFFRQSIKLLKPDSAKSFAVLFTFGLCLSL SFSAFSPSYAKSSTQSSDAKPIANTSIAAVNVVRMGYQKSAVLALVKQQGTLEKQLSP SGVSVKWLEFPSGPPMMEALNANSIDFAAVGEGPPVFAQSAGVPLVYVGNSSASPEGL AILVRNNSPIKTLADLKGKKVAVAKGSSAHFLLVQALSSTGLQYSDIQPTYLSPADAR AVFEQDKIDAWGIWDPFLAAAQKSGGARILRDAKGLASWREFYVTSRSFADANPQLVK QILESVDKVGKWAKKNHRQVAEVLSPQMGIDVASLELAETRRKRYDVLPFSKEVVAEQ QKVADTFLRQKLIPKEIKVTDAVWKPK" gene 10718..11851 /gene="ssuD" /locus_tag="DP116_07430" CDS 10718..11851 /gene="ssuD" /locus_tag="DP116_07430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308644.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkanesulfonate monooxygenase, FMNH(2)-dependent" /protein_id="PRJNA477356:DP116_07430" /translation="MELLWFIPTHGDGRYLATAVGGRECNFYYFQQIAQAVDNLGFAG ALLPTGRSCEDAWVLASSLIAVTRQMRFMIAIRPGLMSPGLSARMAATFDRVSNGRLQ IHVVTGGDPVELAGDGVHLSHDARYELTDEFLTVWKDISSGLETNFSGKYFQIKGGKL LFTSVQKPHPPLWFGGSSFVAQQIAAKHVDVYLTWGEPPQQVAEKIASVRKLAAEQGR TLRFGIRLHVIVRDTETQAWDAANDLIKYVSDEAIANAQKIFARMDSEGQRRMTQLHN GDRKALEISPNLWTGVGLVRGGAGTALVGDSDTVIARMEEYAKLGIDVFILSGYPHLE EAYRVAELLFPRLPLQKPSNTNQTQMFSPVGELIGFENFPKQQ" gene 11876..12658 /locus_tag="DP116_07435" CDS 11876..12658 /locus_tag="DP116_07435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314538.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aliphatic sulfonate ABC transporter permease SsuC" /protein_id="PRJNA477356:DP116_07435" /translation="MKIQFLKRYFNQLAPWCVPLLLVVIWQLLVQFGFVSERVLPTPT NVLLAGIRLAKSGELFFHLGISAQRAVLGFLIGGSIAFVLGLLNGIIPIAEKLLDTPI QMLRNIPHLAMIPLVILWFGVGEEGRIFLVAIGVSFPIYINTFHGVRTVDPNLIEMGK VYGLKPWSLFWEIIFPGALPSILIGVRYALGVMWLTLIVAETISYDSGIGYMAMNARE FMQTDVVVLSILLYALLGKLADVFAKFLENKLLSWHPSYQNL" gene 12685..13485 /gene="ssuB" /locus_tag="DP116_07440" CDS 12685..13485 /gene="ssuB" /locus_tag="DP116_07440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314539.1" /note="part of the ABC type transport system SsuABC for aliphatic sulfonates; with SsuA being the periplasmic substrate-binding subunit, SsuB the ATP-binding subunit and SsuC the permease; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aliphatic sulfonate ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_07440" /translation="MKPKRVGVDLKVIGLSKTFGNIRVLQNLDLEIAKGEFVAIVGRS GCGKTTLLKLLAGLEPPSNGSILLDGKQLRKLNPETRVMFQDARLLMWKRVIENVGLG LKEHWRQKAHWALEQVGLAERAFEWPSVLSGGQRQRVALARALVSEPRLLLLDEPLGA LDALTRIEMQRLIEDIWQQQQFTALLITHDVEEAVTLADRIILIERGEVAMNLSVPLP RPRQRSDAIFATLVEKILERVLSTKPPSQKSTKGESTIGQIATYNILP" gene 13493..14647 /locus_tag="DP116_07445" CDS 13493..14647 /locus_tag="DP116_07445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869507.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07445" /translation="MNTDESNNKGNIICGRCAYDANPRGATHCQKCGTPLVIASVPNN DPSPGSDSTLLVGGISVVATVLFFAIGGYFFWQQVRVASTPSNLNNSADNISSDIRLY NSMKEVPKVPEGTFNYGGAVVFAPLRNSLHKAINQVYPKFGLRFTEPKYNNPGQNTGI TMLLDGELSFAQSSKPLEDTHYSKAKQRNFSLQQVAIGIDGFLLFTHPNVSIRGLAVE QVKDIFKGKITNWKQLGGPDLAITAFAFNPKFGSSLNILLGPELDQLSPKVQFMRDYT DGVRKVSSIPGAIGIGSTGAILGQQSIRPLALAAHNSNNYVQPFTDDGKRVNAAALRD GTYPLTRRLFVVIRRDGTIDEAAGVAYANLLLSKEGQQYIEKAGFVPIRN" gene complement(14739..16333) /locus_tag="DP116_07450" /pseudo CDS complement(14739..16333) /locus_tag="DP116_07450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015081546.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RNA-dependent DNA polymerase" gene complement(16882..17772) /locus_tag="DP116_07455" CDS complement(16882..17772) /locus_tag="DP116_07455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314542.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-sensitive inward rectifier potassium channel 10" /protein_id="PRJNA477356:DP116_07455" /translation="MKKTRRKIPKTRIVNRDGGSSVVRIGVSNNRWRDPYHLLLTLCW YKTLGLVSLSYVLANTLFALAYLAGGDGIENARPGNFFDAFFFSVQTMASIGYGAMYP KTTYTNILVTIESLLGLIGLAMASGLMFARFSLPQARVLFSDVAIITPYNSMPTLMLR VANERQNWILEAQVRMSLVRTEISKEGDVMRRFYDMSLLRSHSPLFALTWTIMHPIDE SSPLYGVSAEEMVEDEMEVIVTFTGLDETVCQTIHARHSYIAKEIVWNMRFVDILSKT RDGRRSIDYSRFHDVMPIEN" gene complement(17894..>18367) /locus_tag="DP116_07460" CDS complement(17894..>18367) /locus_tag="DP116_07460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015174068.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MOSC domain-containing protein" /protein_id="PRJNA477356:DP116_07460" /translation="ATFFTGYLAGIYPSQAARHPNRAPLQLVGEFGKTRYPDREAVHI SLVSQATLNHLSEIAGRQIDVRRFRPNIVLDGVPAWGEFDWVGKEMQLGTARIAITAR INRCLNIEVNPETGERDISLLTLLQKNFQHTQTGVLAQVITNGTVAIGDTLRIAE" assembly_gap 18368..18377 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(18519..19487) /locus_tag="DP116_07465" CDS complement(18519..19487) /locus_tag="DP116_07465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861805.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07465" /translation="MNFSAALPTFVITLREGVEAALVVGIVLALLKKAKQSRLNSWVY AGVGVGIIVSALIGFLFTLAVQALSAINPQYSTVVEPMMEGVFSVLAIVMLSWMLIWM TKQARFMKAQVEGAVTDALKQNKVAGWGVFSLVFIAVVREGFETVLFVAANIQQGLVP GFGAVAGIAVAALIGVLLFRWGVKINIRQFFQVMGVFLVLIVAGLVVTALQKFDEGFA TLALSNRASEGLCFYYERFTRIRSCILGPMVWNTSNILPDEKFPGVILNALFGYKQYL YVVQAVGYVLFLVTVGGLYLRSIIGTGSGRPKKQSAAQKSMGSLKD" gene complement(19571..20008) /locus_tag="DP116_07470" CDS complement(19571..20008) /locus_tag="DP116_07470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493060.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bacterioferritin" /protein_id="PRJNA477356:DP116_07470" /translation="MQELDQKQTINLLNEIIECELSGVVRYTQYSLMVTGPYRITIVD FLKEQASESLLHAQKVGEILTGLEGRPSLQILPADETDKDSVKDILAESLAHEEKALR LYKNLLETVTNSSIYLEQFARNMIGEEEMHNIELKKMLRDFTE" gene complement(20371..21414) /locus_tag="DP116_07475" CDS complement(20371..21414) /locus_tag="DP116_07475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214987.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-hairpin-helix domain-containing protein" /protein_id="PRJNA477356:DP116_07475" /translation="MRKFRYICLTIASAIIFALTSCNGTQTAENPTAPAANSSPQATE AVSHSGHSSKKKININNAILSELDKFEGQLGVPALSNKIQASRPYSSPEDLVSKKVIT QEQFNQIKDQVTTQEVVLTGEPKDVDYMTKLGLMKGHLLVAKELLDQNQPKKAEPHIG HPVEEIYVDVEDQLNERKVKEFKTSLVSLQDFVKANPKSPKVKTDFTSSMQAVDGAVT ALPADQRSQPRFALQVINGLLDAANSEYGAAIADGKISAEIEYQDSRGFVNYADDLYK GISSKVAKDHPQEHKAIETSMAELIKVWPSAIAPAKPVKTPGDVTKLVKTIEENSQKI TDSSSTQASAIVF" gene complement(22023..23195) /locus_tag="DP116_07480" CDS complement(22023..23195) /locus_tag="DP116_07480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873626.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07480" /translation="MTLISREEIKTLLEQPRQNSVSIYMPTQLAGPEVRQNSIRFKNL IKEAEARLIDAGLEQNDVIELLAKSDQLDDSNFWEQNVDQGLAVFISRDIFRYYTLPL TFDELVVVTDRFHIKPLLRILNGDGRFYLLALSQKDVRFFEGTRYSIKELKVENMPKS LDEALNYDDTAQQGQFRIATSKGGTANASVQPGSFHGQGSPDRDQHQKDILQFFQVVN HALEEKLREQTAPLLLAGVEYLMPLYTEANTYQHLIEEGITGNQEILSAQELHERAWP IVEPHYHKSQQEVVERFNELFGGNTGKASNELKEIIPAAYYQRIDSLLVATSQQQWGL FDPTSETVYLHQEEETGDEDLLDFAAAHTLLNGGTVYAVEAEQVPFSTPVAAIYRY" gene complement(23407..24078) /locus_tag="DP116_07485" CDS complement(23407..24078) /locus_tag="DP116_07485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1361 domain-containing protein" /protein_id="PRJNA477356:DP116_07485" /translation="MKAELTELIAKVMQVLRININWMTWNLFLAFIPLALSVWLFRNG RGRSWVWWLGFIVFYSFLPNAPYLLTDIIHLIDDIRTIQSVWIITLILIPLYVLVILA GFEAYVISLINLGYYLHRIGKSKWILWVELITHALCAVGVYWGRFLRFNSWDFVTQPD ALLTRGIEDILGKQPLVIIAITFGILVALHWIMKRVTLGFFSQRGKNLTNNSKHMNSN TPNVG" gene 24453..25493 /locus_tag="DP116_07490" CDS 24453..25493 /locus_tag="DP116_07490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012809693.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_07490" /translation="MERRALGHQGLEVSAIGLGCMGMSEFYGPRDDQESIATIRQAID MGVNLIDTADFYGVGHNEELVRRAIEGRREQVVLSVKFGALRSYDGGWIGFDGRPVAI QNAIAHSLRRLNVDYIDLYFPSRVDPNVPIEETVGALAELVKQGKVKYIGLSEAAPQT LRRAHAIHPISAVQIEYSLWSREIEKELLPTLRELGIGLVGYSPLSRGLLSGKIDETS LQQSGDTRSRMPRYQGDNLTHNLGLVEKLKAIATSKNCTPAQLAIAWVIAQGNDLVSI VGTKRRKYLEENLGAVSSALGGFPDLKQLRTRRAAITLTKGDLEELEHSFPVNVVAGE RYPEAMMAYVYA" gene 25562..26116 /locus_tag="DP116_07495" CDS 25562..26116 /locus_tag="DP116_07495" /inference="COORDINATES: protein motif:HMM:PF00440.21" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_07495" /translation="MTAEQQLRRHGLSKMTVVDIARAAGMSHSNVYRFFPTKAAIFDG ITQRWLSQAEKHLALVVAREVSAALKLEDFVIELHRVKRQKFLTDPEIFATYYAVAKE CNDVVQKHLTYIHSLLFQIINEGIQSGEFQVTDAEAAASVVRSATLRFHHPVLVIEDR ERDVENEARAVMRLLIAGLKTRVL" gene complement(26177..26653) /locus_tag="DP116_07500" CDS complement(26177..26653) /locus_tag="DP116_07500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873623.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07500" /translation="MAVWRRYTVIIPISLSIALLVTACSESKASQCERLVSLINKGSD LIDKNKGQQVTTSLQLSKDLEAVTKEIKELNLKDPKLQQFQSSFVKVFETFSQSIATA GKALGSAKTAQASSEGRVKIQKARGDIDTALTAAADAAKQSDALASEVNKYCSESK" gene 27007..28080 /locus_tag="DP116_07505" CDS 27007..28080 /locus_tag="DP116_07505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112042.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="zinc ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_07505" /translation="MKKISAMVLALSFSVVACSTPGNQTVSPTPEATATPAQARDQKA QNKLLVVTTVAPLTNIISNIAGERVQVTGIIPEGTDSHTFEPRPSDADLLSKAKLIIV NGLHLESPTEKLAKASKPKETNIYELGDNTITQSQWIYDFSFPKEKGDPNPHLWVNPK YAEAYAKLAAQQLTQLDPAGKDYYATNLKNYLQRLDALDKANRAVVESIPAKNRKLLT YHDSWAYWAREYGFQVIGAIQPSDFKEPSAQDVAKLITQIRQVGVPAIFGSEVYPSKV QEQIAREAKVKTANTADDELPGKGSANAMENTNPEHTYIGMMVNNMRIIAENLGGNPE LVKNVNTANVVGPTANETKTSQK" gene 28169..29005 /locus_tag="DP116_07510" CDS 28169..29005 /locus_tag="DP116_07510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017325995.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_07510" /translation="MEQPLLEVKNLTCGYQNKPVFTQVNLSLYRGQLSGLVGPSGSGK STLMKAILGLIHPWAGEIWFRGKRLQPGTSPPRVGYVPQVETVDWNFPVTAEEVVMMG RYQKQRMLPWASRGDRTAARELLNRVGVAHIARQPIGELSGGQQQRVFLARALVGEPE IVLLDEPTSSSDLQVQHELLHLLADLNQQGLTILLSTHDLNSVATHLPWVVCFNHGLI CQGQPLDIFTPANLERTFGAQMVVFHQEDRILIASGGTSLRHQMQRNLPPSLLQGKSK SA" gene 29021..29875 /locus_tag="DP116_07515" CDS 29021..29875 /locus_tag="DP116_07515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal ABC transporter permease" /protein_id="PRJNA477356:DP116_07515" /translation="MNFILEPFRYEFFSRAILVGMMAGLLCGMMGVYITTRRMSYIAH GLSHAILGGAVLSYVLGLNFYIGSGIWGFGSAVLIQYLTGRKIYSDAAIGIVTTASFA LGVAVISSYRKFSQNFEAALFGNVLGVSPTDLWVVTGVTVVLLSLVFCFYRPLLFWCF DREVAQVHGVPVFAMDTLFALMLATMLVATLNVLGVTLIISAVVIPASIARLLSNHFG YMMIFSGFLGAAIAFIGIYLSYYFDIASGASVVLLSTMIFACVLLWRSLQYRRKRYLA PLASQHYE" gene 30015..30821 /locus_tag="DP116_07520" CDS 30015..30821 /locus_tag="DP116_07520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07520" /translation="MTQNDRYTLATGELGAYRLSILNIIHRPYTEFLFRRVGLEQGMA VADIGCGTGNVSNWVAQQVGSSGSVVGVDLSAEQVEQARRNAKTLSLSNVTFGLGSAY DTGLPQDSFDLVYCRFLLMHLTRPIDALLQMRSLLKPGGLLVCEEADFSTAFCEPSNP AYNRCFELFLALSHARGQHFSMGIMLHRIFQDSGFVAPEISLAQAVVVRGETKRLVDL SLLEANDALIEAGLTTQEEINQKIAQIKALAADETTAFGIPRVTQVWARK" gene 31230..31961 /locus_tag="DP116_07525" CDS 31230..31961 /locus_tag="DP116_07525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007303485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_07525" /translation="MYSLKLELKLNNQEKSKLAGCAGFARFVYNFGLSMLTSSWDFEG IKAGDSKRLTAIEKVFTNYVKTNADYTWMKQYPSAIYSSALRNLAKAVERWRKGDSGF PQMKSKRRGDSFTVLKKAGIYPAKGESMLPFTNKQVLQPGKRITIPGLGEFRLKRPIP FLCSSQSFTISRTANKWYVSFSLDVEKVPPLFHSVESVGIDLGVKTFATLSDGSTIVA PSSLKKAKTKLNKLQWRNRRKRAIR" BASE COUNT 8838 a 6895 c 6951 g 9297 t 20 others ORIGIN 1 cccctacaca cccctacacc cctagttccc gtcaagcaag ccttgtgtgg ttaaggctga 61 cagacatatc aaatcaagca acgaatttaa agttaaggtg ttgttgttta aaaccatcaa 121 ggttacaaga gaataagcta agatcaaata cataggggta aaagtaaagc aaagtccccg 181 gacttcccag cgatgtcaag gactataagc aggcaattca aacaaccgca actaacgttt 241 gttttggagt ataaaatata acacaatggt tgtcacaact caccaattca tcttatcttt 301 ggcaggaaaa atcccgaaat ctgtgacttc tatagaagta aacatttttt gaatgacccg 361 acaaaaatga agtttagctg gtgcagtaac cgtgaacttt atggctcaaa aagctcacaa 421 agcagacaac gacaaacaag tctttggctt gagtcaagaa agataccttg actggggtcg 481 atccacgaac catgatgggg tatcctatgc tggggataaa gcccagatgt tgttgataga 541 tataactaag tataaaaagg cagaagcagt agatgctaca ccgcaagatt tatcaattgt 601 gttgcgacgg gcgagaagtc tgtgtgttct tgcgggacca gcaacgctgg attgagcgtg 661 cccgcatcat cgatatagag ggagatttag tcaccctacg ttatgaaaca gaagaagaag 721 acgaagtttg ttcttgggag gaaatggttc gccttgaaag cattggcgct gtaacgcaaa 781 aattagcttc agtgccacgc ggcaatgttg aacctctttt aactgaagac tgtccagaag 841 ctgagcgtat ccgtaatcgt ttcactgatt caaatccaga ctaatcagtg tttatgagct 901 agcgttgcag gagaatcact ctgttgtaag cgacacgtaa ctgctgagta aaatttcggg 961 aattggcaaa ccatcactca gcacttatct gtcgaaactt aggactgagc actcaagact 1021 caggattctt ttcgatgttg cgttcaaatg caggacagta accctcaaga gtgactttga 1081 aattatacaa tggggacgca ggattccaac acctctgtcc cctttgatgt cggcaagtac 1141 cgcagcaatc aatatcagac caggtaaagc gatcgctatt ttggttcagc agttctcttt 1201 gggatagtcc tcgtaatacc agatcttcac cttgccactt ggcttcaatt aaacctgtgt 1261 cggcaaagtg tcgccaccgg gggtcagcag tgattgcatc gggtagggtg attgtgacta 1321 cagttcctaa ttccgtcagt tcgccttcat aatttgtttc tggggcagct ggttgtagaa 1381 atgtttcacc tataggcagt tccacaagtg tgccagccgc aggagaatgc agatggtaac 1441 gattttgcaa ctcttctaag cttgttaggt ctgccaagta agcaggagaa ccgtggacga 1501 agatgacgtg ctgcggtcgc aaattatgaa tgagttgggt tgtcccagga ccgtcactat 1561 gctgagcaag aagataggtt tcgacggttg tgacccgtga gtattggtcg ttaattttta 1621 tattggtttt ctctgggaaa agggtgagcc aatagccagt accattttgc cagtattcgt 1681 tgaggtcagc cgtggagtca gtcaggataa tacaagggtt gctcacatca gggcgttgtt 1741 ctggctgcat ccgccgtaca cgtgggcgta tccgttcatc ccaaaatagg ggttgatggc 1801 gggcaaagtt ttgtaccgac gggggaagat gggatagcaa ttccaggtaa gcgtcacatc 1861 caacggcaac actaccatca acccagatat ctaaatctct tcctgtaaaa tagtgatgac 1921 tgcgtaaaag catcagcagt tcttgaccca gtcccaaagc tggtgtggga agtagcacag 1981 agcaacgctc tgcgatcgct cggttaattc tctcagcaag ttgattttct tgagtccggc 2041 ggtggggatg acgggaagtt ccatagcttc cttcaataat cagcacatcc aactctattc 2101 cccgcaattc ctccaaacgc aaaccttcta caagccgcga attagacaaa aaaaagtcgc 2161 ctgtatacag tagcctgtaa gagcgctctt ttgtggtata cgtgagtaga attgccacag 2221 cccctggtag gtgaccacca ggaattattt ctgctaccag accatctttg aattctactg 2281 gcgatcgcaa tggcagcgct tgacaaaatg atggtacttc ttggggaggt tgctccaacc 2341 aattgagagg cagcaacttg ctggtcactt cgctggcgta tattggtagc agcggaaatg 2401 ccttatgcag agctaacaac cctctggcat gatctggatg ggcgtgactc accaagacga 2461 aatctgcggg ttcgggcgaa cttcctcgac gtgccgattt tttcaattcc tgcaccaagg 2521 ttgatacatc cgccaaacca cagtcaagca aaatgcgatg tggtcccatc cgtactaata 2581 gacaaacacc ttcatcatcg tgatgaaccc caaaaggcaa acattctaat tcactcgccg 2641 tctcccccgc atcaaatctg gaggataccg acaggcgatc gctcatgctc ttttcccctc 2701 taagctcatt tttatggaca tttgcccaat gggtaataaa aattcttcaa ctttacggac 2761 tgggggcggt gctgattgct tttgtaactt cagtgggcag agccattgcc atgatagtta 2821 tctgaatcgt agaatccatt tttcgtacca aaaaacaagc acgaaactgt gaagacaatt 2881 gttagtgcag ctaaaaccag ctttacatcc atatgtttgt tgccctttac gtaaagactt 2941 tctgatttta atagaatgat tccttgtctc gtagaaatcc ccagaaaatc tttatatact 3001 attgtttctg aaaattatga tataagtaaa tatactcttc acaaatatta ttataagact 3061 tttgagtcaa gctgtgtatg cctttgttat tgccccaaag ttcatgagga cagtgctgag 3121 ctagagtccc gcaactaact gccatcccaa tacctcctga tcgtgtcggt gtgggcttat 3181 tttagactga agctaaaact tataacccaa aacttgaatc acactcttgt cgggcaattt 3241 accagaactg acagtaattg tgtcaccact gctaggacgt actgtcattc cattcatcac 3301 ttttaagaag tcatccaaag gtgaaatatc gcagttcgca gcatctgcag cttgtgtacg 3361 gtaatgggtg ggaataacta tcttgggatt gagggtttga attgcaagtc tggcttcttc 3421 aggattatag gcttttgcac ctcctccgac aggtactaac gccacgtcag gacgccccat 3481 aaggattttt tgttcaatgg aaatgggtgc tgcagctcct cctagatgaa ggatactaat 3541 tcccccttgc ttccaaagcc aagccgtatt tacaccaaat tgcctaccgc ctctacggtc 3601 atggtttatg gcaatgccct ggaacttaat gccattagac tcgtaaactc ccgcttcata 3661 aataagtttt ggatttccaa caaggtcttc tacagcacct tcatctagca gttgactgct 3721 aatcagaact aaattagccg caacctttgg cgcacgataa ccagcggtac agccaactgt 3781 ccgaaaggga ttcgtgagaa ttttgactcc accaccagta aatagaaagc tagtatgacc 3841 tagccactga attgataatg aaccgccaga ttgggcatta gcttgaaggg aggaacccaa 3901 actcgtaact aatgttgttg ctaatcccgc ccctgcatag cccaataact gtcgtcgttt 3961 catggattca tttctccaga ctgcaattgt tccagaaagt tttgcagtag ttgcttccct 4021 gaagttgtca ggatactttc tggatgaaac tggacgccct caatatgagg atagttccgg 4081 tgtcgcaccc ccataattgt gccatcctca agccaagcag tgatttctag cacatttggg 4141 caagtgtcgc gcgaaatcac taaactatga tacctggttg cggtcatcgg agtttctatt 4201 ttgttgaaga ttcctacgcc tgtgtgatat acctgagaag ttttgccgtg catcaactct 4261 ggcgcagaaa cgattttacc accaaacact tgaccgatac tttgatgccc taagcaaacc 4321 cccaagattg gtaagcttgg tccaagcaaa gaaataacat ctaaggaaat tcctgcatct 4381 tctgggcgac ctggtcctgg ggaaatcacg actccatttg gcttgagatc acgaatttcc 4441 tctacggtaa ttttgtcgtt acgaaaaact ttaatctcag cggctactgg gaactcagct 4501 gccaattctc ccagatactg cactaagtta taggtaaaac tatcgtagtt atcaataatt 4561 ataatcaaaa cttatttttt ctggttttta tcaatcataa ttatgtgatt ggtagaaacc 4621 agaagatttt aatgtttgtt ttactgatct tgaaaaaaga cacttctgcc cagcacttag 4681 gatgatttcg atcccaacac taaagctaac aggggaggaa gtaagagtgt agcagctacg 4741 gtcactgcta ccaaagcaga gacaagcaca gcaccagcag cgcagtcttt tgcaattttt 4801 gccaactcat ggtaagtctg cttaactgtt aagtccacaa ttgactcgat cgccgtattt 4861 agtaactcca acgccaaaac taaaccgatc gtcagtccaa tgacagctat ttccacagat 4921 ttaagatgca aaaaaatgct caagccaatt gccaaagcac caacactaac atgaatgcga 4981 aaattacgtt gtgtttgaaa agcgtagcta attccgcacc aagcatactt aaaactgata 5041 aataaattag aggcgacttg ccacgataat tgacgttgct tactaacaac cgtttctatg 5101 cagtccggta ctggcgggag cgaagctttt ggtggagatg agacttgttg agacataagt 5161 gtaaagacca aacagcagag taggtgggga acagacagca gatattagga gaatagtgca 5221 cttatcagca atttttcaca gaaaaatatc aaatcttaaa tacagaataa cacttagatt 5281 cttgtggtaa ttcattgcta ttcaaagtca gtgtcaatac ctatcgcctt tagtaagatg 5341 acttgctgct tgagcatttg gctcaaactt tcttcatcag gatgatccca acctaaaaga 5401 tgcagtaacc catgagctgc taaccaagct aactctgttg gcaaaggatg ctcttgctgt 5461 tgagcttgct gctgagctgt ttcgacagaa attacaatat caccaagata caacggttct 5521 tgggcgtcta tttctgccag ttcaggagta tctacctcta gtgcggcaaa agataaaacg 5581 tctgtcggtt tattttgatg acgatactgc tggttgagtg tcagaatttc actgttgtct 5641 gttaaacgta gtccaacttc gtaagcagac gctggagaaa tactcggctg tagtatttct 5701 aaccagcact caaaccagtc tttccaagtt tcaacagtaa tatgggcatc cttctctcca 5761 gagcagagag cttctggctc ggatgattca tcataacaat cctgcacata cagttcaact 5821 tgcaccagct acaaagtctc ctaagcctac gtataaattt attcgttagc gagtgaggta 5881 agccagacct acaagtactg ctaaaagacc tacagtcgtc agtacaaaat gcttcaagga 5941 ggtgaggcgc ttttgcacca tgtttcgcat ggcaagtttg acgtagctag gtttcggttc 6001 cgcagaacga gagtcgctca taataatgat ttgttaacaa actcttctca taatgtaaca 6061 gttatgatcg caccaagttg ccatagtgcc cccttggagg agataaacag gatgagagag 6121 atggtaatca tgagcgaatg cgttaattgt cctgtcctcc atctccctca tcccaactag 6181 gcagaagctt ctaccaactg gttttcttgc cgtagatagg cttggatgaa catatcgatt 6241 tcaccgttca tgacatcggc gatcgcagtt gtttcctcag cagtccgcaa atccttcacc 6301 atctgataag gatgaaaaac gtagttccgg atttggttac cccaagaggc ttctaccata 6361 tcaccccgga tttcggcgac ttctttggcg tgttgctctt gagcgatcac cagcagtttt 6421 gccttcaagc gggcgagggc tttttctttg ttttgtaatt ggctgcgttc ttccgtacaa 6481 cggacagcta tcccggttgg aatgtgaaca acccgtactg cggtttctac tttgttaacg 6541 ttttgtccac ccttacctcc agagcgagtt gtggtcactt ccaaatcttt ctctggaatt 6601 tccagttgta tattgttatc tatctcgggc atcacttcca cgccagcaaa gcttgtttgc 6661 cgcttaccgt tggcattgaa gggtgaaatc cgcaccagac gatgtgtgcc cgtttctgac 6721 cgcaagtaac cataggcata gcgcccggtt atttctacag ttgccgattt gatccccgct 6781 tcatcacctt cggaaagttc gcttaaactg actttatagc catacttttc tccccaacgg 6841 gtgtacattc gcagcaacat ttctgcccag tcttgagcat cggtaccacc agcccctgcg 6901 ttgatggtga gtacagcgcc cttttcatca taaggaccgg aaagcaactg gagtaactcc 6961 cattggtcta gttcgcgctt gagcttatca acattagact cagcttcttg caaaagccct 7021 tcatctgtgt ctaactccaa cagttctaaa actgctttag tgtcttctag actcgtttgc 7081 cactggtgat actgttgcag atgtgctttg agatcgttga gctcttgcag tgtcttctgt 7141 gcctgcgttt ggtcattcca caactctggt tgagctgcta tttgttctaa atcttgaatt 7201 ttcgcagaaa gtgcaggtat gtcaaagata gtcctgggtt ttacccaggc gatcagacaa 7261 cgtttcgatt tcgcgtttga gttctaaaac ttccatactt cccaattgat aataagagct 7321 ttggatatgc aattgcatcc tggcggatgc tttgcactat tgccctctat tttatttgtt 7381 tttactttag caaatataga cctgtgcggc gtttttaccg taacaagatt tacatgaaat 7441 ttacttattc ttgacaagat taaatcagtg aacagggaac agggaacagg gaacagggaa 7501 cagtgaacag tgaacagtga acagtgaaac tgataactgt tgagagttgt actgtaggtt 7561 ttctgcttta aattttgtca ggtgggtcac gttttgttat gcccacctga tggaaaaacc 7621 tttagccgac ggtaacttga ttgctgtcaa ccgcagtcgc accattaacg gaacgcgcta 7681 cagaaaccat cttgttgaga atgtcttggc tgggaacttt gcctttgagg actacagtgc 7741 tacctgtttg agcaacccag agagtttcta tatcatctaa ttgttggtct tggtcaaatg 7801 caagtgccac gcgctttgcc aaaccgctct ggtcatattc tccattcaat cctacacgct 7861 ctgggggaac ttgttgagtt gcagcagcag gagcagcagt ttgctgagct tgttctactg 7921 gttgtggttc tggattgact tgtgcttctt gaggtttttc cattccaaaa agtcttttta 7981 accagcccat aatactattg cctcctatca gggtttcatt acatttgagt atagaaggct 8041 ggataaatgg cgacatctgc ctggaaaaat atatgctgta ttattaatta ctgctttggg 8101 ggtaaatttc taatttctct gtcgcaggga gggtttgttt aatgaaccgc agaggcgcag 8161 agaggagcca gcgcgttgcg ggggttcccc ccgcagtcgc cgtcacccct gatggcaagt 8221 acgtgatttc tggttcagag gacaacacac tcaaagtttg gaatctggaa acgggggagg 8281 tcattgctag tttcactgcg gatagtgcgt taatgtgctg tgcggttgca cccgatgggg 8341 tgacaattgc agcaggcgaa gtaacaggac gtatacattt cctccgcctt gaagggatgg 8401 aggcgaaacc atgaacactg caccccgacc ccccaaccct gatcaatacn nnnnnnnnnc 8461 ccattcatct tcaccgccat caccaacttt ctccaccgcc ataaccgggg ctagaaactc 8521 ctctacaaat gcgatctgct gaaagcagca gcgctagctc cagcagggag ccgcttcgcg 8581 aacgctatcg cacttcccga taaagccaca caagagtgct agagaattgg ctggagttct 8641 tacagttaaa acgcagaaaa gaacaaaccc gttacagtct atatcattcc agtttttgcg 8701 cctgattaag ttagcaacta aatgtctttt aagcagtttg taaccttcaa tcaacatagc 8761 attatgaaag cgggagtagt aaatgctgcg cctgccattc catgtcttcg agaaaatact 8821 gaacatccta gagaaactat taaattgcat ggtatatttt gacgtctacc atgcgagaat 8881 ctcaatataa tcagggttgt ggtggagcta tgaatctttt caatctgtta aagcccttct 8941 taggcatcac aagttttttt cgtcaatcta tcaagctact gaagccagat tcagctaaat 9001 cttttgctgt gctatttact tttggtttgt gcctaagttt gagcttttct gcttttagtc 9061 ccagttacgc taaaagttct actcaaagtt ctgatgctaa gcctatagct aatacctcga 9121 tcgctgctgt taacgttgtg cggatgggtt atcaaaaatc tgcagttctt gctttggtga 9181 agcaacaagg cactttagaa aaacaattgt ccccttctgg ggttagtgtc aagtggctag 9241 aatttccctc tggtcctccc atgatggaag ccttgaatgc aaatagtatc gacttcgcgg 9301 ctgtaggcga gggaccacca gtatttgctc aatcagctgg tgttccactg gtgtatgttg 9361 gtaactcttc agcaagtcca gaaggtttgg caattctagt tcgcaacaat tcgccaatta 9421 agactctagc cgacctcaaa ggtaaaaaag ttgccgttgc taaaggttca agtgctcatt 9481 tcttgttggt gcaagcgttg tcatctacag gattacagta cagtgacatt cagccgactt 9541 atctttctcc agcagatgct cgtgctgtct ttgaacaaga caagattgat gcttggggaa 9601 tatgggatcc atttctagca gcagctcaaa agagtggggg agcgcggata ttaagagatg 9661 ctaagggttt ggcgtcatgg agggaatttt atgtgacgtc gcgaagcttt gcggacgcaa 9721 acccacagct tgtcaagcaa attctcgaat cagttgacaa ggtagggaag tgggcaaaga 9781 aaaatcaccg ccaagttgct gaagtattat caccgcaaat gggcattgat gttgcttcgt 9841 tagagttagc agaaactcgc cgcaaacgct acgatgtact accctttagc aaggaagtgg 9901 ttgcagagca acaaaaagtt gctgatactt tcttacgtca gaaattaata cccaaagaaa 9961 tcaaagtcac agatgctgtt tggaaaccta aatagtgata ctaagttgtg gctgaatgac 10021 cagttaaaac agatatagca atcctaaatg attcgtgaaa tcttccattg agggtgtcaa 10081 aagtctagtg tccttagtcc aaagtaacag ggctatttca ttctatgcac aaccatctgg 10141 aaagaaattt cacaggctaa tagcttgagt cgattaaaac ggactaaaat ttggagttta 10201 agttgcgtta aaactgagac cttgcacttc tgccaacaac cgcctgggat ttaaatccca 10261 gtctcatagc aaaagtcgtc taaagacgac tgcataagcc tttcagtcta ctttagtaga 10321 cttgggctgt gagcctagga attaattcct aggcgggcga ggttgcgttc ttgaaaatgg 10381 tgagccagcg cggtcttggg gagccagtcc tacaggaggg tttcccgacc tagggacctg 10441 gcgttggttt cccccatgtt agcggagcga gccgtaggct gcgactggcg ttagcgcagc 10501 gtgaccggag gtcatacccg gagggtgcaa gatgtgactg aaaacggact tcagctatta 10561 accaggaagt tcagttcctg gcggattaca aaagaatgaa ataaccctgt ctaaagtagt 10621 aaaaaatgac aaataactaa ggacggtcat aacgaggact tttacaactc aaataagact 10681 gctagaaaga cgtaattagt aatgaacaca gagggacatg gaattacttt ggtttattcc 10741 aacacatgga gatggtcgtt atttagcaac agcagttggt ggtcgcgaat gcaactttta 10801 ttactttcaa caaattgctc aagctgtaga taatttggga tttgctggcg ctttgttacc 10861 cactggacgc tcttgtgaag atgcttgggt attggcttcg tctctgattg ctgtgactcg 10921 tcaaatgcgt tttatgattg caatccgtcc agggttgatg tctcctggat tgtcagcacg 10981 aatggcagcg acatttgacc gcgtatctaa tggacgcttg caaattcacg ttgtaactgg 11041 tggtgatccg gtagaattgg caggggatgg tgtgcatctt tctcatgatg cgcgttatga 11101 attaaccgac gaatttttaa cagtttggaa agatatttca agtggtttag aaactaactt 11161 tagtgggaag tatttccaaa taaagggtgg taaactatta tttacttctg tacaaaaacc 11221 tcatccaccc ttatggtttg gaggttcgtc atttgttgca caacagattg ctgctaagca 11281 tgttgatgtg tatctgactt ggggagaacc accgcaacaa gtggcggaaa aaattgcatc 11341 tgtgcgtaaa ttagcagcag aacagggcag aactctacgc tttggtattc ggctacatgt 11401 cattgtccga gatactgaaa ctcaagcttg ggacgcggca aatgacttga ttaaatatgt 11461 gagtgatgag gcgatcgcca atgcacaaaa aatctttgcc agaatggatt ctgaaggaca 11521 gcgccggatg acgcagctgc acaatggcga tcgcaaagca ctagaaatca gtccaaactt 11581 atggacagga gtcggcttag tgcggggtgg tgctggtacc gctttagtag gagattctga 11641 caccgtcatt gccagaatgg aggagtatgc aaaactggga attgatgtct ttattctttc 11701 tggttatcct cacttagaag aagcttatcg agttgctgaa ttactatttc ctcgcctacc 11761 actgcaaaaa ccgtctaaca caaatcaaac gcaaatgttc agtcctgttg gcgaacttat 11821 cggctttgaa aactttccca aacagcaata acagatagtc caagggactt ggataatgaa 11881 aattcaattt ctcaagcgat acttcaatca attagctcct tggtgcgtcc ctcttctgtt 11941 agttgtcatt tggcagttac ttgtgcaatt tggttttgtt tctgaaagag tattacctac 12001 tcccacaaat gttttgctgg caggaattcg tcttgctaaa tcgggagaac ttttttttca 12061 ccttggcatt agcgcacaac gggcagttct tggcttttta attggtggta gcattgcttt 12121 tgttttggga ttattaaatg gcatcattcc catagcagaa aagctgttgg atacaccaat 12181 tcaaatgttg cgtaacattc cgcatttagc catgattccc ttggttatct tatggtttgg 12241 agttggagaa gaagggagga tttttttggt tgctataggt gtatcgtttc ccatttatat 12301 aaatacgttt catggtgtcc gaaccgttga tccaaacttg attgagatgg gaaaagtata 12361 cggactcaag ccttggtctt tgttttggga aatcattttt cctggcgcat tgccttcaat 12421 tttgattggt gtacgttacg ctttgggagt tatgtggttg actctgattg ttgctgagac 12481 aatttcctat gactctggga tcggttatat ggcaatgaat gctcgtgaat ttatgcaaac 12541 ggatgtggtc gttttgagca ttctactcta tgcattgtta ggaaagttag cagatgtatt 12601 tgcaaaattt ttagaaaata aattgctttc ttggcatcct agttaccaaa atctgtaacc 12661 caaagttaat aggagttttt tatgattaaa cccaagcgcg taggagtcga tttaaaagta 12721 atcggtctaa gtaaaacctt tgggaatatt cgtgtattgc aaaacttaga tttagaaata 12781 gctaaaggcg aatttgtagc gattgttggg cgtagtggct gtggaaaaac taccctatta 12841 aagctattag cgggattgga acccccgagt aacggcagta tattacttga tggcaagcaa 12901 ttgcgtaagc tcaatccaga aacacgagtt atgtttcagg atgctcgtct tttgatgtgg 12961 aagcgagtga tagaaaatgt ggggttggga ttaaaggaac actggcgaca aaaggcacat 13021 tgggcgttag aacaagtcgg acttgcagaa cgtgcttttg aatggccaag tgtgctttca 13081 ggaggacaac gccagcgggt agcactggca agagcattgg tgagcgaacc acgtttactc 13141 ttactagatg agcctttggg agcgttggat gccctaactc ggattgagat gcagcgtctg 13201 atagaagaca tctggcaaca acagcagttt actgcattat taattactca tgatgtagaa 13261 gaggcagtta ctttagctga tagaattatt ctcatagaac gaggtgaagt ggctatgaat 13321 ttatctgtgc ccctgcctcg tcctcgtcaa cgaagtgatg ctatatttgc aacattagtc 13381 gagaaaatct tggagcgggt gttaagtaca aaaccccctt cccaaaaatc gacaaaaggg 13441 gaatctacca taggacaaat tgctacctac aacatactcc cctaaaaatt agatgaatac 13501 tgacgaatcg aataacaagg gaaacatcat ctgtggtcgg tgtgcttacg atgcaaatcc 13561 tcgtggggct acacattgtc aaaagtgtgg tacgcccctt gttatagctt ctgtacctaa 13621 caacgacccc agccctggtt ctgattctac actgctcgta ggtgggatta gtgtagtagc 13681 cacagtgttg ttttttgcta tcggaggtta ttttttctgg cagcaagtcc gagtagcttc 13741 aaccccaagc aatctcaata attccgccga taacatttcc tcggatatcc gactctataa 13801 ttctatgaag gaagtcccga aggtgccgga aggaacattt aattacggtg gtgctgtcgt 13861 cttcgcacct ctgaggaaca gtttacataa ggctatcaac caagtttacc caaagtttgg 13921 gctgcggttc accgaaccta agtacaacaa ccctggtcaa aatactggga tcactatgtt 13981 gcttgatggc gaactcagct ttgcccagtc ttctaagcca ttagaggata ctcattacag 14041 caaggcaaaa caacggaatt tttctctaca gcaggtggca ataggtatcg acggcttcct 14101 gttgtttacc catccaaatg tgtctatccg tggacttgct gttgagcaag ttaaagatat 14161 ttttaaggga aaaattacca attggaagca gctgggagga ccggacttag caatcacagc 14221 ttttgctttc aacccgaagt ttggctcctc actcaatatt ctccttggtc cggaattgga 14281 tcaactcagt cctaaagtac agtttatgcg cgattacact gatggtgttc gtaaagtttc 14341 ctcaattcca ggtgcgatcg gcattggctc tactggagca attctcggtc aacagtcaat 14401 ccgccctctt gcactagctg ctcacaactc taacaactat gtgcaaccct ttacagatga 14461 tggtaagcgg gttaacgctg cagccttacg ggatggcaca taccccctaa ctaggcgtct 14521 gtttgtggtc attcgccgag acggtactat tgatgaggct gctggagtgg catacgccaa 14581 tctgttgctg tctaaagaag ggcagcagta tattgagaag gctggtttcg ttcctattcg 14641 taattagtag ttcaaattga tcactatata tatggttaga gtcgagagga ggtattatcc 14701 tccgtccctc tctgttagct ccgtacgtgt aactttcgct acatacggct cccgatgtta 14761 tagggtttcc ctttgctcat gtggatgtaa tcatgacagc tttcatgaat agcaacaaga 14821 tttttggttt tccagttgct gtggttgccg tcaatatgat gtaagtgaac tcgttcacct 14881 ggcatcattt ttagaccaca gtgggcacat gaatggtttt gccgttttaa ggttttagag 14941 gtttcaccat cgtagagctt gctattgcgt tcactccagt aggtgatgtc attgtcaaac 15001 ggtgattttt cccctgtgac catgacatgt ttattttcgg agtaggggac tgctgggaat 15061 gctttgtcta tcaagtcttt gcttgagtga cggtcattct ttggttctct attgaactta 15121 tcaaaggttc ttttactaat gaaccagagc gagttacgcg agccgtccat cttacagaac 15181 ctgtgataat tcctccaacc tctaaccaag gatgctagct ttgcagcttt ttcctcggaa 15241 ccatagttag agcaattgac gatggctttt attttcttac ggaatgcttt gaagttatcc 15301 actgaaggga tacacctgaa tttcccgttg gcttgcacct tgaagtgcca gccaagaaag 15361 tcaaatccat ctgtcgtagc ggtaatcttg gtcttctttt cactggtatt cattcctctt 15421 tcttcaagga atttgtttat ccgatcaagt atttctgtcg cgtcatcttg gggtctaagt 15481 atgattacca tatcgtcagc ataccttata gaaggttcgc tgattttata cgcaggggtc 15541 tttgatgtta tcttaaaatc ctcacggtga tatctgtgta tactttcaat cccgtttaac 15601 gcaatgttgg ctagtagtgg acttactaca ccaccttggg gggttccttg ttcgggaaag 15661 tcgggatgta ttcctgcttt aaggcagcgg aagataccga ttttcaaccc ttttggtgct 15721 atcaattgat tcatgatagt tgtgtggctt atcctgtcga agcatttttc gatatcgatc 15781 tctatgactc gtttatctat tccgttgctg ttggagttta ggttagaaaa tatgattttt 15841 tgcgcgtcgt gtgcgcttct accaggtcta aaaccgtagc tcttggcgtg gaaggttgca 15901 acgcgcaagc tggttccagg gcatattttg caaggcactg ccatgctcta tctgctatgg 15961 ttggtacttt caaaatccgg aacgttccgt ctttcttggg tattggtatt tctcttaatt 16021 tgctatgatg ccaattaagg taggatttct tgaggaattc ctcaagtgcg aagcgttctt 16081 caaaattcag ggacgctttg ccatcaatac ccgcagtctt tttaccagca ttgagttgag 16141 atacttgacg aatagccaaa aatcttgcag ctttagattt cagtatcagt ttctggagtg 16201 accgcgcttt ccgcatgtcg cctgctttaa ctgctttaaa caacctaact tgtaggcgaa 16261 ataaatcttt ccggaatttc ttccagggga gattcttcca agattcacta gtcttgtgac 16321 tgtgcctaat catactctac tcctattgat tgttttctga acacctcagc gaaattactc 16381 gctgtcctac ccgaatcaag agagttccgt atctcgtcat acctacctaa gttcgactac 16441 ctcaggactc ttgattcgtt tttattcgtt cctcggatga gattgtatgt gccgtcaggt 16501 gtaaccactt caactgctag aaccctttac tcttgccgtt tgctttacta acatcggcag 16561 ggttatctct agtggggtca ggattttgtg gtatgccctg ccctgaaata agttgttttt 16621 ctaggttcta tttcaccttg tacactccct attagcgcca gtgtcagccc ataacagccg 16681 tctgattgcg ccctgttccc agcttcagcc tctagaattc cgagtctagt cactgtgggc 16741 agataaggag tcacctctga gtttgagggg gaggactttc acctccatcc tgtccgaagt 16801 tcaacctgtc tgttgacgga ttggacaact tatgtatttg gttgtcaacg aatcgcactg 16861 taaattttca tgcctgatga gttaattctc tatcggcatg acatcatgaa agcgggagta 16921 gtcaatgctg cgcctaccat cccgtgtctt cgataaaata tcgacaaacc gcatattcca 16981 aacaatttcc tttgcaatat acgagtgacg ggcatggata gtttggcata cagtttcatc 17041 aagcccagtg aaggtgacta tcacttccat ctcatcctca accatctcct ctgctgagac 17101 tccgtacaaa ggactactct cgtctattgg gtgcatgatt gtccaagtca gcgcaaaaag 17161 tggagaatga ctgcggagca aagacatatc ataaaatcga cgcatcacat ctccttcttt 17221 gctgatctcg gtgcgtacca aactcattct gacttgtgct tctaaaatcc agttttgacg 17281 ttcattggca acccgcaaca ttaaagttgg catactgtta tatggagtaa tgattgctac 17341 atcgctaaac aagactcgcg cttgaggaag agaaaaccgg gcaaacatta agccacttgc 17401 catcgccaac cctatcagcc ctaacagtga ttcaatggta actaagatat tggtataagt 17461 tgttttggga tacatcgccc cataaccaat agatgccata gtttggacgc tgaagaaaaa 17521 ggcatcgaag aaattacccg gacgcgcatt ttcaatacca tcgcctcctg caagataagc 17581 caaggcaaac aaagtattag caagtacgta acttaaactg acaagtccta gagttttata 17641 ccagcacaga gtgagtagta aatgatatgg atcgcgccaa cggttgttgg atacacctat 17701 acgcacgacg ctggagccgc cgtcccggtt gactatccga gtttttggta tttttcgacg 17761 ggtctttttc atcgacgaca cagttatggt aactagaatt gaggatgtgt actacccatt 17821 atctaattaa ataccagcaa aatcttctag aggagcgatg gatcaccttt gttggctcca 17881 accgttttag tacttactca gcaatcctca aagtatcgcc aattgccact gtaccattgg 17941 taataacttg tgccaaaaca cccgtctgtg tgtgttgaaa attcttttgc agcagagtca 18001 gcaaggagat atcacgttct cctgtctctg gattaacttc tatatttaag cagcgattaa 18061 tcctggctgt aatagcaatt cgtgctgtac caagttgcat ttctttgcct acccagtcaa 18121 attctcccca ggcgggtaca ccatctaaaa caatattagg gcggaaacga cgaacatcta 18181 tttgtcgtcc tgctatctca ctcaagtggt taagagttgc ttgactgaca agagaaatat 18241 gcacggcttc ccggtcgggg tatcgagtct tgccaaattc tcccaccagc tgcaagggtg 18301 cgcggtttgg atgtctagca gcttgactag ggtaaattcc tgcgaggtat cctgtgaaga 18361 aagttgcnnn nnnnnnnttg caatgcgata gcttcgctct tggacacaaa tcttagtcat 18421 tgggaattgg tgattggtaa tttggcttgt aatgagcgtt taagcgctca ctacgaaccc 18481 ttcggctacg ctcagggtaa accctttatc cattttggct aatcctttag agaacccatc 18541 gatttctgtg cagctgactg ctttttagga cgaccacttc ctgtaccaat gatactacgc 18601 aaatacaaac caccaactgt tactaaaaat aggacatatc ccactgcttg cacaacatag 18661 agatattgct tgtaaccaaa caaggcatta agaataacgc cagggaactt ttcatcaggc 18721 aagatgttgg aagtattcca aaccattgga cctaagatac aagagcgaat tctcgtgaac 18781 cgttcgtaat aaaaacaaag accttccgag gcgcgattac taagagcaag ggtagcaaaa 18841 ccttcgtcaa acttctgcaa agctgtcacc actaagcccg ccacaatcaa caccaagaaa 18901 acacccatga cttggaaaaa ttggcggatg ttaattttaa caccccatcg aaataacaac 18961 acaccgatta atgctgcgac tgcaatacca gcaacagcgc caaaaccagg aacaagcccc 19021 tgttgaatgt tagcagcaac gaacagaacc gtttcaaaac cttcgcgtac aacggcaata 19081 aatactaaac tgaaaacacc ccaaccagca accttattct gctttaaagc atctgttact 19141 gctccttcaa cttgagcctt cataaatctg gcttgtttgg tcatccagat aagcatccag 19201 ctgagcataa cgatcgctaa cacgctgaac acaccttcca tcattggttc aaccacggtt 19261 gagtattggg gattaattgc actgagtgct tgaaccgcaa gagtgaaaag aaaacctatg 19321 agtgcgctga caataatgcc aacgccgaca ccagcataaa cccaagagtt aagacgagat 19381 tgtttagctt tttttagcaa agctagtaca atgccaacta cgagagcagc ttctactcct 19441 tctcggagtg taatcacaaa agtgggtaag gcagcactaa agttcattat ttgtccttct 19501 ttattttgtc tcggtgctta gtcattagtc atgagtagaa atgaaatgac taataactaa 19561 taactaataa ctattcagtg aaatcgcgta acatcttttt cagttcaata ttatgcatct 19621 cttcctcacc aatcatatta cgagcaaatt gttcaagata aatactagaa ttggttactg 19681 tttccagcag atttttatac agtctcaatg ctttttcttc atgggctaaa ctttccgcta 19741 agatatcctt gactgaatct ttatcagttt catctgctgg taaaatttgc aaactaggac 19801 gaccttctaa ccctgtgaga atttctccca ctttttgggc atgaagtaga gactcgctcg 19861 cctgctcttt caaaaaatct acaatcgtaa tgcggtaagg accagttacc atcaaagaat 19921 attgagtata acgcacgact cctgatagct cacactctat aatttcgttc agcagattga 19981 ttgtttgttt ctggtcaagc tcttgcatca tgttagtgat tatagtgata cgtagaagtt 20041 tcttgagtga tgagtgataa gttattcatt aattgacaag tcaaaagtta aagctctcgc 20101 aatttggcta atggtcatac cttgcaccga gcctgtgagc taaagttccg gctaaaaact 20161 caagcccatt gaaatgggct agaatatgtg cttagtaagc tttcgcttac ttgctcgcta 20221 atgagcggaa ataaatttca cgagcgttga aaatgctgat gcaagatatg gcttaaccca 20281 attcttaact cataactctt tgttcactgt gcactcattt taaattctcg cccgagagta 20341 aaatcccagg caagtaccac tacaaacaat tcaaaatact attgcgcttg cttgagtgct 20401 ggaactatct gttatttttt gagaattttc ctcaatggtt tttaccagct tggtaacatc 20461 cccaggggtt ttgactggtt tagctggtgc aatagcagag ggccaaactt ttatcagttc 20521 agccatactg gtttcaattg ctttatgttc ttggggatgg tctttagcca ctttgctaga 20581 aataccttta tataagtcat cagcgtagtt gacaaagcca cgggaatctt gatactcaat 20641 ttctgcagaa attttaccat ctgcaattgc cgcaccatat tctgagttag ctgcatctag 20701 caatccgtta atgacttgta gcgcgaatct aggttgcgat cgctgatctg ctggtaaagc 20761 tgtcacagcg ccatcaactg cttgcatcga agaggtaaaa tcagttttga ccttgggact 20821 tttgggattg gctttgacga aatcttgtaa actcaccaaa gatgtcttaa attctttaac 20881 tttgcgctca tttaattggt cttctacatc aacataaatc tcttcaactg gatgaccgat 20941 atgaggttct gcctttttgg gctgattttg atccaacagt tcttttgcta ccaaaaggtg 21001 tcctttcatg agtcccagtt tggtcatata gtccacatct tttggttccc cagttagaac 21061 gacttcctga gtcgtaacct gatccttaat ttggttgaat tgctcctgag taatcacttt 21121 tttagaaact aagtcctcag gactgctgta gggacgactt gcctgaattt tatttgataa 21181 tgccggaaca ccaagctgtc cctcaaattt atccaattcc gacaagatcg cattgttgat 21241 gttgattttc tttttgctgc tgtgtccgct atgactcact gcttcagtag cctggggact 21301 ggagttagca gctggtgctg ttggattttc tgcggtttgt gtaccgttac aggaagtcag 21361 agcgaagatg attgcactag caatagttag gcagatgtaa cgaaattttc tcatttttgc 21421 tcctaaaagt gtttaaaagg taatcattaa aacattcaaa ttagtacatc acggagaaaa 21481 cactctcacc gttaaaaaga cacaaaaagc tttgcaatct agcatttgag cctacccagc 21541 aggaaaactg aacacaaagc gcatacgtgc tcagcctgac agagcgtgct gcttttggca 21601 tagcttttgt acgagtgtca atcaatgaat ttgacagtgg atcttagatt ttgctatgat 21661 actttctcac aaatgatatg atttttcaac tacttgctaa tgttttgatt atgatttaag 21721 ataaattgtc agtatatttg aatatgtaaa aaatacggtg taacagcaac tctgtgtaaa 21781 gttgtgctta attgaaatac accgcgctta tagggggact gttatagctc taccatgttg 21841 tggatgcaag tgaaaatcgc tgtagtgcta gcaagtagtc tagtctctgg tggtgtgcaa 21901 attcaataat gaacaatgaa caatcaaaac taataattga taattgtttt aatacgtgac 21961 agcttaatat ttctaattgg cgttgtatat agcttgttat actttataca acggcaacga 22021 catcaataac ggtaaattgc tgcaacaggt gtgctaaagg gtacttgttc tgcctcaacg 22081 gcgtagactg taccaccgtt taacaaggta tgagctgcag cgaaatccaa caaatcctca 22141 tcaccagttt cttcttcttg gtgcaaataa acagtttctg aggtagggtc aaacagtccc 22201 cattgctgct gactggtagc aaccaacaat gaatcaattc tttgataata ggctgctgga 22261 ataatttctt tgagttcatt ggatgctttg ccagtattac caccaaacaa ttcgttaaat 22321 cgctctacaa cttcttgctg tgatttatgg tagtgtggct caacaatggg ccaagctcgt 22381 tcgtgtagtt cttgtgcgct gagaatctct tgattcccag tgataccttc ttcaattaaa 22441 tgttggtaag tatttgcctc tgtgtaaagg ggcatcaagt attccacccc agccagtagc 22501 aaaggcgctg tttgctctcg cagtttctct tctagggcgt gattaacaac ctgaaagaat 22561 tgtaagatgt ctttctggtg ttgatctcta tcgggactac cttgcccgtg aaacgatcct 22621 ggttgtacag aagcattagc agttccccct ttggatgtag caatgcggaa ttgaccttgt 22681 tgagcagtgt catcatagtt gagcgcctca tccagacttt taggcatatt ttccaccttt 22741 agctctttga tgctatagcg tgttccctcg aagaaacgaa catctttttg actgagagct 22801 aacaggtaga aacgcccatc tccatttaat atccgcagta gtggcttaat atgaaatcgg 22861 tctgtcacta caaccaactc gtcaaatgtc agaggtagag tatagtagcg aaaaatatct 22921 ctagaaataa aaactgctag tccttggtct acgttttgtt cccagaaatt tgaatcatcc 22981 agttgatcag attttgctag caattcaatg acatcgttct gctcaagacc tgcatcaatt 23041 aaacgtgcct cagcctcttt gattaaattc ttaaagcgaa ttgagttttg tcgaacttct 23101 ggcccagcta actgcgtagg catataaata gaaacagaat tttgtcttgg ctgttctaga 23161 agtgttttga tttcctctct agaaattaat gtcatgccat attcctcgta ttaacgtgag 23221 tggacatcca aacaacaccg ttgatatatt caacggttct atgggacttt gcccccaatt 23281 atttgagctt gatttatttc acgtcaagaa aagttacgtt tcatctttcc tatggcagga 23341 aatttggata caatttctgt tcccgtagag tgtaggaata agaaaaattc aaaaattatt 23401 gattagttat ccaacattag gcgtgtttga gttcatatgt tttgagttgt tagttaagtt 23461 ttttcccctt tgcgaaaaaa aacccaaagt cactcgtttc ataatccaat gtaaagcgac 23521 aagtatgcca aaggtaatag caataatcac cagaggctgt ttaccaagaa tatcctctat 23581 acctctagtt agcaaagcgt caggttgagt cacaaaatcc cagctgttaa aacgtaagaa 23641 acgtccccaa taaacaccaa cagcacagag agcatgggta attaactcaa cccacaaaat 23701 ccatttgctc ttaccaatgc gatgtaagta ataacctaaa ttaattaaag atataacgta 23761 agcttcaaat ccagctaaaa taaccaatac atacaacggg atcagtatga gggttattat 23821 ccatacagac tgaattgtac gaatatcatc tataagatga atgatatcag tcagcaggta 23881 aggtgcattc ggtagaaaac tatagaaaac aataaatccc aaccaccaaa cccaagaccg 23941 cccgcgccca ttgcgaaaca accaaacact caaagccaaa ggtataaaag ctagaaataa 24001 gttccaagtc atccaattta tattgatgcg caggacttgc atgactttgg ctatcaattc 24061 agtgagttcc gctttcatag ctgtttcctt gaagacgtgt aaatgattcc gatttctccg 24121 tatagttata agctagctga ttgactatca aaactgcatc aggtaggaaa aatgactact 24181 gaggaaatgg tcaatcaagc gggctaagcc actagtatag ccaatgcgta aatcagaacc 24241 actttaccat gaacaagttt ctcatctgga atggtgcgct tgtgctggct atgattatag 24301 atttcatctt gccctaatac agtttcataa attcttgtca actgtctaga gaaataccta 24361 tgttcattgg tctgttacta accaatgaca aataattaat tatgttattt gtcatggtac 24421 tctgggaaag taagacatac aaagaggtct acatggaacg gagagcttta ggacatcaag 24481 ggcttgaagt atcagcgatt ggtttaggat gcatggggat gtcagagttc tatggtcctc 24541 gtgacgacca agagagcata gccaccatta ggcaagcaat cgatatgggt gtcaatctca 24601 tcgacactgc ggatttttat ggagttggac acaatgagga attggtccgt cgggcaattg 24661 aaggacggcg agagcaggtt gttttgtcag taaagtttgg tgcactacgt tcatatgatg 24721 gtggttggat tggttttgat gggcgtcctg tcgcaatcca aaatgcaata gcccatagtc 24781 tgcgccgtct caatgtcgat tacattgacc tttattttcc gtcccgtgtc gatccaaacg 24841 tccccattga agaaactgtt ggcgctttag ccgagttggt gaaacagggt aaagtgaaat 24901 acattggact ttctgaagct gcaccgcaaa ccctacgccg cgctcatgcg attcatccga 24961 tttcagccgt ccaaattgag tactcccttt ggagtcggga aatcgaaaag gaactgttac 25021 caacgttgcg agaactaggg attggtcttg taggatatag tccgttgtcc cgtggtttgt 25081 tatcaggtaa gatagatgag acttctctcc aacaatcggg agatacacgc agcaggatgc 25141 ctcgctatca aggggataat ttgacacaca atcttggctt ggtcgaaaaa ctcaaggcga 25201 tcgctacttc caaaaactgc actccagcac aactggcgat cgcctgggta atagctcaag 25261 gcaatgatct tgtctcaatt gtcggtacca agcgccggaa atatctggaa gaaaatctcg 25321 gtgcggtgag cagtgcgttg ggcgggttcc ccgacttgaa gcaactgcga acccgaaggg 25381 ctgcgataac tctgacaaaa ggtgacctag aggaactgga acattcattt ccagttaatg 25441 tcgtagctgg tgaacgttat ccagaagcca tgatggctta tgtctatgca taattcaaca 25501 aggggttata gaagtgcccg aaccatcaaa cacttctgaa caaacgcgag aggttatcct 25561 gatgacagca gaacaacaac tgcggcgtca cgggttatct aagatgactg tcgtcgatat 25621 cgcacgtgca gctgggatgt ctcattccaa tgtctatcga ttctttccaa ccaaggcagc 25681 aatctttgat ggaattacac aacgttggtt atcccaagca gaaaagcatc tagcactcgt 25741 tgttgcacga gaagtctcag cggctcttaa gttagaagat tttgtgattg aactgcatcg 25801 tgtaaagcgc cagaagtttt tgacagatcc agagattttc gcgacctatt atgctgttgc 25861 caaggaatgc aatgatgtcg ttcaaaagca tttaacttac attcacagtc ttctttttca 25921 gattatcaat gagggtattc aatcgggaga atttcaagtt actgatgccg aagcagctgc 25981 atctgtagta cgtagcgcta ccttacgttt ccatcacccc gtccttgtga tcgaagatag 26041 ggaacgagac gttgagaatg aagcccgtgc tgtgatgcgc cttctcatcg caggtcttaa 26101 aactagagtt ttgtaaattt agctatagtc taacgattta cagtagcact tgcgtgctac 26161 tgcaaaccca tttacctcat tttgattcgc tacagtactt gttcacctca gacgccagcg 26221 catcagactg tttagcagca tctgcggctg ctgtcagagc tgtatcgatg tctcctcttg 26281 ccttttgaat tttgacccta ccctctgatg aggcttgagc tgtcttagct gaaccaagag 26341 ccttacctgc tgtagcaatt gattgactga aagtctcaaa caccttgaca aaactgctct 26401 gaaattgttg aagcttgggg tcttttaaat ttagctcctt aatttctttg gtcacagctt 26461 ccaaatcctt ggatagttgc aagctggttg tcacctgctg acctttattt ttatcaatca 26521 ggtcacttcc tttgttaatg aggctaacca gtcgctcaca ttgggaggct ttgctctcac 26581 tgcacgcagt cactaacaaa gcgatactca agctaatagg aataataacg gtatatctac 26641 gccaaacagc cattaacacc ttaccaccaa taaaaccgaa acaatagtaa cttaatcatt 26701 cagtcatttt tgctgatatt tggataattg cctgatctta agtgtacgta agcaaagtgc 26761 aggtctacct ggttacacct ttgtgacatc aaaacggatg cagtatcaac ctgtagttac 26821 tgttatgtaa agctcttttt agtccgagtc actccaagct gccgtgtata cacttagagc 26881 aaagtctttc aattataagt aattaaatat actataaact tttgtcatat ccccaggtag 26941 gggattgcat tttgatgtag aaaaactctc tattaaagat agagaaattg ctaagattga 27001 ctgccaatga aaaaaatttc tgcaatggtg ttggcactga gcttttcagt ggtcgcttgc 27061 agtacaccag ggaatcaaac agtatctcct actccagaag caacagcaac tcctgctcag 27121 gctagagacc aaaaagctca aaataagctt ttagttgtca ctactgtagc acctttaact 27181 aacattatca gtaatattgc aggagagcgc gttcaagtca caggcattat tcctgaaggt 27241 acagactcac acacctttga gcctcgtcct tcggatgctg atttgctctc caaagctaag 27301 ttaatcatcg tcaatgggtt gcacttggaa agcccgacag aaaaactggc aaaagcttct 27361 aagcctaaag aaactaatat ttacgaactt ggcgataaca ctatcaccca aagtcagtgg 27421 atttacgact tcagctttcc caaagaaaag ggcgatccca atccccacct gtgggtgaat 27481 cccaaatatg ctgaagccta cgccaaatta gctgcccaac agttaactca gttagatccg 27541 gctggtaaag attactacgc taccaatctg aaaaactact tgcaacgcct ggatgcacta 27601 gataaagcaa atcgtgcagt tgtggagagt ataccagcta aaaatcgcaa acttttgacc 27661 taccacgatt cttgggcgta ttgggcaaga gagtacggct ttcaagtcat tggtgcgatt 27721 caaccatcag attttaaaga gccttctgcc caagacgtgg caaagctaat tactcaaatt 27781 cgtcaagttg gcgttcctgc catttttggt tcagaagtct atcctagcaa ggtacaagag 27841 cagattgccc gagaagcgaa ggtgaaaaca gccaatactg ctgatgatga gttacctggc 27901 aaggggtcag ctaatgcaat ggaaaacacc aaccctgaac atacatacat tgggatgatg 27961 gttaacaata tgcggattat tgctgaaaat ctaggcggaa atcctgagct agtgaaaaat 28021 gtcaatactg ccaatgtagt tggaccaaca gcgaacgaga caaaaacttc tcaaaaataa 28081 ataaaaacca aagataagcc ttaaagactg gacttatcat ttagtcttaa caatgcattg 28141 cgcctgctgc caaggttatt tattgttaat ggaacaacca ctactagaag ttaaaaattt 28201 aacttgcggc tatcaaaaca aacctgtgtt tacacaggta aatctgtcct tgtatcgtgg 28261 tcaactttct ggtttggttg gaccatcagg aagcggtaaa agcaccttga tgaaggcaat 28321 tttggggtta attcatcctt gggctggaga aatttggttt cgggggaaac gtttgcagcc 28381 agggacatca ccgccacgag tgggttatgt gcctcaagtc gaaacggtag actggaactt 28441 tcctgtcaca gctgaagaag tggtgatgat ggggcgatat cagaagcaga gaatgttacc 28501 ttgggcttca agaggcgatc gcactgcagc tagagaactc ctaaaccgcg taggagttgc 28561 tcatatcgcc cgccaaccca tcggtgaatt atctggggga caacagcagc gggtttttct 28621 cgcccgtgcg ctggtgggag aaccagaaat cgtattatta gatgaaccga ctagcagttc 28681 ggacttgcag gttcagcacg aactcttaca tttattagcg gatttgaatc agcaaggttt 28741 gacgattcta ctatccactc acgacctgaa ctcagtcgcc acacatctac cttgggttgt 28801 gtgtttcaac cacggattga tttgtcaagg acaaccgctt gatatcttta ctcctgctaa 28861 ccttgagcgc acctttggtg cccaaatggt tgtcttccac caagaagacc gcattttgat 28921 tgctagcggg ggaacttcgt tgcgccatca aatgcaacgc aacttgcctc caagcctact 28981 tcaaggtaaa tcgaagtctg cgtgatttca acatatttgt atgaatttta tactcgaacc 29041 cttccgctat gagtttttta gccgtgccat tttagttggc atgatggcag gtttgttatg 29101 tggcatgatg ggtgtttata ttacaactcg ccggatgagc tacattgccc acggtttatc 29161 tcatgctatt ttgggtggag ccgtattaag ctatgttttg ggacttaatt tctacattgg 29221 ctctggaata tggggatttg gttctgctgt tttgatccaa tatttgacag gacgcaaaat 29281 ctactcagat gcggcaattg gtattgtcac aactgcgagt tttgctttag gagtagctgt 29341 tatcagcagc taccggaagt ttagccaaaa ttttgaagcc gccttgtttg gtaatgtgct 29401 gggagtttct ccaacagatt tatgggttgt gacaggcgta accgttgttt tgctgagctt 29461 agttttctgt ttttatcgcc ctttattatt ttggtgtttc gacagagaag ttgctcaggt 29521 tcacggtgtt cctgtgtttg caatggatac attgtttgct ttgatgctgg cgacaatgct 29581 tgtagcaacg cttaatgtgt taggcgtaac actgattatt tcggcggtgg tgattcctgc 29641 ttcaatagca cggttgttga gtaaccattt tggttacatg atgatttttt ctggtttttt 29701 gggagctgcg atcgccttta tcggtattta cctaagctac tacttcgata ttgcttctgg 29761 agccagtgtt gtgctgctct caaccatgat atttgcttgc gtgttgcttt ggaggagctt 29821 gcaatatcgt cgcaaacgct atttagcacc cctcgcctca caacactacg agtaaaactt 29881 tttggcgcag ggggagccag cagcctagtt aaccatcagg taacaaagga tctgtatgat 29941 aaagcgagtc ttgtatcgga cacaatatca tgtttgaaga attacaatct gttgaagctg 30001 ctcagacaac ggcaatgacg caaaatgatc gttatacact tgctactgga gagttgggcg 30061 catatcgact ttcaatcctc aatataatcc acagacctta tactgagttc ctatttcgac 30121 gagttggact tgagcaagga atggcagtag ctgatattgg ctgcggtact ggaaatgtat 30181 ctaactgggt ggctcaacaa gttggttcta gtggctcagt tgttggtgtg gacttgagcg 30241 cggagcaagt agaacaagcg cgacgcaatg caaaaactct tagtttaagt aatgtaacat 30301 ttggtcttgg tagcgcctac gatactggat tgcctcaaga ctcctttgat ctagtttact 30361 gtcgcttctt gctaatgcat ctaacccgtc ctattgatgc acttctccaa atgcgatcac 30421 ttcttaaacc tgggggactg cttgtgtgtg aagaagcaga ctttagtact gctttttgtg 30481 aaccttctaa cccagcctac aaccgctgtt ttgagctttt ccttgctctc tcacatgcaa 30541 gaggtcagca ctttagtatg ggaattatgc tccaccggat tttccaagat agtggattcg 30601 tagctccaga aatttctttg gcgcaagctg tcgtagtgcg tggagagact aagcgtctag 30661 tggatttgtc actactcgag gcaaatgacg ctttgattga agctggattg acgacacaag 30721 aagagattaa ccagaagatt gcccaaatca aagcgttagc cgctgacgaa acaacagcgt 30781 ttggcatccc tcgtgtaaca caggtatggg cgcgaaaata atggattcgc aaaaaccgag 30841 gtgtgcaagc aactggttgt acactctcag tttttgcgaa tcagtaatca gattgcctca 30901 gatcaggttt cggactgcac ctcatttcag gttttccaag cccatcctga cttataaaga 30961 cgtgcgacta agtaactaaa tcattcaaca attcgctcat ttatcaatat ttctctccca 31021 ttcttctacc cattgttcta gcttttctct caataacgac tgccatccag gtatagcctt 31081 aattctttct cttaagccag atcttacctt tacctgaaaa ggaatcttat ctaaagcttc 31141 atctccttgt ggtttaaatt taggaaaagg cataacacat ttgacagtat aactggtata 31201 ctagcagtat aacaaagcag gcaggcgtca tgtattcact aaagctggag ttgaaactga 31261 ataatcagga aaagtctaag ctagctggat gtgctggctt tgctcgtttt gtttacaact 31321 ttgggttgtc aatgcttact agttcttggg attttgaggg aattaaagca ggtgattcta 31381 agcgtttaac agcgattgaa aaggttttta ctaattacgt aaaaaccaat gctgattata 31441 cttggatgaa acaatatcca tcggcaatct attcttctgc gcttcgtaat ttagcaaaag 31501 ctgttgagcg ttggcgcaaa ggagattctg ggtttcctca gatgaaatct aaaaggcgtg 31561 gagacagttt cacggttctt aaaaaagcag gaatttatcc agccaaaggt gagtcaatgc 31621 tgccttttac gaataagcaa gtattgcagc cgggaaagag aatcacgata ccaggattgg 31681 gagagttccg acttaagcga ccaataccgt ttctatgttc tagtcaatct ttcactattt 31741 ccagaacagc taataaatgg tacgtcagtt ttagcttgga tgttgaaaaa gtccctcctt 31801 tgtttcactc agttgaatca gtcggtatag atttaggcgt taaaactttt gcaaccttgt 31861 ctgacggttc tacaatagtt gcaccaagta gcctcaaaaa agcgaaaacc aagctgaata 31921 agttacagtg gcgcaatcgt agaaaacgtg cgattcgttg attgctgaat cacggtaacc 31981 agtccgtaca taagcaatcc a // LOCUS NODE_870_length_31554_cov_5.25911931554 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 31554) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 31554) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..31554 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 496..1692 /locus_tag="DP116_07530" CDS 496..1692 /locus_tag="DP116_07530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="PRJNA477356:DP116_07530" /translation="MIDFSLTLEQRMLQSKALDFAQTEMKPIVQIIEESDNPKIEPWD FCQSVFHKGTELGFTSLLIPKEYGGLGGKCVDLVLVLEELGAVDVSIASSYFNLTAAM SLFVTRAGTSEQQKRILSHVRSGEPHLYSAAESEPNVATSDLFCPIPDPNIGLKTFAQ RDGDGYILNGKKSSLVTNAGIADAYFIIARTALDKPLGESMSIFYVPANTPGLKFGKR TEMIGWKPSHHAEIHLDNVRVPAENLLGKEGEAAKLLMLLPEVAIGLAASYVGLARAA YEYALNYAKKRVSWGRPIIQHQSVALKLADMMINTQAARLTVWEAANTADTNPQLAAM VKAPAAKTFAVDVAIKNAQTAVEILGGYGVTKQAQTGKFLADASIGYSCDFTREVLRL GLVNCL" gene complement(1714..2439) /locus_tag="DP116_07535" CDS complement(1714..2439) /locus_tag="DP116_07535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316245.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07535" /translation="MKSGVVKKKKSLWLLCCSFWLLFSNSLNDAAKAQNHHPSASPKA ERRVVGGERFFCSNQNLEALTTQLLQDLPNYANRASQRARRLRRATDVYSYMVVAGRP EFTPLPLNPSGYTADSVKTASVGVEQVFFTTLERQYTAGKAFQLQQFHWLFLTKTKSD WRFVMMFSQIGLSPKNQPPTPPRDSSNGVIAQGITAWLRDCQAGSVRVRSRNLKGSSQ KPLPSQTPPPPLKPPLSQPPPEL" gene complement(2436..3155) /locus_tag="DP116_07540" CDS complement(2436..3155) /locus_tag="DP116_07540" /EC_number="4.1.1.23" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742877.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="orotidine-5'-phosphate decarboxylase" /protein_id="PRJNA477356:DP116_07540" /translation="MNAQEQIIVPLDVADEQAAIALVERLTSVTFFKVGLELFTSTGP KILEVLKSRQKRIFLDLKFHDIPNTVAGACRAAARYGVDLLTIHATSGNEALRAATEA VQAGAADAGVKPPKLIAITLLTSLSSRQLAFELKIPLELPEYALEMALMAQEMGLDGA VCSPVEVAQLRQSCGDDFLLVCPGVRPTWAQKGDQARSLTPAQAFAAGANYLVIGRPI TAAADPELAWKRICEELVAVT" gene complement(3169..4338) /locus_tag="DP116_07545" CDS complement(3169..4338) /locus_tag="DP116_07545" /EC_number="6.1.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742878.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tyrosine--tRNA ligase" /protein_id="PRJNA477356:DP116_07545" /translation="MTQDFSWLRRGIVEIFPQPTDSDTHVESLEKRLATTERPLRVKL GIDPTGSDIHLGHSIPVRKLRAFQDAGHTAVLIIGDFTARIGDPTGKSEVRRQLTEED VARNAQTYLEQVRPILDFDTPGRLEIRYNSEWLSKLNLGEILELLSTMTVGQMLAKEG FAERYKKENPIFLHEFLYPLMQGYDSVAIKADVELGGTDQKFNLAVGRDLQRHFGQKP QYGVLLPILIGLDGVQKMSKSLGNYVGLLEHQTQKYQKLQQVPDHLLNDYFTLLTDFP LDKLPENPRDRQKLLAQEIVKQYHGEQAIKDIEAGDVPEFSLADVHFPAKLAYILNVS GLCKSSGEGKRKIQEGGVRLDGDRITDIETTFTDSAQLQGRVLQLGKNKFVRLIP" gene 4515..6440 /locus_tag="DP116_07550" CDS 4515..6440 /locus_tag="DP116_07550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317780.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="penicillin-binding protein" /protein_id="PRJNA477356:DP116_07550" /translation="MSSRTFEDKQPQPRASGSGFEFLKGVGQVAGGTLLSATMLTSSI VAGGLVGLAVSFRNLPDVRQLRSFLPSETTYIYDIKGKLLTSIHGEANREVMPLDRIS PDLKRAVLASEDSDFYYHHGINPKGVGRAVVTNWSAGGVREGGSTISMQLVKNLFLSH KRAFTRKIAEAVLAIRLEQILTKDQILEMYLNQVYWGHNNYGVQTAARSYFNKSAEYL NLAESAMMAGLIQAPEDYSPFIHMNKAKEQQKIVLGRMKELNWITQEEYDNALKQPIK LGKIRSFQGSALPYVTNAVAHEIAKKFGREALLKGGMRIQTTVDTNFQNMAEDTVKNW HDILRGQGLSRNQIALVAIDPRTHFVKALVGGVDSKTSEFNRATQALRQPGSAFKPFV YYTAFATGKYGPDTTVYDTPVGYRDGDGWYYPRNYDGGYGGAMSIRTALMQSRNVPVI KVGKAVGMNKVVETCRTLGILSPMEPVTSLPLGAIGVTPLEMAGAYATFANYGWQSPT TIIARVTDSSGNVLLDNTPKPVQVLDPWASAAIVDTMRSVINGGTGKNAAIDRPAAGK TGTTSSEKDIWFVGTVPQLTTAVWVGRDDNRTLASGATGGTMVAPIWRNFMMKALKDV PVEKFKSPYQFPRPKSN" gene complement(6509..6832) /locus_tag="DP116_07555" CDS complement(6509..6832) /locus_tag="DP116_07555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410226.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1825 domain-containing protein" /protein_id="PRJNA477356:DP116_07555" /translation="MGFFDSDIIQQEAKQLFEDYQALIKLGGNYGKFDREGKKLFIEQ MEAMMDRYRVFMKRFELSEDFMAQMTIEQLKTQLSQFGVTPQQMFDQMDLTLQRMKNE LEQQS" gene 7214..7588 /locus_tag="DP116_07560" CDS 7214..7588 /locus_tag="DP116_07560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112201.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07560" /translation="MFNQRKSRSIAAILALSGTVTISGLHKFYLGQPLWGVLYLLLSW TPIPKVASAIEGVWFLAQDEEAFDRHFNLGKSAIKNSQFASNQVSTVAQGLRELESLR QDGLISEYEFEQKRRQLLDHIS" gene 7602..8135 /locus_tag="DP116_07565" CDS 7602..8135 /locus_tag="DP116_07565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317784.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ComEA family DNA-binding protein" /protein_id="PRJNA477356:DP116_07565" /translation="MKNWLPLNSKLQKLRYKLLNDPYYRLQSAEEIAIAAQLGIHIDA NQATVDDWLRLPGLSIHQARSLVQLSRAGVKFYCIEDIAAALSLPVQRLEPLKPILNF SYYDDESLVLDSHKINPNTATVETLAKVPFIDLSLAQAVVENRSSAGTYLNLADFQRR LNISGETISQLMYYLQF" gene complement(8240..8812) /gene="lepB" /locus_tag="DP116_07570" CDS complement(8240..8812) /gene="lepB" /locus_tag="DP116_07570" /EC_number="3.4.21.89" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130837.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal peptidase I" /protein_id="PRJNA477356:DP116_07570" /translation="MTHQKSEAKESPASSRGWRTLRENLILIAIALCLALLIRTFVAE PRYIPSDSMLPTLHKGDRLVVEKISSLFHPPQFGDIVVFQPPEELQHRGYPKDQAFIK RIIGTPGKKISVAGGKVYIDGQPIQEDYIAEPPILPMQERQVPPGEFFVMGDNRNDSN DSRYWGFLPKENIIGRAVFRFWPFDRIGVI" gene complement(8880..9740) /locus_tag="DP116_07575" CDS complement(8880..9740) /locus_tag="DP116_07575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_07575" /translation="MKRNKETITLSIPSGTKEHLEEIARRLGIFWGKSPSVSGLLVAI AQQEFEVGEPFALNPTQVAALQQAIGLLKDSGCVEQAQIISSLVLERGKLEPPMRQSL LQQVSQPSEAWRILVDQLRGNQQPFHLLYGNPQGEDLSFTVRFAEISFEEKRFYLNIW CDETDDIKNPGFPELIHNRCLRLDRIKGVVPINGQWRHEGLDSLKVHLHLYRGMVKAY ESKPDDIDNEVIEDVRQIVRRVSNPFWLIREVLRYGEDCVVVSPDSVRSLVKEKLKTL CQHYNLEVSS" gene 9821..11962 /gene="cas3" /locus_tag="DP116_07580" CDS 9821..11962 /gene="cas3" /locus_tag="DP116_07580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872945.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I-D CRISPR-associated helicase Cas3'" /protein_id="PRJNA477356:DP116_07580" /translation="MPANYTVTLKPVYSRTVPTPESVKLPDGLSLSWHQVETLKALQD PNIDVVFNTAMTGDGKSLAAFLSAMTNRTYTLAMYPTNELARDQEKQVQGYKEKFQPK YEPQIHRLTAAILEKYVATGKLPSKLEGMENFSTLYEILLTNPDLFHYIHNFYYLRGK IDNLDRLFRRIDEKYKLFIYDEFHIFSSPQVASVINTILLMKHTANHQKKFLFLSATP NKLLEDCLRNAGIEPKIINPAAVGAYKFESEANNEDWRQISQPIQLSFPQGLEANLRS SYAWLEENAEKVILKFFQEYPNSKGAIILNSIAAVKKLLPKFKDIFEPRWQVRENTGL TGETEKSKSVAEADLLLGTSTIDVGVDFKINFLVFEAADAGNFIQRFGRLGRHPDFET YQAYALIASFLVGRLFEDKSHPLEDGETYDRITFTNAIRDSWVFKNQFEQYPKRWGGI QSAYIYCKLQSNPHMKEKYPGLAQKFGTDIQKALGISIKQMNAQFYRCQSEGKTKIID EARSFRGSSQLDCAIYDLTNPDEPEPERFKMYNLPGILNNFLFELWDETSFRKKAEDA GVITKQFEKALCYLKLRDYREVREDWHFYYPGNIRELAKTTKVQVLKDLEICQPHGYG IQQISDVVERRKFVCFISDRDTPLGVCPSGNRNYLRATLGLPMHFQAYPLTDEPEDRS PRYTITFGQDALLLETLIWHWKPKEDEGWIC" gene 12477..12725 /locus_tag="DP116_07585" CDS 12477..12725 /locus_tag="DP116_07585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872946.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07585" /translation="MPVRVKLKREEAKKRSRDYINNLEISVKGTIVYNGEEFEKPSPL AAKVNGGAANGWEYIEVKKDNQWICLDTLRKIWRNNND" gene 12718..12978 /locus_tag="DP116_07590" CDS 12718..12978 /locus_tag="DP116_07590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863177.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07590" /translation="MTRDNLLSRISINPNICSGTPCIRGHRIWVSLILDLLAAGETIE TIIEEYPGIEKEDILACIAYGAEMVRDYVVEIPIGTHKEAKK" gene 12975..13358 /locus_tag="DP116_07595" CDS 12975..13358 /locus_tag="DP116_07595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458808.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07595" /translation="MNIKLDENLGNLRVATWLRLAGHDVATVREQGLTSTPDEALIDI CCAEGRVLVTSDRGFGNRLKYNPSNYTGIVVIRLSSRSNFNDWREAIETLITGLEAAD VTGKLWIIRNGNIQEYQPIEPEDKD" gene 13358..16726 /gene="cas10d" /locus_tag="DP116_07600" CDS 13358..16726 /gene="cas10d" /locus_tag="DP116_07600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458809.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I-D CRISPR-associated protein Cas10d/Csc3" /protein_id="PRJNA477356:DP116_07600" /translation="MTEFDDDLPQEVPDFDVGEDEDEESPVKRELLTIRLFKEAVKKA KGNEGDRILESFADNILPNLIQQLAGATAKGGRFFETTVEMINAKRAAEGKKPVRRDN AGDQSIIAHLLNGLFPTYRILKKLQEHKETNPVKRTCEELQICIFIVSYLLHDYEKFP DYQAWLIANDEEGKFQNRDWEEDTPNKKDAPNFGRGYITKKILDFGLHHLLGEEWQDY IDDIIEISNNSGIKHDADLGLVTRGLKTLDDDRLDGRIRQVLIDLVSLSDLFASVIKH PIDVENGRLPTLVGRLSNHQLKLTYHSLSENRGVLTNILNNALIEAHSEEFYTPLLYL SDGVVYLAHADAPAITTDNIPERVFGKIKSLCAEKLKERQTGFNRDGKGLKFADYYWL FFDVVGLMEVSIDAASRLLPDTKSSSALKRGESLQTYQAQGELPSNLNLQFANEIRID RLAEFGDIICRGIWGGWCEKVNEWQKQQPKAKRKNLPDLDLTQKLAEYLGLSEEIPAI RHIQALKKTGGVPLDWYYLAAKYFQQHPGKDFTQILEVMKGMVNYAASFIQPILKEFP DIPDGWNDLKTYVSRVISLPTGAVSAPETAPFLVELQRYNAAKIIGRGRENVCAMSSS SYSVTEQMESATLFAPQVYSNRQILFNAQAAKRQICSISSIEIMLRQILMNQTNAVGA DFESRKYRYLYLYPTYFFTPETNKFLQKAYNQFSRTRFDAPLRKHFITENQVAKFRIQ DYQQVDSLLIKDNLQLDDSEALLLSADRTFKISFPEKETLTFFFIGLPPGREPTDTES WVTPAWLAFALPLILDVKVVASESPVPPFLSGADFEQTVFLDGEHQAIRSLIQQDNYR LDSILPRASGKREFSPLNALTAAYSIHLEVNRKKDGDPDWGKLADLARDLATSPLYVF HYLNKWLRKQKNLSSVPIAKVRLYLDLYYYFEPEGKSVNRLRELTSLYRRFYRAKSFY AKANAILKPIDEAADVILKVDKALAADTESLIDVVAARLSKLMNNVRRKAAEGKPTLT LVDGKWKPALTPEEERQAVSDFAKYFVETIFEGSFKGDRARLAGTQLNLIRDTCEYLY RLADDEERRNLSQEQPEEIPDLEADETAYACSTL" gene 16915..17922 /locus_tag="DP116_07605" CDS 16915..17922 /locus_tag="DP116_07605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138519.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_07605" /translation="MSPKVSIIVNCFNQGCYLERSVKSVLSQTFPDIECLIVDDGSTD NTRQVAEQLMNLDERVKYYYKENGGLPSSRNFGVEKAQGEWIQCLDADDWIHEDKTRF QLSYLEKVNLNDDTVFYCDYERVFLDANQNIVNTQENVIGSLTKDEFIQRLLIPDFLA DTPHPALQQAMLMKKSILSKTKFPEYLKALGDRYFAVAILMAGANFVYTPMIGTYYTK HQSNRTNSWNYMKNYYIIFYENILKNYPELNNCCQSGLEFFLEEAIMEKEEDDFERLL KIVPMPVRLLNKKITIKNKKQLKVFHTIRKILPSFLLYEKYRGNRTNKIISMLFSKIK L" gene 17943..19094 /locus_tag="DP116_07610" CDS 17943..19094 /locus_tag="DP116_07610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015177233.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_07610" /translation="MKFSLIVSDLSGGGSVRAFLLAQVLKKLNHEVEIIGFLFGKELY AIPPNGIKIVSIPGKNYPEFFSYSRQVLEKIDGDIIYAVKPKVTSFGLSLLKKISSRR PLLLDMDDWELSWYGGMNWKYRPSLKQFARDILKKDGALRFPDHPLYIQWMESLVQKA DAVTIDTQFLKERFGGIYLPNGKDTAMFDPSMYDSKASRIRYDLDDYRILMFPGAPRP HKGVEDVLMALDRLNQSDLRLVIVGGSPYDDYDDKLIQRWGRWIIKLPKCPVEVMPEV VAAAHIVVVPQRDTIIARAQFPLKLTDGMAMAKPVLSTRVGDIPEILDETGYLIDPGS PEQIAEQIDFIFENLESANERGLKARERCIKKYSLETMASTLESVIAGL" gene 19311..21554 /locus_tag="DP116_07615" CDS 19311..21554 /locus_tag="DP116_07615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873806.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_07615" /translation="MSTSKLLLRFAKPYPVLTLLTILLGFSGALFNGVSTALIVPVIL RIVGQEVDFSGAPAILKKLMSPFDNIPENYLLGVMAGTIIFTILLKNLATYASTLASS TLTRKVTSDMRETGLILLLQIDLSYYAKMKVGDIINSLGGEISRAASAVGNTIKLIIL GITILVFVGILLSISWQLTIAATVLLSLVTLINQYAITRAKQFGKQLSDMSRAYSIAI LETLNGIRLVKATGSEQREYQRIRKLIHEREKADFKSQVHSEAIAPLSEVMGVTAIIL IVFLSRTFFANQIVLLSTVLLTYLLVLLRLLPFISQLNTLRSSFASTATSVDTVAEFL SWENKPLMSNGSNVYTKLKKGVRFESVSFSYPGHEKLVLKDVDLYLPQGTTLALVGGS GAGKSTLADLLPRFYDPTSGCITIDDIDLRDFDLLSIRQAMGIVSQDTFLFNDSVWNN IAYGRQSATKEEVITAAKQANAYEFISELPQGFKTIIGDRGVMLSGGQRQRLAIARAL LQDPEILILDEATSALDTVSERLVQEAIDNLSRDRTTLVIAHRLSTVQKADQIAVLDQ GSVVEVGTHEQLLQKGGYYSRLYAMQFADKAQTATKRQQNLVRISVEIRTRLQSIIGS LRLLLDNRQNNPQQREKFIEESYKSGLKVLNSIDVLEDIVHLQTNWTLSAAQPNHSLA TPNQNLIIICNQFRMTLEPILNSLNSLANDSRNTPQSQHKFIQEAYQALTRLLENLDH FEDNLKF" gene 21559..22770 /locus_tag="DP116_07620" CDS 21559..22770 /locus_tag="DP116_07620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308845.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 1" /protein_id="PRJNA477356:DP116_07620" /translation="MKNTTLLKNYVFFLTEEIPKPEAHIIVSANAANAAANLGYSSVL AYPRKGLAAFNPIHLARPFQPRKTPEALVKYYNIQEKLKVAPLPMPWPIDYINSKFTD SNTIATKYYFPFHILSTTKLVHAWNWNFIKAAVKNGVPAIYEHHHYEDKQFEPEIVNH PLFQVAATVTDTVREHMIQHGMPPEKVITVHNGYNSSFIIRQPEKIAEWRKKLLKDER SHLVVYAGALRKFKGIDILVDVAALMPNIQIVCAGGDEKEVAHYQQLARDLQVNNITF LGYLLHKDLPSLLQAADILAHPHCSGQAATFTSPLKLFDYLTSGNPIVATEIPSLMEF KNTNAIAAWCEPDSPSKFAEAIAQVLKTHPRKVEGYQDIINFVKQFSWENRAAKILSY VDESLRPPLIA" gene 22814..22993 /locus_tag="DP116_07625" CDS 22814..22993 /locus_tag="DP116_07625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002785997.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07625" /translation="MLTSYQLSVISYQLSVISYLRSTSYQVGNGLVHPLFTVYCSLFP VKSSLKVRICDVTKY" gene 22972..23262 /locus_tag="DP116_07630" CDS 22972..23262 /locus_tag="DP116_07630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213383.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase subunit beta" /protein_id="PRJNA477356:DP116_07630" /translation="MRRDEVLTILQQHWTVLKNFGVRSLSIFGSVARDEARSDSDVDI LIELEPPLTFDRYMEIKFYLEDQLGTKVDLVSWRSLKPSIRDVVEKEAIRVA" gene 23252..23605 /locus_tag="DP116_07635" CDS 23252..23605 /locus_tag="DP116_07635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016953654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotidyltransferase" /protein_id="PRJNA477356:DP116_07635" /translation="MSRNIRFYLEDIVGCCAKVLRYTQGITFEQFIVDEKTFDAVARN LQIIGEAVKNIPVEMREVAPEIEWRKIAGLRDILAHTYFQVENEIIWDIVQNKVQPLQ EQIQQLLESEFGDSV" gene 23797..24768 /locus_tag="DP116_07640" CDS 23797..24768 /locus_tag="DP116_07640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138524.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_07640" /translation="MSIKTIGMISSYPGLDQKPDWLWQQTPHPFGVWGDIQIHSTAPK PDFLLMYNYTSFPEPPQKQLWFWKNRKAQLEYEKAQQTLQNKLINIPKERVIYLLREP PLDEVVELNQKFYQQAQAYCGYISGPDDFAPTPNYMPAIWYYSNSFRELNDMPPPEKI RPCSWVTSGISRTVNHRQRLEFLKLLRNSELKFDLYGRGLPEWAQGGGELSNKWYGMA PYYYNLAIENYADNNWYVSEKLWDALLAWCLPIYYGGPAADKLLPPGSFLRLPSLDEK GLLYIQEVTATTDAWYAAKSAIAEAREIILHKLNLLNWLSDYVKHCS" gene 24802..25824 /locus_tag="DP116_07645" CDS 24802..25824 /locus_tag="DP116_07645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308843.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="PRJNA477356:DP116_07645" /translation="MTEGIYVLANDIVFDQLVAFLNSIEANAGTNYGSNTGKNYPVCI IPYDNRLEKVKDEIKNRNNVEIFADTAAIARWEDFATQIWQTHPSAFQTWEQNGISGV YRLGMHRRFCGFDGPFDKFIYFDADILVLNSLDYIFQQLNQNDFVVYDFQHKDAAHVY NVKSNQLLNVFPLARIDSEIFCAGMYGSKKNIFHQEKRNEIISQLKQDQAEILYMNAP DQTILNYMVMKSGISSYNFAHHLPENERTGCCVTSPHFEARDNILYDKGHPLTYIHYI GLSSQLFTRVCSGENIDFPYREIFLHYRYLHEPDKRPKFKSKPKAYNAPPSLATRILR KLGLAG" gene 25835..26767 /locus_tag="DP116_07650" CDS 25835..26767 /locus_tag="DP116_07650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308842.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methionine synthase" /protein_id="PRJNA477356:DP116_07650" /translation="MNRGIYIIANDKVTDHAIALLNSIRLHDTETPIVMIPYDDNYHN IADTLEKYYGVQIYEDLDFIDRLSIKLHELFGNKFFARPNQFRKQACWFGPFDEFLYI DTDIVVFEKIIENLNYLNQYDFICCDYQHAGGITNVFTPKVLEEKVFTEDEVKDIFNG GFWGSKKNLISEQDLYETFAECAAHPEYFDFSQKTSDQPIINYMLLKRIPRRFNIVRR EGKAPGNWGGSHHFQTQGNILIDPKVNQPLQYLHWAGIRIEPGCPYWEIWEYYRNLNP ELTPAVFPQKPKKSQWEQTLENLKNQLRKMKANL" gene 27034..27309 /locus_tag="DP116_07655" CDS 27034..27309 /locus_tag="DP116_07655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126990.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF485 domain-containing protein" /protein_id="PRJNA477356:DP116_07655" /translation="MDDRTKAIQALTAQRWRVSLMLSGAMMFIYFGFILLIAFNKPLL GSQIIPGLSLGILLGALVIVSAWILIFIYVRWANNNYDDKIARLTRK" gene 27389..28963 /locus_tag="DP116_07660" CDS 27389..28963 /locus_tag="DP116_07660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410933.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation acetate symporter" /protein_id="PRJNA477356:DP116_07660" /translation="MNSLWCYLPLAEVDIKNLGHFNPLAIFFFFVFVASSLGITYWAA KLTKSTSHFYTAGGNISGFQNGLALAGDFMSAASFLGITGLVALNGFDGLIYSIGFLV GWPIVMFLIAEPLRNLGKYTFADVVAYRLRQAPVRIASAFGSLAVISFYLIAQMVGAG ELIKLLFGFDYELAVVIVGCVMMAYVIFGGMIATTWVQIIKAILLLGGTILLAILVLA KFGFNPMALFSSAAAKYGAGVLAPGKQVSDPLDAISLGMSLMFGTAGLPHILMRFYTV PDAKSARYSVTYATALIGVFYLLTFILGFGAMVLVGQDAIKQIGTGGNMAAPMLAEFL GGDAFLGFIAAVSFATILAVVAGLTLSGAAALSHDLWVNVVRGGHANETEQLKVARFA TMMLGAVAIVLGILFKGQNVAYMVGLAFAIAASANFPALLLSMLWRRFTTYGAVASML VGTLSSLVLIYFSPTIQVTILKHASAPFPLKNPGLISIPLAFLVGIVVSLLASEREAQ EKFAEVENRIHIGFNG" gene complement(29270..29419) /locus_tag="DP116_07665" CDS complement(29270..29419) /locus_tag="DP116_07665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020092723.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07665" /translation="MKRPIAVLVGKDGGVKRRETTPVQAKAIFDEIDAMPMRRQEMRE RDKSV" gene 29388..30197 /locus_tag="DP116_07670" /pseudo CDS 29388..30197 /locus_tag="DP116_07670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314316.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="phosphatase" gene 30260..30580 /locus_tag="DP116_07675" CDS 30260..30580 /locus_tag="DP116_07675" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07675" /translation="MNKFYIKKPSCFKKLGFFFGVKLLSRYDNFEKFLLDINTNITAI VVTSKALYYLVRSHKSIFVRKMETSGCRYFIFVSSPFKSRKPAIFWQEKVFLLNFEEI IINM" gene 30659..31450 /locus_tag="DP116_07680" CDS 30659..31450 /locus_tag="DP116_07680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879576.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class II aldolase/adducin family protein" /protein_id="PRJNA477356:DP116_07680" /translation="MLIQSVDPSKLSKGELPHPPEFERVEDERLHRKQRLAAAFRLFA RYGFDEGVAGHITARDPEFHDHFWVNPFGMYFGHIRVSDLVLVNHKGEVVEGNKPVNA AAFAIHSQIHQARPDVVAAAHTHSLYGKTWSTLGRLLDPLTQDACAFYEDHALFEDYT GVVLDLEESQRIANTLGLKKAAILRNHGLLSVGHSVDEAAWWFITMDRCCQSQLMAQA AGKPVLINPDTASLTYRQVGSHFMGWFNFQSLYDMIVRQQPDLLD" BASE COUNT 9232 a 6377 c 6704 g 9241 t ORIGIN 1 cagcttaagc tttgtgagtt ttcttacaaa gacccctact tttcaaacat cctcttaggt 61 attccccaat cttaggaacg tgatggtgta gagcgccatc acgtttagcg ccctaataga 121 ggattaatac agttgatttt tttatatgta attttgacat ctttttatct aaattgctat 181 gtattgaagt aaggcgacaa agactgaaaa ctttatatag taagagttac aagacggaaa 241 aataacacat gagacagaat tatcgagtta taagtaagat ttttctacca tctcaagggc 301 tataggatag ttaactagag ccatatggaa agcaaatcct aaattgctcc tagtgttttg 361 ctgagtaaac ttggcagggt aaaggagaaa gatgtcttat cctttaccct ttccccaacc 421 catacccaag ttttggtttg gcagactact aacctatggc gtcaagaaca tccaatgatg 481 aaggaaaaac aaatcatgat tgacttcagc ctcactcttg aacagcggat gttgcagtcg 541 aaagcgcttg actttgccca aacagagatg aaacctattg ttcaaatcat tgaggaatcc 601 gataatccaa aaattgaacc ttgggatttc tgtcaaagtg tgttccacaa aggaactgaa 661 ctaggattta cgtcattatt aatcccaaaa gagtacggtg gtttgggtgg aaagtgcgta 721 gatctcgttt tggttctgga agaactaggt gcagttgatg tcagtattgc gtctagttac 781 ttcaatctga ctgctgcaat gtcgcttttc gtgactaggg ctggaacaag tgaacagcaa 841 aagcgaatat tgtcgcatgt tcgttccggg gaacctcact tgtatagtgc tgcagagagt 901 gagccaaacg tcgccacctc agatttgttc tgtccaatac cagatcctaa catcggactt 961 aagacttttg cacaacgcga tggtgatgga tacatcctga acgggaaaaa gtcatcgtta 1021 gtcacaaatg ctggtatcgc ggatgcatat ttcatcattg cacgcactgc tctcgataag 1081 cctttgggcg agagtatgtc tatattctat gttccagcaa atacccctgg tctgaaattt 1141 ggtaagagaa ccgagatgat tggctggaaa ccttctcatc atgcagaaat tcatcttgat 1201 aacgtacgag tacctgcaga aaatcttctt gggaaagagg gggaggcggc aaaacttctc 1261 atgttgcttc ctgaggtggc tattggactt gcggcttcct acgtcggttt ggctcgcgct 1321 gcatatgagt atgcgctaaa ttatgctaaa aagcgagtca gctgggggcg tccaattatc 1381 cagcatcagt cagtcgcttt gaaacttgct gatatgatga taaacacgca ggctgcgcga 1441 ttaacggtat gggaggcagc aaatactgca gacactaacc cccaactcgc tgcaatggtg 1501 aaggcaccag cagctaagac ttttgctgtg gatgtggcaa ttaaaaacgc gcaaacagca 1561 gttgagattc ttggtggcta tggagtgact aagcaggctc aaacaggtaa gtttcttgcc 1621 gacgcctcta ttggatattc atgtgacttc acaagggaag ttttgcgtct tggacttgtt 1681 aactgtttgt agcttggtac acaccggtag cagttataac tctggcggcg gttgcgacag 1741 tggaggcttt aatggcggtg gcggcgtttg tgatggtagc ggcttttgag atgagccttt 1801 caaattccga gaacgcacac gcacacttcc cgcttgacaa tctcgcaacc aagcagtaat 1861 cccttgggca ataacaccat tactactatc tcgtggtggt gtcggtggct gattcttggg 1921 agacaaacca atttgggaaa acatcatcac gaaacgccaa tcactttttg tttttgtgag 1981 aaacagccag tggaattgct gcaattgaaa tgctttgcca gcagtatact gtcgctctaa 2041 agtcgtaaaa aagacttgct ctactccaac tgaagcagtt ttgactgaat ctgccgtgta 2101 tccactaggg ttaagaggta gcggtgtaaa ctcaggtctt cccgccacta ccatataact 2161 gtaaacgtca gttgcccttc ttaagcgacg ggcgcgttga cttgctctgt tagcataatt 2221 tggtaaatct tgcaatagtt gagtcgttag tgcttctaga ttttggtttg agcaaaaaaa 2281 cctctcccca ccaacgactc tcctctcggc ttttggagag gcgctagggt gatggttctg 2341 tgcttttgcg gcgtcattta gagaatttga gaataacagc cagaagctac aacaaagaag 2401 ccacaagctt tttttctttt ttacgactcc tgacttcatg tcaccgccac taattcctca 2461 caaatcctct tccaggctaa ctctggatca gcagctgcgg tgatcggacg cccaatgacg 2521 aggtaatttg ctcctgcagc aaaagcttgt gcaggagtga gcgatcgcgc ctgatctcct 2581 ttttgggccc aagttggacg tacccccgga caaaccagca aaaaatcatc tccacaactt 2641 tgtcgcagtt gcgctacctc aacaggagag caaacagccc catctaaacc catctcttga 2701 gccattagag ccatttccaa agcatattct ggtaattcta gaggtatttt caactcaaac 2761 gccaactgtc tggaagaaag actcgtcaac agcgtaatcg caattaactt tggcggtttc 2821 acacctgcat ctgctgctcc tgcttgtacc gcttcagttg ctgccctcag tgcttcatta 2881 ccagatgtcg catgaattgt caataaatca acgccataac gagcagccgc acgacaagcg 2941 ccagcaacag tgttggggat atcgtgaaac ttcaaatcta agaaaatacg tttctgccga 3001 gattttagca cctccagaat ttttggtcct gtgcttgtaa ataactccaa gcccaccttg 3061 aagaaagtga ctgacgtaag gcgttcgaca agggcgatcg ccgcttgttc atccgctaca 3121 tccagaggta cgattatttg ctcttgggcg ttcatcagct tttctcctct acggtatcag 3181 tcggacaaac ttgttctttc ccagctgcaa aacacgtcct tgcaattggg ctgaatcagt 3241 gaaagtcgtt tcaatatcag tgatgcgatc gccatctaag cgcaccccac cttcttgaat 3301 tttccgttta ccttccccgc tacttttgca caagccactc acattcagaa tgtatgccag 3361 ctttgcagga aagtggacat cagctaggga aaactctggt acgtcacccg cctcaatatc 3421 cttgatagct tgttcgccat gatattgctt gacgatttct tgtgcgagta gtttttggcg 3481 atcgcgtgga ttttccggta gcttatccaa tggaaaatcc gtaagtagtg taaaataatc 3541 attcaggaga tggtcgggaa cttgttgcag cttttgatac ttctgagttt ggtgttctaa 3601 caaaccaaca taattaccta aagacttgga cattttttgt acaccatcca agccaatcaa 3661 aattggcagc aggactccgt actgtggttt ttgtccaaaa tggcgctgta aatctcgccc 3721 gacagcaaga ttaaattttt ggtctgtccc tcctaactcc acatctgcct taatagcgac 3781 agaatcatag ccctgcatca gcgggtacag gaactcatga aggaaaatag gattctcttt 3841 cttatagcgc tcagcaaaac cctccttagc gagcatctgc ccaactgtca ttgtcgagag 3901 taactccaaa atttcgccca ggttcaactt cgagagccat tctgagttgt aacgtatctc 3961 caaccttcct ggtgtatcaa agtctaaaat aggtcgcacc tgctcaaggt aagtctgggc 4021 gtttcgcgcc acatcttctt ccgtaagttg gcgacgtacc tcagattttc cagtcggatc 4081 accaatgcga gcagtaaaat cgccaataat cagtactgcc gtatgaccag catcttgaaa 4141 cgctcgtagc tttcgtactg gtatactgtg accgagatga atatcgctac ccgttggatc 4201 aatacccaat ttcaccctta gaggtcgctc agttgttgct aaacgctttt ctaaactttc 4261 aacgtgagta tcagaatcag tcggttgtgg gaaaatttca acaataccac gacgcagcca 4321 agaaaaatcc tgcgtcatac tactaggaga ttgactgtta actatgcttt gcactatcaa 4381 agtaggattg gttatggtca ctgacaattt gtatgacttt cgtttgccaa actaatatca 4441 ttgcaaaaaa ttaaccgcct tgcttgatcc aagacattac actatatatt tactagtgag 4501 gaagtaaaac caccgtgtcg tcaaggactt ttgaagataa gcagccacaa cctcgagcgt 4561 ctggctcagg ttttgagttt cttaaaggag tcggccaggt agctggcggt actctacttt 4621 ccgccacgat gctgacaagt tctattgtag cgggaggact cgttggctta gcggtgagtt 4681 tccgcaactt gccagatgtg agacagttac gcagcttttt gccatcagaa acgacttata 4741 tctatgacat caaaggtaaa ctgttaacga gtattcacgg agaagccaac cgtgaagtga 4801 tgccattaga taggatttct ccagatctca aacgagcagt attagcgagt gaagacagcg 4861 acttctacta tcaccacgga attaatccca aaggcgttgg acgtgcagta gtcaccaact 4921 ggtcagcagg cggcgtgcga gagggcggtt caaccatctc catgcagttg gtgaaaaact 4981 tattcttgtc tcacaagcgt gcttttaccc gtaaaatagc cgaggcagta ctggcaattc 5041 gcttggagca aattcttacc aaagaccaaa ttttagaaat gtacctcaac caagtttatt 5101 ggggtcataa caactatggt gtacaaacag cagcacgcag ttactttaac aaatcggcag 5161 aatatttaaa cttggctgag tcggcgatga tggctggttt aatccaagcg ccggaagact 5221 atagcccctt cattcacatg aataaggcaa aagagcagca gaaaatagtt ttaggtcgaa 5281 tgaaggaatt gaattggatc acgcaggaag agtacgataa cgccctgaaa caaccaatca 5341 aactcggtaa aattagatct ttccaaggaa gcgccttgcc ttacgtgaca aacgctgtag 5401 cccacgaaat cgccaaaaag tttggtcgtg aggcgctgct caagggtgga atgcgaattc 5461 aaaccacagt tgataccaac ttccaaaaca tggcggagga cactgtcaaa aactggcatg 5521 atattttgag ggggcaaggg ttatctagaa accaaattgc tctggttgcc attgatcccc 5581 gtacacattt tgtcaaagct ctagtaggtg gcgtagattc taaaaccagt gagtttaacc 5641 gtgcaactca agctttacgg caacctggct ctgcttttaa gccgtttgtt tattacactg 5701 cctttgcaac tggtaagtac ggtccagata ctacggtcta cgatactcct gtgggttatc 5761 gagatggtga cggatggtac tatccacgca actacgatgg tgggtatggc ggagctatgt 5821 caatccgcac tgcgttgatg caatctcgta atgttcccgt cataaaggtt ggtaaagctg 5881 tgggaatgaa taaggttgtc gaaacttgcc gcaccttggg cattctgagt ccgatggaac 5941 ctgtaacttc tctgccattg ggtgctattg gtgttacgcc gttggaaatg gcaggcgctt 6001 atgctacctt tgccaactat ggctggcaat ctccaacgac aattattgcc cgtgtgacgg 6061 atagtagtgg caacgtctta cttgacaaca cccctaaacc tgtacaagtt ctcgatccgt 6121 gggcatcagc agcaattgtt gatacaatgc gctctgttat taatggaggt actggtaaaa 6181 atgctgcaat agatcgccca gcagcaggta agacaggaac aacgtcctct gagaaggata 6241 tttggtttgt cggaaccgta cctcagttaa caactgctgt ttgggtcggt cgggacgaca 6301 acagaacttt ggcgagtggt gcgacgggtg ggactatggt tgctccaatt tggcgcaatt 6361 ttatgatgaa ggcgcttaag gatgtacctg ttgagaaatt caagtcacct taccagtttc 6421 ctcgacccaa atcaaattaa agttaagaaa gttaagaaag ttaaaactga agagtttggt 6481 gtcaaatcca aactcctcat ttttgctctc atgattgttg ttccagttcg tttttcatcc 6541 gctgtagggt gaggtccatt tggtcaaaca tttgttgcgg agttacacca aactgactta 6601 actgtgtctt aagctgctct atggtcattt gtgccatgaa atcttctgat agctcaaaac 6661 gcttcataaa gacccgatag cgatccatca tggcttccat ctgctcaata aacagctttt 6721 tgccctcgcg atcaaatttg ccatagttac cgccaagctt gataagtgct tgataatctt 6781 caaacagttg cttggcttct tgctgaatta tgtcagaatc aaaaaatccc attttggtta 6841 caccccactg agtgatcaga ctcagtagct atcttaattg atgcctcttg tttattctag 6901 tttagggaca taatctagaa aataaacctc ggtttgagta cggtttttta ccgcaatttc 6961 aacacttgac tcacgcgacg cttgaattga gaatttgaag ttcaaaattc aaaaaacagc 7021 gagacacaca gtgcgacttc cttggtatgc tttttgtttc cccaacttca ggtatttgct 7081 gttatagcat ttcacgcttg ggtctgatac aaatgcagag cacacggacc acccgagcag 7141 gctttggggt agttcataag ataggattca gttacgataa cagcaatgct gaatttatta 7201 tggtgctttc attatgttta atcaacgaaa aagccggagc attgccgcaa ttttagcttt 7261 gtctggtaca gtgacaattt ctggattaca taagttttat ttaggacagc cactatgggg 7321 tgtgctttat ctgttgcttt cttggacacc cattcctaag gtagcgagtg ctattgaagg 7381 agtttggttt ttagcgcaag acgaagaagc ttttgatcgt cattttaatt taggtaaatc 7441 agccatcaaa aactcacagt tcgccagcaa tcaagtcagt acggttgctc aaggcttgcg 7501 agagttagaa agtttacgtc aagatggatt gatttctgag tatgaatttg agcaaaagcg 7561 ccgccagttg ctagatcata tttcctaaac caagcaacac tatgaaaaat tggctacctt 7621 taaactctaa gttacaaaaa ctacgctata agttactgaa cgacccatat tatcgactac 7681 aatcagcaga agaaattgcg atcgccgcgc aactgggtat ccacattgat gccaatcaag 7741 caactgttga tgattggtta cggctaccag ggttatcgat tcaccaagcg cgatcgcttg 7801 tacaactttc tcgcgctggt gtcaaatttt actgtattga agatattgcc gcagccttaa 7861 gtttaccagt acaaaggcta gagccactca agccaattct gaatttcagt tattacgatg 7921 atgaatcttt agttcttgat agccataaaa tcaaccctaa cacggcaaca gttgagactc 7981 tagcaaaagt gccgtttatt gatttatccc tggcgcaagc agtggttgaa aatcgcagtt 8041 ccgcggggac ttacctgaac ttagctgatt ttcagcggag gttaaatatt tcgggtgaaa 8101 ctatttccca gcttatgtac tacctacaat tttaatctga aagtgcatgt gaggcagcgc 8161 gcgctgcagt ccaagttttt cggtgcggcc agtcgcctag ccgcgctatg gcaacagcca 8221 tagggctacg tctgtagaat tagattaccc caatcctatc aaaaggccaa aagcggaaca 8281 ctgcccgacc aataatattt tctttgggta aaaaccccca gtagcgggaa tcgttactat 8341 cgttgcggtt atctcccata acaaagaatt cgccaggagg aacttgccgc tcttgcatgg 8401 gtaaaattgg tggctcagct atataatctt cttgtattgg ttgcccatca atgtaaactt 8461 ttccaccagc aacactgatt ttctttccag gagtcccaat aatacgctta atgaaagctt 8521 ggtctttggg atatcctcga tgttgtagtt cttctggtgg ctgaaaaaca acgatatcgc 8581 caaactgtgg aggatgaaac aaagaggaaa ttttttccac aaccaggcga tcgcctttat 8641 gcaaggttgg taacatcgag tctgagggaa tgtagcgggg ttcagcaaca aaagtcctaa 8701 tcaagagtgc taagcataag gcgatcgcaa tcaagataag attttcccgt aaagttcgcc 8761 aacctcgcga cgatgcaggg gattctttcg cttcactttt ctgatgagtc atagaaagct 8821 gattaataag ttagacgaat tctagattat tttgatccta aaaggacatt tgttatggct 8881 taggaagaaa cttccagatt gtaatgctga cagagggttt tgagtttttc tttgacgagc 8941 gatcgcaccg aatcaggcga caccactaca caatcttccc cataccgcag tacttctcga 9001 attaaccaga agggattcga cacgcgccgc acgatttggc gcacatcttc aatgacttca 9061 ttatcaatgt cgtctggttt tgattcatag gcttttacca ttccacgata taagtgtaag 9121 tgtactttca aggaatctaa gccctcatgc cgccattgtc cgtttattgg cacaacacct 9181 ttgattctat ctagacgcaa acagcgatta tgtatgagtt ctgggaagcc cggatttttg 9241 atatcgtcag tctcatcgca ccagatgttt aagtaaaatc gcttctcttc aaaggagatt 9301 tcagcaaaac gtactgtaaa tgataagtcc tctccttgag gatttccgta gagtaggtga 9361 aacggttgtt ggtttcctcg gagttgatcg acaagaatgc gccatgcttc actgggttgg 9421 ctgacttgtt gtagcagtga ttgacgcata ggaggttcaa gttttcctcg ctctaagaca 9481 agactagaga tgatttgagc ttgttcaacg caaccagaat cctttagcaa gccgatggct 9541 tgttgaagtg ctgctacttg tgttgggtta agtgcaaagg gttcgcctac ttcaaattct 9601 tgttgggcga tcgccaccaa caaccctgaa acactgggag attttcccca gaatataccc 9661 aggcgacgag cgatttcttc tagatgctct ttggttcctg atggaattga cagtgtaatt 9721 gtctctttgt tccgcttcat tgacaatttg tatttataaa attgacattc actggtttat 9781 aaggttattg tagctgtatt gaaggtgcaa caaaaaggat atgcctgcaa actacacagt 9841 aactctcaag cctgtttact ctcggacagt gccaacacca gaaagcgtga aactacctga 9901 tggcttgtcg ctttcttggc atcaagtcga aaccttaaaa gcgttgcaag atccaaatat 9961 tgatgttgtc ttcaatacgg caatgacagg agatggtaag agtttggctg cttttctctc 10021 tgcaatgaca aatcgtacgt atactttagc aatgtaccct acaaatgaac tggcgagaga 10081 tcaggaaaaa caggtgcaag ggtacaaaga gaaatttcag ccaaagtacg agccgcaaat 10141 tcatcgccta actgcggcga ttttggagaa atacgtagct acgggtaaat taccttccaa 10201 actagaaggt atggagaatt tttcgacttt atatgaaatt ttgctgacaa accccgacct 10261 ttttcactac attcataact tctactatct gagagggaag atagacaacc ttgatagatt 10321 atttcgccgc atagatgaaa aatataagct atttatttac gacgaatttc atattttttc 10381 atctccacag gttgcaagtg ttattaatac gatacttctg atgaaacata ctgctaatca 10441 tcaaaaaaaa tttttatttc tctccgctac tcctaataaa ttattagaag attgtctcag 10501 gaatgctgga atagagccaa aaataattaa cccagctgct gttggtgctt ataaatttga 10561 gtcagaagct aataacgaag actggagaca gattagtcaa cctattcaat tgagttttcc 10621 tcaaggacta gaagcgaatt tacgttctag ttacgcttgg ttagaagaaa acgcagaaaa 10681 agtcatttta aagttttttc aagagtatcc aaacagcaaa ggagcaatca ttcttaactc 10741 gattgcagca gtgaaaaagc tattacctaa attcaaagat atttttgaac cgcgatggca 10801 agtacgagag aacacaggtt taacgggaga gacggaaaaa tctaagtcag ttgcagaagc 10861 agatttactt cttggtacat ctactattga tgttggtgta gactttaaaa ttaatttctt 10921 agtttttgaa gcagctgatg ccggaaattt tattcaacgc tttggcagac taggtagaca 10981 tccagatttt gaaacctatc aagcttatgc actcatagct agctttttgg ttggaaggtt 11041 atttgaggat aaatcccatc ctttagaaga tggagaaact tatgacagga taacttttac 11101 aaacgccatt cgtgactctt gggtatttaa aaaccaattt gaacagtatc caaaacgttg 11161 gggtggtatt caatcagctt atatttactg caaactgcaa agcaaccccc acatgaaaga 11221 gaaatatcca gggttggctc aaaagtttgg taccgatatt caaaaagctt taggaattag 11281 tatcaagcaa atgaatgcac agttttatcg ctgtcagagt gaaggaaaaa cgaaaatcat 11341 tgatgaagcg agaagttttc gtggaagtag ccaattagat tgtgcaattt atgatttaac 11401 gaaccctgat gaaccagaac cagaacggtt caaaatgtat aatcttcctg gcattctcaa 11461 caactttctt ttcgagttgt gggatgagac aagctttcgg aaaaaagctg aggatgctgg 11521 agtcataacc aaacaatttg aaaaagcttt gtgttatttg aaattaagag attaccgaga 11581 agtgcgggaa gattggcact tctattaccc aggaaatatt agagaactgg cgaaaacaac 11641 aaaggtgcag gttctcaagg atttagagat ttgccaaccc cacggctatg gtatccagca 11701 aatcagtgat gttgtggaaa ggcggaaatt tgtgtgcttt atttctgatc gcgatacgcc 11761 tttaggcgtc tgcccttcgg gcaatcgcaa ctatttacgc gcgactctcg gactacctat 11821 gcactttcaa gcttaccctc ttaccgacga gccagaggac agaagtcccc gctacaccat 11881 tacttttgga caggatgcat tgttgctgga aactttaatt tggcattgga agccaaagga 11941 ggatgaagga tggatatgtt gaaaaacttg tggagaaagg gcttttgccc gaaatcagaa 12001 ggctacagtg tttcaatccc taatgggtgt cttttgagta tttatactcg tgcagtactg 12061 cactgcgttc ctttctcctt atatatatag tacacgttat tgtcctcggc aatattcctg 12121 tggctacaac aactttttaa tcattagggc tacagtaact gttgattttt tgatgagtga 12181 atggcaagat agtataaaca tacacctcag gtaggattat tatgcacatt catctcaacg 12241 atgacgatat ctctcggatg ccagaacagt tacgcagctt gtttttagat tggcttcccg 12301 aatgcctgaa gaccaaaaat tcgcgatatg aactagtagc atttcgtcag aagagccaac 12361 ctcaggtatc tcttaagcag ttagatatct ttgaaagtca gactatggag gagaaggcgg 12421 agcactcaca cgtaagactg actcagctat ttgacgcagg tattaccaag gctgggatgc 12481 ctgtgcgagt caagctaaag cgggaagagg caaaaaaacg aagtcgcgac tatataaata 12541 atctggaaat ttctgtgaaa ggaactattg tctacaacgg cgaagaattt gagaagccaa 12601 gtccattagc tgcaaaagta aacggaggtg cagctaatgg ctgggagtat atcgaagtta 12661 agaaagacaa ccagtggatt tgtttagata cattacgcaa aatctggaga aataacaatg 12721 actagagata acctgctttc ccgaatttct atcaacccga atatttgttc tggtacacct 12781 tgcattcgtg gacaccgtat ttgggtttcc ttaattcttg atcttttagc tgctggagaa 12841 actatagaga caattataga agagtatcca ggaatagaaa aagaagacat tcttgcatgc 12901 attgcctatg gtgctgaaat ggtaagggat tatgttgtcg agattcctat agggactcac 12961 aaggaagcaa aaaagtgaat atcaaattgg atgaaaatct tggtaacctg cgagtggcta 13021 catggttacg tttagcagga catgatgttg caacagtcag agaacaagga ctgacttcaa 13081 cacctgatga ggcattaatt gatatttgct gtgctgaagg tagggtatta gtcacttcag 13141 atagagggtt tggaaatcgc ctgaaatata atccctcaaa ctatacagga attgtagtaa 13201 ttcgtttatc ttcacgctct aactttaatg attggcgtga agcaatagag acattaatca 13261 ctggattgga agccgcagat gtgacaggaa agttgtggat tattagaaac ggaaatattc 13321 aagagtatca accaattgaa ccagaggata aagattgatg acagaatttg atgatgatct 13381 tccccaagaa gttccagatt ttgatgttgg agaagatgag gatgaagaat cgcctgtaaa 13441 acgagaactg ctgacaattc gcttatttaa agaagcggtg aaaaaagcga aagggaatga 13501 gggcgatcgc atcttagaaa gctttgcaga caatatctta cctaacctca ttcaacaact 13561 agcgggagca actgctaagg gtggtaggtt ttttgaaaca actgtggaaa tgattaatgc 13621 taaaagagca gcagaaggta aaaaaccagt tcgtagagat aatgcaggtg atcaatctat 13681 aattgcacac ttactgaatg gtttatttcc aacctatcgc atcttaaaga aactacaaga 13741 acacaaagaa acaaacccag ttaagcggac ttgtgaagaa ctgcaaattt gtatatttat 13801 cgtttcttat ctgctgcatg attatgaaaa attccctgac tatcaagctt ggttaattgc 13861 caatgacgag gaagggaaat ttcaaaatcg ggactgggaa gaagacacac cgaacaagaa 13921 agatgctccc aattttggac gtggctacat taccaagaag attttagatt ttggtttaca 13981 tcatttactt ggtgaagaat ggcaagacta tattgatgac atcattgaga ttagtaataa 14041 ttccggtatc aaacacgatg cagatttagg tctcgtcacc agaggattaa aaaccttaga 14101 tgacgacagg ctagacggca gaattagaca ggttttaatt gacctagttt cactttcgga 14161 tttatttgct tcagttataa aacatccaat agatgtagaa aatggtcgct taccaacctt 14221 agttggcaga ttaagtaacc atcagttaaa gcttacctat cactcacttt cggaaaatcg 14281 tggtgttctc accaatatcc tcaacaatgc tttaattgaa gcacattctg aagagtttta 14341 tacaccattg ctctatctgt ctgatggagt agtttactta gctcatgctg atgcacctgc 14401 aattacaact gacaacattc cagagagagt ttttggaaaa attaaaagtc tttgtgcaga 14461 aaaacttaag gaaagacaaa caggctttaa tcgtgacggg aagggtttga agtttgctga 14521 ttactattgg cttttctttg atgtcgttgg cttaatggaa gttagtattg atgcagcttc 14581 taggttactt cctgacacaa aaagctcttc tgcacttaag cgaggtgaaa gcttacaaac 14641 atatcaagca caaggagaac taccaagtaa tctcaattta caatttgcca atgaaattcg 14701 tattgaccgc ttggcagaat ttggcgatat tatttgtcgt ggtatttggg gtggttggtg 14761 tgaaaaagtc aatgaatggc aaaaacaaca gccaaaagct aagagaaaaa atcttcctga 14821 tttagactta acccaaaagc ttgctgaata cttgggatta tcagaggaaa tcccagctat 14881 cagacatatt caagcactga aaaaaacagg tggtgttcct ttagattggt attatttagc 14941 agcaaaatac tttcaacaac atccaggtaa agattttact caaattctgg aagttatgaa 15001 agggatggtg aattacgctg ctagctttat tcaaccaatc ttaaaagaat ttccagacat 15061 cccggatgga tggaatgatt taaaaaccta tgtcagcaga gtgatttcac taccaacagg 15121 agcagtttca gcgccagaaa ccgcaccctt tttagtagaa ttacaacgct acaatgctgc 15181 caaaataatt gggagaggac gggaaaatgt ttgtgctatg tctagttcct cttacagcgt 15241 gactgaacag atggaatcag caacgttatt tgctcctcaa gtctacagta atcgtcagat 15301 tttgtttaat gctcaagctg ctaaacggca aatttgctca atatcgtcga ttgaaatcat 15361 gttgagacaa attttgatga atcaaacgaa tgctgtcggt gcagattttg aaagtcgaaa 15421 atatcgctat ctttaccttt accctactta cttcttcacc ccggaaacca ataaattttt 15481 gcagaaagct tacaaccaat tttcacggac tcgttttgat gctcccttgc gaaagcattt 15541 catcacagaa aatcaagttg ctaagttcag aattcaagac taccagcaag ttgattccct 15601 actcattaaa gacaatctcc aactagacga tagcgaagcg ctgctgcttt cagcagatcg 15661 cactttcaaa attagctttc cagaaaagga aaccctaact ttcttcttta tcggtttacc 15721 accaggaaga gaacccacag atacagaatc ttgggtgacg ccagcttggt tagcattcgc 15781 tttaccactg attttagatg tcaaagtcgt tgcatcagag tcgcctgttc caccttttct 15841 cagtggtgct gattttgaac aaacagtgtt cttggatggc gaacatcaag caattcgttc 15901 tttaattcag caagataact atcgcctaga tagcatcctt ccccgtgcgt caggaaagcg 15961 tgaattttca ccattaaatg cacttactgc tgcttactct atccacctag aagtcaaccg 16021 caaaaaagat ggcgatccag attggggtaa attagcagac ttagcgcggg atttagcaac 16081 cagtcctctc tatgtttttc actatctcaa taaatggcta cggaaacaaa agaatctttc 16141 ctctgtaccc atcgccaaag ttcggctgta tttagacttg tactactact ttgaaccgga 16201 gggtaaaagt gtgaatcgac tgcgtgaact cacttcacta tatcgccgtt tttatcgtgc 16261 taaaagtttc tacgccaaag ccaatgcgat tctcaaacca attgatgaag ctgctgatgt 16321 gattttgaaa gttgataagg ctttagctgc tgacactgaa tctttaatag atgtcgtcgc 16381 agcacgttta tccaaattaa tgaataacgt gcggcggaaa gcagcagaag gaaaacccac 16441 cctgacttta gttgatggga agtggaaacc tgctttaact ccagaagaag aacgtcaagc 16501 agtctctgac ttcgccaagt attttgtgga gacgattttt gagggaagtt ttaaaggcga 16561 tcgcgcacgt ttagcaggta cacaacttaa cctcattcga gatacctgtg aatacctata 16621 tcgtttagca gacgacgaag aacgtcgcaa cttatctcaa gaacaaccag aagaaatacc 16681 tgatttagaa gcagatgaaa cagcttatgc ttgttcaaca ctctagatac gagcaatcct 16741 aaatcatttg taaaattcta tattctctct tttcttggcg ctcttggcga cgccagtcgc 16801 ctcaacgggg ggaacccccg cacggcgctg gctcgtcttg gcggttgata aattttacaa 16861 ctcaagtagg actgctatat caatatattt tcacaaataa ggaattattt ataaatgtca 16921 ccgaaagtct caatcattgt caactgcttt aaccaaggtt gctaccttga acgctcagtc 16981 aaaagtgtct tgtcacaaac atttcctgat attgaatgtt tgattgtcga tgatggctct 17041 actgataata ctcgtcaggt agctgagcag ttgatgaatt tagatgagcg agtcaaatac 17101 tactataaag aaaatggcgg tctcccttca tctcgcaatt ttggcgtcga aaaagcacag 17161 ggtgaatgga ttcaatgtct cgatgcagat gattggattc atgaagacaa aactagattt 17221 caactcagtt atttagaaaa agttaatctc aatgacgaca cagtctttta ctgcgattat 17281 gaacgggtgt ttttagatgc gaaccaaaat attgtcaaca ctcaggaaaa tgtcattggc 17341 tcattgacta aagatgaatt cattcaacgt ttactcattc ccgactttct tgcagataca 17401 cctcatcctg cccttcagca agctatgctg atgaaaaaaa gtatcttgag taagacaaag 17461 tttccggaat acctcaaagc actgggggat agatattttg ctgtcgctat tttaatggca 17521 ggtgcaaatt tcgtttacac cccgatgatt ggtacttact acacgaagca tcagtcaaat 17581 cgaactaata gttggaatta tatgaaaaac tactatatta tattttatga aaatattctc 17641 aagaattatc cagaactgaa caattgttgt caaagcggtc tggagttctt cttagaggaa 17701 gcaattatgg agaaagagga agacgatttt gaaagattac taaaaatagt tcctatgcca 17761 gttcgcttac tcaacaaaaa aattacgata aaaaataaaa agcaactcaa agttttccat 17821 actatcagaa aaatcctccc aagtttccta ctctacgaaa agtatcgcgg taaccgtacg 17881 aataaaataa tatccatgct gttttcaaaa ataaagttat aaaaaaatca tcaagaataa 17941 ttatgaagtt ttcattaatt gtaagtgact taagcggtgg tgggagtgtt cgcgcctttt 18001 tgctagcaca agtactcaaa aagcttaacc atgaggtaga aattatcggt tttctttttg 18061 ggaaagaact ttatgcaatc cctcccaatg ggatcaagat tgtttccatt cctgggaaaa 18121 actatcctga gtttttcagt tatagtcggc aagttttaga aaaaatagat ggagatatta 18181 tttatgcagt caaaccaaag gttacgagtt ttggtctatc tctactaaaa aaaatcagca 18241 gccgtcgtcc tttgcttctg gatatggatg actgggaatt aagctggtat ggaggtatga 18301 attggaagta tcgtccgagt ctgaagcaat ttgccaggga tattttaaag aaagatggtg 18361 cgttgagatt tccggatcat ccactttata tacaatggat ggaaagctta gtacaaaagg 18421 cggatgcagt aacaatagat actcaatttc tcaaagaacg ttttggcggt atttatctgc 18481 ctaatggtaa agatactgct atgttcgacc ctagtatgta cgactcaaag gctagccgaa 18541 ttcgttatga tcttgatgac tatcgcattt taatgtttcc aggtgcacca cgaccacata 18601 aaggtgttga agatgtcttg atggcgttgg atcggttaaa ccagtcggat ttaagactag 18661 tgattgtcgg tggcagtcct tatgatgatt atgatgataa actcattcaa agatgggggc 18721 gttggattat caagttgccc aaatgtcctg ttgaagttat gcctgaggtt gtagcagctg 18781 ctcatattgt ggttgttcct cagcgagata caatcatcgc tcgtgcccaa ttccccttaa 18841 aattaacaga tggaatggca atggctaaac ccgtgttatc aactagagtt ggagatattc 18901 cagaaatttt agatgaaact ggttatttga ttgaccctgg ttcaccagaa caaattgcag 18961 agcaaattga ttttatattt gagaatttag agtcagcaaa tgagcgaggt ctcaaggcaa 19021 gagaaagatg tataaaaaaa tatagccttg agactatggc atctacctta gagtccgtca 19081 ttgctgggtt atgagtactt ataatgtgtg attttcaagg gggaatttat aaaatttcac 19141 taacgctcaa aatctgattt taaaattatt tactttgatt gatttgatgg cgtgattcac 19201 tgttccctgt tccctgttcc ctgttcccta ttccctgttc cctgttccct gttccctgtt 19261 ccctgttccc tgttccctgt tccctgtgtt tcttgagaga caaatcccca atgtctacca 19321 gcaaactatt actaagattt gctaaaccgt atccagtttt gactctcctg acaatattgt 19381 tagggttttc tggagcttta tttaatggtg ttagcactgc tctgattgtt ccagtcattt 19441 taagaatagt gggacaagaa gtggatttca gtggtgcacc agctattctc aaaaaactta 19501 tgtctccgtt tgataatatt cccgaaaatt acctattagg agtgatggcg gggacaatta 19561 tcttcacgat tctgttaaaa aatctagcga cgtacgctag cactttagca tctagtacct 19621 taacgcgcaa agtcacttcg gatatgcgcg aaactggctt aatcttattg ctacaaattg 19681 atttatctta ttacgccaag atgaaagttg gtgacatcat caacagtctt ggtggagaaa 19741 ttagccgtgc tgcaagtgct gttggtaata ctattaagtt aatcatctta gggattacaa 19801 ttctagtttt tgtcgggata ctactgtcaa tttcctggca attaacgatt gctgctacgg 19861 ttttgctgtc tttggtaacg ttaataaatc agtatgctat tacccgtgct aaacagtttg 19921 ggaagcagct cagtgatatg tctagagcct attcaatcgc tatactagaa actctcaacg 19981 ggattcgtct agtcaaagca acgggtagtg aacaaagaga atatcaacgt attaggaaac 20041 tcattcacga gcgcgaaaaa gctgacttta agtctcaggt tcattcggaa gcgatcgcac 20101 cactgagtga agtcatgggc gttacagcta taatactgat tgtctttttg agtcgcacct 20161 tctttgcaaa ccaaattgtc ttgctttcaa cagtattact gacatattta ttggtactac 20221 tacgactgct accatttatt tctcagttaa atactcttcg cagtagcttt gctagtactg 20281 ctactagtgt ggacacagtt gctgagttct taagctggga gaataagccg ttaatgagta 20341 acggctcaaa tgtttacaca aaattaaaga agggagtgcg ttttgagtca gtttcctttt 20401 cctaccccgg tcatgaaaaa ttggtactta aagatgtaga tttatactta ccgcaaggta 20461 cgacactggc tttggtaggc ggttctggtg caggaaagtc aacattggca gatttgttac 20521 ccagatttta tgacccaaca tctggttgta ttaccatcga tgacatcgat ttgcgtgatt 20581 tcgacttgct ctcaatacga caggcgatgg gaattgttag tcaagatact tttcttttta 20641 atgactcagt gtggaataac attgcatacg gacgtcagtc agcaaccaaa gaggaagtta 20701 tcacagcagc aaagcaggca aatgcctatg agtttatcag tgaattgcca cagggattta 20761 aaactattat tggcgatcgc ggtgtgatgt tgtctggggg acaaagacaa cgcctggcga 20821 tcgcgcgtgc tttactacaa gatccagaaa ttttgatttt agatgaagcc acgagtgctt 20881 tagatactgt ttccgaacgt ttggtgcaag aggcgataga taacctcagt cgcgatcgca 20941 caacattagt cattgctcac cgcctttcca cagtgcaaaa agccgatcaa attgctgttt 21001 tggatcaagg aagtgtggtg gaggtgggaa cccatgaaca actgttacaa aagggtggtt 21061 actattcgcg tctgtatgca atgcaatttg ccgataaggc gcaaactgct accaaacgcc 21121 agcaaaactt agtgcgcatc tctgttgaaa ttcgcacgcg gcttcagtct ataattggtt 21181 ctttgcgctt actacttgat aatagacaaa acaatcctca acagcgagag aaattcatag 21241 aagaatccta caaatcaggc ttaaaagttc tcaacagcat tgatgttttg gaagatattg 21301 ttcacctaca aacgaactgg actttatcag ccgcacagcc aaatcacagc ttagccaccc 21361 caaatcaaaa tttgataatt atctgcaatc agtttcggat gactcttgaa cctatactta 21421 actctctaaa ctccctagcc aatgattcga gaaacacgcc tcaaagccaa cataaattca 21481 tccaggaagc ttatcaagcg ctcacgcgtc tgctagaaaa tttagatcat tttgaggata 21541 accttaaatt ttaaaatcat gaaaaatacg actcttctga aaaactacgt cttcttttta 21601 acagaagaga tacccaaacc agaagctcat attatagtat ctgccaacgc agcaaacgca 21661 gccgcaaact tagggtactc atcagttttg gcatatcctc gcaaaggatt agcagctttc 21721 aacccaattc atttagctcg tccatttcaa ccaaggaaaa caccagaagc actcgtcaaa 21781 tattacaaca tccaggaaaa gctcaaagtt gctcccttac ccatgccttg gcctattgat 21841 tatatcaata gtaaattcac cgactctaac accattgcca caaaatatta ttttccgttc 21901 catatccttt cgacaaccaa acttgtccac gcttggaact ggaatttcat caaagctgca 21961 gttaaaaatg gtgtaccagc aatttacgaa caccaccatt acgaagataa gcaatttgag 22021 ccagaaattg tgaatcatcc actgtttcaa gtcgctgcta cagttactga tacagtccga 22081 gaacacatga tacaacatgg aatgccgcca gaaaaagtca ttacggtgca caacggctat 22141 aattcctcgt ttataattag acaaccagaa aaaatcgcag agtggcgcaa aaaacttctc 22201 aaagatgaac gttcacattt agtcgtttat gcaggagcat taagaaaatt taaaggtatt 22261 gatatcctcg ttgatgttgc tgcactcatg ccaaatattc aaattgtctg tgcaggtggt 22321 gatgagaaag aggttgcaca ttatcagcaa ttagcaagag atttgcaagt taacaacatt 22381 acattcttag gatatctctt gcacaaagat ttaccatctt tgctacaagc cgctgatatt 22441 ttagctcatc cccattgttc cggacaagct gcaacattca catctcccct caagctgttt 22501 gactatttaa cttctggaaa tcccattgtc gcaacagaaa tcccctcttt aatggagttt 22561 aagaatacta acgctatcgc tgcttggtgc gaaccagata gtcctagtaa atttgccgaa 22621 gcgatcgcgc aggttttaaa aactcatccg aggaaagtcg aaggctatca agatattatc 22681 aactttgtga agcagttttc ttgggaaaat cgagccgcaa aaatcttgag ttatgttgat 22741 gaatctcttc gtcctccact tattgcttaa ctatagtagt cctatttgat ttttgaacag 22801 ctaggtacag tttatgttaa ccagttatca gttatcagtt atcagttatc agctatcagt 22861 catcagctac ctacgcagta ccagttatca ggtaggaaac ggactcgtcc accccttgtt 22921 cactgtttac tgttcactgt tccctgttaa gagttcccta aaggtaagga tatgcgacgt 22981 gacgaagtat taacaatttt gcaacagcat tggactgttt tgaagaactt tggtgttcga 23041 tcgctatcaa tttttggttc tgtcgcacga gatgaagctc gatctgacag tgacgttgat 23101 atcttgattg aacttgaacc accactcacc tttgatcgct atatggaaat caaattttat 23161 ttggaagatc aacttggaac taaggttgat ttagttagtt ggcgatcgct gaaaccctca 23221 atccgtgatg ttgttgaaaa agaggctatt cgtgtcgcgt aatattcgtt tttatttaga 23281 ggacattgtt ggttgctgtg caaaggtatt acgttacacc caaggcataa ctttcgagca 23341 gtttatcgtg gatgagaaga catttgatgc ggttgcgcgt aaccttcaaa tcattggaga 23401 agctgtgaaa aatattcctg tggaaatgcg ggaggttgcg cctgaaattg agtggcgaaa 23461 aattgcgggt ttaagagata ttctagctca tacttatttc caggttgaga acgaaattat 23521 ttgggatatt gtgcagaata aagttcagcc tttgcaagaa caaatacaac agttgctgga 23581 aagtgagttt ggcgatagcg tttaacccac gcaggtgggt ttcgtctgta taaccgtacg 23641 tgtgcgactg tttgaatgct gaatcaaaga ctgtggagta cacttagtaa atattggcat 23701 cttttttcca ttctgctacg agtggctgaa tcaaattgat acgtttcccc tgtgcgtctc 23761 cttattctct cctcatcatc cacaaattaa tgaaaaatga gcatcaaaac tattggtatg 23821 atcagtagct atcctggttt agatcaaaaa ccagattggc tatggcaaca aactcctcat 23881 ccttttggtg tttggggtga tattcaaatc cattctacag cccccaagcc agatttttta 23941 ttaatgtata actacacatc ttttcctgaa ccaccgcaaa agcagttgtg gttttggaaa 24001 aatagaaaag cacaacttga atacgaaaag gcacagcaaa cgttacaaaa taaacttata 24061 aacattccta aagaacgagt catttatctg ttgcgagaac cacctttaga cgaagtggtt 24121 gaattgaatc aaaaatttta tcaacaggct caagcatact gcggttatat ttctggacca 24181 gatgattttg cccccacgcc gaattatatg cctgccattt ggtattattc taactcattt 24241 cgtgagttga atgatatgcc accgccagaa aaaattagac catgtagttg ggtgacatct 24301 ggtattagtc gcacagttaa ccatcgccag cgcttggagt ttttaaaact gctgcgaaac 24361 agcgaattga agtttgattt atacggtcgt ggcttaccag agtgggcaca aggaggcggt 24421 gaattgagta ataaatggta tggtatggct ccatactatt ataacctggc aattgaaaac 24481 tatgctgaca ataattggta tgtgagtgaa aagctatggg atgctttgct tgcttggtgt 24541 ttaccaattt actatggagg tcctgctgcc gataaattat tacctcctgg tagtttttta 24601 agattaccaa gtttggatga aaaaggactt ctttacattc aagaagtgac agcaacaact 24661 gatgcttggt atgcagccaa gagtgcgatc gccgaagcca gagaaatcat tttgcataag 24721 ttaaacttgt taaattggct ctcggattat gttaaacatt gttcataaac atgggaaatc 24781 tcggagtttg tggagacttg aatgactgaa gggatttacg tactcgctaa cgatatcgtt 24841 tttgaccagt tagtagcttt cttaaatagt attgaagcga atgctggcac caattatggc 24901 agcaatactg gcaaaaacta cccagtttgt atcattcctt atgataacag attagaaaaa 24961 gtaaaagacg aaatcaaaaa cagaaacaat gtagaaatat ttgcagatac agcagctatt 25021 gcacgttggg aagatttcgc cacccaaatc tggcaaactc atcctagtgc ttttcaaact 25081 tgggaacaaa acggcatctc gggagtttat cgcctaggaa tgcatcgtcg tttttgcggc 25141 ttcgatggtc cgtttgataa atttatttat ttcgatgctg atattctagt cctaaattct 25201 ttagattata tttttcagca gttaaatcag aatgacttcg ttgtttatga tttccaacat 25261 aaagatgctg ctcatgtcta caatgttaag tcaaatcagt tgttaaatgt tttcccgctt 25321 gctcgcatag attcggaaat cttttgcgcg ggtatgtatg gaagcaaaaa gaatatcttt 25381 catcaagaaa aacgtaatga aattatatca caactcaaac aagatcaagc agaaattttg 25441 tatatgaatg ctcccgacca aacaattctc aactacatgg tcatgaagtc aggcatatct 25501 agctataatt ttgcccatca tcttccggaa aatgaaagaa ctggttgttg tgtcacctct 25561 cctcattttg aagccagaga taatatcttg tatgacaaag gtcatccgtt aacttacatt 25621 cactatatcg gtttatcatc tcagttgttt acccgcgttt gttctggaga aaatattgat 25681 tttccttatc gagaaatttt cttgcattat cgctatctgc atgaaccaga caagcgacca 25741 aagttcaaga gtaagccgaa agcttacaat gcaccaccga gtttagctac acgcatttta 25801 agaaagttag gattagcagg ttgaggatta ttcaatgaat cgtggaattt atattattgc 25861 gaatgataag gtgactgacc atgcgatcgc actactcaac agtatccgtt tgcatgatac 25921 agaaacacca attgtcatga ttccttatga tgataattat cacaatattg cagatacact 25981 cgaaaaatat tatggagtac aaatctatga agacctagat tttatcgata gattatcaat 26041 caaattacat gagctttttg gaaacaagtt tttcgctcgt cctaatcaat ttcgcaagca 26101 agcttgttgg tttggacctt ttgatgagtt tttgtacata gatacagata tagttgtctt 26161 tgaaaaaatt attgagaatc tgaattatct gaatcaatac gattttattt gttgtgatta 26221 tcaacacgca ggtggaataa ctaatgtctt tacaccaaaa gttttagaag aaaaagtctt 26281 tactgaagat gaagtcaaag atattttcaa tggtggtttt tggggttcca agaaaaatct 26341 tatttcggaa caagatttgt atgaaacttt cgctgagtgc gctgcacacc cagaatattt 26401 cgatttttcc caaaaaactt ccgaccaacc tattatcaat tatatgctcc tgaagcgaat 26461 cccacgtcgc tttaacattg tccgcagaga gggtaaagca ccaggaaatt ggggaggaag 26521 tcaccatttt cagactcagg gaaatatact gattgatcct aaagttaatc aacccctaca 26581 atatctccac tgggctggta ttcggattga acctggttgt ccttattggg agatttggga 26641 atattatcgc aatcttaatc cggaactgac tccagcagtt ttcccacaaa aaccaaagaa 26701 aagtcagtgg gagcaaacat tggagaattt gaagaatcaa ctcagaaaaa tgaaagctaa 26761 tttgtaaaat aaacagggta gttactatct accctcgttt agcttaattg ggcagaagcc 26821 tcttgctgta tctgtacagt gcgatcagcc tccggcttat cgcatgtgcg tatcgccatc 26881 gcacgatact tgcaccacct ctcaaactta ataaatctta gtatttttta tcactaaaaa 26941 tctcaataaa agtactaaga aactttaaaa gttttttgca gtctagcaat tcagtgagac 27001 aatcaaacct attctcttca attcaagcaa gacatggacg atcgcacaaa ggctattcaa 27061 gccctcactg ctcaacgctg gcgagtctcg ctgatgctaa gtggagccat gatgtttatc 27121 tattttggct ttattctgct gattgctttc aataagccgt tattgggttc acaaatcatt 27181 ccaggactaa gccttggtat tttactggga gcgctggtga ttgtctcagc atggatatta 27241 atttttattt acgtacgctg ggcaaacaat aattatgatg acaaaattgc aagactgaca 27301 cgcaaatgaa gggtagtggt tagtcaaaca agcaataact cacaattctt caattcttta 27361 atcaaaaatt aaaaatccaa aatcaaatat gaatagtttg tggtgttatt tgccactggc 27421 ggaagtggat attaaaaatc tcggtcactt caacccactg gcaatatttt tcttctttgt 27481 gtttgtcgct agttctcttg gtattactta ctgggctgca aaacttacca aaagcacatc 27541 ccacttctac actgctggag gtaatatcag cggtttccaa aatgggctag ctttagcagg 27601 agacttcatg agcgctgcta gctttttggg gattactgga ctagtcgcac tcaatggttt 27661 tgatggcctt atttattcta tcggcttttt ggtgggttgg cctatcgtta tgttcttaat 27721 cgcagaacct ctgcgtaatt tgggcaagta cacctttgct gatgtggtgg cttatcgttt 27781 gcgacaagca ccagtacgaa tagcttctgc ttttggttca ttagcagtga tcagcttcta 27841 cttaattgcc caaatggtag gtgctggcga acttatcaag ctgctgtttg ggtttgacta 27901 tgaattagct gttgtcatcg tcggttgcgt gatgatggct tatgtgattt tcggtgggat 27961 gatagccaca acttgggtac aaattatcaa ggctattctg ttgctgggtg gaactatttt 28021 gctagcaatt ttggtactag cgaaatttgg ctttaaccca atggcacttt ttagtagcgc 28081 agctgctaaa tatggagcag gtgtcttggc tcctggtaaa caagtttctg atcctttgga 28141 tgcgatttct ttagggatgt cgttgatgtt tgggacagca ggattacccc acatcctcat 28201 gcggttctac acagtgcctg atgctaaatc tgcgcgttat tctgtaacat atgctacagc 28261 tttgattggc gttttttacc ttctcacctt cattcttggg tttggggcga tggtgctagt 28321 cggtcaagat gctatcaagc aaattgggac tggtggtaac atggcagcac caatgttagc 28381 tgaatttcta ggtggtgatg cctttttagg ttttattgct gctgtttcct tcgcgacgat 28441 tttggcggtg gttgcggggt taactctttc tggtgcagca gcattgtctc acgatttgtg 28501 ggtgaatgtg gtgcgaggcg gtcatgctaa cgagacggaa cagctaaagg tggctcgctt 28561 tgcaaccatg atgctgggag cagtggcgat agtgctaggt attttgttta aggggcaaaa 28621 cgtagcttat atggtgggtt tagcatttgc gatcgccgct agtgcaaact tccccgcctt 28681 gcttttatct atgctgtggc gacgcttcac cacctatggt gcagtagcaa gtatgttggt 28741 gggtaccctc tcctcattag tgctgattta tttctcacca actattcagg tgactatcct 28801 caagcacgct tctgcaccct tcccattgaa aaatccaggg ttaatcagta ttcctctggc 28861 gtttttggtg ggtattgtcg tttcactatt agctagcgaa cgagaagcac aggaaaaatt 28921 tgcagaagtt gaaaaccgta tccatattgg tttcaacggt taatactaag ttgcgttcaa 28981 agagataatt ttataaacga accacgaaga cgcgaagagc gctaagcaag aaaaaagaag 29041 attacctgta tgaatgcaac ttggtataag attatgttct tctctcagat gtaggaattt 29101 ggtttcctcc actattttgc ttgttcaaaa tttttgaacg agcgaatagc aatctgaagc 29161 acattcccgc ataaattaac tcctatttcg acccccgatt tctcgcgaga agtcgggggt 29221 cttgtttttt gacaaatcat ttaggactgc tatagtcgtt tgctattccc tagactgatt 29281 tgtctcgttc tcgcatttcc tgtcgtcgca tgggcatagc atcaatttcg tcaaaaatag 29341 ctttcgcttg cacaggagtt gtctcgcggc gcttgacacc accgtcttta cccactaaga 29401 cagcgatggg acgcttcagt catgaagctg tcgcagtaga tccaaagacg ggctacatct 29461 acgaaactga agatagagga gatagctgtt tttatcgttt tgtacctgct gtaggtctag 29521 caaatgcgcc tggacagtta gccaaaggcg ggactctcta tgctttggtg ataaaggata 29581 agcccacact gaacacgtca aacaactcta acattggagg cgctgaaggc ttaattccta 29641 tcggtcagcc tttaccagtg gaatgggttc aaatagaaga tgttgaccct gtacaagata 29701 ctgtccgcaa agaagctcaa tctaaaggtg cagcgatctt ttatcgaggt gaaggagctt 29761 ggtacgataa caatcttatc tattttattt ccactcaagg aggtccgcct gcagtggata 29821 gtacgtacgg caatggtcaa gtgtggattt ataatcctag agaagaaaca attacattat 29881 ttgttgaggc ttcccctagc ggagaactcc tcgatgagcc tgacaacatt actgtagcac 29941 ccttcggaga cctcttcctt tgtgaagatg ggggtggtga acaattcgtt gtcggggtca 30001 atcaaaaagg tgagctttat cagttcgcac gtaacgccat tgttagacca gaccgtaatg 30061 gcgaacctga taacagcgag tttgctggcg cttgcttttc tcctgacggt gacacattat 30121 ttgtcaacac tcaaggtgtt ggtattacat attgtatttg gggaccgtgg ccgcgccaat 30181 acagaaagtt taggtagatt ttgaggaaag gataaagtag aatttttaaa attttatcct 30241 ttattcctca tagtttttag tgaataagtt ctacattaag aaacctagtt gcttcaagaa 30301 attgggtttc ttttttggcg tcaaactctt gtcaagatat gataattttg aaaagttttt 30361 acttgacatt aatactaata taacggctat tgttgtaact tcaaaagcgc tatattattt 30421 agtgcgatcg cataaaagca tttttgttcg gaaaatggag acttcaggtt gtagatattt 30481 catttttgtg agttctcctt tcaaaagtcg taaaccagca atattttggc aagaaaaggt 30541 ttttctacta aattttgagg agattataat caatatgtaa ttttacgaaa gtttaagtat 30601 tttgtttaag aaaagttcgg ttgctaaatt tccattaagt tgatgaggag ataaagttat 30661 gctcatacag tcagtagacc cctcaaaatt atccaaaggg gaacttcccc atccacctga 30721 gtttgagcgt gtggaagatg agcgtctcca tcgcaaacag cgtttggctg ctgcctttcg 30781 tttgtttgct cgttacggtt ttgatgaagg tgttgctgga cacatcactg ctcgtgatcc 30841 tgaattccat gaccattttt gggttaatcc ttttgggatg tactttggtc acatccgagt 30901 ttcagacctg gtactggtta accacaaggg tgaggtcgtt gagggtaaca agccagtaaa 30961 cgctgctgct ttcgccatac actctcaaat ccaccaagcg cgaccagatg tagtagcggc 31021 tgctcacaca cattccctct acggcaaaac ctggtcaacc ttgggacgtc ttctcgaccc 31081 gttgactcaa gatgcttgtg ctttctatga agaccacgct ttgtttgagg actatactgg 31141 ggtggtactg gatttggaag aaagtcagcg tattgccaat actctgggac taaaaaaagc 31201 ggcgattctt cgcaaccacg gcttactgag tgttggtcat tcagtggatg aggcagcatg 31261 gtggttcatc acaatggatc gctgctgtca atcgcagttg atggcccaag cagcaggaaa 31321 gccagtgctg attaaccccg acactgctag tctgacttat cgtcaagttg gctcacattt 31381 catgggttgg ttcaatttcc agtcactgta tgacatgatt gtccgtcagc aacctgactt 31441 gctggattga gtaagattct tcttgctgct ttgggggagg atccgcgcga acgtcgtatg 31501 cgcaaagcgc acgccttacg gcgttagcgt aagcgcaagc gcacgctaag agcg // LOCUS NODE_897_length_31053_cov_5.09019931053 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 31053) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 31053) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..31053 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 658..1377 /locus_tag="DP116_07685" CDS 658..1377 /locus_tag="DP116_07685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013193213.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07685" /translation="MGNFAGGNLEAAKNELGKALGKGNEIRVAEISLPIPKQIRAKRQ QEEAERLGRQQEAERLKLQEADRRRQQEPVITRQQFLKWVGLGVAGLVTAVVAGKIFI SSLESTSKTTADAPSSATEQATQPPSAQATQAAQKDALSEIRRKQLNADIRAREGRND AFNDGSATKRAARDIESKVRSKLEANIPSGHLTIAASEDGTVTVSGTVAKKDQLAKID TLPKQITGVTKIVNRATVAQE" gene 1401..1949 /locus_tag="DP116_07690" CDS 1401..1949 /locus_tag="DP116_07690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876004.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="formylglycine-generating enzyme family protein" /protein_id="PRJNA477356:DP116_07690" /translation="MGSNPSNFKGAKRPVEKVSWNDAVEFCKKLSQKTGRKYRLPSEA EWEYAARAGTTTPFYFGQTITTSLANYNGSFTYASEPKGEYRQQTTEVGSFLPNAFGL YDMHGNVFEWCQDTWHESYKGAPSDGSAWVDNDNQRYMQRGGCWDYNAVSCSSACRAY NVAGVRYFGNGFRVVWAGAWTI" gene complement(2228..2407) /locus_tag="DP116_07695" CDS complement(2228..2407) /locus_tag="DP116_07695" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07695" /translation="MIGLILGYELIRVVTQYLMLYSSSYGFGSIDADFSGDFDGGLGG AMLHQGAAQGAISGA" gene 2441..3265 /locus_tag="DP116_07700" CDS 2441..3265 /locus_tag="DP116_07700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07700" /translation="MAFDNVCKILAEKYPTDFAHWLLPDEPRKVKLLKTELSIEPIRA DSITFLQTENRILHLEFQTTAKSETPIPLRMLDYFVRLVRKYDVPITQVVIFLQQTSN EIAFTEEYVNEMTNHRYRIIRMWEQDSALFLNNPALLPLAPLTQTDSPQWLLSQVAQS IARISDRETRQNIAAYTEILAGLKFEKDLIQQFLGEEIMQESVIYQDILQKGDKQGEE RTIIRQLNRRFGEIDSSLIDRIRVLSVEKLDDLAEALLDFSEVSDLIAWLDEQEEN" gene complement(3320..3949) /locus_tag="DP116_07705" /pseudo CDS complement(3320..3949) /locus_tag="DP116_07705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010472814.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 4305..5189 /locus_tag="DP116_07710" CDS 4305..5189 /locus_tag="DP116_07710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015079003.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flagellar assembly protein H" /protein_id="PRJNA477356:DP116_07710" /translation="MKTDSIFYRLFQEFPSIFFELIGEPPQEANAYQFSSVEVKQTAF RIDGVFLPTKESDNPSTAFKLSEAEASAALSTSPLRVNPIYFVEVQFQGDSEIYARLF AEIFLYLRQNQPQNDWRAAVIYPTRSIDTADRKHYREFFSSQRVSPIYLDELGEAVSL PISIATVKLVIENEDTAINKARELIDRTQQEISSQQQQRQLLQLIETILIYKFPTMSR EEIEAMFGLSELKQTRFYQEAFEEGKQEGRLEAQLETVPRFLALGLTVEQIAQALGLS VQEVQQALQQQSSNESSR" gene 5322..7910 /locus_tag="DP116_07715" CDS 5322..7910 /locus_tag="DP116_07715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AAA family ATPase" /protein_id="PRJNA477356:DP116_07715" /translation="MTSQEIVIWLQERTALGILSPEPLNAIREASALWAIAQVIEEQV IPPNHRLVTEGTPPEALYILLEGQLESDSNNKTNPALAIGFLPGSIIHLQELMLDESA QRTITTVTECHLWVVPADKFRELLTQYPEIAQAVSRQMAQELAQLTSALTYEQERSTA LRPYLVPKAERGIVGTSRYAVRLRQEIREAANDRKSVIIFGEPGLGKDTIAALIHFGS KQRREPIIKVNCSILQTSGADLFGRTGGKPGLLEWLGEGTLVLNNIQETPPELLPVLA SLLKTGKYTPVSRSGEATPEPRVSNARILIVAEKTQPQIERCVGHIIKVPPLRVRKAD IKAQIEYYISLYTRSRGISKPKVTPEALRRLQSYDFPGNLKELQNLVERAIVQAGEAK ELTEEIFWAVDTKKKQFRVNLLNAYPELRKFLRSPWWPDRINYGFTLTAFAILVGVLF FGPQTRDRNFALNLFWAWWWPFFLLAFPFLGRVWCAVCPFMIYGEITQKLSQKFFPGK LKRWPRHQAEKWGGWFLFGLFTLIFLWEELWNLENTAYLSGCLLLLITAGAMIFSAIF ERRFWCRYLCPIGGMNGLFAKLSMTELRAQQGICSATCTTYQCYKGGPQKGEGMETGG CPLYSHPAQLEDNRDCVLCMTCLKACPHRSVEFNLRPPGVELWTTHIPHSYEVALLFL LLGGVFLHHLSELQSWLGLHLDLTQFLPHLGLSLLALLIPVAIPFLAYGIMQILYLSN KGLKSTQQNPKPRRFVELAYGYLPLVLGGNLAHYLRLGLGEGGRILPVTFATFGLKSE QLFTLVAHPAVIDFLQGSTLIVSVLLTIVLTQKIARQPMRSLFWQHLAAIGLGASMWA IIVSGL" gene 8169..9350 /locus_tag="DP116_07720" CDS 8169..9350 /locus_tag="DP116_07720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876862.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="low temperature requirement protein A" /protein_id="PRJNA477356:DP116_07720" /translation="MASFIEPPRLRIGEDSEEERRATWLELFYDLVFVVAVSQLAHYL HDHVSLSGVLGFVALFIPVWWSWIGTTFYANRFDSDDVTHRLLTAVQMLAIAGLAVNV HHGLSESCTGFALSYALGRVVLVVEYVRAGWHIPTARPLTNRYATGFTIAAVLWVISG FVPIPWRFVFWTLGLIIDFATPLSARKLQLELPPHSSHLPERFGLFTMIVLGEAIVAV VDGVSEQNWDVLTVIAAVFGLCIAFSLWWVYFDNLGGTPIQKARTEGRVTIFNVWLYT HLPLVMGIVAAGVAVELVLLSKPMLALSDAVRWLLCGSIALCYLGLGILHRIGVIRYC KSRAKFRIGAAPIILAIALFGKDLLPVAVIGLVALVCAVQVVQDVTQSRPTTRLVEPE I" gene 9482..10084 /locus_tag="DP116_07725" CDS 9482..10084 /locus_tag="DP116_07725" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07725" /translation="MRDIITVYIWFPKEFLGHASMQVGIDTYISFWPNKEIINRSSMQ TSQSIFNHKNQNINQLLFQHLADIRSTSYEDDCLILGQADSQREADCKVELFNLPKEP IKTFWREFTNQKNSYHLIKRNCSSVVAEAINEGWKAYIGKGKNAGNKSFADFENQVEY KIPDFEALSFTTAALNFGKLLFWSPKQVLLYAQMVKRLTD" gene 10210..11901 /locus_tag="DP116_07730" CDS 10210..11901 /locus_tag="DP116_07730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455032.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="potassium channel protein" /protein_id="PRJNA477356:DP116_07730" /translation="MKPRIIVCGLSRTGYKVFRLLRQQGAFVVGVHHQPIPGESSGDL IVGDLQAASTLTAAGIQQAHTLVIAGSNDELNLSIMMQARVLNPQIRIINRFFNTNLG DRLDKAVPNHLSMSVAGLAAPVFTFAALGNQAIGQIKLFKQTWPIHEEYIDKNHPWLG RKLSDLWNDRSRMMIYYMPVKGEMDLVAAVLSGQELKVGDRLIIGTQPCIGSTRKSVI AKLLKVLTNLRQFKKHAESVVAMTIVLFVIIVIATVTYTSTNTNISIIDALYFAVGMI TGAGGNDKVVQNAPGSIKLFTVFMMLIGAAVIGLLYALLNDFVLGSRFKQFWDAARVP HRHHYIVCGLSGIGIKVVEQLSASGHEVVVIERDSNNKYVNTARGAGIPVIYADASFS ATLKVANLDSAAAVLAVTGNDATNLEVALKAKGMTPQVPVIVHYADPDFARMAQEVFD FEAVLSPAELAAPAFAAAALGGKILGNGITADSLWVAFATLITPVHPFCGQLVKDVAM SADFVPLYLETNYQTLHGWDLLEMSLSAGDVLYLTMPATRLYQLWRSAPPQLMAS" gene complement(12396..13268) /locus_tag="DP116_07735" CDS complement(12396..13268) /locus_tag="DP116_07735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315808.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" /protein_id="PRJNA477356:DP116_07735" /translation="MAHLNLRRPQNVSGDFYVDTSCIDCDTCRWMAPEVFTEVDEQSV VYHQPVNEAQRLAALQALLACPTSSIGTVEKPQDVKVAQVSFPIPVEDNVYHCGYHSE NSYGAGSYFIERPEGNILVDSPRFTPPLVKRLEQMGGIRYMYLTHRDDVADHQKFAEH FQCQRILHEDDITSGTRDVEIQLTGTEPFELAPDVLIIPVPGHSKGHTVLLYKNKFLF TGDHLAWSDSLKQLIAFPHHCWYSWSEQIKSMRDLANYSFEWVLPGHGRRYHADVETM SQQMHKCVAWMESV" assembly_gap 13642..13651 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 14530..14961 /locus_tag="DP116_07740" CDS 14530..14961 /locus_tag="DP116_07740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319744.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Photosystem II extrinsic protein" /protein_id="PRJNA477356:DP116_07740" /translation="MKALVRLLTVFSLLLGCLGWLGTAQTAQAADLSLTAFRSVPVLA AASLRNPADENLAEVYGKKIDLNNTNVRAFQKYPGLYPTLAGKIITNAPYQKVEDVLD IAGLSEHQKQVLQANLDHFTVSEVVPAFTEGDDRFNNGIYR" gene 15229..16905 /gene="nadB" /locus_tag="DP116_07745" CDS 15229..16905 /gene="nadB" /locus_tag="DP116_07745" /EC_number="1.4.3.16" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206923.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="L-aspartate oxidase" /protein_id="PRJNA477356:DP116_07745" /translation="MPQTDIKSQFDVIVVGAGAAGLYTALCMPAELQVGLITKETVSL SASDWAQGGIAAAIAPEDSPSLHIEDTIQAGVGLCDRPAVEFLAKQAPSCIQSLVNLG VAFDRHDSHLALTLEAAHSRNRVLHAADTTGREVTTTLTAQVLRRQNIQVIQQALALS LWLEPETRRCQGISLFYQGHVRWIRANAVVLATGGGGQVFAQSTNPAVSTGDGVAIAW RAGAILRDLEFVQFHPTALTKPGRFLISEAVRGEGAHLVDNDGRRFAFDYHPAGELAP RDVVSRAIFSHLQRTSTDPATAHVWLDMRPIPAEKIRLRFPNIIKVCQHWGVDVFSQP IPVAPAAHYWMGGIVTDLMNRTNIPGLYAVGETTSTGVHGANRLASNSLLECIVFGAQ MADLQIFPDVSRVSNLSETLAIREFRADVTEWKPQQEHLEVLRDKIPRLLWQSAGICR EQSSLESAIATLESWQQNYISLPVSQFLLSLRPTESARLEIPDVERQLRLWAETRNLL DVAFLILKSAAFRTESRGGHYRLDYPQLDPDWQVHTLIQQNNWWKSPVFG" gene complement(17207..18193) /locus_tag="DP116_07750" CDS complement(17207..18193) /locus_tag="DP116_07750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873585.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07750" /translation="MSRRRSIPWIHKWSRILIAAIASCGALTTAYLTVVKLTQGSAAC PTNSCDVVLSSPYATVLGLPLALFGFLAYASMATFALAPLAVDPAKNKASRTKLENWT WLLLLAGAIAMSIFSGYLMYLLFFTIKALCLYCLGSALFSLSLLVLTIIGRTWDDIGQ IFFTAIIVGMVTLIATLGVYSGVNNNGGSATVSNSGQGTQVSFQPKPGNEPKPGVGWE ITTTSTDAEIALARHLTKIGAKEYIAWWCPHCHEQKLVFGKEAYAEISHVDCATADNP YAQTDTCKAAKVESYPTWIINGQTYPGVKTLEELAKISGYTGPRNFKYSLRR" gene complement(18340..19188) /locus_tag="DP116_07755" CDS complement(18340..19188) /locus_tag="DP116_07755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410840.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphorybosylanthranilate isomerase" /protein_id="PRJNA477356:DP116_07755" /translation="MDLYQLFKTRTPIIGVVHLLPLPTSPRWGGSLKAVVDRAEQEAT ALASGGVDGIIVENFFDAPFPKNQVDPAVVSAMTILVQRIQNMVTLPVGINVLRNDAR SAMAIASCVRAQFIRVNVLTGVMATDQGIIEGEAYELLRYRRELGCDVKIFADVLVKH ARPLSAVNLTTAVQDTIDRGLADAVIISGWATGHPTSPEDLELASSAASGTPVFIGSG ASLENIATLIQAADGVIVSSALKRHGRREQPIDPNRVSQFVEAARKGWNSKGETKSIS ELKLYS" gene 20033..21352 /gene="rimO" /locus_tag="DP116_07760" CDS 20033..21352 /gene="rimO" /locus_tag="DP116_07760" /EC_number="2.8.4.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315849.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S12 methylthiotransferase RimO" /protein_id="PRJNA477356:DP116_07760" /translation="MGDKATIAISHLGCEKNRVDTEHMLGLLVEAGYGVDSNEELAEY VIVNTCSFIEAARKESVRTLVELAEADKKIVITGCMAQHFQEQLLEELPEAVAVVGSG DYHKIVDVIQQVEQGKRVKLVSSQPTYIADETTPRYRTTTEGVAYLRVAEGCDYRCAF CIIPHLRGNQRSRTIESIVTEAEQLASQGVQEIILISQITTNYGIDIYGKPKLAELLR ELGKVKVPWIRIHYAYPTGLTPDVIAAIGETSNVLPYLDLPLQHSHPDILRAMNRPWQ GRVNDGIIERIKETLPEAVLRTTLIVGFPGETEEHFEHLSQFIQRHEFDHVGVFTFSK EEGTPAYNLPNQLPQFVMDERRNVIMELQQPISLKKNQLEIGKVVDVLIEQENPVTGQ LIGRSGRFSPEVDGQVFVTGEARLGTIVPVAIQKADAYDLYGQVVTN" gene 21536..23080 /locus_tag="DP116_07765" CDS 21536..23080 /locus_tag="DP116_07765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent helicase" /protein_id="PRJNA477356:DP116_07765" /translation="MNLSFLDLGISQECVQQLEKLGFSAPTNIQAQAIPHLLSGRDVV GQSQTGTGKTAAFSLPIIDQVDVTQKAVQALVLTPTRELAIQVHDAINHFIGNQDLRV LAIYGGQSIDRQILQLKRGVHMVVGTPGRVIDLLERGCLKLDRVKWFVLDEADEMLSM GFIDDVEKILSQAPKERQTALFSATMPPSIRQLVNKFLHSPVTVTVEQPKAAPNKINQ VAYLVPRHWTKAKALQPILEMEDPETALIFVRTRRTAAELTSQLQAAGHSVDEYHGDL SQQARERLLTRFRNRQVRWVVATDIAARGLDVDQLSHVINFDLPDSVETYVHRIGRTG RAGKEGTAISIVQPFERRKQQVFERHNRQTWQVLSIPTRTQIEARQIEKLQTQVREAL AGERLASFLPMVSELSEKYDPQAIAAAALQIAYDQTRPAWLQSEPDVTEEDLVPPTPK PRLRSNNRRGESSGDRNRSSWVASDASNTEGGRGTPKPKLRTGRRETSVSPKKLGSGA ARETAS" gene 23337..23921 /locus_tag="DP116_07770" CDS 23337..23921 /locus_tag="DP116_07770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197870.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_07770" /translation="MTITTAKRFTIAEYERLAELGFFREDERVELINGEIIPMVSKGR PHSVCETRLFRELFKLVGERGTLRGQEPIIIFNYNQPEPDFVIAQNRDDDYLSAHPSP VDILLLIEIADSSLKYDQEVKLPIYAQAGISNYWIFNLVGNSLECYSESYQDLQGKFG YRRKLIVLPNESVCLPCFPDLSLDLSKVFPQQFA" gene complement(23892..24509) /locus_tag="DP116_07775" CDS complement(23892..24509) /locus_tag="DP116_07775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654469.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07775" /translation="MAYSDFSLRRVSQDFKLTLTQGTFLSEYQAITPSAYLAQYLDKS LPLAIALGTEKARSEMIICPILIELREILQREISLFSGIDFTVDQSLGLNGICDFVIS RSPEQILISAPVAVIVEAKKDDLNAGLGQCIAEMVAAQRFNQQQDHPIFNIYGAVTTG SLWRFLKLEGQNVIIDLTEYGVPPVDRILGILVSMVSGKLLGKNF" gene complement(24612..25997) /locus_tag="DP116_07780" CDS complement(24612..25997) /locus_tag="DP116_07780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113084.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-acyl-sn-glycerol-3-phosphate acyltransferase" /protein_id="PRJNA477356:DP116_07780" /translation="MSFQEAQPPLEFIPPALDPWFLRVCQLFLPSLIHWRTAISHIEA DNPEVLLDLYRQFQDRKIRFLIAFRHPKAEDPLCLVYLLSHILPKVARHKGVVLQPPI HAHFIYDRGIPLWAGSYIGWVASRLGGTPIVRGKADWTGLRSARDLFANGQFPMAAAP EGATNGLSEIVNPLEPGIAQLGFWCAEDLHGDNRHEQVFIVPVGIKYSYVSAPWDAIA NLLSELEATSGLTVSIESHSENSSFDSLYPRLFRLGEHLLSLMEKFYTRFYHQKLPTQ EEAKDVNELLAFRLNALLDVVLRVAEQYFDLQPKGNFNDRCRRVEQAGWNYIFREEFK DVTALSPLERALGNRIAEEANLRMWHMRLVESFVAVTGKYIREKPTAERFAETTLLIW NMVTTIRGENSLGRPTLGKQSVKITIGEPISVSERYPVYKASRQNAKQAVADLTKDLQ LAMEDLILQGE" gene 26103..27899 /locus_tag="DP116_07785" CDS 26103..27899 /locus_tag="DP116_07785" /EC_number="6.1.1.15" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861785.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="proline--tRNA ligase" /protein_id="PRJNA477356:DP116_07785" /translation="MLFLTLRDDPADAEIPSHKLLLRAGYIRRIGSGVYAYLPLMWRV LQKVSQIVREEMNAAGGQECLLPQLQPADLWKESGRWDTYTKAEGIMFALKDRRDQEQ ALGPTHEEVITAIARDMIRSYRQLPQLLYQIQTKFRDEIRPRFGLMRGREFIMKDAYS FNVDEESLEKTYQDMHIAYSNILRRSGLAFRAVQADSGAIGGSASQEFMVLAEAGEDE VLYTKDGQYAANVEKAVSLPPDAQPSAFTNYEKRETPGTETIEKVCEFLKCSSTQLVK NVLYETIFDNGTSVLVLVSIRGDQEVNDVKLQNELTKLAPKYDAKTVIKLAVPDTEAQ RKWASKPLPLGYIAPDVADDYIKSVKDIAPGFLRLVDKTAADLKNFVTGANESGSHVV GANWGEQFKLPELTVDVRKARPGDRAKHNPEETLESARGIEVGHIFQLGTKYSIAMGA TFTNEQGEEKPLLMGCFGVGVSRLAQAAVEQSYDKDGIIWPVAIAPYHAIVTIPNIND TQQVEIAEKLYAQLNQAGVETLLDDRNERAGVKFKDADLIGIPYRIVTGRAIANGKVE VVERKSRKSHEIAIADVVSTLKQWMKSGNSEQ" gene 27945..28757 /locus_tag="DP116_07790" CDS 27945..28757 /locus_tag="DP116_07790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2993 domain-containing protein" /protein_id="PRJNA477356:DP116_07790" /translation="MEFFALLVSGLLGLVTPVGLVVDQTAENAIRSQLAQVEQLQVRV DNAPTHQLLQGKVAKVRVAGRSLQLKKWQDIRIAALELETDAIELEPRSLGKKRPLFK RPLQAGVRLVLTQQDINKLIQSPQFLVMLQKLKINTGGYSNTAPNSVYHFTKPNVKFL ADNRLSAQVELQDRSLDKPLLIRVESGFRIVGGRNIQLVNPIVAANGEQVPPQFVNTV VNNLNKRLDLSNLEGDGLQVRILKLNMKPEELEIAAFLRVEPSSRFLETPSL" gene 28823..29473 /locus_tag="DP116_07795" CDS 28823..29473 /locus_tag="DP116_07795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126386.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="spore germination protein" /protein_id="PRJNA477356:DP116_07795" /translation="MKEQQGSNRFSPGIIAAITAAVIAVGGGIAFFASKPADNNNNSA RIAPPNNPPVQIPVPAVSQPPVSTNQVGTEQKAEVFWLQNTGSGFKLVPQTVQVKALG KSPNEVLEGAFQQLLAGPTESSETTTIPQGTKLLGVKAASDGVHVNLSEEFESGGGSS SMMGRVGQVVYTATAVDKNAKVYIEVNGKPLETLGGEGLELEQPLTRESFDKNYSL" gene 29982..31034 /locus_tag="DP116_07800" CDS 29982..31034 /locus_tag="DP116_07800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206886.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-alanine--D-alanine ligase" /protein_id="PRJNA477356:DP116_07800" /translation="MSVLRILHVAGSPSSDFYRELSRNYAHSCLVATANPSRYEFLIA YITPDGLWRFPSSLSLEDIAVAKPLSLLEAIQFITAQNIDVMLPQMFCLPGMIEYRAL FDLLKIPYIGNTPDIMALTAHKAKTKAIVAAAGVKVPSGEVLRKGDVPTITPPAVIKP ANADNSLGVSLVKEASEYDAALKKAFEFASEVIVETFIEAGREVRCGAIVKDGELIGL PMEEYMIDPEVRPIRSYTDKFKIPIEGDLASTAKNYVPGWIVDINDPITQKVQQEVKK CHLALGCRHYSLFDFRIDPQGQPWFLEAGLFCIFDHNAVVACTANKVGIPLDELFQTM INEALGKIMILDKLPL" BASE COUNT 8934 a 6722 c 6828 g 8559 t 10 others ORIGIN 1 cttcgggatt cataagtaat taagcgaact taatatgact cagtttgagg ggcgcaaaac 61 atgagcaaga taaaacctga agttgctaaa cgacgcattg agtcttttga aaaacgcttt 121 ggagaagcac acctgtattt agcgtatcat gctgcttttc ccttatcact cacaccagat 181 ttgttatacc gtatatgggc taacttccac cgggatattc atggtgaagt gctaggcatt 241 ccctggatag ctgtagcaga tttactgctt tctagtttat gcgatgaagt ggggcatgaa 301 ctttacgaga tgaatgtcgc agtgcgaaat ctactgctga gtcaattgaa ggaagataaa 361 aagttcggtc aacagcggat ttatgaactg tcagactttt tacttgagta tgtacagcag 421 cagcttttga gtaatgatct cgatatccgc gactttgccc aggctcaacg gtggatagct 481 ttagcatata cgcgtacgca acctagtgaa gcagttcgag aactggcgtt agcactttct 541 ggagcctatc aaaaggatag aggggatctc atccgattgg catcattggt agaaacttta 601 gcagaaccac tgggcggatt ggaagaattc cagccgctgt tgatttatgc tcgtgggatg 661 ggaaactttg ctggtggtaa tttggaagct gcaaaaaatg aacttggcaa agcgcttggg 721 aaaggaaacg aaattagagt tgctgaaata agtttgccta ttcctaaaca aataagagca 781 aaacgccagc aagaagaagc agagagatta gggcgacagc aagaagcaga acggctgaag 841 ctgcaggagg cggacagacg ccgacagcaa gaaccagtaa taacccgaca acagttttta 901 aaatgggtag gcttaggagt agcaggctta gtgacagcag tggtagcggg taagattttt 961 attagtagtc tagagtctac ttccaaaact actgcagatg cacctagttc tgcgacagaa 1021 caagcgacac aaccaccatc tgcacaagct acacaagctg ctcagaaaga tgctctgagt 1081 gaaatccgca gaaaacaact taatgctgac attcgtgctc gtgaggggcg gaacgatgca 1141 ttcaatgatg gcagtgctac aaaaagagct gcacgtgaca ttgaaagcaa agttcgctct 1201 aagttagaag ctaatatccc tagcggtcat ttaacaattg cagcttcaga agatggcaca 1261 gtaacagtat ctggaactgt agctaaaaaa gaccagttag ctaaaattga cactttgcca 1321 aaacaaatca caggcgttac aaaaattgtt aatagagcca cagttgctca agagtgactc 1381 aggaacagta ccagcaagta atgggtagta atccgtccaa cttcaaaggc gcaaaacgcc 1441 ctgtagagaa agtgtcctgg aatgatgcgg tggagttttg taagaaactc agccaaaaga 1501 ctgggcgcaa atatcgccta ccgagtgagg cggagtggga atatgcagca cgtgcaggaa 1561 cgacaacacc attttacttt ggccagacga ttacgacttc cttagccaac tacaatggga 1621 gtttcactta tgcttctgaa ccaaagggcg aatatcgtca acaaacaaca gaagtaggaa 1681 gttttctacc caacgccttt ggattatacg atatgcacgg aaacgttttc gagtggtgtc 1741 aagatacttg gcatgagagt tataaaggag cacctagtga tggtagtgcc tgggtagata 1801 atgataatca acgttacatg cagcgtggtg gttgttggga ctacaatgca gtctcctgta 1861 gttcggcgtg tcgcgcctac aacgttgcgg gcgtcaggta ctttggcaac ggttttcgtg 1921 ttgtttgggc tggggcgtgg actatttagc ccttgacact ttaacccttt ttacccttgc 1981 cttttttttg tcacgctctg cgcaatcaaa tattttttat tattttcaac ttaacaaaag 2041 gctcaacgat tagtcaacag aaaattcatc agactgaaaa tctaagtttg tcggtaggca 2101 ctacctactt tcctatgtca ttcgtaagca ttggacagta gccagccaac gtgtattcac 2161 tgatgtcaaa attctctaga ttcaataaca attacacctc tgtttgcagt gcgataagct 2221 gaaggcttta agctccgctt atcgcgcctt gcgcggctcc ctggtggagc atcgctcccc 2281 ctaaccctcc atcaaaatcc ccactaaaat ctgcatcaat acttccaaat ccgtaggaac 2341 tcgaatacag cattaaatac tgagtgacta ccctaattaa ctcatagccg agtatcaaac 2401 cgatcatata attttattag tttaatttta ggtataatta gtggcttttg ataacgtttg 2461 taaaatatta gcagaaaaat atccaactga ttttgcacat tggttgctac cggacgaacc 2521 acgaaaagtt aaattattaa aaactgaatt gagtatcgaa ccaattcgag cagattctat 2581 aacattttta caaacagaga atcgtatttt acatctggaa tttcagacaa cagcaaaatc 2641 tgaaactcct attcccttac gaatgttgga ttactttgtc aggttagtgc gaaaatatga 2701 tgttccaata acgcaagtag taattttctt gcaacagaca agtaacgaaa tcgcttttac 2761 cgaagaatat gtaaatgaga tgacaaatca tcgctaccga attatacgaa tgtgggagca 2821 agattcggcg ttatttctga ataatcctgc gttattaccg ttagcacctt taacacagac 2881 agattcaccc caatggttat tatcgcaggt tgcccaaagt attgctagaa tttcggatag 2941 ggagacaagg cagaatattg cagcttacac agagatatta gcaggtttaa agtttgagaa 3001 agatttaatt cagcaatttt taggagagga aattatgcaa gaatcagtaa tttatcagga 3061 tattttgcag aagggagata agcaaggtga agaacgtaca attatacgcc agcttaatcg 3121 gcgttttggt gaaatagatt catcattaat tgatagaatt cgagtgctat ctgttgaaaa 3181 gctagatgat ttagcagaag cattactaga cttttcagaa gtatctgatt taatagcttg 3241 gttagacgaa caagaagaaa attaatttca ggtgagtgat gcgatggcac ggagtgcccg 3301 ccaaaggcga tcgcctctcc taaccctcca tcaaaatccc caccaaaatc tgcatcaaca 3361 cttccaaatc cgtaggaact cgatacagca ttaaatgctg agtcactacc cgtgaggaac 3421 gatgaaaact cccaaactgc caaatatcgc ctgttgtcac agcaccataa agaatcgagt 3481 cttccgactc cacccaacta tcaagagcaa tcaactcagc cgctaattga gtaaatcccc 3541 gtgttaagtc tgcctacttt gcttccacaa ccagcacact gtgttttgat tatatatagt 3601 agtccaactc tcctctgaga aactgatcaa cctcaattgg gtactcaata tttactgtag 3661 cttctgtgat gtgcgctact tccagaacga taggagcaat taagacttca cgtcgcgccg 3721 cactggttaa gctaacacga gcaattcctt cttctaagcg ctggcgcaag tctgggagtc 3781 tggtaaattg actgaaggac gccattggta aattgattgt ctccctaacc aaacttactc 3841 ccaactcctg cagaatatcc gctggcgcaa atttcatcag aaagtaactg cggaaagtgt 3901 aggtttcatt aggttttagg atagggggac gtggcgtgcg cttagtcatt gcatatgctt 3961 tccatatata tattgatggc agctatcact gttataccaa gttgcactaa attacctcac 4021 cacctgcccc tctctttaat ctaccgtcta cacaccggat cgctttcacc tcaccctcgc 4081 ttgaatcggc gctaaaatct ttccctctcc ttactaagga gagggatgcc cgatagggca 4141 gggtgaggtt taagtcgtga atgcaacatg cgtattacta ttgctaagag tagtgcgatt 4201 gaggaagagc gatgctgcaa agcagccgct acgcgatcgc ctttaaaaag attaaaatcc 4261 gcatcaaagt gatattctat actctatgtt aaaaaactga attcgtgaaa actgacagca 4321 ttttctatcg cctatttcaa gaatttccca gcatcttctt tgaactcatc ggtgaaccac 4381 cacaagaagc taacgcttat caattttcat cagttgaagt caagcaaaca gcctttagga 4441 ttgatggtgt ctttcttcct actaaagaaa gtgataaccc ttcgactgct tttaagctga 4501 gcgaagccga agcttcggct gcgctcagca caagtccgct cagggtgaac cccatttatt 4561 ttgtggaagt ccaatttcaa ggagattcag aaatttacgc gcgactgttt gcagaaatct 4621 ttttgtacct gcggcaaaat caaccacaaa atgattggcg tgctgcggtg atatatccta 4681 ccagaagtat agatacagca gatagaaaac attatcgtga attctttagc agtcagcgtg 4741 ttagccctat ctaccttgat gaattaggtg aagctgtatc gttacccatt tccattgcaa 4801 cagtcaagtt agtgattgag aatgaagata cagcgatcaa caaagcaagg gagttgatag 4861 atagaactca acaggagata agctcacaac agcaacagcg gcaattacta caattgatag 4921 agactatttt aatttacaag tttccgacaa tgagtcgaga ggagatagag gctatgtttg 4981 gattgagtga gttaaagcag acaagatttt atcaagaagc ttttgaggaa ggtaagcaag 5041 agggacgttt ggaagctcaa ctagaaacag taccccgttt cttggcatta gggttgactg 5101 tagaacagat agcacaggcg ttaggcttga gcgtacagga agtgcaacaa gcactgcaac 5161 agcaatcttc caatgagagt agtcgttgaa aaaggttgat ggggataatt ttcacctttg 5221 atagcaactc aatattaagg cgttccatgc actttcacgc acaggtaccg ataaacgcga 5281 taatttcttc tatacagcag tgctgactag gagagaccac catgacatcg caagaaatcg 5341 tgatctggct acaagaacgc acggctttag gaattctgtc gcctgaacct ttgaatgcga 5401 tacgcgaagc gtcagccctt tgggctatcg cccaagttat cgaagaacaa gttatcccac 5461 caaaccaccg tctagtaaca gaagggactc caccagaagc gctttacatt ctccttgaag 5521 gtcagctaga aagcgatagc aacaataaaa ccaacccagc cttagctatt ggtttcctcc 5581 ccggttcgat cattcatctg caagaactca tgttagatga atcggctcag cgtacaatca 5641 caacagtcac agaatgtcat ttatgggttg tgcctgcgga taaatttcgg gaattactca 5701 cccaataccc cgaaattgct caagctgttt ctcgccaaat ggcacaggaa ttggctcaac 5761 ttacctctgc tctcacctac gaacaagaac gttccactgc gttgcgacca tatttagtcc 5821 ccaaggcgga acgtggaatt gtgggaacaa gtcgctatgc tgtgcgcctg agacaggaaa 5881 ttcgagaagc tgctaatgat cgcaagtcgg tgataatttt tggggaacca gggttaggaa 5941 aagatactat agctgctctg attcactttg gttccaaaca gcgacgagaa ccgattatta 6001 aagttaactg tagtattctc cagacaagcg gtgctgattt gtttggtcgc accggaggta 6061 aaccaggact gctggaatgg cttggggaag gcactttagt tctgaacaac attcaagaaa 6121 caccgccaga gttgttaccg gtgttagcaa gtttactaaa aacgggcaaa tacacccctg 6181 taagccgttc gggagaagca actcctgaac cccgtgtcag taatgctcgc atcttgatag 6241 ttgcagaaaa aactcagccg caaattgaac gctgtgtcgg tcatattatc aaagtcccac 6301 cactgcgggt acggaaagct gatatcaaag cgcagattga atactacatc agtctttaca 6361 ctcgctcaag aggtatttct aaaccaaaag tcacgccaga agctttgcgc cgtttgcagt 6421 cctatgattt ccctggcaat ttgaaagagt tgcaaaattt agtggaacgg gcgattgttc 6481 aagctgggga ggcaaaagaa ctcacagaag aaattttctg ggcagttgat actaagaaga 6541 agcaatttcg ggtgaatctg ttgaatgcct atcctgaatt acgaaagttt cttcgtagtc 6601 cttggtggcc tgatcgcatt aactatggtt ttactttaac agcatttgcc atccttgtag 6661 gagtattgtt ttttggtccg caaacgcgcg atcgcaattt cgccctaaat ctattttggg 6721 cttggtggtg gcctttcttc ctactagcat ttccctttct tggtcgcgtt tggtgtgcag 6781 tctgtccctt catgatttac ggggaaataa cacaaaaact ctcccaaaag ttctttcctg 6841 gaaaacttaa acgctggccc agacatcaag cggaaaaatg gggcggatgg tttctctttg 6901 ggctgtttac tctcattttc ctatgggaag aactctggaa tttagaaaat acagcatatc 6961 tttctggttg tttgctgctt ttaattaccg ccggtgcaat gatattttct gccatttttg 7021 agcggcggtt ttggtgtcgc tatctttgtc ccatcggtgg aatgaatggg ttatttgcta 7081 aactctccat gacagaacta cgagcgcaac aaggtatctg ttctgccact tgtaccacgt 7141 atcaatgcta caaaggtgga cctcagaaag gagaggggat ggaaactggc gggtgtccgt 7201 tgtactctca cccagcacaa ttggaagata acagagactg cgtgctgtgt atgacttgtc 7261 tcaaagcctg tccccatcgt tctgttgagt tcaatttacg tcctcctggg gttgaattgt 7321 ggacaactca cataccccat agctatgaag tcgcactgtt atttttacta ttaggtggag 7381 tattcctcca tcacttaagc gaattgcagt cttggctggg cttacatttg gatttaaccc 7441 agtttttacc tcacttggga ttatctttgc tggctctgct tatcccagtg gctatccctt 7501 ttttggcata tggaattatg caaatattat atctaagtaa caaaggttta aagtccactc 7561 agcaaaatcc gaagccgcga cgatttgtag agcttgctta tggctactta ccactcgtac 7621 taggcggaaa cttagctcat tatctgcgtt tgggtttagg ggaaggaggg cggattttgc 7681 ctgtcacttt tgcgactttt ggtttgaaga gtgaacaatt atttacactg gtggctcatc 7741 cagcggtaat tgactttttg caaggtagta ccctgattgt ttcagttctg ttaacgatag 7801 tgttaacaca aaaaattgct cgtcaaccga tgcgttcgct cttttggcaa cacttagctg 7861 ctattggact aggagccagt atgtgggcga ttattgtatc tggtttatag gtttgtagtg 7921 agcacttata cgcgcgttgt attcatacgt attaccctcc ggggaagacc tccggtctcg 7981 ctgcgaatcg cgccagatgc agcccggtgg gagaccctcc tgcagcaggt tgtcgtcacc 8041 tcaccccgcc cttacgggca cccctctcct tgctaaggag aggggcagga ggtgagtttt 8101 tgaacggaag tattaagcac cattgctaca caaatttgaa ttttgaatga gcgaattttg 8161 aattatttat ggcaagtttt atagaaccac cacggttacg aattggtgaa gacagcgaag 8221 aagaacgacg cgccacttgg ttagaacttt tctatgattt ggtgtttgtc gttgctgtct 8281 ctcaactcgc ccactatctt cacgatcatg tttcgctatc aggtgttttg ggatttgtag 8341 ctctttttat tcctgtttgg tggtcatgga ttggtaccac attttacgct aaccgctttg 8401 atagcgacga cgtaacacat cggttactaa ctgctgtgca aatgctggct attgctggac 8461 tagctgtcaa tgtccatcac ggcttaagtg aaagttgcac tggctttgcc ctttcctatg 8521 ctctcggtcg agttgtactt gttgtagaat atgtccgcgc tggatggcat attcccacag 8581 cacgtccatt gacaaatcgt tacgccacag gttttacaat tgcagctgtc ctttgggtta 8641 tatcaggatt tgtaccaatt ccttggcgtt ttgtattctg gacattggga ctcattattg 8701 attttgccac acctctttcg gcacgcaagc tacagctaga actacctccc cactcctctc 8761 acttgccaga acgtttcgga ctgtttacca tgattgtctt gggcgaagca attgtcgcgg 8821 tggtcgatgg agtttccgag cagaactggg atgttttaac cgtaatcgcc gcagtgttcg 8881 gtctatgtat cgcttttagc ttatggtggg tgtattttga taaccttggt ggcacaccta 8941 ttcagaaggc gcggacagaa ggacgggtaa ccattttcaa tgtctggctt tacacccatc 9001 tacccttggt tatgggtatt gttgctgctg gagtcgccgt ggaactagta ttgttgagca 9061 agccaatgct agcgctatcc gatgcagtac gatggctact ttgtggctcc atagcattat 9121 gttacctagg tttaggtatt ctccaccgga ttggggtcat ccgctactgt aaaagccgtg 9181 ccaagtttcg cattggagca gcacctatta ttttggcgat cgcacttttt ggtaaagatt 9241 tgttacctgt tgcggtcatt ggactggtag ccttggtttg tgctgtgcaa gttgtgcaag 9301 atgtaactca aagtcgtcct acaacacgct tggttgagcc agaaatttag tttcaaatac 9361 ttgattagaa ttagccgccg cgcaagcccc ctatttcgct ataggtgggg cacggcttaa 9421 aataagtagt aatactcgaa aataagcggc gttggacgag caattaagca tgagtgcttc 9481 catgagagat attatcactg tctacatttg gtttcccaag gagttcttag gtcacgcttc 9541 aatgcaggta ggcatagata cttatataag cttttggcct aacaaagaaa ttataaatag 9601 gtcatcaatg cagactagtc aaagtatttt taatcataaa aaccaaaaca ttaatcaact 9661 tttattccag catttggctg atattagaag tactagttat gaagacgatt gtttgatttt 9721 aggtcaagct gactctcaaa gagaagcaga ctgcaaagtt gagttattta acctacctaa 9781 ggagccgata aaaacatttt ggagagaatt taccaatcaa aaaaattctt accacttgat 9841 aaaaagaaac tgctcatcag tcgttgcaga agctataaat gaaggctgga aagcttatat 9901 cggtaagggt aaaaacgcag gtaataaaag ctttgcagat tttgaaaacc aagttgaata 9961 taaaattccg gattttgaag ctttaagctt tacgacagcc gcattaaatt tcggtaaatt 10021 gcttttttgg tctccaaaac aagttttact ttatgcacaa atggtcaaaa gattaacaga 10081 ttgaaaaaaa ataccctcaa atttgcaatt atcgtgttgt cagctacagt tctaccagag 10141 ataaataagc aaaaatgaga agaaatctct aaaaatgcag ctttatcctg taagtcatgc 10201 tgccattaca tgaaacctcg aatcattgtc tgtggattaa gtcgcactgg atataaggtc 10261 tttcgtttgc tgcgacaaca gggggcgttc gtcgttggcg ttcatcatca acccattcca 10321 ggcgaatcat caggagatct gattgtcggc gacttgcaag cagctagtac cctaacagca 10381 gcaggaattc agcaggcaca cactttggtg attgctggat ctaacgatga actgaatctg 10441 tcaattatga tgcaagcgcg ggtgttgaat ccgcaaattc ggattatcaa ccgctttttt 10501 aatacaaatt tgggcgatcg cctagataaa gctgtcccta accacttaag tatgagtgtt 10561 gcaggtttgg cagcacccgt gttcaccttt gcggctttag gaaaccaagc aattggacaa 10621 atcaaacttt tcaaacaaac ttggcctatc cacgaagaat atatagataa aaatcatccg 10681 tggctaggtc gcaagctgag tgatttgtgg aacgatcgct cgcggatgat gatttactat 10741 atgccagtca agggcgagat ggatttggtt gctgcagtgt tgtctggaca agagttaaag 10801 gtgggcgatc gcctgatcat tggtactcaa ccgtgtatcg gttccacccg caaatcagtc 10861 attgcaaaac ttctcaaagt cctaaccaat ttgcgccagt tcaagaaaca cgctgaatca 10921 gttgtggcaa tgactattgt actttttgtt attattgtga ttgctactgt cacctacact 10981 tctacgaata caaatatttc tattatcgat gccctctatt ttgcagtagg catgattacg 11041 ggagcaggtg gtaatgacaa agtagtacaa aatgctcctg gcagcattaa attatttacc 11101 gttttcatga tgctgattgg ggctgctgtc ataggtctat tgtacgcgct gctcaatgat 11161 tttgttttgg gtagtcgctt caagcaattt tgggatgcag cacgagttcc tcaccgccat 11221 cactacattg tctgtggctt gagtggaata ggtataaaag ttgttgagca actctctgca 11281 agcggacatg aggtcgtcgt gattgagcgc gactctaaca acaaatatgt caacactgct 11341 cgtggagcag gtattcctgt catttatgct gatgctagtt tctcagccac actcaaagtc 11401 gctaatttag attctgctgc tgcagtgctt gctgtcacag gtaacgatgc gactaaccta 11461 gaagttgccc ttaaggcaaa aggcatgaca ccccaagtgc cagtcatagt ccattacgca 11521 gaccccgatt ttgctcgtat ggcacaagag gtgtttgact tcgaggcagt cttgagtcct 11581 gctgaactcg cagccccagc ctttgcagct gctgcactgg gtggaaaaat actcggcaat 11641 ggcattacag cagatagtct ttgggttgct tttgcaactt tgattacacc cgtacaccct 11701 ttttgtggtc agctggtgaa agatgtagcc atgtctgctg actttgttcc tttatactta 11761 gagacgaatt accaaactct tcacggctgg gatttactgg aaatgagtct gagcgctgga 11821 gatgtgttgt atttaacaat gccagcaaca cggttgtatc agttgtggcg tagtgcgcca 11881 ccgcagctta tggcgagtta gctgcagcga actggaaaca taaacgcccg aaactgcaaa 11941 cgaagtagcg attatctgaa ctaatccgtt caaaaaaata tctgttttat ccaaacatat 12001 atgtttagtt gaaatcccca tcatcgattc caaacaaaat attgaatcga cagtctgtcg 12061 attaattacc tctggttaat cgaacgtgtt ctaaaatctg ctgcttgcgc tcaaccgagt 12121 tggatttaga attattaaag ctgcttgtac tccgagccac gtgctccaga attctttttt 12181 gacggtcaga ctgtgaagcc ataatttacc aagttaacta cagattacta gagcaaaaaa 12241 tcttgattag caaatgcatc catttacgat ggtttgagct ttgccatgat ttttgcttca 12301 tcttaacgcg gagacttaca gccttcataa ttctttagga ggaatataag ataaagtctt 12361 tatctttcat cgttcaacct taccagcaaa caatattaaa cactttccat ccacgcaaca 12421 cacttgtgca tttgctgact catagtttcc acatcagcat gatacctacg cccatgacct 12481 ggaagcaccc actcaaacga gtagttagcc agatcacgca tcgatttaat ttgctctgac 12541 caagagtacc aacagtgatg gggaaaagca atgagttgct tgagactatc agaccatgcc 12601 aaatggtcgc cagtaaataa aaacttgttt ttgtaaagta aaacagtgtg tcctttgctg 12661 tgtccgggaa ctggaataat gagtacatca ggggctaact caaatggttc tgttcctgtc 12721 agttgaattt ccacatcgcg agtcccagaa gtaatatcat cctcgtggag gatgcgctga 12781 cactgaaaat gctctgcgaa tttttgatga tccgccacat catctctgtg agtcaggtac 12841 atataacgaa ttccccccat ctgttccaaa cgtttcacca agggaggagt aaaccgagga 12901 gaatctacca gaatattacc ctcaggtcgt tcaatgaagt agctaccagc gccgtaagag 12961 ttttccgaat gatagccgca gtggtagaca ttatcctcta caggaatagg aaaactgact 13021 tgagcaactt tgacatcttg gggcttttca actgtaccaa tggaactggt gggacaagct 13081 aacaaagctt gaagcgctgc taatctttga gcctcattaa ctggttggtg ataaaccacc 13141 gactgttcat caacttctgt aaacacctct ggagccatcc accggcacgt atcacaatct 13201 atacaggaag tatcaacata aaaatcgcca ctgacatttt gggggcgacg cagatttaaa 13261 tgagccatat taaacctctt tcacccaacg acttgctggg aagggaacaa accctgtatc 13321 tccaggttag caaatgcttg gaagaactcg tcaaaaaagc aaagcaccca gaggttctct 13381 gagtgtaagt aatgattatt ctgatgtcga aacttgtaat cgttcgatta aattaaagtt 13441 aaagcattag taaaccttag acaggcttaa aaaattagat aaagtttggt agtttgaaat 13501 gaaaatgtta taaaaagcaa gaatttacag aatcaaccct ggtattcgcc gaactaagac 13561 tatttagcct attcaagact atttataaga atcagtcagt tgtgattctt cgctttctaa 13621 gccaatcaag cgataaatga annnnnnnnn naagaatcag tcagttgtga ttcttcgctt 13681 tctaagccaa tcaagcgata aatgaagctt aaaactttct tcatggtcta caccggaatt 13741 ttttttattg tttgaaatgg tcaggtcagg aaaatcaaga atttacagaa tcaaccctga 13801 tacttgtcta tccaagacta tttataagaa tcagtcagtt gtgattcttc gctttctaag 13861 ccaatcaagc gataaatgaa acttaaaact ttcttcatgg tctactcctt aatttttttc 13921 ttggtgtctt gatacatata cattcaacat gtctgcttgc gtttccagaa aaaaactatc 13981 tttttattgg aagtcctaaa atattccaat cattaatatc aacaaaatgt taaaaagtat 14041 tgacaaaacg gacattgttt tatttacccc tctagtaatg ccgcagtaca gcatattcgt 14101 ctgcacagaa actcagttag agtgagagta agtacttttg tcactgaaaa taatttttag 14161 caaaagtctg gtcaagctgg gcggcggtaa gaattgtgga tattaatcgc tgtatattag 14221 tttctgatac tgctgtgttg attaagctgt cggctatcag ccttcagcag ttctgagtac 14281 aaagtaccga gtgcagcaaa ctgcctgcgg agtcgtagag atagctgccc tgcaagcgtt 14341 tacggcaagc ccaacgggct ttacggcaaa atcgcctttc gcagaacact tgtgtctaca 14401 attcttctcc acccttggag agtgtccgtg gcgtctacag cacttcctac tcagaacttt 14461 ctgcacactg actgctttta gcttaagggt tttatttcat tttgagtgga tgaaaagagg 14521 taaaaaatcg tgaaagcatt ggtgcgttta ttaacagtct ttagtttgtt gttgggatgc 14581 ttaggatggc tgggaacagc tcaaacagcc caagcagctg atttgagctt aactgctttt 14641 cgttcagttc cagttctggc agctgcaagc ctccgcaacc ccgcagatga aaacctagca 14701 gaggtttatg gtaaaaaaat tgatttgaat aataccaacg tgcgagcttt tcaaaagtat 14761 ccaggactgt atcccaccct cgcagggaag atcataacaa atgctcctta ccaaaaagtc 14821 gaggatgtgt tggatattgc cggattgagc gaacatcaga aacaagttct gcaagctaac 14881 ctagaccact ttaccgtgtc agaagttgta cctgccttca cagaaggaga cgatcgcttc 14941 aacaacggta tctacagata atcccatttg cagatcatta agctgctatg tgatccactc 15001 ctttttggga gtggaaatta gtttatttag gaagagggac aatgaggata agaggacaag 15061 gggaattccc ctactccacg gtagtcgcct ggctcggggg gaacccccct tcgggaacgc 15121 ccgtcaccta cggcgggaaa ccctcattca gtgctggtct caccgcacgg cgctgctccc 15181 ctactccctc tactccctct atttccccta ctccccacac cccctgcttt gcctcaaaca 15241 gatattaaaa gccaatttga tgttattgtt gtcggtgctg gtgctgctgg actatacaca 15301 gcactgtgca tgcctgccga gttgcaagtc ggcttgatta ccaaagaaac agtttctctc 15361 tcagctagtg attgggcgca gggtggaatt gctgcggcga tcgccccaga agattctcct 15421 tctctacaca ttgaagatac aatccaagcg ggcgtcggtt tgtgcgatcg ccctgctgtg 15481 gaatttctcg ccaaacaagc tcccagctgc attcaatccc ttgtcaactt aggggtcgct 15541 tttgatcgtc atgacagcca tttagcttta accttagaag cggctcattc ccgcaaccgc 15601 gttcttcacg ctgcagacac aacaggtaga gaagtcacga ctactctcac agctcaagtc 15661 ctgcgacgcc agaacattca agttattcaa caagccttgg ctttgagtct atggctagaa 15721 ccagagacac gcagatgtca aggtattagc ctgttttatc aaggtcatgt cagatggatc 15781 agagcaaatg ctgtcgtttt ggcaacaggt ggagggggtc aggtgtttgc ccaaagcaca 15841 aaccctgctg tcagcacagg tgatggcgtg gcgatcgcat ggcgtgctgg cgcaattctc 15901 agggatttgg aatttgtcca gttccacccc accgctttaa caaaacctgg tcgctttctc 15961 atcagtgaag ctgtgcgcgg cgaaggggcg caccttgttg ataacgacgg acggcgtttc 16021 gcctttgatt atcacccagc aggtgaactt gcgcccagag atgtggtcag tcgtgcaatt 16081 tttagccatt tacaacgcac ttcaactgat ccagccactg ctcatgtgtg gttagatatg 16141 cgccccatcc cagctgaaaa aatccgcctc cgctttccca acatcattaa agtttgtcaa 16201 cattggggtg tggatgtttt ttcacaacct attcccgtcg cccctgctgc tcattactgg 16261 atgggtggta ttgtcaccga tttgatgaac cggacaaaca ttcctggttt atacgcagtg 16321 ggagaaacta caagtactgg ggtacatgga gcgaatcgcc ttgcaagtaa ttccctactc 16381 gaatgtatcg tatttggcgc acaaatggca gatctccaga tttttccaga tgtgagtcgt 16441 gtgtcaaacc tgtcagaaac tcttgctata cgagagtttc gcgccgatgt aaccgaatgg 16501 aaaccccagc aagaacattt agaagtactg cgagataaaa taccacgtct tctttggcaa 16561 agcgctggta tttgccgaga gcaatcaagt ttggaaagcg cgatcgcaac tcttgagtct 16621 tggcagcaaa attatatttc tttgcctgtg agtcaatttt tgctgtcttt gcgtcccaca 16681 gagtcagctc gtttagagat acctgacgtt gaacgccaat tgcggttgtg ggcggaaact 16741 cgcaatttac tggatgtcgc ttttttaatt cttaaaagcg ctgcctttag aaccgaaagc 16801 cggggaggac attaccgtct agattaccct caactagacc ctgactggca agtccacacg 16861 cttatacaac aaaacaactg gtggaaatct ccagtttttg ggtgattgat ctgttatatc 16921 gagttcgcct aattacttac aataaaacca ccttacgttc ctaacccctg cccctaatct 16981 gcattgtgtg agcactgact tcccctctcc ttaataagga gaggggtgcc cgatagcgta 17041 gcgtgccgtt aggcataggg cggggtaagg ttcttcgttt tttataagtc ttcattagga 17101 cataacaata ccacataaaa accaaaaagt caactaagtc tacagttaac atgcatgact 17161 ccagttgact tttttgaaaa aacccaaatt gaaaaacaaa cttgtatcaa cgtcgcaaag 17221 aatacttaaa attacgcgga cctgtataac cggaaatttt cgccagttct tctaaggttt 17281 ttactcccgg ataagtttga ccgttgatta tccaagtggg atagctttca actttagcag 17341 ctttacacgt atctgtttga gcgtaggggt tatccgcagt agcacaatct acgtgactga 17401 tttctgcgta agcttcttta ccaaagacta acttttgttc atgacagtga ggacaccacc 17461 aagcaatgta ttccttggca cctatttttg tgaggtgacg tgctagagcg atttccgcat 17521 ctgtggaggt ggtggtaatt tcccaaccaa ctcctggttt cggctcattt ccgggtttgg 17581 gttgaaagct gacttgtgtc ccttgaccag aatttgagac tgttgctgaa ccaccattgt 17641 tattcacacc agaataaacg cctaaagtag caatcagcgt caccatgcca acaataatgg 17701 cagtaaagaa aatttgccca atatcgtccc atgtgcgacc aatgatagtc aagactaaga 17761 ggctcaggga gaaaagagcc gaaccaagac agtacagaca aagtgcttta attgtgaaaa 17821 acagcagata cattaagtag ccactgaaga tagacatggc gatcgcacct gccaaaagca 17881 ataaccatgt ccaattttcc agctttgtcc ggctagcttt atttttcgct gggtcaactg 17941 ccaaaggagc caaggcgaag gtcgccatac tagcgtaagc gagaaagcca aacaaagcta 18001 gaggcagtcc caaaactgta gcataggggc tagaaagtac tacatcacag ctattggtgg 18061 ggcaagcggc agatccttgt gtcaacttaa caacggtcag ataagctgtt gttaatgcgc 18121 cacaagatgc gatcgcggca attaatatcc gcgaccattt atgaatccaa ggaatagaac 18181 gacggcgact cataaactgc aataaaaaat tgggaatgag atactaggaa ttgggaacga 18241 ggaattggaa atgattagct gttagttgtt agttttttta ctcatgactt tccactaacc 18301 actacacttg ttccccattc ccaactcccc attcctcagt tacgagtata gctttagctc 18361 ggaaattgat ttggtttcac cttttgagtt ccaaccttta cgtgcagctt ccacaaattg 18421 actgactcga tttggatcta tcggttgctc acgacgaccg tggcgtttca aagcactgga 18481 aacaatgaca ccatctgccg cctgtatcag tgtagcaatg ttttctaaac ttgctccact 18541 accaataaac actggagtgc cgctagcagc agaagatgcc aattccaaat cttctgggct 18601 tgtaggatga cccgtcgccc aaccagatat aatcaccgca tctgccaaac cgcgatcaat 18661 ggtgtcttgc actgctgtgg tgagattcac agcactcaag ggacgagcat gcttgaccaa 18721 cacatcagca aaaattttga catcgcagcc taactcccgc cgatagcgta ggagttcata 18781 agcttctccc tcaataatgc cctgatcggt tgccataaca cctgtgagga cattcacgcg 18841 gatgaattgt gctctgacac aactggcgat cgccattgca cttctagcgt cgttccgtaa 18901 aacatttatg cctacaggca gcgttaccat attttgtatc cgctgtacca atatagtcat 18961 ggcactcaca accgctggat caacctggtt tttgggaaac ggcgcgtcga agaaattttc 19021 tacaataata ccgtcaaccc ctccacttgc cagggctgtt gcttcttgtt cggcacggtc 19081 aaccactgct ttgaggctac ctccccaacg gggcgaagta ggtaatggta gtaggtgaac 19141 tacgcctata attggtgttc gggttttaaa tagctgatat aagtccacgt ctttaacccg 19201 ctttgcggag tcatcagtcc tgagtcattc atcatttgtt attattactg gtgactattg 19261 actattgact cattattcac tactaattaa tgaagtttgc tttttatcac gatttgtttc 19321 aactagctct taaacaattt tatttgtgat ttgcacaact ctggcatttt cgacaacttg 19381 cctgtactgt tttctttgag tttggatttt gaatcaccta cctcctccta gtagatggga 19441 tttggtgcca aagtttccat aagatatgtt aagatagtat agataaataa gaggagcgtg 19501 ccactcacca gacgcgaggc agcttttcgt ggataaggca tctcgcctat cacttctttc 19561 aggcattgcc tgagtagagg ggtagcatta ccgttatgca ggcgttataa cgcgctattt 19621 tccttagggt agcgacttct caaaaagcgc cacacacagt tggttttggc gagatgaata 19681 gttcagatca ttaaacctga ccaatgtcca aatcgagagt ggcttcccaa taggtaaatt 19741 aacaacacac acgttgcggt aatgtaaaat accaaaaggc agtttattaa caaagattaa 19801 cgactagtgt caaagttatc aaaactctgc ttattttccc ttgaccaaca gattaataga 19861 gctatcatag catgaactct atcttaatca tggttgtagc tccctgaggg tgtatatgac 19921 tagaaagtca tagcaaaatc aagaggtttc gaccttcctt agggggtttt aagcaaaaaa 19981 ctagcccaat aaccgttgaa gtacatgaaa aaacctaaag atttttgtga atatgggtga 20041 caaggcaaca attgcaatat ctcacttagg ctgtgagaaa aacagagtag atacagaaca 20101 tatgctcgga ctgctagtag aagcagggta tggcgtagat tctaatgaag agttagcaga 20161 atacgttata gttaatacat gtagttttat tgaagcggca agaaaagaat ctgttagaac 20221 tctcgtagaa ctggcagagg cagataaaaa aatcgtgatc acaggctgta tggcgcagca 20281 cttccaagaa cagttgttgg aagagttgcc tgaggcagta gcagtggtgg gaagcgggga 20341 ttatcacaaa atagtagatg ttattcagca agtagaacaa ggtaagcgag ttaagctcgt 20401 tagttcacag ccaacctaca ttgctgatga gaccacaccc cgttatcgca ccacaacaga 20461 aggagtggct tacctgcgag ttgccgaagg ttgtgattac cgatgtgctt tttgcattat 20521 tccccacctg cgggggaacc agcgatcgcg taccatagag tcaatagtta ccgaagcaga 20581 acagctagca agtcaaggtg tacaggaaat cattttgatc tcccaaatta ccaccaatta 20641 cggtattgat atttatggca agccgaaatt agctgaatta ctccgggaat tggggaaagt 20701 caaagtgccg tggattcgta tacactatgc ctatccaacg ggactgaccc cagatgtgat 20761 agcagcaatt ggtgagacat ctaatgtctt accttacctg gatttaccct tgcagcattc 20821 ccatccagac attctccgcg ctatgaatcg tccctggcaa ggacgggtga acgatgggat 20881 tatcgaacgc ataaaagaga cattaccaga agctgtgctg cgaacgacgt tgattgtcgg 20941 ttttccagga gaaacagagg aacattttga gcatctgtcg cagtttatcc agcgtcatga 21001 atttgaccat gtgggtgtgt tcaccttttc aaaagaagaa ggaacacctg cctacaacct 21061 accaaatcag ctgccccaat tcgttatgga tgagcggcgg aacgtaatca tggaactgca 21121 acagccgatt tctttgaaga aaaatcaact cgaaattggc aaagtcgttg atgttctgat 21181 tgaacaagaa aatcctgtca caggacagtt aattggtcgt tctggcaggt tttccccaga 21241 agtagacggg caggtatttg taacgggaga agcacggtta ggaaccatcg taccagtagc 21301 tatccaaaaa gctgatgctt acgaccttta tggtcaagtt gtcaccaact aagttgtcat 21361 ttgtgagcca gccgttgcca agttggtttc caacagtggg gcgagtgacg ttcgctcgta 21421 aaacgcacgc gactttgaac aaaagacgtg tcataggcat accggaaggg ttctttgtct 21481 tttgacaaat cacaaatgac ttatcgaaaa agtaaatgaa aatacaggag agttgatgaa 21541 tctttcgttt ttagatttag gaatttcaca agagtgtgtt cagcagttag aaaaactagg 21601 ctttagcgca ccaacaaaca tccaagcgca agcaatacca catctgttat caggtcgtga 21661 tgtcgtcggt caatcccaaa ctggaacggg aaaaacggca gcattttcct tgccaatcat 21721 agatcaggtg gatgtgactc aaaaagctgt ccaagctttg gttctaacac caactcgtga 21781 attagctatt caagttcacg acgcgatcaa tcacttcatt ggaaaccaag atttgcgggt 21841 tttagcaatc tacggtggtc aatcgataga tcgtcaaatt ttacaactca aacgcggcgt 21901 tcacatggtc gtaggtacgc cagggcgagt gatagacttg ctggagcgag gctgtttgaa 21961 gctggatcgg gtgaaatggt tcgtgttgga tgaagccgat gaaatgttaa gcatgggctt 22021 tatcgatgac gtagagaaaa ttctctctca agcccccaaa gagcgccaga ccgctctatt 22081 ctcggcaaca atgcctccct caattcgtca gttggtcaac aagttcttac attcgccagt 22141 cacagtaacc gttgagcaac caaaagccgc tcctaacaaa attaatcagg tggcttatct 22201 tgtaccgcgc cactggacga aagccaaagc actacagccc attctcgaaa tggaagaccc 22261 agaaacggct ttaatctttg ttcgcaccag acggacagca gcagaactca ccagtcaact 22321 gcaagcagct ggtcacagtg tcgatgaata ccatggtgat ttgtcgcaac aagcgcggga 22381 acggttattg acacggttcc gcaatcgtca agtacgctgg gtagtcgcaa cggatattgc 22441 agcacgaggt ttggacgttg atcagttgtc tcacgtcatt aactttgact tacccgatag 22501 tgtagaaaca tacgttcacc ggattggtcg tactggtcga gctggtaaag aaggcacagc 22561 aatttctatc gtgcagccct ttgagcgacg caagcagcaa gtgtttgaac gccataatcg 22621 gcagacttgg caagtgctgt caattcctac acggacacaa attgaagcac gacaaataga 22681 gaaattgcaa actcaggtgc gagaagcatt ggcaggtgag cgtttagctt catttttacc 22741 aatggtgagc gagttgagtg aaaaatacga tcctcaggcg atcgccgcag cagcattgca 22801 aattgcttac gaccaaaccc gtcctgcttg gctgcaatca gaacctgacg ttacagaaga 22861 agatctcgta ccgccaactc ccaaacccag actgcggtca aacaaccgtc gcggcgagtc 22921 ttcaggcgat cgcaatcgct ccagttgggt tgcatccgac gccagcaata cagaaggagg 22981 acgtggtact cccaagccta aattgcggac aggacgtcgc gagacttcgg tgtctccaaa 23041 aaaactcggt tccggagcag caagagaaac agcttcctag gttttagtta tgagtcataa 23101 gtcatgagtt cctgactttt ggctaccggc ttttcgttca gcaaaaggtt gataagctta 23161 tcaacccggg caatttcatc atcagtcaaa atcagcccaa tgtgccagca tggtttgtat 23221 tggttgggct gtttttgcat ttgttactaa tgagtgattg tcgtaacttt gcgccattca 23281 aaataagtca taggtggctt ctgccttttt tcaagccctg cactttataa aatgctatga 23341 ctataaccac agcaaaacgt tttactattg ccgaatatga acgtttagca gaacttggtt 23401 tctttcgtga ggatgagcga gttgagctga tcaacggaga aattatccca atggtatcga 23461 aaggtagacc acattctgtt tgcgaaaccc gtttatttcg agagttattc aagcttgttg 23521 gagaacgggg gacactacga ggacaagaac caattattat ttttaactac aaccaacctg 23581 aaccagattt tgtgattgca caaaatcgag atgatgacta tctcagcgct catccaagtc 23641 ccgtcgatat attactgtta attgaaattg ctgactcttc tttaaaatat gatcaagaag 23701 tcaaattacc gatttatgct caagcaggta tttctaatta ttggatattt aatttagtag 23761 gtaatagctt agaatgctac agcgaatctt atcaagattt acaaggtaaa tttggttatc 23821 gtaggaagtt aattgttctg ccgaatgaat ccgtttgtct accgtgtttt cctgatttat 23881 ccttagactt atcaaaagtt tttccccagc aatttgcctg aaaccatact aactaaaatc 23941 cccaaaattc tatctacagg gggaacacca tactcagtca agtcgataat aacgttttga 24001 ccttctaatt tcaggaatct ccaaagactg ccagttgtga ctgcaccata aatgttaaaa 24061 atgggatggt cttgttgttg attgaaccgt tgagccgcaa ccatttcagc aatgcattgt 24121 ccaagtccag catttaagtc atcttttttt gcttctacaa tcactgcaac aggtgccgaa 24181 atcaggattt gttcaggaga acgactaatg acaaaatcac aaattccatt gagccctaac 24241 gattgatcaa cggtaaaatc aatcccagag aataaactaa tttctcgctg caatatttct 24301 cttaactcaa ttaaaatagg acaaataatc atttctgacc ttgccttctc tgtaccaaga 24361 gcaattgcta atggtaaact tttgtctaaa tattgagcaa gataagcgct gggagtaata 24421 gcttgatatt cagaaagaaa tgttccttgc gttagagtaa gcttgaaatc ctgcgatact 24481 cttcttaggc taaaatcgct ataagccata aactgtcaca cctttcattt gcatacagtt 24541 agggactaga cagattgcta acactagttt tattaatttt gaattttgaa ttttgaattt 24601 tgaattattt attactcccc ctgcaaaatc aaatcctcca tcgccagttg taagtccttt 24661 gtcaaatcag caaccgcttg ctttgcattt tgacgactcg ccttataaac tggataacgt 24721 tcagaaacag atattggctc acctatcgtt atttttacac tttgcttgcc aagtgttgga 24781 cgccccaagg aattttcgcc tctaattgta gtcaccatat tccatatcag taaggttgtc 24841 tcagcaaagc gttctgctgt gggtttttct cgtatgtatt tacctgtgac agcaacgaaa 24901 ctttccacca gcctcatatg ccacattcgt aaattagctt cttcggcaat gcggttaccc 24961 aaagcacgtt ctaaaggtga tagtgctgtg acatccttaa attcttctcg aaaaatataa 25021 ttccagcctg cctgttctac gcgccgacag cggtcattaa aattcccttt gggttgtaag 25081 tcaaaatact gttccgcaac acgtaacacg acgtctaaca aagcattgag acgaaatgct 25141 aagagttcat tgacatcttt tgcttcctct tgagtgggca gtttttgatg atagaatcgc 25201 gtgtaaaact tttccattaa agacaataaa tgttcgccta aacgaaacaa gcgtggatag 25261 agagagtcga aactgctgtt ttctgagtgg ctttctatgc ttacagttaa accgctagtc 25321 gcttctaatt cactcaaaag gttggcgatc gcatcccaag gagccgagac gtaactatat 25381 ttaattccaa ctggtacaat aaaaacctgc tcatgtcgat tgtccccgtg caaatcttca 25441 gcacaccaaa acccgagttg agcaataccc ggttccaatg ggttgacaat ttctgaaaga 25501 ccattcgtcg ctccttctgg tgcagccgcc attgggaact gaccattggc aaacaaatca 25561 cgcgctgaac gtaaccccgt ccaatctgcc ttacctcgca cgataggcgt cccacccaag 25621 cgtgaagcca cccagccaat gtaggaacca gcccatagag gaattccccg gtcgtagata 25681 aaatgagcgt gaatcggagg ctgtagcact acacccttgt gtcgtgctac ctttggcaaa 25741 atgtgggaaa gcaggtagac caaacaaagt gggtcttctg ctttgggatg gcgaaatgcg 25801 attaaaaaac ggattttgcg atcctgaaac tggcgataaa gatccagcaa aacttctggg 25861 ttgtctgctt caatatgact aattgctgtt ctccagtgta ttaagctggg taggaacaac 25921 tggcagactc ttaaaaacca agggtctagc gctggaggaa taaattctag gggtggctgt 25981 gcttcttgaa acgacatcag gtaaatttgt tttctcctga agctgcaagc cttcatttta 26041 aactgtaact gaaatagaga ttatcgaatt tggaaagggg acaaacgatg cgactgtcac 26101 aaatgttatt tctcacactc agggatgatc cagcagatgc tgaaattccc agtcataaac 26161 ttttacttcg tgcaggatac attcgtcgca tcggtagcgg tgtctatgct tatcttccgt 26221 tgatgtggcg agtactgcaa aaagtctccc aaattgtccg ggaagaaatg aacgctgctg 26281 gtggacaaga atgtctctta ccccaacttc aacccgctga cttatggaag gagtctggac 26341 gctgggacac ctacactaaa gctgagggga tcatgttcgc cctcaaagac cgccgcgacc 26401 aagaacaggc gctaggaccg actcacgagg aagtcattac agcaattgct cgtgatatga 26461 ttcgttctta ccgtcagctg ccacagctgc tctaccaaat tcaaacaaag ttccgcgatg 26521 aaattcgtcc ccgttttggt ttgatgcgcg gacgagaatt catcatgaag gatgcctact 26581 ccttcaatgt agatgaagaa agtctagaaa aaacttatca ggatatgcac atcgcctaca 26641 gtaatatact gcggcggtct ggcttagctt ttcgtgctgt gcaagctgat tctggtgcaa 26701 tcggtggttc tgcttctcaa gaatttatgg tcttagcgga agcaggcgaa gatgaagtcc 26761 tctacactaa agatgggcaa tacgcggcta acgtggaaaa ggcggtttct ttaccaccgg 26821 acgcccaacc ctctgcattt accaattacg aaaaacgaga aacaccaggg acggaaacaa 26881 ttgagaaagt ctgtgaattt ttgaaatgtt cctccaccca acttgtgaaa aacgttcttt 26941 acgaaacaat ttttgataat ggaacatcag tattagtgct ggtgagtatc cgaggggacc 27001 aagaagttaa tgatgtcaaa ttacaaaatg aattgacaaa gttagctccc aagtatgatg 27061 cgaaaacggt gattaaactt gcagtaccag atacagaagc tcaaaggaaa tgggctagta 27121 agcctttgcc cctaggctat attgctcctg atgttgcaga tgattacatt aagtctgtca 27181 aggatatcgc acctggcttt ttacgtttgg tcgataaaac agcagctgat ttaaaaaact 27241 ttgtcacggg tgcaaatgaa tctggttccc acgtcgtcgg ggcaaattgg ggtgagcaat 27301 ttaagttacc agagttaacc gtggatgtgc ggaaagcaag accaggcgat cgcgcaaagc 27361 acaacccaga ggaaactcta gaaagtgctc gtgggatcga ggtaggtcac atatttcaac 27421 tgggcactaa gtattccata gcaatgggtg caacttttac caacgaacag ggtgaagaaa 27481 aacctctact gatgggttgt tttggtgtag gtgtgtcacg cttggctcaa gctgctgtag 27541 agcaatctta cgacaaagat ggaattattt ggccagtggc gatcgcacct tatcacgcga 27601 ttgttacaat tcctaacatc aacgacaccc aacaagtgga aatcgctgaa aaactctacg 27661 cccaactcaa tcaagccgga gttgaaacct tgcttgatga ccgaaatgaa cgggcaggag 27721 taaaattcaa agatgctgat ttgattggga ttccttacag aattgtcact ggacgagcga 27781 tcgccaatgg caaagtcgaa gttgttgaac gaaaaagtcg caaatcacac gaaattgcca 27841 tcgctgacgt tgtatctaca ctaaagcagt ggatgaaatc agggaacagt gaacagtgaa 27901 cagttatcaa gcaactgata actgacaact gacaaaacga aaacatggaa ttcttcgcac 27961 tccttgtatc tggcttgtta gggttagtga ctccggtagg actggtggtc gatcagactg 28021 ctgaaaatgc tattcgttct cagttggctc aagttgaaca attgcaagtg cgagttgaca 28081 acgcccctac tcatcaattg ctgcaaggta aagtggcaaa ggtacgtgtt gctgggcgtt 28141 ctttgcaact caagaagtgg caagacattc gcattgcagc tttagaattg gaaactgatg 28201 caattgaatt agaaccgcgc agtctgggaa aaaaacgacc attattcaaa cgacctttac 28261 aagctggtgt ccgtttggtg ttaactcaac aggatatcaa taaactcata caatcacctc 28321 aatttctggt gatgttacaa aagctaaaaa ttaatacagg cggttattca aacacagcgc 28381 cgaattctgt ctatcatttt accaagccaa atgtaaaatt tttagccgac aaccggttga 28441 gtgctcaagt agaattacag gatagaagtc tggataagcc cttgttaatt agggtggaat 28501 caggatttcg tatcgttggt ggcaggaata tccagttagt taacccaatt gtggcagcca 28561 atggagaaca ggttccacct caatttgtca acacagttgt gaacaatctc aacaaacgat 28621 tggatttaag caatctggaa ggtgacggtc tacaagtgcg aatcctaaaa ttgaatatga 28681 aaccagaaga gttagagatt gccgcatttt tgcgagtaga gccatcttct aggtttttgg 28741 aaacccctag tttatgagta tcgtcgttaa ccaggcttct cctcatacaa atactatttc 28801 agcctaaagg taggttttca atatgaaaga acagcaagga tctaaccgtt tctctccagg 28861 tattattgca gctataacag cagcggttat tgcagtgggt ggtggtatag ctttttttgc 28921 cagcaaaccc gcagataata ataataatag tgctcgtatt gctcctccta ataatccccc 28981 tgtacaaatc cctgttccag cagtaagtca acctccagta tcaacaaacc aggtgggtac 29041 tgagcaaaag gctgaagttt tttggctgca aaatacaggt agtgggttta agttggttcc 29101 ccaaacagtt caagtcaaag ctctcgggaa gtcgcctaac gaagttttag aaggagcttt 29161 ccaacagtta ttagctggac caacagaaag cagcgaaact accacaattc cacaaggaac 29221 aaagctactg ggtgttaagg cagcaagtga tggtgtccat gtcaatttat cagaagaatt 29281 tgaaagtgga ggtggtagtt cttctatgat gggtcgcgtg gggcaagttg tctacaccgc 29341 tacggctgta gataagaatg ccaaagtcta cattgaagta aacggcaaac ctttagaaac 29401 tttaggcggc gaaggtctgg agttagaaca gccattgaca cgtgaaagct ttgataaaaa 29461 ttattcactg taagttaggt aaacagtttc atcccccggt ttcaaccgag ggatgaattt 29521 ttttagaatt cctcaacaga tgactataag ttataacggg tgagcagttc ctatccggag 29581 cataaggggt aaagcgaaag ggggaaagaa ttatgaaaaa agctttccct tttaaccttt 29641 cccctttccc ataagagtgg caattgttac ttttgcaatc ttgtttatag ttttttctgt 29701 tcaaccagta tataaacgga cttttggggt aataaatacc atcttactgg agtttagact 29761 aaattctatt tttaaggcta cagcccttgt atagcatgca ttttagactt cagaatcaga 29821 acccgcatat aaattagaaa agcttgtctg gataggggtt tggcggtttg ataagcaaaa 29881 ttagtctaaa ctccaaatct tagtcacctg aaatgtttac tgcttagggc ttgttgtcaa 29941 aattattata tcctgtataa cagtacctaa gaagatgaat catgtcagta cttcgtatcc 30001 ttcatgtagc agggtctcca tccagtgatt tttaccgtga attatcacgc aattacgccc 30061 atagctgtct ggtagctacg gcaaatccat cgcgctacga atttttaatt gcatacatca 30121 cacctgatgg cctgtggcga tttccttcct ctctgagtct tgaagatatt gctgtcgcca 30181 aaccgctttc tctgttagaa gctatacagt ttataacggc gcaaaacatt gacgttatgt 30241 tgccacaaat gttttgtctg cctggaatga ttgagtaccg cgcactattt gacttgctta 30301 agatccctta catagggaat actccggata tcatggcatt aacagctcac aaagccaaaa 30361 ccaaagcaat tgtcgcagca gctggggtca aagttccttc tggagaagtg ctccgcaaag 30421 gagacgttcc cacaattaca cctccagcag tcatcaaacc cgcaaatgcc gacaactctt 30481 taggggtgtc cttagtcaaa gaagctagtg agtatgacgc tgccctcaag aaagcatttg 30541 aatttgcttc ggaggtgatc gtagagacat tcattgaagc cggtcgagaa gtcagatgcg 30601 gtgccattgt caaagatggg gaactcatcg gtttacccat ggaagagtat atgatagacc 30661 ccgaagtcag acccatccgc agctatactg ataaattcaa gatacccatc gagggcgact 30721 tggcttcaac tgctaagaat tatgtccctg gttggattgt agatattaat gacccgatca 30781 cccaaaaggt tcagcaagaa gttaagaagt gtcatttggc tttgggctgt cgccattata 30841 gtttatttga cttccgaatc gacccacagg gacaaccttg gttcttagaa gccgggttgt 30901 tttgtatttt tgaccacaac gcggtggttg cctgtacggc gaacaaagta ggaattcctt 30961 tagatgagtt atttcagacg atgatcaatg aagcgttggg caagattatg atactcgaca 31021 aattacccct ttgaagaaac acaggggcaa aac // LOCUS NODE_911_length_30686_cov_5.16333130686 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 30686) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 30686) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..30686 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 311..1960 /locus_tag="DP116_07805" CDS 311..1960 /locus_tag="DP116_07805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877607.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07805" /translation="MTTQLRNVLKLSPVVLAATFFTANSAMAAEVNEQVTSVSVLTSQ SDNIGQVTSVSQFSDVQPTDWAFQALQSLVERYGCIAGYPNGTYRGNRALTRYEFAAG LNACLDRVNELIATATADLVRKEDLATLQRLQEEFSAELATLRGRVDALEARTAELEA NQFSTTTKLVGEAIFNISDIFGSDNRAVPSGVNPATAQDLNSNTIFADRVRLNLLSSF FGSDQLQIRLQSRNITPYGTNVTGTNMTRLGFDGNESNDNLLEKLNYAFKLGDALSVK IDATGGLLYENINTFTPEFNSSGRGAISRYGRFSPIYRVGEGGAGATLVVNPKGPITV SAAYLADRANNPNDGSGFLNGGYAALGQISFQPSQAFNIGLTYARTYQNIGNVANNSI NLFGSTGSQYANNPFGGAALTADHYGVEATLRLGPKVTLGGWYGYSEAEAKSGPREGN NAYFEYWAANVAFKDFGRQGSVLGFVFGQPPKTTGNEFVQANGIRRQDRDTSYHLEAL YRLQLTDNIAVTPGVLVIFNPEHNDRNDTVYVGTLRTTFTF" gene complement(2143..2436) /locus_tag="DP116_07810" CDS complement(2143..2436) /locus_tag="DP116_07810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07810" /translation="MNIDAFAPTPPEWTNTAIHAYEFCCPNCHSSSLEAVQVWINRRS PVMTENYRRKWQEFYQCHCGCVWWAWSSDRPKVKRPSDDDSSGTGSPTNLDPQ" gene 2555..3121 /locus_tag="DP116_07815" CDS 2555..3121 /locus_tag="DP116_07815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453998.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07815" /translation="METNVEIRRLLDIMPASGRMMTKIVIKPEQTKVIDATFPLPWNK ERPIYINFDLWRRLTKPQRDLLLLRTVSWLTQVKWFKPDIYQGMAIAGVLGAFVESAQ ADPVGIVVAGGLTTLSMLRIWRTNRSQQTELDADEAAIHVAQRRGYSEAEAAQHLLSA IEAVAKIEGRSGLEFTELIRSQNLRAIR" gene 3346..4644 /locus_tag="DP116_07820" CDS 3346..4644 /locus_tag="DP116_07820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179738.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_07820" /translation="MSNFVFVIDTNKKPCDPVPPGQARRLLKQGNAAVFRRYPFVIIL KYPASLHPGPHQLKIDPGSKTTGLAIIQQDKVVWGAQLQHRGQKISNDLKARNAVRRS RRNRKTRYRRPPIKNRGKRQRSKGWLAPSLKSRVYNIMTWVLRIKKYVPITGVSQELV KFNTQVLVNPETTGIEYQQGELFGYEVREYLLQKWGRKCAYCSATNTRLEIDHIHPRS RGGSNRVSNLTIACHECNQAKSNQDIRDFLAQKPDVLNRVLSQAKQPLKDAASVNSTR WALFNQLQQTDLPVEIGTGGRTKYNRTRLELPKTHWLDAACVGTQDMLTVLTSQPLLI SAKGWGTRQMCITNKHGFPIKHRERKKVFFGFQTGDMAQANLPRGKFAGTHVGRLTVR KTGVFEMTKHIGKVSPVRHKYCKAIHRNDGYMYAFSTISH" gene complement(4749..6509) /locus_tag="DP116_07825" CDS complement(4749..6509) /locus_tag="DP116_07825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314827.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_07825" /translation="MNVIIKTQNSKLKTLLLLAIWLLIAVGLRLTNLSAKPPWTDEFS TLVFSLGNSFLPVPLDQPISVDVLLQPLQPLATAGIQDVWKHLSTESNHPPLYFILTH LWMRLFPTHEGLVSLWGARSLAAFFGAASIPAVYALTQLAFRSRLVSHLAAAMMATSP YGIFLAQEARHYTLAILWVIASLACLVIASRHIQNHTQIPIQIALSWVGINAVGIATH YFFTLTLCCEALVLLFLAWRQWTRKTREKFSTSSSPPNLYSPWFRIYAVALGTFVAGL VWLPVFLQNSYGGKLTEWIQGPRTGMAWINPIFQALAAWITMISLLPVEASQLAVVIV SGLLMLIFFIWAVPILVRGIKVYLKQAQTRLMVQVFAGLIAGAIALFFLFTYFFGIDL TRGARYNFVYFPAVIVLLGASLAVCWRDPILEKGRWGINGKKAVIIILSMGLVSAITV VCNLGYQKYYRPDLFVQLIEQTSHVPVLIATTHKTHVQIGEMMGIAREYKIQNFIQNS IQNSKSPSPLFLLAHQDEDPKTSTVALQNTLKTLPRPFDLWLVNFYAPQPEEVKNCVA ETQSLSAVNGYDYKLYHCGD" gene complement(6678..7916) /locus_tag="DP116_07830" CDS complement(6678..7916) /locus_tag="DP116_07830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453984.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_07830" /translation="MSIIQPNSLLTVPTGSLQILELPAATTEANSEPIVLSLVIPTYK EAANIQKIVSILTNLLDEYIPGKYELIVVDDDSPDGTWNIAQSLMTEYPQLRVMRRQG ERGLSSAVIRGWQAATGNILGVIDADLQHPPHVLLQLLQAIEEGADLAVASRHIEGGG VSSWSFVRRLLSRGAQLLGLIILPGVLSRVSDPMSGYFMLRRHCIVGKTLNPVGYKIL LETIGRGNVGKIAEVGYVFSERKEGESKVTWKQYVDYIHHLIRLRVSTGRLSKFSRNF PIDRFLRFGLVGLSGVFVDMTVLYLLSDPTTLGLPLTRSKIIAGEVAILNNFLWNDIW TFADVSSQQQEWRQRLKRFLKFNIICLAGLVLNVLVLNLVFNFVIRNRYVANLIAIAV ATVWNFWVNLKLSWRVTQVK" gene complement(8049..9428) /locus_tag="DP116_07835" CDS complement(8049..9428) /locus_tag="DP116_07835" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07835" /translation="MIKHISHRIFITVFILIALHATPDGPQTSRYTDAIHSFVNFGTF ALQKGYRSHIDILLLNNNAYSVIPPGIPIILAPLYFIHQMLMRLIGIPEGEVYWAIFN ILSNVCVSAPLLGIVAVIMFKTLDYFTNDLVKKLWVVFIFIFGSLVFFYSTNGIWSHV YTMSFIFLAFYLIINQANSFFIGLFLGLAQMVDYIAIVPISLLIGFWIYLRIQEKDNK SLLTNIFLLLLGYSIFLGVIMFYNQTITGSVFKTPNSLFLKQLNQEDTIQKSMFIVPS LGTIWSLTFSSFRGIFLYFPMTILFLGSFVKKTYQKNNVILFCFIFFAFIFVLNASYY AWSGDVCFGPRHLVVATPFILLPVVYSPLKYIKLLGVLSMFINLAGVSTIPSNNLLIN IVMFLYRGPFLHWQDYLYKVVLPQYYNVRLSLMTPFFIYVATGFLIYLIWKPGINQEE VLLKDKLVS" gene 10173..11312 /locus_tag="DP116_07840" CDS 10173..11312 /locus_tag="DP116_07840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863168.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase subunit CofH" /protein_id="PRJNA477356:DP116_07840" /translation="MVTITVDAILERALSGYDLLPEEAVVLLKQNDQSAVAAIRATAD KLRQRQAGDTVTYIINRNINFTNICEQHCSFCAFRRDEGEEGAYWLDWAHILEKATDA VQRGATEICMQGGLNPQAKVNGKSLPYYLKVVEKIKQEFPQLHLHAFSPQEVQFIARE DALQYADVIIAFQDAGVDSMPGTAAEVLDDQVRRILCPEKINTTTWLEIVSTAHRLGL YTTSTMLSGHIETPEQQIKHLEKLRSLQQTAIHRDYPARITEFILLPFVGKEAPKPLR RRVGHDQPILGDALLLTAVARIFLGNWIPNHQPSWVKLGLAGATEALLWGCNDIGGTL MEEHITTMAGAQGGTNMEVETLQAAITSIGRPYQQRDTLYQTVGL" gene 11437..11835 /gene="psb27" /locus_tag="DP116_07845" CDS 11437..11835 /gene="psb27" /locus_tag="DP116_07845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198052.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II protein Psb27" /protein_id="PRJNA477356:DP116_07845" /translation="MKRYWSRLLALVLLVVVGLMGCNSSPGALTGDYRQDTLAVVNTL RTAIELPQDSPDRASTQAEARKKINDFAARYQRDGSVSGLASFTTMRTALNSLAGHYS SYPNRPVPQKLKDRLEQEFERVEAALSRGA" gene 12322..13821 /locus_tag="DP116_07850" CDS 12322..13821 /locus_tag="DP116_07850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453978.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07850" /translation="MSNRPLNVATIFFWQRLLAALIFSSSLLIPFLKNQPAAAQLTEY CQLPPPQAQEKENLRLSALKGDQQAQSRYQQLLQQNAQVLQECRNRTWPQLQAIWLRL YPCDVQPGMVDQIMDRIVNRGYNEVYLEVFYDGQVLLPKGANPTVWPSVIRTPGTEKT DLLATAIQKGRERGLKVYAWMFTTNFGYTYAQRSDREGAVARNGKGQTSLYVVDNGSQ VFIDPYNLQAKRDYYQMLQEVVRRRPDGVLFDYVRYPRQAGTDSIATKVTDLWLYSDA TQQALFRRALNYKGLELIRRFLTKGYITAGDVNEADKLYPQEGEPLWQGRTPPQEQKS ILPPDQRQPQLQSELWQLAVAHAMQGIVDFLALAAYPAQQQGIPVGAVFFPDGNQMVG QGYDSRLQPWDKFPNTIQWHPMSYANCASADCIAAQVQRVLNMAKPDTQVIPAIAGQW GASISNRPPLEVQMQALRKLAPQIKGVSHFAYSWQDPEYDNQRKFCRVQ" gene complement(14042..14620) /locus_tag="DP116_07855" CDS complement(14042..14620) /locus_tag="DP116_07855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457668.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_07855" /translation="MTAITINLKNIVKLNDDQFYQLCQDNPEVKFERNGNGELIVMPP TGGETGKRNAKLTARFVVWNEQTQLGEVFDSSTCFKLPNGSSRSPDVSWIKLTRWNAL TPEQREKFPPIAPDFVLELMSPSDSLSDTITKMQEYMDAGVKLGWLMERMTRRVEIYR QGQPKEVLESPTSLSGEEVLPGFVLDLQIVWG" gene complement(14808..15635) /gene="map" /locus_tag="DP116_07860" CDS complement(14808..15635) /gene="map" /locus_tag="DP116_07860" /EC_number="3.4.11.18" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995193.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I methionyl aminopeptidase" /protein_id="PRJNA477356:DP116_07860" /translation="MNILSNLLSLPVQNNPQKRQRRGIEIKSPREIDIMRQSAKIVAT VLKEISELVKPGMTTADLDAHAEKRIREMGATPSFKGYHGFPGSICSSINNEVVHGIP SAKKVIRTGDVLKVDTGAYYQGFHGDSCITIAVGEVTPEAARLIRVAEETLFKGIEQV KAGNDLMDIAGAIEDHVKANKFVVVEDFTGHGVGRNLHEEPSVFNFRTREIPNVKLRE GMTLAIEPILNAGSKYTRTLSDRWTAVTVDNSLSAQFEHTVLVTETGYEILTDRTKL" gene complement(15875..16507) /locus_tag="DP116_07865" CDS complement(15875..16507) /locus_tag="DP116_07865" /inference="COORDINATES: protein motif:HMM:PF03807.15" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07865" /translation="MNIGIIGAGKMGTGLGKLWIKNGHNLMFSYSRDMEKLKSLAESI DPSVRVGTPTEAVQFADVVLLSVPWAAVPDALKAAGSLDGKILFSCVNALTPDMSGMA VGTTTSGAEEIAKLVPGARLVEALPVFAEVLYSASRKFGQQEATVFYCGDDAQAKEIV AGLLREIEVEPLDAGELKNARFIEPAMMLLVQLAYAQNMGGEIGLKLLRR" gene complement(16549..17265) /locus_tag="DP116_07870" CDS complement(16549..17265) /locus_tag="DP116_07870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015154214.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SDR family NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_07870" /translation="MELKGKTAIITGASSGIGEATARELDAAGMNLVLTARNEDKLNK LATSLTHATFVAGEVTDPDLPKRLLEHAVSSFGRADALVNNAGVMVVGSVETVDIEAL CQMVRINVEAAFRMAYVFARHFKQNANGFIVNLSSISGTTNYPTMAAYCGTKHAIESF TDCLRLELAGSGVGVGCIEPGKVATNLYQNWSEEEKQTVAVEQPLVAEDIARAIRFLL DQPSNVNIGRLLITPANQSA" gene complement(17365..18867) /locus_tag="DP116_07875" CDS complement(17365..18867) /locus_tag="DP116_07875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020742190.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="betaine-aldehyde dehydrogenase" /protein_id="PRJNA477356:DP116_07875" /translation="MITCTQAEQPKISARPDFPFTHNKLYINGEWRDSASGKTFSVID PTTEEEIAQIAEGTAEDAEAAITAAYQAFETGPWGQMSGHERGTILWRIGDLFLKYGE ELAYLQAKEMGRLFTDSITVDIPHLANTFHYFAGWASKLEGAVKQTTKNLHTYTLREP LGVVAAITPFNFPLILSIHKFAPALAAGNTIVHKPSSTTPLTALKVAEITAEAGLPSG VFNVVPGPGSTVGHALSTHPMVEKVAITGSTASGIRVIKDSADTLKHLTMELGGKSAN IVFADADLDAAIETAYYGMFYNKGEICYAGSRMLVERSIYDEMVERVAQRAKQIKVGA PLDPDSQMGPIANKSEYDNVLRYLEVGKQNGARLVAGGKTADIGTGKGYFVEPTVFVD VNSDMAIAREETFGPILSVIPFEDFNDAIRIANSHQYGLASGVQTRDIKKAHKAAARL KAGTVWINTYGHFDPSSPFGGYKMSGYGRENGQEALEFYLQTKTVWVDLS" gene 18945..19283 /locus_tag="DP116_07880" /pseudo CDS 18945..19283 /locus_tag="DP116_07880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015169706.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transcriptional regulator" gene complement(19837..20286) /locus_tag="DP116_07885" CDS complement(19837..20286) /locus_tag="DP116_07885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457938.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_07885" /translation="MTTKLNEPLLVVEDSNEDFRMLQRLMRRLAVPNPIYRCTNGDEV LDFLYKEGNYQNPDLAPRPSVILLDLNLPGIDGRDILERLKQDQSFREIPIVVFTTSS NPKDIELCYKKGANGYLIKPMDAQELQKTIQAFVDYWLEVNTSPSAG" gene complement(20336..22618) /locus_tag="DP116_07890" CDS complement(20336..22618) /locus_tag="DP116_07890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316628.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanobacterial phytochrome A" /protein_id="PRJNA477356:DP116_07890" /translation="MSQFDITPTQATNLIHHEPIHLPNSIQPHGVLLAVSPQLEILLT SNNTQDFLGKEPRDLLNQPLSTLLNAEQVKAIEQFWQGSGSSVYSFKLSIETSKGEQY FDAIAHRTESTVILELEPTDSINQISFLSFHALAQEAIAKMRKTSNLTDFLHTVASEV QKITEFDRVMVYQFDQVGAGSVVAEVKKDNLLPYLGLHYPSTDIPQQARELYKRCLLR FIPDMNAQAVELMAVENPEMHSTSVDLSLSVLRSTHPCCVEYHQNMGVAAILVIALIK EQTLWGLISCHHQTLKFIPYEVRKICEFLAQIVSLELEHKVNQSEFDLMVKLQSIQAD FIESISGAENFREALVYPELRLLDIVNARGAAVCLDDDITLVGATPTIEEVRALIEWV DTQITDNLFSTDSLPKLYPEAVAFKDTASGLLLLRISKVRRYYILWFRPEVIQTVNWA GQPNESIQIKADGSVTLCPRTSFELWQETVRLTSLAWKSCELESAIALRNAIVGIVLS KADELAKINQELERSNRELASFAFVASHDLKEPLRGIYNYSNILLEDYAQLLDEDGVE YLETVVSLSIRMETLINSLLRLSLLGQAQLNLQATDLNGLLHQVIDLVRASRSPSQLD IRIPRRLPMIQCDAVLVSEVFSNLMINAFKYNDKSDKWVEIGYLDINEQMEKGLLQQQ PQTSAPFVFYVQDNGIGIPEHHHQTIFRLFKRLHSQEKFGGGTGAGLAISKKIVERHG GRIWVESTVDTGSVFYFTLE" gene complement(23040..23369) /locus_tag="DP116_07895" CDS complement(23040..23369) /locus_tag="DP116_07895" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07895" /translation="MGLRREPAALVCEQYEPQLIDPSFSKVEPALVIVAILDWVGDLY HGYILELVIFTRFIGHKTINLPTYLLVRACQPNLALCAMTQFKHQFTFVHLGRTPSCS LPLWASS" gene complement(23727..25253) /locus_tag="DP116_07900" CDS complement(23727..25253) /locus_tag="DP116_07900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316627.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_07900" /translation="MNGQVLGDRYEIQQQLGKKAGRRTLLARDLVTGEMVVVKLLSFS SDFEWEDLKLFEREAETLKKIAHPSIPRYLDYFEINSSNIKGFALVQTYIPAQTLDEY IKAGRTFTEAEVKQIAKALLTILIYLHEHKPPTIHRDIKPSNILLTNRSGNSVGQIYL VDFGSVQTVAASEGGTMTVVGTYGYMPPEQFGGRTVPASDLYSLGATLIYLVTGTQPA DLPQKDLRIQFEQAAILSPILTEWLRRMSEPSLERRLSSAREALAALEHPLLLQQQHP EQMHLTDKKRGIWYWDGQHWTSKDQEISQKLNTKSESGRHSLTIKMTKPAGSKITLKK DENSFDLLIPPTGFQPSMTFMVLFAISWNSFILFWTISALAAPFPINIPFALFSLPFW GAGFQMLSWIFFPLCRRTRLRLNRQKIAFIWDFFGLKFNRVASSPTQDITKLVYSPKT FKTDSQGNRVEILPQLIIWAGVHKYQLGGTHGVIKSEPEIEWLAHELSDWLGLPLTQE " gene 25368..25970 /gene="cbiT" /locus_tag="DP116_07905" CDS 25368..25970 /gene="cbiT" /locus_tag="DP116_07905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="precorrin-6Y C5,15-methyltransferase (decarboxylating) subunit CbiT" /protein_id="PRJNA477356:DP116_07905" /translation="MPSQLWPYITPGIPDDLFERLPGIPLSKREIRLQLLAQLRLLPN TVLWDIGAGTGTIPIEAGLLCPKGQIFAVERDEDVANLIRHNCDRFEVKNVEVIEGSA PECLQNLKVAPNRVCIEGGRPIQEIMKAVWHYLQPSGRVVATAANLESLYAISQSFAQ LQARNVEVIQSAVNRLETRGSSATFTAVNPIFILSGEKLD" gene 26075..26959 /locus_tag="DP116_07910" CDS 26075..26959 /locus_tag="DP116_07910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457530.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphatidate cytidylyltransferase" /protein_id="PRJNA477356:DP116_07910" /translation="MPWSRIVSGIVAIALALTATLLGGWYFTLLFAVIVILGQLEYFD LVRAKGIVPTAKTTMFVSQVLLIICTLDRNLADAVMPLAGTFICFYLLFQPKMATIAD IAASILGLFYTGYLPSYWVRLRSLHGATISNLPVSDYLSTIWTNIVNGNFSALPQGLT GTALTFVCIWAADIGAYFFGKFFGKTPLSDISPKKTVEGAVFGIAGSVVVAVVAAYYL NWPKFVFTGVALGLLIGIASLLGDLTESMLKRDAGVKDSGQLIPGHGGILDRTDSYIF TAPLVYYFLTLLLPLVGK" gene 27336..27623 /locus_tag="DP116_07915" CDS 27336..27623 /locus_tag="DP116_07915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130810.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1778 domain-containing protein" /protein_id="PRJNA477356:DP116_07915" /translation="MSNSHKNTVRITARIAVSIQETLERAAELSGATLNQFMIQAALK EAKKIIEDERVIILSQNDADTVFSLIENPPVPNAKLKAALKKHKEFFSESH" gene 27610..28107 /locus_tag="DP116_07920" CDS 27610..28107 /locus_tag="DP116_07920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136623.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_07920" /translation="MRVIELLGKQHDRDSFNCGNEALNQFLKQTARQHIQKGVSRTFV LVNTEQPEAIIGFFTLTLCEVRVDNLSAKFFKKYPSKVPGVKLARLAVDQAYQRQGIG EVLMIEAMQRALIVAENAGSIGLFVDAKDESAKTYYSGYGFVSLEDASLELFLPLSVI EQMLE" gene complement(28356..28619) /locus_tag="DP116_07925" CDS complement(28356..28619) /locus_tag="DP116_07925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213239.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07925" /translation="MMLDVFHEHLRQARQTFNLSVVAVAASLGISVIGAGLLISGKAS EGSVTTATGLISTTLCSQIAKESGEKLEELREDLKALRPSSDS" gene 28827..30665 /locus_tag="DP116_07930" CDS 28827..30665 /locus_tag="DP116_07930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal transduction protein" /protein_id="PRJNA477356:DP116_07930" /translation="MSKNKQTVKATNEGLAKAEKALKRLYETQLGLANNLKGSVGRST IQKFFKGEKIQVDKFKEICTALKLESHWEAIAGLTDLPSSVDVPDNNLSESVKEVEDN SIDVDALVQEVREKVTPYIKERCGTMRVLDMTQPIGLDNIYTEVNIFEKITGRTRLGI AELEQNLDVENFDRFGLGKVTQKRVPGLQAVEQHSKLMVLGKPGAGKTTFLKYLAMQC IEGEFQKQRFPIFITLKEFAEAPEKPDILKYISQRLSDCDLTDADVKAKQLLKQGKAL ILLDGLDEVREEDTKKVLNQVQEFSYQFHKNQLIIICRIAAKEYTFQGFTEVEVADFD SEQIATFAQNWFQSRKDPVKGKRFIEKLKENKPIQELATNPLFLTLLCLVFGEAGDFP ANRSELYTEGVDVLLKKWDVKRNIKRDKVYKDLSLQGKENLLSHIALTTFEQKNYFFK KKTVEDDIADFFRNLPNANTAPKALELDSEAILKSIEVQHGLLVARATDIYSFSHLTF HEYFTAREIKERSAWAKLAEHVAEKRWREVFLLTTGMVRSADDLLKLMKHQIDTLLAK DEKLQQFLIWVEQKSKSVEARYKSAAIRAFCLDLSVSPVLSLSRDL" BASE COUNT 9025 a 6516 c 6634 g 8511 t ORIGIN 1 aaatctagtt tcttggtttt tgttgattaa tttattcctt ggaagtccct tattgatttg 61 tttgattcaa gcatttggaa tttcatctta gtatgctaat acttttattt tgattaaagt 121 tgacaaaatg tttcggaaat gcatgtttta cttactgaat ttgtttggtt caaacattca 181 gaatttcatc ctaatttcat cttgatctga gaacctgagt tataggagtg aaatctgtaa 241 atacttactt agatgtgaaa aaacaacaaa tcatatctaa gccgaattta ggtgtgagag 301 agaaaaaatt atgacaacac aattgcggaa cgttctgaag ctaagtccgg ttgttctggc 361 agccacattc ttcactgcga atagcgctat ggctgcagaa gtcaacgaac aggtgacttc 421 cgtctctgtg ctgacatcac aatcagacaa cattggtcaa gtaacatctg tttcccagtt 481 ttccgacgta cagccaacag attgggcatt ccaagcattg cagtctctgg tagagcgcta 541 cggttgtatt gcaggttatc ccaatggtac ctatcgcggt aaccgcgctt tgacccgtta 601 tgagttcgcg gctggtttga acgcctgttt ggatcgagtg aatgaactga ttgcaacagc 661 tactgccgac ttagtcagaa aagaagacct agctaccttg cagcgcttgc aagaagagtt 721 ttctgctgaa ttggctaccc tgcgtggtcg tgttgatgct ttagaagctc gtactgctga 781 gttggaagca aatcaattct ctaccacgac aaaactggtt ggtgaagcga ttttcaatat 841 atctgacatt tttggtagcg ataatcgggc tgttccctct ggtgtaaacc cggcaacagc 901 tcaggacttg aattctaata ccattttcgc tgaccgggtt cgtctgaact tgctctccag 961 cttctttggc tcagaccagt tacaaatccg tttgcagtcc cggaatatca ctccttatgg 1021 tacgaacgta acgggtacta acatgacccg tctagggttt gatgggaatg aaagcaatga 1081 caatttgctc gaaaaactta actatgcttt caagttgggt gatgcactaa gtgtcaagat 1141 tgatgctact ggtggtctct tatacgaaaa cattaacacc tttacccctg agttcaacag 1201 ctccggtagg ggtgctatct ctcgctatgg tcgtttcagc ccgatctatc gtgtaggtga 1261 gggtggtgca ggtgcaacat tagtcgtcaa tcccaaagga cctatcactg tatcagcggc 1321 ttatctagcc gacagagcta acaatcctaa cgatggatca ggttttttaa atggtggata 1381 tgcagccctt ggtcagatat ccttccagcc cagtcaagca tttaatatcg gtttgaccta 1441 cgctcgcact taccaaaata taggaaatgt tgccaacaat agcattaacc tctttggctc 1501 tacaggaagt cagtatgcta ataatccttt cggtggcgct gctctgactg ctgaccacta 1561 tggcgtagaa gccaccttga gactaggtcc caaggttacg cttggtggtt ggtatggtta 1621 cagtgaggca gaagctaaga gcggtcctag ggaaggtaat aatgcatact ttgagtactg 1681 ggctgctaat gttgctttca aagactttgg cagacaaggt agcgtgcttg gttttgtctt 1741 tggtcaacct cctaaaacaa ctggtaacga gttcgttcaa gcaaacggta ttcgtcgtca 1801 agacagagac acatcatatc acttagaggc actgtacaga ctgcaactga ctgacaacat 1861 tgctgttact cctggtgtgt tggtgatctt caacccagaa cataacgaca gaaacgatac 1921 cgtttatgta ggtacactgc gtactacctt tactttctaa aattagctct acagcaactc 1981 tcactctttt ggggtggggt tatgtaaaaa aagcctgctt atgcgggctt tttttgtatg 2041 actcaactga tgagtgttgc gtttcttgta aaaatccggt ttaggggttt ttagaacccc 2101 agatggatgc gtgcggagaa acgcagataa tttacaggaa gtttattgag gatcgagatt 2161 tgttggagaa ccagttccgg aggagtcatc gtccgaagga cgcttcactt tggggcgatc 2221 gctactccaa gcccaccata cgcaaccaca gtgacactgg taaaactcct gccatttgcg 2281 acgatagttt tccgtcatca caggcgaacg acgatttatc cagacttgta cagcttctaa 2341 gctactggag tgacagttag gacagcaaaa ttcataagca tgaattgccg tgtttgtcca 2401 ttcaggtggg gttggagcaa aagcatctat attcattagt tataagtcgt ttgtcattag 2461 ctatatcaat tatgaatgac tacattgaaa gcccataaac caggtattct aagcataagc 2521 atatattata agttgtatga aagctgatat taacatggaa acaaacgttg aaattcgccg 2581 tttgctagat ataatgcctg cttctggtcg aatgatgaca aaaatcgtta tcaagcccga 2641 acagacaaaa gtgattgacg caacatttcc attaccttgg aataaggaaa gaccaatcta 2701 tattaatttt gatttatggc gtcgcctgac gaaaccgcaa cgcgatttat tactattgcg 2761 gactgttagt tggttgacgc aagtcaagtg gtttaaacct gacatttatc agggtatggc 2821 gatcgctggt gttttgggcg cttttgtaga atcagcccaa gcagatccag tgggtatagt 2881 tgttgctggt ggattaacta cgctgtctat gcttcgcatt tggcggacta accgatccca 2941 gcaaacagag ttagacgctg atgaagctgc cattcatgtc gcgcaacgac ggggttactc 3001 agaagctgaa gccgcacaac acctgttatc tgctattgag gcggtggcaa aaattgaagg 3061 gcgttctgga ttagagttta ctgagttgat tcgtagccaa aatttacgag ctatccgcta 3121 aacagtggac aggactggac aaccagtagc taccgttgta gagatagcca cgcttaagta 3181 ctttgaatac tacgttaaaa aggtgatggc atctctaggt gctttccagc ttagggctac 3241 tgccgtcaag gattaaacag gtctatttag ctaagccagt gttcttgaca tgacaagcct 3301 ttttaacttt ggcgaggaca acataacccg gttaccggag gacaaatgtc taacttcgta 3361 tttgtaattg acactaacaa aaaaccgtgt gacccagtac cacctgggca agcaagacgg 3421 ttactaaaac aggggaatgc agcagtattt cgccgttacc cattcgtaat catcctgaaa 3481 tatccagcct cactgcatcc tggtcctcac caattgaaga tcgatccagg gtcgaagacg 3541 acagggcttg caattataca acaagacaaa gtggtgtggg gcgcacagtt acaacacaga 3601 ggacaaaaaa tcagcaatga cctcaaagca cgcaatgctg ttcggcgtag tagacgtaat 3661 cgcaaaaccc gttacagacg accgccgata aaaaacagag gtaagcgtca aagatctaaa 3721 gggtggttgg cacctagttt gaaatctcgt gtctacaaca tcatgacttg ggttttgcga 3781 atcaaaaagt atgtcccaat taccggggta tctcaggagt tagtcaagtt taacacacaa 3841 gttctggtca accctgagac aaccgggatt gagtaccaac agggcgagtt gtttggctac 3901 gaggtacggg aatacctgtt gcagaagtgg ggtaggaagt gtgcttactg tagtgcaact 3961 aatactcgct tagaaattga ccatattcac ccaaggtcta gaggaggtag taaccgcgtc 4021 tctaacctca ctattgcctg ccatgagtgc aatcaagcca agagcaatca agatattcgt 4081 gatttcttgg cgcaaaaacc ggatgtcctc aaccgtgtac tcagtcaagc aaaacagcca 4141 ttgaaagatg cagcatctgt taactccact cgttgggcgt tgtttaacca acttcaacag 4201 acagacctac ccgtggaaat tggtacagga ggaagaacga agtacaaccg cacccggttg 4261 gaactgccca aaacccattg gctggacgca gcttgtgttg gtacacaaga catgttgaca 4321 gtcctcacct ctcaaccact attgattagc gccaaaggct ggggaactcg tcaaatgtgc 4381 ataaccaaca agcacggttt tcccataaag catagggaga gaaagaaagt tttctttgga 4441 ttccaaacag gagatatggc gcaggcaaat ctgccaaggg gcaagtttgc aggcactcat 4501 gtaggacgac taactgttag aaaaactggg gtttttgaga tgaccaaaca cattggtaaa 4561 gtcagcccag ttagacataa gtactgtaaa gctattcatc gaaatgatgg ctatatgtat 4621 gcattttcca ctatttccca ctgaaatgaa gattagaggc gttgcgatcg ctggtttgtc 4681 atcagtgagt gttccagaaa gttttgaaca atagttcacg aaatcaaatc ataactaata 4741 actgatgact aatcaccaca atgatacagt ttatagtcgt agccgttgac tgcgcttaaa 4801 gattgagttt cagcaacaca gtttttaact tcctccggtt gaggagcata aaagtttact 4861 aaccataagt caaaagggcg tggtagtgtt tttaacgtat tttggagtgc aacggtggaa 4921 gtcttggggt cttcgtcttg atgagcaagg agaaatagcg gtgatgggga ttttgaattt 4981 tgtatggaat tttgtatgaa attttgtatt ttatattccc tcgctatccc catcatttcc 5041 ccaatttgta catgagtttt atgagttgtg gcgatgagta caggaacatg ggatgtttgt 5101 tcaataagtt gtacaaataa gtctggacgg tagtattttt gatagcccaa attgcaaaca 5161 acagtgatag cactaaccaa tcccatcgac aaaatgatga tcacagcttt tttgccattt 5221 attccccatc tccctttttc caaaatggga tcacgccaac aaactgctag acttgcccca 5281 agtaagacta tcaccgcagg aaagtaaaca aagttgtaac gagcacctcg tgtcaagtca 5341 ataccaaaaa agtaagtaaa aagaaaaaat aaagctatag cccctgcaat caaaccagca 5401 aacacttgaa ccatcaaacg ggtttgtgct tgtttgagat aaactttaat tccacgcacc 5461 aaaattggta ctgcccaaat gaaaaatata agcatcaaca gtccagaaac aatcacaact 5521 gctaactgtg atgcttccac tggtagcaaa gaaatcatcg taatccatgc tgccaaagct 5581 tgaaaaattg ggttgatcca agccattcca gtacgtgggc cttgaatcca ttcagtcaac 5641 ttcccgccat aactattctg taaaaacact gggagccaaa ctaaacccgc tacaaatgta 5701 cctagcgcaa cagcgtagat gcggaaccag ggagagtaaa gatttggggg agatgaggaa 5761 gtagaaaatt tctccctcgt ctttcttgtc cactgtcgcc aagcaagaaa aagtaaaacc 5821 aacgcttcac agcagagggt gagagtgaaa aagtaatgag tcgcaatacc aacagcattg 5881 attcccaccc acgaaagcgc tatctggatg ggtatttgag tatggttttg gatgtgacga 5941 ctggcaataa ctaagcaggc taaagaagca atcacccata aaattgctaa agtataatga 6001 cgggcttcct gtgctaaaaa aatgccgtag ggtgaagtag ccatcattgc agccgctaaa 6061 tggctgacaa gacgagaacg aaatgctagc tgagttaaag cataaacagc tgggatagat 6121 gctgcaccga aaaaagctgc aagcgatcgc gccccccaca acgacaccaa accctcatgt 6181 gtaggaaaca gccgcatcca caagtgagtg agaataaagt aaagcggcgg atgattactt 6241 tcagtagata agtgtttcca tacgtcttgt atgccagcgg tagccagtgg ttgtagtggt 6301 tgtaagagaa catcaacaga tatcggctga tccaacggta ctggcaaaaa gctattacct 6361 aagctaaata ccagagtaga aaactcatca gtccaaggtg gtttagcact caagttcgtt 6421 aaacgcaagc cgacagcaat gagtagccaa atcgccagca gtaatagagt tttgagtttt 6481 gagttttgag ttttgataat gacattcata aattttgaca ctcctcagcc gtcatgcgtg 6541 aggattctta agaatcttaa ttctattgaa tcttgattcg taacggctgt gccaaacagc 6601 ttctagacag tccggtaaaa ccatatgatt taccactcat aacttatcac tcaaaacaca 6661 tggctgtttg ttccgtccta tttcacctga gtcacgcgcc aactaagctt caaattcacc 6721 caaaagttcc aaacagtagc aacagcaata gcaatcaggt tagcaacgta gcggttgcga 6781 atcacgaaat taaacaccaa gttcaacacc aacacattca acaccagtcc cgcaagacaa 6841 attatgttga attttaaaaa ccgcttcaag cgctgacgcc attcctgctg ctgactgctg 6901 acatcagcaa acgtccagat gtcattccac aagaagttat tcaaaattgc cacttcccca 6961 gctatgattt tactgcgtgt caggggtaaa cccaaagtcg ttgggtcact gagtaagtac 7021 agtactgtca tatccacaaa cacccccgat aatcctacca agccaaagcg gaggaagcga 7081 tctatgggga aatttcgact aaatttcgac agtctccctg ttgatacgcg taagcggatt 7141 aagtggtgga tgtaatcgac atactgtttc caagtaacct tactttcacc ctctttgcgc 7201 tcactgaaca catacccaac ttctgcaatt ttgcccacgt ttccgcgccc gatggtttcc 7261 agcaaaattt tgtatcccac cggattgagg gttttaccga ctatacagtg gcgacggagc 7321 ataaaataac cgctcatcgg atctgacact cttgagagaa ccccaggtaa gatgatcaat 7381 cctaatagct gagcacctcg tgataacaaa cgcctgacaa aactccagct actgactcca 7441 ccgccttcta tgtgacgact agcgacagct aagtctgctc cttcctcaat tgcttgcaat 7501 agctgcagca gcacatgagg tggatgttgt aaatcagcgt ctataactcc taaaatattt 7561 ccagtagcag cttgccatcc ccgaatgaca gcgctggata atccccgttc accctgtcgt 7621 cgcataactc gcaattgggg atattctgtc atcaatgact gtgcaatgtt ccatgtgcca 7681 tctggactat catcgtctac cactatcaac tcgtactttc ctggtatgta ctcatcgagt 7741 aagttggtta atatggagac aattttttgg atatttgccg cttctttgta agttggaata 7801 actagagaaa gtacaatggg ttcactattt gcctctgttg ttgcagctgg caattctaat 7861 atctgcaatg aaccagtggg tactgttaag agggaattag gttggatgat actcattcaa 7921 aaaggatgat atggtaaaca gatttctagt agttttgtta ataacaccct taccaggttg 7981 tggttagcgt tattttctag tagttaacta agataaattt tattcacaaa gtagcactta 8041 gacgcgcatt agcttaccaa tttatcttta agcagcacct cttcctgatt tataccaggt 8101 ttccaaatta ggtaaataag gaaaccagtt gctacgtata taaagaaggg tgtcattaaa 8161 gagagacgaa cattataata ctgaggcaat actactttgt acaagtaatc ttgccaatga 8221 agaaaaggac ctcgatacag aaacatcact atatttataa gtaaattatt agacggaatc 8281 gtagaaactc cagcgagatt aataaacatg gataatactc ccaataattt aatatacttt 8341 agaggagagt agactactgg aagaagaata aatggggtag caaccactaa atgacgtgga 8401 ccaaagcaga catcacccga ccatgcatag taagaagcat ttaacacaaa aataaaggca 8461 aagaagataa aacagaacag tataacattg tttttttggt aagtcttttt tacaaaggaa 8521 ccaagaaata aaatcgtcat gggaaagtat agaaaaattc ctctaaacga actgaatgtc 8581 aaactccaaa ttgttcctaa agaaggaaca ataaacatgc ttttttggat agtgtcttct 8641 tgatttaact gtttgagaaa cagtgagtta ggagttttaa acactgaccc tgtaattgtt 8701 tggttataga acataatgac tcctaaaaag attgagtaac ccaacaatag caagaagata 8761 tttgtcagca aacttttgtt atctttctct tgaattctta aatatatcca aaatccaatt 8821 agtaaagaga ttggcacgat tgcaatataa tcaaccattt gggctaagcc caagaaaagt 8881 ccaataaaaa agctatttgc ttgattgata attaaataaa aagctagaaa aataaacgac 8941 attgtgtaga catgagacca aatcccatta gtagagtaga aaaaaactaa tgaaccgaag 9001 atgaaaataa aaacgaccca taattttttg acaagatcat tcgtgaaata atctaaagtc 9061 ttaaacatta tgactgccac aattccaaga agaggagcgc taacacaaac gttcgataag 9121 atattaaaga ttgcccaata aacttctcct tctggaatac caatcagcct cattaacatt 9181 tggtgaataa agtaaagagg agccagaata attggaatac caggcggaat aacagagtaa 9241 gcattattat taagcagcaa aatatctata tgagatcgat atcctttttg aagtgcaaat 9301 gtaccaaaat tcacaaatga gtgaatggca tcagtatagc gacttgtttg aggtccatca 9361 ggtgttgcat gaagagcaat caaaataaat acagtgatga atattctatg agaaatgtgc 9421 ttgatcattt ttaaaaactt gccattacaa agtttgccct atcaaaaggg gagattgcca 9481 aaggctgaaa ctaaattgac agaaaaactt agcggtaaag gcttataccc aaattagagt 9541 gatagtacta tccatctagg agtttacctt ctcactgggt agtctaccaa ttagcaagta 9601 agctaaccgc agcctttaaa ggcggggctt tgtagtagcc ctgagttgtc gatgtgaata 9661 gatgcttgcg gcttactttc tgatagctag ccactttaaa ccacgtataa ttactgttac 9721 accgtttgat gaacaaatta taagaccgtc attctgaaag aaaaatcact tgatttactc 9781 ttgacaaaat cagaaaaaca gtaaagatat agttaaattt tattgattgg attgaaaaat 9841 aaccaattgt atgttttcac tcattaaaaa ttggctaata tttgatgaaa aatagcattc 9901 aagaacaggt tttcagacag caatattttt ggtaaagtta tttggtgccg caagcggttt 9961 gccctaaata gtgtttagga atatggaaaa cagtggcaga agactgaaga attcatcttg 10021 cataaatctc taaatacttt ttaatgatta gagttccaaa atgtgcgaat ggtttcagaa 10081 caataagcaa acttaataat aaggtagata tttttgccta ctgacttatc aaaaatgtca 10141 aaataataaa tgtggtttgc ttttgagaaa ttgtggttac tataactgtt gatgctattc 10201 ttgagcgtgc tttgagtggg tatgatttac ttcccgaaga ggcagtggtt ttattaaaac 10261 aaaatgacca aagtgcagtt gctgctattc gcgctacagc tgacaaactg cgtcaacggc 10321 aagcggggga cactgttact tacatcatta accgtaatat taattttact aacatctgtg 10381 agcaacactg tagtttctgt gctttccggc gagatgaggg ggaagagggt gcttactggt 10441 tagattgggc gcatattttg gaaaaggcga cagatgcagt gcaacggggt gcaactgaaa 10501 tctgtatgca gggaggctta aacccacagg cgaaggtgaa cggaaaatct ttgccttact 10561 acctcaaggt tgtggaaaag attaaacaag aatttcccca actgcatcta catgcttttt 10621 ctccccaaga agtgcaattc atcgcaagag aagatgcact gcaatatgcc gatgtgatta 10681 tcgctttcca agatgctggt gttgattcta tgccaggaac cgcagctgaa gtgttagatg 10741 atcaagtgcg gcgcattctt tgtccagaaa aaattaatac aacaacatgg ctggaaattg 10801 tgagtacagc tcatagacta ggtttgtaca ccacaagcac aatgttatca gggcatattg 10861 agacgccaga gcaacaaatt aagcatttgg aaaaattgcg atcgctccaa caaactgcca 10921 ttcatcggga ctaccctgct cgcataactg agttcattct attacctttc gttggcaaag 10981 aagctcccaa acctctacgt cgtcgtgtag gacacgatca acctattttg ggagatgcac 11041 tgctactcac tgctgttgca cgaatttttt taggaaactg gattcctaac catcaaccaa 11101 gttgggtaaa actgggtttg gcaggtgcaa cagaagcttt attgtggggt tgcaacgata 11161 ttggtggcac attaatggaa gaacatatca caacaatggc aggtgctcaa ggtggaacca 11221 acatggaagt cgaaaccttg caagctgcta tcacctccat cgggcgtcct taccaacaac 11281 gagatacttt atatcaaaca gtgggattat gagggaaaga tcttttccca cactccctat 11341 ctcctcgtat tgccgtatcc tgcgatgaaa tgatccccat gataccaatc caggatacga 11401 tggacgttga tttaggtatt gtttcactta acgtttatga agcgctattg gtcacgtctg 11461 cttgccctgg ttttgcttgt cgtcgttggt ttgatgggct gtaacagcag tccgggtgct 11521 ttaacaggag attatcgcca agatacctta gctgtcgtaa atactttgag gactgctata 11581 gagttaccac aagattctcc agatagagca tcaactcaag cagaagcgcg taagaaaatc 11641 aatgactttg cagctcgtta tcagcgagat ggctctgtct ctggtttggc ttcttttaca 11701 accatgcgaa ccgcccttaa ctccctagcc ggacactata gttcttatcc aaatcgtccc 11761 gtgccacaaa aactcaaaga ccgcttagag caagagtttg agcgggtaga agcagcatta 11821 agccgtggtg cttaactatt gcggcgtaag tcccaccact aaacggagta ccgttatagc 11881 ggtgggatta agccaacaga tgaccaaaac atcactgaac taagccttaa aaacaaaact 11941 cagctgactt ataagttcag agttcaaagt tttgagtgct gaacttttac tctataccga 12001 tgataatttt gtcacatcat taagttaaaa aagtcacata atttgggttt ttctttttga 12061 attttgaaat ttcgcgtcag aaaagcttgt agataccaca agattgttgc tctcaacatc 12121 acaaacaatc tgtttatcag gacgttgtat gaaaatttcg tgttacttaa attgagtcac 12181 aaaattcggc aacagaaata gagatttggg agtctatacc gtctgggctt caaaacccta 12241 tttatcaaaa acgcaatttt atacacatta tcctacgtaa cttccgttgg gattgattgt 12301 tttgtctttc tacttgtacg tatgtccaac cgtcctttga atgttgcaac aatattcttt 12361 tggcaacgcc tgcttgctgc tctgattttt agcagtagtt tgctgattcc ctttttaaaa 12421 aatcagccag cagcggcgca gctgactgag tattgccagt taccaccacc tcaggcgcaa 12481 gaaaaagaaa acttacgttt gtctgcactt aagggtgatc aacaggcaca aagtcgttac 12541 cagcaattgt tacaacaaaa cgcacaagtt ttgcaggaat gccgcaatcg tacttggcct 12601 cagttgcaag caatttggtt gcgtttgtat ccttgtgatg ttcaaccagg aatggttgac 12661 caaatcatgg atcggattgt caaccgaggt tataacgaag tttatttgga agttttttac 12721 gatggtcaag tacttctgcc aaaaggggct aatcccacgg tttggccttc agtgattcgc 12781 acaccaggca cagaaaagac tgatttgctc gccacagcaa ttcaaaaagg gcgggaacgc 12841 ggtctgaaag tttacgcttg gatgttcacc acaaattttg gctatactta cgcccagcgt 12901 tcagatagag aaggggcagt tgcccgcaac ggtaagggtc aaaccagcct ctacgttgta 12961 gataacggtt ctcaggtctt tatcgatccc tacaacttgc aagccaaacg agactattat 13021 caaatgttac aggaagtggt gcgccgtcgt ccagatgggg tactcttcga ctatgtacgc 13081 tatccacgac aggcaggcac tgattccata gccacaaaag tcacagattt atggctgtat 13141 agtgacgcga ctcaacaggc tttattccga cgagcactga attacaaagg attggaatta 13201 attagacgct ttttaactaa gggatatatc acagccggag atgtaaatga ggctgataaa 13261 ctctatcctc aagaaggaga acccttatgg caaggtcgca ctccacccca agagcaaaag 13321 tcaatccttc ccccagatca aaggcaaccg caactacaat cggagttatg gcagttagca 13381 gttgctcacg ccatgcaagg catcgtagat tttttagcct tagctgcata tccagcacag 13441 caacaaggta ttccagtagg agcggtattt tttcctgatg gtaaccaaat ggtcggtcag 13501 ggatatgatt ctcgcctgca accttgggat aagttcccaa atacaataca gtggcatccc 13561 atgtcttatg caaattgtgc tagtgctgat tgtattgcag cacaggtgca gcgggtttta 13621 aacatggcga aaccagatac ccaggtgatt ccagcgatag ctggtcagtg gggagcatca 13681 ataagcaatc gtcctccctt ggaagtgcaa atgcaggcac tccgcaaact tgcaccgcaa 13741 atcaagggag tcagccattt tgcttattct tggcaagatc cagagtatga taaccagcgt 13801 aagttctgcc gcgtgcagtg attagggagt cgtgaccgag tctaacacga aaaaacagcc 13861 agccttaata cgatatttcg tcttgactcc acgatgaatc gaggtgcttc aaaggctggc 13921 tgtttttagg ggacaaattg gtgagcaata cagttcagat aagatagttg acattttagc 13981 acttatgagc ttatccgaac tgtattgagg aggaggcaag agtgccaggt aataaataga 14041 attagcccca cacaatctgt aaatccaaaa caaaaccagg taaaacttct tctccagata 14101 agcttgtggg ggactctaat acttcttttg gttgcccttg tctataaatt tcgacgcgac 14161 gtgtcattct ttccatcagc caacctaatt ttaccccggc atccatatac tcttgcattt 14221 tggtgattgt atcactcaag ctatcagatg gagacatcag ctctaaaaca aaatcgggag 14281 caatgggagg aaatttttcg cgttgttccg gtgtcagcgc attccatctg gttagtttta 14341 tccaagagac atctggagaa cgtgaagaac cgttgggaag tttaaagcaa gtggaagaat 14401 cgaaaacttc acccagttga gtttgttcat tccaaacaac aaatcgagca gttagcttcg 14461 cattgcgttt ccctgtttct ccccccgttg gtggcatgac aatcagttcc ccgttaccat 14521 tgcgctcaaa tttgacttcg ggattatcct ggcacagttg gtaaaattgg tcgtcgttaa 14581 gtttaacgat attttttaag ttgatggtaa tagctgtcat aatgacctgc caagagagaa 14641 attttcaacc aggattttct tgccacgtta ccatcggctg caacaaccgc gctttttacc 14701 tggcgggacg ggaataccgg gaagtcttct gtgtatccca tcaaaaagac tttggaaaat 14761 tcactccgcc aggttgtacg gttttgtgtg tcaaaatcaa agctaaatca aagttttgtc 14821 cgatcagtca aaatctcgta tccagtctct gttaccaaca cagtatgctc aaactgagcg 14881 gatagagaat tatccactgt tacagctgtc caacggtcag ataacgtccg tgtatacttg 14941 gaaccagcat tcaaaatagg ctcaatcgcc agcgtcatcc cctcacgcag tttgacatta 15001 ggtatttcgc gagtgcggaa gttaaaaacc gagggttctt catgtaggtt gcgaccaaca 15061 ccgtgtccgg taaagtcctc taccacaaca aacttgtttg ctttgacatg atcttcaatt 15121 gccccagcaa tatccatcag gtcgtttcct gctttcactt gttcaatgcc tttaaaaaga 15181 gtttcttctg caacacgaat cagtctggct gcttctgggg tgacttcgcc gacagcgatt 15241 gtgatgcaag aatcaccatg aaagccttgg taatatgcgc cagtatcaac ttttaagaca 15301 tctccagtgc ggataacttt ttttgcacta gggatgccat gcacaacttc attgttgata 15361 ctagagcaga tagaaccagg aaagccgtga tatcccttaa aacttggtgt tgcgcccatt 15421 tccctgatac gtttttccgc atgagcatcc aaatcagctg tcgtcatacc tggcttaacc 15481 agctcagaaa tttcttttag cacagttgcc actatttttg cggattgccg cataatgtca 15541 atttcacgcg gcgatttaat ttcaattcct ctacgttgtc ttttttgagg gttgttttga 15601 actggcagag aaagcaggtt actgagaatg ttcatgggaa attaataatt tttgtgcctt 15661 gtctgttgtg tgttaatact tcttctaact ttgttgagaa ataatacttg tatctcaata 15721 tctaaggtaa cttatttttt gctatagcaa tttgaaatca taagcctacg agcttacgtg 15781 aataacaaga ttcccgactt ctcactcgaa gtcaggaatc tgaaccaata gatttatgcg 15841 ctatgcgcag gcactttcac acaaatcaaa cacattaacg ccgcagcaac ttcaagccaa 15901 tctctccacc catgttctga gcataagcca gttgtaccaa gagcatcatt gctggttcga 15961 taaagcgtgc attcttcagt tcaccagcgt caagtggctc aacttcgatt tcacgcaata 16021 atcctgcgac aatctccttg gcttgggcgt catcaccaca gtagaaaaca gttgcttctt 16081 gctgaccaaa ttttcgcgaa gcagagtaca ggacttccgc aaaaacgggt aatgcttcaa 16141 caaggcgtgc accaggaaca agctttgcaa tctcctcagc ccctgaagtg gttgtcccaa 16201 ctgccatccc gctcatatct ggtgtcagtg cattaacaca actgaagaga atcttcccgt 16261 ctaaggaacc agcagctttc aaagcatctg ggacagctgc ccaaggcaca gacagcagga 16321 cgacatcagc gaactgcacc gcctctgttg gcgtaccaac acgaacactt ggatcgattg 16381 attctgccag ggacttcagc ttctccatat cacgtgagta actgaacata aggttgtgac 16441 cattcttgat ccacaacttt cccaagccgg tgcccatttt gccagcacca atgataccga 16501 tgttcatatt tttgtttcct tgattgaagt acaaattgct tttgtggatt atgccgactg 16561 attcgctggt gtgatcagca ggcgaccaat atttacgttg ctgggttggt ctagcaggaa 16621 tcgaattgct ctggcgatgt cctctgcgac aagcggttgt tcaaccgcta ctgtttgctt 16681 ttcctcttca ctccagtttt ggtaaaggtt cgtcgctacc ttacccggtt caatacagcc 16741 aacaccaacc ccagaacccg ccaattctaa tcgcaagcaa tcggtaaaag attctatggc 16801 atgtttagta ccacagtaag ctgccattgt cgggtagttt gtcgtccctg aaatgctgga 16861 caggttgaca ataaatccat tggcattttg tttgaagtgc cttgcaaaaa cataagccat 16921 gcgaaaagcc gcttcaacat tgatgcgtac catctggcaa agtgcttcaa tatcaacagt 16981 ttctactgaa ccaacgacca tcaccccagc attattaacc agcgcatctg ctcgaccaaa 17041 tgagctaact gcatgctcta acaagcgttt gggtaagtct gggtcagtca cctcaccagc 17101 aacaaatgta gcatgagtga ggctggtagc taacttgtta agtttgtctt cattcctagc 17161 ggtgaggaca agattcattc cagcagcatc caattcgcga gcagtcgctt caccgatacc 17221 gctgcttgct ccggtgatga ttgcagtttt tccctttaat tccatgataa aaatctccta 17281 aattgagttg acaactgtct tttctacttg aaaacaggtc tgaggatgat tttcaggcat 17341 gactcatccc tttgcaaaag cggactatga cagatctacc caaacggtct tggtttgcaa 17401 gtagaactcc aatgcttcct gaccgttttc acgtccataa ccgctcattt tgtagccacc 17461 aaatggactc gatggatcaa agtgaccata ggtgttaatc cacacagtac ctgcctttaa 17521 ccgtgcggct gccttgtgtg ccttcttgat atcgcgagtt tgcacaccag acgcgagtcc 17581 gtattgatgg ctgttggcaa tccgaattgc atcattaaag tcctcgaagg gaatcactga 17641 gagaatggga ccgaaggttt cttcacgagc gatcgccatg tcactgttga catcgacaaa 17701 gacagttggt tcaacgaagt atcctttccc tgtaccgata tctgctgttt taccaccagc 17761 aaccaaccgg gcaccgtttt gtttgcccac ctccaagtat ctaagcacgt tgtcatactc 17821 gctcttgttg gcgatcggtc ccatttgact atcgggatca agtggagcac caacttttat 17881 ttgcttcgcc cgttgcgcca cccgttctac catctcgtca taaatcgatc gctccaccaa 17941 catccgcgat ccggcatagc agatttcacc tttgttgtag aacataccgt aataggcagt 18001 ttctatggct gcatcaagat ccgcatcagc aaagacgata ttggcggact ttccccctag 18061 ttccattgtt aggtgcttta aagtatcggc tgagtcttta atcacccgaa taccgctagc 18121 agttgaccct gtaatagcaa ctttttcaac cataggatgg gtcgagaggg cgtgcccaac 18181 cgtacttccc ggtcctggaa caacattgaa cacgcctgat ggtagtcccg cttccgccgt 18241 aatctcagcc accttcagcg ccgtgagagg ggttgttgag gagggtttat gaacgattgt 18301 attacccgcc gctagagctg gtgcgaattt atggattgag agaatcagtg gaaagttgaa 18361 tggagtgata gctgctacca caccgagcgg ttcccgtaaa gtataagtgt gcaagttttt 18421 ggtagtttgt ttcaccgcac cctccagttt ggatgcccac cccgcaaaat agtgaaacgt 18481 attggcaaga tggggaatat ccaccgtgat gctgtcggta aacagccgtc ccatctcttt 18541 ggcttgtaag tatgccaact cttcaccgta cttgagaaaa aggtcaccaa ttcgccagag 18601 gattgtaccg cgttcgtgtc cgctcatctg tccccaagga cctgtttcaa aggcttgata 18661 cgccgctgtg attgccgctt cagcatcttc tgctgtgcct tcggcaatct gtgcaatctc 18721 ttcttcagtc gttggatcta tgactgagaa agttttgccg ctagcagaat cgcgccattc 18781 gccattgatg tagagtttgt tatgagtgaa gggaaaatca gggcgtgctg atattttcgg 18841 ttgttctgcc tgtgtacaag taatcattat gatttatata tctatcttgg taagagatac 18901 cacaggcaat caatagttac caaatagata ctaatgaagt tgttatgaga gcatcacagt 18961 tggcaacaga atataactgt cctgttgagg ttacgcttga ggttattggc ggcaaatgga 19021 agtgtgtcat cttgtggtgg ctaagacgag atgcaaagcg gtttggggaa ttgagacttt 19081 tgattcctag aattactcaa aaggttttga cgcaacagtt acgtgaactg gaacgagatg 19141 ggctgattcg tcgagaaacc tatcgacaga caccgcctcg ggttgaatac tcactcacac 19201 catacggtga gacgattcga ccgatcacag aattgatgtg tgactggggt aagagccata 19261 gaccagagta caacttcggc tatctccggc taaaaggctt acgaatatta gtcgtggcgg 19321 ctgaggctga ggtgcgcgtt aagcgtttgc gaagcgcccc cttaggggct agctctaccg 19381 ttagcggagc gtgtccgcag gacataggta atcgcctacg cacagtgctt gaagaacatg 19441 atgctcaggc gctcgtggtt gcatcaactc acgcagcact cgaagtgttg cttcaagagc 19501 cgcccgatgc gctaatagtt gatattggag catcaagcga agacagttat gctctcattc 19561 gtcaggtcag aaatctttca gtggagcaag gcggtcaaat tccagcgctc gcgctaacca 19621 ataatgattt ggagcgctca cgagtcatca aagagggatt tcaagtccat ttagccaaac 19681 cgttcgatcc agtggaattg gttgccatcc tcgctagcct cactagttac tcccaatgag 19741 agtagtattt tgctaccatt tgtgctaatc atgatcctcg ctaagtccga agaacaaggc 19801 ttagggttaa tcaacagaaa tcaccaaaca catacattaa cctgcacttg gggaagtatt 19861 tacctctaac caataatcga caaatgcctg aatcgtcttt tgcagttctt gagcatccat 19921 cggcttgatc agatagccgt ttgcgccttt cttgtagcac agttcaatat cttttgggtt 19981 agatgatgtg gtgaaaacaa cgatagggat ttccctaaag ctctgatctt gcttgagccg 20041 ttccaggatg tcacgaccat caatacctgg caaattcagg tcaagcaaga taacagaagg 20101 tcttggcgct aagtccgggt tttgataatt cccctcttta tagaggaagt ctaaaacctc 20161 atctccatta gtacagcgat atatggggtt cgggacagcc agccgccgca tcaggcgttg 20221 tagcatccta aaatcctcat tgctgtcctc aacaaccagc agtggttcat tgagttttgt 20281 ggtcatgact tcaaatattg tcttaaaatt aaacatttct caataaaatt ttacattatt 20341 ccagcgtaaa atagaacact gagccagtgt ctacagtcga ttcaacccaa atacgaccac 20401 cgtgacgctc aacaatcttc ttagaaatgg ctaacccagc acctgtaccc ccaccaaact 20461 tttcttgaga gtggagtcgc ttaaagagcc tgaaaatagt ttggtggtga tgttctggaa 20521 ttccaatccc gttatcttgt acataaaaga caaaaggtgc tgaagtttgc ggttgttgtt 20581 gaagcaaccc tttttccatc tgttcattta tatccaaata accaatttca acccatttat 20641 ctgatttatc attgtattta aaagcattga tcatcaagtt actaaaaact tcgctaacaa 20701 gaactgcgtc acattgaatc ataggcaaac gtctagggat gcgaatatct aactgagatg 20761 gcgaacgact ggcacgtaca agatcaatca cttggtggag caatccgttg aggtcagttg 20821 cttgcaggtt tagttgggct tgccctaaca aggaaagtct caaaagtgag ttaatgagag 20881 tttccatgcg tatggataag gataccacgg tctctaggta ctcaactcca tcctcatcca 20941 gtagttgagc gtaatcttcc agcaatatat ttgagtagtt gtaaatgccg cgcaaaggtt 21001 cttttaaatc atgagaagcg acgaaggcaa aggaagcaag ttcgcggttg ctgcgctcta 21061 actcttggtt gattttggct aactcatctg ccttggaaag tacaatgcct acaattgcat 21121 tgcgcagagc gatcgcactc tcaagttcac acgatttcca agctagcgaa gtcaatcgaa 21181 ccgtttcttg ccagagttca aacgatgttc gcggacaaag ggtaacgcta ccatcagctt 21241 taatctgaat tgattcattc ggctgacctg cccagtttac cgtttggata acttcaggac 21301 gaaaccagag gatataatag cgccgaactt tagaaattcg caacagtagc aaaccacttg 21361 cagtatcttt aaaagcaaca gcctctggat aaagcttggg cagagaatcc gtggaaaaga 21421 gattgtcagt gatctgagtg tctacccatt ctataagggc gcgaacttcc tcaattgttg 21481 gtgttgcacc tacaagcgta atgtcatcat ctaaacaaac tgctgctcct cgcgcattga 21541 cgatatccag taagcgaagt tcaggataaa caagtgcttc tctaaagttt tctgccccgg 21601 atatggactc tatgaaatca gcctgtattg attgaagctt aaccattaag tcaaattccg 21661 actgattgac tttgtgttct aattccaacg acacaatctg tgctaaaaat tcgcaaattt 21721 tccgcacttc gtaaggaata aacttcaatg tttgatgatg gcaggaaatg agtccccaga 21781 gtgtttgctc cttaatgagc gctatcacca ggatggctgc tacacccatg ttttgatgat 21841 attcaacaca gcagggatga gtacttctaa gtacggacaa gcttaagtca acagatgtag 21901 aatgcatttc cggattttcg accgccatca actctacagc ttgggcattc atatcgggaa 21961 tgaatcgaag caaacagcgc ttatacaact ccctagcttg ctgggggata tccgtagacg 22021 gataatgtag tcctaaataa ggtaataaat tatctttttt tacttcggca actacggaac 22081 ctgctcctac ttggtcgaat tgatagacca tcactctatc aaactccgta attttttgca 22141 cttcggatgc tacagtgtgc aagaaatctg tgaggttgga tgtctttcgc attttggcga 22201 tcgcctcttg cgccaaagca tgaaagctca aaaagctgat ctgattgata gaatcagttg 22261 gttctagctc aagaatcaca gtactttctg tacgatgagc gatcgcatca aaatattgct 22321 caccctttga agtctcgata gacagcttga aagaataaac gctacttccc gaaccttgcc 22381 aaaactgctc aatagctttc acttgttctg cgttaagcag agtactgagg ggttgattga 22441 gtaaatcgcg tggctcttta cctaaaaaat cttgagtatt attgcttgtc aacaggattt 22501 ctagctgagg actaaccgcc agtagtacgc catgaggctg gatcgagtta ggcagatgaa 22561 ttggttcgtg gtgaatcagg ttggtagctt gggtaggggt aatatcaaat tggctcatag 22621 cttgactcac gagaaccaaa cttatttgtt ttacgatacc gcaattaaac gttatatttt 22681 ttaaaatttt aatgcataag tataggaatt tggtttggtt tatgagcgaa ctcgttgagc 22741 tagggaacac ttaacaggtg agaccagccc ccgtcttggt gaaccagcac tgcgggaggg 22801 gagccactgc tcgactaggg tttcccggct tgtagacgcc cggagggcgg tgaaagtcgc 22861 acgcccagat ttaacacaga ggtgcggcgg ataagcgcga cttgaaaggt cgcttagaac 22921 ggagagccga ctctcaagcg agaattcaca ctctctaccc taatgtaaaa gactctccga 22981 agtccggtca cgcccaagta ggtaacgaaa caggcggaat gcagaccaca aacgtaacct 23041 taactgctag cccaaagtgg taaggagcaa gagggagttc tcccaagatg aacaaaagta 23101 aattgatgtt taaactgcgt catcgcacaa agagccaaat taggctgaca ggctctaacc 23161 aaaaggtaag tgggtagatt tatggtttta tgccctataa atcgtgtgaa tataactaat 23221 tccaaaatat aaccgtgata aaggtcgccg acccaatcca gaatggcaac gattacaagg 23281 gcaggctcaa cctttgaaaa agaagggtct attaattgag gctcatactg ctcgcagact 23341 aatgcagcag gttccctcct tagacccatt cgacccaaac tatcgccgcc tacgttatat 23401 tccaatacgg ttcagttaag aagaattgta ggttgggtta agcgcagcgc aacccaacaa 23461 aaacgaggta ggtgttgggt ttcgttccaa gggcccaacc tacgtcagca ttagttttta 23521 gccttatctg aaccgtattg agattcaggg aagaacgaac caacaggtaa aggttgacca 23581 agtaaaaaag caacaataat cgcaaatgtt accgcgccga aaacaactca gaagatgtgt 23641 tggctttttg gctggtggtt gagtgcgtgc tacatgattt agcggtaagt catcataacc 23701 actctcacaa ggtgtagcat cagctttcat tcttgcgtca gaggtaaacc caaccaatca 23761 cttaattcat gagcaagcca ttcaatttct ggttcagatt taatcacacc atgagtaccg 23821 ccaagctgat atttatgcac tcctgcccaa atgatcaatt gaggtagaat ttcaactctg 23881 ttaccctgag agtctgtttt aaaggtcttt ggcgagtaca ctaacttagt gatatcctgt 23941 gttggtgatg aagcaacacg attgaatttc aagccaaaaa aatcccagat gaaggcaatt 24001 ttttgccgat ttaaacgaag tcgagtcctt ctgcacaaag gaaagaaaat ccaagataac 24061 atctgaaaac cagcgcccca aaagggaagc gagaataaag caaaaggtat gtttatggga 24121 aaaggtgcag cgagggcgct aattgtccaa aaaagaataa aagaattcca agaaatagca 24181 aacaaaacca tgaatgtcat cgacggctga aaaccagtcg gtggaatcag aagatcaaaa 24241 gaattttcat ctttcttgag tgtgattttg cttcctgctg gtttcgtcat tttgatggtt 24301 aaactatgac gtccagactc acttttggtg ttaagctttt gcgaaatttc ttgatcttta 24361 ctagtccaat gttgaccatc ccaataccaa atccctctct tcttgtctgt gaggtgcatt 24421 tgttctgggt gctgttgttg taaaagtagt ggatgttcca aagcagctaa tgcttcacgc 24481 gcagaactca accgtcgttc taaactaggt tcgctcattc ggcgcaacca ctcagttaaa 24541 ataggactca gaattgccgc ctgctcaaat tgaatccgta agtccttttg aggtaaatct 24601 gctggttgtg tccccgtgac taagtaaatt aaagtcgcac ccaagctgta gagatcagat 24661 gcaggaaccg tgcgtccacc aaattgctct ggtggcatat agccataagt tccaactaca 24721 gtcattgtcc caccttcaga ggctgctaca gtctgcaccg agccaaaatc taccaaatat 24781 atttgaccta cactattacc agagcgattt gtcaacaaaa tattactcgg cttaatatct 24841 cggtggattg ttggaggttt atgctcatgt aagtatatga gaattgttaa aagtgcttta 24901 gctatttgtt tgacttctgc ttctgtaaaa gtacgcccag ctttgatgta ttcgtccaat 24961 gtttgcgctg ggatataagt ttgtacaaga gcaaaccctt tgatatttga tgaatttatc 25021 tcaaaatagt ctaaatagcg aggaatagac ggatgtgcta tttttttgag agtttcagct 25081 tctcgctcaa acagcttgag atcctcccac tcaaagtcac tgctaaaaga cagtaacttg 25141 acaaccacca tttcaccagt tacaagatca cgagctaaaa gcgtccgtcg cccagccttt 25201 tttcccaatt gctgctgtat ttcgtagcga tcgcccaata cttgaccatt cattatgatt 25261 gcttatggat ataaatttaa aacctattta tttattttat tttcttaacc tagctagata 25321 catatctttt tttgaatttc tcctgtggag tgaattttga attgcctatg ccctcccaac 25381 tttggcctta cattactcct ggtattccag atgatttgtt tgaacgcttg ccaggaattc 25441 cgctgagtaa gcgagaaatt cgactacagt tgcttgccca actgcgtctt ttacccaaca 25501 ccgtgttatg ggatattggt gcagggacag gaacgattcc aatagaagcg gggttactat 25561 gcccaaaagg acagatcttt gctgtagaac gagatgaaga tgtcgccaac ttgatccggc 25621 acaactgcga tcgctttgag gtgaagaatg tcgaagtcat tgaaggaagt gcccctgagt 25681 gtttgcagaa cctaaaagtt gctcctaatc gcgtttgtat tgaaggagga cgtcccatcc 25741 aagaaattat gaaagcagtg tggcattact tgcaaccctc aggtcgagtt gtcgccacag 25801 ctgctaatct agaaagtctg tatgctattt ctcaaagctt tgcccagttg caagccagaa 25861 atgttgaagt catccagtcg gctgtgaacc gtttggagac acgaggttct tctgcgactt 25921 ttactgccgt taatcccatt tttattctca gtggtgagaa actagactag aaaacagaat 25981 catcgaagaa agaagtacga agaaacgaag atgcaaaaat ttcttcaatc ttcatttttc 26041 ttttttaact cttgactttt tccttctttt gcccatgcct tggtctcgga ttgttagtgg 26101 aattgttgca atagcccttg ctttgactgc gacccttttg ggtgggtggt actttactct 26161 tttgtttgcg gttatcgtta ttctaggtca actagaatat tttgatttgg tgcgagcaaa 26221 aggcatagtt cctactgcta aaaccaccat gtttgtcagt caagtactgc tgatcatttg 26281 cactttagat aggaacttgg ctgatgccgt aatgccactg gctggcactt ttatttgttt 26341 ttacctgctg tttcagccaa aaatggcaac aattgcggat attgccgctt ccattttagg 26401 gctattttac acagggtatc tgccgagtta ttgggtgcgg ttgcgatcgc tccacggtgc 26461 gactatcagc aatctacctg tgagtgatta cctgtccaca atttggacaa atatagtcaa 26521 cggaaatttt agcgctttac cccaaggtct gaccggaaca gcactcactt ttgtatgtat 26581 ttgggcagct gacattggtg cttacttttt tgggaaattt tttggcaaaa cccctttatc 26641 agatatcagt ccgaaaaaaa cagtcgaagg agcggttttt ggtattgctg gcagtgttgt 26701 cgtcgctgtc gtcgcagcat attatctcaa ttggcctaaa ttcgttttta ctggtgtcgc 26761 attaggtttg cttattggta ttgctagtct attgggtgat ttgaccgaat ctatgctcaa 26821 gcgtgatgct ggggtaaaag attcagggca gttaatcccc ggtcatggag gtatcttgga 26881 tcgtacggat agttatattt tcacggctcc tttggtttat tattttctca ctctcttgtt 26941 gccactggtg ggtaagtagt gaagaaactt cttgcaaaag tgaattgcta tcaaatgagc 27001 gccccacgag ggggaagtca aaagtcaaaa ggagccagtg cgttgggcgg gttccccgac 27061 ttgaagcaac tggcgttcaa aagtcaaaag tattaaaccc cctcgcaact tttgaaagaa 27121 gcacattttt cttcactttc tgaaagtctt gctctataag gattttaaaa tgtgcatctt 27181 aagagtattg atttgattgt gtcgcagtca aggaaccgaa gttcgagatt gatggggtgt 27241 ttttgctaaa tgctcccgaa atcccactcg taacttattg gtgccaaaat gacaccaata 27301 agttaaaatt agaagaacat gacaccatta tctccatgtc taatagtcac aaaaatacag 27361 ttcggataac agcgagaatt gctgtgagta ttcaagaaac tctagaaaga gcagcagaat 27421 tatcaggtgc taccttgaat cagtttatga ttcaggcggc gttgaaagaa gcaaaaaaaa 27481 tcatagaaga tgaacgagtg attattttgt cacaaaatga tgctgatacg gtatttagcc 27541 taattgagaa tcctcctgta ccgaatgcca agttaaaagc agccctgaaa aagcacaagg 27601 aatttttcag tgagagtcat tgaattacta ggtaagcagc atgaccgcga tagctttaac 27661 tgtggcaatg aagcattaaa tcaattcctt aaacaaacag caagacagca tatccaaaaa 27721 ggtgtttccc gtactttcgt tctagttaat acagagcagc cagaagcgat aattggtttc 27781 ttcacattaa cattgtgtga agtgcgtgta gacaatttga gcgcgaaatt ttttaagaaa 27841 tatccttcaa aagttcctgg tgtcaagctg gcaagattag cagtcgatca agcctatcag 27901 cgacaaggaa ttggagaagt tttgatgatt gaggcaatgc agcgtgcctt aattgttgcg 27961 gaaaatgcgg gaagcattgg tttatttgtt gatgcaaaag atgaaagtgc aaaaacctac 28021 tactctggtt atggttttgt cagcttggaa gatgcgtctt tagaactatt tttgccattg 28081 tcagtcattg agcaaatgct tgagtaatgg ttaagctacg tcaaggcgcg atttacagac 28141 gaatgaggaa cctcagaacg atttattggt atcagctatt tcagtgcaag cgatcgccta 28201 ttcaactgat acacagcttt ttctaattca aataaatccc agacggatgt tttgttgtaa 28261 ggaaatgcac ccgacttggg aacgggaatc tcaagagtgc gatcgcttct cttacacaga 28321 agaatcgaag cgcgatcgca caccgataaa ataaattagc tatctgagga aggacgcaaa 28381 gctttgagat cctcccttag ttcctctaat ttctcacccg attctttggc gatttgagag 28441 caaagagttg ttgaaattaa gccggtagcg gtggtgacgc tcccttcaga ggctttacca 28501 gaaatcaaaa gcccagcacc aattacgctg atgccaaggg aagctgcaac tgctacaaca 28561 gagaggttga aagtctgccg tgcctgacgc agatgctcat gaaatacatc tagcatcatt 28621 ttaaggtaag cgtcagagtg cgaatcgtca gatgaaaaat tttgagattt catatgtggt 28681 ttggggtgtg tttgttacct cctctagcat ccagctaaat tccccaaaca ttgattgcac 28741 aatcgtttgg ctaaaattat tctatgattg tcctatgctt ggcttttgat tgtgtactat 28801 agtccataaa cccttagctg caaggtatga gcaagaacaa acaaactgtc aaagcaacta 28861 atgaaggttt agcaaaagct gagaaggctt tgaagagatt gtatgaaaca cagttaggcc 28921 tggcaaacaa tttaaaaggc tcagttggtc gcagcactat ccagaagttt tttaaaggtg 28981 agaaaattca agttgataag tttaaggaga tttgtacagc actaaagctt gaatcgcact 29041 gggaagcgat cgcaggttta acagatttac ctagctcagt tgatgtacca gataataatt 29101 tatctgagtc agtcaaggaa gtagaggata acagtattga tgtagatgcc ttagtgcaag 29161 aggtacgcga gaaggtaaca ccctatatca aagaacgctg cggcacaatg cgggtgctag 29221 atatgaccca gcctattggg ttagataaca tttacactga ggtcaatatt tttgagaaaa 29281 taacaggacg cacgcgacta gggattgctg aactagaaca aaacttagac gtagagaatt 29341 ttgaccgctt tggacttggc aaagtaaccc agaagcgtgt accaggactc caagcagtag 29401 agcaacacag caagctgatg gtgttgggta aaccaggagc gggaaagacc acatttctga 29461 aatacttggc aatgcagtgt attgaagggg agtttcagaa gcaacgcttt ccaattttta 29521 tcacactcaa ggaatttgca gaagcacccg aaaagccaga tattctaaag tatattagcc 29581 aacgattatc tgactgtgat ttaacagatg ctgatgtgaa ggcaaagcaa ctgttaaaac 29641 aaggtaaggc attaatttta ctagatgggc tggatgaagt ccgagaagaa gataccaaga 29701 aggttctaaa tcaagttcag gaattttcct accagtttca caagaatcaa ttgatcatca 29761 tttgtcgaat tgctgctaag gaatatactt ttcaagggtt tactgaagta gaagtagcag 29821 attttgactc tgaacaaata gctacttttg ctcaaaactg gtttcaatca agaaaagacc 29881 cagtcaaagg taagcgcttc atcgagaaac taaaagaaaa taaaccaatt caagaactcg 29941 caaccaatcc tttattttta actctacttt gtttagtctt tggtgaagct ggggattttc 30001 ctgccaatcg ttctgagctt tacaccgaag gtgtagatgt gttattgaaa aaatgggatg 30061 ttaagcgtaa cattaagcga gataaagtct acaaagactt atctttacag ggcaaggaaa 30121 acttactctc gcatattgcc ttaactactt tcgagcagaa aaactatttc tttaagaaaa 30181 agacggttga agatgacatt gctgattttt tccgcaactt acctaatgct aacactgcgc 30241 caaaagcatt agaactagat agcgaagcca ttttgaaatc aattgaagtg caacatgggt 30301 tactagtagc acgggcaacg gatatctact ccttttctca cctaacattt cacgaatatt 30361 tcactgctag ggaaatcaaa gaaaggtctg cttgggcaaa attggcagag catgtcgctg 30421 aaaaacgctg gagagaggtc tttttgttga ccacaggaat ggtgcggagt gctgatgatt 30481 tattaaagtt gatgaaacac caaattgata ctcttttagc aaaagatgag aaattgcagc 30541 aatttctcat ttgggttgag caaaagtcaa agtctgtaga ggcgcgttat aaatctgctg 30601 ctattcgagc attctgcctc gacctctccg tctccccagt cctctccctc tcccgagact 30661 tatagtgggt gagatgaaaa ccctgt // LOCUS NODE_931_length_30031_cov_5.18011130031 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 30031) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 30031) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..30031 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(110..2263) /locus_tag="DP116_07935" CDS complement(110..2263) /locus_tag="DP116_07935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130613.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydroxyacylglutathione hydrolase" /protein_id="PRJNA477356:DP116_07935" /translation="MNNCMMNRTISPSTIAIIGGGLSGSLVAANIMRKATMPLFIKLI ERNQEVGRGVAYGTPFECHLLNVPAGKMSAFPDEPNHFLNWLHRNGHEEVKASTFVPR KVYGDYVQATLSEAEANAPAYVMLERIVDEAIAIRSKNHNLMVHLSSGESLYVQKAVL ALGNFPASLPKPVACVENHNDNIRDAWSSHAIADLNPEDSILLIGSGLTMVDAVVALH AKGFQGKIHAVSRHGLKPCSHKPTIPYPTFIDLETAPKTARELLHLVRQQVRTADEQG QDWRAVIDALRPVTQEIWQTLPLKEQKRFLRHVKAYWEVHRHQIAPEIADVLDAAEES GQLSYYAGRIQTCQQFDNKLTVTISERETQAKIVLQVNRIINCTGSNCNYRSLQHPLL ASLQEQHLIRPNVLSMGIDTAVNGALLDADGNASELLYTLGTPRKGNLWETTAVGEIR VQAANLAQDLLKSLNPIPYAVVENWFAPKPAMLFRQLFDKESSTYTYLIADPQTKEAI LVDPVSEQVERDIQILRELGLTLRYCLETHIHADHITATSQLKGITGCLSIMPENAQT TCADRYIADGKILQLGNIQIQAIATPGHTDSHMAYLVNNTHLLTGDALFIRGCGRTDF QNGDAGLLYDAVTQKLFTLPDDTLVYPAHDYQGQTVSTIGEEKRWNPRFAGHSRSQFM MLMNNLNLPYPKKMSEAVPANQHGGKVLVALDYQI" gene 3160..4320 /gene="ssuD" /locus_tag="DP116_07940" CDS 3160..4320 /gene="ssuD" /locus_tag="DP116_07940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308644.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkanesulfonate monooxygenase, FMNH(2)-dependent" /protein_id="PRJNA477356:DP116_07940" /translation="MQLLWFIPTHGEGRYLGTAIGGRAVNFEYWRQIAQAVDHLGFTG ALLPTGRSCEDAWVLASALVTHTKKMRFLVAIRPGLMSPGVAARMAATFDRVSGGRLL INVVTGGDPTELAGDGLHLSHDDRYKLTDEFLTVWRQIAASEVANFQGDYLNIQDGKL LFPSVQKPYPPLWFGGSSPIAQNIAAKHVDVYLTWGEPPAQVAEKIAAVRQLAEAQGR TLRFGIRLHVIVRETETQAWDAANDLIRYVDEEAIAKTQKAYARMDSEGQRRMQQLHQ GSREALEISPNLWAGIGLVRGGAGTALVGDPDTVAQRISEYADLGIETFIFSGYPHLE EAYRVAELLFPRLPLENIPVVEPQLMSPFGEIIANREFPKQQVKNKTAATVD" gene 4801..5658 /locus_tag="DP116_07945" CDS 4801..5658 /locus_tag="DP116_07945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952071.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_07945" /translation="MTKLTGKTDRLSIPVNFPNPEVILETQELTRCFNKFTAVNTLNI SVISGEVFGLLGPNGAGKSTVIKMLTTLLPPSAGRATIAGYDVTHQQGAVRRVIGYVP QALSADGSLTGYENLLIFAKLYDIPSKQRRERIRDVLAFMGLEQAGDRLVRNYSGGMI RKLEIAQSILHRPQIMFLDEPTVGLDPVARSQVWNLMQELRADYGTTIFLTTHFLEEA DSLCNRVAIMNRGKVIATGTPSDLKAALGKPNATLDDVFIHYTGDELASGVSYRDTAK TRRNAQRLG" gene 5639..6469 /locus_tag="DP116_07950" CDS 5639..6469 /locus_tag="DP116_07950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743661.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="multidrug ABC transporter permease" /protein_id="PRJNA477356:DP116_07950" /translation="MLNGWVEPRLNRRESFIYAIAELASKSLVIAELEVRKLRHDPYD LLIRGVQPALWLLIFGQVFTRTRAIPTGNLSYLDFMTPGILAQSVLFVAILTGGMTLI WERDLGIVHKLLASPIPRAAMVLGKALACGIRSLSQIVIIYGLALLLGVNLNLHPLAL LQVVVIVLLGAGCFCVFSLIIGCLVKNRERFTGIGQLLTMPLFFASNAIYPISLMPKW LQIISHINPLTYQVDALRGTMLVNGSSLYGFGLDCTILLLTLISLTIICGRLYPRVAM " gene 6489..6950 /locus_tag="DP116_07955" CDS 6489..6950 /locus_tag="DP116_07955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206653.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="PRJNA477356:DP116_07955" /translation="MKLGKPSEECAVKVMDTIPLVMRFIRADMRENSVASLSIPQLRA MLFIKRNPGTSLSEVAEHLGVTCATASTTTERLVQRNFIERTDHPQERRRVVLNLTDE GKHHLEQTLAQTRAHIADLLEGLTAEEIVHIEEGLTLLKHVFERSEVKKAP" gene 6913..8184 /locus_tag="DP116_07960" CDS 6913..8184 /locus_tag="DP116_07960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867522.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_07960" /translation="MSSSDQKLKKLLKAEAVKHDPFAALRFRDYRLFTIGRVLLFTGS QMQTVAIGWELYERTGSALALGGVGLAQVLPMIALTLIAGHVADRRDRKHTTLLSIML LVLCSLALAVVSYTKGAIVLVYTCLFFTGVARAFLKPASDALMWHLIPTTAFTNAATW NSTSFQLATVIGPSLGGFGIAALGSATGVYVLAAIASLLCFALTVLIREKKTALSKEP ISLKALAAGAEFVWQNQVILAAITLDMFAVLLGGAVALLPIFAKDILHVGPVELGYLQ AAHSIGALIMAVLLAHLPPLRKAGPALLWSVVGFGVVTIIFGLSRLFWLSLLMLALSG ALDSISVVIRHTLVQIRTPDYLRGRVAAINNVFISASNELGGFESGLTAALFGPVMSV VGGGIGTIVVVMAVAAIWPGIRKLGALQEYE" gene complement(8256..9512) /locus_tag="DP116_07965" CDS complement(8256..9512) /locus_tag="DP116_07965" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07965" /translation="MSALVFWILFGWFADYSFPYGQVIELIICCFRGTIAAKVNNLQK QLQKTDFVSQLDNQQQIQRLNKLDSQAWLCMGLLERLGANTAESNERIAKTIEALKSK KIEILDELNPRRPEKYRKLAKIQKFFEYITLSKAQKDFIQIEKIIDEVVLAVKNSQPS ETIIQESLNKLSRETARNAEKISPYRLRIMYKIGGLLNDISLKGLFNLGDSELANLNE SIINELKEQRKILSKQFNKLIQEKYVMQQELESYSETLNNVNGAIYERETELLRLREE LESYIAANRNRQNQISSLNTELNKLNQKLLESQSQKDSLDKRVNQLIQDVRRKELEIE KLTNQLAKYSQVRILEGDYIGNLSNKSSKYHFNLKCNHWKMLVGEYVLNLDGSREIIS SNSPTVFIKQGLEECDKCVERKNIDL" gene complement(9598..9954) /locus_tag="DP116_07970" CDS complement(9598..9954) /locus_tag="DP116_07970" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_07970" /translation="MLETALATETVKIVLNVLDKVTGGALEEVGVQILQYLKAKFHGT LKLDQVQKDPDLLKTAILEKAIEDQNFKSDLEQLIVKFQKLESNSAKVIQNTQSGVNI NADKSTVVGQQFFRQQ" gene complement(10178..11545) /locus_tag="DP116_07975" CDS complement(10178..11545) /locus_tag="DP116_07975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007915097.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrilotriacetate monooxygenase" /protein_id="PRJNA477356:DP116_07975" /translation="MSTKKRQLRLGAFLMSSGHHVAAWRHPDARADGGLNFQHFKQIA QTAERGKFDMIFFADGVAVRDRGRGTEALSRTSVVHFEPLTLLSALSVVTERIGLTAT VSTTYNEPFHLARKFASLDYLSGGRAGWNLVTSATVAEANNFNREKHMEHTLRYERAK EFVDVVTALWDSWEDDAFLRDKESGIYFDADKLHIPNHKGEHFSVRGPLNVARPIQGY PVIIQAGSSDDGQELAAQTAEVIFTAQQTLAEAQAFYAGVKKKLAKYGRSPDHLKIMP GVFPVIGRTSQEAKDKYEQLQELIHPQVGLGLLSGLVGGVDLSGYPLDGPLPELPETE LAKSRLKLVTDLAQRENLTIRELYLAIAGARGHRTILGTPQQIADQLEDWFVNGGADG FNIMPPYLPGGLDEFVELVIPELQRRELFRTEYEGRTLRENLGLPRPTNKFSTVAASR EQVAV" gene complement(11642..12553) /locus_tag="DP116_07980" CDS complement(11642..12553) /locus_tag="DP116_07980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TauD/TfdA family dioxygenase" /protein_id="PRJNA477356:DP116_07980" /translation="MSSQYFDIQPVAGRIGAEIVGVDLSANLSDDVISEIRKRLVQYK VIFFRNQQLDANGQVAFARRFGEVTTAHPTVPSFPGHPEVLDLNYGRTATRANNWHTD VTFVDRPPLGSILRTLVIPPSGGDTIWANTVAAYQDLPTHLRNLADELWAVHSNAYDY AEAAVDLSEETKAYRKVFTSTVYETLHPVVRVHPESGERILFIGGFVRQIKGLSTTES DDILRLLQSYVTRPENTVRWRWQVGDVAFWDNRATQHYAISDYGDQPRHVQRVTIVGD LPVGIDGKHSEAIKGDSSTYIPSLVTA" gene complement(12684..14297) /locus_tag="DP116_07985" CDS complement(12684..14297) /locus_tag="DP116_07985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amidohydrolase" /protein_id="PRJNA477356:DP116_07985" /translation="MTIALERPTKKSRSAQIREQLGYPIIDTDVHTQEFEPAFLDYLA QVGGSQIADSFRSHLPGAGRYRWFQQTWEERRTYRTARPPFWGRPTKDTLNLATVSLP KLLHERLQEAGTDFAVLYPNLATLAPQIKNEEMRRAVCRAANLYHADIFRDYSVREAR PVAQPSSAWLPQSSALGEPVLPEQARQHTGVRVTPIATIPLNTPQEGIEELEYAVKEL GLKAIQIPAYVTRVIPGFEKYPEEVQREATWIDTFALDSAYDYDPFWAKCVELKVVPT THASGMGWTARRSISNYQYNHIGHFASAGEAFCKALFFGGVTRRFPTLKFAFLEGGSV WGASLYTDIIWHWETRNPEILQNNHPGNVNKEELRELFARYGGPELSVRGASRNENRF EEIGSGLGFHGKYFAPEDPGELNEFAEAGITKPEDVRDRFLNHFYFGTESDDTRVAYA FHQKANPFGDRVKAFLGSDSGHWDVPDITAVTSNAYSFVERGILSEEDLRYFLSIHPL ELYTSLNRDFFKGTAIEKEAEEYLVSVGR" gene 15445..16128 /locus_tag="DP116_07990" CDS 15445..16128 /locus_tag="DP116_07990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864567.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase family protein" /protein_id="PRJNA477356:DP116_07990" /translation="MSNIELYFAKGSTFSQRTRVVLLEKGIDFTPIEIDLQNKPDKFT QVSRYGKVPAIKHGDIEIYESAIINEYLDEVFPEPPLLPHDPGAKAIARIWIDYANTR LVPAFNKFLRGKDSSEQEQGRREFLESLLYIEQEGLGKLSGDGQYWLGDKLSLVDISF YPWFERLPLLEHFRKFTLPAETARLQQWWNTLRDRPTIRAVENPVSYYIERFTKILGE PTAVGAAQK" gene 16188..17027 /locus_tag="DP116_07995" CDS 16188..17027 /locus_tag="DP116_07995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867500.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="PRJNA477356:DP116_07995" /translation="MSLSSPILSKPVLEDLPYVEAVLNYLTPMAQKPVNYTYEPPPGV PRQNGVYEAHKLPIRNARAIAQDLSLDQEGFAYAAHKSAVRDFYDEDEVRRVYYPEAE QLLADVTGAKKVLVFDHNLRNNERAKQSENGAKEPVKRVHNDFTAKSGYSRARAVLTA LGTDDPDELLQHRFSIVNVWRPIAKPVQESPLAVCDAQSIAPKDLVAGDLVYRDRIGE TYAITYNPTHRWFYFPQLQPNEALFIKCFDSAEDGRARFAAHTAFDDPTSPPDAPPRP KMG" gene complement(17012..17983) /locus_tag="DP116_08000" /pseudo CDS complement(17012..17983) /locus_tag="DP116_08000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867504.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="aldo/keto reductase" gene complement(18082..18501) /locus_tag="DP116_08005" CDS complement(18082..18501) /locus_tag="DP116_08005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743628.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_08005" /translation="MLVQLAESDSQILGCFPVISQLRPHLQQADFVEQVRYQMKEGYK LAFLQKEEQTLAVAGFRISNCLALGKFLYIDDLVVDELKQSQGYGKQLFQWLIEYAQN HQCQHLSLDSGVQRFQAHRFYLMQRMSITSHHFSMEL" gene complement(18705..18863) /locus_tag="DP116_08010" /pseudo CDS complement(18705..18863) /locus_tag="DP116_08010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320834.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="dienelactone hydrolase family protein" gene 18985..19251 /locus_tag="DP116_08015" CDS 18985..19251 /locus_tag="DP116_08015" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08015" /translation="MQPIIYRVSATDVPLLHEILIACGLDLQVRFGLTHWIPPIYPLE NMLKDAEKLEVYALKVGESLVGTFTLEFASKVPLSYIKYGKIHW" gene 19252..19518 /locus_tag="DP116_08020" CDS 19252..19518 /locus_tag="DP116_08020" /inference="COORDINATES: protein motif:HMM:PF00583.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08020" /translation="MSDVPAVYVHKLAVLPERQGQGLGTWCLGTIEKLALTNGYLTVR LDAVKTYKKLLSFYASRGYLRVGELIFNSDVWVDAFVFEKVLFK" gene 19550..19930 /locus_tag="DP116_08025" CDS 19550..19930 /locus_tag="DP116_08025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128567.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bleomycin resistance protein" /protein_id="PRJNA477356:DP116_08025" /translation="MWLDAIDHIQVTSSPEAEDAMLFFYGKVLGLTEIPKPETIKANG GAWYVLGNIQIHVSTEKKPDNAASRRHICYLVSDLQAFQKHLRSHSVEIIPDQQPIPG HARFFLRDPAGNRIEIAEKHTILS" gene 19950..20666 /locus_tag="DP116_08030" CDS 19950..20666 /locus_tag="DP116_08030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206130.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08030" /translation="MIFDQVEFDLRCEWGAKGVSELAPISDVIVIVDVLSFSTSTEIA TNNGAIIYPYQWRDQSALDYAQSVQAELSKGRLSKDGYSLSPASLTKIPAGTKLVVPS PNGSSLTLLTLNTPTIAGCLRNSEAVAKFAQRYGSRIAVIPAGEKWEDGTLRPAFEDL IGAGAILSYLNGNLSPEAETAVVAFHAFKHDLLTYLKQCSSGKELIAKGFELDVELAA AFNVSDCVPLFNQNAYIRQK" gene complement(20712..21041) /locus_tag="DP116_08035" CDS complement(20712..21041) /locus_tag="DP116_08035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749795.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="divalent-cation tolerance protein CutA" /protein_id="PRJNA477356:DP116_08035" /translation="MKLYYITLNNSDEARHIGRALLEQKLAVCVNWFPITCAYIWKGE ITEEPEVVLIVKTQSGYREQIEEVIRQHINYTNFIAEISPTEINNTFLEWLNAEVPLP LKKNYSD" gene 21154..21912 /locus_tag="DP116_08040" CDS 21154..21912 /locus_tag="DP116_08040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009343406.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfite exporter TauE/SafE family protein" /protein_id="PRJNA477356:DP116_08040" /translation="MMFHTLLLFITAFIAGGLNAVAGGGSFITFPVLIFTGVPPITAN ATNNTALWVAALASAGAYRQNLSIPRRQFFLLCGISLVGGVLGSVALLYTSPDVFQKL IPYLLLLATLVFTFGEPLKTWFQRQSQKSSESPPLLNLMLAQLAIAIYGGFFGAGLGI LMLATLTFLGIKNIHTMNAFKTFLGSCINGIAIIPFIFAGVIAWHQAILMAVGGSLGG YLCANYARRLEPLLIRRVVMVVAFSMTIYFFIHG" gene 21945..22130 /locus_tag="DP116_08045" CDS 21945..22130 /locus_tag="DP116_08045" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08045" /translation="MEVKCTLDAKETRESILDTLAPAGAPKALTSSQIEDLRLAASKM NGVERRAFINRNYSIYG" gene 22123..22674 /locus_tag="DP116_08050" CDS 22123..22674 /locus_tag="DP116_08050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxamine 5'-phosphate oxidase family protein" /protein_id="PRJNA477356:DP116_08050" /translation="MAKLFESITEELQEFIEAQHLFFVGSAPLSPTGHVNLSPKGLGG FRVLSPHRVGYLDVTGSGNETSAHLQENGRITFMFCAFAEPPSILRLYGKGYTVIPSS PEWETLYPLFSEIPGARQIIVADISRVQTSCGFGVPLYEYKGQRETLVKWAKKKGEAG LKEYHQQKNLVSIDGLATPLSKS" gene 22899..23407 /locus_tag="DP116_08055" /pseudo CDS 22899..23407 /locus_tag="DP116_08055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016953650.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 23329..23487 /locus_tag="DP116_08060" CDS 23329..23487 /locus_tag="DP116_08060" /inference="COORDINATES: protein motif:HMM:PF07592.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08060" /translation="MAQTDDIFNNIKEFDQQATSTLVKRLSMDCKATVNIGDYSRGEK TRGDNGIF" gene 23571..24245 /locus_tag="DP116_08065" CDS 23571..24245 /locus_tag="DP116_08065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126733.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="PRJNA477356:DP116_08065" /translation="MTQEQWTAVDRYITDLFVPPDPALDATLQTSAAAGLPPHNVSPN QGKLLLLLARVQGARTILEIGTLGGYSTIWLARSLPADGRLITLEANPKHAEVARANI AHAGLSDVVELRLGRALDTLPQLVAEGRDPFDLIFIDADKPSNPDYFAWALKLSRRGT LIVADNVVRNGAVVDAKSGDPSVQGVRRFNELLAESPHVSATAIQTVGSKGYDGFAIA IVTTEQ" gene complement(24282..24938) /locus_tag="DP116_08070" CDS complement(24282..24938) /locus_tag="DP116_08070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864564.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_08070" /translation="MPKDWLEWHDLYNTEPKLQQRLQIVREYISHSLDASPPGIIRVV SVCAGDGRDLLGTLASHPRAKDVHARLVEINPQLVERGRASIESLGLAQQIEFINGDA TISSNYVGAVPADIVIVCGIFGNLADEAELNRLLGNLSFLSKQGAFVIWTRGHSNGIP YSETVRRFLRESGFEEVNFKLTATGDMGVGLHRYRGENLPTPKEQQLFVFSGVANKAR " gene 25182..26234 /locus_tag="DP116_08075" CDS 25182..26234 /locus_tag="DP116_08075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206811.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine aldolase" /protein_id="PRJNA477356:DP116_08075" /translation="MNFNLEQFASDNNSGICPEALEYMMKANQGSAPAYGNDEWTSLA ADYFRDLFEIDCEVFFVFNGTAANSLSLAALCQSYHSVICHENAHIETDECGAPEFAS NGSKLLLAKGENGKLTPEAIEAIVNKRADIHYPKPKVISLTQSTELGTLYSIDELVAI KSVAQKYNLKIHMDGARFANAVVAMNKSPAEITWKSGVDVLCFCGTKNGMALGEAIIF FNKALAEDFAYRCKQAGQLASKMRFISAPWLGLLETGAWFKNARHANQCAEYLENQLL KIEGVEMMFPREANAVFVKLPEQVITSLREKNWQFYTFIGVGGVRFMCSWNTTQARID ELVSDIKEAICAPRSS" gene 26268..27299 /locus_tag="DP116_08080" CDS 26268..27299 /locus_tag="DP116_08080" /inference="COORDINATES: protein motif:HMM:PF13489.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_08080" /translation="MPLFSPQQDLKFRTNIMSDSSRLQSLLSDEALTHYENGREAHRL SKGVGQLELARTQELLSRYLPPPPAVIFDVGGGNGIYAFWLAQQGYEVHLIDAVPLHI EQAQIYSQTQRAHPLASIAVGDARQLNRADASVDAVVLLGLLYHLIERSDRIAALRET HRILKNGGLVFAVGISRFASTLDGLFRGYLDDPEFVAIVQRDLAEGQHRNPSNHPAYF TTAFFHHPEELKAEVEEAGLSCENILAIEGSGWLLQNFEEHWSQPSRRERLLQSIRWL ETEPSTLGMSAHIMAIALKTEPVGDWLPELRSGEALRASGTGRATRSLKKRQSSRFCQ KRGSKYHQT" gene 27340..27696 /locus_tag="DP116_08085" CDS 27340..27696 /locus_tag="DP116_08085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408593.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4,5-dioxygenase" /protein_id="PRJNA477356:DP116_08085" /translation="MKEDTIEITGFHAHVYFDTASRDTAARVREGLGARFDVRLGRWH EQPISPHPKPMYQVAFSPDQFSQVVPWLMLNHEGLDILIHPSTGDDVQDHTEHSLWLG EKLELNIEFLRQIRTT" gene 28067..28936 /locus_tag="DP116_08090" CDS 28067..28936 /locus_tag="DP116_08090" /inference="COORDINATES: protein motif:HMM:PF04378.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08090" /translation="MSYRHFGRIGDIWKHIPLCEFLAIEQPISYIETNSASPEYQLTG SFEQQYGILHIEKNIKNSQLIRQSVYWKILSSLSENKNGLSKYLGSPALALNILKAST NKFVFFDIEEMCLKQIAVFVKKLNINSKITYKNQDSVDGLIGMLEELGTRDFIHVDPY FIHHTNANEHSYFDAFCLAMRKGVMGMLWYGFNTIKEREVLHNVFSSQTNTSSQTRLQ GIEIASILLNKNILDVNPGVLGCGILIGNLSEDSRQSFKEMAQEVINLYRDSTMFDKY PGELKLKEFEIQI" gene complement(29129..29755) /locus_tag="DP116_08095" /pseudo CDS complement(29129..29755) /locus_tag="DP116_08095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410069.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="MFS transporter" BASE COUNT 8379 a 6470 c 6525 g 8657 t ORIGIN 1 atttatgact gctatagcag ttctaaatta taagctctac gggcacgcta ctttgaacgt 61 gagaaacaag ctccgcgact tcgagcgaaa agtcggggaa ctttgttttt caaatttgat 121 agtccaatgc aactaaaact ttgccaccat gttgatttgc aggtacagct tctgacattt 181 ttttaggata gggcaggttc aagttattca ttaacatcat gaactggctg cggctgtgtc 241 cagcaaatcg gggattccag cgcttttctt caccaatcgt agacactgtt tgtccctggt 301 agtcgtgagc gggatatacc aaggtatcat ctggtaaggt gaatagcttt tgggtaacag 361 cgtcatacaa caatccagca tcaccgtttt ggaagtcggt acgaccgcag ccccggatga 421 atagcgcatc tccggttaac aagtgagtat tattaaccag gtaagccata tgactgtcgg 481 tgtgtccagg agtcgcgatc gcctgaattt gtatattgcc cagttgcaat atctttccat 541 cagcaatata cctgtctgca caagtcgtct gggcattctc aggcatgata cttagacaac 601 ctgttatacc cttaagttga cttgtcgcag tgatgtgatc agcatggatg tgagtttcca 661 gacagtagcg cagggttagc cccaattctc gtaatatttg aatgtcacgt tcaacttgct 721 ctgatacagg atctacgaga atagcttctt tggtttgtgg atcggcaatc aagtatgtgt 781 aggtactcga ctctttatca aacagttgac gaaatagcat ggctggtttc ggagcaaacc 841 aattttcaac aacagcataa ggtatgggat tgagagattt cagtaaatcc tgtgccaaat 901 tcgcggcttg aacacgaatt tccccaacag cagttgtttc ccaaagattg cccttgcgcg 961 gtgtccctag tgtatacagt agttcagaag cattcccatc tgcatccagc agcgcaccgt 1021 tgacagcagt gtcaattccc attgatagaa cattaggacg aatcaggtgt tgttcttgaa 1081 ggctggcgag taatggatgc tgtaaagagc gataattaca gtttgagcct gtgcagttaa 1141 taattcggtt aacctgtaaa acaatctttg cctgtgtttc tcgttcgcta atggtcacag 1201 ttaatttatt atcaaactgc tgacaagttt gaatccgtcc tgcatagtag cttagttgac 1261 cagattcctc agcagcatcc agtacatcgg caatttctgg ggcaatttgg tggcggtgaa 1321 cttcccaata agccttgacg tggcgtagaa atcttttttg ttcttttaag ggaagtgttt 1381 gccaaatttc ttgagtaacc ggacgaagtg catcgatgac tgcccgccag tcttgtcctt 1441 gctcatccgc tgttcgcact tgctgacgta ctaaatgcag gagttctcgt gcagtttttg 1501 gtgcagtctc taagtcaata aatgtgggat aaggaattgt tggtttgtga ctacacggct 1561 tcaatccatg acgagaaaca gcatggattt ttccttgaaa tcctttggcg tgcaaagcaa 1621 caactgcatc taccatcgtc agcccgcttc caatcaacag aatagaatct tctgggttta 1681 aatcggcgat cgcatgactc gaccaagcat ctctaatatt gtcattgtgg ttttccacac 1741 aagcaactgg ttttggtaaa gatgcgggga aattacccaa cgccagcaca gccttttgaa 1801 cgtacaaaga ctcaccgcta ctcaggtgta ccatgaggtt atggtttttg ctacgaatgg 1861 cgattgcttc atcaacaatt cgctctaaca ttacataagc tggggcgttt gcctctgctt 1921 cgctcaaagt cgcctggaca taatccccat aaactttacg tggaacgaag gtggatgcct 1981 ttacctcttc atgtccgttt ctatgcaacc agttcaaaaa gtgatttggt tcatcaggaa 2041 acgcactcat cttacctgcc ggaacgttaa gcaaatgaca ttcaaatggc gtaccgtaag 2101 caactcctct acccacttct tgattgcgtt caattaactt aatgaacagt ggcatggttg 2161 cttttcgcat gatatttgct gccaccagag aaccacttaa gccgccacca atgatagcga 2221 tcgtagaagg agaaatagtt cggttcatca tacagttatt cataggatgt aagacttact 2281 tattgacaga acttctgttg gtgagttcaa gcattcgtga tttatcgaaa ctcagtaaag 2341 cttagatacg tatgcttaat ttttctttaa taagttgaag tcaaattaac ttctacagtt 2401 tgggaatata aacatcaaac aagctagtca aatctatgaa ctatgctgtt tgctgtatat 2461 ctacaaattt ataacagata aatacttttg tgtatttgtt tggcacaaca catctgaaat 2521 tattttaaac tccacttcta cacgtgaaaa acagggaatt aaatttgagt tttttatttc 2581 gcaacttgag acaagatgag acaaaaaata actgagacta gataagcctt ttgggctttt 2641 acatagttta atttcgacaa atccgatgta aatatcgtgg tttgaaaata atccttgatc 2701 cactcctcta gaatagaaaa attacctatt taattgtcaa aaatttctga catctagcca 2761 gcaattttat atgtaaaaat aaaaataatt gagaaacgta gacacgaaat ggctcgccgc 2821 aggctatcac agaggcgctc caaaacccag aggaggagga aacacagagt ttcaccctct 2881 tattatttag gattgatata gataacttat aagagtcttt tggtttttgg tttttggttg 2941 ctaatatata tccgtaaaac tttgtctaaa tcacacttaa gaatattgca taaacggaat 3001 aaatgcgata ggcagaatca agtgtaagat ttgagtgaaa aatgttgctt tactcgttta 3061 gctgtgcaat aattttaaga tggtaaaaat acggcaatgt aacaaagtta ccgttaaata 3121 taataaaatc caattttgca gccagattga atagggatta tgcaactact ttggtttatt 3181 cccacacacg gagaagggcg ctatctcggc actgctatag gcgggcgggc agtaaatttt 3241 gagtattggc ggcaaattgc tcaagcagtg gatcacttgg gctttacagg tgctttatta 3301 cctacagggc gttcttgtga agatgcttgg gttttggcat cagcgctggt aacgcatact 3361 aaaaaaatgc ggtttttggt ggcaattcgt ccggggttga tgtcaccagg agtagcagcg 3421 cggatggctg cgacgtttga tcgcgtttct ggtgggcgct tgttgattaa cgtggttaca 3481 gggggcgatc ctacagagtt ggcgggagat ggtttgcacc tttcccatga tgatcgctac 3541 aaactaacag acgaattttt aacggtgtgg cggcagatag ctgcaagtga agttgcaaat 3601 ttccaaggtg actatctcaa tatccaagat ggcaagttac tttttccatc tgtacagaaa 3661 ccttatcctc ctttgtggtt tggcggttct tcacccattg ctcaaaatat tgccgccaag 3721 cacgtagatg tgtacttgac ttggggtgaa ccaccagcac aggttgccga aaaaattgcc 3781 gcagttcgtc agctagcaga agcgcaaggc agaacacttc gctttggcat tcgcctacac 3841 gtcattgtgc gggaaaccga aactcaagct tgggatgcgg caaatgattt aattcggtat 3901 gtagatgaag aggcgatcgc caaaactcaa aaagcttacg cccgcatgga ttcggaaggg 3961 caacgccgga tgcaacaatt gcatcaaggt agtcgcgaag ctttggaaat tagcccaaat 4021 ttgtgggcag gaattggttt agtgcgcggt ggtgctggga ctgctttggt aggcgatcct 4081 gatacggttg cccagagaat atcggagtat gcagatttgg ggattgagac tttcattttt 4141 tctggttatc ctcatttaga ggaagcgtat cgcgtcgccg aattactctt tcctcgtctg 4201 cctttagaaa atatacctgt tgtagaacca cagctgatga gtccgtttgg tgagataatt 4261 gccaatcgag aattcccaaa acaacaagtt aaaaataaaa cagcagctac cgtagattaa 4321 ctgaaaaatt aatagcagga gagtttgaca aaattgctta tactttagcg atcgcctatc 4381 tctatctgac ttttcacggt atctggaaaa cctcgcttcc atttctctct cctacgagga 4441 gagaggaatg gaattttctc cccctttcct acgaggaaag gaggttagac cggttaggtc 4501 actccgtttt tccacatgac gtgaaaagtc agatttctat tggtgagaaa ctactttcat 4561 tgaaaattac tgtaagtatt gatagtcaca gccccgactt ttataataag tcggggctgt 4621 gaaaatttga aatctaagta tttgttacag gtaatatctc cgtatataca tctttaaagt 4681 tatctcattc cacagcaatt gtctgattgt aaaaaaaact aactcttgac ttccacacaa 4741 tacatgagaa actaataatt aggcaatcta aatataccac taaggaagtg catcctactc 4801 gtgacaaaac tgacgggaaa aaccgatcgc ttgtccatac ctgttaactt tccaaaccct 4861 gaggtcatat tagaaactca ggaactcacg cgctgcttta acaagttcac cgctgttaat 4921 accctgaata tctctgtcat atctggagaa gtatttggct tgctaggtcc aaatggggca 4981 ggtaaaagta cagtcattaa gatgttgaca acgctgctac cgccaagcgc tggacgggca 5041 actatagctg gctatgatgt tactcatcag caaggtgctg ttagaagagt cattggctat 5101 gtaccccaag ctctttctgc tgatgggagt cttacaggct atgaaaatct tttaatcttt 5161 gccaaactgt acgacattcc ctctaaacaa cgcagagagc gcattcgtga tgtgctggcg 5221 tttatgggtt tggaacaagc aggcgatcgc ctagtgagaa attactctgg tggcatgatt 5281 cgcaagctag aaattgctca atccatcctg catcgaccgc aaattatgtt tctcgatgag 5341 ccaacagtcg gactcgatcc ggttgctcgc agtcaggtat ggaatctcat gcaagaactt 5401 cgtgcagatt acggcacaac catattttta actacccatt ttttagaaga agctgatagt 5461 ttgtgtaacc gggtagcaat tatgaatcgg ggtaaagtga ttgcgactgg cacacccagc 5521 gatttaaaag ctgctttagg aaaaccaaac gccaccttgg atgatgtctt tattcactat 5581 acaggggacg aattagcatc aggagttagt tatcgtgaca cagcaaaaac cagacgtaat 5641 gctcaacggt tgggttgaac cacgacttaa tcggcgagaa agttttattt atgctatcgc 5701 agaattagcc agcaaatctc tagtcatagc tgaactagaa gtgcgtaaac tccgccacga 5761 tccctatgat ttactgatac gcggggtaca gcctgcgttg tggctgttaa tcttcgggca 5821 agtttttacc cgcactcgcg ctatccctac agggaactta tcctatttag actttatgac 5881 tcccggtatt ttagctcaga gcgtgttatt tgtagcaatt ttgactggtg gcatgacgct 5941 gatttgggag cgagatttag gaattgtgca taaattgctt gctagtccta taccccgtgc 6001 ggcgatggta ttaggaaaag ccctagcttg tggaatccga agtttatcac agatagtgat 6061 tatttatgga ttagcgctac tattaggtgt taacctgaat ctccatccgt tagcattact 6121 gcaagtagtg gtaattgtac ttttaggggc aggttgtttt tgtgtttttt cactcatcat 6181 tggctgtttg gtaaaaaacc gagaacgatt tacggggata gggcaattgt taacaatgcc 6241 tttgtttttt gccagtaatg ccatctatcc catctccctg atgccaaaat ggttgcagat 6301 aatttcccac atcaatccct tgacttatca agttgatgct ttacggggta caatgctagt 6361 aaatggctcc agtctctatg gatttggtct ggattgtaca attctcttgc taacattaat 6421 aagcttaaca attatctgtg gacgacttta tccacgggta gcgatgtaat caggagaaaa 6481 acagtcccat gaagctcggt aaaccctctg aagaatgtgc cgttaaggta atggatacga 6541 ttccattggt gatgcggttt atccgagcgg atatgcgtga gaacagtgtc gcatctctat 6601 ctataccgca gttacgggca atgctattta tcaaacgcaa tcctggaacc tctctttcgg 6661 aagttgcgga acatttaggt gtcacttgtg ctactgcatc cacaacaaca gaacgcttag 6721 tacaacgtaa ttttatcgaa cgtaccgacc atccccaaga gcgacggcgg gtggttctca 6781 atctcacaga tgagggcaaa caccatcttg agcaaaccct cgcccaaact cgcgctcata 6841 ttgcagactt gttagaaggt ctgacagcag aggaaattgt acacattgaa gaagggttga 6901 ctctacttaa acatgtcttc gagcgatcag aagttaaaaa agctccttaa ggctgaggct 6961 gtaaaacacg atccctttgc agctttgagg tttcgagatt atcgattatt cacgattggg 7021 cgcgtacttt tgttcacggg ttcacaaatg cagactgtgg caattggctg ggaactctac 7081 gagcgtactg gttcagcgct ggcgttaggt ggggtggggc tggcgcaagt cctgccgatg 7141 attgccttaa ctttgattgc tggacacgta gcagatcggc gcgatcgcaa acacactacc 7201 ctactctcaa tcatgctgct agtcctttgc tcgctagctt tggcagttgt ttcctatact 7261 aagggcgcaa tagttttagt ttatacttgc ttgttcttta caggtgtagc tagggcgttc 7321 ttgaagcctg ccagcgatgc gctaatgtgg catttaatac ctacgactgc ttttactaat 7381 gctgccactt ggaatagtac tagttttcag ttagcaacag tcattggacc aagtttggga 7441 ggatttggga ttgcagcttt gggaagtgcg acaggggtat atgtgttagc agcgatcgca 7501 tcacttttgt gttttgcttt aacagtgcta attagagaaa aaaagacagc cctctccaag 7561 gaaccaatat cgttaaaagc actagctgct ggtgctgagt ttgtctggca gaatcaagta 7621 attttagcgg caattactct agatatgttt gccgtcttgt tgggaggtgc agttgcacta 7681 ctacccatct ttgccaagga tatcttgcat gtcggtccag tggagttggg gtatctacag 7741 gcagcacact cgattggcgc actgattatg gcggtacttc tagcgcatct gccaccttta 7801 cgcaaagcag gaccagcttt actgtggtct gtggtaggtt ttggtgtggt cacgattatt 7861 tttgggttgt ctcgtttgtt ttggctgtca ctgctgatgt tggcattgag tggcgcacta 7921 gacagtatta gcgttgtcat tcgccatacc ttagttcaga ttcggactcc tgactattta 7981 cgcggtcgag tggctgccat caataatgtg tttatcagcg cctcgaatga gttgggagga 8041 tttgaatcag gtttgactgc tgctttgttt ggtccagtca tgtctgtggt tggcggtgga 8101 attgggacga tagtcgtggt gatggcggtg gctgcgattt ggccagggat tcggaagttg 8161 ggggcgttgc aggagtatga gtaaaaaatg acttgggttg agaagtaaaa tcaaagattc 8221 tattcttttt tggtttcact attgttcaaa ccaacctaca aatctatgtt ctttctttct 8281 acacacttat cacactcttc aagtccttgt ttaataaaga ctgtaggact attactgcta 8341 attatttctc gtgaaccatc tagatttaac acatattctc ctacaagcat cttccaatga 8401 ttacatttta ggttaaaatg gtatttacta cttttattgc ttaaattacc aatgtaatca 8461 ccttctaaaa ttctgacctg agagtattta gctagctgat tagtcaactt ttctatttca 8521 agttcttttc gacgaacatc ttgaattagt tggtttacac gtttatctaa agaatctttt 8581 tggctttgag attccaatag tttttgatta agtttattta attctgtatt cagactgcta 8641 atttgatttt gtctatttct atttgcagca atataacttt caagttcttc ccgaagtctt 8701 aaaagttctg tttcacgttc gtatatcgct ccattaacat tatttaaagt ttccgaatag 8761 ctttccaact cttgttgcat tacatatttt tcttggatta atttattaaa ttgtttagag 8821 agaatctttc tctgctcttt tagctcattg atgatactct cattgagatt agctaattca 8881 gaatctccaa gattaaataa tcctttgagc gatatatcat taagcagacc tccaatttta 8941 tacataatcc tcaatctata aggactaatt ttttctgcat ttctagctgt ttctctagat 9001 aatttattca gtgactcttg aattattgtt tccgatggtt gactattttt gacagccaaa 9061 acaacttcat ctattatttt ttcaatttgt ataaagtctt tttgtgcctt gcttaaagta 9121 atatattcaa aaaatttctg aattttagct agtttacgat atttttctgg tctacgagga 9181 tttagttcat ctaaaatttc aatctttttc gactttaaag cttctatggt ctttgcaatg 9241 cgctcattgc tttcagcggt attggctcct aaacgctcta aaagtcccat acataaccat 9301 gcttgggaat caagtttatt taatctttgt atttgttgct gattatctag ttgagagaca 9361 aaatcagtct tttgtaactg cttttgtaag ttattaacct tagctgcaat cgtaccccta 9421 aagcagcata taatcagttc aataacttga ccataaggaa atgaatagtc agcaaaccaa 9481 ccgaaaagaa tccagaagac aagtgccgac acaagaaaaa gtataacctt aaaatagctt 9541 aaaagttttc ctaataaact caaaggctct aagtctgtat ctaaaagact aaagtaatta 9601 ttgttgacga aagaattgct gacctacaac tgtgctcttg tcagcgttga tgttaactcc 9661 agattgggta ttttgaataa ctttagcact attactttcc aatttttgga acttcactat 9721 aagttgctct aaatcactct taaagttctg atcctctatc gctttctcta aaattgcggt 9781 tttcagcaag tctggatctt tttggacttg atctaacttc aatgtcccgt gaaattttgc 9841 ttttaaatac tgaagtattt gcacacctac ttcttccaga gcgcctcctg tcactttatc 9901 caatacatta agcactatct tgactgtttc ggtagctaat gcagtttcca acatacagtg 9961 gtagccctga gtgtactctt atttttatac cataattaca gtaatttacc aatcctatct 10021 tctctgtcta gaaggttagg ttgggtttga ggaactaaac ccaacctact actcattacg 10081 gaacagacaa acggaaaatt aaagatttaa aacccttgtc ttgcgggttt tgtatgtata 10141 gccgcgactt ccagtcgcca actacaagat accagaacta gactgccacc tgctcacgag 10201 aagctgcaac tgtgctgaac ttgtttgtcg gacggggcaa tccgagattc tcgcgcaggg 10261 ttcgaccttc atactcagtc cggaacagtt cccggcgttg cagttcagga atcaccagtt 10321 ctacaaactc atccaaacca cctggtaaat agggcggcat aatattaaac ccatctgccc 10381 cgccgttgac aaaccaatct tcaagctgat cggcaatctg ttggggtgtt cctaatattg 10441 ttcgatgtcc ccgcgcacca gcgatcgcca aatacaactc ccgaatagtt aaattctctc 10501 tttgggcgag gtcggtcacc agcttcaagc ggctttttgc aagttcggtc tctggtagtt 10561 ccggtagcgg accatccaga gggtagccgg ataaatcgac tccacctacc aaccctgaga 10621 gtaaacccaa tccaacctgc ggatggatca actcctgaag ctgctcatat ttgtcctttg 10681 cttcctggga agttctgcca attactggga acacacccgg cataatcttg agatggtcgg 10741 gagaacgtcc gtatttagcc agttttttct tcacacccgc atagaaagct tgagcttccg 10801 caagcgtttg ctgggcggta aaaatcacct ccgctgtctg tgctgcaagt tcttgcccgt 10861 catcggaaga cccagcttga atgatcaccg gatacccttg gattggacgt gctacattca 10921 acggaccgcg caccgaaaaa tgctcgccct tgtggttggg aatgtgcaac ttgtcagcat 10981 cgaagtaaat ccctgactcc ttatcgcgta aaaaggcatc gtcttcccaa ctatcccaaa 11041 gcgctgttac gacatccaca aactctttgg cacgctcata gcgcagtgta tgctccatgt 11101 gcttttcacg gttgaagttg ttcgcttccg caacagttgc ggaggtgacg agattccaac 11161 cagcacgacc accactaaga taatctaatg aagcaaactt gcgggcgagg tgaaaaggtt 11221 cgttgtatgt agttgatacc gtcgccgtca acccaatgcg ctcggttact acagacaaag 11281 ctgataataa ggtgaggggt tcaaagtgta caaccgaggt gcggctcaaa gcctcagttc 11341 ctctaccgcg atcgcgcaca gccacaccat cagcgaagaa gatcatgtca aattttcctc 11401 gttctgccgt ctgcgcgatt tgcttaaaat gctggaagtt caaaccgcca tctgctcgtg 11461 catccgggtg tcgccacgcc gctacgtgat gaccagaact catcaagaat gcacccagtc 11521 tcagttgtct tttttttgta ctcatcttcc ttttccatct tcaactgcat acactaaagg 11581 gctgcaagat ccccgacttc gcaaaagttg tcggggatct gaccaaccta acccctgata 11641 ctcaagcagt gaccaaactt ggaatgtatg tagaagagtc ccccttgatt gcctcactat 11701 gtttaccgtc aatgccaact gggaggtcgc cgacgatggt tactcgctga acgtggcggg 11761 gttggtcgcc gtaatcggaa atcgcataat gttgagtagc gcggttatcc cagaatgcca 11821 cgtcaccaac ttgccaacgc caacgaactg tattttccgg acgtgtcaca tatgactgca 11881 acagtcggag aatgtcgtcc gattcagtcg ttgatagtcc cttgatctgg cgaacaaaac 11941 caccaatgaa tagtatgcgt tctccagatt ccggatggac gcgcactact ggatgtagag 12001 tttcgtatac agtcgatgtg aagactttcc ggtaagcctt agtctcttca gaaaggtcta 12061 ctgcggcttc tgcatagtca taggcattac tatgtacagc ccaaagttcg tcagcgagat 12121 tacgcagatg tgttggtaaa tcttggtatg cagcgaccgt gtttgcccag atagtatcgc 12181 ctcctgatgg cggaataaca agcgtccgta aaatagagcc gagtggtggg cggtctacaa 12241 atgtgacatc agtatgccag ttattcgcac gagtggcggt tcgaccgtaa ttcaggtcaa 12301 ggacttctgg gtgtcctggg aacgatggta ctgtggggtg agctgtggta acttcaccaa 12361 atcggcgagc aaaggcgact tgaccgttgg catcgagttg ctggttacgg aagaaaatta 12421 ctttgtattg aactagacgc ttgcggattt cactgataac atcatcgctg aggttagcac 12481 tcaagtcaac acctacaatc tctgcaccga tacgtcctgc aactggttgg atgtcaaagt 12541 attgagaact catgtttttg tgtctccaag ttttttaacg ttatttttga ctgattgatg 12601 gaggattaat tactgcgtat tgttcaggtg tcagcatggc ttcctgaata ttgagaggtt 12661 taggaagcaa acaaattcct tccttagcga ccaacactca ccagatattc ttcagcttct 12721 ttctcaattg ccgtaccctt gaagaagtca cgattgagac tggtgtacaa ttccaaagga 12781 tggatagata gaaagtaacg caagtcttct tcgctgagaa tgccgcgttc aacaaagcta 12841 taagcgttgg aggtgacagc tgtgatatct ggcacatccc agtgaccgga atcagaaccc 12901 aagaaggctt ttactctgtc accaaatgga ttcgcttttt gatggaaggc ataggcaacg 12961 cgggtgtcgt ctgattctgt gccaaagtag aagtgattta agaagcgatc acgcacatct 13021 tccggttttg taattccagc ctcagcaaat tcattcaact cacctgggtc ttctggtgca 13081 aagtatttac cgtggaagcc taaaccacta ccaatttcct caaagcgatt ctcgttccga 13141 gacgctccgc gaacgctcaa ttctggacca ccataacgag cgaacagttc gcgcaattct 13201 tccttattta cgttgcccgg atgattgttc tgtaaaattt ccggattgcg agtttcccag 13261 tgccagatga tatcagtata aaggctagca ccccaaacag aaccaccttc taggaaagca 13321 aacttcaagg tggggaaacg gcgggtaaca ccaccaaaaa acagtgcttt gcagaaggct 13381 tcacccgctg aggcaaagtg tccaatatga ttgtattggt aattagaaat tgagcggcga 13441 gcagtccaac ccataccaga agcatgggtg gtcgggacaa ctttcaattc cacgcacttt 13501 gcccagaagg gatcgtaatc ataggcgcta tccaaggcga aggtatcaat ccaggttgct 13561 tcccgttgca cttcttcagg atatttctca aagccaggaa tgacgcgagt gacataagca 13621 ggaatttgaa ttgctttgag tcccagttct ttgactgcat attctaattc ctcgattcct 13681 tcttggggtg tatttagagg aattgtcgca atgggagtga cgcgaacgcc cgtatgttgc 13741 cgagcctgtt ctggaagaac aggctccccc aaagcggagc tttggggcaa ccaagcggag 13801 cttggttggg caacagggcg cgcttcgcga acgctataat cacggaatat atctgcatga 13861 taaaggttcg ccgcacggca aacagcccgc cgcatttctt cgttcttgat ttgcggtgct 13921 agcgttgcca aattcggata cagcacagca aagtctgtcc ccgcttcttg taagcgttca 13981 tgcagcagct ttggcaaact cacggtagct aaatttaagg tgtccttggt gggacgaccc 14041 cagaagggag gacgagctgt acgataagta cggcgttctt cccaagtttg ctggaaccaa 14101 cgataacgtc cagcaccagg tagatgcgat cggaagctat cagcaatctg agaaccaccc 14161 acttgagcta aataatcgag aaaggctggc tcaaattcct gggtatggac atcagtatca 14221 atgatgggat aacctagttg ctcccgaatc tgtgctgatc gagacttctt tgtgggacgt 14281 tcaagcgcaa ttgtcatgat aatttctcca cagttatgtc actcgttgac tttcacaagt 14341 cgtaaaccct gacgaaaatc gagcttcatg atcgagcaaa aattgaaagt ctggcatggg 14401 ataacttttt accatgcctg tatatactca caggattttc tggaggaaaa ttcaatactg 14461 tttttgtgcc agcgataccc gaaaaaaacg ggcattgcaa gattgctcaa agtgtctgag 14521 tgattcctct gtgcaagtat tgcacaaagc atttttttac ggcttgtcta ccggactatc 14581 ggatattaat tcagtagtat gacacaccaa gagcgacaag acaagtcttt agcaaacaaa 14641 ctatacgttt ctctaagttt ttgattgact aacttttaga taaaataaat aagaacccaa 14701 ggaggctttc atcacttttc tcaggcaatc tctatatctt tggaatacac tatcagtata 14761 taatactgta aacagacaga catacagtat tatgtagctt ggctggtgta gatacctgcc 14821 agggagaatt ccaaatttcc atagttgacg tactcctcgg cataaatgca cgaggattct 14881 taaaggttgc tactccgaaa gctaaagctt tactccgtag cacttcttct tcctgtttca 14941 ttgactctta actaggcttt gccccatgac tccacccccg ccagaccgtc tcaacaagca 15001 ttttagtttt acatccagga agcgtccctc cccagatgga tctgaatttt attgggctaa 15061 tttctctgca atttctaatt gcttcgtatt acaggtcgcg tcaccattac atgctctagg 15121 gtgatatttg tctcgacaag cccaaacaag gctgagatac ctttaatctg tgggttgttt 15181 accacttgta ataagagtat acgctatctt gtccatacat tctggaaaaa aagaaacgtc 15241 gggttaaaac ccgtcgtgtc gtatttcatc ccctttctaa agaaaagggt tttcatcatc 15301 ccgctcgcag cctgataaat tcgggaggcg ttgcaatcct ataagcaaat tctcccaatt 15361 tctttaagca atacttcgcc ctgcgccaca cccgacgagt gcaactccat aacataagcg 15421 caatatttac aaggaaacca ttccatgagc aacatagaac tttacttcgc caaaggctct 15481 accttctccc aacgaacccg tgttgttttg ctggaaaaag gaattgactt tactcccatt 15541 gaaattgact tacagaacaa accggataag ttcacacagg tttcccgcta tggcaaagtc 15601 ccagccatca aacatgggga tattgagata tatgagtctg ccatcattaa tgagtatcta 15661 gatgaagtct tcccggaacc acctctatta ccccacgatc cgggagcaaa agcaatagcc 15721 cgtatctgga tcgattacgc caacactcgc ttagtacctg cctttaacaa attcctacgg 15781 ggtaaagata gcagcgaaca ggaacaggga cgaagagagt tcctagaatc ccttttgtac 15841 attgagcaag aaggattagg caagctatcc ggtgatggtc agtactggtt aggggataaa 15901 ctgagtttag ttgatatcag cttttatcct tggtttgaac gcttgcccct tttggaacac 15961 ttccgtaaat tcacactacc agcagaaaca gctcgcttgc agcaatggtg gaatacactg 16021 cgcgatcgcc ctacaattcg ggcagttgaa aatccggtca gctattatat agagcgattt 16081 accaagattc tcggtgaacc tacagcagtg ggtgccgctc aaaagtaggg caatgctttc 16141 ttacagcgtg gtttgtcacc aaaacattga acacaaggat gcaacctatg agcctatcca 16201 gcccaatttt atccaaaccc gtcttagagg atttgcctta tgttgaggca gttctcaatt 16261 acctgactcc aatggcccaa aagcctgtta actacaccta cgaaccacca ccaggtgttc 16321 caagacagaa cggagtgtac gaggcgcaca agctgccgat ccgcaatgct agagcgattg 16381 cacaagattt atccttagac caagagggct tcgcttacgc cgctcacaaa agcgccgtcc 16441 gcgactttta cgatgaggac gaggtacgtc gcgtctacta cccagaagcc gaacaacttt 16501 tggcagatgt aacgggcgca aaaaaggtat tggtgttcga tcacaacctt cgtaataatg 16561 agcgggcgaa gcagagtgag aacggtgcaa aggagccagt aaagcgggta cacaacgact 16621 tcaccgccaa gtctggctat agtcgcgccc gtgcggtgtt gacagcactt ggtacagatg 16681 accccgatga acttctgcaa catcggttca gcattgtcaa cgtctggcga ccgattgcta 16741 aaccagtcca agagtctcca ttggcagtgt gtgacgcgca aagcatagcg cccaaagact 16801 tggtagctgg cgacctggta taccgcgatc gcattggcga gacttacgcg attacataca 16861 acccgacgca ccggtggttc tactttccgc aattgcaacc gaacgaagcg ctatttatca 16921 agtgctttga ttctgcggag gatggacggg cgcggtttgc tgcccacact gcgttcgatg 16981 atccgacaag tccgccggac gccccgccgc gccctaagat ggggtgaggt ttgtaaggtt 17041 cctcaagcaa cttccgctcg tcttcagaaa gtaccaaatc cacagcctct actgaatctt 17101 tgagatgttc tattttacta gcaccaataa taggagccgt tacacccggt tggtgcaaca 17161 gccaagctag agcaatctgt gttggggtaa cttgacgttg tttagctaag tctacgacgc 17221 gatcgactat ttgaaaatct gattcgtcat agtagagatt gtgggcaaat tcatcggttt 17281 tagcacgaat ggtttcacca taaccttggg gacgccgatt cccagctaaa aagcctcgtg 17341 caaggggact ccagggaatt atcccaatcc cttctgcgcg agataatggt atgacctctc 17401 gttcttcttc ccgataaact aggttgtagt gattttgcat ggaaacaaag cgcgtccagc 17461 cgtgtttgtc tgctgtgtaa agagctttgg caaactgcca tgcgtacata cttgatgcac 17521 caatgtaacg aaccttaccc gatttcacga catcatgcaa tgcctctaaa gtttcctcaa 17581 tcggtgtttc gttgtcccaa cgatgaattt ggtacaaatc tacataatct gtctgtaggc 17641 gtcgtaatga ggcatcaatg ctatcaaaaa tatgtttgcg agatagtcct ctgtcgttgg 17701 gtccatcacc tacttggttg taaactttgg tagcaatgat gacttgatct cttctggcaa 17761 agtctttgag tgcccttccg agaatctctt cgctaacacc caaagagtaa acatcggcgg 17821 tatcaaagaa attaatcccc aactccaaag ccagtttgat aaatgggcga ctttcttctt 17881 cttctagtac ccattctcgc catttgcggg agccataagt cattgtacca agagacagac 17941 gcgatacttt cagtccggtt ttgccaagat taacgtattt catgtggtta actttactaa 18001 tttctcaggt gcttgatgaa ttttgtgatc agtaggcaga acaataccta tttaataaac 18061 cgccaagctg ataaattgtc actataactc catcgaaaag tgatggcttg taatactcat 18121 acgctgcatg agataaaatc tgtgagcttg aaatcgctgg actccagaat caaggctaag 18181 gtgttgacac tgatgatttt gagcatattc aattagccac tggaataact gtttaccgta 18241 accttgtgac tgctttaact catcaacaac taagtcatca atgtacaaaa actttcctaa 18301 tgccaaacag ttagaaatac gaaatcccgc cactgctaaa gtttgctctt ctttttgcaa 18361 aaaggcaagt ttatatcctt ctttcatctg ataccgaact tgttctacga agtcagcctg 18421 ctgaaggtgc gggcgtaact gggatatcac agggaaacac cctaaaattt gagagtcaga 18481 ttctgccagt tgtactaaca caaatgttgt ttctaaaaat attacattcc ttcatactac 18541 taaacgtctt agttctctgt tgacgtgagc caagtctcgc tgggttatcg gtatgttgct 18601 tcggcttgca tgacgctttt ccagtagtta atagctttta aatagaactc ttgcacagac 18661 agaggcgaaa agcaagcgat agctaatttc tccgtcactt cctagatgtg aatattcacc 18721 ccaaaaattt cctgaaccac aattacagcc gggaaagttc cttcaccctc aggttcagcc 18781 agataagcat ctatgtgcaa atcaccattg gggattttga ctcttgtagt gctaattgct 18841 gtatttgtcg ctgtgttggt catctttgca aacagttagg atttttcaca atgtctctcc 18901 gtagtataat actgtatatt gataaagata cagtatttga aaatgagtag cgtaattagt 18961 tctacagcca aataaaagtt aagaatgcaa cctatcatct atcgcgtcag cgctactgat 19021 gttccgctct tgcacgaaat tcttatagcc tgtggacttg atctgcaagt aagatttggt 19081 ttaacgcatt ggatacctcc tatctatccc ctcgaaaata tgctcaaaga tgcagaaaag 19141 ttagaggttt atgcactcaa agtaggtgaa agtttggtag gtactttcac cttagagttt 19201 gcatctaaag tacctcttag ctacatcaag tatggaaaaa ttcattggta aatttcagat 19261 gtacctgcgg tgtatgtgca taaattagct gtgctaccag aacgacaagg acaaggacta 19321 gggacatggt gtttgggaac tattgaaaaa ttggcactta ctaatggata tctgactgtg 19381 cgattggatg ctgtgaaaac ttataaaaaa ctcttgtctt tctacgcaag tcgaggatac 19441 ctgagagtag gagaactaat ttttaattcg gatgtttggg ttgatgcttt tgtctttgaa 19501 aaagttttgt tcaagtaaag caattctcta aaagttggac taagtaatca tgtggttgga 19561 tgcaattgat catatccaag tgacatcttc cccagaagca gaagatgcaa tgctattttt 19621 ttacggcaaa gttttgggac tcactgaaat tcctaaacct gagacaatca aagcaaatgg 19681 aggtgcttgg tatgttctgg gaaatattca gattcatgtt agtacggaaa aaaaaccaga 19741 taatgcagca tctcggcgac atatttgtta cttagtaagt gatttgcagg ctttccaaaa 19801 acacctgcga tcgcatagtg tggaaattat tcccgatcaa caaccaatac caggacatgc 19861 gcgattcttt ctgcgcgatc ccgctggaaa tcggatagaa atagcagaaa aacacacaat 19921 cctatcataa aaattcacga gcagataaaa tgatttttga ccaagtagaa tttgatctac 19981 gttgcgaatg gggtgcaaaa ggagtttctg aactcgcacc tattagtgat gtcattgtta 20041 tagttgatgt tctatctttc tcaacttcaa cagaaattgc cacaaacaat ggtgcaatca 20101 tttatcccta ccaatggaga gatcagtctg cccttgacta tgcacagtct gtacaagcag 20161 aattatcaaa aggtcgttta tccaaagacg gttattcgct ttctcctgca tctctaacta 20221 aaattcctgc gggaactaag ctggttgtac catctcccaa tggttcctct ctaacgttgc 20281 tgactcttaa cactcccacc atagctggtt gtttgcgaaa tagtgaagct gtggcaaaat 20341 ttgctcaaag gtatggatct cgaattgcag tgattcctgc aggtgaaaag tgggaagatg 20401 gtactctacg tccagcattt gaagatttaa ttggcgcagg ggcaattctt agttatttaa 20461 atggcaatct ttcaccagaa gcagaaactg ctgtagtagc atttcatgca ttcaagcatg 20521 atttattaac gtatttgaaa caatgcagtt ctggaaaaga gttgattgcc aaaggttttg 20581 agttagatgt tgaattagca gcggctttta atgttagcga ttgcgtacct ttgtttaatc 20641 aaaacgctta catccgtcag aaataagaat cccgacttcc caccagaagt cgggattctt 20701 ggtttttaaa tttagtcgct atagtttttt tttagaggta aaggaacttc agcattcagc 20761 cattctaaga aggtattgtt gatttctgtg ggtgagattt ctgcaataaa gttagtgtaa 20821 ttaatatgct ggcgaatcac ttcttcaatt tgctcccgat aacctgattg agtttttaca 20881 atcaaaacga cttctggttc ttctgtgatt tctcctttcc atatataagc gcaggtaatg 20941 ggaaaccaat taacacaaac agctagtttt tgttccaaca aagcacgacc gatatgacgt 21001 gcttcatctg aattattcaa ggtgatgtag taaagtttca tgtcaatcaa gaaagtaggt 21061 ctagtatttt ctaaaagttt actcttttct actcgctaga aatcaattgt gtaacaagtc 21121 tactgacaaa taccatatca caatttatct tttatgatgt tccatacctt acttctgttt 21181 attaccgcgt ttattgcggg tggactcaat gctgtggcag ggggtggaag ttttattaca 21241 tttccagtcc tgatttttac gggtgtacct ccaatcactg ccaatgcaac aaataatact 21301 gctttatggg tagcggcttt ggcgagtgca ggagcatatc gtcagaattt aagcatcccg 21361 cgacggcaat tcttcttact gtgtggcatc agtttagtcg gtggagtgct tggttctgtt 21421 gctttgttat atacctctcc agatgttttt caaaagctaa ttccgtatct attgctgcta 21481 gcaacgctcg tgtttacctt cggtgaaccg ctcaaaacat ggtttcagcg tcagagtcaa 21541 aagtcatcag aatccccacc gttgttaaac ctcatgttag cccaactagc gatcgccatc 21601 tacggtggtt tctttggcgc aggtttaggt attttaatgc tagcaaccct gacgtttttg 21661 ggaatcaaaa atattcacac catgaacgcc tttaagacgt ttctagggag ttgcattaat 21721 ggaattgcca ttattccctt tatttttgca ggtgtcattg cttggcatca agctattttg 21781 atggctgtcg gtggttctct tggtggttac ttatgcgcta actatgctcg taggcttgaa 21841 ccccttttaa ttcgtagagt tgtgatggtt gttgctttta gtatgactat ttactttttt 21901 attcatggtt aggcttgagg tgatttctat gaaaaaattt accgatggag gtaaaatgta 21961 ctcttgatgc gaaggaaaca cgagaaagca ttttggatac cttagcccca gcaggcgcac 22021 ccaaagcatt aacatcatcg caaatcgaag acttgcggtt ggcagcatcg aaaatgaatg 22081 gagtagaacg tcgggcattt atcaacagga actattcaat ctatggctaa actttttgaa 22141 tccatcactg aagaactgca agaatttatt gaagctcaac accttttctt tgtaggttct 22201 gcacccttaa gtcccactgg tcatgttaac ctttctccta aaggtctagg aggttttcgc 22261 gttctttctc cccatcgtgt aggttacttg gacgtgacag gtagtggtaa cgaaacctca 22321 gcccatctgc aagaaaatgg gcgaattact tttatgtttt gcgcttttgc tgaaccccca 22381 agtattctcc gtctctacgg taaaggatac acagttattc ccagttcgcc agaatgggaa 22441 actctgtatc ccttgttttc agagattccc ggagcgcgtc aaattattgt agcggatatc 22501 tcaagagtgc agacctcttg tggttttggc gtaccacttt atgaatataa aggacagagg 22561 gagactttag tcaaatgggc aaagaaaaaa ggtgaagccg gacttaaaga atatcatcag 22621 caaaaaaatc tggtcagcat tgatggttta gctactccat tgagtaagtc atgaggctat 22681 tgataagttt tggctctctt attggctgat tttggcgcaa aggtagttgt atgcgatagc 22741 cttaggtgat ttctaggaaa gaatttaccg atggaggtaa aatgtactct tgatgcgaag 22801 gaaacaggag aaagcatttg gataggttag ccctcgcagg cgcacccaaa gcattaacat 22861 cgtcgcaaat cgaagacttg cggttggcag catcgaaaat gaatggagta gaacgtcggg 22921 gttttcaggc tgaaatggca ttgaaatatt gcaagggtag tgcaaggctg gcagaaacag 22981 tatttggttg gggtagacag aatatagagg taggattggc ccaaaaacga acaggaataa 23041 cttgtatggg attacagtca accaaatgtg gagcaaagcg ttgggaggag aaacaaccaa 23101 aagctgcgtt gtcactgcaa cagcttcttg caatcttatg ctcaacaaga cccgacattt 23161 aaaacatcat tagcctatac ccgactaacc gcagcatcgg cattgaaaga actaaaagag 23221 cagggattta gccaggagca attgccagga gccagtacaa tggcacaagt attgaaccga 23281 atgggctatc gtcaacgcta gcgttgtaaa agccaaacct caaaaaaaat tgcacaaaca 23341 gacgacatct tcaacaacat taaagaattt gatcagcaag caacaagcac gcttgtcaaa 23401 cgactaagca tggactgtaa agctaccgtt aatattgggg attattctcg tggggaaaaa 23461 accagaggag acaatgggat attttgattc gaccacttta gctggtaatt tctttttcaa 23521 aaatcgccta agcgatcgca aatagcttat attcaccgaa acgaggatct atgactcaag 23581 agcaatggac tgcggttgac cgctacatca ccgatttgtt tgtgccgccc gatcccgcgc 23641 tggatgcgac gctccagacc agcgccgcag ccggtctgcc gccgcataac gtttccccta 23701 accagggcaa gctgctgctg ctgttggcgc gggttcaagg ggcgcgcacc atcctggaga 23761 ttggcacact ggggggctac agcacgatct ggctggcgcg atcgctgccc gctgacggtc 23821 gcctgatcac actggaggct aacccaaagc acgccgaagt tgcccgcgcc aacatcgcgc 23881 acgctggtct gtctgatgtc gttgagttac gtctcgggcg agcgctggat acactgccgc 23941 agctcgtcgc ggagggtcgc gacccatttg acctgatttt catcgacgcc gacaagccaa 24001 gcaatccaga ttacttcgcg tgggcgctca agctttctcg tcgcggcacc ctgatcgtcg 24061 ccgataatgt tgtgcgtaac ggagctgtgg tggatgctaa gagtggcgat cccagcgtcc 24121 agggtgtgcg ccgcttcaac gagctgctcg ccgagtctcc gcacgtgagt gctacggcaa 24181 tccagacggt gggcagcaaa gggtacgacg gcttcgcgat cgcgatcgtc accaccgagc 24241 agtagcgctg tgaaagagcg gttctgattt atagatattt tttaccttgc tttattagca 24301 acaccagaaa acacaaataa ctgttgctct ttgggtgtag gcaagttttc acccctgtaa 24361 cgatgaagac ctactcccat atctcctgtt gcagtgagtt tgaagttaac ttcctcaaat 24421 ccagattcac gtaaaaatct acgcacagtc tcagagtagg gaataccgtt agagtgtccg 24481 cgagtccaaa ttacaaaagc accttgttta cttagaaaac ttaggtttcc tagcaagcga 24541 ttaagttcag cttcatcagc aagattacca aagataccgc acacaatcac aatgtctgct 24601 ggtactgctc ctacatagtt ggaggaaata gttgcatcac cattgataaa ctcaatttgc 24661 tgtgccaaac ccaaggattc tatacttgcg cgtccacgtt caactagttg gggattgatc 24721 tcaacgagtc gtgcatggac atcttttgca cgagggtgac ttgcgagggt tcccaataaa 24781 tctcgtccat cacccgcgca aacactcact acacggatta ttcctggtgg ggacgcatcc 24841 aaactatgag aaatgtattc ccgcacaatt tgcaagcgtt gttgcaattt tggctcagta 24901 ttgtagaggt cgtgccattc taaccagtct ttaggcataa cgcgattctc tcctcatcca 24961 aagtatgctt ttgcaggtga gcttgtcacc tgaaatgtaa tttacgctaa acttatcaag 25021 taactgtatt atatttgttt taagtagaat atcacacaaa ttaagacaaa atcaatgttt 25081 ggaattatat tttccttgac aaaagccttc cttgtaggaa tccgcccaca actttggtaa 25141 catggaatat ttcctagaaa gcttaaaact ataagtagca catgaatttt aatttagagc 25201 agtttgctag cgataataac tctgggattt gtccggaagc attggaatac atgatgaagg 25261 caaatcaagg tagtgctcct gcttatggaa atgatgaatg gacttcatta gctgcagact 25321 attttagaga cttatttgaa attgattgcg aagtcttttt tgtctttaat ggtactgcgg 25381 caaattcctt atcattggct gcattatgtc aatcttacca tagtgtcatt tgtcatgaaa 25441 acgcccatat cgaaactgat gaatgtggtg caccggaatt tgcttctaat ggttccaagc 25501 tgctacttgc taaaggggaa aatggcaagt taactccaga agcgatagaa gcaattgtca 25561 ataagcgggc tgatattcac tatcccaaac ctaaagtcat tagtctgact caatcgacag 25621 aattaggaac tttgtattca attgatgaac tggttgccat taagtcagtc gcacaaaagt 25681 acaacttaaa gattcacatg gatggcgctc gttttgccaa tgcagtagtt gccatgaata 25741 aaagtccggc tgagattacc tggaagagtg gagtagatgt attgtgtttt tgcggtacaa 25801 aaaatggcat ggcgttaggg gaagcaatta ttttttttaa taaagcatta gcagaagatt 25861 ttgcctatcg ctgtaaacaa gcggggcaac ttgcatccaa aatgcgattt atctcggctc 25921 cttggctggg attattggaa actggtgctt ggtttaagaa tgcacgtcat gcaaatcaat 25981 gtgctgaata tttagaaaat caattgctaa aaatagaagg cgttgagatg atgtttccga 26041 gagaagccaa tgctgttttt gtgaagttac ccgaacaagt tattacaagt ttaagagaga 26101 agaattggca gttttataca tttatcggtg tgggaggagt gcggtttatg tgttcttgga 26161 atacaactca ggcaaggatt gatgaattgg tgagtgatat taaggaagcg atctgcgctc 26221 cgcgcagcag ctaagctatc gcctaaaata tctaaaattt tcataaaatg cctttattct 26281 cacctcagca agatttaaaa tttaggacaa acatcatgtc agactcgtct aggttacaga 26341 gcttgctttc agacgaagcg ctaacccact atgaaaatgg tcgggaagct catagattat 26401 caaaaggtgt tggtcagctt gaattagctc gcactcaaga acttctcagt cgctacttac 26461 cccctccgcc cgcagttatt tttgatgttg gcggcgggaa tggcatctat gctttttggc 26521 ttgcccagca gggttatgaa gttcatctaa ttgatgcggt tcccttgcat atagagcaag 26581 cccagattta ttcccagact cagcgcgctc atccacttgc aagtatagca gttggcgatg 26641 cccgccaatt aaatcgagcc gatgccagcg ttgatgcagt tgttttgctt ggactactgt 26701 atcacctgat cgagcgcagc gaccgcatag cagcattgcg cgagactcat cgtattttaa 26761 aaaatggagg tcttgttttt gctgttggaa tttctcgctt tgcttccact ttagacggat 26821 tgtttcgcgg atatctcgac gatccagagt ttgtagcaat tgttcaacgg gatttagccg 26881 aggggcaaca tcgcaatccc agcaatcatc ctgcttattt tacaacagca ttctttcacc 26941 atcctgaaga actgaaagca gaggtggaag aagcaggctt atcttgtgaa aatatattag 27001 cgatcgaagg ttcgggctgg ttgctacaaa actttgagga acattggagt cagccaagtc 27061 gccgcgaaag actcttacaa tctattcgtt ggttggaaac tgaaccttct acactaggaa 27121 tgagcgctca catcatggcg atcgccctca agaccgagcc agtcggagac tggctccctg 27181 aactccgttc aggggaggca cttcgtgcct cgggcacagg gcgcgctacg cgatcgctaa 27241 aaaagcgaca gtcttctaga ttttgtcaga aacgaggctc aaaataccat caaacgtaaa 27301 tactgctgag atttgagagc gacaagtaaa gtaaacccga tgaaagaaga taccatcgaa 27361 attacaggtt ttcacgctca tgtctacttc gataccgcaa gtcgtgatac ggctgcccgt 27421 gtacgcgaag gattgggtgc tagatttgac gtgcggctcg gacgctggca cgagcagcct 27481 attagtccac acccaaaacc gatgtatcaa gttgcatttt caccggatca gtttagccag 27541 gttgttccgt ggttaatgct taatcatgag gggttggata ttctcattca tcccagtaca 27601 ggcgatgatg tgcaagatca tactgagcat tctttgtggc tgggagagaa actagaatta 27661 aatattgagt ttctaaggca aattagaact acttgaaaaa aatatgcggc tgattcttct 27721 gtgttaggat tttgcgtcag cgtacaccgt cctgttgata tcgcttactc tatttggttt 27781 agttgagcat taataaactg acatcttgca ccaataaaaa ttgatatgag ttgaaaactc 27841 agtactcagc ctaaagctct tattttacct ataaaagcca agaaaagtga tgtaactttc 27901 ttgcgctgct tttgactaag tccaggctta tacggatagg tgcaagatct gagtaaacag 27961 atgccagcac tttatctgct actgtttgaa atacagtagc agtatcaacc tattcaaaaa 28021 tttttgaaaa aaaacttttc aagttatatc tgattaaatt caaattatga gttacagaca 28081 ttttggaaga attggagaca tatggaagca tattccttta tgtgaatttt tggctatcga 28141 gcaaccgatt agctatatcg agacaaattc tgcctctcca gagtatcagc ttacaggctc 28201 ttttgagcaa caatacggaa ttcttcatat cgaaaaaaac atcaaaaatt ctcagttaat 28261 tcggcagtct gtttattgga aaatactcag cagtctatct gagaacaaaa atggcttatc 28321 aaaatacctt ggttcacctg ctttagcatt aaacatctta aaagcatcaa ctaataaatt 28381 tgtgtttttt gatattgaag aaatgtgctt aaaacaaata gcagtttttg tgaaaaaact 28441 caatattaac agcaaaatta cttataaaaa tcaagattca gtagatggac ttataggtat 28501 gcttgaagaa cttggaacac gggattttat tcatgttgat ccctacttca ttcatcatac 28561 aaatgctaat gagcattctt attttgatgc tttttgtcta gcaatgagaa aaggtgttat 28621 gggaatgcta tggtatggat ttaacaccat aaaagaaaga gaagttttac acaatgtttt 28681 tagctctcag actaacacca gtagtcaaac aaggttacag ggaattgaaa tcgcatcaat 28741 tctgcttaac aaaaatattt tagatgtaaa tcctggtgtg ttggggtgcg gaattctaat 28801 tggcaacttg agtgaggata gtcgtcagtc ttttaaagaa atggcacagg aagtaataaa 28861 tctctatcga gactctacta tgtttgacaa gtatcctgga gagctaaaac ttaaggaatt 28921 tgagattcaa atataacagg cacatgtaga tttgttttgt aagtagttca ccaattgaac 28981 cgaaagcaaa ctttgctctt aacctcctta atatatgaag aaataatttc tgaatgcaag 29041 ttcacgctgt tgcttcacca atccctgagg aagctccggt aataattgcc actttttcgt 29101 ctaatttagc tgccataatt tgctccttag cgctgacact gttaaagccc tggctttgtc 29161 cttgagtcat ccctagccca aaactgccca acgtcatcag agctaataag gctccgaagg 29221 ggtcaaagcg ttgtttaccc cctaatacca cagaaggtgg caccacacga gctactagga 29281 aactgcttac tatccccagt ggtatattga ttaggaaaat gctacgccag cctgtccatc 29341 ctaaaagcaa tcccccagcc gaaggaccta gagcaattcc caaagatacc acactgccga 29401 tgatacccac agcccgaccc cgctgtgagc taggaaagac ttctgtaatg attgctaaac 29461 caagcccgga gatgaacacc gcgcctagtc cctgaagtgc gcgagcagca attagccaat 29521 ttatactaga tgcaaaaccg cacaatagcg aactgaaggt aaatagaatc agtcccccta 29581 aatagaggtg tttcttacct aacatatcgc ccagacgagt cgcacccaaa actagaccgg 29641 aactgactaa ctggtaactc aacacagtcc actgagcttg gggaaaggag gaatgcaact 29701 gattaactag tgtgggtaga gctacattaa taatacctac atcaagggta gacattagca 29761 cccccagccc gacaccgaac agaacccaat ttttttgtga gtctgacaaa gcctgaggtt 29821 gtgtttgtaa tagtggcaat gtccagcctc ctgaattcct aaatttacgg ttacaaatat 29881 tttgtatacg tattttaaga gtatcttatt ttgttgaccg atatgttaac tatggtttga 29941 agatattaaa gcaagtttcc tccttacacc cctacaccct acaacgccag atgcctcaag 30001 tcgggaaacc cgccctacac ccctacaccc c // LOCUS NODE_939_length_29918_cov_5.01691129918 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 29918) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 29918) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..29918 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 229..672 /locus_tag="DP116_08100" CDS 229..672 /locus_tag="DP116_08100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YjbQ family protein" /protein_id="PRJNA477356:DP116_08100" /translation="MPIVNHLIEVETKQGINIHNITSPIQELIESTSIKNGQALIFSR HTTTALAINEYEERLLEDVKVYLQKLAPESDRYLHNDLHLRKNIPVDEPMNAHSHLMA MTLSTSEVIPIVDGKLALGTYQSVLFFELDGPRKRTVFCQISGES" gene 676..2037 /locus_tag="DP116_08105" CDS 676..2037 /locus_tag="DP116_08105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015187396.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-polyprenyl-6-methoxyphenol hydroxylase-like oxidoreductase" /protein_id="PRJNA477356:DP116_08105" /translation="MRTKMTNNHAIVIGGSMAGLVVARILSDRFEQVTLIERDQFPCG AIARKGIPQSRHLHVLLQQGQLILERFFPGLGEEMIAAGAHLIDVTADMMWLTPAGWG VRFPSNVSMLGFSRDLLDWIIRRRLATINNIRFVEGCDVMGLLSNTDGTCVAGVSLRS RTSEEEQLHADLVVDASGRSSKSPQWLKALGYQPPQETVLNAFLGYTSRLYRLPTDFQ SDWKVVCLQAAPPTRTRAAAFMPLEENRWILTVYGGDSDYPPTDEAGLLEFVRSMPCS SIYNAIKNAEPLSQIYSYRGTENRWRHYERLPRYLEGFLVLGDAACAFNPVYGQGMTI AVLGASTLDECLHQQRQYQPNGDFTGLARRFQKKLAKINAVPWLLATSEDYRYRGTEG KPPSLLTQLMHRYMDRVVQLTTNHADVRLALLEVMHMVKPPATLFQPRIVIPVLKQLF KLN" gene 2171..2872 /locus_tag="DP116_08110" CDS 2171..2872 /locus_tag="DP116_08110" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08110" /translation="MLKSFSAGLILLLIVILPAMAIGYAISYRTIGHQAAQRRLGRTA LLILLAFFVGFGLNSLGKYGSISINFLFAAFVVLWLLSWNWRKRKAGALLLDVGGFSR SKLMLWAGVLEGLFAVFYTWSAINKISTGLESDSNLVEVLARPVFLWSLAIYFLSTGL SRLEFRENGICYMLSVVKWEKLTSYRWNPDKPNIMTIEFKQPPLLLSTGLWSLRIPSA HRDTVEQILAEHVNN" gene 2966..9142 /locus_tag="DP116_08115" CDS 2966..9142 /locus_tag="DP116_08115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878253.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_08115" /translation="MSITISGYNLIEVIYDGATTCVYRALRETEQTSVIIKTLKAEYP TIEQLTRLRHEYKILQALEIEGIIKPLALESYQNGLALILSDFGGEPLKNLINAQKFN FSLNLSNCLQIAIQLSSTLAQLHQNNIIHKDIKPHNILINAKTGQVEIIDFSISSRLS SENQTANNPNLLEGTLAYMSPEQTGRMNRLIDYRSDFYSLGVTFYEMLTGQLPFQAAD PLELVHCHIARTAVSPKELNPEIPQAVSDIVMKLLAKTAEERYQNALGLKADLEECLR KLQATGKVEDFVVGQLDLYSQFIIPQKLYGREKEVATLMDAFERVANPPESLLTKGSQ RLAGVPPVVATGVGHRGVEMMLVSGYSGIGKSSLVNEIHKPIVRQRGYFISGKFDQFK RNIPYASLIQAFQELIRQLLTESADKIAVWKAKLLEAFGSNGQVITDVIPEVERIVGV QPDVPQLGPTESQNRFNRLFQQFIHVFTKLEHPLVLFLDDLQWADLASLKLIQLLACD PNSQYLLLIGAYRDNEVSATHPLMLTLEEIQQKGAVVNNIVLQPLQITHVNQLISDTF RCDTTKTMSLAELVFNKTQGNPFFLTQLLKSLYNDNLLSFNFTPLPYQGEPAPCSGQE SCGDWRGTEGGWQWDIKLLKDIDITDNVVELMINQIHKLSVNTQNILKLAACIGDKFT LDVLGIVNQKSLSETAADLWESLQTGLVLPLDQSYKIPLVISSQQNEQLTNDLEEQAT SRTVEELTIAYKFLHDRVQQAAYALIPDSQKKETHLKIGQLLLQNITPEERKENIFAL VNQLNYGTDLLTLESEKYELAELNLIAGQKAKAAAAYESAMRYLKVGLELLAVNSWQN QYELTLALYESGVETAYLNGDFEQMEKWATVVLQQAKTPIDKMKVYEVKIQACMAQVK QLEAIKIGLQALELLGVSFPESPSASDIEETLTQTARNLSGRNIEDLINLPLMTEVDK LAAVRMLACLGSPTYQAGPALLPLIACEQLNLSIKHGNSPFSAYSYVLYSIMINGLFQ DIESAYQFGKLALSLVEKFNAVELKTSVFFVAGSSAFHGKVHAKETLLLLQDSYSSGL ENGHFEYGGYAAMQKCYYSYLIGQELAKVEREMAATSNVLAQLKQENALSWNQIFQQS ILNLLEPFEKPCCLLGEAYNEEKSLPLLKEANDRTGLHYFYSNKLILCYLFGEHDQAL ENAVQAEQYLDGVKGFLIVPVFHFYDSLAQLAIYPLVPHSQQEHLLSRVIKNQEKMRK WADHAPMNFLHKYDLVEAEKARVLGQYWQATEYYDRAIAGAKEQGYIQEDAIANELAA KFYFERGREKVAQTYLTDAYYGYIRWGATAKVRNLAARYPHIFSQTPNRQTKGLEMNQ TISSTTTGTPLLDLAAVMKASLALSGEIVLDKLLAKLMRIVIENAGAETAFLILEKAG QLLIEASGSVGQDEITVRRSTPVETSQQLPISVINYVRSTQGHVVLHDASSEPVFATD NYIINSKPKSILCTPIVNQGKLIGILYLENNLTIGAFTPERLEVLQLLSSQAAISIEN ARLYNDLEEYNRTLAAKVEERTLELQDKNLQLQQEIKERQRAEETAKTANRAKSEFLA NMSHELRTPLNGILGYTQIFKKDKALTAQQKNGIDVIHQCGEHLLTLINDILDLSKIE ARKMELYPKEFHLPEFIETIVEICRIRAEQKGISLIYKTLSPLPRLIRADEKRLRQVL INLLSNAVKFTEKGSITFTVGYQEEKLRFQVEDTGIGIAQEQLEEIFLPFQQVGDESR KTEGTGLGLAISRQLVQMMGSELKVKSTLGKGSVFWLDLDLPEVFQQSDVKSIDEDNI IGFIEPTRKVLVVDDKWANRSVLVNLLQPLGFEVTEATNGLDALDKVREFKPDVILMD LVMSVMDGFEATRRLRMLPDFKEVIVIAISASVFEWDQKQSREVGCDDFLPKPIQKAD LLEKLQVHLGLEWIYEQPENQVKAQSIAPAQTQDSLVVAPPAEELAVLLDLAMRGDLR GIAQRAAKLEELDEQWVPFATHLRQLVKGFKGKQILEFITKF" gene 9162..11186 /locus_tag="DP116_08120" CDS 9162..11186 /locus_tag="DP116_08120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878254.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_08120" /translation="MNIDPTQKGVILIVDDTPTNLEVLFDFLADSGFTVLVAEDGESA IARAEYAPPDLILLDILMPRMDGFETCSCLKANELTKDIPIIFMTALSETVDKVKGLN LGAVDYITKPLQHEEVLARIELHLRLRNLTKTLQEQNQQIREQAALLDITTDAILVKD LDNQIRFWNKGAEHLYGWKAIEAIGKNVNQLLYPVETQSQLQNLQESLALSGSWQGEL HQVTKEGKEIIVASRWTLMGEQDGQPKSILTVNTDITEKKQLEAQFLRAQRLESIGTL AGGIAHDLNNILTPILTAAQLLQLKLPNIDERSQQMFTTIETNTKRGAALVKQVLQFA RGVEGKKRTIVQVNHLFSEIEQIVQETFPKSIEFSTNIKSDLWAIVGDATHLHQVLMN LVVNARDAMPDGGTLKISAENVFIDEHYARMNLEASVGSYIMISVADTGIGMSPKIVD RIFEPFFTTKEFGKGTGLGLSTVRGIITSHGGFVNVSSNVGRGTEFKVFLPAVEVTAT PVAENLELPKGNRELVLVVDDESPILETTKISLESYNYQVLTASDGIEALALYAQYKD DISVVLVDMMMPSMDGALTIRALQKMNPHVLIIGVSGLVAGDKLMEQARVKAFLSKPY TTKELLQTLHSVLFQDVRNTKSAKNWETWDRVRTKSIIYYPESTPNNDHF" gene 11614..13074 /locus_tag="DP116_08125" CDS 11614..13074 /locus_tag="DP116_08125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316744.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_08125" /translation="MERIGYLQLASANEASTRNEHTQVPFNLNLFAELNWRKVSSSAA IHLLSVALTLALLSIAERALALQKVGSSGPQISNIQRCLSNLGYYNGPVTGKFASLTQ NAVIRFQQANRLPTDGVVGARTQQLLQSQCQSRRPGGSVSSGLQPGSSGQAVTRLQQD LGRLGYFNGPITGNFGSETQQAVIKFQQARGIRPDGVVGARTEEAIRIVLSRNNPTVG VGGDSLPNALNLGDSSPQVRELQQDLQQLGYFRVNPTDYFGPTTQEAVARFQQDNRIV PSGIADSQTLGAITIALREQSYGQNSEQSSVVQNSGQSYGQSYGQSSVVQSSVVQSYG CSTATGDICQGERSQRVTVVQQRLQNLGFFRGDTFGFYGPATRDAVIQFQRYSGLETT GSVNFQTWQALGLTNNGNNSTELNTTKENRYVVIIPISRNETLNEVRQYIPEAFRAES RLGPYVNAGQFRERSQAEDLSKWLRSRGLDARVEYF" gene 13178..14050 /locus_tag="DP116_08130" CDS 13178..14050 /locus_tag="DP116_08130" /EC_number="2.4.2.28" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-methyl-5'-thioadenosine phosphorylase" /protein_id="PRJNA477356:DP116_08130" /translation="MAETRIGIIGGSGLYKMDALKNIEEVQVQTPFGAPSDALILGTL EDTRVAFLARHGRNHTLLPSELPFRANIYAMKQLGVEYLISASAVGSLKEEVKPLDMV VPDQFIDRTKNRVSTFFGEGIVAHIAFGDPICKNLAGVVAEAIAKLNLPDITVHRGGT YVCMEGPAFSTKAESNLYRSWGAKVIGMTNLPEAKLAREAEIAYATLALVTDYDCWHP DHDSVTVDMVIANLQRNAVNAQKVIQETVRRLSENPPSSDAHSALKFAILTNLDKAPV ATKEKLALLLKKYI" gene 14493..14948 /locus_tag="DP116_08135" CDS 14493..14948 /locus_tag="DP116_08135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316742.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_08135" /translation="MKSKLIALLTLVAPLVLASSVNAANPQHVKKLLSTGECAGCDLS KANLSGAHLIGADLRDANLKGANLTKANLEGADLTGANLAGANMTSALATNVDFKKAN LNRVNFTRATIHDSNVYGASMNDLNITNAEISNTGIGIGGEDAEIPDWK" gene 15821..16270 /locus_tag="DP116_08140" CDS 15821..16270 /locus_tag="DP116_08140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653760.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08140" /translation="MEAEMQEPEIVETKSPEATMANINNQTGSITKLQPTVQSQDQWL KYGEQVSGFLATLPEYLGNFFNRYKQPLVSIGLIVAAIVAVKVVLAILDALNDIPLVS PTFELIGIGYSTWFIYRYLLKASTRQELTDEITTLKSQVVGKQIPES" gene complement(16267..16512) /locus_tag="DP116_08145" CDS complement(16267..16512) /locus_tag="DP116_08145" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08145" /translation="MQDNSLLISYFPCGKDYAIDTAAPFPEPAAVRFALMGILRAKGM PVPFVNKPSCRTGFTQSSNAFRQTIQNQRYAISLAGL" gene complement(16633..17658) /locus_tag="DP116_08150" CDS complement(16633..17658) /locus_tag="DP116_08150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer protein" /protein_id="PRJNA477356:DP116_08150" /translation="MTNMRPPDSESSPNNTLGFDEFIGILVAFLTIGMILFWTLSRKN SNWNFTGLISPSPTSSASPIIPVIPEQQAIPFILPGVKPTVTPSPTDKHTFLDDLFPQ TLNVPPDEAWSKPEQPSFTPPTTSTQQSRVVTPSEKLPTIPPPIAFTDVPADFWGRRF IDILSSRGMIKGFPDYSFRPNQPVNRAEFAAILQQAFDKRDGGNSTNFKDIPPEFWAI PAINRSIATGFLKGYPDQSFKPDQKIPRVQVLVALVSGLDLKVRSSPEKVLSIYKDAK DIPKYAIDKVAAATENRLVVNNPEPQVLAPNQEATRAEVTAMVHQALVRMGRLQPIES QSIVTAP" gene complement(17757..19043) /locus_tag="DP116_08155" CDS complement(17757..19043) /locus_tag="DP116_08155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651921.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="homoserine dehydrogenase" /protein_id="PRJNA477356:DP116_08155" /translation="MGVKLGILGLGTVGTGTVQLLQNSGFRHPLLQEVEIYRVGVRSL DKPRAVTLPQTVLTTDLEAIVIDPEVDIVVEVMGGLEPARSLILKAIQNGKHVVTANK AAISRFGDEIFSAANQAGVYVMLEAAVGGGIPVIQPLKQSLSVNRIHTITGIINGTTN YILSRMQTEGSNFSDVLADAQQLGYAEADPTADVDGLDAADKIAILASLAFGGRIKLE DVYCEGIRQVSKTDIAYAEKLGFVIKLLAIAKRITSSPPISIRVHPTLVPKAHPLASI NGVNNAILVEGEPIGQVMFFGPGAGAGPTASAVTSDILNLVAALQTSTAVPNPLLTCA HQDYCQIVPMAELITRFYTRFLTKDQPGVIGKLGTCFGNHGVSLESIVQTGFQGELAE IVVVTHDVREGDFRQALAEIRTFAGVDSIPSLLRVL" gene complement(19196..20485) /locus_tag="DP116_08160" CDS complement(19196..20485) /locus_tag="DP116_08160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951277.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin-NADP reductase" /protein_id="PRJNA477356:DP116_08160" /translation="MYIKGAVEGAANTESGSRVFLYEVVGLGQSEETDKTNYPIRNSG SVFIRVPYNRMNQETRRITRLGGKIVSIQPLNALEHLNGKTSTVDANSEAETASSQAN GKATPVAEQQPKQKDKQGNTMTQAKAKKESHADVPVNIYRPNAPFVGKCISNEALVKE DGIGIVQHLKFDISAGDLRYIEGQSIGIIPPGVDKNGKPEKIRLYSIASTRHGDDVDD KTVSLCVRQLEYKHPESGETIYGVCSTHLCFLKPGDDVKITGPVGKEMLLPSDPEAKV IMMGTGTGIAPMRAYLWRMFKDNERAANPEYQFKGFAWLIFGVPTTPNILYKEELEEI QQKYPENFRLTYAISREQKNPEGGRMYIQDRVAEHADELWNLIKDEKTHTYICGLRGM EDGIDAALSAAAAKEGVTWSSYQKDLKKAGRWHVETY" gene 21042..22043 /locus_tag="DP116_08165" CDS 21042..22043 /locus_tag="DP116_08165" /EC_number="2.7.1.19" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015080196.1" /note="Catalyzes a reaction in which the CO2 acceptor molecule, RuBP, is generated via the phosphorylation of ribulose 5-phosphate with ATP; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribulokinase" /protein_id="PRJNA477356:DP116_08165" /translation="MTKPERVVLIGVAGDSGCGKSTFLRRLIDLFGEELMTVICLDDY HSLDRKQRKETGITALDPRANNFDLMYEQIKALKEGQAIDKPIYNHETGNIDPPERIE PNHILVVEGLHPLYDERVRELIDFSVYFDISDEVKIAWKIQRDMAERGHRYEDVLAQI NSRKPDFTKYIEPQREFADVVLQVLPTNLIKDDKERKVLRVRMLQREGKEGFDPVYLF DEGSSIQWTPCGRKLTCSYPGMQLYYGSDVYYGRYVSVLEVDGQFDNLEEVIYIETHL SKTSTKYKGEMTHLLLQHREYPGSNNGTGLFQVLTGLKMRAAYERLTSKEAKLAAKV" gene 22338..23600 /locus_tag="DP116_08170" CDS 22338..23600 /locus_tag="DP116_08170" /EC_number="2.5.1.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317638.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methionine adenosyltransferase" /protein_id="PRJNA477356:DP116_08170" /translation="MSHRYLFTSESVTEGHPDKICDQISDTILDTLLSQDPSSRVAAE VVVNTGLVLITGEITTKANVNYVNLARKKIAEIGYTNAENGFCANSCSVIVALDEQSP DIAQGVNTAHETREQNSEELFDSVGAGDQGIMFGFACNETPELMPLPISLAHRIARRL AAVRKTGDLPYLRPDGKTQVTIAYEDGRPVGIDTILISTQHTATIGDITDEAAVQAKI KEDLWSAVVEPVFSDINIKPDEATRFLVNPTGKFVVGGPQGDSGLTGRKIIVDTYGGY SRHGGGAFSGKDPTKVDRSAAYASRYVAKNIVAAGLADKCEVQLSYAIGVARPVSVML DTFGTGKIDDQILLELIKTHFELRPAGIIHAFNLRNLPKERGGRFYQDVAAYGHLGRN DLDLPWERTDKAELLQQAAKNFLSAAIV" gene 24162..25886 /locus_tag="DP116_08175" CDS 24162..25886 /locus_tag="DP116_08175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_08175" /translation="MQTQEPTAVDSNLEITQVSAQEPSTDELPTIEFPSPGKFKASSW RIHQKIGYGYFLAIGIGFLGSLTGLVISNFYLGIETKQLEHAQFQTQVLGGYRNALVN AQLHGSNLVAVIRDPQRRSSKRAELLSSVKDAKNVEQRITKFIDSKPGKLAVPEDTLR TILNDYSRNLESYVSQIDSILQQFDQQPKQQQQISVLRNELLEIMSGGTAMRLDEIRK RLTNNLQIAQQREQKFTTDLAIAKGVERLIAVASMLLSVAIAAIVAWRTSRAIAEPVI LVTQVAQQVAKKHNFDLRAPISTEDEIGSLAKSLNRLIERVSERTKELQQAKELAEAA NKTKGQFLANVSHELRTPLNAIIGLSQLLRDDAVDFGMSEEFIDDLESINTAGRHLLI LINDILDLSKIEDGKMTFYPETFELAPLINNVVLTVKPLVEKNGNLLEVNFDGELGIM YTDQTKLRQVLYNLLSNAAKFTTNGRVRFIIHKEILNVQTSHTPGMITFTVEDTGIGM SYHQQQQLFQRFTQGDASTTKKYGGTGLGLAISRHFCQMMGGEIFVTSEPGVGSIFTV RLPLIAKD" gene complement(25907..26212) /locus_tag="DP116_08180" CDS complement(25907..26212) /locus_tag="DP116_08180" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08180" /translation="MATIIMDQSLLPKTELSIPIPHAENSRIATSDTSGYTLTKQLSA FICTVGATSPGTLVEHCLMHLQFKNFKFVLTFEFERRQDRLSLQLINSKNKPFYSTN" gene 26211..27041 /locus_tag="DP116_08185" /pseudo CDS 26211..27041 /locus_tag="DP116_08185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015180951.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 27950..28888 /locus_tag="DP116_08190" CDS 27950..28888 /locus_tag="DP116_08190" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08190" /translation="MKFSALLKNWIVKSKKAKIAVCFFAVCLLLLILVTPNLPPALTT QVSQIPIAECTTVQSGDPRSPTNPDIPYIISPRRTLLLTDKPKLRWNQVLGVKSYDVS LQKGDSVVWQTKVNTNQVVYPGEPRLETGVEYLLIVKADNGKLSTDEKPNARGFSLLS KDEAPVVKASMSQLNNQKVPDKVTELQRAFYYIGADLKSEAIETLESLISSGIKETSV YRKLGNLYWETGVNVQTEINYLNAHKLARANKDIIQQAQIAEALGDLYIAIDDQKTAR NWFAQARDSYKTLGNKQRVEELDEQIKQLKTNKPTT" gene 29265..29564 /locus_tag="DP116_08195" CDS 29265..29564 /locus_tag="DP116_08195" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08195" /translation="MTDQTLHPNQDNRKTEAKKLLEQANQQLGEALKSYQQALDIYQD MGEETVATLISYLISQSKILTVTGTDKSGSSKINLTAASTILGNRSKVKRPSSRK" BASE COUNT 9053 a 6006 c 6582 g 8277 t ORIGIN 1 agcctatgag cctgggactt acagtcctag gcggacgaaa acgcagtcca ataatatgta 61 agaaatttga catttatttt cctaagtgca agaggtaagt taacgcagtg cgacacacta 121 cctttaaact ataaatcagt aaaaaaatca gacttatcca ccatccttgt ctttggtgga 181 aaatttccat tctgtaactt acaaaattaa aaacacaaaa ttgacggtat gccaattgtt 241 aatcatttaa tagaagtcga aaccaaacaa ggcatcaata tacataatat tacgtcaccg 301 attcaagaat tgatagaatc aacctctatt aagaatggtc aagctttgat tttttctcgc 361 cacactacca cggcattagc tattaacgaa tatgaagaaa gattgttaga agatgttaaa 421 gtgtatttgc aaaaattagc accagaatca gaccgctact tacataatga tttacactta 481 agaaaaaata tccctgtaga tgaaccaatg aatgcccatt ctcacttaat ggcaatgact 541 ttaagtacga gtgaagtgat tccgattgtg gatggaaaat tagctttggg aacttatcaa 601 tctgttttat tttttgagtt agatggacca cgcaagagga ctgtcttttg tcaaatctct 661 ggggagtcgt aagtaatgag aacgaaaatg acaaacaatc atgcaattgt catcggtggt 721 agtatggctg ggttggtagt cgctcgtatt ttgagcgatc gctttgaaca agtcacgctt 781 atagaacgag accaatttcc ttgtggcgca attgcgcgca agggaattcc tcagtctcgc 841 caccttcatg tactattgca gcaaggtcag ctcatacttg aacgattttt ccctggactt 901 ggcgaggaaa tgattgctgc tggtgctcat ctcatagacg tgacagcaga tatgatgtgg 961 ctaacccctg ctggatgggg agttcgcttt ccctccaatg tttctatgct gggattcagt 1021 cgcgatttac tagattggat tattcgtcgc cgcctagcta ccatcaacaa catccgcttt 1081 gtggaagggt gtgatgttat gggacttcta tccaatactg atggaacttg tgtagctggt 1141 gtatcccttc gttcaagaac ctctgaagaa gagcagttgc atgccgattt ggtcgttgat 1201 gcaagtggac gcagctccaa aagtcctcaa tggctgaaag ctttgggtta tcaaccccca 1261 caggaaaccg tactcaatgc ctttttaggt tacaccagtc gcctttaccg actccccact 1321 gattttcaga gtgattggaa agtagtgtgt ttgcaagcag cacctccaac acgcacacgc 1381 gctgctgcat ttatgccatt ggaagaaaat cgttggatac taacggttta cggaggcgat 1441 agcgattatc cacctactga tgaagctggc ttattagaat ttgttcgtag tatgccttgc 1501 tcttcaattt ataacgcaat caaaaatgca gaaccactct ctcaaatcta cagttatcgt 1561 ggcactgaaa atcgctggcg tcactacgaa cgactacccc gttatctcga agggttcttg 1621 gttttaggtg atgcagcctg tgcttttaat cctgtctatg gacaagggat gacaattgca 1681 gttttgggtg cttcgacttt ggatgagtgt ctgcatcagc aacggcaata ccaacctaac 1741 ggcgatttta caggtctagc aaggcgcttc cagaaaaaac tcgctaaaat caatgccgta 1801 ccctggctat tagcaacaag tgaggactat cgttatcgag gaaccgaggg taaaccaccc 1861 agtctgctta cccaactcat gcaccgatac atggatagag ttgtgcagtt gacgacaaat 1921 catgctgatg tacgcttagc actactggaa gtgatgcata tggtcaaacc gcctgcgaca 1981 ctgtttcaac cgagaatagt tattccagtc ctcaagcaat tgttcaagct caattgagcg 2041 tttttggcaa gaatctcacg cagtgagcaa gcaggataga gactgcatat cctaacaaaa 2101 ttggctccta catctagatt agaacttaac gcattctaca aatagctttt gtagatgcag 2161 ggagatgatg atgcttaagt ccttttctgc agggttaata ttattattga ttgttattct 2221 gcccgctatg gcaattggat acgcaatttc ctaccgcaca atcgggcatc aagccgctca 2281 aaggcgtttg ggtcgaactg ctttgttgat actgcttgct ttcttcgttg gcttcgggct 2341 taattcactc ggcaaatatg gatcgataag cattaatttt ctatttgcag cttttgtggt 2401 tttatggctc ctaagctgga actggcgaaa gagaaaggct ggtgctttac tacttgatgt 2461 aggagggttt tcacgaagca aattaatgct ttgggctggt gtattggaag gactgttcgc 2521 agtcttctac acttggtcgg cgattaataa aatctcaaca gggcttgaga gtgacagcaa 2581 tctggtggaa gttctagcgc gaccggtttt cctttggtcg ttagccatct atttcctctc 2641 aacgggattg agtagactgg agtttcgaga aaacggcatc tgctatatgc tttcagtagt 2701 gaagtgggag aagctgacat cctacaggtg gaacccagac aaacctaaca ttatgacgat 2761 tgagttcaaa caaccaccac ttctcctatc aacaggattg tggagtttgc ggattccatc 2821 agcacatcga gatacagtgg agcagattct ggctgaacat gtaaacaatt aatgatattc 2881 ctgagaacaa cagaaatata acttaagata gatgcaaaga cagcagttta ctgggtaaga 2941 atctcacaag cgcgagaaga caatcatgag tatcaccatt agcggttaca acctcatcga 3001 ggtcatttat gacggtgcca ctacgtgtgt ttatcgtgct ttgagggaaa cagagcaaac 3061 ctcggtgatt atcaaaactc ttaaagctga gtatcccact atagaacagc ttacccgatt 3121 aagacatgaa tataaaatac tccaagcttt agagatagaa ggaattatta aaccgttagc 3181 tttagaaagc tatcaaaatg gtctggcgct catcttgtca gattttgggg gagaacccct 3241 gaaaaattta attaatgctc aaaagttcaa tttcagctta aatttaagca attgtttgca 3301 aattgcaatt caattatctt caacactggc tcagctacat caaaacaata ttattcataa 3361 agatattaaa ccccataata tcctgataaa cgcgaaaaca ggtcaagttg aaattataga 3421 ctttagcatt tcatcgcgtt tatcaagtga gaatcaaaca gccaataatc ccaatttgct 3481 agaaggcacc cttgcctata tgtcaccaga acaaactggg aggatgaatc gcttaattga 3541 ctaccgaagt gatttctact ccttaggtgt caccttctat gaaatgctta caggacagtt 3601 accttttcaa gctgctgacc ctttggaatt ggttcattgt catattgcta gaacagcagt 3661 gtcaccaaaa gaactcaatc cagagattcc gcaagcggtt tctgatattg tcatgaaatt 3721 gttggcaaaa actgctgaag aaagatatca aaatgctttg ggattaaaag cggacttaga 3781 agagtgtcta agaaaactgc aagcgactgg aaaagtagaa gatttcgttg tcggtcaact 3841 tgatttatat agtcaattta tcattcccca aaaactttat ggtcgcgaaa aagaagtcgc 3901 taccctaatg gatgcctttg agcgagtggc aaacccccct gaatcccttc ttaccaaggg 3961 gagccagcgc cttgcggggg ttccccccgt tgtggcgact ggcgtgggac ataggggggt 4021 agaaatgatg ttagtcagtg gttactcagg tattggcaag tcttccttag tcaatgaaat 4081 tcataaaccc atcgtccgcc aacggggtta ctttatttct ggtaagtttg accaatttaa 4141 gcgaaatatt ccttatgctt ccttgattca ggcatttcag gaattaatca ggcagttact 4201 aacagaaagt gctgacaaga tagccgtttg gaaagcaaaa cttttagaag ccttcggttc 4261 taacggtcaa gtgattactg atgttattcc cgaagttgaa agaattgttg gtgttcagcc 4321 agatgttcct caattagggc caactgaatc ccaaaaccga tttaatcggt tgtttcaaca 4381 attcattcat gtatttacca aactcgaaca cccattggtt ctcttcttgg atgacttaca 4441 gtgggcagat ttagcttcct tgaagttgat tcagttgctt gcttgtgatc caaatagtca 4501 atatttgcta ctgattggag cgtatcggga taacgaagtt agtgcaactc atccgttgat 4561 gttgactcta gaagaaattc aacaaaaagg tgcagttgtt aacaatattg tactccagcc 4621 tttgcagatc actcatgtca atcaattaat cagtgatacc tttcgctgtg acacgacaaa 4681 gacaatgtcg ctggctgagt tagtgtttaa caaaactcag ggaaatcctt tcttcttaac 4741 tcagttactc aaatctttat ataatgacaa tctgttgtct tttaatttca cccccctccc 4801 ttaccaaggg gagccagcgc cttgctctgg tcaagagagt tgtggcgact ggcgtgggac 4861 agaggggggt tggcagtggg atattaagct actaaaagac attgatatca ctgataatgt 4921 cgttgaattg atgattaatc agattcataa gctatcagta aacacacaga atatcttaaa 4981 gttagctgcc tgtattggag ataaatttac cttagatgtt ctgggtattg ttaatcaaaa 5041 atctttgtct gaaacagcag cagatttgtg ggaatctttg cagacgggtc tagttttacc 5101 cttagaccaa tcttacaaaa ttcctttagt tattagtagt cagcaaaatg aacaactaac 5161 aaatgactta gaggagcaag cgacttcccg taccgtagaa gaactgacaa ttgcctacaa 5221 gtttctacat gaccgagtac agcaagcagc ttatgctctc attccagatt cgcaaaaaaa 5281 agaaactcat ctcaaaattg gtcaattatt actacaaaat attacgccgg aagaacgaaa 5341 agaaaatatc tttgctttgg tcaaccaact aaattatggt accgatttac tcaccttaga 5401 gtcagaaaaa tatgagctgg ctgaacttaa tcttatagca ggtcagaaag cgaaagcagc 5461 ggcggcgtat gaatctgcta tgcgctatct aaaggtgggt ttggaattat tagcagtaaa 5521 tagttggcag aatcagtatg agctaacatt ggcactttat gagtcagggg tagaaacagc 5581 gtacctgaat ggcgattttg agcagatgga gaaatgggca acagttgtct tgcagcaggc 5641 aaaaacccct attgacaaaa tgaaagttta tgaggtaaaa atccaagcct gcatggcgca 5701 agtcaaacaa ctcgaagcga tcaagattgg gttacaagca ttggaactac tgggggtaag 5761 cttcccagag tcgcctagcg cctcggatat cgaggaaaca ctgactcaaa cagcaagaaa 5821 tttgagcggg agaaatatcg aagacctgat taacctacca ttaatgacgg aggtcgataa 5881 gctagcagct gtacggatgt tagcatgcct aggttctccg acttatcaag ctggacctgc 5941 cttgttgccg ttaattgcgt gcgaacagct gaatttgtca atcaaacatg gaaattcacc 6001 cttctcagct tatagttatg ttctttacag catcatgata aacggtttat ttcaggatat 6061 tgagtcggct tatcaatttg gtaagttggc tttaagtctt gtagaaaaat tcaatgctgt 6121 agaactcaag acaagtgtct ttttcgtggc aggttcatct gcgtttcatg gaaaagttca 6181 tgccaaagaa acgttgctac ttttgcagga ttcatactct agtggattgg agaacggaca 6241 ttttgaatat ggtggctatg ccgctatgca aaaatgttac tattcatatc tcatcggtca 6301 agaactggca aaagttgaac gagaaatggc agcaaccagt aatgttcttg ctcaactcaa 6361 gcaagagaat gctttgagtt ggaatcaaat atttcagcag tcaattctta atttgctaga 6421 accttttgaa aaaccgtgct gtttattggg tgaagcctac aacgaggaga aatctttacc 6481 actgcttaaa gaagcaaatg acagaactgg acttcactac ttctattcaa acaaactgat 6541 actctgttat ttatttggag agcacgatca agcattggaa aatgcagttc aagccgaaca 6601 gtatttagat ggggtaaaag gattcttgat tgtgcctgtg tttcattttt acgattcttt 6661 agcacaacta gcaatttatc cattagtacc acactcgcaa caagaacatc tcttgagtag 6721 agtgatcaag aaccaggaaa agatgcgaaa atgggcagac catgccccaa tgaattttct 6781 gcataagtat gacttggtag aggcagagaa agcgcgggta ttagggcaat attggcaagc 6841 aacagaatat tatgacagag ccattgctgg agcgaaagaa cagggatata tccaggaaga 6901 cgcgatcgca aatgaactcg cagccaagtt ttattttgag cgtggtagag aaaaggtggc 6961 tcagacctat ctcacagatg cttactatgg atatattcgc tggggagcaa cagcaaaagt 7021 tagaaatttg gcagcaagat atcctcacat tttctcccag acaccgaacc gacaaaccaa 7081 aggtctagag atgaatcaga caattagctc tacgactaca ggtactcccc ttctggattt 7141 agctgcagtc atgaaagcat ctcttgctct ttctggtgaa attgttttgg acaagttgct 7201 ggctaaattg atgcgaattg tgattgaaaa tgctggagca gaaacagctt ttttaatttt 7261 agaaaaagca ggacagttac tcatagaagc ctcaggaagt gtcgggcaag atgagataac 7321 ggtgcggcgc tcaacacctg tagaaactag tcagcagcta ccaatatctg tcattaatta 7381 tgtgcgaagc actcaaggac atgtcgtact gcacgatgct agctctgagc cagtctttgc 7441 aacagataac tatattatta attctaaacc aaaatctatt ttatgtacgc caattgtcaa 7501 tcaaggtaaa cttattggca ttctttattt agaaaataac ttgacaattg gagcatttac 7561 accagagcga ttggaagttt tacaactttt atcctctcaa gcagcaattt cgattgagaa 7621 tgcacgtctt tacaatgatt tagaggaata taatcgaaca ttggcagcga aagtcgaaga 7681 gcgaacgttg gagttacaag ataaaaattt gcaactccaa caggaaatca aagaacgcca 7741 gcgagcagaa gaaacagcca aaaccgccaa ccgtgccaag agcgaattct tagctaacat 7801 gagtcatgaa ctccgtaccc cgctcaatgg tattttaggt tacactcaaa tctttaagaa 7861 agataaagct ttaactgctc aacaaaagaa tggtattgat gttattcatc agtgtggtga 7921 acacctactg acactcatca acgatatttt agacctctcc aaaattgaag cacggaaaat 7981 ggaactttat ccaaaagaat ttcatcttcc ggaatttatt gagactattg ttgaaatttg 8041 ccgcatccgt gccgagcaaa agggaatttc gttaatttac aaaacgcttt ctcccctacc 8101 aagactgatt cgagcagatg aaaaacggtt gcgtcaggtt ttgattaatt tacttagcaa 8161 tgcggtgaaa tttacagaaa aaggtagtat aactttcaca gtgggctatc aggaggagaa 8221 acttcgtttt caagtagaag atacaggtat tggcattgca caggagcaat tagaagaaat 8281 atttttgcca ttccaacaag tgggtgacga gagtcgtaag actgaaggaa caggattggg 8341 attggcaatt agccgtcaat tagttcagat gatgggtagc gaactaaagg tgaagagtac 8401 tttaggtaaa ggcagcgttt tttggctcga tttggattta cctgaggttt tccaacaaag 8461 tgatgttaag agcatcgatg aagacaatat cattggtttt atagaaccca cacggaaagt 8521 tttggtagta gatgataaat gggcaaatcg ctctgtttta gtgaatctac tacagccatt 8581 ggggtttgaa gtcacagaag caacaaatgg tttagacgct cttgacaaag tgcgtgaatt 8641 taaaccagat gtgattttga tggacttagt catgagcgtg atggacggct ttgaagccac 8701 ccgtcgcctc aggatgttac cagacttcaa ggaagtgata gtcattgcta tctcagctag 8761 cgtttttgag tgggatcaaa aacaaagtcg agaagttggt tgcgatgatt ttctgcccaa 8821 accgatccaa aaagcggatc ttttagaaaa attacaagtg catttggggt tggaatggat 8881 ttatgagcag cccgaaaatc aagtcaaggc gcaaagcatt gcacctgcgc agactcaaga 8941 ctcacttgtc gtcgccccac cagcagaaga acttgcagtc ttgcttgatt tggcgatgag 9001 gggcgacttg agagggattg cacaacgagc tgccaaacta gaggagttgg atgaacaatg 9061 ggtaccgttt gctactcatt tacgtcaact agttaaaggt tttaaaggga aacaaatctt 9121 ggagtttatc acaaaatttt aataaattca taagaggttt tatgaatatt gaccctactc 9181 aaaaaggtgt catcttaatt gtcgatgaca ctcccactaa tttagaagtg ttgttcgatt 9241 ttttagccga ctctggattt acagttttgg ttgctgaaga tggtgagagt gcgattgcaa 9301 gagcagaata tgccccaccc gacctcatcc tgttagacat actcatgcca agaatggacg 9361 gttttgaaac ctgcagttgt ctgaaagcca atgaattaac aaaagatatt cccatcattt 9421 tcatgaccgc actttccgaa acagtggata aggtcaaagg attgaatctt ggtgcagtcg 9481 attacatcac taaaccactc cagcacgaag aagttttagc ccggatcgaa ctccatctga 9541 ggctgcggaa cttaactaaa acactccaag agcaaaatca gcaaatccgc gaacaagctg 9601 ctttgctcga tatcaccaca gatgccattc ttgttaaaga tttggacaac caaatccgtt 9661 tttggaacaa aggggctgaa catttatacg gatggaaggc aatagaagca attggtaaga 9721 atgtcaatca gcttttgtac ccagtggaaa ctcaatctca actccaaaac ctccaggaaa 9781 gtttggctct tagtggctca tggcagggtg agttacatca agtgaccaaa gaaggcaagg 9841 aaattattgt tgctagccgg tggactttga tgggcgagca agatgggcaa ccgaaatcga 9901 ttcttacagt caacaccgac attacagaga aaaaacaact cgaagcgcag tttcttcgtg 9961 cccagcgact ggaaagcatt ggcacacttg caggcggcat tgcccatgac ctgaacaata 10021 ttctgactcc aattctgaca gcagcgcaac tgttgcagct aaaacttcca aatattgatg 10081 agcggagtca gcaaatgttt accacaatag aaactaacac taaacgcgga gcagctttgg 10141 tcaagcaagt gctacagttt gcacgcggag tcgaaggcaa aaagcgcacg attgtgcaag 10201 tgaaccacct gttctcagaa atagagcaga ttgttcaaga aacatttccc aaatctattg 10261 aattttccac gaatataaag tcagaccttt gggcgatcgt tggcgatgcg acacacctgc 10321 accaggtgct catgaaccta gttgttaacg ctcgcgatgc catgcccgat ggcgggactt 10381 tgaagatttc tgccgaaaat gtgttcattg acgaacacta tgcccgcatg aatcttgagg 10441 caagtgtcgg ttcctacatt atgataagtg ttgccgatac gggaatcggt atgtcgccaa 10501 aaatcgtgga tagaatattc gagccgtttt tcaccactaa agagtttggc aaaggtacag 10561 ggctaggtct ttcaaccgtc aggggtatca tcacaagcca cggtggtttt gtgaacgtat 10621 ccagcaacgt tggcagagga actgaattta aagtgttctt gccagcagta gaagtaacag 10681 caacaccggt ggcagaaaac ctcgaattgc caaaaggcaa ccgagaattg gttctagttg 10741 tggatgacga atccccaatt ttagaaacta ccaaaatctc gttagaaagt tacaattacc 10801 aagtcttgac agccagcgat ggaatagagg cgctcgcgct gtatgctcag tacaaagatg 10861 acattagcgt ggtgttagtg gatatgatga tgccgtcgat ggacggtgca ctaaccattc 10921 gcgccttgca aaaaatgaat ccgcatgtct tgattattgg tgtcagcggt ttggtagccg 10981 gcgacaaact gatggagcag gcacgagtca aagcatttct ttctaagccc tatacgacaa 11041 aggaattatt gcaaacttta cacagtgttc tttttcagga tgtgagaaac actaagagtg 11101 caaaaaattg ggaaacatgg gacagagtga gaacaaaatc tatcatttat taccctgaaa 11161 gtaccccaaa taacgatcat ttttaggaaa gtatgttcac ctgacatctt atttaagttg 11221 agaagaacac agaacacaga actttgcacc taagaataat ttcacgaatg aacctgctta 11281 cttaagaaac tggcatattt ctggtatatt cttcgtataa atcactttgt gcaatgaagt 11341 tcaatcaggg ggcagattgc ttgctgtata aaaccttagt aaagagcctt ataatggttt 11401 tctttctgaa tgggtgtaag cagcaaaagt tttgcacgac taccgtaatt ttacggtaaa 11461 aatgaaattt taattattac aaaacaagtt gtcaggaatt acgtgtcacg gcattgtgtc 11521 aacaattgca tatacaaaca actcacgaac actgtgtgaa ataaaattca caaagcgtgg 11581 aatcatctaa gataacaaac cgaaataact acaatggaaa gaattggtta tcttcagctt 11641 gcatcagcaa acgaggcatc aacaagaaat gagcatactc aagtcccgtt taatcttaat 11701 ctttttgctg agttgaattg gagaaaagtg tctagtagtg cagcgataca tcttctatct 11761 gtagcactaa ctttggcact cttgagtata gctgagcgtg cgctagcact ccagaaagta 11821 ggaagtagtg gacctcaaat ctcaaatatc caaaggtgtt tgagcaattt aggctattac 11881 aatggtccgg tgacaggtaa gtttgcttcc ttaactcaaa atgcggtgat tcgattccaa 11941 caggcaaata gactaccaac tgatggggtc gtgggtgcaa gaactcaaca attgctgcaa 12001 tctcaatgtc aaagtagaag acctggtgga agtgtcagta gtggtctgca accaggtagc 12061 agcggtcaag ccgtcactag attacaacag gatttagggc gtttaggtta ctttaatggt 12121 ccgataacag gtaactttgg ttcagaaact cagcaagcag tcatcaaatt ccagcaagca 12181 cgtggaattc gtcctgatgg tgttgtcggt gcgagaacag aagaagcaat acgtattgtt 12241 cttagcagaa ataatccgac agttggtgtc ggtggagata gcttacccaa tgctttgaat 12301 ttgggtgact caagtcctca ggtcagagaa ttacaacagg atttacagca gttgggctac 12361 tttagagtga atccaactga ctattttggt ccaacaaccc aggaagctgt agcacgtttt 12421 cagcaagata atcgaatagt acccagtggt attgctgact cacaaacctt gggagcaata 12481 acgattgctt tgagagagca aagttatgga caaaattctg aacaaagttc tgttgtacaa 12541 aattctggac aaagttatgg acaaagttat ggacaaagtt ctgttgtaca aagttctgtt 12601 gtacaaagtt atggttgttc aacagctact ggagatattt gtcaaggtga gagaagtcag 12661 cgagtgacag tcgtacaaca gcgtttgcag aatttgggat tttttagggg tgacactttt 12721 ggtttttatg gtccagcaac tagagatgct gtgattcaat ttcagcgata ctctggatta 12781 gaaacgacag gatctgtcaa ttttcaaact tggcaagcat tggggttgac caacaacggg 12841 aacaactcta cagaattgaa tactactaaa gagaatcgct atgttgtcat catcccgatt 12901 tctaggaacg agactttaaa tgaagtgcgt caatatatac cagaagcttt ccgcgctgaa 12961 tctagactcg gtccctatgt taatgctgga caatttagag aacgctcgca agcggaagat 13021 ttgtctaagt ggttacgctc acgtgggtta gatgcacgag tagaatactt ttaatagttt 13081 tgggcgatga tttcaaagat atcaaagtta tgactcataa ctctgatatc ttgattctgt 13141 aacttgattc ctcacttctt ctcgaaatct aaaaatcatg gcagaaactc gtattggaat 13201 tattggtggc agtggtttat acaaaatgga tgccctgaaa aatatcgaag aggtgcaagt 13261 ccagacacct tttggagcac catccgatgc tttgatccta ggaactttag aggatacacg 13321 agttgctttt ttagcgcgtc atggtcgcaa tcacacgcta ttgccttccg agttgccatt 13381 tcgcgccaat atttatgcaa tgaagcaact gggtgtggag tatcttattt ctgcttccgc 13441 cgtaggttct ttgaaagaag aagtcaaacc actggatatg gtggttcccg atcaatttat 13501 tgatagaacg aaaaatcggg tttcaacgtt tttcggtgag ggaattgttg ctcatatcgc 13561 ttttggtgat ccgatttgta aaaatttggc tggagttgtt gcagaggcga tcgccaaact 13621 caacttacca gatatcactg tacatcgtgg tggtacctat gtatgtatgg agggaccagc 13681 attttcgaca aaagcggaat caaatcttta tcgcagttgg ggtgcaaagg taattgggat 13741 gaccaatttg cctgaggcga agttagcacg ggaagccgag attgcctatg ctactctagc 13801 tttggtgact gattacgatt gttggcatcc agatcacgat agcgtgacag tggacatggt 13861 cattgctaat ttacagcgaa atgcagtcaa cgctcaaaaa gtgattcaag aaacagtgcg 13921 gcgtttgagc gaaaatccac cctcaagtga tgcacattca gcgctaaagt ttgcgatttt 13981 gacaaaccta gataaggcac ccgtagcaac taaagagaag ttggcgttgt tgttaaaaaa 14041 gtatatctag aaacgagagt atgagataaa cagcaaaact gcgcgtatgc ctgtggcatg 14101 ctgcgctttt gcccagcgct catctatagt gtctgattgt tcatcaggat tcctgtttta 14161 acggatactc tcatagtgtg tattatccgc aaataccgtt agctccagaa agcttttcta 14221 agcaaccttt tcgtcaattt acgtctcata gcagatatcg gttaatgcct aatggcagaa 14281 ttttccattg aaaccaggtg ttactatttt ttcagtgaag gtagaacaaa gtgcatcagt 14341 taccaatcat ctacagttat caaaaaaacc tgtctgccat cacaaaataa agctgacgtg 14401 tatcacagct aagttatatt cccgagatat aggggataat ataactgtga ggagaaaacg 14461 gaagcaactt gtatcagaaa ggggaaaata ttatgaaatc caagcttatc gcacttttaa 14521 ccttagtagc tcccctagtc ttagctagtt cagtgaatgc agcgaatccg cagcacgtaa 14581 agaagctact ttctactggg gaatgtgcag ggtgtgatct atcgaaggca aacctcagtg 14641 gtgcacactt aattggtgct gacttgagag atgcgaatct caaaggggca aacttgacaa 14701 aggcgaatct tgaaggtgct gatctgacag gtgctaattt agcaggcgct aacatgacgt 14761 cagctttagc gacgaatgtt gatttcaaga aagccaatct taatagagtg aatttcactc 14821 gtgctacgat tcacgactct aatgtgtatg gggcatcgat gaatgacctc aatattacca 14881 acgccgaaat atctaataca ggtataggta tcggtggtga agacgcagaa attcctgatt 14941 ggaaatagag cttaccatag gatcgattgt gggaagtaga gaatagggtt cccaacgctc 15001 gttactagcc tcagcctggt aacgagaaca caaaggctca gcctcttgtt gagaatggtg 15061 agccagcgcg ttggggagcc agtacttgat gagggtttcc ctcacttggt atctggcgtt 15121 cgggtctccc gacttgaagc gtctggcgtg gtttccccca tgagcgactg gtgagacagc 15181 gctgcaggag ggtctccctc cgtaggcgac tgcgaacccg aagggcgaac ccggagggtg 15241 cactcatctg agctaaacct acttcccgaa agaaaccccg aaagaaacat gccgaaagaa 15301 acatgtttta tcaaatcgct gtaagaccgt aagacgcaac tttttagaat ttccaccaaa 15361 accctaaact aatggtggtg ataatgtcaa taagtataaa tcttgtaaaa tgtcctcctc 15421 aaagataaga gcttaagata ggctaaatat agggtaaaat cggcatttag acttgtataa 15481 atgaaaggtg cactgatttt tgcatgattt gtacttagac gagtaccatg attagcaact 15541 tacacacctt gggataacac tacatttgag agtgtagtcc ataaaaaaat tgacttggaa 15601 tgctacctct aagtcatgct atctaggtaa tagctactgc cctgttagtg gttttctacc 15661 ttgtagttga tgaaccaatc atggaagcgc aattatggtt ataggtaact agaacctcgt 15721 tgtaccccaa agtcagtagt gctagtgcac cggtgcatgg tctagtcaac aaccttttgg 15781 ttgtaactgt ccggattaaa gtaagtgaag tctgaaaatt atggaagccg aaatgcaaga 15841 accggaaatt gtggaaacta agtctccaga agcaacgatg gcaaatatca acaaccaaac 15901 aggaagcata acgaaactcc agccaaccgt gcagtctcaa gatcaatggc taaaatacgg 15961 agaacaagtt tctggctttt tagcgacact gcctgaatat ctgggaaact tctttaatag 16021 atacaaacag cccctggtta gcattggttt aattgtggca gcaattgtcg ctgtgaaggt 16081 agttttggcg atattggatg ctttgaatga cattcctttg gtatcaccta cctttgaatt 16141 aattggcatt ggttactcta catggtttat ttaccgctat ctcctcaaag cctcaactcg 16201 gcaagagtta actgatgaaa ttacaactct caaatcacaa gttgttggca agcaaattcc 16261 agaaagctaa agacctgcta aggaaattgc atatcgttgg ttttgaattg tctgcctgaa 16321 ggcatttgat gactgggtga atccagttct gcacgagggt ttgttgacaa aaggaactgg 16381 catacccttt gctctcagta tgcccatcag cgcaaagcgc acggctgcgg gttcaggaaa 16441 tggcgcagcc gtgtctatgg catagtcttt tccgcaggga aaataggaaa tcaatagact 16501 attgtcttgc aaaatcagcg tcttcagatg gaagtctgag gttgacaaag caaagcctgc 16561 gtaagcaggc ttttggttta tgagtctgtg tagtcagtaa attttggttc agcacttgta 16621 ttgcaaaact tatcaaggtg ctgtcacaat gctctgagat tcaattggtt gcaatcttcc 16681 cattcgcacc aaagcttgat gaaccatagc tgtaacctca gcacgggtag cttcttgatt 16741 tggtgccaaa acttgtggct ccgggttgtt taccacaaga cgattttctg tagccgctgc 16801 taccttatca atggcgtatt taggaatatc tttggcatct ttatagatgc ttaaaacttt 16861 ttcaggggaa gaacgtactt tgagatccaa cccactgaca agagcaacta aaacttgtac 16921 tcgcggaatt ttttggtctg gtttgaaact ttggtctgga tatcctttta aaaatccagt 16981 tgcaattgat cggttaattg ctggaattgc ccagaattct ggtggtatat ctttaaaatt 17041 tgttgaattt ccaccatctc ttttgtcaaa ggcttgttgt aaaatagcag caaattcagc 17101 acggtttaca ggctggttag gtctaaaaga gtaatcagga aagcctttga tcattccacg 17161 agaagagaga atatcaatga aacgccgtcc ccaaaaatca gcaggtacat ctgtaaaagc 17221 aattggtggg ggaattgttg gtaatttttc actaggagtg acgacacgag attgttgtgt 17281 actcgtggtg ggaggagtga atgatggctg ttctggtttt gaccacgctt cgtctggagg 17341 tacattcaaa gtttgaggaa acagatcgtc caagaaggtg tgtttatcag ttggggatgg 17401 agtgactgtt ggtttgacgc caggaaggat aaatggaatt gcttgttgtt cagggattac 17461 aggaatgatt ggagaagctg aggaagtcgg cgagggggat atcaacccag tgaaattcca 17521 attagaattc ttacgggata atgtccaaaa tagaatcatc ccgatagtca aaaaagcaac 17581 aagaataccg ataaattcat caaaaccaag ggtattgttt ggagatgact ccgaatctgg 17641 aggacgcata tttgtcatct tttagtgact aaaggacaag gtgcagcatt aaggttattg 17701 gtaataggtt acaattttca ggttacggat ttgttaaaac tcttataact attatctcag 17761 agtactcgca gcaagctagg aatactgtct actcctgcaa acgtacgaat ttctgctaac 17821 gcttgccgaa aatcgccttc tcgaacatcg tgagtgacaa caacaatttc tgcaagttct 17881 ccctgaaaac ctgtttggac aattgactcc aagctgactc cgtgattgcc aaaacaagtt 17941 cccaatttgc caatcactcc gggttgatct ttggtaagaa aacgagtata aaatcgggta 18001 ataagttctg ccattggcac aatttggcag tagtcttgat gtgcgcaggt tagcagcgga 18061 tttggtactg ctgtactagt ttgaagtgct gctactaggt tcaaaatatc tgatgttaca 18121 gcactagcag ttggtccggc acccgcacca ggaccaaaaa acatcacctg tcctatgggt 18181 tcaccttcaa caagaatagc gttattcacc ccgttaatac tagctagggg gtgtgctttc 18241 gggactaaag tcggatggac tctgattgag atggggggag aggaggtaat tcttttggca 18301 atagcaagca atttgatcac aaatcctaat ttctcggcat aggcaatatc tgtcttgctg 18361 acttgccgaa tcccctcaca gtaaacatct tccaatttga tgcgtccacc aaaggctaat 18421 gatgcgagga tggcgatttt atctgcggcg tctaagccat cgacatcagc tgttgggtca 18481 gcttcagcat aacctaattg ctgggcatca gctaagacat cactgaagtt gctgccttcg 18541 gtttgcatcc gcgagaggat gtagttagtc gtgccattaa taatgccagt gatggtgtga 18601 atccggttga cgcttaaaga ttgctttaaa ggttgaatca ctggaatacc accacccaca 18661 gcggcttcca gcataacgta gacccctgct tgattagcag cgctaaagat ttcatcacca 18721 aaacgggaaa ttgcggcttt gttagcagtc accacatgct tgccattttg aatggctttg 18781 aggatcagcg atcgcgctgg ttctagtccc cccattacct cgacgacaat atctacctct 18841 gggtcgatga caattgcttc taagtctgtt gttaacaccg tttgtggtag ggtgactgcg 18901 cggggtttgt cgagcgatcg cactcccaca cgataaattt ctacttcttg caacaacggg 18961 tgacgaaaac cgctattttg taacaattgc actgtacccg ttcctacagt gcctaatccc 19021 aatattccta gttttacacc cacaagtctt acaccaattt tagattttag attattttgt 19081 caaaaaaata acaattgtag ggtgagcaaa gcccacccta caattagttt tcttatttag 19141 tttatagctt tttgtcaaaa gcctaatgag tcataactca tgactcatga ctaaattagt 19201 atgtttctac gtgccagcga ccagcttttt tcaggtcttt ttggtaacta ctccaagtca 19261 caccttcttt cgcagctgct gcactcagcg ccgcatcaat accatcttcc ataccccgta 19321 aaccacagat gtatgtgtgg gttttctcat ctttaatcaa attccaaagt tcatctgcat 19381 gttctgcgac gcggtcttgg atatacattc tgccaccttc ggggtttttc tgttcccggc 19441 tgatggcata agtgaggcgg aaattctcgg gatacttttg ttgtatttct tccaattctt 19501 ccttgtagag gatgttagga gttgtaggta caccaaatat caaccacgca aatcccttga 19561 attggtattc tgggttagca gctctttcgt tgtctttgaa catgcgccac aggtaggcac 19621 gcatgggggc gataccggtt cctgttccca tcataataac tttggcttcg gggtcgctgg 19681 gtaacaacat ttctttaccc acaggacctg tgattttcac atcgtcccct ggtttgagga 19741 aacacaggtg tgtagaacag acaccgtaga ttgtttcacc gctttctggg tgcttgtact 19801 ccaactggcg gacgcacaga gagactgttt tgtcatccac atcatcgcca tgacgggttg 19861 aggcgatcga gtatagtctg attttttctg gcttgccgtt cttatccact ccaggtggga 19921 taataccgat actttgacct tctatatagc gcagatcacc agcggaaatg tcgaatttga 19981 ggtgctgaac aataccaata ccgtcttctt ttactaacgc ttcattggat atgcacttac 20041 caacaaaggg agcgttcgga cggtaaatgt tgacaggaac gtcagcatgt gattcttttt 20101 tcgcttttgc ttgagtcatg gtgttgcctt gtttgtcctt ttgtttgggc tgttgctcag 20161 ctacaggtgt ggctttacca tttgcttgac tgctagcagt ctcggcttca ctattagcat 20221 ctactgtaga ggtttttcca ttgagatgct ctaaagcatt caaaggctgg atgctaacaa 20281 ttttgccacc taggcgagtg atccgccgtg tctcctgatt catgcggttg taaggcactc 20341 tgatgaatac actgccacta ttacgaattg ggtagtttgt tttatcagtt tcttcgctct 20401 gacctagacc caccacctca tataaaaaaa cacggctacc tgattctgtg ttggcagcac 20461 cttcaacagc acctttaatg tacattcctt ctaccactcc gatgtctact taacctttaa 20521 ttgaaaaatc cttcccagca ctatgcgcca gctttagttt acatgatttt ggcttgacaa 20581 aactgacgct ttttgggcga tcggcatcac tagctaatgc gatgctcggc acactcacca 20641 aagacacgca ccgaatggca aacacacaca cctgttcacc ttctaaggta aaggataagt 20701 cttttggata atgttaataa tgaatcaatt taatcttttt ttttggaagc accaccaccc 20761 atagagggag ataacttctt gaaaggaaat accagcaagg cttgtatgga gtacttcctt 20821 ctgttacaat ccaatataac actaggatta aatacttacc agcaacaaat tcagactact 20881 cagataggta tttactggca ctgtcttgta tacccgtcta gtccgttatg gcggtgctgg 20941 gggcgaaaac gcatattgac gccattggag taaactcctg gctataaaat gctgtaagaa 21001 aaataagtga ttgactcatc tttagtatct agaggagatt tatgactaag ccggaacgcg 21061 tggtattgat tggagtagct ggagactccg gatgcggtaa gtctactttt ttgcgtcgcc 21121 tgatagattt attcggtgaa gagttaatga cagtcatctg cttagatgac tatcactctt 21181 tagaccgtaa gcagcgtaaa gaaacgggaa taactgcact tgacccaaga gcaaacaatt 21241 ttgacctgat gtatgagcaa atcaaagcgc tcaaagaagg tcaagcaatt gataagccga 21301 tttacaacca cgagaccggc aatattgacc caccagaaag aatagagcca aatcacattc 21361 tcgtggttga ggggttacat cccttatatg atgagcgggt acgcgaactc attgacttca 21421 gtgtctattt tgacatcagc gatgaggtca aaattgcttg gaaaatccag cgagacatgg 21481 cagagcgtgg tcaccgctat gaagatgttt tagctcaaat caattcgcgt aaacctgatt 21541 ttacaaaata cattgaaccg caaagagaat ttgctgatgt cgttctccag gtattgccta 21601 caaatttaat caaagacgac aaagagcgca aagtcctgcg ggtacgtatg ctccaacggg 21661 aaggaaaaga aggctttgac ccagtctacc tctttgatga aggatcatca attcagtgga 21721 ctccctgcgg acgtaaactg acgtgttctt atcctggtat gcaactgtac tacggctcag 21781 atgtgtacta cggtcgttac gtctctgtgt tagaggtaga tggtcaattt gacaatctcg 21841 aagaggtcat ttatatagaa acacatctaa gcaaaacatc taccaagtac aaaggtgaga 21901 tgactcactt gttactgcaa caccgcgagt acccaggttc caacaatggt actggtttat 21961 tccaagtgct gacaggtttg aaaatgcgtg ctgcttacga gcggttaaca tctaaggaag 22021 caaaactagc agctaaagtt tagttaaagt agatgttttt gtgtctaggc tgttgggggt 22081 gctcacgggc gtcccccccg ttcttttata gttttcttag tcttctgttc cctgtttgtt 22141 cttccggaaa gaacagtaac caagtagagt aacttttttt tatctttaaa ataatcagtc 22201 ttaggttata tttttgaaaa agtaacataa cttacgaatt ttcattgtaa aaatgttgtt 22261 atgattgcag attaaagtgg ctgaactaaa cgcttgcgtt tagtcaatca gaaaaatgtt 22321 ctatggagga actacacttg tctcatcgtt atctatttac ctctgagtcc gtgactgagg 22381 ggcatccaga taaaatctgc gatcagattt ctgatacaat tctagacacc ttactctcac 22441 aagaccccag tagtcgtgtg gcggctgaag ttgtcgttaa cactggtttg gtactgatca 22501 ctggtgaaat tacgactaaa gccaatgtga attatgtgaa tctagctcgc aaaaaaattg 22561 ctgagatagg ttataccaac gctgaaaacg ggttttgtgc caacagctgc tcagtgattg 22621 tcgcattgga cgaacaatca cctgatattg ctcaaggtgt taacactgct catgaaaccc 22681 gcgagcagaa tagtgaagaa ctattcgact ctgttggtgc aggtgaccaa ggtatcatgt 22741 ttggcttcgc ttgtaacgaa acaccagaac tgatgccctt acccattagc cttgcccacc 22801 gcattgctcg ccgactggct gcagtccgta aaacaggtga tttgccatac ctgcgtcctg 22861 atgggaaaac gcaagtcacc atagcatatg aagacggacg tcctgtagga attgatacca 22921 tcctgatttc cacccagcat acggctacca ttggtgacat caccgatgaa gcagcagtgc 22981 aagccaagat taaagaagat ctgtggtcag cggtggtcga acctgtattc tccgacatta 23041 atattaagcc ggatgaggcg actcgctttt tggtcaaccc gacaggtaaa tttgtcgtcg 23101 gtggtcctca gggagattct ggtctaaccg gacgtaaaat cattgttgat acctacggcg 23161 gttactcgcg acatggtggt ggagcttttt ctggtaaaga ccccaccaag gtagaccgtt 23221 ctgcagctta cgctagtcgc tatgtggcga aaaatattgt cgccgctggc ttggcagata 23281 aatgtgaagt ccaactcagt tacgccattg gtgtagcgcg accagtaagc gtgatgttgg 23341 ataccttcgg cacgggtaaa attgatgacc aaatcttgct ggaattgatc aaaacgcatt 23401 ttgaactacg cccagcaggc attattcatg ccttcaattt acgcaacctg cccaaagaac 23461 gaggcggacg tttttatcag gacgtcgcgg cttacggtca tcttgggcgc aacgatttag 23521 acttaccttg ggagcgcacc gataaagcag aattgttgca gcaagcagca aagaatttcc 23581 tatcggcagc gattgtataa atacaatcag gagttggagg ctaaattagt tagcttctag 23641 ccctcttggt ttgcccagcc cccgtagtgg gggctatttt atgaacgatt tctgagtaga 23701 cacttaggta gtcagtggtg gatggggctt ttcttcggtg cctaccacag agaaaagttt 23761 tttagactca actcttttca tgacaggaac tatgatgata ttctacccta ccctgagtac 23821 aaatagacta tacagtttga ttctaaaacg ttgctaacag cctaattatg gcgatttttg 23881 tttgcagcaa ataaacttaa cttagaaaga agaaatgata aatatgcctt cttggcaacc 23941 aactgtaagc ttaaataagg gtgctttata aaaacttcat aacttcggct ttaaaactta 24001 gcctttattt cacactatca agtacattct ttagttaaaa acgcagtaat cctgttaaaa 24061 aagaagatga taatgtaaga agagttaata gaaaattgca ccaaatctta gctttgaaaa 24121 aaatacggtg atttttgcgg gagggtaagg agatttgagg gatgcaaact caagagccaa 24181 ccgctgttga cagcaatctg gaaataacac aagtgtcagc ccaagagccc tctactgacg 24241 aactcccgac tatagaattt ccctcgccag ggaaattcaa agctagttct tggcgtattc 24301 atcagaaaat aggctacggc tactttctgg caattgggat tggttttcta ggctcgttga 24361 ctgggttagt catatccaat ttctatttag ggatagaaac taaacaatta gagcatgctc 24421 agtttcaaac acaagtactg ggtggttata ggaatgcttt agtaaatgca caattgcatg 24481 gctctaactt ggttgctgtt atacgagatc cgcaacggcg ttctagcaag agagcggaat 24541 tgttaagcag tgtaaaagac gctaagaacg ttgagcaacg aattacgaaa tttatagata 24601 gcaagcctgg aaaattagca gtaccggaag atactctacg aactatatta aatgactact 24661 cccgtaacct agaatcttac gttagccaaa ttgattcaat cttgcagcag tttgaccagc 24721 agccaaaaca acaacagcag atttctgtac tcagaaacga gttgctggaa ataatgagcg 24781 gtggaacagc catgcggcta gatgaaatac gtaaacggtt aaccaacaac ttgcaaattg 24841 ctcaacagcg agagcagaaa tttacaacag atctggcgat cgcaaaagga gtcgagagat 24901 taattgctgt agcgagtatg ctgctgtcag tggcgatcgc agcaattgta gcatggcgta 24961 cttcgcgggc gatcgccgaa ccagtcattc ttgtcactca agtcgcccaa caagtcgcga 25021 aaaaacataa ctttgatttg cgagctccta tcagtacaga agatgaaatt ggatcactcg 25081 ctaaatctct caatcgtttg attgagcggg tttccgagcg aactaaagaa ttgcagcaag 25141 ccaaagaatt agcagaagct gctaacaaaa caaaaggtca gtttctggca aatgtgagtc 25201 acgagttacg cacaccgtta aatgcgatta ttggcttaag tcaactgctg cgcgatgatg 25261 cggttgattt tggaatgtca gaagagttta ttgacgatct tgaatctatc aacactgcag 25321 gtaggcattt actgatatta atcaacgata tcctcgacct atcaaaaatt gaagacggga 25381 aaatgacttt ctacccagag acatttgaac tcgccccgct catcaataac gttgttctca 25441 cagtcaagcc tttggtggag aaaaatggca atcttttaga agtgaatttt gatggggaac 25501 ttggtatcat gtacacggat caaactaagc tacgacaggt tctgtacaat ctcctgagca 25561 acgctgccaa gtttaccacc aacggtaggg tgagattcat catccacaag gaaatactaa 25621 acgttcaaac aagtcacact cctggaatga tcacgtttac tgttgaagac acaggcattg 25681 gtatgtctta tcatcaacag cagcagctgt ttcaacgctt tacgcaagga gatgcttcca 25741 ccacgaaaaa atatggtggt actgggctgg gattagcaat tagccgtcac ttttgccaga 25801 tgatgggtgg tgaaattttt gtcacgagcg aacctggagt tggatcaatt ttcaccgttc 25861 ggcttccact gatagctaaa gattagtttt ctgtaataag tcgatttcaa tttgtagaat 25921 aaaatggttt attctttgaa ttaatgagct gtaaactcag tcgatcctgc ctgcgctcaa 25981 attcaaaagt caaaacgaat ttgaaatttt tgaactggag atgcataaga cagtgctcga 26041 ctagggttcc cgggcttgta gcacctaccg tgcagataaa cgcggatagc tgttttgtca 26101 gtgtgtatcc gcttgtatca ctggttgcaa tgcgtgaatt ctccgcgtgg ggtatgggga 26161 ttgataattc agttttgggg agcaaagact gatccatgat gatagtcgcc atgattcgtc 26221 caggaatgtt acttcaagga cgctaccgag ttatctgtca aattggtggc ggtggttttg 26281 gtaaagtgtt tgaagtggat gatggcggta gctccaaagt tttaaaagtt ttgagcttag 26341 agcgctttca taatccaaca attaaacaaa aggcgatcgc cctatttcaa cgagaagctg 26401 aattactagg tcgcctcaag caccctggta tcccgcgtat agagtcagat gggtatttta 26461 cttggtctga tggtaatggt gaaccgttgc actgtctagt gatggagaaa atacccggtt 26521 caaatttgca acagtggttg caagctagag gaaatcaacc actcaccaca caacaagctc 26581 acgaatggtt aatcgaatta gtaggaatac ttaacgaact acatcagcat caataccttc 26641 accgagatat taaactatct aacatcatgc taagacctga tgggcaactg gtgttgattg 26701 actttggtgc tgttagggaa gtcacaaact cttacttaga aaaacaagaa ggcaaacaaa 26761 ctggaactgt actcgtttct cctggattta cccctccaga acaagctgaa ggtcacgctg 26821 tcccacagtc agattttttt gctttggggc gaacttttgt ctgtttgctc acaggtaaat 26881 ctcctcttga ttttcttaaa aactcggcaa caggagaact gatttggcgt gaccaagcaa 26941 cgagtgtttc gccacaatta gcaaatttaa ttgaccgcct catagctcct tttcctggac 27001 agcgacctca aaattgccag gaaatcttaa attatcttca aggcaatcaa caccagtttc 27061 cgctatctca agatggaaca gataccttac caccagatac agggtttgga gtcagtactt 27121 taggattaaa gagctttatc aaaaatacat ttactcttgt atctaagtta gacaaaccaa 27181 gaatagatca acacaataac atacaagata aaagtttaca gagccaacaa gataaaagtt 27241 tacagagcca aagtttacgg agccggagtt tgaaattttt gttagctggc tctttattta 27301 tcggaggagg tttcaagttg tggagtctgg caccagaaat agcaggaaaa ctcaataatg 27361 tgggtttatc agaatacaac cagaaaaatt ttacaaaagc taaactattc tatcaaacat 27421 ctctcatttt tcagtcaaat ctgccacaac ctcaatataa tctgggcttg ctgtatgaag 27481 accaaaataa gcttgagcaa gctcgcactg cttaccaaat agctgctctg aagggtttcg 27541 atagagctta taacaacttg ggacgattgt acattttgga gaaacaatat gatttaacag 27601 tacctgtact acacgaaggt ttacaacgta ctgaagataa caaaattaaa tacgcaatgc 27661 tgaaaaactt gggctgggcg tatttagaac tggggaatta ccaagaagct gagaactctc 27721 ttcaaagggc aatcaagatt aatggcgatc gcgctgctgc atattgcctt caagccaaag 27781 tttttgaaaa gcaaaataat aggcaagcag cactcttggc tttgaaaaat tgtatcacat 27841 ctggaaaacc agaaacgaaa gaagaaaaac agtggatctc tgaagcagca aaaaaaatta 27901 aggagatatc aaagttaccc tccacttcta aaaccccaag gactcaataa tgaagttcag 27961 tgcactacta aaaaactgga tagtaaagag taaaaaagcg aaaatagccg tttgtttttt 28021 tgctgtttgc ttattgcttt tgatactagt cacaccaaat ttaccaccag cacttaccac 28081 tcaagttagt caaataccca ttgctgagtg cacaaccgtt caaagtggtg atcctcgcag 28141 tcccactaat cctgatattc cctacattat cagtccccgc cgaacgttgc tactgactga 28201 caagccgaag ctacgctgga atcaagtttt gggtgtcaaa agctatgatg tcagtttaca 28261 aaaaggtgat tcggttgttt ggcaaacaaa agtgaatact aatcaagtcg tgtatccagg 28321 agaaccaaga ctagaaacag gagtggaata tttgctgatt gttaaagcag acaacggtaa 28381 attgtcaaca gatgaaaaac caaatgcgcg gggttttagc cttttgtcca aggacgaagc 28441 accagttgtc aaagcaagca tgagccaact gaacaatcaa aaagttcctg acaaagtcac 28501 agagctacaa cgtgcttttt attacattgg agcagattta aaatcggagg cgatagagac 28561 actagaatcc ttgataagta gtggtattaa agaaacttcc gtctatcgta agcttggcaa 28621 tctctactgg gaaacaggag ttaatgtgca aactgagatt aattatttaa atgcacataa 28681 attagcaaga gctaataagg atatcataca gcaagcgcag attgcggaag ctttgggaga 28741 tttgtatata gcaatagatg accaaaaaac agccagaaat tggtttgctc aagcacgaga 28801 tagttacaaa actttaggta ataagcaaag agtagaagaa ctagatgagc agataaaaca 28861 actgaaaaca aacaaaccta ctacataagt tagaaatcaa gacactgatt agtagctaaa 28921 tctaattaag cataaaatat aattaggaac gcgagtgcgt gccctaagta tacaagacaa 28981 gctgagtgta tacgcctaaa cgacttttca ttcgcgccct tggcgttccg tatgcgcaaa 29041 gcgcacgccg gcgtcccttg cgcgtttccc tttgggactt acgctcgtag cgtctgcgac 29101 aggagaagca aaggataggg aaatcattac aaagactcta tttacctttt tcaaggttga 29161 cttacttagc atccggaatc gtgtagtttc tgttggtata catatatgtc tttatgtgaa 29221 attaaatctg ttctataaat cacccaacaa caggagtatg acaaatgact gatcaaacat 29281 tacaccctaa tcaagataac cgtaaaacgg aagccaaaaa actactagaa caagctaatc 29341 aacagttagg agaagcttta aaatcttatc aacaagcact agatatctat caggacatgg 29401 gtgaagagac tgttgctact ctcatctcct atctgatttc tcagagcaag attctgactg 29461 ttactggcac agataagtct ggtagttcta aaataaactt aactgctgct agtactatat 29521 tgggtaatcg ttcaaaggta aaacgtccta gttctaggaa gtaaaggaaa acttagtaaa 29581 tcatgtatct caatgactga atgcgatttg aagaaagttt ctgacaggtg catcccaaga 29641 ttgttcaata aaccattttt aataactaag tgttgcagtt gcttattttc tcaataagca 29701 actcttgatt ttaaaagaga ttaaaacagt gaacagtgaa cagggaacag ggaacgctta 29761 acagggaacg cttaacaggg aacgcttaac agggaacgct taacagggaa cagtgaacag 29821 tgaacagtga acagagtcag cgatttcttg acccccctta acttggtaac tgataactgc 29881 ggctggtact tgataactga taactgataa ctgataac // LOCUS NODE_954_length_29603_cov_5.24942529603 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 29603) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 29603) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..29603 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1063) /locus_tag="DP116_08200" CDS complement(<1..1063) /locus_tag="DP116_08200" /inference="COORDINATES: protein motif:HMM:PF12770.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor protein Chase2" /protein_id="PRJNA477356:DP116_08200" /translation="MEIWVNLEFGDGNLERGFGNLNVEVTVANAQRNITQLEVQLPPN AEIPISYQRWKEQYYSLLKHSRGGFKNNQVTHISKTDCYDSAQHLCRHLHQWLYPIQS ELKQALSQDFQPEIRLIINTQKIGSQTTKDILHRLPWQEWDFLAQNFSCEAAVCFHSS VVSTTASEKSSTSNKIRRPRIISIFGDSQNIDTSADKELLQKLQKRAAELIVLTEPNR SDFNALWEEACDILFFAGHSETKGDGQTGIININRNDSLSLSEIKRTLKAAINKGLKL AILNSCDGLGLARQLADLNLPYIIVWREEVPDKLAQKFLKYFLNSFAEGQSLFTAVGE ARDKLKELADDTDIAKQLPG" gene complement(1408..1662) /locus_tag="DP116_08205" CDS complement(1408..1662) /locus_tag="DP116_08205" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08205" /translation="MKSQYVIASLIVGIITVSANYADTHNLLLLQNTPRTVKTIKQNQ IQARIPPNRGVPTRRESGGARSIKQAEFKRIQQQLIAADK" gene 2518..3750 /locus_tag="DP116_08210" CDS 2518..3750 /locus_tag="DP116_08210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870492.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VCBS repeat-containing protein" /protein_id="PRJNA477356:DP116_08210" /translation="MSESRNTSSSSTLEATGSRAFNIQPTNSFENDSFLNTFSRSGSS QSFLDADQEFSDTALSLPNNLNPNPIFTSAAIFPDFNGDGKKDKLWRNPQTGETAIWL MDGTNVASQGALKTMSSDWDSKVADFNGDGKTDIIWRNIKTGDNTIWLMDGTTVASEA SLPNVPTDWTFSTGEFNGDGKTDLAWRNLKTGENAVWLMNGTQVLASSALEAVDPSWT GTIGDFNRDGKTDTLWRNKTTGENAVWLTNGNTVTKTALPTLGTDWQPSLADFNGDFQ TDILWRNNKTGENAVWLMDGSNVASQTALPTLGAGWTSSVGDFDGNGKTDLLWYNPQT GESKVWIMDGANVVSDTALPTQSAAWKTSISDVDGDGKTDIFLRNYETGENKIWKMNG STPTESALPTFAKEWYTF" gene 4104..5318 /locus_tag="DP116_08215" CDS 4104..5318 /locus_tag="DP116_08215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VCBS repeat-containing protein" /protein_id="PRJNA477356:DP116_08215" /translation="MVASKTLDSSTGTSAFQSLPITDSFAGTDDLNSALSLGRSSESP LENSLQSTQVVTARENPNPSRLFSSTGIVADFNGDGKTDKFWRNSQTGETAVWLMDGS KPTGELLSNVDSSWDFAYADFNRDGKTDIFWRNKTTGENAIWLMDGTRIASAVSLEKV DPSWTASIADFNGDGRSDIFWRNAKTGENATWLMDGTTVTTAAFLPKTDSQWSYSIVD FDGNGKNDIFWRNQTTGENAIWFVNGTDTSAYSLSKVDPSWNYSLGDFNGDGRTDLLW HNTQTDENSVWLMNGIFINSGSLEKQNSSWKSSVGDFNGDGRTDIFWHNTQTGENTAW LMDGTTVTTAAFLPTTDAAWQPSIGDYDGDGKSDIFWRRYDTGENTIWQMNGTTVSAA STQTVPVEWSVF" gene complement(5394..6890) /locus_tag="DP116_08220" CDS complement(5394..6890) /locus_tag="DP116_08220" /EC_number="6.3.2.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2, 6-diaminopimelate ligase" /protein_id="PRJNA477356:DP116_08220" /translation="MKLRELLATVDDIIVQLPQHPAMDAEIKNLKTNSHACVAGDLFI GMPGTRVDGGDFWQSAIASGAVAAIISPQAAQKHPPTPSACVISSDDMTKACAQLAAT FHDYPGQKLKLVGVTGTNGKTTTTHLIEYFLHQANLATALMGTLYTRWHGFVQTAVHT TPFAVELQQQLATALDAGNEYGVMEVSSHALAQGRVMGCQFEVGVFSNLTQDHLDFHR DMEDYFTAKALLFSPQYLKGRAIINADDTYGKRLIASLKPEKVWSYSVNDSTADLWMS HLSYEPNGVSGTLHTPKGEVAFRSPLVGQYNLENLLAAVGAVLHLGLDLQLIASVIPE FPGVPGRMERVQILPDQDISVIVDYAHTPDSLENLLKATRAFIPGKMICVFGCGGDRD RTKRPKMGRIAAQLADVAVVTSDNPRTENPERILQDILEGVPETVKPIVIGDRATAIR TAILQAQPGDGVLLAGKGHEDYQILGTEKIHFDDREHAQEALEERMKK" gene 7174..7509 /locus_tag="DP116_08225" CDS 7174..7509 /locus_tag="DP116_08225" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08225" /translation="MLPNLKKKLSVNYYNFNEHFWSPQTVCSQSELPKSNKKSNRRKT LTGVLSLVCLAVWTPLLLGLIVPSVHKLKEKRSNFETSNFDGEDSVFKREHFLFYQAA SGRMRTPAF" gene 8186..8695 /locus_tag="DP116_08230" CDS 8186..8695 /locus_tag="DP116_08230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316874.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08230" /translation="MFKQILTGSYIFVFLLVSSLSAHAQAPKPTSPSSSPSPQIPTTP QTKVSPEELQKFANSLKQLRVIKQGAVQQMGDVINKSGLSQERFLEIYKSQQNPPEKL KRAMTSQEKQQYEKTVTSLKSIQEQADTNMQQVLQKEGLGLERFNQIQVAISQDPALQ QKVREMIKS" gene 9189..9794 /locus_tag="DP116_08235" CDS 9189..9794 /locus_tag="DP116_08235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_08235" /translation="MSSIIRLANEQDAEQVLEIYAPFCEHSPVSFEVQPPTLDEMQQR IAKVLEKLPWLVCEHDGKVLGYVYAAPHRDRTAYQWAVDVSVYIHESVRRSGIGRALY TSLLKILVLQGYYSAYAGVTLPNTASERLHELMGFQFIGIYQGVGYKCGAWHDVVWYE LSLQPRRPNPKPPININVLRNTLELEYALASGLLFLKLPVD" gene 9911..11527 /gene="mviN" /locus_tag="DP116_08240" CDS 9911..11527 /gene="mviN" /locus_tag="DP116_08240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864846.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="murein biosynthesis integral membrane protein MurJ" /protein_id="PRJNA477356:DP116_08240" /translation="MTQEQKTSRSFAGIAGIIAVATLISKFFGLVREQAIAAAFGAGA VVDAYNFAYMIPSFMLILLGGVNGPFHSAIVSVLAKRKQEEAAPLVETMTTLVGGLLL LVTIFLVIFAPNLIDLVAPGLDTVRNGAFIKENAIVQLRIMAPMAVLAGLIGIGFGTL NTANQYWLLSLSPLFSSIILIIGLGILALQLGSKISSPQYAILGGMVLAGGTLAGAFL QWLIQQIAQARAGLGTLRLRFNFKQPGVNEVLKIMAPATFSSGMLQINLYTDMYFASF IPSAPSGLRYSNILVQTPLGIISNIILLPLLPIFSRLAAPENWQELKLRIRQGLILTA FTMLPMGALMMALSDPIVRVVYERGAFTKGDSGLVSSILVASGLGMFVYLGRDVLVRV FYALGDGQTPFRVSMINIIFNALLDWILFKPFGAPGLVLATVGVNFISLLMLLWLLDR RLNGLPWRELGLPILGLTAGSMVAGGASYGTLEVLQKFLGENGLLIQLLEISIAGLVG IGVFGVIVAMMKLPEIEIFVSRLRERFLRR" gene complement(11543..14350) /locus_tag="DP116_08245" CDS complement(11543..14350) /locus_tag="DP116_08245" /inference="COORDINATES: protein motif:HMM:PF12849.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207495.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08245" /translation="MNSTLELISDYYHVGGSLKPNAPSYVMRQADQELYDALMAGQFC YVLNSRQMGKSSLRVQTMDRLRHSGVSCAAIDITRIGSEHLTPENWYGGIVSLLWQGF NLLSKVNQKKWLQEHEELSLVQLLDRFIEEVLLVYVPEKRIVIFVDEIDSIISLNFSI NDFFALIRACYNQRVDKPDYERLTFCLLGVATPSDLIRDKARTPFNIGKAIALDGFKF EEAQALEKGLVGKVSHPQEVLREILYWTGGQPFLTHKLCKLMSEGLTVENPLSVEQVV RSKIIENWESQDQPEHLKTIRDRLLTNKHLTQQMLGLYQSILRHKEIDADHSYEQMEL RLSGLVVKHNGKLKVFNPIYKRVFNQKWLNQELEKLRPYAEALSAWVESQYQDDSRLL RGQAFEEGWSWLVDKGLDRQNKFSIDEHRFLIASRVLDKRGTLAEADRQTINTAEELL AKVSNPMPVIRKVLRITKSEPVLAQKLFRLILNEQFPICQGEDEADWVERMVRERIID NWESQDQPEHLRKIRDELVQNKDAENLLQLYRQVLQSGKVVADDNPKQLILLRLGFVV NYQGKLEVGNHIYETIFNQSWVDNELGKLKFQKQWQRLKLIILSGILIAVSLGIYTGI DLLSSASRCPLQEVLLNTCIATFAKVKNVPEGTFFYGGSTTFAPLRSQDIIDAINKAH PSFHLQYVHPAKKKPGSRTGIEMLLNDELSFAQSSDTLKPYELQAAIKKGFELEQKAV AIDGIAIYVNLSLPIRGLTLEQVQQIFTGEITNWQQVGGPNLKITVFSRNPKAGGTVD FFQDTVLLGRDFGSFKEVNNTTESLQRVADTAGGIGYATASEVVNQKTVPVKLLPLAK SADKPYVAPFSGTNIEKVNQLAFIDSSYPMTRKLYVIIKKDGGVNEEAGTAYANLLLS VEGQKLVEQAGFAPIRPLTSK" gene complement(14347..15675) /locus_tag="DP116_08250" CDS complement(14347..15675) /locus_tag="DP116_08250" /inference="COORDINATES: protein motif:HMM:PF14516.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="PRJNA477356:DP116_08250" /translation="MHGKKFNDILDELTPDQKKVLKRFLAGETDEQIASERNCDTSTI RKHLNKVCTKFGLVNREGERFSYRPELVDLFVQDKPKWVNFDYWEDHRPDQIEPDFPG RPISSQSPFYIQRFYNEHLLLEKLCSQKVLQSGALIRIKAPKKTGKTSLVNKILAEAR HCGYRTIRLNLRQAEELILENLDHFLQWFCTNISQQLNLESRIDDYWDNKRLGSMVSC TTYFQAYLLEQIDTPLVLGLDEVDRLFEYEKIAKSFFTLLRSWHEEANNLQVWQKLRL VVAYSTEVYIPLNINQSPFNVGIPVKLPNLTLAQVQQLAKEYGLKSLDNTELERLRAM IGGHPYLIQLGLYHLHQKDLTLEQLLQTAPTLEGIYSSHLQGLWVMLSEHHELLEALR TLLDSGGKVQMQQVPASKLESMGIVQFEANQVKFSCQLYHSYFLSCLGAK" gene 16179..16487 /locus_tag="DP116_08255" CDS 16179..16487 /locus_tag="DP116_08255" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08255" /translation="MNKPFKINMNPQYFSNNIATQWYKERSRQASISFNLAVGLATAT VIFGIATAVSVCRNNVSVATATTAVGLTSGAASRRLFKLYDDTNKKLDDVAKELLDEQ " gene 16673..17209 /locus_tag="DP116_08260" CDS 16673..17209 /locus_tag="DP116_08260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315216.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08260" /translation="MKFTKLLASTIAVASVVLSAGIASAQNVPTQEAATRGMNGSYVG AGVSAGVTNGGRQNDAAVLGGNVQGRYAVPNAPVSVRGSVLFGGDSTAIIPTLTYDAP IAKNTNVYIGGGYAFQTNEGYASQLGNKNAPVLTVGAETQVAKNTVLYGDAKWGIDAY RDSDSDALSLQAGVGYRF" gene 17405..17815 /gene="crcB" /locus_tag="DP116_08265" CDS 17405..17815 /gene="crcB" /locus_tag="DP116_08265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756827.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fluoride efflux transporter CrcB" /protein_id="PRJNA477356:DP116_08265" /translation="MLQNPVIRAAVAISLGAIAGALSRYYLGLWFNQLFGTEFPYGTL IINISGCFVMGFFTTLLIRALRTIYLDARLLVTTGFLGSYTTFSTYELDTAKLLQQGN LEIGLFYWLCSAVLGMVCFQLGVICAKFFHIRKE" gene complement(17890..18273) /gene="crcB" /locus_tag="DP116_08270" CDS complement(17890..18273) /gene="crcB" /locus_tag="DP116_08270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315219.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fluoride efflux transporter CrcB" /protein_id="PRJNA477356:DP116_08270" /translation="MNSAISTIFAISLGAIPGALSRYYLTLFFTSRFGTAFPYGTFFI NITGAFLMGFFTSLTSKFGISQVVQLLVAVGFLGSYTTFSTYALDTSNVLQARGYKTA LLYWLSSPLLGFISIELGILLARMI" gene complement(18423..18950) /locus_tag="DP116_08275" CDS complement(18423..18950) /locus_tag="DP116_08275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457727.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08275" /translation="MITHHCKSVSVSLMSADLLIWSVVETPVTLHQKDGNRFHIVLTA PPVTDCEMTNFFSTESIGSQEQGNADVNTSRRILWLEISPKRVVMTMQGNAQMSYRHI WQQGVSGTTYYWLPNDLQQQNSQQPNKPIRLRNFTRHLTVSGHPLPENLSVEYELWVG DLLVGSYVLNLNIKH" gene 19335..19523 /locus_tag="DP116_08280" CDS 19335..19523 /locus_tag="DP116_08280" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08280" /translation="MSQGDTLAPFGGAAQTRTERKKPHQSEFAVWMFLPHSFAVGAEN TEKEIGRGKILCMNATRI" gene complement(19562..22453) /locus_tag="DP116_08285" CDS complement(19562..22453) /locus_tag="DP116_08285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409857.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08285" /translation="MSRDALVVGINTYSYERLTALTAPAQDAEAIATLLQNYGEFNVK RLPGVKDKQNNTIRVGQKTQVTLTQLEEAIVQLFKPEGKNIPDTALLYFSGHGLRKNK GIQEGFLATSDVNPDLGNWGIRLKWLRELLQESPIKQQIIWLDCCYSGELLNFAEADP GDRGKGRDRCFIAASREFEVAYEAISSNHSVLTDALLQALDPKRNSGTWVTNYTLIDV LNQRLQAFPQRPLFANSGGAINLTRTWKVPDIESSHSVVTSICPYRGLQYFDCTPEDA QYFYGREALTDQLLERVRAGNFLAVLGASGSGKSSVVRAGLLYQLRLGRRLSGSETWQ IKIFQPGEHPLQSLALAFVDSELSGIDRASQLAKAEELIAKKAEGLRYLIDTADTKRV LLVADQFEEVFTLCKDITERQQFFECLLGALEKTGDKLCLVLTMRADFFGKCAEQEYS GLAQQIQQNLVTVTPMNSEELRQAIVEPAKQVSLEVEPELVNQIIADVEGSPGSLPLL QFTLTEICQQREDEKLTLSTYTRLGGVKGTLQKRANQVYESLSTDEQVAAKQIFLELT QLGEGTEDTRRQVLQRDLVTSQQSPGLVEEVIQKLADAKLVVTSTLIEKGANSGKVPV VDVAHEALIRHWSLLRKWLDENRDKLRQKRKIETAAFEWRERGKTKDYLLQGKPLQEA RAFQQEQVGNLMLSDLAQDLIQQSLRHKRDNRIKFFGLGLIISLGLAVLLSTVIERFT IRQLQQIIKHAKGKEYSAQRNKALEKLVKMGVIINKINIDLSQTNLKDINLSDGILNG VNFSGAKLIGTNFYKAKLDSANFSKADLRVCFLNGASLNGANFTSANLSFALPRNAHL FKAILTKTDLRKANFSGSDLTAADLSSANLSDADLSGAYFGGANLNGANLNGANLNGA HFNSYPVFGYPVFGSAKNLTPEQVKSARNWEKAYYDDEFRVKLGLPSQK" gene 22581..22964 /locus_tag="DP116_08290" CDS 22581..22964 /locus_tag="DP116_08290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013325582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08290" /translation="MTKLTPIQLDDNTIIYIEATDDVNVPLVIAEEPAEEEEEALIDK GISPEAVRKQIVQNFQIIHTTIRAYTLCSLNAFKQLPIPGVNKVTLEFGIELGGQAGI PYVTKGTAKSNLKVTVECSFPKEMT" gene complement(22966..23346) /locus_tag="DP116_08295" CDS complement(22966..23346) /locus_tag="DP116_08295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006542959.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08295" /translation="MKFLVDNALSPLIAQGLQQEGYDAVHIRDYGMQAASDTEVFATA ATEDRIIISADTDFGTLLALRQESKPSVILFRRRSERRPHRQLQILLANLLSIQEALQ QGSVVILEQSRIRIRALPIDSEDE" gene complement(23343..23570) /locus_tag="DP116_08300" CDS complement(23343..23570) /locus_tag="DP116_08300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020250110.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08300" /translation="MKFTRITVNPNQMGGVPCIRGLRIPVATVVGMFAEGMRENEILQ AFPDLEPEDITEALHYAAVAVAERELPLVNI" gene complement(23639..24319) /locus_tag="DP116_08305" CDS complement(23639..24319) /locus_tag="DP116_08305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_08305" /translation="MSVAKDQTTAPDTQEQEWDDIIFPPGDLYSDEPPLETDLHREQI DLLISLLKWWWRNRQDFYVSGNLTIYFSPNKRKSQDFRGPDFFVVLDTERKPRKSWVV WEEEGKYPNVIVELLSPSTADTDRGLKKKIYQDTFRTFDYFWFDPETLEFAGFHLVEG KYQPLEPNSEGWLWSQQLELFLGIHQQQLRFFSVDGQLVPTPEEAAAQAEALLARYRE RFGELPES" gene 24643..25863 /gene="chlP" /locus_tag="DP116_08310" CDS 24643..25863 /gene="chlP" /locus_tag="DP116_08310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015187975.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="geranylgeranyl reductase" /protein_id="PRJNA477356:DP116_08310" /translation="MTLRVAVVGSGPAGSSAAETLAKAGIETYLFERKLDNAKPCGGA IPLCMVSEFDLPPNIIDRQVRKMKMISPSNREVDINLVNEDEYIGMCRREVLDGYLRD RAAKLGANLINATVHKLNFPTNNTDPYTIHYLDHTEGGALGIAKTLEVDVIIGADGAN SRIAKEMDAGDYNYAIAFQERIRLPKDKMVYYEDLAEMYVGDDVSPDFYAWVFPKYDH VAVGTGTMQVNKARIKQLQAGIRARAARKLVGGQIIKVEAHPIPEHPRPRRVVGRIAL IGDAAGYVTKSSGEGIYFAAKSGRMCAETIVEMSNGGNRIPTEADLKVYLKRWDRKYG LTYKVLDLLQTVFYRSDATREAFVEMCDDRDVQRLTFDSYLYKTVVPANPITQLKITA KTLGSLLRGNALAP" gene 26327..27079 /locus_tag="DP116_08315" CDS 26327..27079 /locus_tag="DP116_08315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316321.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PEP-CTERM sorting domain-containing protein" /protein_id="PRJNA477356:DP116_08315" /translation="MKLSSVFAAALSTFAVSLITAFGFSNHAQGLTILGNSSGIWGTP DPGSNTDPVFSGVGTNTFTWGRSRPDDRQNNYGTAANELTFTGNPFSADDVGSLFKVG DLEYYNGKVEQSTSVDSVPLNLTLSFTNPGTFREVFNFGFQLVNTTNLGVNPEDDADI VYIKDNFDTRNFYFEGNEYQLNLIGFSQNGGNTTVNKFSVFEDDRTTAGIYARMTRIT PAKQIPEPAGIVGLSVLGIYLVTHKKSLGVKK" gene complement(27554..28300) /locus_tag="DP116_08320" CDS complement(27554..28300) /locus_tag="DP116_08320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216313.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_08320" /translation="MPRILVIDDDAAISELVAVNLEMAGYDVSQAEDGIKGQALALQL QPDLIMLDLMLPRVNGFTVCQRLRRDERTAEIPVLMLTALSQTQDKVEGFNAGADDYL TKPFEVEEMLARVRALLRRTDRIPQAAKHSEILSYGPMTLVPERFEAIWFTQTVKLTH LEFELLHCLLQRHGQTVSPSEILREVWGYDPDDDIETIRVHIRHLRTKLEPDPRHPRY IKTVYGAGYCLELPSLPQSTEGASATSGVQ" gene 28458..29540 /locus_tag="DP116_08325" CDS 28458..29540 /locus_tag="DP116_08325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316325.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="esterase" /protein_id="PRJNA477356:DP116_08325" /translation="MCYSSYSPPWFLQDGTVQTLYTALWASRDWEHTTENPEPPYEEK IFTGAKGVPIFGWVAIPDNAHGTIVGTYGITGDLDNQWYLKLLGRKAYAQGYAVVLFD WRAHGKTALLSPTLTSDGLYEGEDFVRIAATAKAMGCPGKFWFMGYSLGGQLALWAVK AAVELVKEDADLGLEDSEIGGAAVICPSLDSGRSLSFLVKHPIGKYLERSIARQLNKL AWQIHDAHPGTLEPEAIERANSIWGFDEELVIGRLGFPTVDAYYDASSALPLLPHLSK PTLIIYAEDDPFFEPAIIPDLQAACAKNSAIDLLLTRYGGHVGYISSQKCQRQAQDPD PWWAWNRILEWMGQQQPKSVNYLHGS" BASE COUNT 8302 a 6375 c 6213 g 8713 t ORIGIN 1 cccccggtag ttgttttgca atatcggtgt catctgcaag ttctttcagc ttatctctcg 61 cttctccaac tgcagtaaat aaagattgac cttctgcaaa agaatttaaa aaatatttga 121 gaaacttttg tgctaatttg tctggcactt cttcccgcca gacaatgatg tagggaagat 181 ttaaatctgc taattgtctt gctaaaccta aaccatcgca ggaatttaaa attgctaatt 241 ttaaaccttt gttaattgct gctttcagtg ttcttttgat ttctgataaa cttaaactgt 301 cattacgatt aatatttatg atacctgttt gaccatcacc tttggtttcg ctatgacctg 361 caaaaaatag gatatcacaa gcttcttccc agagcgcgtt aaaatctgaa cgatttggtt 421 cagtcaaaac tatgagttca gctgcccgtt tctgcaattt ttgcagtaac tctttatcgg 481 cactagtatc aatattttga ctatctccaa aaatactaat aatcctgggt cttctaattt 541 tattagaagt ggatgatttt tctgatgccg tagtacttac aacagaagag tgaaaacaaa 601 cagccgcttc acaagagaaa ttttgagcaa gaaaatccca ttcctgccaa ggaagtcgat 661 gtaaaatgtc cttggttgtc tgtgatccaa ttttctgtgt gttgataatt aaacgaattt 721 ctggttgaaa gtcttgtgac aatgcttgtt ttaattctga ttgaataggg taaagccact 781 gatgaaggtg acggcataaa tgctgagcag aatcgtaaca atctgttttg gaaatatgag 841 tgacttggtt gtttttaaaa ccacctctag aatgcttgag taacgagtag tattgctctt 901 tccagcgttg ataggaaatg ggaatttcgg catttggagg taattgcact tctagttgtg 961 tgatgttacg ctgtgcattt gctaccgtca cctcaacatt gagattacca aacccgcgct 1021 caaggtttcc gtcaccaaac tccaagttta cccatatttc cataagccat aaccctaaat 1081 tagtttaatt aaaccatcgc tatttttaaa ataaaaatcc aaaaataatt aacaaattgc 1141 ggtcaataag ctctagcaca ccactcgact actagtgggg gacttttctg cttttatctc 1201 taccctcttg tacaaaaggg tagcttgatg cgagcatcgc agtcttttat ccaatctcat 1261 atatagctac aagtgtacaa atgttgattt cggatatttt ttatggaaaa ataaatatac 1321 tctatttgaa cagtatttta gtaagcgtct gagttgatat gtttaaaatc tttgtctttt 1381 actgaggtag catgcaaccc tgattttcta tttatccgct gctattaatt gttgctgaat 1441 tctcttgaat tcagcctgct tgatgctgcg agcaccgccg ctctcacgac gagttggaac 1501 tccgcgatta ggtggtattc gagcttgtat ctgattctgt ttaatagttt tgacggttcg 1561 aggtgtgttt tgcaagagca ataaattatg tgtatctgca tagttagctg atacagtgat 1621 tattcctact attaacgaag ctataacata ttgcgatttc atttttttct ccttgagaaa 1681 actcttcata atacaacttc acgagtgcgt atgtgttttt ctgacgagat atacagtttt 1741 attgaaacag taatcaaaag aacacagaac tcagaatgac cattaagata tatctagggt 1801 ttgagtaaag atactatgca ccacaggcag acgctttttt gttggcgcag gctgtggtgc 1861 gtgattctcc atagataaat cgaaggactt tagttagacg cgtttgctga ggtatgtacg 1921 ctttgaataa gtagccgtca tgaactgcgt acaccctatg cctacggcac gccaaaggcg 1981 aacgggtatg tcctccggac acgctgcgct ttagcccttt gggcgtgcgc aaagcgcata 2041 cggcagttgg cactctcgga ggaaagccgc tgagtcccaa agggactcag cggttgggac 2101 gcgtctacaa gtcgggaaac ccggtgttag cacctgcctc accagaacac atgaaaatat 2161 tctccctccc ccttcgggta tgccttcggc acacctatgt ccttcggaca cgcttcgcga 2221 acggtgaacg cagtcgcctc tgtcgggaaa gccgtcattc gcgctgtctc actccctcac 2281 tcactccttg ggacttatgc gctacgcgca ggcactttca caaaaatcaa ataggattac 2341 tatgattatt caaatgttat gaaataacat tttgattgtc gtctttgctt ataatgacca 2401 ctgatttaac aaagtcttaa aaaaacataa tttaaacaca aattatgttc ttttatgaca 2461 aacataaaca tcaaaagacg ttaaaacatt gattaacact agtgatagat aaacaccatg 2521 tcggaatctc gaaatacctc cagttcttca accttagaag cgacaggttc tagggcattc 2581 aatattcaac ccacaaatag ttttgagaat gatagttttt taaatacttt cagccggagt 2641 ggctcttcac aatctttctt agatgcagat caagagtttt ctgatacagc actaagcctt 2701 cctaacaatc ttaatcctaa tcccatattt actagtgcag caatttttcc tgatttcaat 2761 ggtgatggaa aaaaagacaa actctggcgt aaccctcaaa ctggtgaaac cgctatctgg 2821 ctgatggatg gtacaaatgt tgcctctcag ggtgcgttga aaacaatgag ttcagattgg 2881 gactctaaag ttgctgattt caatggtgac ggcaaaactg acatcatctg gcgcaatata 2941 aaaacgggtg ataacaccat ttggctgatg gatggcacta cagtcgcatc tgaagcatct 3001 ttgccgaatg tccctacaga ctggactttc agtactggtg agttcaatgg tgacggcaaa 3061 accgacctcg catggcgcaa tctaaaaacg ggtgagaatg ctgtatggct gatgaatggc 3121 acacaggttc tggcttcgtc tgctttggaa gcggttgatc caagctggac aggcactatt 3181 ggtgatttca accgtgatgg taaaactgac accttatggc gtaataaaac aacaggtgaa 3241 aacgctgttt ggttaaccaa tggtaacacc gttactaaaa ctgcattgcc aacccttggt 3301 acagattggc agcctagcct tgcagatttc aacggcgatt ttcaaaccga tattctgtgg 3361 cgtaataata aaacaggtga gaatgctgtc tggctgatgg atggttcaaa tgttgcttct 3421 cagactgcct tacctactct cggtgcaggt tggacatcta gtgttggtga tttcgatggt 3481 aatggtaaga cagatctgct ttggtataat ccgcaaacag gtgagagcaa ggtttggatc 3541 atggatggtg caaatgttgt cagtgacact gctttaccga cacaaagtgc agcctggaaa 3601 acaagtatta gcgatgttga tggcgacggc aagaccgata tcttcttacg caattacgaa 3661 acaggtgaga acaagatttg gaagatgaat ggttctactc ctactgaatc tgctctgcca 3721 acatttgcta aagaatggta cactttctaa gagtttttgt acaattcgta atttgtgatt 3781 cgtaattgat tcgtttttac gagtcatcaa tgacgaatga ttttggtttt cacttcatta 3841 atactgacat tatgttagtt agaaatcgcc gggaattatc ttggcgattt tttataaaac 3901 tcatttctca aataaaaaga cctcaccccc atcccctctc cttattaagg agaggggtgt 3961 tcatctgtca tcttaacaac atgtttgtga tagttaaatg cagttttgtc acaaagaaca 4021 aagataacag atgtaatatc ttaacagaaa cattccaaaa ataaagttaa aaatttgatg 4081 aacactagtc ataaacaaac attatggttg catctaaaac cttggactct tctacaggta 4141 ctagtgcatt ccaatctctc ccaatcaccg atagttttgc gggcacggat gatttaaatt 4201 ctgctttgag tcttggacgt tcttcagaat cgccactaga gaactcgttg cagtcaacac 4261 aagttgtaac tgcacgagaa aatcccaatc ctagtcgttt gtttagtagt acggggattg 4321 ttgctgattt taacggtgat ggcaagacgg acaaattttg gcgcaattct caaacaggtg 4381 aaactgctgt ttggcttatg gatggttcaa agccgacagg ggaattgttg tcgaacgtag 4441 actcatcttg ggattttgct tatgccgatt ttaatcgtga tggcaaaact gacatctttt 4501 ggcgcaataa gactacgggt gagaatgcca tttggcttat ggacggcaca agaatagctt 4561 ctgcggttag tttggagaaa gtcgatccat cctggactgc tagcattgct gattttaatg 4621 gcgatggtag aagcgatatt ttctggcgca atgccaaaac aggcgaaaat gccacttggc 4681 tgatggatgg cacaacagta accactgcag cttttctacc taaaactgat tcacaatggt 4741 cttacagtat tgtcgatttc gatggtaatg gcaaaaacga catcttttgg cgtaatcaaa 4801 caacgggaga aaatgctatt tggtttgtaa atggtaccga cacatcagcg tattctctat 4861 cgaaagtaga tccgtcttgg aattatagcc ttggcgattt taacggcgat ggtagaaccg 4921 acttgctgtg gcacaatact caaacggatg agaatagtgt ctggctgatg aatggtattt 4981 tcatcaattc aggttctctg gaaaaacaaa actcttcttg gaagtctagc gttggcgatt 5041 tcaacggtga cggcagaact gatatcttct ggcacaatac tcaaactggt gaaaacactg 5101 cttggttgat ggatggtaca acagtaacta ctgcggcttt tctgcccaca actgatgcag 5161 cttggcaacc cagcattggt gattacgatg gtgatggcaa gagtgacatc ttctggcgta 5221 gatacgacac tggtgagaat actatttggc agatgaatgg tactacagtt tctgcagctt 5281 ctacacagac tgttccggta gagtggtctg ttttctagaa atttgaggtg atgcagatta 5341 aaaacaatat gcacgcagat gataacttca tcatctgcgt gtatgtttct tttttatttt 5401 ttcatccttt cttctaaggc ttcttgtgcg tgttctcggt cgtcaaaatg gattttttcc 5461 gttccaagaa tttggtagtc ttcgtgacct tttccggcaa gcaaaacgcc gtctccaggt 5521 tgcgcttgca atatggcggt acgaatggcg gttgcgcgat cgcctatcac tatcggcttt 5581 actgtttctg gaactccctc taaaatatcc tgcaaaatcc tttctgggtt ttcagtccgg 5641 ggattgtcgg atgtcaccac agcgacatct gctaattgag cagcgattct acccattttc 5701 gggcgtttgg tgcgatcgcg atcgcctcca cacccaaaca cgcaaatcat tttccctgga 5761 ataaacgctc gcgttgcttt gagcaaattc tccaaactat ctggtgtgtg ggcataatcc 5821 acaataacgc tgatatcttg gtccggaaga atttgcactc tttccatccg tcccggaact 5881 ccggggaact caggtataac agatgcgatt aactgcaaat ctaaccctaa gtgtaaaact 5941 gctcctactg ctgctagtaa attttctagg ttatattgac caacaagcgg tgaacgaaaa 6001 gcaacttcac cctttggtgt atgcaatgta ccactgacac cattcggttc gtaactcagg 6061 tgactcatcc acaaatccgc tgtcgaatcg ttgacactat aactccacac tttctctggc 6121 ttcaaggatg ctattaaccg cttaccatat gtgtcatcag cgttgatgat tgcccgtccc 6181 ttaagatact gaggactaaa cagcagtgct tttgctgtaa aataatcttc catatcacgg 6241 tggaaatcca gatggtcttg ggtaagattg ctaaacactc ctacctcaaa ctgacaaccc 6301 atcactcgac cctgcgccaa agcgtgggaa ctcacttcca tcactccgta ctcattacca 6361 gcatcaagag cagtcgctag ctgctgttgc agttccacag caaacggtgt tgtatggaca 6421 gcagtttgaa caaaaccatg ccaacgagta taaagagtgc ccattaaagc tgtcgccaga 6481 tttgcttgat ggagaaaata ctcaatcagg tgggtggttg tggttttacc atttgtgccc 6541 gtcacaccta caagtttgag tttttgccca ggatagtcgt ggaaggtggc tgctaattgg 6601 gcacaggctt ttgtcatgtc atcactgctg ataacacaag cagatggtgt tggtggatgt 6661 ttctgtgctg cttgaggaga aataattgca gctacagcac cagaggcgat cgcactttgc 6721 caaaagtccc caccatcaac tcgtgttcca ggcatcccaa taaacaaatc tcccgcaacg 6781 caagcatggg aatttgtctt taaattcttg atttcagcgt ccattgctgg atgttgtggt 6841 aactgcacaa taatatcatc tacagttgct agtaactccc gcagtttcat tttctgaacc 6901 tcgtcacaca tcgtgatttt gcttattttg caccatgttt tctgtttcta caaacacttt 6961 cagatgacag gaagcttttt tgtcaaccta cactaacgta ctacttgaat agctggatac 7021 tgatacacaa accaatactg acgccaataa cctcaaagga taacccttat actggtgttg 7081 tcgcttgtct ataaatcttt cttgttattt atctaacttt gttaaattct gaataaaatt 7141 ttcatacaga taaaaaaagt ttgatatcct ttggtgttac ctaatctcaa aaaaaaattg 7201 tcagtgaatt attacaattt caatgagcat ttctggtctc cacagacagt ttgtagtcaa 7261 tcagaactac caaaatcaaa taaaaaaagt aatcgcagaa aaacactcac aggtgttttg 7321 agcttagtgt gtcttgctgt ttggacacca ctgctccttg ggcttatagt tcctagcgtc 7381 cataagctga aggaaaagcg ctctaatttc gagactagca actttgatgg agaagactcc 7441 gtatttaaac gtgagcactt tcttttctat caggctgcaa gtgggaggat gagaacccct 7501 gccttttaag gaggggatga aatccgacgc aaagggctaa agccctttgt ttttctgata 7561 cgatgcgtga aggtactggg tggcgaccca tacaggatat ccttgaccgg actccgaggg 7621 tagaaacgta gtcttcagaa cggcaggtgc tacaacgggg ggaacccccg caacgcactg 7681 cctcctgtac cgttgtctgc ggtgggacgc gtcgtggacg ttaaattaaa ttgcctgcgg 7741 agacggtgag accagcgctg cgggagggtt tccctccgca ggcgactggc gttagaccgg 7801 agggcgtgcc gcaggcatac ccggagggtc tggcggggat tggcaacaat ctagttaaga 7861 gtcaaagaaa caggaattag ggtggaaatg taaggcttta gcctacattg gaacatcctt 7921 taagaatccc cgtgcattta tgccggggag tacgtcaatt atccatacag cagcgataac 7981 gaaatcaagg gcttccagat atgtgttaag ctgtaccacg atagagcata cctagagtcg 8041 ttagcgataa acagtaatta ttgtttgctg tcttacgtgc tgccttaaga tatgccggaa 8101 aaatcgatgg aaaccaaagc tcgattcagg aaattccaga aattcttctg ataccgggct 8161 ggaaacccct aaggaggaac tcaacgtgtt taagcaaatt ctgacaggta gctatatttt 8221 tgtttttctt cttgtcagca gcctgtcggc tcatgcacaa gcaccaaaac cgacatctcc 8281 atcttcatct ccatcccctc aaattccaac aacaccccaa accaaggtga gtccagagga 8341 acttcaaaaa tttgcaaact cactcaaaca gttgagagtg attaagcagg gagcagttca 8401 gcagatgggg gacgttataa ataagtctgg tctgagtcaa gaacgatttt tggaaatata 8461 taaatcacag caaaatcctc cagagaagct aaaacgagca atgacctcac aggaaaagca 8521 gcaatatgaa aaaactgtca caagtctaaa gtcaattcag gaacaagctg atacaaacat 8581 gcagcaagtc ctgcaaaagg aagggctagg gctagaacgt ttcaatcaaa ttcaagtagc 8641 aatcagtcaa gatccggcgc ttcagcaaaa agtgcgggaa atgattaaaa gctagtgtaa 8701 gcaattttgt gaacatgggg atgcgattgc acttagccta ccgttctaac aggtggttgc 8761 acaaagcgca ctgctttgca caaccgccta gaatttaatg aatcccaagt tgcattcatt 8821 ttgataaaat gtcattttat cagtactact caccacccct ctgagtcaat tatggataac 8881 gaggttgttc tttttcttgt cagcgccagt gttctcatcc tctacctcat tttctctgcg 8941 ttaactgaaa tgggcacaaa gttaccttgg aaaaaatagc ggttcgcact ggctccccta 9001 cgatttgtgg tctattctaa ttaaattttg aattttgaga ttgcgtcgca acgcgctccg 9061 caacgctacg cgatgcgtga atttgagcgt ttgcgcagcg cccccttagg ggctagcgag 9121 tgactgctgc tgcatcgtac actgtccata taattccaac aagaacagaa caagagagtg 9181 tgctttctat gtcatccatc atcaggcttg ctaatgaaca ggatgctgag caagttctgg 9241 aaatttacgc ccccttttgc gagcattcac ctgtgtcttt tgaagttcag ccaccaactc 9301 tagatgaaat gcagcagcgg attgcaaaag ttctagaaaa gttgccttgg ctagtgtgcg 9361 agcatgatgg aaaagttctc ggatatgttt atgctgcacc tcatagagac cgaactgcct 9421 atcagtgggc tgttgatgta tctgtttata tccatgaatc agtgcgtcgt tcagggatag 9481 gacgagcttt atatacgtcg ttgcttaaaa ttctggtact tcaaggttat tacagcgcct 9541 acgcaggtgt cactctccca aacacagcca gtgaaaggct tcatgaactg atgggattcc 9601 agttcatagg aatatatcaa ggtgtaggat ataagtgtgg tgcgtggcat gatgtcgtgt 9661 ggtatgagct atctctgcaa ccgcgaagac caaatcccaa gcctcccatc aatatcaacg 9721 tattacgcaa taccttagaa ttggaatacg ctttggcaag tggactactt ttccttaagc 9781 ttcctgtcga ttaagaaatt atttcacacg ccgatatttc tgatttttgc caaagtaccc 9841 tgacaaactc ctaaatttca ggttaaatgt ttccagatat gccccttgag aaagttgtca 9901 ggtactattt gtgacacaag aacaaaaaac ctctcgttct tttgctggaa ttgctggcat 9961 tattgctgtt gcgacgttaa taagcaaatt ttttgggctg gtacgcgaac aggcgatcgc 10021 cgccgctttt ggtgcgggag cagttgtcga tgcttataac ttcgcttaca tgatccctag 10081 ttttatgcta atattactag ggggtgtgaa tggaccattt catagtgcga tcgtcagcgt 10141 cctagccaag cgcaagcaag aagaagctgc tcccttagtg gaaacaatga caacccttgt 10201 gggtggtttg ctgcttttgg taaccatttt cctggtaata tttgctccta acctaattga 10261 cttagtagca ccaggtttag acaccgttcg taacggagct tttataaaag aaaacgccat 10321 tgtgcaactg cgaataatgg caccaatggc agttttagca ggactaattg gtattggttt 10381 tgggactctc aacacagcta atcaatactg gctactttca ctcagtcctt tattttccag 10441 cattattctg attattggtc tgggtatcct cgctctgcaa ctaggtagca aaattagctc 10501 tcctcaatat gccattttag gtggaatggt tttagcggga ggcaccctag ccggagcatt 10561 cctgcaatgg ttgatacagc aaattgctca agcacgagcg gggttaggta cattgcgcct 10621 acgatttaat tttaagcagc caggggtgaa cgaggtactc aaaatcatgg caccagcaac 10681 cttttcctct gggatgttgc aaattaatct gtacacagat atgtattttg catccttcat 10741 cccctctgct ccttctgggt tgcgttattc caatatcttg gtgcaaactc ctttaggaat 10801 tatttctaat ataattttac tacctttgtt accaatattt tcccgactag ccgcaccaga 10861 aaattggcaa gagctaaaat tgcgaattcg ccaaggactc atactcaccg ccttcaccat 10921 gctaccaatg ggagcgctta tgatggcgtt gtctgacccc attgtacgag tggtttatga 10981 acgtggtgct tttacaaaag gagattccgg cttggtttcc tccatactag ttgcttctgg 11041 tctgggaatg tttgtttatt tggggcgaga tgttttggtg cgagtgtttt atgctttggg 11101 tgatggtcag acaccatttc gcgttagtat gattaacatc atattcaacg ctttgctaga 11161 ttggattttg tttaaacctt ttggtgcgcc gggtttggta ctagcaacag ttggggtgaa 11221 tttcatctcg ctattaatgt tgttatggtt gcttgatcgc aggctgaatg gtttaccttg 11281 gcgcgagttg ggtttaccaa ttttaggttt aactgctggt agcatggtag ctggaggagc 11341 aagctatggc actcttgagg ttttgcagaa atttttaggt gaaaatggtt tactaattca 11401 actattggaa atatctattg ctggtttagt tggcatcggc gtttttggtg tgattgttgc 11461 catgatgaaa ttaccagaga tcgagatttt tgtctctcgt ctgcgtgagc gctttttgag 11521 aagataaaga ataactcaat tactatttag aagtgagtgg tcgaataggt gcaaaaccag 11581 cttgttctac gagtttttga ccctcaacgc ttaagagtaa gttagcataa gccgtaccag 11641 cctcctcatt tacaccaccg tcttttttga taatcacata gagtttgcga gtcattggat 11701 aagaagaatc aataaaagcc agttgattca ctttctcgat gtttgttcca ctgaaaggag 11761 ctacataagg tttatcagca ctttttgcta atgggagaag ttttacaggt acagtctttt 11821 ggttaacaac ttctgaagcc gtggcataac caattccacc agcagtgtca gcaactctct 11881 gtaaagactc agtggtgttg tttacctctt taaaagaacc aaagtctctt cccagtaaca 11941 ctgtgtcttg gaagaaatcc acagtaccgc cagctttcgg gttacggctg aagactgtaa 12001 tcttcaaatt aggaccaccc acctgctgcc aatttgtgat ttctccagta aaaatctgtt 12061 gcacttgttc caaagtcagt cctctgattg gaagactaag gtttacataa atggcgatgc 12121 catcaattgc aacagccttc tgttctagtt caaagccttt ttttatagca gcttgtaact 12181 catatggttt aagagtatca gaagattggg caaagctcag ttcatcgttc aaaagcattt 12241 caatacctgt tcgtgaacca ggcttttttt ttgctggatg aacatattgt aagtgaaatg 12301 aagggtgtgc tttgttaatg gcgtcaataa tatcttgaga tctcagagga gcaaatgttg 12361 tggatccacc ataaaaaaat gttccttcag gaacattttt tactttagcg aaagtcgcta 12421 tacaagtatt aaggagtacc tcttgaagtg ggcaacgact tgcgcttgat aacaaatcaa 12481 tgcccgtata aatccccaaa ctgacggcga taagaatccc agataggata attaacttta 12541 atcgttgcca ctgcttttga aactttaatt ttcctaattc attgtcaacc caactctggt 12601 taaaaatggt ttcataaata tgattgccaa cttctaattt tccctggtaa ttgactacaa 12661 aaccaagtct taataatatg agttgtttgg gattgtcatc agctactacc tttcctgatt 12721 gcaaaacctg tcgatacagt tgcagtaaat tttcggcatc tttattctgc acaagctcat 12781 cgcgtatttt cctcaagtgt tctggttgat cttgactttc ccagttgtcg atgatgcgct 12841 cacgcaccat ccgctctacc caatcagctt catcctctcc ctggcaaatt ggaaattgct 12901 cattgaggat aagtcggaaa agtttttggg caagaactgg ttcagacttc gttattctaa 12961 gtacttttct aatgacaggc atgggattac taaccttcgc aagtaattcc tctgctgtgt 13021 taatcgtttg tctgtctgct tccgccaaag tccctcgttt gtctaggact cgactagcaa 13081 tcaagaaccg atgttcatca atactgaatt tattctgtcg atccaaacct ttatcgacta 13141 accaagacca tccttcctca aatgcctgcc cgcgtaacag ccttgaatca tcttgatact 13201 gactttctac ccaagcgctc aaagcctctg cataaggtcg aagcttttcc agttcctggt 13261 taagccattt ttggttaaaa actctcttat atatagggtt gaaaactttc agtttaccat 13321 tatgcttaac caccaatccc gagagccgca attccatttg ttcataagag tgatctgcat 13381 ctatttcttt gtgtcgcaaa atggattggt atagtcctag catctgctgt gttagatgtt 13441 tattagtaag aaggcgatcg cggatagtct ttaaatgttc tggctgatcc tgagattccc 13501 aattctcaat aatttttgac cttaccactt gctcaactga taggggattt tcgactgtta 13561 atccctctga cattagctta cacagtttgt gagttaaaaa aggttgtcct cctgtccagt 13621 acaaaatttc ccttaacact tcctgaggat gactaacttt acccactagc cctttttcta 13681 aagcttgggc ttcctcaaac ttaaacccat ctaaggcaat tgccttgcca atattaaacg 13741 gggtgcgggc tttatctcta atcaaatctg aaggcgttgc tacccctagc aaacaaaatg 13801 ttagcctttc gtaatctggt ttatcgactc gttggttgta acaagcccga atcagcgcga 13861 aaaaatcatt aatcgagaaa ttcaagctga taatgctgtc aatttcatcg acaaaaatca 13921 caatcctttt ttccggcaca tacaccaaca aaacttcttc aatgaatcga tctaggagct 13981 gaactaaaga aagctcttca tgctcctgta accatttttt ttgattaacc ttactcaaga 14041 ggttaaaccc ctgccaaagc agagacacta tgccgccata ccagttttct ggagttaggt 14101 gttcgctacc aattctggtg atatctatag cagcacagct aacgcctgag tgtctcagcc 14161 gatccattgt ctgcacacgt aagctagact tacccatctg ccgactattg agcacgtagc 14221 aaaactgccc agccattagg gcgtcataca gttcttggtc ggcttgtcgc atgacatagc 14281 taggggcatt tggctttaag ctaccgccca cgtgatagta atcggagatt aattctaaag 14341 ttgagttcat tttgccccca agcaagataa aaagtaactg tgataaagtt gacagctaaa 14401 cttgacttga tttgcctcaa actggacaat acccatactc tctaacttag aagctggtac 14461 ttgctgcatc tgcactttac cccctgaatc aagtaaagtt ctcaaggctt ccagcagttc 14521 gtgatgttca ctcaacatga cccaaagtcc ttggagatgg ctgctgtaaa ttccttctag 14581 agtgggggct gtttgcaaaa gttgctctaa agtcaggtct ttctgatgca ggtgatagag 14641 acctagctga attaagtaag gatgtccacc tatcattgct cgcagtcgtt ctagttctgt 14701 gttatctaac gacttcaatc catactcttt tgctaactgc tgtacctgtg ccaaagtcag 14761 atttggtaat ttaacgggta tacccacatt gaacggcgat tgattgatat tgagagggat 14821 atatacttct gttgagtaag ccacgaccaa tctcaatttt tgccaaacct gtaggttatt 14881 tgcttcttca tgccaactcc tgagtaaagt aaaaaatgac ttggcaattt tttcatactc 14941 gaagagccga tctacttcat ccaaccccaa aaccaaaggt gtatctattt gctccaacag 15001 ataagcctgg aaataagtag tacagctaac catgctgccc aatctcttgt tatcccaata 15061 atcatctatg cgcgactcca gattaagctg ttggctgata ttggtacaaa accactgcaa 15121 aaaatgatct agattctcaa ggattaattc ctctgcttgt cgcagattca aacgtattgt 15181 tcggtaccca caatgtctag cttctgccaa aattttattc accaaagagg ttttgcccgt 15241 tttttttggg gcttttattc tgattaaagc tccagattgc agcacttttt gcgaacagag 15301 tttttctagc aataaatgct cgttataaaa acgctgaatg taaaaaggag actgcgagga 15361 tatgggacgc cccggaaaat caggctcaat ttggtcggga cgatggtctt cccagtaatc 15421 aaaattgacc cacttcggtt tgtcttgtac aaacaagtcc accaactctg gtcgatagga 15481 gaagcgttct ccctctctat ttaccagacc aaattttgta cacaccttat tcaaatgttt 15541 tctaatcgtg gatgtatcgc aatttctctc gctagcaatt tgctcatcgg tctcgcctgc 15601 caaaaatctc ttcagcacct ttttttggtc aggtgtcagc tcgtctaata tgtcgttgaa 15661 cttttttcca tgcatctttt tcaagcgctt tttggctagt aaattggcag aagtctcaga 15721 acttaaatac aatctgtaag ttcgggtttg ggaactttat ctaaccgagt ttttattcgc 15781 ttgatgaata aaattctatg aatttttcag gcgatcgctt tctctgagta acggctaaac 15841 tgtagcatta tactcatggt gactatattt atagactgta aaagtcccat gactgtacgg 15901 cgacaaattt tacctacatt ttaaacaaaa aataatgatt ggaaatattt gtgagtattt 15961 ttgcacaaag tccaaatcaa atgctacttg tgcgtatata acagcggtac ttatgcgctt 16021 atttgctctt cagttatgtt gaatctggat ggttttgtca ctcagcaact ctcagactgc 16081 gaaaaatgtg atttttagtg cctaaaccgc ctactctcca aaattcaagg ttaaatctga 16141 gataagagct tcaatcctag ttttttaagg aaatggtcat gaacaaacca ttcaaaatta 16201 acatgaatcc acagtatttc agcaataata ttgcaacaca atggtacaaa gaacgctcca 16261 ggcaagcgag tataagtttt aatctagctg tcggtttagc aacagcgact gtcatctttg 16321 gtatcgcaac tgctgtttca gtttgcagaa ataatgtatc agttgcaaca gcaacaacag 16381 ctgtagggct tacctctgga gctgctagta gacgcttgtt taagttatat gatgatacaa 16441 acaaaaaact ggatgatgtt gccaaggagc tattggatga acagtaaata tcctcaacac 16501 tataacgatt gtagatttct gaaccttaaa ttgtctactt tgttcaccca aatgggtgaa 16561 acagtttcac ccacttggat agcctaagtg cgtgaggtta aattttgtta aaaaagggat 16621 tctagtacta gagaagattg attgcactaa ttcactctag taaaagaaac ctatgaaatt 16681 tacaaaatta ttagcttcta caattgctgt tgcttctgtt gttctttctg ctggtattgc 16741 ttcagctcaa aacgttccaa ctcaagaagc tgcaacacgt ggtatgaacg gtagctacgt 16801 tggtgctggt gtttctgctg gcgtcaccaa tggcggaaga caaaatgatg cggctgtatt 16861 aggtggaaat gttcaaggac ggtacgccgt tccgaatgcg cctgtttctg ttcgcggttc 16921 tgttctgttt ggtggtgatt caacagcaat tataccaaca ctaacatacg atgcacccat 16981 cgctaaaaac actaacgttt acatcggtgg tggttacgct tttcagacta atgaaggtta 17041 tgcaagccag ttgggaaata agaatgcacc tgtcttaact gttggtgcgg aaacacaagt 17101 ggctaaaaac accgttctct atggtgatgc taagtggggt attgacgcct atagggacag 17161 tgattctgac gccctgagct tacaagccgg agttggttac cgcttctaat ccttcatagt 17221 cagtaaagaa ggattttgat gcaaaaccac caaatgtata ggagtagtga agcttttcct 17281 ctgatttgtc ggagcttcac tgctccttcg ctatttttgc agtaatgaac ataagtatta 17341 gattgagcaa caaaacgaaa ttgtataaca tgaaaaagca ttctatgatt ctgaatgtat 17401 tcctatgctg caaaatcctg tcattcgcgc cgctgttgca atcagtcttg gcgcaattgc 17461 tggagcgctt tcgcgctatt atctcggttt atggttcaat cagctttttg gtacagaatt 17521 tccctacggt acactgatta ttaacatcag tggttgtttt gttatgggtt tctttaccac 17581 attgttaata agagcattaa gaacgattta tcttgatgcc cgactgctag tcaccaccgg 17641 ctttttaggg tcttatacta cgttttcaac ttatgagtta gatacagcca agttgcttca 17701 acaaggtaac ttagaaattg gtttgttcta ctggctctgt agtgccgtat taggaatggt 17761 atgctttcag ttaggagtta tctgcgctaa atttttccat attagaaagg aatgaaaact 17821 cattcgacta actgtaaata aacttgtcaa aaaaatgata tgtgtccaga catgaagaga 17881 atcaaaaaat tatatcattc ttgctaacaa gattcccaac tctatactaa taaaccctaa 17941 taacggacta ctcaaccaat aaagtaaagc agttttatag cccctagctt gtaacacatt 18001 tgaggtgtct agggcataag tagaaaaagt agtgtatgat cctaaaaatc ctacggctac 18061 taataattgc acaacctgag aaatcccaaa cttggatgtg agactagtaa aaaaacccat 18121 caaaaaagcc ccagttatat taataaaaaa ggttccataa ggaaatgctg tcccgaagcg 18181 agacgtaaaa aataaagtaa gataataacg acttagggca cctgggattg ctcctaagct 18241 aatggcaaaa atagtagaaa tagcagaatt catataagtc actaactggt tgttttatat 18301 ttttatttag tgcatttaag tcaatttctt cagtttttac acttgaagtt agtgatttat 18361 aatatacaac gaatatggca atatgtgact gcttttctaa gatgcccctg ttagatccgt 18421 ttttaatgtt taatattcaa attgaggaca taacttccca ctaggagatc tcccacccac 18481 aattcatatt ctacagataa attctctggc aaagggtgtc cgctcactgt tagatgacga 18541 gtaaaattcc gcaagcgaat aggtttattt ggttgttgtg aattttgttg ttgtaaatca 18601 tttggcaacc aatagtaagt ggtgccagaa acgccctgtt gccaaatgtg acggtaactc 18661 atttgagcgt taccttgcat agtcatgacg actcgttttg gagaaatctc taaccacaag 18721 attcggcgac tggtgttgac atcagcatta ccttgctctt ggctacctat actctctgta 18781 ctgaaaaagt ttgtcatttc acaatctgtg acaggtggtg cagtcagtac tatatggaat 18841 cgattaccgt ccttttgatg cagtgtcacc ggagtttcca caacagacca aattaatagg 18901 tctgcggaca taagtgatac agatactgac ttgcaatgat gagttatcat gggagtgatc 18961 acgggtaaaa acaatattaa aaagattttg taaatacacc tctacttatc tcataggtga 19021 aagcaagagt gacttatgca ctaaattcat tctgataagt agatcaacat tgaaaagcgt 19081 aaaatagagt ctttgcgatt atttagctcg acccgaaacg tggagcgaat taatgacatt 19141 ttacgtttaa ttaggttaag ctactaaaca tctgcttcat catgtttgca aaatgtaaaa 19201 tataggaatc cggtttggtt tataaacgaa ctcgttgagg tagggaacag ggaacgcgtc 19261 tgtgtttcaa gcttccttcc ttagggtctc ggctcataaa ataacttttc ctcaattgtg 19321 atactaagtt gcctatgtcc caaggggaca cgctagcccc gtttgggggc gctgcgcaaa 19381 cgcgaacaga gagaaaaaaa ccgcaccaga gcgaatttgc ggtgtggatg ttcctcccgc 19441 actcgttcgc ggtgggcgca gagaacacag agaaagagat agggagaggg aaaattctct 19501 gtatgaatgc aacgcgcata tgacatagct attttggcat cgcttctgcg gttccttgtc 19561 cctacttctg cgatggcaaa cccagcttga cacgaaactc atcatcgtag tacgccttct 19621 cccaatttcg ggcagattta acttgttcag gagtcaggtt tttggcacta ccaaagacgg 19681 gataaccaaa gacgggataa ctgttgaagt gggcaccatt aagattcgca ccattaaggt 19741 tcgcaccatt aaggttcgca cctccaaagt aggcaccact gaggtcagca tcactaaggt 19801 tagcactact gaggtcagca gcagtgagat cgctaccact gaagttggct ttacgaaggt 19861 cggttttggt aaggatggct ttgaagagat gggcatttct aggcaaggcg aagctgaggt 19921 tggcactggt gaagttagct ccgttgagac tagcaccgtt aagaaagcaa acgcgaagat 19981 ctgctttgct gaagttggca ctatcaagct tggctttgta gaagttggta ccgatgagct 20041 ttgcaccgct aaagttgaca ccattgagaa tgccatcgct gaggttgata tccttgaggt 20101 ttgtttgact gaggtcaatg ttaatcttgt taattatgac acccattttt actagttttt 20161 ctaatgcctt atttctttgc gcactatatt cctttccctt agcatgtttg ataatctgtt 20221 gtagttgcct tattgtaaat ctctcaatta cagtacttag aagtactgct aaaccaaggg 20281 aaataattaa acctaatcca aaaaacttga ttcgattatc ccgtttatgt ctcaagcttt 20341 gttgaatcaa atcttgagcc aaatctgata gcatcaagtt tccaacttgc tcctgctgaa 20401 atgcccttgc ttcttgcaat ggttttccct gcagtaaata atccttcgtc tttcccctct 20461 ctcgccactc aaaagccgcc gtctcaattt tgcgtttttg tctgagtttg tctcggtttt 20521 catcaagcca ttttcgtaat aatgaccaat gacgaatcag tgcttcatgg gcaacatcga 20581 ccaccggaac tttacctgag ttcgcccctt tctcaattaa ggtgctggtg acaaccaact 20641 tggcatctgc taacttttga atcacctcct ccactaaacc cggtgattgc tgcgatgtga 20701 ctaaatctcg ttgtaaaacc tgcctgcgag tatcttctgt tccttctcct agctgtgtca 20761 gttccagaaa aatctgcttt gctgctactt gctcgtcggt tgataaagat tcataaacct 20821 gattagcacg tttttgcagc gttcccttga ctccaccaag ccgcgtatat gtggagagcg 20881 ttaatttttc gtcttctcgc tgttgacaaa tttctgtgag tgtgaattgc aataacggca 20941 agcttcccgg tgaaccttcc acatcggcaa tgatttggtt aacgagttct ggctcaactt 21001 ccaaacttac ctgtttcgca ggttcgacaa tcgcctgtct caactcttct gagttcatgg 21061 gcgtaactgt caccagattt tgctgaatct gctgcgctag tccgctatac tcttgctcgg 21121 cacacttgcc aaaaaaatcc gcccgcatcg tcagcaccaa acagagttta tcgcctgtct 21181 tctccaaagc acccaacagg cactcaaaaa actgctgtcg ttcggtgatg tctttgcaca 21241 gggtaaagac ttcctcaaat tgatccgcca ccagcaacac ccgcttagtg tccgcagtgt 21301 caatcaaata tcttaacccc tcagctttct tggcaatcaa ttcctctgct tttgctaact 21361 gggatgcccg gtcaatacct gataactccg aatccacaaa agcaagtgct aaactttgca 21421 gtggatgctc acccggttga aaaatcttga tttgccaagt ctcactacct gataaccgcc 21481 gccccagcct cagctgataa agtaatcctg ccctgacgac actcgactta ccactcccag 21541 atgctcccag taccgccaag aaattccctg ctcgcactcg ttctaaaagt tggtctgtaa 21601 gcgcttctct accgtaaaaa tactgggcat cttctggagt gcagtcaaag tattgcaatc 21661 cccgataagg acaaatgctc gtgacaactg aatgactaga ctcaatatct ggcactttcc 21721 aagtgcgagt caggttaatt gcaccaccgg agttagcaaa caagggacgc tgggggaatg 21781 cttgtaaacg ttgattgaga acatcaataa gagtgtaatt tgtcacccaa gtaccagaat 21841 tgcgtttggg gtcaagtgct tgcagtaaag cgtctgtcag cacactatga ttgctactaa 21901 tcgcctcata agcaacctcg aattctcttg atgcagcgat aaaacaccta tctcttcctt 21961 tccccctatc ccccggatcg gcttcagcaa aattcagcaa ctcaccactg taacagcaat 22021 ccaaccagat aatctgctgt ttgattggac tttcttgcag tagttcgcgc agccatttca 22081 agcgaattcc ccagtttccc aaatcggggt tgacatcgct ggtagcaaga aagccttcct 22141 gaattccttt atttttacgc aatccatgac cagaaaaata cagcagcgca gtatctggaa 22201 tatttttgcc ttctggctta aacagttgaa cgatcgcttc ttcgagttga gttaatgtaa 22261 cttgagtttt ctgaccaacg cgaattgtgt tattctgctt atccttaaca ccaggtagtc 22321 gcttgacgtt aaattcgccg tagttttgta agagagtggc gatcgcctct gcatcttgtg 22381 caggtgctgt caaagcagtt aaacgctcat aactataagt attaattcca actaccaaag 22441 cgtctcggct cataaaagac agcttcgctc ctgttttata gagtcattaa acagatacaa 22501 tctcccagtc aatattttcg gtataaatgc agtagtatta tgataattta ttttattctt 22561 tgaaattagc aattcttact atgaccaaac tcacacccat ccagctagat gacaacacga 22621 ttatctacat tgaagccaca gacgacgtga acgttccctt agttatcgct gaagaacccg 22681 cagaggaaga ggaagaagca ctgattgaca agggaatcag tccggaagct gtgcgcaagc 22741 aaattgtgca aaattttcag ataattcaca ccacgattcg tgcttacacg ctttgtagcc 22801 tgaatgcatt taagcaactt cccatcccag gagtcaataa agtgacttta gaatttggta 22861 tcgaattggg tggacaagca ggcatacctt atgtgacaaa agggactgct aaaagtaact 22921 tgaaggttac cgttgagtgt tcatttccca aggaaatgac atagtttatt catcttcact 22981 gtcaattggc aatgcacgaa tacgaatgcg actttgctcc aagatcacta cgctaccttg 23041 ttgtaacgct tcttgaatag agagcaagtt tgccagcaaa atctgtagct gtctatgggg 23101 acggcgttca ctcctgcgac ggaacaaaat cactgatggc ttgctttcct gtcgcagtgc 23161 taacaacgtg ccaaaatcgg tgtcggcaga aatgataatt cggtcttcag tagcagctgt 23221 tgcgaacacc tctgtgtctg aggctgcttg cattccgtaa tcacgaatat gaactgcatc 23281 atacccctcc tgctgaagtc cttgggcaat tagaggcgat aaagcgttgt ctactaaaaa 23341 tttcatatat taacgagagg caattcacgt tcagcaactg caacggcggc gtagtgtaaa 23401 gcctctgtga tgtcttctgg ttccaaatct ggaaacgcct gcaaaatttc attctctcgc 23461 atcccctcag caaacattcc caccacagta gcgacaggaa tccgcaaccc cctgatacaa 23521 ggaacaccac ccatttgatt tgggttaaca gtgatgcgcg tgaatttcat ggttttcctg 23581 gctcatgaat agttctaaat tcaggatagg cgatcatcgc atcgaaaatt ttttccctct 23641 aactctcagg taattcacca aagcgttcgc gatatcgagc caacaaagcc tccgcctgtg 23701 ctgctgcttc ctctggcgta ggaacgagtt gtccatcaac tgagaaaaaa cgcaattgtt 23761 gctgatgaat acctaaaaaa agctcaagtt gctgactcca cagccaaccc tctgaattag 23821 gttctaacgg ttgatatttg ccttccacca agtgaaagcc tgcaaattct aatgtttcgg 23881 gatcgaacca gaaatagtca aaagtgcgga aagtatcctg gtaaattttc tttttcaaac 23941 ctcggtcagt atcagcagtt gatggagaaa gcaattctac aatcacattc gggtatttgc 24001 cctcttcttc ccacactacc caacttttgc gaggtttacg ttcagtatct aaaacaacaa 24061 agaaatctgg tcctcggaaa tcttgtgatt tacgcttgtt gggactgaag taaatagtga 24121 gattaccgga aacataaaaa tcttgacggt ttcgccacca ccacttgagc aggctaatga 24181 gcagatcaat ttgttctcga tggagatccg tttccaaggg tggttcatca ctataaaggt 24241 cgccaggtgg gaatataata tcatcccact cttgctcttg ggtatctggt gcagtagttt 24301 ggtctttggc aacagacata agtgtctcag caaaggaaga taaacatagt gtagcgcctc 24361 ccaaaataag caatgagcaa aagcgatgag cctgatcgaa gctaaggggt gtaagggggg 24421 aaagacaatt ttacttcttc cactcttgtc cacccctctt cccctctttt cccctacacc 24481 ctcataccct tacaccctta caccctgccc aaagggcatt gaaactcttg cgaaatatta 24541 aggaactttg catttatgtc aaatctggaa accagagcgc gaaattagtc aaaagagcgt 24601 gataccattc tttcggtata tgtcaagatt gggagaagac ctttgacact acgggttgct 24661 gttgttgggt ccggtccagc tggttcatca gctgcagaaa cgctagccaa agccggaatt 24721 gaaacctacc tctttgaacg caagctagac aatgctaagc cgtgtggcgg tgcaattcct 24781 ctgtgcatgg tgagtgaatt tgacctgccg ccgaatatca ttgatcgtca agtgcggaag 24841 atgaaaatga tttcaccctc caatcgtgag gttgatatca atctagtaaa cgaagatgaa 24901 tatataggaa tgtgccgccg tgaggtgctg gatggttacc tgcgggatcg tgcggcaaaa 24961 ctaggggcaa atttaattaa tgccactgtt cataaactca attttcccac caacaataca 25021 gacccctaca ccattcatta cctagatcat acagaaggtg gtgctttggg tattgccaaa 25081 accctggaag ttgatgtgat tattggcgca gatggggcaa attcccgcat tgcgaaagag 25141 atggatgcgg gggattataa ttatgcgatc gccttccaag agcgtattcg tctacccaaa 25201 gacaaaatgg tctactacga agaccttgct gaaatgtatg tcggcgacga tgtgtctcct 25261 gacttctacg cttgggtatt ccccaaatac gaccacgttg cagtgggtac tggtacgatg 25321 caggttaata aagcccgcat caaacaattg caagctggta tccgcgcccg tgctgcccgc 25381 aagctggttg gtggtcaaat tatcaaagta gaagcacacc ccattcctga acatcctcgt 25441 ccccgtcgcg ttgtcggaag aatcgcttta ataggtgatg ctgctggcta tgtgaccaaa 25501 tcgtctggtg aaggcattta ttttgctgct aagtccggac ggatgtgtgc ggaaactatc 25561 gtagaaatgt ccaacggtgg taaccgcatt cctacagaag ctgacctcaa ggtctacttg 25621 aagcgctggg atcggaaata cggactcact tacaaggtgt tagaccttct acaaaccgta 25681 ttttatcgtt ctgacgccac tcgcgaagct tttgtagaaa tgtgtgatga tcgcgatgtc 25741 caacgtctca cttttgatag ctatctctat aagacagttg tcccagctaa tcccatcact 25801 cagctcaaaa taaccgccaa gacgcttggt agcttacttc gcggtaacgc ccttgctcct 25861 taaaaggaac acgacaaatt tgtgacaaag acgactctgt cactttcaca atgaacagtt 25921 atttgttatc agcaagcagt tatcatgtca tcctcaaaat ggatacagga aaacgccaat 25981 cagccctcac aaattaattt gtgaataaat caaaaaaacg ctggttgaac tgataacaga 26041 tggctgttca ctattatgta agtatgagtt ttaattttct ttcatcagaa ggtaatacac 26101 tggaagctag atgatttatt attaatacta attattttgt caaaccctta tcgagtccta 26161 attctaacca atactccaac aaaattgctt tgataagatt taaaaaaaat tatccaaaaa 26221 attcacaaaa actttattaa tctattggca aaatttctat aaatttatta aggtagactg 26281 ggtacttatt tgagaaaagc tcctggatag gagtgaaagc ctagctatga aactaagttc 26341 agtttttgct gctgctttgt caacttttgc tgtgagtttg atcacagcat ttggtttctc 26401 caatcacgct cagggtctaa ctattttagg caactcgagt ggtatctggg gaacacctga 26461 tccagggagc aataccgatc cggttttttc cggtgtggga acgaacacat ttacttgggg 26521 acgttctcgt ccagatgatc gtcaaaacaa ctatggaact gctgcaaatg agctgacctt 26581 tactggaaat cccttttccg cagacgacgt tggttcgctg tttaaggtgg gcgatttaga 26641 gtattacaat ggcaaggttg agcaaagcac gagtgttgat tccgtgcctt taaatctcac 26701 tctatcattc actaatcctg gtactttcag ggaagttttc aacttcggct ttcagcttgt 26761 gaatacaaca aacttaggag tgaatcctga ggacgatgca gatattgtgt atataaagga 26821 caactttgac actcgcaatt tctactttga aggaaacgaa tatcaactga atttaattgg 26881 ctttagccaa aacggtggga atacaactgt taacaaattt agcgtttttg aagatgatag 26941 aactacggca ggcatttatg ccagaatgac tcggataaca ccagcgaaac aaattcctga 27001 gccagcaggt attgttggat tatcagtgct aggcatctac cttgtgaccc ataagaaatc 27061 cttgggtgta aaaaaataat aggctgaggt gaattcacac gtaatttgcc tatttatagc 27121 caggggtgtg aagtcctggc tatttatctt gggaaagtaa gcagttcgtt gctcacagca 27181 gggctttacg gttagtaagc gcgctgcgtg tccctttgaa acttacgcgt agcgtctgta 27241 cacgagatac ccagaaagtc aaacaaattg gctacactac atactattga cactctctca 27301 ttttccgtag cgtagagtgt caattttttg agtgaacaac tgaagtgata agtaaagtga 27361 gggtaacaat ctatctcggg agcgcaagaa tgagtttatc ctctatctac tgaaaccacg 27421 ccccctttgg actcgcttcc aaaccgatat ctaaacacac cttgctatgc gtcgatctat 27481 cggcagtgta cacgactgca tgagacacgt tgcgacgacg agcaggatca gcacctccat 27541 tatttggctg ttgttactga actcctgatg ttgcagaagc accttcagtt gactgaggta 27601 aactgggtaa ctccaagcag taacctgcac catacaccgt ttttatataa cgaggatggc 27661 ggggatcagg ttctagtttg gttctcaggt ggcggatgtg tacgcgaata gtttctatat 27721 catcatctgg atcataaccc caaacttctc gaagaatttc acttggagaa actgtctgac 27781 cgtggcgttg aagcaagcag tgcagaagtt caaactctag atgagtcagt tttacggtct 27841 gagtgaacca tattgcctca aatcgttctg gaaccaaggt cattggtccg taactcagaa 27901 tttcactgtg ctttgcagct tgaggaattc gatccgtacg tcgcaagagt gctcgcaccc 27961 gtgccaacat ttcttcaact tcaaacggtt tagtgagata atcatctgca ccagcattga 28021 agccttcgac tttatcctga gtttggctca aagccgttaa catcaacacg ggaatctcgg 28081 cagtacgctc atccctccgc aggcgttggc aaacagtaaa tccatttacc ctcgggagca 28141 tcaagtcgag cataatcaaa tctggctgta gctggagagc aagcgcctga cctttgatgc 28201 cgtcttcagc ttggctgaca tcataaccag ccatttctaa gttgacggca acgagttctg 28261 aaattgcggc atcatcgtct atgacaagaa tcctcggcat tattaaaaat ttattactac 28321 ttattaagga taattaacaa cctttggtag atacaaagat tgctgtattg attataaaaa 28381 atattttaaa tctaacataa agattaaaat taaaggattg tggtgctaaa cccaaactat 28441 actaaaaagc tgtcccaatg tgttactcct cttacagtcc accgtggttt ttgcaagacg 28501 gtaccgtcca gactctctac accgctttgt gggctagtcg tgattgggaa catacaacag 28561 aaaatccaga accgccttat gaagaaaaaa tctttacggg tgccaaaggg gttcctatct 28621 ttggctgggt agccatccct gacaatgctc atggcactat tgtgggcact tacggcatta 28681 caggcgattt ggataaccaa tggtacctca agctattggg acgtaaagca tatgctcaag 28741 ggtatgctgt agtactgttt gattggcgtg ctcatggtaa gacagctttg ttgtcgccaa 28801 cgttaacaag tgatgggttg tacgagggag aagattttgt tcgcatcgca gcaactgcca 28861 aagcaatggg atgcccagga aaattttggt ttatggggta ttctctgggt ggacaattgg 28921 cgctatgggc agttaaggct gcggtggaac tcgtcaaaga ggatgcggat ttagggctag 28981 aagatagcga gattggaggt gctgctgtta tttgtccgag tttggattcc gggcgatcgc 29041 tttcttttct ggtcaaacat cctataggaa aatatttaga aaggtcgatc gcacgccagt 29101 tgaataaact cgcatggcaa attcatgatg ctcatcctgg aactcttgag cctgaggcga 29161 ttgaaagagc taacagtatc tggggttttg acgaggaact cgtgattggg cggttgggtt 29221 ttcccacagt agatgcttac tacgacgcaa gcagcgcttt accacttctg ccacatctgt 29281 ccaaaccgac tttaattatc tacgctgaag atgacccgtt ttttgagcct gctattatac 29341 ctgatttaca agccgcttgt gcgaaaaact cagcaataga tttgttgctg actcgttacg 29401 gtggtcatgt tggttatatc agtagtcaaa aatgtcaacg ccaagcacaa gaccccgatc 29461 cgtggtgggc gtggaatcga attttagagt ggatgggtca acaacagcca aagagtgtga 29521 actacctaca cggttcctga gtacaggtac agtgaacagt aaacagtgaa caaggggtgg 29581 acgagtccgt ttcctacctg ata // LOCUS NODE_955_length_29603_cov_5.08569129603 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 29603) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 29603) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..29603 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..211 /locus_tag="DP116_08330" CDS <1..211 /locus_tag="DP116_08330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319340.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08330" /translation="SAAKANPEVNQAVQELVAAAKTDPKIASKVQELVETNINSQPAT VINNTKLAEEIKNVFQGNTIIGGTF" gene 212..3124 /locus_tag="DP116_08335" CDS 212..3124 /locus_tag="DP116_08335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198487.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="PRJNA477356:DP116_08335" /translation="MQSSERERAKLNPPNNVVFQGSANFVGRQHELSLLQELLQQPGA VAISAVSGMGGVGKTELATQYARQHQADYPGGICWLNARESNLAAEIIQFAQAYMNLE VPQQDFRGKLLSLNEQVHWCWQNWQPPEGLALVVLDDITDLGSCREFLPTAHRFRVLM TTRLRNLDSNIEEISLDVLSEEEALQLLTKLVGERRVQKEEETAKLLCEWLGYLPLGL ELVGRYLAKKPPHWTLAKMLERLKQQRLQDEAINRHQKQLQQTLSTAQLGVLAAFELS WDELDPTTQCVGELLSLFAPDIFACVWVESATGRLNWDASEVETALEQLYSRHLIQWV EDKSGDFDDSYKIHPLIREFLKVKQAASEQTVIASEAKQSQPPFFRSACVSPMKRAFA ETFIAIAKKIPQSPTQERIKSVKDAIPHLAEVAQNLTDAVSDENLISAFLGLGLFYQG QGLYALAKPWLEQCVSVVQSRLGEEHPDVATSFHNLANLYCSQGRYTEAEPLLLKALE LTQRLLGQEHPSLAPSYNNLANLYCYQGRYTEAEPLQIKALELLQPYLGEEHPLIAAS YNNLALLYSDQGRYAEAEPLFIKALELYQRLPGEEHPDLATSYNNLAGLYNSQGRYTE AEPLYIKALELRQRLLGEEHPDLATSYNNLAGLYNSQGRYTDAEPLYIKALELMRRFL GEAHPDIAQSYNNLAVLYCYQGRYTEAEPLHVKALELRQRLLGEEHPDIAQSYNNLAL LYRSQGRYTEAEPLHVKALELTRRLLGDEHPDVTTRYNNLALIYSDQGRYTEAEPLFL KALELRRRLLGEEHPSLANSYNNLAFLYRDQGRYSDAEPLYIKALELRQRLLGDEHPD MAQSYNNLAGLYESQGRYSDAEPLYIKALELRQRLQGEEHPSVATSYNNLASLYKSQG RYTDAEPLYIKALEIDERSLGVNHPKTITIRENLQFLRDNRQSYNSEHFDKLSASQ" gene complement(3183..3596) /locus_tag="DP116_08340" CDS complement(3183..3596) /locus_tag="DP116_08340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307443.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative toxin-antitoxin system toxin component, PIN family" /protein_id="PRJNA477356:DP116_08340" /translation="MTNKQRFVFDTNVLISAFLFSQSKPRQALDKAQDIGVIIFSSSV FSELREVLYRPKFDRYLTEERRQELLEDLTQTAQFIDVTEQISECRDPKDNKYLELAL SGQAECIVTGDDDLLVLNSFRGIKILTVQEFLARN" gene complement(3586..3861) /locus_tag="DP116_08345" CDS complement(3586..3861) /locus_tag="DP116_08345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307444.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08345" /translation="MKDEKSEQTMITKEEITIKVPSEVAEAYRNASEEEREQLQLKIA VIMQSQFTTDRQEAIARLRNTMDKASLEAQERGLTPEILESILNDDE" gene complement(3967..4545) /locus_tag="DP116_08350" CDS complement(3967..4545) /locus_tag="DP116_08350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199687.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_08350" /translation="MTALTLNLNSVIKLTREQFYQLCEENSDLKLERNAQGELIIMPP TGGETGKSNSTINAQIWFWNDQNQLGEVFDSSTGFTLPNGADRSPDVSWVEKSRWDAL TKEQKEKFIPLCPDFVIEILSPNDSLKKTQNKMQEYMENGCRLGWLINRKKQEVEIYR PKQDVEILKLPQTLSGENVLPGFILNLQKIWG" gene complement(4871..4943) /locus_tag="DP116_08355" tRNA complement(4871..4943) /locus_tag="DP116_08355" /product="tRNA-Ala" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(4908..4910),aa:Ala,seq:ggc) gene complement(5259..5450) /locus_tag="DP116_08360" CDS complement(5259..5450) /locus_tag="DP116_08360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191003.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08360" /translation="MLIPILVFDVALVAWSLHLMERAYESKEFSLMLAGTLVAIAAAA MLVVYFLMGHCISYLLQVS" gene complement(5554..6321) /locus_tag="DP116_08365" CDS complement(5554..6321) /locus_tag="DP116_08365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015212774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="imidazole glycerol phosphate synthase subunit HisF" /protein_id="PRJNA477356:DP116_08365" /translation="MLAKRILPCLDVKAGRVVKGVNFVNLRDAGDPVELAKVYNDAGA DELVFLDITATHEDRDTIIDVVYRTAEQVFIPLTVGGGIQSLENVKNLLRAGADKVSI NSTAVRDPDFINRASDRFGNQCIVVAIDARRRLDPNHPGWDVYVRGGRENTGIDALFW AQEVEKRGAGELLVTSMDADGTQAGYDIELTRAIAQSVQIPVVASGGAGNCEHIYTAL TEAQAEAALLASLLHYGQLSVAEVKTYLRDRQVPVRM" gene 6579..7679 /locus_tag="DP116_08370" CDS 6579..7679 /locus_tag="DP116_08370" /EC_number="3.6.4.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Holliday junction branch migration DNA helicase RuvB" /protein_id="PRJNA477356:DP116_08370" /translation="MAIISSKKQPQEPNKEPKQRREVAKAPPKENLLQPEAAVDEGKQ EESIRPQRFADYIGQKDLKDVLDIAIKAAKSRGEVLDHLLLYGPPGLGKTTMAMILAS EMGVNCKITSAPALERPRDIVGLLVNLKPGDVLFIDEIHRLSRMSEEILYPAMEDYRL DITIGKGSSTKTRSLPLSKFTLVGATTRAGALSSPLRDRFGLIQKLRFYEVDELSKIV LRSAQLLQTNITEDGAAEVARRSRGTPRIANRLLKRVRDYAEVKSFKEVNEQVAAEAL QLFQVDPCGLDWTDRRMLSVIIEQFNGGPVGLETIAAATGEDTQTIEEVYEPYLMQIG YLSRTTRGRVATPTAYKHLGFKPPNEQLSLLP" gene 8041..8847 /locus_tag="DP116_08375" CDS 8041..8847 /locus_tag="DP116_08375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128892.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08375" /translation="MMRLIVIFLSLLLVFGWGEVVMAETKSLNFTPEQLQQGEELANK AIAATNKGDFGTAERYWTQILEQFPDNPAAWSNRGNSRVSQNKLQEALADYNKAVELA PNVTDPYLNRGTALEGLGKWDEAIADYNHVLELDPKDAMAYNNRGNAYAGLRKWEEAI ADYKKSTEIAPNFAFARANYVLALYETGKVDEAIREMRNILRKYPNFADVRAAITAAY WVQGKQGEAESNWVSAVGLDGRYKDIDWVANVRRWPPSMVTALDKFLTLK" gene 9358..10281 /locus_tag="DP116_08380" CDS 9358..10281 /locus_tag="DP116_08380" /inference="COORDINATES: protein motif:HMM:PF03321.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08380" /translation="MYEQIKSEMETWFKEYNNIFISQENFLFSLLNKLKDTAVGKERG YADIKTIEQFQAQTPIQHYSDLKPYIERIVAGETNVLYPEPPTHWIQTSGTTGSPKLF PYNTEFEKTFLTSGNAILDCFIYRVGSKALKILEGETLLIHASGNCGTVGTGATKKTL AFVSGWIAKHSSPENPFAPPIGIQSIEDWEERMLQTGVYYVQRNLTRVGGVTSYALMF LQEIENAFGGKLFSVLAENNPERAAELQQFYREDGKLKISRIWPNLLCLQLAGVDPYK YRSWIDENLPNSLIFQSYVGSEGVYGFQTDR" gene complement(10551..12764) /locus_tag="DP116_08385" CDS complement(10551..12764) /locus_tag="DP116_08385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317931.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-glucan phosphorylase" /protein_id="PRJNA477356:DP116_08385" /translation="MNISSVNTAVQVLGEKLPFPLKRLASLAYNYWWSWTSDRVALFQ TIDPQGWERCGHNPVEMLHSATYERLTQLAEDPYYLKQIGSLVREFDEYMSTKDTWVS RVAPQITQEHPIAYFCAEFGIHESLPVYSGGLGILAGDHLKSASDLGVPMVGVGLLYR QGYFRQRLNRGGWQEDYYVDNPFSQMPMELMRNWHGEPITIQLQVRQRMVRVQIWRVQ VGRVSLYLLDSDRQDNDPIDRWLTGHLYGGNQETRIAQEVVLGIGGVKALTALGIQPS VHHLNEGHAAFSTLEVARQEIERTGKSFYDIEADVRNRCVFTTHTPVPAGHDVFSPDL IDSFFAHYWTQLRLSRQQFLALGARRLGDPWEPFGMTVLALRMCRAANGVSELHGHVS RKMWTILYPQRSEDKVPIGYITNGVHAPTWTAPLMAELYAQYLGEDWKTRVIDPKTWE KVDNIPNEELWWRHLVLKERLIAYTRYKVKKAREQRGEEYQKIQVAETLLDPKVLTIG FARRFSPYKRGHLLLRDAERAMRIFGNAQRPVQIIFAGKAHPADEEGKRIIQRLMEWC QHSAIQHRVAFIEDYDIYVGQKLVQGVDVWLNNPRRPLEASGTSGQKVCFNGGINCSV LDGWWCEGYKTDANGQGINGWAIGEDAHTSDQDLQDSIDAESLYKLLEEQIVPLYYDQ DANGTPQRWVQMMKASIKTNAPLFNTDRMIADYVSQVYVPEIATRVGPILAKVLV" gene complement(13634..16246) /locus_tag="DP116_08390" CDS complement(13634..16246) /locus_tag="DP116_08390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="primosomal protein N'" /protein_id="PRJNA477356:DP116_08390" /translation="MYMNGVSLSSLIVAEPGQSYQSEKLVNRWIEVLVDCPGGSGLYT YRLPDHLEVKPGDILSVPFGTQLLGGIAIRFVTQPPVDLPIEKVRNVEDVVSVGFFAS NYWELLNRVASYYYTPLIQVIRIALPPGLLGRSQRRIRLPKNDNNETNLSSGSSSLPF LSSAAQQILKLLQAQADGDYSFAYLQRQVKAAYQGVRELQRRGLVESYLEPPQLTRPK QQKAVIMIGATFERDLTIRQREVLEVLRRRGGELWQSELLQICSASTSILKTLEQKGY IIIQEREVLRTAAVFPPLGDGTQADTPAMDMTKILTSAQSEALGVITHLEGCAIVLLH GVTGSGKTEVYLQAISPLLGKGKSALVLVPEIGLTPQLTDRFRARFGNKVSVYHSALS DGERYDTWRQMLTGEPQVVIGTRSAVFAPLPNLGLIILDEEHDSSFKQDSPIPTYHAR TVAGWRAELENCPLVLGSATPSLETWVSVGEQINSKSRSVASVATQSQSSKLEITSTP PTHYLSLPERVYSRPLPPVEVVDMRLELQQGNRSIFSRSLQSALEQLLERRQQGILFI HRRGHSTFVSCRSCGYVLECPNCDVSLSYHHTEENAPQTLRCHYCNFGRSHPQSCPEC GSPYLKFFGSGTQRVAQELAKQFPQLRFIRFDSDTTRTKGAHRTLLTQFVNGEADLLV GTQMLTKGLDLPQVTLVGVVAADGLLHLSDYRSSERALQTLTQVAGRAGRGEEPGRVI VQTYTPEHSVIEAVQNHDYQSFIHAELEQRQALNYPPYGRLILLRLSSPDPIQVENTA GLIAAALPTHEGLDILGPAPASILRVANRYRWQILLKFAPSALPQLPDWEEVRQLCPG SVSLTIDVDPLNMI" gene 17460..18593 /locus_tag="DP116_08395" CDS 17460..18593 /locus_tag="DP116_08395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114702.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor, RpoD/SigA family" /protein_id="PRJNA477356:DP116_08395" /translation="MNIAELGTMETMGSAADNEELFITLDAVAEDESLVVVENIEAED RDGDQMAAARPSGYNKTEYDDAVGAFFKEMARYPLLKPDEEVELARRVRFIEEIRELQ ASLLEKLENQPSKETVASHIGMTEKQLEHRLYQGRVAKRKMIRSNLRLVVSIAKRYLN RGVPFLDLIQEGAMGLNRATEKFDPDKGYKFSTYAYWWIRQAITRAIANDARTIRLPI HIVEKLNKLKKAQRELKQRLGRNPSEQEMADALDVPAQQLRQLQQLRRQALSLNHRVG KEEDTELMDLLEDEDNLSPEAKMNESMMRQEIWEVLGDVLTPREKDVISLRYGLITSE PCTLEEVGTMFNLSRERVRQIQSKAMRKLRRPHIAKRLKGWLV" gene 18801..20912 /locus_tag="DP116_08400" CDS 18801..20912 /locus_tag="DP116_08400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316254.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TonB-dependent receptor" /protein_id="PRJNA477356:DP116_08400" /translation="MKFSFLIFQVILINFALAPVTFAQEKNFEQKKTSIKKTSEIPYL SEIEKPYTSAELLTQTPIPSNKPQTTDNPEPNSTTNESEPKVEDDGAIIIDVTGKKDD LPQSTPTYVIEKEEIEKQGATSVSDVLKKLPGFAINDSGHGADIHTGTYYRGASINQS VFLINGRAINTNVNTYHGATDLNSIPVEAIERIELYSGAASTLYGSSAFGGVVNIITK EGSRVPRLNATAEFGDLNFNNQQLSYGGATGSLRYNLSFERSFIDNRYRVPVGAANRD SQGYFFNADTATSTYFGSLAFDVDPRNTLSLDVTTLSSRRGLIYFGFPLQRDRLDHDN LNIGFSWKTRLGNGNNSVVTTTLGYNQDYFNTYGPSSQFYRTGTLDTQLYTARIDHVW QLTPNYKLRWGLDLQNTELNGDVLSTVPNRIALNQDESESLLNTALFAVNTLNITDNF QVDLGLRQSFDSKFGNYLNPSVGFKYDIAPALAVRGSWADGQRNPGLDQLYIYDTVHG WLPNPDLKPETGSSWTAGVDVRFAEDLTGQFTYFGSSLDNRLGVIQGKWANIGLVDTN GFEAALRWKVASGWSTFINYTYTDAKIKTGSEKDLQLGLIPYSIASAGIGYENRGWQA NLYATYYGGARRAIYTRVGDTATDFSPSFFNLDFSARVPVTKNLGLTVYLENLLDEQY ERVNRIYSPGFTFRLGLTANI" gene 21342..21638 /locus_tag="DP116_08405" CDS 21342..21638 /locus_tag="DP116_08405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997459.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08405" /translation="MRAKIENDVLFLHHEDVPEYKKGGSVVRNSYFWALRSIAGKASR YGDWEYEPEVWFALTRMLLSFAESGYLGLKQTVLEFPLSQGEIPDVLRDVSTWE" gene 22387..22665 /locus_tag="DP116_08410" CDS 22387..22665 /locus_tag="DP116_08410" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08410" /translation="MHKTSEEYNEFTEKYCEFIEEYDKFLQNYEHSSRLVAVMKTSNQ QKTEVGVLYDKALQAHIDTTNTYIQTVAIYRQLVQKWLLMTKSCYKGE" gene complement(22667..23566) /locus_tag="DP116_08415" CDS complement(22667..23566) /locus_tag="DP116_08415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316256.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA pseudouridine(55) synthase TruB" /protein_id="PRJNA477356:DP116_08415" /translation="MQGFLNLNKSFGWTSHDCVAKTRKFLRLKRVGHAGTLDPAATGV LPIALGKATRLLQYLPGEKAYKATIRLGVRTTTDDLQGEIIASQPCTGLSLEVIEPVL QQFVGKIEQIPPIYSAIQVQGKRLYDLARKGEDVEVPARIVEVFKIEVLDWREEEFPE LDVAIACGAGTYIRAIARDLGTILQTGGTLAALTRTQSSGFDLANSLTLTDLEAKVQA GTFQPLLPDAPLQHLGSITLPPTSAQKWCQGQRIPITFNVLEISPKILRIYDEDERFL GIGKLGNLDNEQLLVPEMVFESI" gene complement(23754..24671) /locus_tag="DP116_08420" CDS complement(23754..24671) /locus_tag="DP116_08420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316258.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="EamA family transporter" /protein_id="PRJNA477356:DP116_08420" /translation="MHTTSGRWRLGLALSLVTVFLWGILPIALAVTLQVLDVYTLIWF RFLISFVLLAIYLGWQKKLPRLLKLRSTSWVLLAIAILGLAANYILFTQGLALTAPAN AEVIIQIAPLLMGFGGLFLFRERYTLLQWCGVGILVIGFTLFFHEQLKNLVTAHGQYL LGSGLVVVGAVTWTFYALAQKQLLQSLSSFSIMLILYAGCALLFTPFANPKAIFQLNL FHLGVLLFCAFNTLIAYGAFAESLEHWEASRVSAVLALAPIVTLISVWLVSVIIPTLI LPENISFLGIIGAVLVVTGSVTVALGKTR" gene complement(24741..25913) /locus_tag="DP116_08425" CDS complement(24741..25913) /locus_tag="DP116_08425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872747.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_08425" /translation="MTPKFEYSQVVKEDIQQLGNILEQCFLMSPGESEIFFNRIGLEN FRVIRAGEQVAGGLATIPMAQWWGGERVPMAGIAAVGIGPEHRGSGAALVMMQHAVKE LYAKGVPISTLYPATQRLYRKAGYEQGGISSTWEVPTQSILVKEQPLPVIAVRADHEI FYELYQQHARLNNGFLDRHQGIWQGIFKPHEKEAIYTYLIGRAEQPQGYIIFSQHQDQ DGSFIRVRDWAVLTTAAAQSFWSFLGLHRSQIKNVRWKGSATESLTLLLPEQTAKQKA TSYWMLRVVDVVKALEKRGYPLGIQAELHLEIQDDLLAENNGRFILSVANGRGEVTSG GKGEMKLDIRGLAPLYTGLFTPQQLQLAGQLDATETARFAATQIFAGASPWMADFF" gene complement(26017..26649) /locus_tag="DP116_08430" CDS complement(26017..26649) /locus_tag="DP116_08430" /inference="COORDINATES: protein motif:HMM:PF13302.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08430" /translation="MVRGWILQVKIACKEKMKLLFEHMRIPKTYRINTERCILRCVCE QDIPHVFSATRFQGFNDGMLWEAPQSIEELHEPLQRNLQAWDFGLAYTFTITCVYTST FLGRISIRKHNEVEGVWNLGFWTHPEYQRQGYMTEAARAIVEFGFTVLEAERIEAYHA LWNTGSEKVLKRIGMKFVCYIPQGFQKHGKWVEENLLAIERKDWRALKET" gene complement(26621..27394) /locus_tag="DP116_08435" CDS complement(26621..27394) /locus_tag="DP116_08435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316261.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polar amino acid ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_08435" /translation="MNNSSSAIIFENIEKSYGSLKVLKGISGEIKRGEVVAVIGASGC GKSTLLRCFNRLETIDSGRLLVNDINLSQPNFSNRQLRQLRTQVGMVFQQFNLFPHLS VLENLTLAPRQVLGKSAKESTQLAGLYLEKVGLFDKASAYPEQLSGGQKQRVAIARSL CMNPQVMLFDEPTSALDPELVGEVLQVMQQLAAEGMTMVVVTHEMQFACEVAHQVFFL DQGIVVEQGKACKVLSHPESDRLRAFLSRLNGKGLDITG" gene 27588..28439 /locus_tag="DP116_08440" CDS 27588..28439 /locus_tag="DP116_08440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316262.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="basic amino acid ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_08440" /translation="MFEPKMTRSHFIKHFLTGFAATVILSACDNSASNTSSSLSGNSQ TATAGKTIKVATEPAFPPFESKSSGNELVGFDIDLIKAAGQAGGLTIEFQSLPFDGII PALQANTIDAAISSITITPERAQAVSFSRPYFRAGLAIAIRQDNTTITNLDSLKGKKI AAQIGTTGSKKAKSISGAQVREFDSAPLALQELANGNVDAVVNDAPVTIEAIKSGNIK GLKVVGQLITEEYYGIALPKNSPNLNAINTALAKIISDGTYAQIYKKWFNAEPPQLPE TVPGSSS" gene 28597..29307 /locus_tag="DP116_08445" CDS 28597..29307 /locus_tag="DP116_08445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316263.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nickel transporter" /protein_id="PRJNA477356:DP116_08445" /translation="MIQSLTIILNALPNLLLGAVVTLEITALSVVFGMIGGSLIGIAR LSPILLLRFCTRAYVDFFRGTPLLVQIFMIYFGLPALAQSIGIPLRFDRLLAAVVALS LNSAAYIGEIVRAGIQSIEPGQAEAANSLGMSGVQTMRYIIFPQALRRMIPPLGNEFI SLLKDTSLVSVIGFEELLRRGQLIVADTYRAFEIYTAVALVYLVLTLASSQFFSRLEV WMNPIKRQKASQKKLSSP" BASE COUNT 8673 a 6277 c 6373 g 8280 t ORIGIN 1 atcagcagct aaagctaatc ctgaagtcaa tcaagctgtg caggaattgg tagcagcagc 61 caaaactgat ccaaaaattg cttcaaaggt tcaagagcta gtagaaacta acatcaattc 121 tcaaccagca actgttataa ataacactaa gttagctgag gaaattaaaa acgtctttca 181 aggaaatact ataatcggtg ggactttttg actgcagtcg tccgaacggg aacgggcaaa 241 gttaaatccc ccaaataatg ttgtgtttca aggttctgcc aattttgttg gcagacagca 301 tgaactttcc ctactgcaag aattgttgca gcaaccaggc gcagtcgcaa tttctgctgt 361 ttctggtatg ggtggtgttg gcaaaactga actcgcaacg caatatgcac gtcaacatca 421 agctgattac cctggtggaa tttgctggtt aaatgccaga gaatcgaatc tggctgcaga 481 aattatccag tttgcccaag cttacatgaa tctggaagta ccgcagcagg atttccgagg 541 aaagctgttg agtcttaacg aacaggtaca ctggtgttgg cagaattggc aaccaccaga 601 aggattagcg ttggtggtat tggatgatat tacagatttg gggagttgtc gagagttcct 661 gccaacggct catcgcttcc gcgtgttgat gacaacacgg ttgcgaaatc tagattctaa 721 tattgaggag atatctctgg atgtgctgtc agaagaagaa gctttgcaat tgttgacaaa 781 gctagtgggt gaaagacggg tgcaaaagga agaggaaaca gcgaaactct tgtgtgaatg 841 gctgggatat ctacctttgg gtttggaatt ggtggggcgg tatttggcta aaaaacctcc 901 tcattggact ttagctaaga tgttggagcg gttgaaacag cagcgactcc aagatgaagc 961 gataaaccgc catcaaaagc aattgcaaca aaccttgagt acagcacagc ttggcgtttt 1021 ggctgcattt gagttaagtt gggatgaact cgatccgacg acacagtgtg tcggtgagtt 1081 attaagttta tttgcaccgg atatctttgc gtgtgtttgg gtagagtctg caactgggag 1141 gctgaattgg gatgcaagtg aagttgaaac agcgcttgag caactttact cgcgccattt 1201 gattcaatgg gtggaagata aaagtggaga tttcgatgat tcctacaaaa ttcatccctt 1261 aattcgggag tttttgaaag tcaagcaagc agcatctgaa caaactgtca ttgcgagcga 1321 agcgaagcaa tctcaacccc ctttttttcg tagcgcgtgc gtaagtccta tgaaacgcgc 1381 ttttgcagaa acttttatcg caattgctaa aaaaattccc caatcgccaa ctcaggagag 1441 aatcaagtca gtcaaagatg ctattccaca tttagcagaa gtggcacaaa atctgactga 1501 tgcagtcagt gatgaaaatt taatttcggc ttttcttggc ttgggtttgt tttatcaagg 1561 acagggtttg tatgcactgg cgaaaccgtg gttagagcaa tgtgtatcag tcgtccaatc 1621 ccgtttggga gaagagcatc ccgatgtcgc caccagtttc cacaacctgg ctaatctcta 1681 ctgttcccaa ggacggtaca ccgaagcaga accacttctt ctcaaagcat tggaactaac 1741 gcaacgtctg ctgggacaag agcatccctc actcgccccc agttacaaca acctggctaa 1801 tctctactgt taccaaggac ggtacaccga agcagaacca ctgcagatca aagcattgga 1861 actgttgcaa ccctacctag gagaagagca tcccttgatc gctgccagtt acaacaatct 1921 ggctctactc tactctgacc aaggacggta cgccgaagcg gaaccacttt ttatcaaagc 1981 attggaacta taccaacgct tgccaggaga agagcatcct gatctcgcca ccagttacaa 2041 caacttggct ggactttaca attcccaagg acggtacact gaagcagaac cactctacat 2101 caaagcattg gaactaaggc aacgcctgct aggagaagag catcctgatc tcgccaccag 2161 ttacaacaac ctggctggac tttacaattc ccaaggacgg tacaccgacg cagaaccact 2221 ttatatcaaa gcattggaac taatgcgacg cttcctagga gaagcgcatc ccgatatcgc 2281 tcaaagttac aataacctgg cagtactcta ctgttaccaa ggacggtaca ccgaagcaga 2341 accactgcac gtcaaagcat tggaactaag gcaacgcctg ctaggagaag aacatcccga 2401 tatcgctcaa agttacaaca acttggctct actctaccgt tcccaaggac ggtacaccga 2461 agcagaacca ctgcacgtca aagcattgga actaacgcga cgcctgctgg gagatgagca 2521 tcccgatgtc accacccgtt acaacaacct ggctctaatc tactctgacc aaggacggta 2581 caccgaagca gaaccacttt ttctcaaagc attggaacta aggcgacgcc tgctaggaga 2641 agagcatccc tcactcgcca acagttacaa caacttggct ttcctctacc gtgaccaagg 2701 acggtacagc gatgcagagc cactctacat caaagctttg gaactaaggc aacgcctgct 2761 aggagatgag catcccgata tggctcaaag ttacaacaac ttggcgggac tctacgaatc 2821 ccaaggacgg tacagcgatg cagaaccact ctacatcaaa gctttggaac taaggcaacg 2881 cctgcaggga gaagagcatc cctctgtcgc caccagttac aacaacttgg catcactcta 2941 caaatcccaa ggacggtaca ctgatgctga accactctac atcaaagctt tggaaattga 3001 tgaacgcagc ttaggagtca atcatcccaa gacaatcact atccgtgaaa atttgcaatt 3061 cttacgcgat aatcgccagt cttataacag tgaacacttc gacaagctca gtgcatcgca 3121 gtgaacactt cgacaagctc agtgcatcgc agtgaacagt gaactgataa ctaatagcgg 3181 tactagttcc tcgccaaaaa ttcttgaaca gtcagaatct taatgccccg aaatgagttt 3241 agcactagta aatcatcatc tccggtaaca atacactctg cttgaccact cagtgccaat 3301 tccagatact tgttatcttt tggatctcgg cattccgaaa tctgttcagt cacatcaata 3361 aactgagcgg tttgagttaa atcctctagc aattcttgcc gcctttcctc ggtgagatat 3421 ctatcaaatt taggacgata taaaacctct cttaactctg agaagacaga actggagaat 3481 atgataacac caatatcttg agctttatcc aatgcctgac gtggcttgct ttgactaaac 3541 aaaaatgcgc taatcaatac gttagtatca aaaacaaacc gttgtttatt cgtcatcatt 3601 caaaatcgat tctaagattt caggcgttaa tcctcgctct tgtgcttcaa ggctagcttt 3661 atccatcgtg tttctcagac gggcaatcgc ctcttgtcta tcagtagtaa attgagactg 3721 cataattacc gcaattttga gttgcaattg ttcacgctct tcttcagagg cattgcggta 3781 tgcctctgct acctcagacg gcactttgat ggtaatttct tctttagtaa tcatagtctg 3841 ttcgcttttc tcatctttca ataagtttcc ccggttttag gatgtacaaa agcatacaac 3901 cataaccatt gaaaccgcca attgacatca gcaattggtg tttttccttc gggtctagca 3961 atgatattat ccccaaattt tttgcagatt gagtataaaa ccaggtaaga cattctcccc 4021 tgaaagagtt tgaggtaatt ttaaaatttc cacatcttgt ttgggacgat aaatttctac 4081 ttcctgtttt tttctgttta tcaaccaacc caagcgacaa ccattttcca tatactcttg 4141 catcttattc tgagtttttt tcaagctgtc attaggtgaa agtatctcga tgacaaaatc 4201 aggacacaga ggaataaatt tttctttttg ttctttagtc aaagcatccc aacgagattt 4261 ttccacccaa gaaacatcag gagaacggtc agcaccattt ggtaaagtaa atccagttga 4321 tgaatcaaaa acttcgccca attgattttg gtcgttccag aaccatattt gagcgttaat 4381 agtagaatta ctttttcctg tttctcctcc cgtaggtggc attataatta attctccctg 4441 ggcatttcgt tctaatttta aatcggagtt ttcttcacac agttgataaa actgctctct 4501 agtcagttta atgacagagt tgaggtttaa tgtaagtgct gtcataggag tatctcccgt 4561 ttttgttggc gcagtgagtt actggagttt agactaaatt cgatttattc ggctacaacc 4621 cttgtttggc aaacatttta gacttctgaa tgaaaaacca gcctagaaat catcaatgcc 4681 tgtctggata aggatttggc ggtttgatag gtaaaattag tctaaactcc aatgagttac 4741 aaacctatat ccattttgtt ccacaccgta tttcttatat tgtggacgtg accctcagcc 4801 atcaggcaga gtggggagaa tgtcaagaca ttaatccttt caaaatgaca agaattcttt 4861 aatttatata tggagctaag cggattcgaa ccgctgaccc cctcaatgcc attgaggtgc 4921 tctaccaact gagctataac cccgcacttt tacaattcaa aatggactgc gttccgctac 4981 gctaacaaag ttgatggttg ctgagcgcag tcgaagcaaa gttcaaaatg aaaggcatta 5041 gttgcgataa cgcatttgca attttacttt aagaatctgc tatttgtcaa tcccactgct 5101 cgttaattgt taataagtgc cataagccac tagtacaaca aggcaaaagt caaaagtcaa 5161 aagtcaaaag aaagaatgct tatgttgtca gctttttacc tccttcaaat ggtagcttta 5221 tttccgccat gctgtactag ccgatggcaa tgactaactt aagagacttg cagcaagtag 5281 cttatacaat gccccatcag gaagtaaacc actaacatcg ccgcagctgc tatggcaacg 5341 agtgtaccag ctaacatcag agaaaattct ttgctttcat atgctctttc cattaaatgc 5401 agcgaccagg ctaccagcgc tacatcaaat actaaaattg gtatcaacat atctcttaaa 5461 attattaatt attgttaact tatattaaca ctgtttccaa aaattgtgtt gaactactcc 5521 ttgggttgat tgacaaagag ttgcaacttg ggattacatt ctgacaggaa cttggcgatc 5581 gcgcagataa gttttcactt cagctacgct taactgtccg taatgtaaaa gtgaggcaag 5641 taaggctgct tctgcttgag cttctgtgag tgcggtatag atatgttcac aattgcctgc 5701 accgcctgat gcaacgaccg gaatttgtac agattgagcg atcgcccgag tcaactctat 5761 gtcataacca gcttgagttc catcggcatc catacttgtt actagcagtt ctcctgcacc 5821 gcgtttttcg acttcttgcg cccaaaacag ggcatcaatg ccagtatttt ctctaccacc 5881 tcgcacataa acatcccaac cagggtgatt ggggtctagt cgtcgtctgg catcaatcgc 5941 aacgactata cattgatttc caaaacggtc actagcccga ttaataaaat ctgggtcgcg 6001 taccgccgta gaattaatac taaccttgtc tgctccagct cgtaaaagat ttttaacatt 6061 ttctaaggat tggatcccac caccgacagt gaggggaata aagacctgtt ctgcagttcg 6121 gtacactacg tcgataatag tgtcgcggtc ttcatgagtg gctgtaatat ctagaaacac 6181 taactcatct gcgcctgcat cgttgtaaac ctttgcgagt tcgactggat cgcctgcatc 6241 cctcaaattt acaaagttaa ctcctttgac aactcgtccc gccttaacat ccaggcacgg 6301 taagattctt ttagctagca tgagtagttt tacactcctg tgatgactat ctggaattga 6361 aattttaact tagcgaaata ttacagaaac aatattttcg catgtgagta gcggtttttg 6421 agaaatcggg atatagggga aaatagaagc tgtaggtggt gaaaaatttt tgcataataa 6481 acactcagcg agtcaactgc acgctttgtt ctctctacac cctacagttt tacactccca 6541 acacaaaagt cgtataccaa aagtcaatag cagcaattat ggcgattatc tcctcgaaaa 6601 aacagcccca agaacccaac aaagaaccaa agcagcgtcg tgaggtggcg aaagcgcctc 6661 ctaaagagaa tcttttgcaa cctgaagcag cggttgatga aggtaagcaa gaagaaagta 6721 tacgcccaca acggtttgct gactatatcg ggcagaaaga tttaaaggac gtgctagata 6781 ttgccataaa agcagccaag tctcggggtg aggtgctaga tcacttactg ttgtatggtc 6841 ctccaggatt ggggaaaacc acaatggcta tgatactggc atctgagatg ggagtaaatt 6901 gcaaaattac gagtgcacca gctttagaac gtcctagaga tattgtcggg ctattggtga 6961 atcttaagcc aggagatgtt ttatttattg atgaaattca tcgtttgtcg cggatgagcg 7021 aagaaatttt atacccagcg atggaggatt atcgcttaga tattactatt ggtaaaggtt 7081 ccagcactaa gactcgcagc ttaccactat caaagtttac tttggtgggg gcaacaactc 7141 gtgctggggc gctgagttca ccgttgcgcg atcgctttgg cttaattcaa aaactcagat 7201 tttatgaagt cgatgaactg agtaaaattg tcctacgcag cgctcaatta ctccaaacga 7261 atatcactga agatggtgct gcagaagttg cccgtcgttc ccgaggaaca ccacgtattg 7321 ctaatagatt actaaaaaga gtccgtgatt atgcggaagt aaaatcattt aaagaagtga 7381 atgaacaggt tgcagcagaa gcattgcaac tatttcaagt cgatccatgc ggtttagatt 7441 ggacagatcg tcggatgttg agtgtgatta ttgaacagtt taatggtggc ccggtgggat 7501 tggaaacaat cgctgcagca acgggtgagg atacacaaac gattgaggaa gtgtatgaac 7561 catatctgat gcagattggg tatttaagcc ggacgactcg tggtagagta gcgacaccca 7621 cagcttacaa gcatttgggt tttaaacctc caaatgaaca gttatctttg ttgccgtgat 7681 ttgattttgg ggattggaga atttatgttc ttgtttccaa gttccagaaa acttggatga 7741 aggcttgcag gttgagtcag ttctcgtcat gattaatttt ggattaggag ttacgcaaaa 7801 ataaggcaat tgttgtcatt gcgaccgaag ggaagcataa gccctaagtc cttacggaca 7861 cgctgcgcgt tcgcccttgg cgtgcgcttg cgcttacggg cacgctgagc gccaaaggcg 7921 cacgctgcgc gttagccctc tgggcgtgcg caaagcgcat acgcgaacgc aaagctttat 7981 ttttcacgag tgcgtaagtc ctatgaattt ggaaccgctg atggacgcag ataaacgcag 8041 atgatgagac taattgttat ttttctaagt ctgttgctgg tgtttgggtg gggtgaggtt 8101 gtcatggcag aaacgaaatc cctcaatttc actccagaac agttacagca aggtgaggag 8161 ttagcgaata aggcaattgc tgctactaat aaaggtgatt ttgggacagc ggaacggtac 8221 tggacgcaaa ttcttgagca atttcccgat aatccagcgg cttggagtaa ccgaggaaat 8281 tctagggtta gtcaaaataa gttgcaagaa gcactcgcag attataataa ggcggtagaa 8341 ttagctccga atgtgactga tccctacttg aatcgtggta cggcgttgga aggtctggga 8401 aaatgggatg aggcgatcgc agattataat catgttctag agcttgatcc caaagatgca 8461 atggcatata acaatcgggg gaatgcctat gctggtttga ggaagtggga ggaggcgatc 8521 gccgactata aaaaatccac tgaaattgcc ccgaattttg cttttgcccg tgcgaactat 8581 gtccttgccc tttatgaaac tggtaaagta gacgaagcga ttcgggaaat gcggaacatt 8641 ctccgtaaat atcctaactt tgctgatgtg cgtgctgcta ttactgcagc ctactgggta 8701 cagggaaaac aaggtgaggc ggagagtaac tgggtgtcag cagtcggact ggatgggcgt 8761 tacaaggata tcgactgggt ggcgaatgtt cggcgatggc cccctagtat ggtcacagct 8821 ttggataagt ttttaacact caagtaagat atctgtttgg cagagggctt atataccaaa 8881 agtaggttaa ttttgcaagc cttctgtcaa aaaattgcga tttctttaca tttgtgctgt 8941 atcaaagtaa gtgcataagt tgattttatt tgcacttaat aatatagata acgggttttg 9001 agtttaaccg ttgcatggtt gtggctgtag tggttttgtc actttttagc actgaactaa 9061 aggtagtctg gagtgtccgc tgattttgag actgtttaag ggtcaccagc accgacagac 9121 cctctgctag tgccaacaca acgcccttat caaatcagtg cataagttga tttcatttac 9181 acccaataaa taaaaggttt tgactttaac gattgcatgg ttctgcctac agtgattttc 9241 ttactttatt aaagttaacg tgagttcgac ggaactaaaa aacagcagga ggtagtgcag 9301 gtaaggtaaa attatagtta gtaaacttcg tgaagacatc aattgccagg tgctattatg 9361 tacgaacaaa ttaaaagtga aatggaaact tggttcaaag aatataataa tattttcatc 9421 agtcaagaaa actttctgtt ttctctatta aataaactca aagacactgc tgtaggtaaa 9481 gaaagaggct acgctgatat taaaactatt gaacaattcc aagctcaaac ccctattcag 9541 cattacagcg atttaaaacc gtatattgag cggattgttg caggtgaaac caatgtactt 9601 tatccggaac cacctactca ttggattcaa acttcaggca caactggttc acctaaactg 9661 tttccttata atacagagtt tgaaaaaact tttttgactt ctggcaatgc tattcttgat 9721 tgttttatct atcgtgttgg ttcgaaagct ttaaaaatat tagaaggtga aactttatta 9781 atacacgcta gtggtaattg tggcactgtg ggaactggag cgaccaaaaa aactctagca 9841 tttgttagtg gctggatagc aaaacatagt tctccagaaa atccttttgc acctcctata 9901 ggtattcaat cgattgaaga ctgggaggag agaatgctgc aaacaggagt ttattatgtg 9961 caaagaaatc tcacgcgcgt agggggagtt acaagttacg cattaatgtt tttacaagaa 10021 atagaaaatg cttttggtgg caaactattt tcagtattgg cagaaaataa tccagaacga 10081 gcagcagagt tacagcaatt ctatcgagaa gatggcaagc taaaaatctc tagaatatgg 10141 cctaatttat tatgtctaca actagctggt gtagatccat ataaatatag aagttggata 10201 gatgaaaatt tgcctaatag tctaattttt cagagttatg ttggttcaga aggtgtttat 10261 ggatttcaga cagatagata attacttcat ccaaacgaga agcgctatat tatccaaata 10321 tagcaatccc aaatcaattc atgaacaaca acattcccga ctcctcgcga gaagtctccg 10381 aatctgaacc aatggatttt caaaggaaca aaccggattc ctatattctt taaaaattct 10441 ttaaaaattc ttgccgaaac tgatacggac atttatttat actggcaata aaataaaaaa 10501 aggcaggttt aaaacctgcc caaaagctga accaagaaac ttaaaattcc ttacactaga 10561 acctttgcta aaatcggtcc aacacgagtg gcaatttctg ggacgtatac ctgtgaaacg 10621 tagtcagcaa tcatcctatc tgtgttgaac aatggtgcat ttgtcttgat tgatgccttc 10681 atcatttgca cccagcgctg aggagtgccg ttagcatctt ggtcgtagta caagggcact 10741 atttgctctt ccaagagttt ataaagagat tccgcatcta tactatcttg taaatcttga 10801 tcactggtgt gagcatcttc accaattgcc catccattta tcccttgacc atttgcatct 10861 gttttgtatc cttcgcacca ccagccatct agaacgctgc aattaattcc accgttaaag 10921 cagacttttt gtccactggt accagaggct tctaaggggc gacgagggtt atttaaccag 10981 acatcaacac cttggacaag tttttgacca acgtaaatat catagtcttc aataaaggca 11041 actcgatgct gaattgcgga atgctgacac cattccatta agcgttgaat aatccgtttg 11101 ccttcttcat cagctgggtg agctttacct gcaaagataa tttgtacggg acgttgtgca 11161 ttgccaaaaa ttctcattgc ccgttcggcg tcacgcaaaa gcagatgacc gcgcttatat 11221 gggctaaagc gtctggcaaa tccgatagtc agcactttgg gatcaagcag tgtttctgcc 11281 acttgaattt tttggtattc ttcaccgcgt tgttcccgtg cttttttcac tttataccga 11341 gtgtaggcaa tcagtctttc ttttaagact aagtgtcgcc accagagttc ctcatttgga 11401 atattgtcaa ctttctccca cgtctttggg tcaatgacac gagttttcca gtcttcgcct 11461 aagtattgag cgtacaactc tgccattaag ggagcagtcc aagtgggcgc atggacgcca 11521 ttagtaatgt aaccgattgg gactttgtct tcagaacgtt gtggatacaa aattgtccac 11581 attttacgag aaacatgacc gtgcaattca ctgacgccat tggctgcacg acacatccgt 11641 agtgctaaaa ccgtcatgcc aaagggttcc caagggtcgc ccagtcgccg tgcgcctaat 11701 gccaagaatt gctggcggga taggcgtagc tgagtccagt aatgggcaaa gaaggagtct 11761 atcaagtcag gtgagaagac atcatgacct gcgggaacgg gtgtgtgggt ggtgaataca 11821 caacgatttc gcacatcggc ttcgatgtcg tagaaggatt tgccagtgcg ctcaatttct 11881 tggcgtgcaa cttctaaagt ggagaatgca gcatgacctt cgttgaggtg atggacagaa 11941 ggttgtatcc ccaaggctgt taacgccttg acaccaccaa ttcccaagac gacttcttgg 12001 gcaatgcgag tttcctggtt accaccgtaa aggtgtccag ttagccagcg gtctatggga 12061 tcattatcct ggcgatcgct atccaataag taaagactca ctcgccccac ttgcactcgc 12121 caaatttgca ctctcaccat tcgttggcga acttgcaact gtatcgtgat tggttccccg 12181 tgccaatttc tcattaactc cataggcatt tgggagaagg ggttgtcaac gtagtaatct 12241 tcttgccaac cgccgcgatt caaccgttgt cggaagtaac cttggcgata caacaagccg 12301 acaccaacca ttgggactcc caaatctgat gctgatttta ggtggtcacc tgcaagaatg 12361 cctaaaccac cagagtagac aggcaaagat tcatgtatgc caaattcagc acaaaagtaa 12421 gcaatgggat gttcttgggt aatttgtggt gcaacccgac tcacccaagt gtcttttgtg 12481 ctcatgtatt cgtcaaactc gcgcacgagt gaccctattt gcttgagata atacgggtct 12541 tcagcaagct gggtaagacg ttcgtatgtc gctgagtgta acatttccac aggattatgc 12601 ccgcagcgtt cccatccttg aggatcgatg gtttggaaca gcgcgacgcg atcgctcgtc 12661 caactccacc aatagttata agccaaagaa gccaagcgct tgagtggaaa aggtaatttc 12721 tcacccagta cttgcactgc ggtgtttacg ctgctgatat tcataaaact ctaggcactc 12781 tcttagtttt tctgttttgc cgtcaaaatt tgcttcccag tgggcagact tggtctggga 12841 atcatccttt caacagtgaa cagtgaacag tgaacagtga acaaggggtg gacgagtccg 12901 tttcctacct gataactgat aactggtaac tgataactga taactgtttc aagcgtggag 12961 tcgttttttg acggttttta tttttctaga cgtaatccaa atcatccttc ttttttgcca 13021 gttttgacaa ctttatctgc atcaaattaa ttgacgttct tttaatttag tttttattat 13081 tttttcttaa tatatggtga tcaacaagat tcccgacttc atctgaataa atagattttc 13141 acagatcaaa ccggattgct atgttagggc ttacgcacaa gagatccctc aaccacgcca 13201 ggtcgcaccg tcggggggaa ccccaacgcc ctatggctta cgccacgcct tacggctatc 13261 ggggaagacc tgagccctat ggctaacgcc acggctagcc ccgttcgggg gcctgcgcta 13321 acgcctatcg ggcacggccc cgtgccgaac ggtctcgcta cgctttagac ctctgggcgt 13381 gcgctttgcg catacgccag tcgcctacgg agggagaacg ccacatgcaa caagagcggc 13441 acgccgacgg cagatgcttc aagccgggaa accccgtcca acgcactgcc tccccaacgc 13501 agtggctccc ctcctgcagc gctggtgtca ccagatacca agtgagggaa accctcatca 13561 agtactggcg cagcaacgca ctggctcccc ttaaaaaggg ggcttttaag attcctttaa 13621 agccagggtt tgctcaaatc atatttaacg ggtcaacatc aattgttaaa ctgacagacc 13681 caggacaaag ttgacgcact tcctcccaat ctggcaactg tggtaaagca ctgggggcaa 13741 attttagcaa tatctgccag cggtaacgat tagcaacccg taaaatagaa gcgggtgctg 13801 gtcccaaaat gtctaaccct tcatgagtcg gcaaagctgc tgcgatcagc ccagctgtgt 13861 tctccacttg aattggatca ggactactca agcgcaataa aatcaatctc ccatagggag 13921 gataattgag agcttgccgt tgttccaatt cggcatgtat gaaagactga taatcgtgat 13981 tttgcactgc ttcaatcaca gaatgttctg gagtgtaagt ttggacaatt actcgtcctg 14041 gctcttctcc tctgccagca cgtccagcaa cttgtgtcaa ggtttgcaat gctcgttcgc 14101 tggagcgata gtctgacaga tgcagcagtc catccgcagc aacaacaccc acgagtgtga 14161 cttgaggcaa atctaatccc ttggtgagca tttgcgtacc gactaacaaa tctgcttcac 14221 cgttaacgaa ctgtgttaag agagtacggt gtgctccttt tgtacgggtg gtatcgctat 14281 caaaacgaat aaagcgcaat tggggaaact gttttgctaa ctcctgtgcg actcgttggg 14341 taccactgcc gaaaaatttt aggtagggag aaccacattc tggacagctt tggggatgcg 14401 atcgcccaaa attacaataa tgacaccgca gtgtttgcgg cgcgttctct tctgtatggt 14461 gatacgacag cgacacatcg cagtttggac attccaaaac atatccacaa ctgcgacaag 14521 aaacaaaagt gctatgtcct cgacgatgaa taaataaaat tccttgctga cgacgttcta 14581 gcaattgctc caaagcggat tgcaaggaac gactaaatat agaacgattt ccctgctgca 14641 attctagccg catatctaca acttccacag gaggtagagg acgggagtag acgcgttcgg 14701 gaagagagag gtagtgagtc gggggagtgg aagttatttc taattttgaa ctttgagatt 14761 gcgtcgcaac gcttgcaacg ctacgcgatt ttgaattgat ttgttccccc acactcaccc 14821 acgtctccaa cgagggggtt gctgaaccca acaccagggg gcaattttct aactctgctc 14881 gccaccccgc aacggtgcgg gcgtggtagg tggggatggg agaatcttgc ttaaagctgc 14941 tgtcgtgttc ttcatctaat atgattaaac ccaagtttgg taaaggagca aaaactgcac 15001 tgcgcgtacc gatgacaact tggggttctc cggttaacat ttgcctccag gtgtcgtaac 15061 gttcaccgtc tgaaagggcg ctgtggtaaa cgctcacttt attaccgaaa cgagcacgga 15121 aacgatcagt caactgaggt gtgagtccaa tttctggtac taagacaagg gcagacttgc 15181 ctttccctag tagagggctt attgcttgca aatacacctc tgtttttcct gaaccagtga 15241 caccatgcaa caaaactata gcgcatcctt ctagatgggt aatgacccct aacgcttccg 15301 attgagctga tgttaatatc ttggtcatat ccattgcggg agtgtctgct tgtgtcccat 15361 ctccaagagg agggaacaca gcagcagttc gcaacacttc tcgttcttgg atgatgatgt 15421 atcccttttg ttctaacgtc ttaaggatgg aggtacttgc actacaaatt tgtaacagtt 15481 cactctgcca caactcgccc ccgcgtcttc gcagcacttc caaaacttct cgctggcgaa 15541 tggttaagtc acgttcaaaa gtagcaccta tcattatgac tgctttttgc tgttttggtc 15601 gagtcagttg cggtggttct agataacttt ccaccaaacc ccgtcgctgc aattcgcgta 15661 ctccttggta agcagctttg acttggcgtt gcaggtaggc gaaactgtaa tctccatctg 15721 cttgtgcttg taaaagtttc agaatttgct gtgctgctga ggagagaaag ggtagggaag 15781 atgagccaga tgagaggttt gtttcgttgt tgtcattttt agggaggcga atgcgacgtt 15841 gcgatcgccc taacaaacct ggtggtaagg cgatacgtat gacttgaatc aggggtgtat 15901 aatagtatga tgcaactcga ttcaaaagtt cccaataatt cgaggcaaaa aaacctacac 15961 tgacgacatc ttctacattg cgaacttttt ctattggtaa atctactgga ggttgtgtta 16021 caaaacgaat ggcgattcct cctaacagtt gcgtaccaaa tggcacactt aaaatatccc 16081 ctggttttac ttccaagtga tcaggtagcc gatatgtata aagtcctgaa cctcccggac 16141 agtctaccag tacctcaatc caacgattca ctagtttttc tgactggtaa gattgaccag 16201 gttcggcaac tatcaaagag gacaagctga caccattcat atacatactt tgtgagctaa 16261 tagtacagat attttcttta ccaaaacagt tccaagtccc tagtgcttaa gaagtctttt 16321 ttctttgaaa tcctccagat aactcaagca tttaggaaaa tagttagaag cccccgaatt 16381 aagacttgga aaaaaaattt ccttgttcta tctttcggat tcatccatag ccattcttta 16441 taactttcga cttaacaatg agcaccgagc agaaccagtt tgtctatctt aaccaagatt 16501 cctcaactct aaatatatac atcctttgag cgaaaatgta attgccaata ctgccaagcg 16561 acataatact tgataagttt cgttttatgt aatctggtag aaagttactc ttgactgtag 16621 aaggcaagaa agatttcagc ctattcaaac gaaatgagac aatctcagct tttttcagtt 16681 agttttctct gattactaaa tagggaataa tgcttttccc cctattgata gatatttaag 16741 cctaggtgta tgtgagatcc tttatcagga gaagacattg ctaaagtttt tacttgccaa 16801 attgcccatt ttggggaaag ctcaagaacg atgcataacc ccactatgaa aaattagata 16861 gttaatgcga ttttccgaca taaaaagtca gaaaactaag aaaagcacaa gaaaatttaa 16921 gaatctggag agaggcaaat gagtttttct aattgttaaa ataagagcat tttcattgag 16981 gagggttgga aagtcttttt cattaagaaa aaattagcaa gatatgtgtt attagcctga 17041 agaaaatgtt aatggatgca actaccaaga ttagttaaga actgtatcta tctgtaacca 17101 acagatgaaa tcaactgagg tttaagtgtt aagaatgaag tatgaaccaa ccccaaaaat 17161 aaaaaattaa ggggcttgtg tgagagtgtt ccctatttga tcgtacttct accttgatcc 17221 cagcttatgg taagtattga catttttttg aaagtcatcc gcttaataca cttttaccaa 17281 agcgtgagct tccggtgatc acagatagat actgatgttc agaatcacac aacttctggg 17341 caaatatttg tatcttgtca gaggcgggtg ctttgatctt tgaggaaaat taacaagctt 17401 cgccagagta gctgtcatta attatgtacc aaacagagca ataatctctg agggaaacta 17461 tgaatatagc tgaattggga acaatggaga caatgggaag tgctgctgat aatgaagaat 17521 tatttattac tttagatgca gtagcagagg atgagtccct agttgttgta gaaaatatag 17581 aagccgaaga ccgcgatgga gatcagatgg cggcggcgcg tccttcggga tataataaaa 17641 ccgagtatga cgatgctgtc ggcgcgtttt ttaaagaaat ggctcgctat ccgttgctaa 17701 agccagatga agaggtggaa ttagcgcgta gagtccggtt tatagaggaa attagggaat 17761 tacaagcttc gttacttgaa aagctggaaa atcaaccgag taaggaaact gtcgcttctc 17821 acataggaat gacagaaaaa caactggaac atcgcttgta tcaagggcgg gttgcaaaac 17881 gtaaaatgat ccgctcaaat ttacggttgg ttgtctctat tgctaaacga tatttaaata 17941 ggggagttcc ttttctggac ttaattcaag aaggggcgat ggggttaaat cgtgcgaccg 18001 aaaaatttga ccccgataaa ggatataagt tttctactta tgcctattgg tggataagac 18061 aagcaattac gcgggcaata gctaatgatg cgcggacaat tcgcttgcca attcatattg 18121 ttgaaaaact taacaaactg aaaaaagcgc aacgagaact caaacagaga ctagggcgca 18181 acccatccga acaggaaatg gcagatgctt tggatgttcc cgcccaacaa ctacgccagc 18241 tacaacaact acgacgacaa gcactgtccc tcaaccatcg tgttggtaaa gaagaagaca 18301 cagaattgat ggatttgcta gaagatgaag ataacctttc tccagaggcg aaaatgaatg 18361 aaagcatgat gcgccaggag atttgggaag tcttaggaga cgtacttaca ccacgggaaa 18421 aagatgtcat ctctctgagg tatggtttga ttaccagtga accctgtacc ttggaagaag 18481 ttggtactat gttcaatctt tcccgcgagc gagtacgaca aattcaaagc aaagccatgc 18541 gaaagttacg acgccctcac atagccaaac gcttaaaggg ctggctggtg taatcaaatg 18601 agggagttat ttctccccca gccccccagc ctcccaacac gcaagaaatt tcttatccga 18661 accctattcc acagcctcct gattccccac accctaaacg aagtttgaca gcttgtcttt 18721 tgacaagttg tcttactaat cgtaagttga caaaactgct actaaagttt ttgtcaaagt 18781 tgggtgctgt gaggtagaaa atgaaattta gtttccttat ttttcaagtc atattaatta 18841 actttgcttt agctcctgtc acatttgcac aagaaaagaa tttcgagcaa aaaaagactt 18901 ctatcaaaaa gacatcagaa atcccatact tgagtgagat agaaaaacca tacaccagtg 18961 cagaattact cactcaaaca ccaattccat cgaacaagcc acaaaccact gataacccag 19021 aaccaaattc cacaacaaat gagtcagaac caaaagttga agatgacggc gcaattatta 19081 tagatgtgac agggaagaaa gatgaccttc ctcagtctac ccctacatat gttattgaaa 19141 aagaggaaat tgaaaaacaa ggtgcgacaa gtgtatctga tgtgttgaaa aaattgccgg 19201 gatttgcaat caatgactca ggtcatggtg cagatattca cacaggtaca tattatcggg 19261 gagcctcgat taaccagtct gtatttctta ttaatggtag agctattaat actaatgtca 19321 acacttatca tggtgcaaca gatttaaata gtattcctgt ggaagctatt gaacgaatag 19381 aattgtatag tggtgcagct tccactctct atggatcctc agcttttgga ggagtggtta 19441 atatcatcac caaagaaggt agcagggttc ctcgcttaaa tgcaactgca gaatttggcg 19501 atttgaattt caataaccaa caattaagct atggaggcgc aactggctct ttaagatata 19561 atctcagctt tgaaaggtct tttattgata accgttaccg cgttcctgtt ggtgcagcca 19621 atcgtgattc tcaaggatat ttctttaatg cagatacagc taccagcaca tactttggta 19681 gtcttgcctt tgatgtcgat cctagaaata ctttaagcct agatgtgact acacttagca 19741 gtcgtcgggg attaatttat tttggctttc ctttacaaag agaccgacta gaccacgata 19801 atttaaatat tggcttttct tggaagactc gccttggtaa tggtaacaat tctgttgtga 19861 caacgacact cggttataac caagattact tcaacactta cggtcccagc agccaattct 19921 accgtacagg aactttagat acacaattat atacagctag gatagaccat gtgtggcaac 19981 tcactccaaa ttataaattg cgctggggat tagatttaca aaacacagag ttaaatggtg 20041 atgtgttgag tacagttcct aatcgaattg ccttgaatca agatgaaagc gagagtttgt 20101 tgaacaccgc attatttgcc gttaatactt tgaatatcac cgataatttt caggtagatt 20161 taggcttaag acaaagcttt gatagcaaat ttggtaatta tctcaatccg agtgtgggat 20221 ttaaatatga tattgctcct gctttggctg tgcgaggaag ttgggcagat ggacaacgca 20281 atcctggttt agatcagttg tatatttatg atacggtaca tggatggctt cctaaccctg 20341 atttaaaacc cgaaactggc tcatcttgga ctgcaggagt tgatgtcagg tttgctgaag 20401 atttaacagg acagtttacc tacttcggga gtagtttaga taatcgttta ggagttatac 20461 aaggaaaatg ggcgaatatt gggctagttg ataccaatgg ttttgaggcg gcgctacggt 20521 ggaaagttgc ttcaggatgg tcaactttta tcaattacac atatacagat gcaaaaataa 20581 aaacaggctc agaaaaagat ttgcagttag gcttgattcc ctactctatc gcttctgctg 20641 gtattggtta tgaaaataga ggttggcagg caaatttgta tgctacttac tatggtggcg 20701 ctcgtcgagc catatacaca agagttggag atacagctac agatttttcg ccctctttct 20761 ttaatttgga ttttagcgct cgcgttcctg ttacgaaaaa tttggggttg acagtttatt 20821 tagagaattt actcgatgaa caatacgagc gagtcaatcg tatttatagc cctggattta 20881 cttttcgctt gggtttaaca gctaatattt agagatggcg tctatagcaa tccagagtac 20941 taacttgaaa atagggaaca gggaacaggg aacagggaac agggaacagg gaataggtgt 21001 tttcatcttt ggttgtgata tcatgtccgg ctaattagtt gtgatttcca cgttaactgg 21061 accccacacg ccagatccct acggagggaa accctcctgc aggactggct ccccttaatc 21121 ccctccccgc aagcggggag gggaggtaaa gcgcagcttt accggggtgg ggttcttagg 21181 gtgatgataa gtatttaacc ggacttgata tgatattttc tcgtgattga ctgatttgaa 21241 aattaacttc tattcagatg ctcgacttct cgcgagaagt tgggcatctt gttttttcac 21301 gaatcattta ggactgctat atttaagtta aaatcgcagt tatgagagca aaaattgaaa 21361 acgatgtgct ttttcttcac cacgaagatg tgccagagta taaaaaaggc ggttctgtgg 21421 tgagaaatag ctatttctgg gcgttgcgtt caatcgctgg aaaagcttcg cgttatggtg 21481 attgggaata tgagcctgag gtttggttcg cattgacgcg gatgctgtta tcctttgctg 21541 agtctggcta tctggggtta aaacaaactg tgctggagtt ccctctgtct caaggggaaa 21601 ttcctgatgt gctgcgagat gtttctacgt gggaatagga taaaactaga ggctgaactt 21661 cagaaaattt atttccaacc tgagcttgga aatgaggaga atgtaccaag ctttacaaag 21721 aaactgaata ttaaattcat tccaaagtat agataataag ggtgcaaaac tagctacgct 21781 agttttgcac ccttgcaaaa aaattaatct actgaggagt tatagttatc caaaagtttg 21841 acattttcaa atacctgatg tgctgcggga tgtttatagg tgggaatagg ataaaactaa 21901 aggcggaact tcagaatatt tatttccaac ctgagtctgg aaatgaggaa aacgtagcaa 21961 cgtatacaga gatagatgtt tagggggcaa aactagctac gctagttttg ccccctcgca 22021 aaaaaattag tccgttggag agttgtggtt acgtcaaaag tttgtttttt ttgaagaagg 22081 tttatgactg aaactaaggg tgtaggggtg aaggggtata ggggtgtagg tgaggagaaa 22141 agtaatcgca aattattttc tttctcttac acccttacac ccttaccccc ctagttcttg 22201 actttgaggc aaacggatac tatagaaatt catctccaaa attagcctag gaacgaggag 22261 aatgtagcac catttacaaa gaaacttaat attcaattca ttctcaagga tagatgttta 22321 gggtgcaaaa ctagctacgc tagttttgca ccctcgcaaa aaaaattaat tttgtggaga 22381 ataataatgc ataaaacctc tgaagaatac aatgagttca ctgaaaaata ctgtgagttt 22441 attgaagagt acgataaatt tttacagaat tatgaacact cttcacgact tgttgcagtc 22501 atgaaaactt caaaccagca aaagactgaa gtcggtgtat tatatgataa agctctacaa 22561 gcacacatag atactacaaa tacatacata caaacagtcg caatatatag gcagttagtt 22621 cagaaatggc ttttgatgac aaaatcctgt tacaagggtg aataatttag atagactcga 22681 acaccatttc aggaacgagt agctgctcgt tatctaaatt tcccagttta cctattccca 22741 aaaaacgctc atcttcatcg taaattcgta agatttttgg tgagatttct aaaacattga 22801 aagttatagg aatacgttga ccttgacacc atttctgtgc tgatgttggt ggtaaggtga 22861 tagaaccaag atgctgtagt ggtgcatcag gaagaagagg ttgaaatgtc ccagcttgca 22921 ccttcgcttc taagtcggtc aacgtgaggc tatttgctaa gtcaaatcca ctgctttgtg 22981 ttcgtgttaa agcagcgaga gttccaccag tttgtaagat tgtacctaag tcacgggcga 23041 tcgcccgaat atatgtaccc gcaccacagg cgatcgccac atccaattcc ggaaactcct 23101 cttctcgcca gtccaaaact tcgattttaa aaacctccac tatccgcgca ggtacttcta 23161 catcttctcc tttgcgtgcc aagtcgtaca ggcgttttcc ttgaacttga atggcgctgt 23221 aaattggagg aatttgttca attttgccaa caaattgttg caatactggt tctatcactt 23281 ctaaactcaa cccagtgcaa ggttgtgagg cgatgatttc gccttgcaaa tcatcggttg 23341 ttgtacgcac acccaagcga attgtggctt tgtaagcttt ttctcctgga agatattgta 23401 ataagcgagt tgcttttcca agggcgatgg gtaatactcc tgttgctgca ggatctaaag 23461 ttccagcatg tccgactcgc ttgagacgca gaaatttgcg tgtctttgcg acacagtcat 23521 gggaagtcca accgaatgat ttgttgagat tgaggaaacc ttgcatatga gttctgaatt 23581 ttgatttctt tttaaatcat tactatgttt tatatcaagt tcgcctaatt cttacaataa 23641 aaaacctcac ccccatcccc tctccttact aaggagaggg gtgcccagag ggcggggtga 23701 ggttcttcgt tttttagaaa aaattttaaa gcatcgaaag gagtaagttc agattaacga 23761 gtttttccta aagcaacagt gactgaacct gtgacaacta aaacggctcc tatgattcct 23821 aaaaaggaaa tattttccgg taatatgaga gttggtatta tgactgatac gagccaaact 23881 gagattaaag tgacaatagg agctaatgcc aaaactgcac taacgcgtga tgcttcccaa 23941 tgttctaaag attcagcaaa ggcaccataa gcaatgagag tattgaaagc acaaaaaagt 24001 aacacaccca aatggaaaag attgagttga aaaattgctt ttgggttagc aaatggagtg 24061 aataataaag cacatcctgc ataaagaatg agcatgatgc taaaagaaga taaagattgc 24121 aacaactgct tttgtgccaa agcataaaaa gtccaagtta ctgcacctac cacaaccaaa 24181 ccactaccca gaagatattg accatgagcg gtaactaaat tttttaattg ttcgtgaaaa 24241 aagagagtga atcctataac aagaatccct acaccacacc actgaagcag agtatagcgt 24301 tctctaaaaa gaaataaacc accaaaaccc atgagtaagg gagctatctg aataataact 24361 tcagcgttag caggtgcagt cagtgctaaa ccttgcgtga aaagaatata gttagctgct 24421 aagccaagaa tagcaattgc tagcaacacc caagaagtag aacgcaattt taagagccgg 24481 ggtaattttt tttgccatcc taaataaata gctaacaaga caaacgatat caaaaaacga 24541 aaccaaataa gggtgtagac atcaagtact tgcagagtga ctgctaaggc aataggtaga 24601 attccccaca aaaaaacggt cactaacgat aatgctaacc ccaaacgcca gcgaccggaa 24661 gttgtatgca tgttagttat gagttatgag tcattagtca tgagtcatta gtcaaaaaag 24721 aaatgactaa taactaattt ttaaaaaaaa tcagccatcc aaggggatgc gcctgcaaat 24781 atttgagtag ctgcaaacct tgctgtttct gtagcatcaa gttgtcctgc aagttgcaat 24841 tgttgcggag taaataagcc tgtgtagaga ggtgctaatc ctctaatatc tagcttcatt 24901 tcgccttttc ctccactggt aacttcaccg cgtccattgg caacagacaa aataaatcta 24961 ccattatttt cagcgagcaa gtcatcttgg atttccaggt gcaattctgc ttgaattcct 25021 agtggataac cacgcttttc tagcgctttg acgacatcaa ccacgcgcag catccaatag 25081 ctagtagctt tttgcttagc ggtttgctct ggtaatagta aagttagaga ctctgtagca 25141 gaacctttcc atcgcacatt tttaatttgg gagcgatgta ggccaagaaa actccaaaaa 25201 ctttgtgcag cagccgttgt gagaaccgcc caatctctga ctcgtataaa agagccatcc 25261 tgatcttggt gctggctgaa gatgatgtac ccttggggtt gttctgcacg accaataaga 25321 taggtgtaaa ttgcttcctt ctcatgtggt ttaaatatac cctgccaaat accttgatgt 25381 cggtctaaga atccattatt gagtcttgca tgttgctgat agagttcata aaagatttca 25441 tgatcagctc taacagcgat aacaggtagt ggctgttcct tgactaagat gctttgagta 25501 ggaacttccc aagtagaaga gatacccccc tgttcatacc ctgcttttcg gtataggcgt 25561 tgagtcgctg gatagagagt agagatgggt actcccttgg cgtagagttc tttaacagcg 25621 tgctgcatca taactaatgc agctccactt ccacgatgtt ccggaccaat ccctactgca 25681 gctattcccg ccataggtac acgttcacca ccccaccact gagccattgg gatagtcgct 25741 agtccaccag caacttgctc acctgcacga ataacacgga agttttctaa acctatgcgg 25801 ttgaagaaaa tttcgctctc acctggcgac atgagaaaac actgctcaag gatattcccc 25861 agctgctgaa tgtcctcttt gacaacttgg ctgtactcaa attttggcgt catagtgtac 25921 accccctaac acggctagac tacaaatata ctacaataac agtcgtgtta gcgcctcacc 25981 caaaaggttt acccaaatag gggaagtttt ttggctttaa gtttctttta aagccctcca 26041 atcttttctt tctattgcca ataagttttc ttccacccat ttgccatgct tctgaaaacc 26101 ttgtgggata taacaaacaa atttcatccc aattcttttt aagactttct cgctacccgt 26161 attccatagg gcgtgataag cttcaatgcg ttctgcttct aaaactgtaa aaccgaactc 26221 gacaatagct ctagccgcct ctgtcatata accttgacgt tgatattctg ggtgtgtcca 26281 aaacccaaga ttccaaacgc cttctacttc gttatgtttt ctgatagaaa ttctgccgag 26341 aaatgtactc gtataaacac atgtaattgt gaatgtatat gctaaaccaa aatcccaagc 26401 ttgaagattt cgttgaaggg gttcgtgaag ttcttcgata gattgtggag cttcccaaag 26461 cataccatca ttaaacccct ggaatcgtgt agcggaaaat acgtgtggaa tatcttgttc 26521 acagacgcat ctcaaaatac agcgttcagt attaattcgg tatgtctttg gtattctcat 26581 gtgttcaaac aggagcttca ttttttcttt acaagcaatc ttaacctgta atatccaacc 26641 ccttaccatt cagacgacta agaaaagcac gtaagcgatc gctctctgga tgagaaagaa 26701 ctttgcaagc cttaccttgt tctaccacaa taccttgatc caaaaagaaa acttgatgtg 26761 ctacttcaca ggcaaattgc atttcatgag tgacaacaac catcgtcata ccctctgctg 26821 ctaactgctg catgacttgc aacacttcac ccacaagttc cggatctagg gcgctggttg 26881 gttcatcaaa aagcatcact tgggggttca tacacaaact acgggcaata gcgactcgtt 26941 gcttttgtcc tccggaaagt tgttctggat atgcagatgc tttgtcaaaa agtccaactt 27001 tttcgagata aagtcctgcc aattgtgtac tttctttcgc tgacttcccc agcacttgac 27061 gcggtgcaag tgtaagattt tctagtacgc ttaaatgagg aaacaggtta aattgttgga 27121 aaaccatgcc gacttgtgtt cgcagttgcc gtagttgcct gttgctaaaa ttcggttgcg 27181 ataggttaat atcgttgact agtaaacgcc cagaatctat tgtttccaag cggttgaagc 27241 agcggagcaa agtactttta ccacaaccag aagcaccaat gactgctacg acttctcctc 27301 gtttgatttc accgctaatt cccttgagaa cttttaggga accataactt ttctcaatgt 27361 tttcaaagat aattgcagag gaggaattgt tcattttcta tgtttcttga actacagatg 27421 attctagcta aacactctga tacaagaaaa cataaaagtt atgctagtac atatttgaag 27481 atatttagtt acgttctgga aattcattgt ccgcttctgt agtaccttga aatgtacttc 27541 tttaaaaata tcgtatctgt taacaattat ttatttggga aaaaaccatg ttcgagccaa 27601 aaatgacgcg ctcgcatttc attaaacatt tcttaactgg tttcgctgcg actgtgattt 27661 tgtcagcgtg tgataactca gccagcaata ctagcagttc cttatcgggg aattcgcaaa 27721 cagctacagc cggaaaaaca attaaagttg ccacagaacc tgcttttccg ccgtttgagt 27781 ccaaaagttc gggaaatgaa ctcgtaggct ttgacattga tctgatcaaa gcagcgggac 27841 aagccggtgg attaacgatt gagttccaaa gtctgccttt tgatggtata ataccagcac 27901 ttcaagcaaa tactattgat gctgctatca gttcaattac catcacccca gaacgtgctc 27961 aagcagtatc tttttctcga ccatatttta gggctgggtt ggcgatcgca atccgccaag 28021 acaacacaac tatcaccaac ctagatagtc tcaaaggcaa gaaaattgct gcccaaattg 28081 ggacaaccgg atctaaaaaa gccaaaagca tctctggtgc tcaagtgcgg gaattcgact 28141 ctgccccttt agctttgcaa gaattggcaa atggtaatgt ggatgctgtt gttaacgatg 28201 ctcctgtgac aatagaagct atcaagagtg gtaacatcaa aggtctaaag gtagttggtc 28261 aacttatcac tgaagaatac tatggtattg ccctaccgaa aaactctccc aacctcaacg 28321 ccattaacac agctcttgca aagataattt cagacggtac ctatgctcaa atttataaga 28381 agtggttcaa tgcagaaccg ccacaactac cagaaactgt tccaggttct agctcttgag 28441 gatgagggag tgaggagggc agagagagag agagagagag agagaggagc agaggagcag 28501 aggagcaggg agcagcgagc aggggagaat aatctttgac tgttgagtgt tgacagtcct 28561 gacgagttaa ctttatttgt accaacttag ttaacaatga ttcagagttt aactatcatt 28621 ctcaacgccc taccaaattt acttcttggt gctgtcgtca cacttgagat taccgcgctt 28681 tctgttgttt ttggcatgat tggcggttcg ttgataggaa ttgcccgact ttcgccaatt 28741 ttactcttgc gcttttgcac tcgcgcttat gttgattttt ttcgcggtac gcctctgctt 28801 gtgcagattt ttatgattta ctttggttta ccagcactag ctcaaagcat tggcatacct 28861 ctgcgttttg accgtttact tgctgctgtg gttgctttga gcctcaactc tgctgcatat 28921 ataggtgaaa ttgttcgtgc tggaattcaa tctattgaac caggacaagc cgaagccgca 28981 aactcgctgg gtatgagtgg agtacaaacc atgcgctaca tcatctttcc ccaagcgttg 29041 cgtcgtatga ttccaccact cggaaatgaa tttatctccc tgcttaaaga taccagtctc 29101 gtgtcagtta ttgggttcga ggaattatta cgacgtggtc aattaattgt agctgacact 29161 tatcgtgctt tcgagattta cactgcagtt gcgttggttt atctggttct caccttagct 29221 tcatcccaat ttttctctcg tctagaagtt tggatgaacc caatcaagcg tcagaaagcg 29281 agccaaaaaa aattgagttc accataagtc atctttgggc tattattgac agtcgtcgat 29341 tcgtttttaa ttttccaaca cacctgcttc cttctctatt ctgtctaaag ccctcttagc 29401 tttcaacaca actcgcaaat aggctacctg attactccaa gccaagcgag gtaacagggg 29461 tgcttgagag tctacctgag tttgggagtt agtaatagtt gtattttctt cagacataaa 29521 aattggcggt ggaaatctag ttttctttat tttgaacttt gagattgcgt cgcaacgctt 29581 gcaacgctac gcgattttga att // LOCUS NODE_978_length_29166_cov_5.16361529166 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 29166) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 29166) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..29166 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 343..771 /locus_tag="DP116_08450" CDS 343..771 /locus_tag="DP116_08450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315320.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mannose-6-phosphate isomerase" /protein_id="PRJNA477356:DP116_08450" /translation="MAQIQEATQANTLPLPPSVTPRGVAATELRPWGSFTVLEEGRGY KIKRIEVKPGHRLSLQMHHHRSEHWIVVCGTAKVVCGDEEIFLSNNQSTYVPQCTAHR LENPGVIPLVLIEVQNGEYLGEDDIVRYQDDYARVQEKNQ" gene 1043..1429 /locus_tag="DP116_08455" CDS 1043..1429 /locus_tag="DP116_08455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872456.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron-sulfur cluster assembly accessory protein" /protein_id="PRJNA477356:DP116_08455" /translation="MIQLSPSAANEIRRLKSKQHPNVLFRLAVKPGGCSGWYYDMSFE QGLQVGDDPEGSASSGDAALVPSQRNERIFECYDIQIVIDAESLKYVNELTVDYSEDL MGGGFRFYNPISNATCGCGNSFSTTQ" gene 1526..1903 /locus_tag="DP116_08460" CDS 1526..1903 /locus_tag="DP116_08460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129154.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S12" /protein_id="PRJNA477356:DP116_08460" /translation="MPTIQQLIRNEREQARQKTKSPALKQCPQRRGVCTRVYTTTPKK PNSALRKVARVRLTSGFEVTAYIPGIGHNLQEHSVVMIRGGRVKDLPGVRYHIVRGTL DTAGVKDRKQGRSKYGTKRAKAK" gene 2811..3281 /locus_tag="DP116_08465" CDS 2811..3281 /locus_tag="DP116_08465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196399.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S7" /protein_id="PRJNA477356:DP116_08465" /translation="MSRRGVIKKRPVPPDSVYNSRLISMITRRIMRHGKKSLASRIVY AAMKTIEERIGGDPLETFEKAVRNATPLVEVKARRVGGATYQVPMEVRAERGTSLALR WLVQYSRQRPGRTMASKLANELMDAANESGNAIRKREETHRMAEANKAFAHYRY" gene 3446..5524 /gene="fusA" /locus_tag="DP116_08470" CDS 3446..5524 /gene="fusA" /locus_tag="DP116_08470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009341972.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="elongation factor G" /protein_id="PRJNA477356:DP116_08470" /translation="MARTNPLERVRNIGIAAHIDAGKTTTTERILFYSGIIHKIGEVH EGTAVTDWMEQERERGITITAAAISTSWKNHQINIIDTPGHVDFTIEVERSMRVLDGV ITVLCSVGGVQPQTETVWRQADRYKVPRIVFINKMDRTGANFYKVYDQVRDRLRTNAI PIQLPIGSETEFQGIVDLVRMRAYIYTNDQGTDIQDTDIPEDLLSQVEEYRTKLIEAV AETSDDLMTKYFEGEELTEDEIRTALRKGTVKGSIVPVLCGSAFKNKGVQLLLDGVVD YLPAPIDVPPIQGTLPNGETVERRADDNEPLSALAFKIMADPYGRLTFVRVYSGVLKK GSYVLNATKNKKERISRLVILKADDRIDVDEMRAGDLGAALGLKDTLTGDTLSDESSP VILESLFIPEPVISVAVEPKTKNDMDKLSKALQSLSEEDPTFRVHVDPETNQTVIAGM GELHLEILVDRMLREFKVEANVGAPQVAYRETIRKQVNRIEGKFIRQSGGKGQYGHVV IDLEPGQPGTGFEFVSKIVGGTVPKEYIGPAEQGMKESCESGVLAGYPLIDVKATLVD GSYHDVDSSEMAFKIAGSMAMKEAVMKAAPVLLEPMMKVEVEVPENFLGDVMGDLNSR RGQIEGMGSEQGLAKVTAKVPLAEMFGYATDIRSKTQGRGIFSMEFSNYEEVPRNVAE AIIAKSKGNG" gene 5547..6776 /gene="tuf" /locus_tag="DP116_08475" CDS 5547..6776 /gene="tuf" /locus_tag="DP116_08475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015190073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="elongation factor Tu" /protein_id="PRJNA477356:DP116_08475" /translation="MARAKFERNKPHVNIGTIGHVDHGKTTLTAAITMTLAAMGQAVA KGYDQIDNAPEEKARGITINTAHVEYETEKRHYAHVDCPGHADYVKNMITGAAQMDGA ILVVAATDGPMPQTREHILLAKQVGVPSLVIFLNKEDLMDDEELLELVELELRELLSD YDFPGDDIPIIKGSGLQALEAMTANPKTQKGSNPWVDKIYALMDAVDAYIPTPERDVD KPFLMAVEDVFSITGRGTVATGRIERGKVKIGDNVELVGIRNTRSTTVTGIEMFKKSL EEGMAGDNAGILLRGIQKADIERGMVIAKPGSITPHTQFEGEVYVLTEKEGGRKTPFF PGYRPQFYVRTTDVTGTIKEFTADDGTAAEMVMPGDRIKVTVELINAIAIEQGMRFAI REGGRTIGAGVVSKILK" gene 6977..7294 /locus_tag="DP116_08480" CDS 6977..7294 /locus_tag="DP116_08480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015190074.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S10" /protein_id="PRJNA477356:DP116_08480" /translation="MATLQQQKIRIRLQAFDRRLLDTSCEKIVDTANRTNATAIGPIP LPTKRRIYCVLRSPHVDKDSREHFETRTHRRIIDIYQPSSKTIDALMKLDLPSGVDIE VKL" gene 7622..8272 /locus_tag="DP116_08485" CDS 7622..8272 /locus_tag="DP116_08485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315327.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent protease" /protein_id="PRJNA477356:DP116_08485" /translation="MTSSSKIAVCELPLFPLPEVVLFPTRPLPLHIFEFRYRIMMNTI LESDRRFGVLMVDPMKGTIANVGCCAEIVHHERLPDDRIKMWTLGQQRFRLLKYVREK PYRVGLVEWISDNPPTKNLRPLAADVEQLLRDVVRLSAKLTEQNIELPEDLPNLPTEL SYWVASNLYGVATEQQTLLEMQDTATRLEREAEILTSTRNHLAARTVLKDTFNQKL" gene complement(8326..9216) /locus_tag="DP116_08490" CDS complement(8326..9216) /locus_tag="DP116_08490" /EC_number="4.2.1.51" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315328.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prephenate dehydratase" /protein_id="PRJNA477356:DP116_08490" /translation="MNLSIAHLGPIGTYTEQAALFYLNWLTKNTGVEAVLCPYPSNAQ TLRAVAQKEAQLAVVPVENSIEGSVTMTLDTLWQLDSLQVQLALVMPISHTLISCAQS LENIKTVYSHPQALAQCQLWLERFLPNVTLIPMNSNTEALLQLKQDLTAAGISSQRAA QLYNLPIIASGINDYPGNCTRFWVVSQNHLPISHPTVSECARHTSIAFSVPANIPGAL AKPLQVLARLNLNLSKIESRPTKRSLGEYLFFIDLEADASEPKAQSALAEISSYTEIL KILGSYNVLPINALSEAVSG" gene 9655..10242 /locus_tag="DP116_08495" CDS 9655..10242 /locus_tag="DP116_08495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08495" /translation="MSTRFTAYQSIEIPIPEQPIPIQHYLRQPQRLVNALVDPSRIQQ LSEEVFRLKMRPLNFMSLSIQPTVDMRVWAESNGTIYLRSLNCEILGVEYINQRFALN LKGYLLPEQQVTSTVIKGRADLEVLVDFPPPFSYTPKPILEATGNGLLKSVLLTVKQR LLHQLLADYSRWVILQTREKALEDKSMPLFNPEQL" gene complement(10530..11213) /locus_tag="DP116_08500" CDS complement(10530..11213) /locus_tag="DP116_08500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140408.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease HII" /protein_id="PRJNA477356:DP116_08500" /translation="MIKTEKTVTSTRLSAPNTEMRWLDFSMVSSTQGLFAGVDEVGRG ALFGPVVAAAVILPESALSQLTAAEIKDSKKLSSSRRVRLAQQICALAIDWKIGFAST AEIDQMNILQATMLAMKRAVLKLKVQPALCLIDGNQLVKDLPLTQQTIVKGDERCIAI ASASIIAKVWRDDLILRLASKYYMYDLERNKGYGSQRHLMALQKYGPSPLHRKSFRPC QITVLSSAT" gene complement(11359..13461) /locus_tag="DP116_08505" CDS complement(11359..13461) /locus_tag="DP116_08505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872464.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease E/G" /protein_id="PRJNA477356:DP116_08505" /translation="MPKQIIIAEQHQIAAVFSEDQIQELVVATGHHQIGDIYLGVVEN VLPGIDAAFVNIGDPERNGFIHVTDLGPLRLRRTAAAITELLTPQQKVLVQVMKEPTG TKGPRLTGNITLPGRYVVLMPYGRGVNLSRRIRSESERNRLRALAILIKPAGMGLLVR TEAEGKPEEAIIEDLELLQKQWEAIQQEAQSTRPPALLNRDDDFIQRVLRDMYGADVN RIVVDSSTGLKRVKQYLQNWSGGQTPQGVLIDHHRDRAPILEYFRINAAIKEALKPRV DLPSGGYIIIEPTEALTVIDVNSGSFTRSATARETVLWTNCEAATEIARQLRLRNVAG VIVVDFIDMESRRDQLHVLEHFNKALKADKARPQIAQLTELGLVELTRKRQGQNIYEL FGTVCSTCGGLGHIVHLPGETESRPLAPTELPDRFASSSSYKEPRLPVTRPTESRETL EGYGEAYESDNDLAALNLVNHPSYQEMTDSRKRRTRRSTRVERLNGGNGKDEPRVGAN NPLAFLNEPDLEIDDEPELPAPSEVSSPSISKPSWGSDRVIERSHKVVTKVEPIKPVV EPPEIVTVEMSSQEQDVYAQMGVSPLEKLNREVRNPKSVIINVTLPGHSPATPTESTS ESPVSPRVTPVVTPDLPSESIPETTSDVSDDVSDDVSDLPVITEDESEVTSSVSDKRR RRRRSSSVETDTYTASET" gene complement(13653..13985) /locus_tag="DP116_08510" CDS complement(13653..13985) /locus_tag="DP116_08510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874246.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08510" /translation="MLTASAIITRGNHARCFNGTPGASSRETRPTHWLGNPRNALAPQ GRTGLATHKTPPAITGEVSQFPRVGEKPSHGISSPRAQWLLKTHNSGLSTFQRKGIAS KINHGRFI" gene complement(14449..15363) /locus_tag="DP116_08515" /pseudo CDS complement(14449..15363) /locus_tag="DP116_08515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872465.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="B12-binding domain-containing radical SAM protein" gene complement(15616..17361) /locus_tag="DP116_08520" /pseudo CDS complement(15616..17361) /locus_tag="DP116_08520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872465.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="B12-binding domain-containing radical SAM protein" gene complement(17483..17827) /locus_tag="DP116_08525" CDS complement(17483..17827) /locus_tag="DP116_08525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-anti-sigma factor" /protein_id="PRJNA477356:DP116_08525" /translation="MQVVLDYPKITVISPQGCLNATNALEFETNITKALAQDGISFLL VDLEYVESLDSAGLMALVSALKLSHKLGRRFSLCSVSPCLRIIFEMTQLDRVFEIFEG KAAFEAACFPMQ" gene 18641..18847 /locus_tag="DP116_08530" CDS 18641..18847 /locus_tag="DP116_08530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08530" /translation="MTMKYSIRRLVEKALDIKKLTPEIENEINSELTQMGHISDVDYE ALELLMAEMDAGRIQLVPSAGCFF" gene 19344..20519 /locus_tag="DP116_08535" CDS 19344..20519 /locus_tag="DP116_08535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652917.1" /note="produces methionine from 2-keto-4-methylthiobutyrate and glutamine in vitro; mutations do not affect methionine salvage in vivo however; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LL-diaminopimelate aminotransferase" /protein_id="PRJNA477356:DP116_08535" /translation="MQFAQRLEKIPPYLFAEIERRHHELVAQGIDIINIAKGDPDKPT PAHIIQAMHEAIDDPSTHDYPPYRGTQEFRKATAMWMEYRFGVTGLNPETEVISSIGS KEGIHNTFLGFVEVGDYTLIPDPGYPVYRTSTIFTGGEPYAMPLKAENKFLPDLNAIP KEVAQKAKLLWINYPNNPTGAVATLEFFEELVAFCKQYDILLCHDHAYSEIAYDGYKP PSVLQVPGAKEVAIEFHSLSKSYNMAGWRIGFVVGNATAIKGLGQVKTNIDSGVFKAV QACAIAAYSTDITEIQSRVSVYQKRRDIIVKGLQSLGWCIETPKGSLYVWVPVPQGYS SKEFVTLLLEKCGILVPPGSGYGAAGEGFFRIALTVSSERMDEAIQRMRDAGIRYQC" gene complement(20549..20905) /locus_tag="DP116_08540" CDS complement(20549..20905) /locus_tag="DP116_08540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315337.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08540" /translation="MNMSVEFKAGQIVYLEHSDRRLYAEVIQVVVSRQLCWVRPWLLV AFTQEMPQIIDLRDASDLLWCINLFQPALDTEVIGFLSEVLAKQPKPGQILDAKQQLN QFLHEIWQAQKESSEC" gene 21109..21390 /locus_tag="DP116_08545" CDS 21109..21390 /locus_tag="DP116_08545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998463.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent Clp protease adapter ClpS" /protein_id="PRJNA477356:DP116_08545" /translation="MSVETLQKNSTSRKIAPRYRVLLHNDDHNPMEYVVRVLLTTVPN LTQPQAVSIMMEAHSNGFALVITCAQEHAEFYCETLKNHGLTSTIEPEE" gene 21405..22232 /locus_tag="DP116_08550" CDS 21405..22232 /locus_tag="DP116_08550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410250.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CPBP family intramembrane metalloprotease" /protein_id="PRJNA477356:DP116_08550" /translation="MKKVLLRLRQRPAPIRLGCFILTLVGIWLPIAVPIYLLVDDSNL VTILTMVLLAIGFFVLLPIWNKYVYQQRQIFRHYGLERTRLNKVELVRGLAIGLITIL ILFSLEGLLGWLVWQKPNIFLLRVVLEGLITSLGVAFAEELFFRGWILDELQRDYSPS VVLWTDATIFAIAHFIKPLPEVIRTSPQFFGLLLLGLTLVWAKRSSRGRLGLSIGLHG GLVWGYYIINVGGLMKYSRQVPDWVTGVNDNPLAGVIGLVFLGGLALWMRGRAVKFE" gene complement(22377..22778) /locus_tag="DP116_08555" CDS complement(22377..22778) /locus_tag="DP116_08555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002783914.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain protein" /protein_id="PRJNA477356:DP116_08555" /translation="MLRAVADTHAVIWYIFADARLSITARNMIAQIASAGDQVAFSSI TLAEIVYLSEKGRISPLTLERLLAVVDTTDAVLAEVPFDRHIAQALRLVERTQVPDLP DRIVAATALHLGVPVISRDSKIKLSSINTIW" gene complement(22778..23008) /locus_tag="DP116_08560" CDS complement(22778..23008) /locus_tag="DP116_08560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129173.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08560" /translation="MSLQEVLKQALQLSTVDKVRLIQQIAPEIERELIDNPPTPRKLL WGLCADLGQAPSESEIDVARSEEWANFSREDI" gene complement(23019..23657) /locus_tag="DP116_08565" CDS complement(23019..23657) /locus_tag="DP116_08565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865151.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_08565" /translation="MNIEEIRLGKLKNLPGTNLEDEDLSNSDLSRINLAGAHLVGTLF TGSKLEGGHLEGANLMGANLVETDLRANLMGANLMQADLTGADLRGGNLRGANFMGAR LSEVSLAGAFLSGANLMNVNLQGADFRGADLRGANLTGANLKGADLSRADLQGALLSE ANLEEADLRGANLAGANLTGANLLCAELDGANLSGVNLERACLVGTLVEIVS" gene complement(23751..25148) /locus_tag="DP116_08570" CDS complement(23751..25148) /locus_tag="DP116_08570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315344.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal-dependent phosphohydrolase" /protein_id="PRJNA477356:DP116_08570" /translation="MLEGSILQKLEAAHRHSKRPIQYGVYYKNTLVSLCHALEDHILT DDSTPLVITAFQQGKWYLQEAQRYADIAQKSCQIAIMASPDAGFAEHPTSQLPNVNLV GLESADPVAQEWHLIIIAPAYTAMVICQELSEADYGTTGVPASDVERKFYGLWTFEAE LVKETAELAISHIGQYNPELAQKLKAHKDAIQPCLATPEDLSAVVSQVVDYLQTGQVN ISVPTATRHQALDRNLVSNEIQAFLRMAQIIDATDITNPMAAAEVVALAETIGQLLDL PAWQMKRLRLAALLHRIDPLQRAQSILSPGTSTRYQEDAPSCPLTCPLVPGAQVLRTM PQLRAIAQIITHQSEWWDGTGEPAGLAGDEIPLESRIMALVADFQWRLNQKKTSQLSR EEIFAQALEECRQLQSYRFDPKLVDTLALLVMGLQQGLDVALVTPKVSASMWLLNSRL ESESKTGEQIRSYGK" gene complement(25218..26810) /locus_tag="DP116_08575" CDS complement(25218..26810) /locus_tag="DP116_08575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210269.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="radical SAM protein" /protein_id="PRJNA477356:DP116_08575" /translation="MEVKTPVMDNRILYVRLPCNPIFPIGVVYLADHVHKLFPTIEQR IFDLGTVPPLDYAFALDQCIDEFKPTLLVFSWRDIQIYAPVGGRGGNPLQYSFEIFYA KNPLVKLHGAFGGLRMLTSYYTELWRNLGLIKRGLKRARKYHPDIRLVVGGGAVSVFY EQLGKSLPKGTIVSVGEGETLLEKLLTGTEFIDERCYIVGESQPRTRLIHEQPTPVEK TACNYDYIESIWSELNYYLQEGDFYIGVQTKRGCPHNCCYCVYTVVEGKQVRINPADE VVAEMRQLYNRGIRNFWFTDAQFIPARKFIEDAEELLQKIVDSGMSDIHWAAYIRADN LTPKTCELMVKTGMNYFEIGITSGSQELVRKMRMGYNLRTVLQNCRDLKAAGFNDLVS VNYSFNVIDERPETIRQTIAYHRELEKIFGADKVEPAIFFIGLQPHTHLEEYAFKEGI VKPGYNPMSHMPWNARKLLWNPEPLGSFFGEVCLEAWRRNPNDFGREVMKILEERVGC ADLEEALSAPIETKEKPLVSVS" gene 27342..27737 /locus_tag="DP116_08580" CDS 27342..27737 /locus_tag="DP116_08580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08580" /translation="MAQILDPLPPEQSGKLFCCYVNATSKIQVARISNIPNWYFERVV FPGQRLVFEAPRQAQLEIHTGMMASAILSDTIPCDRLAISEPNSFVFDTDSSAPVMDS INKKPIVQINTITGDFIKSLQVAGLVTID" gene 27854..28291 /locus_tag="DP116_08585" CDS 27854..28291 /locus_tag="DP116_08585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860886.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4079 domain-containing protein" /protein_id="PRJNA477356:DP116_08585" /translation="MNLPSFLWLWKIAAWSMGLSLLAYLLLAITGVWMFRTRRLQQEE PSWLHSLHYLIGGCIVSLVLLLLLIGIIGTLGHFGSLGHSSHLIAGLTAVVLVLLSAG SALLIHPRRSWAKRIHIGANIALFFGFVWVSLTGWTVVQKYLP" gene 28410..28976 /locus_tag="DP116_08590" CDS 28410..28976 /locus_tag="DP116_08590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456063.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08590" /translation="MTANSADTSSSVFRLSPLIRITLLCLYVALTTPLPFLSQVTAAP VPPALLWVGIVIGFIGLYAALSERVIVDDQGIQVTYPQWVPGFFRKGWSLPWSDVKEL KPRTTGQGGIVYYFLSHEGKAYLLPMRVVGFARLVKFVQQKTQVDTTDVYPLAQPWMY LILLFLTLLLLLIDGWTIATALAQGSIN" BASE COUNT 8083 a 6399 c 6432 g 8252 t ORIGIN 1 ggtaacttat gtgcgggctg agtaccgtga ttcagttacc aacgaatcgc acgttcccta 61 ttccctattc cctattccct gttccctgtt aagagttccc tgttccctct taatcttgct 121 tcttaacttt aaagttgaga ataagttgcc acgtgagtta atgaagactt ggtatagaca 181 cactagtttc agaggtgatt gataacactc ccccacctga tgcgtgttat ctttgccaga 241 ggtggcgtgg attgctgatc tccacctttg caacatcacg ccttgtcttt ttcaatgatt 301 caagagcact gacagtctgt tgtgtcataa cacgaggtaa atatggctca aattcaagaa 361 gcaacacaag ctaacacatt gccccttccc ccatctgtca cgccaagagg tgtcgctgca 421 actgagcttc gtccctgggg ttcatttacc gttttagaag aagggcgtgg atacaaaatt 481 aagcgtattg aagttaagcc tggacaccgc ctcagcttgc aaatgcacca ccaccgtagc 541 gaacattgga ttgtcgtttg tggtactgcc aaagttgtct gtggtgatga agaaattttc 601 ctcagcaata atcagtcaac ctatgtaccg cagtgtacag ctcatcgatt agagaatcct 661 ggtgtgattc ctttggtgtt aattgaagtt caaaatggag aatatttagg agaagatgat 721 attgtgcgtt accaagacga ctacgcccgc gttcaagaga aaaatcaata atagctttat 781 caaagcatga attgaaaata tgcttgttac tcccttccca cttcttaatt acctctgaca 841 gcgaaatcta ttttctggtg cgatctattg cgcctcgtgc ggtttgaaaa tcagtaatct 901 tttggcggga aaggagtagg cagttccggc tgaaataagc aacaagcaaa agttagacaa 961 aatatgctat tttgagaaaa ctcggtttct tcaagaaacc gagtttttca ttgataagtt 1021 tttaaatcgc tttgccgctt ttatgatcca attgagtcca tctgccgcaa atgaaatcag 1081 gcgattaaaa tcaaagcaac acccaaatgt cttgtttcgt ttggctgtta aacctggagg 1141 ttgctctgga tggtattatg atatgtcctt tgaacaaggg ctacaagttg gcgatgaccc 1201 agaaggcagc gcttcttcgg gcgatgctgc tttggtgccg tcccaaagga acgaacgcat 1261 ttttgagtgt tatgatattc aaattgtcat agatgccgaa agcttaaaat acgtcaatga 1321 gttgactgtg gactattcag aggatttgat gggcggtggc tttcgcttct acaaccccat 1381 atctaatgcg acttgtggat gtggtaactc tttttctaca actcaatgac taattcacaa 1441 acaagtttga cacaaaagca caaaaaaaga tataattagg atttgtaaaa acttagacta 1501 aaggcagctt ctcgcgctgt aacccatgcc aaccatacag cagctaatac gtaacgaacg 1561 cgaacaagcg cgtcagaaaa ccaagtcccc agctctaaag caatgccctc aacgtcgggg 1621 cgtttgtact agagtatata cgaccacacc gaaaaagcca aactcagctc tacgtaaagt 1681 cgcaagggta aggttaacct ctggatttga agtgacagct tacattccag gcattggtca 1741 taacttacaa gaacactctg ttgtgatgat tcgcggcggt cgcgttaagg acttaccagg 1801 cgtgagatac cacattgtcc gtggaacttt agatacagcc ggagtcaaag accgcaagca 1861 aggtcgttcc aagtatggaa ctaagcgtgc taaagctaaa tagaatagct atgagtcatc 1921 aatcctctag acaccaacgg acactttgcc taacttgggt aacctcctga gtgtgcaagt 1981 gtttggcgta ggagtgcttc ttcttaacgg tagctttccc tacgacccga ggagtctaca 2041 aatcgccggt agcgcaggac atagcaattg gctttgctaa gtgaaatcaa gcactatgtt 2101 ctgttcagca gtttctgatt tagaacagag agctttttga cacaaatttg agtgagattt 2161 cattgcttaa gcaggaagtt ttgttcttaa aaatgagttt gtgtttgcgc caaactcggc 2221 tgagagtcgg cataaaagcg cgataaaatt ttctggtaac acccatcggt cggtcaggca 2281 gacgaggcgt gaatctggta cagaaagtct agcaggactg ttaactgtat ctaccgatgt 2341 gtattgctca aagtacgcgc aactccatat cgtctcgggg tactaggtgg acgttaaatg 2401 aaaaagcctg ctaactcgat ctggcgggga cgggtgaaaa tgctagtttt gctcgtctag 2461 ttaagagggg ataaagcagg aaattctggg agacaactct tggaatcctc aacgaaacct 2521 tcggcacgtc tttagacgaa cagccttcag cctccttagg gaggttgtca aaaaatcaac 2581 accaagagca atagtatatt gcttgtaaga gttgaataag tagggtcagt cacagtatat 2641 aattttgtcg cttcctctcg ttaaggacag cattttttgt cagtcagtgt agctgcaatc 2701 gcagtgaaac ttctgaaaaa aatttggacg gtggcattta gcgttgagaa atgccggtgt 2761 tcgataacat atagcatgta gtctcccgat tcagaactaa aggtttaagt atgtctcgtc 2821 gtggtgttat taaaaagcgt ccggttccac cggactctgt atacaacagt cgcctcatca 2881 gcatgattac taggcggatc atgcgtcatg gcaaaaaatc tctcgcctca cgcattgttt 2941 atgctgctat gaaaacaatt gaggaacgga ttggtggcga tccgttggaa acctttgaaa 3001 aagctgtgcg caatgctaca cctttagtag aagtgaaagc tcgtcgagtt ggtggggcaa 3061 cctaccaagt cccaatggaa gtgcgtgcgg aacgaggtac aagcttagca ctacgttggt 3121 tagtacagta ttccagacaa cgtccaggtc ggacaatggc aagcaaactg gcaaatgaat 3181 taatggatgc tgcaaacgaa agcgggaatg cgattcgcaa acgtgaagaa acgcaccgca 3241 tggcagaagc aaacaaagcg tttgctcatt atcgctacta atgcaaagac gcgctatggc 3301 gcgtctaaca tcggttgata tatcgtgact gtagcggaat gtaatgtata tttacttagg 3361 aaaagacggt tttccgtaag agtataatat cttaacaaag tgtaatatac aagatatcat 3421 gaggcgaaaa ctataggagg cagctgtggc acgtacgaac ccgctagaga gagtacgcaa 3481 tataggtatt gcggcgcata tagatgcggg caaaacaacg acaacagaga gaatattatt 3541 ttactctggg ataattcata agattggtga ggttcatgaa ggaaccgctg taactgactg 3601 gatggaacaa gagagggagc gaggaattac gatcaccgct gctgctatta gtaccagttg 3661 gaaaaatcat caaattaaca ttattgatac tccaggacac gtggacttca caattgaagt 3721 ggaacgttcc atgcgggtac tggatggtgt gatcacagtt ttatgttctg taggtggcgt 3781 gcagccccaa acagaaaccg tgtggcgtca ggcagatcgc tataaagtgc ctcggattgt 3841 ttttattaac aagatggatc gcactggcgc gaacttctac aaagtttacg accaagtgcg 3901 cgatcgcctg cgaacaaatg ccattcccat tcagttgccc attggcagtg aaaccgagtt 3961 ccaaggtatt gttgacctgg tgcggatgcg tgcatacatc tacaccaacg accagggaac 4021 ggatatccaa gacacggaca tccccgaaga tctgctctct caggtagaag agtaccgcac 4081 taaattgatt gaggcggtag cagaaactag tgatgacctg atgaccaagt acttcgaggg 4141 cgaggaactg accgaagacg aaatccgtac agctctgcgt aaaggcaccg ttaaaggtag 4201 tattgtgcca gtgctttgcg gctcagcatt taaaaacaaa ggtgtacaac tgctgttgga 4261 tggggtagta gattacctgc cagcaccaat cgatgtaccg ccaattcaag gtacactgcc 4321 aaatggtgaa actgtcgagc gccgcgccga cgacaacgaa cccctgtcgg ctctagcctt 4381 caagattatg gctgacccct acggtcgtct aaccttcgtt cgtgtttatt ccggtgttct 4441 gaagaagggc agctacgtcc tcaatgccac taagaacaag aaagaacgaa tttctcgttt 4501 agtgatattg aaagcagatg accggattga cgtagatgaa atgcgggcag gcgatttggg 4561 agccgcattg ggattaaaag acaccttgac aggtgacaca ctttctgatg aaagctcacc 4621 agtgattctg gaatcgctgt tcattccgga gcctgtgatc tcggtggcgg ttgaacccaa 4681 aaccaagaac gacatggaca agctgtccaa ggctctgcaa tctctctcgg aagaagaccc 4741 caccttccgc gttcacgtcg atccggaaac caaccaaacc gtgattgcag ggatgggaga 4801 actacaccta gaaattctag tagaccggat gttacgcgaa ttcaaagtgg aagcaaacgt 4861 tggtgcgcca caagttgctt accgcgaaac aattcgcaag caggttaaca gaattgaagg 4921 taaattcatc cgccaaagcg gtggtaaagg tcagtacggt cacgttgtga tcgacttgga 4981 gccaggacaa ccaggaaccg gctttgaatt cgtctctaaa attgttggcg gtaccgtacc 5041 taaagagtac attggaccag cagaacaggg aatgaaagaa agctgcgaat cgggtgttct 5101 tgctggatat ccgctgattg atgtcaaagc aacgctggtt gatggatcgt accatgatgt 5161 ggactcttca gaaatggctt tcaaaatcgc tggctcaatg gcgatgaagg aagctgtgat 5221 gaaagcagca cccgtcctgt tagagcctat gatgaaagtt gaggtagaag ttcccgaaaa 5281 cttccttggg gatgtgatgg gagatctcaa ttcccgtcgt gggcaaattg aggggatggg 5341 atctgagcag ggccttgcca aagtgactgc taaagtccca ttggcagaaa tgtttggcta 5401 cgccactgat atcagatcaa agacccaagg tcggggcatc ttctcaatgg agtttagcaa 5461 ctatgaagaa gtgcctcgca acgtggctga ggcaatcata gctaaaagca aagggaacgg 5521 ttagttagta aaggaaatta gcattaatgg cacgcgcaaa gtttgaacgg aataaacccc 5581 acgttaacat cggtactatt ggacacgttg accacggtaa aactacgtta acggcagcca 5641 tcaccatgac cttggcagca atgggtcaag ctgtggcgaa aggctacgac caaatcgata 5701 acgcaccaga agaaaaagcg cggggtatta ccatcaatac ggcccacgtt gagtatgaaa 5761 ccgagaagcg gcactatgct cacgtggact gccccggaca tgctgactat gtgaagaaca 5821 tgatcacagg cgcggctcag atggatggtg ccatcctcgt agttgctgct actgatggtc 5881 ctatgcccca aacccgcgaa cacatcctgc tggcaaaaca ggtcggcgtt cccagtctgg 5941 tgatcttctt gaacaaggaa gacttaatgg atgacgaaga actcctagaa ctagtggaac 6001 tggaacttcg agaattgcta tctgactacg atttccctgg tgacgacatt cccattatca 6061 aaggctctgg tctacaggcg ttggaagcaa tgactgctaa ccccaagaca cagaaaggta 6121 gcaatccttg ggtagataaa atctacgcac tgatggatgc tgtagatgct tatatcccca 6181 caccagagcg cgatgtagat aagcccttct tgatggcagt ggaagacgtg ttctccatca 6241 caggtcgtgg taccgtagct accggacgta ttgagcgggg taaagtcaaa atcggcgaca 6301 acgtagagtt agtaggcatt agaaatactc gcagcaccac cgtaaccggt atcgagatgt 6361 tcaagaagag tcttgaagaa ggtatggctg gtgataacgc cggaatactg ttgcgtggta 6421 tacagaaagc tgatatagaa cggggcatgg ttatcgctaa gcctggttca atcaccccgc 6481 acacacaatt tgaaggtgaa gtatacgtct taacagaaaa agaaggcggt cgcaaaactc 6541 catttttccc aggctaccgt cctcagttct atgtgcggac aaccgatgtg actggcacaa 6601 tcaaagagtt cactgctgac gatggcactg ctgctgaaat ggtgatgccc ggagaccgta 6661 ttaaggtgac tgtggaactc atcaacgcga tcgcgattga gcaaggaatg cgctttgcaa 6721 ttcgtgaagg tggtcgtacc atcggtgctg gtgtcgtttc aaaaatcctt aagtagcagc 6781 tttgcacctt aactaaaaaa ggagcagaga tggtataata tccactctgc tcctttcgtg 6841 tccatgaaca cgttacaaca attccacaca gaaacgcact ccttcaaggg gtattggaaa 6901 gcttatttcc caatcgttcg tctggagtgg tctctaatcc ctaaatctca actcaccaga 6961 acctggaaaa ttaaagatgg caactctaca gcagcaaaag attagaattc gtttacaggc 7021 ttttgaccgc cgcttgctgg acacatcttg cgagaagatt gtagacacag caaaccgcac 7081 caatgcgaca gccataggac caattccttt acctacaaaa cgccgaatct actgtgtgtt 7141 gcgatcgccc cacgtagata aagactcacg ggaacatttt gaaacccgta ctcatcgtcg 7201 gattatcgac atctaccaac cttcttctaa aaccattgat gccttgatga aactagatct 7261 accatccggt gtagacatcg aagttaagct ttaatgtgaa ttgattatag cggttctcat 7321 ttgaatcaca tacatcacca cgaagaatgt gaaagtgcat gcactttcac tacacccgtg 7381 tattgcacgt aacaccagaa acgctatatt tgctagtgct taggctcaac agacaactgt 7441 tggcaacgca ctcgcttacc aagaactcat aattattttt gtataaatca taacatagcc 7501 ctgaaaaaat atatttcggg gctttttatt agcattaaaa tcaaaatgat agaatgtaga 7561 aaaatgcggg aaaaagcaga gaagaaaagc tttttaagat taaatttata ccaaggtaac 7621 aatgacatcc tcttctaaaa ttgcagtttg cgaactacct ctgttcccgt taccagaagt 7681 agttctattt ccaacaagac ccttacccct gcacattttt gaatttcgct accgaattat 7741 gatgaataca attttggaga gcgatcgcag gttcggggtt ttgatggtag atccgatgaa 7801 aggcacgatt gctaacgttg gctgctgtgc agaaattgtt catcatgaga gactaccaga 7861 tgaccgcata aagatgtgga cattgggtca acaaagattt cgtcttttaa agtatgtccg 7921 tgaaaagccg tatcgagtag gcttggttga gtggatttca gacaaccctc caacaaaaaa 7981 tttgcgacct ttggctgctg atgtagaaca attactaaga gatgttgtgc gtctgtcagc 8041 taagttaact gagcaaaaca tcgaactacc agaagatttg ccaaatttac caacagagtt 8101 atcttactgg gtagcaagta acctttatgg tgttgctaca gagcagcaga cattgttaga 8161 aatgcaggac actgctactc gtttagaacg ggaagcagaa attctcactt ccactcgtaa 8221 ccacttggca gctcgtaccg ttctcaagga cacctttaat cagaagttgt gagttgtgaa 8281 tggttagtcg ttagtggtta ttagtcgttt cataacaact aaatattaac cactaactgc 8341 ttcactaaga gcgttaattg gtaaaacatt gtaactgcca agaattttta atatctctgt 8401 ataggaagat atttctgcta gagcagattg cgcttttggt tcggatgcat cagcttcgag 8461 atcaataaaa aataggtatt ctccaagaga acgctttgtt gggcgagatt caattttact 8521 gagattaaga tttagacgag ctaatacctg tagaggtttc gccaacgccc ctggtatgtt 8581 agcaggaaca ctaaaagcaa tggatgtgtg acgagcacat tctgaaacag tggggtggga 8641 gattggtaaa tgattttgac tgaccaccca aaaacgagta cagtttccgg gatagtcgtt 8701 aatcccgctt gctattatgg gcaagttgta gagttgcgct gcccgttggg atgaaatacc 8761 cgccgcagtt aagtcttgct ttagttgcag tagtgcttct gtgttggaat tcattggaat 8821 gagcgttaca ttgggaagaa accgctctaa ccatagctga cattgtgcca atgcttgtgg 8881 atgagaataa actgttttga tattttctaa gctttgagca caagaaatta atgtatgaga 8941 aatgggcata accaaagcca actgaacttg caaactatct agttgccata atgtatccag 9001 tgtcatggtc acactgcctt caatagaatt ttccacaggt acaacagcca attgtgcctc 9061 tttttgggca acggctcgta atgtctgagc attgctggga taagggcata aaacagcttc 9121 aacccccgta ttttttgtca accagttgag ataaaaaaga gctgcttgtt ctgtgtacgt 9181 gccaataggt cccaaatgtg caatcgataa attcatgaag tttagttttg tgtggatagt 9241 tcaacactca ttccgcaacg cgagtgtgaa gtcagttgac gtccgcaatc aagtggaggg 9301 aattttattt atattgagtg ttgtgacttg ttatcagaca taataaactt ctttccatac 9361 aaatgagtta tgactcataa cccttatggg tatgccagtc gctcatgggg gaaaccacgc 9421 ggctgtgcta ccctacggga agccgcccag tgggtacggc cacggctacg ccgtgagcgg 9481 cgtctacaag tcgggaaact ctagccgagc gctggctccc caagacgagc cactgcttga 9541 tgagggtttc cctcacttgg catgtggcgt ccgcgctggc tcacttatga ctgatgacga 9601 acacaaatgt ttaagaaaat attacaatta tttaacagat attttaaaaa actcatgtct 9661 actcgattta ctgcctatca atcaatcgaa attcctatcc cagaacagcc cattccgatt 9721 cagcattact tacgtcaacc tcaacgcttg gtcaacgctt tagttgaccc tagccgtatc 9781 caacaacttt cagaagaagt atttcggttg aaaatgcgtc ctctgaactt tatgtcactg 9841 agcattcaac caactgtaga catgagagtc tgggctgaat caaatggaac gatttatcta 9901 cgatcactaa attgtgaaat ccttggtgta gagtatatca accagcgctt tgcattgaat 9961 ttgaaagggt atttattgcc ggaacagcaa gttactagca ccgtgatcaa aggaagagcc 10021 gatttagaag tgttggtaga tttcccccca ccattttcct atacgcctaa gccaatacta 10081 gaagcaacag gcaatggttt actaaaaagt gttttgttga cagttaagca aagattacta 10141 catcaacttc tagcagatta ctcccgttgg gtgatattgc aaacccgaga aaaagcgctt 10201 gaggataaaa gtatgccgct cttcaaccca gagcaattat gaagaagttt tagtcttttt 10261 gaacacacaa ttgtgatcaa ctacagatgt acataaatct atgcagatgt cacgttattg 10321 agttaattct caatagatta ggatcagcta gttccgaata tacgcttgtg aacaagtaaa 10381 agctgtctgt gttgatagaa acaccaaagt agccgttatg aactgcgtac atcaggatac 10441 ataaaaagac ctcttaagct gttccctgtt aagagttccc tattttcaag ctagataaag 10501 tagttgtcgt gaacaactat ggataagttc tatgtagcag aactcagtac tgtgatttga 10561 caaggacgaa aggatttgcg atgtaatggt gaaggtccat acttttgcag tgccatgaga 10621 tgtcgctgac ttccataacc cttgttgcgt tccaagtcat acatatagta cttagaagcg 10681 aggcgaagta taaggtcatc acgccaaact tttgcaataa tactagcaga ggcaatcgca 10741 atacagcgct catctccctt gactatcgtt tgttgtgtca gtggcaagtc ttttactaac 10801 tgattgccat caattaaaca caaagcaggc tgcaccttta acttgagaac agcccgcttc 10861 attgctagca ttgttgcttg aagaatattc atctggtcaa tctcagctgt tgaagcaaaa 10921 ccgattttcc agtctatagc cagcgcacag atttgttgcg ctagccgcac tcttcgagaa 10981 ctggatagct ttttactatc tttaatttca gctgctgtga gttgtgacaa agcgctctct 11041 ggtagtatca ctgctgctgc taccacagga ccaaatagag cacctcgtcc tacttcatcc 11101 acacctgcaa acagcccttg agtacttgac accatggaaa aatctagcca cctcatctct 11161 gtgttgggtg ctgacaaccg agtagaagtg acagttttct ccgtcttgat cataagtttg 11221 ttagctttta gatgcagttt atgtatctgt ctatacagag gaggaacgaa ggcgacggcg 11281 gcgcttatta cttactctgt gttgggtgct gacaaccgag tagaactgac agttttctcc 11341 gtgttgatca taagtttgtt aggtttcaga tgcagtgtat gtatctgtct ccacagagga 11401 ggaacgacgg cgacggcggc gcttatcact tacgctactg gttacttcac tttcatcttc 11461 tgtgattacg ggaagatccg atacatcatc cgatacatca tccgatacat cagatgttgt 11521 ttctggaatt gattcgcttg gtagatcggg tgtgactaca ggggtgactc ttggagagac 11581 aggtgattct gaggttgatt cagtcggtgt tgcaggactg tgtccaggta gggtgacgtt 11641 gataattaca gatttgggat ttctgacctc ccgattcaat ttttccaacg gagaaactcc 11701 catctgggcg tagacatctt gttcctgaga cgacatttct acagttacaa tctccggtgg 11761 ttctaccact ggcttgattg gctctacctt tgtcaccact ttatggctac gttcgatgac 11821 tctgtcacta ccccaagaag gtttgctaat actgggtgag gagacctctg atggagctgg 11881 gagttcaggc tcatcatcta tttccaagtc tggctcgttc agaaacgcca aagggttatt 11941 agccccaacc cgtggctcat ctttgccatt tcccccattt agcctttcta cgcgcgtact 12001 acgccgagta cgacgctttc tgctatcggt catttcttgg tagctgggat gattgaccag 12061 attcaaagca gctaaatcat tgtcactctc gtatgcttcc ccatatcctt ccaaggtttc 12121 ccgtgattca gtcggtcgag taactggtag acgtggttct ttataggacg aggaggatgc 12181 aaagcggtct ggtaattctg ttggtgcgag gggtcggctt tcagtttctc ctggtaagtg 12241 gacaatgtgc cctaaaccgc cgcaagtgga acaaactgtt ccaaacaatt cgtaaatatt 12301 ttgaccttga cgtttgcgag tgagttctac taaacctaac tcagtaagtt gagcaatctg 12361 gggacgagct ttgtctgctt ttagtgcttt gttgaaatgt tctagaacat gcagttgatc 12421 acgccgcgat tccatatcaa taaaatcaac gacaatcact cctgcaacat ttcgcaaccg 12481 cagctgacgg gctatttctg tagcggcttc acagtttgtc cataaaacag tttctctcgc 12541 tgttgccgat cgcgtgaatg aaccagagtt gacatctatc actgttaatg cttctgtcgg 12601 ctcgataata atgtagcctc cagaaggtaa atctactctg ggcttgagag cttctttaat 12661 tgcagcatta atacggaagt actctaaaat tggtgcgcga tcgcggtgat ggtctatcaa 12721 caccccctgt ggtgtttgtc caccactcca gttttgcaag tactgcttca ctcgcttcaa 12781 accagtactg gagtctacca caattctgtt cacatctgcg ccgtacatat cccttagcac 12841 gcgctggata aaatcatcat cccgattcag cagcgctggg ggacgtgttg attgtgcttc 12901 ttgctgaatt gcctcccatt gcttttgcaa caattctaaa tcttcgataa tcgcttcttc 12961 tggtttgcct tcagcttcgg tacgaactaa caaacccatc cctgctggct taatcaaaat 13021 tgccagcgca cgcaaacggt tacgctcgct ttcactccta atccggcgtg ataaattgac 13081 tccccgaccg tatggcatta gtacaacgta gcgtcctggc aaggtaatgt tcccagtgag 13141 ccttggtcct ttagtacctg ttggctcttt catcacttgc actaatactt tttgctgcgg 13201 tgttaataat tctgtaatag ctgcagctgt acgtcggagt cgtagtggtc ctaagtcagt 13261 gacatgaata aaaccattgc gctctggatc gcctatatta acaaaagccg catctatccc 13321 aggtagtaca ttttctacga ctcctaggta gatatcaccg atttggtgat gtcctgtggc 13381 tacaacgagt tcttgtattt ggtcttcaga aaatactgca gcaatttgat gctgttccgc 13441 gatgataatt tgttttggca ttcaattatt gtcctcaaaa actggcagca cagatagact 13501 tggcagtttg ttatacctct cgccccaaca agcccgctta tgcgctgctc gtaaaaattg 13561 cgcttcttcg ctcaaaccta gaaagctatt tacgcgctcg attgcgttga caagcctgac 13621 ccatctagaa cggttgcata ttatttaacc aattaaatga accgtccgtg gttaattttt 13681 gatgcaatac cctttcgctg gaaagtgctg agtcctgagt tatgagtctt aaggagccac 13741 tgcgctcttg gggagctaat tccgtgtgag ggtttttccc ccacccttgg aaactggctg 13801 acctccccag ttatagcagg cggcgttttg tgagttgcta agccagtgcg cccttgggga 13861 gccagtgcgt tgcgggggtt cccgagccag tgcgttgggc gggtttcccg acttgaagca 13921 cctggcgtcc cgttgaagca cctggcgtgg tttcccctgg ttataatagc acttgccgta 13981 agcacagcgt atccctttgg gacttacgcc tagcctctgg tggaggagat acccgaaggg 14041 gttagttctg ggttctttac actcaggact cgccactcgt gactcaacac tatctcaaca 14101 ctcggtactc gttactcaga ctacctaaat tgtagattgg gcattcaatt aattttaaaa 14161 atcatagaat cagagaagcc agcgagccta ttgttggtga ttgctgattc acgagccgag 14221 cggaaagaga gtgttattaa tctgcctggt gtttgtctta gataaaaaac ctagaagttt 14281 cactctaata tctttgcttt tgcgccctga gtatcagggt agtgaatcag tccactcaat 14341 gcagcgccag aaaatttctc ttcattccat tctaacgcaa ccttgttaaa aatcaattac 14401 ccaaatgtac tacaataggg caattactag caaattgccc ccaaaaaact aaactgccaa 14461 aattagccga ttccggtgaa tttgcagcag attgagttct atacctgcta cttgttctag 14521 cataaacagg acttgttcgg gacgcaatag caccccgtcg tgacgacagc tacctacata 14581 tcgcagggta actgtaggct gttcctgatg ctctcggtgt tttggcaact ctactaattc 14641 taactcaaat aagcgatcgc gcagatttac caactgagtt ttgcctgact tggttgtgtg 14701 ttcccaccaa aactcatctt tggctttgat ggcttcaatc caaccttgcc attctacagg 14761 tgtagcctca ataaggcaag ccacagttat gagatactca ctcgcctcca gagcttggtt 14821 agctgcactt gcttttaaat ctagctgttg cacctgatat ataggaatat ctaatggcag 14881 agcatcagct aatttttggc gaaaagtctc caagtcaact ggttgcgtca gttcaaaatc 14941 tacaatttca ccactgcttg tagttcccaa aggcaaagca tgggcaatag aaatgcgagg 15001 gtttggatga aacccaccag taaaagaaat gggcaatcct gctcgtcgca cagatctatc 15061 aaataagcgc atcaaatcta agtgtccgac taaagccaaa cctccttgct tcccaaacca 15121 cacacgcagc cgttgtgctt tgattgtatt tgggacaaac tcgccagcaa attctgggat 15181 agaaggcggc tcgatcacaa cattatgacc aaagtcagtg ccacacacac cacagtgaga 15241 gcaaccttca aaagagcaat ctggtacagt tgctgcttct aaagcgcgtt gcaaatcttc 15301 ctggagccat tttttgtcaa taccagtgtt aatgtggtcc caaggaaggg gtgaatcgag 15361 ggaggggggt gagagagtga agcagtccgt tgcggaggtt cccgaggcag tgcggtcttg 15421 gggttccacg ccactcccct caagtcggga gacccgccca cgggggtggc tccccaagtg 15481 gagcacctgc cgttccgttg taggaactgc tgttcgccgt aaggcgtgcg cggagcgcac 15541 acccgtaagg gtgagggctt cgtttccgta gtcgctcgtt tcctcctttt ttttctcttc 15601 ctctgctgtt gttggaaaca agttccactc gcccttctcc acttggcgat atttccaatc 15661 taaaccagct tcaaagatgg cttgttccca agctgagaat gctttttcaa cactctcaaa 15721 ccaggaatcc atccctgctc ctaattccca agcacggcgt aatactgggg cgagttggcg 15781 atcgcctcgt ccgataaaat cttccatcgc tgaaatacgg acatcagtga aattcacctt 15841 tacgtccttg atgcggcgga atgcttgccg cagtaactct tgcttgcgct taaattcagc 15901 agtggaaaca gagtgccact gaaaaggtgt atgaggcttg ggcgtaaagt tagagattgt 15961 taagttaaat gacagtgctc ttctactttt cgctcgacac tcccttttta accagcttac 16021 cgtttctgct atgcctaaca catcagcatc tgtttcacct ggcaagccaa tcataaaata 16081 aagtttaatt ttgtcccagc cttgttccca agctgttttt acaccccgca gtagctcttc 16141 atttgtcaaa cctttgttga caatatctcg catcctttga gttcctgctt ctggagcaaa 16201 agtcagtcct ccttgccgca caccaccaag gatgttggca atattttcat caaatctatc 16261 aactcgttgg cttggtaaag aaagagaaat attctcatct ttcagtctat tcttgatttc 16321 catccccact gcgggtaggg ataaataatc agaacaactt agggacagca aagaaaactc 16381 attgtaacct gtttgccgca ttcccgaatc tattgcttcc accacctttt ctggttccac 16441 atctcgtgct ggtcgagtca gcattcctgg ttggcagaag cgacagccac gagtacaacc 16501 acgcctaatt tcaattgtca ggcggtcatg cactggctgg atatagggaa ctaaccctat 16561 agaataggct ggcattgggc ttgctatgcg ccgcacaact tgttttggca catctgggtg 16621 gttaggatga actgacccat cctgtgccat atcataaaac agaggaacat agacgcctgg 16681 tatctgcgct aaatccagca acaattcttt acgactcagt cctgcttttt taccttcttc 16741 caaaactaag ccgatttctg gcaacagttc ctcaccatct cctaaggcga agaagtcgaa 16801 gaagtcagcg tatggttccg gattggatgt cgcggtctgt cccccggcaa aaatcagagg 16861 gtagttagag gataggggcg tacggggaga tgagaaggaa gatgttgctc cctcatcttc 16921 ctcgtctccc tcatctgatt ttgtctccac ctccagacgt tcttgccaag ttaggggaat 16981 tccagctaag tccaacattt ctaaaatgtt ggttgctccg agttcataac tgaggctaaa 17041 gcctagaatg tcaaattctg ttagtgctcg cctagattct acggcaaaca atggtgtctt 17101 tgttgcccgt agttttgctg ctagatccga tcctggtagg taagcgcgat cacataactg 17161 acgcggttgg gcattcagta tattatagag gataatgtgt cctaaattgg atgcgcccac 17221 ttcataaatt tccgggtagg ttagaaccca acggatttct gccgtatccc aaggcttatg 17281 aactgctaag agctcgttac ctaggtaacg agctggtttt actatatccg atgtgattaa 17341 cttttcaact gcaacactca ctgtgctatc ttttagctac cttaattaca gctactcact 17401 tttaacctta gcattttaat ggtgtttcaa cagactcact gatttttcct taagactttt 17461 ttagacatga gttgagtatg acttattgca tcgggaagca agctgcttca aatgccgctt 17521 taccttcaaa tatctcaaaa accctatcta attgagtcat ttcaaatata attctcaaac 17581 agggtgaaac tgagcagagg ctgaagcgtc gtcctagttt gtgagatagc ttgagtgcag 17641 acactaaagc catcaaaccg gcgctgtcta aggattccac atactctaag tctacgagta 17701 agaaagaaat cccgtcttgt gccagtgctt tcgttatatt tgtttcaaat tctaaagcgt 17761 ttgtggcatt taggcaacct tgagggctaa taacagttat ttttggatag tctagtacta 17821 cttgcatcgt tttaaagtta ttgaacttta aggttagatg tatttgacca taaacttttt 17881 cttctcacat cgaccattcg tcttagttct ctttgctgaa tctttattgc tgtagagtta 17941 ctgtttaaag attgtcaaaa accttagtgt aactgtattt caatatactt tttttttcga 18001 tttaggaaaa tataatcttc acaaattgcc ggtgaggcga ggataaacag ctagtaccgc 18061 tactttagaa gtcaaaattt aaaaaattta tatgacaacc ttttcgctct tcgtttaatg 18121 gtatgtttat ttctgtgagc actgtgctag ttatagctat aaactttatt tatctataga 18181 gcttcaattg aattggtaaa ggaagggtta accttacaat cgtcactttg ctcataaacc 18241 tatcttaaaa ataagacaca caggaaatca ccgtaggctg aaagctcctc cgtgagaacc 18301 ctaaagcttt ccgttgttca aagccgtggt agaagggata gagattttca tccgcttcat 18361 cactcgtact gaaaaactgt ggtgcctatt tagtttgtta actgcttagc ttaagtagat 18421 ttttggcttg tttcactcaa ccctgctgct tgccgttttt ccacctgagc aagaagtttg 18481 ttgatcattg acagatagtt gactcactaa tgaactatgt tttcagattt ttctttccag 18541 attattgtga tctcgatcac ttacataaac agcttttgat cacgttttat acctaatgat 18601 accatagata atgattacct atatactggg tgtaatcaga atgacaatga aatactctat 18661 tcgtcgatta gtggaaaaag ctcttgatat taaaaagttg actcctgaaa tagaaaatga 18721 aatcaactca gaattgacac aaatgggtca catttcagac gttgactatg aagctttaga 18781 actcttaatg gcagagatgg atgctggtcg tattcaattg gttccaagtg caggttgttt 18841 cttctaagtg cttgaattga cagcttttta ctgaaaatta agtaaactta agtaaagact 18901 atagtattaa gagggaactc tgaactctga acaggcaaat ctccttgacg taaaagtagg 18961 ggaaatgatt gccgacttct tgtttcattg cactacgtgc cttgaaacaa ttcttttctt 19021 ctgttcataa ataaataatg gtgctaaaaa cctaactttt gggacaagtt ttcttcactg 19081 ttaagcgtta agcattccct tgtccggata gcgtcccaac cgctgtgtcc ccttgggacg 19141 agcgcgtggc ggaacgccat agggacaaga gaacccaact tcaatagcag gaaaaactaa 19201 aattagataa attctagtat caataatcac gcttgaatct tgtcactact gatacttttt 19261 tttaggttaa atctataggt tatagcaagg cgtaaactga aaagacagtg ctttcacatt 19321 tgcattagtc agaagtcatc accatgcagt ttgctcaacg cttagaaaaa attcctcctt 19381 atctatttgc cgaaattgag cgcagacacc atgaactagt tgctcaagga attgacatca 19441 ttaatatagc aaagggagat cctgacaagc caacgcctgc tcacatcatt caggcaatgc 19501 atgaagcgat agatgatccg tcaactcacg actacccacc ttaccgaggt actcaagaat 19561 ttcgcaaggc aactgccatg tggatggaat accgatttgg agtaacagga ttaaacccag 19621 aaacagaagt tatctcttcc attggttcaa aggaagggat tcacaatacg ttcttaggct 19681 tcgtagaagt tggagattac accctaatac cagatccagg ttatccagtc taccgtactt 19741 ctactatctt tactggtggt gagccttatg ccatgcctct aaaggcagaa aataaatttt 19801 tgcctgatct caatgctatt cctaaagaag tagcgcaaaa agctaaattg ttatggataa 19861 actatcctaa taatcctact ggggcagtag caacattaga attttttgag gaattggtag 19921 ctttttgcaa gcagtatgat atcctgttgt gtcatgacca tgcctactca gaaatagcgt 19981 acgacggtta caaaccgcca agcgtgctgc aagttccggg agcaaaagaa gtggcaattg 20041 aatttcacag tctgtctaag tcatacaaca tggctggttg gcggattggt tttgttgttg 20101 gtaatgctac tgctattaag ggtttaggac aggtaaaaac gaatattgat tctggagttt 20161 ttaaagcagt tcaagcatgc gcaatcgccg cttactccac agatattaca gaaatccagt 20221 ctagagtctc agtttaccaa aagcgtcgcg atatcatcgt taaaggattg caatctttag 20281 gttggtgtat agaaactcct aaaggaagcc tttatgtttg ggttcctgtt cctcaaggat 20341 attcctccaa agaatttgtg acattgctac ttgagaaatg cggcattctt gtgcctccgg 20401 gcagtggtta cggtgcagca ggagaaggct tttttcgcat agccctgact gtttcctcag 20461 aacggatgga cgaagcaatt caacgtatga gagatgcagg cattcgttat cagtgctgaa 20521 tgagaaatac taagttgtca tttccacctc aacactcaga actttctttt tgtgcttgcc 20581 aaatttcgtg aagaaactga ttgagttgtt gcttggcatc taaaatctgt cctggttttg 20641 gttgttttgc caaaacctca cttaaaaaac ctatgacttc tgtgtctaag gcaggttgaa 20701 ataaattgat acaccaaagc agatcagaag catctcgcag atcaatgatc tgcggcattt 20761 cttgagtgaa ggcaaccagc agccaaggac gtacccaaca caactgacgg gaaacgacta 20821 cttgaatgac ttctgcgtac agcctcctgt cgctatgctc taaatacaca atttgtcctg 20881 ctttaaactc cacactcatg ttcatgcagt ttaggcgcat ttttatcaat tctaagagaa 20941 taaggtgtca acacacacca aagccatgca gctatcaacc cagaagcgca aaaattttga 21001 ctaaccccac atttgtatga gtatgctata aaatagataa ataagtgtta agcaatattt 21061 tgacaaatat tctgctgtac taattcagcc aaaaaagagt aaagagccgt gtctgtcgaa 21121 actttacaga agaattcaac atctcgcaag atcgccccta ggtatcgcgt tttactccac 21181 aacgacgatc acaaccctat ggagtatgtg gtacgggttt tattaaccac agtgccaaac 21241 cttacccagc ctcaggctgt tagcatcatg atggaagcgc atagcaatgg gtttgcctta 21301 gttattactt gcgctcaaga acacgctgag ttctattgcg aaaccttgaa aaatcatggt 21361 ttgaccagca caattgagcc tgaggaatag tagggctatt tgaattgaaa aaagttttac 21421 tccgtttacg ccaacgccct gccccaataa ggctgggctg ttttatatta actttggtgg 21481 gaatatggtt gcccatagcc gtacctattt atttgttggt agatgattcc aatttagtca 21541 ctatcttgac tatggtactg ctggcaattg gatttttcgt gcttctccct atatggaaca 21601 aatatgtata ccagcagcga cagatatttc ggcactatgg tttagaaaga acacgcctca 21661 ataaagtgga actggtgcgc ggcttggcta ttgggctgat taccattctg attctattta 21721 gcttagaagg actcctgggt tggttggtgt ggcaaaaacc caatattttt ttgctcagag 21781 tcgttttaga agggttaatt accagtttgg gtgtggcgtt tgcagaggaa ttgttttttc 21841 gggggtggat attggatgaa ctgcagcggg attatagccc ctctgtggta ctttggacag 21901 atgccaccat ttttgcgatc gcccacttta ttaaaccact gccagaagtc attcgcacct 21961 caccacaatt ttttggctta ttgttgctag ggttaacact ggtatgggca aagcgaagtt 22021 ccaggggacg cctaggttta tcgattggtt tacatggtgg tttagtttgg ggatactaca 22081 ttattaatgt tggcggattg atgaaatatt ctcgtcaagt tcccgactgg gtaacgggag 22141 tgaatgataa tcctttggca ggagtgatag ggttggtgtt tttgggtggg ctggcgttgt 22201 ggatgagggg gagggcagtg aaatttgaat aatcaaaaat ggagttttgc tgggttacgg 22261 acaaagtact gacacaccct actataaggg attgcgaata tagccgcccc tgagaaaacc 22321 cataagcatg atcaaattgc ccttgaatga ccaaatacta tatttcttta cctccatcac 22381 caaattgtgt tgatgctgga tagtttaatt ttgctgtccc gactaatcac tggcactccc 22441 agatgcaagg ctgtagcagc gacaatgcga tctggtaaat ctggcacttg tgtgcgctca 22501 actaaacgca aagcttgggc tatatgtcga tcaaatggaa cttctgctag tacagcatca 22561 gttgtgtcaa caactgcaag cagtctctca agcgtcagtg gagaaatacg tcctttctcg 22621 cttaagtaga caatttcagc tagggtaatt gaggagaaag caacttgatc gccagcagat 22681 gcaatttgtg cgatcatatt tcttgcagta atcgagagtc gtgcatcagc aaaaatatac 22741 cagataactg cgtgggtatc agcgacagca cgcagtatta aatgtcctcc cgtgagaagt 22801 tagcccattc ctcactacgg gctacatcaa tttccgattc tgaaggtgct tgtcccaaat 22861 cagcacataa accccacaat aatttacggg gagtaggcgg gttgtcgatt aactcgcgtt 22921 ctatttcagg agcaatttgc tgaatcaaac gcaccttgtc cactgttgaa agctgcaatg 22981 cttgcttgag aacttcctgt agactcataa ctttttcctt acgaaactat ctcaaccaat 23041 gttcctacca aacaagctcg ctccaaattc acgccactca aattagcacc atctagttca 23101 gcacaaagta aattcgcacc cgtcaaattc gcacccgcaa gattagctcc ccgcaagtcg 23161 gcttcttcca aatttgcctc actcaacaac gctccttgca aatctgcacg actcaagtct 23221 gcacctttga gattcgctcc agtcaagttc gcacctcgca agtcagcacc tcgaaagtca 23281 gcaccttgca agttaacatt catcaaattg gcaccactca agaaagcacc tgcaagcgag 23341 acttcactca gtcttgctcc catgaaatta gcaccacgca aatttccacc ccgtaagtca 23401 gcacccgtca gatctgcttg cattaaattt gctcccataa ggtttgctcg caagtcagtt 23461 tctacaagat tggctcccat caaatttgca ccttctagat gtccaccttc aagtttggaa 23521 cctgtaaaaa gtgtgccaac aaggtgagca ccagcaagat tgattcgact taaatcagag 23581 tttgataaat cttcgtcctc taaatttgtt cctggcagat ttttgagttt tcctaatcga 23641 atttcttcta tgttcatata cattatgcat tatcaagata attaaatttt gcacacagac 23701 acaaaggcgc aaagagtttc tcagcgtctg tgagcttttg tatcaaatga ttatttgccg 23761 taactgcgaa tctgctcacc agttttgctt tcgctctcta acctgctatt gagtagccac 23821 atgctagcac tgactttagg cgtcaccaga gctacatcaa gtccctgttg taaacccatg 23881 accaataaag cgagtgtatc tacaagttta ggatcaaagc gatacgattg cagttgcctg 23941 cactcttcta aagcttgagc aaatatctcc tcgcgactta actgagacgt ttttttctga 24001 ttgagtcgcc attggaaatc cgcaactaat gccatgattc tcgattccag aggaatttca 24061 tccccagcta accctgctgg ttcacctgta ccatcccacc actcgctttg atgtgtgata 24121 atttgggcga tcgctcgcaa ttgtggcata gttcgcagaa cttgcgctcc cggtactaaa 24181 ggacaagtca acggacaact gggtgcatct tcttggtagc gcgtagaagt tccaggagaa 24241 agaatgcttt gtgctctttg caatggatct atacggtgca gcaaagctgc aagacgtaga 24301 cgtttcatct gccatgcggg aagatccaaa agttgcccta ttgtttctgc tagcgccacc 24361 acttctgctg ctgccattgg attggtaata tcagtagcat caataatttg cgccatccgc 24421 aaaaatgctt gaatttcgtt ggaaaccaga ttgcgatcca gtgcttggtg gcgagtcgct 24481 gtagggacgg atatattgac ttgccctgtt tggagataat ctactacctg agaaacaact 24541 gcactcaagt cttctggtgt agcaaggcat ggctgaattg cgtctttgtg tgccttgagt 24601 ttttgtgcca gttctgggtt atattgtccg atgtgggaaa ttgccaattc tgccgtttct 24661 ttgactaatt ctgcctcaaa tgtccacaag ccatagaatt tccgctccac atccgatgca 24721 ggtactccag tagtaccata atcagcttct gatagttctt gacaaatgac cattgctgtg 24781 tatgcaggtg ctataataat caagtgccac tcctgcgcca ctggatctgc tgactctaat 24841 cctaccaagt tcacattggg taactgactt gtgggatgtt cagcaaaacc tgcatcagga 24901 gaagccataa tagcaatttg acaggatttt tgagcaatat ccgcatatct ttgggcttct 24961 tgcaaatacc atttcccctg ttggaaggct gtgataacta aaggtgtact gtcatcagtt 25021 aaaatgtggt cttccagagc atgacacagg gaaaccaggg tatttttgta gtaaacgccg 25081 tattgaattg gtcttttact atggcgatga gctgcttcta gcttttgtaa aatagagcct 25141 tctaacatga gtttaatgaa aagaatcgca gacgaacaga cgaaagggga gcaagtttcc 25201 ccttatcccc gtgtcttcta tgatacgctg acgagtggtt tttctttcgt ctcaattggt 25261 gcagaaagtg cttcttctaa atcagcacaa cccactcttt cctctaaaat tttcataact 25321 tcgcgtccaa agtcgttggg gttgcgtcgc catgcttcta agcagacttc accaaagaaa 25381 gaaccgagag gttcggggtt ccagagaagt tttctagcgt tccagggcat atggctcatt 25441 ggattgtacc ccggtttaac gattccttct ttaaaggcat actcttctaa atgggtatga 25501 ggttgtagcc cgatgaagaa aatggcgggt tcgactttgt cagccccaaa aattttttct 25561 aattcgcggt ggtaggcgat cgtttggcga atagtttcgg ggcgttcgtc aataacgtta 25621 aaggagtagt taacagagac taagtcgtta aatccagcag cttttaagtc gcgacagttt 25681 tgtaagactg ttcgcaggtt atatcccatg cgcatcttac gaacaagttc ttgagaacca 25741 ctcgtaatac caatttcaaa gtagttcatc ccagttttca ccatcaactc acacgtcttt 25801 ggtgtgaggt tatcagcgcg gatgtatgct gcccagtgaa tgtcactcat tccagaatcg 25861 acgattttct gtaacagttc ctctgcatct tcaataaatt ttcgggcggg gatgaattgg 25921 gcatcggtaa accagaagtt acgtatgccc cgattgtaaa gttgtcgcat ttcggcaaca 25981 acttcatctg caggattgat acgtacctgt ttaccttcta caaccgtgta aacgcaatag 26041 cagcagttgt ggggacaacc gcgtttggtt tgaacaccga tgtagaaatc tccttcctgt 26101 aggtaataat taagctccga ccagatgctt tcgatataat cgtagttaca agcggttttt 26161 tccactggag tcggttgttc gtggatgagg cgtgtacgtg gttgagattc tcccactatg 26221 taacaacgtt catctataaa ctctgttcca gttaaaagtt tttctagcag ggtttcgccc 26281 tcacccacag aaacaattgt cccctttggc aagcttttac caagctgttc gtaaaataca 26341 ctcacagcac caccacccac aactaagcgg atatctggat gatatttacg ggcacgtttt 26401 aaaccacgtt tgattaatcc gaggttacgc cacagttctg tatagtaact tgttaacatc 26461 cgcaagccgc caaaagcacc gtgtaatttg actaagggat ttttggcata gaaaatctca 26521 aaagaatatt gtaatgggtt accaccacgt ccacctacag gggcataaat ttggatatcc 26581 cgccaagaga aaaccagcag tgttggttta aattcatcaa tacactgatc tagggcaaag 26641 gcataatcta aaggtggtac cgtccccaaa tcaaaaatcc gctgttcaat tgtgggaaac 26701 aacttgtgga cgtgatctgc aaggtaaaca accccaatag gaaaaatggg gttacaagga 26761 aggcgaacat agagaattcg gttgtccatt acaggtgttt taacttccat cttgctgttt 26821 tcaagggaaa gcgagaacaa tatctttata aagaaaagtt tattttgttt tttgtcctat 26881 atcttaatat tacttgacat tcactcagat tattgattag tctattgtgt ttattgggat 26941 cggtgaaaat agctaaacgg attttgccat cagatttcta gtctgctgag gcgagatact 27001 ataaaaattt ttcatacata tagctttaca ataacaccaa ctttctttgc caagcgatga 27061 tccacataca ttatttgtgg tttagaacat tagcaaggac acgtcaatca cacaagttat 27121 gatttttttg tgaaccaggg aacattatat ttgtgcgact ataatgtaca attgcgtaaa 27181 aggcatagat ttttacaaaa ttttcatatc atccgcatat cttaagatag aaatcatcta 27241 aaaaatacca gttggtagtc ggggttacag cagctgggga tcacccagtg ctacgttggc 27301 tggtgtaata taaaatagtg gcgtaaagct cctcagcagt tatggctcaa atattagatc 27361 ctctacctcc tgagcaatca ggaaaactgt tctgttgcta cgtaaatgcc acaagtaaaa 27421 tacaggtggc tcgcatctcc aatattccca actggtactt tgaacgggtt gtttttcctg 27481 gacagaggtt agtgtttgaa gccccaagac aagctcaact tgagattcac acgggcatga 27541 tggcaagcgc aattttatcg gatacaattc cgtgcgatcg ccttgcaatc agtgaaccta 27601 acagttttgt gtttgacaca gattcttcag caccagttat ggactcgatc aataaaaaac 27661 caattgtgca gattaataca ataaccggag atttcataaa atccttacaa gtcgctggtt 27721 tagtaaccat tgattaaaaa aaacaatttt gcttagttag agggttgcta aactcagcaa 27781 ccctcttttg ttttgtcttt aataatgtcc tcatcgcctc catcactgtt tctattccac 27841 tacaatgttt actatgaact taccttcgtt tctttggctg tggaaaatag cagcatggtc 27901 gatgggttta tcgctattag cttatttact attagcgata acaggtgttt ggatgtttcg 27961 taccagaaga ctacagcaag aagaacctag ctggctgcat tcccttcact acttaattgg 28021 cggttgcata gtcagtttag tgctgcttct gttactgatt ggcattattg ggactttggg 28081 tcactttggt tctttgggac actcatcaca cctgattgct ggtttgacag cagtcgtact 28141 ggttttgctg tctgcaggga gtgcattgct gattcatcct agacgatctt gggctaaacg 28201 tatccacata ggtgctaata ttgccctgtt ttttggcttt gtttgggtat cgttaacagg 28261 ctggactgta gtacaaaagt acttaccatg atgaatcaca aatcactcga cagcttcacg 28321 attcagtttg ccaagcaata tcttgaagaa ttactcttac ctttttacca gctaatatta 28381 atttaaagta caacgatcat gatggagccg tgacagcaaa ctcagccgat acttcatctt 28441 ctgtttttcg actctctcct ttaattcgga ttacactttt gtgtctgtat gtggcactta 28501 ccacaccgtt acctttcctc tcgcaggtaa cagcagcacc tgtaccacca gcgttgttat 28561 gggtgggtat tgtcatcggt tttattggtc tgtatgcagc gttgagtgaa cgcgttattg 28621 tagatgacca gggaatccaa gtcacttatc ctcaatgggt accaggtttt ttccgcaaag 28681 gctggtcttt accttggtct gatgtaaaag agttgaaacc ccgcacgact ggtcaaggag 28741 gaatcgttta ctacttcctg tctcacgaag gaaaagctta cttactaccg atgcgtgtgg 28801 ttgggtttgc ccgtttggta aaatttgtgc aacagaagac acaggtagat acgacagatg 28861 tctatccctt agcacaacca tggatgtatt taattttact ttttttgacg ctattgctgc 28921 tattgattga tggctggaca atcgccacag cgttagcgca aggcagcata aattagagtt 28981 tcaccgagtc cgcaacgggc gtgggtgtaa gggtgtaagg gtgtaagggt gtaagggtgt 29041 aagggaaacc ccacacgatc acgcgagaat tcaaaattca acttaggcaa attcaaaaag 29101 gagagattct taattttgag attgctgagt cccaagggga cacgctgcgc gtatgcctgc 29161 tttcgc // LOCUS NODE_1008_length_28525_cov_4.83835628525 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 28525) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 28525) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..28525 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(78..1658) /locus_tag="DP116_08595" CDS complement(78..1658) /locus_tag="DP116_08595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_08595" /translation="MWTPLRQPVFRALWVASAVSSIGTWMHDVGASWLMTTLAPNSPL LVALMQAASSLPFFLLALPAGALADVVDRRKMLLWTQGWMLVVAALLGVLTIAHITTP WILLGLTFALSIGSSMNMPVWQAVTPELVSKEELPQAVTLSGIVVNLSRSIGPAVAGI IIATAGTGVVFLLNAASFVSVIFVIARWERTHEKSALPTERFVGAMQAGVRYIRYAPV FQSVLIRTIAYIFFASALFALLPLLGRKELGLDALGYGVVLGFWGIGGLAGAFILPKA REKFSIDSLVAIASLVMAAMMLALAYFRIVPLVWGVMLLVGISSLCVMVSLTVTAQTA VPSWVRARALSVQLLVFQGSMVLGSLLWGTLAQHTSISTALTTAAVGLIVCVLLTRRY RLRCAEKLDLRASLHWDQPPIAFEPCPNDGPVLITLEYRIDPANAEEFTKVMQALSQI RRRDGALQWGLYQDLSDPSRFVETALVESWAEHKRQFARVTNTDKVIEERVRAFHIGD EPPKVLQMIYSDPNRRPC" gene complement(1707..1907) /locus_tag="DP116_08600" /pseudo CDS complement(1707..1907) /locus_tag="DP116_08600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012628063.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicA family toxin" gene complement(1904..2152) /locus_tag="DP116_08605" CDS complement(1904..2152) /locus_tag="DP116_08605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316887.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08605" /translation="MTQITFKVQAFWDKDAQVWVATSEDVPGLVTEASTIEVLTQKLR DLIPELILLNRIVPSDYVGSITFELISHRQELISVTNR" gene complement(2211..2918) /locus_tag="DP116_08610" CDS complement(2211..2918) /locus_tag="DP116_08610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphorylase" /protein_id="PRJNA477356:DP116_08610" /translation="MLVNAILVVTGPEYKSVCKGLNRLAVPTPPVFPIPMGSSALTKH LEQWLEAGHLSHHPQPRVLLMGLCGSLSTKYSVGKAVVYRSCLSPGAEEKKNAAEFLC DRSLTEIVQNKLQDRVFTGIGLTSDRLIYSASEKLQLGQTYAADVVDMEGYAALEFLS ELGIAVAMLRVISDDAHHNIPNLSNAFHADGSLQAFPLAMGLLRQPIAATRLVSGSLR GLRILQDLTTFLFSRVG" gene complement(2955..4037) /locus_tag="DP116_08615" CDS complement(2955..4037) /locus_tag="DP116_08615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317049.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxythreonine-4-phosphate dehydrogenase PdxA" /protein_id="PRJNA477356:DP116_08615" /translation="MYKLHEEDITSVPSRPRLALTLGDPAGIGPEVILKALADPEISK SCDATVVGSRDLLLEIYTKLKLAKNSQPLADPEELSIFDVPLDKYTQDEILIGTGNAA SGAASFAYMETAIAQTLANQFDGIVTGPIAKSAWKAAGYNYPGQTELLAEKSGAKRVG MLFVARSPHTNWTLCTLLACTHVPLSQVPLVLTPELMTEKLDLLVECLEKDFGLQKAR IAIAGLNPHSGEQGQLGHEEQDWLIPWIEKERKNRPNLQLDGPVPPDTMWVKPGQAWY GNIEPSQAADAYLALYHDQGLIPVKLMAFDRAVNTSIGLPFVRTSPDHGTAFDIAGKG IADATSMKAAIQLAAELVSQRRKARL" gene complement(4048..4341) /locus_tag="DP116_08620" CDS complement(4048..4341) /locus_tag="DP116_08620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="antibiotic biosynthesis monooxygenase" /protein_id="PRJNA477356:DP116_08620" /translation="MVLINVFCVPQGKENEFASMWTEALELIKNEPGFIDAKLHRSLD PNAQFQFVNVAHWENQEAWKAAFDKLQLQELTKQVSFEQIPALYEVEVYLEKG" gene complement(4379..5122) /locus_tag="DP116_08625" CDS complement(4379..5122) /locus_tag="DP116_08625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865903.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-ketoacyl-ACP reductase" /protein_id="PRJNA477356:DP116_08625" /translation="MASLSGKVAIVTGSSRGIGRAIAERLGRDRANVVVTYAGNRDKA EEVVSAIKANGSDAIALQTDLSKLDDIRALFQKTISHFGKLNILVISGGAPRLTKPLV ETTEEEFDSVFTFNAKGNFFALQEAAKHMADGGRIVTFSTPYTVQPQPNLSVIAGSKA AIEAFTFALALELGNRGITVNAIMPGPTTTESFGEMVSAEEQAQLKQIAPLGRLAEPK DVANAVAFLVSDEAAYITGHTLHATGGLA" gene complement(5236..5565) /locus_tag="DP116_08630" CDS complement(5236..5565) /locus_tag="DP116_08630" /inference="COORDINATES: protein motif:HMM:PF03992.14" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="antibiotic biosynthesis monooxygenase" /protein_id="PRJNA477356:DP116_08630" /translation="MVTNNSESEITLVNLFTVKPEKQQSTANQVAEIYKTVVSKQPGF ICARIHKSLDGTKVAAVARWESQEALEAMQQTSDFQNAIPSLEHEIVSAEPHIYEVIC ILGETGA" gene complement(5572..6108) /locus_tag="DP116_08635" CDS complement(5572..6108) /locus_tag="DP116_08635" /inference="COORDINATES: protein motif:HMM:PF07883.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cupin domain-containing protein" /protein_id="PRJNA477356:DP116_08635" /translation="MSEPISSGPSLTQALSLPPIIQSAESAPAYWFLDILWIVLVDGE QTDGRYSLMEQLMPEGVGPTPHVHPFNDEGFYVIDGAMEMRVGDQTVSAAKGTSVWIP RRTVHAFKVTSPTCRVLNSFAPAGMEQLIKSLARPADRRELPPKGLDTDPKKIAAFAN NYWGMEIEFPVAQTTLSR" gene complement(6351..7253) /locus_tag="DP116_08640" CDS complement(6351..7253) /locus_tag="DP116_08640" /inference="COORDINATES: protein motif:HMM:PF12833.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_08640" /translation="MIQGNIKVIDAVSNKAVPLPVTPGTSLLATTSFQFLNELNIDRL YMSACEMPEVIVREHLLTLQLSPQHLFETWEEGQLKQIHKGVGSVTLAPAGFRFRGRW DRDVEILILTLKPAAIAKCAAQLHDTNQTELVRCTGRLDSQIWHLGLALEAELKEGNP NGRYFWESLTNALAVRVLKQYSAKEPKIQHYCGGLSPHQLRRTIEYINDNLATHLSLN VLAAMLGMSPYYFERLFKQSVGCTPHQYILQRRIERSKQLLRTTQLPIMEIAFQVGCK NHSHFSKLFRKLTGMSPKTYRNSF" gene 7572..7685 /locus_tag="DP116_08645" CDS 7572..7685 /locus_tag="DP116_08645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317050.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome B6" /protein_id="PRJNA477356:DP116_08645" /translation="MSGEMLNAALLSFGLIFVGWALGALLLKIQGGEEESL" gene 7829..8818 /locus_tag="DP116_08650" CDS 7829..8818 /locus_tag="DP116_08650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746367.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-beta hydroxysteroid dehydrogenase" /protein_id="PRJNA477356:DP116_08650" /translation="MTLLIIGATGTLGRQVARRAIDEGYKVRCLVRSAKKATFLKEWG AQVVPGDLCYPHTLTAALEGVTAVIDASTSRPADSLSIRQVDWEGKVSLIQQSVAAGV ERFIFFSILDAEKYPDVPLMEIKRCTELFLAESGLNYTILRLAGFMQGLIGQYGIPIL EGQPVWVTGESSPIAYMDTQDVAKFAIRALKVPETEKQTFPVVGTRAWSAEEIIALCE RLSGKEARITRMPINLLRTVRRVMRFFQWGWNVADRLAFTEVIASGKPLNAPMEEVYQ VFGLDQNETATVESYLQEYFSRILKKLKELDYQKIQPKKQKPKRSPFKKTNTQ" gene 8942..9859 /locus_tag="DP116_08655" CDS 8942..9859 /locus_tag="DP116_08655" /EC_number="2.7.1.23" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(+) kinase" /protein_id="PRJNA477356:DP116_08655" /translation="MPKAGIIYNDVKPVASRIAIELKERFTAAGWDICITSAVGGILG YSTPESPVCHTPIEGLTPPGFDSDMKFAVVLGGDGTVLAASRLVAPCGIPILTINTGH MGFLTETYLNQLPQAIEKLLLGDYEIEERAMLTIKVFREDFVLWEALCLNEMVLHREP LTSMCHFEIAIGQHAAVDIAADGVIVSTPTGSTAYSLSAGGPVVTPGVPVLQLVPICP HSLASRALVFPDSEPVNIYPVNTPRLVMIVDGNGGCYVLPDDRVYVEKSPYCARFVRL QPPEFFRILREKLGWGLPHIAKPTSVELP" gene 10127..10813 /locus_tag="DP116_08660" CDS 10127..10813 /locus_tag="DP116_08660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321751.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_08660" /translation="MTLAHNPCVLLIESDENLANQLAFDLKEAGYDPIVAHDGTNGIQ YSRDREPALVVIDRMLAGESGLSLCKNFRTTGMRSPVLVLMARDTVDDRVACLDAGAD DYFLKPYRGEDFLNLVRLYLKPEVDTSEQLRFGDLVLDIATRRAILNERAIDLTMKEF DLLKYLMEHPREVLTREQILENVWGYDFMGESNVIEVYIRYLRLKIEDEGQKRLIQTV RGVGYVLRET" gene 10882..11436 /locus_tag="DP116_08665" CDS 10882..11436 /locus_tag="DP116_08665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207455.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF192 domain-containing protein" /protein_id="PRJNA477356:DP116_08665" /translation="MINWLSLLSIVLSILLAGCSVPTPAVPPTATPSSQSQAPMSSEQ KPSSKNLGQQLPISAVAVVPDGTKIELEVAQTPQQQAMGLMYRPTLPDNRGMLFEFPS PFQASFWMKNVPVPLDMVFMLDGKVQYIATSAPPCNTTPCPTYGPQTPINQVIELRSG RAAQLGLKVGSYVKIEPLKSGNIR" assembly_gap 11598..11607 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 12121..12312 /locus_tag="DP116_08670" CDS 12121..12312 /locus_tag="DP116_08670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006100242.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2949 domain-containing protein" /protein_id="PRJNA477356:DP116_08670" /translation="MSPSTYSRFISFLQEDLAISTASIAVALRRREQDPGPLPMILWQ YGLITIEQLEKIYDWLETV" gene complement(12983..13609) /locus_tag="DP116_08675" CDS complement(12983..13609) /locus_tag="DP116_08675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877579.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methyltransferase type 11" /protein_id="PRJNA477356:DP116_08675" /translation="MTETRVRQQYDQMSSVYDQRWKSYISKTLSFLKNWAQISPLDTV LDVACGTGEFERLLLSEYPTQEIVGVDISEKMLEIAKHKCNIYSQVSFQTASALALPF ASNSFDVVISANSFHYFDDPLAALREMKRVLKPEGKVVILDWCKDYFFWKIGDIVLKL FDPAYKQCYTQDEFNHFLAATNFTIRRATRVRFDIIWGLMVATATPQS" gene complement(13854..14102) /locus_tag="DP116_08680" CDS complement(13854..14102) /locus_tag="DP116_08680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858757.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08680" /translation="MEQIPLPSPVHYELILQLLERQTMIAVSNNPNLRHQVNQLIITL RKAAVQQKHLEQVCQSSFEIDHRWSLNHLKSKHIAASE" gene complement(14212..15111) /locus_tag="DP116_08685" CDS complement(14212..15111) /locus_tag="DP116_08685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998877.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="indole-3-glycerol phosphate synthase TrpC" /protein_id="PRJNA477356:DP116_08685" /translation="MQIRRRRPNPAIAVSTVQYQTVMSDAEPNNILEEIVWHKEEEVE RMREKLSLQELQRQVLEAPLSRDFVAALQQGKTKPALIAEVKKASPSKGVLREDFHPV EIALKYQKAGASCISVLTDEKFFSGSFENLAQIRAQVDLPLLCKDFVIYPYQMYMARV RGADAVLLIAAVLSDQDLQYFVKIAKALKMAALIEVHSLAELDRVLTIDGVSLVGINN RNLEDFSVDIQTTCQLLAARGKELQQKNIVVVSESGLHNCDDLTLVQQAGASAVLIGE SLVKHPDPEEAIANLFGKPLAHE" gene complement(15434..16864) /gene="lpdA" /locus_tag="DP116_08690" CDS complement(15434..16864) /gene="lpdA" /locus_tag="DP116_08690" /EC_number="1.8.1.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydrolipoyl dehydrogenase" /protein_id="PRJNA477356:DP116_08690" /translation="MSHEFDYDLVIVGAGVGGHGAALHAVSCGLKTAIIEAADMGGTC VNRGCIPSKALLAASGRVRELRDAHHLKSLGIQLGSVEFDREAIANHANNLVSKIQGD LTNSLKRLGVDIIRGWGKIAGTQKVSVTTDSGEKTITAKDIILSPGSVPFVPPGIEVD GKTVFTSDQGVKLESLPDWVAIVGSGYIGLEFSDVYSALGSEITMIEALDQLMPGFDR DIAKLAERVLITGRDIETHVGIYAKKVTPGSPVVIELANFKTKEDVEVLEVDACLVAT GRIPATQNLGLESVGVELDRRNFIPVNDSMAVLSAGEVVPHLWAIGDANGKMMLAHAA SAQGIIAVENICGRHKEVDYHSIPAAAFTHPEISYVGMTEGAAKEKGSAEGFEVATAK SYFKGNSKALAEGEADGMAKVIYRKDTGVVLGVHIFGIHASDLIHEASAAIANRQSVH TLAHLVHAHPTLSEVLDEAYKRAIAS" gene complement(17424..19766) /locus_tag="DP116_08695" CDS complement(17424..19766) /locus_tag="DP116_08695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316900.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="filamentous hemagglutinin" /protein_id="PRJNA477356:DP116_08695" /translation="MNTLVGWIKGLGVAVGGAIAFYANCVTAQITPDTTLPNNSRVIT QNNIKIIEGGTQAGSNLFHSFEQFSVPTGTTAYFQNATDIQNIISRVTGKSISNIDGI LKANGTANLFLINPNRIIFGPNASLNIGGSFIASTASSLNFADGTKFSATDPQTTPLL TVSVPIGLQFGATAAPIRNQSQASPDGATNIARYPVGLQVQPDKTLAIVGGDVILEGG NLTAARGRIELGSVAANSLVSLNPTNQGWSLGYEGVQNFQNIQLIQRTVNGSQIASII DASGKDGSGNIQLQGKSVELIGNVGLINITNGVKDGGDLTITTDKLIVRDGARLLTIT MGEGAGGNLTVNASKSVQVIGTIANTTIPSALSSTTFASGKAGDVNINTGRLLIQDGA EISAESSGNTRQSQFTPATGKGGNLNINASESVELIGTSAKGFPSSLLTRALGTGDAG KVIIVTEKLLVRNEAAVNVSSQLPKLPENVIYLGDTINLGKAGELNISARSILLDNQG KLTSETNSGQGGNINLEVGDLLLLRRNSQISTNAGKAQAIGDGGNIFINAPSGFIVAK PKENSDITANAFTGSGGRVQINATGIFGIAARSREDLTRQLGTNDPIKLDPQNLLTND ITAISQTNPTLNGVVNINTPEVDVKRGLINLPVELKEPRLAQGCDASVARNQSEFIIT GRGGIPSNPREVLRSNNVQVDWVSLGEDANNPLGTSSRETQRQRSRKDAQNNKDVKYP PNEIVEAQGWVVDSNGDVILVAQVPTTTPYGSWLNSRSCK" gene complement(20178..20603) /locus_tag="DP116_08700" CDS complement(20178..20603) /locus_tag="DP116_08700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015122135.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08700" /translation="MRSNGTPVGENSVVKSKPINNHTINQVNGRANCKANLFNSVKQF SVPTGITTYFNNAADLHNISWVTGKSFFNILNIDGTFQTNGTANLFLFNSDRKLSHSD TFSQFLENNQTQDFLKAIPGQQLVIAQCCTRPGCLCCKC" gene complement(21239..21496) /locus_tag="DP116_08705" CDS complement(21239..21496) /locus_tag="DP116_08705" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08705" /translation="MTSEQMEQFKNEIHEELRNAVNNSKLGEVFQKYGITGNKTYQFE CILDLTKIPFQDGISNQLVKSEERVRVDCCGCEPPFFCCSC" gene complement(21647..21865) /locus_tag="DP116_08710" CDS complement(21647..21865) /locus_tag="DP116_08710" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08710" /translation="MAIRMIASLALRDRTTRKDKRMTRMNMLRVLSEKNKYFLFSDDH LFRRKGERSFQIFILKVHKLISLAPIKE" gene 22267..22575 /locus_tag="DP116_08715" CDS 22267..22575 /locus_tag="DP116_08715" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08715" /translation="MSFLANISDFYFKKIAIENVLVESEVGNCWIGLSLAAECGFMHL SYTRDDLYRWGSRLLPRSASGLGGRTFVAINAPKRSRPKGDRSWVDRDNNFHNSNFLI " gene 22639..23460 /locus_tag="DP116_08720" CDS 22639..23460 /locus_tag="DP116_08720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316897.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phytoene/squalene synthase family protein" /protein_id="PRJNA477356:DP116_08720" /translation="MDLYGDALNILKETSRTFYIPIIKLPLGLQEAVASAYLCLRAID EIEDHPELDNFTKAKLLRTISLTLQAGGDGFAIDAFYKGFHSHEHLLPEVSVRIREWA ILAPATIAPRIWDATAAMADRMAYWAQNDWKIRTESDLDRYTFGVAGAVGLMLSDLWA WYDGTQTNRIHAIGFGRGLQAVNIIRNHTEDLVRGVNFFPEGWSAEDLHHYARRNLTL ADAYISALPAGPALDFCQIPLTLAYGTLDALANGKSKLSRSDVLALLEQFTNTSK" gene 23892..25346 /gene="hpnJ" /locus_tag="DP116_08725" CDS 23892..25346 /gene="hpnJ" /locus_tag="DP116_08725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319573.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hopanoid biosynthesis associated radical SAM protein HpnJ" /protein_id="PRJNA477356:DP116_08725" /translation="MKKTLFLNPPSFDGFDGGAGSRYQAKREITSFWYPTWLAQPAAL VPGSKLVDAPPHNQTVEDVLKIAKDYELIIMHTSTPSLANDVTCALAMKEQNPNVQIG FVGAHVAVLPEETLRENSVINFVCRNEFDYTCQELAEGKPWDQIKGLSYRDKNGQLHH NAERDLIHDWDAMPSVLPVYGRDLDITKYFIGYLLHPYVSFYTGRGCPAKCSFCLWPQ TIGGHQYRTKSPEAVGREMEEAKAIFGDKVQEYMFDDDTFTIDKQRAIAISQHMKRLK LTWSCNARANLDYDTLKQLRDNGLRLLLVGFESGNQQVLDGIKKGIKLEVARKFMENC HKLGITVHGTFIIGLPNESQQTIEETIRFACDVSPHTIQVSIAAPYPGTELYQQAQTN GWFSDNSLVASSGIQMSTLQYPNLSSAQIEDAVEQMYRRFYFRPKAIIPIVGEMLTNP QMLVRRLREGREFFSYLKERHTQAAAKEQSVVSR" gene 25405..26265 /locus_tag="DP116_08730" CDS 25405..26265 /locus_tag="DP116_08730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312182.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferritin-like domain-containing protein" /protein_id="PRJNA477356:DP116_08730" /translation="MKLGSEEHKELFCRSFIKSHLEFEPEKLPWPVLDSVALERLHGI PFWREALSTERQAGAMVSTFAATISDPLLREAIALQALEETRHSRLIECLINHYNIQI SQPPEPVLPSNIKTAFIDFGFGECLDSFLAFGLFKIARQANYLPEPLFDIFDPILHEE ARHIMFFVNWVTYQQIQEGRTANWLRGVDALWHYRRALQDKIKAFSGSEEDKQEGFTA TAAGNFMDNLTPELFLSTCLQENAKRMSVFDQQLLQPQLLPTLAKIALRIIRLMPQQQ SNSAAQFSEQ" gene 26276..27154 /locus_tag="DP116_08735" CDS 26276..27154 /locus_tag="DP116_08735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010475138.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hopanoid biosynthesis associated protein HpnK" /protein_id="PRJNA477356:DP116_08735" /translation="MQAQKFAIINGDDFGFSHGVNQAIIKAHKEGVLTSTSLMVTGEA FDEAVDLAHAHPTLAVGLHLVLVCGRAALPPSQIPHLVDSTGNFPYSAPISGLRYQFI QATHEELRQEIRAQLEKFRSSGLRLSHVDGHLHMHVHPVVLRILVDLADEFGIRVIRL PCEELGMSLRLDRRNLLTKLVWAGVFGGLRRYGEGLLKSKGIGFAERVYGLLQTGSVT EEYLLGLIPQIEANLVEIYCHPAVAIAGEPLNGPLGAGEAELAATLSEQVSEMLAASG FELTNFEQQRTQPLTY" gene complement(27151..28230) /locus_tag="DP116_08740" CDS complement(27151..28230) /locus_tag="DP116_08740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873107.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="undecaprenyl/decaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphate transferase" /protein_id="PRJNA477356:DP116_08740" /translation="MNIYSSLRSLGIADPSGSGWLAVVFTFLLAGTVTWRLIPAVRKF ALRVGWADQPNARRLNREPLPNAGGLAIYAGVIAAVVLASLLRPIELERVLAEVQTIL LGGSILVLVGFIDDQFGLPPLVRLLIQILTALLLVANRITIEFSFGTPIDSTLSVIIT VIWVVGITNAINLMDGMDGLAGGVSFITAISLLAVSAQVPNRAAATLVLAALAGAALG FLRHNFHPSRIIMGDAGAYFFGYVLAATSILGNLQGPTAVSLVAPVLFLLLPVLDTTQ VFIRRLMAGKNPLSTPGKDHLHHRLLAWGLSQRHAALTLWSIALICNVLAMRLQNMTL VQILMTTIGIILFLSFIVLQRIRTT" BASE COUNT 7911 a 6421 c 6121 g 8062 t 10 others ORIGIN 1 tacgagtaga cattgttgtc gggttgtcca tccttgtcta tgaaaaatga agcggcatat 61 tttgaatttt gaattgccta acatggacgc ctgtttgggt ctgaataaat catctgcaaa 121 accttaggtg gttcatcacc aatgtgaaaa gcacggactc gttcctcaat cactttatct 181 gtattcgtca cccgcgcaaa ttgccttttg tgttctgccc aagactccac aagggctgtt 241 tctacaaaac gactaggatc agacaaatct tgatacaagc cccactgaag tgcaccatca 301 cgccgacgaa tttggctaag tgcttgcatc actttagtaa attcttcggc gtttgctggg 361 tcaatacgat actctaaagt aattaaaaca ggaccatcat ttgggcaagg ttcaaaagcg 421 atgggtggct gatcccaatg cagggacgct cgcaagtcca atttttcagc acagcgaaga 481 cgataacgcc ttgtcaacag gacacacaca attaatccta cagcagcagt ggtaagagca 541 gtcgagatac ttgtatgttg agccaaggta ccccacaaaa gactacccaa caccatactt 601 ccttggaaca caagcaattg tacagataaa gctctagcac gcacccagct tggaactgct 661 gtttgggctg ttacagtcag actaaccatc acgcagaggg acgaaatgcc cactagtagc 721 atcactcccc acaccagtgg tacaatccgg aaatatgcta gcgctagcat catcgcagcc 781 attactaagg atgcgatcgc caccaggctg tcgattgaaa atttctctcg tgctttgggt 841 agaatgaacg ctcctgctaa gccaccaatg ccccaaaagc caagaacgac accatatccc 901 aacgcatcca gccccaactc tttacgcccc aacagtggta gtagggcaaa caaggcactg 961 gcaaagaaaa tgtaggctat tgtcctaatt agtactgact gaaagactgg ggcgtagcgg 1021 atgtaacgta cccctgcttg catagccccc acaaaccgtt ctgtgggtaa ggcactcttt 1081 tcatgcgtac gttcccaacg agcaatcacg aagatcacgc tcacaaatga ggcagcattt 1141 agcaggaaaa caacacccgt tcctgctgtg gcgatgataa tccctgctac ggctggacca 1201 attgagcgag ataaattaac aacaatacca ctgagagtga cagcttgcgg gagttcttct 1261 ttcgatacga gttcgggtgt gacagcttgc caaacaggca tattcatcga actgccgata 1321 ctcagggcaa aggtcagccc taagagtatc cacggagttg ttatgtgagc tattgtcagc 1381 acacctaata aagcagctac gactaacatc caaccctgtg tccacagcag catcttgcga 1441 cgatctacaa catcggcgag tgctcctgct ggtaaagcca gcaaaaaaaa tggtaaacta 1501 gacgccgcct gcatcagcgc cactaacaac ggtgaattgg gtgcaagagt tgtcattaac 1561 catgatgcgc cgacatcgtg catccaagtc ccaatgcttg acaccgctga tgctacccat 1621 agggcacgaa acactggctg acgcagggga gtccacatgg aaagagcaac tgctgatggt 1681 gatgtcatat aggaagtatt gactggttaa aaggctttag acagcccagc ttgtttcaaa 1741 acagcgatcg ctcgtgtggc gtgacttgat agaactatct accacaaatc ggcgatctgt 1801 aattggactg taccagattt catggtctgg tttaccttgt cgttcaaagt aacaaccagc 1861 cgattcgaaa gaatcttttt caactcagca gtaaatgacg cactcatcta ttagttaccg 1921 aaatcaactc ttgccgatga ctgattaact caaaagtgat agaacccaca tagtcagaag 1981 gcacaatcct gtttagaagg atgagttctg ggatcaaatc tcgaagcttt tgtgtaagaa 2041 cctcaattgt agaagcttca gtgactaatc ctggcacgtc ttcactcgtt gcaacccaga 2101 cttgtgcatc tttatcccaa aatgcttgaa ccttaaatgt gatttgtgtc attatcaagt 2161 gctctgtcga attgacgacg taatttgagt ttaaaacaaa cgtcaaatat ctaccctacc 2221 ctgctaaaca aaaaagttgt taaatcctgt aagatacgta atcctctaag agaaccagaa 2281 acgaggcgag ttgcagcaat tggttgtctg aggagtccca ttgctaaagg aaatgcctgc 2341 aaagaaccat cagcgtgaaa tgcattgctg aggttaggaa tattgtgatg tgcatcgtcg 2401 ctgatgactc gcaacattgc tacggcaatt cccaactcac tcaaaaattc taatgctgca 2461 tatccctcca tgtctacaac gtcagcagca tatgtttgac ctaattgaag tttttcacta 2521 gcagaataaa tcaggcgatc gctcgtcaaa ccaatacccg taaaaactct atcttgtaat 2581 ttgttttgca ctatctccgt taacgagcga tcgcacaaaa attcagcagc atttttcttt 2641 tcttctgcac caggagaaag gcaacttcga tacactacag ctttgccaac actatatttg 2701 gttgacaaac ttccacataa ccccatcaac aaaactctcg gttgcggatg atgagaaagg 2761 tgtccagctt ctaaccattg ttctaaatgt ttggttaatg ctgaagaacc cataggaatt 2821 ggaaaaacag gcggtgtggg aactgcaagg cgatttaatc ctttgcaaac agatttgtac 2881 tctggtccag taacaactaa aatggcatta acaagcattc tcataatttt ttggcgctat 2941 tgcctacact gccattacag ccttgctttt cgcctctgag ataccaactc agccgccaac 3001 tgtatcgctg ctttcatact cgtagcatca gcaataccct tacccgcaat atcaaacgcc 3061 gtcccgtgat ccggtgaagt ccgcacgaaa ggaagaccaa tagaagtatt aactgctcta 3121 tcaaatgcca tcaacttcac aggtattaaa ccttggtcat gatacagtgc aaggtaagca 3181 tctgcagctt gtgacggctc aatgttacca taccaagctt gacccggttt gacccacatt 3241 gtatctggtg gaaccggacc gtctaattgt aaatttgggc gatttttacg ttctttctca 3301 atccagggaa ttaaccaatc ttgttcttcg tgtccgagtt gtccctgttc gccgctgtgg 3361 ggatttaaac cagcgatcgc aatcctcgct ttttgtaaac caaaatcttt ctccaaacac 3421 tccaccagca agtcaagttt ttctgtcatc aactctggtg ttagtactaa tggtacttga 3481 cttaatggca catgtgtgca agcaagcaga gtgcaaagtg tccagttagt gtggggggaa 3541 cgcgcgacaa acaacattcc cacacgcttg gcacccgact tttctgccaa aagttccgtt 3601 tgaccgggat aattatatcc tgctgctttc catgctgatt tggcgattgg acctgtgacg 3661 ataccatcaa attgatttgc gagtgtttgg gcgatcgctg tttccatata agcaaagcta 3721 gccgcaccac ttgctgcatt acctgtccca atcagaattt catcctgtgt gtacttgtct 3781 aaaggcacat caaaaattga tagttcttct ggatctgcta aaggctgaga atttttggct 3841 aactttagtt tcgtataaat ctctagtagt aaatcccggc tacctacgac cgtagcatcg 3901 caacttttgc taatttctgg gtctgctaaa gcttttaaaa taacttctgg tccaattcct 3961 gctggatctc cgagtgttaa tgctaagcgt gggcggcttg gcactgatgt tatatcttct 4021 tcatgtaatt tgtacattat tggcatatta tcccttctca agatacactt caacttcgta 4081 gagagcagga atttgctcaa aagagacttg tttggtcaat tcttgaagct gaagcttatc 4141 gaatgctgcc ttccaagcct cctgattctc ccaatgagca acgttgacaa attgaaactg 4201 ggcatttgga tcaagacttc gatgcagttt tgcatcaatg aaacctggct catttttgat 4261 caactcaagc gcctcagtcc acatactggc aaattcgttc tccttgcctt gaggaacaca 4321 gaagacatta attagcacaa tcggagactt catttccaac ctcgaatgat ttcttaacct 4381 aagccaaccc acctgtggcg tgaagtgtat gtcctgtgat ataagccgcc tcatcactaa 4441 ctaggaatgc aaccgcgtta gcaacatcct tcggttctgc caatcgtccc agtggtgcaa 4501 tttgtttcaa ctgggcttgc tcttcggctg aaaccatctc accaaatgat tcagtggtgg 4561 ttggtcccgg cataatggcg ttgacggtaa ttcctcgatt gcctaattct aaagctaacg 4621 caaaagtaaa cgcttcgatc gcagctttgc tacccgcaat cacagataag ttaggttgag 4681 gctgtaccgt atacggagta gaaaaagtga caatgcgccc accgtcagcc atgtgtttag 4741 cggcttcctg gagggcaaag aagttaccct tggcattaaa cgtgaaaaca gaatcaaatt 4801 cttcttcggt ggtttcaacg agtggttttg taagtctggg cgctcctccg ctgatgacga 4861 ggatgttcag tttaccaaaa tggctgatcg tcttctggaa taaagctcga atgtcatcca 4921 gtttggaaag atctgtttgg agagcgatcg catctgaacc atttgcttta atagctgaga 4981 caacttcctc tgccttatct cgatttccag cataggtgac aactacgttt gcgcgatcgc 5041 gtccgagtcg ttcagcaata gcccgtccaa taccacgcga tgaaccagtc acgattgcga 5101 ctttccctga aagagatgcc ataaatattg cttcctattt attactaagc atccgtagat 5161 gcaaaaaccg gagttgtatc tgtaaggcgc tatacccctt aacatcttgt ttggcttgct 5221 cgaaatcgaa taggttcatg cgccagtttc tcccaaaatg caaatcacct catagatgtg 5281 aggttcagcg ctgacaatct catgctccaa gcttgggata gcgttttgaa agtcggatgt 5341 ctgttgcatt gcctctagtg cttcttgtga ttcccaacga gcaactgcag caactttagt 5401 gccatccaaa ctcttgtgaa tcctggcaca gataaagcca ggttgtttac tgacgactgt 5461 tttataaatc tctgctactt gatttgcagt actttgttgc ttttctggtt tcaccgtgaa 5521 tagattcacc aaggttattt cggactcaga gttgttcgtc acgatccatc ctcagcgtga 5581 aagagtggtt tgtgcaactg gaaattcgat ttccataccc cagtaattgt tggcaaacgc 5641 agcaatcttc ttgggatcag tatcaagacc tttgggcggt agttcacggc gatcggcagg 5701 tcgagccaaa ctcttgatta gttgttccat tccagctggg gcgaagctat tgagcactcg 5761 acaagtcgga ctcgtaacct tgaaagcatg caccgttctc cttggaatcc aaaccgaggt 5821 gcctttagca gcacttacag tctgatcgcc aaccctcatt tccattgctc catcaatcac 5881 gtaaaatcct tcgtcgttaa atggatgaac gtgcggtgta ggaccgacac cttctggcat 5941 caattgttcc ataagtgagt atcgaccgtc tgtttgctca ccatctacaa gcacaatcca 6001 aaggatgtct agaaaccagt aggcgggagc gctttcagca ctttgaatga tcggtggtag 6061 gctgagtgcc tgagtcaaac ttggcccaga cgaaattggt tcactcataa atttttctcc 6121 aaatattcat aaataggcta gagcttaccc tgtgactggc ttgccgtcga tttgtaaaag 6181 tcagaacacc tgtaaaactg tatcagtgtt ctgctcctgt tgacttggca aatcttgtga 6241 aactgttcta aatttgcaaa atcttttgtc actcaaaaaa agttgtctcg tttatcacta 6301 aggaacaacg aaatcaggaa acgaagactt tttctcagct acctagactt tcaaaatgaa 6361 ttgcgataag ttttaggaga cattcctgtc aacttccgaa acagttttga gaaatggctg 6421 tggtttttgc agccaacttg gaatgcgatt tccatgattg gtagctgggt tgttcgcaac 6481 agttgcttgc ttctttctat tcgacgttgc agaatgtatt ggtgaggcgt gcatccaacg 6541 gattgtttga ataatcgctc gaagtaatat ggactcatac caagcatcgc cgcgagaaca 6601 ttcaacgaga gatgggtagc gaggttatcg ttaatgtact caattgtgcg tctcaattgg 6661 tgaggtgata aaccaccaca gtagtgttga atttttggct ctttggctga atactgcttg 6721 agtactctca cggctaaagc attcgtcaga gattcccaaa agtatcgacc attaggattt 6781 ccctctttta attcagcttc taaggctaac cccagatgcc aaatctgcga gtcaagtctg 6841 ccagtacaac ggacgagttc agtttgattg gtatcatgta attgggctgc acactttgcg 6901 atcgcagcag gtttaagcgt cagaatcaaa atctcaacat cgcgatccca tctaccgcga 6961 aacctgaacc cagccggggc tagtgtaaca ctcccaaccc ctttatgaat ctgtttgagt 7021 tgtccctctt cccaagtttc aaataaatgc tgcggactca gttgcagcgt cagcagatgc 7081 tcacgaacaa tcacttcagg catttcacag gctgacatat agaggcgatc gatgtttaat 7141 tcatttaaga attgaaacga ggttgtggct agcaatgaag ttcctggggt gacaggtaag 7201 gggacagctt tgttgcttac agcatcaatg actttgatgt ttccctgaat catgtttaga 7261 agcgcctgaa tgagaaacca aaatttctga tttcaagaat accaggatca ttaggttcta 7321 cccgcagttc tggaggcaat aaggacgcgc ttgggcgatt gcgcaaagca agagacgctg 7381 cgccgcgcgt atcgccagac cccactcaag gacttccaaa taaaaaaata ttcaatcgga 7441 ggcagacaag gaggacgacg acaggtgtca gcaactttac aatgttttac actagtctaa 7501 tactttggca cctctttcag gctctggtaa acttaactta attaaacaat ttgcaaagga 7561 gaattgcagc aatgagcggc gaaatgttaa atgcagcact gttgtctttc ggtttgatct 7621 ttgtaggctg ggctttaggc gctttgttgc tgaaaattca gggtggagaa gaggaatcgc 7681 tgtaaaaaag catcgcgcta gagacaagag acaactttaa agtctctttg tctcgtatac 7741 gcgctttaaa ggcgtgtaaa cttttgtttg catagtcaag tctgataaga ttttattcat 7801 aaaagtttaa attttgtaaa aagctctcat gacattatta ataattggtg ccactggtac 7861 cttagggaga caagtggctc gtcgtgcgat cgatgagggt tataaagtcc gctgtcttgt 7921 tcggagtgca aaaaaagcta cgtttttaaa agagtggggg gcccaagtcg taccaggaga 7981 tttatgctac cctcatacac tgacagcagc actggaaggt gttacagcag ttattgatgc 8041 atcaacatct cgtcctgccg attctcttag tatcagacaa gtagattggg aaggcaaagt 8101 ctctttgatt caacagtctg ttgctgcggg tgtagaacgt tttatctttt tctccatact 8161 ggatgctgaa aaatatccag acgtaccgct gatggagatt aaacgatgta cagaactctt 8221 tttggctgaa tctggtttaa attacaccat attgcgactg gctggcttca tgcaaggctt 8281 aattggtcaa tatggaattc ccatattgga aggacagcct gtttgggtta caggagagtc 8341 gtctcccatc gcctacatgg acactcagga tgttgctaaa tttgccatcc gcgctttgaa 8401 ggtaccagaa accgaaaagc aaacttttcc agtggtggga actcgtgctt ggagtgcgga 8461 ggaaattatc gccttgtgcg aacgcttatc tggaaaggaa gcgcggatta cacggatgcc 8521 aattaattta ctacgtaccg tgcgtcgagt catgcgcttc tttcaatggg gatggaacgt 8581 agcagacaga cttgctttta cagaagtcat cgccagtggg aaaccgctaa atgctccaat 8641 ggaggaagtt tatcaggttt ttgggttaga tcaaaacgaa accgcaaccg tagaaagcta 8701 cctgcaagag tacttcagcc gaattttgaa gaagctcaaa gagttagact accaaaagat 8761 tcagcccaaa aagcaaaaac caaaaagatc tccgtttaag aaaaccaaca ctcagtagct 8821 atcagtcata agtcatgagt caatcgttat gacacaaaga caaaagacta atgactaaat 8881 agcccaaaat gtgcaacgat taacataata aagtgcgtca ccaataattg gatatttcag 8941 tgtgcccaaa gcaggcatta tctacaatga cgttaagccg gtagcaagtc gaatcgctat 9001 cgaactgaaa gagaggttca ccgctgcagg ttgggatatc tgtatcacat cagctgttgg 9061 cggaatattg ggttattcta ctccggaaag tcccgtatgc cacactccaa tagagggact 9121 aacacctccc ggatttgatt cagatatgaa atttgcggtg gtgttagggg gagacggtac 9181 agttctagcc gcttctcgct tggtagctcc ttgtggtatc ccgattttaa cgatcaatac 9241 tggtcacatg gggtttttaa cggaaactta tctcaatcaa ttaccccaag caatagaaaa 9301 gttgcttctg ggtgattatg aaattgagga aagagcgatg cttactatca aagtttttcg 9361 ggaagatttt gtcctgtggg aagctctgtg cttaaatgaa atggtgctgc atcgagaacc 9421 attgacctct atgtgtcatt ttgaaatagc aattggtcaa catgcagcag tagacattgc 9481 agctgacggt gtgatagtat ctacgccgac aggttctaca gcatattcgt tgagtgctgg 9541 tggtccagtc gtgactccgg gtgtacccgt actgcagctt gtacccattt gtcctcattc 9601 tctggcttct agagctttgg tatttccaga tagtgaacca gtgaatatct acccggtgaa 9661 tactcctcga ctggtgatga ttgtggatgg taacgggggg tgctatgttc tgccagacga 9721 tagagtttat gtagagaagt caccctattg cgcccggttt gttcgcctcc aaccgccaga 9781 atttttccgg attttacgag aaaaactggg ttggggttta ccacatatcg ctaagcctac 9841 ttctgtagaa ttaccctagg ggaatttggg attttggatg agtgagtggt tggttgtcca 9901 aaaggacaat ccatcagtct gctcacaagc aaggccagct agattttaaa ttagtgatta 9961 aaatgactgt atctgtcgat ttgtatggcg gatacttgta tgtgctgaaa catggtcggt 10021 agaacgcgtg tcaagtcaag atgaattgat cacctaaaat ccgtaaactg cacttctacc 10081 aacgaaatca acatcctaaa tctaaaatcg aacatccaaa atcaatatga cgcttgctca 10141 taacccttgt gtattgctaa ttgaaagcga tgaaaacctt gcaaatcagc ttgctttcga 10201 tttaaaagaa gctggttatg atcccatcgt ggctcatgat gggacaaatg gtatacaata 10261 tagtcgcgat cgcgaacccg ctttagttgt cattgaccga atgctcgcag gagaatcagg 10321 actctcgctg tgtaaaaatt ttagaacgac tggtatgcga tcgcctgtgc tagtgctaat 10381 ggcacgcgat acagtagatg atcgcgtagc ttgcttagat gctggagctg atgattactt 10441 tctcaagcct taccggggtg aagatttttt gaacctggtt cgcttatact taaaacctga 10501 agtcgatact tccgagcagt tacgttttgg tgatctcgtt ttagacatag caacccgccg 10561 cgccatactt aatgaacggg cgattgactt aacaatgaag gaatttgacc tccttaagta 10621 tttgatggaa catccccgtg aggtattaac ccgcgaacaa atactggaaa atgtttgggg 10681 ttacgacttt atgggcgagt cgaatgtgat tgaagtctat attcgctact tgcgactcaa 10741 aatagaagac gaaggtcaaa agcgactcat tcaaacggtg cgaggtgtag gctatgtatt 10801 gagagaaact tgaacacaag cggggagagt cgagaaagtg ggggtgaatc acaaaacaaa 10861 tagctaatga ctaaaatttc tatgattaat tggctaagtt tgctatcaat agttttgagc 10921 attttgctag cgggctgttc tgttcctaca ccagcagttc ctcccactgc tacgcctagt 10981 tctcaatccc aagcaccaat gtcttcagaa caaaaaccat ctagtaagaa tttaggtcaa 11041 caactaccaa tttcagctgt agcagttgtt cccgatggta caaaaattga gttagaagtc 11101 gcacaaacac cccaacagca agcgatgggg ttgatgtatc gcccaacttt gcccgacaac 11161 cgtggtatgc tgtttgagtt tccatcacca ttccaagcca gtttctggat gaagaatgta 11221 ccagtaccac tggatatggt ttttatgttg gatgggaagg tgcagtacat tgcgacttca 11281 gcacctcctt gcaataccac cccttgtcct acttatggtc cccagacacc aattaatcaa 11341 gttattgaac tgcgttcggg gcgagcagca cagttaggct tgaaggttgg ctcttatgtc 11401 aaaattgagc ctttgaaatc agggaatata cggtaataaa aattgctctc ttgctcacgg 11461 agtcttacac ttagtatttc tcttgtcaca ttcttgtaaa gatttattac atttttcggc 11521 aaaaaataaa tttaagagat tttttaaatg gatttaactc gcccaccaaa ttatcaaatt 11581 ggtggtctac aatgagtnnn nnnnnnnaac tcaatacaca ttagacctct tgcaaaagtg 11641 agattttacg aggttctgtt aagagttccc tgttcagagt tccctgttca gagttccctg 11701 ttccctacaa ctcccacgaa gtctattcag aaaagcctta ttgttccaag actaggtaac 11761 taattttatc ctagactaca aaaaatcttc cttttgacat acgtgtaaag agtagtcaat 11821 aagggaatat taggctaagg aatcttcctt actacagttg tagctaacta gagcactgaa 11881 gtaaaaaaga gaaagtcata aattcattta aatattgtgt tgcttaagac taaatcagat 11941 acgtagtatt actgtctatt ttttaataaa aaaacttctt tgagcaacac cagacaaact 12001 ttcctgagtt atctattaac caaaggcaca gtgaaaggag aattattggc tacttacaac 12061 taatgagaag tggtgaattt aaacataaaa cacacaccat gtgaggaggt ccaatagaaa 12121 atgtcaccat caacttattc tagatttatt agcttcttac aggaagattt ggcaatttcc 12181 acagcttcca ttgcagttgc tttgcgtcgt cgtgagcaag atccaggtcc tttaccaatg 12241 attctttggc aatatggttt aattaccata gaacagttag aaaaaatata cgattggctg 12301 gagacagtat aggaagtaat gacctcgtgt aggaatgtgg aactgtgctt agatatagag 12361 caagcttttt accgaaaaat tgggtttaaa gccccgtcct tctaggacgg cttttaaatt 12421 tgggaacaaa aacgcccccg ttatgcgaca ataaaatata gcggtaaggc gcacataaaa 12481 aagacgaagt aggtcgagcg gacaactcct gagccaccaa gcctggtagt ataaatccta 12541 gatcttgcac gccacttgct ttatgccggg gaaccctttc ggcagttcct cctggggagc 12601 cagtactgaa ggagggtttc cctccgtagg tatctggcgt tggaaacccc caagaccgga 12661 ctgcctcacc accgcagtgg ctcacaaatg tgggagtaaa agtaagaaat taaataaacc 12721 ttttcaaaag tggtcgggca gtccgtttac gcttgtggac gtatccaaac ggggatcttg 12781 aatgcttgca tttgggtatc tagagaggac ggttgttcgc gcagcgtgcc cgttcgccgt 12841 aaggcgtgcc gcaggcatag ggcttagcaa gaatcctcag catgacggcc caggagtgtc 12901 aattgtcaag catagcgagt ttataattca actcgctttt ttgattgtag agttctgagt 12961 tcttctttta actcacattt gtttaagatt gcggagtcgc tgtagccacc atcagccccc 13021 aaatgatatc gaaacgaact ctagttgcac ggcgaatcgt aaaatttgta gctgcaagaa 13081 agtgattaaa ctcgtcttgg gtataacatt gtttataagc tgggtcaaat aactttaata 13141 caatatcacc aattttccaa aaaaaataat ctttgcacca atccagaatg acaactttgc 13201 cctcaggttt caagacacgt ttcatttccc ttaatgcagc caatggatcg tcaaagtaat 13261 ggaacgaatt ggcagatata acaacatcaa aactgttact agcaaatggt agcgctaacg 13321 cactcgcggt ttgaaacgag acttgagaat aaatgttaca tttgtgtttg gcaatctcca 13381 gcatcttctc tgaaatgtct actccaacaa tctcttgcgt tggatattca ctcagtagca 13441 gtcgttcaaa ctcaccggtg ccacaagcaa catcaagtac agtatccagt ggtgagattt 13501 gcgcccagtt cttaagaaaa gatagtgtct tggagatgta gcttttccaa cgttgatcat 13561 acacagatga catctgatca tattgctgac gaactcttgt ttctgtcatt ttgtgcttcc 13621 tgtgatcatg acttataact attgcccaaa cacacgagtt gctgctacca cagatgaaaa 13681 atctaggttt taagagttaa gtaaaaggta aagcgagtat cgggagttgg gatttcagaa 13741 aatactgacc aaatctcgac tcgctactca agatttccta cttttgatag ttgccttcat 13801 aactgataag gtcacgaggt tcgtcaaagc gcgaaagtaa atcccttgct tgattattca 13861 gaagcagcaa tatgcttact tttgagatga ttcagcgacc aacgatggtc aatctcaaaa 13921 gacgattggc aaacttgttc caaatgcttt tgttgtacag ccgctttacg cagggtaata 13981 atgagctgat ttacctgatg ccgcaaattc ggattattgc taaccgctat catagtttgt 14041 ctctctaaca gttgcagtat aagttcgtag tgaacaggtg aaggcagagg aatttgctcc 14101 atgaaattaa tatttgattt cagattttgg acaactcaga agttgcttgt cattaaatga 14161 attgttacac aagcatcaac gaattgccta aagcgagacc taaagctaga attattcatg 14221 ggctaaaggt ttaccaaaga ggttggcgat cgcttcctct ggatctgggt gtttgactaa 14281 agactctcca atcagaacag ctgatgctcc tgcttgttgt actaaagtca gatcatcgca 14341 gttatgtaat cctgactcac tcacaaccac aatattcttt tgctgtaatt ccttacccct 14401 tgctgctaag agttggcaag tggtttgtat atcaacagag aaatcttcca agttgcgatt 14461 atttatacct accagtgaga caccatctat ggttaacacg cgatcaagtt ctgctaaact 14521 atgaacttca atcaatgccg ccattttgag agctttggca attttaacga agtactgcaa 14581 atcttgatcg ctgagtacag ccgcaatcaa taataccgca tctgcacctc gaacacgcgc 14641 catgtacatt tggtaaggat atataacaaa atccttgcat aataaaggta aatctacttg 14701 agcgcgtatt tgagctaaat tttcaaagct accagagaaa aacttttcat ccgtaagtac 14761 cgaaatacag ctagcacccg ctttttgata ctttaaggca atctccaccg gatgaaaatc 14821 ttctcgtaaa actcctttac tgggagacgc ttttttaact tcagcaatca atgctggctt 14881 tgttttgcct tgttgcaatg ctgctacaaa atcacgggat agaggtgctt ccagtacctg 14941 acgctgcaac tcctgcaaag aaagcttttc ccgcattcgc tcaacttctt cttctttatg 15001 ccaaacaatt tcttctaaaa tgttgtttgg ttcggcatca gacatcacgg tttgatactg 15061 cactgtggat acagcaatag ctgggttagg tctacggcga cggatttgca taattagtca 15121 ttagtcatta gtcattaatc attagtcatt agtcattaat catgagtgat atcaattccg 15181 tttgcaatgg ctatggctga cgccacgggg ctgctataac gcgacgcact cacgtacaaa 15241 atgattgctc tgttctcttc tgttctcttc ctctgcgtcc tctgcgcctc tgcggttttt 15301 tttcatcatt gaaattccat ttgtcaagta caaaggacac agaaaatcgc agatctgcca 15361 aatcatgatt gtacagtgcg ggcatcttac tcgtgtaccg caccgagcaa gatactcccc 15421 aaaggacaaa tgactaacta gcaattgctc gtttgtaagc ttcgtccagc acttcagaaa 15481 gtgttggatg ggcgtgaacc aaatgagcaa gggtgtggac agattggcgg ttggcgatcg 15541 ccgctgacgc ttcatgaatt aagtctgaag catgtatccc aaagatatga acgcccaaaa 15601 caacgcctgt gtctttgcga taaatgactt ttgccattcc gtctgcttca ccttctgcca 15661 aagctttaga gtttcctttg aagtaacttt ttgccgtcgc gacttcaaag ccctcagcac 15721 tacctttttc cttcgctgcg ccttccgtca tacccacata actgatttct ggatgagtaa 15781 atgctgctgc agggatactg tgatagtcta cttccttgtg ccgcccacag atattttcta 15841 ctgcgatgat accttgagca gaagccgcgt gtgctaacat catcttgccg ttagcgtcac 15901 caatcgccca aagatgaggg acgacttcac ctgcagatag aaccgccatg ctgtcgttga 15961 ctgggataaa gtttcgacgg tcaagttcta caccaacaga ttctaaaccc aggttctgcg 16021 tcgctggaat ccttcctgtt gcaaccagac atgcatcgac ttccagcacc tccacgtctt 16081 ctttagtttt gaagttcgct aattcaatga cgacaggtga accaggagtg acttttttgg 16141 cgtatatccc cacatgagtt tcaatatcac gcccagtgat gagaacccgt tcagcaagtt 16201 tagcaatatc gcggtcaaat cctggcatca actggtctag ggcttctatc atcgtgattt 16261 cactgcccaa agctgagtaa acatcggaaa attctaagcc gatgtaacca ctgccaacaa 16321 ttgctaccca atctgggagt gattctagtt tcacgccttg gtcgctagta aacacagttt 16381 tgccatcaac ttcaatccct ggaggaacga agggtaccga accaggggaa agaataatat 16441 cttttgctgt aatggttttt tcgccgctgt ctgtggttac agagactttt tgtgtccctg 16501 ctatcttacc ccaacctcgg atgatatcga ctcccaagcg tttgaggcta ttggttaaat 16561 cgccttgtat tttggatacg agattgtttg catggttggc gatcgcttct cgatcaaatt 16621 ccacacttcc cagttgaatt cccagcgact tgagatgatg ggcatctctt aactctcgca 16681 cacgtccaga tgccgccagc agtgctttag atggaatgca gcctcggttg acacaagttc 16741 ctcccatatc agctgcttcg ataatggctg ttttcaaacc acagctaacg gcgtgtaagg 16801 cagccccatg tccgcctact ccagcgccta caatgactaa atcgtaatca aattcatgac 16861 tcacgttagt ttccccgtgt gcttccgcct atttattctg agattgacca agcaatcagt 16921 gcaagtttct taattttagg ctgttgtcat ctcaaatttc tgatttctgt taattcaact 16981 cggggatttg acatcctcac cgccctagaa gtgcggtgat tcctcaccac gccagcaacc 17041 tactcgtggt aggataactg gtcgctgaat ggggttgacg cttcattgag aacagcctgg 17101 aaccaggtct tacactctct ccacatccgt tttgagtctc ggaatgccct tccgcgacta 17161 accgacaatt gaactattga attttgtctt tttcctttgg gttggattta tgtttactac 17221 cattgcccat cagtctttct ctacgcctgc tctcatcgta gctgtttcaa cgcgtcttga 17281 gcctagaggg ggattgctaa ttcctttgta tattttagcg caaaagccgc cctagaagtg 17341 cggggcttgt atcccattat ttttggtcac gcgatattgc ttaaatttca cccatccaat 17401 cacaggtgat agtctttgtg ctttcatttg caagatctag aattcaacca agaaccataa 17461 ggtgtcgtgg taggtacttg ggcgaccaaa atcacatcgc cattactatc caccacccac 17521 ccctgtgctt ctacaatctc attcggtgga tatttgacat ctttattgtt ttgtgcatct 17581 tttctacttc tctgtctttg tgtctctctg cttgaagtac ccaaaggatt atttgcatcc 17641 tctcccaaag acacccaatc tacctgtaca ttattgcttc taagaacttc cctagggtta 17701 gatggtatgc caccgcgtcc ggtgatgatg aattcacttt gatttcgcgc gacacttgca 17761 tcacatcctt gtgccaatct tggttccttt agttccacgg gcaagttgat taatccacgt 17821 ttgacatcaa cctcgggtgt attgatattt actacgccgt ttaaagtcgg attagtttga 17881 gaaattgccg tgatatcgtt tgtgagtagg ttttgcgggt ctagtttgat tggatcatta 17941 gttcccaatt gtcttgttaa gtcttctcga ctacgcgctg ctatgccaaa aataccagta 18001 gcattgattt gaactctgcc gccagaacct gtgaaagcat tggctgtgat gtcgctgttt 18061 tcttttggtt tggcgacgat gaagcccgac ggagcattga taaagatatt accgccatcg 18121 ccaatagcct gcgccttgcc tgcattcgtg gatatttgac tgttacggcg tagcaataat 18181 aagtctccca cttctaggtt aatatttccg ccttgacctg agttcgtttc agatgtgagt 18241 tttccttggt tgtccagaag gatagagcga gcagagatat tgagttcgcc tgcttttcct 18301 aggttgattg tatctcctaa gtagattaca ttctctggca gctttggcaa ttgactgctc 18361 acgtttactg cagcctcgtt tcggacaaga agtttctctg tgacgattat cacttttcca 18421 gcatctccgg tacctaaagc cctagttaac aagctgctgg gaaaaccttt tgctgaagtt 18481 ccaattagct ctacggactc ggaggcattt atgttcaaat tccctccttt tcctgttgct 18541 ggtgtaaatt gtgattgccg tgtatttcca ctagattctg ctgagatctc tgccccatct 18601 tggataagca gcctgccagt gttgatattt acgtcacccg cctttccgct tgcgaaggtt 18661 gtactagaca atgcactagg gatagtagta tttgctatgg taccaatcac ctgcacagac 18721 ttggaggcgt tcacagtcaa atttccccct gcaccctcac ccattgtgat agttaataat 18781 cgtgcgccat cccgaacaat taatttatca gtggtaatcg ttaaatctcc gccatctttg 18841 acaccattag ttatatttat caatcccaca ttaccaatga gttccacaga cttgccttgc 18901 agttggatgt taccactgcc atctttgcca ctagcatcta taatagatgc aatttgagaa 18961 ccattaactg ttcgctggat gagttggatg ttctgaaaat tctggacacc ctcatatccc 19021 aaactccaac cttggtttgt tgggttcaga ctgaccaaac tattggcagc aacacttcct 19081 agctcaattc ttccccttgc tgccgttaga tttccgccct ctaatatcac atcaccgcct 19141 acaattgcca aggttttatc tggctgtacc tgtaagccaa cggggtaacg ggcgatatta 19201 gttgcgccat ccggacttgc ttgggattga ttgcggatgg gtgctgcagt tgctccaaat 19261 tgtaagccaa taggaacact gactgtcagc agaggtgtgg tttgaggatc tgtggcacta 19321 aacttggtac catcggcaaa gttaagacta cttgccgtac tcgctataaa tgaaccgcca 19381 atattcaaag aagcattcgg accaaaaata attctgttgg gatttattag aaacaggttg 19441 gctgtgccgt tagctttaag aatgccatca atattagaga tagacttacc tgttacccgg 19501 ctaatgatgt tctgaatatc tgtagcattt tgaaagtaag ctgtagtccc agtaggtaca 19561 gaaaattgct caaaactgtg gaatagattg cttcctgctt gagttccacc ttcaataatt 19621 ttgatgttgt tctgcgttat aacgcgagag ttattaggta gagtcgtatc tggggtgatt 19681 tgggcagtaa cacagttagc atagaaagct attgcaccac ctactgcaac tcctaatcct 19741 ttaatccagc ctacaagagt attcatccca gatattacaa cccccaagca aactagctat 19801 gactaagcaa cagttacgtt gcacttaata ttcgctagat tacacgactt tggtgacctg 19861 tgtcttcctt tttaccaaag attagctagg agcgttgctg aatttggata tgaaatgcgg 19921 tcttgtaatg ttgtgagtgc tcttccatca ctttcatgag agtgctagaa gagccttaag 19981 taaagcaaca aaattaaacg ttaaatgctt caaaccaact tttttaccta ttttcttcgt 20041 tgtatcttac gttttttaag gttgacctga cttgaaaata gggaacacaa aacaggaaac 20101 aggacacagg tcttttcatg tatcctgttt caaccattaa tgacggcaaa tcaacataat 20161 aattcgggtg cagtgtacta gcatttacaa cacaaacatc caggtctagt gcagcattgt 20221 gctatgacta actgctgacc tggaatggct tttaaaaaat cctgagtttg gttattttcg 20281 aggaattgac taaaagtatc actatgtgaa agttttctat cggaattaaa tagaaacaga 20341 ttggcggtgc cgttagtttg aaatgtgcca tcaatattca aaatattaaa aaatgacttg 20401 cccgtcaccc aactgatatt gtgaagatca gcagcgttat tgaagtaagt cgttatacca 20461 gtgggcacag aaaactgttt cacagaattg aataggttcg ctttgcaatt tgctctaccg 20521 ttaacctggt taatagtgtg gttattgatc ggtttggatt taactacgga attttcacct 20581 actggagtgc cattgcttct catctcactc agagcataat ttccaccaaa gacaagtacc 20641 ccaccaattg ctaaaacact cgctagccct aatttccagc aattaccgct tcggctttga 20701 gacatttttg ctgacctcaa atgtatgcat tacacacccc acttacctgc aatttatatt 20761 gcaggctaag tagaagtgct ctttataagt agactacatt aacgtcagta atccgcttta 20821 cactccttag caaaacttaa tgaatagtga tattgtcaat tgaattgctg aaaagctagc 20881 taattaagaa tgagaacgaa atttcatgaa attgatagga aaatggactt tttgaggtga 20941 tttgattcat tgtactctta ataagttgag atgatagtaa ataaaccacg ggggtagggg 21001 caaaggctag cctttgcccc tacgaaaatc tgcgttttgc tctttatcaa agcttaatgg 21061 ctagcagtac tgtcaattca actgctgaaa aggcaggttg tactaagaat gacgatcaaa 21121 tttcaagaaa tttctaacct aagaatcatt tggctagtgt aacttgatga attgtcttct 21181 taagaagttg aaactatagc tattaaattg attaaatgga caataccgta acggcttact 21241 agcaactaca acaaaaaaac ggtggttcac acccgcagca gtctactctc actcgctcct 21301 cactcttaac taactggtta ctaataccat cctgaaatgg gattttagtt agatcgagta 21361 tgcactcaaa ttgatacgtt ttgttgcctg ttatgccata tttttgaaat acctcgccta 21421 acttcgagtt attcacagca ttcctcagtt cctcatgaat ctcgtttttg aactgttcca 21481 tttgttcaga agtcataact attttctcct tgcaagctag tagattgata aacggaatat 21541 tgcactaatg acagtacgtc gtactatcca ttaacagata gctaatatcc aaaaggaaag 21601 tcgttaacac taaacgcgtt tgcgctttcg gtttaccgtc ttccttctac tccttaatag 21661 gtgctagtga aattaactta tgcactttta aaataaaaat ttgaaacgac ctttcccctt 21721 tacgtctaaa taggtggtcg tcagaaaaca gaaaatactt atttttctcc gaaagaactc 21781 gcagcatatt cattcgagtc atacgtttat ctttccttgt ggtgcgatcg cgcaaagcaa 21841 gagacgcaat catgcgtatc gccatattcc tcttccttaa gaagccacag tcggctggtc 21901 tttaatgact tgctcgctac acgactgcta tatcttttaa agtactgaac ttctatctgg 21961 ctatgccttg ctttggatac aacaacgaca agaccagtaa aagcgatcgc aatcgccaaa 22021 aaactcaacc cttctggtgt ctgagtcata agctggtgtg tactgggaac tttcacctat 22081 tactatcgtc aattgacacc aatgcaggtc tatagcattt tgcgatatcg cacacagagg 22141 agaggagctt gcacaagatt aaccctattg ttttagtgtt tatttacgcg gtcattcata 22201 aactcaggag tcagagccaa gaagcatata tgatccataa ctaacaagtc cggcaggtgg 22261 aaactgatgt cattcttggc gaacatttct gatttttatt ttaagaaaat agctattgag 22321 aatgtgttgg ttgagtcaga agttggtaac tgttggattg ggttgagttt ggcagccgag 22381 tgtggattca tgcatttgtc atataccagg gatgatctgt atcgctgggg ttcccgcttg 22441 cttccccgaa gtgcttcggg cttagggggt aggacgttcg tcgcgatcaa tgcgcccaaa 22501 aggagccgcc caaagggcga tcgcagttgg gtagacaggg acaataattt ccacaattcc 22561 aactttttga tttgagtggt gactttttcg ccaacaatat taggcaaaat gagtaatacc 22621 caatggagct gcaatcttat ggacttgtat ggagatgcct taaacatcct caaggaaacg 22681 agtcggactt tctacattcc aatcatcaag ttaccactgg gcttgcaaga agcagtcgca 22741 tcagcgtact tgtgtttgcg agccattgat gaaattgagg atcatccaga actagataac 22801 tttactaagg caaagctgtt acgaacaatt agcttgacat tacaggcagg gggtgatggc 22861 tttgccatcg atgctttcta caaaggattt cactcacatg aacatctcct acccgaagtt 22921 tctgtgcgga ttcgagaatg ggcaattctt gcacctgcaa ccattgctcc tcggatttgg 22981 gatgcgactg cggcaatggc agatagaatg gcttactggg cccaaaacga ttggaaaatc 23041 cgtaccgagt ccgatttaga tcgttacacg tttggagttg ctggtgccgt tggcttgatg 23101 ctctcagatt tatgggcttg gtacgatggg acacagacaa accgtattca tgcaattggg 23161 tttggtcgtg gtttacaggc tgtgaatatc atccggaacc acactgagga cttagtgcgt 23221 ggagtgaact tttttccaga ggggtggagt gcggaagatt tgcatcatta tgcccgtcgc 23281 aatttgacgc tagcagatgc atacattagc gctcttcctg ctggtcccgc tttggacttt 23341 tgtcaaattc ccttaacctt agcttatggc accctggatg ctcttgccaa cggcaaaagt 23401 aaactcagcc gcagtgatgt tcttgcacta ctggagcaat ttactaacac cagtaaataa 23461 aacacctgtt gtcaagcttt tccattatgg gaaagtccca cgcgtgttca catccaccca 23521 tacctcaata aactatcaac tatgtttatt taggtctgac taagcgtcca caggttttac 23581 tcaacactgc tcctcgcact tactcacaca ccttaacccg cattgcacga ctgtatattt 23641 acggaaagtt gtgcagcgtc tagaacaagc gaacaactgt tcttaatagg actagaaata 23701 ccaaactttg caaaattatt ttttcttata gacctctgga tgatattcaa agatatactc 23761 agccacagta atgtttttgc attcattgca tgattgagta ctcagttttt ctgataattt 23821 cgactgctgc agactatata aaaccttgca attgactcta ttgccttgct taaaaaaagg 23881 agaaatattt cgtgaaaaaa accctttttc ttaatcctcc ttcctttgat ggatttgacg 23941 gtggtgctgg ttcccgatac caagccaagc gcgaaattac ctccttctgg taccccacat 24001 ggctagcgca gcccgccgca cttgtccctg gaagtaagct tgtagatgct cctccacata 24061 atcagactgt ggaagatgtg ctgaaaattg ctaaagatta cgaactgatt atcatgcaca 24121 ccagcacacc ctcgctggca aatgatgtga cgtgtgctct tgcaatgaaa gaacaaaacc 24181 ccaatgtaca aattggtttt gttggggcgc acgtcgctgt cttaccagag gaaacgctgc 24241 gcgagaactc agttataaac tttgtgtgtc gcaacgaatt cgactacacg tgtcaggaat 24301 tagcagaagg caaaccttgg gatcaaatta agggactcag ctaccgagac aaaaacggac 24361 aactacatca caacgcagaa cgtgatttaa tccacgattg ggatgccatg cccagcgtac 24421 taccagttta tggacgtgat ttagatatta cgaaatattt cataggatat ttactgcatc 24481 cttacgtttc attttacacg ggtcgtggtt gtcctgctaa atgcagtttc tgcctctggc 24541 ctcaaacaat tgggggtcac cagtaccgta ccaaaagtcc agaagcagtt gggcgagaaa 24601 tggaagaagc caaagccatc tttggtgaca aggtgcagga atatatgttt gatgatgaca 24661 ccttcacaat tgataagcag cgggcgatcg ccatcagcca acacatgaaa cgcctcaagc 24721 tcacttggag ctgtaacgcc cgcgctaacc tagactacga cactctcaag caactgcgtg 24781 acaacggctt acgcctattg ctggtaggat ttgaatcagg taaccagcaa gttttagacg 24841 ggatcaagaa aggaattaag ttagaggtgg cgcggaagtt tatggagaat tgccataaac 24901 tcggcattac cgtacacggc acattcatca tcggcttgcc aaacgaaagt caacagacaa 24961 ttgaagaaac aattcgtttt gcttgcgatg tcagtcctca taccatccaa gtttccatcg 25021 ccgcccctta tcctgggacg gaactttatc aacaagctca gactaacggt tggtttagtg 25081 ataattcctt agttgcttca tctggcattc aaatgtctac actgcaatat ccgaacctct 25141 ccagcgccca aattgaggat gcagtcgagc agatgtatcg tcgcttctac tttcgaccga 25201 aagccatcat cccaattgtc ggtgaaatgc taaccaatcc ccaaatgcta gttcgtcgct 25261 tgcgtgaggg acgcgagttt ttctcttatc tcaaagagcg tcatacacag gcggctgcta 25321 aagagcaatc agtcgtaagt aggtagtgaa cggtagaaca attagcaacc aacaactgac 25381 aactaacaaa taaattgttg ccacatgaaa cttggaagcg aagaacacaa agagcttttt 25441 tgccgtagct tcatcaaaag ccacttagag tttgagccag aaaaactgcc ttggcccgtt 25501 ctcgatagcg tagctctaga acgtctacac ggcattccct tttggagaga agccctcagt 25561 acagagcgac aggctggggc gatggtcagt acctttgccg cgacaattag tgatcccctg 25621 ttgcgagagg cgatcgccct tcaagctttg gaggaaaccc gccactcgcg actcattgag 25681 tgtttgatca atcattacaa tattcaaatt tctcaacctc cagagcctgt cctccctagc 25741 aatatcaaaa ctgctttcat tgactttggc tttggggaat gcctcgattc tttcttagcc 25801 tttggactgt ttaaaattgc ccgccaagct aactatttgc cggaaccatt gttcgacatc 25861 tttgatccaa ttcttcatga agaagcgcgg catattatgt tctttgtgaa ttgggtcact 25921 taccaacaaa ttcaagaggg tcgtacggcg aattggttac gcggggttga tgctctatgg 25981 cattaccgga gagcgctgca agataaaatc aaagctttta gtgggtcaga agaagacaaa 26041 caggagggtt ttactgccac cgccgctggt aactttatgg ataatttgac ccccgaactg 26101 tttttgtcta cttgccttca ggaaaacgcc aaacggatga gcgtctttga tcaacagctc 26161 cttcagccgc aactgttgcc tacacttgcc aaaatcgcct tacgcattat cagactgatg 26221 cctcagcaac agtccaactc agccgcccag ttctccgaac aataaaccag aacttatgca 26281 ggctcagaaa ttcgccatta tcaacggtga tgacttcggc ttttcacatg gcgtcaatca 26341 agcaattatc aaagcgcaca aagaaggagt actgacgagt accagcttaa tggttacagg 26401 tgaggcattt gatgaagcgg ttgatttagc acacgctcac cccaccctag cagttggttt 26461 gcacttggtt ctggtgtgtg gtcgagctgc gctaccaccc tcgcaaattc ctcacttggt 26521 tgactctaca ggtaactttc catacagtgc gccgataagt gggttgcgct accagttcat 26581 tcaggcaact catgaggaat tgcggcaaga aatccgcgct caattagaaa aatttcgctc 26641 ctctgggttg cgcctttccc atgtagatgg gcatttgcat atgcacgtcc atccggtggt 26701 actgcgtatc ctagttgacc tagccgatga attcggcatt cgggttatcc gtcttccttg 26761 tgaggaactg gggatgagcc tgcggcttga ccgtcggaac ttactgacta aactggtttg 26821 ggcgggtgta tttggtggac tgcgccgcta cggtgagggg ttgctgaaat caaaggggat 26881 tggctttgct gagcgcgttt acgggttgct tcaaactggt tctgtgactg aggaatactt 26941 gcttggtctc ataccccaaa ttgaggcaaa cctagtagag atttattgtc atccagcggt 27001 cgctatagct ggcgaaccgc tgaatggtcc attaggggca ggtgaagctg aactcgcggc 27061 taccttgagt gagcaagtga gtgaaatgct ggcggcttcg ggatttgaat tgacgaattt 27121 tgagcaacaa agaactcaac ccctgacgta ctaggttgtt cgtatccttt gcaagacaat 27181 aaagcttaaa aagagaatga tcccaatggt cgtcataagt atttgtacca aggtcatatt 27241 ttgtagtctc attgccaaca cgttgcaaat caaagcaatt gaccaaagag taagcgcggc 27301 atggcgttga gacaagcccc aagctaacaa gcggtggtgc aagtggtctt tcccaggtgt 27361 gctaagaggg tttttccccg ccatcaagcg tcggataaac acttgagtgg tatcgagtac 27421 tggcaacagc aaaaacagaa ctggtgcgac cagagagaca gccgtgggtc cttggagatt 27481 acctaaaata ctagtagcag ctagcacata accaaaaaag tacgctccgg catcacccat 27541 aataatgcgc gaagggtgga agttgtggcg caagaagccc aacgccgcac ctgccaatgc 27601 tgctagaact aacgtcgctg ccgcacggtt gggaacctga gctgaaacag ctaataaact 27661 tatagcggtg ataaagctta ctcctcccgc caaaccgtcc atgccatcca ttaagttgat 27721 ggcattagtg attcctacta cccatatcac tgtaataatt actgatagtg tagagtcgat 27781 gggagtccca aaagagaatt caatggtaat acgattagca accagcaaca gagccgtgag 27841 tatctgaatt aacaatcgaa ccagaggggg taagccaaat tggtcatcaa taaagcccac 27901 aagaaccagt atcgaacccc ctagaagaat cgtctgtacc tcagccagta ctctttcgag 27961 ttcaataggt cgtaagagac tggctaatac cacggcagca atcactcctg cataaatagc 28021 tagacctcct gcattaggca aaggttctcg gttcagccgt cgcgcattcg gttggtcagc 28081 ccaacctact cgcagggcaa acttgcggac tgctggaatc aaacgccaag tgacggtacc 28141 agctaaaaga aatgtaaata ctaccgctaa ccagccggaa ccgctagggt cagcaatacc 28201 gagggaacga agggagctgt atatgttcat ctcccgattt aaccattagt gttagtgtcg 28261 ataagcatca actgtatgtt agtgaatttt ttcctaacta tctgtaactt cataagaagt 28321 tgcttctaat atagagtcat tctgtgagga taaagtgtcg gagtaagggc ataacacctt 28381 acgtcttttg attagagtgg ggtgtggagt agaaagatta ttttttattt tgtcttttga 28441 aattaaaatt gtcactggtc tgacaccctg aacttcaacg actcccttcg gtgcgacaag 28501 atgaccaaag cgaagggtgt agggg // LOCUS NODE_1016_length_28350_cov_5.14734128350 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 28350) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 28350) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..28350 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(185..1579) /locus_tag="DP116_08745" CDS complement(185..1579) /locus_tag="DP116_08745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="9-cis-epoxycarotenoid dioxygenase" /protein_id="PRJNA477356:DP116_08745" /translation="MRTTQVNPFLDGNFAPVHQETSTNSLQVIGELPPDLSGIFLRNG PNPQWTPIGQYHWFDGDGMLHGVRISDGKAAYRNRYVRTKGWKIENEARKSLWTGFVE PPPKDKPHKQSKNTANTALVWHAGQLLALWEGGAPHAIKVPELNTIGEYTYNGNLVSA MTAHPKVDPVTGEMMFFGYGFTPPYLQYSVVSPQGELLQTEPIDIPAAVMMHDFAITE DYTIFMDLPLTFNIEKKKLGEPMTAFDRDKPSRFGIVPRYGNNSNIRWFESPACYVFH TLNAYEEGDEVVLIACRMNSTNVLGSKDSQPDPEADIPRLYRWRFNLSTGKVSEEMLD DVPCEFPRVNENWLGQKTRYGYTGKLAKGQPVFNSIIKYDFNTGQSQTHEFGKGRYGG EPVFAPRPNAKAEDDGWLMTFVHDTAEDTSELVIVNAQDMTGEPIARVIIPQRVPYGF HGAWVSEEQLRASV" gene 1752..2390 /locus_tag="DP116_08750" CDS 1752..2390 /locus_tag="DP116_08750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213081.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine phosphatase family protein" /protein_id="PRJNA477356:DP116_08750" /translation="MSLTLYFLRHGQTECSRNNFFCGSVDPELTTDGLEMAKAFATAY SSTPWTAIFCSPMGRTIATAKPLCDAIGMQPQLRDGLKEINYGKWESKTPEAVNQEFH DDYIRWLADPAWYAPTGGEMAIAIASRATQVIEEIKQRYSSGNVLIVSHKATIRIMLC SLLGIDVGRFRYRLGCPVGSVSIVEFGSHGPLLKALADRTHMGEELRNLPGT" gene 2590..3726 /locus_tag="DP116_08755" CDS 2590..3726 /locus_tag="DP116_08755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319348.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="succinylglutamate desuccinylase" /protein_id="PRJNA477356:DP116_08755" /translation="MLPVVSTIPLRHMASGDVLSLQVYKFIGAQPGKKVYIQSNLHGA EIAGNAVIHQLIEFLQTVNDTDLYGEIWLVPVCNPMSTNQRSHIFSSGQFCSYEGKDW NRIFWDYEKHADDFLAFAKSQINFEKEVVRKNYQAKIQQSFAKILEKINSSCGVPYTE RFSYRLQSLSLDADYLIDLHSHANQGLNYLYLYRNREESAKYFLLPFGIQFVEFRCDE DAFDKTFIQPWLALEKHFKQLGREIRFDIEAWTLELGTGMQMNPDSVEKGIRGVKNYL AHKGVLQISGFPLKESESHSMSLKPRTQMKRYYAPTGGMIQSRVELGSSVKAGERIYQ ILSFHKDGTLPTVIDISAEQDGFVFDISSNQAVNEGEFVLGMIE" gene 4000..4974 /locus_tag="DP116_08760" CDS 4000..4974 /locus_tag="DP116_08760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319347.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTPase Era" /protein_id="PRJNA477356:DP116_08760" /translation="MMVEPGRNSSQNDIFSLSGEVVIPQAPPEYKSGFVGIIGRPNVG KSTLMNQLVGQKIAITSPVAQTTRNRLRGILTTPEAQIIFVDTPGIHKPHHQLGEVLV KNAKLAIESVDMVLFVVDGSTNCGTGDRYVADLLTHSTTPVILGLNKIDEQPSNFQTI DNSYTELAREHQWQTRKFSAKTGVGLPELQQLLIEHLELGPLYYPPDLVTDQPERFIM GELIREQILLLTREEVPHSVAIAIDLVEETPAITRVLATIHVERDSQKGILIGKGGAM LKAVGSAAREQMQKLIAGKVYLELFVKVQPKWRQSRIRLAELGYRVEE" gene 4978..5655 /locus_tag="DP116_08765" CDS 4978..5655 /locus_tag="DP116_08765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_08765" /translation="MSQDNTLRLNLPANTSVLRILIVEDDPMMQLGLEQSLMAHPQLE IVGQAEDGYLGVQAALKLKPDLVVMDIGLPRLDGIAATQQIKAALPETHVVMLTSHQT DTEIIAALSSGADAYCIKGASVERLLSAIAAAVEGATYLDPQVARRVIDNLKPPSPSG NTANLSQRELEVLRLMVEGLSNPEIAQKLYLSPNTVKTHVRGIMNKLSVDDRVQAAVV ALRSGLV" gene 5867..6139 /locus_tag="DP116_08770" CDS 5867..6139 /locus_tag="DP116_08770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130203.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08770" /translation="MQSIFWSVEEVALRAKQFYENDIRQHVEYGDNIGKMIVIDAETG EYGIDPTGVETALKLKHQKPNARLFTIRIGYDVAVSFGGAMERIAK" gene 6136..6500 /locus_tag="DP116_08775" /pseudo CDS 6136..6500 /locus_tag="DP116_08775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015083191.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="clan AA aspartic protease" gene complement(6638..7198) /locus_tag="DP116_08780" CDS complement(6638..7198) /locus_tag="DP116_08780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745799.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_08780" /translation="MSATTLHSLTLEEFLKLPETKPASEYINGEIIQKPMPKGRHSRL QCKLCAAVNQVAEDKRIAYAFPELRCSFGERSIVPDVAIFLWKRIPFLIDDQVPDNFE LPPDWTIEILSPEQKPNKVIGNILYCLKHGSRLGWFIDPDDVSILVFLREQQPLLLMG EDLLPVLPDELTLTVNQVFGWLKMGG" gene 7370..7609 /locus_tag="DP116_08785" CDS 7370..7609 /locus_tag="DP116_08785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006530391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB/MazE/SpoVT family DNA-binding domain-containing protein" /protein_id="PRJNA477356:DP116_08785" /translation="MDITIINTEGQIPIPPNIQEQLGLLPGTAIELEVIGDTLHLRKQ PTSSRGAQLITAIRGKATRELRTDEIMQLTRQTND" gene 7602..8006 /locus_tag="DP116_08790" CDS 7602..8006 /locus_tag="DP116_08790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006616204.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_08790" /translation="MTDILVDSNVILDVLTEDRQWFDWSSQMLTEYANRGNLVINPII YAEISIGFNQPEELEAALPQDFFRRDPLPYKAAFLAGQSFLEYRRRGGERRSPLPDFY IGAHAAITAMPLLTRDVNRYSTYFPSVQLITP" gene complement(8021..8305) /locus_tag="DP116_08795" CDS complement(8021..8305) /locus_tag="DP116_08795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318878.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prevent-host-death protein" /protein_id="PRJNA477356:DP116_08795" /translation="MTSQTTYTEACNNFDKIYNEAITSREPVVVTREGLQSVSVIPTA ELNSIIETAYLFQSPENAARLLDALERVKAKTNQPRTIEDLRQEFGLDEG" gene complement(8302..8796) /locus_tag="DP116_08800" CDS complement(8302..8796) /locus_tag="DP116_08800" /inference="COORDINATES: protein motif:HMM:PF13646.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08800" /translation="MKIGAGNSDAIRALVDLTRNSKDDYTRWQAAQSLENILTADQIA EVVIVLGSDSKPNKKRYQVILHCAQNMPYSAFYQAWHHRSYMRLAITFLQYYWSRIAL VFALVVQGGVVSANPLAYFLGFAEGFALSALGCALIWVFYRTDWKVIRKKIMKVIKKV RSKK" gene complement(8856..10643) /locus_tag="DP116_08805" /pseudo CDS complement(8856..10643) /locus_tag="DP116_08805" /inference="COORDINATES: protein motif:HMM:PF05729.10,HMM:PF13646.4" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 8883..8892 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(10857..11288) /locus_tag="DP116_08810" CDS complement(10857..11288) /locus_tag="DP116_08810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319340.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08810" /translation="MEPVTLTAVATAIATLLLTKALEKTGENLGDAAWQQSRKLIEQL RTKNKLPLLTNATQANEQQRLDYGQAVLELKAAADADPEIAQGVVEVEAAAKGDPKIA AKVQALENDINSQPATVINSTKLADSIKNVFQGNTIIGGTF" gene complement(11607..12638) /locus_tag="DP116_08815" CDS complement(11607..12638) /locus_tag="DP116_08815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412743.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_08815" /translation="MDISRRDLLKVASTSGLVAAGLAGTEGFFSQLQAQNKPTQTRSG EMIYRTLGRTGEKVSVIGLGGHHIGRPKDEQEGIRLIRTAIDRGINFMDNSWDYHNGG SEIRMGKALQDGYRQKVFLMTKIDGRTKQAATQQINDSLKRLQTDRIDLLQHHEIIRM EDPDRVFAPGGSMEAVLEAQKAGKIRYIGFTGHKDPLVHLRMLEVAAQNNFHFDTVQM PLNVMDAHFRSFEQQVLPKLVSNGIGVLGMKSMGDQNILKSNTVKPIECLHYAMNLPT STVITGIESMEILNQAFEAVRTFKPMSQEQVRALLARTRSVAAKGQYELFKTTNQFDS TAKNPEWLG" gene 12909..14861 /locus_tag="DP116_08820" CDS 12909..14861 /locus_tag="DP116_08820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186780.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S9 family peptidase" /protein_id="PRJNA477356:DP116_08820" /translation="MNKKYFSVVAVPLVVAATLLTNISASFADLPPLIPRQILFGNPE KTNPQLSPDGKYLTYIAPDKNNVLQVWLRTVGQNDDRVLTADKKRGIRSYFWTYNGEQ LIYLQDTDGDENFHFYAVNIRSNEVRDLTPYKGVRARMIALEPNFPNEVLVGLNIKDP RKHDAYRINLKTGAAKLELENSVNVTESVADPQLKIRASVASTPDGGSSLSVRQTTNQ PWKIVRKWGPDDEGGAVGFSQDGKTLYITGSHNANATRVLALNLATGKESVIAQDPQY DAGGMFAHPVKRQIQAVSFEKDKLEWQILDKSIAPDFQAISKVSPGEFSVVDRDLADK TWLVAYRTDNGPVYYYTYDRTSKQSKLLFSNQPKLEGLQLAQMKPISYKSRDGLTIHG YLTTPVGIPTKNLPTVLLVHGGPWTRDTWGYNPQAQWLANRGYAVLQLNYRGSTGYGK KFLNAGNREWAGTMHNDLIDGVNWIVQQGIADRKKVAIMGGSYGGYATLVGLTFTPDV FAAGVSIVGPSNLITLLKSIPPYWESGRAEFYNRIGNLEKEPEFLKSRSPLFFVDRIK VPLLIGQGANDPRVKQAESEQIVAAMRKANKPVEYILYPDEGHGFARPQNRLHFYAKA EEFLSKYLGGRVEAVGDIPGNSGVVK" gene 15011..15241 /locus_tag="DP116_08825" CDS 15011..15241 /locus_tag="DP116_08825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NifU family protein" /protein_id="PRJNA477356:DP116_08825" /translation="MELTIDNVETVLDEMRPYLISDGGNVEVVELDGPIVKLRLQGAC GSCPSSTMTLRMGIERRLREMIPEIAEVEQVM" gene complement(15324..15518) /locus_tag="DP116_08830" CDS complement(15324..15518) /locus_tag="DP116_08830" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08830" /translation="MRVSLTWHLVSPALREGFPPQATGVSAASPQEIPEGRTRREVPS VVATAVQEEAGDSCYKGFVT" gene 15575..17803 /locus_tag="DP116_08835" CDS 15575..17803 /locus_tag="DP116_08835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137675.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase" /protein_id="PRJNA477356:DP116_08835" /translation="MSHPLYVAFIWHQHQPLYKSATNALSSSQHYRLPWVRLHGTKDY LDLVLILERYPRLHQTVNLVPSLILQLEDYIAGTAFDPYLQLSLTPTEQLSDQQRQFI IEHFFDANHHNLIDPHPRYGELYNTRQEKGQAWCFTNWQEQDYSDLLAWHNLAWIDPM FWDDPEIEAWLKQGRNFSLGDRQRIYSKQREILSKIVPQHRAMQDAGQLEVTTSPYTH PILPLLADTNSGRVAVHNMTLPEYRFQWAEDIPRHLQKAWNLYIDRFGTTPRGLWPSE QSVSPEILPYIINQGFKWICSDEAVLGWTLKQFFHRDGAGNVQNPELLYQPYRLQTAA GDVSIVFRDHRLSDLIGFTYGSMSPKQAAADLVGHLQAIARMQKDRQSEQPWLVTIAL DGENCWEYYPQDGKPFLDALYQSLSNEQRIKLVTVSEFIEKFPPTATIPGEQLHSGSW VDGSFTTWIGDPAKNRAWDYLTQARATLANHPEATEENNPEAWEALYAAEGSDWFWWF GAGHSSNQDAIFDQLFREHLYAIYKALNEPIPPYLRQPVEVHEARTDHRPESFIHPII DGKGDEQDWDKAGRIELGGARGTMHNSSAIQRLWYGVDHLNFYLRLDFKTAIQLGQDV PPELNLLWYYPDKTMHNSPVPLAEVPDTSPMNYLFHHHLEINLLTQSIQFREAGEDYQ WYPRASRAQVAFNKCLELAVPWADLQIPPDYPLRLILVLSEEGRFCNYLPENALIPIE VP" gene complement(18203..18928) /locus_tag="DP116_08840" CDS complement(18203..18928) /locus_tag="DP116_08840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131490.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FkbM family methyltransferase" /protein_id="PRJNA477356:DP116_08840" /translation="MLRDIAYSLISAGYKIPKSVEEKIGRFSYQTYLVTLLKKLRINC VLDVGANIGYFAENIRKLGYKGQILSFEPHPEIFPTLQKNFKHDTLWRGYDLGLGSED ALATFNLNTYSELSSFLVPNASMPKTVNSCEVKIKPLDSLLDDILTLVPEPRIFLKMD TQGYDIEVVKGASKCIDKILCLQSEISVRPNYINIPSYLDALRYYESLGFELIDLFPA FRNFDGYVTEYDCLMVRSKTSSV" gene 19796..21877 /locus_tag="DP116_08845" CDS 19796..21877 /locus_tag="DP116_08845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319012.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_08845" /translation="MICCLNPDCSNPLNPSGKKFCRTCSTPLIPLLRNRFHIIKLLSD EGGFGRTYLAEDVDKLNERCVIKQLAPRIQGTWALKKAIELFEKEAQRLQELGTHPQI PTLLAYFEQDKYLYLVQQFIDGQNLLTELQQKKKYNCSEIKKILLDLLPVLKFIHEQG VIHRDIKPQNIIRRQTSLSSKTAVSEIGRNLVLIDFGSAKQLTAQAQMKIGTSIGSQG YSPIEQIRDGAAYPASDLFALGATCFHLLTGVSPFKLWTEHGYSWVKDWQQYLKNPIT EELAQILDKLLQKDIEDRYQSAHQVLADLLIKHQKQSQSTAVTQIKQITKLSITQLQT VSKPYLLLRNLLLAGSAIILLGSGEFWYRQFHNLEMRISASLSQPNSSATNREMIFHQ GKQAPLENFLLASTVKGYTNSILSVAISPDNKAIASNSNDTIKLLSLVTGQEISTLSG HTNTVNFTSFSPDGQILVSASEDKTIKIWNLATGQQMRTLEGHTHSVNTLAFSHDSKI LADGSNDNTIKVWNLATADEIRTLRGHSSSVRSVAFSPNDNTLASGSFDKTIKLWNLA TGQEIRTLEGHSGKVTSVAFSPDGKILASGSFDKTIKLWNLATGQEIRTLEGHSGKVT SIAFSPDGKILASGSFDKSIKLWNLVTGQQIRTLEGHSDGIQSVAFSSDGKTLVSGGN DKTIRIWQTSL" gene 22069..23106 /locus_tag="DP116_08850" CDS 22069..23106 /locus_tag="DP116_08850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319013.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08850" /translation="MTDSKVKVLTSVAIALLGFGCVWYLQSSPAATTSASSHESESQT THDENFSEAFTLAGHSDSILAVAISPDGHTLVSVSGDKTIKLWNLDTGKEIRTLVGHS DWVNSVVFSPDGKTLISASADRTIKVWNLDTGKEIQTLTGHLASVQAVAISPDGSTLA SGSWDQTIKLWKLATGKIIRTLKGGCDVVNTVAFSPNGKTLASGNYFDNSINLWDVAT GKETQTLRGHSEAVSSLIFSPDGKTLISGSWDKTIKLWEIATQREIYTLAGHANKVLS VAVSPDGKTIASSSWDKTIKLWNLAMGKEIRTLKGHTKRVWSVAFSPDGKTLVSGSFD KTVKIWHVVSR" gene complement(23212..23340) /locus_tag="DP116_08855" /pseudo CDS complement(23212..23340) /locus_tag="DP116_08855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747557.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" gene 23502..23783 /locus_tag="DP116_08860" CDS 23502..23783 /locus_tag="DP116_08860" /EC_number="4.2.1.96" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310629.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4a-hydroxytetrahydrobiopterin dehydratase" /protein_id="PRJNA477356:DP116_08860" /translation="MTQLLTEEEIQEKVSHLPNWTLQASTLQCTRQFKDFIEAIEFVN KLVEPAESAQHHPDIEISYNKVKISLTTHDAGGLTQKDFDLAKLISEIN" gene 24354..26168 /locus_tag="DP116_08865" CDS 24354..26168 /locus_tag="DP116_08865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879052.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08865" /translation="MSNLLWKSLVTSPAVLGATLLVSTTAIAAPKNISHEVVTTEVSQ TSNNQEASSSVSTTSTTTTATAVSPKPETLIQAQQTKVNVSQQVSSYSNEGNNNSQSQ VTSVSQFSDVQPTDWAFQALQSLVERYGCIAGYPNGTFRGNRALTRYEFAAGLNACLD RVNELIATATSDLVRKEDLTTLQRLQEEFSAELATLRGRVDVLEARSAELEANQFSTT TKLVGEAIFALSDAFGDTVGKNNNTVFQNRVRLDLQTSFTGKDVLHTRLATGNARRLN TGGNVDVNRNGVIDTSEQNAEGFQTFNLNGDSSNSNDIVLDWLAYYVPIGPAQLYVAA TGGIHSDYAATNNPYFEDYDGGNGALTTFGSENPIYRIGGGAGAALNIPFGKGGGILK PSSLTVSYLGSEPNNPGIGSGIFNGNYAALGQLNFNLGQRIALAATYVHGYHGAGSAL FDAGGFQGANVPVVGTSQANALSSTNASSSNSYGLSAAFRPSDKLSVSGFVSYHDVTG FGRNDDYEAWSYGLGVALPDLGKKGNVLGVFGGAQPYALGRVAGANAVPYQIEGFYKY RVSDNVSITPGVIYQISPGQNSNNDNAFIGTLRTTFTF" gene 26338..26727 /locus_tag="DP116_08870" CDS 26338..26727 /locus_tag="DP116_08870" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08870" /translation="MEENSPAYRQHIPQQDWERTPPSVQKLVEDMWQCIEKLEQQVGM LLEVQQQLLEKINCTSKNSSSPPSTDPPNTPKTQRKQKSGRKRGGQKGHEGHGRSLYP AERCARIVDHINTLYRSEYKKYKKLRI" gene 27770..28324 /locus_tag="DP116_08875" CDS 27770..28324 /locus_tag="DP116_08875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875834.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2'-5' RNA ligase family protein" /protein_id="PRJNA477356:DP116_08875" /translation="MQLSQRLYFIALLPPQEIQDYANQIKQYFADKYASRHAQKSPPH ITLQPPFKWADADVPRLEECLKYFASDRESVPITLSGFGAFAPRVIYINVVRSLELMT LYTDLIMYMESNLGIVDKVGKTRPFAPHMTVAFRDLSRQNFQAAWSEFEKQQLQFEFT ASDLKLLLHDGRRWNGEVPHLAFG" BASE COUNT 8257 a 5992 c 6009 g 8082 t 10 others ORIGIN 1 caatgattga ctaaaccaaa ataatatgca agttaaaagc gtaacagctt atcaagtccg 61 cttgtttagt tatcctatcc gcaaaacctc acccccatcc cctctccttc ctctccttat 121 taaggagagg ggtgcccgac agggcggggt gaggtgatac gcgtgaaggc agtggagtat 181 caatctacac agaagccctc aactgttcct cagaaaccca agcaccgtgg aacccataag 241 gtacacgctg aggaataatt actcgtgcta ttggttcacc tgtcatatcc tgcgcattaa 301 ctatcacgag ttcagaagta tcttctgcgg tgtcgtgaac aaaagtcata agccaaccgt 361 catcctctgc ttttgcatta ggacgtggtg caaagacagg ttcaccacca taacgtcctt 421 ttccaaattc gtgggtttgc gattgcccag tgttgaagtc gtacttaata atactattga 481 acacaggttg accctttgct agtttgccag tgtatccgta tcgtgttttt tgtcccaacc 541 aattctcatt gacacgggga aattcacaag ggacatcatc tagcatttcc tcactcacct 601 tgcccgtact cagattaaac cgccaccgat acaaacgggg gatatctgct tccggatcag 661 gttgtgagtc ttttgaaccc aatacattcg tagaattcat gcgacaagca atcaacacca 721 cctcgtcacc ctcctcgtaa gcattgaggg tgtggaagac atagcaagcg ggactctcaa 781 accagcgaat attactgtta ttgccataac gcggaacaat accaaagcga ctgggtttgt 841 cacgatcaaa cgctgtcata ggttcgccca gttttttctt ttctatattg aaagtcagcg 901 gcaaatccat gaaaattgtg tagtcttcag tgatggcaaa atcatgcatc atcacggctg 961 ctggtatgtc aataggttct gtctgcaaaa gttcgccttg tggtgaaacc acgctgtatt 1021 gcagatacgg tggtgtgaag ccgtaaccaa aaaacatcat ctcacccgtg actggatcta 1081 ccttgggatg agcagtcatt gctgaaacaa ggttgccatt gtaggtatac tcaccaatag 1141 tgtttaactc aggaaccttg atagcgtgag gtgcacctcc ttcccacaac gccaaaagtt 1201 gacctgcgtg ccacacaaga gcagtgttag cggtattctt gctttgtttg tggggcttgt 1261 cttttggtgg cggttctacg aacccagtcc acagagattt acgtgcttca ttctcaattt 1321 tccatccttt tgtccgcaca tagcggttgc gataggcagc tttaccgtcg ctaattcgca 1381 ctccatgcaa cattccatcg ccatcaaacc aatgatactg tcctatgggt gtccattggg 1441 gattcggacc attgcgcaga aatattcctg ataagtcggg agggagttca cctatgactt 1501 gcaggctgtt ggtagaggtt tcttgatgca ctggcgcaaa gttaccatca agaaaagggt 1561 tgacttgtgt tgttctcatg attggtgtcc caaacccaaa tcaatatgct atcggtactt 1621 cgctatatta gccatagccc cactaactgc gctcaacgca cagtgaggaa ttcaaaattc 1681 caaattcaaa aaaattcatt ccctcaaggg gttgactctt gaccaatgat taatctagta 1741 taccgacttc tatgagttta actctttatt tcctccgtca cggacagaca gaatgcagtc 1801 gaaataattt tttctgtggt tcggtagatc cagaactaac cacagatggc ttggagatgg 1861 caaaagcttt tgcgactgca tacagttcta ccccctggac agcaattttt tgctctccta 1921 tgggacgaac catagccacg gcgaaacctt tgtgtgacgc aatcgggatg caaccacaac 1981 tgcgggatgg cttgaaggaa atcaactatg gcaaatggga aagcaaaaca ccagaggcgg 2041 tgaatcaaga atttcatgat gattatatcc ggtggttagc agatcccgct tggtacgcgc 2101 ctactggggg ggagatggcg atcgccatag catctcgtgc tacccaagtt attgaagaaa 2161 tcaagcagcg ttacagcagt ggcaatgttt taattgtttc tcacaaagca accatcagaa 2221 ttatgctgtg cagtttgttg ggaattgacg tagggcgctt tcgctatcgt ttgggatgcc 2281 ctgtaggttc cgtaagtatt gtagaatttg gctcacatgg tcctttactt aaagcattag 2341 ctgaccgtac tcatatgggt gaggagttac ggaatttacc aggaacctaa cagttagcag 2401 ttatcagtta tcagttttag agatgattta tctctcgttc ccatgctctg cataggaatg 2461 cgatcgccca ggctctgcct ccaaacaatt atattgaggc agacgaggaa tttgcaaccc 2521 agccagtacc agaacaactc accataactg gttttctaaa ttttgaatca gcgaatgagc 2581 gaattactta tgcttcctgt tgtatctacc attccactgc gccacatggc ttcaggcgat 2641 gtcttatccc tgcaagtcta caaattcatt ggcgctcaac cgggcaaaaa agtttacatt 2701 caatctaatt tacatggtgc ggaaatcgct ggtaatgctg ttattcacca gctaattgaa 2761 tttttacaga cggtaaatga tacagactta tacggtgaaa tttggttagt tcctgtttgt 2821 aacccaatgt caaccaatca gcgatcgcac attttttcct caggacaatt ctgtagttat 2881 gaaggaaaag attggaatcg tattttttgg gactacgaaa agcacgctga cgatttcttg 2941 gcatttgcta aatctcaaat caattttgaa aaagaggtgg tgagaaaaaa ctaccaggct 3001 aaaattcagc aaagttttgc caaaatttta gaaaaaatta actcctcctg tggagttcct 3061 tacacagaac gctttagcta cagactgcaa tctctgagtt tagatgcaga ttatttgatt 3121 gacttacaca gtcacgccaa tcaaggatta aactatcttt atctctaccg taatagagaa 3181 gaaagtgcaa aatatttctt acttccattt ggaattcaat ttgttgaatt tagatgcgat 3241 gaagatgctt ttgataaaac ctttatccaa ccttggttag ctttggaaaa gcatttcaaa 3301 cagcttggta gagaaattag gtttgatata gaggcttgga cactcgaact tggtacagga 3361 atgcaaatga acccagattc agtggagaaa ggtattcgag gtgtgaaaaa ctatttagca 3421 cataaaggtg ttttacaaat ctctggattt cctctcaagg aaagcgaatc tcatagcatg 3481 agtttgaagc caaggactca aatgaaaaga tattatgctc caacaggtgg aatgattcaa 3541 tcaagagtgg aattgggaag ttctgtaaaa gctggagagc gaatttatca aattttgagt 3601 tttcataaag acggtacatt accaactgtg attgatatta gtgcagaaca agatggattt 3661 gttttcgata tctcaagcaa tcaggcagtt aacgaaggcg agtttgtgct tggaatgatt 3721 gaatagtgtc ttgtgacttc agttttcatt tagaatgagt cattctgttt gaaaaggctc 3781 attgaaaaag tgggcagtga tagataagtc aattaaaata agattaagta attatagcgg 3841 ttatcgtttg aatcacatat acaacggcat atatagaagt gaaatgtaac gtcgaaactg 3901 tattgtatgt aacgattatc cgctatagta taaattcagt ggtagtttga tgtttgattt 3961 gtatttgtac aaaaagttac gttatccgaa atctaaaata tgatggtaga gccagggaga 4021 aatagtagtc aaaatgatat cttctctctt tcaggagaag tggtaattcc gcaggctcct 4081 cctgaatata aatcaggttt tgtcggcatt attggtcgtc ctaatgtcgg taaatctacg 4141 ttaatgaatc aattagttgg acaaaaaatt gctattacat caccagtagc acaaacaaca 4201 cgtaatcggt tacgaggtat tctcaccacc ccagaagcgc agattatttt tgtggatacc 4261 ccaggaattc ataaacccca tcatcaattg ggggaagtgc tggtaaaaaa tgccaaacta 4321 gccattgaat cggtagatat ggtgttgttt gtggtggatg gatcgacgaa ttgtggaacg 4381 ggcgatcgct atgtggctga tttactcact cacagcacaa caccagtaat tttagggtta 4441 aacaaaattg acgaacagcc ttcaaatttc cagacaatag acaatagtta caccgaattg 4501 gcaagggagc atcagtggca aacaagaaaa ttttctgcca aaactggtgt cggattacct 4561 gaactgcaac aattattaat tgaacactta gaactaggac cattatatta tccaccagac 4621 ttagtaacag accagccaga acgctttatt atgggcgaat tgattcgaga acaaattttg 4681 ctgttaactc gtgaagaagt accccattca gttgcgatcg ccattgactt agtggaagaa 4741 acaccagcta ttacccgtgt acttgcgact atacacgttg agcgcgattc ccaaaaagga 4801 attctcatcg gtaaaggcgg agcaatgtta aaagctgttg gtagtgcagc gcgcgaacaa 4861 atgcaaaagt tgattgcagg aaaagtttac ctggaattgt tcgttaaagt tcaaccaaaa 4921 tggcgtcagt ctcggatacg tttggcagag ttggggtatc gcgtggaaga ataaaaaatg 4981 tctcaggata ataccttacg cttaaatcta ccagccaata cttctgtgct gcgaatttta 5041 attgtagaag atgatccaat gatgcaactg gggttagaac agtcattaat ggctcatcct 5101 cagttggaaa ttgtcggaca agcagaagat ggctacttgg gtgtgcaagc agcactgaaa 5161 ttaaaaccgg atttggtggt gatggatatt ggtttacccc ggttggatgg gattgctgca 5221 acacagcaaa ttaaagcagc attaccagaa actcacgttg tgatgctgac atcgcatcaa 5281 acagatacag aaattattgc ggcgttgtct agcggtgctg atgcttattg tatcaaaggg 5341 gcgagtgtgg agcgactgtt gagtgcgatc gccgccgcag ttgaaggtgc aacctacctc 5401 gatcctcaag ttgctagacg agtcattgat aatctcaaac ccccttctcc cagtggaaac 5461 acagcaaatc tatctcagcg cgagttagaa gtcttgagac tgatggtaga aggcttaagt 5521 aatccagaaa ttgcacaaaa gctttatttg agtcccaata ctgtcaaaac tcatgttcgg 5581 ggaattatga ataagttatc tgtagacgat cgcgtgcaag ctgcagttgt cgcattgcgt 5641 tctgggttgg tgtgaaaaaa aatccaactt acgatatgtc attgcgagtg gaacgaagtg 5701 gagcgaagca atcgcaagaa tcgtatttta tgttttttat gttgagctac ttataaaatc 5761 tcgtttccag gttctacctg gaaatgcagt ttcagcggct ctgccgcaag taaactatgc 5821 ttgctttgca ataatgatta aaggacaaaa agtaggtgta actgttatgc aatctatttt 5881 ctggagtgtg gaagaagttg ctctaagagc caaacagttt tacgaaaacg atattcgtca 5941 acatgttgag tatggcgata atattggcaa gatgattgtg attgatgcag agactggcga 6001 atatggaatc gatccaactg gcgtagagac agcgttaaag ttaaaacacc aaaaaccaaa 6061 cgcgagattg tttactatac ggattggtta tgacgttgct gtaagctttg gtggcgcaat 6121 ggaacgtatt gccaagtgat ttatggaaaa gtaattgatg gtagagcaat agttccagtg 6181 gtttttcgct taccttcaca accagatttt tcgttggatt ttgtaattga tactggattt 6241 aatgaccatc tgactttacc accacaagca gttagtgcta tgaatcttcc tttatattcc 6301 actacatctg caaggttagc cgacggtagc gaagctttat tatctataca tttggcaaca 6361 attgtataga taataaagaa aagttagttc cagttttagc ttctggttat aaacctttgc 6421 ttggaactgc tctgatggta ggatatcatt tagcaataga ttttcaagac aatggtttag 6481 tttcgttaga aaaactctaa gcaattagta ttggcgatac gccgccagaa gaaaaaaata 6541 ccaggcgatt agaaatcgcg gctacaaaaa caaagcctgc ctccgcaggc taaattatta 6601 gtcctaggat tccatttgat ttttgaaaaa gatggcgcta gccacccatc ttcagccatc 6661 caaaaacttg attgacggtt aatgtcagtt catctggtaa cacaggcaat aaatcctctc 6721 ccataagtag taaaggctgt tgctcacgga gaaatactaa gatactgaca tcatcaggat 6781 caataaacca tccgaggcga ctaccatgtt ttaagcaata aagaatgtta ccaatcactt 6841 tgttgggttt ttgttctggc gagagaatct caatcgtcca atctggcggt agttcaaagt 6901 tatcaggaac ttgatcatca atcagaaatg gtatgcgctt ccacagaaat atagccacat 6961 ccggtacaat tgagcgttcc ccgaaactac atcgtaactc gggaaaagca taggcaatcc 7021 tcttatcctc tgcaacttga ttgactgctg cacagagttt acattgcaag cggctatgtc 7081 tccccttcgg cattggcttt tgaataatct caccgttaat atattcgctg gctggcttgg 7141 tttcagggag cttcaggaac tcctctaggg taagagagtg gagagttgta gcgctcatgg 7201 ctcctgaatc agtacgaaga tgtgttctca gtttagcaaa acgcagctgg ctagctcatt 7261 tgacaggcgc gtccttgggg gacactccgt gcgatcgctt cgcttggttc tgctgctaaa 7321 ttaggtatta accaaggaat gacctttacc attgctaaaa tattgccgta tggacattac 7381 aataatcaac accgagggtc agatccctat tcctcccaat attcaagaac agcttggact 7441 tctacctggc acagcaattg agcttgaagt catcggtgat acgcttcatc ttcgtaagca 7501 accaacttca agtcggggag cacaactcat tactgccata cgtggcaaag ctaccagaga 7561 attgagaact gacgagatta tgcaactcac ccgccaaacc aatgactgac attctggttg 7621 atagcaacgt tatcctggat gttttgaccg aagatcgcca gtggtttgat tggtcatctc 7681 aaatgctaac agaatacgct aatcggggaa atttggtaat taaccccatt atttacgctg 7741 aaatttcgat tggatttaat caacctgaag aactagaagc agctctacct caagatttct 7801 ttcgtcgcga tccattgcct tataaagcag cattcttagc aggacaaagc tttctggagt 7861 atcgccgtcg tggcggtgag cgccgctctc cattaccaga cttttacatt ggtgcccatg 7921 ctgccatcac agccatgccc ttactaacca gagatgttaa ccgttactct acttattttc 7981 catcagttca actcattaca ccataggaaa ttacaatttt ttatccttcg tctagcccaa 8041 actcctgacg cagatcctca atagttctag gttgattagt ttttgcttta acacgttcca 8101 aagcatctaa aagacgtgct gcattctctg gtgactgaaa caaataagct gtttctataa 8161 tgctatttag ttccgcagta ggaatgacgg atacgctttg caaaccttca cgagtcacaa 8221 cgacaggttc acgacttgtg attgcttcgt tataaatttt gtcaaagttg ttgcaagctt 8281 cagtgtaggt tgtctggcta gtcatttttt actcctgact ttctttatta ctttcataat 8341 tttctttctg atcaccttcc aatctgttct ataaaaaacc caaattaaag cacatcctaa 8401 agcacttaaa gcaaatcctt cagcaaatcc taaaaaatat gctaaaggat ttgctgaaac 8461 aacaccacct tgtactacta aagcaaatac taaagcaatc ctcgaccaat aatattgaag 8521 aaatgtaatt gctaatctca tgtaagagcg atgatgccaa gcttggtaaa aagctgaata 8581 aggcatattt tgggcgcaat gtaagataac ttggtaacgc tttttgttcg gtttggagtc 8641 actgcccaag acaatgacaa cttccgctat ctgatctgct gtcaaaatat tttctaagct 8701 ttgtgccgct tgccaacggg tataatcatc cttggagttg cgggttaaat cgactaaagc 8761 cctgatggca tcagagttgc cagcaccaat tttcactaag ctttgtgccg cttgccaacg 8821 ggtataatca tccttggagt tgcgggttaa atcgactaaa gccctgatgg catcagagtt 8881 gcnnnnnnnn nntcatcctt ggagttgcgg gttaaatcga ctaaagccct gatggcatca 8941 gagttgctag gatcaatttt ccctaagctc tctgccgctt gcctacggat aaaatcatcc 9001 ttggagttgt ggatgaactc gactaaagcg cttatcgcat cagagttgcc agcaccaatt 9061 ttccctaagc tatatactgc ttgccaacag gtataatcag acttggagtt gcggattaaa 9121 ttgactaaag cgcttatcgc atcagagtta ccaggatcaa ttttctctaa gctctgtgcc 9181 gcttgccaac agctatcatc atctttggag ttgcagatga acttgacaag agcattaata 9241 gcttttgggc gatatgtctc tgcgagtact gcctcagctg tttcttgaag agaaaagttt 9301 gtgatatttt cgataggttc gataagtttg tcaatgaggg aagtattaaa accccactga 9361 acgatttgct ctacgatttc atcagcttta gaacaatctc taaactcagc aattcccctt 9421 gcagctagaa tataggctcg atacttataa aagttattgc aattatcctc aaactctacc 9481 aatgctttta taaactgctc tttctgctct gtttctagtt cttcgcaacc caaccagagc 9541 aaaatcacct gcttccattg gggttcaaat atgcgataag ttccttggga tgggttgttg 9601 gggacgtggt taagcaaaaa gtgccaatca tcaatcgcta atgccgcaaa atattcctga 9661 aacgtggcat ggtagaaagc ataaaacggt tcatctgttt gtttatctcg atctacctga 9721 ttcagccacc ctaaacgaca agcgcaatta aagagattct ctcccatcac ccggtaagca 9781 aaattctgtc caatgcgaaa tcgggttttt ccagaagcga tcgcctcctg tgccaacttt 9841 cctaaagctg cattcaactc attctgttgg gcgagcgttg ttggaaattc agcttgtttc 9901 cattcatagt ggtatagtgt aaatcgcttg taaagtgcag cttgagtttc tggtaaatcc 9961 ttagtcatgt cccagacttg acacatcaaa gctaatctca gggggttttt taccaagtca 10021 agcaagcgtt cccttccaga ttcattcagc ttttgccata gcagttcccc ttgtcctttt 10081 gttctcttat cctcttcctc agcccgctgg aaccaagcta gaataaaatc ctgcacttgc 10141 ccgctgctaa actcctgagt tttgaaattc tggaagtcaa gcagattttt ggtgctaaca 10201 tcccaaacat tcagccgaca agttaacaca actcgcgccg cagcaagttt gctaatgtct 10261 tggctaatct tatttaaagc gtcaacagaa gatgcagctt gcatctcatc caaaccatcc 10321 agcagaaacc atacccgttg ttgctgcaac aattcttgca gttggtttag tgtgattgga 10381 ttttcgagcc aacttttcag caggtatttt tccagcgtgt ccccagacaa attgctcaaa 10441 ggaatataga ttggcaaggc gttatgctca attaagtact tgccaatctt ttccagcaag 10501 gtagactttc ccgctcctgg ttctccgata atggctatat tcttgtcttt gatgttggaa 10561 gcatccccaa tcacctctgt gagaaatttc tcatgctcat aagtcgtctt gataacttcc 10621 tcagtaagtt gatacgcaag cattcctttt tccggtgaga tatcctcact gcggcgaggc 10681 tgttgtttgc gctcaacaag tcctaaaggg acataaatat cgagttcaaa tccccttggt 10741 gttgctagcc gtcgttgacg attaacatcg agttgctgtt ggaatttgtt ttggcagact 10801 tcgcgccagt tgatgggagt tgactttacc cgattctccc cgtcgggcga attcagtcaa 10861 aaagtgccac caataatagt atttccttgg aaaacgtttt taatcgaatc agctaatttc 10921 gtggaattaa tgacggttgc tggttgagag ttgatgtcat tttctaacgc ctgaactttt 10981 gcagcaattt ttggatcacc tttcgctgct gcttctactt ccacaacacc ttgagcaatt 11041 tctggatctg cgtctgccgc tgctttgagt tccagcacag cttgcccata atctaaccgt 11101 tgctgttcgt ttgcttgtgt agcattggtg agtaatggta acttattctt agtacgaagc 11161 tgttcgatta acttgcgact ttgctgccat gcagcatctc cgagattttc gccagttttc 11221 tccagtgctt ttgttagaag tagggtggcg atcgctgtag ccacagccgt caaagttact 11281 ggttccataa attatttcat agtagattct tctgtattat ttataccgta gcatccacta 11341 atttttcgga ataacatctt gcaccgagag cgatgagtac gccttgaact caagttcaag 11401 gcttataaac aaagtcagtt aaaacggact cacagactta cccagtccgt tttaactgac 11461 ttaagctttg agccaagaaa tttatttctt ggcggacgaa aattatggtg caagatctga 11521 gcctttaact agcgatagaa aaccgctagc aacagcctta agaaactaaa tccccctgtt 11581 ccctgttccc tgttccgtaa tagcaattat cccaaccatt ccggattctt agctgtacta 11641 tcaaattgat tcgtagtctt aaatagctcg tactgacctt ttgcagcaac agagcgagtt 11701 cgagcgagca gcgccctgac ttgttcctga ctcattggct taaatgtgcg tactgcttca 11761 aacgcttggt ttaaaatttc catgctttca ataccagtaa tcacggttga agttggcagg 11821 ttcatggcgt agtggagaca ttcgatgggt ttcactgtat tgctcttaag aatattttgg 11881 tcccccattg atttcatacc cagcacaccg attccattgc tcactagttt gggtaaaact 11941 tgctgttcaa aactcctgaa atgtgcatcc atcacattta atggcatctg cactgtatcg 12001 aaatggaagt tattttgagc agcaacttcc agcattctca ggtgaaccag aggatctttg 12061 tgtccagtaa agccaatgta gcgaatcttg cctgcttttt gagcttctag cactgcttcc 12121 attgaaccac caggagcaaa aacgcggtct gggtcttcca tgcgaatgat ttcatgatgc 12181 tgtagcaaat caatacgatc tgtttgcaaa cgtttgagag aatcattaat ttgctgagtc 12241 gccgcctgtt ttgttcgacc gtctatttta gtcatgagaa agaccttttg acggtaacca 12301 tcctgcagag ccttacccat gcggatttcg cttcctccgt tatggtagtc ccaactgtta 12361 tccatgaaat tgataccgcg atcaatcgca gtccggatga gacggatacc ttcttgctca 12421 tcttttggtc gtccaatatg gtgacctcct aaaccaatca cggatacttt ttctccggtg 12481 cgtcccaaag ttctgtagat catttcgccg cttctggtct gagtgggttt attttgggct 12541 tgtagttgtg agaagaaccc ttcggtacct gccaaccctg ctgcaaccaa acctgaggta 12601 gaagctactt tcaataaatc tcgtctgctg atatccacaa atttctgttt ctaaataatt 12661 gtcataattc gttctcatag agtgacacga tgcaaaatag agtgcatcta actgttgaga 12721 gaaaacagca atgatcgggg gaaacagaag gctttgcgct tacgcttcac gcaggttgcc 12781 cgaaagctag ccccgtcccg ctcaaagaaa ttttacaatt tttttccaca accctatttc 12841 tcagtccaat gacagatact taaattgtta agaactgtat tttgcagatt ttccagtgta 12901 ttcggtttat gaacaaaaaa tatttttctg tagtagctgt ccccttagtc gttgcagcaa 12961 ctctcctaac caacatcagt gcatcctttg ctgacttacc gccactcatt ccccgacaga 13021 ttctatttgg aaatccagaa aaaacaaatc cacaactatc tccagatgga aaatatctaa 13081 catacattgc acctgataag aataatgtat tgcaggtatg gctacgcaca gtaggtcaaa 13141 atgacgaccg agttctgact gctgacaaaa agcgtggtat ccgcagttac ttctggactt 13201 acaatggtga acaattaatt tacctacaag acacagatgg tgatgaaaat tttcactttt 13261 atgctgttaa tataaggtct aacgaagtgc gtgacctaac gccgtataag ggcgtgagag 13321 cgcgaatgat tgccttagag ccgaatttcc ccaatgaggt gctggtgggt ctgaatatta 13381 aagacccccg caagcacgat gcttaccgta tcaacctgaa aacgggagca gcaaagctgg 13441 aattagaaaa ctctgttaat gtgactgagt ctgttgcaga tccacagttg aaaatccgtg 13501 catctgtagc cagtactcct gatggaggtt ctagcttatc tgttaggcaa acaacaaatc 13561 aaccctggaa gatagttcgt aaatggggac ctgatgacga aggcggtgct gttggctttt 13621 cccaggatgg gaaaacactt tatatcacgg gaagtcataa tgcgaatgcg acacgagttt 13681 tggcgctgaa cttggctaca ggtaaagaat ccgttattgc tcaagacccg cagtatgatg 13741 caggaggaat gtttgctcac ccagtcaaac gccaaattca ggcggtttcg tttgagaaag 13801 acaagttgga gtggcagata ttagataaaa gtattgcccc ggattttcag gcaatatcta 13861 aagtcagccc aggtgaattt tctgtcgttg accgcgacct tgctgataaa acttggctag 13921 ttgcttatcg tactgataac ggtccggtct actactacac ttatgatcgc acctctaagc 13981 aaagcaaact cctcttcagc aatcaaccaa aattggaagg tctacaactc gcccaaatga 14041 aaccgatttc ctataaatct cgggacgggt tgactatcca cggctacctg acaacacctg 14101 taggaattcc gacaaagaat ttacctacag tcctcctcgt gcatggcgga ccttggacgc 14161 gggatacttg gggttacaac ccacaagcgc agtggctagc aaaccgtggc tacgcagttc 14221 tacaactcaa ctatcgaggt tctaccggat acggcaaaaa attccttaac gccggaaatc 14281 gtgaatgggc aggtacaatg cacaatgacc tgattgatgg tgttaactgg attgtccaac 14341 aaggtatcgc agatcggaaa aaagttgcga ttatgggggg ttcctacggc ggttatgcca 14401 ctttggtagg attgacattc actcctgatg tctttgctgc tggtgtaagt attgttggtc 14461 caagcaactt gataacgcta ttaaaaagta ttccgcccta ttgggagtca ggacgagcag 14521 aattttataa tcgtatcggt aatctagaaa aagagccgga gtttctcaag tctcgttctc 14581 cattattttt cgtggatcgc atcaaagttc ctttgctgat tggacaaggt gcaaatgacc 14641 cccgagtcaa gcaagcagaa agcgaacaaa ttgttgcagc gatgcgaaaa gcaaacaagc 14701 ctgtagagta catcctctac ccagatgaag gacacggttt tgcacgtccg caaaaccggc 14761 tacacttcta tgctaaagca gaagagtttc ttagcaaata tctgggcgga cgagtagaag 14821 cagtaggtga cataccggga aattctggag tggttaagta agaattgttc atggttagtc 14881 atcagtcact tttggttcat aattcaatta gaccgaggag aaaagacaat caagccagca 14941 agctgcaatt ggatctagga ttgtcttaac aagactagaa aataaaaatt ttgggaagag 15001 gtctacaact atggaactta caattgacaa tgtcgaaacc gttttagatg aaatgcgtcc 15061 ttatctcata tctgatggcg gtaatgtgga agtcgtagaa cttgatggcc ccattgtgaa 15121 actacggttg caaggtgctt gtggttcttg tcccagttcc actatgactt tgagaatggg 15181 gattgaacgt cgcttgaggg aaatgattcc tgaaattgca gaagttgagc aagtgatgta 15241 aaagttatta gtcaatagtt aacactcaag agtccaaagt ccatagactc ttgagtgttg 15301 gcgcttgaat ggcagttgct aaactatgtg acaaacccct tgtagcaact gtctcctgct 15361 tcctcttgaa cggcagttgc tacaacggag ggaacctccc tccgggttcg cccttcgggt 15421 atctcctgcg gagacgctgc gctaacgcca gtcgcctgcg gagggaaacc ctcccgcagc 15481 gctggactca ccagatgcca agtgagggag accctcatca agcaggttgt cgtcaccgca 15541 acgcactacc tccccttaac ccttgacttt tgctatgtct catcctctct acgtcgcttt 15601 tatttggcat caacatcagc cgctgtacaa atctgccacg aacgcgctgt ctagttctca 15661 gcattatcgt ttaccttggg tgcgtttgca tggtacaaag gactatctgg atctcgtgct 15721 gattttagag cggtatcctc gattgcacca aacagtgaat ttggtaccat cgctgatatt 15781 gcaacttgaa gattacattg ctggcactgc ctttgatcct tatttgcaac tcagtctgac 15841 accaactgag caactatctg accaacagag acaatttatc atagagcatt tttttgacgc 15901 caatcaccac aatctgattg acccccatcc gcgttatggt gagttgtata atacaagaca 15961 agagaaagga caagcttggt gttttacaaa ttggcaggag caagattata gtgatttgtt 16021 ggcttggcac aatctggcgt ggattgaccc gatgttttgg gatgacccag aaattgaagc 16081 ttggttaaag cagggtcgaa attttagttt gggcgatcgc cagcgaattt actcaaaaca 16141 gcgagaaatt ttgagtaaaa ttgtaccgca acatcgagca atgcaggatg ccgggcaatt 16201 agaagtcaca acctcgcctt acactcaccc catcttgccc ttactagccg ataccaactc 16261 tggtcgtgta gcagttcata atatgacatt accagaatat cggtttcagt gggcagaaga 16321 tattccccgg catttgcaga aagcttggaa tttgtacata gatagatttg gaacaacgcc 16381 acgtggtttg tggccctcgg aacaatcagt cagcccagag atattgccgt atattattaa 16441 tcaaggattt aagtggattt gctcagatga agcggtatta ggctggacgc tgaaacagtt 16501 tttccatcgc gatggtgcgg gaaatgtgca aaacccagag ttgttgtatc aaccataccg 16561 cttgcaaaca gcagcaggtg acgtgtctat tgtctttcgg gaccacagat tatcagattt 16621 aattggtttt acctacggtt ccatgtcacc aaagcaagca gctgcggatc ttgtggggca 16681 tctgcaggcg atcgcccgaa tgcaaaaaga tcgccaaagc gaacaacctt ggttagtgac 16741 catagccttg gatggtgaaa actgctggga atattatccc caagatggca aacctttctt 16801 agacgctttg tatcaaagcc tcagtaacga acagcgtata aaactcgtta ccgtttcaga 16861 atttatcgaa aagtttccac caacagcaac tatcccagga gaacaactac acagtggttc 16921 ttgggtagat ggtagcttca ccacttggat cggtgatcct gctaaaaatc gcgcttggga 16981 ttacctaacg caagccagag caactttagc gaatcatcca gaagcaacag aagaaaacaa 17041 ccccgaagca tgggaagctt tatatgcagc agagggttct gactggtttt ggtggttcgg 17101 tgcagggcat tcctcaaatc aggatgccat ttttgatcag ttgtttcgag agcatttgta 17161 cgcaatatac aaggcgttaa atgaaccaat accaccctac ctccgccaac ctgtagaagt 17221 tcatgaggca aggactgatc atcgtccaga aagctttatt catccaatca tagatggtaa 17281 aggtgatgag caagactggg acaaagccgg acgcatagaa cttggtggcg cacgaggaac 17341 aatgcacaac agcagtgcca ttcagcggct ttggtacgga gtggatcact tgaatttcta 17401 tttacgctta gacttcaaaa cagcaattca gctagggcag gatgtgccac cagaattgaa 17461 tctactgtgg tattatcctg acaaaacgat gcacaacagt cctgttcctt tagcagaagt 17521 gccagataca tctccaatga attacctgtt ccaccatcat ttggagatta acttgctgac 17581 gcaatccatt cagtttcggg aagctgggga ggattatcaa tggtatccgc gtgctagtcg 17641 tgctcaagtc gctttcaaca aatgtttgga attggcggtt ccttgggcag atttgcaaat 17701 tccgccagat tatcccttgc gcttgatttt ggtgctttcg gaggaaggac gtttctgcaa 17761 ttatcttcca gaaaatgctt tgattccaat tgaagtacct tgagtcacat cttgcacgat 17821 caaaatgatt gcaattttgt agcctgaact tatgattctg tagggtggga actgcccacc 17881 ttactttttt tgagaaggtg caacaaaggt actgctaaac caaaaaagca attaccaaag 17941 tcagcttgac cattttactt ttctgctttc aggtgttcac cgaattaact ccaccgaaga 18001 gttacccccc aaaccccgtg gaaaatcctc tggttagaaa ggtggaaaaa tatgtcaacg 18061 tagaattact tatatgttat aaaaaaaaag actcctaagg cacgtaaacg aaatctggag 18121 tctgcttaaa tatgtagttt atattacctt tcaatccacc ctttattagc ccagtggcac 18181 tgggtcgttt ttcatgtcca gtttaaacag aagatgtttt agaacgcacc atcaaacaat 18241 catattcagt aacataaccg tcaaaattac gaaaggcagg aaataaatca ataagctcaa 18301 aacctaggga ttcataatac cttaaggcat ccaaatatga aggtatgtta atgtaattag 18361 gtcgcacaga gatttcagat tgcaaacaca gaattttatc tatgcactta cttgcaccct 18421 taacaacttc aatgtcatat ccttgagtat ccatctttaa aaagatacgc ggttcaggca 18481 ctaaagtgag aatatcatcc aaaagagagt ctagaggctt gattttgact tcgcaggaat 18541 taacagtttt tggcatacta gcattaggca ctagaaatga actcagttct gaataggtgt 18601 tcaaattaaa ggtcgccaaa gcatcctcac ttccaagtcc caaatcatat cctctccaaa 18661 gggtatcgtg tttaaagttc ttttgaagag tgggaaatat ttctggatgt ggctcaaaac 18721 taagaatctg tcctttatag cctagtttcc tgatgttttc tgcaaagtaa ccgatattag 18781 ctcccacatc aaggacacag ttaattctta atttcttcaa aagagtcact aagtaagttt 18841 ggtaagaaaa tcgacctatt ttttcttcta cagatttggg tattttatag cctgccgaaa 18901 tcaaagaata agctatgtct ctcaacattg tgatgacttc ctttagcaac ctgggtaaat 18961 gtgtcaacaa tcaacacagg aagaatttta gcagcgattg gtactcctgc caaatctcct 19021 aaacctcgtg gaaaatctcc tgtttggtca ccatgtgaac cacgccaacg tagaattcgc 19081 gcttccagtg gtcaaaaaag gtactacacg atgaaagtcg taaatgtcaa aaacggctta 19141 agcctttaat atatcatctt tgaggttttc agttcgttaa cgagcgcact cctgggttat 19201 tttgttgtta tttagtatca atttcaacaa gtaagcattt ttcagttcgc gaacgcgagt 19261 gcttgcgcca taaggcgctt agctcctcgc tttgggaggc tcgcccgctt agtcttaagt 19321 tgagtaaact ccagttctaa gtaactccac ttaatgaaac gtgaaatctc catggcgttc 19381 aacccaagcc ccttcttaaa agtgcaagct ccctgtagac gcctcccttg cgcgtgtccc 19441 tttgggactg gtgtaatcgc aaagactcta ttttacgttt tttaatgttg acctgacttg 19501 aaaaagtgcg gagtaaaagt caattttcat agctgctgtg gttttgagtt catgatagtt 19561 acttataagc aaaaatctct atgtatatcc gcgtataact gtcttgaaaa acaaaaaact 19621 atgaattttg ttataaattt attttcagta gtactgatac caattggcaa aattatcagg 19681 acaaaaatat tccgattttc ttgacatttt gtataataag agtaacgatc aatcatcatg 19741 atttttttta gggataactt aagcgtaatt tatcacgcag tcgtgcctac caatgatgat 19801 ctgctgcctt aatcctgatt gttcaaatcc cctaaatccc agtggaaaaa agttttgcag 19861 aacgtgcagc actccactga taccactgct acgaaatcgt ttccatatta ttaaacttct 19921 ttctgatgag ggcggatttg gtagaactta tttagcagaa gatgtagata aattaaatga 19981 acgatgtgtt attaagcagt tagctccaag aatccaggga acctgggcac tcaaaaaagc 20041 aatagaattg tttgaaaaag aagctcagcg gctacaagaa cttggaacac atcctcaaat 20101 tccaacgctt ttggcttact ttgaacaaga taaatacctg tatttggtac agcagtttat 20161 tgatggtcaa aatttgttaa ctgaactcca acaaaagaag aagtataact gtagtgaaat 20221 taaaaaaata ttgctagatt tactacctgt tctcaagttt attcatgagc aaggagtgat 20281 tcatcgggat attaagccac aaaatattat ccgccgtcaa acctctctat cctcaaaaac 20341 agcggtttcg gaaataggca gaaatttagt cctgattgat tttggttctg caaagcagtt 20401 aacagcacaa gcacagatga aaatcgggac ttccattggc tcacaaggct actccccgat 20461 tgaacaaatc agggacggtg ctgcttatcc agccagtgat ttgtttgctc ttggggcgac 20521 ctgctttcat ttactaactg gagtttcccc ttttaagtta tggacagaac atggatatag 20581 ctgggtaaaa gattggcaac agtacctgaa aaacccaatc actgaagaat tagcacaaat 20641 tctcgacaag ctattgcaaa aagacataga ggatcgctac caatcagctc atcaagttct 20701 tgctgatttg ctcatcaagc accaaaaaca atcacaatca acagctgtaa ctcaaataaa 20761 gcagatcaca aagttatcaa ttactcagct gcaaactgta tcaaaaccat atcttttgtt 20821 aagaaatttg ctcttagctg gcagtgccat tatactattg ggttctggag aattttggta 20881 tcgacaattt cataatctag aaatgagaat atccgctagt ttgagtcagc caaatagtag 20941 cgcgacaaat cgagaaatga tttttcatca gggtaagcag gctcctttag aaaacttttt 21001 attagcctcg actgtcaaag gatataccaa ctcaattttg tccgtcgcta ttagcccaga 21061 taataaggca attgctagta acagcaatga tactattaaa ctgttaagtt tagtcacggg 21121 acaggaaatc tccactctta gcggtcatac caatacagtt aatttcacaa gcttcagtcc 21181 agacggacaa atcctagtga gtgcgagtga agataaaact ataaaaattt ggaatctggc 21241 aactggacaa caaatgcgca ctttggaggg gcatactcac tcagttaata cccttgcttt 21301 tagtcatgat agtaagatac ttgcggatgg tagtaatgac aacacaatta aagtttggaa 21361 tttggcaaca gcagatgaaa tacgcacact aagagggcat tctagctcag ttcgatctgt 21421 ggcgtttagt cccaacgaca atacccttgc cagtggcagt tttgataaaa ccattaaact 21481 ttggaatttg gcaacaggac aggaaatccg cacacttgaa ggtcattctg gcaaagtgac 21541 ctctgttgcg tttagtcccg atggtaaaat acttgccagt ggcagttttg ataaaaccat 21601 taaactgtgg aacttggcaa caggacagga aatccgcaca cttgaaggtc attctggcaa 21661 agtgacctct attgccttta gtcctgatgg taaaatactt gctagtggta gttttgataa 21721 aagcattaaa ctgtggaact tggttacagg acagcaaatc cgcacactcg aaggtcattc 21781 cgatgggatt caatctgtcg cttttagttc agatggaaaa actcttgtga gtggaggtaa 21841 tgataaaact attaggattt ggcaaacgtc tctttaaaat tttttgattt gaagactttt 21901 ctgttgtcaa cgttacagga aatacgcaag agtctaatca caaatctgga atattatcgc 21961 ttcaatcggt aaataagtat attcctacaa aagaatgatg ctctctattt cacagaggta 22021 gcaacatgag agcagcacaa ctttaaattt tgcaagggtt ctgtgaacat gactgattca 22081 aaagttaaag tgttgacgag tgtggcgatc gcgcttttag gatttgggtg tgtttggtat 22141 ttacaatctt cccctgctgc tactactagt gcatcttctc acgagagtga atctcaaaca 22201 acacatgacg aaaatttttc tgaagccttt acccttgcgg gacactcaga ctcaatttta 22261 gctgttgcta ttagccctga tggacacact cttgtcagtg tcagtggaga caagactatc 22321 aagctgtgga atctcgatac aggcaaggaa atccggactc tagtagggca ttctgactgg 22381 gtgaattcag ttgtatttag tcctgatgga aagactctta tcagtgcaag tgctgataga 22441 actatcaaag tgtggaatct ggatacggga aaagaaattc agactttgac aggacattta 22501 gcttcagttc aagctgttgc tattagccct gatggttcaa cccttgctag tggcagttgg 22561 gaccagacta ttaaattgtg gaaattggct acaggtaaaa tcattcgcac cctcaaaggt 22621 gggtgtgacg tagttaacac cgttgccttt agcccaaatg gaaagactct tgctagtggt 22681 aactattttg acaacagcat taatttatgg gatgtagcca caggaaagga aactcagacc 22741 ctcagagggc actctgaagc tgtttcctcc cttatcttca gccctgatgg gaaaaccctc 22801 atcagtggta gttgggacaa gacgatcaaa ttgtgggaga tcgctacgca aagagaaatc 22861 tacaccctgg caggacatgc taataaagtt ttgtctgtcg ctgtcagtcc agatgggaaa 22921 accattgcca gcagtagttg ggacaaaact atcaaacttt ggaatttggc tatgggaaag 22981 gaaattcgca ctcttaaggg gcataccaaa agagtttggt ctgtggcgtt tagtccagat 23041 ggtaaaaccc ttgttagtgg tagttttgat aagactgtta agatttggca cgttgtttca 23101 agatgatgaa tgataaatcg tgaagttttg tatcatttat cattcaaaat tcataattct 23161 tttgaaactg aagttttctc cttgcgtttg tactaaaaaa gtccataagc tttatttgtc 23221 ttttcgggca acgtgaacag cagaaagaaa cgagagttga atactgtctc caaagttctc 23281 caacttttcc ctcaactttg caaatagtaa ctccttcgtc tgcggctcaa gcttacgaaa 23341 tagtatgtgt catctacagg tagcctagcg cgatcaatca gtgcactgat aatatcatca 23401 agatttggca agtgccccgt gagtgatgtg gtagcatgtg aacgcccaat caagcgactg 23461 gactttaaac tagtattaaa atcaatttgg gatttgttaa tatgacacag ctacttactg 23521 aagaagaaat tcaagaaaag gtaagccatt tgccaaattg gacactgcag gcgtcaacgt 23581 tacaatgtac gcgccaattt aaagacttta tcgaggctat agaatttgta aataagcttg 23641 ttgaacctgc tgagtcagca caacatcatc cagatattga aatttcttac aacaaagtca 23701 agatttcact gacaacgcat gacgcaggtg gattgacaca gaaagacttt gatttggcaa 23761 agctcatttc cgaaattaat taagttagga taaaaaataa ccaaataaga agtcagaagc 23821 cactagccag aatgacgagt gtgtgagtgg tgattcacta ctacacccag gtttgccgac 23881 agaaatatcg agtgtttaaa gcaccgtctc ggtaggagac gctactgaat cccccggatt 23941 tatccgtggg gaactttttg ctgaattcat atcgctgaat tcttttcaaa ttttttgtgc 24001 aagatatttt gagtggggaa aaagttccca gagaggatat ttttgacatc atgttatatc 24061 taaatcgatt tttggagact gaagtcagag gtgtgacagt ttagaaggcg atcgcttcaa 24121 tcttgtcatc catttttcct aatttataat tgcaagtaaa tagaaatacg tttttgcata 24181 agttaattgc attgtttttt tacctaaact taactgtttg ttagcttggc aaaaactaaa 24241 gtagtgtact gggccagcat gcctaatcca caagtatcaa taaaaacttg caactttgaa 24301 aacagttgca gcaaaaaatt tatctataca aaggtgtgag gagaacagga aacatgtcta 24361 atctcttgtg gaaatctcta gtaaccagtc cagcagtttt gggagcaacg ttgttggtgt 24421 ccacaacggc gatcgcagcc ccaaagaaca tcagccatga agttgtgaca acagaagttt 24481 cacaaacgag caataatcaa gaagcttcta gttcagtctc aacaacttca acgacaacaa 24541 ctgcgactgc agtttccccc aagccagaga cgttgattca agcacaacaa acaaaagtca 24601 acgtttcgca gcaagtaagc agctacagca acgaggggaa caataattct cagtcccaag 24661 tgacatcggt ttcccagttc tccgacgtac agccaacaga ttgggctttc caagcgttac 24721 agtccttagt tgagcggtat ggttgtattg ctggttatcc caatgggacg ttccgtggga 24781 accgtgcttt gactcgttat gaatttgctg caggtttaaa tgcatgtcta gaccgcgtta 24841 acgaactcat tgcgacagca acatccgact tagtaagaaa agaagatctg acgactttgc 24901 agcgcttaca agaagaattt tctgcagaat tggcgactct acgcggtcgt gtagatgtat 24961 tggaagcgcg gtcagcagaa ttggaagcaa accaattctc aacgactacc aaactcgttg 25021 gtgaagcaat cttcgccctc agtgatgctt tcggcgatac agtaggcaaa aacaataata 25081 ctgtcttcca gaacagagta cgtttagatt tgcaaaccag cttcacaggt aaggacgttc 25141 tgcatacgcg tttggcaact ggtaacgcca gaagattaaa tacaggtggt aatgtagacg 25201 tcaatagaaa cggagttata gacacctcag agcaaaatgc tgaaggcttt caaacgttta 25261 accttaatgg agatagtagc aacagcaatg acattgtcct tgattggttg gcttactacg 25321 tccccatagg acctgcccaa ctttacgttg cagctactgg tggtattcac agcgattatg 25381 ctgctaccaa taacccctac tttgaagact atgacggcgg taacggcgct ttgactacct 25441 tcggttctga aaaccccatt tatcgtattg gtgggggtgc aggtgcagcg ctcaatattc 25501 cctttggtaa aggtggcggt attctcaaac caagttcgct aacagtgagt tacttgggat 25561 ctgaaccgaa taatccaggt ataggttcag gtatattcaa tggtaactat gctgctttag 25621 gacaattaaa ctttaatctt ggtcagcgga ttgcgttagc agccacctac gttcacggat 25681 atcatggtgc tggtagtgct ttatttgatg ctggtggatt ccaaggagca aatgtccctg 25741 ttgtaggtac ttcacaagct aacgctctga gttcgaccaa tgcatcttcc agcaactcct 25801 atggtttgtc agcagcgttt agaccgagtg acaaactctc tgttagtggc ttcgtttcct 25861 atcacgacgt aacaggtttt ggtagaaatg atgactatga agcttggagc tacggattgg 25921 gagttgcctt acctgacttg ggcaagaaag gtaacgtctt aggtgttttc ggaggtgctc 25981 aaccttatgc ccttggcaga gtagctggtg ctaacgctgt cccatatcaa atcgagggtt 26041 tttacaagta tcgcgtaagt gataatgttt ccattactcc tggtgtgatt tatcagatat 26101 ctcctggtca gaatagcaat aacgataatg cgtttattgg aaccctcaga acaacattca 26161 ccttctagat agttgacaat atttatcaga tattgtaagc ataataattg ctccgcagat 26221 tgcagcgggg ctttttatat ttttagctaa attgggcatt catctgatcg taaccgttca 26281 gggggttagg aatgcgatcg taaacaatag caaccaaatt ctggtaatct cagaggtatg 26341 gaagaaaata gccctgccta cagacagcat atacctcaac aggattggga aagaactcca 26401 cctagcgttc agaaactggt ggaggacatg tggcagtgca tagaaaaatt ggaacaacaa 26461 gtagggatgt tgctagaagt tcagcagcaa ctgttagaaa aaataaattg cacatcgaaa 26521 aactcatcat ctcccccatc aactgacccg cccaatacgc caaaaacaca gagaaagcaa 26581 aaaagtggta gaaagcgggg agggcaaaaa ggtcacgaag gtcatggtcg ttcattgtac 26641 ccagcagaaa gatgtgctcg tatagttgac cacataaata ccctttatcg aagcgaatat 26701 aaaaagtata aaaaattacg tatttagccc gatagccgga taccaaaaat ccccacacct 26761 cacaaggcat agggaaatga ttgggagcta tcgggaaagg tgctcggtaa tagtgcttca 26821 gagagcaaac ctacctcaag cccttaaaca cctttaaatc gacccgtagc agtgggaaag 26881 cgaatcaaca agaaccatta ccttcgattg gtttacggtt agagtcacat ataaaacagt 26941 ttcccgattg ctctatcacc atgtaattgc cccctttaat ttggttctta agaccaaaat 27001 ctagactaca tatttcgcca ttattatcct gtcttctgac agagacattg ttaacattcc 27061 agcttaagct ttcacttctt tgccctggcg caattgatac agatgtacca gaacttccaa 27121 ctacgaccgt agtgcccgta ttattgtaaa ttcgagctgc ataacaaggg gagacggttg 27181 ctagacatat tactagcatg acgatcatca ctttcagaat tcgttttaca ttcattgtca 27241 ttatccttat cagttgaggc tataggacaa aagctcttat agctatactc tataaatagt 27301 tttatgtgaa attaacttat gcactagtgg tctgtcatca cttgagccaa atcacaatag 27361 taatgaggta aattgcgcca agaaagttga atttacactt cttttagaga gtataaatgc 27421 ttattttaag agtaaggtga aatagtaaaa acccccggcg aacgccctgc cacagccccg 27481 acaaacgcca caggacgacc gcccggaagt agttttgttg ctcggatgtt gactgtggtc 27541 actactctca agtctcaaca acgtaatgtc ctggagttta tgacacacgc ggttgttgct 27601 gctcgtgcag gtaaacctgc tccttccttg cttcctcagg tgacctcttc ttctggcgat 27661 cacgacttga ttgcggctta attttgttac ttcatttttt cttcaacgtt aggatattgt 27721 ttcttatttg taccccctga acggttacca tctgatcact gtttcacaaa tgcagctatc 27781 acagagactt tattttatag ccctcctgcc accgcaggaa attcaagatt acgcaaatca 27841 aattaagcag tactttgctg ataaatatgc tagtcgtcac gcccaaaaat ctccaccgca 27901 catcaccctg caaccgccct ttaaatgggc agatgctgac gtaccaagac tagaagaatg 27961 cctgaaatat tttgccagtg atcgagagtc tgtgccgatt acactgagtg ggtttggtgc 28021 ttttgcccct cgtgtgatat acattaatgt tgtcagaagt ttagaactga tgactttgta 28081 tactgatttg ataatgtata tggagagtaa cttgggaatt gttgacaagg ttgggaaaac 28141 tcgtcccttt gctccccata tgacagttgc gtttagagac ttaagcaggc aaaactttca 28201 ggcagcttgg tctgaatttg aaaagcaaca gttacagttt gagtttactg cttctgactt 28261 aaagctgttg ctacacgatg gtaggcggtg gaatggtgaa gtaccccatc ttgccttcgg 28321 ctgaagatgg ggcttccccc aaatccaact // LOCUS NODE_1042_length_27801_cov_5.13461427801 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 27801) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 27801) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..27801 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..499 /locus_tag="DP116_08880" CDS <1..499 /locus_tag="DP116_08880" /inference="COORDINATES: protein motif:HMM:PF00550.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08880" /translation="RRKNTRVVHPCRGFAQPLTVTGEIDREQLRLLSRGRDTQTRIMP RNELERQISLVWQEVLGIGQIDIHSNFFELGGSSIKAIILVNKLEKQLGRNFHFTLMI EAPTIAQFSSYIQNNYPELNSRVQGSNVGTTNSKTTVLSEKTNVTQIAKTLITVATDI EEGEI" gene 496..1530 /locus_tag="DP116_08885" CDS 496..1530 /locus_tag="DP116_08885" /inference="COORDINATES: protein motif:HMM:PF00975.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08885" /translation="MKTVDRLLTDLEKLEIQLWLDEEVRLRYSAPKGALSSQLRDELR ERKAEIIEYLHHKHQIAKFPPIESILVKIQPAGTKPPLFCIHPAGGNVFWYLELSRHL GLDQPLYGLRPPNLYGEEEPLNSIEDMATAYIKAMQSVQPQGPYHLAGSSFGGLVIYE IAQQLQASGQEVSFVGMLDMALLTSTDLKNQIERVGNEYYLVLTSFADSTAGALGEYS QQIIIDELRSFSELDAQLNYILQKLIKLKLLPVEFTFEQFRHLFDVFEGNVIAGMRYT IKTYPGKVVFFRATEDIIDLFHEHQDSTRGWSKFALGGVDVHEIEGNHYSMLRSPVLA EKIKPYLVKS" gene complement(1945..2127) /locus_tag="DP116_08890" /pseudo CDS complement(1945..2127) /locus_tag="DP116_08890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315473.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" gene complement(2229..3314) /gene="recA" /locus_tag="DP116_08895" CDS complement(2229..3314) /gene="recA" /locus_tag="DP116_08895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865541.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase RecA" /protein_id="PRJNA477356:DP116_08895" /translation="MAVNTTDNAGKQKALNMVLNQIERTFGKGTIMRLGDATRMRVET ISSGALTLDLALGGGLPKGRVIEIYGPESSGKTTLALHAVAEVQKNGGIAAYVDAEHA LDPAYAGALGVDTENLFISQPDTGEAALEIVDQLVRSAAVDIVVIDSVAALVPRAEIE GEMGDAHVGLQARLMSQALRKITGNIGKSGCSVIFLNQLRQKIGVSYGNPETTTGGNA LKYYASVRLDIRRIQTLKKGTEEFGNRVKVKVAKNKVAPPFRVAEFDIIFGKGISTLG CLVDLAEETGILIRKGAWYSYNGENISQGRDNAIKYLEEKPEFAEKIKQLVREKLEMG AVVSANSVSKTSEDEEDEEELELVEEE" gene 3614..4876 /gene="xseA" /locus_tag="DP116_08900" CDS 3614..4876 /gene="xseA" /locus_tag="DP116_08900" /EC_number="3.1.11.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317265.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="exodeoxyribonuclease VII large subunit" /protein_id="PRJNA477356:DP116_08900" /translation="MIESDRPWDLIAETALSVASLTDYIRFLIEQDEELQRVWVTGEV SSANHHRSGLFFTLQDPDSSAAIKCVVWNSQVTKLAQIPIRGEQLIILGSIRIYRERG EYQLSVWQALPAGVGLQALRHQELRKRLEAEGLFDSQRKRSLPPHPQTIAVVTSPTAA AWGDIQKTLKQRYPGLHILFSPATVQGEQAPESIVNAIRRVEVDGRAEVLLLARGGGA VEELACFNDERVVRAVACCSIPVITGIGHQRDESLVDLVADAALHTPTAAAERVVPAL AELYNQHQQRVVALQQSVRFSLETAQKQLQEKRNRLRRLRLDRQVQQEIQELAWKRQQ LVRATTTQLQQATQHVEMLRQKLATLDPKAVLQRGYAVVRQQNGAISRRLDATRIARS AAELAVGDELLVQLGQGEVKVRVREVKD" gene 5183..5392 /gene="xseB" /locus_tag="DP116_08905" CDS 5183..5392 /gene="xseB" /locus_tag="DP116_08905" /EC_number="3.1.11.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317266.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="exodeoxyribonuclease VII small subunit" /protein_id="PRJNA477356:DP116_08905" /translation="MNQGWNYEVMVAEIERIIARIEAGDLELEEVFDQFAAAVEYLRQ CESFLQERQQKVDLLIETLVVKSEE" gene complement(5592..6164) /locus_tag="DP116_08910" CDS complement(5592..6164) /locus_tag="DP116_08910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748510.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08910" /translation="MSNDLNSYLPAPTKQRFAATFGVVRPISFWVQLALGAVSSLALL LAIFSRSSTVQTTTNSVMGFGVFLGIIGILVLCFRLYWVSRYKRLDKLLQSPNRELHP KKEEVIQVLQTGLIVSLIGLLLAFLASEVTVIAVLSKSLALPQGVAVYRPENVIRSLD LFVVLTNVNLIGAHFFGSMTSLGLLNWLDQ" gene 6894..7670 /locus_tag="DP116_08915" CDS 6894..7670 /locus_tag="DP116_08915" /EC_number="3.1.3.24" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878264.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sucrose-phosphate phosphatase" /protein_id="PRJNA477356:DP116_08915" /translation="MVKLTREVAVAKFLFVTDLDNTLVGDDKALLELNDRLHATRQEY GTKIVYATGRSPLLYQQIKDEKNLLEPDALILSVGTEIYLDGSHTPDSEWSEKLSSGW NREILVSTTTAFSELVPQPDSEQRPFKMSFFLQEESAAKVLPQLESELQKSGLDVKLI YSSGIDLDIVPSGSDKGQAVQFLRQKWKFVAEVTVVCGDSGNDIALFSVGSERGIIVG NARPELLQWHNEHPANYRYLATNFCAGGILEGLKHFGFLE" gene 7667..8302 /locus_tag="DP116_08920" CDS 7667..8302 /locus_tag="DP116_08920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743230.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Holliday junction branch migration protein RuvA" /protein_id="PRJNA477356:DP116_08920" /translation="MISYLKGIVAAIQNNSGHRYTLTLEVNGIGYDLQIPARLAQQLP NTGGEVQIFTHYQIREEVPLLYGFGSPGERDVFRHLLGVSGVGAAIAIALLDTLELPE LVQTIITGNLQLLIQAPGVGKKTAERICLELKGKLVEWRKTAGFFVATGSPAPGIIEE VQMTLLALGYTANEVSHALHVVSEDIGLPKDAYVEEWIKQAIAHLSSEYSQ" gene complement(8494..9357) /locus_tag="DP116_08925" CDS complement(8494..9357) /locus_tag="DP116_08925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743856.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08925" /translation="MQASDYDKIIHTAFMVSKARQFTDIPYAKELAQLVEAQGLVELS EPQNQDKSALLTARVEARYKAINQVMAQYQIGQVLELASGLLPRGLFMSCHPNITFIE TDLPRMIRCKQQLVEQLVGERPNLHFLSIDATSRPSQFLKSAELLKAGQPIIILCEGL LTHLNMAEKQLVCANVREMLQHYGGVWITPDFIHTASLTQSQEFDASLQKLLQTGTKL TGRSLVDNNFATLEQARQFAYEQDFRVAEYSMLNVMDHLSCLKILGIDTEVVRKMLAL WSIFALTLDVA" gene 9507..10670 /gene="bioF" /locus_tag="DP116_08930" CDS 9507..10670 /gene="bioF" /locus_tag="DP116_08930" /EC_number="2.3.1.47" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317271.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="8-amino-7-oxononanoate synthase" /protein_id="PRJNA477356:DP116_08930" /translation="MPTNPYAWIEESLTTIHRADWYRSVQTIHGLPGATVLLEGREVI NFASNDYLGLAGDQRLIAAATIATQELGTGSTGSRLLSGHREIHRELERAIASLKQTE DALVFSSGYLANLGAITAIVGKRDLILSDQYNHSSLKNGAILSGATIVEYPHCDVEAL RIKLSQQRQNYRRCLIITDSVFSMDGDLCPLPALLEIADEFSCMLLIDEAHATGVLGK TGAGCVEHFGCTGRQLIQIGTLSKALGSLGGYVAGSTTIIDFLRNRAPTWIYTTALSP ADVAAALTAIKIVQQEPQRRVQLWQNVAQLKQVMQQQLPKLKLLPSSSPILCFELSSA ADALKAGQHLKSAGIFAPAIRPPTVPTSRIRICIMATHELTHIEKLVEALSSI" gene 12197..12832 /locus_tag="DP116_08935" CDS 12197..12832 /locus_tag="DP116_08935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317272.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08935" /translation="MNIIARLLSTFAISISAVAVMPYFAIAQETLTAPEQQTTQRVCS SDSVENLLPPPVSQESRSPLSYLGEEGFTQNPDGSWTCYVSDSRKQGRYYTLFKVQQI NGKLVASSFLDGGILVEGQDNRSLDFFMMLIEKHTKANQGNRESIRRYLDAFFSFVKQ GKIQLSNRGYLFDQPSGAVVIYHPVTGGKLKGAAITINISSPENLSSSPVS" gene 12865..14142 /locus_tag="DP116_08940" CDS 12865..14142 /locus_tag="DP116_08940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08940" /translation="MKSKVKTIVVTLSSLATVVNTSQSASAQLKVGNYGIQQGLEHNY LQYQISGRPLNQMRGIPACSVGFGAACNKAGAVFQKLVESNGGPTQEQLLIQAAGGEE NYQNFAKFYGNDPNLTQIPYASFWRNDDPNIVDGYRYLVGQTVNQTPQEGLGQVTNNF YWAPQGSGNSLDARNGLLDLKYSYGRLLLEEVAKIPDAQQQIQSLGLAPELTKFYSEK LSSSMRVLNSGNEESLKEAILKDLSIPYSPDGAEIGRPNLGIPPTNSFTEETLAGDVV PWETAIALDPEGVNVELPPSLAEVALFPQTGESSFPTGWLAGLPVLFLLLLAFGGGGD SSSGRGSSAVASTPPISAPPSGGGSIPPSGGGGGSGGYNQIIEVPSKPPVTTPPGQEV KKVPEPATITPLVLLIIVLYVLNHKQWRIQTRG" gene complement(14283..14768) /locus_tag="DP116_08945" CDS complement(14283..14768) /locus_tag="DP116_08945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_08945" /translation="MNKFIPLIISGVIAIGAVGCESPSKTSADAPSGTNENVKAPTQE TAQKTQEDATTQVRKDQIESDIRAREQRNNVNGGDAKRNDDDLKSEVRGKLEANLPAS QLTVDAKEGSVTVAGTVPTQEQLNKIPTLAQQIKGVKSVKVNAKVAPAQPSSNTNNKP Q" gene complement(15084..15722) /locus_tag="DP116_08950" CDS complement(15084..15722) /locus_tag="DP116_08950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863937.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1269 domain-containing protein" /protein_id="PRJNA477356:DP116_08950" /translation="MALGKHNKRAVGVFPSRREAEYALTELRDAGFPMNKVSIIAKDA DRTGDIAGIETQQRIGNKADEGAAAGAVTGATLGGITGLLVGLGTLAIPGVGPILLAG EIATTLATAAAGAGIGAAAGGLLGGLLGLGIPEERARLYNERVSRGDYLVIVDGTDDE IRRAESVLTNQGIQEFGIYNVPGVVDTDTDYTGGVVNNEPSVTIVDRRDTTV" gene 16056..17531 /locus_tag="DP116_08955" CDS 16056..17531 /locus_tag="DP116_08955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197511.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_08955" /translation="MYKWILPSLSEVLANSQSIVAACSSAASEQQWRVSVAATEQLLL NILTTASPEEATQGLVLAAPAPVFSQPRLAGSLQTVSFTAKPFNPLALMPFQMPPDVG VVDAFASSESVLPVLETDPLAREQFCLVFTKQFRLVLVLAEDINGNKTFSFSFEPEVV ELAWRSLGARVVVSNPDLFADLQELVHKYSPVAPDYRVVMEFTRLLLTQFPEQEENTE IAKSGDVGTRARPECVETVPASPHLSLGASSQGHTRPDVELLQAFAHEVRTPLTTIRT ITRLLLKQRDLPANVIKRLELIDRECTEQIDRMELLFRAAELQTSTTVKSTNTQLTAM SLEQVLQQSIPRWEQAANRRNLTLDVVLPQQLPTVVSNPAMLDRILTGLMENFTRSLP AGSHIQVQVIPAGDQLKLQLLPLTQAGDNGKMSGSPCTPPIRKALGQLLMFQPETGTI SLNLNATKHLFQAIGGKLIVRDRPRHGEVLTIFLPLEVTSQ" gene 17669..19624 /locus_tag="DP116_08960" CDS 17669..19624 /locus_tag="DP116_08960" /inference="COORDINATES: protein motif:HMM:PF05419.10,HMM:PF08357.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08960" /translation="MTEQIQTPPPPKVFISYSWDSDEHKNRVLGLANSLKNYGIDSKI DRYQQSPREGWYRWMMNQIEESDFVLVVCTDKYNLRYQNKEKRGQGRGATWEGGLIIA DLYEAQGMNDKFIPILLSPEYEKDIPSSLKTYTVYRLFDPKDDPKIPGGFQELYRRLM DQPECEEPKLGKPIILPPIQPAISNVGGDGEIIKYEDPDNTQKTEQIQTATKVFISYS WDSDDHKENVLKLANTLRAVWGIEADIDRYVRAEPPYTPVKGWDLWMSERIKWAEFVL IIFTETYQRRFEGNEEPEKGLGVSWEGTIIRNDLYNAQLRDTKFIPVVFSQSDLDYVS SVLNPRDKYILTDDKSFTELCYRLRKQKTIIKPEIKGGPLPLPSDPVLFSPQKPPVKS LEQQQTPEIEYRKKVEEYANGVSLSEPYDDIDEIHKTILETIGLDTDKAKAIEDDILQ SRRQRHEIYKKNLQKYQNLLMKNLEKEGFFSDDILSNLKQLQNNLGLIDKDTEELLLY AQLEYSLKKGSWIEADETTTNLLLKVANVKQNYFNVTDFRKIPCKDIRKIDELWTQYS KGNFGYSAQIDIWQQVNGELLDFFVALDWGYKEDHRFVYTENFKYNIRNHPKGHRPVS VLWEGGTHETRQAYINRIQQCCTGLVP" gene complement(19818..20312) /locus_tag="DP116_08965" CDS complement(19818..20312) /locus_tag="DP116_08965" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08965" /translation="MPSNDLDRVQEDRDFLIKQSEMLQSVIKRMADNSLEAKKFGLTV WAAIIGFGFQNRNPILFILAFVSFTLFGLLDIYYLYMEREFRKNFNRLVRIIGGYASN EDYQWVEQMKIKQRNFLIPDFSPDFFRQISSKDSVLKSWANLPYLITFLITIVLMYVP LPSK" gene complement(20464..21459) /locus_tag="DP116_08970" CDS complement(20464..21459) /locus_tag="DP116_08970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215685.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem reaction center subunit H" /protein_id="PRJNA477356:DP116_08970" /translation="MTSEQSIRRSDILNTQVITRDNGKRLGIVSQVWVDVDQREVVAL GMRDSLISISGVPRYMYLNSINQIGDVILVDNEDVIDDVEVDIYSNLINWEVITETGE VLGKVRGFKFNAETGKLYSIAIASLGLPQIPDQFLSTYEFSVDEIVSTGPNRLIVFEG AEERLNQLTVGFLERLGIGKAPWERDADEEYSSYTPRTVTPDKQLPSGVPLEQPKPKI VRTPEPVAQEWDEDYYEEERQERQVMKARQYEPVQYDEDDEEDNWSEATGKDKYQPQP KYDSGSYNKKPYADDYDDYDEDLARDAWDDEPPKPVNIPKKVKERQPEYEEEGGY" gene complement(21518..25192) /gene="smc" /locus_tag="DP116_08975" CDS complement(21518..25192) /gene="smc" /locus_tag="DP116_08975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139232.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chromosome segregation protein SMC" /protein_id="PRJNA477356:DP116_08975" /translation="MVHIKRVQLTNFKSFGGTTSVPLLPEFTVISGPNGSGKSNILDS LLFCLGLASSKGMRADRLPDLVNTTQTSKSRSAVEASVTVTFDLSDHEPDDELEEISS VVSIDEDEEVEKPEEAQEEETEDERNRKEKIRAKPGSEWSVTRRLRVTPQGTYTSNYY INGVSCTLTELHEELENLRIYPEGYNVVLQGDVTSIISMNPRERREIIDELAGVAAFD RKINQAKETLDQVKEKEDSCRIVETELTGQRDRLSQDRAKAEKYQKLRTEFQDKQQWE AVLSWRSLQAQQEKLATQIQDGDRTSTELTTQLTSLNSEITQKTAELEQLNAHVKALG EEELLAVQSTLATQEAERKQLQRQQKELETAAQETVKRLQQTQEEIQQHHHSLEQVAQ QQVEQTQFITSRRTERDQTRQELENSREAAAQIASASEAWVQQQTALNRQIETLLQTV EPQRTEQAQLGERNNQLQQQIQEQTELVQTLEPQIAERQAECQQVETEFNTSGEPIQN LAETLSATEQELQIQQETQKRLLQEQREKQRQLDKLEAQQAAQQEVQGTQASKIILQS GMPGVCGLVVQLGRVEPRYQLALETSAGARLGHIVVEDDSIAAAGIELLKQKRGGRAT FLPLNKIQVAKFTQDFTLRYVNGFVDYAVNLIECDRRYKDVFNYVFGNTVVFANLADA RQHMKLYRIVTLDGELLETSGAMTGGSTNQRSSLRFGTVEAAESEEVSSLKSRLTDIE RILERCSSAIHSLSSQTKQLTQEVTEARQARREQQLRLEQLQKEVKSLTAQLETTRSQ LSQNTEKLTTVQSRLEILDRELPDQEQQLQQLRHALTELEQSQTPQEWQQIQARIKIQ EQQLQQREGALREAEQKLKDLENQQQRLQEKIVVGEQRIQEYHQEQIKQQNQRTALQS QHFNLSTAIAETRAALSQMEQNLGEEKQKRDATEQELRSHTMRQQQLEWELQKLQETQ QTRREELAALQTQLRTMGSELPLPLPEVPDKVNLEELQKELRSLAKRLQAMEPVNMLA LEQYERTQKRLEELTQKLQTLEGERTELLLRIENFTTLRQRAFKEAFDAVNENFQSIF AVLSEGDGYLQLDDPEDPFNSGLNLVAHPKGKPIQRLASMSGGEKSLTALSFIFALQR YRPSPFYAFDEVDMFLDGANVERLAKMIKQQAQQAQFIVVSLRRPMIESAERTIGVTQ ARGAYTQVLGIKLQSDQTSA" gene complement(25360..25680) /locus_tag="DP116_08980" CDS complement(25360..25680) /locus_tag="DP116_08980" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_08980" /translation="MPIIGLITAEFLSLLGNQVAAVAIPILVLQFTNSPLVTGIASAG NIVAIILATVLGGRAIDRFGAWNISVTADLLSFCSVLALPLAFIYFDQLSPYQFHIWP YCYS" gene complement(25726..26766) /locus_tag="DP116_08985" CDS complement(25726..26766) /locus_tag="DP116_08985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873036.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_08985" /translation="MLVGLSTPLIVGWFFSHHLYGLHLTPLEGRGFNLGATDNELINK CEVVWNQGVWWSLLIGATVAGTLSYLFARRIVQPLIQMEKITQQFAQGNLTARIPKSE IPELNRLAINFNRMASNLEGVEQRRRELIEDLTHELRTPLTILEGSLEGLADSAIEPK TELFERLARETARLGRLVNDTQELSKAEAGYLPIKIQPIDLHPLLLSLMKRFGDQLLE DGPVLRLEYPPNPPLVSADPERVEQILVNLLGNAVRYTVNGSITLRVWSEPPKLWIAV IDTGHGMKAEDLPFVFNRFWRSERSRIRHPGGSGMGLAISQRLVMLQGGEIMVESELN KGSTFRFSLPLA" gene complement(26856..27563) /locus_tag="DP116_08990" CDS complement(26856..27563) /locus_tag="DP116_08990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_08990" /translation="MEILIVEDEVEIAQLIQLYLEKEGFSCRHCSDGLSALQVFQEYS PDVIILDLKIPGLDGLEVCTRIRQQPSFKDPYILMLTAKGEENDRIVGLSTGADDYVV KPFSLKELVARVRALLRRMRRDGKQSQIYRTQHFVINIEQHSASRILSTNQLTRLDLT NLEFELLATFMSDPGRVWNRTQLIEKLWGSDFYGDERVVDTHIARLRKKIEPDPANPT FVKTVIGVGYKFEDQVI" BASE COUNT 7894 a 5886 c 5894 g 8127 t ORIGIN 1 acgccgcaaa aatacaaggg ttgtccatcc ttgtcgaggg tttgcccaac ccttgacagt 61 tacaggggaa atcgaccggg agcagttacg gcttcttagt cgtggtagag acactcaaac 121 acggattatg cctcgcaatg agttggaacg gcaaatatca cttgtttggc aggaagtttt 181 aggaattgga caaattgaca ttcacagcaa cttttttgaa ctgggtggga gttctatcaa 241 ggcaattatt ttggtcaata aattagaaaa acaactagga cgaaactttc atttcacact 301 catgattgaa gctccaacga tcgctcaatt ttcatcgtac attcagaaca attatccaga 361 actcaattct agagtgcaag gttctaatgt gggtacaaca aacagcaaaa caacggtgct 421 atctgaaaaa actaacgtca cgcaaattgc gaaaacacta attactgttg caaccgatat 481 agaggaggga gaaatttgaa aacagttgat cgattgttaa ctgacctaga aaagttggaa 541 attcaacttt ggctagatga agaagttcgt ctgcgctaca gcgcccccaa gggagctttg 601 tcttcacagt tacgcgacga attgcgagaa cgtaaagcag aaattattga atatctgcat 661 cacaaacatc aaatagcaaa gttcccacct attgagtcta tcttggtcaa gatccagcct 721 gctggaacta agccgccttt gttctgcatt cacccagctg gtggtaatgt cttctggtat 781 ttagaattat cccgtcatct gggtttggat cagcctttgt atggcttgcg accacctaat 841 ttgtacgggg aagaggaacc tttaaactct atcgaggata tggctacagc ttatatcaaa 901 gccatgcaat ccgttcagcc ccaagggcct tatcaccttg caggttcttc ttttggtgga 961 cttgtcattt atgaaatagc acagcaattg caagctagtg gtcaagaagt ttcttttgta 1021 ggaatgttgg atatggcttt actcacatct acagatttga aaaaccagat agaaagagtg 1081 ggtaatgaat actatctagt attaacctct tttgctgaca gtacagcagg tgctttaggt 1141 gagtattcgc agcaaataat aatagatgaa ttgcgaagtt tttctgaatt agacgcacaa 1201 ttaaactata ttttgcagaa attaatcaaa cttaaacttt tacctgttga atttacattt 1261 gagcagtttc gccatctttt tgatgtattt gaaggcaatg tcatagctgg aatgcgctac 1321 acaatcaaaa cataccctgg taaagttgtt ttcttccgcg caacagaaga cataatcgac 1381 ttatttcacg aacatcaaga ctcaacaaga ggctggtcaa aattcgcact tggtggggta 1441 gatgttcatg agatagaagg aaaccattac tctatgcttc gtagtccggt tttagccgaa 1501 aaaatcaagc cttatctagt aaaatcttag tttcagcgtc aattaaccgt cttttgagct 1561 tccgctccct attggaaagt cagaagagtc aatgggcgtt aactcttttc tcccctgctc 1621 ctgtgctccc ttgcccccct gcttgccaat acccgcaaat ttcttatccg aaccgtattg 1681 ggacttgata ttctttcagt cctaatacca attcttgagg catttatctg gcaccatctc 1741 aatttgaggc aggatttact tccattgccc acactcaaga agatgttgag cagacgctag 1801 aagtcgcacg ggatgtcatg tcaaacctgt aaattggata ctcagtactg aggtagaaaa 1861 attggggact aggaattttc ctttctagtc cccagccttt aattctcagt ccttttttaa 1921 caaccgcagc cgttatgacc gcagcctttc ataacccgca acttttacat catagatagc 1981 taaaggaaat agagtcggat tcacgtaagc ctcagccaat gatttcacgt tgggtgctat 2041 aaatccctgt tgggattcgt tcacttcaag actgatgcat tcataaaaat tatcaagcgt 2101 gacttcacgt aaatgaacca ttaccaaacc tactcctgta tttggttgtt catgaatctt 2161 gtctggatac agggaagtgg tatgtggagc gatatcgccc cacataccaa actgcaaaag 2221 tactagtgtt actcttcttc gactaactct agttcttctt catcttcttc atcttcactt 2281 gtcttgctca cagagttagc agaaacaacc gctcccatct ctagtttttc acgcaccagc 2341 tgcttaattt tttcagcaaa ttcaggtttt tcttctaggt acttaatggc attgtctcgt 2401 ccttgagaga tgttttcacc gttgtagctg taccaagctc ctttgcggat caagatacca 2461 gtttcttctg ccaaatcgac aagacaaccc aaagtagaaa tacctttacc aaagataatg 2521 tcaaattctg cgactctaaa aggtggtgct actttattct tcgcaacttt gactttgaca 2581 cggttaccaa attcctctgt accctttttc aaggtttgaa tccggcgaat gtctaagcgt 2641 actgaagcat aatacttgag agcgttaccg ccagttgtgg tttctgggtt accgtagctg 2701 acaccaattt tttggcgcag ttggttcaaa aatattactg agcaaccaga tttaccaata 2761 tttccagtaa tttttcgtag ggcttggctc atcaatctcg cttgaagacc aacgtgagca 2821 tctcccatct cgccttcaat ttcagcacgg ggaacaaggg ctgcgactga gtcaatcaca 2881 acaatgtcaa ctgcagcaga acggacaagt tgatcgacaa tttctaaagc agcttcccca 2941 gtgtcaggtt gggaaatgaa gagattttca gtgtcaacac ccaacgctcc agcgtaggcg 3001 gggtcaaggg cgtgttcagc atcaacatag gcagctatac cgccattttt ttgtacttcc 3061 gcaaccgcgt gcagtgctag tgtggtctta ccagaacttt ctggaccata aatctcaatc 3121 acccgtccct tgggtaaacc accacctagt gctagatcca gggtgagtgc tccactagaa 3181 attgtctcta cccgcatccg ggtagcatca cccaagcgca tgattgttcc tttgccaaag 3241 gtacgctcaa tttggttgag taccatgttg agcgcttttt gcttgccagc attatctgtg 3301 gtgttaacag ccattcgatc cctctatgta tctgtatcta tatatttgtg gatttgttta 3361 tggatttagc aagtggggtt tgcaagcttg attagaacta atgtactatt tttattgaaa 3421 cacactttgc gagtttagaa ccggatttaa gaggcaagtg tgacaatatc ctaacgcaag 3481 agctgcctcg tgtgggtaag taaaaataac tatgtagcag atttgataaa gaatattctt 3541 cacataatat atggtaatgg tcagtgacta ctcgcactgc ggaaaatatg cttgactgtg 3601 tatctaaact gaaatgatag aatccgaccg accttgggac ctgattgctg agacagccct 3661 ctcagtagct agtttaactg actacatccg ctttttaata gaacaagatg aggaattgca 3721 acgagtttgg gtgactggag aagtttccag cgctaaccac catcgcagtg gattattttt 3781 caccttacaa gaccctgata gtagtgcagc aattaagtgt gtcgtgtgga atagccaagt 3841 gacaaaacta gcacagatac ccattcgagg tgagcagtta attatattag gaagtataag 3901 aatttatcga gaacgcggag agtatcagct ttccgtttgg caagctttgc cagctggtgt 3961 tggtttgcaa gcgctacgtc accaggaact gcgaaaacgg ttggaggcgg aggggttatt 4021 tgattcgcaa agaaagcgat cgcttcctcc tcacccccaa acaatcgccg ttgtcacctc 4081 acccacagct gcggcttggg gcgatattca aaagactctg aaacaaaggt atccaggttt 4141 acacattctt ttctctcctg ctacagtaca aggtgagcaa gccccggaat ctatagtaaa 4201 tgctattcga cgggtggagg tggatggacg cgccgaggta ctacttttag cacggggtgg 4261 tggtgcagtt gaggaattag cttgctttaa tgatgaacgg gtagtgcgag cagttgcttg 4321 ttgttctatt ccggtgatta ctgggattgg tcatcaaaga gatgagtctt tagtagattt 4381 agtggcagat gctgcgctgc atacacctac agctgctgct gagcgagttg ttccagcact 4441 ggcagaattg tataatcagc atcagcaacg agttgttgct ttacagcaga gcgtacgttt 4501 ttctttggaa actgcacaaa aacaactcca agaaaagcga aaccgcttgc gacgtttacg 4561 gttagatcga caagtgcagc aggagataca agaacttgct tggaagcgtc aacaattggt 4621 gcgtgcaaca acaacgcaat tgcagcaagc aacgcagcat gtagaaatgt tacgccaaaa 4681 gttagcgact cttgacccta aagctgtttt gcagcgtggt tatgcggtgg tgcgtcagca 4741 aaatggtgcg ataagccgga ggcttgacgc tacgcgtatc gctcgttctg ctgctgagtt 4801 ggctgtggga gatgagttgt tggttcagtt ggggcagggt gaggttaaag ttagggttag 4861 ggaagttaaa gattaaaaac caatacagtt cagttagagc caaaaacctt aaactgtgta 4921 ggtagggtcg ccagcgcgaa tgacggtgag actaatgctg cgggagggtt accctccgca 4981 ggcatctggc gttagccgtc aggcgtgccg taggcatacc cgaagggctt tcccgacaga 5041 gggtaagagc gtttgaggaa cgaaacccaa catttatagg agtttgttgg gtttcactac 5101 gttcaaccca acctacagtt atccttaact gaaccgtatt ggattaaaaa ccgccaagac 5161 gccaaggacg ccaagagttt ttatgaatca aggttggaat tatgaggtaa tggttgctga 5221 aatagagaga attattgctc ggattgaggc gggtgatttg gaattagaag aagtgtttga 5281 ccaatttgca gcggctgttg agtatttacg tcaatgtgag agttttttgc aggagcgaca 5341 acaaaaggtg gatttattga ttgaaacttt ggtagtgaag agtgaagagt gaggagtgaa 5401 gagattttct ttttcttttc acttttcact atttcccatt tcttttcact tttcacgctc 5461 ataaatgcaa gaagtctaat gttaattagg tcaattgttg actttcttgt caactatccg 5521 ggaagttgct cgttacctta atttcaaaaa actttgcctc gtaaggggtt tggctcccga 5581 ttctcgtctt actattgatc taaccagtta agcagcccaa gtgaagtcat acttccgaaa 5641 aagtgagcac caatcaggtt gacgtttgtc aaaactacaa aaagatctag ggaacgaatg 5701 acattttcag gtctatagac tgccactcct tgaggcagag ctagcgattt cgatagtaca 5761 gcaataacgg tcacttcaga tgccagaaaa gctaacaata gcccaatcaa actgacaatt 5821 aacccagttt gtaatacttg aattacctcc tctttctttg gatgtaattc acgattaggc 5881 gactgtaaaa gtttgtctaa acgcttatag cgggaaaccc aataaagcct gaaacacagc 5941 actaaaattc caataatacc taaaaagact ccaaacccca tgaccgaatt agtcgttgtt 6001 tggacagtgg aactacggct aaatatagct aacaacaaag ccaagctaga aacagcgcct 6061 agtgctaact gtacccagaa gctaatcggt cttactacgc cgaaggtggc tgcaaatcgt 6121 tgtttagtcg gtgcaggtag gtatgaattt aagtcatttg acatattcgt tctcctgaaa 6181 tttagttggc aaaagtggaa agctagtatt tattcaaaaa atataggtta aaaacttgat 6241 aaatatatgc tttactaaca gccaatgcag cgcatcctgt ctaggtatca aatattaacc 6301 taagttaagt cagtctatag aaggatgtag cgagcagaca tttttttatt gtaggtagta 6361 atcaatttag acagcagaca cgcgatgcca atctgaaggc gctcccggtc accttcgggt 6421 tcatcgacca aacgatcaac aacactgagg gcgcgaaacg gtagcctgac ccaacggttc 6481 gggagccaga ccatgatccg caacacaatt tatccgagat ttcgtccctt cagtagacaa 6541 gagagaagtt ggcaatttca tcttggtctg gctgtcaggg aattttcact tgtttgttag 6601 cttcaatcgc aagtcctaac agggctttga gggtgtggag aggattgagc ttagcagcca 6661 agccatgtag aaacggaaag atggttgtcg ccactctaaa aagctccggt gttgtacgaa 6721 tttatataga acctcaagag agtcgtgctt tttttttatt ctctgactat attaaattca 6781 gcacctgtaa atcgtttcgg tttatccaat ctcggtttga gcacaactgg tcaacttgcc 6841 tattgaaatc taaaaacaga tgagattggt gccaaatcag cataaacttt ttgatggtaa 6901 agctaacaag agaggttgct gtggctaaat tcttattcgt aactgattta gacaataccc 6961 tggtaggtga cgacaaggcg ctgcttgaac tgaacgatcg cctacatgcg acacgccaag 7021 aatacggtac caagattgtt tatgccacag ggcgatcgcc tcttctttac caacagatca 7081 aagatgaaaa aaatctttta gaacccgatg ccctgattct atctgtgggg acagaaattt 7141 atcttgacgg aagtcatact cctgattcag aatggtcaga aaaactttca tctggttgga 7201 accgtgaaat tttagtatca acgacgactg ctttttctga gttagttccg caacctgact 7261 cagaacagcg tcctttcaag atgagttttt tccttcaaga agagtcagca gcaaaggttt 7321 taccccaact ggagtcagag ttgcaaaaat ctggtttaga cgtaaagtta atttatagta 7381 gcggtataga ccttgacatt gtaccttctg gaagcgataa aggacaggca gtacagtttc 7441 tccgccaaaa gtggaagttt gtagcagagg tgacggttgt ttgtggcgat tctggtaatg 7501 atattgcttt attctctgtt ggcagtgaaa gaggaattat tgtcggaaat gctcgtccag 7561 agctacttca atggcataat gaacatcctg ctaactatcg ttacctggca acaaattttt 7621 gtgctggtgg aatactggaa ggtttaaaac actttggttt cttagaatga ttagctatct 7681 caaaggcatt gttgctgcta tccaaaacaa tagtggtcat cgctatactc tgactctgga 7741 agtgaatggt attgggtatg atttgcaaat ccctgcacga ctggcacagc agttgccaaa 7801 tactggaggt gaagtgcaga tttttaccca ttatcaaatt cgagaagagg taccattgct 7861 ttatggcttt ggttcaccag gagaacgaga tgtgtttcgc cacttgttag gtgttagtgg 7921 tgttggtgca gctatagcga tcgccctgtt ggacactttg gaattaccag aactcgtgca 7981 gacgattatc acaggtaatc ttcaattact cattcaagcc cctggtgttg gcaaaaaaac 8041 cgcagaacgt atttgtttgg aactcaaagg caaattagta gagtggcgta agacagcagg 8101 cttcttcgtc gcgacaggca gtccagcacc aggtattatc gaagaagtac aaatgactct 8161 cctagcattg ggttacactg ccaatgaggt gagtcatgcc ttgcatgtcg tcagtgaaga 8221 tattggactt cctaaagacg cctatgtgga agagtggatt aaacaggcga tcgctcatct 8281 cagcagtgaa tatagtcagt agttggatac ctaagctgtg agagtcatct gcctcaggaa 8341 gcgacaaaac cgtgtatgaa tccttgattc acacggcttc tcagtaagtt ggtgcttgtg 8401 acgtacacct ataattccga gtgaaaaaga attaggattg agcaactttg tctgcttcaa 8461 gccatgtcta tgactaagat accccgttta aaatcaagcg acatccagag ttagcgcaaa 8521 gatagaccaa agcgcaagca tttttcttac cacttctgtg tcaatgccca agattttcag 8581 gcagcttaag tgatccatca cgttcaacat actatactct gccacacgga aatcttgctc 8641 atacgcaaat tgccgcgcct gctcaagtgt tgcaaaatta ttatctacta aagacctgcc 8701 tgtaagctta gtacctgttt gtaatagctt ctgtaaagaa gcatcaaatt cttgtgactg 8761 cgttagacta gctgtgtgaa taaagtcagg ggtaatccag acgccaccgt agtgttgaag 8821 catctcgcga acattggcac acacaagttg tttttctgcc atattcagat gtgtcagcaa 8881 cccctcacac aaaattatga ttggctgtcc cgcttttagt agttcagcac tttttaaaaa 8941 ctgacttgga cggctagtag catcaattga aagaaagtgc agattgggac gttccccaac 9001 aagttgttca accagttgct gtttgcaacg aatcattcta ggcaagtcag tttcaataaa 9061 tgtgatgttc ggatgacaag acatgaaaag accacgcggt agcaaaccgg atgctagttc 9121 tagaacttgc cctatttgat attgagccat gacttggttg atagctttat agcgagcttc 9181 cacacgtgca gtaagcaaag cacttttgtc ttgattttgt ggctcagata actctaccaa 9241 cccttgcgct tctaccaatt gcgccaactc tttggcataa ggaatatctg tgaactgtcg 9301 agctttgcta accatgaaag cagtgtggat aattttgtca tagtcactag cttgcatttt 9361 tcgataagtt tttttgaact actatcaagt agtttaaaga taaatttaca caatagtacg 9421 acttttagtt ataaatttgt tttatagtta ataactataa cttaattgta taaagtaaga 9481 aaaataacct cctccacttc tctaccgtgc ccacaaaccc ctacgcttgg atcgaagaat 9541 ccctgaccac aattcaccgt gctgactggt atcgctcagt acaaacaatt cacggtcttc 9601 caggagcaac ggttcttttg gaaggacgag aggttatcaa ttttgccagt aatgattatt 9661 tgggattggc aggggatcag cggctgattg cagcagcaac gatcgctacc caagaattgg 9721 gtacaggtag cactggttcg agattactca gcggacatcg agaaatacat agagaattag 9781 agagagcgat cgcatcactc aaacaaacag aagacgctct cgtttttagt tctggctatc 9841 tggcgaatct aggggcaatc actgctattg taggtaagcg tgatttgatt ttatctgacc 9901 agtacaatca ttctagtctg aaaaatgggg caattctcag tggtgcgacg attgttgaat 9961 atcctcattg cgatgttgaa gcattaagaa tcaaactcag tcaacaaagg caaaattatc 10021 gacgttgtct gatcatcact gatagcgtct tcagcatgga cggtgattta tgtcctttgc 10081 cagcactgtt ggaaatagcg gatgaattta gctgtatgct gctcattgat gaagcgcacg 10141 ccactggcgt actagggaaa actggcgctg gatgcgtaga acatttcgga tgtacaggaa 10201 ggcagttgat tcaaattggg actttgagta aagctttggg tagtctaggc gggtatgtgg 10261 ctggaagcac aacgattata gactttttgc ggaatcgggc accaacatgg atttacacca 10321 ccgcactttc cccggctgat gtagcagcag cactcacagc tataaaaata gtacagcaag 10381 aaccacaacg ccgcgttcaa ttatggcaga atgtcgccca actcaaacag gttatgcaac 10441 agcaactacc caagctgaaa ttgttaccgt cctcctcacc cattctctgt tttgaattat 10501 caagtgcagc agatgcgctt aaagctggac aacacctcaa atcagctggt attttcgctc 10561 ctgctattcg tccccctacc gttcctacga gtcggatacg aatttgtatt atggcaactc 10621 atgaattgac tcatattgaa aaattggtag aagctttgag cagtatctaa actgaagaac 10681 cttaatcttt catggtctaa gttaagaata acaatactac gatgtctccg atagtcagag 10741 attttttcac cactttgtaa cttggtactt attgaacttt ttttctagct agcatgaatt 10801 tcttcaatca gattacgcct ttttaaccaa tcaagttcta aaagagcaaa cccaagagtc 10861 actcgcttct caaacattca aaattcaaaa aaggtcatgg ggatacaggg gaatcagggt 10921 gataggtggt ggaatggtaa aacaagaaaa agcagatagc cagtgaagaa agctcgcaat 10981 tattcgtaag ttaataactt ctgtattcac ccagcagtgc tgagtttaca attgtgctga 11041 agaaatttca atgatagtca tccttttgta agcatgcaaa gatgcacaat cagattgtga 11101 ttagagtcac agcgtgatag ttgaaatact tgatataaat atgttaacta ataaatattg 11161 ctggcaaatt tcaaataagc ataataacct cgaacactaa ttgatacatt tttctgtaca 11221 aattagaaca acaaagttaa taagctgtca cacctacaac ttggacattt attagttcgg 11281 ggagtcaaca cttaaatgca aagacgctcc gcctccgctt tgaagacagc cagtgcccga 11341 aagcgggttt tccggcacaa aaaagccagt gaattgagga gacacgcacc aagtgtgaac 11401 tgatgttaat taaacaaaga ctgcttgttc gtatctcctc cacaagatgc tgctttaaac 11461 gccgttcata tcaaccgttg cgtaagccat aggcatacct gtaacggtta aaaagaactc 11521 aagactgttg acttgaagca aaaaagtgca agatgtacgc gcatctgctg agtatacttt 11581 tcaaaaagtt tatcaaagca aatcaaagca aacaatagta actgtaccta aaactgtcat 11641 gaatcaacta taaactcttt agactacata actaatctgt gttttagaat gtcaatattg 11701 aagccgtaaa gtttttagac ttagacttac atacggattt cctgtgggcg tttctaacaa 11761 agaataagtt gtgtagttag tgtaaagcct gtaaactcaa gtattcttgg actttatata 11821 aacttaaaat ttgcttcatc aaaaatattt gagttaattg attgttaact taagtaccaa 11881 cagtaaaaac acttaatatt tattagggac ttaaacaatt ttctccaaga aaataatacc 11941 gaaaagcctt gtttttactg gttaaaaggt aagttagggg aagtctagct cttgttcaat 12001 atttctactt acaaaaaata ttggtaagtt caattgttct taacaacaga gttttttcat 12061 agattctttt taaagattta tagggacttt atctaaatct ggaaacaatg tctgaataat 12121 tgtatatggg tgtaaagcaa aaaataaaga tttcctctga caaaaggtat tctcaaataa 12181 cttatacgta aagaaaatga atataattgc tcgtttactt agtacatttg ctatttctat 12241 atctgcggtt gctgttatgc cctacttcgc catagcgcaa gaaacgctaa ctgctcccga 12301 acaacaaacg acgcagagag tttgctcttc cgattcagtt gaaaatttgt tgcctccgcc 12361 agttagtcag gaaagtcgtt ctccactttc ttacttgggg gaggaaggtt ttacacagaa 12421 cccagatggt tcttggacct gttatgtgag tgattctagg aagcaaggac gttactatac 12481 cctgtttaaa gtccaacaaa taaatggaaa gcttgtagcc agttcttttc tagacggcgg 12541 tattctcgtg gaaggtcaag acaaccgcag cttagacttc ttcatgatgc tcatagagaa 12601 gcatacaaag gcaaatcaag gaaatcgtga aagtatacgc agatatctgg atgcgttctt 12661 ttccttcgtc aagcaaggta agatacaact ttctaatcgt ggttacctct ttgatcaacc 12721 aagcggtgca gttgttatat accaccctgt cacaggcgga aaacttaaag gagcggcaat 12781 taccatcaat atcagctcgc ctgagaattt atcttcctct cctgtctcat aggctttcaa 12841 aattagttgt tggaggtaaa tcccttgaag tcaaaagtta aaacaatcgt tgtcacatta 12901 tcatccttag caacagtagt taatacgtct caatctgctt ctgctcagtt gaaagttgga 12961 aactatggta ttcaacaagg acttgagcat aactatctcc agtaccagat atcggggcgt 13021 cctttaaatc aaatgcgagg tattcctgca tgtagtgtgg gatttggtgc cgcttgtaac 13081 aaagcaggag cagtgtttca gaaattagtt gagtcaaatg gtggtccaac ccaggaacag 13141 ttgctgatac aagctgctgg aggggaagaa aactaccaga attttgctaa attttacgga 13201 aatgacccca atctgactca gataccgtac gcctcattct ggcggaacga cgaccccaat 13261 atcgtggacg gatatcgcta tcttgttggg caaacagtca atcaaactcc tcaagaaggt 13321 ctgggacaag ttaccaacaa cttttactgg gcacctcaag gaagcgggaa ctcacttgac 13381 gcccgtaatg gcttactaga tttgaaatac tcttatggac gtctgttgct tgaagaggtg 13441 gcaaaaattc ccgatgcaca gcagcagatt caatctttgg gtttagcgcc agaactgacg 13501 aagttttact cggaaaaact ttccagttcg atgcgtgtat tgaactctgg gaatgaagag 13561 tctcttaaag aggcgattct caaagacctc tcgataccct attcaccaga tggggcagag 13621 attggacgtc caaatcttgg cattcctcca actaattcgt tcactgagga aactctcgct 13681 ggagatgttg ttccctggga aactgcaata gcgctagatc cagaaggggt aaacgttgag 13741 cttcctccca gccttgcaga agttgcttta ttcccacaaa caggagaaag ttcttttcca 13801 acgggctggc ttgctggcct gcctgtgtta ttcttacttc ttcttgcttt tggtggtggt 13861 ggagatagtt cttccggtcg aggatcaagt gcagtagcga gtacacctcc tattagtgct 13921 cctccttcag gaggtgggtc tattcctcct tcgggcggtg gtggtggttc tggtggttac 13981 aaccaaatca ttgaagttcc gtccaagcct cctgtaacta ctccgccggg acaggaagtg 14041 aaaaaggtac cagaaccagc aacaataacg ccgcttgtgt tgttgatcat tgtcctttac 14101 gtactaaatc ataagcagtg gcgtatacaa actagaggtt aaattctttt gttgtgtatt 14161 cagatcatca gttggggtgt agaaaacgct cgcattcttc tgtttcctta aagaatgcga 14221 gcgttagttt ttgagtctgt gccaggtaga tgcttaagta accttgtcag ctacaaaact 14281 tcttattgag gtttgttgtt tgtatttgat gaaggttgtg ctggagcaac ttttgcattg 14341 accttcacac tttttacacc tttaatctgt tgtgccaaag tgggaatttt attgagttgt 14401 tcttgtgtag gaactgttcc agctactgtt acagaaccct ctttagcgtc aactgttaac 14461 tgactggctg gtaaattagc ctctaactta ccgcgaacct cactttttaa atcatcatca 14521 tttcttttgg cgtcaccacc attcacattg ttacgttgtt cgcgtgctct aatatcagac 14581 tctatttggt ctttacgaac ttgggtagtg gcgtcttctt gagttttctg agccgtttct 14641 tgcgttgggg ccttaacatt ttcatttgtc cctgagggag catctgcact tgtttttgaa 14701 ggggattcgc aacccaccgc accaatagca ataacaccgc taatgatcaa tggaataaac 14761 ttgttcatct tctgtctcct actgaaatat ttcaaaaaat tccttgacaa aattccttga 14821 cataggtagc tattaacagc aaacagttac ccagctatcc aaaacacgga gacgagaaca 14881 catagcgtga tagatgcgga gaacacttgt attggcttaa tgcctcctga atggataaat 14941 ttaaacaaat ccttgttgaa atctcctgac tttagttacg aaaattcacg ctattactga 15001 aatactgtta atggtgtgtg aggttcagga gcgaatcttt gcaagtgcac agaaagcagt 15061 tttttgatga tacttatgac ttactaaaca gttgtatcgc gacgatcaac aattgttacc 15121 gaaggctcat tattcacaac gccgccagtg taatctgtgt cagtgtctac aacaccaggt 15181 acattataga ttccaaactc ctgaatacct tggtttgtca aaacactttc ggcacgacga 15241 atttcgtcat ctgtcccgtc tactatcacc agataatcac cacgagatac tcgttcatta 15301 tataatctgg ctcgttcttc aggaattccc aaaccaagga gtccacctaa taatccacca 15361 gccgctgcac caattccagc accagctgca gctgtcgcca gagttgtcgc gatttcacct 15421 gcaagtaata taggaccaac gccaggaatt gccaaggttc ctaagcccac gagtaaaccg 15481 gtaatgcctc cgagtgttgc acctgtcact gcgccagcag cagctccttc atcagctttg 15541 ttaccaatac gttgttgcgt ttcgatacca gcaatatcac cagtgcgatc agcgtcttta 15601 gcaatgatag aaactttatt cattggaaaa ccggcgtctc taagctcggt gagtgcatac 15661 tcagcttcac gacgactagg aaatacgccg actgcacgct tgttgtgttt acccaaagcc 15721 atttcactcc tccattcaat agctaaaaga agcacaaaaa cgtataactt actttgcttc 15781 gtgcttatat tacagttttg ccttgctttg ctctcacttt catcgtccgc aagttaagct 15841 aaatatctat cgatagattg gttctttgcc caaatgggaa tttgaaaaaa tcataacctt 15901 atttcacgtc tttgctgttt acaaggggaa attgcgcttc tattgttcgg ttatactact 15961 ttcactaaca aacaaatact tataacctca taaatagtag tggatgccgc ttttcgacac 16021 tgtcagtcgc tctttattca gggggtgaaa tacttgtgta caaatggatc ttgccaagtc 16081 taagcgaagt tttagcaaat agtcaatcaa tagttgctgc atgttcatct gctgcatcag 16141 aacagcaatg gcgcgtcagc gtagcagcga cagaacaact gttattaaat attttaacaa 16201 ctgcttcacc tgaagaagcc acacaaggat tagttttagc tgcgccagcg cctgttttta 16261 gtcagccaag actggctggg agtttgcaga ctgttagttt tacggcaaag ccatttaacc 16321 ccttggcact tatgccattt caaatgcctc ctgatgtggg tgtggtagat gcatttgctt 16381 ctagtgagtc ggttttacct gtattagaaa cagatccgtt agcaagggag cagttttgtt 16441 tggttttcac caagcaattt agattagtgc tggttttagc agaagatatc aatggcaata 16501 agactttttc attttctttt gagccagagg ttgttgagtt agcatggcga tcgctaggag 16561 caagagttgt agtcagcaat ccggatctgt ttgcggattt acaagagtta gtacataaat 16621 attctccagt agctccagat taccgcgtag tgatggagtt cactcgttta ttactgacac 16681 aatttccaga acaagaagaa aacacggaaa tcgcaaaaag cggggatgta gggacacgag 16741 cacgtccaga atgtgtagaa accgtcccag cttcgccaca tctgtctttg ggtgcttcat 16801 cacaagggca tactcgccct gatgtagaac tactgcaagc ttttgctcac gaagttcgga 16861 ctcctttaac aacgattcgc accatcactc gtttactgct gaaacagcgt gatttacctg 16921 caaatgtgat caagcgtttg gaacttattg atcgcgaatg caccgagcaa attgatcgta 16981 tggagttact ctttagagca gcagaactac aaacatctac tactgtaaaa tctacaaata 17041 ctcaactcac agcaatgtct ctggagcaag ttttgcagca gagtattccg cgttgggaac 17101 aagcagccaa tcgacgcaac ttgactttgg atgtagtttt acctcaacag ttaccaactg 17161 tggttagtaa tcctgctatg ttggatcgga ttctcacggg tttgatggag aattttaccc 17221 gtagtttacc ggctggtagt catattcaag tacaggtcat cccggctgga gatcaactga 17281 aattacaatt attaccttta actcaggcgg gagataatgg aaaaatgtca ggctctccat 17341 gcacaccgcc aattcgtaaa gctttgggtc aacttctcat gttccagccg gaaacaggta 17401 cgataagttt gaatcttaac gcaaccaagc acttatttca agcgattggg ggtaaactta 17461 ttgtgcgcga tcgcccacgc cacggagaag tgctaacgat ttttcttcct ttggaagtca 17521 ccagtcagtg aagagtcgca taaatattag tctaaaatcc agttttttgc gtggtcagag 17581 ttttgttttg atcaggaatt aaggataaaa tccttgtagt ctcaacaaga cttgaatcgc 17641 aaccagtatc tacgattaag acagaaagat gacagaacaa atacaaacac ctccaccacc 17701 aaaagttttc attagctata gctgggattc tgatgagcat aagaacagag tcctcggtct 17761 tgcaaatagc ctcaaaaatt acggaattga tagcaaaata gaccggtatc aacaatcacc 17821 tcgtgaagga tggtatcggt ggatgatgaa tcaaattgag gaatctgatt ttgtgctcgt 17881 agtttgtact gataaataca accttcgata ccagaataag gaaaaacggg gacagggacg 17941 aggggctaca tgggaaggtg gattgattat tgcagacctt tatgaagcac aaggaatgaa 18001 tgataaattc attcctattt tattgtctcc tgagtatgaa aaagatattc cttcaagcct 18061 caaaacatac actgtctatc gattgtttga tccgaaagac gacccaaaaa ttccaggagg 18121 atttcaggag ctatatcggc gtcttatgga tcagccagag tgcgaagaac cgaagttagg 18181 taaacccata atattacctc cgatccaacc cgcgatatca aacgtgggtg gtgatggaga 18241 aattataaaa tatgaagatc ctgataacac tcaaaaaaca gaacaaatac aaacagcaac 18301 aaaagttttc attagctata gctgggactc agacgaccac aaagaaaatg tgttgaaact 18361 agcaaatacc ttacgtgcag tatgggggat tgaggcagat atagatcgtt atgttcgagc 18421 agagcctcct tatactcctg ttaaaggatg ggatctttgg atgtcagagc gaattaagtg 18481 ggctgagttt gtgcttatca tctttactga gacataccag cgacgatttg agggtaacga 18541 ggaacctgaa aaaggtttag gagtttcgtg ggaaggaaca attatcagaa acgaccttta 18601 caacgctcag ttaagagaca ctaaatttat cccagttgta ttttctcaat ctgacttgga 18661 ttatgtttct tctgtactaa atcccagaga taaatatatc cttactgatg ataaaagctt 18721 tacagaactt tgctatcgtc tgagaaaaca aaagaccatt atcaaaccag aaattaaagg 18781 gggacctcta ccactccctt ctgatccagt gttattttct cctcagaaac cgccagtaaa 18841 atctttagag caacaacaga ctcctgagat tgaataccgt aaaaaagttg aagagtatgc 18901 taacggggtt tcactctctg aaccctatga tgacattgat gaaattcata agactatctt 18961 agaaacaata gggctagata ctgataaggc taaagctatt gaagatgata ttcttcaatc 19021 acgtcgccaa cggcacgaaa tatacaaaaa aaaccttcaa aaatatcaga accttctgat 19081 gaaaaactta gaaaaagaag ggttttttag tgatgatatc ctttctaact tgaaacaact 19141 tcagaataat ttgggactaa ttgataagga cactgaagaa ctattgctat atgctcaatt 19201 ggagtattcg cttaaaaaag gaagctggat agaagcagat gaaactacta caaatctctt 19261 gttgaaagta gcaaatgtta aacaaaatta ttttaatgta actgattttc ggaaaattcc 19321 atgtaaagat atacgtaaaa ttgacgaact ctggacacaa tacagtaaag gtaactttgg 19381 ttacagcgcg caaatagata tctggcaaca agttaatggg gaacttcttg atttctttgt 19441 tgcattggat tggggatata aagaagatca tcgattcgtg tatacagaga attttaaata 19501 taatataagg aatcacccga aaggacaccg cccagtatca gttctatggg agggaggaac 19561 tcacgaaacc cgacaagcat acataaacag aattcagcag tgctgcacag gtcttgttcc 19621 ttaattggag ataatttcta tatctgtttc tccaagataa gaacaacaca tctagaacac 19681 cgattcagta ttcaccaatt gtgagaaaaa acccagtttc ttctcattga gataacgaaa 19741 tatcagtttt tccagcaaca gaaactgggt ttttggcatt aatgaatcgg tgttctagga 19801 tgctacattt cgcatttcta tttcgatggc aaaggtacat acataagcac aatcgtaatc 19861 agaaaagtaa tcaaataagg aaggtttgcc caggatttta ggacactatc cttgctggaa 19921 atttgcctga aaaaatcagg tgaaaaatca ggtatcaaaa aatttctttg tttaatcttc 19981 atttgctcta cccattgata atcttcgtta gatgcataac ctccaatgat ccgaactaac 20041 cgattgaagt tcttccgaaa ttctctctcc atgtacaaat aatagatatc cagtaaacca 20101 aataaagtaa aagagacgaa tgccaggata aataaaatag ggtttctatt ttgaaaacca 20161 aatccaatga tcgcagccca aaccgttaat ccaaactttt tagcttcaag cgaattgtct 20221 gccatgcgtt taattacaga ttgcagcatt tcactttgtt taatcaaaaa gtctcgatct 20281 tcttgaactc tgtccagatc gttgcttggc atagtactgc actataacga gagccactcg 20341 attgtagaat aaaactagct ctttattagt ccttcggaac ctcaccctgc cctatcgggc 20401 atccctctcc ttataaagga gagggaaagt tttagggtat agcgtagagg aagttcagtg 20461 gaattaatat ccgccctctt cttcgtactc tggctgtctc tctttgactt ttttaggaat 20521 attcactggc tttggtggct catcatccca agcatcgcga gctaaatctt cgtcataatc 20581 gtcataatcg tcagcgtacg gctttttgtt gtaggaacca gagtcatact ttggctgtgg 20641 ttgatactta tctttacctg ttgcttcact ccagttgtct tcttcatcgt cttcatcata 20701 ctgcacaggc tcatactgcc gtgctttcat aacctgacgc tcttgacgtt cttcttcgta 20761 gtagtcttcg tcccactctt gcgctacagg ttcgggtgta cgaacgattt ttggcttggg 20821 ttgttctaaa ggaactccgc ttggtagttg cttgtctggt gtgactgtac gaggagtgta 20881 gctggagtat tcttcgtcag catctcgttc ccacggcgct ttaccaatac ccaaacgctc 20941 taagaaacca acagttaatt gattaagccg ctcttcagcg ccttcaaaaa caatcaatct 21001 attggggcct gtgctgacga tttcgtctac tgagaactcg taggtactca agaattggtc 21061 gggaatttgt ggtaatccaa gagaggcgat cgcaattgag taaagctttc ctgtttcggc 21121 attaaactta aagccccgca ctttgcctaa aacttcacca gtctctgtaa tgacttccca 21181 gtttatcagg ttgctgtaga tatcaacctc aacatcatca ataacatcct cgttatccac 21241 gaggataaca tcaccgattt ggttaatgct gttgagatac atatagcgtg gcacgccaga 21301 aatagagatc aggctgtctc gcatgccaag cgccacaacc tctcgctgat ctacatcaac 21361 ccaaacttga ctcacgattc ctaatcgctt gccgttgtca cgagtgatta cctgggtgtt 21421 taaaatatcg gaacgtctaa tactttgttc agaggtcatc ctgtaccgag tcctgatctc 21481 gaatccggtt tatctataca ctattattaa caaaaactca agcacttgtt tggtcagatt 21541 gtagcttaat tcccaaaact tgggtataag ctcctcttgc ttgagtcacg ccaattgtac 21601 gttcggctga ttctatcatc gggcgacgca aactgacaac gataaattgt gcttgttgcg 21661 cctgttgctt aatcatttta gctaatcgtt ctacgtttgc tccatctaaa aacatatcta 21721 cttcgtcaaa tgcataaaat ggtgatggac ggtagcgttg caaagcaaaa ataaagctca 21781 atgcagtcag agatttttct cctcctgaca tagaagcaag tcgctgtata ggtttacctt 21841 tcgggtgtgc aaccaaattt aatccgctgt tgaatggatc ttctggatcg tcgagttgta 21901 agtatccgtc accctcagaa aggacagcaa aaattgattg aaagttttca ttgacagcgt 21961 caaaggcttc tttaaatgca cgttgacgca atgtggtaaa attctcaatt cttaagagta 22021 attcggtgcg ctctccttct aaggtttgca atttttgagt gagttcttca aggcgttttt 22081 gcgtacgctc atattgttct aacgcgagca tattcacagg ttccattgct tgcaggcgtt 22141 tggcaagaga acgcaattct ttttgcaatt cttctaaatt gactttgtct ggtacttccg 22201 gcaaaggtag tggtaattcc gatcccatag ttcgcaattg ggtttgcagt gctgcgagtt 22261 cttctcgccg cgtttgctgg gtttcttgca atttttgcag ttcccattcc aattgttgtt 22321 ggcgcattgt gtgcgatcgc aattcttgtt ctgttgcgtc ccgtttttgc ttctcctctc 22381 ccaaattttg ttccatttga ctcaaagccg cgcgagtttc agcaattgca gtgctgagat 22441 tgaagtgctg agattgaagt gctgtacgtt gattttgctg ctttatctgc tcttgatggt 22501 actcttgaat acgctgctct ccaacaacaa ttttttcttg caatcgctgc tgctgatttt 22561 ctaaatcttt tagtttttgc tctgcttccc ttaaggctcc ttctcgctgt tgcaattgtt 22621 gctcttgaat tttgatcctt gcttggattt gttgccattc ttggggagtt tgggactgct 22681 ccaactcagt caaagcgtgt cgtagttgtt gtaattgctg ctcttggtca ggtaattccc 22741 gatccaaaat ttctaaacgg gattggacgg tagttaattt ttctgtgttt tgcgaaagct 22801 gcgatcgcgt tgtttctaat tgcgctgtca aactttttac ctctttttgc aactgttcca 22861 agcgtagttg ttgttctcgc cgcgcctgtc gtgcttccgt cacctcttgt gtcagttgtt 22921 ttgtttgaga agaaagagaa tgaattgctg aactacaacg ttctaaaatg cgctcaatat 22981 cagtcaaccg actttttaaa gaagagactt cctcagattc cgcagcttcc accgtaccaa 23041 accgcaacga tgaacgctga ttcgtgctac caccagtcat tgcaccactg gtttctaata 23101 attctccgtc taaggtgaca atacgataaa gtttcatatg ctgacgcgcg tcagcgagat 23161 ttgcaaaaac gactgtgtta ccaaaaacgt agttgaacac atccttatag cggcgatcgc 23221 actcaatcaa attcacagca taatccacaa acccattcac atatcgcagt gtaaaatctt 23281 gggtaaattt ggcaacctga attttattca acggtaaaaa agtcgctcgc ccaccacgtt 23341 tttgtttcag cagttcaatt cctgcggctg ctatgctgtc atcttcgacg acaatatgtc 23401 ccaaacgcgc cccagcagaa gtttccaaag ccaattgata acgaggttcc acccgtccta 23461 gctgtacaac taatccacaa actccaggca tacccgattg taggataatt ttgctcgctt 23521 gagttccttg gacttcctgc tgtgctgctt gttgcgcctc tagtttatcc aattggcgtt 23581 gtttctctcg ttgttcttgc aagaggcgct tttgggtttc ctgctggatt tgcagttctt 23641 gttctgtggc ggatagtgtt tcggctaagt tttgaatcgg ttcgccagaa gtgttaaatt 23701 ctgtttcaac ttgttgacat tcagcttgtc tttctgctat ttgaggttct aaagtttgaa 23761 cgagttcggt ttgttcttga atttgctgct ggagttggtt attacgttct cccagttgtg 23821 cttgttctgt acgttgaggt tcaacagttt gcagcaaagt ttcaatttga cgattcaggg 23881 ctgtttgttg ttgtacccaa gcttctgagg cggaagcaat ttgggctgct gcttcacgag 23941 agttttctaa ttcttgtcgc gtctggtctc gttcggttcg tcgtgaggtg ataaattgcg 24001 tttgttcaac ttgttgttgt gcgacttgtt ctaaagaatg gtgatgctgt tgaatttcct 24061 cttgagtttg ttgtagacgc ttaacggttt cttgtgcggc tgtttctaat tctttttgct 24121 gacgctgaag ctgtttacgt tctgcttctt gggtagcaag ggtagattgt accgctaaaa 24181 gttcttcttc tcccaaggct ttgacatggg cgttgagttg ttcaagttcg gcggtttttt 24241 gggtgatttc tgaatttagg ctggtgagtt gagttgtgag ttcagtcgag gtgcgatcgc 24301 cgtcttgaat ttgcgtcgct aacttttctt gctgtgcttg tagggaacgc catgataaaa 24361 ctgcttccca ttgttgtttg tcttgaaatt cagtgcggag tttttggtat ttttcagctt 24421 tggcgcgatc ttgggagagg cgatcgcgtt gtccagttaa ctctgtctca acaatacgac 24481 aactgtcttc cttctcctta acttgatcca aagtttcttt tgcctgattg attttgcgat 24541 caaacgccgc cacccctgcc aattcatcaa taatttccct tctttcgcgg gggttcatcg 24601 agataatact ggtgacatcc ccttgcagga caacgttgta accttccgga taaatacgta 24661 aattttctag ttcttcgtgt aactctgtga gagtgcaaga aacaccgttg atgtagtaat 24721 ttgaagtata agttccttgc ggagtgactc gcagccttct agtgacactc cactcgctac 24781 ctggtttcgc acgtattttt tctttgcgat ttctttcatc ctcggtttct tcttcttgcg 24841 cttcctctgg tttctcaact tcctcatctt catcgatgga gacgacagag gagatttcct 24901 caagttcatc atcaggttca tggtctgaca aatcaaacgt taccgtaacg gaagcttcca 24961 cagcggaacg agatttgcta gtttgagtcg tgttgactaa atcaggaagg cgatcagcac 25021 gcattccttt ggaactggcg agtcccaggc aaaacagcag cgagtctaga atattggatt 25081 tcccagaacc attcggtcca gaaatgactg taaactccgg cagtaagggg acagaggttg 25141 taccgccgaa ggatttgaag ttcgtgagtt gtacgcgctt tatatgaacc atcgcgctgt 25201 tttacggggc ctgtgacgtt caagtgtatc aacactctca aaagattagc gtgaaagtga 25261 ttgaaccgcc aagaggtaaa gaacgttaaa gaggagggaa ggagagttta aaaccagttt 25321 atcaatagaa tttcgtattt cataagcttg ccttatgctt taggaatagc aatatggcca 25381 aatatgaaat tggtatggag acaattgatc aaagtagata aatgccaaag gtaaggcaag 25441 aacagaacaa aaactcagta gatctgcggt tacgcttata ttccacgcac cgaatctatc 25501 aatggcacgt ccgccgagaa ctgtagccaa tatgatcgca actatattac ccgcgctggc 25561 aatgccggta actaacgggg aattagtaaa ctgtaagacc aatataggta ttgcaacagc 25621 tgcaacctga ttccccaaga gggacaagaa ttctgcagtg attaagccaa ttatcggcat 25681 tgatcgaatg tcggatttca taagttggct gcaatcatca gattcttatg ccaaaggtag 25741 ggaaaatcga aacgtacttc ccttatttag ctcactttca accataattt cgcctccctg 25801 caacatcaca agacgctgag agatagctaa acccattcca gaaccaccag ggtggcggat 25861 acgagagcgc tctgatcgcc aaaatcgatt aaaaacaaag ggtaaatcct ctgctttcat 25921 accatgacct gtgtcaatga ccgcaatcca taatttcgga ggttcgctcc aaactcgcag 25981 tgtgatagaa ccattcacag tgtagcgtac tgcattgcca agtagattga ccagaatctg 26041 ctcaactcgc tcaggatcgg cagataccaa aggaggattg ggtgggtact ccaagcgcaa 26101 caccggaccg tcttctagta gttgatcacc aaaccgtttc attagggaaa gcaggagtgg 26161 atgtagatca atgggttgaa tcttgatggg taaatatccc gcctccgcct tagaaagttc 26221 ttgtgtatca ttcaccaacc gacctagtcg ggcggtctct ctggctaatc gctcaaacag 26281 ttctgttttt ggttcaatcg cgctatctgc taaaccctcc agagaaccct ccaagattgt 26341 caacggcgta cgtagctcat gggttaaatc ttcgatgagt tctcgacgac gctgttctac 26401 cccttcaagg tttgatgcca ttcggttaaa atttatggca aggcggttca actctggaat 26461 ttcactcttg ggtatacgcg ctgtcaagtt cccctgagca aactgctgag taattttctc 26521 catctgaatc agaggttgta cgatgcgcct agcaaacaaa taactcaaag ttcctgctac 26581 ggttgcacct attaatagtg accaccaaac tccttggttc cagactacct cacacttgtt 26641 aatcaactca ttgtcagttg cacccagatt aaatcctcgt ccttccaaag gcgttaagtg 26701 caacccatac aagtggtgcg agaagaacca gccgactatg agtggggtac tcaatccaac 26761 tagcatcaca ataatgttgc tcaacagtaa tcgcgatgct ccagcaggga gccgcgcaag 26821 gcgcgaacgc aggctaagct tactcacaat tggtcctaga taacttgatc ttcaaactta 26881 taacctactc caatcacagt tttgacaaaa gttggattgg ctggatcggg ttcaattttt 26941 tttcgtagac gtgcgatgtg ggtatcaaca acccgctcat cgccataaaa gtcactaccc 27001 caaagtttct caatgagttg agttcgattc caaacccgac cagggtcact cataaatgtt 27061 gctagtagtt caaactccaa attcgttagg tctaaacgag tcaattgatt tgtactcaga 27121 atccggctgg ctgagtgttg ctcaatatta atgacaaagt gttgggtacg ataaatctga 27181 ctttgcttac catcacgtcg catccgtcgt aggagtgctc gcacacgggc taccagttcc 27241 tttaggctga atggcttgac gacgtagtca tctgctccag ttgataaacc gacgatccga 27301 tcattctcct cacccttcgc cgtcaacatc aagatgtaag ggtctttaaa gcttggttgc 27361 tgtcgaatgc gggtacaaac ttccagtccg tccaaaccgg gaatctttag atctaggatt 27421 attacgtctg gcgaatactc ttgaaacact tgaagcgcac ttaaaccatc agagcagtgg 27481 cgacaagaaa atccttcttt ttccaaatac agttgaatta actgagcaat ttcaacttca 27541 tcttcaacaa tcaaaatctc cattagcctg ataaagatct tttgtgccaa ggattaatag 27601 taaaccactt tatcagacca tatcaactaa ggacgaagtt tgagcaatac gcgaactggc 27661 gggagcacga atattccaca accccgtgac agactttaca gattcttcac aaacgttggc 27721 atacatttca acaagtctta gcgagaatct cacaacttcg ttacaggatt gacatatctt 27781 tattacctat ttgttacaag a // LOCUS NODE_1059_length_27543_cov_5.21049227543 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 27543) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 27543) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..27543 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 49..993 /locus_tag="DP116_08995" CDS 49..993 /locus_tag="DP116_08995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015161664.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor RpoD" /protein_id="PRJNA477356:DP116_08995" /translation="MLASKVREELADEWAEAGAFTEEADDADEPEVEVVAPTEGFTED SVRLYLREIGRVKMIKPDEEIELARRIAKGDLDAKKKLIQANLRLVISIAKKYVNRGL PFQDLIQEGNLGLIRAAEKFDHTKGFKFSTYATWWIRQAITRAIADQSRTIRLPVHLY ETISRIKKTTKLLSQEMGRKPTEEEIATRMEMTIEKLRFIAKSAQLPISLETPIGKEE DSRLGDFIESDGETPEDQVSKNLLREDLEKVLDSLSPRERDVLRLRYGLDDGRMKTLE EIGQIFNVTRERIRQIEAKALRKLRHPNRNSVLKEYIR" gene complement(1199..1378) /locus_tag="DP116_09000" CDS complement(1199..1378) /locus_tag="DP116_09000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654777.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_09000" /translation="MTNATKTVTPVIDDRNAWRWGFTPQAEVWNGRLAMIGFLAAALI ELFSGQGFLHFWGIL" gene 1567..1893 /locus_tag="DP116_09005" CDS 1567..1893 /locus_tag="DP116_09005" /inference="COORDINATES: protein motif:HMM:PF11937.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09005" /translation="MKLGQICFLLLPLLIPQVGVAQTPQQTTIAPPQLPDNLKVPANQ VLLLGQRATGVQIYQCKAKSSNANQFEWTFVAPEAKLFDAQGKNNIQHYAGPTWEAKD VKSVVQ" gene complement(1874..4399) /gene="gyrA" /locus_tag="DP116_09010" CDS complement(1874..4399) /gene="gyrA" /locus_tag="DP116_09010" /EC_number="5.99.1.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412588.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA topoisomerase (ATP-hydrolyzing) subunit A" /protein_id="PRJNA477356:DP116_09010" /translation="MAKQLNLLCEGQVITTALHTEMQRSYLEYAMSVIVGRALPDVRD GLKPVHRRILYAMHELGLTPDRPYRKCARVVGDVLGKYHPHGDQAVYDALVRLVQDFS SRYPLLGGHGNFGSVDNDPPAAMRYTETRLAPISHEGMLAEIGEETVDFIGNFDNSQQ EPTVLPAQLPFLLLNGSSGIAVGMATNIPPHNSGEVIDGLIALIDNPDLSDEKLFEII PGPDFPTGGQIVGNAGIKEAYTTGRGSIVLRGVAQIEEVAQSRGNKRRTAIVVTELPY QVNKAGWIEKVADLVNQGRLTGIADLRDESDRQGMRVVIELKRDTNPQEVLQNLYHQT ALQSNFGAILLALVDGQPRQLSLRQLLQEFLSFREQTLNRRYSHELAKAENRLHLVEG LLKALSALDQVVEILRSAADGTTAKIGLQNRLDLSEVQADAILAMPLRRLTNLEQQNL HNEYEQLNEQINSLQKLLQDRRELLKSLKKDLRSLKRKYMDDRRTRIGVMSEEENKGQ GDKATRGQGDKETENPKSKIENSKLEAPAEEVVLEFTHRGYVRRSQPSSRRTKTDNGT SDTDFVIQSVLADTAKELLVLTSGGKVYPVNVADIPPSSGRSARRTPLITLLSNSAQG AQETLINRFILPDHPENGEIILLTKQGRIKRLSLTELTNLTRRGITILKLKDDDELLC TQFTTAGEHLVLASSGGRVLKFEVNDNQLPIMGRAAMGLQGLRVRQQEEMVGCVSLNT KENLLLVTQLGYAKRIPVSGLRAGNRSDIGTQTFKFTNKTDILAGMVRAIASAEVAVV TNHQRVIRLGVETVPILGKDSAGESILQLSREEKIISVIELHS" gene 4657..5061 /locus_tag="DP116_09015" CDS 4657..5061 /locus_tag="DP116_09015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_09015" /translation="MKTVLIVEDDLINARVFSKILTKRGGLQVKHTENVEEVIKIAQS GEVDIILMDVSLSRSVYQGKSVDGIKITQMLKSDPQTATLPIILVTAHAMEGDRENFL KQSGADSYISKPIVDHQQFVDQILALLPQQNN" gene complement(5322..6215) /locus_tag="DP116_09020" CDS complement(5322..6215) /locus_tag="DP116_09020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316794.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c biogenesis factor" /protein_id="PRJNA477356:DP116_09020" /translation="MPKRIGLISLLVVCGLCIPQPSNAQALVPHTLQLDAAKLEQQGL GLAKEAAQLAQFQQYELALPRARLASQLAPKNDKVWFLLGGLYLQSKKLDQSINALKK AQSLNAKNGDVQFALGSAYFQQQKYQEAVNYYQQGLRLKPNDPEGLFDLGNAYYMQSK LPDALIQYKKAVSFNKKFWPAINNVGLILYEQGDIQGAIKQWQDAVGMEKQAAEPLLA LAVALYAKGDQRQALAMGQAAVRIDERYADLDFLKQNLWGTRLLSDTKKFLELPGIQA ALQQREEPSNSTTTRQRMIPQ" gene 6970..8442 /locus_tag="DP116_09025" CDS 6970..8442 /locus_tag="DP116_09025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994627.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_09025" /translation="MTTYLENPVIDKKVKHPLRWLIGLAAAGVLVIGATTTYTVVNRG TSKQDIAALTVPVEVKNVTVRISASGKVQPVQSVNISPKNSGTLVELYVEQGDKVSQG QIIAKMDSANIQARIAEARANLAQNQAQLDQAVAGNRPQEITQAKARLAQAEAQLAQA RAGNRPQEIAQAQAQVDAAQAKAKYTSEQLKRYQSLYQQGAEKKQLLDQAMSEDNAAK ASLQEAQKRLSLQQIGTRSEEISQKEAAVAESRAALQLSQAGSRPEEKQARKAAVAAA EAKLKTEQVNLDNTIIRAPFSGIVTQKYANVGAYVTPTTSASSSASATSSSVVAVARG LEVLASVPEADIGRMKQGQQVEIVADAYPDQVFKGHVRLIAPEAVKEEGVTLFQIRVA IDTGTDKLRSGLNVNMTFLGDKVQDALLVPTVAIVTEKGNTGVLVPDAKNKPQFHPVT IGAQIKDQTQILEGLQGGDRIFLNPPADYKIQQRQQQQKK" gene 8439..9656 /locus_tag="DP116_09030" CDS 8439..9656 /locus_tag="DP116_09030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863253.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_09030" /translation="MNILESMKMAGKTLVSNKLRSALTMLGIVIGNASVIAMIGVGEG GQKFVNNQLESLGPNVLFVIPGNRETQRITTKVPKNLVLEDVEAIASQVPTVAGVSPE LNGRYVANYRNRNTNVNIIGTTPSFLVVRDFETAKGRFFTDIDMKRNNQVVVLGANLA ERLFGTSNPISQQLRIKNASFQVIGVLEAKGSSLGADYDEAALVPITTSANRLVGRNS PYGIALDYIVASARDSNSVDAAEFQVTNLLRLRHKVTSEDDFTIRTQKDALQTVGQIT GALTIMLAAVAGISLFVGGIGIMNIMLVSVTERTQEIGLRKAIGATQQDILSQFIIEA IILSAAGGLIGTAIGVSGIMVVSALTPLKAGISPVAIAVAVGVSGGIGLFFGVVPARR AAQLDPIVALRSA" gene 9763..10527 /locus_tag="DP116_09035" CDS 9763..10527 /locus_tag="DP116_09035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_09035" /translation="MANTQLLTDSRVPNPGNQPVIIRLEDIFKVYGSGEAEVRALNGV NLTIEEGEYCSIMGPSGSGKSTAMNIIGCLDRPTSGHYYLDKLDVAQMEDTKLAEIRN KKLGFVFQQFHLLSQLTALENVMLPMIYAGVNTKERRDRAAEALKRVGLEKRLNNKPT QLSGGQQQRVAIARAIVNRPVLLLADEPTGALDSRTTQEVLDIFGELNASGITVVMVT HEPEVARQTQRVVWFRDGDVVHSHLSPSDVGHLAVS" gene complement(10641..11171) /locus_tag="DP116_09040" CDS complement(10641..11171) /locus_tag="DP116_09040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872653.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="orange carotenoid protein" /protein_id="PRJNA477356:DP116_09040" /translation="MTSSYDQNVPQALSDETQKVVEAFNRLDTDAKLAWLYFVYEKMG DSITPAAPPAAEPELAPVLLGDYYKLSDDEQLAIMRQIVNREDSEYSRAYGAIKENNQ LFVWFDWAQKMGDQVVDIPENYKATDAVNNVLSQIEALDFEGQISVLRTVVGDMGYSD VKPIETQAQTGKTSSL" gene 11387..11758 /locus_tag="DP116_09045" CDS 11387..11758 /locus_tag="DP116_09045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09045" /translation="MPEEVKPNPSEAPTQDAQLAAENIVSGQEKAPSVDIEKDYQAAQ QFSVSEIDRTGEGAKAAQKATAPKQELHDPEQTKIQANSTGNPDDYIDIAKEIGGSKT EAVTNVTDDLVEKAKEKGQSK" gene complement(11931..12572) /locus_tag="DP116_09050" CDS complement(11931..12572) /locus_tag="DP116_09050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191692.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carotenoid oxygenase" /protein_id="PRJNA477356:DP116_09050" /translation="MTQKVIIFDFDGTVADTVDALVSIANRLAGEFGYIPITQEELSL LRNLSSREIIKYSGISVLKIPFLVKKVKAELKNKIKELKPISGIKEALVALNNEGYRL GIITSNSQDNVTDFIKVNDLDNLFEFIYSGVTIFGKTTIINNVLKQKQIKPQEVIYVG DETRDVEASKKANIKVIAVTWGFNSQEVLAKQNPDFLIHHPSQLLDVVRVINF" gene complement(12569..13525) /gene="queG" /locus_tag="DP116_09055" CDS complement(12569..13525) /gene="queG" /locus_tag="DP116_09055" /EC_number="1.17.99.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872656.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA epoxyqueuosine(34) reductase QueG" /protein_id="PRJNA477356:DP116_09055" /translation="MNQSPRTNSSEVKKKALELGFQKVGIAAIDGENITETQRLQAWL ALGYHADMEWMANPKRQDIRKVMPEVRAIISVALNYYTHHQRPGTREYAKISRYAWGR DYHKVMHKKLKAMTTWLQGLDEGIQARYYADTGPVQDKVWAQKAGLGWIAKNGNVITQ EYGSWVFLGEVLTNLELESDRPHTEHCGTCTRCIDACPTGAITQPFVVDANRCIAYHT IENRNEKLPQTVTSHLQGWVAGCDICQEVCPWNQRFAKETDVAEFEPYPRNLAPKLVE LAQISDQEWDRQFPASALRRIKPDMLRRNARANLDAFPQAKE" gene 13729..14151 /locus_tag="DP116_09060" CDS 13729..14151 /locus_tag="DP116_09060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872658.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nuclear transport factor 2 family protein" /protein_id="PRJNA477356:DP116_09060" /translation="MTAAESTPTTVDEKFQISGITELTLLHYFQTLNAGKFEETAALF AEDGVMHPPFESGIVGRDAITRYLQQEAQNVKAYPREGVVETLEGEQIQFQVTGKAQT SWCGVNVLWTFILNQQKEILYTRIKLLASPKELLNLRR" gene 14551..15186 /locus_tag="DP116_09065" CDS 14551..15186 /locus_tag="DP116_09065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007309059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS607 family transposase" /protein_id="PRJNA477356:DP116_09065" /translation="MSYMWKVAEFGDLIGVSASTLRRWESEGKLIPERTLGNQRIYTE QHLNLARNLKSGKYPTRVIIYCRVSSHGQKDDLTSQVNSMDKFCVANGVVVTDRIEEV GGGLNFKRKKFLQIIQWAIQGEVKSVYVAHKDRLCRFGFDLVEQIIIWGGGTVVVANS EALSPHEELVEDLLSIIHCFSSRLYGLRKYKDKVKLIANGIDPCSNSVILH" gene 15213..16391 /locus_tag="DP116_09070" CDS 15213..16391 /locus_tag="DP116_09070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013324876.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_09070" /translation="MHAIKRELKLNKKEISQMRGNAGFKRFVYNYGLDLIISSWSFED IKASDSKRIDAIKKVFTQVTMRRTEYTWMKQYPSTVYQSAFIDLKNAFSRWRQGLAKF PVKKTKKKGDSFTVYKNAGVYPEKGKPAFPFTNRVVICPGKIIKLPGLKQVRLKERIN FLCSSQTFTVSRIADRWFVCFVLDAEKVPPRIHSIHKIGVDLGVKCLATCSDGSRYEM PVTTCQAKIKLGKHQWRNRNKIMGNKKLKIKASNNAKKCLNQLSKQHAHLANIRKDTT QKMTTDLSRKAYIIRIEDLNVVGMIANQKLAKAVSNNCFYEIRRQLIYKQSHYGTKVE LVERWFPSSKMCSKCHHVQPMTLEDRIFNCQKCGQIQDRDENASKNLENAPLDKIRLA " gene complement(16528..16959) /locus_tag="DP116_09075" CDS complement(16528..16959) /locus_tag="DP116_09075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319427.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-sigma factor" /protein_id="PRJNA477356:DP116_09075" /translation="MKSELHVPSDLKFLTIVESWLLSCLEVEFKGSVDWSKQSSRLRL VLVEAYSNVVRHAHKDQPLLPVLIRLELKDQDIALEVWDYGKGFNLSDYSPPSPADQQ EGGYGWLIMHRLMDKVEYQLQVDGGNCLKLEVTLPKITTTA" gene complement(16963..18636) /locus_tag="DP116_09080" CDS complement(16963..18636) /locus_tag="DP116_09080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995872.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fused response regulator/phosphatase" /protein_id="PRJNA477356:DP116_09080" /translation="MTEIETGKLKLLVVDDEPDNLHLLYRTFRRDFQVYKASDALTAL DILDQEGEMAVIISDQRMPEMNGTEFLGRTVEHFPDTIRILLTGFTDVEDLVEAINSG QVFKYITKPWKPERLKAVVEQAADIYRVVKKRTQELSRALRRESLFNAVTTAIRESLD YNSMLQKIVATVGQTFEASYCVLRPVEGNRLTPHQFCYQDPKSLSSSCDFDLNLLISE VLETGQLQKALSIHDEKSYQKLVVPLIWQQNLLAILALKQHHHARSWQQEDIELITGV AEQAALALSQAKLYQRLQQKQEQIRAELEVARQIQNNLLRQTMPNINGVRVQACCYPA REVGGDFFEVFVHPKGDLWLAVGDVSGKGVPAALFMASAISVLRRELSQESPPLPNVV MQNLNHTLSDDLISNNCFITVVLARYTPNTRELVYANAGHIYPLLWSYQETVEKKPKY LKVRSVPLGILPVWQVMSSQLVLAPGDILLLASDGITEAEVLNTKDFAQEIVDSAQPV SRSMLNQEGLWQLLNREAQPLHLNHLLARIQADNRVQEDDQTILSLEVL" gene complement(18833..19570) /locus_tag="DP116_09085" CDS complement(18833..19570) /locus_tag="DP116_09085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411421.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_09085" /translation="MTKIRVALIEDHDLTRVGIRTALQQKEEIEIVGEATNAVEGLKM LNTLQPDVAIVDIGLPDKDGIELTREIKSTTDGEELATRVLILTLRDNKEAVLAAFAA GADSYCMKDIKFDNLIEAVRVTYNGNAWIDPAIARIVLQQAQQNPPQPEVTQLDNKIS LQISVDSSPDKESQPQDTIEPYTLTERELEVLQLIVEGCSNAVIAERLYITVGTVKTH VRNILNKLCADDRTQAAVRALRSGLVG" gene 20306..21076 /locus_tag="DP116_09090" CDS 20306..21076 /locus_tag="DP116_09090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319430.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_09090" /translation="MSKLPRKFSVLGLPVHVTTDYPGWLLERLQQGMGTHVVTLNAEM TMQAERNSSLAKIIHSAELVVPDGAGVVLYLRWLLQQKVQRTPGIELAETLLQELGQT QSEAKVFFYGGAPGVAAKTSEFWLSQIPGLVVVGTHSGFHSQQEEEQLRQTLAQLQPQ VIFVGLGVPRQELWIANNRHLCPQAIWIGVGGSFDIWSGIKTRAPGWLGDNNLEWLYR LYKEPWRWRRMLALPEFAFKALIYRFFQVSGVGVTSNL" gene 21412..22161 /gene="ftsE" /locus_tag="DP116_09095" CDS 21412..22161 /gene="ftsE" /locus_tag="DP116_09095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017803863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division ATP-binding protein FtsE" /protein_id="PRJNA477356:DP116_09095" /translation="MLLLTKRKAPEKSVLTQNSETKQHNGSVEFMVQLRSVSKTYANG CHALANIDLEVKKGEFLFITGASGSGKSTLLKLLYGDELPTQGDVIVNEYNMVPLRGH RLSFFRRRIGVVFQDYKLIPRRTVAENVSLALKTQGYTRKEIQRRLEPSLKLVGLHSK AECLVKQLSGGEQQRVGIARAIAGTPPILLADEPTGNLDPDNSWQVMQIFQKLNSFGA TVIVTTHDEQLVRRCNHRVMQMQDGRLYPRT" gene complement(22764..23348) /locus_tag="DP116_09100" CDS complement(22764..23348) /locus_tag="DP116_09100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865840.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09100" /translation="MAKHASLQSQTKSTIKDHKFSRNEFDSKNQSQSPVSQGQVEKVI DSKVVSVPTCVYLTLEDLKCFEAVERQYEQWGVIFKNCIAIQPSNPAFPTHSGPLVLM GAPKNGYIEATFLHPVHFVSAFVTSSQRLVLSAYGRDQQLLTQTVLPGANLANSDPGE SPNTLLSVSAKEIHRITFCAFDGQFTIDDFSFCY" gene complement(23997..24533) /locus_tag="DP116_09105" CDS complement(23997..24533) /locus_tag="DP116_09105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009344655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyruvate/2-oxoglutarate dehydrogenase complex,dihydrolipoamide dehydrogenase (E3) component" /protein_id="PRJNA477356:DP116_09105" /translation="MAIIDSKGRLFGKINILDLGAALVILLVILGIFIFPGGSGSVAQ VGGKTVPIEVDLIVRGLNVLDTKQLYARGFEKGGKTKVIIRNQPHGQIDIKSVEELPR TILVPQPDGSIKQLPEPERSNNFSKDLRLTLTGNAKITDEGPVLGNSKIKIGMPIELD GFNYNFNATVIDVRLKEN" gene 24804..25670 /locus_tag="DP116_09110" CDS 24804..25670 /locus_tag="DP116_09110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319436.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M48 family peptidase" /protein_id="PRJNA477356:DP116_09110" /translation="MNRKRFSINLRFSQKPWFYPFISVIIATLICLVTPATTRAIDAG QLLPFILQGVQVIQLSNISPRQEVDIGKQINEQLLTSQVKLNRNSALNRYVQQLGQRL VANSDRPDLPYTFQVVEDDAINAFATLGGFVYVNTGLLKTADNEAELASVMAHEIGHI GGKHLVKQMRQKALASGLASAAGLDRNQAVAIGVDLALNRPRSRGDEYDADTRGLRTL TSSGYAPSGMVSFMEKLLKKGSSVPTFLSTHPATGDRIKALQSAINNLPKSGGAGLDN SNYQANIKALLR" gene complement(25687..26448) /locus_tag="DP116_09115" CDS complement(25687..26448) /locus_tag="DP116_09115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197012.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein phosphatase" /protein_id="PRJNA477356:DP116_09115" /translation="MSATSYRRIIIGDIHGHYEGLMTLLEAIAPTSDDQVYFLGDLID RGPQSAQVVSFVKDSCYQCLLGNHEQMLLNIFTNRHVSTSMLQGWLYSGGQATVASYQ QATIPHEHLEWLKTLPTHLDLGDIFLAHAGVDPSIPIAEQTVEQLCWVRDKFHSIEKP LLPDKLIIVGHTMTFTLPGVDPGKLAQGQGWLDIDTGAYHPRSGWLTGLDITNQLVYQ TNVYNHHFRTLPLKEAVTVVNPAKVEVRGYNKQRV" gene complement(26659..27450) /locus_tag="DP116_09120" CDS complement(26659..27450) /locus_tag="DP116_09120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877236.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09120" /translation="MKIWQADFYRRPQTEGSEQVMWELLICDATRSFEYQATCPQSQA SSSWVASQLQIAALEQLPDVIQVFRPQSLSLIEQAGRSLGISVEPTRHTLALKQWLQE KQYPVSLDKPPPMPLPENLWGEEWRFASTTATDVVEVFSDRLIPILEMPEYLLPIHLG LASTTPVPGVIIYGGRQSMRLARWLQQVHPIALNYVAGAPDGLVLEAGLVDRWIVATF EDQQVTTAAKVYEQRKQQSRGLHFLLVQPDDSGMTYSGFWLLRQE" BASE COUNT 7989 a 5822 c 5741 g 7991 t ORIGIN 1 gggtaaatac ggagtaggtt acattcgttt ttactatcgg ggttcaaaat cttggcttca 61 aaggtaaggg aagagttagc agacgagtgg gcggaagcag gcgctttcac ggaagaagcc 121 gatgacgccg acgagccgga agtcgaggtg gttgcaccga ccgaaggctt tacggaagat 181 tctgtgcggc tctatttacg tgaaatcggt cgcgtaaaaa tgatcaagcc cgatgaagaa 241 attgagttag cgcgtcgcat cgctaaaggt gacctggatg cgaagaaaaa gctgattcaa 301 gcgaacctga gacttgtgat ttcaattgca aaaaaatatg tcaatcgcgg actgcccttc 361 caagatttga ttcaagaagg aaacctgggt ctgattcgag ctgccgaaaa gttcgaccat 421 accaagggtt tcaagttctc gacatacgcg acatggtgga ttcgtcaggc gataactcgt 481 gcgatcgccg atcaatctcg tactatccgc cttccggttc acctctacga aaccatttcc 541 cggatcaaga aaacgaccaa gctgctttct caagaaatgg gtcgcaaacc cacggaagaa 601 gaaattgcca cgagaatgga aatgacaatt gagaaactgc ggtttattgc taaatccgca 661 caattgccta tttcactaga aacgccgatt ggtaaagaag aagattctcg actgggcgat 721 tttattgagt ctgatggtga aacaccagaa gatcaagttt ctaagaacct gctacgagaa 781 gacctggaaa aagtccttga cagtcttagc cctcgtgaaa gagatgttct tagattgcgt 841 tacggcttag atgacggtcg tatgaagacc ttagaggaaa ttggacaaat tttcaatgtg 901 acccgcgagc gaattcgtca aattgaggca aaggcgctgc gcaagttacg tcacccaaat 961 cgtaacagcg ttctcaagga gtacatccgc tagttatttg tcattatgtc atgagtcatt 1021 gattttagac aaatgactca tgagtgagaa aaaaagaaaa agctgtcaaa ccagtgctgc 1081 tttgagggta aaccgaagcc aggctactgg ttattgtgaa aaaacctgga agtattcata 1141 cttccaggtt ttttgtttat ttatgtattt attacgaagc aatccgtaac ctgaaaagtt 1201 acagaatgcc ccagaagtgt aggaaacctt gaccagagaa tagttcaata agagcagctg 1261 ctaaaaagcc aatcatcgcc aagcgaccgt tccaaacttc tgcttgaggt gtgaagcccc 1321 agcgccaagc attacggtca tcaattacag gggtaacggt ttttgttgcg tttgtcataa 1381 ctggcactcc aaactgaatt atttaaggtt tgttacctta cgtaaactaa tataacaaat 1441 tttgagaagt tgtgtcacat ttcccataaa attattgcaa tttttcaact gatttttaac 1501 tttccgtgaa gtaaatctct tcaaggtaat tggtatttcc cctttgttga atgaggtata 1561 agtatcatga aactaggaca gatatgcttc ctactactac cattactgat tccacaggtt 1621 ggtgtcgctc aaacgccaca acagaccaca attgcgccgc ctcagcttcc tgacaactta 1681 aaggtaccag caaatcaagt gctactgctt ggccagaggg ctacaggagt acaaatttat 1741 caatgcaaag cgaaatccag caacgccaac caatttgagt ggacgtttgt agcacccgag 1801 gctaagctat ttgacgcgca aggtaaaaac aatatccagc actacgcagg tcctacttgg 1861 gaggcaaagg atgtcaagag tgtagttcaa taacagaaat gattttttct tcacgactga 1921 gttgaaggat actttcacca gcgctatctt tacccaagat tggaacagtt tccactccaa 1981 ggcgtatcac tcgttgatga ttggtcacta cagctacctc agcacttgcg atcgctcgta 2041 ccattcctgc taaaatgtca gtcttattgg tgaatttaaa ggtctgagtt ccaatatcac 2101 tccgattccc tgcccgcaaa ccactcacag gaatccgctt agcgtatcct aattgagtga 2161 caagtaacag gttttccttt gtgtttaagc tgacacaacc aaccatttct tcttgctgac 2221 gcacgcgtaa accttgcaga cccatagcag cgcgacccat aatcgggagt tgattgtcat 2281 tcacctcaaa cttcaagact cgcccacctg aacttgccaa aaccagatgt tcgcccgcag 2341 tcgtgaactg agtacataac aattcatcgt cgtctttgag tttcaagata gttattccac 2401 gacgagtgag gttagtcaat tctgtcaggg acaaacgctt gattcttcct tgcttagtta 2461 ggagaataat ctcgccattt tctggatgat ctggcaatat aaagcggtta atcaaagttt 2521 cttgagcgcc ttgagcagaa tttgagagta aggtaattaa tggtgttctc cgtgcagaac 2581 gtccactgct gggagggata tcagccacat tcactgggta taccttacca ccactggtga 2641 gtaccagcaa ctcttttgct gtgtcagcca acacgctttg gatcacgaag tcagtatcgg 2701 aggttccgtt atcagttttt gtccttcggg atgaaggttg agaccgacgt acatatcccc 2761 tgtgggtaaa ctctaaaacc acctcttctg ctggcgcttc caatttagaa ttttcaattt 2821 tggattttgg attctctgtt tctttgtctc cttgtcccct tgtcgcctta tctccttgtc 2881 ccttattttc ttcctcactc atgacgccta tcctagtacg gcgatcatcc atgtatttgc 2941 gcttgagcga tcgcaagtct tttttgagcg atttcagtaa ttcccgtcta tcttggagta 3001 atttttgcaa cgaattaatt tgttcattga gctgctcata ctcattgtgt aagttttgct 3061 gttctaaatt tgttaaacgt cgcaatggca tagccaaaat agcatccgct tgcacctcac 3121 tcaaatccag tcggttttgt aagccaattt ttgccgtagt cccatcagcc gcacttcgta 3181 aaatctccac aacctgatct agcgcggaca gcgctttgag taaaccttca accagatgca 3241 gccgattttc tgccttagcc aattcatgag agtagcgacg gttgagagtt tgctctcgga 3301 aactcaaaaa ctcctgcaac agttgacgca agctcaactg acgaggttgt ccatctacca 3361 aagctaagag aatcgcccca aagttgcttt gcaatgcggt ttggtgatat aaattctgga 3421 gaacttcttg aggattggta tcgcgtttga gttcaatcac gacgcgcatt ccttggcgat 3481 cgctctcatc ccgaagatcc gcaattcccg ttaggcgacc ttgattcacc aagtccgcta 3541 ctttctcaat ccaaccagcc ttattcactt gataaggcaa ttctgtcacc acaattgctg 3601 tccgccgctt gttacccctg ctttgggcga cttcctcaat ttgggcaact cctcgcagta 3661 caatacttcc ccgacctgta gtgtacgctt ctttgattcc agcgttaccc acaatttgac 3721 cacctgtagg aaaatctggt ccaggaatga tctcaaataa tttttcgtcg gataaatcag 3781 ggttatcaat taatgctatc aacccatcaa tcacttcccc tgagttatgc ggtggaatat 3841 tcgtcgccat ccccacagca ataccagaac taccattgag taagagaaat ggcaactgtg 3901 ctggtagtac ggttggctct tgctgagaat tatcgaagtt accgataaaa tccaccgttt 3961 cttcaccaat ttccgccagc attccctcgt gactgatggg cgcgaggcgc gtttctgtat 4021 aacgcatcgc cgctggcggg tcattatcca cactcccgaa attaccatgt cctcccagta 4081 ggggatagcg actggaaaag tcctgtacca gcctgactaa ggcatcataa actgcttggt 4141 caccgtgggg gtggtactta cctaacacgt ctcccaccac acgcgcacac ttacggtagg 4201 gtcgatccgg cgttagaccg agttcgtgca ttgcatacaa aatgcgtcgg tgaactggct 4261 ttaagccatc tcgcacgtct ggtaacgctc gtccaacgat gacactcatg gcatattcga 4321 gataagaccg ttgcatctcg gtatgcaagg ctgtggtgat gacctgtccc tcacagagaa 4381 ggtttaactg ttttgccatg agtttttttc ctaaatttcc actacacaga atttgccgtt 4441 aatgaagact gcagcaacca cagcatcacg gatgaaatac tcttcattac acaatcttga 4501 tggtcacagg atagaaagga gtagtatcat ccttgttgac gaagatttac attgattatt 4561 tcaaataaag cacacaaggt tttaactagg tggtattttc cctttcactg acttgtgtgg 4621 ttttggttaa gatgcataaa caaaagtcaa attctcatga aaactgtttt gattgtcgaa 4681 gatgatctga ttaatgctcg cgttttttcc aaaatcttga ctaagcgagg tgggttgcag 4741 gtcaaacata ctgaaaatgt agaagaagtt atcaaaattg cccaatcggg ggaagttgac 4801 attatcttga tggatgtttc tctgtcaaga agtgtttacc aaggtaagtc tgttgatggt 4861 atcaaaatta cacaaatgtt aaaatccgat ccgcaaacag ctaccttacc tattattttg 4921 gtaacagcac acgctatgga gggcgatcgc gagaactttc tcaaacaaag tggtgctgat 4981 agctacatat ctaagccgat tgtggaccac caacagtttg ttgaccagat cttggcactg 5041 ctacctcaac agaacaacta aacgcaccct ttggggtata aaagaatgtt taccccacgg 5101 taaactgcac ctagcacatt tccatagtag tgtccttttt ttaagaacat acctggaaca 5161 gtagctcaac acacagattg cgattgggct actgtttctt ttacagaata atatatatgt 5221 actattagat gagattttca gcaagtggca atgggaaaga tgagggaggg agtcacagac 5281 aatttccctt cttccctcct tttcccctca tgctttgttc tctactgcgg aatcattcgt 5341 tgcctggtgg tagtagaatt tgaaggttct tcacgttgct ggagagcagc ttgaatgcca 5401 ggtaattcta agaatttttt tgtatcagac agcaggcgtg tgccccaaag attttgttta 5461 agaaaatcta aatcagcata gcgctcatca atgcggactg cagcttgtcc cattgctaaa 5521 gcttgtctct gatcgccttt ggcatataaa gctactgcca atgctaataa gggttctgca 5581 gcttgtttct ccatcccaac agcgtcttgc cattgcttaa ttgcgccctg gatgtcaccc 5641 tgttcataca aaattaatcc aacattgtta atggcaggcc agaatttttt gttaaaggaa 5701 acagcttttt tatactggat gagcgcgtct ggaagtttac tctgcatgta gtaagcattg 5761 cccaaatcaa acaatccctc tggatcattg ggctttaacc tcaaaccctg ttgatagtaa 5821 tttacagcct cttggtactt ttgctgctga aagtaagccg aacccaaagc aaactgaaca 5881 tcaccatttt tggcgttgag agattgtgcc tttttcaggg cgttaatact ctgatctaat 5941 tttttacttt gcaaatacaa gccacccaag agaaaccaca ctttatcgtt cttaggagcg 6001 agttgtgatg ctaaccgagc tcttggtaga gctagttcat attgttgaaa ctgcgcgagt 6061 tgtgctgctt cctttgccaa acctaagcct tgctgttcca acttcgctgc atccagttgc 6121 agtgtatgag gtaccagtgc ttgtgcattg ctaggttgag gtatgcacaa accacaaaca 6181 accaaaagag aaatcaaacc aatccgttta ggcacactac cgcctctttg ccaagaatga 6241 aggaatcttt ccgctagctt aaacgatttc cagtgttgac gacagcaaaa ttttacaaac 6301 aagagtacaa gcgtattccc ttactaataa ccaatgaaag ccacttgcac caactccgag 6361 ggaacgttct tataccttga gcatggctgc gccgtatacg ccgtgctcct aatgacactg 6421 gactttggcg aagcggcttt tacccatcct catcaccgaa tttgctttgc cgccgctgtt 6481 ttgtaataca gtcgcagctc ttttggatga ctactggatg acaaccgtgg cttagcttat 6541 ttacaaccaa ccataactta aacaatatca cgctatacca gtacccgagt taaattttca 6601 acgacaagct aatgatttag ataaccaacg tcagaatatt aaagatcagg tgcaactccg 6661 ttttcatcct ttttctctgg ctgtaggagt ctacattatt ccaaaaaatg gcaacttacg 6721 caagatttct agtgttcata ataactgcaa aagtcaccca ttcaggtgag gtcataacga 6781 agcatttacc aatattatga agaataatta agctactcta gattctatat gtttacatag 6841 tcattgattt attaataact ctatttcaga aatcactact cattctaaaa gttctaaaaa 6901 ctccaaatta ttagttttgc tcttccactg aaagacactc acttaaagct attttcgtga 6961 caagctatta tgactacata cttagaaaat cctgtcattg ataaaaaagt taaacaccca 7021 ctacgctggt taattggttt agccgcagct ggcgtattgg tgattggtgc aaccacaaca 7081 tacacagttg tcaatcgagg aacaagcaaa caagacatag ctgcactgac tgtcccagtg 7141 gaagtcaaaa atgtgactgt gcgaatttct gccagtggta aggtgcagcc tgttcagagc 7201 gtgaatatca gcccgaaaaa ctctggaact ttggtagagt tgtatgttga gcaaggcgac 7261 aaggtcagcc aagggcaaat tattgccaag atggacagcg caaatattca agcacgaatc 7321 gctgaggctc gtgcaaactt agcacagaac caagcacagt tggatcaagc cgttgctggg 7381 aatcgtcctc aagaaattac tcaagcaaaa gcacgtttag cacaagcaga ggcgcaactt 7441 gctcaagcac gtgcaggaaa ccgtccgcaa gaaattgccc aggcacaagc tcaagtagat 7501 gccgctcaag caaaagcgaa atatacaagc gaacaactca agcgttacca atctctttac 7561 cagcagggag ccgaaaaaaa acaattactc gaccaagcca tgagtgaaga taatgccgct 7621 aaggcaagtt tgcaagaagc tcaaaaacgc ctatctctac aacaaattgg cactcgttct 7681 gaagaaattt ctcaaaaaga agcagcggtt gctgaatcac gagcagcttt gcaattatcc 7741 caagcgggtt cccgtcctga agagaagcaa gcgcgtaaag cagctgttgc tgctgccgaa 7801 gctaagttaa aaacagagca agtgaattta gacaatacga ttatccgcgc tcccttttcc 7861 ggaattgtca cgcagaagta cgccaacgtt ggtgcatacg tgacaccaac aacctctgct 7921 tcctcaagtg catcggcaac ttctagttct gttgttgctg tcgcacgcgg cttagaagtt 7981 ctggcgagtg ttccagaagc tgatataggt agaatgaagc aggggcagca ggtagaaatt 8041 gtcgccgatg cctatcctga tcaagtattc aaaggtcatg tgcgcttgat tgctcctgaa 8101 gcagtcaagg aagaaggtgt gacattgttc cagatccgag tggcaattga tactggcaca 8161 gacaaactgc gttctggtct gaacgtgaac atgacctttt taggagacaa ggtacaagat 8221 gccttacttg tgccaactgt agcaattgtg actgaaaaag ggaacacagg tgtgttggta 8281 ccagatgcaa agaataaacc ccagtttcac cccgtcacaa ttggagcaca gatcaaagac 8341 caaactcaga ttttagaggg actgcaaggg ggcgatcgca tctttctcaa cccacccgca 8401 gattacaaaa tccagcaacg ccagcaacag cagaagaaat gaacatccta gaaagtatga 8461 agatggcagg gaaaaccctg gtgtcgaata agttacgtag cgccctcacc atgctgggta 8521 ttgtcatcgg caatgcctca gtcattgcca tgattggggt tggtgaagga ggacaaaagt 8581 ttgtcaataa ccagttggag tcattgggac caaacgtcct atttgtgatt cctggtaatc 8641 gggaaactca gcgcattacc acgaaagtgc cgaaaaatct agtgctagaa gatgtggagg 8701 cgatcgcctc tcaagtgcca acagtcgcag gagtgtctcc tgagctaaac gggagatatg 8761 tagccaatta ccgtaacaga aacaccaatg tcaacattat tggcacaact cccagtttct 8821 tagtagtacg ggactttgaa actgcaaaag gtcggttttt taccgacatt gatatgaagc 8881 gcaacaatca agttgttgtg ctaggtgcca atttagcaga aagattattt ggtactagta 8941 accccataag tcagcagttg cgaataaaaa atgctagctt tcaagttatt ggtgtcctag 9001 aagccaaagg ctcaagcttg ggagctgatt atgacgaagc agcattggtg ccaatcacaa 9061 cctcagcaaa tcgacttgtc ggacggaatt ctccctatgg cattgcgtta gattacatag 9121 ttgcttccgc tcgtgatagc aacagtgttg atgcggcaga gtttcaagtc accaatttgc 9181 tgcgcctgcg gcacaaagtt accagcgaag atgactttac catccgcact cagaaggatg 9241 ctttgcaaac tgttggtcaa atcacaggtg ctttgacaat tatgctagct gctgtagcag 9301 gtatctccct atttgtcggc ggtattggca tcatgaatat tatgcttgtc tccgtgactg 9361 aacgtactca agaaattggt ctacgtaaag ccattggtgc aactcagcaa gatattttgt 9421 cacagttcat catagaagcc atcattctct cagccgcagg agggttaatt ggtactgcga 9481 ttggtgtgag cggcattatg gtggtatcag cgttgacacc tttaaaagca ggaatttctc 9541 ctgtagcaat tgctgtcgca gttggtgttt ctggtggtat tggtttattc tttggcgttg 9601 ttcccgcacg tcgtgcggct caactcgatc caattgtggc gttaagaagt gcttaaaaga 9661 attttaaccg caggcggttt taaactgaac aggtgactag agcggcatca cccaattaag 9721 taaactgata gtgggttagc ttaaattagg ttaaaaattt atatggcaaa tactcaatta 9781 ctaactgact cccgcgttcc caatccggga aatcaaccag tcattattcg gttagaagat 9841 atttttaaag tttacggtag tggcgaagct gaagtgcgag cactcaacgg tgttaatctc 9901 actatagagg agggtgaata ctgttcaatc atgggaccat ctggttctgg taaatctacg 9961 gcaatgaata tcattggttg cttagatcgc cccacttccg gacattatta cctggataag 10021 cttgatgtcg cccaaatgga agatacaaag ttagcagaaa ttcgcaataa aaaactggga 10081 tttgtatttc aacaattcca ccttttatct cagctgacag cgttagaaaa tgtcatgttg 10141 ccgatgatat atgctggtgt gaacactaag gaacgccgtg atcgagcagc agaagctctc 10201 aagcgagtag gtttagaaaa gcgtctgaat aataaaccaa ctcaactgtc tggaggacag 10261 cagcaacggg tggcgatagc ccgtgctatt gttaaccgtc cagtcttact cctcgccgat 10321 gaaccgacag gcgcactcga ctcgcgcaca acccaagaag tattggatat ctttggtgaa 10381 ctcaatgcca gtggtatcac tgttgtgatg gtaacccatg agccagaagt tgctcgtcaa 10441 acacaacgtg ttgtttggtt tcgtgacggt gatgttgtac actctcacct cagtccatct 10501 gatgtgggtc atttggcggt gtcttagctt gctaagtatg cctcataata aaaaatgcaa 10561 cacacatcac atttatagac tttgcaatta attcgctagg atgcgttaag cttgcctaac 10621 gcaccatcac cccctactga ctaaagactt gaggttttgc cagtttgggc ttgagtttca 10681 atcggcttaa cgtcgctgta gcccatatca ccaacaactg tccgcaacac agatatttgt 10741 ccttcaaaat ccagtgcttc gatttgggaa agcacattgt taactgcgtc agtcgctttg 10801 tagttttctg gtatgtcaac gacttgatca cccatttttt gggcccaatc aaaccagacg 10861 aatagctgat tgttttcttt tatagcacca taggcacgag aatattctga atcttcgcgg 10921 ttgacaattt gccgcataat tgccaattgt tcatcatcag ataatttgta gtaatcgcct 10981 aagagtacag gtgctagttc tggttctgca gcagggggag cagcgggagt aattgagtca 11041 cccattttct catacacaaa atacagccaa gccagtttgg catctgtgtc taagcggtta 11101 aatgcctcta ccactttttg agtttcatcg cttagggctt gaggaacatt ttgatcgtaa 11161 cttgatgtca taatttttct ccaaagttat ttcacagatc aaacttaggg gtatcagagt 11221 ttcagaatca gtatcgacta ccaagagaaa gagttttaaa aattcgcaaa ctaggttttt 11281 atctgccttt tagtagaggt aaaaatctat ccacagatac caaggatgga agaattgact 11341 ttgatacaat acgccattga cagagaaata cttggaggag ttaattatgc ctgaagaagt 11401 gaagcccaat ccatcggaag ctcctaccca ggatgcacaa ttagccgctg aaaacatcgt 11461 tagtggtcaa gaaaaagcac caagcgtcga tatagaaaaa gattatcaag ctgcacagca 11521 attcagcgtc agtgaaattg accgcacagg tgaaggtgcg aaagcggccc aaaaggcgac 11581 tgcacctaag caagaactgc atgacccaga acagacaaaa atacaagcca actcaactgg 11641 taatcctgat gattacatag atatagcaaa agaaattggt ggttccaaaa ctgaagctgt 11701 aaccaacgtc actgatgatt tggtagaaaa agcaaaggaa aagggtcaat caaagtaatt 11761 ttgattcaat ttgacaaaaa atcaaataag ataagcgttc aaactgtatt taggattatt 11821 tcactctttt ctcgtttcta cagtatgcgg aaaacccctt cgggagggct acttcctccc 11881 gaaatttact ataaatataa tttctctcaa gcaagaaatg agcaattcaa tcaaaaatta 11941 ataactctta ccacatctaa cagttgactc gggtgatgaa ttaaaaaatc tggattttgt 12001 tttgctaaga cttcttgtga attgaacccc caagtgactg caatcacctt aatgtttgct 12061 ttctttgatg cttctacgtc tctggtttca tctcctacat agatcacttc ttgaggttta 12121 atctgttttt gctttaatac gttattaatg attgttgttt tgccgaaaat tgtcactcct 12181 gaatagataa actcaaataa gttatctaaa tcgttgactt tgataaaatc tgtcacattg 12241 tcttgagaat tagaggttat gattcctagc ctataacctt cattgttaag cgccactaag 12301 gcttctttaa ttcctgaaat tggttttaat tctttgattt tattttttaa ctcagctttc 12361 acttttttga ccaaaaaagg tatttttaaa actgaaatac ctgagtattt aatgatttcc 12421 ctagagctta agtttctcaa gaggctaagc tcctcttggg tgattggtat atatccaaac 12481 tctccagcta aacgattggc aatacttacc aaagcatcta ctgtatcagc aaccgtgcca 12541 tcaaaatcaa aaataattac tttctgagtc attctttcgc ctgcgggaat gcgtctaaat 12601 tagcacgggc attccgtcgc aacatatctg gcttaatccg tcgcaacgct gatgccggaa 12661 actgtctatc ccattcctga tctgagattt gggctaattc taccagtttg ggcgcaagat 12721 tcctaggata cggctcaaac tctgcaacat ctgtttcttt ggcaaaacgc tgattccaag 12781 gacaaacttc ttggcaaata tcacaaccag caacccagcc ttgtaagtgg gatgtcacag 12841 tttggggcaa tttttcattg cgattttcaa ttgtgtggta agcaatgcag cgattagcat 12901 ctacgacaaa cggttgggta atagcacctg ttggacaagc gtcaatacaa cgagtacaag 12961 taccacagtg ttctgtatgt gggcgatcgc tctccaattc caaattcgtc agcacttccc 13021 ctaaaaacac ccaagagcca tactcctggg ttatcacatt tccgttcttt gcaatccaac 13081 caagtccagc tttttgcgcc cagactttat cttgcactgg acctgtgtct gcatagtatc 13141 gtgcttgaat tccttcatca agtccttgca gccaggttgt cattgcctta agttttttgt 13201 gcatgacttt atggtagtct cgtccccaag cataccgaga aatttttgcg tactcccttg 13261 ttccaggacg ttgatgatgt gtgtagtagt tgagggcgac actaattatc gctcgcacct 13321 ctggcataac cttacgaata tcttgacgtt taggatttgc catccattcc atatcagcgt 13381 gataacccag ggctagccaa gcttgcaatc tttgcgtctc tgttatattt tctccatcta 13441 tcgctgcaat cccaaccttt tggaatccca actctagagc tttttttttc acctcactgc 13501 tatttgttcg cggggattga ttcatttatc ttattttaat ttacctaata ggtactattc 13561 ccattgtaca ctgttgagtc aaagttgtga ctgacatggg cgttggtcta ctatcacgct 13621 agatgatttg tactcgttta ggaaatatac ctcaaaagta gctttcgagt gcggttgctt 13681 ttgtattcaa atgaatgaat tgttagctat cgtgatagag ggaaatctat gacagctgct 13741 gaatctacac ccacaacagt agatgaaaaa ttccagatat cgggaataac agaattaact 13801 ctactgcatt actttcaaac tttgaatgca ggaaaatttg aggaaacagc tgctttgttt 13861 gcagaagatg gtgttatgca tcctccattt gaatcgggaa ttgtgggacg agatgcaatt 13921 actcgatatc tacaacaaga ggctcaaaac gtcaaagctt accctcgtga gggtgtcgtc 13981 gagactttag agggggagca aatccaattt caggtgacag gcaaagcaca aacttcttgg 14041 tgtggtgtta atgtcttgtg gacatttatt ctcaatcaac aaaaagaaat cctttatacg 14101 agaataaaac ttttagcctc tcccaaagag ttactcaatt tgcggcgatg agtaagcact 14161 tgacactccc ctgacctaaa ggctagggga ttctacagtc attgtcgagg cttgctcaga 14221 caggttttca ccaatctgag tacagacctc gaataaatgt agagcacggt ttatgcactt 14281 ttactacaga tcgcgtgtat tgcactcaac accagaagcg ctgtagccct ttgagctaat 14341 gagatgtagc aaaagttata aattttgtta atttatttta aatattatga attttgtatt 14401 ttgaattgcc tcgtctgtgc acaagcacag agagtgtgtt gttaagcgct gactttggtg 14461 gtaatcggca cactcatctt ttgtgacaaa gctagagttc aactaaatag ttaaaaacct 14521 tgctaaactt atctacagtt gcttgatttg atgagctata tgtggaaagt tgcagagttt 14581 ggagatttaa taggtgtttc agcttctact ttacgtcgtt gggaatctga aggtaaattg 14641 attcctgaac gtactttagg aaatcaaaga atttataccg aacaacatct taatttggct 14701 cgtaatctta aatctggcaa atacccgaca cgagtaatca tttactgccg ggtttcttcg 14761 catgggcaga aagatgattt aactagtcaa gtaaactcta tggacaagtt ttgtgttgct 14821 aatggtgtag ttgtcactga ccgtattgaa gaagttggag gaggcttaaa ttttaagcgt 14881 aagaaatttc ttcaaatcat tcaatgggct attcaggggg aagttaaatc agtctatgta 14941 gcgcacaaag atcggttgtg tagatttggt tttgatttag ttgagcaaat tataatatgg 15001 ggtggcggaa ctgttgttgt agctaatagt gaagctttat cacctcatga agaattagta 15061 gaagatttgt tgtctataat acattgtttt agctcacgct tgtatgggct acgcaaatat 15121 aaagacaagg tgaaattaat cgctaatggt attgaccctt gctcaaactc ggttatactt 15181 cattaagtaa gcaagtttta accagtctag caatgcatgc aatcaaacgg gagttaaagc 15241 taaataaaaa agaaatctct caaatgcgtg gcaatgctgg ttttaagaga tttgtttaca 15301 actatggact agatttaatt atttctagtt ggtcgtttga ggatattaaa gcaagtgatt 15361 ctaaaagaat agatgcgatc aaaaaagtgt ttactcaagt cacaatgcga aggactgaat 15421 atacatggat gaagcaatat ccatcaacag tttatcaatc tgcatttatc gatttgaaga 15481 atgctttttc tagatggcgt caagggctgg caaaatttcc tgttaaaaaa actaaaaaga 15541 aaggtgactc gttcacagtt tacaaaaatg ctggggttta tccagaaaaa gggaagccag 15601 ctttcccttt tactaatcgg gttgttattt gtcctggtaa aataataaaa ttaccaggat 15661 taaagcaagt cagattaaaa gaaagaatta attttctctg tagttctcaa acgtttactg 15721 tttccaggat tgctgataga tggttcgtct gtttcgtgtt agatgctgaa aaagtaccac 15781 caagaattca ttctattcat aaaataggtg tggatttggg tgtaaagtgc ttagctactt 15841 gttctgatgg ttcaagatac gaaatgcctg ttaccacttg ccaagcgaaa atcaagctag 15901 gtaagcatca gtggcgcaat cgtaataaaa tcatgggaaa caagaaatta aaaattaaag 15961 catctaacaa tgctaaaaaa tgcttgaatc aattgtctaa gcaacatgct catctagcga 16021 atattagaaa agataccact caaaaaatga ctactgattt aagtcgaaaa gcttacatca 16081 ttaggattga agatttaaat gtagttggca tgattgctaa tcagaagtta gctaaagctg 16141 tttctaataa ttgcttttac gagattcgta gacaattgat ttataagcaa tctcattatg 16201 gaactaaagt agaattggtt gagagatggt ttccatctag caagatgtgt tcaaaatgtc 16261 atcatgtcca accaatgaca ctagaagaca ggatttttaa ttgtcaaaaa tgtggtcaga 16321 ttcaagacag agatgaaaac gcatcgaaga atcttgaaaa tgctcctttg gacaaaatac 16381 ggttggctta accgaaattt acgcttgtgg acaagaagga gccgactccc ttggttgaag 16441 caagaagcag accctctaaa ggaaacttta gttgaatggc tagcttgaag taagtttggt 16501 caagttctgt atagcagcag ttattattta tgccgtagtg gtaatttttg gcaaagtaac 16561 ttctagcttc aaacagttac caccatcaac ctgcagctgg tactcaactt tgtccatgag 16621 acgatgcatg atcagccaac cgtagccacc ttcttgttga tcagcaggac ttgggggcga 16681 ataatcagat aaattgaagc ctttgccgta atcccaaact tctaatgcaa tatcctggtc 16741 tttcaattct aaacgaatga gaactggtaa tagtggttga tccttatgag catgacgtac 16801 cacattagag taggcttcca ccaaaaccaa ccgcaagcga cttgattgct ttgaccaatc 16861 cactgaacct ttgaactcaa cttctaagca acttagcaac cagctttcta cgatggttaa 16921 aaacttcaag tcacttggta catgaagctc gcttttcatg acttacaaaa cctccagcga 16981 aagtatagtt tggtcatctt cttgaactcg gttgtctgct tgaatacgcg ctagtaagtg 17041 gttaagatga agtggttggg cttccctatt tagcagttgc cacaaaccct cctgattcag 17101 catagaacgg ctgactggct gggcgctatc cacaatttct tgtgcaaaat ctttagtatt 17161 taatacttct gcttcagtaa taccatcact agctagcagc aggatatctc cgggagcaag 17221 aaccaactga ctagacataa cctgccatac aggcaagata cccagaggaa cgctgcgtac 17281 cttgagatat ttgggttttt tttccacagt ctcttggtat gaccagagca agggatatat 17341 atgtcctgcg ttagcataaa ctagttccct agtattggga gtatagcgag ctaagacgac 17401 ggtgataaaa caattgttgc tgatcaaatc atcgctgaga gtatgattga gattttgcat 17461 gaccacattt ggtagaggcg gtgattcttg agacaactct cggcgcaaca ctgagatagc 17521 actagccatg aataaagccg ctgggacacc cttaccagaa acgtctccca cagccaacca 17581 caagtctcct tttggatgga caaacacttc aaaaaaatcg cctcccactt cacgagcagg 17641 gtagcaacag gcttgtaccc tcacaccgtt gatgttgggc attgtttgac ggagtaggtt 17701 attttgaatt tggcgagcaa cttccaactc agcgcggatt tgctcttgct tctgctgaag 17761 acgttggtag agttttgctt gagagagcgc aagagctgct tgttcagcaa cacctgtaat 17821 cagctcgata tcttcttgtt gccaagaacg agcgtggtga tgctgcttga gagcaagaat 17881 agcgagcagg ttttgctgcc atattagagg cacaaccaat ttctgataag atttctcatc 17941 atgtatgctt agggcttttt gtaattgacc ggtttcaaga acctcgctaa tcaaaagatt 18001 taggtcgaaa tcacaacttg aacttaggga ttttggatct tggtaacaga actggtgtgg 18061 tgttaggcga ttgccctcca caggtctaag cacgcaataa ctggcttcaa aagtctgtcc 18121 taccgttgcc acaatttttt gtagcatact gttgtaatcc agagactctc gaatcgctgt 18181 tgtgactgca ttaaataaag attctctccg caaggcgcga ctcaactctt gtgtgcgttt 18241 tttcacgact cgatatatat cagctgcttg ctcaacaact gctttgagtc gctcaggttt 18301 ccaaggttta gtaatgtatt tgaagacctg accagagttt atcgcttcta ctaaatcttc 18361 gacatcagta aaaccagtga gtaaaatccg aattgtatca ggaaagtgct ccacagttct 18421 acccagaaat tccgtgccat tcatctctgg cattctttgg tcagagataa tcacagccat 18481 ctcgccttct tgatccaaga tgtctaaggc agtgagggca tcacttgctt tatatacttg 18541 aaaatctcgc cgaaaagtac ggtagagtaa gtgtaggttg tcaggctcat catccaccac 18601 caggagcttg agctttccag tctctatctc agtcatactt aatttgactt tcccaagact 18661 ttgatactaa ggcaactatt tgctgtttcc atatcatcat tgaagaacac tccacaaaaa 18721 caaaagtaaa taaaattcgt ccagtttaaa cccctgattt ctctaatttt tcaaaattgt 18781 ctctaaattc ggggaaggtc ccctaagagt tttccatagt ttcatagaaa ctctatccca 18841 ctagtccaga acgtaatgcg cgaaccgctg cttgtgtacg gtcatcggcg cataacttgt 18901 tcaaaatatt tcggacgtgt gttttcacag ttccaactgt gatataaagt ctttctgcta 18961 tcaccgcatt gctacaacct tctacaatca attgcaaaac ttctaattct ctttctgtta 19021 aagtgtaagg ttcgattgtg tcttggggct gactttcttt atctgggcta ctatcaacag 19081 atatctgtaa actaatctta ttgtctaatt gtgtgacttc gggttgtggt ggattttgtt 19141 gagcttgttg caatacaatt cgggcgatcg ctggatcaat ccaagcatta ccgttataag 19201 tcactcgcac cgcttcaatt aaattgtcaa acttaatatc cttcatgcag taagagtcag 19261 ctcctgctgc aaaagcagct agcaccgctt ctttattatc ccgtagggtt aaaatcaata 19321 ctcttgttgc taattcttcc ccatcagttg tagattttat ctcgcgtgtc agttcaattc 19381 catctttgtc tggtaaacca atatctacaa tagcaacatc tggttgtagc gtatttaaca 19441 ttttcaaacc ctcgacagca ttggtcgctt cccctacaat ttcaatttct tccttttgtt 19501 ggagcgctgt ccgaataccc acacgggtca ggtcatgatc ttcaatcaaa gcaacacgaa 19561 ttttagtcat ggtctattac cgcccttata ctcaaaagct gtaaaattga gtttacaaaa 19621 aagtctcgga agatgcagtt taaagatgcc tgtgaaaaaa tataagagag ctttttgagt 19681 tagactaaca tataaaggca tccactcctt gccacagata tttgataaga tacttaagcg 19741 ctaatcgccc actacttgtg tgttagacta agatacgctg gagtttataa ctctctttct 19801 aaaacattag cttcaacaat aaaaataaaa acgcgaagtg tttctcctgc ttaggagtac 19861 ctgagattga tagattctat tgtgcatcct gatttctata gggagtagcc taacatctaa 19921 atagatacca caagtcatgc cgcttcattt ttcttgagac ttaacaggac ttccttgcca 19981 gcaggatcgg gttgcggctg aagtctggcg gacagaatca atctttctga aggcgcgagc 20041 ttgacgtttt gtgcagtttt ttttcacggc aggatttact actttgactc gtaagttttt 20101 tgtgcgttag ttttggatac atatcataat ttctttgtac agtgcatcag ttataacact 20161 gagttatgag caacttctta gaatcaccat agcttcccct cacaacatct attttgtaaa 20221 ataccctgat ttaagcttct ggaaacaatt tgataagcta ttgttctatc tatgtagttg 20281 gtgcgaggct cataaagaat aggctatgtc taaactgcct agaaaatttt cagtattagg 20341 actaccagtt catgtgacaa ctgactatcc aggctggttg ctagaacgtt tgcaacaggg 20401 tatggggact catgtcgtca ccctgaatgc agaaatgacg atgcaagcag agcggaattc 20461 ctccttagct aagatcatac acagtgctga gttagtggtt ccagatggag caggcgttgt 20521 tctgtacctg cgatggctgt tgcagcaaaa agtccagaga actcctggaa ttgaactagc 20581 agaaacactt ttgcaagaac ttgggcaaac acagagtgag gcaaaggtat ttttctacgg 20641 aggagcgcct ggtgtagccg caaaaacatc agagttttgg ctttctcaaa ttccaggttt 20701 ggtcgtagtc ggtactcact ctggctttca ttcacaacaa gaggaagaac aattgcgaca 20761 aactcttgcc caactacagc cacaagtgat ttttgttggt ttgggagttc cacgtcaaga 20821 gttatggatt gctaataatc gccatttgtg tcctcaagca atatggattg gtgttggagg 20881 aagttttgat atttggtcgg gaataaaaac ccgtgctcct ggttggctgg gagataacaa 20941 tttagaatgg ctgtatcggc tctacaaaga accttggcgc tggcgaagaa tgttggcttt 21001 gcctgaattt gcctttaaag ccttaattta tcggtttttt caagtgtcag gggtaggtgt 21061 aacgtcaaat ttgtaaacta ttgttagttg tccactataa ccctgcctcc ggcttccctc 21121 tccaaaacat cccagaggga ttgagggtga ggtctgatct tatatttaat tacgcctacc 21181 tacttagcag tcctaaatca tttgtgaaaa acaagatccc cgacaacttt ggcgaagtcg 21241 gggatctgtg tctctcaatt tttacaaatc aaattgctat aactaccaac gattaagtac 21301 tccgaaaaat tttctactac cgaacaattt tgttttggat tcaaggttgc cagatttcac 21361 ttgggaatac tgattgtatt caatctcaaa tctagaacta ctactcctcc aatgctacta 21421 ttaaccaagc gaaaagctcc tgaaaaatcc gtcctgactc aaaatagcga aacaaaacag 21481 cacaatggta gtgttgaatt catggtgcaa ttgcgctctg tgtccaaaac ctacgccaat 21541 ggttgtcatg ctcttgccaa tatagacttg gaagtcaaaa agggtgaatt tttgtttatc 21601 acaggagcaa gcggttctgg taaatcgacg cttttaaagc tgttgtatgg agatgagtta 21661 ccaacacagg gagatgtgat tgttaatgag tacaatatgg tacctttgag gggtcatcgc 21721 ttatcatttt tccgccgacg cattggtgtt gtgtttcaag actataaact tattcctcgg 21781 cggacagtgg cggaaaatgt tagtctcgcg ctaaagactc aaggatatac gcgcaaggaa 21841 attcaacggc gtttagaacc tagtttgaaa ttagtgggtt tacattccaa agcagaatgc 21901 cttgtaaaac agctttcggg tggagagcaa caacgagtag gtattgcacg agcaatagcg 21961 ggaacaccac ctattctttt ggctgatgaa ccgactggaa accttgatcc agataattcc 22021 tggcaagtca tgcagatttt tcaaaagtta aattcttttg gagcaacagt gatcgtgaca 22081 acccacgatg aacagttagt ccgcaggtgc aatcatcgag tgatgcaaat gcaagatggg 22141 cggctttatc caagaactta aaggtgaaag ttttatattg ataccctccc tcaccaccaa 22201 gccaagtttg gtgagggatt ccaacagttg tccgtggaaa aaacctctac aaaacacgca 22261 atccccagcg tcttcgttag tcgaatcaac acgctcaggc tagctttgtt ctaaaaaaac 22321 atttgaccca atatttaggc gcattactgc caaatactat gattgcacat ttgcttttgt 22381 tgttcccagc aatttggcaa aacctcagtt atcttcctct gtatgtgcgc tatataggaa 22441 attggttaca cagtcgtagc cacgacaaat ctagccttgt ttcagttgtc tgttatgtgt 22501 cctcgttagt cggatacact attaaggctg actttatctt taaacaaatt cgagggaaaa 22561 tgtttgggta tcgccccgcc attcaccaac gaccaacctg cgggcaagat cgcagtactc 22621 tggcggcttt tgataacgaa agaggagtgt agggagatag gtgatccaga aaaattgatc 22681 atatttccta catccttaca cttatattcc gttttaataa aacccttgca tcaattttgc 22741 acaagagttt ttagattaac tcgttaataa cagaaactaa agtcatcaat ggtgaattga 22801 ccatcaaaag cgcagaaggt aatacggtga atttcctttg ctgatacaga taataaagtg 22861 ttgggagatt caccagggtc agaattagca agatttgctc ctggtagtac cgtttgtgtc 22921 agcagttgct gatcgcgacc gtatgctgaa agcacaagcc gctgtgaact ggtgacaaaa 22981 gcactgacaa aatggacagg atgtagaaaa gttgcttcta tatatccatt tttaggtgct 23041 cccattaaca ctaacggacc agaatgagta ggaaatgctg ggtttgaggg ctgtattgct 23101 atacaattct taaaaatgac tccccattgc tcatattgtc gctccactgc ttcaaaacat 23161 ttcaaatctt ccaaagttaa gtagacgcac gtgggtactg atacgacttt gctgtcaata 23221 actttttcca cttgaccttg cgaaactgga gattgacttt gatttttaga atcaaattcg 23281 ttacgagaga atttatgatc tttaatggtt gactttgtct gcgattgaag actagcatgc 23341 tttgccacga caccccgctt tgtcattatc tagagaacag ttagatgatt aacgacataa 23401 aaatttattg gttttgatca gctgcataag attctaggta caaaaactag ttgctcagct 23461 agcagcatga agcacacccg ggtttaccta cttaatatat tcaatgagtg tttacacttt 23521 tctggacgct tactcatatc agtgatgttt ctgatagagt gccataacaa attaatgtgc 23581 gctggaaatt ataagtgata gttgctacta gtattaagtg tagattaagg gtatattttc 23641 aaagatatct caaagcacaa aataatttta tcgtcataaa gtttactaaa aaacacagta 23701 tatatgtata tttgacatac acttgattag aggtatgaac gagtagtgtg tgaatgttat 23761 ctctaagcgt aagtaaacta ctgaaggtga ttgggaaact ggaagatgaa gaggtaaaaa 23821 agtagggttg attgcaagac acttgcaatc aaccctattg gatggtcgag gaaccattgt 23881 gactcctggc ttttgaacgc cagaaacgac gccaggtgtt aagcttgtac tttattaagt 23941 tccctcctgg gcgtgcctga ccacggcgct gccttctgtt gactaacaaa tgcaagttaa 24001 ttttctttta atctgacatc aatgacagtg gcattaaagt tgtagttgaa gccatctaac 24061 tcaattggca ttccaatttt tattttgcta ttacctagaa ctggtccttc gtcagtgatt 24121 tttgcattgc ctgtcaaggt taaacgtaaa tctttactaa agttgtttga tctttctggt 24181 tctggcaatt gtttaattga accatctggt tgaggaacta atatagttct gggtaactct 24241 tccacagatt taatatctat ttgaccgtgg ggttgattgc ggataataac tttagttttg 24301 ccgccttttt caaatcctcg cgcgtataac tgtttagtat cgagtacgtt taatccccga 24361 acgattaagt ctacctcgat aggtactgtt ttaccaccaa cttgagcaac ggaaccggaa 24421 ccgccaggga agataaagat gccaagtatc actagcagaa taacgagtgc agcacctaaa 24481 tcaagaatgt taattttacc gaataaacga cctttggaat ctataatagc cataacaaat 24541 ctttccaaat aagagcaaag ttgctgattg taacagtatg gtttattaaa gcacagagga 24601 ttctgagtga ctgcgttagc agtgaatgtg cttgggcgtt acgatcatgc gacagtgtat 24661 cacaacttct tagttttaga tggacttttt gctattttaa gttgctgtca cacagataga 24721 tatatcctta tgcaacttac acggctatga atccgtccca tagttttata ttccttcaaa 24781 cctcttttca agccggattt gtgatgaaca ggaaacgctt ttctatcaat ctccgttttt 24841 ctcaaaagcc ttggttttac ccgttcattt cggtaattat cgcgacactc atctgcttag 24901 tgacaccagc gacaacaaga gcgattgatg ctgggcagtt gttgcctttt attcttcagg 24961 gagtccaggt tattcaattg tctaatatat ctcctcgcca ggaggttgat attggcaagc 25021 aaattaatga gcagttgcta accagtcaag tcaaacttaa ccgcaactca gcacttaatc 25081 gttatgtaca acaacttggc cagcgtctgg tagctaatag cgatcgcccc gatctcccct 25141 atactttcca agtcgttgag gatgacgcta ttaacgcttt tgccacttta ggtggttttg 25201 tgtatgttaa cacaggtttg ctgaaaactg cagacaatga agcagaacta gcaagtgtta 25261 tggcgcacga aattggtcat attggcggaa aacaccttgt caaacagatg cgacaaaaag 25321 cacttgcaag tggtttggca tcagcagctg ggttggatcg taaccaagcc gtggcgattg 25381 gtgtagattt agccctcaac cgtccccgta gtcgtggaga tgaatatgat gctgatacta 25441 ggggattaag aactttaaca agttctggtt atgccccatc agggatggtt tcctttatgg 25501 aaaaactact gaaaaaaggt tctagcgttc ccacattttt gagtactcac cctgcaacag 25561 gcgatcgcat caaagctttg caaagtgcta ttaacaacct acctaaaagt gggggtgccg 25621 gtttggataa ttctaattat caggcaaaca tcaaagcttt gctgagataa gttattataa 25681 ctcctactac actcgttgct tattgtagcc tctaacctcg actttagctg gattgacaac 25741 agtcactgct tcttttaaag gcaaggtacg gaagtgatga ttataaacat ttgtctgata 25801 aacgagctga tttgtaatat ctagtcctgt cagccagcca ctccggggat gataagcgcc 25861 agtatcaata tccaaccatc cctgtccttg tgctagttta ccaggatcca ccccaggtag 25921 cgtaaaggtc atagtatgac caacaataat cagtttatca gggagtagtg gtttttctat 25981 actgtgaaat ttatcgcgca cccagcaaag ctgttctaca gtttgttctg ctatcggaat 26041 agagggatca acacccgcat gagccaagaa aatatctcct aaatcgagat gtgtaggtaa 26101 agttttcaac cattccagat gctcatgagg tattgtggct tgttgataac tagctaccgt 26161 tgcttgccct ccactgtata gccatccttg taacattgag gtggaaacat gtctgttggt 26221 aaaaatgttt aataacattt gctcatgatt gcccaacaaa cactggtaac aactgtcctt 26281 gacaaaactc acgacctgtg cactttgagg tccgcgatca attaagtctc caagaaagta 26341 gacttgatca tctgatgtgg gagcgatcgc ttccaacaat gtcattaaac cttcataatg 26401 accatgtata tccccaataa taattcggcg gtagctagtt gcgctcatga gctttttggt 26461 tggataactt tacctaatac tgctgctaca ctgtaactgt tgctaatttc gtaaatatgg 26521 ctaaaaaaag ctgaaaaaca taaatgtaat atataactac tttaactctg gtgagtacat 26581 accgttgtat tatgccaaac aataaactca cagatgtgaa agtgaaattg cgcttcgtac 26641 actgctagag gcaatcactc actcctgacg taacaaccaa aagccactgt aagtcatccc 26701 agaatcatcg ggttgcacta acaaaaaatg tagtccacgg ctttgctgct tccgttgttc 26761 ataaacttta gcagcagttg tcacttgttg atcttcaaat gtggcaacaa tccatctgtc 26821 aactaagcca gcttctaaca ccaaaccatc aggtgcgcca gcaacgtaat ttaatgctat 26881 gggatgaact tgttgcaacc accgtgccaa acgcattgat tgccttccac cataaataat 26941 gacaccggga acgggtgttg ttgatgctaa accgagatga atgggtagga gatattctgg 27001 catttctaga ataggtatta ggcgatcgct aaatacctct accacatcag ttgcagtcgt 27061 ggaagcaaaa cgccattctt ccccccaaag gttttctggt aatggcattg ggggtggttt 27121 gtccaaagat acaggatatt gcttttcttg caaccactgt tttaatgcca aagtgtgacg 27181 ggttggttca acagaaatgc ctaaactacg tccagcttgt tctattaaac tcaaagactg 27241 aggacgaaac acttgaatga catctggtag ttgttcaagc gctgctatct gaagttgaga 27301 agcaacccaa cttgaagaag cttgtgactg gggacaagtt gcttgatact caaaactgcg 27361 agttgcatca caaatcagca actcccacat gacttgttca gatccttctg tctggggacg 27421 acgataaaaa tcggcttgcc aaattttcat gggaggtacg catcttaaca attcaaaatt 27481 caaaattcaa aattcaaaat tcgtcttgaa aagttttgcg ccgggaaacc ccttcgggtt 27541 tgc // LOCUS NODE_1071_length_27261_cov_5.02187027261 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 27261) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 27261) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..27261 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..807) /locus_tag="DP116_09125" CDS complement(<1..807) /locus_tag="DP116_09125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130889.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09125" /translation="MTKTPEYCYQTAQRPKQGQEIYLQQWQKIGKIVPFELKNQDMSD PSGMPAARLCPSDTLRERRTQSPTEGNPPAALSHRFLISEKLYGRQHEVETLLTAFER VRDSGTSEMMLVAGAPGVGKTAVVKEVEKVIIAQQRSYFIQGKCNVLQRDFPLSGLIQ AVEDLIGQLLNETDAEIQQWKTKILSALGEQAQIIIDVIPKLELIVGKQPVVTELFGT NAQNRFILLLQKLLKIFIRKEHPLIIFIDDLQWVDAQNLKIIQFLMSGNHI" assembly_gap 1101..1110 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(1375..3000) /locus_tag="DP116_09130" CDS complement(1375..3000) /locus_tag="DP116_09130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315299.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase" /protein_id="PRJNA477356:DP116_09130" /translation="MRLFYTRKRENLKFNHQLLIMTVSTVVFLTNIMSELPAQSQVNE INTEMIMELPLSTRGAKIIDAKGRQVLLRGVNWFGMETETHVPHGLWKRDYKEILTQI RTLGYNLIRLPYSVQALVSPNISGIDFSIGSNKEFEGKTPIEVMDLIIQEAERQGLLV LLDNHCLNDKRIAELWYEDNFTEADWINTWTMLANRYKNQTNVVGADLKNEPHGKASW GTDDLATDWRLAAERAGNAILEVNPNWLIVVQGVEKNVPTQKLPNHWHGGNLEGVKRY PVRLSRRNKLVYSPHEYGPGVADQAWFSDPKFPKDLINRWQIGFHYISSQNLAPIFIG EFGGRKVDTNSKEGIWQNEFVQYIKQKQLSFAYWSWNPNSSDTGGILLDDWQNVDIPK QQLLSQMLPVSFSQVAPVQVEEDTGKIIPPISPSQNPLSPLYRISSSSQLTVTSDIYA NWQTGFCVSFKIMNQGNTKVNNWQMTFDMKQAAINNSWNGNFKPQGATQYTVTPLDWG RIIEPNQVRDVGFCANKLGSQYQPTEVRVKSQR" gene 3727..5037 /gene="hisD" /locus_tag="DP116_09135" /pseudo CDS 3727..5037 /gene="hisD" /locus_tag="DP116_09135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139087.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="histidinol dehydrogenase" assembly_gap 4786..4795 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 5432..5851 /locus_tag="DP116_09140" CDS 5432..5851 /locus_tag="DP116_09140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864930.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="universal stress protein" /protein_id="PRJNA477356:DP116_09140" /translation="MLKTILVALDDSELADRVIQTLQELVLVKDSKVILCHVFSPSES EMELPADRPHPESSTFSYFHIEKQLQSYQTLLPVESNIELVTGNPAEEIIRLANIYKA DLIIIGSRGLTGMNRIVHGSVSSQVVEDANCSVLVVK" gene complement(5806..6336) /locus_tag="DP116_09145" CDS complement(5806..6336) /locus_tag="DP116_09145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalamin biosynthesis protein CbiG" /protein_id="PRJNA477356:DP116_09145" /translation="MRQIINKVASNHRVLWVGIGCTRGTSRQLMERAIGQVFRENQLA ESAIAGFATINSKSQELGLLELCQHQNLFLKTFPPDVLSQISVPNPSQVVGKKVRTCS VAEAAALCAASDFTCLESSQNKVITLELEVRLIVPKQIFSLEGLPGMVTIAVAQAPIL DSYLTTNTEQLASSTT" gene complement(6269..7990) /locus_tag="DP116_09150" CDS complement(6269..7990) /locus_tag="DP116_09150" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09150" /translation="MSQNNNLQQTDDSLPLWSAEDVDLSSFNALRSPDDKQGKQVLDM RQLFHKQVIIDDETILDVETIKVPGFIGIADEQKITDPNGKTVSGHEDIVQYLDKYGL LICTYQYNKDRYGEAIDLRKPPEDYELSDLVEIFIKNEGHHSGAIVPALWNGGRTKAF GSLNEPDTYHDGLYGHNGFIAVAQRLVFPEFVTPQQARGYTDSMICWMALMNPFVEFS QNDFNGNDPTGVCDRPTLKEFLQNCALASLGSKEAIDFLNNSQNRAYCAEFIYICLNT PVYPFNKQGLTLLLDGDKAKATEILKIRDKQNSRKKNILSQTSENLQFKEFNIQMPVV PEDLPPLDVLMATNGQPPELNSIPFPPFTLSQVLRRAFRTLLKRQEDVNNTKIAKAQA QMLGYLEPLILRQLGIVTPPTEAESANEGQGLISRIPLEFFSVSPPPNDPKVKAVREF IAFVQQQLQRKFDSYEEFDLNFDQVMAKADELIGKDGVTYFIPPRIYVDLGQNDRDNN LPKGWGFKLETLGALIYRGFIRVTGLSSLPSDGGADPVTNTPSHETNNQQSSKQPPGS VGGNRMH" gene complement(8126..9409) /locus_tag="DP116_09155" CDS complement(8126..9409) /locus_tag="DP116_09155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ankyrin repeat domain-containing protein" /protein_id="PRJNA477356:DP116_09155" /translation="MTKNNDVLLLSAVKNGDIKQVQALLNDGANVDTGDRDGTTPLMF AAHFGYTEIARSLLDAGAHPDLKRKRYKLTALMLAASANQLDIVKLLVSRGADVNATN VDGSTALMVAALKGNAEVVRVLLAAGAKVDVKDKDEDTAFQLAIRAGHAAVVKAILQN HANVNAQDEEGETALMIAADLGHLEVVQALLAAGADVQARNLDGSTALSAAAAAGHSA IAAEILARGADVNLQDQDGETALHLAVVEGYADVVEVLLSQGANVEIKNHLGDTPLLL AALQGHSQIAEALLRQGANLKEKNLGEQPLTLAVIQGKTEMVRLLLDYGADVNTQGDD GKSVLIKAAERNHLGVIQQLIAKGSNVNLQDSAGATALMWAASRGYEEAVQLLLKAGA DVNIKNEGGYTALMLAEFNGYRTVVQSLLAAGAHE" gene 9707..10171 /locus_tag="DP116_09160" CDS 9707..10171 /locus_tag="DP116_09160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320993.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4079 domain-containing protein" /protein_id="PRJNA477356:DP116_09160" /translation="MELSPSVKFSLNFIHPILMWVLLLLSLYAAYLGLQVQRTRNAQG EEKKELIKGQYNVKHYQIGSIILALMVTGAIAAMAVTYINNGKLFVGPHLLIGLGMTT LIAISASLSPFMQKGANWARLTHILLNFAIVGLFVLQALSGVEIVQRLLTQA" gene complement(10319..10993) /locus_tag="DP116_09165" CDS complement(10319..10993) /locus_tag="DP116_09165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196961.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09165" /translation="MVSKQPDYQSFDTTEAILSMTSTQEAGDITQEAIGTPTRFHGRY NDHMEMYAAPGTVAEYLNNHASWFSRCADPMKVEPLGKNGYALVIGRFGSFGYEVEPK IGLELLPPEEGIYRIYTIPIPDYHAPGYDVDYRAALRLQENVANNSSTCLSKMTQVEW ELDLSVYIHFPKFIQRIPKSLVLSTGERLLNQVVRQVSRRLTRKVQEDFHQSLGIPFP DHPKKK" gene 11184..11843 /locus_tag="DP116_09170" CDS 11184..11843 /locus_tag="DP116_09170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SDR family NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_09170" /translation="MKALVAGATGETGRRIVQELIARNIPVRALVRDVEKARSILGAD AELVVGDVLKAESLSAALGDSTVLLCATGAKPSFDPTGPYKVDYEGTKNLVDAAKAKG IEHFVLVSSLAASQFFHPLNLFWLILYWKKQAEEYIQKSGLNYTIVRPGGLKNEDNSN QIVMQSADTLFEGSIPRQKVAQVSVEALFEPAAKNKIVEIVAQENAPAKSFGELFANV A" gene 12130..12801 /locus_tag="DP116_09175" CDS 12130..12801 /locus_tag="DP116_09175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase" /protein_id="PRJNA477356:DP116_09175" /translation="MKKSLERKGGVEQLIAPIAALLGFVCLLQWSIFGDLRSHLDPTF ANKQPPLVMKGGDPYIRALMRTISASEASSNRPYSVLYGGHHVNNLNRHPEICVTIVK GPNKGNCSTAAGRYQIINTTWYNLSPRYHPNPGRFMFWVSYSFEPDYQDVVVYRWLSD SRFWGTDISQQLRQGRLPEVLRRLSPTWTSLGYGIETNSVSKSLPQIYHKILQEELRA SEKSV" gene complement(12897..13946) /locus_tag="DP116_09180" CDS complement(12897..13946) /locus_tag="DP116_09180" /EC_number="2.5.1.54" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_001168032.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-deoxy-7-phosphoheptulonate synthase" /protein_id="PRJNA477356:DP116_09180" /translation="MENNKLSNTHIESFQALLTPDDLKSKLPLTLLAKETVLRYRQEI EDILNFQDRRKFIVVGPCSIHDTEAAIEYSEKLKILAERVKDKLLLIMRVYFEKPRTT VGWKGLINDPDMDDSFHIEKGLLIARSLLLKIAELGLPVGTEALDPIVPQYIGELITW SAIGARTTESQTHREMASGLSMPVGFKNGTDGNINVALNALQSAQKPHHFLGINQSGQ VSIFKTRGNDYGHVILRGGNGQPNFDPANVKLVEEKLKEANLPPRIVIDCSHGNSNKQ YKLQASVLEKIIQQIVDGNTSILGMMLESNLYEGHQLIPRERERLQYGISVTDGCIGW EETEEIILAAYRKLK" gene complement(14117..14923) /gene="menH" /locus_tag="DP116_09185" CDS complement(14117..14923) /gene="menH" /locus_tag="DP116_09185" /EC_number="4.2.99.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-succinyl-6-hydroxy-2, 4-cyclohexadiene-1-carboxylate synthase" /protein_id="PRJNA477356:DP116_09185" /translation="MTLSNYQFHYSFSGHPDKPLILFLHGFMGNTHEFDEAISLLSDD FYCLTVDLPGHGATKVFGSDECYTMPNTAHAVIHLLDDLKITKCFLVGYSMGGRLALY LTLYFPHRFPKTILESASPGLLTEVERAERVKRDEQIARKLERSIDKNDFIAFLSNWY NQPIFGSIKNHPQFHHLVEVRLQNNPIELAKSLRFMGTGCQPSLWEKLKENTNPIFLL AGEYDEKFVSINTNMTKICDFCHLNIISHSGHNIHFENVRLFVKNVLILW" gene 15230..15430 /locus_tag="DP116_09190" CDS 15230..15430 /locus_tag="DP116_09190" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09190" /translation="MLSVISSQMLHRLSGIARSTSLATQTAKRRERNAVQRGGQPAGA GFNHFPYASSGYLLYMGFFDHI" gene complement(15653..16054) /locus_tag="DP116_09195" CDS complement(15653..16054) /locus_tag="DP116_09195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861728.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09195" /translation="MNGSNRSRKQSSPTSEQDEDLLFDDTDKDHTQHIHNHEQSAHAH VHSEESLRRIVNRLSRIEGHVRGIKTMVQQNSPCPDVLLQIAAVRGALDRVARIVLDE HLTECIARAAKEGDIEVELEQLKAALDRFLP" gene 16347..17564 /locus_tag="DP116_09200" CDS 16347..17564 /locus_tag="DP116_09200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874857.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_09200" /translation="MRFSKRSRSIRQLSTHVLAVILGVLLTVGTLQVSPSQAEPAPSS VIGDSPQLVAQRQSPATAAIGSSSFVTAAVNRVGPAVVRIDTERTITRRAADPFFDDP FFRRFFGNGSPQQLPPEQLRGLGSGFIIDKSGLILTNAHVVDKADKVTVRLKDGRSFE GKVQGVDEVTDLAVVKINAGGDLPIAPLGSSNNLQVGDWAIAVGNPLGLDNTVTLGII STLRRSSAEVRIPDKRLDFIQTDAAINPGNSGGPLVNAQGEVIGINTAIRGDATGIGF AIPIDKAKTVAAKLQRGETIAHPFIGVQMQEITPELARQFNSNPNSPIQLPEINGVLV MQVVPNSPAAAAGIRPGDVILQVDGQPITKGTQLLDIVEASRVGQQLQLKVQRGNRTQ QLSIRTAQMQNPS" gene complement(17621..18007) /locus_tag="DP116_09205" CDS complement(17621..18007) /locus_tag="DP116_09205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995758.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09205" /translation="MLGNYQQSQIRIEVDASFSAIRDSLLRPTELEKWVPTARFATGM PEELHSGFEFTTQTGPLSIHHQVDVARPNCLRLLLSQGIDGFHEWYWGEGWVQSRLEG VTILPLSLGQTLSLLSLRQFLVTKKR" gene complement(18222..18563) /locus_tag="DP116_09210" CDS complement(18222..18563) /locus_tag="DP116_09210" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09210" /translation="MKSTCPGTSTLGSSPITCAISVVDDIRSWLLERFVDFCCKQKNL YYKVQMSLMPFLLLIDVVEKSLCSLKEMTGYIQWEFSETEYLPLKRINRVRLSYLKRK KHEIKCFFMAN" gene 18952..19845 /locus_tag="DP116_09215" CDS 18952..19845 /locus_tag="DP116_09215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320881.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Hsp33 family molecular chaperone HslO" /protein_id="PRJNA477356:DP116_09215" /translation="MADQLIRATAAQGGIRAVGAITTRLTEEARNRHKLSYVATAALG RTMTAGLLMASSMKRTGARVNIRVKGDGPLGGILIDAGLDGTVRGYVENPSVELPPNS KGKLDVGGAVGKGFLYVVRDIGYGYPYSSTVELVSGEIGDDVAHYLISSEQTPSAVVL GVFVGASGVTAAGGLLVQVLPKAARDEALVETLESRVAALSGFTPLLQAGKSLTDIFQ DLLGDMGVAIFPETQILRYHCGCSFDRVLGALKMLGEAELQDMITKDNGAEATCDFCG TVYQASRDDLAQLVVDLQAEA" gene 20021..22108 /locus_tag="DP116_09220" CDS 20021..22108 /locus_tag="DP116_09220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316067.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chromosome segregation ATPase" /protein_id="PRJNA477356:DP116_09220" /translation="MTERNIPESWLAGKAREPDNNMTRGYRRQESSETHRSGVPATSS MIESVNSSKENDSELLSLDKESQKIVSLSKSSWKLPQWTKSWVLWTLLLALVPSAIAF MATGMLLKLPSAPNCPSIFWPLASASVRLHCAQLAASKETTKDLLQAITLVKELPKNH PLRAEIDRLIEEWSRDILQLADQSFQAGRLDEAIATARQVPKEESAYQLVEAKISKWQ SIWSTAEGVYTEAEAQMRDEKWHQAFMLASRLLRVDNKYWSMTKYDQLNRLIVSARED GEQLGKAKTLAESKTVDNLLKAIKVAELIQQDSYVYPKAQETLSEFGEKMLELAQAKL DARNPDQAILIAQKIPPSTRHNKDIEDFITLTEAQRSAWLGTTAGLETAIAQAQQIQA TRPKYEKAQELIASWQVEIQDVAHLEKARYLASQGSVNDFAAAITEAQLIPDGNPRAE EAKKEIGGWVAQIQTIEDRPYLDRADQMAMLEDVNSLQAAIAQATQVRQGRALYPEAR RKIGIWTAKIQRIQDQPYLDQAKNLADSGDLSSAIATAQQIRPGRALSGEAQAVIDEW QGQLRAKQNWNKAREIAVTGTPEALAKAIRLANRVPDNNILRSDASIAIDQWSQQLLD IARAQGESNIPRAIETARLIPRGTDAYSAAREQIRAWQEYLNPQPQEAPQESRQESSQ ESYTEQPMTIERQ" gene complement(22201..22758) /locus_tag="DP116_09225" /pseudo CDS complement(22201..22758) /locus_tag="DP116_09225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198016.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(22748..23185) /locus_tag="DP116_09230" CDS complement(22748..23185) /locus_tag="DP116_09230" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09230" /translation="MDKMLQALENKPSWLNLPGHQRQEACNDACDAFKQAKSERGFAK FKSCKATSQVIKFKVGNYKNGTWYSKTTKGLKYQSSQPVPYQCEYGTQLVYQRGKWFA CFPQVVEFVGTGSDRVIALDPGNRTFLTGYDGENVLEIGKGLY" gene complement(23320..23754) /locus_tag="DP116_09235" CDS complement(23320..23754) /locus_tag="DP116_09235" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09235" /translation="MGRRPTLQIILDLTTLEKTGKFKGLGSLIRVYNGKRGLHLVVLY IALGRWRLPWSFRVYRGKGHPGCVQLGLRLLSTLPKSLTKRYQVLVLVDTAFGSIDFF KQVRQMKFHAVAGQIRKTSPARKGKYYPDTKRARRSYHQTVF" gene complement(24184..24767) /locus_tag="DP116_09240" /pseudo CDS complement(24184..24767) /locus_tag="DP116_09240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198016.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="peptidase S8" gene 24826..25203 /locus_tag="DP116_09245" CDS 24826..25203 /locus_tag="DP116_09245" /inference="COORDINATES: protein motif:HMM:PF13551.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09245" /translation="MCRVLKLEIKETQAELQELLRQQKTGLGKERIQALYLLKTRQVE TVQHLAVMLGRGRITLHRWLKLYRKGGLSSLLELRKSPGRPKTIPVDVRLRGAQTPKA YRYSKKSFPNQKGLKAMRKSVLG" gene 25200..25433 /locus_tag="DP116_09250" CDS 25200..25433 /locus_tag="DP116_09250" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09250" /translation="MRASEGIEASYKVVHEVVRYKLKAKLKAPRPRSVKQNKGVEEDF KKNFTSGLELIKKYLISPLEQHRRVRYWCGEGR" gene complement(25640..26221) /locus_tag="DP116_09255" CDS complement(25640..26221) /locus_tag="DP116_09255" /inference="COORDINATES: protein motif:HMM:NF033545.0" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS630 family transposase" /protein_id="PRJNA477356:DP116_09255" /translation="MSSTDLAPSSCAPLIRTYARSLQGKRAYGERPYHHRDNITLIGA IALTGWIGAMTIDGGTNGDIFRFFIESILVPNLWSGACVVMDNLPAHKVDGIRQLIED KGARLIYLSPYSPDFNPIENCWSKIKEYLRSLAARSRENLENGITNAMDAVSLKEIRN WFSHCCYCTSRSLKTAIITKMKCAIINLNNMQI" gene 26184..26856 /locus_tag="DP116_09260" /pseudo CDS 26184..26856 /locus_tag="DP116_09260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007355582.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 7848 a 5841 c 5761 g 7791 t 20 others ORIGIN 1 gatatgattt ccactcatca agaattgaat aatttttaaa ttttgtgcat ctacccattg 61 caaatcatct ataaaaataa ttaaggggtg ctcttttcta atgaatattt tcagaagttt 121 ttgcaacaat aaaatgaaac gattttgggc attagtacca aagagttcag taactacggg 181 ttgcttgcca acaatgagtt ccagtttagg aataacatca ataatgattt gagcttgttc 241 acctaatgct gacaaaattt tagttttcca ctgttgtatt tcagcatcag tttcatttag 301 caactgccca attaagtctt caacagcttg tatcaatcct gacaaaggaa agtcacgttg 361 taacacgtta catttgcctt ggataaagta actacgctgt tgggcgatga tcactttctc 421 aacttctttg accactgctg tcttgcccac accaggggca ccagctacta acatcatttc 481 agatgttcca ctgtctctga ctcgctcaaa agcagtcagt agggtttcaa cctcatgttg 541 ccgaccatag agtttttcag aaatcagaaa acggtgagac agcgctgcag gagggtttcc 601 ctccgtaggc gactgcgttc ggcgttcgcg aagcgtgtcc gaaggacata gacgtgccgc 661 aggcataccc gaagggtctg acatatcctg gtttttcaac tcaaaaggca caatttttcc 721 gattttttgc cactgttgca ggtatatttc ttgaccttgc tttggtcttt gggcagtttg 781 atagcaatac tcaggcgttt ttgtcatttt ttcaaggttt ttggcaatag tcttttgttg 841 atagtcttct caagaagctg tcaaactggc gtggttccgc tagtcttact ttgtaagtta 901 ttttaatggt tgtccggttt tgtcatgatt gcccccaacc ctacagatcc ggctgcttcc 961 agctcccgtt gcctttttgc actagtcagt ttcagcatag ctgcctgatc cctacgtcac 1021 gaattacctg atttaaggtt attattcccg gcgtgtcagt gatatacgta catagaaaat 1081 gaaaattctt cacaccgaac nnnnnnnnnn tcccggcgtg tcagtgatat acgtacatag 1141 aaaatgaaaa ttcttcacac cgaacaagta cttaccgcag aaaaagaagg ttattattcc 1201 cagcgtgtca atgatacatg tacatagaaa atctgtagaa attattaaaa ctgaataagt 1261 acttaccgca gaaaaagaag gttattattc ccggcgtgtc agtgatatac gtacatagaa 1321 aatgaaaatt cttcacaccg aactagtact taccgcagaa aaagtcttat cttttcacct 1381 ttgactcttg actctgactt ccgtcggctg gtactgcgaa ccaagtttat tagcacaaaa 1441 accaacgtca cgtacttgat ttggctcaat tatccgtccc caatctagag gtgtgacagt 1501 atactgtgtt gctccttgag gtttaaagtt accgttccaa gagttattaa tcgcagcctg 1561 tttcatatca aatgtcattt gccaattgtt aaccttggta ttcccctgat tcatgatttt 1621 gaagctgaca caaaatcctg tttgccaatt tgcatagatg tcagaagtta ctgttagttg 1681 ggaagaagaa gagatacggt aaagaggaga taggggattt tgtgatgggg agataggagg 1741 aataatcttt cctgtgtcct cttccacttg cacgggcgct acttgagaaa aactaactgg 1801 cagcatttgg gacagtagct gttgcttagg aatatcaaca ttttgccaat catctaataa 1861 aataccgcct gtatcagaac tgttaggatt ccaactccag taagcaaaac ttaattgttt 1921 ctgtttaata tattgcacaa attcgttctg ccaaattcct tctttagaat ttgtatctac 1981 tttccttcca ccaaattcac cgataaaaat aggtgcaaga ttttgactag aaatataatg 2041 aaatcctatt tgccagcgat tgatgagatc tttaggaaat tttgggtcag aaaaccaagc 2101 ctgatctgca actccaggac cgtattcatg aggtgaatag acaagcttgt tgcgacgaga 2161 taaacgtact gggtagcgct tcactccttc taaattacca ccgtgccaat ggttaggcag 2221 cttttgtgta ggaacattct tttctactcc ttgcacaaca atcagccagt taggattaac 2281 ctcaagaatc gcatttccag cccgttctgc agcgagtcgc cagtctgttg ctaaatcgtc 2341 agtaccccag cttgcttttc cgtgaggttc gtttttcaga tctgcaccaa caacgttggt 2401 ttgatttttg tatctattgg ctaacattgt ccaagtatta atccagtctg cttctgtaaa 2461 gttgtcttcg taccacaatt cggcaatccg cttatcattc aaacagtgat tatcgagtag 2521 aacgagtaac ccttgacgtt ctgcttcctg aataattaag tccatcacct ctattggagt 2581 ttttccttca aattctttat tgctaccaat actgaaatca ataccgctaa tattaggcga 2641 aactaatgct tgtacagaat aaggtaagcg aatcaggtta tacccaagtg tcctaatttg 2701 tgtcagtatc tccttataat ctcgcttcca caagccatgc ggtacgtggg tttctgtttc 2761 catgccaaac cagttaacac ctcttagcaa aacttgccga cctttagcat cgataatttt 2821 tgcgccacga gttgaaagtg gtaattccat gatcatttca gtatttatct catttacctg 2881 gctttgagca ggcagctcac tcatgatatt tgtcaaaaaa actactgtgc ttactgtcat 2941 aatgagtaac tggtgattaa attttaagtt ttctcgtttc cttgtataaa aaagccgcat 3001 acttttcgtt accttactta actgacacaa atatctttaa acaagcgtga ggacagacag 3061 cttttaagca tatatcgttc aatgtccaca atttcatgaa agcccagttc ctgcttgttg 3121 ttaactctgt aaatgtgcta gaattaaata ctctgctaaa acttgtcaat tcaaaatcct 3181 tgttaaaatt tcctaacttt agttaaggaa gggttatgat gagtgaacac tgtcaatgta 3241 gttcctccca aagagaaatt ttttgctctc aataggatta tgtgaacggt tagggttcaa 3301 aagttaacat acagcaatta agcacatcac ttctgtaaat atgttgataa gttaagtaaa 3361 aatagccaat aatcataatc agtttctact tagactagaa aaggggagtg gatgaataag 3421 catataattg accgactgta aactttggga gttaaagtca ccaagagacc gggaactagc 3481 attcctactc gctggtacta atagcaaatt tggctctcct gccgttaagt caggcaaaag 3541 gcaagtaaaa ataacctggt tttgttgcct tctcagacaa accatgtgcc caaagccatg 3601 gaagtatcaa aaaaatccag gttaagctag agaagagagt taggattagc tttacatcct 3661 ttggggtcaa gagcaaaact tattctcacc gcaggaattt tctcaagttg gcattgtcct 3721 tactccatgc tgcgaattat tactcagcag gcagacgtta gagcagaact acaacggatc 3781 tgcgatcgca cccatgatga acaggtactt cacaaagaag caacagtgcg ggaagtgttg 3841 caggcagtga agcgccaagg cgataacgct gtactgcatt acacagccga attcgataag 3901 caaaccctca agccagatga actgcgcgtg acaggctcag aactggatgc agcctaccaa 3961 caggtatcaa aggaattgct ccaggctatc aggcttgctt gccgccaaat cgatgcattt 4021 caccgccagc gagtgcccaa aagttgggta caatttggcg acgatgagat cgtgttgggc 4081 aagcgctaca caccagtaga caaagcaggg ttatatgtgc caggtggtcg tgccgcctat 4141 ccaagtacgg tattgatgaa tgcaattccg gcaaaggtgg ctggtgtgcc acgtgtggca 4201 atggttacac caccaggacc agacaaagca attaacccag cagtgctggt agcagcacaa 4261 gaagctggag tacaagaaat ttatcgcgtc ggaggcgcac aggcgatcgc cgctttagcc 4321 tatggtacag aaacgattcc taaagttaat gtcatcacag gaccaggtaa tatttacgtc 4381 actttagcca aaaagcttgt ctatggcacc gtaggcatcg attccttagc tggacctagt 4441 gaagtcctca ttattgccga tgaattcgca aatcctgtac acgttgccgc tgatttgtta 4501 gcgcaagcag aacatgaccc gatggcagca gcaattttgc tgaccacgga tgcagctttg 4561 gcaaagaatg tacaagtggc ggtggaaagg caacttgtgg atcacccaag gcgcttgctc 4621 acagaaaaag cgatcgctca ctacggttta atagtgattg tagaatcctt gaaagcggca 4681 gcagaactct caaacgaatt tgcccccgaa catctggagt tggaagtaga agatccttgg 4741 gagttactac cacagattcg gaatgcgggt gccattttct tgggcnnnnn nnnnnattcc 4801 acaccagaag ccgtaggcga ttatttagct ggacctaacc acaccttacc aacttctggt 4861 gctgctcgtt atgcctcagc attgggagtg gaaacattca tgaaacactc tagtattatt 4921 caatactctc agactgcact ggaaaacgtg gctggtgcaa ttgatgtact ggcaacagcg 4981 gaaggtttac cttctcatgc tgattctgtc agacgccgag tccaaacgca agagtaattt 5041 tgaatttgaa ttattaattt tttccttaga cattacccaa aactcagcct atgggttcaa 5101 actgaaggtt taaggctata caaaattatg tccacagctt ggatatagtg agtgtagcca 5161 ttaacttatt tcatccttga cttataaggc tatccatgac tcggatcttt ctcaacaaaa 5221 gtgacagtaa tcgctcatga acaatctaac ccttatacct ttgcaagtaa atctcaataa 5281 taaaggtaag gggcgatttg tcaaacaagg ctgaaaagtc aatcttgatg agccgtataa 5341 aattacagtt tcaagattgt taacaaagat ataattaaca gatagtcgga gaaagtcaag 5401 taaaaagtct actcttgtgg agaggacatc ggtgctaaag actattttgg ttgctctgga 5461 tgattccgaa cttgcagaca gagtcattca gactttacaa gaactggttc tggtaaaaga 5521 cagcaaagtt attctctgcc atgtgttttc tccatcagag tcagagatgg aactaccagc 5581 tgatcgtcct cacccagagt catcaacatt ttcttatttc catattgaaa aacagcttca 5641 atcttaccag acactattac cagttgaaag caatatagaa cttgtcactg gtaacccagc 5701 agaagagatt attcgccttg caaatattta caaagctgat ttaattatta ttggcagtcg 5761 cgggttaact ggtatgaacc gaattgtcca cggttctgtg agttctcaag tggtggaaga 5821 tgccaattgt tctgtgttag tcgtcaagta agaatctaag ataggcgctt gggcaacagc 5881 gattgttacc attcctggta gcccctctaa actgaaaatt tgtttaggaa cgattaatct 5941 tacctctaat tctagggtaa ttaccttatt ctgggaagac tccaggcagg tgaaatcaga 6001 ggctgcacac aaagcagctg cctctgccac gctacaggtt cttacttttt ttccaaccac 6061 ttgagatggg ttagggacag agatttgact cagaacatca ggcggaaagg tttttaaaaa 6121 caaattttgg tgctggcaaa gttccaataa acccaattct tgagatttac tattaatagt 6181 cgcaaaacct gcaattgcac tttccgccag ttgattttct ctaaaaactt gcccaatagc 6241 cctttccatc aactgtcgtg aagttcccct agtgcatcct attcccaccc acagaacccg 6301 gtggttgctt gctactttgt tgattatttg tctcatgtga cggcgtgtta gttacaggat 6361 cagcacctcc atcagatggc aaagaactca aaccagttac tctaataaat ccccgataaa 6421 tcagtgcccc aagagtttct agcttgaagc cccaaccttt gggtaaattg ttatcacgat 6481 cattttgtcc caaatcaacg taaatacgcg gcgggataaa ataagtgaca ccatctttac 6541 ctatcaactc atctgctttt gccatcactt ggtcaaaatt caagtcaaat tcttcataac 6601 tgtcgaactt acgttgtagt tgttgttgta caaaagcaat gaactcgcga acagctttga 6661 cttttggatc attaggggga ggagaaaccg agaaaaactc caaaggaatt ctagagatca 6721 gaccttgacc ttcattagca gattctgctt ccgtaggagg tgtgacaata cccaattgtc 6781 tcagaattaa gggttctaag taccctaaca tttgggcttg cgctttggca attttggtat 6841 tatttacatc ttcttgacgc ttcaatagag tacggaaagc acggcgcagt acttgagaaa 6901 gtgtgaaagg aggaaaggga atgctgttga gttcaggtgg ttgtccattg gttgccatca 6961 gaacgtcgag aggtggcaag tcctcaggta ccactggcat ttgaatgttg aactctttaa 7021 attggaggtt ttcgctggtt tggctgagga tatttttctt tctgctgttt tgtttgtcgc 7081 gaattttgag tatttctgtt gcttttgctt tatcaccatc caataataag gtcaaccctt 7141 gcttgttaaa cgggtaaaca ggagtattta gacagatata aataaactcg gcacaataag 7201 ctctattttg tgaattattg aggaaatcaa ttgcctcttt tgaaccgaga ctagcaaggg 7261 cgcagttttg caaaaattct ttgagagttg ggcgatcgca cacgccagtt ggatcgttcc 7321 cattaaaatc attttgtgag aattcgacaa acgggttcat taaagccatc caacagatca 7381 tgctatcggt atagccacgc gcctgttgtg gcgttacaaa ctccggaaaa actaaccgct 7441 gggctactgc aataaaccca ttgtgtccat atagcccgtc atggtaagtg tctggttcgt 7501 ttaagctacc aaatgccttt gttcttccac cgttccataa tgcaggtaca atcgcaccag 7561 aatgatgtcc ttcattttta ataaaaattt ctaccaaatc tgacaactca taatcctcag 7621 gtggtttgcg caggtcaata gcttcaccgt atcggtcttt attgtattgg taagtacaga 7681 taagtaaacc gtatttatca agatactgca caatatcttc gtgtccggaa acggtttttc 7741 cattcggatc tgtgattttt tgttcgtccg caataccgat aaaccctgga acctttattg 7801 tttccacatc gagaatagtt tcatcatcta tgatgacttg cttgtggaac aactgcctca 7861 tgtccaagac ttgtttgcct tgcttatcat ccggacttct caatgcatta aaactagaca 7921 aatctacatc ttcggcagac cacaaaggca gactatcatc tgtttgctgt aaattgttgt 7981 tttgactcat atgctgtatt tgacaaaagt aatgtataac ataagacctc attttcttct 8041 acagatgagg taccaagatt aactacatct atcaaacact cttttaagaa aaaactcgta 8101 gacaaattga agaaaatttc tctacttact cgtgtgcgcc tgctgccaag aggctttgca 8161 ccacggtacg atacccatta aactctgcaa gcattaaagc tgtataacca ccttcatttt 8221 ttatattcac atctgcccca gctttcagta acaattgcac tgcttcctca taaccccgcg 8281 aagctgccca catcagtgct gttgcacctg ctgagtcttg tagattcaca ttcgatccct 8341 ttgctataag ctgctgtatc acgcctagat ggttacgttc tgcggcctta attaaaacgc 8401 tcttcccgtc gtctccctga gtattgacat cagctccata atctagtagt aatcttacca 8461 tctcagtctt tccctggata actgctagcg tcaagggttg ctcacccaag tttttctctt 8521 tcagatttgc tccctgacgc agtagtgctt cggcaatttg gctgtgtccc tgcaaagctg 8581 ccaggagtag cggtgtatct cctaaatggt tcttaatttc tacgtttgct ccttggctga 8641 gtaaaacttc taccacatcg gcataacctt caaccacagc aagatgcagg gctgtttccc 8701 catcttgatc ttggaggtta acatcagcac cgcgagcgag tatttctgct gcgatcgcac 8761 tatgtcccgc cgccgccgct gctgataaag ccgtactacc atcaagattt ctcgcctgaa 8821 catcagcccc tgctgctaac agcgcttgca caacttccaa atgtcccaaa tctgccgcaa 8881 tcatcaatgc cgtttctcct tcttcatctt gggcattgac atttgcatga ttttgtagta 8941 tagctttgac aactgcggca tgtccagcgc gtattgctaa ttgaaaagcc gtgtcctcat 9001 ctttatcttt aacatcgact ttggcaccag cagcaagtaa gactcgcacc acctcagcgt 9061 ttccttttaa ggcggctacc attaaggcag tgctaccatc aacattcgtg gcgttgacat 9121 ccgcgcctct agagactaaa agttttacaa tgtcaagttg gttagcactt gctgctaaca 9181 tcaaagccgt caacttatag cgctttcttt tcaagtcggg atgagcacct gcatcaagaa 9241 gcgatcgcgc aatttcggtg tagccgaaat gagcagcaaa catcaagggt gtagtaccat 9301 cgcgatcgcc cgtatctacg ttggcaccat cattaagcag tgcttgcacc tgttttatat 9361 caccattttt cacagccgat agcagcaaga catcgttatt tttagtcatt agtcattagt 9421 gattagtcaa gaggaggcag tgcgttgcag tgagggagtg ggaggctaga gtttcacagg 9481 cttagacgtc ttccaggcag cttccctccg agggtgccac taggcgtgct ttccccggac 9541 ggtgcgaact gccgttcaaa aattattttc ctggtgcttt ttttacacaa gatgctctgt 9601 ttgtcccctt atcctttgtt tgttcatttc tttatcatcc catttcccca ttaccaataa 9661 gcgttatgct aatttttatg aagatttaag aagaggggca ctaggtatgg aactttcacc 9721 atcagttaag ttttcgttaa actttattca cccaattctc atgtgggtgt tattactact 9781 ttccttgtat gcagcgtatc tgggactaca agtccaacgg acaagaaatg ctcaaggtga 9841 agaaaagaaa gaattaatca aaggtcaata caacgttaag cattaccaaa tcggctctat 9901 aattttggct ttgatggtaa caggcgctat tgctgctatg gctgtgactt acatcaacaa 9961 tggtaagttg ttcgttggac ctcacctact cataggactt gggatgacaa ctctaattgc 10021 catctcagct tccctgtctc cttttatgca aaaaggggca aattgggcgc gactaacaca 10081 tattctgttg aattttgcaa tcgtaggtct ttttgtattg caagctctca gtggggtcga 10141 aattgtccaa agacttctca ctcaagcata gtcactagtc attagtcatt agtcattagt 10201 aaaaaacaaa tggctattga ctcttgctac tcaaatgtat gccctaaggg cacgctactt 10261 taaacaaagt atgctcgacg gagcatactt tcaaattcaa aggacgtggg tgaggtgact 10321 attttttctt tggatgatca ggaaaaggaa tacccaaaga ttggtgaaaa tcttcctgta 10381 ccttacgagt taggcggcga gaaacttggc ggacaacttg gttaagtaag cgttcacctg 10441 tagatagaac taaagacttg ggtattcgct gaataaactt gggaaagtga atataaacac 10501 tgagatccaa ttcccactcc acctgtgtca ttttacttaa gcaggtggaa gaattatttg 10561 ccacattttc ttgtaaccgt agtgcagctc gataatccac atcataacca ggagcatggt 10621 agtcaggaat cggaatagtg taaatccgat aaattccttc ttctggaggt aataattcca 10681 aacctatttt tggttctact tcgtaaccaa aagagccaaa acgaccaatg actaaggcat 10741 agccattttt tcccaatggt tccaccttca tgggatcagc acagcgcgaa aaccatgagg 10801 catgattatt gaggtattca gccacagttc caggtgcagc atacatttcc atatggtcgt 10861 tataacgacc gtgaaaccgt gttggcgttc ctattgcttc ttgggtgatg tctccagctt 10921 cttgagtcga tgtcatagat aagattgctt ctgttgtgtc aaaggattga taatccggct 10981 gctttgaaac cataaaagcg ttcttttata gtcagagtgt ttgtctaaat taataattcg 11041 gctggtagat aaaacatttc gtactaaatt tccattttgg cggaatctca atcatctgtc 11101 atgtggttca tctaaatccg tgtaagaatc tttaaaatag gaaactgaaa aacagatact 11161 agttttccag gatagcgttt atcatgaaag cattagtagc aggggcaaca ggtgaaacag 11221 gtcgccggat agtgcaagag ctgatagcgc ggaatattcc cgttcgtgcc ttagtcaggg 11281 atgtagaaaa agcaaggagt attctaggcg ctgatgccga gttggtcgtg ggagatgtgt 11341 taaaggcaga aagcttgtct gctgctttgg gagatagtac agtgctacta tgtgccactg 11401 gcgcaaaacc aagctttgat ccaactggac cttataaagt ggattatgaa gggactaaaa 11461 atttggtaga tgccgcaaag gctaaaggaa ttgagcattt tgtcttggtt tcttctttgg 11521 ctgcttccca gttttttcat cccttgaact tgttctggct gattttatat tggaaaaagc 11581 aagccgagga gtacatccag aaaagcggtc tgaactatac aattgtgcga cctggtggtt 11641 taaagaatga agataattcc aaccaaatcg tgatgcaaag cgctgataca ttgttcgagg 11701 gtagcattcc ccgacaaaaa gtagcgcagg tttctgttga ggcgctgttt gaaccagcag 11761 caaaaaataa aattgtggag atagttgctc aagaaaatgc tcccgcaaaa agctttggag 11821 aactctttgc taacgtcgcc taagtcttgg tattgagttg tgagtcaaaa gtgaagagtc 11881 gaataagtga tgtcccttta agtatgcaca aagcacacgc aatcatgtgt taacgttgca 11941 aaagcgttcg ctttgtgcga cgagtgctta ggcgtggagg agatatgcca gttgtcctcc 12001 cgcagcaagt tgtctcacta atcgctattg acttttgact cattttctgg tgagtttccg 12061 gaattagtgc gtcgaagatt gttgttatat tttgatgaca ataacagctc atttttaagg 12121 agtcggactc tgaagaaaag cttagaacgc aaaggcggcg ttgaacaact cattgcacca 12181 atagccgcac ttcttggctt cgtatgtttg ttgcagtggt cgatatttgg agatttgcga 12241 tcgcatctcg atcccacctt tgccaacaaa cagcctccct tagtcatgaa agggggtgat 12301 ccttatatcc gtgctttgat gcgaactatc tcagcaagtg aagcgagtag caaccgtcct 12361 tattcagtgt tgtatggtgg acaccatgtt aacaacctta accgtcatcc tgagatatgc 12421 gtcactatcg taaaaggtcc gaacaaagga aattgttcta cagctgcagg tagatatcaa 12481 attattaata ctacatggta caatctatcc cctcgttatc acccaaaccc aggacgattt 12541 atgttttggg tttcttatag ttttgaacca gactatcaag acgtggttgt ttatcgttgg 12601 ttaagtgact ctcgattttg gggaactgat atttctcaac agctacgtca aggaaggtta 12661 ccagaagttt tgcggcggtt gtctcctact tggacaagtt tgggatatgg tatagaaact 12721 aattctgtga gcaagtctct gccacagatt tatcataaaa ttttgcaaga ggaattaagg 12781 gcatctgaaa aatcagtgtg aaaagacgct tgtcattcct ctttgaggaa atgaaacaac 12841 cttacctcat ccaacaactt tttctgtact caaaaacttt ttgcaacaca atctttctac 12901 tttaattttc tgtaagcagc caaaataatt tcttctgttt cttcccaacc aatacaccca 12961 tctgttacag aaatgccata ttgtaatcgc tctcgttcgc gaggaatcag ttgatgacct 13021 tcatacaaat tggattctaa catcatgcca agtattgatg tattaccatc tactatttgt 13081 tgaataatct tttccagaac agaagcctgt aatttatatt gtttatttga attgccgtgg 13141 ctacagtcga taacaattct cggtggtaaa tttgcctcct ttaatttttc ttccaccagt 13201 ttgacattag ctggatcaaa gttaggctga ccattaccgc ctcgcaaaat aacatgacca 13261 tagtcatttc ctctcgtttt aaagatgctc acctgtccgc tttgatttat tcctaaaaag 13321 tggtgaggtt tctgagctga ttgaagagca ttcaaagcca cattaatgtt accatcggta 13381 ccgtttttga aacctacagg catcgaaagt ccgcttgcca tttcgcgatg agtttgtgat 13441 tcggtcgtgc gtgctccaat tgcagaccac gtaataagtt caccaatgta ctgaggtact 13501 ataggatcga gtgcttctgt accgacaggt aatcccaatt cagcaatttt tagcagtaag 13561 ctacgtgcaa ttaataaacc tttttctatg tggaaagaat catccatatc aggatcgtta 13621 attaatcctt tccatcctac tgttgttctt ggtttttcaa aatataccct catgataagc 13681 agcagtttat ccttaactcg ctcagcaaga atttttagtt tctctgaata ttcaattgct 13741 gcttctgtat cgtggataga acatggacca actactatga attttctcct gtcttgaaaa 13801 tttagaatat cttctatttc ttgtctgtat cttaaaactg tctcttttgc taaaagtgtt 13861 aaaggtaatt tggattttaa atcatctgga gttaataaag cctggaaact ttcaatgtga 13921 gtattagata atttattgtt ttccataaag tacacttcct gattttgtga tatttactat 13981 tgatttttgt tatacagtat aactcaagag tctgacttta ttcttgattt tcagatttat 14041 attttttatg taactaacac ttactgtgat attttcaggt gtcaaacgtg gataacaata 14101 cttttaaaag gctttattac cataaaatga gaacattttt cacaaatagt ctgacatttt 14161 caaaatgaat attatgtcct gaatgactaa ttatgtttaa atggcaaaaa tcacaaattt 14221 tagtcatgtt tgtattgata gatacaaact tctcatcata ctctcctgct aataaaaaga 14281 tagggtttgt attctctttc agtttttccc acaaagaagg ttgacaccca gtccccatga 14341 atcgcagtga tttagctaac tcaattggat tgttttgcaa ccgaacttct acaaggtggt 14401 gaaattgggg gtgattttta atagacccaa aaatgggttg attataccaa ttggacaaaa 14461 aagctataaa atcattcttg tcaatacttc tttctaattt tctggctatt tgctcatcac 14521 gtttgactcg ttctgctcgt tctacttctg ttaacaaacc aggagaagct gattccaaaa 14581 tagttttagg aaaacgatga ggaaaataca gggttaggta taaagctaat ctccctccca 14641 tcgaataacc aaccaaaaag catttggtaa tttttaaatc atccagtaag tggattacgg 14701 catgggcagt gtttggcatt gtataacact cgtcactacc gaaaacttta gttgctccat 14761 gtcccggaag gtcaactgtc agacagtaaa aatcatcaga tagtaatgag atagcttcat 14821 caaattcatg agtgttcccc atgaatccat gtaaaaaaag aattagtggt ttatctggat 14881 gaccactaaa agaatagtga aactgataat tactgagagt catattttag cctatctgat 14941 cacacacatt gccttagtga attccattcg cgcgccagtt gcaccgcatt agtgacaggt 15001 taaggttatg gcgcatttgt caatatacct aatcattgaa aaatatgcca aaacctgtgg 15061 tatcaaacat cgtttttgag ttcgcgattg cacggttcgc agtgcctttt gggcaatcgc 15121 gtaatcttta tactcatcat tcacttttct gcaaaagcta ggtacaaagc ttagtttgat 15181 acctgatttt tcactcgctt ttcagcagat tttttaattg acaaaagcaa tgttatctgt 15241 aattagctca caaatgttgc accgactgtc gggcattgcc cgcagcacta gcttggctac 15301 acaaaccgcg aagcgccgcg agcggaacgc agtgcagcgt ggcggtcagc ccgccggtgc 15361 gggtttcaat cattttccct acgcgtcatc tggttatttg ctttacatgg gcttttttga 15421 tcacatatag gacttacgca ctgtacaaaa agattggacc gcccgcaggg tttgcttaac 15481 ccccactcgc tgtctgccgt gcatggtttc ccgacttgag gagccactgc ggtgcgtcgc 15541 tgtcctccgt tgttcgccta gcgtctggtg gaggagatag catgtggcgt gcgactgcgg 15601 tgcgacttct agtcgctggg ttgagtatga aataaaactt ttcaaaaatc ccctaaggca 15661 aaaacctatc caaagcagct ttaagttgtt ccaattcaac ttcaatatca ccttcttttg 15721 cagctcttgc aatacactct gttaaatgtt catccaagac aatccgtgct acgcgatcca 15781 acgctccccg tacggcagca atttgtagta aaacatcagg acaagggcta ttttgctgca 15841 ccattgtctt aataccacga acgtgtcctt ctatacgcga aagccgattg actattcgcc 15901 gcagagattc ttcgctatga acgtgagcat gagccgactg ttcatgattg tgaatatgct 15961 gtgtatgatc tttgtctgta tcgtcaaaga gtaaatcctc atcctgctca gatgtgggtg 16021 aggattgttt tcttgatcgg ttcgatccat tcataagttt tcatataggt gctgccaata 16081 aataagatcg taccgtagag catacaggag ggggtgggca cagatggagt gttaaagagg 16141 atcactaata ttatttgaaa atatttgaaa atttcgtaaa ttaaagaaca aatgcctgat 16201 ttgcaaggat taggcagaaa tatagaaata ttaaactaga aacaaacgtg aatggatagt 16261 tattgccaag tatacctgta aaaatagcta tagagagatt tttcaccata gaaacaacaa 16321 ttaagctgaa atattaggtt gtggttatgc gattttccaa acgatcccgg tctatacgcc 16381 aactaagtac tcatgtgtta gccgtcattt tgggagtttt gctaactgtt ggcactttgc 16441 aagtctcacc ctcacaagca gaaccagcgc caagttctgt gattggtgat tcaccacaac 16501 tcgtcgccca gagacaatca cccgccactg cagctattgg tagcagtagc tttgtgacag 16561 cagcagtaaa tcgcgttgga ccagcagttg ttaggataga cactgagcgt acaattaccc 16621 gacgcgccgc cgatccattc tttgacgatc cctttttccg acggtttttt ggtaacggtt 16681 caccacagca gttgcctccc gagcaattac gtggtcttgg ttctggcttt atcattgata 16741 aaagtgggtt aattctgact aatgcccatg tggtcgataa ggctgataaa gtcactgtcc 16801 gccttaaaga tggacgcagc tttgaaggaa aagtacaagg cgttgatgaa gtcactgatt 16861 tggcggtagt taaaatcaat gctggtggtg atttaccaat cgcacccttg ggttcttcaa 16921 ataatctaca agtgggtgat tgggcgatcg cagtcggtaa ccccctagga ttagataaca 16981 ccgttaccct aggaattatc agcaccctca gacgttctag cgccgaagtt cgcattcccg 17041 acaagcgctt agatttcatt caaaccgacg ccgctatcaa ccctggtaac tcaggtggac 17101 cactggtgaa tgctcaaggt gaagtcattg gtatcaacac agctattcgt ggtgacgcaa 17161 cgggtattgg ctttgccatc cctatagata aagctaaaac agtcgcagcc aaactacaac 17221 gcggagaaac aattgctcac ccatttatag gtgtgcaaat gcaagagata acgccggaac 17281 tggcaagaca attcaactct aaccctaatt ctccgattca attgccagaa attaatggcg 17341 ttttagtcat gcaagtggtg cctaactcac cagcagcagc agcaggaata cgtccgggag 17401 atgtgattct tcaggttgat ggacagccga tcacaaaagg cacacaattg cttgacattg 17461 tggaagctag tcgtgttggt cagcaattgc agttgaaagt gcaaagaggt aaccggacac 17521 aacagctatc gatacgcacc gctcaaatgc aaaatccctc gtaaaaattt taaatcctag 17581 actgtaaaag cccggcttgt tgaagaagct gggttttttg tcaacgtttt tttgttacca 17641 gaaactgacg caaactcaac aaacttaaag tttgtcccaa agaaagcggt aaaattgtca 17701 ctccctctag acgagactgt acccaacctt ctccccagta ccattcatga aagccgtcta 17761 ttccttgact gagtaacaag cgtaagcagt taggacgtgc aacatccact tgatgatgaa 17821 tcgagagtgg tcctgtctgg gtagtaaact caaatccaga atgcagttct tctggcattc 17881 cagtagcaaa gcgagctgtg ggcacccatt tttctagttc tgtaggacgc agcaaactgt 17941 cgcgaattgc actgaaggat gcatccactt ctatacgtat ttggctttgt tgataattac 18001 ctaacattgt ttgaggatta cggtggcttg ttgattgaag agtgtttttt tagtataacg 18061 aaccgcagtg aagtacctac caccaaaacg gagtaccgtt tatggtgggg gctacttgat 18121 ggcacgaagt ccaggatttt caaaagattt gtattttgac tgacatctgg cacctgaaaa 18181 caagaatgtt atttaagcat taaggattaa gtgttaaatg cctaatttgc cataaaaaaa 18241 cacttgattt catgtttttt tcttttcaag taacttaatc gaacacggtt tatacgctta 18301 agtggcagat attcagtttc ggaaaattcc cattgtatat agccagtcat ttctttcaaa 18361 ctgcataagc tcttttcaac gacatctatt aacaaaagga acggcatcaa ggacatttgg 18421 accttgtaat acagattctt ctgtttacaa caaaagtcta caaatctttc caaaagccaa 18481 cttctaatgt catctacaac gctaatcgcg caggtgattg gagatgaccc gagtgtgctt 18541 gtacctgggc aagtactttt cattccaata ctgcaatgaa acaataacca ataactaaag 18601 ccgcattttc tttcccagta gcatctcata gctactgcga accgtgatag tggttttacc 18661 ttatcactcg cccgtggttc tgtattcata cttatcaagg acgaaaatca aataaaaatt 18721 tgacattccc ctgcatttca aacaggggat tctttgttct agcccacatc cgtgtttacg 18781 ttaacaaact tcaccttgat tgttagtttc acttgcttga tggcatcatt cccacccact 18841 catcagccac ctgcttaagt taagttaagt tttgttatac taagataatt atctttcctg 18901 gtttttgcac aggaaatttc ggtaagaaat taaaggtata ggtttttttt catggcggat 18961 caattaattc gcgccactgc agcacaaggt ggtattcgtg cagttggtgc aataaccaca 19021 cgtttaacag aagaagcaag aaatagacat aagctttctt atgtagcaac agccgcgtta 19081 ggccgaacta tgacagcagg cttgctgatg gcttctagta tgaaacgaac tggagcgaga 19141 gtcaatatcc gcgtcaaagg tgatgggcct ttaggtggta tattgataga cgcaggacta 19201 gatggaacag tacgcggcta cgtagaaaac ccatctgttg aattgcctcc taatagtaaa 19261 ggtaagcttg atgttggtgg tgcagttggt aaaggttttc tttacgttgt acgcgatatt 19321 ggatacgggt atccttactc tagtacggtg gaactcgttt ctggtgagat tggggacgat 19381 gtggctcatt acctcatcag ttccgaacaa acaccttcag ccgtagtttt aggcgtgttc 19441 gtaggagcaa gcggagtgac agcagccgga ggattattag tacaagtttt gcccaaagcc 19501 gctagagacg aagctttagt ggaaactttg gaatcacgag tcgctgcgtt atcaggattt 19561 actcctttgt tgcaagcagg gaaatctttg acagacatct ttcaagactt actgggagat 19621 atgggagttg cgatctttcc cgaaacccag attttacgtt accattgcgg ttgctccttt 19681 gaccgggtgc taggagcact caagatgtta ggtgaagcag aactccaaga tatgattact 19741 aaagataacg gagctgaagc aacttgcgat ttttgtggta cagtttacca ggcaagtcga 19801 gatgacttgg ctcaacttgt tgtagatttg caagcagaag cttaggctgg ggtgtaaagt 19861 ataaaaacat acttttttat ctaggatata catgatatga gtggaggaca tatcatcatg 19921 taagaagtaa cttcatccaa atggaaagtt acgatggcag ggacaaaact ttggggctat 19981 aacagatcgc taaagttaat tgtttatttt ggtgtgagaa atgacagagc ggaacatacc 20041 agaaagttgg ttagcaggca aagcaagaga gccagacaac aatatgacta gaggataccg 20101 acggcaagaa tctagtgaaa cacataggtc tggagtgcca gcaaccagtt ctatgattga 20161 gtctgtgaac agctcaaaag aaaatgattc cgaactactg tctttagaca aagagtcgca 20221 aaaaatagtt tcactcagta aaagttcttg gaaattgcct cagtggacaa aaagctgggt 20281 gttgtggact ttattgttag cgttggttcc tagcgcaatt gcctttatgg caacaggaat 20341 gttactcaag ctgccttctg cgcccaactg cccatcgatt ttctggcccc tggctagtgc 20401 gtcggtacgg ctgcattgcg ctcagttggc agcttctaag gaaacgacga aagaccttct 20461 gcaagcgatc accctagtca aagaactgcc aaaaaatcat ccgcttcgtg cagaaattga 20521 ccgtttaata gaagaatggt cacgggatat tctgcaatta gcagaccaga gtttccaggc 20581 aggtcggcta gacgaagcga tcgcaactgc tcgtcaagta ccgaaagagg aatccgctta 20641 tcaattagtg gaagcaaaaa tttccaaatg gcaatccatc tggtcaactg cggaaggtgt 20701 ctacactgaa gcagaagccc aaatgcgcga cgagaaatgg catcaagcgt tcatgttagc 20761 ttctagattg ttgcgtgtag ataataaata ctggtcgatg acaaaatatg accagttaaa 20821 tcgattgatt gtcagcgcgc gagaagatgg tgagcagtta gggaaagcaa aaactttagc 20881 cgaaagcaaa acagttgata atctcctcaa agccatcaag gtggcagaat tgattcagca 20941 ggatagttac gtttatccga aagcgcagga aactctttct gaatttggag agaaaatgct 21001 ggagttggca caggcaaagc tggacgcacg aaatccagac caagctatct tgatagccca 21061 aaaaattccc ccaagcactc gacacaataa ggacatagag gactttatta ccttaactga 21121 agcacaaagg agtgcatggt tagggacaac agctggttta gaaacagcaa ttgcccaagc 21181 tcaacaaata caggcaacaa gaccaaaata tgaaaaagcg caagaactta ttgctagctg 21241 gcaagtagaa attcaagatg tcgcccattt agaaaaagca agatacctgg ctagccaagg 21301 ttcagttaac gatttcgcag cagctatcac tgaagcacaa ctgatccctg atggcaaccc 21361 tcgtgcagag gaagcaaaaa aagaaattgg tggttgggtt gctcaaatac agacaatcga 21421 agaccgtccg tatttggatc gtgctgatca gatggcaatg cttgaggatg tgaactctct 21481 acaagcggcg atcgcacaag caactcaagt tcgacaaggt cgtgcattgt atccagaagc 21541 acgaagaaaa attgggattt ggacagcaaa gattcaacgt attcaagatc agccctactt 21601 agaccaagca aaaaacttgg ctgacagtgg cgatttatct tctgctattg caacagctca 21661 acaaatccga ccaggacgag cactttccgg tgaagcacaa gcagttatag atgagtggca 21721 agggcaactt cgtgcgaaac aaaactggaa caaagcacgc gaaatcgcag tcaccggaac 21781 tccggaagct ttggcaaagg caattaggct agcaaacaga gtaccggata ataacatcct 21841 acgcagtgat gctagtatcg cgattgacca atggagtcag caattgttgg atatagctcg 21901 tgctcaagga gaatctaata ttcctagggc tattgagaca gccagactga ttccaagagg 21961 aactgatgcc tatagtgctg cacgagaaca aatcagagca tggcaggaat atctcaatcc 22021 tcagccacaa gaagctccac aagaatctcg acaagaatct tcacaggaat cttatactga 22081 gcaaccaatg actatcgaaa ggcaatgatc gggtgttgtg catttttgat gcacaacacc 22141 taatcacaaa ccacgaagaa aaagggttgg ctaattttta ggaggacatt cccccatacc 22201 ttatttccgc acagaccaag aacgaacttt gcccaaagta acaggaataa tatagctaac 22261 atcaatcgta aatcgataca cctttttagc cctgctgcta ttttccgggt caaaaaactt 22321 cagtttcaca tcccaacaat cgctatcaag gcgacagaaa ggactttttt ctacttcgat 22381 gctgtcgagt tccatacctc tagaaacagc ctcagcaaag ctggaagcag cttgaaaggc 22441 attagtagca gcaaaattca acgctctatc tttggaagtt tgtcccaagt ttcgcaaatc 22501 atagtacact ctttgcaaaa agctactgag cgatagcgaa gcgctgctgc tttcagcaga 22561 tcgcctgact ctgacatcat tagcatcaat ttgttgggta gttacacttt cgacggctgc 22621 actcaccaag ctattcactt tccaaccgta cataccccga atgtttggta gtgtaatcac 22681 aggaaccact tgcccagaaa acaactccac ggttctatca gttaaccgtc cgggaaaact 22741 cacccgttca atatagtccc tttccaatct ctaagacgtt ctcgccatcg tagccagtca 22801 aaaaagtgcg gttgccaggg tcaagagcga taactctgtc actaccagtt cctacgaatt 22861 caactacttg ggggaaacat gcaaaccact tcccacgttg gtaaacaagt tgagtaccat 22921 attcacattg gtaaggaaca ggttgtgaag actgatactt caaaccttta gtggtttttg 22981 agtaccaagt accgtttttg taatttccta ccttgaattt gataacttga ctcgtggctt 23041 tacaagattt aaactttgca aaaccgcgtt ctgatttagc ttgcttgaat gcatcgcaag 23101 catcgttaca tgcttcttgt cgttgatgac ctggcaagtt taaccaactt ggcttattct 23161 ccagtgcttg tagcatctta tccattgtgt aagccgatat atcttctaca ccgtcaaaac 23221 actgcaacag agcaattgct tcgttatagt accagcgata tccagctaac caagatttcc 23281 aaatttgatg cagttcctta gatggatata cccgtatttt cagaacactg tttggtgata 23341 acttcgtctg gctcttttgg tgtctgggta atactttcct tttcttgccg gacttgtttt 23401 tcgtatttgc ccggcgacag cgtgaaattt catctgtcgc acttgtttga agaaatcaat 23461 actaccaaag gctgtatcga ctaataccaa gacttggtag cgtttcgtca gggatttagg 23521 taaagtactt aaaagccgca gacccagttg cacacagcca ggatgaccct taccccgata 23581 gacacggaag ctccaaggta gtcgccatcg tcctagagca atgtagagta cgactaagtg 23641 aagtccccgt ttaccgttgt atactctaat gaggctgccc aaccccttaa acttgcctgt 23701 cttctctaaa gtcgtcagat caagtattat ttgtagagtc ggtctgcgac ccacagattt 23761 tgccgccaat atttgttcta acgctgactt gcgtacagcg cggatgacgg cacgacatga 23821 ccattggtac ttattaagga agcggctgag ggcacttgct gaaagggtct ggctgtgttc 23881 aggtaacgtt aagctagctg cgcctccggc aatcgcccag ttgcttgcag aaatagttct 23941 aatagtgttt gtagactgtt ctgttggtag cgactaggca tcaaggacac cagagtataa 24001 actaaacttt gggcgtaagt aaggaggttt cgcataaacc ttcaatgaat ttctttctac 24061 gccctttttt tcagattttg accaccagcg caaccctgct tgttgagaat ggtgcaagat 24121 ctcagttgaa acggactatg attattttgc tcgttcccag cctgaggctg ggaatgcagg 24181 ttacgaggct ctgcctcgac taccagtacc ggaggcagag gctccaggta gttcattccc 24241 tggctgagcc agggaaccag agaaggttga acactaatta acgcctcgcc agttagtaac 24301 tgataagccc ccggtatatt caacttaccc agcaaacagc gttccgcttc ctccacttct 24361 tcggggttgc aaggaattgc actatttaaa accgcttgac gcaccgcttc cgcattcggt 24421 tgttcccccc gttgcaattg cagactcatc agcaacgcag aaatacccgt aacgataggt 24481 gcagcacaac ttgtcccttg ttggcggatg ggttcgtcag ttcctggctg tgcgcctaag 24541 atattttccc catttgccat cacaccttta ctttgataat ctccaccgta gttgctaaat 24601 ttaaacggct gtccgtcatc gcgcattgca cccactgtca ggacaccaga aataatcgca 24661 gggatgcacc aacattcgcc tttatcatta ccgcctggtg caacaattaa gatgttattg 24721 tcttggcatt gtttgacagc acgggcaaat aaatctggag tgataccatt tcactaaaag 24781 tgtgatacat atagttctag agacatcact caataaaaac caaaaatgtg tcgagttctc 24841 aaattagaga taaaagaaac acaagctgaa ctccaagaac tgttgaggca acaaaaaact 24901 gggttgggaa aagaacgaat ccaggcactg tatttgctga aaacaaggca agtagaaacg 24961 gtacagcact tggcagtaat gttgggtaga gggcgtataa cgttacatag atggttaaaa 25021 ctgtacagaa aaggtggttt aagtagcctg ctcgaacttc gtaagagtcc aggacgacca 25081 aaaaccattc cagtagatgt gcgattgcgc ggagcgcaga cgccaaaggc gtatcgctac 25141 tcaaaaaaga gctttccgaa ccagaagggt ttaaaagcta tgaggaaatc cgtacttggt 25201 tgagagcgtc tgagggtatc gaagcatctt ataaagttgt gcatgaggtt gtgcgctaca 25261 aactaaaagc aaaattaaaa gcaccacgtc cacgaagtgt aaaacagaac aagggtgtag 25321 aggaagattt taaaaaaaac ttcacttccg ggcttgaact aataaagaaa tacttaatat 25381 cgccattaga acaacatcgt agagtccgtt attggtgtgg agaaggaagg tgagttcggt 25441 cttcaaacga taacagggag attaatcacg cttacgctcg tcaaaccact tggtttgacg 25501 agcgtggaaa cgtgacaact tccatctgta tggagttgta gaaccattga caggaaagag 25561 ctttatgctt gaattttcgc atttggatac catatgcaat gaacaatgtt tttagagcat 25621 tttacggcag aatatccagc tatatttgca tattattcaa gttgataatg gcgcatttca 25681 ttttagtaat tatagcggtt ttcaacgagc gtgaagtaca gtaacaacag tgtgaaaacc 25741 aattacgaat ttctttaagt gagactgcat ccattgcatt tgtaatacca ttttctagat 25801 tttctcgtga acgagcggca agcgagcgca aatactcctt gatttttgac caacaatttt 25861 caattggatt aaaatcgggg gaataagggg ataaataaat caagcgtgct cctttatctt 25921 caatcagctg tcggatacca tcaactttgt gtgcgggtaa attatccatc acaacacaag 25981 ccccagacca taagttagga actagaatac tttcaatgaa aaatcgaaaa atatcaccgt 26041 tagtaccacc atcaattgtc attgctccaa tccagcctgt tagagcgatc gctccaatca 26101 aagtgatatt atcgcggtga tgataagggc gttcaccgta agcacgtttg ccttgtagag 26161 aacgtgcata cgttcgtatc aacggcgcac aagaggacgg agcaagatct gtcgaggata 26221 tttcaactca ctttaattgg cgagatggtg caaaccgagc gatcttttat ttaggtgacg 26281 aagcgttagt tggtggcggc gatactgaac aagaggatat cgaagcagct aatcgcgcta 26341 ttgaagcagc cagaaatgct ggagttgtcg tccatactta ttttggaaca tctaaaagcc 26401 aaggacaaga caaaacggcg gctgagtacg tccgcctagc acaagagacg ggcggacacg 26461 gttttagaaa tcaaaatgct attggtggtt ttgctgagat tctgaaaacc gttatttgct 26521 cgactagaaa tagagatact acccctcaac accaaaatta tttgtgtgtc cagcgattcg 26581 agggtagagg aaacgggaca gttgattcct gtggtgccat tatatttcgc agcgatcgct 26641 ggcggggcac ttgtcgccca atcaacgcag atctcaacgg ttaaacctac cgctccaact 26701 gcttcactgg ctagtcctgg aggattattt tctttagcag cagctaattt agtggctgcc 26761 ccccaaagca gtatctcaac ggcagcagtg aatatgccca tctacatgat ggcagtccag 26821 ccgacaaatg acgaaagcga taagccggag gcttgacgct acgcgtatcg cgccacttaa 26881 ctggaataca gacaaaattt tgaggctcag atttagggtt ggctctgtta atcctaggga 26941 aacagacaaa gatacccttc cgcaggggtc ggatttggcg ttaagcagat gcgagagtca 27001 gcgccctaag tccttacgga cacgctgcgc gaacgggcac gctacacgtt cgccctctgg 27061 gcgtgcgctt tgcgcataca gggaattcaa aggagggcta cgccccgctg cgctaacaaa 27121 attcaagatt caaaaaaaac gttattattt tgaactttga gagaagttgg agcggagagc 27181 agcgccttgc ggagccagta cttgatgagg gtttccctca cgcttgttct ggcgttgggg 27241 ttccccccgt tgtggcgact g // LOCUS NODE_1077_length_27149_cov_5.37303527149 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 27149) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 27149) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..27149 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 611..1891 /locus_tag="DP116_09265" CDS 611..1891 /locus_tag="DP116_09265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_09265" /translation="MSKTTLNPYVQKSTTVNHCYCRACGSTLKHTFVDLGMSPLCESY VTSEQLNQMEAFYPLHVYVCDQCYLVQLQEYVSPQDIFSEYAYFSSYSDSWLQHAKNY TEKVIARFGLNTSSQVVEIASNDGYLLQYFLSKGIPVLGIEPAANIAEVAKAKGISTV VKFFGRNTANELASVGKQADLLAANNVLAHVPDINDFVAGAKILLKPEGVITMEFPHL MRLMEENQFDTIYHEHFSYLSLLTVEKIFADHGLAIFDVEELSTHGGSLRIYACHAED NSKSMSQQVIELRAREEAAGFNNIEHYFSFARKVKETKFKLLEFLISAKRKGKSIAGY GAPGKGNTLLNYCGIREDFIDYTVDRNPYKQGKFLPGTHIPIFHPDKIQETKPDYLLI LPWNLKNEIMSQMSYVRDWGCQFVVPIPEVSVYS" gene 1954..2808 /locus_tag="DP116_09270" CDS 1954..2808 /locus_tag="DP116_09270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744958.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucose-1-phosphate cytidylyltransferase" /protein_id="PRJNA477356:DP116_09270" /translation="MKVVLFCGGLGTRLRESSTNVPKPMVHIGYRPILWHVMKYYAHY GHKDFILCLGYKADVIKNYFLNYDECVSNDFTLQEGGKKIKLVNSDIEDWKITFVDTG LTSNIGQRLQAIEQYLEGEEVFLANYSDGLTDLHLPDIIEDFHRHNKIASFLCVKPSQ SFHLVSMEENGLVSDIQDVKQAGIRINGGFFVFNKEIFKYIEPGEELVLEPFQRLMKL QQLIAYKYNGFWACMDTFKEQQQLDDMYCQGNAPWTVWKCLERKGKSLEYTPSHHVAS HSMPVSLA" gene 2921..3574 /locus_tag="DP116_09275" CDS 2921..3574 /locus_tag="DP116_09275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744959.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIG-L family deacetylase" /protein_id="PRJNA477356:DP116_09275" /translation="MLKFDLEKNSDLSYKVLCLGAHCDDIEIGCGGTILRLIENYPNL TFYWVVFSSNEQREKEAYNSANKFLEKIPEKKILIQQFQDGFLPYLGSEVKQFFEQLK RDYNPDLIFTHYRHDLHQDHRLISDFTWNTFRNHLILEYEIPKYDGDLGNPNFFVHLS QENYQNKVKYILDSFPSQNSKQWFTEEIFLSILRLRGMESNAPSKYAEGFYCRKVVF" gene 3833..5149 /locus_tag="DP116_09280" CDS 3833..5149 /locus_tag="DP116_09280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744960.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M28" /protein_id="PRJNA477356:DP116_09280" /translation="MNITDLKQKVNPNEVSHQMYQLISELYPICRSITGNGFRETLHW ISKHIELTKHEVPSGTQVFDWTVPREWNIKDAYIKNSQGERIIDFNKSNLHVVNYSVP IHQKILLEELKKHLFTLPEHPDWIPYRTSYYKESWGFCLSHNQLLELEDEEYEVCIDS SLENGHLTYGEYYLKGDKPDEVLISCHACHPSLGNDNLSGIALSTFLAKYLTQINLSY SYRFIFIPGTIGSITWLSLNESQVHKIKHGLVLTCVGDSGKSTYKKSRRGDAEIDKAV THVLKHSHQDYDIIDFFPYGYDERQFCSPAFNLPVGCFMRTPHSCYPQYHTSADNLDF VQPQSLADSFSKCLSTLHILENNKKYLNQNPKCEPQLGKRGLYSAIGGQTDTKMTELA MLWVLNLSDGDHTLLDIADRAGMSFDFINKAANMLLEHDLVKYPPA" gene complement(5366..6130) /locus_tag="DP116_09285" /pseudo CDS complement(5366..6130) /locus_tag="DP116_09285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011995189.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="oxidoreductase" assembly_gap 5578..5587 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 6485..6971 /locus_tag="DP116_09290" /pseudo CDS 6485..6971 /locus_tag="DP116_09290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495045.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS982 family transposase" gene complement(7016..7366) /locus_tag="DP116_09295" CDS complement(7016..7366) /locus_tag="DP116_09295" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09295" /translation="MNRLVNRRNALFLVPNWLTTFSHLRFRKAVKQLDQIVYNIINQR RTSEENQGTFLDLLTQVAGYQVEIHHNGSDYLAPKRGAAQTVELSARGAQHERSLNAY NKVAVTLPATAPQA" gene 7509..8714 /locus_tag="DP116_09300" CDS 7509..8714 /locus_tag="DP116_09300" /EC_number="6.3.4.18" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874241.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5-(carboxyamino)imidazole ribonucleotide synthase" /protein_id="PRJNA477356:DP116_09300" /translation="MKRVGVIGGGQLAWMMAGAAKKLGVELVVQTPSPNDPAVPISAK DNVFAAIDDANATAELANRSDVITFENEFVDIEALSKLASQGVCFRPRLEALTPLLDK YHQRCYLRDLGLPVPRFVAIEQDWKTTAEVIFANLIGFPAVLKARRHGYDGQGTFIIR EIESLKQKLEVSCTKGVGSQSNSFLLEEFIPFERELAVIAARSVSGDVCTFPVVETQQ EEQVCRRVIAPACVSSQVAVEIEKIASTLLNSLEAVGVFGIELFLTAEGKVLVNEIAP RTHNSGHFSIDACETSQFEQHLRAVCGLQLGNTAMICPSAVMVNLLGYEISQSDYTIK RQQLEQIPQAHVHWYGKTESRPGRKLGHVTVLLDTQSQDQAIAISLRLDALRIAQNIE SIWYPHKTC" gene 8766..8949 /gene="ssrS" /locus_tag="DP116_09305" ncRNA 8766..8949 /ncRNA_class="other" /gene="ssrS" /locus_tag="DP116_09305" /product="6S RNA" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00013" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="Derived by automated computational analysis using gene prediction method: cmsearch." /db_xref="RFAM:RF00013" gene 9255..10499 /locus_tag="DP116_09310" CDS 9255..10499 /locus_tag="DP116_09310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312721.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="insulinase family protein" /protein_id="PRJNA477356:DP116_09310" /translation="MFPASVFKLDNGLTLIHQEIATTPVVVADIWVRAGANFEPEPWF GMAHFLEHMIFKGTARVPPGVFDQKIENQGGLTNAATSYDYAHYSLTTASLHLEETLP YMGELLLNAAIPEDEFTRERDVVLEEIRQAHDDPDWIGFHALISSVYQHHPYGRSVLG SEQELMKQSPVQMRCFHRAYYQPENMTVVIVGSIAAQSALKLVNQTFVNFVERCCDCP QKKEVAKPVIAGIRRQELSLPRLEVARLLMAWIGPGVEQLETCCGLDLLSVLLAQGRT SRLVRDLREEQQLVQGICSHFSLQEDSSLFTITAWLEPKDVERVESLIRLHLQDLIDN GISESEILRAQRLLCNDFAFSTETPNQLAGLYGYYNTIAQAELAVAYPWQIQSYDAKQ LQKLAQEYLSPNHYAVTVLKPS" gene 10832..12118 /locus_tag="DP116_09315" CDS 10832..12118 /locus_tag="DP116_09315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457122.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="insulinase family protein" /protein_id="PRJNA477356:DP116_09315" /translation="MTTSQQKSHIHRTVLNNGIVVLVVENPAADIIAARIFVRAGSCY ENQEQAGLAYLLSTVLTKGCNGLSSLEIAEQVESLGASLGADIASDYFLLSLKTVTSD FAQMFTLAGRILRSPTFPEAEVELERRVALQDIRSQKEQPFTIAFDQLRDAMYENHPY ARSALGNEATMSRLTRRDLVEYHQTHFRPENIVISIAGRITPENAIALVQQVFGDWQA SPIQPLQKLDLPELNVEPQIKVTPQQTQQSIVMLGYLGASVLSNDYASLKLLSTYLGN GLSSRLFVELREKRGLAYEVSAFYPTRLSRASFVVYMGTAPENTQVALSSLRKEVDLL FTTELEEDALQAAKNKILGQYALGKQTNAQIAQMYGWYEVLGLGIDFDTDFQEAIASL RATDTMAAAHQYLREPYISLVGQEQAVHGAILNYNS" gene 12194..13426 /locus_tag="DP116_09320" CDS 12194..13426 /locus_tag="DP116_09320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748453.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09320" /translation="MISSIKTNPLNDSQFLNPTVNSSGRKTEPRYQSLQMLQILLSCG VSLLFGLSTVSIASQPQQIAQTTSSEGINRPTLQVGSKGERVTELQAALKLLGFYTGT VDGEYNESTVLAVSRFQEAAGLKADGIVDTITWQRLFPGETTVASSGSSPNSTSRPPL ASGTSNTNQVVVPSSNSTASTSTTTNTPSATSKPEPRAVTGNTTATSTTREDNSKPEP RSVTRNTTATTRTREDNSKTEPRSVTRNTTATTRTREDDSKTEPRSVTRNTTANSQRT YVRQSTSTRSGSTRSEQSIRTQQSDRSGSTRSEQSIRTQQSSRPSSTTRTQQVASVQY TSEGLAILRIGMRGPEVVRLQRRLQRLGFLNEDEVDGDFGASTEAAVIALQKRYGLDA DGVAGGGTWEILMRRRGR" gene complement(13446..13796) /locus_tag="DP116_09325" CDS complement(13446..13796) /locus_tag="DP116_09325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015153444.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09325" /translation="MLRVLADENFDNTIVRGLLRRNLNIDIVRVQDIGLSGQDDPIIL AWAAQENRVLLTHDVATITRYAYERLTQGQPMPGVIEVSVDAPIGQVIEDILIIVECS LDGELEGQVQYLPL" gene complement(13790..14128) /locus_tag="DP116_09330" CDS complement(13790..14128) /locus_tag="DP116_09330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015153443.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09330" /translation="MALSIIAEPAPLETNADGIVRVGKTRVTLDTVVAVFKQGATAEE IVYRYPSLNLADVYATIAFYLNHQQEVEIYLQQRQQQTHEVRKMNQQRFDPQGLRDRL LARKAEQEAC" gene 14295..14933 /locus_tag="DP116_09335" CDS 14295..14933 /locus_tag="DP116_09335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015083222.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transaldolase" /protein_id="PRJNA477356:DP116_09335" /translation="MAIYLDSAIVSEAEIASRMGWVKGITTNPTLLAKSDNPPETTLK KLTQLTSGPVFYQLMSSDFERMLTEGRKAFEIIGQQTVLKIPATPIGFAVVASLSPEI TCSVTAIYSAAQAAVAREAGARMAIAYVNRATRLLGDGIALVRDMASILNGSNTEILA ASIKSPEEAAASLQAGAHHLTLPLSMLQAMATHEFSDKTVEEFAKNGIGLTI" gene complement(15295..15729) /locus_tag="DP116_09340" CDS complement(15295..15729) /locus_tag="DP116_09340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312790.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Hsp20/alpha crystallin family protein" /protein_id="PRJNA477356:DP116_09340" /translation="MLSRWNPRQEFNALSDQLNRLFDETLAPARNWEGFTKFPAAELT EADDAIHLKLEVPGLEAKDIDIQVTENAVAISGERKSETKTEGKGYTRSEFQYGKFQR VIPLPTHIQNTKVTAEYTNGILNLTLPKKEEAKNKVVKVNLE" gene 15947..16855 /locus_tag="DP116_09345" CDS 15947..16855 /locus_tag="DP116_09345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_09345" /translation="MFPTFLPTTVGELTEPSSITLAQSIEQIALTTPLSTQPITTTYV RQGSGGTPLLLIHGFDGSVLEFRRLVPLLALQNQTWAVDLLGFGFTDRPVGVKFSPVS IKTHLYYFWKTLINKPIILVGASMGGAAAIDFTLTYPEVVQKLVLIDSAGLTGGSPLS KLMIPPLDYWATQFLRSPKVRASISRTAYKNRELASLDAQLCAALHLECSDWHKALIA FTKSGGYSAFRFKKLAEIVQPTLILWGDSDRILGIRDANRFKLAIPNSKLIWIKDCGH VPHLEQPQITAQHILDFRDDHLHKLV" gene 16921..17355 /locus_tag="DP116_09350" CDS 16921..17355 /locus_tag="DP116_09350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652823.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty-acid synthase" /protein_id="PRJNA477356:DP116_09350" /translation="MPVRDRYHQSVKNALIKDGWTITDDPLHLKWGKRDMYVDLGAEK LIAAQKQGRCIAVEIKTFRSVSDMTDLEQTLGQYLAYRSVMTRTDPNRSLYLAVHDEV YADLFDEPIAKLLVEDYKVDIVVFKPEQEVILKWIPWTNTGS" gene 17325..17684 /locus_tag="DP116_09355" CDS 17325..17684 /locus_tag="DP116_09355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652822.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_09355" /translation="MDTLDQYRQLIRHILIEHTKIPFSYGEIQFETVFDSEQDRYLLM ILGREPAYDFSPTVTRRVHGCLIHIDIIDGKIWIQRDGTEEGVATELVRAGIPKDQIV LGFRSQELRQDSGFAVA" gene 18158..18469 /locus_tag="DP116_09360" CDS 18158..18469 /locus_tag="DP116_09360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011614180.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09360" /translation="MATKLIELEDGTLVEVEVPEDQAKQISGSAADRVSTTFNKIKPI LVNTCRPIADAWQELNQEMQIEQAEIEIGFSFEGEGNVYVTKAKAGSNLKVKLVLKPK A" gene 18475..19368 /locus_tag="DP116_09365" CDS 18475..19368 /locus_tag="DP116_09365" /inference="COORDINATES: protein motif:HMM:PF13365.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09365" /translation="MSSIFQDSVILITSSDPNLQKRRVFGTGFVIHHTDEASYLLTCA HVVRDVGGEALVLADSIPATVVANGESKGFDLAVLRVQRLWCPALRLSVSSAAKHFAI AGFYAFDQKETRLLREIQGYLGKQSFIPSTDGRDTHKLKPEAYRIKAWDLHIEGEDIL QPGYSGSPVVDRTSGEVLAMVSHQVGKGEKGLAIAIEAIQNIWHSMPYDLLKTDNVIF EPGVDYTRLREFLAAGKWKQADQETEMLILEVAGRRGNLLNVESIKKLPCLDLRTIDQ LWVKFSQGHFGFSVQKQIWER" gene 19400..20128 /locus_tag="DP116_09370" CDS 19400..20128 /locus_tag="DP116_09370" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09370" /translation="MVFISLITGLIIATAIGASVITAVILKQSSTSENNSTASPTRTV AQLPTTSPTPTVAASPTASPTPTVAETPTASPTPIVTESPTASPSGTAVSLLDTECLS STPGVEQSLVKPREPKNIAIGGEALPEIAYLFSESSEPYPYISKSKAAGVSCVLNSKF RQLNLVVGINGKHPSARQDEKIVFDVSVDNKLIATKDLTIAAKQVLNINVENARSVGI KASCLKDNNYSYCPYVAFVEMSLR" gene complement(20295..21059) /locus_tag="DP116_09375" CDS complement(20295..21059) /locus_tag="DP116_09375" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09375" /translation="MTKKRLSDLLQQEAQKFSPPEGETTIDVVAISDDNPSASSDNDT EKEDGSSQLKEEPQAAEITTSRRTTATKAELEVTVKELKEALEKAQQKEASLVNELKE TLEKAQQKEAYLVNELKETVDKAQKKEASVVKELKEALGKAQQKEASLQEQITDLQLD VSKHKKVAAKLKTELDDAKQAAIQLAEANSQLTETISALQQQKQNTQSSQQEKEKTQS SKSIKSYRKSYDLPEKQPIKKTEESVDNSSPMWLLD" gene complement(21052..21690) /locus_tag="DP116_09380" CDS complement(21052..21690) /locus_tag="DP116_09380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ParA family protein" /protein_id="PRJNA477356:DP116_09380" /translation="MPKIIAILNGKGGVGKTTTAVNLAATFAQEKKVLLIDADIQGSA SWWFGRSENSMGFDLSQSTDPKLLGYLRGITGYDLVMVDTPPALRSEALAAVVAVADY LVLPTPCAPMDLAVLIETIQQVVNPVGKPHRVLLTKVDTRSLGEVLEAQNSLKEMGIP ACKAFIRIYKAHERAALEGVPITQWRGKNAQEAASDYRQVAEELKRDWREYD" gene 21845..23224 /locus_tag="DP116_09385" CDS 21845..23224 /locus_tag="DP116_09385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876888.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome P450" /protein_id="PRJNA477356:DP116_09385" /translation="MRATNNLPNGPKMPVFLRRMKFIFQPLEYVEDFANKYGDNFTLW SRNDSPAIFFSHPQALQQIFNADSSSLNAGSGNRGLQFLLGSNSLILLDGGRHQRQRQ LLTPPFHGDRMRTYAETIREITRQVSDEWKMGKPFNIRASMQEITLRVILRVVFGLDE GPHLEKIRQLLSSLLDSIGSPLLSAGFFFRFLQKDFGAWSPWGRVLRLRQQIDEMIYA LIRERRVQSPQNRQDILSLMMSARYDDGQPMTDEELHDELMTLMVAGHETTASALSWA FYWIDYLPEVRDKLLRELDTLGDKPDPSIVAKLPYLTAVCQETLRIYPIAMNAFPRIV RSPIEIQGYTLPEGTVIIPNIYLAHHREETYPQSKQFKPERFLERQFSPYEYLPFGGG NRRCIGLAFAQYEMKLVLATILSRFQVSLVNRRPVRPVRRGLTVAPPAGMQMVAMPLE KRVNTPALV" gene 23281..23403 /locus_tag="DP116_09390" /pseudo CDS 23281..23403 /locus_tag="DP116_09390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012625949.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transcriptional regulator" gene 23403..24176 /locus_tag="DP116_09395" CDS 23403..24176 /locus_tag="DP116_09395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876886.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotidyltransferase domain-containing protein" /protein_id="PRJNA477356:DP116_09395" /translation="MREKILETIIAALQPEDFVLALWQGGSAAHGYTDEWSDIDIEVI VEDNYVQQTFDIVEAALQIISEINFKYRVPEPTWHGHSQCFYQLVGVSPFLAIDFAVM KRSSRNDFLEMERHGQAVIAFDKANLIVSTHLNHQEHFSKMKARFEQLKTMFYFWQIF VKKEINRGHLAQAIVNYQSYTLRHLVDTPTYVERQADLQLYQALKQGDFCYIFNSRQM GKSSLLVRTKHRLQQEGFISCPHNRLRLSSDLNPPMKTC" gene complement(24236..24970) /locus_tag="DP116_09400" CDS complement(24236..24970) /locus_tag="DP116_09400" /inference="COORDINATES: protein motif:HMM:NF033545.0" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09400" /translation="MADWLSEVTRTTVSRQRGWEILRQMTKRAASTSPISHRKRLSRA RSLEKKLATEVEELKSAYPEAEIQLWCEDEHRLGLKPILRRVYVPEGETPIANVNWRF QWLWLYGFVHPKSGETYWWILPYVNTELFNQVLADFAREFKLGAKKHVLLAVDALSPK NLAALIKRVPDYAHRSTNRRLPDFLRVRPRDQAGWHISKDLEVPAMFTFNTFTIPFSR ITTLQLDYGLWLMNQSQIAHLKHLMI" gene complement(25085..25333) /locus_tag="DP116_09405" CDS complement(25085..25333) /locus_tag="DP116_09405" /inference="COORDINATES: protein motif:HMM:PF13384.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09405" /translation="MPKRISIVEHLNICELEQLYKHAKEGIESRQYQIIWLLAQGKKT EEVEQITGYSRTWIYALVKRYNELGISGLCDCLRQSYA" gene complement(25680..27008) /locus_tag="DP116_09410" CDS complement(25680..27008) /locus_tag="DP116_09410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874312.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09410" /translation="MTEQRDADLPSADSQHSSESTDVTTKSVDSSLTNQLSGVWNETT ARLMKVLPVDKISQTVVGWFSVSEAQVAEILETIRAELPTTEALLIGKPQAGKSSIVR GLTGVSAEIVGQGFRPHTQHTQRYAYPSTDLPVLIFTDTVGLGDVNQDTEVIVQELVG DLQQESRRAKILILTVKINDFATDTLRQITQQLRQKYPDIPCLLAVTCLHEVYPPGTD DHPEYPPDFEEVNRAFTAMQQAFVGLYDRAVLIDFTLEEDGYTPVFYGLEALRDAIAE LLPEAEARTIYQLLDEGTGKQLGNLYRDVGRRYVLVFATISATIAAVPLPFATMPALT ALQVSMVGLLGKLYGQTLTPSQAGGVVSAIASGFLARAIARELVKFLPGFGSVIAASW AAAYTWSLGEAACVYFGDLMGGKKPDPQKIQSVMQDAFQVAKERFKGIKR" BASE COUNT 7852 a 5657 c 5668 g 7962 t 10 others ORIGIN 1 aagtaagaaa agctagaaaa gcaagagcct tgaacatgaa tttctcaata taattaataa 61 aaaacagcaa aataagttac atatctttat gtattttaaa ggcataaagt aacataagtc 121 aacattgtat ttattgtcca cactaaaaat aatgatagta ggaaaataat ggtagtttga 181 cctcattcaa agggaatata acgattttca ttccacggaa atacacacaa tctcgtttaa 241 gtcaagagat gaaaatagtc aacagtggtc aataaactgt tatcaaactg ataattgact 301 attgataaca ggttactatt taagtgacct ataattttga atacagtatg cttttagtaa 361 agttgcgtaa acactgtgtc gtttgtcgaa ctcatacgtt aagtcataca ttttctagga 421 cgcgatcgca aacagacaag ctgcacactg ttattacgta aacaattgcg ctagtaccgc 481 acaagcgcaa tcaaggcagt attgcaagca cttattcgta ggaaagtggc tcgttacgag 541 gaaatggttt tgtactcagt caacacaggt agtttacaga accaaggttt acctatagga 601 tttgaaatca atgagtaaaa caactttaaa cccctacgta cagaagagta caaccgtgaa 661 tcattgttac tgtcgagcct gtgggtcaac attaaagcat acctttgttg atttgggaat 721 gtcaccgctt tgtgaaagct atgtgacatc tgaacagctt aaccaaatgg aggctttcta 781 tccgctgcac gtgtatgtgt gcgatcagtg ttatttggtt caacttcagg aatatgtcag 841 tccgcaggac atctttagcg agtatgctta cttttcgtct tactctgact cctggctaca 901 acacgccaag aattatactg aaaaggtcat agctcgcttt ggattgaaca cctcaagtca 961 agtggtagaa atagcaagta acgatggtta cttgttgcaa tatttcttat caaaagggat 1021 accagtttta ggtatagaac cagcggctaa tatagctgag gtcgctaaag caaaaggcat 1081 ttccacagtt gtaaagtttt ttggaagaaa cacagcaaat gaattggcta gtgttggtaa 1141 gcaagccgat ttattagcgg ctaacaatgt actggctcac gtacctgata taaacgattt 1201 tgttgcaggt gctaaaattt tactcaagcc ggaaggtgtg ataactatgg agtttcctca 1261 cttaatgcga ctgatggagg aaaaccagtt tgacaccatt tatcacgaac acttctctta 1321 cttgtcgttg ctgacagttg agaaaatttt tgcagatcac ggtttagcta tctttgatgt 1381 ggaagaactg tcaactcatg gtggttctct gagaatttat gcttgtcatg cagaagataa 1441 ttctaaatct atgagtcagc aggtgataga gttaagagct agagaagaag ctgctgggtt 1501 taacaacata gaacattatt tttcattcgc tagaaaggtt aaagaaacta aattcaaact 1561 tttagaattt ttaatttcgg ccaaacgaaa aggcaaatca attgctggtt atggtgctcc 1621 tggaaaaggt aatacccttt tgaactattg tgggatcaga gaagatttta ttgattacac 1681 tgtagaccgt aacccataca aacaaggcaa atttttacca ggaactcata taccaatctt 1741 tcatccagac aagattcaag agacaaagcc agactatctg ctgattctac cttggaattt 1801 gaaaaacgag atcatgtcgc agatgtcata tgtacgtgat tggggttgtc aatttgtcgt 1861 acccattcct gaagtcagtg tctattcttg aatcagctac caatcaactc aagcttaatt 1921 tgaattgaat cgtaaggaaa aggagaagtt acgatgaaag tagttttatt ttgtggtggc 1981 ttaggaacaa ggttaagaga atcgtctact aatgttccta agcctatggt tcatattggc 2041 taccgaccaa ttttgtggca tgttatgaaa tactatgccc actacgggca caaagatttt 2101 attttgtgtc ttggttataa agcagacgtt attaaaaatt atttcctcaa ttatgatgag 2161 tgtgtttcca atgattttac tttgcaggag ggaggtaaaa aaataaaact tgttaatagt 2221 gatattgaag attggaagat tacttttgtc gatacaggat tgacttcaaa tattggtcaa 2281 agattgcaag ccatagaaca atatttagaa ggtgaggaag tctttttagc caattatagt 2341 gatggcttaa cagatttaca tctacctgac attattgaag actttcatag acataataaa 2401 atagctagtt ttctctgcgt gaaaccatcc cagagttttc atttagtttc tatggaagag 2461 aatggtttag tctcggatat ccaagatgtc aaacaagctg gtattcggat aaatggagga 2521 ttttttgttt ttaacaaaga aatttttaaa tacatcgaac ctggggaaga attggtactt 2581 gaaccatttc agcgattaat gaaattacaa cagctgattg cttataaata caatggcttt 2641 tgggcttgta tggatacttt caaagagcaa cagcagttag atgatatgta ttgtcaagga 2701 aacgctcctt ggacagtttg gaagtgtctt gaaagaaagg gaaaatcttt ggaatatact 2761 ccgagtcacc atgttgcttc tcattctatg ccagttagtt tagcttaaga gtaaatacca 2821 atcaaatgac agtagatatc ctcaatacac atctactgtc attgctgttt ttagtaagta 2881 tgtcagccaa agtattcctt attattaaat aattcgcaat atgctgaaat ttgatttaga 2941 aaaaaatagt gatttaagtt ataaagtttt gtgtttaggg gcgcattgcg atgatataga 3001 aattggttgt ggaggtacga tattaagatt gatagaaaat tacccaaatc ttacatttta 3061 ttgggttgtt tttagttcca atgaacaaag agaaaaggaa gcttataaca gtgcaaataa 3121 gtttttagaa aaaattccag aaaagaagat cttaatacaa caatttcaag atggtttttt 3181 gccctacctg ggaagtgaag ttaaacagtt ttttgagcaa ttaaagcgag attataaccc 3241 cgatctcatt tttactcatt accgccatga tttgcaccaa gatcatcgct tgatatctga 3301 ttttacttgg aacacattta gaaatcatct cattctagaa tatgaaatac ctaagtacga 3361 tggagattta ggaaatccta atttctttgt tcatttaagc caagaaaatt accaaaacaa 3421 agtcaagtat attcttgaca gctttccatc acaaaatagc aaacaatggt ttacagaaga 3481 aatattttta tccattctaa gattgcgagg aatggaatct aatgcaccca gtaagtatgc 3541 tgaaggtttt tattgtcgca aagtcgtttt ttagcttgct tactctactg acactgttat 3601 ctcactgaga acttgcatta ttctccataa gaggcggagc ctctgtattc tcgttaccag 3661 gctcagcctg gtaacgagtg ttgggaggtt ctacctcccc agtactaaaa gccaaataac 3721 aataatgtcc aatgtaacat taattatcct tttctgattg gtctttttac cgtatatgca 3781 agcaagctga atgacattta gcaactaaca atttttagaa aaggtgaaaa acatgaatat 3841 aactgatctt aagcagaagg ttaacccaaa tgaagttagc caccaaatgt accagttgat 3901 ctccgaatta tatcctattt gccgtagtat aactggcaat ggttttcgag aaacgctaca 3961 ttggatttca aaacatattg aattaactaa gcatgaagtt cccagtggta ctcaagtgtt 4021 tgattggaca gtacccagag agtggaatat taaagatgcg tatattaaga attctcaagg 4081 agaaagaatt atagacttta ataaatcaaa cttgcatgtt gttaactaca gcgtacctat 4141 tcaccaaaag atacttttag aggaactaaa gaaacatcta tttaccctac ccgaacaccc 4201 tgattggatt ccttatcgga cttcatatta taaggaaagc tggggctttt gtctcagtca 4261 caatcaactc ttggaattag aggatgaaga gtatgaagtg tgcattgatt cttctctaga 4321 gaatggtcat ctgacatatg gcgagtatta cctcaaagga gacaaaccag acgaagtgct 4381 gatatcatgc catgcttgtc atccatcact tggtaatgat aacctctctg gaattgctct 4441 ttccacattc ctggcaaaat atctcactca aataaacctt tcatactctt atcgatttat 4501 ctttattcct ggaactatcg gttccatcac ttggctttct ttaaatgaaa gtcaggttca 4561 caagattaaa cacggtttag ttttaacttg tgttggtgat tcaggtaagt ctacttataa 4621 aaaaagtcgt cgaggtgatg ctgaaattga taaagctgtg actcatgtcc tcaagcattc 4681 tcatcaagat tatgacatca tagatttctt tccttatggc tacgacgaac ggcagttttg 4741 ttcaccagca tttaacttac ctgttggctg tttcatgagg acaccgcaca gttgttatcc 4801 tcaatatcat acttcagcag acaatttgga tttcgtacaa ccccagtccc tcgccgattc 4861 attctccaag tgcttgtcaa ctctacatat tctagaaaac aacaaaaaat acttaaacca 4921 aaatccaaaa tgtgaaccgc agttgggtaa aaggggttta tacagcgcca tcggtggaca 4981 gacggatacg aaaatgactg aactcgcaat gttgtgggtt ctgaatttgt ctgatggcga 5041 tcatacctta ctggatattg cagatagagc cggtatgagt tttgatttta tcaataaggc 5101 agcaaatatg ctcctggaac atgatttagt gaagtacccc cctgcatagg cagacggggc 5161 ttcctgtctc acaggcagat gccaccctag tgtcagtctc gtagtcgttt ggttcagaaa 5221 atctctacgt ttatcagcta tcggcttgtg ataccgcgct agtttggcgt agtacttttt 5281 ggcattgctt gaagcctttt cccccttgtc cattccttca ccaacaggtt ttgaccacac 5341 ccgttaaaat tgcctggcgg ttcatctacc tgcgccttaa gccaagttgg tcccccatta 5401 gcttgtccac cattcgcttg ggaagcgcca tcggaagagt ccagtttaac aggcgttgcg 5461 gcacgactgc atatcggaca cgaggatttg ggacagtcag ggctttccaa acagcctcgc 5521 caatccgctc tggcgaataa ccattccgcc ctttgggaat cgcgttctca cgagcctnnn 5581 nnnnnnnaat tatgacatcg atcccgtaaa gcataagctc acggcgcaag ctctcagaaa 5641 aaccttccaa accatgtttg gaagcggaat aggagccaag aaatggaagg cctatcttac 5701 cggtgacaga actcatatta ataattcgac cgggagcacc ttttagtgag cgatccgccc 5761 ccagccaagg caaaaaggcc tgcgtgacaa tgaaaggacc gactaggttg acttcgagtt 5821 gaaatcgata ctcatcaacc gactggtaca tgagcagtcc agacatggca atacctgcat 5881 tgttcaccag cccaaacaga gtttgaccat tgagacactc acgcacctgg tttgcagctt 5941 tgcggacgct accttcgtct gttacgtcga acaggagtgg tgtaaacgca ttaccgaact 6001 cagcggataa tctttcagcg tcgtttgctt tccgtatgct gccaaagacg tgaacgcctt 6061 tgtttataag aactttcgct gtaccccatc cgatacctga cgatacacca gtaacaacaa 6121 cgcttttcat aatgtcctat tattaaattt aagaacccct gattcaagaa tttaagtact 6181 tacctttcaa cgtttcgcat aaacatagcc gacactatcg caaccgcaga aactaactgc 6241 gtcaggggaa tcgccattaa tattttcgtg atcgcagtag cagaaaactg tatttgagac 6301 tacaagcttc acgtgggatt atcaaagaat taacctggat gtgcgtctgt gcttgattcg 6361 gaaccatctg agtggcaatt tcccaggctg gatcttctcg attatcaaca ataacccgta 6421 actcgtactg cggataattt tggttcaaca gcgctcgcaa acagtcgagc aaaaatggat 6481 cagaatccgt atttttcaag tccggctgat acgttcagaa gaatacagag gttatatcgc 6541 atcgaaaaaa cgctatttct acggagtcag agtccagtta atcagtacca aaagcggtat 6601 tcctgtagaa tttgcctttt taccaggtag tgcaaatgat gtgcgtggat taaatgcatt 6661 accattcaac ttaccgcttg gtagtgaagt atacgcgcta tgcagcatat actgattata 6721 ccgtagagga tgatatggaa caatcaagcc aaatatctct cagagtcatg cgcaaaaaga 6781 attctaagcg tcaagacccc cagtggaaaa aatacatcaa gcaatgtacg cgacattata 6841 ttgaaaccgt ttttagtagt attacttgtg tttttccgaa atcaatacat gcagtcactt 6901 atgaggggtt tttacttaag ttacaagcat ttatttttgc ttttactata cagcaagctt 6961 ttattgaatg agtaatttta aaaacttaaa agaatttaag tcacttaata cttaatcaag 7021 cctgaggcgc ggttgcaggt agcgtaaccg ccactttgtt gtaggcgtta agcgaacgct 7081 catgctgagc gcctctggcg cttagctcta ccgtttgcgc agcgcccctt ttaggggcta 7141 ggtaatcgct cccattgtga tgtatctcaa cttgataacc cgcaacttgc gttagtaggt 7201 cgaggaaggt accctggttt tcttcactag tacgacgttg attaataatg ttataaacaa 7261 tttgatctaa ctgcttaact gctttgcgaa agcgcagatg actaaaagtg gtcagccaat 7321 taggcaccaa aaacagagca tttctgcggt ttaccaaacg attcatggtc gcatctagag 7381 caacagtaaa aacctctaac agccagttga gggtacggaa gatgggttaa aagagaaacc 7441 cgctagtctt tgagtgccag tgttgtgtac actaaagcgt ggaagctgat gattcttgat 7501 agatgtcgat aaagcgtgtt ggtgttattg gtggcgggca attagcatgg atgatggcgg 7561 gtgctgcaaa gaagttagga gttgaattgg ttgtgcaaac tcctagcccc aacgatccag 7621 ctgtacccat atcagctaag gataacgttt tcgctgcaat tgatgacgct aacgccacgg 7681 ctgaattagc aaacagaagc gatgtcatca cctttgagaa cgagtttgtt gatattgaag 7741 ctttatcaaa gttggcttca caaggcgttt gttttcgtcc tagactggag gctttgactc 7801 cccttttaga taaatatcac caacgctgct atttacgcga tttgggttta cctgttcctc 7861 ggtttgtcgc catagaacaa gattggaaga caactgctga agttattttc gccaacctaa 7921 ttggttttcc tgctgttttg aaagcccgtc gccacggtta tgatggtcaa gggactttta 7981 tcatcaggga aatagaaagt ttaaagcaaa aattagaagt ctcttgcaca aaaggggttg 8041 gaagtcaatc taatagtttt ttattagaag aatttattcc gtttgaacga gaactagctg 8101 tgattgctgc acgttctgtg agtggcgacg tttgtacctt cccagtcgta gagactcagc 8161 aggaagaaca agtgtgtcgg cgagtcatcg cacccgcttg tgtttcatcg caggtggctg 8221 ttgaaataga gaaaattgct agcactttac taaatagtct agaagcagtg ggagtttttg 8281 gtatagagtt atttttgacg gctgagggca aagtgctggt taatgaaatc gcacctcgca 8341 cccacaattc cgggcatttc tcgattgatg cttgcgaaac ttctcaattt gagcagcatt 8401 tacgggcagt ttgtggtttg cagttgggta acactgccat gatttgccct agtgctgtta 8461 tggttaacct cttggggtat gaaatttccc aaagtgatta tacaataaag cgccagcaac 8521 tagagcagat tcctcaagcc cacgtccact ggtacggtaa aactgaatct cgtcctgggc 8581 gcaaactggg acacgtcacg gttttgctag atactcaaag tcaggatcag gcgattgcga 8641 taagcctccg gcttgacgct ttgcgtatcg cccaaaatat agaatctatc tggtatcctc 8701 acaaaacctg ctagaaattt tcaacgcaaa aaacacaaaa tctttttgaa agctataata 8761 ggtttgttgc actgctacgt tggtgactgc cataaagttt tttgatctat gccttaagaa 8821 taagggagcc ttacaactct cgtggcagta gaccgggcaa agtacccgga aactttaaac 8881 gagtagcatc agttcccacc tggtttatcc aggtcataaa cttaggtaaa acggcgtcgc 8941 ggtgtaccaa aatatctagc tcccttgctc agttaagaca tgggagcttt gatttttgaa 9001 ttgaattagc aattcattaa aacttaaaaa taagagagat aatttctaaa aaaacttcat 9061 atatatttgt gcagatctac acactaatac tcctattcta ggcgtcataa ttgatcaaat 9121 aaagttctca tcatagtaaa aaaatctgtg taaagcgatc ctatataaag gtaccaataa 9181 gagttagaat cgcagaacta agcgatcatt gtggttataa ctgtaaagaa ttgttaaata 9241 aactgataaa gaccgtgttt ccagcgtctg ttttcaaact agacaatggt ttaaccttaa 9301 ttcatcaaga aattgccaca acccctgttg ttgtggcaga tatttgggta cgtgcaggtg 9361 ctaactttga gccagaaccg tggtttggta tggctcactt tctggaacac atgattttta 9421 aaggtactgc aagggtaccg ccaggggtat ttgatcaaaa gattgaaaat cagggtgggc 9481 taaccaatgc cgcaacaagc tacgactacg ctcattattc tctcacgaca gcttccctac 9541 atttagaaga gactttgcct tacatgggag agttactact aaatgcagca ataccagaag 9601 atgaatttac ccgtgaacga gatgttgtgc tagaggaaat tcgtcaagca catgatgatc 9661 cagattggat cggatttcac gctcttatct ccagtgttta ccaacatcat ccttacggac 9721 gttccgtact gggtagtgag caagaattaa tgaagcaatc accagttcag atgcgctgtt 9781 ttcaccgcgc ttactaccaa ccagaaaaca tgacagtggt gattgtcggc tcaatcgctg 9841 cacaatctgc tttaaaattg gtgaaccaga catttgttaa ttttgtcgaa cgttgctgtg 9901 actgtcctca aaaaaaggag gtggcgaagc cggttatagc tggaattcgt cgtcaggaac 9961 tttctttgcc tcgcctagag gtggcgcggt tgctaatggc ttggattgga cctggagtgg 10021 aacagctaga gacttgttgt ggattagatt tactttctgt tttactggcg caaggacgga 10081 cttctcgtct cgtgcgtgat ttacgagaag agcagcaatt ggtacaaggt atatgtagtc 10141 atttttcctt acaagaagac tccagtttat ttacaatcac agcctggtta gaaccaaaag 10201 atgtggaacg tgtcgaatcc ttaattcgtc tacatttgca ggatttaatt gataatggaa 10261 taagcgaatc agaaatactt cgcgctcaaa ggcttctgtg taatgacttt gcgttttcta 10321 ccgaaacacc aaatcagctt gcaggacttt atggatacta caacacgatt gcccaagctg 10381 aattagcagt ggcttatccg tggcaaattc aatcgtatga tgccaaacaa ctccaaaaac 10441 tcgcacaaga gtatctttca cccaatcatt acgcggttac agtacttaaa ccctcttagt 10501 gactcgtcat tagcatcaag aactaataga cctcttgcaa aagtgagatt ttacgaggtt 10561 ctgttaactc agatcttgca ccgtaattct cgtccgccaa gaaataaatt tcttggctca 10621 aagtttaagt ccgttaaaac ggactaagta agtttttgag tccgttttaa cggactttgg 10681 ctatgagcct tgaacttaag ttcaaggcgt actaaagtca aggtgcaaga tctgagttaa 10741 gagttaagcg ttcagaatta agcgttaagc gttccctaca actgccacga agtctaatga 10801 ctaacgacca atgactaatg acaaaggact aatgacaaca tcacagcaaa aatcccatat 10861 tcatcgcaca gtattgaaca atggcattgt agtactggtg gtagaaaatc cggctgcaga 10921 tattattgca gcacgaattt ttgtccgcgc tggtagctgt tatgagaacc aagagcaagc 10981 tgggttagca tatttgctat caacggtact tacaaaaggt tgtaatggac tttcgagttt 11041 ggaaattgcc gaacaagtgg agtcgctggg agcaagtttg ggtgcagaca ttgcatctga 11101 ttattttttg ctttcgttga agacagtcac gtcagatttt gcacaaatgt ttacgttggc 11161 gggacgaatt ttgcgatcgc caacatttcc agaagcagag gtagaattag aacgacgtgt 11221 tgccctacaa gatattcgct cacaaaaaga acagccgttt accattgcct ttgatcaact 11281 ccgggatgcc atgtatgaaa atcatcctta tgcaaggtct gcattaggga atgaagccac 11341 catgagtcgc ttaactcgaa gagacttagt ggagtatcat cagactcatt tccgtccaga 11401 aaatatcgtt attagtatag ctggacgcat cacaccagaa aatgcgatcg ctcttgtgca 11461 acaagttttt ggtgattggc aagcatcccc cattcagccg ttgcaaaaac tagatttacc 11521 agaactcaat gttgaacctc aaatcaaggt tacaccccaa caaacccaac aatctattgt 11581 tatgctgggt tatttgggag catcggtgct gtcaaatgat tacgcctcat taaagttact 11641 atctacttac ttgggaaatg gtctttccag tcgcttgttt gtggaactgc gcgaaaaacg 11701 tggcttagct tacgaggtat cagcatttta cccaacacga ctctcaagag catcatttgt 11761 cgtttacatg ggtacagcac cagaaaatac ccaagttgct ctttctagtt tgcgtaagga 11821 agttgattta ctcttcacca cagaactaga ggaagacgca ctccaagcag cgaaaaataa 11881 gatactagga caatacgcct tgggcaaaca aactaatgca caaattgctc agatgtatgg 11941 gtggtacgag gttttgggac tgggaattga ttttgataca gattttcagg aggcgatcgc 12001 ctctttacgt gctactgata caatggcagc tgctcaccag tatttacggg aaccttatat 12061 ctcacttgta ggtcaagaac aagcggttca tggtgcaatt ttaaattata attcctgaaa 12121 agagttataa atactgctac tctatagttg cattatacac cgattagcat gagataaccc 12181 aggggtaatt tccatgataa gcagtataaa aacaaatccg ttaaatgact cacagtttct 12241 caacccaacc gtaaattcta gtggaagaaa aacagagcca aggtatcaat cattacaaat 12301 gcttcagata ctgctatctt gcggagtatc tctgcttttt ggtttatcta cagtatcgat 12361 tgcatctcaa ccacaacaaa tagcacagac aacttcctct gaaggcatca accgtcctac 12421 tcttcaagtt ggtagcaaag gagagcgcgt cacggaactt caagcagctt tgaaactttt 12481 aggcttctac acaggcacag tagatgggga atataacgaa agtaccgtcc tcgctgtttc 12541 ccgctttcag gaagccgctg gcttgaaagc agatggcatt gttgatacaa taacttggca 12601 acgactcttc cctggtgaaa cgacagtagc atcatctgga tcatcaccga attctacaag 12661 cagacctcct cttgcgtctg gaacttctaa caccaatcaa gttgttgttc ctagttcaaa 12721 ttcaaccgct tcaacatcaa caacgaccaa cacaccatct gcaacctcta aaccagagcc 12781 acgagctgta accggcaata caactgcaac cagcacaacc agagaagata attctaaacc 12841 agagccacgc tctgtgacgc gtaataccac tgcaacgact agaaccagag aagataattc 12901 caaaacagag ccacgctctg tgacgcgcaa taccactgca acgactagaa ccagagaaga 12961 tgattccaaa acagaaccac gatctgtgac gcgcaatacc actgcaaatt cacagagaac 13021 atacgtacga caatcaacat ctactcgctc tggctcaact cgttctgagc aaagcattcg 13081 tactcagcaa agcgatcgct ctggttcaac tcgttctgag caaagcattc gtactcagca 13141 atcctctcgt cctagctcaa cgactcgtac tcaacaagtt gcttcagttc aatatacctc 13201 agaaggatta gccattttac gtataggaat gcgtggtcct gaggttgtga ggttgcaaag 13261 acgactgcaa agactcggtt tcttgaatga ggacgaagtt gatggcgatt ttggcgcatc 13321 aaccgaggct gcagtcatag ctttacaaaa gcgctatggt ttggatgctg acggtgtggc 13381 tggtggggga acttgggaga ttcttatgag acggcgagga aggtgataac aattcaaaat 13441 tcaaattaca gaggaagata ctgaacctgt ccttccaatt ctccgtccag actacactcc 13501 acgatgatga gaatatcctc tatcacttgt ccaattggcg catctacact aacctcaata 13561 actcctggca ttggctgccc ctgtgtgagt cgttcataag cataacgtgt aatcgtagcg 13621 acatcgtggg tgagcaacac tcggttttct tgggctgccc atgccaatat gattgggtca 13681 tcctggcctg ataaaccaat atcttggact cgcacaatat caatattcag gttacgtcgc 13741 agtaagcctc taacgatagt attgtcgaag ttttcatcag caagcaccct caacatgctt 13801 cctgctcagc cttacgagca agcaatctat cacgtaaccc ttgcggatca aacctttgct 13861 ggttcatctt gcgaacttca tgcgtttgct gctgtcgctg ctgtaggtat atctcaactt 13921 cttgctgatg attgaggtaa aaggcaattg tggcgtatac atcagccagg ttcagcgatg 13981 gataacggta aacaatttct tctgctgttg ctccttgttt gaaaacagca acgaccgtat 14041 ccagtgttac acgagttttg cctacccgaa ctataccatc agcgttagtt tctaatggcg 14101 caggctcagc tatgattgac agtgccataa aagttgtact aagatttcta ctctgatagt 14161 aacaaacgtc cgtctgcttc accgtcgttg ttatgccgca accctaattg ccttgaggag 14221 aactgctacc actcaagatg ccgaatgtga gattataact gtgttgttgc aacacctcaa 14281 aaagtgagac aatcatggca atctatctgg actcagcaat tgtatctgaa gctgaaattg 14341 ccagtcgaat gggatgggta aaaggcatca cgacgaatcc aacgctttta gcaaaaagtg 14401 ataatccacc cgaaaccaca ctgaaaaaat taacgcagtt gacttcagga ccagtatttt 14461 accagctcat gtcgtctgat tttgagcgga tgctgacgga agggaggaaa gcttttgaga 14521 ttattggtca gcaaacggtg ttaaagattc cagcaacacc aattggattt gctgtcgtag 14581 cgagtctctc gccagagata acttgttcag taacagccat ttacagtgca gcacaggcag 14641 cagtggcacg ggaggcgggt gctagaatgg cgatcgccta tgtcaatcgt gctacccgtt 14701 tattaggcga tggaattgcc ttagtacgag atatggctag catactcaat ggcagcaata 14761 cagaaatttt agcagctagc attaaatcac cagaagaagc cgcagcatca ctacaagctg 14821 gtgctcatca tctgacttta cctttgtcaa tgttacaggc tatggcgact catgaatttt 14881 cagacaaaac agtagaagag tttgccaaaa acggcattgg tttaacgatt taatccgttg 14941 tagaggatgt tcaaacaggg aacgcttaac agggaatagg gaacagagga agcgatttct 15001 tgaccccctg ataactggta actgataact caatacgctt gctgttaagg attgatcctc 15061 cctagccctc cttaaaaagg agggaactga gcaagctttt taactctgtt gggggatctt 15121 aacaaaacta aacaccacta actgaaccgt attgactgat aactgataac tgataaaaaa 15181 gcctggtagc tatcatgcaa ccaggcgata tttaagtcac atagtaatga ggaagaagat 15241 tatactgcgc tgtagaaatt tctaacaatt tagcacaaca atttagtggg cagcttattc 15301 gagattaact ttgacaactt tgtttttcgc ttcttccttt ttaggcaatg tcagattcaa 15361 aataccgttt gtatattctg cggtgacttt ggtgttttga atatgagtag gtaaaggaat 15421 gacgcgttga aattttccat actggaattc actgcgcgta tagcctttcc cttcagtttt 15481 ggtttcagac ttacgctcac cgctgatagc aacggcattt tctgtcactt gaatatctat 15541 atctttggct tctaagcctg gaacttctaa cttcagatgg atcgcatcat ctgcttcagt 15601 cagttcagct gcaggaaatt ttgtgaagcc ttcccaattt ctagccggcg ctagagtttc 15661 atcaaatagg cggttgagtt gatcgctgag agcattaaat tcttgtctgg ggttccaacg 15721 acttaacata tatagttctc cctcttcatc tttgaaaatc attttttcca tcaagtcagg 15781 aatcaaatcc ttcctttgat tcccatatta ttgcaagata aaaatactgt cggttcggtt 15841 cttatcacca aagaaatctt aataaccgac cacaaaatta gacataatat aagtcgttca 15901 acaaagggta ttgcccactc cactgccaca aggtaaacta aatcctatgt ttccaacttt 15961 cttaccaact acagttggcg aactcacaga accctcctcg attactcttg ctcagagtat 16021 tgagcagatt gctctgacga ctcccttgag tacacaacca atcactacca cctatgtgcg 16081 tcaagggagt gggggtacac ctttgctgtt gattcatggc tttgatggtt ctgtattgga 16141 atttcgtcgt cttgttcctc tacttgcgtt acaaaaccag acatgggcag tcgatttact 16201 cggttttgga tttacagata gaccagtcgg agttaagttt agccccgtca gcattaaaac 16261 ccatctctat tatttctgga aaaccctaat taacaaaccc atcattttag taggcgcttc 16321 gatggggggt gcagcagcga ttgatttcac cctaacttac ccagaagtcg tccaaaagct 16381 ggtattgatt gatagtgctg gtttaacagg tggttcccca ttaagcaagt tgatgattcc 16441 accgttagat tattgggcaa ctcaattttt gcgtagtccc aaagttcggg caagtatttc 16501 ccgcactgct tataaaaaca gggagttagc atcacttgat gcccaattat gtgcagcact 16561 acacctagaa tgctccgatt ggcacaaagc cttgatagct tttaccaaaa gtggtggtta 16621 cagtgctttt agatttaaga aactagcaga aattgtacaa ccaacgctta ttttgtgggg 16681 tgattctgac aggattttag gaattagaga tgccaacagg tttaagctgg cgattccaaa 16741 tagtaaactc atttggatta aagattgtgg tcatgttcct cacctggaac agccgcaaat 16801 cactgctcag catattttag attttcggga tgaccacttg cataagttgg tctaacgcgt 16861 agactcccca taacataaaa ttaatgctct ataccgttga caaagtgagt tgcacatcta 16921 atgcctgtca gagaccgcta tcaccagagt gtcaaaaatg ccctcattaa agatggttgg 16981 acaattaccg acgatccgtt gcatctgaag tggggcaaac gagatatgta cgttgacttg 17041 ggagcagaga aactgatcgc cgcccagaaa caaggacggt gtatagcggt tgaaattaaa 17101 acgtttcgca gtgtatcgga catgaccgat ctcgaacaaa cgctgggaca ataccttgca 17161 taccgttctg ttatgactag aactgaccca aatcgctctt tgtatcttgc cgttcatgat 17221 gaagtgtatg ctgatctctt tgacgaaccc attgctaaac ttttggtaga ggattacaaa 17281 gtagacattg ttgtgttcaa accagaacag gaggttattc tcaaatggat accctggacc 17341 aatacaggca gttgattcgg catattttaa tcgaacacac aaaaattccc ttcagctacg 17401 gtgagattca atttgagaca gtctttgaca gtgaacagga tcgatacctc ttaatgatct 17461 tgggaagaga accagcctat gatttttctc caactgtcac gcgccgggtt catggttgtc 17521 tgattcatat tgacattatt gatggcaaaa tctggattca gcgggatggt acagaagagg 17581 gtgtagcaac agaactagtc agggcaggca tacctaagga tcagattgtg ttaggcttcc 17641 gttctcagga actgaggcag gattcaggat ttgcagttgc gtaactgcgg gattttcatt 17701 tgcatagtta ggacttaggc accatctgct acaaacccgg tttcttcgtt cgtatttggt 17761 tgcgaaacag acgatttttg atagaaaccg ggtttgtttg cgtaatacca atttgaaaaa 17821 agaatgcgac agatacacac tcccaaaccc ttgttatctt aggctttctt cattttgata 17881 gcgcagcgtg ccgcaggcat actttgaatg agaccgccac gctccgctcg cggcgctacg 17941 cgattttgaa ttggtataag tcctgatagt ctgaagttcc agtcagcccc gattttcggc 18001 aatattttaa gtactattac ttatcgacaa ataaacctta acctgtaagc gatgtactca 18061 cttagtatat tctgggaagc tgtcttggga aactttccgg aaaagctctg ttatctcgtg 18121 ttatttaatg cataatttct cttaaaagca agtagtgatg gcaaccaaac taatcgagtt 18181 agaggatggc actttggtag aggtagaagt gccagaagat caagcaaaac agatttcagg 18241 tagtgctgca gatcgggtta gtacaacttt taacaaaatt aagccgattt tggtgaatac 18301 gtgtcgtccg attgcagatg cttggcagga actcaatcaa gagatgcaga ttgagcaggc 18361 agagattgaa atcggcttca gttttgaggg agaaggtaat gtgtacgtga ccaaggctaa 18421 agctggttct aatttaaagg tcaaactggt acttaaaccc aaggcttaag ttaaatgagt 18481 tcaatctttc aagactcagt gattttaatc actagtagcg accccaacct gcaaaagagg 18541 agagtctttg gcacggggtt tgttatccac cacacggatg aagccagcta cctgctaact 18601 tgcgctcatg tggtgcggga tgtaggtggg gaagcgttgg tgctggcaga tagcatacct 18661 gcaacggtag tggcaaacgg ggaatcaaag gggtttgact tggcagtgct gcgtgtccag 18721 agactgtggt gccctgcatt gcgcttgtct gtgtctagtg ccgcaaagca ctttgcgatt 18781 gctggttttt atgctttcga ccagaaagag actcggttgc ttagagagat tcaaggctat 18841 ttgggcaagc aaagttttat tccctctacg gatggtcgcg atacgcacaa gctaaagcca 18901 gaggcttatc gcattaaagc ttgggatttg catattgagg gcgaagatat cttacaacca 18961 gggtacagtg gctcacccgt ggtggataga actagcgggg aagtgttggc aatggtcagc 19021 caccaagttg gtaaagggga aaagggtcta gcgatcgcta ttgaagctat tcaaaacatt 19081 tggcactcaa tgccatatga cttattaaag accgacaatg taatttttga accgggagta 19141 gattacacgc ggctgcgcga gtttttagca gcagggaaat ggaaacaagc tgatcaagaa 19201 acggaaatgc tcatcctgga agttgctggt cgaaggggaa atttgcttaa tgttgaatcg 19261 atcaaaaaat tgccatgctt agacttacgc accattgacc agctttgggt gaaattttct 19321 caaggacatt ttggcttcag cgtgcagaag caaatctggg agcgttaggg aaaatgagaa 19381 tcaccagaga aaaaatccaa tggtttttat ctctttaatt accgggctaa tcattgctac 19441 tgcaataggt gcatcagtaa ttactgctgt tattcttaaa cagagtagta caagtgaaaa 19501 caattctaca gcctcaccta cgcgtaccgt tgctcaatta cctactactt cacccacgcc 19561 aacggttgcc gcatccccta cagcttcacc cacgccaacg gttgcggaaa ctcctacagc 19621 ttcacccaca cctatcgtta ccgaatcccc taccgcttcg ccatctggga ctgctgtttc 19681 tcttctagat acggaatgtc ttagttctac tcctggcgtt gaacaatctt tagtaaagcc 19741 gcgtgaacca aaaaacattg ctataggagg ggaagctttg cctgaaatcg cttatttgtt 19801 tagcgaaagt agtgagcctt atccttatat ttccaagagt aaagctgctg gagtgtcatg 19861 cgttcttaac tctaagtttc gacaattaaa tttagttgtt ggaataaatg gcaaacatcc 19921 tagtgctcgt caagatgaga agatagtatt tgacgtaagt gttgacaaca aattaatagc 19981 tacaaaagat ttgacaatcg cggcaaagca agttctaaac attaatgttg agaatgctcg 20041 cagcgttggc ataaaagcaa gttgtctaaa agataataat tattcttact gtccttatgt 20101 tgcgtttgta gaaatgagtc tacgctaatc taatatcagt tatcagtgaa cacttatcag 20161 ttatcagtta tcagttacca gttatcaggg ggtcaagaaa tcgctgactc tgttcactgt 20221 ttactgttca ctgtttactg ttcactgttt actgtgttgt gttaacgcac cctactgttg 20281 taatttttaa ttccttagtc cagcaaccac attggggagg aattgtcaac agattcttcg 20341 gtcttcttta tcggctgttt ttctggtagg tcgtatgatt ttctatagct ttttattgat 20401 tttgatgatt gagttttttc cttttcctgt tgtgatgatt gagtgttttg cttttgctgt 20461 tgtaatgcac tgattgtttc tgtcagttgt gaattagctt ctgctagttg tatagcagcc 20521 tgttttgcat catcaagctc tgttttcagc tttgctgcta cttttttgtg cttagataca 20581 tctaattgta aatcggtgat ttgctcctga agagaagctt ctttttgctg tgcttttcct 20641 aaagcttctt ttagttcttt gacaacagaa gcctcttttt tctgtgcctt gtccacagtt 20701 tctttcagtt cattaacaag ataagcttct ttttgctgtg ctttttctaa agtttctttc 20761 aattcattaa caagagaagc ttctttttgt tgtgcttttt ctaaagcttc cttcaactct 20821 ttgacagtta cttctaactc ggcttttgtt gctgttgtac gcctagacgt tgttatctca 20881 gctgcttgtg gttcctcctt tagttgagat gaaccatctt ctttctccgt atcattatct 20941 gaggaagcgg agggattatc atcagaaata gcaacaacat caatagttgt ttcaccttct 21001 ggtggtgaaa atttttgcgc ttcttgttgt aataagtcag aaagacgttt tttagtcata 21061 ttccctccaa tcacgcttta gttcttccgc tacttgacgg tagtctgacg ccgcctcctg 21121 tgcattcttt cctcgccatt gggtaattgg tacaccctca agcgcagctc gttcgtgcgc 21181 tttgtagatg cggatgaaag ctttacaagc aggaattccc atctccttaa gagagttttg 21241 agcttccagc acttccccta aactccgcgt atccacctta gtcagcaaaa cccgatgggg 21301 tttcccaaca ggattgacga cttgttggat tgtctcgata aggacagcta aatccattgg 21361 tgcacatggc gtaggcaaaa ccaaataatc agctacagca acgaccgccg ctaaagcttc 21421 agaccgcagc gccggaggcg tatctaccat tactaaatcg taacctgtta ttcctcgtaa 21481 ataacctaaa agtttggggt cggttgattg tgataaatca aatcccatac tattttcact 21541 gcgcccaaac caccaactgg cagaaccttg aatatctgcg tcgattaaaa gaaccttttt 21601 ttcttgtgca aaggttgcag ccaggttgac agcggtcgtt gttttaccga cgcctccctt 21661 accgttgagg atagcgatga tttttggcac tgtttgcatc cgacactctt acacagaata 21721 taactcttga aagagaaatg gggaatcggc gcataggcag aataatttca cacttgtgaa 21781 tcacacaaga gttcctgcag aaagtcacaa tagaagcaat caacgctcaa cgccacaagt 21841 gtttatgaga gcaactaaca atctgcctaa tggaccgaaa atgccagttt tcctgcgccg 21901 gatgaaattt atttttcagc cgttggaata tgtggaagat tttgccaaca aatacggtga 21961 taacttcact ctttggagtc gtaacgattc tcctgctatc ttcttcagtc atcctcaagc 22021 actgcaacaa atttttaatg cggattcgag ttccttaaat gctggaagtg gaaaccgagg 22081 tttacaattt ttgctaggtt ctaattccct gattctactg gacggcggtc gccaccaacg 22141 ccaacgtcaa ctcttgactc ctccttttca tggtgaccgg atgcggactt acgccgaaac 22201 tatccgcgaa atcacgcgcc aggtgagcga cgaatggaaa atgggtaagc cctttaatat 22261 ccgtgcgtcg atgcaagaaa ttactttgcg cgtcatttta cgggtggtgt ttggcttaga 22321 tgaaggacct catttggaaa aaattcgaca attgttgagt tcactgttgg actctatagg 22381 ttcccccctt ttgtccgctg gcttcttttt tcggtttcta caaaaagatt tcggtgcgtg 22441 gagtccttgg gggcgagtgc tacgcctgcg acaacaaata gatgaaatga tttatgcgct 22501 cattcgggaa cgtcgcgtcc aatctcctca aaatcggcaa gatatcctca gtttgatgat 22561 gtctgctcgt tatgacgatg gtcaaccgat gactgatgaa gagttacacg atgagttgat 22621 gacgctgatg gtagcgggac atgaaacgac tgcttcagcc ttgagttggg cgttttactg 22681 gattgattat ttaccagaag tgcgcgacaa gttgctgagg gaactggaca cccttggcga 22741 caagccagat ccaagtattg ttgccaaatt gccttacctg acagcagtct gccaagaaac 22801 cctgcggata tatccgattg cgatgaatgc ttttcctcgg attgtgcgat cgcctataga 22861 aattcaaggt tacacactcc cagaaggaac agtgatcatt cccaatatct acttagcaca 22921 ccacagggaa gaaacttatc cacaatcaaa gcagtttaaa ccagaacgtt ttttagaaag 22981 acaattttcg ccttatgagt atttgccttt tggtggtgga aatcgtcgct gtattgggct 23041 ggcatttgct cagtatgaaa tgaaactcgt attggcgaca attttatcgc gctttcaagt 23101 ctccttggtc aataggcgtc ctgtgcgtcc tgtgcgtcgt ggtcttaccg tagcaccacc 23161 agcaggaatg cagatggttg caatgccttt ggaaaaacgt gttaatactc cagcgcttgt 23221 ttaggtcatg agggtgggcg atgcccacca atattttcgc accaatcttc gcaatttgct 23281 atcttttaaa cagaatttat agtatctgaa gttttaagtg gtcaacgcaa tttgaccgtg 23341 aaccatattc gtcaactggc agagtttttt cacatttcgc cagctgcttt ttttgaggca 23401 taatgagaga aaaaatttta gaaaccatta ttgctgcatt acaaccagaa gattttgttt 23461 tagctttatg gcaaggtggt tctgctgctc acgggtatac agatgaatgg tctgacatcg 23521 atattgaagt gattgttgaa gataattacg ttcaacaaac ctttgatatt gttgaagctg 23581 cattgcaaat aatttctgaa attaacttta agtatagagt tcctgaacct acatggcatg 23641 gtcactcaca atgcttttat caacttgtgg gagtgagtcc ttttttggcg attgactttg 23701 ctgttatgaa gcgtagtagt cgtaacgatt ttttagagat ggaacgacat ggacaagcag 23761 tgattgcttt cgataaggct aatcttattg tttcaacaca tttaaatcac caagagcatt 23821 tttccaaaat gaaagcaagg tttgagcagt taaaaacgat gttttatttt tggcaaatat 23881 tcgtcaaaaa agaaatcaat cgtggacatt tggcgcaagc tattgtcaac tatcaatcct 23941 atacactgcg acatttagtt gatacaccta cgtatgtaga aagacaagca gatttacagc 24001 tttatcaagc tctgaagcag ggtgattttt gttacatctt caactcccgc caaatgggga 24061 aatcttcact tttagtgaga actaagcacc gtctccaaca agaagggttt atatcatgtc 24121 cgcataatcg cttacgatta agctccgatt tgaatccacc aatgaaaacc tgttaagcct 24181 cgaatcaaat cttgctgttg aagtaaagat tggcaacgat gaaataaaac ttcctctaaa 24241 tcatcaagtg ttttaaatga gcgatttgcg attggttcat caaccaaagt ccataatcta 24301 actgaagagt tgtaattctg gagaatggga tggtaaaagt gttaaatgta aacatagctg 24361 gaacttctaa atctttgcta atatgccatc cggcttggtc acggggtctt acacgtaaaa 24421 aatctggcag ccttctgttt gtggatctgt gggcgtagtc gggtactcgt tttattaagg 24481 ctgccagatt tttaggggac aatgcgtcaa cggctaaaag aacatgtttt ttagcaccta 24541 atttgaactc acgagcaaag tcagctaaaa cttgattaaa gagttcagta ttgacgtatg 24601 gcaaaatcca ccaataagtt tctcctgact taggatgtac aaatccatac aaccataacc 24661 attgaaatct ccaattaaca ttggcaattg gtgtttctcc ctcaggtacg taaactcgcc 24721 tgagaattgg ctttagccct aagcgatgtt catcttcgca ccacaattga atttcagcct 24781 ccggataagc acttttaagt tcttcaactt ctgttgctag ttttttttcc aggcttcttg 24841 ctctactaag tcgctttctg tgtgagatgg gcgaggtact cgcagctctt ttagtcatct 24901 gccgtaatat ttcccatcct ctttgtcggc taaccgttgt ccgtgtgact tcactcaacc 24961 aatctgccac tttacgacca ttccataatc ctccatctgg agcattgtct tgtaaaacct 25021 gccatagccg tgcttgctca acatcctcaa ttaacggctt tgctccttta ttttgacttc 25081 ggcgttaagc gtagctctgc cgtaggcaat cgcataaccc agaaatgccc aactcgttat 25141 acctttttac caatgcgtaa atccatgttc tgctatagcc cgttatttgc tctacttctt 25201 ctgttttctt gccttgcgct agtaaccata taatctgata ctgacgactt tctattcctt 25261 ccttcgcatg tttataaagt tgctctaatt cacagatatt taaatgctcg actatactga 25321 ttcgttttgg cataactcat atttttactc cttagcttta tcgtaagcga ttatgcggac 25381 atgatatgaa ttcttactga atatttaacc acgaatttcg tggatttcct agatatatca 25441 aaatagctga tttatttttt ttgaatgaaa ttacgtccag tcaatcggaa tttattgaaa 25501 taatattaac tttgtatatt gacaaaaata ttgtttttta aaaacaatta aatagcgatt 25561 tctgattttg cagataggtg agagaatttc ctgttttagt ctgaaccaat aaactggtga 25621 attcgataat ttcttctatg ctgctcatca ttacagttcg caacttttca gacatccttt 25681 cagcgcttaa ttcccttaaa ccgttctttt gcgacctgga atgcatcttg catcacagat 25741 tgaatttttt gaggatctgg tttctttcca cccattaaat caccaaagta aacgcaagca 25801 gcttctccaa gtgaccaagt gtaagcagca gcccaagaag cagctatcac actgccaaaa 25861 cctggtagga atttgactaa ctcacgtgcg atcgctcgtg ctaagaaacc actggcgatc 25921 gcactcacaa ccccccccgc ttgtgatggc gtcagcgtct gtccatacaa tttccccagt 25981 agccccacca tcgagacttg caaggctgtg agtgctggca tcgttgcaaa gggtagcggc 26041 acagctgcga tggtagctga tatagtcgca aataccaaaa catagcgacg tccaacatcc 26101 cggtaaagat taccaagttg tttgccagtc ccttcatcca acaactgata tatcgttcgg 26161 gcttctgctt ctggtagtaa ctcggctatt gcatctctca acgcctctaa cccataaaac 26221 acaggggtgt acccatcttc ttccagggta aaatcaatta ggacggcgcg atcatacaat 26281 cctacaaaag cttgctgcat tgcagtgaat gctcggttaa cctcttcaaa atctggtgga 26341 tattcaggat gatcatctgt accaggagga taaacctcat gcaagcaagt caccgccagt 26401 aaacagggaa tatctggata cttctgacgc agttgctgag taatttgtcg caacgtatca 26461 gttgcaaaat cattgatctt aaccgtcaga atcaagattt tggcgcgacg actttcctgt 26521 tgtaaatctc caaccaattc ctgaacaatg acttcagtat cctggtttac atctcccagt 26581 ccgacagtat ctgtaaaaat aagtactggc aagtcagttg aagggtaagc ataacgctga 26641 gtgtgttgtg tgtgtggacg aaatccttga ccgacaattt ctgctgaaac tcccgtcagc 26701 cctcgcacaa tcgaactttt acctgcttgg ggcttaccaa tcaaaagggc ttcagtggtt 26761 ggcagttctg ctcgaattgt ctccaaaatc tctgcaacct gagcttcact aacgctaaac 26821 cacccaacaa ccgtctgtga tattttatct acgggtagca ctttcatcaa gcgtgctgtt 26881 gtctcgttcc aaacacctga aagctggttt gtcagtgaac tatcgaccga cttagttgtc 26941 acgtcagtcg attcacttga gtgttgagaa tcagcggatg gtaaatcagc gtcacgttgt 27001 tcagtcattt gcgtcaaaac ggcagcacgc cctgaataac tttttgtttg ccagtatcca 27061 gatttaggat aactcagatc ttgcactagg attctcgtcc gccaagaaat aaatttcttg 27121 gctcaaagtt caagtccgtt aaaacggac // LOCUS NODE_1089_length_26967_cov_5.41227026967 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 26967) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 26967) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..26967 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..603) /locus_tag="DP116_09415" CDS complement(<1..603) /locus_tag="DP116_09415" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09415" /translation="MKLDPNTINELAEILRPFVESQRERQSFLIAALGNDAPVLQHIS WDGSVATFIPHMLDKLANIGGREAFCKVLEHVRSQISVHEDVQQRIDELLNKLQVQDI QSAPTPLPSSSAEEKYREEVKRVVSDNKISDTSRRILNEFKQKLGISQQTAEKIEAEI LHPFYEYQKHLDEFLQILVEQIYNQLPFSAKTRSELREIQK" gene complement(796..1011) /locus_tag="DP116_09420" CDS complement(796..1011) /locus_tag="DP116_09420" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09420" /translation="MWWGFSYVLCTLEEAQGAGRKKKGEKITHVKSVSPQHPAKLLPS EASSGVSPHPERYNQTKPEKVLAFKDY" gene complement(1690..1989) /locus_tag="DP116_09425" CDS complement(1690..1989) /locus_tag="DP116_09425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006909701.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_09425" /translation="MATYKVTLINVAEGLNETIEVPEDEYILDIAEEKGLDLPYSCRA GACSTCAGKLTKGSVDQSDQSFLDDDQIAAGYVLTCIAYPTSDVTIETHQEEALY" gene complement(2442..2741) /locus_tag="DP116_09430" CDS complement(2442..2741) /locus_tag="DP116_09430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320150.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_09430" /translation="MATYKVTLINDAEGINKTIEVADDEYILELAEDAGLDLPYSCRA GACSDCAGLIKSGTVDQSDGSFLDDDQIDQGYVLTCIAKPTSDVTIETHKKDDIE" gene 3122..4126 /locus_tag="DP116_09435" CDS 3122..4126 /locus_tag="DP116_09435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gfo/Idh/MocA family oxidoreductase" /protein_id="PRJNA477356:DP116_09435" /translation="MYDNSAFSQKRPVRVGLVGTGYTAKLRAEALVHDERSHLVAVVG HTPQKTEAFARDYQIMVVDSWQQLVEQDNVDLVMICTVTRDHGTIARQALESGKHVVV EYPLSVDVAEAEELVELAKAQKKLLHVEHIELLGGLHQALKQHLPKVGQVFFVRYNTI KPEHPAPRKWTYNHELFGFPLIGALSRLHRLVDLFGTVMSVNCHNRFWEIETQYYQSC FCDTHLCFNSGLLAHVVYGKGETLWQAERRFEVHGEDGGLIFDGDKGLLVQHGEKKPI EVGARQGLLAKDTTMVLDHLIDGTPLYVTPEESLYTLKVADAARRSAEMGLTIIVEDE " gene 4201..4677 /locus_tag="DP116_09440" CDS 4201..4677 /locus_tag="DP116_09440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007727230.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_09440" /translation="MDVVLQEINQDNWEECIRLKTTEEQENFVASNLYSLAQSKFYPS CVPLGVYHNQTMVGFVMYETMSTGDQNSGYCICRLMIDKNHQRKGYGKAAMQAVINLL KDKPDCKKISTSYVPKNAVAEKLYYSLGFHATGLIEDGEIVLSLSVESRAGVTTQS" gene 4791..5564 /gene="map" /locus_tag="DP116_09445" CDS 4791..5564 /gene="map" /locus_tag="DP116_09445" /EC_number="3.4.11.18" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015184340.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I methionyl aminopeptidase" /protein_id="PRJNA477356:DP116_09445" /translation="MKSETIVLLSNRELDKMRKAGRLAAELLHHLAPLVKPGVSTLEL NDEAERWTQAHGAKSAPLGYKGFPKSICTSVNEVICHGIPNAKQILKEGDIINIDVTP LVDGYHGDTSKTFFVGTPSPKAKKLVEVTEECLRRGIAEVKPQARIGDIGAAIQEYAE AQGFSVVRDFVGHGVSNIFHTAPEIPHYGIRGKGKRLRPGMVFTIEPMINEGTWEVEV LGDGWTAVTRDRKLSAQFEHTLAVTEDGVEILTLRESRT" gene complement(5833..6174) /locus_tag="DP116_09450" CDS complement(5833..6174) /locus_tag="DP116_09450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012268344.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" /protein_id="PRJNA477356:DP116_09450" /translation="MAQIARGEVWLADLNPVRGHEQAGKRPCLVISVDLFNQGASGLV VVLPITSKEKGIPFHVELNPPEGGLKVQSFVKCEDVRSISVDRLEKRWGTVSPKTLTA VEDRLRILMGL" gene complement(6174..6419) /locus_tag="DP116_09455" CDS complement(6174..6419) /locus_tag="DP116_09455" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_923576.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="toxin-antitoxin system protein" /protein_id="PRJNA477356:DP116_09455" /translation="MPELTVSISQTTHETLLKLAQTSGETIQTVLDRAIENYRRHVFL VQANQAFAALRQNEALWQEEQAERQVWDQTMADGVQE" gene 6520..6609 /locus_tag="DP116_09460" /pseudo CDS 6520..6609 /locus_tag="DP116_09460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015212871.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type I methionyl aminopeptidase" gene 6800..7390 /locus_tag="DP116_09465" CDS 6800..7390 /locus_tag="DP116_09465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008403975.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_09465" /translation="MSWTLTINTSRLILRPQQPDDYKSWYAGFAGRLPQQYKYDEGQI SLDGCDPNWFSTLCKRHQDQALSDYAYIFGVFSRQNNQHLGNVDLSTIQREEKQWANL GYSIHNQYWRQGFGKEAVKAALRVGFENLGYHRIEAAINLDNHLSIALAQSVGLQKEC IRRGFYYENEQWVDHLIYVALPSDLGLVEKPPVIVG" gene 7683..7901 /locus_tag="DP116_09470" CDS 7683..7901 /locus_tag="DP116_09470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015225457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09470" /translation="MNRHTRQVILYKDEDGYWVVECPSLKGCISQGNTKEEALSNIKE AIAGYVNALEEDGLPVPEDNFETFLVVV" gene 7898..8119 /locus_tag="DP116_09475" CDS 7898..8119 /locus_tag="DP116_09475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012593556.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09475" /translation="MSKLPSISGRECIKALEKIGFYQKRRESSHIILRRDEPFAQIVV PDHQELAKGTLRAIIRDVELSVEEFVSLL" gene 8195..8434 /locus_tag="DP116_09480" CDS 8195..8434 /locus_tag="DP116_09480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017299978.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09480" /translation="MPTLKLTNKQVVELIKQLPSEQQVEVFRFLLLQQWGEWEALSRY AVDKVRLIAQERGYNWDIMTEEEREVFVDKLMDED" gene 8434..8847 /locus_tag="DP116_09485" CDS 8434..8847 /locus_tag="DP116_09485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007308497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative toxin-antitoxin system toxin component, PIN family" /protein_id="PRJNA477356:DP116_09485" /translation="MRYIAVFDTNILISALLSTSGDPFRCLALAKIGQIESVTCQQIL DEFAQKLVLKFKFSQKMAQAAVEEIRSCSRLVEIGATLEAVPDDPDNDMVVECAIVGN ATHIVRGEKHLLTLAKYQEIKIVKATEFVALLSQS" gene 9069..9515 /locus_tag="DP116_09490" CDS 9069..9515 /locus_tag="DP116_09490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315197.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF29 domain-containing protein" /protein_id="PRJNA477356:DP116_09490" /translation="MNKAYLTDFNSWIDQTAQFLRERRWHEIDVEHLIQEVEDLGKTE RRGITSQLTRLLLHLLKWQYQPQRRSDSWLDSITDARTQIELAIEDSPSLKNYPTEQL EESYQRARHQAAKQTNIQISVFPKECPYPVEFVLDEDWLPEENRTE" gene complement(9531..10745) /locus_tag="DP116_09495" CDS complement(9531..10745) /locus_tag="DP116_09495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015154703.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_09495" /translation="MSSDIISTRQRLINAAMELFAAQGITETTTKAVAELAKVNEVTL FRQFGNKHGLVLAVISESPVFKELGESLKTQASQMTNVHEALKNYCEDRLKALEQVPE LVRSMVGEAGKYPVENRQALGRSLAQANNYVAEYLATMMEREQLHVQLSPQKLASLLN SMLLGYAVIEFTSEFHELWHDRDEFLETLVALFLKVATQSSNQVNNEFISTEKVVDLP ANLVHLILQRAKKSELRDYALMYVLFGAGLSAEEIVNLERSHQIQDSNQYLLQITQGA TRQVPVNQWIMGKRYGSYTRNPLTQWLKSRKDNHPALFLNDDGIPITELEIQQRWRIL TEGLLTPEGQQPVIEQAQQTWCVEMLMKGMNLEDMSIITGWTRTKLQPYARRAREKLA LEQAIRLDHKPQ" gene 10926..11774 /locus_tag="DP116_09500" CDS 10926..11774 /locus_tag="DP116_09500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863404.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA desaturase" /protein_id="PRJNA477356:DP116_09500" /translation="MTTNSTAPVDKQPLTLSWTNVAFFGTIHALAMLAPWCFSWSALG VMIFLHWLFGSIGICLGYHRLLTHRSLSVPKWLEYAIATLGALAMQGGPMFWVAGHRL HHAHTEDVDKDPYSAKRGFWWSHMLWIFYPRPEFFDEQHYKKFAQELYRDPFYRWLNR YFLLLQIPVGVLLYVLGGWSFVIYGVFVRAVLLWHTTWFINSATHLRGYRRFQLKDNS RNLWWAALLTYGEGWHNNHHAHPNVAKAGYQWWEVDMTWWAIQTLQTLGLANKVVMPP IPKPEV" gene complement(11790..12353) /locus_tag="DP116_09505" CDS complement(11790..12353) /locus_tag="DP116_09505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745316.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="colanic acid biosynthesis acetyltransferase WcaF" /protein_id="PRJNA477356:DP116_09505" /translation="MTSEEPFVDLRKYNQSWFDRGRAGWYILLWWLIQAIAFPLTPHP LNNLRCALLRLFGARVGKGVIIRPTARFTYPWKITIGDYSWIGDDVVLYSLDDINIGE HCVISQKSYLCTGSHDITDPAFGLKTASITIGNGVWIAADCFVGLGVKIGANAVIGAR SNVFTEIPSEQVSWGTPCTPRYLRKRE" gene complement(12571..13530) /locus_tag="DP116_09510" CDS complement(12571..13530) /locus_tag="DP116_09510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015121313.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_09510" /translation="MSFKLPVSVLIPAKNEQANLPACLASVQTADEVFVIDSQSTDKS VEIVQSYGANVVQFYYNGSWPKKKNWSLENLPFRNQWVLIVDCDERITPDLWDEIAQA IQKPQYNGYYLNRRVFFLGRWIRHGGKYPDWNLRLFKHKKGRYENLETGDVPNTGDNE VHEHVILDGKAGYLKNDMIHEDFRDLYHWIERHNRYSNWEAHVYRNLLTGKDDTDTIG ANLFGNALQRKRFLKKLWVRLPFKPFLRFVLFYIIQRGFLDGRAGYIYARLLSQYEYQ IGIKLYELCQYDGRLNTTSTTSGEEPHPLVGVAENRPAAGIKN" gene complement(13546..14478) /locus_tag="DP116_09515" CDS complement(13546..14478) /locus_tag="DP116_09515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318510.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_09515" /translation="MPNTLISAIICTHNRDTYLGAAIDSLLAQDFGGEFEVVVVDNGS SDRTVEVVEQRSHNPRLKYIFEPVIGLSVARNTGARVASAEILAYLDDDAEASSDWLQ ILYSAYQDNPKLAIAGGKVSLLWPEGTQAPPWLSKGLSANLGAYDLGESIVYIDNPGQ TPRGLNYSIRRSFLETVGGFNTHLGRIGTKLLSNEELQMTELALQHQMQVAYLPNAKV AHNVSPERLKRSWFISRGWWQGVSECYREQLAGKAGFGQLQRGSERFLRGLYKTLQHF PDPAERFDKLVYAYGQIGYLKSAIQGLLSKSKKE" gene complement(14854..15627) /locus_tag="DP116_09520" CDS complement(14854..15627) /locus_tag="DP116_09520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09520" /translation="MKAGIKRIEATLHDLGTRNTAGSVEADASTKQSLSFRISIGEAE HSETHSEFCEEEPSRHLEQNLATYNTSVQTFPTQDSASKTPSLPKLKTPSFTSHRNGA NPAFAMNLLQDIQESVGSWQKELQKIVRQIQDLYLEGPIVNGWLESHEREVQPGGTAT LRHAEVDRLMDYVEEICAENTNVSCQSPRAGYRLCGLDASGKVWSRPCPADQVPSVSI AIARYQKLRQLLGRKQYLETRLTQLAETLVILHGHIQSQ" gene 15752..16306 /locus_tag="DP116_09525" CDS 15752..16306 /locus_tag="DP116_09525" /EC_number="2.7.1.156" /EC_number="2.7.7.62" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207205.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase" /protein_id="PRJNA477356:DP116_09525" /translation="MSKVVLVTGPARSGKSEWAEILAMESQKVVIYVATAVENSEDKE WQERILQHKQRRPQDWATLEVPYKLSATLANAKPNICLLVDSLGTWVANLLEQDELVW ENIVQEFLETVELVAADIIFVAEETGWGVVPAYPIGRTFRDRLGHLVRQLGGISQDVY LVTGGHVLNLSLLGSPLPKIRAFN" gene 16495..16926 /locus_tag="DP116_09530" CDS 16495..16926 /locus_tag="DP116_09530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09530" /translation="MASQQDVKKYLAYWFQLGKKVVINNGAATLLPQKVIAGDRYSDE FEECWQRIISPESGDCYIEGTQETIAELLTPAWDVSPCGRCDMPVPVKNVGMPPLVCP CFDLTTWPNTELPAPRSPVNNQEQLRAIRDRLLSVVKSNRE" gene 17623..18123 /locus_tag="DP116_09535" /pseudo CDS 17623..18123 /locus_tag="DP116_09535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320355.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="histidine kinase" gene complement(18303..19052) /locus_tag="DP116_09540" CDS complement(18303..19052) /locus_tag="DP116_09540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873908.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09540" /translation="MAALHDGEKFLVQRYAISFAGSLRSIATTSATTLINAKKVNRKN LRMLAMGLTKDAIVDGRMYEALTNVRQEINQVAAQIPGSKQLLDENFTYTSLQTELRL SVYPIIHIATHGEFSTVSEDTFLITGNNGKLTITDLYTLIRNTLHGSEAVELLALTAC DTAIGDDRIPLALAGSAVNAGVKSALASLWSINDAATVTLVTKFYAEWYENGVSKAEA LRRAQRALIASSKKYSHPYYWAPFILIGNWL" gene 19487..20461 /locus_tag="DP116_09545" CDS 19487..20461 /locus_tag="DP116_09545" /EC_number="3.1.26.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199776.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease Z" /protein_id="PRJNA477356:DP116_09545" /translation="MEITFLGTSSGVPTRSRNVSSVALRLPQRAQMWLFDCGEGTQHQ ILRSDLRVSQLSRIFITHMHGDHIFGLMGLLASCGLAGNVQRVDIYGPSGLNDYLQAA SRYSHTHFSYPVKVHAVRPGVIYEDDEFIVTCGFLHHRVTSFGYRVSEKDRPGRFDIE KAKVLEIPPGRIYGQLKRGEVVTLADGRVIDGKELCGPTEIGRKIAYCTDTVYCDGAV ELAQDADVLIHEATFAHQDAEMAFQRLHSTTTMAAQTALAAGVNRLIMTHFSPRYAPG NEIELKDLLEEARAIFPKTDMAYDFMTYEVPRRRDKELKKAEAIAEIS" gene 21339..23606 /locus_tag="DP116_09550" CDS 21339..23606 /locus_tag="DP116_09550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495711.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chain-length determining protein" /protein_id="PRJNA477356:DP116_09550" /translation="MSLEQRKIIITSPQSKVKRRKLSTILLRRRLQILGVSGVVISVA SVLALSAKPTYQSTIQILVNSKLNEGVRSSHTQQGAESNFTDPHLDIINDTTQQKFML NSNLIQKAVNLLRSEYPNLTVEHIKGKKGQQAPLVITPLEQQTADNKVMNQVFEVSFK DNDPVRAEKVLKSLQKVYQEENIEQRKQSVSKGLTFIKKRLSEVKNKMIQADKNLEQF RKKNHLLDPQVQGKILLESLADVTKQLRTTRAQLKDLQARHDNLKQQLSSLSQNTIVS FRLSQSTRYQTLLNEIQKTDLALTQQRLRYTDNYPEVQKLIQQRRTQAALLQEEIRRT LGDKDVQAISDTLFKKKPLDTEISSGTEKPEQTTREVPQADLKLVQDLIQMQTAILGL RANEKSLADSQAQIRAELTKYPSLITEYNRLLPEVETNRKTLEQLIAAQQSLGLMIAQ GGLNFQVLQQPQVENYLDSNKLFILLGGVLLAPILGIGTALMSECNDAISSPEELQRL TNLRLLGTVPELSQLSTKKRSFRRSLRRVMAHFSSQTISDEKHFNVYGLLPSHETLDM AYQNIQISKSFVHHKSVMVTSAMSGEGKSTLSLGLAVSAARMHQRVLLIDANLRQPNL HKILGLTNDWGLSLLLVEERNSSVKEYIQPIHPSIDVLTAGPTPEDTVKLLSSARMKE LLQFFEQTYDVVLIDTSPILGTVDATILASLCNGIVMVGRMEQVICKKLIQATEILSN LNLIGIIANETKCSS" gene 23698..24090 /locus_tag="DP116_09555" CDS 23698..24090 /locus_tag="DP116_09555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867484.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxymuconolactone decarboxylase family protein" /protein_id="PRJNA477356:DP116_09555" /translation="MTKLIEYEQASDEVRVVYDDIRATRKTDYINNFWKAIANHPPTL RRTWETVKEVMTSPGELDPLMRELIYIAVSVTNNCEYCIASHTAGAYAKGMTDVMFGE LMGIIATANTTNRLANGYQIPVDEQFKT" gene complement(24266..24463) /locus_tag="DP116_09560" CDS complement(24266..24463) /locus_tag="DP116_09560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310817.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09560" /translation="MTPKSGLFLAGSCITAIAAVGSMFELSSGQPDLGTEITAIILAI SIPLTGLFFVAAVRDARANIK" gene complement(24472..25047) /gene="def" /locus_tag="DP116_09565" CDS complement(24472..25047) /gene="def" /locus_tag="DP116_09565" /EC_number="3.5.1.88" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide deformylase" /protein_id="PRJNA477356:DP116_09565" /translation="MPIEIAVEKKKLQNPPLELHYLGDRVLRQATKRISKVDEEIRQV IREMLQTMYSKDGIGLAAPQVGVNKQLIVIDCEPDNPANPPLVLINPTITKASREICM AQEGCLSIPGVYMDVKRPQVIEISYKDEHGRPRTLKAGDLLGRCIQHEMDHLNGVVFV DRVENSLALTQELSKHGFSHQAVKPIDRVVR" gene 25412..25696 /locus_tag="DP116_09570" CDS 25412..25696 /locus_tag="DP116_09570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743923.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09570" /translation="MERPIKKSERQSSEGTDTESENLDSMPTAQPSRKNSKRSDDRSE GRGKKASFGDESRQPVNPALARGPKPVKPKANIKTEPDTELEPISEESQD" gene 26130..26948 /locus_tag="DP116_09575" /pseudo CDS 26130..26948 /locus_tag="DP116_09575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878424.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 26316..26325 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 7818 a 5606 c 5724 g 7809 t 10 others ORIGIN 1 cttttgaatt tctcgtaatt cgcttcgcgt ttttgcagaa aatggtaact gattataaat 61 ctgctcaact aaaatctgta gaaactcatc tagatgcttt tgatattcgt aaaatgggtg 121 taaaatttct gcctcaatct tttcagcagt ttgctgcgaa atacccaatt tttgcttgaa 181 ttcattcaaa atgcgacggc tcgtatctga tatcttgtta tcactaacaa cccttttcac 241 ttcttctctg tatttctctt ctgctgatga tgagggtaga ggtgtgggtg cagattgaat 301 atcctgtacc tgcagcttat taagtagttc atcgatacgc tgctgcacat cttcatggac 361 acttatttgc gatcgcacat gttccaaaac cttgcaaaaa gcttctctcc caccaatatt 421 tgctaactta tccaacatat ggggtatgaa agttgctaca gaaccatccc agctgatgtg 481 ttgcaacacg ggtgcgtcgt tacccaatgc agcaattaag aaagattgac gttctctctg 541 tgattctaca aatggtctga ggatttctgc caactcattg atagtgttgg ggtctaattt 601 catcatcgac tcaacctttc acctgcgcca atgttgtgag taacacgctt aattacaaat 661 ataatgtaga aaatactgct tcaggttgtg agaggtaatg gtgctgtttt tctaagaatg 721 agcgagtgta tgtttgttta ttcctcgctt ccaaatacaa catctttata cagcagttta 781 atcggaaact caaaatcaat aatctttgaa cgccaggact ttttcgggtt tggtttgatt 841 gtagcgttct gggtgagggg aaactccgga gctcgcttca cttgggagca gctttgctgg 901 gtgctgaggg gaaacgctct tcacgtgagt tattttctcc cccttctttt ttctccctgc 961 tccctgtgct tcttctaagg tgcatagtac atatgaaaat ccccaccaca tgatggtagg 1021 gaatagtgtg tatctttttt ccttttattt attagggcta gcccaaaaga atttatgcac 1081 cttgctcatg cattttaggc tttaatatcg aatccgggtg catcggaata attaagtcgt 1141 agtcaaaagc caccagtagc aggcgatctg cgcggagcgc agcagcgttt gcgcagcgcc 1201 cccttagggg ctagctatcg ctttacatgc aaaaagggtc aacttttggt gctgacccat 1261 tttctgatta tttaggatta ttagtttggg gtatgagcgc agatgttgcg agtcggagtt 1321 tgagtcattt gacgggagga gatgagaaaa aacaagaagc cacaacgcca gatagattga 1381 acagctagtc tcgaatcaag atgaaacaaa acatagagtt ttctgtgttc ctacttgcat 1441 aacgtgaatc tagacttcct gccaaagtag ccgtcatgaa ctgcaaacca ccgaacacac 1501 gaaaacattt ctccctcact ccttcactcc ctcattccct cactatctga ctccctcact 1561 ccttattttc aagctaagtg taagcagcag acacagcaca gggagaatat ttctctgtct 1621 atgttgcttc ttttccatct ttaccacatc aatgactctt gaaagaggtt gattttggct 1681 gcctaacttt taataaagtg cttcttcttg gtgggtttcg attgtgacgt cagacgttgg 1741 ataagcaata caagttagca catatccagc tgcgatttgg tcatcgtcca agaaagactg 1801 gtcactctgg tcaactgaac cttttgtgag cttaccagca caggtagagc aagcaccagc 1861 gcggcaagag tatggtaggt caagaccttt ttcttcagca atgtctagaa tatattcatc 1921 ctctgggacc tcaattgttt cgttcagccc ttcagcaacg tttatcagtg tcactttgta 1981 agtagccatt tagtcaccct ctatgattgc gaatggatcg tataagttat cctaaagcaa 2041 acaccacgac tcatctgatc agatcagccg ttagttaaaa tttgcttcac ttctgatact 2101 acgagaaaaa caaaagcatg taaccaaaaa ttgaagtcga ttagtaatga aattttgtga 2161 gttcttaaat ataacttttt tttatacgac agaacatcaa cgggttcact agagcagcgc 2221 cttaggtagc cacttttgtc gcgactgcct ttaccacggt tgaaggacgg ttgagtcact 2281 acaccgtggt tcgtgtctca acctcgtggg ctttgtgctt aagtgtaagt aacagagcag 2341 acatcgcaca gggagaagat ttctctttct acgctgcttt ttttctattc ataccacatc 2401 agttactttt tggaaaaggt taatttttga gatgcctaac cttattcgat atcatctttc 2461 ttgtgggttt cgatggtgac atcagaggta ggcttggcga tacaggtgag cacatatcct 2521 tgatcaatct ggtcatcgtc caagaaagat ccatcagact gatccacagt acctgattta 2581 atcaaaccag cacagtcaga gcaagcacca gcgcggcaag agtaaggcag gtcaagacca 2641 gcgtcttcag caagttctag aatatattca tcatctgcta cctcaattgt tttatttatc 2701 ccttcagcgt cgttgatcag tgtcactttg taagtagcca tttagtcatc ctctatgagt 2761 gcaaacgcat ctaataagtt atcctaaagc aaataggacc actgatctta tcggatcagt 2821 cgttcgttaa atttgcttca cttgtgatat tacgagaaaa aaaggatgta attggaaata 2881 gaagtataaa aaatagctaa atatatcaat cctaggtaag ttgatttata tcaaccaatt 2941 gaagtcaaaa ataaggaaat aggcttattt tcacccttac acccctttgt gcttatgcac 3001 aagagcaaca aaccaagcag tcgttatgaa ctgcaaacta cctgacacat gaaaatttat 3061 tttttcgtct gcgttttgcg ttctgcgttc tgagttttga gttcaaatca gtgacaaaat 3121 aatgtacgat aattctgctt tttcgcaaaa acgccctgta cgagtaggtt tggttggtac 3181 tgggtataca gcaaagctac gagccgaagc attggtgcac gatgagcgct ctcatctagt 3241 cgccgttgtc ggtcatacac cccagaagac agaagccttt gctagagatt accagataat 3301 ggtcgtagat tcttggcagc aactggttga gcaagacaat gtagacttgg tcatgatttg 3361 tacagtgact cgggatcatg gtacaatagc gcggcaagca cttgagagtg gcaaacacgt 3421 tgttgtagag tatcctcttt ctgtagacgt ggcagaggcg gaagaactcg ttgagcttgc 3481 aaaagcacaa aaaaaattgc tccatgttga acatattgaa cttttgggtg gattgcatca 3541 agctttgaaa caacatctac caaaagttgg tcaagtgttt tttgtgcgct acaacaccat 3601 caagcctgaa catcccgcac cccgcaaatg gacatataac cacgagcttt tcggctttcc 3661 tttgattggc gcactttccc gattgcaccg tcttgtggat ttgtttggta cagtcatgtc 3721 ggttaattgt cataaccgat tttgggagat agagacacag tactaccaaa gctgtttctg 3781 cgatactcat ttgtgcttca acagcggact gttagcccat gttgtttatg gtaaaggtga 3841 aactttgtgg caggcagaac gtaggtttga agttcatggt gaagatggcg gattaatttt 3901 tgatggcgac aagggacttt tggtacagca cggagaaaaa aaaccgatag aagttggtgc 3961 tcgtcagggt ttgcttgcca aagatacgac aatggtgtta gatcatctta ttgatggcac 4021 tcccttatac gtcaccccag aagaaagctt gtacaccctc aaggttgcag atgcagcaag 4081 gcgttctgct gaaatgggtt tgactatcat cgttgaggat gagtgattgg ttattatagc 4141 atcaaatctg gaataagtaa tcaagatttt gtagacaata aaacagcttt ggtgaaaacc 4201 gtggacgttg ttttacaaga aattaaccag gataactggg aagaatgcat tcggctcaaa 4261 acaactgagg aacaggaaaa ctttgtggct tcaaatcttt actctcttgc ccaatccaag 4321 ttttatccat cttgtgttcc tcttggcgtc tatcataacc aaactatggt cggctttgtg 4381 atgtatgaga caatgtcaac tggtgatcaa aacagtggtt actgtatttg tcgtctcatg 4441 attgacaaaa atcatcagag gaaaggatat ggcaaagctg ccatgcaggc agtaattaat 4501 ctacttaagg acaagccaga ctgtaaaaaa atttcgacta gctatgttcc aaaaaatgca 4561 gtcgctgaaa aactgtatta tagtttgggc tttcatgcga ctgggttaat agaggatggc 4621 gaaattgttt tatctctttc ggtagagtca agagcaggtg tgacaacaca gtcatgaaac 4681 ccaaccataa aaggatttgt tgagtttcgt gttgctcaaa ccaatataca gttatctgca 4741 caaacacggc aaactatatt tatgggagat tcttctttag acgaaatttc atgaaaagtg 4801 aaacgattgt tcttttatct aacagagaac tagataaaat gcgtaaggct ggacgtttag 4861 cagccgaact cctacaccat cttgcaccat tggtgaagcc aggggttagt actcttgaac 4921 taaatgacga agcggaacgg tggacacaag cgcatggtgc aaaaagtgca cctcttggct 4981 acaagggttt tcctaagtcc atttgcacga gtgtgaatga agtcatttgt cacggtattc 5041 ctaatgccaa gcaaattctc aaggaaggtg acatcattaa tattgatgtg acgccccttg 5101 ttgatggcta ccacggtgat acatccaaaa cattctttgt cggtacccct tccccaaaag 5161 ccaaaaagtt ggtagaggtg acagaggaat gcctcagaag aggtattgct gaagttaaac 5221 cacaagcacg tattggagat attggtgcag ccattcagga gtatgctgag gcacagggct 5281 tttctgtggt gcgagacttc gtaggacacg gtgtcagcaa cattttccac actgcgccgg 5341 aaatacctca ttacggtatc cgtggtaagg gaaaacgtct gcgacctggt atggttttta 5401 ccatagagcc gatgattaac gaaggtacgt gggaagttga ggttttgggt gatggttgga 5461 cggctgtgac acgcgatcgc aaactttccg ctcaatttga gcacactctc gccgtaactg 5521 aagatggcgt tgaaatcctg acgttacggg aaagtaggac ttaagcatcg gtatcaaagc 5581 tacaaaacga tttgtggcaa aacaacacag atccgcctgg aaaattaaat aaccgacagc 5641 ttcactcaat cgtattgagt tttagtgttg tgtttgagat ttgatgatgc tgcttgaaaa 5701 ccgccgagtt tgggactatg gcgggatttt tttgtgtggt ttgtgggatg ggaggagagc 5761 gatgcggaag ccgccactac gtgaacgcca taggtgatcg caaagataaa cagtttaatt 5821 tacctagaat cactacaacc ccatcaaaat tcttagcctg tcttcgactg ctgtcagtgt 5881 ttttggagac actgtacccc atcgcttttc aagacgatct acagaaatcg acctcacatc 5941 ctcgcacttg acaaagcttt gtacttttaa tcctccctct ggcgggttca gttctacgtg 6001 aaaagggatt cctttttcct ttgaggtaat cggtaagaca acaaccaaac cagatgcacc 6061 ttgattaaac aaatcgactg aaatcaccag acaaggacgc tttccggctt gctcatgtcc 6121 tctcactgga ttaagatctg ctaaccacac ctcccctctg gcaatttgcg ccactattct 6181 tgcaccccat cagccattgt ttgatcccat acctgcctct ctgcctgttc ttcctgccaa 6241 agcgcttcat tctgtcttaa agcggcaaac gcttgattag cttgtactaa gaatacatgc 6301 ctacggtaat tctctatcgc cctgtctaac accgtctgaa ttgtctcacc agaggtttgt 6361 gccaatttca ataaagtttc atgggtagtt tgactaatac taacagttaa ctcaggcata 6421 gctgtatttt tcaacttcta taagcattgt agccaaagcc aaagcagctt ccactacatc 6481 aaatagttgt ttatggatta tggtcgtttg caagtcggcg atcgcaaact ttccgctcaa 6541 tttgagcaca ctctcgccgt tactgaagat ggcgttgaaa tcctcacgtt acgggaaagt 6601 aggacttaac acttcaattg ccttcatctt gatggcatca aaactatagc gccggatctg 6661 tggcaaagcg acacagatcc tccggcggga aaattaaata actgagacag gttgtggcag 6721 tggtttgagg ttctctgttt agcgttcggt gctaagtttg gatcagtgaa ttgggctgat 6781 tacgttcata atgagtgaaa tgtcttggac actaacgatt aacacttcgc gcctcatcct 6841 tcgtcctcaa caacctgatg attataaatc ttggtacgct ggttttgcag gtcggttgcc 6901 acaacagtat aaatatgacg agggacagat tagcttggat ggttgtgacc ccaattggtt 6961 ttcaactttg tgtaagcgcc atcaagacca agctttaagc gattatgctt atatttttgg 7021 tgtcttttct cggcaaaata atcagcatct tggcaatgtc gatttatcaa caatccagcg 7081 tgaagagaaa cagtgggcta atctgggcta tagcatccat aatcagtatt ggcgacaggg 7141 ttttggcaag gaagctgtga aagcagcact ccgagtcggg tttgagaatc ttggatatca 7201 tcgcatcgaa gctgccatta atcttgataa tcatctctcc attgctcttg ctcaaagtgt 7261 tggtctgcaa aaagaatgta ttcgtcgtgg cttttactat gagaacgaac aatgggtaga 7321 tcatttgatt tatgtagctc ttccctctga tttaggtcta gtcgagaaac caccagtgat 7381 cgtcggataa caaccccttg cggcggacgg ttgagagatc ttggtgtagt ggtaaaggtt 7441 atttgccaca tcatttattc aaagtcaatc tgccatgcat gtgtctatgg gcataaattt 7501 aaataactgc aagatgtaag ttatgctgga aattgaaaac cgccgagtct gcgacgatgg 7561 cgggattttt tgtttgtggg gttattggct cacttttccc atgatctgtt gctggtttag 7621 tctaagttaa gacacaccca tattcaaggg taaaatgagg aaagactata tagtttgtaa 7681 ctatgaatcg gcatacaagg caagtcattt tatataagga tgaagatggt tattgggttg 7741 tagagtgtcc aagcctaaaa ggatgtatta gccagggaaa cactaaggag gaggctttgt 7801 cgaacatcaa ggaagctatt gcaggctacg tgaatgcatt ggaagaagat ggcttacctg 7861 tgccagaaga caattttgaa acattcctag tagtcgtgtg agcaagttac ccagtatttc 7921 aggaagagag tgtatcaaag ctttggaaaa gattggtttc taccaaaaac gtagagagag 7981 cagtcacatt attctgcgaa gagatgagcc gtttgctcaa attgttgtcc cagatcatca 8041 ggagttggct aagggcacac ttcgagctat tattcgagac gttgaactta gcgttgaaga 8101 attcgtgtca ttactgtaga gaggcagtcg cggtctaaca ctataatggg aacgttcaat 8161 tagaagtaat tccctaattt tcaaaatata caagatgcct acattaaagc tgacgaataa 8221 gcaagttgtt gaattaatta aacagctacc tagcgaacag caagtagagg tgtttagatt 8281 cctacttctc caacagtggg gggagtggga ggcactatcg cgctatgcag tcgataaagt 8341 tagacttatt gcccaagaac gaggctacaa ctgggatata atgaccgagg aggagcgcga 8401 agtttttgtt gataaactga tggatgagga ttagtgcgat acattgcggt ttttgatact 8461 aatattttga tctctgcttt gttgtctaca agtggcgatc cgtttcgatg tttggcactc 8521 gcaaagattg gacaaattga gtctgtgact tgtcaacaaa tcctggatga gtttgcccaa 8581 aagttggtgc ttaagttcaa gttttctcag aagatggcgc aagcggcagt tgaagagata 8641 cgtagttgct ctcgtttagt tgagattggc gcgacattag aggcagtgcc agatgatccg 8701 gataatgata tggttgtcga gtgtgccata gttggcaatg caactcatat tgtcagaggt 8761 gagaaacatc ttttgactct agcaaagtat caagagatta agattgttaa agcaactgag 8821 tttgttgcgt tgttatctca gtcttgacaa tcctggcagc ctcatccttt taatcagaag 8881 ttgatgtaca cccgataatt gttggatgat cgcaatcaat tcttctttat gggttatctg 8941 tttggacgta atcgctaggt gctctgtgga tgatttatcc ctctgaaaac tgttacggta 9001 tcaagcgtta tgattgagat gctctagaca cagtcagcaa tgctgcataa ttcactacag 9061 agcgagtcat gaataaagcg tatttgacag atttcaattc atggattgac cagacagccc 9121 aattcttgcg ggagcgtcgc tggcatgaaa ttgacgtaga gcatctgatt caagaggtcg 9181 aagacttggg taagactgaa cggcgcggga ttactagtca actaactcgc cttctactac 9241 atttgctcaa gtggcaatat caacctcagc gtcggtcgga tagctggcta gattccatca 9301 ctgatgcacg tactcaaatt gaattagcca ttgaagatag tcctagtctt aaaaactatc 9361 ctacagagca acttgaagag agttaccaac gcgcacgcca ccaagcagcg aagcaaacta 9421 atatacagat ttcagtgttt ccaaaagagt gcccatatcc tgtagagttc gtattagatg 9481 aagattggct gccagaagag aacagaacag agtaagtgat gtattcattg ctattgaggc 9541 ttatgatcaa ggcgaatcgc ttgctctaaa gctaattttt ctctagctct acgagcatac 9601 ggctgtaatt ttgtgcgagt ccagcctgta attatactca tatcttctaa attcattccc 9661 ttcattaaca tttctacaca ccatgtctgt tgagcttgct caatgactgg ctgttgtcct 9721 tctggtgtta ataatccctc agttaagatt cgccagcgct gttgtatttc tagctctgtg 9781 attggtattc catcatcgtt gagaaataat gcagggtggt tatctttacg acttttaagc 9841 cactgagtta agggattgcg agtatacgat ccataccgct tacccataat ccactgattt 9901 acaggaactt gtcgagtagc gccctgagtg atttgcaata agtactggtt ggaatcttgg 9961 atttggtggg aacgctctaa attcacaatt tcttctgcag ataaccctgc gccaaacaaa 10021 acgtacatca gagcataatc tcgtagttct gattttttgg ctctttgcaa aatcaagtgg 10081 actaaatttg ctggtaaatc taccaccttt tctgtcgaaa taaactcatt gttaacttga 10141 tttgaggatt gtgtagcgac ctttaaaaac agtgccacta aggtttctaa aaactcatct 10201 ctgtcatgcc aaagttcatg aaattcactg gtaaactcaa tcacggcata ccccaacaac 10261 atactattga gtaaacttgc tagcttttga ggtgataatt gtacatgtaa ttgctcacgt 10321 tccatcattg ttgctaaata ttcagcaaca tagttgttgg cttgcgctaa gcttcttcct 10381 agtgcttgac gattttctac tggatatttt cctgcctcac caaccataga acgtaccaat 10441 tcaggaactt gttctaaagc ctttaagcgg tcttcgcaat aatttttgag ggcttcatga 10501 acattagtca tttgacttgc ttgcgttttt aaggattcac ccagttcttt aaacactggt 10561 gattcagaaa tgactgccag aactaatccg tgcttattgc caaattgccg gaatagcgtc 10621 acttcattaa cttttgctaa ttctgcaact gccttggttg tcgtttccgt aattccctga 10681 gctgcaaaca actccattgc tgcattaatc agcctttgtc gcgttgaaat tatatcacta 10741 gacataagtc gaaagtgcaa gtggcacttg cattaaaatt ttatccttgc tagactaaca 10801 gatgtaagta gcacttgcat cctctatttc tactttaact gtcttcgcca gtgttcgtag 10861 agaatgtggt aactagtaca agactgtact agtttattta cggtattcac gtcaaaggaa 10921 aatttatgac aacaaactca acggcgcctg ttgataagca gccgcttact ctgagctgga 10981 ctaacgtggc atttttcggt acaattcatg ccttggcaat gctcgctccc tggtgttttt 11041 cttggtctgc tttgggagtc atgatatttc tgcactggtt gtttggcagc attggtattt 11101 gtttggggta tcaccgactt ttaactcacc gaagtttatc tgtgccgaag tggttggaat 11161 atgcgatcgc caccttggga gcactcgcca tgcaaggagg accaatgttt tgggtagcag 11221 gacatcggct gcatcatgcg catacagaag acgtagataa agatccttac tctgccaagc 11281 gtggtttttg gtggagccat atgctttgga ttttttaccc acgtccagaa ttttttgacg 11341 aacaacatta caaaaaattt gcccaggaac tataccgtga tccattttat cgttggctaa 11401 atcgctactt cttgctgctc caaattcctg ttggtgtcct actgtatgtt ttgggtggat 11461 ggtcttttgt gatctatgga gtctttgtga gggcagtctt gctttggcat accacttggt 11521 ttatcaactc tgcaactcac ctgcgcggtt atcgtcgttt ccagttgaag gacaactctc 11581 gtaatctttg gtgggcagca cttttaactt atggagaagg ttggcataat aatcaccatg 11641 ctcacccaaa tgtcgctaaa gctggatacc agtggtggga agtggatatg acttggtggg 11701 caattcagac attgcaaact ctggggttag cgaacaaagt agtcatgcct ccaataccaa 11761 aaccagaagt ttgacgaatg ttaacttatt cactcccgct ttcttaaata gcggggagta 11821 caaggggttc cccagctaac ttgctcagaa ggtatctcag taaaaacatt actacgagcg 11881 ccaatcacag cattagcccc aatttttact cctagtccaa cgaagcaatc agcggctatc 11941 catactccat taccaatggt gatactcgct gttttcaacc caaaagctgg gtctgttata 12001 tcatggctcc cagtacacag gtaacttttt tgagaaatga cgcagtgctc accaatgttg 12061 atatcgtcaa ggctgtagag aactacgtca tctcctatcc aactgtagtc cccaattgta 12121 attttccaag ggtaagtaaa gcgagcagta ggtctaatga ttacgccttt gccaacacga 12181 gcgccaaaca gccgcagcaa cgcgcaacgt agattattga gcgggtgggg agttagggga 12241 aaggcgatcg cctgtataag ccaccataac aaaatatacc aacctgctcg tcctcggtca 12301 aaccaggatt ggttatattt acgtaaatct acaaaaggtt cttcactagt cattagtcat 12361 tagtcaagag tcattagtcc aaggttacaa gcttccttgt ccccttgtct cctaagaaac 12421 atagggaact ctgaacaggg aactctgaac agggaactct gaacaggaaa ctcttaacac 12481 cgaagaaggg aaagagtgtt tgtttcattc ataacgggtg gtgcgccgcc cgtgggatgc 12541 tcctaaagca ctgtttggct gattcaaaaa ttaattttta attccagcgg cggggcgatt 12601 ttctgcgaca ccaactaggg gatgtggttc ctcgccactg gttgtcgatg tggtgttgag 12661 tcgtccgtca tactgacaca attcgtaaag ttttatgcct atttggtact cgtattgact 12721 caaaagccgt gcataaatat acccagctct accatccaaa aaaccacgct gaatgatgta 12781 gaataaaaca aaccgcaaaa atggtttaaa tggtagccgc acccatagtt ttttcaggaa 12841 gcgcttacgt tgtagagcat tcccgaagag atttgcgcct attgtatcgg tatcatcttt 12901 accagtgaga agatttcgat agacatgagc ctcccaattg gaataacggt tatgtcgttc 12961 tatccagtga taaaggtcac ggaaatcctc gtgtatcatg tcgtttttca gatatccagc 13021 ttttccatct aagataacgt gttcgtgaac ttcgttatca ccagtgttgg gaacatcccc 13081 agtctcaagg ttttcgtagc gacctttttt atgtttaaat aaacgcaaat tccaatcggg 13141 atatttaccg ccgtggcgaa tccatcttcc taagaaaaag actcggcggt tgagataata 13201 gccattgtat tggggctttt gaatggcttg agcaatttca tcccacagat cgggggtaat 13261 gcgctcatca caatcaacaa ttaacaccca ttggttacgg aagggtaaat tttctaaaga 13321 ccaatttttc tttttaggcc agcttccatt gtagtaaaat tgtacgacat ttgcaccata 13381 actttgaaca atttcgacgc ttttgtcagt actttgagaa tctataacaa agacctcatc 13441 agcggtttga acacttgcca gacaagcagg caaattcgct tgttcgtttt ttgctggaat 13501 gagtacggac acgggtaatt tgaaagacat acaagtcata agttgttact ctttcttaga 13561 tttggacaga agaccctgaa tggcggattt taggtatcca atttgaccat acgcatacac 13621 aagtttatca aatcgttctg ctggatcggg aaaatgttgt aatgttttat ataaaccacg 13681 taaaaaacgc tcgctaccac gctgcaattg accgaaacca gctttgccag caagttgttc 13741 gcggtaacac tcactaaccc cttgccacca accccggcta ataaaccagg aacgtttgag 13801 gcgttctggg gagacattgt gagcaacttt ggcgttggga agataagcaa cttgcatttg 13861 gtgttgaagg gcaagttcgg tcatttgcag ttcttcattt gataataatt ttgtcccaat 13921 tcgaccgaga tgagtattaa aaccccctac tgtttcgaga aagctgcggc ggatagagta 13981 gttcaagcct ctgggggttt gtcctgggtt atcaatgtag acaatactct cgcccaagtc 14041 gtatgctccc aaattggcag ataatccttt tgatagccaa ggtggtgctt gggttccttc 14101 gggccataat agagacactt tgccaccagc aattgctagt ttcgggttat cttgataagc 14161 tgagtataaa atctgtaacc agtcagagct agcttctgca tcgtcgtcaa ggtatgccaa 14221 aatttctgca ctagcaactc tcgcgccagt gttgcgagca acagataaac caatgacagg 14281 ctcaaagata tacttgaggc gggggttgtg gcttctttgt tctacgactt caacagtgcg 14341 atcgctcgat ccattatcca caaccacaac ttcaaactca ccaccaaaat cctgtgccaa 14401 aaggctatct atagcagcgc ctagataggt atctcgattg tgagtacaaa taatcgcaga 14461 gattagtgtg tttggcatag actttaagtt tatctttttt ggactttgtt ggttgttcgt 14521 ctttggttga gagttgttag ctattttact caccgctaac cattaacgac taactaataa 14581 caaagtctaa ctcttgcctg ttgttgctct catctatacc aaacatttcc gactgacaaa 14641 ccgagttttt ttagaaaaga tgaaattatc tttagttctt taggaagatt tttcaggaga 14701 attcaaaaaa acagaactca gaattttctc taggaatcaa tacgaataag tttattttct 14761 gtgctagccg actcattaat aggtgtcaaa gaaccagaag ggtttgtgag tcctgagttc 14821 tgtattcaat cgcagtaaaa tttcagtcaa atttcactgt gattgaatgt gaccgtgtaa 14881 aatcacaaga gtttctgcta gttgggtaag acgagtttct agatattgct ttcgtcccaa 14941 gagttggcgt aacttttggt aacgcgcaat tgcgatactc acgctaggaa cctggtcggc 15001 tggacatgga cgcgaccaca ctttaccaga agcatccaaa ccacacaaac gatacccagc 15061 acggggtgat tggcaggaaa cattggtatt ttctgcacaa atttcttcta cgtagtccat 15121 aagacggtca acttctgcat ggcgcagtgt tgcagttcct cctggttgga cttcgcgttc 15181 atgggactct agccagccat tgacaatggg tccttctaaa taaagatctt gaatttgcct 15241 aacaattttt tgcagttctt tttgccaaga accaacactt tcctgaatat cttgcaacaa 15301 attcattgcg aaggcgggat ttgctccgtt gcgatggcta gtaaagctag gagttttgag 15361 cttaggtagg cttggtgttt tgctagcgct gtcttgggtt ggaaaagttt gcacagaagt 15421 attgtaggta gctaagtttt gttccaagtg ccgagaaggc tcttcctcgc aaaactcaga 15481 atgtgtttct gagtgctctg cttccccaat gctaatccga aacgacagag actgctttgt 15541 tgaagcatct gcctcgacag atccagcagt gttgcgagtc cccaagtcgt gtagagttgc 15601 ctctatgcgt tttatgcctg ctttcatacg aatatcaaag tgtgttgtgg ttgtagtgtt 15661 caagggtgaa ggcacaagat ccactacatt atttactgat agcctagtag aattgcaccc 15721 cttctcaatt caagatacta ggtaaacagc tttgagtaaa gtcgtattag tgaccggacc 15781 tgcacgctct ggtaaaagtg aatgggcaga aattttggct atggaatcac aaaaagtcgt 15841 tatctatgtg gcaacggcag tagaaaactc agaagataaa gaatggcaag aacgtattct 15901 acaacataaa caacgccgtc ctcaagattg ggcgacatta gaagtgccat ataaattgtc 15961 tgcaactctt gcgaacgcaa aaccaaatat ctgtctattg gtagattctt taggcacttg 16021 ggtagcaaat cttttggaac aagacgagtt ggtttgggaa aacattgttc aagagttttt 16081 agaaacagta gagttagttg ctgccgatat aatttttgta gcggaggaaa caggttgggg 16141 tgttgtgcca gcgtatccta ttggtcgaac gtttcgcgat cgcctaggtc atttagtgcg 16201 tcagttgggg ggtatcagcc aagatgttta cttagtcact gggggtcatg ttctcaatct 16261 gagcttgcta ggttcccctt tgcctaagat cagagccttt aactagcggt ttcttgaggt 16321 agcaacatat gaaataaacg ccccttccct attccctatt ccctgttccc tattctctat 16381 gtttcttgat aaaaattaaa aagtaacaaa tagttaacaa ctcagtaact cttcttaaga 16441 gtttattcaa gattcctcat ataaaaacta aaatctacag tctaaaattc tagtatggca 16501 tcacaacaag acgtaaaaaa atacttagct tactggttcc agcttggcaa aaaagttgtc 16561 ataaacaacg gtgcggcaac cttactgccg caaaaagtca ttgctggtga ccgctacagc 16621 gatgagtttg aagagtgctg gcaaaggatt atttctcctg agtcaggcga ctgctacata 16681 gaaggtacac aggaaactat agcagaattg ctgacacctg cttgggatgt ttctccctgc 16741 ggacgttgcg atatgccagt tcctgtgaaa aacgtgggta tgccgccgtt ggtatgtcct 16801 tgctttgatt taaccacttg gcctaataca gaattaccag cgccgcgttc tccagtcaat 16861 aaccaagagc aattgagggc aattcgcgat cggcttttga gtgttgtgaa atcaaatcgt 16921 gaataattgg tttacttggc aatctaagta ctgcatttca taaattcgta tcgcaataag 16981 ggcacggcag tgcccatacg tatcaactta acgttcaagc ctttattcca cgttgattta 17041 accccacccc ccttcgggta tgccagcctt cgcctgatgg ctaacaccag tcgccaagtg 17101 agggaaagcc gtcattcgcg ctggattcac cgtaaagcta cgctatggcg ttccgccacg 17161 ctagtcccta agagggacgc tgcgcaaacg ctatcacgtc ccctcccctt agtaagggga 17221 actcaccggt ctcctctggg gattaggggc agggaggttt tggcgtgaga caaaaccggg 17281 gtggggttgt acgaaaatag tggacaagca taagggcatg atatgacgta attacaagag 17341 tttcaccaat gggcactgcc tttgccccta cgactgagat tactttactc ggttgaaaat 17401 ggctatataa cgctacgaac ttttgcaatg ttgtacttag cataactcct aaatctatcc 17461 cagggtaaga gcatctcaat caggcttgac agaaggaatc agaagaatct aaagtgttgc 17521 gaatgaaaaa acaattgaga acctgaatca tataattatt atccatagcg gtgcgcttga 17581 ttttttgacg gagactacgt ttgtaggatt gctcgcgatt caccaaatga attactgaca 17641 gaacttgaaa aagaattaga aaaattgaac gatgagctac gaggaatata tgaatttttg 17701 caacgagaac cacgaaatca agatagtagt ctttatctag gaaatggtat agaactcaat 17761 ttacaaaacc ctcttcacga gcttctctat caagtttaca accataccct agagcgagac 17821 tttccctgtt ttaaaactct caaggttaaa gtccgtacct ttgagcttat agaagataga 17881 catttgagga ttgaacaaaa gcaagatgtg tgccggtttt tagaagaagc tttgtgcaat 17941 gttggcaaac acgctacagg actaactcgt ttggaagtga tttgtaagaa cggcaatggc 18001 tggtacactt tgagcgttac agataatggt tttggtatga aatcagataa agaaggtcaa 18061 ggaacacggc actttagata tatcgctcga caactcaaag gcaaatttaa gcgatcgcct 18121 ctttaagtga agctctagcc tccactgtag atggacaatt aggaatagcg gcactaactt 18181 taatacaaga cctcgctcaa ttagaagagt ctttacagaa tctggatttt tataatcttt 18241 ccttgagcaa ttgagaaagc tatgcattca aaagtctttt actgataact cggactatat 18301 gactacaacc aatttcctat aagtataaaa ggtgcccaat agtatggatg ggaatatttt 18361 ttgctgctgg caattaatgc cctttgcgct ctccgcagag cctctgcctt actcactcca 18421 ttctcatacc attctgcata aaattttgtc accaaagtta cagtggcggc atcattaatt 18481 gaccaaagag aagctaaagc gcttttaacg ccagcgttaa ctgcgctacc tgctaaagcc 18541 aaaggaatac ggtcatctcc aatagcagtg tcacaagctg tcagtgctag taattctact 18601 gcctcagaac catgaagggt attgcgaatg agagtatata gatcagttat agtgagtttg 18661 ccattatttc ctgtgatgag gaaggtatct tcggaaactg tgctaaactc accgtgagtg 18721 gcaatatgaa taatagggta aacacttagg cgaagttctg tctgcagact ggtgtatgtg 18781 aaattctcat ccaataattg cttgctacca ggaatttgtg ctgcaacttg attgatttct 18841 tgcctgacgt tagtgagagc ctcatacatt cgaccatcaa ctattgcatc tttagttaac 18901 cccatagcaa gcatccgtag atttttacgg ttcacttttt tggcatttat gagggtagtc 18961 gctgaggtgg tagcgatgct ccgaagggag ccagcaaagc tgatcgcata cctctgaacc 19021 aagaattttt ctccatcatg cagcgccgcc attagtacac taccaagaat tccatcatga 19081 ataaatggta atctatacag acttgtatca ggaattccaa gaaaactgtt tgctatagca 19141 atcctaaatc ctgagccgct acggacacgc tttgcgtacc ccccttcggg ggcgcttgcg 19201 cttacgtgag caaacttgat ccccgacttc gtaaaagttg tcggggatgt ggatctgtag 19261 ctttcaattc tcacaaatca aataggattg ctataggatt cagaacaatt aagtcaagag 19321 cagattttgc gtataatcaa tcacaaaaaa gccaatattt tccatgaaaa cactagataa 19381 gtatacttct ttttaggaaa gctttttacc acaagaattt tgcattttcc atgccactat 19441 agagaagaga gtggcaaatg cagtaagagg aaggcaaaat aaggctgtgg agataacatt 19501 tttagggacg agttccggtg tacctacgcg atcgcgcaac gtttccagtg tcgccctgag 19561 gttaccacaa cgtgctcaga tgtggttatt cgactgtggt gaaggaactc agcatcaaat 19621 tttgcggagt gacctcagag tcagccaact ctcccgaatt tttatcaccc atatgcacgg 19681 cgaccatatt tttggcttaa tgggtcttct cgccagttgt ggcttagcag gaaatgtaca 19741 acgcgttgat atttatggtc catccggatt aaatgactac ctacaagccg cctcgcgcta 19801 ctcccacacc cacttttcct atccggtcaa agttcacgcc gtacgtccag gcgtgattta 19861 cgaagatgat gaattcattg taacctgtgg ttttttgcat catcgcgtta ccagttttgg 19921 ttaccgtgtt tcagagaaag accgaccagg acgttttgat atagaaaaag ccaaagtgtt 19981 ggaaattcct cccggtcgca tttacggtca actcaaacgt ggtgaagtcg tgacccttgc 20041 tgatgggcga gtgattgatg gtaaggaact gtgtggacct acagaaattg gacgcaaaat 20101 tgcctattgt acagacacag tatattgtga cggtgcagtc gagttggcgc aggatgcaga 20161 tgtgttaatt cacgaggcga cattcgccca tcaagatgca gagatggctt ttcaacggtt 20221 acattctaca accacaatgg cagcgcaaac cgctttagct gctggggtaa atcgactcat 20281 tatgacacat tttagtcccc ggtatgcccc cggaaatgaa atagagttaa aagatttact 20341 tgaagaagct cgtgcaattt ttcccaaaac tgatatggca tatgatttta tgacttatga 20401 agtaccaagg cggcgggata aagaattaaa aaaagcggaa gccatagctg aaatcagtta 20461 acagtgaaca gttaacagtg aaactgataa ctgtttaata cagccaacat ctctacatct 20521 agagccttta agaaacttga tatcaaggaa tatctttatt ttccagtcat ttgcgcccat 20581 cttcacaaaa catttaattt ttctgctcga agaagtcaaa aatttttcgg tgtttgattt 20641 ctctagaagt agatgaaaac acatatctag aaactgaaaa aagtaaagaa caacattttt 20701 agctgctgac taatcaacat actgtacaca attgttctat ttaccaaact ggtaataata 20761 caaacaaagc aaaaaagttg cttgttatga ttgaactact ggtttgatta tcaaatccta 20821 aatcttccaa ttttcgcttt acttagcttc tgattataag catttagagt tgttagcaca 20881 atataagtat atagattacc agtctctatt taagtctttt ttcatactaa ttttttttaa 20941 aactcaaaaa aagttcactt ttttaataga aacatagtgc gaattatcac aaaagtagca 21001 aaagtgaact gattgtataa acagcttctt caaaaatcac aatctaaaca acttggaata 21061 gtaacaatat ggtaaagata gccagaaaga gttgactatt tcaatgttct tcacttaggt 21121 tgaaatgaaa atcaaaatga agatatttca gcagaggtga catcttgtac ctgaattcaa 21181 aattacgttg aaagagcgtc gggacaggag atcttccgaa ttcataactc cgaattcttc 21241 ttttttgact gtacttgttt gaaatgtaaa atttttttct tacaatcttg tagaaaaagt 21301 gattggatat aattaagagg actttgattg tggctgacat gagtctggag cagagaaaaa 21361 taattattac ttccccacaa agcaaagtta agcgcagaaa actctctaca attctgcttc 21421 gccgacgctt gcaaatctta ggtgtttctg gcgtagtcat ctcggttgcg agcgttctgg 21481 ctttaagtgc gaaaccgaca tatcaaagta ctatacaaat attggtaaat tccaagctga 21541 atgaaggggt acgctcaagt catactcaac agggagcaga aagcaatttt actgaccctc 21601 atcttgacat tattaacgac acgactcagc aaaagtttat gctgaattct aacctgattc 21661 aaaaagcagt taatttgctt cgttctgaat atccaaattt gacagttgaa catatcaaag 21721 gcaagaaagg tcaacaagca cctttagtaa ttactccatt agagcaacag acagcggaca 21781 ataaagtcat gaatcaagtc tttgaagttt ccttcaaaga caacgaccca gtcagagcag 21841 aaaaggttct gaagtctcta caaaaagtct atcaggaaga gaacatagaa caacgaaaac 21901 agagtgtttc caaagggctt acttttatca aaaaaagact gtccgaagtc aaaaacaaaa 21961 tgattcaagc tgataaaaat ttagaacagt ttcgcaaaaa aaatcattta ctcgacccgc 22021 aggtacaagg caaaattctt ctggaatctc ttgctgacgt tacaaaacag ctaagaacca 22081 cccgtgcaca gcttaaagac ttacaagctc gtcacgacaa cttaaaacaa caattgtcgt 22141 ctttatctca aaatactatt gtctcctttc gtctgagtca gtcaactcgt tatcaaacac 22201 tactcaatga aattcaaaag actgatctcg ctttaactca acagcggtta cgttatacag 22261 ataactatcc agaagtccaa aaactgatac agcaacgacg aactcaagcc gcgctgctac 22321 aagaagagat tagacgaacg ctgggagaca aagatgtcca agcaatatca gatactttat 22381 ttaaaaaaaa gccgttggat acagaaatat cgtctggtac cgaaaaacca gaacagacta 22441 caagggaagt accacaagct gacttgaagc tggtgcaaga cttaattcaa atgcagacag 22501 caatcttagg actgcgtgct aatgaaaaaa gtcttgctga ctcacaagcg caaattcgtg 22561 ctgaactaac caaataccca agtttaatca cagagtacaa ccgtctgcta ccagaagtag 22621 aaacgaatcg caaaacactt gagcaactta tagcagcaca acaatctcta ggattgatga 22681 tcgctcaagg tgggttgaat ttccaagtct tgcaacaacc ccaagtagaa aactatctag 22741 atagcaacaa gctatttata ttacttggag gagtgctgtt agcaccgatt ttaggtattg 22801 gtacagccct gatgtcagaa tgcaacgatg ctatctcttc tccagaagag ttacagaggt 22861 tgacaaacct tcgtttatta gggacagtcc cagaactatc acaacttagt acaaaaaaga 22921 gatcgtttcg ccggtccttg cgacgagtca tggctcattt ctcatcacaa actatatctg 22981 acgaaaagca cttcaacgtc tacggcttgt taccttccca cgaaacgctc gatatggcgt 23041 accaaaacat tcaaatatca aagtcttttg ttcaccacaa gtctgtgatg gttacttcag 23101 caatgtctgg agaagggaag tcaactttgt cgctgggact tgcagtcagt gctgcccgta 23161 tgcatcaacg ggttttatta attgatgcca acctgcgaca gccaaatctg cacaaaattt 23221 tgggattaac caatgattgg ggtttatctt tgttgttagt cgaggaaaga aattcctcag 23281 ttaaagaata tatccagcct attcatcctt ccattgatgt tttgactgct ggaccgactc 23341 ctgaggatac agtaaagtta ttaagttctg cacggatgaa agaattgctt caatttttcg 23401 agcaaactta tgacgtagtc ctcatagata cttctcctat tctcggcaca gttgacgcta 23461 caatcttggc atctctttgc aatgggattg tcatggtagg gcggatggaa caagtaatct 23521 gcaaaaaact cattcaagct acagaaattt tgagtaactt gaatctgatt gggattattg 23581 ctaatgaaac gaaatgttcc tcataagctc cagcccatat ctgaaaattt ttattgtata 23641 aataccacta tgatcgcaaa agctgaaagt ctttgtcctt tacagttatc cattcagatg 23701 accaaactca ttgaatacga acaagccagt gacgaagtgc gtgtcgtcta tgacgacatc 23761 cgcgccactc gtaagactga ttacatcaac aatttttgga aagcgatcgc caaccatccc 23821 cccactctac ggcgaacctg ggagacagtc aaagaagtga tgactagtcc cggcgaactc 23881 gatccgttga tgcgcgaact gatttacatt gctgtgagtg tgacgaataa ttgtgagtac 23941 tgcatcgcct cgcacacagc aggggcgtat gcaaaaggca tgacggatgt catgttcggc 24001 gaacttatgg ggattattgc gacagcaaat acaaccaatc gccttgcaaa tggttaccaa 24061 ataccagtgg atgagcaatt caagacgtag ggcttatctt cactctcatg ctatctcttg 24121 acagcaatga ttctttcaat cgctgagaat cattgttata aataacaaca aaatgatcag 24181 tcttcagtaa aagggaaaag gctaaaagtc aaaagattta tatttccttt tttctttttc 24241 cttttcctta ctttacacct gatgcttact taatattagc tctagcatcc ctgactgcag 24301 caacaaaaaa taatcctgtc agtggaatac ttattgccag gataattgca gttatctcag 24361 tacccaagtc tggttgtcca gaactgagtt caaacattga accgactgcg gctatagcgg 24421 taatacaaga accagcgaga aataaaccgc tttttggagt cattatattc actagcgaac 24481 caccctatct attggtttta cagcctgatg tgaaaaccca tgcttagata gctcttgcgt 24541 caaagctaaa gagttttcta cacgatctac aaataccacg ccgttgaggt gatccatctc 24601 atgctgaatg catcgtccaa gtaaatctcc agcttttaat gtccggggac gtccatgttc 24661 atctttataa gaaatttcta taacttgggg acgcttcaca tccatgtaaa cccctggaat 24721 actcaaacat ccttcttgcg ccatacaaat ttcacggcta gcctttgtga tagtggggtt 24781 aattaaaacc aatggtgggt tagctgggtt atctggttcg cagtcgatga caatgagttg 24841 tttgttcact cccacttgag gcgcagccaa accaatgcca tctttgctgt acatagtttg 24901 tagcatttcg cgtataactt gacgaatttc ctcatcaact ttagatatgc gcttagttgc 24961 ttgacggaga acgcgatcgc ccaagtaatg aagttctaat ggcggatttt gtaacttttt 25021 cttttcaaca gcgatttcta taggcatggt cttgtaactc agttgttgac ttaggtttta 25081 atatattttg agttctttaa tcttaacaat ttagttccct agctgaattg ggtaaaacac 25141 accattaagg gatgtaatgc ccatatctgc tctcaagttt cttttagtat aagctgggcg 25201 atacgcatgc aggcgtctgc tttcccttgc aatcgcacgc tcggggtgtc tcccaccggt 25261 acaaaaacta tttactcgta tataaagaca tacatcaggg tgtatgagtc caatccattg 25321 agtgactttc acttatgtca agtaacaaat tatttgttac ctgacagcga tatattatat 25381 aagataagtc aatcaacaaa agttaacaaa agtggaacgc ccaatcaaga aatctgagcg 25441 tcagtcttct gaaggcactg acaccgagtc agaaaatttg gattctatgc cgactgccca 25501 acccagtcgc aaaaactcaa aacggagtga tgaccgctct gagggtaggg gcaaaaaagc 25561 atcctttggg gatgaatcca ggcaaccggt taatccggct ctggcgcgtg gtcccaaacc 25621 cgttaagccg aaagctaata tcaaaactga gccggatacc gaattagaac ctatttctga 25681 ggagtctcaa gattaaaaag tagaattatt aggatagttc aaataatggc agttgaatag 25741 cttgggatgt ttggacatta tcaagatttg atagtgaaat gcctagtagc cgaatagtgc 25801 gatcgcttag ttaaattgct tcaaatagtt cctttgcgac ataagcgata agccagaggc 25861 ttgacgctac gcgtatcgca ctcaactgcc gaacttggga caagtagtcg cacgctactg 25921 agagggttgt taggtttgat gtttattttt tggtcgagta acttaataat gcttttcaca 25981 ctaccttggc aaaaaatctt gttgagatag atagaacttc ctgaccatcg cttcaagttg 26041 gtggttattc accgccagaa tcatgaataa taagaaaatg tactttttct aaggtataat 26101 gtctgattca taactaaaaa attataatta tgcgatcgca atgggaatct ttgctgaaaa 26161 acctaggtga atggcaaggt tcatttaccc gtatctcgcc tcaaggtgaa attctagaag 26221 atgttcccag cgtagtttct ttgcaagggt taaacaacaa ccaaacaatc cgccaaacaa 26281 tccgcttggc ggaaaatgaa aaggcttttg aatacnnnnn nnnnncttta ggacgtagcg 26341 tgctgttttt tgaaaatggg gcgttttctc aaggttcaac acaacttggt ccattttccg 26401 aatttggggc agaactgggt ttaatttggg aaaatcgccg cttacgtttg gttcagttat 26461 ttgacaaaaa ccttcaacta gacaaactca cccttattag agaacatctg gctggaactc 26521 aaccagtaca acgtccacct ttaaaagtag atgacctttt gggagaatgg cagggtgaag 26581 cagtgacaat atatccagat tggcgttctc cccaaagtta ctccaccaaa atgcaactac 26641 aacttgatgg tgctgggcga ttaacgcaaa gcttaacttt tggcgaacgc acaatcactt 26701 caacagcgac tgtcaaagac tctatcatcc acttcaacca agatccgcac catcaaatac 26761 aagtcctact tctacctgat ggtgcttctg caacctctcc actcaagctg caattacgtc 26821 aaccactttt tcttgaagtc ggttggctta tccaaccaga cttacgccaa cggatgattc 26881 gcagctacga cgacaaaggc gaatgggtta ctctgacttt ggtaacagaa cgtcgagtca 26941 acaactaggg gtataggggt atagggg // LOCUS NODE_1096_length_26867_cov_4.86267326867 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 26867) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 26867) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..26867 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 992..2641 /locus_tag="DP116_09580" CDS 992..2641 /locus_tag="DP116_09580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013193147.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_09580" /translation="MYLYYRLRLIPSQLVTVTVVLTLLLFLPQTWITWEAYYNFNNII KNEFQLQIISDKIIYIDEVLTMSASMNAATNNPDWEQRYHEFEPQLDVAIKEFINLVP KTCKKEDIKKVYAINQRLVAIEYQSFDLVRKGQKEAAQRLFSSPEYKTQKRFYADSVA KRNRTISLQLRQKVAEYRNKLFWSIFASILSLVMLIPAWLLVLRLLQEYLKSRKLAQA ALEKTNQELEIRVATRTKELRRKNIQLQQTLQKLQQTQVQLIQTEKMSGLGQMMAGIA HEINNPITFVAGNLVYAEEYTQNLLRVVELYQQGYPNLSQVVQAEIDSMDLDYLKQDF TQLLKSMKMGTERIQEIVKSLQTFSRLDEAAIKTVDIHEGIDSTLMILQHRLKATDKH PEISVIKEYGLLPLIECNPGLLNQVFMNILANAIEALDEYNSQDTSDEVKANPSYIRI RTEVISKNWIAIRIMDNGPGIPEKICSKLFDPFFTTKPVGKGTGLGLSISYQIVVDKH GGKLYCHSVPGQGAEFFIEIPISRLCANGNKGLAIKKMSNL" gene complement(3065..4543) /locus_tag="DP116_09585" CDS complement(3065..4543) /locus_tag="DP116_09585" /EC_number="3.4.11.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="leucyl aminopeptidase" /protein_id="PRJNA477356:DP116_09585" /translation="MEIITTNTPLLEWTGDGLAIGLFEDAVELTGELATLDEKLAGSI KELIAEEEFKGKQNTSIVTRVGTGSPVRKIILVGLGKPEAFKLETLRRSAASVARLAK KQKCKTLGISLPIWNNDPTQTAQALAEGVQLALYQDNRFKSEPEDKGPQVEKVDLLGL DGQEAAISRANQIASGVFLARQLVAAPANSVTPITMADTAAAIASEHGLQIEILEQED CEKLGMGAFLGVAKASDLPPKFIHLTYKPEGTPKRKLAIIGKGLTFDSGGLNIKGAGS GIETMKIDMGGAAATLGAAKAIGQLKPDVEVHFISAVTENMISGRAMHPGDILTASNG KTIEVNNTDAEGRLTLADALVFAEKLGVDAIIDLATLTGACVVALGEDIAGLFSPDDA LAGELEKASQISGEKIWRLPMEEKYFEGLKSGIADMKNTGPRSGGSITAALFLKQFVK ETPWAHLDVAGPVWAEKENGYNGSGATGFGVRTLVSWVLGDS" gene 4899..5363 /locus_tag="DP116_09590" CDS 4899..5363 /locus_tag="DP116_09590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015201009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dTDP-4-dehydrorhamnose 3,5-epimerase" /protein_id="PRJNA477356:DP116_09590" /translation="MSKFRGIEIRRVESSKGGMVEFFTAQASHETMLVQIPPNTIDDL FVHKTHTDQLLVVKGQFVLVTLLDKQYQYLPFSEDHSVVVTIPPGVLHGAINLSSEPC VLVNAVLRHKPPQARDYIPHKRPFPYDLEAAQAALKNLEIANQVKNYGATPV" assembly_gap 5463..5472 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(5526..6989) /locus_tag="DP116_09595" CDS complement(5526..6989) /locus_tag="DP116_09595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008181076.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1802 domain-containing protein" /protein_id="PRJNA477356:DP116_09595" /translation="MSHSVVISIALCLPASDIEALLQGRIIAAISNKFIAQGREFALC PTDGLMNVLPVERYYHSSFLPIAQTAFSQFGTETVSIKASARCELCQIINSAESLAAL SSLTIWTQEALQEILTQRQNIFLAYLRVYYLPKAIEVPVKKNSQFVALPEGVTVSQAN PVLNDRLFTIRKHQLETRQPPPHPELENLQSAIAFLTITNPAAKQLDQDIKAFLGWTT EELIQQSDPDLAWINDIAALGDRSIEQDKGKSNYQAGTDFENIVRKSLEFLGFTVDYF HKGGAGGVDVFCSKPYPLVAECKAGKKIPNPTAVQLLNLGTLRLQSQELFHQAAKLII GPGEPTTQLKNAATIQGMSIINPQTLQNLVKLQHNYRGSVDLLKLKEYLKPGYSDNEV DKYIQQVYKAIQLRSHLVQLVKKHQDNTGDKNVEVATLFGAYGYSNPPQPLKIEEIYE ILVELSSPLTGYVGRIKGEDWRRDRFYFLRDLPTPQN" gene complement(7065..8879) /locus_tag="DP116_09600" /pseudo CDS complement(7065..8879) /locus_tag="DP116_09600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410279.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(8951..9628) /locus_tag="DP116_09605" CDS complement(8951..9628) /locus_tag="DP116_09605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998438.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09605" /translation="MTSKSFSKIEISAKTRLLLALWDLGGTKQEVKKGELSKRIVTKD KKVADYEGIFKELEKKGAIAISKTKKSVYLISISPLGLEVLSEGLKSPEFRFEGNIVG TWVANALLKWISQMNGAVAATASTNGVKSGIKSYDEFKQVTLEVYDQLNRDYNLDDLV PIYRIRREIGERVSREHFNEWMLEIQANDILQLQGGSLPDNDPAKLEDSITTEVSGLR CYAKRLT" gene 10492..11310 /locus_tag="DP116_09610" CDS 10492..11310 /locus_tag="DP116_09610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019489298.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon-nitrogen hydrolase family protein" /protein_id="PRJNA477356:DP116_09610" /translation="MKSYLAAAIQMTSVSNLQKNLAQAEEFIDLAVRQGAELIGLPEN FAFMGEENEKMAQAGAIAQETEKFLKTMAQRFQVTILGGGFPVPVDETSTRVYNTALL IDPNGQELARYQKVHLFDVNVPDGNTYRESTTVMAGKELPAVHYSRELGNIALSVCYD VRFPELYRYMAHKGADIIFVPAAFTAFTGKDHWQVLLQARAIENTSYVIAPAQTGINY ARRQTHGHAMIIDPWGVILADAGEQPGVAIAEINPARLEQVRRQMPSLQHRVFV" gene complement(11400..12209) /locus_tag="DP116_09615" CDS complement(11400..12209) /locus_tag="DP116_09615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876106.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TPM domain-containing protein" /protein_id="PRJNA477356:DP116_09615" /translation="MNLLNQSHVFLCLLFLAKRTFFSACLCLTLAFFPLPCYALTVQN VPNPQQFYGRWVMDLAHILSPNTEIRLNQIISKLERQNGDEIAVVTVPETSPARTPKA FTTSLFNYWGIAKSSQNNGVLFLVSVNEHRVEIEVGYGVKNILSNSLVNNIIQQEIIP QFKQGNYEDGILAGTQSLVMRLSTHLSDAKVLTPENLIPLALFLLLFIAVVAFCVKFI LPPNMLSELRLEIDNTELLVSKIKFEGYDTGGGDGSDSGGSSGGDGGGSSW" gene 12865..14709 /locus_tag="DP116_09620" CDS 12865..14709 /locus_tag="DP116_09620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316473.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide deacetylase" /protein_id="PRJNA477356:DP116_09620" /translation="MKLIPNNPILNYIFFNFRSRSIFHTLICGLLTVSVFLPQLAVAR RRPVSKPAEAENTPTTTEACSNNSNNDISLNNQVSRVASNLVLASTWVTKPNWGLENI VEAVGPYVYAFLNRTSWPNINQAAKEARVPILMYHDIIPQKQVFFDVTPEELEQHFQT LKDNGMTPISLDQLMTHLQTGMPLPEKPVVLTFDDGYGGHYQYVYPLLKKYGYPAVFS IYTNGVGNNTGRTHVSWEQLKEMAANPLVTIASHSVSHPPDLTVFPPKQIQIEVVESK AILEAKLGIPIRYFTYPAGKYNEQVASSVQAAGYDLALTMSDVDERFAGASDSLLAVS RFGQSKLQDVIKQAWGGAKLPSWKTGFDFASPVQRTDMTIDKIPLILVSGGKPITIHA DSRYTVKEILARSNTNAVAAVDGGFFSLKFLNSNVMIGPVYSQVTKQFIPGSTWDIQK IAARPLVLISPHEVRFIPFDPLKHNTLEGIQAEMPEVTDAFVAAAWLVKDGQPREPST FNGLYGFDVARYRAFWGINKKKQATVGVSLESVDSISLGKALVKAGLQDVVMVDSGQS TSLVYQGESLVRYEPRPVPHAVALLGSPSTTNTPPCVLVENKTKQRRS" gene complement(15011..16090) /gene="fba" /locus_tag="DP116_09625" CDS complement(15011..16090) /gene="fba" /locus_tag="DP116_09625" /EC_number="4.1.2.13" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_681166.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fructose-bisphosphate aldolase class II" /protein_id="PRJNA477356:DP116_09625" /translation="MALVPLRLLLDHAAENGYGIPAFNVNNLEQIQAIVQAAQETDSP VILQASRGARKYAGENFLRHLILAAVETYPHIPITMHQDHGNEPATCYSAIKNGFTSV MMDGSLEADAKTPASYEYNVDTTREVVKVAHSLGVSVEGELGCLGSLETGTGEAEDGH GAEGVLSHDQLLTDPDQAVDFVEQTQVDALAVAIGTSHGAYKFTRKPTGEILAISRIE EIHRRLPNTHLVMHGSSSVPEDLLALINQYGGAIPETYGVPVEEIQKGIKCGVRKVNI DTDNRLAITAAVREALAANPKEFDPRHFLKPSIKYMQKVCVDRYQQFGTAGNGSKIKQ ISLEDFAAKYAKGELNAVIKKTATV" gene complement(16357..16428) /locus_tag="DP116_09630" tRNA complement(16357..16428) /locus_tag="DP116_09630" /product="tRNA-Lys" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(16394..16396),aa:Lys,seq:ttt) gene complement(16494..17330) /locus_tag="DP116_09635" CDS complement(16494..17330) /locus_tag="DP116_09635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316475.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldose epimerase" /protein_id="PRJNA477356:DP116_09635" /translation="MSKLHIQTNQSQLEIVPERGGIITSWRVQNQEILYLDHERFANP QLSVRGGIPILFPICGNLPNNTYTLNNKQYTLKQHGFARDLPWEFTEDHITPDTASLT LVLSSNGQTRAVYPFDFKLAFTYQIEGNTLEIKQKYTNLSSEKMPFSSGFHPYFLTAD KNQLKFEIPSQEYQDQITKETHSFNGDFDYNRDEIDVAFKQLSGQSATVTDHARKLKL TLEYDDTYSTLVFWTVKGKDYYCLEPWSAARNALNTGEHLSVLEPEATHTATIRLTAN FF" gene complement(17434..18471) /gene="pdhA" /locus_tag="DP116_09640" CDS complement(17434..18471) /gene="pdhA" /locus_tag="DP116_09640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019489774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyruvate dehydrogenase (acetyl-transferring) E1 component subunit alpha" /protein_id="PRJNA477356:DP116_09640" /translation="MVQERTLPKFDAATVQITKEEGLLLYEDMVLGRTFEDKCAEMYY RGKMFGFVHLYNGQEAVSSGVIKGAMRPGEDFVSSTYRDHVHALSAGVPANEVMAELF GKATGCSKGRGGSMHMFSSEHRMLGGYAFVAEGIPVAAGAAFQTKYRREVLGDPKADQ VSACFFGDGAANNGQFFETLNMAALWKLPIIFVVENNKWAIGMAHERATSQPEIYRKA SVFNMAGVEVDGMDILAVRAVAQEAVARARAGNGPTLIEALTYRFRGHSLADPDELRT KAEKEFWFARDPIKKLAAYLVEQNLASGEELKAIDRQIQQEIDEAVKFAESSPEPNAS ELYRFVFAEDE" gene 19195..21531 /locus_tag="DP116_09645" CDS 19195..21531 /locus_tag="DP116_09645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653142.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaJ" /protein_id="PRJNA477356:DP116_09645" /translation="MRIPLDYYRILGIPMAASEEQLRQAYGDRIVQLPRREYSQAAIT SRKNLIEEAYVVLSNPKERKTYDQLYLAHAYGSDSTEGGAVAVESRKQGHIGELDSQA LSIDISQDEFVGALLILQELGEYELVLKLGRPYLVNRNGISSIQSDREDIEEVPFSSE RPDVVLTVSLACLELGREAWQQAHYENAAISLETGEEFLAHEGLFPSVRSEIQSDLYR LRPYRILELLALPEEQTTERKQGLQLLHDILDERGGIDGTGNDKSGLSTDDFLRFIQQ LRNYLTCTEQHKLFEVESKRPSAVATYLAVYALIARGFAQRQPALIRQAKQMLLRLGR RQDVHLEQSLCALLLGQTEEASRALELSQEYEALAFIRESSKDSPDLLPGLCLYEERW LENEVFPNFRDLVDKPASLKDYFANQQVQAYLEALSTEAEPIEPGTGINRQSFQTQQI TRESRRNNSANNNSKVVEGKFPTQKTPHSETPAKSTFSSSSPSHTTTSSSSNTWSSQA ETPVAPVHRIDATPKGTYYNSNRPSKHPTAPPQRQTQRKRKRSPSANSGRGSGNLLGS AYRQRIFARISPAQMRLVRIVSILLGSLLVLWLLIAATFGLLKNLFFPGPSLKGEQLS VELNQPLVPIPNQNSKPQLPAGTLTEATAEEIIQAWLDTKAAAFGPNHEIDRLKYILV GSTLTKWQRWAQQEKTDNQHRKYEHSMKVESLEANTTDQHHAAVEASVSEATQYYTNG QMKKSDKEKLRVRYELVRVEGSWRIRDMSVLNKISMIF" gene complement(21614..22030) /locus_tag="DP116_09650" CDS complement(21614..22030) /locus_tag="DP116_09650" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09650" /translation="MTKVFLGKIVKIKTSAVFDVNGLVTKDLDLTVNQSLMRVSLRRE LVNPIAFWRGVSHRVGILFQGKCLSYSKLNPFTKDQNPQRKYEDQFLSEIEMYQDFRE KITTAYAGNKKFLGHIPHLVLSQSVISQFWEYLSFF" gene 22422..23675 /locus_tag="DP116_09655" CDS 22422..23675 /locus_tag="DP116_09655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653141.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Rho termination protein" /protein_id="PRJNA477356:DP116_09655" /translation="MAKERPPLEEMTLRQLRKVASEFNISRYSRMRKSQLLESVQEAQ RSKVSLTQSRSMEAQETVEAAKFELGQVDRTGGTLADVDEGLADLPEGYGESRIVLMP RDPQWAYTYWDVSNEHKEELRRLGGQQLALRIYDVTDINLEYQSPHSIQEYPSDELAR EWYLPVPVSDRDYVIDIGYRCADGRWLVLARSAPVHVPPVYPSDWIEDVFITVNFDED LRDKTVYELVPPAKKVAPTAPAATAARGNAIYDQIFGLAESAEAMRVAGSVFGSMQHV PGSVIPEQAISSYVFPSGVGMWAVPTTSGLTMSGVGMSGAGFSGDVPMRPRKFWLIAD AELIVYGATEPDATVTIGGRPIKLNPDGTFRFQMSFQDGLIDYPIMAVAADGEQTRSI HMKFNRETPSRNTNTKEEAVLEWLS" gene complement(24058..>26867) /locus_tag="DP116_09660" CDS complement(24058..>26867) /locus_tag="DP116_09660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872535.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="translation initiation factor IF-2" /protein_id="PRJNA477356:DP116_09660" /translation="LEQPAVTDADKDIVPNTNQPVEKVAAQKPEKAPAPRAKPERQPK PQLAAPPTSPAAEKPNSPTADAQPTVSEKPVIILKRDRDQKRDAPEGIQSPAGDGAES LPQKSDAPEGRIIRGERDQVKQQRVNKAATGETPQAALPQKSEKPVRSTTPSPVKPEQ RGNKPSAPVQVGESQRPSRPIVRSGEQPGPAVPVATPPKPMRVSLTKPQAGRQDEDDA PTVTDEVIELKRPTPPRQAKGGKKWQEEEIEEIKESAKPAKGAVKGKRAKPIVDEFED EDLLDEEGLEIPATIQVSLSIARPPKPKASKPTQTAVATAATPVAKARKPASSREQNR RQETEQKQDRPEILEVTGPMTVQELSEALVVADTEIVKILFLKGMAVSITQNLDIPTI TLVANELEIPVETVEPEAEARKVTEMIDVADLEHLLRRPPVVTIMGHVDHGKTTLLDS IRKTKVASGEAGGITQHIGAYHVDVEHDGKEQQVVFLDTPGHEAFTAMRARGARVTDI AILVVAADDGVRPQTVEAISHAKAAEVPIIVAINKIDKPEAQPDRVKQELTEYGLVSE EWGGDTIMVPVSAIRGENLDTLLEMILLVAEIEELSANPDRLAKGTVIEAHLDKAKGP VATLLIQNGSLRVGNLLVAGSAFGKVRAMVDDRGKRVDAATPSFAVEVLGLSEVPAAG DEFDVFDNEKQARSIAAERAEKLRQSRLMQGRVTLTSISAQAQEGELKELNLILKGDV QGSVEAIVGSLRQIPQNEVQIRLLLSSAGEITQTDIDLAAASGAVIVGFNTTYASGAR QAADEAGVDVREYNIIYKLIEDIQGALEGLLEPELVEEHLGQAEVRAVFPVGRGSVAG CYVQSGKLIRNCKVRVRRNGKVINEAPLDSLKRMKEDVREVNAGYECGVGIDRYNDWA EGDIIEAFQMVTKRRTLSSTR" BASE COUNT 7395 a 5895 c 5603 g 7964 t 10 others ORIGIN 1 aggcatgtgg cgtccctggg ctaaagccac agggttttca tctcacccac tataatatca 61 agtttgggtg atgacttttg agagtctcct ggctttttat aggaattttt ctgtcactca 121 ttgccacaaa aactcaggat attctgtaat ctacgtcata ccaagttgcg ttttgttgta 181 gtagtttgtg attagggaag attgctgaat ttctgatacc atttcatgaa attgctgata 241 caaatctttt ctcctctgct cctgaagctc ctctgctcct ctgcggtcta attgtatcaa 301 gcttaaagtg aaacggtatg agtccttacg gacacgcgat cgtcgcaacg cgacacgctc 361 ttgcaacgcg tatggaacag cggtacgctt tgcgattacg ggataccact agcttatacc 421 tgtggcacgc tttgcgcata cgcaattgac ttgtgctatc cgggaattgg gacttggtat 481 tagctgcaag ttttcaagca gatatagctt agattaaact aaaacaagca tattcagtgg 541 tttttggttt agacaacaca gagattcctt ttctggtttt gttactacaa aatcagaaaa 601 aatccttgcc atcacctaaa aaaattagtc ttcgctgtca tgatgatgtt cagatacttt 661 gaagtagagg tatttcacct taagatgaag cctttgtgat tagatttaga atggttcacg 721 attcacaaaa agtcatctac atcgaataat atatattttt acaaactgac ttgacttgtg 781 atcctaaaat tgatttcatc gtcattgaaa cttttttatt gagtcttatc ttataggagt 841 aattctcatt aagcccaaac aagcaaattt ttcatgataa ttattctata ggttttttcg 901 gaacgttatt taataggaac atggagcaag cttattgttt ttatttttat ttcttttttt 961 cttgtaaaac attgatataa aatttaaata aatgtactta tactatcgat tgagattgat 1021 tccatctcaa ctagtaactg taacagtggt actgactttg ttattatttt tacctcaaac 1081 ttggatcact tgggaagcat actacaactt caacaacatt ataaaaaatg aatttcaact 1141 acaaataatt agtgataaaa ttatctatat tgatgaagtt ttaacaatgt cagcaagcat 1201 gaatgctgct acaaataatc cagattggga acaacgctat catgaatttg aacctcaact 1261 agatgttgca attaaagaat ttattaacct ggttccaaaa acatgtaaaa aggaagatat 1321 caaaaaagtt tacgcaatca atcagcgctt ggtagcaata gaatatcaat cttttgattt 1381 agtcagaaaa ggtcaaaaag aagcagcaca aagactcttt tctagtcctg aatataaaac 1441 tcagaaacgc ttctacgccg atagtgtagc gaaaagaaat cgtactatct cgcttcagtt 1501 gcgtcaaaaa gttgctgaat accgcaacaa actattttgg tcaatttttg catcaatttt 1561 aagtttagtc atgctcattc cagcatggct tttggtgttg cgcttgttac aagaatactt 1621 aaaaagtcga aaacttgccc aagcagcctt agaaaaaact aatcaagaat tggaaattcg 1681 agttgcgaca aggacaaagg aattaagacg gaaaaatatc caactacaac aaacactaca 1741 aaaactgcaa caaactcaag tacaactgat tcaaactgaa aaaatgtctg gtttgggtca 1801 gatgatggct ggtattgccc atgaaattaa caatcccatc acttttgttg ctggcaatct 1861 tgtttacgct gaggaataca cccaaaactt actaagagtg gtggaactct atcagcaggg 1921 ctaccccaat ctttcacaag tcgttcaagc tgaaatcgat tctatggatt tggattactt 1981 gaaacaagat ttcactcaac tcctcaaatc tatgaaaatg ggaacagaaa ggattcagga 2041 aatcgttaaa tctcttcaga cattttctcg cttagatgag gctgctataa agacagttga 2101 tattcatgaa ggtattgata gtacattaat gattttgcag caccgcctga aggcaactga 2161 taaacatcca gaaatctctg tgattaaaga gtatggtttg ttgccgttaa tcgagtgcaa 2221 ccctggttta ctgaatcagg tatttatgaa tatccttgcc aatgcaattg aagctttgga 2281 tgagtacaac tcacaggaca catctgatga agtcaaagct aaccccagtt atattcgcat 2341 tcgcactgag gtgatttcaa aaaactggat cgcaatccga atcatggaca atggccctgg 2401 tattccagaa aaaatttgtt caaagctatt tgaccctttc tttactacaa aacctgtggg 2461 taaaggtaca ggacttgggc tgtctatcag ttaccagatt gtagtagaca agcatggtgg 2521 gaagttatat tgtcattcag tacctggaca aggtgcagaa ttttttattg aaattccaat 2581 tagtcgtcta tgtgcgaatg gcaataaggg acttgcaatt aaaaaaatgt ccaatttgta 2641 gtgatgcaca aaacaatggt gggctacggc gcactttact catcatgtaa catctcaact 2701 attgtgcgcc taacccaccc tacctgatgg ataatttatt ttttggtagt ccctaaagca 2761 gtgaactact tatactccgt tctacaacac ttggacaaac ctagacaaag ttgaaccttg 2821 ttagacagtg cccggattgt ttcctgcttc aacctggcag tgtcggcact aaccagtcca 2881 caggcttaaa ttcggataga acttatccta tgtttcgctc ttgtttcgcg ttagaggaat 2941 aatagcacat ttgaatgttg gacaaaaatg ttaattaatg tctagctttg tccaacaatt 3001 tagttacagt ttctagcccc tgatgggtga gttagtcgct tcccttcatg tccttatcgc 3061 tgtatcagct atcgcctagc acccaactga ctaaggtacg aacgccaaag ccagtcgcac 3121 cggaaccgtt gtaaccattt tctttttctg cccaaacagg acccgcaaca tctaggtgcg 3181 cccaaggagt ttctttgaca aactgcttga ggaacaatgc agcggtgatt gaaccaccag 3241 aacgcggtcc tgtattcttc atgtccgcga tgccagactt gagtccttca aagtattttt 3301 cttccattgg aagtcgccaa atcttttccc cagaaatttg ggaagctttc tctaactctc 3361 cagctaaagc atcatcagga gagaacaaac ctgcaatatc ctctcccaaa gcaaccacac 3421 aagcacccgt gagagtagct aaatcaatga ttgcatcaac tcctagcttc tcagcaaata 3481 ccaaggcatc tgccaaggtc aaacgtcctt ctgcatcggt gttatttact tctatggttt 3541 tgccattaga tgcagtcaga atgtcgcctg gatgcattgc acgaccgctg atcatatttt 3601 ctgtgacagc ggagatgaag tgaacttcaa catctggttt aagttgacca atcgctttgg 3661 ctgcacctaa agtagccgca gcaccaccca tgtcaatttt catggtttca atgccactac 3721 cagctccttt gatgttaagt ccaccagagt cgaaagttaa acctttaccg ataattgcta 3781 gcttgcgttt gggcgttcct tctggtttgt aagtcaggtg aataaacttc ggtggtagat 3841 cagaagcttt tgcaactccc aaaaacgctc ccatgcccaa cttttcacag tcttcctgtt 3901 ccaagatttc tatttgtaaa ccgtgttcgg aagcaatggc ggcggcggtg tctgccatag 3961 taattggtgt cacagagttg gctggggctg cgaccaactg gcgtgctaaa aatacaccag 4021 aggcaatttg attggcgcgg ctaattgctg cttcctgtcc atccagtcct agcaaatcga 4081 ctttttcaac ttgtggtcct ttatcctctg gttcagattt aaagcgatta tcctggtata 4141 gtgctaattg aactccttca gcgagtgctt gtgctgtttg tgttgggtcg ttgttccaaa 4201 tcggtaagct gattcctaga gttttgcact tctgcttttt agctaaccgg gcaacgctgg 4261 ctgcgcttcg ccgcaaggtt tctagtttga aagcctcggg tttgcctaaa cctacgagta 4321 taattttacg aactgggcta ccagtaccta ctcgtgtgac aatgctggtg ttttgcttac 4381 ccttaaattc ctcttccgct atcagttctt ttatactacc agccaacttt tcatctaaag 4441 ttgccagttc gccagttaac tctacagcat cttcaaataa tccaattgct aaaccgtccc 4501 ccgtccactc tagaaggggg gtattagtgg taataatttc catactcttg tcttctgtgt 4561 aaaacttctt gtaccagtat tgcgcaaaat atgcgcttta tgcttcaaaa gagcgtatca 4621 gagttggtga ttaaatcagt gaacgcttaa cagggaacag tgaacagtga acagtaaaca 4681 gtgaacagta aacagtaaac aaggggtgga cgagtccgtt tcctacctga taactggtaa 4741 ctgataactg ataactgtta actgatgact gataactgat gtagatccca tatcatattt 4801 tgggcgattt tgttgccgaa gtaacaattt cttataatag agcatctaag attgaaggta 4861 aagcttcaat ttttacgatg ctcttttgga atactgatat gagtaaattc aggggaattg 4921 aaatccgtag ggtggaatcc agtaagggag gaatggtaga gttctttaca gctcaagcca 4981 gtcatgaaac tatgttggta caaattccac ccaacacaat agatgattta tttgttcaca 5041 aaacacatac agaccaattg ctagtcgtaa aaggacagtt tgtccttgtg acattgctgg 5101 ataaacaata ccagtatctt ccttttagtg aagaccattc tgtagtcgtg acaattccgc 5161 cgggagtttt gcacggggcg attaatttaa gttcagaacc gtgtgttttg gtgaatgcag 5221 tgctgcgtca caaaccacct caagcgcggg attacatacc ccataaacga ccatttccct 5281 atgatttaga agcagcccaa gcggcgttga agaatttgga aatcgcaaat caagtcaaga 5341 attatggggc gactccagta taggttgttt gggttaagtt gcattgaggg tgggttaaat 5401 agctcacctt aatttttttg tttttctaaa ttttatacta actttgggat aataaatttt 5461 aannnnnnnn nnccgccaag acgccaagga cgcaaaggta agaaagaaag aaagaaagaa 5521 aagagctaat tttgaggagt tggtaagtcg cgtaggaagt aaaagcgatc gcgtctccaa 5581 tcttctcctt ttattcgtcc gacataacct gttaatggtg aggaaagttc aacaagaatt 5641 tcgtagattt cttctatttt tagtggctgt ggtggatttg agtaaccgta ggctccaaac 5701 agagttgcaa cctcaacgtt tttatcacct gtattatctt gatgtttttt tacaagctgc 5761 actaaatgcg atcgcaattg aattgctttg taaacttgct gaatatattt atcaacttca 5821 ttatcagaat agcctggttt tagatactct ttcagcttca ataagtctac agaaccacga 5881 tagttatgtt gaagttttac caaattctgt agtgtttgtg ggttaatgat tgacatacct 5941 tgtattgtgg ctgcattctt aagttgagtt gttggttctc caggaccaat tattaattta 6001 gctgcttggt gaaataattc ttgactttgt aggcgtaggg ttccgagatt gagtaattgt 6061 actgcggtgg ggtttggaat ttttttacct gctttacatt ccgcaactag gggatagggt 6121 tttgaacaaa atacatctac accaccagca ccacctttgt ggaaataatc aactgtaaat 6181 cctaagaact ctaggctttt acgaactatg ttttcaaagt cagttccagc ttggtagttg 6241 cttttgcctt tatcttgttc tatgctgcga tcgcctaatg ctgcaatgtc gtttatccaa 6301 gctaaatctg gatcgctttg ctggattagt tcctcagtcg tccaacctaa aaatgctttg 6361 atatcttgat ctaattgttt agcggctggg ttggtgatgg taaggaacgc aatagcactt 6421 tgcaagtttt ctaattctgg atgtggtggc ggttgacgtg tttctagttg gtgtttgcgg 6481 atggtaaaga ggcgatcgtt caacacgggg tttgcttgag aaactgtcac accttctggt 6541 aaagcaacga attgactgtt tttcttgaca ggaacttcaa ttgctttagg caggtaataa 6601 acgcgcaggt aagcgagaaa gatgttttgc cgttgcgtga gtatttcttg taaagcttct 6661 tgtgtccaaa ttgttagcga cgataaagca gctagtgatt cagcactatt gataatttgg 6721 cataattcgc accttgccga agctttaata gagacggttt cagtaccaaa ttgactaaaa 6781 gctgtttgag caataggtaa gaaactagag tgatagtacc gctcaactgg cagcacgttc 6841 attaatccat ctgttggaca aagtgcaaat tctcgccctt gagcaataaa tttattagat 6901 atagctgcaa tgattcgacc ttgcaacagt gcctcaatgt ctgatgcagg caagcataaa 6961 gcgatactaa tcacaacaga atgactcatt ctttattaaa aaatgtctat gcttacttct 7021 tctatattgc tgactactgg tgtatttacc acgccctgat tactttcttc atcaacagca 7081 tcttcaggta cttctccaga tggatcgctg agaatttcac ggataaggta attatctatt 7141 gctatccgct tcgagcgaat aaaatccaaa atctgctgtt cgcttaactc ataatcttca 7201 cgacctttca taacgaaatg cagtgctaaa agtggcttaa tgtcctctga ttttaacaat 7261 acccattcac ctcctaagtt ttttgacaaa agttcgctca aacactgttg tgcttgagca 7321 gcacctttat taatttgctt agagcgaact aaacagccgc gagttaaatc aaacttttta 7381 tagtcgatta agcgctttaa tgtagctcca actatcctac ctccagaatc ttgaaggaca 7441 gctacaccaa ttttcacagc tttaccattt tccttgccaa taactttaaa gtctatgaaa 7501 cctttgtcta cagctcttgc tttaacatct tttatttctt caatttgtac tttttctata 7561 gtttctccaa tcacagcaga gaagcctaac cgtagtgctt gagcaaggct tgctttatcc 7621 tccatataat cttgtatggt agtttctaaa gctgctagtt gttgattgta aacaggctct 7681 actgggtctt ttcgcttgac tgtttctata tttccatcat cctcaagagg tttaaagttc 7741 tctgcacacc attgcaaaac ttttctgaca attggtcttt ccttacccaa tgctctcaat 7801 ttatcttcat caaagggata aactcgatga ggaggagtta atcccttctc ttgataaaag 7861 tcttcaagcc atccagacac caatgcaaca acctcatcag aattaagatg ttttaaatca 7921 atcacttttt gaccaattct gtctacgact gatgcagcgt ttggtaatct tttaacttca 7981 tctctccaag aatctggaaa aatacatgtt agcaaaacac ctcgttttat tttgtcatat 8041 aaatctttag cgaagatagc agtaacttgt tgtgcagtgt atcccaattc attacaggta 8101 ttgttctcca tctcatcaaa acaaaccact attggtttgt aatcactaat caaatcgaga 8161 atttgccgca ccgtgttaaa ggattcagct tctttatcct ttgcactgat attagctaac 8221 cccattgcat cagcttttga ctgtggtaag tgatgaccag atagccatcg aatggcaaat 8281 acttcataag ctggatcagg agatagtgtc cataaaattg cagtcagaat atcgggattt 8341 tcaatatcag gctttatacc aagtactgta tctcggaaaa cttcaataat cttgggattt 8401 ttagctaagt ttcctcgaaa ttgattgact aactgctgag gggcatagtt ctttttgtaa 8461 gcctcattga ctaaagctgt tgctaactcc tgccattgac tcacaccttg actacctatt 8521 tgtttaaggc tgtttgccag agtgcttaaa aattctgttg taatacggtt taagtcattg 8581 cactgactca tgtaaacgaa taaagcacta ccgtcaatct gtaatctatg tcgaattcgg 8641 ctgatgacat gacttttacc taatcctctt tctgccgtaa tggtgatacc cacaacttgc 8701 cgttgtttat tacgaacttt ttctattgcc tcatatacag catcggaagc atgggcattt 8761 aacgatggaa catcaggaaa accttgtccc caaacattat gtgttgtaac tacaaagcgt 8821 ccctcaaatg gattgtggtt tttgatggca ttattaatta agtcgttggc gttggacatg 8881 gatgtatttg caatagtggt gtagtaggag ttagcgcggg aaatagcaaa aattaccaac 8941 aatcgatgga ttaagttaaa cgcttggcat agcaacgtag tccactgact tcagtagtaa 9001 tggagtcttc aagtttggct ggatcattat ctggtaagct accgccttgc agttgcagaa 9061 tgtcattagc ttgtatttcc agcatccact cgttgaaatg ttcacgactg acacgctcac 9121 ctatttctct tctgatcctg taaattggta ccaaatcatc taagttatag tcgcggttaa 9181 gctggtcata gacttccaac gtcacttgct taaactcgtc ataggattta atcccactct 9241 tcaccccatt agttgatgca gttgcagcta ccgcaccatt catttgactt atccacttca 9301 gcagtgcatt agccacccaa gttccgacaa tatttccttc aaacctaaac tctggacttt 9361 tcaaaccctc actcagtact tctaaaccca gaggagatat tgatattaaa tagacgcttt 9421 ttttggtttt agagatggcg atcgccccct ttttctccaa ttccttaaag atgccctcat 9481 aatctgctac cttcttgtcc ttagtaacaa ttcgcttact gagttcacct ttcttcactt 9541 cctgctttgt tcctcccaaa tcccacaaag ccagaagaag acgagttttc gcgcttattt 9601 cgatttttga aaaacttttt gacgtcatat ttctttattc tatttgacaa aactagtata 9661 attgtgattc acagtcatat taaaatcaag caactacaat aaactgaaaa ctcctgtagt 9721 ctcgaaaaat tacgtaaaaa tacgggaaat tttcatgtga gtcttgtgaa gatgacttga 9781 ctgatatctc ccactgttct ttaagcatca aaagaacgca aattttccta aaatagaaat 9841 ttttctggaa aaactaagtt ttaccacctt ctcagtcaac atatattttg gtaatctaca 9901 tttcatccaa aataagcttc gttaacggct ttactcgtaa accagagtat aagtctggca 9961 ttgcctgcgc cagaaaaatc tggaatttta cggtgtagcc ctcccgcagc gcgcgctgcc 10021 tcacatgcac tttcacatta aaatggacta gaagccttgc tcagtaagta tttagtctga 10081 tgcgcgaaga ctttagttat tagcgatagg tttttaacct atggcggttg ttggcaggag 10141 tgcaaaatat gagttcaagc aattcggtga ctttgtcctt tgttaacgtt ccatcacaaa 10201 atacttgtta agggaaagtt aacttttctt agtttggtag cttaactgta acccaatacg 10261 gttgctgtta aaaattgatc ctccctagcc ctccttaaaa aggagggaac taagcccccc 10321 ttgggaaggg gagccactgc gttggggagc cagtacttga tgagggtctc cctcacttgg 10381 tatctggtga gaccagcgcg aatgacggcg ctggctcact gataactgat cactgataac 10441 tgttttaacc gtttccaatc tcacacaact ttattattcg ttgttgaatg tatgaagtct 10501 tatctagccg ccgcgattca aatgaccagt gtgtccaatt tacaaaaaaa cttggcacag 10561 gcagaggaat ttatcgattt agccgtgcgt caaggtgctg aattaattgg tttgccagaa 10621 aactttgcct ttatgggtga ggaaaatgaa aaaatggctc aagctggtgc gatcgcccaa 10681 gaaacagaaa agtttctcaa aacaatggcg cagcgttttc aagtcaccat tttgggcggc 10741 ggctttcctg ttcctgtaga tgaaaccagt actagagtct acaacactgc cttacttatt 10801 gaccctaatg gtcaagaact tgcacgttac caaaaggtac acctgtttga tgtcaacgtc 10861 ccagatggca acacttatcg agaatccaca acagttatgg caggtaagga attaccggct 10921 gtccactact cgcgagaact tggtaatata gcactctcgg tttgctacga tgtccgcttt 10981 cctgaacttt accgatacat ggctcacaaa ggtgcagata tcatctttgt gcctgctgcg 11041 tttactgcct ttactgggaa agaccattgg caagttcttt tgcaagcacg tgctattgag 11101 aatacttctt atgtgattgc cccagcacaa actggcatca actacgcccg tcgtcaaaca 11161 cacggacacg ccatgattat agacccttgg ggagtgattt tagcagatgc tggtgagcag 11221 ccaggagttg cgatcgccga aatcaaccca gccagactag aacaagttcg tcggcaaatg 11281 cctagcctac aacaccgtgt ctttgtgtaa ttttgcctaa tctaggtttt ctatttcctc 11341 tctagtgtat gcaccagagg gttagggatg gtggttgctg tagtcatctt tacctagagt 11401 taccagcttg agcctccacc atcaccacca ctgctaccac cggagtcgct accgtcgcca 11461 ccaccggtgt catatccctc aaacttgatt ttagaaacaa gcaactcggt attgtcaatc 11521 tcaaggcgaa gttcagacaa catattaggg ggtaagatga attttacgca gaaggcaact 11581 acggctatga atagcaaaag aaatagagca agaggtatca aattttcagg tgttaacact 11641 ttcgcgtcag atagatgtgt acttagcctc ataactagcg actgagtacc tgcgagaata 11701 ccatcttcat aattaccctg tttaaattga ggtattattt cttgttggat aatgttgttc 11761 accaaggaat ttgaaagtat attcttaaca ccgtagccaa cttcgatttc cacgcggtgt 11821 tcattaacag agactaaaaa caggactccg ttattctgac tgcttttagc aattccccag 11881 tagttgaata aacttgtagt aaacgctttg ggagtccgtg ctggggaagt ttctggaaca 11941 gtcacaacag cgatttcatc tccattttga cgctccaatt tagagatgat ttggttaagt 12001 cgaatttctg tattcgggct aagtatgtgt gctaaatcca ttacccatct accataaaat 12061 tgctgaggat ttggcacatt ttgcactgtg agggcgtaac agggtagtgg gaaaaaggct 12121 agagttagac acaaacaagc tgaaaagaat gtacgtttgg ctagaaacag caagcataaa 12181 aaaacatgac tttgatttag taaattcata ccaaatatag cggttctcgt ttggatgcag 12241 tacgcacatg atttcactcc taactcctcc ggtgcatgca ctggaagagg ggtaggcgaa 12301 ctttgacgtc agccactcag gtggggttcc tttgtgtagt gcaacgaagt gagaagcgcg 12361 atagttcagc cacaatcgga acctttacag atttatcagt ggtaaaccca aagtttacgc 12421 aattatggaa aaaccagtga tcagcgagat attctttctc tactcagagt gacacaagag 12481 caagcagtgg ttagctgaac aaaaaacaca ggaacttcac tagtatcaaa ttcagtctaa 12541 ccaacagaaa agtttcgata gcaacaaata ccaattttct gttaaattgc caatgtaaag 12601 cttaagcaat ttatccttca tctttatatt tttgtctctt atctttataa cgaataggga 12661 ataaagtgta agcgttactc caaccgaagc caatgcaccc gttatatcat gttcggatga 12721 acacttataa aaaacgaatc acctcacccc gccctctggg cacccctctc cttaataagg 12781 agaggggatg ggggtgaggt ctttttattg taagtaatca aacgaacttg atattacaca 12841 acccaagtgt tctaaaaatt aagtatgaag ctcattccta ataatcctat tctgaattac 12901 atatttttca atttccgttc ccgaagcata tttcataccc ttatttgtgg attacttaca 12961 gtctcagtct tcttaccgca actggctgtt gctcgtcgtc gtcctgtttc aaaacctgct 13021 gaggctgaga atactcccac aacaactgaa gcttgtagta acaacagcaa caatgatatt 13081 agcttaaata accaagtttc ccgtgtagct agcaatcttg ttttggcatc tacttgggtg 13141 actaagccta attggggatt agaaaatatt gtggaagctg ttggtcccta tgtctatgcc 13201 ttcctcaatc gtacttcctg gcccaacatt aaccaagcgg cgaaggaggc aagggtaccc 13261 attctcatgt atcacgacat tataccacaa aaacaagtct tttttgatgt caccccagaa 13321 gaattagaac aacatttcca aaccctgaaa gacaatggca tgactcccat tagtcttgac 13381 cagctgatga cgcatttgca aacgggaatg ccactcccag aaaagcctgt tgtcttgacg 13441 tttgatgatg gttacggggg acattatcag tatgtttatc cattgctcaa aaaatacggt 13501 taccccgctg tattttctat ttacaccaat ggtgtcggca acaacaccgg tcgaactcat 13561 gtcagctggg aacaactcaa ggagatggcg gctaatcctt tggtaaccat tgcatctcat 13621 agcgttagtc acccaccaga tttaacagtt ttcccaccga aacaaatcca aatagaagtt 13681 gttgagtcta aggcaatttt agaggcaaag ttaggaattc ctattcgcta cttcacctat 13741 cctgctggaa aatataacga gcaagttgca agttcggtgc aagcagccgg atatgaccta 13801 gcactgacga tgagtgatgt ggatgaacgc tttgctggtg catcagacag tttattggct 13861 gtttcccgct ttggtcaatc caagctacaa gatgtgatca agcaagcttg gggaggtgcc 13921 aaattaccaa gctggaaaac aggctttgac tttgcaagcc cagttcaaag aactgacatg 13981 acaatcgata aaattcctct gattctagtt tcgggtggta aaccgatcac tatccacgct 14041 gatagtcgtt atacggtcaa ggaaatttta gctagaagta atacaaatgc tgttgctgct 14101 gtagacgggg gtttcttctc cctgaaattc ttaaattcca atgtgatgat tggaccagta 14161 tacagccagg tgacaaaaca attcattcct ggtagcactt gggacatcca gaaaattgct 14221 gcacgtcctc tggtattgat tagtcctcat gaagtgcgct tcattccctt cgatccactc 14281 aagcacaaca ctttagaagg aatacaagct gaaatgcccg aggtgactga tgcttttgtg 14341 gcggcggctt ggttagtcaa agatggtcaa cctcgtgaac caagcacctt taacgggttg 14401 tatggttttg atgtagcacg ttatcgggct ttttggggca ttaacaagaa gaaacaagcc 14461 accgtcggcg tttctctaga atctgttgac tctatatctt tgggaaaggc gctggttaag 14521 gctgggttac aagatgtcgt aatggttgat tccggtcaaa gtacttcctt agtttatcaa 14581 ggagaatctc ttgtacgtta cgaacctcgt cctgtccccc atgcagttgc acttttagga 14641 tctccatcca cgacgaatac cccgccctgt gttttggtgg aaaataaaac aaaacagcgg 14701 agaagttaag tttgtaataa cccacattcc gcaccttgta ctgtcacaat cggttatttc 14761 gcaattttgg gaatatttca gtttttttat acatattcta ttgagatttg tcttcctgtt 14821 ccctgttccc tgttccctgt tccctgttcc ctaaaaatcc gtagcgtgac acttgagggt 14881 gcggaatgta aataactaaa ataaaaaacc ggggaaccaa aacattaccc ggttaagtct 14941 cagcagcgta tgaatcctca gtaatatagg gacacagcct tgccatgtcc ctatttattt 15001 cttctgcgac ttagacagtc gcagttttct taatcacagc gttgagttcg cccttagcat 15061 acttagcagc aaagtcttcc agtgaaatct gcttgatctt gctaccatta ccagcagtac 15121 caaattgctg atagcgatca acacaaacct tctgcatgta tttaatagaa ggcttgagga 15181 agtgacgggg gtcaaattct ttgggatttg cagccaaagc ttcacgcaca gcagcggtaa 15241 tagccaaacg gttgtcggtg tcaatattta ccttacgcac accgcacttg ataccttttt 15301 ggatttcttc cacaggtaca ccgtaggttt caggaatagc accaccatac tggttaatca 15361 gggcgagcaa atcttcgggt acagaggagg aaccgtgcat caccaagtga gtgttaggca 15421 aacggcggtg aatttcttca atgcggctga ttgccaagat ttccccagtt ggcttgcggg 15481 taaacttgta agcaccgtgg ctggtaccga tcgcaactgc caaagcgtct acttgggttt 15541 gttccacgaa gtctactgct tggtctggat cagtcagcag ttggtcgtgg gaaagaacgc 15601 cttcagcacc gtgaccatct tcagcttcac ctgtaccagt ttccagagaa cccaagcaac 15661 cgagttcgcc ttcaacgctg acacccagtg agtgagccac tttcactact tcgcgggtcg 15721 tgtcaacgtt gtactcgtag ctagcggggg tcttggcatc agcttccagc gaaccatcca 15781 tcatcacgct ggtaaagccg tttttaattg ctgaatagca ggtagcaggt tcattaccgt 15841 gatcttggtg catggtaatg ggaatatggg ggtaggtttc taccgctgcc aaaatcaagt 15901 gacgcaggaa gttttcacca gcatacttac gagcgccacg agaagcttgt aaaatcacgg 15961 ggctatctgt ttcctgggca gcctgaacaa tagcttgaat ctgctccaag ttattaacgt 16021 tgaaagcagg gatgccgtaa ccgttttctg ctgcgtgatc caacagcagc cgcaggggta 16081 cgagcgccat agatagtcct cctaatatgg ttgtcagcta gtcggtgtga gacaagcgta 16141 tcggtgtacg ctattcttaa tattatttta agcttatagg aaattataac tactcttgtg 16201 tcctttctgt cgaaaaactt taaccaaagt aaggtggaca ctaccaattt tactctatca 16261 ttccaatcac ttagttttgt accttgagtt acgaacctaa atatggctgc tggtgattca 16321 atctttgaaa cttagtactc aaaactaact tttatatggg tcgcccggga ttcgaacccg 16381 gaactaatcg gttaaaagcc gagtactcta ccgttgagtt agcgacccgt tctttgtaat 16441 ttgcaaaacc taatccatca tagcataatt tcagagagat ttgtaaaggg gttttagaaa 16501 aaatttgccg ttaagcgtat agttgctgta tgggtagctt ccggttctaa cacactcagg 16561 tgttcacctg tgttgagagc attgcgagca gcgctccacg gttccaagca ataatagtct 16621 ttacctttaa ccgtccaaaa aacgagagta gaatatgtat cgtcgtattc cagcgtcaat 16681 ttcagcttgc gagcatgatc tgtgactgta gctgattgac cgctgagctg cttaaaagca 16741 acatcaattt catcacggtt atagtcaaaa tcaccgttga atgagtgagt ttccttggtg 16801 atttggtctt ggtactcttg tgagggtatt tcaaacttga gttgattttt atccgctgtt 16861 aaaaagtaag gatggaagcc tgaagaaaaa ggcatttttt cactggaaag attggtgtac 16921 ttctgtttta tttctaaggt attgccctct atttgataag taaaagcaag tttaaagtca 16981 aaaggataaa ctgcgcgtgt ttgcccgttg ctgcttaaga ccaaagtcag gctagctgtg 17041 tctggagtta tgtgatcttc tgtaaattcc caaggcaaat cacgggcaaa accgtgttgt 17101 ttgagagtgt actgcttgtt gttgagagtg taagtgttat taggtaagtt cccacagata 17161 ggaaacaaaa ttggaattcc acccctgaca ctcaactgag gattagcaaa acgttcgtga 17221 tccagataaa gaatttcttg attttgtacg cgccagctcg taataatgcc acctctttct 17281 ggtacaatct caagctggga ctgatttgtt tgaatgtgga gtttactcat tgtattggta 17341 tgattggagt ttttttgcaa acacgttttt agtccggagt attcaagtgg tgagtatttt 17401 ctttccactt caatactcca gactcagcat tttttattcg tcttccgcaa acacaaagcg 17461 atacaactcg ctggcgttgg gttcagggct gctttcagcg aatttcactg cttcgtcgat 17521 ctcttgctgg atttgtcggt caatagcctt aagttcttca ccagaagcca agttttgctc 17581 aaccagataa gcagcaagtt tcttaattgg atcacgagca aaccaaaact ctttctcagc 17641 cttggttcgc agttcatctg gatcagccag agagtgacct cggaaacggt aagttagtgc 17701 ttcaattaag gtaggaccat tcccagcacg agcacgggct acggcttcct gcgcgactgc 17761 tcgcaccgcc aagatatcca taccatctac ttccacaccc gccatgttaa agacactggc 17821 ttttcgataa atctctggtt gggaagtcgc ccgttcgtga gccatgccga tcgcccactt 17881 gttattttct acaacaaaga taatcggtag tttccacagc gccgccatat ttaacgtctc 17941 gaaaaactga ccattatttg cagcaccatc accaaaaaaa caagcgctga cttggtcagc 18001 ttttggatca cctaacactt cgcgtcggta tttcgtttga aaagccgcac cagcagcaac 18061 gggaataccc tcagccacaa aagcataacc acctagcatg cgatgttcgc tagagaacat 18121 gtgcattgag ccaccacgcc ccttgctgca ccctgtggct ttaccaaata attctgccat 18181 gacctcgttt gcaggaactc ctgcgctcag agcatgaacg tggtcgcggt aagtgctgga 18241 cacgaaatct tcacctggtc gcatcgctcc cttaatcaca ccgctggata cggcttcctg 18301 accgttgtat aaatggacaa aaccaaacat tttgcccctg tagtacattt cggcacactt 18361 gtcttcaaag gtgcgcccca gtaccatgtc ctcgtacaac agcaatcctt cttctttggt 18421 tatttgtacg gtagccgcat caaatttggg taatgtgcgt tcttgaacca ttatttctaa 18481 gtattcctgt tgtaaaactt caagtaacta tttgagatta agttgcgctc atgcagcgta 18541 agcttttaaa aagatagcgt ttaaatgagt caataatcaa acttacctca attactgtcg 18601 ttcgtcttgt tccacagcaa tcggtagtat gccaaccttg gtttatagat tacctacaaa 18661 aatagcaagt tttctgcatg tttattaaca atttcggcgc ttattcccga aattaagtat 18721 ttttatgtag attattgtac tcaaagaata acttgtgata aactgtaact cgctgtatac 18781 tagcaaacct gaaaagtggt gcttttctca tcatatgctg gagaaagtcc ctctaaacca 18841 attgcaggga acgtcaagcc atacatctca caaagagaat aaaaaaacac tcctgcaaac 18901 agaccataat atatgctgtt atccaaaata ctatagtcaa ttttttgttt gtcaaagagg 18961 ctggaattta gataggccaa aaagaataaa aatattctta aaaagattgg atacttatca 19021 gcaatcaaaa aacactacta ccataacaag attataccat tgtgaaaaca cgagtaaagt 19081 tgttacaaaa catacttgta gtctgaaatt aatcggttat tgtgatggca gttttttaca 19141 agcctggaat gctttaggaa aattatgtta atcgtgctgc aggggaaatg agccgtgcga 19201 attccgctag attactatag aattttaggg ataccaatgg cggcaagtga ggaacagttg 19261 cggcaggcat atggcgatcg cattgtgcaa ttgccgcgcc gtgagtattc ccaagcagca 19321 attacatctc gaaaaaactt aatagaagaa gcttacgttg ttctgtctaa tccaaaagag 19381 cgtaaaacat acgaccagct ttatcttgct catgcttatg gttctgacag cactgaaggt 19441 ggggcagtcg ctgttgaaag tcgcaagcaa ggtcatattg gtgagcttga ttctcaagcc 19501 ctgagtatcg acatttccca agatgaattt gttggtgctt tattgatttt gcaggagctt 19561 ggggaatatg aacttgttct aaaactgggt cgtccatacc tagtaaatag aaatggtatc 19621 agtagtatcc aaagcgatag agaggatata gaggaagttc ctttttctag tgaacgtcca 19681 gacgttgtac ttacggtttc cctggcttgt ttagaactcg gtcgcgaggc atggcagcaa 19741 gctcactacg aaaatgctgc gatttcctta gaaactggag aggaatttct cgcacatgaa 19801 gggttattcc ccagcgttcg ctctgaaatt cagtctgact tgtatagact gcgaccctat 19861 cgtattttgg aattactggc gctacctgaa gaacagacca ccgaacgaaa acaagggttg 19921 caattattgc atgacatttt agatgagcgt ggtggcatag atggtacagg aaacgacaaa 19981 tctggtctga gtacagatga ttttctacga tttattcagc aactgcgtaa ctatttgacc 20041 tgtactgagc agcacaaact atttgaagta gaaagcaagc gtccctcagc agtcgccacc 20101 tatttagcag tctatgcttt aatcgcacgg gggtttgctc aacgccaacc cgcattaatt 20161 cgtcaagcaa aacaaatgtt gctccgcctg ggtagacgtc aagatgtcca tctagaacag 20221 tcgctgtgtg cgcttctgct ggggcaaaca gaagaagcaa gtcgtgcatt agaactctcc 20281 caagagtacg aagctctagc ttttattcgc gaaagctcta aagactctcc agacctctta 20341 ccaggactgt gtttatacga ggaacgctgg ttggaaaatg aagtgtttcc aaactttcga 20401 gatttggtag acaagccagc ttccttgaaa gattattttg ctaatcaaca ggtgcaagct 20461 tatttagaag ctttgtccac agaagcagaa cctatagaac caggaactgg aataaacaga 20521 cagtctttcc aaacacagca aatcactcgt gagagtcgtc gcaataattc cgcaaataat 20581 aattctaaag tggttgaggg aaaatttcca actcaaaaaa ctccccattc agaaacacca 20641 gccaaatcaa ccttttcctc ttcctcaccc agtcatacaa caacatcatc ttcttcaaat 20701 acttggagtt cacaagccga aacacctgta gcaccagttc acaggataga cgccactccc 20761 aaaggaactt actacaactc gaatcgtccg tccaaacacc ccaccgcacc ccctcaacga 20821 caaacccaaa ggaagcgtaa acgtagtcca tctgctaatt caggacgtgg atctggtaat 20881 cttcttggct ctgcttatcg tcagcgaatt tttgcccgta tttctccagc acaaatgcgg 20941 ttagtgcgga ttgtgtctat tttgttgggg agtctgttgg tgctgtggtt gttgatcgca 21001 gcaacttttg gattgttgaa aaatttgttt tttcctggac cttcgttgaa aggtgaacag 21061 ctatcagttg agctaaatca gcctttagtc cctattccta accaaaacag taaaccacaa 21121 ttaccagcag gaacacttac tgaagcgaca gcagaggaaa taattcaggc ttggctagat 21181 accaaagctg cggcttttgg acctaaccac gagattgatc gtttgaagta tattttagta 21241 ggttcaacct tgacaaaatg gcagcgatgg gctcaacaag aaaagactga taaccagcat 21301 cgaaaatatg aacatagcat gaaagtagaa tctttagagg caaatacaac tgaccagcat 21361 cacgctgcgg tagaagcttc ggtgagtgaa gcgacacagt attatacgaa tggtcagatg 21421 aaaaaatctg ataaggaaaa attacgagtt agatatgaac tggttcgggt agaaggttca 21481 tggcgtatcc gggatatgtc agttctcaac aagataagta tgatttttta gggaataggg 21541 aacgcttaac agggaacgct taacagggaa cgcttaatag ggaacaggaa gacaaatttc 21601 cggacattaa gtatcaaaaa aaacttaaat attcccaaaa ttgcgaaata accgattgtg 21661 acagtacaag gtgcggaatg tggcctagaa attttttatt gccagcataa gctgttgtta 21721 tcttttcacg aaaatcctga tacatttcaa tttcactcaa gaactgatcc tcgtacttcc 21781 tttgtggatt ttggtctttg gtgaagggat ttaacttgct gtagctaagg cactttcctt 21841 gaaataaaat cccaacccta tggctgacgc cacgccagaa ggctatcggg ttcaccagtt 21901 ccctacggag ggaaaccctc atcaaggact ggttcaccgt caaatctaga tccttcgtca 21961 ccaacccgtt aacgtcaaac acggcgcttg tttttatttt tacgatcttt cctaaaaaaa 22021 cttttgtcat caaaatttga cataatcgcc ctcgctcaga tgttttagct gcattcaaca 22081 agctagaatg tcatagtatc ataacttatt gatccccaaa gggctgataa atacgtgaaa 22141 aactgaaaaa tgtgtctgta aaccccttga tcccgacgca ggattattag ttaatattta 22201 ttaacaaaga ctcctctgct ccccagtaaa aaatgctgat tttggcatca cagggaattg 22261 aaagggagtg atttgcaatt taaaaacaca ttaaaaaaca taacactcgg caaaaccagg 22321 agtcaggaat aagaagttta tctgggggtc tacggctctt tatcagccat ttaataacca 22381 ggggacatca tcaacctttc aaactactgg aggccaaact catggcaaaa gaacgcccac 22441 ctttagaaga gatgacattg cgccaactac gtaaagtagc tagtgaattt aacatctctc 22501 gttacagtcg aatgcgtaaa tcgcaactgt tggaatctgt tcaagaagct caacgcagca 22561 aagtttctct cacccaatct cgttcaatgg aggcacagga aaccgtggaa gcagcaaaat 22621 ttgagttggg tcaagttgat cgtactggcg gtactctcgc tgatgttgat gaaggactag 22681 cggatctccc tgaaggctac ggagaaagcc gtattgttct tatgccccgc gatcctcagt 22741 gggcttatac ctactgggat gtttctaacg agcataaaga agagctacgc cgtcttgggg 22801 gacaacagct ggcactgcgt atttatgatg tcaccgacat caatttagag taccaaagcc 22861 ctcacagcat tcaagaatac cccagtgatg aacttgcacg ggaatggtat ttgccagttc 22921 cggtgagcga tcgcgattat gtcatcgaca tcggttaccg ttgcgctgat ggtcgttggt 22981 tagtactagc tcgttctgca cccgtccacg ttcctcctgt gtatccttcc gactggattg 23041 aagatgtctt catcaccgtg aactttgatg aagatttgcg tgacaaaacc gtttacgaac 23101 tggttcctcc cgccaagaaa gtcgctccta ctgctcctgc tgctacggct gctcgtggta 23161 acgccatcta cgaccaaatt tttggtttgg cagaatctgc cgaagcaatg cgcgttgctg 23221 gttctgtgtt cggttccatg cagcacgttc ctggttccgt cattccagaa caagccatca 23281 gctcctacgt cttcccctct ggtgtaggta tgtgggcagt tccaacaacg tctggcttaa 23341 ccatgtccgg tgttggtatg tctggtgctg ggttctctgg tgacgtgcca atgcgtccgc 23401 gtaaattctg gttaattgct gatgctgagt tgattgttta tggcgcaacc gaacctgatg 23461 cgactgtcac aatcggcggt cgtccaatta agctcaatcc agacggtacc ttccgcttcc 23521 agatgtcctt ccaggatggt ttgattgact atccgattat ggctgttgct gctgatggtg 23581 aacaaacacg ctcaattcac atgaagttta atcgtgagac accatctcgc aataccaata 23641 ccaaggaaga agctgtttta gaatggctct cttaatttga aaattttgaa tttttgattt 23701 tagatcagcc tgattaacct ccgtcacaac tgaactgtgg cggaggtttt gtctgttagg 23761 aaaaatttgt ttttgattaa acttagccag actgtcataa aaagtaattt ctgatgtcac 23821 aagaccaggg aataggggag cgatcgcgcc gtgctgcggt tccctccacc cttggcaact 23881 ggcgtggatt gccgtttatc cctgaacatg gtaagccacg gagcgaacga tattttttga 23941 tgtctttaga ttttattgag cggcacagat catagaaggg agacagagtg caggggaata 24001 aaagttttct atcacctgtc atctgtcacc tgtcacctgt cacctgtcaa ctactcctta 24061 tctagtagat gagagagtgc ggcgcttggt caccatctgg aatgcttcga tgatgtcacc 24121 ttctgcccag tcattgtatc tgtcaatgcc gacaccgcat tcgtaaccgg cgttaacttc 24181 acggacgtct tctttcattc gcttgagcga gtcaagggga gcttcattga tcaccttacc 24241 attacgacgc acccgcactt tacagttacg gatgagctta ccagactgca cgtaacaacc 24301 tgcaacagaa ccacgaccaa cggggaagac agcacgcact tcggcttgac ccaagtgttc 24361 ttccaccaac tctggctcca gtagaccttc caaggctcct tggatgtctt ctatgagttt 24421 gtagatgatg ttgtattccc gcacatctac accagcttca tcggcggctt gtcttgcgcc 24481 actggcgtaa gtagtgttga aaccaacgat aactgctcca ctggctgcgg ctaagtcgat 24541 atctgtctgg gtgatttcac cagcagatga caacaacaga cggatttgta cctcgttttg 24601 cgggatttgc ctgagcgatc ccacaattgc ttcgactgaa ccctgtacat ctcctttcaa 24661 gatcaggttg agttctttca actcgccttc ttgagcctga gccgagatac tggttagggt 24721 cacacgtccc tgcatcagac gggattgacg aagtttttca gcgcgttcgg ctgcaataga 24781 gcgtgcttgt ttttcgttgt caaacacatc aaactcgtcg cctgctgctg gtacttcact 24841 taaacccaaa acttcaacgg cgaaggaagg agtagccgcg tcgactcttt tacctctgtc 24901 atccaccatt gctcggactt taccgaaggc tgagccagct accaacaaat tgcctacgcg 24961 cagactaccg ttttgaatca gcaaggtcgc aactggtccc tttgccttgt ctaagtgagc 25021 ttcaatcacg gttcctttcg cgagtcggtc tggattagca gaaagttctt ctatctctgc 25081 taccaagaga atcatctcta ggagtgtatc cagattttcg cccctaatag cgctgacagg 25141 aaccataatc gtgtcaccgc cccactcttc tgacaccaga ccatattctg ttaattcttg 25201 cttaacacgg tcaggttgtg cctctggttt atcgattttg ttgatggcta ctatgatggg 25261 aacttcagcc gctttcgcgt ggctaatcgc ctcaacggtc tgaggacgaa cgccatcatc 25321 tgccgccacc accaaaatag cgatgtcggt cactcgcgct ccacgcgccc gcatagctgt 25381 aaaggcttcg tgaccaggag tatcaaggaa taccacttgc tgctccttac catcgtgttc 25441 tacatctacg tggtaagcac cgatatgctg ggtaatacct cctgcttcac cagatgccac 25501 ttttgttttg cgaatcgagt caagcagcgt agttttaccg tggtctacgt gacccataat 25561 tgtcacaact ggcggacggc gaagcaggtg ttccaggtct gccacgtcga tcatttccgt 25621 gactttgcgg gcttctgctt ctggttccac ggtttcgact ggtatttcca actcgttggc 25681 taccagtgta attgtgggaa tatccaaatt ttgggtgata ctcaccgcca tgcctttcag 25741 gaacagaatt ttcacaattt ctgtatcagc cactaccaaa gcctcagaca gttcttgcac 25801 tgtcatcgga cctgtgactt ccagtatttc tggacgatcc tgcttttgtt ccgtttcttg 25861 acgacggttt tgctcacgag aagaagcagg ctttctcgct ttagcaactg gcgtagcagc 25921 cgtagctact gctgtttgtg tcggcttgga agccttaggt ttcggtggac gggcgataga 25981 gagactgact tggatagtag caggtatttc cagtccttct tcatctaaga gatcttcgtc 26041 ttcaaactca tcaactattg gcttggcgcg ttttccttta acggctccct ttgcgggctt 26101 agctgactct ttaatttctt ctatctcttc ttcctgccac ttcttaccac cttttgcttg 26161 acgtggtggt gttgggcgct tcagttcaat gacttcatca gtcactgttg gggcatcatc 26221 ctcatcttgt cgcccagcct gcggttttgt cagtgagact cgcattggtt ttggaggcgt 26281 agcgactggt accgcaggtc ctggttgttc accagaacgt actataggtc tgctcggtcg 26341 ttgcgattct ccgacttgta caggtgcgga aggcttattc cctctctgct caggtttgac 26401 aggtgatggg gtagtcgaac gcacaggttt ttcagatttt tgcggcaaag cagcctgtgg 26461 tgtttcccct gttgccgctt tgttgactct ttgctgcttg acttggtcgc gttcgccccg 26521 aatgattctt ccttcaggag catcactttt ttggggtaag gattctgcgc catcgcctgc 26581 tggggattgg atcccttcag gtgcatcgcg tttttggtcg cgatcgcgtt tgagaattat 26641 aactggtttc tcgctcacag tcggttgggc atcggctgtg ggagaatttg gtttctccgc 26701 agccgggctt gtgggcgggg ctgccagttg tggctttggc tgtctttccg gttttgctct 26761 gggtgcaggt gctttttccg gtttttgggc tgctactttt tcaaccggct ggtttgtatt 26821 gggtactatg tctttatccg cgtctgtcac tgctggttgt tcaagtg // LOCUS NODE_1102_length_26825_cov_5.00171826825 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 26825) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 26825) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..26825 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(218..415) /locus_tag="DP116_09665" CDS complement(218..415) /locus_tag="DP116_09665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458130.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3252 domain-containing protein" /protein_id="PRJNA477356:DP116_09665" /translation="MILPGATVRVKNPQDTYYRFEGLVQRLSDGKVAVLFEGGNWDKL VTFRLSELELVETTAGRKKAK" gene complement(584..1897) /locus_tag="DP116_09670" CDS complement(584..1897) /locus_tag="DP116_09670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868307.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rod shape-determining protein RodA" /protein_id="PRJNA477356:DP116_09670" /translation="MLVKRSLSRSRWKFWIKPWDQVDLLLFCLPISLTVFGGIMIRST ELNQGLTDWWWHWLMGGIGLVIALFIARSRYDQLIQLHWVTYAITNLSLIAVMVVGQS AKGAQRWINIAGIAVQPSEFAKLGLIITLAVLLHKRTASNIDNVFRALAITAVPWGLV FLQPDLATSLVFGAIVLGMLYWANAHPGWLILLVSPIIAAILFSISWPLSEQPIVLFN QISLTALGLIWSATMGLVGFLTLPRRQNVIGGIGAIALNLLGGELGIFAWNHVLKEYQ KNRLTVFMNPEHDPLGAGYHLIQSRIAIGAGEWSGWGLFKGPMTQLNFVPEQHTDFIF SAVGEEFGFIGCLVVLIVFCLICQRLLHIAQTAKDNFGSLLAAGVLSMIVFQMIVNIG MTVGLAPVAGIPLPWMSYGRSAMLTNFIALGLVESVANFRQRQKY" gene complement(2008..3078) /locus_tag="DP116_09675" CDS complement(2008..3078) /locus_tag="DP116_09675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740545.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium:proton antiporter" /protein_id="PRJNA477356:DP116_09675" /translation="MYDVLDTRSVLKVLQPVQDPELRKSLVDLNMIRNIKIESGKVSF TLVLTTPACPLREFIVEDCQKAVKQLPGVKEVSVEVTAETPQQKTLSDRTGVPGVKNI IAVSSGKGGVGKSTVAVNVAVALAQTGAKVGLLDADIYGPNDPTMLGLGDAEMIVRST DKGEILEPAFNYGVKLVSMGFLIDRDQPVIWRGPMLNGVIRQFLYQVEWGEIDYLIVD MPPGTGDAQLTLAQAVPMVGAVIVTTPQNVALLDSRKGLRMFQQMNVPILGIVENMSY FIPPDMPDKQYDIFGSGCGSTTAAELGVPLLGCVPLEISTRIGGDKGVPVVIAEPESA SAQALKAIALTIAGKVSVAVLT" gene complement(3353..3874) /locus_tag="DP116_09680" CDS complement(3353..3874) /locus_tag="DP116_09680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994827.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SRPBCC family protein" /protein_id="PRJNA477356:DP116_09680" /translation="MGWLSKSVHRKRRRFCLSLVRTYREISSASVDELWQKVVDIADV SWHPLLKSTNVPYGLVPKPGLIYQAVTRLSPIPIRIFVESVNPRELLSVRVLAIPGVE ERVTYKVESTVCGTCLSYSVTLRGWLSPLIWPFSRPYADRVARALVQAVEEATLQAVS RKRKSLKDGCFDC" gene 4662..5711 /locus_tag="DP116_09685" CDS 4662..5711 /locus_tag="DP116_09685" /EC_number="1.3.3.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136576.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxygen-dependent coproporphyrinogen oxidase" /protein_id="PRJNA477356:DP116_09685" /translation="MVINSQTPTMRAESTQTLPTDAKARVSQFMKQLQDKITQTLAEL DGVGKFQEDAWERPEGGGGRSRILREGAIFEQAGVGFSEVWGSHLPPSILAQRPEAAG HDFYATGTSMVLHPRSPYVPTVHLNYRYFEAGPVWWFGGGADLTPYYPFAEDAVHFHK TLKQACDLHHPEYYPVFKRWCDEYFYLKHRGETRGIGGLFFDYQEGRGSLYRGPHNDG AAATYSNQIGTPEPRSWEQLFAFAQECGNAFLPAYVPIVERRHKMEYGDRQRNFQLYR RGRYVEFNLVYDRGTIFGLQTNGRTESILMSLPPLVRWEYGYQPEPNSPEAQLYEIFL KPQDWANWIPSSPKS" gene 6060..6428 /locus_tag="DP116_09690" CDS 6060..6428 /locus_tag="DP116_09690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015118637.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-sigma factor antagonist" /protein_id="PRJNA477356:DP116_09690" /translation="MYYIDQKTYTTQNGNTVTVLTPTGRLDITTAWQFRLKLQECISK LSRHLVVNLGQVNFIDSSGLTSLVAGMRDADKVKGSFRICNVHPEAKLVFEVTMMDTV FEIFETEEDALESESGSIAS" gene complement(6634..7230) /locus_tag="DP116_09695" CDS complement(6634..7230) /locus_tag="DP116_09695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868312.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chorismate mutase" /protein_id="PRJNA477356:DP116_09695" /translation="MSLAAQLLTLANYLSGEFDNREQALAEPAWYVHLRLWHRPLSLF SEDSLTIFAEQANIVNLERPYRQRIMRLLQGSDPDVPFKVQYYMIKDHNTLIGAGQNP ALLNTLTPDHLELLPGCVLNLTQQLLAPNSYKFNATPPPDTRCCFSVDGNTVQVSLGF EVTDDKFLSYDKGIDSTTGKATWGALLGPYCYTKRQQY" gene 7437..8144 /locus_tag="DP116_09700" CDS 7437..8144 /locus_tag="DP116_09700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113336.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II biogenesis protein Psp29" /protein_id="PRJNA477356:DP116_09700" /translation="MNNLRTVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNV DYSYDPIYALGVVTAFDRFMEGYQPERDKESIFHALLQAVEQDPQRYKHDAQRLQALA TSISASELTAWLSQQTPLDRDADFQGSLQAIANNPKFKYSRLFAIGLFTLLEYSEPEL VKDEKKRNEALKTIAKGLNISDDKLNKDLEIYYSNIDKMSQALTVMSDMLLADRKKRE QRAQQSNTKVAPPKANE" gene complement(8293..8586) /locus_tag="DP116_09705" /pseudo CDS complement(8293..8586) /locus_tag="DP116_09705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006100985.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(8716..9339) /locus_tag="DP116_09710" CDS complement(8716..9339) /locus_tag="DP116_09710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877847.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="biopolymer transporter ExbD" /protein_id="PRJNA477356:DP116_09710" /translation="MKINLQTPVEDVQIQIIPLIDVVFCILTFFLLAALQFTRQEEIS INLPKSTTSTPSITNGSNAKKLSATAQRQILTLTIDAIGQTYVENDSVKREQLKETFK SHLQQTPNAILALKVSQTATYNDVISMIDLWRQVGGDRISFVTTPSFSNQPITPINPQ TNPNFNLPTIPNPPQTNQGVTPVNPASPVLPKVPTAPSGQTNPTPKR" gene complement(9376..10158) /locus_tag="DP116_09715" CDS complement(9376..10158) /locus_tag="DP116_09715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315552.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MotA/TolQ/ExbB proton channel family protein" /protein_id="PRJNA477356:DP116_09715" /translation="MDIVDLFKKGGSAMWPLLALSILSLSVIFERLWFWLRILSQEKE IVARVLDAARVDWMSAADIARQATKQPIGRFLYAPLSLPKSDLESFRLALEATAEDEL AGMRRGEKLLETVIALAPLLGLLGTVLGLIQSLRSIRIGDLGTESTAGVTTGIGESLI STATGLIVAIVSLAFYRLFQGLVVNQVKIFRRAGNDMELLYLQSPPDYTKMRSENIIP PVTIIRDSPPDNLNSPRKRGKPRFSETPEPPSESDLSEPKDN" gene 10376..10705 /locus_tag="DP116_09720" CDS 10376..10705 /locus_tag="DP116_09720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130006.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1232 domain-containing protein" /protein_id="PRJNA477356:DP116_09720" /translation="MKFSIQSLYTWYRNVLRNPKYRWWVILGTLLYFVSPIDIAPDFI PIVGELDDVFLLTLLVTELSGLMIEGFKARKGQVDAQATNTTSNTTTEGPTASPNTID VDAVSVK" gene 10937..11740 /locus_tag="DP116_09725" CDS 10937..11740 /locus_tag="DP116_09725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dTDP-glucose pyrophosphorylase" /protein_id="PRJNA477356:DP116_09725" /translation="MTKRQIIGLLPAGGQATRISPLPLSKELYPIGFQDFGVKSNWRP KVVSQYLLEKMQLAGIDKAYFILRSGKWDIPAYFGDGTMLSMSLGYLIMGLSYGVPFT LDQAYPFVQDAIVALGFPDILFQPEDAYVRILTRLEVSHADVVLGLFPTDKPQKAGMV DFDDEGRVRLIIEKPRQSDLRYMWSIAVWTPAFTQFLHEYLTTLKVNSNLSQLPEIPI GDVIQAAINKGFHVEAEVFADGTYLDIGTPDDLVSAVRQFAALVGEENL" gene complement(11765..12634) /locus_tag="DP116_09730" CDS complement(11765..12634) /locus_tag="DP116_09730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877851.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_09730" /translation="MPKSADISTKKLISLAPDNWVKWVTQISDITAQEILNSEFQWIS RESDVLIRAESPQYGQFLVLNELQLRYKPEMPKRMRAYAALAEEKYNLPTYPVLINIL KESDVEIPTRYQSEFAGLQARQDYRVINLWEVDVEIAFQQPVLSLLPFVPILKGGAEE TTIQQALQILRADEQLNQLETVLAFFASFVLDSALVQQIMRWDMAVLNESPWYQQILR EGEARGEARGEERGRREEKLSSIEMGLEVKFGTEGLQLMPEIAQISDLERLKAIQRAI LTVSTLDELRQLI" gene complement(12961..13720) /locus_tag="DP116_09735" /pseudo CDS complement(12961..13720) /locus_tag="DP116_09735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318726.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(13823..14023) /locus_tag="DP116_09740" CDS complement(13823..14023) /locus_tag="DP116_09740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015150573.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_09740" /translation="MPASVRVSVLHKDSYKCVFCGRSSQQVQLEVDHIVPFSQGGSNN LNNLQTLCTDCNRGKGARLLKK" gene complement(14197..14604) /locus_tag="DP116_09745" CDS complement(14197..14604) /locus_tag="DP116_09745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004161543.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09745" /translation="MTDNQNEELKNKVEGTALVAGGTIAGAGVSAVVGGMGLVGGFGG VAIGMAPVTAAGAVIGTATYGAKKAIEEGDATAISAVAGGAAVGVGVSAVVGGMGLAV GGTAVAIGAAPVIAAGAVVGLAAYGLKKLFDKS" gene complement(14896..15438) /locus_tag="DP116_09750" CDS complement(14896..15438) /locus_tag="DP116_09750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09750" /translation="MNKRQKSQYLFRNRWIDRQRIVRNMEAFQDLIVIVLCFGLFAYM VMQLWEIFTELTLPLDHQQVTAKILFLLILVELFRLLMVYLQEHSIAVGVAVEVSIVS VLREVIVHGALEITWVQAASICGLLFILGALLLVCAKTPHMDHMINKTKHSALNHSDD EQQENGNKMSHSTQYEEIVN" gene complement(16065..19280) /locus_tag="DP116_09755" CDS complement(16065..19280) /locus_tag="DP116_09755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl hydrolase family 15" /protein_id="PRJNA477356:DP116_09755" /translation="MKTATQLQARLECYYQQIKTIILARQNPITGLLPASTAITAHGD YTDAWVRDNVYSILAVWGLAIAYRKVDEDKGRTYELEHSVVKLMRGLLFAMMRQSAKL EQFKHTQSPLDALHAKYNTATGDIVVGDDEWGHLQLDATSIFILILAQMTASGLQIIY TFDEVNFVQNLVYYIGRAYRTPDYGIWERGNKINHGNAELNASSIGMAKAALEAINGL DLFGVHGCQASVIHALPDEIARARITLESLLPRESSSKEIDAALLSVISFPAFAVEDV ELRERTRNDIINKLEGKYGCKRFLRDGHQTVLEDKNRLHYEPLELKQFEHIECEWPLF FTYLFLDGLFRGEQEQVKHYQERLESLLIERDGLHLVPELFYVPEESVEAEKLEPQSQ PRLPNENLPLVWAQSLYFLGQMLSEGLIAVGDIDPLGRYLCIGKNQQAVVQIALLAED EDLQAKLAVHGIETQTPKQVEPIQVRQADDLSAIYTQVGRNDKLGLTGRPVRRLRSLT TSKIFRIRGETVVFLPSFLDAQVFYLTLDYHFLVYQIKSELAYIQKYWIDLGRPILTL MLTHTMLETGSEALLALMQELKEGVCNGVRVKLGRLNQLMLTAGTQRIDFLQEDFEYS LSSIKDAAPRCSYLIYNPEGNWLLKHTQEFQMECETNLGLLLSSLHSSDNLYEQIELL QTLVRLEGLEFDTGFGGPGRPVTVADLLDEVYTKAGDSGVWAVVRRAAGLRQMSDIGL SDVVTSIVVRGKQIAVGKAYSEDSLITLPLSHSEIVEKINDFCREDISDRVLTQEILI YLSALIKTEPELFKGLLTFRVGYMILLITSELVQELSLTQDEAYEHLMQLSPLEVKTR VRQVLFEYVSLSQLLRQQESLHVKQKESDIDWMVAPSKGDEIDVPSGGWRRFRQAEGA TGRVPKDFFKQVWLVMEHCKGLVIGDKLERRNRLDSEVILSEMTAGEKNFALQVEHLL NKIEAPEYRQVNIEALTELAAIASNNPNLQIEEYIVLDVLIGHAVRLAWLESHPKRGD RYDEDKASAWRAFYNTSPKECASYVVKAFRFLIEFGQDIAA" gene 20524..21879 /gene="mgtE" /locus_tag="DP116_09760" CDS 20524..21879 /gene="mgtE" /locus_tag="DP116_09760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium transporter" /protein_id="PRJNA477356:DP116_09760" /translation="MLMQDVRDSVIDIADLNQLKLDLNGTQPVDVGEYITQLPEQQRA IAFRLLNKHQAIDVFEYLPSEVQEQIINSLHDVQVAQIVEAMSPDERAELFDELPAGV VKRLLQELSPEERQATATILGYAEGTAGRVMTTEYVRLRDGLTVGEALSKIRRQDEDK ETIYYAYVTDDNHKLVSVVSLRQLLFTFPEVLITDIASTRVIKVRTQMPQEEVAQIMK RYDLIAVPVVDREERLVGIVTIDDVVDILEEEATEDIQKLAGVSGGDEQAFSSPLITL RKRLPWLFGVMLLYIGASSAIAPFQSTISMVPVLAVIMPLFSNTGGTVAIQALTVTIR GLGVGEVTPQDTFKILRKELQAGLGTAVALGLTMIMLSLIWAPPHERWVSLVAGTVMA VNTLVAVTLGTLLPMGLKRLKLDPALVSGPLVTTILDAVGFMIFLTLISTALHIFHLK P" gene 22073..22240 /locus_tag="DP116_09765" /pseudo CDS 22073..22240 /locus_tag="DP116_09765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454378.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 22321..22923 /locus_tag="DP116_09770" CDS 22321..22923 /locus_tag="DP116_09770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139284.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_09770" /translation="MQVQTQKRYYTPEEYLELEEKAEYKNEYRDGEIVPMTGGTTNHN EIALNLAASLKFAVKGQNYRVYIGDVRLWIPRYRQHTYPDVMVIQGQPVYTGTNTTTV MNPLLIAEVLSKSTKNYDQGDKFLYYRSIPEFKEYILIDQYHYHVMQYVKTAEGQWSF TELEGESATLSLQTIDFKILLSDLYEQVDFTVSSEEDSLS" gene complement(22959..23399) /locus_tag="DP116_09775" CDS complement(22959..23399) /locus_tag="DP116_09775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002795967.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain nuclease" /protein_id="PRJNA477356:DP116_09775" /translation="MSYLVDTNVLLRSAQETHSMHKSSVQAVRILLEQGKRLCIIPQN LIEFWVVATRPIEVNRLGLSVADALNELEQLKNCFVLLPDTASIFPVWESLIAKYKVT GKPSHDARLVAAMIVHNLTHLLTFNTSDFRRFSEITALDPHSIF" gene complement(23396..23668) /locus_tag="DP116_09780" CDS complement(23396..23668) /locus_tag="DP116_09780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869104.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09780" /translation="MAILLELEPEIESRLMAQAAAQGTSVEVLLKTLVESLLASSQPT PLTLSPQERAERFVNWARSHSSIKAPPLCDDAISRESIYTREDEMV" gene complement(23808..25475) /locus_tag="DP116_09785" CDS complement(23808..25475) /locus_tag="DP116_09785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859067.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09785" /translation="MRKTVETEIISVTVYTDQALVTRRGAISRRQDSERIAPVSSARP SRVITLTGQERELKIIQLPVTMETESVRVSGTGEVVVHLSGVSTERVFSTEPFAERLS HLTRQIQQLEAEWNLLQAQIDALALQSKFIEGLREKTEEPFAQSLSRKNLSLSETLDF LNFMGSQYSEYAIATGECKVQQQELDKQLQVLRQQWQQVQTPSPKESFSLSVAIEPAG AGEFELEVSYVVSCARWTPLYDLRVNTSSNSINLTYLAEVTQSTGEDWMDVSLTLSTA KLGLGTLPPKLEPWYIDILRPPEVLRMRRVAPIQTPSVAMAPSASGDGSTTLETEQLE ENLVSAQTLIAEVSREGSAVTFEVKSSGNIPSDGAPHKTTIFNDDFPCSFEYVAIPRL ISFAYLQANVKNSSNGVTLLPGKANIFRDNAFVGTTQLENVAPGQEFKLNLGIDEGFK IERDLVERQVDKKLIGNNRRITYSYRIVITNLQNQEANLKVIEQLPISRNEQIKVRLN RSNPQIQLGEMGILEWSLVLPPEAKRDIYYQFVVEYPPELTVVGLDF" gene complement(25690..26628) /locus_tag="DP116_09790" CDS complement(25690..26628) /locus_tag="DP116_09790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865004.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LysR family transcriptional regulator" /protein_id="PRJNA477356:DP116_09790" /translation="MNQATLHQLKVFEAAARHGSFTRAAEELFLTQPTVSMQIKQLTK SVGLPLFEQVGKRLYLTEAGRELFTTCRQIFDTIAQFEMKVADLKGLKQGQLRLAVIT TAKYFVPRLLGSFCQLYPGIDIALQVTNHEGILERMSSNMDDLYIMSQVPEHLDVNYQ PFLENPLVVIAPANHPLAMQKNISIERLADEPFIMREPGSGTRRAVQKLFDEHKVMVK VKLELGSNEAIKQAIAGGLGISILSRHTLTTESKDLTILDVENFPIQRNWYMVYLAGK QLSIVARTYFEYLLDAAKQFANQTISSSVFHADENR" BASE COUNT 7595 a 5876 c 5518 g 7836 t ORIGIN 1 ggtcacagtt gtttaactca tgacccttcg ggtatgcgca aagcgcacgc cttacggcta 61 acgccagatg cctgttgtcg ggaaaacgcc acatgctacc ctgcgggaag ccgccctccg 121 ggcgtctaca agccgggaaa cccgtccaac gcagtggctc ccctcccgca gcactggtct 181 cactaatgac taacgactaa tgactcatga ctaataacta tttagctttc ttgcgtccgg 241 ctgtggtttc tacaagttct aattccgaga ggcgaaaggt aacgagttta tcccagttcc 301 caccttcaaa caggacagca actttgccat cactcagccg ttgcacaagt ccttcaaagc 361 gatagtaggt gtcttgtgga tttttgacgc gaacagttgc tcctggtaag atcatcatct 421 tttcccttgg ttgtttcctg ttaatagctt aatctttctc tagctaggtt tgtattagta 481 ctatttactc atgctattga ttaatcatta taatttgggc taagaaaacc cactcctaca 541 agaagtggga cagacagtcg cagcctgcgg ggagtgtcaa caactaatac ttctgccgtt 601 gtcgaaaatt tgcgaccgat tctaccagcc ccaaggcaat gaagtttgtt agcattgccg 661 aacgaccata actcatccaa ggaaggggaa tccctgccac tggtgctaaa cctacggtca 721 taccaatgtt aacaatcatc tgaaacacaa tcatagacaa aacgcctgca gcaagcaaag 781 aaccaaagtt atctttggca gtttgggcaa tatgtaatag ccgctgacaa atcaagcaga 841 agacaattag tacgactaga cagccaataa aaccaaattc ttcgcctaca gcggagaaga 901 taaagtctgt gtgctgttca gggacaaaat ttaattgagt cataggacct ttgaaaagac 961 cccatcccga ccattcacca gcaccaatgg caatgcgcga ttgaatgagg tgatatcctg 1021 cgcctagagg atcgtgttca ggattcataa atacagtcag tcggtttttt tgatactcct 1081 tcaaaacgtg gttccaggca aatataccta attcaccacc caaaaggtta agagcgatcg 1141 cgcctatacc gccaatgacg ttttgtctcc taggcaaagt taaaaatcca accagtccca 1201 tagtcgctga ccagattaac cctaacgctg ttagggatat ttggttgaat aaaactatcg 1261 gttgttcaga taacggccaa gatatgctga acaaaatcgc tgcaataatc ggagaaacga 1321 gtagtattaa ccagccggga tgggcatttg cccagtacaa cattcccaga acaatcgcac 1381 caaacaccaa tgatgttgct aaatctggct gtaagaagac caagccccaa ggtacagcgg 1441 tgattgctaa tgcccggaaa acattatcaa tattagaagc agtgcgcttg tgtaacagca 1501 ccgctagggt gataattaat cctagtttgg caaattctga aggttgtacg gcaataccag 1561 cgatgttaat ccaccgttgt gctcctttag cactttgacc aactaccatc acggcaatca 1621 agctcaagtt ggtgatggca taagttaccc agtgcaattg aatcagctga tcataacggc 1681 ttctagcaat aaataaagca ataaccaagc caatgcctcc catgagccag tgccaccacc 1741 agtcagtgag tccctgattc aattccgtac tgcggatcat gataccgcca aatacggtga 1801 ggcttattgg taaacaaaat aataacaaat ctacttgatc ccaaggtttg atccagaact 1861 tccaacgaga tctggaaagc gaacgtttta ccaacattgt atgcgtttac actttaagta 1921 tgagatttga aatgaaaaaa taacccaaaa tctgtattcc tggctgagca acttcaacca 1981 ggcagaaatt ggaatttttc tgtcaagtta tgtcagaaca gcaactgata ctttgccagc 2041 aattgtcagg gcgatcgcct ttaacgcttg ggcactagct gattctggtt cggcaataac 2101 cactggtaca cctttgtcac caccaattct agtagaaatc tccaatggta cgcaccccag 2161 caacggcact cctaactcag ctgcggtcgt ggaaccacag ccggaaccaa aaatgtcata 2221 ctgcttatct ggcatatctg gtggaataaa gtagctcata ttttctacta ttcctaggat 2281 gggcacattc atctgctgga acattcgcaa gccctttcgg gaatccagca gtgcgacatt 2341 ttggggtgtg gtgacaataa ctgctcccac cattggcact gcttgtgcta atgtcagttg 2401 agcatcgcct gttccaggtg gcatatccac aatcagatag tctatttctc cccattccac 2461 ttggtagaga aactggcgaa tcactccatt gagcattggt ccacgccaaa tgactggctg 2521 atcccggtcg attaaaaaac ccattgacac taatttgacg ccgtaattga aagcaggttc 2581 cagaatttca cctttgtctg ttgaacgcac gatcatttca gcatcaccca gtcctagcat 2641 agtgggatcg ttaggaccat agatatcagc gtctagcaaa ccaaccttcg cccctgtttg 2701 agccaaagca actgcgacat tcacagcaac cgtacttttg ccaacgcctc ctttgccact 2761 agaaactgca ataatattct taacaccagg aacaccagtg cgatcgctca aagttttctg 2821 ttgcggtgtt tcggctgtca cctccacaga tacctcttta actcctggaa gttgtttcac 2881 ggctttttgg caatcttcca caataaattc acgcaaagga caggcaggtg ttgtcaacac 2941 caaggtaaag ctaaccttcc cggactcaat tttgatgttg cgaatcatgt tgagatctac 3001 cagacttttg cgaagttctg gatcttgcac tggttgcaac acttttagga ctgaacgagt 3061 atcaaggaca tcatacataa tgtttttatc tttctcttgc aataattttt aacaaaacgt 3121 cttgttatct actagaatct taacttttaa gcgctattta tcgttcacaa tctcaagtca 3181 atagccagta atctgtagtc ggtgagtcca aactcagtcg ttcttttact catgaccctt 3241 acgggtatgc ctgctttcgc ctgacccaag agcgccagtt ccctacggcg ggagacccgc 3301 ctgcaggact ggacttacta atgactaatg actaatgact aatgactaat gactaacaat 3361 caaaacaacc atctttaaga gattttcgct ttctggatac cgcttgtaat gttgcttcct 3421 caactgcttg gactaacgct cgtgcgacgc gatccgcgta gggacgagaa aaaggccaga 3481 ttaagggcga caaccaacca cgtaatgtta cagaatatga caaacaagtg ccacagactg 3541 tagattctac tttataagtc acccgttctt ctacaccagg aattgctagt actctgacac 3601 tcagtaattc cctaggattg acgctttcta caaaaatgcg aatgggaatt ggcgacaagc 3661 gtgtaaccgc ttgatagatg agtccaggtt tgggtactaa tccatatgga acgttagtgc 3721 ttttgagtaa tggatgccaa gaaacatctg ctatgtcaac gactttttgc caaagttcat 3781 ccacagaagc agaactgatt tctcggtaag ttcgcaccag agaaaggcaa aaccgacgac 3841 gtttgcggtg gacggatttg gataaccatc ccatttttct attcctcctc cctacagcct 3901 ttttggtaac tctggcatag tcagcatttg cgttcagccc ttgtatgaaa ctttttcaaa 3961 attgtcctca acatagtaag tcgccatttc atatatacgc taatttgaga actgaagcat 4021 aactgagact ttttcattaa ttttcattaa caaacacgct ctttaagaaa gaaggtatca 4081 acgaatgcac attcgctttt aggcgttgtc gttcgcgtag cgtctccgac aaaagaaggc 4141 taggaaaatc cttagcacat tattgaaatt ttggctcata actcgattat tactgaagtg 4201 ttgattttgg gagttaatat aagattcgac agagcaatct tgttgaactg caaggctcga 4261 tttttgatgc tgtgagcaga aacgctctgg gaggaacaac tctattcata tatatcagtc 4321 cgcgccccct gcggatatgg tcggagattt ttctttagac agaggttgag atttgccagg 4381 tggaaagtgc tcgtagattt atcatttgaa gtgttgtatc aatcctagaa ctttacataa 4441 acataaacaa cttgaaaaaa atcaataaaa gcacaagtac acacccattt gagggaaaat 4501 aactatctgt agtcatatcc ctcagtcgtg attaatgctt gactatagag agaatagctc 4561 tttgggcaat tttagcctgc tttcttttgt cagttgctca gaaattttca gttgaaaaat 4621 tcttgctttt accaaaattg taataaatgg ttcttgagtc tatggtcatc aactcccaaa 4681 ctccaacaat gcgggcagaa tctactcaaa ctctaccaac tgacgccaaa gccagagtta 4741 gtcagttcat gaaacagttg caagacaaaa tcactcaaac gttggcagaa ctggatggtg 4801 tggggaaatt ccaagaagat gcgtgggaac gcccagaagg aggcggaggg cgatcgcgca 4861 ttctgcgtga gggtgcaata tttgagcaag ctggggttgg tttttccgaa gtttggggtt 4921 cccatttacc gccttcaatc ttagcgcagc gcccagaggc agcaggtcac gatttttatg 4981 ccacaggcac ctcaatggtg ttacatccac gtagtcctta tgtacccacc gttcacctca 5041 attatcgcta ctttgaggcg ggtccggtgt ggtggttcgg tggcggtgcg gatttaactc 5101 cttattaccc ctttgccgaa gatgcagtcc atttccataa aacactaaag caagcctgtg 5161 atcttcatca tccagagtat tacccagtgt ttaaacgctg gtgtgatgaa tatttctacc 5221 taaaacaccg tggggaaacg cggggtattg gcggactgtt ttttgattac caagaaggtc 5281 ggggttcctt gtatcgcggt cctcacaatg atggtgcagc ggctacttat agtaaccaaa 5341 ttgggacacc agaaccacgc agttgggaac agttgtttgc ttttgcacaa gaatgtggta 5401 atgccttttt accagcctac gtacccattg tagaaaggcg acacaaaatg gaatatggcg 5461 atcgccaacg aaattttcaa ctgtatcgcc ggggaaggta tgtagaattc aacttggtgt 5521 atgaccgagg cactattttt ggattgcaaa ccaacggacg cactgagtca attctgatgt 5581 ccctaccacc cttggtgcgc tgggaatacg gttatcaacc tgaaccaaac tcccctgaag 5641 cccagttgta cgaaattttc ctcaaacctc aagactgggc aaactggata cccagtagtc 5701 caaaatcatg agtcctgagt gttgagtata aatcatctgc aactccaaat tcccttgaga 5761 ggctaaatac gaagaatttt acactcagta ttcagcactc accacttctt tagtgagtta 5821 gactactaaa gagaagcact tagcaataaa tatgacagtt gttgctgtag cttttcaatc 5881 acaagctaca gcatcacaga accctgtcgg aggtttacgt tcttcactta gtctggaact 5941 cattccagta cgaaaaaaga ggaaaagagc cgtaataaca agggttttgg ctttgaccat 6001 gttaaggcga aactataccg tagtctcttt atagaggagt gtcaagggaa ggttcagtga 6061 tatattacat agatcaaaaa acttatacaa cccaaaacgg caatactgtt accgttttaa 6121 caccgactgg tcgcttggat attacaactg cttggcaatt tcgtctgaag ttgcaagagt 6181 gtatttccaa actcagtcgt catcttgtag taaatctcgg tcaagtcaat ttcatagata 6241 gttccggtct gacctcttta gtcgcgggaa tgcgcgatgc tgataaagtt aaaggcagtt 6301 ttcgcatttg taatgtgcat ccagaagcaa agctggtgtt tgaagtcacc atgatggata 6361 cagtttttga aatctttgaa accgaggagg acgctttaga aagtgaatct ggtagtattg 6421 ctagctaata ttgctagcta gtactgagtt gcattcaaag atagtaagta ggatgaccag 6481 accgtgccgg ggttcgcctt cgtgcatcca acacaagcgt ggggatgaga caattaacgc 6541 attcgctcat tcagtaaatg ctatttctga atgcaacttg gtaataaggg cattgctgag 6601 tattgagcaa cagaagaatc ttcacttcat cactcagtac tgttgtctct ttgtatagca 6661 gtagggaccc aaaagtgctc cccaagtggc ttttccagtc gtagagtcaa ttcctttgtc 6721 atagctgaga aatttatcat cagtgacctc aaagcctaag gaaacttgca cagtattacc 6781 atcaacacta aaacagcaac gagtgtctgg aggtggggta gcgttgaact tataactatt 6841 gggggctaat aattgctggg taaggttaag tacacagcca ggtagtaact ctaaatgatc 6901 gggtgtcagt gtgttgagta aagcaggatt ttgaccagca ccaatcaggg tattgtggtc 6961 tttgatcata taatactgca ctttgaaagg tacatcaggg tcacttcctt gtaaaagccg 7021 cataatgcgc tggcggtaag gtcgctctag gttgacaata ttggcttgtt cagcaaatat 7081 ggtaagactg tcttctgaaa aaagagaaag tggtctgtgc cacagacgaa ggtgaacgta 7141 ccaagcaggc tctgcaagag cttgttctcg attatcaaat tcaccagaaa gataattagc 7201 cagggtaagt agttgtgctg ccagactcat aggtgtcatg ttcacgattc catgcaagca 7261 atttccgccg tggaatgtga tgaattgttg ctataatgac agcctacacc ttctgtgatc 7321 aaagaactcc acactatcaa agtaattttg tctggcagtt ttaggtagaa aactgaactt 7381 ggcaaagtgg cattaaagcg agacaataag agtacgtcgc cctccaaatt tgctttgtga 7441 ataacctccg taccgtatct gatactaagc gaactttcta caaccttcac acccgtccga 7501 taaacactat ttatcgtcgg gtggtggaag aattaatggt ggaaatgcac ctactgtcag 7561 taaatgtcga ttatagctac gatcctattt atgccttggg cgtcgtcact gcttttgacc 7621 gcttcatgga aggctatcag ccagaacgag ataaggagtc tatttttcat gcactccttc 7681 aagctgtaga acaagatcca caacgctaca aacatgatgc tcaacgtttg caagcattgg 7741 cgacaagcat aagcgcttcc gaactcactg cctggttaag ccaacaaaca cctctggatc 7801 gagatgctga ctttcaagga tctctacaag caatagcgaa caatcctaaa tttaaataca 7861 gccgcttgtt tgcgatcggt ttattcactt tattagaata ctcagaacca gagttagtga 7921 aggacgaaaa gaaacgcaat gaggcactaa aaacaatagc caaaggcttg aatatctctg 7981 atgacaaact caataaagat ttagagatct actactctaa tatagataaa atgtcacaag 8041 ctttgacagt gatgtcagat atgcttttag cagatcgaaa aaagcgggaa cagcgtgcac 8101 agcaatcaaa caccaaagtt gctccgccaa aggctaacga atagttataa gttgtaacta 8161 tgaaatatat aatgatgagt tatgaaactg ggtttcttat tcacaattca tcattcatgc 8221 tcgtttatcg cttactttag aaacctagta gcttgtatat aaagctacta ataatcaaaa 8281 gagcttgtgc tcctataata gggccgcaga tacgaaaact tgacactccc gtttctcgcg 8341 gctccgggat tcttgaatca taggggttct tgtgaactta cctcctgcgc gcatttttac 8401 agtctgcccg acggctgcac accctctgtt gaggacgact tgagcggcta caacatctct 8461 atcagttgtg tagccacagt tcccgcaaac atgaacacgt tctgacaaat cttttttacc 8521 cgtttcagtg ccacagctag gacaaatttg acttgtcttt gtagtatcga ctttaacgaa 8581 actaaatcta ttttttgtaa aataatagat tttgctcctc tgagtgcgat ttattttata 8641 tatctttagg tttcggcaac gagaaagcga gggaaccagg ggaagcaggg agagcaaata 8701 cttcccctca tctcattacc tcttgggagt aggattggtt tgtccagaag gcgctgttgg 8761 tactttaggc aatacaggag aagctggatt aactggggtt acaccttggt tagtttgagg 8821 tggattgggt attgtgggaa gattgaagtt ggggttagtt tgtgggttga tgggagtgat 8881 gggttgatta gaaaaagatg gtgttgtgac aaaagatatg cgatcgcccc ccacttgtcg 8941 ccacaaatct atcattgata tgacatcatt gtatgttgct gtttggctga ctttcaaagc 9001 taaaatagcg ttgggagttt gttgcaggtg gctcttaaaa gtttctttga gctgttcccg 9061 ttttactgag tcgttctcta cataagtctg accaatagca tcaatagtca aagttaatat 9121 ttgccgttga gctgttgcac taagtttctt tgcattagat ccattagtaa tagatggcgt 9181 actggtagtt gatttaggta aattaatact aatttcctct tgacgagtga attgcaaagc 9241 tgccaaaaga aaaaatgtca gtatacaaaa aacgacatct attaagggaa tgatttgaat 9301 ttggacgtct tctactggag tctggagatt aattttcatt gtcttgctct catacctgag 9361 ttaggggctt ttttgttaat tatctttcgg ttccgacaaa tcagactctg agggaggctc 9421 aggtgtttca ctaaacctgg gtttgcctct ttttcgagga gaattcaagt tatctggtgg 9481 agaatcccgt atgattgtta caggtggtat tatattttca gatctcatct ttgtgtagtc 9541 aggtggagac tgcagataca gcaactccat atcattccct gccctgcgga aaattttaac 9601 ttggttaaca acaaggcctt gaaataagcg gtagaatgcc aaactgacga tcgcaacgat 9661 aagtcctgtt gcggtactaa tcagagattc cccaatacca gtggtaactc cggctgtaga 9721 ttcggttcct aaatcaccaa tgcgaataga gcgtagagat tgaattaacc ctaaaaccgt 9781 acctaacaat cctaacaagg gcgcaagagc aatcacagtt tccaggagtt tttcgccccg 9841 tcgcattcca gccaattcgt cttctgctgt tgcttctaac gccagtcgga aactttctag 9901 atcacttttg ggaagactta gaggtgcata gagaaaccgt ccgatgggct gctttgttgc 9961 ttgtctggct atgtcagcag ccgacatcca atctacacgg gctgcatcca aaacacgagc 10021 aactatttcc ttttcctggc ttaaaattcg taaccagaac cacaaacgct caaaaatcac 10081 acttaaagat aaaatcgaca gagcaagcaa tggccacatt gctgatccgc ccttcttaaa 10141 caagtctaca atatccactg ttgtaggttt cctcctttca tttgtagtac atttgtcctg 10201 aagcacgatt attttagaga ttaaagcaca actatcataa tcgtttacca ttcaagagtt 10261 gagatgagtg caataactta cgagtagcat cttccagacc aactagggcg gttattccca 10321 caggaagtgt aggtcataat ttgccagaat taaaagtaac tggaggttac aatctatgaa 10381 attttcaatt caatcacttt atacttggta ccgcaatgta cttcgtaacc ctaagtatcg 10441 ttggtgggta attttaggaa cactacttta ttttgtcagc ccaattgata ttgctccgga 10501 cttcattccg attgtaggag aacttgatga tgtctttctg ttgacattgc tagttactga 10561 gctgtctggg ctaatgattg aaggcttcaa agcgcgtaaa ggtcaggttg acgctcaagc 10621 cactaatact acctctaata ctactaccga gggtcctact gctagcccga atacaatcga 10681 tgttgatgcc gtttctgtta agtagtattt ataacattat aaaacctaac aacccccctc 10741 ctttttctag gtgctgacaa atgcactttt gctggaaggg ggatattttt ttagtcatta 10801 atcataagtt attagctctt tgggcaaggc agaaggtttg aatttgctaa ccactagcta 10861 ttatgtgttc ctatctagtt aaaatatcta ccaaatagag gatagacaat ttttaactat 10921 ttatttttct ttacctatga ctaagcgcca aattatcgga ctgctacctg ctggtggaca 10981 agcaacacga atttctcctt taccacttag taaagagctg tatccgattg gttttcagga 11041 ttttggtgtt aaatctaact ggcgtccaaa ggttgtttct caatatctgc tagaaaaaat 11101 gcaactagca gggattgata aggcgtattt tatactgcgt tctggtaaat gggatatacc 11161 agcatatttt ggtgatggta cgatgctttc catgagtttg ggctatctga ttatgggctt 11221 gtcttatggt gtgcctttca ctttggatca ggcttatcct tttgttcaag atgcgatcgt 11281 agccttgggt tttcctgata ttttgtttca gcctgaagat gcttatgtac ggatattgac 11341 acgccttgag gttagtcatg ctgatgtcgt cttaggatta ttcccgactg ataaacctca 11401 gaaagcgggt atggttgact ttgacgatga aggtagagta aggctgataa ttgaaaaacc 11461 tcgccagtca gatttgcgtt atatgtggag tattgcagtt tggacacctg ctttcacaca 11521 gtttctgcac gagtatctta cgactcttaa ggtaaatagt aatctatcgc agctaccaga 11581 aataccgatt ggcgatgtga ttcaagctgc gattaataaa ggttttcacg tggaagcaga 11641 agtatttgcc gatggtactt accttgatat tggtacaccg gatgatttag taagcgctgt 11701 gcggcaattt gctgctttgg ttggtgagga aaatttgtag aggcgatggc ttaatctggg 11761 tatgtcagat gagttgccgt aattcatcta gagtactcac cgttaggata gctcgctgaa 11821 ttgcttttaa tcgctctaaa tcggaaattt gggcgatttc tggcatcagc tgtaatcctt 11881 cagtgccgaa ttttacttcc aaacccattt caatactcga cagcttctcc tctcgccgtc 11941 ctcgttcctc gcctcgtgct tctcctcgtg cttctccttc tctcaaaatt tgctgatacc 12001 agggagattc gtttaacact gccatatccc acctcataat ttgctgaact aatgcactgt 12061 ctaatacgaa gctagcaaaa aatgctagaa cggtttctaa ctgattcaat tgctcgtctg 12121 cgcggagaat ttgcaaggct tgttgtattg tggtttcttc cgcaccaccc ttgagaattg 12181 gtacaaatgg aagcaaagat aacacgggtt gttgaaaagc tatttccaca tcgacttccc 12241 aaagatttat tacgcggtag tcttgacgcg cttgtaaacc tgcaaattcc gattgatacc 12301 gtgtaggaat ttcaacatca ctttctttga ggatgttgat gagtacggga tacgtgggta 12361 ggttgtactt ctcttctgcc agtgctgcgt atgcacgcat tcgcttaggc atttccggct 12421 tgtatcgcaa ttgtaactcg ttgagtacga gaaattgccc atattgggga ctttcggcgc 12481 ggattaatac gtcactctcg cggctaatcc attggaattc tgagttgaga atttcttggg 12541 ctgtgatgtc ggaaatttgt gtgacccatt ttacccagtt atcgggtgcg aggctgatta 12601 attttttggt gctgatgtca gcggattttg gcatggggga gttgatgcag gagatgagat 12661 cactaatttt actcctgtcc cacccaaaga atacctaaat gaaagaaccg cagaggcgca 12721 gaggtggcag gtgctaacag cgggtttccc gacttgtaga cgcccgaagg gcggcttccc 12781 gaagggtagc acttggcgtg gcgcagagaa aagacatcaa ttagataggt tatcttatag 12841 gactcaaggg agtagttggt taggggtagg cgatcgccat acactgagca tccgattacc 12901 cccccaattt ctgcttattt ttaagcaata ttgcttattg acaaatgcat atcaccttaa 12961 aacagataaa accgcctctt caagccccgc gaaagtgtaa agccattttc ctgtttttaa 13021 gttccataga cgaactgttc tgtcattgct tgcactcgct agggtttggc tgtcgggact 13081 gatggcgata gcttcgctgc tgccttcggc agatcgcatt cactgcagct gtatgtcccc 13141 tgagagtctg tacacatctc caagtttcag ataatgcttc tttgggaaag acaacttctg 13201 ctttaatatt aggaggagaa gtttgagctt ttttgtgcaa ttctttcttg agcatctgta 13261 actcatcctc aacttccaga gcggcaaatt tttgattgag atcttcacca gtcaaaaatg 13321 cgtctaactc gaacacagca gcagaataag cttccatctg caagattttc tcttccatcc 13381 tgtcgaaaac ctgtgctggt gtttctcttg ttccggtttc gtctagcaac ttggcaatac 13441 cataggcggc gagtccaacc acagcaccaa ttcctgccat cgggattgtg ccaattccaa 13501 aagctaaacc tattttgggt gcgactaatc ccataccacc aactacccca gaaacaccag 13561 caccgcctat tgcaccaacc cccattgcac cataagcagc tgcatctccc tctgctattg 13621 ccttgaaagc accataagca gcagcaccaa caacagcacc agcaccaaca acaggaactg 13681 tgccaattcc aactgctcca aatcctccgg ctaatcccat cccgcctact gttgtggaaa 13741 taccagcgcc tgctatggtt cctcctgcaa taaatgttgc accgattgtg gaattttgtg 13801 tcacttgttt ttgtccttgg tcttactttt tcaacaaacg cgctccttta ccacggttgc 13861 aatctgtaca cagcgtttgc aaattgttga ggttattact gccaccttgg gaaaaaggca 13921 cgatatgatc aacttctaat tgcacttgct ggctgctgcg accacaaaaa acacatttgt 13981 aactgtcttt gtgtaaaact gacacacgaa ctgatgcggg aatatggcgg gagcgtttac 14041 tgctatcttg ctttggttga actcgggatt tgatagcagc agactttttc acctttggct 14101 ttttagattt atcagctttt tcactgttag cctgatgaat caaatcccga ataaaaccac 14161 gcactaattt ctggacagga ttgggagtct gttttgtcac gacttatcaa acagcttttt 14221 caaaccataa gcagctagtc caactaccgc accagctgct ataacaggcg cagcaccaat 14281 agcgacagct gttcctccaa ctgctaaacc catacctcca actacagcag aaactccaac 14341 tccaactgct gctcctcctg caactgcact tatagctgtg gcgtctccct cttctattgc 14401 cttcttagct ccataagtag ctgtgccaat aactgcacca gcagcagtca caggagccat 14461 gccgattgct acgcctccga aacctccaac taatcccata ccaccaacta ctgcggatac 14521 accagcaccc gctatagttc ctccagctac caatgctgta ccttctactt tatttttcag 14581 ttcttcgttc tgattatctg tcattatttg atttacgtat gcttaagtat attacatata 14641 acctaaacaa gaaaactgtt actgtccagg aataatcaat aaaatattat ctcatcttct 14701 taaagtttgt ttaaagatat aagtagtgtt tttaacatca aaatttcact tggcacagat 14761 gactttagaa attccagaca taccttgcag tataatctgg aattctgtca acttatttac 14821 aatcaactct aataacaatc acggtgagta ttcatttcta ggtagacaag aaaatgaaag 14881 aacaatagct tctacctaat taacaatctc ctcgtattga gttgaatgtg acattttatt 14941 cccgttttcc tgctgctcat catcactgtg gttaagagcg ctatgcttag tcttattaat 15001 catgtgatcc atgtgtggtg ttttggcaca aactagcagt agcgccccca aaataaacag 15061 taagccacaa atcgatgcag cctgaaccca agtaatttcc agtgctccgt gaacaatgac 15121 ttctcgcagt acagatacta ttgatacttc aactgctacc cctactgcga tgctatgctc 15181 ctgtaagtac accattagga gtcggaataa ctcgaccaaa atcaacaaaa acagtatttt 15241 tgcagtcact tgttgatgat ctaatggcag tgtaagctcg gtaaatatct cccacaattg 15301 cataaccatg taggcaaaca aaccgaaaca cagaacaatt acaattaagt cttgaaaagc 15361 ctccatattg cgaacaattc gttgtctgtc aatccaacga ttgcgaaata aatattgact 15421 cttctggcgc ttatttattg gcattttaat tctcaccggg acaactacag agcaaattaa 15481 tcagtctacc taatgctatt aacaagaatt agagtttctt tgagaattat ctgaatcaca 15541 agcatcaaac aaaagttatt tgccattgct tgcaaaatct cttcaaacaa gcttattcgt 15601 tattttatga tctacataaa atgacgaaat gctttaaaaa tatagctggc acaatctcac 15661 aacagaagtg tggtacaagg aacacttttt atctcttgtg aaagattatt tgagactaaa 15721 atctgggttt aatccccctt aatgttatag cccgcgtagg ctggcttggt cagtgtagct 15781 gaggtagtac atttcgcaat tcatccccaa gatgtggcat tctggactaa aggttctgca 15841 atgcgaaagt ctgcttgtgc taatgcagct aacagtatat ggcaatccta tttgatttgt 15901 gtgaaagtgc ctgcgcgtag cgcataaatc gggcactctc agattcccga caacttttgc 15961 gaagtcggga atcttgttgt tcacgaataa tttaggactg ctggagagtt tttaaaaatt 16021 ttgttttcag aaggtgcctt aacgaaagta cggcacccta cgtcttaagc tgcgatatct 16081 tgcccaaact ctatcaagaa cctgaacgct ttcacaacat agctggcgca ctctttggga 16141 gaggtgttgt aaaacgctcg ccatgcactc gctttatcct catcatagcg atcgcccctt 16201 ttcggatgac tttctaacca tgctaatcgc acggcatgac caattaacac atccaaaacg 16261 atgtattctt caatttgtaa gttggggttg ttggatgcaa ttgctgccaa ttctgtcaat 16321 gcctcaatat taacttgtcg gtattcggga gcctcaattt tatttagcaa atgttctact 16381 tgcaacgcaa aattcttttc tcctgctgtc atttctgaca aaatgacttc gctatctaaa 16441 cggttgcggc gttctagctt atcgccaatc actaagcctt tgcaatgttc catcactagc 16501 caaacttgct tgaaaaagtc ttttggaacg cgtcctgttg ctccctcggc ttggcggaac 16561 cgtcgccaac cgcctgatgg tacatctatt tcatctcctt tggatggtgc taccatccaa 16621 tctatgtcac tttctttttg tttgacgtgc aaagattctt gctgacgtaa tagctgactc 16681 aagctgacgt attcaaataa tacctgacgt actcgtgttt tcacttccaa aggtgacagt 16741 tgcatgaggt gttcataagc ttcatcctga gtaaggctta actcttgtac cagttcgctg 16801 gtaatcaaca gaatcatata gccaactctg aaggtgagta gtcctttaaa cagttctggt 16861 tctgtcttga ttaaagcact gagataaatc agaatttctt gagtcagaac gcgatcgctt 16921 atatcttccc gacaaaagtc attaattttc tcaacaattt cgctatgcga cagaggaaga 16981 gtgatgagtg aatcttcact gtaagcttta cccacagcaa tttgcttacc gcgcaccaca 17041 atacttgtca ccacgtccga caagccaata tcactcattt gtcgtaaccc agcagcacgg 17101 cgcacaactg cccaaacgcc ggaatcacct gctttagtgt aaacttcatc cagcaaatct 17161 gccacagtaa cgggacgtcc cggtccacca aaacctgtat caaattctaa gccttccaaa 17221 cgaactaaag tttgcaataa ctctatttgc tcgtagagat tatctgatga atgtaaagag 17281 gagagcaaca atcctaagtt tgtttcacac tccatttgga attcttgagt atgctttaga 17341 agccaatttc cttctggatt gtaaatcaga tacgagcagc gaggcgcagc atctttgatg 17401 gaagatagag aatattcaaa atcttcttga agaaaatcaa tccgttgtgt tcctgctgtc 17461 aacatcaact gattaagacg tcccaatttt acgcgtacac cgttacaaac gccttctttc 17521 agttcttgca tgagagcaag taatgcctca ctacctgttt ccaacatggt gtgagtcaac 17581 attaaagtga ggatcggacg ccctaaatca atccagtatt tttgaatgta agcgagttcg 17641 cttttaattt gatacaccaa aaaatggtaa tcaagagtta agtaaaacac ttgagcatcc 17701 aaaaacgagg gcaaaaagac aactgtctcg ccacgaatac gaaaaatctt agatgttgtc 17761 aaacttctca aacgtcggac tggacgtcct gttaaaccaa gtttatcgtt acgtccaact 17821 tgggtataaa tagctgaaag atcatctgct tgtcggactt gaattggttc gacttgcttg 17881 ggtgtttgtg tttcaatccc atgaactgcc agttttgctt gtaaatcttc atcttctgca 17941 agtaaagcga tttgtacgac agcttgctga ttttttccga tgcataaata ccttcccaaa 18001 gggtcaatat ctccaacagc tatcaagcct tcacttaaca tttgaccgag aaagtacaaa 18061 ctctgcgccc acacaagagg aagattctcg ttaggtaaac gcggctgact ttgaggttct 18121 aatttttctg cttctacaga ttcttctgga acataaaaaa gttctggaac tagatgtaaa 18181 ccatcacgct caatcaataa agattctaaa cgctcttggt aatgtttcac ttgctcttgt 18241 tcgccgcgaa acaaaccatc tagaaataaa taggtgaaaa ataaaggcca ttcacactca 18301 atatgctcga attgtttgag ttccaatggt tcgtagtgca agcggttttt gtcttctaaa 18361 actgtctggt gtccatcacg cagaaagcgc ttgcagccgt attttccttc gagtttgttg 18421 ataatatcat tgcgagtgcg ttctcgcagt tccacatctt ctacagcaaa agctggaaaa 18481 ctaataacgc tcaacagcgc cgcatcgatt tcttttgatg acgattctct tggtaataga 18541 gattctagcg taatcctggc acgagcgatt tcatctggta gcgcgtgaat aaccgatgct 18601 tgacaaccat gtacgccaaa taaatccaaa ccattaatcg cttccaatgc agcttttgcc 18661 atacctatag aactggcatt taattctgca ttgccatgat taattttatt acctcgttcc 18721 caaataccgt aatctggagt acggtacgct cgtccaatgt agtaaaccaa attttggacg 18781 aaattcacct cgtcaaacgt gtaaatgatc tgcaatcccg aagcagtcat ttgcgccaat 18841 atcaggataa atattgatgt cgcatctaat tgcaaatgtc cccattcgtc atcaccaaca 18901 acaatatcac ccgtagcagt gttgtactta gcgtgtaaag catccaacgg agactgagtg 18961 tgtttgaatt gctctaattt agcactttga cgcatcatcg caaacaacaa gccccgcatc 19021 agcttaacaa cgctgtgttc tagctcatag gtgcgtcctt tgtcttcgtc aactttgcgg 19081 taggcaatcg ccaatcccca cactgccaga atactgtaaa cgttatcacg tacccaagca 19141 tcagtgtaat cgccgtgggc agttatcgcc gtacttgctg gtaacaaacc tgtaattggg 19201 ttctgacgag caaggataat tgtcttaatt tgctgataat aacactcaag acgagcttgc 19261 agttgagtgg ctgttttcat agttgatttt ctacgttgtt tccggaatgc aaccctgaga 19321 cggtgaagta aattatgtca gcaaagccaa aactaccttt gtagtaggca gttgttttgt 19381 ggatcgaaat gaaaaatgaa tcatcgtctc ctgtcaatat acaagttagc tttcccattt 19441 ttaaagaaag atactctaaa tgtcaaatgt tttcaaattt ttatcagcaa gtcagctact 19501 ttctctttat cttgaaatta gtttgattaa tactcaacaa ggaaaaactt ctacataatg 19561 ctcatcagac gcgggaatta actagataaa ttgtagttgc taaaaacaga aaacacatat 19621 agaattactc tttttgaaaa agagttaatt tctgtttata cgttataacc aaaataattt 19681 ttcactaagt aaaagttgtt tactcccaag ttaaagtcaa cggcaagact gaactgatac 19741 caatcgagtt atatgccgat ggctcacttg agccgattat tgtataaaaa ctaaaatttc 19801 agattcctcc tcggattatc ctcccataac ctttggttat gggaggttac ttaatatttc 19861 ttgcaagcaa cgttcggtaa tatcaagtgc agcaaataag agctcgcttc gcttgctcgc 19921 agggagcaag ggagaaaaag aattattgtc cattagctct tttctccccc gctcccttcc 19981 cctctcccct ctctcttttg cttcttcgtt gacaacatgc tccaaaagtg tggaatttgc 20041 cgcgaattct caatctacat ccactggctt atggttgtga tctcatatag gagcagtaga 20101 aaaggactac tccgcaagtt gtacgcgaac gccgcctttg cgctcgccta aagtgaaaat 20161 tttaccaaaa aataatatta tcactagctt caggtaaaaa tgtttacttg agaaagattc 20221 tatttaaatc aacttgtacg cttgacacag atactcagag tagtagaatc tacacaaatc 20281 aacaggcatc agggtgatga aagattcaac tgaagtggaa ttgaactttc attttctgac 20341 tttcgacatc taccggatta cactattccg ggttaggatt atgaattagc gatgccattg 20401 attggtaatc gatgttgaat ttctgagatt ttatggaccg aagtccacaa gacggcagta 20461 ctaacctcct ctgatctggc gagtttgtct cccagcgcgg gggaggcatt tggaggtatt 20521 atcatgctca tgcaagatgt tcgcgattcg gttattgaca ttgctgactt aaaccagctc 20581 aaattggatt tgaatggtac acaaccggta gatgtgggag agtacatcac acaattgccc 20641 gaacaacagc gggcgatcgc attccgttta ctcaataaac accaagcaat tgatgtcttt 20701 gaatatctac cttcagaagt acaggaacaa atcataaatt ccctgcacga tgttcaagtt 20761 gcacaaattg tcgaggcgat gagtccggat gaacgcgcag aattgtttga tgagttaccc 20821 gctggggttg tcaaacgcct gttgcaagaa ctgagtccag aagaacggca agcaacagca 20881 acgattctcg gctatgctga aggcaccgca ggacgcgtga tgacaacaga atacgtgcgg 20941 ttgcgggatg gattgactgt cggagaggct ttgagtaaaa tccgccgtca ggatgaggac 21001 aaggagacga tttactacgc ctacgttacg gacgataacc acaagctggt gagtgttgtc 21061 tcgctacgac agctactgtt tacctttcct gaggttctga tcacagatat cgctagcact 21121 cgcgttatca aagtcagaac acaaatgccc caggaagaag ttgcccaaat catgaagcga 21181 tatgatttaa tcgctgtacc tgtggttgac cgagaagaga gactggtggg tattgtcacg 21241 attgatgatg tggtggatat tttggaagaa gaagccacag aagatattca aaaactggcg 21301 ggtgtgagtg gtggtgatga acaggctttt tcgtctcctc tgataactct tcgcaagcgc 21361 ttaccgtggc tatttggagt tatgttgctc tatattggag catctagtgc gatcgccccc 21421 tttcaatcaa caatctcaat ggtgccagtg ctggctgtga ttatgccgct tttctccaac 21481 acaggcggta ctgtagctat tcaagcctta acagtgacaa ttcgtggact tggtgttgga 21541 gaagtaacac cccaggatac tttcaaaatt ctccgtaagg aacttcaagc aggactgggt 21601 acagcggtag cactaggact gactatgatt atgctttcct tgatttgggc accacctcac 21661 gaacgttggg tgtcgttagt tgcaggaaca gtcatggctg tgaatactct tgtggctgtc 21721 actttaggaa ctttattacc aatggggttg aagcgactaa aacttgatcc cgctttggtt 21781 agtggacctt tagtgacaac aatactagat gccgttggtt ttatgatttt cttaacactg 21841 atttctacgg ctttgcatat tttccactta aaaccatgag ctaatagact cagaggacgc 21901 gaagcatttt ccttgtcttt gtgatgcgtc ctctgtgaca ctcctcagtc taaccgaatt 21961 gggcgagttc cagggacttt gtggacgata ggcgttaagc gtaagcgcaa gcgcacgcca 22021 agggcgaacg cgtagcgtgc gcctttggcg cttagctctg ccgtaggcaa tcgccgtgcg 22081 gtttattgaa gttaagggaa gatctacagt cggtgaagtg gcgctgacga ctaacgagta 22141 taaaacagcc gaaagattaa aacaggatta ttggttgtac gttgtgttta attgcgcttc 22201 tactccagaa gttcatccta tagaagaccc agtacgataa gtaggggaac cgttagtgaa 22261 gattgagcat tatcacgttt gggcccaaaa aattttaaca gcacaatagg ttgtatattt 22321 atgcaagtac aaacacaaaa acgctactac acacctgaag aatatttaga acttgaggaa 22381 aaagcagaat ataaaaacga ataccgcgat ggagaaatcg ttccaatgac tggaggaact 22441 acaaatcata atgaaattgc actgaattta gcagcttcct taaaatttgc agtgaaaggg 22501 caaaattatc gagtttatat tggtgacgta cgtttatgga taccccgtta tcgtcaacat 22561 acttatccgg atgtgatggt aattcaagga caacctgttt atacgggaac aaatacaact 22621 accgtcatga atcctttgct gattgcagaa gttttatcta aatcgactaa gaattacgac 22681 caaggcgata agtttctcta ttatcggtct atccccgaat ttaaagaata tattttaatc 22741 gaccaatatc actatcatgt tatgcagtat gtaaaaactg cagaaggtca atggtcattt 22801 actgaacttg aaggggaatc tgcaacctta tcacttcaaa cgattgattt taaaattcta 22861 ctgagcgacc tttacgagca agtagacttt actgtcagca gtgaagaaga ttcattatct 22921 taaatattat taacactaat ttatgcttaa agtctttttt agaaaatgct atgaggatca 22981 agtgcggtta tttcagaaaa ccttctaaaa tcacttgtgt taaacgttag cagatgagtt 23041 agattatgaa ctatcatcgc cgcaactaat cgagcatcat gagaaggttt cccggtgact 23101 ttatatttag cgatgagact ttcccaaact ggaaaaattg atgctgtatc tggtagaagc 23161 acgaaacaat ttttaagttg ctctagttcg tttaaagcat ccgcaaccga taaacctaat 23221 cgattaactt ctattgggcg agttgcaacc acccaaaact caatcagatt ttgcggtata 23281 atacaaagcc tcttcccttg ctccaacaaa attcttactg cttgaacgga agatttatgc 23341 atcgaatgag tctcttgggc actccgcaac aaaacatttg tatcaaccag gtagctcata 23401 ccatttcatc ttctctggtg taaatacttt cgcggctaat tgcatcatcg caaagaggtg 23461 gcgcttttat tgatgagtgg cttctagccc aatttacaaa tctttccgct cgttcttgag 23521 gtgatagtgt cagaggtgtg ggttgcgatg atgcaagcaa gctttcaact aaagttttca 23581 acaatacttc aacggatgtt ccttgagctg cggcttgagc catcaagcgt gattcaattt 23641 caggttctag ttcaagtagg atagccatag taggtgtcaa acagttaaag aaataaaaga 23701 aatcaattaa aattagtata ctttaccata tttatcctaa atgataagcc ctacaggcac 23761 gctccgctta tgatttagga ttgctatagt tataagaaac aacttggcta aaaatccaaa 23821 cctacaactg ttaactcagg tggatattca acaacaaact gataatatat gtcacgttta 23881 gcttctggtg gaagaactaa agaccattcc aatatcccca tttcaccaag ctgaatttgc 23941 ggattactac ggttcagacg gactttaatt tgttcgttgc gactaattgg taattgttca 24001 atgactttta aatttgcttc ttgattttgt aagttagtta taactatccg ataactataa 24061 gtaatccttc gattgttgcc aatcagtttt ttgtctacct gacgttctac taaatcacgc 24121 tcaattttga aaccttcgtc aattcctaaa ttcagcttga actcttgccc tggtgcaaca 24181 ttctctaatt gagttgttcc cacaaatgca ttatctcgga aaatattcgc tttccctggt 24241 aacaaagtta caccattgct actatttttc acatttgctt gtagataagc aaagctaatc 24301 aagcgtggta ttgccacata ctcaaagcta catggaaaat cgtcgttaaa aattgtcgtt 24361 ttatgaggtg cgccgtcact gggaatgttt ccactactct tcacctcaaa agtgactgca 24421 cttccttctc tagaaacttc tgcaatcagg gtttgtgcac ttaccaaatt ttcctctagt 24481 tgttcggttt ctagtgttgt gctaccatct ccactggcag aaggagccat agctacagaa 24541 ggagtttgta ttggtgctac tcttcgcata cgcaaaacct cgggtggacg cagtatgtcg 24601 atataccaag gttcaagttt gggtggtaaa gttcccagtc caagttttgc ggtagacaga 24661 gtcagagata cgtccatcca atcttcacca gtactttggg tgacttcagc aaggtaggtg 24721 agattgatgg aattgctact tgtattgact cgcaagtcat acagcggagt ccaacgagcg 24781 caactgacta cgtaagagac ttctaactca aactctccag cacccgcagg ctcaattgcc 24841 acactcaagc taaaactttc cttaggtgat ggtgtttgta cttgttgcca ttgttgacgc 24901 agaacttgta gctgcttgtc tagttcttgc tgttggactt tgcactctcc tgtagcaata 24961 gcatattcgc tgtactggct tcccataaag ttgagaaaat ccaaagtttc gctcaagcta 25021 aggtttttcc gcgacagact ctgtgcaaaa ggttcctctg tcttttcacg taaaccttca 25081 ataaactttg actgcaaagc caaagcatcg atttgtgctt gcaggaggtt ccattctgct 25141 tctaattgct ggatttgcct tgttaaatgg ctcaaccgtt cggcgaaggg ttccgtggag 25201 aaaactcgct ctgtgctgac tccggatagg tgtacgacaa cttcacctgt accactaacc 25261 ctgactgact cagtttccat agtcactggc agttggataa tttttaattc tcgttcctgt 25321 ccggtgagtg tgatcacccg tgaaggtcgt gcactactaa caggtgcgat acgctcgctg 25381 tcttgcctcc ggcttatcgc acctcgtcgt gtcacaaggg cttggtcagt atagactgtc 25441 acagatataa tttcagtttc tactgttttt cgcacagccg atgcttcctt tgttagtcac 25501 tgtcagtttt ctctcattgt gcatgctttt gctggttatt taatggtatt gtggactgtc 25561 ccaaatatgt cctccatgtt tgctgaaatg gggaattgct aaatcgttga tgatttcaat 25621 ccaaaggcag atgctttact gatttaaaaa tgttaataaa gtcagtctta taaataaact 25681 tatacctctc taacgatttt cgtctgcgtg aaaaacacta gaacttattg tttgatttgc 25741 aaattgcttt gcagcatcta ggagatactc aaaataggta cgagcaacga tagaaagttg 25801 cttaccagct aagtaaacca tgtaccaatt ccgttggatg ggaaagtttt ctacatccag 25861 aatggttaaa tctttactct ctgtcgttaa agtatgtcga gataaaatgg agattcctaa 25921 gccaccagca attgcttgct ttatcgcctc gttactcccc agttccagct taaccttgac 25981 catcacttta tgctcatcga ataatttctg cacagctcgc cttgttcctg aacctggttc 26041 gcgcataata aaaggttcat cagcaagacg ttcaatggaa atattttttt gcattgccag 26101 aggatgattc gctggtgcta taacgaccaa ggggttttct aaaaatggct ggtaattgac 26161 atccagatgt tctggaactt ggctcatgat atacaaatca tccatattgc tgctcatgcg 26221 ttccagaatg ccttcatgat ttgtcacttg cagggcgata tcaattcctg ggtaaagttg 26281 gcaaaatgaa cccaataaac gtggtacgaa gtacttagct gtagtaatga ctgccaagcg 26341 caactgtcct tgctttaacc ctttgagatc tgccactttc atttcaaatt gggcaattgt 26401 gtcgaaaatc tgccgacagg tggtaaacag ttcccgtcct gcttccgtaa gatagaggcg 26461 ctttcctacc tgctcaaaca aaggcaaccc aactgatttt gtgagttgct taatttgcat 26521 ggaaacagtg ggttgagtga gaaagagttc ttcagcggca cgagtaaagc taccgtgacg 26581 cgccgcagcc tcgaacacct tcaactggtg cagcgtcgct tggttcaagg gctattctcc 26641 tttaagattg atttttatag atataaatct agtatcactg aacacttgaa attgatatag 26701 ctttgctcat ccaaaagtta caagtgttga cgctctcagc ctgccttacg gctcttgatg 26761 ggcattcttg cttcggcaat ataagacaat acagttgctg ttagtggtat ttagttttgt 26821 aatat // LOCUS NODE_1114_length_26675_cov_5.11637926675 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 26675) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 26675) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..26675 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(627..1397) /locus_tag="DP116_09795" CDS complement(627..1397) /locus_tag="DP116_09795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell-cell signaling protein" /protein_id="PRJNA477356:DP116_09795" /translation="MSFLEELNNANAFIIGASQGIGLGFVQKLLQDDRFAKIYATYRQ PESAADLLTLEGEHPDKLTCLSMDITDESQIIDVVQKISAAVNKLHLVVNCVGLLHEG ALQPEKSLKRINSEYLMHYFQVNSIGAILLAKHLLPLFRHHERSIFASISAKVGSIGD NQLGGWYGYRASKAALNMLMRTAAIEYKRTSPKTVIVMLHPGTTDTRLSRPFQANVPP EKLFSVERTVNQLFGVIEQLQDGDSGQFFSWDGSRLPW" gene complement(1727..2785) /locus_tag="DP116_09800" CDS complement(1727..2785) /locus_tag="DP116_09800" /inference="COORDINATES: protein motif:HMM:PF01764.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09800" /translation="MFNKLKQLITNQQNVMETLQDDPFYFQPQAVGYHPNNAYTLGLI SSLAYKNGSQLNPNLQKDVPRWISKFNVAALALDKTKPNWFDCDTWSLTDTQAIVLEN QEVVIIGFRGSQEIIDWLTDSQIIQRTKGPGGYGVHFGIYYALMSIWDKIEPYIKNKG DKTLWFTGHSLGAGLAVMATAHCLFELNIIPNGLYNYGQPKVGSEDFVEAFNKKFVNQ TFRFANNNDLVPFLPLSQSDLSVKVPNLVKYLPRTQKLNIVNAPTIIDYYHVGHLKYF DKDGVLHDEGLGIGDKWLDRIAGHLNSLLKIAPGSLTVGGLFDRADLLGDHMIKQYIV NLKKHREEWEMNKKSVSS" gene complement(3082..4464) /locus_tag="DP116_09805" CDS complement(3082..4464) /locus_tag="DP116_09805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457321.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE" /protein_id="PRJNA477356:DP116_09805" /translation="MSEIFATTGTIAAIATAVVPQQGSVGIVRVSGSQAIHIAQTLFC APGHQVWESHRILYGYIRHPQTQQLVDEALLLIMKAPRSYTREDVVEFQCHGGIMVVQ EVLQLCLENGARLAQPGEFTLRAFLNGRLDLTQAESIADLVGARSPQAAQTALAGLQG KLASPIRQLRAIALDILAEIEARIDFEEDLAPLDDKVIISEIERVSAEISQILATADK GELLRTGLKVAIVGRPNVGKSSLLNAWSRSDRAIVTDLPGTTRDVVESQLVVGGIPVQ VLDTAGIRETQDEVEKIGVERSRRAASAADLVLLTIDATAGWTALDQEIYSQVQHRPL ILVINKVDLASPEMVEYPTNINHVVKTAAAQNQGIDALETAILEQVKSGKVQAADLDL AINQRQAAALIKAKTSLEQVQATIAQQLPLDFWTIDLRDAIHALGEITGEEVTESVLD RIFSKFCIGK" gene 4823..5665 /locus_tag="DP116_09810" CDS 4823..5665 /locus_tag="DP116_09810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128705.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09810" /translation="MDELRAALELASDEELQDLTAILFSRKFNPLDYVHTPEPIEVQS KGRKAWLDALEQRFRFLAADGITVLRGRSRQVTYRQALIQVCKYLKIPYSSELTTVDL EAEVFLHLLGQVWRKLPEKDKQKLTVRIQRQLAKAELTEPLPLTLQRDPLGLLLKGGS ALAVTSVLQPLLLKQIARQFAMHFATYQVAKEAVIQGSGAVANQFQNYATLQMAKQGM SVNAARYGVTRSVFAFLGPVMWSWFLADLGWRAIATNYGRIIPTIFALAQIRLTRAEC WEPA" gene 5647..7038 /locus_tag="DP116_09815" CDS 5647..7038 /locus_tag="DP116_09815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polymerase" /protein_id="PRJNA477356:DP116_09815" /translation="MLGACLKPAFRHPNPSLQSSWNYAQLGILVFPLLPFLGAVGIVL AILGSWFTQYRAIIRRPLHWGFALLSVFMLITVSFANDKTAAFLGLFNFLPYFLVFAS FSTLIQTPVQLRHMSWILVLGSLPVVILGFGQLFLNWSSKIRILWFVVEWEIEPGGQP LGRMASVFMYANINACYLMIVFILGIGLWLESYRRLRRHVDNSKREQHRERVTEGGGK LPSLPQSPYRTFFFLTGVMIAIFVALILTDSRNAWAIAIFACLAYAVYEGWHILVAGV AGIVASVLCAAFAPLPIAQLFRKLVPAFFWARLNDQLYPDRPIALLRKTQWQFALSLT QQRPWTGWGLRNFTPLYDAQMHIWLGHPHNLFLMLFAETGFPTTLLFCGLLVWIFIAG VQLLQKLKSVDQAEDKLICFSYLLVFVGWVLFNTADVSLFDFRLNTISWLFLAAISGV VYRQNHYQRLQVK" gene 7416..7604 /locus_tag="DP116_09820" CDS 7416..7604 /locus_tag="DP116_09820" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09820" /translation="MVSRLGLKTMTGRTITLKGVKPEGIVVACEKTFIYMVLLNQQQE IVSFGSFLTWIVCVFKIF" gene 7657..7854 /locus_tag="DP116_09825" CDS 7657..7854 /locus_tag="DP116_09825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307870.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09825" /translation="MAFHKSLDLKWPDNVIPIFQPPNSPELNPIERLWEHIKYELSWE HCTTLDQLRQKLYQFAFKSVI" gene complement(8142..8798) /locus_tag="DP116_09830" CDS complement(8142..8798) /locus_tag="DP116_09830" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09830" /translation="MKSGGSNSLWEIFTAFQMVGLYSFDVDKYRGINVKVRFTLGIIG CTLLTVLTPTILTAQTLPYGTARQPQTCSSRVEPKKGAPSVEQAKMYFLCDQESQFGT PGQGGFSSFIRLISDLTLQIAPKSRPANVTDLEYNTNQRGEHLSINMDKPVYDIRGSY TNHTCYEIRGRSHLPGKNCNVEQYTSAGICFQNTFGDWHCRMKGSTKKVGDNLPPPEK " gene complement(9206..10704) /locus_tag="DP116_09835" /pseudo CDS complement(9206..10704) /locus_tag="DP116_09835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318000.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" assembly_gap 9353..9362 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(10925..11047) /locus_tag="DP116_09840" CDS complement(10925..11047) /locus_tag="DP116_09840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127337.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09840" /translation="MAGIVHHLPAMNALTTPNVKYYRCIHPHNWSGAFRCWAKA" gene 11008..11235 /locus_tag="DP116_09845" CDS 11008..11235 /locus_tag="DP116_09845" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09845" /translation="MHSLLVDDVQSQQYLFLPQQADVKRAMGNRALTGDRHFGKILVS HRSGAQAGSSFIKFYKKKKMAKAHTKYVKHP" gene 11380..13956 /locus_tag="DP116_09850" CDS 11380..13956 /locus_tag="DP116_09850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869998.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-glucan phosphorylase" /protein_id="PRJNA477356:DP116_09850" /translation="MQPIRTFNVSPSLPQRLEPLRKLAYNLHWDWNVETKDLFRRLDP DLWESSRHNPVLMLGTISQARLLEVVEDEGFLAQMDRAARQLEDYLQERTWYEKQRIG HASDEKGQEYYAYFSAEFGLVDCLPVYSGGLGVLAGDHLKSASDLGLPLVGVGLLYQQ GYFAQYLNADGWQQERYPINDFYNMPLHLERNPDGSELRIGVDYPGRTVYARVWRVQV GSVPLYMLDTNIEPNNSYDHNITDQLYGGDIDMRIHQEIMLGIGGVRMLKALGYNITA YHMNEGHAAFSALERIRILLQEEGLNYPEAKQVVASSNIFTTHTPVPAGIDLFPPDKM LYYLGYYADVFGLNKDQFLALGRENTGDLSAPFSMAVLALKMATFSNGVAQLHGVVSR QMFKNLWLNVPTEEVPIKAITNGVHARSCVAKSTQELYDRYLGPSWSSAPPDNPLWER MDAIPDEELWRNHERCRLDMIMYVRERLVKHLRDRGASPSEIASAQEVLDPKVLTIGF ARRFATYKRATLWMRDVERIKRILLGNKDRKVQFVISGKAHPKDIPGKELIRDINHFI DEQGLEKQIVFVPNYDIYIARLMVAGCDIWLNTPRRPREASGTSGMKAAMNGLPNLSV LDGWWDEADYVRTGWAIGHGEMYDDPSYQDEIEANAFYDLVEKEVVPLFYDRDVDGLP RRWVDKMKDAIQLNCPFFNTARMVREYAERAYFPASDRYHTLTADKYAPAKELAAWKD NLTAHWYDIKIKDVEVSSGADIEVDQIVNVTAKVDLATLTNKDVQVELYQGAIDADGQ IVNGVPVVMNYQGEDKDGLSIYTAEIVYNISGLQGLSLRVLPNHKYLSSPYEPRVIVW AE" gene complement(13978..14718) /locus_tag="DP116_09855" CDS complement(13978..14718) /locus_tag="DP116_09855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744736.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF429 domain-containing protein" /protein_id="PRJNA477356:DP116_09855" /translation="MKFLGIDLGWKSQPSGLCYLEFIDGKLQIQDIDRKESIADILTW IDICVTPEEPAIIGVDAPTLIPNATGSRLPDKLTHKHFGKYHAGCYPANLGLPFAERT VKFGLELEARGFAHAPTIEPQKPGRYQIEVFPHPAIVHLFGLERILKYKKGRLSDRRL ELIKLYNYITEILPSLHPPLCSLRLSCSFLPEIPTTGAALKQVEDKLDSLICAYVAAY WWYWGEQRNLVLGERTTGYIVIPSKSQK" gene complement(14901..15485) /locus_tag="DP116_09860" CDS complement(14901..15485) /locus_tag="DP116_09860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316329.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_09860" /translation="MSALILQLPPSLKLTDEEFEQLVAVNQELRLELDAQGELIIMSP TGGETGNRNFEFYIDLGIWNRKNNLGKAFDSSTGFKLPNGATRSPDASWITIERWERL TPQQRKKFLPLCPDFAVELVSESDDIEDTQAKMREYIENGLRLGWLIHPQEKRVEIYR PHVAVEVLNSPKSLSGEDVLPGFVLDLERIFGCC" gene complement(15520..15738) /locus_tag="DP116_09865" CDS complement(15520..15738) /locus_tag="DP116_09865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09865" /translation="MIRPKLLSRRQYRDVACNVSTEASQVTIEQTMQFAGCWEDMSDK MFADFNEEVITRRQQAFLVRRSHESSII" gene complement(15752..16003) /locus_tag="DP116_09870" CDS complement(15752..16003) /locus_tag="DP116_09870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741147.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09870" /translation="MTQSFLSSADNANAPIPNQTNSQASSKREPIKHTLVGSPKAVKS TIRVLHQLKYASIGDWSPLVPTGNTGEVMSILIRSILVQ" gene 16296..16556 /locus_tag="DP116_09875" CDS 16296..16556 /locus_tag="DP116_09875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006098510.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09875" /translation="MRILRRTIGCESDDGDSDVSKRFTVTLPDSVFEDLEVLADAQGR PTANLAAFLIEIGIKETKERGEFPEKPEKPKTSKAKKAKEEG" gene complement(16623..16811) /locus_tag="DP116_09880" CDS complement(16623..16811) /locus_tag="DP116_09880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09880" /translation="MPFQKNNKLGAKSFNEEPFDKSPLCFAVRKGVKEKLKAVPNWQE RLRKLVDELIEENGDDCQ" gene 16871..17947 /locus_tag="DP116_09885" CDS 16871..17947 /locus_tag="DP116_09885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016948836.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_09885" /translation="MLLSIKTKLKLNEVQKTVMSKHAGIARFTFNWGLATWNSFVKDG LKPNKFILKKFFNNHVKPEFEWIKEKGICQKITQYAFDNLGDAFSRFFSKKGDYPKFK KKGHHDSFTIDASGKPIPVGGKSIKLPTIGWVRTYEGLPHTTCKSITISRVADSWFIA FAYEQEHEPTVKQHDVVGVDIGVKELATLSTGVVFPNPKHYKTHLEKLRRLSRKFSIK TKGSNNRYKAKIQLAKHHAKVANLRKNTLHQITTFLCKNHAKIVVEDLNVSGMLSNHK LAQVIADCGFHEFKRQLEYKAKKFGCEIIIADRWFPSSKTCSNCEHIQDMPLKERTYN CKSCGHSMDRDLNAAINLSRLAKA" gene 18076..18207 /locus_tag="DP116_09890" CDS 18076..18207 /locus_tag="DP116_09890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875212.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09890" /translation="MQLQTEDSTITVPVPAHDELRTGTLRSIIRQSGLPRALFEVDS" gene complement(18408..19313) /locus_tag="DP116_09895" CDS complement(18408..19313) /locus_tag="DP116_09895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952411.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S1" /protein_id="PRJNA477356:DP116_09895" /translation="MASWASVNAQVNRVDTKSPTAAPPKFIKATDAGQFKIQGTGKPF QPSGLARSEKPIDDTRAIIGPDDRIPMISRKYPWSTIGRIVGESTDGNAYTCTGTLIA ENIVLTNSHCVINPETHQLSKRVAFMPNVINRELQDDNDTALAEKILYGTDFTNDAVT NQTNDWALIQLNKPIGQKYGYLGWKSLPSSTLIKNQKKFIFVGYSGDFPNPKKRGYDF LSAGPGWTASVQHGCSILRDEQNILFHDCDTTGGSSGGPIIAVINGQPYIVALNNAEI KRRDGTGVVNLGVKIDVLDRLSRGN" gene complement(19709..22688) /locus_tag="DP116_09900" /pseudo CDS complement(19709..22688) /locus_tag="DP116_09900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017718769.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 21693..21702 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 21959..21968 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(22940..23134) /locus_tag="DP116_09905" CDS complement(22940..23134) /locus_tag="DP116_09905" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09905" /translation="MNKKDESRQDICWILLNCARYSLTLLLGAVLLSDSVGAISHSSG LLVTNRFKGANTGIKNTCTH" gene 24548..25627 /locus_tag="DP116_09910" CDS 24548..25627 /locus_tag="DP116_09910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126862.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 4 protein" /protein_id="PRJNA477356:DP116_09910" /translation="MRIAQIAPLWERVPPPGYGGTELVVGLLTDELVRRGHEVTLFAS GDSISLAKLESVYPRALRLDQTVKEHSVYEMLNLARVYEQADEFDIIHSHAGHISLSY ANLVKTPTVHTLHGIFTPDNEKLYQYAKNQPYISISDAQREERLGVNYVATVYNGINV SSYKFHPQPEEPSYLAFLGRISPQKGTHLAIQIAKETGWRLKIAGKVDVVDVEYFESQ VKPFIDGKQIQYLGEANHEQKNALMGGAVATLFPITWREPFGLVMIESMASGTPVIAM NMGSTEEVIAHGKTGFLCNNVEECISAVSKVTELDRSACWDYVWERFSVQHMTDGYEA VYRKILGEPFACSNGHLPNPVISGN" gene complement(25875..26381) /locus_tag="DP116_09915" CDS complement(25875..26381) /locus_tag="DP116_09915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875867.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09915" /translation="MNYLVAVLPDRLKVEEAYTALEKEGLPTNQIDILGRGYKSADEY GFINPNQQAKKGINRLVSWLIPFGFVAGYAFNLLTSIEVLPIGSLGNHLIGGLLGAAS GALGAYFVGGGVGLTVGSGDALPYRNRLNAGKYLIVIKGSQELTNQATRILRQFEPEN LQGYAEQT" BASE COUNT 7418 a 5551 c 5650 g 8026 t 30 others ORIGIN 1 taaaaataag catcaatctc tagaaagatt tattattctt gctaacactt ataaccatat 61 taaggattca ttaacttatc attaagtcat cccattcatc tcgaatcgga tgacttgata 121 ctcacaataa ttcgattgac agcacttagt caatgccaaa attattaagt atttattagc 181 ttctctggct tttgaactca tagaagttat gaagcgtaga ttcaaaaagc tttgagaaag 241 gttttgccta tttcaaacaa tgaggtgaac agttttagac tttttatgaa taaccatttt 301 ttaaccttaa ttgcctagat atttaacctt atattttgat accataatgc aatctgaaat 361 agtagaaaaa tctggagtag tgtagggtgg gcactgctaa tcagatcctg catactatat 421 tcgggtttta gtggcagtgc ccaccctacg tgtgtttctt tttatcaata accatttttt 481 aaccttgatt gcctagatat ttaaccttat ttttgatata tttaaaaacc tatagcaatc 541 ataaatattg ataaacaaga tccccgacaa cttctgcgaa gtcggggatc tgtcgctttc 601 agacaactct ttgattttca ctatttctac caaggtaagc ggctaccatc ccaggaaaaa 661 aactgtccgc tatcgccatc ctgaagctgt tcgatgacac caaataattg attaactgtg 721 cgttccactg aaaacaattt ttcaggaggt acatttgcct gaaatggacg agaaaggcgt 781 gtatcagttg tcccagggtg taacatgact atcacagttt tgggactcgt tcttttatac 841 tcaatggctg ctgttcgcat taacatattg agtgcagctt tggaggctcg atacccatac 901 catcctccga gttgattatc accaatacta cctaccttgg cagaaatact ggcaaaaata 961 ctgcgctcat gatgacggaa taaaggtaac aggtgtttgg caagcagaat agccccaata 1021 ctgttgactt ggaagtagtg catcaaatat tctgaattga tacgttttaa acttttttct 1081 ggttgcaaag caccctcatg cagcaatccc acacagttaa caactaagtg caacttattt 1141 actgctgcac ttatcttttg aacaacatca ataatctgcg actcgtcagt aatatccatc 1201 gacagacaag ttaatttgtc tggatgttcg ccttcaagag ttaataaatc agcagcggac 1261 tctggctgac gataagttgc ataaatttta gcaaacctgt catcttgcag taatttttgc 1321 acaaaaccca aaccaatgcc ttggcttgct cctatgatga atgcgttagc gttgttaagt 1381 tcttcgagaa aagacatttg ttttaattaa taaaagatca atctttttaa gattatgaat 1441 aatcagggtt gggtaacttc attacaatat agcaatccga attgaatcat gagaaaacac 1501 gtaaacttta catccaatgt aaaagtcaaa ttttattcaa ccttggcgat tagaaatcgc 1561 ggctatacag acgaaacccg cctacgcggg ttattctata aagtccacgt aggtggactt 1621 tgtttgtgta gtagcgattt ctaatcgccc aattatttct ttctcatgag gagtgattta 1681 ggattgctat aggacacccc aacccacaac ttgagttatt acgtgactaa ctagagacag 1741 atttcttatt catttcccat tcctctctat gttttttcaa attgacaata tattgcttta 1801 tcatgtgatc gcccaaaagg tcagcgcgat caaacagacc tccaactgtc aaagaacctg 1861 gggcgatttt taacaaacta tttaaatgtc cagctatcct atcaagccat ttatcgccaa 1921 tacctaaacc ctcgtcatga agaacaccat ctttgtcaaa atacttcaaa tgtccaacgt 1981 gatagtagtc aattattgtg ggggcattaa ctatattcaa cttctgagtt cggggcagat 2041 atttaactaa atttgggact ttgacagaaa gatccgattg actcagagga agaaaaggca 2101 caagatcatt attattagca aaacgaaatg tctggtttac aaacttcttg ttgaatgctt 2161 caacaaagtc ttcactacca actttaggtt gtccatagtt atataacccg ttcgggataa 2221 tgttgagttc aaacagacaa tgagcagtag ccatgactgc taaaccagct cctaaactat 2281 gccctgtaaa ccacagagtt ttgtctcctt tatttttaat gtaaggctct attttatccc 2341 aaattgacat cagagcgtaa tatattccaa aatgtacgcc gtaacctcca ggtcctttag 2401 tacgctgtat gatttgggaa tctgttagcc aatctatgat ttcttgagaa ccgcgaaaac 2461 caataatcac tacctcttga ttttctaata cgatcgcttg ggtatctgtt aaactccaag 2521 tatcacagtc aaaccaattt ggctttgttt tatctagagc aagagctgca acattaaatt 2581 ttgaaatcca acgcggtaca tctttttgaa ggtttggatt tagttgtgaa ccatttttat 2641 aagccaaaga agaaatcagt cccaaagtat aggcattatt tgggtgatag ccaactgctt 2701 gtggctgaaa gtaaaaagga tcatcctgca atgtctccat cacgttttgc tgattggtga 2761 tgagttgttt caatttatta aacatgacag atccttgtaa tgagttgcgt ttaacttagt 2821 attcccctat acccgcaact cctaggattt tggttatatc atgttaggat gaacacttca 2881 ggacttacgc aaaatcatga aaaattaacc gcaaagacac aaagaacgca aaggtaagag 2941 ggtttcaaag ggtttttgcg taagtcctac acttataaaa aacgaagaac ctcaccccgc 3001 ccttcgggca cccctctcct tgctaaggag aggggcaggg ggtgaggttg tttgattgta 3061 actaattagg cgaacttgat attacttacc aatacaaaac ttactaaaaa tccggtcgag 3121 aacagattct gtcacttctt cgcctgtaat ttcccctaaa gcatgaatcg cgtctcgtaa 3181 gtcaattgtc cagaaatcaa gaggtaactg ctgggctatt gtcgcctgga cttgttctag 3241 ggatgttttt gcttttatca atgcggctgc ttgtctttgg ttaattgcta aatctaagtc 3301 tgcggcttgg actttgcctg atttgacttg ttctaaaatt gcagtttcta gagcatcaat 3361 accttggttt tgagcagcgg ctgttttgac aacatggttg atatttgttg gatattccac 3421 catttcagga gatgctaagt caactttatt aatcaccaaa attaagggac gatgttgcac 3481 ttgagagtag atttcttggt ctagtgctgt ccaacctgct gtggcgtcaa tggttaaaag 3541 cactaagtcg gctgcactgg ctgcacggcg cgatcgctca acccctatct tttctacttc 3601 gtcttgtgtt tcccgaattc ccgcagtatc caatacctgc acaggaattc caccgacaac 3661 taactgcgat tccacaacat cgcgggttgt cccaggcaaa tcggtgacaa tggctctatc 3721 gcttctactc caagcattca acaaactcga ttttccgaca ttcggacgcc caacgatcgc 3781 cacttttaaa cctgtccgca gcaattcccc tttatccgca gttgcaagaa tctgagaaat 3841 ttctgctgag actctctcaa tttctgatat tatgacttta tcatccagcg gagctaagtc 3901 ttcctcaaaa tctatcctgg cttcaatttc tgccaagata tctaaggcga tcgcccgtag 3961 ctgacgaatc ggagaagcta atttcccttg taaaccagcg agtgctgttt gtgcagcttg 4021 aggcgatcgc gctcctacca aatccgcaat actttctgct tgagttaaat ctagtcgtcc 4081 attcaaaaac gcccgcaaag taaattctcc tggttgtgcg agtcttgcac cattttccaa 4141 acacagttgc aaaacttctt gcaccaccat aattccccca tgacactgga actccaccac 4201 atcttcacgc gtgtaagaac ggggtgcttt cataatcagc aaaagtgctt catccaccaa 4261 ttgctgcgtt tggggatggc ggatgtaacc ataaagaatc cggtgacttt cccaaacttg 4321 atgtcctggt gcacagaaaa gagtttgggc gatgtggatt gcttgtgaac cagaaacgcg 4381 gacaattcca acactaccct gctggggaac aacagcagtg gcgatcgcgg caatggttcc 4441 agtagtagca aaaatttctg acatttgcgc ccttttaacc aagacattca attctttctc 4501 gattgtagag actctcagaa gaaacagggt gtaggggagc cagtactgaa ggagggtttc 4561 cctccgtagg tatctggcgt gtgtaaggga aaaagctgtt tcttttatcc ataacgggtg 4621 gcgcgcctaa gagtgcgggc acccgaaggc tcatccaatt tcgctcacct catttaaacc 4681 cttgtgggta tgtctgaagc ccatttacac actctaaatt taacaaaaga aaggataaga 4741 gacacggtag acggctcacg agtccaaaaa gctaatagta gaataaagta cagaaaagct 4801 gtatggaacg gaggaaggga aattggatga actaagggca gcactagagc tagcaagcga 4861 cgaagaattg caagatctca cggcaattct gtttagtcgt aagtttaatc ctctagatta 4921 tgttcacaca ccagaaccta tagaagtgca aagcaaaggt cgcaaagctt ggctagatgc 4981 actagagcaa cgctttcgtt ttttggctgc agatggaata acggtcttgc ggggacgcag 5041 tcgtcaggtg acttaccgac aagcgctgat tcaagtgtgt aaatatttga aaatacctta 5101 ttctagtgaa ctgacaacag ttgatttaga agctgaggta tttctgcacc tgttgggtca 5161 ggtatggaga aaattgccag aaaaggataa gcaaaaattg actgtgcgga tacaacgtca 5221 gctagcaaaa gcagaattaa cagaaccgct accactgaca ttgcagcgag accccttggg 5281 gttacttctc aaaggcggta gcgcccttgc tgtcacttct gttctccagc cactgttgct 5341 caaacaaatt gcccgtcaat ttgcaatgca ctttgcgaca tatcaagttg ccaaagaagc 5401 ggtaattcaa ggttctggag cagtagcaaa tcagtttcaa aattatgcca cactgcagat 5461 ggcaaagcag ggtatgagtg ttaatgccgc tcgttatgga gtcactcgca gtgtgtttgc 5521 ctttttagga ccagtgatgt ggagttggtt tcttgcggac ttggggtgga gagcgatcgc 5581 cactaactac ggtcgaatta tacctaccat atttgcccta gctcaaattc gtctgactcg 5641 tgctgaatgt tgggagcctg cttgaaacca gcttttcgtc atcccaaccc tagtttgcaa 5701 tcttcttgga actatgccca actgggaata ctcgtcttcc cattgcttcc gtttctcggc 5761 gctgtgggta tagttttagc aatcttggga tcttggttca ctcaataccg cgcaattatt 5821 cgccgtcccc tccactgggg atttgcgctt ttgagtgtct tcatgctcat caccgttagt 5881 tttgccaacg acaaaacggc ggcttttctg ggcttattta atttcctacc gtacttttta 5941 gttttcgcta gctttagcac gcttattcag acacctgtac aactgcgaca catgtcttgg 6001 attttggtgc ttggttccct accagtggtg attcttggat ttgggcagtt gtttttaaac 6061 tggtcttcta aaataagaat tttgtggttt gtcgtggaat gggagataga acctggagga 6121 caaccactag gtcgcatggc ttccgtattc atgtacgcca acatcaatgc ttgttatctg 6181 atgatagttt tcatcctagg gatagggttg tggttggaaa gctatcggcg tctgaggaga 6241 cacgtagaca acagcaaaag ggaacaacac agggagagag tcacagaggg aggaggaaaa 6301 cttccctcac tccctcagtc tccctaccga acgttctttt tcctaacagg agtgatgatt 6361 gccatttttg tggcgttgat tttgactgac tcgcgcaatg cttgggcgat cgctattttt 6421 gcttgtttag cttatgcggt ttacgaaggt tggcacattc ttgtggctgg tgttgctggt 6481 atcgtcgcaa gcgtgctttg tgctgctttt gctcctttgc ctatagctca attgtttcgt 6541 aagcttgttc ccgctttctt ttgggcacgt ttaaacgacc aactgtatcc tgatagacca 6601 atcgcgttac ttcggaaaac acagtggcaa tttgcattgt ctttaactca gcaacgtcct 6661 tggactggtt ggggtttacg caattttact cctctgtacg atgcacagat gcatatttgg 6721 ttgggacatc cccacaattt gtttttaatg ctatttgctg aaactggttt tcccacgact 6781 cttttattct gtggcttact cgtttggata tttattgcgg gtgtccaact tttgcaaaag 6841 ttaaaatctg tagatcaggc agaagataaa ttgatatgtt tcagttatct tctggtattt 6901 gtgggctggg tgttattcaa cacagcagat gtcagcttat tcgattttcg cctcaatacg 6961 atctcgtggt tatttttagc tgctatttct ggagttgtct atcgccagaa ccattaccaa 7021 aggctacagg ttaaataaaa actcattata taggtcattg ctcactggcc agatttttcc 7081 gaagatacac tgtggtttat caagttatac gttaataaaa ttacttttat acgactgtga 7141 ccgtgtatgt ggctcatccc gcatttcgtc acgttcgctg attgttggga agatgtggga 7201 ataccttttt gtagtgtttt cctacatgac gggcatctgt tgtagatgcc cttacttttt 7261 taagaagaac tcagaatttg tataatatca tgtcatagca gcgctttcat cgaaacctga 7321 agtccaagaa gcctttaaaa aaaacttcca gcaaattttt caagtgttgc tgcgttattt 7381 ggggtcagat aaacaagtac ggtactggtg tgatcatggt aagtcgttta ggacttaaaa 7441 caatgacagg tcgcacaatc actctcaaag gggtaaaacc tgaaggtatt gttgtcgcat 7501 gtgaaaaaac ttttatctat atggtcttgt tgaaccagca acaggagata gtttcttttg 7561 ggagttttct cacttggata gtatgtgttt tcaagatttt ttagaccttt tttcaaaagc 7621 ataccctgag gtgttaaatt tggttcagat ggacttttgg catttcacaa gtccttggat 7681 ttaaaatggc ctgataatgt tatcccaatt ttccagccac caaatagccc tgagttaaat 7741 cctatagaac gtttatggga acacatcaag tacgaattgt cctgggaaca ttgcacaact 7801 ttagatcagt taagacagaa gttataccaa ttcgcgttca aaagcgtaat ttagaaattt 7861 tgattgtcta gttccatcaa gaactatcgc tatttgaaag ttacggttta gaatgcagtt 7921 tagtattaaa acaggttcac agattctatc tcatccgaag cgattgccta cggcagagct 7981 acgcttaacg cctcaatttg tggatgggat tacatcacct gagctttatt aagtgcaact 8041 tcatagagaa ttggtattag gtttaggaga aagcggaaga tgataatccg agtccaacaa 8101 tgcttttaag cctctcaaaa tgttgcactt tgcccagttc actacttctc aggtggtggt 8161 aagttgtcgc ccactttttt tgttgaacct ttcatacgac agtgccagtc gccaaaggta 8221 ttttgaaaac agatacccgc gctggtatat tgctccacat tgcagttctt tccaggcaaa 8281 tgacttctac ccctaatctc ataacaagta tgattagtgt agctgcctcg aatatcgtat 8341 acgggtttgt ccatatttat gcttaaatgc tcccctcttt ggttagtgtt atattccagg 8401 tcggttacat tggctgggcg agatttaggg gcaatctgaa gagtgagatc gctgataagg 8461 cgtataaaac tgctaaaacc accttgacca ggagtcccaa actgcgattc ctggtcgcac 8521 agaaaataca tctttgcctg ctcgacgcta ggcgctcctt tttttggttc taccctcgat 8581 gagcaagtct gaggttggcg tgctgttcca tacggaagag tctgagcagt tagtatggta 8641 ggcgttaaaa ccgttagcag cgtacagcct attatcccaa gagtaaacct tactttcacg 8701 ttaattcctc tatatttgtc aacatcaaaa ctgtaaagac ccaccatctg aaatgctgta 8761 aaaatttccc ataaggaatt tgagccacca gatttcattc ctccctacaa agcataattg 8821 tgtgcttccg gaaaaatacc aaagtttctg gtcaggaatg tgtctaagtt acggattttg 8881 tgtgaaatta catattgaaa agttatatca ttcgcctacg gctccgttcg cttcgcgaaa 8941 tgcccaaacc tccaaccacc aaaataaact ccacccctgg tgtatccttc cactgccaaa 9001 ctaccggtct ttagcccttg aagtagttca ctcctaaaga actctgattt tcagcgcaga 9061 cccgcgtagt ttacttttta gcaacatacc taacaccgag taaatttcca tatagtcctt 9121 atttttacag tagtggaagc gtcaatgatg acagcaattt tccgatctat agacgaagtt 9181 tgaaacaggg agtgagggag tgtaatcact tgaaaagcgc tgaaattttg ctggtgatcg 9241 ccccaaagga tttgatatga cttgttgttt tctgaatgcg ttctttaaat ttttgtgctt 9301 tttgctcgtc aggaatttct gccatcaaag tttctataag ttgggtttga gtnnnnnnnn 9361 nnggtttaaa gcatctttga tgagaatgac gctagaaaaa ggaccaacaa aactaatgag 9421 ttctcgccga caacgatcta aaaactccgg atcgctaatt gcaggatagt ttccaagggc 9481 ttggtatgaa ctgctaggat ttgcaagttt tgctaagtta ggttcaacaa caagttggag 9541 gcgatttcta aactcttgtg ccttttttgc atcaaaaatt tccctagcta gcgcttcggt 9601 gaattccttt tgggctatct gtgggctttg ttccaaagta tttttcatca aaacactggc 9661 gattggacca acaaaactag ttaattctcg cccgacaaat tctaaaaatt ctgggttgag 9721 agagactggt gtttgtgtat gcgattgagt atttgtttcg ctgttttgct tttcttgccc 9781 atgtgtagtg ttagtttgaa tttgcggtgt tttcggtaac ggactcgaaa ccccacccag 9841 tgacaatgat gagttattat cattaagttc aatgaaaact tcttttgctg actggtatcg 9901 ctctgtgggt acttctgcta acattttttc tagtatgcga gcaaaagagt cactgatatt 9961 gacataagag cgccattgcc agtttagaga accatccatc aacacatgtg gcattttcgc 10021 agtcaaaaga ataaccgcag aaacagcaag tgagtaaata tcacttgagg ggtagcaatg 10081 tcctacacgc agctgttctg gtgcagagta gccaattttg cccactactg aactgtgaac 10141 tgagtattgg taattcggag attgggctga taatatctcg gtcaatttct ccttaaccac 10201 gccaaaatca atcagcatgg gtttagattc gttatggcga agcataacat tttccagtga 10261 aatatcccgg tgaataatct tgcgatcgtg gatatattcc aaaactggta gcatatctag 10321 tagccatgct ctgacttcag tttcggaaaa tggtctacct ttattagaaa gacgttccga 10381 taaaatttct gagtaatttt tgccatcaat atattctagg acaataaaca gccgttcgtt 10441 ctccgtcaac caagccagaa atttaggaat ttgaggatgc tgaatttggt ataaaacttt 10501 agcttcccgc tcaaataact ctcttgattt acgaataatt tctggtttcg tggttgcagg 10561 taaaaactct ttgagaacac aagcttctcc aaagcgctga gtatccaatg ctaagtaagt 10621 tcgtccaaac cctccttgtc caagaatttt cttaatcaga taacggttat tgattaaagt 10681 tcctgggttt atttctgtag tcattttaag ttcttattta attatgagta gcattttgga 10741 tttcataggg ttagaatttc aagattatgc attttgggaa aagaggaaaa aaagtgacat 10801 acgcccttgc gggactgcgt gaggcttctc gtggtttaac gacttgagtg tcggacatga 10861 gcgctccggt gtgttccgtt ccgtagctac ggcgaatgag acgcgtatct cctgcaccag 10921 acgcttacgc tttagcccaa cagcggaacg caccactcca attatgggga tgtatgcaac 10981 ggtaatactt gacattagga gtcgttaatg cattcattgc tggtagatga tgtacaatcc 11041 cagcaatacc tgttcctccc gcaacaggcg gacgtcaaaa gagctatggg caatagggcg 11101 cttacaggcg atcgccactt tggaaagatc cttgtcagtc atcgtagcgg agcccaagca 11161 ggaagctcat ttataaagtt ttataagaag aaaaagatgg caaaagctca taccaagtat 11221 gtaaaacacc cgtagtcttt gctttcgtca gaaaactgag aatgcttttc aaagcttaac 11281 aatgtcatcg taactttaca aacatagtat aaagaattgg ttggataaca aaaatacagt 11341 caacaataac tatagccaaa gctatagttt cgctaaccta tgcagccgat tcgcactttt 11401 aatgtttccc cttcactacc gcagcgactt gagccactgc ggaagctggc atataacctg 11461 cactgggatt ggaacgttga gacgaaagac ttattccgac gcttagaccc tgatttatgg 11521 gagtctagcc gtcacaaccc ggttttaatg ctcggtacta ttagccaagc gcgtctgttg 11581 gaagtcgtgg aagatgaagg cttcctggca cagatggatc gagccgcgcg ccagttagag 11641 gattatttgc aagagcgtac ctggtatgaa aaacaaagaa ttggtcatgc ctcagacgag 11701 aaaggacaag aatattacgc ctatttctcg gcggagtttg ggctggtaga ttgtttgcca 11761 gtctattctg gaggcttggg agttcttgct ggggatcacc tcaaatctgc tagcgattta 11821 gggctacctt tggttggtgt gggcttactc taccagcaag gctattttgc tcaatatctg 11881 aatgctgatg gttggcagca agaacgttac ccaatcaacg acttttataa tatgcctttg 11941 cacctggaac gcaatcctga tggctcagaa ttgcggattg gagtcgatta tccagggcgt 12001 actgtctatg ctagagtttg gcgtgtacag gtgggatcag ttcccctgta tatgctcgac 12061 actaacattg aacccaataa ttcttacgac cacaacatca cagaccagct ttacggtggc 12121 gatatcgata tgcgtatcca ccaagaaatt atgctgggta tcggtggtgt gcgaatgcta 12181 aaagcccttg ggtataacat caccgcgtat catatgaatg aaggtcacgc ggcattttct 12241 gccctagaac gtatccgcat tcttcttcag gaagaggggc tgaattatcc cgaagctaaa 12301 caggtggtgg cttccagtaa tatctttaca actcacacac cagtaccagc gggaattgac 12361 ttgttccccc cagacaaaat gttgtactac ctgggatatt acgctgatgt ctttgggtta 12421 aataaagacc aatttttagc tttgggacgc gaaaatactg gagatttgtc tgcgcctttt 12481 agtatggcgg ttctggcact gaaaatggcg acattctcta acggtgtggc acaactgcat 12541 ggtgtggtgt cacggcaaat gtttaagaat ttgtggctaa atgtgcccac agaggaagtg 12601 ccgattaaag caattacaaa cggcgttcat gctcgcagtt gtgttgctaa atctacacag 12661 gagttgtatg atcgctacct aggaccaagc tggtcatcag caccgcctga taaccctttg 12721 tgggaacgga tggacgcgat ccccgatgag gagttgtggc gtaatcacga gcgctgccgc 12781 ttggatatga ttatgtatgt ccgggagcgt ctggtgaagc atttgcgaga ccgtggggct 12841 tctccctcag aaattgcctc tgcacaagaa gttcttgatc ccaaagttct aactattggc 12901 tttgctcgtc gctttgcaac ctacaaacgt gctactctgt ggatgcgcga tgttgagcga 12961 atcaagcgga ttttgttggg caacaaagac cgtaaagtgc agtttgtgat ctctggtaaa 13021 gcacacccca aggatattcc tggtaaggaa cttatccgtg atatcaatca cttcatcgac 13081 gaacagggct tggaaaagca gattgtgttt gttcccaact acgatattta tatcgctcgg 13141 ttgatggtcg ctggttgcga tatatggtta aacacaccac gccgtccacg cgaagcttct 13201 ggtacgagtg ggatgaaggc agctatgaat ggtttgccga atttaagcgt acttgatggt 13261 tggtgggatg aagctgatta tgttcgtact ggttgggcga ttggacatgg ggaaatgtat 13321 gacgatccca gttaccaaga tgaaatagag gcaaatgctt tttacgattt ggtggaaaag 13381 gaagttgtcc ctctgttcta cgaccgggat gtcgatggtt tgcctcgccg ctgggttgat 13441 aaaatgaaag acgcaatcca gttaaattgt ccgtttttta atacggcacg gatggtgcga 13501 gaatacgcag aacgggctta cttcccagca agcgatcgct accacactct cactgctgat 13561 aaatacgccc cagctaaaga gttagctgct tggaaagaca atctcactgc acactggtac 13621 gacatcaaaa tcaaagatgt tgaagtatcc tcaggagcag atattgaggt tgaccaaatt 13681 gttaacgtca cagctaaagt tgacttggca actttgacga acaaggatgt acaggtggaa 13741 ctgtatcaag gtgccattga tgctgatggt caaattgtca acggtgtgcc tgtggtcatg 13801 aattaccaag gagaagataa ggatggttta agtatttaca ctgctgaaat agtgtataac 13861 atatctggtt tacaaggctt gtctttacga gtgttgccaa accacaaata cctctccagt 13921 ccttatgagc caagggtgat tgtttgggca gagtaaagaa gtgaaaagtc attattttta 13981 cttttgactt tttgagggga tgacaatgta acctgtggtg cgttcgccaa gcactaggtt 14041 acgttgctcc ccccaatacc accagtatgc agcgacataa gcacagatga ggctatcgag 14101 tttgtcttcg acttgtttga gggctgcgcc tgtggtggga atttcgggaa gaaatgaaca 14161 agagaggcgc agagaacaca gagggggatg gagagagggt aagatttctg tgatgtaatt 14221 gtagagtttg atgagttcga ggcggcgatc gctcaaacgt ccttttttat atttaagaat 14281 tcgttctaag ccaaataaat gaactatggc tggatgggga aagacttcta tttgatatct 14341 gcctggtttt tgtggttcta ttgttggtgc gtgggcaaaa ccacgggctt ctaattctaa 14401 gccaaatttt actgtccttt ctgcaaaagg taggcccaga tttgctggat agcatccagc 14461 gtgatatttg ccaaagtgtt tgtgggttaa tttgtcgggt aggcgacttc cagtagcgtt 14521 aggaatgagg gtgggtgcgt ctacaccgat aatggcgggt tcctctggtg taacgcaaat 14581 atcaatccag gttaggatgt ctgcaataga ttctttgcgg tctatatctt gtatttgcag 14641 ttttccgtct atgaattcta agtagcataa gccgctgggt tgggatttcc agcctaagtc 14701 aattccgaga aatttcattg ttgaattaat gagacaattt caatcatatc ctgatgatga 14761 gttctaataa aattttcggc gttgctgaat ttggtaatga atttgatagt gcacgatgtt 14821 caattttcgg caaatttatg tagagacgtt gcatccgaaa gttctacatc aatggaaatt 14881 ttgaaattca tatcaagatt tcagcaacac ccaaaaatcc gttccaaatc taacacaaaa 14941 ccaggtaaca catcctcacc cgataaactt ttaggagaat tgagaacttc caccgcaaca 15001 tgaggacgat aaatttccac tcgtttctct tgaggatgaa tcaaccaacc aagacgcaaa 15061 ccattctcta tatattcccg cattttagct tgagtatctt ctatatcgtc actttccgaa 15121 accaactcca ccgcaaaatc aggacacaaa ggaaggaact tctttctttg ttgtggtgtt 15181 aatctttccc aacgctctat agttatccaa gatgcatcag gagaacgagt tgcaccattt 15241 gggagtttaa aacctgtcga agaatcaaaa gcttttccga gattattttt acggttccaa 15301 atgcctaaat caatataaaa ttcaaaattg cgattccccg tttctcctcc cgttggtgac 15361 attataatta attccccttg agcatccaat tctaaacgta attcttgatt aacagcgaca 15421 agttgttcaa attcttcatc agttaatttt agcgatggag gtaactgtaa aattaacgcg 15481 ctcataatta agtactacca tgacaagtga aatgacaagt taaataatgc tgctttcatg 15541 acttcttctc actaaaaatg cttgctggcg acgtgtgatg acttcttcgt taaaatcagc 15601 aaacatttta tctgacatat cctcccagca accagcaaac tgcatcgttt gctcaattgt 15661 aacttgagat gcttctgtag agacgttgca tgcaacgtct ctatattgac gtcttgataa 15721 cagttttggt ctgatcacca agcgtattgg actattgcac caaaatcgag cgtatcaaaa 15781 tgctcataac ctcacctgtg ttgccagttg gcacaagggg actccaatca ccaatagagg 15841 cgtatttgag ttggtgcaga acgcgaattg tgctctttac ggctttagga gaaccaacca 15901 gtgtatgttt gatgggttct ctttttgagg atgcttgaga gttagtttga ttggggatgg 15961 gtgcatttgc gttgtcagca ctggacaaaa agctttgagt tatcgattgt aaagaataat 16021 ttgcaccata tttgggtacc accgcgtttg cttccccttg cacgtattgt tccaactcca 16081 attgagtaat tacttgctcg tcatacccga tcgcttgtac ttgtgtcatg atttatgtat 16141 ctccattcaa tatgggggtt tataaagcga tcgcttgtcc aattggccct ggaagcgatc 16201 gctttttgct gtatcttgca gcaatatatg tatcataact cataacgcac tatatcgcaa 16261 tctttttagt atcttaaaag atgactttgc tctctatgcg aatcttaagg cgtactatag 16321 gatgcgaaag cgatgatgga gatagcgacg tgagcaaacg atttactgta actcttccag 16381 actcagtatt tgaagattta gaggtattgg cagatgcaca aggcagacca acagccaacc 16441 tcgcagcttt tttaattgaa attggtatca aggagactaa agaacgcggc gaattcccag 16501 aaaaaccaga aaaaccaaaa acctccaaag caaagaaggc taaggaggag ggatgaacca 16561 aaatactatt gatctataag agcttgttct aagtagcaaa taacctagac aatcgtcgtc 16621 atttattgac aatcatcccc attttcttct attagttcat caacaagctt tcttagccgt 16681 tcttgccagt tggggacggc ttttagcttt tctttaacac ctttccttac agcaaagcac 16741 agtggtgact tatcaaaagg ttcttcatta aaagattttg caccaagttt attatttttc 16801 tggaatggca tacaaatacc ttgtatatgt tattctattc acatatagca caagtattga 16861 gggtcgaaca atgcttttat ccatcaagac caagttaaaa ctgaacgaag tccaaaaaac 16921 agttatgagc aaacacgcag gtattgcgcg ttttacgttc aactgggggc ttgctacttg 16981 gaatagtttt gttaaagatg gattaaagcc taacaagttt atcctaaaga aattctttaa 17041 taaccatgtc aaacctgaat ttgaatggat taaagagaaa ggtatttgtc aaaagattac 17101 tcaatacgct tttgataatt taggtgatgc attctctcgg tttttctcca agaaaggtga 17161 ttatcctaaa ttcaagaaaa aaggtcatca tgactctttc actattgatg ctagcggcaa 17221 gcccatcccc gttggtggta aatcaataaa actacctact atcggctggg taagaaccta 17281 tgaaggtctg cctcatacca cttgcaaatc aatcacaata tcgcgagttg cggacagttg 17341 gtttattgct tttgcttatg aacaagaaca cgagccaact gttaagcagc atgatgttgt 17401 aggagttgat ataggtgtca aggaattagc tacactctca acgggcgtag tgtttcctaa 17461 tccgaagcac tacaaaaccc atttagaaaa acttcgccga ttatctagaa agttttcgat 17521 aaagacgaaa ggttctaata atcggtacaa agctaaaata caactggcta agcatcatgc 17581 taaggtagcc aatctcagaa agaacactct tcaccaaatc actactttct tatgcaagaa 17641 ccacgcaaaa atagtagtag aagatttgaa cgtttctggg atgctatcta accataaatt 17701 agctcaagtc atagctgatt gtgggtttca tgagtttaaa cgccagttgg aatacaaggc 17761 gaaaaagttt ggttgtgaaa taatcattgc tgaccgttgg tttccatcaa gtaaaacctg 17821 ttccaattgt gagcatatcc aagatatgcc actaaaagaa agaacttaca actgcaaaag 17881 ctgcggacat tcgatggaca gggatttaaa cgcagcaatc aatctatcac gtttggctaa 17941 agcgtgaaag cttactgagg gatagccgct cccatgctcc ctttgaagta agaagtaaat 18001 gtctagactt gtctaggatt tatatagcag cagcatggct ttgtacaagt acgccagcgt 18061 ggtagtcata taattatgca attgcaaaca gaagactcaa caataactgt tcctgtccct 18121 gctcacgatg agctacgtac tggcactttg cgctctatta ttcgccagtc aggacttcct 18181 cgcgcccttt ttgaagtgga ctcatgacgg cgatcgcccc cagtagacgc tttgcactat 18241 cgcactactg cattaacagc caaaagaagt gcgatacgct agcagcgtct gctttcccgc 18301 agcgatagcg aagcgctact gcctaagcgc aaagcgcacg ctacgcgaac ggcagatcgc 18361 tccccaccat cacactcctt ttcgcttcac atatccatac tgtttatcta gtttccacga 18421 gatagccgat ccaaaacatc aatcttcacc cccagattca ccactccggt gccatcacga 18481 cgtttaatct cggcattatt taatgcaaca atataaggct gaccgttaat cacagcaatg 18541 attggtccac ctgaagaacc acctgttgtg tcgcagtcgt gaaataatat attttgttca 18601 tcccggagaa tgctgcatcc atgttgaaca ctcgcagtcc agccaggacc agcgctgagg 18661 aaatcatagc cgcgtttttt ggggttagga aagtcgccgg aatacccaac aaagataaat 18721 tttttctggt tcttgattaa agtggacgaa ggcagggatt tccaacccaa gtaaccgtat 18781 ttttgaccga taggcttgtt cagttgtatt aatgcccagt cgtttgtctg gttagtgact 18841 gcatcattag tgaaatctgt accgtaaaga attttttctg ctaaagcagt atcattatca 18901 tcttgtagct cacggttgat aacgttgggc ataaacgcga ctctcttgct caattgatgc 18961 gtttctggat tgataacaca atgagaattg gtgaggacaa tattttcagc aattagggtt 19021 ccagtgcaag tataagcatt accatcggta ctttcgccaa caatacgacc tattgttgac 19081 cacgggtatt tacggctaat catggggatg cgatcgtccg gaccaattat tgcgcgagtg 19141 tcatcaatgg gtttttctga acgtgcgagt cctgatggtt gaaatggctt accagtacct 19201 tgtattttga attgacctgc atcagtcgct ttgatgaatt ttggcggcgc tgctgttggc 19261 gatttggtat caactcggtt aacttgggcg ttcactgatg cccaactagc cattgtcact 19321 gctgccagaa gacctgcaaa aatacgcgca caaggttggt tgtagaaagg ttttgtagcc 19381 atgataaata ctccttaatt gcttaaaatt gttgaactcg tttctcaatt tgaactagaa 19441 aagaagggaa aatgcatatc atctaggagc atcccaattt tgcaagaaca tgtttgtcat 19501 tgcgttagcg tagcgtgtcc gaaggacata cgaagcgaca gcgtagtgaa gcaatcgcaa 19561 gatgtgggat tgcttcgctc caccctccgg gttcgccagt cgcctacgga gggaaaccct 19621 cctgcagcgc tggtctcact acgttgcgct cgcaatgaca attcatcacc tggatttggt 19681 ataacttaca catttgggat gctccccatc agtgccactc accctgcaaa gtgaaagccg 19741 cccaataata gggattttgt aaatcttttt gtttccacat atgtatttgt gtagctctaa 19801 gcgcagcagt aggagatttc cgttgctgca acatttgttt gtaaaattcc tgcatcagta 19861 gtgatgtact cttgtcgtta acgctccaca atgaaactgc tacacgcgct ccacctgcat 19921 acattagccc ccgtgtcaac cccatcaacc cttcgccctc gacttgctga ccgagtccgg 19981 tttcacacgc acttagtaca attaactctg ctgggtaatc gaggttgaag atgtcgttta 20041 gccgcagaaa ggatctttgg gatttacctt gtttatctac cgtagacagc actattccag 20101 ataattctgg gttttgctca tcaaagcaac cgtgagtaga gaaatggagt aggcggtatt 20161 ggctcaactg tggactcgtt gcaaatttgt aattcgcatc aaaaccaaaa gcccagaggc 20221 gttccggtga tgagactagt gccataatgt tttcagcctc ttctttagaa tgttgcagtt 20281 tggaaaactt tccactacaa gcccgtctaa gggcagaccg ttctaggttg agttccatag 20341 atgaagaact atcctggggt ttattagtcg tgtcatccgc actgaatatt ggatcggcaa 20401 ggatggctaa agttttggga gctattttcc gccctttgag tttctgccga tgagtcgcaa 20461 ggctggatac tgaaggcaga ttgattattt catgattgac aatcaagggt tgataattga 20521 cctctgccct gctgctaggg gaggggagtg tttcttgctt cccgtttcct tttacagaag 20581 agggctgcgg tgtaagatca gccagcactg caaagggaat tttatataaa atcccatcag 20641 taacaacgac taagcgtttt tgtcccagct tattagccac aggtgcaaga ataagttgac 20701 tgagttcgtt agcagctttg gcagtctgcg caagcgcttt ggctttgtct tgtgcgcttg 20761 ctccgtcaat tgaccaaccg tgtaaaagtg tgtataaagt atctgctgct ttttttatct 20821 cttcttgttt gggaagttcg tagctttgaa aagaatcagg agttactgcc caaagatagc 20881 tgtgttcttt acccaaggaa tactgcaaca acagcgtatc tttatccagt tgttgttgga 20941 ttcctggcaa cttcagcaag ttttttggat tagttaattc tgcacgttct gggttaatag 21001 cacggatttt ttcttcaatt tctttctgtt ggtttaagaa atttttaatt tctttttcgg 21061 tagtcgctat cagttgggcg ggtggtgatt tttgattcag tagccctgat aactgttttt 21121 ctctagcatc aagtagcagc cgtaaacggc tttcttctgc caagagtttt ggatcaactc 21181 ctttgcggat ttctgcacca gctgagttta aaagttctat caaaccacgg gcgcgggaac 21241 gttcgctgat gtgcagtgca agagcatcat acccttttga tgggttcttt ttatgcaact 21301 gcattagcag gtcgatgtag aatttataat aatattgcac tgaggcaaag taggaagttc 21361 gtaagtcttg gttgacaact ttcgtgcgta aatcttcgat gatttcaata gcagatgaga 21421 tttgtttgag ggattgttct aagttgcctc ggttgcgttc tagggaagct aggttaaaaa 21481 gggtattagc ttctccaccc ctgttgccca ctgcacggtt tagtggaagt gcttggttta 21541 ggtattcaag tgctttttgt ttttctccta aatcggagta aactagcccg atatttttga 21601 gtgttgtagc ttctccacgc ctgtcaccca ccgcacggtt tagtggaagt gcttggttta 21661 ggtattcaag tgctttttgt ttttctccta aannnnnnnn nngtttagtg gaagtgcttg 21721 gtttaggtat tcaagtgctt tttgtttttc tcctaaagca gagtaaactt tcccgatatt 21781 attgagtgtt gtagcttctc cactcctgtc gcccactgca cgatacaacg gtactgcttg 21841 attgtagaat tcaagtgctt tttgtttttc tcctaaaact agcccgatat tattgagtgt 21901 tgtagcttct ccacgcctgt cgcccaccgc acgacgcagt ggaagtgctt ggttgtagnn 21961 nnnnnnnnag cttctccacg cctgtcgccc accgcacgac gcagtggaag tgcttggttg 22021 tagtattcaa gtgctttctg attttcccct aaattggagt aaattcccgc aatattattg 22081 agtgtttgag cttctccacc tctgtcgccg actgcacgaa acaacggtag tgattggttg 22141 tagtattgca gtgctttctg attttctcct aaatcggagt aaactagccc gatacccgtg 22201 agtgttgtag cttctccacc tctgtcgccg actgcacgaa acaacggtag tgattggttg 22261 tagaattgca gtgctttttg tttttctcct aaatttaagt aaactccccc gatattattg 22321 agtgtaagag ctactccact cttgtcgccc actgcatgat acaaaggtag tgcttggttg 22381 tagaattcaa gtgctttttg tttttctcct aaatcggagt aaactgttcc gatcgcaagg 22441 gaagtaagag cttggtttgc tttatcccca actttctgcc acagttgcaa agctagttcc 22501 aattttttta ctgcctgtct cagtgattct gcgctacctt ctttgtaaaa tgcaaccgct 22561 tcatcatagg ctttttgcgc cgcagcacga gttaattgtt gagaattcgt ttctggttgc 22621 tgtgctatct gcaactctgt atttttcagt gttgccctca ctgagtcgga cagcaaaacc 22681 acacccatca acacagctaa actgtaacga gaaaagtttg aaaatgagca gcaaaaaata 22741 cacttaagct ttatatttct gttcattctt aagccggaat aatttatcaa aggcagcgat 22801 gttcttgcag acgcgaatcc tgatcttgat tgtttcttta attgtttaag agattctccg 22861 gagaatgtaa tgttgattgt ataacattca tttccggaag ccaatatcgt ttctgttttc 22921 gtaacaaaat taacttattt tagtgagtgc aggtgttttt tattcctgta ttagcacctt 22981 tgaaacgatt agttaccagt aagccagaac tatgagatat tgcccccact gaatctgaca 23041 acaaaactgc accaagcagc aaagtcaaac tgtaacgagc acaattcagt aaaatccagc 23101 agatatcctg ccgagactcg tcttttttgt tcatactgaa gtctcaacct attaatgagt 23161 tacagtcaga attctgccgg cacaagaatc ctgtcatttg tatattattt agttatcttc 23221 caacaaattg tggaaatttt attgttattt tatgagtaat ctcacataaa tctattttgg 23281 tttcgttttt tgcaaccgat ggaaggagta aggattttct cgaaggtttt tgaacaaagg 23341 tatagcgctt ttcattgagt gtatagaagt accttcaaag cagcatgccc gaagcgcgat 23401 aaagcattac ctgatatctt gcatcactac gagtaaacca cgaatactta attaaaatcg 23461 tatttaaaaa tactcaaaaa ctagttccag tctcagtcca ttttaatgga cttcgcctat 23521 tagcctggga tttaaatcct aggcggacaa gaacccagtc aaataatatg taataaattc 23581 tactattttt ctcaaggcaa aatgtaagtg accaccaaac cccctgtgca atacttccca 23641 cagaaacgct atgcgattcc cacaccacct catcccccat caaaaagctg atgcaacctt 23701 tgccctcttt gtgtgaaaaa acaagatccc cgacttctcg cgagaagtcg gggattgtaa 23761 taggagttaa ttgtcactgc tttacatcac cagattaacc agaagccagt aaccgagaag 23821 cacaagaaaa acgcttaacc acaacacaat tggtctgtcc ataggcgtat gttttagagt 23881 gatttttact gctttaacga atacacagtg taaaaactgg gtaatagcaa ataccttttt 23941 ctcgactctc taagtcatgg agtacgagag ctaaagttta attgctattc ccgatttttc 24001 tcaaatttta ctcaaacgtt accagtattc cgcaggactt ttatccttcg gttctggttt 24061 atcaaccaca ctaaactctg tggataaaac tattttagcg atcttaataa cgttttgcag 24121 gaataaatca ctttatatgt gtttacaata cacttctcaa aattgtctta aaagggtagt 24181 tgacaacatc gcggataatg atcaatcaat tttttatatt tatattgttc tattctttgc 24241 gaggtagaaa ggctagttat aacaacattt aattattctg cataaactga aatctatcaa 24301 tgggaataag ttttgtagtt tttctcacga ttaatcacaa aagtatattt taatcacata 24361 gataatagct tccttacctc agaaagacga acaaactcta gagctacata taaaattatt 24421 cttaaattat tactagacta cttttttatc taaaaaatca gatttttcac tctgaaaata 24481 tgggaattat tgattcgtcg gagagaaaac tcttttgact taagatttac tttttaggag 24541 aattgttatg cgaattgccc aaattgctcc actgtgggaa agagtaccac ctccaggtta 24601 tggcggtaca gagttagtag ttggactgtt aaccgatgaa ttagtcagac gtggacacga 24661 agtcacccta tttgcatcgg gagattctat cagtttggca aagcttgaat cagtttatcc 24721 tcgtgcccta agacttgatc agactgtgaa agaacatagt gtctatgaga tgctgaattt 24781 agctcgagta tatgagcaag cagacgagtt tgatattatt cattctcatg cgggacacat 24841 cagcttgagc tacgccaatc tggtaaaaac acccaccgtt catacattgc acggaatttt 24901 tactccggac aacgaaaaat tgtatcaata cgcaaaaaat cagccttaca tcagtatttc 24961 cgatgcacag cgagaagaaa gattaggggt gaattatgta gcgacagtct acaacggaat 25021 taatgttagc agttataaat ttcatcccca acctgaagag ccgtcttatc tggcttttct 25081 aggtcggatt tctccacaga aaggaacgca tttggcgata caaattgcta aagaaacagg 25141 ctggcgcttg aagatagcag gtaaagtgga tgtcgttgat gtggaatatt ttgaaagtca 25201 agtcaagcct tttattgatg gtaagcaaat tcaatatttg ggtgaagcca accatgagca 25261 aaagaatgct ttgatgggag gtgcagtagc cactttgttc ccgattactt ggcgagaacc 25321 gttcgggttg gtgatgattg aatcaatggc atctggtact ccagtgattg cgatgaatat 25381 ggggtctaca gaagaggtta ttgcccacgg taaaacaggc tttctctgca acaatgttga 25441 agaatgtatc agtgccgtta gcaaggttac tgaattggat cgttctgctt gttgggacta 25501 tgtctgggag cgttttagcg ttcagcacat gactgatggc tatgaagcag tttatcgcaa 25561 gattctgggg gagccgtttg cttgcagtaa tggacatttg cccaatccag tgatttcagg 25621 caactagtct atttagaaaa agcaaatgcg ctctttgggg cgatcgctcc ctcgtgctag 25681 tgccccaaag tcacagagca aatttgctgc tagccagagt agggggaggg tagttttcca 25741 aactgccctc ctttgcaaac aagggcgata agccggaggc tttagcttgt gcgtatcgca 25801 aaggagcgta atcgcctcca cagtattctc ctctgaacca aaagcgatcg cagactccgg 25861 acggagtcta gatcttaagt ttgctcggca tagccttgca agttctctgg ctcaaattgg 25921 cgcagaatgc gagttgcctg attggttaac tcttgactgc ctttaataac aatcagatac 25981 ttgcctgcat tgaggcgatt tcggtagggt aaagcatccc cactccccac agttaaacca 26041 actccaccgc cgacaaagta tgcacccaaa gcaccagaag ccgctcccaa cagtccacca 26101 atcaggtgat taccaaggct gccaatagga agtacttcga tgcttgtcaa aaggttgaag 26161 gcgtaaccag caacaaaacc aaagggaatc agccaagata ccaaacgatt tatccctttc 26221 tttgcttgct gatttgggtt gatgaatcca tactcatcag cacttttgta tcccctgcct 26281 aaaatatcaa tttgattggt gggtaaacct tctttttcta aggcagtgta cgcttcttct 26341 acttttaacc tgtctggtag aactgcaaca agataattca tagtgtcatc ccaagttata 26401 ttgtcattgt tgagtcaaaa aatctttagc tcacttgtgt ggacgattaa attaagtgag 26461 ccaactcacc ataaacaata agtatcgaag cagtaagaaa cgtcaatctt caggttgaaa 26521 taaaactgaa gcagttctcc atatttgcct atttctcata gaaatgtcat ttttatgact 26581 tgttctatgt cttgccagag aactcgtaat aatgataaaa ctttccctct ccttaataag 26641 gagagggatg cccgataggg cagggtgagg ttccg // LOCUS NODE_1117_length_26641_cov_5.23173926641 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 26641) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 26641) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..26641 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(69..1601) /locus_tag="DP116_09920" CDS complement(69..1601) /locus_tag="DP116_09920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409066.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium chelatase" /protein_id="PRJNA477356:DP116_09920" /translation="MLARVWSGSIVGIDAVKVGVEVDVSGGLPGIVVLGLPDTAVQES RERVKATLKNAGFAFPMRKIVINLTPADLRKEGPCFDLPISVGILAASEQVSAYLLGD HLFLGEVSLDGGLRSVAGVLPIAATAQKMGITGLVVPADNAQEAAVVEGLAVYGFKNL SEVTDFLNNPGRYKPVQLDNTVERIQAMSLEAADLNDVKGQAHARRALEIAAVGGHNL IFVGPPGSGKTMLARRLPGILPGLSFSEALEVTRIHSVAGLLKNRGSLVRDRPFRSPH HSASGPSLVGGGSFPRPGEISLSHRGILFLDELTEFKRDVLEFLRQPLEDGYVTISRT KQSVMFPAQFTLVASTNPCPCGYYGDTIQQCTCSPRQREQYWAKLSGPLMDRIDLQVA VNRLKPEEITQQPTGESSATVRERVQQARDYASHRFKNEQNLSCNAQMQSRHLQKWCK LDDSSRNILEAAIRKLGLSARASDRILKVARTIADLAGDDELKAHHIAEAIQYRTIDR MQ" gene 3050..3235 /locus_tag="DP116_09925" CDS 3050..3235 /locus_tag="DP116_09925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878499.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_09925" /translation="MNEPAAHTFPSSATLSQKSAVKEITVDLTPSEASTLEKYCNQTG KAATDVIRELIQELCLT" gene complement(3399..3557) /locus_tag="DP116_09930" CDS complement(3399..3557) /locus_tag="DP116_09930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_09930" /translation="MNKKWAVKRITVNLALNEASKLEKYCDQTGRPATDVIRELIRAL PVTRSEVQ" gene 4521..5384 /locus_tag="DP116_09935" CDS 4521..5384 /locus_tag="DP116_09935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877518.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09935" /translation="MRNEFFIDTESGCIMPNEVAVETANRALVHRPASTGSGDSFIPK MIDADSIAKTQKLLDRAIVLAWRNIKSSRRPPALTRTRWVWRMAGAYHSSRHTTRLME EARDRFAASGRESLAQWAAQKAREEAGHDRLALLDIQSMGYDAEAVVQALVPSPIQAL VDYFFQSVQTTDPIGCVGFFYTAERLGTFQGEQYIQSVEALLPSGTHATRWLRIHTGV GAEVKHVEETVEVVAHLSSEELTRVARACYETALLRFSPPKEDYISDEELQHILKPLE LRTLVQVKSAG" gene complement(6007..9606) /locus_tag="DP116_09940" CDS complement(6007..9606) /locus_tag="DP116_09940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739790.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09940" /translation="MALQNWRRKRGVALTNKGLQKLQDAKRKLETKENFGNSYTLEEM STLSGLYPSTISKVLNREGGVDKKSLEKLFSVFCIKIDKIDYLNSKNYLDWGDATFIL VFYGRTEELTTLQQWILDERCRLVLLLGMGGIGKTALSVKLAQQIQEDFEYVIWRSLR EAPPINTIVANLIQVLSAQQETENNLPEKLSEKISRLVYYLQNNRCLLILDNVESILR SGSRAGQYREGYEGYGELFRQLGEANHQSCLVLTSREIPQEVALLKGQTLPVRSLQLS GLKVNEGQEILKVKGLSAAEEEWKAIIKLYTGNPLALKMVATTIVDVFDGNVTEFLQQ NTAVFGDIRDILDQQFERLSDLEKEIMYWLVINREPVSLSELREDIVLPIPPQKLLEA LESLIRRSLIEKATLREELRSTPMLVEKSAATFTLQPIVMEYVTQVLIELVCEEIVTE NIKLFRYHALMKATAKDYIREGQIRLILQPVIDGLLTVLRSKRSIKNQLTKILVRLRE ESPQEPGYTAGNILNLLCHLKTDLSGYDFSHLAVWQADLRNVKLHDVNFQNANLAKSV FAETFGGVLSVAFSPVSAASPEGIGELLALGDTNGEIRLYQVSDWKQLLSCKSHNNWV TSLVFSPDSRTFASGSVDCTVKLWDISTAQCLQTLQEHDDEVWSVAFSPDGNTLASSS DDYTVKLWSVSTGQCLRTFQGHTSWVCSVTFSPDGQTLFSGSDDHTVRLWDINIGECL KTFRGHDDGIRAITVSRDGKMLASGSEDQTVKLWDVSSGECLKTFQGHFNEVYSVTFR SQGDILASGSFDQTVRLWSVSTGECLKTFQGHSSWVYSVAFSPQGDLVASGSYDQTVR LWSVRTGECLKTFQGYTHQVLSVAFSPDGQTLASGSHDSSVRLWDVSEGKCLKTFQGH RAAIQSVAFSPDGQTLASGSEDRTVRLWDVNTGQVLQIFQGHRAAIRSVAFSPDGQTL ASGSEDQTVRLWDVNTGQALRTCQGHRNQVWSVAFSPQGMMLVSSALEETLKLWDVST GECLKTLEGHTGWVWSVAFSPDGELLASTSADRTLRLWSVSTGECLRLLRVDTGWLLS VAFSPDGRTLASSCQDHTVKLWDVSTGKCLKVLEGHRGGWLRSVAFSPDHQILASGGE DETIRLWDVTTGECLKILKAEKPYERMNIMGVTGLTTAAIATLKVLGAFAGEK" gene 9798..10148 /locus_tag="DP116_09945" CDS 9798..10148 /locus_tag="DP116_09945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137043.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine triad nucleotide-binding protein" /protein_id="PRJNA477356:DP116_09945" /translation="MSETTETIFSKIIRREIPADIVYEDDLALAFRDINSQAPVHILV IPKKPIPRLADAESGDDALLGHLLLTAKRVAEQAGLANGYRVVINTGPDGGQTVYHLH LHILGGRQMAWPPG" gene complement(10324..10662) /locus_tag="DP116_09950" CDS complement(10324..10662) /locus_tag="DP116_09950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002707096.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" /protein_id="PRJNA477356:DP116_09950" /translation="MQRGEIWWADLPTPVASEPGYRRPVLVIQSDDFNRSRIRTVIVA VLTTNLRLAEAPGNVLVTTDETGLPQDSVVNVSQIITVDKSFLTERVSQVSDRVILLV EDGLRVVLAL" gene complement(10662..10898) /locus_tag="DP116_09955" CDS complement(10662..10898) /locus_tag="DP116_09955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ChpI protein" /protein_id="PRJNA477356:DP116_09955" /translation="MKTAISIPDNLFEAAESFAKQMGLSRSELYAIALQEYLQVHRCD RITEQLDAVYADEDSSVDPFFVQLQAHTLPKETW" gene 11043..11735 /locus_tag="DP116_09960" CDS 11043..11735 /locus_tag="DP116_09960" /inference="COORDINATES: protein motif:HMM:PF00563.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09960" /translation="MLFNCITELDLYILETALKTYKEATEKAKAERYDDKKMLSINVY PSSILRTAYENLLIKLIRRQKLISGKKLMLEISEKTLLPSIVDHEGKGLESFRSIARK YRKNYDVKFAVDDFGVGNASMSRLEEVDPTYVKVDRDILHFEKKLGRSIIEYLVDLKY DFNCFTILEGFDECSNFSLRELVVELGVEYIQGHSLGIATPEIKARLDKEQCENIFEI LNWRQSSNRNSG" gene 11802..12842 /locus_tag="DP116_09965" CDS 11802..12842 /locus_tag="DP116_09965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745444.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_09965" /translation="MSPESPVYRDKSASDYSIISVQNLSKLYPVAVKEPGIVGTMTHF FRRTYREIKAVEDVSFEIAPGEVVGFLGPNGAGKTTTLKMLTGLIHPSIGQVRVAGHI PFRRQEAFLQKITLVMGQKQQLIWDLPALDSLKINAAVYNISDKEFRQRVGELTEMLT LEGKLTQPVRKLSLGERMKAELLAALLHRPQVLFLDEPTLGLDVNAQVGVRDFLRDYN QRYQATVLLTSHYMADITALCQRVLLIHQGHLIYDGSLDGLQESFAPYREVHLELAND LPKEKLMLYGDVRQIEGRSVCFMVPQEALTRTVAKILADLDIVDLTVTEPPVEEVIGR VFQAGVVVAPSK" gene 13045..15150 /locus_tag="DP116_09970" CDS 13045..15150 /locus_tag="DP116_09970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M3 family peptidase" /protein_id="PRJNA477356:DP116_09970" /translation="MSTNAILSESPLLKGSGLPSFGKISAESVVPAFNQLLAELDQEL TTLEANVNPTWNGLVEPLEKLTERLNWSWGIVNHLMGVKNSSELRQAYESVQPHVVQF SNKLGQSQPIYNAFKALRASDTWTTLDSAQQRIVEAAIRDAELSGVGLQGEAKERFNA IQMELAELSTKFSNHVLDATKAFSMTLTNQEEIDGLPPSLLNLAAQAAQAAGEENATP ENGPWRITLDFPSYGPFMQHSTRRDLREKLYRAFISRASSDELDNNPLIERILELRQE LAELLGFKTYAELSLASKMAPNVQAVESLLEELRRASYDAAIKELEELKAFAASKGAP EANDLQHWDISFWAERQREEKFAFTAEELRPYFPLPQVLDGLFGLVQRLFGVTITPND GQAPVWHQDVRYFQIANETGHPIAYFYLDPYSRPEEKRGGAWMDVCINRGKVTENGVT TTRLPVAYLVCNQTPPIGDQPSLMTFYEVETLFHEFGHGLHHMLTKVDYAGAAGINNV EWDAVELPSQFMENWCYDRPTLFGMAKHYQTGEPLPEHYYQKLLAAKNYMSGSGMLRQ IHFSSLDIELHHRYRPGGGETPKDVRDRLAKTTTVLPPLPEDSFLCAFGHIFSGGYAA GYYSYKWAEVLSADAFAAFEEAGLDNEKAIKNTGNRYRDTVLALGGSKHPMDVFKSFR GREPSTEPLLKHNGLAAAA" gene 15832..16674 /locus_tag="DP116_09975" CDS 15832..16674 /locus_tag="DP116_09975" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09975" /translation="MKQRLRFVRRVVFVTPFIASSALGIAPSQAATFAYSEGNFNFTN INQTPLGIGTEANANTLAIGKGGMVNTLAEATATFVTGPTPPEASNFSLSKALGENKG YLGQAESEATVIGNFVVDVNTPFSFDFTTNLTLETSIDNPPAENARAAGDISFALIDT TNNTVLDFFDLTGNVETLGDNDFIAFQKSDNVTLSNPVTTSNFGGNKESATASIQGSL QRSFANETNVALVEVKRNRTRVIAPEPSTSLALLSFCSIVGLVGKAKRKAITLVRSLK ESNC" gene complement(17002..18426) /locus_tag="DP116_09980" CDS complement(17002..18426) /locus_tag="DP116_09980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873903.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hemolysin D" /protein_id="PRJNA477356:DP116_09980" /translation="MMHTHNQKLLPSHKISSVESDGFLPPVSPWTSLAGVFLVGTVAT IFSLASSIKYNVTVKASANVRPTGDLRLVQPEIEGTVKDILVKENQMVKQGDAIAVLN DQQLQIKKSQIKGNIEQSKLQLIQMYAQIKSLDVQILAEKRVIEQIISSAKAELARNQ REYQERQVTTTSDSLVAEANLQKAEADLQKAKVDLDFATVDRDRYQQLSQTGAIGRRE FEQKKLVVEQTKLILQGQQKAVDIAKAKVKSAKAALNPSNATVDIAKKRVDQEIAKGE ATIATLIKEKEALIQRRSETQNQINQSRKDLQQLDTQLKSSIIRATSNGIILKLNVYN PGQVVLKREAIAQIVPLNAPLVIKAIIPSADVKKVAVAQKVQLRVDACPYPDYGTLQG TVSTISPDAITSQSNSGTGTTGSLTAGTSYFEVTVKPETLSFGHGEYKCHLQAGMAAT ADIISKEETALQYLLRKARLTTDL" gene complement(18494..20647) /locus_tag="DP116_09985" CDS complement(18494..20647) /locus_tag="DP116_09985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873902.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase domain-containing ABC transporter" /protein_id="PRJNA477356:DP116_09985" /translation="MKYAHILQYSEEDCGAACLASIARHHGRTFTLNHIREMVGTGQL GTTLLGLKRGAETLGFNARHVKTSAELLNRMNEAPLPAIIHWKGCHWVVLYGKKGKKC VVADPAVGIRYLSKKDLAEGWTDWLMLLVEPDPVRFFAQKNDQVGGFWRFFRRVWIFR GILAQALPLNFILGLLSLASPFLLQILTDDVLVRGDTRLLTTVAIAVVVMNLIASSLS WVQSNLIAHFAQRLQLGLVLEFGRAILRLPLSYYESRRSGEIVSRLRDINEINQLVAQ VVISLPSRFFIAVISLSLMVFYSWKLTVVAMLISVVMSLSTIVFQPSLRQKTRELLVT EAETQGILVETFKGALTLKSTTAAPQFWDEFQTRFSRLATLTLRTVQISIINNSFSSL VSAIGSIILLWFGGNLVSNPAENLSIGQLLAFNSMNANFLGLISTIINFVDRFTRAKT ATQRLTEVIDATPEHSEDDAKKPFARIQPNADIICRQVNFHYAGRLDLLEDFSITIPG GKVVALIGKSGCGKSTLAKIMAGLYPLQSGNIRIGLYNLDDLALDCLRQQVVLVPQDA HFWSRSILENFRLGRPQLTFEQIVQACQIAEADEFISKLPDKYQTILGEFGANISGGQ RQRLAIARAIATDPAILILDESTSGLDPVSENLVLDKLFKHRHGKTTILITHRPKVIN RADWVVMLDQGRLKLQGTLSDLRSKEGDHLDFLIV" gene complement(20934..21164) /locus_tag="DP116_09990" CDS complement(20934..21164) /locus_tag="DP116_09990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09990" /translation="MNPLFTEIAESQATSVVGGAYKNTAKTKASSGASTESPIGFASS FTITSTFITDSSVSSVSESEANIDYIPEFAEA" gene complement(21287..21517) /locus_tag="DP116_09995" CDS complement(21287..21517) /locus_tag="DP116_09995" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_09995" /translation="MNPLFTEIAESQATSVVGGAYKNTAKTKASSGASTESPIGFASS FTITSTFITDSSVSSVSESEANIDYIPEFAEA" gene 22269..23081 /locus_tag="DP116_10000" CDS 22269..23081 /locus_tag="DP116_10000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115393.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="spermidine/putrescine ABC transporter permease PotC" /protein_id="PRJNA477356:DP116_10000" /translation="MKINLSRQKPRVFTWQAVFSLLMFVFMYLPILVLAFYSFNQSPY SARWQGLTLEWYGKLFRDERILSALQNSLIVAFCAVGISAILGTLMAVGLARYQFLGK SVYRGVSYLPLIIPDIAIAVATLVFLAAFAIPLSIWTIVCAHVVFCLAYVGLVVSSQL TNLDPHLEEAALDLGATPVQAFIKVLLPQLMPGIIAGCLLAFVLSLDDFLIASFTAGS GSNTLPMEIFSRIRTGVKPDINALSVILILVSGIVAFIAELIRASGQRKNSH" gene 23410..25275 /locus_tag="DP116_10005" CDS 23410..25275 /locus_tag="DP116_10005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318439.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_10005" /translation="MQICQNPNCSNPFNSDSNRFCTSCGQNNFGNFLRNRYRVLRLLG EGGFSRTYAAEDVDRLDAPCVIKQFFPQVQGTSERTKAAQLFKEEAKRLYELGENHWQ IPRLLAYFEQGSSLYLVQEFIQGQTLLQELQQQPFSEKKIRELLEDLLPVIQFIHERN VIHRDIKPENIIRRQTDGKLVLIDFGGAKQVTQTSLSRQATVLYTIGYAPSEQMAGFA CQASDLYALGVTCARLLTQCLPIQNPDGGQIQDTLYNPMNGQWLWREYLQEKGITISN DLREILDKLLKHLAKDRYQSATEVLQELNATKFFAQQIVAIPQFQSPLFEQQKVTTPI SELKASSELDSLQTFNFDVVTVDAQGHEIRRECRSAKFYAENLGSQVTLEMVGIPGDT FMMGSKDNDGDADERPQHPVSIKPFFMGKFPVTQAQWKAVAALPKVKQSLNPYPSKFK GANRPVENVSWHEAIEFCTRLFAKTGRQYRLPSEAEWEYACRAGTTTPFHFGETITTE LANCSDDHTWEQKAKNRKETTPVGSFQVANAFGLYDMHGLVWEWCADPWHKNYDGAPT DGSVWEVGGDDNRRVLRGGSWSFSSALCRSASRSWNEPDGGLRICGFRVVASWSV" gene complement(25349..26353) /locus_tag="DP116_10010" CDS complement(25349..26353) /locus_tag="DP116_10010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318440.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10010" /translation="MRKKLQQATKPLLKSLLLLCLVFTLAFSHTDGALAASGGRIGGG SFRMPSSSRPYSSPRTYAPPGGGYGGGYYAPYPGGGFGFPFLLPFWGIGGGFGGLFSI LIFIAIANFLVRSFRRVASNNSEADYNSEVGYNSNPSVSVTRLQVGLLGSARGLQTEL NHIAETADTNSPEGRAEILQEASLALLRHPEYWVYAGGGTEQARLNAAEAQFNRFSLA ERSKFTGETLSNVNNQLKATAPKDALPGGGELDNPTRLITEGPGEYIIVTLLAATLGK LQLGDINSADDLRQALRQFGSIPGEQLLAIEVLWTPQAEGDTLTSDDVLAEYPDLKLV " BASE COUNT 7742 a 5716 c 5626 g 7557 t ORIGIN 1 aggagagggg tgcccagagg gcggggtgag gttcttcgtt ttttataagt gttcatccga 61 acatgatatt attgcattct atcgattgtg cgatattgaa tggcttcagc tatgtggtgt 121 gcctttaact cgtcatcccc agctaaatct gcaatagttc gggctacttt gagaatgcga 181 tcgctcgccc ttgccgacaa acccaatttt ctaattgccg cttctaatat gttgcgacta 241 ctatcatcta gtttgcacca tttctggagg tgacggctct gcatttgagc attgcaagaa 301 agattttgtt catttttaaa tctatgggat gcatagtcgc gtgcttgttg cactcgttcc 361 ctcactgttg ctgatgattc tcctgtcggt tgttgagtaa tctcttctgg tttgaggcga 421 ttgactgcaa cttgcaaatc aattctatcc atcaaaggtc cagaaagttt tgcccagtat 481 tgttctcttt gccttggcga acaggtgcat tgctgaatcg tatctccgta gtaaccgcaa 541 ggacagggat tagtactcgc aaccaaagta aattgagctg gaaacatgac agattgtttc 601 gttctcgaaa tcgtcacata gccatcttcc aaaggttgac gtagaaattc taaaacatct 661 cttttaaatt ctgttaactc atctaggaat aaaatacctc tgtgtgataa agaaatttca 721 ccagggcgag gaaagctacc gccaccgacg agggaaggac cagaagccga gtgatgggga 781 ctgcgaaaag ggcgatcgcg caccaatgat cctctatttt tcaacaaacc agcgaccgag 841 tggatgcgag tcacttccag cgcttctgaa aaagacaatc ccggcaaaat acctggtaaa 901 cgtcgcgcca gcatggtttt accactacca ggcggtccca caaaaattaa attatgtccg 961 ccaaccgccg caatttctaa agcacgacga gcatgagctt gtcctttgac atcattcaaa 1021 tcagctgctt ccagagacat tgcctgtatc ctctctacag tattatccaa ctgtacaggt 1081 ttgtaacgcc ctggattatt caaaaaatca gtgacctcag atagattttt gaagccataa 1141 actgccaatc cttcgaccac agcagcttct tgtgcattat cagcagggac aactaaacca 1201 gtaattccca tcttttgggc agtcgcagcg atcggtagga cacctgcaac tgagcgtaaa 1261 ccaccatcca gagacacctc acctagaaaa agatggtctc ctaacaaata agcgctaact 1321 tgttcagaag ccgccaaaat tcccacactg ataggcaaat caaaacatgg accttccttg 1381 cgtaaatcag caggtgttaa atttatgaca atctttcgca taggaaaggc aaaaccagca 1441 ttctttagag ttgctttgac tctttcccga gactcttgca ccgctgtgtc tggaagtcct 1501 aaaacgacaa ttcccggcaa acctcctgaa acgtccactt caacgcccac cttaacagcg 1561 tcaatgccga caattgatcc actccaaact ctggcaagca tgtgctgaat atttgatcaa 1621 gaacactaaa cctatagaat ctctagcaga tcaacaggta aattggcatg agtgaaacca 1681 cagagacaat tttcagcaaa atcattcaca gatgatgctg ttattgtaat aatattttat 1741 tgcttaaaaa aaatattatt tgtagattta ctttgctcgc tatagatgca aaggtgaatg 1801 gtgtttcttt catatgttcc tggcggtttt caagcgaagc gtgagaagct cacaagcagc 1861 aagtgaaatc tcattattag tctttgagaa aatcaatcga gagctttaga caatttataa 1921 cgattcaagg agtatatagc aatgaccttg ttcaaaacag cgttaccgta cactgagttg 1981 acaattagtg gtgcgatcgc ctattaaaca gttattagct atcagttatc agtgagccag 2041 cactgtgcac aagggtctca ctccctgaat ttatttctgg agagaaataa tagagtccaa 2101 actttactaa atgccccgaa ttcattcttg gggatacctt ctgacttctt aaggttttat 2161 cttttgacca gtcaaacggt tgaaatcaat acgtataaag cgatgaagtt tttttagctt 2221 tacaatacat ttttgcaaac tttaagcaag tgaaaatcag gaagttaaga atcaacttaa 2281 atagatatgt aaaaaagttt gttgaagcac acaaggaatc ttgttagtgt tatttataac 2341 ttttgtttaa atatactgaa ttgattgctt atctttcttt tacttaaagt gtagaaattc 2401 cgtttgtttt tccaaaaact ttttctatgt agtaactttg atactgtatg ttttataaaa 2461 cacatagctt gagcactgat ctagccattt tgtgtcctta ttatttttta gttgcgaaag 2521 aagagtattg agtgctactt gtcttattcc tttactcgta tcccttcgta gcaaagtagg 2581 gtggtttcat acaaacagag gaaaactata actcagtgcc agagtacgta agcgtttcat 2641 tttgagtcaa ataaaaaaca taagtaactg aggcaagcaa gtttgttagg tgcacagata 2701 aatctgtact tcattcaaac gagaagtacc atatgagcgc ccaaggggcg cgattacacg 2761 cccccaacga atgaaagaaa caccatcccc caaattctgt gttaagagtt aagcgttaag 2821 agtgtttctt aagagcgccc cacgggcggc gcgccactcg ttatgaatga aagaaacagc 2881 atcctccttc ttcagtctta agagttaagc gttcgggtgt tccctgttcc ctgttaagag 2941 ttccctgtgt ttcttaagag ttaagttcaa tctcaaaatc tagattttcc ctggtaaaaa 3001 tggttcagaa ttggcaatat ttagtaatct taataaaaat atagaaataa tgaacgagcc 3061 agccgcccat acatttccca gttctgcaac cctcagtcaa aaatcggcag tcaaagaaat 3121 caccgtagac ttgacaccaa gtgaagcttc gacgctagaa aagtattgca atcagacagg 3181 caaagcagct acagatgtaa ttcgggaact tattcaagag ttatgcttaa cctaactttc 3241 cacaccaact gtgaaacgtt agcgtctaga agggttgtag gtcgggtata ccatctaaag 3301 aaaagtaaaa gtcacagatt gtaagggcga tcacacctct aagtgcaaaa cacaatagag 3361 aaaaagtcgc ctgttagtat caaaaagaaa cccagtgctc attgtacttc agacctcgtc 3421 actggtaacg ctcgaatcag ttcccgaatc acatctgtag ctggtctccc tgtctggtca 3481 caatactttt caagcttgga agcctcattt aatgccaagt ttacggtgat tcgcttgact 3541 gcccattttt tattcatggt aaaaatctag ttgtattaca caaatttctg gaacacacaa 3601 caaaaatctg cattctatag aatacaatta tgaaaccata tgaagtatct aggcaacaag 3661 gtttacttga gccagaaaat atctttttgt cacttcgtga atcgatgcct tgcagatcgt 3721 gttcattact tacctgcaat ctcctgtatt atcctttcaa ctttcgttat atggttttag 3781 acatcctcaa aaaatgtcaa agcctatcca ggactgaatt tttcccatga gttttggaaa 3841 aagccatagc gatcgcccgt atacgcccaa ccaagcgaag cttggttccc tgaacgaagt 3901 tcaggggagg cactttgtgc ctcgggcgta gggcgcgcca agggcgatcg catcagccaa 3961 ggggttatta aaatctctgc ataatccaaa taaaaaaatg tccaacctaa gccctatacc 4021 tgcggcacgg cttaaggcct gtaccgtgcg gtctggtgca ggagatacgg gcacgcactt 4081 tacaacaacg gaactacgac tgattaacag gtaagggctg tttcccgctt gaagggtggg 4141 aaacaaggtt ggatattttt ttcgcatgtt cgtgcttaaa gaatcgttac agcaaaaggg 4201 tattaactca aatagctcaa ttttgaaggg tttaagcaat cctactcatg ctagttgagg 4261 ggcattttat tttttgtatc tccttaactt gtcggatcat gtcgttctta tgttattctt 4321 tatttaaata aaaataaaag aatcaaattt gaggatttac tttacccgct atggatgcga 4381 aattgaagca tctaaaagcg aatacagcat gaacagttga atttacccaa aaaaccatct 4441 cataacaagg catctgcata tcttttggta gcagaggcac tggttaagat gctcaaacat 4501 atgggagtgc aaaacgaacc atgcgtaatg agtttttcat tgatactgaa tcggggtgta 4561 ttatgcctaa cgaagtagcg gtggagactg ccaaccgcgc tttggtgcat cgacccgctt 4621 ctacggggag cggcgattcc ttcatcccaa agatgattga cgcagacagc atagctaaaa 4681 cccagaagct tttagacagg gcaatagtat tagcctggag gaatataaag tcgagccgca 4741 gaccaccagc cctgacccgt acacgttggg tatggcgaat ggcaggtgcc taccattcaa 4801 gccgccatac cacacgactg atggaggaag cgcgagaccg cttcgctgca tctggtcgcg 4861 agagtttggc acagtgggct gcacagaaag ccagagagga agcgggtcat gaccgacttg 4921 ccctgcttga tatccagtcg atgggatatg acgcagaggc tgtagtgcaa gcgcttgtgc 4981 catctcctat acaggctttg gtagattact ttttccaaag cgtgcagacg accgatccaa 5041 ttggttgcgt tggttttttt tatacagccg aacgcttagg gacatttcaa ggggagcagt 5101 acatccagag tgtagaggct ttgttaccgt cgggtaccca tgccacacgc tggctgcgga 5161 tacataccgg tgttggtgct gaagtgaagc atgtagagga gactgttgag gttgttgcac 5221 atctgtcttc tgaagagctt acccgcgtag caagagcctg ttatgaaacc gcattgttgc 5281 gctttagccc acccaaggag gactacatat cagacgagga actccagcat atattgaaac 5341 cgctagagtt acgtacactt gtgcaagtca aatctgctgg ttaaccgctg acgcaaaaaa 5401 tcgagagcct tagacagttt atctaagaca ccatagtgta ctgtcccaaa caacattaag 5461 gagaattcaa caatgaccac ggcaactatc actattgagc agcaatcttt gagccaggat 5521 ctcaccagct actttcttga gacctgtttc gatatattga atggcatcga acttgatgat 5581 ttggaaagaa ttatgaagaa acaattcccg caaggattgc caagcgacac cgaattcggt 5641 tcaatgtaga ttcagcatct atatccattt agtggcttaa acttgaaaca gtcggtagcc 5701 ctgattgtct agtccctgtt aaactaggag cgcccagcaa cttcaaggcg ggttagccct 5761 agctaccgta acagttagac agtcagggat taacacttat tgggcagaaa aaattttccg 5821 ttcattacag ctattttcag ttaattgaac cacagatttt cgtagggaca cggctagcct 5881 ttgcccctac ctgtggttta tttacttgaa aaacgctgtg agctgatgtt ttagacaagg 5941 tttgcggtga gctgatctct ctcggtggtg aaatctccat tggtttcata taattgcttg 6001 cggagactat ttttcaccag caaatgctcc caaaactttc agcgtagcaa tagctgctgt 6061 agttaaacct gtaaccccca taatgttcat ccgctcatag ggtttctcag ctttcagaat 6121 ctttaagcac tcacctgttg tgacatccca cagtctaatt gtctcatctt caccaccgct 6181 agccagaatt tgatgatccg gactaaaggc gactgacctg agccaacccc ccctatgtcc 6241 ttccaaaact ttgaggcatt taccagtgct gacatcccac aacttgactg tgtgatcttg 6301 acagctactc gctagtgttc ggccgtcggg actaaaggca actgatagta accaccccgt 6361 atccacccgc aaaagtctaa ggcattcacc agtgctgaca ctccataacc ttaacgttcg 6421 atctgcacta gtactcgcca acagctcacc atccggacta aaggcaactg accaaaccca 6481 acccgtatgc ccctctaatg tcttgaggca ctcaccagtg ctaacatccc acagcttgag 6541 tgtttcctcc aacgcgctgc tgactagcat catcccttga ggactgaagg caactgacca 6601 gacctgattt ctatgtccct ggcaagttct caaagcttga cctgtattga catcccatag 6661 cctaactgtt tggtcttcac tgccactcgc cagcgtctga ccgtcaggac taaaggcaac 6721 tgaccgaatc gcagcacgat gtccctgaaa aatctgcaag acttgacctg tattaacatc 6781 ccacaacctt acagttcgat cttcactgcc acttgctagg gtctgcccat ctggactaaa 6841 agcaactgac tggatcgcag cacgatgacc ctggaacgtt ttgaggcact taccctccga 6901 gacatcccac aacctcactg aagaatcgtg actgccactt gccagcgtct gtccatccgg 6961 actaaaggca actgagagta cctgatgagt atatccctga aacgttttga ggcattcacc 7021 agtgcgaaca ctccacagcc gtaccgtctg atcgtaactt ccactagcca ctagatcgcc 7081 ctgtggacta aaggcaaccg aatataccca actggaatgt ccctggaagg ttttgaggca 7141 ttcaccagtg ctaacgctcc atagccttac cgtttggtca aaactgccac tagctagaat 7201 atcaccttgt gacctaaaag tgactgaata tacttcattg aaatgtccct ggaacgtttt 7261 gaggcattca ccgctactga catcccataa ctttactgtc tggtcttcac tgccactcgc 7321 cagcatttta ccatcacgac tgacggtaat tgcccttatt ccatcatcgt gtcctcggaa 7381 agttttgagg cattcaccga tgttaatatc ccacaaccga actgtgtggt catcactgcc 7441 actaaagagc gtttgcccat ccggactgaa ggtaacagag catacccaac ttgtgtgtcc 7501 ttggaacgtt ctgagacatt gaccagtgct gacgctccat aacttcactg tgtagtcatc 7561 actactactc gctagcgtgt taccatccgg actaaaggca actgaccaaa cttcgtcatc 7621 gtgttcttgc aaagtctgta ggcattgagc tgtactgata tcccacagtt tcacagtaca 7681 gtcaacacta ccgctagcaa aggttctgct atcaggacta aagacaagcg atgtgaccca 7741 attattgtga cttttacagg acagaagttg tttccaatcc gaaacttggt acaagcgaat 7801 ctcaccattg gtatcaccca aagctaaaag ttcgccaatc ccctccgggg acgctgcgct 7861 aacaggacta aaggctactg acaaaacacc gccgaaggtt tcagcaaaaa cagatttagc 7921 tagattggcg ttttggaaat tgacatcgtg caatttcaca ttccgcaaat cagcttgcca 7981 aacagctaga tgagaaaagt catagccgct taaatctgtc tttaaatgac aaagcaaatt 8041 gagaatattt ccagccgtat accctggttc ttgcggcgat tcctctcgca gccttaccaa 8101 aattttggtt aactggtttt taatgcttct tttgctcctt aaaacagtga gcagcccatc 8161 tatgactggt tggaggatga ggcgaatttg accttccctg atgtagtctt tcgcggtcgc 8221 tttcatgaga gcatggtatc taaaaagctt aatattttca gttacaatct cttcacaaac 8281 caactctatc aatacctgag tgacatactc cataacgata ggttgtaggg tgaaagttgc 8341 tgcacttttc tcaactagca taggcgtaga gcgaagctct tcccgcaggg tagccttctc 8401 gattagcgat cgcctaatca gagattctaa agcctccagt aatttttgtg gtggtattgg 8461 taatacaatg tcttctcgca attctgatag cgaaactggc tcgcgattaa tcactagcca 8521 atacattatt tctttttcta aatctgacaa gcgctcaaac tgctggtcta aaatatcacg 8581 aatatctcca aaaacagctg tattctgttg caaaaattca gttacattac catcaaagac 8641 atcaacaatt gtcgtagcaa ccatcttcaa agccaatgga ttgcctgtat agagtttaat 8701 tattgctttc cattcttcct ctgctgcaga tagcccctta acttttaaaa tttcctgccc 8761 ttcgttcacc tttaaaccac ttaattgtaa ggagcgaacg ggtaatgttt gtcctttgag 8821 taatgccact tcttgaggta tttcccgact agtcagcact aagcagcttt ggtgatttgc 8881 ttctcctaac tgtctgaaaa gctcaccata cccttcatat ccttctcgat attgtccagc 8941 tcgacttcca ctgcggagaa ttgattctac attatcaagt atcaacaaac agcgattatt 9001 ttgtaaataa tagactaatc gcgatatttt ttcacttaat ttttctggta agttattttc 9061 tgtttcctgt tgagcagata gaacctggat caggttagca acaatagtgt taataggtgg 9121 ggcttctcgt agcgatcgcc agataacata ttcaaagtct tcctgaatct gttgagcaag 9181 cttgacagac agagcagttt taccaatacc acccattcct aatagtaaca ccaatcggca 9241 gcgctcatca agaatccatt gctgtagcgt ggtaagctct tctgtacgtc cataaaaaac 9301 taatataaag gtagcatcgc cccaatctag ataatttttt gagtttaaat agtcaatttt 9361 gtctattttt atgcaaaaaa ctgaaaacag cttttcaaga ctttttttat caacccctcc 9421 ttcccgattt agtacttttg agattgtact gggatataac ccagaaagtg tgctcatttc 9481 ttcaagggtg tagctgttgc caaaattttc ctttgtttct aacttacgct ttgcgtcctg 9541 aagtttttgc aaacccttat tagtcagtgc aacgccgcgc ttgcgtctcc aattctgtaa 9601 agccatgtat taaaaataca gtcatcataa cttaactttg actgacttgg ctttgtgatt 9661 aaataaaaat tttaccgagt ttgacagcgt gaataccggg caattgatcc actcccaact 9721 ctggcaagta tgtactgaat atttcattaa gaacactaaa cctataaaat ctctagcaga 9781 tgaacaggta aatcgatatg agtgaaacca cagagacaat tttcagcaaa atcattcgtc 9841 gagaaattcc agctgacata gtttacgaag atgacttagc actggcattt agagacatca 9901 attcccaagc ccccgttcac attctcgtca ttcccaaaaa acccataccc agactcgctg 9961 atgccgaatc tggtgatgat gctcttcttg ggcatcttct attaacagcc aagcgagttg 10021 ccgaacaagc tggacttgca aatggctatc gcgttgtcat caacacaggt cctgatggtg 10081 gtcaaactgt ttaccacttg catctgcaca ttctgggtgg acgccagatg gcatggcccc 10141 ctggttgata gagagatagg gaacatatga cacgtagggt gcgtttcact tacacaaacg 10201 cgccctatga gaaaaaatga gtcaaaaaac tttacaaaat atcatttaga tttttatcta 10261 gatagataaa tctaggctgt catagattga cactctcagg gctgatagcg tatcctatga 10321 agctcaaagc gcaagaacta cccgaagacc gtcttcaact agcaatataa cacggtcgct 10381 aacttgactt acccgttccg tcaaaaacga cttatcgact gtaatgattt gtgaaacatt 10441 cacaacagaa tcttgtggca aaccagtctc atcagttgta actaaaacgt tacccggagc 10501 ctcagctaac cgaaggttgg tagttagtac agctacgatt acagtgcgaa ttcggctacg 10561 gttaaagtca tcagattgaa taactagaac gggtctacga tatcctggtt cagaagcaac 10621 tggtgttgga aggtctgccc accaaatttc acctcgctgc attaccaagt ttcctttggt 10681 aaagtatgag cttgtagctg aacgaaaaat gggtcaacac tactatcttc gtcagcatac 10741 acagcatcta actgctctgt aatgcgatcg caacgatgaa cttgtaagta ctcttgaagt 10801 gctatagcgt ataactcact tcgggataac cccatttgtt ttgcaaagct ttcagcagcc 10861 tcaaagagat tatcgggaat tgaaatagct gttttcatag ggatggtata accaaagtta 10921 taccactatc ttatcacttg caatcaatgc caactcatat taatagtgct gcgctagaag 10981 aataggcgat tcaggaagtt gactaagttg cttacgaaac gctttggtta cactcgactt 11041 cattgctatt caactgcatt acagaactcg atctttatat tttagaaacg gcacttaaga 11101 cttacaagga agctacagaa aaggcaaaag cagagaggta tgatgataaa aaaatgcttt 11161 caattaatgt ttatccttca agtattcttc gcactgcata tgagaattta ttaataaaac 11221 ttataaggag gcagaagtta atctcaggaa agaaactcat gcttgagatt tcggaaaaga 11281 ctctcctccc atcaattgtt gatcatgaag gaaaaggttt agaaagcttt agaagtatag 11341 caagaaaata caggaagaat tacgacgtca agtttgctgt tgatgacttt ggagtcggca 11401 atgcttccat gtctaggtta gaagaggtcg atcctacata tgtgaaggtt gacagagata 11461 tccttcactt tgagaaaaag ctaggaagga gtattataga atatcttgtt gatctaaagt 11521 acgattttaa ctgttttaca attcttgaag gatttgatga gtgcagcaac ttctcattga 11581 gagaactggt ggttgaactt ggggtcgaat acattcaggg acatagtctt ggcatcgcaa 11641 ctcctgaaat taaggcaaga ttggataaag agcaatgtga aaatatattt gagatattaa 11701 attggagaca aagtagtaac cgaaattcgg gctaatcttt cccatttggc aactcacaaa 11761 aatgttacag taaaagacaa atcgtctcta tatggcggct tatgtctcca gaatctcccg 11821 tataccgtga caaaagtgca agtgactaca gtataatatc agtccaaaac ctgagcaaat 11881 tatatccggt cgctgttaag gaaccaggca tagtcgggac aatgacccac tttttccgcc 11941 gcacttaccg agaaatcaaa gcagttgaag atgtttcctt tgaaattgca cctggtgagg 12001 tggtgggctt tttgggacca aatggggctg gtaaaaccac gacactgaaa atgctcacgg 12061 ggttgattca tccatctatt ggtcaagtca gagtggctgg acacattccc tttcgtcgcc 12121 aagaagcgtt tttgcaaaaa attaccctcg tgatggggca aaaacagcaa ctaatatggg 12181 acttgcctgc tctagattct ctcaagatta atgctgctgt ctacaacatt tctgacaaag 12241 agttccgcca acgggtaggg gaattaactg aaatgctgac gctggaagga aaactaacgc 12301 aacccgtgcg gaagctgtct ttaggtgaac ggatgaaggc ggaattatta gcagcacttt 12361 tacaccgtcc acaagtcctg tttttggatg aaccgacact agggttagat gtgaatgctc 12421 aggtgggggt gcgtgatttc ttacgcgatt ataatcagcg atatcaagcg acggtgctgt 12481 tgacaagcca ttacatggca gatatcacgg ctttgtgtca gcgggtactg ctgattcacc 12541 aaggacattt gatctatgat ggcagtttag atggtttgca ggaaagtttt gccccttacc 12601 gggaagttca tttggagtta gccaatgatc tgcccaaaga aaaactgatg ttatacggtg 12661 atgtacgaca aatagaaggg cgctccgtgt gttttatggt accacaagag gcactcactc 12721 gcactgtcgc gaagatttta gcagatttgg atattgtaga tttgaccgtg acagaaccgc 12781 cagttgaaga agtgattggg cgagttttcc aagcgggagt ggtcgtagca cctagtaaat 12841 aacctgtctc gttggttact tcaatgtggt cgtgcccttt gttgatttgg tgatcggtct 12901 caccgaccaa attcggaaat cagtgcctat gttacaaatt ttacaaaaaa aacacccgca 12961 agagagaaag ctattgctac attagatgat atctaaagaa aaatcgcttt tgactgcgta 13021 ttgacccgct cttcagacaa aactatgagt acaaatgcca ttctttcaga gagtccttta 13081 ttaaaaggct ctggtttgcc ttcctttgga aagatttcag cagagtctgt ggtaccagca 13141 tttaaccaac tcctggcaga actcgatcag gaacttacta ctttagaagc taatgtaaat 13201 cctacttgga acggtttagt agaacctcta gaaaagctga cagaacgcct caattggagt 13261 tggggaattg taaaccattt gatgggtgtg aaaaatagct ccgaacttcg ccaagcatac 13321 gaaagcgtac agccgcacgt cgtgcagttt tccaacaagc ttggtcaaag ccaaccaatt 13381 tataatgctt ttaaggcact ccgtgccagt gatacttgga caacacttga ctcagctcaa 13441 cagcgcattg tagaagcggc gatccgagat gcggaacttt ctggtgttgg cttgcagggc 13501 gaagcaaagg aacgttttaa tgccattcag atggagttgg cagaactgtc tacgaagttt 13561 tctaaccatg tgctagatgc aaccaaagcg tttagtatga ctttgacaaa tcaagaagaa 13621 atcgacggtt tacctcccag tttactgaat ttagccgcac aagcggcaca ggctgcggga 13681 gaagaaaacg cgactccaga aaatggtcct tggcggatta ctttagactt tcccagttat 13741 ggtcctttca tgcagcatag cacccgacgg gatttgcggg aaaaattata cagagccttt 13801 attagccgtg cttcttcaga tgagttggat aacaacccct taattgaacg cattttggag 13861 ttgcgtcaag aactagcaga attacttggc tttaaaacat atgctgaatt aagcctagct 13921 agtaagatgg ctcctaatgt tcaagcagta gagtccttat tagaggaact gcgtcgtgct 13981 agttatgatg ctgctattaa ggaattggaa gaactcaaag cctttgctgc atctaagggt 14041 gcaccagaag caaacgattt gcagcactgg gacatcagtt tttgggcaga acgtcaacga 14101 gaagagaaat tcgctttcac tgctgaagaa ttacgccctt atttcccact tccccaagtg 14161 ctggatggct tgtttggact ggtgcagcgg ctgtttggcg tcactattac cccgaatgat 14221 gggcaagccc cggtttggca tcaagatgtg cgttacttcc aaattgctaa cgaaacaggt 14281 cacccaattg cctactttta tcttgatccc tacagccgtc cagaagaaaa gcgtggcggt 14341 gcttggatgg atgtctgtat taaccgtggc aaagtcacag aaaatggagt aacgactacc 14401 cgcttacctg tggcgtattt ggtgtgcaac caaactcccc caataggtga tcagcctagc 14461 ctcatgactt tctatgaggt ggagactttg ttccatgagt ttggtcatgg cttgcatcat 14521 atgctcacca aggtagacta tgctggagct gcaggcatca ataacgttga gtgggatgcg 14581 gtggaattgc ccagccagtt catggaaaac tggtgctacg acagaccaac tttgttcggt 14641 atggccaagc attaccaaac tggtgaacct ctaccggaac attactacca aaagttactg 14701 gcagcaaaga actacatgag tggtagtggt atgttgcgac aaatccactt tagcagcctt 14761 gacatagaac tgcaccaccg ctatcgtcca ggtggcgggg agactccaaa agatgtgcgc 14821 gatcgccttg ccaaaaccac aactgttttg ccaccactac cagaagattc atttttatgt 14881 gctttcgggc acattttctc tggaggatat gcggctggtt actacagtta taaatgggct 14941 gaagttttaa gtgctgacgc ttttgccgct tttgaagaag ctggtttaga taacgaaaag 15001 gctataaaaa atacaggaaa ccgttatcgt gatacagtac tagctcttgg tggtagtaag 15061 catccaatgg acgtgtttaa atccttcagg ggtcgcgaac ccagtactga acctttgctc 15121 aagcataatg gcttagcagc agcggcgtaa ctctcaacac ctcagttagc tacagaaatg 15181 gatattgcta actgaggaaa catgaatcaa aacaaaaagc aaagcaagaa tactatagcg 15241 cttggtagtg ttgggtgcaa tacacggctg ttgtgaaagt gcatgcactt tcacattatt 15301 cgtggtgatg tatgcgattc aaatgagaac cgcgcttatg aagccgcagc taactttact 15361 tgagcctggt gaataagaac tgctgactaa gcctgccata gcggacttag tcaagcagag 15421 gaactacttg aaaaacaaac ttagtgtggc tgaactctag cagtacccgc atccattatt 15481 ccctaacatt ttttgtacac aataactcag aaattttgtc cgtgttttca cgaagttttg 15541 gtaaagaaga aaattatcat cactttacaa aaaagctcag aaaatactta ggtttttctg 15601 ttaatattct tactcagatg cgcattgaaa aataagagct ataccatcgt cacactgtaa 15661 cggttattgc tgttacttgg tgcaaataag ttacattatc tacccaataa aagaataact 15721 gcagacacag cagagtagat gctttgaaaa ttcaaagcag atttgtgctg cgtacaggtt 15781 tttacaggcc aagttaccca atctaatgac caagtttagg caaggaatcg tatgaagcaa 15841 cgtttacgtt ttgttcgacg tgtggtgttc gttacgccat tcatagctag ttctgcctta 15901 ggaattgcac ctagtcaagc tgctactttt gcttactctg aaggaaattt caattttacc 15961 aacattaatc aaacaccatt aggaattgga actgaggcta acgctaacac ccttgccatt 16021 ggcaagggtg gtatggtgaa tactctagct gaagcaacag caacttttgt aactggtcct 16081 actccaccag aagcatccaa tttttctttg agcaaagcct tgggcgaaaa caaaggttac 16141 ttgggacaag cagaaagtga ggctacagtt attggtaatt ttgttgtaga cgtgaataca 16201 cctttttctt ttgacttcac aacaaattta actctagaaa catcaataga taatccacca 16261 gcagaaaatg caagagcagc gggagatata tctttcgcat taatcgacac cacgaataac 16321 actgtcttag atttcttcga cctaacagga aatgtagaga ctctaggcga taacgatttt 16381 atcgcatttc aaaaaagcga caatgtgact ttgagtaatc cagtcactac gtctaatttt 16441 ggagggaata aagaatctgc tacagcttcc attcagggtt ctttgcaacg ttcttttgcc 16501 aacgaaacaa atgtcgcttt agtagaagtt aagaggaatc ggactagagt tatagcacca 16561 gaaccttcca ctagtttagc tttgctctct ttttgtagca tcgttggtct tgttggcaaa 16621 gcgaaacgta aagcaattac tttagtacgc tcattgaaag aaagtaactg ctgacggtta 16681 atacgctata gtgaccaaaa gcgatgatgc agacttttgg tcactctatc tgtatccact 16741 gatttagcat cagaagtaga gcttattaac agttaccagt taccagttat cagttatcag 16801 ttaccagtta ccagttatca gttaccaagt taaggggggt caagaaatcg ctgacactgt 16861 tcactgttca cccttcgggt atgcgcaaag cgcacgccaa aggcgttagc gaagcgtctg 16921 gtggaggaga tacgccagtc gcctacggcg ggagaacgcc tcatgcgcgc tggactcact 16981 gttcactgat ttaatagcta ctcacaaatc cgttgttaat cttgcttttc tcaacagata 17041 ttgcaatgct gtctcttctt tagaaataat atcagctgtc gcagccatcc cagcttgaag 17101 atggcatttg tattcaccat gtccgaagga aagtgtttcc ggttttactg tgacttcaaa 17161 gtagctagta ccagcagtta aagaaccagt ggttcctgta cctgaattgc tttgagatgt 17221 tatggcgtct ggagaaatgg tgctaacagt accttgaaga gttccgtaat caggataggg 17281 acaagcatca actcgcaatt gcactttttg agcaactgcg acttttttaa catcagcaga 17341 aggaattatg gctttaataa ctagaggagc attaagagga acaatttgag caatagcttc 17401 gcgctttagt acaacttgac cagggttata gacatttagc ttaagaataa tgccgttact 17461 ggtggcacga atgatgctac ttttgagttg agtatcaagt tgttggaggt cttttcgaga 17521 ttggttgatt tggttttgtg tttcagatcg ccgttgaatt aaggcttctt tttctttgat 17581 caaggtcgcg atagttgctt cacctttagc aatttcttgg tcaacgcgtt tttttgcaat 17641 gtcaaccgtt gcgttgctgg gattaagagc agctttagct gatttgactt tggctttggc 17701 gatatcaacg gctttttgtt gaccttgcag tattaatttt gtttgttcaa caactagttt 17761 tttctgttca aattcacgcc gaccaattgc tccggtttgt gacaattgct gatagcgatc 17821 gcgatctacc gtagcaaagt ctaaatctac cttggctttt tgcaagtctg cttctgcttt 17881 ctgcaaattt gcttctgcaa ccagtgagtc gcttgtcgtt gtaacttgtc gttcttgata 17941 ttctcgctga ttacgtgcca attccgcttt cgcagaagaa attatctgtt ctatgactct 18001 tttttcagca agaatctgaa catctaaact tttgatttga gcatacattt gaatcagttg 18061 taacttgctt tgctcaatat tgcctttgat ctggcttttt ttgatttgca gttgttgatc 18121 attaaggacg gcgatcgcat ctccctgctt gaccatttga ttttctttta caaggatatc 18181 tttaacagtt ccttctattt ctggttgtac tagccggaga tcccctgtgg gacgcacatt 18241 agcactagct tttactgtga cattgtattt tatagaagag gcgaggctaa atatagtagc 18301 gacagttcca acaagaaata ctccagctaa agatgtccaa ggactaacag gagggagaaa 18361 cccgtcactt tcaactgagg aaattttatg cgaagggaga agtttttgat tatgtgtatg 18421 cataatgact aattgctaat tcttaattgc tacttgagtt tataaattta ttctctgagc 18481 cattagccat taactaaact atcaaaaagt ccaaatggtc tccttctttt gagcgtaaat 18541 ctgaaagagt tccttgaagt tttaatctgc cttgatctaa cattacaacc caatcagcac 18601 gattaataac tttgggacga tgggtaatca aaatcgtggt tttaccgtga cgatgtttaa 18661 acagtttatc cagcacgagg ttttcgctca cagggtcgag tccagaggta gattcatcta 18721 aaatgagaat tgctggatct gtggctatag ctcgtgctat tgctagtcgt tgacgttgtc 18781 caccagaaat atttgcgcca aattcaccta aaatagtttg atatttgtcg ggtaatttac 18841 taataaattc atcagcttca gcaatttgac aagcttgcac aatctgctca aaagttaatt 18901 gtggtcttcc taagcgaaag ttttcaagaa tagaacgact ccaaaagtga gcgtcttgag 18961 gaactaaaac cacttgttga cgcagacaat caagagccaa atcatctagg ttatagagtc 19021 caatacgaat attacctgat tgcagtggat ataacccagc catgatttta gctaaagtgc 19081 ttttcccaca gccggattta cctattagag caacaacctt accaccagga attgttatag 19141 aaaagtcttc taataagtca agccgacctg catagtgaaa gttaacttga cgacaaataa 19201 tatcggcatt tggttgtatt cgagcaaatg gctttttggc gtcatcttca ctatgttctg 19261 gggtcgcatc tataacttct gtgaggcgtt gtgtggcagt tttggcacgg gtaaatcgat 19321 caacaaaatt aatgatagta ctaattaagc ccaggaaatt agcattcatc gagttaaatg 19381 ctagcaattg cccgatactt aaattctccg ctgggttact caccaaattc ccaccaaacc 19441 atagtaaaat aatactgcca atagcagaaa ctaaactaga gaagctattg ttgataatgc 19501 taatctgaac agtccggagt gtgagggtag caagacggct aaatcgggtt tgaaattcat 19561 cccaaaattg gggtgcagct gtggtgcttt tgagggtaag tgcgccttta aatgtttcta 19621 ctaaaatgcc ttgtgtttct gcttctgtaa ccaaaagctc acgggttttt tgccgtaaac 19681 taggctgaaa cactattgta gataggctca tcaccacaga gattaacatc gctactacag 19741 tcagtttcca gctatagaaa accatcaaac tcaaagaaat gactgcaata aaaaatctac 19801 tgggaagaga gataacaact tgagcaacta actgattaat ctcgttgata tctcgcaacc 19861 gactcacaat ttccccgctg cgtcgagatt catagtaaga aagaggtaat cgcagaatag 19921 ctcttccaaa ttctaagact agccctaatt gcagacgttg ggcaaagtga gcgattaagt 19981 tagattgcac ccatgaaagg ctgctagcaa ttaaattcat gactacgacg gcgatcgcca 20041 cagtcgtcag taacctcgta tcaccacgca ccagcacatc atcagtcagg atttgcagca 20101 gaaaaggaga agctaaagat aataatccca agatgaaatt taacggcaaa gcttgcgcca 20161 aaattccgcg aaaaatccag acacgtcgga aaaaacgcca aaagccccca acctgatcgt 20221 tcttttgggc aaaaaagcga actggatctg gctctaccaa aagcatcaac caatccgtcc 20281 aaccctcggc taaatctttt ttagaaagat aacggattcc tactgctgga tcggcgacga 20341 cacatttttt gcctttcttg ccgtataaaa caacccagtg acaccctttc cagtgaataa 20401 ttgctggtaa gggcgcttca ttcatccgat ttaacaattc tgccgaagtt ttcacatggc 20461 gagcattaaa cccaagtgtt tctgctcctc gtttcagccc taataaagtt gtgcctaatt 20521 gtccagtgcc caccatctca cggatgtgat tcaatgtgaa agtacgtcca tgatgtctgg 20581 cgatagaagc tagacaagca gcaccgcagt cttcttcact gtattgtaaa atatgagcgt 20641 atttcataaa gaaatgcttg tctctacgtc ctctgtgttt gaaatcatta gtgaacactc 20701 aacactaaac agtgatcact cgcttctcaa acattcacgc atcgcgttca aaaaagacaa 20761 agggcgtggt acaattacag agagtaagaa agatggaaac cattgttgcc cacgcctgag 20821 tgttaattga tattctcctg ccgttgatac atacccagta ttaacgacag gagtgtgcaa 20881 aggatgcaat accatctaac ttcttactgt tttaagaagt cgcgtgatcg ctgttacgcc 20941 tctgcaaatt cggggatata gtcaatgttt gcctcagact ctgataccga actcacagag 21001 gagtcagtta tgaatgtgct tgtaatagta aaagaactgg cgaagccgat tggagactca 21061 gtgctagccc ctgatgaagc ttttgtctta gctgtgttct tgtaagcgcc accgacaacg 21121 ctggttgctt gagattctgc aatttcagtg aatagtggat tcatgacttt tttccctcaa 21181 gttaatcgac tgaattgatt taagtgatct cgaaaccgac accgagtgta caaaggatgc 21241 aataccagtc aacttcttac tgttttaaga agtcgcgtga tcgctgttac gcctctgcaa 21301 attcggggat atagtcaatg tttgcctcag actctgatac cgaactcaca gaggagtcag 21361 ttatgaatgt gcttgtaata gtaaaagaac tggcgaagcc gattggagac tcagtgctag 21421 cccctgatga agcttttgtc ttagctgtgt tcttgtaagc gccaccgaca acgctggttg 21481 cttgagattc tgcaatttca gtgaatagtg gattcatgac ttttttccct taagttaatc 21541 gactgaattg atttgagtta tctggaaacc gatgtctttg tttccgatac tgaaaatgta 21601 gcattaaaaa ttaccgtctg aaaagatcta atactgggtt taacgtaaac aatatgaaca 21661 aatgtactta cttaggtaag tagatagttt gttgataatt attatgaata attaaggagg 21721 tgcatccgtt atattacact tagaaaaatt tttgccatgc gctagcgtta gcctaaactc 21781 aatttaaaaa atattgagtg tgtaatatta ctaagctttg tttgaaagaa atgtgccaaa 21841 gagaattggc tgttagtcaa aaaaatgact tgaaactcat cgtttgctac tcagaacttt 21901 cattaccttg tggtttcgca aacaagcatt tttacgaaaa attaccagta ctggttagct 21961 gcgaaatcat gactagtgat tcaccagacc aactttgatg gatcgtgacg actagaaacc 22021 gggtttcgcc aaagttacct ggtttggtga taacctgtca aaatttgtgt ggtggagtac 22081 tagtactcca ccaaggcaac tttgaggggt aaaggaacga accgcagagg cgctccagag 22141 cgcagagtga agagaaaaga ggatgagtct aaaaatgagt ttacttggca aagttgctgt 22201 ggcgaactac taggttctgt cattgtctca aaagatggta taatagtgcg aaattgaaac 22261 aacaaaacat gaagataaat ctatctcgtc agaaaccgcg tgtctttaca tggcaagcgg 22321 ttttctcact gctcatgttt gtgttcatgt acctgcctat actggtactg gcgttttata 22381 gtttcaacca gtcgccttac agtgcacgtt ggcaaggatt aactttggaa tggtatggca 22441 agttgtttag ggatgagcgg attttatcag ctttacaaaa cagcttgata gtagcctttt 22501 gtgcagtagg gatttctgca atactaggaa cgttgatggc agtggggtta gcgcgttatc 22561 agtttttggg taagagcgtg tatcgtggtg tttcttactt accgttgatt attcctgata 22621 ttgcgatcgc agtcgctacc ctagtttttc tagcagcgtt tgccattccc ttaagcatct 22681 ggacaatcgt ttgtgctcat gtcgtctttt gtcttgccta tgttggtctt gttgtgtctt 22741 ctcaactaac taatttagat ccccatttag aagaagcagc actggatctc ggagcaacac 22801 cagtgcaagc cttcatcaaa gtcttattac ctcagttaat gcctggtatt atagctggtt 22861 gtttactagc ctttgtcctc agcttagacg actttctcat tgccagtttc actgctggta 22921 gtggttccaa caccctacca atggaaattt ttagccgcat cagaacagga gtcaaaccag 22981 atattaatgc tctcagtgtt atcttgattt tagtctctgg gattgtcgct tttatagctg 23041 aattaattcg cgcttcagga caaagaaaaa atagtcatta gtcattagtc attagtcatt 23101 ggtcattagt tattagttat tggtcatgat cccgatcaat ttttcaaagc acttgacaat 23161 attaggagtt ttatcataat attcttttat agaatgggaa tccttgcgct catatatact 23221 tatatttaga aacttctaac ttagaaactt ttcaccctac ataggcagat acacagaact 23281 ttttcatctg tctaccttgt gttgtgggtg tgaacagata gtcagaaatg tgtatgtcaa 23341 tatagatttc tgtattttaa cagctcaagt tagagccaac tttggacaag atgcgataca 23401 tggctcacta tgcaaatttg tcaaaatccc aattgctcca atcccttcaa ctctgacagc 23461 aatagatttt gcaccagctg tggacaaaac aactttggca atttcctcag gaaccgctac 23521 cgtgtcttga gattgttagg cgaaggtgga tttagtagaa catatgcagc cgaagacgtt 23581 gatagactag atgctccatg cgtcatcaaa caattcttcc cacaagttca agggacttca 23641 gaacgtacaa aagctgcaca attgtttaag gaagaggcga agcgacttta tgaactggga 23701 gaaaatcact ggcaaattcc cagattactt gcttactttg aacaaggttc tagtctctat 23761 ctggtgcaag aatttattca agggcaaact ctgttacaag aacttcagca acaacctttt 23821 agtgaaaaga aaattcgaga acttttagaa gatttattac ctgttatcca attcattcac 23881 gagcgtaacg ttattcatcg ggatattaaa ccagaaaaca ttatccgccg ccaaactgat 23941 ggcaaactcg ttttgattga ctttggtggt gctaagcagg tgacgcaaac cagtttatca 24001 agacaagcta cagtgctata tacaatcggt tatgcaccta gcgagcaaat ggctggattt 24061 gcttgtcagg cgagtgattt atacgcttta ggagtcactt gtgcgcgtct tctgactcaa 24121 tgtttgccta tacaaaatcc tgatggagga cagattcaag acactcttta caaccccatg 24181 aatggtcaat ggttatggcg agagtattta caggaaaaag ggattactat tagcaacgac 24241 ttgagggaaa ttctggataa gttgctgaaa catttggcaa aagatagata tcaatcggca 24301 acagaagttt tgcaagaatt gaatgcaaca aaattttttg cacaacaaat tgtggcaatt 24361 cctcaatttc aatcaccatt atttgaacaa cagaaagtca caacgccaat ttctgaacta 24421 aaagcttctt cagaactcga ctccttacaa acctttaatt ttgatgtggt gacagtagac 24481 gcacaaggtc atgaaatcag gcgtgagtgt cgaagtgcaa agttttatgc agaaaacttg 24541 ggaagccaag tgacgttgga aatggtagga attcctggcg atacttttat gatgggttca 24601 aaagacaatg atggggatgc ggatgaacgt ccacaacacc cagtgagtat caaacctttt 24661 tttatgggca agtttcccgt cacccaagca caatggaaag cagtcgcagc tttacctaaa 24721 gtcaaacaat ctttaaatcc ttatccatca aaatttaaag gtgcaaatcg accagttgaa 24781 aacgtttctt ggcacgaagc aatagaattt tgtactaggc tgtttgcaaa aactggacga 24841 caatatcgct tacccagtga agccgaatgg gagtatgctt gtcgtgctgg aacgacgaca 24901 cccttccatt ttggcgaaac aattacgact gaattagcaa actgtagtga cgatcacact 24961 tgggaacaaa aagccaaaaa ccgaaaagaa acaacacctg taggtagttt tcaggtggca 25021 aatgcctttg gcttgtatga tatgcatggg ttagtttggg agtggtgtgc tgatccttgg 25081 cacaaaaatt acgatggcgc acccacagat ggatctgttt gggaagttgg tggggacgat 25141 aatcgtcgcg tgcttcgcgg tggttcttgg agttttagtt ctgcgctttg tcgtagcgcc 25201 agtcgtagct ggaatgaacc ggatggcggg ctgaggatat gcgggtttcg ggtagtggcg 25261 agttggtctg tttgaaatct taaaacaaaa aaatacgggt gagcatcttg cccaccccac 25321 aaaagttatt ctttgactac acaacgcctt aaactaattt caaatctgga tactccgcga 25381 ggacatcatc agaggtgagg gtatcacctt cagcttgtgg agtccaaaga acttcaattg 25441 caagcaattg ctctccagga atactaccaa attgacgtaa agcttgacgc aaatcgtcag 25501 cgctgttaat gtcgcctagc tggagtttac ccaaggttgc agctagtaaa gtgacgataa 25561 tatattctcc aggtccttca gtgatcagac gggttgggtt gtcgagttca ccaccaccag 25621 gtaaagcatc tttgggagca gttgctttga gctggttgtt cacattagaa agagtttctc 25681 ctgtgaattt gctgcgttct gccaatgaga aacggttgaa ttgagcttca gctgcgttta 25741 aacgcgcttg ttcggtacca ccacccgcgt aaacccaata ttccggatgg cgtagcaaag 25801 ctaggctggc ttcttgcaga atttctgctc taccttctgg ggagttcgta tcagcagttt 25861 cagcaatgtg gttgagttcg gtttgcaaac cacgggcgct acctaacaag ccaacttgta 25921 aacgagttac ggaaacagaa gggttgctgt tgtagccgac ttcactgttg taatccgctt 25981 cactattatt agacgcaacg cgacggaagc ttcgcactaa gaagttggcg atcgcaatga 26041 aaattaaaat gctaaacaaa ccgccaaatc ctcccccaat accccagaac ggaagcagga 26101 agggaaagcc aaaaccaccg ccaggatagg gagcataata acccccgcca taacctccac 26161 caggaggtgc gtaggtgcga ggtgaggagt agggacgact tgatgagggc attctgaagg 26221 aaccaccgcc gattctaccc ccactggcgg ctagcgctcc atcagtgtga ctgaatgcca 26281 atgtaaaaac aaggcatagg aggagcagag attttaaaag gggtttggta gcttgttgta 26341 gttttttacg catgacgaca cagtggcttg aaaaacagta tttggttatc tttatctccc 26401 catgctggtt ggagttttgt gcttgtttaa taacgccagc ccgtagcctt aaacagtcgt 26461 tggcacttgc tctccagtta cagactacaa ttccaattta ccgtcaactg tattgtacgg 26521 gcagcatgct ttgggcggat ttctggagta gataaccgta catcacaagc cattgaaact 26581 tctaaattca atcttcacaa accgagttcc ctcttcgttg tgccagaatt ggtatctgca 26641 c // LOCUS NODE_1128_length_26543_cov_5.01091126543 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 26543) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 26543) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..26543 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..384 /locus_tag="DP116_10015" CDS <1..384 /locus_tag="DP116_10015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318239.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_10015" /translation="AIAIANPYAIANDEFINWATELEKIKIFHSVDFPVLIAQLRKSK STIPNDKQPQEVRRAFVEQLLQTWLNAFNLTPEMVNLSKEELQALDNYFYANYFIIQC KQAAVRVSPKTWEAIEEGMLLVPNN" gene 656..850 /locus_tag="DP116_10020" CDS 656..850 /locus_tag="DP116_10020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007354027.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_10020" /translation="MSSLQQAKEKKTQTFAAILYWEEDVYVAQCPEVGTASQGETIEE AVANLKEATELYLEEFLDNL" gene 955..2409 /locus_tag="DP116_10025" CDS 955..2409 /locus_tag="DP116_10025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741730.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10025" /translation="MHTIYKKSTKKAWAQALAQPAKEFPPTQLQILSGRIPEGLHGTL YRNGPGRLERGGINVGHWFDGDGAILAVNFNSPYEAGKKGGATAVYRYVQTDGYKEEA AAGQLLYGNYGMTAPGPIWNKWLKALKNCANTSVLALPDKLLALWEGDNPHALDLQTL HTLGKDDLGALKNGLAYSAHPKIDPKSGSIFNFGISVGLNGILNIYKSDATGTIQQKT SFELDGFPLIHDFVLAGQYLVFFVPPLRLNVLPVLTGFSCYGDSFEWKPQLGTQVLVF DCETLSLVSRGETEPWFQWHFGNAYVDDSGLIVVDVVRFADFQTNEYLREVATGETHT PAVSTLWRIHLEPSTNIVKGIEEIIDRHCEFPVVPPQESGQYTNQTYLTVHRQGVDAS KEIFGAIACFNHKTNTLTIADCGENRYPSEPIYAQNPQNSEQGWIITVVYDGNSDCSE VWVYDAARLDDSPVCRLALPSVIPHSFHGTWKPY" gene complement(2668..6828) /locus_tag="DP116_10030" CDS complement(2668..6828) /locus_tag="DP116_10030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-ribosomal peptide synthetase" /protein_id="PRJNA477356:DP116_10030" /translation="MDNINQKIAALSPAKRALLELKLKNKTTEVKTQQTIPKRTNEST ALLSFAQQRLWFLEQLEPNSPLYHIAEVLRLQGDLNVDVLQQSLDAIVAHHEVLRTNF IAQDGDPIQVIGEPKSVELKVIDLKDCPLSERSTQVQHLLQNESQRPFDLTSDFMLRA CLLQLEPQEHILVLVMHHIASDAWSRSILFEQLRTLYQAFKEGLPNPLPKLPIQYADY AVWQRQWLSGEVLENQLKYWKQQLAGATPVLELPTDRQRPPVQTYRGAKHFVVLPQSL SQALSTLSRQEGVTLFMTFLAAFQILLYRYSGQEDILVGSPIAGRNHPEIERLIGFFV NTLVLRTDMSGNPSFRELLQRVRAMAMSAYVNQDLPFEKLVEELQPERSLSYNPLFQV MFAFQNTPQQTFELSGLTITSTYVDRLTSKFDLTLFIVETEQGVEEIWEYNTDLFDAS TISQMSGHFQTLLEGIVANPQQHISKLPLLTAAQQHELLFEWNNTQKDYPHKCIHELF EQQVNLTPDAVAVVFENQQLTYQELNNRANQLAHYLRDLGVGPDVPVGICVQRSLEMV VGVLGILKAGGAYVPLDPAYPQDRLSFMLSDSQVAVLLTCENLMRVLPKHEGHVVCLD TDWQAIAQASEENLVNGVEPENLAYVIYTSGSTGMPKGVAMKQLPLSNLISWQLENST VSTGARTVQFSPISFDVSFQEIFSTWCSGGTLILITDDLRRDATALLDFLNHKAINRL FLPFVALQQLAEVAVGSESLPIHLQEIITAGEQLQITPALTNLFSKLPGCTLHNHYGP SESHVVTAFTLADSVQSWAALPPIGRPIANTSIYILDSNLQPVPIGVPGELYIGGVAL ARGYLNRPELTAEKFIRDPFSQKDKAYMYKTGDLARYLKHGNIEYFGRCDNQVKIRGF RIELGELEAVLSQHPAVHQAVVIVRQDIPGDKRLVAYVVPDQDSVPTTGELRNFLKEK LPEYMVPSAFVLLDVLPLTPSGKVNRRSLPAPDITKLEPEASYVAPRNDTEHQLTEIW AEILGIQPVGVRDNFFDLGGHSLLAVKLFAQIEKKFAKKLPLATLFQSGTVEALAQLL SPKEKTVDNQLLTGAHEHTSEASWSCLVPIQTKGSKPPLFCIHPLGGETLCYRNLSLH LGQAQPLYGVQPQGLDGKLSLLTRIEEMAALYIKEIQTIQPNGPYFLGGYSMGGIIAY EIAHQLNRQGQKVALLAMFDSGIPGAATRLPLISRIFMHINNLLQRGPSYLRKKLIGW IEWSTYHLRAKYTHFLGIKEPLPQDDTHWDIIDANVLAWNEYTYQPYSGQITLLRVDE NSDDSQDDAVGVKSEPLLGWDKLVTGGIDVHYIPGSHYTLFEEPNVRVLAEKLRECLE KTVAVSHT" gene complement(6943..12156) /locus_tag="DP116_10035" CDS complement(6943..12156) /locus_tag="DP116_10035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314620.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-ribosomal peptide synthetase" /protein_id="PRJNA477356:DP116_10035" /translation="MSKLIIDQDSYLQPATGLNDLTEAEDYLIQTDYPRNLCIHQMFE AQVQQTPNAVAVVFENQQLTYRQLNQRANQLAHYLRTLGVAPEVMVGICMERSLDMVV GLLGILKAGGAYVPIDPKYPQERLHFMLSDTQVSVLLTTKQLANELFEQESRVVCLDT DWESIAKESQENPLSELTQEHLAYVMYTSGSTGKPKGVQITHANVAHYIPAVSQVLQV QPEDVYLHVASFSFSSSVRQLMVPLYRGATSIIATGEQTKNPISLFELIQKQGVTICD GVPSIWRYGLQGLETLDKRHTEALQKSKLRTIVLSGDLPPVQLYKQLRDLFQDRVSIF NVYGQTETIGNCAYLVPKDFDPELGYIYIPVGYPYAHNQTYILDEHLQAVAPGEVGEL YIGGACLARGYLNHPKLNAEKFISNPFSQDRKQRLFQTGDLARYWSDGSIELLGRTDF QVKIRGMRVELGEIESILVQHPTVKEAVVIATEDVPGEKRLVAYVVPNLSLSEIIQNV FIKELRGLSYQKLADYMVPSAFVLLDSLPLTPNGKIDRLALPVSKVVSPELEVAYVKP QTETEEIIATVWQEVLQVEKVGIHDNFFELGGHSLLATQIISRCRQAFGVEISLQSLF ETPTIAELASAITTSQNQGTQEHQIISRQTNRQSIPLSFAQARLWFLAQLEPDSAAYN VVDAMQLQGNLNVDVLQQSLDAIATHHEVLRTNFIVEDGNPVQVIREPQSVELKLISL MDCPETERTTVVQKLLQQEAQRPFNLSSDLMLRACLLEISPQEHILQLTIHHIATDGW SMSILFEQLTTLYQAFLEGKPNPLPQLPIQYADYAVWQRQWLSGEVVENQLNYWKQQL AGAIPVLELPTDKPRPPVQTRRGAKQSFVLPKNLSASLSALSRQEGVTLFMTLLAAFQ TLLYRYSGQQDILVGSPTAGRNREEIEGLIGFFVNTLVLRTDLSGNPSFRELLQRVRS HAMSAYANQDLPFDKLVEELQPERSLSYHPLFQVMFVLQNVPTQTLKLPGLSISTIEV DNFASQFDITLSIEETEQGLRGLWEYSTDLFDADTITRMSGHFQTLLEEVTANPQQHV NELPLLSATERQQLLEWNDTQAEYQEQCIHELFEVQVEKTPHAVAVMFEGEQLTYQQL NTRANQLAHYLKTLGVGADVLVGICMERSLEMVVGLLGILKAGGAYVPIDPAYPKERL TFILEDTQTPVMLTQEKLVNSLQNLGSQVICLDSDWELIANNSQENPVCEATVDDLMY VIYTSGSTGKPKGVMVPHRGISNQLHWRQATFQLTEQDKVLQTISLSFDPSVWQIFWP LLFGGQLIMARPDGHRDPAYMVKMIIEQQITVAALVPSIIRVLLEEKGIENCTSLKHV TSGGEALAVELIERFVERLKLENVLINCYGPTEASIDTTFWTCQRGTDYTIAPIGRAI ANVQVYILDDNLQPVPVGQSGELYIGGTGLARGYLNRPELTAQKFIRNPFSSEPGARL YKTGDLARYLSNGNIEFLNRIDYQVKIRGFRIELGEIEAILGEYPGVQQTLVSVREDV PGDKRLVAYIVAKQVPPSASELRDFLQGKLPEYMVPKAFVFLDVMPLNPNGKVDRRAL KAPEPADFSDANSFVAPRTPTEEVLATIWTQVLSLDQVGIYDNFFELGGHSLLATQVI SRVCQTLGSEIPLQLLFETRTIAGLAQAIVQSQADKRDDDEISRLLNELEELSDEQAE SLLAQLMQQPD" gene complement(12264..12962) /locus_tag="DP116_10040" CDS complement(12264..12962) /locus_tag="DP116_10040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314621.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10040" /translation="MQLYTPIELNQLQQDLPYLINIMQWVQSFLAKPHPDLGRSGPVC PFVPYAIKSNTIRFAVIHAKNMEPQQLEEMVLCYRDTFLELEPRDRESAINKTILLIF PELDREETSKLVDGVQQKLKPLFVDLGLMIGEFHKHNESPGLHNENFRPLRSPIPMLA IRFMVESDLPFLINADDIDSRIKFLEAYIQRFENEARDQKNLNKACQALALAKEQIQQ EKVLHLNCEYAAAK" gene 14429..14713 /locus_tag="DP116_10045" CDS 14429..14713 /locus_tag="DP116_10045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209672.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HU family DNA-binding protein" /protein_id="PRJNA477356:DP116_10045" /translation="MNKGELVDAVAEKASVTKKQVDAVLTAALETIIEAVSSGDKVTL VGFGSFESRERKAREGRNPKTNEKMEIPATKVPAFSAGKLFREKVAPPKS" gene 14740..15885 /locus_tag="DP116_10050" CDS 14740..15885 /locus_tag="DP116_10050" /EC_number="4.1.1.81" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315588.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine-phosphate decarboxylase" /protein_id="PRJNA477356:DP116_10050" /translation="MRQPAHGGNLAWAAALAGCPPSAILDFSASISPLGPPKSAIAAI ESQLDHLRHYPDPDYRELRLALGHFHQLPPEWILPGNGSAELLSLAGRELAQLAATAL ITPAFGDYYRALAAYDAMVLEFPLSLVRQSVAEVSSLSGTAEPVRVMSHLSFVMSDLS LIFDKGQQTKDKGLLLNNPHNPTGVLFSREAILPYLKEFALVVVDEAFMDFLPPGQEQ SLIQVVQEYPNLVILRSLTKFYSLPGLRLGYAIAHPDRLRRWQSWRDPWPVNTLAAAA AVAVVQDKEFQEQTWAWLPPARNQLFAGLASIPGLQPLESAANFLLVESQQSSSQLQQ NLLKHHQIFIRDCLSFPELGDRYFRVAVRSWSDNQRLLEALSLVVSH" gene 16017..17498 /locus_tag="DP116_10055" CDS 16017..17498 /locus_tag="DP116_10055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315589.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mercuric reductase" /protein_id="PRJNA477356:DP116_10055" /translation="MTIDYDVVIIGGSLAGRYAALTASQLKAKVALVEPITNGALTIV NAPLEFIYHHALSHTSNLSKQLGDAALLGLHTSCADTLEKCQISVDTPQAMLYAHSIV SNLQEQNSLRLLAAQGVDVIVGNGQFQSSPHLSFAVDTRLLHARTYLLATGSSPAIPE IEGLQRTGFLTIPQVWQSLSSSTPPKQWVILGGVPQSIQLAQTLARFGYDVTLVMERP NLLPNIDSEMAQLLCSLLEAEGVRVFIKTSITQVRKIEDKKWLQVGDKAIETDEIVVA TAQQPNIQSLNLAAVDVKWNQRRLLVNEKLQTTNPRIWACGDVIGGFEFANIANYEAK IALQNALFFPRFKVNYRHIPWAVFTNPMFAQVGLTEAQAKHQYSPDEVIVVQQFFKTL FAAQIQDETTGICKFIVLRNGEILGASIFGAQATELINVIALAIAQTIKLHRLADLAP LCPSFSEIFEQIVETWKQQKLNSNIAWQDCLESFFQFRRNWNF" gene 17539..18168 /locus_tag="DP116_10060" CDS 17539..18168 /locus_tag="DP116_10060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740851.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LysE family translocator" /protein_id="PRJNA477356:DP116_10060" /translation="MQFLGDWLTIFTSGCLVIMSPGPNFVLTLRNSLAHSKQAGIYTA LGVTAGDLIHVICWLIGIGVIISKSILLFNLLKWLGAAYLIYLGIKSLQAKHQDNFVE EQTSSELKSLTAFKNGFLTCLLNPKVTLFLLALFTQIIRPDTPLALQIIYGLTIVGIE FTWLAFVATVVCVAAIKRRFLSISHWFERIMGAVLIFLALRLALAKAHD" gene 18493..19266 /locus_tag="DP116_10065" CDS 18493..19266 /locus_tag="DP116_10065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310344.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10065" /translation="MSTNKRSSTCLKKENRGLAINRWKQDLKNDLIAGLLVVIPLATT IWLTITVATWVVNFLTQIPKQLNPFQGMDPILVNLLNLLVGLMVPLLSILLIGLMARN IAGQWLLDVGERLLQAIPLAGQVYKALKQLLETLLKDSNGKFRRVVLVEYPRRGMWAI AFVTGMISTDIQTQMSRPVLSVFIPTTPNPATGWYAVVPEDEVVNLSLSIEDAFKIIV SGGIVAPNTSPTPLVIPKERKLEMPPLEAKRQILPLDET" gene 19409..20059 /gene="nusB" /locus_tag="DP116_10070" CDS 19409..20059 /gene="nusB" /locus_tag="DP116_10070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315592.1" /note="Regulates rRNA biosynthesis by transcriptional antitermination; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N utilization substance protein B" /protein_id="PRJNA477356:DP116_10070" /translation="MQERKPRQISRELALLSLSQLPLNSKKLEKIAPEQLVPKLVLAA VRTLRSEVQDTLDNAAGELQRSNDRLLSSQTRASDLNTARTMLKEAVVYTQTAINKLA AAIEFPELIQLANQDKEVREYAKEIIITVHENRNTIDEDISTALVDWQVTRLAHIDRD ILRIAVAEMKYLKVPDSIAINEAVELTKRYSEEESHRFINGVLRRVTEQRKIASTL" gene 20327..22030 /locus_tag="DP116_10075" CDS 20327..22030 /locus_tag="DP116_10075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal recognition particle-docking protein FtsY" /protein_id="PRJNA477356:DP116_10075" /translation="MVFNWFRRQHNDPSATPSQQKQEETPAAKQPQPQSAETSTAETA PEVATDLLAYAKAAYKNIQQRQQTEPAETPATEVTADVVASPAQTETAQPQPAEEIQS EESADVTATETTPEEPEGTEELSTAPAVAITEEPDAESVTTESVAPPELTEPLTTESV ALPEPSEPPTTQEVTQPTAPATLSFLERAAAERQAKQERLIATAIEVTQPKVVQPAAQ TSTAPEVIEEMTGLDFDEGFLWSTEVLAAQGRRPEDISFEEITWLKKLRQSLDKTRRN IVNQLKSIVGQGPLNQAAVTEIEALLLQADVGVEATDYIISTLQNKLRQEVLPAEEAI AYLKQILRDMLDAPVKKSNKPIFVPEKETLNVWLITGVNGAGKTTTIGKIAHLAQKSG YKCLIGAADTFRAAAVEQVRIWGERSGVQVIFNPAKNADPAAVVFDAISAATARETEL LLVDTAGRLQNKKNLMDELSKIRRIIDKKAPNAKIESLLVLDATLGQNGLRQAEVFSQ AAQLSGVVLTKLDSTAKGGVALAVVQQLGLPIRFIGAGEGIEDLRPFSSYEFVEALLN G" gene complement(22051..22197) /locus_tag="DP116_10080" CDS complement(22051..22197) /locus_tag="DP116_10080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015196399.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_10080" /translation="MKKLTIRCSDEEYKILVEYCKETDRTQNDVLRELIRKLKKSRPR RAGL" gene 22236..23384 /locus_tag="DP116_10085" CDS 22236..23384 /locus_tag="DP116_10085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999645.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_10085" /translation="MKTLKFKLYEHKRNRYLKRTINAAGVIYNHCIALHKRYYRIWGK HLNCAKLQSHIAKLRKRNPLWQSVGSQAVQDICQRIEKAYQLFFKHNKKGVRPPGFKK VKKYKSFTLKQAGYKFLGGNRVKIGSRVYQFWKSREIEGTVKTLTIKRTPLGELFMVV VVDNCVACKVNSTAGKIAGFDFGLKTFLTCSDGSRIDSPQFFKQSLNAIRKASKQHSK KLKGVSSGDATRTSNRERARLNLVRSYEDICNRRRDWFWKLAHELTDRFDVLCFETLN LKGMQRLWGRKISDLAFGEFLQILEWVAKKKHKQLVFVDQWYPSSKTCSSCGHILESL DLSVRVWRCPSCQSVNGRDDNAAKNIQMVGASTIGLGDVRRALPAVAV" gene 24027..25430 /locus_tag="DP116_10090" CDS 24027..25430 /locus_tag="DP116_10090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315595.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="guanylate cyclase" /protein_id="PRJNA477356:DP116_10090" /translation="MVTVPQPPSQPTDSNSRTPTEVTPVVALKELVARLHREQNKIQD LLSSLGFALRSFNNLNQFLELIPLMATRVTDADGSALFVYKPNGQVRLEQLHWQDSHQ RKNIRKALETATSQIVCVPNNTQPLAIMSGILDDYMHRSLGPGIQIFGTAILVKHTER GWLYVLSRDPEYTWTETRQKLVRLVADQTAVAIENDELAVELRKKERLDQELEIGAEI QRRLLPRQCPTIPGVTIAARCKPANRVGGDYYDFIPTNHNQIQLNNKESLDADGRWGL VIGDVMGKGVPAGLIMTMMRGMLRGEVLHGHSPQRILQNLNQVMYADLENSHRFITLF YSEYDPQTRMLSYSNAAHNPPIWWHAATKTVTRLDTFGMLIGLDANSQYEDAQVKLEL GDIILYYTDGLTDAAAASGDRYDEENLITIFNYACRICNGPQEILDYLFDQVQEFIGA DKQNTDDMTLVVLQVNS" gene complement(25489..25974) /locus_tag="DP116_10095" CDS complement(25489..25974) /locus_tag="DP116_10095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152606.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NUDIX hydrolase" /protein_id="PRJNA477356:DP116_10095" /translation="MHKPGEIRVIALGLIRDTQHAKRGVSGACPQDIGIRIFVSEGYD PVKQQTFYRAMGGGVDFGETSLEALKREFQEEIQAELTNIRYLGCLENIFTFNGQPGH EIIQLFESDFVDAKFYEIEKLVFSEGERQKTALWVDINRFQSGELRLVPEQFLDYLSS S" gene complement(25983..26435) /locus_tag="DP116_10100" CDS complement(25983..26435) /locus_tag="DP116_10100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872413.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3531 domain-containing protein" /protein_id="PRJNA477356:DP116_10100" /translation="MQVQFREINPFDLWIWLEFSTIPSAQEKQYVEEAFNSWFYLGKL GAFNAENLQVQETGLDLSYMNYDSQGYDKSLLALMHNMGEFEYEGTWARCWFDLGTSD AIALDILINALKQLGEEYVTIEQLYIGGENEDWPIEESESRPSFIYDN" BASE COUNT 7643 a 5788 c 5670 g 7442 t ORIGIN 1 gccatagcca tagccaaccc ctacgccata gccaacgacg agtttattaa ctgggctaca 61 gaattggaaa aaataaaaat cttccattcc gtagactttc cagtgctgat tgcccaactt 121 agaaaatcca aatctacaat tcctaatgat aaacaacctc aggaagtgcg tcgggcattt 181 gttgagcagc ttctacaaac ttggctcaac gcctttaatc ttaccccaga aatggtcaat 241 ttatccaagg aagaattgca ggcactggat aattattttt acgccaatta cttcatcatt 301 cagtgcaaac aagcagcagt acgggtgtca cccaaaacct gggaagcgat tgaagaggga 361 atgttgttgg ttccaaacaa ttgaaatctt atgtgaaagt taaagtcaaa atagggagca 421 tcccaaatgt gtaaaaacat attttgtcat tgcgttagcg cagcatgccc ggagggcata 481 cgtaatgaaa tggagtgaag caatcgcaac atctagactt tgcgttcgcg aagcgtgtcc 541 gaaggactta tgcttcattc cgcttcgctg cattcgcaat gacaatctat tgtcttggta 601 catttgcaaa attgggatac tccctcaaaa gaggaaatgg aaagagaaat attctatgag 661 ttcattacaa caagctaagg aaaagaaaac tcaaactttt gcagcaatat tgtattggga 721 agaagatgtt tatgtagcac aatgtccaga ggtgggaact gctagccaag gagagacaat 781 agaagaagcc gttgccaatt tgaaggaagc aacagaactt tatttagaag agtttctaga 841 caatctgtag agtgattcaa cacacactcg attctggtgt accttctggc agaggaattg 901 taacaagaat ctaaaataat ctagttgttt tattgataca gttccaaacg cctgatgcat 961 acaatctata aaaagtcaac aaaaaaagcc tgggcacaag cccttgccca acccgcaaaa 1021 gaattccctc ccacccaact gcaaatcctt tctggcagaa tacctgaagg cttacacggc 1081 acactttacc gcaatggacc tggaaggtta gaacgcggcg gtataaatgt gggacactgg 1141 tttgatggag atggggcaat tcttgctgta aatttcaact ctccttatga agcagggaaa 1201 aaggggggtg caactgcagt ctatcgctac gtacaaactg atggttataa agaagaagca 1261 gcagcaggac aactgcttta cggtaattat ggaatgactg caccaggacc aatttggaat 1321 aaatggttga aagcactcaa gaattgtgcg aatacttctg tgctggcgtt acctgataaa 1381 ttactagcac tgtgggaagg cgacaaccct catgccctgg acttgcaaac gctccacact 1441 ttgggcaaag atgatttagg agcattgaag aatggattgg cttattctgc ccatcccaaa 1501 attgacccga aatctggctc gatttttaac tttggtatct ccgtaggatt aaatggaata 1561 ctgaatatat ataaaagcga tgctacgggt acaattcagc aaaaaacatc gtttgaacta 1621 gatggtttcc cattaataca tgattttgtt ctagctgggc aatatttagt ctttttcgtt 1681 cctccgttgc ggttaaatgt cctaccagtg ttaactggat ttagctgcta tggtgattct 1741 tttgagtgga agccgcaact gggtactcag gttctggtat ttgactgtga aactctctct 1801 ctggtaagtc gtggtgaaac tgaaccttgg tttcagtggc attttggtaa tgcgtatgta 1861 gatgatagtg gattgatagt tgtggatgtc gtgcgctttg cagactttca aaccaacgag 1921 taccttaggg aagtcgccac aggtgaaact catactcctg ctgtaagtac tttgtggcga 1981 atacatcttg aaccgagtac caatatagtc aagggaattg aggaaattat agatcgccac 2041 tgtgaatttc cagttgtacc accgcaagag agcggacaat atactaacca aacttatctt 2101 acggtgcatc ggcaaggggt ggatgcgagc aaagaaatat ttggggcgat cgcctgtttc 2161 aaccacaaaa ccaacaccct cacaattgct gactgtggcg aaaaccgcta tccaagtgaa 2221 cccatttatg cccaaaatcc ccaaaattct gaacaaggat ggatcatcac agttgtgtac 2281 gatggcaatt ctgattgtag tgaagtttgg gtatatgatg cggctaggtt agatgattca 2341 cctgtgtgca gattggcatt acccagcgtt attccccaca gcttccacgg cacttggaaa 2401 ccttactgac actgagttga tgcctgttaa gagttccctg ttccctgata tatttaacca 2461 gcgcaagaaa agcctgataa actcgtttcc agcctcaggc tggaaatgcc agcgcatagg 2521 ctctgcctca acctaaatac cggaggcgga gcctcctagt agcgcattcc ctggctcagc 2581 cagggatcga gttgcttagg gggtgtaact cctcatttat tcgcgctcca acatcattaa 2641 tcactgagct tttgttgtca atctgtgcta tgtatggcta acagctaccg tcttctccaa 2701 gcattccctc aatttctctg ctagtacccg tacattgggt tcctcgaaca gcgtatagtg 2761 agagccagga atgtaatgga catctattcc tccagtgact agcttatccc aacctaggag 2821 cggctcagat ttaacgccaa cagcatcgtc ttgtgaatca tcgctgttct cgtcaacccg 2881 taaaagagtg atttgccctg agtaaggttg gtaagtatat tcgttccaag ccaacacatt 2941 agcgtctatg atatcccaat gcgtgtcatc ctgaggtaga ggctccttta ttcctaaaaa 3001 atgcgtatat tttgctcgga gatggtaagt gctccactcg atccagccaa taagcttttt 3061 ccttaggtag gaaggacctc gttgtaacaa attatttatg tgcatgaaaa ttcgcgaaat 3121 caatggcaat cgtgtggcag cacctggaat accgctatca aacatagcca gaagagctac 3181 tttttgacct tgcctgttaa gttgatgagc tatttcgtat gcaatgatac ctcccatcga 3241 gtacccaccg agaaaataag gaccattggg ttgaatggtc tgaatttctt taatgtaaag 3301 cgctgccatt tcttcaatcc gggttaagag agacagtttt ccatctaatc cttgtggttg 3361 taccccataa agtggttgag cctgtcccaa atgcagtgac aaattgcggt aacacagagt 3421 ttctccacca agcgggtgga tacagaataa aggcggcttg gaacctttgg tttgaattgg 3481 aaccaagcat gaccaactag cttctgatgt gtgctcatgt gcccctgtca aaagctgatt 3541 atcaactgtt ttttcttttg gagaaagcaa ttgggcaagg gcttctactg taccggattg 3601 aaagagggta gccagaggaa gttttttggc gaatttcttc tcaatttgag cgaataactt 3661 gactgcaagc aaagaatgtc cgccgagatc aaagaagttg tccctcacac caacgggttg 3721 gatgcccaaa atttctgccc aaatttcggt tagctgatgt tctgtatcat tacgaggggc 3781 aacataactt gcttctggct ctagttttgt gatgtcaggc gctggtagag agcggcggtt 3841 tactttgcca ctaggagtga ggggcagcac atccaacaat acaaaagccg aaggaaccat 3901 gtactctggt agcttttcct taaggaaatt acgtaattca cctgttgtag gtactgagtc 3961 ttgatccggc acaacatagg ctaccaagcg tttatcaccg ggaatgtctt gacggacaat 4021 aacgacagct tggtgtacag cagggtgttg gctcaagaca gcttctagct ctcccaattc 4081 aatacgaaaa cctcgtatct ttacttggtt gtcgcaacgc ccaaagtact caatattacc 4141 atgcttcaag taacgagcta agtcccctgt cttgtacata tatgccttgt ctttctggct 4201 gaaggggtcg cggataaatt tttccgcagt caactccgga cgattgagat aacctcgcgc 4261 tagcgcaaca ccaccaatat acagttctcc aggtacaccg ataggtactg gttgtagatt 4321 tgagtcgaga atatagatag aagtgttggc aatagggcga ccaattggtg gtagagcagc 4381 ccaactctgt acggaatcag ccaaggtaaa ggcggtgact acatggcttt ctgatggtcc 4441 ataatgattg tgtaaagtac aaccgggtaa cttgctaaac aagttagtca aggcaggtgt 4501 aatctgcaac tgttcgcctg cggtaatgat ttcttgcaaa tggattggta gtgattctga 4561 accaacagca acttcagcaa gctgctgtaa agcaacaaag ggtaagaata gtctatttat 4621 tgctttatga ttgagaaaat ctaataaagc cgttgcatcg cgtcgcaagt cgtctgtgat 4681 taatatcaat gttccaccgg aacaccaggt agaaaatatt tcttggaacg agacatcaaa 4741 gctgatagga gaaaattgca cagttcttgc tccagtagaa actgtgctat tttcaagttg 4801 ccacgagatg agattggaaa ggggaagttg tttcattgct accccttttg gcattcctgt 4861 ggaacctgaa gtataaatga cgtaagctag gttttctggc tcaaccccat tgactaaatt 4921 ctcttcgctt gcttgagcga tcgcctgcca atctgtatct aagcaaacta cgtgtccctc 4981 gtgtttgggt aatacgcgca ttaaattctc gcaggttaac agcaccgcta cctgtgaatc 5041 tgacagcata aaactcagtc gatcttgcgg atacgctgga tctaaaggta cataagcccc 5101 acctgctttg agaattccca acactcctac gaccatttct aaggaacgtt gcacacagat 5161 gccaactggt acatctggtc ccactcctaa atccctgaga tagtgcgcca actggtttgc 5221 acgattattc aactcctggt aggttaattg ctggttttca aacaccactg ctactgcatc 5281 cggggtcaga tttacctgtt gttcaaacaa ctcatggatg catttgtgtg gatagtcttt 5341 ttgggtgttg ttccattcaa acagtaactc gtgctgttga gctgccgtaa gcagtggcaa 5401 tttgctaatg tgctgttgtg ggttagcaac aattccctca agtaatgtct ggaaatgtcc 5461 actcatttgg ctaatggtgg aagcatcaaa cagatccgtg ttgtattccc atatttcctc 5521 tactccctgt tccgtttcca caataaacaa cgtcagatca aatttcgatg ttaatctatc 5581 tacatatgtg gaagtaatgg tgagccctga caactcaaat gtttgctgag gtgtattttg 5641 aaaagcaaac ataacttgga acaagggatt gtaactcaag gagcgctctg gttgcagttc 5701 ttcaaccagc ttttcaaacg gcaaatcttg gttaacgtat gcggacattg ccatcgctcg 5761 tactctttgc aacagttccc ggaagctggg gttgccagac atatcggtac gaagcactaa 5821 ggtattgaca aaaaatccaa ttaaccgttc tatttctgga tgatttcgcc ctgcgatcgg 5881 agatccaact agaatatctt cttgtccgct ataacggtat agtaagattt ggaatgccgc 5941 caaaaacgtc atgaacaacg tcacaccctc ttgccgtgac agtgttgata gcgcttggga 6001 taaactctgg ggtagtacaa cgaaatgctt tgcacctcgg taagtttgga ctggtggtcg 6061 ttgtctatct gtgggtaatt ccagtaccgg cgttgcaccc gctaactgct gtttccaata 6121 cttgagctga ttttcaagca cttcacctga aagccattga cgttgccata ctgcatagtc 6181 agcatactga atgggtagtt ttggcaatgg attgggcaaa ccttctttga acgcttgata 6241 cagagttctc aactgctcaa ataaaatact tcttgaccac gcatcagagg cgatgtggtg 6301 catgactaac actaggatat gctcttgcgg ctcaagctgc agcaagcaag cacgaagcat 6361 gaaatctgat gttaaatcga aaggtcgttg cgactcattt tgtaacagat gttgtacttg 6421 agtggaacgt tcgctgagcg gacaatcctt gaggtcaatc accttgagtt ccacagactt 6481 aggctcacca atgacctgta ttgggtcacc atcttgtgca ataaagttgg ttcgtaatac 6541 ctcgtgatga gcaacaattg catctaatga ctgttgcaaa acatctacgt tgagatctcc 6601 ctgtaacctt aaaacctctg caatatgata cagtgggcta tttggctcta gttgctccaa 6661 aaaccacagt cgctgttgtg cgaaggataa aagagcagtt gattcattcg ttctttttgg 6721 aatcgtctgc tgtgttttta cctcagtagt tttgttctta agttttaact ctagtaacgc 6781 tcttttggct ggtgaaagag cagcaatttt ttggttgata ttatccatac ttactaatcc 6841 ttatttaaat tgcagataca gatgaaaact ttttgctaca tatttgattg gatgagggtg 6901 ttgggagcgg tgacaatttt tcactagctc ccctacaccc tgttagtcag gttgctgcat 6961 aagttgcgct aaaagacttt ccgcttgctc atctgatagc tcttccaact catttaacaa 7021 gcgactgatt tcgtcatcat ctctcttatc cgcctggctt tgaacaattg cttgggctaa 7081 acctgcgatt gttcgtgtct caaaaagcaa ctgcaacgga atttctgaac ctaaagtttg 7141 gcatacccga gaaatcactt gcgttgctag tagtgaatgt cctcctaatt caaagaagtt 7201 atcgtaaatg cctacttgat ccaaactcaa aacttgggtc caaatagtgg ctaatacctc 7261 ttcggtgggg gtacgaggtg ctacaaagct gttggcatcg ctgaaatctg ctggctctgg 7321 agctttcaat gcgcggcgat ccaccttacc attggggttt aatggcatca catctaagaa 7381 gacaaaggct tttggcacca tgtactcagg taatttacct tgcaaaaagt cgcgcaattc 7441 acttgcgctt gggggaactt gtttcgccac gatatatgcc actaggcgtt tgtcaccagg 7501 aacatcttct cttactgaaa ctaaagtctg ttgcactcca ggatattcac ctagaatagc 7561 ttcaatttct cccaactcaa tgcggaagcc acgaattttc acctggtagt caatacggtt 7621 gaggaattct atgttcccgt tgcttaagta acgtgccaaa tccccagttt tgtaaagacg 7681 tgcacctggt tcactactaa aggggttgcg aataaacttt tgtgcagtca gttcgggacg 7741 gttaagatag cctcgcgcta aaccagttcc accaatgtat agttcacccg attgaccaac 7801 agggactggc tgtaaattat catctaagat ataaacttgt acgttggcga tcgcccgacc 7861 aataggagca atcgtataat cagtcccccg ctgacaagtc caaaacgtgg tatctataga 7921 agcttctgtt ggaccatagc aattaatcag aacattttcc aattttaaac gctcaacgaa 7981 gcgctctatg agttcgactg ctaaagcttc accgccactc gtgacatgct tgaggctggt 8041 acaattctcg attccctttt cctcaagtaa aacgcggatg atggatggta ccaaagcagc 8101 tacagtaatc tgctgttcga taatcatctt aaccatgtaa gcaggatctc gatgtccatc 8161 ggggcgagcc ataattaatt gtcccccaaa taacaacggc caaaatatct gccatactga 8221 ggggtcaaag ctcaaagaaa tcgtctggag aactttgtct tgttctgtta attgaaatgt 8281 tgcttgtcgc cagtgaagct gattggaaat tccacggtgc ggaaccatca cacccttggg 8341 cttacctgta gaaccagagg tatagatgac gtacattaaa tcatcaactg ttgcctcaca 8401 cacaggattt tcttggctgt tgttggcaat cagttcccaa tcagaatcca aacatatcac 8461 ttgtgaccca agattttgta agctgttgac caatttctct tgggtgagca tcactggtgt 8521 ttgggtatct tctaaaatga aggttaaccg ctcttttgga tacgctgggt caattggcac 8581 atacgctcca cctgccttga gaattcccag taatccgact accatctcta aggaacgttc 8641 catacaaatg ccaactaaga catcagcacc aacacccaaa gtttttaaat agtgcgccaa 8701 ttggttagca cgagtgttca actgttggta agtcagttgc tcaccttcaa acataaccgc 8761 cacagcgtga ggcgtttttt ctacctgcac ctcaaacaac tcatgaatac actgctcttg 8821 atattctgct tgagtatcgt tccactccaa taactgctgt ctctcagttg cacttagcag 8881 tggtaattcg ttcacgtgct gctgtgggtt agccgtaact tcctcaagta gtgtctgaaa 8941 atgcccactc atccgggtga tggtatcagc gtcaaacaag tcggtgctgt attcccatag 9001 tcctcgcagt ccttgttctg tttcctcaat tgacaaggtt atatcaaact gcgatgcaaa 9061 attatccact tctatggtag aaatactgag tcctggcaac tttaatgttt gtgtgggtac 9121 attttgtaag acaaacatga cttgaaacaa aggatgatag ctcaaagagc gttctggttg 9181 caattcttcg actagcttgt caaaaggtaa gtcttggttg gcgtaggcgg acattgcgtg 9241 cgatcgcact ctttgcaaca gttcccgaaa actcggatta cccgacaaat cagtacgcag 9301 taccaaagta ttgacaaaaa aaccaattaa cccttctatc tcctcccgat tgcgtccggc 9361 ggtaggagaa ccaactaaga tatcttgttg tccgctgtaa cggtaaagca gagtttggaa 9421 tgctgccaac aatgtcatga acagcgttac tccctcttgt cgtgacagtg cactcaatga 9481 tgcagataaa tttttcggga gtacgaaaga ttgttttgca ccccggcgag tttgtactgg 9541 tgggcgcggt ttatcagtgg gcaattccag aactgggatt gcacctgcta actgctgttt 9601 ccaatagttg agttggtttt ctaccacttc acctgaaagc cattggcgct gccaaacagc 9661 ataatcagca tactgaatgg gcaactgtgg caaaggattg ggctttcctt caaggaatgc 9721 ttggtacagc gttgtcaact gctcaaataa aatgctcatc gaccagccat ctgtagcgat 9781 gtggtgtatg gtcaattgta gaatgtgctc ttgtggtgat atttccaaca agcacgcacg 9841 cagcatcaaa tctgatgaga ggttgaaagg acgttgcgct tcttgttgta acagcttttg 9901 tacaacggtt gtgcgttcag tttctggaca atccatcagg gaaatcagct tcagttccac 9961 tgactgaggt tcacgaatca cctgcactgg attgccgtct tctacgataa agttggttcg 10021 taagacttcg tggtgagttg cgatcgcatc caatgactgt tgcaaaacat ccacattgag 10081 gttgccttgc aattgcatcg catccacaac gttatatgct gcactgtctg gttccagttg 10141 tgccaaaaac cacaatcttg cttgtgcaaa ggacagagga attgattgtc ggttcgtttg 10201 tcgagaaata atttggtgtt cttgagtccc ttggttttga cttgtggtaa ttgcgcttgc 10261 taattcggca atagtcggtg tttcaaacag agactgtagc gaaatttcta caccaaaggc 10321 ttggcggcac cgagatataa tttgtgttgc taataatgag tgtcccccta attcaaagaa 10381 gttgtcgtga atacctactt tctctacctg gagtacttct tgccaaacag tagcaatgat 10441 ttcctcagtc tcagtttggg gtttgacata agcaacttct agttctggag acaccacttt 10501 gctgacaggc agagcaaggc gatctatttt accgttaggc gttagaggta aagaatcaag 10561 cagcacaaaa gcagaaggca ccatataatc tgctaacttc tgataagata agccgcgcag 10621 ttcttttata aaaacatttt ggataatttc ggataaagat aagttgggaa ctacataggc 10681 tacaaggcgt ttttcccctg gtacatcctc tgtggctatg accacagctt ctttgacagt 10741 aggatgttgt acaagtatgg actcaatttc tcctaattca acccgcatcc ccctgatctt 10801 gacttgaaaa tcagtgcgac ctaataattc tatcgaacca tctgaccagt aacgagctaa 10861 gtcgccagtt tggaataatc gctgctttcg atcttgactg aaagggtttg agataaactt 10921 ttcggcattc agctttggat gattgagata gccacgagcc aaacaagcac cacctatata 10981 aagttcgccg acttctcctg gtgcaacagc ttgaagatgt tcgtcaagga tataagtttg 11041 gttatgggca taaggatagc ctacgggtat gtaaatatat cctagttcag gatcaaagtc 11101 ctttggaacc aaataagcac agttaccaat ggtttctgtt tgaccataaa cgttgaaaat 11161 ggatactcga tcttgaaaca ggtctcgtag ttgtttgtaa agttgaactg gcggcaagtc 11221 accagaaagc acaatagttc gtaatttaga tttctgaagt gcttctgtgt gtcttttgtc 11281 cagagtttct aatccctgaa gtccatagcg ccaaattgag ggtacgccgt cacaaatcgt 11341 tactccttgt ttctgaatca gctcaaacaa actgatgggg tttttggttt gttcgccagt 11401 agcaataata ctggtggctc ctcgatacag aggcaccatt aattgcctga ctgaggagga 11461 aaacgagaaa gatgccacgt ggagatagac atcctcaggc tgaacttgca atacttggct 11521 gactgctgga atataatgcg caacattagc gtgagtgatt tgtacgcctt tgggttttcc 11581 agtcgaaccc gaagtataca tcacgtaggc aagatgctct tgtgttaact cactgagtgg 11641 gttttcttga ctttctttag caatactctc ccaatctgtg tcaagacaga ccactcgtga 11701 ttcctgctca aaaagctcat tcgccaactg cttggtcgtt aacagcaccg acacctgagt 11761 atccgataac atgaaatgca agcgttcttg aggatacttt gggtcaattg gcacataagc 11821 cccacctgcc ttgagaattc ctaacagtcc taccaccata tccaaggacc gttccataca 11881 aatcccaacc atgacctccg gggcaacccc caaagtacgc aggtagtgcg ccaactgatt 11941 tgcccgttgg ttcaactgtc gatacgtgag ttgctggttt tcaaatacca cagcaacagc 12001 attcggagtc tgctgtacct gagcctcaaa catctgatgg atgcacagat tccttgggta 12061 atctgtttgt atcagataat cttccgcctc ggtcaagtca tttagcccag tagctggctg 12121 aaggtatgaa tcttggtcaa tgatgagctt ggacatggtc ttaagtaaat tcctaaacta 12181 ttttttatac acaacaaatt ccagaaagtt tccaattttg ttggactaag caaaaaaatc 12241 tcttgagcta ctgctcataa gagctactta gctgctgcat attcacaatt gagatgcaat 12301 actttttctt gttgtatttg ctcttttgcc aaagctaatg cctgacaagc tttatttaaa 12361 tttttctgat ctctcgcctc attttcaaac cgttgtatat aagcttcgag aaacttaata 12421 cgtgaatcga tatcatctgc atttataaga aaaggcaggt cagattcaac cataaagcgg 12481 atagctaaca tgggaatagg gctgcgaagt ggacgaaaat tttcgttgtg caaaccagga 12541 ctttcattat gcttgtgaaa ttcccctatc atcagtccta gatcaacaaa taaaggcttc 12601 agtttttgtt gaacaccatc aaccagtttg gaagtctctt ctctatcaag ttcaggaaag 12661 atgagtaaga tggttttgtt tatggcgctt tctctgtctc gtggttctag ttcaaggaaa 12721 gtatctcggt agcacaaaac catctcttca agttgctgtg gttccatatt cttagcatga 12781 ataactgcga acctgatagt gtttgacttg attgcgtaag gtacaaaagg gcaaacagga 12841 cctgaccggc ccaaatctgg atgaggtttt gccaaaaagc tttgaaccca ttgcatgatg 12901 ttgattaagt aaggaaggtc ttgttgaagc tgattaagct caattggtgt gtagagttgc 12961 attttgacta ccttatgaaa actaacggat gaagaaaaca agacgaatct aacctataga 13021 aaatgaagta agttatagct aatgagccaa aagctatatt cattttttat aggatatttt 13081 tcccttagct ggcgctcaac taaaggatgg ataaatatga ctgcaacaag attcttagtt 13141 tcgcatctct caagtaacct cacagacgga atttacttgt atgtagatgt ctttgtcctg 13201 gttgtgggag tttataaagt atattaaggc tgagccataa ttactttttt tgatagaagg 13261 atagatgtta ttgactgaag gactgactga gtattagaag tccgcataat ataagagtct 13321 accaagcgaa ctgtaaagtt ggtgtttcaa tatccatcca gagtatgaca ttttcgtgtg 13381 gaataaaaca acttttatgg acgagatccc ttgaatatag tctaattatt taaatctcaa 13441 tctagaaata ataaaagact attgttacgc tgccttttat gagaaatgtt atagcagttg 13501 ccaggtagat taggacatga actaatgaga aaatacggac accaccagac catcaagccc 13561 gtcaagagtt ccctgttaag agttccctat ttcctgctat aactaattcc gtggaacaaa 13621 cctgtacaaa acatgaagtt cgcgcacaga cttcaggcgc aaccctgtgc aggtaatagt 13681 tctgcgctct cgtgttgaat cgtgagaggc gcagccgtgc cgcaggcata gtacctcaaa 13741 ttcaggtgtt tcttagaatt ttttttcaat taagactcaa aagcctaatg tatttttgat 13801 tgagatatca tcaaaaatac atcgtgcaaa tttagttaaa aaatcactca attttaactt 13861 tagcaaatca actgtgattt tgcgtaatag tttgtttgat ttaatactta attagtgtta 13921 ctcattttgt ttatatattt taagatgcaa ccaataatac attatatatc agtatagaca 13981 atactggtag taaaaatttt tctaatgctg aatgagccag ccaacgccca acattgttag 14041 gacgatccac aaatcgttaa tgattgatct ctaattcttt cccgcttcat caaatccgga 14101 aaaactgctg tgcccttccc tgttcatctg acaacagggg atttcactaa gtattttttc 14161 ctactaaaag ggggttttta gtcaaacaat tgcccttttt tcaaaaaaaa tcctgaaatc 14221 tatataaatt aatggtttca tctatccact tctggctgtg acactttaga atgattcatt 14281 gaaaagtcga atatacagca attacagccg ttttgagcat aaatactcag aaagtccgac 14341 ttttgtaaat ctctaaacag agatttaagt aacgatgcat ctttgatgca tctgtagaca 14401 atctcaagta aacctcaagg agtttgacat gaataaaggt gaattggttg atgccgtagc 14461 tgaaaaggct agtgttacca aaaagcaagt tgatgccgtc ttaactgcgg ctttggaaac 14521 gattatcgaa gctgtttcct ctggcgataa agtgacgttg gtgggattcg gctcatttga 14581 atcacgggaa cgtaaagccc gtgaaggtcg taaccccaaa accaatgaaa agatggaaat 14641 tccggcgaca aaggttcctg ccttctctgc tggaaaactc tttagagaaa aggttgcacc 14701 cccaaaatct taggttctca cctaagtgac actttttcta tgcggcaacc agcacacggg 14761 ggaaatctag cctgggcagc agcactagct ggctgtcccc caagtgctat tctggatttt 14821 tctgcaagca ttagcccttt gggaccacca aaaagcgcaa tagcggcaat tgagtcccaa 14881 ttggatcatc tcaggcatta tccagaccct gattatcgtg aactgagact tgctctgggt 14941 cactttcatc aattaccccc tgaatggatt ttgccgggta acggctcggc agaattgctt 15001 tctctggcag gtcgggaatt agcacagtta gctgcaacag ctttgataac tccagccttt 15061 ggcgactact acagggcgct ggcggcatac gacgctatgg tactggagtt tcctttgtca 15121 ttggtgaggc agtccgttgc ggaggtctcc tcgttgagtg gaactgccga acccgtaagg 15181 gtcatgagtc atttgtcatt tgtcatgagc gatttgtcat tgatttttga caaaggacaa 15241 cagacaaagg acaaaggact attgctgaat aacccccata acccaacggg ggtgttattt 15301 tcacgagaag ccattctgcc atatctcaag gaattcgcct tggtggtggt agatgaagct 15361 tttatggatt ttctgccccc aggacaggaa caaagtctga tacaagttgt gcaggaatat 15421 ccaaacttag tgattttgcg atcgctgacc aagttctaca gtttgcctgg tttgcgacta 15481 ggatatgcga tcgcccaccc cgaccgttta cggcgctggc agtcctggcg cgacccctgg 15541 cccgtaaata ccctggcagc agcggcagca gtcgctgtcg tccaggataa agagtttcag 15601 gagcagactt gggcatggct cccacctgca cgaaatcaac tcttcgctgg tttagcttca 15661 atacccggat tgcagccttt ggaaagtgcc gctaactttt tactcgttga atcacagcaa 15721 tcaagttcgc aattgcagca aaatttactc aagcatcacc agattttcat tcgtgattgt 15781 ctgagttttc ctgaacttgg cgacagatat tttcgggtag ctgtacgctc ttggtcagat 15841 aaccagcgct tactagaagc actgtcatta gttgtgagtc attagtcatt agttattagt 15901 cattagtcat tagtcattaa ggacgaagca caattgaaaa gaaagccaaa accgtgcccc 15961 aggcatcggg catcgggtta ggtaggacaa ttgacaaagg actagggaca aataacgtga 16021 ctattgacta cgatgtcgtg attattggcg gcagtttagc gggacgctac gctgccctta 16081 ctgcttctca actgaaagct aaggttgcct tggtagaacc aatcacaaac ggtgcattaa 16141 cgatagttaa cgcaccatta gagtttattt atcaccatgc tctaagccac accagtaacc 16201 ttagcaagca actaggtgat gcagcactac ttgggcttca tacctcatgt gctgatactc 16261 tagaaaaatg ccaaatttct gtagatacgc ctcaggcaat gctgtatgct catagcattg 16321 tttccaatct ccaagagcag aattcacttc gcctcctggc tgcccaaggg gtggatgtta 16381 tcgtgggtaa tggtcaattt caatcctcac cccatctttc atttgccgtt gatactcgct 16441 tgctacacgc acgcacttat ttacttgcaa cgggttcaag tcctgcaatt ccagaaattg 16501 aaggattaca aagaactggc tttcttacca tacctcaagt ttggcaatct ctcagcagtt 16561 ctacaccgcc taaacagtgg gtgattcttg gtggtgttcc ccaaagcatc caactcgctc 16621 aaactctggc gcgctttggt tacgatgtga cgctggttat ggaacgtccc aatcttcttc 16681 ccaatattga ctcagagatg gctcaactgc tttgcagttt gttagaagca gaaggtgtgc 16741 gcgtcttcat taaaacatca atcactcagg tcagaaaaat tgaggataaa aaatggcttc 16801 aggtgggaga taaagcaatt gaaactgatg aaatcgtagt agctacagca caacagccaa 16861 atatacaatc cttaaatttg gctgcggtag atgttaaatg gaatcagcgt cgtttacttg 16921 tgaatgaaaa actacaaacg acaaatcccc gcatttgggc ttgtggtgat gtgattggtg 16981 gctttgaatt cgccaatatt gctaattatg aagcaaaaat agctctgcaa aatgcgctct 17041 ttttcccgag gtttaaagtc aattatcgtc atattccttg ggcagtattt actaacccga 17101 tgtttgcaca agttggttta acagaggcgc aagctaaaca ccaatacagt cctgatgaag 17161 ttattgttgt acagcagttt tttaaaacac tctttgctgc ccaaatccaa gacgaaacca 17221 cgggtatctg taaattcatt gtcctacgta atggtgaaat tttaggagct tcgattttcg 17281 gtgcacaggc aacagagttg attaatgtta ttgctttggc gatcgctcaa acaatcaaac 17341 tccatcgcct cgctgattta gctcctctgt gccccagttt ttcagagatt tttgagcaaa 17401 ttgtagagac gtggaaacag cagaaattga atagtaatat tgcttggcaa gactgtctgg 17461 aaagcttttt tcaattccga cgaaattgga atttctaaac tttgatagag ctaatgaact 17521 atatgcaaat caatccgcat gcaattttta ggcgattggt tgactatctt tacatctggg 17581 tgtttagtta ttatgagtcc tggaccaaac tttgtcttaa cactccgcaa tagcctagct 17641 cattctaaac aagccggaat ctacacagcg ttgggagtaa ctgctggcga cctcattcat 17701 gtcatctgct ggttaatcgg tattggtgtt attatttcta aatcaatcct attatttaac 17761 ctgctcaaat ggcttggtgc tgcttattta atttaccttg ggattaaatc cttacaagct 17821 aaacatcaag ataattttgt tgaggaacaa acttcgtcag agttgaaatc tttaacggcg 17881 tttaaaaacg gtttcttgac ttgtctgctc aatccaaagg taacgttatt tttactggca 17941 ctgtttactc aaatcattcg tccggataca cctctggcgc tacaaataat ttacggttta 18001 acaattgtgg gaattgaatt cacctggctg gcatttgttg ctactgttgt ctgtgttgcg 18061 gcaatcaaac gtcgattttt gtcaatctca cattggtttg agcgaatcat gggcgcagtt 18121 ctcattttct tagctctgcg tctggcactt gctaaagcac acgattaatt ggtttagcag 18181 cttttcatcc accacagtag tattaataat acgttaaaat atttgtaaca atcctttgca 18241 attcttcaaa agaaagtaac aagtttgaga gtagccaacc gctaccctaa ttggtatgct 18301 tctatctaca ttaagtagaa aaatagctga gtttgccgtc aaaaacgtgt atttttgctg 18361 cataagctta gctacatgcc ctaatggaca gtaattaatg gcattttgcc catcaagcaa 18421 aaagcaggag cggctaaaac caccacttct ttgtgcacaa aactgtcttc cccatttgac 18481 gaatctactg gaatgagtac caataaaaga agttccactt gcctaaaaaa ggagaatagg 18541 ggcttggcaa tcaatcgctg gaaacaggat ctcaaaaatg acctcatagc cgggttgttg 18601 gttgtgattc ccctggcaac taccatttgg cttacaataa ccgttgcaac ttgggtcgtc 18661 aactttctca cccaaattcc caaacaactg aatccgtttc agggaatgga tccgattttg 18721 gtcaatctac tgaatctctt ggtgggactc atggtgccac tcctgagtat actcctgatt 18781 ggcttgatgg ctcggaatat tgcagggcag tggttgctag atgtgggtga acggctattg 18841 caggcaattc ccttagcagg acaggtttac aaagctctca agcaactttt agaaacactc 18901 ttaaaagaca gcaatggcaa gtttcgccgt gttgttttag tagaataccc tcgacgggga 18961 atgtgggcga tcgccttcgt tacaggtatg atcagcaccg atatccaaac tcaaatgtct 19021 cgcccagtat taagcgtttt catcccaaca acccccaacc ccgccactgg gtggtatgca 19081 gttgttccag aagacgaagt tgtcaacctc tcattatcta ttgaagatgc atttaaaatt 19141 atagtctccg gtggcattgt tgctcccaat acctccccga ctccattggt tatccccaaa 19201 gaacgcaaat tagaaatgcc acccttagaa gcaaagcggc agattcttcc tctcgacgag 19261 acttaaaatc agtagtcagt agtcattagt cattaggagg cagtgcggtg gacgggtttc 19321 ccggcataaa gcacctgccg ttcattagtc aggagttttc actttactcc ctccctcttt 19381 ttatcccctt cccccctaac tcacttttat gcaagagcgt aaacctcgtc aaatttctcg 19441 tgaattggct cttttaagcc tcagccaact gccgttaaac tcaaagaaat tagaaaaaat 19501 agcaccagaa caactggtac caaagttagt actcgcagca gtacgtacgc tgcgatccga 19561 agtccaagat actctagaca atgctgcagg cgaactgcaa cgcagtaatg atcggctttt 19621 gagtagccaa acacgcgcca gtgatctaaa tactgccaga acaatgctca aagaagcagt 19681 tgtgtacacg caaacagcca tcaacaagtt agctgcagct atagaatttc ctgaactcat 19741 tcaactggca aatcaagata aagaagtccg cgagtacgct aaagagatta tcattactgt 19801 ccatgaaaac cgaaacacca tagatgagga catttccaca gctttggtag attggcaagt 19861 cactcgcctt gcccacattg accgagatat actgcgaatc gctgtagcag aaatgaagta 19921 tcttaaagtg cccgacagta tcgcaattaa cgaagctgta gagcttacga aacgatacag 19981 tgaagaagaa agccatcggt ttattaatgg tgttctgcgc cgagttaccg aacaaagaaa 20041 gatcgccagt actctatagc caaaagcttt attcttaaga agaacacaga acacagaata 20101 tcttcacaaa taaatagagt cttgcggcga tcagcgcgtg acaccctgga gttttggcaa 20161 agaaagaatt ctttaatata gttctttttt tgcccttaca tccctacacc cttagttttg 20221 atcatgacaa catgattaca ttcttcctaa aaaatcctct tgtcacgaga agcagaagtg 20281 tcaaaattta gataaaagct tatagtttca taacacttta tctgcaatgg tcttcaattg 20341 gttccgtcgt caacataacg atccttctgc tactccctcg caacagaaac aggaagaaac 20401 tcctgctgca aaacaacccc aaccccaaag cgccgaaacg tcaacagcag aaactgcacc 20461 cgaagtagcc acggatctgc tggcttacgc gaaagcagcg tacaaaaata ttcagcaaag 20521 gcaacaaacc gaaccagcag aaactccagc aactgaagta acagccgacg tcgtagcatc 20581 accagcacaa acagaaacgg cacaaccaca accagcagaa gaaatacagt ccgaagaatc 20641 agctgatgtg actgcgactg aaaccacccc agaagaacca gaaggtacag aggaactcag 20701 cacagcacca gcagtagcaa ttaccgaaga accagatgct gaaagtgtga caacagaaag 20761 tgtcgctcca ccagaactca ccgaaccact gacaacagaa agtgtcgctt taccagaacc 20821 cagcgaacca cccacaacac aagaagtcac tcagccaaca gcaccagcaa ctttatcatt 20881 tttagaaagg gcagctgcag aacgacaagc aaagcaggaa agattgatag ccaccgccat 20941 tgaagtcaca caaccaaagg tggtacagcc agcagcccaa acaagtaccg cgccagaagt 21001 catagaggaa atgactggac ttgattttga tgaagggttc ttgtggtcaa cagaagttct 21061 tgctgcacaa ggtagacgtc cagaagatat ttcttttgaa gaaattactt ggctgaaaaa 21121 gctgcgacaa agcttagaca aaacccgtcg taacatcgtt aatcaactca agtcaattgt 21181 tggacaggga ccactcaacc aagcagcggt gacggagatt gaggcattgc ttttgcaagc 21241 tgatgttggt gttgaagcaa cagattacat tatcagtact ctacagaaca agcttcgtca 21301 agaagtcctg ccagcagagg aggcgatcgc ctacctcaaa caaatcttgc gagatatgct 21361 ggatgcacca gtgaaaaaat ctaacaagcc aatctttgtc ccagaaaaag aaactctcaa 21421 tgtttggtta atcactgggg taaatggcgc tggtaaaacg actaccatcg gtaaaattgc 21481 tcacctagca cagaaatctg gctacaagtg cttgattggt gcagcagata ccttccgcgc 21541 cgcagccgta gaacaggtca ggatttgggg agaaagaagt ggtgtgcaag tcattttcaa 21601 tcctgctaaa aatgcagacc ctgcagcagt tgtgtttgat gccatctctg ctgctacagc 21661 acgagaaaca gaattacttc tggtggatac cgcaggacga ctacaaaata agaaaaattt 21721 gatggacgaa ctcagtaaaa tccgccgcat cattgataaa aaagccccca atgctaaaat 21781 agagtccctt ttggttttag atgcaactct aggtcaaaat ggattacgcc aagctgaggt 21841 tttttcccaa gcagcacaac tgagtggtgt tgttttgact aagcttgata gcactgccaa 21901 aggaggcgtc gcccttgctg ttgtgcagca gttaggctta ccgattcgtt ttattggtgc 21961 tggagaagga attgaagact tgcgtccttt ttctagctac gagtttgtcg aagcactttt 22021 gaatggttag ccgttaccga aaaactgggt ttaaagcccc gcccttctag ggcggctttt 22081 ctttaacttc ctaattagtt ctctaagcac atcgttttga gttcgatccg tttctttgca 22141 gtactccaca agtatcttgt actcttcatc agaacaccga atggttaatt ttttcatgcc 22201 gtcataatgc cgtctatttg gggtacaatg ttagcatgaa aacactgaag tttaagctct 22261 acgaacacaa acggaataga tacctcaagc gcacaattaa cgctgctggg gtgatttata 22321 accattgcat tgctctccat aagcggtact accggatatg gggcaagcat ttaaattgtg 22381 caaaacttca gtctcatatt gccaagttgc ggaagagaaa cccgctttgg caatcggtag 22441 gctctcaggc agtacaagat atctgccaac gcatcgagaa agcgtaccaa ttatttttta 22501 aacacaataa aaagggggtt cgtccaccag gatttaagaa ggttaagaag tacaaatcat 22561 tcacccttaa acaggcaggt tataagtttc tgggtggtaa tcgagtaaaa attggcagta 22621 gagtttatca attctggaag tccagagaaa tcgagggaac ggtcaagaca ttaactatta 22681 aacgtacccc gttgggtgaa ttgtttatgg ttgtggttgt tgataattgt gttgcgtgta 22741 aagttaattc cacggctggt aagatagcgg gttttgactt cgggctaaag acattcctca 22801 cctgctcaga tggttctagg attgattctc cccaattctt caagcaatcc cttaacgcca 22861 ttagaaaagc tagtaagcag cattccaaga agttaaaagg tgtctcctcc ggagacgcta 22921 cgcgaacatc caaccgggaa cgggcaagat taaacctggt gcgctcttat gaggatattt 22981 gtaatcgtag gcgtgattgg ttttggaaat tagcccatga actaactgat aggtttgatg 23041 tactttgctt tgaaacgttg aatcttaagg gtatgcaacg cctttggggt cgtaagatat 23101 cagacttagc ttttggtgag tttctgcaaa tcctagaatg ggttgccaag aagaagcaca 23161 agcaactggt atttgtggat cagtggtatc catccagtaa aacctgttct agctgtgggc 23221 atatcttaga aagtcttgat ttgtcagtaa gagtttggcg ttgtccatcc tgtcaatcag 23281 tcaatgggag agacgataac gcagctaaga atattcaaat ggttggggca tcaaccattg 23341 ggttaggcga tgtaagacgg gctttgcctg ctgttgctgt ttgaccccag aatccccacc 23401 cttcaagggt ggggagtatg tcaaggacac acacagcaag cctgcctctg caggctttgt 23461 tcgtatcgcc acacgatacg cgcctgtagg cttatccgaa ccgtattgct tttgagttgt 23521 ttttttgtca aaatttcctg acttagagtt taacaatatt ctctagtggt aaattaaaga 23581 tttgaagagt aacagttaaa caacggttcc ccgacctgag gaaactaccc gtcccgacgg 23641 atagcagcct ctaggctttt attgtcaggt tttatgaccc tcagacagtg aaacacaagc 23701 agactttttt ggaacctaat ggtttacttt tttcaatcaa atgtttatat ctaaagaggt 23761 atgacaaagt gaagaatttc taagataata gaacccctat taacctctgt aacaaaaagc 23821 agtttttgct aatcgttttt tcctaaaaag acttaactct aaattttaat attaagttaa 23881 agaagcacat gaaatatgaa agacttttaa aattttcacc ctacgttttt ctactcttgc 23941 taaaccgata aacttgtcac aagatggttt agctattcaa aaacgaaagt attaaaagaa 24001 tctaatactt gctgaagtaa atcacaatgg taactgtgcc tcaaccaccg tctcaaccta 24061 ctgatagtaa tagtcgtacc ccgaccgagg tcacaccagt tgtggcactc aaagaactcg 24121 tggcacggtt acaccgagaa caaaataaaa ttcaagattt actcagttct ttgggatttg 24181 ctctgcgaag cttcaataat ttgaatcagt ttttggaact gattccgctg atggcaacaa 24241 gagtcactga tgcagatggt agcgccctat ttgtgtacaa accgaatggt caagtgagat 24301 tagaacagtt acattggcaa gatagtcacc agcgtaaaaa tatccgcaaa gccctagaaa 24361 cagcaacgag tcaaatcgta tgtgtaccca acaataccca gcctctagca ataatgtcgg 24421 gtattttgga tgattacatg caccgcagct taggaccagg tatacaaatt tttggtacag 24481 ctattctcgt taagcataca gaacggggat ggctctatgt cttaagccgc gatccagaat 24541 atacttggac agaaacgaga caaaagttag ttcgcctagt ggcagatcaa accgcagtcg 24601 caatagaaaa tgatgaactt gctgtagaac tcaggaaaaa agaacgccta gatcaagaac 24661 tagaaattgg ggcagaaatt caaaggcggc ttctgccacg tcagtgccct acaattccag 24721 gtgttacgat tgcggcacgg tgtaaacctg ctaatcgtgt tggcggagac tactacgact 24781 ttatccccac aaaccacaac caaattcagc taaataacaa agagagttta gatgcagatg 24841 gtcgttgggg tttggtgatt ggtgacgtga tgggcaaagg agtcccagca ggactgatta 24901 tgaccatgat gcggggaatg ctgcgaggag aagtgctgca cggtcattct cctcaacgaa 24961 ttctgcaaaa cttaaatcaa gtgatgtacg cggatttgga aaattctcac cgctttatca 25021 ctttgtttta ctcagagtat gacccccaaa cccggatgtt gtcttacagc aatgcagccc 25081 ataatcctcc catatggtgg catgcagcaa cgaaaacagt cacgcgtcta gatacttttg 25141 gaatgctgat tggtttagat gccaatagcc agtacgaaga tgctcaagta aaattagagc 25201 ttggagatat tattctttac tacacagatg gcttgaccga tgcagctgca gcaagtggcg 25261 atcgctatga tgaagaaaac ctcatcacga tttttaacta cgcttgcaga atttgtaatg 25321 gtccgcagga aattctagat tacctgtttg accaagtgca agaatttatc ggtgccgata 25381 agcaaaacac cgatgatatg acattagttg tgctgcaagt taatagttag tcgttagttg 25441 ttagcaagct gacaagtaac aactgtagtt aaaagcctta tctagtgttt aactacttga 25501 caaataatct aaaaactgct ccggcactaa cctcaactct cctgactgga aacggtttat 25561 atctacccac aaagcagttt tctgtcgctc accttcagaa aaaactaact tttcaatttc 25621 ataaaatttg gcatcaacaa agtcgctttc aaaaagttga ataatttcgt gacctggttg 25681 accattgaaa gtaaagatgt tttccaaaca acccaagtaa cgaatatttg ttaattctgc 25741 ttggatctct tcttggaatt ctcgtttgag ggcttctaaa cttgtttcac caaagtcaac 25801 tccaccaccc attgcgcggt aaaaagtttg ttgtttgact ggatcgtagc cctcagaaac 25861 gaagatacgt atgcctatgt cctgcggaca cgctccgcta acgccacgct tcgcgtgttg 25921 agtatcgcga atcagtccta aagcgatgac acgaatttcc cctggcttat gcatttttgt 25981 tgctagttat cgtaaataaa tgaaggacgg ctttcgctct cctctatggg ccaatcttca 26041 ttttcaccgc cgatgtacaa ttgctcaatg gtgacatatt cctcacctag ctgcttaagg 26101 gcgttaatga gaatatcgag ggcgatcgca tcagaagttc ccaagtcaaa ccagcaacgc 26161 gcccatgtcc cctcatactc aaactcaccc atattgtgca tcagcgccag caaactttta 26221 tcatatcctt gggagtcgta attcatgtag ctcagatcta atccagtttc ctgtacctgg 26281 agattttcgg cattaaatgc acccaattta cccaaataaa accaggaatt gaaagcctct 26341 tctacatact gtttttcttg tgcagaagga attgtgctga actccaacca aatccataaa 26401 tcaaaaggat taatttcgcg aaactgtacc tgcattttct ttagaaacta aatctagatt 26461 actttttatc aaatctcatt gtaaaagagt aaggtgcaaa gggtgtgcct ttcctaatcg 26521 agaagtgggg aagcttcggg agc // LOCUS NODE_1159_length_25985_cov_5.11438525985 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 25985) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 25985) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..25985 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(257..1465) /locus_tag="DP116_10105" CDS complement(257..1465) /locus_tag="DP116_10105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863090.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hemolysin D" /protein_id="PRJNA477356:DP116_10105" /translation="MKLPFASSPAQARQTKEQFANPDAYLDYELGKAVQELPPLYTRL VGVTLSLAVFGAIAWAGLSKVDEVAVAPGKLIPGEQDVQPVRSPSSGKIKYINETKVK EGQPVQKGDILVALDSESSQTEIQRLNNQAQLMKQDILRATKAAEESQKARIKEAEIE YSRLRNNLNSAQRKADKECPFFGPITRVKCEDAKVELNNSKKSFEAQEQKIKQLQQNY KTGSLSDLSKRREELQTIERQLAQAKNQWQNQTITAPITGRVYNVKVNPSQGTVQPGE ELLSILPEGKEPLLEVDLPNQYRGFVDEQMNAKVKIEAFPYQEYGVIEGTVVYVSPYA VVKDKNSGKEVYPTRIKLHKITIRRRGQDKTLTPGMEASGEIVMRQKSILSLLIEPVT RKFDEVFSVK" gene complement(1478..1984) /locus_tag="DP116_10110" /pseudo CDS complement(1478..1984) /locus_tag="DP116_10110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744908.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(2068..2589) /locus_tag="DP116_10115" CDS complement(2068..2589) /locus_tag="DP116_10115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870148.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetolactate synthase small subunit" /protein_id="PRJNA477356:DP116_10115" /translation="MKHTLSVLVEDEAGVLSRISSLFARRGFNIESLAVGPAEQSGIS RITMVVPGDDRVIEQLTKQLYKLINVLKVQDVTEIPCVERELMLLKVNATSSTRSEIV ELSQIFRARVVDVAEDSVTLEVVGDPGKMVAIVQVLQKFGLREIARTGKISLTRESGV NTELLKSLEAKAS" gene complement(2797..3414) /locus_tag="DP116_10120" CDS complement(2797..3414) /locus_tag="DP116_10120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740694.1" /note="MobA; links a guanosine 5'-phosphate to molydopterin to form molybdopterin guanine dinucleotide; involved in molybdenum cofactor biosynthesis; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdenum cofactor guanylyltransferase" /protein_id="PRJNA477356:DP116_10120" /translation="MTNHLTAIVLAGGKSSRMGRDKALIPIQGVPLLQLVCRIAESCA DTVCVVTPWQERYQHLLLSTIEFIKEVPLSGETGSEQLPHGPIIGFAQALAYVQTDWV LLLACDLPKLRVEVLQEWTNALDSVEGEAIAALVPQAKGWEPLCGFYRRRCLPSLVEY INRGGRSFQEWLKQHPVQALPLPDPEMLFNCNTQDDLFRFQSQVQ" gene 3490..3774 /locus_tag="DP116_10125" CDS 3490..3774 /locus_tag="DP116_10125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319973.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_10125" /translation="MIVSFKSEETKFIFEGFTSSQYPSNIQKTALRKLLILDAATSIN DLRLPPGNRLEKLVGDRIGQYSIRINDQWRICFVWTDENNALEVEMVDYH" gene 3790..4098 /gene="higA" /locus_tag="DP116_10130" CDS 3790..4098 /gene="higA" /locus_tag="DP116_10130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002773507.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="addiction module antidote protein, HigA family" /protein_id="PRJNA477356:DP116_10130" /translation="MNNNRLPNIHPGEILQLEFLEPLNITPYRLSKDIGVAQTRISEI LSGKRSITADTALRLSRYFGNNAQFWLNLQTQYDLRQALEENEEVYNQIPKLPLNDVA " assembly_gap 4432..4441 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(4451..6526) /locus_tag="DP116_10135" CDS complement(4451..6526) /locus_tag="DP116_10135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875902.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin domain-containing protein" /protein_id="PRJNA477356:DP116_10135" /translation="MTNRLAQAKSLYLRKHAENPIDWWSWCDEALATAKTQNKPIFLS IGYSSCHWCTVMEGEAFSNLAIAEYMNANFLPIKVDREERPDLDSIYMQALQMMSGQG GWPLNVFLTPDDLIPFYAGTYFPVEPRYGRPGFLQVLQAIHHYYDTEKQDLSERKTAI LESLLTGAVLQQEGITESQDKQLLHNGWETSTGIITPTQYGNSFPMIPYAELALRGSR FLFPGSASERSRYDSKQVCTQRGLDLALGGIYDHVGGGFHRYTVDPTWTVPHFEKMLY DNGQIVEYLANLWSLGVQEVAFERAIAKTVQWLKREMIAPTGYFYAAQDADSFSDPTE VEPEEGAFYVWSYSELEQLLTPEELTELQQEFTITPEGNFENKNVLQRRNAAKLSETV ENTLAKLFAIRYGAAPESLETFPPARNNQEAKTGNWKGRIPAVTDTKMIVAWNSLMIS GLARAYAVFQQPEYLELAATAANFILDHQFVDGRFYRLNYESEPTVLAQSEDYAFFIK ALLDLQACSPEEKNWLERAIAIQEEFHEYLWSVELGGYYNTSSDASQDLIVRERSYMD NATPSANGVAIANLVRLALVTDNLHYLDLAEQGLKAFRSVMSRAPQACPSLFTALDWY RNCTLVRTSAEQIQSINSMYLPSTAFVAVSKLPEGSLGLVCQGLKCLAPAQSLEKLLQ QVQQSQVRG" gene 7093..7698 /gene="clpP" /locus_tag="DP116_10140" CDS 7093..7698 /gene="clpP" /locus_tag="DP116_10140" /EC_number="3.4.21.92" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013190934.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent Clp endopeptidase proteolytic subunit ClpP" /protein_id="PRJNA477356:DP116_10140" /translation="MIPIVIEQSGRGERAFDIYSRLLRERILFLGQPIDSNVANLIVA QLLFLDAEDPDKDIYMYINSPGGSVTAGMGIFDTMKHIRPNVCTICTGLAASMGAFLL SAGTKGKRMSLPHSRIMIHQPLGGAQGQATDIEIQAREILYHKRKLNEYLAEHTGQPY DKIAEDTERDFFMSPEESKEYGLIDQVIDRHAAGIRPMAVV" gene complement(7763..8572) /locus_tag="DP116_10145" CDS complement(7763..8572) /locus_tag="DP116_10145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875900.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10145" /translation="MLAQFQSKYPQGSLISELVQIYHGKYIVRASVQIDGVTRATGMA AAETVEEAEDQARNRALVVLGMSNSPDSVVVSPEPVKQVQPITTTTTRLNESAYPAAL KTFPQTEATPTSPVISSVVSKNDIKNEEIIERFSTTDNQETQSLEKSPFQNLEMMFGN QAENEISEISPSNVTPFPSRSYSSSVEDVPTQTTTGKKKKKSEPVDQSDDIAKIGVEM QRLGWTTEQGRDYLIKAYGKRSRHLLTDEELHDFLRYLESQPTPPDPLAGF" gene 9346..11706 /locus_tag="DP116_10150" CDS 9346..11706 /locus_tag="DP116_10150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129251.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_10150" /translation="MEFSIATLLANFTDDKLVARKLLEKKLGCEDEVSLQKLHIALEI LEKIGVLVKERGKYRRVTEEGLIEAKLRCSSKGFCFAIQDVEGSEDIYIRESHLSNAW NGDRVLVRVLKEGSRRRSPEGEVKLILERSNHTLLARIKQVESGFRAVPLDDRLLFEL KLQQNSPSLEQAIDHLAHVEVLRYPLAQYPPLGRVVQILGSDAEAAADIDLVTCKHDL SRTFPESVQEAASKLPKKLLKADLKNRLDLRPLLTLSIIGNSNDSTMIENAFTLDKTS EEHWQLGFHIADLSHFIQPDEALDREALKRGRSVYLGELVLSMLPEGVAERCALLPKS DRLAISFLITIDSKSGQVGEWEVQPSVVKVDASVSEEQVEAILTNKSTKISSSLVEMV QQLDSLAHLLKQVRSSRGCLQLNLPPNQNPYYDEGAMGCVMVNDLPVHSLLTEFVLLV NQLIATHFNALGIPAIWRVQGAPDAEDVQEMLKLAINLGVELSLDPETDVQPLDYQQL TRVFAESASEQVLTYLLQDTLKPATYSTTKGPHFGLSLPEYVHFTAPLRRYPDLLMQR VFYTLLEHGRDRRTTRVKERVNLRHSNSHGEINWNVLPPELQQELQSDLTRVIIQLND REKEVQEAEADLAGLQRASLMKQRIGEVFTGVITGVQSYGFFVEIEVSPTVSNSVSNP RVPLRVEGLVHVSSLKDDWYEYRARQQALFGRKNRASYRLGDRVAVQVKSVDYYRQQI DLVTVGSDGLPVFSKDVNLSNGEDTNPYLSHEDIDPDDLDAYPDEE" gene 11978..12601 /locus_tag="DP116_10155" CDS 11978..12601 /locus_tag="DP116_10155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319362.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aromatic acid decarboxylase" /protein_id="PRJNA477356:DP116_10155" /translation="MSTHTKPLILGVSGASGLIYAVRALKFLLEADYRIELVASKSTY MVWQSEQNIRMPVEPAQQEQFWRQQAGVEGIGKLCCHSWSNVGANIASGSFRTLGMIV MPCSMATVGKLAAGLSSDLLERAADVQLKEGRKLILVPRETPFSLIHLRNLTTLAEAG VRIVPAIPAWYHNPKTIEDLVDFVVARALDQLDVDCIPIQRWQGRQD" gene 12878..13585 /locus_tag="DP116_10160" CDS 12878..13585 /locus_tag="DP116_10160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875896.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LapA family protein" /protein_id="PRJNA477356:DP116_10160" /translation="MAVFRLILLVTVLGGLTLLLVQNWSPVLPLVFLGMKSKALPLAI WILFSTAAGAFTTLFVTSLFNFSNFFAGQQRQTPLRSPTTSTARSQTRKEEPTPRPSP PPSSSKTESTRTSDPLNDWETDDSTDDWDFEEKQQQAPTPNSQNTQVRDSNTYERQQE PSSSYKSDSVYSYSYREPKNSGVGKTESVYDADYRVIIPPLQPPTTNQAQTNQESDDD WGFLDEDIEDQDKRPRR" gene complement(13618..13797) /locus_tag="DP116_10165" CDS complement(13618..13797) /locus_tag="DP116_10165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859680.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10165" /translation="MIRIRDVVQKALATGYLTVEAENQLRQLLTTRYDLEDLNAFMSL QEAAMTGKVKQESRG" gene 14280..15887 /locus_tag="DP116_10170" CDS 14280..15887 /locus_tag="DP116_10170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209182.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NB-ARC domain-containing protein" /protein_id="PRJNA477356:DP116_10170" /translation="MMVSDFNKVDKEFTEAKNHWEVEKLYVDLGSAKGKSLTPVEKKF LRGLLCGYSPAEIANTVYQSRSSSTVRVYLSNGLYKYIEEMLSYQVGYPVEVKSWSRV THLLEQAGYKKALFQKELASNQVKTEKNEFTIIKAAQTEDWGEAVDTSLFHGRTTELA VMTQWIVEEECRLVVLLGMGGIGKTALSIKQAEKIKDKFEYVIWRSLHLASPPEVILN QLIQTLSPTQQTIGVEDLNSSISQLIDCLRSSRCLIVLDNFDSILCSEYSTSDSDIYP TSSTNHSTINNSSAYHLPQIRYRLGYEGYGELIRRVGDSQHQSCLIVTSREKPQEVAA LEGDKLPVRCLKLKGLSHTESIKILKDKGFDNSTAEEYKLLLDRYTGNPLFIKLVATA IQELFAGNIYDFLEQDTIVFGDIRAILDKQFNRLSHLEKLIMYWLAFNQDLGSMRKLQ RDIVPRVSQRLILEAIELLHRRSLIEWQVSSFCQTPVLMEYVAERLIEENFKLMGEKP SSIPMLQTIFEVQIKNYIRDLRLNNEI" gene 15940..16500 /locus_tag="DP116_10175" CDS 15940..16500 /locus_tag="DP116_10175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209183.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="shikimate kinase" /protein_id="PRJNA477356:DP116_10175" /translation="MSIDLLKGVNLYLIGMMGVGKTTVGRLLGQHLDYGFVDIDTVIE KAAGGKSITELFAELGEPAFRQLESQVLSQVCAFTKLVIATGGGIVVQQQNWSYLHHG LIVWLDASVEILCARLAEDTTRPLLQNVDRKAKLQFILEQRQHLYRQADLRITINEGE TPEDIATRIIEQIPSVLKPGVLSTEC" gene 16598..17494 /gene="argB" /locus_tag="DP116_10180" CDS 16598..17494 /gene="argB" /locus_tag="DP116_10180" /EC_number="2.7.2.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860095.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetylglutamate kinase" /protein_id="PRJNA477356:DP116_10180" /translation="MVSETEYMIKQDAASRVQVLSEALPYIQQFSGRTIVVKYGGAAM KDSNLKDKVICDIVFLSCVGLRPIVVHGGGPEINSWLGKLGIEAQFKNGLRVTDAPTM DVVEMVLVGRVNKEIVTLISKAGGSAVGMCGKDGNLIKARPQGEEGIGFVGEVSSVDT KILETLVNNGYIPVVSSVAADETGQPYNINADTVAGEIAAAIGAEKLILLTDTRGILK DYKDPSTLIQRVDIQEARELITTGVVSGGMIPKVNCCVRSLAQGVRAAHIIDGRIPHA LLLEIFTDVGIGTMLVGSQFTT" gene complement(17613..17798) /locus_tag="DP116_10185" CDS complement(17613..17798) /locus_tag="DP116_10185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017719828.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10185" /translation="MFGLSELKQTRFYQEVFAEGKQEGKLETIPQLLALGLSIEQIAQ ALGLDEQVVRQAAQPKS" gene complement(17850..18630) /locus_tag="DP116_10190" /pseudo CDS complement(17850..18630) /locus_tag="DP116_10190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875071.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="flagellar assembly protein H" gene 19511..20068 /gene="efp" /locus_tag="DP116_10195" CDS 19511..20068 /gene="efp" /locus_tag="DP116_10195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999184.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="elongation factor P" /protein_id="PRJNA477356:DP116_10195" /translation="MISSNDFRPGVSIVLDGSVWRVVEFLHVKPGKGSAFVRTKLKNV QSGNVMEKTFRAGETVPQANLEKSTMQHTYKEGDEYVFMDMETYEEGRLSASQIGDRV KYLKEGMEANVVRWGDQVLEVELPNSVVLEIVQTDPGVKGDTATGGSKPATLETGATV MVPLFISQGERIRIDTRNDTYLGRE" gene 20201..20752 /locus_tag="DP116_10200" CDS 20201..20752 /locus_tag="DP116_10200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319098.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetyl-CoA carboxylase, biotin carboxyl carrier protein" /protein_id="PRJNA477356:DP116_10200" /translation="MPLDFNEIRQLLTTIAQTDIAEVTLKSDDFELTVRKAVSVSPML SAPPQAALGAVGTPNTPVPPPIPLVMSPQTVTVTSPNRPTDSNTLGLQSPPTGSSVIE QKFVEIPSPMVGTFYRSPSPGEAPFVQVGDRIRSGQTVCIIEAMKLMNEIEAEVSGQV IEILVQNGQPVEYGQPLMRINPD" gene 20817..21161 /locus_tag="DP116_10205" CDS 20817..21161 /locus_tag="DP116_10205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861788.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_10205" /translation="MKQALPVPPEVVQQVAEYFSLLSEPMRLRLLHLLRDEEKCVQEL VEATQTSQANVSKHLKVMWQAGILSRRSEGTCAYYRVEDEMIFELCNRVCDRLAFRLE QQARNFRILNTK" gene 21444..21893 /locus_tag="DP116_10210" CDS 21444..21893 /locus_tag="DP116_10210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010873056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ADP-ribose pyrophosphatase" /protein_id="PRJNA477356:DP116_10210" /translation="MQKQFPLTTVGALAVNPHGQVLIVKTTKWRGTWGVPGGKVEWGE TLEAAVKREFREEVGLELTDVRFGLLQEAVLDSQFVREAHFIMVNYYAFSASETITPN EEIEEWAWVTPQRATEYPLNTYTRVLISDYLQKQIDKLYNNKVQEQC" gene 21887..22606 /gene="tmpR" /locus_tag="DP116_10215" CDS 21887..22606 /gene="tmpR" /locus_tag="DP116_10215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013014560.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional dihydropteridine reductase/dihydrofolate reductase TmpR" /protein_id="PRJNA477356:DP116_10215" /translation="MLKKALVTGSSGGIGRAIALKLASQGFDVAFHYNRSGEAASKAS QEAATHGVKAIALQADVTNPDQAKSLVERTAENLGGLSVVVNTVGNYLGKPTSQTSIE EWHEVIDSNLNSTFYITQAALFYLKAANWGRIVNFACASAQNVVARRTNTAYVIAKTG VIIYTKSLAQELIKDNITANVVSPGIAENSFDVEEMIPKLPAKRAATLEEISNAVWFF ISPDADYITGQILEVSGGWSL" gene 22697..23584 /locus_tag="DP116_10220" CDS 22697..23584 /locus_tag="DP116_10220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009787003.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydropteroate synthase" /protein_id="PRJNA477356:DP116_10220" /translation="MLNLEDLYAIHEHYKDALNAKVEEFTIGNKNFNFNSKKAILGVI NLSSDSWYRESVCLSSEQAIRRGIVLNAQGADIVDIGAESTLEKAERVEDVRQKSQIL PVLEALNQEGVLTSIETYYPEVARECLRVGANVINLTGPEKSEEIYQAVSEFDAGVII CYVQGKNVREVGDFDFGDDPTDLMYDYFAKEIEIATKYGIRKIFIDPGLGFYYKNLQD SSLRIRYQMRTFLNTFRLRKLGFPVCHALPHAFECFGEEVRSAEPFFAVLAALGKTDL FRTHEVPKTKAVLDTLSFF" gene 23864..24985 /locus_tag="DP116_10225" CDS 23864..24985 /locus_tag="DP116_10225" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10225" /translation="MATGESPTCSVICSQAPEWNSSKLIFPFLIDNWKHFYEIEFPLE FKNTDKLHPSVLIGIAMAEIYRLCEICTPEIVQVCWYSCLAWESDWWNEDIYWYLQQK FYLEKWGWDKMPRIEFSQSAIEIENNWLPSVNDTYLLAVSGGKESTFGFEWMQQANLP MEAFTLHHAGGILGNNWQEKFPVFDYIRNRTRLWEILTHPREDPAEHFGYKGVRNDPT ITNALFLMMVIAAQQGHRFLVLANDKSSNESNATYEGREVNHQSAKGTAYIERFNSFL ERKGLPFRYVSICEEVYSIATVHQLSLWDKSILNVLTSCNEAQWAPGSCRWCCQCPKC AFSYALIEAATDYRFAVQVVGKDLFNLTTLEEVWRRATK" gene 25237..25968 /locus_tag="DP116_10230" CDS 25237..25968 /locus_tag="DP116_10230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007304232.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_10230" /translation="MYSLKLELKLNNKEKSRLAGCAGFARFVYNFGLSMLTSSWDFEE IKASDSKRLTAIEKVFTNHVKTNPEFAWMQQYPSAIYSSALRNLAKAVERWRKGESGF PQMKSRKRGDSFTVLKKSGVYPAIGEPMLPFTNRQVLQLGKRITIPGLGDFRLKQPIP FLCSSQSFTISRTADKWFVSFSLDVEKIPPLIHEVESVGVDLGVKTFATLSDGTTIVA PSSLKKAKTKLNKLQWRNRRKRAIR" BASE COUNT 7483 a 5473 c 5602 g 7417 t 10 others ORIGIN 1 gctagggacg ggcagtcccg aagtaacgct acaggagttg gaacctctac gctacgaagc 61 atatgcaaac gatttaaagc agccgtagct tcaatagcaa agggagaagt agttcatagc 121 gaccgttgtc tggtttccaa tttgcaaatt cttcaaagct tactagttta ggtaaagctt 181 gagtcatata tacctcagat aactcttctt cagaaccccg acttctctaa gaagttgggg 241 ttcttatttc gacgattcac ttaacagaaa acacctcatc aaatttgcgc gtaactggtt 301 caatcaacaa gctcaaaatc gacttctgcc gcatcacgat ttccccacta gcctccatcc 361 caggagttaa tgtcttatct tgaccccgcc tcctaattgt gattttatgc agtttaatcc 421 tggttggata aacctccttg cccgaattct tatccttcac aactgcgtag ggactaacgt 481 aaactacagt cccttcaatc acaccatatt cttgataagg aaaagcttct atcttcacct 541 ttgcattcat ctgttcatca acaaatcccc gatattgatt tggcaaatca acttctagca 601 aaggttcttt accttctggt aaaattgaca ataactcttc accaggttgg actgtcccct 661 gacttgggtt gactttgacg ttataaactc tgccagtaat tggagctgta atagtttgat 721 tctgccactg atttttagct tgagctagtt gcctttctat agtctgcaat tcttcgcggc 781 gcttgctcaa atcagaaagg gatccagttt tataattttg ctgcagttgc ttgatctttt 841 gctcttgggc ttcaaaactt ttttttgagt tatttagttc aactttagcg tcttcacact 901 ttactcgggt gataggaccg aagaaaggac actccttatc tgctttgcgt tgggctgagt 961 taaggttatt tcgcagacgg gaatattcaa tttccgcctc cttaatgcgt gctttctggc 1021 tttcttcggc tgctttggta gcacgtagta tatcttgctt catcaattgg gcttgattgt 1081 ttagcctctg aatctcagtt tgagacgatt ctgagtcaag tgctactaaa atgtcacctt 1141 tttgtacagg ctgaccttct tttaccttag tttcgttgat atatttaatt ttgccactac 1201 taggcgatcg cactggctgc acatcttgct caccgggaat caacttaccg ggagccacag 1261 ccacttcatc aactttactc aaaccagccc aagcgatcgc cccaaatact gccaaactta 1321 aagttacccc taccaatctt gtataaagcg gcggtagctc ttgcaccgct ttacctaatt 1381 cataatctag gtaagcgtct ggattagcaa attgctcttt agtttgccgt gcttgagcag 1441 gagatgaagc aaagggcagc ttcattgttt ttgcaggtta tcgtacagtt ctcgcagatt 1501 tatctatagt atcactttct tgtgtgttat attcctcagt aaagtagtag tggctcagac 1561 taagattttt aggaagaact gcattcccca ccagtcccaa gacattctga ccagattttt 1621 ccaacagatc acgagcaata gtagcattca cagaatcaac tacccctggt cgaacgacta 1681 gcaaaacacc gtcagccatt tgacccaaag tagcagcatc agcggctacc gttaatgagg 1741 gagcatcaat gatcacgaag tcgtagttgg tagcaaaact ttccatcaat ctagccattg 1801 gtttcgagtc tatgagggat gctggactgg gaggcacaac tccacaagtg aggacataca 1861 aattatccat gacttgtgcg atacgctttg cgtcaagcct ctggcttatc gcgccttgcg 1921 cggctccctg gtggagcatc gctgccctga tttcagcatg atcgacaatc acattactca 1981 gacctgggca gcctgttgcg cgaagaacag gggcgatgca aaacgggcga tagcttcgcg 2041 ctctgcgctc ctttgcagat cgccttctta cgaagctttt gcttccaaag acttcagcaa 2101 ttcagtatta acaccagact cacgagttaa agaaattttg ccagtgcgag ctatttctct 2161 caagccaaat ttttgcagca cttgtacaat tgctaccatc ttacctggat ctcccacaac 2221 ttcgagggta acagaatctt ctgccacgtc tacgactctc gcccggaaaa tttgagatag 2281 ttcgacaatt tcagaacggg tgctgcttgt tgcattcacc ttgagtaaca tcaattctcg 2341 ttccacgcaa ggaatttctg taacgtcttg cactttcaaa acgttaatga gtttgtatag 2401 ttgcttggtg agttgttcaa tcacgcggtc atcaccaggg actaccattg taatccgaga 2461 aatacctgat tgctcagcag gaccaacggc aaggctttct atgttgaagc cgcgacgggc 2521 aaataagcta gaaatacggg aaagaactcc cgcttcatct tcgacaagta ctgataaggt 2581 gtgtttcatc gtcgccaaca ttaggctaga ggaagtaagt tatgaataac agagaaatta 2641 agtcgctcaa cccaattaaa cgtgaaatgt cattacgagc gcaacgcagt gcagcgaagt 2701 aatcgcaaag actttattct acgtttttca atgttgacct acttagtatg tagccgcgct 2761 acacacacta ttgcacaaga ttgcaattgt aatcccttat tgtacctgag attggaatcg 2821 aaataaatcg tcttgtgtat tacagttaaa aagcatttct ggatctggta aaggcaaagc 2881 ttgcacggga tgttgcttca accactcctg aaaggatcgc ccccctcgat taatatactc 2941 caccagactt ggtaaacagc gacggcgata aaaaccgcac agtggttccc atcctttagc 3001 ttggggaact aaagcagcta tagcctcacc ttccacacta tccaatgcat tcgtccattc 3061 ttgcaacacc tcaacgcgca atttgggtaa atcgcaagcc agcagcaaca cccaatctgt 3121 ttgtacatac gccagtgctt gagcaaatcc aataattggt ccgtgcggta gttgttcact 3181 accagtttca ccagacaaag gcacttcttt gataaactca attgtagata gtaacaagtg 3241 ttgatagcgt tcttgccagg gtgtgactac gcaaactgta tcagcacaac tttctgcaat 3301 tcgacaaacc aattgcaaca ggggtacacc ttgaatggga atcaatgctt tatcccgtcc 3361 catgcgagaa ctctttccac ctgctaaaac tatggcggtt aagtgattag tcattattca 3421 taactctatg cgagctcacc ttgaagtcta tactaacgtt tgacgatagt aacgcatagc 3481 gatagtattg tgattgttag ttttaaatct gaagagacaa agtttatctt tgagggcttt 3541 acatcttctc aatatccgtc caatatccaa aaaactgctt tacgaaaact gcttatcctt 3601 gacgcagcaa cgtcaattaa cgatttgcgc cttccacctg gtaatcgttt agaaaagcta 3661 gttggcgata gaattggaca gtacagtatt cgtattaacg atcaatggcg aatctgcttt 3721 gtttggacgg atgagaacaa tgctttggaa gttgaaatgg tagattacca ttaacgcatg 3781 gaaataatta tgaacaacaa ccgtctgcca aatatccatc ctggcgaaat cttgcagcta 3841 gaatttttag agccactcaa tatcacccct tatcgcttga gcaaagatat cggtgtagct 3901 cagacacgaa ttagtgaaat tttatctgga aaacgcagta ttacagcaga tacagctttg 3961 cgtttatctc gctattttgg taacaacgct cagttttggt tgaatttaca aacacaatac 4021 gatttgcgtc aagctcttga agagaatgaa gaagtttaca atcaaatacc taaacttccg 4081 ttgaatgacg tagcctgatt ttgtctagca aaattttaac ttttgtcccc atctcataac 4141 tcaacagtat actaatcccg acttccccgc agcttgaagc caacgcgcat caggtttgca 4201 agccccattt ttttcttcac ggcgacgggt gtagacgtta aattgaaccg tgacagatga 4261 gagcaacatg gtgcgttact gcaacttggt atgaggcagc aagcagcgca cagttatgca 4321 ttccccggca gagccaggga acgagaaaag agccagggaa cgagaaaaaa ctgttttcaa 4381 ttttgaattt ggaattttga tagcgcagcg tgcccgtagg gcatattttg annnnnnnnn 4441 ntttgaattg ttaacccctc acctgactct gctgcacttg ttgcaacaac ttctccaaac 4501 tctgggctgg tgcaagacac ttcaaacctt ggcaaactaa tccaagactt ccctcaggta 4561 atttcgacac cgcaacaaac gcagtgctcg gtaaatacat ggagttaata gactgtatct 4621 gctctgcact ggtgcggact aaggtacagt tacgatacca atctaaggct gtaaacaaac 4681 tgggacaagc ttggggagcg cggctcatga cacttctaaa tgcttttaag ccttgctcgg 4741 ctaaatccaa gtaatgaaga ttgtcggtta caagagccaa acgaacaagg ttggcgatcg 4801 caactccatt cgccgaaggt gtggcattat ccatataact gcgctctcgc acaattaagt 4861 cttgacttgc gtcacttgat gtgttgtagt atccgccgag ttctacactc caaagatatt 4921 cgtggaattc ttcttggatt gcgatcgctc tttccaacca gtttttttcc tcaggagaac 4981 aagcctgtaa atccagtagc gctttgataa aaaaggcgta atcttcagac tgggctagga 5041 cagttggttc actctcataa ttcaatcggt aaaaacgccc atcgacaaac tgatgatcca 5101 aaataaaatt cgccgctgtt gctgctagtt ccaagtactc tggttgttgg aagactgcat 5161 acgcccttgc taaaccggaa atcatcaagc tattccaggc gacaatcatc tttgtatctg 5221 tcaccgccgg aatgcgtcct ttccagtttc ctgttttcgc ttcctggttg ttgcgggcgg 5281 gagggaaagt ttccaatgac tcaggtgctg caccatagcg aattgcaaac agctttgcga 5341 gtgtgttttc gactgtttca ctcagttttg cagcattacg cctttgcaac acattcttat 5401 tttcaaagtt accctcaggc gtaattgtaa actcctgttg taattctgtt agttcttccg 5461 gagtcagcag ttgttctaat tcgctgtaac tccagacata aaacgcgcct tcctccggtt 5521 ctacttctgt gggatcgcta aagctatctg catcttgcgc tgcatagaag tatccggttg 5581 gcgctatcat ttctcgcttc aaccattgga cagttttggc gatagccctt tcaaaagcca 5641 cctcttgtac gcccaagctc cacaagttcg ccaaatactc tacaatttgt ccgttgtcgt 5701 agagcatctt ttcaaagtgc ggtacagtcc aggtggggtc aactgtgtag cgatgaaacc 5761 caccacctac atgatcataa atgccgccca gtgcgaggtc tagtccccgc tgcgtacaaa 5821 cttgcttgct atcatatcgc gatcgctcac tcgctgaacc tgggaacaag aatctgctcc 5881 ccctcaatgc gagttccgcg tagggaatca tcgggaagct gttaccgtat tgagttggtg 5941 taatgatacc tgttgaagtt tcccagccat tgtgcaacag ttgtttatct tgagattctg 6001 ttattccttc ctgttgcaac acagctccgg tgaggagtga ctcaagaata gctgttttac 6061 gttcacttaa atcttgtttt tctgtatcgt agtaatgatg aattgcttgc aggacttgca 6121 aaaatccagg acgaccgtag cgtggttcaa ccgggaagta agtaccagcg tagaatggta 6181 ttaaatcatc tggggtgaga aaaacgttca aaggccaacc cccttgaccg ctcatcatct 6241 gcaaagcttg catataaatg ctatcaaggt ctggtctttc ttctctatct acttttatcg 6301 gaagaaagtt ggcgttcatg tactcagcaa tagccaggtt agaaaaagct tcaccttcca 6361 taacagtaca ccagtggcaa ctggagtagc caatagaaag aaaaatcggt ttattctgcg 6421 tctttgccgt tgcgagtgct tcgtcacacc aagaccacca atcaataggg ttttcggcgt 6481 gtttgcggag gtagaggctt ttagcttgag ccagacgatt agtcatgata ggatacttga 6541 tgacagcctt ccttgctcag tctagcttgt aacagagata tttacttcgc agcctttaag 6601 cactccatag ggtaagaagc taacaatgta agaagctaac aattttctca tcctagacga 6661 aataataata ttattcatct tattcaaaat tgttggacag cgttctggca attggtgctg 6721 acagcgtatt aaaagctggt gtcgcttctg ttttgatagt tgttgggcta agacctgcta 6781 acaggtggag taaaaaagta atgatggcgg cttgcacaaa tttttctagc atggcgactt 6841 cctcaggctt taggcgggtg acatcttaat tagaatgtgt tacaagattg ctccctatat 6901 tttcatctac cgcacgaagt agttattttc cggaaaagac tagtaataat gctgacaaaa 6961 ttctaaaaaa gtcacaaaat gaagctttct tcataaactt ggttgattgt aaaaattggg 7021 tttatggttc tttgaaaaca atcagccgtt gcaatcgata aaatgtatgt aaattagtga 7081 caagcagtct gcatgattcc tatcgttatt gaacaatcag gtcgtggcga acgcgccttt 7141 gatatttact cacgactgtt acgtgagcgc atcctttttt tgggacagcc gattgatagc 7201 aacgtcgcta acttgattgt tgcccagctg ctgttcttgg atgcagaaga tccagacaaa 7261 gacatttata tgtacataaa ttctcctgga ggttcggtaa cagcgggtat ggggatattc 7321 gatactatga agcacatccg cccgaatgtc tgtacaattt gtaccggatt ggcggcgagt 7381 atgggcgctt tcctcctcag cgctgggact aagggtaagc ggatgagttt acctcattct 7441 cggattatga ttcaccaacc tttgggtggc gctcagggac aagcgactga tatcgaaatt 7501 caggcgcgcg aaatcctgta ccataagcgg aagctaaacg aatatttagc tgaacacaca 7561 ggtcagccat atgacaaaat tgctgaggat acagaaaggg acttctttat gtcgcctgag 7621 gaatcaaagg agtatggatt aatcgaccaa gttattgacc gtcacgctgc tggtatccgt 7681 ccaatggctg tggtgtagtc agttatcagt tatcagttat cagtaaataa ctgataactg 7741 ttgagtgttg agtgttgagt gttcaaaatc ctgctaatgg atctgggggc gtgggttgag 7801 attctaggta tctgaggaaa tcgtgcaatt cttcgtcagt gagtaaatga cgcgatcgct 7861 tgccataagc tttaatcaga taatctcgcc cctgttctgt tgtccaacct aggcgctgca 7921 tttcaacccc aattttagcg atatcgtcgg actgatccac aggttcgctc ttcttctttt 7981 ttttccccgt ggttgtctgg gttggcacat cttctactga actgctgtaa ctccgtgacg 8041 gaaagggtgt cacattactt ggagaaatct ccgatatctc attttctgct tggttaccaa 8101 acatcatctc caaattttgg aatgggcttt tttccaatga ctgagtttcc tggttgtcag 8161 ttgtagaaaa gcgttcaatt atctcctcgt ttttaatatc gtttttactg acaacagatg 8221 agatgactgg ggatgttgga gttgcttctg tttgaggaaa agttttgaga gcagcgggat 8281 acgcagactc atttaatctg gttgttgttg ttgtgattgg ctgaacctgc ttaaccggtt 8341 ctggggaaac gacaaccgag tctggcgaat tgctcattcc caaaacgaca agcgctctat 8401 ttctggcttg gtcttctgct tcctccactg tttctgctgc agccattcca gtggcgcgtg 8461 ttacgccatc aatctgtacg cttgcccgga caatatactt accgtgataa atttgtacta 8521 attcagaaat aagactgcct tggggatact tgctctgaaa ttgagccaac ataatactgc 8581 taccaatcta aaatctgtaa aacaaagggt ttgcgctgac ctgtgagtcc aagctttgac 8641 tctcgttccg ttatggacta tactccaaat tgattttgcc aagtttaacg cttgaacagt 8701 taccagttag cagtttcatc cgcctgctta ctgataactg atgaccgacg actgatttca 8761 tccgtgtcct gattgtagca tgaggttttg gagtaaattt ttggcgattg cttgttgagc 8821 aagagtggtt gcatggtcgt tgctatacta attacctaat tagtgatgtg gaactatttt 8881 ctgtaagtgc gctatatatc tacaataatg aggtaatggt tcatcctcac ctattatgga 8941 ctgctgacct aattgtttgg gatcctacat atgggtcaat tagatttatc ggcagtttct 9001 cctagttaaa aatcagagtt acatggcgta aaagtacctt cacacaagac aatcaaacac 9061 ctgcggtttc cctaaagaga gcttttgggc agcgtggaaa attctgcaac gtacgtcact 9121 caaaaacccg ttacgtttta gcgagggggc aaaatctcaa acgggcgtac tttggaagat 9181 agtcttgcac aaggatcttg aacacatcaa tactctggat gttcgtgcag tgccctctat 9241 gattgggcaa ttcgtttgcc aaagggtgtt tttagtcgtg agggtacacg agcaaagaac 9301 cagattaaaa aaaaatgatg ttttgaccaa aggccggttc actgcatgga attttcaatc 9361 gctacacttc tcgccaattt cactgatgat aaattggtgg ctcgtaaact cttggaaaag 9421 aaacttggtt gcgaagatga agttagttta caaaaacttc atattgcttt agagatactc 9481 gaaaaaatcg gggttttagt caaagaacgt ggcaaatatc gtcgagtcac agaagaaggg 9541 ctgattgaag caaaactccg ttgttccagt aaaggctttt gctttgcaat tcaagatgtg 9601 gaaggatccg aagatattta catccgcgaa agtcatctca gtaacgcgtg gaatggcgat 9661 cgcgttttgg ttagggttct aaaagaaggt agtcgtcgcc gctccccaga aggagaggtt 9721 aagttaatct tagaacgctc caatcacact ttactagcgc ggattaagca ggtagaaagt 9781 ggatttcgtg ctgtaccttt ggatgatagg ttactgtttg aactcaaact gcaacaaaac 9841 agccctagct tggaacaagc aattgaccac ctggctcatg ttgaagttct acgttaccca 9901 ttagcacaat atcctccact tggtcgggtt gtgcaaattc ttggcagcga tgctgaagcg 9961 gcggcggaca ttgatttagt gacctgtaag catgatcttt ctcgtacttt tccagaatcc 10021 gtacaggaag cagcctcaaa attacccaaa aagctgctaa aagcagattt aaaaaatcgg 10081 ctggatttgc gacctctgtt gactttgagc attattggca acagcaatga ctcaacgatg 10141 atcgaaaatg ccttcacctt ggacaaaaca agcgaagaac attggcagct gggctttcat 10201 attgcagatc tttctcattt tattcaacca gacgaagccc tggatcgaga agcactcaag 10261 cgaggtcggt cagtttatct gggagaattg gtactgtcaa tgttgccaga aggtgttgcc 10321 gaacgctgtg ctttattacc caaaagcgat cgcttagcca tctctttctt aatcacgatt 10381 gattcaaaat caggacaagt tggagaatgg gaagttcaac ccagcgttgt caaagtagac 10441 gcctcagtca gtgaagaaca agtagaggca attctcacaa ataaatcgac caaaatttca 10501 tcgtccctgg tagagatggt gcaacaactc gatagtttgg cgcacttgtt aaaacaggta 10561 cgctcttctc gtgggtgctt gcagttaaat ttgccaccaa accaaaaccc atactatgat 10621 gaaggcgcta tggggtgtgt gatggtgaat gatttacctg tgcactcgtt gttaactgag 10681 tttgtgctac tggttaacca actcatagca acccacttta atgctctggg tatcccggct 10741 atttggcgag ttcaaggcgc acctgatgcc gaagatgtgc aagaaatgct gaaattagca 10801 attaacttag gcgtcgaatt gtcactagat ccagaaacag atgtccaacc cctcgactat 10861 caacagttga ccagagtttt cgcagaatca gcatccgagc aagttctgac ctatttgttg 10921 caagacacac tcaagccagc tacgtatagt acaaccaaag gacctcactt tgggttgtca 10981 ttaccagagt acgttcattt cactgctccc ttgcgacgtt acccagattt actgatgcaa 11041 agggtatttt atacgttact tgaacacgga cgcgatcgcc gcaccacccg tgtcaaagag 11101 cgtgttaacc tgcgccactc aaacagccac ggtgaaatca actggaacgt cctaccgcca 11161 gaattgcaac aagaactcca aagcgatttg acaagggtga ttattcagct caacgataga 11221 gaaaaagaag ttcaagaagc agaagctgat ttagcaggat tgcaaagagc ttcactgatg 11281 aaacagcgga ttggtgaggt tttcactggt gtgatcacag gtgtccagtc gtatgggttc 11341 tttgtagaaa ttgaagtttc accaactgtg tcaaactcag ttagcaatcc tcgcgtacct 11401 ctacgggtag aaggacttgt ccacgtcagt tctctcaaag acgattggta tgagtatcgc 11461 gccagacagc aggcgttgtt tggtcgtaaa aatcgtgcat cctatagact gggcgatcgc 11521 gtagccgtac aggttaagag tgttgattat tatcgccagc aaatcgattt agtcactgtt 11581 ggcagcgatg gattaccagt ttttagtaag gatgtcaacc tctcgaatgg agaagataca 11641 aatccttact tatcacatga ggatattgat cctgatgact tagacgcgta tcctgacgag 11701 gaataaaggg acgagtaccg ctttgcggaa ttcaaaatat gcccggagcc gctacgggca 11761 cgcactttac aaaattcaaa aacctttgtg ttctaggttt ttgatatttt gttaaatagt 11821 gtgttgattt ccgcactcga catatgagtt agtcgttagt gggtgactac aaaagttaac 11881 taataatcaa caattaacaa ctaactcaat aactaacaat tcataaatga caacatccct 11941 aatccccaat ctccggtaag tcccgctaga aaaatacgtg tcaactcata caaaaccact 12001 catcttaggc gtatcaggtg catctggtct gatttacgcg gttcgcgcgc tcaaatttct 12061 gctagaagcc gactatcgaa ttgaattagt tgcttccaag tcaacttaca tggtttggca 12121 atctgagcag aatattcgta tgccagtaga accagctcaa caagagcaat tttggcgaca 12181 gcaagctgga gtagaaggga tcggcaaact atgttgtcat tcttggagta atgttggggc 12241 aaacattgcc agtggttctt ttcgcaccct aggaatgatt gtgatgccat gcagtatggc 12301 tactgtaggg aagctagcag ctggtttaag ttccgactta ctcgaacgag cggcggatgt 12361 ccaactcaag gagggacgaa agttgattct cgttcctcgt gagactccct ttagcttaat 12421 tcaccttcgt aacttaacaa ccttagccga agccggagtt agaattgttc ctgccattcc 12481 tgcctggtat cacaatccca aaaccattga ggatttagtt gactttgtag ttgcccgtgc 12541 attggatcaa ctagatgttg attgtatacc cattcaacga tggcaaggtc gtcaagatta 12601 gtcaaaatag ggaatatggg gtaataaaga gaaagtcaac agtcaacagt gagccagcgc 12661 ggtcttgggg gtttccacgc caggtgctac aacgggggga accccaacgc caggtcccta 12721 cggagggaaa ccctcctgca ggactggctc cgcaacgcac tggctcccca tgagcgactg 12781 gcgaaccccg aaggggtcaa cagtcaagca ctttttgatt aatgaccaat gaccaatgac 12841 caatgactaa tgactattga ccaaccacta aacaactatg gctgtatttc gcttaatttt 12901 gttagtaaca gtgttaggag gactaacgct tttactggtg caaaactggt cacctgtcct 12961 cccactggta tttttaggaa tgaaatctaa agcattacca ctggcgattt ggattttgtt 13021 cagtactgct gcgggtgctt ttaccactct attcgtcaca agcttattta acttctctaa 13081 ctttttcgcg ggacaacaac gtcaaactcc gctaagatca cccacaactt cgacagcaag 13141 gagtcaaact cgtaaggaag aaccaacacc ccgtccttct cccccaccat caagtagtaa 13201 aactgagtca acacgaacta gtgatccact taatgattgg gaaacagacg acagcacaga 13261 tgattgggac tttgaagaaa agcaacaaca agcgcctaca ccgaattctc aaaacacaca 13321 ggttagggac tctaacactt acgaacgtca acaagaacca tcaagcagtt acaagtctga 13381 ttcagtttac tcctacagct accgcgaacc aaaaaattct ggagtgggga aaactgaatc 13441 cgtttacgat gctgattatc gagtgattat ccccccatta caaccaccaa cgaccaatca 13501 agctcaaacc aatcaagaat cggatgatga ttggggattt ctcgatgagg atattgaaga 13561 tcaggacaag cgcccccgtc gttaagtata ttaaaatatt gtttactaaa ttccacatca 13621 cccacgtgac tcctgcttga ctttcccggt catagcagct tcctgcaagc tcatgaaagc 13681 atttaaatct tccaggtcat accgggttgt tagcagctgc cgcagttgat tttctgcctc 13741 aacggtcaaa tagccagttg ctaaagcttt ttgtacaaca tctcgaatcc gaatcatggt 13801 tggaacctca aacgaccaca cttatggcac acttatttca tcaggttgtt ctcgataaaa 13861 tatgcaatgg ctttgtgcct caacggctta gttggcagat cttttttgta tctcttctga 13921 taaattgtca tcttttacac ctttaatcaa aatactgatt tgctgagtac aagttactgc 13981 aacttattcc taggacactg atatagtgtg tcgataaaca tggtgtcatt tcgtacaaac 14041 acttgtgtca tccagtagta gtcacgctca tttctgttga ccgacatagt caggataaag 14101 tttcaatcaa gatttcttaa atttgtatat aagcccaaag atacctaagt aacaataggc 14161 aagtcaccta acaaatgtta ctaaaatagt tagtcttcta tggtgtattt aatttgactt 14221 gccagaaaag atagtatagt tatgatgttt tcttaggtaa cttcaaataa ataacggtag 14281 tgatggtgtc tgattttaat aaagtagaca aagaatttac agaagctaaa aatcattggg 14341 aggtagaaaa gttatacgtt gatttaggtt ctgcaaaagg caaatctctc acacctgtag 14401 aaaaaaaatt cttacgaggc ttactttgcg ggtatagtcc tgcagaaatt gctaacacag 14461 tttatcaaag tcgtagcagc agtactgtta gggtttatct ttccaatgga ttatataaat 14521 atatagaaga aatgttgagt taccaggtag gatacccagt tgaagtaaaa agctggagtc 14581 gggtgactca tttgctagaa caagcaggtt ataaaaaagc tttattccaa aaagagctag 14641 ccagtaacca agtcaagaca gaaaaaaatg aattcactat tataaaagca gcccaaacag 14701 aagattgggg tgaagcagtt gatactagtc ttttccacgg aaggacgaca gaactcgctg 14761 tgatgacaca atggattgtt gaggaagagt gtcggcttgt tgttctcttg gggatgggag 14821 gaattggaaa aacagctttg tcaattaagc aggcagaaaa aataaaggat aagtttgagt 14881 atgtgatttg gcgaagtctc caccttgctt cccctccaga agttattttg aaccaactca 14941 ttcaaacttt gtcaccaaca caacagacca tcggtgtaga agacctcaac agcagtattt 15001 cacaattgat tgattgtttg cgttcttcac gttgtctgat tgtattagat aattttgatt 15061 ccattttgtg cagcgaatat tctactagcg actcagatat atatccgact tcttcgacaa 15121 atcactcaac tatcaacaat tcctctgcat accatctccc tcaaatccgt tatcgcttag 15181 gatatgaagg ttatggagaa ttaattagac gagtgggtga ctctcaacat caaagctgtt 15241 taattgtgac aagtcgggaa aaacctcaag aagttgctgc tctcgaagga gataagttac 15301 ctgtacgttg tttaaaatta aagggtttaa gtcatactga aagtataaaa attctcaaag 15361 ataaaggatt tgataactct acagcagagg aatacaaatt attacttgat aggtatacag 15421 gtaatccctt atttattaaa ctagttgcta ctgctattca ggaattattt gcaggcaata 15481 tctatgattt tttagagcaa gacaccatag tgtttggaga cattcgagca attttagaca 15541 aacagtttaa tcgcttgtct catttagaaa agctcattat gtactggtta gctttcaatc 15601 aagatttagg ctcaatgcgt aaattgcaaa gagacattgt cccacgagta tcgcaaagac 15661 taatattgga ggcgatagag ttattgcaca ggcggtctct catagagtgg caagtatcta 15721 gcttttgtca aactccagtc ttaatggagt acgttgcaga acgattaatt gaggaaaact 15781 ttaaattaat gggagagaaa ccaagttcaa ttccgatgct ccagacaatt tttgaagtac 15841 aaataaaaaa ttatatccga gatttgcgtt taaacaacga gatttaaagt aaagtgagaa 15901 gtagtttgtc ttaatgctta agcgcaacta caaacaatca tgagtattga cttgttaaaa 15961 ggggtaaatc tgtacctaat tggcatgatg ggcgttggca aaacgactgt cggacgcttg 16021 ttagggcagc atttggacta cggatttgtt gatattgaca ctgtgattga gaaagctgct 16081 ggtggtaaat caataactga attatttgcg gaacttgggg aaccagcgtt tcgccagtta 16141 gaaagtcagg tactgtcaca agtgtgtgct ttcaccaagc ttgtcatcgc aacaggtgga 16201 ggtattgtcg tacagcaaca aaactggagt tacctacacc acggtttaat tgtctggctg 16261 gatgcatcag tggaaatact ttgcgctcga ctagcagaag atacaacaag accactactg 16321 caaaatgttg accgcaaagc aaagctgcaa tttatcctgg aacaaagaca acatctttac 16381 cgacaagcag atttgcgaat caccataaac gaaggagaaa cgccagaaga cattgccaca 16441 agaattatag agcaaattcc cagcgtcctc aaacctggag tattatctac tgaatgctaa 16501 gaatttttac tttgtaggta gataatgttt taactataat attaggactt tattgaccca 16561 gactaaaacc gcaatatcta tacaatagct tcccatgatg gtcagcgaaa ctgagtacat 16621 gataaagcag gatgccgcta gccgcgtaca agtgctaagc gaagcactgc cctacattca 16681 acaatttagt ggtcgaacca ttgtcgtcaa atacggtggc gcggcaatga aagatagtaa 16741 tcttaaagat aaagtcatct gtgacatcgt attcctatct tgtgttggtt tacgaccaat 16801 tgtggtgcat ggtggtggac cagaaattaa tagttggctg ggtaaactgg gaatcgaagc 16861 ccaatttaaa aatggtttgc gagtgactga cgcccccaca atggatgttg tggaaatggt 16921 gttagttggt cgagttaata aagaaatagt taccttgatt agtaaagctg gtggttcagc 16981 agtgggaatg tgcggcaaag atggtaactt aatcaaggct agacctcaag gtgaagaagg 17041 tatcggtttt gtgggagaag tttcctctgt tgataccaaa attttggaga cactagttaa 17101 taatggctat attcctgttg tgtctagcgt cgcagcagac gagacaggac aaccctataa 17161 cattaatgca gatacagtcg ctggagaaat agcagcagca atcggcgcgg aaaagttgat 17221 tttgctgacg gataccagag gaattctcaa agattataaa gacccttcta ccctcattca 17281 aagagtagac attcaagaag cccgcgagtt gattacaaca ggtgttgtca gtggtgggat 17341 gattcctaag gttaattgtt gtgtgcgatc gctcgcacaa ggagttcgtg cagctcatat 17401 cattgatggt cgcattccac acgcactgct actggaaatc ttcactgatg ttggtatcgg 17461 tacaatgctt gttggttccc agtttacaac ttagaatatt agttcaaatg ttgcatcatg 17521 caggcttgca caagtcatcg tacgcatctt gacgaaatta tttcagtggg gcggtaaaat 17581 ctataacaac ggtagctgag agttgataat gcttacgatt tcggttgtgc agcttgccta 17641 acaacttgtt cgtccaaacc taacgcttgg gctatctgct caatactcaa ccccaacgcc 17701 aataactggg gtatcgtttc caatttccct tcttgtttac cttctgcaaa aacctcttgg 17761 taaaatctag tttgttttaa ctcgcttaat ccgaacattt ttccgatctc ctccctgatc 17821 tagcgcggca atttgtagat taataatgct tacgatttcg gttgtgcagc ttgcctaaca 17881 acttgttcgt ccaaacctaa cgcttgggct atctgctcaa tactcaaccc caacgccaat 17941 aactggggta tcgtttccaa tttcccttct tgtttacctt ctgcaaaaac ctcttggtaa 18001 aatctagttt gttttaactc gcttaatccg aacatttttc cgatctcctc cctgatctag 18061 cgcggcaatt tgtatattaa tatcgtctct atcaattcta taatttccct ctgttgggcg 18121 gcatctctaa tctcttttcg ggcgctgttt actatctgta tagcttttgt tgtcgccgtt 18181 gattctggtt cgatgattag tttaactgta tcaatgccga ttgatgatgt ctctgttgag 18241 cttaattcat caagatagac gcgactcact ctggttgagt taagtaattc tgcgtatctt 18301 tctgtatctc cattgtcaat gctacggttg ggaaagatga tcacactacg ccaagtatta 18361 gttaattcag tttgattgag atagttaaag atttctgtga acaaacgtga gtagaatttt 18421 ttgtctggtt gaaattgcac ttccacaaaa tagatgggta gttctgctgt atttggaagg 18481 aaaacaccat caattctaaa tgccagttgt ttgacttcta agtcgtgaaa attggtaagc 18541 tttagctaat tgtggtggtt ggttaatcag ttcaaagaaa gtgctaggga tgctctggaa 18601 tatacgataa aagatactgt ctgttttcaa ggctggtttt tcaggggaat gagtttgttt 18661 tagcgcgaaa ctataaaaaa tcgccattgc ttaatagcag tatgaacgac tcttgtacct 18721 ttttttaaaa acttagatat ataaaacttc tctgtgcgtc tgtgggtcat gaaatcaaac 18781 tctcattcag caacgccaac aaatcacagt gtttaacttg atgaacgcag attcggtatt 18841 acgttgttgg atgagattaa gagcgatctg cgccaatgcg cagcgctagc tcgcctctgg 18901 gagccgctac gcgtatctcc tgcggagacg ctgcgctaac gccctctggg cgtgcgcttg 18961 cgcttacgct atcgcacttc cccccaccac tacgatcaac accattgttg attcccgaac 19021 caatcgtcag catatgcata cagatggtta ttagtcaaga ttaacttaac aattattgag 19081 ataagagact gacatattga gaagcaaagc atttagcaag agttgttcat tcagcagcag 19141 gatgatggaa aatttaggca gttggacaac tggaatttaa tcaattccaa aaaggttgtc 19201 gtctttttga cgaaaaacaa tttttgattt ttgtttttag attttatgag tctgtctgat 19261 tttcccaatc tctgggtacc tttggcgctt ttagggctaa ttgtcattgc tgctatcttt 19321 ttcagccgca gcagctaata tcgtgttgct gacatactaa aaaagtcatg gtgaggtagg 19381 agagtccggg aggaaggata actgctagtt tttgactaag tactaataac taatgagcaa 19441 agaaataact aatcactcct attggaggtt aaaataaatc ccgattgtct cccaaaattt 19501 taaaagcttc atgatctcca gcaacgactt tcgacccggt gtttcgatag tattagatgg 19561 atccgtatgg cgagtggtag aattcctcca cgttaagcca gggaaaggtt ctgcttttgt 19621 gcgaacgaaa ctgaaaaatg ttcaaagtgg gaacgtgatg gaaaaaacgt tccgcgcagg 19681 tgaaacagtt ccgcaagcta acctggaaaa aagcacaatg cagcatacct ataaagaagg 19741 tgacgaatac gtctttatgg atatggaaac atatgaagaa ggcagattga gcgcctcaca 19801 gattggcgat cgcgtaaaat acctcaaaga aggtatggaa gccaatgttg ttcgctgggg 19861 cgaccaagtc ctagaagtcg aactgcctaa ctccgtagta ttggagattg tgcaaacaga 19921 tccaggtgtt aaaggtgaca ctgcgactgg tggttctaag cctgcaactt tggaaacggg 19981 agcaacggtg atggttcctt tgtttatttc tcaaggagaa cgcatccgta tagacacccg 20041 taacgataca taccttggca gggagtaact ttcttctcac ctctagatgc aaatcctgat 20101 tttcatctat gacagggatt tacatccctg cctgactata caaattcatt tcaaaagata 20161 aagtctatct acgacctaat tgcatgaggt aaggaaagct gtgccattgg actttaatga 20221 aatccgccaa ctgttgacaa ccattgcaca aaccgacatt gcagaagtca cgctcaaaag 20281 tgacgatttt gaactcacag ttcgtaaggc tgtaagcgtt tctccaatgt tgtcagcacc 20341 acctcaagcg gcgttagggg cggttggtac accgaacaca ccagttccac cgcctattcc 20401 cttagtgatg tcgcctcaaa cagtaacggt cacctctcca aatcgcccca ctgacagcaa 20461 tactcttggc ttacagtcac caccaactgg ttcgtctgtc atagaacaga agtttgtgga 20521 aatcccttcc ccaatggtag gaacgtttta tcgttcgccc agtccaggcg aagcaccatt 20581 tgtgcaagtg ggcgatcgca ttaggagcgg tcaaacagta tgtatcatag aagccatgaa 20641 actgatgaat gaaatcgaag ctgaagtatc cggacaagtc atagaaattc tcgtacaaaa 20701 cggtcaacca gtagaatatg gtcaaccttt gatgcgaatt aacccagatt aagtattaat 20761 ctatatatcg atgcctagtt gttaaatttt taatgagtcc ttacgggtct gccctgatga 20821 aacaagcgtt gcctgtacca ccagaagtcg tgcaacaagt ggctgaatac ttcagcctct 20881 tgagtgaacc catgcgccta cggctgttgc atttgctgcg agatgaagaa aaatgcgtgc 20941 aagaattggt agaggcaaca cagacttctc aggctaatgt atcaaaacat ttgaaggtga 21001 tgtggcaagc aggaatcctt agtcgtcgca gtgaaggaac ctgtgcctat taccgagttg 21061 aagatgagat gatttttgaa ttgtgtaaca gggtttgcga tcgactagcc ttcaggttag 21121 aacagcaagc ccgtaatttt cgcatcttaa ataccaagta attcttgaga agaactcaga 21181 atgcaaaacg cagaacacaa aataacccca cggcttaatc catgcttttt cagaattaaa 21241 ctgaacttag cctggtgcaa atgtactaaa caaatcgaat agacttatgt cttcgtcttg 21301 actttatatc tggaacacct caaacagaaa gtttgttgta tagcacaaaa caaactttgt 21361 gtaaaaccgt agatcctaga tgttactcta gatggttagc aggcaaagtg atgcaaaatc 21421 aaaccccaat tggacaaata tctgtgcaaa agcagttccc cttaacgact gtaggggcac 21481 ttgctgttaa tcctcatgga caagtcttaa ttgtcaaaac gactaagtgg cggggcacat 21541 ggggtgtacc gggaggtaaa gtggaatggg gcgaaacgct agaagctgca gtcaaaagag 21601 agtttcgaga agaagtcggt ttagagttga ctgatgtacg ctttgggttg ctacaagaag 21661 cggtgctaga ttcacagttt gtgcgggaag cccatttcat tatggtcaat tactatgcat 21721 tctctgctag cgaaaccatc acacccaacg aagaaattga ggaatgggca tgggtgactc 21781 cccaacgggc aacagagtat cccctcaata cctacacccg cgtattgatt tcggactatc 21841 tgcaaaagca aatcgacaaa ttatataaca ataaagttca agaacaatgc taaaaaaagc 21901 actcgtgact gggtcgtcgg gaggaattgg acgggcgatc gccctcaagc tagccagtca 21961 aggatttgat gtcgcatttc actataaccg gagtggagaa gcagcaagca aagctagtca 22021 ggaagcagcg actcatggag tcaaagcgat cgcccttcaa gcagatgtca ccaatcccga 22081 tcaagctaag tccctagtag aacgtacagc cgagaacctg ggaggcttat cagtggtggt 22141 taatactgtg ggtaactact tgggaaaacc taccagccaa acatccatag aggagtggca 22201 cgaagttatt gattccaacc tcaattccac tttctacata actcaagcag cacttttcta 22261 tctgaaggca gctaattggg gacggattgt caacttcgct tgcgctagtg cccagaatgt 22321 ggtagcacgt cggacgaata cagcttatgt aattgctaaa actggtgtca tcatctacac 22381 gaagtcctta gctcaagagc tgattaagga caacattact gccaatgtgg tatccccggg 22441 aatagctgag aattcttttg acgtagagga gatgatccca aaactacctg ccaaacgcgc 22501 agcaaccttg gaagaaataa gcaatgctgt ttggtttttt atcagtcccg atgctgatta 22561 catcacagga caaatattgg aagtatcggg gggctggagt ttgtagtctc gcgctattgc 22621 catagtatag atgagtgcag gtcgttagac aggctggtct ccgataattc ctattttttg 22681 tcaacggttt ttaaatatgc ttaatcttga agacttatac gcaattcatg aacactacaa 22741 agatgcctta aatgcaaaag tagaagagtt cacaattgga aacaagaatt tcaatttcaa 22801 ttctaaaaaa gctattttag gagtcattaa tctttctagt gattcttggt atcgagaaag 22861 tgtatgttta agttcagaac aagctatccg gcgtggaatt gtcttaaatg ctcaaggtgc 22921 tgacatcgta gatattggcg ctgagtcaac tttagaaaaa gcagagagag ttgaagacgt 22981 tagacaaaaa agtcaaattc tgcctgttct tgaagctttg aatcaagaag gtgttttaac 23041 ttcaattgaa acttattatc ctgaagttgc tagagaatgt ttgcgggttg gggcaaatgt 23101 catcaactta acaggaccag aaaaaagtga agaaatttac caagctgtct ctgagtttga 23161 tgccggggtc attatctgtt atgttcaagg aaaaaacgtt agagaagttg gtgactttga 23221 ttttggagat gacccaacag atttaatgta cgattatttc gccaaggaaa tagaaatagc 23281 aactaaatat ggcatcagaa aaatttttat tgatccaggg ttagggtttt attacaaaaa 23341 tctccaagac agttctcttc gtattcgtta tcaaatgaga acgtttttaa atacttttcg 23401 actcaggaaa ttaggttttc cagtttgcca tgccctaccc catgcttttg aatgttttgg 23461 cgaagaagtc agaagtgctg aacctttttt tgctgttctt gctgcattag gaaaaacaga 23521 tctgtttaga actcatgaag tccctaaaac taaagctgtt ttagacactc ttagtttctt 23581 ttaaagtgtt gggattaaga gggaactctt aacagggaac aaggaacaag gaacaaggaa 23641 caaggaatag ggaaatcccc ataattaaaa cgcgcagcgc cgcgacttga gagtggcggt 23701 ctagtgctag tatcctttat ctatggcgtt agccataggg acttggtcag aaatatttca 23761 tcaccagttc tagatactct cagtttcttt taaagttttg gtagtcatgt cctcaatttt 23821 tgaacaattc aaaagtcgat atcacgagtc gcgtatacag tacatggcta caggagaatc 23881 cccaacctgt tctgtgattt gttctcaagc acctgagtgg aacagttcta aattaatttt 23941 tccatttctg atcgacaact ggaagcattt ttatgaaatc gaattcccac tcgaatttaa 24001 aaatacagat aaactccatc catctgtctt gattgggatt gcaatggctg aaatttatcg 24061 cctttgtgaa atctgtaccc ctgaaattgt gcaagtctgt tggtatagtt gtctggcgtg 24121 ggaatcagat tggtggaatg aggacattta ctggtatctc caacaaaagt tttatttgga 24181 gaaatggggt tgggataaaa tgcctcgaat tgagttttct caaagtgcca tagagataga 24241 gaataattgg ttaccatcag ttaacgatac ttacctcctt gcagtcagtg gaggtaaaga 24301 aagtaccttt gggttcgagt ggatgcagca agctaacctt cctatggaag ccttcactct 24361 acatcatgcc ggaggcatac tagggaacaa ttggcaagaa aagtttccag tatttgatta 24421 catacgaaat cgaacccgcc tttgggaaat actaacccat ccaagggaag atccagctga 24481 acattttggt tataagggcg ttcgcaacga tccaacaatt actaatgcgc tgtttctgat 24541 gatggtaatt gcagcacagc agggtcatcg gttcctcgtc ttggcaaatg ataagagttc 24601 caatgaatca aatgccacat atgaaggacg cgaagtcaat catcagagtg ccaagggaac 24661 tgcttacata gaacgtttta acagtttttt ggaacgcaag ggtttgccat ttcgctatgt 24721 cagcatttgt gaagaagtgt actcaattgc cacagttcac caattatccc tttgggataa 24781 aagtatcctg aatgtcctta cctcatgtaa cgaggcgcag tgggcaccag gatcgtgtcg 24841 atggtgttgc caatgtccca aatgtgcttt ttcatacgct ttaatcgaag cagcaacaga 24901 ttaccgtttt gctgtgcaag ttgttggtaa agacttattc aacctaacga cacttgagga 24961 agtttggaga cgtgctacta agtagccata ttgctcaaca attcgttcat ttctctacat 25021 ttcgctccca gtcttctacc cactcttcta gcttttctct caataacgac tgccatcctg 25081 gtatcgcttt tattctttct ctcaacccaa atttaacttt aagctgaaaa ggaatcttat 25141 ctaaagcttc atctccttgt ggtctaaact ttgggaaagg catagtgtat tgcatgtgta 25201 gctggtatac tagtagtgta acgaaacggc agttcaatgt attcactaaa actagagtta 25261 aaacttaata acaaggaaaa atctagacta gctggatgtg cggggtttgc tcgttttgtt 25321 tacaacttcg ggttgtcgat gcttacaagc tcctgggatt ttgaggaaat taaagcaagt 25381 gattccaagc gcttaacagc gattgagaag gtttttacca atcacgtgaa aaccaatcca 25441 gaatttgctt ggatgcaaca atatccatca gcaatatatt cttctgcttt acgcaatttg 25501 gcaaaagctg ttgaacgatg gcgcaaggga gagtctggat ttcctcaaat gaaatctaga 25561 aagcgagggg atagttttac cgttctcaag aagtcggggg tttatccggc tattggggag 25621 cctatgctgc cttttacgaa taggcaggta cttcagttag ggaaacgaat aacaatacct 25681 gggcttggtg attttcggct taaacaacca atcccgttct tgtgctctag tcaatcattt 25741 actatttcca gaacagccga caagtggttt gtgagcttta gcctagatgt tgaaaaaatc 25801 cctcccctaa ttcatgaggt tgaatcggtt ggcgtagatt taggcgtcaa aacttttgca 25861 actttgtctg acggtaccac gatagttgcg cctagtagcc ttaaaaaagc gaaaaccaag 25921 ctgaacaagc tacagtggcg caatcgtaga aaacgtgcga ttcgttgatt gctgaatcac 25981 ggtaa // LOCUS NODE_1189_length_25403_cov_5.01187525403 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 25403) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 25403) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..25403 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 565..774 /locus_tag="DP116_10235" CDS 565..774 /locus_tag="DP116_10235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862633.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_10235" /translation="MEISARNTFKGTVKEIVTGSVNDEITLEIAPGVEVTAVITKTSA ESLGLKEGKEAYAIIKASDVMVSVD" gene complement(1150..1668) /locus_tag="DP116_10240" CDS complement(1150..1668) /locus_tag="DP116_10240" /inference="COORDINATES: protein motif:HMM:NF033545.0" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10240" /translation="MKTLTGKVITAPGIKPTVEVKWNRENFWVYGAIEPLTGDHFLHE YPQLNGDYFQEFLNWLSHELGSDYAILQIDQAPAHISSAIRWPKNVIPLLQPPHSPEF NPIERLWQFLKRSPKNELFSDLQALRDRLQEMFDQLTLQQVMSVSSYNFILEALFYAA SHYSREQGRLNA" gene 1833..2216 /locus_tag="DP116_10245" CDS 1833..2216 /locus_tag="DP116_10245" /inference="COORDINATES: protein motif:HMM:PF08881.8" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10245" /translation="MNKKTLSILCFVSMAAACFNPLGKEALAGNFSRSCGDISLDGTT LRANCKDRSANFRSTSLDLNRRIENRDGNLRIGGGFASTCQNIQLTGTSLQADCKTFN GDFRSTSLDLNSVITNNDGNLTFDR" gene 2637..2936 /locus_tag="DP116_10250" CDS 2637..2936 /locus_tag="DP116_10250" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10250" /translation="MLKQGRQWADLSIARTTPALLGLFSLVILVAHHLQNSQKFSIRQ AAWYAKPLPTFVDAITLVRQSLWSSTFSMSHSPSDMVKIPRALLERLTDTLGYAA" gene complement(3195..3857) /locus_tag="DP116_10255" CDS complement(3195..3857) /locus_tag="DP116_10255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316359.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10255" /translation="MRFTSRIAIVSLTLSILGFWQATLAEPGNRASWEENTCSAGLSQ TLSLGGEVKNRTTFDLQKLIDLQEDLKKQDPTVVTEVTVSFQTGSGPRTETYYGVPLW ELINNEKAGGGLQPANSGQNTKNAFLRQYVLAEATDCYEAIVSIGEIHPNFEAKQVLV AYAKKASDGSIQYLTDTDEGFARLVVPGDKAGGRYVSNVRNILVLSAPASPLKPKYFR QP" gene complement(3906..4896) /locus_tag="DP116_10260" /pseudo CDS complement(3906..4896) /locus_tag="DP116_10260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320746.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="molybdenum ABC transporter substrate-binding protein" assembly_gap 4049..4058 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 5438..6355 /locus_tag="DP116_10265" CDS 5438..6355 /locus_tag="DP116_10265" /inference="COORDINATES: protein motif:HMM:PF04313.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I restriction endonuclease subunit R" /protein_id="PRJNA477356:DP116_10265" /translation="MNSALSRIATMKASRLTTRLRRRFDIKPSRTAPDLLTLFEVIGY TVLLESDINIGRHKAQRSSNVMVVLCTRLRNALKQINPKVPCEVIETVISGLTSTHSS DLLENNRRFHKLLTEGVDVVYQNGNQIVHDKVWLIDWFNLLSNDWLVIHPFTIVQGYH SHCLDVVVFINGLPLAVIVLMDSKHEKAKLSEAYQRLKTYQQQIPMLFSYNAFIALGV GNSARVGTLTSGWKEFLPWRSIEGEDFPYQGETELEILIQGIFDKRRFLELVKHFIVF EETGTSISKTLLRHPFCTTQNPKIRSRMI" gene 6352..6726 /locus_tag="DP116_10270" CDS 6352..6726 /locus_tag="DP116_10270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181019.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10270" /translation="MNTNTSWYEQELVKEFTTIVNHSFPQVGELLNQCYLKVIQSFWG QHNVHFLPYIAIYCSKNTIAAVKAEIYVFREVAFFLGLSKVVCLNATCLLHDPKSKLP QENPHLWLELQWIVTQEKGSVS" gene 6723..7730 /locus_tag="DP116_10275" CDS 6723..7730 /locus_tag="DP116_10275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949633.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LLM class flavin-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_10275" /translation="MKTGLFCNYENYHSDARRAILEQIALVKHAESLGFEEAWVTEHH FSDFSVSPSILLLIAHLAGVTKTIRLGSAAVLLAFHDPILVAEDITTLDNLCNGRLAI GIAKGGPFPEQNKHFNTPMSESRAKTLEAMMLIHKLLYHTDVSFLGKYYQCDRVSIYP KPLQKQIPVYVATSDEEAIGFAALNSFGLMGGAPFTLDRLKSNVTKYRAINSSGSDKF MVARFFFVARTYDEAVSEALPFIRTFSQRMKALSAKAQKYGNNSQHLQANDGQKSAFD EDELLENSIIGDVVTCRDKIKRFQDELNLGTLALKPASLNLQKNFESLTLYNQEVQGY V" gene complement(7940..9163) /locus_tag="DP116_10280" CDS complement(7940..9163) /locus_tag="DP116_10280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease" /protein_id="PRJNA477356:DP116_10280" /translation="MSVTSPQIIELTEYVPKLLGRLQLPDVVGEMLWRDYETQVSVDF PSPKTGDRWRLTNQGCVGHIPLTPDFHIILRPKVKLDNLLRMLEYAYQLKSFRFLSGL VDCQTLLEFYQRLADILARRILNRGRQGFYCAYIPKTEHLPYVRGRVDVRQMITRPWD TKIQCYYEEHTADVEENQILAWTLWCIARSGLCTERVIPTVRRAYHLLQGLVTLQPCH PRTCVGRQYNRLNEDYRPLHALCRFFLQQTAPSHETGANATLPFLVNMSQLYEHFVAE WLKAHRETALLTQALDIQSQERVYLGQGQGLYLDIDLVLYDQATGIARYVLDTKYKAA SKPATADIYQMVAYAEAKGCQEAILIYPTPLSEPLNIKVGSIRIRSLTFSLAGNLEQA GYCFLQDLLGIGNRE" gene complement(9160..10794) /locus_tag="DP116_10285" CDS complement(9160..10794) /locus_tag="DP116_10285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655431.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_10285" /translation="MSKTKFNKDKKVLLLEKKHEFIRLFEEFISSYPYTPAGLRHQKA YHEQREKGRGNFQAIASNYESTENIADVIVLQQLLPYPPTANNSQKNFWIHHASTTDK DIKRWFEDAVWTKLQDWSNIAQAIFHFVRHCYQNPTQLREACQDFSKLPSTKSFQIGM LTPILNALRPDDFLLITNTSRQVINYLTGKSYTQKLTEYPAINATGQKLIEELAPEMR QTGVPSMRDDDLFDMFCHWLVAVKKYDNACKERPPIDEVSELSSDVEAVEMQPEYSLS QCALDTGIEQETLLNWLKAIERKKQAIFYGPPGTGKTYIAKKLAKYLTGGSDGFVDLV QFHPAYTYEDFVQGIRPQRIDGELDYPLVNGRFLDFCYKACCRQDICVLIIDEINRAN VARVFGELMYLLEYRDEKIPLAAGEVFSIPANVRIIGTMNTADRSIALVDHALRRRFA FIPVYPNYEILRRRHLNTGFPVQKLIQTLIKLNQAISNPHNEVGISFFLRADLADQIE NIWQLEIEPYLEEYFFDQPSKVEQFRWNEVKKQIYP" gene complement(10899..12047) /locus_tag="DP116_10290" CDS complement(10899..12047) /locus_tag="DP116_10290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314318.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine hydrolase" /protein_id="PRJNA477356:DP116_10290" /translation="MFSAFVRRFWLSFAAVILLTLMLLPLKAQEPSLSPLSSSITESS SQQLEQQIHQTSSLADGMMGVSPELAIAQSNTTLFQERLKNLDISAAQGRVGIGVLDL NNGQSWFLNGKQRFPMQSVYKLPSAIAILKQLDEGKISFKQLVTIMRRDLAPGSSPII KEFKGDRVQLPLRNVLERSVGMSDNTAADALVRVLGGPKQVNAILNKLQIRNVRVDRL EQQLQPDCVGLKNFQPELADEQKWAEAVQNIPDRVKKAALEKYLRDERDTATPEGMVD LLARLNSNKLLSQNSTTLLLKMMTDSPTGQKRLKAGLPNNWSIAHKTGTGPDVLGIGT ATNDVGIASSPDGKRVAIAVFIAGSKAPLEVREKVMSEIASAVIQAIQ" gene complement(12205..12684) /locus_tag="DP116_10295" CDS complement(12205..12684) /locus_tag="DP116_10295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655430.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10295" /translation="MPCNILLLSIKPKYAHKIFEERTKQVELRRVRTRLNKGDLVFVY VSSPTKSFLGFFEVDFVIEKEATTDELKHFWKEVKDHAGINYQDFYKYYEGATVAVGI FLRNVKKFENPIDLYRLREKLSYLRPPQSYRYLNEREYKIIMSLGGENPAAITNQSE" gene complement(12704..14827) /locus_tag="DP116_10300" CDS complement(12704..14827) /locus_tag="DP116_10300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015173919.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_10300" /translation="MTTNAQIKIEAIDNHSPYLATVIKLWRANSKTLGNLPKGAFEER ATHRQILVALDSEAGCIGYLLYRRSYDWFTITHLCIDPSCRGKGVAKQLVDRLKQITT SSRGIKLSCRRDYNLQGMWSSFGFIACEDLEGRSKKKKTILTRWVLEHNPLPLFSRII EQQLESKLCVMIAPDIFFDLHKDENFDTEEPFFLLADWIQTELTLCINDEIFNQINKI EKTHERNSLRSFAETFPCLPCDNQKMEINQILLQNFLKKYQVEFNEYKLRYIARAITS DSHLLLTKDEELLDLGNRIHESFILSVIHPDELINQLDELRHKPDYQPVRLSGTCLEQ TRVKIGEEDFLVNYFQNSQRDEYQTEFQQHLRRFLAESDKFECVLVREEVNQPLALIV YGRQKKDELEILMLRVGNNKLSSTLAHHLIFKSICLSAREQRQFTRITDPYLEEAVLT AIQEDAFVQVKNGWLKANLPVVKSSSEFSEYLTHLVSHLGDEYNFLMVIAKNLEEHII KDAQALVDIERFLFPAKISDSGIPTFIIPIQPIWAKDLFDEHLANQMLPFPDFGAKPE LSFNREAVYYRSVKNSRGLNAPCRILWYVSETQGERKGKGFYDVGYIRACSYVDEVII GKPKELYRRFQRLGVYKLSHVEKIPTDKNGDIMAIRFSDTELFKNPVSLQELHRILEQ DKLLLACPYKLPENGFMRVYNLGIR" gene complement(15177..15959) /locus_tag="DP116_10305" CDS complement(15177..15959) /locus_tag="DP116_10305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317474.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucosamine-6-phosphate deaminase" /protein_id="PRJNA477356:DP116_10305" /translation="MSTAKNSFRVDALQVQVYNSEVELAQDVAGIVQKHLQHTLQQKD TAALLLATGNSQMKFLDALIALSGVDWSRITCFHLDEYLGISADNSASFRRYLRERVE MRVSPKEFHYIEGDAMQPLAECDRYTKLLQAQPIDLCCLGIGENGHLAFNDPAVADFN DPHTLKLVKLDTVNRQQQVNTGYFSSLESVPQYAFTVTLPMICSAKKIICLAPGKRKA KIVKQILEGDITADCPASILRTQQQATLFLDVYSTGLLKSEE" gene 16369..17052 /locus_tag="DP116_10310" CDS 16369..17052 /locus_tag="DP116_10310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HAD family phosphatase" /protein_id="PRJNA477356:DP116_10310" /translation="MLAAILFDLDGTIANTDPIHYQAWREMLMGYDMDIDETFYKSRI SGRTNPQIIEDLLPQLSPEEGAKFADEKEALFRQKAKTILKPLSGFSELIAWTDAHQL KRALVTNAPRLNVQFVLEVLEIKEVFHTVVIAENEIAAKPDPAPYQVSLNRFGITAEQ AIALEDSPSGIRSAVGAGIRTIGVTTTQESKVLLSLGAFMTVPDFTDLQLWTLLNSSV QEDVACLDL" gene 18200..18772 /locus_tag="DP116_10315" CDS 18200..18772 /locus_tag="DP116_10315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950441.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix domain-containing protein" /protein_id="PRJNA477356:DP116_10315" /translation="MIFEISDFYESKFLTPCQRQVLLKNLQANLQPEYRRRIEIMLLA DMGKSQTQICKILGCSQEMARYWITVAQLGLADKWQERPIGRPKIVNDQYIQRLKELF SHSPRKYGYAFSSWTSQWLSKHLATEFGIEISDRHINRLLKQMGLSTQQKRSSKKQAT KDTKETGIRICDLQSHSEPSFHWLFNHHAN" gene 18802..21880 /locus_tag="DP116_10320" /pseudo CDS 18802..21880 /locus_tag="DP116_10320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314975.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" assembly_gap 19666..19675 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 21985..23520 /locus_tag="DP116_10325" CDS 21985..23520 /locus_tag="DP116_10325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748591.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_10325" /translation="MAYPSNNSSSPFTPTDDQQLSTPPNVVEENNNVAVAKDWFYGTE ELLDALPRLWTRSLLYLLIGFSAIALPWAMLSQVDETGTARGRIEPLGATQRLDSQVT ASITAVRVKEGEQVRAGQLLVELQSDVMQTDLKQAQAKLEGLINRQAQLELIKNQLLL AIRVQEQQNQSQESEKIAQVNQAKQNLDAKQSTYNLQKLEKLALVDQAKQQINSTQND QKSAQSRLSIDSKQVKRFSKLVKDGAVSANQIDQLRKEEEESKRLNQKTQSDIKQAQL RFQEEQNRYQATMRQAQVDIQVAKLRLQESQSSYQSIIQAGKLAVFKNQEQLKDLHTQ IVTLKSEIAQTKNQIDSLKLQLEQRVMRSPVDGVIFDLPIKKPGVVVQPGQIIAQIAP KQTPFVLKANMPSQQSGFLKLGMPVKIKFDAYPFQDYGVVPGRVIRISPDSKIQETPQ GKIETFELEISLNQPDIQSGNKHIPLTPGQTATAEVIVRQRRVIDFILDPFKKLQKGG LEL" gene 23828..24610 /locus_tag="DP116_10330" CDS 23828..24610 /locus_tag="DP116_10330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748590.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_10330" /translation="MLKVLNVSGKEMLEQLKLSCQIPGLLDAIATRKIIFDAARTAGI KVEVQELQQSADSLRTANNLLKAEDTWAWLQKHHLSLEEFEQLAEINLLSAKLANHLF ADKVESFFYEHQLDYLAAVTYEVILDDEDLAWELFYALTEGEMSFQDMTRQHIQNPEL RRTGGYRGIRPRKDFKPDIAAALFAANPPQLLKPIVTTQGIHLLRVEEIIRPELDQQL RLKIMSDFFSTWLKEQMAQIEVIPHFESDSYSPPVQELLKLA" gene complement(24769..24975) /locus_tag="DP116_10335" CDS complement(24769..24975) /locus_tag="DP116_10335" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10335" /translation="MAKITIFDINSTSCQILSESETFLNELSETESASVMGGNPFSIF ANTLLKAFAAYLIYLIVTDSSFKK" gene complement(25093..25305) /locus_tag="DP116_10340" CDS complement(25093..25305) /locus_tag="DP116_10340" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10340" /translation="MSKITVSDLAFDSKSFINEVAETESSSVNGGGDALTFASSYGGF VIEISKILQNSFLKGTAILAIAELAK" BASE COUNT 7628 a 5243 c 5151 g 7361 t 20 others ORIGIN 1 gcatggaact ggtgctgcgt gaaaagcgct gtcgtcattt atgtgaactc tttgagcaaa 61 tcgatccaga agccgaaccg gaaaaatggg aatcttattg taaggttttc taccaagaaa 121 aacagcaact tcaggaatta gagcgtcagc ggcaattttc ccttgctgaa ttgctgtaac 181 acaagtttca aaggggttgc atggtggtag gggtgtaggg gaggacagtg ccgtgggcgg 241 gtttcccgac ttgaggcacc tgtccgttga gccagtgccc ttataccctt accccctagt 301 ttttgtaacc ctagtaagtg cctatttgat ggataaacaa ctcatctacc tggtaaaatt 361 atcagattat ttagtttgat gtagaaccaa aagaaagttg attgcagata taatgtaaga 421 gattttcata acacaattaa tgacacagta atcaaggggc taatcaacag tacaacttaa 481 gttaaatttg caagcagcaa caagccaaga caagacaaga tagaatgaaa tatgagtgaa 541 tgaaaaaata gggagcataa gaatatggaa attagtgctc gtaacacttt caaaggtact 601 gtgaaagaaa ttgtcaccgg atctgttaac gacgagataa ccttagaaat cgcaccagga 661 gtagaagtaa ctgcggttat cacaaaaact tcagcagaaa gcctaggact caaagaagga 721 aaagaagctt acgccatcat caaagcatca gatgtgatgg tttctgttga ttagacgact 781 cttttctacc taagtctggc ttgactactg atatgatgtc tgcttaaata ccctgaatat 841 aatctcatac aaaaacgtac cagacgccaa gaaacagaga gattttcatt atgggttatt 901 tagcggacat catattaaat agaaccacta ataacgtaag ttgctagtta gtgcctataa 961 acggtatcat caagatacaa aaacgctttt taaagtattg caaagtacta taatttgaga 1021 ctttgtactt aaaattaggc actttttgat aactagcaac ttgggttaat accaatttct 1081 agcagccgcc aggtagatta ggacacgaac taatgataaa acacggtcac caagagactt 1141 tcaagtctgt taagcgttaa gccttccctg ttccctgcta taatgtgaag ctgcatagaa 1201 aagggcttct aaaataaagt tataagaaga aacagacatc acttgttgaa gcgtgagttg 1261 gtcaaacatc tcttgaaggc gatctcgtaa agcttgcaag tcagaaaaaa gttcattttt 1321 gggcgatcgt ttgaggaact gccaaaggcg ctcaatcggg ttaaattcag gagagtgagg 1381 tggctgaagt agaggaatca cgttcttagg ccaacgaatt gctgaactga tatgagcagg 1441 agcttggtca atctgtaaaa tggcataatc agagcctaac tcgtgggata accagttcaa 1501 gaactcttga aaatagtccc cattcagttg agggtactcg tgaaggaaat ggtctcctgt 1561 caggggttca attgcgccgt acacccaaaa attctcccgg ttccatttca cttcgacggt 1621 tggttttatc ccaggagcag tgatcacttt gcctgtcagg gtcttcatcc cgacccgagt 1681 ttcgtcttgg cacagatagc ggacgcgctg ctacttcctg gcgaatactg gcattttttt 1741 tcatctctat cccaacaact gcataagttg atagtactca cacctatata taaatacagt 1801 agtctaataa aaaattaggg agtgtcaaaa taatgaacaa aaaaacatta tcaatattat 1861 gttttgtctc tatggcagct gcatgtttta accctttagg gaaagaagct ttagcaggaa 1921 atttttctcg tagctgtgga gatatttcac tggatggtac aactttgagg gcaaattgta 1981 aagaccgtag tgctaatttc cgtagtactt ctttggattt gaatagaaga atagaaaata 2041 gagatgggaa tttaagaata ggtggaggtt ttgctagtac ttgccaaaat atacaactga 2101 caggaacttc attacaagca gattgtaaaa cgtttaatgg cgactttaga tcgacaagct 2161 tggatttaaa tagtgtcatt actaataatg atggaaactt aacatttgat cgttagtttc 2221 ttcgccgtaa gccccgcact ataacggtac tccgtttagt gcgggaagta gtaaagcacg 2281 tcgcaaactt gccaagcgcc aggcgcgtga acatcaacgt attgccaaag caaggaaaga 2341 ccacgcattc aaaaccgctc atgagttggt tcgcactgga aagaaagtct ttgtccaggg 2401 tgtggtcaaa agtcgttggt aaagggaaaa gttgacaggt cgtggtttca cataaagaga 2461 aatagccaaa acatggaatt agctaacgga aaccgtcagc taattccatg ttagtgctcg 2521 acaaaatcca tcttttctat gtaccaatcc tgcgattgct ccacttcaaa tccttgattg 2581 gtttgtacgt cgttagcaaa tagaagtcac ctttcaggaa gtacgcgcac atttaggtgt 2641 tgaaacaagg gcggcaatgg gctgatttgt caatcgctcg aactacacca gccttactag 2701 gattgttctc actggtgatc ctagtggcac accatctaca aaattcacag aagttttcga 2761 tccgacaggc agcttggtat gccaagccct taccgacatt tgttgatgcg atcactcttg 2821 tccgtcagtc tctgtggtca agtacttttt caatgtcgca ttcaccaagc gatatggtaa 2881 aaattcctcg tgctttgttg gaacggttga ccgataccct aggctacgct gcttaaatgg 2941 attttgtcga gcttagaact ttattttaga agcccttttc tatgcagctt cacattaaat 3001 tggtataagt aagtcggcac aataaaacca atgtatgttg agttttgtaa aaactgtgag 3061 attacctatt tctaagcagt ttactgattt tacatttcgt tacataagtg gattttttaa 3121 cgccgactta cttaaattct ggagttcttt tgatttagat aaagtctgct tttattcaga 3181 gaaatcaaag tggattaagg ctgccggaaa tatttgggtt ttaaagggga agcaggagca 3241 gagagtacta agatatttct aacgttgcta acgtaacgtc ctcctgcttt atcaccagga 3301 accactagac gagcaaatcc ctcatctgta tctgtcaaat attggatgct accatcgctt 3361 gctttttttg catatgcaac tagcacttgt ttggcttcaa agttggggtg aatttcacca 3421 atacttacga tcgcctcata gcagtcagtg gcttcagcaa gaacgtactg tcggaggaaa 3481 gcgtttttcg tattttgacc ggaattagca ggttgaagtc ctcctcctgc tttctcatta 3541 ttaattaact cccacagagg gactccatag tatgtttctg ttctcggacc ggaacctgtt 3601 tggaaagaca cagtcacttc ggtcacgact gttgggtctt gtttctttaa atcttcttgc 3661 aaatctatga gcttttgcaa gtcaaaggtt gttcgatttt tgacttcacc acccaagctc 3721 aatgtctggg acagacctgc agaacacgta ttttcttccc aagaagcacg gttcccaggt 3781 tcagctaaag tcgcttgcca aaaccccaat atcgacagtg ttaaggatac aattgcgatc 3841 cgactagtaa aacgcataaa atatgcctct gtaattgtta aaaattaatt gttaaaaatt 3901 cgtttctagt gacagcaagt agccactttt ttgagcgttc gctttttgga agaagctgat 3961 gttttcttca gaaccaaggc gataccaaca ccaagaacca taccgcccca attttgggat 4021 tcaggaacgg aaaccaaaga tggagaacnn nnnnnnnnga attttttgtc caaatggaga 4081 aaggatatat tcagcaagct tcttcccatc tggactggca tctttaagaa ctgttaaacc 4141 gtaatcagct ttgactgcga gattatctgg taattccacc agttgtaagt caggtgcggt 4201 ttccaaggct gcgatacgcg aagcgtcaag cctccggctt atcgcactcg tgtaataagc 4261 caaaaacaca tcagcctgct tagtttcctc caaaaagtac accaaactat tttttccatc 4321 gggaactggt ggagaactag gtccaccaac caaacgtaag gcttttgcat ctagggtttc 4381 aaaactacct ggtctcaatt catcagcttt gcgaaatatt tcttgtgcat aatccccaga 4441 gggatcggat atgggggttg aagtgcctag tttaatgttt ggatctagca gcacgtctaa 4501 aaggttatct gacgtgaccg acaaccccgg tttcacaaca agagtcatgc gattgctagt 4561 gaagttcacc acagatccac tcaaaccttc ttgatacagt ttgagaggat ttccaatgtc 4621 agcactagca aaaacatcag cctcttcccc gttttccaaa cgtccccgca gtgttccaga 4681 tggaccaaat tctgttttca caggatttcc gtattctgtg ctaaaagcgt tagcgacttc 4741 tgacagtgca cctctcaaac tgccagcccc gtataaagtg accgtgggcg gttcagactg 4801 ggttgaattt gttagtgagg cagcgaaagc cccagaagtc ggggtgatat aggaaacaca 4861 ggttacgaga gagacggcaa gtaaacattt tctcatattg ttttagatga acattttgag 4921 tgttaagtct tggtagaatc tgaaaacggt gttatcattt cctaactttt gactaaattt 4981 caggttctgt cacacagagc aatttctacg aaaaaaccct gatatatatt taggtttttg 5041 tacgttgtta taaattacca ataaaattgg taaaaaatca agctatagtg tagagacgtt 5101 acccaacgag ggagcctcca agagggtgtt aactcacgtg taacgtttgt acatacagtc 5161 gtgtatgtga ttcaaaccaa aactgctata ttttcttttt tggatatctt taagaagaaa 5221 ttaacaatac ttccataaac tgttacaggt agttaccaac aatgttggaa aatagcagct 5281 attgatacag ttttggtaat cgatctaggc tttgatgtta ctagggggtg gtatctcctc 5341 ttttgtggat ataccaacca attttgcaaa tcttcaatta atttggtggg ttaggcgaaa 5401 actctgcggt cactaaaaac tacatctgtt gctaagcatg aattccgcac tatcaagaat 5461 agcaactatg aaagcgagcc gtctgacaac aaggcttcgc cgacgctttg atatcaaacc 5521 ttcaagaaca gcgccagatt tgctcaccct gtttgaggtt atcggctaca ctgttctttt 5581 ggaatcagat attaacatcg gtagacacaa ggctcaacgt agtagcaatg tgatggtagt 5641 tttgtgcact cgccttcgta acgccttgaa gcagattaat cccaaagttc catgtgaggt 5701 catcgagaca gtcatatctg ggctgactag cactcacagt tctgacctac tggaaaataa 5761 ccgtcgtttt cacaaactcc taaccgaagg agttgatgtt gtctaccaaa acggtaacca 5821 aatagttcat gacaaggtgt ggcttattga ctggtttaat ttgctctcca atgactggct 5881 agtgatccat ccgtttacca ttgtccaggg gtaccatagt cactgtcttg atgtcgttgt 5941 cttcatcaat ggattacctc tagcagtcat tgtcttaatg gactcaaagc atgaaaaagc 6001 taagctgagt gaagcttatc aacgcctcaa aacatatcaa caacaaatac cgatgctatt 6061 ttcttacaac gcatttattg cgctaggtgt aggaaattca gcacgagttg gtactttaac 6121 ctctggttgg aaagagttct taccttggcg ttcaattgag ggcgaagact ttccatatca 6181 aggagaaaca gaactagaaa tactgattca aggtattttt gataagcggc gtttcttaga 6241 gttagtgaag cactttatag tttttgagga aactggaacg agtattagca aaacattact 6301 tcgtcaccct ttttgtacaa cacaaaaccc gaaaattcgt tcgagaatga tatgaatacg 6361 aacacaagtt ggtatgaaca agaactcgtc aaagaattca caacaatcgt caatcactcg 6421 tttccacaag ttggtgaatt actaaaccag tgttatctca aagtcatcca atccttctgg 6481 ggacaacaca atgttcactt cctgccttac attgcaattt attgttctaa aaacacgatt 6541 gctgctgtca aagcagaaat ctatgtcttt agagaagtcg ctttctttct gggattaagt 6601 aaggttgttt gcctaaatgc cacatgccta ctgcatgatc ctaaatcaaa gttaccgcaa 6661 gagaatcccc acttgtggtt agaattgcag tggattgtca cacaggaaaa aggaagtgtg 6721 tcatgaagac aggacttttc tgcaattatg aaaattatca ctctgatgcc cgtcgcgcca 6781 tcttggaaca aatagcgcta gtcaaacacg cagaaagttt gggttttgaa gaagcttggg 6841 taacagaaca tcattttagt gatttcagtg ttagcccatc aattttgctc ttgattgcac 6901 acttagcagg cgtgactaaa actatccgat taggttcagc ggctgtgcta ttggcatttc 6961 atgacccaat tcttgttgca gaagatatta ccacactgga taatctttgc aatgggcgac 7021 ttgctatagg tattgctaag ggtggtccat ttcctgaaca aaataagcac tttaacactc 7081 caatgagtga atctcgtgct aaaacgctag aagcaatgat gctgattcat aaacttctat 7141 atcacactga tgtgtcgttt cttggcaagt actatcagtg cgatcgcgtc tcaatttacc 7201 caaaaccttt gcaaaaacaa attccagttt atgtagcaac tagtgatgaa gaagcaattg 7261 ggtttgctgc tttaaattct ttcggtttga tgggtggagc accatttact ctggatagac 7321 tcaagagtaa tgttactaaa tatcgcgcta tcaattccag tggttctgat aagtttatgg 7381 tggcacgatt cttttttgtt gctcgtacat atgatgaagc ggtgagtgaa gctttacctt 7441 tcattcgcac ttttagtcaa agaatgaaag cgttgagtgc taaagcacaa aagtatggca 7501 ataacagtca acatctccag gcgaatgatg gtcaaaaaag cgcttttgat gaagacgaat 7561 tgctggaaaa ttcgattatt ggtgatgttg ttacttgcag ggacaaaatt aagcggtttc 7621 aggatgaact gaatttgggc actttggcac tcaaacctgc ttcgttgaat ttgcaaaaga 7681 attttgagag tttgacgctc tataatcagg aggtacaagg ttatgtgtaa ttgttaacta 7741 taggactcct ccaaagattg ttactcagct aggtacagtt tatgttaacc agttatcagt 7801 tatcagttat cagctatcag tcatcagcta cctacgcagt accagttatc aggtaggaaa 7861 cggactcgtc caccccttgt tcactgttta ctgttcactg tttactgttc actgttcact 7921 gttcactgtt ccctgttccc tattccctgt tccctattcc tagtaaatcc tgtaaaaaac 7981 aataacccgc ctgttccaaa tttcctgcaa gagaaaaagt caaactacga attcggatgc 8041 taccaacttt gatgtttaac ggttctgaca agggcgttgg ataaattaaa attgcctctt 8101 gacacccttt cgcttctgca taggcgacca tctgataaat atcagcagtt gctggcttag 8161 aagcagcttt gtacttggta tctaacacat acctagctat ccccgttgcc tgatcataga 8221 gtaccaagtc aatgtcaagg tataatcctt gaccctgacc caaatagact ctttcttggg 8281 actggatatc taaggcttgt gtaagcagtg ctgtttcccg atgtgctttc agccactctg 8341 ccacaaagtg ttcgtagagt tgcgacatat taaccaaaaa gggtaatgtc gcgtttgcac 8401 ccgtttcatg actaggtgca gtctgttgca aaaagaagcg acaaagcgca tgtaggggac 8461 gataatcctc attcaggcgg ttgtactgtc gtcctacaca agttctggga tggcatggtt 8521 gcaatgtcac caatccctgt agtagatgat acgcgcggcg caccgttggt atgacccttt 8581 ctgtacacaa accgctacgg gcaatacacc aaagtgtcca agcgagaatt tgattttctt 8641 caacgtctgc ggtatgttct tcgtaataac actgaatttt agtatcccac ggtcgagtga 8701 tcatctggcg tacatcaaca cgtccccgaa cataaggcaa atgttctgtc ttggggatgt 8761 aggcgcagta gaaaccttga cgacctcggt tgagaatgcg acgggctaaa atatcagcca 8821 gccgttgata aaattctaag agggtttgac aatcaactag tcctgacaaa aagcgaaaac 8881 ttttgagctg ataggcatac tccagcattc gcagaagatt atctagcttg actttggggc 8941 gcaggataat gtgaaaatca ggagttaggg gaatgtgacc gacgcatcct tggttagtta 9001 gtcgccagcg atcgcctgtt tttggagaag gaaagtccac actcacctga gtctcgtagt 9061 cacgccacaa catttctccc actacatcgg gaagttgcaa tcgccccaac aacttgggaa 9121 catactcagt taactctata atttgtggtg atgttacact catgggtaaa tttgcttctt 9181 gacttcgttc cagcgaaact gttcgacttt actcggttgg tcaaaaaaat attcttccaa 9241 atacggctca atctccagtt gccaaatatt ctcaatctgg tcagctaaat cggctcgcag 9301 gaagaagctg atacccactt cgttgtgagg attactaatt gcttggttca gctttatcag 9361 cgtttgaatc aacttttgga ctggaaaacc agtatttaag tgacggcggc ggagaatctc 9421 atagtttggg tacacaggaa taaaagcaaa gcggcggcgg agggcatgat cgacgagtgc 9481 aattgagcga tctgctgtgt tcatcgtacc gataattcgt acattggcgg gaatgctgaa 9541 cacctcacct gcagccaaag gtattttttc gtcgcgatac tccaacagat acatcaactc 9601 accaaaaacg cgggcaacat tggcacggtt aatttcatct ataattagta cgcaaatatc 9661 ctgacgacaa catgccttgt aacagaaatc aagaaaccgt ccattaacca gtggataatc 9721 cagttctcca tctattcgtt gtgggcgaat accttgtaca aagtcctcat aagtataggc 9781 ggggtgaaac tggacaagat ccacaaaacc atcactacca ccagtcaaat atttagccag 9841 ttttttggca atatatgttt tacctgtgcc tggaggtccg taaaaaattg cctgcttttt 9901 acgctctatt gcctttaacc agtttaacag tgtttcctgt tcaattccag tatctaatgc 9961 acattgactg agggaatatt ctggttgcat ttctactgct tcaacatcag aagagagttc 10021 gctaacttca tcaattgggg gacgttcttt acaagcattg tcatactttt taacagcaac 10081 cagccaatga caaaacatat caaataagtc atcgtctcgc atagatggta caccagtctg 10141 acgcatctca ggagccaatt cttcaattaa tttttgtccg gttgcgttaa tagctggata 10201 ctctgttagc ttctgagtat aagattttcc tgttaagtaa ttgataacct gtcgtgatgt 10261 gttggtaatg agtaaaaagt catccggtcg gagtgcattc aggataggtg tgagcattcc 10321 tatttggaaa cttttggtag aaggtaattt tgaaaaatcc tgacaagctt ctcgcaactg 10381 agttgggttt tgataacagt gacggacgaa atgaaaaatg gcttgagcta tattcgacca 10441 gtcttgtaat tttgtccaaa cagcatcttc aaaccacctt ttaatgtctt tgtcggttgt 10501 tgaagcatga tgtatccaga aatttttttg actgttgtta gctgttggtg ggtaggggag 10561 aagttgttgc aggacgatga cgtcggctat attttctgtg gactcatagt ttgaggcgat 10621 cgcttgaaaa ttcccccttc ctttctcacg ctgctcatgg taagcctttt gatgacgcaa 10681 tccagcagga gtataagggt aagaactgat aaattcttca aataacctga taaattcgtg 10741 cttcttctct aaaagaagaa cttttttgtc cttattgaat ttagttttac tcatataact 10801 gttgtttata ctaaaaagta taaaacgacc cagcgatact gagtcgaaaa gaagcaaacc 10861 aaaaaataaa aaattggaaa tcaagctgat tgttgggttt attgtatcgc ctgtataact 10921 gctgaggcga tctcagacat gactttctcg cgcacttcca gaggggcttt tgaacctgcg 10981 atgaatacgg cgatcgccac acgctttcca tctggggaac tcgcgattcc aacatcattt 11041 gtcgcagtac ctatcccgag aacatctgga ccagtacctg tcttgtgtgc aattgaccaa 11101 ttattcggta gccctgcttt tagccgcttc tgtcctgttg gagaatcagt catcatcttt 11161 agtaacaaag tggtagagtt ctgggacaat agtttattcg agttcagcct tgccaaaagg 11221 tcaaccatac cttcaggagt cgcagtatct cgttcatcgc gaaggtactt ttccagagca 11281 gctttcttga cgcgatctgg aatattctgc acagcttctg cccatttttg ctcgtcagcc 11341 aattcgggtt gaaagttttt gagtcccaca caatccggct gtagctgctg ctctaatcga 11401 tcaacgcgaa cgttgcgaat ttgcagctta tttaagatag cattgacttg cttaggtcca 11461 ccaagcactc gcaccaaagc gtcagctgca gtattatcac tcattcccac agagcgttct 11521 agaacgtttc gcagtggtaa ctgtactcta tcacctttga attccttgat gataggactc 11581 gatccaggag caagatcccg acgcataatg gtgactaatt gcttaaacga aatcttgccc 11641 tcgtctagtt gttttaaaat ggctatggca gaaggaagct tatacacgct ttgcatggga 11701 aagcgctgct taccattgag aaaccagctt tgaccgttat taagatccaa aaccccaata 11761 ccaaccctac cttgagcggc tgaaatatcg agatttttta accgttcttg aaataatgtc 11821 gtgttggact gtgctattgc taactcaggg ctaactccca tcattccatc tgcaagtgac 11881 gaggtttgat gaatttgttg ctcaagttgc tgtgaggatg actcagtgat gcttgaagat 11941 aaagggctta gagatggttc ttgtgctttg agtggaagca gcatcaatgt cagcagaatc 12001 actgctgcaa acgaaagcca aaaacgacgt acaaaggctg aaaacatttg ctgttaaatt 12061 tctgggtgtg aatatttgtt ctaccgtatc atcgacttta ttgatgtctt ttatgtttct 12121 ggtgaaaact gtagtaatca ctgttagtct aaactccaga tacaagtatc ggaggataat 12181 agaaataagt agcaaccttg atactcactc tgactggttt gttatagctg ctggattttc 12241 cccgcctaaa ctcataataa ttttatattc tctctcatta agatagcgat agctttgagg 12301 tggacgcaaa taagatagtt tctcccgtaa acgatataaa tcaataggat tttcaaactt 12361 tttgacattt cttaaaaaaa ttccgacggc aacggttgct ccttcataat atttataaaa 12421 gtcttgataa ttgatgccag catggtcttt aacttccttc caaaaatgtt ttagttcatc 12481 agtagtagct tctttctcaa tcacaaaatc tacttcaaaa aaacctaaaa aactttttgt 12541 aggggatgat acataaacaa agacaagatc gcctttattc aatcgagtcc taactcgacg 12601 tagttccacc tgcttagttc gttcttcaaa aattttatgg gcatatttag gtttaattga 12661 aaggagaagt atgttacatg gcatgaaaaa aatatattta aattcatcta atacctaaat 12721 tatacactct cataaaacca ttctccggaa gtttataagg gcaagcaagt aagagtttat 12781 cttgttctaa aatccgatgt agttcttgta aggagacagg attcttaaat aattccgtat 12841 cactaaatct aattgccatg atgtcaccat ttttatctgt tggtatcttc tcaacatgac 12901 ttaacttata aacacctaat ctttgaaaac gtcggtaaag ttctttgggt ttaccaataa 12961 tgacctcatc tacataagaa caagcacgaa tataaccaac atcgtagaaa cctttacctt 13021 ttctctcacc ttgagtttca cttacatacc agagaattcg acaaggagca ttcaagcccc 13081 ttgagttttt tacggagcga tagtagacag cttcacgatt aaatgataat tccggcttag 13141 ctccaaaatc agggaaagga agcatttgat ttgctaaatg ttcatcaaat aaatcttttg 13201 cccatatagg ttgaatagga ataataaatg taggaattcc cgaatcactg atttttgcag 13261 gaaaaaggaa gcgttcaata tctactaagg cttgagcatc tttgattata tgttcttcta 13321 gattcttcgc aatgaccata agaaaattat attcatctcc aagatgtgat accaaatgag 13381 tgagatattc cgagaattca gaagaagatt tgacaactgg taaatttgcc ttcaaccatc 13441 cgtttttaac ttgcacaaaa gcatcttctt gaatagctgt gagcacagct tcctctaggt 13501 aggggtcagt gattcttgta aattgtcgct gttctctagc agacagacat atggatttaa 13561 aaatcaaatg atgtgcaaga gtagaggaaa gtttgttatt tccaactcga agcattaata 13621 tttctagctc gtcttttttc tgccttccat atacaattaa agcaagcggt tgattgactt 13681 cttctctcac gagaacacat tcaaacttat ctgattctgc taaaaaccgc cgtaaatgtt 13741 gctgaaactc agtttgatat tcatctcttt gagagttttg aaagtagttt accaaaaagt 13801 cttcttcccc aatttttact cgtgtttgct ctaagcatgt gccagaaaga cgtacaggtt 13861 gatagtcagg tttatgccga agttcgtcta gttgatttat cagttcgtca ggatgaataa 13921 ctgataatat aaaactttca tgaattctat ttcctaagtc tagtaattcc tcatctttcg 13981 tcaacaataa atgagaatcg gatgtaatag ccctagcaat ataacgcaac ttatattcat 14041 taaattcaac ttggtacttt tttaaaaagt tttgtaaaag aatctggttt atttccattt 14101 tttgattgtc acaaggtaag cagggaaaag tttctgcaaa gctgcgcagg ctgtttctct 14161 catgagtctt ttcaattttg ttaatttgat tgaaaatttc gtcattgatg cacaacgtta 14221 actcagtctg tatccaatca gctagtaaaa aaaatggttc ttcagtatca aaattttcgt 14281 ctttgtgtaa gtcaaagaaa atatcaggag caatcatcac acataacttt gactcaagtt 14341 gctgctcaat aattctagaa aatagaggaa gaggattgtg ttccaaaacc caacgagtaa 14401 gaattgtttt cttcttttta ctcctccctt ctaaatcctc gcaagcaata aagccaaagc 14461 tcgaccacat tccttgtaga ttataatccc gacgacaact tagcttaatt ccgcgagaag 14521 atgttgttat ttgtttgagg cgatcgacaa gttgtttggc tactccctta ccccgacatg 14581 aaggatcaat acacagatga gtgatagtga accagtcata cgaacgtcga tacaacaggt 14641 aaccaatgca gccagcttca gaatcaagcg ctacaaggat ttgacgatgc gttgctcgtt 14701 cttcaaatgc tcctttaggc aagttaccaa gtgtttttga gtttgctcgc cacagtttta 14761 tgactgttgc taggtatgga gaatgattat ctattgcttc aatcttgatc tgtgcgttag 14821 ttgtcacttt ttggtttcct tggacagttt catagctgca atatctttca tagtaaatcc 14881 gtcaccttgt gaatatagtt ggtattacag cgttccgtgc ggagattatt tgtggggata 14941 ttctgagttg gtgtgttctt cttataccaa ttttaaaaaa gaatgcgaca gatacacact 15001 cccaaaccct tgcgatgtct gactttcttc attttgaact ttgaattttg aattttgaat 15061 tggtattagc taccctacgg gaagccgcta gcccctaagg gggcgctgcg caaacgcgtc 15121 tacactccgt tacgctcgta atgacatttt acgtttaatt aggttgacct acttatctac 15181 tcttcactct tcagtaaacc cgtagaatat acatctaaaa ataacgtcgc ttgctgctgc 15241 gtgcggagaa tagaagcagg acaatctgca gtaatatccc cctccaatat ctgtttcact 15301 atcttcgcct tacgttttcc tggtgcaaga caaatgattt tttttgccga acaaatcatt 15361 gggagagtaa cagtaaaagc gtattgcggc acactttcca aactggaaaa atatcctgta 15421 ttgacttgtt gctgacggtt cacagtatcc agctttacaa gtttcaatgt atggggatcg 15481 ttaaaatctg ctactgctgg atcattaaaa gccaagtgcc cgttttcacc aatacctaga 15541 cagcataaat caatgggttg tgcttgcagg agtttggtgt agcgatcgca ctctgccaaa 15601 ggttgcatag catcaccttc tatatagtga aattccttgg gactaacccg catttcaacc 15661 cgttcccgca gatagcgccg aaaactcgcc gaattatccg cagaaatgcc caaatattca 15721 tccagatgga aacaggtaat ccgtgaccaa tctacaccac tcaatgctat caaagcatca 15781 agaaatttca tctgggagtt gcccgtcgct agcaataaag ctgcagtatc cttttgctgg 15841 agggtgtgct gcaaatgttt ttgtacaatt cccgctacat cctgcgccag ttcaacttca 15901 gaattataaa cttgcacctg taaagcatcg acgcgaaaag aatttttggc ggttgacatc 15961 ggagtataca gaatattttc agacaaactc aagaatgcag acagattcac cgacctcgcc 16021 ctaaaaggga cgaggattgc ccgaaccaat tcgggcaacg caagcggtga atagcccagc 16081 ccttgcctaa ttctggcaca gacctccgga tgcttctcca gtccggattc atctctaagc 16141 ttgattggtc aagcgttggg taatgccaag acatcttgaa ttaggtgggc gaggagacta 16201 actacgggtt tgctggttcc tgggattata tccatttaaa aaccagcatt caaatataaa 16261 gccgtgctaa aagcacgggg tttctaccca ggtttttcga tgacaaacta gaacaaaaca 16321 atgtaaaatc tgtaaagaaa ctttacaaga gtttacacct taaataccat gctagctgca 16381 attctctttg acctagatgg aacgatcgcc aacactgacc ccatacacta ccaagcttgg 16441 cgggaaatgc tgatgggcta cgacatggac attgatgaaa cattttataa atcccgaatt 16501 agcgggcgga cgaatccaca aattatagaa gacctcctgc cacaattatc acccgaagaa 16561 ggtgcaaagt ttgcagacga aaaagaggct cttttccgcc aaaaagccaa gactattctc 16621 aaacctctaa gcggattttc agaactcata gcatggacag atgcgcatca actgaaacgt 16681 gctttggtaa cgaatgctcc tagattaaat gtccaattcg tgctagaagt tttggaaata 16741 aaagaagtct ttcacacagt tgtcatagca gaaaatgaaa tagctgcaaa accagatcct 16801 gcaccttacc aagtttcctt aaacaggttc ggtatcacag cagaacaagc aatagcacta 16861 gaagattctc cctctggaat tcgttctgct gtgggcgctg gtattcgcac tattggcgtg 16921 acaacgactc aggagtcaaa agttctcctg tcacttgggg catttatgac agttccagac 16981 ttcactgatt tgcaactgtg gacacttctc aactcgtcgg tacaggaaga tgtggcttgt 17041 ctagatttat aaagtttggc gggatgtgtg gaaacctgat cgaagaggca ggggcaggga 17101 gcacagggag caacaccaga aaaaattacc ctgccatgta ggtttgatgc cccttctttc 17161 aagaatggag aagaaaaaat tgttgagaga cgcaaggcag agccagcgct tgactagggt 17221 tgcaaataga ccaaaatctg atttttgcca agaatatatt attttcaacc tttagataga 17281 tgcttatcta gtaaattgtg aattatgcga aatttcccag aatgacttaa gaaaatataa 17341 atttagtgaa aaatttgatt ttttactaaa tcactcttga caaaaccttt gattatagaa 17401 taaaacaaaa cagtacattc atatacagat ttagttagaa cttgtcttct gtgataactt 17461 ctcaatttat caaaatcttt tgatgaaaaa agagttagtg gttggcatct tcaactagca 17521 gatattgcag ttaagtttaa ctgttgctat ttttacaccc ttgaagctgt tttcaaactt 17581 tttgcaacgc cacttcatca ccctacattt ctggaggata agttattaaa cagttaggta 17641 gaattcatga aaatcatcaa aacttatgca gaggagtttt gtattattta tccatatagg 17701 atgaatagta gaacatgatt ctatctgtgc acagacatga aaataagctg tgcagttctt 17761 gggttacagc ggtttttagg taaggtgggc attgcttatc aacatctcaa atgtccatca 17821 tataagcttt taggagcact ggctccccta ggatttgtgg tctaaccata gaatcagcgc 17881 tgcaactagt aataccaaaa aaatcccacc tagaactact tttttcccct ggcaaatgtc 17941 atttgcagct ttgataacag gggtagttaa acctgatttg ggatgttctt ttcaaggaac 18001 gtaatcatct ttttgctaac cactatagtt aattagataa ataacagcta cctatctgat 18061 gtacttttca gacaaattga ttctacgtct gtaaaaactg cgcgtccgag atttcagaac 18121 agatgtggaa gcatttttcc acctgcttaa atttgacaat cactgctcca gaattctgtt 18181 aaaaaattca ggtaatgtaa tgatttttga gatttctgac ttttatgaaa gcaagttttt 18241 aacaccttgt cagcgtcagg ttttgttgaa gaatttgcaa gctaatttgc aaccagaata 18301 ccgtcggagg atagaaatta tgttgctggc agatatgggt aaatcgcaaa cccaaatctg 18361 taagatttta ggttgttctc aggagatggc gcggtattgg atcaccgtcg cacagctagg 18421 tttggcggac aaatggcagg aacgaccgat aggtagaccg aagattgtca acgaccaata 18481 tatccaaagg ttaaaagaat tgtttagtca tagtccgcgt aaatatggtt atgcatttag 18541 ttcttggaca tctcaatggt taagcaaaca tttagcaact gaatttggga ttgaaatcag 18601 cgatcgccac atcaatcgcc tgctaaaaca aatggggctt tctacacaac agaaacgctc 18661 ttctaaaaag caagcaacta aagacaccaa ggaaactggc attcggatct gtgacttgca 18721 atctcacagt gagcctagtt ttcattggtt attcaatcat catgcaaatt aataactaac 18781 tcttgggatg gagaagaaat catgtcatca gggttttccc aagaacaact gagtcaacaa 18841 ctcatgactt tcttaggaga gaagctttca ccaaaggaag tgatgaattg tttaaaagaa 18901 gtggaaattg tcgaaccacc tgtagcaaag ctgttttggc aatcaacaga ctctcagccg 18961 ggaatttata tagtccttgc gggtaaagtg cgactgctgg atagttctgg taacttaatc 19021 tccactcttg cagcaggatc atcatttggt gaggtgactc tgtttgcaga agaaggcttt 19081 attccttacg ctgctagagc ttcacataac ttaaaactct gttatatcag cgcagatgca 19141 ttgcagattt tgatggagaa ataccccaaa atccgcgatc gcctgttgaa aagcacagaa 19201 ctttgggatt tgcagttgtt gtatcaaaat caacacccaa atacacctgt tcataacata 19261 tttcaagcct tctccctgtt tgaaaggcat tcattcaata aatttcaaga aattgaggta 19321 gatccagata ctaagttatg gctgttacat cgaggcaaac tacagcgttc tgacggtagt 19381 tgtttgactt caggcaaaat ctatgctgtt ccacaagata cttgttggca agcgactcaa 19441 ccaacaattc tgtacagtct ccggaattct gcttggcttg cagcagtgca acacttgccc 19501 caattaacag aattgattgc ttctgattct aaacaaatta caactgagga aaacgaagcg 19561 ccgcctcaga cttcatttcc acaaagacgt accaatagca ggaaagtcat tccatttccc 19621 tcgcacgccg cccccccaaa acaaaagcga cagcgacgtg tctacnnnnn nnnnngtcta 19681 cttccctagc cctcaggtaa aagcaggaca tttatgggca cacctgacga aacgctatcc 19741 cttctttgaa caacaaagtg cctcagactg tggtgcagcg tgccttgtga tgatgagtcg 19801 ctattggggt aaacgcttta gcatcaacat attgcgggat ttagctaacg tcaggcgcac 19861 tggggcatct ctacaaggtt tagcagtagc agcagaaagt atcggttttg cgactcgtcc 19921 agtgaaagcc agcctagaca aattggcaca acaacctcta ccagcgatcg cccactggga 19981 aggcaagcat tacatcgtcg tctatgaaat cactccaaaa caggtaattg tcggcgatcc 20041 tgccatcggt caacgcaccc tcagttatgc tgagtttaaa gcagggtgga ctggttatgc 20101 cttattagtg caaccgacag cagaactcaa agaaagccaa gaagcgagta caccattctg 20161 gcagttctgg gagttagtca aaccacattg gcaagtcctg ctggaagtct tcatcacttc 20221 agtctttatc cagctgtttg gactcgtcac gcctctgttc acccagttac tgttagacag 20281 ggtgattgtc caaggtagta ccctcacctt aactgccatt gggttagggt tgctgatttt 20341 tggtttgttc cgcgtcgcca tgaacggact acgacaatac ctgctagatc atacagcgaa 20401 ccggatcgga gcagccctga tggtaggttt tatcaaacat accttccgcc ttcccctagc 20461 cttttttgag tcgcgttacg tcggcgatat tgtttctcgc gttcaagaaa accaaaaaat 20521 tcagcgtttc ctgactggcg aagcactgtc aatcattctt gatttactga cggtgtttat 20581 ctatgtgggt ttgatgtttt ggtatagctg gcagatggca ttgttagtgc tgttgattgt 20641 gccgccattt tttattctgg cgctggttgc tacacctttt tttcgccgca tcaaccgtga 20701 agtttttaat gccttagccg acgagaacag ttatttaatt caagccctga caggaattcg 20761 ctcgattcgt tcaatggcaa ttgaacaaac ggtgcgctgg cgttgggaag aactgctaaa 20821 taatttgatc aaaaaaatgt ttggcggaca agtcattgcc aaccaactgc aagttctcag 20881 ttctaccatc gaatcagtgg caaatacagc attattatgg tttggagcat ggctggtgat 20941 tcacaaccaa ctgacaattg ggcaacttgt agccttcaat atgttgttgg gtaacatcat 21001 tcatcctttc caacgcctga cagtgctgtg gaatcaagta caagaagtga tggtttccac 21061 cgaacggatt aatgatgttc tagaagccga accagaagaa gacttacaac atcaaccacg 21121 ccaatctttg ccaagactac gaggttacat tcgctttcgt gatgtgactt tccgctatca 21181 cccagaaagt gatatcaacg tactggaaaa cctcagtttt gaaattcagc ctgagcaaac 21241 tgtagcggtt gtagggcgta gcggttccgg gaaaacaact ctttccaagc tgattttggg 21301 tttatatacc ccgacaaatg gcaaagtatt gattgatggt catgatgtga caactcttca 21361 gatgcgatcg ctacgacaac aaattggtgt tgtcgatcaa gaaacctttt tgtttggtgg 21421 tacgattcgg gaaaacatta gcatcgctca cccagaagcc actttagaag agattaccga 21481 agcagcgaat cttgcagggg cttcagaatt tattcagcaa ttacctatgg gttacgaaac 21541 ccaaatcggt gaaggcggcg gaatgctttc tggtggacaa cgccaacgtc tagcaattgc 21601 tcgcgcttta ttaggtaatc cccggttttt aattttcgat gaagcaacca gtcacctcga 21661 tgcggaatca gaacgcatca ttcaaaacaa tttgaaaacg attctccaag gacgtaccag 21721 cttgattatt gcccatcgac tttctacgat tcgcaatgct gacttgattc tggtattaga 21781 tcaaggtgtg ttggtggaaa gcgggactca caaggaatta atcgccaaaa aaggtcatta 21841 ctactacctc aatcaacaac aacttgctca agtaggctaa atcagttatc agtgaacagt 21901 taccagtcat caatcataaa tagtggtaac tgctaattga tagctgataa ctgctaatta 21961 ataactgata actgaaatta agctatggct tatccatcca ataattcatc atcaccattc 22021 accccaaccg atgatcagca actgagcact ccacctaatg tagttgagga aaataataat 22081 gtagcagttg ccaaagattg gttttacggt actgaagaac tactagatgc tttacctcgt 22141 ctttggacgc gctctttgtt atatttgctg ataggtttta gtgcgatcgc cttaccctgg 22201 gcaatgctgt ctcaagttga tgagacagga actgctagag gacgtatcga accactaggc 22261 gcaacacaaa gattagattc tcaagtcacc gccagtatca ctgctgtcag ggtgaaagaa 22321 ggagagcaag ttcgagccgg acagttactg gtagaactac aatcagatgt aatgcaaacc 22381 gacttgaaac aggcgcaggc aaagctagaa ggactcataa atcggcaagc acaactagaa 22441 ctcatcaaaa accaattgct actagcaatt cgcgtccaag agcaacaaaa ccaatcccaa 22501 gaatcagaaa aaattgccca agttaatcaa gcaaagcaaa acctggatgc aaaacagagt 22561 acctataact tacaaaaatt ggaaaaactg gctttagtag atcaagctaa gcagcagatt 22621 aacagcactc agaatgacca gaagtcggcc caaagtcgtt tgagtataga ttcaaaacaa 22681 gtcaaacgct ttagcaaact cgtgaaggat ggtgcggttt ccgcaaatca aattgaccaa 22741 ctcagaaaag aagaagaaga aagcaaacga ctcaaccaaa aaacgcaatc ggatatcaaa 22801 caagcccagc tacgcttcca agaagaacaa aaccgttatc aggcaactat gcgtcaagcc 22861 caggtagata ttcaggtagc aaaactgaga ctgcaagaga gtcaaagcag ctatcaaagt 22921 attattcaag ctgggaaact agctgttttt aaaaatcagg aacaattaaa agacctgcac 22981 acacaaatag tgactctgaa atcagaaatt gctcaaacca aaaaccaaat cgattcctta 23041 aagctgcaat tagagcaacg agttatgcga tcgcccgttg atggtgtcat ttttgattta 23101 cccatcaaaa agcctggtgt cgtcgtacaa cccggtcaga tcatcgccca aattgcccct 23161 aaacaaactc cctttgtact taaagccaat atgcctagtc aacaaagtgg tttcttgaaa 23221 ttaggaatgc cagtcaaaat caagtttgac gcctatccct tccaagatta tggagtcgtt 23281 ccagggcgag tgattcggat ttcaccagac tccaaaattc aggaaactcc ccaaggcaaa 23341 atagaaactt ttgagttgga aatatccttg aatcagcctg atatccaatc tggtaacaaa 23401 catattccct taacccctgg tcaaacagca acagcagaag ttattgttcg tcagcggcgc 23461 gtgattgatt tcattttaga tccgtttaaa aagctgcaaa aaggtggttt agagctttaa 23521 ccaacagtta accagttatc agttatcagt tatcagttac caatgataac gctacggtgt 23581 acacacatct cgctcaaaac ctcaccctcg cttgaatcgg cgctaaaatc tttccctctc 23641 cttaccaagg agagggatgc ccgacagggc agggtgaggt tccgaaggaa ttttgattaa 23701 ttcgatgact tgtgtgtaca cccttacccc cttaccccct taccccctta cactcctaat 23761 tgttgactct aaacctgatt tctttcgtac ccatcaaaaa attgttaata atcagaggaa 23821 taagatgatg ttaaaagtgt tgaatgtgtc tggtaaagaa atgctagagc aactgaaact 23881 ttcctgccag attcctggtt tattagatgc gatcgcaaca cgcaaaatta tttttgatgc 23941 agcaagaact gcagggatta aagtagaagt acaagaactc caacagtcag ccgatagcct 24001 gcggacagcg aataacctac tgaaagcgga agatacttgg gcgtggctac aaaaacatca 24061 tctttctttg gaagagtttg aacaattagc cgaaattaac ctactatctg ccaagttagc 24121 gaatcattta tttgcagata aagtggaatc attcttttat gaacaccaac tggattacct 24181 ggcagcagtt acttacgaag tcatcttaga tgatgaagac ttagcttggg aactttttta 24241 tgcactcact gaaggtgaaa tgagtttcca agacatgact cgtcaacaca tccaaaatcc 24301 agaacttcgc cgtactggcg gatatcgtgg aataagacct cgtaaagatt ttaaaccgga 24361 tattgccgct gcgctctttg ccgctaatcc tcctcagtta ctcaaaccca tagtcactac 24421 acaaggaatc catctcttga gggttgagga aatcattcgt ccggaattag atcagcagtt 24481 acgcttgaag attatgagcg actttttttc cacttggtta aaggaacaaa tggcgcaaat 24541 agaagttatt ccacattttg aatcagattc ctattctcca ccagtccagg aattactgaa 24601 gctagcttaa tcggtataaa acaattttta aaaaccagat acaattggta tctggttttt 24661 aattcacaac aattatcaat gacttctgaa cagaaaaact cggttttgca aaacttaccg 24721 agtttataat gtatgatgta tgagtcttga gttattgaga ttttcttatc attttttaaa 24781 tgaactatct gtgactatga ggtaaatcag ataagcagca aatgctttta ataaggtatt 24841 tgcgaatata ctaaaaggat taccacccat gacagatgca gactcagttt cagatagttc 24901 atttagaaat gtttcagatt ctgataaaat ttggcaactc gtcgaattga tatcaaaaat 24961 tgtgatttta gccattgtga ttttttgagt tataaaagtt aattagtgat cagaattttt 25021 gggaaacatg aaatttatag agataggttt tacccatccc tataaacaat actaataaac 25081 tagttattta cgttatttcg ctagttcggc aatcgctaaa attgctgtac ccttcaagaa 25141 tgaattttgc aaaattttag aaatttctat tacgaatcca ccgtagctgc tcgcaaaagt 25201 tagcgcatca ccaccaccgt taacagagga agactctgtt tcagctactt cattgataaa 25261 ggattttgag tcgaatgcca ggtcggaaac tgtgatttta gacataaatt tttcctgatt 25321 ttgtagttac ttggattaag ctcttgctta acccttgcat ctatttttaa cgatataaaa 25381 tcgaaattca aggagttgaa att // LOCUS NODE_1197_length_25215_cov_5.31152625215 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 25215) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 25215) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..25215 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 67..1095 /gene="cas7d" /locus_tag="DP116_10345" CDS 67..1095 /gene="cas7d" /locus_tag="DP116_10345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872950.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I-D CRISPR-associated protein Cas7/Csc2" /protein_id="PRJNA477356:DP116_10345" /translation="MNLLKTVDAKFFHTEIPAKPMGNYVHFITIRVTESYPLFQTDGE LNKAKVRAGIENKEPISRLAMFKRKQSTPERLTGRELLRKYEIGDAKNCDYNVDFSKT TPDCILYGFAIGDSGSEKSKVVVDTAYSITPFEDSHLNVTLNAPFENGTMSRQGEVTS RINSQDHILPQIFFPSIVTLKDPTEAGFIYVFNNILRTRHYGAQTTRTGRVRNELIGV IFTDGEIVSNLRWTQKIYDLMQHKGEINLPDPLDEDEVVKAATEAMIALMKDECITHT DFIGESFTSLLNEIKSITSHETQLQAMLTQANAESSAYAQTWVLKSAKKESKKDSKSQ KKAGAVAE" gene 1101..1820 /gene="cas5d" /locus_tag="DP116_10350" CDS 1101..1820 /gene="cas5d" /locus_tag="DP116_10350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740218.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I-D CRISPR-associated protein Cas5/Csc1" /protein_id="PRJNA477356:DP116_10350" /translation="MAIIYHCQLELHDSLYYATREIGRLYETELVIHNYALCYALGLV DSEIYSTTVAEEHSYRYFCPEQVPKYEEHLTPLNHQSIYITPAHSINHTTILNTWKYA NNNYHVEMEKTQKNIPSFGRAKEIAPESQFEFFVISQKQLRLPKWIRLGKWMSKAEVQ TQEVTKLELKTGNFSFPYPLNPLDVMFTHQVVSYDVVNMPPVSLIQNVSIRQGQYYEF ENPNKSEKLRLPAKMQYRFKG" gene 1882..2742 /gene="cas6" /locus_tag="DP116_10355" CDS 1882..2742 /gene="cas6" /locus_tag="DP116_10355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740217.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated endoribonuclease Cas6" /protein_id="PRJNA477356:DP116_10355" /translation="MPHSLVLNLIPQSPIYPEFLTGRHYHALFLTLISSVDKDLGDYL HTSNADKPFTLSPLQVTRNHKSHQHKHHTLQFSHQRLIPPGTPCWWRISLLDDNLFSK LTPLWLNLNPEHPWHLGSADLYITSILGTPQSMQPWANACTYTQLYEQAKESDRPNHP LNLTLATPVAFRQGGYDTILPIRECVFNSLLSRWNKYSGIEFTNIPIESIYPSFVNIH TEVIRNYDNKFVGCVGEISYRILGDIEPIAIKQINALADFALYAGVGRKTTMGMGMTR RLSSSVENYE" gene 2735..3328 /gene="cas4" /locus_tag="DP116_10360" CDS 2735..3328 /gene="cas4" /locus_tag="DP116_10360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740216.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated protein Cas4" /protein_id="PRJNA477356:DP116_10360" /translation="MNETEYIPIASLNQYAYCPHRCWRMFCAGEFIDNQYTIEGTSLH ERVHTLGEGHREETWQVRAIWLKSDKYKLIGKSDLIESENGELYPVEYKRGRKGEWDN DELQVCAQALCLEEITGQTVTTGYVYYAHSHQRQLVEITEELRQSTIATIEAVQMLLL TGIMPKPVKTKRCVGCSLYTRCLPEIVDKVGRYQEVY" gene 3550..4554 /locus_tag="DP116_10365" CDS 3550..4554 /locus_tag="DP116_10365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872956.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I-D CRISPR-associated endonuclease Cas1" /protein_id="PRJNA477356:DP116_10365" /translation="MGTLYITQDDAFIGKIDERLHVKFDKKTILDVPLIKIDGVVVLG RATVSPAAVNELLERQIPLTFLTETGRYLGRLEPEVTKNIFVRKAQWQAAGDTAQAIH VVQGFVRGKLKNYRNTLVRRQRECNNLDLSASIERLDHVIVPIDSTQNIDSLRGLEGA GSAAYFGCFNQMIRNTGFTFTKRVRRPPTDPVNSLLSFGYSLLCHDVQSAVNIAGFDS YLGYLHCDRYGRPSLALDLMEEFRPLVVDAVVLSALNKQFLKVEDFVTEPLSGAVSLT NEPRKTFLRLYGQKKLSEFKHPVLGRKCTYQEAFELQARLLAKYLMGEIEKYPPLVLK " gene 4737..5024 /gene="cas2" /locus_tag="DP116_10370" CDS 4737..5024 /gene="cas2" /locus_tag="DP116_10370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated endonuclease Cas2" /protein_id="PRJNA477356:DP116_10370" /translation="MNVVVSYDISEDKRRTKIHKVLKSYGQWVQYSVFECQLSDTQYA KLRSRLHKLIKPDTDSIKFYFLCACCFGKVERIGGEPPRDDTIFFAECADG" repeat_region 5237..7306 /inference="COORDINATES: alignment:crt:1.2" /inference="COORDINATES: alignment:pilercr:v1.02" /rpt_family="CRISPR" /rpt_type=direct /rpt_unit_range=5237..5273 /rpt_unit_seq="gttgaaatttctcttactccctattagggattgaaac" gene 7462..8163 /locus_tag="DP116_10375" CDS 7462..8163 /locus_tag="DP116_10375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309811.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_10375" /translation="MSLAKELDTPQDIPEDVILPPGDLYSDEPPLETELHLRQIILLF KCLEWLWRDKTDFYAAGNLTIYYSLSKRKSEDFRGPDFFVVLDTERKTRKSWVVWEEE GKYPNVILEILSESTANTDKEFKKKLYQNTFRTPDYFWFDPYTSEFAGFHLLDGKYQP LEANNQGHLWSQQLELYLGIHQGLLRFFTASGQLVPTPEEEAESQRQQKELAISKAER LAAKLRELNIDPDTI" gene complement(8258..8938) /locus_tag="DP116_10380" CDS complement(8258..8938) /locus_tag="DP116_10380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112189.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_10380" /translation="MTQALTKQVTFDEFIAWYPENPQRRYELHDGVIVEMAPPTGDHE EVVGFLATKLTLEYSRLNLPYFIPKTTFIKPIEGKSAYSPDVLLLNRPNLINEPLWKK ESTISDAASVPLVVEVVSQCVARVPRVEATDEPVRVSSNWRDDYHKKLADYEEMGIPE YWIVDYAALGGRAFIGSPKQPTISVYQLVEGEYQVAQFRGSNRITSPTLSELNITAQQ IFDAASNF" gene complement(9167..11008) /locus_tag="DP116_10385" CDS complement(9167..11008) /locus_tag="DP116_10385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316233.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10385" /translation="MANRKFYLHYLALVGAIALGAILRFWNLDLKPLWMDEVITAIFS LGKSYNDLPMEVVLPLERVSEIFTVQPGVSCSEIAKNIASQSTHPPLFFCGMYSWLKW LSPLGGEWVGKLRSPAVFFGVAEIVAIYYLNRIAFFPSAGIIAAAFMAVSPLAVYLSQ EARHYTLPMFLITLALLGLVQIVKDIEKRQKVRFWVVLGWAIINSISFYVHYFCILAF IAQIATLLVLMCWRSGNILNKRQIWLALILCVSVVVISFLPWLPVVFHDYNRSETGWL DPPQHISPIYQTLISWLLMVISLPVENQPLAIAVVSGLLMLLFGIWVGWQVFKGLKQL WYKQSTHLATLTLLSFCVWVLLEFLAIAYFLGKDITAVPRYHFVYYPSFCALVAASFA HTKLQSKVVAVIASRSREAKPWRETKWREAISPKAPLQQNATWYQINKNEKTRNHSIF LPKSSFVILFLVSLLSCVFLVFNLVFQKPFEPEQVARKMNQNPSIPILMVVGYRDYQD VALGLSFALALEPLREAEEYSSSHVAFFKQSPDFAPVLQKLSQLPPPTATHLNLWLVG PGRKRRDYPQQITLSQKMTCAIDSSQHYRVGVPYQLYRCGVSVSNRK" gene 11145..13520 /locus_tag="DP116_10390" CDS 11145..13520 /locus_tag="DP116_10390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316232.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA helicase UvrD" /protein_id="PRJNA477356:DP116_10390" /translation="MPQESKSIEPSTLPVSSPREAAIANIRNTLRQGQQRMADWNSGS LAVSAVPGAGKSTGMAAAAAIAIAHQYDQSTKSRRNIRRQLIVVTFTRSAAANIKIKI REYLKELSLPQTGFIVNTLHGLALNIASRYPNLSGLELENVTLITPNQSHRFIRTAVE QWVASHPGRYSRLLEGIEFDGEETERLRRQSVLRTEVLPELATIVIHEAKSSGLLPED LREFGKQTTEEYDILSIAAGLYEQYQNLMRSRDFIDYDDMILAALRVLENENTRKIEQ NQVFAVFEDEAQDSSPLQTRLLTILATNPDNPNEPPNLLRVGDPNQAINSTFTPADPI YFRQFCKECDAQQQLVTMDQAGRSTKIIIDAANFVLEWVNSLYVKTGQDNSIQNSKFK IPNLESPSSPTPFRFQRIRPVEPNYRKADVNPQAVGLGLELYTPRDIHHTVELLSERI VELFSQEPNAISAAILVRENRQGRWLAQALTSVCKEHNILLYDVGERDRHSHVPEEVL ALLQFCDRPHSPDYLKRALEVFVQRRLIETQDLNALATLPEEFLYPSPLASPQSESVQ KAAHLCRSLLRARLELPLYQLISFIALTLNYEQAELATADKLAERVIQQIAGNTSMGS MLNALSEIVSSERFEPVETEDLEKRYTRNGQLTIITMHKAKGLDWDYVFMPFLHENLI PGRFWVPPQSQFLGNFTLSEVARAQIRAALHGQSTLPSITVAWEQAKHLKTAEEYRLL YVAMTRAKRLLWMSAAQKAPFTWSKPENLQEQTPCPVFPALKRQFPKSVRN" gene 13887..14348 /locus_tag="DP116_10395" CDS 13887..14348 /locus_tag="DP116_10395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008187059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_10395" /translation="MQNIEVRPVKDKQELDDMFHQRWLVLRSPLGMDKGTEKDKHEDS AFHLVAVCDHKVVGSARLRLLSKELGSIAYLAVLPEFRHQGIGTKLMEKLIEIAHEKN LNTLRLMSRVHAVNFYKRLGFCEVGEPFYYLDVSHIFMQCKLQQSQKNAKD" gene complement(14302..15279) /locus_tag="DP116_10400" CDS complement(14302..15279) /locus_tag="DP116_10400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873676.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sucrase ferredoxin" /protein_id="PRJNA477356:DP116_10400" /translation="MNTFFCSDHSRQIGEDIIGSGTNNQTYILIECRTPWTAEALNSR WVPENLRLLIQKCKSGKIPVKFLLIANNLSHKVDGTTVLIYQKKEGLSNGYHKQEFHL AHIEQAAGIVRKWLSGQSSNYEVKNSATRDILVCTHGSHDLCCARYGNPFYAQAVAMS EDLCLDNVRIWKSSHFGGHRFAPTIIDLPEGRYYGVLDQDSFRSILTRTGDITCLNKV YRGWGILPTSIQVLERELILRHGWDWFNYRVAGRIIEESSDKTTILAELTVEKPDCSL DTYQAKLVKNDSKTLEMRGSCNAKQESLFVKYSVASLSHFSETAVACTA" gene complement(15308..16471) /locus_tag="DP116_10405" CDS complement(15308..16471) /locus_tag="DP116_10405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748176.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cupin" /protein_id="PRJNA477356:DP116_10405" /translation="MFGDILKPYTVEEFFKKNWTSEAVFISSGDQTKFAHLFSWEKLT YLLNFHQFKYPDLRLALDEKVLDESANANIIQRCQEGATLILNGVHKLIPELATFASE MKYDFGYGVQVNAYCSWPHKQGFSSHYDTHEVFILQIDGTKQWYVFYDTIKYPLPEQK SSSFSPPEGEAYLTCTLKPGDVLYIPRGHWHYAVAVDEPSLHLTLGVHCKTGVDFLEW LVNELRQQEEWRKSMPLRVETAAVENYVDGLIEKLSKHIAHSNLSDDYMNYLDGLGKA IAPYSLPYQAGFHIFEQGTQTKFKSAKFQRTRITELPDSSGYKIVVAGKEVSLKGVPI SLVENLFTGEIFTGEDVINWLPDYDWEIDIAPLLSRLVTEGIIFVSSSVSYPH" gene complement(17522..17802) /locus_tag="DP116_10410" /pseudo CDS complement(17522..17802) /locus_tag="DP116_10410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309420.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 17968..18771 /locus_tag="DP116_10415" CDS 17968..18771 /locus_tag="DP116_10415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951336.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10415" /translation="MKDINKLYPALNKILKSPKGILLTLTLTSLFSSGLGVVGNYPAA ASGEDEQIQSSYSSVQISQTTNQNSSYSRIQISQTTNQNSNGLPRRIENAILRDASKR SGVPIRELQITEATAKTFSNPCIFKFGEVCTREFNPIKGWEVVVRVQDNSWTYHVNES GSEIVLDPKVSTSQSTTLPKEIEDAILGDASKRSRIPTSNLKVTKATAKTFGNPCEFK FGEICTKEYNPIKGWEVVVQVGRQSWTYHVNESGSQLVLDPKISGGLKN" gene complement(18853..20679) /locus_tag="DP116_10420" CDS complement(18853..20679) /locus_tag="DP116_10420" /inference="COORDINATES: protein motif:HMM:PF05239.14" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10420" /translation="MRKGSDLIGKVVITYDTGEQIDRVQDLIFDQDSHQLLGLLVDEG GLFRATRVIPFTSIQAIGPNAVVVPSKKAVVKLRAVPEIKAIMKRNNVLRATRIFTVN GRDLGTMVDLYFDEETGSVEGYEVSGGLFADAYSGRSFVPAPQTLKIGEDVAFVPVET AQLMQEQIGGIKGAVQAAGDSLQVTTAMAGLKLQEATQTATEKLQETTEAVGKRLQDV NQATVTSITNAIVDPAAQKAFVIGKVADQNVIAADGTLLIMQGETVTFLIAHSAERLG VLDQLYRATGGRVAKELNRKIQQTANTTSAQFQEIVDTTSEKLQQGAGVANEKLREMT RTAAARFTNAVVDPEEQKALIIDRVVDGDVIAPDGTLLIAQGQLVTLEIADEAERQGV LDQLFRAAGGSLSTELSNLANNFLAGHVVEQALGRRVHRIVQTNEALVVAAPGQIVTP QVIKRAQTYQLEQALLNAVGLTSTEAAYANANKTFADTGERVVDGVIQMRDNASTLLV VLAERFEHLRKQALQVLEEQRMKQALGRPVNRIILDREDNIILSPGDIITHRAIELAR QADVLDILFSSIWKTPELPATVLLNSSPKAEVKHLKSVVLSS" gene complement(20690..20899) /locus_tag="DP116_10425" CDS complement(20690..20899) /locus_tag="DP116_10425" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10425" /translation="MPRLVGRRSNGSLYVLSAFIVAIAFAGVLRYSGVINIQNLVEQT KIRFHKSSLPAEFSIANLLDKQISA" gene complement(21143..21346) /locus_tag="DP116_10430" CDS complement(21143..21346) /locus_tag="DP116_10430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015168991.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CsbD family protein" /protein_id="PRJNA477356:DP116_10430" /translation="MGLEDRIKATAKNIEGKIQEVVGDITGNTQDQVEGKAKQAEAQA RHVVENVKDKLEEIKNQVKKGLE" gene 22009..23481 /locus_tag="DP116_10435" CDS 22009..23481 /locus_tag="DP116_10435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314974.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_10435" /translation="MSSEFSHQTEATDEGQDWYFGTEELLDALPQAWTRSSLYLLLAF AVTVLPWAMLSKVDETGSARGRIEPKGATQKLDSPVGASVTAVKVKEGETVKAGQVLL ELDSEILKTELRQVQTKLEGLKNRRESLQLLKNQLMLSVQTQQQQNQAQLLAKQSQVD QARRNLDTLKTLYNLQKEEKQAKVDQVQEALYSSKAAYKLAEVRFQASQGKVPRYKKA YEDGVISQERFLEVEQLAKENYEHLVQAQSDIAQAQSSLKEQQSSYQKTIQQAQSQIE QAELRFKEEQRNYQSRVHAGELAQLKTEEELKQLQSQINSVHSEIAQTGSQMVSYKIQ LQQRVVRSPINGVIFEFPTTKPGAVLQPGQRVAQIAPLRAGLVLKAQMPNQHSGFLKL GMPVKAKLDAYPFQEYGIVSGKVNWISPDSKVQQTPQGNVENFELEITLNHQYIQNGK KRIQFIPGQTATAEVIIRQRRVIDFILDPFQKLHKGGLDV" gene 23641..23880 /locus_tag="DP116_10440" CDS 23641..23880 /locus_tag="DP116_10440" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10440" /translation="MHGGLQVVQQSFSTPTSSVSTYAASDSSSGQNVSYFLDPATGKY GYTIDYGYKYAVAAAAGSAANEPTYTTSYASARAS" gene complement(24166..>25215) /locus_tag="DP116_10445" CDS complement(24166..>25215) /locus_tag="DP116_10445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-dependent DNA polymerase" /protein_id="PRJNA477356:DP116_10445" /translation="GTNKRVIELDIEKCFDRINHSAIMNNLIAPQSMKRSIFRCLKAG VNPEFPEQGTPQGGVVSPLLANIALNGIESIHRYHINGGKARITPATSPSDIIEPTIR YADDMVTILRPEDNAEEILGKINQFLADRGMNVSKKKTKITATTDGFDFLGWHFKVQS NGKFRCTPSVDNFKAFRQKVKTIVNCSNYGSKIKAEKLAPLVRGWRNYHRFCKMDGSR NSLYHIQHRAFRVFNKEVKNDRESSSILIKKAFPTVPYSENKFINVKGNKSPFDGDIT YWSKRNSKLYDGETSKALKRQNHTCGHCGLKMLSEERVHLHHIDNNHDNWKTKNLLAV HESCHDYIHMSKEKS" BASE COUNT 7609 a 5524 c 4928 g 7154 t ORIGIN 1 aaataggact gctatatgat caatcactca cttattgcaa tattctacta aactttggag 61 acacatatga acttactcaa aaccgttgat gccaaattct ttcacactga aatccctgct 121 aaaccaatgg ggaactatgt tcattttatc acgattcgcg tcactgaatc ctatccttta 181 tttcaaacag acggtgaact caataaagca aaagtccgtg ctggaattga aaataaagaa 241 cctattagcc gtttagcaat gtttaagcgc aagcaatcca caccagaacg tttaacagga 301 cgggaattac tgcgaaaata cgaaattggt gacgcaaaaa actgtgatta caatgtagac 361 ttcagcaaga cgactcctga ttgtattctt tatggttttg cgattggtga ttccggttcc 421 gaaaaatcta aagtcgtcgt agatacagct tactctatta ctccatttga agactcacat 481 ctcaacgtta cccttaacgc tccttttgaa aatggaacaa tgagtcgtca aggggaagtg 541 actagcagaa ttaacagcca agatcatatc ttacctcaaa tctttttccc aagcattgtt 601 actttaaaag accccacaga agccggattt atctacgttt ttaataacat tctcaggact 661 cgtcattatg gagcgcaaac aacccgtact ggtagagtga gaaatgaatt aatcggagtg 721 atttttacag acggtgagat tgtcagcaat ctacgatgga cgcaaaaaat ctatgacttg 781 atgcagcaca aaggagaaat taatcttcca gatcctcttg atgaagatga agttgtaaaa 841 gctgctactg aagcaatgat tgctctgatg aaagacgaat gtatcactca tactgatttt 901 attggtgaat catttacatc ccttttgaat gaaatcaaat ctatcaccag tcatgaaact 961 caactgcaag ctatgttgac acaagcaaac gcagaatcat ctgcatatgc tcaaacatgg 1021 gttttaaaat cagcaaagaa agaaagcaaa aaagattcca agagtcagaa aaaagctggc 1081 gctgttgcgg agtaagcaat atggcaatta tttaccactg tcagctagaa ctccacgaca 1141 gcttatatta cgcaactcgt gagataggaa gactctacga gacagaacta gtcattcaca 1201 attacgcact ttgctatgca ttgggattag tagatagtga aatctactct acaactgttg 1261 cagaagaaca ttcctatcgt tacttttgtc ctgaacaagt tcccaaatat gaggagcatc 1321 taacaccgct caatcaccag agcatttaca tcactccagc gcattcaatt aaccatacta 1381 caattctcaa tacctggaag tatgctaata acaactacca cgttgaaatg gagaaaactc 1441 aaaagaatat ccctagtttt ggtagggcaa aagaaattgc accagaaagt cagtttgagt 1501 tttttgtaat ttctcaaaag caactcagac tgccaaaatg gattcgctta ggtaagtgga 1561 tgagtaaagc agaagtacag actcaagaag tcaccaaatt agaattaaaa actggtaact 1621 tctcttttcc ttacccattg aatcctctag atgtgatgtt tactcatcaa gtcgttagct 1681 acgacgtagt taatatgcct cctgtgagtc tgattcaaaa tgtcagcatt cgccaagggc 1741 aatattacga gtttgaaaac cccaacaagt ctgaaaaact acgacttcca gccaaaatgc 1801 agtatcggtt taagggttaa atcgtgcagg gtgggtaata ctcaccctaa aatccaataa 1861 tactaaaatc caaaattaaa tatgccccac agtctcgtcc tcaacctcat cccccaatct 1921 cccatctacc ccgaattcct cactggtaga cactaccacg ccctattcct caccctcatc 1981 agttcagtcg ataaagattt aggagactat ctacacacat ccaacgccga taaaccgttc 2041 accctttccc ccttacaagt cacacgaaat cataaaagcc atcaacacaa acaccacacc 2101 ttgcaatttt cccatcaacg cctaattcct ccaggtacac cctgttggtg gcgtatctca 2161 ctcctagacg acaacctatt tagcaagctg actccactgt ggctaaatct taaccctgaa 2221 cacccttggc atttgggttc agcagattta tacattacta gcattctcgg tacaccacaa 2281 tcaatgcaac cttgggcaaa tgcttgtact tacacacaat tgtatgagca agcaaaagag 2341 agcgatcgcc caaatcaccc tctcaacctc accctcgcca cacccgtagc cttccgccaa 2401 ggaggatacg ataccattct cccaattcga gaatgtgtct tcaacagtct tctcagtcgc 2461 tggaataaat atagtgggat tgaatttacc aacatcccca tcgagtcaat ttatccaagt 2521 tttgtcaaca tccatacaga agttattcgt aactatgaca acaaatttgt tggttgtgtc 2581 ggcgaaataa gttatcggat tttaggagat attgaaccaa tcgcaatcaa acaaattaac 2641 gcccttgctg actttgcatt atatgcagga gttggacgta aaacaacaat gggtatgggt 2701 atgacacgtc gcttatcatc aagtgttgaa aattatgaat gagacagaat acattccaat 2761 tgcttcctta aaccaatatg cctactgccc acatcgctgc tggcggatgt tttgtgcagg 2821 agaattcatt gataaccaat acacaattga aggaacaagt ttacatgaac gcgtccacac 2881 acttggagaa ggacatcgtg aagaaacctg gcaagttcga gcgatttggc tcaaatcaga 2941 caaatacaaa cttattggta aatctgattt aattgaatca gaaaacggtg agttatatcc 3001 agttgaatac aagcgaggac gtaagggtga atgggataac gatgagttac aagtttgtgc 3061 ccaagcttta tgtctagaag aaatcacagg acaaacagtg acgactggct atgtttacta 3121 cgcacactca catcaacgtc aattagtaga aattacggaa gaattgcgtc aaagcacaat 3181 cgccacaatt gaagctgttc aaatgcttct actaacaggt attatgccaa aacctgtcaa 3241 gacaaaacgt tgtgttggat gcagtttgta cacacgatgc ttgccagaaa ttgtggacaa 3301 agtaggacgg tatcaagaag tttactaaac cagagatttg acatttttgc atcactttca 3361 atgagcctcg aattgctcgc ctcggaatta attccgaggc tcaaagacca agtctactaa 3421 agtagactga aaagcttatg cagtcatctt tagatgactt tgactatgag cctgggattt 3481 aaatcccagg cggacgaaaa tactgtttac tgttcactgc ttcatctcaa ctaaatctta 3541 acactcacta tgggcacact ttacatcaca caagacgatg catttatcgg caaaatcgat 3601 gagagacttc acgtcaaatt cgataaaaag acaattttag atgtaccgct tatcaaaatc 3661 gatggtgtcg tggtgttagg acgcgccaca gtttcacccg ctgctgttaa tgaattatta 3721 gagcgtcaaa ttccgctaac attcctgaca gaaacaggtc gttatttagg acgtttagaa 3781 ccagaagtta ccaaaaatat ttttgtgcgt aaagcgcaat ggcaagcagc aggtgataca 3841 gcccaagcaa ttcatgtcgt tcaagggttt gtacgtggta aactgaaaaa ttaccgtaat 3901 acccttgttc gtcgtcaacg tgaatgcaac aacctagatt tatctgcttc tattgaacgc 3961 ttagatcatg tcattgtacc catcgactca actcaaaaca ttgattcttt acgcggatta 4021 gaaggtgcag gtagcgccgc ttactttggc tgctttaatc aaatgattcg caatacagga 4081 ttcaccttta ccaagcgcgt tcgtcgtcca cccaccgatc cagtgaattc tttactcagc 4141 tttggttatt ccttgctgtg tcatgatgtc caaagtgcag tgaatattgc tgggtttgat 4201 tcttatttgg gatatttaca ttgcgatcgc tacggtagac cctcactcgc gttagattta 4261 atggaagaat tccgtccttt ggtggtagat gcagtcgtgt tgtctgcgct aaacaagcaa 4321 tttctgaaag ttgaggattt cgtgacggaa cctttaagcg gtgctgtttc tctcaccaac 4381 gaaccaagaa aaacttttct gcggctgtac ggacaaaaga aattatccga atttaagcat 4441 cccgtcttgg gacgcaaatg tacttaccag gaagcgtttg aacttcaagc tcgattactg 4501 gctaaatatt taatgggtga aatcgagaaa tatccaccat tggttttgaa gtaggcgcaa 4561 gagtcatcag ttatcagtta tcagttacca gttaccagtt accagttacc agttatcagt 4621 taccaagtta aggggggtca agaaatcgct gcgtaggtag ctgttcactg ttcactgttc 4681 actgttcact gttcattgtt cactgttcac tgttcattga ttcggtcaat ttgcctatga 4741 atgttgttgt gtcttacgat atttctgaag ataagcgccg tacaaaaatc cacaaagtcc 4801 tcaagtctta tgggcagtgg gtgcagtata gtgtatttga gtgccagctg agtgatactc 4861 aatatgcaaa gttgcgatcg cgcttgcaca aactcattaa gcctgatact gatagtatca 4921 agttttattt tctgtgcgcc tgctgttttg gtaaagtcga acgtatcggc ggcgaaccac 4981 ctcgcgatga caccattttc tttgccgaat gcgcggatgg gtaggtgttg gaaaataggt 5041 tttgaaaaaa atggctggat tgcttacagg acgaggattt ctggggaaaa ttgtttataa 5101 ccatccgcgc tccttgccta gactggattt cagccatttc ctgtcaatca atttactttt 5161 ttgcatgcta tcattagatc acccgcgatt tagaaccttg aaaactttat atagagcagg 5221 tctcagactt ggatcggttg aaatttctct tactccctat tagggattga aacaacattg 5281 atcctagtga ccctgaaaag ggcatcagtt gaaatttctc ttactcccta ttagggattg 5341 aaactggaac cagaacaaaa ctaaacgaat tgcatatgag agttgaaatt tctcttactc 5401 cctattaggg attgaaacaa ggaagatgaa ttattgactt atagcatgtt tcatcgttga 5461 aatttctctt actccctatt agggattgaa acttaaaagt caatggtcca gcatcaataa 5521 ttgtggcgtt gaaatttctc ttactcccta ttagggattg aaaccgcgcg tacaacaatc 5581 gtgaaaaaaa gaaaatagag ttgaaatttc tcttactccc tattagggat tgaaacttac 5641 attccagtta ccgtagagga ggcgatcgcg agttgaaatt tctcttactc cctattaggg 5701 attgaaacga tttggaatat tgctcagtac atatatcaag agcagttgaa atttctctta 5761 ctccctatta gggattgaaa caaattgttt ctagatctac acctttttcc gcccaagttg 5821 aaatttctct tactccctat tagggattga aacagctaac aaagatgggt ttcaaggttt 5881 gcggtttcct gttgaaattt ctcttactcc ctattaggga ttgaaacttt tcaacataaa 5941 acaccacatt agtaatactg ttggttgaaa tttctcttac tccctattag ggattgaaac 6001 tgctagatga tacaggaaat gaaagggaag acatgttgaa atttctctta ctccctatta 6061 gggattgaaa cttgagggta ttagtggcgc gaccaaaccc caagggttga aatttctctt 6121 actccctatt agggattgaa acaggtagta cccatacata ttgatagggt ttgaacaggt 6181 tgaaatttct cttactccct attagggatt gaaacaatat ctgagaattc agtaaatgca 6241 gaaatcaagt tgaaatttct cttactccct attagggatt gaaacacggt ttatacacat 6301 ttgaaatttc taaaattccg ttgaaatttc tcttactccc tattagggat tgaaactcta 6361 taccttcaca ttgcacccat ttaccattta cagttgaaat ttctcttact ccctattagg 6421 gattgaaact aaagtcaagg tcaaaaagcg caattcaacc atgagttgaa atttctctta 6481 ctccctatta gggattgaaa caagttcgct cttgtggtta actctcctgc tgtatagttg 6541 aaatttctct tactccctat tagggattga aacaggcttc taaaactgtc tactaaggaa 6601 gaaaccaagt tgaaatttct cttactccct attagggatt gaaacgtaag cagtttctgt 6661 gtgtcaattc taagtaatta gttgaaattt ctctttctcc ctattaggga ttgaaactca 6721 atgaggtaga ccgagtatgt atgctatggc ttcaggttga aatttctctt tctccctatt 6781 agggattgaa acatgaagtc tgagaaattt ggagtatcta tagtagagtt gaaatttctc 6841 tttctcccta ttagggattg aaacaatgcc gtactcagca caatttgaaa tatcaacaca 6901 tgttgaaatt tctctttctc cctattaggg attgaaacag atccaccaag tccatcaaac 6961 gcgcacggat gtggttgaaa tttctctttc tccctattag ggattgaaac taattgggat 7021 tgctgcaaag tttatccaac tgccaggttg aaatttctct ttctccctat tagggattga 7081 aacgaaagga tgatcgtatt taaagttttc aaatttgcac cgttgaaatt tctctttctc 7141 cctattaggg attgaaacga acatcctgtt gcggcagtcg ataaacctgt tgttgttgaa 7201 atttctcttt ctccctatta gggattgaaa ccagcgatcg cccctatcgc catcccctcc 7261 ggcacatccg ttgaaatttc tcttactccc tattagggat tgaaactaac gggattgaaa 7321 caggcgattt ctatggaaag cttttcttct cctgtcgcag acactgcgcg ccgagagttc 7381 ctgtctgcag gacttacgct aatgctcaac tacgacagcg ttaaaataaa cccataaatc 7441 gtaacgccat acccactcgc catgtcccta gctaaggaat tagacactcc ccaagatatc 7501 ccagaagatg ttatccttcc tccaggtgat ttatacagcg acgagcctcc cttggaaacc 7561 gaactacatc tacggcaaat aatcctactt ttcaaatgcc tggaatggtt gtggcgagat 7621 aaaacagatt tctacgctgc tggcaatctt actatttatt acagcctcag caaacgcaaa 7681 tcagaagact tccgaggacc agattttttt gtcgtgttgg acaccgaacg caaaactcgt 7741 aaaagttggg tagtttggga agaagagggc aaatatccga atgtaattct agaaattctt 7801 tctgaatcaa ctgctaatac tgataaagaa tttaagaaaa aactttatca aaataccttc 7861 cgcacccctg attatttttg gttcgatcca tacacatcag aattcgctgg ttttcactta 7921 ttagatggca aatatcaacc tttagaagcg aataatcaag ggcatttgtg gagtcagcaa 7981 ctagagttat atctgggaat tcatcagggt ttattgcgat ttttcacagc atcaggacag 8041 ttagttccaa caccagaaga agaagctgaa tcgcaacgtc agcagaaaga attagcaata 8101 agtaaagcag agagattggc tgctaaattg cgggagttaa atattgaccc agacacaatt 8161 tagcttaaac gggcatcata ttataatgtt ggcagagcgt tcgcaattgc aaatgccatc 8221 aatggaaaag aatatcaaaa ggctaaacga attgatatca aaagttgctg gcagcatcaa 8281 aaatctgttg tgcggttatg tttagttcgc ttaatgttgg ggaggtaatg cgattgcttc 8341 ccctgaactg ggcaacttgg tactcacctt caaccaactg gtagacagaa atagttggtt 8401 gtttggggct gccaataaac gcccgtccgc ccaatgctgc ataatccaca atccagtatt 8461 caggaatacc catttcctca tagtcagcaa gttttttatg ataatcgtca cgccaattac 8521 tgctgaccct tacgggttcg tcagttgctt caacgcgggg aacccgcgca acgcactgac 8581 tcacaacttc aaccactagg ggtactgatg ctgcgtccga tatagttgac tcttttttcc 8641 acagaggttc gtttatcaaa ttgggacgat ttaataacag cacatctggt gaataagctg 8701 atttgccctc aattggttta ataaaagtgg ttttggggat gaagtaagga aggtttaagc 8761 gactatattc caaagtgagt tttgtagcta aaaatccaac tacctcttca tgatcacctg 8821 taggtggtgc catctcaaca attactccat catgcaattc atagcgtcgt tgcgggttct 8881 ctggatacca ggcaataaat tcatcgaagg ttacttgttt ggttaaggct tgagtcatga 8941 tagactacgt attctgctag tggttctttt ttaaatgaac tatcaatcta aatcgtgacg 9001 attttggctt catcggcgga ttgtttacct ctctagtttc gcacaggtat agcttagtgc 9061 tgccaatctt ccacagtaac attaacatgc agaagatgca gagacttgtt tttaaactaa 9121 aattgcttcc ttattcccag ggctgaatca aaaagtctcg cctgccctac tttctattag 9181 aaactgaaac tccacaacga tacaactgat aaggaacgcc aacgcgataa tgctgcgagg 9241 aatctatggc gcaagtcatt ttctgagaaa gtgttatctg ctgcggatag tctcggcgtt 9301 ttctacctgg accaacaagc cataaattaa gatgagtcgc agttggtgga ggaagttgag 9361 aaagtttttg caacacaggt gcaaagtctg gtgattgctt gaagaaagca acgtgagagg 9421 aggaatattc ttctgcttcc ctaagtggtt ccaatgctaa agcgaaactt aaccctaatg 9481 ctacatcttg gtaatctctg tatcctacca ccattaaaat aggaatagaa gggttttgat 9541 tcatctttcg cgcgacttgt tccggctcaa atggtttttg aaatactaag ttgaaaacaa 9601 gaaaaacaca actaagaaga ctgacaagaa aaagaatgac aaaggaagat ttaggaagga 9661 aaattgaatg gtttcttgtt ttttcattct tgttaatttg ataccaagtt gcgttttgtt 9721 gtagtggcgc tttcggggag attgcttccc tccacttcgt ttcccgccac ggcttcgcct 9781 cgcggcttcg actcgcaatg acagctacta cctttgattg caacttggta tgagcgaaac 9841 tcgctgcgac tagggcacaa aaactgggat aataaacaaa gtgatagcgg ggaacagcgg 9901 taatatcttt gccgaggaaa taggcgatcg ccaagaattc caataacacc cacacacaga 9961 aacttaacaa agtcaatgtc gctaaatgcg ttgattgttt ataccatagc tgctttaaac 10021 ctttgaaaac ttgccaaccc acccaaatac caaacaaaag cattaacaac ccagatacga 10081 ctgcaattgc aagtggttga ttttctacag gtagagaaat caccatcagc agccagctaa 10141 taagagtttg ataaattggg gaaatatgtt gcggaggatc cagccaacca gtttcagaac 10201 gattgtagtc atgaaagaca actggtaacc acggcaaaaa actgatgaca acaacactca 10261 cacataaaat gagtgctagc caaatttggc gcttgttaag gatgtttccc gaacgccagc 10321 acatcagcac cagtagtgtt gcaatctgcg caatgaaagc aagaatacaa aagtaatgaa 10381 cataaaaact gatactgttg atgattgccc atcccagcac aacccaaaat ctcacttttt 10441 gccgcttttc aatatccttg acaatttgca ccagcccaag taaagctaaa gtgatcagga 10501 acatgggcaa tgtgtaatgt cgtgcttctt gggaaaggta aacagccaga ggggaaacag 10561 ccataaaagc tgctgcgatt atccccgcag acgggaagaa agcaatacga ttgagatagt 10621 aaattgcgac aatttcagcg acaccaaaaa acacggctgg cgatcgcaac ttcccaaccc 10681 actcacctcc caaaggactc aaccacttca accaactgta catcccacaa aaaaacagcg 10741 gcggatgagt agactgactc gcaatatttt tagcaatttc agaacaactc acccctggtt 10801 ggacagtaaa aatctctgac acgcgttcca gaggtaacac cacttccatt ggtaaatcat 10861 tgtagctttt acccaaactg aaaatcgcag taatcacctc atccatccac agaggtttga 10921 gatccaaatt ccaaaagcgt aaaattgcac caagggcgat cgccccaacc aaagcgagat 10981 aatgtagata aaatttacga ttagccattc cttctaaatt acaaccgcag atcaacgcgg 11041 atgcacgcag ataaaaatct aaatattatc tgcgtcgaga gtgcgtcgag ggtggggttc 11101 caaatctaaa atcttattag ataagacgct tatattaaaa atcagtgcct caagaatcga 11161 aatctataga accatctaca ttacctgtct cttccccgcg agaggcagcc atagcaaata 11221 ttcgtaatac tctccgtcaa ggacagcagc gtatggctga ctggaactcc ggttcactcg 11281 ctgtttcagc cgttcccggt gcaggtaaat ctacaggaat ggctgcagca gctgcaattg 11341 cgatcgccca tcagtatgat caatctacaa aatcacgtcg aaatatccgt cgtcaactca 11401 ttgttgtcac ctttactcgc tcagctgctg ctaatattaa aattaaaatc cgcgaatacc 11461 ttaaagaatt atctttacca caaactggct ttattgtcaa taccctacat ggtcttgcat 11521 tgaacatagc cagccgttat cctaatttat caggtttaga gttagaaaat gtcactttaa 11581 ttaccccaaa tcaaagtcat cgattcatta ggactgctgt agaacagtgg gttgctagcc 11641 atcccggacg ctattctcgg ttattagaag gtattgaatt tgacggagaa gaaacagaaa 11701 gactgcggcg acagtcggtg ttgcggacgg aagttttacc agaactagca actatagtga 11761 ttcatgaagc aaagagttct ggcttattac cagaagattt gcgtgaattt ggcaaacaaa 11821 ccacagagga atatgacatc ttaagtattg cggctgggtt gtacgagcaa tatcagaatt 11881 tgatgcgatc gcgtgacttc attgactacg acgacatgat attagccgcc ctgcgcgtat 11941 tagaaaacga aaacacccgt aaaatcgagc aaaatcaagt tttcgctgtc tttgaagacg 12001 aagcacaaga ttcgagtccc ttgcaaacgc gcctactcac aattctcgcc accaaccccg 12061 acaatcccaa cgaaccacca aatctactcc gcgttggcga tccaaaccaa gcgattaact 12121 caacctttac cccagccgat ccgatttatt ttcgccaatt ctgcaaagag tgcgacgccc 12181 agcaacaatt agtgacaatg gatcaagcag gtcgtagtac gaaaattatc atcgacgccg 12241 ctaactttgt tttggaatgg gtgaatagtc tttatgtgaa aacgggacaa gacaattcaa 12301 ttcaaaattc caaattcaaa attccaaatt tggaatctcc ctcatctccc actccctttc 12361 gtttccaaag aattcgccct gttgaaccta attaccgtaa agctgatgtc aatccgcaag 12421 ctgtgggact aggattggaa ttgtacacac cgcgtgacat tcatcacaca gtcgagttgc 12481 tgtccgagag gattgtagag ttattttctc aggaaccaaa cgctattagt gcagcgattt 12541 tagtacggga aaaccgtcag ggacgatggt tggcacaagc gctgacttca gtgtgtaaag 12601 agcataatat tcttttgtat gatgtggggg aacgcgatcg ccattctcac gtaccagaag 12661 aagttttggc acttttacaa ttttgcgatc gcccccattc tcccgactac ctcaaaagag 12721 cattagaagt ttttgtacaa cggcgtttga ttgaaaccca agatctcaac gccctcgcga 12781 ctttaccaga agaatttttg tatcctagtc ctttagcttc acctcaatca gaatcagtcc 12841 aaaaagctgc tcacctgtgt cggagtttac ttcgtgcccg tttagaacta cccctctacc 12901 agttaatttc attcatcgcc ctaacactca attatgagca agcagaactc gcaacagccg 12961 ataaactagc agaaagagtc atccagcaaa tagctggcaa tacttcaatg ggttcaatgc 13021 tgaatgcatt gagtgaaatc gtcagttccg aacgctttga accagtagaa acagaagact 13081 tagaaaaacg atacacccgt aacggtcaac tgacaatcat caccatgcac aaagcaaaag 13141 ggctagattg ggactacgtt ttcatgcctt ttctgcatga gaatctaatt cccggtagat 13201 tttgggttcc tccccaaagt cagtttttgg gtaactttac tttatcagaa gttgcccgcg 13261 ctcaaatccg tgctgctctt cacggtcaat ctaccttacc cagcatcacc gtagcttggg 13321 aacaggcaaa acatctcaaa acagctgaag aataccgctt actttatgtt gccatgacac 13381 gagcaaaacg cctgttatgg atgtctgctg cccagaaagc cccatttacc tggagtaagc 13441 cagaaaactt acaagagcaa accccttgtc ctgtttttcc agcactcaaa cgccagtttc 13501 ctaaaagtgt acgaaactaa gggtgtaggg ctgtaggggt gtaggggtgg aatccattcg 13561 cttgacaaca gttgatcatc caatacggtt gctatgcttc tggtgttgcg tgcaatacac 13621 aattgtaatg aaagtgcatg cactttcaca ttaattcttg aaattatcca agttatacca 13681 attctccatg aagatgcact taatatttat taaacgttag cgtagcgtgc gcccttggcg 13741 cataccgcca agacgcaaag aacgccaaga attaaagagt gagtattaag tgcaagttta 13801 cagagaactg gtattagatg tgttttagct tattaatcaa acaaatacct ctttccctta 13861 cacccttaca cccctgtttt ttttcaatgc aaaacatcga ggtgcgtccc gtcaaagata 13921 aacaagaact agatgatatg ttccaccaaa ggtggcttgt tctaagatca cctttaggaa 13981 tggacaaagg aacagaaaaa gacaagcatg aagatagcgc ttttcatcta gttgctgttt 14041 gtgatcataa agtcgttggt tcagcgagac tgcgtttact atcaaaagaa ttgggaagca 14101 ttgcctatct cgcagtgcta cctgagtttc gccatcaagg cattggtaca aaactcatgg 14161 aaaaattgat agaaatagct catgaaaaaa atctcaacac tttaagatta atgtcacgag 14221 ttcacgccgt caacttttac aagcgactag gattttgcga agtaggggag ccattttatt 14281 atttggatgt gagtcatatt tttatgcagt gcaagctaca gcagtctcag aaaaatgcga 14341 aagactagct acagagtatt taacgaacaa tgactcttgc ttggcattgc aagaacccct 14401 catctcaaga gtcttactgt catttttcac aagcttagct tggtaagtgt caagagaaca 14461 atcaggtttt tcaacagtta gttcagccag aattgttgtc ttatctgaac tctcctcaat 14521 aattctgcct gcaactctgt aattaaacca atcccatccg tggcgaagaa tcagttctct 14581 ttccaaaacc tgaatagaag ttggcagaat tccccaacct cgatagactt tattcaagca 14641 ggtgatatca ccagttcgcg tcaaaatcga ccgaaatgaa tcttgatcga gaacaccata 14701 gtatcttcct tctggcaagt ctattattgt tggtgcaaat cgatgtccac caaagtgact 14761 tgatttccaa atccgaacat tgtccaagca caaatcttcc gacatcgcta cagcttgggc 14821 ataaaaggga tttccatacc tagcacagca aagatcatgg ctaccgtggg tacagactaa 14881 aatatccctg gttgcactgt ttttcacttc ataattgcta gattgaccag ataaccattt 14941 tctgacaatc cctgctgctt gctcaatatg tgccagatga aactcttgtt tgtggtatcc 15001 gttactgagt ccctcttttt tctgataaat taatacagtt gtaccatcta ccttatgtga 15061 taagttatta gcaattaaga gaaattttac cggaatttta cccgacttac acttttgaat 15121 taagagcctc aaattctctg gaacccacct agaattcaag gcttcggctg tccaaggagt 15181 gcgacactca atcaatatgt aagtctggtt attggtgccg ctgccaataa tatcttctcc 15241 tatttggcgt gaatgatcgg aacaaaagaa agtgttcatt tagtggtaat gattgttttt 15301 tttgagatta atgaggatat gacacagagg aagaaacgaa aatgatgcct tcagtgacca 15361 agcgagacaa aagaggagca atatcaattt cccaatcata atctgggagc caattgataa 15421 catcctcccc tgtaaagatt tctccagtga ataaattttc cacaagagag ataggtactc 15481 ctttgagaga tacttcttta ccagctacaa caattttgta gccactgcta tctggcagtt 15541 cggtaattct agttcgctga aacttggcgc ttttgaactt ggtttgcgtc ccttgttcaa 15601 atatatgaaa tccagcttga taaggtagag agtagggagc gatcgccttg cccaagccat 15661 caagataatt catataatca tcagaaagat tgctatgagc aatatgttta cttaattttt 15721 caattaagcc atctacatag ttctccacag ccgccgtctc tacacgtaat ggcatacttt 15781 tgcgccactc ttcctgctga cgcagttcgt taactaacca ttccagaaag tcaacacccg 15841 ttttacaatg tacccctaaa gtcaggtgca gtgatggttc atcaacagca actgcatagt 15901 gccaatgacc acgaggaata taaagaacgt ctccaggttt gagagtgcaa gttaaataag 15961 cttctccttc tggaggcgag aaagaagatg atttctgttc tggtaaagga tacttaatcg 16021 tgtcataaaa tacataccat tgctttgtac catctatttg taaaataaag acttcatggg 16081 tgtcatagtg tgaagagaat ccctgtttat gaggccagga acagtaggcg ttgacttgta 16141 caccatagcc gaaatcatac ttcatttcgg aggcgaaagt agccagttct ggaatcagtt 16201 tatgtactcc gttgaggatg agagttgcac cttcttggca tcgctggatg atattagcat 16261 ttgcactttc atccaagact ttttcatcta atgctaaacg gaggtcagga tatttgaact 16321 gatgaaagtt caaaagatat gtgagttttt cccaagaaaa caagtgagca aatttcgttt 16381 gatctccact agagataaac actgcttcgc ttgtccaatt ctttttgaaa aactcttcaa 16441 cggtatatgg ctttagtata tctccaaaca taattaattt ataattattc tcaataaaat 16501 gacaaaaaaa taattagatc caatcatgaa cgggagtagg aggtatcata gcacctactc 16561 ctgagtacaa ctcatcatta atacggttcg gaaatcaaaa ttatgccaaa acagggaaat 16621 gaataatctg ttaagccaat tccgaacccg ttttataatg acgtgtactt agtgtttaag 16681 tgtaatggga taattagcaa ctattagctg cttaaatcat caccatcagc atcagtattt 16741 aagtcatcac catcagcatc agtatttaat tgaccagtag tatttttact gagttcctga 16801 gccaactctt tgacttcaac ttgaattctg ctggaacgga aatgttttgg agcatttttt 16861 tcaaattgat ttgacatgtt gcctccctga gtgtcttagt cagcttttca ccgagtttca 16921 gttcctgatc gtttcaccaa gaatcttgtc taggtgttca ccttgactgt aaccgaactc 16981 aataccaaat gcaagtcatt actaagtaat atagctaatt ctaatacctt tcgcctccat 17041 acagcagtaa tattaagctt atctggtagc ttaactggta tttaactggg aaagtgttgc 17101 catagtgacg atttgcttat atctagttgt actctaagaa taatagataa attttctcat 17161 ttttttccta agacataaac aattatattt tggtatatag taatatatta tacttacata 17221 tcgaaaaatg ctaatgcgac agatgcttca gttgggtaga gcataaacgt tgtaggcgtt 17281 gggttgaggt actctaccca accaacttac attggtatta atttgatatt tttatgtaag 17341 caataaataa taataattca caagagtatg tcaaaaatct gcaattagcc gagcctttct 17401 caggaaaaat cgcagtacag tctcttcgcg ggaaataata tcaactctac cctccattcc 17461 caattgaatc tgacactgat tttcccctct acctaagccc tcccttgccc gctgcgcgaa 17521 caaagaaatt ttggtatgca cgggctttgc attgagactt acagattgca aaacctgtga 17581 ataacactct cctagccatt cagtttcttt atcttttttg agtgctggca acattgaatt 17641 taatccagat tgagacaagc ttttgccagt tgctttatag gtttcaatac atttgtttaa 17701 cccataattc caccaccacc gcgcacatcc aaaatgctgt gcaagtatct tgacttgttg 17761 ttctgtgggg tagaacctaa tttttacggc ttgtcgtctc atctaaactt ctccaaagaa 17821 gacagaaaca aaaccggcac atcagccgaa actttcaaac ttgagtttgt agtctctaat 17881 catagagcaa aaacaaaaca gacaaaaacc gcgctaaggc tatcaccctc agtcgcgatt 17941 tgattgattc aatcttcaag gtgttctatg aaagacataa acaaactata tcctgcactt 18001 aacaaaatcc tcaagtcccc caaaggaatt ttattgactt taaccttaac gagtttgttc 18061 tcaagtggac tcggagtggt tggaaactat cctgctgctg cttctgggga agacgagcaa 18121 atacagagca gttatagcag cgttcaaata tcccaaacta caaaccagaa ctcaagctac 18181 agtagaattc aaatatccca aaccacaaac cagaactcaa acggtttgcc aagacggata 18241 gaaaatgcaa ttctgcgtga tgcttccaaa cgttccggcg taccaattcg tgaattacaa 18301 ataaccgaag ccacagcaaa aaccttcagc aatccttgca tcttcaaatt tggagaagtt 18361 tgtaccaggg aattcaaccc cattaaaggt tgggaagtgg ttgttcgggt gcaagataat 18421 tcttggactt accacgtcaa cgaatccggc tcagaaattg ttttagatcc aaaggtgagt 18481 acatcacaat caaccacact cccaaaagaa atcgaagatg cgattttggg tgacgcctcc 18541 aagcgttcga gaataccaac ttctaaccta aaagttacca aagccacagc aaaaaccttt 18601 ggtaatccct gcgaattcaa atttggcgaa atttgcacca aagaatacaa ccccattaaa 18661 ggttgggaag ttgttgtcca agttggcaga caatcttgga cgtaccatgt caacgaatcc 18721 ggttcgcaac tcgtattaga tcccaaaatc agcggggggc ttaaaaacta agtcagagtg 18781 tagggtgggc attgtaatgc ccaccctact tagatgttat ttaattaatt gacgccgaac 18841 aacacagtcg gctcaacttg acaagacaac cgattttaga tgtttcacct cagcctttgg 18901 cgacgagttc aatagtacgg tagctggaag ttcaggagtc ttccatatag agctaaataa 18961 gatatcaaga acatcagctt gacgagcaag ttcgatcgca cgatgagtga tgatatcacc 19021 aggactcaga atgatgttat cttcccggtc aaggataatt cgattcactg gacgtcccaa 19081 ggcttgcttc atccgttgtt cttccagcac ctgaagtgcc tgtttacgta aatgttcaaa 19141 cctttctgcg agcactacta acagagtact tgcattgtcc cgcatttgaa tgacaccatc 19201 cacgactcgc tcacccgtgt ctgcaaatgt tttgttggca ttggcataag ctgcttcagt 19261 tgaggtcaaa ccaacagcgt tgagtaaagc ctgctccagt tgataagtct gcgcccgttt 19321 tatcacctgc ggtgtcacaa tttgtcccgg tgcagccacg actaatgctt cattagtctg 19381 aacgatacga tgtacacgcc gacctaacgc ttgttcaacc acatgaccag caaggaagtt 19441 atttgctaag ttgctcaact cggttgataa gctacctcct gctgcccgaa acagttgatc 19501 cagcacacct tgacgctcag cttcatcagc aatctctaat gtgactaact gaccttgagc 19561 aatcagcaac gttccatctg gtgcaatgac atcgccgtca actactctgt caatgataag 19621 tgccttctgc tcttctggat caaccactgc atttgtgaac ctagcagcgg cggtacgagt 19681 catttctcgt agtttttcat tcgcaacgcc agcaccctgt tgcaactttt cactggtagt 19741 gtctacaatt tcttggaatt gtgcgctggt tgtgttggct gtctgttgta tcttgcggtt 19801 taactccttt gcaacacgac ctcctgttgc ccggtagagt tgatcaagca caccaaggcg 19861 ctcagccgaa tgtgcaatca agaaggtaac tgtctctccc tgcataatca atagtgttcc 19921 atccgcagcg atcacatttt gatccgcaac tttaccaatc acgaaggctt tctgtgcagc 19981 cggatcaacg atcgcattcg taatagaggt aacagttgcc tgattaacat cttgtagtcg 20041 cttaccaaca gcctctgttg tttcttgcaa tttttctgtt gcagtctgag tcgcttcctg 20101 aagcttcaaa cccgccatag cagtcgtgac ttgcaagctg tcaccagcag cctgaacagc 20161 ccccttaata ccgccaattt gctcctgcat caattgcgct gtttctactg gtacgaatgc 20221 cacatcctca ccgattttca gggtttgagg agcaggaaca aaagaacgac cagagtaggc 20281 atcagcaaac aaaccgccag aaacttcata gccttcaaca gagcctgttt cctcgtcaaa 20341 gtaaaggtca accatagtac cgagatcacg cccattgacc gtgaaaatcc gggttgccct 20401 gagcacgtta ttgcgcttca taatcgcctt tatttcaggc acagccctga gtttcacaac 20461 cgcctttttg gatggtacaa caactgcatt cggaccaatt gcttgtatac tggtgaatgg 20521 aatgacgcgg gtagcacgaa ataaccctcc ctcatccaca agtagtccca gaagctgatg 20581 gctatcttga tcgaaaatta aatcttgaac ccgatcaatt tgttctcctg tgtcgtatgt 20641 aataacgacc ttaccgatta aatcgcttcc tttacgcatc ttcaaacctc taagcactta 20701 tttgtttatc aagtaaatta gcaatactaa actcagcagg caggctactc ttgtgaaacc 20761 tgattttcgt ctgctcaacc aagttctgaa tattaatgac accagagtac ctaagtactc 20821 ctgcgaaagc gatcgcaaca ataaacgcag ataaaacgta caaactaccg ttagaacgcc 20881 gaccgactag tcttggcatg acattgctcc tttgctccaa aattcagatg acaagataac 20941 tatccacata agaaaaactt agtgagttat gagaacttca attcaaccga cttaaagcat 21001 cggatgatct gtataaaaaa taaaccacaa gatttctcac cagacatgat gtatctgttg 21061 gtgagagtgt atctttataa cagcaatgga ttatggttgg aacactgtta gactctactc 21121 tagactcagc cgagtacttt aattattcaa gccctttctt gacttgattt tttatttctt 21181 caagtttatc tttcacgttt tcaaccacgt gacgagcctg agcctcagct tgctttgcct 21241 taccctccac ttggtcttgt gtattgccag ttatatcacc tacaacttct tgaattttgc 21301 cctcaatgtt cttagccgtt gctttgatgc ggtcttctaa acccatagaa tcattccttg 21361 ctgaaatacc agacttaaaa cagtttgttg tttcgtctct tacttaatca aatccttata 21421 tgtcagtatt ttactgacgg tggattagta acagtagtca caatatatgc tagtaatcgt 21481 aactagggct tccctcccag gaaggagctt ttgttacatg tggtatattt agagtcaaag 21541 cctgattcta cttaagaaat tgtgattttt aagtaaagta ttatatttta ttaagataaa 21601 ggttattctc agaatttcct caaatttttt tttaagatta gttttaaaag ttaaatcgaa 21661 aacaaaaggt gcggcaaaaa atgttatgtt acacaggata actcaacggc aagctatagc 21721 aagcttagat agcagatctt ctaagagtat gcatcgcgat gggcaactag tccttgaacc 21781 agagtgtact ttagagtgta ctttacggtc aaccaatcta gtactccacc aatctaaaat 21841 taccagatta aggcaagcaa ggcaagcacg ggaagctgag tccagaggac acgctcttgc 21901 aacgagagca cagaattaag atttttttct gtaagcaacc cgtcgaagtt cctaattgtg 21961 aaccactagt tcagaccaaa gactgattct tgtggtggag gaaagaaaat gtcatcagaa 22021 ttttcgcatc agacagaagc aacagatgag ggacaagatt ggtatttcgg aaccgaagaa 22081 ctattagatg ctttacctca agcctggaca cgttcttcgc tgtacttgtt attagccttt 22141 gcagttactg ttttaccttg ggctatgttg tcaaaggtgg atgagacagg gagtgctaga 22201 ggacgtattg aaccaaaagg cgcaacccaa aaattagata gtccagtggg tgcaagtgtg 22261 actgctgtca aagtcaaaga aggcgaaacc gtaaaagcag gtcaggttct actggaactg 22321 gactcagaaa ttctcaaaac tgaactcaga caagttcaaa caaagttgga gggactaaaa 22381 aatcgccggg aaagcttgca gctactcaaa aatcaattga tgctttctgt gcaaactcag 22441 caacaacaaa accaagctca actattggca aaacagtctc aagtagacca ggcgcgacgc 22501 aatctagata ctctcaaaac tctctataac ttacagaaag aggaaaaaca ggcgaaagta 22561 gaccaggtac aggaagctct gtattctagt aaggcagctt ataagttagc agaagttcgt 22621 tttcaagcct ctcaaggaaa agtcccacgc tacaaaaaag cctatgaaga tggtgtgata 22681 tcacaagagc gctttctgga agtggaacag ttagcaaagg aaaactatga acaccttgta 22741 caagctcagt ctgatatcgc tcaggctcag tccagcttaa aagagcaaca aagcagttat 22801 caaaaaacta ttcagcaagc tcaatcccaa atcgagcaag cagaactccg cttcaaagaa 22861 gaacaacgca attatcaaag ccgggttcat gcaggtgaac ttgcacaact caaaactgaa 22921 gaagaactca aacaactgca atcacaaatc aactccgtac actctgaaat tgcccaaact 22981 ggaagccaga tggtatctta caagattcaa ctgcagcaac gagtggtgcg atcgcctatt 23041 aatggtgtga tttttgaatt tcccactacc aaaccaggag ctgtactaca accaggtcag 23101 agagttgctc aaattgcacc attacgagct ggtttagtac tcaaggctca gatgcctaat 23161 cagcacagtg gcttcttaaa actgggtatg cctgtgaaag ccaagcttga tgcctatcct 23221 tttcaggagt atggcatcgt atcaggaaaa gtgaattgga tttctccaga ctccaaagtt 23281 cagcaaacac ctcaagggaa tgtagaaaac tttgagttag aaatcacttt gaatcaccag 23341 tatatccaaa atggcaaaaa acgtattcaa ttcattcctg gacagactgc aaccgctgag 23401 gtgattatcc gccagcggcg tgttatcgac tttatcctag atccatttca gaaattgcac 23461 aaaggtggtt tagacgtcta gtcaaaagtc aacctcaaca gctacctacg cagtgaacag 23521 ctacctacgc agtaccagcc gcagggaaca aggggtggac gagtccgttt cctacctggt 23581 aactgataac tggtaactga taactgataa ctggtaactg aacttccgag tgatactcaa 23641 gtgcatgggg gacttcaagt agttcaacag agtttcagca cacctacaag cagtgtatca 23701 acctatgctg cctctgattc aagttcaggt cagaacgtat cctatttctt ggaccccgca 23761 acaggtaagt atggttatac cattgactat ggatataaat acgctgttgc tgctgctgct 23821 ggttccgctg ctaatgaacc tacttacacc acttcctatg cgagtgcacg tgcttcatga 23881 agcttgtttt actaagtttt agttcaagaa gctattgaat agaagcccca caaaacaagt 23941 ggttttcttc aatagctttt ctgttatctt gacagcaact cagaggaatg aactagcaat 24001 tctcaggaga acaatatcat gattatcaaa gacttgtctc acgtagaaat tgttgctgaa 24061 caagcaaaag aagttcaagg ggcggagtcg ggagggggca ttaccccccg tctctctcct 24121 ttagatccgt acgtgcccgt ttccgtgcat acggctcccg atgttctagg atttctcctt 24181 gctcatgtgg atataatcgt gacagctttc atgtacagct aggaggtttt tggtcttcca 24241 gttgtcgtga ttgttgtcaa tgtggtgtag atgtactcgc tcttcgctga gcatctttaa 24301 accacagtgt ccgcatgtat gattttgccg ctttagggct ttagaggttt ctccatcata 24361 gagtttgcta ttacgcttgc tccagtaggt gatgtctccg tcaaagggtg atttatttcc 24421 tttgacattg atgaatttat tttcggagta aggaactgtc ggaaatgcct tcttaatcaa 24481 gatactgctc gattctcggt cattctttac ttccttattg aataccctaa acgctctgtg 24541 ttgaatgtgg tatagcgagt ttcgcgaccc gtccatctta cagaagcggt ggtaatttct 24601 ccatcctcta accaaggggg ctagcttctc agcctttatc ttggaaccat agttcgagca 24661 gttgacgatg gtttttactt tctgacggaa tgctttgaag ttatccactg aaggggtaca 24721 tctaaacttt ccgttgcttt gtactttaaa gtgccagcct aggaaatcaa atccatctgt 24781 cgttgcggtt atcttggtct tcttcttact gacgttcatt cctctgtctg caaggaactg 24841 gtttattttt ccaagtattt cctctgcatt atcttcgggt ctaagtattg tgaccatatc 24901 gtccgcgtat cggatggttg gctctatgat gtcacttggc gatgttgcag gcgttattct 24961 tgccttacca ccgtttatgt ggtatctatg tatactttcg attccgttga gtgcgatgtt 25021 ggctaaaagt ggacttacca cgcctccctg aggagttccc tgttctggaa actctggatt 25081 aaccccagct ttaaggcagc ggaagatact tctcttcatg ctctgtgggg cgatgaggtt 25141 attcattatg gcagagtggt ttatcctgtc gaagcatttt tcaatatcga gttctataac 25201 tcgtttattt gttcc // LOCUS NODE_1199_length_25162_cov_5.20149825162 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 25162) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 25162) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..25162 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 449..1321 /gene="aroF" /locus_tag="DP116_10450" CDS 449..1321 /gene="aroF" /locus_tag="DP116_10450" /EC_number="2.5.1.54" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009460239.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-deoxy-7-phosphoheptulonate synthase" /protein_id="PRJNA477356:DP116_10450" /translation="MNSAKLAAKSHPDHQTVVNLSEKVAFGGTELVIIGGPCTVESAE QMETVAQKLSAAPVQALRGGVYKPRTSPYAFQGMGEDGLEVLAKVQAHYNIPVVTEVM SISQIEAIATHVDMLQVGSRNMQNFDLLKALGQAGKPILLKRGLAATIEEFVMAAEYI LSHGNPDVVLCERGIRSFDNYTRNVLDLGAVAALKQITHLPVIVDPSHAVGKRELVAP LAKAAIACGADGLIIECHPEPEKSVSDARQALSLEDMVNLVHSLKPVAASVGRRISED MGAGFKPAPLCCAA" gene complement(1425..2897) /locus_tag="DP116_10455" CDS complement(1425..2897) /locus_tag="DP116_10455" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="TIGR04222 domain-containing membrane protein" /protein_id="PRJNA477356:DP116_10455" /translation="MNTHELDLYQRIQQFSLDDIDAKLSFSKRLARDNNWTVEYAQRV IDEYKKFAFLAIVAGHPVTPSDQVDQVWHLHLVYTKSYWENFCAHVLQTPLHHNPTKG GQQEGYKFNNWYGKTLASYEQFFQQLPPPDIWSPPHIRFGRDIHFVRVNTQQNWIIQK PDFSFLTKFRFSQSAVWMLSLLALVSIITWDVPALASFPNPINRSLPEFFKFYLLVGC IGLLFIGLLGLLLQRFKNDSRTAWVLFAILAFVLFGLGSINLATGILNLAGKEFIGFY ILSAIASFLFDFAILRWKQTKLFIRPSGIRSLDDTPSWEEFLQGNKLSLQESIATWLT MLSIFCLYSLGIARIIIGLSRHKPIGYLVVLCLCVGVYLLWLLSREDDNFNSFIKTLI NVTVLLSGIVLFFLGFQIVIWSIFALIFFVGIFSSRGGGGYTGGGGSDAGGDGGGDGG GDGGGGGGCGGSDGGGDGGGGDGGGCGGCGGCGGCGGCGG" gene 3280..4653 /locus_tag="DP116_10460" CDS 3280..4653 /locus_tag="DP116_10460" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10460" /translation="MTLLESDPNVNERFLAQHFYNNLHSSPLALKHMSAYLEKFGYDA ANRVHRELSRFQKYNQLLYDKRDILQMAWLFASTPDKFFKNFNPQHPLENYACKTMEY KIKQEIFRIQMGQKQFSDWGLLKHCTRKSLQQALQCQGCTQPQLGCYLLAYNCFKEIY APQRATSNRSLQPPTDTQFQEITNLYNHLVQQSALATLDTVHQQKIKQWLLECIQALR NFRSRQIISLDAPKGEDENSLPLSQITPDPTSESLWEKIIVQELTPQMMTTLSELLTQ LDRYTNNHLLLKYGFNLDYRSIAPFFFVDSTTISRHCNKTTQKLLGQLTQWAQEELHI TPDSENLKEINTLLKQCLNQYYQDFIFRSVFQDAWQQLDSECRHILYLRYFRRIDEAA IAHNLQLSELEVTNGLVTGTQKLAAAVCHWISNHLTVSLDFVNPLADKIAIFVQTLIA NYPEHEF" gene 4673..6211 /locus_tag="DP116_10465" CDS 4673..6211 /locus_tag="DP116_10465" /inference="COORDINATES: protein motif:HMM:PF08852.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10465" /translation="MSTMLEAFITLYPEQTWLEVSPKQLEETMPSQQEYSYDVARSTA FTNRLTLNAFVDWFKAESGIDEQLKVWPSLDALPSIWEMVNGTAIQLGETRVVLIPSQ EIDINEFCVPAEWVDIPSWAADYYYLAVQVSPEDCWLRICGYTTHKKLKQAGLYDTIK RTYSLEQEELIEDLNVMWVARSFGCDTRLRSRSVPEGLSAPYACGTATPIGASVCPLG NRKAAIKPLPSLSQAQAENLLAQLSQRSCYSPRLKVDFEQWATLMENEKWRQHLYERR IAQAVAPQPVLVNLSQWFSNVFEEYWQIVEDIFAPTELIPVLSSRSPEPDRESKLKAI ASIIPLLNPHHEELLRCQAAGVLGKIGVGSHDVSIALTELLHTARDEKTRWQAAISLG KVNPDHPEAGRQRAKLIDLGMQLGGYPVALIVAVMPRTDGKVNVFVQVQPTQVETLLP PGMKLGLLSESEEIIREVEARKNALGQGKDESIQLVFSCSSATHFRVKVTCNNVNFTE SFVI" gene 6238..8274 /locus_tag="DP116_10470" CDS 6238..8274 /locus_tag="DP116_10470" /inference="COORDINATES: protein motif:HMM:PF12770.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10470" /translation="MGRVVVLRIESVTPERVSVSLQIWRDHGPPIVHRIDATLPALLD LIGIYQSWYQEYRNLIINPLNPEIIIDETIPTQTSTSGVDACRIFSRLLRDGIRDRLG NSENPGWARIRERVLEQLFNVSDEIRIVIQADDSQLWKLPWHEWDLLQRDSVRTNGVE VAFSNLHDSASPSNPVTPQGWVKILALVENIPGLQQTVQTTIRSLPEVIPEYPSNLDA LQSQLRQGCEILVFAGHGYTGGDGIGRIIYGNSQITVDCFAEALKEAVSKGLKLLFLC CCDNLGLVKDLKDAGVNIPVIIAMREEISVEAAQEFFANFFQEYAQQKQPLYKSFRRA RVALENWEERLPGTKRIPIIYQTLSVTPPTWEELIASGPEPQLPEPVVSPPIELPEPV INPLLRFIRSFRRFVQRRFVRKFLLLMLVGLGIAFIIWRIPILSPPPEVALCPTGLPD NISYGDKILNENDRNLDPAQSGTKKLQEACTKFGNGQDKNAVAAAIPLFEQAVKDLTT AHKQNTKNAEILTYKNNAEVYLKYAEDISQSTPKLDRPKILAVAIPVNPRNSPLPGEI KQSANVINLGVAIAQKEFNQTQKNNQKFLVLIADDNSEMSTAKPIIESFKKVQHEIVG VVGHASSGLTLNASRFYNQKKIVLISPTSTAEDIRPKDVKPIDNYIFRVCPTNK" gene 8581..8967 /locus_tag="DP116_10475" CDS 8581..8967 /locus_tag="DP116_10475" /inference="COORDINATES: protein motif:HMM:PF01094.26" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10475" /translation="MYSESIINTSTRKPCPEGLMLAAPAYDSEFTNKVNGILNNQYAT IEDWRVPYSYDAALALFTAASNTNQQFSSNSSQIKDLLHNLNITGVTGQIQFDANGDR IQPNNSDDIELFRVVNNGNQCSFQRL" gene complement(9102..10040) /locus_tag="DP116_10480" CDS complement(9102..10040) /locus_tag="DP116_10480" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10480" /translation="MRKNRIPVQLVWAVVVICVLPSLLNLLGVDFGSPSQTFDASLSN TTKDKVIDVMHYTLSGSFTHTILEWSAFCTAIFTVILSLIHFYLKADVVTPIIGIALF SSGCMDAFHTLAADRLIQGVADNQNLIPFTWAICRLFNALIMLGGAGFFLIAQPRRWK SGITLVVLSSLVFGVIAFWIVSICAHSATLPKTIFPNSMVTRPWDVAPLLLFIFAGFV VFPRFYNKYPSLFSHALVISVIPEVATQLHMVFGSKVLFDNDFNIAHFLKIIAYLVPF LGLILDYIRTYREEADTAKQLKQAMGVFSSILSESR" gene 10388..11530 /locus_tag="DP116_10485" CDS 10388..11530 /locus_tag="DP116_10485" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10485" /translation="MLDNRRDVKPSLASLWKLDCTRVVKVLRGSIVLKVTALAITIGT LPLLTVRTTTHEFANQSRSKEITEVQQTRTSAKANEVPQTMLEQHQKIQIAAHLSILV NPLVRHNIFEQQQAVLNRYIEVLVNLDKIVVKIDNAYELINYVVWKTPKGLPNEHEDV VFAVNKVTASANQQDSLLSLLLWIGVIGSLVVAIAIFLVQWCISPILNTTNVLKKLGL KEPHQIDAQQEDEFASLGSTINLMADQLKVLVKEQAEYSSRQEAEAKRTQIFTEITLR IRQFLRQQDILQTTVEEIRNLLATERVVVYRFNDVGSGNIVSESVAAGWSKIIDKQMN DPCLSERYIQHYKNARVRAIDNIYQAGLTNCHGEELSRNAGISKSA" gene 11689..12822 /locus_tag="DP116_10490" CDS 11689..12822 /locus_tag="DP116_10490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310908.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_10490" /translation="MNAGAVPSNPFVAGGMLDDPQLFVGRKDELHAIASRMSGAQPTS VNVVGEKRIGKSSLLYHFFQTWEQRVQDPSRYVVIYLSLQSVVCQREEDFYLAVAQEL LRRPAVRAIPSLMTALQLNPLNRLAFSTAIALFKNQIQNQNNNQNQNNNQNLLPVLCL DDFKTLFENKSEFDDGFYNNLRSLMDSNSLMLVVASQKELDFYARRHQLTSSFFNLGH VLKLRELSEEEVTELVLLPSSTVTHAQPALGMHEQKLTRQLGGRHPFLLQLAGMLVYQ ARQQGKDENWVKTRFKEQSRRLPRPGLGSRRWWRSLRWLVWNLPIRIGSVVKSIGVSV SDMSSWILGVVIVCVIILVVFGLVSHSDAWRWIQKLVNKVLGG" gene 12825..15197 /locus_tag="DP116_10495" CDS 12825..15197 /locus_tag="DP116_10495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317926.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AAA family ATPase" /protein_id="PRJNA477356:DP116_10495" /translation="MTRQRSASDLWTYIAECIRLLYWIYLKPYTFANWLRDIHPELKP GTNPFAKRAEFSTNPRLRRYAGQVWWLTAVIPLVAVLIAAPIYTFVTPLVAPINGFED SVESFNWLASGLIWLGWLIGLMIVARGDNKQKILVLIFATLTILLPFILSFTTPSSVA FVGVAFNVAVGMVLGVPFGVAFGVTFGVASSLAISVAFGVTLGVASGLAFNIVLAFGV VVLGVAFGVVCILGVLRVYFWVPELLWMLILRLLTQQKNPVLALRYLPPRFDELIRLP LPFMGEMIVKAYRENPTAARETIDYLINFTNQQQVALQAMANIAVTRLNSCQTLSDIA NMTEELAWIPSPPPKELSSALPQLLDISQDVSAASQATSAHRQSELLNRPISALRLLK NSLAFGKNAEVATTFGSIIQRWQSILETAQRTLEEQGRFSKEFRQVYIAGNALDPQTA ENRFKGRMDLFREIETFALSESPPVLLLYGGRRTGKTSSLKYLPDKVGASLIPLLVDL QGAASATTLQGLANNLVSQMTEAARRLPRRLDLPYTDLSRTDPFIALQNWLVQIERTL PDKRFLLCLDEYERLSEVVETTRSRVPLNFLRNILQHRTSWTLLFSGSHELSELPTYW SDYLINTRALRLTYLHESEARELIEKPVEDFPKIYEPEAVDAIIRLTRCQPYLVQLMC YEVVELLNRGIRQNKRDAATIKATVHDVETVIPTVLERGDQYFRELWTGFTEGDRNLL QRLLQGETPTKQDKASVRKLVRKEILYKEGVEFQVPLVQKYVEQRLEDET" gene complement(15408..15713) /locus_tag="DP116_10500" CDS complement(15408..15713) /locus_tag="DP116_10500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199994.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_10500" /translation="MKFIVIHTEARKELDAAIAYYEKQKFGLGLDFLSEVEKVLGNIQ QNPNLGTPYKIEGVRRYAIKRFPYLIFYIELEEVIWVIAIAHGKRKPDYWKKRNMEI" gene complement(15710..15940) /locus_tag="DP116_10505" CDS complement(15710..15940) /locus_tag="DP116_10505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002784951.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="addiction module component" /protein_id="PRJNA477356:DP116_10505" /translation="MTETAEKLKLELSQLSAQERAEIAHFLIQSLDGDVDNDVEAAWD TELTKRLEDIHRGTAIGEPLNQVFSELREKYS" gene complement(16290..17567) /locus_tag="DP116_10510" CDS complement(16290..17567) /locus_tag="DP116_10510" /EC_number="3.3.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995586.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenosylhomocysteinase" /protein_id="PRJNA477356:DP116_10510" /translation="MTATSPRLKHEVKDLGLAALGRQRIEWAGREMPVLRQIRDRFAQ EKPFAGIRLVACCHVTTETAHLAIALKAGGADALLIASNPLSTQDDVAASLVADHEIP VFAIKGEDNETYSRHVQIALDHRPNIIIDDGCDVVATLVQQRQHQLADIIGTTEETTT GIVRLRAMFRDGVLTFPAVNVNDADTKHFFDNRYGTGQSTLDGIIRATNVLLAGKTVV VVGYGWCGKGTALRARGLGANVIVTEIDPIKAIEAVMDGFRVLPMAEAAPQGDLFITV TGNKHVIRGEHFDVMKDGAIVCNSGHFDIELDLVALGSKAKEVKTVRPFTQEYRLQSG KSVVVLGEGRLINLAAAEGHPSAVMDMSFANQALGCEYLVKNKGNLEPGIHSIPTEVD QDIARLKLQAMGIEIDTLTEDQIEYTNSWTSGT" gene 18051..19595 /locus_tag="DP116_10515" CDS 18051..19595 /locus_tag="DP116_10515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869047.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mechanosensitive ion channel family protein" /protein_id="PRJNA477356:DP116_10515" /translation="MPKATAQQIPFLPNLPTPNLLIHESEKATETDWVSLDGRRLFQI AAPKGSLTQRLQDIQQRLDQQSQDYFQKSDAEPNVQIKTEKKSTTLKGKTTETVLTVI LLNGQYLMTVNDQDASLQQDNTSTVANQIVEKLKEGLKQAKQERQSKFLIQQGQIAAA AGVAMIVISWGVYSYQHRPKKYKAQPVTTASAAARPMTLQLNQQQHQHITEVKRRLYQ LAQTTIWGGGSLFILGLLPYTRPLQLWIMIAAQVPLTLGVIALGTYVAIRLSYALIDR FNSAFVNSAVLLTPEAPERIQLRVSTISGVTKSLITISCVAVGTLLALGALGVELVPL LAGVSIVGVAVSLASQSLIKDAINGFLIILEDQYALGDYITVGTVGGLVENLNLRMTQ VRDGEGRLITIPNSEIKIVANLSSRWSRADLTIPVAYQTDVDEAIKLIQKVGVEMDQD PQWDHQILETPQVLGIDNFAVAEKGFLIRVWIKTQPLKQWSVAREYRRRLKNAFDQAG ISIFIP" gene 20064..20330 /locus_tag="DP116_10520" CDS 20064..20330 /locus_tag="DP116_10520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320653.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10520" /translation="MMKHYILSLNPTAKHEWDRCILRDPLTAKRPDIAKLVAEAVGAD TGSYLVSVNIEVQVLEQAAVPHAEQLSLNVGEVTIQAPQLREVA" gene 20476..20841 /locus_tag="DP116_10525" CDS 20476..20841 /locus_tag="DP116_10525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017656215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10525" /translation="MGAALVEDIAQAAQRVNEVDSQLMAVQLQINRFEGNADRAAAFD MDLKNDAQRKARRFEVLLVNQEYQKAMDTLIQLTTEKANAVAHLEYLRNLFSVAKLEA RLTIAQQLTDFESHELVGL" gene complement(21041..22177) /locus_tag="DP116_10530" CDS complement(21041..22177) /locus_tag="DP116_10530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316815.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_10530" /translation="MKILFLDQSGKPGGAELCLIDIAKPYKDSCLVGLFADGSFKNLL EQNHIPVQVLTNQAIQVRKESSLAQGLGSLAQILPLISKVVQKAREFDLIYANTQKAL VVGALASFFSRRPLVYHLHDILSPEHFSQTNRRIAVTCANRFASLVIANSQASQAGFV AAGGRLDITEVIYNGFDPKNYQSHESDVSQIRQQLGLDGQFVVGHFSRLAPWKGQHIL IEALTQSPVDVTAILVGDALFGEQEYVQHLHQQVTALGLENRVKFLGFRSDVPLLMAA CDLVAHTSTAPEPFGRVIVEAMLCGRPVVAAKAGGAVELVEHGVNGFSVTPGKPDELA QVITTCLKEPQKIATIAHHAQTTASQRFDVTAINQQIAVLLNKL" gene complement(22528..23700) /locus_tag="DP116_10535" CDS complement(22528..23700) /locus_tag="DP116_10535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_10535" /translation="MENNTKNALESKSILCLGVGWFPKTPGGLERYIYELTQKLAANQ DKVELCGVGLPEAEPNLPIKLTNLASPSSPIWKRLWSIRTNFQKTRTEKPDVINLHFA LYSFPILDLLPKGVPVTFNFHGPWAFESQEEVEERKLSVLLKAKLIEQRTYNRCDRLI VLSKAFGNILHKQYQVPWNKIHVIPGGVDITQFQPNLSPQEARTKLDWPQDRPILFTA RRLVNRVGLDKLLDALAIIKPRIPDVMLAIAGRGPQQATLQQQATELGLDQHVKFLGF IPDELLPVAYQAADLSVMPSQSFEGFGLAVVESLACGTPVVCTPVGGMPEILEPFSPD FITSSTEVPAIAEKLEQVLLGKIPTPSREACRHYAVTNFNWDKIAQDVRQVLLAQQ" gene complement(24432..24890) /locus_tag="DP116_10540" CDS complement(24432..24890) /locus_tag="DP116_10540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488151.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10540" /translation="MNIKAKLLGAVAVISGISGLLFVAPTPASANTFIEQVGMQLMRA AIHGGLNGYQLTHEPFIDGIRSGGSDYLTINLRAGQQYGIVGVCDGDCQDLDITLYND RGQVIAADTDDDDIPTVGVTPTRSGTYRVKVDLPSCNSAMCYYGVGVFGK" BASE COUNT 7377 a 5369 c 5397 g 7019 t ORIGIN 1 ggtatctgtc gcaaacattt tttgaatcgg tataagctta acgctttttc acgcgttatg 61 gctttcttga cggaactcgt tttctatgct agcgtcataa attatgagtg taaactaagt 121 tatccaacaa aacgccgaac aaatgtattc cagtcattcc tatttttatt tttggtttag 181 ggtggttcac cgggggtgaa ttcagtttcc taatggaaac cgcccggtga accgcagagc 241 gcactctgcg gtttttgttt ttttaaggag ttaaatttca tcgtcatgat ctatctcaat 301 acctggtttt atggctttta tttttggtcc cgaataagtt ccgggtagat agcatcatgt 361 cgatgaatct ttcaagcccg gaacaacaaa ggttctgggc ttttttagtg tcaacctctc 421 atttgttgtt ttttaggaga atccaaccat gaatagtgct aaactagccg ccaaatctca 481 tcctgaccat caaacagttg tcaatctttc agagaaagtc gcctttggtg gtacagaact 541 ggtgattatc ggtggacctt gtactgttga aagcgccgaa caaatggaaa cagtagctca 601 aaagctctcc gccgcacctg tgcaggcttt acgtggcggt gtttacaaac cccgcacctc 661 tccctacgct tttcagggaa tgggagaaga tggactggaa gttttggcaa aagtgcaagc 721 ccactacaat atcccagtcg tgactgaggt gatgtcaatt tctcaaatag aagcgatagc 781 cacccatgtt gatatgcttc aagtcggtag tcgcaacatg caaaactttg atctcctcaa 841 agcattggga caagcaggta agccgatact cctcaaacgc ggcttagcag cgacaattga 901 agaattcgtt atggcagctg aatacatctt aagtcacgga aatcctgatg tcgtactgtg 961 cgaacgaggt atccgcagct tcgataatta cacccgcaac gtactagact taggagcagt 1021 ggcagctctc aagcagataa ctcacttacc tgtgattgta gatccatctc atgcagtcgg 1081 taaacgggaa ctcgtggcac ctcttgccaa ggcggctata gcttgtgggg cagatgggtt 1141 aattattgag tgtcatccag aaccagaaaa atctgtttct gatgcacgtc aagccctgtc 1201 tttagaagat atggtgaatt tggttcatag cttaaagccc gtagcagcat ctgttgggcg 1261 aagaatatcg gaagacatgg gggcgggttt caaacctgcc cctctttgtt gtgcggcttg 1321 aaatgtgaat aaaggctctt gctacctgtt ttctatcaca tttttgattc tggctgttca 1381 aactccttga tgccgtaagg taactctaca gtatcaagga gacgttaacc accacagcca 1441 ccacagccac cacaaccacc acaaccacca caaccacctc catcaccacc gcctccgtca 1501 ccgcctccat ctgaaccacc acaaccacct ccaccacctc catcaccgcc tccgtcaccg 1561 cctccgtcac cgcctgcatc tgaaccacca ccacctgtat aaccccctcc tcctcttgag 1621 gagaatatac ctacaaaaaa tatcagggca aagattgacc aaattacaat ctggaatccc 1681 aggaaaaaca aaactatacc cgagagtagg acagttacat tgatcaatgt tttgataaaa 1741 gagttaaaat tatcatcctc acgacttaga agccatagga gatatacacc cacacacagg 1801 cataaaacaa caagataacc aatcggtttg tgtcgagaaa gtccgatgat aatcctggca 1861 attcctagag agtataggca aaaaattgag agcattgtta accaggttgc aatgctttct 1921 tgcaaagata atttatttcc ttgcaaaaat tcttcccaag atggtgtatc gtctagcgat 1981 ctaattccag aagggcgaat gaaaagttta gtctgtttcc aacgaaggat agcaaaatca 2041 aagagaaaac tggcgatcgc agataaaata taaaacccaa taaactcctt tcccgctaga 2101 ttcaaaattc ctgtcgctaa attaatactc cctaaaccaa ataaaacaaa cgccaagatg 2161 gcaaataaca cccaagccgt ccgagaatca tttttgaatc gttgtaataa taagccgaga 2221 agacctataa acaaaagccc aatacatcca actaacaggt aaaacttgaa aaactcaggt 2281 aatgaacggt taattggatt tggaaagctt gcgagtgcgg gaacatccca agtgatgata 2341 ctcacaagtg caagcagcga cagcatccag acagcactct gactaaaacg aaacttggtt 2401 aaaaaactga aatccggctt ttggataatc caattctgct gtgtattgac gcgaacaaaa 2461 tgaatatctc tgccaaaacg gatatgggga ggcgaccaaa tatcaggtgg tggtagctgt 2521 tggaaaaact gttcataact cgctaatgtt ttaccgtacc agttattaaa cttataccct 2581 tcctgttgcc cgccttttgt aggattgtga tgcaaaggcg tttgtagaac atgagcacag 2641 aagttttccc agtatgattt tgtataaaca aggtgtaaat gccagacttg atccacttga 2701 tccgaagggg tgactggatg accagcaaca atagctaaaa atgcaaattt cttgtattca 2761 tcaatcaccc gttgagcata ttcaactgtc caattattat ccctcgcaag gcgtttacta 2821 aaagataatt ttgcatcaat gtcatccaag gagaactgct gaatacgctg atagagatcc 2881 aattcgtgag tattcataga taaatgggtg aattcggtta ataggagaga tctttgctca 2941 ttgcctctgt tggcttgttt gtttacccac ctgttgatga actcgtggat ttgtatttgg 3001 ttttcagttt gtttagagga acttatacgt tcttgttaca aatctttaca ttcctctgaa 3061 gttgcgataa atatcaaagt tgaaccgata aataaaaaag gagcatgact taacggttta 3121 ctgcagcctg agaatttttc aggaaggagg cgcatgacga ttatctggaa gttttccacc 3181 cacgccaaaa ttaacgatag taacgggctt tcaattactt ggaagtgcga taagccggag 3241 gcttgacgct acgcgtatcg cgaattagaa aacaacatca tgactttgct tgagtcagat 3301 cccaacgtca atgaaagatt tttggcacag cacttttata acaatcttca ctcttcacca 3361 ctagcgctca agcatatgtc ggcttactta gaaaagttcg gctacgatgc tgctaataga 3421 gttcacagag agttaagtag gttccagaaa tacaatcaac tgctatatga taaacgtgat 3481 atcttgcaga tggcttggtt gttcgctagc actccagata aattttttaa aaactttaac 3541 ccccagcatc cattggaaaa ctatgcgtgt aaaacaatgg agtacaaaat taagcaagag 3601 atttttcgca tacagatggg acaaaagcag ttttcagatt gggggttatt gaagcactgt 3661 actcgaaaat ctctacaaca agcgctgcaa tgtcaaggtt gcactcagcc gcagcttggc 3721 tgctatctgt tagcttacaa ctgttttaag gaaatttacg ctcctcaaag agcaacaagc 3781 aaccgttctc tgcaaccacc tacggataca caattccaag aaataacaaa cctatacaac 3841 cacctcgtgc agcagtcggc actcgccact cttgacacag ttcatcaaca gaagattaaa 3901 caatggttgt tagaatgcat tcaggcgtta agaaactttc gatccagaca aattatctcc 3961 cttgatgcgc ctaagggtga ggacgaaaat tctcttccct tgtcacagat aactcccgat 4021 cctacctctg agtctctgtg ggaaaagatt attgtgcaag aattgactcc gcagatgatg 4081 actactttgt cagaacttct gacacaatta gacagatata ctaacaacca cctgctactg 4141 aagtacggtt ttaacttaga ctaccgttca attgctcctt ttttcttcgt tgattctaca 4201 acaattagtc gtcactgcaa caaaacaacc cagaaattac ttggacaact tacccaatgg 4261 gcgcaagaag agctacatat tacgcctgat tcagaaaatc tcaaggaaat aaatactcta 4321 ctaaagcagt gcctgaatca gtattaccaa gatttcattt ttcgatctgt gtttcaggat 4381 gcatggcagc agcttgactc agaatgccga catatactgt acctacgcta ttttaggcga 4441 atagatgaag cagcgattgc ccacaacttg caactctctg aattggaagt cacaaacgga 4501 ctagtcacag gtactcaaaa gttagctgct gctgtttgtc actggatttc aaatcaccta 4561 acagtttctc tggatttcgt taatccactt gctgacaaaa ttgcaatctt tgtacaaaca 4621 ttgattgcaa attaccctga acatgaattt tagataacac tgaagagtca aaatgtcaac 4681 tatgttagaa gcatttataa ctttataccc agaacagaca tggttagaag tttcaccaaa 4741 acagcttgag gaaacaatgc catcacagca agaatattct tatgatgtcg ctcgttcaac 4801 tgcttttacc aaccgcttaa ctctgaatgc ttttgttgat tggttcaagg cagaatcagg 4861 aatagacgag cagttaaagg tgtggccaag tcttgatgct ttacccagta tatgggagat 4921 ggttaatggt acagcaattc aactgggaga aaccagagtt gtcctgatac caagccagga 4981 gattgacatc aatgaatttt gtgtgcccgc tgaatgggta gatattccca gttgggctgc 5041 tgactactac tacttggcag tgcaagtaag tcccgaagac tgttggttgc gaatttgtgg 5101 atacacaacc cataaaaagt tgaaacaagc ggggttatat gatacaatca agcgcactta 5161 ttctctagag caagaagaat taattgaaga tttaaatgtc atgtgggtgg cgcgctcctt 5221 cggttgcgat acgcgtttgc gttcgcgtag cgtgcccgaa gggctcagcg ccccctatgc 5281 ctgcggcacg gctacgccta tcggggctag cgtctgccct ttgggcaatc gcaaagcagc 5341 aattaagcct ttgccaagtt tatcacaagc acaggcagaa aacctgttag cacagctcag 5401 tcaacgctct tgttactctc cacgtttgaa agtggatttt gaacaatggg caacattgat 5461 ggaaaatgaa aaatggcgac agcatctgta cgaaagaagg atagctcaag ctgtagcacc 5521 acaacctgta ttggtaaact tgagtcaatg gtttagcaac gtctttgaag aatattggca 5581 gattgttgaa gatatctttg ctccaacaga attaattcca gttttgtcgt caagaagtcc 5641 agaacctgat agagaaagta aacttaaggc gatcgccagc attattccct tattaaaccc 5701 tcatcatgag gaattgcttc gttgccaagc tgcaggagtg ttgggaaaga ttggcgtagg 5761 aagtcatgat gtcagcattg ccctgactga attattgcac acagcaagag atgagaaaac 5821 tcgttggcaa gcggcgatta gtttggggaa agtaaaccca gatcatccag aagctggtcg 5881 tcagagagct aagttgattg acttggggat gcaattaggg gggtatccag tcgccctgat 5941 tgtggctgtc atgccaagaa ctgatggaaa agtgaatgtt tttgtgcaag tccagccaac 6001 tcaagttgaa acacttctac caccaggtat gaaacttggt ttgctttctg aatcagagga 6061 aattattcgg gaagttgaag ccagaaaaaa cgctcttggg caaggtaaag acgagagtat 6121 tcaactagtt tttagttgtt catcagcaac tcattttcga gtcaaagtga cgtgtaacaa 6181 tgtcaacttt actgaaagct ttgtaatata agcaaataaa ctcttgtcgg ggcagttatg 6241 ggtagggtcg tagtcttgag gatagaatct gttactcctg agagggtttc cgtctcgtta 6301 caaatatgga gagatcatgg acctcccatt gtgcacagaa tagatgctac tctacctgcc 6361 ctcttagacc ttattggtat ttatcagagt tggtatcagg aatatagaaa cctgattatc 6421 aatcctctta atcctgaaat tattattgat gaaactatac caacccaaac cagcacctca 6481 ggtgttgatg cttgtcgtat attctctagg ttgttgagag acggcataag ggatcgactt 6541 ggaaatagtg aaaatcctgg ctgggcaaga attcgagagc gtgtgttgga gcagttattc 6601 aatgtatccg atgaaattcg catcgttatc caagctgatg attctcaact gtggaaattg 6661 ccttggcatg aatgggattt gcttcagaga gactctgttc gcacaaacgg tgttgaagtt 6721 gcttttagca acttgcatga ttcagcatca ccttcaaacc cagtgactcc tcaaggctgg 6781 gtaaaaatcc tagcactagt agaaaatatt ccaggtctgc aacaaacagt gcagacaact 6841 ataaggagtt tacctgaagt cataccagaa tatccatcca atttggatgc attacaatct 6901 cagctaagac agggctgcga aattctcgtt tttgctggac acggttacac aggaggtgat 6961 ggaattggtc gaattattta tggaaatagc cagataactg ttgattgttt tgccgaagct 7021 ttaaaagaag cagtcagtaa aggtttaaaa cttttatttc tatgttgttg cgacaacttg 7081 ggcttagtaa aagacttgaa agatgcaggc gttaatatcc cagtcatcat cgcgatgaga 7141 gaggagatat cagtcgaagc tgcacaagaa ttttttgcta atttttttca agaatatgct 7201 caacaaaagc agcccttata taaatcattt cggcgagcaa gagttgcatt agaaaactgg 7261 gaagagcgtt taccaggaac aaaacggata ccgataattt accaaacttt atccgtcacg 7321 cctcctacgt gggaagagtt aattgcatct ggaccagaac cacaactgcc tgaaccagtg 7381 gtatctcccc caatagaact gcctgagcca gttattaatc cactcttaag gtttatacgt 7441 tcatttcgtc gatttgtgca gcgtcgtttt gtaagaaaat tcttgctgct gatgttagta 7501 ggattaggaa tagctttcat aatttggcgt atacctattc tttctccacc cccagaagtc 7561 gcactttgtc ctacaggact tcctgacaat ataagttatg gagacaaaat tttaaatgag 7621 aacgatagaa acttagatcc agcacaaagc ggaactaaga aattgcaaga ggcgtgcaca 7681 aaattcggta atgggcaaga taaaaatgct gtagctgctg ctattccttt atttgagcag 7741 gctgttaaag acttgaccac tgctcataaa cagaatacga agaatgctga aatactgact 7801 tataaaaata atgctgaggt atacctcaag tatgccgagg acatttctca gtccactcca 7861 aagctagata gacctaagat tttagcagtt gctattcctg tcaatccaag aaattctcct 7921 ttaccgggag aaataaaaca atcagcaaat gtaatcaact taggcgttgc tatagctcag 7981 aaagaattca atcaaactca aaagaataac cagaaattcc tagttttaat agctgatgat 8041 aattctgaga tgagtactgc taaaccaatt atcgagagtt ttaaaaaagt acaacatgaa 8101 attgtaggtg ttgttggtca cgcttctagc ggcttaacat taaatgctag tagattttat 8161 aatcaaaaaa aaatagtctt aatatctccg actagtacag cagaagatat tcgtccaaaa 8221 gatgtgaaac ctattgataa ttacatattc agagtttgcc ccactaacaa ataaattgcc 8281 aaaaaaatag ccgaatatat cgacaagaat gttgcttata acaataaaaa gaaggtcatc 8341 atcggttacg tcaatgaccc ttatgtacag tcgttaaaga cagaagttgc agatctgcta 8401 tctaaaaggc aaagaaaggt tgatcctctt aatttagata cagacaacat tcagacacat 8461 ctaaaagatg ctgtagcgct tgttctcatc ccaaatcttg ggagacgcga gtcaatttta 8521 aattggtttt acacagctaa taattctcct tatcaattac ctgttatcgg tagtgatagt 8581 atgtatagtg agtcaattat aaacacgtca acacgtaaac catgcccaga aggactaatg 8641 ctggctgctc ctgcttatga ttctgaattt acgaacaagg tcaatggtat tttaaacaac 8701 caatatgcaa ccattgaaga ttggcgtgta ccatactcct atgatgcggc tttagcctta 8761 ttcacggcgg ctagcaatac aaatcaacaa ttttcatcaa acagcagtca aatcaaggac 8821 ttattacaca atctaaacat tacaggcgtt acagggcaaa ttcagtttga cgccaatgga 8881 gaccgaatac agcctaataa cagtgacgat attgaattat ttcgtgtcgt aaacaatggt 8941 aaccagtgta gctttcaacg attataaata caggcggatg agacaggaaa ctcacgcata 9001 caaataagat cgtcactgat tcacttttgt tgtgaagccc caaccgggcg cgaaacgctc 9061 gttggggact tcacctatac tggttgatct cctgtacgcg tctagcgaga ctcactcaaa 9121 atagaactaa acacccccat agcttgtttt aactgcttcg ctgtatctgc ctcttctcga 9181 taagttcgta tatagtccaa aatcaatccg agaaacggca ctaggtaagc aataatcttg 9241 agaaagtggg cgatgttgaa atcgttgtca aacagcactt ttgaaccaaa aaccatatgg 9301 agctgagtcg ctacttcggg aataacgctg ataaccaacg catgagagaa cagacttgga 9361 tatttgttat aaaagcgagg gaaaacaaca aaacctgcaa agataaatag taacagcgga 9421 gcaacgtccc acggacgagt caccatagag ttaggaaata tggttttggg tagagttgcg 9481 ctgtgggcgc agatagatac aatccaaaaa gcaatgacac caaaaactag gctcgaaagt 9541 accaccaaag taataccgct cttccatctt cttggctgag caatcaaaaa aaagcctgct 9601 ccaccaagca taatcagagc attgaaaagt cggcagattg cccaagtgaa ggggatgaga 9661 ttttggttat ctgccacacc ctgaattaat ctatcagccg ccagagtgtg aaatgcatcc 9721 atgcatccag agctaaaaag tgctatgcca atgatgggag taacgacatc agccttaagg 9781 tagaaatgta tcaaagacaa aataactgtg aagattgctg tacagaaggc actccactct 9841 aaaatggtgt gcgtaaaact gccagataaa gtgtagtgca taacatcaat cactttgtcc 9901 ttagtagtat ttgacagtga ggcatcaaaa gtctgtgagg gcgaaccaaa atctactccc 9961 aatagattca gcaagctagg cagcacacag atgacgacta ctgcccaaac caattgaaca 10021 ggaatcctat tttttctcat ataaatttct taagtatctc gcctgataag atatagattg 10081 ctaaactaag attacagcaa atatcgggat tcttttacaa aatatattgg taaatttccg 10141 taaaagaagc acaggagatg cgcctacccg gagtgtaacg tgatacagag ctacaatgca 10201 aacggcgaaa gtaaacccga tacgcgccgg cttcagacgt agcaaaatac agcggttcac 10261 atcaatacat acatctgtca ctatttttag tgactattat cacagtggtc tcttcgtaag 10321 tattgttatg atgcgaacta tttaaaagaa ttagttcagt attttcacac aaacttaagg 10381 tagagcaatg ttggataatc gtcgagacgt caagccaagt ctagcgtctt tgtggaagct 10441 tgactgtact cgtgttgtta aagttcttag agggagtatt gtcttaaaag tgacagcttt 10501 agctatcacc attggcacac ttccattact gacagttagg acaacgactc atgagtttgc 10561 gaaccaatcg agaagtaagg aaataaccga agttcaacaa accagaacct ccgctaaagc 10621 aaatgaggta ccccaaacga tgcttgaaca gcatcaaaaa attcagatag cggctcattt 10681 atcgattttg gtaaatcctc tggtacggca taatattttt gaacaacagc aagcagtgct 10741 aaaccgctac atagaggttc tggtgaacct tgacaaaatt gttgtaaaaa tcgataacgc 10801 atacgagttg ataaactacg tagtttggaa aacaccaaaa ggtttgccaa acgaacatga 10861 ggatgtcgtc tttgctgtaa ataaagttac tgcatctgcc aaccaacaag actcactcct 10921 gagcctgcta ctttggattg gggtgatagg atctttggtc gttgcgatcg ctattttcct 10981 agttcaatgg tgcataagtc caatcctgaa tacaacgaat gtgctcaaaa aactaggttt 11041 aaaagaaccc caccagattg atgcacaaca agaagacgaa ttcgcttctt taggctctac 11101 aatcaacttg atggcagacc aactcaaagt tctagtcaaa gaacaagcag aatactcctc 11161 tcgccaagaa gctgaagcca aacgaacaca gatctttaca gaaatcacac tgcggattcg 11221 tcaattttta agacagcaag atatcctcca gacaacagtt gaagaaatcc gaaacttact 11281 tgcaactgag cgagtagtcg tatatcgctt taacgatgta ggaagtggaa atatcgtatc 11341 agaatctgtt gctgctggtt ggtcaaaaat catcgataaa cagatgaatg atccctgttt 11401 gagtgagcgt tacatacaac actacaaaaa tgctcgggtg cgtgctattg ataatattta 11461 ccaagcgggg ttgacaaatt gtcacggtga ggaattgagc cggaacgcgg gtataagtaa 11521 gtcggcgtaa aaatttcaag gtatgtcatt gcgagtggaa cgaagcaatc gcaaggattt 11581 tgtggatttt acattctgtt acatagttcg atttatttgt gctgacttac ttaacttttt 11641 taggattatt tctctagtgt tagaatctag tcactagaaa atttttttat gaatgctgga 11701 gcggttcctt ccaacccttt cgtagctggc gggatgttag atgatcccca gctatttgtt 11761 ggtcgaaaag atgaattaca tgcgatcgca tcacgaatga gtggtgcgca gccaacaagt 11821 gtcaatgttg ttggtgaaaa acgaattggc aaatcctcgt tgctgtatca ctttttccag 11881 acttgggagc agcgagtgca agatcccagc cgttatgtag tgatttatct gtcgcttcaa 11941 agtgttgtgt gtcaacggga agaggatttc tatctcgctg tggcgcagga gttgctgcgt 12001 cgtccggctg tgcgagccat accaagttta atgactgctt tgcaactcaa tcctcttaac 12061 cgcttggcat tttcgacggc gatcgcatta tttaaaaatc aaattcaaaa ccaaaataac 12121 aaccaaaatc aaaataacaa tcaaaactta ttgcctgttc tgtgcttgga tgattttaag 12181 acattatttg agaacaagag cgaattcgat gatggatttt ataacaactt gcgctctttg 12241 atggatagca acagtttgat gctggtggtt gcttcccaaa aggaactcga tttttatgct 12301 cgtcgccatc aactgacttc ttctttcttt aatttaggtc atgtgctcaa gctgagggaa 12361 ctgagtgaag aagaagtcac agaattagtt ctcttaccta gcagcactgt tactcatgcc 12421 caaccagcct tgggtatgca tgagcagaag ttaactcggc aattgggtgg gcgtcatcct 12481 tttttgctgc aattagcagg tatgctggtg tatcaagccc gtcagcaagg gaaagatgaa 12541 aactgggtaa aaactcggtt taaagagcag tctcgccgtt taccccgtcc tggattgggt 12601 tcccgccggt ggtggcgttc tctacgttgg ttagtttgga atctcccaat tcgtataggt 12661 agcgtagtaa aatccatcgg cgtgagtgtg agcgatatga gcagttggat tttaggagtt 12721 gttatagttt gtgtcattat cttggttgtg ttcggtcttg tgagtcattc tgatgcgtgg 12781 cggtggatac agaagttggt gaataaagtc ttagggggat agcgatgact cgacagcgtt 12841 cagcatcaga tttgtggact tacatcgctg aatgtattcg cctcctttac tggatttacc 12901 tcaagcccta cacctttgcc aactggttgc gggatatcca tccagaattg aagccaggta 12961 caaatccctt tgcaaaacga gccgaatttt ccactaaccc gcgcctgcgt cgttatgctg 13021 ggcaggtttg gtggctaact gctgtcatac ctttagtagc tgtgttgata gcagcaccaa 13081 tttatacctt tgtcacgcct ttagtagcac caattaatgg ctttgaggat agtgttgagt 13141 cgtttaactg gcttgctagt ggtctaattt ggcttggttg gttaatcggt ttgatgatag 13201 tagcacgtgg tgacaacaaa cagaaaatat tagtgttaat ctttgctacc cttacaatac 13261 tcttgccatt catcttgtca ttcaccacac cctcaagcgt ggcgttcgta ggtgtggcgt 13321 ttaacgtggc ggttggcatg gtgcttggcg taccgtttgg cgtggcgttt ggcgtgacgt 13381 ttggcgtagc gagcagcctg gcgattagcg tggcgtttgg cgtgacgctt ggcgtagcga 13441 gcggcctggc gtttaatatc gtcctggcgt ttggtgtggt ggtgttaggg gtggcgtttg 13501 gcgtggtgtg cattcttggg gttttgcgtg tttatttttg ggtgccagag ttgctgtgga 13561 tgcttatcct gcgtttgctc acacagcaaa agaatccagt gctagcactg cgctacttac 13621 cacctcgctt tgacgaactc atccgcttac ccctaccttt catgggtgag atgattgtca 13681 aggcttatcg agaaaaccca accgctgctc gtgaaacaat tgattatcta ataaatttta 13741 ctaaccagca gcaggttgca cttcaggcaa tggctaatat cgctgttact cgcctcaaca 13801 gttgtcaaac tcttagcgat atcgcgaata tgactgaaga actcgcttgg attccctcac 13861 ctccgccaaa agaactcagt tcggcactac cacaactgtt agacatcagc caagatgtat 13921 cagcagcaag tcaagccact tcagctcatc gtcagtctga gttacttaac cgtcccatca 13981 gtgcattgcg tctactaaag aacagtctgg cttttggcaa gaatgctgaa gttgccacga 14041 cttttggcag tattatacag cgttggcaga gtattctgga aactgcacag cgtactttgg 14101 aagaacaagg acggttttca aaagaattcc ggcaagttta tattgctggc aatgctctag 14161 atccccaaac ggcagagaat cgttttaaag gacggatgga cttgtttcgg gaaatcgaaa 14221 cttttgcgct ttctgaatca ccgcctgttt tactactgta tggcgggcgt cgcactggca 14281 aaacttcttc tttgaaatat ctgcccgaca aagtaggagc aagtttgatt cctttgttgg 14341 tggatttaca aggtgcagca tcagcgacaa cgcttcaggg attggcaaac aatctggtgt 14401 cacaaatgac ggaagctgca cggcgattac cccgtcgtct tgacttgcct tacacagatt 14461 taagcagaac agatccattc atcgccctcc aaaattggct cgtacaaatt gaacgcactt 14521 tgcctgataa gcgatttctt ctgtgtttgg atgaatatga gcgcttgagt gaggtggtgg 14581 aaacgacacg tagccgagtt cctctcaact ttctccgcaa tattctgcaa caccgcactt 14641 cttggacttt acttttcagt ggctctcacg aattgtcgga acttcccaca tactggagtg 14701 actacctaat taataccaga gctttacggt tgacatattt gcacgaatca gaagcgcgtg 14761 agttgattga gaaacctgtg gaggacttcc ccaaaatcta cgaaccagaa gcagtagatg 14821 ctattatccg tctgactcgc tgtcaacctt atctcgtgca gttgatgtgt tacgaagtcg 14881 tggagttgct caaccgtggt atcaggcaaa ataagcgaga cgcggctact atcaaagcga 14941 ctgtacacga tgtggagaca gttattccca cagtgctgga acggggagat cagtatttta 15001 gggaattgtg gacaggtttt acggagggcg atcgcaactt actccagcgt ctccttcaag 15061 gagaaactcc cacaaaacaa gacaaagcat ctgtgcgaaa actcgtgcgc aaagaaattt 15121 tgtataagga aggtgttgag ttccaagtgc ctttagttca gaagtatgtg gaacagcggc 15181 tagaagacga aacttgatga catggtgagc tatttccggc aagctaaagt agtgaaaagt 15241 gattcttttg taaatttaga aatataggac tcgtatttga tttttgaaaa aaactccgta 15301 cgcctttatt acttcttccc tgttccctgt taagagttcc ctgccctgac gagttagtat 15361 ggctccgcca cgcaagctaa cagaaatcaa accggattcc tatagcatca tatttccata 15421 tttcgctttt tccaataatc gggtttgcgc tttccatgtg caatcgctat cacccaaata 15481 acttcttcaa gctctatata aaagataaga taagggaaac gtttaatagc ataacgtcgc 15541 accccttcaa ttttatatgg tgttccgaga tttggatttt gctgaatatt tccaagaact 15601 ttctccactt cagataagaa atcaagtcct aaaccaaatt tttgcttttc atagtaagca 15661 attgcagcat caagttcctt tctggcttcg gtgtgaataa cgataaattt cacgaatatt 15721 tttctcgcaa ctcagaaaat acttgattta aaggttcgcc aatagctgtc ccacgatgaa 15781 tatcttctag gcgcttagtt aattccgtat cccaagcagc ttctacatca ttatcaacat 15841 ctccatctaa agattgaatc aaaaaatgag caatttcggc acgttcttgt gccgaaagtt 15901 gagaaagttc aagcttgagt ttttccgcag tttcagtcat tttatcgttt ctcttgagca 15961 attgagatac gaggcttaac tcttcaattt taactgcgtt cacgcttggt tcaccctccg 16021 ggtatctcct gcggagacgc tacgcgttag ccctctgggc gtgcgctttg cgcatacgca 16081 gtcgctacaa cggggggaac cccaacgcca ggtccctctg tcgggaaacc ctcctgcagg 16141 actggctccg caacgcgctg ctcacagtgc tgcgcggaac gcagatcgct accttgcaaa 16201 aaaagcggga taggcaactt gcctatccca ctaaagtcac actaaaaaag cgatcgctcc 16261 tctagacgct ctaaaactct aacctacgct taagttccag aagtccaaga gtttgtgtac 16321 tcaatttgat cttctgtgag ggtatcaatc tcaattccca ttgcctgcaa tttcagccgt 16381 gcaatgtctt gatcgacttc tgttggaatt gagtggatac cgggttccag gttaccttta 16441 tttttcacga ggtattcgca acccaaagct tggttggcga agctcatatc catcaccgcg 16501 ctaggatgtc cttctgctgc agccaggtta atcaagcgtc cttcacccag aactacgaca 16561 gatttgccac tttgcaggcg atactcttgg gtaaagggac gtactgtctt gacttccttg 16621 gctttacttc ccaaggcgac gagatccaat tcaatgtcaa agtgaccgga gttacagaca 16681 atcgcgccgt ctttcatcac gtcaaaatgc tcaccacgaa tgacgtgctt gttaccagtc 16741 acagtgataa acaaatctcc ttggggtgca gcttctgcca ttggcaggac gcggaaacca 16801 tccatcaccg cttcaattgc tttgatgggg tcaatttcgg tgacaatcac gttagcaccg 16861 agtcctcgcg cccgtaatgc tgttccttta ccgcaccagc cgtaaccaac aacgacaacg 16921 gttttaccag caagcagaac gttggtagcg cggataatac cgtctagggt tgattgtcca 16981 gtaccgtagc ggttgtcaaa aaagtgtttg gtgtcagcgt cgttgacgtt gactgcgggg 17041 aaagtgagaa cgccatctct gaacatggcg cggagtcgca caattccagt tgtcgtttct 17101 tctgtggtcc caataatgtc agcaagttgg tgctggcgtt gttgtaccag ggtagcgacg 17161 acatcacaac cgtcatcaat gatgatgttg gggcgatgat ctaaagcaat ttgtacgtga 17221 cggctgtatg tttcgttatc ttcacctttg atggcaaaga ctggaatttc atgatctgcg 17281 acgaggctag cagctacgtc gtcttgagtt gataggggat tgctcgcaat cagcagggcg 17341 tctgcaccac cagctttgag tgcaattgct aggtgtgctg tttctgttgt gacatggcag 17401 caagccacaa ggcgaatgcc cgcaaagggc ttttcttggg caaagcgatc gcgaatctgc 17461 cgcaaaactg gcatctcacg tccagcccat tcaatgcgct gtcttcccaa ggcagctagg 17521 ccgaggtctt taacctcgtg ctttaatcgg ggagaagttg cggtcatcaa atagttcctc 17581 aattacaaag aaattggcgt aaattcttac gcactctact agagtattcc aagactggca 17641 atctgaaaca agatttgctc acaaaaaagc cagttaatgg ttctttgcac tagcttgacg 17701 tttgagagcg cgaatgtcaa gaactagggg tgtaggggta taggggagag gaagaatcaa 17761 tggacgttaa ctcttgtttc tccctgctaa aagcgccctg cccccgactg cctacttctg 17821 ctgacaactg acatcccata actagattta ttaataaata tgtagtgtga ttaatttttc 17881 ttaagaacaa acgaagaaac acctaactgt ggacaggtat aattactagg attgcttagg 17941 caagatagat ttcagctaag ctagcagcgt tgaaataaca aaggaggtgc ccattgcgct 18001 ctcaatttgt ggcgatcgct ttttcttcaa tagtgatcac tgttgggaca atgccaaaag 18061 ctacagcgca gcagattccc tttttaccta acctaccaac ccctaacttg ttgattcatg 18121 aatcagaaaa agcaacagaa acagattggg tttctttgga tggtaggcgt ttgtttcaga 18181 tagcggcacc aaaagggagc ttgacacagc gtttacaaga tatccaacaa aggttggatc 18241 aacaaagtca agattacttt caaaaatctg atgcagaacc taatgtacag atcaaaactg 18301 aaaagaaatc aacgactcta aagggaaaaa cgactgaaac cgtattaaca gtaattcttc 18361 ttaatggtca atacctgatg accgtcaatg accaggatgc cagtttgcaa caggataata 18421 cctcaactgt ggcaaatcaa atcgttgaaa agttgaaaga aggcttgaaa caagcaaagc 18481 aagagcgaca aagtaagttt ttaattcaac aaggtcaaat cgctgctgct gctggagtag 18541 cgatgattgt catcagttgg ggggtttatt cttaccaaca ccgccccaaa aaatataaag 18601 cacagcccgt caccacagct tcagcagcag ctagaccgat gacgttacag ctaaatcaac 18661 aacaacatca acatattaca gaggttaaga ggcggctgta tcagctagcg caaacaacga 18721 tttggggggg tggaagtctt tttatactgg gtttattacc ctacacacga ccgcttcagc 18781 tatggattat gattgcagcg caagttcctt tgacattggg tgttatagct ctgggaactt 18841 atgtggcaat acgtctaagc tatgccttga ttgaccgttt caactctgct tttgtcaaca 18901 gtgctgtctt gttgactcca gaagctcccg aacgtataca attgcgagtt tctaccattt 18961 ctggcgttac taagagtctt atcaccatta gctgtgtagc agttggtact ctgctagctc 19021 tgggggcgtt aggtgtagag ctcgttccct tactagcagg tgttagtata gttggtgttg 19081 cagtgtccct ggcatcccaa agtttgatta aagatgcgat taatggtttt ctcatcattc 19141 tagaagacca gtacgcctta ggtgattata ttacagttgg aactgtaggt ggtttggtcg 19201 aaaatttgaa tctgcggatg acgcaagtgc gcgatggaga agggcgctta attaccattc 19261 ctaacagtga aatcaaaatt gttgccaatc tttctagtcg ctggtcaaga gccgatttaa 19321 ctatcccagt tgcataccaa accgatgttg atgaggcgat aaaattaatt caaaaagttg 19381 gtgtggagat ggatcaagat ccgcaatggg atcatcaaat tctggaaaca cctcaagttt 19441 tgggaataga taattttgca gttgctgaga aaggtttctt aattcgtgtg tggattaaaa 19501 cacaacccct caagcaatgg agtgtggcac gagagtatcg ccgtcgccta aaaaacgcct 19561 ttgaccaagc tggaatttcc atttttattc cgtgacaagc cataggactt acggaacctc 19621 accctgccct gtcgggcatc cctctggtgc gatttcacca gagggaaaga ttttagcaag 19681 tagggtgtgt tgtcgccaag cgccgcacca tctatgattt tcggtgcgtt aggacgttcc 19741 gtccataacg caacgccaga tgctacaacg gggggaaccc ccgcaccgca ctggctcccc 19801 taccagatta aaacgaatgt tagttagatt ttccgacttg tgtgtacacc gtagccctta 19861 ataaagggtg ggctagtttt tttgtgatta attatacgga catcatatga gtggtaggtg 19921 gtgcaactaa ccactcacca atccccatct cactcgccaa aacagtacaa aaatactata 19981 ataaagatag cgttcacaaa gcccagtaaa gtataaaacg ggtgcgtgac agtcttctgg 20041 agatgggttc aacggtttgt cctatgatga agcactatat tctcagcctt aatccaactg 20101 ctaagcacga atgggatcgg tgtattttac gtgacccact aactgctaaa cgcccggaca 20161 tcgccaagtt agtcgccgaa gcagttggcg ctgatacagg cagttattta gtcagtgtaa 20221 atattgaagt tcaagtgttg gagcaagctg cagtacctca cgctgaacag ctttccttaa 20281 atgttgggga ggtgactatc caagcacctc aactgaggga agtggcgtaa agggagacag 20341 ggggagaaaa tactccctca cctttcccgt ttaccgcagc tactcagtac catatctttg 20401 ctattttgta tgtaaatttg cgtaacaaaa aactgatcta tgttaacaaa gcaactgaat 20461 cattatccag ctgcgttggg cgcagccctt gtcgaagaca tcgctcaagc tgcccaaaga 20521 gtaaatgaag tagattctca gttaatggct gttcagctgc aaattaaccg atttgaaggt 20581 aatgccgatc gcgccgccgc ctttgatatg gatttgaaaa atgatgctca acgcaaagca 20641 cgtcgctttg aagtactact tgtgaatcaa gaatatcaaa aagcgatgga taccctgata 20701 caattaacta ccgaaaaggc gaatgctgtt gcccatttag aatatctgcg caacctgttt 20761 agcgtagcga aactagaagc aagactcaca atcgctcaac aactcacaga ttttgaatcg 20821 catgaattag tgggtttgta gttgtgttgg agaaagtact ttgtggggtt gattagacgg 20881 aacctcaccc tgccctatcg ggcatccctc tccttattaa ggagagggaa aagttagcgt 20941 agctaaaagt cagatctgtg tacaccgtag cttaggggag aatcttaaaa gccccctttt 21001 taactctgtt gggggatctc ttgtgcgtaa gtcctataga tcataactta ttcagcagca 21061 cggcaatttg ctgattaatg gctgtgacat caaaacgctg gctggctgta gtttgtgcat 21121 gatgagcgat agtcgcaatc ttctgtggtt ctttgagaca ggtggttatc acttgtgcta 21181 attcatcagg ctttcctggt gtgactgaga aaccattgac accgtgctct accaattcta 21241 ctgcaccacc agcttttgca gccacaacag gtcttccgca gagcatagcc tctacaatca 21301 ctctaccaaa tggctctgga gcggtcgagg tatgtgcaac taaatcacaa gccgccatca 21361 gcaggggaac atctgaacga aatcctaaaa atttgactcg attttctagt cccagtgctg 21421 tcacctgttg gtgtaagtgc tggacatatt cttgttctcc aaacagtgca tcacctacta 21481 agattgctgt gacatcaaca ggagattgtg ttaatgcttc tattaatata tgctgccctt 21541 tccaaggtgc gagacggcta aaatgtccga caacaaattg tccatctaac cccagttgtt 21601 gtctaatttg actgacatcc gactcgtggc tttgataatt tttcgggtca aagccattgt 21661 aaatcacttc agttatatct aaacgtcctc ctgctgctac gaagcctgct tgacttgctt 21721 gagaattagc aatgacgagt gatgcaaaac gattggcgca agttacggca atgcgacggt 21781 tggtttggct aaagtgctct ggggaaagaa tatcatgcaa atgatagacc aaaggacgac 21841 ggctgaaaaa actcgctagt gcgccaacaa ctaaggcttt ttgtgtatta gcgtaaatta 21901 aatcaaattc tctggctttt tgcaccactt tagaaatcaa aggcagaatt tgagccaggc 21961 tgcccaatcc ttgtgctaaa ctgctttctt tacgaacttg tatggcttga tttgtgagaa 22021 cttgaaccgg aatgtgattt tgttctagta aatttttaaa agaaccatca gcaaataatc 22081 ccactaaaca actatcttta tagggtttgg cgatatctat taaacagagt tcagccccac 22141 cgggtttacc actttggtct aggaatagaa ttttcatata attttcatat aattttcata 22201 aaatatttaa taatttcgct cagcagtagc catcaacagt aagttcttgt tcttgcaaaa 22261 atatgatgaa tgaatattta gaaccacaga tgcacaacag acacttgcgc gactgtcggg 22321 aaactccccc accgcagtgt cctccagatt aacacagata atttatctgt gtgtatctgc 22381 tcccaaagct tcgcttgagg gcaaggttta tctgttgttt aattttattc aaattgactt 22441 ttgcaagagg tttagtatat gctaggctgc aaaggaaaaa gttgccaaag cagcctcaag 22501 agattttagt tttttctact tattgtgtta ttgctgagcc aatagaactt gccgtacatc 22561 ttgagctatc ttatcccagt taaagtttgt gacggcgtag tggcgacaag cttctcgtga 22621 aggtgttggt atttttccca atagcacttg ttctaatttt tctgcaatgg ctggaacttc 22681 tgtggaagac gtaatgaaat ccggtgaaaa tggttctaaa atttctggca ttcctccaac 22741 cggagtacac acaacaggag taccgcaggc taaagattct actactgcta atccaaatcc 22801 ttcaaaagac tgactgggca taacactcaa gtcagcggct tggtaagcaa caggtaacag 22861 ttcgtcaggt atgaatccta aaaatttcac atgctgatca agtcctaatt cagtagcctg 22921 ctgttgtagt gttgcttgct gtggtccacg tccggcgatc gccagcatca catctggaat 22981 cctcggctta ataatagcca aggcgtccaa caatttgtct agtccgactc gattcaccaa 23041 gcggcgggcg gtgaataaaa ttgggcgatc ttgcggccag tctagttttg ttcgagcctc 23101 ttgaggtgat agattaggtt gaaactgagt aatatcaact ccaccaggaa tgacatgaat 23161 tttgttccaa ggaacttgat actgtttgtg taatatattc ccgaatgctt tgctaagaac 23221 aatgaggcga tcgcaacggt tatatgttct ttgttctatc agcttagctt tgagaagaac 23281 acttagtttc ctctcttcca cttcttcttg actttcaaaa gcccaaggac catgaaagtt 23341 aaaggtgaca ggtacacctt ttggcaaaag atccaaaatc gggaaactat atagtgcaaa 23401 atgtaaatta atcacatccg gtttttcagt tcttgttttc tgaaaattag tgcgaataga 23461 ccataatcgt ttccaaatcg gactacttgg tgatgctaaa ttagtcaatt ttattggcaa 23521 gtttggttca gcttctggta aaccaactcc acataattca actttgtctt gatttgctgc 23581 tagtttctga gttagttcat agatatatcg ttctaatcct ccaggggttt taggaaacca 23641 gcctactcct aaacacagaa tagacttaga ttctaaagca tttttggtat tattttccac 23701 ttacatttct ccgagtacgg gttgtgcctg tgtctacaag tcaattaaca gttatcaaga 23761 aagcgcggga tcaatttcat cgcttgggta aggatttcgc tacgtttgac aacgaaggtg 23821 ttcagacgtg ccgcaggaga tacccgtaag ggcgggtgca aatccccctg catacgctgc 23881 ttcactgcgg ctgacaactg gtgctgtttt catgactaat gtcttttttt gggttcattc 23941 agtacggtta ctcatgctga ctatacatat attctcttac ttgatatata atccgaattt 24001 ttccctatta ctatatattt cattccttta tatcatctct ctatagagtg acaagttcat 24061 aatagaaatt tatctcaaca tgatgattac agaaataatc tatagcaatc ttaaatcatt 24121 cttcataaac aagatccccg acaactttta cgaagtcggg gatctgtggc tttcagtatc 24181 aaaaagaaat gaggcgtaat ttgccataat tcagaggaag gttaagctcc ttaggctgat 24241 ataaaggagt ttaaaacccc tgcacaatta aagattttga cggtatacaa tgaggtgatg 24301 agcgcaaatt atacctaatt gcataatgtg aaattattgc ttcgccgact ccagtttttc 24361 gttctccttc aaaagcacaa agacacaaaa taaacaggag tgactttgtg cctttgcgtc 24421 aaaatatttt cttacttacc gaaaacccca actccgtagt agcacattgc agaattgcaa 24481 ctgggcagat caaccttgac tcgatacgtt ccactccgag tcggagtcac accaacagtc 24541 ggaatgtcat catcatcagt atcagccgca atgacttgac cacggtcatt ataaagggta 24601 atgtctaaat cttgacaatc tccgtcgcac acgccaacga tgccatattg ctgacctgca 24661 cgcaaattta tggttaaata atcagagcca ccactacgaa ttccatcaat aaaaggttca 24721 tgggttagct gatagccatt tagaccacca tgaattgcag ctcgcattaa ttgcatccct 24781 acttgctcaa taaaagtgtt agcactcgcg ggagttggtg caacaaaaag caaaccggag 24841 atgccagaga taacagctac tgcacctaat aatttagctt tgatattcat aagttaagct 24901 tccttttgtg gtaattttgt actgtaattt atgcacttga aatcaagcac aaggattttg 24961 tagagacggg tttgatcacg tttctacaag cgaatatgta tcttgcacct gttgaagttc 25021 agataggcga gacgaccctg aaacaagatg tttttttatt tgtgggatga gtttattgtt 25081 ggaactatct gttggtgtaa gatgtgggat aaaagaatgc gagcgctaaa taaatccttt 25141 taacctcttc acttacatct ac // LOCUS NODE_1200_length_25154_cov_4.84425725154 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 25154) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 25154) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..25154 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1092 /locus_tag="DP116_10545" CDS <1..1092 /locus_tag="DP116_10545" /EC_number="2.7.9.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878705.1" /note="catalyzes the formation of phosphoenolpyruvate from pyruvate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate synthase" /protein_id="PRJNA477356:DP116_10545" /translation="AIVGCGNATQVLKNGQEITISCAEGEEGRVYAGLLPFEVQEVPL EDLPRTRTQILMNVGNPSEAFSLSAIPNDGVGLARTEFIIANHIQTHPMALIHHDEIN DDFVKAKISEITALYDDKPQYFVEKLAQGIGRIAAAFYPKPVIVRMSDFKSNEYANLL GGRQFEPEEENPMLGWRGAARYYDEGYKEAFALECHALKRVRDEMGLTNVIAMIPFCR TPDEGRLVLAEMAKNGLKQGVNDLQVYVMCELPNNVILAEDFAEVFDGFSIGSNDLTQ LTLGLDRDSGLVARLFDERSEGVKRMVKFAIAAAKKHNRKIGICGQAPSDYPEFAQFL VEQGIDSISLNPDSVLKTMVEVAKVEGRS" gene complement(1153..1719) /locus_tag="DP116_10550" CDS complement(1153..1719) /locus_tag="DP116_10550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_10550" /translation="MNTVVLNLEPIAHLTDEQFYQLCIANRDLSLEMNATGELIIVPP VGGESGNQEADLITDLNNWNRQTKLGKVFSSSTIFILPNGAKRSPDVAWVKLERWEAL TLEQRKKFPPLVPDFAIELRSETDRLAPIQAKMKEYKENGLRLGWLINPQDAKVEIYR PGKVLEVIQMPTILSGEDILPGFELQVF" gene complement(1842..2159) /locus_tag="DP116_10555" CDS complement(1842..2159) /locus_tag="DP116_10555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186314.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotidyltransferase" /protein_id="PRJNA477356:DP116_10555" /translation="MTEIITQLPIEIPKDKIAELAQRYHIRKLSLFGSILRNDFRPDS DIDILVEFEPGYTPGFAFIDIQDELSQLLGRSVDLNTPQDISRYFRDQVLREAQVQYV KNG" gene 2217..2375 /locus_tag="DP116_10560" /pseudo CDS 2217..2375 /locus_tag="DP116_10560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745755.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="dolichyl-phosphate-mannose--protein O-mannosyl transferase" gene 3023..5329 /locus_tag="DP116_10565" CDS 3023..5329 /locus_tag="DP116_10565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319232.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanobacterial phytochrome A" /protein_id="PRJNA477356:DP116_10565" /translation="MVSDLQLQGINLTNLKEPKIHIHSQIQPHGILFVLKEPDLTILQ VSSNVSSIFGISPENLLQTKLEDLLDSFQIERIQTGLLEESLDLINPTKIWVRKKGDD YVVFDAVFHRNSEGFLILELEPAISQENIPFLSFYHLAKASINQLHETANLKDFCQII VQEVRKVTGFDRVMLYKFDHDGHGSVIAEEKLESQEPYLGLHYPESDIPKPARKLFTS NWIRLIPDTHAEPVEIVPNNNPETQHPLDLTHSILRSVFPCHIEYLHNMGVGASLTIS LIKEGKLWGLIACHHQTPKYVSYELRKACEFLGRVIFSEISTREETEDYDYRMQLTYI QSALVEYMSQEDSFIDGLVKHQPNLLNLTGAEGAAICFGGHYTLIGETPKEEDLNFLV QWLKSNVNEEVFYTNSLPSIYPDADKFKNVASGLLAIPISKRNYVLWFRPEVIQTVNW GGDPNQAFELSQSEGNLRLRPRKSFELWKETVRLTSLPWKPVEIKAALELRKAIVNIV LRQADELAQLAQDLERSNAELKKFAYVASHDLQEPLNQVANYVQLLEMRYQDKLDEDA TEFITYAVEGVSLMQTLIDDVLAYSKVDMQAIEFQLTEVETALERALTNLRKRVKETG AVVTYSELPTVMADSTQLMQLFQNLIGNAIKFRSDKPPEIHVEATRMEDEWLFCVRDN GIGLHPQFSDRIFVIFQRLHTRDEYPGTGMGLAICKKIVECHRGRIWVESQLGEGATF YFTIPVGGRDRERRNGRKAQNNLFGGRQ" gene 5274..5669 /locus_tag="DP116_10570" /pseudo CDS 5274..5669 /locus_tag="DP116_10570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012406853.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="response regulator" gene 5670..5954 /locus_tag="DP116_10575" /pseudo CDS 5670..5954 /locus_tag="DP116_10575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319230.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" gene complement(6239..7066) /locus_tag="DP116_10580" CDS complement(6239..7066) /locus_tag="DP116_10580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410742.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR01548 family HAD-type hydrolase" /protein_id="PRJNA477356:DP116_10580" /translation="MTEQLSPKAIVVFDIDGVVRDVSGSYRRAIADTVQHFTEGVYRP TPLDIDQLKSEGVWNNDWEASQELIYRYFETLSYTREQLQLDYNAIVTFFQSRYRGPD PHRFTGYICNEPLLLDSKYLEELTKAEIPWGFFSGATRGSATYILEQRLGLKSPVLIA MEDAPGKPNPTGLFATVRLLESSDNRNNDHLTPVIYVGDTVGDMYTVKQARLEEPSRT WIGVGVLPPHVQQIVARRDAYAEKLIDAGASMVFTNVQHLSLLRISELVNETGFGQV" gene 7383..7682 /locus_tag="DP116_10585" CDS 7383..7682 /locus_tag="DP116_10585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209803.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein Ycf65" /protein_id="PRJNA477356:DP116_10585" /translation="MSKFILKILWLDENVALAVDQVVGKGTSPLTKYFFWPRNDAWEE LKKELESKHWITELDRVELLNKATEVINYWQEEGRNRPMAEAQLKFPEVGFTGSA" gene complement(7816..9195) /locus_tag="DP116_10590" CDS complement(7816..9195) /locus_tag="DP116_10590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314959.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminopeptidase P family protein" /protein_id="PRJNA477356:DP116_10590" /translation="MIQTTSSSLIETLRDRRQRLASLIDFPAILWSGTSSPRNYPANV FPFRANSHFLYFAGLPLPNAAIRLAAGKLELFMDDPQPESALWHGEMPDREEIAQIIG ADAAKPMAELELWTQEAATIAVQDAATWTQQSQLLNRWVLPLKSPQDIDLELAKAIIS LRLIHDDGALAELRKAGAVTVEAHKAGMAATTHAQIEAQVRAAMEQVIIANNMTTSYN SIVTVHGEVLHNEQYHHSLQPGDLILADVGAETETGWAADVTRTWCVNGKFSSTQRDI YDIVLAAHDACIAKVRPEVEYQDLHLLACTVIAEGLVNLGILQGNPQDLVEMNAHSLF FPHGIGHLLGLDVHDMEDLGDLAGYEEGRTRSNHFGLSYLRLNRPLRPGMLVTIEPGF YQVPAILNNANVRLQYQDVVNWDRLSQFADVRGIRIEDDVLVTQEGSEVLTAALPTQA NQIENLVCG" gene 9355..9804 /locus_tag="DP116_10595" CDS 9355..9804 /locus_tag="DP116_10595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113162.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4332 domain-containing protein" /protein_id="PRJNA477356:DP116_10595" /translation="MSLKQKVSVSRISSSDWCIEQLPGLSQQEQAQLQNCGITTTAAL VKQGKTPADRLVLANKLQIHLQYVNKWVALADLARIPGVGTQYCGLLLHAGIASVVQL AATPIHRLHQQLLRLVVATMQQRDLCPSIEQVQQWSQQAKRVLSNER" gene complement(9852..10466) /locus_tag="DP116_10600" CDS complement(9852..10466) /locus_tag="DP116_10600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458512.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_10600" /translation="MRIFNAPPSSETQTRTRILDAARRLFASQGFDGTTTRDLAQAAG VAEGTLFRHFSNKKAILIEVATQGWVEILTDLLTELSEMGSYKAVAQVMRRRMWNMQK NADLMRVCFMEAQFHPDLRDRIQSEVIVKMTDVAEAFFQTAMDKGIYRRMDAKLVAKV FLGMFAVAGFSNNTLMEPNASPQQMQEMAEGLADIFLNGVLATE" assembly_gap 10681..10690 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 10755..11036 /locus_tag="DP116_10605" CDS 10755..11036 /locus_tag="DP116_10605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015153736.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF433 domain-containing protein" /protein_id="PRJNA477356:DP116_10605" /translation="MVQATEHLYIVRDNHILSGEPIIKGTRTPVRAIVEIWRMGIAPE EIPKGMPHLTLAQVFNALSYYSDHQDEINDYIEHNRIPDNLIDPLVKDL" gene 11033..11380 /locus_tag="DP116_10610" CDS 11033..11380 /locus_tag="DP116_10610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002750655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10610" /translation="MSSLFIRLYLDEDVNILVADLLKARGFDVVTTRDAGQLHASDSQ QLAYAISQGRALVTHNRTDFEALIQAYFVSVQMHCGVILAVRRSPQDIAQRLLVILNQ VTADELQNQVRYI" gene 11381..11518 /locus_tag="DP116_10615" CDS 11381..11518 /locus_tag="DP116_10615" /inference="COORDINATES: protein motif:HMM:PF03128.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10615" /translation="MIQPLVISRIFVNNILYQDILLTISEYHDKEPSCRIWNPDTCLC F" gene 11700..13280 /locus_tag="DP116_10620" CDS 11700..13280 /locus_tag="DP116_10620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="insulinase family protein" /protein_id="PRJNA477356:DP116_10620" /translation="MNKRSYLYQRKVKNKASQLFVSFVIAVLLWWELLPGIALAKTQT PAPAENSIQPYLNRVMKQLSEFRLDNGMKFIVLERHQAPIVSFLTYADVGGVDEPDGK TGVAHFLEHLAFKGTTRIGTSNYQAEKPLLNRLDQLAEEIIAAKAANKKDEVAKLETQ FKKIEAQAAKLAKQNELGRIVEQAGGVGLNANTSTEATRYFYSFPSNKLELWMSLESE RFLDPVFREFYKEKDVILEERRMRVENSPIGIMMEKLIDTSFKVHPYKRPVIGYDQDI RNLTREDVQKFFNAYYVPSNLTIAVVGDVKPAEVKRLAQIYFGGYKAKEKAVAQISVE PKQTQTREVSVELRSQPWYLEGYHRPAMTHPDHAVYEIIGSLLSSGRTSRLYKSLVEK QQLALTAQGFSGFPGDKYPNLMLFYALTAPGHTVDEVAVALRQEIERLKTEPVSDTDL ERVKTQARASLLRILDSNMGMAQQLLEYEVKTGSWRNLFKELDQIVAVTPADIQRVAT ATFTTENRTVGKLLSKQS" gene 13380..14990 /locus_tag="DP116_10625" CDS 13380..14990 /locus_tag="DP116_10625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458508.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="insulinase family protein" /protein_id="PRJNA477356:DP116_10625" /translation="MNRFRGRRNKRKVSQCGGEAVRSWWFPQGGTAERVPRHKASGEP VRVRSLYLLLVCFVLVPLLCFGVYNTSWAATAAKHYTELQLPPLPQVKIPKYERYVMN NGMVVYLMEDHELPLVSGSAIVRTGDRFEPAQKVGLAQLTGVVMRSGGTTKHTPDQLN QILEQRAAMVETGIGETAANASFQSLSEDLETVFGLFGEVLREPVFAQEKLDLAKTQL RGSIARRNDDPDDIASREFQKLIYGKESPYARTQEYATLNNISRADLIKFKQQSFYPN NMILGIVGDFDSKKMRSLIQAQFADWKPNPNLVKPQLPKVSQNKRGGVFFVNQPQLTQ SNILIGHLGGQFNSPDYPALDVLNGVLNGFGGRLFNEVRSRQGLAYSVSGAWSPRYDY PGMFVAGGQTRSNATVQFVRALQQEIKRLQTQSVMPQELAFAKDSTLNSFVFNFEDPG QTLSRLMRYEYYGYPSDFLFRYQKAVAATTAADVQRVARKYLKPDNLVTLIVGNQSAI KPPLTQLATQVTLIDVTIPGSPTQAATN" gene 15469..16506 /locus_tag="DP116_10630" CDS 15469..16506 /locus_tag="DP116_10630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114811.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GDSL family lipase" /protein_id="PRJNA477356:DP116_10630" /translation="MKKQLMTAGFVLFGLTLPLKASAAGFSQFNVFGDSLSDTGNVFT VSQQNSPTGPIPPDPPYFQGRFSNNKIWVDYLGQDIGLTPTLFANPSTKTPTQGINFA FGGSLSGEDNAFFPGAPGVLKQVGSFVGNNQKVDPNALYAVWGGGNDYLFGQNPNPNQ TVSNLSNAVGALAQAGAKNILVFNLPDLGKTPLAVRTGNTSNLTTLTNAHNAALASAL GQLNNNNPSVNIIPVDINSLFNRVIANPGEFGFKDVNTSCVVYDIRNNVVLKTCNNPN DYLFFDEVHPTTNAHKLVAETALAAIRAKSVPESSTALALLSLGALGAAAMLKRRQKR VSPEFSSKSNL" gene complement(16632..17111) /locus_tag="DP116_10635" CDS complement(16632..17111) /locus_tag="DP116_10635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746434.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10635" /translation="MVQYTLAQSPEIILTVAGKDSAKARDKAMDQLMKLLDEGKLPTE LEDGFGPKQLIEVKETPTDTAGDEDAITQAVQILSNLATLKLKVQESRTEAMEVRKAV DVLFSDNTVTEEEIARLKEGFKILKNYAQANLRYQEARAKAEQARQVLDEALKSPEK" gene 17482..18768 /locus_tag="DP116_10640" CDS 17482..18768 /locus_tag="DP116_10640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10640" /translation="MAPNPTIMQAVEQLGYRVTVGDVATQAGLNVAQASQGLLALASD AGGHMQVSDTGDIIYLFPKDFRSILRNKYLQLQLQEWWKKVWKILFYLIRISFGIFLI VSIALISISIILMISALNSDRDNDNRGGGFSGGFFYFPDLFWYLSPDYDARQRERRRE KSDLNFFEAVFSFLFGDGNPNTKLEERRWQEIAAVIRHNRGAIVAEQIAPYLDDIGEG SAREYEDYMLPVLTRFNGQPKVSPEGEIVYDFPELQVSAAKKYRQSVSAYLEEFPWRF SAASSGQILLSAGLGVANFVGALVLGSLLRNGTVAAQIGGLVAFVHGIYWLLLAYGTG FLVVPLIRYFWIGWRNSKIAERNGDRLSRARQLTDPDSALQQKIAYAQQFAAEKVLGD EDLVYSTETDLLEQEVERSQKIDAEWQKRLERGSGE" gene complement(18830..19978) /locus_tag="DP116_10645" CDS complement(18830..19978) /locus_tag="DP116_10645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746430.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sporulation protein" /protein_id="PRJNA477356:DP116_10645" /translation="MKFQVFLASLFSNVKGRHWWLGILIWIAMVAPSQASSVILRVAI QRDVNQVKVGSSTTAVIKDSSGRTVGELPAMSPYSAQAVPGGVALDKWRSNLFWVEPT GKGFVYIGDRWYRGRTLVVPSQKGLSVVNWVALEEYLYSVIGGEMDSRWPLEALKAQA IAARTYALYERERQRNNPLYDLGASPDRWQIYKGVSSESPSTYAAVDATDGKVLTYRN RVILSVFHACSGGHTENSEDVWGSAQPYLRAVPDFDQNIRECNWSRTFSPAEISAQMP EIGNVKQMIPESTSPFGSVKALKIVGDKGEKVLRGEDVRTALRLKSTRFTVSNGNGSF TLTGLGYGHALGMSQWGAYNLASRGYNHLQILNYYYKGVALAPIQVKK" gene complement(20138..20932) /locus_tag="DP116_10650" CDS complement(20138..20932) /locus_tag="DP116_10650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872779.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10650" /translation="MKMILQPMKLLLAVVVGYLSLLPSTTSVIAQSASQSKPLTTTSS RKQPLRLVLPPLPPGQPPGGRRYGGASRGQCPVAKPGLTALVPLIEQPTSVMNVWGQT TAERPTFWFYTPYVKDSAYPADFVLLDAESNPVYRQEITLPSQPGVINVSLPATVSPL ITGKQYRWFLNVYCERQKQQSPVYVEGVIQRVNLNQGIVQQLQQAQPRQQVAIYASNG IWYEALTTVAQLRQKNPQDLTLEQEWQDLLSMIGLGDFAAKPLISR" gene complement(21070..23445) /locus_tag="DP116_10655" CDS complement(21070..23445) /locus_tag="DP116_10655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130769.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor protein Chase2" /protein_id="PRJNA477356:DP116_10655" /translation="MGKLVVLKFADGSFEQGFPVTLQIGEEGERPSTEITGKLPAAGE MPLYYSHWQSSYLKLGNCYRLSADKMQVTNVSVTQDCDNIAHILRSRFNTWLRTEEFR PIREKWLEKLLPTDEIRVILQTEDTQLQRLPWHLWDLLERYPKAEFALASPTYEKISS QKTLNNKVRILAIVGNSQGIDTEVDRALLQQLTDADVTFLVEPQRKELTDFLWGNSWD ILFFAGHSSSHGKDGSGRIYLNKTDSLTISELRYALRKTVQHGLQLAIFNSCDGLGLA RELADLQIPQMIVMREPVPDLVAQEFLKYFLQGFAGGESFYQAVREARERLQGLEDKF PCATWLPIICQNLAQIPPTWPEITGRVEVEVPEPLERSPLPPPPPPRFAVALLSSVVI SVLVCGLRFLGLVQMSELQAFDQMMRLRSLIFHEESDPRLLVVAIDDADIDAQRQRGE DVIGKSLSDISLNKLLEKLQQYEPYAIGLDIYRDFKAQYPNLTFRLKQTENLIGVCKH SDAAIPVKSTAPPPEIPKERLGFSDFIHDPDGVVRRHLLFMEQEAVSACAADYAFSAQ LAFRYLLADKGIQPKFTTQGDLQLGNTVFPRLNSHSGGYQGIDANGGQILLNYRSSKN IAQQVTLTQVLSNQVHPDAIKNRIVLIGVVSKGERSDYWGTPYENHFDQRTSGLLVQA HMVSQLLSAVLDKRPLLQVWSLWGEVIWICGASVVGGVLAWRVRVLPRLTLFVFVCSG VLYVVCFGLLIQGYWVPFVPSALALVGTMSIVFFQNSRLFIVKTQQLSRQT" gene complement(23577..24545) /locus_tag="DP116_10660" CDS complement(23577..24545) /locus_tag="DP116_10660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746427.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10660" /translation="MRRTNHNLDDLALTLPITQAACRTAQQFANQQPTPEKAEQVRLN TLCVWVVNEYLEMMDIPTQLTKSDSWNSVLRLCTDVADLELLGIGRLECRPVRFAQEI CYIPPETWEERVGYVVVQIDESLQQAKLLGFVRNVATEELPLSQLQPLEDFIEYLAQL RQTPIRTLVNLSQWLVGIFDAGWQTVESLWNQPEIRPGYAFRNGDTLVQNDTNKTQAL TRRAKLIDLGIQIANQPVILIVEIRPKTDQQTGIRLQLHPTGNQNYLIPGVQLTVLDE SGAVFLETQARSADNYLQLQFRGEPAEQFSVKVSLNDASVTENFVI" gene complement(24997..>25154) /locus_tag="DP116_10665" CDS complement(24997..>25154) /locus_tag="DP116_10665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872393.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10665" /translation="GAIEAEVQQQIRSLSITQLEELAEALLDFFNPSDLVNYLTNISS PQLNPRD" BASE COUNT 7243 a 5235 c 5539 g 7127 t 10 others ORIGIN 1 gcaatagtgg gatgtggtaa tgctacgcaa gttctgaaaa atggtcaaga aatcacgatt 61 tcttgcgcgg aaggggaaga aggacgggtt tatgcagggt tattgccctt tgaagtccaa 121 gaagttccgc tagaagactt accccgcacc cgtactcaga ttctcatgaa tgtgggtaac 181 ccttcagaag cattcagttt gtctgcgatt cccaatgatg gagttggttt agcgcggaca 241 gaatttatca tcgccaatca tatccaaacc catccaatgg cgctgattca tcatgacgaa 301 ataaatgatg actttgtcaa agctaaaatt tctgaaatca ctgcgcttta tgacgacaaa 361 ccccagtatt ttgttgagaa attagctcaa ggtattggta gaattgctgc tgcattttat 421 cccaaaccag ttatcgtcag gatgtcggac tttaagagta acgagtacgc gaacttgttg 481 ggcggtagac agtttgaacc cgaagaagaa aatcccatgc taggctggcg aggcgcagcg 541 cgttactatg atgaaggtta caaagaagct tttgcgttag aatgtcacgc cctcaagcga 601 gttagggatg aaatggggtt gacaaatgtc atagcaatga ttccgttttg tcgaactccc 661 gatgaggggc gtctggtttt ggcagaaatg gcaaaaaatg gtttgaagca gggtgtgaat 721 gacttgcaag tgtatgtgat gtgcgagtta cccaacaatg tcatcctggc tgaggatttt 781 gccgaagttt ttgacgggtt ctcgattggt tctaatgact taactcaact gacacttggt 841 ttagatagag attcagggtt agtggcgcgg ttatttgatg aacggagtga gggtgttaag 901 cggatggtga agttcgcgat cgccgccgcc aaaaagcaca atcgcaaaat cggtatttgt 961 ggacaagcac ccagcgatta cccggaattt gcccagttct tagttgaaca ggggattgac 1021 tctattagtt taaatccgga ttcggttttg aaaactatgg tggaggtggc taaagttgaa 1081 gggcgatcat aaacctcacg ctgcctgcct tcccctcttt gatggcttgg agaggggaca 1141 atacgcctat ttttaaaata cttgcaactc aaatccaggt agtatatctt ctccagaaag 1201 aattgttggc atttgaataa cttctagtac ttttcctggt cggtaaattt ccacttttgc 1261 atcttgagga ttaatcagcc aacccaaacg caatccattt tctttgtatt ccttcatttt 1321 cgcttggatc ggtgcaagtc gatctgtttc agaacgcagt tcaatcgcaa agtcgggtac 1381 aagtggcgga aatttttttc gctgttctag tgttaaagct tcccaccgtt ccaactttac 1441 ccaagccaca tcaggggaac gttttgcacc atttggaagt atgaagatag ttgaagaact 1501 aaaaactttg cctaatttgg tttgacgatt ccaattattc aagtctgtaa tgagatctgc 1561 ttcttgattt ccactctctc ctcctactgg tggcacaatt attaattctc ctgttgcatt 1621 catttctagg cttaaatcgc gattggcgat acacagttga taaaattgct catcggttaa 1681 atgagcgatc ggttctagat ttagcacaac agtattcatg aaaactttcc ttgtgccttt 1741 gttgctaaca tcagttatct ttagttaaaa gtttagcaga agaatctttc caatctggcg 1801 atgcaattgg aagtcactaa tttcattatc agagacttac ctcatccgtt tttgacatac 1861 tgcacttggg cttctcttag tacctgatcc cggaagtaac ggctaatatc ttgaggcgtg 1921 ttcagatcaa ctgatcgccc caaaagctgg gagagttcat cctgaatgtc gatgaacgca 1981 aaaccagggg tgtatccagg ttcaaactca actaaaatgt caatgtcact gtcagggcga 2041 aagtcattcc gcaggatgga accaaacagg gaaagcttac ggatatgata acgctgtgca 2101 agttcagcga ttttatcttt ggggatttca ataggtaatt gagtaattat ttctgtcatc 2161 ttggttctgg cgctaacaag tgtgttgggg gttaagctta ttttaatttc tgttggattt 2221 tcccgattga tgtctttgat gaggtttact acgctaaatt tggcaatgac tatcttaccc 2281 acacaccatt ttttgatggt catccaccat taggtaaata tatgattgca ctggggattt 2341 cgattggcaa gcatattccc ttttggcaag atgagggctg ctgcctccct tacttgcggc 2401 atccccgcac ccatcgcatt tccaggtgga accaccgaaa cgagatttta accagatagt 2461 aggggcataa tatcatgtcc gctcaattac ttataattcg tcattgcgaa cggagcgaag 2521 cgaagtgaag cataagccct tcgggcacgc tgagtcccaa ggggacacgc gcaagggacg 2581 cgaacgcaag gttctgagat tgcttcactt catttcatta cgctcgcaat gacaactaat 2641 catccggaca tgatataagt cagctgtacc cctcagagca aacttgtgtt ttactcaatt 2701 gaaaactgct gtaacagagc tacataatat atagtgcagt tcataacatt ttgttaatat 2761 ttattattgt caagtcaggt tttttgaccg ttacactact ataggcaggt ataccaattc 2821 aggcaaatat gtaatgtttt atctgttaat tgaggtaaaa acttaacaca ttaacgttgt 2881 tatgactatc tatgaagtca tctatgaagt aaattgatag agtgttaaga aactgctgaa 2941 acctggtgta tcatacctgc ttagatgtta aacaagatta gtattttctt ttgaagaatc 3001 ctggaaaaac aggagaattg cgatggttag cgatttacaa ttgcaaggca ttaatttgac 3061 taatttgaaa gaaccaaaaa ttcatattca tagtcaaatc caacctcatg gaatactttt 3121 tgtcctcaaa gaacctgact taacaatatt acaagttagc agcaatgtct cttctatttt 3181 cggcatatct cccgaaaatc tgctgcaaac caaactagaa gatttactcg actcttttca 3241 aatagagaga atccaaacag gactcttaga ggaaagtctt gatttgatca accccacgaa 3301 aatttgggtg agaaaaaaag gtgatgatta cgtagtattt gatgcagtct tccaccgcaa 3361 ttctgaagga tttttgattc tggagttaga gccagcaatt tctcaagaaa atatcccatt 3421 tttaagcttt tatcacctcg cgaaagcttc aattaaccaa ttgcacgaaa ctgccaatct 3481 caaagatttc tgtcaaatca ttgtccaaga agtacgaaaa gtgacggggt ttgatagggt 3541 catgctttat aagtttgatc acgatgggca tggctcagtc attgcagaag aaaaactaga 3601 aagccaagaa ccttatttag gtctgcacta cccagagtca gatattccca aaccagcaag 3661 aaaattattt acttctaatt ggattagatt aataccggat actcatgccg aacctgtaga 3721 aattgtccca aacaataatc cggaaactca acatccgctt gatttgactc actcgattct 3781 cagaagtgtc tttccttgtc atatcgagta tttgcacaat atgggtgtgg gtgcttcttt 3841 gacgatttct ctgattaaag aaggaaaact ctggggactc attgcttgtc accatcaaac 3901 accaaagtat gtttcctatg aattgcgaaa agcgtgcgaa tttttaggtc gagtcatatt 3961 ttcggaaatc tctacgagag aggaaactga agattacgac tatcgtatgc aattgacata 4021 tattcaatca gctttggttg aatatatgtc ccaagaagac agctttattg atgggttagt 4081 taaacaccaa ccaaatctcc tcaacttgac tggtgctgaa ggcgcagcta tatgttttgg 4141 tggtcattac acattaattg gtgaaacacc caaagaagaa gacttaaatt ttttagttca 4201 atggctaaaa agtaacgtta acgaagaagt cttctacaca aactctttgc ccagcattta 4261 tccagatgct gacaagttca agaatgttgc cagtggtttg ttggcgattc ccatttctaa 4321 gcgaaattat gttttatggt ttcgcccaga agtgattcaa actgttaatt ggggaggtga 4381 tcccaatcag gcgtttgagt taagtcagtc tgaggggaat ttacgcttgc gtccgcgtaa 4441 atcatttgaa ttgtggaagg aaacagttcg cctcacctct ttaccctgga aaccggttga 4501 aataaaagca gcactggaac tgcgcaaagc gattgttaat attgtgctgc gtcaggcaga 4561 tgaattagca cagcttgcac aagatttaga acgttccaac gcagaactga aaaagtttgc 4621 ctacgtcgcc tcccacgact tgcaagaacc actcaatcaa gtggctaact atgtacaatt 4681 gttggagatg cgctatcaag acaaacttga cgaagatgca actgagttta ttacctacgc 4741 cgtcgaggga gtcagcttga tgcagacgct gattgatgat gtgctggcgt actctaaggt 4801 ggatatgcag gcaattgagt ttcaactgac tgaggtagaa acagctttgg aacgcgcttt 4861 gacgaatttg cgaaaacgcg ttaaagaaac aggggctgtt gtgacttata gtgagttacc 4921 aactgtgatg gctgacagta cccagttgat gcagttgttc cagaatctca ttggtaatgc 4981 tattaagttc cgcagtgaca aaccaccaga aattcatgtg gaagcgactc gaatggagga 5041 tgagtggctg ttctgtgtac gggataatgg tattggtctt catccgcagt ttagcgatcg 5101 catcttcgtc atctttcaac gcctacacac acgcgacgag tacccaggta caggtatggg 5161 tttagctatc tgcaagaaaa ttgtagagtg ccataggggg aggatttggg tagaatcaca 5221 acttggagaa ggagcaactt tctactttac gattccagtc ggaggacgcg atcgtgagcg 5281 cagaaatgga cgaaaagcgc aaaacaatct ttttggtgga agacaataaa gctgatattc 5341 gtctgatcca agaagcattg aaaaatagct cagtaccaca tcaagtcgta actgtcagag 5401 atggtatgga tgctatggct tatttgcgcc aggaagggga gtatgcagaa gcaccacgcc 5461 ccgacctcat tctgctggat ttaaacttac ctaagaaaga tggtagagaa gtcctggcgg 5521 aaataaaagc tgaccccaaa ctaaaacgca ttcccgtagt cgtgctaaca acatcacaca 5581 atgaagatga cattttccac agctacgact tacacgtgaa ttgctacatt accaaatctc 5641 gaaaccttag tcagctattc aaaatagtca atggtggtac agtgcgattt caacttattg 5701 gtcaagaaca aatagtcgtt tttcagatac aagacgaggg aattggtatt cctaaagaag 5761 accaaccgct gttgttcaag cctttctatc gtgcttccaa tgtggataga attcctggta 5821 ctggcttagg gctagcgatt gtgaaaaagt gtgtagacac acttggtggt gagatttcag 5881 ttcatagtga gattggggtg ggtacgacat ttgctgttac gctacctata accaagtacg 5941 atcagaaaaa ttgagaattt taattattaa gatatattgt gccaacagta tccttaattc 6001 cctccccact cgcttcgctc aaattcaaaa ttataaattc aaaattcgtc ttgaaaagtt 6061 ttgcgccggg aaaccccttc ggattcgcag tcgcctacgg agggagagcc gtcattcgcg 6121 ctgtctcacc ggacgcgcaa acttttcgca aaatgaaaga cagtgaactg ttggggattt 6181 aaaccccaac agttcactga ggtgcaataa cttgcaccct acttacacaa aatagcacct 6241 acacttgtcc aaaacccgtt tcgttcacta gttctgaaat cctaagcaga ctcaggtgtt 6301 gtacattggt gaaaaccata gatgcgcctg cgtctatcaa tttctcagcg tacgcatcac 6361 gacgcgctac tatttgctgt acatgaggag gtaaaacacc aacaccaatc caagtccgag 6421 aaggttcttc caaacgcgct tgcttcaccg tgtacatatc acctactgta tctcctacat 6481 atattactgg cgttaaatgg tcattatttc tattgtcgga agactctaac aagcgaacag 6541 tggcaaaaag tccagtggga tttggtttac ctggtgcatc ttccattgcg attaatactg 6601 gagacttcaa cccgagacgt tgttccaaaa tataagttgc agaaccgcga gttgcgccac 6661 tgaaaaatcc ccagggaatt tcagcttttg tgagttcctc tagatattta gagtctagca 6721 ataatggttc gttgcaaata tatcctgtga atctgtgggg atcaggtcct cgataacgcg 6781 attgaaaaaa ggtgacaatg gcattgtaat cgagttgtag ttgctcgcgc gtgtaagaga 6841 gagtttcaaa atagcggtaa attaattcct gggatgcttc ccaatcgtta ttccaaacac 6901 cttcagactt gagctgatct atatcgagtg gtgttgggcg atatactcct tctgtgaaat 6961 gctgtacagt atctgcgatc gctcgtcgat aagaaccact gacatcgcgc acaacaccgt 7021 caatatcgaa aaccactatc gcctttggtg agagttgttc agtcatagaa cgcggacatt 7081 ttagatgagt cattagtagg cagtgcgttg cgcagccagt gcttgatgag ggtttccctc 7141 acgcttgctc tggcgtcggg ttaagcgcgt tgtagcacct gccattcatt agtcattagt 7201 caatagtcat tagtcaatag ttattagtca caacaagcaa tttacttttt aaaatttcat 7261 tctctgtagc ccactcttga ctggctactt tccacttccg tttggaaagc tggtagatat 7321 tgcgttacaa taggcgatcg caaagttagt gagtgttgcc cggatactcc ccggaggtgt 7381 gattgtcgaa atttattttg aaaatactgt ggctagatga aaacgttgcc ttggcggttg 7441 atcaagttgt aggcaaaggc acaagtcctt tgactaagta ctttttctgg ccaaggaatg 7501 atgcctggga agaattaaaa aaggagctag agtcaaaaca ctggattact gaactggatc 7561 gggtggagtt gctcaataaa gctacagaag ttattaacta ctggcaagaa gaaggcagaa 7621 accgcccgat ggctgaagct cagttaaaat ttcccgaggt tggctttaca ggtagcgcct 7681 aaagttgtat aattcactac cctactgttg aacaagctcc cagagtttga tagttttatc 7741 cctgctgcca ctaacctgaa gttgtgcaac agggctaata cttatttttt atgtatttaa 7801 ataacacaac aacaattacc cacaaactaa attctctatc tgattggctt gtgtcggtaa 7861 tgccgcagtt aagacttcgc taccttcttg tgtcactaaa acatcatctt caatgcggat 7921 tcctcgcaca tccgcaaatt gagataggcg atcccaattt acaacatcct gatattgtaa 7981 ccgtacatta gcattattta gaatcgctgg aacttgataa aatcctggtt caattgtgac 8041 taacattcct ggacgtaagg ggcgatttag acgcaggtag cttaaaccaa aatgattgct 8101 ccttgtacgt ccctcttcat aacctgctaa atctcccaaa tcttccatat catgaacatc 8161 taaacccaac aaatgaccga ttccgtgagg gaagaagagg ctgtgggcgt tcatttccac 8221 taaatcttgc ggatttcctt gcaaaatgcc taaattgacc aagccctctg caatcacagt 8281 acaagcgagt aaatgtaaat cttgatactc aacttcaggg cgtaccttgg caatacaagc 8341 atcatgagcg gctaacacaa tatcataaat atctctttgg gtagatgaaa actttccatt 8401 aacacaccaa gtacgtgtga catcagcggc ccagcctgtc tctgtctcag caccaacatc 8461 agcaagtatc aaatcgcccg gttgcaaaga gtggtgatat tgctcgttgt gcagaacttc 8521 accatgaacg gtgacgatac tgttgtacga cgtagtcatg ttattggcga tgatcacttg 8581 ttccatcgcg gcacgaactt gtgcttcaat ttgagcatga gttgtagctg ccatgccagc 8641 tttgtgtgct tctacagtca cagcaccggc tttgcgtaat tcagctaatg cgccatcgtc 8701 gtgaatcagg cgtaacgaga taattgcttt ggctaactcc aagtcaatgt cttgtggtga 8761 cttcagcggc aaaacccatc tattcaaaag ttgtgactgt tgcgtccaag tggcggcgtc 8821 ctgcacagca attgttgctg cctcttgtgt ccacaattcc aattctgcca ttggtttagc 8881 agcgtctgct ccaattatct gagcaatctc ctcacgatcc ggcatttctc catgccaaag 8941 agcgctttct ggttgaggat catccataaa cagttctagt ttacctgcag cgaggcgaat 9001 tgctgcattg ggtagtggaa gtcctgcaaa gtagaggaaa tgactgtttg cacgaaacgg 9061 aaagacgttt gctggatagt tacgcggact gctagttcct gaccaaagaa ttgctggaaa 9121 atcaatcagg ctggcgagtc gttggcgtct gtcgcgtaga gtttcgatga gagagctaga 9181 ggttgtttgt atcataatct gggaaatgaa agcaaatatc tcaaagtttt gttagtttgc 9241 agagcaaatg aagatttttc aacagccaaa attttgctta agattacgat ggaataaaga 9301 tatcaatgag tgtgtttatc actaattgat atagcgtatt tattttttga aaaaatgtct 9361 ctcaaacaaa aagtaagcgt aagtcgtatc tcatcctctg attggtgtat tgagcaatta 9421 cctggattga gtcagcaaga gcaagcccaa ctgcaaaact gtggaattac aaccaccgca 9481 gcacttgtca aacaaggaaa aacacctgca gatagactag tgctggcaaa taaattacag 9541 attcatcttc agtatgttaa taaatgggta gcattggctg atttagctcg tatccccggt 9601 gtcggaacac aatattgtgg attattgctg catgcgggta ttgcttctgt ggtacagtta 9661 gcagctaccc ccattcacag attgcaccaa caacttctac gcttggtggt agcaaccatg 9721 caacaacggg atttatgtcc ttcgattgag caagttcaac agtggagtca acaagcgaaa 9781 agagtactga gtaatgagcg atgaatttag tatgcaaaac aagaaattgt tgactgttga 9841 gtgttgactc attactctgt tgccaagact ccattcaaga aaatatcagc caatccttct 9901 gccatttctt gcatttgttg gggtgaggcg tttggttcca taagggtgtt gttggaaaag 9961 cctgcaacgg caaacattcc tagaaaaacc ttggcgacaa gtttggcatc catgcggcga 10021 tagattcctt tatccattgc agtttgaaag aatgcctcag caacatcagt cattttgaca 10081 atgacttcag actggatgcg atcgcgcaaa tccgggtgaa actgtgcctc cataaaacaa 10141 acccgcatta agtcggcgtt tttttgcata ttccacatcc ggcgacgcat aacctgagca 10201 acagccttat aactgcccat ttcactcaat tctgtcagca aatctgtcag aatttccacc 10261 cacccttgag ttgcaacctc aatcaaaatc gcctttttat tggaaaaatg acgaaacagg 10321 gtaccttcag caactcctgc tgcttgtgcc aagtcgcggg tggttgtgcc atcaaatcct 10381 tgagacgcaa acagtcgtct tgccgcatct agaatgcggg tgcgtgtttg tgtttctgag 10441 gatggaggag cattaaaaat tcgcataaaa gttatagtga tatcgctgga atagaatgtg 10501 tagcattatt gtctaatgtt aaatgcaata acaaccgaaa gcttaataca agattaagac 10561 cttgttgact gacgacgcca agtcatagac taacttttcc ccgcaagggg acgaaaacgc 10621 tttatgcagt tgaacgtagc gaaaacgttt tagccgtggg agcgtcaagc tatgcgacct 10681 nnnnnnnnnn atgcgactca tcaacaaaat cgttacaatt aatcctgacg gttttctcta 10741 gataaaaagc agcaatggtt caagcaacag aacatcttta tatagttaga gacaatcata 10801 ttctaagtgg agaaccaatt atcaagggaa ctcgtacacc tgttcgggca attgttgaaa 10861 tttggcgaat gggtattgct cctgaagaaa ttccgaaagg tatgcctcac ttaacgttgg 10921 cacaagtatt caatgctttg agttactaca gtgatcacca agatgagatt aacgattata 10981 ttgaacacaa tcgaatccct gacaatctaa tagatccgtt agttaaagac ctatgagtag 11041 tttgttcatc cgcttatatt tggatgaaga tgtcaatata ttggtggcag acttattaaa 11101 agcaaggggt tttgatgttg tgactacacg agatgcagga caacttcatg caagtgattc 11161 acagcaactt gcttatgcta tcagtcaagg aagagcgtta gttacccata atcgaactga 11221 ctttgaagca ctcatacagg cttatttcgt atccgttcaa atgcattgtg gtgtaattct 11281 tgctgttcgt cgctcccctc aagacattgc acaacgacta ttagtcatat taaatcaagt 11341 gacagcagac gagctgcaaa atcaggtgcg gtacatctga gtgattcagc ccttggtgat 11401 atccagaatt tttgtaaata acattttata tcaggatatt ttgctgacaa tttctgaata 11461 tcatgacaaa gagccgtctt gccgcatctg gaatccggat acgtgtttgt gtttctaaga 11521 atagaggaaa attaaaaatt cgcagaatca aaagcttgtg agatatctgg aacaggatgt 11581 ctcgcattat tgtctaaggt ttaatgcaag aacgacagaa aacttaatac aagactaaaa 11641 ccctcttgac tgactacact gtttctattg tcaagaaaac tttactttta ttttcgtgta 11701 tgaacaagcg cagttactta taccaaagaa aagttaaaaa caaagcaagt caacttttcg 11761 tatcttttgt tatagcagtc cttctttggt gggaattact accgggaatc gctttggcaa 11821 agactcagac tccagcgcct gctgaaaatt cgattcaacc ttatttgaat cgggtgatga 11881 agcagttgag tgagttccgc ctcgacaatg gtatgaagtt cattgtcttg gaaagacatc 11941 aagcgccgat agtttctttt ctgacttatg ctgatgttgg tggtgtggat gaaccagatg 12001 gaaagactgg tgtagcccac tttctggagc atttggcttt taaaggtaca acgcgcatag 12061 gtacttctaa ttaccaagca gaaaaaccac tacttaatcg tttggatcaa ttggcagagg 12121 aaattatagc agcaaaagcg gctaataaaa aagatgaagt tgctaaatta gaaacccagt 12181 tcaagaaaat agaagcacaa gcggctaaac tcgccaagca aaacgaacta ggacgaattg 12241 tggaacaggc gggaggcgtg ggtttaaatg ccaatacctc tacggaagcg actcgttatt 12301 tttatagttt tccttctaat aagttggaac tgtggatgtc gttggaatca gaaagatttc 12361 tggatcctgt gtttcgggag ttttacaagg aaaaagatgt cattttagaa gagcgacgta 12421 tgcgggtgga aaactctccc atcggaatca tgatggaaaa gttgattgat acatctttca 12481 aagtccatcc ctacaagcgt ccagtgattg gttacgacca agatatccgt aacttaacac 12541 gggaagatgt gcagaagttt ttcaatgcgt actatgtccc cagtaatttg actattgctg 12601 ttgttggtga tgtcaagccc gctgaggtta aaagactagc acaaatttac tttgggggtt 12661 ataaggcaaa agaaaaagca gttgctcaaa tctctgtaga accaaaacaa acacaaacgc 12721 gagaagtgag tgttgaacta cgctctcaac cctggtatct ggaaggttac catcgtcccg 12781 caatgactca tccagatcac gcggtttatg aaattattgg tagcttgctc agtagtgggc 12841 gtacgtcgcg actgtataag tctttggtgg aaaaacagca gttagcgctc acagcacagg 12901 gttttagtgg attcccagga gataagtatc caaatttgat gttgttctat gctctcacag 12961 ctccgggtca cacggttgat gaggtggctg tggctctgcg tcaggaaatt gagaggttga 13021 agacagagcc agtgtctgat actgatttgg agcgcgtaaa aacacaagcg cgggcgagtt 13081 tgttacgcat acttgattcc aatatgggaa tggcacagca gttgttggag tatgaagtga 13141 aaactggctc ttggcggaat ttgtttaagg agttggatca aattgtagct gtgacacctg 13201 ctgatattca acgggtggcg acagcgacgt ttacaacgga aaatcgtaca gttggtaaat 13261 tgttgtcgaa gcaaagttga ttgacggata agaggaatga accgctaaga cgagccagcg 13321 ctgcaggagg gtttccctcc gtaggcgact ggcgtagcaa agggcacgaa ggaagagata 13381 tgaacagatt taggggcaga aggaataaga gaaaggtgag ccagtgcggt ggtgaggcag 13441 tccggtcttg gtggtttccc caaggaggaa ctgccgaaag ggttccccgg cataaagcat 13501 ctggcgaacc cgtaagggtg agaagtcttt atttgttgct agtttgtttt gtgttagttc 13561 cgttgttgtg ttttggagta tacaacacat cttgggcggc gacggcggca aagcattaca 13621 cagagttgca gcttccacca ctacctcagg tgaagatacc gaagtatgag cggtatgtga 13681 tgaacaatgg catggttgtg tatttgatgg aggatcatga gctaccattg gtgagtggtt 13741 cggctatagt acgtacgggc gatcgctttg aacccgcaca gaaggttggt ttggctcagt 13801 tgacgggtgt tgtgatgcgt tctggaggaa caaccaagca tacaccggat caactcaatc 13861 agatcttgga gcaacgcgcg gcaatggttg aaactggcat tggtgaaact gctgctaatg 13921 ctagttttca gtcactcagt gaagatttgg aaacggtttt tgggttgttt ggtgaagttc 13981 tccgcgagcc agttttcgcg caggaaaagt tggatttggc gaaaacacag ttgcggggtt 14041 ctatcgctcg tcgcaatgac gatccagatg atattgcaag tcgggaattt cagaaactaa 14101 tttatggcaa agaaagtcct tatgcccgta ctcaggagta tgcaacgctg aataatattt 14161 ctcgtgcgga tttaatcaag tttaagcagc agtcttttta tcccaataat atgattttgg 14221 gaattgtggg ggattttgac tctaagaaaa tgcgatcgct tattcaagct caatttgctg 14281 actggaaacc caatcccaat ttggttaaac ctcagttacc gaaagtgtcg caaaataaac 14341 gaggtggtgt gttttttgtc aatcagccac aactgactca aagcaatatt ctcattggac 14401 atttgggagg acagtttaat agtcctgatt atccagcgct ggatgtgttg aatggagtgt 14461 taaatggctt tggtggacgc ttgtttaatg aagtgcgatc gcgtcaaggt ttagcttact 14521 cagtttccgg tgcttggagt ccccgatacg actaccctgg aatgtttgtt gccggtggac 14581 aaacacgctc gaatgcgaca gtgcaattcg tcagggcact acagcaagaa atcaagcgcc 14641 tacaaactca aagcgtcatg ccacaggagt tagcttttgc gaaagactct accctcaact 14701 cgtttgtttt caattttgaa gatcctgggc aaactttatc acgcctcatg cggtacgaat 14761 attatggtta tccgtctgat tttctctttc gctatcaaaa agcagtagcc gcaaccacag 14821 cggctgatgt acagcgagtt gcgagaaaat atctcaaacc agacaatttg gtgactttga 14881 tagtcgggaa tcaaagcgct attaaaccac cactaacaca attagcaaca caagtaacgt 14941 taatagatgt gacaattcct ggttcaccaa cacaagcagc gacaaattaa tcaatatctt 15001 aacatttcca cacagtaggg gtgattcaga aatagcccct atttttttaa tagcatgagc 15061 ggaattcaaa agacaaagct tttttatagc aggaaactct taacgcttaa ctcttaacag 15121 gcgtgtgatt gtctcttagt gtccgtgttt tctcattagt tcatgtccta atctacctgg 15181 caactgctat agtttttagt tttgtttcgt taagtgactt gaagtattca cagtttgttt 15241 accaaattat tgtaaattat aaaaaagtgg taaaaataaa cattgataac tatcatatgt 15301 ggtaactttt ggattaaact cttggtacat tagtacataa aattactcaa acaaaagtta 15361 tcaggatttg tttttcaagc atatacaggg aaacttttta acaaagctag ggcgctcgct 15421 atttatcaag cttagtaagt aatcaaaaat acgaggaact gaataagtat gaaaaaacaa 15481 ctgatgacag caggatttgt cctcttcggt ttaactttgc cactcaaggc ttcggctgct 15541 ggttttagtc aatttaacgt attcggtgac agcctttctg atactggtaa tgtattcact 15601 gtttctcagc aaaactcacc tacaggtccc attcccccag atccgcccta ttttcaagga 15661 cggttttcta ataataagat ttgggtggac tatcttggac aagatatagg attgactcct 15721 acattatttg ctaatccaag tactaagact ccaacccaag gtatcaactt tgccttcggt 15781 ggttctctct ctggtgaaga caatgctttc tttccaggcg caccaggagt gctcaagcaa 15841 gtcggctcct ttgtgggaaa taaccagaag gtagatccaa atgcactcta tgctgtgtgg 15901 ggaggtggaa atgattactt gttcggtcag aatcctaatc ctaatcaaac agttagtaat 15961 ttatcaaatg cggtaggagc acttgcccaa gctggtgcaa aaaatatttt ggtatttaac 16021 ttgccagatt taggcaagac tccgcttgca gtgagaactg gaaatactag caatttaacg 16081 actttaacta atgctcacaa tgcagcattg gcatcagctt tgggtcaatt aaataacaac 16141 aaccctagtg tcaatatcat ccctgttgat attaactctc tatttaatag ggtaatcgcc 16201 aatccaggag aatttgggtt caaagatgtt aacacttctt gcgtcgtgta tgacatcagg 16261 aacaacgtag tgttgaaaac ctgtaataac ccaaacgatt atttgttctt tgatgaggtt 16321 catcccacta caaatgctca caagcttgta gcagaaacag cactggcggc gattagggct 16381 aagtccgttc ctgaatcctc cacagcatta gctctattat cgcttggtgc tttgggtgca 16441 gccgcaatgc tcaaacgcag acaaaaaaga gtcagccctg agtttagtag caagtcaaat 16501 ttataactgg gttactatag cagatttcgt ttgaatcaca tacacaatgt cgtatgtaga 16561 gaccgtacag tacggtctct actgtattgc acgcaacttt gtttttaatt cacctcagtc 16621 aacagctcaa actatttctc cggagatttc aaagcctcat ctaaaacttg tctagcttgt 16681 tctgcttttg ctcttgcttc ttgataacgc agattagctt gagcataatt tttgagaatt 16741 ttaaaacctt ccttcaggcg agcaatttcc tcctcagtca cagtattatc tgaaaataaa 16801 acgtcaactg ccttacgaac ttccatagct tcagtgcgtg actcctgcac tttcaactta 16861 agcgtggcta ggttgctcaa aatttggaca gcctgtgtaa tagcgtcctc atcgccagca 16921 gtatctgttg gtgtttcttt gacttcaatg agttgtttgg gaccaaatcc atcttccagt 16981 tccgttggta atttcccttc atccagcagt ttcattaact gatccatcgc tttgtcgcgg 17041 gctttggctg aatctttacc agcaacggtg aggataattt ctggactttg agcaagagtg 17101 tactgaacca taattcggca gaaaatttgc aagtaaacca agccagggta ataatcttaa 17161 cgcgaaatca agctgcaaga ccattaaatg atccactgtg caagaataac tgttcgttaa 17221 atcagtgaac agctacctac gcagctacct acgcagtgaa caggaactgg taactgataa 17281 ctggtaactg gtaactggta actgataact gataactgat aactgttgaa caggatcggg 17341 ataatttccc ttaattgagg aaaatcaaag agcgatgtca aggtttttct ttatcgcctc 17401 tggtaagata tttccaagtt tttaaaaatc aaactttata ctggaagagg taagagtatt 17461 tttcgctgtg agggaagagc aatggctcca aatcccacca tcatgcaagc tgtcgaacaa 17521 ttgggttacc gcgtcactgt tggagatgtg gcaactcagg caggattaaa tgtcgctcag 17581 gcaagtcaag ggttattagc tcttgcgtct gatgctggcg gacatatgca agtgagtgat 17641 acgggtgaca ttatttattt atttccaaaa gacttccggt caatattacg gaataagtat 17701 ttgcaattac aattacagga gtggtggaaa aaagtttgga agattctatt ttacctaatt 17761 cggatatctt ttggaatttt cttgattgtt tccatcgcct taataagtat tagcattatt 17821 ttaatgatct cagcgctgaa ctcagaccgt gataacgaca atagaggtgg cggttttagt 17881 ggtggcttct tctattttcc agatttattc tggtatctca gtccagatta tgacgcccgt 17941 cagagggaaa gacgccgcga gaaaagcgat ctcaatttct ttgaagccgt attttcgttt 18001 ttgtttggcg atggtaaccc caataccaaa ttagaagaac gccgttggca agaaattgct 18061 gcggtgattc gtcataaccg gggtgctatt gtagccgaac aaattgcacc atatcttgat 18121 gatataggag aaggttctgc aagagagtac gaagactata tgctacctgt tctcactcgc 18181 ttcaatggac aacccaaagt tagtcccgaa ggagagattg tttatgactt tccagagttg 18241 caggtgagtg cagccaaaaa atatcgtcag tctgtgtcag cttatttgga agaatttcct 18301 tggcgcttta gtgcggcgag ttcaggacaa attctgctga gtgcagggtt gggtgttgca 18361 aattttgttg gtgcattagt gctaggaagt ttgttgagaa atggtacagt agctgcccaa 18421 ataggtggac tcgttgcttt tgtacatgga atttattggc tattactggc ttacggaaca 18481 ggttttttag ttgttccgtt aatacggtat ttttggattg ggtggcgcaa tagcaaaatt 18541 gctgaacgta acggcgatcg cctatcccgc gccagacaac taacagaccc agactctgcg 18601 ttacagcaga aaatagctta tgctcagcag tttgcagcag aaaaagtcct tggtgatgaa 18661 gatttggtct actctactga aactgactta ttagaacagg aggttgaacg ttctcaaaaa 18721 attgacgctg agtggcaaaa gcgattggag cgagggagtg gggagtaggg taaagatgag 18781 ggagatgaga attttgaatt ttgaattgtc ttatccccct catcctcgtc tacttcttaa 18841 cctgaatagg tgctaaagca acacccttgt aatagtaatt caaaatttgc aggtggttgt 18901 atccccgcga tgctagatta taagctcccc actgactcat gcctaaagca tgaccatagc 18961 caagccctgt gagtgtaaag cttccattcc cgttgctgac ggtaaagcgg gtgcttttta 19021 gcctaagagc tgtacgcaca tcttcacccc tgagtacttt ctcacccttg tcaccaacta 19081 ttttcaaagc tttcacgctt ccgaaaggtg atgtactctc aggaatcatt tgcttgacat 19141 ttccaatttc tggcatttgg gcgctaattt ccgctgggga aaaagttctg ctccagttac 19201 attctctgat attttggtca aaatctggaa cagcacgcag gtatggctga gcacttcccc 19261 aaacatcttc cgagttttca gtgtgtccgc cagaacatgc gtggaagaca gagagaataa 19321 ccctatttct gtaagtcagt actttcccat ctgtagcatc gactgcagcg taggtactag 19381 gagattcact gctgacacct ttgtaaattt gccaacggtc tggcgaagca cccaaatcat 19441 aaaggggatt attgcgctgt ctctcacgct cgtaaagagc atatgtacgg gcggcgatcg 19501 cctgagcttt cagggcttcc aagggccacc tagagtccat ttcgccaccg ataacgctgt 19561 agagatattc ctccaaagca acccagttaa caacagataa acccttttgt gaaggaacaa 19621 caagagttct gccgcgatac cagcgatcgc caatataaac aaatccctta cctgttggct 19681 caacccaaaa cagattagag cgccacttat ccaaagcaac tccaccagga acagcttgag 19741 cagagtaagg actcattgct ggcaattctc ccacagtacg accactacta tctttgatca 19801 ccgcagtggt agaactaccg actttcacct gattgacgtc cctttgaatt gccacacgca 19861 ggatgacaga tgatgcttga gatggggcaa ccatagctat ccatatgagg ataccaagcc 19921 accaatggcg tcctttgacg ttagaaaata aagaggctag gaacacttgg aatttcatgt 19981 tgacaattac agttacacct atattaggtt tgacgcttgc ttttgcagta gtttcctttc 20041 aggtaggatt ttaactccat caaatgatag atgaactacc acaagcaagc aacactatgt 20101 tgtatagcgt ctgtcatttg agcgctcaat tttgaaatta cctggaaatc agcggtttgg 20161 cggcaaaatc acccaagccg atcatggaaa gcaaatcctg ccactcttgc tcaagagtca 20221 gatcttgggg atttttttga cgcaattgtg ccactgttgt taatgcctca taccatattc 20281 cattgctcgc ataaatagca acttgctgtc ttggctgtgc ttgctggagt tgctgaacaa 20341 ttccctgatt caagttgact cgttgtatga ctccttcaac ataaaccggg ctttgttgct 20401 tttggcgctc gcagtagaca ttcaaaaacc accggtactg tttacctgta atcagtgggg 20461 aaacagtagc aggtaaagaa acgtttatga ctccaggttg acttggtaag gtaatctcct 20521 gtcggtagac tggatttgat tctgcatcaa gcaacacaaa atcagcgggg taagcagaat 20581 ctttgacata aggtgtgtag aaccagaaag ttgggcgttc tgctgtggtt tgtccccaaa 20641 cattcatcac cgaagttggt tgctcaatca aaggaactaa agccgtgagt ccaggtttag 20701 caactggaca ttgaccgcga ctcgctcccc catagcgacg accaccagga ggctgtcctg 20761 gcggaagtgg aggtagaacc aggcgcagag gctgcttacg agaagaggtt gttgttaagg 20821 gttttgattg tgatgcagat tgagcgatca ccgatgttgt gctggggagc aaacttaggt 20881 acccgacgac aactgccaac aacagtttca tcggttgtaa aatcattttc ataaatcact 20941 tttaactgtt gagtgttaag tagatgggtg taaataaata taagatggac tgtcattgcg 21001 aggaggaacg acgaagcaat cccattgatt gcgattgctt cgttcctcgc aatgacatat 21061 tacatttaat tatgtctgcc tacttaactg ttgagttttg actatgaata atctcgaatt 21121 ttgaaagaaa acgatgctca ttgttccgac aagtgctaac gctgaaggta caaacggcac 21181 ccaatatcct tggatgagta gaccaaagca gacgacgtag agaactccag aacagacaaa 21241 gacgaaaagt gtcagtctgg ggagtacacg cacccgccaa gccagaactc ctccaaccac 21301 agaagcgcca caaatccaga tcacctcacc ccacaatgac caaacttgca acaaaggacg 21361 tttatcgagg acagcgctga ggagttggct aaccatgtgt gcttgtacaa gcaatcctga 21421 cgttcgctga tcaaagtgat tttcatatgg tgtgccccaa tagtcactac gctctccttt 21481 agatacaaca ccaattagga caattcggtt ttttatagca tctggatgaa cctgatttga 21541 taaaacttgt gttagcgtca cctgttgagc aatgttcttg gaagaacgat agttcagcaa 21601 aatttgacca ccattagcat caatgccttg atatccgcca ctgtgagagt tcaaacgagg 21661 aaaaacagta ttgccaagct gcaaatcccc ttgtgtggtg aactttggtt gaattccctt 21721 gtcagcaagc aaataacgaa atgctagctg tgcactgaat gcataatcag cagcacacgc 21781 agaaacagcc tcctgttcca taaacaggag atgtcgccgt accacgccat caggatcatg 21841 aataaagtcg ctaaatccca ggcgttcttt tgggatttcc ggtggcggcg cagtactctt 21901 aacaggtatt gcagcatcac tatgcttgca aaccccaatc aaattttcag tttgtttgag 21961 acgaaaagtt aaatttgggt attgcgcttt aaaatctcgg tagatatcca agccaatagc 22021 atagggttcg tactgctgaa gtttctccaa aagtttattg agagagatat ctgagagaga 22081 ttttccaatc acatcctcgc ccctctgccg ttgagcatca atatctgcat catcaattgc 22141 taccaccagc aggcgtggat cgctttcctc gtggaagatg agggaacgta gacgcatcat 22201 ctggtcaaaa gcttgaagtt cactcatttg cactaacccc aaaaacctca gtccacaaac 22261 taggacactg atcaccactg aggacaacaa cgctacagca aatctgggtg gaggcggtgg 22321 gggaagaggc gatcgctcta atggttctgg tacttccacc tcaaccctac ccgtaatctc 22381 aggccaagta ggaggaattt gtgcgagatt ttgacaaatg attggtaacc aagtcgcgca 22441 gggaaacttg tcctcaagac cttgcaaccg ttcccgtgct tcccgcactg cttgataaaa 22501 ggactcacct ccagcaaacc cttgtagaaa atacttcaaa aattcctgtg cgactaagtc 22561 tggaaccggt tcgcgcataa caatcatctg cggaatttgt aaatcagcca gttctcgcgc 22621 cagtccgagt ccatcgcaag agttgaagat tgccagttgc agtccatgtt gcacagtttt 22681 tcttagggca tagcgtaatt cgctgatggt gagactatcg gttttattca ggtagattcg 22741 cccagatcca tctttgccat gacttgaact gtgtccagca aagaaaagaa tatcccagct 22801 gttaccccag agaaagtcag tcaattcttt gcgttgaggt tccaccaaaa acgtgacatc 22861 tgcatctgtt agctgttgca gtaaagcacg gtcaacctct gtgtcaatcc cctgactatt 22921 gccaacaata gctaaaatac gaactttgtt attaagtgtt ttctgggaag agatcttttc 22981 gtatgtaggc gatgctagcg caaattcagc ttttggatat cgttctagta aatcccacaa 23041 atgccaaggt aaacgttgta actgagtatc ttcagtttgt aaaatgaccc gtatttcatc 23101 cgtaggcaat aatttttcta gccatttctc gcggatgggg cgaaattctt ctgttcgtag 23161 ccacgtgttg aagcgcgatc gcaaaatatg agcaatattg tcacagtctt gggtgacgga 23221 cacatttgtc acctgcattt tatcagcaga caaacgataa caattaccca actttagata 23281 actagactgc cagtgactat agtacagtgg catttctcct gccgcaggta acttacccgt 23341 gatttcagtt gagggacgtt caccttcttc accgatttgt agcgttacgg gaaacccctg 23401 ctcaaagcta ccatctgcaa atttcagaac cactagcttg cccataattg aagcaaaagt 23461 acacgaagtt gggaagcaga atcatttttg agttttcaac ggcagttaca ctcaacagcg 23521 cgctgactca ccaagggtgc actgccttgt cttgactttt gactcttgac tcttgactaa 23581 atgacaaagt tctctgtgac acttgcatca ttcaatgata ctttaacgct aaattgctct 23641 gctggttcac ctctaaattg taattgcagg taattatctg cacttctggc ttgagtttcc 23701 aaaaaaacgg ctccagactc atctagcacc gttagttgta ctcctggtat aagataattt 23761 tgattaccag taggatgcaa ctgaagacga atgccagttt gttgatcagt tttcggtcga 23821 atttcgacaa ttaatatgac gggttgattg gcaatttgga ttcctaaatc aatcaacttc 23881 gcccgtctgg taagtgcttg tgttttatta gtatcgtttt gcacaagcgt atcaccattg 23941 cgaaaagcat aacctggtct aatttccggc tgattccaca aagattctac tgtttgccaa 24001 ccagcatcga agattccaac caaccactgg ctcaaattca ccaatgtcct gatcggtgtt 24061 tgtcgcagct gtgcgagata ttcgataaaa tcttctaagg gttgcagttg acttaaaggt 24121 aattcttcag ttgccacatt acgaacaaac cctagtaact ttgcttgctg aagcgattca 24181 tctatttgga cgacgacata acccactcgt tcttcccaag tttctggggg aatgtaacat 24241 atttcttgtg caaaacgcac gggacgacac tctagacgcc ctattcccag caactccaag 24301 tccgccacat cagtacataa acgcagaaca gagttccaac tgtcactttt tgttaactgg 24361 gtgggaatat ccatcatttc caaatattca ttcaccaccc atacacaaag cgtatttagt 24421 cgaacttgtt cggctttttc aggagtcggt tgctgattgg cgaactgctg ggcagttctg 24481 cacgctgctt gagttatagg caacgttaaa gctaaatcat ctaaattgtg gttggtgcgc 24541 ctcatatgaa atccccagtg aagaacgctg ctacatacat atcgtttaaa ggcatgctat 24601 ttacgcaaga gcaaaaaaaa gcaatatttc aatttaaaag ctaaattaag cgtcttacca 24661 atagagttgg gcaaaatatt gtaataattt ttaacaattt atgacaaaaa aatcccccac 24721 atctgtaccg gcaaacacag gattagtaat caagtgttga cagatgtagg ggaataattt 24781 actcggattt tggttaacca actataggaa gcaaagaacc ttatttagta ttcccatcgg 24841 aaattgaaaa acgatcctcc acttctgctg agaagtcggg gatctttttt ctcgtgcctg 24901 tagggttgag tactcccctt gggcaaaaat ggacatccta atgcctgtga aagtaatagt 24961 gcgagcataa agagcgaaca tttgcgatgc tcgctgttaa tctcgcggat taagttgagg 25021 tgaagaaata tttgtgaggt aattgactaa atcgctggga ttaaagaaat ccaacagcgc 25081 ctcagctaac tcctctagct gagtaatgga taaactgcga atctgctgtt gtacctccgc 25141 ctcaattgca ccaa // LOCUS NODE_1204_length_25114_cov_5.07274825114 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 25114) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 25114) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..25114 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..385) /locus_tag="DP116_10670" CDS complement(<1..385) /locus_tag="DP116_10670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860509.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsQ/DivIB" /protein_id="PRJNA477356:DP116_10670" /translation="MAGIVSVSRSDLKGRRKKLRQKRQMKIIQTIWQTIVVSSLAGGL LWTAIQPIWVLKTPKDIEVLGNHVLSGEAIQSLLVVSYPQSLWRIEPHRIAESLKQQP MIVQANVNRRLFPPGLIIQIKERVPV" gene complement(887..1237) /locus_tag="DP116_10675" CDS complement(887..1237) /locus_tag="DP116_10675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10675" /translation="MSNGKKLELTKLSCQLQTIASAVYDTAQDCQGNAKALLALLRQL EQLHREIRDGAFQQSLPDNRQGLYSLLRDIEAEGGWPYIERMRLQAFLTKAEQQTAVE NSSEVLENDSSLNW" gene complement(1751..2071) /locus_tag="DP116_10680" CDS complement(1751..2071) /locus_tag="DP116_10680" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10680" /translation="MLLPLAQALNLKLPKPLHLLRCVVKGWSKNKQFYIQPIVLWENQ SDHQPDHTIFFNSKSKLSDIYQQSGIASSDPPVIARDFRNASLLFYLSRQNFNCCSTG SFPV" gene 2179..3012 /locus_tag="DP116_10685" CDS 2179..3012 /locus_tag="DP116_10685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Photosystem II manganese-stabilizing polypeptide" /protein_id="PRJNA477356:DP116_10685" /translation="MRFRALIVALLALCLGLLTACSEGPSSSSTEFLTYDQIKGTGLA VKCPQLAETSRGFIPIDNSQSYTIKELCLEPTQYFVKEEPLNKRQQAEYVAGKLLTRR TYSLDQITGDLKINPDKSLTFVERDGFDFQAITVKLPGGESVPFLFSLKGLEAQTQPG LTSINTSTDFEGTFRVPSYRGSAFLDPKGRGVTSGYDNAVALPASSDDKELTNANVKR GDVLKGKIFLQIAKVDSSTGEIAGTFESQQPSDTDLGAKKPEEVKIRGLFYGRVEPAQ V" gene 3998..4771 /locus_tag="DP116_10690" CDS 3998..4771 /locus_tag="DP116_10690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997994.1" /note="sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released; this sigma factor controls the expression of genes coding cell surface proteins involved in motility and growth on nitrogen; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigF" /protein_id="PRJNA477356:DP116_10690" /translation="MPITVTNELKHEIWQLLREYQLSGSPDFRNQLVKLNFGLVRKEA HYWMNQCHESYEDLLQVGCLGLIRAIERFTISKGYALSSFAIPYIRGEIRHYLRDKGV TVRIPRRWLALQQQAIKVSRSLREKYNRQPTDSELAAALEISLDEWQEIKLAWTNRAP LSLDLPVQNGDESATLLGDLVPDNRYHSFQLAQEDQILIQQALMQLEQPTRQVLEFVF LDDLTQKQAAERLGISVVTVSRRLRKGLDLLKHLMYVPE" gene 4791..5900 /locus_tag="DP116_10695" /pseudo CDS 4791..5900 /locus_tag="DP116_10695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314483.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 5903..6148 /locus_tag="DP116_10700" CDS 5903..6148 /locus_tag="DP116_10700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10700" /translation="METKETIEFAGLPLAVYREIAAHLCQVEGVEVSLIPQTSQQFDY SQSQIAGLWISYTPECGAQSRQRVQQILTYYRSRYVA" gene 6396..7148 /locus_tag="DP116_10705" CDS 6396..7148 /locus_tag="DP116_10705" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10705" /translation="MYLEITDKIFVLLTKIMQSNRTNFLRFLLQSEPGELAIANGLRA ALGLGIPMLLGQLIDQRQNGLFVALMAFSVNLANVGGPYRIKATAMAIATLGIAVSAF VGTLVAGVPVLSVVLTFLLMTLLVYMGRFTKAVTVLAVHLEHFQRTQPLPELETFVRQ ISLLLEQLAQSAQQEVAPPPLPDLEETLQKIQPHLQALRIARIQELAVNQGHTPRRQA VIDYSILDMEIDQIVRRLSAMHSAMVRLSTQL" gene 7339..9828 /locus_tag="DP116_10710" CDS 7339..9828 /locus_tag="DP116_10710" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10710" /translation="MEHWQFLIQKQGDRSWSPIESPNIEIVEGRYRVLARSDLTNTDM YVQITHSSTMEFPPKRRIQKRSGSTNSEGLIAVLPFSYLKPGVWELCCSGNLMSHFVD QPWQHCVELKVLPILVIEVGRQEEDTCVPRVDEEEFCLFTDELLEEEAMISQPISPVW LKGEKVEQILQNLIEIALPDSQLVPAIEDFSNQTLEPPLVLKLQEDFYTVPWGQTLTI NGRVEQKETTNLDLSLTSNYERVYAGEIRIELRSPQNSKVIRRVRQSLSEKLIPFQIR CSIEVPADCESKLILGEISLYGILTMDGKTTLLTSQFFTMTADVTELLAISAKAKKNE LDTIEHQAVSSAALATSVAKKLSTPLDLQLFNLAKTPKKAKSHLLQPSPKKSLPPKID PPLRRKPVGISPQLPSFIPHQNRIITPTVVWEPSSKFESDRDDSTITVVSKVIRMETT LPYLRRLKTSQNTTKGIMRYESVDEQQYTTEIHNEDAAKSILEETQSQIESFDEDAAK LATDNAQPQDTFVKLVIPHNSQLITTGSPNISPLIRKWMHNQGYSLPEPINLQNQDDD IYIVASQNHVSDEVNKETQTHADANDLERTQIEEQVGIETQEDKTIQNILYPSGTPAA KGYAKRIKYPLLAREANKISPSPSPQPSNSLDGRILSARLVHEIVVEDIFDETEPKTF KNQPSKQKEESVSDVSVGLPVLAEITEPLPVPQLHLPSGELISGKFVRICVQLAPVAD EVAVKLWILDCQTRGLIGEPRLLTNLRLNPAGVLEEITHLRVPFGCLEICLEAIAVNK TTQQESHKVSILRTVMPPDLPRLQLEEVFGT" gene 9941..10087 /gene="psaI" /locus_tag="DP116_10715" CDS 9941..10087 /gene="psaI" /locus_tag="DP116_10715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997990.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit VIII" /protein_id="PRJNA477356:DP116_10715" /translation="MLTASFLPLILTYPASYLSSIFVPVIGWVLPSVTFAFLLLYIER DDIG" gene complement(10338..11669) /locus_tag="DP116_10720" CDS complement(10338..11669) /locus_tag="DP116_10720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875754.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10720" /translation="MDVILERSHQLKQDLTDFVYDAEGELAQALESYAAVKSRRGSGD NFQQELIIDSFITQGRVGDSSPLDLFIQSYQELSKVDRELIKSWHRTFSGLFAITKIL PDGFELRNWLTDKYYIVKPNTSQKLNEMSRLKEGEILLTRISPITDSYWTFFSSYTQM GKLGKPKLAVAIGNFKENYKENLYGDAPDLLEEAWQSVGIYHQQFLDFFASDEVTLPG YQLNKKIAEFQEILTKKRLEEAGIDPSKSLGDVAQEAGLGEEEIKAVAEEFGADSQAV SQMFDKKNSGSKMVMPKVDLPAELRKAEQVTAISHPRWGQMFLPTYSKIKTILEAEDW QSVEGAEKLIRFYLEDKSINAFIWHRLAQEYPTQLEKVLQTFLQRPNFQLQNDLDTLL QEYNKPIEPELPEIASVPIHLHNLFQEALMEVSKSKPKGKSQNQTVKGFGK" gene 11840..13159 /locus_tag="DP116_10725" CDS 11840..13159 /locus_tag="DP116_10725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006633896.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-isopropylmalate dehydratase" /protein_id="PRJNA477356:DP116_10725" /translation="MGMTLVEKILAKASGRSVVEPGENIWVNVDLLMTHDVCGPGTIG VFKREFGADAKVWDPEKIVLIPDHYIFTADQRANRNVDILRDFAKEQSIKYFYDITDL SNFKANPDYKGVCHIALAQEGHTRPGEVLFGTDSHTCNAGAFGQFATGIGNTDAGFIM GTGKLLIKVPATMRFVLDGEMPPYLLAKDLILQIIGDISVAGATYRTMEFAGEAVERM TMEERMTLCNMVIEAGGKNGTIAPDETTFEYVRARTDKPFEAVYTDSDAKFYSEHHYD VSKLEPVVAKPHSPDNRATARECSDVKINRAYIGSCTGGKTEDFFHAAQVLKGHKVKV PTYIVPATQKVYEDLFKIKYQEQTLSEIFLEAGCIEPAAPSCAACLGGPKDTFGRMNE PEICVSTTNRNFPGRMGHKEAGIYLASPFTAAASALTGYVTDPREFL" gene 13387..13872 /locus_tag="DP116_10730" CDS 13387..13872 /locus_tag="DP116_10730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458321.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gluconokinase" /protein_id="PRJNA477356:DP116_10730" /translation="MIILVMGVSGSGKSSIGQLLADSLHWGFSDADAFHSPENIEKMR HGIPLNDLDRVPWLLALEQAIQQWLQENKNMVLACSALKASYRQVLVLDEERVKVVYL KGPFELIQKRLQQRHGHFMGEKLLKSQFDTLEEPSGAVTVDVSEPPEVIVQKIRVSLG I" gene complement(13940..15412) /locus_tag="DP116_10735" /pseudo CDS complement(13940..15412) /locus_tag="DP116_10735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748913.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="SAM-dependent DNA methyltransferase" gene complement(15419..16072) /locus_tag="DP116_10740" CDS complement(15419..16072) /locus_tag="DP116_10740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748912.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease" /protein_id="PRJNA477356:DP116_10740" /translation="MKPSLTLDILLEEAVKFAEIESIYDEPLLFGVTDGKAVGTYLEQ KFRNYLALLYEYVLGNSASGIDFPDLNIDIKVTSIRQPQSSCPFKSARQKIFGLGYGL LIFVYEKRDDHQHKTGRLNMQHTVFIDQRRTADYQMTRGILSILANEGNADDLVAFIM DRNLPVDEIEARNIAEEILRNPPEQGYLTISNALQWRLQYSRIIQMAGEVNGIIRVR" gene complement(16050..16247) /locus_tag="DP116_10745" CDS complement(16050..16247) /locus_tag="DP116_10745" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10745" /translation="MSLVQPLPDNLKQGDEVKVLMTPISPKYSFPTLKLGIKEEYLSK EKIYEPEEHWRRGNNEAFADT" gene complement(16321..17457) /locus_tag="DP116_10750" CDS complement(16321..17457) /locus_tag="DP116_10750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007062209.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_10750" /translation="MAITCGTVVANLYYNQPLLVVIAQNLQASNHATGWIPMLTQIGY AVGLFLLVPLGDLMERRRLILMMLVLTSVALGAAAASLNLAWLLVASLAIGITSVSAQ LIIPFAAQLAKPEHRGRIVGTVMSGVLIGILLARTVSGFVGASLGWRAMYWLASGLMI VLAVVLLRMLPKSQPSLRVSYPQLVGSLFKLIQQQPILREASLAGAMSFGAFSAFWST LVFFLAQPPYHYGSEVTGLFGLVGVVGATAAPVAGKIADKRNPKITVALGLSITTLSF LIFWVFGYHILGLIVGVILLDLGAQSTHISNQARIFSLPLEFHSRLNALYMTFSFMGG ALGSFLSAYAWSRWEWHGVCAIALLMLGVAFMTFIKRRRQPLVS" gene 17667..18422 /locus_tag="DP116_10755" CDS 17667..18422 /locus_tag="DP116_10755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876617.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YebC/PmpR family DNA-binding transcriptional regulator" /protein_id="PRJNA477356:DP116_10755" /translation="MAGHSKWANIKRQKAVVDAKKGKTFTQWSRAIIVAARSGVPDPV GNFQLRTAIEKAKAAGLPNENIERAIAKGAGTLSGGASSLEAIRYEGYGPGGVAILLE ALTDNRNRTAADLRVAFSKNGGNLGETGCVSWMFSQKGVCTVEGIVDEEQLLEASLEG SAEFYEMTEEQIAEVFTEVVNLENLSQTLKEKGFKVTDAELRWIPGNNVEVTDPDQAR SLLKLIDTLESLDDVQNVTANFDMSEELMAVMA" gene complement(19158..19448) /locus_tag="DP116_10760" CDS complement(19158..19448) /locus_tag="DP116_10760" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10760" /translation="MDGVLEGDVVGVLPVGVVAELELLGRREVFFPFLVVSDFFFLGV LVVLLGLVVVGVVVAGVVVAGEDICAFTAGAVRATGAATTRDRSKAFAILVM" gene 19683..20939 /locus_tag="DP116_10765" CDS 19683..20939 /locus_tag="DP116_10765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315711.1" /note="Oxygenase that introduces the hydroxyl group at carbon four of 2-octaprenyl-6-methoxyphenol resulting in the formation of 2-octaprenyl-6-methoxy-1,4-benzoquinone; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-octaprenyl-6-methoxyphenyl hydroxylase" /protein_id="PRJNA477356:DP116_10765" /translation="MAQAQLQTLNPPQTPTHTRGYDYDLVIVGGGIVGLTLASALKDS GLSVLLVEAKVESAAVAKGQAYAVHMLSALIFQGIGIWNKILPNIETYRRVRLSDADY PAVVEWETGDINTKDLGYVAEHQALLHPLQEFVKNCPNVTYLCPAQVVNIQYQQDVVT MDVKIADQMQTVRTKLVVAADGSRSRIREAAGIKTRGWKYWQSCIVAFVKPEKPHNNT AYEKFQSSGPFAILPLPGNRCRIVWTAPHEEAKALCALDDEQFLRELQARYGNQMGKL ELLGDRFIFQVQLMQSDRYVLHRLALIGDAAHNCHPVGGQGLNLGIRDVAALAQVLQE ADAQGEDIGNIQILKRYERWRQRENLTILGFTDLLDRMFSNNILPVVIVRRLGLWMMQ RVPMVKVFALKLMIGLKGRTPKLAQH" gene 21872..24061 /locus_tag="DP116_10770" CDS 21872..24061 /locus_tag="DP116_10770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128107.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tail length tape measure protein" /protein_id="PRJNA477356:DP116_10770" /translation="MLKKLQTKHISLIVGAGLCAFLAGAIVSAPEIGKSVSQWLKPRK TQPEKLSEESKAKSPVFALVSQSPQERGAKLQALKEASKPADQNRARYLLASDLIEQK QAQPALKLLDGLDRDYPVLAPYVLLKQAQAHEILGQEGKASDLRQKVLKDYPKEAAAA KALYLIGVPEYQDKAIAEFPSHPLTWEIIRKRLETNPNQPQLQLVLAKYAYDQPGTVP VVDQLAKNSTLKPEEWDIVGTAYWENNEFGKVTTAYAKGTKTPRNLYRVGRGLQVSGK RAEALAVYQELVKAFPEARETSTALLRLAEMATVRKDALPYLDQVITKFPDKAGTALV EKVKILQTQDQKAASEATKLLITKYANSEEAAEYRWKIAQQKAKAKDYKAAWQWAEPI AKNNPNSILAPRASFWVGKWAAKLGKKEEAKQAYEYTLSNFPHSYYSWRAGATLGLNV GNFNSVRQMNPNLVPFQRPVPTAGSETFKELYLLGQDRDAWLQWQTEFQNKMQPTVAE QFTEGLMQLAKGENILGINTVSKLEDRETPQEQAQYQALSKQVTYWQARYPFPYQKEI EQWSQKRQLNPLLVTGLMRQESHFEPKVRSTAGAVGLMQVLPSTAKWIAPQIQLDSTK INLENPNENIMFGTWYLDHTHEQYRNNSMLAIASYNAGPGNVSKWVQTLPKEDPDEFV ESIPFDETKNYVRQVLGNYWNYLRLYNSETSQLVGKYSTVHPQLPVQ" gene 24155..24964 /gene="tatC" /locus_tag="DP116_10775" CDS 24155..24964 /gene="tatC" /locus_tag="DP116_10775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878398.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="twin-arginine translocase subunit TatC" /protein_id="PRJNA477356:DP116_10775" /translation="MTPSQNVDTTTSPSVDFEVDGDSKPEIDPMDPLDELPGEVEMSI FAHLEELRQRIFYSLIAVAVGVVGCFLAVKPIVQLLEVPAQGVKFLQLAPGEYFFVSM KVAGYSGLLLCSPFVLYQVIQFVLPGLTRRERRLLGPVVLGSSFLFAGGLVFAYLLLV PAALKFFISYGADVVEQLWSIDKYFEFVLLLLFSTGLAFQIPIIQVLLGALGIVSSGQ MLSGWRYVIIGAVVLGAILTPSTDPFTQSLLAGAVLGLYFGGIGLVKLVGK" BASE COUNT 7452 a 5355 c 5208 g 7099 t ORIGIN 1 ctacagggac tcgttcttta atctggatga ttaaccctgg tggaaacaga cgacgattga 61 cattcgcttg tactatcatc ggttgctgtt tcaaagattc ggcaatccta tgaggttcta 121 tccgccataa agactgagga taggaaacca ccagcagtga ctgaatcgcc tcaccagata 181 gtacgtgatt acctaaaacc tcaatatctt tgggtgtttt cagtacccaa atcggttgga 241 tagccgtcca cagcaaacct cccgccaatg aactcacgac aatggtttgc caaattgtct 301 ggataatttt catctgccgc ttttgacgta atttcttacg gcgacctttt aaatctgagc 361 gggaaaccga tacaatacca gccattcaaa cccttccata cttttaatcc tagtcgtgat 421 tgcgcttgta gtattacgta attttagtca atgctgttag cgggacttag atctgctctc 481 cagatgaatc tagccttttg atttttttgt ggcaaccatt tcctacttca aaaagctaca 541 aaaaaaatag tattatctgc atactacttt ttcattgtcc gaccgagaac tcaggatcta 601 aaatcacaca agagtactga aacggataaa atttgttgat atttaaaaaa atttaatcag 661 agtatatacg ctgtcatatt tttaacaatt tacaggacaa acagggtaaa ttcccataat 721 atcatgacta gcgtgatatt gctgaatata atgcccacac gggcatgtga tgcgggagca 781 agccttagag atggtcatgg ggattggtca gaggtttgtc gtcaagaaca attaacgaaa 841 aataactgac aatacccaat ctcaaatcca caatccaaaa ttactactac cagttcaagg 901 agctatcgtt ttccaaaacc tcactgctgt tttcaacagc cgtctgttgt tctgcctttg 961 ttaaaaaggc ttgtagcctc atacgttcaa tatacggcca gcctccttca gcctcaatat 1021 cgcgaagcag tgagtaaagt ccttgacgat tatcgggcaa actctgttga aaagccccat 1081 ctcggatctc tcggtgtaac tgctccaact gtcttagtaa ggctaaaaga gctttagcat 1141 tcccttgaca atcttgggct gtatcataga ctgctgatgc aattgtttgc agttgacaag 1201 acaatttcgt taactctaac tttttaccgt tgctcataca cccctgtccc tatgaatcag 1261 gtttgatcaa atttactgga gattttgtgc tttcacctgt taccagtcaa ccattagttg 1321 aaaaggttaa ttgatttatg aagtgttgtt gtgacaaatt ttctttgcat aacttaaaga 1381 aaataagttt ttgggagtct ctgactttgg tcacaaagac tcactacccg aactcgcctc 1441 ctgctagtac tttatcaacc acaaattgac aggtattggt ctgggaaagt ggggtttgtt 1501 gtgaggatga ggcaaaagcg taaagttaat aagcatttcc ccatccgcca aagttccttt 1561 ggtgggctac tagtgattat tttactgaaa gcacagttat aattttagtc tctgtaacca 1621 tagcaatact taatacattt taaacatctc tattgtcaat gagtgttaat caccagttta 1681 agaatagttt ataaatttta tatattgatc acatttgtgt gtgacattag tattatataa 1741 tacactaatc ttagaccgga aaactacccg tagagcaaca gttgaagttc tgacgactca 1801 gataaaaaag caacgaggca ttgcggaagt ctctggcgat cacgggcgga tcgctacttg 1861 cgatcccgct ttgttggtaa atatcagaaa gctttgattt tgagttaaaa aaaatcgtat 1921 gatctggctg atgatcgctt tgattttccc ataaaacaat gggctgtatg taaaactgct 1981 tattctttga ccacccttta accacgcatc tcaacagatg cagtggcttg ggtagtttga 2041 ggtttaaagc ttgggctaaa ggaagcaaca cgtttaaaaa cgcataactc ctgataagtt 2101 taagtcacgg cttgggctta ccaaagcttg ggcgcagttt tacaagttaa atattaattt 2161 ttgacattga ggttaaccat gaggtttcgc gctttaattg ttgcactctt agctttatgc 2221 ttgggtttac ttactgcttg tagtgaaggt ccgtcatcga gtagtacaga gttcctaacc 2281 tacgaccaga ttaaaggcac cggcttggct gtcaaatgcc cccaactggc agaaacaagt 2341 cgtggcttta tccccataga taatagccag tcctacacca taaaagaact gtgcttagag 2401 ccaacccaat actttgtcaa agaagaaccc cttaataaac ggcaacaagc agaatatgtt 2461 gctggcaaat tgttgactcg ccgcacttac tccttggatc aaattactgg cgatctcaaa 2521 atcaatccag ataaaagcct cacctttgtg gaacgagacg gctttgactt tcaagctatt 2581 acagtcaaac tgcctggcgg tgaaagcgta cctttcctgt ttagcctcaa aggcttagag 2641 gctcaaacac aacctggctt gaccagcatc aacacctcca cggactttga aggtaccttc 2701 agagttccct cctatcgagg ttctgccttc ctcgacccca aaggtcgcgg tgttaccagt 2761 gggtatgata acgcggtggc tctccctgcg tcatctgatg ataaagaact gactaacgct 2821 aacgttaagc gaggtgatgt tctcaagggc aaaatttttc tgcaaatagc aaaagtagat 2881 agctctactg gtgaaattgc aggcactttc gagagtcaac agccttcaga taccgactta 2941 ggcgccaaaa aacccgagga agttaagatt cgtggtctgt tctacggtcg tgtagaacct 3001 gctcaggtct agattgttct gataaaccta agaggtaact tcttaccctc ttaacaggag 3061 aatttctgga acgaggtttt ggagttacct ctgttggtat gtagtttaca aaaatggcaa 3121 aaagaggcga tatgcacaag tacgactttg ggtgtatcgt cttttttatt tgagcatgtg 3181 aatatcttta attattttaa atttaaatta tgaattgtaa agtcaattta ataagctaat 3241 gcgcgtacca gactgcactt ttttcgttca gggaggcgga gcatctccgg ttccgttctt 3301 gagtgttgac ttccttgaca aataaatatg atcttttggc gcgacagcta atcatgtctt 3361 tcaaagttaa gtattatccc tgttagaatc tgtttttatt tctgatatgt atattctttt 3421 tttcccagat ttccgatgta ttcacgactt taattcagta atattacggt tttttattga 3481 gtgataaatt ttttatttta tttctatgtg gttttcggaa agtcacagga cataccacag 3541 agttaactaa aaaacaaaag aacccctcca atttaaatta agctgcattg tcaatcgagt 3601 tatggtagct gttttcagca caaaattctt tatctgccat tggagcctct gtgttatgta 3661 agtatcgaat accgctgtat ttggcgtaag ttcgctattt tgcacaaagg cgtccagaag 3721 gcgttcgcgt gtctgtagcg tttctgagtt gcataaagta caaaacccca gcctctctac 3781 taagcatcct cacgtcacgt gctacaacag ggagggttgg ggaacttaat ctcggggatt 3841 aggggcagat tgtactgtat ccaaaagaga accgctatat tgtgcggagt ttcatcattg 3901 aatgggtatg ctgcaaaaaa tctgacatat aactgcaatc gcagcttggt ttaaaagtga 3961 gtatttcttt gtttattcga ccaagaaatg cgatgttatg ccaatcacag tcaccaacga 4021 actgaagcat gagatttggc agttgttgcg agaatatcag ctatctggct cacctgactt 4081 tcgcaatcaa ttggtgaaac ttaattttgg acttgtcaga aaagaagctc attactggat 4141 gaatcaatgt catgaaagct acgaggactt gctccaagta ggttgtttgg gtttgattcg 4201 agctatagaa agatttacaa tttccaaagg gtatgcttta agctcctttg ctattcccta 4261 tattcgcggt gaaatacgac actatcttcg agataaggga gtcaccgtgc gaattcctcg 4321 acgctggctt gcgctgcaac agcaagcaat aaaagtttcg cgttctctac gggaaaaata 4381 taatcgccaa cctactgact ctgagttagc agcagcattg gaaatttctc ttgatgaatg 4441 gcaagaaatt aaattggcgt ggactaaccg cgccccctta agccttgatt taccagtgca 4501 aaatggagat gaaagtgcaa ctttattggg agatcttgtt ccagacaacc gctaccacag 4561 ctttcaacta gcccaagaag accaaattct tatccaacaa gcattgatgc agttggaaca 4621 acccacccgc caagttttgg aatttgtttt tttggatgat ttgacgcaaa aacaagcagc 4681 agaacgcctt ggtataagtg tagtcacggt ctctcgtcgg cttaggaaag ggctagactt 4741 gctgaaacac ttaatgtatg tgccagaata atggcttttg agaacagttc atgggcagaa 4801 atttaaaaat tgtaatagca gctattctga ctttggtgac tttggcgatc gcctcttgtt 4861 cctcaactcg tgaacaatcc agtcatccta cacctaccga taaaccagca cctcgccagc 4921 cctttcataa cccagtggtc tctgttaaca aaaaagtcac gaatcaaccg tttcataacc 4981 cagtggttgt taccaaacaa actccaccag ttgttcccat caccactgct aacttgattc 5041 aaccaaccga ttctacaaaa cgggtaagta tcgtgtcaaa aggtcggaat gatccgtttg 5101 caaaaattgt ggttccttat tccataaaag tcacgaatca gcctgcacag gtaaaacctg 5161 ttcctaagtc acctcctcta agcgcagcta ctcctttccc gcccgaagat taccgtttag 5221 gtagcacggg agccggggag ccggggagcc ggggagcaga aatgacatcg gtaatcttac 5281 accgggaagg gagtaagagg aaacagcaca gtgcgagcgc actcttaagt aaaaatcctt 5341 taaaacaaaa gctaaatcaa caaaaacaca gaattaacaa aaaagcgatt cgccttgcga 5401 tagcttcgct tcacgcccaa agtgcagcgc ccaaaagggt gattaagcat aactctctgc 5461 cgccagtggc gcgtgcgtct cataataaaa ttaaacccaa ccgtgccttg actcgtgtgg 5521 tgccaaaagt tttaccgcaa gctatcccta accctgcctt agcatccata acaccaccac 5581 aaccagagtt agcaaaagca gtatttgttt ctggtgttat tctcattggt aaaaaatccc 5641 aagcaattat caaagtacca aatgaaccaa caagtcagta tgtacacgca ggacaacagt 5701 tagcaaatgg agtgctgatt aaacgtattg aaataaatga aggctccgaa ccagtcgtaa 5761 ttctggaaca atatggtatt gaggttgcga aaatggtagg acagagacta actacttaaa 5821 cgccacgaac tgcttctggg gaaaatcctg tttctaacac agcgctaccc caaaaccctg 5881 ttccagagga agatacttaa agatggagac aaaagaaaca attgaatttg ctggtttacc 5941 tttagcagtc tatcgagaga tagcagctca tttgtgtcag gtagaaggag tagaagtgag 6001 tttaattcct caaacttctc aacaatttga ttatagtcaa agccaaattg ctggtttatg 6061 gatttcgtat acgcctgagt gtggcgcaca aagtcggcaa cgggtacagc aaattttgac 6121 ttattatcgc agtcgctacg ttgcttgacg aattgtcaca ataaatttta gtaaatatgt 6181 ctatatttaa gaatttttgg gttgaggttt ctccacccaa caattctatt tttttgttga 6241 ggaaatgcgc ttattttact tgacaaaata acgtttttgt gtataaagat cagttagaaa 6301 tctgttggtt tgcgagaact ctctgatttc caacgagcga tctccctatg tcgctacgcc 6361 atcagagcat acactcctca gcttaatcct actgagtgta tctggaaatt acagataaaa 6421 tttttgtttt gttaacaaaa atcatgcagt cgaaccgaac taattttctt aggtttcttc 6481 ttcaatcaga accaggggag ctagctattg ccaacggctt gcgagccgcg ttagggttag 6541 gaattcccat gctgttgggg caacttattg accaaaggca aaacggattg tttgttgctt 6601 taatggcttt ctctgtcaac ttggcaaatg ttggtggccc ttatcggata aaagctacag 6661 cgatggcaat tgccacgttg ggaatagctg tttcggcgtt tgtaggtact ctcgtagctg 6721 gagtgccagt gctgagtgtg gtgttgacat ttctcttgat gacgctgctg gtttacatgg 6781 gacgttttac caaagcagtc acagttttgg cagtacatct cgaacatttt caaaggactc 6841 agccactccc ggaattggaa acctttgtcc gccaaatttc cctgctgcta gaacagttag 6901 cgcaatcagc acagcaggaa gttgccccac cacccttgcc tgacttggaa gaaacgctgc 6961 aaaaaattca accgcacctg caagcactgc gtatagcccg catacaagaa ttagctgtaa 7021 atcagggaca tactcccaga cgtcaggcgg taattgatta cagcatattg gatatggaga 7081 ttgaccagat tgtccgccga ttgagtgcta tgcactcagc tatggtgcgc ctgagtacac 7141 aattatagat tttatactaa atttgtaacc aattttcata gttcagccag cagcttgcca 7201 cacgtaatcc caaccgtgtc agtgtttgaa tgaaaacaac agaatccggt gatgaaattt 7261 taaaatactg agttaaactg gcaagcaaga taatcctagt ataaccagcg catctctaca 7321 gtctgggcat cactcaacat ggaacactgg caatttctga tacagaaaca gggcgatcgc 7381 tcttggagtc ctatagaatc cccaaatata gaaattgtag aaggtcggta tagagtttta 7441 gctcgctctg acctgactaa cactgatatg tacgtgcaga tcacccatag ttcaacgatg 7501 gagtttccac cgaagcggcg aattcaaaag cgatcgggtt ctacaaactc tgaaggctta 7561 atcgcagttc ttccctttag ttatctcaag ccaggagttt gggagttatg ttgttccggt 7621 aacttgatgt cgcactttgt cgatcaacct tggcaacatt gtgttgagtt aaaagtccta 7681 cctatattag ttatagaggt gggacgacaa gaggaggaca cttgcgtgcc aagagtcgat 7741 gaggaagaat tttgtctgtt taccgatgaa ttgcttgaag aagaagcgat gatttcgcag 7801 ccgattagcc cagtttggct caaaggtgag aaagtagagc aaattttaca aaacttaata 7861 gagattgctt tacctgattc tcaattggta ccagcaattg aggatttttc aaatcagacg 7921 ctagaaccac cgctggtgtt gaaacttcag gaagattttt acactgttcc ttgggggcaa 7981 actctcacaa tcaatgggcg tgtagagcaa aaagaaacga cgaatttaga tttgagccta 8041 acatcaaatt atgaaagagt ctacgcagga gaaatcagaa tagaactgcg ctcaccccaa 8101 aattctaaag ttatcaggcg agtgcggcaa tccttaagcg aaaagttaat accatttcaa 8161 atcagatgtt ccatagaggt tccggctgat tgtgaatcca agctgatttt gggagaaatc 8221 agtttatatg gcatcctgac gatggatggt aaaaccacac tgttaactag ccagtttttc 8281 acaatgacag cagatgtcac agaattactg gcaattagtg ctaaagccaa aaaaaatgaa 8341 ctagacacaa tagaacatca agcagtatcc tctgcagcac tggcgacgtc agttgccaaa 8401 aaactatcca cacctctaga tttgcaactt ttcaatttag caaaaacgcc aaaaaaagcg 8461 aaatcacatc tgttgcagcc atctcccaaa aaatctttgc caccgaaaat tgatccgcca 8521 ttacgtcgta aaccagtagg aatttcacct caactaccaa gttttattcc gcatcaaaat 8581 cggataatta ctcctacagt tgtttgggaa ccttcttcaa agttcgagtc agacagggac 8641 gacagcacaa tcacggttgt tagtaaagtc atcaggatgg agacaacttt accctacctg 8701 agacgactca aaacatccca gaatacaaca aaaggcatca tgcgttatga gtcagttgac 8761 gaacagcagt atacaactga gattcataac gaggatgccg cgaaatcaat tctagaagag 8821 acacaatctc agattgaatc ttttgatgaa gatgcagcaa aattagccac agacaacgca 8881 caacctcaag acacgtttgt caaattagtc attccccata attctcaact catcacgaca 8941 ggaagcccca acatctcgcc cttgattaga aagtggatgc ataatcaagg atactccttg 9001 cctgaaccga ttaatttgca aaaccaagac gacgatattt atattgtcgc cagccaaaat 9061 cacgtatctg atgaggttaa caaagaaacg caaacacatg cagatgctaa tgatttagaa 9121 cgcacgcaaa tcgaggaaca ggttggtata gagactcaag aggacaagac aattcaaaat 9181 atcctctatc cctccggtac gcctgcggcg aaagggtacg caaagcgaat aaaatatccc 9241 cttttggcac gcgaagcgaa caaaatatcc ccctcaccat ctccccaacc ttctaactcc 9301 cttgatggca gaatattgtc cgctaggttg gtacacgaaa ttgttgtcga ggatatattc 9361 gatgaaaccg aacctaagac tttcaaaaat caaccttcca aacaaaaaga agaatcagtc 9421 tcagatgtat ctgttggttt acctgtacta gcagaaatca cggaaccgtt accagttccg 9481 caactgcatt tgccatcagg ggaactgatc tctggaaaat ttgtgaggat atgcgtccag 9541 ttagcccctg tagctgatga agtggctgtg aagttatgga ttttggattg ccaaactcgt 9601 gggttgatag gtgaaccgcg tttgttaaca aatttacgac ttaaccctgc tggagtgttg 9661 gaagaaataa ctcatctgag ggttcctttt gggtgtttgg aaatttgctt ggaggcgatc 9721 gctgtgaaca agacgactca gcaagaaagc cacaaagtct ctatcctccg gactgtgatg 9781 cccccagact tacctaggtt gcagctagag gaagttttcg gtacgtgaac agtctttctt 9841 ttgaacggtg ctgtgttaaa aatctttaga atttgaaaaa tcatgcaaga tttgcagtaa 9901 gctcattttg aattgttgtt tgattggata ggaatatttt atgctcacag cttcattttt 9961 acctctcatc cttacgtacc cagcttcata tctctcttcc atttttgttc cagtcattgg 10021 ttgggttctt ccaagtgtaa cttttgcgtt tctgttgtta tacattgaac gcgacgatat 10081 tggctaaatt cgtcaaccga cttcatcatt tggcactccc acgcctccac gaggaataag 10141 gcgtccttga tttcagctga gtcccataga aattaccaaa aacttaacca ccctttctac 10201 ctctgtttgc tatgcaaaaa attggttgaa agggtgtcaa ataacttggc tgtgccaagg 10261 ttgcagcact ctgcaggttt ttggtagctg cttcccttaa ggaggcagct gttttttatt 10321 ggaaccaatt gaaactctca cttcccaaag cccttaactg tttggttttg actctttcct 10381 ttgggcttag atttactaac ttccatcaaa gcttcttgga ataaattatg tagatgtatt 10441 ggtacactgg caatttctgg taattctggt tctataggtt tgttgtattc ttgcaagagt 10501 gtatccaagt cattttgaag ctgaaaattt ggtcgttgaa gaaatgtttg cagtactttt 10561 tctaattgcg ttggatactc ctgggctaaa cgatgccaaa taaaagcatt aatactttta 10621 tcttccagat aaaaacgaat cagtttctca gcaccttcta cactttgcca atcttctgct 10681 tctaatattg tcttgatttt agagtaggtt ggcagaaaca tctgtcccca tcgaggatga 10741 gatatagctg ttacctgttc tgcttttctg agttcagcag gtaaatcaac ttttggcatg 10801 accattttac taccgctgtt tttcttgtca aacatttgtg aaactgcttg ggagtcagca 10861 ccaaattctt ctgcgactgc tttgatttct tcttcaccta aaccagcttc ttgtgcaaca 10921 tctcctagtg atttggaagg atcaatgcct gcttcttcca aacgtttttt cgttaatatt 10981 tcttgaaatt cggcaatctt tttgttgagt tggtatcctg ggagtgtgac ttcatcagaa 11041 gcaaaaaaat ctaaaaactg ctgatgatag attcctacag attgccaagc ttcctctagt 11101 aagtctggag catctccata aagattctct ttgtaattct ctttaaaatt accaatagct 11161 actgctagtt ttggtttgcc caatttaccc atctgagtgt aggaactgaa aaaagtccag 11221 tagctatctg tgataggaga aatgcgagtc agtaaaattt ctccttcttt caaccgagac 11281 atttcgttta gtttttggga ggtgtttggc ttgacaatat agtatttgtc tgtcaaccag 11341 ttccttaatt caaagccatc aggtaagatt tttgtaatag caaataagcc actaaaagtg 11401 cgatgccaac ttttgataag ttcgcggtca accttagaca actcttgata gctttggata 11461 aacaaatcta atggtgagga gtcgccgaca cgcccttgtg tgatgaatga gtcaataatc 11521 agttcttgtt gaaaattatc accgcttcca cgacgtgact tgacagcagc atagctttcc 11581 aaagcttgtg caagttcgcc ttcagcgtca tagacaaaat cagtgaggtc ttgtttgagt 11641 tggtgcgatc gctctaatat aacatccaca agttaatctc tcggactcga ttgttgttgt 11701 acgttagagg ttagcagaga atttctgcat ttggctattg caggatatat tcggatattt 11761 acgtgtcaag atttggttga cagactacta gccatgtagc atataatttg tacccttgca 11821 caaaaagaac gcgatcgcta tgggcatgac ccttgtagaa aaaattttgg cgaaagcctc 11881 tggtcgttcg gttgtcgaac cgggggaaaa tatctgggtt aatgttgatc ttttaatgac 11941 acatgatgtt tgtggacctg gaaccatcgg cgttttcaaa cgcgagtttg gtgctgacgc 12001 caaagtctgg gatcctgaaa aaatcgtgtt aattcccgac cattatattt tcacggctga 12061 ccaacgcgct aatcgcaacg ttgatatttt acgtgatttt gccaaagagc aaagtataaa 12121 atacttttac gatattactg acctgagcaa ctttaaagcc aacccagact ataaaggtgt 12181 ttgccacatc gccctagccc aagaaggtca cacccgccca ggcgaagttt tatttggtac 12241 tgactctcac acctgtaacg ctggtgcttt cggtcaattt gcgactggta tcggcaacac 12301 agacgctggc ttcattatgg ggactggtaa gttgctgatc aaagttcctg ccaccatgcg 12361 ttttgtgttg gatggtgaaa tgcctcctta cttgttggca aaagacctga ttttacaaat 12421 tattggggat atcagcgtcg ctggcgcaac ctatcggacg atggaatttg caggggaagc 12481 cgtcgaacga atgacgatgg aagaacggat gactctgtgc aacatggtga ttgaagctgg 12541 tggcaaaaat ggtaccattg ctcctgatga aaccacgttt gagtatgtgc gggcgcgtac 12601 cgataagcct tttgaggcag tttacacaga ctcagatgcc aagttctaca gtgagcatca 12661 ctacgatgtt tccaaactgg aaccagttgt tgccaaacct cactcccctg ataaccgcgc 12721 cacagcacga gagtgcagcg atgtcaagat taaccgagct tatatcggtt cctgcacggg 12781 tggaaaaaca gaggactttt tccatgcagc acaagttctc aaaggtcata aggtgaaggt 12841 tcccacatac atagtccctg ctacccaaaa agtttacgaa gacctattta aaatcaagta 12901 ccaagagcaa accctctcag aaattttctt agaagctggt tgcattgaac ctgctgcgcc 12961 ttcctgcgcc gcttgcttag gtggtcccaa agacaccttt gggcgcatga atgagccaga 13021 aatctgtgtt tccaccacca accgcaactt ccccggacga atggggcata aagaagcggg 13081 aatttatctt gcttccccct ttaccgctgc agcttccgca ctcactgggt atgtgacaga 13141 tccgcgtgag tttttgtaag aggacaaggg gacagtagga caatacggtt gctgattaag 13201 ataatttgca agaacgataa cctctgtaga gacgttgcat gtgagtccag cgctgcggga 13261 gggtttccct ccgcaggcga ctggcgttag gcgaagccgt gccgcaggca tacccgaagg 13321 gcaacgtctc tacacgtgtc aaatctctcc ccctatctcc caattaggtt aattaattcc 13381 actgagatga taattctcgt aatgggtgtc tctggttctg gtaagtccag cattgggcaa 13441 ctgttggcgg actctttaca ctggggattt agcgatgctg atgcttttca ctcgccagaa 13501 aatattgaga aaatgcggca tggcattccc ctaaatgatt tagatagggt gccttggcta 13561 ctggcgttgg aacaggcaat acaacagtgg ttgcaggaga ataaaaatat ggtgctggcg 13621 tgttctgcgc tcaaagccag ttatcgccag gttttggttt tggatgagga acgcgtcaag 13681 gtggtttatc ttaaaggacc atttgagttg attcaaaagc ggctgcaaca gcgtcatggt 13741 cactttatgg gtgaaaaact tcttaaaagt cagtttgata ctcttgagga gccatctggt 13801 gcggttacag tggatgtttc tgagccacca gaggtgattg tgcagaagat tagagtgagt 13861 ttggggattt aggcgagttc atgggagaag ttaggcccag atcatggcaa gacagagatt 13921 gctggtgata actgcttaag acataactgt tgtattcctc ttctctaccc aaaacctgcg 13981 ctaacgttcg caaatttagc cgcttgagta attcaattgt tattggtctc ttatcactcc 14041 aaaaaatcat tgattctaaa aactcttttg ctggttgcga atttagcaaa gcagcaataa 14101 aagatgcttc agatttagac ctacatgata aaaagtaaac agtatcgtca aaaataacag 14161 gtttgccgtc aatttgacct acttgcacaa aattcagagt tttgtaaaag ccagatattg 14221 ccactttcca aggtgcaaaa gtatattccc caacaccaaa tatagaaaat tctggacgat 14281 tccgataaat agaactgcta cgcttcaaca aatactctcg ataattctga agataaaccc 14341 atattttagg agcatcaaac tttaagtgtg ttgtatcctc cccaatataa cgctgagtaa 14401 caagaacata ttttctacaa gtagatgtac gactatttcc tacatctgaa cttttcaaaa 14461 acgggtaaac ccaatcttgc tcaagagaag aaacataacc atatccgttg taaaatttgt 14521 cttcttgtcg ttcaagttcc atgaccttgg aacaatcatg tttcaagcca gaacgccaaa 14581 tataagctgg atcagttcct cgtagatgca accaacgctt gtaatctatt acattagaaa 14641 taactgtatt atcgtggtaa ccaagaactt gtaatggttc gtgagcagat aaattctcga 14701 atatctcgca agtatttgat aggttcccat ctccaatcaa caaaacaaaa aaacaggcat 14761 ctacagatgc attaaaattc ttcagtgtat ctattccata tatttgagcg cggataactc 14821 tatgcttctg ttgccaaaca tgaacaagaa tttttcgggc gacagcagtc ttacataaca 14881 ctgcaattat gcctgtgcgt tctttcagcc aatctaaatg tttgatcaac atccattcgg 14941 aaatatcaaa attacttttg ccagttaatg cctcatatcc gcttctgcct tgaaaatttg 15001 atttttctgg taagttagaa ctttttaata aacttagttc agaactcgtt acccatggag 15061 gattaccaat aatcagtaat ggttcggcta atgtgttaag aatttctgac caatttaaat 15121 caaaaaaatt accattaata atttctattg atgcagaata gttttctgct gcaacgcttt 15181 ccttaagata gcttagatgt tgtgtattaa tatcaacacc gacaaatttt ttcacttgtg 15241 aaaatacttc acaagaagct aacaaaaacg ctccgcgacc acaagtaggc tctaaaacag 15301 acctgggtgt cacacctagc ctgttaacga catttgtcgc ctctaacgct aaagctattg 15361 gagtttgaaa atccccaaat tcccatatat tcttagcttt ttgttgcgcc attttagttc 15421 atcgcactct aataataccg ttaacctcgc cagccatttg tataatacgg ctatattgaa 15481 gtcgccattg tagagcattt gagattgtta aataaccttg ctcaggtgga ttgcgtagta 15541 tctcttcggc aatattacgt gcttcaattt catcaactgg tagattgcga tccatgatga 15601 aggcaacaag gtcgtcagca ttaccttcat ttgcaagaat gctaagtata ccacgagtca 15661 tttgatagtc agcagtacgc ctttgatcaa tgaagacagt atgctgcata ttgagtctac 15721 cagttttatg ctgatgatca tctctctttt cgtaaacaaa aatcagcaaa ccatacccaa 15781 gaccaaaaat tttttgccga gcagatttaa aaggacatga agattgaggt tgccggatgc 15841 tggtaacttt tatatctata tttaagtctg gaaaatctat tccagaagca gaattaccca 15901 gcacatactc atacaataaa gctaaataat tccgaaattt ttgctccagg taagttccta 15961 cagcctttcc atcagtaaca ccaaacagca atggctcatc gtagatactt tctatctcag 16021 caaactttac agcctcctct agaagaatat caagtgtcag cgaaggcttc attgttcccc 16081 cttctccaat gttcctccgg ctcataaatt ttttctttac ttaaatattc ttcttttata 16141 ccaagtttca aggttggaaa agaatatttt ggtgagatag gagtcatcaa gacttttact 16201 tcatctcctt gtttgagatt atctggtaaa ggttgtacta aactcatatc ccagctacct 16261 atgctaaccg ctcaaagttg atgcggtcat tgtgcagaag attagagtga gtttggggat 16321 ttaagaaacc aaaggttgac gccgccgctt aataaaggtc ataaaagcca ctcctagcat 16381 cagcagtgcg atcgcacaaa ccccatgcca ttcccagcga ctccaagcat atgcactcaa 16441 gaacgaaccc aacgctcctc ccataaagga aaaggtcata taaagcgcat tcaaacggct 16501 atgaaactct aagggtaaac tgaaaattct tgcctggttg gaaatgtgtg tgctttgcgc 16561 acccaagtcc agaagaataa ccccgacaat tagccccagg atgtggtaac caaacaccca 16621 gaaaatgaga aatgacaagg ttgtaatcga aagacccaag gccacagtta tcttgggatt 16681 ccttttatct gctatcttgc ccgcaacagg agccgctgtc gctcctacca caccaaccaa 16741 cccaaacaac cccgtcacct cactaccgta atgatagggt ggctgtgcta aaaagaatac 16801 gagagtactc caaaaagcac tgaaggctcc aaaggacatc gcccctgcaa gggatgcttc 16861 acgtaagata ggctgctgtt gtatcaactt gaataaagat ccaaccaatt gaggataaga 16921 aactcgcagc gacggttgac tttttggcag catccgtaat aggacaacag ccagaacaat 16981 catcaaccca ctcgccagcc aatacatcgc ccgccagccc aaacttgctc ctacaaagcc 17041 gctcaccgtt ctagctaaaa gaattccgat caacacccca ctcatcaccg taccaacaat 17101 tcttcctctg tgctctggtt ttgccagttg tgctgcaaaa ggaattatca gctgagctga 17161 aacactagtt atgccgatcg ccaaactcgc cactaacaac caagcaaggt taagtgatgc 17221 tgctgctgct cccaatgcta cagatgtcaa caccagcatc atcaaaatta accttctcct 17281 ctccatcaag tcgcccaatg gaacaaggag gaaaagccct acagcataac caatttgcgt 17341 caacatggga atccaccctg tcgcatggtt agaagcttga aggttttggg caattaccac 17401 caacagaggt tggttgtagt ataagttagc aacgacagtt ccacaggtga ttgccattat 17461 caagactaga ctacgcccca aaggctcttg gctagttcta tcagtagatg cactctttga 17521 gtatttgtta acacttccca caggagaaaa ctccaattac gtgtaaaaca gtctgtttat 17581 atttcgctat caaaaacgga actatgcaag agctaaaatc tagctatatc gtatttttga 17641 gtagatagta ttattcattt cttaatatgg caggacacag taaatgggca aatattaaac 17701 gccaaaaagc ggtcgtggat gcaaaaaagg gtaagacttt cacccagtgg tcgcgtgcga 17761 ttattgtggc agccagaagt ggggttccag atcctgtagg aaattttcaa ctgcgcacgg 17821 caattgaaaa agcaaaagcg gcgggtctac ctaatgagaa cattgaaaga gcgatcgcca 17881 aaggtgcagg tacactctca ggcggtgctt ctagtttaga agcaattcgt tacgaaggtt 17941 acggtcctgg gggagttgcg atcctccttg aagccttgac agataatcgt aatcgtactg 18001 ctgcagactt acgtgttgct tttagtaaaa atggtggtaa cctcggtgaa acaggttgcg 18061 tcagttggat gttttcccaa aaaggcgttt gtactgttga gggaatcgtt gatgaagaac 18121 agcttttaga agcttctctt gaaggaagtg ctgagtttta tgaaatgact gaagaacaga 18181 ttgctgaggt atttactgaa gtggttaatt tagaaaacct gagccaaact ctcaaagaaa 18241 agggctttaa agtaactgat gccgaactac gctggattcc tggtaacaat gtagaagtca 18301 ctgatccaga tcaggcacgc tcacttctga agttaattga tactttagaa agtttagatg 18361 atgtacaaaa tgttactgct aattttgata tgtcagaaga attgatggca gtgatggctt 18421 gaactttgat atgacatctg acttataaat tttcaagagc ggtggaagta gaacaaatta 18481 ggtactggca accgcaaagc ttagaagatg tgattgttaa ttatgagatg aaaaaatgct 18541 gtcagcttgc aacttttgtc tgcaaggata atttgcaggc tttgagatgt caagcctgca 18601 aaaatcgaca caaaaactcc acctcaacaa catgagactg acataacttt ctgaaatact 18661 tagattgttg attctcagat gaaaattttc acaggaacaa gtttgctaaa gctttactca 18721 tcgctgttgt gtttgctgcc cctctagccc tgagtgcaac tgtggtaagg ttacaactgt 18781 tgctgccaaa cctgttactg gtaccaaagc tactaaagtt catcatcaga aacatcatgg 18841 aaagcacaaa ggctcccata ctggaaacac tatgaaaaaa atagttactg ttaacgtata 18901 ttaagttgta gtagcctgcg aaaacttgaa ttctaattcc tcctcatcaa gacatataac 18961 acaatttaac cccgttgcgt tcctcaagct ccagcggggt tttttctatt tagcaaatac 19021 ttgaccaaag taacctaatc tgttattcag ttaactcaaa aaaggcaaaa atttacacaa 19081 aaactccaca tcaataacac gacaatgtca catcttcttg ggatagttaa atggttgatt 19141 cttgagtgat aaaagtttca catgaccaag attgcaaaag ctttacttct gtctcttgtg 19201 gttgctgctc ctgtagccct aactgcacct gctgtaaacg cacaaatgtc ttctcccgcc 19261 actacaacac ctgctacaac gacaccaact actaccaaac ctagtaatac tactaacacc 19321 cctaaaaaga agaaatctga tactactaag aagggaaaaa agacttcccg tcgtcctagc 19381 aattctaact ctgccacaac acctacaggt agtacaccca caacatcacc ttctagcact 19441 ccatccacac ctcctgcttc aaccccagct caataagcaa tagctaataa gcgggcttaa 19501 tcaggcaagc atctcatcaa aactatatat cctcatgctc atcaaacttt tgctcgctgt 19561 ttataatagt gagtattttt atttatatat ttacttcacc ccaacttgaa catctcctca 19621 acttgctctt cctagccact tgcgtcaaaa taagaatgaa aacgctgtaa cttatcttaa 19681 caatggcgca agcgcagctt caaaccctta accctcccca aacaccgaca catacgcggg 19741 gatatgatta tgatttagtc atcgtcggcg gtgggattgt tggcttaacc ctagcctctg 19801 ccttgaaaga ttccgggtta agtgtgctgc tggtggaggc aaaagtagaa tcagcggcgg 19861 tagccaaagg acaggcttac gcggtgcata tgctttcggc gctcattttc caaggaattg 19921 gaatttggaa caaaatattg cccaacattg agacttaccg ccgtgtgcgt ctttctgatg 19981 ctgattatcc cgctgtggtg gaatgggaaa ccggggatat aaatacaaaa gatttaggtt 20041 atgtggcaga acatcaagca ctgttgcatc ccttacagga gtttgtcaaa aattgtccca 20101 acgtaacata tttgtgtcct gcccaagtgg ttaatatcca gtatcagcag gacgtggtaa 20161 cgatggatgt taaaattgca gaccagatgc agacggttcg cactaaacta gtcgtagccg 20221 cagatggatc gcgttcccgt atccgtgaag ctgctggaat taaaacccgt ggttggaagt 20281 attggcagtc atgtattgtt gcctttgtca aaccagaaaa accccacaac aacacagcat 20341 acgaaaaatt ccagtcaagt ggcccgtttg ccattttacc tttgccagga aaccgttgcc 20401 gaattgtgtg gactgctccc catgaagaag caaaagcttt gtgcgcgttg gatgatgagc 20461 aatttttaag agaattacaa gcgcgttatg ggaatcagat ggggaagttg gaattgttgg 20521 gcgatcgctt tatttttcag gtacaactta tgcagagcga tcgctatgtt ctccaccgac 20581 tcgctttaat tggcgatgca gcacacaatt gccatcctgt aggcggacaa ggtttgaatt 20641 taggaattcg cgatgtggct gctttggcgc aagtgttgca agaagcggat gctcaaggtg 20701 aagatattgg taacattcaa atcctcaaac gttacgaacg ctggcgtcag cgggagaact 20761 tgacaatttt aggttttact gatttattag accgcatgtt ttctaacaac atcttgccag 20821 ttgtgatagt tcgtcgcctt gggttatgga tgatgcagcg agtacctatg gtaaaagtat 20881 ttgctctcaa gctgatgatt ggtttgaagg ggaggactcc taaattagca cagcattgaa 20941 gaaccgcttc tttcctgtta agagtttcct cttaagaatc attcaggcaa gcatcccaaa 21001 tgtttaaatt tgtggtgtat tgtagtgcag gcttctagcc cgctttggtt taaatttggg 21061 agcaagatgc ttccacgact cgtcttgttt cacgcattgg gatactccca tcattcaaga 21121 ttgctataac catttgtctt tagacaaaac aaagacaaat gtaaagaaag tcatgtaaaa 21181 attatcgctc agttatgagt cactaagctc ttttaaatac tctgaccata aattattagc 21241 atggctattt gtcctatttt gacatcttcc attagaggga taaatattta aatatgagtt 21301 catttaagac tataacaatt gatagcattc ttctctgaat actagtatta ttgatgactg 21361 actttcctct gagaattgtt tatttaagta aaaaaaataa cgaattgagg ctgtcatgag 21421 tgatttatag caacggacat attggttagg acatcatatt ttgatctcaa gggaacaggg 21481 aacgcttaac agggaacaga gaaatgtcct aacaattgtg gcgactgcta tagtaagagt 21541 tttaggcgaa catcggtcta tttgtttcta ctgaattact tagtatgaat gagtgatttt 21601 taaaagcctt atggagtcat tttaatgaca aactcataaa aaaaaataag tgtttactgt 21661 taaaaaaaag ctttgagatg aaaagtagca tgtcttagtt atttaacaaa gattacttac 21721 gcaaagattg tgtcaataat ttctaaaagt tcaaatttac atctcatctt tgttcccgaa 21781 gataaaaaat acaaaagtta aaactgaaga cgttgaccac cagtagtgtt aaagtggcaa 21841 gaaattcatg tgaaattttc agcgggaatc aatgctgaag aagctacaga caaagcatat 21901 ttctctcata gtgggtgccg gactatgtgc ctttttagca ggggcaatcg tgtcagctcc 21961 cgagatagga aaatcagtaa gtcaatggct taaaccgcgc aagactcagc ctgagaagct 22021 ttcggaagaa agcaaagcga aatcacccgt ttttgccttg gtatcacaat ctccgcaaga 22081 acgtggtgca aaactacaag cattaaagga ggcgtcaaaa ccagcagatc aaaatcgcgc 22141 tcgttatctt ttggcaagtg acttaattga acaaaagcaa gctcaacctg cactgaagct 22201 actagatgga ttagacagag attatccagt ccttgcgcct tatgtgttgc taaaacaagc 22261 gcaggcacac gaaattctag gacaagaagg caaagcctcg gatctgaggc agaaagtgct 22321 caaagactac cccaaagaag cggctgctgc taaagccttg tatctcatcg gagtcccgga 22381 gtatcaggac aaagcaatag ctgaatttcc ttctcatccg cttacctggg aaatcatccg 22441 taagcggttg gagacaaatc ccaatcagcc gcaattgcag ttagttttgg caaaatatgc 22501 ctatgatcaa ccaggaactg tacctgtggt ggatcagctg gcgaagaatt ccacccttaa 22561 acccgaagag tgggacatcg ttggtacagc ttactgggaa aacaatgaat ttggcaaagt 22621 aacgacagca tacgccaaag gaactaaaac accccgcaac ctctaccgtg ttggcagagg 22681 attgcaggtg agtggaaaac gagcagaagc ccttgctgtc tatcaagaac ttgtaaaagc 22741 atttccggaa gccagagaaa ctagcactgc tttgttacgc ttagcagaaa tggcaacagt 22801 acgtaaagat gctttgccgt atcttgatca ggttattact aaatttcctg acaaagcagg 22861 tacagcgctg gtagaaaaag tcaaaattct tcaaactcaa gatcaaaaag cagcgagtga 22921 agctactaag ctcctgatta ccaagtacgc taactctgag gaagcggcag aatatcgctg 22981 gaaaatcgca caacaaaaag cgaaagccaa agattataaa gctgcatggc aatgggcaga 23041 acctattgcc aaaaataacc ccaacagtat tttggctcct agagctagct tttgggttgg 23101 aaaatgggca gcaaaactcg ggaaaaagga ggaagcaaaa caagcttatg aatataccct 23161 cagcaatttt ccccattctt actattcctg gcgagcaggg gcaactttgg gactcaatgt 23221 tggcaatttc aactccgtgc gccagatgaa cccaaacctc gtcccgttcc agcgtccggt 23281 gccaactgct ggttctgaaa catttaaaga attatacctc cttggtcaag accgcgatgc 23341 ttggttgcaa tggcagaccg aatttcagaa caaaatgcaa ccaacagtgg cggagcaatt 23401 tactgagggt ttgatgcagc tggcaaaggg agaaaatatc ctcgggatta atacagtttc 23461 caaattggaa gaccgagaaa cgccgcaaga acaagcacag taccaggctt taagcaaaca 23521 agtcacttac tggcaagctc gttatccctt cccctatcaa aaagaaattg agcaatggtc 23581 ccaaaagcgt cagctaaatc ctctgctagt gactgggctt atgcgtcagg aatcacattt 23641 tgagccaaaa gtccgttcta cagctggtgc ggttggttta atgcaggtgt taccaagtac 23701 agctaaatgg attgccccac aaattcagtt agatagcaca aaaataaatt tggaaaatcc 23761 caacgagaat atcatgtttg ggacttggta tttggatcac actcatgagc aatatcgtaa 23821 taattcgatg ctggcgatcg ccagttacaa tgcgggtccc ggtaacgttt ccaaatgggt 23881 acaaactctg ccaaaggaag atccagatga atttgtagaa tcaattccct ttgatgaaac 23941 caaaaattac gtgcgtcagg tgttaggtaa ctactggaat tacttaagac tgtataactc 24001 agaaacttcc cagttggtag gaaaatattc gactgtacac ccacaactac ctgttcagta 24061 attttcaaaa gtagagacgc tttcattgcg cgcgtctcta catccacctg cttcccagat 24121 acaatagtca cagatgacgc cttctcgctg actcatgaca ccttcacaaa acgtagacac 24181 aacaacatct cctagtgttg actttgaagt agatggtgac tcaaaaccgg aaatagatcc 24241 tatggatcct cttgatgagt tgcccggtga agtcgaaatg tcaattttcg ctcacttaga 24301 agagttacga caacgaattt tttactcgct cattgctgta gcagtgggtg ttgtcggctg 24361 ttttcttgcc gttaagccga ttgttcagct gctagaagtg ccagcacagg gagtaaaatt 24421 tttgcaactg gctcccggtg aatatttctt tgtttccatg aaagttgcag gctacagtgg 24481 gttgcttttg tgtagcccgt ttgtgctata ccaagttatc cagtttgttc tccctggact 24541 cactcgtcgt gaacggcgtt tgctgggacc tgtggttttg gggtcgagtt ttttgtttgc 24601 gggagggtta gtctttgcct accttttgtt ggtaccagct gctttgaaat ttttcatcag 24661 ctacggtgca gatgtagtag aacaactgtg gtcgattgac aaatattttg aatttgtctt 24721 gttgctgcta tttagtactg ggttagcatt tcaaattccg attatccaag ttctacttgg 24781 tgctttggga atagtctctt ctgggcaaat gctttctggt tggcgctacg ttattatagg 24841 cgcggtggtt ttaggagcaa ttctcacacc ttcaactgac ccctttaccc aaagtctgtt 24901 agcaggggca gttttgggac tttattttgg tggtattggt ttagttaagt tggtagggaa 24961 atgagggtaa gaaactgtaa cttaattcct gctcacctct ggtcataact catctttttg 25021 aagaggttct atggcgttcc gccacgcttc gctatcagtc aatacggttc agttaaggct 25081 gaaaacgtta ttgaggacac gaaaagaagg agct // LOCUS NODE_1217_length_24899_cov_5.03852024899 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24899) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24899) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24899 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(397..1143) /locus_tag="DP116_10780" CDS complement(397..1143) /locus_tag="DP116_10780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459314.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferritin-like domain-containing protein" /protein_id="PRJNA477356:DP116_10780" /translation="MNFLTYVMHLVGSGAMAYYSALQIRDLKTRPNILAGFQLAESGS VPFLTKLGDRAAAEGDTWLAEKLAKHASDETRHGQIFAHALKQLNKRVIDFKSSPQNT EEKKTQQRRSPFFAAYYEGYSQEQLKPETIDWDVFMASTYILELDASKDFTRMANVLP EDDPIARNLKQGMLSIAKDETGHAAYLYEAMTRRMSASKVQQLVDEWRTRKVNALFAF AGNMLQGNDNRSSLVQDGAPSESESELIVA" gene complement(1359..1625) /locus_tag="DP116_10785" CDS complement(1359..1625) /locus_tag="DP116_10785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996886.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="four helix bundle protein" /protein_id="PRJNA477356:DP116_10785" /translation="MPILLRVIASHFLSPPANIAEGYGRRTRGEYIQFLYIAQGSLKE LETHLLLSIRIELASSATITPVLNQCESVGRLLLLLIRSLENKG" gene 2897..3282 /locus_tag="DP116_10790" /pseudo CDS 2897..3282 /locus_tag="DP116_10790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015364308.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(3386..6064) /locus_tag="DP116_10795" CDS complement(3386..6064) /locus_tag="DP116_10795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756836.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA helicase" /protein_id="PRJNA477356:DP116_10795" /translation="MNYPALSPELEPSSIFPFELDQFQKDAIASLNAGRSVVVCAPTG SGKTLIGEYAIYRALSRGKRVFYTTPLKALSNQKLRDFREKFGFDHVGLLTGDASINR DAPILVMTTEIFRNMLYGTPIGQIGISLVDVEAMVLDECHYMNDRQRGTVWEESIIYC PREIQLVALSATVANSDQLTDWLNQVHGPTDLIYSDHRPVPLEFHFGNIKGLFPLLND DKTQINQRLLRKKKKGEKNKSKANVRAEAPSMIQVLSQLQERDMLPAIYFIFSRRGCD KAVAEVGSDMWLVDQEEAQQLRWQIDEFLSRNPEAGRSGHVGPLYRGIAAHHAGILPA WKVLVEELFQQGLIKVVFATETLAAGINMPARTTVISTLSKRTDSGHRLLNASEFLQM AGRAGRRGMDERGHVVTLQTPFEGAKEAAYLATSQSDPLVSQFTPSYGMVLNLLQTHT LEEAKELVERSFGQYLANLHLRPQREYITHLQTELAQLQAQIAAIDEQDLAVYEKLRQ RLRVERLNLKMLQEQAQEARQDELGMMLDFAVSGTLLSLKSKNFTMPAPITAVLVGKT PGSGQTCYLVCLGQDNRWYVATASDVIDLFAELPRIDIPDELVPPPEMPLKRGQSLLG DEYTALITQEIPEASEAVFMPPEVLEQLRRVTAVQEQLEAHPLHQSGDAVSLFKRRTQ LVELEAEIQLMQAQIEQQSQRHWEEFLNLIEILQHFECLDKLVPTRLGQMAAAIRGEN ELWLGLALESGELDNLDPHHLAAAIAALVTETPRPDSMVRFDLSEEVAEALSKLRGIR RKIFQLQRRYNVALPIWLEFELIALVEHWALGMEWIELCENTSLDEGDVVRILRRTLD LLSQIPHVPHLSQSLQRNAHRAMQLIDRFPVNEVGG" gene complement(6268..7083) /gene="proC" /locus_tag="DP116_10800" CDS complement(6268..7083) /gene="proC" /locus_tag="DP116_10800" /EC_number="1.5.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742426.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyrroline-5-carboxylate reductase" /protein_id="PRJNA477356:DP116_10800" /translation="MTTIKFGLIGGGVMGEALLTRLIARKIYQRSEVLVSEPQTSRQG FLEDEYGVNVTADNRLVFTPTTEVVFLAVKPQVFSAIAQELADVITTVSKPLVISILA GVTLNQLEAAFPHFPIIRAMPNTPATVGAGVTAICPGAYTKANHHETAKLLFSAVGEV VEVSENLMDAVTGLSGSGPAYVALAIEALADGGVAAGLPRTIANQLALQTVLGTAKFL HETKIHPAELKDRVTSPGGTTIAGVAQLERAGFRSALIEAVKAATARSQELGK" gene complement(7311..7904) /locus_tag="DP116_10805" CDS complement(7311..7904) /locus_tag="DP116_10805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131788.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein SepF" /protein_id="PRJNA477356:DP116_10805" /translation="MNNIFSKLRDFVGLNEQVEYEYYEEEPETESYRNLYQEQNAQQP VAQEPPEQNRRWREPVPTMENVSAAGSKPMSNVIGMPGVINGISEVLVLEPRTFEEMP QAIQALRERKSVVLNLTIMDPDQAQRAVDFVAGGTYALDGHQERIGESIFLFTPSCVQ VSTQSGIIHEVPQPTVRPSRSTGPNPAWGNEVNRMAQ" gene complement(8092..8277) /locus_tag="DP116_10810" CDS complement(8092..8277) /locus_tag="DP116_10810" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10810" /translation="MVLLNTLLQYILRSKTLYKTLLKKISEDTGSIGDKGYSIDKSKS QYNIRGKQLPKNLLVKI" gene complement(8320..8991) /locus_tag="DP116_10815" CDS complement(8320..8991) /locus_tag="DP116_10815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860525.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YggS family pyridoxal phosphate-dependent enzyme" /protein_id="PRJNA477356:DP116_10815" /translation="MSSSISERIITLGSSLPDSVRLIAVTKQVSAEIIRCAYAAGIRD FGESRIQEAASKQAVLQDLPDITWHFIGHLQHNKAKKAIEQFQWIHSVDSLKLAQRLN QLAQELKITPQVCLQVKIRPDPNKSGWSASQLLADLPALDQCENLQIQGLMTIPPLGL NDYEVISVFNSTSKLAKEIREQNWSNVQMHHLSMGMSGDYKLAVQAGATMVRLGTILF GERSV" gene complement(8988..9266) /locus_tag="DP116_10820" CDS complement(8988..9266) /locus_tag="DP116_10820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006199162.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3539 family protein" /protein_id="PRJNA477356:DP116_10820" /translation="MNLDNSETYINHPTWGLLYRICMVDENQELFTTLYAQRLFFIVT TDVKGMKFQSIGRTEARMMIENRLRSLRRTGQSQEYDQLQTVFQRTFQ" gene 9280..9471 /locus_tag="DP116_10825" CDS 9280..9471 /locus_tag="DP116_10825" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10825" /translation="MNIYIFAAEEKHFDDKGRISPNKIFAKMPKTPKLRASSTLPTEV GSGACAVARVFCGGGYESR" gene 10154..10225 /locus_tag="DP116_10830" tRNA 10154..10225 /locus_tag="DP116_10830" /product="tRNA-Val" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:10186..10188,aa:Val,seq:cac) gene 10482..11372 /locus_tag="DP116_10835" CDS 10482..11372 /locus_tag="DP116_10835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319063.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphonate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_10835" /translation="MKTMTSRYFLLIHLLVLIGLLGTGCNNQQPDINLEKLTVGVVSY GEGTVSIDKYERFKDYIAKQTQSIVELEPAYNELQALEQIQRQNWEIVFAPPGLAAIA IGKQLYVPLFSMEGTRGRQRSLLIVRDDKPIKTIPDLANKTVALETAGSAAGYYVPLY DLYGLTLAQIRFAPTPKTVLQWISEGSVDAGALSQTDFELYQREFSTTKFRILHTSRS IPPAVVLLAPTVERNQQQQIQKAMTQAPGDIIADAGYVPAAKVPSYEQFIKLVDKVRP LEEQVKKTPAVLLNHQSTSQ" gene 11686..12942 /locus_tag="DP116_10840" CDS 11686..12942 /locus_tag="DP116_10840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872545.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_10840" /translation="MSQSPIVKAEIPSGTMIDNRYIIQKLLGQGGLGRTYLAFDTRRF NEACVLKEFAPTGSGENALEKCRNLFKREARILHQLEHPQIPRFLACFEGDGRLFLVQ EFVDGKTYSLLLRERQDLEESFSENEVIQWLKNLLPILEYVHQHNIIHRDISPDNIML PDGKNLPVLIDFGVGKQIANLNEDTTPYHMSFVGKMSLVGKVGYAPREQISLGLCSPS SDLYALGVTAVVLLTGRDPSFLMDQYSLEWRWRSYTRVTNDFARVLEKLLADTPKLRY QSSKEVLTDLDRLGQFQVMRQSTVVDLPTTVIEQKSQPTFPTRQPPNQYPETIASSAR SSSRQQQFPVIESQPQPQQSSLNPAFIQRCQQQLAYYIGPIASLVVEEILAETPQISP YQFIEFLAREIPDASAALEFRKHLFS" gene complement(13030..13953) /locus_tag="DP116_10845" CDS complement(13030..13953) /locus_tag="DP116_10845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319065.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_10845" /translation="MLKNLKLRQKFTILLVIILVVGLSFSGLALSTLLRQNAINEIAS RALTLIDTMTSLRGYTLTQIYPELADKLEEKFLPQVVSAYSAREVFEILRKKPEYRDL FYKEAATNPTNLRDKADSFETTILEGFQQDKNLKEVSGFRSLPGGDIFYIARPLTISQ ESCLKCHSTPDVAPKTMIERFGAVNGFNWQLNQIVAAQFISLPASKVIEKANQSSLLI VGLVSTIFIVVILLVNVFLQRQVIIPLKRITRVAEEVSTGHLEVDFDQISNDEIGNLA KAFKRMKLSLEMAMKRIRRTHGGNTGGTAGN" gene 14544..16421 /locus_tag="DP116_10850" CDS 14544..16421 /locus_tag="DP116_10850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319066.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein kinase" /protein_id="PRJNA477356:DP116_10850" /translation="MVWNPGQALLGGRYIIERKLGEGGIGITYLAGNERNELRVIKTL RVDILNQETWKPHQSKLKQDFRDEAVRLALCRHPHIVQIENIFDEGDLPCMVIEYIEG EDLGNRLRRMKVLPEVEALLYIRQIGDALKVIHSKGLLHRDIKPRNIMLRAGKSEAVL IDFGIAREFIPNVIQRHTVYRTPGFAPPEQYELEAPRGEYIDVYALAATLYSLITGVT PRNADERARHNIPLEPPKHFNRNITDTVNQAIIRGMDLQPNLRPQSVQEWLDLLDVEI REEPPTQVIVHRPVLPNNQSPPQNQQRIIPSVATSGNWQCVRTLRGHSSMVLAVAISS DGQTIASGSNDHTIKLWQLETGKLLRTLGGLFSGHSSIVHTVAFSPDGQLLASGSWDE TIKLWQVSRGKEIRTLTSHTSWINSVAFSPILPNFSYQQEEAYVRAWMLASGSADSTV KLWIVSTGIEISTFTGHLDSVWSVAFSPDGELLASGSADSTIKLWQVNTGRKIHTFRG HSFFVNFVTFSPDGRLLASASADGTIKLWQVSTGKEIHTFRGHSDAVCSVTFSPNAQL LASGSWDKTIKIWQIRTGTEICTLSGHSNYVRSVAFSPDGQTIVSGSDDDTIKIWRCY S" gene 16491..16715 /locus_tag="DP116_10855" CDS 16491..16715 /locus_tag="DP116_10855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652075.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_10855" /translation="MKHIKLIVEKHSDGYVAYPLGIKGVVVGQGDSYEEALADVKSAI RCHIEIFGQDVLEEESPVLEAFVAEAEVSV" gene 16715..16954 /locus_tag="DP116_10860" CDS 16715..16954 /locus_tag="DP116_10860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877318.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicA family toxin" /protein_id="PRJNA477356:DP116_10860" /translation="MAKFPVDAPKARVIKTLELLGFNIIREREHIVMVRQNNDGTRTP LTMPNHSYIKGSTLRSICTQASISREEFLAAYEQM" gene 17202..18101 /locus_tag="DP116_10865" CDS 17202..18101 /locus_tag="DP116_10865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10865" /translation="MGFGIGDLFWIFLLLSSLQPLWQKRQVEYRRIRALQEFQQQRKS RVILLIHRQESISFLGIPVSRYITIEDSEQILRAIRLTPPEIPIDLILHTPGGLVLAT EQIARALIRHPSKVTVFVPHYAMSGGTMLALASDEIVMDANAVLGPVDPQLGNFPAAS IVKVVEQKPIGEVDDQTLIMADLSRKAIDQVQRFVRTLLKDTVPTQKVKPENIENIID ALTTGRVTHDYPITVEEATEMGLPITAGLPRVIYDLMDLYPQPQGGRPSVQYIPMPYD DRRPILPTPKGRPLEEPSPTQMS" gene complement(18111..18863) /locus_tag="DP116_10870" CDS complement(18111..18863) /locus_tag="DP116_10870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494272.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_10870" /translation="MWCGFGKSNITVATACLITASVVISDVAFAARTYTPRQFRAVLR GLGYSVKVSDAPLTTADTKNAIQEFQKGHKLQPADGIVGPKTQDVAAKIVETLQANLN LVAKPNPALPSNQYYGPQTEVAVKQYQKKLGLQETGIADLAFRQRLSQEAKDIKKKPT SAPTTKPRKPTTSPTTQPTPTAKPTKTPTSTPRTTPTAKPTTTPTSTPEALPTATPTS TPEALPTASPTSTPGVSPTATPTPTPTPTPRR" gene 19223..19582 /locus_tag="DP116_10875" CDS 19223..19582 /locus_tag="DP116_10875" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10875" /translation="MTSPLPLNTQPVAGAIFTALLLWRGVVVTGGKGNDILWGGTGAD RFVYSSPSEGIDIIKDYSYLQGDRITISKTGFGATSTNQFSYNSITGALFFQGTQFVT LENKPADFSTSLGIELV" gene 19925..21325 /locus_tag="DP116_10880" CDS 19925..21325 /locus_tag="DP116_10880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196595.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10880" /translation="MAIIASVNLGLVLFDFSYIPWRDFYKRNLPIITQRYDPVKGIEP HRETENYLVTVDALIEQVSQTGLQSPQVAKELEELRGLSSEMIDSNPFEAANKSGTLE KIKNRMRDRIGVKSSKQAFSTFWSQAYLSKRGWKQEIDFFNQRIRPLIIVNYYRSLGE NGEFVDNFWIIDLPFVILFGLELLARSFYIQRQHPGFRLFNAIFWRWYDLFLLLPFWR PLRIIPILIRLDHARLLNLHPLRQQIHQGIIANFAEELTEIVVIRVINQIQGSIQRGE LKRWLSQNENLRSYIDINNINEVEAIGAILVKTIVYQVLPQIQPEVTAILQHNIESAI KQVPIYHNLQILPGLDAMQTQLSEQLATQITTNLYSAVVSAAEDPVGAKLSGQLVQRF SEALGTEMQKKHVLSEIQNLLIDFLEEVKLNYVQRLSKEDMWKILEQTKQLRTQVSVQ PGEDKSSTMLVRRETS" gene 21747..22208 /locus_tag="DP116_10885" CDS 21747..22208 /locus_tag="DP116_10885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210359.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome maturation factor RimP" /protein_id="PRJNA477356:DP116_10885" /translation="MTHPLVPQIIELATPVAEELGLEVVGVVFHTNQNPPVLRLDIRN PQQDTGLDDCERMSRAFEATLDAVEIIPDAYVLEVSSPGISRQLTSDREFISFKGFPV IVQTAPFYEGQQQWIGQLIRRDETTVYLNQKGRVVQIPRSLITRVDLDERR" gene 22410..23681 /gene="nusA" /locus_tag="DP116_10890" CDS 22410..23681 /gene="nusA" /locus_tag="DP116_10890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872537.1" /note="modifies transcription through interactions with RNA polymerase affecting elongation, readthrough, termination, and antitermination; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcription termination/antitermination protein NusA" /protein_id="PRJNA477356:DP116_10890" /translation="MVTLPGLKDLIENISRERNLPRVAVQSAIREALLKGYERYRRAQ NLERKQFDEDYFDNFDVQLDVEDEGFRVVATKTIVEAVANSDHEIALQQVQDMGGDEA QLGQEVVLDVTPDQGEFGRMAAMQTKQVLAQKLRDQQRQMVQEEFQDLEGTVLQARVL RFERQSVIMAVSSGFGQPEVEAELPKREQLPNDNYRANATFKVYLKKVSQGQQRGPQL LVSRSDAGLVVYLFANEVPEIEDEVVRIVAVAREANPPSRHVGPRTKIAVDTLDRDVD PVGACIGARGSRIQVVVNELRGEKIDVIRWSPDPATYIANALSPARVDEVRLMDPETR QTHVLVAEDQLSLAIGKEGQNVRLAARLTGWKIDIKDKAKYDHAGEDAKFADVRAKYI PEVDDSDEEDLEDKNHEELFEDEIFDENNDE" gene 23730..23975 /locus_tag="DP116_10895" CDS 23730..23975 /locus_tag="DP116_10895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015083448.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF448 domain-containing protein" /protein_id="PRJNA477356:DP116_10895" /translation="MKPNYRRCISCRKIGLKQEFWRIVRVFPSGQVQLDEGMGRSAYI CPQHNCLLAAQKKNRLGRALRASVPEALYHTLWQRLH" gene 24179..24431 /locus_tag="DP116_10900" /pseudo CDS 24179..24431 /locus_tag="DP116_10900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196599.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="translation initiation factor IF-2" gene 24428..>24899 /locus_tag="DP116_10905" CDS 24428..>24899 /locus_tag="DP116_10905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318995.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="translation initiation factor IF-2" /protein_id="PRJNA477356:DP116_10905" /translation="MNNGKVRIYELSKELNLDNKELLAICDQLNIAVKSHSSTITESE AERIRAQAEKLAQTSVTPRIGNGTNSHRPNSPQDGGRNRPAAPNKQQILEIRKPTVLK NPISNAPEASVAINTVASEVNPPSPPKSIAPPVSPMKPTAPIRSVPRNQSETTLE" BASE COUNT 7105 a 5308 c 5331 g 7155 t ORIGIN 1 cccctcccct aagctcgcgg ggaggggcgg ttttggcaag agacaaaacc agggtggggt 61 tcttcgggaa tggtatcaaa tccggcggcg tctgaatttt taaaaaccgc agaggcgcag 121 agaacgcaga gaaagagaag agagagaaga gggagaaata attctgatac aaacggattt 181 gatattatat catgttcgac taattagtta taattcccac gttgactgaa ccccacacgc 241 cagtcgccac aacgggggga acccccgcaa ggcgctggct cccctttatc ccctccccgc 301 ttgcggggag ggaagcctaa gcgcagcttt ggcggggtgg ggttcttcag gaatgataag 361 taatcaaccg aacctgatat tactcccctt aagttatcag gcaactatca actcagactc 421 actctccgat ggtgcaccat cctgcaccaa ggaagatcta ttatcattgc cttgcaacat 481 attacccgca aacgcaaaca aggcattgac tttacgagta cgccactcat ccacgagttg 541 ctggacttta cttgcagaca tccgccgtgt catggcttca taaagataag cagcatgacc 601 tgtttcatct ttagcaatgc tcaacatccc ctgtttgagg ttacgagcaa ttggatcatc 661 ttcaggtaac acgttcgcca tgcgagtaaa gtctttgcta gcatccaatt ctaagatgta 721 agtgctagcc atgaatacat cccaatcaat tgtctcaggt ttgagttgtt cttgagagta 781 accttcatag tatgctgcaa agaaggggct gcgtcgttgt tgagttttct tttcctctgt 841 attctgaggc gaacttttga aatctataac tcttttatta agttgtttta aggcatgagc 901 aaagatttga ccgtgtctgg tttcatcgga tgcgtgtttt gccagctttt cggctagcca 961 tgtgtcgcct tctgcggctg cgcgatcgcc cagcttagtc aaaaatggaa cagaaccaga 1021 ctctgctaac tgaaagcctg caagaatatt ggggcgggtt ttgagatcac ggatttgcaa 1081 agcagagtaa taagccatcg caccagaacc aactaagtgc atgacataag tcaagaagtt 1141 catgagtaag tcgcgcggat tgtgaggaag ttacatttct tatgctaatc ctcatttttg 1201 attttgtgag caaaactaag ggtgtagggg tataggggtg taagggtgta ggggagaaga 1261 agtaggggtg taggggaaga aaaattaatg aacgtaaact ctttcctcac ttacaccctt 1321 attcccttac ccccctacac ccctaatttt tgacacctct accccttatt ttccagtgag 1381 cgaatcagga gcaacagaag cctacctact gattcacatt gattgagaac tggagtaata 1441 gttgctgatg aagcgagttc gatacgaatg gacaataaca aatgtgtttc caattctttg 1501 agtgaacctt gggcaatata caaaaattgg atgtactcac cgcgtgtcct tcttccataa 1561 ccttcagcaa tattggctgg aggggataaa aaatgggaag caataaccct caacaatatt 1621 ggcattgtct actccgcttt aggaaacaaa cagagtgccg cttcattttt catagacaag 1681 gatggacaac cctccacaat gtctactcgt aaaacccgag atctggatgt atttttgcgg 1741 cactaccaaa aaatttgaat caaactagtt gaaaggatac gccgcaaaaa tacaagggtt 1801 gtccatcctt gtcgagggtt tgcccaaccc tacgcccaat ccggctttgc cggattgccc 1861 tctgctttca gcacaggggg agggttttct ggaggaaaac cctccgggcg tgatacgggc 1921 gatcgctttg ctacccttta tgtttaattt ggttaattga cttattcact ggtaattgtc 1981 aacaaaattt ttttatgggt tactaaagtc tctagttcag tagagatttt agtataccat 2041 tagtctctaa acttatattt cgcatctaca ggatttactg tggaacaacg taactaaagt 2101 ccgcagcaga tggctgtaac tgtgtttagc cgtgggtgtg gttcatgcag cctctcaaag 2161 accaaatttt tggtactgga taggtgaact ggtgagtggg tatagctgcg cccaatggaa 2221 gtgtcggcga cagatctcat ttttgaaagg atattatttt tgtcttacct tttagaaaaa 2281 gaagtggcaa agccacttct gcgtaattcc caggtagaac ctaaagaata gtataacttt 2341 tctaaaaaga tgaaacggta tgattccaca cttgttaagt aatatgtttg ctttaattgc 2401 cttcgtgatg ttgcggatat ttgaagcggc tgactaaaac aagcgaaaaa tctgccagca 2461 gaggtatgaa agggaaaaat aagtcaagcc tcataagaac tgcgactaag caggaagaca 2521 aaccaatcaa tcatgttcag cgaaccctct ggcgtgcgct ggcgcaagag gtgtaccgaa 2581 ggcttaggct tggctgattt ttttgtgttc atgagagatt catgtctcac aaatcaaata 2641 ggactgctat agtaccgcta gcattcgtca aaagtgaagg taacaagtca aaagtccttt 2701 ttttgagctt ttgacctcta aaagatggta gctttatttg gaccgcactg tactagcttg 2761 ttttctgcta tctaagaaag atttattggg taaaaacgaa tatgaaaaat tcttggtatt 2821 cctgataaaa actctattga cctgtaacag gtaattagat tatcttagca gtcagcaaaa 2881 ttggttaggg aaacccatca acagaaaact tatacttaat ttattgtcta gttcctctat 2941 ttttgtatcg ctgatgtcta cgctagcagt catcaatcct gcccacgctg ctaaaaccga 3001 ggaattaatc cacaaaaatg gtcagacttg tgtactaagt ccccacgctg ttaacaagat 3061 ggtgtgtatc aaggattcag agaggaagaa agcatctggt tctacacctt catcatctac 3121 accttcgtca actgtgacta caggcagtct gcccagaata ttgcaacgct tgcatttaca 3181 gaagaagaaa gcgatcgtgc cattcagtta tttggctgtg actgtccagc ttgtataaac 3241 tctttacgcc agttgcgtgg gtctggaaac ttggtgtact agaatttccc ctcgattgtg 3301 atcactactg cgactttggc gtgaaagtta actgtatagg acttaggact caggactcaa 3361 aaactcccaa ttcctaactc ctgatttatc cgcctacctc gttcactggg aacctatcaa 3421 taagctgcat agcacgatgg gcattgcgct gcaaggattg tgataaatgg ggtacgtggg 3481 gaatttgcga tagtaaatct aacgtccggc gtaaaattcg caccacatca ccttcatcca 3541 aactcgtatt ttcacaaagt tctatccact ccatccctag cgcccaatgt tccacaaggg 3601 ctattaactc aaactctaac catatcggta aagcgacatt ataacgacgt tgcagttgaa 3661 agatcttgcg acgaatcccc cgcaattttg ataacgcttc tgccacttct tcactcaagt 3721 caaagcgaac catgctatct ggacgtgggg tttctgtcac caaagctgcg atcgccgctg 3781 ctaagtggtg tggatctaag ttgtccaatt ctccactctc aagtgctaag cccagccaca 3841 actcattttc tcctcgaatg gctgcagcca tttgtcctag tcgcgtggga acaagtttat 3901 ccagacattc aaagtgctgc aaaatctcaa taagattgag aaactcttcc caatggcgct 3961 gagattgttg ctctatttgt gcctgcatga gttgaatttc ggcttcgagt tcaaccaact 4021 gtgttctgcg tttgaaaaga ctgacagcgt cgcctgattg atgtaaggga tgtgcttcta 4081 attgctcttg cacagcagtg acgcgacgaa gttgttctag gacttctgga ggcataaata 4141 cagcttcgct tgcctcagga atttcttgag ttataagtgc ggtatactca tcaccaagaa 4201 gagactgtcc ccgtttcaat ggcatttctg gtggtggtac caattcatca ggtatgtcaa 4261 ttcgcggtag ttctgcaaac aaatctatga catcagaggc tgtcgctaca taccagcgat 4321 tatcttgccc caagcatacc aagtaacaag tttgaccaga acctggcgtt ttcccaacta 4381 acaccgctgt tatgggtgca ggcattgtaa agtttttact cttgagactc aacagtgttc 4441 ccgacactgc aaagtccaac atcatcccca attcgtcttg tcgggcttcc tgcgcttgct 4501 cttgcagcat tttcaagttt agccgttcta ctcgcaggcg ttgccgcaat ttttcataaa 4561 cagccagatc ttgttcgtca atagcagcga tttgtgcttg aagttgagca agttctgttt 4621 gtaagtgggt tatgtactca cgttgcggtc gtaggtgcaa gttcgccaag tactgtccaa 4681 agctgcgttc tacgagttct ttggcttctt ccaaagtgtg agtttgcagc aagttcagca 4741 ccatgccata acttggtgta aattgactaa ctagaggatc gctttgggaa gttgccaagt 4801 aagctgcttc tttcgctccc tcaaagggag tttgcagcgt cacaacatga ccccgttcat 4861 ccattccccg gcgacctgcc cgacctgcca tttgcagaaa ttccgaggca ttcaataggc 4921 gatgtccact atcagtacgc ttagaaaggg tagaaataac cgttgtccgg gctggcatat 4981 taattccagc tgccaaggtt tcggtggcaa agacaacttt aattaacccc tgttgaaaaa 5041 gttcttcaac aagtactttc cacgcaggta aaattccagc atggtgtgcg gctattcctc 5101 ggtaaagcgg tccaacgtgt ccagaacgtc cggcttccgg attacggctt aaaaattcgt 5161 caatctgcca gcgcaattgc tgcgcttctt cctgatccac taaccacata tcactgccca 5221 cctccgccac cgccttatca catccccgac gactgaagat gaagtaaatt gctggcagca 5281 tatcccgttc ttgtagttga cttagaacct gaatcatgct gggggcttcc gctctcacat 5341 tagctttgct tttgtttttc tctccttttt tcttctttct tagcaggcgc tggttaattt 5401 gggttttatc atcattcagc agggggaata accctttgat gttgccaaag tgaaattcca 5461 aaggtactgg gcgatgatcg gagtatatga gatcagtagg accatgaact tgatttaacc 5521 agtcagtcag ttgatcacta ttagcaacag tcgctgagag ggctacgagt tgaatttcac 5581 ggggacagta gatgatggac tcttcccaaa ctgttccccg ttggcgatca ttcatgtagt 5641 ggcactcatc cagcaccata gcctctacgt ctactaatga aatgccaatt tgcccgatag 5701 gtgtgccata gagcatatta cggaaaattt ctgtggtcat caccagaatc ggggcatctc 5761 ggttaatgga ggcatctcca gttaacagtc caacatggtc aaagccgaat ttttcgcgaa 5821 agtcacgtag tttttgatta gagagcgctt ttaggggagt ggtgtaaaat actctttttc 5881 cacgcgatag agcgcgataa atagcgtatt ctccgatcaa tgttttgccc gaacctgtgg 5941 gcgcacacac aacaacggaa cgtccagcat tcaaggaggc gatcgcatcc ttctggaact 6001 gatccaattc aaatgggaat atcgaacttg gttcaagttc tggagacagc gcagggtaat 6061 tcactcaatc acatttgcaa ccaaaactgt ttactatggt aactactctt ttgtcaactt 6121 agttatgagt catgagtcat gagtcatgag gaggcagtgc ggtcttgggg agccagtgcg 6181 ttgcgggggt tccccccgtt gaagcacctg gcgtggtttc ccccatgagc acctgccgtt 6241 cattggtttt taactcatga ctcatgacta ttttcccaat tcttgtgagc gggctgttgc 6301 tgctttgact gcttcaatta aagctgagcg aaacccggct cgttctaact gagctacacc 6361 agcaattgta gtaccacctg gactggtgac gcggtctttg agttctgcag ggtgtatttt 6421 tgtttcatgt aggaacttgg ctgttcctaa gacagtttgc aaggcaagtt gattggcaat 6481 tgtccttggt aaacctgctg cgactcctcc atcagcaagt gcttctatag caagtgctac 6541 gtatgctgga ccagaacctg atagtcctgt caccgcatcc atcaggtttt ctgaaacttc 6601 cacaacttct cccacagccg aaaaaagtag ttttgcggtt tcgtgatggt ttgctttggt 6661 gtacgcgcct gggcatatcg cggtgactcc tgctcccaca gtagctggag tattaggcat 6721 tgctcgaatg attggaaagt ggggaaaagc cgcttccaac tgatttaatg tcacgcctgc 6781 caagatggag atcaccagtg gttttgacac tgtcgtgatc acatctgcta attcttgggc 6841 gatcgcacta aacacttgag gttttactgc caaaaatacg acttctgttg ttggtgtgaa 6901 aaccagacga ttatcggctg tcacattaac accatattcg tcttctagaa aaccttggcg 6961 tgaggtttgt ggttcgctga ctaggacttc tgaacgttga taaattttac gagcgataag 7021 gcgggtaagg agcgcttctc ccattacccc gccaccaatc aagccgaatt taatagtagt 7081 catttttcgt tagtcataag tcatgagtca ttagtgagcc agcgcgcaga acagggggtt 7141 ctcccccaat ccccacggag tggggaccgg tgagtccccc atgagcgact ggcgttagac 7201 cgcagggcgt gcgctttgcg catacccgta agggtcatga cttatgactc attagtataa 7261 agacaaagga ctttgtccaa agtcctctgg actaatcact cattttacgt ttattgtgcc 7321 attctgttaa cttcgtttcc ccaagcagga tttgggcctg tggaacgtga gggacgtact 7381 gttggttgag gtacttcatg aataattcct gactgggtgc taacttgcac acagcttggt 7441 gtgaacaaaa atatactttc gcctatccgt tcttgatgcc catcaagcgc gtaagtacca 7501 ccggcgacaa agtccactgc tcgttgtgct tgatctggat ccattattgt taaatttaat 7561 actactgact tacgctctcg caatgcttga attgcttgag gcatttcttc aaaggtgcgt 7621 ggttctagca ccaagacttc tgatattcca ttaatgactc ctggcatacc aatcacattg 7681 ctcatcggct ttgatcctgc tgcgctgaca ttttccattg taggcacggg ttcccgccag 7741 cgtcgatttt gctctggagg ttcttgtgct actggttgtt gagcattttg ttcctgatac 7801 agatttcggt aactttctgt ttctggttct tcttcatagt actcgtattc tacttgctcg 7861 tttaaaccga cgaagtctct aagtttggaa aatatattat tcatggttta cgctcctagg 7921 gctaatcact cgtaaggata tgccaaaaac taatgaaatc aatttttacc cttataaagg 7981 gcagatcttt caggcgttct attagtatct gttctaccca ctaccggttt ttctttgaac 8041 aaacgcttct agaaggtctg tatatgtttt ttaggtaaca atgagtcaag attatatctt 8101 aaccagtagg ttttttggaa gttgttttcc tctgatatta tattgacttt tgcttttgtc 8161 aatactatat cctttgtcac cgattgaacc agtatcttct gaaatttttt tcaacaaagt 8221 tttataaaga gttttggagc gcaaaatata ctgtaacaag gtatttaata aaaccattaa 8281 ataccttgtt ataccgtaat aatactttga ctgccaatat taaacagaac gctctccaaa 8341 caaaatagtt cctaacctta ccattgttgc gcctgcttgg actgccaatt tgtaatctcc 8401 tgacattccc attgaaaggt gatgcatttg aacattagac cagttttgtt ctcggatttc 8461 ttttgctaat ttactggtac tattgaatac gctgatcacc tcataatcat ttaatcccaa 8521 aggtggaatt gtcattaagc cctgaatttg taaattttca cattgatcta gagctggtaa 8581 gtcagccaat agttgtgatg cactccaacc cgacttgttt ggatcaggtc gtattttgac 8641 ttgcagacaa acctgaggag tgatttttaa ctcttgggcc aattggttta agcgttgggc 8701 aagcttcaaa ctatctacgg agtgaatcca ctgaaattgt tcaatggctt ttttggcttt 8761 gttgtgttgc aaatgtccaa taaagtgcca agttatgtct ggtaagtctt gcaacacagc 8821 ttgtttgcta gccgcttctt ggatgcgact ctcgccaaaa tcccgaattc ctgcggcgta 8881 agcacagcgt atgatttcgg cagagacttg cttggtaaca gcaatcaacc ggactgagtc 8941 tggtagagag gaaccaaggg taataatacg ttcgctaatc gaactactca ttggaaggtg 9001 cgttggaaaa cagtctgaag ttgatcgtac tcctgagatt gccctgtacg acgcagacta 9061 cgcaagcgat tttcaatcat cattctagcc tcagtgcgtc caatggactg aaacttcata 9121 cctttgacat cagttgttac tatgaagaat aagcgttggg catacagtgt tgtgaacaac 9181 tcttggttct cgtctaccat acaaatccta tagagcaaac cccaagttgg atgattgatg 9241 taagtttccg agttgtctaa gttcatttaa tactaaggga tgaatatata tatttttgca 9301 gctgaagaga aacattttga cgacaaagga cgcatttcgc caaacaaaat ttttgctaaa 9361 atgccaaaaa ccccaaagtt aagagcaagc agtaccttgc caacagaagt aggatcaggg 9421 gcttgtgcag tagcgcgtgt tttctgtgga ggaggttacg aatcgcgctg agcgtgtcct 9481 tagggacata gaaagctctg aactatgccc tgctttgtac gtcgcttgcc tctcgttaga 9541 atgccgatta gcttaacaca aagtgcatgc agcgcatacg aaaagtttcc cttcatgcat 9601 gcaaactgcc gtttatcact caataggtgt aaattataga agaacgaact acgccgcatc 9661 ggttctggtt cttcaacaat ttcgatcaag cggactttac gacaacctgt gagcgtgcca 9721 acgctaacaa atggcgcgaa tcaagtcaag cacgttgcaa tcagatttta agctacgttt 9781 gatgttgctg acaaggtgca taacgtggct attacctcaa ttcttagcaa ttcattaaca 9841 gcaacagttt cccattcaca cgctcgccct aattcattgc gtttcaactt tactagcatg 9901 tcggtgttca gcgggttcaa tggttgtgtt cagtaaagtc ttgtgcgagt aatcgtgaac 9961 atgtagagtg atacacaaga cttaacaatc ttattcccct tactactctt ttgatctttt 10021 gcgctttgtg cgaaacgatt ttagaatgct ttgtacgtac ggtcgtgcaa ctgtaaatta 10081 tagaacactt gtgtcttcat tccattcttg taccttggaa tttactgtca tttgtgctat 10141 tctagatgaa attgggcgat tagcacagtg gtagcgcact tccttcacac ggaaggggtc 10201 actggttcaa atccagtatc gcccatttct ttaattctat ctaagcttaa gaagcaacaa 10261 gggtggtttg ttcaaatcta ccactcatga aattgtttaa ctagctgttg tttctaaaga 10321 tgaaccgaaa caatgtttgt tcgcaaatca tattttcaaa tagcaacaca agctttatga 10381 aaaattaatt ttttagatga attgctatat gaagtacttg aaatagcgat aatcaaaatg 10441 tctccatatt agagaatttt ccggaaaaag tcaattgaag aatgaaaacc atgacctcac 10501 gatatttttt gctcattcat ctcctagtgc ttataggatt actaggaact ggatgcaaca 10561 atcaacaacc agatatcaac ttagaaaaat tgactgtagg tgtggttagc tacggcgaag 10621 gaactgtttc catagacaaa tatgagcgct ttaaagacta catagcaaaa caaactcaat 10681 caatagtaga attggaacca gcttacaacg aattgcaagc tttagagcaa attcagcgcc 10741 aaaattggga aattgttttt gcgcctccag gtttagcagc aatagcgatc ggcaagcaac 10801 tttatgttcc tctgttctct atggagggaa cacgtggtag acaacgctct ttacttatag 10861 tccgagatga taaaccaata aaaacaatac cagatctggc taataaaact gttgctttgg 10921 aaacagcagg ttctgcggct ggctattatg ttcctttgta cgatttgtac ggtttaactc 10981 ttgctcaaat ccgttttgct cctactccca aaacagtatt gcagtggatc agtgaaggta 11041 gtgttgatgc tggggcatta tctcaaacag attttgaact ttatcagcga gagtttagta 11101 caaccaagtt tagaatttta cacactagcc gatcaattcc tcctgcagtt gttttactag 11161 caccaactgt ggaacggaat caacagcagc agattcaaaa agccatgacc caagcacctg 11221 gagatatcat agcagatgct ggttacgtcc ctgcagcaaa agtacctagt tacgagcagt 11281 ttattaaatt agtagataaa gttagacccc tagaagagca agtcaagaaa acacctgctg 11341 ttttgctcaa tcaccaatct accagtcaat agagagttga tggacaagtt attaacaaaa 11401 gacaacttca taaaacttca tcatgttttg gcgttttttt ggaaaaatct gccgatatat 11461 tattaagttc tcaccagctt ggagtcacgg ggaatcaact gagtataata taaattatct 11521 atctaccaaa cagatttttg taaggaaata ctgtatctgg cgctgtagtt atcagactgc 11581 ccaaatgcga taaatctatt aaaactttta cctaatagac tttcttttga aaaacaccaa 11641 attacttata aagaaaatag ctaagatagc taggagtggt gccttatgtc tcaatctccc 11701 attgtaaaag cagaaatacc ctctgggacg atgattgata atcgttatat tattcaaaaa 11761 cttttgggac aggggggctt aggaagaacc tacttggcgt ttgatactcg tcggtttaac 11821 gaagcatgcg ttctcaaaga atttgcaccc acaggttcag gggagaatgc tttagaaaaa 11881 tgtcgcaatt tatttaaaag agaagcaaga attcttcacc agctggaaca ccctcaaatt 11941 cctcgtttct tggcttgttt tgaaggagat ggtcgcctgt ttttagtaca ggagtttgtg 12001 gatggtaaaa catattcact tctgttgcga gaacgccaag acttggaaga aagtttttcg 12061 gaaaatgaag ttatacagtg gctaaaaaat ttgttaccta tattggaata tgttcaccag 12121 cacaacatca tccatcgaga tatttctccg gacaacatca tgttacctga tggcaaaaat 12181 ttgccggtgc tgattgattt tggtgtggga aaacagatag ctaacctgaa tgaagacacc 12241 acaccttacc acatgagttt tgttggcaaa atgtcgcttg tggggaaagt aggatatgcc 12301 ccccgtgagc aaattagctt ggggttgtgt tctccctcca gtgatcttta tgccttgggc 12361 gtgactgcag ttgtcctact cacaggtaga gatccatctt ttcttatgga tcagtactct 12421 ttggagtgga gatggcgttc ttacaccagg gtcaccaatg attttgccag agtacttgaa 12481 aaattgctag cagatacgcc aaagctgcga taccaatcgt caaaagaagt tcttacagat 12541 ttagaccgtc ttggacagtt tcaagtaatg cgacaatcaa ctgttgttga tttgcctacc 12601 accgttatag aacagaaatc tcaaccgaca ttcccaacac gacagccacc caaccaatat 12661 ccagagacta tagcaagttc ggctagaagt tcatctcgtc aacagcaatt ccctgtgatt 12721 gagtcacaac cacaaccaca acaatcttca ctcaatccag ctttcataca acgttgtcag 12781 caacaactag cttactacat tggaccgata gcaagtctag ttgtggaaga gattctggcg 12841 gaaactcctc agatctcgcc ttaccaattt attgaatttt tggcaaggga aattcctgat 12901 gcatcagcag ctcttgaatt tagaaaacac cttttttcgt aagtgttttg gaggaaccca 12961 ctcacctaag tgtgctcaag aaaatcaaaa ctgtcataaa tatagttgcg acttagtttg 13021 agttgctgct taatttcctg cggttcctcc agtatttcct ccatgagtac gtctaattct 13081 tttcattgcc atttctaaac ttaatttcat ccgtttaaaa gctttggcaa gattaccaat 13141 ttcatcattg gaaatctgat caaaatcaac ttctaaatgt cctgtgctaa cttcttcagc 13201 tacacgagtt atccgcttga ggggaataat aacttgtcgc tgcaaaaaga cattgactaa 13261 aagaataacc acgataaaaa tagtagagac aagtcctact atcagcaaag aagattgatt 13321 ggctttttca ataactttac tagcaggtag tgaaataaat tgagccgcta caatttgatt 13381 caactgccaa ttaaatccat taactgcacc gaaacgctca atcatagttt ttggtgcgac 13441 atcaggcgtg ctatgacatt tcagacaact ttcttggcta atggttaagg gacgagcaat 13501 gtaaaatata tcgcctcccg gaagtgagcg aaatccactc acctctttta aatttttgtc 13561 ttgttggaaa ccctctaaaa ttgtcgtctc aaaactgtca gccttatccc gaagatttgt 13621 gggatttgtt gctgcttctt tataaaataa gtctcggtat tctggttttt ttcggagaat 13681 ttcaaagacc tctcgtgctg agtatgcaga tacaacttgg ggcaaaaact tttcttccag 13741 tttatcagca agttctggat agatttgggt aagtgtgtac ccacgaagag aagtcattgt 13801 atcgatgagt gttaaagctc gtgaggcaat ttcatttata gcattctgcc ttaacaaagt 13861 agaaagagcc aatccactaa agctcaaacc tactacaaga atgatgacta gcaaaattgt 13921 aaatttttgt ctcaatttta aattttttaa catcatccca gttcatctca agataacagt 13981 acaacgcaca aataattgaa gccatctatt acccattata gggacagatc aagcagaaat 14041 aactaagaat gtaaatgatt acaaaataac tttgctgcat tccagatttt gtcaacaaat 14101 ttttacaaaa aatatattga tgaattgcaa gctcccattt atgattaagt tcatcaatca 14161 agtttatgac ttttttgttg aaaggatact atatttatca acttgagctt ttgcattcgt 14221 tacattgaag tctttgagaa attttatccc tcagttttac tgataaaaca taaccagtct 14281 taaagttgaa tttttaaaga aaaattaaga tacgagccac ttgctaatag cctgcttgtt 14341 cagttgagtc agatgactgt tgacagtgac agaataagac taggagataa ctcagttcaa 14401 aatatgcctt acgggcacac taagcctagc gtgcaactcc cgtgtaggag gcacaaattt 14461 agaatctcaa aatcttcccc tgtttcctaa cccagcttta ataccgtgcc aaaataaata 14521 gcatgcggag agttattaat ctaatggtct ggaatccagg acaggcgttg cttgggggac 14581 gctacatcat cgaaagaaaa ttaggcgaag gtggaattgg tatcacctat ctcgctggaa 14641 atgaacgaaa tgaactgcgg gtcattaaaa ctctaagagt tgacatcctt aatcaggaga 14701 cctggaaacc gcatcaaagc aaattaaagc aagacttccg cgatgaagct gttcggcttg 14761 ccttgtgtcg ccatcctcat atagtgcaaa tagaaaatat ttttgatgaa ggcgatttac 14821 cttgcatggt gatagagtac atcgagggtg aagacttagg caaccgcctt cggcggatga 14881 aagtgttacc agaagttgaa gcgcttttgt acatccggca aattggcgat gctttaaagg 14941 tgattcacag taaaggtttg ctgcataggg atattaaacc acgtaacatt atgctccgcg 15001 ctggcaaatc agaagctgtc ctgattgact ttggtattgc cagagaattt attcccaatg 15061 ttatccaaag gcatacagtg tatcgtacac caggttttgc ccccccagaa cagtatgaat 15121 tggaagcacc acggggagaa tatattgatg tctatgcact agctgccacc ttgtatagtt 15181 taatcactgg agtgacacca aggaatgccg atgagagagc acgccataac attcccctag 15241 aaccaccgaa acacttcaat cgtaatatta ctgacacggt caatcaagca atcatccggg 15301 ggatggattt gcaacccaat ctccgcccgc agtccgtaca ggagtggctg gatttattgg 15361 atgttgagat ccgcgaggaa ccgcccacac aggtaatcgt acaccgccca gtacttccta 15421 ataatcagtc tccaccccaa aatcaacagc gaatcatacc atctgttgcc acttctggaa 15481 actggcaatg cgtacgtacc ctcagaggtc attccagcat ggttcttgct gttgctatca 15541 gctcagatgg acagacgatc gctagtggta gcaacgatca tacaattaaa ctgtggcaac 15601 tagaaactgg taaacttctg cgtacccttg gtggtttgtt ttctggtcat tccagcattg 15661 ttcatactgt cgcttttagt cccgatggac agttacttgc tagtggaagc tgggatgaaa 15721 ctattaagtt gtggcaggtc agtagaggaa aagaaattcg tacattgact agtcatacca 15781 gctggatcaa ttctgtagcc tttagtccaa tacttccgaa cttctcttat caacaagagg 15841 aagcttacgt gcgagcatgg atgttagcca gtggcagtgc tgatagcact gttaaattat 15901 ggatagtaag tacaggcata gaaattagca ctttcacagg tcatttggac tcagtatggt 15961 cagttgcttt cagtccggat ggggaacttc tagcgagtgg tagcgctgat agtacaatca 16021 agctttggca agtgaataca ggtagaaaaa ttcacacttt tagaggtcat tccttttttg 16081 ttaatttcgt taccttcagc ccagatgggc gacttttggc aagtgccagt gctgatggta 16141 ctatcaaact ttggcaagta agtacaggga aagaaattca cacttttaga ggtcattccg 16201 atgcagtgtg ttcagtcact tttagtccga atgcacagct tcttgctagc ggtagttggg 16261 acaaaactat caaaatctgg cagataagga cgggtactga aatttgcact ctttcgggtc 16321 attccaatta cgtcagatcc gttgccttta gtccagatgg acagaccatt gtaagtggta 16381 gtgatgacga tacgatcaaa atttggcgat gctattccta atttgggtaa tgcacttaag 16441 attgatgtat tgattgttgc ttataaaaat atctgctact caagaaactc atgaaacaca 16501 tcaaacttat tgtagaaaaa cattccgatg gttatgttgc ttatcccctt ggtattaaag 16561 gggttgttgt tggtcaggga gacagttatg aagaagcttt agctgatgtg aaatctgcta 16621 ttcgctgcca tattgagata tttggtcaag atgttttaga agaggaatca cccgttttag 16681 aagcctttgt cgctgaagct gaggtttccg tctaatggct aagtttcctg tagatgcgcc 16741 caaagccaga gttattaaaa cattggaatt acttggattt aacattatcc gggaaaggga 16801 gcatattgtc atggtacgac aaaataatga tggaacgcga actccattga ctatgcctaa 16861 ccactcttat attaaaggtt caacgttgag atccatatgt actcaagcaa gtatctcgcg 16921 agaggaattt ttagctgcct atgagcaaat gtaaaattac cttgacagta aaccataacc 16981 ctcgctaaaa ctcaatcact cttccctctc ccctaatgcg gttaaatcct tctaactctc 17041 tagatagact caatatttcc caaaaattga aacaatgagg agagtcatcc tgattttggc 17101 aatctattct taactgagac gcgtttacag aaaccaagca gatgagatgc ttgcctgagt 17161 taggcaattg attttaattg tgttttaatt tggaattgca tatgggcttt ggcataggag 17221 atttattctg gattttcctc ctcctttctt ccctgcaacc cctgtggcag aaacgtcaag 17281 tagagtatag gcgtattcgc gccctacagg aatttcagca acaacgcaaa agccgagtca 17341 ttttgctgat tcaccgccaa gaatctatca gctttttggg aattcctgta tcgcgatata 17401 tcaccatcga agattcagaa cagatactgc gggcaattcg gcttaccccg ccagagatcc 17461 ccattgactt gattttacat actcctggtg gtttggtttt agcaacagaa caaatagcca 17521 gagcattaat tcgccatcct tccaaagtga ctgtctttgt ccctcactac gccatgagtg 17581 gtggtacaat gcttgcccta gcttctgatg aaattgtcat ggatgctaac gctgttcttg 17641 gaccggttga tccacaactg ggtaacttcc ccgcagccag catcgtgaag gttgttgaac 17701 agaagcccat tggtgaagtt gacgaccaga ctttgattat ggcagatcta tcgcgcaaag 17761 cgatcgacca agtgcagcgc tttgtacgga ctctgctgaa agatacagta cccacacaga 17821 aagttaagcc agaaaacatt gagaatatca tcgatgccct aacgactggg cgcgttactc 17881 acgactaccc aatcactgtt gaagaagcaa cggaaatggg gctacccata acagccggac 17941 tgccccgtgt tatttacgac ctgatggatt tgtatccaca accacaaggc ggacggccca 18001 gcgtacagta tattcccatg ccttacgatg accgccgccc cattttacca actcctaagg 18061 gtagaccatt agaagaacct agcccaactc agatgagtta agccaagtta ttatcttctc 18121 ggtgttggtg ttggtgttgg tgttggcgtt gctgtcggtg agactccagg tgtggacgta 18181 ggactcgctg taggcaacgc ttctggtgta gaagtaggtg ttgctgtcgg taacgcctct 18241 ggtgtagaag taggcgttgt ggttggtttt gctgttggag tagttctagg tgtagaagta 18301 ggcgttttag ttggttttgc tgttggtgtt ggctgggttg tcggtgaggt tgttggtttt 18361 cgtggctttg ttgttggcgc ggatgttggc tttttcttga tgtcctttgc ttcttggctg 18421 agtctttgcc tgaatgctag gtcagcaatt cctgtttctt gtaaacccag ttttttctga 18481 tattgcttaa ctgctacttc tgtttgagga ccataatatt gattactggg caaagcagga 18541 ttgggcttag ctaccaagtt taaattcgct tgcaaagttt caactatttt tgcagcaaca 18601 tcttgagttt ttggtcctac tattccatcc gcaggttgta gtttgtgccc tttttgaaat 18661 tcttgaattg cgtttttagt atccgctgtg gtcaaaggtg catctgatac cttgacgctg 18721 taacctagtc cccgtaacac agcacgaaat tgccgtgggg tataagttcg tgcagcaaaa 18781 gcgacatctg aaatcactac actagctgtg atcaaacagg cagtcgcaac ggtaatgttt 18841 gattttccaa acccacacca cataagtaaa actcctcttg gtcaaatcaa ccggatgttg 18901 attttgacag atagatgcat tttaagtttc tacaatgtct ttttttcgct ttattcatga 18961 tttttgataa gtttaagtta tattaatgaa atttagatga gaactgattt ttgccatcta 19021 tagcggttct caactgcatg cactacacgg tagggcgtga agaaacgtaa ctatatgaaa 19081 atgtgcctac atacacattg caatgcaggc gatcgctcca aaaagtacat taaaccatat 19141 tgccactttg ccgtttcacc gttgatagca agtagcagga gtagacccca aaccattggc 19201 atcaggcgca tgagagatca gcgtgacttc gcctttgccg ttgaacaccc aaccagtagc 19261 aggtgcaatt ttcacggcat tattgttatg gcgtggagta gtggtgacag gtggcaaggg 19321 taacgacatt ttatggggtg gtactggagc agatagattt gtctattcct ccccatctga 19381 aggtattgac attatcaaag actacagcta tttgcagggc gacaggataa caatttccaa 19441 aacagggttt ggcgctactt ccaccaacca gttcagctac aacagcatca ctggtgcctt 19501 gttctttcaa ggaacccaat tcgtcaccct tgagaacaag cctgctgatt tctctactag 19561 tctgggcatt gaactagttt aagaatgtga tgatttttag ttaagaatta aatttgaatg 19621 ggaaatataa gcctaggttc aatgcctagg ctttttaagt agaattatta tttaaaaaga 19681 caactttgtt ttaactgatc gcaaatacca ttctgcccga cctaacctga taaataacta 19741 ttttctgttt tctgacttcc acccaactta aaaataaagc agctatgctg agggcagaag 19801 attttttatg cttggtaatg tatgttgatg acaactgctt atctaaaatc tacacttttc 19861 catacacaaa caaatgacta acgtaaaacc tctaacacgg cgtaacttat ggtttgagcg 19921 gttgatggca attattgcgt ctgttaatct agggttagtt ttatttgatt tcagttatat 19981 tccttggcga gatttttaca aacgaaatct ccccatcatc actcagagat acgatccagt 20041 taaaggaatt gaaccccatc gagaaacaga aaactatttg gtcacagtag acgcacttat 20101 agaacaggtc agtcaaacag ggttgcaatc acctcaggta gcaaaagaac tagaagaact 20161 caggggtctg agtagtgaga tgattgatag caatccgttt gaagcggcta ataagagtgg 20221 taccttggaa aaaatcaaaa atcgcatgcg cgatcgcata ggcgttaaat cttccaagca 20281 agccttttcc acattctgga gtcaggcata tttatcaaaa cgtggttgga agcaagaaat 20341 tgattttttc aaccaaagaa ttcgtcctct aattatcgtc aactattacc gaagtcttgg 20401 agaaaatggc gaattcgttg ataatttttg gataattgac ttaccatttg tgattttatt 20461 tggtctagag ttactagcac gtagcttcta tatccaacgt cagcatcctg gttttcgctt 20521 gtttaatgcg attttctggc gctggtatga cctattttta ctgctaccat tttggcgacc 20581 tttgcgaatt atacctattt tgattcgtct agaccacgca cgtttattaa atctccatcc 20641 actacgtcag caaattcatc aaggaattat cgcgaatttt gccgaagaac tcacagaaat 20701 tgtcgtgatt cgagtcatta accaaatcca gggatcaatt caacggggtg agctaaaacg 20761 ttggctgtca caaaatgaaa acttgcgctc ctacatagat atcaacaata tcaatgaagt 20821 agaagcaatt ggagctattt tggtaaaaac aattgtttac caagtactgc cacaaatcca 20881 acctgaagtt accgcgattt tgcagcacaa cattgaaagc gctatcaaac aagttcccat 20941 ttatcacaac ctacaaatct tacctggatt ggatgcgatg caaactcaac tgagcgagca 21001 actcgcaact caaatcacaa ctaatcttta tagtgcagtt gtcagtgcag cagaagatcc 21061 agttggagca aaactttctg gtcaacttgt gcaacgcttt agcgaagctt tgggaacaga 21121 aatgcagaaa aaacacgtcc tctcagaaat tcagaactta cttattgact ttttagagga 21181 agtcaagctt aattatgtcc aacgcctgtc aaaagaagat atgtggaaaa ttctggagca 21241 aactaagcag ctacgaacac aggtatcagt tcagcctggg gaagataaaa gctctaccat 21301 gctggtgcgt agagaaacaa gttaattgtt aatttgcaac agcagtggaa aatgaacaaa 21361 cagacaacaa gacaaaggga ggaattttct ggagtggtac gacctctcat aatgcctgat 21421 ttcaaatcag ccaatcacgg gaaaatccca caaacctagc catgaaaatc tggctttccc 21481 ctacacccct atttttaagg cataaagatt tttccgtaca cccctacacc cttacaccct 21541 tataccccta cacccttaca ctcttacacc cctacaccct tacaccccta aacccatttt 21601 caagcgagtt tttgcttctc gctccaaaaa aaatacgcta aacttagatc aaaggcgaac 21661 ctgttaggtt cgcttgaaga cttcttggaa gatatcccta tcgcaggaag tgggtaacga 21721 cccacttttt ttgttggaat agccgcatga ctcaccctct cgttccacag attattgaat 21781 tggcgacacc agtggcagaa gaactgggat tggaagttgt tggcgtggtt tttcacacca 21841 atcaaaatcc gccagtttta cgcttagaca ttcgcaatcc tcaacaggac actgggttag 21901 atgactgtga gcgaatgagt cgtgcttttg aagctacctt agatgcagtg gaaataattc 21961 cagatgctta tgtattggaa gtttccagtc ctggtatttc gcgacagctg acaagcgatc 22021 gagaatttat ttccttcaaa ggatttcctg ttattgtcca gactgctcca ttctatgaag 22081 gacagcaaca gtggattggt cagttgattc gccgggatga gacaacagtt tatttaaatc 22141 aaaaaggtcg tgttgttcaa attcctcgct ccctaattac tagggtagat ctggatgagc 22201 gcagatagaa aaactatcag cattctacag ttgtgagtac caagtcataa ggagcgacag 22261 cgtcccttgt gaggttcacc cgttcagcaa gtggcgctgc tgagtcaggt tttagacaat 22321 tccagaaatc actcattttc agcacttagc actgttttta tgctgaccat tttttgcata 22381 gctatataag gagattgttt attatgtcaa tggtaacttt acccggatta aaagatttaa 22441 tcgaaaatat aagtcgtgag cgtaatttac ctcgtgttgc agttcaatca gctattagag 22501 aagcattact caaaggttac gaacgttatc gtcgtgctca aaacttggag cgaaaacagt 22561 tcgatgaaga ttattttgac aattttgatg tccaattaga cgttgaagat gaaggctttc 22621 gcgttgttgc taccaaaact atcgttgaag cagtagccaa ctctgatcat gaaattgcgc 22681 tccaacaagt tcaagacatg ggaggagacg aagcgcaatt aggacaagaa gttgtgctgg 22741 atgttacccc cgatcaaggg gaatttggtc gcatggcagc aatgcaaact aagcaagtct 22801 tggcgcaaaa actgcgggat caacagcgcc agatggtgca agaagagttc caagacttag 22861 aaggaacagt tctacaagca agagttctcc ggtttgaaag acaatcggtg attatggctg 22921 tcagcagtgg ttttggtcag ccagaagtgg aagccgaatt gcccaagcgt gaacaattac 22981 ctaatgataa ttatcgggca aatgccacct ttaaggtcta tctgaaaaaa gtttctcaag 23041 gtcagcaacg aggacctcaa ctactggttt cacgttctga tgccggtttg gtggtttatt 23101 tgttcgccaa cgaagtgcca gaaattgaag atgaagtcgt gcggattgtc gctgtagcgc 23161 gggaggcaaa tcctccttct cgtcatgtag gacctcggac gaaaattgct gttgatactc 23221 ttgatcgtga tgtagatcca gttggtgctt gtattggagc gcggggatca cgaattcaag 23281 ttgtcgtcaa cgaattgcgc ggcgaaaaaa tagatgtgat tcgctggtct ccagaccctg 23341 cgacttacat tgctaatgct cttagccccg cacgagtcga tgaagttcgt ctgatggatc 23401 ccgaaacaag acaaactcac gtactggtag cagaagacca attgagtttg gcgatcggta 23461 aagaaggaca aaatgttcgt ttggcggctc gtctgactgg ttggaaaatt gatatcaaag 23521 ataaagctaa gtatgatcac gcaggagaag atgctaagtt tgcagatgtg agagcaaaat 23581 acataccaga agtagatgac tctgacgagg aggacttaga agataaaaat cacgaagaat 23641 tgttcgagga tgaaattttt gacgaaaata atgacgaata agagtaaagt tatcaagaaa 23701 catagtcctg agtaacgaac tcagcgctta tgaaaccaaa ttatcggcgt tgtattagtt 23761 gtcgaaaaat tggattaaaa caagagtttt ggcgaattgt ccgcgtgttt ccttcaggac 23821 aggtacaatt agatgagggc atggggcgtt ctgcctatat ttgtccgcag cacaactgtc 23881 ttttggcagc tcagaaaaaa aatagattag ggcgagccct acgtgcatca gtgccagaag 23941 cactgtatca cacattgtgg cagcgtctac actaaaggaa taaccaaaaa caaacttaat 24001 gaaaactacg ttgtagcggt aattgccaaa gaaattgtga tgagggccga attatttgtg 24061 ctatcaacac gttctctttt atcgctcatg tagtgccaaa atagtaaaag ggtgtcctta 24121 gcgttgctct tggttatcca aaggggcgaa gcaagactcc taataagttt tactcgatgt 24181 gtcgtggaag actcagaatc aacaaaacaa aacagtacag aatggcaacc tggtagcgcc 24241 caagcgcagt tcggcgtcaa ccatcaatcc ttgtggggat gccaccaata aaccgaaaca 24301 tctctctttc ctgaatggag gcaccggcat tcattagcag tggcaaccta acacagtaat 24361 cgaaggatgg agatgtcgcc aaccaggcaa ccatccgcta aaactgtaaa ttaaagggga 24421 agagtggatg aacaacggca aagttagaat ttacgaatta tcaaaggaat tgaatttgga 24481 taacaaagag ctattagcaa tttgcgacca gctcaatatt gcggtcaaaa gccatagcag 24541 cacaattaca gaatccgaag cggaacgcat tcgggctcaa gcagaaaaac tagctcagac 24601 aagtgtgacg ccgagaatag gaaatggcac taacagccat agaccaaatt caccacaaga 24661 tggcggacgt aaccgacctg ctgcacctaa taaacaacaa attttggaga ttcgcaaacc 24721 tacagttttg aaaaacccca tctctaacgc cccagaggcg tcggttgcta tcaatacagt 24781 tgcttctgaa gtcaatcctc cttcaccccc taagtccata gctccccctg tctcacccat 24841 gaagccgacg gcaccaattc gatctgtacc ccggaatcag tctgagacca cacttgaac // LOCUS NODE_1218_length_24868_cov_4.96243924868 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24868) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24868) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24868 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(210..653) /locus_tag="DP116_10910" CDS complement(210..653) /locus_tag="DP116_10910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874338.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10910" /translation="MEYQLQVEKDIETLKNDLPTLFQKDISYDIYTQDIYFQDPVNKF KGKLNYRIIFWTLRFHARLFFTTIYFDLHEVYQSAEDIILAKWTVRGVLRVPWKAGVF FNGYSTYKLNKEGLIYEHIDTWDRKPGEILQQFFRKGGDISNSKF" gene complement(817..1722) /locus_tag="DP116_10915" CDS complement(817..1722) /locus_tag="DP116_10915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10915" /translation="MTLGVTVNIPSEYLEKIRAVYPNISFEHLDFNQDGMVNVLVVVN HDLVCRFAKDDWGKEVLSHEVMVLEVVRNYVDLRVPHFEHQEVGFVSYRFIKGEPLSR NTLLKLSEASQARIISQLARFHQQLHSIPNEVLVNAEVPSSGAARSREDWLELYEQVQ ETLFPHLWRHQQTWIHELFAPVITGELDLSYTPVLIDGDKAVYHILFDSVSESISGVI DFGTAGLGDPACDIAVQLGNYGEGIVRRMESDYPMLPEVIDRARFWVGTLELQWAFAG IKYKDISLSLAHIGLAREVQPVGTR" gene complement(1881..3035) /locus_tag="DP116_10920" CDS complement(1881..3035) /locus_tag="DP116_10920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874339.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10920" /translation="MARVEIRGVWLTTTDSKVFHSRENIASAMDFLAQTGFNVVFPVV WNQGATMYPSQLMRQTFDTEINPQFVGRDPLAEVVTEARRVGLKIIPWFEYGFASSYN LNGGQLLAKKPEWAARDCEGNLLKKNGFEWLNALDSEVQSFMLNIILEVVKNYDVDGI QGDDRLPALPSEGGYDRGTLERYRQQFGEDPPKNFKDGSWLQWRADILTEFLARLYQE VKAINSNLLISITPNIHDWAYKEYLQDSPTWLQRGLVDIIHPQIYRRDFFTYKSIIDK LVNEQFTDSTLSKLAPGILMKVGSYRISSEDLLQAIEYNRTCGISGEVFFFYEGLREE NNALAEVLRKSPYAQSAAFPSLEDLNQGRVSKKISLSIGQRLLRFLKNLF" gene complement(3043..3855) /locus_tag="DP116_10925" CDS complement(3043..3855) /locus_tag="DP116_10925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748442.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10925" /translation="MTQQLIHHHSLWIERLARFGYISKGIVYAIVGLLAAQAAFSSGG KITDTKGALREIVNQPFGEFLLALVAIGLIGYVILRFVQAINDPENKGTDAKGLVQRV GYVINGLIYAGIALSAVQIILGSNSGNSNSRQDWTARLLSQPFGQWLVGAAGAFVIGL GFYQFYQAFSSQFRRNLNLNELDDSERKLVMGISRFGLVARGIVFCITGWFLIQAATQ SDASQAGGLGEVLQTLAQQPLGPWLLGIVALGLIAYGVYMVIQARYRQVVTR" gene 4065..4931 /locus_tag="DP116_10930" CDS 4065..4931 /locus_tag="DP116_10930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874341.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_10930" /translation="MSVLEGSWQHKYITANGVRLHYVTQGNGPLMVMLHGFPEFWYSW RHQIPEFARNFKVVALDLRGYNDSEKPKNQSAYVMDEFVRDVEGLIKGLGYEKCILVG HDWGGAIAWNFAYSHPQMVERLIVLNLPHPAKFAEGLRTPQQLLRSWYMFFFQLPGIP EFLIQSLDYQLIETAFQGMAVNKSAFTQADINAYKDAAAKRGALTAMLNYYRNIFQQR IVNQNWGVLSVPTLMIWGENDTALGKELTYGTQAYVKDLQIKYIPNCSHWVQQEQPQL VNQYIREFLGEN" gene complement(5004..7193) /locus_tag="DP116_10935" CDS complement(5004..7193) /locus_tag="DP116_10935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872565.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase C14" /protein_id="PRJNA477356:DP116_10935" /translation="MKRRTFLQSFGSIFAVLSAIETGWLTFLSSNYQAFAQSAPRKLA LLVGINQYPQSPALSGCLTDVELQRELLIHRFGFHPSDIISLTNEQATRKAIETTFFE HLVKQAKPDDVVFFHYSGYGTRVKLEMQETEQNALVLFNETDVQESKKVNYLLEETLL LMLRSLATNHVAAVLDTSYYTNSNSLPTNLRIRALPEPEQAILLPEELELQKQLQEKV SSEESVIVLCATSTPKELAREGQLSGFSAGLFTYALTQYLWEATPATTIRVCLSHVGG SIQQLGGTKQQPGLLLDSKNQLSRTPIGDILLPDSTCAEGVVTGVEDDGKTILLWLGG LPAQVLEYYGVNSRLTLITKEGSNIQLVLKSRVGLTAKAQIFSSDGTTSPQVGQLVQE TVRVIPRNIGLTIALNGLERIERVDATSAFATATHLLTVVTGEQPADYVFGRVPEANN KDVAATNSTTVSPSPYGLFSVGGELIPNTAGEAGDAVKVAAQRLKPKLGTLLAAKLWG LTENTGSSRLGIRATLEVISPLTPRSLIQRETVRTQSTETSNKKSFNPELGDIPTVSL GSKIQYRVENKSDRPVYLMLLALNSSRTAIALLPWYKDTQPDSSQVKLLLKDIVVSPG ETLTVPQTTSGFEWVVQAPTSLSETQLIFSTAPFTLSLAALEAAKYPQAEQHRIQPLR NPLEVAQAVLQDLHNASAVKDMNTTATDSYVLDVNHWASLSFVYQVV" gene complement(7299..8486) /gene="sat" /locus_tag="DP116_10940" CDS complement(7299..8486) /gene="sat" /locus_tag="DP116_10940" /EC_number="2.7.7.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318027.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate adenylyltransferase" /protein_id="PRJNA477356:DP116_10940" /translation="MSYHYDGIAPHGGQLVNRIATPEQKQVFLSKADFLPRVQLDERA VSDLQMIAIGGFSPLTGFLNQEDYNGVVAQMRLANGTVWSIPVTLSVGEQVAASLKEG DLVRLDNPAGQYIGVLELTQKYHYDKTREAVHVYRTNDSKHPGVQVVYNQGSVNLAGD VWLLEREPHPLFPKYQVDPVESRQVFQELGWKTIVGFQTRNPIHRAHEYIQKCALEIV DGLFLHPLVGATKEDDIPADVRMRCYEILMEKYYPRNRVLLAINPSAMRYAGPREAIF HALIRKNYGCTHFIVGRDHAGVGNYYGTYDAQHIFYEFKPEELGIVPMMFEHAFYCTC TQQMATDKTSPCLPEERIHLSGTKVREMLRRGELPPPEFSRPEVAAELARAMRVAVHT YEI" gene complement(9106..9885) /locus_tag="DP116_10945" CDS complement(9106..9885) /locus_tag="DP116_10945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FHA domain-containing protein" /protein_id="PRJNA477356:DP116_10945" /translation="MQNPGRSQTTELSLELFHVQTDTPLELAQNFSVIRIGKPKDQTM PEINVAGLPNANFASRLHAEIHVEKSTYYLVDVGSVNGTYLNNIKLEPTKRYPLKFGD KIDLGHGGKLTFIFLQKQEQELATHSDNTMLNHSPTVIQVELVANTNQSRMEIFTKLI GLVLMIAGIFILANTQIADSVRIPGVLSCAVGVVVLICRHAFRHLAWSLMALGITVMF FNGNAFVPASLLAILASCALLVVGCQLFTSGKFLNFSLRKN" gene complement(10237..13209) /locus_tag="DP116_10950" CDS complement(10237..13209) /locus_tag="DP116_10950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129665.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyruvate phosphate dikinase PEP/pyruvate-binding protein" /protein_id="PRJNA477356:DP116_10950" /translation="MLELWGIVVVFIVCPLLGALPIIAWLTQATTGRQLAQIGTGNIS VSSAFYHGGTLVGILAVFSEAIKGIAAVLLARIFFGEGSAWELIALIGLVLGRFLAGK GAGTTNAVWGFSVHDPLAAIFVLLLASVSFTVVRSRQLAKFGILVLFPAIVALLHPEA PARILAAAALAGLLGWIYKQIPDDLNLPVEDAQSDSQAAFKFFRGNQSILTLDDELEA AFVGQKAARLAEVKRWGYPVPKGWIIGPYQNPEPLIEMLQPSDLLPFVVRSSAIGEDT ELASAAGQYETVLNVTSKQELRDAIARCRASYNSERAIQYRRDRTVTKPVVSKQDSPF QKNAAKSQTDFGQNTGERAAFDAMAVLIQQQVQGVYSGVAFTRDPITKQGDAIIIEAL PGNATGVVSGRVTPEQYRAYVVETDNFTSIQLEGEGNVPPILIKQVAYLSRHLEERYH GIPQDIEWTYDGQTLWTLQTRPITTLLPIWTRKIAAEVIPGLIHPLTWSINRPLTCGV WGEIFALVLGERSVGLDFNQTATLHYSISYFNATLIGEIFRRMGLPPESLEFLTRGAK MSQPPIESTLRNFPGLVRLLLREFSLTLDFNRDNRKRFVPALSQLAKEPVENFDPARL LARINFILELLRRATYYSILAPLSAALRQAIFRVKDGDLDNSRTPEVAALRAIQDLAI NARQILPEFNPDKVFEQLRRTPGGQEIVNKFDKLLECYGYLSEVGTDIAVPTWKEESE AVKQLFVQFMQTNEPLKHQKCRRKKKRGFVQKRVHLKGRVTEVYSRLLAELRWCFVAL EQVWLKSGLLKQPGDIFFLEFEEVRRLAESFEPAFIQQLQGLVQLRRTQFAQDNQRNP VPLLVYGNNPPPFSLQTAIPSPNRVLQGIPASPGQAEGRIKVLRNLQSVADIDRDVIL VVPYTDSGWAPLLVRAGGLIAEAGGRLSHGAIIAREYGIPAIMDVTNAMSLLQDGQKV RIDGYRGIVEISDDLRLQ" gene 14059..15030 /locus_tag="DP116_10955" CDS 14059..15030 /locus_tag="DP116_10955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138185.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SGNH/GDSL hydrolase family protein" /protein_id="PRJNA477356:DP116_10955" /translation="MKVVLVISLVVVIGLFVIIELGLRLLFGFGNPLTYIRDEQIGYL LSPNQRTRRFGHRITINQYSMRSEPIAQTPSPSTLRVLLLGDSIANGGWWTDQDNTIS SLISRLLTSAVSSNSQQVEVLNASANSWGPRNELAYLEKFGSFDAQALVLLINTDDLF ATAPTSLPVGCDRNYPDKKPPLALAEIFQRYIFKPTPIPGIEAVQNEGGDRVGFNLEA IGKIQALALQTNTQFLLVMTPLLREIGEPGPRDYEVIARNRLSEFCQAQQITYLDFLP IFNSNQDPKALYQDHIHLNLQGNQLVSEVISRSALCAAAFAQRPLRG" gene 15186..15881 /locus_tag="DP116_10960" CDS 15186..15881 /locus_tag="DP116_10960" /inference="COORDINATES: protein motif:HMM:PF13599.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10960" /translation="MVKESFQELLNLSQRVDEAQTHNLSAENFAGEDLREANLSGTNL FNANLSGANLNNAKLDTTNLSTANLASSDLSGANLSGANLNFANLSDANLSGANLSGA NLSGANLSFANLSKADLNGADLSGANLRRADLKNSDLKNVNLSLANLDSANLKGANLN DAYLGGIQLTGANLTGANLTGANLNVSNLNGANLSNADLRSANLSNASLKLTNLYGAK INDSTKFDDQSLD" gene 16234..17817 /locus_tag="DP116_10965" CDS 16234..17817 /locus_tag="DP116_10965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316992.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_10965" /translation="MPEVKVPTITSTYDIICFGDEVPGILALVCAAREYNRQKNQYPR TLLLLKGNSKLGLGGHLVRGGLSYLDRSVVPAAIRQSRNLDTFGDPAAIYKEFLKRAG VAFIALDPVKADSALRAMLQEVRADIISDIEIKSVIKEGQKITAIELTKGETYAGKQF IDCTVNAELAQAAGVKKLKGFETFGLPDSELAVTLVFETQGLSIEKLKNVEFQYLKRF TNKADTEAQKWLNVAAGGDPARADELRKDLVDSAGKLKTMYAGQDYIDVRSKALSIAY HAFRGTPVSLQSSGTMLDNGNIAILSQGRLSWNALLYKVNADQAEALTRAKSKPTSDM LREMVFVRKWFESIGASQVKSVEELYIRHAGNITGAVDPLSGSEMLAGGVPQSEALGT FGYHFDIRGGINGLDERAAGKGFNDFAFLNPPLFNIGIRHALVKNVPNLAVISPGSGF VGYASSAGRIVEFNCGVGQGVGIAAGIAIAQGRNLAEISNAQVRTILAQTGKLSQIYG TDNPLLASKLGSFENQMIA" gene 17989..19278 /locus_tag="DP116_10970" CDS 17989..19278 /locus_tag="DP116_10970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874044.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_10970" /translation="MYLTQKNQIRELNKPEFVALRELCRLSKNLYNVGLYTVRQYFFQ ERKHLRYESAYHLCKVNENYKLLNTDIAQQTLKVVDRTFKSFYGLISAVKNGSYQQKV KLPNYLSKDGYFLLIIPRIVVKDGKFRVPMSNAFRKQYGEVWIPFPKRLNINQIKEVR IHPKYNARYFEVEFISEVEPEPVEVKSDRAISIDLGVDNLAACVDTNGASFLVDGKPI KSINQWFNKRNAQLQSIKDKQNINGITNQQVKLTSLRNNQVRDYLNKTARFIVNHCIT NGIANLIVGYNPGIKQEINIGGRNNQNFVQIPFHSLRSKLKAMCERYGLNYQEQEESY TSKASAIDGDEIPVYADNPKEYQFSGIRIKRGLYRTKDGHLVNADLNGSLNIGRKSKH DGFTGVSRAALTQPRRINLLKLEKWRATAHTEFGTTS" gene 19347..20222 /locus_tag="DP116_10975" CDS 19347..20222 /locus_tag="DP116_10975" /inference="COORDINATES: protein motif:HMM:PF08450.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SMP-30/gluconolactonase/LRE family protein" /protein_id="PRJNA477356:DP116_10975" /translation="MSHTQETGIPGVLAKNTKITKLADGMSFCEGPVWDKKHHRLIFS DTGADEHRCWSDAQGLQTFRKPSYQPNGNVFDLQGRLLTCEHESRAISMTNIDGQRTI VVDNYAGKHFNSPNDLEVKSDQTIWFSDPTYGLGNRTKEMDFQGVFRFDPKANKLTLI ADDLSMPNGIAFSPDEKKLYVGDSAEDKRQIRAFTVNPDGTVSGGEVLCTTENPIWGP DGVDVDANGNIYTGCGDGVNIFSPAGLLLGKILTEAPISNFAFGGNDGKMLFMTSEHA LYRVNLLVAGAVKRW" gene 21505..21687 /locus_tag="DP116_10980" CDS 21505..21687 /locus_tag="DP116_10980" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10980" /translation="MRIYWKPIICEVILEILLELAPGSFDMMATISEYLVESTWEASR TSIQLVLGMNSQVAFH" gene complement(21795..21923) /locus_tag="DP116_10985" CDS complement(21795..21923) /locus_tag="DP116_10985" /inference="COORDINATES: protein motif:HMM:PF16536.3" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_10985" /translation="MREFALGVEALERFVNRVPLRRVHECVFGILAMESEPVDSRL" gene complement(21995..22429) /locus_tag="DP116_10990" /pseudo CDS complement(21995..22429) /locus_tag="DP116_10990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997873.1" /note="internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="polynucleotide kinase-phosphatase" gene complement(22462..23850) /locus_tag="DP116_10995" CDS complement(22462..23850) /locus_tag="DP116_10995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318409.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3' terminal RNA ribose 2'-O-methyltransferase Hen1" /protein_id="PRJNA477356:DP116_10995" /translation="MLLSITTTYYPATDLGYLLHKHPDRCQSFPLSFGQAHIFYPEAN EQRCTATLLLDIDPVKLVRGRSANLEQYVNDRPYVASSFLSVAIAQVFSTALAGRCKD KPELVQTPVPLVAKLSVLPCRGGEGFLRQLFEPLGYTVTAQNHALDETFPDWGQSQYY TVELQHTLPLAELLSHLYVLIPVLDDDKHYWVGDEEIEKLLRHGEGWLTEHPAREAIT RRYLKRLHRLTRTALAQLAEEDELDPDSTQENHAEEEAAVEKPISLNQQRLNAVVTTL KESGAKRVIDLGCAQGNLLRTLVKDSFFEQVTGVDVSYRSLEVAQERLDRLHLPRTQL ERLQLIQGALTYQDKRFTGYDAATVIEVIEHLDLPRFAVFERVLFEFAQPKTVIVTTP NIEYNVRFEKLPAGKLRHRDHRFEWTRQEFQTWAKQVAERFGYAVKFQAIGEEDPEVG SPTQMGVFSHQI" BASE COUNT 7276 a 5341 c 5178 g 7073 t ORIGIN 1 gattatggtg ggggcgagga gccgaattaa tcaattcaaa aaaagagtta ttattttgat 61 agcgcagcgt gcccgtaggg catactttga attcaaaaaa ggtcaataga taactgaaac 121 cactactata gcaatcctaa atcattacca gtcgcgacaa ttcgacgtag gaattgcatc 181 cttctttgta ggttgcctaa agggcatatt taaaattttg aattgctgat gtctcctcct 241 ttgcggaaga actgctgcag aatttctcct ggcttgcgat cccaggtatc aatatgctcg 301 taaattaaac cctctttatt aagtttgtaa gttgagtagc cattgaaaaa cactccagct 361 ttccaaggaa cgcgcaaaac gcccctgact gtccacttcg ccaaaattat gtcttcggct 421 gactgataga cttcgtgtaa atcaaagtag attgtcgtaa aaaataatcg agcgtgaaat 481 cgcaaagtcc aaaagataat gcgatagttc aattttcctt taaatttatt gactgggtct 541 tgaaagtaaa tatcttgcgt gtagatgtcg taagaaatat ccttttgaaa aagtgttggt 601 aaatcgtttt taagagtttc aatatctttt tccacctgca attgatattc catgcgatcg 661 ctccgaaaaa atgagttttg ttgctgttca agtcatctgt atgcccctat ggaaaaaact 721 ttctccccct gcaacgcctg ctttgggagc attacaagcg tgtcattcgt ttctagacac 781 aagggtggat atgcccattg gctatggcgg gttaatttag cgtgtaccta caggttgaac 841 ctctctcgct aaaccaatat gtgctaacga tagtgagata tccttatact ttatcccagc 901 gaaagcccac tgaagctcaa gtgttcccac ccagaagcgt gcccggtcga taacttccgg 961 taacattgga tagtcgcttt ccatacgtcg cacaattcct tcaccgtaat ttccgagttg 1021 cacagcaata tcgcaagctg ggtcgcctaa accagcagta ccaaaatcga taacaccact 1081 aatgctttct gacacagagt caaacaggat atgatacaca gccttatcgc catcaataag 1141 tacgggcgtg taactcaggt caagttcgcc tgtaattacg ggcgcaaaca attcatgtat 1201 ccaagtttgc tgatgacgcc acaagtgtgg aaaaagagtt tcctgaactt gctcgtacaa 1261 ctccaaccaa tcttcacggg aacgcgcagc accagaggaa ggcacctcag cattaactaa 1321 aacttcattt gggatgctgt gcaattgttg gtgaaatctg gctagttggg agatgatacg 1381 cgcttgtgat gcttcactca gcttcagcag agtgttacgc gaaagcggtt cacctttaat 1441 aaatctgtag ctgacaaagc caacttcttg gtgttcaaag tgtggaaccc gcaaatcaac 1501 atagtttcgc accacttcca acaccattac ttcatgggaa agaacctctt ttccccagtc 1561 atctttagca aagcgacata caaggtcatg attaacaact accaacacgt taaccatgcc 1621 atcttgatta aagtccagat gctcaaaaga tatattaggg tagacagcac gaattttttc 1681 gagatactca gaaggaatat tcacagtaac tcctaatgtc attacaacaa acaatgagaa 1741 ttagtcggac atttagcccg attgcagcga cttaactaat ttgcgtgttt gttgtatcaa 1801 cgtatcactg tgatttaact ccctagaaca tcgctacaga taagtctatc gccattttac 1861 tactggctgc tgcttcttga ttaaaaaaga ttttttaaaa acctcagtaa ccgctgccca 1921 atagataatg atattttctt gcttactcta ccttgattca aatcttctag agaaggaaat 1981 gcagcagatt gagcataagg actttttcgc aacacctcag ctaaggcgtt gttttcttct 2041 cgcaaacctt cgtagaaaaa gaaaacttcc ccagaaatcc cacaagtacg gttatactca 2101 atagcttgca acaggtcctc tgaacttatc cggtaagaac caactttcat cagaattcct 2161 ggtgctaact tcgagagagt agaatccgta aattgctcgt ttactaattt atcaataata 2221 cttttataag tgaagaaatc acggcgatag atttggggat ggataatatc taccagccct 2281 cgttgcaacc aagtaggaga gtcttgcagg tattctttat atgcccagtc atgaatattc 2341 ggagttattg atattaacag gttagaatta atcgctttga cttcttgata gaggcgggca 2401 agaaattcag tgagaatatc agcacgccac tgcaaccagc ttccatcctt aaagtttttt 2461 ggtggatcct caccaaactg ctgacgatag cgttccagag ttcccctgtc ataaccacct 2521 tcactgggca atgcaggcaa gcggtcatca ccttggatgc catctacatc gtaatttttt 2581 acgacttcca atattatatt taacatgaaa ctttgaactt ctgaatcaag ggcattcagc 2641 cactcaaagc catttttttt cagcaagtta ccctcacaat cacgggcagc ccattcaggt 2701 ttttttgcca gcagttgtcc accatttaaa ttgtaggaac tggcgaatcc atattcaaac 2761 caaggaatga tttttaaccc aactcgccgc gcctctgtaa cgacttccgc caagggatcg 2821 cgtcctacaa actgtggatt gatttctgta tcaaaggttt gacgcatcaa ttgacttgga 2881 tacattgtcg ccccctgatt ccagacaaca gggaagacaa cattaaatcc tgtctgggcg 2941 aggaaatcca tagctgacgc aatgttttct cgggaatgga aaactttgct atctgttgta 3001 gttagccaga caccgcgtat ttctactctt gccatgcact tttcaacgtg tgactacctg 3061 acgataccgt gcctgaatca ccatgtaaac accataggca attaaaccca gcgccacgat 3121 acccaaaagc caaggaccta atggttgttg tgctagcgtc tgtaacacct cacctaaacc 3181 tcctgcttga ctagcgtcag actgagttgc tgcttgaatc aaaaaccagc cagtaataca 3241 gaagactatc cctcgggcta ctaaaccaaa tctagaaata cccattacca acttgcgttc 3301 tgaatcgtcc aattcattca agtttaagtt tctccgaaac tgactactaa aggcttgata 3361 aaactggtag aaacccaaac caataacaaa cgcccccgca gcaccaacta accattgacc 3421 aaagggttga gaaagtaatc ttgctgtcca atcttgtcta gagttactgt taccactgtt 3481 agaaccaaga ataatttgca cagcactcaa agcaatacca gcatagatca aaccattgat 3541 gacataacca actcgctgta ccaaaccttt agcatctgtg cctttatttt ctggatcgtt 3601 tattgcctgt acaaaacgaa gaattacgta tccaatgagt ccaattgcca ctaaagctag 3661 tagaaactct ccaaatggct gattgacaat ttcccggaga gcgcctttag tatcagtgat 3721 tttaccacca ctgctaaatg ccgcctgcgc tgccaatagt ccaacgatcg cgtaaactat 3781 tcctttagat atataaccaa atctagctag tctttcaatc cataaagaat gatgatgtat 3841 tagttgctgt gtcataagac ttggatttca tcatggctaa gtatttagca gattttatcc 3901 caaacataca tctaccaaaa gaggaattat caattcttaa attaagaatg gttgaagact 3961 cattttattt tcaagttttt atttcatatg gataattttc agatattagt tattattaat 4021 acttgtattg agagtcccct tatctaacta gaagagtata aattatgtct gtactagaag 4081 gttcatggca gcacaagtat atcactgcaa acggcgttag attgcattac gttactcaag 4141 ggaatggtcc tctgatggta atgttgcatg gatttcctga gttttggtac tcgtggcggc 4201 atcaaatacc agaatttgcc cgaaatttta aagttgttgc ccttgacttg cgtggctata 4261 acgatagcga aaagcctaag aatcagtcag cttatgtgat ggatgaattt gtcagagatg 4321 ttgagggtct tattaaggga ttgggatacg aaaaatgtat tttggtggga catgattggg 4381 gtggtgcgat cgcctggaat ttcgcttact ctcatcccca aatggtagag cgtttaattg 4441 tacttaatct gcctcatccc gctaaatttg ccgaaggctt acgcactcct caacagttgc 4501 tacgtagctg gtatatgttc ttttttcaat taccaggaat accggaattc ctcatacaat 4561 ctttggatta tcaactcatt gaaactgctt ttcaaggtat ggcagttaac aaaagcgctt 4621 tcacgcaagc ggatatcaac gcttacaaag atgctgcagc caaacgtggt gccctgacag 4681 caatgttgaa ctattaccgc aacatttttc aacagagaat agttaatcaa aattggggcg 4741 ttttgtcagt accaacgttg atgatttggg gagaaaacga tactgccctt ggcaaggaat 4801 taacctacgg cactcaagct tacgtgaaag atcttcaaat caaatatatt cctaactgta 4861 gtcattgggt acagcaggaa cagcctcagt tagtcaatca gtacatacga gagtttttgg 4921 gagaaaatta gttcatagaa ctaaactcaa ctctgtcagg cagacgaaga aattaatgct 4981 agcctgaaag ctagcaattt ttgtcaaacc acctgataaa caaaactaag gcttgcccaa 5041 tgattcacat ccaacacata ggaatcagtc gctgtagtgt tcatatcctt gactgcactg 5101 gcattatgca aatcttgtaa aacagcttgt gcgacttcta gggggtttct caagggctga 5161 atgcggtgtt gttctgcttg gggatactta gcagcttcta aagcagcaag ggacagggta 5221 aaaggtgcag tactaaaaat aagttgggtt tcacttaagg aagttggtgc ttgcacaacc 5281 cattcaaaac ctgaagtcgt ttgtggtact gtcaaagtct cccctgggga gacgactata 5341 tctttgagaa gtagtttgac ttgcgatgag tctggttgag tatccttgta ccagggaagc 5401 agggcgatcg cagttctact gctatttaat gctagtagca tcagataaac agggcgatcg 5461 ctcttgtttt ccactcgata ctgtatctta ctgccaaggc tgacagtggg aatatcacca 5521 agttcaggat tgaatgactt cttgtttgat gtttcagtgc tttgagttcg cacagtttct 5581 cgttgtatga gggaacgagg tgttagaggg ctgattacct ctaatgttgc cctaatcccc 5641 aagcgagaag atcctgtatt ttctgttagt ccccataatt ttgccgccag taaagttcct 5701 aattttggtt ttaaccgctg tgcagccacc ttcactgcat ctcctgcttc tccagcagtg 5761 tttggaatca actcaccacc aacagaaaat aaaccgtaag gactgggaga aacggttgta 5821 gagtttgtcg ctgctacatc tttgttgttc gcttcaggta ctctcccaaa cacataatct 5881 gctggttgtt ctccggtgac tacagttaat aaatgggtgg cggttgcaaa agcacttgtt 5941 gcatctactc gttcaattct ttcgagtcca ttcaaggcga ttgttaaacc aatatttcgg 6001 ggtatcactc gcacagtttc ctgaacaagt tgtccgactt ggggagaagt tgtaccatca 6061 gaactaaaaa tttgcgcttt tgctgttaac ccaacacgcg atttcaaaac cagctgtata 6121 ttcgatcctt cttttgtaat gagcgtgagt cgtgaattga ctccgtagta ttctagtact 6181 tgtgcaggta atcctcccag ccacagtaaa attgttttcc catcatcctc aactcctgta 6241 accactcctt ctgcacaagt gctgtctggg agcaaaatat ctccaatcgg tgtacgagaa 6301 agctgatttt tgctatctag caataaacct ggctgctgtt tggtaccacc caactgctga 6361 atagaacctc ccacatgaga caaacagact cgaattgttg ttgctggtgt ggcttcccat 6421 aagtactgag ttaaggcata ggtaaataat ccagcactaa aaccagacaa ctgtccctcc 6481 cttgccaatt ccttgggggt agaggttgcg caaagcacaa tcactgactc ttcactagag 6541 actttttctt gtagctgttt ttgcaactca agttcttctg gtaacaggat agcttgttct 6601 ggttctggga gggcgcgaat tcgcaggttt gttggtaaag agttggagtt ggtgtagtag 6661 cttgtatcta acactgctgc tacatggttt gtggcaagcg atcgcaacat gagcaacagt 6721 gtttcttcta ataaatagtt gactttttta ctttcttgta catctgtctc attaaataga 6781 acaagagcgt tttgctctgt ttcttgcatt tccaatttga cacgagtgcc atagccacta 6841 tagtggaaga aaacgacatc atcaggttta gcttgcttaa ccagatgctc gaaaaaagtc 6901 gtctcaatgg cttttcttgt ggcttgttca ttagttaacg agataatatc agacggatgg 6961 aaaccaaagc ggtgaatcaa aagttccctt tgtagttcca catcagttag acaaccgctc 7021 aaagctggag attgtgggta ctgattgatg ccaactaata aagccaactt acgcggtgca 7081 gattgtgcaa aagcttgata gttgctgctc aagaatgtca accacccagt ttcaatcgca 7141 ctcaacactg cgaatataga gccaaaactt tgtaaaaacg ttcgccgctt cataagttaa 7201 attaataatc agccctcagt ctatcaggta tagcttaaag ggctgaggtc tgggtgtaaa 7261 gcaacctttt acactttact atctgccttt tgtgtgcttt aaatctcgta agtatgaact 7321 gctactcgca ttgcccgtgc caattctgcc gccacctctg gacgagaaaa ttctggtgga 7381 ggtaactcgc ctcggcgcag catttcccgc acttttgttc ctgaaagatg aatgcgctct 7441 tctggaagac aaggacttgt tttatctgtc gccatttgct gggtacacgt gcagtaaaag 7501 gcgtgttcaa acatcatcgg tacaatgccc agttcttcag gcttaaactc ataaaagatg 7561 tgctgagcat cataagtgcc gtagtaattc cctacgccag catgatcccg cccgacgata 7621 aagtgagtgc aaccgtagtt cttgcgaatc aaagcatgga aaattgcctc ccgaggacca 7681 gcataacgca tagccgaggg attaattgct aaaaggacgc ggttacgtgg gtagtatttt 7741 tccatcaaaa tttcgtaaca gcgcattcgc acatctgcgg gaatgtcatc ttctttcgtc 7801 gccccaacca atgggtgcaa aaatagacca tccacaattt ctagggcgca cttttggata 7861 tattcatgag cgcggtggat cggattgcga gtttggaacc caacaatagt tttccagccc 7921 agttcctgaa acacctgccg agactcgacg ggatcaactt ggtattttgg aaacagagga 7981 tgaggctcgc gttctaataa ccacacatca cctgctaaat ttacagaacc ttgattataa 8041 accacctgta caccaggatg ctttgaatca ttggttcgat aaacatgaac cgcttcacga 8101 gtcttgtcgt agtgatattt ttgtgttagc tctaatacgc cgatatattg accagcggga 8161 ttatccaagc ggactaaatc accttctttt aaagaggcgg cgacttgctc accaaccgaa 8221 agtgtaacag gaattgacca aacagtaccg ttggcgagcc gcatttgagc aacaacgccg 8281 ttataatcct cctggttcag aaaaccagtt agtggactaa aaccaccaat tgcaatcatt 8341 tgcaaatctg atactgctcg ttcatcaagt tgaactcgcg gtagaaagtc ggcttttgag 8401 agaaatactt gtttttgctc aggtgtagcg atgcgattca ccaattgccc accgtgaggg 8461 gctatgccat cgtaatgata actcaaggct gtcctttgtt tgcgataact acattttttc 8521 ttaatctatg tattaactta tcattttccg tttcgcagat tcttaaaaaa gtttctgaaa 8581 ttggtgtact attttattta gatcagctta ttagcggcat acgtaaaaaa tagggaacag 8641 agaacaggga acagagaaca gggaacagag aacagggaac agggaacagg gaacagggaa 8701 cagggaacag ggaacaggga actcttaacg cttaactctt aacagacaac tattaacgct 8761 taactcttaa tagagaagaa ggaagaaaag tgtacctagc tgagaaaaaa tttttggagg 8821 agtcctagtt gagactcaga agtagcaagg aaattttgat gagcgttaag cgaagctcta 8881 ccgaaggtaa tcgcttctcc ctattctgac taaaagagag tcaatcccat atcatttttt 8941 gagaccactg ctgtcggatg taaaggcgat tcgataaact atattattat ggttagcaaa 9001 aatgtttatc gcagacagag aaataatttt atgtattgac gaaacggcac tttccaaaaa 9061 agggaagttt acagattatg ttgcacgtca ataactattt ctctattaat tctttcgcaa 9121 actaaagttc aaaaatttcc cagacgtaaa tagttgacat ccaacgacta acaaagcaca 9181 ggatgcgagg atagccaaaa gagatgctgg tacaaaggca ttgccattaa aaaacatgac 9241 tgtaattcct aatgccatca aagaccatgc taaatgacgg aatgcatgac gacagataag 9301 gactacaacc cctactgcac atgaaagtac acctggaatg cgcacagaat cagcaatctg 9361 agtattcgct aaaataaaaa ttcctgctat catcaatacc aagccaatga gcttagtaaa 9421 aatctccatt cgagattggt ttgtgttcgc aaccaattca acctgaatca ctgtgggtga 9481 atgattcagc atggtattat cagagtgagt tgctaattct tgttcttgtt tctgtagaaa 9541 gataaatgtg agcttacccc catgacctag gtctatctta tctccaaact taagtggata 9601 gcgctttgtt ggctctagct tgatattatt gagatatgta ccattgacgc tgccaacatc 9661 aacgagatag taagtacttt tttcgacatg aatctctgca tggagtcgag aagcaaaatt 9721 ggcatttggt aagcctgcaa cattaatctc aggcatagtc tgatctttgg gcttgccaat 9781 gcgaataaca gaaaaatttt gcgccaactc aagaggagtg tcggtttgaa catgaaaaag 9841 ttctaaactc agctctgttg tctgtgatct gcctgggttt tgcatgcctt atctttaggg 9901 agcaattttt caatcaatta caattacctt gtttggtcgt taagatgcgt caggtgataa 9961 ctctcgattt attgtcttaa ttttaatact tatatgaact gtacaatatt atattcgctc 10021 aggaacactc tgccgtcttg tagacgatag attcataaca attttttaag aaccactgta 10081 gaaaacggga ataatttaca ctttctagca ccacattaaa taaatttaat atggcaaaag 10141 tccctgcctt aggcggctgc agtgtcaatt ttccactagc aatctcaaat aggtgagaac 10201 caattctttc gccaagtcat tcaggggaga taaaccttat tgtagtctta agtcgtcgga 10261 tatttccaca atacctctat acccatcaat ccgcactttc tgaccatctt gcaataatga 10321 catagcattt gtcacatcca tgatagcagg aataccatac tcgcgagcaa tgattgcccc 10381 gtgggaaagt cgtccacccg cttcagcaat caaccctcca gccctaacga gtagaggtgc 10441 ccaaccggaa tctgtataag gtacaaccag gatgacgtct cggtcaatat ccgcaacact 10501 ctgtaaattc cgcaacacct taatccgtcc ttctgcttgt ccagggctgg caggaatgcc 10561 ttgtaaaaca cggttaggag aaggaattgc agtttgcaaa gaaaatgggg gaggattatt 10621 gccgtaaact aaaaggggta ctggatttcg ctgattatct tgtgcaaatt gtgtgcgcct 10681 cagttgtact aaaccctgca actgttggat aaaagcaggt tcaaaacttt cagcaagacg 10741 ccgtacttcc tcaaactcta agaaaaagat atctcctggt tgtttaagta agcccgactt 10801 taaccaaacc tgttccaacg ctacaaaaca ccagcgcaac tcagctaaaa gtcgagaata 10861 tacctctgtg actcgccctt taagatgcac acgcttttgg acaaaccctc tcttcttttt 10921 ccttctacac ttttggtgtt taagcggttc atttgtctgc ataaactgta caaacaactg 10981 cttaaccgcc tctgattctt ctttccaagt gggaacagca atatcagttc ccacttcgct 11041 caaataaccg taacactcaa gcaatttgtc gaacttgttt acaatttctt gccctccggg 11101 ggttcgtctt aactgctcaa acactttatc tgggttaaac tcaggtaaga tctgcctagc 11161 atttatggct aaatcttgaa tggctcgcaa cgccgctacc tctggcgtcc gactgttatc 11221 aagatcccca tccttcaccc gaaatatcgc ttgtcgcaac gccgcactca acggagctaa 11281 tatactgtag tacgttgcac gacgaagtaa ctccagaata aagttaattc ttgccaacag 11341 ccttgctgga tcgaagtttt ctacgggttc cttagctaac tgagacaaag ccggaacaaa 11401 ccgcttgcga ttatccctgt tgaaatccag tgtgagagaa aactcccgca acagcaaacg 11461 cactaaacca ggaaaattcc gcaacgtgct ttctatagga ggctgactca tttttgctcc 11521 cctagttaaa aactccaagc tttcaggggg taaacccatg cggcggaaaa tttctcctat 11581 tagggtggca ttaaagtaag atatagaata gtgtaatgtc gctgtttggt taaaatccaa 11641 cccgacagag cgttcaccca acaccaaagc aaaaatttct ccccagacgc cacaagttaa 11701 aggacgatta atcgaccaag ttaatgggtg aatcagtcca ggaatcacct ccgcagcaat 11761 tttgcgcgtc catataggta gtaaggtggt tatgggtcga gtttgcaatg tccaaagagt 11821 ttgaccatcg tatgtccact caatatcttg aggaataccg tggtaacgtt cttccagatg 11881 tcgagataga tatgcgactt gcttgattaa tattggtggt acatttccct caccctctaa 11941 ttgaatagaa gtgaaattat ctgtctcaac tacataagcg cgatactgtt caggcgttac 12001 tcgtccagaa acgacacccg tggcgtttcc tggcaaagct tcaatgatga tcgcatctcc 12061 ttgtttggta atgggatcac gagtaaaagc aacgccagaa tagacacctt ggacttgttg 12121 ttgaatcaac accgccattg catcaaaggc ggcgcgttca cccgtatttt gcccaaagtc 12181 cgtctgcgac tttgctgcgt tcttctggaa tggagaatcc tgttttgaaa caacaggctt 12241 ggtaacagtg cgatcgcgtc gatactgtat tgcccgttca ctattgtatg aggctcgaca 12301 gcgggcgatc gcatctcgca actcctgttt actcgtcaca ttcaaaaccg tttcatactg 12361 tcctgcagcg gaagcaagtt ctgtatcctc tccaattgct gaagaacgta ccacaaacgg 12421 taataaatct gaaggttgaa gcatttctat caacggttca gggttctggt acggaccaat 12481 gatccatccc tttggaactg gataacccca tcgtttgact tcggctaatc gagctgcctt 12541 ttgtcccaca aaagccgcct ccaactcatc atctaaagtg agaatgcttt gattcccgcg 12601 aaaaaattta aacgctgctt gtgagtcgct ttgtgcgtct tcaacaggaa gattcaaatc 12661 atcaggaatt tgtttgtaaa tccagccaag taacccagca agagcagccg cagccaagat 12721 tcttgctggg gcttcaggat gcaagagggc tacaattgca ggaaatagaa ctaaaattcc 12781 aaattttgcc aattgcctag agcgcacaac ggtaaaactt acgctggcaa gcaataacac 12841 aaagattgcc gcaagtggat catgtacgct aaatccccac acagcgttag tcgtgccagc 12901 tcctttaccc gccaagaatc tacctaatac taagccaatc agagcaatca attcccaagc 12961 tgaaccctct ccaaagaaaa tacgagccag caagactgca gcaatacctt tgatcgcttc 13021 tgaaaaaaca gccagaattc caaccagtgt accgccgtgg taaaaagcag atgaaacact 13081 aatattgcct gtaccaattt gtgctaactg ccgccctgtt gtagcttgag tgagccaagc 13141 tatgattggt agcgcaccta acagcgggca gacaatgaaa acgactacaa taccccaaag 13201 ttctaacatt tttgcttttg gattttagat gtgttttcaa tctaaaatct caaaccttaa 13261 atttaataat tatttttctt cgtcattaca ccataatggt acacacttgg ctgtgatgga 13321 aatcgtcttt gccaatgtct tttaccactt gctcaattaa actttctctg cttttgacta 13381 tttaatatga tttatagttc cagtcatacc tgagcagagc atttacgccc atacagaagc 13441 gcaaaatgat actcaggatg acttgttatg atacatctag ctccctgaag ttgggagttc 13501 cattatgtaa tttagtgaac tgcgtaggct gcttgcagtg cccaaatgat cagcgccaaa 13561 attgacccta gaagcagcac ggaagagatg gtgactactt ttggggaatc agcgccctca 13621 aacttcataa ttcctcggtt taagtcggac atagctttgt aatcctcctt ctgttacagt 13681 gctaacttgc ccgtttcgat tctagtcctt atgtatgcat ctgatgcaaa gtttatatta 13741 caattttgtt taagcttcgc aacagtttca tcgttataaa tacttatatt aatagagaca 13801 tggtcttgtt ttagacattt tgattactga attcctcaca gaaaattgaa taagttagaa 13861 cgtgacaatt gtcaattttt tcaaaaaaac tactgatagc agtttgtatt tggttgcaat 13921 acacgctcga attaagtgaa taggcatgag caatgcccac cctacagcag taccaagcca 13981 gcgttgtgta ggaacatctc cattgactgg cattacgagt gtaaaatata cggaacttag 14041 aacaaacgga cttaacaagt gaaagtggtg ctggttatca gtttagtcgt cgtgatcgga 14101 ttgtttgtga tcatagagtt gggattaaga ttgctgttcg ggtttggcaa tcctctaaca 14161 tacattcggg atgaacagat tggttatctg ttaagtccta accaacgcac ccgtcgcttt 14221 ggtcatcgta ttacaattaa tcagtattcg atgcggagtg aacccatcgc ccagacgcct 14281 tcaccatcta ccctgagggt gctactttta ggagattcta ttgctaacgg tggttggtgg 14341 acagaccaag ataatacaat ttctagttta atatcgcgct tgttaacttc agcagtgagt 14401 agtaattctc aacaagtaga agtgctaaat gcttcagcca actcttgggg accaagaaat 14461 gaattagcct atttggaaaa gtttggtagt tttgatgctc aagcacttgt gctactcatt 14521 aacaccgatg atttgtttgc taccgctccc acatctttac ccgtaggatg cgatcgcaat 14581 taccccgaca agaaaccacc tttagcatta gcagaaatct tccaacgtta tatattcaag 14641 ccaaccccaa taccaggaat agaagcagta caaaatgaag ggggcgatag agttggtttt 14701 aatctagagg cgattggcaa aatccaagca cttgcgcttc aaaccaatac tcaatttctt 14761 ctcgtcatga ctcccttact ccgagaaatt ggtgaacctg gaccccgcga ttacgaagtt 14821 atagcacgta accgcctgag tgaattttgt caagcccaac aaatcactta tttagatttt 14881 ttaccaatat ttaactcaaa tcaagacccc aaagccttgt atcaagacca catccacctt 14941 aacttgcagg gaaatcagtt ggtgagtgag gtgatttcgc gatctgcgct ctgcgcagca 15001 gcgtttgcgc agcgccccct taggggctag ctatcgctac ttgagttact ggggcaaaaa 15061 taactaatac tttacaattt acaatttttt gtaaagaaat aaaaacaacg aaagatagat 15121 atgacgccaa gtaaatatct cgtaaagtta aggaggagtg tatctacgtc ggtcaaagtt 15181 gcccaatggt taaagaatct ttccaagaac ttctgaactt gagccagcgc gttgacgaag 15241 cacaaactca taacttgtct gctgaaaact ttgcaggaga agaccttaga gaggcaaacc 15301 ttagcggtac caacctattc aatgctaacc ttagtggggc aaatctcaat aatgccaagc 15361 ttgacactac taatcttagt actgccaacc ttgctagttc tgacctcagt ggtgcaaatc 15421 tcagtggcgc aaatctcaac tttgctaacc tcagtgatgc aaacctcagt ggtgcaaatc 15481 tcagtggtgc aaacctcagt ggtgcaaacc tcagctttgc caacctcagt aaagctgacc 15541 tgaatggtgc agatcttagt ggtgcgaacc tcagacgtgc agatctcaaa aatagcgatc 15601 tcaaaaacgt gaacttgagc ctagccaatc tcgacagtgc caacctcaaa ggtgctaatc 15661 tcaatgatgc ctatcttggc ggtattcagc tcactggtgc gaatctcact ggtgcgaatc 15721 tcactggtgc gaacctcaac gtttctaacc tcaacggtgc taacctcagt aatgctgacc 15781 tccgcagcgc caacctcagc aacgcttctt taaagcttac taacctctat ggtgccaaaa 15841 ttaacgattc aacgaaattt gatgatcaga gtttagattg atactctccg ggaagctacg 15901 gaggttctta agatatcggt ttatcgctaa gatgaagcag ttcactacgt acccgggcgt 15961 ggtgatgtat cattacgttg tttcgggcgt ttcaactcat tgtcaagaaa ggattactag 16021 aataaaatca tacaaaatat tgcgatagct gaaacaaaac agtagtcgca attattactc 16081 agctttcaca cttgttagct gtagtgctga cgtccagtca tttaaacagt acttgtaaag 16141 gaactgattc atttctggaa taacaagaag taatattgtt gactttatct agaaaatcaa 16201 atcttacaaa attgagggtt aagcagaaaa cctatgcctg aggttaaagt tcctaccata 16261 actagcacgt atgacattat ttgcttcggc gacgaagttc ctggtattct agctctagta 16321 tgtgcggcac gcgaatacaa ccgtcaaaaa aatcaatatc cacggacttt gctactgttg 16381 aaaggaaatt ctaaattagg gcttggcggt catttagtac gtggtggatt gtcttatctt 16441 gaccgttctg tagttcctgc tgctattcgt caatcacgta atttggacac ttttggtgat 16501 ccagcggcta tttacaagga atttttgaaa cgagctggtg tagcgtttat tgctctcgat 16561 ccggttaaag ctgatagcgc tttgcgagca atgttgcaag aggtacgtgc agatattatt 16621 agcgatattg aaattaagtc cgtaatcaaa gagggacaga aaatcactgc tattgaactg 16681 actaaaggtg aaacctatgc gggcaaacag tttattgact gcactgtgaa tgcagagttg 16741 gcgcaagcgg ctggagtgaa aaaacttaag gggtttgaaa cttttggctt acctgactcc 16801 gaactagcag tgactttagt ctttgaaact caggggttga gtattgaaaa actcaaaaat 16861 gtagagtttc aatatctcaa acgcttcacc aataaagcag atactgaagc tcaaaaatgg 16921 ttaaatgttg ccgcaggagg agatccagca agagcagacg agctgagaaa agacctggtt 16981 gattccgcag gcaagctgaa aacgatgtat gcaggtcaag attacatcga cgttcgctct 17041 aaagctctct ccatagctta ccatgctttc aggggtactc ctgtttccct acagtctagt 17101 ggtaccatgc ttgacaacgg aaatatagca attctgtctc agggaaggtt atcttggaat 17161 gcactgttat ataaagttaa tgctgaccaa gcagaagctt taacacgcgc caaatccaag 17221 ccaacatcag acatgctgcg tgagatggta tttgtgagaa agtggtttga aagcattggc 17281 gcgagtcagg tgaagtctgt agaagaactt tatattcgtc atgctggtaa tattacaggg 17341 gcagttgatc ctctgagtgg atctgagatg cttgcaggtg gtgtaccaca aagtgaagct 17401 ttaggaactt ttgggtacca ttttgatatt cgtggcggta tcaatggctt agatgaaagg 17461 gctgctggta aaggttttaa cgattttgca tttctgaatc cgcccttgtt caacattggt 17521 attcggcacg ccctggttaa aaatgtaccc aacttggctg ttattagtcc cggttctgga 17581 tttgttggct atgcttcttc tgctgggaga attgtggaat tcaactgtgg tgttgggcag 17641 ggagttggaa ttgcagcagg aattgcgatc gctcaagggc gtaacctcgc tgaaatctct 17701 aacgcacaag tgcgaaccat cttggcacaa acaggaaagc tatctcaaat ttatggtacg 17761 gacaatccac tactagcaag taagctaggc agttttgaaa atcaaatgat tgcttgagct 17821 tgctcgcaat accatatagt aatcataaaa aacctcgcga gagcttggaa cgagcgtggc 17881 ttttagccct gggaggaaaa gcgacacagt gggctttagc tcacagtctc tgttgttgac 17941 tgggttatac tttcgctgta aactagcgtc tctagcgaaa tttgttttat gtacctgacg 18001 caaaagaacc aaatcagaga actgaataag cctgaatttg ttgctctgcg cgaattgtgc 18061 cgactgagta agaacctcta caatgtaggt ttgtacactg tgcggcaata cttttttcag 18121 gagcgtaagc acttgcgtta cgagtctgca taccatttgt gtaaggtcaa tgagaactat 18181 aagcttctca atacagacat tgcacagcaa acattgaagg tcgtagacag gacttttaaa 18241 agcttttacg gattgataag tgcggttaag aatgggagtt atcaacaaaa agtgaagctt 18301 cccaactacc tgtccaaaga tggatacttc ttgctgataa ttcccagaat tgttgtcaag 18361 gatgggaaat ttcgagttcc aatgtctaat gcttttagaa agcagtatgg cgaagtttgg 18421 attccatttc ccaaaagact taacatcaat caaatcaagg aagtaaggat tcacccaaaa 18481 tataatgcta gatattttga ggttgaattt atcagcgaag tagaacccga gcccgtagaa 18541 gttaaaagtg atcgtgctat tagcatcgat ttgggagttg ataatctcgc tgcttgtgtt 18601 gataccaatg gggcatcctt tcttgtggat gggaaaccaa ttaaatctat taaccagtgg 18661 ttcaacaagc gtaacgccca actgcaatct attaaagata agcagaatat taatggcatt 18721 acgaatcaac aagtgaaact cacgagcctt cgcaataacc aagtccgaga ttacctgaac 18781 aagacagcac gatttatcgt gaaccactgc atcactaacg gtattgctaa cttgattgtt 18841 gggtacaacc ccggcatcaa gcaagaaata aatattggtg gacgcaataa ccaaaacttt 18901 gtccagatac cttttcacag tttgcgttct aagttgaagg cgatgtgtga aaggtacggg 18961 ttgaactatc aagagcaaga ggaatcttac acctctaaag caagcgcaat tgatggtgat 19021 gaaatccctg tttatgccga caatccgaaa gaataccaat tttctgggat acgaattaaa 19081 cgcggattgt acagaacaaa ggatggacat ttggttaatg cggatctgaa tggttcacta 19141 aatattggta gaaaaagtaa gcacgatggt tttaccggag tgtctagggc tgcgttgacc 19201 cagcctagaa ggattaatct tcttaaattg gagaagtgga gagcgacggc acataccgag 19261 ttcggaacaa cttcttagaa tctccgtggc tttagcccgg agagtgtcaa ttgaaaagca 19321 ctgtaactca aagagcattt ttgacaatga gtcatacaca agaaactgga attccaggtg 19381 tgttagcaaa aaacaccaaa atcactaaac ttgccgatgg aatgtccttc tgcgaaggtc 19441 cagtttggga taaaaaacat catcggctca tttttagtga cactggtgca gatgaacata 19501 gatgctggag tgatgctcaa gggttacaga cttttcgcaa gcctagctat cagccaaacg 19561 gaaatgtttt tgatttgcaa ggcaggcttt tgacttgcga acacgaatcg agagcgatta 19621 gcatgaccaa catagatggt cagcgaacaa tagtggtgga taattatgca ggcaagcact 19681 tcaattcacc aaatgacttg gaagtaaaat ctgatcagac aatttggttt agcgatccaa 19741 cctacggtct tggtaacaga accaaggaga tggattttca gggcgttttt cgtttcgacc 19801 ccaaagcaaa taagctgaca ttaattgctg atgatttgag tatgccaaat ggcatcgctt 19861 tttctccgga tgaaaagaag ctttatgttg gagattcagc agaggataag cgacaaatac 19921 gcgcattcac ggtgaaccca gacggaacgg tttccggtgg cgaagtgcta tgcacgactg 19981 agaacccaat ttggggtcca gacggagttg acgtagatgc aaatggaaat atctacacag 20041 gttgtggtga tggcgtaaat attttttctc cagcaggttt gctgctaggc aagatattaa 20101 ctgaggctcc aatctcaaac tttgcctttg gcggaaatga tggcaaaatg ctgtttatga 20161 ctagtgaaca cgctttatat cgagttaatt tgctagtagc aggtgctgtc aaacgatggt 20221 gatttcggta gaaagtaagg aaatttacct tcattaagtg ctattactgt tttttgcttt 20281 tcttacagtt gatagccttt tttaattcgt agaattcacg ttgttgtacg gagtgtgctg 20341 ttatctaatc ctacaccata cttcaagata attccggaaa aatctataca tgagtactga 20401 attgctctcc tagtcgttac ggttcggttt ttgttgatag tgctagtttc atgcgatcgc 20461 acctcttgct tattttgcat agcctagcac tgcggttgtt gattacgttc acagcgtcac 20521 ggcttcgcta gctgaatgcc tgaatgtgct actgggtaag cgtttcatcg gtgtcaagag 20581 ggttacattt aaccctcttg accaatgtag cgaaaaacgc gaccgaccat ctctaaagtg 20641 gctctggata agcattagag caatgtggcg acaaacgccg acatcaaaac atttttatgt 20701 cgagatcata gtttcaatct cttcgtgggg tattagtagg cggaaagttt tttaatgctg 20761 gtaacaactt ccatctcatt accccaacga agaagtctga atggcttgtt tcaatttcct 20821 tacggcgaag taaataattt caactctctc tgtgtcctct gcgcccaccg cgaactttgc 20881 gggaggaaca tccacaccgc aaattcgctc tctgcggttt ttttatcact cagatgctac 20941 cggatttgat ataaaacgga atgcccattt tagtttgggg tgattgcact gaggggtgtc 21001 tcccctcggt acgaattctg tttactcgca tataaaaatt acatcagggt gcatgagtcc 21061 aattcattaa gtgaaagtca ctctttgttt tggtttctcg tatgaaaaag ctaaggttct 21121 tgagtaatta cttaaacgag taattactca atccacatca actggagtgc ggaaggtggg 21181 tttcacaagt tcgtagcgtt attgagcctc ttgaagcttt accagtaagt gattcagttc 21241 gtcacggatt tatgaaatcc tgtactaagc ccccttttta tataactctg acgtgagttc 21301 gacggttaga aaaaggcaaa aaatttgcct gcactacctc ctgccatttt ttagttccgt 21361 cgaactcacg ttaactctgt tagggggtct taacaaaact aaacaccact aactgaactg 21421 tattgatttt tatcaaagta aggcaggttg cataacttgt caaaagccac tgtatttatc 21481 ttcagaggtt tttggagcaa gtttatgcga atctactgga agccgattat ttgtgaggtg 21541 atattagaaa ttctgctaga actagcccca ggaagctttg acatgatggc gactattagc 21601 gagtatttgg tggaatcgac ttgggaagct agcagaactt caattcaact agttttggga 21661 atgaattccc aagtagcttt ccactaatct ctctaaaagg agtaagtgcc atctgctcct 21721 caaaacactt actttacaag acttgcagac aaattttgtt tttccccaaa atgagctact 21781 aactacaaaa ttttttacaa tcttgaatct acaggttcgc tctccatcgc caaaatccca 21841 aaaacgcact catgcacacg acgcaaagga acacggttaa caaatcgctc caaagcctcg 21901 acacccaaag caaactcccg catcgccaaa gaccgcttga cagatagtcc tcgttgacgt 21961 aaatgtcaat tgtattattt aaccattctg cttctaaaac gggtgtgtga cggcgcacga 22021 catgagaacc aaattggcgg tttggtcgtt gttggttacg ttggtgacac agttgttctg 22081 gtaaattgag ggcgattgcc tacggcagag ctacgcttaa cgccactgaa aaacaatgat 22141 actgccgtgc taatgccacc aatgctttac ggtcttctgg ctggacgtta gtagcatcta 22201 tgactgttag tctacctgct gccagtcgct tggcagtaat atagtgcagc acatcaaacg 22261 catcaccaga agcagattga ttattttcat catcagacac taagccgcga caaaaatctg 22321 aagataaaat ttcaaacggt gcaaagtgat tgtgggcaaa tgttgatttc ccggaaccgg 22381 aagcaccgat gaggacaact aaggaaagtt caggaaaagt tattttcatc tcacttataa 22441 aaacttttgt gattatgttg cttaaatttg atggctaaac acccccatct gcgtcggaga 22501 acctacttct ggatcttcct ctcctatcgc ttgaaatttc acagcgtagc cgaagcgttc 22561 tgcaacttgt tttgcccaag tttggaattc ttggcgtgtc cattcaaagc ggtgatctct 22621 atgtcgcagt tttccagcag gtaacttctc aaatctgacg ttgtattcaa tattgggtgt 22681 tgttacaatc acagttttgg gttgagcaaa ttcaaacaac acgcgctcaa aaacagcaaa 22741 gcgcggtaag tcgagatgtt caatcacctc aataactgta gcagcatcgt atcctgtaaa 22801 acgtttatct tggtaagtga gtgcgccttg aatgagttgt aggcgttcta attgggtacg 22861 aggtaagtgc aggcgatcta atctctcttg tgcgacttcc aatgaacggt aggaaacatc 22921 aacaccagtt acttgctcaa aaaagctatc tttaaccaga gttcgtaata aattcccttg 22981 agcacaacct aaatcaatca ctcgttttgc gccgctttct tttaaagtag taacaacagc 23041 atttaaccgc tgctgattta gactaatggg cttttctaca gctgcttctt cttctgcatg 23101 gttttcttgg gtactgtctg gatcaagttc gtcttcctct gcaagttgtg ctaaagcagt 23161 gcgagtcaga cgatgtagtc ttttgaggta acgacgagta attgcttccc gtgctggatg 23221 ttcagttaac caaccctctc catgacgcaa cagcttttcg atttcttcat cacctaccca 23281 atagtgttta tcgtcatcta agacgggaat caaaacgtac agatgactca acaactcagc 23341 aaggggaagg gtatgctgca gttctacagt gtagtactgg ctttgtcccc agtcaggaaa 23401 tgtctcatct agagcgtgat tttgagcagt gacagtgtaa cccaatggtt caaaaagctg 23461 tcgtagaaat ccttcaccac ctcgacaagg taaaacagac aatttcgcaa ccagagggac 23521 gggagtttgt accaattcag gtttatcttt acaacgccct gcaagtgcag tactaaaaac 23581 ttgggcgatc gccacactta aaaatgaaga agcaacataa gggcgatcat tcacatattg 23641 ttctaaattt gcactacgtc cgcgcactaa cttcactgga tcaatatcca aaagtagtgt 23701 cgcagtgcat cgttgctcat ttgcttctgg atagaaaatg tgcgcttgtc caaaggaaag 23761 aggaaaagac tgacaacgat ctgggtgttt atgcagtaag taccctaaat ctgtagctgg 23821 ataatatgtg gttgtaatcg acagtagcat tcacaataaa cagaggttta ctctcggtag 23881 aataatcatg actactgctc aatagctcgt gattacggaa aagatttatt taatgctgcc 23941 atagcacttt ttacttaata tttcttaaca gattcttttg aaaataagaa tattttgttt 24001 ggctgaatat ttgtgatacg ctagcttatt ttcctgtttg tctttctata acttgtatat 24061 agaattaaac actctagatc attagggagc atcttgtgaa tatcgacact gatcgcggta 24121 gcggcaggtt tttcctgtaa aatttctgtt aatcttgaac ctacacacca aactttttgt 24181 taagactgca agtgaggagt gaagtgtgta gttatggtga ggattgtgat actttctcta 24241 cacccgtaaa ggcgcaatcc attgtacctc taaccttaca ccttacattc tttaaagacg 24301 ttaattttcg tttcattgtg taaatcccgt tcatcgaaat cgaaaatttg ggtctgaaac 24361 cccgtccttc taggacggct tttactgtta taattagata agccacaaag cgccattatt 24421 ggtgggtaaa actcaagcgt ggcgggacgt tctagttaaa aggcttttaa ctgaaatagt 24481 taaatcggtg ctaagcactc tagttccagc ccccacaagc agaaacaatt gagcgcaagt 24541 gttgcaataa acatacaaaa gtagacccaa tggtaagttt acgtcttgcc ggagggagta 24601 ggggcaattg acaccgccca gtgaactgcc accgatttcg agtaaccgga ttggtgaagt 24661 ggttgttccg agcaatggct aacaattgtt agttacctcc tcttaaagaa tcctcatgcc 24721 ttttggccca ggagtgtcaa caatacccca cctacacagc aactaggtgg ggttgggaac 24781 tgtaacttaa aatctgtaaa tatctggaca taacttccgg aatgggtaca taatgaagtg 24841 tagtccttac aagttgcact tgaaatgg // LOCUS NODE_1231_length_24731_cov_5.72633324731 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24731) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24731) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24731 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 730..935 /locus_tag="DP116_11000" /pseudo CDS 730..935 /locus_tag="DP116_11000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006508267.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="response regulator" gene 1148..2479 /locus_tag="DP116_11005" CDS 1148..2479 /locus_tag="DP116_11005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748701.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_11005" /translation="MKEKLRILIVDDDEVDRMAVCRVLTKAGVEMELSEVGDGKDAIA LLSDTLYDCVFLDYRLPDQDGLSLLQNLRLNNIEVPVIVLTDQGDEQIAVELMKAGAT DYISKSRLSSEILVKVLRNAIRVHRAEMQVALVNQQLRQSNELLIRQNQELEVQRQHI ELQNLRLIEASHLKSQFLATISHELRTPMNAIIGFSQLLLRPKFSQLTHQQKDMLQRI LNNAKHLLMLLNEVLDFSKLEAGRLDLKPQIFDLSKVVSATVEEMRSLAEEKNLSLLI QMDLQNSLVFNDPTRIRQILTNLLSNAIKFTESGAIKVEVKELPEIRVEIAVHDTGIG IARADLQKIFEPFRQVDQSITRKYPGTGLGLPIIKALVQMMDGNISVESQLGNYSVFR IQLPRQISSLSQQGSDGTSNFTHASARKHFWKQQAIPRFAQEGAHSQSRDS" gene 2565..3689 /locus_tag="DP116_11010" CDS 2565..3689 /locus_tag="DP116_11010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314583.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_11010" /translation="MLAQDNSQVEHILAVDDNPDNLILLETILESEGYKVELVSNGRS ALQQIAQAPPDLILLDVMMPEMDGYEVTRRIRQNREISYIPILLITAYHDASVVEGLD AGADDFIRKPFDHEELLARVRSMLRLKHSIDEQRKMARQREDFVSRLTHDLRTPLVAA DRMLHLFHQEMFCAISPDMKEAIFAMIRSNQNLLEMVNNLLEVYRFEAGKKNLQLESW NMRQIVEEIVQELTPLVIEKGLAINIDSSNLDQQDETAAVVKGDRLELRRVIANLVGN AIKFTDTGSVNIRVSETPIKPEGKTWVIIEVQDTGFGIAPEDQATIFERFRQGKNKRA GSGLGLHLSQRIIESHKGKIDVFSELGKGSVFTVRLPKQA" gene 4071..4508 /locus_tag="DP116_11015" CDS 4071..4508 /locus_tag="DP116_11015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11015" /translation="MTTPTPTRGQVERTLAQGIQALYRDQLGHQPSKVTCQLSDQNVV VIIENSITKPEKLLITTGHEELAEEVRSDLDDAIRPQLKALIEETLNVTVIELLSDAT LQTGRTGIIAILTDSPTIRTPTPESNKVRTTTSSNRSNTPKSA" gene complement(4514..4966) /locus_tag="DP116_11020" CDS complement(4514..4966) /locus_tag="DP116_11020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136789.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_11020" /translation="MEQAQLFSHQKKQNSQPPLILAVEDNEDNLLLLSYALESLGCKL IRQNDGSTTSLVAKEYQPDLILLDILLPGISGIDIVRSLKQEPLTSHIVVIAVTALAS TEDRERILSAGFNDYISKPYMIEDLEALVGRYLCQELNPDLAYNLCED" gene 5862..7064 /locus_tag="DP116_11025" CDS 5862..7064 /locus_tag="DP116_11025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867866.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AAA family ATPase" /protein_id="PRJNA477356:DP116_11025" /translation="MDFEYYRNDEGTPANSNRQSLLASGWRPFHRELDWEFVWQLLSH DRQEFTQKSLNLASNFAEILGRNNYTWWANLLSVVSDNTRYEVEKFWNYITPDPLSPD YRYKDILSTETPIVQFVSRNSIPIDYVLNRLQEIAVLRVLDVLGCPDIITQYYLERDF YFPVEKFVNWERLDVVNTVYAYWSKNDIWLQINAFDRGRRQYTLLAKNLAPLINKATR DLAIMLSGYQTRVGKLYSQFPIRSFPGDIQNFTDLVQQAILNQKQLAVLVHGEPGTGK TAWTQAVAKEILVPLGYVIFILDHDAIANFVPPTYLERICIIINEADNLAQNRATEVA QYNNKTEHILSLLDGTLYQSVIDDSGIHIQQQLVVLMTCNTTERLDPAMLRKGRVDLM CEFTQRFV" gene 7435..8670 /locus_tag="DP116_11030" /pseudo CDS 7435..8670 /locus_tag="DP116_11030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859915.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ATP-binding protein" gene 8721..9569 /locus_tag="DP116_11035" CDS 8721..9569 /locus_tag="DP116_11035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_11035" /translation="MKINRFEDASQFYDQVKDYLLNHEALHNVQLALCNNLIQNPERF DEKPYLATVEVDGDVIAVAMKTPGRKLLLSKIEDFGSIEVIAQDIHLTQELLSGVNAP VTEAKAFVEAWHSLTGQSYHLKMALRAFQLEQVQPIPKTTGELRLATQSDRQFLIPWY EAFALEALGNVESEAERKVERLLERGIAYIWEDKIPVSMACHVRVMPNGAAVSLVYTP PEHRRKGYASACVAALSQTLLNQGHRYCFLFTDLANPTSNRVYQAIGYQPVGDLSEYS FTENTS" gene 9569..10534 /locus_tag="DP116_11040" CDS 9569..10534 /locus_tag="DP116_11040" /EC_number="4.3.1.16" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876767.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threo-3-hydroxy-L-aspartate ammonia-lyase" /protein_id="PRJNA477356:DP116_11040" /translation="MSQHNSVTITDVEAAAKRLAGIAHRTPVLTSRTVNERTNAQVFF KCENFQRTGSFKFRGAYNALSQLSEEQKQKGVLTFSSGNHAQATALAGQLLNIPTTIA MPDDAPAVKLSATRGYGGEVVLYNRKQTNREELAQTLLTERGGVMIPPYDHPHIVAGQ GTAAKELIQEVGELDLLLVCCGGGGLLSGSAIATKAVLPNCRVIGVEPELADDATRSF HTKVLQTVNNPDTIADGARTPYLGKITFPLVLHYVDDMVTVSEEAILRTMFFLWERLK IVVEPTGVLAAAALLEGVVKVPGARVGVIISGGNVDLAKVGQLFS" gene 10662..10886 /locus_tag="DP116_11045" CDS 10662..10886 /locus_tag="DP116_11045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949656.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11045" /translation="MTLQEIINSINSLSIEERDYLFEFLRKKKEESRGDNFWEGIQKF RKVIQSEGIIFTDEDFADLRDKSVGREIDL" gene complement(11040..11960) /locus_tag="DP116_11050" CDS complement(11040..11960) /locus_tag="DP116_11050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11050" /translation="MFHIQWFITRLLKPIVSLILVIILVVNYPSTVLAQTNPFQNIWE RIRSLPESLQTRTQGAPVAGRQKAGAGRGRCPALIPLDEDNEIPLTAFVPAIQEEQPT LSNSDNVSPSKLTYVWGRTIEQYPTFWFYIPYGSEESKTEYGKFVLLDKDKHIISGQI FVKIPIGNNPSLAKFTLPKSENPLEINQEYNWYFSIVCNPLKPSRNPGVTGWIERVNL PSFSLGNYRYYAEKGIWYDTVTRFVESADPQTLSQQLDWLLLIKFVFRNVENVEDVSM NDNDFNQIVNKIANFPIQTLTPVPNPELQR" gene complement(11978..13912) /locus_tag="DP116_11055" CDS complement(11978..13912) /locus_tag="DP116_11055" /inference="COORDINATES: protein motif:HMM:PF05226.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315038.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11055" /translation="MYSDTPIKKILILAANPKGTSPLRLDQELRDIAEGLQRSQKRDQ FSLEQKLAVRSRDIQRAMLDINPHIVHFSGHGAGEEGLVFEDDTGQSKLVTGTALAGL FELFADQVECVVLNGCYSLEQATAIAQHIKCVIGMSKAIGDEAAIEFAIGFYDALGAG RSIEFAYKLGCNAIQRQGIEEHLTPVLLKKVEIIGSAARLPINEVIKKWLRRIPWSGI RTTLLMSVGVTTLVVLTRFSAILEPFEFFFFDQMMRFQSAEKQDDRLLLVQITKEDIA NFGSGNINSLPDKVMSDLLTTLINNKPKAIGLDLYREVATDKKSKLYSLLKNTPNLFA VCKVANPEVDSEGNTHPPEVPEERIGFSDLLEDKDHVVRRQLLRMDVKEIENNPCSNK QKTMESFSFKLAQHYLGKEKKYKQTKNGLASGNVVLEPLGGSMTGGHQSTKWLYGYQV LLNYRSVCTSEKSRLPPCSPHKIAKAIRVQDVIEYGPKKELKQFPKDKLKQEDVKNKI ILIGTKREGVDILAGTPFSFGGGEPEISGVTLQAQMVSQLVSAVEDGRHLLRVWFIGH EILWILFWSLVGGFLTQFIRAPRKLILSITVTFLSLYVFCLIFFTSIKLWIPFVPPAL SLLSTSGVVVYIRLKSRKLS" gene complement(13977..16817) /locus_tag="DP116_11060" /pseudo CDS complement(13977..16817) /locus_tag="DP116_11060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315034.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="CHAT domain-containing protein" gene complement(16922..18841) /locus_tag="DP116_11065" CDS complement(16922..18841) /locus_tag="DP116_11065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ShlB/FhaC/HecB family hemolysin secretion/activation protein" /protein_id="PRJNA477356:DP116_11065" /translation="MENKYFWNMCLLKFLFTVATLTFSLPVQIARAVEVAQVPSADLL SQPNLAQTLPPPQDVLPPPSQPPQQPTVPSPPPDPGKLLPTTPAPNLPQQPAPEVVIK FFVNKFQFEGGSVFKNEKLLRAMEVFLYPSEPASEQLNDKAKCDALKQIDKKAPPRKL PRSESDPPVELTFAQLLVARSAITQLYICKGYITSGALIPAEQEFPPPPEAGALKIQI IEGTLEDILVVGNRRLNRNYIRSRLARANKKPLNRNQLLEALQLLQLNPRIGNLSAEL AAGTRFGTNLLVVRVDEARTFHGEIALDNNRSPGVGTFQRSVRLREENLLGIGDTVSL SYANTDGSHQFDVSYAIPLTPEETTLAFSYGRSWNNVIEKPFNILDIFSDSEEYQLSL RHPVIRNPRQEFTLEFALSHQRTQSSLGIDDTGPFPLSPGADDQGRTRISALRFIQQW VERGDRSVLALRSQFSVGLGILGATNQEVPPDTNFFAWRGQAQWVRRLAKDTLLVVRG DLQVADRRLVSLEQFRLGGQASVRGYRQDAVLADNGVFGSVELRLPIIRFAENQGLLQ VIPLVDVGTAWNNFEGSDLKPNPLVSVGLGLSLQIGESLDARLDWGIPLVSVEGDKKT LQEQGLYFSVLWRPF" gene complement(18831..24350) /locus_tag="DP116_11070" CDS complement(18831..24350) /locus_tag="DP116_11070" /inference="COORDINATES: protein motif:HMM:PF05860.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11070" /translation="MTIYIRCKQLWQIAIAFVATSGVIFSWNSAVAQIVPDKTLGTES SVVTPNVDIKGLPSDRIDGGAIRGANLFHSFQEFGIREGRGAYFSNPAGIENIFSRVT GNNSSNINGKLGVLGNANLFLLNPNGIIFGPNASLDIGGSFVGSTANSLNFGNGQQFS ATNPTAPPLLTVTVPLGVQFNQAQPSAIVNSGNLLAGTGQNLTLLGGTVVSTGQLSAP GGQVAVAAVPGGSVVKLSPSGQLLNIDTSSSVVSVGSSPLTELLRSVDEKSHPGLTVN SNGQVELTGSGLPVVDGDVVARNVTAKTATLTANHNLTLAESQLGTTGDLNLLAGDTV RVRDSVANPFVAQAGGKLLVQGRQGVDIFALNNSSSGLISGGDMVLRSLNTVGGDAHY WAGGNFRIEKLDGSLGGLFSPYDPIIRASGDVSFASYTGASLHILAGGSVTIPGTVTI TGADTTNGLSKETVTLSDGKQITIDGKGSRTLDIRAGTTDFGTVGLTPNPISGIVNPS FGSVPTSANIRIGSINIQRDQGDGQVFLTNQFQPNNLQGSIEITGGINIGKSFISVLA FGDPVTIDAKGGVTIGNIITGIGSGIGGDVKILGERNITTAGIVSLIDANGSGNGGDI SLISRSGTINTKGKIVSSGTNGNSGNIFLKARGNITIENEVTSRILGTRQGTETAGKI EITSESGNIINLSLEAIPPTTPVLDISGPPLIQQITSSTPNGTGGNIKLNAPQGSITT TGWTISAKSDKGVAGNVTLTAGRDIRSGNIEASSNSSSGDQKALSTITLNSSAGSVFI DGVKLSTTNTGTDYAGIININGKNIQITNNSKIESRGVDGLVFIGIDVENKVTADNVT IKNSTIDAIRNVESDLKPIKKDDGIQIASNGGIQIEGSNLSTTLKTTKSKLRSSGSIT LDAKGSVSLSNGAQLRVNTAGQGDAGKISIKAEQLSLRENSGLIGDVDPNGNGNGASV NLDVTGEILLEGDKDGGFSGTGESTRITVGILAEKEQQGEGKGGEIKINAGSLVLKNG AIIKNSTQGKGDAGPISVDATKEVNISGSVAGSGFPSGFFTSTSTDGKAGDITVKTGK FTIACGAALSARTTGNGQGGTITVNANSFEAIDGGQLVTTTSGSGQAGSIIVNATDQV TISGNDPNYSDRIAKFPKVIDKRVANDIKEGSASGLFANTTENSTGKGGDISISIPTG SLFVTNGAQLRAFTNGKGDAGNITINAGDTVTFDNAFAFATVEESGIGKGGDISISIP TGSLFISNGAQLQALTRGRGNAGNVIINAGDTVSFDRGYAITKVEGNVQTQDGQKRQG GDITITTGSLSVTNGAQLQALTQGRGNAGNITIKARDISFVGTSGNGLFPSAALVEVD KNAVGNAGNIDITTDSLSITNGARLFADTQGQGNAGIIQINATGSVKISGSNSINGRP SGLFTSTNPKSTGAGGDITIGNIIKPNLFHISDGAVLSASTSNNDKAGNITVNANRFE AINGGQLITTTSSTGAAGTITVNAGRRVTISGRDATFAERRSKLTLRNLPVENLPVNS ESEVFSGFFVRSGGSGQAGNIEINSPIVLLDNQGRLDANSNSQPGGNITLNKGNLLLL RRNSLITTSAGAGNGGNIEIKYPKGFIIAGFGRNNDIVANAFTGAGGKVTIKALGTVN IRALKKDELERQLRLSSNTSPEKLDPQELPTNDITAISQARGPDLQGTVNLNAPDVEP TRGTIQLPEDLGDSSKLIANSCPVGVQSAASRFVVTGRGGLPPNPGSVLSTDTFLGSA ANAPSGKNATASTSIPPQEAQGVEIGPRGEIILTARPSTLTPHTPWQRLTGCYGK" BASE COUNT 7136 a 5104 c 5053 g 7438 t ORIGIN 1 aatgataagt atacttcgcc aataggcaca ctaccacctg ttgccagggc aagtgcacca 61 aatggaagct ctcaccaacg gcttgttagc ataatcacga gttggcagca cctaaactca 121 attgtcttgt catatgttgt agttctgact aatggctgaa gacgggcgct tgagtgtcag 181 tttagaattg taatgagcaa tactaagtat tttattatac cacgtttttg ttctgattcc 241 cacacaaata atcatttcta tgcctgtgag tgatgtttgt tgttatggtt actgcaattg 301 acttgcttag aggatgtctc aaaaaaggtg cgtcttgtgt aggagatacg acactctgca 361 atgacaatgc gaacagcata acattttaga ctgacttcaa gagacatcct cattacaagc 421 aaataagttg caattcattt acagcagatt gcatctttat tgaggtacac cttttcaact 481 caagagccag ataatcaagc aaattttact tctgacttct gacttggtga cctatgctgt 541 aagttgcacg tttaccaatt aggctaaccc tttgtcatta atcgtttgtc tttacaaaat 601 gtaaagcaca atcttataga aaaagagtgc aattgcgatg attttcatca ttactatcat 661 ctgaaatttc ggtgatttat acaaatcaat tgaaatgtac gtaaagcaga ttgtatgaaa 721 tcaaaggata tgtcaaaaag aacgattaac atactacttg tggaggatga cgaggttgat 781 atgatgaatt tgaaacgggc attcaaaaaa gttaaggtca caaatccact ttttgttgct 841 atcaatggaa tcgattggga aatgatcaag cgccaaaagt tcctgctaag cggtccttaa 901 ttttgctaga tttgaatatg cttaaaatga aagggaacag ggaagagcga acaggaaaca 961 gggaacagaa actgtatcta gtttcgcaaa aataaaaaaa gaatttcata gttattaaat 1021 gttgactatg ttactaacca agagcaacta gtaaatctaa aataaattta tttatttacc 1081 tctactgtca aaaaataaga aatcataaaa ataaaaacaa tgtatttaat agcgatggct 1141 aaaagcaatg aaagagaaac taagaatttt aattgtagat gatgacgaag tagaccggat 1201 ggcagtgtgt cgtgttctaa ccaaagcggg cgttgaaatg gaactgtctg aggtaggtga 1261 cggcaaggat gcgatcgctc ttttgagtga tactttgtat gattgtgtct ttctcgacta 1321 tcgtttacca gaccaagacg gtttaagtct actccaaaac ctgcgtttaa acaatatcga 1381 agtgcctgtt atcgtcctga cagatcaagg agatgaacag atagcagttg aacttatgaa 1441 agcaggtgct accgattata tatccaagtc gaggttatca tcggaaattt tggtaaaagt 1501 tttgcggaat gctatccgag ttcaccgggc agaaatgcaa gtcgccttag ttaatcaaca 1561 actgcgccaa agcaatgaac tcctgattcg tcaaaaccag gaactggaag tacagcgaca 1621 acatatagag ttacaaaacc ttagactcat cgaagcatca cacctcaagt cgcagttttt 1681 agcaacaata tctcatgagt tgcggactcc aatgaatgct attatcgggt tttctcagtt 1741 gttgttgcgt cctaagttta gtcaactaac acatcagcaa aaagatatgc tacagcgtat 1801 cctcaataac gctaaacact tgctgatgct gctgaatgaa gttcttgact tttccaagct 1861 agaggcagga aggttagatt taaaaccaca aatatttgat ttatcaaagg tggtcagtgc 1921 tactgtggaa gagatgcgtt ctctagcaga ggagaaaaat ctgtctttgt taattcaaat 1981 ggacttgcaa aactctctgg tgttcaatga tccaacacgt atacggcaaa ttttaaccaa 2041 cctgctttca aacgctatca agtttacaga gtctggcgca ataaaagttg aagtgaagga 2101 actgccagaa ataagagtcg agattgcagt tcatgatact ggtataggta ttgctcgtgc 2161 cgatttacaa aagatttttg aaccttttcg acaagtcgat caaagtatca ctcgcaaata 2221 tcctggtaca ggtctgggtt tgccaattat caaagcgttg gtgcaaatga tggatggtaa 2281 cattagcgta gaaagtcagt tgggtaatta ttcagttttt cggattcagc taccacgtca 2341 gatatcatcc ttgagtcaac aaggttcaga tggaacatca aactttactc atgcttcagc 2401 cagaaaacat ttttggaaac agcaagcaat tcctcgtttt gcccaagaag gtgcacacag 2461 tcagagtaga gatagttgat gttctccccc aaactaaata aagtttgggg attgttgggc 2521 tgatccagcc tgtgtacaag tttcgcttgt taaaataatt agttatgcta gcacaagaca 2581 attctcaagt agaacatata cttgctgttg atgataatcc cgataatctc atcttacttg 2641 aaacgatttt agagagtgaa gggtataaag ttgagttagt ctcaaatggt aggagtgctc 2701 tccagcaaat tgcccaagca ccacccgact tgattttatt agatgtcatg atgccagaaa 2761 tggatggcta tgaggtgaca cgccgcattc gtcagaatcg agaaatatca tatattccta 2821 tactgctgat cacagcctac catgatgcga gtgttgtcga aggtttggat gcaggagcag 2881 atgattttat tcgcaaacca ttcgatcatg aagaattgct ggctagagta cgatcaatgc 2941 tgcgcctcaa gcacagtatt gatgaacaac gaaaaatggc acgccaacga gaggactttg 3001 tttcacgtct gactcacgat ttgcgaactc ccttagtagc tgcagatcgg atgcttcatt 3061 tgtttcatca ggaaatgttt tgtgcaattt cgccagacat gaaagaggcg atattcgcca 3121 tgattcgcag caaccagaac ttgttagaaa tggtgaataa cctactagaa gtctaccgct 3181 ttgaggcagg aaaaaagaat ctgcaacttg aaagctggaa tatgcggcaa atcgtagaag 3241 aaattgttca agaactcact cctcttgtga ttgagaaggg cttagctatc aatatagact 3301 ccagcaactt agatcaacaa gacgaaacgg ctgctgtggt taagggcgat cgcttagaac 3361 tacgacgagt catagccaac ctagttggaa atgccattaa atttacagat acaggaagtg 3421 tcaacattcg tgtatctgaa acacccatta aacctgaagg taaaacttgg gtgatcattg 3481 aagtccaaga tacaggattc ggcattgccc cagaagatca agctacaatc tttgagcggt 3541 ttcgtcaagg taaaaataag cgtgctggca gtggcttggg tttgcatctg tcgcagcgta 3601 tcattgaatc acataaggga aaaattgacg ttttttctga acttggtaag ggcagtgtat 3661 ttactgtacg tttgccaaag caagcttaag ttgcttggat tcctaagttg cttactcccc 3721 gtgatctgca atggcacttt acgggggata tatattagga tttatgccta gaggttaaga 3781 aaaggttaca gaaacatttc tagacattga tggtaaatac tgcatttgga tggtgatttt 3841 caacaatcct gaccaatata atgctctcaa ttgttcggga tttcagagtt tactacactt 3901 gatagtaaag ttagactact attgacaact caaaaagtca acagtgaaaa tcgctggtat 3961 gtcaacatca aatctgagtg acgcgtgaga agttaagtaa ctcatagctc ataacatcac 4021 tttttgactt gtcacttgtg gtctaaatat ctacaaaaaa caacaagcac atgactacac 4081 caacaccaac tcgcggtcaa gtagaaagaa cattagcaca aggtattcaa gccctgtatc 4141 gtgaccaact cggtcatcag cctagcaaag tcacctgtca actgtcagac caaaatgtag 4201 tagttatcat agaaaattct atcactaaac cagagaaatt gctgattaca acaggtcatg 4261 aagaattagc tgaggaagtt cggtcagatt tggatgatgc aattaggcca caactcaagg 4321 cattgattga agaaactttg aatgttacag tgatagaact cttaagtgat gccacattac 4381 agactggtcg tactggcatt attgctattt tgacagattc accgactatc cgcactccta 4441 cacctgagtc taacaaagtt aggacaacaa cttcatctaa tagaagtaac actccaaaat 4501 ccgcgtgata tggttaatcc tcacatagat tataagctaa atctggattc aactcctgac 4561 aaaggtaacg gccaacgaga gcttctaggt cttctatcat gtagggtttg ctgatgtagt 4621 cgttaaaacc tgcactcaaa atgcgttctc tatcctctgt actagccaat gctgtcaccg 4681 caataactac aatatgactg gttaggggtt cttgttttag agaacgcaca atatctatgc 4741 cactaatacc tggtaacaaa atatctagca gtatcaagtc tggctgatac tcttttgcaa 4801 caagtgatgt tgttgaacca tcgttttgac gaataagctt acaacccagg gactcaaggg 4861 catagctcaa cagcagtagg ttatcctcat tatcttccac cgctaatatt aagggtggtt 4921 gagaattttg ctttttttga tgactgaata attgtgcctg ctccatgtct tctccagaag 4981 attcatgtga aactcatcat ctgttttgag aaaataaacg aggcactccg taagcggtta 5041 agtagcgctc acattagact ggctcgtgtc tcaactctac taagttgatt atctttgctc 5101 acccaatgaa ctttaaggaa agaatttatt cagctaaaga agtgagaata cacttcctaa 5161 ttataggcaa aatggataag aatgatagcg aataaattat aagcacttaa tatatttaag 5221 aatgatttaa cgtgataccc ttgaatgtac atatctgcat ggcagtttgt tatacaaagt 5281 tcaccaaaac ttttggtgag tatagggtta taagcgtatg ggtgtaggtc gcaaagcgtt 5341 acacctctga gtctcttatt tttcgtgttg gtataaaaat aagaaaattt ttatagagcc 5401 ttgacatatt ttgtagtcag tatgtagtat ataagtatca ataaacaaaa accgaaaatt 5461 tttaaagtga ctagtaccgc tacgcggaat tcaaaattcg gaatatctcc tccaccagac 5521 gctcttgcaa cggaattcgg aattaccttt tggcaagggt ttggtgacat ttccaatggg 5581 tgatttattt atgccgtact gtactagttt agtaggtcgg gcgcggaaat aaagctacca 5641 tattctgcgg gaagcatgcg tgggacttac gcgctcttgg cgtgtgcgta gcttccacac 5701 agaggagcaa aaagcttgta atgtaaacat gggtgaattc cgccgttgcg cactagtcgc 5761 ttctcaaggt acagttcctt tcaagtgaga aacgataaaa cttttgccaa atctcccata 5821 tcaggagatc acaactccac cactgcggca caactgacaa tatggacttt gaatactata 5881 gaaacgacga aggcactcct gctaatagta accgtcaaag tttactagca agtggctggc 5941 gacccttcca tagagagttg gactgggagt ttgtatggca actgttgtct catgacaggc 6001 aagaatttac acaaaaaagt ttgaatttag caagtaattt cgctgagata ttaggacgga 6061 acaattatac ttggtgggct aatttattga gtgttgtctc tgataatacg cgctacgaag 6121 tagaaaaatt ttggaattac atcacacccg atcctctgtc accagactat cgttacaaag 6181 atattttaag taccgaaaca cctatcgtcc aatttgtcag ccgcaacagt attccaattg 6241 attatgttct gaatcgtctt caagaaattg ctgtattgcg cgttttagat gtgttgggat 6301 gtcctgacat tatcacacaa tattatttgg aacgagattt ttattttcca gtggaaaaat 6361 ttgtgaactg ggagcgtttg gacgttgtta acaccgttta tgcttattgg tctaaaaatg 6421 atatttggct acaaataaac gcttttgaca ggggacgacg acagtatact ttattggcaa 6481 aaaatctcgc tccactcatt aacaaagcga cacgtgattt agcaattatg ctgagtggat 6541 atcaaactcg tgtgggtaaa ctttatagtc agtttcccat tcggagtttt cctggggata 6601 ttcaaaactt tactgattta gtacagcagg caattctgaa tcaaaagcag ttagcagttt 6661 tggtacatgg agaaccaggt acgggtaaaa ccgcttggac acaagcagtt gctaaagaaa 6721 ttttggtgcc tttggggtat gtgattttta ttttggatca tgatgcgatc gccaactttg 6781 ttcccccaac atatctagaa agaatttgca tcattattaa cgaggcagat aacttagcac 6841 aaaatcgtgc taccgaagta gcgcagtaca ataacaaaac cgaacatata ttgagtttac 6901 tagacggcac gctgtatcaa agtgttattg atgattctgg tattcatata cagcagcagt 6961 tagttgtgtt gatgacttgt aacaccacag aaagattaga tccagcaatg ctacgtaagg 7021 ggagagtgga tttgatgtgt gagtttacgc aacgatttgt ttaaagacgg tttttgatga 7081 ttggattaat tagcaagcaa attcttgggt agtaaatatc tggaatttgt cccggaggtt 7141 ttgtataaac tatgaagcga atgtctttct agacaataac ccaatacagt tgctgttagt 7201 ggtatttagt tttgtaagat cccccaacag agttaaaaag ggggcttagt tccctccttt 7261 ttaaggaggg ctagggagga tcaatcctta acagcaaccg tattggacaa taaccaacta 7321 aaaaacagaa tatagcagtt tgcagttgga tggatacgcc attgtatgaa gtcaggtagg 7381 caatgcctac tctaccaaaa tatgtatttc agtgattttc aaagcaataa aaccgtgaaa 7441 attcaacaac tgattttaga agcacttaac cttcctacta acgccattac ctaccatgta 7501 agtcaggaat tagcagcgtt ttatccgaaa aaagcgttgc tcgaaggaag tgattctgcg 7561 tttgatgtcg aaaggtacgc ccaggcaaac ctttgtacta ttaagtatga cacctctatt 7621 tacaaccaga ttatttctgg ttgggatggc atggaaaata aaatttacaa ttcgactgaa 7681 aatgccagct ttgaagtgac atgggaagaa cataaactgg atatcctact cataagtttt 7741 caggagggtt attgtaaaac cagatatcac tggattttgg ctgaaagtaa agatattgct 7801 gaaaagtttt ttgccgcagt ttgtgagtgg aactccgaaa tcagaagtga aattttagtg 7861 tttgaagaag gctattggtc gaaaaatgaa gagttgtttg agtcgattca gaacgcgact 7921 tttgataatc tgattttgcc tagtactctc aaacaataaa ttcaagatga cttgacaaac 7981 ttttttgctt tgcgggaaac ttacgaagct tacggtttac cttggaagcg aggaatttta 8041 tttattggct ctcctggaaa tggcaagact cacgcagtca aagctttgat taataaaatg 8101 cagcagcctt gcttatacgt caaaagcttc aaatcggagt atagtaatcc tcatcacaat 8161 attcgtcaag tctttcggga agcacgacag tctgcaccct gtattttagt gttggaagat 8221 cttgattctt tagtagaaca agaaaatcgc tcgttctttt tgaatgaact tgatggtttt 8281 gccgcaaatc ttggaatagt catattagcg acgactaacc atccagatcg tatagatgca 8341 gcgattttag aacgtcccag tcgctttgat cgtaaatatt actttgagtt gccgactttg 8401 gcagaacgta tagcctatat caacttgtgg aacgacaaat ttaaaccgac aatgcgcttg 8461 tctgaagcca caatatctca gattgccgag atgacaaatg gcttttcctt cgcttacctc 8521 aaagaactat ttgtatcatc tatgatgctt tggatgcaag ggatggaacc aggtggaatg 8581 gataaaagta taatttcact agtagctgtt ttgcgacaac aaatgagcag tgcaactggt 8641 gaaaacacag ccgcagtaag tacagcataa ttctgtcacc accatacagc ataattctgt 8701 caccaccata actgatgttg atgaaaatta atcgttttga agatgcaagc caattttacg 8761 atcaagtgaa agactacttg ctcaatcacg aagcactgca taacgtgcag cttgcacttt 8821 gcaacaattt gattcagaat cccgaacgct tcgacgaaaa gccctactta gcaaccgtgg 8881 aagtagacgg agatgttatt gctgtggcga tgaaaacacc tgggcgaaaa ttgctgttgt 8941 caaaaataga ggattttggg agtatagagg tcatcgccca agatatacac ctgactcaag 9001 aattattgtc aggagtcaat gctcccgtta ctgaggcaaa agctttcgtg gaagcttggc 9061 attccctaac tggtcaatcg tatcacctaa aaatggctct gcgtgctttt caattggagc 9121 aggtgcagcc gattcccaaa acaacaggtg agttacgttt ggcgacacag agtgatcgcc 9181 aatttctcat cccttggtat gaagccttcg cgctggaagc cttgggcaat gttgagtcag 9241 aagctgaacg caaagtagag cggcttttgg agcggggtat tgcttacatt tgggaagata 9301 aaatccctgt ttcaatggcg tgccatgttc gtgtaatgcc caacggtgca gcggtgagtt 9361 tggtttacac accaccagaa caccgcagaa aaggttacgc tagtgcttgt gtggcggctt 9421 tgagccaaac tttactgaat cagggacatc gatactgctt tttgttcact gatttggcaa 9481 atccgacttc taatcgcgtc taccaggcga tcggttacca gcctgtaggt gatttgtctg 9541 aatattcttt caccgaaaac actagctgat gtcacagcat aattctgtca ccattactga 9601 tgtagaagcc gccgccaagc gtttagctgg tattgctcac cgcaccccgg ttctcacttc 9661 cagaactgtt aatgagcgca ccaacgctca ggtgtttttc aagtgcgaga acttccagcg 9721 caccggatct ttcaaattta gaggtgcata caacgcacta tcacagttgt ctgaagaaca 9781 aaagcaaaaa ggcgtcttga ctttttcttc tgggaatcat gctcaagcaa cagcacttgc 9841 tggacaactg ctaaatattc ctactactat tgctatgcct gatgatgctc cagccgtcaa 9901 gttatctgcg actcgtgggt atggcggtga ggtggttttg tataaccgca agcaaaccaa 9961 cagggaagaa ttagcccaaa cactattaac cgagcgagga ggtgtgatga ttcctcctta 10021 cgaccatcct cacattgtag cgggacaagg tacagctgcc aaagaactta ttcaagaagt 10081 tggtgaactg gacttgctgc tggtttgttg cggtggtggt ggattacttt ctggttctgc 10141 cattgcaacc aaagccgtat tacccaactg tcgggtgata ggagtagaac cagaacttgc 10201 tgacgatgca acccgctcgt ttcataccaa agtcctgcaa actgttaata atccagatac 10261 tattgctgat ggtgctcgta ccccttattt aggtaagata actttcccac tggtgctgca 10321 ttatgtagat gatatggtca cggtatcaga agaagcgatt ctacgcacca tgttcttttt 10381 gtgggaacgc ctcaaaattg ttgttgaacc gactggggtg ttagctgcag cggctttgtt 10441 ggaaggagtg gtgaaggtac cgggggctag ggttggtgtg attattagcg gtggtaatgt 10501 ggatttggcg aaagttgggc aattgttttc ttgacagata aaatgcttgt agataaatgt 10561 gatatgaaaa atgtcagaac cgcagattga aatcaagttt aatgtagggc ttacagagac 10621 cgaatctctg gttatacata ccttttagag gagcttctat gatgacttta caagaaatta 10681 ttaattctat taacagtttg tccatagaag aacgggacta tttatttgag tttctgcgga 10741 aaaaaaagga ggaatctaga ggagataatt tttgggaggg aatacaaaaa tttagaaagg 10801 taatccaaag cgaggggatt atctttactg atgaggattt cgctgattta cgagataaga 10861 gtgtgggaag agaaattgat ctatgatctg gaaattcttg cttgatacca atattctaac 10921 tgattttctt tccgttatta cggatatgag aaaacccaca gcctggacat tccatcagga 10981 ataacctcaa ttcataccta tgttatgcaa cgcctattgg tttaattaca cttacctact 11041 taccgctgta attccggatt tggaacagga gttaatgtct gaataggaaa atttgcaatt 11101 ttatttacta tctgattgaa atcattatcg ttcattgata catcttccac attttccaca 11161 tttctaaata caaatttgat aagaaggagc cagtctagtt gttgggataa ggtttgtgga 11221 tctgcacttt ctacgaaacg tgttactgta tcataccaaa ttcctttttc agcataatat 11281 cggtagtttc ctagagagaa acttggcaag ttaactcttt ctatccagcc tgtaacacca 11341 ggatttcgcg atggtttcag aggattacaa acaattgaga aataccaatt gtattcctga 11401 ttaatttcta gaggattttc actttttggc aaagtgaatt tcgcgagact aggattattt 11461 ccaatgggta ttttgacaaa aatttgtcca ctaataatat gcttatcttt atctagtaaa 11521 acaaattttc catactcagt ttttgattct tcagatccgt aaggaatgta aaaccagaag 11581 gttggatatt gttcgattgt ccgaccccat acataagtta atttagaagg cgaaacatta 11641 tctgagttag ataacgtggg ttgttcttcc tgtatagcag gtacaaaagc agtcagtgga 11701 atttcgttat cctcatctaa aggtataagt gcaggacaac gacctctacc cgctcctgct 11761 ttctgacgac cagcaacagg tgctccttgt gttcttgttt gtagcgactc aggaagagac 11821 cgtatccgct cccaaatatt ttgaaaagga ttggtttggg ctaggactgt tgagggatag 11881 ttgacaacga ggatgatgac aagaatgaga ctaacaatag gtttgagcaa acgagtaata 11941 aaccactgaa tatgaaacat aaagaacagt ttagagatta agataatttt cgagatttca 12001 atctaatata aactacgact ccactagtac ttagcaagct taaagcagga ggtacaaatg 12061 gaatccaaag cttgattgag gtgaagaata ttaagcaaaa aacatataaa ctgaggaagg 12121 taacagttat agatagtatt agttttcttg gtgcgcgaat aaactgagtt aagaagcctc 12181 cgaccagcga ccaaaacaaa atccaaagta tctcatgccc aataaaccaa actcgtagta 12241 ggtgtcgccc atcttcaact gcactcacaa gctgactgac catttgtgct tgcagagtca 12301 ctcctgagat ttctggttcc cctcctccaa aagaaaaggg agtgcctgcg agaatatcaa 12361 caccttcgcg ttttgttcca atcaatataa tcttgttttt tacatcttcc tgtttcagtt 12421 tgtcttttgg aaactgtttt aattctttct taggtccata ttctatgacg tcttgaaccc 12481 ttattgcttt tgcaatcttg tgaggagaac aaggaggaag acgagatttt tcagaagtac 12541 acacagaacg gtaattcagc agcacctgat agccatacaa ccattttgtg gattgatgac 12601 caccagtcat tgaaccacca agcggttcta agacaacatt accagatgct aaaccatttt 12661 tagtttgctt atattttttt tctttgccta agtaatgttg agctaattta aaactaaagg 12721 actccattgt tttctgttta ttgctacatg gattgttttc aatttcttta acgtccatcc 12781 gcaaaagttg acgacgaaca acatgatctt tatcctctaa taaatcacta aaccctattc 12841 gctcttcagg aacttcaggg gggtgagtgt taccttcact atctacttca gggtttgcta 12901 ctttacaaac agcaaaaagg ttaggtgtat tttttaagag gctgtataac ttcgattttt 12961 tatcagtagc aacctcacga taaaggtcta aacctatagc ttttggctta ttatttatta 13021 gtgttgttag taaatctgac atgactttgt ctggtagaga attgatgtta ccactaccaa 13081 agttggcaat atcctctttt gtaatttgaa caagcaatag tcgatcatct tgcttttctg 13141 ctgattgaaa tctcatcatc tgatcaaaga aaaaaaactc aaatggttcc aatattgcag 13201 aaaagcgtgt tagtactact agagtagtta cccccacact cattaacagc gtagtacgaa 13261 tgccactcca aggaatgcgc cgcagccatt tttttatgac ttcattgata ggaagtcgtg 13321 ctgcagaacc aattatttca acctttttaa gcaaaactgg tgtcaagtgt tcttcaattc 13381 cctgcctttg tatggcatta cagccaagtt tgtaggcaaa ctcaatcgaa cgtcctgcgc 13441 ccaaagcatc gtaaaatcct atagcaaatt caattgcagc ttcatcacct attgccttac 13501 tcatgccaat cacacactta atatgttggg cgatcgccgt tgcttgctcc aatgagtaac 13561 agccattaag tactacacac tcaacttggt cggcaaatag ttcaaacaat ccagccagtg 13621 cagttccagt gacgagctta gattgtccag tgtcatcctc aaagactaat ccctcttccc 13681 ctgcgccatg tcccgaaaaa tgaacaatgt gtgggttaat atctaacatc gccctttgga 13741 tatccctgga acggactgct aacttttgct ccaaactaaa ctggtcccgc ttctgggaac 13801 gttgcaatcc ttctgcaata tctcgcaact cctgatctag ccgcagagga gaagtacctt 13861 ttggatttgc tgctaaaatc aaaatttttt ttattggagt atcactatac atataattac 13921 taaagggaat attttgggtt agtagtgagt ctatctggag ttaactgaaa ctcaacttac 13981 agccagtttc caagtaaaat atacggtgcc caatatcttg gatgttcacg accaggtaaa 14041 tcctttagtt gtttttgagc ttcctgcaaa gcttgagctt ttgttacctt ctgtttaatt 14101 aattgcttgt aaaaaatctt agtaaaatca acactgattt catcatccaa tgtccacagc 14161 gaagctatag aactacgcgc accggctcga acacttacac cagaaatacc taaagttgct 14221 cgcttatcac cagcagcggt ttcacacgca ctcaatacaa gaatttctat aggttcaggt 14281 tgtctttgag cttgtgttcg gaatatgtcg ccgatttgat taatgttaat aggctcgtct 14341 ctcgctaaaa gaaatgtttt ctcaggacta gagctaaact caccgtgtgt agctaagtgg 14401 aggactttaa aagcagaagt atcgatttct ttttttaaat tctttatttt aaacttatta 14461 tcaatcagtt cataaactga aaaattagaa ttagctattt ctttgattgc ctttatttct 14521 ttttcaacat aattcaactt aggaaatttt agtttgttgt tgtcgtctgg ttcctttagc 14581 cctgctgcca aaattttgag tcctttacct tggagagtat taggcttggg aatatttaat 14641 cttggagcta atgccagtgc ataattatca attagatact tgggtatttt ttccttatca 14701 ttaaccagca gtgctgctaa tggaatattc cgcaaattag tatctagagc aaacactaat 14761 gtaacaattt ctttttcttt gaggtattgt tctgctcctt taagaagcca gtcatacacc 14821 agtttgcctt cttttttaac atcctcaaat gtatactctt cttcaaaatc tagttgcagc 14881 tttttaagaa caccattaaa tgtatattct ttcacgtcta tctgcagttc tctgatttgt 14941 tctttaactg tagcctcgtc tacctgagtg ctgtaatgtt ttaaattcgg atcgtcaggt 15001 aacttgatga tgacttcaat tctatctttg acttcgattt tctcttctac ttcgattcta 15061 tcttctacaa gaatcggata gaaaaaagct gtctttacgg gttgctcatc aataatttta 15121 tcaatttctt ctatattgga ctctggacaa gctagacgta agaaattttc caactctacc 15181 gcttgtaaag aagaaataac ttcccgtgct ttcgagagat tttcttgatg tggattacta 15241 tcccataaaa gcaaattgac gtaatctcga taaatatcct cgatatcatt ttgaaaagat 15301 aattcggcat caggattacc tgctactaac tctctacgaa gagattgtaa agtgagatac 15361 gcctgttgat acatttgacg tgcatcatcc aagttttttg gactctcttg ctgtatctct 15421 ttggtttgag cctgtgctct tggtggaagt tgagatttgt aaatccgccc taactgccac 15481 tcccaaagat attcaatctc tgacggtgat gccgcagtag ctaaagattg agcaatgaac 15541 aaagctttct gagtattatc tttagctaat tcccattcat taggtttttc atggagttca 15601 ccaagatacc cgtaagcgta ggattctact ttcaagtcgc cgatatctct agcttgttta 15661 attgtttctt ttagaaattc ttctatgata aattcttcta tttttgaaga tcgattaaga 15721 cgaatcaagc ttttagcgaa attaagccta ataaataatg ctgcatggct ctttggtagt 15781 tgctcaattt gatgttgcaa gttatcaatt tgctgtaata atattccccg atcttggatt 15841 gctaactggt taactatctc attaagaatc atatcaattg attgaaaatc aaaaatctta 15901 aataaacggg gtgtattctt ttttttagtc ttacgattcc actcatcaca tattctgttg 15961 gcagaaccat ctttcatatt tttcgtgatt gtctcacgcc agtcacttaa attttctaca 16021 agagtaagct gattaagctg tgcctgtact tgcatcaaag ctggagaacc tgaagttttt 16081 ttgctgtagt ctagaacctg ttgatagaca ctattggcca cgcccgcata tgttaacgca 16141 caattaaaat tttcgtgttg ttcggtgcgg ttataataat cttgctctct attacttcaa 16201 gtacggaata tgttgcctaa atccaacaaa attgctcctg cttgttcagg gaaagagttg 16261 ttattacttg ctaataaaac taatatttgg tatgaccaat ctagctcacc tagtgagcga 16321 aagacgttag caagaagacg taatccagcg attttagtag gagaatttaa gagttttctt 16381 aatccttctg ggttttctat tttttcgcta aagtgtttac gtttatcttg gtcttgtatt 16441 attaaaaagc agtctttttc tcgataaatc ggcagcaaga tattacaagc tcttggatat 16501 aatcccattg cgaatgcagc ttgagcttga ttgatttggt tattaattgc ctccgcctca 16561 tttttatgtt gtcggtaggc tgcaacagat ttttgccaac aatttatcgc ctcgtcaaaa 16621 cgaccaatct cgtaattcgc tttacctcgt acacttaaat cttctggact gagagttgct 16681 gaacaactta attccccagt ttgaggcttt acaccgatta ccattttcgg agagaaaagt 16741 tgaatggtca gcgatgaaat tagcccaaaa agggctagta agaatatggt aattaccctg 16801 tgaagctgtt tttgcataga gtataaatgt tttgtatcaa gttcaaatga ttagtcacat 16861 tcctatagtc aaaggctcat atgttgcaac aattagtata agtattcaag ccaacgcgac 16921 attagaaggg tctccataga acagagaaat acaagccttg ttcttgtaaa gtttttttgt 16981 cgccctcaac agacactaaa ggaatacccc aatccaagcg ggcatcaaga ctttcgccga 17041 tctgcaagct caagcctaaa cccacagata ctagaggatt cggctttaga tctgaaccct 17101 caaagttatt ccaagctgta cctacatcaa ctaaaggtat tacttgtaac aagccctggt 17161 tttcagcaaa gcgtataatt ggtaggcgga gttccaccga accaaagact ccgttgtctg 17221 caagcacggc gtcttgacga tatcctctaa cacttgcttg accaccaaga cgaaattgct 17281 ctagagatac aagtcggcgg tctgctactt gcaaatctcc ccgaacgact aacaaggtat 17341 ctttcgcaag gcggcgcacc cattgagcct gtcctctcca agcaaaaaag ttggtgtcgg 17401 gaggaacttc ctgattggta gcacccaaga taccaagtcc tacactgaac tgcgatcgca 17461 atgccaaaac agagcgatcg ccccgctcaa cccattgctg aatgaaacgc agtgcagaaa 17521 ttcgagtccg tccttggtca tcggctcctg gtgagagtgg aaaaggtccg gtatcatcaa 17581 ttccaagaga actttgagtt cgttgatgag atagggcgaa ttccaaggta aattcttggc 17641 gaggattccg aatcactgga tggcgtaaac tgagttggta ttcctcggaa tctgagaaaa 17701 tatctagtat gttaaagggt ttttcaatga cattgttcca actccgacca tagctaaaag 17761 ccagagtcgt ctcctcaggt gtgaggggta tagcgtaact gacatcgaat tgatgactac 17821 cgtcggtgtt agcgtaagac aaactcacag tgtctccaat tcccaacaaa ttctcctctc 17881 gcaaccgaac gctacgttgg aaagtaccca cgccaggaga gcggttatta tcaagagcga 17941 tttcaccgtg aaaggttctc gcctcatcca cccgaactac cagcaagttg gtgccaaagc 18001 gtgtaccagc agctaactcc gccgacaaat ttccgatgcg gggattgagt tgcaacagtt 18061 gtagtgcttc tagcaactgg ttacgattca gtggtttttt gtttgctctg gctaagcgac 18121 tgcggatata gttgcgattt aaccgccgat tcccgacaac cagtatgtct tctaaagtac 18181 cctcaataat ctgtattttc aatgcaccag cttccggagg tggcggaaac tcttgttctg 18241 ctggaatcag cgctccagag gtaatgtagc ctttacagat ataaagctgg gtaatggctg 18301 aacgtgctac cagcagctgg gcaaatgtga gttccactgg tggatcactc tcgcttcttg 18361 gtagtttccg aggaggagct tttttatcta tttgttttaa tgcatcacat ttggctttgt 18421 cattcaattg ttcggaagca ggttcggaag gataaagaaa cacttccatc gctcgtagta 18481 atttttcatt cttgaagacc gaaccaccct caaattggaa cttgttgaca aaaaatttga 18541 taacaacctc cggagctggt tgctgaggga ggttgggagc aggagtcgtg ggtagcaact 18601 ttcctggatc tgggggaggt gaaggaactg tgggttgctg cggaggttga gatggtggag 18661 gtaaaacatc ttggggaggg ggcagagttt gcgctaagtt aggctgtgat agcaagtcag 18721 cactggggac ttgagccact tccacagcac gtgctatttg cacgggtagg ctaaacgtca 18781 gtgtcgctac ggtgaataag aactttagca gacacatatt ccagaaatat ttattttcca 18841 tagcaacctg ttaacctctg ccaaggagta tgaggagtga gtgtcgatgg acgagcagtc 18901 aaaataattt cccctctcgg accaatttct accccttgag cttcttgtgg cggtattgag 18961 gtggaagctg tagcattttt accggaggga gcattggctg cacttcctaa aaaggtgtca 19021 gtactaagga cagaaccagg gttgggcggt aatcctccac gtcctgtgac aacaaaacga 19081 ctcgctgcac tttgtacacc caccgggcaa ctgttggcaa ttaattttga agaatcacct 19141 aagtcttctg gtaactgaat tgtcccgcgt gtaggctcta catccggtgc gttgagattt 19201 actgtaccct gtaagtcggg accacgcgcc tgggaaattg ctgtgatgtc gttcgttggt 19261 agctcttgag ggtctagttt ttctggtgag gtgttggacg acaatcgcaa ttggcgctcc 19321 aattcgtctt tctttagtgc tcgtatgttg acagtaccaa gagccttgat tgtgaccttg 19381 ccgccagcac ctgtgaaagc attagcaacg atgtcgttat ttctcccaaa acctgcaata 19441 atgaagccct tagggtattt gatttcgata ttgccgccat tgcctgctcc cgcactggtg 19501 gtgatgagac tattacggcg tagaagtagt aagtttcctt tattcagtgt tatgttgccg 19561 ccaggttgtg aattagagtt agcgtccaac ctgccttggt tgtccaaaag cactatgggt 19621 gagttaattt cgatgttgcc tgcttgccct gaaccaccag aacgaacaaa gaagccacta 19681 aaaacttcag attcagaatt tacaggtaaa ttttcaacag ggagattgcg taaagttaat 19741 ttgctacgtc gctcagcaaa agttgcatca cgtccactga tagttacccg tctgccagca 19801 ttgactgtaa tcgtacccgc agcgccagtg gaggaggtgg tagtgatgag ttgtcctccg 19861 ttgatagcct caaatctatt agcgttgact gttatgttgc ctgctttgtc attattagag 19921 gtacttgcag ataacacagc accatctgat atgtggaaaa gattgggttt aatgatattg 19981 cctatagtga tatcaccacc agcgcctgtg gacttaggat ttgtggaggt aaataatccg 20041 ctaggtcgtc cgttgattga gttgctccca gaaatcttga cagaacctgt tgcattgatt 20101 tgaattattc ctgcattccc ctgtccctgt gtgtcggcaa agagtcgagc gccattggta 20161 atagaaagtg aatcagtagt gatgtcgatg ttgccagcgt taccaactgc gtttttgtct 20221 acctcaacta aggcagcact gggaaataga ccattaccac ttgtaccaac gaaggagata 20281 tcccgggctt tgatagttat attgcctgca tttccccttc cttgggtaag agcctgcagt 20341 tgagcgccat tggtaacaga gagtgaacct gtggtgattg taatgtcccc accctgtctt 20401 ttttgtccgt cctgtgtttg tacgttgcct tctactttgg tgatggcata acccctatca 20461 aacgagacag tatcgccggc attgataatc acattgcctg cattcccccg tcctctagta 20521 agggcttgca gttgagcgcc attgctaata aagagtgaac cagtagggat ggagatgcta 20581 atgtctcccc ctttacctat gccgctttct tcgacagtag cgaaggcaaa agcattatca 20641 aaggtgacag tatcgccagc attgatagtt atattgcctg catctccctt tccattggta 20701 aaagctcgga gttgagcgcc attggtaaca aagagggagc cagtagggat ggagatgcta 20761 atgtctcccc ctttacctgt agagttctcc gttgtgtttg caaatagacc actggcagaa 20821 ccttctttga tgtcattggc aacacgttta tcaataacct taggaaattt agcaatgcga 20881 tcagagtaat tcgggtcgtt accagaaata gtcacctggt ctgttgcatt gacaatgata 20941 ctccctgctt gtccactacc agatgttgta gtcaccagct gtccgccatc gattgcctca 21001 aaagagttgg cattcactgt aattgtgcca ccttgcccgt tccctgtggt gcgggcgctc 21061 aaagctgccc cacatgcaat tgtgaatttg ccagttttaa cggtaatatc gccagctttt 21121 ccatcggtac tggtactggt aaagaatccg ctgggaaagc cgcttccggc aacactgcct 21181 gagatattaa cttctttagt agcatcaaca ctgataggtc ctgcgtctcc ctttccttgg 21241 gtactgtttt tgataatcgc accgtttttc agtactaaag aaccagcatt aattttaatc 21301 tctccacctt taccttcccc ctgctgttcc ttttctgcca ggataccgac ggttattctg 21361 gtcgattcgc cagtcccaga aaatcctcca tctttatctc cctctagcaa tatctcacct 21421 gtgacatcta aattgacact tgctccgttt ccgtttccat ttggatctac atctcctatc 21481 agcccagaat tttctctcaa agaaagctgt tcagctttta tgctgatttt gcctgcatcc 21541 ccttgtccag cagtattaac gcgcagttga gcgccattac tcaaggaaac agagccttta 21601 gcatctaaag tgatagaacc agaagatcgc agtttcgact tggttgtctt cagcgtcgta 21661 ctcaggtttg agccttcaat ttgaattcct ccgttagaag ctatctgaat accatcatcc 21721 ttctttatag gcttcaaatc actttctaca ttcctaatag catctatggt gctatttttg 21781 atagttacgt tatcagcagt taccttattt tcaacatcaa ttcctataaa aacaagtcca 21841 tccacacctc ggctctctat cttactgttg ttagtaatct gaatattctt gccattaatg 21901 ttaataattc ctgcataatc agttccagtg ttagtagtac ttaatttaac tccatctata 21961 aacactgagc ctgctgatga gttaagtgta attgtgctca aggctttctg atcaccacta 22021 ctactgttac tagaagcttc gatattgcca gagcgaatat cgcgaccagc agtgagagta 22081 acatttccag ctacaccttt gtcagactta gcgcttatag tccacccagt tgttgtaatc 22141 gaaccttgag gagcattaag ttttatattt ccaccagttc cattaggagt agaagatgtt 22201 atttgctgaa taagtggagg accgctaatg tccagaacag gagtagtagg aggtattgcc 22261 tctaagctta gatttattat attgccactt tcactggtaa tctcaatctt tcctgctgtt 22321 tctgttcctt gtctagttcc aagtatacga gaggtaactt cattttcaat tgtgatgttg 22381 cctctagctt tgagaaagat gttcccactg ttaccattag ttccagagga aactattttt 22441 ccttttgtgt taatagttcc acttctgctt atgaggctaa tgtctccgcc atttccacta 22501 ccattagcat ctattaagga aactattcca gcagtagtta tatttcgttc gccaagtatc 22561 ttgacatctc cacctatgcc tgaacctatg ccagtaataa tatttccaat cgtgacgccg 22621 cctttagcat ctatcgtgac aggatcgccg aaggcaagaa ctgagatgaa gctcttacca 22681 atatttattc cacctgtaat ttcaattgaa ccttgtagat tatttggctg aaattgattg 22741 gttaaaaata cttgaccatc gccttgatcg cgttgtatgt taatacttcc tattcgtatg 22801 ttcgcgcttg ttggaacgct accaaagctt gggttaacaa ttcctgagat gggatttggt 22861 gttagtccta cagtcccgaa gtcagttgta cctgctcgaa tatctagagt ccgagagcct 22921 ttaccatcaa ttgttatttg tttcccatct gatagtgtta ctgtctcctt agaaagtcca 22981 ttcgttgtat ctgcacccgt aatcgtaacc gtaccaggta tagttacact tcctccagcg 23041 agaatgtgca gcgaagctcc tgtgtaactg gcaaaactca catctccact agcccggata 23101 atcggatcgt aaggactgaa caaacctcct aaactcccat ccagtttttc aatccgaaaa 23161 tttcctccag cccagtagtg agcatcgcct cccacagtgt tgagcgatcg caacaccata 23221 tcccctcccg aaataagccc actggaagaa ttatttaagg caaaaatatc aactccttgc 23281 ctaccctgaa ctaataactt tcctcctgct tgagcgacaa aaggattcgc cacactatcc 23341 cgcacccgca ccgtatcccc cgccagcaaa ttcaaatcac cagtcgtacc aagctgactc 23401 tccgccaacg tcagattatg attcgctgtc agtgttgcag tcttagccgt gacgttgcgt 23461 gcaaccacat caccatccac tacaggcaaa cccgatcccg tcaactccac ttgtccgtta 23521 ctgttcactg ttaaccctgg atgcgacttt tcatcaacgc ttctcagtaa ctcagtcaga 23581 ggcgatgaac ccacactcac cacagatgac gaagtatcaa tatttagcaa ttgtcctgat 23641 ggactcaatt tcaccacact cccaccaggc actgctgcaa ctgcaacctg tcctcccggt 23701 gctgacaact gcccagtgct aaccaccgta ccacccaaca gcgttaggtt ttgtcctgtg 23761 cctgctaata aattccctga attcacaatg gcacttggct gtgcctggtt aaattgcact 23821 cccaaaggta ctgtcacagt tagcaatggt ggagctgtgg gattcgtagc gctaaactgt 23881 tgcccattcc caaaattcaa actgttcgct gtactgccta caaacgaacc accaatatct 23941 aacgaagcat tgggaccaaa aataatcccg ttggggttga gcaagaatag attggcattc 24001 cccagcacac ctagtttccc attaatattt gaggaattgt ttcctgtgac tcggctgaaa 24061 atgttctcaa tcccagccgg attactaaaa taagctcctc ttccttccct gatgccaaat 24121 tcctgaaaac tgtggaacag attcgcacct cgaattgctc ctccatcaat gcgatcgctt 24181 ggtaaaccct tgatatccac attgggcgta acaacagaac tttcagtacc cagcgtctta 24241 tctggaacaa tttgggcaac tgcactattc caactgaaaa tcacaccgct agttgctaca 24301 aatgcgatcg ctatttgcca caactgctta cacctgatgt atattgtcat gaaatttacg 24361 ttctccaata ttcgatttga gttgaattag cttttatttt gtatttttca agacaaatgt 24421 tcgtaagctg ttagacattt agtttgcaat agcagcaact acagcctaac cagtcactaa 24481 tttattggta agtttttcca tcatctatca gtaagaacat taatattctc tgcgtcaact 24541 gtggattatt gctgtctgat agagcttttt gtaaactttc tttctacaat ttatttaggt 24601 atcctttggc aggcatagct ctagttaact agcttatttc atttccattt tatgcaattt 24661 agatattgaa cagcttaaca ttaatattct ctgcgtcaac tgtggattat tgctgtctga 24721 tagagctttt t // LOCUS NODE_1255_length_24454_cov_5.02746024454 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24454) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24454) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24454 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(547..1413) /locus_tag="DP116_11075" CDS complement(547..1413) /locus_tag="DP116_11075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197452.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="branched-chain amino acid ABC transporter permease" /protein_id="PRJNA477356:DP116_11075" /translation="MNAQFAQLIVNGIALGSIIALASVGLTLTYGILRLSNFAHGDFL TLGAYLTLLVNSFGVNIWLSMAIAAIGTVAAMLLSEKLLWSHMRSIRATSTTLIIISI GLALFLRNGIILIWGGSNKTYDLPVVPAMDILGVRVPQNQLLVLFLAVAAIVALHYIL QNTKIGKAMRAVADDLDLARVSGINVDQVILWTWVIAGTLTSLGGSMYGLIGAVRPNM GWFLILPLFASVILGGIGNPYGAIAGAFIIGIAQETSTVLLGAQYKQGIALLLMILVL LIRPKGLFKGTI" gene 1652..2479 /gene="larE" /locus_tag="DP116_11080" CDS 1652..2479 /gene="larE" /locus_tag="DP116_11080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207283.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent sacrificial sulfur transferase LarE" /protein_id="PRJNA477356:DP116_11080" /translation="MILTEKFEKLKALFLEMEQALIAYSGGVDSTLVAKVAYDVLGDR ALAVTAMSPSLLSEELEDASLQAAMIGIRHQIIQTHEMENPNYTSNPVNRCYFCKSEL HDTLKPLAIKLGYPYVVDGVNADDLHDYRPGIQAAKERGARSPLAEVGITKAQVRQIS KELGLPWWEKPAQPCLSSRFPYGEEITVAKLQRVGRAEIYLRKLGYQNLRVRSEGDTA RIELPPEQIKDFVLKTDLQSIVSAFGEFGFIYVTLDLEGFRSGKLNQVLNREVASMK" gene 2720..3340 /locus_tag="DP116_11085" CDS 2720..3340 /locus_tag="DP116_11085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652827.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_11085" /translation="MIQPLRKLVTFEEFAKWKPEDGRYELHDGVIVKMPQPLGEHEEI ILFLVEKLTLEYSRLNLAYGIPKTVLVKPPENESCYSPDVLILNRSNLVNEPLWKKES TVIQAASVPLVVEVVSTNWRDDYYKKLADYEGIGIPEYWIVDYLALGARKFIGNPKQP TISIYNLVEGEYQVNQFRGDERIQSLTFSELNLTAQQIFEANTISL" gene complement(3374..3565) /locus_tag="DP116_11090" CDS complement(3374..3565) /locus_tag="DP116_11090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312783.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11090" /translation="MNTKQKIQAEIDSLNEDYLDELYLLLKDFTQSKQHSKKPSFMSK LKQIKIDAPEDFSTNIDEK" gene complement(3681..4826) /locus_tag="DP116_11095" CDS complement(3681..4826) /locus_tag="DP116_11095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215553.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CapA family protein" /protein_id="PRJNA477356:DP116_11095" /translation="MLNQKLIKLVSFSFISFCFCLGIGIGLFIRVEQLQRTNASTTST DTQPIPFSTPEYTSGDTITIQAVGDIIPGTNYPNYKLPRNRNLLLPNSVRAYLQKADI LFGNFESSLTNYPHSAKDISRGQVFAFRSPPKYAQLFADVGFDVMNIANNHALDFGYV GFQDTVKNLKAVGIETLGHKNQILLLKVNNIVVGMIAFAPYEFYNSIHDLETAKALVQ KAKTQANVVIVSMHAGAEGTNALRVHDKTEFFYGENRGNSIQFARTMIDTGADLVIGH GPHVPRAIEIYNRKLIAYSLGNFLGYRTLSTTAEAGYSMILEVKLNSKGDFVFGKIIP VHLNAQGIPHIDQRFRTVGLLRYLNNKDFPDDLVKINKKGEIVVADK" gene 5259..6083 /locus_tag="DP116_11100" CDS 5259..6083 /locus_tag="DP116_11100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein-glutamate O-methyltransferase CheR" /protein_id="PRJNA477356:DP116_11100" /translation="MLKEQLEDIEIQLLLEGIHRYYGFDFRNYALASIKRRIWNIIRA EGLTTISGLQEKVLHHPECMERFLCSLSVNVTTMFRDPNFFLTFRQKVVPILRTYPFI RIWHAGCSTGEEVYSLAILLHEEGLYHRCRLYATDINEMVLKKAKAGIYPLDAMQDYT QQYLQAGGKKAFSEYYTAAYDHAIFSSCLKENMVFSQHNLATDNSFNEFHIIFCRNVL IYFNKILQERVINLFHESLVPFGILGLGRQETLRFTPHERDYEELEGGEKLYRRIR" gene 6109..6693 /locus_tag="DP116_11105" CDS 6109..6693 /locus_tag="DP116_11105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315802.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein CheB" /protein_id="PRJNA477356:DP116_11105" /translation="MTIQLIAIGASLGGLQALEVLLAGLPRNFPVAVAIVQHRYKSSD KKLRVALQQYSALVVVEPQDKEEILPGYIYLAPADYHLMVEVISDTVSYPSFSLCVDA PVTYTRPSIDVLFETAADTYAEKLIGVLLTGANHDGTQGMKKIKARGGKTVVQEPATA ICATMPKAAIAAGVADKVLPLADIAPYLVKICHF" gene 6860..8728 /locus_tag="DP116_11110" CDS 6860..8728 /locus_tag="DP116_11110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872398.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_11110" /translation="MPFEPKVNVLLVDDHPENLLALEAILDSLGQNVVRATSGAEALR CLLNQDFAVILLDVQMPDMDGFETAALIRQRERSRHTPIIFLTAFSTSDNMVFKGYSV GAVDYLFKPIHAEILKSKVAAFVDLFQKTVEVKRQAAQLAQMNTELRKREEMFRSLSA CSPVGIFLTDTSGKCTYVNPRYQAIYGMTLEESLGDGWTQTIHPEDRGRVIADWYAIS GEGREYTGEFRILTSTGIERWVHMSSSPMLSDQSEVIAYVGTIEDITERKQAQEEHIK FIRAQAAREEAETANRLKDEFLATLSHELRTPLTSILGWSKLLRQRKMDEKAIVRALE TIERNATLQAQLIDDILDVSRIMRGKLQLNLCQITLTSVIATVVNSVRLEAEKKNIQL EYIIEHTKTEEEERREREREGEREGGRERERENILSLSSSPEGTLNSSTSFVVCGDPN RLQQIVWNLLSNALKFTPQDGRVEVRLLLSAESSSETLQPQQHQSPAAKSSLALIRVS DTGSGISPDFLPHVFERFRQADGSITRHQGGLGLGLAIVRYLVEMHGGSVHAESPGVG QGATFTVKLPLIGTRKTPQSEEEKEDEEEGSSEEHLSAKLITDYTTKSENTSCSAG" gene complement(8686..9777) /gene="ald" /locus_tag="DP116_11115" CDS complement(8686..9777) /gene="ald" /locus_tag="DP116_11115" /EC_number="1.4.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194415.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alanine dehydrogenase" /protein_id="PRJNA477356:DP116_11115" /translation="MEIGIPKETKDQEFRVGLSPSSVRVLQENGHRVFVETQAGIGAG FTDDDYINAGAKIVTTTDEAWKCELVVKVKEPLAAEYKFLQKGQILFTYLHLAADRTL TEHLIDCGVTAIAYETVEQPGNNKLPLLTPMSIIAGRLSVQFGARFLERQQGGRGVLL GGVPGVKPGKVVILGGGVVGTEAAKIAVGMGAAVQIIDVNVERLAYLETLFGSRVELL YSNSAHIEAALKDADLLIGAVLVPGRRAPILVSRNLVKQMRPGSVIVDVAVDQGGCVE TVHPTSHTHPVYLEEGVVHYGVPNMPGAVPWTSTQALNNSTLPYVLQLANSGIKALEK NQALAKGVNVQDHRLVHPALQEVFSDLVV" gene 10230..10397 /locus_tag="DP116_11120" CDS 10230..10397 /locus_tag="DP116_11120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410338.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11120" /translation="MTTQPSVTPKLEEPKFGFNEYAERLNGRAAMIGFLMMVVIEYVT NQGVLSWLGLK" gene complement(10390..10587) /locus_tag="DP116_11125" CDS complement(10390..10587) /locus_tag="DP116_11125" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11125" /translation="MRGLYPFIFGQQLGGVGEVGGVGEVGEVGRVGGFYLYFHRPAHP TLNEYITAWGFYQKKLSKNPT" gene 11092..11388 /locus_tag="DP116_11130" CDS 11092..11388 /locus_tag="DP116_11130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315806.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11130" /translation="MNAVFPRFLKSSYRREPLISVLITMGVIDALIGGLDDSWSLFAF GLGTTGMALAYKWWRIQQRQPLPEEPVVQHYLPSRSSSAALPMLSVSKKKPPYQ" gene 11660..12658 /locus_tag="DP116_11135" CDS 11660..12658 /locus_tag="DP116_11135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194418.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="WD40 repeat domain-containing protein" /protein_id="PRJNA477356:DP116_11135" /translation="MSVAGAVAITSLIQIPSASPVEFQVTPVAQPSSVANPQLLYTFT EHSGTIQSLAFTPDSRILISGGYDNEGIIRLWDMTTGRRKGIIKRAHKTAVESIVISP DGQTLASCSDDNTINLWNLKSYKFTRSFVEHTSNVLSLAMTPNGKVLVSGALDGIRMW DLLQQRPLATLTRFDNSINTVAISPDGQTLASGDNVGVIKLWDLNSGRLIRTISGAHS NTVTKIVFTPDGRSFISASRDRTIKLWSVTTGQSVRALTGHNNWVNDIAINPNGQTLA SAAKDGIKLWNLTTGELITTLYGHSDWVSAIAFSPDGTKLASGGFDTRVNVWLLGL" gene complement(12737..14698) /locus_tag="DP116_11140" CDS complement(12737..14698) /locus_tag="DP116_11140" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11140" /translation="MTTSTVDTLNSVIPSAIDVPSATELAGAGSYSAVGNYSAADYGV SDPLTSGGSSLPVDTYSPSTSAPTGGSSNPISGGASGSLTPISGSASGSSNPISGSTS GSSNLFSASQGAGSFGGSSQGAGGLQSLGLGTGTTNTFAGTANNPYAGGGGQPFGSLG NVVVGESQWIIGTEITPERFSKAINYTVEELVDSVIGKLSNRIYQSGITISSSGGNPF AGGNNPFGTTNAPALDYLKQAYGTNFPIPKSAESLVTSVYSQTLPSGSATTSGAGAGT SGASAGTSVAGTGTTSGASAGILGASGGNPLVPTNNTAIAPGSIPQQPSVDIVQVLFG QDLKGAGGSTSPSDILRIVQDDLLNFTKTVNSATGKKLFSGNNTPLKSPSDLLTLYKN DIVGANSSINTGTQNAGGDPYNNVFANSGSQAVQPPTDIISTVMSGLMPFSGSDNIFN TPNGGIPIGYGNKDFGGNNAAIGNANWNYGQSNASIGNANWNWDSTKNNTTIGNGNWH LDSSQNNRTIGNGNWYWESTKDNVTLGNGNWDFGNNNTTIGNGNWDFGSNNTIIGNGN HVFTSNSVVIGDGNWSVIIDKSNTAAGDFLGKLDNLVLSVGIKDAADSLVNSLFSKFA DALYPLTSDLGESGLKTYNQLFYYGANNT" gene complement(14890..15456) /locus_tag="DP116_11145" CDS complement(14890..15456) /locus_tag="DP116_11145" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11145" /translation="MATSNTNSAASSDSVFPRLPSNVKSDIQTNISTFTDIVNQYRSS NAAASDDQPLAPVVDLVNAAYGSNSPFLNGNASGSSGNNTVAGSGSVSSGSSLVTSGS SVFPLRSFPWDGSFGASPSEVLKLVEAEISSSYNLGDQNFGVGSQLPQSRSDVLELLH SDITTFSKAIDSLQSGNNPLASTSTPIV" gene complement(15664..16218) /locus_tag="DP116_11150" CDS complement(15664..16218) /locus_tag="DP116_11150" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11150" /translation="MATDNNNSGNQSSGDLGSVLQQSPWGRVINEVASPDTVIQGFSN ATGGGTIGGAPSGAGSGSPFTNFGNPNASGSPLTGGINPWAAIGTGGSSASSGGSSTD VLTGAPSGGGSTGSFGGIDFGSASQSPSFQSRAEIDGIVYSSLNEAGVSVPLSGAGSF DGGASGFGGGSTGGFGDSNQIAGF" assembly_gap 16536..16545 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(16790..17869) /gene="aroF" /locus_tag="DP116_11155" CDS complement(16790..17869) /gene="aroF" /locus_tag="DP116_11155" /EC_number="2.5.1.54" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317609.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-deoxy-7-phosphoheptulonate synthase" /protein_id="PRJNA477356:DP116_11155" /translation="MIIIVKNGTPAEEITRISQELHDTWGLTVEKSVGQNKVVLGLIG ETATLDPMQIQNNSPWIEQVLRIQKPFKRVSREFRHGEASQVVVPTPNGPVYIGEQHP VVLVAGPCSVENEEMIVETAKRVKAAGAKFLRGGAYKPRSSPYAFQGHGESALDLLAA AREATGLGIITELMDAADLPALSRVADVIQIGARNMQNFSLLKKVGAQDKPVLIKRGM SATIDEWLMAAEYILAAGNPNVILCERGIRTFDSKYVRNTLDLSVIPVLRQLTHLPIM IDPSHGTGKSEYVTPMAMAALAAGTDSLMIEVHPNPAKALSDGPQSLTPEKFDRLVQE LSVLGKVVGRWSTPDQRSLVALGSV" gene complement(18095..19327) /gene="trpB" /locus_tag="DP116_11160" CDS complement(18095..19327) /gene="trpB" /locus_tag="DP116_11160" /EC_number="4.2.1.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017804458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tryptophan synthase subunit beta" /protein_id="PRJNA477356:DP116_11160" /translation="MSDINNKVVSTTARPDSLGRFGKFGGKYVPETLMPALSELETAY EQYRNEASFQEELQKLLKDYVGRPNPLYFAERLTAHYALTDGTGPQIYLKREDLNHTG AHKINNALGQVLLAKRMGKQRIIAETGAGQHGVATATVCARFGLQCVIYMGIHDMQRQ ALNVFRMKLMGAEVRPVEAGTGTLKDATSEAIRDWVTNVETTHYILGSVAGPHPYPKL VRDFHAIIGQETRAQSQEKWGGIPDILLACVGGGSNAIGLFHEFVDEPSVRLIGVEAA GEGVDTEKHAATLTRGRIGVLHGAMSYLLQDDDGQVIEAHSISAGLDYPGVGPEHSYL KELGRAEYYSITDSEALGGLQLLSQLEGIIPALETAHAIAYLEKLCPQLEGSPRIVIN CSGRGDKDVQTVATILNP" gene complement(19429..20256) /gene="trpA" /locus_tag="DP116_11165" CDS complement(19429..20256) /gene="trpA" /locus_tag="DP116_11165" /EC_number="4.2.1.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408006.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tryptophan synthase subunit alpha" /protein_id="PRJNA477356:DP116_11165" /translation="MTSISQRFQSLRNSNQCALIPFITAGDPDLETTKEALRILDRNG ADFIELGVPYSDPLADGPVIQAAATRALHNGTKLEHVLEMVADVSPNLQAPIILFTYY NPILNRGIKSFLAQVASVGVRGLVVPDLPLEEAEDLINTGTEFGIEVILLVAPTSSQE RIEAIAKYSQGFIYLVSVTGVTGVRSQLQERVKTLLTQMRSITDKPIGVGFGISGPEQ ARQVSDWGADGVIVGSAFVQRLANGSPTEGLQAIEQLSQELKAAIAQPSLQTVGSIQ" gene complement(20298..21131) /locus_tag="DP116_11170" CDS complement(20298..21131) /locus_tag="DP116_11170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317605.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="indole-3-glycerol phosphate synthase TrpC" /protein_id="PRJNA477356:DP116_11170" /translation="MKQVATTNVRPRHILEEIVWYKKQEVAQMRQDLPLATLQQQLNA APSVRNFLTALKQNPHQPSLIAEVKKASPSRGVLRADFNPVAIAQAYEQGGAACLSVL TDEKFFQGSFDNLRAIREQVALPLLCKEFIIDIYQIYYARVAGADAVLLIAAILSNEQ LQDFLRVIHDLGMKALVEVHTLAELDRVLKLDDLRLVGINNRNLEDFTVDIGLTQQLV EERRSKLQSFGITIVSESGLYTPADLSLVAQAGARAVLVGESLVKQADLEQAVRSLLH S" gene complement(21189..23390) /locus_tag="DP116_11175" CDS complement(21189..23390) /locus_tag="DP116_11175" /EC_number="4.1.3.27" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317604.1" /note="trpE(G); catalyzes the formation of anthranilate from chorismate and glutamine; contains both component I and II; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anthranilate synthase component I" /protein_id="PRJNA477356:DP116_11175" /translation="MIVDSHCYTTLGGIKISRSISEVQTDTALDEILFHLNSQRGGLL KSSYEYPGRYKRWAIAFVNPPLEVTTRERAFTLRALNKRGEVLLPILFERLSVHSQLQ EVKLENDHISGFVKPTVQLFAEEERSKQPSAFTVIREILHAFSSDEDEHLGLYGAFGY DLVFQFESIPKRLERPTDQRDLVLYLPDELLVVDYYLQRAYRIQYDFETAHGSTKNLP RTGDSVDYKGERHVPTLTSDHQPGKYANQVEVALDYFRRGDLFEVVPSQSFFESCEEP PSKLFQTLQQINPSPYGFIFNLGGEYLIGASPEMFVRVEGRRVETCPISGTITRGQDA LDDAEQIRQLLNSGKEEAELTMCTDVDRNDKSRICEPGSVQVIGRRQIELYSHLIHTV DHVEGLLRPEFDALDAFLSHTWAVTVTGAPKRSAIQFIENHERSARRWYGGAVGYLNF NGNFNTGLILRTIRLKDSIAEVRVGATILYDSLPAAEEQETMTKAAALFETIRRAKQG KKSEESQPTKLNTYIPNVESGKRILLIDYEDSFVHTLANYIRQTGATVTTLRHGFSES LFDTERPDLVVLSPGPGRPNDFRVPETVQACLRRQIPLFGVCLGLQGIVEAFGGELGV LNYPQHGKSSRIFVTDSNSVLFKGLPESFCVGRYHSLFALPEKLPAQLKVTAISNDDV IMGIEHQTLPVAAVQFHPESIMTLAGELGLAIIKNVVRAYTQTQESLVISQ" BASE COUNT 7134 a 5286 c 5209 g 6815 t 10 others ORIGIN 1 gtacaacaca aaagcagcca tctcaaaagc cacctgtcaa agagataata ccagcacagg 61 gttggatata taacgaaaaa ggtgaggtgt tgctggttgg ttatgatccg actaagactg 121 gtccacaacg taagcagcca gcaccgacta gtaattgtgc tgcggtgaaa tagatatgta 181 gaacgtttaa ggtttgattc cacgttgatt tgacctcacc ccgcccttcg ggcacccctc 241 tccttattaa ggagagggga ggttttggcg taagccaaag ccggggtgag gtaacacgcg 301 tgttggattc atgagtatta agtagtcgtc agaaattccg tacaccaatt aagatatgta 361 gaacgtttaa gcgctttgat tcggtgttga tttaacctca ccccgccctt acgggcaccc 421 ctctccgcga gttcggagag gggaggtttt ggcgtaagcc aaagccgggg tgaggtcaca 481 cgcgcgttgc attcatgggt attaagtagc cgtcatcaac tgcgttcatc atcgctactt 541 aatttctcaa atcgtacctt tgaacaaacc tttgggacga atgagcaaca ctaaaatcat 601 cagcaacaac gctatccctt gtttatactg agcacccagc aaaacagtac tggtttcttg 661 tgcaatacca atgataaaag ctcctgcgat cgcaccgtaa ggattcccaa ttcctcccaa 721 aatcactgaa gcaaacaaag gtaaaatcaa gaaccaaccc atgtttggtc ttacggctcc 781 aatcaagccg tacatacttc cgcccaagga tgtgagagta ccagcaatca cccaagtcca 841 taagatgact tggtccacat taataccaga aacccgagct aaatctaaat cgtcagcaac 901 agcccgcatt gctttgccaa ttttagtgtt ttgtaaaatg tagtgtaatg caacaattgc 961 tgctacagct aggaacaaaa ccaacaattg attttgtgga actctcacac ccagaatatc 1021 catagccgga accacgggca aatcgtaagt tttgttacta ccgccccaga taaggataat 1081 cccattacga agaaataagg caagcccgat agagataata ataagagtgg tggaagtagc 1141 acggatagaa cgcatatgag accatagcaa cttttccgac aacagcattg ctgctactgt 1201 gccaattgca gcgatcgcca tagacagcca aatattaaca ccaaagctat tcacgagcag 1261 cgtcagataa gctcccaacg tcaaaaaatc accgtgagca aagttagaca accgtaaaat 1321 cccgtaggtg agagtaagtc caacagatgc taaagcaata atgctaccca aagcaatacc 1381 gttaacaatt aattgagcga attgtgcatt catagtcgtt acatctaaaa ttcataactt 1441 gtatagcagg aaatagggaa cagggaacag ggaacaggga acagggaata cggaacgctt 1501 aacagtgaac agtgacaagt gataactgat cgctggtaac tggtaactgg taactggtaa 1561 ctggtaactg ctgtatcttt ttcccttatt cctttattgc ccacctaaaa tggaattgat 1621 ggggcgcgat accatttgga caagattgga aatgatacta acagaaaaat ttgaaaaatt 1681 aaaagcatta tttttagaaa tggagcaagc tttgatagct tactctggtg gagtggatag 1741 cactttggta gccaaggtgg cttatgatgt tttaggcgat cgcgctttag ctgtcacagc 1801 tatgtctccc tccttgttaa gcgaagagtt ggaagacgcc tcccttcaag cagcgatgat 1861 tgggattcgt catcaaatca ttcaaacaca tgagatggaa aatcccaatt acacttccaa 1921 ccctgtgaac cgctgttatt tttgcaaaag cgaactacac gacactctca aacccttagc 1981 gattaaattg ggttatccgt atgtggtgga tggagtcaac gcggatgact tacatgatta 2041 tcgcccagga attcaggcag caaaagaaag aggtgcgcga tcgcccttag cagaagtggg 2101 tatcaccaaa gcccaagtcc gccaaatttc caaagaactc ggtttacctt ggtgggagaa 2161 acccgcgcaa ccttgtttga gttcacgctt tccctacggt gaagagatta ccgtcgctaa 2221 gttacaacga gtcggtaggg cagaaattta tttacgcaag ttgggttacc agaatttgcg 2281 ggtgcgttct gagggagata cagcacgaat tgaattaccg ccagaacaaa tcaaagattt 2341 tgttttaaaa actgatttgc aatctattgt ttctgctttt ggtgagtttg gttttattta 2401 cgtcacatta gatttggaag gttttcgtag tgggaagtta aatcaagttt tgaatcgaga 2461 agttgcgagc atgaaatgat aatttgttga catcttcgag caatctcgtc cgcccagaac 2521 tcaagttctg ggctaatagt ataagtctac ttaagtagac tgaaaataaa atcaccgaaa 2581 gaatttagtc ctcttgagag gacttttgct attagcctgg ggtttctaac cctaggcgat 2641 cgcacaaagc ctgctaatcg atattgacaa caaaatcctc aaatgagtat ccggacaaga 2701 gaatgaacgg gagagaaaaa tgattcaacc cttacgcaaa ctagtaacat ttgaagaatt 2761 tgcaaagtgg aaaccagagg atgggcgtta tgaattgcat gatggagtca ttgttaaaat 2821 gccgcaacca ctaggagaac atgaggaaat tatactattt ttagttgaga aactcacttt 2881 ggaatatagt cgcctcaatc tagcctatgg tatccccaaa acagtgttag tcaaaccacc 2941 tgaaaatgaa tcatgttatt caccagatgt gctgatacta aaccgttcta atttggtaaa 3001 tgaacctctg tggaaaaagg aatcaaccgt tattcaagcc gcatcagttc ctttagttgt 3061 cgaagttgtt agtacaaact ggcgtgatga ttactataaa aaattggctg attatgaagg 3121 aattggcatt cctgaatact ggattgtaga ctatcttgct cttggcgcga gaaagttcat 3181 cggtaatccc aaacaaccaa ccatttctat ttacaaccta gtcgaaggcg agtaccaagt 3241 taatcagttt cggggtgatg aacgcatcca gtctttaaca ttttcagagt taaacttaac 3301 ggcacaacaa atttttgaag ctaataccat ttctttgtga ggcggcgcca aattttcttg 3361 acttcttctt aacttatttt tcatcgatat tagtactaaa atcttcgggt gcatcaatct 3421 ttatttgctt cagtttcgac atgaaactag gctttttaga gtgttgtttg gattgtgtaa 3481 aatctttgag caagagatag agttcatcaa gataatcttc gttgaggctg tcaatttctg 3541 cttggatttt ttgttttgta ttcatgttta tttttctgtt ttatcctaat tatagacaat 3601 tttagcatag ggaattaaca tagctgaatg agcaattctc tgtgccctta caaacaatga 3661 tcgcactcgt agctactcta ctacttatcc gccactacaa tctcaccctt tttattaatc 3721 ttcactaagt catcaggaaa atctttatta ttcaaataac gcaacagtcc cactgtccta 3781 aaacgctgat caatatgagg aattccttga gcattaaggt gaacaggaat aattttcccg 3841 aacacaaaat ctccttttga gttcagtttc acttctaata tcattgaata gccagcttca 3901 gcagtcgtgg ataaagtccg gtatcctaaa aagtttccca aagaataagc gatgagtttt 3961 ctattataaa tctcaatcgc tcttggaaca tgaggaccat gtccgataac taaatctgcg 4021 cctgtatcaa tcattgtccg ggcaaactga atagaatttc ctcggttttc tccataaaaa 4081 aattctgttt tatcgtgaac gcgcagtgca ttagttcctt ctgctcctgc atgcattgat 4141 acaatcacaa catttgcttg tgtcttagct ttttgtacga gtgctttggc tgtttctaaa 4201 tcgtgaatcg aattataaaa ttcataggga gcaaacgcaa tcatcccaac gacaatatta 4261 ttcactttta ataagagaat ttgatttttg tgacctaatg tttcgatacc aacggcttta 4321 agattcttaa ctgtgtcttg aaaccccaca taaccaaagt ctaaagcatg gttatttgct 4381 atattcatca catcaaaccc aacatcagca aaaagttgag catactttgg tggggagcga 4441 aaagcaaaaa cttgtcctcg actgatatcc ttggcgctat gggggtagtt tgttaaacta 4501 ctttcaaaat taccaaataa aatatcagct ttttgtaaat atgctctcac agagtttggt 4561 aaaagcagat tgcggttacg tggaagtttg tagttagggt aatttgtgcc aggaataatg 4621 tctccaacag cttgaatcgt gatggtgtca ccagatgtat actctggtgt agaaaatgga 4681 attggttgtg tgtcagtaga tgttgtagag gcgtttgttc gttgcaattg ttcgacacgg 4741 atgaataatc ctataccaat acccaagcaa aagcagaaac ttataaaact gaatgagact 4801 agttttataa gcttttgatt tagcatattt atattgttta tattatggtg ccagtatatc 4861 gcagaagtct tgatgaatga acagctttac agaaaattgg tattagaggt tgactttata 4921 acgagctata ccaaatccgg atctaatacc ccctttatta tttagtccgc gtaggcggac 4981 tttgtttgtg tagtcgcgat ttctaatcgc cacagtagta gggtgcgtta tgcctctggc 5041 taacgcacca tcatcaggat ttggtgcgtt atgcactctc acacaacgcc agatactaca 5101 acgggacgcc aggtgctaca agtcgggaaa cccgcccaac gcactggctc gggaaccccc 5161 gcaccgtact ggctccccta cagatactgt tcacttttcc ctgttaagag ttccctaact 5221 caactagtaa attcataaac caaacctgat tcctatatat gcttaaagag cagcttgaag 5281 atattgaaat tcaattgtta ttagaaggta tacatcgtta ttacggattt gattttagaa 5341 attatgcctt ggcttcaatt aaacgacgaa tttggaacat cattcgagcc gaaggtttaa 5401 ccactatttc tggattgcaa gaaaaagttc tccatcatcc cgaatgtatg gagagatttt 5461 tgtgtagtct ctcggttaac gtaacgacta tgtttcgtga ccccaacttt tttctaacat 5521 ttaggcaaaa agtggttcca atattacgaa cttatccttt tattcgcatt tggcatgcag 5581 gatgttctac tggtgaagaa gtttattctc tggcaatttt actacacgaa gaaggtcttt 5641 atcaccgttg tcgtttgtat gcaactgata tcaatgagat ggttttaaaa aaagcaaagg 5701 ctggaattta tccattagat gcaatgcaag attatactca acaatattta caagcgggtg 5761 gaaaaaaagc tttttcagaa tattatacag ccgcctatga ccatgctata ttttcttctt 5821 gtttgaaaga aaacatggta ttttcacaac ataatttagc gactgataat tcttttaatg 5881 aatttcatat tattttctgt agaaatgtct taatttactt taataaaatt ttacaagaga 5941 gagtgattaa tctttttcat gaaagtcttg ttccgtttgg aattttgggc ttgggacggc 6001 aagaaactct gagatttact ccccatgagc gagattatga ggaattagaa ggtggggaaa 6061 agctttatcg tagaattcga taagaattaa gaatgagtcg ttgggagtat gacaattcaa 6121 ctcattgcta ttggtgcatc tttaggaggg ttacaagccc ttgaagtttt actcgccgga 6181 ttgccaagaa acttcccagt cgctgtggca attgttcaac atcgttacaa atcttcagat 6241 aaaaaattga gagttgcatt acagcagtat agtgctttag ttgtcgttga gccgcaagat 6301 aaagaagaaa ttttacccgg ttatatatat ttagctccag cagattacca tttgatggta 6361 gaagtaataa gcgatacagt ttcttatcca agtttttctt tatgcgttga tgctcctgtg 6421 acttatacac gaccgtcaat agatgtgcta tttgaaactg cggcagatac ttatgctgaa 6481 aagttaattg gagttctctt gacaggagca aatcatgatg gtacacaagg aatgaaaaaa 6541 attaaagcac ggggtggcaa aactgtagta caagaacctg caacagcgat ttgtgcgaca 6601 atgcctaaag cggcgatcgc agcgggagtc gcagataaag ttctaccttt ggcagatatt 6661 gcaccttacc tagtgaaaat ttgtcatttt tgagagagtt cttgccgaat tcttcataaa 6721 caaaagttct caactactgt cattcaagaa tcactaaatt tctaacttaa gatagtgtca 6781 ataaaataag cccaaacgtc acagtcaata gaaagacgtg aaaacgtttt tatctattac 6841 ctaaagcata cagatagaaa tgccatttga accgaaagtt aacgttctgt tggtagatga 6901 ccatccagaa aatttgttag ccctagaggc aattttagat agtctgggtc aaaatgtcgt 6961 gagagctaca tccggcgcgg aagcactgcg atgtttgctc aatcaggact ttgcagtgat 7021 tctgctggat gtgcaaatgc cggacatgga cgggtttgag acagcagcct tgattcgaca 7081 gcgagagcga tcgcgtcaca caccaattat ttttctcaca gcatttagca ccagtgacaa 7141 tatggtgttt aaaggatatt ctgttggtgc cgtagactac ctcttcaaac ccattcatgc 7201 agaaattttg aaatccaagg tagccgcgtt tgttgatttg tttcaaaaaa ctgtcgaagt 7261 caagcgacaa gcggcacaat tggcacagat gaatactgaa ctgagaaagc gcgaagagat 7321 gtttcgctct ttaagtgctt gctcgccggt gggcattttt cttacagata catcaggtaa 7381 atgcacttat gttaacccac gatatcaagc tatctatggg atgacgctgg aagaaagttt 7441 gggcgacggt tggacacaaa caattcatcc agaagataga gggcgagtga ttgctgattg 7501 gtatgctata tctggtgaag gaagggaata cacaggagag tttcgcatac tcacctcgac 7561 aggaattgag cgttgggttc atatgtcctc gtcgcctatg ctttccgatc aaagcgaagt 7621 tattgcatat gttggcacga tagaagatat tacagagcgt aaacaagcac aagaagaaca 7681 catcaagttc atccgcgcac aagcagcccg agaagaagct gaaacagcaa accgtcttaa 7741 ggatgagttt ttagcaactc tttcccatga acttcgtaca cctttaactt caatactagg 7801 ttggtcaaaa ctgctgcgcc agcgaaaaat ggatgaaaaa gcgattgtac gcgctctaga 7861 aacgattgaa cgcaacgcga ctttgcaggc acaactcata gacgatattt tagatgtctc 7921 gcggattatg cgtggtaaac tacagctcaa tctctgtcag attaccttaa catctgtcat 7981 tgcaactgtt gtaaatagcg tacgcttaga agctgagaaa aaaaatattc aacttgagta 8041 cattattgaa cacactaaga cagaggagga agagagaagg gagagagaga gggagggaga 8101 gagagaggga gggagggaga gagagaggga gaatattctt tcactctctt cttccccaga 8161 gggaactttg aattcttcca cttcttttgt tgtttgtggt gatccaaacc gcttacagca 8221 aatagtttgg aatttactga gtaatgcgct caagttcaca ccccaagatg gaagggtaga 8281 ggtacgattg ttactcagtg ctgagtcttc ctcagaaaca cttcaacctc agcaacacca 8341 gtcacctgcg gcgaaaagtt ccttagcact gattcgggtg agcgatacag gtagtggtat 8401 cagcccagat tttcttcccc atgtttttga gcgctttcgt caagctgacg ggagcatcac 8461 cagacaccaa ggtggactgg gtttaggact cgcaattgtg cgttatctcg tagaaatgca 8521 tgggggaagc gttcatgctg aaagtccggg agtcggacaa ggagccactt ttactgtgaa 8581 gctaccgctg attggtacaa gaaaaacacc acaatcagaa gaagaaaaag aagacgagga 8641 ggaggggagt agtgaagaac acttatctgc taaactgata actgactaca caactaagtc 8701 agaaaatacc tcctgtagcg cagggtgaac taagcgatga tcttgtacgt tcacaccttt 8761 ggctaatgct tggtttttct ccagtgcttt tatccccgaa tttgctaact gcaagacata 8821 gggtaaagtg ctgttgttca atgcttgagt tgaagtccaa ggtactgccc ccggcatgtt 8881 aggaacccca taatgaacca caccctcttc gagatagaca ggatgagtat gggatgtggg 8941 gtgtacagtt tccacgcaac cgccttgatc aacagcaaca tcgactatga cagaaccagg 9001 acgcatttgt ttaactaagt tgcgggacac aaggatgggt gctctacgtc ctggtactaa 9061 aacagcaccg atgagcaagt cagcatcttt gagcgcagct tcaatatggg cggagttgct 9121 gtaaagcagt tctactctag aaccaaacaa ggtttccaga taagcgaggc gctcaacatt 9181 aacatcgata atctggacag cagcacccat acccacagca attttcgccg cttctgtacc 9241 aacaacaccg ccgcctagaa tcacgacttt gcctggtttc actccaggta caccgcctag 9301 caaaactcct ctaccacctt gctgacgttc tagaaatctt gccccaaatt gtacggataa 9361 ccgacctgca ataatgctca tgggagtgag cagaggtaat ttgttgttgc ctggttgttc 9421 tacagtttca taggcgatcg ccgttacacc acaatcaatc agatgttccg ttaacgttcg 9481 atcagctgct aaatgcagat atgtaaacaa aatttgcccc ttctgcaaaa acttatactc 9541 cgcagccagt ggctctttaa ctttaacaac gagttcgcat ttccaggctt cgtcagtggt 9601 ggtgactatt ttcgcgccag cgtttatgta gtcatcatct gtaaatcctg ccccgatacc 9661 tgcttgagtc tctacaaaaa ctctatgacc attttcttgt aagactctca cactagaagg 9721 acttaaccca actcgaaact cttgatcttt tgtttcttta ggaataccaa tttccattta 9781 ttgcctctta tgaatttttg tatagctata gtttgtcagt ctcattcata atctgttgaa 9841 tcagaaacta cgaattttag aactcctact catccatagt ttgttgaatc agaaactacg 9901 aattttagaa ctcgtattga agtagaaagt attgaagtgc aaagactgca ctaagctttg 9961 agaaggcagc gctgttttac cgtcttccgg agcatacctt tatggggggt taacctttgg 10021 gactctacac ttcaatactc tggatgactc cagttgcttt gataccacca ctgtagttca 10081 aacttttgct cttgacgatt gccaaagtac ttctacaata aactaagaat cgtaaataaa 10141 tgtaaacagt ggcaatcctg caaaatagcc taggcgtgtt gtcactcaaa tacccagtta 10201 aaagttcatt tgtcaaaaag aggttttcaa tgacgacaca accaagcgtt actcctaagt 10261 tggaagaacc taagttcggg tttaatgagt atgctgagcg tttgaatggt cgagctgcaa 10321 tgattggctt cttaatgatg gtggtaatag aatatgttac caaccaagga gtattatcat 10381 ggctcggtct taagtaggat ttttacttag ctttttctga tagaagcccc acgctgtaat 10441 gtactcattc agcgtgggat gagcggggcg atggaaatac aaatagaaac ccccaactct 10501 tccaacttcc ccaacttctc caactcctcc aacttcccca actcccccca attgttgacc 10561 aaaaataaag ggatacaagc cccgcacttc tagggcggct tttgcgctaa aatatacaaa 10621 ggagttagca atccccctct aggctcaaga cgcgttgaaa cagctacgat gagagcaggc 10681 gtagagaaag actgatgggc aatggtagta aacataaatc caacccaaag gaaaaagaca 10741 aaattcaata gttcaactgt cagttagtcg cgtccggaca ttccgagact caaaacggat 10801 gtggagagag tgtaagactt ggttccaagc atttctcaat gaagcgtcaa ccccattcag 10861 cgaccagtta tcctactaca agtaggttgc tggcgtggcg aggaatcacc gcacttatag 10921 ggcggtgagg atgtcaattc atcaaacctt tgctgaagca tataaggtag tagaacggaa 10981 ccgaactata atgatattga ggctctatca caaaagcgtc aattgctctt gaggtgctct 11041 caaaaaatat aaatatacat ataaaggtga acaggaacaa aaaagcttgt gatgaatgct 11101 gtatttcctc gttttttaaa atcaagctat cgtagagaac cacttatcag tgtattgatt 11161 acaatgggag tcatagatgc attgattggt gggttggatg atagctggtc gttgtttgct 11221 tttggcttag ggacaacagg aatggcgcta gcttacaaat ggtggcggat tcagcaacgc 11281 caacccttgc ctgaggaacc tgtggtgcaa cattatctac cttcgcgatc gtcaagtgct 11341 gctttaccga tgctgagtgt ttctaagaaa aagccaccat atcagtagat tttggctcta 11401 aaatttgagt catgatattt attattgatt gttgctaaaa aaaaataaaa catggtatgc 11461 tttgctacct atagcgccag tatgatcatt gacgggcgga gctttactgc ctgtcaatct 11521 ggtacaggct atcttgagat atttgaaata tatgaataat cacaataaat tcctcttact 11581 tatccaaaat gcaaagcttt ctgggcgtag aaaggaattt cagtctcaaa taaccagtgc 11641 caagaatttt gtgatactca tgagcgtagc aggtgctgtt gccatcactt cgctcatcca 11701 aataccatct gcaagccctg ttgagtttca agtgacacca gtggctcaac ccagcagcgt 11761 tgctaatcca caacttttgt acacattcac agaacattcg gggactattc aatctctcgc 11821 cttcactcca gatagccgaa tccttatcag tggtggctat gataacgaag gtatcattcg 11881 cttgtgggat atgacaactg gcagaaggaa gggaattatc aagagagcac ataaaactgc 11941 agtagaatct atcgttattt ccccagatgg tcaaactctt gccagttgca gtgatgacaa 12001 tactattaac ctttggaacc tgaaaagcta caaatttact cgctcctttg tggaacacac 12061 aagcaatgta ctatctttag ccatgactcc aaatggtaaa gttcttgtca gtggcgcact 12121 agatggtatt cggatgtggg atttgctaca gcagcgccca cttgcgactt tgacacgttt 12181 tgataactcg attaatacag tggcgattag tcctgatggt cagacgctgg caagtggtga 12241 taatgttggt gtgattaagc tatgggattt gaatagtgga agattaatcc gcacaatttc 12301 aggggcacat tctaatacag ttactaaaat agtttttacc ccagatggta ggagttttat 12361 cagtgcgagt cgcgatcgca caattaaact ctggagtgtt accactggac aatcagttcg 12421 cgccctaaca ggacacaata actgggtaaa tgatatcgcc atcaacccaa acggacaaac 12481 cctagctagt gctgccaaag atggaattaa gctgtggaat ttaacaacag gcgagttaat 12541 aacgacactt tatggacatt ctgactgggt aagcgccata gcctttagtc ctgatggaac 12601 aaagcttgcc agcggtggat ttgatacaag agttaatgtt tggctactcg gtttgtaaat 12661 cacaccaaaa ttcaacgatc aaaacttttt tgggttttga tcgttgaact ttggggctag 12721 ttgtctctcg cgttatttag gtgttgttgg caccgtagta gaagagttga ttatatgtct 12781 tgaggccaga ttctccaagg tcacttgtta aggggtacaa agcatcagcg aatttgctga 12841 agagggaatt caccaaacta tcagcagcat ccttgattcc tactgacaac acaagattat 12901 ccaacttacc gagaaagtcg ccagcggcag tgtttgattt gtcaataatc acagaccagt 12961 tgccatcacc aataactacg ctgttgctgg tgaagacatg gttaccatta ccaatgattg 13021 tgttgttact gccaaagtcc cagttaccat taccaatggt tgtgttgtta tttccgaaat 13081 cccagttacc attaccgaga gttacgttgt ccttggttga ctcccaatac cagttaccat 13141 taccaatggt tctgttgttt tgactagagt ctaaatgcca gttgccatta ccaatggttg 13201 tgttgttctt agtagagtcc caattccagt tagcattacc tatgctggcg ttggattgtc 13261 cgtagttcca gttggcatta ccgatggcag cgttgttacc accaaagtcc ttattgccat 13321 aaccaatggg tatgccgcca ttaggagtgt taaaaatgtt gtcactgcca ctgaagggca 13381 tcagaccact cataacagtg gatatgatat cagtgggtgg ttgaacagcc tgagagccac 13441 tattagcaaa cacgttgttg tacgggtcac ccccagcatt ttgagtacca gtattaatac 13501 tgctattggc acccactatg tcattcttat aaagcgtcaa caaatcggaa ggtgatttca 13561 gaggcgtatt gttaccactg aacagttttt tacctgtagc agagttgact gttttggtga 13621 agtttaacag gtcatcttgg acaatcctta agatgtcgga gggtgaggta cttccgccag 13681 cgccttttaa gtcctgacca aagaggactt gcacaatatc tacagatggt tgctgaggta 13741 tactgccagg agcaatagcc gtgttattag taggtacaag cgggttgcca ccactagcac 13801 ctaggatacc agcactagca cccgacgtag taccagtacc agcaaccgag gtaccagcac 13861 tagcacccga ggtaccagca ccagcacccg acgtagtagc agacccactt ggcaaggtct 13921 gactgtagac ggatgtcacc aaactctcag ctgatttagg tatggggaaa tttgtaccat 13981 aagcctgttt caagtaatcg agagctggcg cattggtagt accaaacggg ttgtttccac 14041 cagcaaatgg gtttccacca ctggaggaaa tagtaatgcc actctggtat attctgttgc 14101 tcagtttacc gatgacagag tcaaccaatt cctcgacagt ataatttatc gcttttgaaa 14161 atctttctgg tgtgatttct gtaccaataa tccattgact ctcaccaaca actacgttgc 14221 caagactacc gaaaggttga cccccaccac cagcataggg attgttcgca gtaccagcaa 14281 aggtatttgt ggtgccagtt cctaagccga gactttgtaa gccgccagca ccttggctgc 14341 taccaccaaa gctgccagca ccttggcttg cactaaatag gttgctgcta ccactagtgc 14401 tgccactgat tgggttgctg ctaccactag cgctgccact gattggagtc agactaccac 14461 tagcgccacc actgattggg ttgctgctac caccagtagg agcagaggtg cttggagaat 14521 aagtgtcaac cggtagagaa ctgccaccac tagttagggg atctgagaca ccataatcag 14581 cagccgagta attgccaaca gccgagtaac tgccagcacc agctagctca gtagctgagg 14641 gaacatcaat agctgaggga ataactgaat tcaaggtgtc aacggtagaa gttgtcatta 14701 aaaataatcc tcgtaatttt taggaatgaa atctcgtaaa atggggaaat tagatacata 14761 aagcctgcct tattgctttg ctctttggtt ttttaagtgc aggctgttta ttgattagta 14821 gctcttggaa tgttgttggg tttccccgta aaagggtcaa cccaacattt tttatttcaa 14881 actgggacgt taaacaatcg gagtgctagt actagcgagc ggattgttcc cagattgcaa 14941 tgaatcgata gctttactga aggttgtgat gtcactgtgc aagagttcca agacatcaga 15001 acgtgattgc ggaagctgac taccgacacc aaaattctga tcgccaaggt tgtaagagct 15061 tgagatctca gcttctacca gcttcaaaac ctcagaaggc gatgctccaa aggagccgtc 15121 ccaagggaat gagcgcaaag gaaatacact gctaccacta gtcactagac tgctaccact 15181 tgatacactg ccactaccag caactgtgtt gttaccacta gacccactag cattgccatt 15241 taagaagggt gagttgctgc cataagccgc attcaccaaa tcaaccacag gtgccagagg 15301 ctgatcgtca cttgcagccg cattgctact tctatattgg ttgacaatgt cagtaaaggt 15361 tgagatgtta gtctgaatgt cacttttaac attagatggt aaccgtggaa acacactgtc 15421 actactagca gcagagttag tgttagaagt tgccataact aaaaatcctc ctatcgttag 15481 aacaaaaatg ttgttgaact aaggaaacta aaagaagcaa gctagattcg tatgtaactt 15541 tcttctttgg tttcatggag gctagttaat ggctagtagc tcttgaaatc atgttgggtt 15601 gccatgggta ttggttaacc caacaatttt tctttcaagg tgaaacccta agcaaacagg 15661 ttgctaaaaa cctgcaattt ggttgctatc accaaagccg ccggtgctgc cgccaccaaa 15721 tccgctagcg ccaccatcaa agctgccagc gccactcaag ggtacactga cacctgcttc 15781 gttgaggctg gagtaaacta ttccgtcaat ctcagccctg ctttgaaacg atggcgactg 15841 gcttgcgctt ccaaaatcga tgccaccgaa gctgccggtg ctaccgccac cagagggagc 15901 gccagtcaac acgtcagtgc tactgccacc agaggaagcg ctgctaccac cagtgccaat 15961 agcagcccaa gggttgatac caccagtaag cgggctacca gaggcattcg ggttgccaaa 16021 gttagtaaag gggctaccgc taccagcacc actgggagcg ccaccgatgg taccaccacc 16081 agtagcattg ctaaagcctt gtattacggt gtcaggagaa gcgacttcat taatcacgcg 16141 accccaggga gattgttgca gcacactccc gagatcacca ctagattgat taccactatt 16201 attgttgtca gttgccatta gagaaaatcc tcttacttct aggttaaaaa ttacgaaagc 16261 acaggcatat aaaataggca gaaaatcagc ctatattttt aattttttat tgctctgtta 16321 ttgccaaaac tagtaattat ttatctgggc attacaaatc aagaatggct ttgtttcgga 16381 aattctccaa cgaatgtaga gatcacttaa gtcaaactgt aattaacgct actgtctata 16441 caaaaataag caattttaag aaatttatca atcaaaattc ttttgattga tttgtcaaat 16501 ttttagattt ctctatttta tttttctctt agttannnnn nnnnnatata aaaagtatca 16561 acataaaaat atatcgttgg aaagatgtaa gataccaaat attagaaaaa atttctctga 16621 gcattgatag cagtatctct ggcaatactt ttcggataag tactactcat ttcccttgct 16681 atttaggaaa acttaaagga aaagagggaa aggatgaatt tttccctttc cccctaactg 16741 aaaagtattg gtacctctgt tacaaacaca aaatactaca ctctacttcc tacaccgatc 16801 ctaaagctac caaactacgt tgatcgggtg tagaccagcg accaacgact ttacccaaaa 16861 ctgacaattc ctgaaccaat cggtcaaact tttcaggggt cagggactga ggtccatctg 16921 atagagcctt agcaggattc gggtgaactt caatcatcaa tgaatctgta cctgctgcta 16981 aagctgccat tgccatcggt gtaacatact ctgatttccc tgtaccatga ctagggtcaa 17041 tcatgatggg taaatgtgtt agctgtctca acacaggaat cactgataaa tctagagtgt 17101 tacgaacata tttggaatca aaagttctaa taccccgttc gcaaagaatc acatttggat 17161 ttcctgctgc cagaatatat tctgctgcca tgagccactc atcaatcgtg gcggacatcc 17221 cgcgtttgat cagcacaggc ttatcttgag cgcccacttt ctttagcaag gagaagtttt 17281 gcatattcct ggctcctatc tggatcacgt cagcaaccct agacagtgct ggtaaatcgg 17341 cagcatccat aagttctgtg atgataccca aacccgtagc ttcacgggct gcggctaaca 17401 aatctagagc actctcacca tgtccttgaa atgcataggg tgaagagcga ggtttgtaag 17461 cgccgccacg cagaaactta gctccagctg ccttaacgcg ctttgccgtt tccacaatca 17521 tttcttcatt ttccacggag caaggtccag caaccaacac aacgggatgt tgttcaccga 17581 tgtagacagg accatttggt gtgggaacaa caacctgact ggcttctcca tgacggaatt 17641 ctcgactgac gcgcttgaaa ggtttctgaa ttcgtagtac ttgctcaatc caaggactgt 17701 tattctgtat ctgcattgga tctagggtgg cagtttcacc aattaagccc aaaactactt 17761 tgttctgtcc aacacttttt tccacagtga ggccccaagt gtcatgcaat tcttggctga 17821 tgcgagtaat ttcctcagca ggtgtaccat ttttgacgat aatgatcatg aattttgatt 17881 ctccgtatta tttgttggtt tgttttgtat aaactactaa cgagtagcta ctgaatatag 17941 cggttctcat ttggatgcag tacgcttttt gaccccacac gccggtcgcc gtgagagcac 18001 cgaaacaagg actgcagcac tggctcccct tacccctgcc cgaactcgcg gggaggggcg 18061 gttaatagtc aatagtcaat agtccctaaa aagactaagg attcaggatt gtagcaacag 18121 tttgcacatc tttatcccct ctgcctgagc agttgatgac aatccgagga ctaccttcta 18181 actgaggaca cagcttttct aaataggcga tcgcatgagc agtttccaaa gctggaataa 18241 tcccttctag ttgagacaga agctgcaacc ctcctaaagc ctcagagtcg gttatactgt 18301 agtactcagc gcgaccaagc tccttgagat agctatgctc tggaccgacg ccaggataat 18361 ctaaaccagc actaatcgaa tgtgcttcga taacctgacc atcatcatct tgtagcaaat 18421 agctcatagc accgtgcaac acaccgattc gtcccctagt caaagttgct gcgtgctttt 18481 cggtgtcaac accttcgcct gctgcttcaa ctccaatcag gcgcacagat ggctcatcca 18541 caaactcatg gaataaccca atcgcattgg aaccaccacc cacacaagcc aagagaatat 18601 ctggtattcc tccccatttc tcttgagact gagcgcgagt ttcctgacca atgatagcgt 18661 ggaagtcgcg tactaacttg gggtaaggat ggggacccgc aacggaacca aggatgtagt 18721 gggttgtttc cacgtttgtc acccagtccc gaatcgcctc ggaagttgca tccttaagtg 18781 ttccagtacc agcctccacg ggacgaactt cagctcccat cagtttcatc cgaaacacat 18841 tgagggcttg ccgttgcata tcgtggatgc ccatataaat cacgcattgt aaaccaaaac 18901 gagcacatac ggttgcagtt gctaccccat gctgacctgc tcctgtttca gcgataatgc 18961 gttgcttacc catacgcttt gccagcaaca cctgacctaa agcgttatta attttatgag 19021 cacctgtatg atttaaatct tcccgcttga gatagatttg cggccctgta ccatcagtaa 19081 gagcgtagtg tgctgtgaga cgttcagcaa aatataaggg attcggtcgt cccacgtagt 19141 ctttaagtaa tttttgtaac tcttcttgaa aacttgcctc gttgcggtat tgctcatatg 19201 ctgtttccaa ttcacttaat gcgggcatta aggtttcagg aacatactta ccgccgaact 19261 ttccgaatct tcccaaggag tcgggtcttg cagttgtgga tacaacttta ttattgatat 19321 ctgagatgct aaccactgga tataacctct tagcttctag taaattttca ggagtcttaa 19381 caacagttaa cagtcaacag ttgtttttga ctgataactg atcactgttc actgtataga 19441 acctaccgtt tgaagagacg gttgggcaat tgcggctttc aattcttgag aaagttgttc 19501 aatagcttgc agtccttctg tgggactacc atttgctaat ctttgaacaa atgcactacc 19561 aacaataaca ccatctgcgc cccaatcgct gacctgacgc gcctgttctg gtccagagat 19621 gccaaagcca acaccaatcg gtttatcagt tatacttcgc atctgagtga gtaaagtttt 19681 gacgcgttct tgtagttgag aacgaacacc tgtcacacca gtcacactca caaggtagat 19741 aaatccttga gagtatttgg cgatcgcctc aattcgctct tgagaactgg taggagccac 19801 aagtaaaatg acttctattc caaattctgt gcctgtgtta attaaatctt ccgcttcttc 19861 taaaggtaag tctggtacga ctaatccccg cacccctaca ctggcgactt gcgccaaaaa 19921 tgacttaata cccctattta aaattgggtt gtagtaagtg aatagaatga tgggtgcttg 19981 caagtttgga ctcacatctg caaccatctc taacacatgc tctaacttag ttccgttgtg 20041 taacgcacga gttgcggctg cttggataac aggaccatct gcgagtggat cagagtaagg 20101 aacacccagt tcaataaagt cagcgccatt tctgtctaag atccgtaaag cttcttttgt 20161 ggtttctaag tcaggatctc cagctgtgat gaagggaatc agggcgcact ggttgctatt 20221 tcttaaagat tgaaaacgtt gggaaatgga agtcatcgga taatgcacct acgtaatcag 20281 tgtttttgta tgacgagcta agaatggagg agacttcgca cagcctgctc taaatcagct 20341 tgtttcacta acgactctcc taccaaaacc gcacgcgcac ccgcttgggc gacaagtgat 20401 aaatcagcag gagtatataa accagactca ctgacgatgg taatgccaaa gctttgcaat 20461 tttgagcgtc gctcttctac aagttgctga gtgagtccga tatccaccgt aaaatcttct 20521 aaattgcggt tgttgatccc taccaagcgc aaatcatcaa gttttagcac ccgatccagt 20581 tcagctaaag tatgaacttc taccaaagct ttcatcccca aatcatgaat gactcgtaaa 20641 aagtcttgga gttgttcatt tgaaaggata gcagcaataa gtagtacagc gtctgcgcct 20701 gcaacccgtg cataatagat ttggtagata tcgatgataa actctttgca cagtagtggg 20761 agtgcgactt gttccctgat agcacgcaaa ttatcaaaac tgccttgaaa gaacttttcg 20821 tcagtcagaa cagatagaca agctgcacca ccttgctcat aggcttgggc gatcgccact 20881 gggttaaagt cagcccgcag aactccacga ctcggcgaag ctttcttaac ctctgcaatc 20941 aggcttggtt gatggggatt ttgctttaaa gcagtcagga aatttcgtac actcggagca 21001 gcatttaact gctgttgcaa agtcgctagt ggtaaatctt gccgcatttg tgcgacttct 21061 tgctttttgt accacacaat ctcttcgaga atgtggcgtg gacgaacatt tgtagtagca 21121 acttgtttca tagactcctg gtgagatgag ggagtaggga gcagggagta gggagtgagg 21181 gagaataact attgactaat gactaatgac tcttgagttt gagtgtatgc acgcactacg 21241 tttttaatga ttgccagacc aagttctcct gccagagtca taatggattc tggatgaaat 21301 tgcacagctg cgacaggaag tgtttgatgc tcaattccca tgatcacatc gtcattagaa 21361 atggctgtta ccttcagttg tgctggcaat ttttccggta aagcaaacaa cgaatggtat 21421 ctacccacac aaaaggattc tggtaaacct ttgaagagaa cagaatttga atcggtgaca 21481 aaaatccgtg aagacttgcc atgttgagga tagttgagaa ctcccagttc tccaccaaac 21541 gcttcaacaa taccttgcag tcccagacag actccaaaaa gtggaatttg ccgacgcaag 21601 caagcctgga cagtttctgg aacacgaaaa tcattcggtc taccaggacc tggagataaa 21661 acaactaagt ctggacgttc tgtgtcaaac agcgattccg aaaacccatg acgtaatgtt 21721 gtgactgtcg cacccgtttg gcggatgtaa tttgccagtg tgtgaacaaa tgagtcttcg 21781 tagtcgatga gtaagatgcg cttaccagat tcgacatttg gaatgtaggt gttcaatttt 21841 gttggctgag attcttcaga ttttttcccc tgcttcgcgc gacgaattgt ctcaaataaa 21901 gccgcagctt tagtcattgt ctcttgttct tctgctgcgg gcaaggagtc ataaagaata 21961 gtcgcgccaa ctctcacctc agcgattgag tcttttaacc gaattgtccg caaaatcaat 22021 ccagtattaa aattgccgtt gaagtttaaa taaccgacag ctccaccata ccagcgtcga 22081 gcactacgct catggttttc gataaactgg attgctgagc gtttgggtgc gcctgtcact 22141 gtgactgccc aagtatggct caaaaaggca tccaaagcat caaattctgg tcttaataat 22201 ccctcgacat gatccactgt gtgaatcaaa tggctataca attctatttg acgacgacca 22261 atcacttgta ctgagccagg ttcgcaaatc cgtgatttgt cattacgatc cacgtctgta 22321 cacatggtca gttcggcttc ttctttccca gagttaagga gttgacgaat ctgttcggca 22381 tcatctagag catcttgtcc tcgggtaatc gtcccactga ttgggcaagt ctctacacgc 22441 ctaccttcaa ctcgcacaaa catttctgga gatgcgccga taagatactc tccacctaag 22501 ttaaatataa acccataggg actgggatta atttgctgta gtgtttggaa tagtttgctc 22561 ggtggttctt cacaggattc aaaaaagctt tgactgggaa caacttcaaa taagtctccc 22621 cgacggaagt aatctagtgc gacttctact tgatttgcat acttccctgg ctgatggtca 22681 gaagttagag tgggaacatg gcgttctcct ttataatcaa ccgagtcacc tgttcgtgga 22741 agatttttgg tactgccatg cgctgtttca aagtcgtact gtatacggta tgctcgttgc 22801 aagtagtagt caacaacgag tagttcatcc ggtagataca gtaccaaatc ccgctgatct 22861 gtaggacgct ctaaacgctt gggaattgat tcaaactgga acaccaagtc atatccaaat 22921 gcgccataca atcccagatg ctcgtcttca tcactagaga aagcatggag aatctcgcgg 22981 ataactgtaa aggcagaagg ttgcttactg cgctcttctt ctgcaaatag ttgaacagtt 23041 ggcttgacaa aaccactgat gtgatcgttc tctaacttga cttcctgaag ttgagaatgc 23101 actgacaatc gctcaaaaag tattggtaga agcacttcgc ctcgcttgtt gagcgctctt 23161 agcgtaaaag ctctctcgcg tgttgtcacc tctagtggtg gattgacaaa tgcgatcgcc 23221 caccttttat atctccctgg atactcataa ctacttttga gtaaacctcc acgctgagag 23281 ttcaggtgaa acaaaatttc gtctagagct gtgtctgttt gcacttcgct gatggagcga 23341 gatatcttta taccgccaag agttgtgtag cagtgagaat caacaatcat gcgcaatttc 23401 tctgaagtag ttatgagtca ttaggaggca gtacgcccag gagggaaacc ctcacttggt 23461 atctggcgtt cattagtcat gcatcaaaat tttgactaat gactttcaac tatcatgtgt 23521 ttgtactatg tcaattaaaa acttacatac tacaaccaaa atgcaagatc ttggaagcaa 23581 aaagtagtat attgatcttg tcccaaattg ggcgttattg tagttgtcta ttcaaacaag 23641 agccttattg tcaaaaggga acgagccgac aaacctattt attatgaata tgtcgcattc 23701 aactaagtca caaggttaaa ccatcaatta ccgactacta acagcctaca attaacagtc 23761 aaaaccttaa gcaaagtttt agatttgaat gctacttggt atgagattag cgtatctaaa 23821 tgttaaagaa tgagatatga acaatttttt gataattgcg tcaatttttt tgcttttgct 23881 tttatagtta agtaaaatct ttataatggt atgaaagtat ttttccgctt ctacctaggg 23941 ttagaaagat tcctttcttt tatgaataga taaagatgga agtttataga ctcaattcta 24001 aaatgtccga atcgtgtcgc ctcgttatag gaagacggtg agaaagcccc gtcccataag 24061 agtgcgggga tgaatcacct caccgtctta gtcctgccgt cattggtcaa tagtccgacg 24121 tatgattaaa cactaaactt ttaaaatacg tacgtagggg cgcaaggcat tgcgccccta 24181 ctaatgactc gaaatccgag ttaatggaaa atacattttg accgccccct ggacatttga 24241 gaacaagacc cccatctgcg ctcatggagg taaaaagtat aatcaatcac taaaatccgg 24301 gcttttaata aaaggggcgg cttgaaaata acttaaatta tcagggtagt gctatcaaca 24361 ccagtcaagg tcgctaagaa aatatcgcta atattgtagt tggtgtctcc ggctaccagg 24421 ttgaaagcac tagaatcaac cgccacaaag cgcc // LOCUS NODE_1263_length_24329_cov_5.34160024329 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24329) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24329) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24329 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..286 /locus_tag="DP116_11180" CDS <1..286 /locus_tag="DP116_11180" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11180" /translation="CFKPGNPSNAVAPLPQRWTHIYVLYIMHGGVFDSNENRYIFPLK SEADEAGEADEREIFSFLLMCKCPFGVITHSLAMNGGSYALQILLDVVEI" gene 637..3222 /locus_tag="DP116_11185" CDS 637..3222 /locus_tag="DP116_11185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314794.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11185" /translation="MSRLVVLSLGQGNLHEGCATVTAQLGEADNPYRMKLTASLPAAP KIYELYQKWQLVYYAFYQRLSWRLDNEIDDNFEIEEDSVTNISEADIHDLCQLLSKKI NTWLNSTKFRKIDQQLRTQLKPSEEIRFIIETNDHLLRRLPWHLWKFFEDYPYAEVAL SAPEYQKPKKLLVNNRTSQVRILAIFGNSQGIDISQDKTLLDQLSTQAEVKFLIEPNI GRLNEQLWQQNSDILFFAGHSFSQEKGFLQINQTDTITLEQLKYGLRQAISRGLKLAI FNSCDGLGLAQELQDLQIPQVIVMREPVPDVVAQEFLKYFLAEFSSGKSLYTAVRSAR ERLQGLEGEYPCATWLPIIFQNPAEPSMIWTQQSVWTKRKQKKNSSMLVSTVQARGMQ TNIRLPVATSALSKTTALDSLIEMLNGNLLKYQNLPLNHTEILVLRGIWQSQTYNQIA QQGGYSTSYLTKTVAPRLCHKLSDLIGNRVTKKNCRTLLESYVTTQASLKTTLEQHPP VKDFPNFQQEMSPRFPSGLVPLGSPYYIERPEIATQVYEEIIKPGALIRIKAPQEMGK TSLLLRILEYVNGMGYHIVSLNLQEQVDRAILSDLNRFLRWLCANISCQLELEPRLDE FWDEDIGSKVSCSLYLRNYVLEQIDSPVVLALDEVNQIFEQPQVAKEFLPLLRSWYED AKRQPIWQKLRLIVVHSTEVYVPLQLNQSPFNVGLPIELNYFSLEQVQELAQRFGLNW TDDEARFLMDIVGGHPALVHLTLYHLSRGEVALGQLLQSAPTPTSIYYRHIQPYWATL KVQPELAFALSAVMSATKPVKLEPVLAHKLRSMGLIKLDDNLATPSCRLYQEYFQLKC EISAS" gene 3451..4332 /locus_tag="DP116_11190" CDS 3451..4332 /locus_tag="DP116_11190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743829.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydroxyquinol 1,2-dioxygenase" /protein_id="PRJNA477356:DP116_11190" /translation="MKNITLENITQAVIEHGDGGKTHPRLYEIYTSLIRHLHAFVQEV NLTEQELSLLRDFLIRADRYTKEIPNGEIHMLLDLLGISELVVLLHNKSSTATESNLE GPVYVADAPERNMGDRLGIDPDGNTLFLSGRVLDLNNKPIANALLDVWQSNSKGLYDL HDASQAKGNFRGRFRTNSDGSYSIETVVPIGYTIPSSGPCGEMLQLLGRHTLRAGHIH FKLSAAGYIPLTTQIHIDGDPHLDSDTTFAVRSAILKLQKHEAPDKLNAYNQSKPFYT AEFDFVLQPTDQQTDAA" gene 4724..4942 /locus_tag="DP116_11195" /pseudo CDS 4724..4942 /locus_tag="DP116_11195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740103.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(4915..5385) /locus_tag="DP116_11200" CDS complement(4915..5385) /locus_tag="DP116_11200" /inference="COORDINATES: protein motif:HMM:PF06051.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11200" /translation="MPVYSNSEFVFGTAIASHPTFWFYVPYQSSFPAKFVLRDKEGKL IYQIDVTLPKTPGVTSFSLPSTVAPLEMNKQYHWYFKIYCKAQEPPAFADGWIQRTSL NPALKSQLEKATPQQRVALYATNGIWFEALSTAGELRRRNPRDTSWAALFTIFV" gene complement(5434..5685) /locus_tag="DP116_11205" /pseudo CDS complement(5434..5685) /locus_tag="DP116_11205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740114.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(5689..8835) /locus_tag="DP116_11210" CDS complement(5689..8835) /locus_tag="DP116_11210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459142.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AcrB/AcrD/AcrF family protein" /protein_id="PRJNA477356:DP116_11210" /translation="MSFNISAWSIKKPVPTIVLFLILTVVGWFSFLALGIDTNPNIDV PAVQVRVTQPGAGPAELEFQVTKKIEDAVAGLGNIDELRSTVIDGVSTTTINFVLGTN TDRATNDVRNAVAQIRQNLPQDINDPIVQRIEFAGGAIMTYVVKSDKRSVEELSNLVD QTISRALLAVKGVSQVKRVGGVDREIRINLNPDRLQSLSITATQVNDQIRAFNINLPG GRAQIGGSEQSIRTLGSAASVEVLKNYEIVLPNGGSVPLSSLGTVEDSFGEVRQAASL NNQPVVAFQVLRSTGSVMVTVEQGVKAAVEQLQKTLPTDVKLELIFTRATFVEKSYQS TIDELIQASILAVIVIMLFLRDWRATLITGVALPLSIIPTFAVQYALGYTLNNMSLLA LALAVGNLVDDAVVEIENMERHMAMGKSAWQAAFDSSDEVGLAVIASAATIIAVFMPV AFMGGVPGQFFQPFGVTVAVSTIFSTLVARMITPMMGAYLLKDKQPKQGREETKEMRE QHIATGPTSVRRRFQPYKSLLKWALTHKLTTLGVAVAFFIASLMLVPFIPKGLVDSSD IGISTISMELPPGSTLEDTKKVVTQTTNIIKQNPNVVSVLATQEVNSATLVVKLKSKE EGRKISQVEFQQKVRPLFAQIPGTRISFQSAGSVGSRKDLTILLRSDNPKALTQAADA LEKQMRSIPGLVELSSSASLLKPEILVVPNPQRAADLGVTVQSIARTASLATIGDNDA NLAKYNLSDRQIPIRVQIDPKARQDINTIKNLQVPSQNGSLVPLIAVADIRFGSGPAT IDRYDRSRQVSLEANLQGISLGDGLKAVNQLPALQNLPPGVKLQNSGDAKIMADIFSR FGAALGLAVMCIYAILVLLYNNFLHPLTIMAALPFCLGGALIGLMLAQKALGLYALIG IVLLMGIVTKNSILLVDYTIINLEEGKTQRQALLEAGVSRLRPIMMTSLATIAGTLPL ALGIGAGAEVRQSMGIAILGGFTTSTLLTLVVVPVLFSYVDSFQCWILDVAKYGFGKK PQRKIAEDKEVVNLPPAS" gene complement(8922..10442) /locus_tag="DP116_11215" CDS complement(8922..10442) /locus_tag="DP116_11215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux RND transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_11215" /translation="MVKESVSELKVEEPVAFEDANWQKKSVYRKRHSWLIPLLVGTGL GGIIAFGGMRLSSHSVSEKTTLAEKKTVQKVAPTMSVTVTPVETTRVARTLSTKGTVA ARDLTPVLPQANGLQIKKILVNIGEIVKAGQVMAVLDDSVLQDQIRQAKADVEAKQAD VASKQADLASKQAVVVSTRATVVSNQAIVQQKQADFAQAQAKLRDAQINFRRTQELTS QGAISQQQLDTATTNLATATEAVSLAQANIKSAQANVSSAQANIGSAEANVKIAQANI NSAQASVKSSTAKMEQLKTQLGQTLLRAPVSGVVAEKLARVGDVTGIAPQTQIATVVG GTQKLFSIIQDGKLELQAQVPEIQLTQVKIGSKVQVTSDVDNRVKLQGRVRDIEPMIN QEKREATVKIDLPQTTLLKPGMFARAAISTVTTIGLAVPQKAVLPQSDGSAIVFILSG VDTVRAQKVELGEILVGGRVEIKSGLQQGNASRVRVVVDGAGYVKDGDTVRVVNTQ" gene 10864..11739 /locus_tag="DP116_11220" CDS 10864..11739 /locus_tag="DP116_11220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal-dependent phosphohydrolase" /protein_id="PRJNA477356:DP116_11220" /translation="MFNATEILIDAFVAQIREGYRRTYGCLKNDYQDIIAWAGNMALE NIANSDALYHNVEHSILVTLVGQEILRGKHIREGGVSNEDWLHFIISLVCHDIGYVKG VCRQDREEEGLYATGKDGKMISLRSGASDASLTPNHVDRAKLFIDERFGGHKLIDSEV IKSNIELTRFPVPAVDDHQDTNNFAGLVRAADLIGQLSDPRYLNKITSLFYEFEETGM NKVLGYQNPADLRKNYAKFYWNVVYPYIKDALRYLSLTQQGKQIVANLYSNVFVVEHE KLQEEHLYLIEKLRA" gene 11838..12956 /locus_tag="DP116_11225" CDS 11838..12956 /locus_tag="DP116_11225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318683.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_11225" /translation="MNWWHRLKKNPLARFGAILLLVFYIAVILADFVAPYNPYTSQPN GSLLPPTRVYWVSKESGQFIGPHIYPTTQGDTNLETGDRQIIVDYKKPSPLRLFVTGP EYRLLQLNLPLPPKWDEVEIFGGIPLNIHLFGAVGEAKFNVLGTDDQGRDQLSRLLYG GRISLFIGIIGVALTFPLGMLIGGISGYFGGWLDSVIMRFSEVLMTFPSIYLLVTLGA VLPPGLSSTERFLLIVVITSFISWAGLARVIRGQVLSIKEREYVQAAKAMGANPLYII VRHVLPQTATYIIISATLAVPSFIGAEAVLSLIGLGIQQPDPSWGNMLSLATNASILV LQPWLIWPSAALIILTVLAFNLLGDGLRDALDPRSLRR" gene complement(13057..13140) /locus_tag="DP116_11230" tRNA complement(13057..13140) /locus_tag="DP116_11230" /product="tRNA-Leu" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(13104..13106),aa:Leu,seq:caa) gene complement(13119..13994) /locus_tag="DP116_11235" CDS complement(13119..13994) /locus_tag="DP116_11235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140378.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M48" /protein_id="PRJNA477356:DP116_11235" /translation="MFPQKTQLIGLKADSFRHPLDLDATKALKQIPGLDMMVRNLLGP LAEQIFYVENIASSVLVGEKQLPHLHKSLLEACKILDIEPPQLYIRQHPAPNAYTFAM RGKQPFVVLHTSLIDMLEQEEIQAVIAHELGHLKCDHSVYLTPVNILILAAAVVPTVG TVLAQAIQTQLLEWVRCAEFTCDRAALLATQNPKVVMSVLMKLAGGSPTLAPQLNLDA FIAQARAYDDISKTEMGEMVKAARTAQLTHPVPVLRAREIDRWASSQDYQKLLQNQKI EYNSEVAPKGGWRNW" gene complement(14148..15557) /gene="murA" /locus_tag="DP116_11240" CDS complement(14148..15557) /gene="murA" /locus_tag="DP116_11240" /EC_number="2.5.1.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140377.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-N-acetylglucosamine 1-carboxyvinyltransferase" /protein_id="PRJNA477356:DP116_11240" /translation="MIPSGGLPNAKSSSTADSSVLQISGGHRLQGHVKISGAKNSALV IMAGALLCSGECRLRNVPLLADVTRIGQVLSALGVRLKQTGDILEIDARVIKTSKAPY ELVTQLRASFFAIGPILARLGVAQMPLPGGCAIGARPVDLHVRGLQAMGAEVQIEHGV CNAYVPGSSRRLKGAKIYLDIASVGATETLMMAATLADGETIIENAAREPEVVDLANF CIAMGAKIYGAGTSTITIVGVPQLHSTEYTIIPDRIEAGTFLVAAAITRSELILSPVV PEHLTPIIAKLQDIGVPIIEEAPNCLRTLSAETLRASDIETLPHPGFPTDMQAPFMAL LALAEGDSVINESVFENRLRHASELNRLGADIRVKGNVAFVRGVPMLSGAPVLGTDLR ASAALVLAGLAAEGKTIVQGLQHLDRGYDRLDTKLQQLGARIQRIPLAQVNAEVASNP TVEELPTSGATNRENVQSQ" gene 15903..15984 /locus_tag="DP116_11245" tRNA 15903..15984 /locus_tag="DP116_11245" /product="tRNA-Leu" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:15937..15939,aa:Leu,seq:cag) gene 16096..16905 /locus_tag="DP116_11250" CDS 16096..16905 /locus_tag="DP116_11250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873973.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA methyltransferase" /protein_id="PRJNA477356:DP116_11250" /translation="MLTSLQNPLVKQIRKLHSSKERHKQQLFLVEGTHLLEEACAANY PLEAVCSTPEWQDSHAVLWEKACDLCERAEIVSEEVLKAIATTVQPDGVVATAKRRER VGELPFTGIMLALETVQDPGNLGTIIRTAAAAGASGLWLSEDSVDLDNPKVLRASAGQ WFRLAMAMSPDLKTTVQQSREARMQVVATLPNATLTYWDVDWRKPSLILLGNEGAGLS KELTATADLQVKIPLSPGVESLNVAMTAALMLYEAQRQIFKSKEGTLNREQ" gene complement(17113..18324) /locus_tag="DP116_11255" CDS complement(17113..18324) /locus_tag="DP116_11255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315695.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11255" /translation="MNLAKKLDVNSYHNKKMTDKATQAKLVFENALLRSAEESSTFDY QVGGSLPMNAPTYVVRSADCLLYQALKRGEFCYILNARQMGKSSLMVCIMHRLQQEGF SCAAVDITRLGTENVTPDQWYKGFAVELWQNFNLFGKVNLKAWWNEQKDLSSIQCLSR FIEEIILPQVESEKVFIFVDEIDTVLGLKFPVNDFFALIRFCFNQRSINPEYRRLTFA LFGVATPSDLITDHQRTPFNIGQAIQLNGFQIHEAQPLLYGLTEKVSNPQVVLKEVLA WTSGQPFLTQKLCKLIRRCSSPIPTNGEAQWVENLVRNYVMKNWESQDEPEHLRTIRA RILNTKLKSSRLLEIYRQILSQQEVVSVDNPEEKELLLSGLVVKQQGFLKVHNRIYQS IFDHTWISEHL" gene complement(18269..20080) /locus_tag="DP116_11260" CDS complement(18269..20080) /locus_tag="DP116_11260" /inference="COORDINATES: protein motif:HMM:PF05990.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11260" /translation="MDLLIDDLLKNQKIDGKDLTILGYFLRSTAPINIEDDESKVDET LPTIDDVIDHLYRHSQSDTEELGLVIQIHGFNTGVKDGQEDYVREDWARVCKYLNQQD KALKDRKSSFVYLGYRWSSESVPSNLKNAFYSLPSFLQFLLYSGLAITALGIFFLILF SSPWFWIFIVLGVCLASFIGTLFILRVIVYFRDGYRARYFAVPDLIEFIRQLDQGLIQ RSKDVLGDAQKAQEEWNQKRIKLTFIGHSMGGFITTEVVRVLSDAFDPEAIGNVDNLN KRPSSNIGRVFSLGRLILVSPDIPVNTILSGRTNFLRSSLRRFEEAYLFSNEGDVALR LASTAANYFSFPASTRTQGYRLGNVTVNLPDTKVYGIVNLKQILSAEHQLFYSNQKFD HLLKYLGVKVLNKRQERNLLQNKKDPESLVPKGKDPESIADLFTYFDCTEYQDETDYP GREGKTINVLICPNQKSPLNFWQYIKILKAFFDSFSNSPTGIDVHGGYFHGSFCKLVI YRLAFVGFQGLLDSLILEQPNEFNLVAPPDLQEKLNRSGELNTVEKHKVALEYFSWIC EQKSIQVAASSERYYVDVLNESREKVRCEFLSQQKND" gene complement(20544..21017) /locus_tag="DP116_11265" CDS complement(20544..21017) /locus_tag="DP116_11265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869715.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11265" /translation="MSDLSAAQTNPLLGTWTLISASAINPDGTVTPQVYSPNPIGYIT YTPEARMMVIFSRSDRLPLSGDIRSPFSKDIRSLPTEECVQAFSTFNAYAGTYTLNGN SVTHHVEVASIPNRVGKDLVRTFTLNGNRVTLKTPPTNTDGVLKVFELVWERVEL" gene complement(21062..21463) /gene="tnpA" /locus_tag="DP116_11270" CDS complement(21062..21463) /gene="tnpA" /locus_tag="DP116_11270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012263857.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS200/IS605 family transposase" /protein_id="PRJNA477356:DP116_11270" /translation="MATRLLKERHSVSDLNVHLVCVVKYRRPILTAESLLVIEKSFNE VSEKMNFIVQEFNGETDHVHALIKYPPKLSISQIVNSLKGVSSRRYGQGGYPKPYGKD ALWSPSYFVSSVGGAPLEVLKRYILDQQKPS" gene 21514..22140 /locus_tag="DP116_11275" /pseudo CDS 21514..22140 /locus_tag="DP116_11275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015163149.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 22864..23973 /locus_tag="DP116_11280" CDS 22864..23973 /locus_tag="DP116_11280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869531.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11280" /translation="MLFDAIIEREESLQELICDFDSTAAKIIQSCGLCLMSKLLNNLS HQVTQEAKTKELVIHRVKKIKYAVIFGELEVPSPYLWNKKAKSGFCPVPKSLGIEHGG RSKAVKKALTDFGIEESFGQASKRFVEHYGWEVERASVRREVEAIASTALDYVEKRLE SASDEPIQQKEETVSRLLVELDGSHVRTGVVQPKPEQKDSKRRNKTRLRIAKKHNKIA DTRKDFLHKLSSKIVRENQAIVLEDLNVSGMVKNRKLARAISLQGWREFRVLCEAKSD KFGRTFKVISRWEPTSQTCFDCGYRWGKLDLKVRTVTCLQCSATHDRDENAAKNIEKV GIGNCHDSKRTHRGRKTVLAADLNEASRITAPLGR" BASE COUNT 7053 a 5168 c 5206 g 6902 t ORIGIN 1 atgcttcaag ccgggaaacc cgtccaacgc agtggctccc ctcccgcagc gctggactca 61 catttacgtg ctctacatta tgcatggtgg tgtatttgat tcaaatgaga accgctatat 121 tttccccttg aagagtgagg cagatgaggc aggtgaggca gatgaaagag aaatattttc 181 attcttgctg atgtgcaaat gcccattcgg ggtaattact cattctctag cgatgaatgg 241 ggggagttac gcccttcaaa tcttgctcga tgttgtcgaa atttagtata ctacaagaaa 301 atattttaaa taaacttaac gcttctattt tctacaaata taagaaatta tgaattgtta 361 taagaaactt tcaaaacaat gttatatttc tatcaaattc tattaaaata accaatgtgc 421 ctaactgatt cgtaaaaatc gatagatttg ttattgtgac caaaaaagga tgaaaagtca 481 ggtaaaaagt ctgtaaaaat atttacttca ttttacagac tttcgttcta tgctaaagga 541 aaccgtccta aaagtaagat aatgtacgct gcattctctc catccttgcc aacgattcgt 601 cacgtctaca gttgaccttt atttgattaa actcctatga gtaggttagt tgttttaagc 661 ctaggtcaag gaaatctaca cgaggggtgt gctacggtga cggctcagct tggagaagca 721 gacaacccct accggatgaa attgactgcc agcttaccag ccgcaccaaa aatttatgaa 781 ctctaccaaa aatggcagtt agtttattat gctttttatc aacgcttgag ttggcgtctg 841 gataacgaaa tagatgataa ttttgaaatc gaagaagatt ctgtcactaa tatttcagag 901 gctgatattc atgacttgtg tcagctgttg tcaaaaaaaa taaatacatg gctcaactca 961 acaaagtttc gcaaaataga ccagcagtta cgtacacaat taaaaccctc tgaagagatt 1021 cgcttcatta ttgaaaccaa cgaccattta ctgcgacgac ttccctggca tctgtggaaa 1081 ttttttgagg attacccata tgcagaagtg gcgttgagtg caccagagta tcaaaagccg 1141 aagaaattgc tggtaaataa ccgcacaagt caagtgagaa ttttggcaat ttttgggaat 1201 agtcagggta ttgatattag ccaggataaa actcttttag atcaactatc aactcaagca 1261 gaagttaaat ttttaatcga gccgaatatc ggtcgtttaa acgagcagct atggcaacaa 1321 aattcggata ttctcttttt tgcaggtcat agttttagcc aagaaaaagg ttttttgcaa 1381 attaatcaaa cagatacgat tacccttgag cagttaaaat atggtctgag acaagctatt 1441 agtcgtgggt tgaagctagc gattttcaac tcctgtgatg gtttagggtt ggcgcaggaa 1501 cttcaagatt tgcaaattcc acaagtgatt gtcatgcgtg agcctgtacc ggatgtggtc 1561 gctcaagaat ttttaaagta ttttttggca gaattttcaa gtggaaagtc tttatacact 1621 gcggtgcgtt ctgcacgcga aagactgcaa ggattagagg gagaatatcc ctgtgccact 1681 tggctaccaa tcatcttcca gaatccggct gaaccgtcta tgatttggac ccagcaaagt 1741 gtatggacta aacggaaaca gaagaaaaat tcctcaatgc tggtttccac cgtccaagcc 1801 agaggtatgc aaacaaacat ccgattgcca gtagcaactt cagctctctc caaaacaacg 1861 gctctcgatt cactcattga gatgctcaat ggcaacctgc tgaaatacca aaatcttccc 1921 cttaatcata cagaaatttt ggttttacga ggtatttggc aaagtcaaac ttacaaccaa 1981 attgctcaac agggcggcta tagtactagt taccttacta agactgtcgc tccacgatta 2041 tgccacaaat tgtctgatct gatcggcaac cgtgtgacta aaaaaaactg tcggacgcta 2101 ctagagtcgt atgtgactac acaagcttct ttaaaaacaa ccctagaaca acatcccccc 2161 gtaaaagatt ttcccaattt ccagcaggaa atgtcacctc gttttcctag tggtttagtc 2221 ccgcttggtt ctccctatta cattgaacga ccagagattg cgacacaggt ttatgaagaa 2281 attattaaac caggagcctt aattcgcatt aaagctcctc aagaaatggg taaaacctcg 2341 ctgctgttga ggattcttga gtatgtcaac ggcatgggct accatatagt gagtctaaac 2401 ttacaggagc aagttgatcg agcaattttg agcgatttga atcgtttttt gcgttggtta 2461 tgtgctaaca tctcttgtca gcttgagctt gaaccaagat tagacgagtt ttgggatgaa 2521 gatattggca gcaaagtgag ctgctcgctt tatttacgaa attatgtgtt agagcagatt 2581 gattctcctg tggttttggc gttggacgaa gtcaaccaga tttttgagca gccgcaagtg 2641 gcaaaagaat tcttaccctt gttgcgttcc tggtacgaag atgccaaaag acaacctatt 2701 tggcaaaagc tgcgcctaat tgtggttcac tcaactgagg tttatgttcc tctccagctg 2761 aatcagtctc catttaatgt cggactaccg attgagctaa actatttcag tttagagcag 2821 gtgcaagagt tagcccagcg ctttggactc aattggacag atgatgaagc aagatttctc 2881 atggacatcg ttggaggtca tcctgcatta gtacatctga cactttatca cctcagtcgt 2941 ggagaggttg ctttaggaca attgctgcaa agtgctccca cacccacaag catttattac 3001 cgtcatatcc agccttattg ggcaactttg aaagtgcagc cagaattagc atttgctctt 3061 tctgcagtta tgagtgctac caaacctgta aagttagaac ctgttcttgc tcataagtta 3121 aggagtatgg ggttgattaa gttggatgac aatctagcta cacctagttg tcggttgtat 3181 caagagtatt ttcaattaaa gtgtgagata agtgcaagct aaaaacccga gccagccgcg 3241 tggtgaggta gtgctgcacc gaaggtaggg aactggtatc cagcaatgcc tgatatcgtg 3301 tccggttaaa gacttatcat taaaatacag cagtaggaag taaacaccaa actagtctgt 3361 aaaatgagtt gattcatgct aaaagtactt agaaaactag actctaacta aagtagtcaa 3421 ctatggataa gtttcactct tcagataaag atgaaaaaca ttactctcga aaatatcact 3481 caagccgtga ttgagcacgg ggatggaggc aagactcatc cacgccttta tgagatttat 3541 accagcctta ttcgacatct gcacgccttc gtgcaggagg taaatctgac ggaacaggag 3601 ttatcgctct tgcgcgattt tctcattcgg gcagatcgct acacaaaaga gataccgaat 3661 ggggaaattc atatgttgct cgacttgctg ggaatctcag agctagtggt tctattgcat 3721 aacaagagta gcacagctac tgaaagtaat ctggagggtc ctgtctatgt agcggatgcg 3781 ccagagcgca acatgggcga tcgcctcgga atcgatcccg acggtaatac gctcttccta 3841 tcaggacgcg tcttagacct gaacaacaaa ccaattgcca atgcgcttct tgatgtctgg 3901 cagtcaaatt caaagggact ctatgacctt catgatgcat cccaggcaaa gggtaacttc 3961 cgaggtcgtt tcagaaccaa ttcggacggc tcctactcca ttgaaacggt tgtgccaata 4021 gggtacacta ttccatcaag cggaccgtgc ggtgaaatgt tgcagctttt gggacgacat 4081 accctgcgag caggccatat ccattttaag ttgagtgctg caggctatat acccctgact 4141 acacaaattc atattgatgg agatccgcac ctcgattcag atacaacgtt cgcggtcaga 4201 tctgccatcc tcaagctgca aaaacacgaa gcacccgaca aactcaacgc atacaaccaa 4261 agcaaacctt tctatacagc cgagtttgac tttgtgctac aaccgaccga ccaacagaca 4321 gatgcggctt agagcttgga aggtcattgg ttgttgatta tgagttttta gtggttaatg 4381 gtgagccagt gcgttgggga gccagtactt gatgagggtc tccctcactt ggtatctggt 4441 gagaccagcg cgaatgacgg ctctccctca cttctgtacg ttcgctcccg aatgtcaggg 4501 tgagaggaat taatgtctga cagggtaagg tttcaatacg attgtggatc tgctaaaacc 4561 taacccgtgt attacaggga aaaagctgac cccgcctcat cgtgtattag atggagtaag 4621 ggaacttagg tgttttcatg taatacacgg tgcagacttc gtggtcacgg ggtcttacac 4681 gtaaaaaaat agtccccatc gactcaaact atgaagccca actcagcaac tgggacagtg 4741 gagtgaagca gaaagtgcga ttcgccttgc gatagcttcg cttgacgccc aaaacataaa 4801 cacaaaagag cgatcgcaaa tccttgccca actcctagat atcaaaggtc gtttgcaatt 4861 ggctcaaagc aaagccgaag acgccttaaa tacttggcaa caagcacttg acatttatac 4921 aaaaatagtg aacagcgccg cccaagaagt atctcgcgga ttgcggcgac gcaattcgcc 4981 tgcggtactc aacgcctcaa accaaatacc gttggtggca taaagagcaa cgcgctgttg 5041 tggtgtcgct ttctctaact gactcttgag ggcaggattg agtgaagttc tttgaatcca 5101 gccatcagca aaagcaggtg gctcttgcgc cttacaataa attttgaaat accaatgata 5161 ctgcttgttc atttctagcg gcgctaccgt agacggaagc gagaagctgg taactcctgg 5221 tgttttgggt agcgtaacat caatctggta gattaactta ccctctttat cccgcaatac 5281 aaactttgct ggaaatgacg actgataagg aacataaaac cagaaagtgg gatgcgaggc 5341 gatcgctgtt ccaaaaacaa attctgaatt tgaataaaca ggcactaagg ctgctagccg 5401 cttatcccct ccacagccac ggcttcctac accctacacc cttacaccct gttctcggta 5461 aatcgccaca gtaccgccag tcattactaa aactaaggct gacggaacga gaggaaccca 5521 actgccagag cacaagaaga ctaaacaaag aacgtataaa atcaagagtg cgcctccacc 5581 cgttagtatg agatataatc ctgagcgaaa gcgccaagca attgcacctc caacgataga 5641 ccaaccccaa acccataaca cttcgcccca aacaggcaaa actgaatttc aactagcagg 5701 tggtagattc acaacttcct tgtcttcggc aatcttgcgc tgcggtttct tgccaaagcc 5761 gtatttcgcg acatctagaa tccagcattg gaagctgtca acgtaagaga acagcacggg 5821 tactactacc agggtgagca acgtagaggt tgtaaaacct cctaaaatgg ctatacccat 5881 cgactgacga acttctgcac ctgcaccaat tcccaatgcc aagggaaggg taccggcgat 5941 cgttgcaagg gaagtcatca taattggacg taaacgagac actccagctt ctaacaacgc 6001 ttgacgttga gttttgcctt cctcaaggtt aataattgtg tagtctacca agaggataga 6061 gtttttcgtc acaatcccca tcaacagcac aataccaatc agggcgtata gtcccaaagc 6121 tttttgtgcc agcatcaagc ctatgagtgc gccacctaaa cagaaaggta acgctgccat 6181 aattgtcagg ggatgaagaa agttgttata caataagaca agaatagcat aaatacacat 6241 gactgcgagt cccaatgcag caccaaagcg actgaagatg tcagccataa tcttggcatc 6301 accagagttt tgcagcttga ctcctggcgg taagttttgc aaagctggga gttgattgac 6361 tgctttgagt ccatccccca aagaaatgcc ttgcaagttg gcttctaggg aaacttgacg 6421 ggagcgatcg tagcgatcta tagttgcagg accactacca aagcggatat ctgcaactgc 6481 aatcagagga accaagctac cattttgact aggaacctgg agatttttga tggtattaat 6541 gtcttgacgt gctttggggt caatttgcac gcgaatgggg atttggcggt cagaaagatt 6601 atatttcgcc aaatttgcat cgttatcgcc tattgtagca agggaggctg tacgagcaat 6661 tgattggact gttaccccca aatctgctgc ccgttgtggg ttaggaacga ccaaaatctc 6721 tggtttgagc aaactcgcac tagaagacag ttccacaagt ccgggtatgc ttctcatctg 6781 cttttcaagt gcatctgctg cttgagtcaa cgctttgggg ttatcactgc gcaggagtat 6841 tgttaaatcc ttgcgactac caacagaacc tgcactttgg aaactaattc ttgtccctgg 6901 aatttgcgca aatagaggac gtactttttg ttgaaactct acttgagaaa tttttcgtcc 6961 ttcttcttta gatttgagtt taacaacaag cgtcgcagag tttacctctt gtgtagctag 7021 tacgctgacg acattgggat tttgcttaat gatatttgtt gtttgagtaa ccaccttctt 7081 ggtatcttcc aaagtagaac cagggggaag ttccatagaa atagtggaaa ttccaatgtc 7141 actactatca actaacccct taggaatgaa gggaacaagc attaaactag caatgaagaa 7201 agcaacagca acacccaaag ttgtcaactt gtgagttaac gcccatttga ggagagattt 7261 atagggttga aaacgacgac gaacagaagt gggcccagta gcaatatgtt gctccctcat 7321 ctccttcgtt tcctctctcc cctgctttgg ctgcttatcc ttgagtaaat acgcgcccat 7381 catcggcgta atcatccgtg caaccaaggt tgagaaaatg gtggaaacag caacagtcac 7441 accaaagggc tggaaaaatt gaccaggaac accgcccata aaagcaacag gcataaatac 7501 ggcgataatt gttgctgcac tggctatcac tgctaaaccc acttcgtcag atgagtcaaa 7561 agcagcttgc caagcagact tacccattgc catatgccgt tccatgtttt caatttctac 7621 gacggcatca tctaccaagt tacctactgc tagtgctaat gccaataagc tcatgttgtt 7681 gagggtataa cctaaagcat actgtacagc aaaggtagga atgattgaca gtggcaacgc 7741 gactcctgta atcaatgttg ctcgccagtc acgcaaaaac aacataatga ctatgactgc 7801 gagtatcgaa gcctgaatta attcatctat agtgctttgg tatgattttt caacaaaagt 7861 tgctctggta aaaattaatt ccagtttgac atcggtcggt agagtttttt gcagttgttc 7921 tacagctgct ttcactcctt gttccactgt caccataaca ctgcctgtgc tacgcaacac 7981 ttggaatgct accacgggtt ggttgttcaa acttgctgct tgtcgaactt caccaaagct 8041 atcctcgact gttcccaagc tggacaaggg tacagaacca ccgttaggca agacgatttc 8101 atagtttttt aaaacctcta cacttgcggc gctaccaaga gtacgaatgc tttgttcact 8161 accgccaatt tgggcacgtc caccaggtaa gttaatgtta aaagctcgta tttggtcgtt 8221 aacctgggta gcggtgatac ttagagattg taaacgatct ggatttaggt tgattcggat 8281 ctctcggtca accccaccga cacgcttaac ttgtgatact cccttaacag ccaataaagc 8341 gcggctaatg gtttggtcaa caaggttact taattcttct acagaacgct tgtcagattt 8401 taccacgtag gtcatgattg cacccccggc aaattctatg cgttgcacaa ttgggtcgtt 8461 tatgtcctgt ggaaggttct ggcgaatttg cgccactgcg ttacgaacat cgtttgtggc 8521 gcggtcagtg tttgttccca agacaaagtt gattgtggtt gtggagacac catcgataac 8581 cgtcgatcgc agttcatcga tattacccaa cccagcaact gcatcttcta ttttttttgt 8641 gacttggaat tctaattccg caggacctgc acctggttgt gttaccctga cctgtactgc 8701 tggaacatca atatttgggt tcgtatcaat tcccaaagca agaaaggaaa accaaccgac 8761 tacagttaaa atcaagaata aaactattgt tggaacaggt tttttaatcg accaagctga 8821 aatattgaag gacatgcaat ctgttatgag ttatgaatca agaatttgag tgatgaatta 8881 agaatcaaag tccaaagtca agagtgacta ttgactatca actattgagt gttgacgact 8941 cgtactgtgt caccatcttt gacatatcct gcaccgtcaa cgactacacg gacacgggat 9001 gcattaccct gctgcaatcc acttttgatt tccactctgc caccaacaag gatttctcct 9061 agttctactt tctgagcgcg aacggtgtct acacctgata ggatgaagac aattgcactc 9121 ccatctgatt gaggtagaac tgctttttgg ggtacggcta aaccgattgt ggttacagtg 9181 cttatcgcag cacgagcaaa cattcctggt ttgagtaaag ttgtttgtgg caagtcaatt 9241 ttgactgtgg cttcacgttt ttcttgattg atcataggtt ctatatctct gacgcgtcct 9301 tgcaacttca cacggttatc aacatcagag gtgacttgca ctttggagcc aatttttacc 9361 tgtgtaagct gaatttcagg aacctgtgct tgtaactcta attttccatc ctgaataatc 9421 gagaatagct tttgtgttcc accaaccaca gttgctatct gtgtttgtgg tgcaatacct 9481 gtgacatctc ccactcttgc caatttctct gccacaactc ccgaaactgg ggcacgaagc 9541 aatgtttgtc ccagctgagt tttcagttgt tccattttgg cggtgctact tttgacactg 9601 gcttgggcac tgttgatatt ggcttgggca attttaacat tagcttcggc actaccaata 9661 tttgcctggg cactactgac attagcttgg gcacttttaa tattcgcttg ggcaaggctt 9721 actgcttctg ttgctgtggc taaatttgtg gtagcagtat ccaattgttg ttggctaatc 9781 gccccttgag atgttagttc ttgagtgcgt cggaaattga tttgggcatc tcgtaatttt 9841 gcctgcgctt gagcaaaatc tgcttgtttt tgttggacga tcgcctgatt tgataccaca 9901 gtcgcccgag ttgagactac aactgcttgc ttcgatgcca aatccgcttg ttttgaagca 9961 acatctgcct gctttgcttc cacatccgct tttgcttgcc gaatttggtc ttgcagtacc 10021 gaatcatcca gcacagccat cacttgacct gctttgacaa tctctcctat attgacaaga 10081 atttttttaa tctgcaagcc gtttgcctga ggcaaaactg gagtcaaatc gcgtgctgca 10141 acagtccctt tcgtactgag ggttcgcgca acacgagttg tttctactgg ggtgacggtg 10201 actgacattg ttggtgcaac tttttgtact gttttttttt cagctagtgt tgttttttca 10261 gatacactgt gactggagag acgcattccc ccaaacgcaa taatacctcc caagcctgta 10321 cctactagca gtggtatcaa ccatgaatgt cgctttcgat acacactttt tttttgccag 10381 tttgcatctt caaatgccac tggttcttcc actttaagtt ctgaaacact ttcctttacc 10441 acagttgtcc ctccagtttc ccaaccgcca ttaagcagtt gtgcttaaga aaattttaaa 10501 ttaattaatg tatattaact atgacgtatt ttctctgctg cgagttctaa tattcctcaa 10561 atttacgcat gactatcgtt acgattttgt caagtagggg taaaggtaat aaagacaaac 10621 ctgtatcacc acttccgtag tcgtttgggg ctatgtataa ctacgaaggc atggttgatg 10681 ttttgctcaa ttcgtaaaaa tttgcgctta ccctgcatac accaattggt agcagatatg 10741 cttaaactat aaacagttat aaagagttat ttgcctttat ttgagtgctc caacgatcat 10801 caaattatcc aaatttgaga taaaccaata atacataatt tggtatataa cagagtcacc 10861 aatatgttta atgccactga aatcttgatt gatgcgtttg tcgctcaaat tcgtgaagga 10921 taccgtcgca cctacggctg cctgaaaaac gattaccaag atattatagc ctgggcaggc 10981 aatatggcgt tggaaaatat tgccaacagt gacgcccttt atcataacgt agaacattct 11041 atcctagtta ctttggttgg acaagaaatt ttgcgcggca aacatatccg tgaaggaggt 11101 gtttctaatg aagattggtt gcattttatt atatccttgg tctgtcatga tattggttat 11161 gtcaaaggag tttgccgaca agaccgagag gaggaaggtt tgtacgctac tggtaaagat 11221 gggaaaatga tttctcttcg ttctggagct tctgatgcga gtctgacacc aaatcatgtt 11281 gatcgggcaa aactttttat tgatgagcgt tttggcggtc acaaattgat agattcagaa 11341 gtcattaaga gcaatattga attgactcgg tttccagtgc cagctgtaga tgaccaccaa 11401 gatacaaata attttgctgg tttagtacgt gctgccgatt tgattggaca attgagcgat 11461 ccacgttatt taaacaagat tacctcttta ttttatgagt ttgaggagac tggcatgaat 11521 aaggtgttgg gttatcaaaa ccctgctgat ttacggaaga actatgccaa gttctattgg 11581 aatgttgtct atccttatat taaggatgca ttgcgctatc tctcgcttac acagcaagga 11641 aagcaaattg ttgctaatct ctactcaaat gtgtttgttg tagaacacga aaaacttcaa 11701 gaagagcatt tgtacctgat tgaaaaactt cgtgcttaaa aattgatagt tataaattat 11761 caacaacgaa aaagctttca caattcatca attataattt ataattcatc atttatgact 11821 cataactcgt aattactatg aattggtggc atcgactaaa gaaaaatcct ttagcaaggt 11881 ttggggcaat tttactgttg gttttttata tagcagtcat tttggctgat tttgttgccc 11941 cttacaaccc ttacacctca caaccaaatg gttctctact accaccaacg cgtgtttact 12001 gggtttctaa agaatcaggg cagtttattg gtcctcacat ttatccaacg actcagggag 12061 acacaaattt agaaacgggc gatcgccaaa tcatcgtaga ctataaaaag ccttctcccc 12121 taagactatt tgtcactgga cctgagtatc gtctacttca gttgaattta cctctacccc 12181 caaagtggga cgaggtagaa atatttggtg gtatcccgct gaatatacat ttgtttggtg 12241 ctgtcggtga ggcaaagttt aacgttttag gtacagatga ccaaggtcga gatcaactca 12301 gtcgcctgct gtatggtggt cgtataagtc tgtttatcgg tattattggg gttgccctaa 12361 cctttccctt agggatgttg attgggggaa tttctggcta tttcggtggc tggcttgaca 12421 gtgttattat gcggttcagc gaagtgctga tgacttttcc cagtatttat cttttggtaa 12481 ctttaggtgc ggtactaccg ccagggttat ccagtaccga acgctttttg ctgattgtcg 12541 tgattacttc gtttatcagc tgggctggat tagcacgagt tattcgcgga caagtcctct 12601 ctattaaaga aagagaatac gtccaagcag caaaagcaat gggtgccaat ccactttaca 12661 ttatcgtgcg tcacgtattg ccgcaaaccg ccacttatat cattatttct gctacacttg 12721 ctgtccctag ctttattggt gcagaagcag tactcagtct tatcgggtta ggaattcaac 12781 aaccagaccc ctcatggggt aatatgttat ctcttgcaac taatgcttca attttagtat 12841 tacaaccttg gctaatctgg ccttctgctg cgttgattat tctcacagtg ttggctttca 12901 acttgctggg tgatggactg agggatgctt tagatcctcg cagcttaaga aggtagcttc 12961 tactaagttg tttaacagtt attagttatc agttatcagt tatcagttat caaggaattg 13021 ataactgttc actgttcact gttcactgat ttcaagtggg caggaggaga atcgaactcc 13081 tatgaccgca aggtcgccac attttgagtg tggtgcgtct accagtttcg ccacccgccc 13141 ttgggtgcaa cttcactatt atactctatt ttttgatttt gcaacaactt ttgataatct 13201 tggctacttg cccaacggtc aatttctctg gcacgcagaa cgggtactgg gtgagttaac 13261 tgagctgtgc gggcagcttt gaccatttca cccatctcgg ttttgctaat atcatcataa 13321 gcacgagctt gggcaataaa agcatcaagg ttaagttgcg gtgcgagggt gggcgaacca 13381 cctgcaagtt tcatcaacac cgacatgaca actttcgggt tttgggttgc cagtaatgcg 13441 gcgcgatcgc atgtaaactc agcacagcgt acccattcca aaagttgcgt ctgtattgct 13501 tgagcgagaa ctgttccgac ggtaggtact actgctgcgg ctaatatcaa tatattgact 13561 ggagttaggt aaacactgtg gtcacacttc aggtgtccca attcatgagc aatcacagcc 13621 tggatttctt cttgttccag catatcgatg agggaggtgt gcagcacaac aaaaggctgt 13681 tttccccgca tagcaaaagt gtatgcattt ggtgctggat gctgcctaat ataaagctgg 13741 ggaggttcaa tatccagtat cttgcaggct tctaacagcg acttatgaag atggggcagt 13801 tgtttttcgc cgaccagaac actagaagcg atattttcta cataaaaaat ctgctccgcc 13861 aatggaccca aaagattccg caccatcata tccaaaccag gtatctgctt cagtgctttg 13921 gttgcgtcta gatctagcgg atgacggaat gaatctgctt tgagaccaat cagttgagtc 13981 ttttgcggga acatggcgga atcttgctgg ggatataagg gattgaaaaa taatctagga 14041 ctgctctttt gaacagccct tctccaccag tttaacatct accccaacta agagtgcaag 14101 gtaggttggg gattccaaca gtcacctgaa gaagtccaaa gcgctatcta ctgtgattgc 14161 acgttttccc tatttgttgc cccagaggtt ggcagttcct caactgtcgg gttagaagca 14221 acttccgcat tgacctgtgc tagtggtatc cgctggattc ttgctcctaa ttgctgcaac 14281 tttgtatcaa gtctatcgta gccccgatct aagtgttgca atccctgaac tatggttttt 14341 ccttctgccg cgagtcctgc caaaactaac gctgctgatg ctcgcaaatc tgttccgagt 14401 acgggtgcgc ctgataacat cggcactcct cgaacaaagg caacgttacc tttaacgcga 14461 atatctgccc ctaagcgatt taactcagaa gcgtggcgca aacgattttc aaatacagat 14521 tcattaatca cactatcacc ttcagctagt gccagcaaag ccataaatgg cgcttgcata 14581 tctgtaggaa aacctggatg aggcaaagtt tcaatatcac ttgccctcag ggtttctgcg 14641 ctcagagttc gtaagcagtt cggtgcttcc tcaataatgg gcactccaat atcttgcagc 14701 ttagcaatga ttggtgtcaa atgctctggt accacgggtg agagaatcag ttctgaacga 14761 gtaattgctg cggcgaccaa aaacgttcct gcttcaatgc gatcagggat aattgtatac 14821 tcagttgagt gcaattgtgg aacaccgaca atcgtaattg tactggttcc tgccccgtat 14881 atctttgctc ccattgcaat acagaagttc gccaaatcaa cgacctctgg ctcgcgtgca 14941 gcattttcga taatggtttc gccgtctgca agagtagccg ccatcatcaa agtttctgtt 15001 gctcccacgc tggcaatatc caagtatatc tttgctccct tcagtcggcg actgctacca 15061 ggaacatagg cattacaaac gccatgctca atctgtactt cagctcccat tgcttgcagt 15121 cctcggacat gcagatccac aggtcttgcg ccaatagcac agcctcccgg taaaggcatc 15181 tgcgcgactc ccagtcgtgc taggatcgga ccaatagcaa agaaacttgc tctgagctgt 15241 gtcaccagtt catagggagc ctttgatgtt ttgataactc tagcatcaat ttctaaaatg 15301 tctcctgttt gcttcaagcg aacgcctaaa gctgataaaa cctgacctat ccgtgtcacg 15361 tctgccaata acggcacgtt acggagccga cattcgcctg aacagagcaa cgctccagcc 15421 atgattacca gtgctgaatt tttggctccg ctaattttta catgaccttg caaacgatgc 15481 ccacccgata tttgcaggac tgaggagtct gctgtcgagg aagatttggc atttggtaag 15541 ccgccagaag gaataataaa cttaccctcc aaaaaaactg ttgattgtat aagtttgaag 15601 ttggtttcga ttctacacga tagtttttga ttgtctacat ttttgaactg aaattgataa 15661 atttcattta tgtctggtct aagagcagat acagggaaca gggaactctt aacgcttaac 15721 tcttaacgct taacataaac tgtcatgaat caacaattta tcatacaatt ttttttcaga 15781 aaagcttaac actggtaaca ataaatactt aaaagcaaag aaaccgttga tgtcagaaat 15841 tagggcttga caaatcaaaa gaattgagta ataataaaaa gctgttaatc aaaaatcaat 15901 tcgcggaact ggcggaattg gcagacgcgc tagattcagg ttctagtgcc gaaaggcttc 15961 cgggttcaag tcccgggttc cgcatacgag tgagaagtgg tgagtaacga gtaaaagacg 16021 gtatttcttt gtgggccaaa catttaactc gtgagtatca gtattttgac tcaccactta 16081 tgaccagaac ctttaatgtt aacaagttta caaaatcccc tagtcaagca aattcgcaaa 16141 ctccactcct caaaagaaag acacaagcag cagttgtttt tggtagaagg gacgcacctg 16201 ttagaggaag cttgtgcggc gaattaccca ttggaagcag tgtgcagtac accagaatgg 16261 caagacagcc atgctgtgct atgggaaaaa gcatgtgatc tttgcgaacg agcagaaatt 16321 gtgagtgaag aggttttaaa ggcgatcgcc accacagtac aaccagatgg cgtagttgcg 16381 acggcaaaaa gaagggaacg cgttggggaa ctgccgttta ctggtataat gctagcgttg 16441 gaaactgtac aagatccagg caacttgggt acaataattc ggactgctgc agccgctggg 16501 gcttctgggt tgtggctgag tgaagacagt gtggatttag acaatccaaa agtattacgt 16561 gcttctgctg gacagtggtt ccgcctagca atggcgatga gtcctgattt gaaaacgaca 16621 gtccagcaaa gtagggaagc acgaatgcag gtagttgcaa ccttacccaa tgctactttg 16681 acttattggg atgtagactg gcgcaaaccc agtttaattt tgttggggaa tgaaggtgct 16741 ggtttgtcaa aagaattgac agcaacagca gatttacaag taaaaattcc cctgagtccg 16801 ggggtagaat ctttaaatgt agcaatgacc gccgctttga tgttgtacga agcgcaaagg 16861 cagattttca aaagtaaaga gggaacgctt aacagggaac agtaaacagt gaacacaacg 16921 accacgctca gtgcatcgca gtgaacaagg ggtggacgag tccgtttcct acctgataac 16981 tgataactga taactgatga ctgatgactg ataactgata actgataact gataactggt 17041 taacataaat tgtacggagt tctgttcaaa aatcaaatat aaatcctata ttatgtatta 17101 tgataatttt gtttagaggt gctcactgat ccaagtatga tcaaatattg attgataaat 17161 ccggttatgc acttttaaaa aaccctgttg cttgacgaca agacctgaca acagcaactc 17221 tttttcttct ggattatcaa cagaaacaac ttcttgttga gacaaaattt gtcgatagat 17281 ttctagtagc cgagatgatt ttagcttggt gttgagaatg cgagcgcgta tcgttctcaa 17341 atgttctggc tcatcctgag attcccaatt cttcatcaca taatttcgca ccaaattttc 17401 aacccattgc gcttcaccgt ttgtggggat aggagatgag caacggcgta tgagtttaca 17461 aagcttttga gtcagaaaag gttgaccgct tgtccaggct aacacttctt tgagcacaac 17521 ttgtggatta ctcactttct ctgtcaatcc ataaagcaaa ggttgagctt catggatttg 17581 aaagccattg agttgaatag cctgaccaat attgaaaggc gttctttgat gatcagttat 17641 taaatctgaa ggtgttgcga ctccaaacag agcaaaagtt aaacgtcgat actctggatt 17701 aatactacgc tgattgaagc agaagcgaat aagagcaaaa aagtcattaa caggaaattt 17761 taagcccaaa acagtatcaa tttcatcgac aaaaataaat accttttcac tctcaacttg 17821 aggtagtatg atttcctcta taaaccgact caagcactga atagaagata aatctttttg 17881 ctcgttccac catgctttca gattcacctt tccaaacaaa ttaaaatttt gccataactc 17941 cacagcgaat cccttatacc attgatcagg agtaacgttt tcagtaccaa ggcgagtgat 18001 atctactgct gcacagctaa acccctcttg ctggagacga tgcattatgc ataccataag 18061 gcttgactta cccatttgcc gtgcattcag aatgtaacaa aactctccgc gcttgagtgc 18121 ttgataaagg agacaatccg cagatcgtac cacataagta ggagcattca tcggtaaact 18181 acccccaact tgataatcaa aggttgagga ttcttctgca cttctaagca aagcgttttc 18241 aaaaaccaat ttagcttgag tcgctttatc agtcattttt ttgttgtgat aagaattcac 18301 atctaacttt ttcgcgagat tcatttaaaa catcgacata gtagcgttct gaagaagctg 18361 caacttgaat acttttttgc tcacatatcc aagaaaaata ttccagagca actttatgtt 18421 tttccaccgt attcaattca ccacttcgat tgagcttttc ctgtaaatca ggaggtgcta 18481 caagattaaa ttcattaggt tgttctaata ttaaagagtc tagtaaacct tgaaatccca 18541 cgaaagccag acggtaaatt accaatttac aaaaagaacc gtgaaagtaa ccgccatgaa 18601 catctatccc tgttggactg ttgctaaaag agtcaaagaa agccttgaga atctttatgt 18661 actgccaaaa gttcaaggga gatttttgat tcggacaaat gagaacattg attgttttgc 18721 cctcacggcc tggatagtca gtttcatctt gatattctgt acaatcaaaa taggtaaaca 18781 aatcagcaat cgattctgga tctttacctt tggggactaa agattctgga tctttcttat 18841 tctgtaatag gtttcgctct tgacgcttgt ttaaaacctt aactccaaga tatttcagta 18901 ggtgatcaaa tttttgatta gaataaaata gctgatgctc tgccgacaat atctgcttta 18961 aattcacaat tccataaacc tttgtatctg gcaaattaac tgtcacatta ccaagtcgat 19021 atccttgagt acgagtacta gctggaaaag agaaataatt agctgctgtt gaagcaaggc 19081 gtaatgcaac atctccctcg ttactaaaga gataagcttc ttcaaaacga cgcaaagaag 19141 agcgcaaaaa gttagtccgt cctgaaagga tagtatttac tggaatgtct ggagaaacca 19201 aaattaaacg acctaaactg aaaacacgac caatattaga tgatggtcgt ttattcaaat 19261 tatcaacatt cccaatcgct tctgggtcaa aggcgtcaga gagaactcgc acaacttctg 19321 ttgtgatgaa tccacccata ctatgaccaa taaatgtcaa cttaattctt ttttggttcc 19381 actcctcttg ggctttttga gcatcaccta aaacatcttt tgaacgctgg attaaacctt 19441 ggtctaattg tctgataaat tctattaagt ctggtacagc gaaatatctt gctcgatatc 19501 catctcgaaa ataaacaata actctcaaaa taaataaagt tccaataaaa gaagctaaac 19561 agacacctag cacaataaaa atccaaaacc aaggagacga gaataggatt aagaagaaaa 19621 taccaagagc agtaattgct aagccactat atagtaaaaa ttgtagaaaa ctcggtaaag 19681 aataaaatgc attttttagg ttagaaggaa cactttccga tgaccatcga taacctagat 19741 aaacaaagct actttttcta tctttaagag ctttgtcttg ttgattaaga tatttgcaaa 19801 ctcgtgccca atcttcgcgt acatagtctt cttgtccgtc cttcacacca gtattaaagc 19861 catgaatttg aataaccagt cctagttctt ctgtgtcaga ttgactgtga cgataaagat 19921 ggtctataac atcatcaatt gtgggcaaag tttcatccac tttcgactca tcatcttcta 19981 tattgatagg tgctgtgctg cggagaaagt aaccgagaat agttagatct tttccatcaa 20041 ttttttgatt ctttaaaagg tcatctatca aaagatccat atttgctacc taattttgtg 20101 ttaaggttta aaatttttga attttgaatt cttggtcggg agcgtggggt cttgaatgcg 20161 tgaactttgt aagcgcaaag tgtgccgtag gcatacacgt agtgtgtcca cagggcttag 20221 gggctgacgg tcttctccac aggaaagctt tctgcgcttg ccgtgagcag aaaattaagc 20281 gtatgtggac tcttgaaagt ttggtatctc acattttgca ccaataaaaa tttattgtat 20341 tagatcactc agtattaaac caaaaatcct ttatgagcgc gagaaaagcc aagaaaaatt 20401 atgtagctta cccttacatc tttttagctt atacggtttt ctacggatag gtgcaagata 20461 tgatgtatgc ttgtgacaac tgacaagcga tggcacggtg tgcgggcact ccgtgccatc 20521 gctttatata cgagtcgata tcattatagc tccacacgtt cccacaccag ttcaaagacc 20581 ttcaggactc catcagtgtt ggtcggtggc gttttcagtg tgactcgatt cccattcaat 20641 gtgaacgttc ggactaaatc cttaccaacc cgattaggaa tcgatgctac ctcgacgtga 20701 tgagtgacgc tgtttccatt caacgtgtat gtccctgcat aggcattgaa tgtagaaaac 20761 gcctgtacac attcctcagt cggtaaagag cgaatgtctt tgctgaatgg tgagcgaata 20821 tctccactta agggtaggcg atcgctcctt gaaaagataa ccatcatccg cgcctcgggt 20881 gtataagtga tgtagccaat tgggttagga ctatacacct gaggagtcac tgtcccatca 20941 ggattgatgg cgctcgcact gattagcgtc caagttccaa gcaacggatt cgtctgagca 21001 gctgacaaat cagacatgac tcactcctta tcaaaaactt tgggtctaaa accccgtcct 21061 tctaggacgg cttttgttga tcgagaatat aacgcttcag tacttcaagc ggcgctccac 21121 caacggaaga tacaaaataa ctaggtgacc acagagcgtc tttgccgtag ggttttggat 21181 aaccaccttg accatatctt ctgctggata ctccttttaa agagttaact atctgggaaa 21241 tagaaagttt gggcggatat ttaatcaacg cgtgaacgtg atcagtttcg ccgttgaatt 21301 cttgtacaat aaaattcatt ttttcggaaa cttcattgaa cgatttttct atgactaata 21361 gactttctgc tgtgagtata ggacgtctat attttacgac acagaccaag tgaacattta 21421 aatctgaaac gctgtgtctt tcttttaata gccgtgttgc cattttgtcg cgaccaactt 21481 ataatgacta acagactcac tttaacataa gcaatgaaag cacgctatca gttcagatgt 21541 tacccaacag accaacaacg ccaagctttg gcacagttgt ttggctgtgt tcgggttgtc 21601 tggaatgacg cgctcgcttt gtgcaaggct tccgaaaagt tgccagggta taacagtctt 21661 tctttggcat taactcaagc taaaaagact gaagcccgga tttggttgaa agacgtgtct 21721 tccgttcctt tgcaacaatc tcttagacat ctggatgtag cttacaaaaa cttttttaat 21781 tctcgcaacg ggaagcataa aggtaaaaag gtcggttgcc ctaggttcaa gaaaaaaacg 21841 aatgcacaat ctgctgaatt tacgaaaaca ggtttttcaa taactggcaa tgaagtatat 21901 ctagccaaga tcggaaacat aaagccaatt tggtctaggg aactgccttc ggaaccaagt 21961 tctgtaaccg tacttaagga ttgcgctaat cgctatttcg taagttttgt tgtagaagtt 22021 gaaccaattc aaatagagcc gattaaccaa tctattggaa ttgatttagg gctgaaaacc 22081 tttgcagtac ttagtaatgg tgaaactttc aagagtcctg actattcacg gcatgaccgc 22141 aaaggggtgt ggtcaaaagt ggttggttgc cagcacccgg aagggatact agacaaagca 22201 gagaaaataa ctgacaatga taatcataga gaagcttcat gattttctgg aggcaaagaa 22261 tctcagaaat accatgtcat acttggataa gtccgggtcg gggagcagtg tattacatgg 22321 gtaaagttag cgctttctcc atgtaatact taccaagata ttgactttca tcggtgtaat 22381 acacgttagg aagaacatcc agtaatagat gccatcgtta gagagcgaac aagcacgttg 22441 cttgagatgg caacacttga aactttacgc aaccaagttt tgtttgaact gaagttaggt 22501 aaacaagctc ctggttacaa agctgctcaa aaagccctga atcatttcat tgttaaacta 22561 agcagctaac gtgggatttt acgatttcgc cagcagccct ggcgctcgcg atcgcccact 22621 aggacaaata ggaatttatt ttcaagtcag tatcagcaac tgggcttaat gtctatatag 22681 ttagactagg aaaaaaacgg ggtagaactt accgtgtgat acatggataa atacaattaa 22741 aatagttact cgaataagac ttgaagtgta ggaaatatgg cagtgttaaa aaaagacgtt 22801 tgagaaaatt tcgatactgc catgagtcaa tattatactg atgttattga acaaattagt 22861 tcaattctgt ttgatgccat catagaaagg gaagagtctt tacaagaatt gatttgtgat 22921 tttgattcaa ccgcagccaa gattattcaa tcttgtggtt tatgtttaat gtcaaagctg 22981 ctgaataatt taagccatca agtgacccaa gaagctaaga caaaagagct agttatccat 23041 cgtgtcaaaa aaataaaata tgctgttata tttggagagc tagaagttcc ttcaccttat 23101 ttgtggaata aaaaagccaa gtctggattt tgtcccgttc ctaaaagttt aggcattgaa 23161 catgggggac gttcaaaggc tgttaaaaaa gcgttaacag attttggtat tgaagaatct 23221 tttggtcaag catcaaagcg ttttgtagaa cattacggtt gggaagtcga acgggcttct 23281 gtacgtcgag aggtagaagc tattgcatcc acggctttag attatgtaga gaaacgtctt 23341 gaatcagctt ctgacgaacc aatacaacaa aaagaggaga ctgtttcaag gctattagtt 23401 gagttagatg gttctcatgt acgtaccgga gttgtccaac caaaacccga gcagaaggat 23461 tctaagcgaa gaaataaaac acgtttaaga attgcaaaaa aacataacaa gattgcagat 23521 actcgaaaag attttctgca taaactttcg tctaaaatag tgcgtgaaaa ccaagccatt 23581 gttttagaag atcttaatgt ctcaggcatg gtcaagaata gaaaactagc ccgagctatt 23641 agtctccagg gatggcgtga gtttagagta ttgtgtgaag cgaaatctga caagttcggt 23701 cgcacattta aggtaattag ccggtgggaa ccgaccagcc aaacttgttt tgactgcggt 23761 taccgttggg gcaaactaga tcttaaagtg cgaaccgtaa cttgtttaca atgcagcgct 23821 actcacgata gagatgaaaa cgctgccaaa aatatagaga aagtcgggat agggaattgc 23881 cacgactcta aacggacgca cagaggacgt aagactgttt tggcagcaga cctcaacgaa 23941 gcgtcaagaa tcaccgcgcc tttaggcagg tgagtatttc aaaagtagct catttttagt 24001 ggttagtttt tagtcgctgc tcttcggtag agctacgagt tattcactaa cccttcgggt 24061 tcgccagttg ctaccctacg ggaagccgcc ctccgggcgt ctacaagtcg gcagagccgg 24121 acgcccggag ggtatctcct gagccctatg gctaacgcca cggcttgagg ccgaacggag 24181 acgctgagtc ccaaagggac acgctgcgcg ttcgctctta gcgtgcgctt gcgcttacgc 24241 gaacgccagt cgccaagtga ctgtacgttc gctcattttt gaagagagag aggaattaat 24301 acctgacagg gtaaggaaaa acctcataa // LOCUS NODE_1271_length_24270_cov_5.07251724270 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24270) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24270) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24270 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(71..1132) /locus_tag="DP116_11285" CDS complement(71..1132) /locus_tag="DP116_11285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412266.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="agmatinase" /protein_id="PRJNA477356:DP116_11285" /translation="MSHQLQEYNPSGIGQINGNLFALPFDYESANLIIFGVPWEVTVS YGAGTANGPQRVLDASPQLDLFDFDNPDGWKQGIFMVEIPQDILEKNEYYRTLAAKII ERLEQGKPLTNTPDLTAVLAEINQACQQVNQWLFEQSKQAIEKGKRVAVIGGDHSSPL GYFQALAASYANYGILHIDAHADLRDAYEGFEFSHASIMFNAMKLPQISKLVQVGLRD ICHDEVQMINQSHGRIVAYYDPAMKQKLYSGTTWMDLCREIISHLPEYVYISFDVDGL DPKYCSSTGTPVPGGLELEQAFCLFRELINSGRKIIGFDLCEVGDAEWDGNVGARIVY KLANLIDLSQRRSSAFIQD" gene 1447..1767 /locus_tag="DP116_11290" CDS 1447..1767 /locus_tag="DP116_11290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11290" /translation="MSRTRSAINLNIIFSSFYNFILRKTEKKHSKTYDYTQYVSGRDY IFESIDNQAKGYMTGQGTGIKRGDYIILCHSSRTCRYQVEDIDYYSEPPNMWIALLQE VDFE" gene 2737..3051 /locus_tag="DP116_11295" CDS 2737..3051 /locus_tag="DP116_11295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137049.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PqqD family protein" /protein_id="PRJNA477356:DP116_11295" /translation="MTSKSLNGKISESSIIVAAVEQISSDLGGEAVILNLRSGVYHGL NEVGARIWNFIQQPKAVKDIKQKLLEEYEIEPEVCVADILTLLEELKAVELVEVKNET VA" gene 3035..3490 /locus_tag="DP116_11300" CDS 3035..3490 /locus_tag="DP116_11300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317180.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lasso peptide biosynthesis B2 protein" /protein_id="PRJNA477356:DP116_11300" /translation="MKQLRNFLKLSGSDRYLLAITFLLLGAIRLGLFFLAFRNLLKLL QKINKQNIRFPFENHGSQISVGKIVWAVNVTTRYMPGGAKCLARALTTQFLMNRYHYS SELRIGVAKEQGGQLEAHAWIEYEGRVAIGNLADLSRFIPLPSLEGVKL" gene 3487..5502 /locus_tag="DP116_11305" CDS 3487..5502 /locus_tag="DP116_11305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_096660590.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lasso peptide isopeptide bond-forming cyclase" /protein_id="PRJNA477356:DP116_11305" /translation="MSGIMGIYNLDGCPVDPEDLGRMVDILAHRGPDGADIWVDGSVG FGHRMLWTTPESKLEKLPLANGTGNLVITSDARIDNRDELINTLEFDNFLPEKITDSE LILAAYEKWGEQCPENLLGDFAFAIWDKQQQSVFCARDHFGVKPLYYYHQLDKAFVFA SEMKALFRLPQVPRRLNEVRIADYLALMMEDKAITIYQDILRLPPAHSMVVSQSGMKM WSYWELDPHCEIKMDSDEAYAEKFREIFTEAVRCRLRSAFPIASQLSGGLDSSAVTCV ARDLLAETKKTSLHTISTIFDKITECDERPFINAVLEQGGFIPHYVQGDEFGPLSNLD HIFRYEDEALLGPSHFYPWIVNRALKELGLRISLDGFDGDTTVCHGVTRLTELARQGN WKTCIQEVKAFSPHFNVSPYAAFCNYGLPHLKELAKKFRWIAFFQGVQLIHKHFGVSR KLLIRNHAIKPFLEQVRQWQHKHRKFANPFVSQTPLVKRNFAERIGLDERIQKLDALN EEPLTVREQHWRSLTQGVLPYTLERADQYAAMFSLEARHPFMDKRLIEFCLALPSEQK LYQGWGRMVMRRGLEGILPEKVQWRGGKADLTANFDDGLLNRNRQILDEVMSNQIEYL EKYIDSDFLQAAYQRLISGTEVRDEDITPIWQAVTLALWFDYKQVTP" gene 5644..5841 /locus_tag="DP116_11310" CDS 5644..5841 /locus_tag="DP116_11310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456427.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative RiPP precursor" /protein_id="PRJNA477356:DP116_11310" /translation="MKSSYTAPKLTVHGDVAQITQILGDTTRQDFVFLNGTPISGGND IGSKDICSGTTPKGPDCDPRF" gene 6170..7069 /locus_tag="DP116_11315" CDS 6170..7069 /locus_tag="DP116_11315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11315" /translation="MKSYKAYNLCIASELQLPELIESEGAPDAIVRFGKVDNATAMQH DSGQNFVGEIPEVGEFFIHDGREILMNPLPGVNEALLRTVLLGPILCVLLRQRGLLVL HASCIDMNNKGVAFMGGSGWGKSTLATAFHNHGYNVLTDDVLPIQIKTGQPVVFPSYP QFKLWPEAATSLGQDTKSLLPVSQNSFKVAYKLSRGFQQTPLPLHHIYVLDKGSEHKI TKIKPQEAFVELVRHTRAISSMTEQEFIADHLHLCSELIKNVSFCRFTRKPSLEDLPT LLKLIEDDLAQVSQKNEVHYTLL" gene 7192..9036 /locus_tag="DP116_11320" CDS 7192..9036 /locus_tag="DP116_11320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019489790.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_11320" /translation="MTLISKLRSVNTTLSHLIKTFHLVWAASGYWTLAWMVMLLLQGL LPAMSITLTREAVDNLVAVSGAGLSSESIQKIVTPVGLMAGIVVVMELLNAAGEWVRT AQSELIHDYMTALIHKQSVAVDYGYYEYSEYNDKLDRARDGASGRSLALLESTGSLLQ NSVTLVAMAAVLIPYGLGLPMILLISAFPAFSVLLYISKIQYRWSQRTTSDRRWLMYY DYLLTNNTTAAEVRLFDYADYFQSSYQNLRKRLRREQFNLLRLQTLGRIVAAIIALTI TGGALAWMGRQVLLGILTLGDLALFFQAFNQAQSIVKDLLSNLGKIYRNSLFIGNLFE FLQIQPKIIDPPSPVPVPTKLKQGIRFRQVSFRYPGTKEPVLENFNLTLPAGKIVAVV GDNGAGKSTLIKLLCRFYDPDSGSIELDGIDLRHFSVKAFRRLVTVLFQSPIPYYTTA GENIALGDISAASNYSEIQAAAKASGIHDKIKRLPLGYNAMLGKLFPEGSDLSGGQWQ RLALARAFFRRAQIIILDEPTSAMDPWAEHDWLERFRTLASGRTAVVITHRFTLAMRA DIIHVMRAGQIVESGSHDQLLALDGLYAQSWKSQMQADSSNPVESRIV" gene 9052..10335 /locus_tag="DP116_11325" CDS 9052..10335 /locus_tag="DP116_11325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317175.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11325" /translation="MKILSKPNNQNQAINARSEIELLLCCARTHLNYLTTERIKALVQ EDIDWEYLIQTAYIHGVIPLLYLNINKICPKSVPATLLQQLQQDQYGRTQRNFILTSK LLKLLEFFEANSIPVIPFKGPTLAIWAYGDISRRDFYDLDILVRKQDFLKTKELLASQ GYRPYSNSSEKEAIYLSTLNSEQQKAYLESHWELHLVDECDGQSPAKGDRVTIDVHHG ILPKQFSSLFDTEWLWEDAHLKPFANKMVLNFTLEDLILVLCSQGAKDCWLQLNRVCD IAQVIRTSCEINWERICERAAKLRMTRILLLGLLLAHELLEVELPKSILQQIQASPLL KSLSSQIYTQLFCTTEYYLENWKVRSSFFHLKMIEHPWDKIRYCYEHLLVPTVADRVI LPLPNFLSFFYYLVRPIRLITMYLIGALPSKRYSK" gene complement(10423..11631) /locus_tag="DP116_11330" CDS complement(10423..11631) /locus_tag="DP116_11330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_11330" /translation="MKILIAHNRYVYAGGEDVVVQAERELLESYGHNVLIWEVNNDSI IGIQGKVKAALSVVYSSTSREQIKEKISHFRPDIVHVHNFFPLLSPAVYDACRDANIP VVQTLHNYRLACPKAMPLRDGKICEDCIGKVVPWSGVAHGCYRGSRVQSAAVATMLTF HTFRGTWQNRVDAYIALTNFHKDKMVQAGLPRDKIHVKPNFVLPPKFKSETHKLKNYA LFVGRLSEEKGVSTLIDAYAQGHLSIPLKIVGDGPLDEALRQQVQTRGLGEVITFLGR QTKATVLELMYNAKFLIFPSIWYEGFPLTIAEAFACSLPVIAAKLGSMAEIVEDGVTG LHFEVRNPLDLAAKINWAITHPEAIDTMEINARCTYEAKYTPEANYKQLMEIYNQVMN QAQTKSENVL" gene complement(11767..13131) /locus_tag="DP116_11335" CDS complement(11767..13131) /locus_tag="DP116_11335" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11335" /translation="MKRNEHPKNLPENLVLTSPQAGILKQSTMIVLFLASAMFPRVLS ALKFPSVVNFLHFAFALFLFVWVLSIIRSRIAVQLLVGVLILLAISTTSALLNKAGLI NVILSFLMLAEPFILLIVITSTKLPQDSIKTMQKWISRFVYIHFAFVYFQKFVLGYEN DFIKGIFLNMGAGHHVGGDIGLTFSLYFLLASGVRSIWLRIFVFILGVGNIIYADVKQ VIIAFLVSWLILIITQLQEVWKAIMYSAIAIPVTVFVLNLINNMYAGIRYISDMEVVT EAFQYKLSVFSIITSFYKSDWNFLFGFGPGHTISRLGWLMKDYIQFLKPLGVTGTSVF NTVWVAQESYYWSNSITGSSMYSLLFSWAGIWGDLGFMGLGIYLYLWFLVWKYICVDK LSKFFLITVFVAGLIFSWMEEPAFMLFVVSLIGLQWQKHQAEKSQRMFEKYSFYHAET SAVE" gene complement(13138..14400) /locus_tag="DP116_11340" CDS complement(13138..14400) /locus_tag="DP116_11340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_11340" /translation="MKVLVSAYACEPGRGSEPGVGWNMACQIAKHHQVWVLTSNTHRP AIEAELARHQPPNLNFVYLDPFGWVYDWSYEGKRSQWSVYLHHYLWQIWAYFVARRLD QEISFDVAHHITYGRYSSPSFLVFLRIPFVWGPVGGAESTPKPFWQDFSRRAKIYETL RNLLRWVGELDPFVHLTARKSTAAIVTTPETATRLKVFGAKRIEFLPGQTGINQQEFA RLEQLAVSSDETIIRFVSIGRLLHWKGFHLGIKAFAQTGLEQSEYWIIGEGPERERLE ELANQLGIADRTFFLGRISREETFHTLAKCHVVVHPALHDFSPTVCLEAMVARRPVVC LNLGGPAIQITEETGFKIDAQTPEQAVTDMAKAMTLLAKDPDLRLRMGQAGHKRVREA FSWESKGQFFAQLYKEISTQDKSYVKLG" gene complement(14422..15558) /locus_tag="DP116_11345" CDS complement(14422..15558) /locus_tag="DP116_11345" /inference="COORDINATES: protein motif:HMM:PF00534.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_11345" /translation="MSHKSIAIFTWGLDGGAFTNLPVALTKGFWNLGVKDLYILYLSK GPAEDITFPEGVKLVSLGVERSMASPIALSRFLKTAKPDVLIAMPTIISIPAIMGWLL AQQRQTKFVIYQGDTLTSDIAIDHKHNLRMQIMPWLARLLYPTANGLTTVSQGVLDIL KQDHIPIPRNRVAVIPNPVDVENFLVRSQAEEPEHPWLRHKDSPVIVSLGRLGKRKNY PMLLQAVAKVRSHLKVKLIIFGEGPERKNLQELIAKLGLQEDVSLPGHVANPWSHIAK ADVFVMSSLDEAFCLALVEAMACGVPVISTDAIGGGPRSILEDGKYGNLVPTDDVQAL ADAIHKVLTSQSWRSHLIDVSKQRCQAFEPEAVAKQWLSFLQQV" gene complement(15600..16706) /locus_tag="DP116_11350" CDS complement(15600..16706) /locus_tag="DP116_11350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869256.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 2" /protein_id="PRJNA477356:DP116_11350" /translation="MQELQTEKQQPIELPKLPENPLVSILIANYNYAKYIGETLDSVL SQTYPHFEAIICDDGSKDNSCEIVEIYANKDSRIKLVRQPNGGVASALNTAYRNSSGE IICVLDADDIWMPSKLQKVVEAFQSDAKGGFVIDNLINIDGDSKIIKSTPIFSELSSG WKAPFAMENGGFAYNIPPASGLSIRRKVADFIFPMNEAFTRNADSLISYLAPLITKIV AIPEVLTQFRLHGSNLTSSRSYVLKDFERDVSNWERTHQEQKQLLTRVYGKEVAEKLT GLEKSVKYLHFRYVAMRLKRLPRKETKKAHQQLTAHQHFGYSAPERWFLPWAEYLPDV LFAKMFGVVYGANPVKHFLKNLVGDLTNPRYVLR" gene 17418..19505 /locus_tag="DP116_11355" CDS 17418..19505 /locus_tag="DP116_11355" /inference="COORDINATES: protein motif:HMM:PF12708.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11355" /translation="MIQFERSKKFWKRLSSLCGILVLSLIVVVITPTVFQNISVSVLG VSEGKSLKKGESKCVVDKGSTGDKLFPKNFVINVKTQYGAKGDGVADDTQAIQKAIQE NVGTNRIVYFPKGTYLVSDRLEWRDKNGKWQSQLWLQGQNRATTIIRLKDNAPGYNDP KNPKAVVYTASGLYVEQPNGGGKDYPAKGEGNEAFANYIEDLTIDSGKNRGAIALDYL ANNTGAVRNVRLRGQGLVGLDMTRRWIGPAMVKNVIVEGFDYGVRIASEVNGVTVEHM TLCNQKTVGLENSGNVVAIRDLSSNNRVPAVRNINSTSLMTLVDGKLYGGDRSMSAIE NNSRLFARNVRSSGYKSVLKQDKKDLKQTVLKEFTSNAPFTLFTSSLKTSLNLPIQES PNFHDNNPSNWANVEDFGARGKKADGGEDWDDDTAAIQKALDSGKSTVYFPPGRYFVS DTLRVKGNVRKITAISATLSSSGSAFENANKPKPFIRIENGKASDVTIEHLSIINLHQ NAPPRLGFIGFEQATSRTLFLKDMTCCSLKPNDQKYVFRNTPKAGKLFIEDVSAESWQ FEHPQQVWARQLNPEGSSKKIFNNGGKLWVLGLKTEGGNVNTVLHTKGGGASELLGAL LYVTGNIPPNEIAFINDNSRVALSYTTMSYGAKDFQIHIQEKRKSNNRQLTRDKLLSN GSGRAVPLYMGGQ" gene complement(19745..20641) /locus_tag="DP116_11360" /pseudo CDS complement(19745..20641) /locus_tag="DP116_11360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861755.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(20772..22595) /locus_tag="DP116_11365" CDS complement(20772..22595) /locus_tag="DP116_11365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456440.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_11365" /translation="MNVISRIQFPRMPETSGLYMKFDEGTSLSPCVDETKVMFHQNTI VSFNTYFHSFYEKFYGKYTELKSVYYLLALEGHFIVSLYREHHEQDERELIYKESFEN CQPGEPIKILLPHSWLDEDAGRVYLEIMCLSKEGFFTEGYVATDEKPLREVSLGIITC TFKKEAYIKKTVDNLLKDNFIKCKNFKIFIVDNGKTLKEQDFYHSNVQLIPNRNLGGS GGFTRGLYEALQEDKYTHFLFMDDDIELNSESIYRLFTLYEYAKDDFAVAGGWLDLAK KHMLYEAGAFYGEKHDSLGYKAFSVNPLKRNLDLQNSTFLNSLLKEEDFDYGGFWFFS CSQEVVKKIGLLMPFFIQRDDIEFCLRIKKYAGNKIIFFPPISVWHEPSYAISKQPAW LIYYVWRNSLITSCIHSSLKYMDAIKHILRSLIIELFFFNYSYAVMLVKGFEDCMKGP AFLQSIDAETLHSTILELSKRYESQTLSCDNYLVEDFYQKPEESSWKKIISILTLNGH LLPSFLISDSQALVVSAPGHSKEWLKGFAKRKVLMFKEEKNTFYQYELNQWAGIKLLS RFLKIAMTGVIKWSFISRHWKNASKELVSIKFWQAYFGLTT" gene complement(22807..23928) /gene="glf" /locus_tag="DP116_11370" CDS complement(22807..23928) /gene="glf" /locus_tag="DP116_11370" /EC_number="5.4.99.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317167.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-galactopyranose mutase" /protein_id="PRJNA477356:DP116_11370" /translation="MKFDWLIIGAGYSACVIAERVANELGQRVLIVEKRDHIGGNAYD YYNEHGILVHKYGPHIFHTKSKKVWDYLSQFTEWRHYYHHVLGVVEGKKVPVPFNLNS LYALFPPKYAEKLENLLLENFGFGVKVPILKLREHASGDLTFLAEYIYENVFLRYTMK QWGVKPEELDRAVTGRVPVYISRDNRYFQDPYQAMPKLGYTEMFRRMLAHPNIKVLLN TDYREIINEVKFNRILCTGPIDTFFDYMYGELPYRSLRFQFETLDQEQYQEVGTVNYP NDYDITRITEQKYLSGQTSPKTTLVMEYPQAYVPGKNDPYYPILNEENRERLELYLKE VEKLNGSVLFAGRLADYKYYDMDHAVLRALGVFEKEIAK" gene 23954..24142 /locus_tag="DP116_11375" CDS 23954..24142 /locus_tag="DP116_11375" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11375" /translation="MDIEAPSIKKIFNKKVVRSQEPEFRMNLGYVVDDLGGLSPLLNQ KFSSLQLVAGQNPPLKDC" BASE COUNT 7226 a 4892 c 4891 g 7261 t ORIGIN 1 ccctgttccc tgttccctgt tccctgttcc ctgttccctg ttccctgttc cctcttaatg 61 tcgcccagta ctaatcttgt ataaaagcgc ttgatcgtcg ctgagataaa tcgatcaaat 121 tcgccagctt gtaaacaatc cgcgcaccaa cattaccatc ccactcagca tcaccaactt 181 cgcacagatc aaaaccaata atttttctac cactatttat caattcacgg aacaaacaaa 241 aagcttgctc taattccagt ccacccggaa caggagttcc tgtacttgaa cagtattttg 301 gatctagacc atctacatca aagctaatgt aaacatattc aggcaaatga ctgataattt 361 ctcgacataa gtccatccaa gtcgttcccg aatacagctt ttgtttcatg gctgggtcat 421 aataagcgac aattcgacca tgagattggt taatcatttg cacttcatca tgacaaatat 481 cacgcaagcc tacctgcact aacttggaaa tttgcggtag cttcatcgca ttgaacatga 541 tggacgcatg ggaaaactca aatccctcat aggcgtcgcg taaatctgcg tgtgcgtcta 601 tgtgcaaaat gccatagttt gcgtaactgg ctgctaatgc ttggaagtaa cctaatggtg 661 aactgtgatc gccaccaatc acagcaactc gtttaccttt ttcaattgct tgcttagact 721 gttcaaacaa ccattgattt acctgttgac aagcttgatt aatttctgct aacacagctg 781 ttaaatctgg tgtatttgtc aggggtttac cttgctcaag tcgctcgata atttttgccg 841 ctaaggtgcg ataatattcg ttcttctcta aaatatcttg gggaatttct accatgaaaa 901 ttccctgttt ccatccatca ggattatcga aatcaaacaa atccagttgc ggcgaagcat 961 ccagaacccg ttgtggtccg ttagcagttc ctgcgccata ggaaacggtc acttcccagg 1021 gtacgccgaa gataatcagg tttgcagact cgtaatcaaa tggtaaagca aaaaggttgc 1081 cattaatttg acctatgcca ctgggattgt actcttggag ttgatggctc atgtgggtag 1141 aaaaaacacc gcttggtgga cacctgcact attgtagcga accattagct tatggtatct 1201 tgcttgtcag acgcacccta ggcaatgtgg ctatccacaa ttattaaggt agggcgcgcc 1261 aagagcgatc gcccttcttc gcccagaggc ttgggtctaa acgtgtaaaa atccggcaac 1321 ctatgccaac tgatagcttg gcgataggct tttatgcgtc tgttactttg cgcaatcgcc 1381 ctgataatag aggtgaggca ttgtccccta aaaatccaat cgcctttatt tctttgccaa 1441 tcgcttatgt ctcgtaccag gtcagccatc aaccttaata ttattttttc gtcattctat 1501 aactttatat taaggaagac agaaaaaaaa catagtaaaa cttatgacta cactcaatat 1561 gttagtggac gtgactatat atttgaatcg attgataatc aggcaaaagg ttatatgact 1621 ggacaaggta caggtattaa acgcggagat tatatcattt tatgtcacag ttctcgtact 1681 tgtcgatatc aggttgagga cattgattac tactctgagc cacccaatat gtggatagct 1741 ttacttcaag aagttgattt tgaatagtag agattttcat ttttctagga tttaaagcag 1801 aaagcccacg gcgcgtcttt ccatgtgcgc accaatatat catgtctgtg ggactgtcaa 1861 tagattgaaa ctccatcgct tagcaagtgg agttctcctg actctaccaa ttctggattc 1921 tgactccttt tattgaactt taaaattctt tctgtgaatt ccaagacaaa ggtgagaatt 1981 tattttttca tataaaatac aaaagcgata attatgttga gaaacataca gtttttttta 2041 attgagagac tcttttgagt cattgcaaag attagatatg tcaattgaca catacttaac 2101 agtattgtgt tgtttttatg aactttagga aagttacttg catgatcttg tattaattat 2161 cgtagaatat caggtaagca ttcgttgaaa taagcatata caaacttgag tacaattaag 2221 tttgttggaa aagccatcta ccactacaga taaaaaatcc agtttgtaat ctagatgatt 2281 gcatgttttg agatcaggga cagtttataa gtaggtagac acaagtaaac ctaacgacgt 2341 aacataatat aaagtggtca aaactattgc gattgcgcca gattgcggaa agccctctcc 2401 ggggtctacg ttccactcgc aataatatac tgaaaaattt tcccgcttac ctacttaaca 2461 gttaagactg tgacagtttt tcttatcaga gtaaagcttt ggaaactcaa ggaattggta 2521 tagttcgagt agtctactag acaagtaaaa ccctagaaat agcatagagt tagttgctgt 2581 gtgtcaatca ctcatcttga ttatttcaag ttgcataaaa ttttaataat taatgtacct 2641 gttgagagat ttccagtgtc tttactgatt tgaaaatcaa tttctacaaa taactttttg 2701 aataaaatca agcatcaaaa ggaaacccct gtaatcatga cctcaaaatc attgaacggc 2761 aaaatctcag agtcgtcgat aatagtagct gctgtagagc agatatcgtc tgatcttggt 2821 ggcgaagcag ttattctgaa tctcagatca ggagtctatc acggactcaa tgaagtagga 2881 gcaagaattt ggaatttcat tcagcaaccc aaggcagtga aagatattaa acagaagctt 2941 ctggaagaat atgaaataga accagaagtt tgcgtggcag atatcttgac actgttggaa 3001 gaattgaaag ctgtagaact ggttgaagtt aagaatgaaa cagttgcgta actttctgaa 3061 actttctggg agcgatcgct atcttttagc aatcactttt ctcttgctag gagcaattag 3121 attagggctg ttttttctcg cctttcggaa tttactaaag ctcttgcaaa agataaataa 3181 gcaaaatatt cgtttcccat ttgagaatca tggaagtcaa atttcagtag gcaaaattgt 3241 ttgggctgtt aacgtaacaa cacgctatat gcctggtggg gcaaagtgtc tggctcgcgc 3301 tttgacgact caatttctca tgaaccgcta tcattactca tctgaactac gtataggtgt 3361 tgctaaagaa cagggaggac aattagaggc tcacgcttgg attgagtatg agggacgggt 3421 tgcaattggg aaccttgcag acctttcccg gtttattcca ctcccatctc tcgaaggagt 3481 aaaattatga gtgggatcat gggcatttac aatttagacg gttgccctgt agatccggaa 3541 gatttgggac ggatggtaga tatcttggca caccggggac ctgatggtgc agatatctgg 3601 gttgacggat ctgtcggttt cgggcatcgg atgctgtgga cgactcctga atcaaagctc 3661 gaaaaattac ctttagctaa tgggactggt aatttagtca tcacgtctga tgctcgcata 3721 gataaccgag atgaacttat aaatacgtta gagtttgata attttctccc tgaaaaaatt 3781 acggatagcg agttgatatt agctgcctat gaaaaatggg gcgagcaatg tccagagaat 3841 cttttgggtg attttgcatt tgccatttgg gataagcagc aacagtctgt tttctgcgct 3901 agagaccatt ttggggtcaa gcctttatat tactaccacc aacttgataa agcctttgtg 3961 tttgcctcgg agatgaaagc tctttttcgt ttaccgcagg taccacgccg actcaacgag 4021 gtgagaattg ctgactactt agcattgatg atggaagata aagccatcac tatttaccaa 4081 gatattctgc ggcttccccc tgctcacagt atggttgtca gtcagtcagg aatgaagatg 4141 tggtcttact gggaacttga cccccattgt gagattaaga tggattctga cgaggcttac 4201 gcagagaaat ttcgggaaat ttttaccgag gcagtacgtt gtcgcttacg tagtgccttc 4261 cctatcgcca gccagttgag cggtggatta gattcttcgg ctgtgacttg tgtggcgcgg 4321 gatttacttg cggaaacaaa gaaaacttcg ctacacacta tctccactat ttttgataaa 4381 attactgagt gtgacgagcg ccctttcatc aatgcagtgc tggaacaggg tggatttatt 4441 cctcactacg tccagggtga cgagtttggt ccgttatcta acctagacca tatttttcgg 4501 tacgaagatg aagctctttt gggtccaagt catttttatc cttggattgt aaaccgtgct 4561 cttaaggagc tgggactgcg gatttccctg gatggttttg atggggacac cacagtttgc 4621 cacggcgtga ctaggcttac cgaactcgca cgtcaaggaa attggaaaac atgtatacaa 4681 gaagttaaag cattttcgcc acactttaat gtctcacctt acgccgcatt ttgcaattat 4741 ggactgccac accttaagga attggcaaag aagttccgtt ggatagcgtt ctttcaagga 4801 gtacagctaa tccataaaca ttttggtgtt tcgcgcaaac tgttgatacg aaatcacgca 4861 attaaacctt ttctggaaca agtacggcaa tggcagcaca aacatagaaa atttgccaat 4921 ccttttgttt cccagactcc ccttgtcaag cgtaactttg cagaacgtat tggtttggat 4981 gagcgaattc agaagttgga tgcattgaat gaagaacccc tcaccgtgag agaacaacat 5041 tggcgcagtc ttacccaagg tgtattgccc tataccttag aacgagcgga tcaatatgcg 5101 gcaatgtttt ctttagaagc gcgtcatccg tttatggata agcgactaat tgaattttgt 5161 ttagcattac cctccgagca aaaactttac caaggatggg gacggatggt gatgcgtcgt 5221 ggcttagaag gaattctacc agaaaaagtt caatggcgtg gaggaaaagc agatttaaca 5281 gccaatttcg atgatggact gttgaatcgc aaccgccaaa ttctggatga ggtgatgtcc 5341 aatcagattg aatacttgga aaagtacatt gattcagatt tcctgcaagc agcttatcag 5401 cgattgatat caggaaccga agtcagagat gaggatatta caccaatttg gcaggcagtt 5461 actttagctt tgtggttcga ctataagcaa gtgacgccgt aagtccagca gtcttggcag 5521 gttatggctg gtaatattgt ctgtcgtatt tacgacaacc atgtcacacc gtaagtccaa 5581 ttatggggca ctaccttcaa caactagtag atcaacttta gactcaaaat caggaaatag 5641 atcatgaaaa gttcgtacac tgctcccaaa cttactgttc acggtgatgt tgcccagatc 5701 acacaaattc ttggtgatac aacaaggcaa gattttgtgt ttctcaatgg cacccctatt 5761 tccggtggta atgacatcgg ttcaaaggac atctgctctg gcacaactcc aaaaggaccc 5821 gactgcgatc ctaggttcta ataagtgaat gaagtgtttg tcttcatgaa gtgtgtgttt 5881 agcacatagc gcctgctcta agaaactagt ggttcatgtc atgaccttcg tgacggacat 5941 gattacgttt atttatgtcc atttacttag gctttaaaag tagagagcaa aaagcgcaat 6001 tttgtaggtt ggttagggtt tggaaactca acagattttg cttgggttta aataccctct 6061 tatctaacct acaattattt actatctgca agcgtaataa tacggacaca agacggtcgg 6121 tgaatttgtt tgaaatcgct gttttcattg cactaaataa agttaagata tgaaatctta 6181 taaagcctac aacttgtgca ttgcttccga attacaactt cccgaactga ttgaaagtga 6241 aggtgctcca gatgcaattg tgcggtttgg caaagtagat aatgcaaccg caatgcaaca 6301 cgatagcggt caaaattttg tgggagaaat accagaagta ggtgagtttt ttattcacga 6361 tggacgggaa attctgatga atcctctacc aggtgtgaac gaggctcttc ttcgcactgt 6421 acttctaggt cctatattat gcgtactgct gcgacaaagg gggctgctag ttttacatgc 6481 tagttgtatt gatatgaata ataagggtgt agcgtttatg ggtggttcag gttggggaaa 6541 gtctactttg gcaactgctt ttcacaatca cggctacaat gttttaacag atgatgtact 6601 gcctatccaa attaagacag gtcaacctgt tgtttttccc agctatcccc agtttaaact 6661 ttggccggaa gcagcaactt ctttaggaca agatacgaaa agcttattgc ctgtttctca 6721 aaactcattc aaagtagcct acaaactctc tcgtggtttc caacaaactc cccttccatt 6781 gcatcatatt tatgtattag acaagggaag cgaacacaaa attaccaaaa ttaaacccca 6841 agaagccttc gttgaactgg tgcgtcatac tcgtgctatt agctcaatga ccgaacagga 6901 atttatcgca gatcatttac atctttgtag cgaacttatc aaaaatgtca gcttttgtcg 6961 ctttacgcgc aaaccttctc tagaagattt accgacacta ttgaaattaa ttgaggatga 7021 tttagcacaa gtttctcaaa aaaacgaagt gcattacaca ttgctataga taagggttta 7081 ggaaatagtt agtagtggaa agtaaacaag tttagacttt tgactttatg ctgcgtgctg 7141 tttcaatttg tcaattaaat atttatcagt ttaatgaatt tataggattt gatgactttg 7201 attagcaaac ttcgcagtgt caatactaca ctttcacact taataaaaac ttttcactta 7261 gtctgggctg catctggtta ctggacttta gcttggatgg tgatgctgtt gctgcaagga 7321 ttgcttccgg ctatgtctat caccctcacg cgtgaggctg tggataattt ggttgcagtt 7381 tctggtgctg gtctttcttc tgagagtatt cagaaaattg ttacaccagt aggattgatg 7441 gcaggtattg tggtggtgat ggagttgtta aatgctgcgg gggaatgggt tcgcacggcg 7501 cagtcagaac taatacacga ttatatgact gctttgatac acaagcagtc agttgctgta 7561 gactacggtt attacgaata ctcagagtat aatgacaaac ttgatcgagc gcgggatggg 7621 gcgagcgggc gatcgcttgc tttgttagaa agtactggca gcttgttgca aaatagcgtc 7681 actttggtag cgatggcagc ggtgttaatt ccctacggtc tagggctacc gatgattcta 7741 cttatcagtg cctttcctgc tttttctgta ctattgtata taagtaaaat tcaatacagg 7801 tggtcgcagc gaacaacaag cgatcgccgt tggcttatgt attatgacta tctgctcaca 7861 aacaatacca cggcagcgga agtacgattg tttgactatg ccgactactt ccaatcgtca 7921 tatcaaaatt tgcgtaagcg actcagacgc gaacaattca atttactgag actgcaaact 7981 ttaggtcgca ttgttgctgc tattattgca ctgacaatca caggtggggc gctagcctgg 8041 atgggtcggc aggtgttgct aggaattttg actttaggag atttagcgct atttttccaa 8101 gcattcaatc aggcacagag cattgtcaag gatttgctca gtaatttagg aaaaatatat 8161 cgcaatagct tgtttatcgg taatttattt gagttcttac agatccaacc caaaattata 8221 gatccgccca gtcctgttcc tgtaccgaca aaactcaagc aaggaattcg ctttcggcaa 8281 gtcagcttcc gctaccctgg tactaaggaa ccagtattag agaattttaa cctgacgctg 8341 cctgctggca aaatcgtggc ggttgttggt gataacggtg ctggaaaaag taccctaatt 8401 aaacttcttt gccgcttcta tgaccccgat tctggaagca ttgagctcga cggtatcgat 8461 ttgcgtcact tttcggtgaa agcattccgc agactagtta cagttttgtt ccagtcacca 8521 attccctact acaccacggc tggagaaaat attgctttgg gtgacatatc ggcagcatca 8581 aattactcag aaattcaagc agctgccaaa gcgtcgggta ttcacgataa gattaagcgc 8641 ttgcctctgg gttataacgc tatgttgggt aagttatttc ccgaaggaag tgatctcagc 8701 ggcggtcaat ggcaacgttt ggcgttagca agggcatttt ttagacgcgc ccaaatcatt 8761 attttggatg aaccgactag tgctatggac ccttgggcgg aacatgattg gttagagaga 8821 ttccgcacct tggcgagtgg tcggactgcg gtggtcatta ctcatcgttt cactctggcg 8881 atgcgggctg acattattca cgttatgcgt gctgggcaga ttgtggaatc agggagtcat 8941 gatcaattgc tggctttaga tggactttat gcccagtctt ggaaatcaca aatgcaggct 9001 gattcaagca atcctgttga gagcaggatt gtttagtagt gagaagcacc aatgaaaatc 9061 ttatctaaac ccaataatca aaaccaggca atcaatgctc gctctgaaat agaattactt 9121 ctttgttgtg cccgcactca cctaaattat ttaactactg agcgtattaa agccttagtt 9181 caggaagata tagactggga ataccttatt caaacagcat atatccacgg tgtcataccg 9241 ctcctttact tgaatataaa taaaatatgt ccaaaatcag ttccagcaac tttgctacag 9301 cagctgcaac aagaccagta tgggagaact cagcgcaact ttatcttaac aagtaagctg 9361 ctcaagctat tggagttttt tgaagcaaac agcatacctg ttattccatt caaaggtcct 9421 acccttgcca tttgggccta tggtgatatc tcacgtcgag atttttatga tttagatatt 9481 ttggttcgca aacaagattt tttgaaaaca aaagaactac tggcatctca aggatatcga 9541 ccatactcca acagcagtga gaaagaagca atttatctta gtactcttaa ctcagaacaa 9601 cagaaggctt atctggaatc ccactgggaa ttgcatctag tggatgagtg cgatgggcaa 9661 agccccgcca agggcgatcg cgttacgata gatgtccacc acgggatatt accaaagcaa 9721 ttctcatctt tgtttgatac tgaatggcta tgggaagatg ctcacctaaa gccttttgcc 9781 aataagatgg ttttgaattt taccctggaa gacctcatct tggttctctg ttctcaaggt 9841 gctaaagact gttggctaca gctaaaccgg gtttgtgata ttgctcaagt catccgcact 9901 tcttgtgaga ttaattggga gagaatatgt gaacgggctg caaagttgcg tatgacgcga 9961 atcctcttac ttggtctttt actagcgcat gaacttttag aagtagagct tccgaagagt 10021 atattgcagc aaatacaagc gagtccattg ttgaagtctc tttcttccca gatttacacc 10081 cagcttttct gcacaaccga gtattatcta gaaaattgga aagtgagaag ttcatttttt 10141 catttaaaga tgatagaaca tccctgggat aaaatccggt attgttacga gcatttgctt 10201 gttccaacag tagcagaccg agtgattcta ccactaccta actttttatc tttcttttac 10261 tatttagtcc gaccaatccg tttaataaca atgtatctaa taggcgcgtt gccttcaaag 10321 agatattcaa aatagagaga gcaacattaa tgaacagtta tcaatgaaca atgagcaatg 10381 agcaatgagc aattataaaa tcgataattg ataattgatt ggtcataaca cgttttctga 10441 tttggtttgc gcttgattca taacttggtt ataaatttcc atcaactgtt tgtaattagc 10501 ttcaggagtg tacttagctt cgtaagtgca gcgagcattt atttccatag tatctattgc 10561 ttcagggtga gttattgccc aatttatttt agctgccaaa tccagtggat tcctaacttc 10621 aaaatgcaga cctgtgactc catcttctac aatttctgcc atacttccta atttggcagc 10681 tatcacaggt aaactacagg cgaacgcctc tgctattgtc aatggaaaac cttcatacca 10741 aattgaaggg aatatcagaa atttggcatt gtacatgagt tccaatacag tcgctttagt 10801 ttgtcgccct aaaaatgtga ttacctcccc aagtcctctg gtttgtactt gctggcgcag 10861 tgcttcatcc aatggtccat cacctacaat ttttagtggt atgctcaaat ggccttgtgc 10921 ataagcatca atcagcgttg acacgccttt ttcttccgaa agtcgcccga caaagagtgc 10981 gtagtttttc aacttgtgag tttcagattt aaatttagga ggcaacacaa aattaggttt 11041 gacgtgaatt ttgtccctgg gtaagcctgc ctgaaccatt ttatctttat gaaagttggt 11101 caaggcgatg taagcgtcca cgcggttttg ccaggtacca cgaaatgtat gaaaagtcaa 11161 cattgtggct acagcagcgc tttgtacccg tgaaccacgg tagcatccat gagcaacacc 11221 agaccaagga acgactttgc ctatgcaatc ttcacagatt ttaccatctc gtaatggcat 11281 tgcttttgga catgcaagtc gatagttatg gagggtttgt acgactggga tgtttgcatc 11341 tcggcaagca tcgtatactg ccggagaaag aagaggaaaa aagttatgta cgtgaactat 11401 atcgggacgg aaatgagata ttttttcttt gatttgctct cttgatgtag atgaatagac 11461 aacagaaagt gcagctttta cttttccttg aattccgatg atgctatcat tattaacttc 11521 ccatattaag acgttatgtc cgtaagattc taacaactcg cgttcagctt gtacaacgac 11581 atcttcgcct cctgcgtaaa catagcggtt atgggcaatg agaattttca taggatgaat 11641 ctttaactga gaatttttac aaaggctttt gatgaaagtt aaatttatac gaaataaggt 11701 ttgtaattgc gcttaagcgc tactacaaac agctttaatt tcgagaattg gtattatcca 11761 gatgcttcac tccactgcgc tagtttcagc atgataaaaa ctatactttt caaacatcct 11821 ctgagatttt tctgcctgat gtttttgcca ttgcagccca attagactca cgacaaaaag 11881 cataaaagca ggttcttcca tccaagaaaa gattaaacca gctacaaata cagtgatgag 11941 aaaaaatttg gaaagtttat ctacacaaat atacttccac actaaaaacc agagatataa 12001 atatattcct aaccccataa atcccaaatc tccccaaata cctgcccaag aaaataataa 12061 agagtacata ctagagccag taatactatt agaccagtaa tagctttctt gggcaaccca 12121 tactgtatta aagacagaag tgccagtcac acctaatggt ttcagaaatt gtatgtaatc 12181 tttcattagc caacctaagc gactaatagt atgaccagga ccaaatccaa acaaaaaatt 12241 ccaatctgat ttatagaagg aggtaataat agaaaataca gataatttat attgaaaagc 12301 ctctgtcacg acttccatgt cacttatata gcggattcca gcgtacatgt tattgataag 12361 attcaaaaca aaaacagtca ctggaatggc aatagctgaa tacattattg ctttccaaac 12421 ctcctgaagt tgcgttatga tcaagataag ccaggagact agaaatgcaa taattacctg 12481 tttgacatca gcataaataa tatttcctac acctaagata aacacaaaga ttcttaacca 12541 gattgaacga actccagaag cgagaagaaa ataaagacta aaagttagac caatatcacc 12601 tcctacgtgg tgtcccgctc ccatgttcag aaaaattcct ttaatgaaat cgttttcata 12661 tccgagaacg aatttctgaa aatagacaaa agcaaagtga atatagacaa atctagatat 12721 ccatttttgc attgttttaa tgctatcttg aggaagcttt gtactggtaa ttactattaa 12781 caatataaaa ggctcagcca gcattaaaaa actcaggata acattgataa gtcctgcttt 12841 attgagcagg gcactcgttg tgcttattgc taataaaatt agcaccccta ctaatagttg 12901 tacagcaatt cgactacgaa ttatagataa tacccataca aataaaaaga gggcaaaagc 12961 aaaatgcaaa aagttaacga ctgaaggaaa ctttagtgct gaaagaactc gtgggaacat 13021 agcagaagcg aggaatagca caatcatggt agactgtttg agaattcctg cttgcggtga 13081 tgttaagact aagttttctg gtaaattttt cgggtgttcg ttccgcttca tgattttcta 13141 acccaatttc acgtaggatt tatcttgggt tgaaatttct ttgtaaagtt gagcaaagaa 13201 ttgtccttta ctttcccaac taaaagcctc tcttaccctc ttgtgtccag cttgacccat 13261 acgcaatctc aaatctggat ctttagctag cagcgtcatg gctttagcca tatcagtgac 13321 tgcttgttca ggtgtttgag cgtcaatctt aaaaccagtt tcttctgtaa tttggatagc 13381 tggtccacct aaattcaggc aaactactgg acgccttgcg accatagctt ctaagcaaac 13441 agttggtgaa aaatcatgca gggcaggatg aacgacaaca tggcattttg ctagagtatg 13501 aaaggtttct tcacgagata ttcttcctaa aaagaaggtt ctgtcggcaa ttccgagttg 13561 atttgcaagt tcttccagcc tttctctctc tggtccttct ccaataatcc agtactcact 13621 ctgctctaaa ccagtctgag caaatgcctt tatgccaaga tgaaatcctt tccagtgtaa 13681 gagacgtcct atactgacaa atcgaataat agtttcgtcc gatgaaaccg caagttgttc 13741 aagacgggca aactcctgtt gattaattcc agtttgtcca ggcaaaaatt caatcctttt 13801 agcaccgaaa actttcaaac gggttgcggt ttctggagtc gttactattg cagcagtact 13861 tttgcgagct gttaggtgta caaatgggtc aagttcacct acccaacgca aaaggttacg 13921 cagagtctcg taaattttag cgcgtctgct gaagtcttgc cagaatggtt tgggtgtaga 13981 ttccgcgcct cctactggac cccaaacgaa gggtatgcga agaaagacta gaaaacttgg 14041 gctggaatat ctaccatagg ttatgtgatg tgcaacatca aagctaattt cttgatcaag 14101 tcgtctggct acaaaatatg cccaaatctg ccataagtag tggtgaagat agactgacca 14161 ttgcgatcgc tttccttcat aagaccagtc ataaacccaa ccgaatggat cgagataaac 14221 aaagttcagg ttaggcggtt gatgacgagc aagttctgcc tcaattgctg gacgatgagt 14281 attagaagtc aaaacccaga cttgatgatg tttagcaatc tggcaagcca tgttccagcc 14341 tactcccggc tcagagccac gaccaggttc acaagcataa gcagacacaa ggactttcat 14401 tgatggaatt ctccttatca tttagacttg ttgcaaaaaa gataaccact gtttggcaac 14461 tgcttctggc tcaaaggctt gacaacgctg tttactgaca tcaattaagt gcgatcgcca 14521 actctgtgaa gtcagcacct tgtgaattgc atcagcaagc gcttgcacat catctgtagg 14581 tactaaattc ccgtatttgc catcttctaa gatgctgcga ggtcctccac caatcgcatc 14641 agtggaaata acaggtactc cacacgccat tgcttccact aatgctagac aaaacgcttc 14701 atctaaagac gacatcacaa aaacatcagc tttagcaatg tgactccaag gattagcgac 14761 atgaccaggt aaactgacat cctcttgcaa acccagctta gcaatcagct cctgcaaatt 14821 ttttctttct ggaccttctc cgaaaataat cagcttaact ttgagatggc tgcgaacttt 14881 tgcgactgct tgcaatagca tgggatagtt tttgcgcttt cctaaacgcc cgaggctgac 14941 aataaccgga ctgtccttgt gtcgaagcca aggatgttca ggttcttctg cttgactacg 15001 cactaaaaag ttttcaacat cgacaggatt cgggatgact gctacacgat tgcggggaat 15061 tgggatatgg tcttgtttga gaatatcaag aacaccttgg ctgacagtag ttaaaccatt 15121 ggctgtagga tataaaagcc gtgctagcca gggcatgatt tgcatgcgca ggttatgctt 15181 gtggtcaatt gctatgtctg aggttaaagt atccccttga taaatcacaa attttgtctg 15241 tcgctgttga gctaatagcc atcccataat tgcagggata gagataattg tgggcatagc 15301 tatcagcaca tctggcttag ctgttttcaa aaaccttgat agagcaatcg gagaagccat 15361 cgatcgctca acacccagtg acactaattt aacaccttct ggaaaagtga tatcttctgc 15421 tggtcctttt gagagataca agatgtataa gtctttgact cccaaattcc aaaacccttt 15481 agtcagcgcg actggaaggt ttgtgaatgc tccgccgtct aaaccccaag taaagatagc 15541 aattgatttg tgactcatct atcgtccttt gcaattatcc ttaaacatct ggtttggaat 15601 tacctgagaa cataacgcgg attggtcaaa tctccgacta agtttttgag aaagtgcttg 15661 actggattgg ctccatatac aacaccaaac attttggcaa acagaacatc aggtaaatac 15721 tctgcccaag gtaaaaacca tcgttcaggc gcagagtaac caaagtgttg atgagcagtt 15781 agttgctggt gggctttctt ggtctctttt ctgggtaagc gctttaagcg cattgctaca 15841 tagcggaagt gcagatactt cacgcttttt tccaaaccag tcagtttttc tgcaacttct 15901 ttaccataga ctctagttaa aagctgtttt tgctcttgat gagtacgctc ccaattactc 15961 acgtctctct cgaaatcttt taacacgtaa ctccttgaac ttgtcaggtt agagccatgt 16021 aagcgaaatt gggttagcac ttctgggata gctactatct tagttattag gggagctaag 16081 tagctaatca gactgtctgc gttgcgtgta aacgcttcgt tcatggggaa gatgaagtca 16141 gcgactttgc gtcgaataga aagtcctgaa gctggcggta tattatatgc aaatccacca 16201 ttttccattg caaatggtgc cttccaacca gaagatagct cactaaatat aggcgtcgat 16261 ttaatgattt tgctatcacc atcaatattt attaaattgt cgataacaaa gcctccttta 16321 gcgtcagatt gaaacgcttc gacaactttt tgtaacttgc tgggcatcca tatatcatct 16381 gcatccagta cacaaataat ttcaccgctg ctgtttctgt aagctgtgtt taatgctgag 16441 gctacaccgc catttggctg acgtactagc ttaatccggg agtctttatt tgcatagatc 16501 tcaacaattt cacaagagtt atctttagat ccatcatcac aaataattgc ttcaaaatgg 16561 ggataagtct gagaaagtac actatctaaa gtctccccaa tatacttggc gtagttatag 16621 ttagcaatca gtattgaaac tagggggttt tcaggtaatt tgggtagttc aattggttgt 16681 tgcttctcgg tttgaagttc ctgcatttac tcctcggttt tcattatatg ttattgctgt 16741 aacattgaat ttttcaatac atgctatggt taaccacttt atgaactgac agatttcacc 16801 tggaagtgca attatttttg acctgcgact tccttctcga acaaacccaa tgccggtaac 16861 acaacgtact ccatgtcgtg atattgatat tctgctggtc tgtaaaaaac agctttatcc 16921 catttggctt tctaaactca acacatattt agttagattg caccgaaata ttgcggcata 16981 ctagaaagta gtgtttctac atttcccaat gtagcaactt tcgggaagtg agttggtcat 17041 tttccacaga atcttctgtc atactttttc ggcaaattag caaatattcc ggattctttt 17101 atcagaagaa gtactgataa acgtctttca tttcatatca aagtgctgaa atgcttttat 17161 actcatactt ttagccatgt gaaaaattca gaaatttttt ggaaactatg taagttattt 17221 gtacatttgt atttttttcc ggatagagtt gctatgggtt tgaataacta aggcttaaaa 17281 cagtgaacag tgacaactgg taactggtaa ctggtaactg gtaactgata actgaataac 17341 tgaataactg ttttgaaagg aagcaagagg acttttaaag gtaggaagct tgttttgtca 17401 aacctaaaga ggtgatcatg atacaatttg agcgcagtaa aaaattctgg aaaagacttt 17461 ccagcctctg tggaatctta gtactttctc tgatagttgt tgttataact cctacagttt 17521 ttcaaaatat atctgtttct gtgttagggg tttctgaggg aaaatcttta aagaaaggag 17581 agagcaaatg tgttgttgac aaaggttcaa caggggataa gctctttccc aaaaattttg 17641 tgattaatgt gaaaactcaa tacggagcga aaggagatgg tgtcgcagat gatactcagg 17701 caatccagaa ggcaattcaa gagaatgtgg ggacaaatag aattgtttac tttcctaaag 17761 ggacttattt ggtgagcgat cgcctagaat ggcgagataa aaacggcaaa tggcaatcgc 17821 aactatggct gcaaggacag aaccgtgcca ccactattat tcgcttgaag gataatgcac 17881 caggttataa tgaccccaaa aaccctaaag cagttgttta caccgcatca ggactctatg 17941 tagaacagcc caacggagga ggtaaggact atcccgccaa gggagaggga aacgaggcgt 18001 ttgcaaacta catcgaagac ttgacgattg acagtggtaa aaaccgagga gcgatcgcac 18061 tcgattatct agcaaacaac actggtgctg ttcgtaatgt tcgacttcgc ggtcaaggac 18121 tcgtcggact tgatatgact cgacggtgga ttggtccggc gatggtcaaa aacgtcattg 18181 tcgaaggctt tgactatggt gttcgtattg cgagtgaggt caacggtgtg actgttgagc 18241 acatgactct ttgcaatcag aaaactgttg gtttagagaa ctcgggcaat gttgtcgcaa 18301 tccgagactt atcgagcaac aatcgtgttc cagccgtgcg taacataaac tcaacgagcc 18361 tgatgaccct tgtggatggg aagctttatg ggggcgatcg ctcaatgagt gcgattgaga 18421 ataacagccg tctgtttgct cgcaatgtcc gctcatccgg ttataagtca gttcttaagc 18481 aagataagaa agacttgaag cagactgtgt taaaggaatt tacctcgaat gctcctttta 18541 ctttgtttac atcttcttta aagacatctc tcaatcttcc gattcaagaa tctccaaact 18601 ttcatgacaa caatccttca aactgggcaa atgttgagga ttttggtgcg agagggaaaa 18661 aagcagatgg aggtgaggat tgggacgatg atacagctgc aattcaaaaa gctcttgact 18721 caggaaagtc aactgtttac tttccacctg gtcgttattt tgtgagcgat actctacgag 18781 tcaaaggaaa tgttcgcaaa atcaccgcga taagtgctac gctttcttct tctgggtcag 18841 cttttgaaaa tgcaaacaag ccaaaaccgt tcattcgtat cgagaatgga aaagcaagtg 18901 atgtcacaat tgaacatctt agtattatta acctccatca aaacgctcca ccacgacttg 18961 gatttattgg ttttgaacaa gcaacttccc gaacactctt tttaaaagat atgacttgct 19021 gctccttaaa acctaacgac cagaagtatg tgtttcggaa cacaccgaaa gccggaaagc 19081 tttttattga agacgtatct gctgaaagtt ggcaatttga gcatcctcaa caggtgtggg 19141 cgcgtcagct taatcctgaa ggtagtagca agaagatatt caataacggc gggaagctgt 19201 gggtattagg tcttaagacc gaaggaggta acgttaacac agtgcttcat accaaaggag 19261 gcggtgcgag tgagttgtta ggggcgctgt tgtatgtgac tgggaatata ccaccaaatg 19321 agattgcttt tatcaatgac aactctcggg ttgcactctc ttataccacg atgagttatg 19381 gggctaagga tttccagatt cacattcaag aaaaacgcaa gtctaataat cgtcaactaa 19441 ctcgcgacaa gctactgtcg aatggtagtg gtcgtgcggt accgctttac atgggaggac 19501 agtgaaggga aaacaaaagt ttccacaatt actcccttct gggcagaaga atacctagtt 19561 tatgcaaaca aagcccgcat tagcgggctt taaatatata tatatctata ggactcgtat 19621 ttgatttttg aaaaaaaatc agtacacttt tattgcttct tccctgttcc ctgttccctg 19681 tctccacgag ggattcagaa atcaaaccgg attcctatag agcaagggta ttaaaaacca 19741 agctctaaca taccaattta ctttcatgac ttatctcttt gagacaaagt tgagagtata 19801 tttctggcaa atactctttc attgcagctg attgaggacc aaatgaaaaa tgtccagcaa 19861 acaaattgtg aataacaatc atcactctag acatctcaat acaatcttga caaagatagc 19921 cttcatcgat accaagacat ccctctttga gagaaactct aaagccacct attgcttccc 19981 aaaactctct ttcaaaaaga agagctccaa tactgaatct atgtggtgat gctgagtatt 20041 taaaaggctg ggaggcaata tattgagcca aatcatcaaa tggcaaactc tttgcccaca 20101 accatttcgc tgcctcacca tcatcaaaag ccttgatgtc tctggctgcg tgtttgagtt 20161 cgccaaattt cgctttgtat tcatcatcag cattaatact tttaagaaag ctgatgtatg 20221 agtagccgtt aacattaagc atcggagaac aaaaaccagg gttatataat ccctcatttc 20281 tgacgtataa atacccttct aaaagttttt caaagaagtt ttcagatata aaaatatcct 20341 catctagctt gtaaatccac ttagcttttt gatgtctgag aattgctaga ttctgaacta 20401 gtgagacctt gttaagtttt gtataaagat aagaccagcc ataattagct gccattttat 20461 ccaaaggttc agaatacagc ccagaagaaa ggatacagac atcaacatct gacgggacaa 20521 acttagcaat tctagtaagc gttagaggcc acaaaaaact cttgtagcca gctagcacaa 20581 tcagcaggcg atcagatccc ttacttctat caataaaggt atgagagcca ttaaagttta 20641 gtcgctggac gcagaattta ttgagtttta aacctgctgt gattaataac caaaggtacc 20701 aatcgcgatg atttaggagc ctttttaatc tgacaataaa gctgtctgcc atttttccct 20761 cctaatacat tttatgtcgt taatccaaaa taagcttgcc aaaacttgat ggatactagc 20821 tcttttgagg catttttcca gtgtctacta atgaatgacc atttaataac acctgtcatg 20881 gcaattttca aaaatctgct taataacttg atgccagccc attgattcag ttcatattga 20941 tagaaagtat ttttttcctc tttgaacata agaacttttc tcttggcaaa acctttgagc 21001 cattcttttg aatgaccagg agcagataca accaacgctt ggctatcact tattaaaaaa 21061 cttggtagca gatgaccatt caaagtcaag atactgataa tttttttcca gctgctttct 21121 tctggctttt ggtaaaaatc ctcaacgaga tagttatcac aggacaaggt ttggctttca 21181 taacgtttgc tcagctctaa aatagtagaa tgtaatgtct cagcatcaat gctttgcaga 21241 aaggctggac ctttcataca atcttcaaac ccttttacaa gcattactgc ataactataa 21301 ttaaaaaaga ataattcaat gattaagctt ctgagaatat gcttaatcgc atccatatat 21361 tttaaagagc tatgaatgca agacgtaatc aggctgttac gccagacata gtaaattaac 21421 caagcaggct gtttacttat cgcatatgat ggttcatgcc atacagatat aggtgggaaa 21481 aagattattt tattgccagc atacttctta attcttaaac aaaactctat gtcatctctt 21541 tgaataaaga aaggcatgag taaaccaatt tttttgacaa cttcctgaga acaagagaaa 21601 aaccaaaacc ctccataatc aaaatcttct tctttcaaaa gagaattgag gaaggtagag 21661 ttttgtaaat caagatttct cttcagagga ttaacagaga atgccttata tccaagagaa 21721 tcatgctttt ctccataaaa agccccagct tcataaagca tgtgtttttt agctaagtct 21781 aaccaaccac ctgctactgc aaaatcatct tttgcatatt catataaagt aaaaagtcga 21841 tagatagatt cgctatttag ctctatatca tcatccataa ataggaagtg tgtgtatttg 21901 tcttcttgta gagcttcata caaacctcta gtaaatccac cacttcctcc taaattccta 21961 ttaggaatca gttgaacatt tgaatgataa aaatcttgtt cttttaaagt tttcccatta 22021 tcaactataa aaattttaaa gttcttacat tttataaaat tatctttgag tagattatct 22081 actgtctttt tgatataagc ttctttctta aatgtgcagg tgataatacc taaagatact 22141 tcccttagag gtttttcatc agttgcaaca tatccttctg tgaaaaaacc ctccttactt 22201 aggcacatga tttctaagta aactctacct gcatcttcat ctagccagga atgaggtagt 22261 aaaatcttta taggttctcc tggctgacag ttctcaaaac tctccttata aatcaattct 22321 ctttcatctt gctcatgatg ttctctatag agagaaacta taaaatgacc ctcaagagct 22381 aaaaggtaat atacagattt cagctctgtg tatttaccat agaacttctc atagaatgaa 22441 tgaaaataag tattaaaaga tacaatagta ttttgatgaa acataacttt tgtttcatct 22501 acacaaggac ttaatgatgt gccctcatca aatttcatgt acaagccaga agtttcaggc 22561 atacgtggaa actgaattct gcttattaca ttcatgtata tttacctcat gttatatata 22621 ctatatttat ctttttgaga aacttaatac ctcattgaca gagaaactgt tcagtgaaca 22681 aagacactat tattgagtat gtagtacttt ttccaatgtc taatttatta gtttttggta 22741 gatgcagaac ccagatttca gcaagttttt aactccctca atgttacttg tcagtcctaa 22801 cagcgtctac tttgctatct ctttctcaaa cacaccaagt gctcgcagca ctgcgtgatc 22861 catatcgtag tacttgtaat ctgcaagtcg tcctgcaaac agtactgaac cgttgagttt 22921 ctctacttct ttaagataaa gttccagacg ttcacggttc tcttcattga gaatgggata 22981 atatgggtca ttttttcctg gtacataagc ttgaggatac tccatcacca aagtggtttt 23041 gggtgaagtt tgtcccgaca aatacttttg ctcagtgatg cgtgtgatgt cgtagtcatt 23101 ggggtagtta actgtaccaa cctcttgata ctgctcttga tctaaagtct cgaattgaaa 23161 gcgcaggcta cggtatggca attcaccata catataatcg aagaaagtat caatcggtcc 23221 agtgcagagt attcggttaa acttaacttc gttgataatc tcgcggtaat ctgtgttcag 23281 aagtacttta atattggggt gagccaacat acggcggaac atttcggtat aaccaagctt 23341 gggcatagct tgataagggt cttggaagta acggttatcc cgactgatat aaactggaac 23401 acgtcctgtg actgcccgat ccagttcctc tggtttcact ccccactgct tcatcgtgta 23461 gcggaggaag acattttcat aaatgtattc agctaagaaa gtgaggtcac cactagcatg 23521 ttcacgcagt ttcaaaatcg gtactttgac gccaaaacca aagttttcta gaagcaaatt 23581 ctctagcttt tctgcatatt ttggaggaaa aagagcatag agcgaattga gattaaaagg 23641 gacaggaact ttctttcctt ctactactcc cagcacatgg tgatagtagt gtctccactc 23701 agtaaattga gagagataat cccaaacttt tttagatttg gtgtggaaga tgtggggacc 23761 gtatttatga accaggatgc catgctcgtt gtaataatcg taggcattac cgccaatgtg 23821 atcccgtttc tctacaatca gtactctctg cccaagctca ttagcaactc gctccgctat 23881 gacacaagca gaatatccag cgcctataat tagccaatca aatttcatgt atttttacct 23941 tacttgatgt aatttggata tcgaggctcc atcaataaag aaaattttta acaagaaagt 24001 agtcaggagt caggagccag aattcagaat gaatttgggc tacgtggtag atgacttggg 24061 gggtttaagt cccctactga atcaaaaatt cagtagtctt caattagtgg cgggtcagaa 24121 tcccccacta aaagattgct gactcctgaa ttctgacttc tgacttcttc ttcaatagtt 24181 tctaagttcc aatactattc ggttagaatt atctcaaata ttcaatggtg catcaaaact 24241 tataggcatt ttatatcagc taatacatag // LOCUS NODE_1272_length_24245_cov_5.29611424245 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24245) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24245) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24245 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 199..1797 /locus_tag="DP116_11380" CDS 199..1797 /locus_tag="DP116_11380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase" /protein_id="PRJNA477356:DP116_11380" /translation="MKSNFPFKLNRRQFLLTSAITTGGIIATNVLSKSTGFAQAPAII TSEKARPSIPYGVASGDINGNRAVIWSRSDRPARMIVEYATDESFRGAQRVVGPTAID ASDYTARVYLKDLPSDERIFYRVSFRDLSDTKIYSDPVEGSFRTPAAYRQNVFFAWSG DTAGQGWGINPDFGGMKIYETIRNVKPDFFIHSGDNIYADGPIQAEVKLDDGSIWKNI TTPEKSKVAETLAEFRGNYIYNLLDENVRRFNAEVPILAQWDDHETRNNWYPGQTIVD DDRYTVKDVSLLSQRANQAFLEYLPIRINADDPTRIYRSFKHGPNLEIFMLDERSYRG PNTTNRQTEQSAETAFLGSAQVRWLKNQLRKSTATWKVIASDMPIGLVVPDGPTNFEN LANGDGPALGRELELADLLRFIKQNNIKNVVWLTADVHYAAAHYYDPNKAQFQDFKPF WEFVAGPLNSGTFGPNKLDNTFGPEVKFLAIPEDLKANRPPSEGFQFFGTVKIDGYTE VMRVALVNLEGKTLYSVDLPPEKD" gene complement(2061..2585) /locus_tag="DP116_11385" CDS complement(2061..2585) /locus_tag="DP116_11385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454872.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11385" /translation="MGLFDQIIGAIGNSSQQGNIGQIGNILNTVQQLSNNAGTDPSTM QSALGIVGGYVRSALQDKRDTEGPEAAQEVVNQFGDTSPNPQAVDSLFAPFLQQQLAN TVAQRTGLNAGLIQQLLPTLVPLVLNLLKSGANSQNPQTGGNPVLNSFLDADGDGDID ISDAMRMASRYLGR" gene complement(2785..4512) /locus_tag="DP116_11390" /pseudo CDS complement(2785..4512) /locus_tag="DP116_11390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015083604.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="histidine kinase" gene 4712..6271 /locus_tag="DP116_11395" CDS 4712..6271 /locus_tag="DP116_11395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316867.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTP-binding protein" /protein_id="PRJNA477356:DP116_11395" /translation="MPLSRIITLIVGLIVILGLSLWLIDSFSRLYFQFSYSAPLLGNL LLFLLIILIGVLIAAFVYYVFVLQSGENRQRQRRERPSVQVPAAKSEAASSTLEAVKQ QVAQIQDEVTRQALLSQSQEIETNLTRGEIQVVVFGTGSAGKTSLVNGVMGRVVGKVD AAMGTTQVGETYCLRLKGLERRILITDTPGILEAGVAGTEREQLAREVATDANLLLFV VDNDIRMSEYEPLRALAQIGKRSLLVLNKSDLYTDEDKEAILARLRQRVRGFIAPSDV VAIAANPQSVELENGGIFQPEPDILPLLRRMAAILRAEGEDLVADNILLQSVRLGEQA RKLIDTQRRRQADKIVDRFQWIGAGVVSVTPLPVVDLLATAAVNAQMVVEIGRVYGCE LNMENAKELALSLAKTLASLGIVKGALQLLTTALQLNVATFIVGRAIQGVTAAYLTRI AGKSFIEYFRHDQDWGDGGMTEVVQRQFQLNRRDEFIKIFIQQAIQRIVQPLQGNSEV EEQELNNKSLQ" gene 6560..8374 /locus_tag="DP116_11400" CDS 6560..8374 /locus_tag="DP116_11400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874370.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_11400" /translation="MSNHMIGKVLQGRYQVVQTLSAGVFGETYVAVDIENPENPKCVV KQLKVISSQPSYLQTLRLRFLTETETLRQLGHHKQIPQLISCFEEHERFYLVQEFIEG HALTAELPIDRNLGYLWSESQVINFLQDVLLILDFVHSQGVIHCDIKPENLIRRACDN KLVLIDFGSIQSIDFEIIDEILPLDSIPVSSLGYIPPEQFIGLTQPNSDIYALGMIAI QAMTGVTPLQLKVDPQTNEILWRSPSTPVSDYLATIISQMIRYNYEDRFHSARVALRA LQQMPLETQYSYIVDVDFTVSSEDSEKNQPQTKLNTNPKDSTSPNSSPLLKGMKVGLV ANSLVMGFGTYFIIHNTPSYSEKEALYKATEEYQAGDLDRAIALAKSIPSNSNVYPDA QASIEEWQKQWHNAAEQYKAAEKAFQENRWSDVLRAASQVPDILHWQTKTNKLVQQAQ VNIEAQTKDLLTKAYEKASLKDFSGALDYLSQIPEESSAGAIVQQKLTEYNHKKSVRA AYFLQKAYNKAAEGDFKSAVEFLQQVPKDTPVYATAQEKLVEYTQKQHVVQAKSQKIA SSKAPAFTNMKKPLSNSNYAKNLESFDPSNQMEEVNIR" gene complement(8419..10527) /locus_tag="DP116_11405" CDS complement(8419..10527) /locus_tag="DP116_11405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316865.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein kinase" /protein_id="PRJNA477356:DP116_11405" /translation="MLGTILVGRYQIISHLGGGGFGETFVACDTQLPGSPQCVVKKLK PQASDPDTLQTARRLFDTEAKVLYKLGIHDRIPQLLAYFEEKQEFYLVQEFIEGHDLS QEIIPGKPLNQDQVICLLEEILEILDFVHQQQVIHRDINPRNIIRRKLDDKLVLIDFG AVKQITTQVIIPSGKTKFTVAIGTPGYIPSEQAQGNPKFTSDIYALGILGIEALTGLS PEQLEKDAETGEIIWQNQTSVSQDFAKVLNKMVCYDFRERYSSAKLALQGLKDLKKSQ SHMMTLNFAISNNKISNFLKIRSQPKKKNIKKWLALISLIGIGVGSSIYIAHAINSVN ATELYKQANTLYELQRYQDALSTYAKAVNIRPDYAQGWNGQGKTLYELKKFQDALVAY DKAIQIEPDYLEAWSGRGFALNKLQRYQEAIASFEKTLQFQNNYPEVWNAKGEALAKL NEYDQAFKSYDKAIELNKEYYEAWYNKALLLQNLKRYDDAIAAYDKVLEFKPDHERAW YNRGNVLVNLRRYQDAVGAYEKAVQYKPSFYQAWLSKGNILINLQRYPEALESFQQVI KYNRRNYQAWYGSGWSLHQMKRYDDAVSSYNKAIELDRRNYQVWYSRGNSLYNLKKYQ EAISSYNKAINYDYKNYDIWYSKGNALFNLKRYKEAIASYEQVLKIKPDYQPAINARN QAQQIQQDLVPSPESRFKNL" gene 10648..11355 /locus_tag="DP116_11410" CDS 10648..11355 /locus_tag="DP116_11410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316864.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rhomboid family intramembrane serine protease" /protein_id="PRJNA477356:DP116_11410" /translation="MIPISDRIFFFKRRKPIIIYCLIGINIGLFLWELTLELGGTLGN FVNSWGVVPAQISAAFANALAGNPAAWIVVLKGLTSLLVGMFLHGSFSQILGNLIFLW VFGKTIESILGYGRFLVFYLVSGILTGFIQILAEPSLTVPLIGANGAITAILGAYVFK FAKAKIDTILPLLIVFIPIQVPAYFYLFWWFVQQISYGIGSLNIPGGVNPVSVGYLAQ FSGLFIGVAFIKLLQRF" gene 11643..12317 /locus_tag="DP116_11415" CDS 11643..12317 /locus_tag="DP116_11415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015189432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="PRJNA477356:DP116_11415" /translation="MKGNQPKSVIIPRSLDVQAADVIREQILNGTLVPGTRLLEINLA EQFNLSRATIRSALQQLTYEGLVIQLPYKGCTVSGLSSQDAWELYTLRSALESLAARL AAVAITPSQAKELNAALQQLVKAAHKGSWSEVADGDFALHKTIIQLAGHRRLQEQYKI VEQQIRLYIISCNALHPDLDDIIQQHQELVNAICSGDASRAEKIAQDHNTDGKALVEH LQEIEK" gene 12357..13061 /locus_tag="DP116_11420" CDS 12357..13061 /locus_tag="DP116_11420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015189433.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="haloacid dehalogenase type II" /protein_id="PRJNA477356:DP116_11420" /translation="MINLNQYEVLTFDCYGTLIDWEKGVLEALQPVLQSHKIQLSEKE ILEWFARFESSLEQGEYRKYKDVLRGVVQKFGEQFGFTPSSGELNALADSIKNWQPFA DTVEALKRLKQRFKLAIISNVDDDLFAFSAKHLQVEFDEVVTAQQVQSYKPSVQNFQV AIARLAEIGIPSEKILHVACSVYHDIVPANSLGLSTVWVNRRLGQEGSGAALPAQGKP DLEVPDLKSLAAKINF" gene complement(13138..13473) /locus_tag="DP116_11425" CDS complement(13138..13473) /locus_tag="DP116_11425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_11425" /translation="MDTLENYRQIIQKVLSEYAQLPYAYGQLERQLIIDKNANHYLLL TLGWENKQRVHGCLVHIDIINDKIWIQRDGTEYGIANELVNAGIPKAQIVLGFQPADV RKYTEFAVI" gene complement(13461..13877) /locus_tag="DP116_11430" CDS complement(13461..13877) /locus_tag="DP116_11430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015175525.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty-acid synthase" /protein_id="PRJNA477356:DP116_11430" /translation="MPAKDIYHYAVRNALEKESWKITKDPFILKWGTRDLYIDLGAEK LIAAEKSGQKIAVEIKSFVGASPVADLENALGQYILYYDILSRLESDRRLYLAIRQET YSELFTEPIGKILIENQRLCLLVFDSEQEVILQWIP" gene 13992..15035 /locus_tag="DP116_11435" CDS 13992..15035 /locus_tag="DP116_11435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200380.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_11435" /translation="MLKLKRGLSLLATLPLVFSLSACAGKLGGSGGSLLDTIKSRGKL ICGVSGQLPGFSYVKSNGEYAGLDVDICRAIAAAIFNDPNKVEFRNLNSKERFTALQT GEVDILSRNTTWSMSRDTSIGIKFAAVIFYDGQGIMVKKNSGIQKLEDLKGKSICIGT GTTNEQNLTDQMRQRGVNYKPLVFEDANTVFATYEQGRCEGVTADRSQLVSRRTTLSK PDDHIVLDTVLSKEPLAPAVVNGDSKWLDMVRWTIFALVNAEELGVNSQNVSQLANSN NPEVKRLLGTEGNLGKGMGLTNDFVVHIIKNVGNYGEIYERNLGKNSELKLERGPNKL WNQGGILYAPPFR" gene 15289..16206 /locus_tag="DP116_11440" CDS 15289..16206 /locus_tag="DP116_11440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756643.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid ABC transporter permease" /protein_id="PRJNA477356:DP116_11440" /translation="MTPIWRNRKFFPILGQLIAAFIVAIVVMILWHNLIYNLQRLGLQ LGFDFLQFQASFDIGETPIPYKSSDSYSRALLVGLVNSLRVIVFGITLATIIGITVGV ARLSDNWLVRQLALVYVETLRNTPLLLQLFFWYFAVFLSLPKTENQISLLGFININNR GVTLPFGIELSSELSTLILGLTLYTGAFIAEIVRAGILSVAKGQWEAARALGFKPHLI LRLVIFPQALRLIIPPLSSQYLNLAKNSSLAIAIGYPDIYFVASTTFNQTGRSVEVIL LIMVTYLTISLSISLGMNLLNRSVQLKER" gene 16217..17251 /locus_tag="DP116_11445" CDS 16217..17251 /locus_tag="DP116_11445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744805.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid ABC transporter permease" /protein_id="PRJNA477356:DP116_11445" /translation="MSSFKSQKPTILQWLQKNLFNNWYNSILTVVCLWLLFFGIKGIL TWVLTQAKWQVITANLSLFFVGRFPQQLYWRLWLALFIILGLVGLSWGTFTKRLPHRM NSWLPLGWALSFPIILWLIGGGFGLQPVESNLWNGLLLTLVMAVISIVLSFPLGVLLA LGRQSQLPVVRWFSILYIEIIRGLPLIGVLFFAQVMLSLFLPVEYRLDRVLRGIAGLI FFSAAYLAENVRGGLQSVPRGQIEAAKVLGFNAPLTVLLIVLPQALRAVIPALVGQFI GLFKDTSLLAIVGLLELTGISRAILAQPQFIARYAEVYLFIAFIYWIFCYSMSLASRR LEKELGVGQR" gene 17346..18086 /locus_tag="DP116_11450" CDS 17346..18086 /locus_tag="DP116_11450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309712.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_11450" /translation="MNQQKPIIIAQDIHKWYGKFHVLQGVSLTVNRGEVVVLMGPSGS GKSTFIRTFNALEEYQEGRIEIDGIDLTNDLKNIEAIRREVGMVFQQFNLFPHLTVLQ NITLAPTWVRKWTKVKAEETAMQLLERVGILEQAQKYPGQLSGGQQQRVAIARALAMQ PKVILFDEPTSALDPEMVREVLDVIKTLAADGMTMVVVTHEVGFAREAADRVILMDSG TIVEEATPSIFFQNPKHDRTRKFLSQIL" gene complement(18186..18689) /locus_tag="DP116_11455" CDS complement(18186..18689) /locus_tag="DP116_11455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859217.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid hydroxylase" /protein_id="PRJNA477356:DP116_11455" /translation="MFEAIACAWILLVIGDFLSTFCYHVPEHVFGMLHLRTHHSYKKS FRHYAILTFNSQVLLDGILGALPYLLVAAVLWFFSPIGVVCGLLFGQFHVWWRHTTTL GWQTPKFIEILCRILFITTPERHWEHHQKTNQGYGDIFTFFEQPAKGWLRLLRLLRLR FSHLLVS" gene 19255..19893 /locus_tag="DP116_11460" CDS 19255..19893 /locus_tag="DP116_11460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314422.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipase" /protein_id="PRJNA477356:DP116_11460" /translation="MSKYRPQIRICFLGESFVNGTGDPEFLGWTGRICVDAYRKGYDI TFYNLGVRGETSTELRQRWLKEVSYRLPKQYNGRVVFSFGVNDTTLINSKPRVELTES IENVRSILSTAKQLYPVLMVGPPPCADEEQDKRNQRIANLSKQFTLVCDELDIPYLDI FPILEKSNIWQDEARANDGAHPRSAGYAEFAEIVQSWNAWLNWFPPFSSSDF" gene complement(20049..20285) /locus_tag="DP116_11465" CDS complement(20049..20285) /locus_tag="DP116_11465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015177033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="stress-induced protein" /protein_id="PRJNA477356:DP116_11465" /translation="MADTSKRGFASMDEDKQREIASLGGQAAHEKGTAHEFTSEEAKE AGRKGGETVSQDREHMSEIGREGGKNSHKGAKDE" gene 20797..22113 /locus_tag="DP116_11470" CDS 20797..22113 /locus_tag="DP116_11470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208344.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tyrosine transporter" /protein_id="PRJNA477356:DP116_11470" /translation="MKAVLNSDTPQVTRLFSNLELYGNKLNHQPGSVLGSTALVAGTT VGAGILALPAVTLPSGVVPSTVLLVAVWLYTLVSGLLIAEVTLNTMRLVGSSSSGLLV MVERTLGKPSARVAGGAYLFLHYALLVAYVSQGGEILVSAVEKVLGVQNNLPASVGTT AFTLLFGGIMYLGRERFIEKLNSAFVAIVLASFVGLLVLGATQLKTSSFSFQNWSALP AAVSVMFVALFYSNVVPTVVTQLEGDVRKIRQSIFIGSAIPLIMFLAWNAVILGSVST DMLQGSSGGGTVFDPLQILREGGAGEWLGVLVSVFSEFAIVTSFIGFVYGLLDFFKDT SAPNEPSKRLPLFSLILFPPMSLGALNPSIFLAALDYAGTFSISVLGGIIPALMIWKQ RQEQQQSNSMSQPLVPGGKVTLIAMIGVALVVIAKQVVSIWGSFGY" gene 22502..22885 /locus_tag="DP116_11475" CDS 22502..22885 /locus_tag="DP116_11475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Nif11-like leader peptide family natural product precursor" /protein_id="PRJNA477356:DP116_11475" /translation="MSLEDVQAFYQRLGTDEAFRTQIQGVNSKDECSQMVKSAGYDFT QEELEEYTAELLELSADEDGLADLDEKELATVFGGIVAQPLYGGIIYEPPTDWSPIKP PIKWPPKKWPPIDPQPLYGIVVSPE" gene 22981..>24245 /locus_tag="DP116_11480" CDS 22981..>24245 /locus_tag="DP116_11480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409790.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nif11-class peptide radical SAM maturase 3" /protein_id="PRJNA477356:DP116_11480" /translation="MTYRRVSYAVWEITLKCNLACQHCGSRAGHTRAKELSTEEALDL VKQMAEVGITEVTIIGGEAFLRPDWLEIASAITDAGMRCGMTTGGYGITLDTARRMKE AGIKVVSVSVDGLEATHDRLRGRKGSWQWAFKTMSHLKEAGIIFGCNTQINRLSAPEF PQIYERIRDAGIFAWQIQLTVPMGNAADNSDILLQPYELLDVYPMIARVAQRAQEEGV QVQPGNNIGYYGPYERLLRGGHAWAFWQGCNAGLATLGIEADGAIKGCPSLPTSAYTG GNIRDHSLRTIIEETEELRFNLGADTPKGTEHLWGFCKSCEFAQLCRGGCSWTAHVFF DKRGNNPYCHHRAITQVNRGVRERVFLKHQAEGNPFDNGEFALVEEAIDAPLPANDPL QFSGDRIQWSKSWQQEPLSNSKSRSVASVA" BASE COUNT 6868 a 4800 c 5215 g 7362 t ORIGIN 1 ttgctttcag ttagattttg aagttcatag ccaagagaat ttgtaaactt gggagcaaag 61 cgtattcaac agttatcagt tatcagtgat tagctcaaga aagctgcttc tgcttgataa 121 ctgttcactg ttcactgttt actgttcact gttcactgat tcaattccac atagcagttt 181 taaccgaccc ttaattttat gaaatctaac tttccattca agctaaaccg tcgccagttc 241 ttgctaactt cagccatcac cactggtgga attattgcga caaacgtttt atcaaaatca 301 acaggttttg ctcaagcacc tgctatcatc acatctgaga aagcacgtcc ctccatacct 361 tatggagtgg ctagtggtga catcaatggg aatagagctg ttatttggag tcggagcgat 421 cgcccagccc gaatgattgt agaatatgca actgatgaat cttttcgtgg cgcacagcgt 481 gttgttggac cgacagcaat agatgccagc gactatacag cacgggttta tctcaaagat 541 ttaccatcag atgagcggat attctaccgg gtgagtttcc gggatttatc tgatacaaaa 601 atttacagtg atcctgtgga gggtagtttc cgaactcctg ctgcatatcg gcaaaacgta 661 ttctttgctt ggtcaggtga tactgctggt caaggatggg gtattaatcc ggattttggt 721 ggaatgaaga tttacgaaac catccggaat gtcaagccag actttttcat tcattctggt 781 gataatatct atgccgatgg tccgattcaa gcggaagtca agctagatga cggtagtatt 841 tggaaaaata tcaccacacc agaaaaatca aaagttgctg aaaccctagc agaatttcgt 901 ggtaactata tatataactt attggatgaa aacgtccgtc gctttaatgc tgaagtaccc 961 atactagcgc aatgggacga ccacgagact cgtaataact ggtatcctgg acaaacgatt 1021 gtagatgatg accgctacac agtcaaggac gtatctttac tgtcacagcg ggcaaatcaa 1081 gcatttttag aatacttacc gattcggatt aatgctgatg atccaaccag aatttaccgg 1141 tcttttaagc atggtcctaa cttggaaatt ttcatgctag atgaacgcag ttaccgggga 1201 ccaaacacaa ctaatcgtca gacagaacaa agtgcagaaa cggcattttt aggaagtgcg 1261 caggtgcgtt ggttgaaaaa ccaattgcgg aaatcaacag caacatggaa agtcattgca 1321 agtgatatgc ctattggact agtcgttcca gatggtccca ccaactttga aaacttggct 1381 aacggagatg gtccagcctt aggacgggaa ctagaacttg cggatttact acggttcatc 1441 aaacaaaaca atatcaagaa tgttgtctgg ttaactgcgg acgtacatta cgcagccgca 1501 cactattacg accccaacaa agcacagttt caagacttca aacccttctg ggagtttgtt 1561 gcaggtcctc tcaactctgg cacatttgga ccaaacaaac tcgacaatac ctttggtcca 1621 gaggtgaagt tcctagccat acctgaagat ttgaaagcaa atcgaccccc aagtgaaggt 1681 ttccaattct ttggtaccgt caaaattgat ggttatacag aggtgatgag agtcgcattg 1741 gtgaacttgg aaggcaaaac tctctacagt gtagatttac caccagaaaa agactgaaat 1801 aaacatttac tgtagggtgg gcataagtaa tcccacttta cttattagca atgagaaatt 1861 agcatttctc tactatatca agttcgccta attacttaca ataaaaagac ctcaccccca 1921 cccctaatcc ccacttcgtg gggaccggtg agttcccctc tccttaataa ggagaggggt 1981 gcccgatagc gtagcgtgcc gttaggcata gggcggggtg aggttcttca ttttttataa 2041 gtgttcatcc gaacatgata ttacctaccc agataccgac tagccattcg cattgcatca 2101 gaaatatcaa tatcgccatc gccgtctgca tcaagaaaag aattcagcac tggatttcct 2161 cctgtttggg gattttggga atttgcacca gatttcaaca aattcagaac caaaggaacc 2221 aaagttggta gtaattgttg aatcaaccct gcatttaacc ctgtgcgctg ggcaacagta 2281 ttagccaact gctgttgcaa aaaaggagcg aaaagagagt ctacagcttg aggattaggg 2341 gaagtatcgc caaactgatt tacgacctct tgcgcggctt caggaccttc tgtatctcgt 2401 ttgtcttgta aagcagaacg cacataacca ccaacaatcc ctaaagcgga ttgcattgta 2461 gaggggtctg taccagcatt attacttaat tgctgcacgg tgttgaggat attcccaatc 2521 tgaccgatat tcccttgctg actagaattg ccaattgcac cgataatttg gtcaaaaagt 2581 cccataagct ttttctccct gtgtactcat gccaacaaaa ccaaattctg acacaccagc 2641 gacgacttgt tccaaagtat tcagtaaaaa ataagctttt tctatttttt ctatttgaag 2701 acgcttcttt tgttattaag cttaacaagc actgtcagca gaagtcggaa cgtgaagagt 2761 aagatataac tttattttta agtattagct tctatcaata ggtagaaaaa tggtgaactc 2821 tgtaccttgg ttcaactcgg acttaaaagt catttcccca acatgctttt caacaataat 2881 ctgatagcta attgataaac ccaatccagt accaagacct cgcggtttgg tcgtaaaaaa 2941 catttcaaat attttatcct ggttttgggg tgcaatacct gagccattgt ctgcaattgt 3001 tatcttcacc caattgttgt cctggcgttc ggtacttacg gtaattgtag gagaaaattc 3061 cgaagtgaca ggagactttt ccttcacagc atcgatcgca ttactcaaaa gattcatgaa 3121 cacctgatac agtagtcctg tatagcctgg aatgggtgga atatcaccat agtgacaaat 3181 aacgttgatg ccggttttta agcggttttg tagaattaac agcgtactgt tgagacatat 3241 gtgtacatca actagctgaa cttctctttc atccagacgg caaaagtctt tcaaactctg 3301 ggacatttcg cggatgcgtt cagccccaaa ctctactgaa tttaggagat ttggcaaatc 3361 agctttgaga aactctaaat caatttcctc tgctaatact tgtacagcgg tgggggagtt 3421 aggaacttca gcttcgtatg tttgcagcag cgctaggaga tcatcaatgt agttttttgc 3481 gtgtacaagg ttaccagaaa taaaattgac gggattgatg atttcatgag cgacgccagc 3541 caacatcttg cctaaactgc tcattttttc actctgaagt agtcgcgctt gagtttcggc 3601 tttttgttct tctaaaagtt gcttgacttg ttgaatgagt tggttgagag aggtggctaa 3661 ttcgcccact tcatcttttg tcgtgacggg tgcttgtaaa tcaaaattgg catcttgggt 3721 cacacgttga gcaatgttcg ttactgcttt taggggacga gcgataagcc ggaggcttga 3781 cgcttcgcgt atcgcacgac ttgtatataa agcaaaaatc gtcgccagta ctgttgacag 3841 caacatacta cagataataa ttagcccttt tagtacatag gtatcatttg aaacgacatc 3901 agctttgtcc tgacgctggc gaactaatat gacaaagttg tttaactcgt gggtgaattg 3961 ataaaacttc agtgttgctt gagtttgatg aaattgccaa atcagttttc ttgcttttga 4021 aactctgtct gaattggagt taagagagga aatttgctgt agcaggactc gtagttgctg 4081 aatatattct actagtgtgt tgtcatactt gttgagcaaa gcttgcaaat ccccttgctc 4141 gttggtttgg ctaaattttc gcaactcatt cagcaacgtt tcagtgtcct tgagatgtgc 4201 cttgatatca gaatgtttct gttgtaaagc tttttgctct aaaaaaaaca ctgtttcttg 4261 ctgatgagtg tcaatttcca acaacaaccc ttgtaatcta ttcaataaac atccttcctg 4321 atctgccatt ctcatctgtt gtttggcatg ttggaagtag cgatcgccta tcaccaatcc 4381 cactgttgtt cccaaaacag cagtcccaaa ggaaagaaca tacccacaga taattttttg 4441 tcgaatacta agttggtgga acactcgtta tatcccagtc aaagtcatac aaaatttagc 4501 tgcccaagtc atgctgcaac ttgctgctca acttttctaa agaaagcttt tccttattaa 4561 aaaggaattc tatcaaactt atttacgcta aattttatac ataatcggtt atttcttcaa 4621 caaaataact ttctcaacac tgtggcgaaa aagactgaaa ccattaacat gagaagcatc 4681 tgagtacgta tccacgcttt tcttcccaac catgcctctg tcacgtatta tcacgctgat 4741 tgttggtctt atcgtcattc taggactaag tctatggctg attgattctt tcagtcggct 4801 ttacttccaa ttctcctatt ctgcgccgtt gctgggtaat ctgctattat ttttgctgat 4861 tatccttatt ggagttttaa tcgcggcgtt tgtctattat gtgttcgtgc ttcaatcggg 4921 cgagaatcgt cagcgccagc gcagagaacg cccaagtgta caagttcctg ctgcaaaatc 4981 tgaggctgcg tcttctactt tagaagctgt caagcaacaa gtcgcgcaaa ttcaagatga 5041 agttacacgc caagctttac tgagtcagtc gcaggaaatt gaaaccaatt tgactcgtgg 5101 tgagattcaa gttgtcgtgt ttggtactgg gagtgctggt aaaacctccc tggtgaatgg 5161 ggtgatggga cgtgtggtgg gtaaggttga tgcagcaatg ggaacgactc aggttggaga 5221 aacttattgt ctgaggttga aaggattaga acgcaggatt ttaattacgg atacaccagg 5281 tattttagaa gcaggggtgg cgggaaccga gagggaacaa ctcgcacgag aggtggcgac 5341 ggacgcaaac ttactgttgt ttgtggtgga taatgacata cggatgtccg aatatgagcc 5401 gttacgagca ttggctcaaa ttggtaaacg ttctttactt gtcctcaata aaagtgattt 5461 gtatacagac gaggacaaag aagctatttt ggcgaggttg cgtcagcgag tgcggggatt 5521 tattgcaccg agtgatgtgg tggcgatcgc cgcaaatccc caatctgtag aattagaaaa 5581 tgggggaatt ttccaacctg aacctgatat cttaccctta cttcggcgaa tggcagcgat 5641 tttacgggcg gagggtgaag acttggtagc agataacatt ctcctacaat ccgtacgatt 5701 aggcgagcaa gcgcgaaaac tcatcgatac tcagcgtcgc cgtcaagctg acaaaatcgt 5761 ggatcggttt cagtggattg gtgctggtgt ggtttcggtg acgcctctac cagtcgttga 5821 tttgctagca acagccgctg tcaatgctca aatggtggta gaaattggca gagtctacgg 5881 ctgtgaatta aatatggaga atgcaaaaga attagcgctg tctttggcaa aaactcttgc 5941 tagtttgggt atagttaagg gagcattgca gttacttact accgccttgc agcttaatgt 6001 cgctactttt attgttggta gggcaattca aggcgtgaca gccgcttatt tgacgcggat 6061 tgctggcaag agttttattg agtattttcg tcatgatcaa gattggggtg atgggggaat 6121 gacggaagtt gtccagcgac agtttcaact caatcgccgc gatgagttta ttaagatttt 6181 tatccagcag gcaatacaac ggatcgtgca gccgttgcaa ggaaattctg aagtagagga 6241 acaggaactt aacaataaat ctttacagta ggaatactgc ttagaccaga tacatcaact 6301 ggcacacaga gcagactagt aatattgtca gtgcttgaca gcagtggcgc tattccatat 6361 tttaagctag tctgtgataa agtgttttta taaatacatt tttaagcctt gattacagta 6421 gtaacactta gaaaaatcca actcctattt aaccactcag cgctttattc agcgctcctt 6481 tcatttaatt taatcagggt tccggaaata gggagcaatg ttagaaaacc tatataagca 6541 gtgactaagt aactagctca tgagtaacca catgatcggt aaagtactac aaggacgtta 6601 ccaagtcgtc caaaccctaa gtgcaggcgt gtttggcgaa acatacgtcg ctgtagacat 6661 tgagaatcca gagaatccca aatgcgttgt taaacagctt aaggttatca gttcccaacc 6721 aagctactta caaactctga ggttaagatt tcttacagaa accgaaacac tgagacaatt 6781 aggacaccat aagcaaattc ctcaattgat atcctgcttt gaagaacatg aacggttcta 6841 cttagtgcaa gagtttatag aaggacatgc actcacagca gaactaccaa ttgatcgcaa 6901 tttaggttat ttgtggagtg aaagtcaagt tatcaacttt ttacaagatg ttctgttaat 6961 tttagacttt gttcactctc aaggtgttat tcactgtgat ataaagccag aaaacttaat 7021 cagacgtgct tgtgataaca agttagttct cattgatttt ggttcaattc aatcaatcga 7081 ttttgaaatc attgatgaga tattaccttt agatagcatt cccgtcagtt cactgggtta 7141 tataccacca gagcaattta ttggtctaac acaacctaat agcgatattt atgctttagg 7201 aatgattgcc attcaggcaa tgaccggggt gacaccgctg caactcaaag tagatcccca 7261 aactaatgag attctctggc gttctccaag cacaccagtg agcgattatt tggctaccat 7321 tatcagccaa atgattcgtt acaactacga agaccgattc cactcagctc gtgttgcatt 7381 acgggcgctt cagcaaatgc cgttggaaac tcaatactcg tacattgtag atgtcgattt 7441 taccgtttct tcagaggact ctgaaaaaaa tcaaccacaa actaagttaa atactaaccc 7501 taaagattca acatctccaa attcatctcc actcttgaaa ggaatgaagg tggggttggt 7561 tgctaattct ttggtcatgg ggtttggaac ttattttatc atacataaca ctccaagtta 7621 ttcagaaaaa gaagcactat acaaagcaac tgaagaatat caagcaggag atttggatcg 7681 agcaattgct ctagccaaat caattccctc gaatagtaat gtctatcctg acgcacaagc 7741 cagtattgaa gaatggcaaa agcaatggca taatgctgct gaacaataca aagccgcaga 7801 aaaagccttt caagagaatc gttggtcaga tgtattgcgt gcagcctctc aagttcctga 7861 tatattacat tggcaaacaa aaacaaataa attagttcaa caagcacaag ttaatattga 7921 ggcacaaacg aaagatttat taaccaaggc ttatgaaaaa gcatcattaa aagatttcag 7981 cggcgcttta gattatctga gtcaaattcc tgaagaaagc tctgcagggg caattgttca 8041 acaaaaattg actgagtaca accacaaaaa gagtgttaga gcagcttact tcttacaaaa 8101 agcttacaac aaagcagcag agggagactt caaaagtgct gttgaatttc tccaacaagt 8161 tcctaaagat actcctgtat atgccacagc tcaagagaaa cttgtcgagt acactcaaaa 8221 acaacacgtt gtgcaagcta aaagtcaaaa aatagcttca tcaaaagcac cagcttttac 8281 gaatatgaaa aaaccattga gcaatagcaa ttatgctaag aacctagaat cctttgatcc 8341 aagtaaccaa atggaagagg tgaatataag atagtcaagg tcaagagtca agagtcaaaa 8401 attaaagaac actttggact atagattttt gaatctagac tcaggacttg gaacgagatc 8461 ctgttgtatt tgttgtgctt gattgcgggc gtttattgct ggttgataat ccggcttgat 8521 tttaagaacc tgctcgtatg aagcgatcgc ttctttataa cgtttcagat taaacaaagc 8581 attgccctta ctataccaaa tatcgtaatt tttataatcg taattaattg ctttattata 8641 agatgaaatt gcctcttggt atttctttaa attgtagagt gaatttcctc gactatacca 8701 tacttgataa tttctacgat ctaattcaat tgccttatta taagatgaca ctgcatcgtc 8761 atagcgtttc atttgatgta gtgaccaccc gctaccatac cacgcttgat aatttctacg 8821 attatattta ataacttgct gaaaagattc aagggcttct ggataacgtt gcaagttgat 8881 aagtatatta cctttagata accaggcttg gtagaaactg ggcttatatt ggacggcttt 8941 ctcatacgca ccaacagcat cttgatagcg tcgcaaattc actaaaacgt tgcctctgtt 9001 ataccaagct ctttcatggt ctggtttgaa ttcaagcact ttatcataag ctgctattgc 9061 gtcatcgtaa cgttttaaat tttgtagcaa caatgcttta ttgtaccaag cttcataata 9121 ttctttattc aactcaatgg ctttgtcata agatttaaat gcttggtcat actcatttaa 9181 tttagctaaa gcttctccct tagcattcca aacttccgga tagttatttt ggaattgcaa 9241 tgttttttca aaagaggcga tcgcttcttg gtatctttgc aacttattca atgcaaatcc 9301 acgcccactc caagcttcta aataatctgg ttctatttga attgctttgt cataagctac 9361 aagtgcgtct tgaaattttt tcaactcata cagagtttta ccttgtccat tccatccttg 9421 agcgtaatct ggtctgatgt taaccgcttt tgcatatgtt gataaagcat cttgataacg 9481 ttgtaactca tataaggtat tggcttgctt atacaattca gtagcattga cagaattgat 9541 cgcatgagca atatatatag atgatcctac acctatccct attaaactta ttagcgctaa 9601 ccacttttta atgttctttt ttttaggctg acttctgatt tttaaaaaat tgcttatctt 9661 attgtttgat atggcaaaat tcaatgtcat catgtgagat tgagattttt ttaaatcctt 9721 caaaccttgt agggcaagtt ttgcagaaga ataacgttcc cggaagtcat aacacaccat 9781 tttatttaaa actttggcaa aatcctgact gactgaggtc tggttttgcc agataatctc 9841 accagtttct gcatcttttt ctagttgttc aggagataat ccagtcagtg cttctatgcc 9901 aagtatccct aaagcataga tatcactggt gaattttggg tttccctgtg cttgttcgct 9961 gggaatatac cctggagtac cgatcgcaac ggtaaatttt gttttcccac taggaataat 10021 aacttgggta gtgatttgtt tgacagcgcc aaagtcaatc aaaactaact tgtcatccag 10081 cttgcgtcta atgatatttc gtgggttgat gtcacggtga ataacttgct gctgatggac 10141 aaagtctaaa atctctaaaa tctcttctaa aagacaaata acttggtctt gattaagcgg 10201 ttttcctggt atgatttctt gactgaggtc atgaccttct ataaattcct gtacgagata 10261 gaattcttgc ttctcctcaa agtaagctag gagttgagga atgcggtcat gaatacctaa 10321 tttatacaga actttcgctt ctgtatcaaa taagcgtctg gctgtttgca aagtatcagg 10381 gtcactagct tggggcttga gttttttgac gacacattga ggagaaccag gtaattgagt 10441 atcacaagca acaaaagttt caccaaatcc cccacctccc aagtgactaa taatttggta 10501 tcgtccaaca agtatggttc ccaacatttg gttagcctag tttcatgtag tacagtctga 10561 ttccaatctt gccacagctt ctcttaaggg tcgataaaag gataagccag ataaagagaa 10621 taactatgta actaatgact tattgatatg attcctatta gcgatcgcat tttctttttc 10681 aaacggagaa agccaataat tatttactgt cttattggca ttaacattgg tctattttta 10741 tgggaactga cacttgagct tggtggtaca ttgggcaatt ttgtcaatag ctggggtgtc 10801 gttcccgcac aaattagtgc agcttttgca aatgcacttg ctggaaaccc agctgcttgg 10861 atagttgtgc tgaagggttt gacttcactg cttgtgggaa tgttccttca tggcagtttt 10921 agccaaattt tagggaattt gatatttttg tgggtttttg gcaaaacgat tgaaagtatt 10981 ttgggatacg ggcgattttt agtattttac ttagtttctg gcattcttac tggattcata 11041 caaattctag ctgaaccgag tttgacagtg ccattgattg gagctaacgg ggcgattaca 11101 gctattttag gagcttatgt ctttaagttc gctaaagcaa aaattgacac tattctgccc 11161 cttttgattg tgtttattcc catacaagta ccagcctatt tttatttatt ttggtggttt 11221 gtgcaacaga tttcttacgg aattgggagt ttgaatattc ctggtggcgt aaaccctgta 11281 agtgttggtt atttggcgca attttcgggg ttatttattg gtgttgcgtt tattaagcta 11341 ttgcagagat tttagtaaca aggggatcat ctctattttg ccaaaacatg tttgtctgta 11401 gcgatcgtac gaaatgaagc gtaagcgcaa tgacaagtca tcatctgagt ttgatatcac 11461 ttacacattt gggatactcc cgttataagg acttccataa aataagcttg agtggaattg 11521 tcgattgttg acaatctaca atttgaggtt tagagtcaga gtatcaatag tttgcaatca 11581 aaccagcagt cacagggctt gtaactgtgg ctcaatttat tcgcatttct acaagaatcg 11641 aaatgaaagg gaatcaaccc aagtctgtca tcatacctcg aagcttagat gttcaagcag 11701 ctgatgtcat tcgagaacaa attctcaatg gtacgcttgt accggggact cgtttgctag 11761 aaatcaatct ggcagagcaa tttaatctga gtcgtgcaac tatacgctca gcattgcaac 11821 agttaaccta tgaaggcttg gttatacagc tcccctacaa aggttgtact gtttcgggat 11881 tgagttccca ggatgcttgg gagctttata cgctacggag tgcacttgag agtttggcgg 11941 cgagactagc agctgttgca ataacaccga gtcaggcaaa agaactcaat gctgccttgc 12001 agcaattggt caaagcagct cacaaaggca gttggagcga agtggctgat ggagactttg 12061 ctctacataa aacaattatt caacttgctg gtcatcggcg gttacaggag cagtacaaga 12121 ttgttgaaca gcaaattcgt ctttacatca tttcctgcaa tgcgctgcac ccagatttgg 12181 atgacatcat acagcagcat caagaattgg taaacgcaat ttgctcaggc gatgcgtcca 12241 gagcagagaa aatcgctcaa gaccataata cagacggtaa agcactggta gaacacctgc 12301 aagaaataga aaaataaaat tcagctgaca aagcattgag ctagggagca taaaccatga 12361 ttaacctgaa tcaatatgaa gtcttaacct ttgactgcta tggaactttg attgattggg 12421 aaaaaggtgt gctggaggcg ttgcaaccag ttctgcagtc ccataaaatt caactgagtg 12481 agaaggaaat ccttgagtgg ttcgctcgtt ttgaatctag tcttgagcaa ggagagtacc 12541 gcaagtacaa ggatgttctt aggggggttg tgcaaaaatt tggagaacaa tttggattta 12601 cgccttcttc tggggagtta aatgcacttg ctgattctat caagaattgg caaccttttg 12661 ctgatactgt tgaagcactc aagaggttga aacagcgatt caaactagca attatttcta 12721 atgttgatga tgatttgttt gctttcagtg caaaacattt gcaagttgaa tttgatgagg 12781 ttgtgactgc acagcaggtg caaagctata aaccttctgt gcaaaatttt caagtggcga 12841 tcgcccggct cgcagaaatc ggcattccct ctgaaaaaat tctacacgtc gcttgtagtg 12901 tttaccacga cattgttccg gcaaattccc ttgggttatc gacagtttgg gtgaatcgta 12961 gattaggtca agaaggttca ggagctgctt tacctgctca aggaaaacca gatttggaag 13021 taccggattt gaaaagtttg gcggcaaaaa ttaatttttg aagtaccatc atcagtttca 13081 cgtgttcgca ttcgtccagg ttgacgaatg cgtcataaag attgctaaat ttaggagcta 13141 aataaccgca aattccgtat actttctcac atcggctggt tggaaaccca gaacaatttg 13201 agctttggga attccggcgt tgacaagttc gttagcaata ccatattctg taccgtccct 13261 ttgaatccag attttatcgt tgataatatc aatatgaact aaacaaccat gcacccgttg 13321 tttgttttcc cagcctagtg tcagcagcag ataatgattg gcatttttat caatgattag 13381 ttgtctttct aattgtccgt aagcgtaagg aagttgggcg tattcgctta atactttttg 13441 gatgatttgg cgatagtttt ctaaggtatc cattgtagaa tgacctcttg ttctgaatca 13501 aagacgagta aacaaagacg ctgattttct atcaatattt taccaatcgg ttcagtaaac 13561 agttctgagt atgtttcttg acgaatggca aggtataagc ggcgatctga ttctaggcgg 13621 ctgaggatat catagtataa aatatactgt cccaaagcat tttctaagtc agctactgga 13681 gaagctccaa caaagctttt tatctccact gctattttct gtccgctttt ttctgcagct 13741 atcagctttt ctgcacctaa atcaatatat aaatctcttg taccccattt taagataaat 13801 gggtctttgg taatcttcca actttctttt tctagagcgt ttctcaccgc ataatggtaa 13861 atatctttag ctggcataga gagcgtacgt agcttgccat aacattttta gtataaactg 13921 ctcacctctg caaccatttg ggaaacaata aagtatctta gtctttattg agcagaggaa 13981 aaaattggta aatgttgaag ctcaaacggg ggttatcact gctagctacg ttacctcttg 14041 tcttttcact gagtgcttgt gctggaaagc ttggaggaag tggcggaagt ctgttggata 14101 caatcaaaag ccgtggtaag ctcatctgtg gtgtcagcgg acaattaccg ggatttagct 14161 atgttaagtc taacggtgaa tatgcagggc tggatgtgga tatttgtcgg gcgatcgccg 14221 cagcaatttt taatgacccc aacaaagtag agttccgcaa cttaaactcc aaagaacgtt 14281 ttacagcctt gcaaacgggc gaagtagata tcctcagccg aaataccact tggagtatga 14341 gccgagatac ttctatcggt ataaaatttg ctgccgttat attttacgac ggtcaaggca 14401 taatggtgaa aaaaaatagt ggtattcaga agctagaaga cctcaagggc aaatcaatct 14461 gtattgggac aggaaccacc aacgagcaaa acctgacaga ccaaatgcgg caacgtggtg 14521 ttaactataa acctttagtc tttgaagatg ctaacacggt ttttgccacc tacgagcaag 14581 gtcgctgcga aggtgtcact gcagatcgtt ctcaactcgt ttcccgtcgc accactctat 14641 ctaaacctga tgaccatatt gttttagata ctgtgctgtc aaaagaacct ctcgcacctg 14701 ctgtagtcaa tggagattct aagtggttgg atatggtgag atggacaatt tttgctcttg 14761 tcaatgctga agaactggga gtcaactccc aaaatgttag tcaattagca aacagtaata 14821 acccagaagt gaagcgcttg ctaggtacag aaggtaacct cggtaaaggc atgggtttaa 14881 caaacgactt cgtcgttcat attatcaaaa atgtcggcaa ttatggtgaa atttacgagc 14941 gtaatttagg gaaaaactct gagttaaaac tagaacgtgg tccaaacaaa ctttggaatc 15001 aaggtggtat tctttacgct ccccccttcc gataaacaag ttatcagtta tcagttatca 15061 gttaccagtt atcagttacc agttatcagt tatcagttat caggtaggaa acggactcgt 15121 ccaccccttg tttactgcga tgcactgagc gtggtcgttg tgttcactgt ttactgttca 15181 ctgttcactg ttccctgtta agagttccct gttcactgtt cactgttccc tgttaagagt 15241 tccctgttaa gagttccctg ttcactgttc actgttcact gtttcactat gactccaatt 15301 tggcgcaacc gaaagttttt tcctattctc ggtcagttaa tagctgcatt catagttgcc 15361 atcgtcgtga tgatactatg gcacaacctg atatacaatc tccagcggct aggtctccaa 15421 ctaggatttg attttctaca attccaagca tccttcgata ttggtgaaac gcccattccc 15481 tataaatctt ctgactccta cagtcgtgct ttattagttg gattggtgaa ctccctgcga 15541 gttatagttt ttggcatcac tttggcaaca attattggca ttaccgtggg agtggcgcgg 15601 ctatcagata attggttagt acgtcagcta gctctagtgt atgtcgaaac cttacgcaac 15661 acacccttac ttctgcaatt gttcttttgg tactttgcgg ttttcctcag tttgccaaaa 15721 acagaaaatc aaatttccct acttgggttt ataaatatta acaatcgagg agttaccctt 15781 ccttttggaa tagaactctc ttctgaatta tcaacactca ttttaggact aacgctgtac 15841 actggtgcct tcatcgccga aattgttaga gctggaattt tatcagtcgc taaaggacaa 15901 tgggaagcag cacgggcgtt aggttttaag ccgcatctca ttttacggct agtgattttt 15961 ccccaagctt tgcggctgat tattccacca ttgagcagtc agtatcttaa cttagcgaag 16021 aattctagtt tagcaattgc gatcggttat cctgatattt actttgtcgc ctccacaact 16081 tttaaccaaa caggtcgatc tgtagaagtt attctgttaa ttatggttac ctatctgacc 16141 attagcctga gcatctcttt agggatgaat ttgttaaacc gtagcgtgca actcaaggaa 16201 agatgagttt tttcaaatga gttcttttaa aagtcaaaaa cctactatcc tacaatggct 16261 acagaaaaat ctgtttaaca actggtacaa cagtattctc actgttgtct gtctctggtt 16321 gctctttttt ggtatcaaag gtattctgac ttgggtattg actcaagcaa aatggcaagt 16381 catcacagcc aacttatctt tattttttgt tggtcgtttc ccacaacaac tttactggcg 16441 attgtggcta gcactcttta ttatcctcgg tctcgttggc ttgtcatggg gaacttttac 16501 gaaacgttta ccccatcgaa tgaacagttg gttaccacta ggctgggcgc tttcttttcc 16561 aattatccta tggctaattg ggggaggctt tggactgcaa ccagtagaaa gcaacttgtg 16621 gaatggtttg ctgctgactt tggtgatggc ggtgattagt attgttcttt cttttccctt 16681 gggtgtttta ttagcattag gtcgtcaaag tcagctacct gttgtgcgtt ggtttagtat 16741 cctttacatt gaaattattc gaggactgcc actgattgga gttttgttct ttgctcaagt 16801 gatgttatcc ttattcctac cagtagaata ccgtttagac agagttctga ggggaattgc 16861 tggtttaatt ttctttagtg ccgcttattt agcagaaaat gtacgtggtg gtcttcaatc 16921 agttccacgc ggacaaattg aagctgccaa agtactggga tttaatgctc ctttaacagt 16981 actactcatt gttcttcccc aagccctacg tgctgttatc ccagcgcttg ttggtcaatt 17041 tatcggtttg tttaaagata cttccttgtt agccattgtt ggattattgg aattaacagg 17101 aatttctcgt gctatcctgg ctcaacctca atttattgct cgctatgcag aagtctattt 17161 attcattgca tttatttatt ggatcttttg ctattccatg tccctagctt cacgacgtct 17221 agaaaaggaa ttgggtgtag gtcaacggta atggggatat cttgagcagg ggagaataat 17281 ttttgattct tgactaatga ctaatgacta atgactaatg actaatgact aatgactaaa 17341 taattatgaa ccaacaaaag cctattatta ttgcccaaga tatccacaag tggtatggta 17401 aatttcacgt actccaaggg gttagcctga cagtcaatcg tggagaagtc gtggtactga 17461 tgggaccatc gggttcagga aaatcgacct ttattcgcac ctttaatgct ttagaagaat 17521 atcaagaggg acgaattgaa attgatggaa ttgacctcac aaatgaccta aaaaatatag 17581 aagcaattcg gcgagaagtg gggatggtgt ttcaacagtt caacttgttt cctcatttga 17641 cagtattaca aaatatcact ttggcaccaa cttgggtgcg caagtggacg aaagtcaaag 17701 cagaagaaac ggcgatgcag ttactagaac gagtgggaat tttggaacaa gcacagaaat 17761 atccaggaca attgtctggt ggacagcagc aacgagtagc tattgcaaga gcactagcta 17821 tgcagcctaa ggttatactt tttgatgaac ccacttccgc tttagatcca gagatggtgc 17881 gagaggtttt ggacgttatt aaaactctcg ctgctgatgg tatgacaatg gttgtggtta 17941 ctcatgaagt tggatttgct cgcgaagccg ctgaccgcgt gattttaatg gatagcggta 18001 cgattgtaga ggaagccaca ccaagtattt tctttcaaaa cccaaagcac gatcgcactc 18061 gcaagttctt atctcaaatt ctctaacaaa acctcaagca tactattcaa acagttatca 18121 gttatcagtc atcagctcat cttagcttga taactgttca ctgtttactg ttcactggta 18181 actggttaag acacaagcaa gtgcgagaaa cgcagcctca acaggcgtag cagacgcaac 18241 cagcctttgg ctggctgttc aaaaaatgta aaaatatcgc cataaccttg attggttttt 18301 tggtggtgtt cccaatgtct ttctggagtc gtaataaaca gaattcggca taaaatctct 18361 atgaactttg gagtttgcca acccaaggta gtggtgtgtc tccaccatac atgaaactga 18421 ccaaacagta gcccacacac aacacctatc ggggaaaaaa accatagaac tgctgctact 18481 aataaataag gtaaagcacc cagaatacca tccagaagaa cttgggaatt gaaagtcagg 18541 atagcataat ggcgaaagct ctttttgtag gagtggtgag ttcgtaagtg aagcatacca 18601 aaaacatgtt caggtacgtg atagcaaaag gtagaaagga agtcaccaat aacaaggagt 18661 atccaagcac acgcgatcgc ctcaaacatt atttaatcct caaattttag taaaaaccac 18721 cacacaaaaa acaatctgaa tcttctttct cagtgttatc acatacattt ttttacaagc 18781 tataaaaaga cacaaattag ccccgtgaag cactttacaa ggattctcgt tctcgactgg 18841 aattcttagg ttctgttgat tttgctttaa atgacagttt tagactgatt tgttttgtgg 18901 aacatttatt gacaatctcg catatatcta aaaaatgata gtatttccgt agtttatcac 18961 agttatacta aatccgcttt cataaccctc tcttaaccct ggcacagtgc ggctatacga 19021 gacttgtcag tctatgcacg aagcgcacgc tttgcgaacg cggagtcatg tccaaggggg 19081 gtattgacct agacttggta ttatttctta attctttaga tagagtaaga tactaagagc 19141 aaaagttaag ttaaaatttg cacaaatttc ggttaaagtt tcctcgtggg gggaaagctt 19201 tcggtagtcg cctaccttcg ggttaccctc caaagacgga atttcgattg aattatgtca 19261 aaatatcgac cacaaatcag aatttgcttc cttggcgaat catttgtcaa tggtactggt 19321 gatcctgagt ttcttggatg gactggaaga atctgtgttg atgcttatag aaaaggctat 19381 gatatcacct tctataattt aggagttagg ggggaaacga gtacagagct aagacaacgt 19441 tggttaaaag aagtttctta tcgtttacca aaacagtaca acggcagagt tgtattttct 19501 tttggggtga atgatacaac actgataaat agtaaacctc gtgttgagtt aacagaatca 19561 atagaaaatg ttcgtagtat tttgagtaca gcgaaacagt tgtatcctgt tttaatggtt 19621 ggtccaccac catgtgcaga tgaagaacag gataagagaa atcagagaat agcgaattta 19681 tctaaacaat ttactttagt ttgtgatgag cttgatatac catatctaga catttttcct 19741 attttggaaa agtcaaatat ttggcaagat gaggcgagag cgaatgatgg tgctcatccc 19801 aggtcagcag gttacgcaga atttgcagaa attgtgcaaa gttggaatgc ttggttaaat 19861 tggtttcctc ctttttcatc atcagatttt tgaatatagt agtagcgcta ccgcgcggta 19921 gcgctacttt gtgtgatctg acagagttgt caggtcaagt tctaacttag ttttgttgtc 19981 tgatagcgta agtcttgact tccgtcattg aaagtatgct tcggtggtct agcgtgcgat 20041 ttgtgaaact attcgtcttt tgcgccctta tgagagttct taccaccttc acgaccgatt 20101 tctgacatat gttctcgatc ctggcttact gtttcaccac ctttgcgacc agcctctttg 20161 gcttcctcgg aagtaaactc gtgcgctgtt cctttttcgt gagcagcttg cccaccaagg 20221 ctcgcaatct cgcgctgttt atcctcatcc attgaggcaa atccacgctt acttgtgtct 20281 gccataatat ctctcctttt gactcaagtt cttgtgcttt acaaagtgaa aacttgcgag 20341 gcttttttgc ccatttgtaa ttattatttg caagttctgg agtcatttta aaaagccgaa 20401 aatatgagta cctctgtcta aaagtgggct ttaatttgaa cgagttctga gtcggagggc 20461 tgattttttt ggtaagttca tcggttagcg ataagccgga ggctttagct tttgcgtatc 20521 gcaagcgttc attaaaagca tatagaagtc ctaaatcatt tgtgaacaac aagattctcg 20581 actttgccaa agttgtcggg aatctgagca tgcgcgattt atgcgctacg cgcaggcact 20641 ttcacacaaa tcaaagagga ttgctatagc ctatgaaaat agcagtgctc ataagcacca 20701 gggtgataaa aatcaccgtt attgctaaaa taatttacat tactttatgt tatgttactg 20761 attgtttcgg aatgtgatca ctttagcaac actcatatga aggcagttct caactcggat 20821 accccacaag tcactcgatt attttctaac cttgaactct atggaaacaa actcaatcat 20881 caaccaggta gtgttttagg aagtactgca cttgttgctg gaacaactgt tggtgctggt 20941 attctcgcat tgcctgctgt cactttaccg tctggcgttg tgccatcgac agttctactt 21001 gttgctgttt ggctctacac cttagtttca ggtcttctga ttgcagaggt gactttaaac 21061 acaatgcgtc ttgtggggag ttcaagttca ggtttgttgg tgatggttga gaggacgctt 21121 gggaagccga gtgcgcgggt tgctggtggt gcgtatctgt tcttgcacta cgccttgctt 21181 gtggcttatg ttagccaagg tggagagatt ttagtgtctg ctgttgaaaa agtgttaggt 21241 gtacagaata atctgcctgc gagtgtgggg acaacagcgt tcactctctt gtttggtggc 21301 atcatgtacc ttgggcgaga aagatttata gagaaattaa acagcgcgtt tgtggcaatt 21361 gtccttgctt catttgtagg actgttagtg ttgggagcaa cgcagcttaa gacttcctct 21421 ttctcgtttc aaaattggag tgcgcttcct gcagccgttt cagtgatgtt tgtggcgcta 21481 ttttacagta acgttgtgcc aactgttgtc actcaactcg aaggtgatgt ccgcaagata 21541 cgtcagtcca tcttcattgg ttctgcgatt cccttaatca tgttcttggc ttggaatgcc 21601 gtgattttgg gaagcgtcag cactgatatg ctacaaggta gttctggtgg cggaactgtt 21661 tttgacccac tgcaaattct acgtgaaggt ggcgcgggag aatggttagg agtgttagta 21721 tctgttttct ctgagtttgc aattgtcaca tcatttattg ggtttgtgta cggcttgcta 21781 gatttcttca aagatacttc agcacccaat gagccttcta aacgtctacc gcttttttca 21841 ctgattcttt ttccaccgat gagtcttggg gcgcttaatc ctagcatctt cttggctgct 21901 ctagactatg ctggaacgtt cagtatttca gttttaggtg gaattattcc tgcattgatg 21961 atttggaagc aacgtcaaga acagcaacaa tcaaatagca tgagtcaacc gctggttcct 22021 ggtggtaaag tgacgctgat tgcaatgatt ggcgttgcgt tggttgtgat cgccaaacaa 22081 gtggtgtcaa tctggggttc gtttggatac tgataatttg gactacttcc ctccgtctcg 22141 tccttatgtc ctattggaca ggctgcactc agcgcgtaat atcaagcctg gtacgtagca 22201 tttggcttga tattatctcc tccgtaggag gcaaaaattg aaaaataatt aagacatatt 22261 gaattgaaac tgtctaagtg agaagattat tagtcagtga tgagactgct ttagtgtttt 22321 cacttaacaa gtagaaaata attgatgcgg tgtaaaactg acgactgttt cagcaaaact 22381 actagtgcag cacggcagaa atatctgttt tgtttttgaa agctcaaaac tagatgttat 22441 gagcttagtt ctgttttgcc tgatggcact agttgtaatt taagtaatcc caggagaaat 22501 tatgtctcta gaagacgtcc aagcattcta ccaaagacta ggaactgacg aagctttccg 22561 tacccaaatt caaggggtta acagtaagga tgaatgtagc caaatggtca aaagtgctgg 22621 ctatgacttc acccaagaag agttggaaga atatacagcc gaactgttgg agttaagtgc 22681 tgatgaggat ggtctggcag atttagacga aaaagaattg gcaactgttt ttggtggaat 22741 tgttgcacag ccactgtatg gaggaatcat ttatgaacca cccactgact ggtcacccat 22801 taagccaccc attaagtggc cacccaagaa gtggccaccc attgatcctc agcctttata 22861 tggaattgtg gtatcgccag aataataatt tcaaggaagc ttgagcaata agcttaagtt 22921 tattgggacg gatacaagta agtccgtccc tatctgaata aattagctga ggatatttta 22981 atgacatatc gcagagtgag ttatgcggtt tgggaaataa cgctaaagtg caatttagct 23041 tgtcagcact gcggttctcg tgctgggcac acaagggcga aagaactttc tactgaggaa 23101 gcccttgatt tagtcaaaca aatggcggaa gtcgggatta cagaagtcac cataattggg 23161 ggtgaggctt tcctccgtcc tgattggttg gaaattgcct ctgcgattac cgatgctgga 23221 atgcgttgtg gtatgactac tgggggttac ggtatcacct tagacacagc acgccggatg 23281 aaggaagctg gaattaaggt ggtttctgtt tcggttgatg gcttagaagc gactcacgat 23341 cgcctcaggg gtagaaaagg ctcttggcaa tgggcgttta agaccatgag ccatcttaag 23401 gaagctggta ttatctttgg ctgcaacact caaatcaatc gcctatctgc accggaattt 23461 ccccaaatct acgagcgtat ccgcgatgct ggcatttttg cttggcagat tcagttaaca 23521 gtcccaatgg gcaatgcagc ggataatagt gacattctgc tgcaaccata tgaactgctg 23581 gatgtctatc caatgatagc tcgtgttgct caacgcgccc aagaagaagg ggtgcaggtg 23641 cagccaggaa ataatatcgg ctattacggt ccttatgaac gattgctgcg gggaggacat 23701 gcttgggctt tttggcaggg atgcaacgcc ggactggcta ctttgggcat tgaagcggat 23761 ggtgcaatca aaggttgtcc ttcactgccg acttcagcat acaccggagg taacatccgc 23821 gaccactcac tgcgaacaat cattgaagag acagaagaat taagatttaa tctcggagct 23881 gatactccta aagggacaga acatctttgg ggtttttgta agagttgtga atttgctcaa 23941 ctctgtcgtg gtggttgctc ttggactgct cacgttttct ttgacaaaag aggtaataat 24001 ccttactgtc atcaccgtgc tatcacgcaa gtaaaccgtg gtgttcgtga gcgagtgttt 24061 ctcaagcatc aggcggaggg aaacccgttt gataatggcg agtttgcttt ggttgaggaa 24121 gcaattgatg ctcctttgcc agcaaacgat ccacttcagt ttagtggcga tcgcattcaa 24181 tggtcaaaaa gttggcaaca agaaccatta agcaattcaa aatcgcgtag cgttgcaagc 24241 gttgc // LOCUS NODE_1288_length_24055_cov_5.24862524055 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 24055) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 24055) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..24055 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 497..687 /locus_tag="DP116_11485" /pseudo CDS 497..687 /locus_tag="DP116_11485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002733701.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 870..1829 /locus_tag="DP116_11490" CDS 870..1829 /locus_tag="DP116_11490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655562.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetyl-CoA carboxylase carboxyl transferase subunit beta" /protein_id="PRJNA477356:DP116_11490" /translation="MANNEESRGLKSLLDWFANRRKSGSTILEPQEREIADGLWHKCS NCGVLTYTKDLRANQMVCVECGHHNRVDSDERIRQLIDMNSWRSIDEHLRPTDPLQFR DRKPYIDRLRETQDKLNLVDAVKTGFGQINGLSVGLGVMDFRFMGGSMGSVVGEKLTR LIEQATQRRYPVVIVCTSGGARMQEGMLSLMQMAKISAALQRHQQARLLYIPVLTNPT TGGVTASFAMLGDIIIAEPKATIGFAGRRVIEQTLREKLPEEFQTAEDLLKHGFVDEI VPRTQLKQSLAQLIALHQPPTPTTTHNNMVVWETRTLSSTSVE" gene complement(2477..3892) /locus_tag="DP116_11495" CDS complement(2477..3892) /locus_tag="DP116_11495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878239.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="low-complexity protein" /protein_id="PRJNA477356:DP116_11495" /translation="MSAKKTDVLRNWLIVVAAMFILSPIVFFITSSKMEELSNQQKIE NRTQVLSTTATFFLGLAVMIHAYVVAKRAQALQESAISQRLDAERIAAQKSNEIAIKN VHLAEERLMTERFMAAITTLGHQSVATRTGAIYALERIAQDSPQQYWIIMEILTAFVR ENAAGKPQNEQIQQTARIATDIQAALSVIARRDAQKDQPNQKIDLRYADMSGADLHKA NLQQADLRGANLCQANLQGANLCEANLDGAKLCGSILYEANLQSTNLTDANLCGANLN CAQAYGANLRSANLTGATLRGANLQRANLYKANLQWSNLKAANLQEAKLFLANLQGAK LGKANLQGTGLIGANLQQANLNGANLQQANLNAAKLQHTEVFFANLSEASLRETDLCG ANLMGSNLQMAILYEANLCGANLMGANLNMTNLSHVKLEGAILTGAKNLKLHQITLTE GDLKNRLPENVQSPGDRTLIS" gene complement(4523..5048) /locus_tag="DP116_11500" /pseudo CDS complement(4523..5048) /locus_tag="DP116_11500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195015.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" gene complement(5073..5671) /locus_tag="DP116_11505" /pseudo CDS complement(5073..5671) /locus_tag="DP116_11505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126933.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="tRNA-(ms[2]io[6]A)-hydroxylase" gene 5966..6531 /gene="ssuE" /locus_tag="DP116_11510" /pseudo CDS 5966..6531 /gene="ssuE" /locus_tag="DP116_11510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867817.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="FMN reductase (NADPH)" gene complement(6683..7717) /locus_tag="DP116_11515" CDS complement(6683..7717) /locus_tag="DP116_11515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318540.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_11515" /translation="MLLGDVSNSDVLIIGSGPAGSAAAIACAQRGLRVILIEREQFPR SHPGETLHPGVEPLLKQLGMIEPVLDAGFLRHTGNWVQWEAQRHFVPFGEDDSGAWLG FQAWRADFDTILLNRARATGVEVLQPCQALRLLVDGDRVVGVETSLGTLRASKVVDAT GSHHWLARQLQVQINYHSPRRIVYYGYACGECPVRDDAPAIIADSGGWTWTARVQPQL YQWTRLSLVDQKIPKDWLPDEFHGLKIHQKMQAADVTWRIVSQPAASGYFMVGDAAMV LDPASSHGVLKAIMSGIMAGHLLTAELLGGLSPAQAIHHYCQWIHNWFEHDVEKLSKL YAILPNSKNY" gene complement(7695..9887) /locus_tag="DP116_11520" CDS complement(7695..9887) /locus_tag="DP116_11520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747749.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11520" /translation="MSLIEDVKKVCDRLAPLGWRDLLLAHGVDITAANLKQELTKELP NINRTIKGFEDFAFEGKRGIESGNPARSLLFHAFASPNVVSENLREYPTLGEIEVIEN YVYGVQPPSLEDLRVRAQGQLLAIVVFAYEYRPASETVQRKHADLCFSRTGVARVGTA QALYDTKLRGFLPFVKEDSQLFRVLPARYSAYIAVQRRGNSTTFGPLRFQNGDDGRLF WVPLHKLFNGKECIRNLDLQVTLNARHINEKLRRVHLVLKNTGWGEPDISNPPFIFSD GIAELTTKPEFGNGLLVPVVQKNLVEAATYQGKPLTFKVPPNSETLSSSLSISADRTT GARRAPEYVHARHKFSNGQIENLNNLPNVTEVVQKGDYNALHYVDFTGDGSIEAVCPQ LAVAIPRRVPAYSLVTAPDFFPNCDQRELLEWTQQSVPTALRQNLWRVPPETLSDNRF APNLQLKNANFRPEDKTVTAIVCLPLSGSVKQMPLDGSPTMRHAYLPDGASGVFAPGW DISFDTVNGVEHLAAYGLGSPFPEDSKLCAALSTFWPAVAPDATRTFAPNPSWPTVSP LTDEEIGQVGNLAWDGIPGPRRVTRNGKNFIEYTDFAHADYVENSVQKKFSLSLTGKV DVREYEARVLAMANVYRVIPEARSQWSVFSFRQVQPTDSELQQAQTQTTTRLAGIIYR FELYRFGNSSPNPDDFRKQLVEIQETATLFVTPLNILLKRNNAAWRRV" gene 10362..10919 /locus_tag="DP116_11525" CDS 10362..10919 /locus_tag="DP116_11525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318659.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acireductone dioxygenase" /protein_id="PRJNA477356:DP116_11525" /translation="MAILQLEDGTRYTDLQDISRELAPLNIQLNRWAVGESQQLRELL SQDSLNEDEKQQVLTFLDGYFEQLQQTAGYQTRDLIVLHPGIPNLDTMLTKFDKIHTH SENEVRYIIDGEAIFGFVRPDDSQVELTVHPEEYINVPAGTEHWFYLTPARRVKAVRY FTGSEGWTPEYTGREIHTRQVVTQV" gene 10931..11581 /gene="mtnB" /locus_tag="DP116_11530" CDS 10931..11581 /gene="mtnB" /locus_tag="DP116_11530" /EC_number="4.2.1.109" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858716.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methylthioribulose 1-phosphate dehydratase" /protein_id="PRJNA477356:DP116_11530" /translation="MNSPKLIDPRLELILAARHFYQQGWMVGTAGNLSVRLPDDSFWI TASGRSKGELELGDFVRIYADGTVEKPSPDVKPSAQTVIHQALYTLFPEARSCYHVHS VEANLVSRFVQGDTLPLPPFEMLKGLGIWQENPDCAMSIFANHSQVSCIADEIKERFT TIPPQLSALLNRDHGVTIWAPSAKTARNYIELVEYIFRYMVAARGTGFCRAGEERE" gene 11618..11917 /locus_tag="DP116_11535" CDS 11618..11917 /locus_tag="DP116_11535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127682.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="antibiotic biosynthesis monooxygenase" /protein_id="PRJNA477356:DP116_11535" /translation="MTHQTIRVVARIIALPEKVETVKAVLLEIIEPTRQEAGSIKYEL LQNQSDPTDFTFVEEWASDDALDTHLATPHLNEAGAKLASLLAAEPDIRRYSVLA" gene complement(12056..13741) /locus_tag="DP116_11540" CDS complement(12056..13741) /locus_tag="DP116_11540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11540" /translation="MDWITLLRSLQSDFIKRLTSGCLLHCETEGQYSELTVISGERLK ALREFCWQMAEKYKRTSPVRDVFISNLKGKLGEEVVKERLAAFVTEVDYEKRFGIGDG KVDFTLTFDPSVGIAVKSRHGSLNKVRWSINSEEVQKNAVVVCIFIQEEVNEAQPEYH LFFAGFLPTQMIKLKTGKISFGIEQLLYGSGLRCYLQQLESLTPVNSHNQKSQLEKSN PKQEASYQQHKQHFTKTNFSPEQQQPSYQLHKQQSLKTSFSPEQNDYPRFHNEDLNLL SVKLGDECFEKEQYNAAINNYNQALKLNSKDAETYYKRGLARYHLRDYEGAIADYAQA IQINPYYGKVYTKIALARYHLGDYEGAIADYTQAIRMNPNDAVAYRNRADIRYELGDY QAAIEDYNQAIKINPNSSNLANSKKAIELFSQSIELKPNDINSYKNRGNSRFDLGDYQ GAIEDYTQVLKINAHDIDAYYNRGQARCHLEDYQGAIEDYTQIIKRNPNDADAYYHRG HARYNLGDKQGVVDDFQKAADLYRKEGKLEEHKKTREKILDLEIEESLDILNF" gene complement(13983..15491) /locus_tag="DP116_11545" CDS complement(13983..15491) /locus_tag="DP116_11545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995281.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UbiD family decarboxylase" /protein_id="PRJNA477356:DP116_11545" /translation="MARDLRGFIKILEERGQLRRISAIVDPDLEIAEISNRMLQKGGP GLLFENVKGASFPVAVNLLGTVERICWAMNMQHPQELEELGKKLAMLQQPKPPKKISQ AVEFGKVLFDVLKAKPGRDFFPACQQVVIQGDELDFNKLPLIRPYPGDAGKIITLGLV ITRDCETGTPNVGVYRLQLQSKNTMTVHWLSVRGGARHLRKAAERGKKLEVAIALGVD PLIIMAAATPIPVDLSEWLFAGLYGGSGVQLAKCKTVDLEVPADSEFVLEGTITPGEI LPDGPFGDHMGYYGGVEDSPLIRFHCMTHRKDPIYLTTFSGRPPKEEAMMAIALNRIY TPILRQQVSEIVDFFLPMEALSYKAAIISIDKSYPGQARRAALAFWSALPQFTYTKFV IVVDKDINIRDPRQVVWAISSKVDPTRDVFILPNTPFDSLDFASEKIGLGGRMGIDAT TKIPPETEHEWGAPLESDPEVAAMVERRWAEYGLGDLQLGEVDPNLFGYDVR" gene 15793..16452 /locus_tag="DP116_11550" CDS 15793..16452 /locus_tag="DP116_11550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009634462.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MotA/TolQ/ExbB proton channel family protein" /protein_id="PRJNA477356:DP116_11550" /translation="MTQAYDIFLAGGLVMWALLGLFTVTSVCILERSWFWLRLIVQEK KVVREVLTTAKVDLEKAEEIAENAQVLAIGRFLVEPLRLKKPSPETFHLAIKAACDRE FVEMRKGGMLLKSVVAIAPLLGILGTAEGLVTTFTNLKTNSFNITDLSTVTLGLTQAL FSSTTAIAVAVFAFIFFRLFLCLQARQMGFFSQVGNDLELIYLQYWYEPSTQTNTQTN N" gene 16912..17946 /gene="pstS" /locus_tag="DP116_11555" CDS 16912..17946 /gene="pstS" /locus_tag="DP116_11555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter substrate-binding protein PstS" /protein_id="PRJNA477356:DP116_11555" /translation="MTFSTTVLNRVVSASAIMAAVTLGPIFTAIAQAETINGAGATFP APLYERYAREVKKKYPDLGVNYQAVGSGAGVNQVIAGTVDFGASDSAMTDAEIAKVKN GVILVPTAGGAVAIAHNLPIDNLKLSRKTLPAIFSGQITKWDDPQIKADNPGVNLPSQ PIKVVVRADSSGTSFIFTNHLNSISPYFKGRIGVSKSPNWTIPNVTKAKGNPGVASSV TRTPGAIGYVEYEYALKNKLKVAQVQNKQGQFVAPSLQTANEALSTVNFPDNFRVFVD DPAQGYPIVGLTWLLVYKSYPNAAKGTAVKNFLNWVLTDGQQINDDLNYSRIPAPVAQ KVIQTVNSIK" gene 18033..19025 /gene="pstC" /locus_tag="DP116_11560" CDS 18033..19025 /gene="pstC" /locus_tag="DP116_11560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198598.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease subunit PstC" /protein_id="PRJNA477356:DP116_11560" /translation="MANSFEPANSNQPSLIDESIDITASSGKNFWIDQGFTWLVYVFA ALTVIVLFWMSLIIFQKALPAIEKFGLGFLWNQQWDTGNLVFGALPYIYGTLVSSAIA MLFAVPVGIAVALVTSENFIPPSARTTIAFIVELIAAIPSVIIGLWGVFVFIPALVPL QTWLSSFFGWIPLFHTPGPAGFNILTAGIILAIMILPTMAAISREVLLVVPKELRSGS MALGSTRWETIFRVILPAGFSGIVGAAMLALGRALGETMAVTMVIGNSAQISPSLLDP AYSIPSVLANEFAEAQDPLHIGALTYLGLILFAVTLVVNILALVIVQFVGGKGK" gene 19158..20036 /gene="pstA" /locus_tag="DP116_11565" CDS 19158..20036 /gene="pstA" /locus_tag="DP116_11565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease PtsA" /protein_id="PRJNA477356:DP116_11565" /translation="MTSHQEFNTDKSVAAEICSPLSPVRMIFDYGMTVVAFFLSALAL IPLLSLLWEIIGRGITSIKPSMFVTSVINDGFANAIVGTLEMVIIAALFSIPTGVMTG IFLSEIGKGNRIGRAVRFVASILTGVPSIVVGVFAYAVIVLITKQFSAIAGGFALAVL MLPVIVLTTEEALKLIPTSVRLGSAALGGTRFQTTFRVVVAAAIPAITTGVLLAVARA AGETAPLIFTALFSLDWSSDLFGPTASLSVLIFNLYNDPSPEKTALVWTASLVLVGII MLISILSRVFTRKRNV" gene 20086..20865 /locus_tag="DP116_11570" CDS 20086..20865 /locus_tag="DP116_11570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016953405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_11570" /translation="MSKLNPAIKVKNLSFYYGTSKALEGVTMDVYENQVTAIIGPSGC GKSTFIKCLNRISELEGPVKVDGSVEFYGQNIYSSRVNLNRLRRQIGMVFQKPNLFPI SIYDNIVYGIRIAGWRPRAELDEIVEYALRGAAIWDEVKNKLNKSALGLSGGQQQRLC IARALAVKPKILLMDEPCSALDPIATMKIEELIHSLREELTIAIVTHNMQQAARVSDF TAFFSTDESRIGQMVEFGQTNQIFSNALDSRTRDYTSGRFG" gene 21556..23865 /locus_tag="DP116_11575" CDS 21556..23865 /locus_tag="DP116_11575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321389.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycerophosphodiester phosphodiesterase" /protein_id="PRJNA477356:DP116_11575" /translation="MANVTLKGFASLPADTFAEGRSSGNFITGNTNGRTVPFLTQPVQ GFSAVQFADANTFWFLPDNGFGAKNNSADFQLRIYRLNPSFQGTEGGNGRVEVLNFIQ LSDPNNQVPFKIVNEGTTDRLLTGADFDTESFVLGSDKSIWIGDEFGPYLLHVDKTGQ LLEAPISTPNYYKLNTLNGQPPIVIGHRGVAGERPEHTLESYKVAIERGADFIEPDLV STKDGVLIARHEVNITETTDVGTHPEFADRYTTKTIDGVTEKGWFADDFTLEEIKTLR AKERLSFRDQSYNGLFEIPTFQEIIDLVKQVEAETGQKIGIYPETKHPTYHDSVGLSL EEPLVETLKKNDFTDPTRVFIQSFEVSNLKELNQKIDVPLIQLLDAEDVSLDGTLIEK QPYDFVVSGDPRTYGDLRTAEGLKEIATYADGIGPWKRMIVSVKGVDADGDGKADDVN GDGAVNDADKTTLPPTTLVKDAHAAGLLVHPYTFRNESQYLAADYNKNPELEFQQFIK LGVDGYFTDFAGTGDKVRDQITGEFVRSPDNPDVLAGLAYSNLASSKGFEGLAINPDK TKLYPLLEGSVLGDPNDALRIYKFDVASKQYEGLVGYYRLENSSYAIGDFTVVNDNEY LVIERDNGQADTAKFKKIFKVDFSEKDANGYVAKEEVADLLNIQDPNDLNQDGSTKFT FPFQTIENVLVIDKNTILVANDNNYPFSVGRPPAIDNDEIILLGLEKPLSLDPRVGLA GLDNNTSISQGHDLLGTQNWAQLTNVNII" BASE COUNT 6888 a 5026 c 4934 g 7207 t ORIGIN 1 ttacaaacca gatgattttg ggctttggtt ttcagcacta tcactaaagt atcagaaagc 61 tacaaacaag tcaaacttct ctttatgaat aattatgaaa gagtttagta aagtatttgg 121 ctatttagtc aagctctact tgcagataca ctagaattgt agatttaatc accactacta 181 tctttaagct tttgataaca tcgtcttgtg aattcaacct gctaggctgg ctgaccacac 241 cttgtagact ggtgtaagcg caagcgcacg caatcatgcg aacaccattg aaagtaacgt 301 atcagttttg ttcgcgcaag ccagcgcaga acttggaagc gatagctgta acattagtca 361 cagatgtatc ggatcttctc caaacaggct gcgcgtgcac tggcacaaaa gccgtagcgt 421 taggcacctg agactacccg cagggtgagt tggatgatcc tcttaatagc gagtactctt 481 aacacgcgca aaattgatgc aacgactgct tgcctttcct aacataccaa caaatattat 541 tgatttaggg ggtgaaatcc tacttgtggg cagggacttg gtgcagcttt gcaggtacat 601 ttacaagaga aatggactgc tttgctactt tttaagtttc ccaaaaggcg agcaagagac 661 ttgaattggt ttacgtattt tcgttaagct gacaaccaaa gatctaaaat atggtttcat 721 ttgcttacaa gacaattttg aatatgttgt cattagttat gagtcattca tccttaatat 781 ggacaaagga gtcatgataa aaaactaaga ctttgacagc atcggtttga tctattttca 841 acatactatc cacgtaagat aagatacaaa tggcgaacaa cgaagaatca cgcggtttaa 901 aatctttatt agattggttt gcaaatcgac gcaagtcagg atctaccatc ctggaacccc 961 aagaacgcga aattgccgat gggctgtggc ataaatgttc taactgtggt gtgttgacat 1021 ataccaaaga cttgagagca aaccagatgg tttgtgttga gtgtggacat cataatcgag 1081 ttgatagcga tgagcgcatc cgtcaattga ttgatatgaa cagttggaga tcgatagatg 1141 agcatttgcg tcctaccgat ccactgcagt ttcgcgatcg caaaccttat attgatcgtc 1201 tgcgggaaac ccaggacaaa ctgaacttag tagacgcggt taaaactggt tttggccaaa 1261 taaatggttt atccgttggt cttggcgtta tggacttccg gttcatggga ggaagtatgg 1321 gttctgtcgt cggagaaaaa ctgacgcgcc tgattgaaca agctactcaa cgacgatatc 1381 ctgtggttat tgtttgtacc tctggtggcg ctaggatgca agaaggaatg ctgtcgctga 1441 tgcagatggc gaaaatctct gcagctttgc agcgtcatca acaagcgcgg ctactgtaca 1501 ttcccgtttt gacaaatcct accacaggtg gtgtaactgc tagttttgcc atgttggggg 1561 atattatcat agcagagcca aaggctacta ttggttttgc tgggcggcga gtgatagagc 1621 aaaccttacg ggaaaaactt ccagaggagt ttcaaactgc ggaagattta ctcaagcacg 1681 gctttgtcga tgagattgtt cctcgtactc agctaaaaca atcattagca cagttgattg 1741 ccttacacca gccacctaca ccaacgacaa ctcataataa tatggtggtt tgggagacaa 1801 gaactttgag ttctaccagt gttgaataac catcccctct caaatattag acttgttgca 1861 aaagtcgaaa atttgtgatt ttgaatcaca ccaatgactt acgatatgtc tgccacgagt 1921 ggaacacaac gccccccttc aaagggtagc ctctgtgaac aagggctttg ggttctcctc 1981 ttggaagatc gccctcccta gaagagatga aaggctaggg gaatagttct tagcctcgct 2041 cttttgatgc ttagtagaac ttgcgcactt gtattggaaa ctggagcaca atagacacat 2101 cttggcacgg gagtagtttg gctgtacaga caaagcctgc cgcagcaggg cgattcagct 2161 tatcgctgaa atatttttgc aagaggtcta ttcaaaagtt aaaatatgcc caaatagcat 2221 attcacaatt ttggatttcc ctatagggtg tggggatatt ctgatttatg tgttgttcat 2281 ttttcttaaa taaatataaa aatataaagt gcaatgaaaa agttgaataa ctaaaatttt 2341 ctgtaatagg tagctgcaaa agactttagt gagatcggtc acttttggtg tattgaaaag 2401 taggaaagca ccctgtttgc atatgaaatg caaatgaatt atcaggcagt taattatgct 2461 tacaacttgc tgttagttaa ctaatcagtg tcctatcccc tggtgattga acattttctg 2521 gcaagcgatt tttaagatcg ccctctgtca atgtaatttg atgcagtttc aggtttttag 2581 ctcctgttaa gattgcccct tccagtttga catgacttag gtttgtcatg ttaaggttag 2641 ctcccatgag gtttgcccca cagagatttg cttcatagag aattgccatt tgtaagttgc 2701 ttcccatgag atttgctcca cagaggtcag tttctctaag gcttgcttca gaaaggttgg 2761 caaaaaacac ttctgtgtgt tgtagtttgg ctgcattgag gtttgcttgt tgcagatttg 2821 ctccattaag gtttgcttgt tgcaaattgg ctccaatcaa acccgttcct tgcaagttgg 2881 ctttacccaa ctttgctcct tgcaaattag ctaaaaacag ctttgcctct tgtaagttag 2941 cagcttttaa attagaccat tgcaaattgg ctttatataa attggctctt tgtagattag 3001 ctccacgcaa agttgctcct gtaagatttg cgctacgtaa gtttgcccca taagcttgag 3061 cacaattgag gtttgcccca cagaggttag catcagttaa atttgttgat tgcaagttgg 3121 cttcatatag aattgaccca cagagtttag caccatccaa atttgcctca cacaagttag 3181 ctccttgtag attggcttga cagaggttag ctcctcgtaa gtctgcttgt tgcaaattgg 3241 ctttatgcaa gtctgctcca ctcatatcag catagcgcaa atcaattttc tgatttggct 3301 ggtctttttg ggcatcacgc ctggcgatga cactcaaagc tgcttgaata tctgtagcaa 3361 ttcttgctgt ctgttgaatt tgctcatttt ggggttttcc agcagcattc tctcgcacaa 3421 aagctgtgag aatttccatg attatccagt attgttgagg agaatcctga gcaattcttt 3481 ccaaggcata aattgcaccc gtccgtgtgg caacactttg atgcccaaga gtcgtaattg 3541 cagccataaa gcgttctgtc atcagccttt cttcggctag atgaacattc tttattgcaa 3601 tttcattgct cttttgggct gcgatacgct ctgcgtcaag cctttggctt atcgcacttt 3661 cttgaagagc ttgcgctcgt ttggctacca catatgcatg aatcatcact gctaatccta 3721 agaaaaaggt agcagtcgtt gataaaactt gagttctatt ttcgattttt tgttgatttg 3781 ataactcttc cattttggaa gatgttatga aaaatactat tggtgataat ataaacattg 3841 ctgctactac tatcaaccaa tttctgagta cgtctgtctt tttagcagac ataatgtttg 3901 tgtccctcaa aacacagaaa ttttaggtag attacaggaa aaaaggttat agcatttgtc 3961 tggtaaattt caacatctga cgagataatg gtagatgtag caacttgtta ttaaaaatca 4021 atggcagcct aaagtggttc cattttttta caaaaagttg tccgatgact atccctgaag 4081 ttggtttaaa atacttatta ttaatttagt ttgataaaga atgcaaccct ctcaagagag 4141 attttttgta ttcctatttt catttatttt aatgtttctc catactccat agatgaagtg 4201 agaattgcaa atcagatctt taagaatcaa gtagtgattc ttaatacttt tgtctaagta 4261 tacgcttttg tatagatatt ttacattata aagattgtat aaagttagca attgtctctc 4321 atttttaaac ttgacttttt tgccaaaaaa ggcaaaaaga caaagtatag agaaagctag 4381 agactgattt cggagtattt tttttagagc ttaagactca gagtcggtgc atgctaagtg 4441 agggttatgt ctcgcttgct atgctcaaat taatgcattc aaactatgtc gtaggagtac 4501 gcacttaaca aagttcaaat aactaatgac tagagcgtag cgattttaca taaactctat 4561 cgcaacgagc tgttctcaca ccatttgcct tctgataccc gttgctttca tatagtttga 4621 ctgcttcggt caaaacactg gcagtttcta tccaaatttg gtcaaaacca cgatatgcga 4681 tcaccccaaa ggggtggctc ctaaaggagc atcgctgttt ctcattgttg tagcaaatat 4741 tttcctaatc ctatacccct aacgcgaggc aaaagataca ttttgcggcg gatttctaca 4801 gctttttccc ctcgttttat aggatagtat cccccagtac ctacaagttt attttggtat 4861 tcaataattc aaaactcttc cccaactgct aagtagcatt cttccacttg cagcacatct 4921 cgatctgcac ggttcgcttc ccaacccaaa ccatattctg ataatacaga ctggataact 4981 tcactagccc aggtgcgatc cctttcctcc cagtcacgaa tgagaaaatc cttgtagtaa 5041 tttttcattt tcaatatctg ttcttggtac ttttagctgt gaatgcgagg ttctgaatgc 5101 aaatttgcta gcaattgact ttcaacaacc gctaactcat ccaaccgttg tctgacaatt 5161 tcacggtcta ggtaagtatc taaaaaactc agtacattcc agagtgctgt gcctcagatg 5221 ccatcaagct gtgataaaat ttggctaaat cacattctgg gcaatgagcc gctaacaacc 5281 caaggccttc gtgactacga gcttcaatta aaccacttac cagcagggaa tctaaaaacc 5341 gttcaggttc cttggggcga atttccgcct ttaaaccagc accataggga ggtggttgta 5401 agggacgcaa agggatattt cgacgttcca gccattgatt gaccaattca aaatgttcta 5461 gttcttcacg ggcaattttg gttaactccc gcaccatttt cgcattggaa gggtagcgaa 5521 atatcatgtt caaagcgact cctgcggctt tacgttcgca ctgggagtga tcgaggagaa 5581 tggtgtctag gttagagatt gcttgttcca cccaagcaaa acttgtgggt gttttgaggg 5641 cgttgatcgt tggtaactca gatgaaagca cgtttaggaa ctacaaaata ctctacatat 5701 tcaatattag gtactagcga tcgccccaac aagccaagaa tatccagtcg cagaggattt 5761 aacgtccaaa cctatttttc ataaaaaagt cttgctagac ttaacaggac gtgcaacaat 5821 tttatcttaa gattccaaaa aaaacggtta attgatataa ttgccgtagt ataaagaatg 5881 agatacgcag tcccttggct gaagattatt atcagcttat atttttgtta acaaacgtta 5941 gcgaaaaagt atcggttcaa aaaatatggt taacatcata gcaattgctg gcagtccatc 6001 tcatccttcc agaacctaca cccttctgga acactccact aaaattttgc agcaagaggg 6061 cttccaaaca gatattatct ctgttcgtga tgtccctgct gaagttctag cctatggacg 6121 ttacgatagc cagatctaga acaaccaaaa gcactgatag aaaaggcaga tggtattatt 6181 attgccactc cgatttacaa ggcagcttat acgggagttc tcaaagcgtt tttagacttg 6241 ctacctcaaa gagcgttgtc agggaaagtc gtgttaccta ttgccactgg tgggactcta 6301 gctcatttgt tggcaattga ttatgccctg aaacctgtgt tgtccgagtt aaaagcacgg 6361 ttcgtcttgg gcggtgttta tgcagtagac aaacaactgc aaatacaaca agatggcagc 6421 cttcaatttg atgaggaaat tgaccaaaga ttgaaggaag ctctcaatga gtttgcagaa 6481 tctgtgaaga gtttgcaacc caatgtcaaa gaattggctt ctgcaaatta atttatcact 6541 gttttcctac tactgagtag tgaatcacac atcaaacctg gtataatgta gagacgtagc 6601 atgctacgtc tctacattac gcattgcacg caattgagaa gcgctatatt agcggacgtt 6661 ttggcaagga ctatatcatg tcctaataat ttttagagtt tggcaggatg gcatagagtt 6721 tgctcagctt ctctacatcg tgctcaaacc aattatgaat ccactggcaa tagtggtgaa 6781 ttgcttgtgc ggggctaaga ccacccaaaa gttcagctgt taataaatgt cctgccatga 6841 tcccagacat gattgctttg agcaccccat gactggatgc gggatcaaga accatcgccg 6901 catccccaac catgaagtaa ccagaagccg caggttgcga aacaatgcgc caagtgacat 6961 cagccgcctg cattttctga tgaatcttaa gtccgtggaa ttcatccgga agccagtctt 7021 ttgggatttt ctgatcaact aacgataggc gcgtccactg gtaaagttga ggttgcactc 7081 tagcagtcca tgtccagcca ccagagtccg caattatcgc tggagcatca tcccgcactg 7141 gacattctcc acaagcataa ccgtagtata caattctacg cggagaatga tagttgattt 7201 gcacttgtaa ttgtcgagct aaccaatgat gacttcctgt agcatcaacc actttagatg 7261 ctctcaaagt tccaaggctt gtctcaacac caaccactct atccccatcc acaagcaaac 7321 gcaaagcttg gcatggctgc aaaacctcaa cacccgtagc tcttgctcga ttaagcaaaa 7381 tagtatcaaa atcggctctc caggcttgaa atcccagcca cgcgccggaa tcgtcttctc 7441 caaaggggac aaaatgcctc tgggcttccc actgtaccca gtttccagtg tggcgtagga 7501 agccagcgtc tagcactggc tcgatcatac ctaactgctt caaaagcggc tctacacctg 7561 ggtgcagagt ttcaccgggg tgagagcgtg gaaattgctc gcgctcaatg aggataactc 7621 gtaaacccct ttgagcgcag gcgatcgcag cagcagatcc cgcaggtccg ctacctataa 7681 ttagcacatc agaattagac acgtctccaa gcagcattgt tacgtttcaa caaaatgttg 7741 aggggtgtga caaaaagcgt tgcggtttct tgaatttcaa ccagttgttt gcgaaaatca 7801 tctggatttg gcgatgaatt tccaaaacga tacagttcaa accgatagat gataccagca 7861 agtctagttg ttgtttgagt ttgtgcctgt tgcaattctg aatcagttgg ctgcacttga 7921 cggaacgaaa atacactcca ttgacttctt gcttcaggga taacgcggta aacatttgcc 7981 atcgccaaaa ctctggcttc atattcacgc acatcaactt tacctgttag cgacaacgag 8041 aatttctttt gcacagaatt ttctacataa tctgcatggg caaaatctgt atactcaata 8101 aagtttttgc cgttccttgt gactcgacgc ggaccaggga taccgtccca cgcgaggtta 8161 ccaacctgac caatttcttc atccgtgaga gggctgactg tgggccatga gggattaggc 8221 gcaaatgtcc gtgtcgcatc gggagcgact gcgggccaaa atgtactgag tgctgcacac 8281 agtttggaat cttctggaaa aggactgcct aatccataag ctgctaaatg ttctactccg 8341 ttaactgtgt caaaactgat atcccaacca ggggcaaaca ccccagatgc tccatctgga 8401 agataggcat gtcgcattgt tggggagcca tccaacggca tttgttttac cgaaccactt 8461 aggggtaaac agacaattgc agttacggtt ttatcttctg gacggaaatt agcattcttt 8521 aactgaagat taggagcgaa gcgattgtct gatagggttt ctgggggaac tcgccagaga 8581 ttctgtctaa gtgcggtagg taccgactgc tgagtccatt ccaatagctc tcgctgatcg 8641 caattgggaa agaaatccgg tgctgttaca agcgaataag caggaacacg gcgcggaatt 8701 gctactgcta actgtggaca gactgcttca atcgatccat cacctgtgaa gtcaacgtag 8761 tgaagggcat tataatcacc cttttgtaca acttctgtga catttgggag attattgaga 8821 ttttcgattt gaccattgga gaatttgtga cgggcgtgaa catattctgg agcacgtctt 8881 gcccctgttg ttctatctgc agaaatgctt aagctggagg atagagtttc gctgtttggt 8941 ggaactttaa atgtgagtgg ttttccttga taagtcgcag cttcaactag attcttctga 9001 acgactggaa ctagaagtcc attaccaaat tccggctttg ttgtcaattc agcaatgcca 9061 tctgaaaaaa tgaatggcgg attactaatg tctggttcac cccatcctgt gtttttcaaa 9121 actaaatgaa cccgccgcag tttttcattg atatggcgag cattaagtgt cacctgtaaa 9181 tctaaattgc gaatacattc tttaccgttg aacagcttgt gcagaggaac ccagaaaagt 9241 ctgccatcat ctccattttg aaagcgcaat ggaccgaagg tagttgaatt ccctctgcgt 9301 tgaacagcaa tataggcaga atagcgagct ggtaaaacgc gaaaaagttg tgagtcttcc 9361 ttaacgaagg gtaaaaatcc acgaagtttt gtgtcataca gtgcttgggc tgttcctaca 9421 cgcgccacac cagtacgcga aaaacacaaa tcagcatgct tacgctgaac agtttcagat 9481 gcaggacgat attcataagc aaatacaaca atcgctaaaa gttgtccctg tgcccgaact 9541 cgcaaatctt ccaacgaagg gggctgcacg ccgtaaacat aattttcaat aacttcaatt 9601 tcccctagcg tcggatattc cctcaggttt tcactcacaa cattgggtga agcaaaagca 9661 tgaaacagta aactccgagc cggatttcct gattctattc ccctttttcc ttcaaaagca 9721 aaatcttcaa atcctttaat agttctgttg atatttggca gttcctttgt taattcttgc 9781 ttaagattag ctgctgtgat atcaactcca tgagcgagaa gtaagtcgcg ccagccaaga 9841 ggagctaggc gatcgcacac ttttttgaca tcttcaatta aggacataac tgaaataaac 9901 tccttgtttt cagatataag acgaggactt acgtcttgac aaaatttcaa ttactcagta 9961 ataggtacct gatttgtgga aggaaagtta accaaggaaa actttgcttt gacttattaa 10021 atgtaaacaa aatcaactta tgcactttcg gatagaaata ggaaaaatgc cagaaaaatt 10081 gcaaagataa ctgattctta ggggttttaa tggtgaagct acaactttga tagataaatc 10141 atattggtaa gacagtgaaa taccaattca actttattcg cgatcacctt tggtggctcc 10201 ctgtgtccgg aggacacgct acgcgaacgg agcatcgcct tgttcgccta gagggttttc 10261 taaagaaaaa ctctctccct aactcgaaaa tttggggcaa acgcacgcgc cgcgtcgctc 10321 ataagatacc gtaggatata tatctttgcg cgaggtgaga tatggcgatt ctacaattgg 10381 aagatgggac acgatacact gatttgcaag atatttcccg cgaactagca cctctaaata 10441 ttcaacttaa tcgctgggct gtcggagaga gtcaacagct gcgtgaactg ctatcacaag 10501 atagcctcaa tgaagatgaa aaacaacaag tcttaacatt tctggatgga tatttcgagc 10561 aactgcaaca aacagcagga tatcaaacac gcgacttaat tgtcctacat ccaggaatcc 10621 cgaaccttga taccatgctg acaaagtttg acaaaatcca tacccattca gaaaacgaag 10681 ttcgttacat cattgatgga gaagccattt ttggttttgt ccgacctgat gatagccaag 10741 tagaactcac agtacacccc gaagagtaca tcaatgtgcc tgcgggaact gaacattggt 10801 tttatttaac tccagcgcgg cgagttaaag cagtacgtta tttcactgga agtgagggtt 10861 ggactcccga atatacgggt agagaaattc atactcgtca ggttgtgact caagtataga 10921 tgaggaaaca atgaatagcc caaagctcat tgatcctcgc cttgaactca tcttagctgc 10981 ccgccacttc taccaacaag gatggatggt ggggactgcg ggaaatctct cagttcgtct 11041 acctgatgat agcttttgga ttacagctag tggtcggtct aaaggagaat tagaacttgg 11101 tgattttgta cgtatctatg cagatggtac ggtagagaaa ccttcgccgg atgtgaagcc 11161 ttcagctcaa actgtcattc accaggctct ctatactcta ttccctgaag ctagaagctg 11221 ctatcatgtt cactcagtag aagcaaattt ggtttctcgt tttgttcaag gagatacctt 11281 gcccctacca ccgttcgaga tgcttaaagg actgggaatc tggcaagaga atcctgattg 11341 cgctatgtct atctttgcca accattcaca ggtttcctgt attgcagatg agattaaaga 11401 gcgttttaca acaattcctc cacaactaag cgctttactt aaccgtgacc acggtgttac 11461 catttgggca ccttctgcaa aaactgcccg taattacatt gagttagtgg aatacatttt 11521 ccgctacatg gttgcagcca gaggtacggg tttttgccgt gccggcgagg agagggaata 11581 gaaaaagtag gaaaagaaaa caattgacat ttgaccaatg actcatcaaa ctattcgcgt 11641 tgttgcccgg attattgctt tacctgaaaa ggtggaaact gtgaaagctg tgctgctgga 11701 gattattgag ccaactcgac aggaagcagg ctctatcaag tatgaattgc tacaaaatca 11761 gtctgaccca acagacttca cttttgtaga agaatgggct tctgatgatg ctcttgatac 11821 acatttagcg acaccccatc ttaacgaagc aggtgctaaa ctggcgagtt tgcttgctgc 11881 agaacctgat atccgccgct atagtgtatt ggcttagccg actacaaggt gcggaatgtg 11941 ggtttgaaag ttggaaaaga taattcttga caagtgacat ctcctacact aagtgcaacc 12001 acttcagatt tatttttctt cttcttactc aattctcaac ctctactaga ggtttttaaa 12061 aatttaaaat atctaacgat tcttctattt ctaaatctaa aattttttct cgtgtctttt 12121 tatgctcttc taatttacct tctttccgat aaaggtctgc tgctttttga aaatcatcaa 12181 caactccctg cttatctcct aagttataac gagcatgacc tcgatggtaa tatgcgtcag 12241 catcgttggg atttctcttt ataatctggg tgtaatcctc aattgctccc tgataatctt 12301 ctaagtgaca acgagcctga ccacgattat aatatgcatc aatgtcatga gcattaatct 12361 taagaacctg tgtgtaatct tcaattgctc cttgataatc tcctaaatca aaacgagaat 12421 taccacgatt tttgtaactg ttaatatcat taggttttag ttcaatggac tggctaaaaa 12481 gctcaatagc tttctttgaa tttgccaaat ttgaactatt tggattaatc ttgatggctt 12541 gattataatc ttcaattgct gcctgataat cacctagttc atagcgaata tcagcacgat 12601 ttctgtaagc tactgcatca ttaggattca ttctaattgc ttgagtataa tccgcaattg 12661 ctccttcgta atctcctagg tgatatcgag ctaaggcaat tttcgtgtaa actttgccgt 12721 aataaggatt aatttgtatt gcttgagcgt aatcggcaat tgctccctcg taatctctta 12781 gatgataacg agccaaacca cgtttgtaat aagtctcagc atctttagag ttcagtttta 12841 aagcttgatt atagttatta attgcagcgt tgtactgttc tttctcaaag cattcatcac 12901 cgagtttgac agaaagcaga tttaaatctt cattatggaa acggggataa tcattttgtt 12961 caggagaaaa gctggttttg agagactgtt gcttgtgtag ctgataactg ggttgctgtt 13021 gttcaggaga aaagtttgtt ttagtaaagt gttgcttgtg ttgctgatag ctggcttcct 13081 gcttagggtt agatttttcc aattgagatt tttggttatg agagttgact ggtgttaagg 13141 attctaactg ttgtaaatag caccgtaaac cacttccata cagtaattgt tctatcccaa 13201 atgaaatctt tccggtcttc agcttaatca tttgagtagg aagaaaacca gcgaagaaaa 13261 ggtgatattc aggttgtgcc tcattgactt cctcttgaat gaatatgcag acaacaacgg 13321 catttttttg gacttcttct gaattaattg accatctgac tttattaaga ctgccatgac 13381 gagatttaac tgcaatacca actgaggggt caaaagtcag cgtaaaatca actttgccat 13441 cgccaatgcc gaatcgtttt tcataatcta cttcagtcac gaaagcagcc aaacgttctt 13501 tgacaacttc ttcacctaat tttcctttca aattactgat aaaaacatca cggactggtg 13561 aagtgcgttt atatttctca gccatctgcc agcaaaattc tcgtagtgct tttaacctct 13621 cgcctgagat gactgttaac tcgctatatt gcccttctgt ttcacaatgc agcagacaac 13681 cagatgttaa cctcttgatg aaatcagatt gtagcgatcg cagtaaggta atccagtcca 13741 tttataattc tgcgaatctt tttgtttatt gtattaaaat aacaaattag cgaaaagttg 13801 tctaagttgt atttaatgca acataaaccc atagaacccc attcccaacc ccctccccgc 13861 aagcgaagag ggggctaata atgtggagag tagtgttgtc tggtaagtat taatatcaaa 13921 tgctcccgtg gaaacgtcaa gtaatgtcta gctttgtcta gattttctgt agcagtagga 13981 tattacctca catcgtaacc aaacaaattc ggatccactt cccctaactg caaatcacct 14041 aaaccatact ccgcccaacg cctctcaacc atcgctgcga cttctggatc tgattccaaa 14101 ggtgcacccc attcatgttc agtttctggg ggaatctttg tggttgcatc aattcccatt 14161 cttccgccca aaccaatttt ctcgctggca aaatctaaac tgtcaaacgg tgtgtttggc 14221 aaaataaaca catctcgggt ggggtcaact ttagaactta tcgcccacac aacttgacgc 14281 ggatctctga tattaatatc tttgtctacc acaatcacaa acttggtgta ggtgaactgt 14341 ggcaaagcac tccaaaaagc aagagctgcc cgtcgcgctt gtcctggata tgatttatct 14401 atagatataa ttgccgcttt gtaactcaaa gcttccattg gtaagaagaa atcgacaatt 14461 tctgaaactt gttgtcgcaa aatgggggtg tagatgcggt tgagtgcgat cgccatcatc 14521 gcttcttctt tcggtggacg cccgctaaat gtggttaaat aaattgggtc tttgcggtgc 14581 gtcatgcaat ggaagcgaat caggggcgaa tcttccacgc cgccataata acccatgtgg 14641 tcgccaaagg gaccatctgg caaaatctcc cctggtgtaa ttgtcccttc tagaacaaat 14701 tctgaatctg ctgggacttc caaatctact gttttacatt ttgccaactg tacgcctgaa 14761 ccgccgtaaa gtccagcaaa caaccattct gataagtcta caggtatcgg tgtggcagct 14821 gccatgataa tgaggggatc tacaccgagg gcgatcgcca cttctaattt cttcccacgt 14881 tctgctgctt tgcgtaaatg cctcgcccca ccccgcaccg acaaccaatg aacagtcatc 14941 gtattttttg attgtagttg caggcgatac acacccacat tcggtgtccc tgtttcacag 15001 tccctggtaa tcacaagtcc cagagtaata attttgcccg catcaccagg ataagggcgt 15061 atcaaaggta acttgttaaa atctaactcg tcaccttgaa tgaccacctg ctgacaagca 15121 gggaaaaaat cccgtcctgg tttcgccttt aagacatcga acagcacctt gccaaactct 15181 accgcctgag aaatcttctt gggtggtttc ggttgttgca gcattgccaa ctttttccct 15241 aattcctcca actcctgtgg atgctgcata ttcatcgccc agcatatccg ttccactgtc 15301 cccaacaagt ttaccgccac tgggaaggaa gcgcctttga cgttttcaaa tagaagtcct 15361 ggaccacctt tttgcagcat ccggttagaa atttcagcaa tttctaaatc tgggtcaact 15421 atggctgaaa ttcgccgtaa ttgtcctctt tcttccagaa ttttgataaa tccccgtaaa 15481 tctcttgcca tcgttccaaa aaattaatat ttaagaagtg tgaggcgctc ttctaatatt 15541 atgagttatg tgtcaaaagg aaaattgtgc aagtagtcat ccccaccgat attttcccat 15601 cgccccttga atggggttag agtttccatt gattcagttt tggactctgg acgccccgca 15661 tgacttcacg acctcatctc ctttcaagga caataacttg ccttaactac ctattaacaa 15721 aaataatgaa ttcggatatt ctaaccaaga tatgcattaa tttttaccaa gaaacgttta 15781 cctgatgaaa atatgactca agcatacgac atcttcctag caggcggtct tgttatgtgg 15841 gctttgttgg gtttatttac agtaacttca gtatgcatat tggaacgtag ttggttttgg 15901 ttgaggctaa tcgttcaaga aaaaaaagtc gtgcgtgagg ttttgacaac agcaaaagta 15961 gatttagaaa aagcagaaga aattgcagaa aatgctcaag ttttagccat tggtcgcttt 16021 ttagtggaac cactaagact aaaaaagccc tctcctgaga catttcattt ggcaataaaa 16081 gccgcttgtg atagagaatt cgtggaaatg cgcaagggcg gtatgctcct gaaaagtgtg 16141 gtagcgatcg cccctttgct aggtatatta ggtacagctg agggtttagt caccactttc 16201 actaacctta aaactaattc tttcaatatt actgatttat ctacagtgac gcttggtctt 16261 actcaagcac tatttagctc tacaaccgca atagctgtag cagtttttgc ttttatcttt 16321 ttccgacttt ttctatgttt gcaagcgcgg caaatgggct ttttttctca agttggcaac 16381 gatctagagc ttatttactt gcaatactgg tatgaacctt ctactcaaac aaacactcaa 16441 actaataatt aattgatatt agttattagt ttgaattttt tttgatcaca aggtcaaaat 16501 attttctaaa tcatatttaa aaaatatata tttcaagaga tattttatat ctaaaaaaat 16561 atattttaat attgatgaag atagtcatca tatttgattt gtaaaaatca gcatgttaag 16621 attctcgact tagccaaagt tgtcgggaat cttatcttgt tgttcacgaa taatttagaa 16681 ctactataga attcctattt aatttttgct cagctaggta cagtttatgt tcaaaaataa 16741 aataagaatt ctatagatta atcatctgtg tctctaatcc cttttaaaca gatgttaaga 16801 aacatttaat tttttgttaa aataaactta acctccgatt aactctgaac atagtaggtt 16861 atttgcggaa gagttaatca acaaaaattt tcttattaag aacgaggcaa gatgacattt 16921 tcaactaccg ttttgaatcg tgtagtgagt gcttcagcta tcatggctgc tgttactttg 16981 ggaccgattt ttacagcgat cgcccaagca gaaaccatca atggcgcggg agcaactttt 17041 ccagctccgc tctacgaacg ttacgctcgt gaagtcaaaa agaagtatcc agacttagga 17101 gttaactatc aagccgttgg tagcggcgct ggtgtgaatc aagttattgc cggtactgtt 17161 gactttggtg ctagcgattc tgcgatgacc gatgcggaaa tagctaaagt gaagaatggt 17221 gtcatcctcg taccgactgc aggcggtgct gttgcaattg ctcacaatct tccaatagat 17281 aatctcaaat tgtcccgcaa aacattgcca gcaatttttt ctggtcaaat tactaaatgg 17341 gacgaccctc aaatcaaagc tgataatccc ggtgtcaatc taccaagtca gccgattaaa 17401 gttgttgttc gcgctgatag tagtggtacc tctttcattt ttacgaacca tttgaattcc 17461 ataagtcctt attttaaggg acgaattggt gtcagcaaat ctccaaattg gactattcca 17521 aatgtcacta aggcaaaagg caatccagga gtagcctctt cagtaactcg cactccaggt 17581 gctattggtt atgttgagta tgaatatgca ttgaaaaata agctcaaagt agcacaggta 17641 cagaacaagc aaggacagtt tgtagctcct tctttacaga ctgcgaacga agctttatca 17701 actgtaaatt ttccagacaa tttccgggtt tttgttgatg atccagcaca aggttatcct 17761 atcgttggtc taacttggct gttagtttat aagagttatc ccaatgctgc gaaaggaact 17821 gctgtgaaga actttttgaa ttgggtgcta acagatggtc aacaaattaa tgatgacctg 17881 aactactctc gcattccagc tcctgtggct caaaaggtca tacaaacggt caacagcatc 17941 aaataatctg tcatctcaca ctttttgctc taccagggtg ggctatttgc ccaccattag 18001 gtcatgaaaa taatttagtt tcccgattag taatggcaaa ttcttttgaa ccagctaata 18061 gcaatcaacc tagtttgatt gatgaaagta tagacataac agccagcagt ggaaaaaatt 18121 tttggataga ccaaggattt acatggctag tttatgtgtt tgctgctcta acggtgatag 18181 tcctattttg gatgagtttg ataattttcc aaaaagctct accagctatt gaaaagtttg 18241 gactggggtt cttgtggaat caacaatggg atacaggtaa tttagttttt ggtgccttgc 18301 catatattta cggcactttg gtcagtagcg cgatcgccat gttgtttgcc gtaccagtag 18361 ggatagcggt agctttagtc acgagtgaaa attttattcc tccttcagca cgcaccacca 18421 tagcttttat cgttgagctg atcgctgcta ttcctagcgt cattatcggt ttgtggggtg 18481 tttttgtttt tattccagct cttgtacctt tacaaacttg gctatccagc ttctttgggt 18541 ggataccact atttcataca ccaggtcctg ctgggtttaa tatattgact gctggcataa 18601 ttcttgcgat tatgattttg cctacgatgg cagcaattag tcgtgaagtc ttgttagttg 18661 ttcccaaaga gttacgaagc ggctcaatgg ctcttggttc tacccgttgg gaaaccattt 18721 ttcgagtcat actacctgct ggattttctg gaattgtagg tgcagcaatg cttgctcttg 18781 ggcgagcttt gggagaaaca atggctgtca ctatggtgat tggtaactcg gctcagatta 18841 gtccgtcact actcgatcca gcttactcaa ttccttctgt gttagccaat gaatttgctg 18901 aagctcaaga tccattgcac attggtgctt taacatattt gggcttgatt ctgtttgctg 18961 tgactttagt tgtaaatatt cttgctttgg tgattgtaca gtttgttggt ggcaagggga 19021 aatagaaagc taatttttaa gacactaggg acatatatta atagtggaaa acactgataa 19081 aatgtcctat gtagtctatt tccaaaggta gacgccaaag ttaccaaatc atttgaagat 19141 gtaaatgcgt aagggatatg accagtcatc aagagtttaa tacagataaa tctgtggcag 19201 cagaaatatg tagtccttta tcacctgtaa ggatgatatt tgactatgga atgactgtag 19261 tagcattttt tctctcagct ttagccctga ttcctttgct gtcattattg tgggaaatta 19321 tcgggcgggg tataacaagc atcaaaccat ctatgtttgt gacatcagtt ataaacgatg 19381 gatttgcgaa tgcgattgtt ggtactctgg aaatggtaat cattgctgca ctgtttagta 19441 ttcccactgg agtcatgaca ggcatatttc tctcagaaat tggtaaagga aatcgcattg 19501 gtcgtgctgt tcgctttgtt gcatctatcc ttactggagt cccttcaata gttgtcggtg 19561 tttttgctta tgctgttatt gtattgataa ctaagcaatt tagtgcgatc gcaggtggtt 19621 ttgcattagc tgttctcatg ctccccgtta tcgtattgac aacagaagaa gctctaaaac 19681 ttattcccac ttccgtacgt cttggatcag ccgctttagg agggactcgc tttcagacga 19741 cttttcgggt tgttgtcgcc gcagcaattc ctgcaattac tacaggcgtt ttgttagccg 19801 tagcccgtgc tgctggtgaa accgcaccct taatttttac tgctttgttc agtttagatt 19861 ggtcaagcga tttattcggt ccaacagctt cgttatcggt gttaattttc aacttataca 19921 acgatccctc gccagaaaaa acagcattag tctggactgc ttctcttgtt ttagtcggga 19981 taattatgtt aatcagtatt ctctctcgtg ttttcacaag gaagagaaac gtttaatcac 20041 aactgaagtt atgttgtgaa ttattgttgt agtacataac acaatatgag caagttaaac 20101 ccagccatca aagtcaagaa cctcagcttc tactacggca cttccaaagc actggaaggc 20161 gtaacaatgg atgtctacga gaaccaagtc accgcaatta ttggtcctag tggttgtggt 20221 aaatctacat tcattaaatg cctaaatcgc attagtgaac tagaaggacc agtaaaagta 20281 gatggcagcg tagaatttta tggtcaaaat atctacagtt ctcgtgttaa cttaaatcgg 20341 ctacgccgcc aaataggtat ggtgttccaa aagccaaacc tttttcccat aagcatttac 20401 gataatatcg tttacggtat aagaattgct ggatggcgtc ccagagcaga attagatgaa 20461 attgtcgaat atgctttgcg aggtgctgct atttgggatg aagtcaaaaa taaactgaat 20521 aaatcagctt tagggctttc tggtggtcaa caacagcgtt tatgtattgc ccgtgcttta 20581 gctgttaaac caaaaattct tttaatggat gagccttgtt cggctcttga ccccattgca 20641 acaatgaaaa ttgaagaact catccacagt ttacgcgagg aactgacaat tgcaattgtg 20701 acccataata tgcagcaagc cgctcgcgtt tctgatttta ccgctttctt cagcactgat 20761 gaaagtcgaa ttggtcaaat ggttgaattt ggacaaacaa atcaaatctt cagcaacgcc 20821 ctcgattctc gcacccgcga ttacacttct ggacgtttcg gataaagagt ttaaccaaat 20881 tcaacaaatt gattagtttg tgtcaaataa gctagaataa gcgtttattt acgctcatct 20941 acttatatct tcaagaaatt taacaagctt atcactttaa tattattttt gttcccagat 21001 gataagctag ctcctcactt ttaaatacct gtgatattga gattcccaca actttaagtc 21061 acgtttcaac gtgaaactgt gaaattattg ctacctctgt aaagttttta tttgagtgac 21121 acttctacac tttcttgccg ggtaggtgta gacttctcag tcattcagac gaatgagtat 21181 aacttctata tttcggtaaa aagaatattt ttaatcaaag catttcttat tcttaaccta 21241 tatgtgagaa cttcgatgga gatgtacaga atgtaaattt tttgctttgt gtacatcttg 21301 ctcttgacag atgtctgtcg cttgtatccc ataagcattt atggaaaaaa gagaggaaat 21361 ctggattttt gtgctgtaaa tatagcggtt ctcatttgaa tcacatacac catcacgaag 21421 aatgtgaaag tgcatgcagt ttcactaaac atatgtattg catgcaacac cgtaagcgct 21481 ataaataccc aattccaggt acatagtata tctaccgctt aagaaatgta agttgcgaca 21541 aactagggaa aaactatggc aaacgtaaca ctcaaaggat ttgcttcact accagcagat 21601 acatttgctg agggtcgttc ctctggaaac tttattacag gcaatactaa tggcagaaca 21661 gtaccctttc taacacaacc tgttcaaggc tttagtgctg tgcagtttgc tgacgcaaac 21721 actttctggt tcctcccgga caatggtttt ggggcaaaga acaatagtgc tgattttcag 21781 ttgcgaatct accgccttaa ccccagtttt caaggtacag aagggggtaa tggtcgtgtt 21841 gaggttctca actttattca actatctgac ccaaataacc aagttccatt caaaatcgtt 21901 aacgaaggta ccacagaccg tctgctgaca ggtgcagatt ttgacacaga atctttcgtc 21961 ttaggttctg ataaaagtat ctggattggg gatgaatttg gaccatacct actacacgtt 22021 gataagacag gtcaattatt agaagctccc atttctaccc ccaattacta caaacttaac 22081 accctaaatg gtcaacctcc cattgtgatt ggtcaccgag gagtcgctgg tgagcgtcca 22141 gagcatactt tagagtcata taaagtagcg attgagcgag gggctgactt tattgagcca 22201 gacctagttt cgaccaagga tggagtgtta attgctcgtc atgaagtaaa tattacagaa 22261 actactgacg tgggaactca cccagagttt gccgatcgct acactactaa gaccattgac 22321 ggtgtgaccg aaaaagggtg gtttgccgat gacttcaccc ttgaagaaat caagactttg 22381 cgagcaaagg aacgtctttc cttccgggat caatcctata acggtctatt tgaaattccc 22441 acctttcaag aaatcattga tttggtcaag caggtagaag ccgaaacagg tcaaaaaatc 22501 ggtatctacc cagaaaccaa gcaccccact tatcacgatt ctgtcggctt gtccttagaa 22561 gaaccactgg ttgaaactct gaaaaaaaat gatttcactg acccaactcg ggtatttatt 22621 cagtcttttg aggtcagcaa cctcaaagaa ctcaaccaga agatagatgt accgctgatc 22681 caattattgg atgcagaaga tgttagcttg gacggtacac tgattgagaa gcaaccctac 22741 gattttgtcg tcagtggtga tcctcgcact tatggtgact tacgcactgc cgagggtttg 22801 aaagaaattg ccacatacgc ggatgggatt ggcccttgga agcggatgat tgtctcggtt 22861 aagggtgttg atgctgatgg tgatggcaaa gcagatgatg tcaatggaga tggtgcagtc 22921 aacgacgcag acaaaaccac attgcctcca acaacgctag taaaagatgc tcatgctgca 22981 ggtttgctgg tgcatccata caccttccgt aatgaaagtc aatatctggc agcagattat 23041 aacaaaaacc cagaactgga gttccagcag tttatcaagt taggggtcga tggctacttc 23101 actgatttcg ctggtaccgg agacaaagtc cgcgaccaaa ttactggtga gtttgtgcga 23161 tcgcccgata accccgatgt tctagcaggt cttgcctact ctaaccttgc cagttccaag 23221 ggctttgaag gcttagcaat taatccagat aaaactaaac tgtatcctct gctagaagga 23281 tcagttctcg gtgatcccaa tgatgcctta cgaatttata aatttgatgt tgcaagcaaa 23341 cagtatgaag gtctagttgg gtactatcgt cttgaaaact ccagttatgc gattggtgac 23401 ttcactgttg tcaacgataa tgaatactta gtgattgagc gtgataacgg acaagcagac 23461 accgctaaat tcaagaaaat tttcaaggta gacttttcag aaaaagatgc caatggatac 23521 gtcgcaaaag aagaagtcgc ggatctactc aacattcaag atccaaatga cttaaatcaa 23581 gatggcagca caaaattcac cttccccttc caaaccatcg aaaatgtgct ggtgattgat 23641 aaaaacacta tcttagttgc caatgacaac aactatcctt tttctgtggg acgaccacct 23701 gcgatcgaca atgacgaaat catcctccta gggttagaaa aacctcttag ccttgaccca 23761 cgtgtcggtc ttgccggact tgacaacaac acgtcaatta gccaaggtca tgacttgcta 23821 ggaactcaga attgggcaca gctgactaat gtgaatatta tttagtcaaa atctgtaaca 23881 taagcttcac gacttccact ttcgtttgcc ctcaccttaa ctgtgggggc aattttttat 23941 aaactgtttt tctaccacga agtaaaacta aagcagataa taaagttact gtcaatagtt 24001 agatacttaa actggaaaga ctattattaa gatatcttga taaaatctga agagt // LOCUS NODE_1294_length_23948_cov_4.88268523948 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 23948) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 23948) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23948 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..411 /locus_tag="DP116_11580" CDS <1..411 /locus_tag="DP116_11580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002798711.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator, XRE family protein" /protein_id="PRJNA477356:DP116_11580" /translation="AVGRDLSNANLSAANLSGTNLSGANLSGANLIAANLIVANLSGA NLSGANLSYANLIVANLSGANLSGAYLSGANLSGTNLSGANLIGVIVVNAFFGGTKGF TEDNKRDLKQRLAIFGDEQRENVFSDRPRVPSPK" gene complement(819..1148) /locus_tag="DP116_11585" CDS complement(819..1148) /locus_tag="DP116_11585" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11585" /translation="MTSDQISLLLQLQLRQTLALEKISDALERLKEGKFAIPNKNVDP VSKAYVPPYEIQIGARIKVINEKLFGESKEGIVTDVRRGGTSWLVQVNVEGMTRVVRP WDVEVQP" gene complement(1442..2515) /locus_tag="DP116_11590" CDS complement(1442..2515) /locus_tag="DP116_11590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412416.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_11590" /translation="MSQKNETLSLFLAIVITLGLTFAGLWFLIERWAQISRRNEVNST LVINPSVNNNPVNNSPIPNPTTTSCSVPNLPAGTFNYGGSTTWAPIRKDVDPVLQSIC PQFTLRYVQPAVEKPGSGTGIRMLIDNQLAFSQSSRSIQGEENQKAVQKGFSLKEIPV AIDGIAIAVNPNLNIPGLTVSQIKDIYTGKITNWQEVGGSDLPIIPYTRSKEAGGTVE FFIENILNKENFGNNVSYVNTTTEALRKLASSPGGIYYASAPEVVPQCTIKSLPVGRI SGQFVPPYQEPFIPLSECPGKKNQLNAQAFRTGDYPITRNLFVILKQNNQTEQQAGDA YANWLLTPQGQELIEKAGFVRIK" gene complement(2542..3969) /locus_tag="DP116_11595" CDS complement(2542..3969) /locus_tag="DP116_11595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860787.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_11595" /translation="MEIYCTRPRCPRPQNYFSDLDDHGTMRTVQQKYCTACGMPLMLD GRYLPTKLLGRGGFGAAFLARDRRTPGMRQCVVKQFQPAGDLSPTQLQLAQDLFEREA VVLEQIGRQHEQIPDLYAFFPVIIPSLRAGEEDQFFYLVQEYIDGQNLEEELAQKGKY SEQEVLEILQEILKVLKFVHEKGIIHRDIKPSNIMRHRNGKLFLLDFGAVKQVTNAAT SVSGVSTGIYSLGFAPPEQMSGGQVFPSTDLYALAVTTLMLLTGETEITQLFDAYSNQ WKWRGHVSVSPHLADILDKMLISAASQRYQSAQEVLNALNAPQKHIPSTYINPPQPSV TPQPQSQQTQTQPQVQKQSLSQTPPVKQQFSLVKLLVRAGFSGFEGALITIGLSTILP SPIVTLVAACVILSGLIFAQWKRWIGGSDLLIFPAITLGIVLFLLRGNLTVGEIVILS VAAALMAIAVTALFRLIYKLLSLLL" gene complement(4368..5381) /locus_tag="DP116_11600" CDS complement(4368..5381) /locus_tag="DP116_11600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137175.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LysR family transcriptional regulator" /protein_id="PRJNA477356:DP116_11600" /translation="MSDLPFTLDQLRILKAIAAEGSFKRAADSLYVSQPAVSLQVQNL ERQLDVPLFDRGGRRAQLTEAGHLLLNYGEKILSLCQETCRAIEDLQNLQGGTLIIGA SQTTGTYLLPRMIGMFRQKYPDVAVQLHVHSTRRTTWSVANGQVDLAIIGGEIPGELA ESLEILPYAEDELSLIIPAFHPFAKLETIQREDLYKLQFIALDSQSTIRKVIDQVLSR CDIDTRRFKIEMELNSIEAIKNAVQSGLGAAFVSTSAIAKELQMGVLHCASIEGVIVK RTLRLVFNPNRYRSKAAEAFSREILPQFAAPGWNLEMLKTSRKPIMVSTVDLGAHNSD DDD" gene complement(5811..6533) /locus_tag="DP116_11605" CDS complement(5811..6533) /locus_tag="DP116_11605" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11605" /translation="MKKLPSELKLAIIGNLKLIRTSEKENLLQGVSLTEQSCLNDTAW MAFENGKVSSDIRLQKYLAVIKSEKEVLHKAVSRNDHPLTTAKHKGTPSIKCRLMALS RTGKQFTADINNHITKQVVIGKNFIKLEKSTRSLRIFPSLVSKQIYNDRVQRTDNRTQ SCHLSPVFYDHPGGKAKVSQSKWSYAELDGFIHYKVMNNSSAIRVDANCSSQSYPCRG HTNKHNCPEPWINDSVWWKSWI" gene 6695..7411 /locus_tag="DP116_11610" CDS 6695..7411 /locus_tag="DP116_11610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318552.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11610" /translation="MMQPTWLTLSHFVMLGLLFGFAIVHSGGAALRPWAEKHIGCRLY RIFFAFSSLPLAVILIIYFFNHRYDGLQLWYVQNLPGVQQFVWVLSAISFLFLYPATF NLLEIAAIQKPQVHLFETGIIRITRHPQMVGQIIWCVAHTLWLGTSFTLVTSIGLVLH HLFGVWHGDRRLSDRYGEAYTQVQQRTSIIPFAAVLDGRQSILWQEFLRPAYLGVAIF VLLLWWSHPLLLEATSKIGL" gene complement(7711..7968) /locus_tag="DP116_11615" CDS complement(7711..7968) /locus_tag="DP116_11615" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11615" /translation="MHCQDFVNIVHCKTDKKVSQTFKTVTLSNFLVLIVVYVKVFFAN LLLTLPMETTYLQKPIITVGSFLLLQQKETLPYLPQALFWN" gene 8024..8410 /locus_tag="DP116_11620" CDS 8024..8410 /locus_tag="DP116_11620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318551.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin" /protein_id="PRJNA477356:DP116_11620" /translation="MVLSVSERTFSQEVLESSIPVLVNFEAPWCGLCRIIHPLLLQFK VQCGDQIKLVGVNADDNFKLATKYKLKSLPTLLLIENGVVRQRLESFRGREDLFLALE EIKVCYTNRPKNYHRPKTADLGCRSA" gene 8720..10819 /locus_tag="DP116_11625" CDS 8720..10819 /locus_tag="DP116_11625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868126.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit L" /protein_id="PRJNA477356:DP116_11625" /translation="MEIIYQYAWLIPVLPLLGAMLVGLGLISLNQVTNSLRQLNAAFI ISLMGLAMALSFALLWSQIQGHAPYTRTLEWAAAGNFHLNMGYTVDHLTALMLVVVTT VAVLVMVYTDGYMAHDPGYVRFYAYLSLFGSSMLGLVVSPNLVQVYIFWELVGMCSYL LVGFWYDRKAAADACQKAFVTNRVGDFGLLLGILGLFWATGSFDFDVMGDRLGQLVQT GAVSNLLAILLAILVFLGPVAKSAQFPLHVWLPDAMEGPTPISALIHAATMVAAGVFL IARMYPVFEHVPAAMNVIAFTGALTAFLGATIAITQNDIKKGLAYSTISQLGYMVMAM GVGAYSAGLFHLMTHAYFKAMLFLGSGSVIHGMEGVVGHDPNLAQDMRLMGGLRKYMP VTAITFFIGCVAISGIPPFAGFWSKDEILGKVFAVNPALWAVGWLTAGITAFYMFRMY FVTFEGKFRGNQTNLWQKLKSPAGMGIVTGFDAAPAFGPGAMTKGELEATEEHHHDDH SHGHGHSEYPHESPWTMTLPLAILAVPSILIGLLGTPFANYFEEFIFPPSETLAEVVE KAAEFNPSEFYIMAGASVGISLIGITLASLTYLLGKINPVAIAAKIQPLYEFSVNKWY FDDIYHRVFVIGLRRLARQVMEVDFRVVDGAVNLTGFFTLITGEGLKYLENGRAQFYA LIIFGAVLGLVIVFGVT" gene 11033..12604 /locus_tag="DP116_11630" CDS 11033..12604 /locus_tag="DP116_11630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315061.1" /note="shuttles electrons from NAD(P)H, via FMN and iron-sulfur (Fe-S) centers, to quinones in the respiratory chain; subunit D, with NdhB and NdhF are core membrane components; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit 4" /protein_id="PRJNA477356:DP116_11630" /translation="MNTANFPWLTTIILFPIAASLLIPFIPDKDGKTVRWYGLIIGLI DFALIVIAFYTGYDFSNPDLQLFESYSWLPQIGLNWSVGADGLSMPLIILTGFITTLA MLAAWPVTLKPKLFYFLILAMYGGQIAVFAVQDMLLFFLVWELELVPIYILLSIWGGK KRQYAATKFILYTAGGSLFILLAGLTMAFFGDTITFDMQTLAAKGFSLNLQLWLYAAF LIAYAVKLPIFPLHTWLPDAHGEATAPVHMLLAGVLLKMGGYALIRMNAQMLPDAHAY FAPVLVILGIVNIIYAALTSFAQRNLKRKIAYSSISHMGFVAIGIASFTDLGLNGAML QMVSHGLIGASLFFLVGATYDRTHTLMLDEMGGVGKKMSKMFAMWTTCSMASLALPGM SGFVAELMVFVGFATSDAYNPTFKVIVVLLMAVGVILTPIYLLSMLREIFYGEENKEL VSHQKLIDAEPREIFIIACLLIPIIGIGLYPKMLTQVYDATTVQLTARLRNSLPALAD QKAPALSFNAPEIGN" gene complement(12856..13281) /locus_tag="DP116_11635" CDS complement(12856..13281) /locus_tag="DP116_11635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315062.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_11635" /translation="MPSKRVTEAIRRIVAARAQNYCEYCRCSGEFATESFTVEHIKPR QAGGETTLENLAWSCFGCNSHKHTKIYGTDPTTGQQESLFNPRQQSWNEHFSWNSDFT EVIGNTPCGRATVEALRLNRPGVVNLRRLLFMAELHPPK" gene complement(13271..13660) /locus_tag="DP116_11640" CDS complement(13271..13660) /locus_tag="DP116_11640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128937.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11640" /translation="MTAPTEIRQRAIALVEKLPEETLFKAVDLLEALCIENGVVTLGE ETLLQRIQRRLPTDDQTRLTYLRQRNETGEITEAEHQELLGYVDRVERQDAERAEALI QLARLRNVELKTLLNEFLPVYTKPNAL" gene complement(13841..14197) /locus_tag="DP116_11645" CDS complement(13841..14197) /locus_tag="DP116_11645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408593.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4,5-dioxygenase" /protein_id="PRJNA477356:DP116_11645" /translation="MQEDTIEITGFHAHVYFDTASRDVAERVREGLGARFEVRLGRWH DKPIGPHPQAMYQVAFLPNQFAKVVPWLMLNREGLDILIHPETGDDVKDHTEHALWLG EKLELNISFLQRISTN" gene complement(14287..14619) /locus_tag="DP116_11650" CDS complement(14287..14619) /locus_tag="DP116_11650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MazF family transcriptional regulator" /protein_id="PRJNA477356:DP116_11650" /translation="MPSYSKNDIILVRYPFSDLSSSKVRPAVVINAPHISQDIIITPL TSKTGSLLEGEFVLYDWAAAGLNVVTAVKRGLYTVHESLIVTTIGKLANSDVEQLEQS LRSWLGLL" gene complement(14597..14800) /locus_tag="DP116_11655" CDS complement(14597..14800) /locus_tag="DP116_11655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653039.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11655" /translation="MLKTLWATVRQGKIELLESAEIPEGTRVLVTLLPDDEAEFWLQA SQTSLDEVWDNAEDDVYAQLLQK" gene complement(14937..15200) /locus_tag="DP116_11660" CDS complement(14937..15200) /locus_tag="DP116_11660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012267867.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11660" /translation="MKSAVLPSFWVEYRRLSDDVRQSARKAYRLWAQNPFHPSLHFKC INSQEDIWSVRVTRGYRALGVLEGDTVTWFWIGSHDDYERFFS" gene complement(15197..15427) /locus_tag="DP116_11665" CDS complement(15197..15427) /locus_tag="DP116_11665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873761.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11665" /translation="MSSPAITTVVKMMESLPVDVQEQIAHHLREYINELQDEIQWSKS FERTQQKLVAAAQRAKQEIAEGKATTLDYDQL" gene complement(15629..16708) /locus_tag="DP116_11670" CDS complement(15629..16708) /locus_tag="DP116_11670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655680.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA dihydrouridine synthase DusB" /protein_id="PRJNA477356:DP116_11670" /translation="MVTLSPDLKAKLSTPLKIGSFEVKSRVLQSPLSGVTDMVFRRLV RRYAPESMMYTEMVNATGLHYVKQLPKIMEVDRNERPISIQLFDCRPDFLAEAAVKAV EEGADTVDLNMGCPVNKITKNGGGSSLLRQPEVAEAIVREVVKAVDVPVTVKTRIGWS DKEITILDFAKRMQDAGAQMITVHGRTRAQGYNGNARWEWIARVKEILSIPVIGNGDI FSVEAAVKCLEQTGADGVMCSRGTLGYPFLAGEIDYFLKTGEELPAPTPIQRLECARE HLQALYEYKGDRGVRQARKHMTWYSKGFAGAADLRGQLSLIENVQQGVDLIDRAIEQL ANGYELIEENQLAIRLSGVCEAIAQ" gene 17199..19112 /locus_tag="DP116_11675" CDS 17199..19112 /locus_tag="DP116_11675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111643.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaK" /protein_id="PRJNA477356:DP116_11675" /translation="MAKVVGIDLGTTNSCVAVMEGGKPTVIANAEGFRTTPSVVAFAK NGDRLVGQIAKRQGVMNPENTFYSVKRFIGRKYDEITHEATEVSYKVQRDSNGNVKLE CPQVGKPFAPEEISAQVLRKLVEDASKYIGETVTQAVITVPAYFNDSQRQATKDAGKI AGIEVLRIINEPTAASLAYGFDKKSNETILVFDLGGGTFDVSILEVGDGVFEVLATSG DTHLGGDDFDKKIVDFLAEQFKKDEGIDLRRDKQALQRLTEAGEKAKIELSSVTQAEI NLPFVTATQDGPKHLDTTLTRAKFEELCSDLIDRCRIPVENALRDAKLSKSDIDEVVL VGGSTRIPAVQQVVKQVLGKDPNQSVNPDEVVAVGAAIQAGVLGGDVTGILLLDVTPL SLGVETLGGVMTKIIPRNTTIPTKKSEVFSTAVDGQTNVEIHVLQGEREMSNDNKSLG TFRLDGIPPAPRGVPQIEVVFDIDANGILNVTAKDKGTGKEQSISITGASTLDKSDVE RMVREAEQNASTDKDRREKIDRKNQADSLAYQAEKQLQELGDKVPAADKTKVEGLVKD LREAVSKEDDEQIKKVMPELQQALFAVGSNIYQQAGGTEASTSTGANTSGGSSSSSGS GDDVIDADFTESK" gene 20026..23049 /locus_tag="DP116_11680" CDS 20026..23049 /locus_tag="DP116_11680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875652.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M43" /protein_id="PRJNA477356:DP116_11680" /translation="MKKFIYFVILLYCLLLGIPPSSAKSKTLKIFEAGHFADKESGGQ GERAAKEEKSNRVRETSVVPSFVLWSSQSRQSKLENVTFFEKQGFSKDQVWVIDGKQQ AQRQSFTWVVEDSKKVEQPFLQVEKIVENPIKPEEKTEVIAKPSKKEELEPFDKVVKD AEILPGLFTLYRDKEKNKIYLEINPDQLKKNYLATGTLESGIGEAGIYSGMPLHDFLF YFQRVNNKVQFVVRNVNFRTREGDPQERSLARSFSDSVLYSVSIKSIHPVRKTILIDL GDLLLTDLAGLSLSLGVAPATDKSYFGTAKAFPLNMEIESVLNFTNTRANSDKLRFGG TLADLRGFSLRVHYSLSQLSENNYRPRLADERVGYFITAYQDISNDEHRDPFVRYINR WNLEKQNPSAALSPPKKPIIFWIDNAVPLEYRDAIKEGVLLWNKAFEKAGFKDAIQVQ QMPDNATWDPADIRYNTIRWINTVDGYFAMGPSRVNPLTGEILAADILVDGSFIRALK NDYSRVGQFKQTQNQTPLSALMQNSLLCTNRTEAESSDTSVESMAMEGLASRLSKLAG QYDLCYGMEAANQFAYGSLAMKLLQNSPPSREQMKDYIHQYLRLIIAHEVGHTLGLRH NFRGSTMLTPEQINNQDITRTKGMISSVMDYIPPNIAPQGTKQGDYFPKMIGPYDEWA IQYGYAPIPVVTPAAEKPFLDKVAQQSNKPELSYSTDEDMFDLDPTANAWDNSSNVLV YSHWQLDNALVMWQRLNKVDLLSGESFSDVKEQFGTVFDQYFKNIFYVTKYIGGQSFY RVQPGDPQGKLPFQPVPVEEQRQALDLVQKYVFAEDALSFPPELLNKLAPSRWRHWGS SPRIGRLDFPIHDWVLYMQSSVLWDLLSSDRLSRLKDIELKTARDQALTLPELFDTLQ NDIWTEVVKPNGKPIVISSLRRGLQRQYVDKLTAMVLRKERVPEDARSLAWYKLKQLN EKLDGLSSNDEYTKAHLLETRDRISKILNAEVQAN" gene 23163..23804 /locus_tag="DP116_11685" CDS 23163..23804 /locus_tag="DP116_11685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015122674.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11685" /translation="MGNDSSWLKSEMELVFFPKKQSDISKGSFQDMLEALAVRETGLS SGNPNQYQFENPLSFIGKYQFGEILLIRLGYYKADVYYGHGADKNYWRGTWTKKKGID SKLKFLNSPNVQEEAIREALKVYWKDINDILTQQGKSINNYLGQKKTFNDKGKSKTIT VTLSGILAGAHLRGPDRVVDLLVKGKVSEDEFGTSILEYMEIFGGYDMKIENL" BASE COUNT 6498 a 5203 c 5294 g 6953 t ORIGIN 1 gctgtaggtc gagatttaag taatgccaat ctgagcgctg ccaacctgag cggtaccaac 61 ctgagcggtg ccaacctgag cggtgccaac ctgatcgctg ccaacctgat cgttgccaac 121 ctgagcggtg ccaacctgag cggtgccaac ctgagctatg ccaacctgat cgttgccaac 181 ctgagcggtg ccaacctgag cggtgcctac ctgagcggtg ccaacctgag cggtaccaac 241 ctgagcggtg ccaacctgat cggtgttatt gttgtgaatg ccttttttgg aggaactaaa 301 ggttttacag aagacaacaa gcgtgatcta aagcagcgat tggcaatttt tggcgatgag 361 cagcgagaga atgtctttag cgatcgccct cgggttccaa gcccaaaata acttaaaaat 421 ctgttaatac cagttctgga gaaaacttgc acttaatact tactctttaa ttcttggtgt 481 tcttggcacg gcaggtgcta caacgggggg aacccccgca acgcactgcc ttgtcttggc 541 ggttcgttta ataaatatta agtgcttcat ggagaattaa tattaggcga ttacttcgct 601 tcacttcgtt ccgctcgtaa tgacatttta cgtttactcc cttgccagtc taagatgacc 661 tataattatt tactcctctc tctgtgtact ctgcgcctct gcggttcaaa aataggtatt 721 cttctggtgg ttcgggagta attgggttga gctaattatc tacatcccca agcgcgatac 781 gcttaacgta tcgcactcac cccactacgc caaaaccatt aaggttgtac ctcaacatcc 841 caaggtcgaa ccactcttgt cattccctca acattcactt ggacaagcca ggaggttcct 901 ccgcgccgca catcagttac aattccctct ttactttcac caaagagttt ttcgttaatg 961 actttaattc ttgctcctat ctgaatttcg tatggcggta cgtaagcttt ggaaactgga 1021 tcaacatttt tattagggat agcgaatttg ccctctttga gacgctctag agcatcggat 1081 attttttcta aagctagcgt ctggcgaagt tgaagttgaa gcaataaact tatttgatct 1141 gatgtcataa ttttatattg taaggttgct aagaattatt agatttcccc tcagtgcttc 1201 agttgctagg aacaattgta ttaatactca gttgcattca tacgtattac ccttcgggta 1261 tgaccttcgg tcacgctgcg ctaacgccag atgcctacgg agggagaccc tctccgaagt 1321 tgccacaacg cggggaaccc gcgcaaggca cttctctctg cagcactggt ctcacctcac 1381 tcccgccctt acgggcaccc ctctccgcga gttcggagag gggaaggggt gaggttgtgt 1441 atcacttaat tctgacaaat ccagcttttt caatcagttc ttgaccctga ggtgtgagca 1501 accaatttgc ataagcatcc cctgcttgct gctctgtttg attattttgt tttaaaatca 1561 caaacaaatt gcgggtaatt gggtagtccc cagtacgaaa tgcctgagca ttcaactgat 1621 tcttcttgcc tggacattca gacagaggta taaaaggttc ttggtaggga ggtacaaatt 1681 gccccgaaat acgccccact ggtagggact taatcgtgca ttggggcaca acctctggcg 1741 cagaagcata gtaaattcca ccaggactcg aagccagttt tctcagagct tcagtagtcg 1801 tattaacata actcacgtta ttgccaaagt tctctttgtt aagaatattt tcgataaaaa 1861 actctactgt accgccagct tctttgctgc gagtgtatgg tataattgga agatctgaac 1921 cgcccacttc ttgccagttg gttatcttgc cagtgtaaat gtctttgatc tgggagacag 1981 ttaaaccagg aatgttaagg tttggattga cggcgatcgc aataccatca attgccactg 2041 gaatttcttt aagactaaat cctttctgta ctgccttttg gttttcttca ccctgaattg 2101 agcgagaaga ttgagaaaaa gcgagttgat tatctatcaa catccgaatg ccagtgccag 2161 aacctggttt ttcaacagcg ggttgaacgt agcgtaaagt aaactgaggg cagatgcttt 2221 gaagcactgg gtctacgtct ttacggatgg gtgcccatgt tgtactacca ccgtagttga 2281 atgtcccggc aggtagattt ggtacactac aacttgttgt cgtagggttt ggaataggag 2341 agttattaac tggattatta tttaccgaag gattgataac aagagtagaa ttaacttcgt 2401 ttctacgact gatttgagcc caccgttcta taagaaacca taagccagca aatgttaagc 2461 caagcgttat aacaatggct aggaaaaggc taagcgtttc gtttttctga gacataaatg 2521 ggtgttagca ccagatggtg tttatagcag cagagataat aatttgtaaa taagacgaaa 2581 tagtgctgtt actgcgatcg ccattaaagc cgccgcaaca gacaaaataa caatctcccc 2641 aacggtaaga ttacctcgta ataagaataa aacaatccct aaggtgattg caggaaaaat 2701 taataaatca gatcctccaa tccatcgctt ccattgagca aaaattaacc ctgataaaat 2761 cacacaagca gcaaccaaag tcacgattgg cgatggcaag attgttgaaa gaccaattgt 2821 tatcaatgct ccttcaaacc cactaaaccc tgctctaact aataatttaa ctaaagaaaa 2881 ttgttgtttg acaggcggag tctgagaaag cgattgcttt tgtacttggg gttgagtctg 2941 tgtttgctgt gattgtggtt gtggggtgac agaaggctga gggggattga tgtatgtaga 3001 aggtatatgc ttttgaggtg cattgagggc attgaggact tcttgcgctg actgataacg 3061 ctgactggcg gcgcttatga gcatcttgtc tagaatatca gcaaggtgtg ggctgacact 3121 cacatgaccc cgccacttcc attggttact atacgcatcg aacagttgag ttatctctgt 3181 ttcgcccgtt aataacatga gggttgtgac agctaacgca tacaaatctg tagatggaaa 3241 gacttgacct ccggacattt gttcgggtgg agcaaatccc agagaataaa ttcctgtgga 3301 aacacctgag acactagtag cagcgtttgt aacctgttta actgcgccaa aatctaacaa 3361 gaaaagtttg ccattacgat ggcgcatgat gttagaaggt ttgatgtctc tgtggataat 3421 acctttttca tgaacaaact tgagtacttt gagaatttct tgcagaattt ctaaaacttc 3481 ttgttctgaa tacttgcctt tttgggctaa ttcttcctct aagttttgtc cgtcaatata 3541 ttcttgtact aaataaaaga actggtcttc ctctcctgct cgcaagctgg gaattattac 3601 tggaaagaaa gcgtataagt ctggaatttg ctcatgttgg cgtcctattt gctctaagac 3661 gactgcctct cgttcaaaca gatcttgtgc tagttgtagt tgagttggac ttaaatctcc 3721 tgcgggttga aattgcttca ccacacattg gcgcattcct ggggtacggc gatcgcgtgc 3781 caaaaacgct gctccaaatc cgccccgtcc taaaagcttt gtcggcaaat agcgtccgtc 3841 taacatgagc ggcattccac aagcagtaca gtatttctgc tgcactgttc tcatggttcc 3901 atgatcatct aaatctgaaa aatagttttg tgggcggggg cagcgtggac gagtacaata 3961 aatttccata aatcttgagt caacagtcaa gaatcaacag tcaacagtca atagtcaaca 4021 gtcaatagtc aataggcctt aactaatgaa cgccacttgg tatctcctgc agcctagtgg 4081 cacggctacg gcgtaagcat tgctcttgtg caggagacac gcgaacaatg gggggctcgg 4141 ggttcccccc tggggagctc gatcccttgg ggatgagggg cagaggagga aagccgttgg 4201 aaacaaggac tgcatacgct gctcaccctt tgggtatgcg cagagcgcac gcgtctacaa 4261 gtcgggaaac ccggtggaac gcacctgcct caccgcaaca cttacggtgt gcaacgcagt 4321 ggcttctaat gactaatgac taatgactaa tgactaatga ctacatctta atcatcgtcg 4381 tcagaattat gcgctcctaa atccactgta ctaaccatta tgggtttgcg tgacgttttt 4441 aacatttcta gattccatcc aggtgcagca aactggggta gaatttcccg actaaacgct 4501 tctgctgctt tagatctgta acgattggga ttaaaaacca gccttagtgt ccgtttgact 4561 atcacacctt caatggaagc acagtgtagc acccccattt gtaactcttt agctattgcc 4621 gaggtcgaaa caaaggcagc ccccaaaccg gattgcacgg cattttttat agcttctatg 4681 gagtttagtt ccatttcgat tttaaaacgc cgcgtatcaa tgtcacagcg tgatagtact 4741 tggtcaatga ccttacggat agtagattga gaatcgagag cgatgaactg taatttatat 4801 aggtcttctc tttgaatcgt ttcaagtttg gcaaagggat gaaatgctgg tataataagt 4861 gacaactcat cttcggcgta tggaagaatt tccaaagatt ccgcgagttc accaggaatt 4921 tcaccaccga tgattgccaa atccacttgt ccgttagcta cactccaagt ggttctgcgg 4981 gtggagtgga cgtgcaattg cactgctaca tctggatatt tttgtcggaa catcccaatc 5041 atccggggta aaagataggt gccggtggtt tgagaagcac caataatcaa agtaccgcct 5101 tggagatttt gtaaatcctc aatagcgcgg caggtttcct gacacaggct gaggattttc 5161 tcaccgtaat tcagaagtaa atgtccagct tcggttagtt gtgcgcgacg tcctccacgg 5221 tcaaataatg gaacatccag ctgccgttct aggttttgca cttgcaagct aacggcaggc 5281 tgggatacat agaggctatc agcggcacgc ttgaagctgc cttctgcggc gatcgctttg 5341 agaatacgta actgatctaa agtgaaagga aggtcagaca taaggctaaa cccacaaact 5401 ttgagcatga gcggcagtct tctacaaacc agcgtaaaaa gccttgacag cagggcgtga 5461 ttttttgcat cttaaattta gagactttac tcgaaaaaga cagtagcata ccatcttgac 5521 taaagttctc ttttctactt tcctgcccta ataagtagat tttcaaggga attgtgtgta 5581 aactccactc tgggtttcaa aagcttgaac gctttttttg ttttcgttac aaaatccaat 5641 gcacccctct acggttctcc cagcaaggag agagtacgat tttgttgccc cggaatagag 5701 agactttcag ggggtaggct gttaccagaa tttccgtttg acttgcgtcc tgccggagta 5761 acaacgttct cattgcgaca ttttgccctc caactaaatc tgcacatagc ttaaatccac 5821 gacttccacc acactgaatc gttaatccaa ggttcgggac aattgtgttt gtttgtatgc 5881 ccgcgacaag gataagattg actgctacag ttagcatcca ctcttatagc tgaagaatta 5941 ttcatgactt tatagtggat gaaaccatct aattcagcat acgaccattt ggactgggag 6001 acttttgctt ttccccccgg atggtcataa aaaacagggg atagatgaca ggattgtgtc 6061 ctgttatctg tcctctgcac cctgtcatta tatatttgtt tactgactaa actggggaag 6121 atacgcaaac tccgagtcga tttttccagt ttgatgaagt tcttgccaat gacaacttgt 6181 ttcgtaatat ggttattaat atcagcagta aattgtttac ctgttctaga aagtgccatt 6241 aatctacatt taattgaggg agtccctttg tgctttgctg ttgtcagagg gtggtcattg 6301 cgactgactg ctttgtgcag tacttctttt tctgatttga tcacagctaa atacttttgt 6361 aacctaatat ctgaggacac tttaccattc tcaaatgcca tccatgctgt gtcgttcaga 6421 caactttgtt cagtcagaga cactccctgt aacaaatttt ctttttctga agttctaatt 6481 aacttcaggt tgccaatgat agctagcttc agttcagacg gtagtttttt catcaggtta 6541 ttctatttta tggtgcggat gttggtcctc aagttgtgtt catctaacgc acatactgta 6601 tggtgacagg atatcctgcc tatgattttt ttacatagtt tgtggtattc gttgtactaa 6661 attaggcttt ctttaagtta cttcatttgt gtctatgatg cagccgactt ggttgaccct 6721 tagtcatttt gtcatgctag gactactatt tggttttgca attgtccata gtggaggcgc 6781 agccctgcgc ccgtgggcag aaaagcatat tgggtgtagg ctttatcgca ttttttttgc 6841 atttagcagc ctgccgttgg ctgtcatctt aattatttac ttttttaacc accgctacga 6901 tggcttgcaa ctttggtatg tccaaaattt accgggagtc cagcaattcg tttgggtact 6961 gtcagctatt tcgtttttgt ttttgtatcc cgcgactttc aatctactag aaattgctgc 7021 tatccaaaag cctcaggtac atctttttga aacaggaatt attcgcatta ctcgccatcc 7081 acagatggtg ggacagatta tctggtgtgt tgctcatacc ctgtggttag gtacaagttt 7141 tacccttgtc acttcgattg ggttggtgtt acaccattta ttcggagtat ggcacgggga 7201 tcgccgtttg agcgatcgct acggagaagc ttatactcaa gttcaacagc gtacttccat 7261 catcccgttt gcagctgttc ttgacggtcg tcaatctatc ttatggcagg aatttttgcg 7321 tccggcttat ttgggggttg caatttttgt gctgttactt tggtggtcac accctctatt 7381 gctcgaagca acaagtaaaa tcggattatg attttaacat aatcgcattt atctacagaa 7441 gttgatgaga taaaatattg atttgatctc atcaactgcc accaaatcaa tgtaggatga 7501 attgaagtag ctgtcagtgg gggtgcgctt agtcgttttg aaattttaga tttggtaatt 7561 gctggtcaac tgccaaaata gggaaaaact tgataattcg ctattttctt ttctataatc 7621 agaagacggt tgattaaatt tggcgaagaa aacaagatag ccaacagcga ccaaaacttg 7681 agaaaatcct caacacttgt gcgagagcgt ctagttccag aataaagcct gtggcaaata 7741 aggtagtgtc tccttttgct gaagcagaag aaaactacct acggttatta tgggtttctg 7801 gagataagtt gtctccatag gtaaggtcaa cagcagattg gcgaaaaaga cctttacata 7861 gacgactatt aagaccaaaa agtttgagag agttaccgtc ttgaatgtct gactgacttt 7921 tttgtcagtt ttgcaatgga caatgtttac aaaatcctga caatgcactc ccaaagctga 7981 cagttagttt tgatatcaaa accatataaa ttccagaggc gtcatggtgt tgtcggttag 8041 tgagcggaca ttttctcaag aagttttaga atcttcaatt cctgttttag taaattttga 8101 agcaccttgg tgtggcttgt gtcgaattat ccacccttta ttattgcaat ttaaagttca 8161 atgcggcgac caaattaaat tagtcggggt gaatgctgat gacaatttta aactggctac 8221 aaaatataag ctgaaatcac tgccaacttt actgttgatc gaaaatggcg tagttcgcca 8281 gcgtttggaa agttttcgtg gacgagaaga tttattctta gctttggaag aaataaaagt 8341 ttgttacact aaccgcccta aaaattacca tcgcccaaaa acagcggatt taggttgtcg 8401 ttctgcgtga agagtcatta gtcatcagtt atcagtcatc agtcatcagt cattaactca 8461 ggatcttttt gtttaacagg taaggctttt aaccaaaaac tggagctttc gagtctaaat 8521 tgctgtgtgt gctccagaca accaactcta tactcttgac taatgaccaa tgactaagaa 8581 ctgataacca aatctaaaac caagcttaat aaccccaccc aataactatt gggtggggta 8641 ttttatccta aagtgtgtgt gtcacaatta ttaacagttc cgacaaatta gtcaagaatt 8701 ctaaaagtag gcgtcaatga tggaaataat ctatcagtat gcctggctga ttccggtatt 8761 acctcttctt ggggcaatgc tggtcggtct agggttaatc tcgttgaatc aggtgacgaa 8821 cagcctgcga cagctcaatg cggcatttat tatctcctta atgggactcg cgatggcgct 8881 gtcgtttgct ttgctgtgga gtcaaattca aggacacgct ccttacaccc gcaccttaga 8941 atgggcagca gcaggcaatt tccacctgaa catgggctac actgttgacc acctaacagc 9001 cctgatgcta gtggttgtaa caaccgtagc cgttctagtc atggtttata cggatgggta 9061 catggctcat gacccaggct atgtccggtt ttacgcctat ctcagcttgt tcggctcctc 9121 aatgttgggt ctggtggtta gccccaactt agtacaggtt tatatcttct gggagctggt 9181 cgggatgtgc tcgtacttgc tggtcggctt ctggtacgat cgcaaggcgg cagcagatgc 9241 ttgtcaaaaa gcatttgtaa cgaaccgtgt gggtgacttt ggattgctgt tgggcattct 9301 agggcttttc tgggcaacag gaagctttga ttttgatgtg atgggcgatc gcctagggca 9361 acttgtccaa accggtgctg tcagcaattt gctcgccatc ctgttggcga ttctcgtctt 9421 cctgggtccg gttgcaaaat ccgcccaatt tcccctgcac gtctggttac cagatgcgat 9481 ggaaggtccc acccccatct ccgccctcat ccacgcagca acaatggtgg cggcgggtgt 9541 tttcttaatt gctcggatgt acccagtttt tgaacacgtt ccagcagcaa tgaacgtgat 9601 tgcattcact ggggcattga cagcattttt gggtgcgaca attgccatca ctcaaaacga 9661 catcaaaaag ggcttagctt actccaccat ttcccaactc ggttacatgg tgatggcaat 9721 gggagtaggc gcatacagcg ccggactttt ccacttaatg acccacgcct actttaaggc 9781 gatgctcttc ctgggttctg gttctgtaat tcacggaatg gaaggagttg tcggtcacga 9841 ccccaacttg gcgcaggata tgcggttgat gggtggactg cggaagtata tgccagttac 9901 ggcaattacc ttcttcattg gttgtgtggc aatttctggt atccccccct ttgctggttt 9961 ctggtcaaaa gatgaaatcc tagggaaagt atttgcagtg aacccagctc tctgggcagt 10021 tggttggtta accgctggaa ttaccgcttt ctacatgttc cggatgtatt tcgtaacctt 10081 tgaagggaaa ttccggggca atcaaactaa tttgtggcaa aagctcaaat ctccagcagg 10141 aatgggaatt gtaacaggtt ttgatgcagc ccctgctttt ggtcctggtg ctatgaccaa 10201 gggagaattg gaagcaactg aggaacatca ccatgatgac cacagtcacg gtcacggtca 10261 cagtgagtat cctcatgagt caccttggac aatgactctg ccattagcaa ttttggcagt 10321 tccttccata ctgattggtt tgcttgggac tccttttgcc aactactttg aggagtttat 10381 tttcccaccc agcgaaactt tagccgaagt cgtagaaaaa gcggctgaat tcaacccgtc 10441 agaattttac attatggcgg gtgcttcagt tgggatttcc ttgattggca ttaccttggc 10501 ttccttaacg tatctgttgg gtaaaattaa cccagtggcg atcgcagcaa aaatccagcc 10561 gctttacgag ttctccgtca acaaatggta ctttgatgac atttaccatc gtgtttttgt 10621 catcggcttg cgtcgtctag caagacaagt gatggaagtt gacttccgcg ttgttgatgg 10681 tgctgtaaac ctcacaggtt tcttcactct gatcactggt gaaggtctga aatacctgga 10741 aaacggtcgt gctcaattct atgccttgat tatctttggg gctgtgttgg gcttagtgat 10801 tgtttttggt gtaacttgat caccctgtag ggtgaataag gtgagtccac tctgtggctc 10861 aaaatgacaa atgcacacag aggatgaagg cagggcaact cgtttacctc ccttcatctt 10921 tctttatttt tattaaataa acctaacaac gaagaccaat ccctgcaaaa actgctagta 10981 aatagctagt agttgatagc cgaccattat tattgctgac taacaacctg cgatgaatac 11041 agcaaatttt ccttggctga cgacgattat tctgtttccg atagcggcgt cactgctaat 11101 tcccttcatt ccagataaag atggcaaaac agtgcgctgg tatggcctga tcatagggct 11161 gatagatttt gcactgatcg tgattgcttt ttatactggg tatgatttct ccaatccaga 11221 tttgcaactc tttgagagtt acagctggct tccacagatc ggtttgaatt ggtcggtagg 11281 ggcagatggc ttgtctatgc ccctgattat tttgactgga ttcattacca cgctagcgat 11341 gctagcagct tggccggtga cactcaagcc caagctgttt tactttttga tactagcgat 11401 gtatggcggt cagattgccg ttttcgccgt ccaggatatg ctgttgtttt tcctggtgtg 11461 ggaactcgaa ctggtgccga tatacattct gctttcgatt tggggaggca aaaagcggca 11521 atacgcagcg accaagttta ttttatacac ggctggcggt tcgctgttta ttttgctcgc 11581 tggcttaacg atggcgtttt ttggcgatac gatcactttt gatatgcaaa cgcttgctgc 11641 caaaggtttc agcctcaatc tccaactttg gctgtatgct gctttcttga ttgcctacgc 11701 cgtcaaactt ccgatcttcc cgttgcatac ctggttacca gatgcccacg gtgaagcgac 11761 agcccccgtg catatgttgc tagcaggtgt tctgctgaaa atgggtggtt acgcgcttat 11821 tcgcatgaat gcccaaatgc ttcccgatgc ccacgcttat tttgccccgg tgttagtgat 11881 tttggggata gttaatatta tctacgctgc cctgacatca tttgcccagc gcaacttgaa 11941 acggaaaatt gcctactcct caatttctca catgggcttt gtggcgatcg gtattgcttc 12001 cttcaccgac ttaggcttga atggggcaat gttgcaaatg gtttcccacg gtttgattgg 12061 ggcgagtttg ttcttcctag taggggcaac atatgaccgt actcacaccc tcatgttgga 12121 tgaaatgggc ggtgttggta agaaaatgag caagatgttt gccatgtgga caacatgttc 12181 tatggcatcc ttagccttgc caggaatgag cggtttcgtt gctgagttga tggtattcgt 12241 tggctttgcc accagcgatg cctataaccc caccttcaaa gttattgtgg tgttgttgat 12301 ggcagttggg gtgattttaa ctccgattta tttgctgtca atgctgcggg agattttcta 12361 cggtgaagaa aataaggaat tagtttctca ccagaaactc atagatgccg aaccccgcga 12421 aatcttcatc atcgcctgct tgttaatacc aattattggt attggtttgt atcctaagat 12481 gctgactcag gtgtacgatg cgacaactgt gcagttgacg gcgcggttac gaaattctct 12541 tcctgctttg gcggatcaaa aagcgccagc gctatcattt aatgcgccgg agattgggaa 12601 ttaggttgtt gatttcaaat caatatttct ttaactgagc gggttttgca cctgctcttt 12661 tttgtatggt gtgatactaa atccgctttt ctaaccccct tttaacccag gcgattagaa 12721 atcgcggcta cacaaacgaa gtcccttcgg gttcgccagt cgcctgcgga gggaaaccct 12781 cccgcagcgc tggactcacc gcctacgcgg actaaataat aaaggggtat tagatccgga 12841 tttagtatga ttgcttcatt ttggtggatg cagttcagcc atgaatagca gacgacgcaa 12901 gtttactacc cctggtcgat ttaaacgaag tgcttctact gtcgctcgac cacaaggagt 12961 gttcccaata acctccgtga aatcgctatt ccagctaaaa tgctcattcc atgattgctg 13021 acgcgggttg aacaaagatt cttgttgccc tgttgtagga tctgtaccgt aaatcttggt 13081 gtgtttgtga ctgttgcagc caaagcaact ccaagctaaa ttttccaaag ttgtttcccc 13141 acccgcttga cgaggtttta tgtgttcaac ggtaaaactt tcagttgcaa attcaccaga 13201 acaacggcaa tactcacagt aattttgtgc tcgtgctgca acaatccgcc taatcgcttc 13261 cgtaactcgc ttagagggca ttaggctttg tataaactgg taaaaattcg ttaagtagag 13321 tttttaactc aacattgcgg agtcgagcta attgaattaa cgcttcagct ctttcagcat 13381 cttgacgttc tacacggtct acgtagccta atagctcttg atgctcagct tcagttattt 13441 ctcctgtttc gttacgctga cgtaaataag ttagtctggt ttggtcatct gtaggtaatc 13501 ggcgttggat tctttgcaac aaagtttcct ctccaagagt caccacgcca ttttcaatgc 13561 ataaagcttc taataagtca acagctttaa ataaagtttc ttctggtaat ttttcaacca 13621 atgcgatcgc tctttgacga atttctgttg gtgcagtcat ggaacatctg aattaactca 13681 tgcctagctc gattgtactc aatggtatga gttgggtttt tgtacctgcc cttttttgta 13741 tgcagcgata agcgtccgag agtgcgcgcc ctatgcctgc agcacggcta cgcctatcgg 13801 cgctagcttg ctgccaaagg cagatcgctg tatgattttg ctaatttgtg ctaatccgtt 13861 gcaaaaagct tatattcaac tctagttttt ctcctagcca cagagcatgt tccgtatggt 13921 ctttgacatc atcgcccgtc tcagggtgga ttaaaatatc caacccctca cgattgagca 13981 ttaaccaggg aacaaccttg gcaaactgat ttggtaagaa tgcaacttga tacatcgctt 14041 gtggatgtgg accaatgggt ttatcatgcc agcgcccaag ccgcacctca aatcttgcac 14101 ctaatccttc gcgtacacgt tcagctacat cacgacttgc ggtatcaaag tagacatgag 14161 cgtgaaaacc agtgatttcg atagtatctt cttgcatcgc tctaaactcc agttttcact 14221 gtgactgatt tcactcatta cctccatttt aatggctatc ctaatctaag ctgttcaaac 14281 gctgcgtcac aacaacccta accaggatcg taaggattgc tctaactgct cgacatcgga 14341 attcgccaac ttaccaatcg ttgtcacaat taagctctca tgtactgtgt acaaacctcg 14401 cttgactgct gtaaccacgt taagccctgc tgcagcccag tcataaagta caaattcacc 14461 ctcaagtaac gatcctgttt tacttgtcag tggcgtgatg ataatgtctt gagaaatatg 14521 cggcgcatta attacaactg caggtcttac ttttgaacta gacaaatccg agaaggggta 14581 ccgaaccaag atgatatcat ttttggagta gctgggcata aacatcatcc tcggcgttat 14641 cccaaacttc atcaagtgat gtttggctgg cttgaagcca aaactcagct tcatcatcag 14701 gaagaagtgt taccagcact cttgttcctt caggtatttc cgctgattct agcagttcaa 14761 tttttccttg ccgaacggta gcccaaagtg tttttaacat attcatctta taactggtca 14821 tgctttgatt ttatgccttg atgccggatc ttgccgtccc actaacaacc tcaaaggaca 14881 cagtcggcac cttcactagg ggttccgatg gagtgaagtg ttgggctagt gccaacttac 14941 gagaagaagc gctcataatc gtcgtggcta ccgatccaaa accatgtgac tgtatcacct 15001 tctagtactc caagtgctcg ataaccccgc gttactctca cagaccaaat atcttcctgg 15061 ctattgatac acttgaaatg caaagatgga tgaaatggat tctgtgccca taaccgataa 15121 gctttcctgg cactttgtct aacatcatca cttaaacgcc gatattcaac ccaaaatgag 15181 ggaagcacgg cggacttcat aactggtcgt agtccagtgt agtcgctttc ccctcagcaa 15241 tttcttgttt ggcgcgttgt gcagcagcta caagtttttg ctgcgttctc tcgaaggatt 15301 tgctccattg aatttcatct tgtaactcat tgatgtactc tcgaaggtga tgtgcaattt 15361 gctcctgtac atcaacaggt aaagactcca tcatctttac tactgtagtg attgctgggg 15421 acgacatagt gtgcttttca tgcgatattc tgtttttaaa gctagcacca attgtcgcct 15481 aactgttcaa ctaactcgct ttgagagcat atctcacagc ctaatactgc gttcaactgc 15541 ataaacacac gtcggcttat ctttccacac cctcttttta actaatctca ttccattttt 15601 atcagctacc cgacatgagg tactgaactt actgtgcgat tgcttcgcag acgcctgaaa 15661 ggcgtatcgc cagctgattt tcttcaatca gctcataccc attcgccaac tgctctattg 15721 ccctatcaat caaatcaaca ccttgctgaa cattctctat caggctcaac tgtccgcgca 15781 aatcagcagc acccgcaaag cctttcgagt accatgtcat gtgtttgcgt gcttgacgca 15841 ctcccctgtc acctttgtat tcgtacaaag cttgtaaatg ttctctagca cattccaagc 15901 gctgaattgg tgtcggtgct ggtaattctt ccccagtttt caggaagtaa tcaatttctc 15961 ctgccaaaaa cggataaccc agagttccac gggaacacat caccccgtca gcaccagttt 16021 gttccaaaca cttcaccgct gcttcaaccg agaaaatatc cccattacct attactggaa 16081 tagaaagaat ttccttaacg cgggcaatcc attcccaacg ggcattgcca ttgtaacctt 16141 gggcacgggt gcgtccgtgt accgtaatca tttgtgcgcc tgcatcttgc atccgcttgg 16201 caaagtcgag aattgtgatt tccttgtcac tccaaccaat acgggttttc actgttacag 16261 gaacatcaac tgccttcacc acttccctaa cgattgcctc agcgacttct ggctgacgca 16321 acagggaaga accaccaccg tttttagtaa ttttatttac cgggcaaccc atattaagat 16381 caacggtatc agctccttcc tcaacagcct ttaccgctgc ttctgccaga aaatctgggc 16441 gacagtcaaa caactgaata ctaattggtc gctcattgcg gtctacctcc atgattttcg 16501 gcaactgttt gacatagtgc aatcccgttg cgttcaccat ctccgtgtac atcatcgact 16561 caggtgcata gcgacgcacc agccgacgaa agaccatatc ggtcacgcca gacaaaggcg 16621 actggagaac tcggcttttc acctcaaaag aaccaatttt tagaggagtc gaaagtttgg 16681 cttttaggtc aggagatagc gtaaccatat atcatgtcga ggcatagaag cttttctatt 16741 gtacagagct ttaagaggga acagggaata gggaacaggc aacaggcaac agggaatagg 16801 gaacaggcaa ctcttaacag ggtgaatggt gtttctttgg tagtggtttt ctaacgaagc 16861 gtgggactca ggactactaa agacggcaaa caccacatta acgatacggt tcacccgaat 16921 tactcctagt attaattctc atacattaga caaagaaagt gaaagcaaat tcagtccatg 16981 aacaaaccaa aaaaaatttt ccagcataac cgccctaacc aaggttcgga aaaccaccct 17041 tcatggactg gctacacagc ctcagtatat atatcagcag taaattaaac caaaggtatt 17101 catgctgaac tggctgcttt ctaaatccag ttggttcaaa aatttagtcc cttatactcc 17161 ctttattcat tagtagtgat acggagccag cacctacaat ggcaaaagta gttggaattg 17221 atttaggtac aacaaactcc tgcgtggcag ttatggaagg tggtaaacca acggttattg 17281 ccaacgcaga aggttttcgg acaacacctt cagttgttgc atttgcaaaa aatggcgatc 17341 gcctagttgg tcaaatcgcc aaacgccaag gggtgatgaa cccagaaaac accttttatt 17401 ctgtaaaacg ttttattgga cgtaaatacg acgaaattac tcacgaagca actgaagttt 17461 cttacaaagt tcagcgcgac agcaacggaa acgttaaact ggagtgtccg caagtcggta 17521 agccttttgc tcctgaagaa atttcggcac aagttctccg caagcttgta gaagatgcaa 17581 gcaagtacat tggtgaaact gtaacccaag ctgtcatcac cgttcccgca tacttcaacg 17641 actctcaacg gcaagcgacc aaagacgctg gtaagattgc tggtattgaa gttctgcgga 17701 ttatcaacga acctaccgct gcttctctgg catacggttt tgacaagaag agcaacgaaa 17761 caattctcgt atttgacctt ggtggtggta cattcgacgt atccatcctg gaagttggcg 17821 acggtgtctt tgaagtgtta gcaacttctg gtgatactca ccttggtggt gatgacttcg 17881 ataagaaaat cgttgacttt ttagcggaac agttcaaaaa agacgaaggt attgacctcc 17941 gcagagacaa acaagcctta caacgtctga ctgaagctgg agaaaaagcg aaaattgaac 18001 tttctagcgt cacccaagcg gaaatcaacc tgccatttgt gacggctacc caggatggtc 18061 ccaagcactt ggatacaacg ctgactcgcg ccaagtttga agaactttgc tccgacttaa 18121 ttgatcgttg ccgaattcct gttgagaacg cgctgcgcga tgccaaacta agcaagagcg 18181 atattgatga agttgtctta gttggtggtt ctactcgtat tcctgcagtg caacaagtgg 18241 tgaagcaggt gttgggtaaa gacccgaacc aaagcgttaa ccctgatgaa gttgtagcag 18301 ttggtgcagc aattcaagcg ggtgtccttg gtggtgatgt cactggtatc ttgctgttag 18361 acgtaacacc actgtctttg ggtgtggaaa ccttaggcgg tgtcatgacc aagattattc 18421 ctcgcaacac cacaattcct accaagaagt cagaagtctt ctccaccgct gtggatggtc 18481 aaactaacgt ggaaatccac gtcctccaag gcgaacgcga gatgtcaaac gacaacaaga 18541 gtttaggtac cttccgtctc gatggaattc ccccagcacc acgtggtgta cctcaaattg 18601 aagtcgtatt cgatattgac gccaacggta ttctcaacgt taccgcgaaa gacaaaggta 18661 ctggtaagga gcaatccatc agtattactg gtgcttctac cttggataaa tctgacgttg 18721 aacggatggt tagagaagct gaacaaaacg cttcaactga caaagaccgt cgtgagaaaa 18781 tcgaccgcaa aaaccaagcc gactctttgg catatcaagc cgagaagcag ttgcaagaat 18841 tgggtgataa agttcccgct gctgataaga ccaaggtaga aggtttagta aaagacctgc 18901 gtgaggctgt ttctaaggaa gacgacgagc aaatcaagaa ggttatgcca gaactgcaac 18961 aagcactctt tgcagttggt agcaatatct atcaacaagc aggtggaact gaagcttcta 19021 caagcactgg tgctaacacc tctggtggtt cttcctcctc ttctggcagt ggtgatgacg 19081 tgattgatgc tgatttcact gaatctaagt agttattctt ccggtgggta gtacccacca 19141 gacaatcaaa attctcatcc agaacaattc ttgggtggga attttttttt gcaaagatat 19201 agcagggaac agggaactct taacagggaa ctcttaacag ggaataggga acagggaact 19261 cttaacaggg aacagtgatc aaatgattgt tcattgataa ctgcggctgg tacttgataa 19321 ctggtaactg gcaactgcta taatctcaaa ttgggtttgt actgcaggtc gaaagtcaat 19381 tgccaggata gttctaagca attgacttgc gattgttgag cgcttcatat gctttctaag 19441 cacacactcg ctttggcggc agacaaaagt ttttcgtttg ttggtaacct cctgttttta 19501 tttagcctgt tatagttcca tcagtcatag acaaatataa acagtcgttt gatttacggc 19561 ttagacccaa gtaaccacag aacgctgtta gaaagcacga tgttatctat taggcatagc 19621 atgcactacc acacatcttc tatgcctcga cagttaataa ctgaatgaat cgacactact 19681 tgaattgttg tttgaagtca aaactcaaac atttttgagg tgggtcgaaa cgttattttt 19741 agagtctatt ataccatatt ttgcgattcg tgtctacaaa tttcgtgatt tttcataata 19801 cgtactctgt gcgacaaacg catttattgg gaaaactact agcataagat attgcaacaa 19861 agctagaaaa tcgtatcgac aattgactga gttcactgag aaactactgc ttgcaagtag 19921 ttaacaaatt tttcagaact taacacttag ttaaagagag tatgaattag ctttgtaaca 19981 caagtattgg aaataaaata ggaatcaata ctaggtatgt attggatgaa aaaatttatt 20041 tattttgtta ttctgttgta ttgtttgctt ttaggtatac caccttctag tgctaagtcg 20101 aaaactctaa agatatttga agctgggcac tttgctgaca aggaaagtgg gggacaagga 20161 gaaagagcag caaaagagga aaagagtaac agagtccggg aaacatctgt tgttccctca 20221 ttcgtcttgt ggtcatcgca gtcacgtcaa tcaaaattag agaatgttac tttttttgag 20281 aaacaggggt tctcaaaaga ccaagtctgg gtgattgatg ggaagcaaca agcacagcgt 20341 caatctttta cttgggttgt tgaagatagt aaaaaagttg aacaaccgtt tttacaagta 20401 gagaagattg tagagaaccc gattaagcca gaggaaaaaa ctgaggtcat tgcgaaacca 20461 tcaaaaaagg aagaattaga gccgtttgat aaagttgtca aagatgctga aattctgcct 20521 ggactcttta ctttgtatcg agacaaggaa aagaataaaa tttatctaga aattaatcct 20581 gaccagctca agaaaaatta tttagcgaca ggaacgttag aatctgggat tggtgaggct 20641 ggaatctata gtggaatgcc gttgcatgat tttttatttt actttcagcg ggtgaataat 20701 aaagtgcaat ttgtggttcg caatgtgaat tttcggactc gtgaaggaga tccacaggag 20761 cgatcgctcg cccgatcttt tagtgactca gttctctact cagtatccat taaaagtatt 20821 catccagtac gcaaaacaat tctcattgac ttgggagatc tactactaac agatttagct 20881 ggattatctc ttagtttggg agttgcgcca gcgacagata aatcctattt tgggactgct 20941 aaagcttttc ccctaaatat ggaaattgag tcggtgttga acttcactaa cactcgtgcc 21001 aatagtgata aactccgttt tggtggtaca ctagcagacc ttcgaggctt tagcttgcgg 21061 gttcattaca gtctgtccca acttagcgag aacaattatc gaccgcgttt agccgatgag 21121 cgtgttggct attttatcac tgcctaccaa gatatatcta acgatgagca cagagatcca 21181 tttgtccgct acatcaaccg ttggaattta gaaaagcaaa accctagcgc agccctttct 21241 ccaccgaaaa aaccaattat cttctggatt gataacgcag tacccctaga ataccgcgat 21301 gcaatcaaag aaggtgtcct gctgtggaat aaagcctttg aaaaagcagg attcaaagat 21361 gcgattcaag tacagcaaat gccggataat gcgacatggg atccagcaga tatacgctac 21421 aacacaattc gctggattaa cacagtagat ggatatttcg ctatgggacc atcccgtgtt 21481 aacccattga ctggagaaat tttagctgca gacattctgg tagatggcag ttttatccgg 21541 gcactcaaga atgattattc tagagttgga cagttcaagc aaacccaaaa tcagacgccg 21601 ctgtctgcgt tgatgcaaaa tagcttgctt tgcaccaata gaacagaagc agaaagtagc 21661 gacacttcgg ttgagagtat ggcaatggag gggttagcga gtcgtttatc aaaactggca 21721 ggtcagtatg acctttgcta tggtatggaa gctgctaacc aatttgctta tggttcactg 21781 gctatgaagc ttttgcaaaa cagcccgcca agtcgtgagc aaatgaaaga ttatattcat 21841 cagtatttac gcttaattat tgcacatgaa gtgggacaca ccttagggtt gaggcataac 21901 ttccgtggta gtacaatgct tactcccgaa cagataaaca accaggatat cactcgcacc 21961 aagggtatga tttcgtcagt gatggactat attccaccaa atattgcacc tcaaggtact 22021 aagcagggag attattttcc taagatgata ggaccttatg atgagtgggc aattcaatac 22081 ggatacgcac caattccagt agtaactcct gcagcagaaa aaccattttt ggataaagtt 22141 gctcagcaat ctaacaaacc tgagttaagt tattctacag atgaagatat gtttgacctt 22201 gacccaacgg cgaatgcatg ggataacagc agtaatgtgc tggtctattc tcattggcag 22261 ttggataatg cattggttat gtggcaacgt ctcaacaagg ttgacctctt gtctggcgag 22321 agtttcagcg atgtgaagga acagtttggt acagtctttg accagtattt caaaaatatt 22381 ttttatgtta caaaatacat tggtggacag tctttctaca gagttcaacc tggtgatcct 22441 caaggaaaat taccatttca acctgtgcct gtggaagaac aacggcaggc gttagatctt 22501 gtacaaaagt atgtattcgc tgaagatgct ctgagttttc ccccagaatt gctgaataaa 22561 ttagcacctt cccgttggcg acattggggg agttctcctc ggattggtcg tctggacttt 22621 ccaattcatg attgggtatt gtatatgcag agttcggtgt tgtgggattt gctgtcaagc 22681 gatcgcctct cccgtctcaa agatatcgaa ctcaaaactg ctcgcgatca agccctaact 22741 ctaccagaac tatttgatac attacaaaac gatatttgga ctgaggtagt gaaaccaaat 22801 ggtaagccta tcgttatttc tagtttgcgt agaggtttgc aacgccaata tgttgacaaa 22861 ttgactgcga tggttttacg taaagaacgt gtcccagaag atgctcgctc actagcttgg 22921 tataaactca aacagctaaa tgaaaagctg gatggactga gttcaaatga cgagtacacc 22981 aaggcacatt tgttagaaac acgcgatcgc attagtaaaa tcttgaacgc cgaagtgcaa 23041 gcgaattagg aaaatttgtg tcaaagatag ctcgcgatct catcttatta cttatagttt 23101 gtcaactact ttatacaaat ggcaatttac caagtctgag ttcttacagt aatgacttat 23161 taatgggaaa tgattctagt tggttgaaat ctgagatgga acttgttttt tttccaaaga 23221 agcagtctga tatcagcaag ggaagtttcc aagatatgct ggaagcactt gcagttcgtg 23281 aaacagggct atcatcagga aatcctaatc agtaccaatt tgaaaatcca ctctcattta 23341 ttggcaaata ccaatttgga gaaatcctgt taatacgtct tggttactac aaagccgacg 23401 tttattatgg acatggtgct gacaagaatt actggcgagg tacatggaca aaaaagaagg 23461 gtattgatag caaattaaaa tttctcaatt cccctaatgt tcaagaagaa gcgattcgtg 23521 aggctttgaa ggtgtattgg aaggatatca acgatatcct cactcagcaa ggtaagtcta 23581 taaacaacta ccttggtcag aaaaaaacgt ttaatgataa agggaagtca aaaacaatca 23641 cagttaccct ttctggcatt ttagcagggg cacacttgag aggtcctgat cgagtggtag 23701 atcttctagt aaaagggaaa gtatctgagg atgaatttgg cacttctatt cttgaatata 23761 tggagatatt tggcggttat gacatgaaaa tagaaaattt atgaatcttt accgaaaaag 23821 tactcctcag ctaggaactc agaactcaga atagagaatc aagatttgga attcccagct 23881 tcgagtaagg ttgctgaaga agcgggtgta agctgttacg cttttaactt gcatattatt 23941 ttggctgt // LOCUS NODE_1298_length_23876_cov_5.21825323876 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 23876) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 23876) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23876 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 145..1035 /locus_tag="DP116_11690" CDS 145..1035 /locus_tag="DP116_11690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458478.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_11690" /translation="MTITKSPQLRFLRFPKSPTLSQQLMLIGLTMTLFFVFLALFAPL LQLLGLVQNPIDSLSNPIQEPPSLQHWFGTSREGYDVFSRTLFGAQAALQVVLLATAI SMFVGVPLGMLSGYAGGRLDKALLFIMDSIYTLPGLLLSITLAFIVGRGILNAALAIS IAYVPQYYRVVRNHTVSVKTEVFIEAAQAMGANTWQVLSRYLFFNVIQSVPVLFTLNA ADAILILGGLGFLGLGLPRETPEWGRDLQQALQALPVGVWWTALFPGLAMTIMVVGLS LFGEGLNEYINPRLRKDIRK" gene 1157..1360 /locus_tag="DP116_11695" CDS 1157..1360 /locus_tag="DP116_11695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317741.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11695" /translation="MKDNFSLIAAAAGGFMLSVALTGILRGAPVASLQGNPGFHSLTV ATLQPAPKTSENVEFFGSKTGKK" gene 1520..1768 /locus_tag="DP116_11700" CDS 1520..1768 /locus_tag="DP116_11700" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11700" /translation="MEPQLTIQNSLIRLGKFALSGNQRPSGSPVAYGGFPDRGIWRWG SPKVCRGKQPKRRALTVVATAEVPPRVHLAWLGGSSKY" gene complement(1949..4849) /locus_tag="DP116_11705" CDS complement(1949..4849) /locus_tag="DP116_11705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314438.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CAAX protease" /protein_id="PRJNA477356:DP116_11705" /translation="MNVKRVSIFLGIVFIAVLCTVLFPELSHLWLGWYNALGGALKLT VDLLQISFIAVLFAGLLAPLEALGWWAGWYGDKVDTKLNLGILEQEIPPQTNVVRYVI YLDGIGQASFKYFPDGDQFLHELAIALPDNTVLIKGIIPYSVFNRPLTEGGILSPFWR FAERRSQSKTGSIFNALLTLAINIRNLLVVAVSADQRFGPIYNQGTAQVMYNSLINHG YKPGSAVPITLIGFSGGGQIAMGALSYLKQALDKAPIEVISLAGVISGNTNALLAEHL YHLVGDKDPVERLGPILFAKRWKVFFLSYWNRAKRLGKISFVSLGPVGHMGAGGPLDD KKFLSDGRSYLRQTVDIISGILRDEYPYNEELVITKLSNYERYREAAFNRPEYYPLNQ SVNTEFYEPIAPWMGRLILPPKEQRQSGVLFEVHHADTKHQHLVGQVVYLQWIDDPES KVSVQSTKKDVHFNAEALYNYTRGKIVPIRINHWRQVTPLESLAGSRPNDDIIVKLRE PVVVEQNGKITLYITSEPVQISGRFYGLVRFLQPIQPGSEQFRVIHFNRASRKFNSVE EVVLLPEVIPSEQNISSSTKDGLEKSPLNETGWYIYGAKNAAGMFVVQALAPRALLRV QPREVVVGRKLAMEYLKKRSWNNIITKKGQIQSVILNRKSKDIPQAVSKWREGECALL LHVYGGIGGKKREIAAKAPVYFGHSAYGVAEVVREPLTGELRFEIEYHQVYTHNVDGL IAGTLSWTRYMGDRQFGWLGIRPVCDTLIKLDALTNDYDADEVKRTLLGVLVRELEIM TARYRIGDGMGATYVGPANNCAQDSNQALYAAIKVIQAAIQFDAKDIAYAIKNNPEFK NWLLRHPEYATSFKQLVKLDKAIRHDLLPFGIARADWENSSTTTIGSSLADSPLQQIL KALVSWRSILPRKASDSVTEIFHKQGASLWILSTSQVGNSDPEIAAIAPFTF" gene complement(4846..5541) /locus_tag="DP116_11710" CDS complement(4846..5541) /locus_tag="DP116_11710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11710" /translation="MTQTALDRFWELAGGAIALNPEAFNLIQTLPQASKAAFYIVLVA GFSQAVGQGTVLFINRVRPIRFLLSLFISSVLFVFTVLFWGLSTWLVSFILFRANIPY NIVWSTLGFAYAPLILSFLVALPYLGVAIQVLLSIWTLLIFVMGLRVAIGVGIWQALW CGVLGWVVFQIVQRTIGRPVAILGKWLSNTVAGTHLVTDMQGVEQLLQAGPQILRRIN QNNAGDKGDNTRV" gene complement(5592..6302) /locus_tag="DP116_11715" CDS complement(5592..6302) /locus_tag="DP116_11715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11715" /translation="MKVLKPAKVKITSGICTSLFLTFALGTPLASAANTAGDYRQLGL LYRQQGRLPEAITAMQKSVELEPKNLMGRVNLGWTLHLAGHDQQAAQSLWQAIYQKPT FVPAYNALGIVYLVDSNLTGAVLVHTLAAILKPDNEIAYFNLSLALHRLQIYNLAIVT GNRAAILEPNNPHPLVASAIAYWDSGNQNTAKKVYTKAIYLDSRYSNRTFLAHLKQAA FSQEQIKKTELILNFKVN" gene complement(6423..7217) /locus_tag="DP116_11720" CDS complement(6423..7217) /locus_tag="DP116_11720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740642.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_11720" /translation="MTFVTTATDSVLSLPDHTQLPDSDGTFVKNLQEHPQSILLTDSI KPILEQLHPDGQYCIGQDSGIYWRLTDPPQKGAEAPDWFYVPNVPPTLNGKIRRSYVL WKEYVAPLIVIEFVSGDGSEERDRTPPSLREDGTVGKFWVYEQAIRVPYYAIYEVEKA LLEVYRLLDNTYQLMRPNERGHYLIAPLAVELGIWQGRYQNAELPWLRWWNAQGNLLL TGEERAEVERQRAEVERHKRETIVEKLRSLSPEQLDALGIDPKMLD" gene complement(7554..8276) /locus_tag="DP116_11725" CDS complement(7554..8276) /locus_tag="DP116_11725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412489.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11725" /translation="MNLKSLTYSVFAKPLIYMILGSLSVIDTLPLVLADTTYPQKNNN DSVVTHDTLSTLPQNKRANASNFILAQIGDRNEQERSRLIQEANVSYNQRNFAAAEEN LRKLIKKFPKDAFAHYQLGNVLYQQDKAEEAIGEYKQAIRLNSSYALAHNAMGIALAS QTRWEEAIAEFQKALKINPEYADALASLGQVLWQKGNRDEALASVNKALNIFKTQNRP DKVYQVQQLLQKMKATEDPSVS" gene 8820..10727 /locus_tag="DP116_11730" CDS 8820..10727 /locus_tag="DP116_11730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017710780.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11730" /translation="MSKGGLSVFLALLSLFFGTFVVMTIVRYISELLFPNTDVESSSE LYWEVFVQLIGLRDTGDNANLATKIIGVITIFLGLVLFSSLVAFITQEFESRLQILRQ GQSPVIEENHTLILGFSDRVIDIIKELVVGNESEADAVVVILSQKDKEEMDNFLRNNL GDLKTTRVVTRNGIITNLNELDKVGIKVAKSVIILNDAKTSDPDELKTLADARVVKAV LAVVAANEEDSVPSIVVELHSQQYRRLAENIAPGAVTTLNEADILARILVQTSRSVGL AAVYLNLVGFEGNEFYFYRPEKGWQSVNFGELPFHFSNGIPIGVRHANATLTIKPSKD YQLVESDEAIVLAEDDSSIQFHPQSVVQPKNFSYSDCCKTLEQKPERHLIIGWNSKTP ITLREYAKYLISGSEVNLVVQDLTSQVKAEFDTIAKNYSQIKMDALQVNLDSVEQLLR LKPYEYNSISILALRGENSEEIDAKTLTILLELRQIFREYTAETKNQVTTELIAEIID SQDTDLVIKAGVKDFLLTNQFVSKILAQVSQEPSVMSIYDDLFSVDGSELYIKPISLY FSSKELGRLTFADCVKAAQERDELCLGVKISALAQNKNQNFGIDLVPSLDKPLNLTFN DALITLAEDET" gene complement(10933..11454) /locus_tag="DP116_11735" CDS complement(10933..11454) /locus_tag="DP116_11735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115868.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydrofolate reductase" /protein_id="PRJNA477356:DP116_11735" /translation="MRKIRLFIASSLDGYIARTSGEVDWLFTDQDYGYNEFSTQIDTV LMGSKTYYQVLTFGEYPYKDKKGFVFSKTVQVERDNNVEFVKENWKDFIHALRQSSGH DIWLVGGAQTIHYFMKYGFVDELILSIHPILLRNGIPLIVNDPSLETALELKDVKTYD SGLLQVSYDLKKI" gene complement(11543..13963) /locus_tag="DP116_11740" CDS complement(11543..13963) /locus_tag="DP116_11740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741309.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="exonuclease" /protein_id="PRJNA477356:DP116_11740" /translation="MDFVVLDTEGNPELSELAIVDSQGRVIYEAFSKNHPNNTSNVPN LKSLKTLLTEFLTIVQGKKIVCHYAEHDLKVLQSSFRSAGLKLPNLEFECTWNKAKNY WLDLESYSLEYLSKYLNLRVNNRYFIKGLAHAARYDAEFTYHLYRKLMNEQLKEQPNP FSGSRVDTPFQNHPDYLDTYHKEFITLESILKDIKLDSNRQSKGAVVIGEPGSGKTHL MMRLAQARLSSNRLLFIRQPNHPKSVLYHIYSRILESLVERVGTFTQLDYLIVNSFHK IVATFPNLTNKDQDILKALKDKNIDALGGEGTLRKREYWQHIEKRFNEWWVSYYSAGG FAPSILKGIIKYCSYTDSKRKEIVTRWLAANILSTEEAEYVGLPNWGEDLSQEAFSLE AISVLGKLSILDEPLIIIFDQLEGLGLPHNREILLNFGETIKEIFTHVPNSLIILNMF PERWEQFQNSFDNSIIGRVSQYQIFLQRPDETELKAILEVKLKTLNVPLEQIFFPEDL EDILEQKSIRAVLNRAADYYNYRVRQIPLPAIRENVRKLDEEEKLIIQLRVVQQQQQI LTEVFLNIIEEMQKPGSVDMRELRKKLVPDVKTQEQQIEEYVVEYLTRQKASLEQQYV NIPIISDSDDIGKLKTIAEAFNHLKPIKLTLYRLGKRVLPEHIVIETDHENYVIAFLQ IPPNTTSFTSRIGNFNELVSSHPQDRFGLFRDERLTKIKGQVAQERVEQLKNSPNGNF GLFTKEDRIHLELTYKLIIDVQNKDLDVDLESALKVFITNEQWYHWLFSIFGFTKPKP SAQTEKYI" gene 14669..15244 /locus_tag="DP116_11745" CDS 14669..15244 /locus_tag="DP116_11745" /inference="COORDINATES: protein motif:HMM:PF01565.21" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11745" /translation="MKKRIRTVLFIVLATVTLVIAIVGRPIIHISLTAWNDRSEIQPL PPGLIDDASRLNSTPVLVREVPDNPKKAVQVLQELLKVARSTGKKVALAGARHSMGGH TIYADGISLDMSNFKHMALDEKTNVLHVGSGARWADIIPYLNAHGRSVALMQSNNDFS IGGTMSANAHGWQHNSPHECLHRKQLSTYAS" gene 15171..16157 /locus_tag="DP116_11750" CDS 15171..16157 /locus_tag="DP116_11750" /inference="COORDINATES: protein motif:HMM:PF04030.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11750" /translation="MPTVGSTIALMSASTVNSFRLMLADGSVVHCSRSENAELFSLVL GGYGLFGIILDVDLQVVKNEMYLAQRVIIPTQQYVDIFEELVNRVTDIGMVYGRISVA PDYFLKEAILTIYRRNPSKDGKISPLNEHSRRGLTRTVFRGEVGSNYGKNLRWQLEKA LGGEAGSNVSRNQILNRSSKLLENQKLASTDILHEYFIPPKSMETFLEKCRTIIPKHN GDLLNITVRNVHRDSDSFLRYADQNLFGLVMLFHQHRTPNAEVQMKAMTTELIDAVLS VGGRYYLPYRLHATKEQFARAYPQAQEFFALKRKYDSKEVFQNQFYLKYGKQ" gene 16473..17267 /locus_tag="DP116_11755" CDS 16473..17267 /locus_tag="DP116_11755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315536.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2993 domain-containing protein" /protein_id="PRJNA477356:DP116_11755" /translation="MLGGLTGLTDPKGTDWGEQMLNTVASQTIRRLFTQSESVEVSVR CYPSSKLLQGSIDSFKMKGRGLVIRRQFAAEELSVETDAVAIDFSSVLSGKLRLKQPT QAIAQVVLLEAGINESFKAELVRKRLENLTAPALTALSGGQPVSFPEVQIKLLPQNRL RILAKADLNNGTLVPLNMTVTIGIERRRRVSFKDPEIDLNEVPEAQKEISRTLSLALV EILDNMVDLDRFDLDGVKMRLNRLETEGERLIFSGYAEIERIPKNP" gene complement(17354..18745) /locus_tag="DP116_11760" CDS complement(17354..18745) /locus_tag="DP116_11760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458121.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-glucose 6-dehydrogenase" /protein_id="PRJNA477356:DP116_11760" /translation="MRVCVIGTGYVGLVTGACLAHIGHDVVCIDNNEEKVKIMKSGQS PIFEPGLSDIMQSAISTGKIEFSSDLAAGVAHGEILFIAVGTPPLPTGESDTRYVEAV ARGIGAHLDGGYKVIVNKSTVPIGSGDWVRMIVLDGIAERQNTLITAGGAPTYDKLPE ITAQFDVVSNPEFLREGSAVYDTFNPDRIVLGGNSPRAIAMMKELYTPIVERKFATDP SLPPVPVLVTDLSSAEMIKYAANAFLATKISFINEVANICDRVGADVTQVAKGIGLDS RIGNKFLNAGIGWGGSCFPKDVSALIHTADDYGYETQLLKAAVSVNERQRLLAIEKLQ QTLKILKGKTVGLLGLTFKPDTDDMRDAPALNVIEQLNRLGARVKAYDPIVSQTGMRH GLSGVLVETDAERLADGCDALVLVTEWQQFKNLDYAKMAQLMSHPVMIDGRNFLDPEI MVRAGFQYVGIGR" gene complement(18878..19828) /locus_tag="DP116_11765" CDS complement(18878..19828) /locus_tag="DP116_11765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215768.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SDR family NAD-dependent epimerase/dehydratase" /protein_id="PRJNA477356:DP116_11765" /translation="MRILVTGGAGFIGSHLIDRLIANGDDILCLDNFYTGHKRNILKW LDHPSFEMIRHDITEPIRLEVDQIYHLACPASPVHYQYNPVKTVKTNVMGTLNMLGLA KRVKARLLLASTSEVYGDPEVHPQSEEYWGNVNPIGIRSCYDEGKRIAETLTFDYYRQ NKVDVRVARIFNTYGPRMLENDGRVVSNFVVQALRGIPLTVYGEGLQTRSFCYVSDLV DGLIRLMNNEYVGPVNLGNPDEYTILELAKTVQDLVNPDAQIKFEPLPSDDPRRRRPD ITRAKTWLNWEPSIALQQGLKLTIEDFRERVAPPSSSQPS" gene complement(20635..22785) /locus_tag="DP116_11770" CDS complement(20635..22785) /locus_tag="DP116_11770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium:proton antiporter" /protein_id="PRJNA477356:DP116_11770" /translation="MELISQVLALESTSQVFGKEPIVPFVILLVVILVVPILFERLQL PGLVGLLVSGVVLGPYGWNLLHTESAFRTLLSDIGLVYLMFVAGLEIDIEQFRRTKNQ SLGFGSLSFLVPLLVGTLVGRFFDFDWNSSILIGSLFASHTLLAYPILSRLGVVNNEA ITVTIGATIFTDIAALLVLAICVAMKAGPFSFGRLMTLVTLLALYSIVVLVGFDWAGK QFFRRSGDEEGNQFLFVLLSVFLATVGAQLIGVEKIVGAFLAGLAVNETVGEGPVKEK VVFVGSVLFIPIFFVNLGLLINVPAFLQSIETLQFTFFIIIGLVASKFIAAFLAKLVY RYNWQETLTMWSLSVPQVGATLGATIMGYRAGLLDSRILTSVIVLMLITSTLGPFITS RVAAGLTNSGIKDQQSVNPAYQKQEENDSSYTIVVPVHNPQTQQYLIEMAALLARESH GRIIPLAIATAFAHMDAPQLDASVQRSHRLLAKATALSKVLGVEAQPLLRIDDAFAQG ISRASREQKASLIIMGWGKRTGFKARLFGNVIDNVLWASHCPVAVTRLVESPKKFQRI LVPLENLMTPSLQPVKFAQILADANQAQVTILNVCERRTSSSKIAWRRSQLSLLISRL ALQNPPEIQIIAHENTAQAILQAARLYDLVVLPFIRNRTVPGGLAISDVTTQLAKQLT CSIVMLGEPQRTQDVVVQTEVTSSTTAAMSEGIV" gene complement(22930..23586) /locus_tag="DP116_11775" CDS complement(22930..23586) /locus_tag="DP116_11775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196844.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11775" /translation="MRLPLPQFDRSDRHPNHIAEVIETSSCEFLAQCLEPDDLSFPSM PPFGSWVRSVDEESGNQIYAVVYSATTMPVDSVHRAVALGLSLQDLREEQPQIFAMLK TEFRAAIVGFEQSPDAHFSNARVFQYLPPRPPQVHQAVYRCESEAIIKFTEQLDFLRT LLSVNGAPVESLTAAAIREVYQLRKADRQWIIKAGRTLSVLLKDDYDRLRFILSQIHP " BASE COUNT 6882 a 5184 c 4898 g 6912 t ORIGIN 1 atgttaaata atgttacaaa actttataag taatttaatt tttgatattg gacaccgtga 61 cttgtctttg gatcattgca tgaaaccatc tggtatgtag aatggtcatt tatagctcat 121 agccgcgaac tcctccctcc gtttatgacc attacaaaaa gtccccaact aagattttta 181 cggtttccca aaagccccac cctttcccag caactgatgc tgattgggtt aactatgact 241 ctatttttcg tgttcctagc tcttttcgct cccttacttc agctattggg tttggtgcaa 301 aaccccatag attccttaag taaccccatt caagaaccac cctcacttca acattggttt 361 ggtactagcc gcgaaggcta tgacgtcttc tctcgcactc tattcggtgc tcaagctgcc 421 ttgcaagttg ttttattagc tacggctata agtatgtttg ttggtgtacc tctgggtatg 481 ttgagtggct acgcgggggg tagattagat aaagcattac tattcattat ggatagtatc 541 tacactttac cggggctgct actttcgata acgctagcat ttatcgtggg acgtggaatc 601 ttaaacgcgg ctcttgctat cagcattgct tatgtaccac agtattatcg tgttgttcgc 661 aaccacactg tgagtgttaa aactgaagtg ttcattgaag cagcacaagc aatgggtgcc 721 aacacctggc aggttctttc tcgatacctg tttttcaacg tgattcaaag cgtacctgtg 781 ctttttaccc ttaacgctgc tgacgcaatc ttgattctgg gcggtttggg ctttttaggg 841 ttagggcttc caagagaaac gccagaatgg ggacgcgatt tacaacaagc tttgcaagcg 901 ctacccgttg gtgtttggtg gactgcgctt ttccccggat tagcaatgac aataatggtc 961 gtgggactgt cgttatttgg tgaggggtta aatgagtata tcaatccgcg tttacgaaaa 1021 gacatccgga aatagtcgag agttatttgt tattaaatag tggttagtcg tcaatacaac 1081 aactaacgat aagaaaagca tacttgagat cgtttgctac cctaatatct gttgtacttg 1141 tcattgagag atttgtgtga aagataactt ttctttaatt gctgccgctg ctggtggctt 1201 catgctttcc gttgccctga ctggcatttt aagaggtgcg ccagtcgcat ctttgcaggg 1261 taaccccggt tttcattccc tgactgtcgc tactttacaa cccgcaccaa agactagtga 1321 aaatgtagaa ttttttggta gcaaaactgg taaaaaatga agtagcagat ttggcagcgc 1381 tggcaataaa ttttatagct gatgtagtgg attgtttttt gagcgtacga ataactagat 1441 ttacctgaaa agtctagttt cattcatttt tgtgagcgtt tgctcatgct gcctcttgca 1501 aaagtcaatt ttttggaata tggaaccaca gttaacaatt caaaattcgc tcattcgtct 1561 tggaaagttt gctctgtcgg gaaaccaacg cccttcgggt tcgccagtcg cctacggagg 1621 gtttcccgac agagggatct ggcgttgggg ttcccccaaa gtctgccggg ggaagcaacc 1681 caagcggcga gctttgaccg ttgtggcgac tgcggaggtt cctccgaggg tgcacctggc 1741 gtggttgggg ggatcttcaa aatattaaaa cgttaaccga accgtattgt cccctctcct 1801 taataaggag aggggtgccc gtcagggcgg ggtgaggtga gaccagtgct gcgggagggt 1861 ttccctccgc aggcatctgg cgttagcgca gcgtgcgctt tgcgcatacc cgaagggtaa 1921 tacatatgaa tgcaacgcgc gtataaaact aaaaagtaaa aggagcgatc gccgcaatct 1981 ccggatctga gttacccacc tgactcgtgc taagtatcca taaagacgct ccctgtttat 2041 gaaatatctc agtcacgcta tcactggctt ttcgtggcaa aatgcttcgc caactcacca 2101 aagctttcaa gatttgttgt agaggactgt ctgcaagaga actgcctatc gttgtagtag 2161 aagagttctc ccagtcagct cgtgctatcc caaagggtaa cagatcatgt ctgatagctt 2221 tgtctaactt caccaactgc ttaaaactag tggcatattc gggatgacga agtaaccagt 2281 ttttaaactc aggattattt ttgatcgcat acgcaatatc tttagcatcg aattgaatgg 2341 cagcctgaat aactttaatc gctgcatata atgcctggtt agaatcttga gcacagttgt 2401 ttgcaggacc gacataagtt gcacccatcc catctccaat ccggtaacgc gctgtcatga 2461 tttccaactc acgcaccaaa actcccaaga gtgtgcgttt gacttcatca gcatcgtagt 2521 catttgtgag agcatctaat ttaatcagcg tatcgcaaac gggacgaatt cctaaccaac 2581 caaactggcg atcgcccata taacgagtcc aagaaagcgt acctgcaatc agcccatcaa 2641 cgttgtgagt gtaaacttgg tgatactcaa tctcaaagcg taactctcct gtcaggggtt 2701 cgcgcaccac ttctgcgacc ccataggcac tatgaccaaa gtagacaggg gctttcgctg 2761 cgatttctcg ttttttgccg ccaataccgc cataaacgtg taaaagcaaa gcacactcac 2821 cttcacgcca tttggacact gcttgcggaa tatctttact cttgcgattc aaaatcacag 2881 attgtatttg tcctttcttt gtgatgatat tattccaaga gcgtttcttg agatattcca 2941 tcgctagttt tctgccaaca accacctctc ggggttggac tcgcaacaaa gcacgggggg 3001 ctaaggcctg cacaacaaac attccggcag cgttttttgc accgtaaata taccagcctg 3061 tctcgttcag gggagacttt tccagaccat ccttagttga tgaagaaatg ttttgctcag 3121 aaggaatcac ctcaggcaaa agcacgactt cctcaacaga gttaaactta cgcgaagcgc 3181 gattgaaatg tataacccga aactgttcgc ttcctggctg aatgggctgc aaaaacctca 3241 ctaatccata aaagcgtcca gaaatttgaa ctggttcact tgtaatgtaa agggtaattt 3301 tcccattctg ttccaccaca acaggctcgc gcagctttac gattatatca tcattcggtc 3361 ttgaaccagc cagcgactcc aatggtgtca cctgacgcca gtgatttatc cgaatgggaa 3421 caattttccc ccgtgtgtag ttgtatagtg cttccgcgtt gaagtgaaca tcctttttgg 3481 ttgactgaac cgatactttt gattcgggat cgtcaatcca ttgtaaatac acgacttgtc 3541 cgaccaaatg ttgatgttta gtatctgcgt gatgcacttc aaacagaaca cctgattggc 3601 gttgctcttt tggcggtaaa attagtcttc ccatccaagg ggcgattggt tcataaaatt 3661 ctgtgttgac cgactgattt aaaggatagt attctggtcg attaaacgca gcttctcgat 3721 accgttcgta attgctaagc tttgtgatca cgagttcttc attgtaggga tactcgtctc 3781 gcaaaattcc tgatatgatg tcaactgttt gacgcagata gctgcgacca tcacttaaga 3841 atttcttgtc atctaaagga ccacccgcgc ccatatgacc gactggaccg agtgaaacaa 3901 aactgatttt acctagccgt ttggcacggt tccagtagga taagaagaag actttccacc 3961 gctttgcaaa caaaatcgga cctaagcgct caactgggtc tttgtcaccc acgagatggt 4021 aaagatgttc tgccaatagt gcattagtat ttccactgat gacgccagct aaagaaatga 4081 cttcaatggg tgccttgtct agtgcttgct tgagataaga aagtgctccc attgctatct 4141 gtccaccacc gctgaaacca atcagtgtaa ttgggactgc gctaccaggt ttgtaaccgt 4201 ggttaatcag gctattgtac ataacctgcg ctgtaccttg attataaatt ggtccgaagc 4261 gctggtcagc agaaaccgca acaacaagca aattgcgaat attaatcgct aaggtgagca 4321 aagcgttgaa tatactacca gtcttggatt gactgcgtcg ttcagcgaag cgccaaaagg 4381 gcgaaagtat tccaccttcg gtcagtggtc gattgaagac tgagtaaggg atgattcctt 4441 taatgagcac ggtattatct ggtaaagcga tcgctaactc atgcaaaaac tgatcgccat 4501 ctgggaagta cttgaatgaa gcttgaccaa tgccatcaag gtaaataaca taacgaacaa 4561 cattcgtttg cggtgggatt tcttgttcta atattcctag gttgagttta gtatcaactt 4621 tatcaccata ccaaccagcc caccaaccca aggcttcgag gggtgctagg agtccagcaa 4681 agaggacagc aataaaacta atttgtaaca aatcgactgt tagctttaac gcacctccca 4741 aagcgttgta ccaacctaac cacagatgac tcagttccgg aaagagtaca gtacaaagta 4801 cggcaatgaa aacaattcct aaaaatatgc tcactcgctt cacattcata ctcttgtatt 4861 gtctccttta tcacctgcgt tattttggtt gatcctacgt aatatctgcg gtcccgcttg 4921 cagaagttgt tctactcctt gcatatctgt caccaagtgt gttcccgcca ctgtattaga 4981 cagccacttg ccaagtattg caactggtcg tccaatagtg cgctgcacaa tctgaaagac 5041 aacccatccc aacacaccgc accacagcgc ttgccaaata ccaacaccaa tggcgacacg 5101 caagcccatg acaaatatta acagtgtcca aatagaaagc aagacttgaa ttgccacacc 5161 caaataaggc aatgccacta aaaaactcaa tatcagcggt gcgtaagcga atcctaatgt 5221 agaccacacg atgttatagg ggatattagc acgaaatagt ataaagctca ccagccatgt 5281 acttaaaccc caaaataaca cggtgaatac aaataacaca gatgaaatga atagactcag 5341 aagaaagcga attggtctga ctcggttaat gaatagtacc gtcccctgac ctacagcttg 5401 agaaaaccca gcaacgagta cgatgtaaaa ggctgctttt gaagcttggg gtagggtttg 5461 aatcagatta aaggcttcgg ggttgagggc gatcgctcca cctgcaagct cccaaaatct 5521 gtcgagggca gtttgtgtca tggcaaatgg cataaaaaag ctgataactg ttttagttgt 5581 tatagtgtat gttaattcac cttaaaattc agtattaact ctgttttttt tatttgctct 5641 tgactaaacg cggcttgctt gagatgagct aaaaaagttc tattgctata cctggaatct 5701 aggtaaatag ctttggtata aacttttttg gcggtatttt gattaccact gtcccaatag 5761 gcgatcgcac tagcaactaa agggtgagga ttatttggtt caagaatagc agcacggtta 5821 ccagtgacaa tagctaagtt ataaatttgt aatctatgca acgctaaact taagttaaag 5881 taggcaattt cattatctgg cttgagaata gctgccaaag tatgaaccag aactgctcct 5941 gttaagttgc tatcaacgag atagacaatt cccaaagcat tgtaagcagg aacaaaggta 6001 ggcttttgat atattgcttg ccataaagat tgtgctgctt gttgatcatg tccagctaaa 6061 tgcaatgtcc aacccaaatt aactcgtccc ataagatttt tgggttcgag ttccacagat 6121 ttttgcattg cggtaattgc ttcaggtagc cgtccttgtt gacgatacaa aagccctaat 6181 tgtcgatagt caccagcggt attggctgca ctggctaaag gtgtaccgag agcaaaagtc 6241 aagaataaag atgtacaaat tccactggtt atctttactt ttgctggttt caacaccttc 6301 attttgagtt tcctttatca gttatcagtt atcagttatc agttatcagt tatcagttat 6361 cagttatcag tttcactgtt cactgttcac tgttcactgt tcactgtttg aaggagttta 6421 tcttaatcca acattttggg gtcaattccc aaagcatcga gttgttcagg actcagggag 6481 cgcaacttct ctacaattgt ctcgcgcttg tgtcgttcaa cctcagccct ttgtcgttca 6541 acctcagcgc gttcttcacc tgttaggagt aaatttccct gtgcattcca ccatcgtaac 6601 caaggtagct ctgcattttg ataccgccct tgccagattc ctaattctac cgccaatggc 6661 gcaatcaggt aatgtcctcg ttcgttgggt ctcatgagtt gataagtatt atccaggagg 6721 cgatagactt ctaagagtgc cttttctacc tcataaatcg cataataagg gacgcgaatt 6781 gcctgctcat aaacccaaaa cttgcccact gtcccgtctt ccctcagaga tggcggcgtt 6841 ctatctcgtt cctcacttcc atctcctgat acaaactcaa tcacaatcaa cggcgcgacg 6901 tactccttcc acaaaacgta ggaacgccga atttttccat ttagagttgg cgggacattc 6961 gggacgtaaa accaatccgg tgcttctgct cctttttgtg gcggatcggt taatcgccag 7021 taaataccag aatcctgccc gatacaatat tgaccatcag gatgcagttg ctctaggatg 7081 ggtttaattg agtctgtgag taagatactc tggggatgtt cttgcaggtt tttcacaaat 7141 gtcccatcag aatctggcaa ctgcgtgtgg tcgggtaaac ttaataccga gtcggtagct 7201 gtagtgacga aggtcatagg cttcaccagg tatccatctg attttcattt tatttatttt 7261 tcggaaatgt ctagcaaaca agatttcaac ttctgataaa cctgcacctg ctggttgttc 7321 tgcaccagat actagtgttt caccacagga acttcgcccg gtacagaaac ccggtttcga 7381 taaagttacc gggtttctca gagatctcag gtcctgcaaa gtagctttga cagactacta 7441 gtctaaactc cagttgtttt aggtaggcaa agcataattt gaatttgctg cgcctataaa 7501 gactttagca agtcttattc tcgactcttt ggcatctacg ccaaagagtt tttttatgac 7561 acagaaggat cttcggttgc tttcatcttt tgtaaaagct gttgaacttg gtaaacttta 7621 tcaggtctat tttgtgtctt gaaaatattc aaagctttgt tcacagatgc caacgcctca 7681 tctcggttgc ctttttgcca tagcacttgt cctagacttg ccagcgcatc agcatattct 7741 gggttaattt ttagtgcttt ttgaaactct gcaatagctt cttcccaccg agtttggcta 7801 gcaagagcaa tgcccatcgc attatgagca agggcgtagc ttgaattgag acgaatggct 7861 tgtttgtatt caccaattgc ttcttcagct ttgtcttgtt gataaagaac attccctagt 7921 tgatagtgtg caaaagcatc tttgggaaat tttttaatca acttgcgtaa attttcctct 7981 gcagctgcga agttccgctg attataggaa acatttgctt cttgaatgag acgcgatcgc 8041 tcttgttcgt tccgatctcc tatttgtgct agtataaaat tgctcgcgtt agccctcttg 8101 ttctggggca acgttgagag cgtatcatga gtcacaacag aatcattatt atttttttgg 8161 ggatatgtag tatcggctag tactaatggt aatgtatcaa taacagataa tgaccccagt 8221 atcatataaa taagaggttt agcaaatacc gagtaagtta ggcttttcag gttcatcacg 8281 cacattacct gaataattta tcgttaatgg ttttagggaa gcacattcag catcaagtag 8341 aatcagcata tgctgttgag aacattttac ctaaaatttc cactttcaaa aattaaccct 8401 taatgggata tttttgtcta tatgtgaatc agacatcagg gtaagagcgt tcgcccttgt 8461 tttgccaagc ttgttcctcc cgcaacacgc ggactcaaag cagagacttt gggcaatagg 8521 ggggttaaga tgtgatcatc gcatcctaaa gatcatactg tagaaataaa aagctttagc 8581 gatagaaagt cattcaactc agttgaagta taattaacac gtattgattg cacgcgatca 8641 gtaaatggta actgctaatg tggtaacagg taatggctca tgcctcatcc aattaccgat 8701 taccgattac taacaattac tgataaaggt taggagtaaa attgactagc aatatcaata 8761 gaaattttca acatcaagtc tcttggcaga agcgcctaca gtacaagttc gacaacttta 8821 tgtcgaaggg agggctgtca gtatttttgg cattgttatc cctatttttt ggaacttttg 8881 ttgtcatgac aattgttcgt tatatatctg aattgttgtt tcccaacaca gatgtagaaa 8941 gtagttcaga gttgtactgg gaggtattcg tacagcttat tgggctgaga gacacaggag 9001 acaatgccaa tttagcgact aagattattg gtgttatcac aatttttctc ggtttagttt 9061 tgttctccag cctagtagct ttcattaccc aagaatttga atcaagactt cagatattgc 9121 gtcagggtca aagcccagtc atagaagaaa atcatacact gattttagga tttagcgaca 9181 gagttataga cattatcaaa gagttagtag ttggtaatga atcagaagca gacgcagtag 9241 ttgttatcct atctcaaaaa gataaagagg agatggataa tttcctccga aataatctcg 9301 gcgatcttaa aaccacaaga gtggtgactc gcaatggcat aatcactaat ttaaatgaat 9361 tagataaagt tggaataaaa gttgctaaat cggtgatcat tttgaatgat gccaaaactt 9421 cagatcccga tgaattgaag actttagcag atgcacgggt tgtcaaagca gtcttggcag 9481 tggtggcagc taacgaagaa gactccgtgc catctattgt cgtagaatta cactcgcagc 9541 aatatcgccg actcgctgaa aatattgctc ctggagcagt caccactctg aatgaagctg 9601 atattttggc tcggatttta gttcagactt cgcggagtgt aggactagca gcagtttatt 9661 taaacttagt gggttttgaa ggtaacgaat tctatttcta tcgtccagaa aaaggttggc 9721 agagcgtgaa ttttggtgaa ctaccatttc atttttcaaa tggtataccc ataggtgtgc 9781 gtcatgcaaa tgccacccta acaataaaac caagcaaaga ttaccagtta gttgaaagtg 9841 atgaagctat tgttttggca gaagatgact caagcatcca atttcatcca caatctgtcg 9901 tgcaacctaa gaattttagc tatagtgact gttgcaaaac cctagagcaa aaacctgaaa 9961 ggcatcttat tattggctgg aacagtaaaa ctcctataac tttaagagaa tatgccaaat 10021 acctgatttc tggttcagag gttaacctgg tggtacagga tttaacatca caagtgaaag 10081 ctgaatttga tactattgcc aaaaattact cacagatcaa gatggatgct ctgcaagtca 10141 atctggactc agttgagcag cttcttcgcc tcaaacccta tgaatataac agcatctcaa 10201 ttttggctct tagaggggaa aattcggaag aaattgatgc taaaactctc accattttgt 10261 tagaattgcg gcaaattttc cgagaatata ctgccgaaac caaaaaccaa gtcaccaccg 10321 aattaattgc agaaataatt gactcacaag acaccgatct ggtgattaaa gcaggtgtga 10381 aagactttct cctcaccaac caatttgtct ctaaaattct cgctcaggta tctcaagagc 10441 ctagtgtgat gtcaatctat gacgatttgt tttcagtaga cgggagcgaa ctgtatatca 10501 agcctatatc gctttacttc tcaagcaaag aacttggtag gttgactttc gctgactgtg 10561 taaaagctgc acaagaacgt gacgaattgt gtcttggtgt aaaaattagc gcactagctc 10621 aaaataaaaa tcaaaatttt gggattgact tagtaccaag tctagataag ccgctaaatt 10681 taacttttaa tgatgcactg attactttag ctgaggatga aacatgacaa gctaatttac 10741 aagttctcat actaattccg cttttataac ccccttttta tcctggcaca ggtgtagtta 10801 taccagactt gtccgtctgc gcggactaat gaaaaagggg gggtttaaga tttttgactc 10861 taacagcaac cgtattggga taaagatgac aactacacta aaaatatcac agcaacagtc 10921 aaactgattt cgttatattt ttttcaaatc ataagacact tgcaataatc cggagtcata 10981 ggttttcaca tctttcaact caagtgctgt ttccaagctt ggatcgttca caatcaatgg 11041 aattccattt cttaaaagaa ttggatgtat cgagagaatc aattcatcga caaaaccgta 11101 tttcatgaaa taatgaattg tctgtgctcc cccaactaac caaatatcat gaccgctcga 11161 ttgacgcaac gcgtgaatga agtctttcca attttccttc acaaattcca cgttgttatc 11221 tctttcgact tgcacagttt ttgagaaaac aaaacctttt ttatctttgt atgggtattc 11281 cccaaatgta agtacttggt aatatgtttt acttcccatc agtactgtat caatttgggt 11341 agagaactca ttgtaaccgt agtcttggtc ggtaaatagc caatctacct ctcctgatgt 11401 cctcgcaatg tatccgtcaa gactagaggc aatgaataac cgtatttttc gcatggtctt 11461 attgcttccc tttcaaggac tttgcagatt atagcggttt tcaactcagt agaatacaac 11521 atacccctta cggggtacgt tactagatat atttttccgt ctgtgcagaa ggtttcggtt 11581 tagtaaaacc aaatatacta aacagccagt gataccattg ttcgttagtg ataaaaacct 11641 ttaatgctga ctccaaatcg acatccaaat ctttattttg aacatcgata attagtttat 11701 atgtcaattc caagtgaatt ctatcttcct tagtaaataa accaaagtta ccattgggag 11761 aattttttag ttgttctact ctttcttgag ctacttgacc tttaatttta gttaagcgtt 11821 catcgcgaaa taacccaaag cggtcttgag gatgtgagga tacaagctca ttaaaattac 11881 caattcgact tgtaaaagag gtggtatttg gaggaatttg aagaaatgct atgacatagt 11941 tttcgtggtc agtttcaatg acaatatgtt ctggcaacac tcgtttacct agccgataca 12001 aagttaattt gataggtttg agatgattga aagcctcggc aatggtcttt agtttcccta 12061 tatcatccga gtcagaaatg ataggaatat tgacgtactg ctgttcaaga gaagctttct 12121 gccgagttaa atattcaacg acatattctt ctatttgctg ctcttgggtc ttaacatcag 12181 gaacaagctt tttacgcagt tctctcatgt caaccgaacc aggtttttgc atctcctcta 12241 ttatattgag aaatacttcc gtcaatatct gctgctgttg ctgcacgact cgcaactgaa 12301 taattagttt ttcttcttca tccaatttgc gaacattttc ccttatagca ggtaaaggaa 12361 tctgacgaac tctgtaattg tagtagtcag cagctcggtt taaaaccgca cggatagatt 12421 tttgttcaag aatatcctct aagtcttctg ggaaaaatat ctgctccaaa ggaacattga 12481 gagttttaag tttgacttca agaatggctt ttagttctgt ttcgtcaggt ctttgtagaa 12541 aaatttggta ttgagatacc cgaccaataa ttgagttatc aaagctgttc tgaaactgtt 12601 cccaacgttc gggaaacata ttaaggataa ttaagctatt tgggacgtga gtgaaaatct 12661 ccttgatagt ctccccaaaa ttaagcaata tctctcggtt atgaggaagt cccaatcctt 12721 caagctgatc gaagatgata attaaaggct catccaggat agatagttta cctaaaactg 12781 aaatagcttc taaagaaaac gcttcctggc taaggtcttc cccccagttc ggcaaaccaa 12841 catattctgc ttcctcagta gataaaatgt tggctgctaa ccaacgagtc acaatttctt 12901 tgcgttttga atcagtgtag ctacaatact tgataattcc tttgaggatt gaaggtgcaa 12961 aacctccagc ggagtaataa ctgacccacc attcattgaa tcgtttctca atgtgctgcc 13021 agtactctcg cttacgcaga gttccttctc cgccaagggc atctatattt ttatccttga 13081 gtgctttaag aatgtcttgg tctttgttgg taagatttgg gaaagtagcg acaattttat 13141 gaaagctatt aacaattaaa tagtctaatt gagtaaatgt tccaactcgc tcgactaaag 13201 actccaaaat ccgactgtaa atatggtaca gcactgactt aggatggttg ggttgacgga 13261 taaaaagcag tcgattactc gataatcgtg cttgtgctaa tctcatcatc agatgagttt 13321 taccagaacc tggttctcca atgaccaccg cccctttact ttgacgatta gagtctagct 13381 taatgtcttt taagattgac tcaagggtaa taaactcttt gtgataggta tcaaggtaat 13441 caggatgatt ttgaaaaggg gtatcaaccc gactaccact aaaaggatta ggttgctctt 13501 ttaattgctc gttcattaac ttacgataaa ggtgataggt aaattcagca tcatatcggg 13561 cagcatgagc cagacccttg ataaagtagc ggttattaac ccgtagattt aaatatttac 13621 tgaggtactc taaagagtag ctttcaagat ctaaccaata atttttagct ttattccaag 13681 tacactcaaa ttctaaattg ggcaatttca aaccagcaga acgaaaacta gactggagga 13741 ccttgagatc atgctcagca taatgacaaa ctattttctt accctgaaca atagtcaaaa 13801 attctgttaa cagagttttc agacttttaa ggttaggtac attgctggtg ttattcggat 13861 ggtttttaga aaatgcttca tatatgacac gtccttgact atccacaatt gccagttcac 13921 ttaattctgg gttgccttca gtatcgagaa ctacaaagtc catatgacgc taactctaaa 13981 tactttggta acaaatccga tcgcgacaag gatgggtatt cattttctaa cagtttattg 14041 tggttgtgga tactcaactg tttcgcctca tctttgcgtc aacccaatta aaaagcgaag 14101 ttcacgaatg aactgactaa tttctagctg gttgattgtt aagggttttt gaatggtcat 14161 gtagtgtaaa atgttccagt tcataatgat gtcgcaagta cttggctaca cttaagctac 14221 attcaaactt caatatcatc cggaaatata cagaaaaaat ttttgatgag agccatgtta 14281 agtggatttg cggggcagag atgacgggcg atatcgcacg cacggttaac cgcttgtccg 14341 tgcgactagt gagaagagcc aagccagcag caatgtcagc aaatttgctg agagtcaagt 14401 agagttgtaa ccttctcaac aattctatct gggtcagtga taaactattg ggctagattg 14461 cgattgctta caactggagt ttagactatt aatcgctgcg tttgttatag gttgttttga 14521 atgtaacgcg cgtatatcac tgatagtatt gtggaaaaaa tagactctct taccacagaa 14581 cggttctatt ttggagatat ctagtagtta tttgagtttt agctgtggtc aagataactg 14641 ttttgagatc attgaggcaa actgagtcgt gaagaagcga attcgcactg tgctgtttat 14701 cgtcttggct actgtcacgc ttgtaattgc aatagtagga agacccatca tccatatcag 14761 cttaacagct tggaacgacc gaagcgaaat tcaacctcta cccccaggct tgattgatga 14821 tgccagtcgc ctcaactcta cccccgtcct tgttcgagag gttcctgaca acccgaaaaa 14881 agctgttcag gtgcttcaag aactattaaa agtagcacgg tctactggaa aaaaagtcgc 14941 acttgcagga gcacgccata gcatgggtgg tcacactatc tatgctgatg gcatctcgct 15001 agatatgtct aacttcaaac acatggcact tgatgagaaa accaacgtcc tccatgtggg 15061 atcaggagcg agatgggcag acattattcc ctatctcaac gctcatggtc gttcggttgc 15121 actgatgcaa tccaataacg atttttcaat cggcggtacc atgagtgcga atgcccacgg 15181 ttggcagcac aatagccctc atgagtgcct ccaccgtaaa cagctttcga cttatgctag 15241 ctgatggctc tgtcgttcat tgctcaagaa gcgaaaatgc ggaactcttc tctctcgttc 15301 tgggaggtta cgggctgttc ggcatcattc tagacgttga tctgcaagta gtcaaaaacg 15361 aaatgtactt ggctcaacga gtgattattc caactcagca atatgtggac atttttgaag 15421 aactcgtcaa ccgagtaacg gacattggga tggtgtacgg caggatttct gttgctcctg 15481 actacttcct caaggaagcc atattgacta tttatcgtcg aaacccgtct aaagatggca 15541 aaatttcacc cttaaatgaa cactctcgta ggggtttgac acgaactgtc tttcgtggcg 15601 aagtcggaag taactatggt aaaaaccttc gttggcaatt ggagaaggcg ttaggaggag 15661 aagccggaag caacgtctct cgcaaccaaa ttctcaaccg ctcatccaaa ttgcttgaga 15721 atcaaaagct tgcatcaact gatatccttc atgaatattt cattccgcca aagtcgatgg 15781 agacattttt ggaaaagtgt cgtactatta ttcccaagca caatggcgat ttgctcaaca 15841 tcaccgtgcg taacgttcat cgggattcag acagttttct gcgttacgcc gaccaaaatt 15901 tgtttggtct ggtcatgctg ttccaccagc accgcacccc caacgccgaa gttcagatga 15961 aagcgatgac gactgaactc attgatgccg tgctttcggt gggaggtagg tactatctcc 16021 cgtatcgcct gcatgctact aaggaacaat ttgctcgtgc ctatccccaa gcacaggagt 16081 tttttgccct aaaacgcaaa tacgactcaa aggaagtctt ccagaatcag ttctatctaa 16141 aatacggtaa gcagtaacct aagttaatta gtataaaaaa aacatctata ttaataaggc 16201 caatcgcttg ttccttctgt taacacgctc tgcctgcctg caaccacaca agactcccct 16261 ctactttttt atacctcatc ccagttctgc ttaacagtcc aattgaacta ttaaagggtg 16321 aaactccact tttagtctaa actccacaag aaagaactga gatggtagtc atcctacgcc 16381 ctttccatca gcatattcaa tcttttgaaa cttgctgaat catccacaat ttgttacaaa 16441 agtacaaaga acttctataa acttaaacta gaatgctagg cggacttact ggtttaacag 16501 atcctaaagg cactgactgg ggcgagcaaa tgctcaacac tgtcgcgagc caaacgattc 16561 gccgcttgtt cactcaaagc gagtcagtgg aagtctctgt tcgctgctat ccttcaagca 16621 agctcttgca gggtagcatc gatagcttta aaatgaaagg ccgtggctta gttatccgca 16681 gacaattcgc tgctgaggaa ctttctgtag aaactgatgc cgttgccatt gactttagct 16741 ccgttttgag cggtaagcta cgtctgaagc aacctactca ggctattgct caagttgtct 16801 tactagaagc aggtatcaac gaatctttta aagcagaatt ggtgagaaag cgcttagaaa 16861 atcttactgc accagcactg acagcgttat ctggtggtca accagtctct tttcccgaag 16921 tccagataaa gttattgcct caaaaccgat tgcgaatttt agccaaggca gatttaaaca 16981 acggtacact tgtaccacta aatatgactg tgactatagg tattgaacgg cgacgacgtg 17041 tttcttttaa agacccagaa attgatctta acgaggtacc agaagcacaa aaggaaatct 17101 cacgaacctt gagtttagcg ttggtggaaa ttttagataa tatggtggat ttagaccgct 17161 ttgacttgga tggggtgaaa atgcgtctta atcgtttaga aacagagggt gaacggctca 17221 tttttagtgg ttatgcagag attgagcgta tccccaaaaa tccctgattt ccattcatct 17281 ttatctgctg gaggcagagc ttctaaaata gcattcccag gctcagcctg ggaacgagaa 17341 attaagtgtt gagttagcga ccaataccta cgtactggaa gcccgctcta accattatct 17401 caggatcaag gaaattacga ccgtcaatca tgacagggtg gctcatcaac tgtgccattt 17461 tcgcgtaatc caagtttttg aactgctgcc attcggtcac aagcactaaa gcgtcacagc 17521 catcagcaag tctttcggca tcggtttcta ccagcacacc agaaagacca tgacgcatac 17581 ctgtttgaga aacaattggg tcgtatgcct taactctagc tcccaatcta ttcaactgct 17641 caataacatt aagcgctgga gcatcccgca tatcatcagt atcaggctta aaggttaaac 17701 ccagcaatcc cacagtttta cctttaagaa ttttcagggt ttgttggagt ttttcaatgg 17761 caagcaaccg ctggcgttcg ttgacactca cagcagcttt gagcaattgt gtctcgtaac 17821 cgtagtcatc tgcggtgtga atgagcgcag aaacatcttt ggggaagcaa gaaccacccc 17881 aaccaatacc agcgtttaag aacttgttac caatacggga gtctaaaccg ataccttttg 17941 ccacttgagt gacgtcggca ccaacgcgat cgcaaatatt tgcaacttcg ttaataaaac 18001 taattttggt tgccaaaaaa gcattagcag cgtacttaat catctccgct gaactcaagt 18061 ctgtgaccag cactggtact gggggtagag atggatcagt tgcaaacttg cgctctacaa 18121 tgggggtata cagttctttc atcatggcga tcgccctagg actgttgcct cctaggacga 18181 tacgatcagg attaaaagta tcatatactg ccgagccctc acgtaaaaac tctggattac 18241 tgacaacatc aaattgagct gtaatctcag gtaacttatc gtaggttggc gctccacctg 18301 ctgtgatcag tgtattttgg cgttcggcta tgccatctaa cacaatcatc cgcacccagt 18361 ctcccgaacc aatgggcact gtggatttat tgacaatcac cttataccca ccgtctaaat 18421 gggcaccaat accacgggcg actgcttcaa catagcgggt atcgctttcc cctgttggta 18481 aaggtggcgt tcccacagca ataaataaaa tttccccgtg ggcgactcct gcagctaagt 18541 ctgatgagaa ctcaatctta cctgtactaa tggcagactg cataatatct gacagtccag 18601 gctcaaagat gggcgactgc ccggacttca ttattttgac tttctcttcg ttgttatcta 18661 tacaaacaac atcatgcccg atgtgagcta aacatgcacc tgtgactaaa ccaacataac 18721 cagtaccgat gacacaaaca cgcattttgt gctaatcctc gctttttggg gttaattttt 18781 tgagaagata ttttccgttt tcaagtcctg agtatcaaac tctccgaatc attactcttg 18841 aggcgatatc gcccttgtgc atcgaggggt ttcccttcta ggaaggctgg gagctactag 18901 ggggcgctac acgctcgcgg aaatcttcta tagtcagctt taacccctgt tgcaaagcaa 18961 tactaggttc ccaatttaac caagtttttg cccttgtaat gtcgggacga cggcgacgtg 19021 gatcgtcaga aggtaatggt tcaaacttga tctgtgcgtc ggggttcact aaatcttgta 19081 ctgttttcgc caattctaga atcgtatact catcaggatt tcccaaattg actggaccaa 19141 cgtattcgtt attcatcagt cgtatcagtc cgtctaccaa atctgatacg taacaaaaac 19201 tacgggtttg caaaccctct ccgtaaacgg ttaaaggaat accccgcaat gcttgaacga 19261 caaagttgct cacaactcga ccatcgtttt ctagcattcg aggtccataa gtgttaaata 19321 tgcgggctac ccgaacatcg actttatttt gtctataata atcaaatgtc aaagtctcag 19381 caatcctttt gccttcgtcg taacatgagc gaattccaat aggattaacg ttaccccagt 19441 attcctcact ttgaggatga acttctggat caccgtaaac ttcacttgta gaagctaata 19501 aaagccgtgc cttgacacgt ttagccaacc ctaacatatt aagcgtcccc atgacgttag 19561 ttttgacagt tttcacgggg ttgtactgat aatgtactgg ggaagctgga caagccagat 19621 ggtaaatttg atcaacttcc aagcgaattg gttctgtaat gtcatggcgg atcatttcaa 19681 aggaaggatg atctaaccac ttgagtatgt tccgtttgtg tcctgtgtaa aagttatcca 19741 agcacaggat atcgtctcca ttagctatta atcggtcgat tagatgggag ccaataaacc 19801 cagcaccgcc cgtcaccaaa attctcatag ttccccaatt acttacagat atttgcagta 19861 gtgattgcgc ttgtcttcag atgacccagc cttgaaacca aagtacacta cttgtgtctc 19921 attgagaagg ttttgttaaa cctagcagtt tgaccatgaa tgcttttttg ttatctttgc 19981 atcttttggc ttcaaatttg aattatttca gtcacaaata cctttgtttc cattagaaaa 20041 atagcaaaat catctgcaaa atgtaagctt tatcgctaga tttcaccaaa tctttaagaa 20101 tataaagttt tccgtagaat ttcaaaactt ttactcacgt aaagttggtc aaattttcac 20161 ccacaaactc aagaagctaa gtaaaacaag ctttggctta tattcagtat gaattctgta 20221 taactgagtc aagaaattca gatctatgct cttgtcaaca atctttgatt tgagagtctt 20281 cagttaaggg ggattaccaa tcttccttta atgacattca cagattctga gttttgtgtt 20341 ctgctttctt gttcatagaa gaggaaaact ttttccgttt catttttcca gtcctgttaa 20401 taggtgtgaa atgtgggaaa agtaagtttt ctagattatg atccttgctg aagtttttgt 20461 tctttgaatt cagaccaggt tgcgttctga tagaggtcaa aaaaataaca tgctcaccag 20521 ttcatatata tcttggtaaa accgacgatt tttgaggggt tctttgagga tcacagtcaa 20581 aagttaaaaa gaactcttct taactgaact gtactgacat ataaccatgt tgaattacac 20641 aattccctct gacatagcag cggtggtgct actggtaact tcagtttgta caacgacatc 20701 ctgagtacgt tgaggttctc ctagcataac aattgaacag gtgagttgct tagccaattg 20761 agttgtgaca tcgctaatgg ctaatcctcc gggaacagtg cgattgcgta taaaaggtaa 20821 tacgactaag tcatatagtc gcgctgcttg caaaatcgct tgggcagtat tttcatgagc 20881 tataatttgg atttctggtg gattttgcag tgctaatcta gaaatcaata gagatagttg 20941 cgatcgcctc caagcaatct tacttgaact agtacgtcgt tcgcagacat ttagtatggt 21001 aacttgagct tgatttgcat ctgctaaaat ctgggcaaat tttacgggct gcaatgaagg 21061 tgtcatcaaa ttttccaatg gtaccaagat gcgctgaaat ttttttggtg attccaccag 21121 acgtgtgact gctactggac aatgggatgc ccacaaaacg ttatcaatga cgttaccaaa 21181 taaacgtgct ttgaatccag tacgtttacc ccaacccatg ataatcaaac tagccttttg 21241 ttctcgagaa gccctgctaa ttccttgggc aaaagcatcg tcaattctca gcaatggttg 21301 tgcttcgact cccaagactt tactcagtgc tgtggctttt gccagtaacc ggtgacttct 21361 ttgcacagaa gcgtctaatt ggggtgcatc catgtgagca aaagcagttg cgatcgctag 21421 tggtataatt ctgccatgag actctcgtgc caatagcgcc gccatttcaa taaggtactg 21481 ttgtgtttga ggattgtgga caggtactac tatagtgtat gagctatcgt tttcttcttg 21541 cttctggtaa gctgggttga cactctgttg gtccttaatt cctgagttcg tcaaacctgc 21601 agcgactcga ctggttataa atggtcccaa ggttgaagtg atgagcataa ggacaataac 21661 gctagtcaat atcctagaat caagcagacc agctcgatat cccataatgg ttgctcctaa 21721 cgttgcaccc acttgaggca cagatagtga ccacattgtt agcgtttctt gccagttgta 21781 gcgataaacg agttttgcca aaaaagcagc gataaattta cttgctacta aaccgataat 21841 aataaagaac gtgaactgta gggtctcaat gctttgtaaa aaggcaggca cattaatgag 21901 caagccaagg ttcacaaaga agatgggaat aaacaacaca ctgccaacaa acacgacctt 21961 ttctttaact ggaccttctc ccacagtttc attaacagcc aaccctgcta aaaaggctcc 22021 aacaattttt tctactccaa tcaattgtgc tcccacagta gcaagaaaaa cagagagtag 22081 cacaaacaaa aactgattgc cctcttcatc accagaacgc cgaaaaaatt gtttccctgc 22141 ccaatcaaaa cctaccaaaa cgacaattga gtagagtgct aataaagtta ccaacgtcat 22201 cagccgtcca aagctgaatg gtccagcttt cattgctaca cagatggcta acaccaacag 22261 ggcagcaata tcagtaaaaa tagtggctcc aatagtaaca gttatagctt cgttattgac 22321 cacacccaaa cgactaagaa tgggatatgc caaaagggta tgagatgcga acaaagagcc 22381 aattaatatt gacgaattcc agtcaaagtc aaaaaaccgc ccaacaagag ttcccacgag 22441 taaaggaacc aaaaagctca aactgccaaa tcccaacgac tgattttttg tgcgacgaaa 22501 ctgctctata tctatttcta accctgcgac aaacattaag taaactaacc caatatctga 22561 taacagagtc ctgaacgctg attctgtgtg taataaattc cagccgtaag gaccaagcac 22621 gacaccggaa actagcaaac ccactaatcc tggtagttgc aaccgctcaa acagaatcgg 22681 tacgactaaa atgactacca gtaaaatcac aaaaggaacg attggttcct tgccgaaaac 22741 ttgggaagtt gattccagcg caagaacttg tgagatgagt tccataggta ggacgcagaa 22801 gtgggcatag tgaggctttg attacttaag cgtaatcgct gaatttattc aactgcaaaa 22861 gtcaccccaa tgaaaaaaat caaaaaacct tgcctagatt acctaaacga tgatcactga 22921 actaactacc tatggatgga tctggctcaa tatgaaccgt aagcgatcgt agtcgtcttt 22981 aagcaacacg ctcaaggtac gtccagcttt gatgatccat tgtcgatcag ctttgcgtaa 23041 ctgataaact tctcgaatgg cagctgcagt taaagattct acaggagcac cattgacaga 23101 aagtagtgtt cgcaaaaaat ctagttgttc ggtaaactta ataattgctt cgctttcaca 23161 tcggtaaact gcttgatgaa cttgtggggg acgaggaggc agatactgaa ataccctagc 23221 atttgaaaaa tgagcatctg gtgattgctc aaatcctaca attgcagcac gaaactctgt 23281 ttttagcata gcaaatatct ggggttgttc ctctcgcaag tcctgcaatg acaatcctag 23341 agccactgcc cgatgtacgg aatccacagg catagttgtt gcagaataca ccacagcata 23401 aatttgattg cctgactcct catcaactga acgtacccag ctgccaaagg gaggcataga 23461 gggaaagctc aaatcgtcgg gttccaaaca ctgagctaaa aattcgcaac tagaagtctc 23521 gataacctcc gcgatatgat ttggatggcg atcgcttcta tcaaactgtg gtaaaggaag 23581 gcgcataaac tattcaaaaa aaagaattca gcaattccac cagtcaccaa gtcacaagtc 23641 agaacgcaat tagtgaggtc aagaaaccct caagtcagga accacttgca ctccgcagcc 23701 attctgactc ctgagtcctg aatggcagtc gcacgccaca tgctatctcc tgcacagacg 23761 ctacgcgaac aacggaggac agcgacgcac cgcagtggct cctcaagtcg cgaaaccctt 23821 ttggcagagc cagcacgagg aaaacccgcc cacggggctg cttcctcgtg actaac // LOCUS NODE_1303_length_23777_cov_5.43980323777 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 23777) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 23777) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23777 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(76..1233) /locus_tag="DP116_11780" CDS complement(76..1233) /locus_tag="DP116_11780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015212752.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11780" /translation="MSYSTQQQIISIPIPVKFRETALEFAQEQPTQQKAKQVYVNTLA VLVVNSYLEMLDIPTDLEASHCWNQYGRLMANVADLLLTGVGRLECRAIRTGDRLCYV PPEVWDDRIGYVVVELNKTCTEGKVRGFLPDIKTSQIDIEELQPLERLIESSHLVHLR QWLEGIYTSQWQSMEELSRQRSPQLAFRFRRIGGFQLNTSEEVWKLITQLYPHRSWEN NLPSELLEKMSGNQVEDVESPQHNINSLSVIDVLTHLLKTTEDEEKRWKFAETLWTIE PNHPAISARRIMDLGMQLVGHSVALMVAILSKPDKTVAVLLRVYPIGNQPYLPPGLQL AGLYENGQPFLEVKARVVDNYIQLKFCAEFGERFGVEVSINNASITEHFVI" gene complement(1251..1916) /locus_tag="DP116_11785" CDS complement(1251..1916) /locus_tag="DP116_11785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015083270.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sigma-70 family RNA polymerase sigma factor" /protein_id="PRJNA477356:DP116_11785" /translation="MDELEAQILQLVQETCQHPRGSIERQKGLHKIFLLIQQTGKLLR GTGVPDAEEALQKTWLYFCRNLCEADTVKEPYNSGKGSVITWLNGYLKYRLEDSRRVG SNNTIHPIQDQNGEELDPVNLIPARPEAPPILEEIQQWLKKEANNLRRIHVQDRPDIN CLVLIERRLPPETSWKDLSDEFGVSIPTLSGFYQRQCFPRLLNFGKSQGYINSEGYLN IES" gene complement(1998..3269) /locus_tag="DP116_11790" CDS complement(1998..3269) /locus_tag="DP116_11790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868490.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flotillin family protein" /protein_id="PRJNA477356:DP116_11790" /translation="MEIIALLLGIFGTGTAAGWWVIRNLYYICQPSEVLIFAGSRTSL GDGNSVGYRLVKGGSRIQNPLLEKTFRMDLTNMIIELRVSHAYSKGGIPLTVEGVANI KIAGEEPTIYNAIERLLGKSRKDIEQLAKETLEGNLRGVLANLTPEQVNEDKIAFAKS LLEEAEDDLEKLGLVLDNLQIKNIFDEVRYLDSIGRKQQAELLRDARIGEAEAKAEAI IKTSANNRITKLRQIERDLEIAKADAERRVRDTLTKRTAMIAEVEAVVNAQVAKVQAE VGVQTERIKKVENQLQADVVAPAEAESQRAIAKAKGDAARIVEEGKAQAAGTQYLAES WQNAGASAREIFIYQKLEPLLRMLAAGVPEVQVDNLTVIDATNGGSVPKMASFLEQLR QTTGVDVTKVINQLKSEERNSKYEIRSHLDK" gene complement(3514..4107) /locus_tag="DP116_11795" CDS complement(3514..4107) /locus_tag="DP116_11795" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11795" /translation="MNTLKTFAITIIAVTLGTQNPSAALINKSASHQNLVFKAPLQPD VIANDVTYSTAQGDTKIWMPGRITKNKPQELASENPTTKTAYFVTYIDLSEDVEYLST VGIRKVMQNEFRKQFSSKSEFSGKVVRSTDLVIDGYPGIEFLVQHPNAAWGQYRFFLV KRRMYFLGSVAPVELTTETANFFDSFRVYPEQIHYSH" gene complement(4153..5493) /locus_tag="DP116_11800" CDS complement(4153..5493) /locus_tag="DP116_11800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868489.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flotillin family protein" /protein_id="PRJNA477356:DP116_11800" /translation="MKSQQNIATVPVAQINIDTPTNKNNNGVEGLLFSSIPIAVSIFS AVVFVWFVKSFLCICKPNEVLVLSGRRWRTQDGQEVGYRVLLGGRAIRIPIVETVKRM DVTTMPVRVEVRNAYAKGGTPLNIQAIANVKISTDPGVVGNAIERFLDRDRSELTRVS RETLEGYLRGVVATLTPEELNEDRLSFAHRIASDVSRDLTKLGLQLDTLKIQSVSDDV DYLKSWGRKQIALVVKNAEIAESNAIAQAEQIEAQCEEHAQVAKTQDRIIVLEKENEL RTIKAKLEQKARCEEEITTAVAQEKKAKAEQVLQALRSELERLRLQADEVLPAQAQRQ AQEIRAKGEAAPLEENAKAAALVNDILSQVWQQTGTDASKLFLIQQIETVLQEAVQIP KRIQLKKVNVIDNGDGKSLASLVNVYPEIVLQFLESVNQTLGIDVTGTLNQGKD" gene complement(5578..5898) /locus_tag="DP116_11805" CDS complement(5578..5898) /locus_tag="DP116_11805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868488.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NfeD-like protein" /protein_id="PRJNA477356:DP116_11805" /translation="MKNLILIIAGIAVFVGIFGGVLIVGLISLQRRRQVVDSLVRSDN IIGCFATVEIPLNYNSPGKVRVNLKGSLVDFVAFTDETQQLHQGARVVVVGMKGNKVW VVSV" gene 6327..7259 /locus_tag="DP116_11810" CDS 6327..7259 /locus_tag="DP116_11810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875340.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1338 domain-containing protein" /protein_id="PRJNA477356:DP116_11810" /translation="MNPEIARHLWELLWQDYSIRVSYARTYAQMITDAGGTVANDHIA FRSLRMEIDSPQGKVNLGIDYLGQLAEALGYEAAGEYSFPETHLYARHYRHPQQEEFD LPKLFISELIVDELPEKTVNLIQQTVSGVNLFHSSAILHTFATKTERLAKELHRIFTS HWQPPRRSVVEEVNKVTQYGAWVLLHGYAVNHFTGYVNRQNTPKYPDIETTARGLANL GVPMKAEIEGDIASGLRQTATQAVTEIVTVLEDNVDVEIQLPWTYAYYEIAERYMVEV EPDQQVLFDGFLGRNAQQLFEMTRLNINKNNVET" gene complement(7439..8587) /gene="tal" /locus_tag="DP116_11815" CDS complement(7439..8587) /gene="tal" /locus_tag="DP116_11815" /EC_number="2.2.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458796.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transaldolase" /protein_id="PRJNA477356:DP116_11815" /translation="MASNPLLTIKDYGQSIWMDNLTRDLIESGELQRMISSRGIRGIT SNPAIFEKAIAGNALYDADIEAGIRAGLPVSKIYEYLIFEDIRNACDIFRPIYEESGG LDGYVSIEVPPTIAHDTKATIDEAQRYYHEIGRENVMIKIPGTLSGAPAVEQVISDGI NVNVTLLFSIQSYEEAAWAYIRGLEARVNKGQDISKIASVASFFLSRIDIKVDQMVEE RIKTIGTEDLAMQARLVAVKGKVAIANAKIAYQKYKEIIHSERWKAIAEKGANVQRLL WASTSTKDPAYSDVMYVNELIGSDTVNTLPPATIEACADHCDVANRLETGVKEAYQLI DSLKEPDININLDEVMTLVLDEGIDKFIQPYQSLISSMEEKIKQLSPA" gene complement(8590..8919) /locus_tag="DP116_11820" CDS complement(8590..8919) /locus_tag="DP116_11820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11820" /translation="MKTQQKSEIDQPVSIDKHTKAEPSVKERNLTVNPGDVIAEKPQS IQERAKQIAVDSPDITGDWIKVPTYFIVEYPNGEKKALHHVRDAKQISDVIRQARVDE NGNRIWW" gene complement(9276..9485) /locus_tag="DP116_11825" CDS complement(9276..9485) /locus_tag="DP116_11825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353634.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetyltransferase" /protein_id="PRJNA477356:DP116_11825" /translation="MLLQDKQTGQLVEIRDFEALISPLKPTVAGQLQAGEEEQEPEDF KKEQLLFPSGESLPRCWIDENYKDS" gene 9939..10481 /locus_tag="DP116_11830" CDS 9939..10481 /locus_tag="DP116_11830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009631655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11830" /translation="MIVQDYLFKSTLSFLTCVSVLLYSNSAIAAEKIVLKYGIFRESV SVEKLTKFAETGEVSPMLNLLLNQARQDPQTVRNVLTKEVNASPVVLDRLLNNPIGEF LVDQIGQTIHTPSSEANPQALRSALVLSANKDNKVSLIEIIQNYPTQEVYVEGERLVR TYDQLSLLAERLQNLLSWRE" gene 10574..11071 /locus_tag="DP116_11835" CDS 10574..11071 /locus_tag="DP116_11835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315372.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11835" /translation="MNTVHSQRKKASFPAAIFLSALLTLPVLSGCGGGSRTAAPPPPV EDSVTRTVNNPTPANQPQTKKGLSTGQKVAITLVGAAALYYLYNRHKNSQKEGAQGKY YLSKNGRVYYRDEQGRPHWVTSPSGGIQVPESEAQQYRDFQGYNGRTTGRSLSDVAPA NAPAF" gene 11357..11692 /locus_tag="DP116_11840" /pseudo CDS 11357..11692 /locus_tag="DP116_11840" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: GeneMarkS+." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 11466..11475 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(12036..12467) /locus_tag="DP116_11845" CDS complement(12036..12467) /locus_tag="DP116_11845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872989.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11845" /translation="MDSELQQAEYTDAKIKEGIKELEGSEPGSLAMLPPATQTEEEAQ QFGRQISTYLAELPEYLGSFFNEYKLPVISFALLVVTVTLLRLMLAVLDAVNDIPLLG PLLELTGIGYTIWFTSRYLLKGSTRQELGAQLNSIKKEIIG" gene 13577..13935 /locus_tag="DP116_11850" /pseudo CDS 13577..13935 /locus_tag="DP116_11850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015212489.1" /note="frameshifted; internal stop; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="MFS transporter" gene 14402..15190 /locus_tag="DP116_11855" CDS 14402..15190 /locus_tag="DP116_11855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873986.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="multidrug ABC transporter permease" /protein_id="PRJNA477356:DP116_11855" /translation="MKRFFRKALTLLSVYYAYMLEYRAELILWVLSNSLPIILMGVWI QAAQGGRFNLSPVDFARYFLAVFIVRQITVVWVVWEFEREIVEGKLSPRLLQPIDPVW HHIAAHVSERSARFPFTLFLLVVFFLLYPQAFWVPSFTKIFLFLLASVLAFVLRFIMQ YSFAMLAFWTERASAVENFWFLFYLFLSGMIAPIEVFPESVRAIVLCTPFPYMVDFPA SILVGLPVDLARGFLSMVGWILVFLGLNRLLWRRGLKRYSGMGA" gene 15710..16483 /locus_tag="DP116_11860" CDS 15710..16483 /locus_tag="DP116_11860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876506.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11860" /translation="MDWQEISGNWVLVPRNPVGVIHFLGGAFVATAPHLTYRWLLEQL ASKGYVVIATPFVNTLDHSAIAKSVLLNFERALERLQDSSAIRKRYLPIYGIGHSMGC KLHLLIGSLFSVERAGNILISFNNYAARDAIPLIEQFNLAAVEFTPSPLETNKLVQDR YNVRRNLLIKFSNDTLDQSTALTELLKQRFPEMVIAQTLNGTHTTPLGQDIKWQTGES FTPFDAFGQWFKQEVYRDLNQLKSIILFWLNPLAHPIIN" gene 16557..17702 /locus_tag="DP116_11865" CDS 16557..17702 /locus_tag="DP116_11865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747161.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="regulator" /protein_id="PRJNA477356:DP116_11865" /translation="MNSAFQILVIDDDPAVQILLLRMLERQGYKVLVASNGKDGITKA MACSPALIICDWMMPGLNGLEVCQRIKTDPQLSTTFFILLTSLDSVADRVKGLDAGAD DFITKPIEQNELQARVRAGLRLHQLSKDLQTQKEILEAELAEAAEYVRSLLPIPMTHP IQINFQFIPSRQLGGDCFDYYWLDSDYLAIYLLDTAGHGLRATLPSLSVLNLLRSHAL AGLNYYQPSDVLQALNNTFQMNYRNDKYFTIWYGVYNRVKRQLIYASAGHPPAVLVSG KSPTKPKVQRLKTPGMPVGMFPEVKYVDDFCQIEESSSLYIFSDGAYEITQSDGTIWT LDAFIQMLVTSQLQSEGKLDHILNGIISLNSKEAFDDDLSMIQVKFD" gene complement(17699..18034) /locus_tag="DP116_11870" CDS complement(17699..18034) /locus_tag="DP116_11870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015118395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-sigma factor antagonist" /protein_id="PRJNA477356:DP116_11870" /translation="MVEEIKVIQPSGILDAAQSQELRKKIDEILQKESSTKTVLLDLK EVTFMDSSGLGALVLAFKALKSVDRNMVICSINEQVRILFELTGMDKIFKIYSNREEF NKTVSSGIN" gene complement(18210..18884) /gene="grpE" /locus_tag="DP116_11875" CDS complement(18210..18884) /gene="grpE" /locus_tag="DP116_11875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872128.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotide exchange factor GrpE" /protein_id="PRJNA477356:DP116_11875" /translation="MSHIDFTSQLHDLMQRVGISSFKALSRTAGVSERQLVRLRRLGV EQMRVDVLLKLAQALQITFNELVTIFSQQEFVKREDTSTNESQQLLQQIAELKKEYER SQQQIQQQRQVLLQEFQQSSLQLLESLLVQFPTAAHKARENPQLSAINIVPLVHKPLE RLLQQWGVEAIARVGAELPYDPQLHQLLEGTAQVGELVKVRYIGYHQGDKLLYRAKVS PIPPQA" gene complement(18995..19258) /locus_tag="DP116_11880" CDS complement(18995..19258) /locus_tag="DP116_11880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131976.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chromosome partitioning protein ParB" /protein_id="PRJNA477356:DP116_11880" /translation="MIKVQEIPLNQIRRPLPRTNDPHKVKALMESIAAIGQQEPIDVL EVDGQYYGFSGCHRYEACQRLGKETILAKVRKAPKSVLKMHLA" gene 19595..21190 /locus_tag="DP116_11885" CDS 19595..21190 /locus_tag="DP116_11885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315201.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amidase" /protein_id="PRJNA477356:DP116_11885" /translation="MKIIWRISYTVLTTFCLLGLTGASGPASNTKDVEIKIGIVQRFG DNSTEQLQLEPTKGDRLRLKFQDGNRQQTLVTTNPVKLEVLMEALPAAVVKERVVLGT YRTYETAEDSAKHWRDKGIEVEIAQPERWQVWAKREVYNTPLLRRLLLSSVETAAQKT AFVDSQVLQNVPRVTSLLNGKRYTIDNLEISSDKNLIRVNESKKPEKARLYAGRMKLQ PNAYGSYTLVNEVALETYLRGVVPYEIGASAPTAAMEAQTVLARTYALRNLHRFVIDG YQLCADTHCQVYYGLNGVTPNTDRAIATTRGMVLTYRNELVDALYSSTTGGVTASFSD VWNGDDRPYLQPIVDAPSDKVWNLSGQNLADENQFQRFISIKIGFNESQKDLFRWRKE SSLKDITKGLQKFLKVKNSPYAKFQTIRDMKVVERSKSGRILKLDVKTDIGTFSLHKD EVRSAFAAPVSTLFYLQPLNKGESELWGYAFVGGGLGHGVGLSQVGAQNLAQLGWESQ KILQFYYPGTKIQILGNGISETN" gene complement(21528..21827) /locus_tag="DP116_11890" CDS complement(21528..21827) /locus_tag="DP116_11890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320150.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_11890" /translation="MATYKVTLINEAEGLNQTIDVEDDEYILDAAEEAGLDLPYSCRA GACSTCAGKITEGEVDQSDQSFLDDDQIQSGYVLTCVAYPRSNSTIETHQEEELY" gene 22458..22994 /locus_tag="DP116_11895" /pseudo CDS 22458..22994 /locus_tag="DP116_11895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015158520.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="helix-turn-helix domain-containing protein" gene 22994..23560 /locus_tag="DP116_11900" CDS 22994..23560 /locus_tag="DP116_11900" /inference="COORDINATES: protein motif:HMM:NF033545.0" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS630 family transposase" /protein_id="PRJNA477356:DP116_11900" /translation="MSQYAAIVLPTYKNIRYFVQDESRFGLKTIEGRRITLKGVKPTG DWQWQFQAFWLYGAVEPLTGESFFWQFSHVDTECYQQFLNKFAASNPNSLNIIQVDNG LFHKAKKLQIPENIILLFQPPHSPELNPIERVWEHLKRNLKWELFDNLEHLRIKVAEL LAQLTSEVAASLTGYDFILNALSVANIF" BASE COUNT 6750 a 5000 c 4864 g 7153 t 10 others ORIGIN 1 gtctcaaaac tgccttctcc aatctcgaag ataacccgca tgcccacttt tctcacccta 61 tactcatttg tctagctaaa tgacaaagtg ttccgtaatg cttgcattat ttatactaac 121 ctcaactccg aatctttcgc caaattctgc acaaaatttt aattggatgt aattatcaac 181 aactcttgct ttgacttcaa gaaacggctg tccattttca tacagtccgg ctaattgcaa 241 gcctggtggt aggtagggct gatttcctat cggatatact cgcaacagga cggcaacggt 301 tttgtctggc ttagataaaa tcgctaccat taaagccaca gaatgaccca cgagttgcat 361 ccctaaatcc atgatgcgtc tagcactgat agcagggtgg ttgggttcta ttgtccacaa 421 agtttccgca aacttccaac gcttctcttc atcttcagtt gtctttaaaa gatgagtgag 481 tacatcgata acactaagag agttaatgtt atgctgaggt gattctacat cttcgacttg 541 attaccagac attttttcta aaagttctga tggtaagttg ttttcccaac tgcgatgggg 601 atacaattgg gtaatcagtt tccaaacttc ttctgatgta tttaactgaa atcctcctat 661 ccgtctaaaa cgaaaggcta attgaggact tctttgacgg gatagttctt ccatactttg 721 ccattgggag gtgtagatac cttctagcca ttgccgcaag tgaacaagat gagaagattc 781 tattaatctt tccaatggtt gcaactcttc gatgtcaatt tgtgatgttt taatatcagg 841 taaaaatccc ctgacttttc cttctgtgca ggttttattt agttctacaa cgacataacc 901 aatacgatca tcccaaactt cagggggaac ataacaaagg cgatcgcccg tcctgattgc 961 tctacattct aaacgtccca ctccggtgag cagtaaatct gcaacatttg ccatcaagcg 1021 cccatattgg ttccaacaat gactggcttc taagtcagtg ggaatatcca gcatttctaa 1081 gtagctatta accactagta cagctaaagt atttacatac acttgtttag ctttttgttg 1141 agttggttgt tcttgggcaa attccagagc agtttctcga aatttcaccg gaatgggaat 1201 agagatgatt tgctgttggg tagagtaact catagccgta gattcaaaaa ttaggactcg 1261 atattcaaat aaccttcgct gttaatataa ccttgggatt tgccaaagtt gagaaggcgg 1321 ggaaaacact gacgttgata gaaaccgctg agggtgggta tggatacgcc gaattcgtca 1381 gatagatctt tccaagaagt ttcgggtggt aagcgacgtt caatcaagac tagacagttg 1441 atatctgggc gatcttgtac gtggatgcga cgcaaattat tggcttcttt ttttaaccat 1501 tgctgaattt cttctagtat gggtggtgct tccggtcgtg ctggaattaa atttactggg 1561 tctagctctt caccattttg atcttgtatg ggatgaattg tattattact accaacccta 1621 cgactatctt ctagcctgta cttcaaatac ccgttgagcc atgttatgac actacctttt 1681 cctgaattgt agggttcctt aacggtatct gcctcgcaaa gattgcggca aaaatatagc 1741 caggtttttt gtagtgcttc ctccgcatca ggtacacccg taccccgcaa cagtttacct 1801 gtttgctgaa tcagtaaaaa aattttatgc agtccttttt gacgttcaat actacctctc 1861 ggatgctgac aggtttcttg aacaagttgg aggatttgcg cttctaattc atccatactg 1921 tcatcgtcgg tctgatttga ttgttcaggg ttacctggta acagtcttgg cataacttac 1981 ctttacaagt cttatggcta cttatccaga tgtgatctga tttcgtactt gctatttctt 2041 tcttcagact taagttgatt gataaccttg gtaacatcga caccagtcgt ctgccgtaac 2101 tgctctaaga aagacgccat tttgggaaca ctaccaccat ttgtagcatc aatgacagtc 2161 aaattgtcta cctgtacctc tggcactcct gcagccaaca tgcgcaacaa tggttctagt 2221 ttttgataga tgaagatttc tcgggcagat gcaccagcgt tttgccaaga ttcggcgaga 2281 tactgcgtcc cggcggcttg ggctttacct tcttctacaa tgcgggctgc atctcccttg 2341 gctttggcga tcgcccgttg ggattctgcc tctgctggtg caaccacatc agcttgtaac 2401 tggttttcta cctttttgat gcgttcagtt tgcacgccaa cttctgcttg cacctttgcg 2461 acttgggcat tgacaaccgc ttctacctct gcaatcatag cagtccgctt tgtcagggta 2521 tctctaactc tgcgttccgc gtctgctttg gcaatttcca aatcccgttc aatctgcctg 2581 agtttggtaa tgcggttgtt tgccgaagtt tttattatcg cctcggcttt cgcttcggct 2641 tctcctatgc gtgcatctcg taacaattct gcttgctgtt tgcgtccgat agaatccaga 2701 tagcgcacct catcaaagat attcttgatt tggaggttgt ctaatactaa tcccagcttt 2761 tccaaatcgt cttcagcttc ttcaagtagg cttttggcga aggctatttt atcttcgttc 2821 acctgttctg gcgtcagatt tgccaatact ccccgcagat ttccctccag cgtctccttt 2881 gccaactgtt caatatcctt gcggctttta cccaacagtc gctcaatcgc attgtaaatt 2941 gtcggttctt ctccagcaat tttgatattt gcgactcctt ctactgtcag gggaattccg 3001 ccttttgagt atgcatgcga tacccgtagt tcaataatca tgtttgtcag atccatgcgg 3061 aaagttttct ctagtagagg gttttgaatc ctactacctc cttttactaa ccggtatcca 3121 accgaatttc catcaccaag agaagtgcga ctgccagcga aaatcagtac ttcgcttggt 3181 tgacaaatat agtataagtt acgaatcacc caccaaccag cggcagttcc agtaccaaaa 3241 attcctagta gtaaagcgat aatttccata atctttcttc cgaatttaag tgtaatttgt 3301 agaacattca aacagtgaac tcttaatagg gaactcttaa cagggaacag gaaataggga 3361 acaggaaata gggaacagta gcagccgtag taaaaactaa taactgataa ctgttaaaac 3421 agtgaacagt agcagccgta gtaaaaactg ataactgata actggtgact gataactgtt 3481 aaaacagtga acaggcgttg agtgagggta cgttcaatgc gaataatgaa tttgctctgg 3541 atatactcgg aaagaatcaa agaaattagc tgtttctgtg gttaattcaa caggtgctac 3601 agaacctaga aaatacatgc gtcgtttaac tagaaaaaat cgatattgtc cccaagctgc 3661 attaggatgt tgcaccagaa actctatacc aggatatcca tcaatcacaa gatcggtact 3721 ccgaacgact ttaccagaga actcgctttt actagagaat tgctttcgga attcattttg 3781 cataactttc cgaatcccca cagtagatag atattcaacg tcttcagata aatctatgta 3841 ggtaacgaaa tatgccgtct tagttgttgg attttcagaa gctaattctt ggggtttatt 3901 ctttgttata cgcccaggca tccaaatttt agtatcgccc tgtgctgtgg aataagtaac 3961 atcgttggca atgacatctg gttgtaatgg tgctttaaaa actaaattct gatgggatgc 4021 tgatttgtta atcaatgcag cagatgggtt ttgcgttccc aaagtcactg caattatggt 4081 gatagcaaaa gtttttaagg tgttcataaa aaacctcggg agtttggaaa ttgtcgtcta 4141 aaaccgtcat ctctagtcct ttccttgatt gagagttcct gtgacatcaa taccaagggt 4201 ttgattgacg ctttctagga actgaagcac gatttcggga taaacgttga ctaagctggc 4261 taaagatttt ccatcaccgt tgtcaatcac gttcactttc ttgagttgaa tgcgcttggg 4321 aatctgcact gcttcttgga gtacagtttc aatttgttgg attaggaata atttggaagc 4381 atctgttcca gtttgctgcc aaacttggga gagaatatcg ttaactaaag ctgcggcttt 4441 agcattttct tccaaggggg cggcttctcc tttggcgcgg atttcttggg cttggcgttg 4501 ggcttgggct gggagaactt catccgcttg taagcgcaag cgttctaatt ctgaacgtag 4561 tgcttgcaga acttgttcag cttttgcctt cttttcttga gcaacagctg tggtaatttc 4621 ttcttcacag cgggcttttt gttctagctt tgcctttatt gtgcggagtt cgttttcttt 4681 ttccagcacg ataattctgt cttgggtttt ggcgacttgg gcgtgttcct cacattgggc 4741 ttctatttgt tctgcttggg cgatcgcatt tgattccgca atttccgcat ttttaacaac 4801 aagagcaatt tgcttgcgtc cccaagattt gagataatct acatcatcag aaacactttg 4861 aattttcaag gtgtcgagtt gtaatcccaa tttggtaagg tcgcggctga catcagaggc 4921 tatacggtgg gcaaaactca gacgatcttc attcagttct tcaggtgtga gagtggcgac 4981 gacaccccgc aaatagcctt ctagagtttc gcgagagaca cgagttaatt ctgagcgatc 5041 gcgatccaaa aagcgttcaa tagcattacc cacaacccct ggatctgtgg aaattttcac 5101 attggcaatt gcttgaatat ttaaaggtgt tccccccttg gcgtaagcat ttcgtacctc 5161 cacccgtacc ggcatagtgg taacatccat acgcttaaca gtttccacaa tcggaatacg 5221 aatagcgcgt ccacctaata gcactcgata accgacttct tgtccatcct gagtacgcca 5281 tctgcgtcca gaaagaacca atacttcatt gggtttgcag atacacaaaa aggacttaac 5341 gaaccataca aatacaacag cactaaagat agaaactgca atggggatac tgctaaaaag 5401 taatccttca acaccattgt tgttcttatt agtaggtgtg tcgatattaa tttgtgctac 5461 tggcaccgtc gctatgtttt gctgagattt catagttagt agttagtaat tagtggttag 5521 ttgttgcttt ctattgcatg gtgtttggtg ttatttttcc cctacttccc tcactttcta 5581 cacagacacc acccatactt tattgccctt cataccaaca acaacgacac gagcaccttg 5641 atgaagttgt tgagtttcgt cagtaaaggc aacaaaatct actaaagaac ctttgagatt 5701 aacacgcacc tttccaggac tgttgtaatt caaaggaatt tccaccgttg caaaacaacc 5761 gatgatatta tcactacgta ccaagctatc tactacttga cgtcgtcgtt ggagagaaat 5821 taatccaacg attaatacac caccgaaaat acccacgaat actgcaatac ccgcaattat 5881 caggattaag tttttcattt cctcctgtcc agttgatgaa taaatcccag ttaaaacttt 5941 tcatttcaag acaatcctct tgaagtgctt ttgtataact ctgctggttg aactttcata 6001 aaatgttgct atcgccaata caccagaaac agtgagactg ttcaaacagt gaacagtgat 6061 aactggtaac tgataagtaa gacagaggat atagcagaaa tgctatatct ctgccgtcac 6121 ggaacgcagc tcaaaatgaa catcacttag cgtgacattt ggattagcgt tggttctata 6181 tgttcttctg ttggagtaga acaaagttct ctcttttgtt acaaaacttt acaagatagc 6241 gaaacctcag taacaaaaga aagtatgtcg ttttggatcc ttgccagagt tatcataaag 6301 tttgatatgc atcgccagca aacaccatga accccgaaat tgctcgtcat ctttgggaat 6361 tactttggca agattacagc atccgggtga gttatgcccg cacctacgca caaatgatca 6421 ctgatgcggg tgggactgtt gctaatgatc acattgcttt tcggtcttta cgaatggaga 6481 tagacagtcc acagggtaag gttaatttgg gtattgacta cttaggacaa cttgctgagg 6541 ctttgggcta cgaagcggct ggggaatact cttttccaga aacccatctt tacgcccgtc 6601 actatcgcca tcctcagcaa gaggaatttg atttaccgaa gttgtttatc agcgagttga 6661 ttgtcgatga attgccagaa aagactgtca acctcattca acaaactgtt tccggcgtga 6721 atttatttca ttcttcagca attctccaca cattcgctac taaaactgaa cgccttgcca 6781 aagaactgca tagaatcttt acgtcccatt ggcaacctcc ccggcgttct gtagtggaag 6841 aagtgaacaa ggtaactcaa tatggtgctt gggtgctgct gcacggttat gctgtcaatc 6901 actttacagg ctacgttaat cgtcagaata ccccaaaata cccagatatt gaaactactg 6961 ctcgtggttt agctaacttg ggtgtgccaa tgaaagctga gattgaggga gacattgcta 7021 gtggtttgcg gcaaacggca actcaagcag ttactgaaat tgtcacagtc ttagaagata 7081 atgtagatgt agaaattcaa ctcccttgga cttatgccta ctatgaaatt gccgagcgct 7141 atatggtaga agtggaacca gatcagcaag tgctgtttga tggtttttta ggaaggaatg 7201 cccagcagtt gtttgaaatg acgcgattga atataaataa gaataatgta gagacgtagc 7261 atgtgagtcc aggcgtaggt tgggttgagg aacgaaaccc aacaatatca cacgtttgtt 7321 gggtttcgcg ctccgctcta cccaacctac cattctctta actgaaccgt attggaatat 7381 aaataagaat aatgtagaga cgtagcatgc tacgtctcta catctgtata ttgcacaact 7441 aagcgggaga caattgtttg attttttctt ccattgaaga aatcagtgat tggtatggct 7501 gaataaattt gtcaatccct tcatctaaga caagagtcat cacttcatcc agattaatat 7561 tgatatctgg ctctttgaga ctgtcaatca actgatatgc ttcttttacc ccagtttcaa 7621 gccgattagc aacgtcgcag tgatctgcgc aggcttctat tgtggctggg ggtagagtat 7681 taacggtatc tgaacctatt aactcgttga cgtacatcac atcgctgtag gcggggtctt 7741 tggtactggt agaagcccac agtagtcgct gtacgttcgc tcctttctcg gctattgctt 7801 tccaacgctc actgtggata atttctttgt atttttggta agcaatttta gcgttagcaa 7861 ttgcaacttt gcctttgact gcaactaagc gagcttgcat tgccaaatct tcagtaccaa 7921 ttgtcttaat tctttcctca accatttggt caactttaat gtcaattcga ctgaggaaga 7981 aactagcaac agaggctatt ttgctaatat cttgaccctt attgactctt gcttccaatc 8041 ctcgaatgta tgcccaagcc gcctcttcat agctttgaat cgagaacaat agggtaacgt 8101 tgacattaat tccgtcgcta atcacctgtt ctactgctgg tgcgccgctg agagtaccgg 8161 gaatttttat catgacattt tcccgcccaa tttcgtgata atatcgctgc gcttcgtcaa 8221 tcgtagcttt tgtatcgtgg gcaatagttg gaggaacttc gatgctaacg tagccatcca 8281 agcctccaga ctcttcgtag ataggtcgga aaatatcaca agcgttgcgg atatcttcaa 8341 agatgaggta ttcgtaaatt ttagaaacag gtaaacctgc tcgaattcct gcttcgatat 8401 cagcatcata gagggcgttt ccggcgatcg ctttttcaaa aatcgcaggg ttagaagtaa 8461 ttccacggat accccgactc gaaatcatgc gctggagttc accagactca atcaagtccc 8521 gagtcaaatt atccatccag atactttgac cgtaatcttt tattgttaac agtgggttgc 8581 ttgccatttt taccaccaaa ttcgatttcc gttctcgtca actcgtgctt gtcgaatcac 8641 atcagagatt tgcttagcat cccttacgtg gtgaagtgct tttttttcac catttgggta 8701 ttccacaata aagtaggtag gaactttaat ccagtcacct gtaatatccg gtgaatctac 8761 agcaatttgc tttgctcttt cttgaatgct ttgcggcttt tcagcaatca catctcccgg 8821 atttactgtt aaattccttt ccttgactga tggttctgct ttcgtgtgtt tatctatact 8881 tactggctgg tctatttctg atttttgttg agtcttcata gcttgaaaca agttgtcaac 8941 ggtcttacta ttcttaaatc aagcctaaaa aaacaaacac attaactcct ctttcaagag 9001 agggacttta aaaatcacaa taaaatcttc cggtatacat tttcggtata atatttattt 9061 catctcaatt ttgttgaata acacagactt gattttcatc aaaaagatcg agatgaaata 9121 taagtaaaga tttgactatt ttgctctgct taaataagtt tcgatcaaag gtgtcatcaa 9181 cagattgcag tcgaatttac aaccaaatta tgagcagtat ttttcaatga tagatcatga 9241 aaaaccttgc tcataatgac tgctatgatg gaaaattaag agtctttata attttcatct 9301 atccaacagc gtggcaagct ctcaccagaa ggaaaaagta attgctcttt cttaaaatct 9361 tctggttctt gttcttcttc tcccgcttga agttgcccag cgacagtagg tttcaatggg 9421 ctaatcagtg cttcgaagtc gcgtatttct actaattgac cagtctgctt gtcttgaaga 9481 agcatggcga aaatggtttg aatgaatgtt taacaaatta attagctaga cagcttctaa 9541 cagtagccta ttcttttctg tgtttcactc accctaagat agaattgacc actgaccttt 9601 catgtagtat ttgttaatac catactgtga cgttcagcat aaaatgaact atcaatacag 9661 cttttaatac caattgctca tgatagcttg cccgtaaagt gccataatag atgatattaa 9721 tctttcttaa tacatttgtc tgcaagactt acacactgta gttgtttgtc accaaatatt 9781 aagctcaaat ttacagagaa aatgagaaaa ttacgtatct taaatataac ttttggagga 9841 gtattaaaga gcatgcttcc tatagactta aaaatcatac taaactcaat taacttgttt 9901 tcatggtata caaaatttta aaaaattgta catttgaaat gatagttcaa gattatcttt 9961 ttaaaagcac gttatctttc ctaacatgtg tcagtgtttt attgtacagt aatagtgcga 10021 tcgcggcgga aaagattgtt ttaaaatacg gtatttttcg tgaatcagtg tcagtagaaa 10081 aattaacaaa atttgccgag acaggagaag tttcgccaat gctaaattta ctcttgaatc 10141 aggctcgtca agatccgcaa acagttcgta atgtcttaac aaaagaggtc aacgctagcc 10201 cagttgtgtt agatcgacta ctgaataacc cgattggtga atttttagta gaccaaatcg 10261 gtcaaacaat tcatactccc tcaagtgaag ctaacccaca agctttgcga tcagctctag 10321 tgttatcagc taacaaagat aataaagtgt cattgattga aattattcaa aattatccga 10381 cgcaagaggt gtatgtagag ggggagcgtc ttgtccgcac atatgatcaa ctcagtcttc 10441 tggcagaacg tttgcaaaac ctgctctctt ggcgcgaata atatttaaaa tatttgtgag 10501 gtttgccaat ctcaaatctt ggtaagggcg ctaagaaaca gcatttagca aattagggtg 10561 gagaagaaat atgatgaaca cagtacactc acagcgcaaa aaagcttcgt ttcctgctgc 10621 aatatttctt tcagctttgc taacacttcc agtactttct ggctgcggag gaggttctcg 10681 aaccgcagcg ccaccaccac ctgttgagga tagtgttact agaacagtaa ataatccaac 10741 cccagcaaac caaccccaaa ctaaaaaggg attgagtaca gggcagaaag tagcaattac 10801 attagtcgga gcagctgctc tatactacct ctacaacagg cataagaatt cccagaaaga 10861 gggagcgcaa ggtaagtact acctttcaaa gaatgggcgt gtttactacc gggatgagca 10921 aggtcgccct cactgggtaa cctcaccatc tggaggaatt caagttcccg agtctgaagc 10981 gcaacaatac cgggattttc aaggctataa cgggcgtact acaggtcgca gtctcagtga 11041 tgtagctcct gcaaacgctc cagcatttta aatcaacaag cacacccctt aacagttcaa 11101 aattcgtctt ggaaagtttg ctctgtcggg aaacgctctt ggcacgccag ttcgctcaag 11161 tcgggaaacc ctttcggcag ttcctcatgg ggagccagtg cgttcgttgc ccagagcagc 11221 gccttgcggt gaataagggc gcgcatgagg cgttttcccg ccgtaggcga ctggtgttag 11281 ccgtcaggcg aaagcaggca tacccggagg gagccaggga aagcagtcct tgtttcccga 11341 cagagggaca agcgcgttgg ggttccccac gccagtccgc tcaagtcggg aaacccgccc 11401 acgcggctgg ctcccgttgt ggcgactggc tgttgggttc ccccctactc tcctgattta 11461 tccccnnnnn nnnnntttat cccccataga actttgttgg tcgaaattga agcaatttct 11521 ccgttctcgt gaagcacgca cactggaatt actcgatcaa gcaatgactg atgccgtaaa 11581 ttatattacc gaagatgatg cctacgagaa ggttcaatca ttgtggtcta tttacctgaa 11641 aattgctgta aagacagttt cacttgagga tttaaaccgc aacagaaact gagacgacgt 11701 gtagacgtag agatcttaaa ccgtttaatt tacgatactt taaaagttat gttaagaaag 11761 cgccctgccc cccaagagcc ttaatgataa gtctttaacc ggacacgata taaaggtgcg 11821 atcgccattt acgacgatcc agcccgaagt tagagcgaac ttcgacaatt cacctcgctt 11881 cacgtgaagc aatgacaagc aaataaccat tgattcaaaa aatggtcaag cagaaaaagc 11941 aatttctact tgaccacaac gggtgtaaat gatataatcc aaacaatatt tttgttagaa 12001 ctatactgaa gttagctcaa aggttttgag atatttcagc cgataatttc ttttttgatt 12061 gagttcagtt gtgcacctaa ttcttgccga gtagaacctt tgagtaaata gcgggaagta 12121 aaccaaattg tgtaaccaat ccctgttaat tccagcagcg ggcctagaag tggaatatca 12181 ttgactgcat ctagcaccgc caacattagt ctgagtagag taaccgtcac cacaagtaaa 12241 gcaaagctaa taactggcag cttgtattcg ttgaaaaagc tacctaaata ttcaggcagt 12301 tctgctaaat atgtagaaat ttgtctacca aactgctgcg cttcttcttc agtttgagtc 12361 gcaggtggta gcattgccag acttcctggc tctgaacctt caagttcttt gatgccttct 12421 tttatcttgg catcagtgta ttcagcttgt tgtagttcgc tgtccatagt atttcctcac 12481 tctcaaataa caacaatttt ttctacttaa atttgtggta ctagcctacc atgtcagaaa 12541 gtgccgcaac tacaccatat tcgtcagctt tatcgccaac acgttctgtg atttcagcac 12601 cagcaatctg atcattgcga cttgtattct gcacaaccac aaacaatcta taccattggg 12661 aaaccagaat ctctcaattc atgcagccca ggtttggcac tggatgatga gtaaatacgc 12721 ctacagcaca cctatgcaaa cctaagacca tccttacttc ctcctacaag cgaaaattat 12781 ttgtacaaga atcttctaaa tgcagttaac catattgata ctatttttaa acgaaatttt 12841 cacccatctt ttgggataag ttttctcttt atcaaattga aaaattattc gttttcagat 12901 aaagtccaac caaatgtttt taacaaaaag tcaactatac tggaattcct gatttagcaa 12961 tctttctggt agggattggt gagcgctatc taaattgtag accacatgca tcaaagattt 13021 tggtggggga gactaaccca ttactcatcc agacgtcgca ctgtccaaca ttgctgtctc 13081 aagcgcgact cacggctgtg ttcttctcta ttcaatctga aggtttagct catttcctgc 13141 aaatggtgat catttattta tttttgtgac tgtaatgctt tacaattttt gccagaaagt 13201 agtcacaaca atactactag ttcaccacac aaattttcac aggtaatcag agtcttgata 13261 aactagtttc aaactcaaca tctaccaatc gcagaaaaat acatagtcaa ggaatactcc 13321 tgtagattac aaaactatat gcatatatag tcttacaatc tatcaacttc gacatttttc 13381 taattgcgag tcacaaaaaa tcttaaaaaa ataaatcatt ctctcgggag agggttcaaa 13441 ataactaaat tcataatcga ataaatagga gagattttga aggataaagc gaataatgcc 13501 actttctata attagataat tcgattgtta ttacagtcag acgatggata gtttttatgt 13561 ttgtgaattt taatgcttgc tgttgtagcg gcaagatttg agccaagaac aggaaaacca 13621 aaacattgac tctttacttg ctgttcggga caatgttttg caaccacctc aaaaagtagc 13681 gggtaaagcc aaaaagcagt atgagcaaac tacaaccgct attgctgagt atctccgcaa 13741 tacgaatctg gaagaactga atcctgaagg aattcagcaa gacttgcaaa aattactagc 13801 cgatcctaaa gaaggtgcga atgcattacg cgatctgccg ctatgctgca gcgaagctat 13861 cgcctctggc aagttgatcg cgaaacctta gtcaaactcg tgacgcaacg aggagatttg 13921 agtgaagagc aagtaatacc gtttcacttt aaggttgata caaataggca gcaagcagca 13981 atgcttgggg ctttcttgca gggcagtatg agcgcgtttt gtgtagataa atgcgtcgtc 14041 cttcgttaac tcttgagtta acatcattga ctctacttac ccttcaactg attgctgtag 14101 gggtgtcgag ttttccgggt aggtttatgc agagtttttg aaaattataa gtagcgatct 14161 acctaatcaa acgtaaaatg tccgcagcgt aagcgcaggc gcacgcaagc gagggcgtta 14221 gcgtatccgt cgtgcccagc ccaggatttg tggtctaatc ataagaataa gcactgctct 14281 aagacaagtc tcatatagcc gcacctgtgc caggataaaa agggggctaa aaaagtggat 14341 tcgatatcac ctgatttcac tgcggagttc gtaaaattat gcaatgccac taatctttat 14401 gatcaagcga tttttccgaa aagccctgac attattgtca gtgtactacg cctatatgct 14461 tgagtatcgg gcggaactca ttttatgggt actgtcaaac tctctgccaa ttattttaat 14521 gggagtgtgg atacaggcgg cgcaaggagg acggtttaat ctgtcacctg tagattttgc 14581 tcgttacttt cttgcagttt tcattgttag gcaaatcaca gttgtttggg tcgtttggga 14641 gtttgaaagg gaaatagtag aaggaaaact ttcaccacgc ttgttacaac ctattgatcc 14701 cgtgtggcat catattgctg ctcatgtttc tgagagatcg gctcgttttc cttttacatt 14761 gttcttgtta gtcgtctttt ttctccttta tccgcaagct ttttgggtgc caagtttcac 14821 aaagattttt ctatttttgt tagcgagtgt actggctttt gttttacgct ttatcatgca 14881 atactcattt gcaatgcttg ctttttggac ggaacgagca agtgctgtag aaaatttttg 14941 gtttttgttt tatttatttt tgtctggaat gattgcacca attgaggttt ttcctgaatc 15001 ggtgcgcgca atagtccttt gcacaccatt tccttatatg gttgattttc ctgcaagtat 15061 tttggttgga ttacccgtgg atttggcacg gggatttttg tcaatggtgg gttggatttt 15121 ggtgtttttg ggtttgaatc gtttactgtg gcgacgggga ttaaagcgat actcagggat 15181 gggagcatag acaattcaaa atcgcgtagc gttgcgagtg aagcgtaagc gcaaagcgca 15241 cgcggcttgg gcgttagccg taaggcgaaa gcaggcatac gcgcagcgtg tccccttggg 15301 actcagcaat ctcaaaattc aaaatatgtc ctatgggcac actactttgg aattcgtaag 15361 tttttccttt aggaattccc cacgtgtagc tattgcaacc tgttacgaat caaagaaaca 15421 cctccctcac tcactcactc cctccctcaa gaggtacaaa gggcaatgct tgcggctacg 15481 tcctgcctca gtcacctggg gtgcgcaatt gacttgtgtt tatctggact ttatcatgaa 15541 ggataaaagg tttatttctg acttaactgg cgatcgctct ggtacgccgc tcccaaaggg 15601 attatcacca cctgcaagct tgagatcatc ccacaagaac agtcaacaag gtttaccatg 15661 aagaagtatt aatttttaaa ctttttttct aactttttac cttgtttaca tggactggca 15721 agaaatttct ggaaactggg tacttgttcc gcgaaatcct gttggtgtga ttcactttct 15781 tggtggagca tttgttgcta cagcacctca tctgacgtac cgttggttgc tagaacaact 15841 ggcaagtaaa ggttatgttg tgatcgcgac accgtttgtg aacactttgg atcatagtgc 15901 gatcgcaaaa tctgtcctgt taaactttga acgcgcccta gaacgcctac aggatagttc 15961 ggcaatacgc aagcgttacc ttcccattta tggaatcggg cacagcatgg gttgtaaact 16021 ccacttgctg atcggcagcc tcttcagtgt agaacgcgct ggcaatatcc tcatatcctt 16081 caacaactat gctgcacggg atgctattcc cttaatcgaa cagttcaact tagccgccgt 16141 cgagtttacc ccctcaccct tagaaactaa caaactcgtg caagatcggt ataacgttcg 16201 ccgtaaccta ctgatcaaat ttagcaacga caccctagac caatcaacag ctttaactga 16261 actattaaaa caacgctttc cagagatggt gatagcacag acgctaaacg gaactcacac 16321 cacaccctta gggcaagata ttaaatggca aacgggagaa tctttcacgc cgtttgacgc 16381 ttttggacag tggtttaagc aagaagtgta ccgtgatctt aatcagctta aaagcatcat 16441 cctcttttgg ctaaaccctc ttgcccatcc cataattaac tagggatatc atcaaaaagc 16501 cagattttca caacagtcta tactctaaat tattattaat taatgattat tgattaatga 16561 atagtgcgtt tcaaatttta gtcattgatg atgatccagc agtacagata cttcttttaa 16621 ggatgctgga aagacagggt tataaagtat tggtagccag caacggcaag gatggaatta 16681 caaaagcaat ggcttgttct ccagcgctaa ttatttgtga ctggatgatg cctggattaa 16741 atggattaga agtttgccag cgaatcaaga cagaccctca attatctacc acgtttttta 16801 ttttattaac atctttagat tcggtagctg atcgtgttaa agggctagat gcaggtgctg 16861 atgattttat cactaaacct attgaacaaa atgaattaca agcgagggta agagcaggat 16921 tgcgtctaca ccaactcagt aaagatttgc aaactcaaaa ggaaatttta gaagcagaat 16981 tggcagaagc cgcagagtat gtgcgttccc ttttgcctat cccaatgacg catcctatac 17041 agattaattt ccaattcatt ccctcgcgtc aactgggggg tgattgtttt gattactact 17101 ggcttgactc tgattatctg gcaatttacc tgttagatac cgctggacat ggactcagag 17161 caactctccc ctctctttcc gtgctgaact tactgcgatc gcatgctctt gctggtctga 17221 attattatca acccagtgat gttttgcaag cgttgaataa tacttttcaa atgaattatc 17281 gaaacgataa atactttact atttggtatg gagtttataa ccgagttaag cgacagttaa 17341 tttacgcaag tgcaggtcat ccaccagcag tcttagtatc tggaaaatct ccaactaagc 17401 cgaaagtcca aagattgaaa acccctggta tgccagtagg catgtttcca gaggtaaaat 17461 atgtagatga cttttgtcag attgaagagt ccagttctct ttatattttc agtgatggtg 17521 cttatgaaat tacccaatca gatggcacta tttggacttt ggacgctttt attcagatgc 17581 tagtcacctc acagcttcaa tctgagggta aactggatca cattttgaat ggtatcatct 17641 cattaaactc caaagaagct tttgatgatg atttatctat gatccaggtt aaatttgatt 17701 aatttattcc agaactaact gttttattga actcttctcg attggaatat attttaaata 17761 ttttatccat acctgtcagt tcaaacaata tcctaacttg ctcattaatc gagcagataa 17821 ccatatttct atctacagat tttagagctt taaaagctaa caccaaagca cccaagccag 17881 aactatccat aaaagtgact tctttcaaat caagtaacac tgttttggta ctactttctt 17941 tttgtaaaat ttcatcaatt ttttttctca actcttgcga ttgagccgca tctaaaattc 18001 cactaggttg aataactttg atttcttcta ccatcataag gaaacacttt ttgtataatt 18061 aataacaaat tactcagtct atacccatac taattcattt caaaaacact tgccacatca 18121 gtatgtataa attcaaaata ataagtcaat catcatatca agatattgac aagttcttga 18181 ttttttcata ccgtgctcag cgaaagactt tatgcttgtg gtggaatggg actcaccttg 18241 gctcgatata gcagtttatc accttgatgg taaccaatgt aacgtacctt aaccagttca 18301 ccaacttgtg cagtaccttc taaaagttgg tgtagttggg gatcataagg tagttctgct 18361 cctacacgtg ctatggcttc cactccccac tgctgcaaaa gtctctctaa gggtttgtgc 18421 actaacggta cgatgttgat cgcgcttaat tggggatttt cccgtgcttt gtgtgctgca 18481 gtgggaaatt gcaccaacaa agattccagc aattgtaaac tagactgctg gaactcttgg 18541 agcaatacct gtcgttgttg ttgtatttgc tgttgcgatc gctcgtattc ttttttcaat 18601 tctgctattt gttgtaacaa ttgttgagat tcgttagttg atgtatcttc acgttttacg 18661 aattcttgtt gagagaagat tgtaactagc tcattaaaag taatttgtag tgcttgcgct 18721 aacttgagga ggacatccac ccgcatctgc tcaactccca gccgccgtag ccgcacaagc 18781 tgacgctctg agacgccagc agtgcgactc agagctttaa aactagaaat acccacacgt 18841 tgcattaagt catgcaactg tgaagtgaaa tcgatgtggg acatcgttgg caatgactaa 18901 ctgctggttc ctcagaacta cagtttagtg gttagtggtg ctttttgaca agcagatgag 18961 ttattaagga ttagggaata acaactgacc aacttcatgc caaatgcatc ttcaaaacgc 19021 ttttgggggc tttgcgaact ttggctaaga tagtttcttt acccaaacgc tggcaagctt 19081 catagcgatg gcatccagaa aatccataat actgtccatc gacttctaat acatcgattg 19141 gttcttgttg cccaatggca gcaatcgatt ccatgagagc ttttaccttg tggggatcgt 19201 ttgtacgtgg caaaggtcgc cgtatctgat ttaaaggaat ctcttgaact ttaatcataa 19261 ataagttttg ttgataaatc gtgtcttgtg gttcttgata tatcatagtc gttatgacta 19321 tgatttgcaa gaaactaaat tgccaaaact aatattagtt tgatcaaata acttgtgcca 19381 aaaactacta aactacatag gagtcaactt ttcaaggaac cccaaaggag aaacaagggt 19441 cttcatttta ggatcctcca ctgttatttt ttctccaagg atattccggt ttgggggctt 19501 acaaccaaat aaatctagca tgatccttct gacctcatac tgtttttgcg gcttatccac 19561 cccaatacag ttgtagtgga agaacaaagg ttttatgaaa attatctggc gtatttctta 19621 caccgtactg accacgtttt gtttattagg tttgacagga gcctcaggtc ctgctagcaa 19681 cacaaaagat gtggaaatta agattggcat agtgcagcga tttggagata attccacaga 19741 acagctacaa ttagagccta ccaaaggcga tcgcctgagg ctaaaatttc aagatggtaa 19801 tcgtcagcaa accctagtga ctaccaaccc tgtcaagctg gaagtcttga tggaagcttt 19861 accagcagca gtggtcaagg aaagggttgt gctgggcacc taccgcacct acgaaacagc 19921 cgaggacagc gcgaaacact ggcgtgataa gggaatagag gtagaaatag cccaaccaga 19981 acgctggcaa gtttgggcaa agcgcgaagt gtataacact cccttactca ggcgtttact 20041 tctttcgagt gtagaaacag ctgctcaaaa gactgctttt gtagatagcc aagttttaca 20101 aaatgtaccg cgagtcacga gtctattgaa tggcaaacgc tacaccatag ataatttgga 20161 aattagtagt gataaaaatt taattcgcgt taacgaaagt aaaaaaccag aaaaagcacg 20221 tctttacgct ggacggatga aattgcagcc aaatgcttac ggtagctata ctctggttaa 20281 cgaagtagct ttagaaacat acttgcgtgg agttgtgccg tatgaaattg gtgcttcagc 20341 ccctacggca gcaatggaag cacaaacggt tcttgctcgg acttatgctc tgcgaaattt 20401 acataggttt gtgatagatg ggtatcaatt gtgtgctgat actcactgcc aagtgtatta 20461 tgggttgaat ggagtgacac caaacacaga tcgggcgatc gcaacaacac gcggtatggt 20521 tctcacctac agaaacgagc tcgtagatgc cctatactct tctacaacag gcggcgtcac 20581 tgcttctttt agcgatgtct ggaatggtga tgatcgtcct tatctgcaac ccatcgtaga 20641 cgctcccagc gataaggtat ggaatttatc tgggcaaaat ttagcagatg aaaaccaatt 20701 tcaacggttt atcagtatca aaatcggatt taacgaaagc cagaaagact tgttccgttg 20761 gcgtaaggaa tctagcttga aagatattac caagggttta cagaaatttc tgaaagtgaa 20821 aaatagtccc tacgcaaagt ttcaaactat cagagatatg aaagtcgtag aacgaagcaa 20881 gagcggacgc attcttaaac ttgatgtcaa aacagatatt ggcacctttt ctttacataa 20941 agatgaggtt cgcagcgctt ttgctgcccc tgtgagcaca ctgttttact tacaaccctt 21001 aaataaaggt gaatctgaac tatggggata cgcttttgtt ggtggcggtt taggacatgg 21061 agttggctta agtcaagtag gtgctcaaaa tctcgctcaa ctaggttggg agagtcagaa 21121 aattctccaa ttctactatc ctggaactaa aattcaaatt ctcggtaacg ggatttccga 21181 aaccaattga gtgatttgtt atactccaga tcccctttcg catttgcagg gtgactgagg 21241 gctttttctg gagtaaacta gacagtcaaa taatttacgg ttaattaaca aagttaggtt 21301 tgttatcttt gttagtagca gattgctaga attaaattaa tgaagcagga acacgggaca 21361 ccctgtgctc ctgcttacat aacgtgaatc tagattgaat gccaaattcg gcgttggagc 21421 agacacacca cagaacgatt tctgttgtta gttctgcttc ttcttctttc gtgtagctac 21481 gacatcgaag actcttgcaa gaggttaatt tttgagaacc ctaactttta ataaagttcc 21541 tcttcctggt gggtttcgat tgtggagtta gaccttggat aggcgacaca ggtcagcaca 21601 tatccagatt gaatctggtc atcgtctaag aatgactggt cagactgatc tacttcacct 21661 tctgtgatct taccagcaca ggtagagcaa gcaccagcgc ggcaagagta gggcaggtca 21721 agaccagctt cttcagcggc gtctagaata tattcatcgt cctcaacgtc aattgtttgg 21781 ttcagccctt cagcttcgtt gatcagtgtt accttgtaag ttgccatttt gtaactctct 21841 atgagtgcaa atagactgta taagttatcc tgaagcaaaa agtacggctt ttgttatcgg 21901 aacagctgcc aattaaaatt tgcttcacct ttgatactac gagaaaaata aaacgatgta 21961 gccagaaatt gaactcgatt agtaatgaaa ttatgcgagt taattatgat aaatttatct 22021 tataaaaatc agtatccact tcctagattg ttaagtaaag ttacaggtgt gagatgcctg 22081 taagcgtctt gatagttgtg aacttgcctg cactattggg aggaatgaag gtgatgctcc 22141 caaaggaagc cttacctggc gcaaacgcct gaaaaaacaa gatacccgac aactcttacg 22201 aagtccggaa tctgaaattc cgaggttaac ggatttgaca tcactcagga agtgcgatag 22261 cgaagcgctg ctgctttcag cagatcgcgt acttatccga tgatggtttc cccctacctt 22321 gtcgccagca gttatccgct tcccatgcac cattttcaga ttgtcgcttg cggctgcgga 22381 cggtgagtgt gttataccaa tttgaaaaaa gaatgcgaca gatggtaaat atatagaggt 22441 agcatttagg gtgagtgatg gcaggggtaa cacatatcga aatccaagaa agtgtcgaag 22501 aactggaggc gttggtacgt caacagaata atccacgact caaagaacgg ttacaagcgc 22561 tttacatgat aaaaaatcaa ggtatcaccg tgtgtgcgat tgcgcggagc gcagacgcct 22621 aaaggcgtat cgctaaaata ctcggaaagc atcgaagcac cgtacaacgg tggttagcag 22681 attatcgtga aactggaatc gagacgatgc ttgaatttgg tgtaagtcca ggtcgaacac 22741 gagtaatacc gaattgggcg gttgaaagtc tcaacaaaca actagaggag cctgaaattg 22801 gattcgctgg gtataaacaa attcagcatt ggcttggtac tgttttagga atagaagccg 22861 aatatgccac ggtacatcat ttagtacgtt atcagctcaa agctaaacta aaagtcccac 22921 gtccgcgcaa ttgcaagcaa gatatacaaa aacaggaagc ttttaaaaaa accttggtga 22981 cgacttacaa ctaatttctc aatatgcggc tattgtatta ccaacttata aaaatatccg 23041 ttattttgta caggatgaaa gtcggtttgg actcaaaaca attgaaggaa gaagaattac 23101 ccttaaaggc gttaagccaa caggcgactg gcaatggcaa tttcaagcct tctggcttta 23161 tggggcagtt gagcctctaa caggtgaaag ttttttttgg caattctctc atgttgatac 23221 tgaatgttac caacaatttt taaacaaatt tgctgcctct aatcctaaca gcctcaacat 23281 cattcaggtt gacaatggat tattccacaa agccaagaaa ttgcaaattc cagaaaatat 23341 tattttattg tttcaaccac ctcattcccc agaactaaat cctatagaac gagtttggga 23401 acatcttaaa cgaaatttga agtgggagtt attcgataat ctagaacatt tacgcattaa 23461 agttgctgaa cttcttgctc aacttacctc tgaagtcgct gcttctctga ctggctatga 23521 tttcattttg aatgccttat ctgtcgcaaa cattttttga attggtatta ccttcaaatt 23581 ggaacagatg atattgccaa ggcgtagcag tcacaagttc tttgatggat gcaccaaagg 23641 ttccagcaca gatgcggtaa agtccgcgtt cagcgtcgta ttcattgaaa cctttttgtg 23701 cttggtggat gtgcccgtgg aggaagaagc ggaaaccact gataccattt cacgaaaagc 23761 ctgatacaaa tagagcg // LOCUS NODE_1307_length_23747_cov_5.06812423747 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 23747) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 23747) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23747 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..720 /locus_tag="DP116_11905" CDS <1..720 /locus_tag="DP116_11905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317384.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alcohol dehydrogenase" /protein_id="PRJNA477356:DP116_11905" /translation="CEWCMSGSHNLCLTAEGTIVGRHGGFANKVRAHHAWVVPLPSDI DPVTAGPLFCGGITVFNPIVQFDVKPTDRVGVIGIGGLGHMALKFLHAWGCDVTAFST SPDKEAEARELGANHFINSRDPNALKSVENSFDLILSTVNADLDWSTYIACLRPKGRL HFVGVVPNPVSSLVFPLIAAQKSISGSPLGSPATVAKMLDFAARHKIEPVIETFEFDQ VNEALERLNSGKARYRLVLKH" gene 1429..3045 /locus_tag="DP116_11910" CDS 1429..3045 /locus_tag="DP116_11910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407098.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_11910" /translation="MQKKSIVLAENLAYELSSTRTLFKGIQVSIGEDERIALVGSNGV GKSTLLKILAGQLSPTTGSVTRHETIYYLPQISTIKQDIKADTVFNYLSSISEEWWKI EEILEARFSTTLDLSLPLLNLSGGELTKLFLAIGLSAEPKVLLLDEPTNHMDLMALES LRNFLNDFNGAFVIVSHKPFFLDQVTDMTWELSPEGVKVYGGNFSLYREQKEIELEAR VRSHEVARKELKRTKASALQEQQRAAQSSKNGRAKFLNGSVDRAAAGLLKTKAQVSAG IAKKRHEAAVAKATQRVAQTKVKTTKATSLQLEERSQKRRNLIDIQGANLKVGEHLLI SNIQLHVSSGDRIVIAGGNGSGKSSLAKAILGIENTTAILESGEILLASAMKAVYLDQ TYELVNRKQTILENMQAANPNLNYQLLRQQLGHFLFKEDDVNKSASVLSGGELARLAI AIISISQIDLLILDEPTNNLDIETVDQMIEGINEYQGALWVISHDLDFLSRINITQAY KLNEQALQQTMYLPSESEQYYEELLACQDG" gene complement(3033..3410) /locus_tag="DP116_11915" CDS complement(3033..3410) /locus_tag="DP116_11915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015180373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11915" /translation="MSTAQITDYFNFLTPGDIRLKDSRIGIETILYEYIDRGRTPEEI AQLYTSLTLEQVYATILYYLQNKEVVSAYMKNWIEHGHTMREEQRLNPPPVSKKLQQL RAERKAKKIANVSATLQPRHHPS" gene 3818..4144 /locus_tag="DP116_11920" CDS 3818..4144 /locus_tag="DP116_11920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198363.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11920" /translation="MSDPLLQAFFVGRAAAEVLTERLEFAITDALSELGKWDAETREQ LRQFTDEVLERANRAADAAGATQTTSYGETRSAPVDLQATIDELRAEIATLRTELQRY RSGNSL" gene 4255..5940 /locus_tag="DP116_11925" CDS 4255..5940 /locus_tag="DP116_11925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015121452.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AarF/ABC1/UbiB kinase family protein" /protein_id="PRJNA477356:DP116_11925" /translation="MEKGYSDKAYRWNRENYSRRRRFLDIWSFVLTLMFKLWRYNKSW SYPGGVTEAKQAARRKAQAVWIRNTLLDLGPTFIKVGQLFSTRADIFPSEYVEELAKL QDKVPAFSYEQVEAIIEQELGKKIPQLYQSFEPIPLAAASLGQVHKAVLHSGEAVVVK VQRPGLKKLFEIDLQILKGITRYFQNHPKWGRGRDWMGIYEECCRILWEEIDYLNEGR NADTFRRNFRAYDWVKVPRVYWRYTSSRVLTLEYVPGIKISQYEAIEAAGLDRKIIAR QGAQAYLLQLLNNGFFHADPHPGNIAVSPNGALIFYDFGMMGRIKTNVREGLMDTLFG IASKDGDRVVQSLIDLGALAPVDDMGPVRRSVQYMLDNFMDKPFENQSVAAISDDLYE IAYDQPFRFPATFTFVMRAFSTLEGVGKGLDPEFNFMEVAKPYAMQLMTDMNGSDSNS FLNELSRQAVQVSSTALGLPRRLEDTLEKLERGDVRVRVRSLETERLLRRQTSIQLGM TYAVIVSGFTLSATILLVKEYIWLAMLAGFIAVAVSGLLIRLLMRLDRYDRMS" gene 5957..6679 /locus_tag="DP116_11930" CDS 5957..6679 /locus_tag="DP116_11930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Stp1/IreP family PP2C-type Ser/Thr phosphatase" /protein_id="PRJNA477356:DP116_11930" /translation="MKLNFTGVTDPGLIRSNNQDAHYIDPEGRFFIVADGMGGHAGGE QASHIATGEIQTYLSAHWSSSKPSEQLLKEALLKANEAILLDQQSHSERSDMGTTIVV VIFRPKEPPFYAHVGDSRLYLFRDSQLQQITEDHTWIARAIKMGEITQQEARIHPYRH VLSRCLGREDVNQIDVQQLDVKRGDRLLLCSDGLTEELADPKIADYLQLPVVKKAALS LVEAAKEHGGHDNITVVIVALE" gene complement(6721..7239) /locus_tag="DP116_11935" CDS complement(6721..7239) /locus_tag="DP116_11935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872553.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF456 domain-containing protein" /protein_id="PRJNA477356:DP116_11935" /translation="MQILYWLLVAVMAVGIIGAVVPAIPGASLILIAIIIWGFISGSF VAIKIPLIVTVIVLLLSVGVDFLASYLGARKAGASQWGQIGAIVGLVLGFFGLLPTLP FGGPLLGMLLGPLLGAIIGEYLYRRDLGLAIKAGIGIVAGTLVGNLIQGLLAIAAVVV FLVTTWPQVFGS" gene complement(7419..8057) /locus_tag="DP116_11940" CDS complement(7419..8057) /locus_tag="DP116_11940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872552.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cofactor assembly of complex C subunit B" /protein_id="PRJNA477356:DP116_11940" /translation="MTKSDPNRVLRRLPIVVGGLGAVLLLMNRLLSSELTNSQARADV LGVILSAVLILVGLIWQQVQPRLPEAVQLIGEEGFVLAPDLPPSVKTELAWASHLLLT NTVTRSLVVFYKNKVLLRRGILSPKSEVVPGPILKRVLEKHKPVYLVDLKVYPGRVEF DYLPENTQGVICQPIGEDGVLILAANAPRSYTKQDEKWIAGIADKLAVTLNQ" gene complement(8095..8823) /locus_tag="DP116_11945" CDS complement(8095..8823) /locus_tag="DP116_11945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210368.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_11945" /translation="MESHKEKILVVDDEASIRRILETRLSMIGYDVVTAGDGEEALDT FRKAEPDLVVLDVMMPKLDGYGVCQELRKESDVPIIMLTALGDVADRITGLELGADDY VVKPFSPKELEARIRSVLRRVDKIGASGIPSSGVIHVATIKIDTNKRQVYKGDERIRL TGMEFSLLELLVSRSGEAFSRSEILQEVWGYTPERHVDTRVVDVHISRLRAKLEDDPS NPELILTARGTGYLFQRIIEPGEE" gene 8993..10567 /locus_tag="DP116_11950" CDS 8993..10567 /locus_tag="DP116_11950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742703.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA repair protein RadA" /protein_id="PRJNA477356:DP116_11950" /translation="MAKPKSYYVCNECGAESPQWFGKCPACGTYNSLEEQINIQSSTD VPSRGVSGWHAQAGGGKTTTTKPAKPAKPRASLTFDQISDRQVTRWASGYEELDRVLG GGVVPGSMVLIGGDPGIGKSTLLLQVSNKLAQRYRILYVSGEESGQQVKLRASRLGVT KALSLIGDGNGNAKVAQETPEATPQEMPKVDEAEGIGADLYILPETDLEEILREMDSL KPNVTIIDSIQTVFFPALTSAPGSVAQVRECTAALMKVAKHDDITMLIVGHVTKEGAI AGPKVLEHLVDTVLYFEGARFASHRLLRTVKNRFGATHEIGIFEMVDNGLREVPNPSE LFLGNRDDPAPGTAIVVACEGTRPIVVELQALVSPTSYPAPRRAGTGIDYNRLVQILA VLEKRVGIPMSKLDSYVASAGGLNVEEPAVDLGIAIAIVASFRDRIVDPGTVLIGEVG LGGQVRSVSQMELRLKEAAKLGFKRAIVPKGQKYPDYNIEILPVSKVIDAIIAAIPPQ QGLTEEDLAPDEEDEE" gene 10645..10755 /locus_tag="DP116_11955" /pseudo CDS 10645..10755 /locus_tag="DP116_11955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314999.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" gene 10814..11488 /locus_tag="DP116_11960" CDS 10814..11488 /locus_tag="DP116_11960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871571.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_11960" /translation="MRILIVEDDDRIAKPLAEYLRRQHHIVDITSDGIEGWEWSQSGL YELILLDLMLPKLDGITLCQRLRAASSNTLILMLTARDTTGDKIIGLDAGADDYLIKP FDLKELAARIRALARRSQEIRPPILIHGEMQLNPASQQVTYAGNLLSLTPKEYMILEC FLRNPNQVLTRSAILDKLWEFDKSSGEQTIKTHITNLRNKLRAAGSSEDFIESIYGIG YRLCHK" gene 11565..12797 /locus_tag="DP116_11965" CDS 11565..12797 /locus_tag="DP116_11965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015364270.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_11965" /translation="MFQKIRYRLLFSYLVVFASLLGIFAIAVRVAFTRSLTQQTTDKL IAIGQGAAANAEFEKDHLTVESDFRPQDLITRHQALQWFDTQGNLIAQQGQTVLTLPL LPSTMVQVQSGKVPIQAVTIPIIGSDNNQLVGYVRVSQSLEEFEETLEKLDWGLGGGI IMTLILSGVAGILLTRQAMQPIEESFQKLKQFTADASHELRSPLMAIKINADLALEYQ EEIGPKDVEKFQAIASATNQMTRLTEDLLLLARTDKVPNRDWETLNLMSILKNLVQLY KPQAQAKQINLISQLTENFYMMGDSVQLTRLFTNLIENALYYTPSSGVVEIKMSRVGS QLYVNVQDTGVGIAPEDIDKVFERFWRADQSRSYWGGGSGLGLAIAQAIAQNHGGLIT VTSQLGVGSCFTVRLPAS" gene complement(12818..13267) /locus_tag="DP116_11970" CDS complement(12818..13267) /locus_tag="DP116_11970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015162916.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2127 domain-containing protein" /protein_id="PRJNA477356:DP116_11970" /translation="MNNKRPPGLLAIVIYKTFVASLLAVTSITLLLALKNYQNLAAFS ESYVLETKLTIIEWLIDKILNISPTKLKFSGIAIGVYAIVTAIEAVGLWYEKRWAKLL VLGLVGISIPPEIFELITGITILKFIVFIVNVTIFWYLLLHFTKHER" gene complement(13449..13943) /locus_tag="DP116_11975" CDS complement(13449..13943) /locus_tag="DP116_11975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412428.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="PRJNA477356:DP116_11975" /translation="MKTSTKIILATAFFGTLGLAGLSRVVSAKQPQSPVAIVRQHHSV IQVAQASDGDGETNDDAQEQQEATKLQPLAKITAQQAQQAAETSVGGKASRVKLENEN GNLVYAVEIAQQDIKVDAGNGKVLYTENANQEDEKNEATRPKSSIQVQKGNDGDGESK NDGK" gene 14293..15060 /locus_tag="DP116_11980" CDS 14293..15060 /locus_tag="DP116_11980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412427.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11980" /translation="MKKMLNRVPEITLVFWIIKVLATTVGETAADYLSVTLNFGLSAT SYIMSGILLIVLLNQFRLKRYIPLSYWIVVVLMSITGTLITDRLVDELGVSLMTTTVI FSVSLLVVFALWYSNEKTLAMHSINTAKRELFYWVAILFTFALGTATGDLLAEALRVG YAESALIFGASIAIIASSYYYFHMNAVLAFWLAYILTRPLGASIGDLLSQPASKGGFG LGTVSTSMLFLSIITSLVIYLSLKQKKSVSLPINSQD" gene 15311..16054 /locus_tag="DP116_11985" CDS 15311..16054 /locus_tag="DP116_11985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015364272.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_11985" /translation="MNKVAKVTIFFWIMKIIATTLGETAGDFISMSLGLGYYIAFAVT FAILAIFLFFQIQSDRYRPVLYWAAIIATTTAGTEVSDLMDRSFGLGYAVGSLILVVG LLSVLAIWYYRDRDLSVYPIMRKDAETTYWLAIVFSNSLGTAFGDFLTSNLGLSYIEG AFVTASVIGVVIALHYLTKLSDVLLFWLAFIFTRPFGATFGDFLTKPVKDGGLSLPRG YASVIAFILLAVVLFFSVRKEKKVRQPLE" gene 16261..16533 /locus_tag="DP116_11990" /pseudo CDS 16261..16533 /locus_tag="DP116_11990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015144397.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 16543..17232 /locus_tag="DP116_11995" CDS 16543..17232 /locus_tag="DP116_11995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015177556.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphatase PAP2 family protein" /protein_id="PRJNA477356:DP116_11995" /translation="MLRKISTFWLRHIHPHLAPLIATIGTVGLISCLLILFVLAKLAE EVLEREAFAFDTNFLLWLHQFANPTLDNLMLFITHLGNPNIVVIVAGVTLLLLWWRRY REEAKAFVLACLGAFILNTGLKLFFSKPRPELWHRLISEKSFSFPSGHALGSMVLYGF IAYLLAIHYPKLSRVIYSFAAIFIAAIGISRLYLGVHWPTDIIAGYGVGFLWLMICIT MLKLQILRLSQ" gene 17281..17466 /locus_tag="DP116_12000" CDS 17281..17466 /locus_tag="DP116_12000" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12000" /translation="MEAPLAVKLGRIRKQGIAIALWGAALCAIRTSRSPEAYRLLLQT RYAPECLCAGLTAVFFA" gene complement(18636..19403) /locus_tag="DP116_12005" /pseudo CDS complement(18636..19403) /locus_tag="DP116_12005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315000.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="NTPase KAP" gene 19563..20396 /locus_tag="DP116_12010" CDS 19563..20396 /locus_tag="DP116_12010" /EC_number="4.1.3.36" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015157203.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1,4-dihydroxy-2-naphthoyl-CoA synthase" /protein_id="PRJNA477356:DP116_12010" /translation="MQINWQTAKTYEDILYHKADGIAKITINRPHKRNAFRPKTVFEL YDAFCDAREDTSIGVVLFTGAGPHTDGKYAFCSGGDQSVRGHAGYVDDDGIPRLNVLD LQRLIRSMPKVVIALVAGYAIGGGHVLHLICDLTIAADNAIFGQTGPKVGSFDGGFGA SYLARIVGQKKAREIWFLCRQYNAQQALEMGLVNCVVPIEQLEAEGIQWAREILEKSP IAIRCLKAAFNADCDGQAGLQELAGNATLLYYMTEEGREGKQAFLEKRPPNFHQYPWL P" gene complement(20688..20987) /locus_tag="DP116_12015" CDS complement(20688..20987) /locus_tag="DP116_12015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12015" /translation="MKQMKSLTSACRYCRYYNLEGRRGGFCQQLGAPVRGNWKACSLA LPPFAPSWENLEDVWSLPDATSVMAAGLNKPALDPVEEIATPCSSEKTKAEAILI" gene complement(21599..22159) /locus_tag="DP116_12020" CDS complement(21599..22159) /locus_tag="DP116_12020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF721 domain-containing protein" /protein_id="PRJNA477356:DP116_12020" /translation="MSLKSINDILEVLAAQPEWQEPPFQRLLACWAQVVGSMVVAHTK PLSLQRDVLRVATSSAAWAQNLTFERQRLILKLNQKLSIGLKDIHFSTAGWQRPQNIP KKQQTISRQEHPSYVGDGMSIDDGDVTPKAKDANAAFQNWARTVQGRSHNLPLCPLCH CPTPPGELQRWGVCSLCAAKQSLKSC" gene 22460..23668 /locus_tag="DP116_12025" CDS 22460..23668 /locus_tag="DP116_12025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874627.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12025" /translation="MSENPLIDTSGAQSSKTIGLKPPLAAALASLEVQLDQELARYRR TRIGYRTPNQPRASISTSSKLQPFTVVSATASKTKSLPEENLKETFRKYGLATEKTHT PNTLLPQESILSETKINPAIPPVQIQNQVSASDTILDALLSPEISQSQTSASSKTENL QSHVDSIPSNFAEAQISLPSNAIVVRTQIQEPSQSQRENISNSQENTSKKPNDYLKSS EALLRSLVEDEPPKTQAASNSNNSLLSPLGIGSMLLLFLGSLTLGYIVFNFKTFPQIS FDRLFQQNAPPTGQKTPESNKNVKTVAQPTFTPIPNLGTNSTPNSTATTSQKPDTEIK PSADGFYHVIIDNQGDSAFASARRVVPDAYLSPDGKIIYLGALKKKEQAQTLLQELKA QGINAKIQQP" BASE COUNT 6862 a 4922 c 5030 g 6933 t ORIGIN 1 tgcgagtggt gtatgtccgg cagccataat ctttgcttga cagctgaggg tactattgtt 61 ggtcgtcatg gtggctttgc caacaaggta cgcgcccatc acgcttgggt tgtgccttta 121 ccttctgata ttgaccctgt gactgctgga ccattattct gtggtggtat cacagttttt 181 aatccaatag tacagtttga tgtcaagcca acagatcgtg ttggtgtgat tggtatcggt 241 ggattaggtc atatggcatt gaagtttctt catgcttggg gctgtgatgt aactgcattc 301 tctacaagtc ctgataagga agcggaagca agagaacttg gtgcaaatca tttcatcaac 361 tcccgcgatc caaatgcact caaatcagta gaaaattcct tcgatttaat tctctcaacc 421 gtcaacgccg atttagattg gagcacttac attgcctgtt tacgtccaaa aggacgacta 481 cattttgttg gcgtggttcc taaccccgtt tccagtcttg ttttcccatt aatagcggct 541 cagaagtcaa tctctggtag tccgcttggt agtcccgcga cagtcgccaa aatgctcgat 601 tttgctgctc gccataagat tgagccagtt atcgaaactt ttgaattcga tcaagtcaac 661 gaggcattag aacgactaaa tagtgggaaa gcaagatata gattggtgct gaaacactaa 721 attttaagtc aggcaatagc tgctgataag ataatttgca agaacaatca tctttgtaga 781 gacgttggat gcaacgtctc tacacgtgtg aaatctcaaa aaacaatcct tatttaaaaa 841 tatttaaaca gcgttccgga atcctttact gctatgatgt agctagtatc aaacctcaac 901 acgcctcata tagctgacat gaagtagtag tagaaaactg ttgctcgaaa gcaacgagtg 961 ttaaatcaaa tcttgcacct catccaaggg tacgccttga acttaagtac cctacgggaa 1021 gccgcccaaa gggcgtctac aaggctaata gtcgaagtcc gttaaaacgg actggaattt 1081 ttatgcagta agctaattcc atactttggc tattcgcagt agaaatttat tttttggcga 1141 tgctccaaag gagccgctac gcgataagcc agaggcttga cgcaaagcgt atcgcgcaga 1201 gaaagagact tctggcggcg tatcgcaaag tttggtgcaa gagatgaatc aacaaagaga 1261 aatcagcaaa aactcagcag gtttggtaaa aagcagccat gcgctaaata ctttttaaaa 1321 acttatctct acaaagagca gccctaaata tgtctgtggc tggctctatt tgctcgctcc 1381 tattgtgcgc gttcgatagg cgctatttgg tcatattgag gtttgagtat gcagaaaaaa 1441 tcaatagttt tagctgagaa tttagcttac gaactcagtt caaccaggac tttatttaaa 1501 ggcatccaag taagcattgg agaagatgag cgcattgctt tagtcggttc caacggtgtt 1561 ggtaaatcta cgctgttaaa gattcttgca ggtcaactta gccccacgac aggttctgtt 1621 acacgtcatg aaacgattta ctatttgcct caaatcagta ctattaaaca ggacatcaaa 1681 gcagatacgg tattcaatta cctgagttct atttctgagg aatggtggaa aattgaagag 1741 attttagagg caagattcag cacgacgctt gacttatcct taccgctttt aaatttaagt 1801 ggtggggaac tcacaaagct atttttggct attggtttat ctgctgaacc aaaagtgtta 1861 ctgctggatg agccaacgaa tcatatggac ttgatggcgt tagaaagttt gagaaatttt 1921 ctgaatgact ttaatggagc gtttgttatt gtctctcaca agccattttt tctagaccaa 1981 gttactgata tgacttggga actgtcacca gagggtgtga aggtttacgg aggaaatttt 2041 tctctgtatc gagagcaaaa ggaaatagaa ttggaggcga gagtgcgatc gcacgaagtt 2101 gcgagaaaag aactcaagcg caccaaagct tcagcgttgc aggaacagca acgagcagcc 2161 cagtctagca aaaatggtcg agccaaattt ctcaacggta gtgtagatag agcagctgca 2221 ggactgctta aaacaaaagc ccaggtttca gcaggaattg caaaaaagag acatgaagcg 2281 gcggtagcaa aggcgactca aagagttgca caaaccaaag tgaaaacaac gaaggcgaca 2341 agtcttcaac tagaagaaag aagccaaaaa cgtagaaatc tcattgatat tcaaggtgca 2401 aatctcaagg taggagaaca cctgctgatt tcaaatatcc agcttcatgt gtcctctggt 2461 gacagaatag ttattgctgg tggtaatggt tcaggtaaat ctagtttggc aaaagcaatc 2521 ttgggaatag agaatacaac agcgattttg gagtcaggag agattttact tgcttctgcg 2581 atgaaagctg tctacctcga ccaaacttat gaactggtga atcgaaagca gacgattctg 2641 gaaaatatgc aagctgctaa tcctaatctc aactatcagc ttttgcgcca gcagttggga 2701 cacttccttt ttaaagagga tgatgtcaac aaaagcgcat cggtgttgag tggaggtgag 2761 ttggcaaggt tggcaatcgc catcattagt atttcacaaa tcgacctgct gattcttgat 2821 gagccaacga ataacctgga tattgaaacg gttgaccaaa tgattgaggg tatcaatgag 2881 taccaaggag cgctttgggt tatttcccac gatttggatt tcttaagccg gattaatatc 2941 actcaagctt acaagttgaa tgagcaagcg ttgcaacaga caatgtattt gcctagtgaa 3001 tctgagcaat attatgaaga attactagcc tgtcaagatg gatgatgtcg aggctgaagc 3061 gttgcagaaa catttgctat tttcttggct tttctttctg ctctgagttg ctgaagtttt 3121 ttggaaactg gtggaggatt aagtcgctgt tcttccctca ttgtgtgacc gtgttctatc 3181 cagtttttca tataagcact cacaacttct ttattctgta agtaataaag aattgttgca 3241 taaacttgtt ctagggtcag tgatgtgtat aattgagcaa tttcctctgg agtgcgacca 3301 cgatcaatgt attcgtaaag aattgtttca ataccaattc ttgagtcctt gagccgaata 3361 tcaccaggag ttaggaagtt gaaataatct gttatttgcg ctgtggacat cgtttttctc 3421 ctttgatgtt ttttgtcaaa tgcattaata gtttttccgc ttgtttggca gttaagcgcg 3481 gtaatttagg caacttgtac ctccaaagtc ctggtcaata tttccttgtt taaaagtgct 3541 tttttctcca tacataggag cgctcgccac tgacagtagc atcagcctga ttcttgcccg 3601 aaaaagtcag tacgtcttta ttttatactg attactatca tatataagca taattactaa 3661 gataaaattg gttacaaata gacttagtag tgtgtgctac aaaacaagaa ttgcaaaaaa 3721 caaatttcta tctccacaac agtccagcac tcagggataa gcagtaaagc agacaggcaa 3781 gttataattt ttttgcagat gaacgtaaca aaaatttatg agtgaccccc ttttacaagc 3841 ctttttcgtt ggtagagcag ctgctgaagt cctcaccgag aggttggagt tcgctattac 3901 agacgctctc tcggaactcg gcaaatggga cgctgaaact agagagcaac tgcgccaatt 3961 tacagacgaa gtcctagagc gagcaaatcg tgcagcagat gctgctggtg ctactcaaac 4021 aacaagctat ggagaaactc gctcagcgcc agttgactta caagcgacga ttgatgaact 4081 acgagcagaa atagccacat tgcgaacaga attacaacgc tatcgcagtg gcaattcttt 4141 gtagagacgt tgcactgcaa cctttctacg agaaaattat aaaggaacag ttttggaatc 4201 agagtgtctt ttcttcctac tgactcggtc aaaaaccagc ggtacgcaaa gcatatggaa 4261 aaaggttatt cagataaggc ataccgttgg aatcgcgaaa actactctcg tagacggcgc 4321 tttttagaca tttggtcttt tgtcttgacc ttaatgttta agctttggcg gtacaacaaa 4381 tcttggagtt acccaggtgg agttaccgaa gctaaacaag ctgctcgacg gaaggctcaa 4441 gcggtatgga ttcgcaatac tctgctagat ttaggaccaa cgtttatcaa agttgggcaa 4501 ctattctcta cccgtgctga tatcttccct agcgagtatg tcgaagaact tgccaaacta 4561 caagataaag tgccagcatt tagctatgag caggtagaag cgattattga gcaagagtta 4621 ggcaaaaaaa tcccccaact ctaccaaagc tttgaaccca tacccctagc tgctgctagc 4681 ttgggacaag tacacaaagc tgtactgcac tccggagaag ccgtagttgt caaagtacaa 4741 cgtcctggtc ttaagaaact ctttgaaata gatttacaaa ttcttaaggg aattacacgc 4801 tactttcaaa accatcctaa atggggacgg ggacgcgatt ggatgggcat ctatgaggag 4861 tgttgtcgca ttctttggga agaaattgat tatctcaatg aaggtcgcaa cgctgatact 4921 tttcgtcgta attttcgtgc ttacgactgg gtaaaagttc cacgtgtcta ctggcgttac 4981 acttcatcgc gggtattgac tttagagtat gttcctggta taaagattag ccagtatgaa 5041 gcaatagaag cagcgggttt ggatcggaaa attatcgctc gtcagggtgc ccaggcatac 5101 ctacttcagc tactcaataa tgggtttttc catgctgacc cgcatcccgg caatattgcc 5161 gtcagtccta atggtgcttt gatattctat gattttggca tgatggggcg cattaaaact 5221 aatgtgcgtg aaggactgat ggacactctg tttggcattg cttcaaagga cggcgatcgc 5281 gttgttcaat ctctcattga tttgggagcg ctcgcgccgg tggacgacat gggaccagtg 5341 cgacgttctg tccagtatat gctggataat tttatggata agccctttga aaatcaatcg 5401 gtagctgcta ttagtgacga cttgtacgaa atagcatacg accagccgtt tagatttcca 5461 gcaaccttca cttttgtgat gcgagctttc tcgacactcg aaggcgtagg aaaagggtta 5521 gatccagagt ttaattttat ggaagttgcc aaaccatatg ccatgcagct tatgaccgat 5581 atgaatggtt ctgatagcaa tagctttctc aatgaattga gtcgtcaagc agttcaggtt 5641 agcagtactg cattaggact accacgcagg ctggaagata cactagaaaa gctagaacgg 5701 ggagacgtac gtgtgcgtgt cagatctctt gaaacagagc gcctgctacg acggcaaacc 5761 agtattcagc tggggatgac ttatgctgtt attgtgagtg gttttacgct ttcggctacg 5821 atcttattgg taaaagaata tatatggttg gcaatgcttg caggtttcat cgcagttgca 5881 gtgtcaggtt tactgattcg actgctgatg cgcctcgacc gctacgaccg tatgtcataa 5941 atttagtgaa tggtatatga aacttaattt taccggtgtt acggatccgg gacttattcg 6001 ttctaataat caggatgctc actatatcga cccagaaggg cgtttcttca ttgtcgctga 6061 tggtatgggt ggtcatgcag gaggcgaaca agcaagccat attgctactg gggaaattca 6121 gacgtacttg agtgctcatt ggagttcctc taagccttca gagcagttat taaaggaagc 6181 tttattaaag gcgaatgaag caattttgct tgatcagcaa agtcattcgg aacgttcgga 6241 catgggcaca acaattgtag tcgtcatttt tcgtcccaaa gagccaccat tctacgctca 6301 tgttggcgac tcaaggctat acctttttcg ggattcccaa ttgcagcaaa tcacggaaga 6361 ccatacttgg atagcacgcg ctattaaaat gggtgaaatt acgcaacaag aagcccggat 6421 tcatccctat cgccatgtgt tgtctcgttg tttagggcgt gaagacgtca atcaaattga 6481 tgtgcaacaa ctagatgtga aaaggggcga tcgcctactg ttatgcagtg atggtttaac 6541 agaagaactc gccgatccaa aaattgctga ctatctccaa ctccctgtgg ttaaaaaagc 6601 tgctctttcc ctagttgaag ctgcaaaaga gcacggcgga cacgataaca tcactgttgt 6661 tattgtcgcg ctggagtagc aaagaggttt ttaaaaaccc gcttttgcgg gttttatatt 6721 ctaactccca aagacttgtg gccaagttgt gactaggaaa acgacaactg ctgcgatcgc 6781 caacaaccct tgaatcaaat ttccaaccag cgtacctgcg acaattccaa taccagcttt 6841 aatcgccaac cccaagtctc gccgataaag atattcaccg ataattgctc caagcagtgg 6901 tccgagtaac attccgagta atggaccacc aaaaggtaat gttggtaata atccaaagaa 6961 acctagtact aacccaacaa ttgcaccaat ttgtccccac tgacttgcac ctgcttttct 7021 tgcacccaag tagcttgcta aaaaatcaac tcccacactc aggagtaaaa caataactgt 7081 gacaatgagt ggtattttta tagcgacaaa cgaaccactt ataaatcccc aaatgataat 7141 tgctattaaa attaaactgg ctcctggaat cgcaggaact acagcaccga taatacctac 7201 agccattaca gcaacaagta accaataaag aatttgcata attgttgagt tgttgatttt 7261 gctatacacc acctaataac agttatcagt taccagttac cagttatcag ttaccagtta 7321 ccagctatca gttatcagtt gttagtgttt actgtttact gtttactgtt tactgttcac 7381 tgttccctgt tccctgttcc ctgttccctg ttccctgttc actgattcaa agtcactgct 7441 aatttatctg caattcccgc aatccacttt tcatcttgtt tagtgtaact gcgaggagca 7501 tttgctgcca aaattaagac tccatcttcg ccaatgggtt gacaaatgac accttgagtg 7561 ttttcgggca aataatcaaa ttcaaccctt ccgggataaa cttttaaatc taccagataa 7621 actggcttgt gtttttctag cactcttttt aagattggtc ctggtacaac ttccgacttg 7681 ggacttagaa ttccccgacg taacaaaact ttatttttgt agaaaacgac gagcgatcgc 7741 gtcaccgtat tagtcaacaa caaatgtgat gcccaagcta gttctgtttt caccgatggc 7801 ggtaaatccg gtgcaagtac aaaaccttct tcgccaatga gttgtacggc ttcaggtaaa 7861 cgcggttgta cttgctgcca aattaaccct actaaaatca agacagcact caaaatcaca 7921 cccagaacat ctgcacgcgc ttgagaattt gtgagttcgc ttgagagcaa acggttcatg 7981 agcaaaagga cagcgcccaa cccacctact acgataggta gacgccggag aaccctattg 8041 ggatcggatt tagtcatcag tataagtccg tattatcccc ttatcccctt gtttttattc 8101 ctctcctggt tcaataatgc gttggaataa ataaccagtt cctcttgcgg tgaggattaa 8161 ctctggatta cttgggtcat cttccaattt tgcccgtaaa cgggagatat gcacatctac 8221 tacacgggta tccacatggc gttctggtgt atatccccag acttcttgta aaatttctga 8281 acgggagaaa gcttctccag aacggctgac taacaactct agtaagctaa actccatgcc 8341 tgttaagcga atgcgctcat cgcctttata cacttgccgc ttattcgtgt caattttgat 8401 agtcgcaacg tgaatcactc cagaactagg aataccagag gcacctattt tgtctactcg 8461 ccgcaacact gagcgaatgc gggcttctag ctctttaggg gaaaaaggtt tgactacgta 8521 gtcatcagca cctaattcta aaccagtaat tcggtcagca acgtccccca aggctgttaa 8581 cataataatt gggacatctg attctttacg taattcttga cacacgccgt agccatctaa 8641 cttcggcatc atgacgtcta aaacgaccaa gtcaggctct gctttgcgaa aggtatccaa 8701 agcttcctca ccgtcaccag ccgtcaccac atcgtagcca atcatcgaaa gccgcgtttc 8761 taaaatccgg cgaatactgg cttcgtcatc taccacgaga attttttctt tatgactttc 8821 caagtttctc aacgctcctt aactaaaatt tttcatgatt aatttttact accatattat 8881 taagatatca tttcacaaat tgattgaaaa aaacccgcgc atactacaat tattggattc 8941 tagtttcctc aaaaacttaa gtaaagatta agattattta ataatattaa gaatggcaaa 9001 gccaaaaagt tattacgttt gtaacgaatg cggagcagaa tcccctcaat ggtttggtaa 9061 atgtccagct tgcgggacgt acaattcttt agaagagcag attaatatcc agtcatcaac 9121 agatgtacca agtcgcggag tgagtggatg gcacgctcag gcaggaggtg gcaaaacgac 9181 tacgaccaaa ccagctaaac cagccaaacc acgagcttct ttaacttttg accagattag 9241 cgatcgccaa gtcacacgtt gggcttctgg ctatgaggaa cttgatcggg tgcttggcgg 9301 tggagttgtt cctggttcaa tggtgctgat aggcggtgat ccaggaattg ggaaatctac 9361 gctgctgttg caagtgtcga ataaactcgc gcagagatac cgcatcctct acgtatctgg 9421 agaagaatca ggacaacagg tgaagttaag agcctctcgt ttaggagtga caaaagccct 9481 aagtttaata ggtgatggta acggtaatgc caaagtagca caagaaactc ccgaagcaac 9541 tccccaagaa atgcccaaag tagacgaagc tgagggtata ggtgccgatt tgtatatttt 9601 gccagaaaca gacttggaag agattttgcg ggagatggat tcactcaaac cgaatgtgac 9661 aattattgat agtattcaaa cagtgttctt cccggctctg acttctgcac caggttctgt 9721 cgcccaggta cgcgaatgta ccgcagcttt gatgaaggtg gcaaagcacg acgacatcac 9781 aatgttaatt gtgggacacg ttaccaaaga aggggcgatc gccggaccaa aggtcttaga 9841 acacctagtt gatacagtgt tgtattttga aggcgctcgc tttgcctcgc atcggttatt 9901 acggacagtc aagaaccgtt ttggggcaac tcacgaaatc ggcatctttg aaatggtaga 9961 caacggactg cgagaagtcc ccaatccttc agagctattt ttgggtaacc gcgatgatcc 10021 ggctcctggt actgcaattg tcgttgcttg tgaaggaaca cgcccgattg tcgttgaatt 10081 gcaagcttta gtcagtccta ccagctaccc cgcaccgcgt cgggcgggta caggtataga 10141 ttataaccgc ttagtgcaaa ttttggcggt gctagaaaag cgggtaggga ttccaatgtc 10201 aaagctggat tcttatgtgg cgtccgcagg tgggttgaat gtagaagaac cagcagtgga 10261 tttaggaata gcgatcgcca ttgttgccag tttccgcgac aggatagttg accccggtac 10321 ggtacttata ggggaagttg gcttaggagg acaagtccgc tcagtatcgc aaatggaact 10381 gcggttaaaa gaagcagcaa agttaggatt taagagggca attgtcccaa aagggcaaaa 10441 ataccccgac tataatatag aaattttgcc agtctcaaag gtgatagatg cgattattgc 10501 agcgataccg ccgcagcagg ggctgacaga agaagatttg gcaccggatg aggaggatga 10561 ggagtagaat tatacctctg gtcgcttgca ccggagaggc tttacaggac aaacagcgtt 10621 tgaacaccca aaggaacacc cgtcatgata gccactgcta cccaagatta ccagataact 10681 tgggaaaagt tacccgatga ttacgtacta ccagacgaac cagtggataa cattaaccag 10741 cctcccctta tcgcacttgc cactcatact tacccaattt tgtcaagcct taacaagtta 10801 aaagtgggtc attatgagaa ttttgatagt tgaagatgat gatcgcattg ccaaaccatt 10861 agctgagtat ttaagacgcc aacaccatat tgtggatatc acaagcgatg gaattgaggg 10921 ctgggaatgg tctcaatcag ggttatacga actcatttta ttagatttaa tgctccctaa 10981 attagatgga attactctgt gtcagcgttt acgtgctgct tcatccaaca ctctcatctt 11041 aatgctgaca gcacgagata caacaggcga taaaattatt ggactcgatg ctggagctga 11101 tgattattta atcaaaccct ttgatctaaa agagttagca gcacgcatca gggctttagc 11161 taggagaagt caggagattc gcccaccgat tttaatccac ggggaaatgc aactcaatcc 11221 tgctagccaa caagttactt atgcagggaa tcttctatca ttaactccta aagaatatat 11281 gatattagaa tgttttttga gaaatccaaa tcaagttttg actcgttcgg caatccttga 11341 taaactgtgg gaatttgata agtcttctgg agaacaaacc atcaaaactc atatcaccaa 11401 tttacggaat aagctcagag ccgctggaag ttcagaagac tttattgaaa gtatctacgg 11461 cattggttat cgtctctgtc acaaatgaaa caattggtta gtttttagca tttttatcaa 11521 caaagctctc agctaataaa tgataatttg atagtcacaa caatgtgttt caaaaaattc 11581 gttatcggtt attgttctct tacttggtgg tgttcgcatc gctgctagga atatttgcga 11641 tcgcagtccg agttgctttc actcgcagtc tcactcaaca aacaacagat aaactcatag 11701 ccataggaca aggtgcagct gcaaatgcag agtttgaaaa agatcacctg acagtagaaa 11761 gtgactttcg tccacaagac ttaataactc gtcatcaggc attgcagtgg tttgatactc 11821 agggaaattt gatagcccaa caaggacaaa ctgtcttaac tttacctttg ttaccaagca 11881 caatggtaca agttcaaagt ggcaaagtcc ctattcaagc agtgactata ccaattattg 11941 gtagcgataa taatcaacta gttgggtatg tcagagtaag tcaatcccta gaagaatttg 12001 aggaaacgct cgaaaaattg gactggggat taggcggtgg aattattatg actttgattc 12061 ttagtggagt tgcgggaatt ttactcactc gtcaagcgat gcagccaatt gaggagagtt 12121 ttcaaaaact taaacagttt actgctgatg cttcccacga actacgcagt ccgttaatgg 12181 caattaaaat taatgctgat ttggcgctag aatatcaaga agaaatagga ccaaaagatg 12241 tggaaaagtt tcaggcgatc gccagcgcta ctaaccagat gactcgcctc acagaagact 12301 tactattgtt agcacgtacc gataaagttc caaatcggga ttgggagact ctcaatttaa 12361 tgtctatttt aaaaaactta gtacagctgt ataaacctca agctcaagct aagcaaatta 12421 acttgatatc tcagttgact gaaaattttt atatgatggg tgattcagtt caactaacgc 12481 ggctgtttac gaatttaatt gaaaacgccc tttattacac accatcatcc ggcgtagttg 12541 aaatcaaaat gagtcgtgtt ggttctcagc tttatgtgaa tgtgcaagat acgggtgtgg 12601 gaattgcgcc agaggatatc gacaaggttt ttgagcgctt ttggcgggcg gatcagtctc 12661 gttcttattg gggaggtggt tctggtttgg ggttagcgat cgctcaagcc attgctcaaa 12721 atcatggtgg attgattact gtcacaagtc agttaggagt tggtagttgt tttacagtac 12781 gcttaccagc ttcttgacta tcaagacttg ataaaaatta cctttcatgc ttggtaaaat 12841 gaagcagcaa gtaccaaaat atagttacat tcactataaa tactatgaac tttagtattg 12901 ttattcctgt aattaactca aatatttctg gaggtatgct aataccaacg agtcccagca 12961 ctaagagttt agcccaacgt ttttcatacc acaaaccaac tgcttcaatt gcagttacaa 13021 tagcatatac tccaatagct attccgctga atttcagttt tgttggactg atattgagaa 13081 ttttatctat aagccattca ataattgtca gttttgtttc taaaacataa gattcagaaa 13141 aggcagctaa gttttggtaa ttttttagtg ctaatagcaa ggtgatagaa gtaacagcta 13201 ggagtgaagc aacaaaagtt ttataaatga caattgctaa taaaccaggt ggacgcttat 13261 tgttcacgta atctcttcaa attgtttcac atctaattcg ttctgattat ccagattcta 13321 caaaataggg attgtaaagg ttgtcttcaa caatccccat ctagttaata ttttgttagt 13381 atgcgcaaca gcagtatttt acttaactgc tgatacttgc aactaaaata gtgtgggttt 13441 caaattcgct acttaccatc attcttgctt tcaccatcgc catcattgcc tttctgaact 13501 tgaatactac tcttggggcg agtggcttca ttcttctcat cttcttgatt agcattctca 13561 gtgtagagaa ccttaccatt accagcatca actttaatat cttgctgggc aatttctaca 13621 gcataaacta aattgccatt ttcattttcg agtttgacgc ggctagcttt acctcctaca 13681 gatgtttcag ctgcttgctg tgcttgttgt gctgtaattt tagctagagg ttgtagttta 13741 gttgcttctt gctgttcttg tgcatcgtcg ttggtttcac catcgccatc actggcttgc 13801 gcaacttgga taacgctgtg atgctgacgc acaattgcta ctggagattg tggttgtttt 13861 gcagacacaa ctcgcgataa tccagcaagt cccaaagtac caaaaaaagc tgttgccaaa 13921 atgatttttg ttgaagtttt catagttgta gaataccttt cattctgggt ttgtcttttc 13981 acggtacgta ataaaagttc agaaatagtc aagattagct tgagttcaca aaattcaata 14041 tcaaatagtg acatcaagtg atgattagat atttaagcta ttttctgtcc atctattaag 14101 ggtaaattcc gcttgagata aaatcttgac tttttcttga cctttggtgt gtatagtgca 14161 atcactatgc acaaaataaa gttataaaaa tttttttctg ctctattgat aggagtgttt 14221 ttcggtactt ttgtaactca attgatatca ataagctata actcattact ttaagaaagg 14281 aggaggaata cgatgaaaaa gatgttaaat agggttccag agattacact tgttttctgg 14341 attattaaag ttctcgcaac cacggtaggt gaaacggcag cagattattt gtcagtaacc 14401 ctgaactttg gtttaagtgc tacatcctat ataatgagcg gcatattatt gattgtgctt 14461 ttgaatcagt ttaggctgaa acggtatatt ccattaagtt actggattgt agttgttttg 14521 atgagtatta ctggcacact gatcactgat agattggtgg acgagcttgg agttagtttg 14581 atgacaacta ctgtaatttt tagtgtttcc ttgttggtag tttttgcgct ctggtattca 14641 aacgaaaaga cattagccat gcattccata aatacagcta aaagagaact tttctactgg 14701 gtagctattt tatttacgtt tgccttagga accgcaacag gagatctctt ggcagaggct 14761 ttgagagtgg gttatgcaga atcagcactg atatttggcg cttcaattgc aatcatagcg 14821 agcagttatt attacttcca tatgaatgct gtcttggcat tttggctggc ttacatcttg 14881 actcgtccgc ttggagcatc tataggcgat ttactatccc aacctgcttc aaaaggtggc 14941 tttggtttag gtacggttag cactagtatg ctttttcttt ctataataac gagtttggtt 15001 atttacttaa gcctgaagca gaagaaatca gtatcattgc cgatcaactc acaagactaa 15061 ttggcgatac tcccttggcg tctgcgccct gcgcgatctg ccgctatgct gcagcgtttg 15121 cgcagcgccc cctaaggggc tagctatcgc ctgctcaata ttgggtattt ttctgtttgt 15181 gatctggcga tgatgtattt tttttcctga cttctgactc cttgtactag caacttacgt 15241 tattagtgcc atttagctct tagttgtaaa taccagactt cgctaatctg atttttattg 15301 tgaaaaagaa atgaacaaag ttgcaaaagt cacaattttc ttctggatca tgaagatcat 15361 cgctacgacg ctcggcgaga cggcaggtga cttcatctca atgtctcttg ggctggggta 15421 ttacatagcc tttgccgtaa cgttcgccat cctggctatt ttcctgtttt ttcaaattca 15481 atccgacaga tatcgtccag tcctttattg ggcggctatc attgcgacaa ccacagccgg 15541 aactgaagtt tcagacctga tggatcgatc tttcggattg ggctacgcag tgggatcgct 15601 tatcctggta gtcggtctct tgagcgttct tgccatctgg tactaccggg atcgggatct 15661 gagcgtttat ccgattatga ggaaagacgc agagacaacc tattggctgg ccatcgtgtt 15721 ctccaacagt ttgggaacgg cctttggtga ctttctgaca agcaacttgg gactgagcta 15781 tattgagggc gcattcgtga cggccagcgt cattggtgtt gtcatcgccc ttcactacct 15841 aacaaagttg agcgatgttc tcctattctg gctcgcgttc attttcacgc gacccttcgg 15901 agccaccttc ggagattttc ttaccaaacc agtcaaagat ggcggtctat cactgccaag 15961 gggctatgct tcggtaattg cctttatcct gctggcagtt gtcctattct tttccgtgcg 16021 aaaggagaaa aaagtgcgtc agccacttga gtgatgcgtt taagcagata gatgattcac 16081 cccataatag gaacaccgga cttggatttg cggggggaac gtttctaata acccaagttg 16141 ctagttatca aaaagcgtct tgttttaagt acgaaattca aaaatctagt attttaaagt 16201 actgaaaatt gtgctttatt tattattatg atacggtttc ggcgcactaa ctagcaactt 16261 atgttaatag ctggtttgct tgccctggta ttggcagcca attattatac ccaaatttcc 16321 gcagtcatgt tgttttggat tgcatttgtc ctaacccgac catttggtgc aacactgggg 16381 gatgttctta caaaaccaca cgaaaaaggt ggcctcggtt ttggtactat cggctcatct 16441 atagtccttg tgtcaatact tggagtctgt atcttattca caactctaaa gcagagaaga 16501 ctaaccgtag ctgggccttc taacgatgaa tgacattaaa gtatgctccg aaaaatttca 16561 accttctggt tgcgtcatat acatcctcat ttggctccct taattgccac aatcggtact 16621 gttggactta ttagttgtct gcttatcctt tttgttttag caaagttagc tgaagaggtt 16681 ttagagcgag aagcttttgc atttgatact aattttctgt tatggctaca tcagtttgcc 16741 aatccaactc tagataattt aatgctattt attacacatc ttggtaatcc taatatagta 16801 gttatagttg caggggttac tctcttgtta ctttggtggc gacgttaccg agaagaagcg 16861 aaagcttttg tacttgcttg cttgggagca ttcattttaa atacaggact aaagttgttt 16921 ttttctaaac ctcgccctga actttggcat cgcttaattt ctgaaaaatc ttttagtttt 16981 cctagtggtc atgcactagg ttctatggtg ctatacggtt ttatcgctta cttgctagca 17041 attcattatc ctaagttatc cagagttatt tacagttttg ccgctatttt catcgcagcc 17101 attggcatca gtcgcttata tttaggagta cattggccca cagatatcat tgcaggttat 17161 ggagtcggtt tcttatggtt gatgatatgt attacgatgc taaaattgca aatattgaga 17221 ctgtctcaat gatgctttag aaattttctc attacggctg ctctgcttgg gaatgccctc 17281 atggaggctc cccttgcagt aaaattaggc agaatcagga aacaaggaat tgcgatcgcc 17341 ctctggggcg ctgcactttg cgcgatacgc acaagccgaa gtccagaggc ttatcgcctt 17401 ctccttcaaa cacgttacgc cccagaatgc ctgtgtgcag gacttacggc agtgttcttt 17461 gcctaatcgc tatcagaagc tgaaggacta gctttttcag aacattctgc gttttgtgcc 17521 tgctttcttt tggcaacaat ttcttccagt tctttcagat gttgtgcgct aattaactct 17581 tcatcttcaa ggttaaatat atctgaactg tactcttctt gttgctcatt taaattttct 17641 aattcttttg taattattga taatgctttt ccttgagcca ttgaggtttt taacacttca 17701 aaccgtgccg cttcatcaag acgacataat aaccgtaaga ttatacgatg aattctaatt 17761 tcattcccaa aatcaaacat tgtggtatga ggttcatcgt cagaacgcaa cagttcttct 17821 cccacatcaa acaaagcttg tacaattgag gcaatgcaat cagttggaat ttcgttgagc 17881 gtgctatctt caagtttttc taaaaatgcc ctaacttgag ttgtactatc aggacgtatt 17941 tgctcagaca attctataag tttctttcca aatagcttgg gatcacaagc tgaagcaaga 18001 atagcttgaa tctgagtatt agataaatca tcttctgata aagttaaacg gaagtagata 18061 ggaaatattt ctgaacagca tacacgctgt tgacttcgcc actttaattc ttgttgttca 18121 tcataacatc tctttcccca aacagcttgt aatttaggaa aaataatttt tagcaaattt 18181 ttaacaggtt gcttatcttt gtcttgcaat tgggcaagcc aagaattatg aaagtttttc 18241 agagcatcta aggaagaatc tatttttcct acaaaatagt ttttgttttt acgaattatg 18301 tcatatacca ttgggtaaaa aacccgcagt gattctaaag caataaaatc tattgggttc 18361 acttcgcctt tcactaccgg atatgtcatt gttagggtgt cagtcagatg aacaatgtcg 18421 cggaggttcg taatgaagtg gtctattcct tgaaagtaaa cattgctcca gtaggtttgc 18481 tcaaatacct cagccagcgg cgtggatggg ttttccacat cagcttcctt caagtcagca 18541 acctgctgca gttctggttc ttcagcgctt ggacttgttt catcacacgg ttggctggag 18601 ttgaggtcat cctcgtcgtt tggtgtacca atcagaatac tatcgagttt ttcaaaaagt 18661 agccgacgaa gtgaagtttt atctggatta ggtaactcaa aagcaacttg aatagatttc 18721 tccaaatatg cttctggtga tttctcctga gtatcggaaa gtgctttgtt aacaacttct 18781 ttatcaaaaa ccagaagata gacaacgtta gtaaaattcg gaattgcttt aaaaatgcga 18841 aatagctgtt tgatatcctc tgtgggaagc ctatcaatat cgtcaattgc gacgactatt 18901 cgtcgctgct gctgcaccac tgtgttttcg acttcttctt ttaactcgga agcttccttt 18961 tgttgatcgt taaataatgt tgctactgcc ttaccagctt gagcgtaagg taagggaatt 19021 tcagaaataa ctttagcaaa accggctatt cgctctctca aacctttggg tacatacatt 19081 acagacgtta aaacgctatg taattggtca aaaaagcgcc ttgtaatgtc ttggtttcca 19141 gtaaacaacc aagcactaaa gggaacgatg attggttgct cctcatcagg cttttgctta 19201 agataatgaa ctacaaagtt caacagtgtc gattttccgg aaccccaaga accgtacact 19261 gcaatcacga acccttcagt aacggtcatt ttgcaaatac tgtctgccaa atgcttggcg 19321 aagggcgcat atcctaatcg gtctttttct ggttcgatga agaggtcatc ttgcaggtgg 19381 ttatgagttt cagtctgtgc cataaaaaac ttttgtgagt tgtactgcaa tcgtgacaca 19441 tatagtaggt agcgctatgc caccctacga tagaattctg tgcgaatcgc cgtacatatc 19501 ataggtatgc atatgcctgt ccaaggataa aatttttaag ctgaagtaaa tactttttcg 19561 cgatgcaaat taactggcaa actgctaaaa cctacgaaga cattctttac cataaagctg 19621 atggcattgc aaaaattacc atcaaccgtc ctcacaaacg caatgccttt cgtcccaaaa 19681 ccgtttttga actatatgac gccttctgcg acgcccgcga agatactagt atcggcgttg 19741 tgctatttac tggtgctggt ccacacactg atggtaagta cgccttttgt tctggaggcg 19801 accaaagtgt gcgaggacac gctggctatg tagacgatga tggcataccg cgtttgaacg 19861 tgctggactt acaacgtctg attcgttcca tgccaaaagt ggttattgcc cttgttgctg 19921 ggtatgcaat tggtggagga cacgtcctgc acttaatttg tgatttgact attgcggctg 19981 ataacgccat ttttggacag actggtccga aagtcggcag tttcgatggt ggttttggtg 20041 ctagctatct tgcccggatt gttggacaaa aaaaggcgcg agaaatctgg tttctctgcc 20101 gacagtacaa tgcacagcaa gcactagaaa tgggtttagt caattgtgtt gttccaatag 20161 aacaacttga agctgaaggt atccagtggg cccgagagat tttagaaaaa agccccatcg 20221 ccattcgttg tctaaaagca gcatttaacg ctgactgtga tggacaagct ggtttgcaag 20281 aattagctgg caatgccact ttactttatt acatgacaga agaaggacga gagggaaaac 20341 aagcattttt agaaaaacgt ccacctaact ttcatcaata tccctggctg ccctaaaaac 20401 aaaaaaatcg cacctttgat aaaaggggcg attttttgag tgttcaggtc ttttgacgac 20461 ctataaatat taaaagcgaa ttttattttc gatgaactac tttcgtaggt aataattggt 20521 agctaagcct atgtgtgtca tttctacaaa gtgaaatgtg aaatattcgc cctcggaggg 20581 tcatgatcaa tttctcttcc acaaaaatag ctatgaataa attcgctcaa aacagcacta 20641 aaagggtgct tctgcaattt gacttttcct gtttttgcaa aacacagcta tattaagata 20701 gcctcagctt ttgtcttttc agaagagcaa ggagtcgcta tttcctcaac gggatcaagg 20761 gcaggtttgt ttaaacccgc agccatgact gatgttgcat ctggcaaact ccaaacatct 20821 tctaaatttt cccaagaggg agcaaacggt gggagcgcca aagaacaagc cttccaattt 20881 cctcgaactg gtgctcccaa ctgttggcag aatccgccgc gacgtccctc taggttgtaa 20941 taacgacaat atctacaggc agaagtcaat gatttcatct gtttcatatt tagtgccttt 21001 agtgccgttt tagggctaat tttcttcctt tattttgcgt ctacatacag aatgtgggtt 21061 ttagctggaa cctttgaagt acctaggttc tagcgatccc aacgaaaaaa ttaactttat 21121 aatgttttta ggtttgtgtt acatttttca ttttttttgt taacaaagtg atacattaat 21181 tgcgatggat taagcatact cggtcttgac tgaaagtcat tggggatcaa agctaaatat 21241 aaatataacg atgtgagtcg ctcttgaact tcagccttag gagacacttt gtaagatgca 21301 attaactgta tctttagaaa gaaataacac aattgaatcg caactaagat cagatatcaa 21361 taaatataaa aactctctct ttagggagag gaagttagtg gtaaaaaaca tagtcttttt 21421 agcgattcat cagatcatga aaaaactttt ttgagagata tataatcaaa aaccgactgc 21481 acaatactaa gttattagtc ctgagtagga ttaacgaaag tagaaaaaag aatcaataaa 21541 gaattccgga gaattcctga attttgaatt ctgaattctg actcctttgt ttaaatactc 21601 agcaggactt taaagattgt ttagcagcac atagagaaca gacaccccag cgctgaagtt 21661 cgccgggtgg agtgggacaa tggcagaggg gacaaagggg taaattgtgc gatcgccctt 21721 gcactgttct agcccaattt tggaaagcag cattggcatc ctttgctttg ggcgtcacat 21781 caccgtcgtc tatactcatg ccatcaccca cgtaactcgg atgttcttga cgcgatattg 21841 tttgttgttt cttgggaatg ttttgaggac gctgccatcc agcagtagaa aaatgaatat 21901 ccttcaaacc aattgacagt ttttgattca actttaaaat taagcgctgg cgctcaaaag 21961 ttaggttctg tgcccaagcc gcactcgatg ttgccacccg caaaacatcg cgctggagtg 22021 acaacggttt agtgtgagca actaccatcg atcctacgac ttgggcccaa cacgcgagca 22081 aacgttgaaa tggcggctct tgccattcgg gttgagctgc caaaacctct aaaatatcat 22141 taatggattt caacgaaatt tacctgacct tcgcaaaaac tacagcacag ctgtgcaata 22201 tatttatagc aatcctaaat gataagccct atgcctgcgg caaacgggca tacacaaagc 22261 gtatgcctac ggcacgctag gctttgcgct tacgcttgcg tgccgtaggc atacgtgaaa 22321 tttttcatga agaacactgg gcaagctcaa cagttgtttc acaactaaaa taggactgct 22381 atatgatatt tatcgcgctt gattggcaaa aatatatttt taaattaatc ccgtactcca 22441 ggtttgaggc agataggcaa tgagtgaaaa ccccctaata gatacgtcgg gtgctcaatc 22501 ttctaaaaca atagggttga aaccaccact agcagccgca ctggctagtt tggaagtcca 22561 acttgatcaa gaattagctc gataccgacg cacacgaatt ggatacagaa ccccaaacca 22621 accccgtgca agcatttcca ctagcagtaa acttcaaccg ttcactgttg tgagtgcgac 22681 agcaagcaaa acaaaatcac tacctgagga gaatctcaaa gagactttcc ggaagtatgg 22741 gttggcaaca gagaaaacac atactccaaa tactttgctt ccgcaggaat ctattctttc 22801 agaaacgaaa attaacccgg ctattcctcc agtacaaatt caaaatcagg tgagcgcatc 22861 agatacaata ctggatgctt tgttatcacc agaaatctca cagtcacaga cctcagcttc 22921 ctcaaaaaca gaaaatctac aatctcatgt agattcaatc ccaagcaact ttgccgaagc 22981 acaaatctct ctaccgtcaa atgccattgt cgtacgcaca caaatccaag aaccaagtca 23041 gagtcagagg gaaaatattt caaactcaca agagaataca agcaaaaaac caaatgacta 23101 cttaaaatcg tctgaagcac tactgagaag tttagtagaa gatgaaccac caaagactca 23161 agcggcaagc aattctaaca acagtctgct atctcccttg ggcattggct cgatgttact 23221 actgttctta ggaagtctga cgctgggcta tatcgtcttc aatttcaaga ccttcccaca 23281 aattagtttt gatcgattat ttcaacaaaa cgcgcccccc actgggcaaa agacgccaga 23341 aagtaacaaa aatgtgaaaa ctgtagctca accaactttc acacccatac ctaatcttgg 23401 aacgaactct acgccaaact caacagcaac aacttcccaa aagccggata cagaaatcaa 23461 accgtcagca gatggatttt atcatgtgat tattgacaat caaggtgata gcgcttttgc 23521 ctctgcacgg cgagtcgttc ctgatgctta cttatcaccc gacggtaaaa ttatttatct 23581 gggtgccctc aagaaaaaag agcaagcaca aacactccta caagaactaa aagcacaagg 23641 cattaatgcc aaaattcagc agccttaagg cagttgacgc agctctaaaa ggggtaagta 23701 actgaaagat aatatatggg catatctcgg caactcttgc ggaatgc // LOCUS NODE_1334_length_23291_cov_4.57604623291 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 23291) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 23291) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23291 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(239..2491) /locus_tag="DP116_12030" CDS complement(239..2491) /locus_tag="DP116_12030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495103.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase" /protein_id="PRJNA477356:DP116_12030" /translation="MIGDGMGWEMARAAAIYKQIQEGKTGATLSDFYTTGEGTGLSYQ NLTGYTLATTYGTTIADSNGVFSTGNSALDNSDPATGGSPVRSGFSFNPAFNPGSTAT GQAKVSDGAVGNLVGYDPERGGINPWTPGNDPEYIKWSYPDSANTATTLYTGVKSYNN AIGVDIFEQPLETILTTANLQGKSTGLVSSVPIDHATPGAAAASVNRRTKYDDEFPNL DNILQQELRIYQPTVLLGGGHPLSTPGNPLPEGVEPPRTNEFITESTYKELSSNPTNN IYDYTFLERGADAAAKLAEIAGAVDPNKGDRLLGLYGARGQNGNLPVSSANGDYSTTG LDNFSVFSTQGKNPDTKRPLLPGETDESFIAREVNENPTLKDLTNAALEVLGKDQDGF WLMVEGGDIDWSAHDNNIDNLIGTVLDFDKAITSTINWIENNGGWEDNLLIVTADHDH YLTLNPNYPSLVRSLGGQALTDLDTPTEAGHFWGSDSNVKYGWGSHSNRPVPVYYQGA ESEVLTSFVGQGYESYGYQIPGIPGLVDQSQIYRTMLAAVTGSSEKPLLQEPQTYYGS AGNDVVYTGSESDTIYAGEGDNRIFVNNGDNKVFAGSGNDLVYAGIGNDVIYVGEGDN TVFAGAGNDEIYGGAGDDQFYAGAGDDLIYAGEGNNIISAGTGNDTVYVGSGENGFIL NAGIGSVTIYGFTSDDYISRGGGLTAPNATTLPPELSLRISGNDTLISLGNDLLATLK WAQLDTVTIV" gene 2984..3943 /locus_tag="DP116_12035" CDS 2984..3943 /locus_tag="DP116_12035" /inference="COORDINATES: protein motif:HMM:PF08450.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="superoxide dismutase" /protein_id="PRJNA477356:DP116_12035" /translation="MSRLKFILLLVVVVVSLITFDFASTQAKVISLNRYILPGETVFP EGIAYQPKTKDFFVSSTTDGAILRGNLQEESAKVFLPGGADGRTTAVGLKVDDKNRLF VAGGNTGQIFVYDTRSGELLGQFKNQKASTFINDVAISANGEAYFTDSSDPTLYKVSI NSANEIQFEAWLDFTGTSLEYQSGFNLNGIAASLDGKYLVVVQSNTGKLFRIDIDSKE VTEIDLGGERLNNGDGILLSRGKKQILYVVRNQQKLIVKVELEEDFSRGTVVSSTTDP SLAYPTTIAQVSSQLLVVNSQFDKRAPGLTPELPFTVSTIPSP" gene complement(3992..4213) /locus_tag="DP116_12040" CDS complement(3992..4213) /locus_tag="DP116_12040" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12040" /translation="MNKLTKKNNKGIRTALWAIAAINAIAQSKRRPWRIADFACILTV THMPHLKEAFQARIEVNKTQSGSQVRLLI" gene 5208..6377 /locus_tag="DP116_12045" CDS 5208..6377 /locus_tag="DP116_12045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195041.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12045" /translation="MATSENSNQTVGSQGTSNNLTGNGNFSFTDGAWQSTDGSTLNSA ASGGFGAGGAGGAGGAASGGFGGAADGSNTFASFIGGGNSSTTSADGQYTYNYERPLD ERALTDTNNPFNQLIGVVGGDTSALGSSNPFAGGSASGGNSASGSSNPFAGGSASGGN SASGGSNPFAGGSASGGDSASGANNAVAGGNLPGNLPFGSTPPSNQSGSNLPATGNNG LPVPYNSDNWISDIQKLDGANATGNTGQGNGNWNYGSNNTSDGNGNWNFGSGNTTNGN GNWNLGNNNDILGNANTQTGSNNDILGSGNTASVSNSNLLGNRIEASGDGNTLIGNEG WNFSVSGGLISLGSSRPSQAIGTDVSNLLSSPDLVTSAISNPGYNNPNYDFSGIG" gene complement(6888..7538) /locus_tag="DP116_12050" CDS complement(6888..7538) /locus_tag="DP116_12050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319243.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12050" /translation="MPVALTLLSFGANVQSVTAQTTENIYEFSITYDALAIFGPVFNE EKNILDVAVLGESIGDAPFGLTNFDSRTYGRFEDRTTQIFSTFDADPSVFGIEGNVRG DRYFGGSNELFGRASDSAVIDLVQQTIVGGGIINVTGGTGIFSNATGLITFTQEDRLT PGSTDGPLTSRGVAVLNFSIRTPQRVPEPAATTALLGLGVTGAVLLHQYRRRLNNF" gene complement(7862..10944) /locus_tag="DP116_12055" /pseudo CDS complement(7862..10944) /locus_tag="DP116_12055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129569.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ATP-binding cassette family protein" gene complement(11615..12601) /locus_tag="DP116_12060" CDS complement(11615..12601) /locus_tag="DP116_12060" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12060" /translation="MVSKSKTKAESQKSKARTAANSKKTVIEPKIQEANNAKSTAPRA RSKQTDEPALKTQATKKVKSPNGKANSTTGKATKQANSKTKVQPTKKAKVAAGKSSST TNKRTQKVVSQIDIKPTEQPTFSTAKASPTGAKSVSQTKVQPIPKAKTSTAKTRHKNK KNSFENNFFSEPEVDRKSGRKANLDLKASTVQAAFKKFQSKIVDLERSTTEKGRTSRD YLFKQVKRLVKNHSDFPPLKGNYIPFGSFARKTKIRPLDDIDILLLLNGRGINVEISY QQKPSGYGVNMTAVCRVKITDARSPLQAFADDKGYVNSIKILNRIKHHHRRQ" gene 13046..14161 /locus_tag="DP116_12065" /pseudo CDS 13046..14161 /locus_tag="DP116_12065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655380.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Zn peptidase" assembly_gap 13283..13292 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 14148..14663 /locus_tag="DP116_12070" CDS 14148..14663 /locus_tag="DP116_12070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655379.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12070" /translation="MQSLKGIILDATALIDFRWLNEWVWLQRYYSPLYIAQELLDSDQ LEPPTRQAANQYLTPLALSTEEMFASFLEFSVRAPLLSVADRSTIAIARHQLLICASD DGLVVETCKAYGVAYTRTLRLLTEMVETAHKTVIEVTEMADSLINERGKHISPKVLTD WTTSLQKYSTS" gene complement(14699..15420) /locus_tag="DP116_12075" /pseudo CDS complement(14699..15420) /locus_tag="DP116_12075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002059879.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" gene 15501..16508 /locus_tag="DP116_12080" CDS 15501..16508 /locus_tag="DP116_12080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="site-specific DNA-methyltransferase" /protein_id="PRJNA477356:DP116_12080" /translation="MNPILHLQDINPAYFTNNGASYCGDSRELLQKLPDSSVNLVITS PPFALQRKKEYGNKEQHEYVDWLTEFAALVYKKLRDDGSFVVDLGGAYEKGVPVRSLY NYRVPIRFCDDIGFLLAEDFYWYNPSKLPSPIEWVNKRKLRAKDSVNTVWWFSKTEFP KADVTKVLAPYSDRMKKLLEDPDKFYKPKVRPSGHDISKAFAKDNGGAIPSNLLQISN TESNGQYMDGCKAVSVKQHPARFPAKLPEFFIRFLTDPGDLVVDIFAGSNTTGQVAQT ENRLWLAFEEQPEYLAASAFRFLTKENTTLEMQEIYNLICAGKSVDLNTYQKQTLLSL F" gene 16934..18115 /locus_tag="DP116_12085" CDS 16934..18115 /locus_tag="DP116_12085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ISAs1 family transposase" /protein_id="PRJNA477356:DP116_12085" /translation="MQSFFSEIEDPRVPRTRAHLLTDILIIGIFSAIAGGKGWEDMEN YGLSKHDWLKEFLALPNGIPCPDTFRRVFERINPKAFERCFRRAWVQSVVETVGAQVV SIDGKTLKGSYNREQGKSALHLVSAWASEHRLVLGQVKVTDKSNEITAIPALLELLDL AGCIITIDAMGTQTAIAKRCCEAQIATQIYNAKADYVLALKANHPTLHGQIKTWFDQA AADQFQGITVSYDERIEKGHHRTEKRQVWSVPVSQLPPLHNQDDWVGLQTVVMVVRVR HLWNKTTREVQFYLTSLESDACKLGQAIRLHWGVENGLHWTLDMTFSEDACRVRTGHA PQNLALLRRIALNGLNREQSLKRSNRQKSNRAAMDNNYMLTILAACLSQHNDTSKPAC Q" gene 18564..19253 /locus_tag="DP116_12090" CDS 18564..19253 /locus_tag="DP116_12090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318575.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="conjugal transfer protein TrbI" /protein_id="PRJNA477356:DP116_12090" /translation="MSQFHRWKSGAAGVMAMAITTGTIAPMFMFAPASAQSIFRGQGS QAPTGRVSIPAGFTLPITYDKDKIVVTPDETTRIKLKVAKNLVDSSRNVLVPEGSEIE GQLQPITRNGEKGIYFVAQDLILPNGERQSINATSRVITRKEKISKGGRTDKIIQDAA IGAGAASVISLITGDRKIQALEPILGAGAGAAASVLLRRKNTEVIVIEPQRGDLDLTL RSSLFVSRGYY" gene 20297..21550 /locus_tag="DP116_12095" CDS 20297..21550 /locus_tag="DP116_12095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459849.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer homology domain-containing protein" /protein_id="PRJNA477356:DP116_12095" /translation="MFGHFRWQSGCAAFMILGITTGTIAPLVIPTASFAQTSFIDVQS NYWAAEFIRELAQRGIVAGFPDGSFRPEQAVTRAQFAAMIGKAFRKAPERQAVRFDDV PSSYWASSAIQEAYTTGFLSGYPGNRFEPNQNIPREQVLVSLSSGLDYKVSGNTDTIL QSYDDTNNISGYARSPVAAATEKQIVVNYPNIKFLNPKVTTTRAQVAAFIYQALVSSN QASAINSPYIVALAERTPSKPVAVTIPEGTVIPIRYEKAEKILVTKEEIVPLTLTVAQ NVVTDKGTLVIPAGSQVVGELRPVKDKNGSQFVAQKLVLTNNGQEYPINATTDVITKT ETVKKGINTKTILRNTALGAGAAAAVSIVTGDKAISAGEVLGGAAIGGLVGLFFGKKS VDLIAIDPKTDLQMTIGENFQVSLK" gene complement(21661..23091) /locus_tag="DP116_12100" CDS complement(21661..23091) /locus_tag="DP116_12100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743105.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_12100" /translation="MSASSEFVALCREQLALLAQGLEASLSVVYLTQELIEASTSEAK LIPVVVYPETTVEQQGASALVLPTSIPISNPNWASYRSASVTLDKTLHSRTGDVFNQK HNRNNRRLLKAAEDFPAPSRETATGDTAPPHSTEEEEYLLKGDQIVLPLVHEDVMMGL LVTVREDRVWNEQEKSQIERIAQTLSLACILDQRRAWLHQQLQQQQIFQEKQQDLLDN LLHQLRNPLTALRTFGKLLLKRLRPGDANREVATSIVRESDRLKELLQHLDEVIDLTT EDLEPLTLPQKEVLVEANVQKDFIAPLLLPGAGEKEVNCSVLDILQPLLVSARAIAQE RNLQLISEIPPNLPLVRANTKALREVLSNIIDNALKYTPTGGKILIQTRQEKPHFQGI AISDTGPGIPQQDLEHLGERHYRGVQAQTEIPGTGLGLSIAKQLIEQMQGEIQVFSPA LTSLRSSPDTPGTTFIIWLPVAHREY" BASE COUNT 6345 a 4971 c 5022 g 6943 t 10 others ORIGIN 1 ccttcttgtc cacgggcgta aatttcgctc aagccaagcg tattttatct ggtggtctct 61 tcttcggtaa attccacatt ggtgacttga tttggttgag gtttctgtgc taccttcacc 121 tgtgttttag gtgcagtcat agaaaagcct ctgaagtctt gctataaaac ggcttcagaa 181 gctcagacaa acaatcacat attttttgtc ttccttcact tcccagttaa aaataagact 241 agacaatagt cacggtgtca agctgcgccc acttgagggt tgccagtagg tcgttaccaa 301 ggctgattag ggtatcattg ccgctgatgc gaagacttag ttcgggagga agagtagttg 361 cgttgggggc agttagacca ccgccacgac taatgtagtc atcggacgtg aagccataaa 421 tggtgacaga accaatcccg gcgtttagga taaagccatt ctcgccactg ccgacataca 481 ctgtatcgtt gcctgttccc gcagagataa tgttgttacc ttcaccagcg tagatcaggt 541 catcgccagc ccctgcataa aactggtcgt ctcctgcgcc accgtagatt tcatcgttac 601 cagcacccgc aaagactgtg ttatcacctt cgccgacgta aataacatcg ttaccaatac 661 cagcatacac aaggtcgttg cctgaaccag caaacacctt gttgtcaccg ttgttcacga 721 aaattctatt gtcaccttca ccggcgtaga ttgtgtcact ctcagatcca gtgtaaacga 781 catcattgcc agctgagcca tagtaggttt gaggttcttg tagcaagggc ttttctgaag 841 aacctgtgac cgccgccagc atggttctgt agatttggga ttgatctaca agtccgggta 901 tgccaggaat ttgatagccg taactttcat aaccctgacc cacaaagctg gtaagcactt 961 ccgattcagc accttgatag taaactggta ctggacggtt ggaatgactt ccccacccgt 1021 acttaacatt ggagtcagaa ccccagaagt gccccgcttc tgtcggtgta tctaaatccg 1081 tcagcgcttg accaccgagg cttcgcacca aagaggggta gttcggattt aatgtcagat 1141 agtggtcgtg gtcagctgta actatcagca ggttgtcttc ccagccaccg ttgttttcaa 1201 tccagttaat tgtagacgta attgctttgt cgaagtccaa cacggtgcca atcaggttat 1261 ctatgttgtt gtcgtgggca gaccagtcaa tatccccacc ttcgaccatc agccaaaaac 1321 catcttgatc cttacctaaa acctctaatg ccgcatttgt caagtctttg agggttgggt 1381 tttcgttgac ttctctggcg atgaaggatt catctgtttc tccaggcagt agcggtcgct 1441 tagtgtcggg gttcttgcct tgagtggaga acacgctgaa gttatcaagt ccagtcgtgc 1501 tatagtctcc attggcagaa cttacaggta agttaccatt ctgcccacga gcgccatata 1561 aacccaatag gcgatcgcct ttattagggt caacagcacc agcaatctca gctagcttgg 1621 cggcggcgtc agctcctcgc tccagaaagg tgtaatcgta gatgttatta gtggggttag 1681 aactcagttc tttgtaggtt gattctgtga tgaactcatt tgtacgaggt ggttctacac 1741 cttctggtaa agggtttcca ggcgtggaga gtggatgtcc tccacctaac aacaccgttg 1801 gctggtagat tcgtaattct tgctggagga tgttgtctaa gttggggaac tcatcatcat 1861 actttgtacg gcggttcacg ctggcggcgg ctgcaccagg ggtggcgtgg tcaataggta 1921 ctgacgatac cagacctgtt gattttcctt gcagattagc tgtcgttaga atggtttcga 1981 ggggttgctc gaaaatatct acgccaatag cattgttata gctcttcaca ccggtataca 2041 gggtggtggc agtgttcgca gaatcgggat agctccactt gatatattct ggatcattgc 2101 caggagtcca ggggttgatc ccaccacgct ctgggtcata accgaccaag ttacctactg 2161 cgccgtcgct gacttttgct tgaccagttg ctgtggaacc tggattaaag gcagggttaa 2221 agctgaaacc agaacgaacc ggactacctc ctgttgctgg atcagagtta tccaaggcgg 2281 agttaccggt actaaaaact ccgttactat cggcaatagt tgtgccgtaa gtcgtagcta 2341 gggtgtagcc agtcaaattt tggtagctta gtcccgtacc ctcccctgta gtgtagaaat 2401 cacttagggt tgcgcccgtt ttcccctctt gtatctgttt gtagatagca gcagcacgag 2461 ccatttccca gcccatgccg tcaccgatca tgatgataac gttctttgcc acgagtacta 2521 cggtctccta atactgagaa ataaaaatta ctggcacaaa aacttgagat cagctttctg 2581 cacctttctc tctaaatctt tcaaacttag gttaaaaatg aagaatgttt tgattaaaaa 2641 taagtatttt tttaaattgc gatgtagcca agtggaaatt tcctcttatt gactggagtt 2701 tagactaaaa gtcggcgatc actcctgaaa tagtttaaat gtactaaaaa tgtggaaaca 2761 aggtacaggt ataaaagttg caagagaaac ataacctaaa agcatttgtc tattattaga 2821 cgcattgcca aatcttccta aatcaacaat cgtacccaat acggttggtg ataagcacgc 2881 ttgacagaaa cccggtttct ctgagcaaca attacgttaa ccgaactctt gatttttctt 2941 ctctggtttg agattgttaa ctaaagtata cgtaaaaaaa ctgatgtcaa gattaaagtt 3001 tattttactg ctcgtagtcg tagtagtctc attgattact ttcgattttg caagcactca 3061 agctaaggtt atcagtctca accgctatat ccttcctgga gagactgttt ttccagaagg 3121 tattgcctac caacccaaaa ccaaggactt ttttgtgagt agcaccacag acggcgcaat 3181 tttgcgagga aatctccaag aggagtcagc caaggtgttc ttacccggtg gtgctgatgg 3241 gcgcacaaca gctgtcggct taaaagtgga cgacaaaaat cgtttgtttg tagcaggagg 3301 caatactggt caaatatttg tctacgatac ccgcagtggc gaactactcg gtcagtttaa 3361 gaaccaaaag gcttctacat tcattaatga tgtagctatc tctgcgaatg gagaagccta 3421 tttcactgac tcatctgatc caactttata caaagtttcg atcaattcgg caaatgaaat 3481 tcagtttgag gcttggctcg attttactgg tacgagtcta gagtaccagt caggatttaa 3541 cctcaacggt atcgccgcaa gcttagacgg taagtacctg gtagttgtgc agtcaaacac 3601 aggaaagctg tttcgtatcg acattgatag caaggaagtc accgaaattg atctaggcgg 3661 tgaacgacta aacaacggcg atggtatctt gctctcgcgc ggtaaaaagc aaattttgta 3721 cgttgtgcgt aatcagcaaa agctgatagt caaagttgaa ctagaagagg atttctcccg 3781 aggtactgtt gtctccagca cgactgatcc gtcgttggct tacccgacta ctattgccca 3841 agtgagttcg cagctgctag ttgtcaactc tcaatttgac aagcgtgcac cgggactgac 3901 acctgagtta ccgtttactg tttcgaccat tcctagtcct taaaacagtg aacagtgaac 3961 agtgaacagt gaaactgttt actgttcact gttaaatcaa caatcgcacc tgtgaaccac 4021 tctgtgtttt attcacttca attcgggctt gaaaggcttc tttgaggtga ggcatatgag 4081 ttacggttaa gatacaggca aaatctgcga tacgccaagg gcgtctcttg ctttgggcga 4141 tcgcattaat tgctgcgatc gcccaaagcg cagttcttat tcccttgtta ttcttcttcg 4201 tcaatttatt cactcgttag cggaacttgc gaactatgac gtacattttg aattcgcgat 4261 gctcctaaag ggagcctttg caaggcgcga tcgcggcata gttgatggca gtctttatct 4321 tccatcaaat aatactatct ttggcttcgt ataaagtagc gatatatctg ctagcatttg 4381 gaaactatga acaggtaaat tatcctcggt atgttttttt agctttagct aaagctcgtt 4441 ctttcaggat tatttggcac taattcttca gcaataaatg gcatctttaa cagtgacaag 4501 tacgcaataa ggcacaaaca agtacattgt gaccagcgcc aattatcaca agatcataag 4561 cttcaatata tagtaacttt cggaatacag tatcccctta aatgctagaa aactcttaag 4621 aagtataaat ctgtcggcgg ttagaattca gagatttggc atctgtgtga ttaatatgtt 4681 tgtttactat ctgtctgtag gaggatgcta actacagatt agccagcact aatgtgtgga 4741 agttacaatg cacattgcag gagaatcgcc aaacttattt attattagat aatgcctgtt 4801 ttcaagagtg agcgagcaaa tccggaaagt atcgctttta aggacacaga gagacctgaa 4861 ttctaccgtg catgatattc tgcctgaacg aaatcaaaag aattaacaat caaatctaaa 4921 ttttgttcat ttcaaacgga aatattattt cgtatacttt tgttgagtgt ggatgtaact 4981 gttctttagt ttgattttgg gttaattaag gcttgatgaa atcaaactat tttcattgat 5041 atttaactga attccaagca aattctaact agctgtcaaa aatagttatg tagttagaaa 5101 acagtatttc tttattgaag atgaaagtca aaaaatgctt tcattcatgt cgttgatctt 5161 cctctttctg ggggaatttg agttttagtc ataggagttt aattctaatg gcaacttctg 5221 agaatagcaa tcaaactgtt gggtcacaag gtactagcaa caatttaact ggtaatggca 5281 atttttcttt cactgatggg gcatggcagt ccaccgatgg cagcacttta aatagtgccg 5341 ctagtggtgg atttggtgct ggcggtgctg gcggtgctgg cggtgccgct agtggtggct 5401 tcggtggcgc tgctgatggt agtaatacct ttgcttcctt catcggtggt ggcaattcat 5461 ctacaaccag tgcagatgga cagtatacct acaattacga gcgaccactc gacgaacggg 5521 ctttaactga tactaataac ccgttcaatc aattgatcgg agtcgttggc ggtgacacca 5581 gcgcattagg tagtagcaat ccctttgctg gtggtagcgc atctggtggt aatagtgctt 5641 ccggtagtag caatcccttt gctggtggta gcgcatctgg tggtaatagt gcctccggtg 5701 gtagcaatcc ctttgctggt ggtagcgcct ctggtggtga tagtgcctct ggtgctaaca 5761 atgccgtcgc tggaggcaac ctccccggca acttaccatt tggcagcact cccccctcga 5821 accagagtgg tagcaaccta cctgcaactg gcaacaacgg tctacctgta ccttataaca 5881 gcgacaactg gatatctgat attcaaaagt tagatggtgc aaatgccact ggtaacactg 5941 gtcaaggtaa cggtaactgg aactacggta gtaacaacac aagcgacggt aatggtaact 6001 ggaattttgg tagtggcaac acaaccaacg gtaacggtaa ctggaactta ggcaataaca 6061 acgacatcct tggtaatgcc aatacacaga ctggtagtaa caacgacatc cttggtagtg 6121 gcaacacagc tagtgtaagt aacagcaatc tccttggaaa ccggattgaa gccagtggtg 6181 atggcaatac acttattggt aatgaaggtt ggaatttctc tgtgagtggc ggactgatat 6241 cgctaggtag tagtaggcca agccaagcta taggcactga tgtcagcaat ttactcagca 6301 gtcctgacct tgtaacatct gccatatcaa atccaggcta caacaacccg aattatgatt 6361 tctctggcat aggctaaaag ttgtagcaca tcgccagttt ttgcattgat aagcctgtct 6421 ataagcttgc gggcgcatag cgcccgcagt accactaagc catactttgt tgttgtatta 6481 ctaatcaaat tgagtgttta agcatttcta tgtaagttca aaatatgtag cagtcctaaa 6541 tcaacaatcg cacttgtgaa ccttgttgag ttttactcac ttcaattcgg acttgaaagg 6601 cttctttgag gtgaggcata tgagttacgg tgaggatgca ggcgaaatca gacgcgatcg 6661 cgccttgggc gatgctccca aagggagccg cccaaggcgc gatcgctttg gacactgcgc 6721 tgcttatttt cgctcaatgc aagagttgct gtcaccgaag tgccaacgtc attgaaaccc 6781 tttcatacag gaaatttcag cataaccttg tcataaattt cttgagcttc tccgaaaggg 6841 gtaaaaatag cgaatttttg cgcttttttc aacagttagt actcacgtta aaaattatta 6901 agccgacgac gatattgatg tagcaacacc gctccagtaa cacccaagcc gagtaacgcc 6961 gttgtagctg ctggttctgg aactctttga ggagtccgta tagagaaatt tagtacagct 7021 acacccctag aggttaaggg tccatcagtt gaacctggag tcagcctgtc ttcttgagtg 7081 aaagttatta agcctgtagc gtttgagaat atacctgttc caccagtgac atttatgatg 7141 ccaccaccga caatagtctg ttgaaccaga tcgatgacag cgctatcgct tgctcttcca 7201 aataactcgt tagaaccacc aaagtagcga tcgccacgta cgttgccttc tataccaaat 7261 acggatggat ctgcatcaaa tgtgctgaat atttgggttg tgcgatcctc aaaccgcccg 7321 taagttctgc tatcaaagtt agttaatccg aaaggggcat caccaatact ttcacctagg 7381 acggctacat ctagaatgtt tttttcctca ttaaaaactg gaccaaagat agctagtgcg 7441 tcgtaagtaa tactgaattc ataaatattt tccgtagtct gtgcggtgac ggattgtaca 7501 ttcgctccaa aactaagtaa agttaaggct acgggaatga accatgtgac tttttgacag 7561 agcataatag tcctttgctt tttaggtgaa aacacagact gtaaataaaa tacgtgtctt 7621 acccagaata atacgtaaaa atcaagagcg gcttaatgaa tttttcataa aataaaacga 7681 aaccgctgca aatcttaact cgtctcccgc ttcatcgtct tctccatcgg caactacaac 7741 tttgactgca caagctaagg ttctccaaaa agcacaacaa gagttagcta agttagaggc 7801 tgcttgaaag aacgacatat tcaacagcct ttgtaaataa ctgaaaaaac acaggtgttg 7861 attaaatcaa caagcgcacc tgtgaaccac tttgtgtttt attcacttca attcgggctt 7921 gaaaggcttc tttgaggtga ggcatatgag tcactgtcag gatgcaggca aaatcagaag 7981 cgatacgccc ttggcgtctg cgctttgcgc aatcgcatta atcgctgcaa tcaggcgatc 8041 tgcgcgcagc gcagcagcga agctatcgca tccttctgca tcttgtgtcc caaatccttc 8101 atccacaatc aacaattgca acgccgcccc cgcccgttgc gctaataatt tcgccaaagc 8161 taaacggatt gcaaagttaa ttctaaacgc ttccccacca gaataagtct cataagcccg 8221 cgtccctctg gcgtctgcaa tcaaaatgtc caacgtatct atcagcttgg catttttctt 8281 gcttgcctta ccactctttc cggctctttg tgtcacaaat tgaacgtgta attgattggc 8341 actcaatcgc gaaagcagtt gatttgtctc agcttccaat tgtggcaaga cgttctcaat 8401 catcaatgct tggataccat ttttaccaaa agcttgtgat aactcctgat atatacggta 8461 ttgtcgcctt acttcttgga gttgctgctg ctgttgatcg tactgaattt gcaatgtttc 8521 cagttggtga gactgctgct gcaagcgtcc taactgggca atttgttcat caagttgtcg 8581 tctgcgtgat gttatctgct gctctaatac ttgaatttga gtagatgggt ttgctgattc 8641 ttctagctgt tgaacgatgc tttcaatttg tgctccatat ctttgtcgtt ctgtcaatct 8701 tgcttgcaaa gaaatctcta actcttgtaa tcttgttttg agttggggat actgttgctg 8761 tgtaccaaca agttgttgat agcgtaattc ccaagcttga gttttgcgta cagctatacg 8821 aagattgtta tgttgttctg acacgtagcc aatttcagta atctggcgct caagggcggc 8881 gatttctctg gcacattcag attcagtggc ttcttgctgg attctggtgt gtaattgggt 8941 aatttgtgcc tcaatttctg gttttcgtcc tgctaattgt gcttgacgtt tagctgcgtc 9001 ttttatttgg cttaatctaa tttctgccca tcgccaccgc tcaacttcgt tacgagcaag 9061 ggcgtggtct tgttcattat agttgagttg ttgtagatat tgttcgagtt ggtggagttc 9121 ggcgtgtgtg tcgtgggcgt agtcatttat ttggagcgat cgctccaagt gctgcttttc 9181 agcaacaatt tgttgtagct gttgctcaac atcacttgtc acctccaact gtgctgccaa 9241 ttgccctctt tgttcgcgta aggtatcata aggagacaat tgttgcgata tttctttata 9301 ttcctgtctg agaacctgaa tttctctgtc tgaaacagcc attcgttctc gaactatcca 9361 caactcatcc tctgcatctt tatactctac tttcgttttg tctactacac ggttccagtg 9421 atgttcatct agaggacgtt cgcacaaagg acagatagca ttagggttct gaagcatttg 9481 cagcttttgc tggagttctc ccagtagttt ttcaaagtcc cgttgatgcg cttgcaaacg 9541 ttcgataagg tgacgccttt cctgtccttt ttcctgcact cgttgcagat aaactcgctt 9601 tttctccagt tccgaaatct gcatcgccac ttccaccact gcttgttgca gttgcggttg 9661 gcgtctatgc tgacgttgga gttggttttc tgtcgcttgc aattgttcaa atcgagccac 9721 taaaccagca tgagcacgat ccaattgact ttgcaatctc acccgttgtt gcaaaagagg 9781 agtgacttgt aattgcagct gatccatttg agcaacacgg ttgcgtgcgg tcgcaagttg 9841 tgctaatgct gcttcgactt catctgactt gctcagagtt tgccgaagtt cttgctcttg 9901 ttgctgtaaa gccgcgagtt gtgcttctgc gtgttgtaat tgccgttcaa tttcgttaat 9961 ttgttgagtg agttgctgtt gcttttgttg tcgtgcttgt tgggcgcgag tgtgctcttc 10021 aaatttagcg cccattgctt cttcttgaga ttgcagaatt tggtattgat tgtacccggc 10081 ttgaatttca gcttcttgag ttaggagttc ttctaaagaa gagagttgat tgtcaattgc 10141 tgagtgttct tgagtcaggc gatcgcaatc ttgagtcaga ttttgatact gctgcctaac 10201 aaaactgagt tgttgctccc agttttgtcg ctggtgctgg acaacttgca aactttgtaa 10261 ttgaatattc tcaaacgctt gtacttgttg caattgattg atttgccctt ctaactcggc 10321 ttgtttttgg gcagttactt cacgctgttg cagttgagat tttaacgaat ccaaaaggcg 10381 ctctaactct tccgctcttg ctttaaactg acgtgagagt tcttttgccc gttcttctaa 10441 ttcatcatac tgattaagtt ttaataactc tgctaatatt tccttacgct cagttggtcg 10501 cttcagcatg aattcatcag cacgaccttg acgcaggtat gctgaattaa tgaaagtgtc 10561 gtaatctaac ttaatatgtt ctaaaatgag atcttgcgtt acccttatgc ccttaccagt 10621 cagtgcgcgg aaacctgtag gcgtttgtat ttgaaattct agagcaccat ttccaccacg 10681 cgatcgcgtg cgaatcactt tatatgtttg ctggtttgct tcaaaaatga aatcaactct 10741 aacttctttt gcacctgtat ggataacatc atcttctgaa ggagctcgac tttcacccca 10801 aattgcccag gtaatcgctt cgagtaggga agattttcca gcgccattgg aaccacaaat 10861 acaagccgta tgtaacccac gaaaatctaa agtcgtgtca cagtaactga ggaagttttt 10921 gagggtgagt tggattggaa tcattgagga ttaggcacca cacctttttt tagaagtaac 10981 agtacaaatg ctcttattgc tagtttagct aggttattat catgtttata gtccttaaca 11041 ctcgaatttt acgaatgtta acaatatagt gagcaaattt ttcttattag gcaaacttct 11101 tttactgtcg tcgtggaaaa tttaatcttt catcaggttc tttcctcttt ccttgtgata 11161 aacaataagt ttgtttgatg ctttaaactc tttataatgt caggcaaaaa agaagacatg 11221 gggataagga aaaagacgag caaaacacat agagtcacac agtacgggaa atttctctgg 11281 taccccttat ctgactcaac ccatctcccc cattttccct tacctcttat ttgcctttta 11341 caggtttatt cgctcaggtg cattccggtg aatttaagaa ctttggcaaa acgtggttgc 11401 gaaaacgtca ttataattca cttaaaaaaa tggtgctgct tggaactgcc atgactttgc 11461 tgaacaaact gtaattgaaa ttagacctga agagagaatt gaggggatat gcatgaacgg 11521 cgtctgccct gagtccttac ggacacgctg ctttgttggc gtagcctgcg ctttgcgcat 11581 aggggcgatc gcgtttttac gttatttgca caactcattg ccgacgatga tgatgcttaa 11641 tccgattcaa aatcttgatt gagttgacat accccttatc atctgcgaat gcttgaagag 11701 gagaacgagc atccgtaatt tttacccgac aaactgctgt catgttgact ccgtagccgg 11761 atggcttctg ttgataggaa atctcgacat taattcccct tccgtttagc aggagcaaga 11821 tatcgatgtc atctaagggg cgaatctttg tctttctagc aaatgagcca aacggaatat 11881 agttaccctt aagcggtggg aaatccgagt gattcttgac tagtctttta acttgcttaa 11941 acaagtagtc acggcttgta cgtcctttct ctgtggttga ccgctctaag tcaacaatct 12001 tggattgaaa cttcttaaat gcagcttgaa ctgtgctagc ttttaggtca aggttagctt 12061 tccgtcctga ctttctatca acttccggtt cgctaaaaaa attattttca aagctatttt 12121 ttttattctt atgtctagtt ttggcagtag aagttttagc ttttgggatt ggttgtacct 12181 tggtttgact gacactttta gctccagttg ggctagcttt tgcagtagaa aatgtaggct 12241 gttcagtcgg ttttatgtca atctgactga ctactttttg agtacgttta tttgttgttg 12301 agctagattt accagcagca accttagctt ttttagttgg ctgtaccttt gtcttactgt 12361 ttgcttgttt ggttgccttg cctgtcgttg aattagcttt tccgtttgga ctcttcacct 12421 ttttagttgc ttgtgtctta agagcaggct cgtcggtctg tttggatcta gctctcggag 12481 cagtggactt agcattatta gcttcctgta tcttgggttc aatgactgtt ttttttgaat 12541 tagcggcagt gcgagccttg gacttttggc tttctgcttt tgttttagat ttactgacca 12601 tttttttgta aatggtgatg ctttgtattt tgaagccgaa gagggagata atgaagacaa 12661 ctgaaggagt gggctatgag tgtaatacca ccccttcaat aaactaaacc taagacctat 12721 accgagttgc gttcaatctc tacaattgaa cgtaatttgg tattagtctt ttttacgaac 12781 acctttaaaa ggttctccgt tttgcttcac gtccataaac cgtccggttt cagcatcacg 12841 cttcacccat tgttcggtca cagggttgta agtttggcta cggttgtcca cagcaccacg 12901 gcgatagccc ttgcctgtat tgtttgccat atcttcatca cctccttaag agagatgtaa 12961 aagcgctttt acattttata ttataattag agaaaatagc atcaacaagt ttgacacctc 13021 tagcgatcgc tctcaagaat cagccatgaa agagatcatt gctgcgaacc tgatccgcta 13081 ccgtaaaagt ctaggcttgt ctcaagaaca acttgcagaa caagctgggg taactcgcca 13141 gagcatcaat aactacgaga acgccaaaac tttaccagac agcaaaatcc tttctgctct 13201 ggctagtgct ttgggcatta cactcgatga cttgctacgc tcacaaggtg aaggactacc 13261 caactttcgg ttccgcgctc atnnnnnnnn nnaggtttcc tttgacaaaa atgcccagtt 13321 tgcggcccaa gtgctacgaa tgctgcaaac ttataacgcc ctagaacaag ccgttggctt 13381 accgacctac accccagaaa gtacaccttg ccatcaagta gaaggcaatg aaaagcacat 13441 tcagacaata gctgctttgt ttcgtcatcg cctgggcttg ggagatgctc ccattgccaa 13501 tctgtttcag tctgtagaag aaatcggctt aaaagtttta cgctcctctg ttcccattaa 13561 aggtttcttt ggtctaagtg cttgtagtga tattgaaggt gcttttgttc tggtaaatac 13621 ccataacatc actattgaac gtcaattgtt tacccttgca catgaaattg cacaccttat 13681 cttccatcgt gtagagtacc aagacaccct gattgaagaa ggaaccaaag aagaagaaaa 13741 agcacgagaa aaagtagctg attactttgc tagtcatcta ctagttcccc aagctgaatt 13801 tgaacggatg tatgcactta cccaagatat tgtcaaactt aaacggcatt ttagggtgag 13861 ctacctagtc atcttgaatc gtttagcaga aatgaaaatc attgattttg ctaaagagaa 13921 agccaaaatc tgtgcaattt ataaaaagca gcatgatggt gcatctttgc aaaactcaat 13981 ggaattacca ccagcactag ctgctgcgga ttatccagaa aatgaacgtt atgaatttct 14041 aatttggcag tccttaaaat cgggcaatat ttcagagatg aaagcagcag aacttctcaa 14101 cttaactgtt gaaaaactgc gggtgcgtcg tcaagaaaat gaggtttatg cagtcgctta 14161 agggaataat tctcgacgca accgcactga ttgattttcg ctggctgaac gagtgggtgt 14221 ggctacaacg gtactacagc ccgttgtaca ttgcccagga gcttctagac tcagatcaac 14281 tggaaccccc aactcgccaa gctgctaacc aatacttaac gcctctagct ctttccacag 14341 aagagatgtt tgccagtttt ctagaattta gcgttagagc gccccttttg agtgttgcgg 14401 atcgatctac aattgctatt gcccgtcatc aattgctgat ttgcgctagt gatgatgggc 14461 tagttgttga aacctgcaag gcatacggtg ttgcctacac cagaacactg cgattattaa 14521 ctgagatggt agagacagca cacaaaacag tgatagaggt gacggaaatg gctgattcat 14581 taatcaacga gcggggtaaa cacatttctc ccaaagtttt gacagattgg acaacaagtt 14641 tacagaaata tagcacgagc taaaaaatta aatgtggtta aactaagtgc tttagatgtt 14701 atcgtcttct tttcggagag atgctaggag gagtgccgta tagtaattca ccattctcga 14761 cgacatattt taagggaatt tttgggttgt taatatcctt ctgtccattg gtgtgccact 14821 tgatttccca aacagtataa aactgttcca attttgctgg tttgagacga taaattgctt 14881 gtctttcaat atttttataa atggcaaaaa tccaatctac tttgcgatat ttgttaataa 14941 taatagggtt catgtgatgg tgggttgaga accctttcgt cagttcaata ttcattgatt 15001 tcaattcgta ttcatttcca tccagatcga tagcatcgtt cccctctcgc cccggaagga 15061 cagtcaaacc aagtatcagg agaacttgca ataatttgcc accattatcc tgaaaaatgt 15121 cgttaatacc gtgcttcgat gctaattctt ggtactgctg tatggcagga aacagtatct 15181 ccagcgtttc tagatcagga tgtggtttta gcactccttg tccctcccta gcaaattcac 15241 tgctttcaaa cctaatgctg tagcaatccg cttaatgtta tctatcgata tgttcctttc 15301 atctcgctcc actgagccaa tgtaagtacg gcgtaggcca accagttctg ccagcctctg 15361 ctgtgagatg ccacgagcct gccgtgcttt tcgtagattg tccgcaaata acttcctcat 15421 tttggtaatt gttttttttc ggttatatac cggaattgct ctgacgactt tacgtctaca 15481 gattataagt atcatttgtg atgaacccaa ttttgcactt acaagacata aaccctgctt 15541 atttcaccaa caatggagct agctactgtg gtgactcccg tgaacttttg caaaaattac 15601 cagatagtag tgttaattta gttatcacca gtccaccctt tgcccttcag cgtaagaaag 15661 aatacggcaa caaagaacag cacgagtatg tggactggct taccgagttt gccgcactcg 15721 tttacaaaaa actgcgtgac gatggtagtt ttgtagtgga tttgggtggg gcttatgaaa 15781 aaggggttcc tgtccgcagc ctgtacaact accgagtgcc tatacgcttt tgcgatgaca 15841 ttggtttttt gttagctgaa gacttttatt ggtacaaccc gtctaaactg cctagcccta 15901 ttgagtgggt taacaaacgc aaactgcggg ctaaagattc ggtaaatact gtatggtggt 15961 tcagtaaaac cgagtttccc aaggctgacg tgaccaaagt ccttgcaccc tacagcgatc 16021 gcatgaagaa attgctggaa gatccagaca agttttacaa accaaaagta cgaccttctg 16081 gtcatgatat tagtaaagcc ttcgccaaag ataatggtgg agctatcccg tctaacctgt 16141 tgcaaatttc taatactgaa tctaacggtc aatatatgga cggttgtaaa gcagtttcgg 16201 ttaagcaaca ccccgctagg tttcctgcca aacttcctga attctttatc cgtttcctga 16261 cagatccagg tgatttggta gtagatattt ttgctggttc taacactact ggtcaggtgg 16321 cacaaactga aaatcgtctg tggttggctt ttgaagaaca gccagaatat ctggctgctt 16381 ctgcgttcag atttttaacc aaagaaaata ccactttgga gatgcaggaa atttataatc 16441 tgatttgtgc aggaaaatca gtcgatttga acacttatca gaagcaaacc cttctcagtc 16501 ttttttaact accaccagta cagggttaat tcgccgttcg atagcgtgtt gtgggcatga 16561 cgcaacttac gggtttattc gctcaggtgc attccggtga gcctctcacg cttcgctttg 16621 ggcgcaagca acgctaatta agaaacatat ttgatttaaa tctctaaaag ccttgctgca 16681 tattgaaaaa tctgccagcc tccggaaaaa tcgcccatgt ataaatactc atttttttag 16741 aaaaatccac gtttggggca tcttagcgcc cagatcgttg gcgctaaacc aaccaattag 16801 taatgaggca ccgcagtttt tacaggtgaa atgaagcggc agtctcgtgg catcaggatt 16861 caaatctcat caagtacaaa ctaaaagctc taagcctttg agtatagccc atgcgactgt 16921 gctcacacag caaatgcaat cattttttag tgagatagaa gacccgcgtg taccaagaac 16981 ccgtgcccat ttgttaacag atattttaat aattggaata ttttcagcga tcgctggagg 17041 taaggggtgg gaggacatgg aaaactatgg gttgagcaaa catgactggt taaaggaatt 17101 tttagctcta cctaatggta ttccttgtcc agatacattt cgacgagtgt ttgaacgtat 17161 taacccaaaa gcatttgagc gatgctttcg tcgagcttgg gtgcaatcgg tagttgagac 17221 agtaggggca caagtagttt ccattgatgg taagacttta aaaggttcct acaatagaga 17281 acagggaaaa tcggctttac atttggtaag cgcatgggca agcgaacatc gtcttgtgct 17341 gggacaagtt aaggtgacag ataaatctaa cgaaatcacg gcaattcctg cattactaga 17401 attgcttgac ttggcaggat gcatcattac cattgatgca atgggtacac aaacagcgat 17461 agcgaagcgc tgctgcgaag cgcagatcgc tactcaaata tataatgcca aagctgatta 17521 cgtcttagca cttaaagcca accatccaac acttcatggg cagatcaaaa cttggtttga 17581 tcaagcggct gcggatcaat ttcaaggcat aaccgtaagc tacgacgaac ggatagaaaa 17641 agggcatcat cgcactgaaa agcgacaagt ttggagcgtt ccggtttctc aactgccacc 17701 tctacataat caagacgatt gggtgggact tcaaaccgtt gtcatggtgg tgcgagtaag 17761 acatctgtgg aataaaacta cgcgcgaagt tcagttttat ttaactagtt tggagagtga 17821 tgcttgtaaa ctggggcagg ctatccgcct ccattggggc gttgaaaatg gattgcattg 17881 gactctagat atgactttta gcgaagatgc ttgccgtgtt cgtacaggac atgcgccgca 17941 aaaccttgcc ttactacgac gcattgctct taatggtttg aatcgagaac aatctttgaa 18001 acgcagtaat cgccaaaaat cgaatcgagc cgcaatggat aataattata tgcttactat 18061 tcttgctgct tgtttatccc aacataatga tacctctaaa cccgcttgtc aatagcattt 18121 gagacgcgct taccctggac taaaactgtc ttcacctttc ttcttagctt tataggaaaa 18181 tataagcttt ttccttttct tttcattaac taaagatgta ctagtcttca aatttaccat 18241 tccaataata aaagaataca atagaaacaa gagattagat tcagcctatc gttacctatt 18301 gctgaacata tatcgtcgga catagagcat gtcaaagcat tcaggaacta tttcacaata 18361 gttgcgtcat atatatcctt cgttactttt cataagagca gatgaacggt ataagaatta 18421 aagaggaaca attgcatctt agttagcgtc gcacaagtca cttttcactc ggaatatgcc 18481 tactgatagg tttaaaaaga gccagaaaca atctcagtaa cagcacgtct cttacattca 18541 gaaaaaaaca aaaggacttt actatgagtc aatttcatcg ttggaaatct ggagcagctg 18601 gagttatggc aatggcaata acaacaggca ccattgcacc catgttcatg tttgctcctg 18661 cttctgcaca aagcatcttt cgtggtcaag gaagccaagc accaacagga agagtttcaa 18721 ttcctgctgg atttacacta ccaataacat acgacaaaga taaaattgtt gtcactcctg 18781 atgaaacgac acgtataaag ctgaaagtag caaaaaacct tgtagacagt tcacgaaatg 18841 tgttggttcc tgaaggaagt gaaattgaag gacaattgca accaatcacc agaaatggtg 18901 aaaaagggat ttattttgtc gctcaagatt taatacttcc taatggtgaa cgacaatcta 18961 ttaatgcaac ttctcgtgtc attaccagaa aggaaaaaat tagtaaagga ggtagaacag 19021 ataaaatcat ccaagatgct gctattggtg caggtgcagc tagtgtgatt tcattgataa 19081 caggcgatcg caaaattcaa gcgctcgaac ctatacttgg tgcaggtgct ggtgcagcag 19141 caagtgttct actcagacgt aaaaacactg aagttatcgt tattgaaccc caaagaggag 19201 atttggatct gactctacgc tctagtttgt ttgtatcccg tggctactat taggcttatc 19261 ctaattattt aggtcgtgag ttccatctgt ttgcagtata actgctccaa aggggcaagg 19321 tgctttaccc aattgccctc acactaccaa ggggctgcca gaagagcaac ccctcctctt 19381 tcacagaaag taagacaaag taagtcagca attttaaagt taaatccgga agaaaacaac 19441 ctgctagtaa atggtctagc ttctagctgt atgcaattta aatgcgacac agatgaggat 19501 attgcgtgtg attgggcaaa gccgcgatat cgcttattct aatttatgta aaatttgtca 19561 tttttctgca actattttat cgctcgcacg tcttacgact ccatataaat caggagcatc 19621 ccaattttcc aaaagtcttc gttcgccacg ttcaaaccag ggtgcccaga gggcataagt 19681 aacgtattag agcaaagcaa tcatccttcc tgtgaattta cacatttggg atgcttcgga 19741 gaaatcgtct tggcgaaaca tttaggaaaa taaataaact aatacgcatc taccgttgcg 19801 tatttatttg ttaagacagt catgaatcat ttgttattac aaatgactca tgactaatga 19861 ctcatgacaa acgagaagcg gcacgaatta gtgtgcaaat tttaggcgcg actcgttatg 19921 aaccaataca gttcagttag tggtatttcg ttttgtaaaa tcccccaaca gagttaaaaa 19981 gcttgctcag ttccctcctt tttaaggaga gccagcgcgt tgcggaggtt ccctccgttg 20041 tagcgactgg cgtgggctag ggaggagcaa tccttaactg aaccgtattg cgttatgaac 20101 tgaataaaat cccaagagtg aaacaaaaaa gaaagatgaa aaatgaagtt attcacactt 20161 cattctttat cctttaattg actcttttgg ctgaacgaaa gtattaacca aaacaaagtg 20221 ttttggaacc tcttcagtca aaggtagtct attttactaa cagaaaatag caggttgtgt 20281 ggggagcaag aataaaatgt ttggtcactt tcgttggcaa tctggatgtg ctgcattcat 20341 gattttgggt atcacaacag gaactatcgc acctctggtg atacccacag catcttttgc 20401 ccaaacaagt tttattgacg ttcaatctaa ctattgggca gcagagttta ttcgagaatt 20461 ggcacagcgg ggtattgttg ctggctttcc agatggcagt ttccggccag aacaagcagt 20521 cacacgtgct caatttgctg ctatgattgg caaagctttc cgaaaagcgc cagaacggca 20581 ggcagtcaga tttgacgatg taccaagcag ctactgggca tcgagtgcga ttcaggaggc 20641 atataccact ggttttttgt ctggatatcc tggcaatcgt ttcgagccta accaaaatat 20701 tcctcgcgaa caggttttag tttctctgtc tagcggttta gactacaaag ttagcggtaa 20761 taccgacacc attctacaat cttacgacga caccaataac atttctggct atgcccgtag 20821 ccctgttgct gcagctactg aaaaacaaat tgtcgtaaac taccctaata ttaagttctt 20881 aaatccgaaa gtaactacaa ctcgggcaca agttgcagct tttatctacc aagcattagt 20941 gagttccaat caagcctcag caattaactc accctatatt gtggctcttg ctgagcgtac 21001 tccatcaaaa ccagtagcgg tgacaattcc tgaaggaact gttattccca taaggtatga 21061 aaaagcagaa aaaattcttg tgactaagga ggaaatagtt cctttaaccc tcaccgtagc 21121 gcaaaacgtt gtcactgata aaggaacctt ggtcattcca gcgggcagtc aagtggttgg 21181 tgagctgcgt cccgtcaaag acaagaacgg ttctcaattc gtcgcccaaa aactcgtttt 21241 aaccaacaat ggtcaagagt atccaatcaa tgcgacaact gacgtgatta ccaagaccga 21301 aaccgtcaaa aagggcatca atacaaaaac aattcttagg aataccgcac ttggtgcagg 21361 tgcagcagca gcagtttcaa ttgtgactgg tgataaggct attagtgctg gagaagttct 21421 aggcggggct gcgattggtg gattggttgg tctgtttttt ggtaagaaga gtgtggactt 21481 aatcgccatt gacccaaaga ccgacctaca aatgactatt ggtgagaact ttcaggtttc 21541 gttgaaatag ttagtagtca agattgacca cgtaataaca tcagtttaga atcctggttt 21601 ttcaaaacca ggaaagttca attagaattc ttgcgcattc attgcaggaa gacttcaaat 21661 ttagtactca cgatgagcaa caggtaacca aataataaaa gtcgttcctg gtgtatcagg 21721 tgatgatcta agcgaagtga gagcagggct gaaaacttgt atttcgccct gcatttgttc 21781 tattagctgt ttggcaattg ataatcccaa tcccgtacca ggaatttccg tttgagcttg 21841 aacacctcgg taatgacgct ctccaagatg ctctaaatct tgttgtggaa taccaggacc 21901 agtgtcactg atggcaattc cttgaaaatg aggtttttct tgtctggtct gaattaaaat 21961 cttgccgcca gtgggagtgt atttcaaagc attgtcaatg atattgctga gtacttctcg 22021 taatgctttg gtgttagcgc gtaccagagg taagttaggg ggaatttcgg atatgagttg 22081 caaatttctt tcttgagcga tggctctggc tgatactagc aatggttgca atatatctag 22141 tacagaacaa ttaacttctt tctctcctgc accgggcaac aagagtggcg ctataaagtc 22201 tttttgtacg tttgcttcca caaggacttc tttttgcggt agcgttaatg gttctaaatc 22261 ttctgttgtc aagtcaataa cttcatccaa gtgttgcaac aattctttga ggcgatcgct 22321 ttcccgcaca atactagtag ctacctcgcg gttagcatcc cctggtcgca gtcgcttcag 22381 caacagtttg ccaaaggtac gcaatgctgt taatgggttg cgaagctggt gcaataaatt 22441 atctagtaaa tcttgctgtt tttcttgaaa aatttgttgt tgctgcaatt gctggtgcaa 22501 ccatgctcga cgctgatcca aaatacaggc aagggatagt gtttgggcaa ttcgttctat 22561 ttgacttttt tcttgttcat tccatactcg gtcttctcta actgtgacga gcaaccccat 22621 catcacatct tcatgaacaa gaggtaaaac aatttggtcg ccctttaata aatattcttc 22681 ttcctctgtt gaatgcggtg gtgctgtatc tcctgtcgca gtttctcgtg atggtgcggg 22741 aaagtcctct gctgctttca gtaaccttcg gttattgcgg ttgtgtttct ggttgaacac 22801 atcacccgta cgagaatgta gtgttttatc caaggtgacg gaagctgaac gatacgaagc 22861 ccagttagga tttgagatgg gtattgaggt gggcaatacc aaagcactag ctccctgctg 22921 ttccactgtt gtttctggat aaacgaccac aggaatgagc ttggcttcac ttgtcgaagc 22981 ctctatcaat tcttgcgtta aataaacaac gctcaaagat gcttccagcc cttgggctaa 23041 cagtgccaat tgctctcggc atagagcaac aaactcagaa ctggcagaca ttaacatttc 23101 tcagcccttg tttaaaatga cagactatac ggacgaaaag gaacaaaagt tggcttttcc 23161 catttaacaa agcttaactg aatcttgcaa ctaacagttc gttccaagga attatattgt 23221 tcaacgcttc tggtttttat acaaaaaaat aaattaaccg gattgtatca aaagattaaa 23281 aactattgat t // LOCUS NODE_1344_length_23187_cov_4.51262323187 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 23187) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 23187) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23187 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 110..463 /locus_tag="DP116_12105" CDS 110..463 /locus_tag="DP116_12105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493431.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rhodanese-like domain-containing protein" /protein_id="PRJNA477356:DP116_12105" /translation="MRMTNLLLGDFTLQTDAHELKSRLQWGQPAFTIIDVRERQTYNQ GHISGAIPLPVNELVKRATVSLAKDRDIYIYGDSDEQSASAAKTLRDAGFARVSELTG GFSAWKAAGGSTEEI" gene complement(551..1594) /locus_tag="DP116_12110" CDS complement(551..1594) /locus_tag="DP116_12110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenosine deaminase" /protein_id="PRJNA477356:DP116_12110" /translation="MTLYAELHRHLGGSVVPRILWRYFERHSPELISGFADYSAFEDF YTRPRNTLDEYLQLHTLVESVQTVETLPYFIYRLLRGAYIFENLAYLELRYTPYLRTP DHLSQSQRIDMMAEIVEVVGKASKLPEYPIVTSQILCMHSRLPFEVNKAIVDLAAQSR QYVCAIDVAGGDSYSAERINEWIRLYDYARSLGLNTTGHLYETTAGCDPQLLPYLMRI GHGIQIPLLYPELLKDLALRNQCLEVCPTTYIKTGTLQDIRQLKLVFERCFDAGVDIA ICTDNAGLHNVRLPFEYENLLTYDIINFEQLQACQDAAFRHAFAWPHRQRPASLLNGL LNPQPTKALAMME" gene 1899..4187 /locus_tag="DP116_12115" CDS 1899..4187 /locus_tag="DP116_12115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317980.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transmembrane sensor domain-containing protein" /protein_id="PRJNA477356:DP116_12115" /translation="MAKLVVLKLDGDLEHQGFRVTLEIGSESAPPEIEISGKLPPCPD LATYLCQWQHKYRSLGMPSRIKPQEIIYDGSITTRISECRQKAKVLINCLQRWFDSQE FRCINNTLREELKRNEAIRVLIRTDNRELQRIPWHLWDFFDRYSLAEFALTTTASQTP PKPPRKPRVRVLAILGNSSGIDVQIDRQLLENLPLSETVFLVEPQRKELNNYLWHQWD ILFFAGHSETAENTGRIYINKTDSLTIEELEFGLKRAIAQGLQLAIFNSCDGLGLAWQ LASLNIPQMIVMREPVPDEVAQEFLKHFLKAFSGGKSLYLASREAREKLQGLEDKFPN ATWLPVLIQNPRVVPPTWKELSTQKSDSLRQILTVLMASVLVTSIVMGVRYFGMLQPW ELQAYDHLMQLRPAYESSDPRLVIVTVDEADIRYQNQKGMKRTGSLSDKALTQLLQKL EQYQPTTIGIDIYRDFSVDSSYPDLITRLKQDNRIFAVCKVSADADDAPDGIHSPDEV PTERQSFSDFVVDDDEIPRRQLLYLTPSVKSPCTAENALSLQLARHYLLSKGIKADIT PSGDLKIGNVLFKQLKEHTSGYQAVDASGYQVLLNYRSLRPQENIAETVSLRDILNNN SQINSELLKNRIVLISVTAPSDTDYWKTPSHNKKIPGVFVQAQMISHILSAVLDHKPL LWWWSWWVEVLWVWIWSLVGGMIARRYSKPEYLGFAVAAALLLLFGICFGIFTQLGWI PLIPPALTLILTAIAVVWKIRR" gene 4253..5035 /locus_tag="DP116_12120" CDS 4253..5035 /locus_tag="DP116_12120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12120" /translation="MKLAGVIPVILLCLASYSPQVQAQVAQPVKNKIANTQNRLEKLN FPRSGAPVGRRQGGARRNGCPDLKQPVTALVPGEKTVNDEAISFLALTVSEYPTFWVY LPDLPTNLRSGEFVFQDERGKNIYKTPLKLPDRSGIIGITLPQNPQYALKQDNKYQWY FKVYCGNPENTSDYYYVKAWVQKVALTPNLESQLKAAKPGEYTVYAVNQIWQDAVTNL ADMRRTNSGSSVLARDWNDLLTAVGLEEFATAPIVGRYGLER" gene 5350..7173 /locus_tag="DP116_12125" CDS 5350..7173 /locus_tag="DP116_12125" /inference="COORDINATES: protein motif:HMM:PF05860.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="filamentous hemagglutinin" /protein_id="PRJNA477356:DP116_12125" /translation="MTNFSGSKAQGNANLFLINPKGIIFGENARLDIGGSFLATTANL IQFPGGGEFSMTSPVNPQNPLLTVNPSAFLFNQIASEPISSIQVNEARLSVKDSQSLL LVGGDVKLDRGRLRATSGRIELGGLAGVGTVGLNVDNNNLRLSFPVGVQRADVSLSNQ ATVNAPDSAGSIQVQGRRVTLTDNSEIRIINTFGAQPAGTLAVTASESVELLGGSRLL TGTEGTGEDAGNLRIETGRLIVQDGAQVSASTRSQGRGGTLSVTAKESTQLIGTSIDG IESGLFVRTTATGDAGNLEIETGQLIVQDGARISASTAQESTGQAGSITLKTGELNLQ DGAQVTVSSAGKGNAGNLEIISGSIKLDNSGKLLGFTNFGEGGNINLQVRDLISMRRQ SQISAEARNNAKGGNITINAKDGFIVAVPGENSDIIANANQGRGGNINITTSGIYGLE SRPTLTEFSDINASSQAGPQFNGTIEINTPDVDLNSDLVNLPSVPIDTKLAQGCNSPN YAKSSFIYTGRGGLPPNPKDILTPDAVQVDWVTLNPNIDSHNSPSVSTSTNPTPEPIV EATGWVFNAKGEVVFTASAPTTTPRSSWQTPTDCGRSKPNP" gene complement(7188..8072) /locus_tag="DP116_12130" CDS complement(7188..8072) /locus_tag="DP116_12130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861950.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12130" /translation="MTRFIYDQFSKDYLEELLKPYGEVQAPKRVAGEVREIDVFFAPS SQQNSNLETLGLLGRFAATPAIFEPFRNAASKDEMCDCLLKLLEIRGELQREANRNKT SIAESAIPKLWILTPTASTTLLSGFGATQRVDWVSGVHFMAEYLRTAIVAIHQLPRTP ETLWLRVLGRGTVQNQAIDEFVALPANHPFRKAALELLYNLQKNLLVAQNPEEDDREL VMRLAPLYQQDREQAIQEGEQRLIIRQLNRRFGEIDSSLIERVRELSIEQLEALGEAL LDFSALTDLEAWLTQQEG" gene complement(8140..10772) /locus_tag="DP116_12135" /pseudo CDS complement(8140..10772) /locus_tag="DP116_12135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318553.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 9306..9315 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(10727..12219) /locus_tag="DP116_12140" /pseudo CDS complement(10727..12219) /locus_tag="DP116_12140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198098.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ShlB/FhaC/HecB family hemolysin secretion/activation protein" assembly_gap 11220..11229 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(12512..15724) /locus_tag="DP116_12145" CDS complement(12512..15724) /locus_tag="DP116_12145" /inference="COORDINATES: protein motif:HMM:PF05860.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer family protein" /protein_id="PRJNA477356:DP116_12145" /translation="MWFTQKRVQLQSSAGGCRLLRMNALVCRFFLKLGIAPWCALSAA VLLPLAFSADSTLAQELKPVADKTLGAESSVVTPTSPQADRIDGGAIRGANLFHSFSQ FNIGEGREAYFANPNGIENILSRVTGSDASRIFGKLGVLGNANLFLINPNGIIFGKNA SLDLRGSFVASTASSLKFADGTEFSAKGAQSKPLLTVSVPLGLQFGTDSGKILVQGSG NSLSQDPQINASGIVIGTTLNRSTRPVGLQVPPDKTLALVGGDVMLEGGNLTATSGRI ELGSIAGTNFVSLNPIDKGWALGYQGVQNFGNIQLSQAASVDVSGTGGGDIQVRGRRI TLTDGSVIAASTLGAKPGGTLTVAADDAMELTGTDSNYQISSSLLTETLNSGSAGDIT ISTDKLIVRDGAQVASTSISSGAAGKLNVSAPKSVELVGTGNVVINAVGSLEIPSTLA STPTGTGRGGNVTIATNKLSAISGAQIGSVTFGTGDAGNLVVNAKESVELIGISADGQ FPSALGTSVQLGATGAGGSLTLETGRLSVRDGAQIQTTTFGIGDAGNSVVNAKQSVEL IGISVNGQGFSALSTSTQPGATGASGNLTLETGRLSVRDGAVIQTTTFGTGDAGNLQV NAKESVELIGRTPNGRFSSSLATSTERGATGSGGNLTLETGRLSVRDGAVIQAATFGT GAAGNLVVNAKESVELIGTSANGQTSSGLFTSVQPGATGSGGNLTLETGRLSIRDGAQ IGSSTLGSGSAGILIVNAKELVELIGISANGQRPSLLTARTTSSGNASNINIETGKLT VRDGARVSVSSEGSGSAGNLSVQARSIQLDNKGAIIATTRSGNGGDITLQTQDLLLLR RDSQISTTAGTEGAGGNGGNITINTPNGFILAVPRENSDITANAFTGTGGRVDINSFG IYGIQPRQKPTSLSDITASSEFGVNGTVQLNTPDIDLNSALINLPSVPVDTKLAQGCN SPNYAKSSFIITGRGGLPPNPKDILTPDAVQVDWVTLNPNLEKNSSTNISTNPNPTTP APIVEATGWVFGPKGEVIFTAQAPTQHQSSWQTPPKCHEK" gene complement(15911..16315) /locus_tag="DP116_12150" CDS complement(15911..16315) /locus_tag="DP116_12150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015079300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_12150" /translation="MKKRVTLTFPKRVIQMPVTYRLARDFNVAANIIRAQVAPNQIGK LVVELSGDIDELDAAIEWMRSQNITVSQALGEIVIDEDICVHCGLCTGVCPTQALNLD PQSYKLTFTRSRCIVCEQCIPTCPVQAISTNL" gene 16607..17197 /locus_tag="DP116_12155" CDS 16607..17197 /locus_tag="DP116_12155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874984.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiol:disulfide interchange protein" /protein_id="PRJNA477356:DP116_12155" /translation="MSNDTPVNSSETSESKVGTRLRNFLIAIVAIALSVALILGLRTE TTSATLTQLDKQSTPFEVALTNGKPSFVEFYADWCTVCQKMVPDVAQLKQQYADKVNF VMLNVDNTKWLPEMLQYRVDGIPHFVYLNQKGEAIAEAIGDQPRTILSSNLEALLTAS PLPYAQANGRVSQFNAPVAPIDTQEDPRSHGAQVVN" gene complement(17474..17710) /locus_tag="DP116_12160" CDS complement(17474..17710) /locus_tag="DP116_12160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12160" /translation="MSEQKPLSPWKYKPWWCQPWSILLTGVTLISGSWVLFRTVWLTI LVCVPVLTWMGFFLIIWPQLMIRSGILESYEDQV" gene 17966..18190 /locus_tag="DP116_12165" CDS 17966..18190 /locus_tag="DP116_12165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006199192.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB/MazE/SpoVT family DNA-binding domain-containing protein" /protein_id="PRJNA477356:DP116_12165" /translation="MYTLKVCRIGNSLGTTLPEEILQKLRVDEGDTIFVTETADGVYL TTSNPDFDKAMEAYNKVSTKYRNALEELAK" gene complement(18281..19246) /locus_tag="DP116_12170" CDS complement(18281..19246) /locus_tag="DP116_12170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010855179.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA (cytosine-5-)-methyltransferase" /protein_id="PRJNA477356:DP116_12170" /translation="MNYVDKINHVLKPAFSRSPLVVDLFAGCGGLSVGFEAQGFATHG FEMDADSCVTYRKNLKGNCTQVILTPETELPSAKVLIGGPPCQPFSVGGKQKGLQDSR DGFPIFISAVKRLCPEIWLFENVRGLLYKNKWYLDEIVQTFQDLGFIVEWRLLNAVDF GVPQNRERLIVVGHKGNFKFPKLFEKKVSAGEALGELAFMTPEESKFLTPSMDEYVAK YEKASSCKHPRDLHLDKPARTLTCRNLAAATGDMHRIKLPCGRRRRLFLREAARLQSF PDWFEFVGTQTSCFNQVGNAVPPLFAFYLAGSVRDYLHIVGVDSP" gene complement(19297..20370) /gene="murG" /locus_tag="DP116_12175" CDS complement(19297..20370) /gene="murG" /locus_tag="DP116_12175" /EC_number="2.4.1.227" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873852.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase" /protein_id="PRJNA477356:DP116_12175" /translation="MVETPIRLLIAASGTGGHVFPAIALAEHLRDYQIEWLGVPNRLE TQLVPKQYRLNTIAVEGFQQGFGISSLRVLGKLIGSILEVRKLLRQGNFQGVFTTGGY IAGPAVIAARSLGLPVILHEANALPGKVTRFLGPWCSAVAVGFEAASQYLPRVKTIYT STPVRSQFLEEEIEASLDLPIPKDVPVIVVFGGSQGAVAINKLVRQSANAWFDAGAWV VHLTGDNDPEAGTFKHHQYISLPFYNNMAALLQRANLAITRSGAGSLTEMAVCGTPAI LIPYPFAAEDHQTYNAKVFTSVGAASLFKQSELTAEVLQSQVLDLLKHPEELRKMREA AKAIAVPDSAEKLAQLVREVVEK" gene 20455..21315 /locus_tag="DP116_12180" CDS 20455..21315 /locus_tag="DP116_12180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318138.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nuclear transport factor 2 family protein" /protein_id="PRJNA477356:DP116_12180" /translation="MPNFASFLHKRQTMLQPTRLLVYSLLTLSLLTSWKTAQASTPQH LVQAGEPLQPSSASQNAPSQVKNLLAQIDAAASQGNIKGVMQFYSPNFVHGDGLTRQN MEKALTAFWQRYPRLKYTTQVQSWQSEGNAIVAETLTNITGVPSTNSENSTFNATIRS RQRIAAGKIIRQDILSERTELTSGAKPPKVEFKLPQQVKVGQQFNLDAIVQEPLGDDY LLGTALEEPIKPEKLLTATPVDLELLSSGGIFKVGRAPAVPGSQWVSAVIMRGDGMVM ITQRIQVVKN" gene 21490..22263 /locus_tag="DP116_12185" CDS 21490..22263 /locus_tag="DP116_12185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994651.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_12185" /translation="MLSIQNQIVLITGASSGIGAACAKVFANAGAKLILAARRLERLQ ELADEVSKNSATDIHLVELDVRDRTAVESAISTLPPSWSEIDILINNAGLSRGLDKLH EGDIQDWEEMIDTNIKGLLYLTRYVVPGMVKRDRGYVVNIGSIAGHQTYPSGNVYCGT KAAVRAISEGLKQDLLGTPIRVSSVDPGMVETEFSDVRFHGDSDRANKVYQGVKPLTP DDVADVIFFCVTRPSHVNINEVVLMPVDQASATLVNRRS" gene 22300..22956 /locus_tag="DP116_12190" CDS 22300..22956 /locus_tag="DP116_12190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459166.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="steroid 5-alpha reductase" /protein_id="PRJNA477356:DP116_12190" /translation="MQNTANAEKTGVTVLTTINIAKVLTISCLVAFAVVFGIQDWRQV IYMCLHISYCLWWLIEQWFYPQRRQMFNDPVGVGLFVFILLSVGVFYALPGYLAFTNP VPLSMTTAAVALSLYIFGTLINATADIQKLTAKQYGAGLVNDNIWRFSRNVNYFGDLL RYLSFSVVAGSLWAYLLPAYILVFYLRLMSNKEQSMSQKYSEYPDYKQSSARLIPFIW " gene complement(23011..23133) /locus_tag="DP116_12195" /pseudo CDS complement(23011..23133) /locus_tag="DP116_12195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495178.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine hydrolase" BASE COUNT 6487 a 5143 c 5010 g 6527 t 20 others ORIGIN 1 gtaaagttct aaaaatttaa aaaataaccg agaaatctgt attaatttta aaaaaaacct 61 gaaaaaaatc caatttttgg tagatttcag gtgatgtgac aaatggaaaa tgcgtatgac 121 aaacttgtta ttaggagact tcaccctcca aaccgatgca catgagctaa agtctcgttt 181 gcagtgggga caaccagctt ttactattat tgatgtgcgt gaacgtcaaa cctacaatca 241 gggacacatt tcaggagcga ttccgcttcc cgtaaacgaa cttgtcaaaa gagcaacagt 301 ttctctagcg aaagatcgcg atatctatat ttatggtgac agtgacgagc aatcagcgag 361 tgctgcaaaa acgcttcgag atgctgggtt tgctcgtgta tcagaactga ctggtggttt 421 ctctgcttgg aaagcagctg gtggttccac agaagaaatt taacagttat cagttatcag 481 tgaacagtga acagtgaaca aaaaactgat aactaataac tgatttcggt caaactccca 541 tgagagttca tcactccatc atggctaggg ctttagtggg ttgcgggttt agcaacccat 601 tcaacaggga tgctgggcgt tgtctgtggg gccaagcaaa agcatgacgg aatgctgcat 661 cctgacaagc ttgtaactgt tcaaagttaa taatgtcgta agttaagagg ttctcgtact 721 caaaaggtag acgcacattg tgcaagccag cattatcagt acaaatagca atatctaccc 781 ctgcatcaaa gcaacgctca aatactaact tgagttgacg tatatcctgc aaagtaccag 841 ttttgatata ggttgtgggg caaacctcta aacactgatt tcgcaaagct aaatctttga 901 gcaattcggg atacagcaga ggaatttgga taccatgacc aatccgcatc agatacggta 961 aaagttgggg atcacatcca gcagtcgttt cataaaggtg tcctgtggtg ttgagtccaa 1021 gcgatcgcgc ataatcatac aaccttatcc attcgtttat gcgttctgca gagtaactat 1081 ctcccccagc tacatctatt gcgcaaacat actgtctgct ttgcgctgct aaatcaacaa 1141 tcgccttatt cacctcaaag ggtaagcgcg agtgcataca aagaatttgg ctggtaacaa 1201 tcggatattc tgggagtttg cttgctttac ccacaacttc cacaatttcc gccatcatgt 1261 caattctctg tgattgactc agatggtctg gtgttcgtag ataaggggta tagcgcaact 1321 ccagataagc caaattttca aaaatataag cacccctgag caagcggtag ataaaataag 1381 gcaaagtctc cacagtttgc acgctttcca ccaaggtatg caactgcaaa tactcatcta 1441 gagtgttgcg tggacgcgtg tagaaatctt caaatgctga atagtcagca aagccagaaa 1501 tcaactcggg cgagtggcgc tcaaagtacc gccataaaat acgaggtaca accgatccgc 1561 ccagatgcct atgtaattct gcgtataaag tcatagtcgt cttttaacaa aactagaaat 1621 tttaattatt attaacagta acaaattaaa aaacaagttg tgggtgcgta aaatttcata 1681 agactttctt atcctgcaag gcactgcgat ttatcttcaa aagaggactc aaaaagttat 1741 aggtataaaa gtttgctcat tggatagcgc ttctcgtttg aataaagtag agatcgcttc 1801 ttaagattta ctttataaat tacctctaag tgtaaaaagc tatcacgatt tgttagtgtt 1861 attttgacac ttcaaaatcc caagtctcaa atgggtctat ggcgaagtta gttgttctga 1921 aattagatgg tgatttagaa caccagggat ttcgggtgac gctagagatt ggctcagaat 1981 ctgctccccc agaaatagaa atctctggca aattaccacc atgtcccgac ttggctacct 2041 atctgtgcca gtggcaacat aagtatcgca gtttggggat gccaagtcgc atcaagccac 2101 aagaaattat ttacgatggt tctatcacaa cacgaatttc agaatgccgc caaaaagcta 2161 aagtattaat taactgctta caaagatggt ttgattctca ggaatttcgc tgtataaaca 2221 acacattacg cgaagaatta aaacgaaacg aagctattcg agtcttaatt cgcaccgata 2281 atcgtgaact acaaagaatt ccctggcatt tgtgggattt ctttgataga tactcgttag 2341 cagaatttgc cctaacgaca acagcgtcac aaacccctcc aaaaccaccg cgtaaaccga 2401 gagtcagagt tttggcgatt ctgggcaata gtagtggaat tgatgttcaa atagaccgtc 2461 agttgttaga aaatttacca ttatccgaaa cagttttttt agtagaaccg caacgcaaag 2521 agttaaacaa ttatctttgg caccagtggg atattctatt tttcgccgga catagtgaaa 2581 ctgctgagaa tacaggtcgg atttatatca ataaaacaga tagcttaaca atagaagagt 2641 tagagttcgg tttaaaaaga gcgatcgcac aaggtttaca gctagcaatt tttaactcct 2701 gtgatggttt aggattagca tggcagttag cttctttaaa tatcccgcaa atgattgtca 2761 tgcgggaacc cgtaccagat gaagtagcac aggaattttt aaagcatttt cttaaagctt 2821 tttcgggtgg taaatcttta tatctggctt cacgggaagc aagagaaaaa ttgcaaggct 2881 tggaggacaa atttcccaat gcgacttggt taccagtcct aatccaaaat cccagagttg 2941 tcccccctac atggaaagaa ttaagcactc aaaaaagcga ttcattacgt caaattctga 3001 ctgtgcttat ggcgagtgtg cttgtcacaa gcatagtcat gggagtgcgc tatttcggaa 3061 tgctacaacc gtgggaattg caagcttacg accatcttat gcaactgcga ccagcatatg 3121 aaagttcaga tccacgtcta gtgatagtta ctgttgatga agcagatatt cgctatcaga 3181 accagaaggg gatgaaaagg acaggatcgc tttcggacaa agcactgact caactgttgc 3241 aaaaacttga acagtatcaa ccaacaacta tcggtataga tatttatcgt gatttctctg 3301 tagattcgag ttatccagat ttaatcactc gcctaaagca ggataaccgt atttttgcgg 3361 tgtgcaaagt ttctgctgat gctgatgacg caccagatgg tatccattcc cctgacgaag 3421 ttccaacaga acgccagagt tttagtgatt ttgtggtaga tgacgatgaa atcccacgtc 3481 gccaattatt gtatttaact ccttctgtaa aatctccctg tacggcagag aatgctttga 3541 gccttcagct tgcacgtcac tatttgcttt ctaaaggtat taaagcagat atcaccccct 3601 ccggggactt aaaaattggc aacgttttgt ttaaacaatt aaaagaacac actagcggtt 3661 atcaagcagt cgatgcatca ggatatcaag tgctgctgaa ttatcgttcg ctgcgtcctc 3721 aagaaaacat tgctgagact gtatctctaa gagatatcct taataacaat agccaaatta 3781 actctgagtt attaaaaaat cgtattgtcc ttattagcgt tacagcccca agtgatactg 3841 attattggaa gactccttcc cacaacaaga aaataccagg agtgttcgtg caggcgcaga 3901 tgattagcca tatcctcagt gccgttcttg atcataaacc tttgttgtgg tggtggtctt 3961 ggtgggtgga agtactctgg gtttggatat ggtcgttggt gggaggaatg atagctaggc 4021 gctactcaaa gccagagtac cttggatttg cagtagcagc agcgctacta ttattatttg 4081 gaatttgttt tggcatcttt acacaattgg gatggatacc actgattccg cctgccttga 4141 cattaatatt aacagcgatc gctgtggtgt ggaaaatacg aagataaaat tccggaatac 4201 aggtattgcg tcttgatgtt atatagtagc atcggcatat ggatatttaa ttatgaaact 4261 tgctggcgtt atccctgtga tacttctgtg tcttgcgagt tactctccac aagtgcaagc 4321 acaagttgct caaccagtca agaataagat tgcgaataca caaaaccgcc ttgaaaaatt 4381 gaactttcca cgtagcggtg cacctgtcgg tcgccgtcaa ggaggggcaa gacgtaatgg 4441 ctgtccagac ctgaaacagc ccgtaaccgc tttagttcca ggtgagaaaa ctgtgaatga 4501 tgaagcgata tctttcttag cattgacagt ctccgagtat ccaacctttt gggtttatct 4561 tcccgactta cccacgaact tgcgttctgg agagtttgtg tttcaggacg agaggggtaa 4621 aaatatctac aagacacctt tgaagctgcc tgataggtct ggtatcattg gcattactct 4681 gccacaaaat ccgcaatatg ccctcaagca agacaataag tatcaatggt acttcaaagt 4741 ttattgtggc aacccggaaa atacatctga ttattattac gtaaaagcat gggtgcaaaa 4801 agttgcgctt accccgaatc ttgaaagtca gttaaaagcg gcaaaaccag gagaatatac 4861 agtatatgct gtgaatcaaa tttggcagga tgctgtcacg aatttagctg atatgcgtcg 4921 tactaattca ggttcgtcgg tgcttgcacg ggattggaat gatttgttaa cagctgttgg 4981 cttggaagaa tttgccactg cgccgattgt tgggcgttat ggtttggaaa gataatcgtg 5041 attttgcgct gaggcgtaga taaatagagt gttataattt tgatagagaa aatgttatat 5101 aattaacctt ttgtaagtat tttgtcaatt agaaccaagc agttaacagg gcgctgactt 5161 aaccacaact tattaaagag ccatatttgg gatgaatcga tacgcgagtg tgttggtgtt 5221 ggaggttagt gttaaagaat ttgttagtgg tggggggagc gatcgcccaa gggggagggg 5281 caaagctgat cgcctcgttt tgtaccctaa accagatcac ctggaatgct agtcaagata 5341 aatagttgaa tcactaattt ctcagggagt aaggctcagg gtaacgctaa tctctttctc 5401 ataaatccca agggcatcat atttggagaa aatgcccgat tagatattgg cggttcattt 5461 ctagcaacaa cagcgaacct gatccagttt cctggcggtg gagaattctc tatgacttca 5521 cctgttaatc cgcaaaaccc cttgctgacg gtgaatcctt ctgcatttct attcaatcaa 5581 attgcatctg aacctattag ctcaattcaa gttaatgaag cacgtctatc agttaaagat 5641 agtcagagtt tattgctagt gggcggcgat gtgaagttgg atcgggggcg actacgggct 5701 acaagtggtc gaatcgaatt aggaggattg gcaggagtgg gaacagtagg gttaaacgtt 5761 gataacaaca atctccgcct aagtttccca gtgggagtcc aacgagcaga tgtatctctt 5821 tctaatcaag ctacagtaaa tgctcctgat agcgcaggta gtatccaagt gcaaggcagg 5881 cgcgttacgc tcaccgataa ttcagagatt agaattatta atacttttgg agcccaacca 5941 gcaggaacat tggcggtgac cgcatcagag tcagttgaac ttctgggagg aagccgcctg 6001 ctgactggaa ccgaaggtac tggagaagat gctggaaact taaggataga aactgggcgg 6061 ttgattgttc aagatggggc acaagtctca gcttccactc gtagccaagg acgaggagga 6121 actttgtccg tgaccgcaaa agaatctacc caactaattg gaacatcaat agacggcatt 6181 gaaagtggct tatttgttag gactacagca actggagatg ccggaaactt agaaatagaa 6241 actgggcagt taattgttca ggatggggca cggatttctg cttccactgc tcaagagtct 6301 actggacagg cgggaagcat cacacttaag actggagaat taaatcttca agatggcgct 6361 caagtaactg tgagcagtgc aggaaaaggc aatgcaggca acttagaaat tatttctggc 6421 tctatcaaac tggacaactc aggaaaacta ctaggtttta ccaacttcgg tgagggtggc 6481 aatataaatt tgcaggtgcg agatttaatc tcgatgcgcc gccaaagtca gatatctgcc 6541 gaagctcgta acaatgccaa gggcggcaat atcaccatca atgccaagga tggcttcatc 6601 gtcgccgtcc ctggtgaaaa cagtgacatt attgccaacg ccaaccaagg gaggggaggt 6661 aacattaata ttaccacctc tggcatctat gggcttgagt cccgcccaac gctaacagaa 6721 tttagtgaca tcaacgccag ttctcaagca ggacctcagt ttaatggcac tatagaaatc 6781 aacacacccg atgtagacct caacagtgac ttagtaaact taccatcagt accaattgac 6841 accaaactcg cccaaggctg taactctccc aattacgcga aaagcagttt catttacact 6901 ggacgcggcg gcttaccgcc taacccaaaa gatattctca cacctgatgc cgtgcaagta 6961 gattgggtca ctctcaaccc aaatattgac agccacaaca gtccatctgt ttccacatca 7021 acaaacccca cgccagaacc catagttgaa gcgactgggt gggtgttcaa cgccaaaggc 7081 gaggtagtct tcacagcctc ggcacccacc actacgcctc gcagttcttg gcaaacgccc 7141 actgactgcg gaagatctaa accaaatcct taatattact ggttctttta tccttcttgt 7201 tgcgttaacc aggcttctaa atcagttaag gcggaaaaat ctaacaacgc ctctcctaac 7261 gcctccaatt gctcaatgga taattctcga actcgctcaa ttaatgatga atcaatctca 7321 ccaaaacggc gatttagttg acgtatgatt agacgttgtt ctccttcctg tattgcttgt 7381 tctctgtctt gctgataaag tggtgctaat ctcataacta attccctatc atcttcttct 7441 ggattttgag ctactaatag gtttttctgc aagttgtaca acaattctaa cgctgctttc 7501 cggaatggat gatttgctgg tagcgcaacg aattcatcaa ttgcctggtt ttgtaccgtt 7561 cctcgaccaa gaactctcaa ccacagtgtt tctggagttc ttggtaactg gtgaatggca 7621 actatggctg ttcgcaaata ctctgccata aagtgtactc cagacaccca atctactcgt 7681 tgtgttgcgc caaatccaga taacaatgtt gtggatgctg tgggtgtgag aatccataat 7741 tttgggatgg ctgattctgc aatggatgtt ttgtttctat tggcttctcg ttgtaattca 7801 cccctaattt ctaatagttt cagaagacaa tcacacattt cgtctttgga tgcagcattg 7861 cgaaacggtt caaagatagc aggtgtagcc gcaaatctcc ccaacaatcc cagtgtttct 7921 aaattagaat tctgttgtga tgaaggagca aaaaagacat caatttctct tacttctccc 7981 gccacgcgct tgggtgcttg tacttcccca taaggtttta gcaattcttc aagatagtct 8041 ttggaaaatt ggtcataaat aaatcgagtc attcaatttg attggaaaaa aattaagtaa 8101 tgaatattat tttaattcct tcaattacga attgtttcgt cataaccaat ttcccaacag 8161 cacataaggt gcccaaaaat aaggactatt ctccttttta aaaaccacaa gttgagcacg 8221 ctgaagtgct tcagctttac tcaccccatt atttaactgt cggtaaaact cactcatcaa 8281 atctgcggta gactggtcat ccaccgacca taaagaagcc agtgtactgc gtgcacccgc 8341 ccgcacagcc actcccgcta atcccaatgc tgcacgcttg tctcctgctg ctgttttaca 8401 agcactcagg acgagtaatt caatattgct agacctgcta gattcactca ctcgcagtaa 8461 gctatcaaat tccttcacct taagtagttt atcccaagca agaataaatg ttttttctgg 8521 atcagaacta aactcgccat gagtcgccaa gtggactacc gagaaaggat ctgattttag 8581 ctcgttttgc aagttgattt cagtaaactt ctgattcaac agttcttcac tcctgggtac 8641 ctcagacttg atttcttcta actcccgtgg cacattttgc aatgcaggaa acccaaagcg 8701 ttgttcgccc actccagcag ttaaagcatt taactgtact ttttgtaaag gtttcgggtc 8761 aaaaagttgc aaaccaggcg tgagagctaa agcatatttt tctatcagat atttctgctg 8821 ctgtttatca taaagaaccc ccatcgggat attctgcaat tcactatcga gtacaaacac 8881 taaagtttta atttcattct tggttaactc tggttccatt ggtcgaatta accaatcata 8941 tatttcctga gactgctcct gaacctgaac agtttgttcg acttgcaata agtctttccg 9001 caccgaagcg acagtccttt ctactttctc ctgagttata gccgttgtat atttgcgaag 9061 ttctttttgt ccaggcaacg tgagaataac ctcaatacgg tctggtaaaa taattggata 9121 aataatcgcc gcttttttgt cctgcccaag cgcagtgtca attgttaccc taggttccac 9181 acagggtgag cggaagaagt tgttcagttc cgcctgctga agcgattcaa tcacattccg 9241 ggcttgcgta agattgtatt gcatagtttc agcgttacct atgttttgca actgcaaatc 9301 caccannnnn nnnnnactaa atcccgacgc agagactcaa gggttttgaa tgctgaatcg 9361 tatgctgcag ttgctccttt tatatctcct ttggctttca gcaaacgtcc taactgccat 9421 tgccattgat aagcaatgtc gggcggctca agagtttgag ctataaatag ggcttgttgc 9481 gttaggtttt gtgcctcaga taactgccct gttttttcgt acaacccgcc aagagtccca 9541 agtgcataag attctgctcg ctggtctttt aagtttcgtg cttgctcaac agcattggac 9601 aacatctgag caatatccag ccatgaagga ttatctgcat tcgttttctg tcttaattgt 9661 gttatgcttt ggacatagtt aatctgggca tatattgacg ttcggctggc gggcaacttg 9721 gttatttgag gttgaatttg agacgccaaa gccaaagccg cagaaaattg ctgattttcc 9781 agaagtagac taagttgatt gagttgtgct tggatgcgcg tggtgggttg tgcgacaagg 9841 gcagcttgtt gatagtattt taatgcagct tgagtttctt tctcggcttg ggttgtattt 9901 cccagagtga gttgagtttt cccctgggca agagctgtgt tacctagact gagcaaagtc 9961 tcaccagtag ctgttgatga ttgcactcgc tgcgctactg ctaaactttc cttcaagact 10021 tccctagagg tgtttaaatc accgacaact cgcaagatat tgccgagact acgcaatcct 10081 gtcgctttga gaggagagtc tggaagcttg gtgagagatt gctgaacctc agttaaagtc 10141 ttgctcgcct gacggtaaag tcccaaagct tgcatcgctt gagctgtatt aatgcggtta 10201 cgagttaatg tcgcttcatc gcctatttgc ttgtaaattt ttgcggcttc ttgccatgta 10261 gaaagtgcag cttgagtttg tcctcttgcc aattgtaacc gcccttggac atctagagct 10321 tgggcgaaga ttacagaagc ctgttttgag ttgtcgccat tttttaagag attcaaactt 10381 tgggtaattg cactttcagc ttgagtccac tgtcccagtt gctgataaac tagggaaaga 10441 ttgctcaaag tcatcgcttg gttgagttta tcgccactgg ctttaaaggc agcagctgct 10501 tgttccaaaa ctttaactgc ttcagcaaat cttccagcat catacagtgc tttaccctgt 10561 tgcacagaat tttcactttg agccgtctga cttggtggag tctgggtagt agaattaaca 10621 gccaatacgg ctgtcattaa tggtgaaagt actgtgcaca tcaacgcagt cagcagtgcc 10681 aatccagcct gagtcagcca ctttctacgc actcgcccag acatcatcaa gggttgaatt 10741 gtatggaaaa atacagacca ttttcttgcc atgttctttc ccttgaatca acagatatta 10801 aaggaatacc ccagtcaaga cgagcagtaa aacgctctcc ctgtgaccag cgtaaaccaa 10861 gaccgacgga tgctaaagtg ttagggtttg gattgtctct atttgaactg ttccaaccaa 10921 caccgaaatc aacaaaaggg atgatttgta aaacgccatt tatctcgcgt acccggaaaa 10981 ttgggacttg aacttcagca gaagcaaaag cgccattatc tgtaagtaag aaatcttgac 11041 ggtagccgcg aacattatcc aaaccgccta agccaatctg ctctaaaggt aggagtgctc 11101 ttgatgccag ttgcatatct gcacgaagta atagtacggt gtcaggagcc aaaagacgca 11161 cccactgagc ttgtccttgc catgagaaaa agcggctatc gggagggtta ctattaattn 11221 nnnnnnnnna tctaatgcac ccagtccaat gttaaattga gagcgagcgg cgataacttc 11281 acggctgtta cgactcgtcc attcttggaa aaatcgcaat gctgatacgc gagtgcgtcc 11341 ctcagaatca gcgccaggcg aaagtaagtt agggggaact ccctcttgtt gaaacagccg 11401 ggaagaaatt tcactttctc gtcttgaggc ggtaaagcca agagcaaatt cttgcgtggg 11461 agtttggatg actggctgac ggaatgtgag ttcgtagtaa cgggaagctg attgtatatc 11521 tagttcgttg aaaggacgtt caatgatgtc gctggatgtc gtaccgtagt aaaaagagag 11581 agtgccattt tggggattga cgggaaacgt gtagctacca tcaaaggcgt tgctgccatc 11641 ggtgttggtg tatgctaaac ttagaccgtc tccaaatccg agtaagttag ctgagttaag 11701 ttgtaatcgg cggcggaaac taccaacact gggcgatcgc ctattatcca aaacaatttg 11761 gctactgaaa gtttttgctt cttgtatctc gacttgtaat atagttgtac cgggacgtgt 11821 tccagcggtg agttcagcag acaggttttg aatcaaaggg ttaagttgca gcagttgcaa 11881 ggcttcaagc aagcgttctc tattgagagg tggtgatgtt gctcttgcta gacggctgcg 11941 tacataattg cgtctgagtc gcctggtacc tgtgacttgg atgtcttcta atttaccctc 12001 gattacctgg attttgacaa caccggactc cattgtttga gcagggagat atgctccaga 12061 agtgatgtaa ccttttttga tgtataattc tgtgataacg gtacgagcct gatacacttg 12121 ggtcagtgag ataggtttgt tagtaaaggg agcgagttct ttggctaact ctgcggggct 12181 aaatacggta ctgccaataa cttcaaatcg tttaacaatg atagtttcag gaaacttacc 12241 aggaattggt tcactcttgt tgggagttgg gcgagaaggg ggaagtaagt cttgtggtgg 12301 aggaagtggt tgtggtggtt taggtgaggg aaatggagaa ggtggaggtg gttgcacatc 12361 ctgcggtgaa gggagttgtt ggctttggaa agaattgccg tttggtagtt gtgctggagt 12421 cacaacttgt gcgtgtaggc gtgaatgagg gaagctcagc aatgcaacta tatttatata 12481 taaacagtag gttgtaaggt tgtctggaaa tttacttttc atggcacttt ggtggtgttt 12541 gccaagaact ttggtgttgt gttggggctt gtgctgtaaa tatgacttcg cccttgggac 12601 cgaataccca gccagttgct tctactattg gggctggtgt ggtaggattt ggattggtgg 12661 aaatatttgt acttgaattt ttttctagat tagggttaag tgttacccaa tctacctgca 12721 cggcgtcagg tgtgagaatg tctttggggt tgggaggtaa gccgccgcgt ccggtgataa 12781 tgaaactgct tttcgcgtag ttgggagagt tgcagccttg ggcaagtttg gtgtcaacgg 12841 ggactgaggg taggttgatt aaggcgctgt tgaggtcgat gtcgggtgtg tttagctgaa 12901 cagtgccgtt tacgccaaat tctgaactgg cggtgatgtc acttagagaa gttggttttt 12961 gacggggttg aatgccatag atgccaaagg agttaatgtc tactctacca ccagtgccag 13021 tgaaagcatt ggcggtgata tcgctgtttt ctcttgggac ggcgaggatg aagccgttag 13081 gggtgttgat ggtgatgtta ccgccgtttc cacctgctcc ttcagtgcct gctgtggtag 13141 atatctgact gtcgcggcgc agcagcaaca aatcctgtgt ttgcagtgta atatcgccgc 13201 catttcctga cctagttgtg gctatgatgg ctcctttgtt gtccaactgg atggagcgag 13261 cttggactga taggttgcct gccgatcctg aaccttcact gctcacagat actctagccc 13321 catcccggac agtcaacttt ccagtttcaa tgtttatgtt actggcatta ccagaactgg 13381 tggttcgagc cgtcaagagg ctggggcgct gaccattagc tgagatgcca atcagttcta 13441 ctaactcctt tgcgtttact attaaaatac ctgccgaacc cgaacccaac gtactagatc 13501 ctatctgtgc cccatcgcgg atgcttaagc gccccgtttc caaagttaaa ttgccgccag 13561 atcccgtggc tcctggctgg actgaagtaa acaagccgct ggaagtttga ccattagctg 13621 aagtgccaat cagttccact gactctttcg cgttcaccac caagttgccc gcagctcctg 13681 tgccaaacgt cgcagcttgt atcactgccc catcccggac gcttaagcgc cccgtctcca 13741 aggttaaatt gccgccagat cccgtggctc ctcgttcggt tgaagtcgct aagctactgg 13801 aaaatcgacc attgggtgtc ctgccaatca gttccactga ctccttcgcg ttcacctgca 13861 agttgcccgc atctcctgtg ccaaatgtcg tagtttgtat cactgcccca tcccggacgc 13921 ttaaccgtcc cgtttccaaa gttaaattgc cgctagcgcc cgtggctcct ggctgggttg 13981 aagtagacaa ggcgctgaaa ccctgaccat taactgagat gccaatcagt tccactgact 14041 gctttgcgtt caccaccgag ttgcccgcat ctcctatgcc aaatgtcgta gtttgtatct 14101 gtgccccatc ccggacgctt aaccgccccg tttccaaagt taaactgccg ccagcgcctg 14161 tggctcctag ctggactgaa gttcctaagg cgctgggaaa ttgaccatca gctgagatgc 14221 caatcagttc cactgactcc ttcgcgttca ctaccaagtt gcccgcatct cctgtgccaa 14281 acgtcacact tcctatctgt gccccactga tggcgcttag tttatttgtt gcaatagtta 14341 catttccacc cctcccagtt cctgttggtg tagatgccaa tgtgctagga atctccaaag 14401 aacctacagc attgattacc acatttcctg ttcccacaag ttctacggac tttggtgcgc 14461 tcacgttcaa ctttcccgct gcaccggagc taatgctagt agatgctacc tgtgctccat 14521 cccggacaat caacttgtcg gtagaaatcg ttatatctcc tgccgaaccg gagtttaaag 14581 tctcagtcaa caagctgctt gaaatttgat aatttgaatc agttcccgtt aattccatag 14641 catcgtcagc agctacagtt aaagtcccac ccggttttgc tcccaatgta gaagccgcaa 14701 tcaccgaacc atcagtaagt gtaatgcgtc tgcctcgtac ttgaatatca ccgccgcccg 14761 taccactaac atctacggaa gccgcttggg acagttggat attcccaaag ttttgcaccc 14821 cttgatatcc caatgcccag cccttgtcta ttgggttcaa acttacaaaa ttagtaccag 14881 ctatactccc caattcaatc cgtcccgatg tagccgtcag attaccaccc tctagcatta 14941 catcaccacc tacaagcgct aaagttttat ccggcggcac ttgtaaacca actggtctag 15001 tactcctatt caatgtcgta cctattacaa ttccacttgc atttatttgt ggatcttggc 15061 tcagggagtt acctgatcct tgcaccaaaa tctttcctga atccgtgcca aattgcaagc 15121 caaggggaac actcacagtc agcagaggtt tgctttgggc accctttgca ctaaactctg 15181 tgccatccgc aaacttcaaa ctactcgccg tactcgctac aaacgaaccc ctaagatcca 15241 aactggcatt tttcccaaaa ataatcccat tcggattaat caaaaacaga ttagcattac 15301 ccaaaacacc aagcttccca aaaatcctag atgcgtcact ccctgttacc cgactcagaa 15361 tattctcaat cccatttgga tttgcaaaat acgcttctcg tccttcaccg atattaaatt 15421 gtgaaaaact gtgaaacaga ttagcgccgc gaatcgctcc accatcaatt ctgtcagctt 15481 gtgggcttgt tggtgtcacc acagaacttt ctgctcccag agtcttatca gctacgggct 15541 tgagttcttg agcaagagta gaatctgcag aaaatgcgag tgggagcaac acggctgcgc 15601 taagcgcaca ccaaggagca atccctaact tcaagaagaa cctgcacaca agagcattca 15661 tccttaacaa gcgacacccc ccagcactgc tttgcagttg cactcttttt tgagtaaacc 15721 acattttccc ttcgtcaaag taatatctac tgacttctca gctctgacac aagttttatg 15781 caaattttac taaatattta caaaacttta taaatggagt aatcagaaaa accccgcgct 15841 gtcaaataac attgcggggc tggtgaatac taatcagact cgaaaaaacc ttttctgccc 15901 tgtgtcttct tcatagattg gttgaaattg cctgtacggg acaggtggga atacactgtt 15961 cacagacaat gcagcgcgat cgcgtaaacg tcaacttata actctgggga tcgagattga 16021 gggcttgtgt tgggcaaact cctgtacaca agccacagtg gacgcatatg tcttcatcaa 16081 tgacgatttc tcctaaagct tgggagacgg tgatgttttg cgatcgcatc cattcaatcg 16141 ctgcatccaa ttcatcaata tctcccgaga gttccaccac cagtttacca atttgatttg 16201 gtgcaacctg agcacgaata atatttgcag caacgttaaa atctcttgcc aatcggtaag 16261 tgactggcat ttgaatgaca cgtttgggga aagtcagtgt gactcgtttt ttcacagttt 16321 cattttgggg gtatcccaag cgtattttac cagttaacag ttatcagtta tcagttatca 16381 agtaccagcc gcagttatca agtaccagcc gcagttatca aggaagcgca gtaggaaacg 16441 gactcgtcta ccccttgttc actgttcact gttcactgtt cactgttcac tgttcactgt 16501 tcactgttta aaatccgtgg tttgatcaat cgtcaggaat gctaacagag atcacgagga 16561 ggttaaactg aagaaagtag tacttaataa gttttaataa attgttatga gtaacgatac 16621 acctgttaat tcttcagaaa catcagaatc caaggttggg acgcgcttga gaaatttttt 16681 aatcgcaatc gtggcgatcg ccctcagcgt tgccctaatt ttggggctga gaaccgagac 16741 aacctctgca actctgactc agttagataa gcagtctaca ccttttgaag ttgctttaac 16801 caacggtaag ccatcgtttg tagagtttta tgctgattgg tgtactgtct gccaaaaaat 16861 ggtaccagat gtcgcccaac tgaaacagca gtatgctgac aaagtgaatt ttgtcatgct 16921 gaatgtggat aataccaagt ggctaccaga gatgttgcaa tatcgggtgg atggtattcc 16981 tcattttgtg tatctgaatc agaaagggga ggcgatcgca gaagctattg gtgatcaacc 17041 tcgtaccatt ttgtctagta atctagaagc tttgcttact gcttcccctt tgccttatgc 17101 tcaagcaaac ggcagagttt cgcaatttaa tgcaccagtc gcaccaatag atactcaaga 17161 agatccccgc agtcatggcg ctcaagttgt gaattaaaat acttgcaaat agccacactc 17221 agatcttgca cctccacctg agtatgcctt gaactgaagt accctacggt tcgccgtatc 17281 ttgtgggcgt ctacaaggct aatacctcaa gtccgttaaa acggacttaa atacttaccc 17341 agtccgtttt aacggactta aattttgagc caagaaattt atttcttggt ggacgaaaat 17401 tatggtgcaa gatctgagca cagcttagaa gttgtggcta cacctactaa gctcgcgtga 17461 ccaagtttta tattcaaacc tgatcctcat atgattccaa aattccacta cggatcataa 17521 gctgaggcca aattatcaaa aagaaaccca tccatgtcag tactggtaca cagacaagaa 17581 tagttaacca aactgtccta aatagtaccc agctaccact tatcagcgtc acacctgtga 17641 gaagaataga ccaaggctga caccaccacg gtttgtattt ccaaggactg aggggctttt 17701 gttcagacat ttgtttgaca gctagtttat aattatttct ttaactatta ctaaattacg 17761 caacttaagc aagcagaact agcaatgtga tgggtgcaaa ggagcaatgc ttaatctaca 17821 cacaagtgat cgtatgtcag aattatggag tggaagcagt tacagcagac gattgaagaa 17881 aactaatgac taaaaagcgg catttcaatg taatagtagt tgaatggtgt tattacaata 17941 gtattaccat taacaacaaa tgactatgta tacgttaaaa gtttgtagaa ttggaaattc 18001 tttaggaact actctacctg aagaaattct gcaaaagcta agagttgatg aaggtgacac 18061 catatttgtg actgaaactg ctgatggagt ttaccttact acttctaatc ctgattttga 18121 taaagctatg gaagcataca ataaagtaag cactaagtac agaaatgcat tggaagaatt 18181 agcaaagtga atgaacgatt ttggttggaa gagggaataa ttagagctat gcatgcagac 18241 caactatttc agtatggagg ttgataaaga atttatttat ttacggtgaa tcaacgccta 18301 caatatgtaa ataatctcta acacttccag ctaaataaaa tgcaaataat ggtggtactg 18361 cattacccac ttgattaaaa caacttgttt gtgttcctac aaattcaaac caatctggaa 18421 aactctgtaa tctagcagct tctcttagaa ataaacgtcg ccgcctacca catggtaatt 18481 taattctgtg catatcacct gttgcagcag ctaagtttct acaggtcaaa gttcttgctg 18541 gtttatctaa atgaaggtcg cggggatgtt tgcaagagga tgctttttca tattttgcga 18601 catattcatc catgctgggt gtcagaaact tcgattcttc tggagtcata aatgcaagtt 18661 ctcctaaagc ttcgccagca ctaacttttt tctcgaatag tttcggaaat ttgaagtttc 18721 ctttgtgtcc tactactata agacgttctc gattttgagg aacaccaaag tcaactgcat 18781 ttaataatct ccattctact atgaacccta aatcctggaa tgtttgaaca atttcgtcta 18841 agtaccactt gtttttatag agaagcccac gaacattttc aaacaaccaa atttctggac 18901 aaagcctctt gactgcacta ataaaaattg gaaatccgtc tcgtgaatct tgtaaacctt 18961 tttgttttcc tcccacactg aaaggttgac aaggaggtcc acctataagt actttggcag 19021 atggtaattc tgtctcaggt gtgaggataa cctgagtaca gtttcccttt aagttttttc 19081 tatacgtaac acaagaatca gcatccattt caaacccgtg agttgcaaaa ccctgtgctt 19141 caaacccaac agataaacca ccgcagccag cgaataagtc taccaccagt ggactacggg 19201 aaaatgcagg tttgagtaca tgattgattt tatctacgta gttcattctg agttaggtaa 19261 ctgatctgtt gggtctttgg cgtatatgta gtttttttac ttttctacca cctcacgcac 19321 caactgtgct aatttttctg cgctatcagg aactgcgatc gccttcgccg cctccctcat 19381 ctttcgcaat tcttccggat gcttcaacaa gtccaaaacc tgactttgca aaacctcagc 19441 cgtcaattct gattgtttaa acagactcgc cgcaccaact gaggtaaaaa cttttgcatt 19501 ataagtctga tggtcttctg ctgcgaatgg gtacggaatc aaaattgctg gtgttccgca 19561 cacagccatc tccgttaaac tacctgctcc cgaacgagta attgctaaat tcgcccgttg 19621 tagcaacgca gccatattat tataaaatgg taatgaaata tactgatgat gtttgaaagt 19681 gccagcctca ggatcattat cgcctgtcaa atgcacaacc caagcaccag catcaaacca 19741 agcattggca gactgtcgca ccagcttatt gatagcaact gctccttggc taccaccaaa 19801 gacaacgatc acaggaacgt ctttaggaat cgggagatcc aaggatgctt caatttcttc 19861 ttctaaaaat tgagaacgca ccggagtact ggtgtaaatt gttttaacgc gaggcaagta 19921 ttgagaagcg gcttcaaaac ccaccgccac agcactacac caaggtccta aaaagcgagt 19981 cactttacct ggtaaggcgt tggcttcgtg gagaataaca ggtaaaccca aagaacgtgc 20041 tgcaataaca gccggtccag caatgtaacc tcctgtcgtg aacactccct gaaagtttcc 20101 ttgtcgtaaa agtttcctga cctctaaaat tgaaccaatc agttttccca aaacgcgaag 20161 tgaagaaatt ccaaaacctt gctgaaaacc ttcaactgca atagtattca atcgatactg 20221 tttgggaaca agctgagttt ctaatcgatt ggggacacca agccattcga tttgataatc 20281 cctcaagtgt tcagctagtg cgatcgccgg aaacacgtgt ccgcctgttc cactagcagc 20341 tattaacaac cgtataggag tttctaccat caccctccac cctgttagcg cttagcttaa 20401 cctaaaataa aacaacttcc accccttgtg tcgtcaagtt gcctaatttt actcatgcca 20461 aattttgctt cctttttaca caaacgtcaa acaatgttgc aacctactcg cttactagtt 20521 tattctctgc ttacactaag tttattaacg agttggaaaa ctgctcaagc cagtacacca 20581 cagcacttag tacaagccgg ggaacccctc caacccagta gcgcctctca aaatgcaccc 20641 tctcaagtaa aaaacttatt ggcgcaaatt gatgcagcag caagtcaggg gaatatcaaa 20701 ggcgtcatgc agttctatag tcccaatttc gttcacggag atggcttaac ccgtcagaat 20761 atggaaaaag ctttaactgc attttggcaa cgataccctc gattgaaata cactacccag 20821 gtacaatctt ggcaatctga aggtaatgcc attgtcgccg aaacactcac caatataact 20881 ggtgtgccct ctacgaacag tgagaactcg actttcaatg cgacaattag gtcgcgccag 20941 cgcattgctg caggtaaaat tatccgccaa gacattttgt ctgaacgcac tgaattgact 21001 tccggtgcca agccacctaa agtagagttc aagttaccac agcaagtaaa agttggtcag 21061 cagtttaatc ttgacgcgat tgtccaagaa cctcttggtg atgactatct cttaggaaca 21121 gcgttggaag aacctattaa accagagaaa ttgctgactg ccacacctgt ggatttagaa 21181 ttactctcat ctggcggaat ttttaaagtt ggacgcgcac cagctgtccc tggtagtcaa 21241 tgggtttctg ctgtgatcat gcgcggagac ggtatggtca tgatcactca gcgcatacag 21301 gttgtgaaaa actaattatt tagtcattgg tcatttacta atgactaatg attaatgact 21361 aagtagctca acctaattaa acgtaaaatg tcattacgag cggaacacag tggagcgaag 21421 taatcgcaaa gactctattt tacgcttttc aatgttgacc tacttaatga ctcatcacta 21481 ataactaaaa tgctttctat tcaaaatcaa attgttttga ttactggtgc aagcagcggt 21541 atcggtgcgg cttgtgccaa agtattcgca aatgcaggtg caaaactcat tttagccgca 21601 cgacggttag agcgcttgca ggagttagca gatgaggtga gcaaaaactc tgcgactgac 21661 attcacttgg tagaactgga tgtgcgcgat cgcactgctg tagaatctgc catctcaacc 21721 ctacccccct cctggtctga aattgacatc ctcatcaaca acgctggtct gagtcgtggt 21781 ttagacaaac tccacgaagg cgacatccaa gactgggaag aaatgattga taccaacatc 21841 aagggtttgc tgtacctgac gcgctatgtt gtccctggaa tggtgaaacg cgatcgcggt 21901 tatgtcgtca acattggttc catcgccgga catcagacat atcccagtgg taacgtctat 21961 tgtggcacta aagccgctgt cagagctatt tccgaaggtt tgaaacaaga cctgctagga 22021 acccctatcc gcgtcagttc tgttgaccct ggtatggtag aaacagaatt tagcgatgta 22081 cgctttcatg gcgatagcga tcgcgccaac aaagtttacc aaggagtcaa gcctctgact 22141 ccagatgatg tcgccgatgt gatatttttc tgcgttacgc gaccaagcca tgtcaatatt 22201 aacgaagttg tacttatgcc agttgatcag gctagcgcta ctctggtgaa taggcgaagt 22261 taaaattagg caataacaag ataaatttgg tatggcttta tgcaaaacac tgccaacgct 22321 gaaaaaactg gagttacagt tctcacgacg attaacatcg caaaagttct cacaataagc 22381 tgcttggttg cttttgctgt cgtcttcggc attcaagact ggcgacaagt catttatatg 22441 tgcctgcaca tcagttattg tctttggtgg ttaattgagc aatggtttta tccccagcgg 22501 cgacagatgt ttaatgaccc tgtgggcgtt ggtttatttg ttttcatact attatctgtc 22561 ggggtttttt acgcacttcc aggataccta gcatttacta atcccgttcc cctatcaatg 22621 accacagctg ctgtggcatt atcactttac attttcggca ctctgatcaa tgctaccgcc 22681 gatatccaaa agcttaccgc caagcagtat ggggcagggc tagttaatga caacatctgg 22741 cgtttctccc gcaatgtcaa ttactttggg gatctgctac gttatctgag ttttagtgtc 22801 gtcgctggtt cactctgggc ttatttgctg cctgcatata tcttagtttt ttacctccgg 22861 ttgatgtcta acaaagaaca gtcaatgtct caaaaatact cagaatatcc tgattacaag 22921 caatctagcg cccgtttgat tccgtttatc tggtgaagtt taagcgagga gttagtaggt 22981 agagtggagg gtatttcctc cactcttttg ctatgcagct tcaagaacac gcaataaatc 23041 attgaccatt ccataaggat gattgacgac agaggtatta caaaaaacag ctaacgtcag 23101 tttgcgtccg tgaaaatcgg gtaaatgcat agcgcttcat ttttcatgag ccttggatgg 23161 acaacccgac aacaatgtct actcgta // LOCUS NODE_1360_length_23018_cov_4.65457523018 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 23018) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 23018) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23018 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 657..1319 /locus_tag="DP116_12200" CDS 657..1319 /locus_tag="DP116_12200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864629.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_12200" /translation="MTWLISTLVIGISAAFATTFDDNLYLTAFFGKVNRTFRPKHIVL GEFLGFTALVFASLPGFFGGLVIPEAWIGLLGLLPIAIGISHLMSREDQQEVVLQTVS VDLPSPAKSRPHKKSLLETLRDPQTYRVSSVTIANGGNNIGIYVPLFASSNLPSLGVI LCVCYFTVGVWCCLSYFMTRNPLMAPLLARYGRKVFPLVLIWLGFSILMKSESYRLFI PS" gene complement(1736..1936) /locus_tag="DP116_12205" CDS complement(1736..1936) /locus_tag="DP116_12205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010873917.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_12205" /translation="MNFEMGGKTLLSGVTTYISPELKAELEAWAQEEERSISWLLAKL IENKLQERRQKVSQALSAGNQD" gene 2041..2856 /gene="map" /locus_tag="DP116_12210" CDS 2041..2856 /gene="map" /locus_tag="DP116_12210" /EC_number="3.4.11.18" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995193.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I methionyl aminopeptidase" /protein_id="PRJNA477356:DP116_12210" /translation="MNILDNLFTQPVTQRQSRGISLKSQREIEIMRQAASIVATVLKE ISQIVQPGMTTGDLDAYAEKRIREMGATPSFKGYHGFPGSICASINHEAVHGIPSCKR VIRPGDVLKVDTAACYQNFHGDSCITIAVGKVSPKAEKLIQVAEEALYKGIEQVKAGA YLMDIAGAVQDCVEAHGFSVLENYTGHGIGRNMHEEPSVFNFRTREMPNVKLRAGMTL TIEPIVTAGSKQTRTLSDRWTVVTLDKSLAAQFEHTLLVTDNGYEILTDRTKV" gene 3181..4359 /locus_tag="DP116_12215" CDS 3181..4359 /locus_tag="DP116_12215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312917.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphorylase" /protein_id="PRJNA477356:DP116_12215" /translation="MKPGGRTQAAIEGSCSAQADKVPRIALISAFSGEADRLVSEMQL NNGANRFDGCTVINGHRFSKGKLRGKNVVVLLTNISVVNASMVTQLTLDKFRITNVIF SGVAGGIGGIGANDDNPDTPNETPIGSVTIPERWGFHQEMYFNNTRDTVPCALSVGLQ LNYTLQEPSQEAQTCNFISGTAQSLGFANTETVFAPDAKNAFLRNTNVSSDDIPQYFL DANNVQQLRSVPFPGVPTNPNTDQNLKFWFTVDPQMFAKARQINVELLNCAPVDTSGK CNSTPLDPAPRLIVGGNGLTGPTFVDNAAYRKYVATNLNFDERGNKNNNTEVLVADME TTASAMVAFSNRVPFIAVRSVSDLAGGGEQSAAAQLQTFFAVAAENQARVVLKLVELL " gene complement(4405..7674) /locus_tag="DP116_12220" CDS complement(4405..7674) /locus_tag="DP116_12220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316658.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AcrB/AcrD/AcrF family protein" /protein_id="PRJNA477356:DP116_12220" /translation="MQQVNQTSGFSISAISIRQHIGTLMLTVAVIVVGIFFLTTIQVD LLPSITYPRIGVRLEAPGISPEVAVDEITRPLEEALSATENVVQVFSRTREGQVSLDL FFQPGGDIDQALNDATAAFNRGRGQLPDTIEEPRIFKFDPSQQPIYELALTSPSLQGK DLRVFADEELSRELSVVQGVASVDISGAAEEEVRVLVDLRRLQALGVGLTDVLNQLTA RNQDISGGRILGKNSEPLTRTVGRFKNADEISNLSFQVSSSPSSSSSPSTSSTSSPPR RVYLRDFAEVSDGTEEQRIFVYLNRQPAVKVSIQKQPDANTINVVDGVKKRIEQLRGS GLIRSDMVLTPTTDESRFIRNSLNDVITSAVSGALLAAAAVLLFLGSIRQTFIISLAI PLCTLAAIALMKLFGLTLNVFSLAGLTLGIGQAIDTSVVILENVAEKTGMTPNQKQTE KLADKEMGKKPNSKFFIETTIASSQEVESALVAATAANLVSVVPFLLIGGFISLLFNE MILTIGFAVAASLVVAVTVVPMLCSRLLAIPWSSRIREFWLLRQFNRRFEDATILYAK LLKNVIRYRIIAVTIVFLILGGGSLFMAGQISQEILPRINTGQANLRVQFPPGTPLAT SQKVMQVVDDILMKQPETDYVFSTVGGFLFGSNTTENPLRASSTINLKPDKDVEKFVQ KVNQEFNKLNLAGILLRLSPGQVRGLILSNTPAQGSEVDVILQGNDEQNLQQAGRQLL QALEEKATLARFRPDADPRQPEVQIRPDWERVAALGLTAQQIGETIQTAIEGSVPTQI QRGNRLVDVRVELNQEAIERPSQLEGLPLFTQNNQQVRLLDVARIEEGQAPGEVQRIN QRQVFVIAGNLSEGASLGDAIAQVNEIVKEIQLPDGVTIIPSSAQETNQQLQNSLKTM GALATFLIFVVMAVQYNSLIDPLIIMLTVPLALAGGIFGLYVTKTAIGATVIVGAVLL VGIVVNAGILMVELANQIREEEGCDRRTAILKAAPQRLRPIMMTTVTTILGLFPLALG IGEGSEFLQPLGIVVFFGMAIATLLTLFLIPCFYILLHDLLGGKWAKPVFGRLGKLRK W" gene complement(7866..9218) /locus_tag="DP116_12225" CDS complement(7866..9218) /locus_tag="DP116_12225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740602.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_12225" /translation="MQLLRKKILITSVLSIGLLTAGCGSLPNESAEAESQRPGGKQQG GGAVPVDASIARTGVLRQDPEYTGTTTPFRTVSLRSRVEGSLLALNVDVGDAVKQGQI IAQIDDALLRTAQNQAEAELAALKSEVARERARVSNARAEVEKARAQLVQAQADSQRQ QKLVKEGAIAQQTAEQARTEARTAAQALRAAQEQVRTQQQALAAAQGRVVAQQAVVAE AKERRSYAKLTSPITGAVLEKMTEPGNLLQPGNEALRIGDFSRVKVVVQVSELELGKI RLGQSVKVQLDAFPNQTYTGQVTRISPAADTTARLIPVEVVIPNRDGKIGSGLLARVN FETQTQARVIIPEVALQGTSGDKGTRGQGDKGQSASSSSSSSQSPVSKRQGTVFVVTQ AGDKVTVTARAVTLGERADANVEVLSGLQPGERFVTRSGRPLKDGSIVRLSILSEQAS " gene complement(10463..11371) /locus_tag="DP116_12230" CDS complement(10463..11371) /locus_tag="DP116_12230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876283.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem reaction center subunit H" /protein_id="PRJNA477356:DP116_12230" /translation="MGLYKIADFDPDYKDTFQGNDIKGMGVYAEGSDEKVGTVGDVLV DEDGHFRYLVVDLGFWIFGKKVLLPVGRSRIDYNVDRVYAIGITREQAERLPEYNEHD VLDYDYEERVRGTYRTPVDTATPLESSAALIDPTYSAATAGYQPTPAAVNPTYDRNTY KYEQDAPLFNLNEQDHQTLRLYEERLVANKRRQKAGEVTVGKHIETETARVAVPVEKD RVVVERVTPADAGRAVAPGEATFREGEVARVEIYEETAEVRKEAFVREEVRVRKVVDQ DTVEAQDTIRREELDINAPGLPVDER" gene complement(11656..12498) /locus_tag="DP116_12235" CDS complement(11656..12498) /locus_tag="DP116_12235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318303.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem reaction center subunit H" /protein_id="PRJNA477356:DP116_12235" /translation="MPLYKLADFDPNYRETFGGDDIKALDLYTEGGVRVGSVADALVD ADGRFRYLVIDTQYNSSHKRILLPIGLSQIDYNQRRVYVDGLSKEQVQSLPVYKDDIT VDYDYEEQLRKTYRPSSSGLTYDRDTYNYQQDPSLYNLNDQNHQTFKLYEERLIANKS RIKTGEVTVGKHIETETASVSVPIERERVVIERVNPTNAGTVVNPSELKFQEGEVARI EIYEETPEIRKEAFVREEVRIKKVVETETVEAQETIRREELDVRTQGDLPIAQTDVTP NNLV" gene 12846..13478 /locus_tag="DP116_12240" CDS 12846..13478 /locus_tag="DP116_12240" /EC_number="4.1.2.14" /EC_number="4.1.3.16" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318304.1" /note="catalyzes the formation of pyruvate and glyoxylate from 4-hydroxy-2-oxoglutarate; or pyruvate and D-glyceraldehyde 3-phosphate from 2-dehydro-3-deoxy-D-glyconate 6-phosphate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="keto-deoxy-phosphogluconate aldolase" /protein_id="PRJNA477356:DP116_12240" /translation="MSGQAWLSQLKQHKVIAVVRAPEVSLTRQMALAVASGGIQLIEI TWNSAGATELITQLRVELPNCTIGTGTLLNLQQMQEAIAAGAQFLFTPHVDPVMIQAA VDIGVPIIPGALSPTEIVTAWSCGASCVKVFPVEAVGGVSYIRSLRGPLGHIPLIPTG GVTLENAKEFLQAGAIAVGLSSQLFPKEFVDTQNWQAIAQKAASLMQKMS" gene 13667..14179 /locus_tag="DP116_12245" CDS 13667..14179 /locus_tag="DP116_12245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876286.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12245" /translation="MNSLTSPISMRPLKLFITGMALTLNVFLSASNQVWAQSKSQTIK GQVVVLQQSQQRWIQIDLSSQRLTAWEGDKPVYTFIISTGKKSTPTPTGVFQIQSKHK SARMQGEDYDIPDVPYTMYYSGSYGIHGAYWHKKFGTPISHGCINVAPKKAKLLFNWA SIGTPVVVQR" gene complement(14402..16000) /locus_tag="DP116_12250" CDS complement(14402..16000) /locus_tag="DP116_12250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128702.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Hsp70 family protein" /protein_id="PRJNA477356:DP116_12250" /translation="MAIAIDFGTSNTVIARWNPVTQQPETLNIPGLSVQQSLNPPLIP SLVYVEDASKGQVLLGQQVRDRGFDLKNDPRFFRSFKRGIGADIQGFLPELDGQIVTF EQVGQLFLSQIIEQLAPLQGGIDSLVLTVPVDSFEAYRHWLGQVCQTLNVEQVRMLDE PTAAALGYGLADQEILLVIDFGGGTLDLSLVRLDQTVQRTQKPVGFLLKWGNKSLAED SKQKVKTARVLAKAGQNLGGTDIDNWLVDYFAKTQGLVVSPLTTRLAERVKIQLSTQT QASEVYFNDETFESYELDLNRDTLNTILTEHSFFERLDESMTSLLQQARRQGIEVSDI NAVLLVGGTVQLSAVQTWVKQYFEPEKIRCERPFEAIAQGALQVAQGVEVKDFLYHSY GIRYWDRRNKCHSWHPIIKVGQPYPMTQPVELVLGASVESQPSVELIMGELGADTGGT EVYFDGDRLITRSAASCQTNVKSLNDKDGARSIAQLTPPGFPGSDRIKVGFQVDEQRF LRITVEDLLTSETLLENQVVAQLS" gene complement(16254..17219) /locus_tag="DP116_12255" CDS complement(16254..17219) /locus_tag="DP116_12255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744513.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UV DNA damage repair endonuclease UvsE" /protein_id="PRJNA477356:DP116_12255" /translation="MSAVRESKSPSGVAINNSPAQQLQKTAVVHLGLVCITFSKDVRF RTITRKRYLELPEEQRETALKVIYRENLQRLDLALTFCVRNSIRLYRMSSGLFPMSDL EDQVGATVLEKMSADLAKIGQRALKLGIRMVLHPDQYVVLSSDSPQVVALSIKILDRH ARTLDLLGLPRSSWSLMNIHGGKSQRRDQLVDVISELPENIKSRLTLENDEYAYSASE ILEVCQRAGIPLVFDAHHHICHESLDSYDDPSVAQMFYAARETWANPDWQLVHISNGE EAFRDRKHSEFINAMPSVYREAPWIEVEAKHKEEAIAHLRSWWLQ" gene 17746..17964 /locus_tag="DP116_12260" CDS 17746..17964 /locus_tag="DP116_12260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995434.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12260" /translation="MKNRHSSESKQFVGNIKNGIWVFGISSWLFGITDRSIASFSDGY LSALDLTQLFTAAIFLVAWWFLKPTSRV" gene 19143..20879 /locus_tag="DP116_12265" CDS 19143..20879 /locus_tag="DP116_12265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S8" /protein_id="PRJNA477356:DP116_12265" /translation="MRHQKLSPGLLLAFEDYQREGQEALIPQVRMLSIVPPKNNLKPI RSIVFIYCDENADLSHLSQHGIEVNQNRGHVRTAFLPVQSLDALSDDPAIERIKPSRK LKLHMDVAKSTVHIPEFRKKNSNLTGKGVIIGIIDSGIDPKHPAFKGRILRIWDQTLS GSGVIEGKYGAELTGSLLTVSQDTNGHGTHVAGIAAGMDATYGGVAPEAELLIVKSDL DEGHIADAVRYIFRVARELGRPAVVNMSLGGHFDPHDGSDSLSKVIDSETGPGRIVCC AAGNEGNDNIHAQAIVPPGKTHTMRFNVPLNQASIATLNSWYSSAGQLEVSLRSPNGF VTPFQPVIADGNYIKEYTLQDSQVQVATPKRDPGNGDYNVLVQIRGKGKGNYTPPVQG GIWQLRFKNTSAKDVRLDVWTLDGSSLFTGQSIADSVKIGSPGCASSAITVAAYTTKE KYTDIDNKLEEMGFSLNTISDFSSEGPLRNDAKKPDVAAPGAMIVSTLSSNANSDRSM IINSKFMALAGTSMAAPFISGLVALLLQRNPKLDPVAIKELLRKNSSIPKKPPGTFDS KWGYGLIDAQNL" gene 21003..21308 /locus_tag="DP116_12270" CDS 21003..21308 /locus_tag="DP116_12270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744506.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12270" /translation="MNYQKLDTALAVALNDVNNSQEPSLKVFIHTQNDANYTETTAVL ENLGVADVTPEKDVFTATLSPNAISQLSEQPWVQYIKLSQNLHLVNQKIKGGMKFFK" gene 21671..22960 /locus_tag="DP116_12275" CDS 21671..22960 /locus_tag="DP116_12275" /EC_number="2.7.7.27" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016948852.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucose-1-phosphate adenylyltransferase" /protein_id="PRJNA477356:DP116_12275" /translation="MKKVLAIILGGGAGTRLYPLTKLRAKPAVPVAGKYRLIDIPVSN CINSEIFKIYVLTQFNSASLNRHIARTYSFAGFTEGFVEVLAAQQTPDSLNWFQGTAD AVRKYLWLIEEWDVDEYLILSGDHLYRMDYRQFVHRHRETEADITLSVLPIDERRASD FGLVKIDESGRIINFSEKPKGDALKGMRVDTTVLGLTPEQAQEQPHIASMGIYVFKKE VLIKLLKEASERTDFGKEIIPDAANNYNVQAYLYDGYWEDIGTIEAFYDANLALTKQP QPSFSFYDEEAPIYTRSRYLPPSKILDSQITESMISEGCILKNCRIEHSVLGVRSRVE AGCVIQDSLIMGADFYQPFAERQSDCETKGVSLGIGANTTIRRAIIDKNARIGCDVQI INKDNVQEAERENQGFYIKSGIVVVLKNAIIPDGTII" BASE COUNT 6444 a 5043 c 4841 g 6690 t ORIGIN 1 atgtcattag tctagttata ggtggcgatc gcctgtgaca gttgcgtaag tcctggatcg 61 aaatgattgc cgctcaacag agctagcgct ggcgttgcat aacgctttgc acaactctat 121 gcaggagtat tgcctttgat tgatttcctc taagcttaca cgaacggttg cagcaatatg 181 caatagcaaa acactcaaag cgcaaatact gttttattcc tacatttcct cgctaaaagt 241 ttcacataat aatcagtata ccattgttga atgagtatat acaatatttt caatcaactt 301 tcagcataat agaagtaagc tatagaaacg aatttgcttg tttgaagtca gcagtgtgca 361 ttctgaattc aataaagctc acttctagcc aataactaaa acccaactct tcaaatgtcg 421 tcactttatc gttaagttat tcatcacttt tatttatcgt tttgaaaaaa aagaagattt 481 gctttcattt agttgaaacc cttaatattg aagccatacg cggtcaaggt aagtaccttc 541 cggagaaaaa aaatcaccaa tatcattgac gattgagcaa tctcctgaac tgccctcact 601 tcgagaagca gagaaggaca aggaaaaaca gtattgaata atagtaaaca acgatgatga 661 cttggctaat tagtacatta gttattggaa tctctgcggc ttttgcaacc acttttgatg 721 acaatttata cctgacagct ttcttcggaa aagtcaatcg cacttttcgt cctaagcata 781 ttgttctggg tgaatttctt ggattcactg cattagtgtt tgcaagtttg cctggtttct 841 ttggtggttt agtcattcca gaagcatgga ttggattgct gggcttgctt ccgattgcta 901 ttggtatcag tcatctcatg agtcgagaag accaacaaga ggttgtattg cagacagtct 961 cagttgactt accctctcct gccaaatcta gaccccataa gaaatcactg ttggaaactt 1021 tacgcgatcc ccaaacttac cgtgtttctt cagtgactat tgccaatgga ggaaacaaca 1081 ttggcatcta cgtgccatta tttgctagta gcaatctccc aagtttgggg gtaatacttt 1141 gtgtttgcta tttcacagtt ggggtgtggt gctgtctgtc ttacttcatg actcgtaatc 1201 ctctgatggc tcctcttctg gctcgttatg gtcgaaaagt cttcccactt gtcttaatct 1261 ggctgggatt ctctattctg atgaaaagtg aaagctatcg actttttatc ccgtcttgag 1321 ccgtttttca gttgggcaat tgcgcaaagc ccacggtttt ttatcattca aatataacag 1381 gaaacgatat gatatttttt ctggataaac acttataaaa acgaagaacc tcacccgcct 1441 ccggcaccct ctccgcgatt tcggagaggg ttgaattaat atttgatatt taagtcagtc 1501 aattgcttga gtttgtctgg attgacaaca atgaaaacag attgaatgcg gttatcccta 1561 aaatcaaatg tcataacact gtggaggcga tcgctttgct cacatatttt gcacctggaa 1621 atttgcagcg gagacagcct ttgtttacat agccgcacat gaatagtgtg tcgggtcaag 1681 tatttgttgt gcaaatttgc gatgctccca aacagaggca tcctgcttgg cgcagctaat 1741 cctggttgcc cgcactcaac gcttgtgata ctttttgccg tcgttcctga agtttatttt 1801 cgatgagttt ggcaagaagc caggagatag agcgttcttc ctcttgtgcc catgcctcca 1861 actctgcttt cagttccggg gaaatgtacg ttgttacacc agacaataga gttttgccgc 1921 ccatttcaaa atttatgtat tgcgtagata gtacttacca tagcatttta ctgtagcttt 1981 tataattata tcatatgcta tgctatgcta caacagcata ccttcagttg acatctcacc 2041 atgaatatcc ttgataacct cttcacccag cccgttactc aacggcaaag tcgtggtatt 2101 agcctcaagt cccaacgtga aattgaaatt atgcgtcaag cagcttcgat tgttgcaact 2161 gtgctaaaag agatttctca aatagtgcaa cctgggatga cgactggtga cttagatgct 2221 tacgcagaga aacgcatccg ggaaatgggt gcaacaccaa gtttcaaggg ataccacgga 2281 tttcccggct ctatctgcgc ttcgattaat catgaagctg tgcatggcat ccctagttgt 2341 aagcgggtga ttcgacctgg agatgttttg aaggttgata cagcggcttg ttatcagaac 2401 tttcacggtg attcttgtat cactattgct gtgggtaagg tgtccccaaa agcagaaaag 2461 ttaattcaag ttgcggaaga ggcactctac aagggaattg agcaagtcaa agcaggagca 2521 tatctgatgg atattgcggg tgctgttcaa gactgtgtag aagcacacgg cttcagtgtt 2581 cttgaaaact acactgggca cgggattggt cgcaacatgc acgaagaacc ttcagtgttc 2641 aactttcgta ctcgtgaaat gccaaatgtg aaactccggg ctggaatgac attaacaatc 2701 gaaccgattg tgactgctgg ttctaaacag acccgtacgc tatcagaccg ttggactgta 2761 gtcacattag acaaatcttt agctgctcag tttgagcata cattgcttgt gactgacaat 2821 ggatacgaaa tcttaactga ccgcacaaag gtttagtaat tcgtcaaacc aagcaggtag 2881 tcaggcaata cggttcggta gtgtcggtat ctatcccctt gcaagcaccg aggaagaggg 2941 acaacagaag caggggtgga gaaaagaact gatttttctt ctcctgcctc aagagcctca 3001 agactcaaat tagaacgtaa aaaaatagat tataaatctt tgctgtttta gtatattctt 3061 ctttattgag agaagcttga aaaatgtaat tttaattact acataaagat aaaaatcatg 3121 caaaaataca ggactttgct gaggttgttg cttacgtcgt gtatatgtgt aggaagcgtg 3181 atgaaaccgg gcggtaggac gcaggcggct atcgagggtt cttgtagcgc ccaggcagat 3241 aaagtaccac gaatcgccct gatttcagca ttttctgggg aagcggatcg cctcgttagc 3301 gaaatgcaac tgaataatgg agccaaccga ttcgacggct gtacggtgat caatggtcat 3361 cgcttcagca aagggaagct gcgtggcaaa aatgtggtgg tgctgctaac aaatattagt 3421 gttgtgaatg cctctatggt tacccagttg accttagata aatttcgcat cacaaatgtt 3481 atcttcagtg gagtcgccgg aggaattggc ggtatcggag caaatgacga caatccagat 3541 acgcccaacg agactccaat cggatcggtt accattccgg aacgctgggg ctttcatcaa 3601 gaaatgtatt tcaacaatac ccgagataca gttccctgtg ctttatcagt cggattgcaa 3661 ctcaactata cgctacaaga gccatcacag gaagctcaaa cctgtaactt tatatccgga 3721 accgcacaat cgctagggtt cgctaatacc gagacagtct ttgcaccgga tgcgaaaaat 3781 gcctttttac gtaacaccaa tgtcagttcc gatgacatac cccagtattt cttagatgca 3841 aataatgtac agcaactccg ctcagtgccg tttccaggtg tacccaccaa tccaaacaca 3901 gatcagaatc tcaaattctg gtttaccgtc gatccacaaa tgtttgccaa ggctcgccaa 3961 attaatgttg aactcctgaa ctgtgcacca gtcgatacga gcggtaagtg taacagtacg 4021 ccgctcgatc ccgctcctcg tctgattgtg ggtggaaacg gacttactgg tcctactttt 4081 gtcgataatg ctgcctatcg caaatacgtc gcgacaaatt taaacttcga tgaacgcggt 4141 aacaagaaca acaatactga ggtgctggtt gcggatatgg aaaccacagc ttcggcaatg 4201 gtagcttttt ccaatcgcgt accctttatt gcagtacgga gtgtctcgga tcttgcgggc 4261 ggtggtgagc agtcagcagc agcacaactg cagactttct ttgcagtagc ggctgaaaac 4321 caagcacgtg tcgtgttgaa gctggtagag ttactgtagc aggaatttga ctgtacaaaa 4381 ggctggttac tgcaacggag tggctcacca tttcctcaac ttacccagcc taccgaacac 4441 gggctttgcc catttcccac ccagcagatc gtgcagcaga atgtaaaaac aggggatgag 4501 aaacagcgtc aagagtgtgg cgatcgccat cccaaaaaac acgacaatcc ccaatggctg 4561 caaaaactct gaaccttcgc caatacccaa agctaacgga aacagtccca aaatagtcgt 4621 gactgtagtc atcataatcg gacgcaaacg ttgaggggca gctttgagga tggctgtgcg 4681 gcgatcgcaa ccttcctctt ctcgaatttg attagccagt tctaccatga gaattcccgc 4741 gtttaccaca atacccacca gcagcaccgc accaactatc accgttgcgc caatagctgt 4801 tttagtgacg taaagtccaa aaatcccccc agctaaagct agtggtacag tcaacataat 4861 gattaaaggg tcaatcagcg agttgtattg cacagccatg acgacgaaaa ttaagaacgt 4921 cgctagcgca cccatagttt tcaatgagtt ttgcaactgc tgattcgttt cctgtgcgga 4981 acttggtata atcgtaacgc catcaggtaa ctgaatttcc ttgactattt cattcacttg 5041 tgctatagcg tcacctagac tagctccctc gctaaggtta cctgcaatca cgaaaacttg 5101 acgttggtta attcgctgaa cctctcctgg tgcttgaccc tcctcaatac gggcaacatc 5161 taaaaggcgg acttgttggt tgttttgtgt aaataacggt aatccttcta actgggaagg 5221 acgctcaatt gcttcttgat ttaactctac gcgtacatca actaggcggt taccgcgttg 5281 aatttgcgtg ggaactgaac cttcaatagc agtttgaatt gtttcaccaa tttgttgggc 5341 tgtcagtccc aaagctgcaa ctctttccca gtcaggacga atttgtactt ctggttgacg 5401 cggatcagca tctggtcgaa atctagcgag tgtagccttt tcttctaaag cttgcaggag 5461 ttgtctacct gcttgctgta agttttgttc atcgttacct tgcagaataa catcaacttc 5521 tgaaccttga gctggggtat tactcaagat taaaccccgt acctgaccag gactcaagcg 5581 cagcaaaatt cctgctaaat tgagtttatt aaattcctga ttaacctttt ggacaaactt 5641 ctcaacatct ttatctggtt tgagattgat agtactgctg gcgcgcaaag gattttctgt 5701 tgtgttgcta ccaaagagaa aaccacctac ggttgagaaa acatagtcag tttctggttg 5761 cttcatcaag atatcatcca caacctgcat gactttctga gaagttgcca aaggagtacc 5821 tggaggaaac tgcactctta aattggcttg tcctgtattg atgcgcggta agatttcttg 5881 agaaatttga cctgccataa acaaactgcc accgcctaaa atgagaaaaa cgatagtgac 5941 ggctattatc cgataacgta taacgttctt taacaactta gcgtataaaa ttgttgcatc 6001 ttcaaagcgg cgattaaact ggcgcagtag ccaaaactct ctgatacgac tagaccaagg 6061 aattgccaga agtcgagaac acagcatcgg tacgactgtg acagcgacaa ccaaagaagc 6121 agcgactgca aagccaatcg tcagaatcat ctcattaaat agcagtgaga taaagccgcc 6181 aatgagcaag aatgggacta cagaaactaa gttggcagca gtcgcagcga ctaatgcaga 6241 ttcgacttct tgggaagatg caattgtagt ctcgataaag aattttgaat tgggtttttt 6301 ccccatttcc ttatctgcca atttctccgt ctgcttctga ttgggagtca tgcctgtttt 6361 ttcagcaacg ttctccaaaa tgacgactga tgtgtcaatt gcttgaccga tgcccaaagt 6421 aagacctgct aaactgaaca cattcaaagt caagccgaat agtttcatta aggcgatcgc 6481 cgccaaagta cacagcggaa ttgccaaact aataataaat gtttgccgta ttgatcccaa 6541 aaataacagc actgctgctg ctgctaacaa cgccccagaa actgctgatg taattacatc 6601 attcaacgaa ttgcggatga agcgagattc atctgttgta ggggtcaaaa ccatatcaga 6661 tcgaatcaag ccagaacctc gcaattgctc aatccgtttc ttcacgccat ctacaacgtt 6721 gattgtattg gcatcaggct gcttttgaat tgaaactttc accgctggct gacggtttaa 6781 ataaacgaag attcgctgtt cttctgtacc atcactgact tcagcgaagt ctcgtagata 6841 aacgcggcga ggaggagatg aggtggatga ggtggatggg gatgaggagg atgagggaga 6901 tgaggagact tgaaaagaga ggttgctgat ttcatctgcg tttttgaagc gtcccaccgt 6961 gcgggttaat ggttcagaat ttttccctaa aatcctacca ccagatatat cttggttgcg 7021 ggctgtgagt tgattgagta catcagttaa cccaacaccc aacgcttgca agcgccttaa 7081 gtcaacaagc acccgtactt cttcttcagc agcaccagag atatcaacag aagcaactcc 7141 ttggacaaca ctgagttcgc gagacagttc ctcatctgca aatacccgca aatctttacc 7201 ttgtaaagaa ggagatgtca gcgccaattc gtagattggt tgttgggaag ggtcaaattt 7261 aaatatacgc ggttcttcaa tagtatctgg cagttgtcct ctgcctcggt taaaagcagc 7321 agtagcatca tttaaagctt gatcgatatc gcctcctggc tggaaaaata aatcgaggct 7381 aacctgtccc tcacgagtgc gggaaaaaac ttgcactaca ttctctgtag ctgataaagc 7441 ttcttccaaa ggtctagtga tttcatctac cgccacttca ggtgatatac caggtgcttc 7501 cagccgcaca ccaattcgcg gataagtaat tgatggtaat aaatctactt ggatcgttgt 7561 cagaaaaaat atcccaacaa caatcaccgc cacggtgagc ataagtgtgc ctatatgctg 7621 gcgaattgaa atggcactaa tactaaatcc gctagtctgg tttacctgct gcatgatact 7681 aaatgctgag tactgagtat tgttgtctcg gtatattaga cctctgttgg aaaacaatgt 7741 agagacgtta catgtaacgg agccactcgc gtgcgcggtt ttcccgcgtt gagcgatgtg 7801 gcgttctcta caagagtttc acgtaacgca taattaattt ctggagatgt ctattgaata 7861 ctgctctacg atgcttgctc agaaagaata gaaagacgta caatactgcc atctttcaat 7921 ggtctgccac tacgagtcac aaatctttct cctggttgca agccagataa aacttctacg 7981 ttggcatcag ccctttctcc aagcgttacg gctcttgctg tcaccgttac tttatcacct 8041 gcttgtgtca caacaaatac cgtcccttga cgcttagaga ctggggattg agaagatgag 8101 gaagatgagg aagcagattg ccccttgtct ccttgtcctc ttgtcccctt gtctcctgaa 8161 gtcccctgaa gtgctacctc aggtatgatg actcgcgctt gtgtctgggt ttcaaaatta 8221 actcttgcca acaaaccact accaatctta ccgtccctgt tgggaatgac gacttcaact 8281 ggtatcaaac gagctgttgt atcggcggct ggggaaatgc gtgtcacttg cccagtgtat 8341 gtttggttag ggaaagcatc taactgcact tttacagatt gccctaagcg aattttcccc 8401 agttctaatt ccgaaacttg aaccacgact ttgacgcggc tgaaatcacc tattctcaag 8461 gcttcattcc ccggttggag aagatttcct ggttctgtca tcttttctaa aactgcccca 8521 gtaatcgggg atgtgagctt tgcataagaa cggcgttctt tcgcttctgc aaccacagct 8581 tgttgagcaa cgactctacc ttgggctgcg gcgagggctt gctgttgtgt gcgaacttgc 8641 tcttgtgcag cacgaagtgc ttgggcagct gttcgggctt cggtgcgtgc ttgttcggct 8701 gtttgttgtg cgatcgcccc ctctttcaca agtttctgct gcctttgtga gtctgcttgt 8761 gcttgtacca gttgtgctcg cgctttttct acctccgcac gggcattgct aactctggct 8821 ctttcccgtg cgacttctga cttgagtgct gccagttctg cttctgcttg attttgagca 8881 gttcttaaga gggcatcatc tatctgtgcg atgatttgtc cttgcttgac agcatcacct 8941 acatcgacat ttaatgccaa aaggcttcct tctacgcgcg atcgcaatga cactgtacgg 9001 aatggtgtgg tagtccctgt atattctgga tcttgtcgca acacaccagt acgggcaatc 9061 gatgcatcga caggaaccgc accgcctccc tgttgcttac ctccaggacg ctgtgattca 9121 gcttctgctg attcatttgg aagcgaaccg caacctgctg tcagtagtcc tatgcttagt 9181 acagaagtaa ttaaaatctt ctttcttaag agttgcattt ttttagatgt tggctgtctc 9241 aaaaaacagt ctctattttt ctacctttcg ctgtgtgtga atttgtgtgc tactttccgc 9301 tgctgatttc cttacttaat catctatgca cgctcctatc aggctgtttt gggttagcct 9361 agtcgtttga ttttctaatg attttcttaa atttaatact ctaaataaaa atgactacta 9421 aagataaaca cctcctgtac tttctacagt tttgcgagaa ctattgtttt ttatgttaga 9481 ataatgcaaa ttttcttaag ttgattctaa ctatttttaa attttattga ctctatctat 9541 aggtcagcta tctcatctcg tctgaaagga aaatacgaca ggcttaaata gcaaaaaggt 9601 ggttaaactc tatatataac agagattccc cgtcttggtg acgaagaaag acaaggaaga 9661 cgacgagtga gcaaaacaag tcaaaagtca aatgaaaaca agctctgtct atacagatga 9721 cactttctat ctactggttc aaagactgca aatttttatt tggataaact ctacttttgt 9781 gtttttgctt tttgcttcaa ataattctca aagaaacaaa aacaagctgt aacgccttaa 9841 cgttaatact gatttgtgag agccggcaag aatcaaaagt cctaagcaat tattgacttt 9901 ttactcttga caaatgatct gtcataacta tatcaaaaga ttgtttgaac tcgtgactca 9961 ctatcaaaga ctcgattttc caaaaatgtt tatgtacgac ccggtgtttt tacaactcct 10021 gcgcggattg ctatatacgc gcaacagctg atctatttct cctgcgagca tcccaaatgt 10081 gtaagttata agctgttacg cttttaactt gcatattatt ttggtttagt caatcattga 10141 aaccctcttc ccttctgcct tctgccgtct gccttctgcc ttgctgttgt gcatttttaa 10201 tgcacaacag cttatcaaac ccagataata aattgtcatt gcgagcgcaa cgtagtgaga 10261 ccagcgcgaa tgacggcttt cccgacagag gcgactggcg ttagcccgga gggcgtgcgc 10321 tttgcgcata cccgaagggt gaacagtgaa actgataact gataactgta aaaaaaatct 10381 tgattacatt gactttacta atgatgagta gagacccgga attccgcgtc tctacttcta 10441 tcacttatgt aaaaaatttg atttagcgct catcaacggg aagaccagga gcgtttatat 10501 ccagttcttc acgacgaatt gtatcttgag cctcaaccgt gtcttggtct accactttcc 10561 tgactctgac ttcttcacgt acaaaagctt ctttgcgaac ctcagctgtt tcttcgtaaa 10621 tttccacgcg agcaacttct ccttcacgga aggttgcttc accaggagca actgctctac 10681 cagcatctgc tggagtaact cgctcaacaa caactcgatc tttttctact ggaaccgcaa 10741 ctcgtgcagt ttcagtttca atgtgtttac caactgttac ttccccagct ttctggcgtc 10801 tcttgttagc aactagccgt tcttcataca atctgagggt ttgatgatcc tgctcgttca 10861 gattgaacaa cggagcgtct tgctcgtatt tgtatgtatt acggtcgtaa gtgggattga 10921 ccgctgcagg tgttggctga tatcctgctg tcgcagctga atatgttggg tctattaaag 10981 cagcagatga ttccagagga gttgctgtat ctacaggagt ccgatatgta ccgcgtaccc 11041 gctcttcata gtcgtaatca agaacgtcgt gctcattgta ctcaggtaat ctttcggctt 11101 gttctctggt gattccaatt gcatagacgc gatcaacgtt atagtcgatg cgagaacgac 11161 caactggtaa taagactttc ttgccaaaaa tccagaaacc taagtcaaca actaaatagc 11221 ggaaatggcc atcttcatca actaaaacat caccgacagt gccgaccttt tcatcgcttc 11281 cttctgcgta aacgcccatt cctttgatgt cattaccttg aaaagtatct ttgtagtctg 11341 ggtcgaagtc tgcaatttta tataaaccca tgttaatttc ctctcaacct attttttact 11401 ttcttatcta tgatgttaaa aggttatcta atgtgctcca tctccctaac gactgaatca 11461 gttgatgtta ctagtattcc aaagtgatag tttttttgta atgaattgac ttgctttttt 11521 ttaattttta tacttttttt ttaaaacaag actttcaaat aacaaagaca ggcaaaagat 11581 gagtcttcct cctcttcctc ttgcctgtct gattttctag tacagctcta ttcaaaaagt 11641 tttggaaaaa gcaatttaaa caaggttatt tggagttaca tcagtttgag ctatgggtaa 11701 atctccttgg gtccgaacat ctaactcttc ccgacgaatt gtttcttgtg cttcaaccgt 11761 ctctgtctca actacttttt tgatgcggac ttcttcacgt acaaacgcct ctttacgaat 11821 ctcaggggtt tcttcgtaga tttctatacg cgctacttcc ccttcttgaa acttaagttc 11881 actgggattt actactgtcc ctgcatttgt aggattaact cgctcaatca caactcgttc 11941 tctttcaatt ggtactgaaa cacttgctgt ttctgtttca atgtgtttac caactgttac 12001 ttctcctgtc ttgatacggc ttttattggc aatcagccgc tcttcgtata gtttgaaagt 12061 ttgatggttc tggtcattca agttgtacag ggatggatcc tgttggtagt tgtaggtatc 12121 gcgatcataa gtcaaaccac tagagcttgg acgataagtc ttacgtagct gctcttcata 12181 atcgtaatca acagttatgt cgtctttgta cacaggcaaa ctttgcactt gctctttgct 12241 cagcccatcg acataaacgc gccgctgatt ataatcaatc tgggatagac caattggtag 12301 caatattctt ttatgagaag aattgtattg agtatctata actaaataac gaaaacgacc 12361 atctgcatca actagcgcat cggcaactga gccaactctg actccacctt ctgtatataa 12421 gtctaaagct ttgatatcgt cgccaccaaa agtttctcga taatttggat caaagtctgc 12481 aagtttatag agaggcataa ttttgttacc acattaagaa aagtgtgtat ccttttttaa 12541 actaagaaca aattgaatct ttgtcatctt gctggcggct taatttatct aattaataat 12601 cagtcattag tcatcactca ttaattagta gactattgat gaaagggtaa atagctgtat 12661 atatattccc aaagataggc tgataatcga aatgtaactc gttatgatta aaactaaata 12721 tgtctatggt aagatacaaa agttatgttc taatcattaa gacataactt tagagattca 12781 aaagatagag agctacaatc tattgtgatg aaaactacta atctaaaata taaaacttga 12841 aaaaaatgtc tggtcaagct tggctatcgc agctcaaaca acataaggta attgcagttg 12901 tccgcgcccc agaagtttcc ttgacacgcc aaatggcttt ggctgtggca tctgggggaa 12961 tacaattaat tgaaattacc tggaatagtg ctggtgcgac tgaacttatt acacaactaa 13021 gagtagaatt acctaattgt actattggaa ctggtacgct gctaaattta caacaaatgc 13081 aggaagcaat tgcagcgggg gcgcaatttc tcttcactcc ccacgttgat ccagtgatga 13141 ttcaagcagc agtcgatata ggcgtaccga ttataccagg agcactctcc ccaacggaaa 13201 tagtgactgc ttggtcttgt ggggcaagct gtgtcaaggt gtttcctgta gaagcggtgg 13261 gaggagtcag ttatatcaga agtttgcgag gacctttggg tcacattccc ttgattccga 13321 cagggggtgt gactttagaa aatgccaagg aatttttaca agcaggggcg atcgcagtcg 13381 gtttgagtag tcaattgttt cccaaagagt ttgtagatac acaaaactgg caggcgatcg 13441 cccaaaaagc agcaagcttg atgcaaaaaa tgagttagtc gtgttatgtc cttgcaaagg 13501 gctgacttca agtatgcctc cttatctaaa ctttggcgga tattgtagct gatgcatcaa 13561 tttgtctaaa attaaacaaa acagcagtat tttgatgcaa taaaatagga attataggag 13621 aaagcgaaaa ctttccccaa acggctgtta tttgagtgta tctatgatga acagtctaac 13681 ttctcctata tcgatgcgtc ccttgaagct ctttataact ggaatggcac tgactttaaa 13741 cgtttttttg agtgcttcaa atcaggtttg ggcacaatca aaaagccaga cgattaaagg 13801 acaagtagta gtgctacaac aatctcagca gcgttggatt caaattgatc tttcaagcca 13861 acgcttaaca gcttgggagg gtgacaaacc tgtctataca tttattattt cgacaggtaa 13921 gaaatccacc ccaactccta ctggtgtttt tcaaattcaa tccaagcaca agtctgcccg 13981 aatgcaaggt gaggactatg acattcccga cgttccatat actatgtatt acagcggaag 14041 ttatggaatt catggtgctt actggcataa aaaatttggg acaccaataa gtcacggctg 14101 tataaatgtc gctcccaaaa aggctaagtt gctgtttaac tgggcatcga taggaacgcc 14161 tgtggtagtg caaaggtagg aactgggtta tacgtgcgca gtgatacctc accccgccct 14221 cacgggcacc cctctccttt gttcgtacgt atacctcacc ccgccctcac gggcacccct 14281 ctccttatta aggagagggg ataaggttga ggtcactgat gcacacttta gtataaaagt 14341 ttactaaaag ccatgtttat cagttatcag ttatcagtta tcagtttcac tgttcactga 14401 tttaactcaa ctgtgccaca acttgattct ccaaaagcgt ttcacttgtc aacaagtcct 14461 caactgtgat tcgtaaaaag cgttgttcat caacttgaaa gccgacttta atgcgatcgc 14521 tccccggaaa ccctggtggt gtcagttggg caattgatct tgccccatct ttatcattaa 14581 gggatttgac gtttgtttga cagctggctg cactgcgagt aatcaggcga tcgccatcaa 14641 aataaacttc cgtaccgccc gtatccgcac ccaactcacc cataattaat tcaacgctgg 14701 gctgactctc cacagaagcg cccaaaacta attctactgg ctgagtcatt gggtaaggct 14761 gaccaacttt aataatagga tgccagctat ggcatttgtt gcgacggtcc cagtaacgga 14821 taccatagct atggtagaga aaatctttca cctccacgcc ttgagcaact tgtaaagcac 14881 cttgagcaat agcttcaaaa ggacgctcac aacgaatttt ttcaggttca aaatactgct 14941 taacccatgt ttgtactgcg ctaagttgca cagttccacc gacaagtaaa acagcgttaa 15001 tatctgaaac ttctatccct tggcgtcgtg cttgctgcaa cagagaagtc atcgactcat 15061 caagtcgttc aaaaaatgag tgttctgtaa ggatagtgtt tagagtatcg cggttgagat 15121 ctaactcata actctcaaac gtctcatcgt tgaaataaac ttcgcttgct tgggtttgag 15181 ttgaaagttg aatctttact ctttctgcga gtcgcgttgt cagaggactc accaccaacc 15241 cttgagtttt ggcaaagtaa tcaactagcc aattatcaat gtcagtccca cccaaattct 15301 gcccagcttt cgccaacacg cgagcggttt ttaccttttg ttttgagtct tcggctaaag 15361 atttattacc ccacttgaga agaaacccta cgggcttttg agttctttgc acggtttgat 15421 ctagccgcac caaagataaa tctaaggttc cgccgccaaa gtcaattacc aagagaattt 15481 cttggtctgc caagccatag cccaaagctg ctgctgttgg ttcatccagc atccgcacct 15541 gttcaacgtt gagggtttgg caaacttgtc ccagccagtg gcgataagcc tcaaagctat 15601 ctacaggtac agttaacacc agagaatcta ttcccccttg cagtggtgcc aattgctcta 15661 tgatttgaga gaggaacaat tgtcctacct gctcaaaagt gacaatttgt ccatccaact 15721 ctggtaagaa accttggatg tctgcaccaa taccgcgttt aaagctgcgg aaaaatcgcg 15781 ggtcattttt gaggtcaaaa ccgcgatcgc gcacctgttg acccaataac acttgcccct 15841 ttgaggcgtc ttcaacataa accaagctag gaatgagtgg cggattgaga ctttgttgaa 15901 cagataaacc aggtatgttg agcgtttctg gttgttgggt aacaggattc caacgagcga 15961 tcacagtgtt gcttgtacca aaatctattg ctattgccat aattttttaa tcatctcacg 16021 caaagaagca aagcatgaga ttattttctc atctcaaacc tccgcaaggg gacttctgcc 16081 ataagcacac tagccttggc ggtagtggcg tcggcacgat tagtttttct ctggcttctc 16141 tcgttcctat gctctgcatg ggaatgccta aaagtgggct gctgcctccc aatgattata 16201 ttgaggcagc agctatgctg ccacctctcg ttagtgttgc aagggagcgg aatttattgc 16261 agccaccaag agcgtaaatg ggcgatcgcc tcttccttgt gtttcgcttc gacttctatc 16321 caaggcgctt ctcggtaaac actgggcatt gcgttaataa actcactatg ttttctatcc 16381 cgaaaagctt cttcaccgtt ggaaatgtgg actaactgcc agtctgggtt agcccaagtt 16441 tctcttgcag cataaaacat ttgggcaaca cttggatcgt cgtaactgtc taaactttca 16501 tgacagatat ggtgatgagc atcaaatacc aatggtatgc cagcccgttg acaaacttct 16561 aaaatttcac tcgcactgta ggcgtattcg tcattttcca aagtcaagcg gcttttgatg 16621 ttttctggta attcagaaat aacatccacc agttgatccc gacgttgaga tttgccacca 16681 tgaatgttca ttaacgacca agaagaacgc ggtaaaccta gtaaatctag cgtgcgggcg 16741 tgtcgatcta aaattttaat acttaatgcc accacttgcg gtgaatcaga actgaggacg 16801 acgtattgat ctgggtgcaa taccattcgg atacctaatt ttagcgctct ttgcccgatt 16861 ttggctaaat ctgcgctcat cttctctaga acagttgcgc cgacttgatc ctctaagtca 16921 ctcatcggga ataaaccaga ggacattcga taaagccgta tagaattgcg cacacagaaa 16981 gttaacgcca aatccaagcg ctgtaagttc tcccgataaa tgacttttaa ggcagtttca 17041 cgttgttcct ctgggagttc taagtagcgc ttacgcgtta ttgtccggaa gcgcacgtct 17101 ttggaaaatg taatgcaaac caatcctaag tgcacaacag cagttttttg caactgctgt 17161 gcaggtgaat tgttaattgc aacccctgat ggggatttcg actcccgcac tgctgacata 17221 tgacatattg ctatcacttt cttgatttaa ctcatttctg cccaggaatg atgagtatcc 17281 aaaggcagaa gtcacccgtt acagagttga gtttaaacgt gcaaaaatct tttttccgct 17341 tgtcttgctt gcgatcttct caggatccaa aagcttcttt cgtagagacg ataaaacaag 17401 gttgatcgag catatctgtt tacacaaaga ttaatagcga aaaattaatg attattaagt 17461 tatttatcca aaaataaatc aatatttaca ttgaaggatg gaaaatctat tgacaaaaag 17521 tacatctaga ttatgttatc ctcacaacat gaggatatgt accttttggc agttatgtca 17581 agtaccttaa agattgttaa atatatgaaa gaaatataaa gctcaacagc cagagtcaga 17641 aaaactttca gctgagggaa caataacacc tgaaatttac tagtggctgg tatccgccgc 17701 tggtgcatcc tctaaccatc agattcttca ccaaggtcga aaatcatgaa aaatcggcac 17761 tcatcagagt ctaagcaatt tgtcggaaat ataaagaatg gaatttgggt gtttggtata 17821 tcgtcgtggc tatttggtat aactgatcgc agtattgctt cgttttccga tggttatctg 17881 tctgctttgg atctaacgca actatttaca gctgcaatct tccttgtggc ttggtggttt 17941 ctcaagccta catctagggt atagttccat tttctatgtt tgatttggaa tgggatgcac 18001 tcttttgcag cgataaaact aaaataggat actggaactt tagtcatgat tggtgcgatt 18061 tttctcaaaa aaagtcaagg ttgaatgtct gatctcgaca aaatccattt tctaccgatt 18121 gcgctcgcat acgccaaaac cttatttgtc aatacctttg actttttcta ttgtcactag 18181 aattaatgac aatagaaaag gttttgtgcg actggttcta acttttgcca aataacatga 18241 gttaagttat ttgatttgga ttttgttgag gttagaggat gtttgggaag catcgttttg 18301 taccaaaatt tatccgggac tcccctaaat ccccgatgtc ttgggaggat cccaattttc 18361 tgaacatatt gcgagctttc cggctcgcac tgcatttggc agaaatttga aataaagttc 18421 tttttacgac aatacagtta ctgagaagag actttgtaat aacgataacc aatgctagag 18481 acgttgccct tcgggtatct cctgcggaga cgctgcgcgt tcgccctctg ggcgtgcgct 18541 tgcgcttacg gcagttgcta actcctgcgg agacgctgcg cgaacaagtc ggggaacccg 18601 ctgttagcac ctgcctcaca tgcaacgtct ctacatgtgt gagaccaagt tgcaattccc 18661 agatagcaca agtcaattgc aagtgaaacg tagaccccga gggggcttcc ggctttcttc 18721 gccatctctt tgccagctac tacaacatca tgcaacttga tatgaaatct ccaaaaactt 18781 tttaagctct tccgtaaaga ttcacgctgc cattgtattt agttaaagtc gctgaagaat 18841 ttgtcaatca tgaataatga ataaattaga ttcgcaagct ctgcaaagaa aaatcctgat 18901 tttggtaaat aaggtaattc ctgtaagtag gattttctga aaaatcatta gtgttttagc 18961 agtttacatt tggatgtaac accttattgc tgaaaaaaat gtgtatttta cccaagtgat 19021 aactgccata attattatcc aaacatcagg gaaatctaaa attattcaca attcctaatt 19081 cttcagtttg tcttgctaaa atgttgcaga tattcaactt caatagattc aagaaacgtc 19141 aaatgagaca tcaaaaacta tctcctgggc tgctattagc atttgaagac tatcaacggg 19201 aaggacaaga agctttaatt ccgcaggtca gaatgctgag tatagttccc ccgaaaaaca 19261 atctcaagcc aatccgtagc atcgttttta tctactgtga tgaaaatgca gacttgagtc 19321 atttatcaca acacggtatt gaagtcaacc aaaatagagg acacgtgcgg acggcgttct 19381 tacctgtaca aagtttagac gccttatctg atgaccctgc tattgagcgc atcaagccat 19441 cgcgcaaact taaattgcac atggacgttg ccaaaagcac agtacacata ccggagttta 19501 gaaagaaaaa tagtaacctt actggcaagg gagtgattat cggtattatc gacagcggca 19561 ttgatccgaa gcaccccgcc tttaaaggac gcattttacg catctgggac caaacactgt 19621 ctggttctgg agtgatagaa ggtaaatatg gagcggaatt aactggctcg ctactgacag 19681 tttcccaaga tacaaacggt cacgggactc acgttgctgg aattgctgct ggtatggatg 19741 ctacctatgg tggtgtcgca ccagaggcag aattgctcat cgtcaaaagc gacttggatg 19801 aaggtcacat tgccgatgct gtccgctaca tcttccgcgt cgcccgagaa ttgggacgtc 19861 cagccgttgt gaatatgagt ttggggggac actttgaccc tcatgatgga agtgactcgc 19921 tatcaaaagt tattgactct gaaactggtc ccggacgcat agtttgctgt gctgctggta 19981 acgagggcaa cgataacatt cacgctcaag caattgttcc tcctggtaaa actcatacca 20041 tgcgcttcaa cgtaccatta aatcaagcca gcatagcaac gttaaacagt tggtactcta 20101 gtgcaggtca attagaagtg tctttgcgta gtcccaacgg tttcgttacc ccgttccaac 20161 cagtcattgc tgatggcaac tatataaaag aatacacctt acaagattcg caggtgcaag 20221 ttgcaacacc aaaacgcgat ccaggcaacg gcgattataa cgttttagta caaattcgcg 20281 gtaaagggaa aggtaattat actccacctg tccagggtgg aatttggcag ttgcggttca 20341 aaaacacttc agccaaagat gtgcggttgg atgtgtggac attagatggg tcaagcttat 20401 tcaccggtca aagtatcgct gactcagtga aaattggttc acctggatgt gccagcagtg 20461 caattacagt tgctgcttat acaaccaaag agaagtacac tgacatagac aataaactcg 20521 aagaaatggg cttctctttg aatacaattt ctgatttcag cagtgaagga cctctgcgga 20581 atgatgctaa aaaaccggat gtcgcagcac caggagcgat gattgtttct actctttcct 20641 ccaatgccaa ttctgatcgc tcaatgatca ttaattccaa gttcatggcg ttagctggta 20701 cgagtatggc tgcacccttc atcagtggct tagtcgcact gctgttgcag cgtaacccca 20761 agcttgaccc agttgctatt aaagaactgc tacgtaaaaa tagttccata ccaaaaaaac 20821 cacctggaac ctttgatagc aaatggggtt atggactgat tgatgcacaa aatctgtagt 20881 tggacgtaca gataagtgca ttaagtctgt ataaggatag gtaatgataa cctaataatt 20941 tgataggttg ataatagggt aagctatttt gcttgccctt gttgacttac tggaaggttt 21001 ttatgaatta tcaaaagcta gatacggccc ttgcagtagc gctcaatgac gttaataact 21061 cacaagagcc tagtttaaaa gtttttattc atactcaaaa tgacgcaaat tatacggaga 21121 caactgccgt attagaaaat ttaggcgttg ctgatgtgac tcctgaaaag gatgttttca 21181 ccgcaaccct atcaccaaat gcaatttctc agttgtcgga gcagccttgg gtgcaatata 21241 ttaagctgtc tcaaaatttg catttggtaa atcagaaaat taaaggcggt atgaaatttt 21301 ttaaatagca ttaagagtaa tgtaagtacg agcgccgaag acgactgctc cggagcatcg 21361 cgcttttgaa ctcacgactg acggggaaca catctccgtc ttttttatta tgtaaagtta 21421 aaattattgc cagtacctta agttgttttg tatttttaag ctttgattcc ggaaaccgta 21481 gtaatactag tgaagttaaa aacaacaatt tgtgctgaag ctgtgaaaac ctatctgctt 21541 acccttacca gattgttggt catttgaaaa ttttttataa ataccataat cttccctccc 21601 cccctttatt cgcggtctca gctagcccta aactgtaggt gagagttgaa aggcagtcgg 21661 gagaaatctt gtgaaaaaag ttttggcaat aatccttgga gggggtgcag gtacccgcct 21721 ttatccactc accaaattac gcgctaaacc agccgtacca gtggctggga aatatcgttt 21781 gatcgatatt ccagtcagta actgtataaa ctcagaaata tttaaaatct atgttctgac 21841 tcaattcaac tctgcttcgt tgaatcgtca catcgcccgt acctacagct ttgctggctt 21901 cacagagggt tttgtagaag ttctagctgc acagcaaacc ccagatagcc tcaactggtt 21961 ccaaggtaca gctgatgcag ttcgcaagta cttgtggttg atagaagagt gggacgtaga 22021 tgaatatctt attctttctg gtgaccactt gtaccgcatg gactaccgtc agttcgtaca 22081 ccgtcacagg gaaactgaag ctgatataac tctctcagtg ctaccaattg atgaacgccg 22141 tgcctctgac tttggcttgg tgaaaattga tgagtctggt aggataatca atttcagcga 22201 aaaacccaaa ggtgatgctt taaaggggat gcgcgttgat acaactgtac tgggattaac 22261 tccagaacag gctcaagaac agccccacat tgcctcaatg gggatttatg tgtttaaaaa 22321 agaagttttg attaagctgt tgaaagaagc ttcagaacgg acagattttg gaaaagaaat 22381 tattcccgat gccgcaaaca attacaacgt tcaagcctac ttatacgatg gatactggga 22441 agatatcgga acaatcgagg cattttatga tgcaaattta gcactgacca aacaacctca 22501 gccgtcgttt agcttctacg atgaagaagc gccaatttat actcgatctc gttacctacc 22561 tcccagtaag atcttagatt cccaaatcac agaatcaatg attagtgaag gttgcattct 22621 gaaaaattgc cggattgaac attcggtgtt gggagtgcga tcgcgcgttg aagctggctg 22681 cgtcatccaa gattccctaa tcatgggagc agatttctat cagcctttcg ctgaacgcca 22741 gtccgactgc gaaactaagg gagtttcttt aggcattggt gctaacacta caattcgtcg 22801 tgccatcatt gataaaaatg ctcgcatagg ttgtgatgtg caaattatca ataaagataa 22861 tgtccaagaa gctgagcgcg agaatcaagg attctatatc aaaagtggta tcgttgtcgt 22921 gctaaaaaat gccatcattc ccgatggaac cattatttag tcattagtca ttagtcatta 22981 gtcaacaatt aggggggtca cgggggtaag ggaataag // LOCUS NODE_1367_length_22984_cov_5.15351722984 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 22984) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 22984) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..22984 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(11..1243) /locus_tag="DP116_12280" CDS complement(11..1243) /locus_tag="DP116_12280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868763.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12280" /translation="MLRRLSVTITATATFCLFCNSLTLAASKKPQQPDKFPPNPLEIT TPDPLLPRSPKDKQPLTPQESQDLQAALEKLNQQAAATLQAGDKITAFEIWNRELRLR RFLGSLAEVQALSRVGEIAWRENDRMEVRYITQRLQLIQKQAKSQKTVDLQLLQALGQ AYQQVRSPKVALPIYDQVLTVVRQQGDTAAEVDTLKTIGELHLSWFDYASAAATYEKL LTFASSKSESVNEIAYLQQLAYIYKQLKQPEQSINIRNKLAEVYQRENNLIQLAALKE SIGSDYESLARENPSFLQEAFKNYKEAYTIAWQLQQYVRVSEALQKLIALYRSQGQVE EALQASQILLQAQEQAVNFYGLMNAYDQIGQIHLQRKDYPQALTAFQKGLQVAQELNY DQTYFSQQIQKISGQTSR" gene 1526..2128 /locus_tag="DP116_12285" CDS 1526..2128 /locus_tag="DP116_12285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999463.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chorismate-binding protein" /protein_id="PRJNA477356:DP116_12285" /translation="MTHSTDIATLARWMAADFSNQEQAIENPPFYAHIRVCMRPVSLG LSSGVSLFLEQAYDYMLNNPYRLRVLNLRNAQNHIILENYTVKEEQRFYGASRNLERL KTLSADDVEIMSGCNMIVEWTGNSFKGRVEPGKGCIVFRDGKKTYLDNEFEVDEEKLI SLDRGRDIETDEHAWGSIAGPFYFVRRANFADEVKLTPES" gene 2402..2473 /locus_tag="DP116_12290" tRNA 2402..2473 /locus_tag="DP116_12290" /product="tRNA-Cys" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:2434..2436,aa:Cys,seq:gca) gene complement(2784..3287) /locus_tag="DP116_12295" CDS complement(2784..3287) /locus_tag="DP116_12295" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12295" /translation="MKKSDSQFKKIIDFLTTASVFFGTLNALVINLTEFLKNGQNFLS IINPIAYYIGLSEICVLTSFFILCCFLFVLSPDTFIILTEADSSVLSQISIFLTFFAI VIGIFLIVDVPSHFELLSNFFHHLNLPTRDNIIKAVNPIIFEVCGVATLLIFVYFAYR SYLNETE" gene complement(3491..4708) /locus_tag="DP116_12300" CDS complement(3491..4708) /locus_tag="DP116_12300" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12300" /translation="MVNEREPSPQEQLLNKKASYSEKTHRIFALSQFLKSISPIIWTL VILIVVISLWGKISIGSIGGKYISKSSGDISNGRVVITVPSIPPQINSDLVLALKNAR ASSETYASEKLDKWIAQLRERVDNDFLNWYFNYFTQLDIGFRAIITDFSSLITRGFNP NQSTPEAQKAEKLTEEFQREFTRRVLKPEIAQIKLERFTRETINAYVSELSEQLAGIQ SEYKIPQADWERYLSGVSTTILDSAGNNQDLSLRVLSRGTEYLVALPLSKATVKLGSG LATKFAENAAAKAAAKTTAKMTTKISTKAAGKVASEFSIPTLSTIGLELIDPLAAVGI LAWDVWDNYHTAQVERPKMREAILEYLDEMKQVLLYNPHDSIMSTIYSFEGGLIDKLA EKKLNHNAVFVQP" gene complement(4848..6152) /locus_tag="DP116_12305" CDS complement(4848..6152) /locus_tag="DP116_12305" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12305" /translation="MNHEQDNEHKPEVVDETATSSIKTTPKSEKPELKTNLENEQTTE VVNETATTSVDINLESGKPPSVWSNFITALSQFFTAYRQFYNRIKPVFILLWIIVIIT IFMPLLGKVVTAYSFSRRIEAPKQQYHSENFLNQQRDKNLVLLSQTTQNNSQINQAIL DTLNISHTKAENFASERLDFWIELLQNRVDNDFLNWYFNWFNKKWREDSAFILGIFGQ DIAKRDIENYTKEFSQRVLSPAESQAKFKELAEKTVNVYVSDLNYRLSKVRLQYDIPQ VKWDKYLNSITFNIPGEESNIALKEVLAIGGYKILAKPIIIPAAKVGTVVVIESVEKT LGVLGVKAVGKLGATVVAETVAKVLDPLAIVGLGAWDYFSYRGEVATQKPALREQILE SFQEMKNSLIYDSQTGILTVIDQVEKNLRNSISSSSISQLST" gene 6324..7328 /locus_tag="DP116_12310" CDS 6324..7328 /locus_tag="DP116_12310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496597.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="PRJNA477356:DP116_12310" /translation="MNQQEFEKIFEHLSPRRKEVLRRILAGETDAVIAKAMGIGESTV RKYIERICQEFGLENEHSDGRRYKRSDLVALFAKYKPELLQQRAFTSTNLSENFLLQL FSANIPVQEYLEKQLQLSDDEEKTQTAKSLNKIGHHDYLNGDFKSAVCYLKWAITFNP DFAEAHYNLGAAYEKLEELSSAYHHYEIAMKYSNRAADAAINNLARLSILKGNSAAAV EMIQPILSRVQDSMVKASLHKNLGWAYFEQKLYKQAKKHLLMSLKLDSDRPLTHCLLA KVQEAQGEKQSALESWKDCLKSDYDNQQLKGQDCKLPELNFWQLEARRVLDDEIDLGN " gene complement(7384..7626) /locus_tag="DP116_12315" CDS complement(7384..7626) /locus_tag="DP116_12315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015195691.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12315" /translation="MPVTLVQAFGWDPSNAAYKVLIQSRNGNQYFVWYDNLIGAKVGS VITLTYEGSYSSLWFYKLINTGNGKESNIRRYLRAN" gene 8378..8992 /locus_tag="DP116_12320" CDS 8378..8992 /locus_tag="DP116_12320" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12320" /translation="MNSSSDQLCQECTEVENVLEIDIKRLLKDLATYKRGKELTKQEK LYLCLSFLGNEPIDIARIENYQRLYVEQKSENPSLTEEKIHKLVEQKLRNRAHDISHY LSQSINQYILGLISTFDNSISDKNPRPSWFKIFFLLKANGYKKVAASSQKPSSLKRIV IESEDENNLQNIIELIKMVNQKFGNGSLSIEEIESEGDEDNEQK" gene 8970..9767 /locus_tag="DP116_12325" CDS 8970..9767 /locus_tag="DP116_12325" /inference="COORDINATES: protein motif:HMM:PF08852.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12325" /translation="MKIMNKNKTRQKLVLSGSSEAIEYLQTLFQSGELSKLLDVNVLN ISITSEKTPTEVSLVNLSQCLQKNFVTAIAAGFEVIQDILEPPQLVLGYQRRSTRSST SGESDEYIRLQSAQRLLETNPDNSTAIATLFEIIRTTQEEEVRWRAIESLPKNARHRL TDVIGLKKEELRLANHPIKLTVSAIKISDEEVSVFIKLYPAGEQTTLPIGIKLIVLDE SGKIFDQVPNEDEEYDEIKYKLICNLGEIFSVRVALGSDSITESFSF" gene 9786..13484 /locus_tag="DP116_12330" CDS 9786..13484 /locus_tag="DP116_12330" /inference="COORDINATES: protein motif:HMM:PF05729.10,HMM:PF12770.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12330" /translation="MSDLLVLKFRYESDNQNLLVNFVIESNGELRDTEIRGTLSSADN LLKIQNQWIEAYRSFLKEKEIDRMKKVEKQSIAIIKTTFIDCQKAAYELENTFHNWLN SEQFHPFKQELEQILKSQKTSEENRVLIQTNQPWLWKLPLHRWEIFQRYSCEVGLSLT ENKRVERPLPPKPRAKVRILAIIGDSTKIDTKVDRELLEKIPDAEIIWSQQPQRRDLY QKLWNPHGLDIFFFAGHSKSEANGDSGRISINENDSLSIGELKNALRFAIRLGLKLAI FNSCDGLGLARELADLYIPHLIVFRESVDDKVAREFLQHFLDAYSSDNSLYTSVRLAR EKLQTLEEDIPCASWLPVICQNPSDAPLTWQKLRGSQTEATWRACCQVMLALSTYKRL LNNPLTDKNELGLELEEIYVPLGLVERRKQDRRIGDVSPEQGSQFYQQEPEYEITKTF KPQEFFEQVLHQGQSPKSQGKRLVIIGEPGAGKTTLLQKIGDWVLENTIEDVPIWISL ADLQRGQRLEGYLHQNWLELAMQTVRVPGATSSTLVQMFQSGRVWLLLDGVDEYTANS VNPILELVNQLTDWVQRARVVLTCRLNLWEANKNALETFDTYRTLDFSYGNPQTSDQV GLFIRRWFIANSESGRRLRSHLDEPGRSRIKNLVKNPLRLSLLCRTWQWQQGKLPETR AELYRQFTEAVYKWKQGIFPTTKATEEELNAALGRLARRSLDLEASWFRLPHHLVVEE LGEFDAPLFQLALQLGWLNQVGVAAQNPEEPVYAFFHPTFQEYFAASATGNWHDFLQH DNHQPAPMQGIYRIFQPQWKEPMILWLGQSEQEVLKQEKEEFIKALVTFTDGCNNFYW YRAFSLAAAGIAEYPDCSRADLIVEQLVTQAFRHQSVNSPEYSARLSLIADKAKEALK ETDHPRVINALKGIEPIPIYFLVEIATSSPDDIRALAELVQNHQQEDVRLIVARWLLQ VDPGNQDATDTLLNMLRYSRSPWIYHRAARGLPGNSEAISILQEILGRPRHPFDNFEI KNILAEIGVGNPNATDSLAASAPTKNRTFNSLLNLLSKKILIIVYINFSNTKANLLQN RTFRGIFLGLRQSFIFLLRNACNTLVYLHQKANHDENFHLQFIEKLAKFDPGNPKLIL SLVELLRTGQSPSVYKQAAQSLKEILRGQMCLVVVSGLRDCLQKGTSEERYDCCYEVI WHCAQHSAYPDFYQAWHGQPFISDAQRN" gene 13622..13951 /locus_tag="DP116_12335" CDS 13622..13951 /locus_tag="DP116_12335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015180064.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12335" /translation="MAKTPSSTKSESVTLRIPNEVFDQILNYAAANTKGNRSVAIVEL INMGLVVAKQSTQTSLTAQQSMKQKDLSEAVTLLKSQMNQLTQLLQETVVQRLTVLET ELGELNA" gene 14203..14496 /locus_tag="DP116_12340" CDS 14203..14496 /locus_tag="DP116_12340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12340" /translation="MKSVTFLLLLILSFAVAFPALASVCRNWDGHQICILDIKRSAKN YWEYRAAVSIDGVKTPIEVYNCRSKVKVQQDGTALPFEHNDPGELICSFFKKS" gene 14872..15594 /locus_tag="DP116_12345" /pseudo CDS 14872..15594 /locus_tag="DP116_12345" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: GeneMarkS+." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 15526..15535 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 15582..16571 /locus_tag="DP116_12350" CDS 15582..16571 /locus_tag="DP116_12350" /inference="COORDINATES: protein motif:HMM:PF13103.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12350" /translation="MPTVVAPKPKPLALKLRQRTVTPETIPPEPLPSPTAVAPTTKPL ALKLRQRTVTPETTPPEPEPLPTTVAPKPKPLALKPRQTLVTPETTQPVPLPSPTAVA PTSKPLALKPQQTVVTPETTQPVPLTRRENSSRLATKSSPEQFENRTNSLVSPETQRI APSRSQSQSTTSPRKLSRSQASSKSGGASSLGGPMSLSSRDFGSNNLAALPNSNRLNQ GTQGIDARQDVDMSAYLQQLQEQVKQQWIPGLTQSSQQTVLSFIVSRAGLVSNLQVVQ SSGLTMTDEAALNAVNRAVPFAPFPTEYPQDYIKIQFTFNINVYGQLELWSDQ" gene 16714..17091 /locus_tag="DP116_12355" CDS 16714..17091 /locus_tag="DP116_12355" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12355" /translation="MSTNVVTHPALANEENVSPVITNQGQAAALVTQPVTINSGTVSV TDHTNNLRKVSDSIGVVRESQCKKINPLELIKSPGNTLKQCLEETNKQADQINQTSQP AEQFEYFKVPKLESGVNVTVTKF" gene 17202..18233 /locus_tag="DP116_12360" CDS 17202..18233 /locus_tag="DP116_12360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410625.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_12360" /translation="MAWVAGDRLQGGKYTIERELGRGRFGITYLVKNSNSDRLVVKTL NDDLLRPLTQQERERLETMYLQEALKLQKCKHKHIVQVTEVFKEGEHSCLVMEYVDGD SLADLRPPILSEADALRYIQQIGEALIVVHQNELIHRDVHPGNILLRNREGQLEAVLI DFGLALDFDHILTTSRTKETSDGFTPPELYTKRTIARAYSDIYSLAATLYKLLTGRTP VNAVKRKVDGEHLVSPKEFNPQISDRVNGAILTGMKLDHKERSQSMREWLDSLGLSGE TPQPVSTVLPNPNPNREKKINWTLVIAAIAAIGALLSGIAALIPILKQSPSPSPSSLS PSPSSTKTP" gene 18308..19618 /locus_tag="DP116_12365" CDS 18308..19618 /locus_tag="DP116_12365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410624.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_12365" /translation="MPWTAGQRLQGGKYVIGEVLGQGGFGITYKALHVELNQTVVIKT PNEHLRHDPEYDKYIDRFIKEGQILARLSEEPHPHIVRVIDLFKEGAIHCLVMDFVPG ENLFEAVRRRGALPEAEIVPCIRQIGEALMMVHQAGLVHRDAHPGNIMLRNNGKAVLI DFGIAKELLPKTLSSTGNAGNHGFAPYEQITRGSREPTVDIYCLAATLYYAVTGQNPT TSLARKLDDASLTPPKQIIPDISEQLNYAILKGMALEAEDRPQSMQEWLTMLEALKAS PPPLVEPVHKIEVVRRQFDESKSKPATKSPRIIAWGWLVGVLLNYTNISYFLAALNAP RIIWAVAVAWVVAWAGLAAVAVAWAGLVAVAVAVAGVVAVAVAGALSGVVAVAWAGAL ATVWAGKKLQKSFSKVHTFLILASTSSLGLGLGLLGHRIFHTGS" gene 20298..21242 /locus_tag="DP116_12370" CDS 20298..21242 /locus_tag="DP116_12370" /inference="COORDINATES: protein motif:HMM:PF00353.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12370" /translation="MVVIYGATGNDYLKGTQDNDFIYGNLGNDVLIGLAGDDFLEGKE GNDVLIGGDGNDILYAGLSKVPYAPGYGGDLNSVNLLYGGKGDDQLFGSLGKDFLFGG DGNDRIIGLSGDDYLDGGDGNDRLDGGFGNDILIGGNGDDILYDGSYKTGGGNDILLG GNGDDILNSGSGNDILIGGKGNDILTGAGYIPTGSPGGIYGVDEIDILIGGEGRDTFN LGGPNAAGNFSVYYDDGNKLTTGENDYALIADFNPSEDIIQLVGTASDYILGSSPAGL PNGVTINLKKPDSELNELIAIVPGVSGLSLDSSYFRFI" gene complement(21381..21809) /locus_tag="DP116_12375" CDS complement(21381..21809) /locus_tag="DP116_12375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994741.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12375" /translation="MSRLLYENSVSYKGYLIIPFVFGTIDSQDIYSYKLLSEIANQSQ FHKAENPAEIYGTSIENIVDIAKEHIDEYSDFVSHCDNFKFRYVYRNNLIIVFQEAGK YFYDHYPSDSLNNIAAPKLFPSEYDCLSWIKQGMDGLHTR" gene complement(22059..22925) /locus_tag="DP116_12380" CDS complement(22059..22925) /locus_tag="DP116_12380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF72 domain-containing protein" /protein_id="PRJNA477356:DP116_12380" /translation="MNFFIGCAVWAYKGWVGELYPQGTRQSDFLQLYSRRFTTVEGNT TFYAVPNQETITRWATQTPQNFEFCLKLPRDVTHNGRLQPQIPDALKFLEGMRLLGKR LGPVFAQLPPSYPPTLIDDLGVFLEALQQTDVRLALEVRHQDWFTEPLAGELTALLER LGVGRVLLDSRPIYTGDDDPQIQSERRKPKVPVQFSVTASVLAPFSLIRFISHPNLSV NQPFMEEWVTQIKQWLQQGTQIYFFVHCPVEEHSPGNARHFQKLLEQSGVAVPPLPWN ILDNPPNQLSLW" BASE COUNT 7000 a 4607 c 4784 g 6583 t 10 others ORIGIN 1 ctccccctcc ctaccgcgaa gtttgcccag atattttttg aatttgctgg ctgaagtacg 61 tttggtcata gttcaattct tgcgctactt gcagcccttt ttgaaaagcg gttaatgctt 121 gtggatagtc tttgcgttgc aaatgaattt gcccaatttg gtcataagca ttcattaaac 181 cgtaaaaatt gacagcttgc tcttgtgcct gcaagagaat ttggctggct tgcaaagctt 241 cttccacttg tccttgagaa cgatacaacg caattaactt ttgcaaagct tcactaacac 301 gaacatattg ctgtaattgc caagctatgg tataagcttc tttatagttc ttgaaagctt 361 cttgtagaaa gctgggattt tctcgtgcca aagattcgta gtctgagcca atagattcct 421 ttaatgctgc taactgaatg aggttgtttt cgcgttggta gacttctgct aacttattac 481 gtatatttat tgattgctct ggttgtttta attgcttgta aatgtaagcg agttgttgta 541 aatatgctat ttcattgaca gactcgcttt tagaagaagc aaaagtgagc aatttctcat 601 aagtggctgc ggctgatgcg taatcaaacc aactcagatg taattctcca attgtcttta 661 gggtgtctac ttctgctgca gtatctcctt gttgtcgcac gactgtcaaa acttggtcat 721 aaattggtag ggcaacttta ggcgatcgca cttgttggta agcttgacct aaagcttgca 781 acagttgtaa atctaccgtc ttttgggatt tcgcctgttt ttggatcagt tgcaatcgct 841 gtgtgatgta gcgcacttcc atgcgatcgt tttctctcca agcaatttca cccactcgcg 901 acagtgcttg cacctctgct aatgaaccta aaaagcgacg caagcgtaac tctcggttcc 961 aaatctcgaa tgctgttatc ttgtctcctg cttgtagtgt agctgcagct tgttgattga 1021 gcttctctag cgccgcctgc aaatcttgac tctcttgcgg ggtcagcggc tgtttatctt 1081 ttggtgaacg tggtagtagt ggatcaggcg tggtaatctc taacggatta gggggaaatt 1141 tatctggctg ttgaggcttt ttgctcgctg ccaatgttag cgaattgcaa aacagacaga 1201 atgtagcagt cgcagtaatg gtaacactta agcgtcgtag catggatatt ttaccttttc 1261 gtcggttcac aaagggacaa ggcagataaa tcaattcaaa atcgcgtagc gttgatcgcg 1321 aagcgacgca atctcaaaat tcaaaatttg caatctcccc ctacttgacg atagtcagca 1381 aaaaattctc ctgagaaaac agtcaactac aagctataca atacttttta atacaattta 1441 tgtttatcca gctacacatt ggtagctttg aataccaagt agcctggttg acccaatcca 1501 aaatccagaa ttcaagattg gtataatgac tcattctact gatattgcca cgttagcccg 1561 ttggatggca gcagatttca gcaatcaaga acaagctata gaaaacccgc ctttttatgc 1621 ccacatccgt gtttgtatgc gtcctgtgtc tttgggatta tcatcaggcg tgagtttgtt 1681 tcttgagcaa gcttatgact acatgctcaa taacccttat cgtttgcggg tgttgaactt 1741 gagaaacgca cagaaccata ttatattgga aaactacact gtaaaagaag aacaacggtt 1801 ctacggtgca tcccgtaacc ttgagcgctt gaaaactctc agcgccgacg atgtggaaat 1861 aatgtcaggc tgcaatatga ttgtcgagtg gacgggtaac agcttcaaag gcagggtaga 1921 gccaggaaaa ggttgtatcg tgtttcgcga tggtaaaaag acttatcttg acaatgaatt 1981 tgaagtcgat gaagaaaaat tgattagcct tgaccgagga cgtgatatag aaactgatga 2041 gcatgcttgg gggtcgatcg ctggaccatt ttactttgtc cgccgggcta attttgcaga 2101 tgaagtaaaa ctcactcctg agtcgtagag ttctaagttt tgagtgagtg caaaagcact 2161 ctaaagaacg gtgagtgttg agtgttgagt tttgaaccac atccaatcac ttggaactta 2221 gtactcaaca ctcctctggt atcctgttgc tgccaaattc aggcgtgact ttaactttca 2281 cagcaaagct tgtaattggt catgagttaa actataatgg attagcactc aatacatata 2341 gctattcaaa agccaatata taaagcagtg caatttgtca agtgcgacta aacgcacaaa 2401 cggcgacata gccaagtggt aaggcagagg tctgcaaaac ctttatcccc cggttcaaat 2461 ccgggtgtcg cctttaagtg ttcagtaact aattatgtgt gactgttgac agattgatgt 2521 caattttcac tcgtttttca ctcgttgctg tcactgttaa cggattatgt gtcactgctt 2581 gtatattttg agtatcaata ttttctcaga aatgattccg gattgctata ctcaaatctt 2641 gcaccagttg ctttattgca atttttgtag ggtgggcact gtttatcgtt cgctgaaact 2701 cctattttta cgttgtgggc agtgcccacc ttacagttat tgagaatggt gcaagatctc 2761 agtatagaga tttagatagt ttctcattca gtttcattta gatagctacg atatgcgaag 2821 tagacaaaaa ttaacaatgt tgctacaccg caaacctcaa aaataatagg attaacagct 2881 ttaattatgt tatctcttgt tggtaaattc aagtgatgaa aaaaattact taaaagctca 2941 aaatgacttg gcacatctac tatgagaaaa atacctataa caattgcaaa aaaggtcaag 3001 aatatagaaa tttgtgagag cactgaacta tcagcttcag tgagtatgat gaaagtatct 3061 ggagacaaaa cgaaaagaaa acaacacaaa atgaaaaaag acgtgagtac gcaaatttct 3121 gataacccta tgtagtaagc tataggatta ataatactta aaaagttctg cccatttttc 3181 aaaaattcag tcaaattaat aacaagagca ttaagagtac caaagaaaac actagctgtc 3241 gttaaaaaat caattatttt cttgaattga gagtcagact ttttcataag aaaaaatacc 3301 gccaacaaaa tttaattcat gtaatatcgc tcaacaaaat ttaattcatg taatatcgct 3361 gaacccggtt ttctgctcca atgattcttg taccccgctt attattgctc cctttcttaa 3421 aaaagaacga gcctgccatc cgtttttatt ccaatcgtct gaaaaatttg ctagtggtta 3481 aaccacagca ttaaggttga acaaatacag cattatgatt aagtttcttc tctgccagtt 3541 tgtcaattag accaccttca aaactataaa ttgttgacat aatgctgtca tgaggattgt 3601 acagtagaac ctgcttcatt tcatctaaat actccaagat agcttcgcgc atttttggtc 3661 tttctacttg agccgtatgg taattgtccc atacatccca agcaaggata cccacagctg 3721 ctaacgggtc tattaactct aatcctatag ttgaaagagt gggaatcgaa aattcacttg 3781 caaccttacc agctgctttc gttgatattt ttgtagtcat ctttgcagta gtctttgcag 3841 ctgcttttgc tgccgcattt tccgcaaatt tcgtggctaa tccgcttccc agttttactg 3901 tagccttgct taagggtaag gcaactaaat attctgttcc ccgtgataat actctcagtg 3961 acagatcttg gttattacct gcactatcga ggatggttgt tgacactcca ctgagatatc 4021 gctcccagtc tgcttgagga attttatatt cgctttggat tcctgccagt tgttcgctta 4081 gttcagagac ataggcattg atagtctccc tggtaaagcg ttctagttta atttgtgcga 4141 tttctggttt caatacacgc cttgtgaatt ctctttgaaa ttcctcagtc agcttctcag 4201 ccttttgtgc ttctggagtt gactgatttg gattgaaacc acgagtgatg agggatgaaa 4261 aatctgtgat tatagctcga aatccaatat caagttgagt aaagtaatta aagtaccaat 4321 ttaagaagtc gttatcaaca cgctctctca attgagcaat ccatttgtct agtttttcag 4381 aagcatacgt ttcagaactg gcacgcgcat tcttgagagc aagaacaaga tcgctgttaa 4441 tttgaggagg aattgatgga acggtaatta ccacccttcc atttgatata tcaccagatg 4501 atttagaaat atattttcct cctatactgc caatagagat ttttccccat agactaatca 4561 caacaatcaa aataaccagt gtccagatta tgggactgat tgatttaaga aactgactca 4621 gtgcaaaaat tcgatgagtt ttttcagaat aagaagcttt cttatttaac agttgctctt 4681 gtggggatgg ttctcgttca ttaaccattt ttattttcct tactatacta tattgcggtc 4741 ttgtagttac agaatattat aaaatagttt cagtggaaac cttggatacc taacttagag 4801 ggtgactttc tgggttaggc atcctgttac taaatcacat cagtgcttca ggtacttaat 4861 tgactaatag aacttgaaga aatggaatta cgaagatttt tttccacttg atcgataacc 4921 gttaaaatgc cagtttgaga atcgtatata agggaatttt tcatttcttg aaaactttct 4981 aaaatctgct cgcgcaaagc tggtttctga gttgccactt ctccccgata actgaagtaa 5041 tcccatgccc caagcccaac gattgcaagt ggatctagaa cttttgctac agtttctgca 5101 acaactgtag caccaagctt acccacagct ttgacaccta atactcctag ggttttttct 5161 acactttcta ttactacaac agtaccaact ttagctgcgg gaataatgat aggttttgca 5221 agaattttat aaccacctat agctaatacc tctttcaaag ctatatttga ctcttcacct 5281 ggaatgttga aagtaatgct gttcaagtac ttatcccact tcacttgagg aatatcatat 5341 tgtaatctca ctttagaaag tctgtagttt agatcggaaa catagacatt gacggttttt 5401 tctgctaact ctttaaactt agcttgtgat tcagccggac taagcacacg ttgagaaaac 5461 tctttggtat aattttcaat atctcgttta gctatatctt gaccaaaaat tcccagaata 5521 aaagcgctgt cttctctcca ctttttgtta aaccagttga aataccagtt taggaaatca 5581 ttatcaactc tgttttgtaa taattctatc cagaagtcta gtctttcaga ggcaaaattt 5641 tctgcttttg tatgggagat gttcagtgta tctagtattg cttgattaat ctgactattg 5701 ttctgcgttg tttggcttaa taagactaag tttttatctc gttgttggtt taaaaaattt 5761 tcggaatgat attgttgctt tggcgcttct atacgccttg aaaatgaata tgcagtaact 5821 actttaccta aaagaggcat gaaaattgtg ataatcacga ttatccaaag cagaataaac 5881 actggtttaa ttcggttgta aaactgccta taagctgtga aaaattgact taaagccgtg 5941 ataaaattac tccaaactga aggaggtttt ccgctttcta aatttatgtc aacagaagta 6001 gtagctgttt catttactac ttctgtcgtt tgttcattct ctaagtttgt ttttaattca 6061 ggcttttcac ttttcggggt tgttttaata gaagaagtag ctgtttcatc tactacttct 6121 ggcttgtgtt cattgtcttg ttcgtggttc atatgcattt tgattgtaat tagtgactta 6181 agtctcttaa ttgtgtcact ttttgtgaca cttttttatg gttctcaact ataataatta 6241 ctagcataga agatgacccg ttttttaaaa tttcatataa tgtcacaaat gtggcttaag 6301 attggagcga tttgaagtca aggatgaacc agcaagagtt tgaaaaaata tttgagcacc 6361 tgtcaccccg acgtaaagaa gtgttaagaa gaattttggc aggtgagaca gatgctgtaa 6421 ttgcaaaagc tatgggcatt ggagaatcta cggtcagaaa gtatattgaa agaatttgtc 6481 aggaatttgg gttagaaaat gaacattctg atgggcgtcg ctacaaacgc tctgatttag 6541 ttgccttatt tgctaaatac aaaccagagt tacttcaaca acgtgctttt acgagtacaa 6601 atttatctga aaatttcttg ttacagttat tttcagcaaa tatacctgtt caagaatacc 6661 tggaaaagca gttgcaactc tcggacgatg aagaaaagac tcaaaccgct aaatctttga 6721 ataaaattgg tcaccacgac tacctaaatg gggatttcaa aagtgcagta tgttatttga 6781 aatgggcaat cacttttaac cctgattttg cggaagctca ctacaatctg ggagcagctt 6841 acgaaaaact ggaggaattg tccagtgctt accatcacta cgaaattgct atgaaatata 6901 gcaaccgggc tgctgatgct gcaatcaata atctagctcg tttatcgatt ctcaaaggaa 6961 atagtgctgc agccgttgag atgattcagc caattttgtc gcgggttcaa gatagcatgg 7021 tgaaagcttc attgcataag aaccttggtt gggcgtactt cgagcaaaag ctatacaagc 7081 aagcgaaaaa acatctgctc atgtcactta aattggatag cgatcgccct ctaactcact 7141 gtttattagc aaaagtccaa gaagctcaag gagaaaagca aagtgcctta gaatcatgga 7201 aagactgcct gaaatctgac tacgataacc aacaactaaa aggacaagac tgtaaactgc 7261 cggaattaaa tttttggcaa ttagaagcgc gtcgagtgtt ggatgatgaa atagatctcg 7321 gaaactgaac cgaacccata gctagaagct agaggggttt ggttcagtta tagctgtggg 7381 tacttaatta gctcttaaat atcgacggat gtttgattct tttccattcc cagtgttgat 7441 taatttgtag aaccagaggg aagagtaaga accttcgtat gtgagtgtaa tcaccgaacc 7501 aacctttgcc ccaatcaaat tgtcatacca cacaaagtat tgattgccgt ttctagattg 7561 aatcaagacc ttatatgcag cattcgatgg atcccaacca aatgcttgaa cgagcgtaac 7621 gggcattgat tcggcagcat gtgcaatctc ggcagtaaca aatgcagaag ctgtaagcag 7681 cagcccaacg ggaatcgaag cttgccaagt tcctattcct acagcaagaa gcagtgcaag 7741 gaaattgcga tcttttttgg aaaagctacc catcagtttg ttggaattta tattcatcat 7801 ttttgttctc cttgagtaag ttacagaagt tgtttaggtt gtaaaatttt ggctgtggca 7861 gattaattaa ctttttttaa ttaggcggaa caattggatc atcatcatct ccaaacttgg 7921 ggatttctgt agttttatcg ggtaaattac ttatggaaaa aaccaattct ttcaacataa 7981 tgattcaatc agcaacaata tgggagatgt tttctaaata ataaaaactg gaatttgaaa 8041 ttcctaccaa ctactgtttg ccaaaaaaag accaacgagt attggcagta gaagaagtgc 8101 tttatatgta agatttgttg ttatctattc aaaagccaat ttgtatcaag cctgttactg 8161 gtgctcaccc gttaactcag caatgcacca caggttcatt aagagtttgt aataaagaga 8221 caaggataat ttttttttga tagaatatgg cacaatgtta ttaagtcggt taatagttgg 8281 ctttaattgt gcctacttac tgaacgagct ttcattactt tccccagaaa atatctggaa 8341 gagttatcgt ataaaatatt actaaaaaac acacaagatg aacagctcat ctgatcaact 8401 ctgccaagaa tgtacagaag tagaaaatgt tttagagata gatatcaaac gtctccttaa 8461 agatttagct acatataaaa gaggaaaaga gcttacaaag caggaaaagc tttatctctg 8521 cctatcattt ctaggtaatg agcctattga catagcaaga atagaaaatt accaacgttt 8581 gtatgttgag caaaaaagtg aaaatcctag cttaacagaa gaaaaaatac ataagttagt 8641 agagcaaaag ctcaggaata gggcacatga tattagtcat tatctcagcc aaagtataaa 8701 tcaatatatt ctcggtttga tatcaacatt tgataatagt atatctgata aaaaccctcg 8761 tccatcttgg ttcaaaatat tttttttgtt gaaagcaaac ggttacaaaa aagtggcagc 8821 atctagtcaa aaaccatcaa gcctaaaaag gatagtcatt gaatctgaag atgaaaacaa 8881 tttacaaaat attattgaac ttataaaaat ggttaaccaa aaatttggta atggttcttt 8941 aagtattgaa gaaatagagt cagaaggaga tgaagataat gaacaaaaat aaaacgagac 9001 agaagctggt attgtcaggc tcatcagaag caattgagta tctccaaacg ctattccaat 9061 caggagaact gagtaaatta cttgatgtca atgttttgaa tataagcata acttcagaaa 9121 aaacaccaac agaagtttct ttagtcaact taagtcaatg tttacaaaag aacttcgtca 9181 cagcaatagc tgctggtttt gaggtgattc aagatatttt agaaccccca caattagtat 9241 tgggatatca acgacgttca actcgtagtt ccacgtcagg tgaatcagat gagtatattc 9301 gcctacaatc tgctcaaaga ttgctagaaa ctaaccctga taattctact gcgatcgcta 9361 ctttatttga aataatacgt acaactcaag aggaagaagt tcgctggcga gctattgaaa 9421 gtttaccgaa aaatgctcgc catcgtctta ctgatgtaat cgggctgaaa aaagaagaac 9481 tgcgattagc taatcatccg attaaactaa ccgtgtctgc aatcaaaata agcgatgaag 9541 aagtgagtgt ttttatcaaa ctgtatcctg ctggtgaaca aactactctg ccaataggta 9601 tcaagctgat tgtgcttgat gaatctggaa aaattttcga ccaagttccc aatgaagatg 9661 aagagtacga cgaaattaaa tacaaattga tatgtaactt gggagaaatt tttagtgtca 9721 gagtcgctct tggctcagat agcattacag aaagtttttc attttaattg ctttcaataa 9781 atattatgag tgatttactc gttttgaaat tcaggtatga gagtgataat caaaatttac 9841 tagtaaattt tgtaattgaa tcaaatggtg agcttagaga tacagaaatt agaggtacat 9901 tatcatcagc agataatctt ttaaaaattc aaaatcagtg gatcgaagct tatagaagct 9961 tcttaaaaga aaaagaaatt gaccgaatga aaaaagtaga aaagcaaagc attgcaatta 10021 taaaaacaac attcattgat tgccaaaaag cagcttatga attggaaaac acatttcata 10081 attggcttaa ctcagaacag tttcatccct tcaaacagga attagagcag attctaaaaa 10141 gccaaaaaac atctgaagaa aatcgggtgc tgattcaaac aaatcaaccg tggttgtgga 10201 aactcccttt gcatcgctgg gagattttcc aacgttactc ctgtgaagtt ggtttgagtc 10261 ttacagaaaa taaacgagta gaaagaccat taccccctaa gcctagagct aaagttagaa 10321 tcttggcgat tattggtgat agtactaaaa tagatacaaa ggtagatagg gagttactag 10381 agaaaatacc tgatgcagaa attatctggt cgcaacagcc tcaacgccga gatttatacc 10441 agaaactgtg gaacccacat ggcttggata tcttcttttt cgctgggcat agtaagagtg 10501 aggcgaatgg cgatagcggt cgaatttcca tcaatgaaaa tgatagctta tcgattgggg 10561 aacttaaaaa tgccctgaga tttgctatcc gattaggttt aaagctggca atttttaact 10621 cctgcgatgg cttgggttta gcaagggaat tggcagattt gtacatccca catttgattg 10681 tcttccggga atcggtggac gataaagttg cccgagaatt tttacaacat tttctcgatg 10741 cttactccag tgataattct ttatatacct ccgtgcggtt agcaagggag aaactgcaaa 10801 ctctagagga agatattccc tgcgcttctt ggctaccagt catctgtcaa aatccatctg 10861 atgcaccttt aacctggcaa aagttgcgcg gtagtcaaac agaggctacc tggcgggctt 10921 gttgtcaagt aatgctcgca ctatcaacct ataaacggtt gctcaacaat cccctaaccg 10981 ataagaatga gttagggctg gaactggagg aaatttatgt gcctctggga ttggtggagc 11041 gacgcaaaca agacagacgt atcggggatg tttccccaga gcagggttcg cagttttacc 11101 agcaagaacc ggagtatgaa atcactaaaa cctttaagcc gcaggagttt tttgaacaag 11161 ttcttcacca agggcaaagc cctaagagtc agggcaaacg gttggttatt attggcgaac 11221 ctggggctgg taaaaccacg ctgttgcaga agattgggga ttgggtgtta gagaacacca 11281 ttgaggacgt gccaatttgg atatctctgg cagatttaca gaggggacaa aggttggagg 11341 gctacttaca ccagaactgg ctagaattgg caatgcagac tgtacgtgtc cctggtgcaa 11401 cttcctctac tttagtgcaa atgttccaaa gtggaagggt gtggctgttg ctcgatggag 11461 tggatgagta cacagccaac tctgtcaacc cgatattgga actcgtcaac caactcactg 11521 actgggtgca aagggcacga gttgttctga cttgtcggtt aaatctttgg gaagcaaata 11581 aaaacgctct ggaaactttt gatacatatc gtactttaga ctttagttat ggtaaccctc 11641 aaacttcaga ccaagtagga ctattcatcc gtcgttggtt tatagctaat tcagagagtg 11701 gtaggcggtt gcgatcgcac ttagatgaac ctggacgctc acgcatcaag aatttggtca 11761 aaaatcctct gcgattatca ttattgtgtc gcacttggca atggcaacaa ggtaagttac 11821 ccgagactag ggcggaactt tatcggcaat ttacggaagc agtttacaaa tggaaacaag 11881 gaatcttccc tactactaaa gcgactgagg aggaattaaa cgcagccttg gggcgattgg 11941 caaggcgatc gcttgactta gaagcatcct ggtttcggct accgcatcac ttggttgttg 12001 aagaactggg ggaatttgat gctcccctat ttcaattagc actccaactg ggctggttaa 12061 atcaggtggg tgttgcagcc caaaacccag aagagccagt ctacgcattt ttccatccca 12121 ctttccaaga atattttgca gcatcagcga ctggcaattg gcacgatttt ttacagcatg 12181 acaatcatca acccgctccc atgcagggta tttaccggat ttttcaaccg caatggaaag 12241 agccaatgat actgtggctg gggcaatctg aacaggaggt actaaagcaa gaaaaagaag 12301 agtttatcaa ggcgttagta acttttacag atggatgtaa caatttttat tggtatcggg 12361 cattttctct agctgctgct ggtattgctg agtatcccga ctgtagccgt gcagacttga 12421 ttgtggagca gctcgtcaca caagccttcc gtcatcaatc ggtcaactca ccagaatact 12481 cagcacgctt gagcctcatt gcagacaaag cgaaagaagc actaaaagaa accgaccatc 12541 cacgggttat caatgctctg aagggaattg aaccaatacc tatctatttt ttggtagaaa 12601 ttgcaactag cagcccagat gatattcgcg ccttggctga attggtgcag aatcaccaac 12661 aggaagatgt tcgtttgata gttgcgaggt ggttattgca agttgacccc ggtaatcagg 12721 atgctactga tactctcctt aatatgttgc gctacagtcg aagtccttgg atttatcacc 12781 gagctgccag gggtttgcct ggaaactctg aagcaatcag catcttgcag gaaatactag 12841 gcagacctcg acatccattt gataattttg aaattaaaaa tatattagct gagattggtg 12901 ttggtaatcc gaatgctacc gacagcttag cagcgagcgc tcccactaaa aacaggactt 12961 ttaactcctt gttaaattta ctatctaaaa aaatcttgat aatcgtttat ataaacttca 13021 gcaatacaaa agcgaactta cttcaaaata gaacatttag ggggatattt ctagggttga 13081 gacaatcatt tatttttctt ctgagaaatg cctgtaacac tttagtttac ttgcatcaaa 13141 aagcaaatca tgatgagaat ttccacctac agttcattga gaagctagca aaatttgacc 13201 cgggaaaccc caagcttatc ttgagtctag ttgagttgct gcgtactggg caaagtccat 13261 ctgtatataa gcaggctgct cagagtttaa aagaaatcct tcgaggacaa atgtgtctag 13321 tcgtagtctc aggcttgaga gattgcctgc aaaaaggaac ttctgaagaa cgatacgatt 13381 gttgctatga ggttatttgg cattgtgccc aacattcagc ttaccctgac ttctatcaag 13441 cttggcacgg tcaacctttc atctctgatg cccaaagaaa ctgaccacat ccccaatggt 13501 cttacttttc acccatgctc aacctatcac gactcaacag agatgagttt tccaaatcaa 13561 acaacatctg caaacaatgt ggtaatgtca atatgttagt tgatgttagg tagtgtgaga 13621 catggcgaaa acaccatcat ccaccaagtc agagtcagta acgcttagaa tcccgaatga 13681 agtgtttgat cagattctaa attatgcggc tgcaaatact aaaggcaatc gttctgttgc 13741 aattgttgag ttgataaaca tggggctagt ggttgccaaa caatctactc aaacaagttt 13801 gacagcacaa cagagtatga aacaaaaaga tttaagtgaa gctgtaacac tgctaaagtc 13861 acaaatgaat caacttaccc agcttctaca agaaacggtt gtgcagcgtc taacagtttt 13921 agagacagaa ttgggggagt taaacgcctg agggaagaga actcctctct caggcaacca 13981 attcagcagc accttcttga aactgcacgc gaacgtatcc ttaccagtct aaaagtgggc 14041 aaacaagcct caaagtacaa aagtatccga tcgctgctaa accgctttat ctccgaacta 14101 cgctcactgt aatacacaaa agattaattg agcattttct aaccaaccgc attttattta 14161 actcctgctt gtgatttttt caataagaaa aagttcaaag cgatgaaatc tgttaccttt 14221 ttgttactat taatactctc tttcgcagta gcatttccag ctcttgcatc ggtgtgccgc 14281 aattgggatg gtcaccaaat ctgcattctt gatatcaaac gaagtgccaa aaattattgg 14341 gaatacagag cagctgttag catagatgga gtgaaaacac ctatcgaagt ctacaattgt 14401 cgttctaaag ttaaggttca gcaagatggg actgctttgc catttgagca taacgatcct 14461 ggtgaattga tatgcagttt tttcaagaag tcttgaggaa caatataaga ctcctagctg 14521 tacaaccctc cccaaactca ggagaggagc ggttttggcg tccgtcaaac cctgggtgag 14581 gttagtttaa cactcagcat tcccgctaat tttttagcaa taatacttat taataaaagc 14641 gttgctgctt taaattttca caatgcggtg gaatgaactg aatttgatgc tatgattcca 14701 cagggaataa ccgcgctatt cgcgctatcc tgccgattaa tgtagatggt aaatctttca 14761 agtcaaagat tgccaccaaa caagtgcaaa gtgctgatat ttgccagtca cgacaccaac 14821 attgttaaat aagacttgtt gaggctgttg ttaggaagaa agcagaataa catggctaag 14881 tctcatgcat cacagcgcaa cagaagtaat ggcgcaaagc atcgcgcctt aaaaggcgat 14941 agcccagaaa atcgcccaca aaatctcatc ttctatctaa ttacttccat cctgctgcac 15001 tctgttctgt ttcttggaag tgattattgg cttcgggctt ttgcacccaa gcaagaactc 15061 tccgaaacaa taccaattga gttcgttgaa gttcctccta atgagacaaa gacgcccccg 15121 gaaacttcac aacgggctgc caaaaattct gttgcaggtg gaaaagcaaa acctgaaaaa 15181 cctatttctg ctggtaactc agcatccaca actacaccca aaactaagag tgcttctgaa 15241 ccctctgagg tattactccc acaacgaaca cagcaaaaaa cagtatcatc aaatccacct 15301 cctcaaaaac tgcaacccaa acctcagaaa atagcgattg ctcctgtcac caaaccactt 15361 gtacctgaac caaagtcaac agcagttaca cctgagacga taccaccaga accattacca 15421 tcacctacag cagttgcacc taccaccaaa ccacttgcac tcaaactgcg acaaagaaca 15481 gttacacctg aaacgacacc accagaaccc gaacctttgc caacannnnn nnnnnacaaa 15541 gaacagttac acctgaaacg acaccaccag aacccgaacc tttgccaaca gtagttgcac 15601 caaaacctaa accactcgca ctcaaactgc gacaaagaac agttacacct gagacgatac 15661 caccagaacc attaccatca cctacagcag ttgcacctac caccaaacca cttgcactca 15721 aactgcgaca aagaacagtt acacctgaaa cgacaccacc agaacccgaa cctttgccaa 15781 caacagttgc accaaaacct aaaccactcg cactcaaacc ccggcaaaca cttgttaccc 15841 ctgaaacaac acaaccagta ccattaccat ccccaacagc agttgcaccc accagcaaac 15901 cattagcact caaaccccag caaacagttg ttacacctga aacaacacaa ccagtacctt 15961 taactcgtcg ggaaaatagc agtcggcttg cgacaaaatc ctctcctgaa cagttcgaga 16021 ataggactaa ctcgctagtg tccccagaaa cacagcgtat agcaccatcg cgttctcaaa 16081 gtcagtcaac aacatcacca cgaaaactgt ctcgatctca agcatcttca aaatcaggtg 16141 gcgcaagcag tttgggtggt ccaatgagtt tatctagtcg tgattttgga agcaataatt 16201 tggcagccct gcccaattcc aaccgcctca atcaagggac acaaggtatt gatgctcgtc 16261 aagatgtaga catgagcgct tacctgcaac aattacaaga gcaagtgaag cagcagtgga 16321 tacccggact cacccaatct tcccaacaga cagtacttag ctttatcgta agccgagcag 16381 gtttagtcag caatctccag gttgtacaaa gttctggatt aaccatgact gatgaagcag 16441 cactgaatgc tgtgaaccga gcagtacctt ttgctccttt tcccacagaa tatccacaag 16501 actatatcaa gattcaattt acatttaata tcaacgtcta tgggcagcta gagttatgga 16561 gcgatcaata acgtagctca ccaaagctgc tgaaacaggt ttttagctgc ctggaacttt 16621 tctgtttgga ttgtgtcatc atatttatgt agtttctgtg agtgtcgagc acatacagtc 16681 atgagacgct tgctgacagc cttattcgta atcatgagca caaatgttgt gactcatcct 16741 gccttagcca acgaagaaaa tgtttctcct gtcataacta accagggtca ggctgcagct 16801 cttgtgactc aaccagttac gattaatagt ggaacagtta gtgtcacaga ccacaccaac 16861 aatctcagaa aagtttctga ctcaatcggg gtagtacgtg agtcacaatg caaaaagatt 16921 aacccgcttg agctgatcaa atctcctggc aataccctca aacaatgttt agaggaaaca 16981 aataaacaag ctgatcaaat caaccaaact tctcaacctg cagagcagtt tgagtacttc 17041 aaggttccaa aactcgaatc tggtgttaat gtcacagtga ctaagtttta agagcgtgtt 17101 agcagagcat cgattttttg caagcaaaga ataataccag ttaatcttat cgtaaaaatg 17161 tgattggtat aatttatgca gatattttgg gagttgcaaa tgtggcttgg gtagcaggcg 17221 atcgcttaca aggtggaaaa tacaccattg aaagagagtt gggaagggga cgctttggca 17281 tcacttattt agtcaaaaat agtaacagcg atcgcctagt cgttaaaacc ttaaatgatg 17341 acctgctcag accccttacc cagcaggagc gcgaaaggtt agagacgatg tatttgcagg 17401 aggcattaaa actacaaaag tgtaagcata agcatattgt acaggttaca gaagttttta 17461 aggaaggaga gcattcgtgt ctagtaatgg agtatgtgga cggggacagt ttggctgacc 17521 tccgtccacc aatactttcg gaagcagacg cactacgtta cattcagcaa attggggaag 17581 cgctgatagt tgtacatcaa aatgaactga ttcatcggga tgtgcatccg ggaaatatct 17641 tgttgcggaa ccgagaaggg cagttagaag ccgtgttaat cgattttggt ctagctttag 17701 attttgacca tattttaacc acaagccgaa ccaaggaaac ttctgatgga tttacgcccc 17761 ctgagcttta cactaaacgt actatagctc gtgcgtatag cgatatttat tcactggcag 17821 ctacgctcta taaacttcta acggggagaa cacctgtcaa tgcggtaaag cgcaaagtgg 17881 atggtgagca tttagtgtct ccaaaagaat tcaatcctca gattagcgat cgcgttaacg 17941 gagcaatttt aacagggatg aagctagatc acaaggagcg atcgcaatca atgcgagaat 18001 ggctcgattc tttagggtta agtggagaaa ctccccagcc tgtatcaact gtactcccta 18061 accctaaccc aaatcgggag aaaaaaatca attggacact tgttatagct gcgatcgccg 18121 cgattggagc cttactctca ggcattgctg ctttaattcc tattctcaaa caatctccgt 18181 ctcctagccc atcgtcactg tctcccagtc cgtcatctac caaaactccg tagaattagc 18241 taaactttca tcagggtgct ggcattaaag taatttttta agtatttttt cagacttaga 18301 ggtgttcatg ccttggacag caggacaacg cttgcaaggt ggcaaatatg taattgggga 18361 agtcctgggg caggggggat tcgggattac ttataaagca ttgcatgttg agttaaacca 18421 gacagttgtt attaaaacac ccaatgaaca tctcaggcat gatcctgagt atgacaagta 18481 catagaccgg tttatcaaag aagggcagat actagcacga ttatctgaag aaccccatcc 18541 tcacattgtt agagtgattg acctgtttaa ggaaggtgca atccactgct tggtgatgga 18601 ttttgtaccg ggagaaaatt tatttgaggc agtcagacgc aggggagcgt taccagaagc 18661 tgagattgtc ccctgtattc gccaaatcgg agaagccctg atgatggtgc atcaggcagg 18721 gctagtacac cgagatgctc accccggaaa catcatgctg cggaacaatg gcaaagcagt 18781 tctgattgac tttggtatcg ctaaagaact tctacctaaa actttgagtt caacaggtaa 18841 tgcgggtaat catggatttg cgccctatga gcagatcact aggggcagtc gggagccaac 18901 ggttgatatt tattgtctgg ctgcgacact ttattatgcg gtgacaggtc aaaaccccac 18961 aacttctctg gctcgaaagc ttgatgatgc ttccctgact ccacccaaac aaattattcc 19021 agacattagt gaacaattaa attacgcaat tctcaagggg atggcgctgg aggcagaaga 19081 ccgtcctcag tcaatgcagg aatggttgac aatgttagaa gcactaaaag cgtcgcctcc 19141 acctttagtt gagccagttc ataaaataga agttgttcgt cgccaattcg acgagtcaaa 19201 gtcaaaacca gctactaaat cacctagaat tatcgcttgg ggctggttgg ttggtgtatt 19261 gttgaattac acaaatataa gctatttctt agcagcgctt aacgctccgc gaataatttg 19321 ggctgtggct gtggcttggg ttgtggcttg ggctgggctt gcggctgtgg ctgtggcttg 19381 ggctgggctt gtggctgtgg ctgtggctgt ggctggggtt gtggctgtgg ctgtggctgg 19441 ggctctgagt ggggttgtgg ctgtggcttg ggctggggct ctggctacgg tttgggctgg 19501 aaagaaatta caaaaatctt ttagcaaggt tcatactttt ctaattttag ctagcacatc 19561 tagtctaggc ttgggtttgg gattgttagg acataggatt tttcatacag gttcatagct 19621 ttatcatctg aagctaactg gcaaaagatt ttgtagctgt tgtatcgact ggagaagcga 19681 tgtagtagcg acaagctgag aaagcgtcta cggatcaaca agccaaagtc caaaaatatt 19741 aggcttctgg ttcaacacca agcgattgat tgctcttaag agtttcaaca atccatctgt 19801 cgcattcttt tttcaaattg gtataacttt ctgtgtcacc ctgtgatatc atgtccggtt 19861 aattgcttat aaatccagaa gaaccccacc ccggttttgt cttgcgccaa aaccgcccct 19921 ccccgcaagc ggggagggga ttaaggggag ccagcgccgt gcgggggttc ccccccgttg 19981 aggcgactgg cgtgtggggt gcagttaacg taggaatcac aactaattag tcggacatga 20041 tatgagcaga tcactccaca atatctgcgc tatgtataca agtgtaaggg agaaatagcg 20101 acacaggaac ttgctgctat gcgatgtttg aaagacttgt caaatcccac ccttttttcg 20161 tcactctgga taagcgccaa agattgtttt cgtaacaaaa ataaagacat gtgaatagca 20221 tgtattattt tttacatata taatgaaaca gcttcacaac tatcaccact ccatctgatt 20281 cctagaaaag gtgctttatg gtagttattt acggtgctac agggaacgat tatttaaagg 20341 gaacgcaaga caatgatttc atttatggta atttaggaaa cgacgttttg atcggtcttg 20401 ctggtgacga ctttttggaa ggtaaagagg gtaacgacgt acttataggc ggtgacggta 20461 acgatattct atatgctgga cttagtaaag taccatatgc cccaggttac ggaggagact 20521 taaatagcgt aaatttactg tatgggggca aaggtgacga ccagttattt ggttcactgg 20581 gtaaggattt cctgtttggt ggtgatggca atgatcgaat tataggtttg tctggtgatg 20641 actatttaga tggcggcgac ggtaacgaca gattagatgg tgggttcggc aatgatatcc 20701 tgattggtgg taatggagat gacatcttat acgatggaag ttacaagact ggcggtggca 20761 atgatatcct tttaggtggt aatggagatg atattctcaa cagtggcagc ggcaatgata 20821 tccttatagg tggaaaaggg aatgatattc tcactggcgc tggatacata cccactggtt 20881 cccctggtgg tatctatgga gtagatgaaa tagatatcct aattggaggg gaagggagag 20941 atacattcaa ccttggaggg cccaacgctg ctggtaactt ctcggtttac tatgatgacg 21001 gtaataaatt aaccactggg gagaatgact acgctttgat tgctgacttt aacccaagtg 21061 aggatattat ccagcttgta ggaacagcat ctgactatat cttgggatca tctcctgctg 21121 gcttaccaaa tggagtaact atcaacctga aaaaacctga tagtgaattg aatgaactta 21181 ttgcaattgt accaggtgtt tcaggtttga gtcttgacag tagctacttc agatttatct 21241 aacgcttgtt gtgatttcct attgccttat ggtgagggtg aattatataa gtaaaattta 21301 taaccagatg aacagaaaat caatggtgcg ttaggactaa tgtccgtaac gcaccctaca 21361 tgttactgaa cagtttatct ttaccgtgta tgcaatccat ccataccttg tttaatccaa 21421 ctcaagcaat cgtattctga tggaaataac tttggtgctg caatattatt cagtgagtct 21481 gaaggataat ggtcatagaa atattttcca gcctcttgaa acacgataat taagttgttg 21541 cggtaaacgt agcgaaattt aaaattatcg caatgactga caaaatctga atattcgtca 21601 atatgttctt tggcaatatc aacaatattc tctatactag tgccatagat ttctgctgga 21661 ttttctgctt tgtgaaattg gctttgatta gcaatttctg ataatagttt ataggaataa 21721 atatcctgac tatcaattgt accaaagaca aatggaatga tgagataacc tttatatgaa 21781 acggaatttt cataaagaag acgactcatc taaccctcct ttggctcttc tactcaaatt 21841 accacagtac ggctaagtac ggtaagcata aaaatataag atcagatttc accctcaatc 21901 cctctccttg ataaggagag ccagtgcgtt gcggagccag tcctgcagga gggtttccct 21961 ccgtagggac ctggcgttgg ggttcccccg acggtgcgac ctggcgtgag ggaagccgga 22021 ggcagggtga ggttttatat ttaatttgac ccacttactt accacaaact gagttgatta 22081 gggggattgt caaggatatt ccaaggaaga ggaggaacag ctacaccact ctgttctaat 22141 aacttttgaa agtgacgtgc atttcctggt gagtgttctt ccacaggaca atgaacaaaa 22201 aaataaattt gcgttccctg ctgcaaccac tgcttaatct gagtcaccca ctcttccatg 22261 aaaggctgat ttactgataa attgggatga gaaataaacc gaatcaaact aaaaggcgct 22321 agtacgctcg ctgtcacgct aaattgtaca ggaactttag gtttacgccg ttcagattgt 22381 atttgcggat catcatctcc ggtgtagata ggacgtgagt ctagcagaac ccttccaaca 22441 ccaagccttt ctaaaagtgc tgtcaactca ccagctaatg gttcagtaaa ccagtcttga 22501 tgccgaactt ctaacgccaa acgtacgtct gtttgctgta aagcttcgag aaatacgccc 22561 agatcatcaa tcagtgtagg cggataactg gggggtaact gagcaaaaac aggtccaaga 22621 cgtttaccta aaagacgcat cccttctaaa aattttaaag catcagggat ttggggttgc 22681 aatcgtccgt tgtgggtgac atctcgcggt aatttgagac aaaattcaaa attttggggt 22741 gtctgtgtcg cccaacgggt gatagtctcc tgattaggta cggcgtagaa ggtggtgtta 22801 ccctcaacag tggtgaagcg tcggctatag agttgcaaaa aatcactttg gcgagtccct 22861 tgagggtaaa gttcgcctac ccagccttta tatgcccaaa cagcgcaacc aatgaaaaag 22921 ttcacaaatt ctttaaataa taaaaaactc gaatgtagtt atatctacta taattttttg 22981 gata // LOCUS NODE_1378_length_22829_cov_5.14867822829 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 22829) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 22829) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..22829 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 357..560 /locus_tag="DP116_12385" CDS 357..560 /locus_tag="DP116_12385" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12385" /translation="MPDASELLKSTSHFLDASLYEVYKEIETSEKSGGATVESNHQAI QDGLVQWRSLLPCDERGSQLIIK" gene 1053..2039 /locus_tag="DP116_12390" CDS 1053..2039 /locus_tag="DP116_12390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210064.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12390" /translation="MDTVYLAQVHLHLNSWTQSNGQSTLKSKTPHGLDHSHNLYSEPV QDWTIEAMKFLSCVARCKERFGMLHIIDVLRGGKTQKITQHEHDKLSTYGIGKDKTLD EWRMLGRSLLEQGLLEQTGDRYPVLKLNTLSWEVMHKKRTVSITIYVVQEMTWEDDNE KAAEVEMLMERLRSLRQELAEEQSLPPHFIFQNSTLELMAQVQPQTKEEFSKLSGVGH HKLVQYGDKFLGEICAYRTEQGLLEQADLDRQIQKVEFPKNSEDFDRVDKLIEQLRQS LESLSSDKIPEDVQKFLKAAAKSGATIDLLTLEVKEWLIQHRLAESLRIRLT" gene 2495..3427 /locus_tag="DP116_12395" CDS 2495..3427 /locus_tag="DP116_12395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319278.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_12395" /translation="MNVGNWVYLGVGLAIGIGVRGFLARPEVSSSSSTVLSQNKQQAT PILLQQMKQTQLAYHMAREISQFKAGFLARTTHVLRSPLNGLIGLHQLILSNLCENPE EEREFIAQAHERALKLLKLIDEILSVARVEHGTNKLHIQPLDLSELLQEVHDLTYMLA ENRNFPFQVLPPDPETYILADPNWLRQVLVTLVESCIAQMEEGSICISAHVAHTSGYV DIWLDVPTHATPLSEPIDFMTSKNKSPEVGEKNDILLPGMKLLLNQTLLEVMAGKLQI VPFPAPQEQASDMTRLQVSIPLVIPEVELLQEEN" gene complement(3378..3851) /locus_tag="DP116_12400" CDS complement(3378..3851) /locus_tag="DP116_12400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_12400" /translation="MNSSHNQIRFSDRKSKIDLYQLQQLLNISAFWAKNRSIEDLGIA IANSDPVITVWDGERLIGFARATSDCVYRATISDVAIHPEYRGVGLGSKLVETVLSHP RMNRVERVYLMTTHKQRFYERIGFQQNSTTTMVLYNQPKFNSLPVEVQLQESLGG" gene complement(3959..4372) /locus_tag="DP116_12405" CDS complement(3959..4372) /locus_tag="DP116_12405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319280.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12405" /translation="MDQPEEPPKTSWYTDESDVSFAKQVQKLHQLEVYSRWLFVGFLW LTITPLCVWNLRGEIALWRQYFTWVAVRYGLYYHPLASLGLFFCLGMTVAVLIWQSRN ILFGLPQQEKQRLEQQVFRIRQQGQSHPFWKWVCS" gene complement(4429..5349) /locus_tag="DP116_12410" CDS complement(4429..5349) /locus_tag="DP116_12410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007356383.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein translocase subunit SecF" /protein_id="PRJNA477356:DP116_12410" /translation="MKISVIKQRFLWWALSGAIILSGIIAMVISWQTLGAPLRPSLDF VGGTRLVFERDCTIAQNCAKPIELSSVREVMDAQGLGNSSIQVVGQNQQAVSVRTETL NVEARTKLQTALSEKLGAFDPKATQIETVGPTLGQELFTSGLKALIVSFIGIVIYLSV RFKTDYAVIAIIALLHDVLITTGIFSILGLVQGVEVDSLFVVAILTITGFSVTDTVVI YDRIREILNQHPNDPINQVVDDAINQTLTRSLNTTFTVLLTLFALFIFGGETLRNFAL ALIIGFAVGAYSSIFVAGPLLALWRGRSPT" gene complement(5346..6764) /gene="secD" /locus_tag="DP116_12415" CDS complement(5346..6764) /gene="secD" /locus_tag="DP116_12415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein translocase subunit SecD" /protein_id="PRJNA477356:DP116_12415" /translation="MQRQRSLLVLILVLVIAAIAVIATIPASLGLDLQGGSQLTIQVK TTPEIKQITDRELEAVKRVVEGRINGLGVSEPVIQTAGQDKILVQLPGVNDPQQAERV LGGTAQLEFKKQKPNTEIQLNTFFASKTQLKAKQAELRKTNDKAAIEKNQQDLKKSNE AIAELFESTNPPLIGKYLKDAYGEPTQSNNWNVAIRFDDQGGQLFADLTKQLAGTGRS IGIFLDNELISFPTVGPEFATSGISGGAAVITGRFTSQEANDLGVQLRGGALPVPVEI VENRTVGATLGKDSIQRSIYAGIGGLVLVLIFMVAYYRLPGIIADIALVIYALLTYAT FVLLAVTLTLPGIAGFILSIGMAVDANVLIFERTREELRAGKSLYRSVEAGFYRAWSS ILDSHVTTLISCAALFWLGAGLVKGFALTLALGLLVNLFTSIICSRTLLLVAIGFPNL RKPELFCPNLAMYTKSEPEAVQ" gene complement(6944..7927) /locus_tag="DP116_12420" CDS complement(6944..7927) /locus_tag="DP116_12420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459052.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-ketoacid dehydrogenase subunit beta" /protein_id="PRJNA477356:DP116_12420" /translation="MAETLFFNALREAIDEEMAHDATVFVLGEDVGHYGGSYKVTKDL YKKYGELRLLDTPIAENSFMGLAVGAAMTGLRPIVEGMNMGFLLLAFNQISNNAGMLR YTSGGNFKIPMVIRGPGGVGKQLGAEHSQRLEAYFQAVPGLKIVACSTAYNAKGLLKS AIRDDNPVLFFEHVLLYNLKEDLPEKEYLLPLDKAEVVRSGKDVTILTYSRMRHHVTQ AVKTLEKEGYDPEVIDLISLKPLDFDTIGASVRKTHRVIIVEECMRTGGIGAELTASI NERLFDELDAPVLRLSSQDIPTPYNGTLERLTIVQPEQIVEAVEKMVALRV" gene complement(8151..8495) /locus_tag="DP116_12425" CDS complement(8151..8495) /locus_tag="DP116_12425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319286.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" /protein_id="PRJNA477356:DP116_12425" /translation="MDMQVLRERAGLSRAEVAFRLAISETSVRNWEAGRTEPTMTPKK YLDALRLFKCTPEELAAASEKSINQRHKRKPGRPRRFPESSVNPSPPVNQVSQVNQVN DSPSFQLKEVGY" gene complement(8860..9840) /locus_tag="DP116_12430" CDS complement(8860..9840) /locus_tag="DP116_12430" /EC_number="4.2.1.24" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127225.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="porphobilinogen synthase" /protein_id="PRJNA477356:DP116_12430" /translation="MFPTHRPRRLRTHPQLRRMVRETVLTTNDLIYPLFAVPGEGIAN EVRSMPGVYQLSIDKIVEEAKEVYDLGIPAIILFGIPTDKDVDATGAWHDCGIVQKAA TAVKEAVPDLIVIADTCLCEYTSHGHCGYLQVGDLTGRVLNDPTLDLLGKTAVSQAKA GADIIAPSGMMDGFVQAIRSALDEAGFHDTPILSYAAKYASAYYGPFRDAADSAPQFG DRRTYQMDPGNSREALKEIALDIAEGADMLMVKPALAYMDVIWRVKEASNLPVAAYNV SGEYSMVKAAALNGWIDEQRVVMETLIGFKRAGADLILTYHAKDAARWLR" gene complement(10193..10534) /locus_tag="DP116_12435" CDS complement(10193..10534) /locus_tag="DP116_12435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459045.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4870 domain-containing protein" /protein_id="PRJNA477356:DP116_12435" /translation="MYDPDKRKILSAVSHGAIFLSATFVSVGLPIAILFVSEDPVVRE NAKESINFHFNVWVYGAIVTGLTWITFGVLLPLAGLWFIAHWGLTIWAIFHVLTDPDK PFRYPFIFRVF" gene 10669..12300 /gene="prfC" /locus_tag="DP116_12440" CDS 10669..12300 /gene="prfC" /locus_tag="DP116_12440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319290.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide chain release factor 3" /protein_id="PRJNA477356:DP116_12440" /translation="MSTELESELHKAVDSRRNFAIISHPDAGKTTLTEKLLLYGGAIH EAGAVKARRAQRKATSDWMAMEQQRGISITSTVLQFEYQDCQINLLDTPGHQDFSEDT YRTLAAADNAVMLIDVAKGLEPQTRKLFEVCKLRGIPIFTFVNKLDRPGREPLELLDE IEQELGLVTYAVNWPIGMGDRFKGVYDRKEQQIHLFERSAHGSREAVDTVVDLGDARI EELLEQDLYYQLKNDIELLEGVGPELDLELVHQGKMTPVFFGSAMTNFGVELFLKYFL DYALKPGPHSTTVGEVTPIYPEFSGFVFKLQANMDPKHRDRVAFIRVCTGKFEKDMTV SHARTGKTIRLSRPQKLFAQERESIDVAYAGDVIGLNNPGVFAIGDTIYTGQKLEYEG IPYFSPELFAVLRNPNPSKFKQFQKGVSELREEGAVQIMYSVDEAKRDPILAAVGQLQ FEVVQFRLQNEYGVETLLELLPYSVARWVEGGWEALNKIGRVFNTTTVKDSMGRPVLL FRNEWNCNQLQEDHPELKLSSIAPVVSGQATNSNS" gene complement(12483..14102) /locus_tag="DP116_12445" CDS complement(12483..14102) /locus_tag="DP116_12445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131438.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GMC family oxidoreductase" /protein_id="PRJNA477356:DP116_12445" /translation="MRFSDLNECEDNLLIETDLCIVGSGPAGLSIAKEFAGTKIKVWI LESGGFEKESDTEALCKIENVGVSRKDNTRNRLYGGTSHTWTGRCAPFNPTDFQKRSW IPYSGWDLKYEELEPLLERAGKNLGLGPNCYDNQLWKLFKVLPATPNLDSKFLDTTFW QFSKSPKERGEPIRFGRDFFIDNSPNIEVLLHANVTHINTNELGTCFESVEVRTLEGK RAFVKAKALVLSCGGIENARLLLASNRTIPNGIGNQNDVVGRFLMDHSLCVIGSFDHH AAYSVRNRFGHYLIKNKQRTHVYLHGLSLSPKIQEQENLLRCDAYLEEYNPLPNDSWS AMRRLKTTLRKRECIKQSTQDIFTILSNFREISSGFHRRYVKRRPPLLKVERLELHCM LEQLPDPESRITLSTNQKDRLNMPLAKINWKVNSLERQTAQRMSQLITQEFQRIGFPA PHLSSWLNKEENWTQNFEDKAHHTGTTRMSANPKKGVVNVNSQVHEVNGLFVAGSSTF PTSGTANSTLMIVALALRLADHLKYNYFRAS" gene 15567..17183 /locus_tag="DP116_12450" CDS 15567..17183 /locus_tag="DP116_12450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315912.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="B12-binding domain-containing radical SAM protein" /protein_id="PRJNA477356:DP116_12450" /translation="MKTLLLYPCFPQSFWSYDRFLKIAGLKAFIPPLGIITVAALLPK DWEIRFFDRNVNPETDDDWQWCDMVILSAMLVQKPDFHALIQKAVRLGKKVVCGGPYP TSIPQDALASGAHYLVLDEGELTIPQFLEALAQGKDQGIFRASDKPDVTKSPIPRFDL LKLDAYFMMAIQFSRGCPFNCEFCDIITLYGRKPRTKEPSQTLAELQTLYDLGWRGSL FIVDDNFIGNQRNVKRFLLSLIPWMKERNYPFTFITEASVNLAEDPELLHLMVEAGFN AVFLGIETPDQDSLKVTHKFQNTRNPLLEACRQINSAGLLIYAGFILGFDGERSGAGD RIQAFVEQSSIPQPMLGILQALPNTALWNRLKAEQRLLEGIGVVEVGDQNTLMNFKPT RPLTEIAREYVEGFWTLYEPANYLRRCFQQCLNIGLPPEKRQTMRFPAGKGLRLVAQL IWHQGWQRSEIRLQFWQQLWTILRTKPQVLNMYLGLCAAGEHFWEYRVLARERIAQQL GYDPLTVPGSLTLESIKSYDCTKTKNNAVS" gene complement(17155..18876) /gene="ureC" /locus_tag="DP116_12455" CDS complement(17155..18876) /gene="ureC" /locus_tag="DP116_12455" /EC_number="3.5.1.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870741.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="urease subunit alpha" /protein_id="PRJNA477356:DP116_12455" /translation="MNRREYTNTFGPTIGDRIRLADTDLFIEVEQDYTTYGGTVYGEE VKFGGGKVIRDGMGQSPISNADGAVDVVITNALILDWSGIIKADIGIKDGKIFKIGKA GNPYTQDNIDIIIGPGTEAIAGEGKILTAGAIDTHIHFICPQQIETAIASGTTTLIGG GTGSATGTLATTCTPGPWNIYRMLQAADGFPVNVGFLGKGNTSKPEGLIEQVAAGVMG LKLHEDWGTTANAIDTCLSVADEYDVQVAIHTDSLNEAGFVENTIAAFKNRVIHTFHT EGAGGGHAPDILKVCSLANVLPASTNPTRPYTINTFDEHLDMLMVCHHLDKNIREDVK FAESRIRKETIAAEDILHDMGIISIMSSDSQAMGRVGEVITRTWQTAHKMKVQKGLLP NPGHPHETHDNFRAKRYIAKYTINPAIAHGVADHIGSVEEGKLADLCLWKPAFFGVKP ETVIKGGMIAWSQMGDPNASISTPEPVHMRPMFGSFGGAMAETSITFVSNEALKQGVP EKLGLKTRTVAVSKTRELRKADLKLNDATPQIEVDPETFHVRVDGKLLTCEPATSLPM TQRYFLF" gene 19168..19380 /locus_tag="DP116_12460" CDS 19168..19380 /locus_tag="DP116_12460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002785630.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapB family antitoxin" /protein_id="PRJNA477356:DP116_12460" /translation="MQITLNLDESLLNEAFQLTNLTSQEELVNLALQELVRLRRKKNL LDLAGQIQFTEDFNHKALREARHAAD" gene 19367..19768 /locus_tag="DP116_12465" CDS 19367..19768 /locus_tag="DP116_12465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012626637.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain nuclease" /protein_id="PRJNA477356:DP116_12465" /translation="MLLIDTSVWISVFRDSSGQIGKQLETLIAEREVLLTRFTQLELL QGSLNEKEWILLSIYLETQDYVELTSHSWQAAARIYYDLRRQGLTVRSPIDCCIAQSA LENDLLLIHNDRDFETIAQVRRSLQHFRFQP" gene complement(19868..20572) /locus_tag="DP116_12470" CDS complement(19868..20572) /locus_tag="DP116_12470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019507901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="urease accessory protein UreF" /protein_id="PRJNA477356:DP116_12470" /translation="MSLPASAIAQQLALMQLSDSFFPSGSFTLSHGLESLVQENQLRG SEDLQTFLRLLLRNKVGVTDTVALIHAYRGSAIADLKAVRAADARLFAQTLVEKTRET QRKSGRALLMVASSTWQDSQLEILNQDAAIGKIHCLHPVIFAVVGRAALLSEQDTVLG FLHSFVTGLLGAAIRLSVVGHLQAQRVLLQLAPEIEIAYETAASMSLEEMWSCTPAID IAQMRHQKLAQRLFAS" gene complement(20569..21093) /locus_tag="DP116_12475" CDS complement(20569..21093) /locus_tag="DP116_12475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210032.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="urease accessory protein UreE" /protein_id="PRJNA477356:DP116_12475" /translation="MTELAEIYLGNSKENISLSERIEKARSSALCLEVHISQTDSRKG RIHAHTTTGAAVGIVKSRDWALREGDVLETEDGKLLLIHLQEQKVMVLSFSEPVTEHT IELVHLGHVLGNHHWPIIVQDGKLYIQLAADIEVMESTIRDFQIPGLQIDFELRSPQQ HLDFSPHTHHDHGS" gene complement(21090..21986) /locus_tag="DP116_12480" CDS complement(21090..21986) /locus_tag="DP116_12480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132491.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="urease accessory protein UreD" /protein_id="PRJNA477356:DP116_12480" /translation="MNADERFHRDEKRYNLELRLKCVTPLGVCPSGNRFGQTILSHQY TAYPLRVSRVFYLDDADFQRAYLYVMSTSPGLLAQDELNVSLQLAPHTNVYLTEQAAT KVHSMPIGGSKATTNYEIEIGEGATLEFVPEPLILFADATLEQTISIKIHPTGRLFLS EMILPGRLARGEFYQFNHYFSRLQVSSTCGELWFTDAMRLEGKGNIFTNSNLFADSPV IGNLIIVLPETNLELLSKSVEDLEAANCSGLTVASSILPRNKGLLIRAMASGTHELKN YLKYALNCVRRCIDEPLLPNDL" gene complement(21983..22660) /gene="ureG" /locus_tag="DP116_12485" CDS complement(21983..22660) /gene="ureG" /locus_tag="DP116_12485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015956361.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="urease accessory protein UreG" /protein_id="PRJNA477356:DP116_12485" /translation="MLKSAARLGIGGPVGSGKTALIESIVPILMNQGIEVAVVTNDLL TTEDADRLKSRGFLPAERIIGVETGSCPHTAIREDPTMNLLAVKDFELLIPELDLILI ESGGDNLASTFSYDLVDSYIFVIDVGAGDDIPRKNGPGFVQADLAVVNKIDIAPYVGA DLELIRKEAPIYRRGKPIAYTNCKTGEGLDEVVDFILETVLFRSASATHQRVEEPQMD AEVDGRR" BASE COUNT 6722 a 5038 c 4577 g 6492 t ORIGIN 1 ctaatttaac gaaccgccaa gacgccaagg acgccaagaa aatcaaaaag aagatcggta 61 atcttgcacc gggaagggag taaaaagcag caagcacaat cctcctcggt taagcccaaa 121 aatcgttatc cgagcagtat tgagagccgg catattttgg ggttacggga aatatagaaa 181 tactttatgc aagggataga gagctttgat gacctcttta tactctgaaa atagttgatt 241 tgtactaatt aataattatg tagatacaat gaagtgcgta caaatagaga gagaaaattc 301 agataataaa ataagcaagc tgcttgctgt atttgagcga aaatcatcaa aaagctatgc 361 cagatgcttc agaacttctc aagagcacaa gtcattttct tgatgcttct ctctacgaag 421 tatacaaaga gattgagaca tcagaaaaat ctggtggtgc aacagtcgaa tcaaatcacc 481 aagcaattca ggatgggtta gttcaatggc gtagcttgct tccgtgtgat gaaagaggaa 541 gtcaactgat aattaagtag ttcaaatcaa tgttttacgt aactcaagct tcggaaatcc 601 aagcagttat aacccgacta gcttcgcgtc aaattctctg gttagataca gaagttgctg 661 actatgatac acgctttcca aaactatctc tgattcaggt ttcggtagaa ccattagatt 721 tagctggcga taatgtttat atattagatg tctttcaaaa gcctgatgtt gtaaaatact 781 ttatcaatca aattatggca aatgccaaga ttgaaaaggt atttcacaac tccagctatg 841 acctgcagta tctaggtaag attcaggcgc agaatattac ctgtacgtta aaaatagcaa 901 gaaagatatc acgcgatcgc ctaggaacat caaatttaaa acttaaaacc ttagccaaag 961 aactttgcaa ttttactgat gtagatgaag aggaaggaac aagtgactgg ggaagacgac 1021 ctctgagcca aaaacagcta aactatgcca agatggacac tgtttatcta gctcaagtac 1081 atctgcatct aaatagttgg acacaatcaa acggacagtc aactttaaaa tctaaaacgc 1141 cacatggatt ggatcacagc cacaatcttt attctgaacc agtgcaagac tggacaattg 1201 aagcaatgaa gtttttatct tgtgtggcgc gttgtaaaga aagatttgga atgcttcaca 1261 ttatagatgt gttgcgggga ggaaaaaccc aaaaaattac ccaacatgaa cacgacaaac 1321 tttctaccta tggtataggt aaagataaaa ctctcgatga gtggcgaatg ctggggcgtt 1381 ctctattgga gcaagggttg ttagaacaaa ctggcgatcg ttacccagtt ttgaaactta 1441 acaccctcag ttgggaagtt atgcacaaaa agcgaactgt gtcaataact atttatgtgg 1501 ttcaagagat gacctgggaa gatgataatg aaaaagcagc agaagtggaa atgcttatgg 1561 agaggttgcg atcgctccgc caagaacttg ctgaggaaca atctcttcca cctcacttta 1621 tctttcagaa ttctacccta gaattaatgg cgcaggtgca accccaaact aaagaggaat 1681 ttagtaaact atctggcgta ggtcatcaca aacttgtgca atatggtgac aagttcctag 1741 gtgaaatttg cgcctaccgc actgaacaag gtttactaga gcaagccgac ttagatagac 1801 aaatacaaaa agttgaattt cccaagaaca gcgaagattt tgacagagtt gataaattaa 1861 ttgagcaatt gaggcaatct ttagagagtc tgagttctga caaaatacct gaggatgtac 1921 aaaaatttct caaagctgct gcaaagagtg gggcaactat tgacctctta actctggagg 1981 tgaaagaatg gctcattcag catagactcg ctgaatcttt acgaattcgc ttgacttaat 2041 aatataggaa tccggtttga tttatcaact tactcgtaaa gacaagtcct gttaagcgtt 2101 ccctgttaag cgttccctgt tccctattcc ctgtataagt atatttgctg gggcaacttc 2161 agcgcaaaag acgctataac ttgatttcta acaaaaatca ggttattact agtgaccttt 2221 gatgcatcaa gtcatgtttg taagcttggt ttgtcggatg ttctccatca actcagtatt 2281 aagaataagg aaccgcagtg ttccgaacaa agtgctgtgt ctaaacaatt ccaagttttc 2341 tgttcaacct caaactgggt gagaaacaga tgaactttca tctgtaccta gaaattcagt 2401 ttttgaggac tctgttggaa acgggtgtta tggaaactcc ggaactttgt tttatgtctg 2461 tatggtcaat gaaaaaggtc attattgcta aattatgaat gttggaaact gggtctatct 2521 gggagtagga ctagcaatag ggattggtgt tcgtgggttt ttggcacgac cagaggtgag 2581 ttctagctca tcaacagtgt tatcgcaaaa taaacaacag gctacgccaa tacttctgca 2641 acagatgaag caaacgcagc tagcgtacca tatggcacgg gaaataagcc agtttaaggc 2701 ggggtttttg gcgcgaacga ctcatgtatt gcgatcgccc ctaaatggtc tgattggttt 2761 gcatcagtta atcttatcca acttatgtga aaatccagag gaagaacgag aatttatcgc 2821 tcaagcccac gagagagcgc taaagttgct taagctgatt gatgaaattc ttagcgttgc 2881 tagagtcgaa catggtacga acaaattaca cattcagcct ttagacttat ccgaattgtt 2941 gcaagaagtt catgatttga cttatatgct ggcggaaaat cgcaattttc cctttcaagt 3001 tctaccgcca gatccagaaa catatatact ggcagatccc aactggctgc gacaagtatt 3061 agtcactttg gtggagagtt gcattgctca aatggaggaa ggaagcatct gtatttctgc 3121 tcatgttgca cacacaagtg gttacgttga tatttggttg gatgtaccaa cccatgcaac 3181 tcccctaagt gagccaatag atttcatgac gtcgaaaaac aagtcacctg aagttggcga 3241 gaaaaacgat attcttttac caggaatgaa gcttctactc aatcaaactc ttttggaagt 3301 tatggcggga aaattacaaa tcgttccctt tcccgcccct caggaacaag cctcggatat 3361 gacgaggcta caagtctcta tccccctagt gattcctgaa gttgaacttc tacaggaaga 3421 gaattaaatt taggttggtt atatagcacc attgtggtgg tagaattctg ttggaaacca 3481 atcctttcgt agaatcgctg cttgtgagtt gtcatcaaat agacacgctc aacacgattc 3541 atgcgcgggt ggctcaaaac agtttccacc aacttacttc ctagtccaac accacggtac 3601 tctggatgaa tggcaacatc cgaaatggta gcgcgataaa cacaatcaga agttgctctg 3661 gcaaagccaa ttaatcgttc tccatcccaa acggtaatca cagggtcgct gtttgcaata 3721 gctatgccta aatcttcaat gctgcgattt tttgcccaaa aagcagaaat atttagcagc 3781 tgttggagct gataaaggtc gattttagac ttgcgatcgc taaatcgaat ctgattgtga 3841 ctagagttca tggacgttac cgaattgttg gagtatattc gctacctttt tatccgattt 3901 tgtaagatag tgctagacct gtatcaattg taagcttaaa aaaagttaat attccaattc 3961 aactacaaac ccatttccag aaaggatgac tttgaccttg ctgacgtatc cgaaatacct 4021 gttgttctag gcgttgcttt tcctgctgtg gtagtccgaa tagaatattc cggctctgcc 4081 aaatcagcac agcaacggtc ataccgagac aaaaaaataa accaagacta gccaagggat 4141 gatagtatag cccgtatcgc acagctaccc aggtaaaata ttgtcgccac agtgcaattt 4201 cgcctcgtaa attccataca caaagaggcg taatagtcaa ccataaaaag ccaacaaata 4261 accacctgct atacacctct agctggtgta gcttttgtac ttgttttgca aaagacacat 4321 cactttcgtc tgtgtaccag ctagtttttg gtggttcttc cggttgatcc atagattaag 4381 gagtgactaa taactaataa tgatgaatca tctgctgtgt cgattgattc acgttggtga 4441 tctccctcgc cacaaagcga gcaagggacc agcgacaaaa atactagaat aagcgcccac 4501 agcaaaacca ataatcaaag cgagggcaaa gtttctcaga gtttcgccac caaatatgaa 4561 taaggcaaac aaagtcagca gcacggtgaa agtggtgtta agcgatcgcg tcagtgtctg 4621 attgatggca tcatctacaa cttggttaat agggtcattg ggatgttggt tgagaatttc 4681 ccgaatccgg tcataaatga ctaccgtatc tgtcaccgaa aaaccagtaa tcgtcaatat 4741 cgcaacgaca aacaggctgt caacctcaac accttgtact aatcccaaaa tcgagaaaat 4801 ccccgtggta attaagacat catgcaacaa ggcaataatc gcaatcacgg cataatcggt 4861 tttgaagcgc acgctcaagt agataacaat gccgataaac gagacaatta aagcttttaa 4921 accactggta aacaattcct gacccagagt aggaccaaca gtctcaatct gagttgcttt 4981 ggggtcaaat gctcccaatt tttcgctcaa agctgtttgt aactttgtgc gtgcctcaac 5041 atttaacgtc tcagtgcgga ctgatactgc ctgttggttt tgaccgacaa cttggatgct 5101 gctgttgcct aatccttgag catccatgac ttcccgcaca gacgaaagtt caattggctt 5161 agcacagttt tgcgcaattg tgcaatcccg ttcaaacact aagcgcgtac caccgacaaa 5221 gtccagacta ggacgaagag gtgcgcctaa cgtctgccaa gaaatcacca tcgcaatgat 5281 accgctgaga ataatagcac cggaaagtgc ccaccataga aaacgctgtt tgatgacact 5341 aattttcatt gtaccgcctc cggctcagac ttagtataca ttgccagatt aggacaaaac 5401 agttcaggct tccgcaagtt aggaaatccg attgccacca acagcaacgt acgactacaa 5461 atgatcgatg taaacaagtt cacgagcaat cctaatgcca atgtcagagc aaagcctttg 5521 actaaacctg ccccaagcca aaaaagcgct gcacaggaaa ttaaagttgt cacgtgagag 5581 tctaaaatac tagaccaggc tctgtaaaat cctgcctcta ccgaacgata caggcttttg 5641 ccagctcgta gttcttcacg agtacgctca aaaatcagca cgttcgcatc aactgccatc 5701 ccgatactaa ggatgaatcc agcaatccca ggcagagtta aagtcacagc cagtaaaacg 5761 aaggtagcat atgtcagcaa tgcgtagatg actaaagcaa tgtcagcaat tataccgggt 5821 agtcgataat acgctaccat aaagattaat actaacacca gaccgcctat gccagcatag 5881 atgctacgtt gaatactgtc tttacctaag gtagctccta ctgtacgatt ctctacgatt 5941 tccacgggta caggtagtgc accaccgcgt agctgcacgc ctaagtcatt ggcttcttgt 6001 gaagtaaatc gacctgtaat cacagcagct cctccactga taccagatgt ggcaaattct 6061 ggaccgacgg taggaaagct gataagttcg ttatctagaa aaatacctat actgcgtccg 6121 gtaccagcaa gttgtttcgt cagatcggca aatagttgac caccctgatc atcgaagcga 6181 atggctacat tccagttgtt gctttgggtt ggttcaccat aagcgtcctt gaggtactta 6241 ccaatcagtg gtgggtttgt gctttcaaat aattcagcga tcgcctcgtt actcttttta 6301 agatcttgct gatttttctc aattgcagct ttgtcgttag tctttcgtag ttctgcttgc 6361 tttgctttca attgtgtttt tgatgcgaaa aatgtattca gttgaatttc tgtatttggc 6421 ttttgctttt taaactctaa ctgcgctgta cctcctagta ctcgttctgc ttgctgtggg 6481 tcgttgaccc ccggcaattg tactaatatt ttgtcttgac ctgcagtttg gataactggt 6541 tccgaaacac ctaaaccgtt gatgcgacct tcgacgactc ttttaacagc ctctaattct 6601 cggtcggtga tttgtttaat ttctggtgtc gtttttactt gaattgttag ctgtgagcct 6661 ccctgcaagt ctaatcccag agatgcagga attgtagcaa tcaccgcaat tgcggcgatt 6721 actaaaacta aaatcaaaac taatagcgat cgctgtcttt gcataccctt acttgctgcg 6781 acaaaggcta tgatagcgct ctttggaaga gttgtgacct ctcagttcag agttttgagt 6841 cacaactttt acaaagaatt ggggattgtt gattgttagt ctttcgctgc tgatatttta 6901 catcatcaag cactcacttc taaccagtcc ccaattctac ctcctaaacg cgcaaagcca 6961 ccatcttttc cacagcttca acgatttgtt ctggttgaac gatagttaat ctttctaaag 7021 ttccattgta aggtgtggga atatcttggg aagaaagtcg caatacaggt gcatccaatt 7081 catcaaataa gcgttcatta atagaggcag ttaattccgc tccaatgcct cctgttctca 7141 tgcactcttc cacaataatc actctatgtg tttttcttac agatgcacca atagtatcaa 7201 aatctagcgg tttgagtgat atcaggtcaa tgacttctgg atcataacct tctttttcca 7261 aagttttgac agcttgcgtc acatgatgtc gcatgcgtga gtaggtcagg attgtgacat 7321 cttttccaga acgtacaact tctgctttat ccaaagggag gagatattct ttttctggta 7381 aatcttcttt caagttgtaa agcagaacgt gttcaaagaa caacactgga ttatcatcac 7441 gaatagctga tttcagcaat cctttagcat tgtaagctgt agagcaggca acaatcttca 7501 accctggtac agcttggaag tatgcttcta gacgctggga atgttctgca cctagctgct 7561 tgcctacgcc tccaggacca cgaatcacca ttggaatttt aaagttacca ccggaggtat 7621 agcgcagcat tccagcattg ttagatattt ggttgaaggc aagaagcaag aaacccatgt 7681 tcataccttc aacaattggt cgcaacccag tcattgctgc cccaacagct aaacccataa 7741 agctgttttc ggcgattggg gtgtctagaa gccggagttc gccatatttt ttgtataagt 7801 ctttagtcac tttgtaggaa ccgccataat gtcctacgtc ttcaccaaga acaaatacag 7861 ttgcgtcatg cgccatttct tcatcaattg cttcccgcaa agcattgaag aacagtgttt 7921 ctgccattag acttttatct tgcgactatt gttctagaat cttaacgcgc atctgctccc 7981 taagatggtg agcgatgcag caaaccagct gcactctgcc acgagttgct tgtattgcgt 8041 ttcaaattat aagctatgtg gccgcaagtc cttgaacagt tagcagttac cagttaccag 8101 ttaccagtgt ttgttcccca taaccttgag cactgttaag agttccctga ttaatatcca 8161 acctctttta attggaaact cggtgaatca ttgacttgat ttacctggct cacctgattc 8221 accggaggag acggattcac cgaactttcg ggaaatcttc tgggtcttcc tggtttacgc 8281 ttatgtcgtt gattgataga tttttcgctg gctgcagcca gttcttctgg tgtgcattta 8341 aacagccgca atgcatctaa atattttttg ggtgtcatgg ttggttctgt acgccctgct 8401 tcccagttac gaacactagt ttcacttatt gcaagcctga aggcaacctc tgcgcggctg 8461 agtcccgcac gctctctcag gacttgcata tccatactaa atgctccctt tgcaaaacta 8521 tagtacttat ttattaaatc aattgacgta aaaaagccaa ctgagcatca tttgttaact 8581 atattatttt ataatctatt agttaggtat ctggcaaacg ggcgatcgcc ctcacccccc 8641 ctaaacttcc cgtataacaa cgtttgctct actaacactg taagtctatc tgtcatcaaa 8701 aaaatcttgg cattcaaatt tatcttccaa ttaaaggata atctcacacc cagccacaaa 8761 gacacctaga ttggctctac gcccttgcag tttggtgtga gattatcaag caaagttatc 8821 aaaacttcac gttctagcac ttgaggcaaa agtcactacc tatcgcaacc accgcgctgc 8881 atctttggca tggtaggtta aaatcaagtc tgctccagcg cgcttaaagc caatgagagt 8941 ttccatcacc acacgctgtt catcaatcca gccgtttaaa gctgcagctt tcaccataga 9001 atactcacca gagacgttat aagcagcaac aggcaggttg ctcgcttcct taacgcgcca 9061 gataacatcc atgtatgcca aagctggctt caccatgagc atatccgcac cttcggcaat 9121 atccagagca atttctttta aagcttcgcg agagttaccg ggatccattt ggtatgtgcg 9181 gcgatcgcca aactgtgggg cagaatctgc tgcatctcga aatgggccat agtaagccga 9241 agcatattta gcagcatagg ataaaatcgg ggtatcatga aatcccgctt catccaaagc 9301 gctacgaatt gcctgtacaa atccatccat catcccggaa ggagcaatga tatctgcccc 9361 agccttcgct tgagaaaccg ctgttttccc aagtaaatcc agtgttggat cattcaaaac 9421 tcgtccggtt aaatcaccca cttgtaaata accgcagtgt ccgtgacttg tgtactcaca 9481 caaacaagta tcagcaatca caatcaaatc tggtactgct tctttaacag cagtagctgc 9541 tttttgaaca ataccgcaat catgccatgc gccagtggca tcaacatctt tatctgtagg 9601 aataccaaac aaaataatcg caggaattcc taagtcataa acttcctttg cctcttccac 9661 tattttgtct atcgaaagtt gatagactcc aggcattgat ctgacctcat tcgctattcc 9721 ctcacctggt acagcaaaca gagggtaaat taaatcattg gtggttaaaa cagtttcgcg 9781 taccatgcgg cgtagttgtg gatgggtacg cagacgacga gggcgatggg taggaaacat 9841 aaatttttgt gaacttacta atacaaaatg gacacaaaac aaagcgttac aaataatcta 9901 cacaaacgct ctgtagtcag atctacaaaa agcgcctaaa tgtcacgaca tttgcaccaa 9961 gtcggggcaa acctccggtg gctcctcttc cctaaagctt gtaacgctgt gggtaatttt 10021 gtattttaca gctatatcgt caaacaaaaa tcttaaaaat tccctgtatt tctccctctg 10081 gacgtaatta aaaattaatg gtgatgtgcg cgtaattagt ccaaaatgaa agactcactc 10141 gtctacagtt cttttgatca atgattaacg gctaatgggt gatgacaaat aactaaaaaa 10201 cccggaaaat gaagggatag cggaagggtt tatctggatc ggtaagaaca tgaaaaatcg 10261 cccaaattgt cagtccccaa tgcgctataa accataaacc tgctagtggt aacagtacac 10321 caaaggtgat ccaggtcaat cctgtcacaa ttgctccata aacccataca ttaaaatgga 10381 agttgatgga ttctttcgca ttttctctca ccacaggatc ttcagacaca aagagtatgg 10441 cgatgggtag acctacagac acaaaagtag cgctcaagaa aattgcacca tgactcactg 10501 ctgataaaat ctttcgctta tctggatcgt acactgtttt ttctccttca gtttgacttg 10561 gttattatat aacgaccact acataggtgt tgttgcctta agattaaaag acttactaaa 10621 gttggaaaaa attaagttaa gttaagttac aagtaatcaa ataacattat gtctactgaa 10681 cttgagtcag aactacacaa agcagttgat agccgtcgca actttgcaat tatttctcac 10741 ccagacgccg gaaaaacaac actgacagaa aagctactac tgtacggagg agccattcat 10801 gaagctggcg cagtcaaggc aagacgggca cagcgtaaag caacctctga ctggatggca 10861 atggaacagc aaagaggtat ttcgattacc tctacagtat tgcagtttga gtatcaagac 10921 tgtcagataa atttgttgga tacgccggga caccaagatt ttagtgaaga tacttatcgt 10981 accttggcag cggcagataa tgctgtgatg ctgattgacg tggcgaaagg tttagaaccc 11041 cagacacgga agttgtttga agtgtgtaag ctgcgcggta tacccatctt cacgtttgtc 11101 aataagctgg atcgtccagg aagggaacct ttggaattgt tggacgaaat tgagcaggaa 11161 ttgggcctgg taacctatgc tgtgaattgg ccgattggca tgggcgatcg cttcaaaggt 11221 gtctatgatc gcaaagaaca acaaatacac ctgtttgaaa gaagcgccca cggaagtcgc 11281 gaagctgtgg atacagtcgt tgacttaggc gatgccagaa tagaagaact tttagaacaa 11341 gatctctact accaactgaa aaacgatata gaacttttag aaggagttgg accagaatta 11401 gatttggaat tggttcatca aggcaaaatg acacccgttt tctttggtag tgccatgacc 11461 aacttcgggg ttgagttatt tttgaaatat tttcttgatt atgccctcaa acctggtcct 11521 catagtacca cagttggtga ggtgactcct atatacccag aattttctgg ctttgtcttc 11581 aaacttcagg caaacatgga cccgaaacat cgcgatcgcg tcgcttttat ccgtgtttgt 11641 acaggcaagt ttgaaaaaga tatgacagtc agtcacgccc gtactggcaa aacgattcgc 11701 ttgtcccgtc cacagaaatt atttgctcaa gaacgggaat ccatagatgt tgcttacgct 11761 ggtgatgtga ttggtttgaa caatcctggt gtgtttgcga taggtgatac catttatacc 11821 ggacaaaaac tggaatatga gggaattcct tatttctcgc cagagttatt tgcagttctg 11881 cgtaacccca acccttctaa attcaagcag tttcaaaaag gtgtttccga attgcgggaa 11941 gaaggggctg tgcaaattat gtactccgtg gatgaagcga aacgcgaccc aattttggct 12001 gcagttggtc agttgcaatt tgaagtcgta cagttccggt tgcaaaatga gtatggagtg 12061 gaaactctgc tagaattgtt gccttacagt gtcgcccgtt gggttgaggg cggctgggaa 12121 gccttgaata aaattggacg tgtctttaat acaacaacag taaaagacag catggggcgt 12181 ccagtgttat tattccgcaa tgaatggaat tgtaaccaac tgcaggaaga ccatccagaa 12241 ttaaaattga gtagtattgc tcctgtggtt tctggacaag cgacaaattc aaatagttag 12301 tggttagtgg gtagtggtta gtggttaaca aacgactaac ccgactcccg tgtgaaccgt 12361 tgctcaacac gggagtcggg ttaatacggt tcggttaaga ctgaagttta gaccctacat 12421 ttcgcactta agcgaacaaa ttgatggtat tttttattga tgcattacgc aacaccattt 12481 ttttatgagg ctctaaagta gttatatttt aaatgatctg ctaaacgcag tgctaacgca 12541 acaatcatta acgtagaatt agcagtacct gaagtaggaa atgtagagct tcccgcaaca 12601 aacagaccat ttacttcatg aacctgagaa tttacattca caacaccttt cttcgggtta 12661 gctgacatcc gcgttgtccc cgtgtgatga gctttatcct caaagttttg cgtccaattt 12721 tcttccttgt ttaaccaact agaaagatgc ggtgcaggaa agccaattct ttgaaattct 12781 tgcgtaatca attgactcat tctttgagca gtttgtcgtt ctaacgaatt tactttccaa 12841 ttgatcttgg ctagtggcat attcaatcta tctttttggt tggtagaaag agtaatacga 12901 ctttctggat caggtaactg ctctagcata cagtgaagct caagacgttc aactttgagg 12961 agtggaggac ggcgcttgac ataacggcgg tgaaagcctg agcttatctc cctaaaatta 13021 gacaatatgg tgaatatatc ttgagttgat tgcttaatac actcccgttt tcgtaaagta 13081 gttttaagtc gcctcattgc tgaccaagaa tcgtttggta gtggattgta ttcttctaga 13141 taagcatcac atctcaatag gttttcttgc tcctgaattt ttggacttag tgataatcca 13201 tgtagataaa catgagtccg ttgtttattc tttatcaaat agtgtccaaa tctatttcgt 13261 acactgtagg cggcgtgatg atcaaaagaa ccaatgacac acaaactatg atccattaaa 13321 aatctaccaa ccacatcatt ttgattccct attccattag gaatggttcg gttggaggct 13381 aaaagtagtc gtgcattttc aataccgcca caggatagca ctaaagcttt tgctttaaca 13441 aacgcacgct taccctctaa ggttctcact tcaactgatt caaagcaagt ccccaattca 13501 tttgtattaa tatgggttac attggcatgc aaaagaactt caatatttgg actattatca 13561 atgaagaaat cgcgtccaaa ccttattggt tctcctcgtt ccttaggact cttactgaat 13621 tgccagaatg tcgtgtctaa aaattttgag tccagattcg gtgtagcagg caaaacttta 13681 aacaacttcc aaagttgatt atcataacaa tttggtccaa gtcccaaatt ttttcctgct 13741 cgttctaata atggttctaa ttcctcgtat ttaagatccc atcccgagta ggggatccaa 13801 gatcgttttt gaaaatcagt gggattaaaa ggggcgcatc tacctgtcca tgtatgtgaa 13861 gttccaccat aaaggcgatt acgagtattg tcttttcgtg aaactcctac attttcgatt 13921 ttacataaag cttccgtatc actctctttc tcaaagccac cgctttctag aatccacact 13981 ttgatcttcg tcccagcaaa ctcttttgca atagatagac cagccggtcc gctacctaca 14041 atacataaat ctgtttcaat taacagatta tcctcacact cattaagatc agaaaatctc 14101 atattaattt cttgcttatc cctgatttac gaaattgaaa aagtcaatga tttgcaatgt 14161 ctatcagcca agttctaaga gaatgtttga aaagtcatta gggaaaaaag tcatgccaat 14221 cacctcacca caatacgcat catggcaatg tagacaaata tttctactgt ttggaggtaa 14281 tagcttgtag tcttcgttca aaccccgata ctggttgagc aagcttttgg taccctccac 14341 tacccactgt ttaggtaata agacaaaacg cttgaaccct tgcagtcgca gcagttccta 14401 tacaatccag tgacaagatt catcacccat tgcataaacg gattgccatc ataaaccgcc 14461 atctacccag atggttacaa ccgagacaca ggttttgcca tttgtttgac cttctgaagt 14521 gcctgtttag ctccctctag cttcgttcaa actgctcttg ggtaagattg atcgagtatg 14581 attgagttat ggctctctgg gtgctgtact acgtatttgt agctgacacc tcaagagctt 14641 tttttactat gctgccaact ttgcaaacat cctctaagag cgcgaaatat tatcgaacgc 14701 actccattaa gaagtttaat tgaaattcat gatcaatggg agcgttgata cagaaaagat 14761 atcagccgct gggatcacaa aatatttatt caactttatt gtcaactcct taattaaata 14821 tttttagcac aaaatcccaa aaataataca ctcaaaagac tttctttatg attgctttat 14881 cctgtgacaa ttaaattgaa aggttgtgct aagaagttta atttaaattc atgatcaacg 14941 agagtgttga tacagaaaag ataccaacag ctagcatcac aaaatattta ctcagctttg 15001 ttatcactcc ttaactaaat attgttagga gaaaatacca aaaagaatac agtcaaaaga 15061 cttcttttgt gactatttta ttctgtgaga attaaattga aagcttgtgc aaccactgtt 15121 aatttttaga attatcatga attcttggac aaatcaatac tttgtataag atatctactt 15181 atcctactta tcctacttat cctacttatc ctacttattt tatttatccc acatatctta 15241 cttatcccct tttctcagtc agataaaagc ttaataaaat ataatattca ttgatttcag 15301 tttgtgctaa ttagatgagg cgtcctgata tcttggggaa actttttcac agattttata 15361 ttagtgtttg ataataccca taacacaagt cactataggg gtattgccat agttgccctt 15421 ttaagttagc tccactcagt tcaaacacat tgcaaataat tgttgcagaa tttggttttt 15481 ttgtttatta acttgccaat aaatgctata gcagacatag gtaaattccg catcagaaac 15541 aagtcattaa tttgaggact gagccaatga aaacgctgtt gctctaccct tgttttcccc 15601 agtctttttg gtcttatgac cgatttttaa aaatagctgg acttaaagct ttcattcctc 15661 cattgggaat tattacagtt gccgcattgc taccaaaaga ttgggaaatt agattttttg 15721 accgtaatgt caacccagaa acagatgatg attggcagtg gtgtgacatg gttattctgt 15781 ctgcaatgtt ggttcaaaaa cctgacttcc acgccctgat tcaaaaagct gtgcgattgg 15841 gcaagaaggt ggtgtgtggt ggtccttatc caacgtcgat tcctcaagat gctcttgctt 15901 ctggagcgca ttacctagta ttggatgagg gagagctaac tatcccgcaa tttttggaag 15961 cacttgccca aggtaaagac caaggaatct ttcgcgctag tgataaacct gacgttacaa 16021 aaagccccat tcctcgcttc gacctgctga agctagatgc ctacttcatg atggcgattc 16081 agttttctcg gggctgtccc ttcaactgcg aattctgtga cattatcact ctttatggtc 16141 gcaaaccgcg tactaaggaa ccgagtcaaa ccttagcaga gttgcaaacc ctctatgact 16201 taggctggag aggatcactt ttcattgtcg atgacaactt tattggaaat cagcgcaatg 16261 tcaaacggtt tctgctttca ttgattcctt ggatgaaaga acgcaattat cccttcactt 16321 tcatcactga agcttctgtc aatctggcag aggatccaga actgctgcat ctaatggtgg 16381 aagctggctt caacgctgtc tttctcggca ttgaaacccc agaccaagat agcctcaaag 16441 tcactcacaa gtttcaaaat acccgcaatc cactcttaga agcctgtcgc cagattaact 16501 cagcaggatt actgatttat gctggattta ttctcggctt tgatggagaa cgttctggcg 16561 ctggcgaccg gattcaagct tttgtcgaac aaagcagcat tcctcaacct atgcttggga 16621 tactgcaagc gttgcccaat actgctcttt ggaatcgcct caaagcagaa caacgtttat 16681 tagaaggcat cggagttgta gaggtgggag accaaaatac tttgatgaac tttaaaccta 16741 cccgaccact caccgagatt gctagagaat atgtagaagg tttctggacg ttgtacgaac 16801 cagctaacta tctcagacgc tgctttcaac aatgcctcaa cattgggctg cctccagaaa 16861 aaaggcagac catgcgcttc cctgcgggca agggactgcg actagtagca caattaattt 16921 ggcatcaagg ttggcaacgc tctgaaattc gcctgcaatt ttggcaacaa ctgtggacaa 16981 tcctacgtac taaacctcaa gttctcaata tgtatttggg cttgtgtgct gctggggaac 17041 atttctggga gtaccgtgtt ttagcgagag aacgcattgc ccagcagcta ggctacgatc 17101 cactaacagt gcctggatcc ttgacactgg aatctataaa aagctatgat tgtactaaaa 17161 caaaaaataa cgctgtgtca taggtaaaga ggtagcaggt tcacaagtta acaacttccc 17221 atctaccctg acatggaaag tttctggatc gacctcaatt tgtggggtag cgtcgttgag 17281 tttcaaatca gcttttctca actcacgtgt tttggaaact gctaccgtcc gagtttttaa 17341 accaagtttt tcgggtacac cctgtttcaa agcttcattg gaaacgaaag ttatagaagt 17401 ttctgccatt gcaccgccaa agctaccaaa catcggacgc atatgaactg gttcgggagt 17461 ggagatacta gcgttaggat cgcccatttg tgaccaagcg atcattcctc ctttgatcac 17521 agtctctggt ttgacgccaa aaaatgctgg tttccacaaa cacaaatctg ctagcttccc 17581 ttcttcaacc gaacctatat gatctgcaac cccatgagca atagcagggt taatcgtgta 17641 tttagcaatg tagcgttttg ccctaaaatt atcatgggtt tcatgaggat gaccaggatt 17701 cggtaataat cctttctgta ctttcatctt atgggcagtt tgccaagtgc gagtaatcac 17761 ttcccctacc cttcccattg cttgagaatc ggaagacata atactgataa tccccatatc 17821 atgcaaaatg tcctcagccg caatcgtttc cttgcgaatt ctcgattcag caaatttgac 17881 atcttcacga atgtttttat caaggtgatg acacaccatc aacatatcga gatgttcgtc 17941 aaaagtattg atggtgtagg gacgagtggg attagtagaa gcgggtaaaa catttgctag 18001 actacagacc ttgagaatat caggcgcgtg accaccacct gcaccttctg tatggaaggt 18061 gtgaataaca cggtttttaa aagcagcaat ggtattttcc acaaatcctg cttcattcag 18121 ggagtcggtg tggattgcga cttgtacatc gtactcatct gcaaccgaaa gacaagtatc 18181 aattgcattc gcggtggttc cccagtcttc gtgcagcttt aaacccataa ctcccgcagc 18241 aacttgctct atcaatcctt caggtttgct agtattacct ttgcccaaga aaccaacatt 18301 gacagggaag ccatcagccg cttgtagcat gcggtaaata ttccaaggac caggggtaca 18361 agtcgtcgcg agagtccccg tagcagaacc agtaccaccg ccaattaaag ttgtcgtacc 18421 agaggcgatc gcagtttcga tttgctgagg acagataaaa tgaatgtgag tatcaatcgc 18481 cccagcagtc agaattttgc cttctcctgc gatcgcttcc gtcccaggac caataataat 18541 atctatattg tcttgtgtgt aaggattacc tgctttacca attttgaaaa tcttcccgtc 18601 tttaatacca atatctgctt taatgatgcc tgaccagtcg agaattaagg cgttggtgat 18661 caccacatct accgcaccat cggcatttga aatcggggat tgtcccattc catctcgaat 18721 caccttacca ccgccaaatt taacctcttc cccgtaaacc gtcccaccat aggtagtata 18781 gtcttgttcc acttcaatga acaaatccgt atcagcaaga cggatgcgat cgccaatcgt 18841 aggaccaaaa gtattcgtat attcccgtcg gttcataaaa taactcatgt ttcttttgct 18901 aagtagatca acatcgaaaa acataaaata gagtttttgc gttcgcgaac gctcatgctg 18961 agcgctttgc gcttacgcaa tgacatttta cgtttaatta ggttgaacta cttacatcaa 19021 gttaaaaatc gtgagttgaa aatttgaaac gttcaggact taggcactag gttttatcga 19081 cttttatcgt catttttcgt gaaacttgac atcagttata ctggttctag gatatatcag 19141 ccattcgtca attgtccgac acacccgatg caaatcaccc tcaacctcga cgaatccctt 19201 ctcaacgaag cctttcaact tactaacctt actagccaag aagaattggt gaacctggct 19261 ttgcaagaac ttgtgcggtt gcgccgcaag aaaaacctcc tcgaccttgc ggggcagatt 19321 caatttactg aggacttcaa tcacaaagcc ctgcgtgaag ctcgtcatgc tgctgattga 19381 tacgtctgtt tggatcagcg tgttccgaga tagcagtggt cagataggta agcaacttga 19441 aaccttgatc gctgagcgtg aggttttact cacgcgcttt actcagcttg aactgttgca 19501 aggtagcttg aatgaaaaag agtggattct cctgtccatc tacctcgaaa cacaagatta 19561 tgttgaactt acgagtcatt cctggcaagc ggctgctcgt atctactatg atttacggcg 19621 tcaaggactc acagttcgca gtccaattga ttgctgtata gcgcaatctg ctttggagaa 19681 tgacttactg ttgattcaca atgatcgcga ctttgaaacc attgctcaag tgcgacgatc 19741 cctccaacac ttccgctttc aaccttgagt ctcacgaaaa atgacaagaa agcagaggaa 19801 atagggggaa caaagagctt cgcgtccccc aacacaccaa acctgcaacc gctttcatta 19861 aaaacaacta actagcaaat aatctctgag ccagtttttg atgtcgcatt tgagcaatat 19921 ctatagccgg agtacaagac cacatttcct ccagactcat tgatgcagca gtttcgtagg 19981 ctatctcaat ctctggtgct agctgaagta aaacccgttg tgcttgcaga tgtcccacaa 20041 cacttaaacg gatagcagcc cccaacaacc cagtcacgaa gctatgcaaa aaacccagca 20101 ctgtatcctg ttcactcaaa agagctgccc taccaactac agcaaagata actggatgca 20161 aacaatgaat tttaccaata gccgcatctt gatttaatat ctctaattga gagtcttgcc 20221 aagtcgaact agcaaccatg agtaaggcgc gtccgctttt gcgctgggtt tcgcgtgttt 20281 tttctacgag ggtttgagca aatagtcgag catctgcggc tcgtactgct tttaaatcag 20341 cgatcgcact tccccggtaa gcgtggataa gtgctactgt atctgtaact cctaccttgt 20401 tacgcaatag tagccgcaga aaagtttgca aatcttctga cccacgaagt tgattttctt 20461 ggactaagga ttctagcccg tgagataaag tgaaagagcc agagggaaag aaagaatcag 20521 acaactgcat taaagcgagt tgttgggcga tcgcagatgc aggtaaactc atgagccgtg 20581 atcgtgatga gtgtgtggag aaaagtcgag atgttgttgt ggcgaacgca gttcaaagtc 20641 aatctgcaag ccaggaatct gaaaatcccg gatggttgat tccatgactt ctatatctgc 20701 tgctaattga atgtagagct taccatcttg cacgataata ggccaatggt gattacccaa 20761 gacatgacct agatgaacca attcaatagt atgctcagtc actggctcac taaaactaag 20821 taccatcacc ttttgctctt gcagatgaat cagtaatagc ttaccatctt ccgtttctaa 20881 aacatctcct tcccgcagtg cccaatctcg acttttcaca attcctacag ccgcaccagt 20941 ggtagtgtga gcgtgaattc gacctttgcg gctatcagtt tgactgatat gaacttccaa 21001 gcaaagtgct gaggaacgag ctttttctat ccgttcagat aagctgatat tttcttttga 21061 attgcccagg taaatttctg ccagttcagt cataaatcat tgggtaaaag aggttcatca 21121 atacagcgac gcacgcagtt taaagcgtat tttaaataat ttttgagttc atgagttcca 21181 ctagccatag ctctaatcag cagacccttg tttcggggta agattgagct tgctacagtc 21241 agaccggagc agttagctgc ttctaagtct tctacacttt tactcaataa ttcgagatta 21301 gtttcaggta aaactataat taaattacca ataacaggtg aatcagcaaa taggttactg 21361 ttagtaaaga tatttccttt tccttccaag cgcatagcat cagtaaacca tagctctcca 21421 caagtagaac tcacttgcaa gcgactgaaa tagtggttaa actgataaaa ttctccccta 21481 gccagccttc caggtagaat catttcactc aaaaacaatc ttccagtcgg atgaatcttg 21541 atacttatag tttgctccaa agtcgcatca gcaaacaaaa ttaatggttc tggtacaaac 21601 tctaacgttg ctccttcccc aatttcaatt tcataattgg ttgttgcttt tgagcctcct 21661 ataggcatcg aatgaacctt agtagcagcc tgttccgtaa gataaacgtt agtatgaggt 21721 gctagctgca aagatacatt caactcatct tgtgcgagta aacctggaga tgtgctcata 21781 acatagagat aagcacgctg aaaatcagca tcatcgagat aaaatacccg agataccctc 21841 aaaggataag ctgtgtactg atggctgaga atagtttgac caaagcgatt gcccgaaggg 21901 cagacgccta aaggcgtaac gcacttcagt cttaactcca agttatagcg cttctcatcc 21961 cgatgaaatc gttcatctgc attcatctgc gtccatctac ttcggcgtcc atctgcggtt 22021 cctcaactct ttgatgtgtt gcggaagcac tacgaaacag aactgtctcc agaataaaat 22081 cgaccacctc gtccaatcct tctcccgtct tgcaattagt atatgcaata ggtttacctc 22141 gacggtatat cggcgcttct ttacggatta actccaaatc agcaccaaca taaggagcaa 22201 tatcaatttt attcaccact gctaaatcag cttgtacaaa gccaggacca tttttgcgag 22261 ggatatcatc tcccgcaccc acatcaatca caaaaatata cgaatccacc aaatcatagc 22321 taaaagtgga agcaagatta tcaccaccac tttcaatcaa gattaaatct aattcaggaa 22381 tcagtaactc aaaatccttc actgctaaaa gattcatagt tggatcttcc cgaattgctg 22441 tgtgcggaca gctacctgtt tctactccta taatccgttc cgcaggcaaa aacccccgac 22501 ttttgagtcg gtctgcatct tccgtagtta gtaagtcatt ggtgacaact gctacttcaa 22561 tgccctgatt catcaatatt gggactatag actctattag agctgttttt ccgctaccta 22621 ctggtccccc aatccccagt cgagctgctg actttagcat cgctaatcta tcctgcttga 22681 actcttggag actgctagta taattagact ttagctatct tcattattct acagtagtcg 22741 ttacatacaa acgcaggtag atcgcgctaa atccttaact cacatcttgc acctggaaat 22801 atggatgtca aaaccaaaac tacgtgcaa // LOCUS NODE_1384_length_22779_cov_4.91669622779 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 22779) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 22779) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..22779 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..314 /locus_tag="DP116_12490" CDS <1..314 /locus_tag="DP116_12490" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12490" /translation="AAFAGQNQSVDQYTEQNGAAVNGSSVTQTSITNSRQGQNARTTR PGYGYGYYGGSRRTPQGQGARQTTIQNGSAVDGSTTDQYSETNNDQRQNATNRRSLYH R" gene 427..762 /locus_tag="DP116_12495" CDS 427..762 /locus_tag="DP116_12495" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12495" /translation="MVKKTLTLGLLVAAVMVIPTVAFAGEAVTKQEINQNSTVGGENS RVNQAADQYSIQQKTGYPRSGRQNAKQRIDQNGAAYDNSSVDQYASQKSEQLQRNRQN RGSYIPYRR" gene 1067..1441 /locus_tag="DP116_12500" CDS 1067..1441 /locus_tag="DP116_12500" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12500" /translation="MPVRQTISLLVVLSALIALPAKIAIADDIDVEGSTSRVTIDEDG DITIRDTRPTYRTTYPSRRIPNSRRNIRQYSLDQRLRSLRYPRTTQSYCNGRRVVRTN TVRSGSNSANSTYSSTTTTTCQ" gene 1458..1838 /locus_tag="DP116_12505" CDS 1458..1838 /locus_tag="DP116_12505" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12505" /translation="MKSKLLSLGFFSLTAVSPFFIPSLTPSASAGCVGVTAGTQIAIS REASQSTRVNRQSSPGCSGSTSVSTGKQVCINNDGACEQRRTVNQNFQGGNRRGGYDP VKDINVDVNVPVHVKSPRAPFSNR" gene 2219..2755 /locus_tag="DP116_12510" /pseudo CDS 2219..2755 /locus_tag="DP116_12510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317405.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(2906..3172) /locus_tag="DP116_12515" CDS complement(2906..3172) /locus_tag="DP116_12515" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12515" /translation="MGDFSGGWQIFHMPTARLTAIVVTQPYGKPSTGFRASPIPALHS LDAFVGLASREEIEVCLWYSTQAADPEIRKNSLLVRKPKSDSWK" gene complement(3362..3814) /locus_tag="DP116_12520" CDS complement(3362..3814) /locus_tag="DP116_12520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015122696.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12520" /translation="MFRILVDADLILEALMNRNGSVGEVSELLDKVHPWIQMYITDIG WQKIYTYASRLRNTNIAELVVNWLQDKIQVCTIDQTILQQARFSPLQDFESAVELVCA SYEKLDAIVTHNYDNFAQVRNKFWIWSVAELWVRANLESQLGTTRSIS" gene complement(4767..5996) /locus_tag="DP116_12525" CDS complement(4767..5996) /locus_tag="DP116_12525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873680.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="murein transglycosylase" /protein_id="PRJNA477356:DP116_12525" /translation="MQPLGHLALSPPECRVTKWQLPVSSQTQLQDTIVQQQTAPPPLV PIQKSPFPCCEGDSSCLDNQLWGENGQSPDKKALLTSVDRSLQYLFSSDAAAAYQQYQ VVGMTRDHVIKSLQRFRELVLKSKSAVELNAAIAKEFVFYQSVGGDKKGSVLFTAYYE PVYTASRVPTLEYRYPVYRIPPDISSWQKPHPTRLELEGADGLQASIGRLRGLELFWF RDRLEPYLAQIQGSAKLQLPDGTQTTIGYAGNTAYNYKSIGRELANDGKLPLEGLTMP KILNYFQKYPQQLNVYIPRDPSFVFFQENHGAPAQGSIKVPLTAERSIATDKSLMPPG ALALIRAPFPFVNSVTKQMEHRIVSRYVLDQDTGGAIKGAGRVDYFLGTGKIAGDRAG VTVSYGQLYYLLLKSHQ" gene complement(6366..7691) /locus_tag="DP116_12530" CDS complement(6366..7691) /locus_tag="DP116_12530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195964.1" /note="involved in light-induced Na+-dependent proton extrusion; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="proton extrusion protein PcxA" /protein_id="PRJNA477356:DP116_12530" /translation="MISSVFRQKIYPFLVASYRWYLRTSERSLDEAYKAALQIKAIED EYFNGNQIDFGSNKYSGSVMDYFESELKKQLKIVRMRLTEFKASRFFQSESNQKVAKE FAIEYPQPALILEKLKFIDEVTAKYTRLDELHSNHPINREKSLTRVDPSTNQLKQNNL SIQRLDNKTPQSKTDTIGVLPRSIFSTFSRLQVELDPNAEQDVVKNFRNTQRRTIISV RFLLLLIIVPILTHQLSKALVVGPIVDRYLEKNSAEETAIFLNYEMEEEALEELERFE ERIKFDNLLSNTEPLSLEQIENQIKTKAHEIAEGFRGSSSNAIKNVFADILSVITFTW LLLISKKEIAVLKDFFDHIVYGLSDSAKAFIIILFTDIFVGFHSPHGWEVILEGISRH WGLPANHDFIFLFIATFPVILDTIFKYWIFRYLNRISPSAVATYHNMNE" gene 7779..10025 /locus_tag="DP116_12535" CDS 7779..10025 /locus_tag="DP116_12535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317400.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase family protein" /protein_id="PRJNA477356:DP116_12535" /translation="MIWDSLWVTGSSRSGKTARLVKEFCNWVLSETSSKESYYTKNKG EKNSENIPRRLYLHQTEPGVLILAANDENRRDLADRIVRATAGKYPVRCKTPLGFIQD EVTLFWPLLIQLLKLKAQFPVRLRPETEQELATKLWRPQLDEETLRRAGDPEYRLVRR ILDLLQLGAYSGIACEKIDDVLLSAIAQQENSTNLEPELIISLLLNWRNWCLDRGFLT YGIMTEVYGKDLLPNPIYQQQLTKRYQAVLADDIQDYPALSRHLFELLLNSGAVGAFS YNPDSVVRLGLGADPNYLEGLSAHCRQEILTQPPFLSIGEASTIPMLELITEQIIPLS LPAAVQSIQTISRAQLLRQTAEVIVKAVKDRHIQPNEVAIIAPGLDAIARYTLTEILT KQNIQVQPLNDQRPLISSPIIRGLLTVLALVYPGLGRLVDRDAVAEMLVVLSRGVSVI RGEKEKEQNPHSLVSSSPSFPHIDPVRAGLIADYCFVPHPEEPKLLPVTAFDRWDRLG YAATTAYGNILQWLENQKTQQELRLIPTPISLLDRAIQRFLWNGNNLPYDQLAALREL LETAQHYWEIDTRLQRTPFVSPPLKEETQGSGTSPHSTIGEFIQLLRRGTITANPYPV HPIGPKSNAVALATIFQYRSSRRSHRWQFWLDTGSPLWAKGGAATLFGAPFFLQERLG EPWTVEDEKSAEEQRLRTILADLLARVSQRVYLCHSDLAVNGQEQLGPLLPLVHACVS VVSDPAVV" gene 10212..10874 /locus_tag="DP116_12540" CDS 10212..10874 /locus_tag="DP116_12540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318025.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide-methionine (S)-S-oxide reductase" /protein_id="PRJNA477356:DP116_12540" /translation="MVLFGFGKKLSLPTPEEALPGRAEKMPVPDRHYVNGHPLKPPFP EGMEIATFGMGCFWGAERKFWQLEGVYSTAVGYAAGVTPNPTYKEVCSGRTGHNEVVQ VVFDPKVISYSELLKVFWENHNPTQGMRQGNDVGTQYRSGIYVYSENQRKLAEASQQA YQKALSASDYGKITTEILDAPEFYYAEEYHQQYLAKNPNGYCGLGGTNVACPVGIAEL NA" gene 11210..12115 /locus_tag="DP116_12545" CDS 11210..12115 /locus_tag="DP116_12545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743179.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_12545" /translation="MLTTKSVTGNLPKSSELNQRLNNKTLMLSSRQMGWNGILVEQYQ NLPAPAEKELSALSTHWLILPGHPGHLHWKFDDRLRESIFQKGDSLLVPAGQSGYWRC QNSKSSQTELHIHLQPELVEQVTQASEMDTERLDLVNHFCKQDLHLQHIAMLLLAELC SDGMMGQLYVESLTQVLVIHLLRHYSKSAQIITSENRSLTHAQLQQAIDYIHTHLNRD LSLAEIAEVINISPTYFASLFKRATGISPHQYVIQQRVERAKSLLSKTDLAIADIALQ VGFSSQSHLTQQFKRLTGMTPKQVR" gene 12303..12701 /locus_tag="DP116_12550" CDS 12303..12701 /locus_tag="DP116_12550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006514327.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RidA family protein" /protein_id="PRJNA477356:DP116_12550" /translation="MSKKLINPAALYDGAPSGMSQATVDTKSGLVFVSGQVDWNHQYS TTEQTVEGQFKKSLDNLKIALEEAGSSIEQLLQVRIYIRGELGDYMEVLAPIFSNFLG ESRPAVTGIGVASLASPETLVEVEAIASIR" gene 12922..13131 /locus_tag="DP116_12555" CDS 12922..13131 /locus_tag="DP116_12555" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12555" /translation="MALVTLASVANPHPFFIVGDKTSLIFTRGMHKPSTPIPIKMQKR KNESKTPNKLTADKDTNQNQNYFMR" gene 13137..14231 /gene="aroF" /locus_tag="DP116_12560" CDS 13137..14231 /gene="aroF" /locus_tag="DP116_12560" /EC_number="2.5.1.54" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120492.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-deoxy-7-phosphoheptulonate synthase" /protein_id="PRJNA477356:DP116_12560" /translation="MIVVIKSQTPASEIERIIQELHQYSVTTEKVIGKDKVVIGLVGD TSVISPEHVQQISPFIEQVIRVKQPFKRVSLEYRYGEPSEVVVPTPNGPVTFGQNHPL IVVAGPCSVENEEMIVETAQLVKATGAQFLRGGAYKPRSSPYAFQGHGESALSLLATA REVTGLGIITEVMDTADLDKVAEVADVVQIGARNMQNFSLLKKAGATGKPILLKRGLS ATIEEWLMAAEYILAAGNPNVILCERGIRSFDKQYTRNILDLSVVPVLRGLTHLPIMI DPSHATGYSRFVPTMAKAAIAAGTDSLMIEVHPNPSKALSDGPQSLTPPDFVELMQEL APLAQFFGRGTKSAFTTTPFATKTHSTAGV" gene 14782..16977 /locus_tag="DP116_12565" CDS 14782..16977 /locus_tag="DP116_12565" /EC_number="4.1.3.27" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120490.1" /note="trpE(G); catalyzes the formation of anthranilate from chorismate and glutamine; contains both component I and II; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anthranilate synthase component I" /protein_id="PRJNA477356:DP116_12565" /translation="MSFDTQSYTTKGGIFVSRSVTQTSIDSAIDEVLMRLDSQRGGLL ASSYEYPGRYKRWAIGFVNPPLELVTCDNTFTITAHNERGKILLDYVADCLLDQPYLQ AVSKENKRIIGSVKPTERLFSEEERSRQPSVFSVVREILSVFHSPVDENLGLYGAFGY DLVFQFEQMPKHLERSADQRDLVLYLPDELLVVDYHLQQAFRVQYDFVTKNGSTRDLP RTGEVIDYRGQRLSVTQGSDHAPGEYEQQVIKALDYFRRGDLFEVVPSQSFYQTCEKS PTELFRTLKEINPSPYGFIFNLGGEYLIGASPEMFVRVDGRRVETCPISGTIRRGETA IDDAAQIQKLLNSRKDESELTMCTDVDRNDKSRICEPGSVRVIGRRQIELYSHLIHTV DHVEGVLQPEADALDAFLTHLWAVTVTGAPKRSAMQFLEQNERSPRRWYGGAVGGLSF KGNLNTGLILRTIRLKDSVAQVRVGATVLYDSEPEAEAQETITKASALFQTLRVVEHK SGDLSSLSGLDIQEANPGVGKRVLLVDYEDSFLHTLANYIRQTGATVTTLRHGFAESV FDTEKPDLVVLSPGPGKPSDFRVAETIAACKKRQIPIFGVCLGLQGIVEAHGGTLGVL DYPQHGKVSMVSVVDPDSVTFKGLPQSFEVGRYHSLYALPESLPAELKVTAMSADGVI MGIEHRTLPIAAVQFHPESIMTLAGGIGLTIVKNVVREFMATIPSSAIA" gene 17218..18066 /locus_tag="DP116_12570" CDS 17218..18066 /locus_tag="DP116_12570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317605.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="indole-3-glycerol phosphate synthase TrpC" /protein_id="PRJNA477356:DP116_12570" /translation="MLNYHHQPTQATASTKPRHILEEIVEHKQQEVALMKEQLPITEL KHQISLAPPVRNFLNAIRQNSTPPSIIAEVKKASPSKGVIRADFEPVKIAQAYERGGA TCISVLTDEKFFQGGFKNLQLIRNNVGLPLLCKEFVIDPYQIYLARVNGADAVLLIAA ILSDETLQDFLHIIQDLGMTPLVEVHTLTELDRVLALSNVQLVGINNRNLEDFTLDIE TTKRLLVERQQKLSDLDITVVSESGLYTSSDLAFVAQAGAKAVLIGESLVKQADLEQA VKELRM" gene 18136..18951 /locus_tag="DP116_12575" CDS 18136..18951 /locus_tag="DP116_12575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408006.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tryptophan synthase subunit alpha" /protein_id="PRJNA477356:DP116_12575" /translation="MTSISEHFESLRTKNQCALIPFITAGDPDLETTAEALGILDANG ADFIELGVPYSDPLADGPIIQAAATRALKGGTRLAHVLEMAQSATRSLRSPIILFTYY NPILSLGIQQFLKQVADSGIKGLVVPDLPLEEADDVLKSADSVGVEVTLLVTPNSSKE RIEAIARQSRGFIYLVSVTGVTGVRSQVQTRVKDVLHEIRSVTNKPISVGFGISTPEQ AHQIKDWGADGVIVGSAFVQRLAETTPDKGLRAIKDFCGDLKAAITPTAGIKN" gene 19077..20303 /gene="trpB" /locus_tag="DP116_12580" CDS 19077..20303 /gene="trpB" /locus_tag="DP116_12580" /EC_number="4.2.1.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120488.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tryptophan synthase subunit beta" /protein_id="PRJNA477356:DP116_12580" /translation="MLADTPRTTQQPDALGRFGQFGGKYVPETLMPALSELEAAYQKY RNDPDFQNELQGLLRDYVGRATPLYLAERLTAHYTKPDGTGPQIYLKREDLNHTGAHK INNALAQVLLAKRMGKQRIIAETGAGQHGVATATVCARFGLKCMIYMGVHDMERQALN VFRMRLMGAEVRPVEAGTGTLKDATSEAIRDWVTNVETTHYILGSVAGPHPYPMIVRD FQAIIGKETRSQSQEKWGGLPDILLACVGGGSNAIGLFHEFVNEPSVRLIGVEAAGEG INTEKHAATLTLGRVGVLHGAMSYVLQDEDGQITEAHSISAGLDYPGVGPEHSYLKDV QRAEYYSVSDAQALEAFKRVSLLEGIIPALETSHAIALLETLCPQLTGSPKIVINFSG RGDKDVHTVAKFLDTF" gene complement(20426..21316) /locus_tag="DP116_12585" CDS complement(20426..21316) /locus_tag="DP116_12585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012789645.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NmrA family protein" /protein_id="PRJNA477356:DP116_12585" /translation="MKQTILVAGGTGNLGGRIVKALIKRGAEVRVLSRNEIDPVKIKK LTELGAEVITVDMSDVEELKNACQGVSCVVSALAGLHDVIVDSQTLLLDAAIAAGVPR FIPSDFSSDFTKMPEGENRNFDLRKQFHKYLDKSPIAATSILNGAFSDILSYNTPLYN PNDHSVAYWGDNPDWKVDFSTMDNTADYTSAAALDSATPKILRIASFQISPNELIAIG QEVKKTNFKLVPMGSLEDFSVSNKRERAAHSEGEKQIYPHWQASQYLYSMFSVQNKPL DNERYPDFQWTSAIDVISKI" gene complement(21729..22613) /locus_tag="DP116_12590" CDS complement(21729..22613) /locus_tag="DP116_12590" /inference="COORDINATES: protein motif:HMM:PF12833.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_12590" /translation="MPENQNSPLIVNNVHTLLSANVSLPLKWKGLNVTYLYTAFSRYP VHEHRAIQITIPLLTSGFNAVTLSATDSRQNSQKLTLENVFLAPAHQPHTLLWDQDTE LVMFDLEPEFIERAAGESFRGSQVEITAIGGIRDSLITQLGGALYAEFKRASAIGRIY VESLAVTLAVHLIRNYSASEQNVRELSGGLTGAKLRRAVEYINSNLDQNLSLQQIAET VGMSPYYLSRALKKSTGFAPHAYLTHQRMERAKQLLTQTRLPIIDIALAVGFVNHSHF STQFRKLVGISPKAYREG" BASE COUNT 6546 a 4971 c 4878 g 6384 t ORIGIN 1 ccgctgcatt cgcaggtcaa aaccaaagcg ttgaccaata taccgagcaa aacggtgctg 61 ctgtcaacgg tagtagcgtg actcaaacct ccatcaccaa cagccgtcaa ggacaaaatg 121 caagaaccac tcgccctggc tatggctatg gctactacgg tggttcccgc agaactcctc 181 aaggtcaagg tgctcgtcaa accaccattc aaaatggttc ggctgtcgat ggttctacca 241 ctgatcaata cagcgagacc aacaacgacc agcgtcaaaa tgctaccaat cgtcgtagct 301 tgtaccatcg ctaatcattg atgtcatcaa aaggcgttgc ccaatctttg gtatcagtta 361 attcttttac tattcaagca acgccactgt ttcacaatca attcataatt taggacgtag 421 ttaaaaatgg tgaaaaaaac cttgactctg ggtcttttgg tcgctgctgt gatggtaata 481 ccaactgtag ctttcgctgg cgaagccgtg accaagcagg agattaatca aaattctaca 541 gttggtggtg aaaatagccg agtgaatcag gcagctgacc aatacagcat tcagcaaaaa 601 actggttacc cccgttctgg aagacaaaat gctaagcagc gcatcgacca aaatggtgcc 661 gcttatgata acagtagtgt tgaccagtat gcttctcaaa aaagcgagca actgcaacga 721 aatagacaaa acagaggatc ttatattcct taccgccgtt aagcctcaag tactagtctg 781 tatttagttt gctgtcccaa gctgctgttg gtgttgggtt acgcatctgc caaacttctg 841 atgagttcaa ctcccgagat atagcggttc cggctaataa gccattgggt gttgctggat 901 ctacgtgaca aagcaaaccc aaataactca ccagcttaca gaaaattgca agtgtctgtt 961 cacagtgcgg tggtgtgtta taggtgagga gttgatatag atgtgaccga tttcctaatt 1021 tgttatatga aatccggttg ttttttacga acagaggaat taagccatgc ctgtcagaca 1081 aactatttca ttactagtgg tactctcagc gctgattgca ctacctgcga aaattgcgat 1141 cgcagatgac attgatgtcg aaggtagtac tagcagggta acaatcgatg aagacggtga 1201 tatcacaatt agagacacaa gacctactta cagaaccacc tatccttctc gtcggatacc 1261 caattctaga agaaatattc gacaatacag tcttgatcag cgtctgcgga gtttgagata 1321 tcctcgcaca acacaaagct actgtaacgg cagacgagta gtccgaacaa acactgtacg 1381 cagtggctct aacagcgcaa atagtactta cagttcaaca accacaacaa cttgtcaata 1441 ataccaaggg tacatacatg aaatccaaac tactttctct aggctttttc tctctaacag 1501 ccgtatcccc cttctttatt ccttccttaa ccccctctgc ttcagcagga tgtgttggtg 1561 tcacagctgg gacacaaata gcgatttcca gagaagcaag tcaatctaca agagttaatc 1621 gtcaaagttc tccaggttgt agcggtagca cttctgtcag cacaggaaag caagtttgta 1681 tcaacaatga tggtgcttgc gagcaacgtc gcactgtgaa tcaaaatttt caaggtggta 1741 accgcagggg gggttatgac cctgtaaaag acattaatgt tgatgtcaac gttcctgtgc 1801 acgtaaaatc accaagagca cctttttcca acaggtaaga tagtcctaaa cgactcctat 1861 caaaactctg atctttccga agtccctacc aggcgagaac ttcggtttca ttgctagcta 1921 gggatcactt ttgtgaacgc agaaacttct gctgagttta caatagcaag tgaaaatttt 1981 cttcccgcct aacatcttct ccagctatca gcaggtgacg caatttttaa aataattgaa 2041 ttggctaatt gtatacctgc tgttacttca atctggttat gtcatcatga gtaaattagt 2101 gattcttgca agctgtctag tacttctaag tgttagtaca gcttttgcac aaaatttcgt 2161 ccgtagtagc cagaaaactt cctctgttat tattcaaaca gacagcaata gcggacatag 2221 cagccagagg agttcctctg ttattgttca gacagacaac gatgacgaaa atgttcacga 2281 ttcttccgtt gttgttccaa cagacagtta tgacgatgac gacgaggaaa gtagcgaaga 2341 tttttccgtt gttgttccaa cacacaggag tagcgatatg aacagatcca caacattcac 2401 tagccaacaa tctagctttg gtgattcctt tggatatcaa aaaagctcac ttcagctaag 2461 tgcggcgaat ttaagtcaac cgcatatctt gagcatcaat acatcaaatg ctcaaatgac 2521 gggtgaagtt actgtaaatg gtaaagttgt caagcaaatt aacaacgata aaaacgttaa 2581 ggttaatttg tcaccttatt tatcagttgg tgaacacaag gtggaagtat cagcacgata 2641 caatcctcca tcatcttctg tgagtgtaga actgagcggt cctggtacta acgtagccca 2701 acaaaatagt ggtaatggtg ccttaaacta cactatggat gttaacgtac gctaagatca 2761 tgcgatacgc gcagcgtcaa gcctccggct tatcgccaat tgtgtagcta gccttgaatt 2821 cattcactga atctataaac aagcacacga actcgtagtc acttattcgc cccaatagct 2881 taatggctgc cttgtaaagc ctctgctatt tccagctatc gctctttggc ttgcggacta 2941 gtagcgagtt ttttcgtatt tctggatctg ctgcttgtgt actgtaccaa agacatacct 3001 ctatctcctc ccttgacgcg agtccgacga acgcgtcaag ggagtgtagc gcaggtatag 3061 gacttgctct aaaccccgta gagggtttcc catagggttg ggtcacaacg atagccgtca 3121 ggcgtgccgt aggcatatga aaaatctgcc agcctccgga aaaatcgccc atgtataaat 3181 actcattttt tccgaaaaat ctacgtttgg ggcatcttag cgcccagagt gtcctatgcc 3241 tgcggcacgc cttacggcga acggacacgc actcgcgttc gttggcgcta aaccaacgcg 3301 cttagtaatg aggctggcag atttttccgg ttttaatgaa gcggtatggt attttgaatt 3361 gctatgaaat agatctagtt gtacctagtt ggctttccag gtttgcacgc acccacaatt 3421 ctgcaactga ccaaatccaa aacttatttc gtacttgagc aaaattgtca tagttatggg 3481 tgacaattgc atccaatttt tcataactgg cgcaaacgag ttctacagca gattcaaaat 3541 cttgaagagg tgaaaaacgt gcttgttgaa gaatagtttg atcgatagta cagacttgga 3601 ttttgtcttg cagccaatta actactaact cggcaatatt tgtatttcgc aaacggcttg 3661 catatgtgta aattttttgc caaccaatat ctgttatgta catttgaatc cacggatgta 3721 ctttatccaa taattcactg acttctccta ccgagccatt acgattcatc aaagcttcta 3781 atattaaatc agcatcaact aatatccgga acactttcaa accccattat taacgtattc 3841 atatattgcg gtgaaataag atattagact cactgagtca ggaatatact tatatacaaa 3901 ctgaaaagat agtatttccg atttacgcac aattcatgag aaacataaac tagctttcgc 3961 accactttac tgtctggcta ggacgatcag cttttggttg agtggctaac attctacgta 4021 cgtaagacgt cagataaatt tttgtagttc caaacaagag taaaagttat ttctacccaa 4081 aatttcagtc aatataacat catatcattt taattttctt gaattttcac aaagttaata 4141 agtgcttatt gccaacaagt tataaaatta acaaagcaat aagcgtgata ttactaatat 4201 ctcataaaac attcaatcaa ctaataaaaa tttatgaagt gttttggttt ttagaaataa 4261 aatgtataaa ctcttcttta aaaagaaaat tccaaatttc tatctcaaga aggattcata 4321 ttaacataac acttcatgag attccaaaac ccatgactaa agtcacggct aaacaaataa 4381 agtctgcgtg atatctcaaa cttttgctaa acaaggaaaa tacagcagtt tgaaatcaga 4441 tcatggtaca gaacctaggt cccaaagcca gataagaagc ttgttttagt tgtgacttct 4501 gacgattgac acctctattt tgctgtatat aaaatcgagt taagttaagg atagacacca 4561 ttacgatcta aaaactttac tatatctcaa tgccgttcac ttaaggattg tttttaaaga 4621 tttaacacgt gtagagacct tgcatccaac gtttctacaa aagttactgt tcttacaaat 4681 tattttatct taactgtatt gtactatatc taagtttgag gttattgtga acttagtttg 4741 ttacaaaata tgactaataa tgactattat tgatgagatt tcagcaacaa ataatataac 4801 tgcccataac tcacagtcac gccagcgcga tcgcctgcta tcttaccagt acctaaaaaa 4861 taatctactc gccccgcacc tttaattgca cctcccgtat cttgatcaag aacataccga 4921 ctgacaattc gatgctccat ttgtttagta acagaattaa caaaagggaa gggggcacga 4981 atcaacgcca aagcaccagg aggcataagc gatttatccg ttgcaattga gcgttctgct 5041 gttagtggta ctttgataga accttgggca ggtgcaccgt ggttttcttg aaaaaacaca 5101 aagctgggat cgcgagggat ataaacattt aactgttgtg gatacttctg aaaatagttg 5161 aggatcttcg gcatcgtcaa accctcaagc ggtaacttac catcgttcgc cagttctcgc 5221 ccgatactct tgtagttgta agcggtgtta ccagcataac ctattgttgt ttgagtgcca 5281 tcaggaagct gaagtttggc tgaaccttga atttgggcta agtagggttc taggcgatcg 5341 cgaaaccaaa acaactctaa cccccgcaat ctaccaattg aagcttgcaa cccatcggca 5401 ccttctagtt ccaagcgtgt gggatgaggt ttttgccatg agctaatatc aggtggtatg 5461 cgataaacag gatagcgata ctctagagtt ggaacacgac tagcagtata aactggttca 5521 tagtatgctg taaacagcac cgagcctttt ttgtctcccc ccactgactg gtaaaaaaca 5581 aattcctttg ctattgctgc attaagttct acagcagatt tagattttaa gaccaactcc 5641 cggaatcttt gcaaactttt gatcacgtgg tcgcgtgtca ttcccaccac ctgatactgt 5701 tgatacgccg ccgcagcatc gctagaaaaa aggtactgta aactccgatc tacagacgtc 5761 aacaaagctt ttttatctgg tgattgaccg ttttcgcccc ataattgatt atcgagacaa 5821 gaactatcgc cctcacaaca aggaaacggt gacttctgaa tgggtactaa tggaggtggc 5881 gctgtttgtt gttgaacaat tgtgtcctgt aattgagttt gagaggatac tggtagttgc 5941 cattttgtca cccgacattc cggcggactc agtgctaaat gccccaaagg ttgcataagt 6001 aataacagaa ttataggagc tatagggagg ctgatggcga tacgcgtagc gtcaagcctc 6061 cggcttatcg ctcgtaagat tagcccactt tgccccattg ggacaaatgt ctttttcata 6121 attttttcta aagtgcgtgc catagggcgt atctttaagc aacgataggc aagcgtgggg 6181 tgtgacgaaa atggcttcca tcttacccct aaccactgat gaccattcaa ttatcagtta 6241 tcaactgtca attgtcaggc atttaacccc acgaattcat ttatagggga attgagatca 6301 gatgtccgac aactcttacg aagtcggaca tcttgttttt tcacaaatcc tttaagactg 6361 ctatctcact cgttcatatt gtgatatgta gcaacagcag aaggtgatat acgattcaaa 6421 taacggaaga tccagtattt gaatatagta tctaaaatta ctggaaatgt agcaataaat 6481 aggaatatga aatcgtgatt tgctggtaat ccccaatggc gcgaaatacc ttctagaata 6541 acttcccatc catgaggaga gtggaaacca acaaagatat cggtaaacaa aatgataata 6601 aatgccttgg cactgtcact gagaccgtag acaatatggt cgaaaaagtc ttttagtacc 6661 gcaatctctt ttttgctaat aagcaacaac caggtaaacg taatgacaga caatatatcg 6721 gcaaaaacat ttttaatagc atttgaactg ctaccacgaa atccttcagc aatctcatga 6781 gcctttgttt ttatctgatt ttctatctgc tcaagggata gtggttcagt attgctcagc 6841 aagttatcaa atttgattct ctcttcaaat ctttctagct cttctagagc ttcttcttcc 6901 atttcataat tcaggaatat ggcagtttcc tcagcagaat ttttctccaa ataacggtca 6961 acaataggtc caacaaccag agcttttgac aattgatgag tcaaaatagg aactatgatg 7021 agcagaagga ggaatctgac agatattatt gttcttcttt gtgtattacg gaaattttta 7081 acaacatctt gttccgcgtt gggatctaat tcaacttgca ggcgagaaaa agtactaaaa 7141 atcgaacgcg gtaacacccc aatcgtatct gtcttacttt ggggtgtttt attatctaat 7201 ctctgaattg ataaattgtt ttgttttaat tgattagtcg aggggtctac tcttgttaaa 7261 ctcttttctc gattgattgg atgattggaa tgtaattcat ccaatctggt gtattttgct 7321 gtgacttcat caatgaattt gagtttttct aaaatcaaag caggctgtgg atactctatg 7381 gcgaattcct ttgccacttt ttgattagat tcactctgaa aaaaacggct tgccttaaac 7441 tctgttagcc gcattcggac aatttttaat tgttttttta gctctgactc aaaataatcc 7501 atgacactac cgctgtactt atttgagcca aagtctattt gattaccatt gaaatattca 7561 tcttctattg ctttaatctg taatgccgct ttgtaagcct catctagaga acgctctgat 7621 gtccgtaagt accatctgta agatgctaca agaaaagggt atattttttg gcgaaaaaca 7681 gaactaatca ttgttgggaa taatccagca ttgttttatg actcttaaac ttaagttaac 7741 gtacgaagtg tctgaaaaaa tctctaagga gatggtttat gatttgggat tccctttggg 7801 taacaggatc tagccggagt ggtaagactg ctcgcttggt aaaggagttt tgtaattggg 7861 tgttgagtga aacgagtagc aaagagtcat attatacgaa aaacaaaggt gaaaaaaata 7921 gcgagaatat accaagacgc ttgtatcttc atcaaacaga accaggagtc ttaattttgg 7981 ctgccaatga cgaaaatcgc cgggatttag cggatagaat tgttagagca actgcgggaa 8041 aatatccagt acgttgcaag acgcccttgg gttttattca ggatgaagtg actttgtttt 8101 ggcctttgct gattcaatta ttgaagttaa aggcgcaatt tccggtacgg ctgcgtccag 8161 aaactgaaca agaactagca accaagcttt ggcgtccgca attggacgag gaaactttgc 8221 gtcgcgcggg agatccggag taccgattag tacgtcgtat tttggattta ttgcaactag 8281 gggcttacag tggcatagcg tgtgagaaaa ttgatgatgt tttgctcagc gctatcgctc 8341 agcaagaaaa tagtacaaac ctagaacctg aactcattat atctttgcta ctcaattggc 8401 gtaactggtg tctagacaga ggattcctca cctatggaat catgaccgaa gtttacggca 8461 aagacctctt accaaaccca atttatcagc aacagcttac caaacggtat caagcagtcc 8521 tggctgatga catccaagat tatcccgccc tatcacgcca cctgtttgag ttgcttctaa 8581 atagcggtgc agtgggggca tttagctaca atcctgatag tgtagtgcgt ttgggactag 8641 gagctgaccc caactattta gaaggattat cagcacattg tcggcaagaa attttgactc 8701 agcctccctt tttatcaatt ggagaagcta gcacgatacc gatgctggag ctgataacag 8761 aacagatcat tcctttaagc ttaccagcag cagtgcaatc tattcaaacc atctcccgcg 8821 cccaattatt gcggcaaaca gcagaggtga ttgttaaagc tgttaaagac agacacatac 8881 agccaaacga agtggcgatc attgcgccgg gtttagatgc gatcgcccgt tacaccctca 8941 cagaaatcct caccaagcaa aacatccaag tccaacctct caacgaccaa cgccccttga 9001 taagttcgcc gatcatcaga ggattgctga ccgtccttgc cttagtttat cccggcttag 9061 gacgcttagt ggatcgggat gctgtggcag agatgttggt tgtcttaagc aggggagtct 9121 ctgtcattag aggagagaaa gaaaaagaac aaaaccctca ctctctggtt tcctcatccc 9181 cctcatttcc ccacattgat cccgtccggg ctggcttgat agcagactac tgctttgtac 9241 cccatccaga agaacccaaa ttgctgcccg tcacggcgtt tgaccgctgg gatcggctgg 9301 gctacgccgc caccacggct tatggtaata tattgcagtg gttagaaaac caaaaaacac 9361 aacaggaact gcggttgatt cctactccca tttctctttt agatagggca attcagcgtt 9421 tcttgtggaa tggcaacaat ctcccctacg accaattagc agcactgcgc gaactattag 9481 aaactgcaca gcactactgg gaaattgata cgagattgca acgaacccct tttgtttctc 9541 ccccattaaa agaggaaact caaggttcgg gcacttcacc tcactcaaca attggcgaat 9601 ttatccaact actgcggcgt ggtaccatta ccgctaatcc ctacccggtt catcccatcg 9661 ggccgaaatc aaacgctgtc gcattggcta ccatcttcca atatcgctct tctagacgat 9721 ctcaccgttg gcagttctgg ctggatacag gttcgccttt gtgggcaaaa ggcggtgcgg 9781 caactttatt cggtgcacca ttcttcctcc aagaaaggtt gggcgaacct tggacagtag 9841 aagatgaaaa atcggcagaa gaacaaagac tgcgaacaat tctggcagat ttacttgctc 9901 gcgtgtccca gagagtctat ttatgtcata gcgatttagc ggttaatggg caagaacaac 9961 ttggtccttt gctacctttg gtacatgctt gtgtttcagt tgtttctgat cctgccgttg 10021 tttaacgcat ttcggcggaa ttcaagagtc cgccatacga tgaccgagtc taacacgaaa 10081 aaatctagca gcacgccagg tgctacaagt cgggaaaccc gcccaccaca ctggctcctt 10141 actacaacga gtatctgtac aatattaagt aaggtaaaga atactctcat aaaaacccag 10201 agtaaaaatc aatggtgtta tttggatttg gtaaaaagtt aagcctgccc actcctgaag 10261 aagctttgcc aggacgagca gaaaaaatgc cagtaccgga tcgtcactat gttaacggtc 10321 atcccctcaa gccaccattt ccagaaggaa tggaaattgc aacgtttggc atggggtgct 10381 tttggggtgc agaacgcaaa ttctggcagc tagaaggagt ttacagcaca gcagttggtt 10441 atgctgctgg tgtcactcct aacccgactt ataaagaagt gtgttctgga cgcacaggtc 10501 acaatgaagt ggtacaagtc gtctttgatc caaaagtcat tagttattct gagttactaa 10561 aagtcttctg ggaaaaccac aaccccactc aaggaatgcg tcagggtaac gacgtgggta 10621 ctcagtaccg ttcaggaatt tacgtgtatt ctgaaaatca aagaaagcta gcagaagcat 10681 cacaacaagc ttatcagaaa gctctcagtg cttcagatta cgggaaaatt accactgaaa 10741 ttctggatgc tcctgaattt tactatgccg aggaatacca tcagcagtac ctggcgaaaa 10801 accccaatgg ctattgcgga ttaggcggga caaatgttgc ttgtcctgtg ggaattgccg 10861 aacttaatgc ttaagtctgt gacattgtgt cccgttatag cgcaagctta acgggacata 10921 ttcttttcta tagcaatcct aaatattcgt gataaacaag atccccgact tcgtagaagt 10981 tgtcggggat ctgtcgctta cccgagggta ggcgcagttg agcattgggg atatcagcga 11041 taaagctatt accactactg cgattcacca gttgcaacgc ttgcccgagg ctgtacatta 11101 ttttgacccc tccaaacctc cgcaaacctg cggggagggc agtaaacggc gatggggtta 11161 tttctatgca tcttcatatg aaaatggtat aagttaaatg ctgatataaa tgctgacaac 11221 aaaatctgtg acagggaacc ttcctaaatc gagcgaactc aaccaacggt tgaataataa 11281 aactctcatg ttgtccagtc ggcaaatggg ctggaatggc atcttggttg agcagtatca 11341 gaatcttcca gctccagccg aaaaagaact ttctgccctg tcaactcatt ggcttatctt 11401 acccggacat cccggccatt tgcattggaa gtttgacgat cgcctgcgtg aatcaatctt 11461 tcagaaaggt gacagcctct tggttcctgc gggacaatca ggctactggc gttgtcaaaa 11521 cagtaagtca tcccagacag aactacatat tcacttacag ccggagttgg ttgaacaagt 11581 tacccaagcg tctgaaatgg atacagagcg cctcgatctc gtgaaccatt tctgtaagca 11641 ggacttgcat cttcaacata tcgccatgct gctgttagct gagttgtgtt cagatggcat 11701 gatgggtcaa ttgtatgtcg aatcattgac ccaagtgtta gtcattcatc tgttacgaca 11761 ctattctaag tctgcacaaa tcatcacatc cgagaacaga agcttaactc acgctcaatt 11821 acagcaagca attgactata ttcacactca ccttaaccga gatttatcac tagctgaaat 11881 tgcagaagtc atcaatatca gtcccactta ttttgccagt ttgttcaaac gcgctacagg 11941 gatttctccg catcagtacg taattcaaca gcgagtggaa cgggcgaaat cgctgctgtc 12001 gaaaactgat ttggcgatcg cggacattgc attacaagtg ggtttctcca gtcaaagtca 12061 tttgacacaa cagtttaagc gcctcactgg aatgacaccg aaacaagttc gctaatacca 12121 taagaatctg acaaattatt gtaagaattt gaaagcagtc aagcattgga actcaataca 12181 ctttaaatag atagaaacag gagaatccaa tacagacagc accgtccttt gctgaagcag 12241 tgttccgaga ctttcttgcc tctatcgaat gtcaaaattc ctaatttctg tgggagaact 12301 gcatgtctaa aaaactaatt aacccggcag cgttatacga tggcgctcct tctgggatgt 12361 ctcaggcaac agtcgataca aaatctggtt tggtgtttgt atcgggtcag gttgattgga 12421 atcatcagta cagcaccacc gaacagactg tagaaggaca atttaaaaaa tcactagata 12481 accttaagat tgccctggag gaagccggct cttccattga gcagttgtta caagtccgaa 12541 tttacattcg tggcgaactt ggtgattata tggaggtgct tgcgccgatc ttttccaatt 12601 ttttaggaga atctcgaccc gcagtcactg gaattggtgt agcttccttg gcatccccag 12661 agacgcttgt agaggtagag gcgatcgcca gcatcagata acttagtctg caagtttttt 12721 tatgagaatc tggcttgtct aatttatgac tccgggttcg cccattttgg aatgagttag 12781 gaaaaatgag cagatacgct cgcaatgaca aaaatacgtt acctttgcgt aagtcctgaa 12841 caacttgcgc atttttgtgt caccatctgt ttgcactcgc gttcgctcgc gttcccggct 12901 acaagacgca gtcgagcgct tatggctctc gttacgctcg cgtctgtcgc gaatcctcac 12961 ccttttttca tcgttggcga taaaactagt ttaattttca ccagaggaat gcataaacct 13021 tcaaccccta tccctatcaa aatgcagaag cgaaaaaacg agtccaagac gccaaacaaa 13081 cttacggcgg acaaagacac aaaccaaaac caaaactatt tcatgaggta aagaaaatga 13141 ttgtcgtcat caaatcccaa actcccgcat ctgaaattga aagaatcatt caagaactgc 13201 atcagtattc cgtaacaacc gaaaaagtta ttggtaaaga caaagttgtc attggcttag 13261 tcggtgatac aagtgttatt tctcctgaac acgtccaaca aatcagccca tttatcgaac 13321 aggtgattcg cgtgaaacag cctttcaaac gggtttcgtt agaatatcgc tatggcgaac 13381 caagcgaagt cgttgtccca actcccaatg gtccagtcac ttttggtcaa aatcatcctt 13441 taattgtagt cgctggacct tgttcagtcg aaaacgaaga aatgattgtg gaaacagcac 13501 aactcgtaaa agctacaggt gcacagtttc tccggggagg agcgtacaaa cctcgtagtt 13561 caccttatgc tttccaagga catggcgaaa gcgctctatc cctgttagca acagcccgcg 13621 aagtcaccgg actggggatt atcacagaag tgatggatac agccgactta gacaaagttg 13681 cagaagttgc agatgtggtg caaattggtg cccgcaatat gcagaacttc tccttactga 13741 agaaagcagg cgctactggt aaaccaattc tgctcaagcg ggggctgtct gcgacaattg 13801 aagaatggtt gatggctgct gaatacattc tggctgctgg taatccaaat gtgattttgt 13861 gtgaacgcgg aattcgcagc tttgacaagc aatatacaag gaatattcta gacctatctg 13921 tggtacctgt attgcgtgga ctgacccatt tgccaattat gattgacccc agtcatgcca 13981 ccggctactc taggtttgta ccaacaatgg caaaggcagc tatagcagct ggtacagatt 14041 ccttgatgat tgaggttcac cccaacccct ccaaagcact atcagatggt cctcaatctc 14101 tgacaccacc ggattttgtg gaacttatgc aagaacttgc tccccttgcc caattcttcg 14161 gacgtggaac aaagtctgcc ttcacaacta caccatttgc taccaaaact cattcgactg 14221 cgggtgtcta agttgatcaa tcaagagtgg tttctcaagg gtaggtaata tcatgtttac 14281 accaatgact cagcgatttg cctacctcag attctttaat cattctgaaa ttatttggtt 14341 aaggcaatca tgtgtgcgta ttattctctc agtttgacta gaaatacagt acgaaaatac 14401 gcacactcta atttgctccg attttctaat tccaaaacta taattgacac tgctgttaaa 14461 ttttaagaaa tcttataccg acaacaactt gacaaaaccg ctgaatttct aaccaatctt 14521 ttgtagatac cagagatgac aataggagca gtacatattt tcatcctaaa cgctccaaaa 14581 tcatttcttt ctcgaaactg cataagtcaa attaaattac atctatctaa gatcaggccg 14641 ttgttgtcaa gcaacaatca ctcagagaaa catgatagtc aacgttagat tttaaccagt 14701 gtcctgcact caactttgtc accaagtcgc caaactatcc tggtgaggat ttttttagtc 14761 agctttaaga gggtcatatc tatgtctttt gacacacagt cttacacaac taaaggtgga 14821 atcttcgtat cgcgctccgt gactcaaact tcgatagatt cagcaattga tgaggttttg 14881 atgcgtctcg attcccaacg cggaggcttg ctggcaagta gctacgaata cccagggcgc 14941 tacaaacgat gggcgatcgg ttttgttaat ccacctttag aacttgttac ctgtgataac 15001 acttttacca ttacggctca taatgagcga ggcaagatac tgctagatta tgtggcagac 15061 tgcctgttgg atcaaccata tcttcaagct gttagtaaag agaacaagcg tattatcggc 15121 tctgtcaaac caactgaacg tctcttctca gaagaagaac gcagcagaca accatctgtt 15181 ttcagtgttg ttcgcgagat attatctgtc ttccacagtc cggtagatga aaacttggga 15241 ctttatggtg catttggcta tgacttagtt ttccagtttg agcaaatgcc aaagcacctc 15301 gaacgttcag cagaccaaag agatttggta ctatatctac ctgatgaatt gttagttgtt 15361 gactaccacc tccaacaggc attcagagtg cagtatgatt tcgtcactaa gaatggtagc 15421 actcgggatt tacctcgcac tggtgaagtt atagattacc gaggtcaacg tttaagtgtg 15481 actcaaggat ctgaccatgc accaggtgaa tatgaacaac aagtcatcaa agccctcgat 15541 tacttccgtc gcggcgattt atttgaagtt gttcccagtc aaagcttcta tcaaacctgc 15601 gaaaaatccc caaccgaact tttccgcact ctcaaggaga ttaatcccag tccctatgga 15661 ttcatcttta atttaggtgg agaatatctg attggtgcat ctccagaaat gtttgtgcgt 15721 gttgacggca gacgtgttga aacttgccca atcagtggta caatccgtcg cggtgaaact 15781 gcgattgatg atgcagcgca aatccagaag ctgctgaact cgcgtaagga tgaatccgaa 15841 ctcaccatgt gtaccgacgt agaccgcaac gacaaatcgc ggatttgtga gccaggttct 15901 gtgcgagtca ttggtcgtcg ccaaattgaa ttgtacagcc atcttatcca tacagtagac 15961 catgtggaag gcgtgctgca accagaagct gatgctttgg atgcattcct gacgcatctg 16021 tgggcagtca cagtcacagg agcaccaaaa cgctcagcca tgcagtttct cgaacaaaat 16081 gagcgcagtc ccagacgttg gtatggtgga gccgttggag gcttaagttt taaaggcaac 16141 ttgaatactg gtctgatttt aagaactatc cgactgaagg actcagttgc acaggtgcga 16201 gttggtgcta cagttcttta tgactcagag ccagaagcgg aggctcaaga aactattact 16261 aaagcgagtg cgttattcca aacacttcgc gtcgttgaac acaagagtgg agatttgtca 16321 agtttgtcag gtcttgacat tcaagaagca aatccagggg tgggcaagcg tgtcttgtta 16381 gttgactatg aagactcatt tcttcatact ctagcaaatt acattcgtca aactggtgcc 16441 acagtcacaa cactacggca tggttttgca gaatcagttt ttgatacaga aaaacctgac 16501 ttggttgtat tatctcctgg tcctggaaaa ccaagtgatt ttcgtgtagc agagacgatc 16561 gcagcctgca aaaaacggca aattcccata tttggtgtgt gcttgggctt acaaggaata 16621 gtcgaggcac acggtggaac actgggagtt ctggattatc cccagcatgg taaagtatct 16681 atggtatctg ttgttgatcc agattctgtt accttcaaag gtctaccaca atcgtttgaa 16741 gttggtagat atcactcgct gtatgcactg ccggaaagcc tacccgcaga actcaaagtg 16801 accgccatgt ctgcagatgg tgtgattatg ggcattgagc atcgcacatt accaattgcc 16861 gccgttcagt tccatccaga atccatcatg actttggcag gaggaatagg tttaacaatt 16921 gttaaaaatg tagtccgtga attcatggct acaattccct cctcagccat tgcataatca 16981 cctctgttgt aaccctttct ggttgtagcc taccgtctac acaccggatc gcttttcacc 17041 tcaccctcgc ctttagctac gctaaaatct ttccctctcc ttaataagga gagccagtgc 17101 gttgcggggg ttccccccgt tgtagcacct ggcgtgaggg atgcccgaca gggcagggtg 17161 aggttaaaaa ttttgggtta actctccctt tcttgaacta ataaccacta taaaaagatg 17221 ctaaattatc accatcaacc aactcaagca actgcgtcta ccaaaccacg ccatattctc 17281 gaagaaattg ttgagcataa acagcaagaa gttgccctca tgaaagagca actgccaata 17341 accgagttaa aacatcagat cagtcttgct ccccctgtac gaaatttctt gaatgccata 17401 cggcaaaatt ctactcctcc cagcattatt gccgaggtta agaaagcttc accaagtaaa 17461 ggagtgattc gtgctgattt tgaacctgta aaaattgccc aagcatatga gcgaggcggt 17521 gctacttgta tttctgtctt aactgacgag aaattttttc aaggcggttt taagaatttg 17581 caacttatta gaaataatgt aggattgccg ctactttgca aagagtttgt tatagatcct 17641 taccaaattt atctggcacg agtcaatggt gcagatgcag tgctcctcat tgctgcgatt 17701 ttaagcgacg aaacgctgca agattttctg cacattattc aggatttggg catgactcca 17761 ttggtggaag tgcacacctt gactgaactt gaccgagtac tggcattatc aaatgtgcag 17821 cttgtcggaa ttaacaaccg caatttggaa gactttactc ttgacatcga aacgacgaaa 17881 cgccttttgg tagaacgtca acaaaagtta agtgatttag atataacagt cgtgagtgag 17941 tctggtttgt acacatcttc tgatttagct tttgtcgccc aggcgggagc taaagctgta 18001 ttgatcggag aatctttagt caagcaagct gatttagagc aagctgtcaa ggaattaaga 18061 atgtaaatga ggtttgcatt tttcattctc cttttgccct attaatttat tgttcgcttt 18121 cggtaattca ttaagatgac ttctatctct gaacactttg aatctttgcg cacaaaaaat 18181 caatgtgctt taatcccttt tattacagct ggcgatcctg atttggaaac gacagccgaa 18241 gccctaggaa ttttggatgc taatggtgct gattttattg aattgggagt tccctattct 18301 gatccattgg cggatggacc tatcattcaa gcggctgcta ctcgcgccct caagggggga 18361 actcgactag ctcatgtttt agaaatggcg cagtcagcta ctcgtagttt gcgatcgccc 18421 atcatcctct ttacctacta caaccccatt ttaagtctgg gcattcagca gttcctcaag 18481 caagttgctg attctggaat caaggggtta gtcgtacctg atttaccctt ggaagaagca 18541 gatgatgtgt tgaagtctgc tgatagtgta ggagtagaag tcacattgtt agtgacacct 18601 aacagttcaa aagagcggat tgaggcgatc gcccgccaat ctcgtggatt tatttaccta 18661 gtgagtgtca caggtgtaac aggtgtacgc tcccaagtgc aaacgcgcgt gaaagatgta 18721 ctccatgaaa tccgcagtgt caccaacaag ccaatttctg ttggttttgg tatttctacc 18781 ccagaacaag ctcatcagat taaagactgg ggcgcagatg gagtgattgt gggaagcgcc 18841 tttgtgcaac gcttggcaga aacgactcca gataaagggc tgcgggcgat taaagatttc 18901 tgtggcgatt tgaaagcagc aattaccccg actgcgggaa ttaaaaatta acaatttcgg 18961 acttacgcat tttcttatct tctctgtacc tgctgtagtg aaattaattg aactttttga 19021 tagttttagc gtaagtccta aaaattcaat tccaaatttc aaaatcagga gattccatgt 19081 tagcagatac acctagaaca actcagcaac ccgatgcatt aggcagattt ggtcaatttg 19141 ggggcaagta tgtacccgaa acattaatgc ctgcactgag tgaattagaa gcagcgtatc 19201 aaaaatatcg caatgatcca gattttcaga atgaactaca gggactacta cgcgactacg 19261 taggacgtgc taccccccta taccttgccg aacgcctgac agcacactac accaaaccag 19321 atggaaccgg accgcaaatt tacttaaaaa gagaagactt aaaccataca ggcgcacata 19381 aaattaataa cgccttagct caggtattac tcgccaaacg catgggcaaa cagcggatca 19441 tagctgaaac tggagcaggt caacatggtg ttgcaaccgc tacagtctgt gcccgttttg 19501 gcttaaaatg catgatatac atgggcgtcc atgatatgga acgacaagcc ttgaatgtgt 19561 ttcggatgcg cttgatggga gcagaagttc gtccagtaga agcaggaaca ggaaccctga 19621 aggatgcaac atcagaagcg attcgggatt gggtgacaaa tgtggaaacc acccattaca 19681 ttttaggttc tgttgcaggc cctcatccct acccgatgat cgtgcgagac tttcaagcaa 19741 tcattggtaa agaaactcgc agccagtctc aagaaaaatg gggtggatta ccagatattc 19801 tgcttgcttg tgttggcgga ggttctaatg caattgggct tttccatgaa tttgtcaatg 19861 aaccatctgt acgcttaatt ggagttgagg cggcgggtga aggaattaat actgaaaaac 19921 acgcggcaac tttgacactt ggcagagtcg gtgttttgca cggcgcaatg agttatgttc 19981 tgcaagatga ggatggtcaa ataacagaag cacactctat cagtgctggg ttagattatc 20041 ctggggttgg tccggaacat agttatttga aagatgttca acgtgctgaa tattacagcg 20101 tgagtgacgc gcaggcgttg gaggcgttta aacgtgtatc actactggag gggattattc 20161 cagcactgga gacttctcat gcgatcgctt tgttggaaac tctttgtcca caactgactg 20221 gttctcctaa aattgtcatc aatttttctg gacgtggtga caaggatgtg catactgttg 20281 caaaattctt ggacactttt taaatgtctt gaagggtttg aaacaggttt gtagtaaaca 20341 ttgttattaa aaaacaactg acgggggcgt tctccgtcag tttcatagta accgtttaca 20401 ggagttgaaa cactatggtt gtacgttaga tttttgaaat tacatcaata gcgcttgtcc 20461 actggaaatc aggataccgc tcattatcca gcggtttatt ttggacgcta aacatactat 20521 acaaatactg gctagcttgc caatgcggat agatctgttt ttctccttcc gagtgtgctg 20581 cccgctccct tttgtttgag acagaaaaat cttccaagct tcccattggg acaagtttaa 20641 aatttgtctt tttaacttct tggcctatag caatcaattc gtttggacta atctggaaac 20701 tggcaatccg tagaattttt ggagttgcgg agtctaacgc tgcggctgaa gtataatctg 20761 ccgtattgtc catagtcgaa aagtccactt tccagtcagg attatctccc caataggcga 20821 cactatgatc atttggatta taaagcggcg tgttgtagct caatatatct gaaaaagcac 20881 catttagtat agaggttgca gcgattggcg atttgtctaa atacttatgg aactgcttcc 20941 ttaaatcaaa atttctgttc tcgccttcag gcatttttgt aaagtcagat gaaaaatcgg 21001 acggaatgaa gcggggtacg cccgcagcta tggctgcatc aagcaatagg gtttgggaat 21061 ctacgattac atcgtgtagc cctgccaacg cagagacaac acaggatacg ccttggcatg 21121 cattttttag ctcttcaaca tctgacatat ccactgtaat tacctcagca ccaagttcgg 21181 tcagtttctt tattttcact gggtcaattt cgttacggct aaggacacgt acttccgccc 21241 ctctttttat caaagcttta acgattcgtc caccgagatt acctgttccg cctgctacta 21301 aaattgtttg tttcatatta ttctccaaaa tttaactttt gtgctgtgta aatggttatg 21361 gttctatatg gctgcttttt ttaagcgctc tgttttctaa gtagtaacaa gcgtgcaatt 21421 aaatgaaact tctgtactat ctcagtattg agcaaatttg aaacgtcgaa gataaaaggt 21481 tgagtcgaga ggagctatta tgtaccgtga ttcagctaac aacgaatcgc acaacacatc 21541 gccgtcaaac tcagctgttt gcgtcgccat tggacgcgat gggaggtttc cctcttcatc 21601 aaaagattcg tcttttaaca agtattttag gtaaccttct cgctcaaagc tttcggtttt 21661 cgagataatt gttgtctttt tgcgctttta aaattttttc tttgagatct gctgataatt 21721 gaaagatatt atccttcacg ataggctttt ggagagatgc cgacgagttt gcgaaactgc 21781 gttgaaaaat gactgtgatt gacaaagccg acagcgagcg caatatcgat gattggcagc 21841 cttgtttggg tgagcagttg cttggcgcgc tccattcttt gatgagtaag ataagcgtgc 21901 ggggcgaaac cggttgactt ctttaaggca cgggataagt aatacggact cattccgact 21961 gtctcggcaa tttgttgcag cgagaggttt tgatcgaggt tagaattaat atattcaaca 22021 gctcggcgga gtttcgctcc tgtcaatccg cccgataatt cgcgcacatt ctgttctgat 22081 gcggaatagt tccggattag atgcacggcg agggtaacag ccagtgactc gacgtaaatt 22141 ctgccaatcg cgctcgccct cttaaactct gcgtaaagcg cgccaccaag ctgggtgatt 22201 aaagaatcgc gaatgccgcc gattgccgtg atctcaacct gactgccgcg aaatgattcg 22261 cccgccgcgc gctcgataaa ttcgggttct aaatcaaaca tgacaagttc agtgtcctga 22321 tcccataaaa gggtatgcgg ttgatgggcg ggagcaagaa agacattttc caaggttagt 22381 ttttgagaat tttgcctgct gtctgtcgcc gacaatgtga cggcattaaa tccggaggtc 22441 agcagcggaa ttgtaatctg gatggcacgg tgctcgtgta ccggataacg gctaaaagcg 22501 gtatagaggt aagttacatt cagtcccttc catttgagtg gcagtgagac gttcgccgat 22561 agaagagtat gaacattatt aacgatcagc ggcgaatttt ggttttctgg catggggtat 22621 aagtcgtaaa cgaacagaaa gttacgcgct tggctggaac gctttctaga taagcatcaa 22681 tacggttcac ataagaaatt tgctcttgaa ggcaagcagg ggtgcagggg ggctccaggg 22741 gcagaggaga attattagtt cacgtggatt ggcgtctcc // LOCUS NODE_1386_length_22754_cov_4.95347822754 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 22754) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 22754) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..22754 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..319 /locus_tag="DP116_12595" CDS <1..319 /locus_tag="DP116_12595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411915.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12595" /translation="LWTVNDVSTSFLMIKFYENLFKLDNLEAGDVAIALNQAQKWLRN LTIEGLDRFLEEHKPQIEKVLAQLRVGQRKNIEESLKLIKQRQPLPFANPYYWAGFTA SGR" gene complement(316..684) /locus_tag="DP116_12600" CDS complement(316..684) /locus_tag="DP116_12600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411916.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12600" /translation="MITVTPEEISQFRSQLADNPQALVALDTIEECEGYLDDAVPLLV MRETAQEADRGLNDWLEKCRQFICQEEVRQALESGLIAPIIEPLAASAGIPLGTATAL SICVFKLGAKKFCKVPGSGA" gene complement(1176..2204) /gene="rfaE1" /locus_tag="DP116_12605" CDS complement(1176..2204) /gene="rfaE1" /locus_tag="DP116_12605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-glycero-beta-D-manno-heptose-7-phosphate kinase" /protein_id="PRJNA477356:DP116_12605" /translation="MSLDPDFAILLRASADRLFFLLEKFASARVLVIGDLTLDEFLTG QVERVSREAPVLILRHETTKQVPGGGANAIYNFAQLGAKVLAVGLLGKDEQGKALRSL LEAAGINTDGVFVDSSRPTVTKTRISGHSRQSVTQQIVRLDRKSDDLPDLDLQVQISQ YIKEQINSVDAVVCSDYGDGTLTQPVISAALSASRTIVDAQKHLERYRGGTLFTPNLP EAELAVGYAITDEATLTQAGQDLLALTQAQHILITRGEQGMSMFNREASAFHIPAFNR TKVFDVTGAGDTVVAALTLGLTVGASAWEAAVLGNLAASIVVRQFGTTTTTPEEMRVA LQRLLEES" gene 2404..2742 /locus_tag="DP116_12610" CDS 2404..2742 /locus_tag="DP116_12610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_12610" /translation="MFQYKGYTGELEIDIESGMLSGRVIDIKDVVSFKGKTVEETRQA FEESVDDYLEFCAELGEEPDKPFSGKLPFRTSPEHHRKIFIASRKVGKSINAWMDETL VKAAEEIIHA" gene complement(2829..3488) /locus_tag="DP116_12615" CDS complement(2829..3488) /locus_tag="DP116_12615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316208.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12615" /translation="MNTFASAKGSLSFYNFFNSDFTVPQPTQDADLLRQLSFLPGLKE ILMLRQVHALEHATVWLLSASKNVQASKGRPRPSNFQVDNELLGGLSTERGFYLYGEV NISDLRRAVTLALHRLTNGEWDLAVHPRCGTNVSVEMLLTAGLAVGMHLLLPRGLIEQ LIGLGVATTTAAELAPDIGALAQRYLTTSIPFNLKIENITHTRDLWGRQAYFVQVQWR E" gene complement(3908..4177) /locus_tag="DP116_12620" CDS complement(3908..4177) /locus_tag="DP116_12620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139971.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix transcriptional regulator" /protein_id="PRJNA477356:DP116_12620" /translation="MAGGESQTPVSLSDRELQIIDLVAAGLTNQEIAGKLEISKRTVD NHISNILTKTGTDNRVALVRWALQWGKVCLNDVNCCTLPLPEQKE" gene 4608..5585 /locus_tag="DP116_12625" CDS 4608..5585 /locus_tag="DP116_12625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbohydrate kinase" /protein_id="PRJNA477356:DP116_12625" /translation="MSNPRVLCLGEILFDRLADQLGRKLEEVESWTAYPGGAPANVAC ALVKLGTPVAFIGTLGEDAPGHELVELLEKIGVDTTGVQRHPTAPTRQVNVVRSLEGD RTFAGFKDYETTEFADTRLKADQIPQELFQAADFLALGTLGLAYPESGQAIHRALQMA EQYDVKILLDVNWRPVFWTNPDIAPQKIQELFKHIDFIKLSKEEAEWLFDTADPGAIT YRLDTVEGVLVTDGENGCAYCLGENEGKLPAFSLPVVDTTGAGDSFVAGFIHQLLTHG IQSLNDSETARRIVTYASATGALTTMKPGAIASQPTADEVEAFLASHQV" gene 5686..6576 /locus_tag="DP116_12630" CDS 5686..6576 /locus_tag="DP116_12630" /inference="COORDINATES: protein motif:HMM:PF12833.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_12630" /translation="MSKTDCAVEKAILKSFNRQSLLSNDDSFWNGIRFQFTQSRSASA LPDEILFPENAIHIYTDMPSGYVLEARINGRCQKSSLVTGHSLIIPRGTAYWQSDNHK SQGITLGLNFSFLERTLSESIDLGCLKLHPQFPTFDPLIYQIGLALKAELEHNPHSSR LYAESAATFLATHLFHHYAQGKQKELITTSGLPKYKLQQVIDYIHANLECNIGLTELA DLAQMSLSHFSRLFKQSTGYSPHQFVIKCRVERARELLLKGELSIADITYKVGFANQG HLTSHFKRLLGVTPKVVRGK" gene 6766..7188 /locus_tag="DP116_12635" CDS 6766..7188 /locus_tag="DP116_12635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867545.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldehyde-activating protein" /protein_id="PRJNA477356:DP116_12635" /translation="MAIKFTGGCLCGSVRYECSAEPIAMGNCHCRDCQRATGSAYASG LLVPRSAVTITGDVKYYDVIGDSGSIVGRGFCPNCGSRLFSKPPIPELLGILAGSLDD PSWFQPAMDFYTASAQPWDYMNPDLPKFDKMPVMQQSL" gene 7923..8537 /locus_tag="DP116_12640" CDS 7923..8537 /locus_tag="DP116_12640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015191258.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase" /protein_id="PRJNA477356:DP116_12640" /translation="MIVVHHLNNSRSQRVLWLLEELGIEYEIKYYERDANTMLAPASL RQIHPLGKSPVITDADLTIAESGAIIEYIVDRYGNGRLVPGSGTSERLRYTYWLHYAE GSAMPLLVMNLIFNKFGTGDSAAQDAFIAPQIKLHFDYIEGELRKSTWFAGEEFTAAD IQMSFPLEMLAQLPEQVDNRPKIKEFVEHIHERPAYKRALERGA" gene complement(9035..10060) /locus_tag="DP116_12645" CDS complement(9035..10060) /locus_tag="DP116_12645" /EC_number="2.7.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213632.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucokinase" /protein_id="PRJNA477356:DP116_12645" /translation="MTLLLAGDIGGTKTILRLVETSQKPSLQTICEERYRSGDFPDLV PMVQQFLLKANTQTPEKACFAIAGPVVQNTAKLTNLVWFLDSERLQQELGISHVTLIN DFAAVGYGVLGLDNQDLLTLQAGKLNPEAPIAIIGAGTGLGQGFLIKQGENYQVFASE GGHVDFAPRTELEFQLLKYLLDKHDIQRVSVERVVSGMGVLAIYQFLRDRKIAAESPE IAQIVRTWEQEAGRQEKSVDPGAVIGKAALEKSDPLSEQTMQLFVEAYGAEAGNLALK LLPYGGLYIAGGIAPKILPLIQEDRFMLNFTQKGRMRHLLEDIPVYVVLNQEVGLIGA AICAARL" gene complement(10072..10719) /locus_tag="DP116_12650" CDS complement(10072..10719) /locus_tag="DP116_12650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459424.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine phosphatase family protein" /protein_id="PRJNA477356:DP116_12650" /translation="MNQIVWIARHANRLDFVNPDWFLTAERRYDPPLSEDGMVQAQQL AKRLKGENIAHIFASPFLRTVQTANAVAEVLDLPIKLETGLSEWLNIVWMTEEPQRLS TRVLAELFPRIDTSYTSRIAVNYPETYEKMQERSGQTARCLTTEFSPEDILLVGHGAS VLGATIGLVGEIARTEVKASLCSLVKVVRQHPEWLLELKGDTSHLTQVEEVIRFN" gene complement(11007..11804) /locus_tag="DP116_12655" CDS complement(11007..11804) /locus_tag="DP116_12655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011433595.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid amidase" /protein_id="PRJNA477356:DP116_12655" /translation="MKIYISADIEGIAGISHWDEATLGKQEYDIFQEQMTREVIAACE GATAAGATEIIVKDAHDTGRNLDPYRLPLPAQLIRGWSGHPYSMVQELNSSFTALVLI GYHSRSGSGKNPLGHTLSEELNCILLNDQPIAEFHLVAMTAAYEKVPVVFVSGDSAVC EAVKVYDTNIETVTTKKGIGESVWGIHPEEAVKQIQAGVESALQHQHQIGVKPLPENF KLEVEYKHPPDAYENSFYPGAKLQGEQSVLFETDDWFEVIRALCFIV" gene complement(12009..16019) /locus_tag="DP116_12660" CDS complement(12009..16019) /locus_tag="DP116_12660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409659.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide synthetase" /protein_id="PRJNA477356:DP116_12660" /translation="MENKYCSNTQHYSETIDSINAQSYQLEIPPQRLHYFFERRCDAN PDAIALICDTERLSYAELDARANQLAHYLLRRGISSGNRVGILLERSVNTYVTLLAIL KCGAAFVPLDPSFPQDRIAFIAENASLNLLVTTSQFVDITAGACCEVLLLDAVAAAIA IGPANRIAIADTADELCYIIYTSGSTGRPKGVAVNHSNICNYFTFCTPIYKVTQQDRV YQGITIAFDFSIEQIWVTFSVGATLIAAPGNYQLVGSDLAVFLKEQEVTVLSCVPTLL ATIDCDLPSVRLLIVGGEACPRDLVNRWSKPGCRMLNTYGPTETTITATWTEMQPDLP VTIGRPLPTYKVYILDENMHPVPPGESGEICISGIGVAQGYINLPEATAAKFVLDPFE QNSSNARMYRTGDLGRFTSNSEIEYLGRIDHQVKIRGYRIELTEIEAVLLENPEVENA IVSLVSNAVQELAAYITLRVNVADPKDLKNCLYTSLRSRLPSYMVPAFIEILDTIPTL PNGKADRSKLPAPTQRMQRQHGSDQTLPLSLSKEESAISQRLDATRIANIWQQLFPNA QISIKDDFFLDLGGHSLLAASLVSQLREQPEFSHVSMLDVYQCPTIAGLAARLTQQTA LPNVPTAPLPFHRTSRKRYLRSVTVQALGLIVILFCFALQWLLPYLTYSWTQENDASN LQSAFLSIAALAGIIPLMLGFSIVAKWLLLGRVKPGKYPLWGSFYLRWWFVKNLLSIT PSHLLSGTPLLNIYYQLLGAKIGTNVYLNSVNIDVPDLVSIGADSSLGYEARLLNATV EQGWLEIGSIKIGNRCFVGASAVLSGNTIMEENASLEDLSMLPPLQRIPAREVWAGSP AAPVGINESKMISRPARLRRIYFGTLQAILLLTLPILELLPILPGVEQMYSISDQHNW LLFSPIIALSFVILMALQIAALKWLIVGRVKPGSCRLDSHRYVRLWYVDKLMALSLDI IRPLYATLYLLPWYRLLGAKLGWRTEISTPSSVVPDLVTIDDESFIADGVIMGVPRVE RGRMSLESTRIGKRAFIGNSALLPAGVIIGDDTLIGCMSVPPADKSLVAKPDTSWFGS PAINLPQRQIVQGFSVQSTYKPPISLVVQRMSFEAVRVLFPLSCIVVLSSVLIDTMIR LNDDWDESALILILPFLYLGFGLAATAITIIAKWLIIGRYKSTERPLWSNFVFRSELV TCIHETLAVPLLVDMLRGTPFINWYLRLMGCKIGRQVYTDTTDITEFDTIEVGNDVAL NSNCGLQTHLFEDRVMKISTVTIGDRCSVGSGAIVLYDTVMEPDSSLGNLSMLMKGES LPASSSWVGSPARIAE" gene complement(16641..17969) /locus_tag="DP116_12665" CDS complement(16641..17969) /locus_tag="DP116_12665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015188502.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12665" /translation="MPPDSQNDFWVFHGILAIASLLPSADRIQFDIERDRILAQEQQA RAEAERANRIKDEFLAVLSHELRTPLNPILGWSKLLQQGKLNPAKTAHALETIHRNAK LQVQLIDDLLDVSRILQGKLSLTITPVDVKAVICAALETVRLAAEAKSLHIQTTIPDA VGTVNGDAGRLQQVVWNLLSNAVKFTPPGGQIAVELAQVGTHAQIQVRDTGKGISSDF LPYVFEHLRQQDGATTRKFGGLGLGLAIVRQIVEMHGGTVTVDSPGEGQGATFTVQIP LAPQLNELPTPQQLSPNTSDLSGIRILVVDDEADSRQFIAFILEQANAIVTTVGSGID ALQAFSQSIPDIVVSDIGMPEMDGYMLMRQIRALPVEQGGQVPAIALTAYAAELDQQR AIRMIASALCAIAAGFLHHIAKPIDPDVVLAIVVNTILSRPRISSCSISS" gene complement(17878..19257) /locus_tag="DP116_12670" CDS complement(17878..19257) /locus_tag="DP116_12670" /inference="COORDINATES: protein motif:HMM:TIGR00229" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12670" /translation="MIHPEDLASFQASVAYAIKHFLPWDWVGRVITPSGQLKWIQGRS SAELTSDGSAWDGWLVDITERKRVEAALGESEQQLRLALQTSKLGSWQLDLKTNVLSV SNQCKVNFGVPLSAEFSHQVLMERIHPDDRAWVQAAIQDSIVNRTDYDVEYRTVWDDA SIHWALIRGCSLYDKTGNPERMIGMSMDITARKQAQAALRESEARLRFVLDSSQIGEW DLDLTRQPYTARRSLRHDQIFGYESLLPEWNYEIFLTYVHPDDRASVAQKFQHTLSTW ADWDFECRIIRSDGELRWIWAKGSVYRDAHNLPSRLIGVVVDFTERKQADALLRESEE LNRKILESSYECIKVLDLDGKILYINSGGQRLLKICDFSCYVNSDWIEFWQGEDRQAA QAAIAAALSGGVGRFQGYCPTADGIPKWWEVMVSPILDSTGQPERLLGISRDISDSFA AAFGRSHSI" gene complement(19291..20463) /locus_tag="DP116_12675" CDS complement(19291..20463) /locus_tag="DP116_12675" /inference="COORDINATES: protein motif:HMM:PF01590.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12675" /translation="MLPEDSLPRLPLPITDTNRLAALRRYDILDTLPEAAFDDLTQLA AQICQTPVALITFVDADRQWFKSSVGDKMTESPLSVGFCPAVVQNGDSLIIPDTQADP QFTTNPAVCQQDVRFYAGVPLITSDGYVVGTLCVLDFTPRQMSEEQINGLRILSRQVM TQLELRLSARKVAQTNAALTDVSAGVAANVGEMFLYSLVQHLSKALGVKYTYIGLLAN REPEAIDVIAVCADGRIAENFEYLLRDTPCQEVIQQRKLCCYPRNVQQLFPDAPLLAP FQVESYAAIPFFDSTGVPLGLLGVMDNKPLEARCLRHATANALTESLLTIFAVRIATE LERQQTEKLLHETQDRVQTLLSNMPGMVYRYVPGVDGSDRFVFVNYGCCDLFEVEP" gene complement(20586..21326) /locus_tag="DP116_12680" CDS complement(20586..21326) /locus_tag="DP116_12680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311029.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_12680" /translation="MKTDILNKKPDFKILEIKDLFVNYGGIEALQNINLFVNNGEVVT LIGANGAGKTTTLRAISKIVNPRHGTIIYNGRNITRRQPHEVVQLGIAHCPEGRRVLA RQTVLDNLLLGAYIRPNQTEVRNDIQYQFDMFPRLAQRRSQLAGTLSGGEQQMLAIAR ALMSQPKLLLLDEPSLGLAPTIVREIFSIIENLRATGVTILLVEQNANLALQIADRGY VLEAGCMTLSGAASELIIDERVKKAYLG" gene complement(21421..22218) /gene="livG" /locus_tag="DP116_12685" CDS complement(21421..22218) /gene="livG" /locus_tag="DP116_12685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875621.1" /note="Part of the ABC transporter complexes LivFGHMJ and LivFGHMK involved in the high-affinity transport of branched-chain amino acids; LivFGHMK is specific for the transport of leucine, while LivFGHMJ is a transporter for leucine, isoleucine, and valine; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high-affinity branched-chain amino acid ABC transporter ATP-binding protein LivG" /protein_id="PRJNA477356:DP116_12685" /translation="MIHKKSNQNLFGLPILEVKNLTRRFGGLVAVNDVSFTIIQYEIF GLIGPNGAGKTTLFNLMTGFITPSSGQLLYQGAEISQQHPHQIASLGIARTFQNIRLF GELSALENVRIARDCRINSNTIKGILGLPPAPREEEKSKQKALELLDLVGLSERIHEK AKNFSYGDQRRLEIARALALEPQILLLDEPAAGMNPSEKQQLSKFIRNLRDLFNLTII IIEHHVPLVMDLCDRIAVLDFGQLIALGEPSVVRNNPAVIEAYLGND" gene complement(22224..>22754) /locus_tag="DP116_12690" CDS complement(22224..>22754) /locus_tag="DP116_12690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459496.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="branched-chain amino acid ABC transporter permease" /protein_id="PRJNA477356:DP116_12690" /translation="IEYLWIALPLLLISIVLVYRLERIRVGRAFIAIREDELAASAMG VNLTYYKVWAFTLGCILAGMVGAMSAHFLNTWNARQGTFDASIIYLTFVLIGGSRTFL GPVVGGMVFTALPEVLRAIADTSGLPSWFAQFLRDGRLIIFGLLIVIGTIFFPQGLVH PDLFKKGKNRKNDILK" BASE COUNT 6468 a 5137 c 4951 g 6198 t ORIGIN 1 tctctggact gttaatgacg tttccactag ctttttaatg attaaattct atgaaaactt 61 attcaaactt gataacctgg aagcgggaga tgttgctata gcccttaacc aagctcaaaa 121 atggctgcga aatctgacca ttgagggatt agacagattt ttggaggaac acaaacccca 181 gattgaaaaa gtccttgctc aactaagagt aggacaacga aaaaatattg aggaatcttt 241 gaaactaatt aagcaacgtc aacccttacc ttttgcaaat ccctactact gggctggatt 301 tacagcttcc ggacgttaag caccagatcc aggtacttta caaaactttt tcgcacccag 361 tttaaacacg catatactaa gtgcagtcgc agtaccgaga ggaatgcctg cactagctgc 421 caacggttct ataattggag caattaatcc agattctaat gcttgtctga cttcctcttg 481 acagataaat tggcggcatt tttccaacca atcatttaat cctctatctg cttcctgtgc 541 agtttcccgc ataaccagca gtggaacggc atcgtctaaa tagccttcac actcttctat 601 agtatccaaa gctactaaag cctggggatt atccgccaac tggctgcgaa actgactaat 661 ttcttctggt gtaacggtaa tcatttaata gttttaacca gtaaataaat atccgaactc 721 ttcagcaatt atagatctaa tgcttaatct agaaatgtgg cgatggcgat gcggaagccg 781 ccacctccgg tgatcgcact cgtaccaaat ccggatctaa tacccccttt attatttagt 841 ccgcgtaggc ggactttgtt tgtgtagccg cgatttctaa tcgcctgggt taaaaggggg 901 ttataaaagc cgatttagta tcacacaaca tcaacagtag gaataagtag ttggacagaa 961 ttaattacac aatgtcattg cgaatggagc gaagcggaat gaagcaatcc caaagtctct 1021 gcgattgctt cacttcgttc gcaatgacaa ctactccctt cccggtgtaa gattacctat 1081 gttctttttg attttcttgg tgtccttggc gtcttggcgg ttcgttaaat taggtattct 1141 tctggcggga agggagtaac cagccggaca tgatattaac tttcctccaa caaccgttgc 1201 aacgccaccc tcatttcttc aggagtcgtc gttgtggtgc caaactgtcg caccacaata 1261 ctagctgcca aattacctaa aactgcagct tcccaagctg acgccccaac agttaaacct 1321 aaagttaacg ccgccactac agtatctcca gcacctgtga catcaaacac cttcgtccga 1381 ttaaaagctg ggatgtgaaa tgcgctagcc tcgcggttaa acatactcat cccttgctca 1441 ccacgggtaa tcagtatatg ttgcgcttgc gtcaacgcca gcaaatcttg tcctgcttga 1501 gtcaaagttg cttcatcagt aattgcataa ccaacagcaa gttctgcttc cggcaaattt 1561 ggagtaaaca atgttcctcc acgataccgt tccagatgtt tttgagcatc cacaatagtt 1621 cgagaagcag acagtgctgc tgaaatcacg ggctgagtta gagttccatc cccgtagtca 1681 gaacagacaa cagcatctac ggaattaatt tgttctttaa tatattgaga tatctgaact 1741 tgtaaatcta agtctggcaa atcatcggat tttcggtcta agcgaacaat ctgctgtgtc 1801 accgattgac gagaatgacc agaaatccgc gttttggtta ctgttgggcg actggagtca 1861 acaaaaacgc catcagtatt gattcccgca gcttccagaa gactacgtag cgcctttcct 1921 tgttcatctt tgcccaatag ccctactgcc aaaacttttg ctcccagttg ggcaaagttg 1981 taaatcgcat tggccccacc tcctggtact tgcttggtgg tttcatggcg caggatcaac 2041 actggtgctt cccgagaaac tctttccact tgaccagtga gaaactcgtc taaagttaaa 2101 tctccgataa ccagtacccg tgcagaagcg aatttttcca ataaaaagaa caagcggtca 2161 gccgaagcac gcagtaagat ggcaaaatca gggtctaaag acataaagca caagaatatt 2221 caacacacaa gccaatgatt ctacaaccca aactcaggaa atggcgagat gagaagtgat 2281 atcatgtctg gctatattct tgacgaccac gggtgcgatt aacgttcccg caaaatatga 2341 aaacagaaaa accacaaata attttttgtg gctactagac aaaaaggagt agtaattaga 2401 tctatgtttc aatataaggg gtatacaggt gaacttgaaa ttgatataga aagtggaatg 2461 ctttctggac gagtgattga tattaaagac gttgtgtctt ttaaaggaaa aactgtggaa 2521 gaaactcgtc aagcttttga ggaatctgta gatgattact tagaattttg cgctgaatta 2581 ggagaagaac cagataagcc tttctcaggt aaacttcctt ttcgcacatc tcctgagcat 2641 caccgaaaaa tttttatagc ttctagaaag gttggaaaga gtattaatgc atggatggat 2701 gaaacattag tcaaagcggc tgaagaaatc attcacgctt gagataaaag gtaaaaacag 2761 tgtatgagtg tgggaaaaga gtgatttcgc attaactttt ccctccctct ctccctttct 2821 ccctccttct actcccgcca ctgcacttga acaaaatacg cctgtcgtcc ccaaaggtca 2881 cgggtatgtg tgatattttc aattttgagg ttaaagggaa tggaagttgt caggtaacgc 2941 tgtgctaaag caccaatgtc aggagcgagt tcagcagctg ttgtcgtcgc gactcccaaa 3001 cctatcagtt gctcaattag tcctcgcggt agcagcagat gcatccccac tgctagccct 3061 gctgtcagca gcatttctac tgatacattt gtcccacaac gggggtgtac agctaagtcc 3121 cattctccgt tggtgaggcg atgtagagcg agtgtgactg cacgtcgcaa atcgctgata 3181 ttaacttcac catagaggta aaatccacgc tccgtggaca aaccgcccaa gagttcgttg 3241 tcaacttgga aattacttgg tcttggtctt cctttggaag cttggacgtt ttttgatgca 3301 ctcagaagcc aaactgtagc gtgttctaga gcatgaacct gacgcagcat taaaatttct 3361 tttaatccag gcaaaaacga cagctgtctg agtaaatcag catcttgtgt gggctgaggt 3421 actgtaaaat cagaattgaa aaagttatag aaagacaagc tacctttagc agaagcgaaa 3481 gtattcattg ctacactgct tcctaagcta ataaaccaat agagatttct ttttaatgta 3541 gcgttttagc ctggagatag ttgctaaatt ttgtttacta tttggtttac tttgattgac 3601 taaaggcaaa atctttataa ggacagatag caacggcaag gttcttccat ttaaaagcca 3661 agaaagcttg gtttctctgc tctggtggac aaacttaaaa aagagcaaag ttgcgctttc 3721 tgccgcgctt ttcgtgtcct tatgtataca gttggaataa tctgaggaaa tacaacctct 3781 acccaagaca acatagtcga gtactctgtg gtgcttgatt ttgagaggtt tacacttata 3841 cgttaagttg tgaggagaac aataagtaac cagtcaaaat gagcttggca gtatactagc 3901 agtttatcta ttctttttgc tcaggcagag gcaaagtaca gcaattaaca tcattcaggc 3961 agactttgcc ccattgcaaa gcccagcgga caagagccac tcggttgtcg gtacctgttt 4021 tggtgagaat attgctgata tgattatcaa ctgtacgctt gcttatttct aatttacctg 4081 caatctcttg gttagttaag ccagcggcca ctaagtcgat aatttgcagt tctctgtctg 4141 acagactaac gggggtctga gactcgccac cagccataga gttactttat cctccttggt 4201 atatgtactc aatcgctcta tcccattcta gaagattctt ttgtaggtag atgtttcaag 4261 tgtaagcata aatactattt ttctttgaga gttgggttta cagagtagag cagaagatag 4321 aaaaagttca agattgtaaa ctggaaatta taaggacgct atccttgctc ccccacacca 4381 ttacacttca cctcatccca aatcagttat ttataagtaa tatttataaa atttccatta 4441 acaataatag aaataatact gatggcaaat aagcagtgct caccatcgtt aatggctaaa 4501 ttgattcata gaaattgcta tttcattgta tgaacgactg ccgacatagt gacacaaatt 4561 attgtagatt ttatactggc aattcctaat ccaaaatctg aaattcaatg agtaatcctc 4621 gtgttttgtg cctgggtgaa attttattcg accgtttagc tgatcaactg gggcgaaagc 4681 tagaagaagt tgagtcatgg actgcctacc ctggaggagc accagctaac gtcgcctgtg 4741 ctttggtgaa gctggggaca ccagtggcat ttattggcac cttgggtgaa gatgcaccag 4801 gtcatgaatt ggtagaattg ttagaaaaaa taggtgtgga tacaactgga gtacagcgtc 4861 atcctacagc accaacgcga caagtgaatg ttgtgcgttc tcttgagggc gatcgcactt 4921 ttgctggatt taaagattat gaaactacag aatttgctga tacgcgtctc aaagccgatc 4981 aaataccgca ggagttattt caagcagcag attttctggc tttaggtact ctgggactag 5041 cgtatcctga aagtgggcaa gcaattcacc gcgcacttca aatggcagag caatatgatg 5101 tcaaaatttt gcttgatgtc aactggcgtc cagtattttg gacaaatcct gacatcgctc 5161 cacaaaaaat tcaagaatta tttaagcata tagactttat caaactttca aaagaagaag 5221 ccgaatggct atttgacacc gcagatcctg gggcaattac ttatcgcctc gataccgttg 5281 aaggtgtgct ggtgacagat ggagaaaacg gttgtgccta ctgccttggc gaaaacgaag 5341 gcaaattacc tgctttttcc ctccccgttg ttgatacaac aggtgcaggg gatagctttg 5401 tggctggctt catccaccag ttgctcactc atggaattca aagcttaaac gactcagaaa 5461 ccgcgagaag aatcgtgaca tatgcaagtg caacgggggc attaacgaca atgaaaccag 5521 gggcgatcgc ctctcaaccc accgcagatg aagtagaagc ttttttggct tcgcatcaag 5581 tttagggggt agcacagcta actggaataa cagtacaaca agaaggcggg aagaaaatct 5641 tcaccgcctt ttgcccagca aagccgagcc aaaaaggtga gagtcatgtc aaaaacagac 5701 tgtgctgtag aaaaggcaat cttaaaaagc tttaatcgtc aatctctact ctccaacgac 5761 gattcgtttt ggaacggtat ccgcttccag ttcactcaaa gcaggagcgc aagtgcattg 5821 ccagatgaga ttctctttcc agagaatgcg attcatattt atacagatat gccctctggg 5881 tacgttcttg aagcacgaat taatggacgg tgccaaaaaa gctctcttgt cacgggtcat 5941 agtctcatca ttccacgcgg cacagcttat tggcaatcag acaaccacaa aagccaaggc 6001 ataaccttag gtttgaattt cagctttctt gaacgcactc tctctgaatc gattgactta 6061 ggttgtctga aactgcatcc tcaatttccg acgtttgatc cacttattta tcaaattggg 6121 ctagcgctca aagctgaact agaacataat ccgcattcta gccgcttata tgcagaatct 6181 gccgcgacgt ttcttgctac ccatcttttc caccattacg cccaaggcaa acaaaaggag 6241 ctaatcacaa caagtggctt gcctaagtac aaattacaac aggtcattga ttacatccat 6301 gccaaccttg agtgtaatat tggcttgact gaattggctg acttggctca aatgagtctg 6361 tctcacttct ctcgattatt caaacaatca accggatact ctcctcatca gtttgtcatt 6421 aagtgtcgcg tcgagcgtgc tcgtgaactc cttctcaaag gtgagttgtc aatcgcggat 6481 atcacttata aagtcggctt tgccaatcag ggacatttaa ccagtcattt taagcgcctg 6541 cttggagtca cacctaaagt cgttcgaggt aaatagcaca aacgtgtaaa aaaagcaaaa 6601 acttgtaaga ccaaataaga gtgtgatcgc tactttagaa atcaggacgc gccgcaagtg 6661 tcatatcttg cttggtctta caaactacga gttaaagaat catgatacct gcactcccgt 6721 cctagaagta acaaaacgag aaagctctca aggagagata ctcttatggc gataaaattt 6781 actggcggtt gtctatgcgg atctgtccgc tacgaatgtt ccgccgaacc cattgcgatg 6841 ggaaattgcc actgtcgcga ttgtcaacgg gcgacaggaa gtgcatatgc ctccgggctt 6901 ctcgtcccgc gaagtgcagt caccatcacc ggagatgtga aatactatga cgtgattggt 6961 gatagcggaa gcattgttgg tcgaggcttt tgcccaaact gcggttcccg attatttagc 7021 aaacccccaa tccctgagct tctgggcatt ttggcaggga gtcttgatga cccaagttgg 7081 ttccaacctg cgatggattt ttacacagct agtgcccaac cctgggatta catgaaccca 7141 gatctaccca agttcgacaa gatgccagtg atgcagcaga gtctctgaag aaacacaggg 7201 gtatagggat gtaagggtgt aagggtgtag gggtgtaggg gtctaaggga aaaggtattt 7261 ctttcattcg tcgcgggtgg tgcgccgcca gtgcacctgc tctgaagaaa cacaggggta 7321 taggggtgtg aggatgtagg agagttgggg aagttggggt gagggaaaaa ggtgtttttg 7381 tcagctgaaa tgctaagcgt caagaacgca cgctacgcta aaagcataac gaaaactata 7441 ctttgaaaac atcctcttaa aagaactact ctatgactgt aaaaatccag accgaaaaag 7501 gctttcttac ggcaagcaat agtatcaaga caaatcaaga aacagaaaac gaaaaagttc 7561 tatgttctca ttgtcagcgt accgctacaa acggaattaa atgtaaaggg atgtgtgtag 7621 ctgacaatga ctactaacat aatgattgac aaaaaacact ttttatgaaa atctcacgaa 7681 tgattacaga gtcacccgtt actaactttt cttcaatttt tttctagaaa tgacctctaa 7741 agaatgaatt tttcagaaga gtctcaagct aagcttaggg tcagataatg ccatcaaaaa 7801 attaggctga aaactgtttg gctaagcacc caattgggga gtcacgcaaa aaaactgttt 7861 ctagtaaata gttacagatt agccgaaata gaacatcaat cagccgctag gaggagggtt 7921 ttatgatcgt tgtccatcat ctcaacaact cgcgatcgca gcgcgtacta tggctgctcg 7981 aagaattagg tatcgaatac gaaatcaaat actatgaacg cgatgcaaac acaatgctag 8041 caccagcgtc gctgcgccaa atccatccac tcggcaagtc accagtgatc acagatgcag 8101 atctgacaat cgctgagtca ggtgccatta tcgaatacat agtggatcgc tacggcaatg 8161 gtcggttagt cccgggatca ggtacgtcgg agcgtctgcg ctacacgtat tggttgcatt 8221 acgccgaagg ctctgcaatg ccactgctgg taatgaactt aattttcaac aagtttggca 8281 caggggacag cgccgctcaa gacgcgttca tcgcccctca gattaagctt cactttgatt 8341 atatagaagg cgagctgcgt aagagtacat ggtttgcagg cgaagaattc accgccgctg 8401 atatccaaat gagctttcct ctcgaaatgc tcgcccaact tcctgaacag gttgataacc 8461 gaccaaaaat taaggaattt gtcgagcaca tccatgagcg ccccgcttat aaacgcgccc 8521 ttgaacgtgg agcttgacta ggtacccata aactttgttg aacctgctca aagttcagca 8581 attcaaataa tttttttcgt tgcattatat cgtctgtatc tgtatactaa gaagaatgcc 8641 gaaaaccaag gctaaacttt cgccctttgc aaaccacagt cgattcaaaa ctagctagcc 8701 tgtaaaaagg tcgaacgaaa taccacgatt cgttaaatta gctgaaaata acggttgaac 8761 aatggctaaa agcttcaatc ttttggttgt ttgtttgaaa aatagggtgg gctacacccg 8821 aatttacgct tgtggacaag aaggagccga ctcccttggt agaagcaaga agcaaaccct 8881 ctaaaggaaa ctttagttga atggctagct ttaaataagt ttggtcaagt tttgtatagc 8941 aggtgcatac gtgctaacag cattcaatag ttgatgatct ttgacactcc caggcgtaaa 9001 cgcactggga ttctaagaaa tatcctacta atttttataa ccttgcagca cagatagcag 9061 cacctatgag tcctacttct tgatttagaa caacatacac gggtatatct tcaagcaggt 9121 gacgcatcct acctttttga gtgaaattga gcatgaatct gtcttcttga atcaaaggca 9181 gtattttagg agcaattcca ccagcaatat acaaaccacc gtagggtaga agtttgaggg 9241 caagattacc tgcttctgca ccgtaagctt ctacaaataa ctgcatcgtt tgttccgaaa 9301 ggggatcgct tttctctaat gctgctttcc caatcacagc accgggatcg acgctttttt 9361 cttgtcttcc tgcttcttgt tcccaggttc tgacgatttg ggcaatttct ggtgattcag 9421 cagcaatttt gcggtctcgt aaaaattgat aaattgctag aactcccatc ccagaaacga 9481 ctcgttctac agaaacccgt tggatatcat gtttatccag caggtatttc aaaagctgaa 9541 actccaactc ggtacggggg gcaaagtcaa cgtgtccacc ttcagaagca aaaacttgat 9601 agttctctcc ctgcttaatt aaaaatcctt gtcccaaacc agtaccagca ccaataatgg 9661 caataggggc ttcggggtta agtttgccag cctgtaaagt tagcagatct tggttgtcta 9721 aacccaaaac accataacca acagcggcaa agtcgttgat gagagttaca tgagatatac 9781 cgagttcttg ttgtaaacgt tcgctatcaa ggaaccaaac taaattagtc agcttcgcgg 9841 tattttgcac cactggtcct gcaatggcaa aacacgcttt ttctggagtt tgtgtgttag 9901 ccttgagcaa gaactgttgt accattggta ccaaatcggg aaaatctcca ctacggtacc 9961 gttcctcaca gatagtctgt aatgaaggct tttgtgatgt ttcaaccaat cgtaagatgg 10021 ttttcgtgcc gccgatgtct cctgccaata gtaatgtcat gtgagttgtt gttaattaaa 10081 ccgaatgact tcttctacct gagttaaatg ggaggtatct ccctttagtt ctagcaacca 10141 ctctgggtgt tgacgcacaa cttttactaa ggaacatagg gaagccttca cttccgttct 10201 ggcaatttca cctactaagc ctattgtcgc ccccaatacc gatgcaccat gtcccactaa 10261 aagaatatct tctggggaga attctgtcgt taggcatcta gcagtttgtc ccgaacgttc 10321 ctgcattttt tcgtaagttt ctgggtaatt tactgcaatg cgggaagtgt agctggtgtc 10381 tattctgggg aataattctg ctaacactcg ggttgagagt cgctgtggtt cttctgtcat 10441 ccagacgata ttcagccatt cgcttaagcc tgtttctagc ttgattggca aatcaagaac 10501 ttctgccaca gcatttgctg tttgtaccgt ccgcaggaag ggcgaagcga aaatatgggc 10561 aatgttttct cccttcaggc gcttagctaa ctgttgcgcc tgcaccatac catcttcaga 10621 cagtggtgga tcgtagcgtc gttctgctgt gagaaaccaa tcagggttaa caaagtcgag 10681 acggttggca tgtcttgcga tccaaactat ttgattcata gtatgatatg gcgtttttgc 10741 tagagtaaag cctgtagctc cacgtatgca gcctttcagc gtaggctgtc agttaaataa 10801 tattacaaag tgtgtaattt ctccctgatc ttgctccaaa gataagtaag ctggctaaaa 10861 aaattcaatt gtataaacaa atgtaaaatc actttccggg tgcgttgtca cgcagcacaa 10921 cgcaccttat gcagccgttg atgggttagc ttgaaaatag cagagagtag gaatcagttt 10981 tgatgtacgc aattcatgag agctacttac acaataaagc aaagtgcgcg tatcacctca 11041 aaccaatcat ctgtctcgaa gagaactgat tgttcgcctt gtagttttgc tcctggataa 11101 aaggaatttt cgtaagcatc gggtggatgc ttgtattcca cttcaagctt aaaattctca 11161 ggtaagggtt tcaccccaat ttgatgttga tgctgcaatg cgctttctac acctgcttga 11221 atttgcttaa cggcttcctc aggatgaatt ccccatacag actcaccaat tcccttcttg 11281 gtagtcactg tttcgatgtt tgtgtcataa acttttacag cttcacacac agcagaatct 11341 ccagacacaa atacgacagg aactttttca tacgccgcag tcatcgctac taaatggaac 11401 tcggctatgg gttgatcatt cagcagaata cagtttaatt cttcactcaa agtatgccca 11461 agaggattct ttccagatcc agatcgtgag tggtagccaa tcaaaactaa tgcagtaaag 11521 ctgctattta actcttgaac catgctgtag ggatgaccac tccatcctct aataagttgt 11581 gctggtaatg gtaatcgata gggatcaaga tttctacctg tgtcatgggc atccttaaca 11641 ataatctcag ttgcaccagc agctgttgcc ccttcgcatg ctgcaatgac ttcacgggtc 11701 atttgctctt gaaagatatc gtattcttgt ttacccaggg tcgcttcatc ccagtgactg 11761 attccagcta ttccttcgat atctgcgctg atatagattt tcatagggtc tgttaattgc 11821 catcttaatt gattgttgtc tactctcaag aaacataggg aacagggaac gcttaacagg 11881 gaacagggaa gaagggaaac agtgtttttt tcattcgtag cgggttgcaa tgccacctat 11941 tatcaatgaa acaaacacgt ctttccccta cacccctata cccctacacc cttacactct 12001 tacaccctct actcagcaat ccgggcagga gaacccaccc aagaggaact agcaggtagg 12061 gattcgcctt tcatgagcat ggacaggttg cctaggctgg aatcgggttc catgacagta 12121 tcgtagagaa cgatcgcacc actgcctacc gaacagcgat cgccaatagt cacagttgat 12181 atcttcatca cacggtcttc aaacaggtgg gtttgcaaac cacaattact attgagtgcc 12241 acatcattcc caacctctat tgtgtcaaat tcggtgatat cggtggtatc ggtgtaaacc 12301 tgtctaccaa ttttacaacc cattaagcgc aaataccagt taataaaggg agtgcctcgc 12361 agcatatcca ccagtaacgg aactgccagg gtttcatgga tacaagtaac tagttccgag 12421 cgaaacacga aattcgacca caagggacgt tctgtcgatt tataccgacc gatgattagc 12481 catttagcaa tgatggtgat cgccgtggct gccagtccaa agccgagata gaggaacggc 12541 aatatgagta tcaatgctga ttcatcccaa tcatcgttca gcctaatcat tgtatcaata 12601 agtactgaac ttagaacgac gatgcagcta agagggaaca gcactcgaac tgcctcaaag 12661 ctcatgcgtt gcaccaccag actaatcggt ggcttgtaag tcgattgtac tgaaaaaccc 12721 tggacaattt gccgttgtgg taggttaatg gctggggaac cgaaccagga ggtatcaggt 12781 ttagcaacta agcttttatc ggcgggagga acagacatac agccaatgag ggtgtcatcg 12841 ccaatgatca cgcctgctgg taataaagca ctgttaccaa tgaaagcacg cttaccaatc 12901 cgtgtcgatt ccaaagacat ccgtccacgc tcaactctgg gtacacccat aatcacacca 12961 tctgcaataa agctttcatc gtcgattgtg actaaatcag gaaccacaga ggatggtgta 13021 gagatttctg tccgccatcc tagctttgct cccagcaggc gataccaagg tagcagatac 13081 agcgtagcgt aaagcggacg aataatatcg aggcttaatg ccatcaactt atcaacatac 13141 cacaatctta cgtaacgatg actgtctaac cgacatgatc ccggtttaac acgtccaacg 13201 attaaccact tgagtgcggc aatttgtaat gccatcaaga tgacaaacga taaggctatt 13261 attggtgaaa acagcagcca attgtgctga tcactgatgg agtacatttg ctcgactcca 13321 ggaagaattg gcagaagttc caaaatcggt agcgtcaaca ataaaattgc ctggagagtg 13381 ccgaaataaa ttcgtcgcag tcgcgcagga cgcgatatca ttttgctttc gttgattccc 13441 acgggagcag caggagaacc agcccaaact tcccttgctg gaatccgttg tagcggtggc 13501 agcattgata aatcttccag actggcattt tcttccatga ttgtgtttcc gcttagtaca 13561 gcactagcac cgacaaaaca acgattgcca atcttaatcg aaccaatttc cagccaaccc 13621 tgttccaccg tagcatttag caatcgtgct tcgtagccaa ggctactatc tgcaccgata 13681 cttaccaagt caggaacatc gatattaacc gaattcaggt aaacattagt tccaatttta 13741 gctcccagca gttggtagta gatattgagc aacggagtac cactcaacaa gtgcgatggt 13801 gtaatcgata gcaagttttt gacaaaccac cagcgcaagt aaaaactgcc ccaaagcgga 13861 tattttcctg gtttgactcg tcccaaaagc aaccacttcg caacaatgga gaacccaagc 13921 atcaagggaa tgatccctgc caaagcagcg atggacaaga aagcggactg aagattgctg 13981 gcgtcgttct cttgcgtcca actgtacgtc agataaggca gcaaccactg taaggcaaag 14041 caaaatagaa tgactattag tcctaatgct tgaacggtta cgctcctcag gtagcgtttg 14101 cgactcgtac gatggaaggg gagaggggca gttggaacgt ttggcaaagc tgtttgttgc 14161 gtcaaccggg ctgctaatcc agcaatggta gggcattgat agacatcaag catggagacg 14221 tgactaaact caggttgctc tcgcagttga gacactaagc tggcagccaa aagtgagtgt 14281 cctcccaagt cgaggaagaa gtcatccttt atagaaattt gggcgtttgg aaagagctgt 14341 tgccagatgt tggcgatacg cgtagcgtca agcctctggc ttatcgcgga ctcctccttt 14401 gataagctaa ggggcagcgt ctgatcactg ccatgctgac gttgcatcct ttgggttggc 14461 gcaggcagct ttgagcggtc tgccttaccg ttgggtagcg tgggaatggt gtccaaaatt 14521 tcgatgaatg caggcaccat gtagctgggt aggcggctgc gtagcgacgt gtacaggcaa 14581 tttttcaaat ctttgggatc agcaacgttg acgcgcaatg tgatgtatgc tgccaactct 14641 tgtactgcgt tagatacaag cgaaacaata gcgttttcga cttctggatt ttccagtaaa 14701 accgcttcga tttctgtaag ttcgatgcgg tagccacgta tctttacttg atgatcgatg 14761 cgaccaagat attcaatctc cgagttggag gtgaatcgac ctaagtcgcc agttcgatac 14821 attctggcgt ttgaggaatt ctgctcaaac ggatcgagga caaacttggc agcggtagcc 14881 tccggaaggt tgatatagcc ctgtgccacc ccaatcccac tgatacaaat ttcccctgat 14941 tctcctggtg gaaccggatg catattctca tcaaggatat aaaccttgta tgttggtagc 15001 ggtctaccga tggtcactgg taagtctggc tgcatctcgg tccaggttgc agtaatcgtt 15061 gtctcggtgg gaccataggt attgagcatc cggcaacctg gtttgctcca gcggtttact 15121 aagtcacgcg ggcaagcttc accgccaacg attaaaaggc gcacagaagg cagatcacaa 15181 tctattgttg ccagcagtgt aggtacacaa cttagcactg tgacttcttg ctctttgaga 15241 aaaacagcta gatccgatcc cacgagctgg taattgcctg gagcggcaat taaggtcgca 15301 ccaacactga aagttaccca gatttgctct attgagaaat caaaggcaat ggttattcct 15361 tgataaaccc gatcctgttg agtgactttg taaattggtg tgcaaaatgt gaagtagttg 15421 caaatgttgg aatgattaac agcaacaccc ttggggcgac cggttgaacc tgaggtgtag 15481 atgatgtagc aaagttcgtc tgcggtgtcc gcaattgcaa tacggtttgc tggcccaatg 15541 gcgatcgctg ctgctacagc atcaagcaac aagacttcac agcacgctcc ggcagttatg 15601 tcaacaaact gactggttgt taccaacaaa ttgagggagg cattttcagc aatgaaggca 15661 atcctgtctt ggggaaaaga cgggtcaagc ggtacaaagg cagcaccaca ctttagtatt 15721 gccagcagtg tcacataagt gttaacagag cgttcgagca aaataccaac tctgttccct 15781 gaactgatac ctctacgcaa aaggtaatga gctaactgat tcgctcgtgc gtcaagttca 15841 gcgtaactaa gacgttccgt atcgcagata agagcaatgg cgtcggggtt agcatcacat 15901 cgtctttcaa aaaagtagtg aagacgttgt ggcggtattt ctagctgata agactgagca 15961 tttatagaat caattgtttc ggaatagtgt tgcgtgttac tacaatattt attttccatt 16021 acactcctcg gaaaattgct aatacgcttc tgtgactggc ttctgtggtg cattgcttaa 16081 ctcaaaaagc agccagtttc tacggttcaa ggaaatgtgg agatagttat agttaagaac 16141 ttgaagcaaa gtatatcata atcaacgatt tttgtttgaa aagtctgcgc tcatttcaag 16201 taactaatca agttaaacct aacctttatg aaatttgtat aacaaaggtg ttttgctgtt 16261 ttttttaagc tcccgacctt ccaaaagatt tgctgctacg ttcatgcatc ttcaaatctt 16321 aaaattttat ttaaaatcaa gtatttgtag acagctatat tgtaggcagc accaaaaagc 16381 taggtacgct taaactgctt gctatctttt ctttaggaag atattgcaat agatagcaat 16441 gtgtaggaaa aaacagctta tatacactac ttcagaagtt taatcgtttt gtgtggcgag 16501 agaatggcaa taaaatggca atgttgtgac aagaaaaaca cagtattaat actcacttgc 16561 agtgcatcac aacttgttat gaaaatccat cagtatctgc ctacgctaac ctacaatttt 16621 cttaactcaa ccgtattgag ttaggaggaa atagaacacg aggaaatgcg aggtcttgat 16681 aatatggtgt tgacaacgat cgcaagcaca acatctggat caattggttt ggctatgtga 16741 tgtagaaatc ctgctgcgat tgcgcaaagc gcagacgcaa tcatgcgtat cgctcgctgt 16801 tgatcgagtt ctgccgcata agcagtcagg gcgattgccg gaacttgtcc accttgctca 16861 accggaaggg cacgaatttg gcgcatcagc atgtagccat ccatctctgg cattccaata 16921 tcgctgacga cgatatcggg aattgattgg gaaaaagctt ggagtgcatc aattccagat 16981 ccaacggttg taacgatcgc atttgcttgc tctaaaataa aagcgatgaa ctgccgcgaa 17041 tctgcctcat catcaactac taagatgcga atgccgctca agtcgcttgt attaggtgat 17101 aattgctgtg gagtaggtag ttcatttaac tggggagcaa gcggaatttg aacagtaaag 17161 gtggctcctt gtccttcacc gggactgtct actgtgacag ttccaccgtg catctcaaca 17221 atttgccgca caatcgccaa tcctaacccc aacccgccaa acttgcgcgt tgttgctcca 17281 tcttgttggc gcaaatgctc aaacacatac ggcagaaagt cagaactaat tccttttccg 17341 gtgtctctga cttggatttg ggcatgggtt cccacttggg ctaactcaac ggcaatttgt 17401 ccgcctggag gagtaaattt cactgcattg cttaagagat tccacaccac ttgctgcaat 17461 cgtcccgcat cgccgttgac tgtgccgact gcgtccggga tggttgtttg aatgtgcagc 17521 gactttgctt ctgcggctaa tcgaaccgtt tcgagggcgg cacagatcac cgctttaaca 17581 tcaacgggtg ttattgtcaa gctcaactta ccctgcagaa ttcgggagac gtcaaggaga 17641 tcgtcaatca gctgcacctg caatttggcg ttgcgatgaa tcgtctcaag ggcatgagcg 17701 gtttttgcag gattgagttt accctgttgc aaaagttttg accatcccag aattggatta 17761 agcggtgtac gtaattcgtg ggagaggact gccaaaaatt cgtctttgat acggtttgct 17821 cgttcagctt ctgctcttgc ttgttgttct tgagcaagga tacgatctcg ttcaatgtca 17881 aattgaatgc gatctgccga aggcagcagc gaagctatcg ctaatatccc gtgaaatacc 17941 cagaagtcgt tctggctgtc cggtggaatc tagaatcggt gaaaccatca cttcccacca 18001 tttcggaatt ccatcggcgg taggacagta accttgaaat cgacctactc cgcccgataa 18061 cgctgctgcg atcgcagctt gtgctgcttg tcggtcctcg ccttgccaaa attctatcca 18121 atcgctgtta acataacagc taaagtcaca aatttttagt aaacgctgcc cacctgaatt 18181 tatgtaaagt atcttcccat ctaagtcgag tactttgata cactcgtagc tgctctccaa 18241 aattttccga ttcaattctt cgctttcacg cagtaacgca tctgcttgct tacgttcggt 18301 aaagtcaacc accactccta tcaatcggga cgggaggttg tgtgcatctc ggtaaacgct 18361 acccttcgcc caaatccaac gaagttctcc atccgagcgg ataattctgc attcaaaatc 18421 ccaatctgcc caagttgaga gcgtgtgctg gaacttttga gcaacagatg ctcgatcatc 18481 agggtgaacg tatgttaaga agatttcgta gttccattct ggcagcagcg actcataacc 18541 aaaaatttgg tcgtgtctga gcgatcgcct cgcggtgtag ggttgtctcg tcaaatctag 18601 atcccactcg ccaatctggg aagagtcgag cacaaaccgt aatcgcgctt cactttctcg 18661 cagggcagct tgagcttgtt ttcgagcagt gatgtccatt gacataccta tcatacgctc 18721 tggattgcct gttttgtcat aaagtgaaca accccgtatc agtgcccagt gaatactagc 18781 atcgtcccaa acagtgcggt actccacatc gtaatcagtg cgattaacga ttgagtcttg 18841 tattgctgct tgcacccaag cccgatcatc tggatgaatc cgttccatca gaacctggtg 18901 cgaaaactca gctgaaagtg gaacaccaaa gttcactttg cattggttag aaacagacaa 18961 aacgtttgtc ttgagatcaa gttgccaaga gccaagctta ctggtttgta gtgccagtct 19021 cagctgttgt tcgctttctc ccaaggcagc ttcgacgcgc ttgcgttcag taatgtcaac 19081 tagccagcca tcccatgcac ttccatctga ggttaactct gcactggatc ggccctgaat 19141 ccacttgagt tgccccgacg gggtgatcac gcgtcccacc caatcccacg gcagaaagtg 19201 cttgattgca taagcaaccg atgcttgaaa ggaggctaag tcttctgggt ggatcaacct 19261 ccaaattgaa ttagcatctc ggagcgcggt ttatggctct acttcaaaca agtcgcagca 19321 gccataatta acgaagacaa agcgatctga accatctaca ccaggaacat aacgatacac 19381 catccctggc atgttactga gcagggtttg aactcggtct tgagtttcat gcagcaattt 19441 ctctgtttgc tgtcgctcta attctgtcgc aatccgtaca gcaaagattg tcaggagtga 19501 ttcagttaat gcgttcgcgg tagcgtgccg gaggcatctc gcctctaagg gtttgttgtc 19561 catcacaccc agcaacccaa gtggaacacc tgtcgagtcg aagaagggaa ttgcggcata 19621 gctttccacc tgaaatggag ccaacagggg ggcgtccggg aacagctgct gcacattgcg 19681 gggataacag cacagcttgc gttgctggat aacttcttgg caaggggtgt cgcgcagcaa 19741 atactcgaag ttttctgcga tgcgtccatc tgcacaaact gcgatgacgt caatcgcctc 19801 tggctcacgg tttgctaaca acccaatgta agtatacttc acgcctaacg ccttgctcag 19861 atgttgcacg agcgagtaga gaaacatttc tcctacgttt gcagcaacac cagcagagac 19921 gtcggtcaac gctgcattcg tttgagccac tttccgagca gacagccgta attccagttg 19981 cgtcatcact tgacgactta agatgcgtaa gccattgatt tgctcttcac tcatctggcg 20041 tggagtaaag tccagcacac acagcgttcc aacaacatag ccatccgagg taatcagggg 20101 aacgcctgca tagaaacgga catcctgttg acagactgct ggatttgtgg taaactgcgg 20161 gtctgcctgc gtatctggaa taattaacga gtccccattt tgcacaacag caggacaaaa 20221 tcccacactc aacggcgatt ccgtcatttt gtctccaaca ctcgacttaa accactggcg 20281 gtcagcatcc acaaaggtaa tcagcgccac tggagtttgg caaatctgag ctgctaattg 20341 cgtcagatca tcaaacgcag cttccggcag agtatccagg atatcgtacc gacgcagtgc 20401 tgccaaccgg tttgtgtcag taatggggag agggagtctt ggcaaagagt cttcaggcaa 20461 catttagttc taggggaact cgtgaaaaaa tcaagatttg aaaatcgaca actggctggt 20521 gtgagtcatg atcatgaaag ctttctgtaa agaattatga actacccagg ctctagttag 20581 tttcatcaac ccaaataagc ctttttcact cgctcatcga tgattaattc cgaagctgca 20641 cctgacaggg tcatacaacc agcttctaat acataaccac ggtctgcaat ttgtagggca 20701 aggttagcgt tttgttcaac taaaaggata gtaacacctg tcgcacgcaa attctcaata 20761 atactgaaaa tttcacgaac aattgtagga gctaaaccta agctaggctc atctaaaagc 20821 aaaagtttcg gttgactcat taaagcacgg gctatggcta acatttgttg ttcaccacca 20881 ctaagagttc ctgcaagttg actacgtctt tgtgctaaac gtgggaacat atcaaattgg 20941 tactgaatat catttctgac ctctgtttga ttgggacgaa tataagcacc taatagtaaa 21001 ttatctaaaa ctgtttgccg agctaatact cttcttcctt caggacagtg ggcaatacct 21061 aattgtacaa cttcatgagg ttggcgacgg gtaatgttac gtccattgta gataattgta 21121 ccatgacgag gattaactat tttggaaata gcccggagag ttgtcgtttt gccagcacca 21181 ttagcaccta tgagggtaac gacctcacca ttgttcacaa ataagttaat attttgtaga 21241 gcttcaatgc ccccatagtt aacaaaaaga tcttttattt ctaatatctt gaaatctggt 21301 tttttattta atatatctgt cttcatttgt ttacatatgt agttatcaat tatcagttat 21361 cagttatcag ttaaaaatta tttgataact gttcactgtt cactgttcac tgatttctcc 21421 ttaatcgttg cctaagtatg cttcaatgac cgctgggtta tttctgacaa cagaaggttc 21481 ccccaaagca attagctgac caaaatctaa aacggcaata cggtcacaca aatccataac 21541 taaagggaca tgatgctcaa tgataattat cgtcaagttg aatagatcac ggagattacg 21601 aataaattta ctcagttgtt gcttttcact agggttcata ccagccgcag gttcatctaa 21661 cagtaaaatt tgcggttcta aggcaagagc acgggcaatt tctagtcgtc tttggtcacc 21721 gtaggagaag tttttggctt tttcgtggat acgctcactc aatccaacaa ggtctagtaa 21781 ttctaaagct ttttgcttac ttttttcttc ttctctcgga gcaggtggca atcctagaat 21841 accttttatt gtattgctat tgatacggca gtcgcgcgcg attctcacat tttctaacgc 21901 cgatagttca ccaaataagc gaatattttg aaaggttcgt gcaataccca aacttgcaat 21961 ttgatgggga tgctgttgag aaatttcagc accttgataa agtaattgtc cgctagaagg 22021 tgtaatgaag cctgtcatta aattaaacag agttgtctta ccagcaccgt tgggaccaat 22081 cagtccaaag atttcatact ggatgattgt aaaagagaca tcatttaccg ccactagtcc 22141 cccaaaacga cgagtgaggt ttttcacctc taatattggt aatccaaata aattttgatt 22201 tgatttttta tgaatcattg catttatttt aatatgtcat ttttgcggtt tttccctttt 22261 ttaaataagt ctggatggac aagaccttga ggaaaaaaga ttgtaccaat aactataagt 22321 aatccaaaga taattaatct accatctcgt agaaattgcg caaaccaaga aggtaaaccg 22381 ctggtatctg ctattgctct caacacttct ggcagagctg taaataccat acctccaaca 22441 actggaccga gaaaagttct ggaaccacca attaacacaa aagttaagta gataatactc 22501 gcatcaaagg tgccttgacg tgcattccaa gtgttgagaa agtgagcgct cattgcacca 22561 accatccctg ccagaataca ccctaatgtg aatgcccaaa ctttatagta agtcaggtta 22621 actcccatcg cactggctgc taattcatcc tcacggatag cgatgaatgc ccttccgact 22681 cggatgcgtt ctaaacggta aaccaataca atgctgatga gcaacaatgg cagggcaatc 22741 cataaatatt caat // LOCUS NODE_1392_length_22683_cov_5.35500322683 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 22683) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 22683) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..22683 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 261..2297 /locus_tag="DP116_12695" CDS 261..2297 /locus_tag="DP116_12695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015196789.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_12695" /translation="MLCCLNPACHNPPNPDGTIFCSNCGVGLVVLRNRYRPIKSLGAG GFGKTYLAEDIDKLNKKCVIKQLAPQVQGTGALQKATELFEQEARRLEQLGEYPQIPT LLAYFEEDNCLYLVQQFIDGQNLLKELQQQGTFSEGKIRDLLQDLLNILKVVHQHKVI HRDIKPENIIQRGDRKLVLIDFGASKQLTKTVMTAKGTMIGSLGYAPLEQMQGGEAYP ASDLYSLGATCFHLLSGIHPWELFTKQGYTWVSSWRQHLQQPVSLELRRILDKLLQED YQQRYQSAEEVLQDLNPAPPSTKVSSQPPISTPRASAKSPVKLKAWQQRLLAGVAITL GGLVLTQFVGYVRYGLFPTNPISVIASQDSDVFLTRTLTGHSNSVSSVAISPDGKTLA SGSDDKTIKLWNLVTGEQIRTLTEHSSPVNSVAISPDGNTLVSGSDDRTIKLWNPATG EQIRTFRGHTGYVNFVAISPDSKTFASGSSHKTIKLWNLTTGKQIRTLTGHFKSVDSV ALPVIIRPQSLEPDKQQVISVAISPDGKTLASGSYDNTIERWNLETGEPIRTLTGHSS QVNSVAISPDGKTLASGSGDKTIKLWNLTTGEQIRTLTGHSDWVSSVAISPDGKTLAS GSWDNTIKLWNLTTGEQIRTLTGHFNRVYSVAFSPDGKTIASSSYNTIKIWRLR" gene 2340..3119 /locus_tag="DP116_12700" CDS 2340..3119 /locus_tag="DP116_12700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875888.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flagellar assembly protein H" /protein_id="PRJNA477356:DP116_12700" /translation="MKTDSIFYRLFQEFPNIFFELIGNSPETANLYQFSSVEVKQTAF RIDGVFLPIQDEEVPIYFVEVQFQPDTDIYLRLVSEISLYLRQNKRKNSWRGVVIYPS RDIDKGEKQDLWEFFHSQRISLIYLDELGEAASLPLGIATIKLVIEEEDTAISTAREL INRIQQSVNLQLPQKQLLELIETILVYKFPKMSREEIEAMFGLSDLKQTRVYQEGRLE AKLEAVPKLLVLGLTVEQVAQALDLDVAQVQQVEQRTQSNE" gene complement(3282..3470) /locus_tag="DP116_12705" CDS complement(3282..3470) /locus_tag="DP116_12705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12705" /translation="MLDHREDIEALEILFSRRTPDSQAIIYPSMFTEDGKPIEENNRI IEEAIAKRVQQEKNNQDQ" gene complement(3508..3714) /locus_tag="DP116_12710" CDS complement(3508..3714) /locus_tag="DP116_12710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874192.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12710" /translation="MSQLRKRIYTLQPGKHLCVIICQYLSNFYREIQLFRFSDTTGNV FILAGDELQILVFRDGTWRFVNET" gene 3950..4162 /locus_tag="DP116_12715" CDS 3950..4162 /locus_tag="DP116_12715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652078.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12715" /translation="MFDLQKHYVMDENQQPVAVQIPIADFEKIEEILENYGLVHLMTE LEDNERLSKEEALKYYQSLKGKNVAS" gene 4149..4421 /locus_tag="DP116_12720" CDS 4149..4421 /locus_tag="DP116_12720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875032.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_12720" /translation="MWQVEYTKRFLKDLAALPEDMQARIEPIVFEEIQSDNPFDLGYL EKMKGYLDKYKIRVGDYRIGITVDKQTNTLICQRVAHRKDIYRIFP" gene complement(4494..6521) /locus_tag="DP116_12725" CDS complement(4494..6521) /locus_tag="DP116_12725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995520.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfite reductase, ferredoxin dependent" /protein_id="PRJNA477356:DP116_12725" /translation="MVKSAPLTTSRKPSKVEAIKERSNSLREPVATEILQDTTHFSED AIQILKFHGSYQQDNRDNRVKGQEKDYQFMLRTKNPGGLVPPQLYLTLDKLADEYGNH TLRVTTRQGFQVHGILKKNLKAAFAAIIKNMGSTLGACGDLNRNVMAPPAPFKNKPEY QYAWEYADNIASLLTPQTGAYYEIWLDGEMAISAEENPEVKAAREKNGTGTIFHDKEE PLYGTYYMPRKFKVCVTVPGDNSIDLYSQDLTLVVITNDKEELEGFDVFAGGGLGRTH NKEETFARVADPICYVEKDDVYDLVKAIVATQRDYGDRTDRRHARLKYLIHDWGVDKF RSQVEEYFGKSLAPFKPLPEFKYHDFLGWNEQGDGKLFLGISIDNGRIKDEGDFQLKT ALREIVEKYNLPIRLTPHQNVVFYDIQPKNKQAIQKILSSHGVIFELEKIDPLVRYAM ACPALPTCGLAITESERAIPGILERIRALLDKVSLQNEHFVVRMTGCPNGCARPYMAE LGFVGSAPESYQVWLGGSPNQTRLALPYMERLHDNDIETQLEPIFVYFKQQRQTEETF GDFCDRIGFDAIREFAANYESQTVAPPEITDDADGLVETMADSTTAGITEESERQEVA IANTTIATSKNRHRVTLQDEVYDRLKEISSRQNKPMTNLVNEALLDYLKNL" gene complement(6851..7396) /locus_tag="DP116_12730" CDS complement(6851..7396) /locus_tag="DP116_12730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013193140.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12730" /translation="MKRLLKSFVTISALSSVVVAPILLSAGLASALPRPTQNGTDASY VGVGVSGGLTNGGQKGDAAAFGGNVTGRYKLGDTPLSARGQLLWGEETTAIIPQISAD LGIGKGTNIYLGAGYSFVEKDGKASPLGNKDGVALTAGVESELGNSFMLYGNANLGVD AYKNSPASAFNVSGGVGYRFR" gene complement(7676..8539) /locus_tag="DP116_12735" CDS complement(7676..8539) /locus_tag="DP116_12735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197476.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12735" /translation="MNHHPLFTNRVAVLATMHHKEKVIAPILEQELGIKVVVPKYFNT DRFGTFTREIKRVGTQIEAARLKAETVLTMTGETLAIASEGTFGPHPGSPFIPCNREV VILLDKTNGLEIIGQEISTETNFNHRIVNSLEEAQDFADTIGFPEHALVVSLSSSPRS QDEIIKGIKTREQLVEAVQFGLNSSKDGKVYIETDMRALYNPTRMKNIAASTHDLVRK LNHACPNCSCPGFELVQRIKGLPCAYCHAPTQLTIAAVYKCQKCGFSKEKLFPDGLEK ADPAQCLYCNP" gene 9220..10581 /locus_tag="DP116_12740" CDS 9220..10581 /locus_tag="DP116_12740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873234.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bicarbonate-binding protein" /protein_id="PRJNA477356:DP116_12740" /translation="MSDFSNQFSRRKFLLTAGVSAVGSVFLKGCLGNPPDTASTSPVQ QAVATNVSAADKPETTKVKLGYLPIVEAAPLIIAKEKGFFAKYGMTEVDISKQANWGS ARDNVEIGAAGGGIDGGQWQMPMPYLISEGRITKGNRKIPMYILAQLNTQGNGIAVAA KHQGKGLSVKIAGKETFFAQLKSAKNIFKAAMTFPGVNQEFWIRYWLAASGVDPDKEV QLLTVPAAQTVANMKTGSMDAFSTGDPWPYRIVKDKIGFMPVLTAEIWKNHPEEYLAM RADWVDQNPKATKALLKGLMEAQQWCDNFNNRKELAQILSKRNYFNVPADVLNDPFMG KYDMGDGRTVNDKKMASLYWKDEKGNVSYPYKSHDLWFLTESVRWGFLPEDTLTNAKT LIDKVNREDLWRQAAKELGVPAADIPTSTSRGVEEFFDGVKFDPENPKAYLQSLKIKK VKI" gene 10632..11471 /gene="ntrB" /locus_tag="DP116_12745" CDS 10632..11471 /gene="ntrB" /locus_tag="DP116_12745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318771.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrate ABC transporter, permease protein" /protein_id="PRJNA477356:DP116_12745" /translation="MVTLQRRSRANTVDNAWLSRLQKQYPGLIPPIIAIGIFLVLWQL FAWIPGATLPGPIQVVQDTWILILYPFYDRGGTDKGLFWQILASLQRVAIGYSFAAII GISLGIVIGTSKVMSRALDPMIQLFRTVPPLAWVPISLAALRQNEPAALFVIFITAIW PILINTAVGVKQIPQDYNNVAKVLQLSRREYFFNILIPAALPYIFTGLRIAIGLAWLA IIAAEIIMSGIVGIGFFIWDAYQNNKVSEIILALVYIGVVGLILDKLMIWVQSRILPE EQK" gene 11704..13710 /locus_tag="DP116_12750" CDS 11704..13710 /locus_tag="DP116_12750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652999.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bacitracin ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_12750" /translation="MSVFVAVDQIDKVFSLANGGQYIALKGIDLQIKRGEFISLIGHS GCGKSTLLNMIAGLDLPSEGVVTLEGRRITRPGPDRMVVFQNYSLLPWRTVRENIALA VDSVLNDYSVDERKAIVEQHIDMVGLRPHADKAPAMLSGGQKQRVAIARALAIRPKLL LLDEPFGALDALTRGNLQEQLMQICQENQVTAVMVTHDVDEAVLLSDRIVMLTNGPES KIGDILEVDIPRPRKRMEVVEHPSYYSLRSEMIYFLNQQKRIKKIRARKTAAIARHGL EKVNLEIGFLPLTACAPLAIAKEKGFFAKHGLDEVTLVREASWRGITDGISGGYLDAA QMPSGMPVWMTVGGDKGRPVPVVSALTLTRNGNAITLDKRFYDQGIYSLADFRRLLQY STDQRHTLGMVHPSSMHNMLLRYWLAAGGIDPDHDVSLKTIPPAQMVVDLQAGTIDGY CVGEPWNLRASMEGIGFTIATDLEIWQGHPGKVLGVREDWATAYPNTHIALVKALLEA CRYCADENNFQDVREILSRQEYVSTTEDYIQLGDPNSYVCSLEQPMRQYAHHLFFGDG VNRPSRTEHLWMMTQMARWEDIPFPRNWLEILERVCRVSVFSTAARELGLLDIKYNRG PIELFDGSVFNADDPIAYLNSLDIKRNFSVAEIVINSRQVSAVA" gene 13770..14612 /locus_tag="DP116_12755" CDS 13770..14612 /locus_tag="DP116_12755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_12755" /translation="MYERTFSNTYRQSGVTATRQNPFLLIENVSKIYPTSKGPYTVLQ DVNLTVNEGEFICVIGHSGCGKSTLLNMVSGFATPTHGSVLLNSKPITQPGPDRMMVF QNYALLPWLTTSENIYLAINAVFPDKPKAQKSAIVREHLALVGLTEAADKKPTQISGG MKQRVAIARALAIRPQVLILDEPFGALDAITKEELQEELLKIWNERRCTVLMITHDID EALFLADRLVMMTNGPAAKIGEDIQIPFSRPRDRARIMEDPEYYRLRNHVLDYLYRRF AHDE" gene 14750..15238 /locus_tag="DP116_12760" CDS 14750..15238 /locus_tag="DP116_12760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319816.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12760" /translation="MRILILKTTITTATALIIALTGITPAQAITLNFTWNGNNNYSAL GSFSYDENTAPAIFSEKGAGQTDVLQSLNISFFDPLNNLIATYNNVVDGVSIPNYFQF NFNTVTQEIFGLIDLGGEIAGDTYLKGTVNSDLSLFQVPQSDSDARIDSNSGAIVVKP IP" gene complement(15398..16660) /locus_tag="DP116_12765" CDS complement(15398..16660) /locus_tag="DP116_12765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316113.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cysteine desulfurase" /protein_id="PRJNA477356:DP116_12765" /translation="MTFTQEKTLADQVRADFPILHQEVNGKPLIYFDNAATSQKPLLV LNTIRDYYEQYNSNVHRGVHALSAKATEAYEVARDKVAAFVNAASRQEIVFTRNASEA INLVAYSWGMSNLQPGDEIILSVMEHHSNLVPWHFVAQKTGAVLKFVELAAEETFDLE QFKKLISDKTKLVSVVHVSNTLGCINPVKEICEIAQKHGARVLIDACQSVPHMPINVQ QIGCDWLVASGHKMCAPSGIGFLYGKLDLLRSMPPFLGGGEMIAEVFLDHSTYADLPH KFEAGTPAIGETIALGAAIDYLSSIGMDKIYTYEAELTGYLFEQLEQIPQLRIYGPKP KVAGLGRAALAAFTAGEIHANDISTLLDQEGIAIRSGHHCTQPLHRHLVVPATARASL SFYNTREEIDVFVKALKETLDFFGGFLA" gene complement(16697..18148) /gene="sufD" /locus_tag="DP116_12770" CDS complement(16697..18148) /gene="sufD" /locus_tag="DP116_12770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316114.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Fe-S cluster assembly protein SufD" /protein_id="PRJNA477356:DP116_12770" /translation="MSVLVSPSPVPNSNPVDLASAVLDRDAELTGLLNQIIDDHTSFD GADAWLQELQERATRVVRKSVLPTTGDEEWRFTDLSSLRKVKFEGVGSQPADISLSDI HSVPEAANRLVFVNGVYAPELSMVAGLPDGVVVSNLAGLPVGDRQRVQQYLAQTEGAH EVFTALNTSGIKDVAVVWVGKNVVVETPIHLVFVAAAECATISLPRCLVVAETGSSVT LVEEYTNRRGAEGAEEEEEKRVYFSNAVTEIWLEENAQVNHTRLELENAEAFHIGKTA VSQARYSRYTCHAITFGGRLSRHNLEILQAGEQTETTLNGLTVIGGNQLADTHSAIAL NYPYGRSQQLHKCIIGVRVRAACPQGSAAPSEHRAHAVFNGKVFVPKPAQLTDAAQLN RNLLLSPKARVDTKPQLEITADNVKCAHGATVSQLEDDEIFYLQSRGIDQDNARNLLI NAFAAEVINQIPVPSLQERLTQTVNSYQSITND" gene complement(18148..18936) /gene="sufC" /locus_tag="DP116_12775" CDS complement(18148..18936) /gene="sufC" /locus_tag="DP116_12775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112309.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Fe-S cluster assembly ATPase SufC" /protein_id="PRJNA477356:DP116_12775" /translation="MIIENSEVVLSVRDLTADVDGTPILKGLNLEVRSGEIHAIMGPN GSGKSTFSKVLAGHPAYEVTGGEVIFQGQNLLEMEPEERARSGIFLAFQYPLEIPGVS NLDFLRVAYNSRRKAQGLEELDAFDFDDLIEEKLDVVKMNPAFLSRSLNEGFSGGEKK RNEILQMALLEPKLGILDETDSGLDIDALKIVANGVNQLTSPENATIMITHYQRLLNY IVPDFVHVMANGRILTSGGKELALELESRGYDWVLEDALDEVGV" gene complement(19041..20480) /locus_tag="DP116_12780" CDS complement(19041..20480) /locus_tag="DP116_12780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996648.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Fe-S cluster assembly protein SufB" /protein_id="PRJNA477356:DP116_12780" /translation="MSATVKTLVNQPYKYGFITDIEADTIARGLNEDTIRLISLKKNE PEFMLDFRLRAFRQWQKMAEPTWPSVKYPVIDYQNIIYYSAPKQKKEKLNSLDEVDPA LLETFEKLGLPLSEQKRLANVAVDAIFDSVSIGTTFKEKLLKEGVIFCSISEALQEYP ELVQKYLGSVVPTADNYFAALNSAVFSDGSFVYIPKGVKCPMELSTYFRINNGETGQF ERTLIVAEENSYVSYLEGCTAPMYDSNQLHAAVVELVALDNAEIKYSTVQNWYAGDAN GKGGIYNFVTKRGLCQGVNSKISWTQVETGSAITWKYPSCVLVGDNSVGEFYSVALTN NMQQADTGTKMIHVGKNTRSTIISKGISAGNSSNSYRGLVKINPKAQGARNYSQCDSM LIGDNAHANTFPYIQVQNNTAKVEHEASTSKIGEDQLFFFAQRGISSEDAISMMISGF CKDVFNQLPMEFAVEADKLLSLKLEGSVG" gene 20713..21390 /gene="sufR" /locus_tag="DP116_12785" CDS 20713..21390 /gene="sufR" /locus_tag="DP116_12785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876440.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron-sulfur cluster biosynthesis transcriptional regulator SufR" /protein_id="PRJNA477356:DP116_12785" /translation="METTHQSSTKQDILEYLLKHSQATAFELAEALDISPQAIRRHLK DLEGEELISFSLSVQGGMGRPQHVYQLTRSGRDRLRPDGADGYGQFAVSLLDTLAETV GRDQVSSILRKQWERKAEEYRDRLGNTSLSERVAILVKLRKAEGFMAEYHPVESSESS QGDGFILTEHNCAISNVAESFPSICGHELEMFATVLPDCTVERTHWIINGEHRCGYLV QERKNKL" gene 21550..21810 /locus_tag="DP116_12790" CDS 21550..21810 /locus_tag="DP116_12790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655183.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12790" /translation="MELPSPQSTSQFITPEELVKVDAALLSSPEKFLTRLTISSLRLL KQIAQENEVNIEDLTAQQVIDWFEKDGKIRREQGIEAAYLKW" gene complement(21939..22649) /locus_tag="DP116_12795" CDS complement(21939..22649) /locus_tag="DP116_12795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315038.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CHAT domain-containing protein" /protein_id="PRJNA477356:DP116_12795" /translation="MPETPKKQSNFNLQGAQFAGGFVNADSVNAHQIGGDITNYNQVA RTEANNSALKTILILAANPKNSLPLRLDEEVREIEAGLQRAKKRELFDLKQRWAVRVQ EVYQSLLDFKPQIVHFSGHGTGDHGLALEDETGNVRLVDTQALAGLFELFASHIECVV LNACYSEVQAQAIVKHIPYVIGMNKQIGDQAAIKFATAFYNVLGAGESIEFAYNLGCN VIQLEGIPEYLTPVLKKK" BASE COUNT 6602 a 4986 c 4690 g 6405 t ORIGIN 1 ttaggggcta gcggtgagac agcgctgcag gagggtttcc ctccgtaggc gactgcgaac 61 ccgttcggcc tcaagccgtg ccgaaggcat agggcttccc gttcgccgtt aggcgttccg 121 ttggcgcagc ctctccgata ggagaaggaa agggtagcga agtcatcgca acaacaagat 181 attctaaata aatcaataaa tttacttaaa aatcttggta gttcgtagat aatatatcta 241 tctgtcagtg gatgcagctc atgctctgtt gcctcaaccc agcctgtcat aatccgccga 301 atcctgatgg cacaatattt tgctccaact gcggtgtagg attggtagtg ctgagaaacc 361 gctatcgccc gataaaatcc ttaggtgctg ggggatttgg caaaacttat ttggcagaag 421 atattgacaa attgaataaa aaatgcgtaa ttaaacaact tgcgccacaa gttcaaggaa 481 ctggcgcact gcaaaaggca acagaattat ttgagcaaga agcaaggcga ctcgaacaat 541 taggcgaata tccacaaatt ccgacgctgt tagcttattt tgaggaagat aattgtttgt 601 atttggtgca gcagtttata gatgggcaga atttattgaa ggaattacaa caacaaggaa 661 ctttcagcga gggaaagata cgggatttat tgcaggattt gttaaatatt ctcaaggtag 721 tgcatcaaca caaagttatt caccgggata ttaagccaga gaatattatt cagcgtggcg 781 atcgcaaact tgttttaatt gactttggtg catctaagca actgacaaaa acagtgatga 841 ctgcaaaagg aacaatgatt ggttctcttg gttatgcacc actggaacag atgcagggag 901 gagaagctta cccagcaagt gatttatata gtttaggtgc aacttgtttt catctgttaa 961 gtggaattca tccttgggaa ctgttcacta agcagggtta tacgtgggtg tcttcttgga 1021 gacagcattt acagcaacca gtgagtctgg aattaaggcg gattttagat aagttgctgc 1081 aagaggatta tcagcagcgt tatcagtcgg cggaggaagt tttacaagat ttaaatccag 1141 cgccaccgtc tactaaagta tcttctcaac ctccaatatc tacgccaaga gcatcagcaa 1201 agtcaccagt aaaactaaaa gcatggcaac aaagattact tgcgggtgta gctattactc 1261 tagggggatt agtgttaacc cagttcgttg gctatgttcg ctatggctta tttccaacta 1321 atcccatatc tgtaattgct agccaagata gcgatgtttt cttaacaaga actttaacag 1381 ggcattccaa ctcggttagt tccgtcgcca tcagcccaga tggcaaaacc cttgccagtg 1441 gtagtgatga caaaaccatc aaactgtgga atctagtaac tggagagcaa attcgcaccc 1501 tcactgagca ttctagcccg gttaattccg tcgccatcag cccagatggc aacacccttg 1561 tcagtggtag tgatgacagg acaatcaaac tgtggaatcc tgcaacggga gaacaaatcc 1621 gcactttcag agggcatact ggctatgtta attttgtcgc tatcagccca gatagtaaaa 1681 cctttgccag tggtagttct cacaaaacca tcaaattgtg gaatctgaca acgggaaaac 1741 aaatccgcac cctcactggg cattttaaat cggttgattc cgtcgccttg ccagttataa 1801 taagacctca aagcctggaa cctgataaac aacaagttat ttccgttgcc atcagcccag 1861 atggcaaaac ccttgccagt ggtagttatg acaacacaat cgaacggtgg aatctggaaa 1921 caggagagcc aatccgcact ctcacaggtc attctagcca agttaattcc gtcgccatca 1981 gcccagatgg caaaaccctt gccagtggta gtggagacaa aaccatcaaa ctgtggaatc 2041 tcacaacagg agagcaaatc cgcaccctca cagggcattc tgactgggtt agttccgtcg 2101 ccatcagccc agatggtaaa acccttgcca gtggtagttg ggacaacacc atcaaactgt 2161 ggaatctcac aacaggagag caaattcgca ccctcacagg gcattttaac cgtgtttatt 2221 ctgtcgcctt tagcccagat ggcaaaacca ttgccagtag tagttacaac accatcaaaa 2281 tttggcggtt acggtagtga taaaatcgct acagctaaaa actacgccaa aacctaagtg 2341 tgaaaactga cagcatcttt tatcgtctct ttcaagaatt tcccaatatc ttctttgaac 2401 tgattggtaa ctctcccgaa acagcgaatc tctaccaatt ttcttcagtt gaagtcaagc 2461 aaacagcctt cagaattgat ggtgtatttc ttcccatcca agatgaagaa gttcctattt 2521 attttgtcga ggttcagttt caaccagaca cagatattta cttacgtctt gtgtcggaaa 2581 tttctttata cttacgccaa aacaaacgca aaaattcttg gcgaggagtc gtgatttatc 2641 ctagcagaga tatagataaa ggtgagaaac aagatttatg ggaattcttc catagccagc 2701 gtatcagtct aatttactta gatgaattag gtgaagctgc atcactgccg ctaggcattg 2761 ctacgataaa attagtaatt gaagaggaag atacagccat tagcaccgct agggaattaa 2821 tcaaccgtat ccagcaatca gtaaatttgc aactgccaca aaaacaatta ctagaattaa 2881 tagaaacaat cttagtttac aagtttccga aaatgagtag agaggagata gaagcaatgt 2941 ttggcttaag cgatttaaag caaacgcggg tttaccaaga aggacgttta gaagcaaaat 3001 tagaagctgt acctaagctt ttagtattgg gcttaacggt ggaacaagta gcacaggctt 3061 tagatttaga tgtagcacaa gtccagcaag tagaacagcg aacgcaatca aatgaatagt 3121 aaggtaatac tgttttcaaa attcatttca gcgatattaa agtatgagta gggaattagc 3181 atcagtcaag attatctaga gctatagcaa gcgatgctcc agcagggagc cagcaaagct 3241 gatcgcacag gtaacaagta aaatatacca acaacacagg actattgatc ctgattattc 3301 ttctcttgct ggactctttt ggcgatcgct tcttcaatta tccgattatt ctcttctata 3361 ggtttaccat cttcagtaaa catcgacgga tatataattg cttgtgagtc gggagtacgg 3421 cgactaaaca ggatttccaa tgcttctata tcttcccgat gatccagcac atatgccctc 3481 aattcttttg tggacatttc tgcaaaatta ggtttcattt acaaatctcc aagttccatc 3541 acgaaataca aggatttgca gttcatcacc tgccaaaata aacacattgc cagttgtgtc 3601 actaaaacgg aatagctgaa tttcgcggta gaaattagac aagtactggc aaataatcac 3661 acataagtgc ttgccgggtt gtagggtata tattcttttc ctcaattgtg acataaataa 3721 tattaagcct tataccattt cacgaaattg gtgatacaaa ttctttcacc cctgctcccc 3781 tgctcccctg ctgcctattt gtatcaacat taaagtaaaa cggtattaca ggttgaacaa 3841 tctcggttag cccaaatgac gacttccatg ttttatgcta tgtattaccc aagcagctgt 3901 gttactccgt tggcgaggac atcatgtgcc aactccttaa gaggtaaaaa tgtttgatct 3961 tcaaaagcat tacgtaatgg atgaaaatca gcaacctgta gccgttcaaa ttcccattgc 4021 tgattttgaa aaaatagaag aaatcttaga aaactatggt ttagtgcatt taatgacaga 4081 attagaggat aacgaacgtc tttccaaaga agaagctttg aaatattacc agtctcttaa 4141 gggtaagaat gtggcaagtt gaatacacca aaagatttct caaagacttg gcagctttac 4201 cagaagatat gcaagctcgt attgaaccga ttgtatttga agaaatacag tccgacaatc 4261 cttttgattt aggatatctg gaaaaaatga agggatatct tgataaatat aaaattcgtg 4321 ttggcgatta tagaattggc attacggtag ataaacagac gaatacacta atttgtcaac 4381 gtgtagctca tcgcaaagac atatacagaa ttttccctta attccagggt ttttcaagta 4441 aataaatcac aggtagaggc acagcatcct gtgcctctac cttatgctat gcgctacaga 4501 ttctttaaat aatccagaag cgcctcattc accaaattag tcatcggctt gttctgacgc 4561 gacgagattt ccttcaatct gtcgtaaact tcatcctgaa gggtgacgcg atggcggttt 4621 ttgcttgtcg caatcgtcgt gttagcgatc gccacttcct gacgttcact ctcttctgta 4681 attcctgctg tggtagaatc tgccattgtc tcaactaaac catcagcatc gtctgtgatt 4741 tctggtggtg caacagtttg agattcgtag ttagcagcaa actcgcggat agcatcaaaa 4801 ccaatgcgat cgcaaaaatc accgaatgtt tcttctgtct gacgctgctg cttgaagtag 4861 acaaaaatcg gctctagctg ggtttctatg tcattgtcgt gcagccgttc catataaggt 4921 aatgccagcc gtgtttgatt gggcgaacct cctagccaaa cttggtaaga ttccggtgcg 4981 ctacccacaa agccaagttc tgccatgtaa ggacgagcgc agccgttagg acaacccgtc 5041 atccttacca caaaatgctc attttgtaaa ctcactttat ccaaaagggc gcgaatccgc 5101 tctaaaattc ccggtattgc tcgttctgat tcggtgattg ccaaaccgca agtcggcaaa 5161 gctgggcaag ccattgcata ccgtacaagg gggtcaattt tctctagttc aaaaatgact 5221 ccgtgactgc taagaatttt ttgaatcgct tgcttgttct ttggttgaat atcgtaaaag 5281 acaacgtttt ggtggggtgt caggcggata ggcaagttat acttttcgac aatttcccgc 5341 aaagctgttt tgagttggaa gtcgccttca tccttaattc gaccgttgtc gatggaaata 5401 cccaggaata gcttaccgtc gccttgttca ttccaaccga gaaagtcatg atatttaaac 5461 tctggcaagg gcttaaaggg tgcgagtgac ttaccaaagt attcctcgac ttgggagcga 5521 aacttatcaa cgccccaatc gtgaatgaga tattttaacc tggcgtgacg gcggtcagtg 5581 cgatcgccat aatctctttg agtcgcaaca atcgccttaa caagatcgta aacgtcatct 5641 ttttctacgt agcaaatcgg gtctgctact ctagcaaaag tttcttcttt attatgtgtt 5701 cgtcctaaac caccgccagc aaagacatca aatccctcta actcttcctt gtcattggtt 5761 atgactacca aagtcaagtc ttggctatac aaatctatcg aattatctcc tggaaccgtc 5821 acacaaactt tgaacttgcg cggcatatag taagtgccgt aaagaggttc ttctttgtcg 5881 tggaaaattg tgccagtgcc atttttctct cgcgccgcct tcacctctgg gttttcctca 5941 gcactaattg ccatttcccc atctagccaa atttcgtagt aagcgcctgt ttggggtgtt 6001 agcaagctgg caatattgtc ggcatattcc caagcgtact gatactcagg cttattttta 6061 aacggtgctg gaggtgccat gacgttgcgg ttcaagtccc cacaagctcc cagcgttgaa 6121 cccatattct tgattatggc agcaaatgct gccttaagat ttttctttaa aatcccatgc 6181 acctgaaacc cttgacgagt ggttactcgc aaggtgtggt taccatattc atctgccagc 6241 ttatctaaag tcagatacag ctgcggcggg actaacccac ccggattttt cgtccgcagc 6301 atgaactggt agtctttttc ttgtccctta acccgattat cacgattatc ttgttggtaa 6361 gaaccatgaa actttagtat ttgtatagca tcttcactaa aatgagttgt atcctgaagt 6421 atttcggttg caacaggttc acgcaaagaa ttgctgcgtt ccttgattgc ttctactttg 6481 gaaggcttac ggcttgtggt cagaggagca gatttaacca ttagattggt acacagtttt 6541 ttcaaaaaat gatcacaatg tgccgtaaag tacttcagaa gcaccctttc ggctgattta 6601 tttccgataa tccggtcgga attaggagaa atataatgat tttaacacgc caaaaagccg 6661 ggtttttcag cgatgcaaca cttagataaa cccagcaata ttacttatac taaagttatg 6721 agtcttgact tatcatttgc cgcttgaaaa taggcaaatg attcgtctat aattcctcaa 6781 cctaacagga tatcgttcct gtcaggattt gctaatggct gtagttcaac ccgacaattg 6841 aactacagct ttacctaaag cgatagccaa caccaccact gacgttgaaa gcagaagcag 6901 ggctattttt gtaagcatca actcctaagt tagcgttgcc ataaagcata aagctattac 6961 ccagttctga ttctacgcca gcagtcagcg cgactccatc tttgttgccc aatggacttg 7021 ctttaccatc tttctccaca aaagagtaac cagcaccgag gtaaatatta gtacctttgc 7081 caatgcctaa atctgcggaa atttgtggga tgatagcagt tgtttcttca ccccagagga 7141 gttgaccgcg tgcagacaaa ggtgtgtctc ctaatttata gcgacctgtg acatttccac 7201 caaatgcagc tgcatcaccc ttttgcccac cattggtgag accaccagaa acaccaacac 7261 caacatagct agcatcagta ccgttttgtg ttggtcgagg aagagcagaa gcaagacccg 7321 ccgagaggag aataggagca acgacaacag aggacaatgc agaaattgtc acaaaagatt 7381 taagcaaccg tttcataatc tttaccgata ttttgtaggt attgatttta aatacatgtc 7441 gaaaaactaa ctacaatgtt ccagatattt tctaaaaatt tttcttgtct gttgttatta 7501 atgcagaata tatcaagtag tttctctaga tataaataag agttttataa ttacatggtg 7561 tcattttttt atccggaact ttcttttgaa aagtagacac ttgtttgact ggcacataat 7621 tctcatccca cagatacggg tctgggagga ttttgcgaca ttcagtcagt tttgattaag 7681 gattgcagta cagacattgt gctgggtctg ccttttctaa tccatctgga aataactttt 7741 ctttagaaaa tccacatttt tgacatttat atactgctgc aatagttagt tgtgtgggtg 7801 cgtgacaata cgcgcaaggt aatcctttta tcctttgaac taattcaaaa ccgggacaag 7861 aacagttggg acaagcatga tttagttttc tgactaagtc atgagtcgat gcagcaatat 7921 ttttcatcct tgtgggatta tataaagccc gcatatccgt ttctatataa accttgccat 7981 ctttggaact gtttaaacca aattgaactg cttctacaag ttgctctcta gtcttaatac 8041 ctttgataat ttcatcttga cttctgggtg aggaactcaa gcttacgact agggcgtgtt 8101 cgggaaagcc aattgtatct gcaaaatctt gcgcttcttc taaactgttc acaatcctat 8161 gattgaagtt cgtttcagta gaaatttctt gaccaataat ttctagcccg tttgttttgt 8221 ctaataagat gacaacttcc cgattacaag gaataaatgg gctaccaggg tgaggaccaa 8281 atgtaccttc actagctatt gctaaagtct cacctgtcat agttaatact gtttctgctt 8341 ttagccttgc tgcttcgatt tgcgttccta cccgtttgat ttccctggta aatgtaccaa 8401 aacggtcagt attaaaatat ttgggaacta cgactttgat gcctaactct tgttcaagga 8461 taggagcaat gaccttttcc ttatgatgca ttgtcgctag tactgctact cgattggtaa 8521 atagaggatg atgattcata actctttaca attcctcaga taagattaac cccagagcac 8581 tcttgttttg aagagagtga cgaaatatta gatttttttc ttgactttat agtgattttg 8641 tgtgatttga aattttcttc tcaaatgatg aaactcctat cttacaccta tccgtataaa 8701 taaagggtaa gtgaaaagca gcgcgataaa gttacattac tttttgggaa ttttatcgct 8761 aaattaaggg ttatggaatt catactgagt taatccaata cattgtttgt ggggatcaaa 8821 atgtcataag aatcattaca aaaatacaat agttactata agtcaacagt tatggcttac 8881 tattggtcgt ggtaaaatct aatagttact atcgaattgc ttctataaga tagagtaaat 8941 tcaattagga tacgtatttt attaaatttt ttcttatagg aaccatataa gattttttta 9001 agattttggt tgagatatta accactgctg attcaaaagt gataatctta tctacaacag 9061 taaaatctgc tcattcaaaa cacctcagtt catgtaattg aggcggcgat aggtttgaaa 9121 ttaggtgtga ataacacatt aagaaagcaa tgaacataat tgttctgttg ccttgatttg 9181 catacccaaa aacaattgta atttacaaaa tgtaatgtta tgagtgactt ttcaaatcaa 9241 ttttctcgac gtaaattttt gttaaccgct ggagtttctg ctgtagggtc tgtattcctt 9301 aagggttgtt tgggaaatcc tcccgacact gctagcacaa gtccagtaca acaagctgtt 9361 gcgaccaatg tcagtgcagc ggataaacca gaaacaacta aagttaagct agggtatcta 9421 ccgattgttg aagctgctcc tctgattatt gcaaaggaaa aaggcttctt tgctaagtat 9481 ggaatgaccg aagttgatat atccaagcaa gctaactggg gttcggcaag agataacgta 9541 gaaattgggg ctgctggagg tggaattgat ggtggtcaat ggcagatgcc tatgccctat 9601 ctgatttctg aaggacgcat caccaagggt aatcgaaaaa tacccatgta tatcttggct 9661 cagttgaata ctcagggtaa tggaattgcg gtcgcagcta agcaccaagg caaaggtctg 9721 agtgtgaaga ttgctggcaa agaaacgttt tttgctcagc tcaaatcagc aaaaaatatc 9781 ttcaaagctg ctatgacttt cccaggagtg aatcaggagt tttggattcg ctactggtta 9841 gctgcatcgg gtgtcgatcc agacaaggaa gttcaactat taacagtacc agcagctcaa 9901 acggttgcaa atatgaagac ggggtcgatg gatgccttta gtactggtga cccctggcct 9961 taccgaattg tcaaagataa gattggcttc atgccagtat taactgcaga aatctggaaa 10021 aaccatccag aagaatatct tgcaatgagg gcagattggg ttgaccaaaa tcccaaggcg 10081 actaaagccc ttttgaaggg tctgatggaa gcccaacaat ggtgcgataa ctttaacaac 10141 agaaaagaac tcgctcaaat cctttcaaag cggaattact ttaacgtgcc tgcagacgtt 10201 ctcaacgatc cattcatggg taagtatgac atgggtgatg gtcgcactgt taacgataaa 10261 aaaatggcat ctctgtattg gaaagatgaa aaaggtaacg tctcctatcc atataagagc 10321 catgacctct ggtttttaac agaaagtgtg cgctggggct ttttaccaga agatacctta 10381 acaaatgcca aaacactgat agataaagtt aaccgtgagg atttgtggag acaagctgca 10441 aaagaattag gtgttccagc tgctgacatt cccaccagta catcccgtgg cgttgaagaa 10501 ttttttgatg gtgtcaagtt tgacccagaa aatccaaagg cctacttgca aagtctgaag 10561 attaaaaaag taaaaattta agctcagttt tgactcagta agaaaaacca aatttcaaga 10621 ggaaagtaac aatggtaact ctgcaaagac gctctcgggc aaatactgtt gataatgctt 10681 ggttgtctcg tctgcaaaag caatatcctg gtttgatacc accgattata gcgatcggca 10741 tctttttagt tctatggcag ttgtttgctt ggattcctgg cgcaacacta ccaggaccaa 10801 tccaggtcgt acaagacact tggatactga ttctgtatcc tttttatgac cgaggtggta 10861 cagacaaggg tttattttgg caaatcttgg caagtttaca acgggtagca attggttact 10921 catttgcggc aattatcggt attagcttgg ggattgtgat tggtacaagt aaagtcatgt 10981 ctagggcttt agatccaatg atccaattat tccgaactgt accacctcta gcttgggtgc 11041 cgatttctct tgcagcttta cgacaaaacg aacctgctgc actattcgtc atttttatca 11101 cagcgatttg gcccatctta ataaacacag cagttggtgt aaaacaaatc ccccaagatt 11161 acaacaacgt cgccaaagtt ctgcaacttt ctcgtcgaga atatttcttc aacatcctca 11221 tacctgctgc tctcccttac atttttacag gcttaagaat tgcgattggt ttggcttggt 11281 tggcgattat tgcggcagaa attatcatgt ccggtattgt tggaattggc ttttttatct 11341 gggatgcgta tcagaataac aaagtcagtg aaattatttt ggctctcgtt tacatcggtg 11401 tggttggttt gatactcgac aagttgatga tttgggtgca gtcgagaatt ttgccagaag 11461 aacaaaagta gttgttagtt gcttctcgct cgaagaatag tcatttaacg aaccgccaag 11521 acgaggcagt gcgttgcggg ggttcccccc gttgtagcac ctgccgtgcc aagagcgcaa 11581 agaaagaaga aaaagagaag atagtaatct tgtcgcaccg aaggcagtag ttattcatct 11641 tttgtcatga ggaaatggca aagggtgaaa aacaaataac aaagaacaaa ggataaaaga 11701 aaaatgagtg tttttgttgc tgttgaccag attgataaag ttttttcctt agctaatggt 11761 ggtcaatata ttgctcttaa aggaattgac cttcagatta aaagaggaga atttatctct 11821 ctgattggtc actctggttg tggtaaatcc accctgttaa atatgattgc tggtttggat 11881 ttaccaagtg aaggtgtggt gacgctagaa ggaagaagaa ttactagacc tggtccagat 11941 aggatggtag tgtttcagaa ttattcactc ttaccttggc ggacagtacg agaaaatatt 12001 gctcttgctg tagactcagt gctgaatgat tactctgtgg atgagcgcaa ggcaattgta 12061 gagcagcata tcgatatggt ggggttgcgt cctcatgctg acaaagcccc agcaatgtta 12121 tcgggtggac aaaaacaacg ggtggcgatc gcccgcgcct tagcaattcg tcccaaatta 12181 ctcttgctag atgagccatt tggtgcgtta gatgcactca cacgcggtaa cttgcaagaa 12241 cagttgatgc aaatttgtca ggaaaaccaa gtcaccgctg ttatggtgac tcacgatgtc 12301 gatgaagcgg tgctgttgtc cgaccgaatc gtgatgttga caaatggacc agaatcgaag 12361 attggagaca ttttggaggt ggatattcct cgacctcgta agcggatgga agtggtagaa 12421 catcctagct actacagctt gcgcagtgag atgatatact tcttgaatca acaaaaacgg 12481 attaagaaaa ttagggcaag gaaaacagca gcgatcgcgc gtcatggttt agaaaaagtg 12541 aacctggaaa ttggttttct tcctctgact gcttgcgccc ctctggctat cgctaaagaa 12601 aaaggtttct ttgccaaaca tggtctagat gaagtgactc tagtgcgcga agcgagttgg 12661 cgcggaatta ctgatgggat tagtggcgga tatttagatg cagcgcaaat gccttctggg 12721 atgccagttt ggatgactgt aggaggcgat aagggtcgtc ctgtaccagt tgtcagcgcc 12781 ctcacactca cccgtaatgg taatgctatt accctggata aacgttttta tgaccaagga 12841 atatattccc tggcagactt tagacgattg ttacaatact ctacagatca acgtcatact 12901 ctcgggatgg ttcatccttc ctcaatgcac aacatgctgt tgcgttactg gttagcagct 12961 ggtggaattg atcctgacca cgatgtttcc ctgaaaacga ttcctccggc tcaaatggta 13021 gttgacttgc aagcagggac tattgatggt tactgtgtgg gtgaaccttg gaacctccgg 13081 gcgtcaatgg aaggaatagg ctttactatc gctacagatt tggaaatatg gcaaggacac 13141 ccaggcaaag ttctaggagt tcgggaagac tgggcgactg cttatcctaa cacccacatc 13201 gccttagtca aagccttgtt ggaggcctgt cgatactgtg ctgatgagaa taattttcag 13261 gatgtccgtg agattttatc tcgacaggag tatgtcagca caactgagga ttacatccaa 13321 cttggcgatc caaactccta tgtgtgtagc ctagaacaac caatgcgcca gtatgcccac 13381 cacctatttt tcggagatgg tgtgaatcgt cccagccgaa cagaacatct gtggatgatg 13441 actcagatgg cgcgttggga agatattccc ttcccccgga actggttaga aatcctggaa 13501 cgagtttgtc gagtcagtgt gtttagtacc gccgcacgag aactcggtct gcttgatatt 13561 aagtataatc gcggtccgat tgagttgttt gatggcagtg tgtttaatgc tgatgatcct 13621 atcgcttatc tcaacagtct tgacattaaa cgcaattttt cagtagcaga aattgtcatt 13681 aactcacgcc aggtttcagc agttgcttga gaattgtaaa tttttgattc tgaattttga 13741 ttctgaattt tgaattggga aatactgata tgtacgagcg cactttttct aacacttaca 13801 gacagtccgg ggtaacagct acacgccaaa atcctttctt gttaattgag aacgtttcca 13861 aaatctatcc cacatcgaaa ggtccttata cagtcctaca agacgtgaac ctcaccgtga 13921 atgagggcga gtttatctgc gtcatcggtc actctggttg tggtaaatca actctactga 13981 acatggtttc tgggttcgcc acacccaccc acggatcagt gctgctcaac tccaaaccca 14041 tcactcaacc aggaccagat agaatgatgg tattccagaa ctatgctttg ttaccctggt 14101 tgacgacttc tgaaaacatt tacttagcga ttaacgccgt tttcccagat aagccaaaag 14161 cacagaagtc agcaattgta cgcgaacatt tagcactggt gggattgaca gaagctgctg 14221 ataaaaagcc gacgcaaatt tctgggggga tgaaacagcg ggttgcgatc gcccgcgctt 14281 tagcaattcg tccccaagtt ttaatcttag atgaaccttt tggggcatta gatgcaatca 14341 ccaaagaaga attgcaagaa gaattgttga aaatttggaa cgaacggcga tgcacagtgc 14401 tgatgattac tcacgacatt gatgaagcat tgttcctagc agatcgtttg gtaatgatga 14461 ctaacggacc tgctgccaaa attggtgaag atatccaaat tcctttctct cgtccgcgcg 14521 atcgtgcgag gattatggaa gatccagaat actacagact gcggaaccac gtccttgatt 14581 acctctaccg ccgctttgcg cacgacgaat aaatctaaca aacctaaaat ctcgtcaaaa 14641 tctcgtttcc aggtagaacc tggaaacgca atcaaagcgg cttcgccgcg agtaagggaa 14701 gcggtttgat ttcgtttata gtattctcta ttcccactaa atgaaagcta tgagaatact 14761 tattttaaaa actaccatta caactgcaac agctttaatt atcgcattga cgggaataac 14821 tcccgctcaa gctattacat tgaacttcac ctggaatggt aataataatt attccgctct 14881 tggttcgttt agttatgatg agaacaccgc gccagcaatt ttttctgaaa aaggggctgg 14941 acagactgat gttttacaat ccctcaatat ttctttcttc gaccctctta ataatcttat 15001 cgctacttat aataacgttg tagatggcgt atcaatacct aactatttcc aattcaactt 15061 caacaccgtc acgcaagaaa tttttggttt aatagattta ggaggcgaaa tcgcaggcga 15121 tacttattta aaaggaacag ttaatagcga cttatcacta tttcaagttc ctcaatctga 15181 ttctgatgca agaatagaca gtaattcagg agctattgtt gttaagccaa taccataacc 15241 gagttgttca gtgaagcatg aaagtgttaa attgaaactc ttaaacaggc aaatccaagg 15301 taggatgggc actgcctacg ttactgtgtt attggtgtac cgtgaaccga cagtgcccac 15361 tctacaaatt aagattttta acaattcgtt aacaatatca cgctaagaaa ccgccaaaga 15421 aatctaacgt ttccttcaac gctttgacaa aaacatctat ctcttcacga gtattgtaga 15481 aagataaact tgcccgtgca gtcgctggaa caactaaatg acggtgcaat ggttgagtac 15541 agtgatgtcc agaacgaata gcaatacctt cttgatctaa taatgtagat atgtcattag 15601 catggatttc cccagcagtg aatgcagcaa gagccgctct acccaaacct gctactttag 15661 gtttaggacc gtagatacga agttgaggaa tttgttctag ctgctcaaac agatagccag 15721 ttaactcagc ttcatatgta tagattttat ccataccaat gctgctaagg tagtcaattg 15781 ctgcaccaag agcgattgtt tccccaatag ctggtgttcc cgcttcaaat ttatgtggta 15841 aatctgcata ggtagaatgg tctaaaaaga cttccgcaat catctcacca ccacccaaga 15901 aaggaggcat tgatcgcagc aagtctaact tgccgtacaa aaagccaatc ccacttggag 15961 cgcacatttt atgaccagaa gccaccaacc aatcacagcc aatttgctgt acattgatag 16021 gcatatgagg gacactttgg caagcatcaa ttaatactct tgcgccgtgc ttttgagcaa 16081 tttcgcaaat ttccttcact gggttaatac aacccaaagt gttagaaaca tgcaccaccg 16141 acaccaattt cgttttgtca gaaatcagct tcttgaactg ttccaaatca aaagtttctt 16201 ctgcagccag ttcaacaaat ttcagcaccg cacccgtctt ttgggcgaca aaatgccaag 16261 gtactaaatt actgtggtgt tccatcaccg agaggatgat ttcgtcccct ggctgcaaat 16321 tgctcattcc ccaactgtaa gcaaccaaat tgatcgcctc agaagcatta cgagtgaaaa 16381 caatttcctg gcgcgatgca gcattgacga acgcagcaac tttgtctctg gcgacttcat 16441 aagcttccgt cgctttggcg ctcagggcgt gcacaccgcg atgtacattg gaattgtact 16501 gctcgtagta gtctcgaatg gtattgagga cgagcagagg tttttgtgat gtagcagcgt 16561 tgtcgaagta gattaagggt ttgccgttaa cttcttggtg taatattggg aagtcagcac 16621 ggacttgatc agcaagggtt ttttcttggg taaaggtcat attagttatt agtcattagt 16681 catgggtcat tagtcattag tcattggtga ttgactgata actattgaca gtttgtgtga 16741 gtctttcttg taaagaaggg acaggtattt ggttgataac ttcagctgcg aaagcattaa 16801 ttaataagtt acgagcatta tcctgatcaa taccccggct ttgcaagtag aagatttcat 16861 catcctccaa ctgactaacg gtagcaccgt gagcacattt cacgttatct gcggtaattt 16921 ctaattgcgg cttagtatca accctagcct taggcgatag taacaaattc cgattcaact 16981 gtgctgcatc agtcaactgt gcaggcttgg gaacaaaaac cttaccgtta aacacagcat 17041 gagcgcgatg ctccgaagga gccgctgagc cctgcgggca cgctgcgcga acgcgaacgc 17101 caataataca cttatgcaat tgctgacttc taccatacgg ataattgagt gcgatcgcac 17161 tatgagtatc cgccaactgg ttccccccaa tcaccgtcaa cccattaaga gtcgtttccg 17221 tttgctcacc agcctgtaaa atctccaaat tgtgacgcga cagccttcca ccaaaagtga 17281 ttgcatgaca agtataccga ctgtaacgag cttgagaaac agcagtcttt cctatgtgga 17341 aagcctccgc attttccaac tcaagccgag tgtgattcac ttgagcattt tcctcaagcc 17401 aaatctccgt caccgcattg ctaaagtaaa ccctcttctc ctcttcctcc tctgcgccct 17461 ctgcgcctct gcggttcgta tactcttcca ccaacgtcac actactacca gtttccgcca 17521 ccaccaaaca acgcggtaaa gaaatcgtcg cacactcagc agcagcaaca aacaccaaat 17581 gaatcggtgt ttccaccacc acattcttcc ccacccaaac cacagctaca tctttgattc 17641 cactcgtatt gagcgcagta aacacctcat gcgccccctc agtttgagcc aaatactgct 17701 gcacacgctg gcgatcgcct acaggtaaac cagccaaatt actaacgaca accccatctg 17761 gcaaacctgc aaccattgat aactcaggcg cataaacccc attcacaaac accaaacggt 17821 tcgcagcttc tggtaccgaa tgaatatccg ataaactgat atctgctggc tgactcccaa 17881 caccttcaaa ctttactttc cgcaaagacg acaaatcagt aaaacgccat tcctcatcgc 17941 cagtggtggg gagaactgat ttacgcacca ctcgcgtagc gcgttcctgt aactcctgca 18001 accaagcatc agcaccgtca aaagacgtgt gatcatctat gatctgattt aataacccag 18061 tcagttcagc atccctatcc aacacagcag acgccaaatc aactggattt gaattaggaa 18121 ccggactagg agaaactaga acgctcatca cacacccacc tcatcaagcg catcttccaa 18181 cacccagtca taaccgcgtg attctaactc caacgccagt tccttaccac cactcgtgag 18241 aattcgtcca tttgccatca catgaacgaa gtctggcaca atataattca gcaaccgctg 18301 atagtgagta atcataatcg tggcattttc cggactcgtc agctgattca ccccgttcgc 18361 cacaatcttc aaagcatcaa tatccaaacc cgaatcagtc tcatccaaaa ttcctaactt 18421 tggttctaga agcgccatct gcaaaatctc attacgcttc ttctcaccac cggaaaaacc 18481 ttcattcaaa ctccgactca ggaaagcagg attcatcttt accacatcca gcttttcctc 18541 aatcaaatca tcaaaatcaa acgcatccaa ctcctccaaa ccctgcgcct tacgccgaga 18601 attataagcc acccgcagaa aatccaaatt actcacaccc ggaatttcca acggatactg 18661 aaacgccaaa aatataccac ttctggcgcg ttcctccggt tccatctcca gcagattttg 18721 cccctggaaa atcacctcac cgccagtcac ctcatacgcc ggatgtccag ccagaacctt 18781 agaaaaagta ctcttaccag aaccattcgg tcccataatc gcatggattt cacccgaacg 18841 tacctcaaga ttcaaaccct taagaattgg cgtcccatca acatcagccg tcaaatctcg 18901 taccgacagc acaacttcac tattctcaat aatcatgaat attctctctc ttctcttctc 18961 tttcttcctc cttcgcgccc ttcgcgcctt cgcggttaaa tattttttaa accacgaaga 19021 cacaaagaac acgaagagaa ttacccaaca ctaccttcca acttcagact caacagctta 19081 tcagcctcca ccgcaaactc catcggtagc tgattaaaca catccttaca gaagccacta 19141 atcatcatcg aaatagcatc ttctgaggaa atgccccgct gtgcgaagaa gaacaattga 19201 tcttccccaa tcttagaagt cgaagcttcg tgctcaacct tcgcagtatt attttgtacc 19261 tgaatataag ggaaagtatt ggcgtgagca ttatccccaa tcagcatcga gtcgcactgg 19321 gaatagttcc gcgccccttg cgccttcgga ttgattttca ccaaaccacg gtaactgtta 19381 ctagaattac cagcagaaat ccccttagaa ataatcgtgc tgcgggtgtt cttaccaacg 19441 tgaatcattt tggtacccgt atcagcttgc tgcatattat tcgtcagcgc taccgagtaa 19501 aactcaccta cagagttatc acccaccagc acgcagctag gatacttcca ggtgatagcc 19561 gaaccagttt ctacctgtgt ccaagaaatc ttagaattca caccttgaca caaaccgcgc 19621 ttggtgacga agttgtagat accgccttta ccattcgcat ccccagcgta ccagttctgc 19681 acggtggagt atttaatctc ggcattgtcg agggcgacga gttctacgac agcagcatgt 19741 agctggttac tgtcgtacat tggtgcagta cagccttcca ggtaggagac atagctgttt 19801 tcttcagcaa caatcagtgt ccgctcaaat tgtccagttt cgccattgtt gatccggaag 19861 taggtagaca gttccatcgg gcatttcaca cccttgggaa tgtagacgaa ggaaccatca 19921 ctaaagacgg cggaattaag cgcagcaaaa taattatctg ctgttgggac aacgctaccc 19981 aaatacttct gcaccaattc tggatattct tgtaaggctt cagaaatgga acagaatatg 20041 acaccttctt tgaggagttt ttctttgaag gtggttccga tagaaacgct atcgaaaatc 20101 gcatcaacag cgacgtttgc cagtcgcttc tgttctgaca ggggaagtcc cagcttttca 20161 aaggtttcca acaaagcggg atcaacttca tctaagctgt taagcttttc tttcttttgt 20221 ttcggcgcgg agtaataaat gatattttgg taatcaatta ccggatactt aacgcttggc 20281 caagttggtt ctgccatttt ttgccactgg cgaaaagctc tcaggcgaaa atccagcatg 20341 aactctggct cgtttttctt gagcgagata aggcggatgg tgtcctcgtt tagtccacgc 20401 gcgattgtgt cggcttcgat gtcggtgata aagccgtact tgtatggctg gttgactaat 20461 gttttgacag tggcactcat tggtagtttt ctcttgcgtt cgctttgtag aatctcttta 20521 aggaagggag acgggctttt ttaaccgatt cccaggtttc gtcaccgagt ggttacgaga 20581 ctgctaaaat aactatatag actaaaacaa caacattgtt gtttaatgta tattcatttt 20641 aagctagatt aacaacaaca ttgttgtcaa acttatattt ttctgcaaat caaatgaact 20701 tttgggacga cgatggagac tacccaccag tcctcaacaa agcaagatat tctagaatat 20761 cttttgaaac actcgcaggc tacagctttt gagctagccg aagcgctcga tattagcccc 20821 caagcaattc gtcgccatct taaagattta gagggcgagg agttgatttc tttctcattg 20881 tcagtgcaag gaggaatggg gcgtccacag catgtttatc aattgactcg tagtggacgc 20941 gatcgcctgc gtccagatgg tgctgatggt tatggtcaat ttgcggtttc tctattggac 21001 acattagcag aaacggtcgg acgcgaccaa gttagttcaa ttttacgcaa gcaatgggaa 21061 cgcaaagcag aagaatatcg cgatcgcctg ggaaacactt ccctctcaga aagagtggcg 21121 attttggtaa aactgagaaa agctgagggc tttatggctg agtatcatcc tgtagaatcg 21181 agtgagtcat cacaaggcga cggatttatt ttaaccgaac acaactgcgc gatttctaac 21241 gttgcagaat ctttccctag catttgcggt catgaattag aaatgtttgc tacagtttta 21301 cctgattgta cagtagaacg tactcactgg attattaacg gcgaacatcg ctgcggctat 21361 ttagtacaag agcgaaaaaa caaactttaa gtttttccat taagtgagta gccgcaacaa 21421 gtgagcattt ttattaagtc taaaccaaag attataggta gttagtaatc ggatttacga 21481 accagtgatt tctaacccgt tacaaccgca acatttgagt aaaatttagg caaatcttaa 21541 acaaaagtta tggagttacc atcaccacaa tcaacctcac aatttataac tccagaagag 21601 ttagtaaaag tcgatgcagc tttgctctcc tcaccagaga aatttttaac tcgattgact 21661 atttcttctc tgaggctttt gaaacaaatc gcccaagaaa acgaagtgaa tattgaagat 21721 ttaacagccc agcaggtgat tgattggttt gaaaaagatg gaaagattcg acgagagcaa 21781 ggaattgagg cagcttattt aaagtggtaa gcttatcagt tatcagttat cagttatcag 21841 ttatcagtta tcagttatca gttatcactg tttactgttc cctgttccct gttccctgtt 21901 ccctgttccc tgttccctgc cttattttaa cgacctactt atttcttctt caatacagga 21961 gttaaatatt ctggaatacc ctctagctga ataacgttgc acccaaggtt ataagcaaat 22021 tcaattgatt ctccagcacc taatacgtta taaaaagctg ttgcaaattt aattgcagct 22081 tgatcaccaa tctgcttgtt cattccaatc acgtagggaa tatgtttaac aatcgcttgg 22141 gcttgaactt ctgagtaaca agcgttgaga acaacgcact caatgtgaga agcaaataac 22201 tcgaatagtc cagccaaagc ttgtgtatcc actaatcgca cattcccggt ttcatcctcc 22261 agcgccaaac catgatcacc tgtaccatga ccgctaaaat gaacaatttg tggcttaaag 22321 tccaataacg attggtaaac ttcttgaaca cgcacagccc aacgctgttt taggtcaaat 22381 aattctcgtt tttttgctcg ttgcaatcca gcctcaattt ctcgcacctc ttcatctaat 22441 cgcaggggca aactgttttt tggatttgct gccaaaatta gaattgtttt gagtgcagaa 22501 ttgttggctt ctgttcgtgc aacttggttg tagttggtaa tatcgccacc tatttgatgg 22561 gcgttcactg aatcagcgtt tacaaaacct ccagcgaatt gagcgccttg taagttgaaa 22621 ttagattgtt ttttaggagt ttctggcata gatttacaaa aattacgggg gtaagggtgt 22681 agg // LOCUS NODE_1432_length_22071_cov_5.57580922071 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 22071) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 22071) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..22071 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 529..1146 /locus_tag="DP116_12800" CDS 529..1146 /locus_tag="DP116_12800" /inference="COORDINATES: protein motif:HMM:PF01471.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12800" /translation="MATVNETAIETAATSKDMPVLQNGDNPSKGRAVAYLQYLLISYG EDVGTSGVDGEYGPDTKAAVEEFQRERNQVSDVNPGNLKINGVVALSTWRALGDNFYR TCRTSADNSPVESDFISGGIDLPRLSRNDKGDAVRFLQQLLLGYDDITGYTNDSFDAD FGPETERAVKAFQSSSGLDDDGIVGRDTWAKLFEGSRERCNGSVS" gene complement(1330..1542) /locus_tag="DP116_12805" CDS complement(1330..1542) /locus_tag="DP116_12805" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12805" /translation="MQTSPITCSIAQAQKTLATIWHVPDELWKKLSSILAEYHVSKPI GHKRIDARAALTLLLKKKPQISRKAQ" gene 1650..2204 /locus_tag="DP116_12810" CDS 1650..2204 /locus_tag="DP116_12810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206540.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_12810" /translation="MLTSTVVSNVSSTKDFSLEEWMQNPPDHTEWVNGELVEKKGVTL KHSRIQANLAYYWRNYKDSSGQGGQVYTEVPCRTNKQGRVPDVAYLTPELFNQFGEPA VLPQSFPLIAEIVSPTDLAEEVIAKSQEYLQSGGEEVWLVFPENRWIIVTTKNQRFVF TSGEIVSTQIVLKGFSVAVDELLA" gene complement(2423..3961) /locus_tag="DP116_12815" CDS complement(2423..3961) /locus_tag="DP116_12815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126970.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-binding oxidoreductase" /protein_id="PRJNA477356:DP116_12815" /translation="MSLTENVLSQLPGNVLDKLRQADRILTSLREDNPTVPTVVKETQ QPLGTVDWDVIICGGTLGILIGCALAVKGVRIALIERSILRGREQEWNISRKELQVFL ELNLLTDAELEQAIATQYNPARVSFSGGTEVWVRDVLNIGVDPVYLLETLKQRFLTSG GKLFENTPFTEAVVHPDGVMVNNQFKTRLLIDAMGYLSPIIQQARQGKKPDALCLVVG TCAQGFPENHTGDLLLSFTPVQNQCQYFWEAFPARDGRTTYMFTYMDANPQRLGLEAL FEDYLRLMPEYQGVELSQLTFKRALFGFFPSYRSPLKTPWNRILPVGDSSGSQSPLSF GGFGAMVRHLRRLTLGIHEALEIDQLSAKQIAPLQPYQPSLAVTWLFQRAMSVDVNQN IDPNQINQLLSAVFQQMEQLGESVLKPFLQDVVQFPALTKTLIKTFFTHPILVAKVIP QVGLRALLDWMVHYGNLGIYSILFLFSQMLEPWKKNLPSIPKYYWNRWTESWKYGSGS DYSD" gene complement(3981..4190) /locus_tag="DP116_12820" CDS complement(3981..4190) /locus_tag="DP116_12820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MbtH family protein" /protein_id="PRJNA477356:DP116_12820" /translation="MYQDDKEDTTIYKVVVNHEEQYSIWPADRENALGWKDAGKNGSK QECLDYIKEVWTDMRPLSLRKKMEC" gene complement(4252..5625) /locus_tag="DP116_12825" CDS complement(4252..5625) /locus_tag="DP116_12825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744602.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MATE family efflux transporter" /protein_id="PRJNA477356:DP116_12825" /translation="MTSQKQSQLTNEILQGNLVKLMFKLSIPGILGMLLLGLNIFVDA LFAGQLIGETAVAGISLALPLTNILTGFAMLVGVGSASVLSRAIGSFDIKTQSKIFGN LIVMSVVISLVITIISYSFGEELILFMGGSGKVASAGAEYFKTYMLGSVFYILAVASN QIIKSEGKIRIATIFTVIYVIVNIIFNFIFVSVFQWGIQGIALATVLAMVVHSIVNLT YFLSGKSSIPVHPKKFVLAVDLLPAILSVGIPALLTQVMGLVQSSVVFKSISYYGTQN DIAFYGATLTLTSLTYVPLNGFTQALQPVIGINYGAGNYDRLKKAYLTFVNGGITVLT FFWLLLQLSPNTFLGWLLPDVAFTSNDFLNFRILSLLIPIMPLVFLGATLFQSLGKGK IVTIIILLRSLFLFIPLVLILSNLIAVTGIYYGMLMTDILVILIVLILILIKFEHLTK MQLNKYK" gene complement(5627..5764) /locus_tag="DP116_12830" /pseudo CDS complement(5627..5764) /locus_tag="DP116_12830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454431.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="sucrase ferredoxin" gene complement(6051..7430) /locus_tag="DP116_12835" CDS complement(6051..7430) /locus_tag="DP116_12835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454434.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome P450" /protein_id="PRJNA477356:DP116_12835" /translation="MNLPAGPKTPYRLAIQQFLADPFGYVDGICKRYGDIFTIMSGST PIIYVSNPSGIKQILTNTKEITASGALNRDFALTTGNQGILQLDGLRHKHRRKLLMQA FHGERMQACGRRICELTEKIISQQAIAKPFIAYPIIEDITLRVGIEVVMGLREGERYD KIKHLFVSVLKYGQSPLFQFATKLPFGQRDLGRWSPWGYRLYLRRELFGLLYAEVQER RKQADTSCANILSDLIFAHDETGELLSDEEVRDLLLSPIFAAQDASATALAWSLYWIH RFPTIRERLLEELDRLGENPEITSIVALPYLNAVCCEVLRIYPTQLFTFPRLVESPVE IMGYELSPGTVLIGNIYSTHQREDLYPEPKEFKPERFLERQYSNYEFLPFGGGARSCI GGAFALFEMKLVLATILSRYQLALVDNRPEKPKFGGLICYPKSGVKMVMHGRRQHQGQ SQPLVVGSI" gene complement(7453..11595) /locus_tag="DP116_12840" CDS complement(7453..11595) /locus_tag="DP116_12840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744607.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-ribosomal peptide synthetase" /protein_id="PRJNA477356:DP116_12840" /translation="MDKQYSNLSPAKKALLEKWKGGKFQAETIPKRQTSNNIPLSFSQ QRLWFIDQLYHESSFYNIPSAFHLTGHLNITALEKSLNEILRRHEVWRTTFALVNGEP VQEIAPELTWDLPIINLEHLSGQNWEGEIKQFAVQEAKKPFNLGTGPLVRASLLRLGE QEHVFLLTMHHIITDGWSVGVFFRELTMLYAAFSTNQPSGLPELPIQYADFAIWQRVK LSSTVSGACPQDIGNRIQGTLLETQLNYWKQQLSGELPVLQLPTDRPRPNVTTFTGAK QYFTFLKTLTDALNQLSQREDATLFMTLLTAFNILLYHYTEQEDILIGSPIANRNRAE LEGMLGLFVNTLVLRNNLSGNPSFRELLHQVREVTLNAYAHQDLPFEMLVEELQPERD LSRNPLYEVMFVLQNTPMSVQSVSGLSLRTLQFDSGTAQLDIFLSMSESQEGLTGFLE YNTDIFEDATITRFVNNFQALLERIVANPEQRICELSPLTDLEREQVLFEWNQTSADY PQDSSLHQLFEQQVEQSPDALALISQTEQLTYRQLNQRVNQLAHYLQKQSVTEETLVA ICLERSIDMIVGILAVLKAGGAYIPLDPSYPVERLGFMLSDSQAEVLITQQEILEKLP TSSAKTVCLDIQFDEIAEESEENLISPSKADNLAYIIYTSGSTGTPKGVLGTHRGTVN GLHWLWKTYPFTKEEVCCQKTAISFVDSVWEIFAPLLQGIPTVIIPDAVVKDSQLFVE TLAHHKVTRIVLVPSLLRLLLDSHSHLTKNLSHLKLWITSGEALSVHLVQTFLQLMPF AKLINLYGSSEVSANVTYYDTSLLPKQATSIPIGRPIDNTQVYVLNRHLQPTPVGVVG ELYIGGDGLARGYLHRTELTQERFIDNPFVSGTKLYKTGDLVRYLKDGNLEYLGRRDH QVKIRGFRVELGEIAAAIGQHPDVRESVVIAGDDAQGSKRLIAYVVTDKQDIVSQLLH SLQRKLPNYMIPSAFVVLDTLPLTPNGKVDKRALPTDDFIRPNATKSFVAPRNFTELA LVKTWETLLNTSPIGVTDNFFDLGGHSFLAVRLMAQIHDRFGHNLPLSTLFENPTIEK LATIVSHPVRENSNSHLVTIQSSGSKIPFFCMHGAGGGVRQYFNLSRRLGEDYPFYAL QHTPDQEEPEIISVEETASRYLKEIRQVQPNGPYLLGGHCYGGVLAFEMAQQLQRQGE RVSLLVVIDAILPETVIKPADDDDAKFLLRMAESIKTDSNIDFSLPFEELRDLPLNEQ FHLVNKKANFIFSDKEIQDFLSYYKLFKAHVQAMRDYVPQAYPHSMTLLRAKEEIIHD FESPEFHTDDPLLGWGKCSSQPIDVIEVPGDHFSMLVEPNIQELARQLRISIDNALCN VV" gene complement(11689..16407) /locus_tag="DP116_12845" CDS complement(11689..16407) /locus_tag="DP116_12845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181757.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polyketide synthase" /protein_id="PRJNA477356:DP116_12845" /translation="MNVLDNLNNLNDKDEIAIIGMAGSFPGSKNVDSFWRNIRDGVES ISFFTNEELVSAGIDSTVVNDPLYVKAGSLIEDIELFDASFFGFSPKEAEITDPQHRL FLECVWSALENAGYDSQTYTGQIGLFAGVTYSSYLLSNIYSNRGLIESVDGFQIFIGN DKDHLPTQISYKLNLKGPSINVQTTCSTSLVAVHLACQSLLNGESDIVLAGGVSIQVP QKSGYRYQEGGINSPDGHCRAFDAKARGTIFGNGLGVVVLKRLEDALADGDFIHAIIK GSAINNDGSLKVGYTAPSVDGQREVILEALALAGVEPETITYVETHGTGTPLGDPIEI KALTQAFRASTNKKGFCGIGSVKTNVGHLNTAAGVTGLIKTIQALKHKQIPPSLHYEQ PNPEIDFANSPFYVNTKLSEWKTNGTPNRAGVSSFGIGGTNAHVILEEAPVVAPSSTS RPWQLLLICAKTTTALETTTANLATHLQQHPDINLPDVAHTLQVGRRAFDHRRMVVCH DCEDAVKVLTSQDPQRVFTYHHKPSHCPVIFMFSGQGAQYVNMGRELYDTEPTFKKHI DTCAQILQPHLLLDIRHILFPKEEQIETALHRLQQTAITQPALFIIEYALAQLWMEWG VHPQAMIGHSIGEYVAATIAGVFSLEDALAIVAKRGQLMQQLPTGSMLAIPLGEKDVQ SFIENVETFHGTSLHTDGTSVEIAAINSPSSCVVSGSREAIATLQNQLSSQEIECRLL HTSHAFHSVMMEPILEPFVQAVKKVKLNPPRIRFISNVTGTWITDNEATNPTYWGQHL RQTVKFSDGISQLLQQFEGVFLEVGPGRTLSTLTTQHLKPGAKQQVLTSLRHVKEQQS DVGFLLQTLGRLWLSGVEIDWSGFYAHEQRHRLPLPTYPFERQRYWIDAKSPSSSSNK PVTLDKRQDIADWFYIPSWKRSLLPNSTSLSGEETGNERWLLFIDECGVGSELVNRLQ QSGKNVIVVKVGQQFTKLSEGIYVINPQNRNDYDTLFQELIALGKIPQNIAHLWSVNN FGTHQGKYLEFNSLLFLTQALNNQKITDRLQLWVISNNIQEVSGNETLDPEKATMLSL CKVIPQEYSNITCRSIDVVLTNRQDEDTCVRGFPPLSKVSVAKGAEIGGGDCDGHIID QIINEFTAFSSELVVAYRDRYRWVQTFESVHLESAVEEKTLLRKQGVYLFPGGLESLG VVLAQYLAKTLQAKLIFIEDWAFPEKDEYSQWLETHTPEDEVSRKIQKLQEFGDLGAE VLVVRADTTNYEQMHQSLAPNNIGQIHGVIYSTGRTRENIFGSIPEIGQTELEQLLDS QRQNLTVLEQVLESIKLDFCIIFYSLSSILGGFGLALYSGVNQLTDTFSQRHNQTNSL PWIVINWDKLQLNTTQEQKTVGQASGVELAITSTESVEVFKRILSLGEGTQVVVSTVD LKARCERTFHLDSKPDSKSSSQVNSSSSYSRPNLSNSYVDPTNELEKQITEIWQEVLG IAQVGIYDNFYELGGDSLIATQLVSRLRAKFPVELPLRDLLVQAMIPRLQAEMIEQLL LEKIEELSEEEVAVLLANES" gene complement(16443..18095) /gene="panP" /locus_tag="DP116_12850" CDS complement(16443..18095) /gene="panP" /locus_tag="DP116_12850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866922.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative pyridoxal-dependent aspartate 1-decarboxylase" /protein_id="PRJNA477356:DP116_12850" /translation="MTLSKQEMPKKSSFFETEQALNYEIEEQVMQLFASSSHVTSIES QIDEITNKFSQDFLSTLDANTHIDLDYLLSNFSDSQIPVQPASLESYLKYLDNNIVAH SIHTSSPRFIGHMTSALPCFVRPLAKLMTAMNQNAVKIETAKALSFCEREALAMLHRL IYNLDDNFYAQHIQNNLSTLGILVSGGTVANITALWCARNTALGPKDDFLGIEKEGLT AALDFYGYKGAAIIGSELMHFSFDKAADLMGIGTHRLIRVPADCNNRVDIQALRQAVA QCRAQNLLIIAIVGVAGTTDSGGVDSLAEIAEIAQSANVHFHVDAAWGGPLIFSQQHR HKLAGIEQADSVTIDGHKQLYLPMGIGLVFFRDPYMAKAIEKQASYTMRKGSFDLGKR ALEGSRPGMALFLHAGLNLIGLKGYEFLIDEGIRKTQYMADRICTMPEFELLAEPDTN LLLYRYIPEHLRELVVKKQLTEIDNQLINECNERLQKIQRQIGRTFISRTTKTTTSFG KEIPIIVLRAVIANPLTTEDDINAVLNEQIQLASEISIETFL" gene complement(18168..>21560) /locus_tag="DP116_12855" CDS complement(18168..>21560) /locus_tag="DP116_12855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744615.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-ribosomal peptide synthetase" /protein_id="PRJNA477356:DP116_12855" /translation="TIAGLAKDIEKATKAGLGLETADIERISRSQKLPLSFAQQRLWF LTQLEPNSPFYNIPGAVRLQGQLNLKALQQSFNEILRRHEALRTNFQTIEGQPVAIIS SVTSLPLPIFDITELPSNQQQAEVRKLADKEAQRPFDLNNDLLLRVKLLRLGEQEHIV LLTMHHIASDGWSKGVLVREVATLYQAFCTEQPLPLLELAIQYVDFAAWQRQWLVGEV LKSQISYWCKQLEGAPSVLELPTDHPRPAVMTFAGATYSFELSQELSVSINKLSQQQG STLFMTLLAAFVTLLWRYTGQEDIVLGSPIANRNRAEIEGLIGFFVNTLVLRTNLAGN PTFEELLTRVREMALGAYAHQDLPFEQLVEKLQPQRSLSHTPLFQVMFVLQNAPMSPL ELPGLTLSPLESDSGGAKFDLTLYMTETGHGLVGTLEYNIDLFEQSTVSRMAGHLQTL LEGIVANPQQRLSELPLLTELERQTLLVEWNDTSVEYPQQQCIHQLFEDQVERSPNAV AVVFEDQQLTYRELNARANQLAHYLQNQGVGPEVLVGICVERNAGGAASRSLHMIIGL LGILKAGGAYVPLDPAYPQERLAFMLSDSQVSVLLTQQKLVERLPEHKAQVVCLDSDW DIISKYSQDNPTNISKVDNLAYVIYTSGSTGQPKGVFGLHRGAINRFHWMWQNYPFVQ GEMCCQKTSLNFVDSVWEIFGPLLQGVPTVIVPDEVVKNPQQFVVTLAHNNVTRLVLV PSLLRILLNTYTDLQSRLPQLKLWVSSGEALSIDLLQQFRQSLPDSTLLNLYGSSEVS ADVTCYSLSPNAPLPKCVSIGHAIANTQIYILDANRQPVGVGVPGELHIGGDGLARGY LNRPELTKEKFIPNPFSDFSAARLYKTGDLARYSPSGEIEYLGRIDHQVKVRGFRIEL GEIEAHLSQHPKVRENVVIVRSDEADSQRIVAYVVSHSGQTLTVTELRDFLESKLPNY MVPAAFVMLEALPLTPNGKVDRKALPAPDDSRPELEAVYQQPQTEVEQTIARILQELL QIENVGIHDNFFELGGHSLLLVQVHSKLQKIFQRDFPLVEMFQYPTISHLTRYLGQES SEQESFTQHSHRPESRTASVQRRKQARIEHRTGTKQKGVSSQ" gene complement(21554..22071) /locus_tag="DP116_12860" /pseudo CDS complement(21554..22071) /locus_tag="DP116_12860" /inference="COORDINATES: protein motif:HMM:PF00550.23" /note="too many ambiguous residues; incomplete; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="hypothetical protein" assembly_gap 21561..21570 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 6373 a 4599 c 4583 g 6506 t 10 others ORIGIN 1 cactctatgt ccaccttacc agagtttgtg gttcaattac ctgaaaattg ctgtaagcac 61 tcatatcgct tcgcttctat attcaaaata gatgtaagca tccaccactt atgcacttat 121 ctttaaaatc aaacaacaat gcaaatccac caaaccctga ctaaacctga ctttcaaaat 181 gttttaattt tgaatttttc attttgaatt ggtaaggatt caggtcatac taaaatctct 241 aaaatcgttt cgtgataagg ggttacggga attagccgta aatttatatt ttcaataaga 301 atcacgtaaa gcttatgttt tggtagttta ttgcgactaa atgtatgtta ataagccaaa 361 ctaaattttg aaactggctc tgtgactggt ttaaaacacc tgaatcctta cttgaattgg 421 tataacctcc ccctctgttg tgcataagtc aaataaagcc atatctatta cttgtcagag 481 ggaagaacaa tcccacaact caaacaaaaa agacaaggag aataaatcat ggcaactgta 541 aatgaaactg caatcgaaac tgctgctacc tctaaagata tgccagttct gcaaaacggt 601 gacaaccctt ccaagggtcg tgctgtcgcc tacttacaat atctgctcat tagttatggc 661 gaagacgttg gtacgagtgg agtagacggc gaatatggtc cagacacaaa agcggctgtc 721 gaagagttcc aacgggaacg caatcaggtg agtgacgtca accctgggaa tctgaaaatt 781 aatggagtag tcgctttaag cacttggcgt gctttaggag ataacttcta tcgtacttgt 841 aggacatcgg ctgataactc acctgtagaa tccgacttta ttagtggcgg cattgatttg 901 ccacgtctga gcagaaatga caaaggtgat gctgtcagat tcctacaaca actgttgctt 961 ggttatgacg atattactgg ttataccaat gactcgtttg atgccgattt tggcccagaa 1021 acagaacgtg ccgtgaaagc ctttcaaagt tctagcggat tagatgacga tgggatagtt 1081 ggccgggaca cttgggcaaa attgtttgaa ggctctagag agcgttgtaa tggcagcgtg 1141 agctaggaag tttcagggtg aagtcagtca ccccaaaact tgatttttaa aaccacactt 1201 agcataacca cagataaacg tgtagtcaat atatgtcttc agtccttccg tagtccctta 1261 tactaaggac aaggcaacag gcaacatgag aaaagctttg agtcttatgt ttctttacat 1321 agttagattt cattgtgctt ttctacttat ctgtggtttt ttcttcagca ataatgttag 1381 agcagcacgt gcatcaatgc gtttatgacc aattggtttt gacacgtgat actcagctaa 1441 aatggaactg agttttttcc acagttcatc aggtacatgc caaatggttg ctaatgtttt 1501 ttgtgcttga gctatggagc aagttatcgg tgaagtttgc acaggtgttg aaaattaatt 1561 gattaacact aatctgagcc taaccttttc agcttgtgct tattttgaga tagtgtctta 1621 gtaagtgctg cgaaggagaa tatatagtaa tgctaacttc tacagtcgtc tcgaatgtat 1681 caagtacaaa agatttttca ctagaagagt ggatgcagaa tccccccgat catacagaat 1741 gggtaaatgg ggaattagtg gagaaaaagg gagtgacact caagcatagc cgaattcaag 1801 caaatttagc ctattactgg agaaattaca aagactccag tggacaagga ggacaagttt 1861 atacagaagt accctgtcgt accaacaagc agggacgtgt tcccgatgtt gcttatctca 1921 caccagaact tttcaatcaa tttggtgaac ctgctgtgtt accccaaagt ttcccactca 1981 ttgctgagat tgtttcccct acagatttgg ctgaagaggt tattgctaaa tctcaagaat 2041 atttgcagtc tggaggcgag gaagtttggc tagtgtttcc cgaaaatcgt tggattattg 2101 tgacaacaaa aaatcaacgg tttgtgttta cttctggtga aattgtcagc actcaaattg 2161 tgttaaaagg tttcagtgta gcagttgatg aattgttggc ttgatgaaaa gttctgctta 2221 aagctttttt acgctccttt tgtcccacca ctacaaaaat ggctgatact cgtaattgcc 2281 acaggactcg atttaatcaa tcccaaacat ctaaaaaact ggtttacaaa ctgctgttac 2341 cgtacttcat aaacctgcaa atagtatcct cacaacctta ttcttgtaga gactcgatat 2401 atcgcgtatg actgtaaaat catcaatcag aataatcact accagaacca tatttccagc 2461 tttcagtcca gcgattccaa taatatttag gtatactagg caagtttttc ttccacggtt 2521 ctaacatttg actgaacaaa aataaaatgg agtaaattcc taagttgcca taatgcacca 2581 tccaatctag taaagctcgc aaaccaactt gaggaatcac ttttgcaact aatattggat 2641 gtgtgaagaa agttttgata agtgtttttg tcagtgctgg gaactgtacc acatcttgca 2701 aaaatggttt gagtacagat tcaccgagtt gttccatttg ttgaaatacc gcagacaata 2761 gttgattaat ttgatttgga tcaatatttt gattgacatc aacactcatt gctctttgga 2821 atagccaagt cacagctaag ctaggttggt atggttgcag tggtgcaatt tgttttgcag 2881 ataactgatc tatttctagc gcctcgtgaa tacctaatgt taaacgccgt aagtgacgca 2941 ccatcgcacc aaaaccgcca aaactcaagg gagattggct tccactgcta tctccaacag 3001 gtaaaatgcg attccaaggt gtcttgagtg gagaacgata ggagggaaaa aagccaaata 3061 gcgctcgttt aaatgtcagc tggctcaact ccacgccttg atattccggc atcaaacgca 3121 gataatcctc aaatagagct tctaaaccca agcgttgtgg attggcatcc atgtaagtaa 3181 acatgtaagt ggttctacca tctcttgctg ggaaagcttc ccaaaagtat tggcattgat 3241 tttgcacagg tgtgaatgat aacagtaagt cgcctgtgtg attttctgga aaaccttgag 3301 cacaagttcc cacaactaaa cagagtgcat ctggtttctt tccctgacgt gcttgctgga 3361 ttatgggaga aagatatccc attgcatcaa ttaacaacct ggttttgaat tggttgttta 3421 ccatcactcc atcaggatga acaactgctt ctgtaaatgg agtattttca aacaactttc 3481 caccagatgt aagaaatcgc tgctttaaag tttccagcaa ataaactgga tcgacaccaa 3541 tgttcagaac atctcttacc caaacttctg taccaccaga aaaactgact cttgcggggt 3601 tgtattgtgt ggcgatcgcc tgttccaatt ctgcatccgt cagtaaattt aactccagaa 3661 aaacttgtaa ttctttacga gaaatattcc attcttgctc tctaccacgt aaaattgagc 3721 gttctattaa cgctattcgc actcccttca cagctaaggc acaaccaatt aaaatgccta 3781 atgtaccgcc gcaaatgata acatcccaat ctacagtacc caaaggttgc tgagtttctt 3841 ttacaactgt tggtacagta gggttatcct ctcttagtga tgtcaaaatg cgatcagctt 3901 gacgtaactt gtctaaaaca tttcctggta gttgcgaaag aacattttca gtgagggaca 3961 taacactaaa ttttattata ttaacactcc atctttttcc ggagactgag cggtctcatg 4021 tcagtccaaa cttctttgat gtaatccaaa cattcttgtt tggaaccatt tttaccagca 4081 tccttccagc caagggcgtt ctcccggtca gcgggccaaa tggaatactg ttcttcatga 4141 ttgactacaa ccttgtaaat cgtcgtgtct tctttgtcgt cttgatacat atttatcctc 4201 caggatgaga taaaaaacaa ctcttctcaa agcttcacaa aacttcattc attatttata 4261 cttgtttagt tgcatttttg tcaaatgctc aaattttata agtatcaata ttaatacgat 4321 taaaatcact aaaatatctg tcatcaacat tccatagtat ataccagtta cagcgattaa 4381 atttgataaa attagtacta gaggaataaa tagaaataga cttcttagaa gaattataat 4441 tgtgactatt ttccctttac ctagagattg aaataaagtt gctcccaaga aaactaaggg 4501 catgattggt attaataaac taagaatccg aaagttgagg aaatcattac tagtaaaagc 4561 aacgtctggc agtagccagc ctagaaatgt gtttggagat aattgtagaa gtagccagaa 4621 gaaagttaat acagtgattc caccattcac aaaagttaaa taagcttttt tgagtctgtc 4681 ataatttcct gctccataat ttataccaat cacaggttgc aaggcttgcg taaagccatt 4741 taaaggaacg tatgtcaatg aagtcaatgt cagtgttgct ccgtagaagg caatatcatt 4801 ctgtgttccg taataagaga ttgatttaaa aaccacagaa ctctgaacta agcccataac 4861 ttgagtaagt agcgctggta ttcccactga taaaatagca ggcagtaagt ctactgctag 4921 aacaaacttt ttgggatgaa ctgggataga acttttacca gaaaggaaat aagttaaatt 4981 tacaatacta tgaacaacca ttgcaagaac tgtagcaagc gctattccct gaatgcccca 5041 ttgaaaaaca ctaacaaata taaaattgaa aatgatgtta acaatgacat aaattacagt 5101 aaatattgtg gctattctaa ttttgccttc tgatttgatg atttggttag aagctactgc 5161 taggatataa aatactgaac ccagcatata agtcttaaaa tattctgctc ccgcagaggc 5221 aactttacca ctccctccca tgaatagaat taattcttca ccaaagctgt agctaataat 5281 tgtgataacc aaggatataa caacactcat tactattaag ttgccaaaga ttttggactg 5341 ggttttgata tcaaaagaac caatggctcg actgagaact gaagcagaac caaccccaac 5401 taacatagca aatccagtta atatatttgt gagcggtagt gcaagtgaga tacctgcaac 5461 agcggtttca ccaatcagtt gccctgcaaa taaggcatca acaaaaatgt tcaagcctag 5521 taacagcatt cctaaaatac cgggaattga taacttgaac atgagtttaa cgagattacc 5581 ttggaggatc tcattcgtca gttgagattg tttttgggaa gtcattttaa agttttcggc 5641 ttgttgctgt tgattcaact ttgatcaaac ggctaacaca atactgtttg acagcttgta 5701 attccatctg ttttgctgag tttaatgcac ttcttacctc actacaaact tcaactctgg 5761 cttcaaccaa cgagtaaaaa cgacaatttg tcggagaact taatcagtct gttgagtctg 5821 cttcctttgt gagtgagttg ctctttttaa gtggaactac taacaagctg ttgcgatcgc 5881 ccctggcgtg gaagctgacc aagcattacc atttccaagc cactagcagg aggatagtta 5941 acacctcgcg ccttgggttt ttctactctt ttttcatcga agagtgctag ttgatagcgt 6001 gaaagaatag ttgctaaaac tgataactga taactgataa ctgataacta ttaaattgaa 6061 ccaacaacca atggctgcga ctgtccttga tgctgacgcc gaccatgcat gaccatcttt 6121 acgccacttt ttgggtaaca aatcagacct ccaaacttgg gtttctctgg tcgattatca 6181 accagcgcta attgatagcg tgaaagaatc gttgccaaca ctaacttcat ttcaaacaaa 6241 gcaaaagccc caccaataca actacgagca ccaccgccaa agggaagaaa ttcgtagttt 6301 gaatattgtc tttccagaaa gcgttctggc ttaaattcct tcggctctgg gtataaatct 6361 tcacgctgat gcgtcgaata aatattacca ataagcactg tccccggact taactcataa 6421 cccataatct caactgggga ctctaccaac cttgggaatg tgaacaactg agtcgggtaa 6481 attcgcaaaa cttcacaaca aacagcatta aggtagggca atgcaacaat gctcgtgatc 6541 tctggatttt caccaaggcg atcgagttcc tcaagcagtc gctcgcgaat tgtaggaaaa 6601 cgatgaatcc agtacaaaga ccaggctaga gcagtggcag aagcatcctg ggctgcgaat 6661 atcggcgata atagaaggtc gcgtacctct tcatcgctca atagttcacc tgtttcatca 6721 tgagcaaata tgagatccga gaggatgtta gcgcatgagg tatctgcttg cttacggcgt 6781 tcttgaactt cagcgtacag caaaccgaaa agttctcgcc tcaggtaaag acggtatccc 6841 cacggactcc accgacctaa atcccgttgc ccaaagggaa gttttgtggc gaactgaaac 6901 aagggagatt gtccgtattt gagcacagaa acaaatagat gcttgatttt gtcatagcgc 6961 tctccctcac gtaatcccat cacaacctct atacccaccc gcagggtgat atcttcgatg 7021 atggggtagg caatgaaagg ttttgcgatt gcctgttgac tgataatttt ttctgtgagt 7081 tcgcagatac gtcgcccaca ggcttgcatc cgctctccgt gaaaagcctg cattaagagc 7141 ttgcgccgat gtttgtgacg caagccatcg agctggagta ttccttggtt tcctgtcgtc 7201 aaggcgaaat cccgattcaa tgcaccggat gcagtgattt ctttggtgtt ggtgagaatt 7261 tgctttatcc ctgaaggatt acttacatat attattggtg tagaaccaga cattatggtg 7321 aagatgtcac catagcgttt gcaaatacca tccacatagc caaaggggtc agccaggaat 7381 tgctgaatcg ctagtctata gggggttttc ggtcctgcgg gtagattcat tggtcttctt 7441 tactcgcgtt ctctatacta cattacacag agcattatcg atagaaattc ttaactgtct 7501 ggctagctcc tgaatattcg gttcgactaa cattgagaaa tgatcccctg gaacttcaat 7561 gacatcaata ggttgactcg aacatttgcc ccaacctagt aaagggtcat cagtatgaaa 7621 ttctggactt tcaaagtcat gaataatttc ctcttttgct ctcaacagag tcattgagtg 7681 aggataagcc tgcggaacgt aatctcgcat agcttgaaca tgagctttga atagtttgta 7741 atagctcaga aaatcctgaa tttctttgtc gctaaaaatg aagttagctt ttttattcac 7801 caaatgaaat tgctcattga gtggcaaatc tcgaagttcc tcaaaaggaa gagaaaaatc 7861 tatgttacta tcagttttta tcgattccgc catacgaagt aagaattttg catcgtcatc 7921 gtctgctggt ttaatgacag tttctggtaa aatagcatcg atgacaacta ataaacttac 7981 cctttcaccc tgcctttgca attgttgtgc catttcaaaa gcaaggacac caccgtaaca 8041 atgaccacct aaaagatagg gaccattcgg ttgcacctga cgaatttctt tgaggtagcg 8101 agatgctgtc tcctccactg aaattatttc aggttcttcc tgatcaggag tatgttgcaa 8161 ggcataaaat ggatagtcct caccaagtct tctggacaag ttaaagtact gacgaacacc 8221 tccaccagcg ccatgcatac aaaagaaagg aattttagaa ccagaagact gaatcgttac 8281 cagatgagaa ttggaattct cacggactgg gtgactgaca atagtcgcta gtttttcaat 8341 tgtcgggttt tcaaaaagag tagacagggg aagattgtgc ccaaatcggt cgtgaatttg 8401 agccattaag cgaacagcta aaaacgagtg accacccaag tcaaagaagt tatctgttac 8461 tccaatagga ctagtattca aaagagtttc ccaagttttc accaaagcta attccgtaaa 8521 attccgggga gcaacgaaag atttcgtagc gtttggtcga ataaaatcgt cggtaggaag 8581 tgctcgctta tctactttgc cattgggtgt gagcgggaga gtgtccaata ctacgaaagc 8641 agacggtatc atgtaattcg gcagtttccg ttgcaaggag tgcaacaatt gtgacacgat 8701 gtcttgctta tccgtgacga catatgcaat cagacgcttt gagccttgag catcatcacc 8761 agcgatgacg acagattctc gcacatctgg atgttgtcca attgcagctg caatttctcc 8821 caattccacc cggaaaccac ggattttaac ttggtgatca cggcgaccta aatactcaag 8881 attgccatcc ttaagataac gcaccaagtc gccagtttta taaagcttgg ttcctgaaac 8941 aaaggggtta tcaataaacc gttcttgagt taattcagtc cgatgcaaat atccccttgc 9001 taatccatca ccaccaatat ataactctcc aacaactccc acaggcgttg gttgtaaatg 9061 gcgattcaac acatagactt gagtattgtc aataggacga ccgataggaa tactagttgc 9121 ttgttttggc aataaactcg tgtcgtaata agtaacatta gcagaaactt ccgatgatcc 9181 gtagaggttt atcagctttg caaatggcat caattggagg aaagtttgga ctaagtgaac 9241 agacagcgcc tccccactcg ttatccagag ttttagatgc gataaatttt tggtaagatg 9301 actgtggcta tctaggagta agcgcagtag tgaggggaca agtacaatcc gcgtaacttt 9361 gtgatgtgcc aaagtttcta caaacaattg tgagtctttt acaactgcat ctggaataat 9421 gacagtagga atgccctgaa gtaagggagc aaaaatttcc catactgagt ctacaaagct 9481 aatagctgtt ttttgacaac aaacttcctc ttttgtgaag ggataagttt tccataacca 9541 atgcaagcca ttgactgtac ctcgatgagt cccgagaaca cctttaggag ttccggtaga 9601 gccagaagtg tagataatat aggcgagatt atcagctttt gaaggactga taagattttc 9661 ctcactttct tcagcaattt cgtcaaattg gatgtccaag caaacagttt tagctgaaga 9721 agtcggtagc ttttctaaaa tctcctgttg ggtgattaac acctcggctt gagaatcaga 9781 aagcatgaaa ccaagacgtt ctactggata actcggatcg aggggaatgt atgcaccacc 9841 agctttgagg acagctaaaa tccctacaat catgtctatg gaacgttcca gacaaatggc 9901 gacgagggtt tcttctgtga cactttgttt ttgtaaataa tgagcaagct gattaactct 9961 ttggttgagt tgacgataag tcagttgttc tgtttggcta attaacgcta aagcatcagg 10021 agattgctcg acttgctgct caaataattg atgaagggag ctatcttgag gataatctgc 10081 actagtttgg ttccactcaa ataatacttg ctctcgctca agatctgtga gtggtgacaa 10141 ttcgcagatg cgctgttctg gatttgcaac aatcctttcc aataaagctt ggaaattgtt 10201 aacaaatcgg gtgattgttg catcctcaaa aatgtctgta ttgtattcca aaaaccctgt 10261 taatccctct tgggattcag acattgatag gaaaatatcc aactgtgctg taccgctgtc 10321 gaactgcaaa gtacgcaaac ttaaaccaga aacggattgt acggacattg gagtgttttg 10381 caggacaaac atcacctcat acagtggatt ccggcttaag tcacgctcag gttgtaattc 10441 ttctacaagc atctcaaaag gcaaatcctg atgtgcataa gcattgagag ttacctcacg 10501 cacttgatgc aatagttccc ggaagctggg attaccactc agattattac gcaacaccaa 10561 agtattcaca aacaaaccca gcatcccttc tagttcggct cggtttctgt ttgcaattgg 10621 agaaccaatg agaatgtctt cttgctctgt gtagtggtat agcaatatat taaatgctgt 10681 cagcagagtc atgaataaag tggcgtcttc tcgctggctc aattgattta acgcatctgt 10741 tagggttttt aaaaaggtaa aatattgctt tgcaccagta aaagtggtaa cattcggtcg 10801 cgggcggtct gtgggtaatt gtaggacagg taattcacca ctcagttgtt gcttccagta 10861 attgagttga gtttcgagta gtgtaccttg aatgcgatta cctatgtcct gcggacacgc 10921 tccgctaacg gtagagctga gcttaacgcg ctgccaaatt gcaaaatctg catactgaat 10981 aggaagttca ggtaaaccag aaggctgatt tgtggagaac gctgcatata gcattgtcaa 11041 ctctcggaag aacacaccaa cagaccatcc atccgtaata atgtggtgca tggtcaaaag 11101 gaaaacgtgc tcttgttcac ccaagcgcag taaacttgct ctgacaagag gtccagtccc 11161 taaattgaag ggttttttgg cttcctgaac tgcaaattgt ttaatctctc cttcccaatt 11221 ttgaccagat aaatgttcaa ggttgatgat gggtaaatcc caagttaatt ctggtgcaat 11281 ctcctgtact ggctctccat ttacaagtgc aaaagtcgtg cgccaaactt catgacgcct 11341 gaggatttca ttaagacttt tttcgagtgc tgtaatatta aggtgtccag ttaaatggaa 11401 agcacttgga atattataga aagaactttc gtggtaaagt tggtcaataa accagagcct 11461 ttgttgagag aaagacaagg gaatattatt ggatgtttga cgcttaggaa tagtttcagc 11521 ttgaaatttt cctcccttcc atttttctag aagggctttt ttggcaggtg ataaattaga 11581 gtattgttta tccattgtta atagtttttt tgataattgt taagtcataa tttgtctgac 11641 aagcatcttg cctaagtggg tagtcattct agcaggcggg acgccccact acgactcatt 11701 agccaaaaga actgcaactt cctcttcaga taattcttca atcttttcca gaagaagttg 11761 ttcaatcatt tctgcctgta atcgtggtat cattgcttgt actaaaaggt cacgcagcgg 11821 taattccact gggaactttg ctcgcaatcg agaaactaat tgagttgcaa tcagcgagtc 11881 tcctcccaat tcataaaagt tgtcgtaaat accaacttgt gcaattccaa gaacttcttg 11941 ccagatttcg gtaatttgtt tttccaactc attagtcgga tcaacatagg aattactgag 12001 gttaggtcta gaatagcttg aggatgagtt tacttggctg gaagattttg aatctggttt 12061 ggagtcgaga tgaaatgtac gttcgcacct ggctttaaga tctaccgtgg aaacaacaac 12121 ctgagttcct tctcctaaag agagtatccg tttaaatact tctacgcttt ctgtcgaggt 12181 gatggctaat tccaccccag atgcttgtcc aactgttttt tgttcttggg tggtattgag 12241 ttgtaattta tcccaattta tgacaatcca aggcaaagag ttagtttgat tatgtctctg 12301 actaaaagta tctgttaact ggttaactcc tgagtataaa gctaacccaa atcctcccaa 12361 aatggaagac aaagaataaa aaatgataca aaaatctaac tttatactct ctaagacttg 12421 ttctaataca gtcagattct gacgttgaga atcaagcagt tgttctaact ctgtctgacc 12481 aatttctgga atcgagccaa agatattttc gcgcgttctc ccagttgaat agataacccc 12541 atgaatttga ccaatattat taggtgctaa actctggtgc atttgttcat aattagttgt 12601 atctgcacgc actaccaaaa cttctgcacc caaatccccg aattcttgga gtttttggat 12661 tttgcgactc acctcatctt caggagtatg agtttctaac cattgcgagt attcatcctt 12721 ttctggaaaa gcccaatcct caataaatat gagttttgct tgtaaggttt tcgccaaata 12781 ttgagcaagg acaactccaa gactttcgag tcctccagga aacaagtaaa caccctgttt 12841 tctgagtagt gttttttctt caactgctga ctctaggtgc actgactcaa atgtttgcac 12901 ccagcggtag cgatcgcgat atgcaacaac caactcagac gaaaaagcag tgaactcatt 12961 aataatttgg tcaataatat gaccgtcaca atctcctcct cctatctctg cgcccttggc 13021 gacggacact ttgctcaacg gggggaaccc ccgcacgcaa gtgtcctcgt cttggcggtt 13081 cgttaaaaca acatcaatac tccgacaagt gatattggaa tattcttgag gaatcacctt 13141 acataaactt aacatcgttg ctttttctgg atcaagagtc tcattaccac tgacttcttg 13201 gatgttgttc gatataaccc agagttgcaa gcgatcagtg attttttgat tgttaagggc 13261 ttgcgttaaa aacagcaaac tgttaaattc tagatactta ccttgatgag tcccaaaatt 13321 attcacactc cataaatgag caatgttttg gggaatttta cccagtgcaa tcagttcttg 13381 gaacaacgta tcatagtcat tgcggttttg gggattaata acataaattc cctcactcaa 13441 tttagtaaac tgctgtccta ccttcaccac aatgacattt ttaccacttt gttggagtct 13501 attaactaac tccgaaccca ccccacattc atcaataaac aataaccatc gctcattccc 13561 tgtttcttcg ccagataaag aggttgaatt cggtagtaaa gaacgtttcc atgaaggaat 13621 atagaaccag tcagcaatgt cttgtctttt gtccaatgtt actggcttat tacttgaaga 13681 tgatggcgat ttagcatcaa tccagtatct ttgtcgctca aaagggtaag taggcaaagg 13741 taagcgatga cgttgctcat gagcatagaa ccctgaccaa tctatttcta caccagaaag 13801 ccacaaccga cctaatgttt gtaataaaaa gccaacatct gattgttgtt ctttgacatg 13861 gcgtaaagaa gttaaaactt gctgttttgc tcctggtttg agatgctgtg ttgttaatgt 13921 acttaatgtc cgccctggtc ccacttccag aaaaacacct tcaaactgtt gcaatagttg 13981 agagatccca tctgaaaact tcacagtttg tcgcagatgt tgaccccagt aagtaggatt 14041 tgttgcttca ttatctgtaa tccacgtacc agtgacattg gaaatgaagc gaatacgcgg 14101 tgggtttagc ttcacttttt tcacggcttg tacgaatggt tccaagattg gttccatcat 14161 tacagaatgg aacgcatgag atgtgtgcaa cagccgacat tctatttctt gtgaagataa 14221 ttggttttgg agtgtggcga tcgcctccct tgaaccggaa actacacacg aggatggact 14281 gttaattgct gctatctcta cagacgttcc atcggtgtgt agagacgttc catggaacgt 14341 ctctacattt tctatgaagg attgcacgtc tttttcccca agtggaatgg caagcatact 14401 tcctgtgggc aattgttgca tcagttgtcc cctttttgcc acaatcgcta aggcatcttc 14461 tagagagaat acacctgcaa ttgttgctgc gacatattct ccgatgctat gaccaatcat 14521 agcttgtgga tgcactcccc attccatcca caattgggcg agggcgtatt cgatgataaa 14581 tagtgctggc tgagtaattg ctgtttgttg cagtctatga agtgcagttt caatttgttc 14641 ttctttgggg aagagtatat gacggatatc aagaagaagg tggggttgaa gaatctgcgc 14701 gcaagtatct atatgttttt taaacgtggg ttcagtatca tacagttccc gccccatatt 14761 cacatattgt gccccttgtc cggaaaacat gaagatgact ggacaatggc tgggtttgtg 14821 gtggtaagtg aaaacacgtt gtggatcttg ggatgtcagc acttttactg catcctcaca 14881 atcgtgacag actaccatgc ggcgatggtc aaatgctcgg cgacctactt gtagggtatg 14941 agctacatca ggcaggttaa tatcgggatg ttgctggagg tgagtagcta agttggctgt 15001 ggtagtttct agggctgtgg tggttttggc acagattagc aacaattgcc agggacgtga 15061 agtacttgag ggcgcaacaa ctggggcttc ttccaagata acatgggcat tagtccctcc 15121 tataccaaaa gaactgacac cagctctgtt aggagtcccg tttgttttcc actctgataa 15181 tttggtattg acgtagaagg gactgttggc gaaatctatc tcaggattgg gttgctcata 15241 gtgcagactg gggggtattt gtttatgttt tagtgcttgt attgttttaa ttaacccagt 15301 tacacccgct gctgtgttca aatgtccaac atttgtcttg actgagccga ttccacaaaa 15361 gccctttttg tttgtgctag cacgaaaagc ctgtgtaaga gccttgattt caatgggatc 15421 tcccaaagga gttccggtac catgtgtttc tacatatgta atagtctcag gttcaactcc 15481 agctaaggca agcgcctcta gaataacctc cctttgacca tccacgctag gagctgtata 15541 accaaccttt aaagaaccat cgttattgat agctgaacct ttaattattg catgaataaa 15601 gtcgccgtct gcaagagcat cctctagtcg ctttaataca acaacaccca aaccgttgcc 15661 gaaaatagtt cctctagctt ttgcatcaaa agctcgacaa tgtccatcag gagaattaat 15721 tcctccttct tgatatcgat aaccactttt ttgtgggact tgtatcgaaa cgccaccagc 15781 aagtacaata tcgctttcgc catttagcaa actttggcag gctaagtgga ctgcaactaa 15841 tgatgtagaa caggtagtct gaacattaat gcttggcccc ttaagattta atttataaga 15901 aatttgtgta ggcaggtggt ctttatcatt gccaataaaa atttggaaac cgtctactga 15961 ttctatcaag ccacggttag aataaatgtt tgaaagaagg taagaactat aagtgacacc 16021 agcgaaaaga cctatttgac cagtgtaagt ttgggagtcg tagccagcat tttctagagc 16081 tgaccaaaca cattctagaa agagacgatg ttgtgggtct gttatttcag cttctttagg 16141 agaaaagcca aaaaatgaag catcaaataa ttctatatct tctataagac tacctgcttt 16201 tacatacaga gggtcattaa ccactgtaga gtctattccg gcagatacta gttcctcatt 16261 agtgaaaaaa gaaatagact ctacaccatc tcggatattt cgccaaaaac tatcaacgtt 16321 tttagaccca ggaaaactac ctgccatgcc aatgatagct atttcgtctt tatcatttaa 16381 gttattcaaa ttatctaaca cattcataac taaaaacctt tcaaggttat ttttttactt 16441 gcttatagaa aagtttcaat tgaaatttct gaagcaagct gaatctgttc gtttaaaaca 16501 gcgttgatat catcttccgt agtcagtggg ttggcaatga ctgcacgcag gacaataata 16561 ggaatttctt tcccaaagct cgtagttgtt tttgtcgtcc gtgagataaa agtacgacca 16621 atttggcgct ggattttctg taggcgttca ttacattcat taatgagttg attgtcaatt 16681 tctgtcaact gctttttaac aacaagctct cttaagtgtt ctggaatata tcgatagaga 16741 agcaagttag tatcaggttc tgctaacaac tcaaactcag gcatggtgca gatacgatct 16801 gccatatatt gagttttccg aattccttca tcaattaaaa attcatatcc cttaagtcca 16861 atcagattta gcccagcatg taaaaataaa gccataccag gtcgagaacc ttctaaagca 16921 cgtttaccta aatcaaatga ccccttacgc atggtgtagc tcgcttgctt ttcgatggct 16981 tttgccatgt acggatcgcg gaaaaagacc agaccaatgc ccattggcag gtatagttgt 17041 ttatgtccgt caatggtcac tgaatcagct tgttcgatac cagcaagctt atgccgatgt 17101 tgttgggaaa aaatgagtgg tccaccccaa gcagcatcta cgtgaaaatg aacattcgcg 17161 gattgtgcaa tctctgcaat ttctgcaagg gaatccactc cacctgagtc tgtggtgcca 17221 gcaacgccta caattgcaat tataagcagg ttttgggcac gacactgggc aacagcttgg 17281 cgtagcgcct gtatatcaac tcgattgtta cagtcagcgg ggactctaat caaacgatgt 17341 gtaccaattc ccatcaaatc cgcagcttta tcaaaagaga aatgcatcaa ttcagagcca 17401 ataatcgctg ctcctttgta accgtagaaa tctagggctg cggttaagcc ttctttttct 17461 ataccaagaa aatcatcctt tggtcccaaa gcagtattcc tcgcacacca aagcgccgtg 17521 atatttgcca ctgtcccgcc agaaaccaag attcctagtg tacttagatt attttgaata 17581 tgctgagcgt agaaattatc atccaagtta taaatcagtc gatgcaacat tgccaaagct 17641 tcacgttcac agaagcttaa ggctttagct gtctctattt tgacagcgtt ttggttcatt 17701 gccgtcatga gtttggcaag ggggcgcaca aaacaaggaa gcgccgaggt catgtgaccg 17761 ataaatcgtg gtgaagatgt gtgtattgag tgagcaacaa tattgttgtc taggtatttg 17821 agatagctct ccaaagaagc aggttgaaca ggtatttgac tatcagaaaa gttacttaat 17881 aagtagtcta aatcaatgtg ggtgttggca tcaagggtgc ttaaaaaatc ttgagaaaac 17941 ttattagtta tttcatcgat ttgtgactca atagatgtca catgactaga tgaagcaaac 18001 agttgcatca cttgttcttc aatctcataa ttcaaagctt gctcagtctc aaaaaaagaa 18061 cttttctttg gcatttcttg tttacttaat gtcatcttgt gttcttgtaa taatttggtt 18121 ttttactcta ttgccaatgc tcgtttaatt cttacagttt atttgcctta ttgggatgaa 18181 acaccttttt gctttgttcc tgttcgatgt tcaatccttg cttgttttcg gcgctgtact 18241 gaggcggttc tgctttcagg gcgatgagaa tgttgtgtaa aagactcttg ttcgcttgac 18301 tcctgaccaa gatatctagt taggtggctt atggttggat attgaaacat ttcaaccaag 18361 gggaaatctc gctgaaatat tttttgcaac ttgctatgaa cctgaaccaa aagtaatgaa 18421 tgaccaccaa gttcaaagaa attatcatgg attcctacat tttcaatctg aagtaattct 18481 tgcaaaattc tcgcaatagt ttgttcgact tcggtttggg gttgctgata aaccgcttct 18541 aattcgggac gtgaatcgtc aggtgcaggc agtgctttac gatcaacctt accattaggt 18601 gtgagtggta gtgcttctag catgacaaaa gctgctggca ccatgtagtt tggcagcttc 18661 gactccaaga agtctcgcag ttcagtcact gtcagtgttt gccctgagtg agagacgaca 18721 taagcgacta ttcgttgaga atcagcttca tcagaacgaa caatgactac attttctcgt 18781 actttggggt gttgactcag atgtgcttca atttctccaa gttcgatgcg gaaaccgcga 18841 actttgactt ggtggtcgat acgtcctaag tattcaatct ctccgctagg tgaataacgt 18901 gctaggtctc ctgttttgta taaacgtgct gctgaaaaat cgctaaaggg attggggatg 18961 aatttttctt tagtcagttc cgggcggttt aagtagccac gtgctagtcc atccccacct 19021 atgtgcagtt ccccaggtac tccaactcca acaggttgtc tgtttgcatc cagtatatag 19081 atctgggtat tggcgatcgc atgaccaatt gaaacgcatt tgggcaaagg tgcattcgga 19141 ctgagactat agcaagtgac atccgcagat acttccgacg aaccgtaaag gtttaacaaa 19201 gtactatccg gcaaactctg ccgaaattgt tgtagcagat caattgagag tgcttcacca 19261 ctgctgaccc acaacttcaa ttggggtagc cgtgactgta ggtcagtgta tgtattcaac 19321 agtatacgta ataaggaggg aacaagtact agccgtgtca cattgttgtg agcaagagtc 19381 actacaaact gttgtggatt tttgacaacc tcatcgggca caatgactgt gggtacgcct 19441 tggagtaaag gtccaaaaat ttcccacact gaatccacaa agtttaagga tgttttctgg 19501 caacacattt ctccctgcac gaaaggataa ttctgccaca tccagtgaaa gcgattgatt 19561 gcacccctat gcagtccaaa aactccctta ggctgacctg ttgagccaga ggtgtagatg 19621 acataagcta ggttatcgac ttttgaaata ttggtggggt tatcttggga atatttgcta 19681 ataatatccc agtcggagtc caagcaaact acttgtgctt tatgttcagg aagtctctcc 19741 accagcttct gctgagttag tagtactgac acttgagaat ccgacaacat gaatgctaga 19801 cgttcttgtg ggtatgctgg atcaagaggt acataagcgc cacccgcttt gagaatgcct 19861 aaaagtccga tgatcatgtg gagcgatcgc gaagcggctc ctcctgcgtt tcgctccaca 19921 caaattccca ccagtacctc tggtcccact ccctggtttt gcaggtagtg tgccagttga 19981 ttggctctgg cattcagttc tcggtaggtt agttgctggt cttcaaacac tactgctaca 20041 gcgttgggcg atcgctctac ttgatcttca aacaactgat ggatgcattg ctgttgggga 20101 tactcaaccg aggtatcatt ccactccacc agtagagtct gccgctctag ctctgtcaac 20161 agtggtaact ccgacaggcg ctgctgtgga ttggcaacaa tcccctcaag caacgtttgc 20221 aaatgcccag ccatccggct tacagtgctt tgctcgaata agtcaatatt atactccaga 20281 gtgccaacga gtccgtgtcc tgtttctgtc atgtacaagg ttaaatcaaa cttggctcca 20341 ccactgtcac tttctagagg acttaaagtt aaaccaggta attctaaagg cgacattggt 20401 gcattctgaa gcacaaacat gacttggaaa agcggtgtat gactcaaaga acgctgtggt 20461 tgcagttttt ctaccagttg ctcaaaaggc aaatcttgat gagcatacgc tcctagtgcc 20521 atctcccgca cacgagttag tagctcctca aaagtgggat tacctgccaa gttagttcgc 20581 agtaccaaag tattgacaaa aaatccaatt aacccttcta tttccgcccg gttgcggttg 20641 gcgatgggcg aaccaagtac aatatcttct tgccctgtgt aacgccatag caaggtcaca 20701 aaagctgcta acagggtcat aaataaagta cttccctgct gctgactcaa tttattgata 20761 cttacagata gctcttgaga tagctcaaat gaataggtag cacccgcaaa ggtcataacg 20821 gcgggtcggg gatggtcagt aggtaactcc agtactgatg gtgcgccttc taactgctta 20881 caccaatacg atatctggga ttttaacact tctcccacaa gccattgtcg ttgccaagcc 20941 gcaaaatcta cgtattggat tgccagttcg agcaatggta agggttgctc ggtacaaaag 21001 gcttgataaa gtgttgctac ctcacgcacc agtacgcctt ttgaccaacc gtcagaggca 21061 atgtggtgca ttgtcagtag tactatgtgc tcttgctcac caaggcgcag tagtttcact 21121 cgcaacagaa ggtcgttgtt gaggtcaaaa ggtcgttgag cttctttatc tgccagtttt 21181 ctgacttcgg cttgttgttg gtttgaaggt aactcagtga tgtcaaaaat gggtaacggt 21241 agcgatgtga ctgatgagat aattgctacg ggttgccctt ctattgtctg gaaattggtt 21301 cgcaacgctt cgtggcggcg taaaatttcg ttgaagcttt gttgtagtgc ttttaaattc 21361 aattgtccct gtaggcgaac agcaccaggg atgttgtaga aggggctgtt tggttctaac 21421 tgtgtcagga accataaccg ttgctgggcg aatgataggg gtagcttttg tgagcgtgaa 21481 atacgctcaa tgtctgctgt ttcgagtcct aaacctgctt tggtcgcttt ttcaatgtcc 21541 ttggctagcc cagctattgt nnnnnnnnnn caaataaacg acgtaaagga agctcttgct 21601 gaaaaacttg tcgtacttgg gaaatcacgc gagttgcaag cagcgagtgt cctcctaagg 21661 aaaagaagtt attgtgaatg cctactttct caatactaag cacctctgcc caaattaagg 21721 ctagcatctc ctcaattggt gtggaaggag caacaaaatt tgattcggat aatagctgtg 21781 tgacttcggg tgcaggcaat gctttacgat caaccttacc attaggtgtg agtggtagag 21841 attctagcat gacaaaagct gctggcacca tgtagtttgg cagcttcgat tccagaaagt 21901 cccgtaattc agtaatcgtc agtgtttgct ctttttgagg aacgatataa gcgactattc 21961 gttgagaatc tactgactct tcacgaacga caactacact ttcttgcact ttggaatgtt 22021 ggcgaacaag tgcttcaatt tctccaagtt cgatgcggaa accgcgaact t // LOCUS NODE_1434_length_22058_cov_5.53220022058 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 22058) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 22058) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..22058 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(83..655) /locus_tag="DP116_12865" CDS complement(83..655) /locus_tag="DP116_12865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407134.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3727 domain-containing protein" /protein_id="PRJNA477356:DP116_12865" /translation="MFSSSFPEENDQTRTGSITLTDDKGRTLECYIEHSLTVDAQEYV LLLPVDSPIEIFAWQGDEEEEEAILVEDETIIDQIFGIAQAVLSEQNLLLKHTAYALT VAGELPPVEESELFTLEIEDEEADLEPEQLQLLSSFYHEEQEYAIYTPLDPLLFFARI SQAGKPELLSPEEFRQVQPLLEEHLFNEVE" gene complement(790..1350) /locus_tag="DP116_12870" CDS complement(790..1350) /locus_tag="DP116_12870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997251.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Holliday junction resolvase RuvX" /protein_id="PRJNA477356:DP116_12870" /translation="MRGQPDKGTQGQYLPRSPASAVPLSSPTPPTDEKNSRQFVSALG LDVGRKRIGVAGCDRTGLIATGITTIERTSFDRDVEQIRQLVNQRQVQVLIVGLPYSM DGSLGFQARQVQKFTTRLAKVLKLPVEYVDERLTSFQAEQMLIAENRSPSRHKQLIDR KAASIILQQWLDTRRSCLKSTLVSAD" gene complement(1390..2649) /locus_tag="DP116_12875" CDS complement(1390..2649) /locus_tag="DP116_12875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_12875" /translation="MAARMKMTSLLQRNLSVVIRPVQYRDLDGIERLTQESFAAQTPK GACDAMRQMQWLRGWYGLLKCLSWFPNPLRYRFCSYVAEQGRMLLGMIQISPFNRTHS TWRVDRVMIERAIDKQAVGSQLLRYCFESILEARTWLLEVNVNDKDALALYRKNGFQR LAEMTYWEIEPELLEELAKSEPDLPNLLPVSNADAQLLYQLDTASMPPLVRQVFDRNA DDFKTSLFGALTEAVKQWVTKIEVVSGYVFEPQRKAAIGYFQLRLDRKGQQPHVATLT VNPAYTWLYPELLSQLARIAQDFPPQALQVISADYHPEREEYLERMGASRIEHTLMMS RSVWHKLRESKFVSLEGIQLPEMLQGLQPARKPIPGGMSWAQQGQPVSPDRKAQPNTT KETVVFSCNNCNVETSSQQESADARQE" gene complement(2848..3048) /locus_tag="DP116_12880" CDS complement(2848..3048) /locus_tag="DP116_12880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651616.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12880" /translation="MESNLLTSTNAVRDDFSEYVAHLQLHMALQARNLVPPLKQSLED SREQLLHQTQAHFEKLVSRRAI" gene 3735..5873 /locus_tag="DP116_12885" CDS 3735..5873 /locus_tag="DP116_12885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313738.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_12885" /translation="MPIYHFFNETVEQRVNSRTVGRGVKLQASYAKLALSQRIIMPFL LVFFSIMVLLIISFAFWFSHKMEQQITSSVEKTASIVLQELYREKQHLSSWVQLMADR DDVRLALKQANTLALIKLFVPQKTTLELDFLKIVNQNGRVLLDLGQRQLGNSTLEDKR SFSQALSGLYLSDVVNFSTKEGQTQSVLVGLAPVKSKEEVIGAIEIGILIKQELFQHL ETTNSEHIVGFNIDKTAISDNEDLICVYASTLPAACETHWQLPPAFRPPQRLIIAGED YLAKRVTVSGLSNSYLTIVLLKSLFTLNNTLQFLWLGLWSFFVLSALITIFVGRNIAR KISDPVLAIAKFAQKVTMESNFDLRSPVMTHDEVGILATSLNSLISRIAEYTQLLQLA RQTLERRVQERTQQLLQKNQELNQAYEQLSQALNELQQTQAQLIQTEKMSSLGNMVAG VAHEINNPINFVYGNLTYAKQYTEELLKILFFYQREYPQPSAVIQNQLSEIDIDFISE DLPKLMSSMQMGAQRVHDIVLSLRNFSRLDEAEMKAVNIHEGIDSTLLILNHRTKEKI KVIKEYGNLPLVECYPAQLNQVFMNILSNAIDALEERMGNRKSAIVKETLPTPTIRIY TEIIFAASGKDSVPCVCIRITDNGCGIQSNFKDKIFEPFFTTKTVGKGTGLGLWICYQ IIQKHQGKIEVNSNPVQGTTFLITLPLSQL" gene complement(5985..7571) /locus_tag="DP116_12890" CDS complement(5985..7571) /locus_tag="DP116_12890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316162.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="B12-binding domain-containing radical SAM protein" /protein_id="PRJNA477356:DP116_12890" /translation="MRILLVYPIFPKTFWSYEKILALVDRKVLLPPLGLVTVAAILPQ EWEFKLVDRNIRPVTEEEWAWADVVVFSAMIVQKQDLLEQIREAKQRGKLVAVGGPYP TSVPGGVQEAGADFLILDEGEITLPMFVEAIKRGETSGTFRATEKPDVTNTPIPRFEL LDFDAYDMMSIQFSRGCPFQCEFCDIIVLYGRKPRTKTPAQLLAELDYLYKLGWRRGI FMVDDNFIGNKRNVKLLLKELKVWMAEHKYPFRFDTEASIDLAQDPELMELMVECGFA AVFLGIETPDEDSLQLTKKFQNTRSSLTEAVQTIIKAGLRPMAGFIIGFDGEKSGAGD RIVRFAEQAAIPSTTFAMLQALPNTALWHRLNKEGRLRENKDGNINQTTLMNFIPTRS LEDIAREYIEAFSYLYDPVCYLDRTYRCFLIMGAPSWKAPFKMPEWVVVKALLLIIWR QGIKRETRWKFWHHLFSILKRNPGVAEHYLATCAHNEHFLEYRQIVREQIESQLAEYL AQGVEKPYEIVNQTVAAVAS" gene complement(7790..8644) /locus_tag="DP116_12895" CDS complement(7790..8644) /locus_tag="DP116_12895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015176237.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4231 domain-containing protein" /protein_id="PRJNA477356:DP116_12895" /translation="MTTIEQKNQTEISDTQEQSSSSSSQKMLFTLKVFEYFFLAAFIG SGVITLVFPDNPRVILTGSISLAIFVFLFLINQQVFRVSSNAASKLELQKKAELYLMN SNNHSTKNPIAPARENALQYCQELIDDYKKTRNLSRNLYYGLQMSTIVFSGVTPILVL VDKLEPGQGFLKWLPVIFPAIASIVASVVTSFPFQENWISANTTVESLEAEQEKFVLG ITPSYRCYDSADESEQQQKAKEAIENFVIQVNNIHLNQLQAVGEGQKKEEKTQPADQS SQSNSQQS" gene 8732..8941 /locus_tag="DP116_12900" CDS 8732..8941 /locus_tag="DP116_12900" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12900" /translation="MQIREYLKSLMSDFHLGQSLPKIHAKPILQTIPASACAYGNKDC NAGLTRLPHHRKWLTILAQGDFSSA" gene 10156..10998 /locus_tag="DP116_12905" CDS 10156..10998 /locus_tag="DP116_12905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407814.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12905" /translation="MVNIGYHTAKFQGQFIRSLYRDVVVKLVNLPIKQNRKVPINVYS LSCERDLPEQVVSIRSFIRHVGIPDTFTVISDGSYTDDSCRLLRRIHPCVEVVQLKKL LRNDLPQCVSEYAQQHAMGRKLSALMSIPVNGPTIYTDSDILFFPGGIDLVQLATSDD KYTRYLLDCSNSLDQRIIVNDSEKLNPINAGFILFKHELDWNFAIERLANIEGLPSYF TEQTVVHLTIHHNHGIPLCSQRYVLNVEDQFIYPDKFASKEIALRHYVRDVRHKFWFN LALA" gene 11152..13242 /locus_tag="DP116_12910" CDS 11152..13242 /locus_tag="DP116_12910" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12910" /translation="MVSVAVSVRSLIFTKNEYINTMQISSFDVFDTVLTRVVGTPKAF FLLLGKQLAGQSLINCTPEAFVHARTTAEFRAHSNVGEKYSLHQIYTEVAIALRLTDE QREKIMHIECALESELIRPIPIARDLIQTARKQNKRVVFVSDMYLPAEFIKEQLVRHS FWVDGDELYVSYEYGKSKATGELFRELLNREGVSPAEVSHYGNDLRIDVQGAKKVGLK AQHFSEGNLNRYEQKLESHSYATEGLSSAMAGASRLVRLQVPVSSSKEEALRDVAAGV VAPTLVGYVLWILQQAQLMGLKRLYFVSRDGQVLLEIARRLVGKLNFSCELRYIYGSR LSWNLPAVVSLDPQQALEMLKRPSWILDGTSTLSIRDFLARVSIAPQEIRDSLAAIGF KEEDWSRILSPQEVQALHPVLDKPEVSELILQKAVQKQQVLMKYLDQEGVLDSIPKGL VDVGWFGSSYDSLAALVNANGATLDVGFFYGLKSNSKGNQSDSKKGYFFDQRTRTGFK DVLPELGIVPLEMFCSADHGTVLGFMEEGDQVRPVFKEEHNQRIIDWGLPLVRKAVYS FTENLLLDPNLVNPWADVRQASADVLQSFWLSPSYTEAKAWGDFPWEAGHSENTNSLA QSYSWINVAKSFLTARFAYNQGLWFEGSVAQSSLPVQKGIQGFRRYRRLLLSIKSKVL TPRLKLVKRSLQTP" gene complement(13599..14558) /locus_tag="DP116_12915" CDS complement(13599..14558) /locus_tag="DP116_12915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316161.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_12915" /translation="MTQKRILVTGASGCIGHYISEALIQETEHELYLLVRNPKKLQVD TQARPGVTVLQGDMQEIEQFADLLKTIDVAVLTATAWGGEKTFDINVHKTHELLNLLD PDKCEQVIYFSTASVLDRNNQPLKEAGELGTDYIRSKYDCLHKISQLAIAPKVTSVFP TLVLGGDRNKPYSHITSGIPEVTKYINLIRYLKADGSFHFIHGQDIATVIRYLIENPP KKEETRSIVLGEEKLTVNQAIEEVCAYLGKKIYFRIPLSLSLANLIIVLFRIQMAAWD RFSMNYRHFTYQNVVNPASLGLPNYCATMSDILKISGVKPFIK" gene 14646..15020 /locus_tag="DP116_12920" /pseudo CDS 14646..15020 /locus_tag="DP116_12920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410846.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(15217..16272) /locus_tag="DP116_12925" CDS complement(15217..16272) /locus_tag="DP116_12925" /EC_number="4.1.1.37" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874895.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uroporphyrinogen decarboxylase" /protein_id="PRJNA477356:DP116_12925" /translation="MGISSNAPYLLRAARGEVLDRPPVWMMRQAGRYMKAYRDLREKY PSFRDRSEIPEVAIEISLQPWKAFQPDGVILFSDIVTPLPGLGIDMDIAEGKGPIIHS PIRTSEQIDNLHTLEPEESLPFIKTILQALRQEVGNESTVLGFVGAPWTLAAYAVEGK GSKTYSVIKNMAFSNPTLLDKLLTKFADAIAIYVRYQIDCGAQVVQMFDSWAGQLSPQ DYETFALPYQQRVFQQVRETHPDTPLILLVSGSAGLLERMAISGADVITVDWTVDMAD ARERLGKHLKVQGNLDPGVLFGSKEFIRDRIYDTVRKAGNKGHILNLGHGVLPETPEE NVAFFFETAKQLSSAVV" gene complement(16765..17079) /locus_tag="DP116_12930" CDS complement(16765..17079) /locus_tag="DP116_12930" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12930" /translation="MYMNNFRRFIVLMGLALTLLVGSFTAPAFAQETLAQKDAPTTAP IPVTQPVFDPPPPPDPTDKNECWSGRNCEGRILNNRDAHNCKNSGGKSWRSRLTGDCT NL" gene complement(17644..18567) /locus_tag="DP116_12935" CDS complement(17644..18567) /locus_tag="DP116_12935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LD-carboxypeptidase" /protein_id="PRJNA477356:DP116_12935" /translation="MQSKTQNLKSKLVPPPLKPGDLLRVISPSGALREFEAFQKGVEI WRSRGYRVEIFPEIEDKWGYLAGKDEFRREQLATAWQEPECRGVLCTRGGFGSTRILE NWTWQNLENSAHPLWLIGFSDITALLWSLYTAGISGVHGSVLTTLACEPDWSIQRLFD CVEGRPLAPLKGCGWGGGVVNGILLPGNLTVATHLLGTPMLPDMDGVVLALEDVTEVP YRIDRMLTQWRMSGVLSKVRGIALGSFSRCEAPANVPSFSVEEVLRDRLGDLGIPIVS NLPFGHEAPNAALPVGVPVQLDADKGILNIL" gene complement(18961..20127) /locus_tag="DP116_12940" CDS complement(18961..20127) /locus_tag="DP116_12940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315703.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_12940" /translation="MRRRYSSIIFIACLACLGFDQRPASAVKPEIVFKNLELAQASSK DAAKQSILRLGSTGVEVKSLQTLLKKLGYYDGEIDGRYGISTSRAVTKFQQAKGLSVD GIFGDATRQSLQTAINKKLPPSSIAISSTTEPKTKKETETDIVWWSLVGTGVLGSIGA LLYIVKKIEKGTKVVKYPENYSQLNTSSVPQTIEKFHTKNDDEEFHNTSVISTQETET TIPPSTEFLPMETTSRLAKVDIIDQLISDLRSPDITQRRKAIWDLGQKGDSRAIQPLV DLMVDADSQQRSLILAALAEINTRTLKPMNRALAISLQDESPQVRQNAIRDLTRVYDM MAQMSQILSHAVRDPDADVRSTAKYALSQMNRIRALPGDASEEGKDEEEEDKNF" gene complement(20333..21592) /locus_tag="DP116_12945" CDS complement(20333..21592) /locus_tag="DP116_12945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017658650.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_12945" /translation="MLTLNYRYRIYPNAAQEQILIEWMELCRGAYNYALREIKDWCES RKCLIDRCSLEKEYILPADLKFPCVVKQLNNLPKAKENFPKLREVPSQVLQQTIKQLH RGWEYFQERGFGFPRFKKYGHLKSLLFPQFKESPVTDVHLKLPKIGIIPINLHRPIPN GFVVKQVRVLRKANQWYASINIQCDVNVSSPMPHGYPIGVDVGLEKFLATSDGVLVKP PKFFKTMQSKLKLLQRRLSRTQKRSKNYEKQRMKVARMHHTIDNSRKDFHFKQAHALC DTGDMVFMEDLDYSKMAKGMLGKHMLDAGFGQFRTIVKYVCWKRGKFFAQVDSRGTSQ ECPGCGAVAKKDLKTRVHHCFNCGYTTDRDVASGQVIRNRGIASISTPGLGGMETACA ADLPGTKISLSRQVAKSRKGKTRKSSE" BASE COUNT 6303 a 4695 c 4629 g 6431 t ORIGIN 1 taaaagatga taatgaaaaa tccctcatta aaaattaaaa atttttgatt cttttaattt 61 ttaatgaggg atttttttta tcttactcta cttcattaaa caaatgttct tccaacagag 121 gttggacttg acgaaactcc tctggtgaga gcaattctgg tttaccagct tgagatattc 181 gtgcgaaaaa cagcagtgga tcgaggggag tgtaaattgc gtactcttgt tcttcgtggt 241 agaagctcga tagcagttgt aattgctctg gctctaaatc tgcttcttcg tcttcaattt 301 ctagagtaaa aagttctgac tcttccactg gtggtaattc acctgcaaca gtcagagcat 361 aagctgtatg cttcaaaagg agattttgct cagaaagaac agcctgggct atgccaaata 421 tttggtcgat tatcgtttca tcttccacta agatggcttc ttcttcctct tcatcacctt 481 gccaagcaaa aatttctatt ggtgagtcaa cgggtagaag caaaacgtac tcttgagcat 541 ctacggtgag tgaatgctca atgtaacatt cgagggttct ccctttgtca tcagtcaaag 601 tgatggaacc agtacgagtc tgatcatttt cttcaggaaa ggatgaggaa aacatagtgg 661 agatatcgac tttttactaa aaactaccag caactgttat aatacccgaa ttctttcaac 721 agatattatt aatcaaacta ggggtaaaac tgattaataa atttcagaat atcatgttaa 781 aagcttgcat caatcagcag aaactagtgt gcttttcaaa caagagcgcc ttgtatctag 841 ccattgttgt aagatgatgg atgctgcttt acggtcaata agttgtttgt gacgagaggg 901 ggagcgattc tctgctatga gcatctgctc tgcctgaaat gaagttaatc gttcatcaac 961 atattccaca ggcagtttga gaaccttggc aagtctggtc gtaaatttct gcacttgacg 1021 agcctgaaat cctaaagaac catccataga atagggtaac ccaacaatta ggacttgcac 1081 ttggcgctga ttgacgagtt gtcgtatttg ctccacatcc cgatcaaacg atgtgcgctc 1141 tatagtagtg ataccagtgg caatcaaacc agtgcgatcg caccctgcta caccaatacg 1201 tttgcgccct acatctaatc ctaaagcaga aacgaactgc cgtgagttct tttcatcagt 1261 cggtggagtt ggggaggaga gagggacagc ggaagcaggg gagcggggga gatattgtcc 1321 ttgtgtccct ttgtctggtt gtccccttat atccttctcc ttgtgtcctt gttcaaaact 1381 cactgcgaat tactcctgcc gtgcatctgc cgattcttgc tggctagatg tttctacgtt 1441 acagttattg cacgaaaaga caacagtctc ttttgtcgta ttgggttgtg cttttctgtc 1501 tggcgacact ggctgtcctt gttgtgccca tgacatacca ccaggaatcg gtttacgtgc 1561 aggttgtaat ccttgcaaca tctctggtaa ctgaattcct tctaaggaga caaattttga 1621 ctcccgcagt ttatgccaca cagagcgaga catcatcaag gtgtgttcta tacgacttgc 1681 acccattctc tccaaatact cttctctttc tgggtgatag tctgctgaga tgacttgtag 1741 agcttgtgga ggaaaatcct gtgctattcg agctagttga gacagtaact ctggatacaa 1801 ccaggtataa gctgggttaa ctgtcaaggt tgccacatgg ggctgttgac ctttacgatc 1861 cagtcgcagc tgaaaatagc cgatcgcggc tttgcgttgg ggttcaaaaa cgtaaccact 1921 cacaacttct attttcgtca cccactgctt aactgcttct gttaaagcgc caaacagact 1981 tgttttaaag tcatcagcgt tacggtcaaa cacttgtctt actagaggtg gcattgatgc 2041 cgtatcgagt tgatacagta gctgggcatc agcattactt acaggcagca aattgggtaa 2101 gtctggttct gattttgcca attcttcaag caactctggt tctatttccc aatatgtcat 2161 ttctgccaag cgctgaaatc cgtttttgcg ataaagcgcc aatgcgtctt tgtcattgac 2221 attcacttct aaaagccaag tacgagcttc caaaattgat tcaaagcaat agcgcaaaag 2281 ctgcgagcca actgcttgct tatctatcgc acgctctatc atcacgcggt caactctcca 2341 ggtgctgtgc gtgcgattga acggcgatat ttgaatcatt cccagaagca tacgcccttg 2401 ttctgctacg tatgaacaaa aacgataccg taatggattg ggaaaccagc tcaaacactt 2461 gagtaatcca taccagccgc gcagccattg catttgccgc attgcatcac aagctccctt 2521 gggggtttgg gctgcgaatg actcttgagt taggcgttca attccgtcca agtcccggta 2581 ttggactggt cggataacaa cgctaagatt tcgctgaagt aatgaggtca ttttcattcg 2641 agccgccatt cagcttagag taactaggtc aaaggaactg ctgcaataat cttaacggct 2701 ttgtgctact ttctattaaa ccgaaagggt gattttcgta actttaagat attaaaaagc 2761 agatttgtag cagaacagca ttagccttgt cttaacaaaa tctgctttca gcttacttta 2821 gattaacatt aatttaacat tttggaacta gatggctcta cgagacacta gtttttcaaa 2881 atgagcttgg gtttgatgga gtaattgctc ccggctgtct tccaaagact gcttcagggg 2941 tggaaccaga ttgcgagcct gcagagccat gtgaagctgt agatgggcaa catattcact 3001 aaaatcatca cgcactgcat ttgtgctcgt taataaattt gattccattg ccactgtgtt 3061 ctccttcaat ttttatagtt gttttacatt acaaaatatt tatcatgtta ttttgcctat 3121 cttcttactt cgcaatgaag tttaatgtat ggcaggaaga tttcagcgct tggcacatag 3181 cgcgtaaata agtatccacc cgtcacaaaa gaagcaccag gagacggagt tgggagagag 3241 ggagaaaccc cttggaggtc tcctcgttga gtgcaagtgg cgtggaaacc accaagacga 3301 gccagcactg cggacgggtt tcccgccgca ggtgactggc gtcctcggcg actacccacc 3361 gcaactgcct caccattaat aggggtcgga aatccaaatg tgaataaatc tggacaagaa 3421 tttatcactc ccaaaaattc cgaacccttc ttatgggaat gtgtactaag gaactgcgca 3481 cgaataagtt acccgttctc atacccaaaa gccttgataa acaaaggtta ttttttctat 3541 aacagataag ttatttctgc gcgactccgt aagtcatgag tagtaatgac tgatgatcta 3601 ctagacggtg ggcggcggtt catgaggaca gcacaatggc tgggggatct cccaacacag 3661 gtgagtgtcc cgcgtgtgaa agggtctttc ttgccatctt tggtaactgt tgctgattta 3721 ttcttataca aacaatgcca atttaccact tcttcaacga aacagtagag caacgggtca 3781 actcaagaac tgtgggacga ggtgtgaagt tacaagccag ttatgcaaaa ctagctttaa 3841 gccagagaat catcatgcca tttcttctgg tgttctttag catcatggtg ctattgatca 3901 tcagctttgc tttctggttt agccacaaga tggaacaaca gataacctcc tctgtagaga 3961 aaactgcttc aatagttttg caggaattgt atagagaaaa acaacactta agttcttggg 4021 tacagctgat ggcagatcgg gatgatgtgc gtcttgcact caagcaggca aatactttag 4081 cacttataaa gttgtttgta ccacaaaaaa cgactttaga acttgatttc ctcaagattg 4141 ttaaccaaaa tggtcgcgtt ctgttggatt taggacagcg acaattagga aactcaaccc 4201 tggaagacaa aagaagcttt tcccaagcac tgagtggttt gtatctttcg gatgtcgtca 4261 acttttctac aaaagaagga caaacgcagt cagttcttgt tgggttagca cctgttaagt 4321 ccaaagaaga agtgattggt gcaattgaaa ttggtattct tatcaaacaa gaattgtttc 4381 aacatctaga gactacgaac agcgaacata tagtaggatt caacatagac aaaacggcaa 4441 tatctgacaa cgaagatctg atatgtgttt atgcttctac gttaccagca gcttgcgaaa 4501 ctcattggca actaccccca gctttcagac cgccacaacg tctcatcatt gcgggtgaag 4561 attacttagc taaaagggta actgtttcag gattgagcaa ttcttatctt actattgtac 4621 tgctgaaatc actttttact ttgaacaata cactacagtt tttgtggtta ggactatgga 4681 gtttttttgt gttgtcagct ttgatcacaa tttttgtagg taggaacata gcgcgaaaaa 4741 tttctgaccc tgtgttagct atagctaaat tcgcgcaaaa agtcacaatg gaatcgaatt 4801 ttgatttgcg atctcctgtt atgactcatg atgaagtggg gatattagcc acttctctca 4861 acagtcttat tagcagaata gcagaatata ctcaactatt acagttagca cggcaaacat 4921 tagaaagacg agtccaagaa cggactcagc aacttttgca aaaaaatcag gaattgaatc 4981 aagcttacga acaactcagc caagctttaa acgaacttca gcaaactcaa gcacaattaa 5041 ttcagactga aaaaatgtct tcgctgggta atatggtggc aggagttgcc catgaaatca 5101 ataatcctat caattttgtc tacggcaatt taacttatgc taaacagtac acagaagaac 5161 ttttgaaaat actgtttttt tatcaaagag agtatcccca gccaagtgct gttattcaaa 5221 atcaacttag tgaaattgat attgatttta tcagtgagga tttacccaaa ctgatgtcct 5281 caatgcaaat gggtgctcaa cgtgtccacg acattgtttt atcattgaga aatttctcac 5341 gtcttgatga agccgaaatg aaagctgtga atatccatga aggaattgac agcactttat 5401 taatcctcaa tcatcgaacg aaagagaaaa ttaaagttat taaagaatat ggaaatttgc 5461 ccttggttga atgctatcca gcacaactca atcaagtctt tatgaatata ctcagcaatg 5521 caatagacgc attagaagag agaatgggaa ataggaagtc ggcaattgta aaagaaacac 5581 ttcctacccc cacaattcgg atttatactg agattatttt tgcagcctct gggaaagact 5641 ctgttccatg tgtttgcata agaattactg ataacggctg tgggatacag tcaaatttca 5701 aagataaaat ctttgagcct tttttcacga caaaaacggt aggtaaaggg actggtttag 5761 gactgtggat ttgctatcaa attattcaaa agcatcaggg gaaaattgaa gttaactcta 5821 acccagttca aggaacaaca tttctcatca ctctgccgct ttcacaatta taaatagcct 5881 agattaaata ggcttgtagt agcgcttcaa ctccaggaca aaactactgt tgactatgtc 5941 agttgtttac aaatgatgca ggctgaacaa actattatcc atactcaact agctacagca 6001 gccactgtct gattcactat ttcatatggt ttttctacgc cttgtgccaa atattcagca 6061 agctgagatt ctatttgctc ccgtacaatt tggcgatact ccaagaaatg ttcattgtgg 6121 gcacaggtcg cgagataatg ctctgcaacc ccggggttac gcttgagaat actaaacagg 6181 tggtgccaga acttccagcg agtttcgcgt tttatccctt gtcgccaaat aatcaacagc 6241 agcgctttca caacaaccca ctctggcatc ttgaatggtg ctttccaact cggtgcaccc 6301 atgattagga aacaacggta ggtacgatcc aagtagcaca ctgggtcata taaataacta 6361 aatgcttcaa tatattctct ggcaatatct tccaaagaac gagtggggat aaagttcatc 6421 aaagttgtct gattgatgtt accgtcttta ttttctcgta gtcgtccttc tttattgagg 6481 cgatgccaaa gggcagtatt tggtagcgct tgtaacatgg caaaagttgt ggagggaatt 6541 gctgcttgtt ctgcaaagcg gacaatgcga tcgcctgcgc ctgatttctc tccatcaaaa 6601 ccaataataa acccagccat cgggcgcaaa ccagctttga tgatggtttg cacagcctca 6661 gttaatgaac tacgcgtatt ttgaaacttc tttgtcagtt gcaaactgtc ttcatccggt 6721 gtttcaattc ccaaaaacac cgcagcaaaa ccgcactcaa ccatcaactc catcaattct 6781 ggatcttgtg ctaagtcaat agaagcttcg gtgtcaaacc ggaatggata cttgtgttct 6841 gccatccaaa cctttaactc tttgagcaac aacttaacat tgcgcttgtt gccgataaag 6901 ttgtcatcca ccataaaaat accacgccgc caacccagct tatagaggta atctaactct 6961 gccaacagtt gtgcaggagt tttggtgcga ggtttgcgtc cgtaaagaac aataatgtca 7021 caaaattcgc actggaaagg acaaccccgc gaaaattgaa ttgacatcat gtcgtaagcg 7081 tcaaagtcca gcaactcaaa acggggtatt ggggtgtttg tgacatcagg tttttctgta 7141 gcgcggaaag tcccagaggt ttcaccccgt ttaattgctt cgacaaacat cgggagggta 7201 atttcccctt catctaaaat taggaaatct gcaccagctt cttgaactcc cccaggaact 7261 gaggtggggt aaggaccgcc cacagcaact aacttaccac gttgttttgc ttcgcgaatt 7321 tgctctaata aatcttgttt ttgaactatc atcgcagaga agacgaccac atccgcccaa 7381 gcccattcct cttctgtcac tgggcggata ttgcgatcaa ctagcttaaa ctcccattct 7441 tgaggcaaaa ttgctgcaac tgtcaccaaa cccaatggtg gtaacaaaac tttgcggtca 7501 actaatgcca agattttttc ataagaccaa aatgttttgg gaaaaatggg ataaactagt 7561 aaaattcgca tgaaccgtca accctcacaa acaaaagtgt tttttcgcgc tcaactatca 7621 taaaaagttg attggtacac tgacgaagtt ttcttctttg aagtttctgt ccgatagcac 7681 gagtgtagct agttcacata ggtgaacaat agcgcctaag tgcatcgagt tgcaacccaa 7741 cccagagtgt ggcacttaga cgctatttat tgtcaaaagc aaaaattcct taactctgct 7801 gtgagtttga ctggctagat tgatctgcgg gctgtgtctt ctcctctttt ttctgaccct 7861 caccgactgc ttgcaattga ttgaggtgaa tattattcac ttgaataaca aagttttcta 7921 ttgcctcctt tgccttttgt tgttgttcgc tttcgtcagc tgagtcatag caacgataag 7981 atggagttat tcccaagaca aatttctctt gttcagcttc taacgattca actgtcgtgt 8041 tagcagaaat ccaattttcc tgaaaaggaa acgaggtgac gacactagca actatagagg 8101 caatagctgg aaaaattacg ggcagccact tcaggaaacc ttgacctggt tccaatttgt 8161 ctaccaaaac taaaattggt gtcactccgg aaaaaacaat ggttgacatt tgcaaaccat 8221 agtaaagatt tctcgataga tttcgagttt ttttataatc gtcaatcaac tcttggcaat 8281 actgtaaagc gttttctctc gctggcgcta tcgggttctt tgtcgagtga ttgttactgt 8341 tcatcagata gagttcggct tttttctgca gttcgagttt agaagcggcg ttactagaaa 8401 cccgaaatac ttgttgattg atgaggaaca gaaaaacaaa gatagccaaa gaaatagatc 8461 cagtgagtat gactcttgga ttgtctggga aaacaagagt aataacacct gaaccaataa 8521 aagcagctaa aaagaaatac tcaaagacct tcagggtaaa caacattttt tgacttgatg 8581 aggacgatga ttgttcctgc gtatcggata tttccgtctg gtttttctgc tcaatagtcg 8641 tcattgctca gttgccaatt cggtgatcac atacaagtat agtaagtttt ctggcaaaat 8701 gtttttcagc atacaaaatg ccaggattaa gatgcagatt agagaatatt taaaaagttt 8761 aatgagtgac ttccatttgg ggcaatctct cccaaaaatt cacgcaaagc cgattttgca 8821 aacaattccg gcgtctgctt gcgcttacgg aaacaaggac tgcaatgctg gactcaccag 8881 actgcctcac caccgcaagt ggctcacaat tttagcacag ggcgattttt cctcagctta 8941 ggcagatttt tcgcttgttt tacccaacgc ttgaaattct agccgagcac tgcctcagct 9001 atagaaagcc aacctctgta ggctttctgt agatccacct gcgtggacgc gtgtttgtat 9061 agcctttgaa ttttattcgc tgggtggctg ggtttatttt tcaaaatgag acgctttcac 9121 agctttagca ccggatgttc aagagtgcat ctactcaaag tattatatct tagaccagaa 9181 tcacttttct gtcaatgcat aactcctaaa attaggaata ccgacatata ttgtcttgcg 9241 aatctaaaaa atctgtaaat caaacaaata tttgttgacc atgtatttcc acgcaattaa 9301 tgtattgtac atatgcaaat ttatatattg catatacaca taataaaagt acttaattca 9361 tcccctggct gcccttgact gcttctactt gactatcacg aaaagccgcg atgtttcgag 9421 tcattggtca ttagtactaa ttactaaaga ctaagaatcg agacagtggg cggcggttga 9481 tgaggacagc actgcaccga cgtctcccgc actttggtga ttgtccgtcc gatctctaga 9541 agaaatgggc tttccggctg tctttggtaa gtattaagta caaaatcata tttttttgag 9601 gcatcacagc ttcagtatta aacatgaaaa tacaaggtac agactatctt gaaaatgagg 9661 agttagggag ggaaagagtg agagagtgaa ggaaagaaga tttttatgtg tattggtgaa 9721 cgtagttgat gagagctact tagaggatgt ctgttgaagt cagaagtgtt atgtccaaca 9781 gagtgctctt cgtagtgttt gcgtatctgg tgatctctat gtctacggca ccgttgcttt 9841 gcgttccccc aggccagcct agcgtctggg caggagatac gctcgtgttc caactaagca 9901 tgctcaggac tacgttcata atgacaaaat gtacagctta gatatagttt aaacttctaa 9961 ctgaccagat agatttgcta aagttaataa ataaactatt aggaactcat ttctaagttt 10021 atatttagca tctatcaagt ctaatacaac ccgaaatgat atcaggctgg gtagaattgt 10081 agccattaat ctacagttcc aaatctgact ctagctatat tcgtaacaat tatcgttttt 10141 gatgaggatt aagaaatggt aaacataggc tatcatacgg ccaaatttca aggtcaattt 10201 attcgttctc tttatagaga tgtagttgtt aagttggtga atcttcccat caaacaaaat 10261 cgaaaagttc caataaatgt ctattcttta tcttgtgagc gcgatttacc agaacaagta 10321 gtcagtattc ggtcattcat ccgtcatgtt ggcattccag atacattcac tgtgatttct 10381 gatggtagct acactgatga tagttgtcgt ttactccgcc ggatacatcc ctgtgttgag 10441 gtggtgcaat taaaaaaact tcttagaaac gatttgcctc aatgtgtttc tgaatatgcc 10501 caacagcatg caatgggtag aaagttgtcg gcattaatgt caattcctgt gaatggtcca 10561 actatctaca cagattcaga tattttattt tttccaggtg gaattgactt agttcagtta 10621 gccacttcag acgataaata cactagatat ttacttgatt gctctaattc actagatcag 10681 agaatcattg taaatgattc tgagaagtta aatccaatca atgctggatt tattttattc 10741 aagcacgaat tggattggaa tttcgctatt gaacgcttgg cgaatatcga aggacttccc 10801 tcatatttta ctgaacaaac agtggtgcat ttgacaatcc accacaatca tggtatacct 10861 ctgtgctcgc aacgatatgt tctcaatgta gaagatcagt tcatttatcc agataaattt 10921 gctagtaaag aaattgcttt aagacattat gttagggatg tcagacacaa gttttggttt 10981 aatttggcgt tagcttgatg agtcaatact ttttgagtaa gcctgcaggt taaacaaact 11041 aagcctacat aagtgggtaa tttctatggg ctgcagaagc agcctttgtt agtgtgtccg 11101 taactctgca ctatgactgg gtgacagata ctattaaccg gcacgtattg aatggtaagt 11161 gtcgcagtta gtgtacgttc gctaattttt actaaaaacg aatacatcaa cacaatgcaa 11221 atctcttcat ttgacgtctt tgacacggtt ctgacacgag ttgtaggaac accaaaagcg 11281 ttctttcttt tgctaggtaa gcaactggcg ggtcaatccc tgattaactg tacacctgaa 11341 gcgttcgttc atgctcgcac tacagctgag ttccgggcac acagcaatgt gggtgaaaaa 11401 tattctttgc accaaatcta cacagaagtt gcgatcgccc tacgactcac tgatgagcaa 11461 cgcgaaaaaa tcatgcacat tgaatgtgcc ttggaatctg aactaattcg cccgatcccc 11521 attgcaagag atctcatcca gactgcccgc aagcaaaaca agcgtgtggt tttcgtgtcc 11581 gatatgtatt tgcctgctga gttcatcaaa gagcagttag tgcgtcattc gttctgggtg 11641 gatggtgatg aattgtatgt atcttatgaa tacggaaaat ccaaggcaac aggtgaactt 11701 ttccgagaat tgctcaatcg tgagggtgtt tcgcctgccg aagtctctca ctacggtaat 11761 gatttgcgta ttgatgttca aggtgcgaaa aaagttgggt tgaaagctca acatttttcc 11821 gaaggcaact tgaatcgata tgaacaaaaa ctggagtcac attcctatgc gacagaaggt 11881 ttatcttctg caatggctgg tgcttccaga ttggtgcgct tgcaagtccc ggtgtcgtcc 11941 tcaaaagagg aggctctacg tgatgtcgct gcaggcgtag tagctcctac attagtcggc 12001 tatgttctct ggatactcca acaggctcag ctgatgggct taaaacggct gtacttcgta 12061 tcaagagatg gtcaagtttt gctggaaata gcgcgtagat tagtcggcaa gctcaacttt 12121 tcttgcgaac ttcgctatat ctacggcagc agattatcgt ggaatctgcc agcagtcgtc 12181 agtcttgatc cacagcaggc attggagatg ctgaaaagac caagctggat tttggatggc 12241 acgagtaccc tctctattcg agactttctg gcgcgggttt ctattgctcc ccaagagatt 12301 cgtgatagct tggctgctat cggattcaaa gaagaagatt ggtctagaat tctgagtcca 12361 caagaagtgc aggcattaca cccagtgctt gacaaaccgg aagttagtga actcattctt 12421 cagaaagctg ttcaaaagca acaagttctg atgaaatatc tcgaccaaga gggggtgctt 12481 gactcaattc caaaggggtt agttgatgtc ggttggtttg gcagctcgta tgattcccta 12541 gctgcacttg tcaacgctaa cggtgcgaca cttgatgtgg gtttctttta tggtcttaag 12601 agtaattcca agggaaatca atctgactct aagaaaggct atttcttcga tcaacggaca 12661 agaactggct tcaaagatgt tttacctgag ttgggtattg ttcctttgga aatgttttgt 12721 agtgcagatc atggtaccgt gcttggcttc atggaggaag gggatcaagt tcgaccggta 12781 ttcaaggaag aacataatca aaggatcatc gactggggac taccacttgt taggaaggca 12841 gtttactctt ttacagagaa tctgttactt gatccaaact tagttaatcc atgggcagat 12901 gtgcgccaag cttctgcaga tgtccttcag tcgttttggt tgagtccttc ttatacagag 12961 gcgaaggctt ggggtgactt tccctgggag gcaggtcata gcgagaatac taattctctt 13021 gctcaatcct actcttggat aaatgttgct aaatcttttt tgactgcgag attcgcttat 13081 aatcaaggtc tatggtttga gggatcagtc gcccaaagtt ccttaccagt ccaaaaggga 13141 atacaaggtt ttcgccgcta tcggcgcttg ctcttgagta tcaagtctaa agttctcact 13201 cctagactca agctagttaa gcgctcacta caaacccctt aagaacgaaa attccttaac 13261 ttgtctatag taggaagatc ccactcctaa cccctccccg gaagcagtgt acacacaagt 13321 catcgaatga ctcaaaattc tgggaaaacc tcaccctgcc ctgtcgggca tccctctcct 13381 tattaatggg aaaacctcac cctgccctgt cgggcatccc tcacgccagg tgctacaacg 13441 gggggaaccc ccgcaacgca ctggctctcc ttattaagga gagggaaaga tttttgcgta 13501 gcaaaaagcg agggtaaggt tttgagcgag atgtgtgtac accgtaagct tgcagggaaa 13561 gttggtaacg gttctgaggt cattaaacga aaccgatatt acttgataaa tggcttgaca 13621 ccactaattt tcaaaatatc gctcatcgtc gcacagtaat ttggcaaacc caagctggct 13681 ggattgacga cattttggta agtaaaatgc cgatagttca tactgaatct atcccaagca 13741 gccatctgaa tacggaacaa aacgataatt aaattggcta aagataaaga taaaggaatg 13801 cgaaagtaaa tctttttgcc taaataagcg caaacttctt ctattgcttg atttacagtt 13861 aatttttctt cacccaaaac aatagagcgt gtttcctctt tttttggagg attctcaatc 13921 aaatatctta ttactgtggc aatatcttgt ccgtgaatga agtggaaact gccatctgct 13981 tttaaatagc gaatcaaatt tatatatttc gttacttctg gaataccaga tgtaatgtga 14041 gagtagggtt tattgcgatc gccccccagc actaaagtag gaaaaacact cgtcacttta 14101 ggggctattg ctaactgcga tatcttatgc aagcaatcgt acttggaacg gatgtaatct 14161 gtacccagtt cccctgcttc cttcaaaggt tggttgttgc gatccaaaac gctagctgtg 14221 gaaaaataaa tgacttgttc gcatttgtct ggatctaaca agttcagcaa ctcgtgcgtt 14281 ttatggacat taatatcaaa tgttttttca ccaccccaag ccgtcgctgt caacacagca 14341 acatctatcg ttttcagcaa gtcagcaaat tgctctattt cttgcatatc accctgcaag 14401 acagtcacac caggacgtgc ttgagtatcg acttgcagtt tctttggatt cctaaccagc 14461 agatacagtt cgtgttctgt ttcttgaatt aaggcttcac ttatgtagtg accgatacaa 14521 ccacttgcac ccgtgactaa aatccgtttc tgggtcatag agagtttcaa ataagattta 14581 ggagttatca attatgaatt gtgaatgaga aaaaagtttc atattagtag gcaaggatta 14641 ttcatgagcc agttggcatt aatttgatgc agttgacaat tgcgtcggat gaaacgatgg 14701 caaagcaagc aaagcaatta attgagcgag tacaattaga agaaacaggc acgctgccga 14761 aaaacgaaat actaaatatt attactacca ttgctgttta taagttttcg accttgagta 14821 gagaggaagt tgaagctatg ttaggactga ctttagagca aacaagagtt tatcaagaag 14881 cgaaagccga gggtcgagaa gaaatgttga gagcagcgat acctctgtta ctaaaaactg 14941 ggatgagcgt ggaacagatt gctcaacagc ttaatgttga tgtagaagct gtccgcttag 15001 ctgcacagca aagcacatag aatgaacatt gaagaaacac aaatctacaa agacttggag 15061 cgttaaacca agttaaaggc agcttcacgc cttttacagc aattttcagg taattgaacc 15121 acaatttgcg ggaaagtaga catggagtga ttgtttaatt cgatagcgat gagacagcca 15181 ggggacacaa aacattgcgc ccctaacctc agctgatcat acaactgcac tactaagttg 15241 cttagcggtt tcaaagaaga aagcaacatt ttcctctgga gtttccggta aaacaccgtg 15301 acccaaattt aaaatgtgtc ctttattgcc agctttccgc acagtatcgt aaatgcgatc 15361 gcgaataaat tccttagaac caaacaatac accagggtca agatttccct gcactttgag 15421 gtgcttacct aatctttccc gtgcgtctgc catatctact gtccagtcta cagtaatcac 15481 atcggcacct gatattgcca ttctctcaag caaacctgca ctaccactca ccaatagaat 15541 cagaggtgta tcaggatgag tttctctgac ttgttggaaa acccgctgtt gatagggtag 15601 tgcgaaggtt tcataatctt gaggactcaa ttgtcctgcc caagaatcaa acatttgcac 15661 aacttgtgcg ccacagtcaa tttggtagcg gacgtagatg gcgatcgcat ctgcgaattt 15721 tgtcagcaac ttatccagca gcgtcggatt tgagaacgcc atgtttttga tgacggaata 15781 ggtttttgaa ccttttccct ccaccgcata agcggctaat gtccaaggtg caccgacaaa 15841 gcccaacaca gttgattcgt tgcccacttc ctgccgcaat gcttgtagga ttgttttaat 15901 gaaaggcagc gattcttccg gttctaaggt atgcaggtta tcgatttgct cactcgtgcg 15961 aatgggcgag tgaatgattg gtcctttacc ttcagcgata tccatatcaa tacctaaacc 16021 aggtagcggt gtgacgatat cagaaaacaa aatcactccg tctggttgaa aggctttcca 16081 aggttgcagg gatatttcaa tcgctacttc cggtatttcc gagcgatcgc gaaaagaagg 16141 atacttctcc cttaaatctc ggtaagcttt catatatcgt cccgcttgtc gcatcatcca 16201 tacgggggga cgatctaaca cttcaccacg agctgccctg agaagataag gagcgttgga 16261 agaaataccc atttaacagt tcatcctaaa ctctttttta tgtcattgtt tagcttatca 16321 tcctaggatt gccctttttt gataagactc gtctttgaag cgttgctcgt aaaaattcta 16381 ttcaaattgg actcaaaact tcataaatat taaagaataa gtcaggagga gccagcactg 16441 caggagggga gccactgcgt tggacgggtt tcccggcttg aagcatgtgg cgttctccct 16501 ccgtaggtgt ctggcgttca gaatgcaatt gagtggagat gcgtgataaa cttatgtaaa 16561 gcaccataaa cgctttaaaa ataagctttc aagcgattta catttcttaa tctagatgtg 16621 ttttttagcg cccacctact taggacatga ggtggaaatc ttcagggaca tggtgtccca 16681 ttggaggcga ccttatcaag tcacactgaa ctacctacac cgtacttgta ttcaagtaaa 16741 tgtaagtagt tcacagctat tccattataa attggtacaa tctcccgtca gtctacttcg 16801 ccaagacttt ccaccagaat tcttgcaatt atgtgcatcc ctattgttta agattctccc 16861 ctcacaattt cttccgctcc agcactcgtt tttatccgtg ggatcaggtg gtggtggtgg 16921 atcaaaaact ggttgagtca caggtattgg tgctgtagta ggtgcatctt tttgtgctag 16981 tgtttcttga gcaaatgctg gtgctgtaaa acttcccact agtagagtca aagctaatcc 17041 catcagaacg atgaaacggc gaaagttatt catgtacatg gttgattttc cttaagtgat 17101 tggattttgt tagagcttac acattgacaa aagtttaatt atgcgattgt ttgtagccct 17161 tcctttttta attagttgta agcaaaatta acttatgcat gagtgaaatt gaaatgagaa 17221 attcttgctc gatctgagtg ttccagaatc gtatgcaagt tggctatatg cgtttgcatt 17281 tgcgtagcac tccttttggg gacgggggtg agagaggagt agagaataat atataaatat 17341 aaaaagttag tgacatctac cctgtctacc ctgtctacct gaggtcttga aagcttgaaa 17401 gctttatgga acaagagttt cggaggtaga cagggtagta gacatgaata catatgtact 17461 atgaatacca tgttttaggc aaaattgggt ataagataag ctgcgtatgt tgtcagcttg 17521 ggacagtttt gtgtttaaga cataaaaaaa aacctcatac gttcacttaa cgtagagggt 17581 tctagaaatg attttagtaa gggttaggta tacttaagaa taaagattaa caccgctgct 17641 tggttagagt atattcaaaa tacctttgtc tgcatctaat tgaactggga ctcccacagg 17701 taaagctgca ttgggagctt catgaccaaa aggcaagttt gagacaatgg gaatgcctaa 17761 atctcctagg cgatcgcgca acacttcctc aacactaaag ctaggaacat ttgcaggcgc 17821 ttcacaacgg ctaaagctac ccagtgcaat tccacgaact tttgataaaa caccactcat 17881 ccgccattgt gttagcatgc ggtcaatgcg gtacggaact tctgtcacat cctccaatgc 17941 caaaaccaca ccatccatat ctggtagcat tggcgtacca agcagatgag ttgccaccgt 18001 aagattacct ggtaacaaga taccattaac cacacctcca ccccaaccac aaccctttaa 18061 aggtgcaaga gggcgacctt ccacacaatc aaataacctt tgaattgacc aatctggttc 18121 gcaagccaga gttgtcagca cggaaccatg aacaccagaa attcctgctg tgtaaagact 18181 ccacaaaagc gcggtgatgt cagaaaaacc aatgagccac aatggatgtg ctgagttttc 18241 taaattctgc catgtccaat tttccaaaat acgggtgcta ccgaaaccac ctctagtaca 18301 aagaacacca cgacactctg gctcttgcca tgctgttgct agttgctcgc gcctaaattc 18361 atctttacca gctaaatatc cccatttgtc ttctatctct gggaatattt cgactcgata 18421 gccacgcgat cgccaaattt ctaccccttt ttggaaagct tcaaattccc gtaaagcacc 18481 gctaggagaa atcactcgta ataagtcacc tggtttgaga ggtggaggaa caagtttaga 18541 tttgagattt tgagttttgg attgcatcaa taaacttatt tcagagcaaa taatacaaaa 18601 tgttaatttc gtattcgcta tttatatctt cttaaccgtg caaaatccaa tatcagagca 18661 aaaaatatgg ctcatgaatt caaaaatcta aaattttcag aaagcatcgt tgagagcgat 18721 agagtgcagg cggcttccgt atcgcccgta tcacgcccta cgccccagag aaaattcaag 18781 tgaacagtga acagtgagta agcgtgcgaa tgacggcttt cccgacagag gcatctggcg 18841 taagcgcaag cgttcgccgg aggcgtgcgc gcagcgtctg ggcaggagat acccgaaggg 18901 tgaacagagg aagcgatttc ttgactttct tgataactga taactggtaa ctggtaactg 18961 ttaaaaattc ttgtcttctt cttcctcatc tttaccttct tcacttgcat caccaggcaa 19021 agcacggata cgattcatct gtgaaagagc atactttgct gttgatcgca cgtctgcatc 19081 cggatctcgc acagcatgag ataagatttg actcatttga gccatcatgt catatacgcg 19141 agtcaaatcg cgaatggcat tttgccgcac ttgtggactt tcatcttgca atgaaattgc 19201 caaagcacgg ttcattggct taagtgtgcg ggtgttaatt tctgctaaag cagctaaaat 19261 caagctgcgt tgctgagaat ctgcatccac catcaagtcc accaaaggct gaattgctcg 19321 tgagtctcct ttttgaccta aatcccagat agctttacgt cgctgtgtta tatcagggct 19381 acgtaaatct gaaatcaact ggtcaatgat gtcaactttg gcaaggcgag aagttgtttc 19441 cattggtaaa aattcagtag acggtggtat cgttgtttct gtttcttgtg tactgatgac 19501 actagtgtta tgaaactcct cgtcgtcgtt ctttgtgtga aacttttcta ttgtttgcgg 19561 aactgaactg gtgtttagct gactataatt ttctgggtat tttacaactt ttgtaccctt 19621 ctcaatcttt tttactatat aaagtaatgc cccaatactt cccaaaacac cagtacccac 19681 aagtgaccac caaactatgt ctgtttctgt ctctttcttg gtcttgggtt cagtggtaga 19741 actaattgca atggaagaag gcggtaattt cttgtttatt gctgtctgaa gactctgcct 19801 agttgcatca ccaaagatac cgtctacact taaacctttt gcttgttgaa atttagtcac 19861 agcacggctt gtacttatgc cataccgtcc atctatctcg ccatcatagt atcctaactt 19921 cttaagtaag gtttgcagtg acttcacttc tacacctgtg ctaccaagtc taagaatgga 19981 ctgcttagca gcatctttgg aactggcttg agcaagttct aaatttttaa aaacaatttc 20041 aggtttaact gcacttgctg gacgttggtc aaatcctaag caagcaaggc aagcaatgaa 20101 aataatagag gaatagcgtc gcctcatata gcaagtttca gccagaacgt gcattggaag 20161 ccattgatac cacagtgaca tcctcccaca cccttcgggt atacctactt tcgcctgacg 20221 gctttagccc agagggcgtg cgctttgcgc atacgccaga tgccaagtca gggaaacccg 20281 tcattcgcac tggtctgacc aaatcagaga ttatggtgtg ggcttcccca gatcactctg 20341 aagatttcct ggttttcccc tttcgggatt tcgccacttg cctagacaaa gaaattttcg 20401 tccccggtag atcggctgca caggcagttt ccattccgcc aagccctggc gtactaattg 20461 aggctatgcc tcggtttcta atgacttgac cactagctac atctctgtcc gtggtataac 20521 cacagttgaa acaatgatgt accctagttt tcaagtcttt ttttgcgact gcaccacatc 20581 ctggacattc ctgagaggtt cccctagagt caacttgtgc aaagaacttt cctcgtttcc 20641 agcatacata cttgacgata gttcggaatt gaccaaatcc tgcatcaagc atatgcttgc 20701 ccagcattcc ttttgccatt ttggagtaat ccaagtcttc catgaagacc atatctccag 20761 tgtcacaaag ggcatgagct tgcttaaaat gaaagtcttt ccgactgtta tctattgtgt 20821 ggtgcattct tgcaaccttc atacgttgtt tttcgtaatt cttagaccgc ttctgtgttc 20881 tagacagtct gcgttgcagc aatttcagct tactttgcat tgttttgaag aacttaggcg 20941 gttttacaag aacaccatcg ctggttgcta aaaacttttc tagccccacg tccactccaa 21001 taggataacc gtggggcatc ggtgaagaaa cattgacatc gcattgaata ttgatagatg 21061 cataccattg attggctttc cttaaaacac gcacttgctt taccacaaat ccattaggga 21121 tcgggcggtg caagttaata gggattattc caattttagg gagcttcaaa tgaacgtctg 21181 tcacaggact ttctttgaat tgaggaaaaa gcagtgactt caggtgacca tattttttaa 21241 aacgaggaaa tccaaatcca cgttcttgaa aatattccca tccacgatgc aactgcttaa 21301 tcgtctgctg caaaacctga gaaggcactt ctcgtaattt cgggaagttt tctttggctt 21361 taggcaggtt gtttagttgc ttgacaacgc aaggaaactt tagatcggca ggaaggatat 21421 actctttttc taaagagcat ctatcaatta aacacttacg actctcgcac caatccttga 21481 tctctcgaag tgcatagttg tatgcacctc gacaaagttc catccactca attaatattt 21541 gctcttgagc ggcattagga taaattcgat agcgatagtt aagtgttagc atgaaatgaa 21601 ttgtagcata tttactagac ttgtagtgtc tagtaaatgc ttacctcatc cctgagggac 21661 taagcgacaa aaaatcggca gccttcttat tacgagtacc cgtgcgcacc gacagatcaa 21721 tcaaacaaac aaaggctgcc gatttttcta gcaggatgcg acgccattga ggggaatcgg 21781 taagtgcgga gaccgatgta tcccacgcgc ccgttgagta caactatggc gtgagtctta 21841 tagcccccga acccccacaa gataactata agctacataa atatcaaatg tcttcttggc 21901 ttcggtatat tgaatacact ctggctaata ctcaaattta agcacatgag cgggagttgt 21961 tgtgcttgag tcggttattt atttttgaga tattttttga ctaccctccg ggttcgccag 22021 tcgcctgcgg agggaaaccc tcccgcagcg ctggtctc // LOCUS NODE_1452_length_21814_cov_5.56422621814 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 21814) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 21814) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..21814 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(38..340) /locus_tag="DP116_12950" /pseudo CDS complement(38..340) /locus_tag="DP116_12950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009460325.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="alanine--glyoxylate aminotransferase" gene complement(562..753) /locus_tag="DP116_12955" CDS complement(562..753) /locus_tag="DP116_12955" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12955" /translation="MHLKSEKIDNDSAGCGTWVTEKKKRCISSGKCTRNTASLIIRPL LVVWESLALQQGYKETLPP" gene 1121..4144 /locus_tag="DP116_12960" CDS 1121..4144 /locus_tag="DP116_12960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878004.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain S-box protein" /protein_id="PRJNA477356:DP116_12960" /translation="MNETVKALYTPNSQQQALPLLEKIATNLPGMIFQFLKRRNGSQF VIYASSGCRELFELEPQVLQADFQVLNKLTHPQDIKAFEESIAACCATGEPWRWEGRI ITPSGKLKWIQGTSRAEQQLSGDMIWDGFLIDITDRKLAEEKLRESEARYKAILDAIP DLMFRISRDGKYLDFKGEGANVTIPRHEIVGKTLQELLPPDIALKSQKAIAKALDSKM LQTCEYQLPTPLGIRDYEARLVVSGQDEVVAIVRDITERKQTEVTLRNLAQKFATVFH CSPNPISISTLAEGRYIEVNDSFVKQSGYEREEVIDRTDFDLHIWVNRSDRTTVLQQL QKHGVVRNMEFEFRRKSGEIRTGLFSAEVIHLDGIPCLLSVNHDISERKKAELALRES EEKFSKAFCSSPNPISILTLKDERYLEVNDAFVQITGLSREEVIGRTRSSFMWVNLPD NTIQTLQEQGFLRNLEIELYTKSGELRVMLFSAEVITLGGEPCMLCVTNDITERKQAE ELLRLSSKRDRLLTQTLSRIRQSLNLDQILQTTVNEVRQFLAADRVFIVLNDTNVHSG VFAESVDPKYSSVLNWKNEDKTLIEELKTILKSNRVRIVEDTTKISASAKLKGLYKQF QIRANLAVPIMQGKELFGALIANQCSKPRHWSAIEIDLLQQMSEQVAIAIQQAQLYQQ LEQFNTNLERTVEERTAQLQQKMLELQELNSVKDVVLHTVSHELRTSVMGNLMVLNNL LNRGQQEDTSVQKSTTLRGVNKGLRGGHPLSEIEGSIPRLGNDTRLLKFPRLRGMSLD KKHFLCISPPSAVTAFSIPVSGSIIERMIQGNDRQLEMIESLLELHSSEAQGIVLHRE VVSFHSLIGRIIRDLEPMLTQQKSTVKNLVPDDLPSVIADPAQLQKVFVNLLTYSWQH NPPGVKFTLKAIVETGRIRLQIQDDGVGMSKLECDRLFDLYVRAPQAPCSTRIGLKMY LCRQIIKAHGGEIGVSSNRKRGLTFSFTLPLAIST" gene 4429..6624 /locus_tag="DP116_12965" CDS 4429..6624 /locus_tag="DP116_12965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316264.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain-containing sensor histidine kinase" /protein_id="PRJNA477356:DP116_12965" /translation="MKKAKAFLAVGRFGQRLRATVFRTVQKCLKKVFVFTSSLPSYTI ASDYEAWRDRFLWQRLHLALWIAIILVLTFTFRDIYDAFFPLKELAEIPKEVKRLFFV LYAAIVLSLLSCLAVHRTQFGRRHPAFVFLGLGWSITIVEQIWATLNNFALPDFFAWF VMFLSQATLIPVRFPLHLVSQLGVLVYYFGVNTALGLKLPPPDSNRPIYSVTMSLALF WFCFICNLGVYLYDRLQRSEFFARKELETAYQKLGATEEKYRSIFENAVEGIFQSTPD GHYITANPALARIYGYSFPEEVIANLTDIEHQLYVDPNRRTEFMRLIEEHGSVSEFES QIYRKDGSIVWISEKAYAVRNQSGKLLYYEGVISDITKRKQVEEALQEQLNFLQVLVN TIPTPVFYQNPQGLYVGCNKAFEEDLGLSKKQILGKSVYEIAPKDLADKYHQADRELL EQGGVQTYESSAIYKDGTKHDVIFYKATFCKADGTLGGLVGVILDISQRKRTEEALRV FFHAVSHDLRNPVLGTLMVLKNLLSHPEEKILLSRSILERMVQSGDRQLNLINSLMEA HVSEVQGIVLQRQTLQLHTVVEAAITDLEPMLKENQALLTNLVSANLPSVNADSTQLW RVYSNLIVNAVKHNLPGLSITLDATIEGDKMCCTVSDNGVGMSQQQSEHLFDLYFRGS NVRNSLSLGLGLYLCRQIITAHGGEIGVKSDLGAGATFWFTLPLEDNLK" gene complement(6700..8400) /locus_tag="DP116_12970" CDS complement(6700..8400) /locus_tag="DP116_12970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317370.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="guanylate cyclase" /protein_id="PRJNA477356:DP116_12970" /translation="MNLQPPLPNNELDRLNALRQYQILDTPSEKVFDDLTFLAAQICH TPIALISLIDGNRQWFKSKVGLTVPETSRDLAFCAYTILQQKPLIVRNALADSRFATN ALVLGDPNIRFYAAAPLITPEGFGIGSLCVIDIMPRDLTLEQVEALRILSSQVMAQFE LRRHAITLSRTIIQQQQTAEQLRQQHDFIEVFYRRGEEKARKGDYEGAIVDFNEFLRL NPNGFKAYYNRGLARQKLGDYEGAIIDFDKYLRFNPNDVEARQNRGLLRFELGDYKGA IADYSSAMNPHSDNSIPGDMGFTYSEVENKEAVVEHTHYLELHSSDIDIDTDINITQT HTRSVLENNTNTFDDSIQFLPFHSDDGEVYVTLSNTQYQPEESTQALLLSSDQAKLYI QRGHARCELNDYSAAIEDYTQSLKMNPHDPQAYISRGNARFILKDYTGAIDDYTQCLW INPNHAKTYISRGNVYCELEDYSAAIKDYTRCLELNSNNAEAYINRANARSRLKDYHG AIEDYTHFLQLNPNDAKAYISRGNARSKLKDYSGAIEDYKRVWSKAIEQLPKLSSEEI " gene 8969..9718 /locus_tag="DP116_12975" CDS 8969..9718 /locus_tag="DP116_12975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015154871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_12975" /translation="MIVLMVEHPPERNTSEEERFQDDYYSYFESPENAPRYKPAVSFL DDAFEQQSFSLLDLGCGNGALCKYLPERCDYFGVDHSELAIEYCLKVYPNRNFVAKDL SVALPQLVAENKRFDAVVLCGLLFHTTDKETLEKKDDQELIQFCLDKLLTEKGYLVII VPFAYGDHPSHNLYARAEWLQNSVEEMLETAKAKIVYGNISLLIGLDKKIRQQKTTPD WFVPDSNADYSNKYAGTYMATWTFLAIPSER" gene complement(9834..10529) /locus_tag="DP116_12980" CDS complement(9834..10529) /locus_tag="DP116_12980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873086.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_12980" /translation="MKDIFSCPEESNFYSNCLENLVFRNCKKSECVVEFGSGDGSPVI NSLLRNRFDGVIHGFELNPSAWKAANSTIDEYSLGQKYIISNSCLFDSSQPEAEYLVA NPPYLPAPDNDIYMPLLFGGEDGATVTNELLSLGYDNVLLLVSSFSNPESTIHHARAN GYFVKNFVVLPLQFGYYSSEPKVKKHIEKLRKNKMAFYSGDYYFLAGVLFQKSQESKV DLSNELAQVMTSL" gene complement(11065..12180) /locus_tag="DP116_12985" CDS complement(11065..12180) /locus_tag="DP116_12985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869235.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron-containing redox enzyme family protein" /protein_id="PRJNA477356:DP116_12985" /translation="MQSNLVMSPLAEVEAKSTQIEEKISIDYELAEQQFIELLATENL DKKLDADPSQKNEFEHTLAVAISAAYQNGASDDAAHRFLQRILYRINRLNLFWYDDLR HYVNERSPYLYKVRDQIETAWQEWELGQIDVAALQQLDVKQALIERAAYDLEPPLSED SRYIQEEMSEAGYRHLLAIGSFEGLVESSHLPRILGGAANAVQCTLMRVFQEEYGNGR LPRKHSTFFAQMLNEFGMNTQPEGYFDLIPWELLACSNNSYLISERKRYFLRYNGGLT YFELSVPATFRSYVAAAQRLGLSNAAMSYWEVHIREDERHGRWMLDDVALPLAQMYPN DAWELVLGYDQEKLMGDRASKAVVRSIREAEQVASSM" gene complement(12477..14321) /locus_tag="DP116_12990" CDS complement(12477..14321) /locus_tag="DP116_12990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="penicillin-binding protein 2" /protein_id="PRJNA477356:DP116_12990" /translation="MQKSSSRTKLKNFNFPKPSFKKRQQKGSRRESNSKLPDSTQEQP PNMKVRLFIVWSFLIAAGVGLGVNLYNLQIIRGSKLTEKARNQQMVNLRPFIPRRPVV DRNKDLLAIDRPVYTLYAHPKLFEKSNHEMAELLSPIMDQDAAELEKKFDSKKSGITL ATALSEEIADRIASLHLNGLESIQKYSRLYPQQDLVAEVVGYVNLDRRGQAGVEYSQE KLLERSMHMVRLSRTGNGSLMPDYAPDGLFNSDDLRLQLTIDSRLQRVARFALKAQMQ KYRAKRGAVIVMDAWDGSLLALVSQPTYNPNEYSKADISLFKNWTVADLYEPGSTFKP LNVAIGMEAGIIRPEDVFNDPGQIQVADRTIRNAENKRYGRINIAQILQHSSNIGMVK IIQKLKPSVYYGWLERLGLGQSVDTDLPFTVNSQLKSQQEFLATPVEPATTSFGQGFS LTPLQLVQMHGALANGGKLVTPHVVQGLIDTKGQMHYSPNRAAPRQIFSYVTARKVVE MMETVVDDGSGKASQIPGYRIAGKTGTAEKASRAGGYIIGAKITSFVGILPVESPRYV VLALIDDPRGKNAYGSTVAAPVAKSVMEALITIEQIPPSKPMTQIPAN" gene complement(14394..14930) /locus_tag="DP116_12995" CDS complement(14394..14930) /locus_tag="DP116_12995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_12995" /translation="MPAVLTPVSTKKQRASSKKSSLSPQELTREASSAKKQTTTSVSS GKQKASTVPVMPSPESVPSWLLRLYTLHRHSSIMAFALVAVTLVVYGWTVYSQQLWSQ AYRKMQNLQRHERQLMTTNEVLKNKMAQEAERPPANLVSPSPDKMIFWTPAPVEPNSL PTTPNSERQQQTPNPVGY" gene 15575..15847 /locus_tag="DP116_13000" CDS 15575..15847 /locus_tag="DP116_13000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317375.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13000" /translation="MKVKIFSFALILGLATVLGACDGGGGAPEGGAGGAGGAGGAATT PAEPADTGATPPAAGGTATTPPAAGGTTATPPTSATTPATPTKSPK" gene 16495..16812 /locus_tag="DP116_13005" CDS 16495..16812 /locus_tag="DP116_13005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015080544.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13005" /translation="MRRVIKFVLFIVCTSILVELNTSQGALVMALPPEQDLPEEILRT EIITTARSPVDGKPLTAGEYAQLQVQLQKAPPPKLAPRIQDQVFLIRLRKTLLQLFPF LDI" gene 17059..19347 /locus_tag="DP116_13010" CDS 17059..19347 /locus_tag="DP116_13010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323003.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1,4-alpha-glucan branching enzyme" /protein_id="PRJNA477356:DP116_13010" /translation="MTTIAPEQVNNIVWNQHQDPFEVLGAHPIEQNGKSIWAVRAYLP NANQAWVILPEERKEYSMEAVHHPHFFECTIETTELANYQLRIKEGEHERVIYDPYAF RSPRLTEFDLHLFSEGNHHRIYEKLGAHPTEINGVKGVYFAVWAPNARNVSVLGDFNL WDGRKHQMRKGQTGIWELFIPAISVGEGYKYEIKNIEGHIYEKSDPYGFQQEPRPKTA SIVTDLDAYTWSDEEWLEKRRHSDPLTEPLSVYELHLGSWLHGSSAEPPLLPNGETEP VVTVSELNPGARFLTYRELAQRLIPYVKDLGYTHIEVLPVAEHPFDGSWGYQVTGYYA PTSRFGRPEDFMYFVDQCHKNGIGVIVDWVPGHFPKDGHGLAFFDGTHLYEHADPRKG EHKEWGTLVFNYSRNEVRNFLVANALFWFDKFHIDGIRVDAVASMLYLDYCRENGEWL PNQYGGRENLEAADFLRQVNHTIFSYFPGVVSIAEESTAWPMVSWPTYTGGLGFNLKW NMGWMHDMLDYFSMDPWFRQFHQNNITFSMWYHHSENYMLALSHDEVVHGKSNIIGKM PGDRWQKFANVRCLFAYMFTHPGKKTMFMGMEFGQWSEWNVWSDLEWHLFQYEPHQQL KEFFKQLNHLYRSEPALYTQDFAREGFEWIDCSDNRHSVVSFIRHDKDSDDFVVVVCN FTPQPHSHYRIGVPELGFYTELFNSDARQYGGSNMGNLGGKWTDNWSLHNHPYSLDLC LPPLGVLILKLDKQKTAQVMES" gene 19523..19729 /locus_tag="DP116_13015" CDS 19523..19729 /locus_tag="DP116_13015" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13015" /translation="MTYQTPKEKLEALLAQIENDLDKDRQKACQNLSNVEKSYINQKL LDSFTDLWQRYIILKSYCSRNRQY" gene complement(19738..20319) /locus_tag="DP116_13020" CDS complement(19738..20319) /locus_tag="DP116_13020" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13020" /translation="MGDFEQKYKDIATFVKKFKDIIREEIESLDLTEEPVKSVLIEKL QQVFKYDFALEDIENSYKACQEASTWLSKNRSKLVIKIQNLISQRKIISNKLNTSVLA PQRIEQFSKDIELYLKWIGHYMAIGDAPTPLPNGVISFVLPPVAYLEVFKFIRNQIIS TKNGLSEEAVVEAQGYFDRFLIEPLSQFNFYDS" gene complement(20384..20815) /locus_tag="DP116_13025" CDS complement(20384..20815) /locus_tag="DP116_13025" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13025" /translation="MNVVFLAQLAVNFFLGKALEGALQSIGSDAYKASIERLRGFFQW KFAGKPELTQAIENPKALEALVEKKATEEEDFRKELEKLVAELQEAIKNTSASGTNYN NVGSITEQDIESVSGVNAGHNTVTGHQTNVGGNQTNHNFRT" gene 21178..21471 /locus_tag="DP116_13030" CDS 21178..21471 /locus_tag="DP116_13030" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13030" /translation="MLWLGMHKLSLDHNKLLLHHNNLSLDHNKPLLHHNNLSLDHNKL LLHHNNLSLGYDNLENIQWAIQGGCGQNQLVDIGESTASPTPRVTASVSTNYL" gene 21565..>21814 /locus_tag="DP116_13035" CDS 21565..>21814 /locus_tag="DP116_13035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996206.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative thioesterase" /protein_id="PRJNA477356:DP116_13035" /translation="MKTSSFNSWVICPKPNPQAKLRLFCFHHAGGGALSFRPWLNYLP SSVEVCLIELPGRGTRMKEALFTQFEPLIQALEKALLPS" BASE COUNT 6181 a 4554 c 4716 g 6363 t ORIGIN 1 gactagctta tgtacggact gattaccgtg attcagctag taacgaatcg cacggttcta 61 ttgtgttggc gatcgccgct tccattgcag ctgatcctgt cccactaatt gcgatcgtcc 121 actggttttt tgtctgccac acatagcgca gcaaagactg aatttcatcc atcaatgtga 181 ggaaagctgg atcaagatgc ccaactggcg gagtattcat tgcttccaga actgcgggat 241 gggcatttga gggtccaggt cccaacagta ggcgggatgg tatcgtcagg ggtgaaagtt 301 gtaaccgctc gttgttgtcg atagagagtg tttgcatgat tattttttta atggttaatg 361 ataattaatg catatatgat aaatttttat tgtgcattat ttttgaaaaa aaattcagga 421 gtcagcaatg cttgtagtgg acactgaacc tagcactcct cacccaggag cactttccaa 481 cattgctgac tctttcaagg tcagtcgcct aaatagttat tattttgaca aaattagcat 541 tggtgctcaa caaatagggg tttaaggggg tagggtttct ttatacccct gttgtagtgc 601 aagggattcc cagaccacaa gtagggggcg tatgattaag gacgccgtat ttctcgtaca 661 ctttccgctc gatatacatc tttttttctt ttccgtaacc cacgttccgc acccggcact 721 gtcattatcg attttttctg attttaaatg cattttttat accttgttac gtacaatcgt 781 cgttctgctc cctgctccct gctccctatt ccctattccc tgttaagcgt tccctattcc 841 ctattcccta atagcagatt tcatttattg acttgacatc acattcaaat tacgttaaaa 901 attctcaaaa ctcagttgac ttattgttgc ccaaatcact cttgtgcaag tggttcatcc 961 aattgggtga atgcgaaatt aaactgggga atataacctg aaagaagaac gactttttaa 1021 agttttgttt ggttgctatc caaataccgg aaaaactgcc aaagagtcga taatggtctt 1081 gcgtgataca tttattactt catgagaaaa aggtcagagt gtgaatgaaa ccgtgaaagc 1141 gctgtataca ccaaattccc aacaacaagc tttacccctg ctggaaaaaa ttgctaccaa 1201 tttgccgggg atgatttttc aattcctgaa gcggcggaat ggatctcagt ttgttatata 1261 tgccagttct gggtgtagag aactgtttga attggaaccc caagtgctac aggcggactt 1321 tcaagtactc aacaaactga ctcatcctca ggatatcaaa gcgtttgaag aatctattgc 1381 tgcttgttgt gctacgggag aaccttggcg atgggaaggt cgcattatca ctcctagtgg 1441 taaactcaag tggattcaag gtacttctcg cgcagaacaa caactttcag gagacatgat 1501 ttgggatggt ttcttgatag atattacaga tcgcaagtta gcggaggaaa aattacgaga 1561 aagtgaagca cgctataagg cgattttaga tgctatccct gatttaatgt ttcgcataag 1621 tcgtgatggc aagtatctcg actttaaagg tgagggagca aatgtcacta ttcccagaca 1681 tgaaattgtt gggaaaactt tgcaggaatt attacctcca gatattgcct taaaaagcca 1741 aaaggctatt gctaaggctt tggattcaaa aatgttacaa acgtgtgaat atcaattgcc 1801 aacgcctttg ggaatcagag attatgaggc gcggttggtt gttagcggac aagatgaggt 1861 tgtggcaatt gtacgggata ttacagaacg caaacaaacg gaagttactt tgagaaattt 1921 ggctcaaaag tttgcgacag tttttcattg tagtcctaat ccgatctcga tcagcactct 1981 tgcagaagga cgctacatag aagttaatga tagttttgtc aaacagtcgg gctatgaacg 2041 agaggaagtt attgatcgta ctgattttga tttacatatt tgggtgaatc ggagcgatcg 2101 caccactgtg ttacaacaat tgcaaaaaca cggcgttgtt cgcaacatgg aatttgaatt 2161 ccgccgcaag tctggtgaga tacggacggg actattttca gccgaagtca ttcatttaga 2221 tggcattccc tgcttattat cagtgaatca cgacatcagc gaacgcaaaa aagcagaact 2281 tgccttgcgc gagtcggagg aaaagttttc taaagccttt tgttctagtc ctaacccaat 2341 ttctatcctg actcttaagg atgaacgcta ccttgaggtt aacgacgctt ttgtgcaaat 2401 cactggctta agtagagaag aagtgattgg tcgtacccgc agtagtttca tgtgggtcaa 2461 tcttcctgac aataccatac aaaccttaca agagcaaggg tttcttcgca atttagaaat 2521 agaactttac acaaagtctg gtgaactcag agtaatgctg ttctcagccg aggtgattac 2581 tttaggtgga gagccttgta tgctttgtgt gactaatgat attacagaac gcaagcaagc 2641 ggaagaatta ctgcgcttgt cttcaaaacg cgatcgcctc ttaactcaaa cactatcgcg 2701 aattcgtcaa tcgctgaatc tagaccaaat tctacaaacg actgtcaacg aagttcggca 2761 atttttagca gcggatcgag tttttattgt tttaaatgat acgaatgtcc actcgggagt 2821 ctttgctgaa tcagtagatc ccaaatattc atcagtccta aattggaaaa atgaggataa 2881 aactcttata gaagaattga aaacaatcct gaaaagcaac cgagtccgta tagttgagga 2941 taccacaaaa atttcagctt ctgcgaaatt aaaagggctg tacaaacaat tccaaatcag 3001 agctaactta gctgtaccca tcatgcaagg taaggaattg tttggcgcgt tgattgcgaa 3061 ccaatgcagc aaaccacgcc attggagtgc gatagaaatt gacttgctgc agcagatgtc 3121 tgaacaagtg gcgatcgcca ttcagcaagc acaactttat caacaactag agcaatttaa 3181 cacgaatttg gagcgtactg tagaagagcg tactgcacag ctacagcaga aaatgctgga 3241 acttcaagaa ttaaacagcg tcaaagatgt tgttttgcac accgtttccc atgaactgcg 3301 gacttcagtg atgggtaatt tgatggtatt gaataattta ctcaatagag gacaacagga 3361 ggacacctcc gtgcaaaagt ccacgacttt gaggggagta aataagggac tccggggagg 3421 acacccctta agcgagattg aggggagtat cccaagactt ggaaacgaca ccagactact 3481 caagttccct cggttgaggg gaatgtctct agacaaaaag cactttttgt gcatatcccc 3541 accttccgcc gtgactgcat tctctatccc agtatctggc tcaatcattg agcggatgat 3601 tcaagggaac gatcgccaac tggaaatgat tgaatcactg ctggaactac actcaagcga 3661 agcacaggga attgtcctcc atcgtgaggt agtttctttt cactcattga ttggtagaat 3721 tatcagagat ttggagccaa tgctgacaca gcaaaaatca actgtgaaaa atttggttcc 3781 agacgattta ccatcagtga tagccgaccc agctcagctg caaaaggttt tcgtcaactt 3841 attgacttat agttggcaac ataatccacc tggagtcaag tttaccctaa aagccatagt 3901 tgagacagga aggattcgct tacagattca agatgacggt gtgggtatga gtaaattgga 3961 gtgcgatcgc ctctttgatt tgtacgtgcg cgccccgcaa gccccttgtt ccacaaggat 4021 tggcttaaaa atgtatcttt gtcggcaaat tattaaagca cacggtggtg aaattggcgt 4081 cagttccaac cggaagcgcg gcttaacttt ttctttcaca ttacctctgg caatctcaac 4141 ttaagatgat aatacgcgcg ttacagctca attacctcac ccctatcccc tctccttaga 4201 aaggagaggg gtgtccgtta gggtagggtg aggtgagacc agcgctgcgg gaggggagcc 4261 agtgcggtct tggggtctcc ccaagtagag cacctggcgt tttcccgaca acaggcgact 4321 ggcgttagcc gtcaggcgtg cgcgcagcgc atacccggag ggtgatatct gattgatttg 4381 taaacaacat ccgcagtcca agttcaataa gagggtgcga tggcaaaaat gaagaaggca 4441 aaagccttcc ttgctgtggg aagatttggt cagaggttaa gggcgactgt atttagaaca 4501 gtacaaaagt gtttgaaaaa agtttttgtc ttcaccagct ctctacctag ttataccatt 4561 gcatctgatt acgaagcttg gcgcgatcgc tttttatggc agagattaca tctggcgttg 4621 tggattgcga tcatcttagt cttgactttt accttccgag acatttatga tgccttcttt 4681 cctctcaagg agttggcaga aattcccaag gaagtgaaac gccttttttt tgtcctgtat 4741 gctgcgatcg tgctcagttt actgagctgt ttagctgtcc atagaactca attcggtcgt 4801 cgtcatccag cgtttgtctt tcttggctta ggctggtcga taactattgt cgagcaaatt 4861 tgggcaacac tcaacaattt cgctctacca gattttttcg cttggttcgt gatgttcctc 4921 agccaagcca ccctcatccc ggtacgcttt cccttgcatc ttgtgtctca actgggtgta 4981 ctggtatact attttggcgt gaatacagca ctgggactca agctaccgcc acccgatagt 5041 aatagaccaa tttacagtgt gacgatgagc ttagctctgt tttggttctg ttttatctgt 5101 aatcttggtg tctatttgta tgaccgtttg caacgttctg aatttttcgc ccgcaaagag 5161 ctagaaacag catatcagaa actaggagca actgaagaaa agtatcgcag catttttgaa 5221 aacgcagtcg aagggatttt ccaaagtaca ccagatggac attacattac ggcaaatccg 5281 gcgctagcac gtatttatgg atattctttt ccagaagaag tcatagcaaa cttgactgat 5341 attgaacatc aactgtatgt tgatcccaat cgtcgcacag agttcatgcg cttaattgaa 5401 gagcatggta gtgtgtcgga atttgaatct caaatatatc gcaaagacgg cagtattgtc 5461 tggatttcgg aaaaagcata cgccgtgcgt aatcaaagcg gaaaacttct ttactacgaa 5521 ggagtaattt cagatattac caagcgcaag caagtggaag aagcgctaca ggaacaatta 5581 aattttttac aagttttagt taatactatt ccaactcctg tgttttatca gaatcctcaa 5641 ggtctatatg tcggctgcaa taaggctttt gaggaagact tgggtttgag caaaaagcag 5701 attctgggta aatcagtgta tgaaattgca ccaaaagatt tagctgataa ataccatcaa 5761 gcagatagag agctattaga acaaggggga gttcaaactt atgaaagctc tgctatttat 5821 aaggatggca caaagcatga tgtcatattt tataaagcaa ctttttgcaa agcagatgga 5881 acgcttggcg gtttggtggg agtgattttg gacatcagcc aacgcaagcg cactgaggaa 5941 gcattacgag tttttttcca tgcagtttcc catgatttac gcaacccagt gctgggaact 6001 ttgatggttt taaaaaactt gctttctcat ccagaagaaa aaatattgct ttctcggtcg 6061 attttagaga ggatggtgca aagtggcgat cgccaattga acttaattaa ctcgttgatg 6121 gaagctcatg ttagcgaagt ccaaggtatt gttttgcaac gccaaactct acaattgcac 6181 acggtggtgg aagctgctat tacagattta gagccaatgt tgaaagaaaa tcaagctctc 6241 ctaacaaatt tagtttcagc aaatttaccc tcagtcaatg ctgattctac acaactgtgg 6301 cgagtatatt ctaacttaat tgttaatgct gttaaacata atctccctgg attaagtatt 6361 acccttgatg ccacgattga gggtgataag atgtgttgca ctgtttctga taacggtgtt 6421 ggtatgagtc agcaacagag tgagcatcta tttgatcttt attttcgggg tagtaatgtt 6481 cgcaattctt taagtttagg attggggttg tatctgtgtc ggcaaattat tacagcacac 6541 ggtggggaaa ttggcgtgaa aagtgatttg ggagcaggtg cgactttttg gtttactttg 6601 ccgttagagg ataatctcaa gtaataccaa ttccctattc cctattctct attctctatt 6661 ctctattccc tgttccctgt taagcgtttc ctgttccctt cagatctcct cgctagaaag 6721 cttaggtagt tgctcaatag cttttgacca aactcttttg taatcttcaa ttgcgccgct 6781 ataatctttt agtttagagc gagcgttacc ccgactaata taagctttag catcgttagg 6841 attgagctgt aaaaaatgag tgtaatcttc aattgcaccg tggtagtctt ttagccggga 6901 gcgagcatta gctcgattga tgtaagcttc cgcattgttg gaattcaatt ctagacatcg 6961 agtgtaatct ttaattgcgg cactgtaatc ttctaactca caataaacat tgcctcggct 7021 aatgtaagtc ttagcatgat taggatttat ccacaagcac tgagtataat catcgattgc 7081 accagtgtag tctttgagta taaaacgagc attaccccgg ctgatatatg cttgtggatc 7141 gtgaggattc atcttcaagg attgtgtata atcctcaata gcggcactat aatcattcaa 7201 ctcacaacga gcatgacccc gctgaatata taatttggct tgatcagaac ttagtagcaa 7261 agcttgagtg ctttcttcag gctgatattg ggtgttactt aaggtaacgt aaacttcgcc 7321 atcgtcagaa tgaaatggca gaaactggat gctatcgtca aatgtgttag tgttattttc 7381 caacacagag cgagtgtgag tctgggtgat attgatatca gtgtcgatat ctatatcact 7441 ggaatggagt tccaaataat gggtgtgttc tacaactgct tctttgttct caacttcaga 7501 gtaagtaaac cccatatcac cgggaatcga attgtcagaa tgagggttca tggcactact 7561 gtaatctgca attgctcctt tgtaatctcc tagttcaaag cggagcaacc ctcgattttg 7621 gcgggcttca acgtcgttag gattaaatcg caggtactta tcaaaatcta tgattgcacc 7681 ttcgtaatct cctaactttt gacgggcaag ccctcggtta taataagcct taaagccgtt 7741 gggattgagc ctcaagaact cgttaaagtc tacaatggct ccttcgtagt ctcctttgcg 7801 agccttttct tctcctcggc ggtagaagac ttcaataaaa tcgtgttgct gacgaagctg 7861 ttctgctgtt tgctgttgtt gaataatagt acgtgataaa gttatcgcat gacgccgcaa 7921 ctcaaattgt gccatcacct gactactcaa gatccgtagt gcttccacct gttcgagagt 7981 caaatcccga ggcataatat caatcacgca tagcgatcca attccaaatc cctcaggtgt 8041 aattaagggt gcagcagcgt aaaaacgaat gttgggatca ccgagtacca aagcgtttgt 8101 tgcaaaccta gagtctgcca acgcatttct gacgattaat ggtttttgct gcaaaatggt 8161 gtaagcacag aatgcaagat cccgacttgt ttcaggtaca gttaaaccca ccttcgattt 8221 aaaccactgg cgatttccat caataaggct aataagcgca attggggtat gacaaatttg 8281 tgcagctagg aaggttaaat cgtcgaagac tttttccgaa ggagtgtcaa gaatttgata 8341 ctgacgcaag gcattgagtc tatccaattc attgttaggc agtgggggtt gtaaattcat 8401 cttttttgat agaccttcaa aggttggttt ttgtagagta aacaatattg aaatgtatct 8461 tctcaacaac aggtcgtttt ctctaaattg tgtggtgctt cctaacgact tcaagacact 8521 caatgcactt gtgtaaggcg agtgaaaaag ccctttgaac tggcttatta cttgcggcaa 8581 aagcggttac agtcgtatcc tgttaaaaac cagatccaaa tgatgactca tttcaaattt 8641 agtatctaac attaaatgtg tttttatata caataacaag acaagtattt tatttgacat 8701 ctacttttag atagattttc tagtggaaat cacctttttg tttcatgctc aatgagaaca 8761 gaaggctagg tatctaaaca ataacgacac ttacaccctt attaacttaa cattctcccc 8821 cgctacgccg attcatccaa aggattttag tatttagcta gtaaacctat cgggcaattt 8881 tggtatctac cgtattaagt ataaatgaga tgactatgaa ggctgatgaa attcttggat 8941 gaatgtttta aaaaaattat agttatcaat gatagtactt atggtagagc atccaccaga 9001 acgcaatacg tctgaagaag aacgttttca agatgactat tattcctact ttgagtcacc 9061 ggaaaatgcc cctcgatata aaccggcagt ttcttttctg gatgacgctt ttgagcaaca 9121 atcattttca ttattggatc taggatgtgg taatggtgct ctgtgtaaat atttacctga 9181 gcgctgtgac tacttcggag tcgatcacag tgaattggct attgaatact gtttaaaggt 9241 ctaccccaac aggaacttcg ttgctaaaga tttatcagta gcactacctc aacttgttgc 9301 ggagaataaa agatttgatg cagtggtttt atgcggactc ctatttcata ccactgacaa 9361 ggaaacccta gagaaaaagg acgatcaaga actcattcaa ttttgtttgg ataaattact 9421 aactgagaaa ggatatctag tcatcattgt accttttgcc tatggcgatc atccatctca 9481 taatctgtat gcccgagcag aatggctaca aaattcagta gaagaaatgc tggaaactgc 9541 caaggcaaaa attgtttacg ggaatatttc tctgctaatt ggcttggata aaaagattcg 9601 acagcagaaa acgacacctg actggtttgt tccagatagc aatgctgatt attccaataa 9661 gtacgccggt acatatatgg caacttggac atttcttgca ataccctcgg aacgttaaat 9721 cctaatcttt agtttttgtt aaaaaacttc ttgaagcagg acagaaacac ttcaagaagt 9781 ttttgtttaa ttgttagctc gaaaattagt tgatgaattg ttaatcgctt cttttacaaa 9841 cttgtcatga cttgagctaa ctcattagat aaatcaacct tagattcttg agacttttga 9901 aataaaacac cagccaaaaa gtaataatcg ccagaataga atgccatttt gttttttcgt 9961 aatttctcta tgtgtttttt cacttttggc tcagagctat agtagccaaa ctgcaaaggt 10021 aaaaccacaa aattctttac aaaataaccg ttggctcttg cgtgatgaat tgtactttct 10081 gggttagaaa agctagagac taaaagtaat acattgtcat aacctaatga taaaagttcg 10141 ttggtaactg ttgctccatc ttctccacca aataacaaag gcatataaat atcattatct 10201 ggtgctggga gataaggtgg attagctact agatactctg cttctggttg agaagaatca 10261 aacaaacagg aattagagat gatgtatttc tgaccaagac tatattcatc tatcgttgaa 10321 ttagcagctt tccaagcaga aggatttaat tcaaatccat gtatcacacc atcaaacctg 10381 tttctcaaga gtgaattaat aacaggactg ccatctcctg agccaaattc aacaacacat 10441 tcagattttt tacaatttct aaaaaccaag ttttccaagc agttggaata gaaattagat 10501 tcttctgggc aagaaaagat gtctttcatt tgactctcac tatctgtaga ttacaattgt 10561 tatgatggac agtgctcacc ctaggaggta ttggtaaaaa ggcaagcaca aactgttcct 10621 aagccaagca attgatgact aaagaaatag tttgctctta ggtaaagacc aatgagtttc 10681 taacttttaa gatgtcttac cgaaaagctg ggtctaaagc cccatccttc gtttgacggc 10741 ttttccagga ttgacagcgt atctaaaaat acgggataat aatattgccc atagtctaaa 10801 agactctcgt gaggcagagt tgggtgctcc ccatctacaa tgctcagctt cgtgctcttg 10861 ttgtaggaaa gagtaaaaag caattgaata aatcttttca aacgcggtcg ggcagtccgt 10921 gtacgcttgt ggacgtatcc aaacgggggt attttgaatg cttgcatttg aatacctagg 10981 gaggacggat gttcgcgcag cgtgcccgaa gggcttagca agaatcctcg caccgcgata 11041 gtcgaggagt gtcaataaga tgtcttacat agatgaggcg acttgttcgg cttcacgtat 11101 tgatcggaca acagctttgc ttgcgcgatc gcccatcagc ttctcctggt cgtaccccag 11161 caccaattcc cacgcatcat tggggtacat ttgagctaaa ggtaaagcca catcatccaa 11221 catccagcgt ccgtggcgtt cgtcctccct gatgtgcact tcccaataac tcatagccgc 11281 gttagagagt cccaagcgct gcgctgctgc aacatagctt ctgaaagtcg caggtaccga 11341 gagctcgaaa tatgtcaacc caccgttgta acgtaggaaa tagcgcttgc gttcacttat 11401 cagatagcta ttattagaac aagctagaag ttcccaaggg attaaatcga agtatccctc 11461 tggttgagtg ttcatcccaa actcgtttag catttgagca aaaaacgtag aatgcttgcg 11521 aggtaagcga ccgttgccgt actcttcctg aaagactcgc atcagcgtac actgtactgc 11581 attagcagcg ccacccaaaa tgcggggaag gtgactgctt tccactaagc cttcaaatga 11641 cccaatagcc agcaagtgac ggtagccagc ctcgctcatt tcctcctgga tgtagcgact 11701 gtcttctgat aaaggcggtt ctaaatcgta agcagcacgc tcaattaggg cttgcttaac 11761 atctaattgt tgcaatgcag ccacatcgat ttgtccaagt tcccactctt gccaagctgt 11821 ctcaatttgg tcacgcacct tatacaagta aggagaacgc tcgtttacgt agtggcgtag 11881 atcatcgtac caaaacagat ttaatcgatt aatgcgataa agaatacgct ggagaaaacg 11941 atgagctgcg tcatcactag caccattttg ataagcagca gaaattgcta cagcgagtgt 12001 gtgttcaaac tcattcttct gtgagggatc tgcatctagt tttttgtcta aattttctgt 12061 tgctagcagt tctataaatt gctgctcagc aagctcgtaa tctatactta ttttttcttc 12121 tatttgtgta ctctttgctt caacctcggc gagaggagac atcactaaat tgctttgcat 12181 ggaacatttg gttagtaaag gctggcatga aattatttca gcttgtacta tctagatttc 12241 cattaagtca atttaagtac ctctcccgca agtgataaag aaaaattaag ttcttgagcc 12301 aaaatttagt atacaggcat acaaagatga ggagatgggg agattgaaaa ttttgaattt 12361 tgaattgatt tgtctgcttg tcacgccagt ccgcaagggc ggaggacacc ggagcaaggg 12421 acaactcttg gcacaaaggc tgactccgct tttcctgaac gcggattggt aggtactcaa 12481 ttggcaggta tttgagtcat tggtttactt ggtggaattt gttcaatagt aatcagtgct 12541 tccatgactg acttagcaac gggtgcagct acggtactac cataagcatt tttacctcta 12601 gggtcatcaa tcaatgccaa cacaacatag cggggagatt ctacaggtaa aatacccaca 12661 aagctggtga ttttagctcc gatgatgtag ccaccagcgc gactagcttt ttcggctgtg 12721 ccagttttgc ctgcgatgcg atacccagga atttgtgatg cttttcctga tccatcatcg 12781 acgacagttt ccatcatctc tacaactttt cgggcagtca catatgagaa gatttgacgc 12841 ggtgcagctc ggttgggtga gtagtgcatt tgccctttgg tatctatcaa tccctggact 12901 acgtgaggtg tgaccagttt acctccattc gctaaagctc catgcatttg caccagctgt 12961 aacggtgtca acgaaaaacc ttgaccaaaa gaggtcgttg ctggttcaac tggtgtagca 13021 agaaattctt gctggctttt gagctgactg ttaacagtga agggtaaatc cgtatcaacg 13081 ctttgcccta aacccaaacg ttccaaccaa ccgtagtaga ctgagggctt taatttttgg 13141 ataattttca ccatgcctat attactggag tgttggagaa tttgagctat gtttatccgc 13201 ccgtaacgtt tgttttcggc attcctgatt gttctgtcag cgacttgaat ctgaccagga 13261 tcattaaaca catcttcagg tctaataatg ccagcttcca taccaatagc cacattcaaa 13321 ggtttgaaag ttgaacctgg ttcatatagg tctgctaccg tccagttttt aaataaagag 13381 atatcggctt tagagtattc gtttggatta taggtgggct gagaaaccaa agccagtagc 13441 gaaccatccc acgcatccat aacaataact gcccctcgct ttgctcggta cttttgcatt 13501 tgtgctttaa gagcaaagcg ggcaactctt tgcagacggc tgtctatagt gagttgcagt 13561 cgcaaatcat cagaattaaa taaaccgtct ggagcataat ctggcatgag ggatccgtta 13621 cctgttctgc tgagccgtac catgtgcata gaacgttcca gcaacttctc ttggctgtat 13681 tccacaccag cctgaccacg acggtcaagg ttaacataac ctaccacttc tgcaactaaa 13741 tcctgctgcg ggtataatcg ggagtatttt tgaattgact ctagaccatt cagatgtagt 13801 gaagcaatac gatctgcaat ttcttcgctt aaagcagtag caagtgtaat accgcttttt 13861 ttactgtcaa actttttttc taactcagca gcatcttgat ccattattgg cgaaagcagt 13921 tctgccatct cgtggttaga cttttcaaaa agtttgggat gagcatacaa agtatatacg 13981 ggacggtcaa ttgccagcaa atctttattg cgatccacca ctgggcgacg gggtataaaa 14041 ggtcgcaaat tcaccatttg ctggtttcgc gccttttctg ttagctttga tccccgtatg 14101 atttgtaggt tgtacaaatt aacacccaag ccgactccag ccgcaatgag aaagctccaa 14161 acaatgaaaa gtctaacttt catgttgggt ggttgctctt gcgtggaatc aggaagtttg 14221 ctattcgact ctcgtctcga acccttctgc tgtcgcttct tgaatgatgg tttcggaaag 14281 ttaaagttct ttaattttgt tctgcttgat gatttttgca tgaaaatagt cctttgtcgt 14341 ttgtcttttg tcttttgtct aagtgttaat gactaatgac taatgacaaa taactaatac 14401 cccactggat tcggggtttg ctgttgtctt tctgaatttg gcgttgtggg taatgagtta 14461 ggctcaacag gtgctggagt ccaaaaaatc attttatcgg gagatggcga tacgaggttg 14521 gcaggtggac gttctgcttc ttgtgccatt ttgtttttta gtacctcgtt cgtcgtcatc 14581 aattggcgct catggcgttg caagttttgc attttacggt atgcttgact ccagagttgt 14641 tgagaataca cagtccaacc ataaaccact aatgttacag cgaccagtgc aaacgccata 14701 attgaggaat gacgatgtaa agtatacagg cgcagcagcc aggaaggtac ggactcagga 14761 gatggcatta caggcacagt tgacgctttt tgtttgcctg atgacactga tgttgttgtc 14821 tgtttcttgg cgctagaagc ttcccttgtg agttcctgtg gagagagtga cgattttttt 14881 gacgaagccc tttgtttttt ggtcgatact ggtgtaagca cagcaggcag atcgactttt 14941 gaagttttta actgctttgt ctctggggct ttttctggtt tggtggacgc ttttttttgt 15001 tgagcagagc gacgttttct gcgacttttc acttctgtcg ggagcctact gctcgttgca 15061 taaacatctg atttacgtgc aacagccata attgtttaga atttttggat tatggtactt 15121 tactctatga acattgtctg tttttactct tgttattatt gccgatcgga caataaccac 15181 aaataacacc actaactcag tcaatatatt gtgataggta ggcgcactaa tctttgtatc 15241 cctaagaaac atggtttctt agagaaattt agatttcttg gtaattttgg aattaaccct 15301 gaatatcttg agtgttcagg tatcgtattt aaaagtatga caacctattg tcactctaag 15361 caagagtatg cgaataaaat cgtaattttt ggtatgatat cgagctccgt gacttaattt 15421 ggtaatgcta ccctaaacaa ttaagttacc ctacaataaa gtcaggaatt ttgctgaaat 15481 ctgtgatatc aagtcactta aaaaaaagca ataaataggg tatcttaatc aacagactaa 15541 aatagttctt gccgcaattg gtgaaagagg agttatgaaa gttaaaatct ttagttttgc 15601 attaattcta ggtctggcga cagttctcgg agcttgtgac gggggcggtg gagcacccga 15661 aggcggtgct ggaggtgctg gaggtgctgg aggtgctgct actacgcctg ctgaaccagc 15721 agacaccggt gctacccctc ccgccgctgg tggtacagcg actacccctc ccgcagctgg 15781 tggcactaca gctacccctc ccacttctgc taccactcct gctacgccaa ccaagagtcc 15841 taaataacag tctcaaaaca gctaaaacac tttgttgtag tgcacctcac tacattgctt 15901 gagttgatgc tctcaaattt atagatgtgg aaggctgtgt tttgagagtg cattgagtta 15961 gtcacgctca ctgagcctta caggtatgcc tgcagcacgg ctttgccgta tgcgcaaagc 16021 gcacgccctc tgggctaaag cggagcgtct gcaaaggtga tacgcagtcg ctatctcctc 16081 caaagacgct acgcgaacaa gggggagaac cacgccactc tccccaagcc ggggaacccg 16141 tcccttgggg gtggctcccc aagggcgcac tgctcacaat ttgtattgac ctacctcatc 16201 tgaaaaagtg cccaggtatt agcacctgaa cttgtcagac tttgacaaca acttccccaa 16261 caaacccggt ttctctatga aaccgggttt tttggcaggt aagcccctgg aaagataaaa 16321 atatagaaag taggggatta agaacagttc aaaattcaca gcataaaaat atgcctgttg 16381 taccctctgc gcttacaaaa agaaaaaacc ctgttttggc aagcatcctg cctgttgact 16441 gtacacaact tttaggcgtg acaccgcaag cttttgactt ggagtagaga gaaaatgaga 16501 agagtgataa agtttgtgct tttcattgtc tgtactagca tattggtaga gttaaacact 16561 tcacaggggg cattggtcat ggcattaccg ccagaacaag atttacctga agaaatattg 16621 cggacggaga ttatcacaac agcgcgatcg cctgttgatg gtaaacccct gactgctggg 16681 gaatacgccc aattgcaagt ccagttacaa aaagctcccc caccaaaact tgctccgcga 16741 attcaggatc aagtttttct gatacgactg cgtaagacat tgcttcagct tttcccattt 16801 ttagatattt gaagggttta ccctgagcgg agccgaaggg tatgggaatc aaatgatgaa 16861 gtgcgcatca tcccacactc ggtactctac aaccgatgaa gcacattcct ccaaaggttc 16921 tctccaaagg tactctttcc aaggtaggat cactatcaca acaaaaattg caaataaatg 16981 taaagaatat atatagatgt tttacaaagc gaatttactt actcacttta ttcaacgaac 17041 gtaggtagct ccatgtctat gaccacgata gccccagaac aggttaacaa catcgtttgg 17101 aatcaacatc aagatccgtt tgaagtacta ggtgcccacc ccatagaaca gaatggtaaa 17161 agcatctggg ctgtgcgagc ctacctacca aatgcgaatc aggcgtgggt tatccttcct 17221 gaagaacgca aggaatattc aatggaagcg gtgcatcatc cccacttttt tgaatgcacc 17281 atagaaacaa cagaactggc gaactaccag ttacggataa aagaagggga acacgagcga 17341 gtcatttatg atccttacgc tttccgttct ccccgtctga cggaatttga tttgcactta 17401 tttagcgaag gtaaccatca tcggatttac gaaaaactgg gagcgcatcc cacagaaatc 17461 aacggtgtaa aaggcgttta ctttgctgtt tgggcaccca atgcccgtaa tgtgtcagtt 17521 ctaggagatt tcaatctttg ggacggtcgc aaacaccaga tgcgtaaagg tcagacaggg 17581 atttgggaat tgtttattcc cgcgatcagc gtaggagagg gttacaaata cgaaattaaa 17641 aacattgaag gacacattta cgaaaaatct gatccttacg gttttcaaca agaaccccgc 17701 cccaaaactg catccattgt caccgattta gatgcttata cctggagtga tgaagagtgg 17761 ctggaaaaac ggcgtcactc cgatcccctc accgaacccc tttcagttta cgaactacat 17821 ttaggctctt ggttacatgg ctctagtgca gaaccaccac tactgcctaa cggtgaaacc 17881 gaacctgtcg ttactgtttc agaactcaat ccaggcgcac gcttcctcac atatcgggaa 17941 ctagcacaac ggcttattcc ctacgtcaaa gatttaggat acacccacat tgaagtgctg 18001 cctgttgctg agcatccctt tgatggttct tggggttatc aagtcactgg atattacgct 18061 cccacctccc gttttggtag accagaagac tttatgtact tcgttgacca atgccacaaa 18121 aatggtattg gggtgattgt agattgggtt cccggtcact tccccaaaga tggtcatggt 18181 ttagcattct ttgatggtac tcacctttac gaacacgctg acccccgcaa aggcgaacat 18241 aaggaatggg gtactctcgt gttcaactac tcccgtaacg aagtccgaaa ttttttggta 18301 gcaaatgccc tcttttggtt tgacaaattc catattgatg gtattcgcgt tgatgctgtc 18361 gcttcaatgc tttatctcga ctattgccgc gaaaatgggg aatggcttcc caaccaatac 18421 ggcggcagag aaaacctgga agcagcagat ttcctgcgtc aggtaaatca taccattttc 18481 agctatttcc ccggggttgt ttcaattgcc gaagaatcga ctgcatggcc aatggtatct 18541 tggcctacct acaccggagg cttgggcttt aacttaaagt ggaatatggg ttggatgcac 18601 gatatgctgg actacttcag catggatccc tggttccgtc agttccacca aaacaacatc 18661 acttttagta tgtggtatca ccacagcgag aactacatgc tggcactgtc ccacgatgaa 18721 gtggtgcatg gtaagagcaa tattatcggc aaaatgcctg gggatagatg gcagaagttc 18781 gctaatgtgc gttgtttgtt tgcctatatg ttcactcacc caggtaagaa aaccatgttt 18841 atgggcatgg agtttgggca gtggagtgag tggaatgtgt ggagtgactt ggaatggcat 18901 ctattccagt atgagccgca ccaacagcta aaagagtttt tcaaacagct gaaccatctt 18961 taccgatccg aaccagcttt gtacacccaa gattttgcac gggaagggtt tgagtggatt 19021 gactgtagcg ataaccgcca cagcgtggtt tcttttatcc gtcacgacaa ggattctgat 19081 gatttcgttg ttgtggtttg caactttaca ccccaacccc attcccacta tcgtatcggt 19141 gttccagaac taggatttta tactgagttg ttcaacagcg atgctcgtca gtatggcggt 19201 agcaatatgg gcaacttagg cggtaagtgg acggacaatt ggtcattgca caatcatcca 19261 tactcgctgg atttgtgtct accacccttg ggagtattaa tcctcaagtt ggataagcag 19321 aagacagcac aggtgatgga atcttaagta acgagggtgg gctgtttgcc cacccttgac 19381 ttgttaaatc tattacgtat tatattttca cactttatac gtagagattt aaggaaatct 19441 tacaatatga atcctggtga tgccacataa aaaacgaata agtgaatagt acttattcat 19501 tgttttaatt gtaacatcac ctatgactta ccaaacacct aaagaaaagc ttgaagctct 19561 cctggcacaa atagaaaacg atctagataa agatagacaa aaagcgtgtc agaatctatc 19621 aaatgtagaa aaaagttata ttaaccagaa attacttgat agttttacgg atctctggca 19681 gagatacata atattaaaga gctattgcag tcgaaacaga caatactaaa tagttatcta 19741 tgaatcataa aagttgaact gcgaaagggg ttcaatcaaa aatctatcaa agtaaccttg 19801 tgcttctaca actgcttcct cagaaagacc attcttggta gaaatgatct gatttcttat 19861 aaacttaaag acttctaagt aagctacagg aggcaagaca aagctaatta ccccgttagg 19921 cagtggagtg ggtgcatcac ctattgccat gtaatgtcca atccacttca ggtagagttc 19981 aatatctttg ctgaattgtt caattcgctg tggtgctagc actgaagtat tcagtttatt 20041 ggaaataatc tttctttgac taatgagatt ttggatcttt ataactaact tactccgatt 20101 cttggataac catgtagaag cttcttggca agctttgtaa gaattttcta tatcttctag 20161 ggcaaaatca tacttgaata cttgctgtaa tttttcaata agaacggatt taacaggttc 20221 ctctgttaag tctaagcttt ctatctcttc tcttataata tccttaaact ttttaacaaa 20281 ggtagcaata tctttgtatt tttgctcaaa gtcacccaca gatccttgaa ggttttcaat 20341 tcgttcttca agaaatttgt tttgaacctg aagtgatgat aaatcaagta cgaaagttgt 20401 gattagtctg attacctcct acgtttgttt gatgacctgt tacagtgttg tgacctgcat 20461 tgacaccaga aacagactct atgtcctgtt cagtgatgga gccgacatta ttatagttag 20521 tgccagaggc ggaggtattt tttatcgctt cctggagttc ggcaaccagt ttctctagtt 20581 cttttctgaa atcttcctct tctgtagctt tcttttcaac cagtgcctca agggctttgg 20641 ggttctcaat agcttgagtg agttcaggtt taccagcaaa tttccattgg aagaaacctc 20701 gcaagcgttc aatactagct ttataggcat cactaccaat gctttgaagc gcaccctcta 20761 aagcttttcc cagaaaaaag ttcacagcaa gttgagcaag aaataccaca ttcatttggt 20821 aagcaccaca gtaaaaatac tatgtatact attacaatac actaagggta atgtcaacag 20881 agtgcttaaa acttattcag gtaaggctct atcaaaaccg ctagcagaca aatacccact 20941 gcgatcgcac tccatccctg caatcaagta acataaaaga gtccgcaatg gctatgcaac 21001 aggtgaaaag ccatgtcctt ggttccgttg ggagcaaaat cagttggtga gtcagtcatg 21061 agggttagca ttgcagcgat tgcccaaggg gcagtggcaa agccaattgc agacaatggg 21121 ggtcagaatg agaatgagcg atcgcagttc cacaagaagt tgtcgtgaga tcacaatatg 21181 ctgtggttag gaatgcacaa gttgtcatta gatcacaaca agctgttgtt acatcacaat 21241 aacttgtcgt tagatcacaa caagccgttg ttacatcaca ataacttgtc gttagatcac 21301 aacaagctgt tgttacatca caataacttg tcgttgggat acgataattt agaaaatatc 21361 cagtgggcga ttcaaggggg gtgtggtcaa aaccagttgg tggacatagg tgaaagtacc 21421 gcttctccca ctccccgcgt caccgcgtcg gtctcaacca actacttatg accacaccca 21481 ttcaagggga agtgaaatcg ctgtacgtag cacacaaaaa tcggttgtgc cgatttggct 21541 ttgaattagt tgagcaaaac gccgatgaaa acctcaagct ttaactcctg ggtaatttgt 21601 cccaagccaa atcctcaagc caaactacgg ctattttgct tccatcatgc tgggggtgga 21661 gctttaagct ttcgcccctg gttaaattat ttaccatctt ctgtggaagt ttgtctgatt 21721 gaactacctg gacgtggaac gcggatgaag gaggctttgt ttactcagtt tgaaccctta 21781 attcaggcgc tagaaaaagc cctcttgcca agcc // LOCUS NODE_1453_length_21803_cov_8.53292321803 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 21803) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 21803) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..21803 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 246..650 /locus_tag="DP116_13040" CDS 246..650 /locus_tag="DP116_13040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gamma-glutamylcyclotransferase" /protein_id="PRJNA477356:DP116_13040" /translation="MKIFVYGTLKPGEENYQIYCAGKVVNTTRAVAQGKLFALPMGYP AMTPGDSAVYGYLLSFENQDVLMALDELEDYHPDRDASENLYNRKQIETQDLQGNLLG YAWVYIMTEELAIQLQGIHQPNGWWSGFNKHK" gene complement(707..1345) /locus_tag="DP116_13045" CDS complement(707..1345) /locus_tag="DP116_13045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878043.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_13045" /translation="MTTDSQFDDRSPLQHHKDLPGGNAEHTNESTGAMDMVKRDRFEL LSAYLDGEVTAAERKQVEEWLAEDATVKRLYARLLKLRQGVRAIPVPEPQQSLEETVE QVMARLRRRPRLVWMAGGAAIAACVISALSGLLTGGESRIPQLAQNQPMQQTQSVMST PVVASPLRIAINNPVIPIPKTAEASPENLDDKKVEPQSQNIEQNFTVEYNVN" gene complement(1503..2159) /locus_tag="DP116_13050" CDS complement(1503..2159) /locus_tag="DP116_13050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949583.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase subunit sigma" /protein_id="PRJNA477356:DP116_13050" /translation="MSQSITVSWSTVDTRLPETSVQVDKLSNHDLILRCQAGLRPDRA AFAELLRRYQSQVDRVLYHLAPDWPDRADLAQEVWIRVYRNINRLQEPSKFRGWLSRI ATNLFYDELRKRKRVVSPLSLDAPRSVDDGEMDWEIAGDTPGPEEELTTREFYEQLRE AIADLPEVFRTTIVLREIEGLAYEEIAEITGVSLGTVKSRIARARARLQSQLQNYLDS " gene 3067..3738 /locus_tag="DP116_13055" CDS 3067..3738 /locus_tag="DP116_13055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458874.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="L,D-transpeptidase" /protein_id="PRJNA477356:DP116_13055" /translation="MARVRNDLLTRVVMLLCFGTALLSLAVRWHMTSSTTADTNLKTG GNVILSEKASAAEKPWQAIQGRQNSIASNISNFLSAKGSVQEKLSISLASAKTQLVVA LGDRRVYVYRGDVVIASYPIGVGKKGWETPTGTFQVIHKQLNPMWRHPITGRVFPSGE DSPLGDRWIGFWSDGRNQIGFHGTPDEEVVGSAISHGCLRMRNPDVRLLYHQVSLGTP VEVRQ" gene complement(3772..4305) /locus_tag="DP116_13060" CDS complement(3772..4305) /locus_tag="DP116_13060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997428.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="competence protein ComFB" /protein_id="PRJNA477356:DP116_13060" /translation="MSIEKIVEQALLDGYLTPAMEAEVGRICDSANELSREEYMALDH LMGALLTGEVVAVPRKQFINVMEELVLTEAISRVAEIEATSESSLDVGDIAAYALNRL PPLYATTEEGANYQRQRAKKELQELIAQRVGEAIIRNLDRPNDDRTPVGTKNTGNEVL RQVSTLLQAYAPHFEQK" gene complement(4405..5340) /locus_tag="DP116_13065" CDS complement(4405..5340) /locus_tag="DP116_13065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316276.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M23 family peptidase" /protein_id="PRJNA477356:DP116_13065" /translation="MTVEYHTSNFNKKVLSLGVPAKKANFSAISVVLGIFAALPIALA SPAKALQVQVSPETPKLGDTISVVVNLDNPANGSNVTVTNGDETYPAYEIAPLQYRAF IPTTPLEKAGTRNVQVSFEGQVQNLSIQVRDRKFPLQRINLPPGKAGVEATEYELKRA AEFKAIRTPEKFWDAAFLAPNKGPITTIYGVRRYYNGKFANDYYHRGVDYAGAAGSPV VAPAAGRVALVGKVSQGFRVHGNVVGIDHGQGVASIFMHLSRINVKEGDIVKPGQVIG AVGSTGAATGPHLHWGLYVNGKSIDPVPWRDQAVK" gene complement(5512..6636) /locus_tag="DP116_13070" CDS complement(5512..6636) /locus_tag="DP116_13070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456242.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="site-2 protease family protein" /protein_id="PRJNA477356:DP116_13070" /translation="MNGTIRVGNLFGIPFYIHPSWFLILFLVTLSYSGGLAAQFPQLG GGLALPLGLLTALLLFSSVVAHELGHSFVAIRQGIDVKSITLFIFGGLASLERESKTP AEAFWVAIAGPLVSLVLFGILTAIGFVATPTGPLAAILGLLASVNLALALFNLIPGLP LDGGNILKALVWKITGNPYRGVVFASRVGQIFGGGAIASGLLPLLLFGSFANLWNLLV GFFLLQNAGNAAQFAKVQEQLTGFTAGDAVTVDSPIVSAHLTLREFADERILNGQNWD RFLVTDESGQLVGAISVQDLRTVSTALWSETQVREVMRPVQQVVTVQSDKPLLEVMQL LEKQNLSALPVICDNGVLVGLLEKASIISLLQRRMQANPA" gene complement(6907..8136) /locus_tag="DP116_13075" CDS complement(6907..8136) /locus_tag="DP116_13075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877971.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13075" /translation="MIIFDPITLSAMHHTINTISVISVNTQNQSTLSLSQKTQLELAN SPRKLSFLTRTTTPTLAVQTIISTPNSDRSKCGRGQSRDKYTQPFACDSIWNMPIGAN AKYVDAYIGSKGVGVDTDWFIITQESDPAVPTYMPGSWSSARCSGFQVQQQAQWHPEA GELLKVPKNLIIEDAKVNFTPNNSSAFLKPDGRTLVSFNVTTRCQEGAPLYGVWFGQQ DIYGDGIDGGHGGSGMSSIGGSIRKGELLNNKPIRHALKMVIWGKWLHYNFSSSTPGR RWPARLADANAAYQYQGSNPALVMGSLLAIPPNVTAQSLGVTSKAGKKIFQAMQDYGA YVVDDAGWDYNYLCIERSAEQEYEAVTGHQIDGDTALQADFAKIIAAVKVVDNNESNN IGGGGTPRQPLAPPIRN" gene 8668..9273 /locus_tag="DP116_13080" /pseudo CDS 8668..9273 /locus_tag="DP116_13080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862989.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="nitroreductase" gene complement(9535..10572) /locus_tag="DP116_13085" CDS complement(9535..10572) /locus_tag="DP116_13085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744536.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S1" /protein_id="PRJNA477356:DP116_13085" /translation="MVNQKLTAPEIGFTHDDFAALLDKYDYHFSPGDIVPGTVFSIEP RGALIDIGAKTAAYIPIQEMSINRVDSPEEVLQSNETREFFILTDENEDGQLTLSIRR IEYMRAWERVRQLQAEDATVRSGVFATNRGGALVRIEGLRGFIPGSHISTRKPKEELV GEELPLKFLEVDEERNRLVLSHRRALVERKMNRLEVGEVVIGTVRGIKPYGAFIDIGG VSGLLHISEISHEHIDTPHSVFNVNDEVKVMIIDLDAERGRISLSTKQLEPEPGDMIK NRDLVYDKAEEMAARYREQLLAKQQGATTVSAPSDEASDITEEIPPAVEPVEVEAFQA VEEEIPAAIEE" gene complement(10884..10982) /locus_tag="DP116_13090" CDS complement(10884..10982) /locus_tag="DP116_13090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316017.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein T" /protein_id="PRJNA477356:DP116_13090" /translation="MESVAYILILTLALGTLFFAIAFREPPRIQKK" gene complement(11154..12683) /gene="psbB" /locus_tag="DP116_13095" CDS complement(11154..12683) /gene="psbB" /locus_tag="DP116_13095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409026.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II chlorophyll-binding protein CP47" /protein_id="PRJNA477356:DP116_13095" /translation="MGLPWYRVHTVVLNDPGRLISVHLMHTALVAGWAGSMALYELAI YDPSDAVLNPMWRQGMFVLPFMARLGVTQSWGGWNVTGSPSTDPGFWSFEGVAAAHIV LSGLLFLAAVWHWVYWDLELFQDPRTGEPALDLPKMFGIHLFLSGLLCFGFGAFHVTG LFGPGIWVSDAYGLTGHVQGVAPEWGPDGFNPFNPGGIAAHHIAAGIVGIIAGLFHLA VRPPERLYKALRMGNIETVLSSSIAAVFFAAFVVAGTMWYGNAATPIELFGPTRYQWD QTYFKQEIDRRVQADLGQGASLSEAWSQIPEKLAFYDYVGNSPAKGGLFRTGPMNKGD GIAQSWQGHAVFTDADGRELTVRRLPNFFETFPVILTDSDGIIRADIPFRRAESKYSF EQTGVTASFYGGDLNGQTFTDPADVKKYARKAQGGEIFEFDRETLNSDGVFRTSPRGW FTFGHAVFALLFFFGHIWHGARTIYRDVFAGVEADLEEQVEWGLFQKVGDKTTRRAEP L" gene complement(13060..13392) /locus_tag="DP116_13100" CDS complement(13060..13392) /locus_tag="DP116_13100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994316.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13100" /translation="MKYFFLSEGWTVGRVWASDGLWQMTAWRRQPDIQRMNICLVEQN EVLWLYQVEDVVLTVEVKPTTQLQVSKSGQAIGQVVLKRLMSAEQVIERLGTASARCQ LQNIQSMV" gene 13972..15621 /locus_tag="DP116_13105" CDS 13972..15621 /locus_tag="DP116_13105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_13105" /translation="MTRFSLSARQWSRITKFFCLICFCLLLVASCSRSQQATTTSGAV NASTGDGRITIGTTLNPRTLDPADAYELQSLGLVYNMSDRLYTYEPGSTEVKPQLATA LPKVSQDGLTYTIPLRQGVVFHDGTPFDAKAMAFSLERFIKNGGKPSFLLADVVALVK ATSDYELTIQLKKPFAAFSSLLAFSGTCPVSPKAYEIGAEKFKPNIFIGTGPYKLARY GTDSLQFDVFDKYWGEKPANGGINLQILSSPANLYNSFRTGAVDVAYLSLEPDQIRSL EASAKKGDWQAITAQGSVVSYLVLNRNQKPLDKPEVRQAVASMINRPLMNQRVLLGQA SPLYSMVPKTLEVSQPLFKDKYGDGNVDKAKQLLTSAGFSKEKPVKLQVWYPSSSPIR SLAAQLIKAYADKYMDGMLQFEVNVVDGATFYRGIAKSFYPTALLDWYPDFIDADNYI QPFLGCDKGSNAKGCEKGGSQTQGSFYYSETMNNLITEQRQEQNPQARKKIFADIQAQ IATDVPYVPLWQNKDYVFARKGVSGVQLDPTQNLVYKGIKK" gene 15777..16802 /locus_tag="DP116_13110" CDS 15777..16802 /locus_tag="DP116_13110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194291.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_13110" /translation="MSRSKALQYYIFARLLLAPLMLLTVTTVVFLLLRATPGDPVDAI LGGRAPESAKEELRAKLGLNLPIWLQYLNYLGKLLTLDLGTSIASRGQSVWEVIGQYF PATVELAVCSMAIALIVGIGIGVISASRPGSYLDIGGRLFGIITYSLPMFWAGMILQL IFAVQLGWFPLGTRFPTTTPAPHGMTGLYTVDSLLTGNLNQFFTALYYLALPSITLGL LLSGIFERIVRVNLKQTMKADYVEAARARGIPERKILFSHALKNALIPVITVMGLTFA SLLGGAILTEVTFSWPGLANKLYDAINARDYPLIQGVLVFFAAIVVVASIVIDIINAY VDPRIKY" gene complement(17122..17757) /locus_tag="DP116_13115" CDS complement(17122..17757) /locus_tag="DP116_13115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874845.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease subunit R" /protein_id="PRJNA477356:DP116_13115" /translation="MTLLQAKNLSLADIHRLFDFQRQYDSSFTPILSLEPLTESEQQD LIQIRNDFDNYLIEGRVSEGLVKVLTIFPLLRLAGFYRYPIKISLEEKIADIDITDED TKITGRMDILAVNKAQQTTAKTYFWILVIESKNSSIAPSEGLPQLLTYAHDSLKHQKS VWGLITNGQHYQFVYILQGNNPTYHLMPFLNLMEPESAIQLLQVLKAICKL" gene 17860..18429 /locus_tag="DP116_13120" CDS 17860..18429 /locus_tag="DP116_13120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_13120" /translation="MNQTATEGVRWTTTDLDLLPENEGTRYEIIDGELFMTRAPHWKH QRACGRIFRELDSWSELSGLGEASITAGVLFSESDNVIPDVIWASNERLAVLLDEAGH LTGAPELVVEVLSAGVENERRDREAKLKLYASRGVQEYWIADWRLQQVEVYRRENATL KLMVTLLANDELSSPLLPGFGCSVGRFFV" gene complement(18572..19138) /locus_tag="DP116_13125" CDS complement(18572..19138) /locus_tag="DP116_13125" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13125" /translation="MKLSSENFIAPILVAFITAAATVGAAIIGQNNKTPSAVNPSATP SSIISKNSELECNKTVVTKPCIANVTMQINSDEPRQIKNNERVPLKARDTLRLANLRY CIPPEVTLNKVEVKAYLFPKGTENYKNGLLTSSSFPTYTGCHNIGNFEPTWKVESGQH QVTIPIVKYDGGSRIVDKSFYLNLDVGQ" gene 19688..19999 /locus_tag="DP116_13130" /pseudo CDS 19688..19999 /locus_tag="DP116_13130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316022.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 20073..20660 /locus_tag="DP116_13135" CDS 20073..20660 /locus_tag="DP116_13135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012598663.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS607 family transposase" /protein_id="PRJNA477356:DP116_13135" /translation="MYLTPAEAQKRYGYHPKTLTRWADEGKIQYIKSPGGHRRYLIES IEKLVDRVDQRPIILYARVSTTSQKDDLASQIEYLGKNYPNCKCINDFGSGLNFKRKK FISLMEQVSKQEIQSIVVAHKDRLCRFGFDFVEWFCNLNHCDIIVLNNTYKSPHQELM EDFMSIMHCFSSKLDFLRKYEKIIESYSEKSDTAL" gene 20685..21578 /locus_tag="DP116_13140" /pseudo CDS 20685..21578 /locus_tag="DP116_13140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008186388.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" BASE COUNT 6262 a 4538 c 4688 g 6315 t ORIGIN 1 gatgctcgca ttaaattgaa gcgactttat ccctcaattc aaacttgatg acctactaga 61 gtaccatgtg caaatataag aattgtcatc gtctacagat actcaagggg attcttgaaa 121 aattttgttt atatgtatta ttagtagttt ggaatatttc aagaaatata cgtggatatt 181 ttgtgtagtt taataaaatt taaataatta aatactcatt ttctaagtat taaggattct 241 tggtgataaa aatttttgtc tacgggactc ttaaaccagg tgaagaaaat tatcaaatat 301 actgtgctgg taaggtagtc aatactacta gagccgttgc acagggaaaa ttgtttgccc 361 taccgatggg ttatcccgcg atgacgccag gagatagcgc agtctacgga tatttactct 421 catttgagaa ccaagacgtg ttgatggctt tggacgagtt ggaagattac caccctgatc 481 gagacgcatc agaaaacctt tataacagaa agcaaataga aacccaggat ctacaaggaa 541 acttgcttgg ttatgcctgg gtctacataa tgacagaaga gctagctatt caattacaag 601 gtatccacca acccaatggt tggtggagtg gttttaataa acacaaatga tgaattataa 661 atttttttaa cttattttta acttaaaatt catcattgat aactttttaa tttacattgt 721 attcaactgt aaagttctgc tcaatatttt gcgattgagg ctctactttt ttatcatcta 781 aattttctgg agaggcttct gctgtttttg gaattggaat cactggatta tttatagcaa 841 tcctaagcgg tgaggcgaca acaggagtag acattactga ttgtgtctgc tgcattggtt 901 ggttctgagc cagttgaggt atcctagact caccacctgt cagtaaacca gataaggcgc 961 taatgacgca agctgcaata gcagcgcctc cagccatcca tactagccta gggcgacggc 1021 gcaaacgtgc catcacttgc tcgactgttt cttctagtga ctgttgtggt tccggtactg 1081 gaatagcccg tacaccttgt cgcagcttta atagtcgcgc atacaaacgc ttaactgtcg 1141 catcctctgc aagccattcc tcgacttgct tgcgttcagc agccgtcacc tcaccatcga 1201 ggtaagcact taataactcg aagcgatcgc gcttcaccat atccatagca cccgttgatt 1261 cattggtatg ctcagcattc ccacctggca aatctttgtg atgttgcaag ggagaacggt 1321 catcaaactg agaatcagta gtcattttaa cattattacc aattcctaca cgggtaaaca 1381 cggcggatta aggatgagaa atgaatcaat tcttccccta ttggcagaag cactctgcaa 1441 tctctttaat aatactaacg aattatcata atctgccatt aaccaaagga cggaaactgt 1501 cattacgaat ctaaataatt ttgcaactga gattgcaatc tcgctcttgc tcttgcgatt 1561 ctcgacttaa cggttcctag agaaacacca gtgatttcgg caatttcttc atatgccaat 1621 ccttcgattt ctcgcagaac tattgtcgtg cggaatactt ctggcaaatc ggcgatcgcc 1681 tcacgtagtt gctcataaaa ttctctagtc gtcagttctt cctctggtcc tggagtatct 1741 cctgcaattt cccaatccat ctcgccatcg tctaccgaac ggggagcatc cagtgacaaa 1801 ggactgacaa cccgcttgcg cttacgcaac tcatcgtaaa acaagttggt ggcaatacgg 1861 cttaaccagc ctcggaattt ggatggttct tgtaatcggt taatattccg atacacacga 1921 atccaaacct cttgagccaa atcagctctg tcaggccaat caggagccag atggtataaa 1981 actctatcaa cctgggattg atatctacgc aacaactcag caaacgcagc acgatcaggg 2041 cgtaatcctg cttgacaacg caaaattagg tcgtgatttg agagtttatc aacttgtact 2101 gaggtttctg gaagccttgt atcaactgtt gaccaggata cagtaatcga ttgactcata 2161 gatcgcactg gctttaaatc atccctatct attagacctg ttaatgagtt ggaagttcct 2221 tgattgagtg acggattttt gactggaaca ctcttggtat actaaactgg gcagaactac 2281 atggaacaaa aaacgaccac caacgtcttt tacaaagaca ctcgccttgg tgcacctctt 2341 ttgttgaagg tacaagctat ggggcaatgt tacttatatt tgtgcgatct tgagaactga 2401 ccacctacgc ttgacaaatt tgaaccagaa acaaaccaga acgaaatcaa ctcagtcata 2461 ttcaacaatc ccaaactgaa gactgcattt tgtgttcttg agttctgaat tcaatgcaag 2521 tcaattgctt gttttagaaa acttttattg ccgtggcaac gttaccacga tatttggtgc 2581 tgtttcagct tcttgttggt aattatggat atttgtattc gtttgcttgg gtatgtgaac 2641 ctgcgaaaat agatttaaag aaaaagtaat cgtaactgct agcaatagtt tttccaacat 2701 aatttttctc tcacccacgc caataatgat tgctacattt agtgaagagt tccaccagag 2761 caaagttttt tacctgaact tgattcacct gaaagcagta acgcaatgtc cttaagacca 2821 gtttgcctgg gactattgac aaattcatga gggaactgat aagtattgat gaatttttgg 2881 taactcaatc atgaaaagat aaaaaaatat cagtataact attcctagac caagatcatt 2941 aaatgccagt ttcaaagagt tggcacaggg caagggtatg gtgggtgcag taggtcaaag 3001 gttagagtaa ggtagggtaa gctgatagct aaatggaaga actgggtctg ttaaagagaa 3061 tagacgatgg cgagagtaag aaacgactta ttaacacgtg tagtgatgtt gctctgtttt 3121 ggtacagcac tgctatctct ggctgtccgc tggcacatga cgagttctac gactgctgat 3181 acaaacttaa aaacaggtgg aaacgttatt ttatcggaaa aagcctctgc agcagaaaaa 3241 ccgtggcaag cgatacaggg aagacaaaat tcaattgcgt ccaacatctc aaatttccta 3301 tcagccaaag gttcggttca agaaaaatta tctatcagct tggcatctgc aaagactcaa 3361 ttggttgttg ctttgggcga tcgccgcgtt tacgtttatc gtggggatgt cgtaatcgca 3421 agctacccaa tcggtgtggg gaagaaaggc tgggaaaccc ccactggtac cttccaagtt 3481 atacacaagc aactaaatcc gatgtggcgt catccaatta ctggcagagt ctttccgtca 3541 ggtgaggata gtcctttagg agaccgatgg attggttttt ggtcagatgg acgcaatcaa 3601 attggctttc acggcacacc agatgaggag gttgtgggca gtgcaatatc tcacgggtgc 3661 ttgcggatgc gtaatcccga tgtgcgctta ctttaccacc aggtgagttt aggtacacca 3721 gtggaagtac gtcaatagtt gactgttaac agtcaactgt taacaatcac tttatttttg 3781 ctcaaaatgg ggtgcgtatg cttggagtaa ggtactaact tggcgcagaa cctcattacc 3841 agtattttta gttccaactg gcgtcctgtc atcattcggt cgatctaagt ttcgaatgat 3901 cgcctcgcca acccgctggg caatcaattc ctgcagttct tttttagcac gttggcgctg 3961 gtagttggca ccttcttctg tcgtggcata caaaggtggt aggcggttga gggcgtaagc 4021 ggcaatatcc ccaacatcca gagaactttc gctagttgcc tcaatttctg ctacgcggga 4081 tatcgcttct gtaagtacca actcttccat aacgttgata aattgtttgc gtggtaccgc 4141 caccacttcc ccagtcaata gcgcccccat gagatgatcc aacgccatat actcttctct 4201 tgagagttca ttagcgctgt cacagattcg cccgacttct gcttccattg cgggtgtaag 4261 ataaccatcc aggagagctt gttccacaat tttttcaata ctcataacgc tcatcatttt 4321 ttagccatcc aagcaagggg ggcacggtac caatcaatcc tatttatttt gcccacatat 4381 gagcaaataa tacgcaaata gttactattt tactgcttga tctctccaag gtacagggtc 4441 aatagatttt ccattgacat aaagtcccca atgtaagtgc ggacctgtgg cagcacctgt 4501 tgaacccact gcgccaatga cttgaccagg tttgacaata tcaccctctt tcacattaat 4561 acgacttaga tgcataaaaa tacttgccac tccttgcccg tggtcaatcc caacaacatt 4621 accatgaact cggaaccctt gagatacctt acctactaag gcaacccgtc cagcagcggg 4681 agcgacaaca ggcgaaccag cagccccggc ataatcaacg ccgcgatggt aataatcatt 4741 tgcaaattta ccattataat agcgacgtac accataaatt gttgtgatcg gccctttatt 4801 cggtgccaaa aaagcagcat cccaaaactt ttctggtgtt cgtattgcct taaactctgc 4861 tgcacgcttg agttcatact cggtcgcttc cactccagct tttcctgggg gcagattaat 4921 gcgctgtagg gggaatttgc gatcgcgtac ttgtatggat aaattctgca cttgaccttc 4981 aaaagaaacc tgaacatttc tggttccagc tttttccaaa ggcgtcgtgg ggataaaagc 5041 ccgatactgt agtggtgcta tttcatatgc tggataagtc tcatcaccat tagtgactgt 5101 gacattgcta ccattagctg gattatctaa attcacgaca actgatatag tatcgccaag 5161 tttaggagtc tctggactga cttgtacttg taacgctttt gcgggggaag ctaaagcaat 5221 gggcaaagca gcgaatattc ctaggacaac actgattgcg ctgaagttag cttttttagc 5281 tggaacgcct aaactcagca ctttcttatt aaagttactg gtgtgatatt caacagtcat 5341 aaagatggta caaaaatttt caatccttct gttgaacaac agactaccgt attgtttcag 5401 gtgtttccca attttgacta atactaaaat gactaaaacg aaaatacgac tttcctcccc 5461 acttctaaaa ggagtgggga ggaaagctga tgttgtttga acaactcatt attatgcagg 5521 attcgcttgc attctccttt gcaggagact gataattgaa gctttttcta gaagtccgac 5581 gagtacgccg ttgtcacaaa ttacgggaag tgcagataag ttctgttttt ctaaaagctg 5641 cattacttcc aacaagggtt tgtcagattg cacagtgacg acctgttgga ctggtcgcat 5701 aacttctcgc acttgagttt ctgaccaaag tgctgtagag acggttcgca aatcttgaac 5761 ggatattgct cctaccaatt gtccactttc atcagtcacc aagaagcgat cccagttttg 5821 accattcaaa atacgctcgt cagcaaactc tctcagggtt agatgagcag acacaattgg 5881 actatccacc gtgactgcat cgcctgctgt gaaaccagtc agctgttcct gtactttagc 5941 aaactgagct gcattcccag cattttgcag taagaagaaa ccgactaaca agttccacaa 6001 gttagcaaag ctaccaaata acagtagtgg gagtaaacca gaggctattg caccgccacc 6061 aaagatttgc ccaactcgac tggcaaaaac cacacctcta tatggattac cagtgatttt 6121 ccaaacaaga gctttcagta tatttccacc atctagaggc aagcccggaa taaggttaaa 6181 cagagctaat gccaagttaa cagaagccag cagtccaaga atcgctgcta atggtcctgt 6241 tggagtagcc acaaaaccaa tagctgtcag tatcccaaat agcactaggc taactaaagg 6301 acctgcaatt gcaacccaaa aagcttccgc tggggttttc gactctcttt ctagacttgc 6361 caatccacca aatatgaata gcgtgattga ttttacatct attccttgac gaatagcgac 6421 aaagctatgt cctaattcat gggcgacaac cgaggaaaat aacaatagcg ctgtcagcaa 6481 tcccagtggc aaagctaatc caccacctaa ttgaggaaat tgtgctgcta gtccaccgct 6541 gtagcttaaa gtcactagga acaggattaa aaaccaagac ggatggatat agaagggaat 6601 cccgaagaga ttgccaacgc gaatagtccc attcatacct ttcacctttg atttcaactt 6661 gcagaggttt tgtttcacct ctgttgatgt ttttatcgta acgaaatgtt aagcattttt 6721 aatctttata actgtcgtga tatccgtctg ttaagatgcg gtaatcgtac ctaatagttg 6781 gtgaaactat aggtaaatat tacatattct gagaaaccat ctggtgattt caccttggtg 6841 tcatagagtc agcttgcgaa gagggtatat gtttgccctc ctcgcttgct ctagcaatgc 6901 ttggttctaa tttctaattg gtggtgccaa aggttggcgt ggtgtgccac ctccaccaat 6961 gttgttcgat tcgttgttgt ccactacctt cacggcagca ataattttgg cgaagtcagc 7021 ttgaagcgct gtatctccat ctatctgatg tccggtgact gcctcgtact cttgctcagc 7081 acttcgctca atgcaaaggt aattgtagtc ccaacccgca tcatcaacga cgtaagcacc 7141 atagtcttgc atggcttgaa agattttttt gccagctttg gaggtcactc ctaggctttg 7201 tgctgtcaca ttaggtggaa ttgctaacaa cgaacccatg accagcgccg ggttggaacc 7261 ctggtattgg tatgctgcat tagcatcagc cagacgggct ggccaccggc gaccgggggt 7321 tgaggatgaa aaattgtagt gcagccactt gccccaaatc accatcttga gtgcgtgacg 7381 gataggcttg ttgttcaaca actcaccttt gcgaatgcta ccaccaatgc tcgacatccc 7441 tgagccacca tgtccaccgt caatgccatc gccgtaaatg tcttgctgtc caaaccaaac 7501 gccgtaaagt ggagcaccct cctgacaacg ggtggtcacg ttaaaagaca ccaatgttct 7561 gccatccggc ttcaagaatg cggaggagtt atttggagta aagtttacct ttgcatcctc 7621 aatgatcaaa tttttaggaa ccttcaacag ctcacccgct tcaggatgcc actgcgcttg 7681 ctgctgcacc tgaaagccac tgcagcgagc tgaactccat gagccaggca tataggttgg 7741 tacagccgga tcactttcct gggttatgat gaaccaatct gtgtcaactc caacgccctt 7801 tgagccaatg taagcatcaa catattttgc gttcgctcca attggcatat tccaaatgct 7861 gtcgcaggca aacggctgtg tgtacttgtc gcgtgattga ccacgaccgc atttgcttct 7921 atctgagttt ggtgttgaga tgatggtttg gactgcgaga gtaggggtag tcgttcttgt 7981 caagaatgaa agctttcttg ggctatttgc taactctaat tgagtctttt gactcagact 8041 caaggttgat tgattttgag tgttaacact gatgacagaa attgtattga tagtatggtg 8101 catcgcactc agggttatag ggtcgaaaat gatcattttt tcttttgttt aaaattggtt 8161 caatttcaca tatggtcagt ctttgtactc taaatagata gaggaaaaac cctgaaaatt 8221 tgttgaaatt tcgttatacc aattctgtat aaagatgcgc ccaactatat gaataacgta 8281 tctcctgcgg agacgctacg cgttcgccca agccgcgtgc cgcaggcata cgcgtagcgt 8341 gcgccttagc gcataccaca aagaaagaaa taaagaagaa gtcaagaaga agtaaagaaa 8401 atttggcact gcctcacaaa tcaatggtat tagaatagat ttcacgccta gagtgaagcc 8461 aagctttctt gacgcaaatt agatttgctc atacctttgt taaggtcatt taatgaacgc 8521 aaacttgtta actatatgtg agaaaatatt tgaatatatc aatctctata aacaaaaaat 8581 atgttctaaa atttcagcct tcaggtaaag gttctcactt aaatctcgcg gcttttgtcc 8641 tcataaaaca caatactagg cactgttatg gagaaacctg ctgatacaca atttccaatt 8701 gatgacttgc tgaaacgacg ctggagtccc ctggcatttt ccagtcaacc tgttgaaaaa 8761 gaaacgcttt gcagtttgct agaagctact cgttgggcag catcttcata taatgagcag 8821 ccttggtact ttatcgttgc aactcaggac aatctagagg aattcaacca tctgctgagt 8881 tgtttggcag atggaaatta agtgtgggca aaaaatgcct ccgttctgat gctttcagta 8941 gcaaagcttt actttgaaaa aaatggcaac gaaaatcgtc atgcatttca tgatgttggg 9001 gcggcgacga gtcttctagc cattcaagca acttctttgg gtctgtttat ccaccagatg 9061 gcaggatttg atgtgcccgg agcgagggag ttgtacaata tacctgaagg gtatgagcca 9121 gtagcagcga ttgctgttgg atatcctggc gatcccaaag cactaccaga acaataccaa 9181 cagcgccagt tttctccacg ccagcgcaag tctattgaga cgtttgtctt cactggaagc 9241 tggggacaaa cttcaccggt ggtcaagtat tagttattaa aaaagaggga caatccctct 9301 tcacccagtt tttttctaag ctttcttttt gacactcaat cgttagattg gttcacgcag 9361 catttcaact tagagatgct tttcacgtta cctacttact gttaggcggc ataccactcc 9421 ctgataagcc tgtcaagcaa cccttcatcc ctgccttaaa ggcacaggga tgaagggctg 9481 gcgtttataa gctttactaa atagccgctg tatccaacca caacggcaaa tgcattactc 9541 ttcaatagct gctggtattt cttcttcaac tgcttgaaat gcttcaactt caactggttc 9601 aactgctggt ggtatttctt cagtaatatc actggcttca tctgatggag cactcacagt 9661 agtagcacct tgctgtttgg ctagcaactg ttcgcgatac ctagcagcca tttcttctgc 9721 cttgtcatat accaaatcgc gatttttaat catgtcacct ggttctggtt ccagttgttt 9781 ggtagacaag gaaatacgac cacgttctgc atccaagtca atgatcatga ctttgacttc 9841 atcattgaca ttaaagacac tatgaggtgt atctatatgc tcgtgggaaa tttcggagat 9901 gtggagtaga ccgctcacac cgccaatatc gataaaggca ccgtagggct tgataccgcg 9961 tacagtacca atcacgacct caccaacttc taggcggttc atcttgcgct caaccagcgc 10021 acgacggtga gataaaacaa ggcggttacg ctcttcgtct acctctaaga acttcaatgg 10081 tagttcttca ccaaccaatt cttcttttgg tttgcgcgtg ctaatatgag aaccaggaat 10141 aaatccacgt aagccttcta tccgaaccaa agctccgcca cgattggtgg caaatacacc 10201 agaacgaaca gttgcatctt ctgcttgcaa ctgtcgtacg cgttcccaag cccgcatgta 10261 ttcaatacgg cgaatggata gcgtcagttg accatcttcg ttttcatcgg taaggatgaa 10321 aaattctctt gtttcgttgg attgtaatac ttcttccggg ctatctaccc ggttgataga 10381 catttcttgt ataggtatat atgctgctgt tttagcacct atgtcaatca gagcgccgcg 10441 cggctctatg ctgaatactg ttcctggtac tatatcacct gggctaaaat gataatcgta 10501 cttgtctagc agagcagcga aatcatcgtg agtaaatcca atttcaggag cggttaattt 10561 ctgattgacc atgctgattg tttcccggtt atttttctca gtaaaggttt tgcctcctag 10621 cgatgtaatg caagagccga ataggcattc cacacctaca tcatttttca tcctagcgca 10681 gaaaagtgga tctgaacaca ttaccttcca gatcgtacat catatcataa attatttact 10741 cttttatatt cgttaataat taattaccaa taatagttgc gcgcactctt ttttgtgcag 10801 aaagggcgca attgttgatg cgccccaaga tttctaccac tttacacaat tagcgtggat 10861 ggtaatgggt aacttatttc tcttcatttc ttttggatgc gaggaggttc gcggaaggcg 10921 atcgcaaaga agagagttcc gagtgccaag gtcaaaatta ggatgtacgc aacgctttcc 10981 atagctgtag ttcctgccta ttttcccacc attattagtg taccaagtta caagaaagaa 11041 tggtctaggg aattggggct tagggattgg ggatgaggaa gaattttttc cgttcccagt 11101 cgccagtcca tagtccctct cattagacca ttcttaactt ttcattacaa agtttagaga 11161 ggttccgcac ggcgggttgt tttgtcaccc actttctgga acagacccca ttcaacttgc 11221 tcttcgagat ccgcctcaac accagcaaag acatctcggt aaattgtcct cgcgccgtgc 11281 cagatatgac caaagaagaa caacagcgca aagacagcgt gtccaaaggt aaaccaacct 11341 ctgggactgg tgcggaatac accatcagag tttaaggttt ctctatcaaa ttcaaagatt 11401 tcgccgcctt gagccttacg ggcatatttc ttcacatcag ctgggtctgt gaaggtttga 11461 ccgtttaagt caccaccgta gaagctcgcg gtaacacctg tttgttcaaa gctatacttg 11521 gattctgctc gacggaaagg aatatcagca cggataatac catcactgtc ggtcaaaatc 11581 actgggaagg tttcaaagaa gttggggaga cgacgcacgg tcaactcacg accatcagca 11641 tctgtgaata ccgcgtgacc ttgccaagat tgggcaatac catcaccctt attcattgga 11701 cctgtacgga atagaccgcc ttttgcagga ctattgccga cgtaatcgta gaaggcgagt 11761 ttttccggaa tttgtgacca agcttcagaa aggcttgcac cttgaccaag gtcagcttgc 11821 acgcggcgat caatctcctg cttgaagtaa gtctgatccc actgatagcg ggtaggtcca 11881 aacagttcga ttggtgtggc ggcgtttccg taccacatgg tgccagcaac aacgaacgct 11941 gcaaagaaga cagcagcaat actgctagac agcactgttt cgatgttacc catcctgaga 12001 gctttgtaga gcctctcggg tggtctgact gccaggtgga acaaaccagc aataatacct 12061 acgatacccg ccgcaatgtg gtgagcagcg ataccaccag ggttaaacgg gttaaaacca 12121 tctggtcccc actctggtgc gactccttgg acatggcctg ttaagccgta ggcatcagag 12181 acccatattc caggtccaaa taatcctgtg acgtggaaag caccaaaacc aaagcagagt 12241 agaccagaca agaacaggtg aatgccaaac atctttggca ggtcaagggc gggttcacca 12301 gtgcgggggt cttggaacaa ctccaaatcc cagtaaaccc agtgccaaac agcagctaag 12361 aacagcaaac cggaaagaac gatgtgggca gccgcgaccc cttcaaatga ccagaaaccg 12421 gggtcagttg atggactgcc ggtaacgttc caaccgcccc aagattgggt gacgcccaaa 12481 cgtgccataa agggcagcac gaacatacct tgacgccaca tcgggttgag tactgcgtcg 12541 ctggggtcat aaatggctag ttcgtacaaa gccattgaac cggcccagcc tgccaccaag 12601 gctgtatgca tcaaatgtac agaaatcagg cgtcctgggt cattcagaac aactgtatgt 12661 actcggtacc agggtagtcc catcgactac gctcctcctc gacgatttat gtttacaaag 12721 caaacaaagc aatttttcag tttttgttat tgccacagcc ggagttatcc cacagctttg 12781 tcttatggat cagacagtgt ggttgatttg gcaagtgtta agcacttcgg gagttttcat 12841 cccaaagagt cgtgctcgcc aagtgaaaaa tgagaatgtt taaaaaagtg taacttttga 12901 tggagctgtt tgcaagcgtg tcaacataag catatacgta acttgagggt gtctcaaact 12961 acgtaacttt tgaggataaa ggcagcaggg gataagaaaa ttcacgcatt cgctcatttt 13021 catccccttg atctggtgta accttcgtga tccctgaaac taaaccatag attgaatatt 13081 ttgcagctga catctagccg aggctgttcc caagcgttca ataacctgtt cagcactcat 13141 caaccgcttg aggacaactt gacctatggc ttgaccagat ttactgactt gtagttgtgt 13201 tgtcggcttc acttccacag ttaaaacgac atcttcaacc tgataaagcc ataacacttc 13261 gttttgttct actaaacaga tattcatgcg ctgaatgtct ggttggcgtc gccatgcagt 13321 catctgccaa agtccatcag atgcccacac tctaccaacg gtccaaccct cagataggaa 13381 gaagtatttc acacttttta gtctcactat tagactctaa atctacttat tctcgattgg 13441 catttagggt gtcaatcggt ttccctgaag ctaagtattt ttttctgcat tgggatgcac 13501 gtagacaaac cctaatgtaa agcttttaag ctacgagaaa agcttaacaa atgtttgccc 13561 tggtatagtt taaatttaaa acaaatttat gcactattta tatgttgaat caaataaaat 13621 tttttctgtt attcaggaac atgaaattct ataagtaggt cggtaccaat aaagttaatt 13681 agttaaggcg gtcattagga ggcagtgcgt tgcagaggtt cccctgaggg ttcacctgcc 13741 attcattggt gagggtatta gaggatttta cctttcttaa tatagttctg tttattagtc 13801 taaatttttt gattttgaca gtaggactta aataaactta tttaagttga ctcgcaagaa 13861 gtcaaaactt agaagaagaa aaggtcttct cctaaatttt tatttttttt atgattgttt 13921 gccaaaaaaa tagtataatg cgtgaccacg aagcttagaa atattcaagt tatgaccagg 13981 ttttccttgt ccgcaagaca gtggagtcga attactaaat tcttttgttt aatttgtttc 14041 tgtttacttt tggtggctag ttgttcgcga agccaacagg cgacgacaac atctggtgca 14101 gttaatgcct caacaggaga tggtcgcatt actataggta cgacgcttaa tcctcgtact 14161 ctagatccag cagatgctta tgaattgcag tcgctgggat tagtttataa catgagcgat 14221 cgcctctaca cctacgaacc aggaagcact gaagttaaac ctcagttagc caccgcttta 14281 cccaaagtga gtcaagatgg tttaacatac acaattcctc tacgtcaagg agtcgtcttt 14341 catgatggaa ctccttttga cgccaaagca atggcgttta gtcttgaacg ctttataaaa 14401 aacgggggta aaccttcttt cttactagca gatgtggtgg ctttagtaaa agcaactagt 14461 gattacgagt taacaataca actgaaaaaa ccgtttgctg cgttttcatc attattggcg 14521 ttttcaggca cttgtccagt ttctcccaaa gcttatgaaa ttggtgcaga aaaattcaag 14581 ccaaatatct ttattggtac cggtccatat aaattagcac ggtatggcac agattcctta 14641 caatttgatg tctttgataa atattgggga gaaaaaccag caaatggtgg cattaatttg 14701 caaattctaa gtagtccagc taatttatat aattcctttc gtacgggtgc agttgatgtt 14761 gcttacttat ctttagaacc agaccaaatt cgctctttag aagctagtgc taaaaaaggt 14821 gattggcaag cgattacagc tcaaggtagt gtagttagtt acttggtttt aaatcgcaat 14881 caaaaacctc tagataaacc agaagttcga caagctgtag cttcaatgat taaccgtcca 14941 ttgatgaatc agcgcgtctt gttaggtcaa gctagtccct tatacagcat ggtaccgaaa 15001 acgcttgagg tttcccagcc attatttaaa gataaatatg gcgatggcaa tgtagataaa 15061 gctaaacaat tgttaacttc tgcaggcttc tctaaagaaa aaccagtcaa gctacaagtt 15121 tggtatcctt ccagttcccc aattcgtagt ttagcagcac agctaatcaa agcttatgct 15181 gataaatata tggatggaat gctgcaattt gaagtcaacg ttgtagatgg tgctactttc 15241 tacagaggta tcgctaagag tttttatcca accgccttat tagattggta tccagacttt 15301 atagatgcag ataattacat tcaaccgttt ttgggttgtg ataaagggtc aaatgcgaag 15361 ggatgcgaaa aggggggaag tcaaactcag ggttcatttt actatagcga aactatgaac 15421 aatctgatta cagaacaacg ccaagaacaa aacccccaag cacgtaagaa aatttttgca 15481 gatattcaag cacaaatagc cactgatgtc ccttatgttc ccttatggca aaacaaagac 15541 tatgtatttg ctcgcaaagg agtcagtggt gtacaacttg atccaactca aaatttggtt 15601 tacaaaggaa tcaaaaaata gtcattagtc acccgcaagg aaataaattt ccttgctcat 15661 aacaccaagt ccattgaaat ggactaaatt atttcccagt cttctgaaga gaacttatgc 15721 tattcgcctg caaatttatt cgtgggtaag tcatcaatta ggtacaacat ataattatgt 15781 ctagatccaa agctttacaa tattacattt ttgcccgttt acttttagct ccgttgatgt 15841 tattgactgt cactacggtc gtatttcttc tactacgtgc aacaccagga gatccagtcg 15901 atgcaattct tggtggacgt gcgccagaaa gtgcgaagga agaattgcga gcaaaactgg 15961 gtttaaatct tcctatatgg ttacagtatt tgaattactt aggaaaattg ctaaccctgg 16021 acttgggaac ttctattgcg agtcgtggac aatctgtttg ggaagtgatt ggacaatatt 16081 ttcctgcaac tgtggaattg gcggtttgta gtatggcgat cgcactcata gttggtattg 16141 gtattggtgt tatttctgct tctcgtcctg gaagttattt agatattgga gggcgattat 16201 ttgggattat cacatattcc ctacccatgt tttgggctgg gatgattctg caattgattt 16261 ttgcagtcca gttgggttgg tttcctttag gaacgcgctt tccgacaaca acgccagccc 16321 ctcatggtat gactggttta tatacggttg atagtttatt gactggaaac ttaaatcaat 16381 ttttcactgc tttatattat cttgcgcttc ctagtatcac tttaggtctt ttgctgagtg 16441 gtatttttga gcgaattgtg cgagtgaatt taaaacagac gatgaaagca gattatgtgg 16501 aagctgcaag agcaagaggt attcctgaaa gaaagatttt gttttctcac gctttaaaaa 16561 atgcgctgat tcctgtgatt actgttatgg gtttgacgtt tgcttcacta ttaggtgggg 16621 cgattttaac tgaggtgaca ttttcttggc ctggattggc gaataagtta tatgatgcca 16681 ttaatgcgcg ggattatcct ttgatacagg gagtgttggt gttttttgcg gcaattgtgg 16741 ttgttgctag tattgtgatt gatattatca atgcttatgt agatcctagg attaagtatt 16801 agggattggt tatttttcga ttgaattttg atgtatgggt tttcggattg attccggaaa 16861 cttttttctg aaccgcagag gcgcagagga cgcagagaga ggaagaggag gagagtttgt 16921 tttcaagtaa gtaagtcggc gtaaaaattt caaggtatgt cattgcgagt ggaacgaagt 16981 gaaacgtaag cgcaaagcgc acgctaagag cgaacgcgca gcgtgtcccc ttgggactca 17041 gcaatcgcaa aggttttgaa cattttacat tttgttacat agttcggttt atttgtgccg 17101 acttacttat gatgagaatt tttacagctt gcaaatggct ttgagaactt gtagtaattg 17161 aattgcactt tctggttcca ttaagttgag aaatggcatt aaatgataag ttgggttatt 17221 tccctgtaag atatagacaa attgataatg ctgtccattg gttatcaaac cccaaactga 17281 tttttgatgt ttaagactat catgggcgta cgtaagtagt tgcggtaaac cttctgatgg 17341 agcaatgcta ctgtttttag attcaattac caaaatccag aaatatgttt tagctgttgt 17401 ctgttgagct ttattgacag ctaatatatc cattcgccct gtaattttag tgtcttcatc 17461 tgtgatgtca atgtctgcga ttttttcttc cagactgatt ttgataggat agcgataaaa 17521 tccggcgagt cgcagtaatg gaaagatggt taatacttta acgagtcctt cggaaactct 17581 tccttctatt aagtaattat caaagtcgtt gcgaatttga atcagatctt gttgttcaga 17641 ctcagttaat ggttcgagag ataagatagg agtgaatgaa ctgtcgtact gtctttggaa 17701 gtcgaaaaga cgatgaatgt ctgctagtga taagtttttg gcttggagaa gtgtcataat 17761 tgtatagttt taaaacttat ttattaattt tagtcctgtt gtatttatac aaaatactgg 17821 tttcagcaat tgatgaaacg accaaaaagg agttccgcta tgaatcaaac agcaactgag 17881 ggtgtgcgct ggactactac agatttagac ctcctaccag aaaacgaagg tacgcgctat 17941 gagataattg atggtgagtt atttatgacg agagcacctc actggaaaca tcaacgtgct 18001 tgtggtcgaa tctttcgaga acttgattct tggtctgagt taagtggttt gggagaagca 18061 agcattactg ctggtgttct attttcggaa tcggataatg tgattccaga tgtgatttgg 18121 gctagcaatg aacgcttagc agttttgtta gatgaagcag gacatttgac tggcgcacca 18181 gagttagttg ttgaggtatt atctgctggt gtcgaaaatg agcgtcgcga tcgcgaagcg 18241 aaactcaagc tttacgcatc acgcggtgtg caggaatatt ggattgctga ttggcgctta 18301 caacaagttg aagtttatcg ccgggaaaat gcgactttaa aattgatggt aacgctgctt 18361 gcaaatgatg aattgagttc acctctgtta cctggttttg gttgttctgt aggaagattt 18421 tttgtgtagg tgttgagtgc gatctgcgct ccgcgcagca gcgaagctat cgccccgaaa 18481 gaaagccgcc taagaggcga tcgccccaaa ccacaaaccc cagtcttaca cagcaaaaag 18541 ctatcctccc tgaactcagc aatcattcag tttactgtcc aacatccaga tttaagtaaa 18601 agctcttatc aacaattctg ctgccaccat catacttgac aataggtatg gtgacttgat 18661 gctgtcctga ttctaccttc caagttggtt caaagttgcc aatattgtga caacctgtat 18721 aagttggaaa actactggaa gtcaaaagac catttttgta attctctgtt ccttttggaa 18781 acagatatgc tttaacttct actttattga gtgtcacttc aggcggtatg cagtaacgta 18841 gatttgctaa tctcaatgta tctctagctt ttagaggaac acgttcgtta tttttaattt 18901 gccttggttc gtcactattg atctgcatag tgacattagc aatacagggt ttagtgacta 18961 cagttttatt gcactctaat tcagaattct tagagatgat actactagga gtagcagaag 19021 ggttgactgc tgaaggagtt ttgttatttt gtccaataat agcagcacct acagtagcag 19081 cggcagttat aaaggcaact aaaattggag caataaaatt ttcgctggaa agtttcataa 19141 atttgtctct tagtcttttg gtgtttggtt gatagcacca tgatgacttt agagacacat 19201 gcattaggta gacgaatgta gggagttaca ctgctgcggt ttttgtaggg taatgtagcc 19261 taatgttggt ttgtagggta aagtatgata ttaatggatc gtttttaatt ttactctgtc 19321 aattgaatta cgcctaatgg attcaaaaaa agcattggaa ttcattaatt atcttttcgc 19381 gcaagcaaac aagtcaatat taaatgactt agaaacaact ctctttttgg gtatatggga 19441 aggtaagact tacacagaaa ttggtgctaa ggcttattta agtgagcaat atatgagaga 19501 tgcaggcgca agtttatgta gaaagattac agaagattta gacatcacag tcactaaaag 19561 aaattttaaa aatccaataa aataccgata tcaacagcat tcagcaaaag ttacaccttc 19621 ccaagagcca cttgtaaacc agacaaatca accgaaacag aagacatcaa actcagatat 19681 tattaccaat cccttcgttc cccagcgtgg cagagtagaa gatacgcaat cgttttttga 19741 cagagagcga gaaattgggc gggtgtttga ggcgcttaac agtggaagta gtgttgcttt 19801 gataggagaa gaaggtattg gcaagtcctc attactatgg gcgatttgcc aacaggtggc 19861 gactcgccta cattcaccgc gtcagtcagt ttttctagat ttgaatgaag tcgataatga 19921 agatgagttc tacagtgaat tgtgctacaa ggttggtatt ccggaagata aagggactat 19981 gttaaaccgg aatttgcgcc gaataactag gcgaccgaat accggaattt aactgcatat 20041 aatggaacaa atcggacaaa gattccatta ttatgtactt gacaccagcc gaagcgcaaa 20101 aaagatatgg ttaccacccg aagaccttga ctagatgggc agatgaggga aagattcaat 20161 atatcaaatc accgggtgga cataggcggt atttgattga atctattgaa aagctggttg 20221 atagagttga ccagcgaccc attattttat atgcacgagt ttctactacc tcccagaaag 20281 atgacttggc gtcacaaatt gaatacttgg ggaagaatta cccgaactgc aaatgcataa 20341 atgattttgg gtcagggttg aattttaaac gcaagaaatt tatatcacta atggagcaag 20401 tctccaagca agagattcaa tctatcgtcg tagctcacaa agataggttg tgtcgatttg 20461 gctttgattt tgtggaatgg ttctgcaacc tcaaccattg tgacatcatc gttttgaaca 20521 atacttacaa gtccccacac caagagttaa tggaggactt catgtcgatt atgcactgtt 20581 ttagctctaa gctagatttt ttgagaaagt acgaaaaaat aattgaaagc tattcggaaa 20641 aatcagacac agcgttgtag tattaacaag aggaggaaca aacaatggca aaacgcaaac 20701 caaaaaatct agttcaaaaa ttaacaagtc aaattgtctt gactagcaat gtagatattg 20761 catcgaaaac ggaattacta gaaataatgg gacgtctggc tgttgttcgt tctgaatctt 20821 acaacaagct tggaagcatt cggcactggg gaatggactg gcacaaagcg tatccagaag 20881 tgcgaacttt ccgctctccg gagtcgctag gcttgccttc aaaattgatg gagtggacag 20941 ttagcgatgt tgcaaaggct gttctcgctc aacaagctgc cactatcgct aatctccgaa 21001 agcatatctg gaaacggtac ggtggagaga ggaacgaaaa agctagaaaa gcttgttatg 21061 agaaacttaa aactactgct tttctagacg attcgttcct ccaccgtatt gttaggaaag 21121 aatttcaccg tggtcatacc ttcgtcaaga accaaattgt ttatcaaccg gatggctatt 21181 cttgcaaaca ggtcaatcga aatgtttacc ttttagaact agctggtttg accagaggta 21241 aacgcatcaa aatccaagtt cgcagcaaca gaaagatcaa tggacagatt cggatcattt 21301 tcaatggtgc aattagccag ttcgagattc atttcttggt taaccttgga gaatacacta 21361 aggattcaga agcaattaca actgaagttg gaattgacaa aggatacact gaagcgttca 21421 ttgattctcg tggtcaagag catggtcatg ggctaggtaa gttattaact caaaagtcaa 21481 accgcatcac tgccaagaat cggagccgtg gcaagttgtt tgctcttgct cggaacctga 21541 aatcgcatga cccagcgaag tctgcaagaa ttttaaagaa caatctcact agaaaaacag 21601 agaataagcg ctataccaaa gaccagtcag ccattgctaa tctgattggt aaagcttcca 21661 aatctttgtt cgaggaagaa tgcttaaaag tttatgctga ggatttgact gaacctatca 21721 aaaataaacg tcagtccaaa gcaatgtccc gcaagctgaa tagctgggtg aaaggctact 21781 tgcgggacag tttacaaaaa tgg // LOCUS NODE_1454_length_21784_cov_4.78204221784 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 21784) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 21784) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..21784 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 221..814 /locus_tag="DP116_13145" CDS 221..814 /locus_tag="DP116_13145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873787.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarC family protein" /protein_id="PRJNA477356:DP116_13145" /translation="MDTSILLKTFIAVFVLADAVGNIPILLVLTKGMEPESRSRVIDK AIVVAIAVLLLFAFAGQFILSYLDVSLGSLRVAGGLLLLLIALRMLQGDLDTPVIDQE RDVAITPLALPLLAGPGTLTTVMLLMSESPNPHLSVIVGIVAAMFCTWLILRLGSGID KWIGVEGEVIITQLLGFLLAALAVQIGSTGIKELFFT" gene 1112..1402 /locus_tag="DP116_13150" CDS 1112..1402 /locus_tag="DP116_13150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007355429.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotidyltransferase" /protein_id="PRJNA477356:DP116_13150" /translation="MKRDEVLAIVAAHRQQLQAMGVKSLDLFGSVARDEAGPDSDVDF LVEFDRPVGLFDFSKVRLYLEDVLGCSVDMGTQDALREHLRKPVLKDVIRAL" gene 1392..1745 /locus_tag="DP116_13155" CDS 1392..1745 /locus_tag="DP116_13155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015202648.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13155" /translation="MPSRDWRLRIQDIVQSIAAIRQRTAGMTFEQFQGNETIAKAVLY DFIIIGEAAINVPSDIQSRYPDIPWRIMGDMRNVMAHEYFQVTLRIVWNTIENDLPSL MQQLEEVIEHEGIGE" gene 2026..2235 /locus_tag="DP116_13160" CDS 2026..2235 /locus_tag="DP116_13160" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13160" /translation="METQRTASFLDSNHLLLARDQDLFEVVGQRNLNRTAIATREIKQ GEVIHNILPYSSKTNADFLQLFSVN" gene 2559..3563 /locus_tag="DP116_13165" CDS 2559..3563 /locus_tag="DP116_13165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651328.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="EamA family transporter" /protein_id="PRJNA477356:DP116_13165" /translation="MTISLNNDSQRSPNTLHALGIAVLLVVTLIWGTSYPLLKGAVSS LSPAAIFATRFAVAALPFTPYLRFLNLPLLRDGIFLGLVIFSTLTLQTVGLETTSANR AAFIASFNVILVPLLGQLLGRQVFLKTFLTAGIAIIGVGVMCWESGQIVIGDLLMLGN AFLYSIYILMLESITSRHPILPLTAIQLWVITIVSLFWGASDLVRQHEAIATNFGVLL YLGLVDTAATIVLQAVAQRWVNAYETALMYTLEPIFAAVFSFLLLGEKLGVRGLIGAI LVLVAMVFGQSKSQDAEQYGKIQVNEPIVASLLAADTEPINVSVSLLNANLIESEFDS " gene 3582..4103 /locus_tag="DP116_13170" CDS 3582..4103 /locus_tag="DP116_13170" /inference="COORDINATES: protein motif:HMM:PF00856.26" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SET domain-containing protein-lysine N-methyltransferase" /protein_id="PRJNA477356:DP116_13170" /translation="MIDLDIFVKVTNKGKSVFASRYFRKGETVVVGRRVEILPERTNH SLQMDFDLHIELDEPGRLINHSCSPNTGVRNNKFGAYDFVALVDIPSGSEITWDYETT EFVSIAIPKCSCGSPECRLKTLGFKFLPVEIRKKYGEFIADYLKPFVDEPFCSNELND AMLDLVDKWHPSP" gene complement(4196..5788) /locus_tag="DP116_13175" CDS complement(4196..5788) /locus_tag="DP116_13175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875213.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_13175" /translation="MPYTIPNNSCVGCDNCRPQCPTGAIKFENNEYWIDPSLCNNCEG YHSEPQCVVTCPIHLPIPMQAKRGRCKVDIRDATSPDLFANTRNNPFASAIVIWEACN VLAQRTSLPWEIDEYGRACYRRQVNQGRGAIAFHITKPHNANELATELGVVETLDIRA TCIHLIFAAHATALDQPSEQVFTIDDRQIEKYLGLEKRKDLSKAAKLTLMKNLVQQTC SLITSLNWPQQGRVRGFSVEGTPLWHLVEIQHHFQEDNLGCKYLVGLTFKVRAGIWAQ YFLNKQGCKERTTFYQYGSLPKSLLTTVMSIWQQHEGAARLMLWLLFKTKMGKEQRIT VPTLLRVAYGEEKVTLACRKREERKRLLRTFEHDLEVLNHYGMKPAFDPVTYPPTIQP LWAKLVDIPEDPDDALEFWINDANGKTRLTDAGPRGKWNMLMNARILSFTLPPDWEQQ TSESEKKQRTAKNRIKSKTTAGYLLGEQILQARKNLNLSQRELAKMAGKSQSWIRDLE NGRLKAKSEDQVVLRKVLGIAG" gene 6939..7232 /gene="fdxB" /locus_tag="DP116_13180" CDS 6939..7232 /gene="fdxB" /locus_tag="DP116_13180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin III, nif-specific" /protein_id="PRJNA477356:DP116_13180" /translation="MATLTGLTFGGKTWTPKFVQEIDQEKCIGCGRCFKICGHNVLLL KAMNEEGEFVEDEDDDEIEKKVMTVANQENCVGCEACARICSKNCYTHSALDN" gene 7849..8862 /locus_tag="DP116_13185" CDS 7849..8862 /locus_tag="DP116_13185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407145.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome C oxidase subunit II" /protein_id="PRJNA477356:DP116_13185" /translation="MQQIPVSLWTLVAGIVVTILSIWVGQNHSLLPIQASQQAPLVDG FFNVMVSIATALFLVVEGTIVIFLVKYRHRPGDDTDGVHVEGNLPLEVFWTAIPSIIV LCLGIYSVDVFNRMGGFEVGGAHNMAHSHSPAHVAQMPGSAIAATLSDASQNESEMAA PAGTPIIGIGATPTEIGKPADLVVNVTGMQFAWLFDYPESGVNAGELHVPVGADVQLN ISATDVIHSFWVPQFRLKQDAIPGIPTQLRFVATKPGAYPVVCTELCGGYHGSMRSQV VVHTPEEFESWLSENRIAQKQDMQKAVAVNPADLSTSEFLTPYAQQMGVGSTTLESLV MSH" gene 8914..10608 /gene="ctaD" /locus_tag="DP116_13190" CDS 8914..10608 /gene="ctaD" /locus_tag="DP116_13190" /EC_number="1.9.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315440.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c oxidase subunit I" /protein_id="PRJNA477356:DP116_13190" /translation="MTQVELPRNIPPEDNQSGTQKVVHPHHPKAWKWYDYFTFNTDHK VIGIQYLVTAFLFYLIGGLMAVVMRAELATPDADLIDPNLYNAFMTNHGTIMIFLWIV PSAIGGFGNYLVPLMIGARDMAFPKLNAIAFWLNPPAGLLLAASFIFGGSQSGWTAYP PLSLVTANTAQSLWILAIVLVGTSSILGSLNFVITIWKMKVPSMKWDQLPLFCWAIMA TSVLALLSTPVLAAGLVLLLFDLNFGTSFFKPDAGGNVVIYQHLFWFYSHPAVYLMIL PIFGIMSEVIPVHARKSIFGYKAIAYSSLAICVVGLFVWVHHMFTSGTPGWMRMFFTI STLIVAVPTGVKIFGWVATLWGGKIRFTSAMLFALGLLSMFVMGGLSGVTMGTAPFDV HVHDTYYVVAHFHYVLFGGSVFGIYAGIYHWFPKMTGRKLNESLGRIHFILTFIGTNL TFLPMHELGLQGMPRRVAMYDPQFVSLNQICTIGAYILAASVIPFTINILWSWVSGAK AGDNPWNALTLEWTTSSPPLIENWEELPVVTHGPYDYGLNNHSTEIQSSVATEVGA" gene 10795..11388 /locus_tag="DP116_13195" CDS 10795..11388 /locus_tag="DP116_13195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313223.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme-copper oxidase subunit III" /protein_id="PRJNA477356:DP116_13195" /translation="MTIATSTSPAPRVEHHPDLRVLGLLVFLVSESLMFGGFFATYLF FRGSTEVWPPEGTEVELLLPAINTIILVSSSFVIHLGDTAIKKNDVQGMQKWYKITAI MGAIFLLGQVIEYVSLGYGMTTNVFANCFYLMTGFHGLHVLIGVLLILGVVWRSRRPG HYSATKHTGIAMAEIYWHFVDIIWIVLFTLLYILTRF" gene 11450..11935 /locus_tag="DP116_13200" CDS 11450..11935 /locus_tag="DP116_13200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454778.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13200" /translation="MKVDAERFSWFEVPDDIKQLLILAAEHWQNTEESEKYINQALAK TAESTEVLVAAYRYFFYKNNYHMALQTAIKVIEKIKFTEQLPDNWQQTKQILISRKEE ANIRLYLNAYAASGLVLAKLGDIEKAKKISAQVKEVDDKNEFGASIVYNILTRPVEED E" gene 11999..12610 /locus_tag="DP116_13205" CDS 11999..12610 /locus_tag="DP116_13205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875220.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cupin" /protein_id="PRJNA477356:DP116_13205" /translation="MKGRDWLVTEDGQYQTCQSVRAWDLLRDNYRFYRFLTEVEDALS DTEDETSHLPEIRMLVRRLIVNSYWVQSRYLEPSSKTGISVVLLYDELGFPFTVQTVT LAPGTSSKIHNHGTWGVVAVLKGEERNTFWQRTPDLNFHDKIERTGELTLFPGDIISY TPDAIHRVEAVGTQPTVTFNIYGETRMQERFEFDTVSHIARNF" gene 12695..13156 /locus_tag="DP116_13210" CDS 12695..13156 /locus_tag="DP116_13210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875221.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase-associated protein" /protein_id="PRJNA477356:DP116_13210" /translation="MATVIFYEKPGCQGGTKQKTLLTAAGHEVIAYNLLTQPWTAERL RSFFGDRPVSEWFNRAATRVKSGEIIPENLDAQTALMLMLKEPLLIRRPLIEVGDRRE VGFEVEKIDAWIGLKPKDATLKEITETLMNQDLQTCSHKHEHKHEPGSCKH" gene 13442..13720 /locus_tag="DP116_13215" CDS 13442..13720 /locus_tag="DP116_13215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743648.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13215" /translation="MEDQNQKDKDNFLYPRGRYYGQFKPENLVFNANLQQFAQKIGYI TSLETSGKISPLDAYNQIKALWKQLKRSKKELGIGNEPPSEPESPDVT" gene complement(13783..14235) /locus_tag="DP116_13220" CDS complement(13783..14235) /locus_tag="DP116_13220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319449.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SRPBCC family protein" /protein_id="PRJNA477356:DP116_13220" /translation="MSQVFEQSIQINATATTVERCITENTLMHRWLNPVLRCEPVGVW STNVGSKSRFVIQIPIIKPTLDSTVVERQPGLVVWEFQGFFKGRDAWECQPLDKGTRL VNRFEFQIPNPLVSWGFKTFAQNWTKEDMQSQLRRLKRVAEEIQQGFY" gene 14821..16473 /locus_tag="DP116_13225" CDS 14821..16473 /locus_tag="DP116_13225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747834.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13225" /translation="MNTTWQQITQVMEVDLSMRVQYLLAQLLAQSPTLPSERPDAVTY LQGTVQQVLNFMPRLLGAVLILLIGWLIAAIVSAVVRSLLKRTNIDNRIASGIAGGRD VPQVESIISGIVFWSILLLTIVAVLDTLQLRVASQPLNSFLNQIGDFLPRLVSAAVIL GVAWLVASLVKLITTRGLQALRVDERLNPPQDDGLNLSNLSVSETIGNALYWFIFLVF LVPLLENLGLNQALLPVQSLVTQIISIVPNILGATLIAVIGWFVANIVRRIVTNLLAT TGIDSLGSRFGFSGVSGTQSLSKIIGTIVYVLILIPVAIAALNQLQIEAISVPAISML QQILNALPSIFTAIAILIVAYFVGRFVAELVTNILTSIGFNNIFSVLGLPSPTRRVVI PQEPTAPGISNRTPSEIVGIIVLVGIMLFATLAAVNILNIPALTVLVTGIVIVLGRIL AGLVVFAIGLFLANLAFSIISSSGNRQARVLAQLTRIAIIAFVSAMALQQIGVASDIV NLAFGLLLGAIAVAIALSFGLGARDIARSQVQEWLDSFKGKN" gene 17179..17772 /locus_tag="DP116_13230" CDS 17179..17772 /locus_tag="DP116_13230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319451.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fasciclin domain-containing protein" /protein_id="PRJNA477356:DP116_13230" /translation="MLNQNRPNALITKSQLKKLACAIGIAGVSTLIAFPVLAKFYAPM YLFQPSAHRNYPYRNSDKTIADTLSQNSKFANLYHELKQAGLLKDLKQGNYTIFAPTN EAFNALPKNVFERYSQSQNRLRVLKYHLVASEIKAKDAKELNGKLITTVEGDQIKITV DPQDTVKLNDATGKHPSIKASNGVIIEVDKVLLPRGL" gene complement(18012..20909) /locus_tag="DP116_13235" CDS complement(18012..20909) /locus_tag="DP116_13235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009460330.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13235" /translation="MKQKLLPRKIQNFFKRIYKRRSLVLAALLFILSGVSPVVAAKVS SPTSIVQSSYDAEQLANKAVKLYQSGKFTEAAAAWKQTAQVFATIGDKLNQAMALSNL SLTYQQLGEWEQAKKAIEDSLALLKTASKGKEELKILAQSLDIQGSLQRELGQTADAL NAWQQASKIYSQTSEQEKLAQSKINQAQAMQDLGLSPRACSTLLEVLNQDIGVSNCQE LNDLDKPEFNKPKTEILKQRLQVVADKSPSLSRVVALRSLGELLRFVGQLEQSQMILE TSLKQAQKLNSPQEQAAVYLSLGNTARDLAEVNKILRSTRISYEQKALDSYSEVIKLS PSPTAQQQAQLNQLNLLLSQLKPKKSSETPEAENLSEAENLPILSQAEELWNRLNPQL NNLSASRTGVYLQINFAENVLKLAQQENFTLKANSKLPTFDEVDRILAKAAEQARSLG DKRAEANALGNRGGLYELTRPTRDLAKAEELTKQALNIAPSFSTPDIAYQFSWQLGRI RRDQGKTANAIAAYTAAYHALQSLRSELVAINPEVQFSFRDNVEPVYRQLVELDLKEA DSLKQAGDNKKSQEYIDQARTVIESLQLAEINNFFRESCVEAKPQEIDQIDKTAAVIY TIALQDRLEVILSLPNQPLTLHTAPLRPGELEQTVNDVRGSLIGADSKVEDFLPTYKQ LYDWMIQPVEAELAKSKVKTLAFVLDGDLRNIPMGVLYDGKQYLLEKYAIALTPGLQL VNPKPISKVGLRALTAGLSKIRPNFPAHEGFKPLGNVEEELKQIEKFGVSSQELLNDK FTSAEIQKQTVASRVPPIVHLATHGQFSSKVEDTFILAWDRRINVKELGGLLRDNTQY QKTPIELLVLSACETASGDRRAALGLAGVAVRSGARSTLATLWTVEDKSTAEVMGQFY RQLEQARKTNINKAQALQQAQLALKNQVDLKNREYTHPHFWAPFVLVGNWQ" gene complement(21328..>21784) /locus_tag="DP116_13240" CDS complement(21328..>21784) /locus_tag="DP116_13240" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=2 /transl_table=11 /product="filamentous hemagglutinin" /protein_id="PRJNA477356:DP116_13240" /translation="PTNDITAISQNNPNLSGTINIITPDVDPTRGLFELSETVIDPAQ QIAQNPFTKGFGSSFTIIGRGGIPTDPKKILSSDNVRVDLVKPSTTTNSITVTVDQPS KSPTVKRIIPAQGWIFNEKGEVLLVGYDPTKTGPQRQQQTPASSCAAVR" BASE COUNT 5953 a 4547 c 4675 g 6609 t ORIGIN 1 ttgcaaatga tattgacaaa ctcctcctcc ttgtggaatg aaacagccct gccttgggaa 61 gggggtcgcc gcaggcgttc tgatcttaac cgaaccgtat gggagagggg atggggtcgg 121 tcaggtcatt gagccgcaat ttagtttcat ctcttaggtg caggaaagtt aaccacaatt 181 taggttagtt tttgagaaca actgatgaat actatctgcc gtggatacct caattttatt 241 aaaaaccttt attgctgtct ttgttcttgc agatgcagtg ggtaatatac caatactttt 301 ggttcttact aaaggtatgg aaccagaaag cagaagcaga gttatagata aggcaattgt 361 agtggcgatc gccgtacttt tgctctttgc ctttgctggt caatttattt taagttattt 421 ggacgtcagc ttggggtcgt tgcgagtcgc tgggggacta ctactgttgt taattgccct 481 acgaatgctt caaggagact tagatacacc cgttattgat caagagcgtg atgttgcaat 541 tactcctctt gccttgccac tgctggctgg accaggtact ctgacaacag tcatgctctt 601 gatgtcggaa tctcccaacc cacatcttag cgtcatcgtg ggtattgtcg cggcaatgtt 661 ctgtacttgg ttgattttgc gtttggggag tggtatcgac aagtggattg gcgttgaagg 721 agaagtcatt atcacccagc ttttaggttt tttgctagca gcacttgcag tgcaaattgg 781 cagtactgga atcaaggagt tgtttttcac ttaatcaact gccctagaat atacaaaact 841 tacacacgaa caaggtaact atagtcattg cgaccgtatc tcctgcggag acgctacgcg 901 aacgcgcagc gtgccgttag gcatagggaa gcataagccc taagtcctta cggacacgct 961 gcgcgaacgg gcacgctgag tcccaagggg acacgcgcaa gggacgcgaa cgcaacccgc 1021 tcttttaaag tagcgcgtgc gtaagtccta atacagccag tgctatcgta caagcaggaa 1081 atgctgattt agataattgg ggagaaaaac aatgaagcgc gatgaggttt tagcaattgt 1141 cgcagctcat cggcagcaat tgcaagcaat gggggtgaag tctttagatt tgtttggttc 1201 ggtagcacgg gatgaagctg gtccagatag tgatgtagac tttctggtag agtttgaccg 1261 tccagtggga ttatttgatt tcagtaaagt gcgactctac ttagaggatg ttttggggtg 1321 ttcggtagat atgggaacac aggatgcttt gcgggaacat ttaagaaaac ctgttctgaa 1381 ggatgtgatt cgtgccctct agagactggc ggcttcgtat tcaagatatt gtgcaatcca 1441 tcgctgcaat taggcaacgt acggcaggaa tgacgtttga gcaattccag ggaaatgaga 1501 caattgcaaa agctgtttta tacgacttta taattattgg tgaagcggct atcaacgttc 1561 cctctgatat tcagtctcgc taccccgaca ttccctggcg tattatgggt gacatgagga 1621 atgtaatggc tcatgaatac ttccaagtca cgttgagaat agtttggaac actatcgaaa 1681 atgaccttcc ctcactgatg cagcaattag aagaagtaat cgagcacgaa gggataggag 1741 aataggttag ttaggaggcg atatcgcatt tccaagttag gcgatgcgcg cagcgtcaag 1801 cctccggctt atcgcgttag gtgaggggaa agaaaacaaa ccgccttagt gcagcgtgtc 1861 cacaggacat agacgaggac acttgcgtac gcagagaagt cgagggtaat cttaaaccgg 1921 caagggagta gtttaattac cagcgcgtac aatgttcccg tttatgcacg gtggtgcttg 1981 aaaaagcggt agactaaaaa tgacggaacg ttgggataaa tcaacttgga aacacaaaga 2041 actgcttcgt ttttagacag caaccatttg ctactcgcac gcgatcaaga tctgtttgag 2101 gttgttggac aaagaaactt aaatagaaca gcgatcgcaa cacgtgaaat taagcagggt 2161 gaggtcatcc ataacatttt gccttactca tccaaaacaa atgcagattt tttacagcta 2221 ttttcagtta attgaaccac agactttagt aggagcacgc tgagaacagc ctctaccccg 2281 tggttgattt acttgaaaag cgttgttagt ggatgtttga gtgatttatt agacacacta 2341 gctttgattt ttacactctt gggatgtatc gagtttattc aaaaaccaaa taggattcct 2401 ataagcagat ggaaaaaagc aaacactgct atgtgaagat atgtaaagcc tgactttttt 2461 tttaaataac tcttttgtta tttttaacat tcctttacac agcttagttt acttatttcc 2521 acccacttag ttcaattgct aaggagttat gctgctcaat gaccatatca ttaaataatg 2581 actctcaaag atcaccaaac actctccatg ctttgggtat agcagtgttg cttgtcgtca 2641 ccctcatctg gggtacgagc tatccactcc tcaagggagc cgtcagtagt ctttctccgg 2701 ctgcaatatt tgcaacccgc ttcgcagtag cggcattgcc tttcactccc tatttgcgtt 2761 tccttaatct tcccctgcta cgggatggaa tcttcttagg acttgtaatc tttagtacct 2821 taaccctcca aaccgttggg cttgagacta cctctgcaaa tcgcgctgca ttcattgcca 2881 gtttcaatgt catcctagtt cctctgctgg ggcaactgct aggtcggcaa gtgttcctaa 2941 aaactttcct caccgcagga attgccatca ttggcgttgg agtgatgtgt tgggagagtg 3001 gacagattgt tatcggcgat ctcttgatgt taggcaacgc cttcctctac tcgatctaca 3061 ttctaatgct tgagtccatc acgtcgcgcc accccatttt gccactcaca gcgatccagc 3121 tttgggtgat aacgatagtg tctttattct ggggagcttc ggatctggtt agacagcatg 3181 aggcaattgc tacaaacttt ggtgtactat tatatcttgg tttagttgat actgctgcaa 3241 ctattgtgct ccaagcagtg gctcaacgat gggttaacgc ttacgaaacg gcgctaatgt 3301 atacgcttga gccgattttc gcagcggttt tctcgtttct actactggga gagaaactgg 3361 gagtacgcgg tctgattggt gcaattcttg tgctggttgc aatggttttc ggtcagagta 3421 aatctcagga cgccgaacag tacggcaaaa tacaagtcaa cgaaccaatt gttgcatcgc 3481 tcttggctgc ggatactgag cctattaatg tttcagtgtc gcttctgaac gctaacctca 3541 ttgaatccga attcgattcg taatctgtac aaggaaacag gatgatagac ttggatatct 3601 tcgtaaaagt aacaaataaa ggaaaaagcg tttttgcaag ccgatacttc cgtaaaggtg 3661 agacggtggt ggttggacgc cgagtcgaaa tcttgcctga gagaactaat cactctttgc 3721 agatggattt tgacctgcat atagaattag atgaacctgg acgactaatc aatcattctt 3781 gtagcccaaa cacaggggtt cggaataata aattcggagc ctacgatttc gttgcattgg 3841 ttgatattcc ttccggaagc gagattacat gggactacga gacaacagag ttcgtttcca 3901 ttgccattcc aaaatgttct tgtggctccc ccgaatgtcg attaaagacc cttgggttca 3961 aatttctccc tgtcgaaatc aggaaaaagt atggtgagtt tatcgccgat tacctcaaac 4021 catttgttga tgaacctttc tgtagcaatg agctaaatga tgccatgctg gatttggtag 4081 ataagtggca cccctctcct tgataaggag aggggcaggg ggtgaggtaa tctcgcgtga 4141 tgaagttccc cgcatcgggt ttcttcccca ccctcaactt tgaaatcaca aatcgctatc 4201 cagcaattcc tagtactttc cttagcacta cttggtcttc tgatttggct ttgaggcgac 4261 cattttccaa atctcgaatc cagctttgac ttttacctgc cattttcgcc aattctcttt 4321 gggaaaggtt caggtttttc cttgcttgca aaatctgttc acctagcaga tatcctgctg 4381 tggtttttga ttttattcgg ttttttgctg tgcgttgctt tttttcagat tcactggttt 4441 gttgctccca gtctggaggg agtgtaaatg acagaattcg ggcattcatg agcatattcc 4501 acttgccacg gggacctgca tccgtcaggc gtgtttttcc gttggcgtcg ttaatccaaa 4561 actctaacgc atcatctgga tcttcaggaa tatcaaccaa tttcgcccac aagggttgga 4621 tcgtcggtgg gtatgtgact gggtcgaaag ctggtttcat gccatagtga ttgaggactt 4681 ccaaatcgtg ttcaaaagtt cgtagcagtc gcttgcgttc ttctcgtttt ctgcaagcaa 4741 gggtgacttt ctcttcaccg taagcaacac gcagcaaggt gggaactgta atccgctgtt 4801 cttttcccat tttagtctta aacaacaacc acagcatcag ccgcgctgct ccttcgtgtt 4861 gctgccaaat actcatgact gtcgttaaca gggatttcgg caaactacca tattgataaa 4921 atgtggttcg ttctttgcat ccttgtttgt ttaggaaata ctgcgcccat atacctgctc 4981 ttaccttaaa agtgagtcca accaaatatt tgcaccccaa gttatcttct tggaagtgat 5041 gctgaatttc tactaagtgc cataaagggg ttccttccac tgaaaatccc ctaactcgac 5101 cttgttgggg ccaatttagc gaggtgatca gtgagcaagt ttgctgcacc aagtttttca 5161 ttaaagtcaa tttagcagct ttgctcagat ctttgcgttt ctctagcccc agatatttct 5221 caatttggcg atcgtcgatg gtgaagactt gctcactcgg ctgatctaaa gctgtcgcat 5281 gagctgcaaa aatcaaatgt atacaagttg ctctgatatc cagcgtctca acgactccta 5341 gttctgttgc gagttcgttt gcattgtgtg gtttggtgat gtgaaatgcg atcgcccctc 5401 gcccctggtt aacttgtcgg cgataacacg cccttccgta ttcatcaatt tcccaaggta 5461 gcgaggttcg ctgtgccagt acattgcaag cctcccagat gacaattgct gatgcaaatg 5521 gattatttct ggtgttcgca aacaaatctg ggctagtcgc atctctgata tctactttac 5581 atctccccct tttcgcctgc atagggatcg gcaaatgaat gggacaagtc actacacatt 5641 gaggttcgga atgatagcct tcgcagttgt tacatagact tgggtcgatc cagtactcat 5701 tgttctcgaa tttaattgca ccagtaggac attggggacg gcagttgtca catccaacgc 5761 aactgttgtt aggaattgta taaggcataa ggctttttcc tactttaatc gttgagatgt 5821 ctggctcaag tctcccctgg atgtgtcctc ttaattgact actcactact gactcaccac 5881 cacattattt ttccgacaat aagaaatttt acttgccgta ctacttaacc acatccttat 5941 tcagaccacg catcctattg gaagctattt aaggcttgta accagagcct aatttttaag 6001 atttcttact atctaatctt tcataaatct ttatataagt ttgttactaa tgagctaaac 6061 atgattttgt agatttcata tttaaaattt ccttaaagaa cacttttgat agttgtcaaa 6121 tcttttccat tgttattgaa taacaattcg ttgttgtttt tgctaattac taatattaat 6181 aatatttaca tgtcttcact atgtcatcta ctttactttt tgatcattca tttaaattta 6241 tttccttcaa gcaatcaaca atgagcatta aaaaaagtat aaaaatatgc acactttatc 6301 tttgacgctt aatatctaat atgctctaac ttataaaata taagcttggg ataagaagac 6361 aaaatttctc aaaatataat aaggtatatt gtcaaactat taatattgag taatttcaga 6421 ctttgtttgt tgatattaaa tctcattact tctcaacatt tcacggctat ttacaaagat 6481 aagagtcatt ttttgtgtct atcggaaaga aaaaatgata gttttctctt gatcattagg 6541 tcatactact caatagatgc gacgtgaaaa agcctatttc ttaataagag tgagttattc 6601 gagatgttac agggactcct agttttagag agagtatccc atttttatca atttgtagtg 6661 cgggaagaga gcaagatgct gccactttcc tttttttgca cttcttcaga tgctttcgtt 6721 tgagagacaa ctattttgtt atctagatac tctgaaaaac tattggcaaa gaatcgcatt 6781 tgattttttt agtgtgaata tttacagatt ttttgacaag tcatgaaaat ttaatgagcg 6841 tccaaattat attaaaaaat ccgaaaaatc gaatttacct tatgttagga tgagcgtaaa 6901 agtcgatgag acatgtaaag agagtagaag gtaaaatcat ggcaacacta acaggattaa 6961 catttggtgg taaaacttgg acacccaaat tcgttcagga gattgaccaa gaaaaatgca 7021 tcggctgcgg cagatgtttt aaaatttgcg gtcacaatgt attactgctg aaagcaatga 7081 acgaagaagg agaatttgtc gaggatgagg atgatgatga aattgaaaag aaagtgatga 7141 ctgttgctaa tcaggaaaac tgtgtgggtt gtgaagcttg tgccagaatt tgttccaaaa 7201 actgctatac tcactctgca ttagataact agagttgttg ggttgttcag aggtaggtaa 7261 aagaaggtgc tgtttgacaa aagtgttaag tttgggtatc taaagaacaa cttctcactt 7321 tagtagcaag gtaacgagag gggaatattt aaagttacaa gaacgtcaaa gcaaaaagtt 7381 cacagcaaac aaatccgtaa atcttaaagt gtaagctatt aacaaacaaa aacttgctgc 7441 ccagatagct atcatgaatc accaagaccc tgagcgacct cctctgcaac tagattctgt 7501 gaaagtgcaa agctctggta ctatgagttg agagatagtt ttccgttttt tgggggagag 7561 aagaattaaa agattttaaa attcccccac attttctaaa aaaaacgtag acaataaact 7621 tggttaacac aatcgcttat tgcgtcatta gtcattaatt actagtcttg aaaactaaag 7681 aaatgataaa tgacaaaaaa gacatgacga aagatgctac tagcagttag ccatgtaagc 7741 aagagacatc aaaagttact agttatcagt tactaatgaa gaataaattg ataactgttc 7801 actgataact gatgactgat tgaacgatgt tttaccgacc aagtatttat gcaacaaatt 7861 cctgtttcac tatggaccct agttgctggg atagtagtta caattctcag catttgggtt 7921 ggacaaaatc acagtctact accgatacaa gcatcgcaac aagcgccttt ggtagacggg 7981 ttttttaacg ttatggtttc gattgctacc gctctgtttt tagtggtaga aggaaccatt 8041 gtgatttttt tggttaagta tcgtcaccgt ccaggagatg acacggatgg cgtgcatgtt 8101 gaaggcaact taccactaga agttttttgg acagcaatcc catctattat cgtcctctgc 8161 ttaggaatct acagcgttga tgtttttaat cgaatgggag ggtttgaagt ggggggtgct 8221 cataacatgg ctcatagcca ttccccagct catgttgccc aaatgccagg aagtgccatt 8281 gctgctaccc tgagtgacgc ctcacagaat gagtcagaaa tggcagcacc agcaggtact 8341 cctataattg gtattggcgc aactcccaca gagataggta aacctgctga tttggttgtc 8401 aacgtcaccg gaatgcagtt tgcttggctc ttcgactacc ccgaaagtgg agtgaatgct 8461 ggagaattac acgttccagt tggtgctgac gtgcaactca acatttctgc aacggatgtg 8521 attcactcgt tttgggtacc acaattccgg ctgaagcaag acgctattcc tggaataccc 8581 acccaactgc gatttgtcgc gacaaaacca ggcgcatatc cagtagtttg tacagaactt 8641 tgtggcggct accacggttc aatgagatca caggttgttg ttcacacgcc tgaagagttt 8701 gaaagctggc tttctgaaaa ccgcattgcc caaaagcaag acatgcaaaa agccgttgca 8761 gttaatccag cagacttatc aacatcagag tttctcactc cctacgcgca acaaatggga 8821 gttggttcaa caactctgga gtcattagtc atgagtcatt agtcattggt gacaacaaag 8881 gacaaacaac aaaggacaaa tgacaaaaaa aatatgaccc aagtagaact tccacggaac 8941 attccacctg aagacaatca atctggtact cagaaggttg ttcatccaca ccatcccaag 9001 gcgtggaaat ggtacgatta cttcacattt aatactgacc acaaggttat tggtatccaa 9061 tacctggtta cggcgttctt gttttatctc atcggtggac tcatggctgt tgtcatgcgt 9121 gccgaattgg caacaccaga tgcagattta attgacccta acctgtataa cgccttcatg 9181 accaatcacg ggacgatcat gattttctta tggatcgtac ccagtgcaat tggtggattt 9241 ggtaactatc tagtgccatt gatgattggt gctagggata tggctttccc caagctgaat 9301 gcgatcgcct tctggttaaa cccacccgca ggcttgctgc tagcagctag cttcatcttc 9361 ggcggatcgc aatctggttg gacagcttac ccacctctaa gcttagtcac agctaacacc 9421 gcccaaagcc tgtggatact tgccattgtc ttggtgggaa catcctcaat tttgggttcg 9481 ttgaactttg tcatcaccat ctggaagatg aaagttccca gcatgaaatg ggatcaattg 9541 cccttgttct gctgggcaat tatggcaacc tccgtactag cacttctctc cacacctgtg 9601 ttagctgcgg gattggtgct gctgttgttt gacctcaact ttggcacatc gttctttaaa 9661 ccagatgcag gcggtaacgt tgttatttac cagcacttgt tctggttcta ttcccacccg 9721 gcagtttatc tgatgattct gcctatcttc ggcatcatgt ccgaggtgat tccggttcac 9781 gcgcgtaagt caatttttgg gtataaggcg atcgcctatt ccagtttggc aatctgcgtc 9841 gtgggtttat tcgtctgggt acaccacatg tttaccagcg gtacacccgg ttggatgcgg 9901 atgttcttca caatctccac cctgattgtt gctgttccta caggcgtcaa gattttcggt 9961 tgggttgcaa ccctgtgggg cggtaaaatt cgcttcacat cagccatgct ttttgccctt 10021 ggcttgttgt cgatgtttgt catgggtggc ttaagcggcg tgacaatggg aacagccccc 10081 tttgatgtcc acgtccacga cacctactat gtggtggcgc acttccacta cgtcttgttt 10141 ggtggttccg tgtttgggat ttacgccgga atctatcact ggttccccaa aatgacagga 10201 cggaagttga atgaatccct cggtcgcatt cactttatcc tcaccttcat tggcacgaat 10261 ctcaccttcc tacctatgca cgagttgggt ttgcaaggaa tgccccgacg agttgcgatg 10321 tatgatccgc aatttgtcag cctgaatcag atttgtacca ttggtgctta catcttggca 10381 gcatcggtga ttcccttcac catcaacatt ctctggagtt gggtatctgg agcaaaagca 10441 ggtgataatc cttggaatgc tctcacctta gaatggacaa ccagttcccc accactgatt 10501 gaaaactggg aagaattgcc cgtcgtcact cacggtccct acgactacgg cttgaacaat 10561 catagtactg aaatacagtc atctgttgcg actgaagttg gtgcttagtt agtcatttca 10621 ttcccgctcc caaccgccag gaaagaaatt tcctggcaga tcagctcttg ttaaatcagt 10681 caacagtgaa cagtgaacaa ggggtggacg agtccgtttc ctacctgata actggtaact 10741 gataactggt aactgataac tgattgttga acacttattc accacacaga cgatatgact 10801 atagcgacat cgacgagtcc agccccgcgc gttgaacatc atccagattt acgagtctta 10861 gggctattag tattcctcgt ctctgaatct ctgatgtttg gtggattttt tgccacttat 10921 ttattctttc gaggcagtac cgaagtttgg cctccagaag gaaccgaggt agagttattg 10981 ttgcctgcaa ttaacactat tattctggtt tccagcagtt ttgtgattca cttgggtgat 11041 acagcaatca aaaagaatga tgtccaaggt atgcagaagt ggtacaaaat caccgctatc 11101 atgggcgcaa ttttcttgct gggtcaagtt attgagtatg taagtttagg atatggcatg 11161 acaactaatg tctttgccaa ttgtttttac ttaatgactg gattccacgg tttgcacgtt 11221 cttatcggag tgttgttaat tttgggtgtg gtgtggcgtt cccgccgtcc cggtcactat 11281 tctgctacca aacacactgg cattgcaatg gcagaaattt attggcactt tgtagacatc 11341 atttggattg ttcttttcac cctgttgtac attctcacca ggttttaaat tgagagtcgg 11401 gaataggaaa tgagtaatac ctattcccca ttcctaaagg agaagtctta tgaaagttga 11461 tgccgaaaga ttttcatggt ttgaagttcc agacgacatc aaacagttat tgatattggc 11521 agcagaacat tggcaaaata ctgaagaatc agaaaaatat attaaccaag ctttagccaa 11581 aacggcagaa agcacagagg ttttagtagc cgcttataga tactttttct ataaaaataa 11641 ctatcacatg gcgttgcaaa ctgcaattaa agttatagaa aaaattaaat ttaccgaaca 11701 attaccagat aattggcagc aaactaagca aatattaatc agtcgtaaag aagaagcaaa 11761 cattagatta tatctaaatg cttatgcagc ttctggactg gtcttggcaa agctaggaga 11821 tatagaaaaa gctaaaaaaa ttagcgctca agtcaaggaa gtagatgata aaaatgagtt 11881 cggagcaagt attgtctaca atattttgac acgtccagtc gaagaagacg aatgaattct 11941 aagtgatcaa gtctcaagtt gatgctcaca ttatcattga ctatacggtt taaaacatat 12001 gaaaggaagg gattggcttg tcactgaaga cgggcagtat caaacctgtc aatctgtcag 12061 agcatgggat ttgctgagag ataattatcg cttttatcga tttctaactg aagtagaaga 12121 tgccctcagt gatactgaag atgaaactag ccatcttcca gaaattcgga tgctagtcag 12181 gcggttaatt gttaactcct actgggtaca aagtcgatat ttagagcctt cttctaaaac 12241 aggaatctca gttgtactcc tttatgatga gttaggtttt ccatttactg tacaaacggt 12301 aacattggca ccgggaacgt catcaaaaat tcacaatcat ggaacttggg gtgttgtcgc 12361 tgtgttaaaa ggagaagaga gaaatacttt ttggcagcgc acgcctgatt taaattttca 12421 tgacaaaatt gaaagaacag gagaattaac tttatttcca ggcgatatta ttagctatac 12481 ccccgatgca attcatcgtg tggaagcagt gggtactcaa ccgactgtga ctttcaatat 12541 ttatggcgaa actcgtatgc aagaacgttt tgagtttgat acagttagcc atattgctag 12601 aaacttttaa tcttctaatt tctctgcgtt tttctgtagt tccaaaaagc cccataccgg 12661 gtttttgtaa gaaattaacc aattaggaga tattatggca acagtcattt tctatgaaaa 12721 accaggctgt caaggtggta ctaagcaaaa aactctgctc acagccgcag gtcatgaagt 12781 gatagcctac aacttgctca cacaaccttg gactgctgaa cgtttacgct cattttttgg 12841 cgatcgccct gtatctgaat ggttcaatcg tgccgctaca cgggtaaaat ctggtgagat 12901 cattccagaa aatttggatg cccaaacagc cttgatgctg atgctaaaag agccattgtt 12961 gattcgtcgt cctttgattg aagtgggcga tcgccgtgaa gtcggctttg aggtagaaaa 13021 aatagatgct tggattggct taaagcccaa ggatgcaact cttaaagaaa ttactgaaac 13081 gctgatgaac caagatttgc aaacttgttc ccacaagcat gaacacaaac atgagccagg 13141 ttcgtgtaag cactagatga gtttattgac tgctgatata gcttgggtgc atcgctgaaa 13201 cgcaatgcgc cctttttgtt cttgatcaaa ttatgtcttt agagagatta taccgcttct 13261 gatgttgcgt gcaatacaca gatgtagaga cgtaatatgt aacgtgctct acattattcg 13321 tggtgatgta tgtgattcaa atgagaatcg ctatatacat atattccaaa aattagtttg 13381 ttaaaaacaa tactctattt aagataaggt taaataccat gaaatcaata cgggtaaact 13441 gatggaagat caaaatcaaa aagacaagga caactttctt taccctcgtg gtcgctacta 13501 tggtcaattt aagccagaaa acttagtatt taatgccaat ctccaacaat tcgctcaaaa 13561 aataggctat atcacgtcct tagaaacgtc tgggaaaatc tctccgttag atgcttacaa 13621 ccagattaaa gcgctttgga aacagttaaa acgcagcaaa aaagagcttg gtattggcaa 13681 cgaaccacca agtgaaccag aatcaccaga tgttacgtga aatccgatga atcacataac 13741 catatcttaa tccaaaagat ctgactttaa tgtacgacaa gcttagtaaa acccttgttg 13801 tatttcctct gcaacccgct taagccgacg cagttgactt tgcatatctt cttttgtcca 13861 attttgggca aaggttttga aaccccaact taccagaggg ttgggaatct ggaactcaaa 13921 gcggttaacc agacgtgttc ctttgtctag gggttgacat tcccaagcgt cacgcccttt 13981 aaaaaaaccc tggaactccc acacaaccaa acccggctgt ctttccacga ctgtgctatc 14041 caatgtcggt tttatgattg gtatttgaat cacaaaccga cttttgctac caacattggt 14101 actccacaca ccaacaggtt cgcaacgcaa gacgggattc agccagcgat gcatgagagt 14161 attttcggta atgcagcgtt ccacggttgt ggctgtggca ttaatttgaa ttgattgttc 14221 aaagacttga gacattccac atcttaattt ggtatggctg catttagagt accgcgaaac 14281 tctgcactgc caacagattt gtaaattaag aactcggaat ttggagtatc tcctccacgc 14341 ctacgctgcg ctttgcgctt acgcttgcgt gcgctttctt tgcccatacg cgccccagga 14401 gtgctgcaca tactgaattt tattagtatt gaaatcaatc agtacagttg atgttaagcg 14461 ttaagagttc cctgttcaga gttccctatt ccctattaag agttccctat taagagttaa 14521 gcgttaagag ttccctgttc cctagcgaaa ctagtaagta tggctccgcc acgcaagcta 14581 acataaacca aaccggattc ctacatatgg tataaatact ttattgtttg ataaaacgaa 14641 aaaaaaaagt atcaaagtcc aaaaatcact gctaatattc atttttctgt agaactttgc 14701 tggtaaattt atagacaaaa cactgataac ttacggaatt gtaggattat cgggactttc 14761 gacatcagaa gcatctaagg tgttttctgc aataacactc aactttaatt ttgaacaacc 14821 atgaacacaa cttggcaaca aataacccag gtgatggagg ttgatttgtc aatgagggtg 14881 caatatttat tagcacaatt attagcacaa tcgccaacgt taccatcgga acgacccgat 14941 gcggtcactt acctgcaagg aactgtacaa caggtactta attttatgcc ccgtttgctg 15001 ggagcagtgc taattttatt aattggctgg ctgattgctg ctattgtatc tgctgtagta 15061 cgctccctcc tcaaacgcac caatatagac aaccgcattg cttcggggat agcgggtggt 15121 cgggatgttc ctcaagtgga gtcaattatc tccggcatag tcttctggag tatcctactt 15181 ttaacaatag ttgccgtttt agatacgtta caacttagag ttgcttctca gccgctcaac 15241 agttttctca atcaaattgg tgacttctta cccagacttg tgagtgcagc ggtgatctta 15301 ggagtcgcct ggttagtggc tagtctagtc aagctgataa ccacacgcgg actacaagcg 15361 ttgcgggtag acgaacgatt aaatccacca caagacgatg ggctaaatct cagtaacttg 15421 tctgtcagtg agacgattgg taatgctcta tattggttta tctttttagt ctttctcgtc 15481 ccgttacttg agaacttagg gttaaaccaa gcattactac cagtacaatc tcttgtcaca 15541 cagattatct caattgtgcc caatattttg ggtgcaacgt taattgctgt gattggctgg 15601 tttgtggcta acatcgtgcg tcggattgtc acaaacttac tggcgacaac gggaatcgac 15661 agtttgggaa gtcggtttgg attcagtgga gtttcaggaa cgcaatcctt atcgaagatt 15721 atcggtacaa ttgtctatgt tttaattttg attcctgtcg caattgcagc actcaatcaa 15781 ctgcaaattg aagcgatttc cgtaccggca atctcgatgc tgcaacagat tctcaatgca 15841 ctgccgagta tctttacagc aatagctatt ttgattgttg cctattttgt tgggcggttt 15901 gtagcagaac tagtgaccaa catcctcaca agtataggat ttaacaacat cttctctgtt 15961 ctcggtctgc catcacccac cagacgagtc gttattccac aagaaccaac agcacctggg 16021 atatcaaacc gcaccccgtc ggaaattgtc ggcattatcg tccttgtcgg cattatgctg 16081 tttgcaactt tggcagcggt taatatcctg aacattccag cgctgacagt gttggtgact 16141 ggtattgtga tagtgttggg gcggattttg gctggattgg tcgtattcgc tataggtttg 16201 tttttggcaa atcttgcttt cagcattatt agcagttctg gtaatcgcca agcacgggtt 16261 ttggcacagc tcacccggat tgctattatt gcctttgtat ctgcgatggc gctgcaacag 16321 attggtgttg ctagtgatat tgtgaatttg gcttttggac ttttactggg agcgatcgcc 16381 gttgccattg cattatcgtt tggtttaggt gctcgcgata ttgccagatc acaagtccaa 16441 gagtggcttg actccttcaa aggaaaaaat taactaactg tttgcactca agacggacgt 16501 tcacgcctta aagttgcata cttacctgtg aatcttgtca agagtcaaga gtgattctca 16561 attagagaac tcttggctct tgaatacttg aagagaaaac cctgcaattt caaatgtttc 16621 aaccgatttg tttaatgtgt aaatttttat gactaggaaa attagaaagc aagatttagg 16681 tgtaaaatta tagagcaaaa caacaaataa gggaaacaaa cagtcaaact cacagagatc 16741 gaagtaaaat aaagaagtgc aatttcacta atcgttggga ttattaaaaa aagttttgca 16801 gcagacaagc agctttttct tgactcagaa tcgggaatgc aaaaacagct gttatttttc 16861 agaaaacatt ttgagatttt cagaaaagat ttcgggaatt acccttgagt gggttgtagt 16921 tagcattgag actaagactt ggctgagcat gaattgtaag accatgagta ggagaggata 16981 gcaccgtggg cagacagcgc gttatagtac tggctccttc gtgactggcg ttgaggttac 17041 attcgttgta gcgactaaac gttaatctta cctcctatag cctcctatcc gtacaacctc 17101 aaagggtggt ttccggctaa gagtcttcga ctagaagttg gaaaggtttt caagtgttta 17161 ctcataactc atctacttat gcttaatcaa aatcgtccaa atgctctaat caccaagagc 17221 cagcttaaga aattggcttg tgctatagga attgctggtg tcagtaccct catcgctttt 17281 cctgtattag ccaaattcta tgctcccatg tatctttttc agccttctgc tcatcgtaac 17341 tatccctacc gcaactctga caagactatt gctgatacac ttagccaaaa tagtaaattt 17401 gccaatcttt atcatgagtt gaaacaagca ggtcttctca aagatttgaa gcaaggtaat 17461 tacacaatct ttgctcctac caatgaagcc tttaatgctt tacctaaaaa cgtctttgag 17521 cgatatagcc aaagccaaaa tcggctcaga gtgctgaagt atcatttggt tgctagtgaa 17581 attaaagcta aagacgcaaa agagctaaat ggtaaattaa tcacaactgt tgaaggtgac 17641 caaatcaaaa taactgttga tccacaagac acagtcaagc tgaatgatgc tactggtaag 17701 catccctcta tcaaagccag caatggtgtg attattgaag ttgacaaggt acttctaccc 17761 cgtggcttgt agaactccac ttttaggggt gggcgatgag gatcatcgct tgtgtaacac 17821 actgctcatg aaccttgtga tgtgttacgc caagttgcat tgaacgtaac ttggcatttt 17881 gcctcgttcc cagtctcaga ctgggaatac ctatttggag gcttagcctc ccttaatcaa 17941 actaaagttc acaccaatgg gcacagacag catgctgtct gtgcccaact ttgttttgtg 18001 gtgtgtgtgg tttattgcca attccccact agtacaaacg gtgcccagaa atgggggtga 18061 gtgtattccc gatttttaag atccacctga tttttgagag ccaattgtgc ttgttgcaag 18121 gcttgtgctt tatttatatt tgtcttcctg gcttgctcta actggcgata aaattgaccc 18181 atgacttcag cagtactttt atcctctaca gtccagagtg tcgccagtgt actgcgcgcc 18241 ccagaacgca ccgcgactcc agccagtcct aaagctgctc gcctgtctcc acttgctgtc 18301 tcgcaagcac tgagaaccaa taattcgatt ggtgtcttct ggtattgggt gttgtctcgc 18361 agtaagccac ccaactcttt aacattgatg cggcgatccc aagcgaggat aaaggtatcc 18421 tcaaccttag aactaaactg accatgagtt gctagatgga ctatgggagg aacacgagaa 18481 gcaacagtct gtttttggat ctcagcgctt gtaaatttat cgttgaggag ttcctgagat 18541 gaaacgccaa atttttcgat ttgtttgagt tcctcttcta cgttacctaa tggtttaaaa 18601 ccctcatgtg cggggaaatt cgggcggatt ttgctgagtc ctgctgttaa ggctcttaaa 18661 ccaacctttg aaattggttt tggattcact agctgcaaac caggtgttaa agcaatggca 18721 tacttttcta gtagatactg cttgccatcg taaaggacac ccatagggat attccgcaag 18781 tctccatcga gtacaaaagc caaagtcttg actttacttt ttgccaactc tgcttccact 18841 ggttgaatca tccaatcata caattgtttg tatgtgggta aaaagtcctc gactttactg 18901 tcagcaccga tgagagaacc ccgcacatca ttgacagttt gctcaagttc tcctggacga 18961 agaggagctg tatgaagggt taaaggttga tttggtaggc tcaaaataac ctctaagcgg 19021 tcttgcaaag caattgtata gataactgct gctgttttat ctatttgatc aatctcttga 19081 ggttttgctt ctacacaaga ttctcggaaa aagttgttta tttcagctaa ttgcagagat 19141 tcgatcacgg tacgagcttg atctatgtat tcctgacttt tcttattgtc accagcttgt 19201 tttaatgaat ctgcctcttt taaatctaac tcaactaact gtcgataaac tggttcaaca 19261 ttgtcccgaa aagaaaactg tacttctggg ttgattgcta ctaattcgct gcgtaacgac 19321 tgaagggcat gatatgctgc ggtgtaagcg gcgatcgcat ttgcagtctt tccttgatct 19381 ctacgtattc gccccaactg ccaggaaaat tggtaagcta tgtctggtgt tgaaaagctg 19441 ggagctatat tcagagcctg ttttgtcaat tcttctgctt tggctaaatc tcgcgtcggt 19501 cgagtcagtt catacagtcc gccacgattt cccaaagcat tagcttcggc tcttttgtct 19561 cccaaacttc tggcttgttc tgcagctttc gccaatattc tatcaacttc gtcaaaggtt 19621 gggagtttgg aattagcctt gagtgtgaag ttttcttgtt gagctaattt gagtacattc 19681 tcagcaaaat tgatttgtag gtacactccc gtgcgactag cagacaagtt gttcagttga 19741 ggattgagcc tgttccataa ttcttctgct tgtgataaaa ttggcagatt ttctgcttct 19801 gacaagttct ccgcttctgg tgtttctgac gatttttttg gtttcaactg actcaacaac 19861 aggttcagtt gattcaattg tgcttgttgt tgtgcagttg gtgatggaga cagctttata 19921 acctcgctgt aagaatccag tgctttttgt tcataactta ttcgtgtgct gcgaagaatt 19981 ttatttactt cagctaaatc acgtgctgta ttgcctaaac tcagataaac agcagcttgc 20041 tcttggggag aattcaattt ttgagcctgc tttaaacttg tttccaaaat catttgagac 20101 tgctctaatt gacctacgaa tcgcagcagt tcccctagac ttcgcaatgc cacgactcta 20161 gataaagaag gagatttatc ggcaacaact tgcagccttt gctttaaaat ttctgtcttg 20221 ggtttattga attccggttt atccaagtca ttgagttcct gacaattact cactccaata 20281 tcttgattta aaacttccaa taaagtacta caagcacgag gagacagacc caaatcttgc 20341 attgcttgag cctgattaat cttgctttgt gctaatttct cctgttcact agtttgactg 20401 tatattttac tagcttgttg ccaagcgttg agagcatcgg ctgtttgacc caattctctt 20461 tgtaaagatc cctgaatatc caaactttga gccaggattt ttaattcttc ttttcctttt 20521 gatgctgttt tcaaaagtgc taagctgtct tcaatcgctt ttttcgcttg ctcccactct 20581 cccagttgtt gataagtcaa agagagattg cttaacgcca ttgcctggtt tagtttatct 20641 ccaatggtcg caaaaacttg tgctgtttgt ttccaagccg ccgccgcttc agtaaacttt 20701 ccactctgat acagcttgac agccttgttt gctagctgtt cagcatcgta tgaagattgg 20761 acaatcgagg ttggtgaaga aactttggca gcaacaacag gcgaaactcc tgagaggata 20821 aataatagag ctgcgagaac aagagaacgc ctcttatata ttcttttgaa aaaattttgt 20881 attttcctgg gtagaagttt ttgtttcatt atattttagt cctgatgtca aaatgttaag 20941 taactcggta caaataaggg aactcccaga tgtttatatt tatgctcgca ctaccttaat 21001 aaaattggga tgctccctac aaataaagtt ataagcctta aactcgtgcg caagaaacag 21061 ctaggaacat atgaaagaaa caccatcctc cttcttaaaa gttaagcgtt aagagttaag 21121 cgttaagagt tccctgttcc ctgttccctg ttccctgttc cctgttccct gttccctgtt 21181 ctctgttctc tgttctctgt tccctgttcc ctgttccctg ttccctgttc cctgtgtttc 21241 ttgaaagata gagagcaaat ggcagatgca ttttatgggt tttgagtatt ttacgtttaa 21301 tttggctgct ctacttgtga agaattttta tctcaccgca gcacaactac tcgcaggtgt 21361 ttgttgctga cgttgcggac ctgttttggt gggatcataa cctaccagca gcacctcacc 21421 tttttcgtta aatatccagc cttgggctgg tattatgcgt ttgacagttg gactcttaga 21481 tggctgatcc acagttacag ttattgaatt tgttgtcgtg ctaggcttga ccaaatccac 21541 acgcacattg tcactgctga ggattttctt aggatcagtt ggaattcccc cacgtccgat 21601 gatggtaaag ctgctaccaa aacctttggt aaaaggattt tgggctattt gctgtgcggg 21661 gtcaataaca gtttctgata attcaaataa cccacgggtg gggtcaacat ctggggtgat 21721 aatgttgata gtaccactca agtttggatt gttttgagaa atagctgtga tgtcattcgt 21781 aggt // LOCUS NODE_1462_length_21627_cov_10.14430721627 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 21627) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 21627) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..21627 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 74..907 /locus_tag="DP116_13245" CDS 74..907 /locus_tag="DP116_13245" /inference="COORDINATES: protein motif:HMM:PF00534.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase" /protein_id="PRJNA477356:DP116_13245" /translation="MFRVARRAGLITVGEQMIAPAAVEWAEAERQHEAWPGWEDKPDR RTYDRLRRFEEATWAAADRLGAPSEYVRHGLVGQGVSADRVALVPYPAEVGPFEYVDR RGRTGPLTVGFVGQVNLRKNAPTVFTVARRFRPTEVRFVMVGRVYLKAAAVAAHKGEV ELTGPLPRSEVPGRLAGFDLFLFPSTCEGSAGAVIEAMATGLPVLTTPNSGSPVRDGV DGFVLAPGDVDGFARRIAELAADPDCRHALGAAARERVAGLTLDRYGRELAAVFDGAQ A" gene 919..2112 /locus_tag="DP116_13250" CDS 919..2112 /locus_tag="DP116_13250" /inference="COORDINATES: protein motif:HMM:PF00534.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 1" /protein_id="PRJNA477356:DP116_13250" /translation="MLPDLGHYHHARIQAYAERGEADVTVVVVGGKGLFAEFAHRGDR PPAYRTVTLFPDRFFADLPAADLGRAVENALAAERCSVVLAQGWAAAYSLAALRWAVA RDVPRVVTSESQRDDFARSPVKEWVKRRLVGLCGAALCGGTRHAEYARDLGVPADRIF LGYDAVDNAHYAAGADAARTNPGARTRLGLPPAYLLASARFIPKKNLPGLLAGYAAYR AAVGPGAWELVVLGDGAGRPELEARRGELGLAGVVHLPGFRGYDALPAYYGLAEGFVH ASLVEQWGLVINEAAAAGVPVVASDRCGATADLVQPGRTGWAFAPTDSAALGAALVEL HRHPDRRSLGQAGRQLVAEWGPERFAAGLSAAVAAATPARRPGLVGRAVLRAVAARPP VTGGE" gene 2147..3082 /locus_tag="DP116_13255" CDS 2147..3082 /locus_tag="DP116_13255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020263471.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GDP-L-fucose synthase" /protein_id="PRJNA477356:DP116_13255" /translation="MSHLAGKRVVVTGGAGFLGSQVVRRLADFGPAHVATPRKAEYDL TEQGQVRKLLDDLKPQVIVHLAAVVGGIGANRENPGRYFYENAVMGILLMEEARKRGV QKMVTVGTICSYPKFTPVPFKEDALWDGYPEETNAPYGVAKKALLVQAQAYRQQYGFA GVTLLPVNLYGPGDNFDPASSHVIPALIKKVVDAREAGRKHIDVWGTGTASREFLFVR DAADGIATAADRYDHPDAVNLGSGREITIKALTELVCELCRFDGELRWDPTKPDGQPR RCLDTTRAAERFGWTASTDFRTGLMETIAWYEANR" gene 3181..4083 /locus_tag="DP116_13260" CDS 3181..4083 /locus_tag="DP116_13260" /inference="COORDINATES: protein motif:HMM:PF13472.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SGNH/GDSL hydrolase family protein" /protein_id="PRJNA477356:DP116_13260" /translation="MVASEAGLRWGLGLGRPALLQADPEIGYLFQPDQRLTRFGRRVV INGYHQRTGPVEPRPPAGTLRVLCVGDSVTFGGTLTDQAETYPELLAARLRETHRDGP VEVLNASAGSWGLGNEAAYLRRFGTFGSRWVVLQIGTHDLTQEPSTGAVVGVADTHPD RNPPAALIELFTRYVRPRLVGNVAEPALPPPPERPADASLADNLARLADMVRQTREAG GQPLVLHTPNRDEVTGAALPNPADESRRTTFLDRCRQLVVPVVNLRSEWATRPDAVSF YRDEVHLNPAGNRAVADRLALALP" gene 4080..5306 /locus_tag="DP116_13265" CDS 4080..5306 /locus_tag="DP116_13265" /inference="COORDINATES: protein motif:HMM:PF13692.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13265" /translation="MTDRPHAVLLYHYFQPDDVISARLFGELADGLVARGWDVTAVPC NRGCRDEVVTRPARERWHGVDIRRVWRPRFKQASNRGRLLNAGWMIAAWGLTAAALPS RKREVVVVGTDPVLGVLAALPWGLWRRRTAVAHWCHDLYPEAPVADGMVGERSPAVRG LKAVLAQAYRRCKLLADLGPCMRERLAAYGSPGRAVTLTPWALVEPPAPVEPDPATRR DLFGDAALGLLYSGNFGRAHSHAEFLDLARRLRDTPVRLCFAGRGNRMDELKAAVRPD DTNVSFAGFAPEGELERRLGACDLHLVSLRPEWAGTVVPSKFFGALATGRGVVYAGPP DSAIARWIEEHQVGWVLTPATAGAVADDLRALAADPGRLASLRRRCHAVYHHHFSRQA QLARWDAELRQLLPPG" gene complement(5371..6054) /locus_tag="DP116_13270" CDS complement(5371..6054) /locus_tag="DP116_13270" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13270" /translation="MGLLSGVKHDCAAVFELRESGSALTDAAGQVVHIELSAVYPLLK SSDLARGRLATRRRMLVTQTSLDADPEQLRGTAPAAWAYLESAADRLAARRSSIYRGR HRFAQFGVGPYTFIDWKVAVSGLYKQVQFQVIGPFHGQPVVFDDTCYFLPCPSEDTAS RVGHLLDSTPARGYLRAFMFSDAKRPITAALLSRLDLDAVARYLGEQPTGLQPSGWSA DTASISLLD" gene complement(6253..6849) /locus_tag="DP116_13275" CDS complement(6253..6849) /locus_tag="DP116_13275" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13275" /translation="MPTSSAEFGDYQTPDGLVTEVYRLLVTAGVGPKMVIEPTCGSGN LLAAAAGAFPNAACVSVADINSAYVDAAVAPLAADRRVSVRRADFFTEDWSRVIAAGP DPVLVVGTPLGDICRRWKVVRHQRAAEVKLRPQGGACGQDWRQQLRHRRVDAAVAPTA LSTVRVAGRSVQAIRCPQAARPVLGRRRSVRGGRPVSD" gene complement(6852..7505) /locus_tag="DP116_13280" CDS complement(6852..7505) /locus_tag="DP116_13280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006188454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease" /protein_id="PRJNA477356:DP116_13280" /translation="MNPPLTLDGLLAEATGFAAAYSQVAFPELYGVDNGKTVGTHLEQ TFIRQLLATYGFPPGNAAKGIDLPHLDVDFKTTSLKQPQSSCPFRSARQKVYGLGYSL LVFVYRKADDPVAKTARLTVAHAVFVERTYTADYQTTTGLLKLPANHANKDDLPAFFA ERMLPLDEIGASDLADEVLRTPPLVGYLTISNALQWRLQYSRVIEQAGTVPGLRRFV" gene complement(7601..9781) /locus_tag="DP116_13285" CDS complement(7601..9781) /locus_tag="DP116_13285" /inference="COORDINATES: protein motif:HMM:PF13519.4,HMM:TIGR02226" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13285" /translation="MHPILLAGVALVGLPVLLHLILKQEPKRLPFPALRFLQQKKKVS QRKLRLKQLLLLALRCLLVALFALALYQPRLSGSGLAGLADEQPVACVLLVDTSPSMG YQIAGVTRLADARKRALELLDELPAGSKVAVLDPADPAVQWEPTAGDARRRLESLLDP HGGGPPLTTGLTAAYQLLRTADEDQPEPLPKLVAVFTDRAVACWPADRTADLKSLAAQ LPPPGVSHLVFDVGTDGPANVSVLDLVAKPASGPAGQPVTLTATLKSVGGDVDAEAAL SLDDGPPQPQAVRLVAGQPTAVRFALTDLPKGLHQAKVTVRDDALAADNVRHVTFKVG DARTILTVCDDPAAANLWALAHEAKGEFNVVVTTPDEAKDFSRYEFVTLLGLADPGKP LADGKSLWDRLQPYLAAGGKLLVVLGGEGINPAGYDRPFLPGKLTRLVDSTRQPAPTP GERDRRLGAVWVLDEAAERQPLLAPFKDWERAGVGFIKDPRKTVKYWAADAPGENVVV RYDDADRTPAVLEKTLPGGGRVVLLTTRLDGLASDPADKWNDNWTLDGTDWPTVWPHR LAVHLAGASGEATFTYPTGQPVTLTLPKGGLPKGTKLTLEGPGVNGPDATQPVADGQT EWRLPPPMSLTAGSYRLRAGDWQDGFSLNPPADEFDLTKVPAERIEEATGPGSVVPVG KAVTLREVLDRKLGGELNLFPYLLIGVLLLFAGEGLLANRFYRR" gene 9776..10018 /locus_tag="DP116_13290" CDS 9776..10018 /locus_tag="DP116_13290" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13290" /translation="MHTGMLRGGVRRRSGGGAAELVQAVAGEDFGRRYGHDLAVPPFG HRPPHAGVFEQGVAGGEIVCPLGSRPSGDDGSQARA" gene complement(10026..10601) /locus_tag="DP116_13295" CDS complement(10026..10601) /locus_tag="DP116_13295" /inference="COORDINATES: protein motif:HMM:PF06414.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13295" /translation="MPRVAIVAGINGAGKTTASQRVLRDLLQMPCFVNADALARGLNG FNPESEAAKAGRLMLDHLHELATAGKDFSFETTLSGRAYAPWLRDLRANGYEVYLYYY WLRSPELAVERVANRVRSGGHHIPEPTIRQRYAKSIRNFFDLFRQQADYWEVCNNSNG RAVLFALGNPTEELVVDETLWAAFHRSGQHG" gene 10662..11975 /locus_tag="DP116_13300" CDS 10662..11975 /locus_tag="DP116_13300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010043049.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyrrolo-quinoline quinone" /protein_id="PRJNA477356:DP116_13300" /translation="MTRLLASALALTLAPSLRADDWPQWMGPQRDNVWRETGLVDKFP AGGPKVVWRTPVASGYAGPAVVGGKVFLADYASPKPLPEDGNFNRKPTDGTESFLAFD AATGKELWKQSFPVKYAISYPAGPRCTPLVAGSLVYFLGAEGHLLACDVNSGAIKWQV ELKDAYKTTSDLWGYAAHPLLDGDRLIVLAGGEGSHVVALNKDTGKEIWRSQTSKGQG YAPPLLTDAGGVRQMIVAGPAAVVGLDPATGKRLWTTPYAATSGSIIMTPVRVGDYLL VAGYDNKNLLLKLLADKPGVEVVWKDKLRMALSPVNVQPIADGTTVYGLHQSGELMAV AIPTGDRLWTTTAPLAAAEAPAPANGTAFVVRAGDHYILFNDLGELILCRLSPKGYEE IDRAKVIEPTGAAFGRKVVWSMPAFANKRAYIRNDKELICVELGK" gene complement(12020..12919) /locus_tag="DP116_13305" CDS complement(12020..12919) /locus_tag="DP116_13305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013627854.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF58 domain-containing protein" /protein_id="PRJNA477356:DP116_13305" /translation="MTHAEQYLRPEVIQQVSRLDLRAKFIVRGFLSGLHGSPFQGASV EFAEHRVYTPGDDVKDLDWNVYAKTGRHYVKRFKAETNMTGHLVLDLSGSMGYTYRQA LTKFDYGVCLAAALGYLMVHQQDPVGLVTFDTKIRTVLPPKAKRSQIGSLLSVLANSK PAGETDAAGALTQLAGLIRGRGLVMLFSDLLTEIDPLVKSLYRLRHAGHEVILFHILD EAEVHFPFQGRIEFEDVETPAKLEVDARGIRDDYLSGLGEYRAQLKRECGAADVDYVP MDTSVGFDVALLEYLHQRTRRFG" gene complement(12916..14076) /locus_tag="DP116_13310" CDS complement(12916..14076) /locus_tag="DP116_13310" /inference="COORDINATES: protein motif:HMM:PF00072.22,HMM:PF00512.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13310" /translation="MLQAQKMESLGLLAGGIAHDFNNILTGVMGYIDLARAELPADAP ARRLLTEAARNTERAADLTRQMLAYSGKGRFVVTAVDLTALTLAAKSLLEVSVSKKCR LGFDLQAGLPACQADATQLEQVLMNLVINGSEALGGAAGEVTVRTGAGWFEPAELRSA GVHDRLPAGEYVWLEVADTGGGMSAETAGKMFDPFFSTKFVGRGLGLAAVLGIVRGHR GAITVDTAPGRGTRVRVLLPAVAGPDLPTPAPAAAAGWRATGTVLVADDEPVVRQVAA GMLERLGFRVVLAADGREAVAAVRAGGVDLVLLDLLMPVMDGREALRDIRAAAPGLPV LLSSGYDEQQAADADGLAGFDGFVRKPYRLHHFVTELRRVYERRPAGGTIPT" gene 14077..14592 /locus_tag="DP116_13315" CDS 14077..14592 /locus_tag="DP116_13315" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13315" /translation="MPLQLPPLVLGLLAAGDLLPQLGVDGRQPRQGKRPRQRRDEGQH GRPGGDRGDELDHPGQAVSRVPEERRLDGVADAAAQHEGGEQPEHRQERHVPPAEDEV RQQARDGEVGGGDTAVGQDVEPAVEGGPEAARPAGHEPVGGEQAGGQVEHVSPTVGPP GGLSPAGHPMV" gene complement(14677..15117) /locus_tag="DP116_13320" CDS complement(14677..15117) /locus_tag="DP116_13320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018397430.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1810 domain-containing protein" /protein_id="PRJNA477356:DP116_13320" /translation="MTADPHDLDRFVRAQADVYATALAEIRAGDKQSHWMWFVFPQFA GLGVSPMSQRYAIRSRAEATAYLAHPVLGPRLIECATAALGVADRSAEDIFGRTDALK LKSSATLFAAVSPPGSVFEQLLDHFFAGDHCDITRTRLARSDRQ" gene complement(15340..18552) /locus_tag="DP116_13325" CDS complement(15340..18552) /locus_tag="DP116_13325" /inference="COORDINATES: protein motif:HMM:PF13517.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13325" /translation="MELGNTVVAENTTAGKAEFADLRGWFFNTTAAARNTLRLTGYTA RPQPGNVVGVLDTGATLFNTSAAVTVAANGTSTTPLVPGLGALADHGGGLLTRVPQAG SVLIDITNTGGYTELQSLPGVATNLDERGSGFARRAGFTLDAGAVEVQNPGTFLVVPA LPGQFGPPQQVTAPNSLFPQPITVSAQDEYGLPVTTQKVELQFNPVGSGVGATFPLGS KLVINPDAPATAGSGSGFVYLAARPVVGTFTQTASFGGQQLGFRLYVRPGLGSSDVIR PTLPADGSVPAQFQAVVGTDYAPLSVTVRDSSGTPLAGATVTFTVVLPAGGKGSDNPF AVPAVSASGSFTAGGDPAVVSQLATVTSDASGVATITLKAGTKAGLVTVLVSATATNT NGLLQVAANDTVTLQNLPDVASRVSPALPDADGNEQSGNGQLVRILSAPAKPLVFLVS DQYLNPRPGVTVTLNDLDTGRLAGLANDGVTAVSDADGRATFSAATGNAAVANAQVEA APYTVRASAGELVADVQLTNVPGLPAGLAILDGNNQSAPVSDSRVAVGESNQFPSLLR VRVTDAGGNPVPAGTVISFTGVGIRFESARVPTGDDGVAEVRVAPSEQAGTFTATALA DTGASATFTLTATAGLPTSVAVIAGDAQTARAGATLSPVRVQVLDQYQNPVPGVAVTV AGGGGTFTSTLTGADGRSTVTVVAGAVAGQYTASASVGSLTAGFGFTVTGIPVDLDNP PPVVGLPSLTAVGLGDGIASRVTVYNPDGSVRTQFTPFDPGFLGGTRAAVARDPRGNR VVVVPGPGRFPDARVFSADDGTEVATFQPFEVGFTGGLFVASADIDRDGYEDYVFSAD VGGGPRVKITSGKTGATLQDFFGIEDTNFRGGARVAVGDVNGDGTPDLIVAAGVGGGP RVAVYDGTTVIGVRNTPRRLVGDFFAFEQALRNGAYVAAGDVDGDGKAEVVLGGGPGG GPRVRVLSGADLLNNRQTAVSDFFAGPQTDRGGVKVTVRDLSGQTGTTGAKSDLVAAS GSGDGSRVTVYLGSQLRPGQPPAARGFDDLSGFRGGVFVG" gene 18634..19860 /locus_tag="DP116_13330" CDS 18634..19860 /locus_tag="DP116_13330" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13330" /translation="MDAAGPARQGHVPRHRRVENPHVQPAAVHPDAAAGVGRVSVERA VDDAADRVAGVRQVVSASRLVQVGRGEQPATPAAGRVAAERAAADEEVVVPGEQPAAV PGRRLVVGELGLAEGVGDAVGHPERPAATARQGHRPAPLRGRVGVEIAVGELTLPGDG QGVGEGHRPAVLGGRAVQDPHVVERHRGVLHPHRPAGVGRPAVQRQVVDHHPAVPVGG AADVEEPEGRGAGHPGPAEGGPVAVQGDERGRHLHRGGRHQRPEGAAAHGGVVEHVGA AGEQPDGVGRVVRAGGGHIGPQLGVGGGGGDGRIDRADVHRRHVAGFQPEQVEAAGGQ AAVRRHKVCPGRPGCPAGECAVHPDCAPADPCQQGGRRHQSGEAAAGVAVPPAAPADR PTSPDDSPPAGVLPAG" gene 19918..20178 /locus_tag="DP116_13335" CDS 19918..20178 /locus_tag="DP116_13335" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13335" /translation="MTPEALNTFVKAVPFRPFRITLVNGTTYDIRHPEMLLLTRRDAF LSVRDNPDGEFADRVIAIGLSVVASATQLDTPAPAQPQDSAA" gene 20175..21569 /gene="rimO" /locus_tag="DP116_13340" CDS 20175..21569 /gene="rimO" /locus_tag="DP116_13340" /EC_number="2.8.4.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010050024.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S12 methylthiotransferase RimO" /protein_id="PRJNA477356:DP116_13340" /translation="MTRSLPLAADAPAKPAGSFSFVSLGCPKNTVDSERMLGKLAQDG YALQPDADGADVVVVNTCGFIEPARQESMAVIREMLELKKQGRIGGVIVAGCLAERNR EGLIAELPGVDQVLGVFGREEIAAAVAGVAAKRKTLTGLDLFPPAPARALPDTDRLRI TPRHFAYLKISEGCDRLCTYCAIPQMRGKHATKPLEQVLTEARELAADGVRELNLVAQ DSTYWGMDLYGKPRLADLLRELDRIDGLEWVRLLYAYPEHVTDELLAVMGSSKKLVPY IDIPLQHISDRVLRRMVRRVDRAATETILHRLRAAVPGIAVRTTFIVGFPGETDADFD ELLSFVRDFQFERAGVFPYSFEPTTSSAKLDGHLPEEVKLARRDALMEAQQAVAFAHA AGQVGKELAVLIDGPDADSPTQFAGRTTADAPDIDCAIRVKGKGLRAGDLVTAKVTAA DGYDLLGRAVGKPR" BASE COUNT 3095 a 7724 c 7717 g 3091 t ORIGIN 1 atcatctgct cgccgacggt gatgaggccg gcacggcggg ctgtacggct gcgtccgcaa 61 cgtccacccg acgttgttcc gggtcgcccg ccgtgccggc ctcatcaccg tcggcgagca 121 gatgattgcc ccggccgccg tcgagtgggc cgaggccgaa cggcaacacg aagcttggcc 181 gggctgggag gacaagccgg accgccgcac ctacgaccgg ctacggcggt tcgaggaggc 241 cacctgggcg gcggccgacc ggctgggggc gccgtcggag tacgtccggc acgggctggt 301 ggggcagggg gtgtcggccg accgggtggc cctggtgccg tacccggccg aggtcgggcc 361 gttcgagtat gtagaccggc ggggccggac cgggccgctc accgtcgggt tcgtcgggca 421 ggtgaacctc cgcaagaacg cgccgacggt gttcaccgtg gcccgccgct ttcggccgac 481 cgaagtgcgg ttcgtcatgg tcggccgggt gtatctgaag gccgccgccg tggccgccca 541 caaaggtgaa gtggagctaa ctggcccgct gccacggtcc gaggtgccgg gccggttggc 601 cgggttcgac ctgttcctgt tcccgagtac ctgcgagggt tcggccgggg cggtgatcga 661 ggcgatggcc accggactgc cggtcctcac caccccgaac agcggcagcc cggtgcggga 721 cggggtggac gggttcgtgc tggccccggg cgacgtggac gggtttgctc gccgcatcgc 781 cgagctggcc gccgacccgg actgccgaca cgccctcggg gcggcggccc gggagcgggt 841 ggccgggctg accctggacc ggtacggccg cgagctggcg gcggtgttcg acggggcgca 901 ggcgtgaggc tctgctgggt gctgccggac ctcggccact accaccacgc ccgcatccag 961 gcctacgccg agcggggcga ggccgacgtg acggtggtgg tggtcggcgg caaggggctg 1021 ttcgccgagt tcgcccaccg cggcgaccgg ccgccggcct accgcaccgt caccctgttc 1081 ccggaccggt tcttcgccga cctgccggcc gccgatttgg gtcgggcggt tgaaaacgct 1141 ctcgccgccg agcggtgttc cgtggtgctg gcccaggggt gggcggcggc gtacagcctg 1201 gccgccctgc gttgggcggt cgcccgcgac gtgccccgag tggtgacgag cgagagccaa 1261 cgggacgact tcgcccggtc gccggtcaag gagtgggtga agcggcggct ggtcgggctg 1321 tgcggggcgg ccctgtgcgg cggcacccgg cacgccgagt acgcccgcga cctgggggtg 1381 ccggccgacc gcatcttcct cggctacgac gcggtggaca acgcccacta cgctgccggg 1441 gccgacgccg cccggaccaa ccccggggca cggacccggc tcgggctgcc gccggcctac 1501 ctgttggcgt cggcccggtt catcccgaaa aagaacctgc caggtctgct cgccgggtac 1561 gccgcctacc gcgccgccgt cggcccgggg gcgtgggagc tggtggtgct gggcgacgga 1621 gccggacggc cggagctaga ggcgcgtcgt ggcgagctgg gactggccgg cgtcgtccac 1681 ctgcccgggt tccgcgggta cgacgccctg ccggcgtact acggactggc cgaggggttc 1741 gtccacgcca gcctagtgga gcagtggggg ctggtgatca acgaggcggc ggcggccggg 1801 gtgccggtgg tggcgtccga ccgctgcggg gcgacggccg acttggtgca gcccgggcgg 1861 accggctggg cgttcgcccc gaccgactcg gccgcgctgg gcgcggccct ggtcgagctg 1921 caccgccacc cggaccgccg gagcttggga caggcaggcc gtcaactggt ggccgagtgg 1981 gggccggagc ggttcgccgc cggcctgtcg gcggcggtgg cggcggccac cccggcccgc 2041 cggccggggc tggtggggcg ggcggtgctg cgggcggtgg ccgcccgccc gcccgtcacc 2101 ggcggcgaat agaacagtcg agtcaacctc tccgagagtg gcagtcatga gtcatctggc 2161 cggcaagcgg gtcgtcgtca ccgggggggc cgggttcctg gggtcgcagg tggtgcggcg 2221 gctggccgac ttcggcccgg cccacgtcgc caccccccgc aaggccgagt acgacctgac 2281 cgaacagggt caggttcgca agctgctcga cgacctcaag ccgcaggtga tcgtccactt 2341 ggcggcggtg gtcggcggca tcggggccaa ccgcgagaac cccggccggt acttctacga 2401 gaacgccgtc atgggcatcc tgctcatgga ggaggcccgc aagcgggggg tgcagaagat 2461 ggtgacggtc ggcaccatct gctcgtaccc caagttcacc ccggtgccgt tcaaggagga 2521 cgccctctgg gacggctacc cggaggagac caacgccccc tacggcgttg ccaagaaggc 2581 gctgctggtg caggcccagg cgtaccgcca gcagtacggg ttcgccggcg tcaccctcct 2641 gccggtcaac ctgtacggcc ccggcgacaa cttcgacccg gcgtccagcc acgtcatccc 2701 ggcgctcatc aagaaggtgg tggacgcccg ggaggcgggc cgcaaacaca ttgacgtgtg 2761 gggcaccggc acggccagcc gggagttcct gttcgtccgg gacgcggccg acggcatcgc 2821 caccgccgcc gaccggtacg accacccgga cgcggtcaac ctgggtagcg gccgggagat 2881 caccatcaag gcgctgacgg aactggtgtg cgagctgtgc cggttcgacg gcgagctgcg 2941 gtgggatccg accaagccgg acggccagcc ccgccgctgc ctggacacca ctcgcgccgc 3001 cgagcggttc ggctggacgg cgtccaccga cttccgcacc ggcctgatgg agaccatcgc 3061 ctggtacgag gccaaccggt gacggacacc ccgtacccgc cacggggcgg gttgtcgtcg 3121 gtcggtcgcc ggccgcgtcg gcggtggctg gtccgcgggc tgctagcggt agtggggctg 3181 atggtggcga gcgaggccgg gctgcggtgg gggctgggcc tcggccgccc ggcgctgctc 3241 caggccgacc cggagatcgg ctaccttttc cagccggacc agcggctcac gcggttcggc 3301 cggcgagtgg tgatcaacgg ctaccaccag cggaccgggc cggtcgagcc gcggccgccg 3361 gccggcaccc tgcgggtgct gtgcgtcggc gactcggtga cgttcggcgg caccctcacg 3421 gaccaagccg agacctaccc cgaactgctc gccgcccggc tgcgggagac ccaccgcgac 3481 ggcccggtgg aagttctcaa cgcctcggcc gggtcgtggg ggctgggcaa cgaggccgcc 3541 tacctccgcc ggttcggcac cttcggcagc cgctgggtgg tcctccagat cggcacccac 3601 gacctgaccc aggagccgtc caccggggcc gtcgtcggcg tggccgacac ccacccggac 3661 cgtaacccgc cggccgccct gatcgaactc ttcacccgct acgttcgccc gcggctggtc 3721 ggcaacgtgg ccgagccggc cctgccgccg ccgcccgagc gacccgccga cgcctccctg 3781 gccgacaacc ttgcccggct ggccgacatg gtgcggcaga cccgggaggc tggcggccag 3841 ccgctcgtcc tccacacccc gaaccgcgac gaagtgaccg gagcagccct gccgaacccg 3901 gccgacgagt cgcggcgaac cacctttctt gaccgctgcc ggcaactcgt ggtgccggtg 3961 gtcaacctgc ggtccgagtg ggccacccgc ccggacgccg tctcctttta ccgcgacgag 4021 gtccacctca acccggccgg caaccgggcg gtggccgacc gcctggccct cgccctccca 4081 tgaccgaccg cccgcacgcc gtcctcctgt accactactt ccagccggac gacgtgatca 4141 gcgcccggct gttcggcgag ctggccgacg ggctggtggc ccgcggctgg gacgtgaccg 4201 ccgtgccgtg caaccgcggc tgccgggacg aggtcgtcac ccgaccagcc cgcgaacgct 4261 ggcacggcgt cgacatccgg cgggtgtggc gaccgcggtt caagcaggcc agcaaccgcg 4321 gccggctgct caacgccggc tggatgatcg ccgcctgggg gctgaccgcc gccgccctcc 4381 cgagccgcaa gcgggaggtg gtggtggtcg gcaccgaccc ggtgctgggg gtgctggcgg 4441 ccctgccgtg ggggctgtgg cggcgccgca cggccgtcgc ccactggtgt cacgacctgt 4501 acccggaagc ccccgtcgcc gacggcatgg tgggcgagcg gtcaccggcg gtgcgggggc 4561 tgaaagcggt gctggcccag gcgtaccgcc ggtgcaagct gctcgccgac ctcggcccgt 4621 gcatgagaga gcggctggcc gcctacggct cgcccggccg ggccgtcacc ctgacgccgt 4681 gggcgctggt cgagccgccg gccccggtcg aacccgaccc ggccacccgc cgcgacctgt 4741 tcggcgacgc cgccctcggc ctcttgtaca gcggcaactt cggccgcgcc cacagccacg 4801 ccgagttcct cgacctggcc cgccggctgc gggacacgcc cgtccgcctg tgcttcgccg 4861 gccgcggcaa ccgtatggac gagttgaagg ccgcggtgcg gccggacgac accaacgtga 4921 gcttcgccgg gttcgccccg gagggcgaac tcgagaggcg gctcggggcg tgcgacctgc 4981 acctagtcag cctgcggccg gagtgggccg ggacggtggt gccgagcaag ttcttcgggg 5041 cgctggcgac tggccggggg gtggtgtacg ccggcccgcc ggacagcgcc atcgcccgct 5101 ggatcgagga gcatcaggtc ggctgggtgc tgaccccggc cacggccggg gcggtcgccg 5161 acgatttgcg ggcgctggcc gccgacccgg gacggctggc cagtctccgg cgtcgctgcc 5221 acgccgtcta ccaccaccac ttctcccgcc aggcccaact cgcccgctgg gacgccgaac 5281 tccggcaact gctgccgccg ggctgaccta atcttgtcac ttctcccatt ccatactgac 5341 ttggctaatt gcccctaacc atacggcgat tcagtcgagc aagcttatac ttgcggtgtc 5401 ggcagaccaa ccgctcggct ggaggccggt cggctgctcg cccaagtaac gcgcgaccgc 5461 atctaagtcg agtcgggaca gcagtgcagc cgtgatcggc cgcttcgcgt ccgagaacat 5521 gaaagcgcgc aagtacccgc gggccggtgt ggaatcgagt aggtggccga cgcgagaggc 5581 cgtgtcctcg gacggacacg gaaggaagta acaggtgtcg tcgaacacga ccggctgtcc 5641 gtgaaagggg ccgatgactt ggaactgtac ctgcttatac agccccgaga cggccacctt 5701 ccagtcgatg aacgtgtacg gccctacgcc gaactgcgcg aatcggtggc ggcctcggta 5761 gatggagcta cgtcgcgcgg ccaaccgatc ggctgccgat tcgaggtatg cccaagcggc 5821 aggggccgtc ccccgcaact gctccgggtc ggcgtctaga gaggtctggg tgactagcat 5881 ccggcgacgg gtcgccaacc gtccacgcgc caagtctgaa cttttgagca acggataaac 5941 ggccgacagt tcgatatgga ccacctgccc ggcagcgtcg gtcaaagcgc tacccgactc 6001 gcgcagctcg aacacagccg cgcagtcgtg tttaacaccc gacagcagcc ccacgtccgg 6061 ccggtgtcca agtggtgcca gcggtcgtaa gccgccgcgt cggccaccaa cttgccgtca 6121 cgccagccga aggcgccctg tgggctagtg gccaacctcg catgttcggt gcaactcatc 6181 gcccccgtcg ctggcccgca ccgtaccacc agtagcccgg cctcgacgga tgcgccgaac 6241 caccgcttgg cgtcaatcag atacaggtcg gccgcctcga accgaacgcc gtcggcccag 6301 aaccgggcga gcagcttgcg ggcaacggat cgcttgtaca gaacggccag ccacccggac 6361 cgttgagaga gcggtcggag caactgcagc atccactctg cgatgtcgaa gttgctgccg 6421 ccagtcttgg ccgcaagccc cgccttgcgg tcgaagtttg acttctgcgg cacgttggtg 6481 ccggaccact ttccaacgtc ggcagatgtc acccaggggg gtgccgacca ccaatactgg 6541 atcagggcca gccgcaatca cccgcgacca gtcttcggtg aagaagtcgg cacggcggac 6601 agacacccgt cgatcggctg ccagcggcgc aacggctgca tcgacatatg ccgagtttat 6661 gtctgccacg gacacacatg cggcgttcgg aaatgcgcca gcggcagccg cgagtaagtt 6721 cccgctgccg caggtcggct cgatcaccat tttcggccct actccggctg tgaccagcag 6781 ccgatagacc tcggtcacga gtccgtccgg cgtctggtag tcgccaaatt cggcggaact 6841 ggtcggcacg gtcagacaaa ccgtcggagg ccgggcacag tgcccgcttg ctcgatgacg 6901 cgggagtatt gcagccgcca ttggagcgcg ttcgagatgg tgagatagcc gacaagtggt 6961 ggcgtccgta gcacctcgtc cgccaagtcg gacgccccaa tctcgtcgag cggcagcatg 7021 cgctcggcga aaaaggccgg cagatcgtcc ttgttggcat gattcgccgg cagtttcaac 7081 aagccggttg ttgtttggta gtcggccgta taggtgcgct ccacgaagac cgcatgagcg 7141 acggtcaggc gcgccgtctt cgcaaccggg tcgtctgctt tccggtagac gaacaccaac 7201 aaggagtagc cgaggccgta gaccttctgt cgggccgacc ggaacgggca ggacgactgg 7261 ggttgcttca ggctggtcgt cttgaaatcg acatcgagat gcgggaggtc gatgcccttc 7321 gcggcattgc cgggcggaaa accgtaggtg gccagcagtt gccggatgaa cgtctgttcg 7381 aggtgggtgc caaccgtctt gccgttgtcc acgccgtaca gttcggggaa ggcaacttgg 7441 gagtatgctg cggcgaaacc ggtggcctca gcgagtaggc catccagagt cagcggcggg 7501 ttcacacacg ttctcaacgg tcaacataaa tgtatctctt cagtgcagtc gtgacgaggc 7561 ggagctgacc gcggcatcca accagaccca gtaatcctcg tcaccgccgg tagaagcggt 7621 tggccagcag cccctcgccg gcgaacagca gcagcacgcc gattagcaag tacgggaaca 7681 ggttcagctc gccccccagc ttgcggtcca gcacctcccg cagcgtcacc gccttgccca 7741 ccggcaccac gctccccggg ccggtcgcct cttcgatccg ctcggccggc actttggtca 7801 ggtcgaactc gtcggccggc gggttcagac tgaacccgtc ctgccagtcg ccggcccgca 7861 gccggtagct gccggcggtg agcgacatcg gcggcggcag ccgccactcg gtctggccgt 7921 cggccaccgg ctgcgtggcg tccgggccgt tcacccccgg cccctccagc gtcagcttgg 7981 tgcccttcgg cagcccgccc ttcggcagcg tcagcgtcac cggctgaccg gtcgggtagg 8041 tgaaggtcgc ctcgccgctg gccccggcca gatgcaccgc cagccggtgc ggccagacgg 8101 tcggccagtc ggtgccgtcc agcgtccagt tgtcgttcca cttgtcggcc gggtcgctgg 8161 ccaggccgtc gaggcgggtg gtgagcagca ccacccggcc gccgccgggc aacgtcttct 8221 ccagcaccgc cggggtgcgg tcggcgtcgt cgtaccgcac caccacgttc tcgcccgggg 8281 cgtcggcggc ccagtacttg accgtcttcc gcgggtcttt gatgaacccc acccccgccc 8341 gctcccagtc cttgaacggg gccaggagcg gctgccgctc ggccgcctcg tccagcaccc 8401 agacggctcc tagccggcgg tcccgctcgc ccggagttgg ggctggctgg cgggtgctgt 8461 ccaccagccg ggtgagcttg cccggcagga acgggcggtc gtagccggcc gggttgatgc 8521 cctcgccgcc gagaacgacc agcagtttgc cgccggcggc gaggtacggc tggaggcggt 8581 cccagaggct tttgccgtcg gcgagcggtt tgcccgggtc ggccagcccg agcagggtga 8641 cgaactcgta gcggctgaag tctttcgcct cgtccggcgt cgtcaccacc acgttgaact 8701 cgcccttcgc ctcgtgggcg agcgcccaga ggttggcggc ggccgggtcg tcgcagacgg 8761 tcaggatggt gcgggcgtcg cccaccttga acgtcacgtg gcggacgttg tcggcggcca 8821 gggcgtcgtc gcgaaccgtc actttggctt ggtgcagtcc cttcggtaag tcggtgaggg 8881 cgaaccgcac ggccgtcggc tggccggcga cgagccgcac cgcctgcggt tgcggcgggc 8941 cgtcgtccag gctcagggcg gcctcggcgt ccacgtcccc gccgacggat ttgagcgtgg 9001 cggtgagcgt caccggctgc ccggccggcc cgctcgccgg cttggcgacg aggtcgagga 9061 cggacacgtt ggccgggccg tcggtgccca cgtcgaacac cagatggctg accccgggcg 9121 gcggcagttg ggcggccagc gacttcaggt cggctgtgcg gtcagcgggc cagcaggcca 9181 ccgcccggtc ggtgaatacg gccaccagct tgggcagcgg ctcgggttga tcttcgtcgg 9241 cggtgcggag cagttggtag gcggcggtca ggccggtggt gagcggcggg ccgccgccgt 9301 gcgggtcgag caaactctcc agccggcggc gggcgtcgcc ggcggtcggc tcccactgca 9361 ccgccgggtc ggccgggtcg aggacggcca ccttgctgcc ggcggggagt tcgtccagca 9421 gttccagggc acgcttgcgg gcgtcggcca ggcgggtgac gccggcgatc tggtagccca 9481 tgctcgggct ggtgtcgacg agcagcacgc aggccaccgg ctgctcgtcg gccaggccgg 9541 cgaggccgga gccggagagc cggggctggt agagggcgag ggcgaacagg gcgacgagca 9601 ggcagcggag ggcgagcagc agcagctgct tgagccgcag cttccgctgg ctgaccttct 9661 tcttctgctg gaggaaccgg agggcgggga acggcagccg cttcggctcc tgcttgagga 9721 tgaggtgcag gaggaccggc aggccgacga gggccacgcc ggccaggagg atggggtgca 9781 taccggcatg ttacggggcg gcgtccgtcg caggagcggc gggggggcgg ccgaactcgt 9841 ccaggccgta gcgggcgaag atttcggccg gcgatacgga cacgatctgg ccgtcccgcc 9901 attcggacac cgaccgcccc atgcgggcgt gttcgagcag ggcgtcgcgg gtggcgagat 9961 agtgtgcccg ctcggtagcc ggccgtcggg cgatgatgga agccaggctc gggcttaagg 10021 cggcgtcagc catgttgtcc gctccggtgg aaggcggccc aaagggtttc gtcaacgacc 10081 aactcctcgg tcgggttgcc gagcgcgaac agcaccgccc ggccgttcga gttgttgcac 10141 acttcccagt agtcggcctg ttggcggaac aggtcgaaga agttgcggat gctcttggcg 10201 taccgctggc ggatcgtcgg ctccgggatg tgatgcccgc ccgaccgcac ccggttggcg 10261 acgcgctcga cggccagctc cggactgcgg agccagtagt agtaaagata cacttcgtag 10321 ccattagccc gcaagtcacg cagccacggg gcgtaggccc ggccggacag cgtcgtctcg 10381 aacgagaagt ccttgccggc ggtggccagc tcgtgcaggt ggtcaagcat cagccggcca 10441 gccttcgcgg cctcggactc ggggttgaac ccgttgagtc ctcgcgcgag ggcgtcggcg 10501 ttcacgaagc acggcatctg cagcaggtct cgcagcaccc gctgtgaggc ggtggtcttg 10561 ccggccccgt tgatcccggc cacaatcgct acccgcggca tgtcgcaccc ccgcgactaa 10621 cataccccat cgtccggccc gtttctcgct cgggatgtcc catgacccgc ctccttgcct 10681 ccgcgctcgc cctcacgctt gccccgtccc tccgggccga cgactggccg cagtggatgg 10741 ggccgcagcg ggacaacgtc tggcgggaaa ccggtctcgt ggataagttc ccggccggcg 10801 ggccgaaggt ggtgtggcga acccccgtcg ccagcgggta cgccgggccg gcggtggtcg 10861 gcggcaaggt gtttctggcc gactacgcca gccccaagcc gctgcccgaa gacggcaact 10921 tcaaccgcaa gccgaccgac ggcaccgagt cgttcctggc gttcgacgcg gccaccggca 10981 aggagctgtg gaagcagtcg ttcccggtca agtacgccat cagctacccg gccgggccgc 11041 gctgcacccc gctggtcgcc gggtcactcg tctacttcct cggggccgag gggcacttgc 11101 tggcctgcga cgtgaactcg ggcgcgatca agtggcaggt ggagctgaag gacgcctaca 11161 agaccaccag cgacctgtgg gggtacgccg cccacccgct gctggacggc gaccgactga 11221 ttgtgctggc cggcggcgag ggcagccacg tcgtcgcttt gaacaaagac accggcaagg 11281 aaatctggcg gagccagacc tcgaagggcc aggggtacgc cccgccgctg ctgaccgacg 11341 ccggcggggt gcggcagatg attgtggccg ggccggcggc ggtggttggc ctcgacccgg 11401 ccaccggcaa acggctgtgg accacccctt acgccgccac cagcgggtcg atcatcatga 11461 ccccggtgcg ggtcggtgac tacctgctgg tggccggcta cgacaacaag aacctgctcc 11521 tcaaactgct cgccgacaag ccgggggtcg aggtggtgtg gaaggacaaa ctgcggatgg 11581 ccctgtcgcc ggtgaacgtc cagccgatcg ccgacggcac caccgtgtac gggctgcacc 11641 agagcggcga gctgatggcc gtggccattc cgaccggcga ccggctgtgg accaccaccg 11701 ccccgctggc ggcggccgaa gccccggccc cggccaacgg caccgccttc gtggtccggg 11761 ccggcgacca ctacatcctg ttcaacgacc tcggggagct gattctgtgc aggctgtcgc 11821 cgaaggggta tgaggagatc gaccgggcga aggtgatcga gccgaccggg gcggcgttcg 11881 gccgtaaggt ggtgtggagt atgccggcgt tcgccaacaa gcgggcgtac atccgcaacg 11941 acaaggagct gatctgcgtg gagctaggga agtgatgggg gccaaagccc acccatgatc 12001 gtgggtgggc tttggggcgt tacccgaacc gccgcgtccg ctggtgtagg tactccagca 12061 gcgccacatc aaagccgacg ctggtgtcca tcggcacata gtccacgtcg gccgccccgc 12121 actcccgctt gagctgcgcc cggtactcgc ccaggccgct caggtagtcg tcgcggatgc 12181 cgcgggcgtc cacctccaac ttcgccggcg tctccacgtc ctcgaactcg atccgcccct 12241 ggaacgggaa gtggacctcg gcttcgtcga ggatgtggaa caggatgacc tcgtggccgg 12301 cgtgccgcag ccggtacaga ctcttcacca gcgggtcgat ctcggtgagc aggtcgctga 12361 acagcatcac cagcccccgg ccgcggatga ggccggcgag ctgcgtcagt gccccggcgg 12421 cgtccgtctc gccggccggc ttgctgttgg ccagcacgct caagagcgac ccgatctgcg 12481 accgcttggc cttcggcggc agcaccgtgc gaatcttggt gtcgaaggtg acgaggccga 12541 ccgggtcttg ctggtggacc atcaggtagc ccagcgctgc ggccaaacac acgccgtagt 12601 cgaacttggt gagcgcttgg cggtaggtgt accccatcga cccggacagg tcgaggacga 12661 ggtggccggt catgttggtc tcggccttga accgcttgac gtagtgccgg ccggtcttgg 12721 cgtacacgtt ccagtcgagg tctttgacgt cgtcgccggg ggtgtacacc cggtgctcgg 12781 cgaactcgac gctggccccc tggaacggcg acccgtgcag cccgctgagg aacccgcgga 12841 cgatgaactt ggcccgcagg tcgagccggc tcacctgctg gatcacttcc ggccggaggt 12901 actgctcggc gtgggtcatg tggggattgt accgccggcc ggccgccgct cgtagacccg 12961 ccgcagctcg gtcacgaagt ggtgcagccg gtacggcttc cgcacaaacc cgtcgaaccc 13021 ggccagcccg tcggcgtcgg ccgcctgctg ctcgtcgtac ccgctcgaca ggaggaccgg 13081 cagccccggg gccgccgccc ggatgtcccg cagcgcctcc cggccgtcca tcacaggcat 13141 gagcaggtcg agcagcacca gatcgacgcc gccggcccgg acggccgcca ccgcctcccg 13201 gccgtcggcc gccagcacca ctcggaaccc gagccgctcc aacatgccgg ccgccacctg 13261 ccgcaccacc ggctcgtcgt cggccaccag taccgtgccg gtcgcccgcc agccggccgc 13321 cgccgccggg gcgggggtcg gcaggtccgg ccccgccacc gccggcagta gcacccgcac 13381 ccgggtacct cggccggggg cggtgtccac ggtgatcgcc ccgcggtgcc cgcggacgat 13441 gcccagcacg gccgccagcc ccagaccccg gccgacgaac ttggtgctga agaacgggtc 13501 gaacatcttg cctgccgtct cggccgacat gccgccgccg gtgtcggcca cctccagcca 13561 gacgtactcg ccggccggca gccggtcgtg gaccccggcc gaccgcagct cggccggctc 13621 gaaccaccct gccccggtgc gaaccgtcac ctccccggcg gccccgccca gcgcctccga 13681 cccgttgatg accagattca tcagcacctg ttcgagctgg gtggcgtcgg cctgacacgc 13741 cggcaggcca gcttgtagat cgaacccgag ccgacacttc ttcgacaccg acacctccag 13801 cagcgacttg gccgccaggg tgagggcggt caggtccacg gcggtgacga cgaaccgccc 13861 cttgccggag taggcgagca tctggcgggt gaggtcggcc gcccgctcgg tgttgcgggc 13921 ggcctcggtc agcagccggc gggccggggc gtcggccggc aactcggccc gggccaagtc 13981 gatgtacccc atcaccccgg tcaggatgtt gttgaagtcg tgggcgatgc cgccggcgag 14041 cagccccagg ctctccatct tctgggcctg gagcatgtgc cgctccagct cccgccgctc 14101 gtactcggcc tgctcgcggc gggcgatctc ctcccgcagt tgggcgttga cggccgccag 14161 ccccggcagg gaaagcgccc gcggcagcga cgggacgagg gccagcacgg tcgcccagga 14221 ggcgaccgcg gtgacgagct tgaccacccc ggacaggcgg tatcacgggt gccagaagag 14281 cgtcgcctcg atggcgtggc cgacgccgca gctcagcacg aaggcggcga acagccagaa 14341 caccggcagg aacggcacgt cccgccggcg gaggacgaag tacgccagca ggcacgggat 14401 ggcgaagtag gcggcggcga taccgcagtc ggacaggacg tggagccagc cgtggaaggg 14461 ggtccagagg ccgcacgccc agcggggcac gaacccgtcg gtggcgaaca ggcgggcggg 14521 caggtcgagc atgtgtcacc gacggtcggg ccgcccggcg gtttgtcacc cgcgggacac 14581 ccgatggtgt agccctgccc gccaccgggc ggtgggcaaa ccgttgcgca tcacacgaaa 14641 cactcacgac cggcgacggg agaggcgacg catccactac tggcgatccg atcgggccag 14701 ccgggtccgc gtgatgtcgc aatggtcgcc ggcgaagaag tggtcgagta gctgctcgaa 14761 caccgacccg ggcggcgaga cggccgcgaa cagcgtggcc gacgacttca acttgagggc 14821 gtcggtgcgg ccgaagatgt cctcggcgga gcggtcggcc accccgaggg cggccgtggc 14881 gcactcgatc agccgcggcc ccagcaccgg gtgggcgagg taggcggtcg cctccgcccg 14941 gctgcggatg gcgtaccgct gggacatcgg gctgacgccc agcccggcga actgcgggaa 15001 gacgaaccac atccagtgcg actgcttgtc gccggcccgt atttccgcga gggcggtggc 15061 gtacacgtcc gcctgggcgc ggacgaaccg atccaggtcg tgcgggtcgg cagtcatggc 15121 gttgctcccg gttcgagcgg ctgccgagtg ggtgtggcgt ctggtggaac ccggagcgtg 15181 agcgacgggg tggcccccgg ggtggggggc tgtcactaca ccccgggggc caccccgtcg 15241 ctcacgctcc gggttccatc agcagctaca gcggctcggg agccgctcta acagccccaa 15301 agcccacccg ttgcaacggg tgggctttcg cccgaatcgt caccccacga acacgccgcc 15361 gcggaacccg ctcaggtcgt cgaacccgcg ggccgccggc ggctggcccg gccgcagttg 15421 gctgcccagg taaaccgtca cccggctgcc gtcgccgctg ccgctggccg ccacgaggtc 15481 gctcttggcc ccggtcgtgc cggtctgccc gctcaggtcc cgcaccgtca ccttcacccc 15541 gccgcggtcg gtttgtgggc cggcgaagaa gtcgctgaca gccgtctgac ggttgttcag 15601 caggtcggcc ccgctcagca cccgcacccg cggaccgccg cccgggccgc cgcccagcac 15661 cacctcggcc ttgccgtctc cgtccacgtc gccggcggcc acatacgccc cgttccgcag 15721 agcctgctcg aaggcgaaga agtcgcccac cagccgccgc ggggtgttcc gcacgccgat 15781 caccgtcgtg ccgtcgtaca ccgccacccg cggcccgccg ccgacgccgg ccgccacgat 15841 caggtccggc gtgccgtccc cgttcacgtc gcccaccgcc acccgggcgc cgccgcggaa 15901 gttcgtgtcc tcgatgccga agaagtcttg cagggtggcc ccggtcttgc cgctggtgat 15961 cttcacccgc gggccgccgc ccacgtcggc gctgaacacg tagtcctcgt acccgtcccg 16021 gtcgatgtcg gccgaggcca cgaacaggcc gccggtgaag cccacctcga acggctggaa 16081 ggtggccacc tcggtgccgt cgtcggcgct gaacacccgg gcgtccggga accggccggg 16141 gccgggcacc accacgacgc ggttgccgcg ggggtcgcgg gcgacggccg cccgggtgcc 16201 gccgaggaag cccgggtcga acggggtgaa ctgcgtccgc acgctgccgt ccgggttgta 16261 caccgtcacc cggctggcga tgccgtcgcc caagccgacg gcggtgagcg acggcagccc 16321 gacgaccggc ggcgggttgt ccaggtccac cgggatgccg gtcacggtga agccgaaccc 16381 ggccgtcagc gacccgaccg aggcgctcgc cgtgtactgc ccggctaccg ccccggcgac 16441 gaccgtcacg gtgctgcggc cgtccgcccc ggtgagggtg gaggtgaagg tgccgccgcc 16501 gccggcgacg gtcacggcca cgcccggcac cgggttctgg tactggtcca gcacctgcac 16561 ccgcaccggg gagagagtcg ccccggcccg ggccgtctgg gcgtcgccgg cgatcacggc 16621 cacgctggtc ggcagcccgg cggtggccgt cagggtgaag gtggccgacg ccccggtgtc 16681 ggccagggcg gtcgccgtga acgtgccggc ctgctcgctc ggggccaccc gcacctcggc 16741 caccccgtcg tccccggtcg gcacccgggc cgactcgaac cggatgccca ccccggtgaa 16801 gctgatgacc gtgccggccg gcaccgggtt gccgccggcg tccgtcaccc gcacccgcag 16861 caggctcggg aactggttcg actcgcccac cgccacccgg ctgtcgctca ccggggccga 16921 ctggttgttc ccgtccagga tggcgaggcc ggccggcagc cccggcacgt tggtcagttg 16981 cacgtcggcc accagctcac cggccgacgc ccgcacggtg tacggggccg cctccacctg 17041 ggcgttggcg acggccgcgt tgcccgtcgc cgccgagaag gtggcccgcc cgtcggcgtc 17101 gctcaccgcc gtcaccccgt cattcgccag cccggccagc cgacccgtat cgaggtcgtt 17161 cagcgtgacc gtcacccccg gccgcgggtt caggtactgg tcgctcacca ggaacacgag 17221 cggcttggcc ggggccgaca ggatgcggac caactggccg ttgcccgact gctcgttgcc 17281 gtccgcgtcc ggcagggccg gcgacacccg gctggccacg tccggcaggt tctgcagcgt 17341 caccgtgtcg ttggccgcca cctgcagcag cccgttcgtg ttggtggccg tggccgacac 17401 cagcaccgtc accagcccgg ccttcgtccc ggccttcagg gtgatggtgg ccaccccgct 17461 cgcgtcgctc gtgacggtcg ccagttggct caccacggcc gggtcgccgc cggcggtgaa 17521 cgacccgctg gccgacaccg ccggcacggc gaacgggttg tcgctcccct tgccgccggc 17581 cggcagcacc accgtaaagg tgacggtcgc cccggccagc ggggtgccgg acgagtcccg 17641 gacggtcacg ctcagcgggg cgtagtcggt gcccaccacc gcctggaact gggccggcac 17701 cgacccgtcc gcgggcaggg tcgggcggat cacgtcgctc gaccccagcc ccggccggac 17761 gtacagccgg aacccgagct gctggccgcc gaagctggcc gtctgggtga acgtgccgac 17821 caccggccgg gccgccaggt acacgaaccc ggagccgctg ccggccgtcg ccggggcgtc 17881 cgggttgatc accagcttgc tgccgagcgg gaaggtggcc cccaccccgg acccgaccgg 17941 gttgaactgg agttccacct tctgcgtcgt caccggcagc ccgtactcgt cctgggcgga 18001 caccgtgatg ggctgcggga acaggctgtt cggggccgtc acctgctggg gcgggccgaa 18061 ctgcccgggc agggccggca ccaccaggaa ggtgccgggg ttctgcacct cgacggcccc 18121 ggcgtccagg gtgaacccgg cccggcgggc gaacccgctg ccccgctcgt ccaggttggt 18181 cgccaccccc ggcagggact gcagctcggt gtacccgccg gtgttggtga tgtcgatcag 18241 cacgctgccc gcctgcggca cccgggtgag cagcccgccg ccgtggtcgg ccagcgcccc 18301 cagcccgggc accagcggcg tcgtcgaggt gccgttggcg gcgaccgtca cggccgcgct 18361 ggtgttgaac agggtggccc cggtgtccag caccccgacc acgttgcccg gctgcggccg 18421 ggccgtgtac ccggtcagcc ggagcgtgtt gcgggcggcc gcggtggtgt tgaagaacca 18481 cccgcggagg tcggcgaact cggccttgcc ggccgtggtg ttctcggcca ccaccgtgtt 18541 gcccagttcc acccgggtga acttgctccc cggccgcacg tccagcgagg tgttgccgga 18601 cagctcgaag atgttggcgg tggactgctg ccagtggatg ccgccggccc cgctcgacag 18661 ggccacgttc cgcgtcaccg tcgagttgaa aatccgcacg tccagccagc cgccgtccac 18721 cccgatgccg ccgccggtgt tggccgtgtt tccgtcgaac gtgctgttga tgatgctgcc 18781 gaccgtgtcg ccggcgtccg acaggttgtt tccgccagcc gccttgtcca ggtcggccga 18841 ggagaacaac ccgccacccc ggccgccggc cgtgttgccg ctgaacgtgc tgccgctgac 18901 gaagaagtcg tcgtcccggg cgagcagccc gccgccgtcc ccggacgccg ccttgttgtt 18961 ggtgaactgg gactggctga aggcgtaggt gacgctgttg gccaccccga acgacccgct 19021 gccaccgctc gacagggcca ccgccccgcc cccctgcgtg gccgtgttgg tgtcgaaata 19081 gctgttggtg aactgactct gccaggtgac ggtcagggtg tcggcgaagg ccaccgcccc 19141 gccgtactcg gaggccgtgc tgttcaggat ccgcacgttg tcgaacgaca ccggggcgtt 19201 ctgcacccgc accgccccgc cggtgtcgga cgccccgccg tccaacgtca ggttgttgac 19261 caccacccgg ccgtacccgt cggtggtgcc gcggatgtcg aggaaccgga aggccggggt 19321 gccggccacc ccggaccggc ggagggtggc cccgttgccg tccagggtga tgagcggggt 19381 cgtcaccttc accgcggtgg ccggcaccag cggcccgagg gcgctgccgc ccacggcggc 19441 gtcgttgaac acgtaggtgc cgccggcgaa cagccggatg gtgtcggccg cgtcgtccgc 19501 gctggcggcg gccacatagg accgcagctc ggcgttggcg gcggcggggg agacggccgg 19561 atcgaccgtg cagatgtaca ccgccggcac gtcgcgggtt tccagcccga gcaggtcgag 19621 gcggcgggcg gtcaggcggc ggtgcgacga cataaagtgt gccctggccg gccggggtgt 19681 ccggcggggg agtgtgcggt gcatcccgat tgtgcgcccg ctgacccgtg ccagcaaggg 19741 ggccggcggc accagtccgg ggaagcggcg gcgggcgtag cggttccgcc ggcggcgccg 19801 gctgaccggc ccaccagtcc tgacgattca ccccccgccg gggttttgcc ggccggttga 19861 cccgcccccg ccggccgtta ggatgggggt tgaagtgtca cccgcccgag gctcaccgtg 19921 acccccgaag cgctcaacac cttcgtcaaa gctgtgccgt tccgcccgtt ccgcatcacg 19981 ctggttaacg ggacgacgta tgacatccga caccccgaaa tgcttctgct cacccgccgg 20041 gacgccttcc tgtccgtccg ggacaacccg gacggcgagt tcgccgaccg ggtgattgcc 20101 atcgggctgt cggtcgtcgc aagcgccacc caactcgaca cccccgcccc ggcccagccc 20161 caggactccg ccgcgtgacc cgctcgctcc cgctcgccgc cgacgccccc gccaagccgg 20221 ccgggtcgtt ctcgttcgtc agcctcggct gcccgaaaaa caccgtcgac tccgagcgga 20281 tgctcggcaa gctggcccag gacgggtacg ccctccagcc ggacgccgac ggggccgacg 20341 tggtggtcgt caacacctgc gggttcatcg agccggcccg ccaggagtcg atggccgtca 20401 tccgcgagat gctcgaactc aagaaacagg gtcgcatcgg cggggtgatc gtcgccggct 20461 gcctggccga gcggaaccgc gaagggctga tcgccgagtt gccgggcgtg gatcaggtgc 20521 tgggggtgtt cggccgggag gagatcgccg ccgccgtcgc cggcgtcgcc gccaagcgga 20581 agacgctcac cggcctcgac ctattcccgc cggctccggc ccgggcactg ccggacaccg 20641 accgcctccg catcaccccg cggcacttcg cctacctgaa aatcagcgag ggctgcgacc 20701 gcctgtgtac ctactgcgcc atcccgcaga tgcggggcaa gcacgccacc aagccgctcg 20761 aacaggtgct gacggaggcc cgggagctgg ccgccgacgg cgtccgcgag ctgaacctgg 20821 tcgcccagga cagcacctac tggggcatgg acctgtacgg caaaccccgc ctcgccgacc 20881 tgctccgcga gctggaccgc atcgacggcc tcgagtgggt gcggctgctg tacgcctacc 20941 cggaacacgt caccgacgag ttgctggcgg tgatgggaag ctcgaagaag ctggtgccgt 21001 acatcgacat cccgctccag cacatcagcg accgggtgtt gcggcggatg gtccggcggg 21061 tggaccgggc ggccaccgaa accatcctgc accggctccg ggcggcggtg ccggggatcg 21121 ccgtgcggac caccttcatc gtcggcttcc cgggcgagac ggacgccgac ttcgacgagt 21181 tgctcagctt cgtgcgggac ttccagttcg agcgggccgg cgtcttcccg tactcgttcg 21241 agccgaccac gtcttccgcg aagctcgacg ggcacctgcc cgaagaggtg aagctggccc 21301 gccgggacgc gctgatggag gcccagcagg cggtggcctt cgcccacgcc gcggggcagg 21361 tcggcaagga gctggcggtg ctgatcgacg gcccggacgc ggacagcccg acgcagttcg 21421 ccgggcggac gacggccgac gccccggaca tcgactgtgc catccgggtg aagggcaagg 21481 ggctgcgggc cggcgacttg gtgacggcga aggtgacggc ggccgacggg tacgacctgc 21541 tcggccgggc ggtgggcaag ccccgctaac gacgcggcta accgcgtccg ctcactttct 21601 tcttgcgtcc ggccggacgc ctgtcta // LOCUS NODE_1487_length_21231_cov_4.63623921231 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 21231) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 21231) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..21231 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(179..2437) /gene="psaA" /locus_tag="DP116_13345" CDS complement(179..2437) /gene="psaA" /locus_tag="DP116_13345" /EC_number="1.97.1.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319188.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I core protein PsaA" /protein_id="PRJNA477356:DP116_13345" /translation="MTISPPEREEKKARVIVDNDPVPTSFERWAKPGHFDRVLAKGPK TTTWIWNLHALAHDFDTHTSDLEDISRKIFAAHFGHLAVVTIWLSGMIFHGARFSNYE AWLSDPLNIKPSAQVVWPIVGQDILNGDVGGGFHGIQITSGLFQIWRGWGITNSFQLY VTAIGGLVLAGLFLFAGWFHYHKRAPKLEWFQNVESMLNHHLAVLLGCGSLGWAGHII HVSAPTNKLLDAGVAIKDVPLPHEFILNKDLLIELFPSFANGLAPFFTLNWGVYSDFL TFKGGLNPVTGGLWLTDNAHHHLAIAVLFIIAGHQYRTNWGIGHSIREILENHKGPFT GEGHKGLYENLTTSWHAQLGTNLAFLGSLTIIIAHHMYAMPPYPYLATDYATQLCIFT HHMWIGGFLIVGGAAHATIFMVRDYDPVVNRNNVLERVLRHRDAIISHLNWVCMFLGF HSFGLYVHNDTMRALGRPQDMFSDTAIQLQPVFAQWVQNLHTVAPGATAPNALEPVSY AFGGGILAVGGKVAMMPIALGTADFLIHHIHAFQIHVTVLILLKGFLFARSSRLIPDK MNLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNTISIAIFHFSWKMQSDVWGTVDA DGTVSHITGGNFAQSALTINGWLRDFLWAQAVQVINSYGSALSAYGLLFLGAHFVWAF SLMFLFSGRGYWQELIESIVWAHNKLKVAPSIQPRALSIIQGRAVGVAHYLLGGIATT WAFFHARIISLG" gene 3003..4571 /locus_tag="DP116_13350" CDS 3003..4571 /locus_tag="DP116_13350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319187.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13350" /translation="MNLWQKTATAILRLVMASVLVLSLTSCGEKAVSQEPSGSLQPKA QLSKQISEVSPPATIQELRSALEVYQPQVTIVTPKQDEILEDDTVTVRFQVKDLPIYK DPKLELGPHLQVILDNQPLIAVYNLNNPLVLPDLSPGTHTLRVFASRPWDESFKNEGA YAQTTFHVVTKTDDNNPDPAKPLLTYSHPKGSYGAEPILLDFYLTNAPLHLVATENPD DDIADWRIRCTINDESFVLDRWQAVYLKGFKPGKNWVKLEFLDEQGNPVKNVFNTTVR LITYEPNGKDTLSKIVRGELSVDQVRSIVDPNYTAKIPTAEPAPTPTSTPSVEETPQT QPKQETQPVPEPTPTPTLTPSVEKTPQTQLKEEKQPVPESGSVPQTQPEKPKSGDFFN RFERRTDKIPTPEATVEPTPGLSPTFPEVIETPQPEPELTPVPEKVTQDPEKKSRFSR YFNRQPDKKPTPEAPVETTPGLPPTLPEIIESPAPESMVPQPEITAEPKSEVTPTPEK QTEEIQKAHSNSEQ" gene complement(4632..5546) /locus_tag="DP116_13355" CDS complement(4632..5546) /locus_tag="DP116_13355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872753.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutaminase A" /protein_id="PRJNA477356:DP116_13355" /translation="MIGLKTLTTTQLSAWVQQAKLLTHQGQVAKRISQLALANPDLFA VYICCGTGGTFSQGDTDYIFPLMSAIKPFSLLYLLELVGAERVFQWLGFEPSDAPFNS LEQLVADHGRPRNPMINSGAITVADKLPGKDAIARTQQLSNWLNQLAGCHLKLDEVML ASVRSSNSQANQAITQYLAKAGSVKNPDIALDTYEQICCLSGQVEDLARLGLLLACEC EFIKPQHRRIVNSLMLTCGLYKASSQYALQIGLPMKSGISGALLAVVPGEGAVACYSP ALDSTGNSVGAIAFVEALSQGLQLSIFG" gene complement(5861..6601) /locus_tag="DP116_13360" CDS complement(5861..6601) /locus_tag="DP116_13360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017302686.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13360" /translation="MKPSEDRNYKLADYLSRSSDRYSLTKYKILRNWLPQAPKMRVLN AGCGSGEMNILLSQNSTWQVDALDVDTEAIRLSQKLKLENNIKNLNLYHTTIEDYTAP EKYDIIISNDVLEHIEDDQAVIKKFSDILKPDGLIGISVPALQWLFGYHDEMLAHYRR YHRKELIKKLSVYFNVSKCRYFGASLIPVALFYSRWLRKPYPVGELESESVKTKILEN LLGFESKVSFPLGISLIALATTKKLCLK" gene complement(6582..7355) /locus_tag="DP116_13365" CDS complement(6582..7355) /locus_tag="DP116_13365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015178533.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide deacetylase" /protein_id="PRJNA477356:DP116_13365" /translation="MNSLILLSFDLEEFDLPEEYGEKVEERVKFEVSYKGLTEIIQIL TILNIKATFFVTARFAIYYKALIQEISQKHEIASHGFDHCNFRLEDLKKSRQTLEEIT SQKVLGFRMPRLKKVDDTEIAKAGYKYNSSMNPTYIPGRYNNFFKPRTAYYSNNLVNI PVSVTPLLRFPLFWLSFKNFPLSLIKLASRLTLKNDHYISLYFHPWEFTDITQFHLPN YITKYSGKEMVERLEKYLIWLQTQGEFVYFSEFYETIRR" gene complement(7348..8112) /locus_tag="DP116_13370" CDS complement(7348..8112) /locus_tag="DP116_13370" /inference="COORDINATES: protein motif:HMM:PF00535.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_13370" /translation="MYLICEECLRIMNTISIIVPVYNEAQCIVSTFYSVLKFAVNKPW YDFIFVDDGSTDGTREILESQIKLLNNKQISLLSYEAHRGKGYAVKTGVLYADGDYIC FLDGDLAYSLDHLDLMVAKLAYFDIVIGCRNLTAEKNNGFKFLRRLAGRIFNFISRSL LNLKFTDMQAGIKGFEKYAAKDLFKTQIIPGFAFDVELLYLAKKKRYTIGEIPVVVSK KHLQKKSKVNLFKHSIEMLFNLLQIVYYDLILKKYE" gene complement(8382..9059) /locus_tag="DP116_13375" CDS complement(8382..9059) /locus_tag="DP116_13375" /inference="COORDINATES: protein motif:HMM:PF13489.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13375" /translation="MTDNIFRTLVALYDPLPLSDRLFVRTRLLLSNLYQLESYVPKEG RILDIGCGHGLLSNLLAITSPQRQVLGIDIDAKKIAAAQGTVGDRGNIQFQVGDAAVL PGTSFHAVTIADVMYLIPPDIQRAILTSIACALEPNGVLIWKTQSHRPQWKYTITYAQ EWLMTKLGPTKGAGIFFMDCEESIQAIRNAGLDPTVVPMPSRRPYSDILFLGCKSNKN SLSQNES" gene complement(9089..10336) /locus_tag="DP116_13380" CDS complement(9089..10336) /locus_tag="DP116_13380" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13380" /translation="MKLHAGWRLQVLIFITAFAILVSRRPDALFNAQFWAEDGKCWYA EAYNLGMIQSLVLPKGGYFQTISRLTAAFVQFFPLVWAPFIFNLTAIVIQILPVNLIA SVRFSQLIPNLQTRLFLGLLYLALPNSFETHANITNSHWHLAILACMVVLATPSRLLV WKVFDLAVILLSGLSGPFSVLLAPIAAIIWWLRRKRWSFILLLGLSAGAVVQGIAILL TGHSSRVHTPLGATPNLFAKILASQVFLAALIGQKGSKLISSISFGYSIIAILIAVAG SAAFVYCLLKAPLELRLFSIFAAGVFGMSLLSPVVSETVPQWWLLWRPGTGVRYWFIP MLAFVTILTWLLGATRPRQLQLIARIALVIMLAGIVVDWRYPTFVDLNFKNYSRQFVE APTGIMVTIPINPPGWFMELIKH" gene complement(10516..11481) /locus_tag="DP116_13385" CDS complement(10516..11481) /locus_tag="DP116_13385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213697.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_13385" /translation="MNHEPVERTTNTFLPKVSVVIPVYNGEADLPELLCCLSAQTYPK HQVEYLLVDNNSGDRTFALLQQASENSQITIRPKSENQIQSSYAARNAGIRAATGEIL AFTDADCRPEPEWLDSLIAPFVNQDIVLVAGEILALPGKTLLEQHAERQETLSQKHTL AHPFCPYGQTANLAIRRQALEQVGLFRPYLTSGGDADMCWRILQQKIGRLEFAPNAVV KHRHRATLKELESQWRRYGRSNRYLHELHGVDLMRDITLSECFYRLGRWLLKELPKDS VKALAGKASLVDLVSTPIGLYTARARAAGQREAKLPENAKIIEQW" gene complement(11606..12952) /locus_tag="DP116_13390" CDS complement(11606..12952) /locus_tag="DP116_13390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015196982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S41" /protein_id="PRJNA477356:DP116_13390" /translation="MNRSAKRYSPLQVAFISGAIASTATLSAFGPVWCRSVRAALQDS PKAVVDNVWQLVNREYVDGSFNKQNWVAVRQSLLSKDYTNREDAYTAVRQALEKLGDP YTRFMDPKQYEALTNQTSGEVSGIGIRMELNEQTKRLTVIEAIENSPALKAGIKAGDW IVAIDGKPTSQMKVEDASKLIRGKAGTKLTLRLGRDGQNAFNVNLTRASIEVPTVRYT LKQEGDRRIGYIRLREFSAHASTQMRRAIRDLNGKQVDAFVLDLRANPGGLLQASIEI ARMWMDNGAIVKTVDRVGGSDEMKANRTAITKRPLAILVDGNSASASEILTGALKDNN RAVVVGGQTFGKALVQSVHELADGSGVAITIAHYYTPKGTDINHKGIAPDIKLDLTEA QQRQLAANPNLIGTKSDPQYARALAALSSNNFAQPTQQNRPLQQQPMSNSAVDLKL" gene 13177..13284 /locus_tag="DP116_13395" /pseudo CDS 13177..13284 /locus_tag="DP116_13395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949199.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="3-ketoacyl-ACP reductase" gene 13424..14140 /locus_tag="DP116_13400" CDS 13424..14140 /locus_tag="DP116_13400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316704.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YdcF family protein" /protein_id="PRJNA477356:DP116_13400" /translation="MSTEHKPKVTNNRPNSVSERFHRRRQFLQKLALGLCLVLLSWVI FNTLTIISASSKQIDAFFILGGSIRREIYVAELAQQYPQTPILISRGSPDPCILLVFQ RLAPQRMQNVWLEKCADSTFGNFYYSIPILRRWGVHKVKLISSQTHFPRAKWMAQILF GVHGIWVETDIVPEQGVPGNRESWSKTGLDVTRSLLWAGLSRIIQPKCVYVMRLTEVN MDFWQRQRFRCERQQDFNLR" gene 14268..15038 /locus_tag="DP116_13405" CDS 14268..15038 /locus_tag="DP116_13405" /EC_number="2.1.1.144" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019500066.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="trans-aconitate 2-methyltransferase" /protein_id="PRJNA477356:DP116_13405" /translation="MLTWDANLYLQFANERTQPSLDLVARIAVSHPQRIIDLGCGPGN STQILRRHWSKADIIGLDNSPEMIAAASKAYPEGKWVLADVATWTADARFDIVFSNAT LHWVPNHAALFPHLLEQVAPHGVLAVQMPVHFQSPVHQLMYEIADDPAWRQKMHRAKN ALVNEKPSFYYDVLQPKVSRIDIWETEYNHIMDSPDSIVQWISGTGLRPFLEALELEE QKLQFQEMLRAGVTRAYPRQKDGRILFPFRRLFIVAYR" gene complement(14997..15539) /locus_tag="DP116_13410" CDS complement(14997..15539) /locus_tag="DP116_13410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120791.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="PRJNA477356:DP116_13410" /translation="MEVGYAQLTIEGVAARAGVGKPTIYRRWSTKARLIIDAFLAATN PELSFPDTGSVKEDILQQMHKLVKVINSPRGQVIATLIGGGQTDPEMIEAFRANWLSP RRFDCSQVIKKGIERGELRSDVDMEAVIDALYSPLFYRLLLKHAPLTEDFVDQLIDVV MSGLNDRLAISNNEEPTKRK" gene 15598..15888 /locus_tag="DP116_13415" CDS 15598..15888 /locus_tag="DP116_13415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019507082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13415" /translation="MSGFRIEIVGSSQSRSGMPNPSNFEEALRTTGIGHFCFRVDDVD TALTELNQRGVQTFVEAADYPNVGVRVGFIKDNNGNVIEFSGPLKLIESKTY" gene 16062..16952 /locus_tag="DP116_13420" CDS 16062..16952 /locus_tag="DP116_13420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_13420" /translation="MTDNTVKIDPTFSVSFQIAGDLNVNRLGYGAMRLTGQPGNFGPY SDWEGGQKLLRRAVELGINFIDTAEAYGPGFNEELIASALHPYPEGVVIATKGGINKP APDDIRADGRPENLRWGCEASLQRLRVDQIDLYQLHRPDPKVPFAESVGMLATLKSEG KIRHVGLSNVTIAQIEQARRIVPIASVQNRFSITERDGEDVLDYCTKHAIAFIPYGSL GAHPLKQGAPLANAQGILASIANRHGVKPNQIALAWLLHRAPNIILIPGTTTIAHVEE NIAAASIKLNTDEIETLNQM" gene 16995..17267 /locus_tag="DP116_13425" CDS 16995..17267 /locus_tag="DP116_13425" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_924655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB family transcriptional regulator" /protein_id="PRJNA477356:DP116_13425" /translation="MEIVRLDNFGRVLIPKNVREQLGLTNATQFSLSIQDDKLVLSPL TQPSNVYHLGSTLVVESQPIGNLETAIDELREEQIRELICSSENPV" gene 17251..17685 /locus_tag="DP116_13430" CDS 17251..17685 /locus_tag="DP116_13430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012267791.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VapC toxin family PIN domain ribonuclease" /protein_id="PRJNA477356:DP116_13430" /translation="MKTLFDTSVLIAAFEVSHPRHSVCLPWLQQAQTQLLQGFIATHT LAELYSVLTRLPVKPSISPYLAQQLIVENLKNFEVISLDTHDYQMVIAQMVNLNLTGG GTYDALIAQAAIKARVDILLTLNPNHFTRLGEEIAQLVQTPL" gene complement(17804..20128) /gene="recJ" /locus_tag="DP116_13435" CDS complement(17804..20128) /gene="recJ" /locus_tag="DP116_13435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458635.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="single-stranded-DNA-specific exonuclease RecJ" /protein_id="PRJNA477356:DP116_13435" /translation="MPEQQWILATTEQPPESFIQAVKQYTPASGGHFAAQLLWQRGIR EKPQLDAFVNSQAYQAASPFEFGQEMHLAVERLQQARDAGEKVAIWGDFDADGITSTA VLWDGLGQFFAQNTQLVYYIPNRLTESHGLNCQGIDKLAKQGFQLIVTCDTGSTNISE IIYAKQLGIDVIVTDHHTLPPERPAVTAIINPRYLSETHQLFHLSGVAVAYKLVEAFY QTLPNIPQQPLEDLLDLVAVGLIADLVQLSGDCRYLAQLGIGRLQEDFKKTPELRRRP GVGRLLELCQKSGDRPTDISFGLGPRINAVSRIQGDASFCVELLTSRDQKRVHELAEI TELANTRRKSLQKDVAGQVAQKLSKMDLSTTSVIVLEDPQWAVGVLGLVAGQVAQDTG RPTILLSTEVAEEQESNLSPPLLARGSARSINSVDLYQLVKDQAHLLHRFGGHPYAAG LSLPVENIPLFTQAINQKLRQSLGGANLTPTVQADLTVTVADLGKDLFLELKLLEPCG MGNPVPKLLIQNCWFENAWHRNQQDSQGKKVQYIKAEFDIRDDSTRNPFPGVWWGHYK DELPPGRCDCIAELDYNTFKKRYEIRLIAVRSRTNSALASSSQTLIIDWRNIPTPDSR LPTPPFLLKECPTSWDDLRVYLRRSLHNQQPLALAWLKPDPKPPEQIWLTLVGIAKYL SRTNQPVTRIQLLEKLGINDQTLLLGLRGLRYLGFQISRQDRMLVFTQHSTIEPTLAE AAVEKFLAAVREEQFQRKYFAEVPLSTIEAIASP" gene complement(20209..20847) /locus_tag="DP116_13440" CDS complement(20209..20847) /locus_tag="DP116_13440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_13440" /translation="MHFDENDAVYVREIRIDDIAPVYHLGEQLFTSDSYPYLYRCWDE WEVIGLYNTDPEYCLVAEVDEQLAGFILGTIITKASWTYGYILWLGVSPKFQRRGVGD KLVDRVIARMIEDGARFMLVDTDPANSPAVKFFNRKGFGNVRQHVFLSMNLSKHDYYG RLIDYEHQKAERAGYRRSRPTMRPRKSDGVASEVVLNTLVSEPQISDQEAPV" gene complement(20877..>21231) /locus_tag="DP116_13445" CDS complement(20877..>21231) /locus_tag="DP116_13445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316707.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13445" /translation="DAARTQSTGTAKTATPSPQSVKFSYNTTRHQQTAKKLNQVLSRL TYWIEHYYHGELFPQEPEGSKACDYCQYTTRCNRQSRAQDINSLATPGQISSTNSTIV ETNLLNIASIQEVSL" BASE COUNT 6118 a 4661 c 4507 g 5945 t ORIGIN 1 cgtcgtcgga tcctgtgcta gatcctggct aaattttgga aattttgttg ccataggttt 61 tgataagtcc tctgaccttc agccatcagg ttcttagagt tttgagttgt gagttctgag 121 tcctgagttt ttattctgaa cccaaaactc aaaactcatc acttgtaaga cctgattttt 181 agcccagtga aatgattcgt gcgtggaaga atgcccaggt agtagcaatt cctcctaaga 241 ggtagtgagc cacaccaaca gcccgacctt gaataatgct aagagcgcgg ggctgaattg 301 atggtgctac tttcagttta ttatgtgccc aaacaatcga ctcaatcagt tcttgccagt 361 agccacggcc actgaacagg aacattaagc tgaatgccca gacaaagtgt gcgcccaaga 421 acagcagacc ataagctgac agcgcactgc cgtaggagtt aatcacttgt acagcttgtg 481 cccacaagaa gtcacgcaac cagccgttga ttgtgagagc gctttgggca aagttaccac 541 cagtgatatg agacacagta ccatctgcgt ccacagttcc ccagacatca gattgcatct 601 tccagctgaa gtggaaaatt gcaatagaaa tggtgttgta catccagaac aaaccaagga 661 atacgtggtc ccatccagat acctgacagg taccgccacg accaggacca tcgcagggga 721 accggaagcc caagttcatc ttgtctggaa tcagacgaga gctacgggca aacaggaatc 781 ctttcaacag aattagtact gtgacatgga tttggaaggc gtgaatatgg tggatcaaga 841 agtccgccgt acctagtgca atgggcatca ttgccacttt gccgcctaca gccagaatac 901 cgccgccaaa ggcgtagcta acaggttcta gtgcattggg tgctgttgca ccaggagcta 961 ccgtgtgcag gttttgcacc cactgagcaa atactggctg caactgaatt gccgtatccg 1021 agaacatatc ttggggacgt cccaaggcac gcattgtgtc gttgtgtacg tacagtccaa 1081 agctatggaa gcccaggaac atacataccc agttcaggtg agagataatt gcatctctgt 1141 gacgaagaac ccgctccagc acgttgttcc ggtttaccac tggatcgtaa tcccgcacca 1201 tgaagattgt tgcgtgagcc gctccaccaa caatcaggaa gccaccaatc cacatatggt 1261 gagtgaagat gcagagttgg gtagcgtaat ctgtcgccaa gtacggatac ggaggcatcg 1321 cgtacatatg gtgggcgatg atgatagtca acgaacccaa gaaagcaagg ttagttccta 1381 attgagcgtg ccaggaagtg gtcaagtttt cgtagagacc tttgtgacct tcacctgtga 1441 aaggtccttt gtggttttct agaatctctc taatgctgtg accaataccc cagttggtac 1501 ggtactggtg accagcgatg ataaacaata ccgcgatcgc caagtggtga tgggcattgt 1561 ctgtcagcca caacccacct gtgactgggt taagacctcc cttgaaggtg aggaagtctg 1621 aataaacacc ccagttcaaa gtgaagaaag gcgctagacc gttggcaaag ctgggaaaca 1681 gctcgatcaa caagtctttg ttcaaaatga actcgtgggg caacggtaca tctttaattg 1741 ccactccagc atccagcagc ttattcgttg gtgcagatac gtggatgatg tgtcctgccc 1801 agcccaatga accacaaccg agcaatactg ctaaatggtg attcagcatg gactccacat 1861 tttggaacca ttccaatttg ggagcgcgct tatggtagtg gaaccagccg gcaaacaaga 1921 acaagcctgc taataccaaa ccgccaatcg cagtgacgta aagctggaat gagttcgtga 1981 taccccaacc acgccatatt tggaataagc cggacgtgat ttgaataccg tgaaaaccgc 2041 caccgacatc tccattcaaa atgtcttgac caacgattgg ccaaacaact tgagcgcttg 2101 gcttgatgtt gagggggtcg ctcagccaag cttcatagtt tgaaaagcga gcgccgtgga 2161 atatcatccc gctcagccaa atggtcacta cggctaagtg accgaagtgt gctgcgaata 2221 tcttgcggga tatgtcttct aaatcgcttg tatgtgtatc aaagtcgtgg gcgagcgcgt 2281 gtaggttcca aatccatgtt gtcgtttttg gacctttggc tagaactctg tcgaaatgtc 2341 ctggttttgc ccatcgttca aatgaagtgg gtaccggatc gttatcgact attactcttg 2401 cctttttctc ctctcgctct ggaggactaa ttgtcattcg acctcctctc tttatcggga 2461 ttgaggaatc attgaacacc acctagtatc tttagggttt gagttcacct tagtcgcctc 2521 ttccagaaac agatgactgg agctgcgacg aagacctttt tgtggtgatt ggtttctgtt 2581 gaattataaa ggcttatctc gtacatagtt agagatcttt taacaataat tcaaaatcct 2641 cgaatctatt gcctaatcta ctttttacag gaaaactact tggcaaaaag tcaatagtct 2701 cttcatttat tttacaaatg gtaataattt tttccagaaa tgctcatacc acaaaaattt 2761 attttttcag tatttgctaa gtttacttaa tatcatttca cataatcaac gaagacaacg 2821 cttactataa aaaattaaaa tttcattaaa aagatttaaa acttaatatg gaaaaaatca 2881 cattactgat ttctggacac aatgtctacg aataaatctc cttttgggct atagtgtcca 2941 gtggacagtt tgagccaaaa ttatttagtc agttgtagta ggagtatcac tgggtgaaaa 3001 gcatgaactt gtggcaaaag accgcaactg ctatactcag attggtgatg gcatcggtgc 3061 tggtattaag tttgaccagt tgtggagaga aagccgttag ccaagaaccg tctggcagct 3121 tgcaaccaaa agctcaactt tcaaagcaaa tctcagaagt ttcccctcca gcaacaatcc 3181 aagaactgcg ctcagccttg gaagtttatc agccacaagt aacaatagtg actcctaaac 3241 aggatgaaat ccttgaagac gacacagtca ctgttcgctt tcaagttaaa gacttaccga 3301 tatataaaga cccaaaacta gagcttggtc ctcatttaca ggtcattctc gataatcaac 3361 cattaatagc cgtatataat ctgaataatc ccctggtttt gccagatttg tcaccaggta 3421 cccataccct acgcgtcttt gcctctcgtc cttgggacga aagctttaaa aatgaaggtg 3481 cttacgctca gacaacattt cacgtcgtca ccaaaactga cgacaacaac cccgatccag 3541 ctaagccact gttaacatac agtcatccca aaggcagcta tggtgcagaa ccaattctac 3601 tcgactttta ccttactaac gctccattgc atttagttgc cacagaaaac cctgatgacg 3661 acatagcaga ttggcgcatt cgctgcacaa ttaatgatga aagttttgtt ttagatcgct 3721 ggcaagccgt ttacctcaaa ggctttaagc caggtaaaaa ttgggtaaaa cttgagtttc 3781 tcgatgaaca gggaaaccct gtaaaaaatg tcttcaacac cacagtgaga cttattactt 3841 atgaacccaa tgggaaagac actttatcta agattgtcag aggagaactg tcagtagatc 3901 aagttcgcag cattgtagat ccgaattata ctgccaaaat tccaactgct gaacccgcac 3961 ccaccccaac ttcaacgcct agtgttgaag aaacacccca aacacaaccc aagcaagaaa 4021 cgcaacccgt ccctgaacct acacctaccc caaccttaac gcccagcgtt gaaaaaacac 4081 cccaaacaca actaaaggaa gaaaaacagc ctgttcctga atctggatca gtacctcaaa 4141 cacagccaga aaaacctaaa tcaggagact tttttaatcg gttcgagcgt cggacagata 4201 aaatacctac tccagaagca acagttgaac caactcctgg cttgtctccc actttcccag 4261 aggttattga gacaccccaa ccagaacctg agttaacgcc cgtaccagaa aaagttactc 4321 aagacccaga aaaaaaatcg agattcagtc ggtatttcaa tcgtcagcca gacaaaaaac 4381 caactccaga agcaccagtt gaaacaacac ctggcttacc tccaacctta ccggagatta 4441 ttgagtcccc tgcaccagaa tcaatggtgc cgcaaccaga aatcacagca gaacctaaat 4501 cagaggtaac gccgacaccc gaaaagcaaa ccgaggaaat ccaaaaggca cattcaaaca 4561 gtgaacagtg aacagggaag aaaacgataa ctgataactg gtaactgata actgataaaa 4621 gttttgggct gctagccaaa tatactcaac tgcaaccctt gagataaagc ttcaacaaag 4681 gcgatcgccc ctacagaatt acctgtacta tccaaggcag gactgtagca agcaacagcc 4741 ccttcacctg gtactaccgc caagagtgcg ccactgatac ccgatttcat tggcaaacca 4801 atctgtaaag catactgcga agaagcttta tacaacccac aagttaacat cagcgagttg 4861 acaatacggc gatgctgcgg tttgatgaat tcacactcac acgccaaaag caaacccaaa 4921 cgggctaaat cttccacttg accagataga cagcatattt gttcataagt gtcaagagct 4981 atgtctggat tctttacaga cccagctttt gcaagatatt gcgtaatggc ttgattagct 5041 tgggagttac ttgagcggac agaagcgagc ataacttcat ccaacttgag gtgacaacca 5101 gcaagttggt ttagccaatt agacagctgc tgagtgcgag cgatcgcatc ttttcctggt 5161 aacttatcgg caacggtaat tgccccacta ttaatcatag gattgcgcgg tcgtccatga 5221 tcagcgacca gttgctctaa agaattgaag ggagcatccg atggctcgaa cccaagccac 5281 tgaaaaactc tttctgctcc tacaagttca agtagataca aaagcgaaaa cggcttaatc 5341 gcgctcatga gggggaaaat ataatccgta tcgccttggc taaaagttcc cccagttcca 5401 cagcagatgt aaactgcaaa taaatcggga ttagccaaag ctaattggga aatacgctta 5461 gcaacttgtc cttgatgtgt taatagtttg gcttgttgca cccaagctga taactgagta 5521 gttgttagtg tttttaatcc aatcattact gctactgtct tatgcaattt ataaatactg 5581 taaaacagtc aattttgaat tttgcggaaa gttgccagga gggtttccct ccgtagcaaa 5641 ctttccaaga cgaattttga attttgaatt ttgaattaag cgaagggggt gagggtgtaa 5701 gggaaaaagg tgtttctttc attcgtagcg agtggtgcgt ctagcagcgt tgctagtggt 5761 ttctggcgct agttaaaggc ttagaatgcc cacatcatac tgcgattgtt ggtagttaag 5821 gtgctgctga aagcccacga atagaatttg gggggaatgg ttactttaga catagctttt 5881 ttgttgttgc taacgctatt agtgagatgc caagagggaa agagacttta gactcaaatc 5941 ccagtaaatt ttctaagatt tttgtcttta ctgactcgct ctctagttcg cctactgggt 6001 aaggtttgcg taaccagcga ctataaaata aggcaacggg aataagactt gctccgaaat 6061 aacggcattt actgacattg aagtaaacag ataatttctt gattaattct ttgcgatgat 6121 agcggcgata atgagcaagc atttcatcgt gataaccaaa gagccattgt aaagccggta 6181 cagaaatacc tattaatcca tctggtttga gtatatctga aaactttttt atcactgcct 6241 ggtcatcttc aatatgttct aatacatcat tagaaatgat aatatcgtat ttttctggtg 6301 ctgtgtagtc ttcaatagtc gtgtgataaa ggttgagatt ttttatatta ttctctagtt 6361 ttaacttttg tgaaagcctg atagcttctg tgtctacatc caaagcgtct acttgccaag 6421 ttgagttttg cgatagcaag atgttcattt ctccagaacc acatcccgca tttaaaactc 6481 gcatttttgg tgcttgaggt aaccagttcc ttaaaatttt gtacttggtt aaagaatagc 6541 gatcgctaga acgagataaa taatcagcaa gcttgtaatt tctatcttct gatggtttca 6601 taaaactccg aaaagtaaac aaattcccct tgagtttgca gccatattaa atatttttcc 6661 aaacgctcaa ccatttcttt accggagtat ttagtgatat aattaggtag atgaaactgg 6721 gtaatatcag taaattccca cggatgaaaa taaaggctta tgtaatggtc atttttcaga 6781 gtcagtcgag aagctaattt gattaatgat aagggaaaat ttttaaaact caaccagaac 6841 aagggaaatc tgagcagcgg tgtcactgaa actggaatat taactaagtt gtttgagtaa 6901 taggctgttc taggtttgaa aaagttattg tatcttcctg ggatgtaagt aggattcatt 6961 gacgagttat atttgtatcc tgcttttgct atttctgtat catctacttt tttcagtcta 7021 ggcatcctga aaccaagaac tttctggctg gttatttctt ccaaggtttg ccgtgacttt 7081 tttaaatctt ctaatctaaa attgcaatgg tcgaaaccat gagaagcaat ttcatgtttt 7141 tgggaaattt cttggatgag cgctttgtaa tagatagcaa atctggctgt gacaaaaaaa 7201 gtggctttta tgttcaatat tgtcaaaatt tgaataattt ctgttaatcc tttgtaagaa 7261 acttcaaatt tgactctttc ttcaactttt tcaccgtatt cttctggaag atcaaattct 7321 tctaaatcga agcttaataa aatcagacta ttcatacttt tttaggatca aatcatagta 7381 gactatttgc aataaattaa atagcatttc tattgaatgc ttgaataaat ttacttttga 7441 ttttttctgt aaatgtttct tagatacgac aactggaatt tcgccaatag tatatctttt 7501 ctttttggct aaatatagca actctacatc aaaggcaaat ccagggatga tttgagtttt 7561 aaataaatct ttagctgcat atttttcaaa tcctttgatt ccggcttgca tatcggtaaa 7621 tttcaaattt aaaagacttc tcgatatgaa attaaatatt cgaccagcca gtcttctcag 7681 aaatttgaac ccgttgtttt tttctgccgt gagatttcta cagccaatga caatatcaaa 7741 atatgctagt tttgctacca tcaaatcaag gtggtctaaa gagtaagcta aatctccatc 7801 taaaaagcaa atataatcgc catctgcata taaaactccg gttttgactg catatccttt 7861 gccacgatga gcttcataag aaagtaatga aatctgctta ttgttcaata gtttgatttg 7921 actttctaag atttctctag ttccatcagt tgagccatcg tcaacaaata taaaatcgta 7981 ccaaggttta ttgacagcga atttgagaac ggaataaaag gtagaaacga tacattgtgc 8041 ctcgttataa acgggcacaa ttatggaaat tgtgttcatt attctcaagc actcctcaca 8101 tattaaatac atccaactca gctgtcgggc atttaaatta cactttttgt gcgttggatt 8161 cctgacgcta gggcattcta gtgggaaaaa aatcaacagc tgactaacgc atggccatat 8221 ctatgtatgg ttatgcgact agaggatata tttattccta ctcagcatac caaggttatt 8281 tcgattagtt aataggttaa aaaatcagat tttggaaaaa gtgtcgtgac gactacaaaa 8341 tactcgtttt ttggatgtgc gactcataaa gaactcagaa atcaggattc attctggctt 8401 aaagaattct tattcgactt acaaccaagg aaaagaatat cactgtaagg acggcgtgaa 8461 ggcataggca cgacagttgg atctaaccca gcattacgaa tagcttggat cgattcttcg 8521 cagtccataa aaaatatacc agcacctttg gtaggaccaa gcttagtcat gagccattct 8581 tgggcgtagg ttatagtata tttccactgg gggcgatggc tttgggtttt ccaaatcagt 8641 acaccgtttg gttccaaagc acaagcaata cttgtcagaa tcgcacgttg gatgtcaggg 8701 ggaatcaggt acataacatc ggcaatagtg acagcgtgga aggaagtacc tggcaaaaca 8761 gcggcgtcac caacttgaaa ttggatgtta cctctgtcgc caacagtacc ttgtgcagcg 8821 gcaatttttt tcgcgtcaat gtcaatgcca agtacctgtc gctggggact ggtgatggct 8881 agtaagttac tcagtaaacc gtgaccacaa cctatatcaa gaattctacc ctcctttggt 8941 acgtaagact ctaactggta taaattgctc aaaagtaagc gagtgcggac aaaaaggcga 9001 tcgctcaatg gcagtggatc atacagtgca actagtgtgc ggaaaatatt atcagtcacg 9061 gcaagattcc ttgcttgtag tgatgtcgct aatgcttaat aagttccata aaccaacctg 9121 gtggattaat aggaattgtc accattattc ctgttggtgc ttcaacaaac tgccttgaat 9181 agttcttgaa gtttaagtca acaaatgtag gataacgcca gtctactaca ataccagcaa 9241 gcataatcac cagagcaatt ctagcgatta actgtagctg acgcggtctt gttgcaccca 9301 atagccaagt taatattgtc acaaaagcaa gcataggtat aaaccagtac cgaacaccag 9361 ttccaggacg ccacagaagc caccactgag gtacagtttc gctgactact ggagacaaaa 9421 gcgacatacc aaaaactcca gcagcaaaga tagagaaaag tcgaagttct aaaggggctt 9481 tcagaaggca gtaaacgaat gcggcgcttc cagcgacagc gattaaaatc gctattatgc 9541 tatacccaaa agaaatagag ctgatcaatt tactcccttt ttgtccaatc aaagcagcta 9601 agaagacttg actcgcaagg attttggcga ataggtttgg tgtagcccct aaaggagtat 9661 gaacacggct agaatgtcca gttagtagaa tagctatgcc ctgaaccaca gcacctgcac 9721 ttaaacctaa gagtaaaata aatgaccatc tcttacgacg taaccaccaa atgattgcag 9781 caatgggagc gagcaaaacc gagaaaggtc cacttaatcc tgacaacaag atgactgcga 9841 ggtcaaaaac tttccatact agaaggcggc tgggtgttgc taacacgacc atgcaggcta 9901 gtattgctaa gtgccagtgt gaattagtga tgttggcatg agtttcaaaa gagttgggta 9961 aagctaaata cagaagtccc aaaaacaacc tagtctgaag gttcggtatt aactgcgaga 10021 agcgcaccga agcgataaga ttaactggca gaatctgaat aacaattgca gttagattaa 10081 agatgaaagg tgcccatacc agtgggaaaa attgcacaaa agcagccgtc agcctggaaa 10141 tagtttggaa gtagccaccc ttaggcaaaa ccagggattg gatcatcccg aggttgtaag 10201 cctcagcata ccagcacttg ccatcttcag cccaaaattg tgcattgaaa agtgcatctg 10261 gtcttctgga aaccagaatt gcaaaagcag ttataaagat caaaacttga agacgccagc 10321 cagcatgcag cttcatttgt tgtttatcga ttcgttgtac tagactgttc gagattttac 10381 catttgggtt aggtattata tagcggttct ggtgttgcgt gcaatacaca actgtagaga 10441 cgttacatgt aacgtttcta catgattcgt ggtggtgtat ctgattcaaa tgagaaccgc 10501 gatatctccg atgccttacc actgctcaat tatcttggca ttctccggca acttcgcttc 10561 cctttgccca gccgcacgtg cgcgagctgt gtacaatcca ataggagtac tcactaaatc 10621 tacaaggctg gcttttcctg ctaaagcttt cacactgtct tttggtagtt ctttcaacaa 10681 ccagcgtccc aaacgataga agcattcgct gagtgtgata tcccgcatta aatccacgcc 10741 gtgaagttca tgcaaatagc gatttgagcg tccatagcgc cgccattgac tttctagttc 10801 cttaagtgtg gcgcggtggc ggtgcttgac aactgcattg ggggcaaatt ccaaacgtcc 10861 aattttttgt tgcaaaattc gccaacacat atcagcatca ccaccactgg ttaggtaagg 10921 acgaaacaaa cctacttgtt ccaaagcttg gcgtcgaatt gctaaatttg ctgtttgacc 10981 gtagggacag aagggatgag caagagtgtg cttttgtgag agggtttctt ggcgctctgc 11041 gtgctgttct agcagggttt ttcctggtag cgccaaaatc tcaccagcga caaggacaat 11101 atcttggttg acaaaaggag caattaatga atctaaccat tctggttctg gacgacaatc 11161 tgcatcggta aaagcaagta tctcaccagt tgcggcacga atgccagcgt tacgggcggc 11221 ataggaactt tggatctggt tctcgctttt aggacgaatt gtgatttggc tattttcgct 11281 tgcttgttgg agtaaggcaa aagtgcgatc gccactgtta ttatctacca gcaaatactc 11341 tacctggtgt tttgggtaag tttgagccga tagacaacac agcaactctg gtaaatctgc 11401 ctcaccgtta taaacaggta taaccaccga tacctttggc aagaaggtat tagtggttct 11461 ttcaactggt tcatgattca tagttttgag attttggatt aagaattgag gactgtgggt 11521 tattttatta acagctaagc actcactact acttaccaac tagtccttta tccaatattt 11581 ccaatccaaa atcaccgaag tttagctaaa gttttagatc taccgcgcta ttactcattg 11641 gttgttgctg caacggtcga ttttgttgtg taggttgagc aaagttgtta ctagatagag 11701 ctgctagcgc acgcgcgtat tgggggtcac tcttagttcc gatcaaattt ggatttgctg 11761 cgagttgacg ctgttgtgcc tctgtcaggt ctaacttgat atctggcgca atccccttgt 11821 ggttgatatc tgtgcccttg ggagtgtaat aatgagctat ggtaatagct acgccagaac 11881 catctgcaag ttcatgaacc gactgtacta aggctttgcc aaaggtttga ccgccaacaa 11941 ctaccgctcg gttattatcc ttgagtgcgc ctgtgaggat ttcgctagca ctggctgaat 12001 tgccatccac tagtatcgcc agagggcgtt ttgtgatagc agtacgattt gctttcattt 12061 cgtcactacc tcccacacgg tctacagtct tgacgatcgc cccattatcc atccacatcc 12121 gggcaatttc gatactcgcc tgcaacaaac cgcctgggtt tgcacgtaga tccaatacaa 12181 aggcatcgac ttgcttgcca tttaaatcgc ggatggctcg tcgcatctga gttgaagcat 12241 gggcactgaa ctcgcgcagg cggatatagc caatgcggcg atcgccttct tgctttaagg 12301 tgtaacgcac tgtcggcact tcaatacttg cccgtgtcaa attcacatta aaagcatttt 12361 gcccatctct tcctagccgc aaagtcaact tagtccctgc tttcccgcga atcagctttg 12421 aggcatcttc tactttcatt tggcttgtgg gtttgccgtc aattgccaca atccagtccc 12481 cagctttaat acccgctttc agtgcagggg aattttctat ggcttcgata acggtgagcc 12541 ttttggtttg ctcgttcaat tccatccgaa tgccaattcc agagacttcc ccagatgttt 12601 ggttggttaa agcttcatac tgtttgggat ccataaatcg agtgtaagga tcacccaact 12661 tttctaatgc ttgacggacg gcagtgtatg catcctcacg gttcgtatag tctttgctca 12721 gcagactctg tctcactgct acccaatttt gtttattaaa tgagccatca acatattcgc 12781 gatttactaa ttgccagacg ttatcgacta ctgcttttgg gctatcttgc agagctgcac 12841 gcacagaacg acaccaaact gggccaaaag cggataacgt agcggttgag gcgattgctc 12901 cgctaataaa agctacttgg agcggcgagt aacgtttcgc agatcgattc atgtatcttg 12961 actagaataa ttgtattgtt gacagtgtag caatctgact ttgcaggaat ttataagttt 13021 ttcttcctaa ttatgctttg ttcatcactt ttccttgatg ttttttgaga aaactgcata 13081 ttgttcaaac agcttcttta ctttagctta cttctcccgc tagctaagtg ccaccatatg 13141 atagtttcat cagcagtgct atccaatccg gcaaaattgc ttaaaatagg acttatagca 13201 aagggaacag ggaacaggga acagggaaca gggaacaggg aacagggaac agggaacagt 13261 gaacagggaa cagtgaacag tgaagataaa ctctacgtag ctgagcaaaa atttttggag 13321 gagttttata gtacacccca tcattcatag agttaaccct tcttatgaaa atgtgtactt 13381 cgctgcttca actggtattg ctgattatac aaaaaagctt gtattgagta ccgagcacaa 13441 acctaaagtg acaaacaacc gacctaattc agtgtcagag cgattccaca gaaggcggca 13501 atttttacaa aaactcgctt tggggttgtg tctcgtattg cttagctggg ttatttttaa 13561 tactctaacc attatttctg catcttccaa acaaatagat gcctttttta tactcggtgg 13621 cagtattcgt cgggaaattt atgtcgccga gttagcacaa caatatccgc aaaccccgat 13681 cttaatttct cggggatctc cggatccgtg cattttgctg gtttttcaac gcctagcgcc 13741 ccagagaatg caaaacgttt ggttggaaaa gtgtgcggat tctacctttg gtaattttta 13801 ctattccatt cccatcctgc gccgttgggg agtccacaag gtgaagctga ttagttctca 13861 aacacacttc ccgcgtgcaa aatggatggc acaaattctt tttggtgttc atggtatttg 13921 ggtagaaaca gacattgttc cagagcaagg tgtgccaggt aatcgagaat cttggtcgaa 13981 aacaggacta gatgtgacgc gtagcttgtt atgggcgggg ttaagtcgga taattcagcc 14041 gaaatgcgtt tatgttatga gattaacaga ggtgaatatg gatttttggc agcgtcagcg 14101 ttttagatgt gaacgtcagc aggatttcaa cctgcgttag cataaattga ggtaattgat 14161 cattgactaa cttttcaaca gatgctcctg aacgtcttct agaattgaag tggcatcttg 14221 gaaaaatcaa tgctagaggt gcaaaaagaa acttctaagg gatacagatg ctaacttggg 14281 acgctaacct ttacttgcaa tttgccaatg aacgcacgca gccgtcatta gatttggttg 14341 cacgcatagc cgtttctcat cctcaacgga ttatcgattt gggatgcgga ccagggaaca 14401 gcacccaaat acttcgccga cactggtcaa aagcggacat catcggactt gataactcac 14461 cagagatgat tgccgctgca tcgaaagctt atccagaagg aaaatgggtt ttggcagatg 14521 ttgccacttg gacggctgac gctcgatttg acattgtttt ttctaatgct acgttacact 14581 gggtgccaaa tcacgccgcg ctgtttccac accttttgga acaggttgct cctcatggag 14641 ttttagcggt gcagatgccc gtgcattttc agtcacccgt tcatcaactc atgtatgaaa 14701 ttgcggatga tccggcatgg cgacaaaaga tgcacagagc aaaaaacgca ttggtgaatg 14761 aaaagccgtc tttctattac gacgtgctgc aacctaaggt ctcacgaatc gacatctggg 14821 aaacagaata taaccacatc atggacagtc cagattcaat tgtgcaatgg ataagtggta 14881 ctggtctgcg tccctttctt gaagcgttgg aattggagga gcaaaagctt caatttcaag 14941 aaatgctacg tgctggtgtg acgcgagcgt atcctcgaca aaaggatggt cgcatattat 15001 ttccttttcg tcggctcttc attgttgctt atcgctaaac gatcattcag tcctgacatg 15061 actacatcaa ttaactgatc aacaaaatct tcagttaacg gtgcgtgttt caacagcagg 15121 cgataaaaca gggggctgta aagcgcgtca atgactgctt ccatatcaac atcagagcgt 15181 agttccccgc gttctatacc ttttttaatg acctgactac aatcaaatcg acgtggtgat 15241 aaccaattag ctcgaaacgc ctcaatcatt tccggatcgg tttgtcctcc accaatcagc 15301 gttgcaataa cttgtcctcg tggactgttt attactttga ctaacttgtg catttgctgt 15361 agtatatcct cctttaccga acccgtatca ggaaaagaga gttctgggtt agttgcagcc 15421 aagaaagcat caataataag acgcgccttg gttgaccacc gtcgataaat ggttggttta 15481 ccaacacctg cgcgcgctgc aactccttca atcgtcaact gagcataccc gacttccatc 15541 aacagttcat aagcagcgtg caaaatattt tgatgtgctt cattagctta tctcaatgtg 15601 agtggatttc ggattgagat agttggcagt agccagtcac gttctggaat gcctaatccg 15661 tcaaattttg aagaagctct acgcacaaca ggtatcggtc atttttgttt tcgtgttgat 15721 gatgtagata cagctcttac tgaattaaat caacgcggtg tacagacatt tgttgaagcc 15781 gcagattatc caaacgtagg tgtcagagtt ggttttatta aagacaacaa cggaaatgtt 15841 atcgagtttt caggaccctt gaaattgatt gaatcaaaaa cgtactagcg ccattgtgcc 15901 tatatttgat taaccaaagg aattacaaag atgattaaaa atctcttcca ttccacttca 15961 ttggtaactt tagcgctgct ggtctttcgt caagttctat tcagaatttc aaaacaccta 16021 aaacaggtca aattcatcaa cacaaatagg ggtaaaagat tatgacggac aatactgtaa 16081 aaattgatcc tactttttca gtgtcttttc aaattgcagg tgacttaaac gttaatcgtc 16141 tcggttacgg tgcgatgcga ctaacaggac agcctggaaa ttttggtccc tattcggatt 16201 gggaaggcgg tcagaaactc ctgcgtcgtg cagtggaact aggtattaat tttatcgata 16261 cagccgaagc ttacggacca ggttttaatg aagaattaat tgcttctgca ctgcatccct 16321 acccagaagg tgtagtcatt gcaactaaag gtggtattaa caaacctgca ccagatgaca 16381 ttcgcgccga cggaagacca gaaaatttgc ggtggggttg tgaagcaagc ttgcaaagac 16441 tgcgagtaga ccaaatagac ctgtatcaac tacatcgtcc tgatccaaaa gttccatttg 16501 ctgagtcggt agggatgcta gcgactttga agtcagaagg taaaattcgt catgtaggac 16561 tatccaatgt cacgatcgcg caaattgaac aagcgcggcg gatagttcct attgcttctg 16621 tgcaaaatcg ctttagtatc acagaacgtg atggcgaaga tgtactagat tactgcacca 16681 aacacgcgat cgcatttatc ccctacggtt cacttggcgc acatcccctc aaacaaggcg 16741 cacctttagc aaatgcacaa gggattttgg cttcaattgc taatcgtcac ggtgtgaaac 16801 ctaaccaaat tgctttggca tggttgctac atcgtgcgcc caatatcatt ttaattccag 16861 gaacgacaac aattgctcac gttgaagaaa atattgcggc agcttcaatc aagttgaata 16921 cagatgaaat tgaaactttg aatcagatgt aatactcgtt acgatagact taggatatct 16981 tcaaatagct gtgtatggaa atagtcagac ttgataactt tggtcgcgtt cttattccta 17041 agaacgtacg agaacaactg ggactgacaa acgcaactca gtttagcttg agtattcagg 17101 atgataagct tgttttgtca cctttgactc aaccgtctaa tgtttatcat ttaggttcaa 17161 cattggttgt tgagtcacaa ccgataggaa atctggaaac tgcaattgat gaactgcggg 17221 aagagcaaat tagggaattg atttgcagta gtgaaaaccc tgtttgatac ctcggtattg 17281 atagcagctt ttgaagttag tcatccccga cactcagttt gtcttccgtg gctgcaacaa 17341 gcgcaaacac agcttctgca aggatttatt gcaactcata ctcttgcaga actctattca 17401 gttctgacgc gccttcctgt caaaccctct atctctcctt acttagctca acaactaata 17461 gtagaaaact tgaagaactt tgaggtcatt tctctcgaca cccatgacta tcagatggta 17521 attgcccaaa tggtaaatct taatctaaca ggtggcggaa cttatgatgc actcattgct 17581 caagctgcta tcaaagctag agtagatatc ttgctcactc tcaatcctaa tcattttact 17641 cgcttagggg aagaaatagc gcagctcgtg caaacaccac tgtgagattt tttgagctta 17701 agcaactact tatctcagac ggtagactga gaatttgtaa ctactccaag actgtttgag 17761 atgcgatctg cgcttagcgc agcagcgttt gcgcagcgcc cccttagggg ctagctatcg 17821 cctcaattgt cgataacgga acttcagcaa aatacttacg ctgaaattgt tcttctcgga 17881 cagcggctaa aaatttttcc acagcagctt ctgctaaagt tggttcaatg gtagagtgct 17941 gagtgaaaac caacatacgg tcttgacgtg agatttggaa acccaagtat ctcaaacctc 18001 ttaaccctaa aagtaaagtt tggtcattga taccaagttt ttccaataac tgaatacggg 18061 tgacaggttg atttgtacga ctgagatatt tggcaattcc cacaagagta agccagattt 18121 gctcaggtgg tttggggtca ggttttaacc aagccagtgc taggggttgt tggttgtgaa 18181 gcgatcgcct caaatacaca cgtaaatcat cccaacttgt cggacactct ttgaggagaa 18241 aaggaggagt cgggagtcgg gagtcgggag ttgggatatt ccgccagtct ataataagtg 18301 tttgagatga cgatgctagt gctgagttcg tccgggaacg tacagctata agtcggattt 18361 cgtagcgttt tttaaaggtg ttgtagtcca attcagcaat gcagtcacac cttcctggag 18421 gtaattcatc tttgtagtgt ccccaccaaa cgccaggaaa aggatttctt gtcgagtcat 18481 cccgaatatc aaactccgct tttatgtact gtaccttttt cccttgcgaa tcttgctgat 18541 tgcgatgcca agcattttca aaccagcagt tttgaatcaa aagtttggga actgggtttc 18601 ccattccaca aggttctagt agcttcagtt ccaaaaataa atctttcccc aaatctgcta 18661 ccgttacagt taagtcagct tgtacagttg gtgttaggtt tgctcctccc aaggattgac 18721 gcaacttctg gttaatcgcc tgtgtaaata aaggtatatt ctctactgga agacttaacc 18781 ccgctgcata aggatgtcca ccaaatcgat gcaacaaatg tgcttggtct ttcactaatt 18841 gatataaatc aaccgaatta atcgaacgcg ccgaaccccg cgctagaaga ggaggagaaa 18901 gattgctctc ctgctcctct gctacttccg tacttaaaag aatcgtcggg cgtcctgtgt 18961 cctgtgcgac ttgtcctgcg actaatccca aaacgccgac agcccattgg gggtcttcca 19021 aaacaataac gctggtagtt gataagtcca ttttggaaag cttttgcgcg acttgcccag 19081 caacatcttt ctgcaaagac ttgcgacgag tgtttgctaa ttctgtgatt tctgcgagtt 19141 catggacacg cttttggtcg cgacttgtga gtaattccac gcaaaaagag gcatcgcctt 19201 gaatgcgact gacggcgtta atgcggggac ctaaaccaaa ggagatatct gtggggcgat 19261 cgccactttt ctggcataat tccaacaatc gcccgactcc cggacgccgt ctcaactctg 19321 gtgttttctt aaaatcctct tgcaatcgcc caattcccaa ctgtgctaag tagcgacaat 19381 ctccactcaa ctgcaccaaa tcagcaatga gtccaaccgc gactaaatcc aacaaatcct 19441 ctagcggttg ttgcgggata tttggtaagg tttgataaaa agcttccacc aacttgtaag 19501 ccaccgccac tccagaaaga tgaaacagct gatgtgtttc gctcaaataa cgagggttaa 19561 taattgccgt tacagcggga cgttccggtg gtaaagtgtg gtggtctgtc actataacat 19621 ctatacctaa ctgcttagca taaataattt cactaatatt tgtactgcca gtatcacaag 19681 tcacaatcaa ctgaaagccc tgttttgcaa gcttatcaat tccttgacaa ttcaagccat 19741 gagattcggt taagcgatta ggaatgtaat aaacaagctg cgtattttga gcgaaaaatt 19801 gtcccaaacc atcccacaat acagcagtag acgtaatacc atcagcgtca aaatctcccc 19861 aaatcgcaac tttttcccca gcatctcgtg cttgttgcaa ccgttccacc gccagatgca 19921 tttcttgccc aaactcaaat ggactcgcgg cttgataagc ttgtgagttg acaaaagcgt 19981 ctaattgtgg tttttctctg attcctcgtt gccataacaa ctgcgctgca aaatgcccac 20041 ccgatgcagg tgtatattgc ttcaccgctt ggataaacga ttctggcggt tgttcagttg 20101 tagcaagaat ccactgttgt tctggcattt gaattttccc cgccttgaaa aacggggatt 20161 atagactatt gttgctttgt gcggatcaaa gccacacact gataactgtt aaaccggcgc 20221 ttcctggtcc gagatctgag gttcactcac caaagtattc aatacgactt cacttgcaac 20281 accatcagac ttacgaggac gcatagtagg acgagaacgt ctgtaacctg ccctttcagc 20341 tttctggtgt tcgtaatcaa ttagtctgcc atagtaatca tgcttgctta aattcatcga 20401 caagaaaaca tgctggcgaa cattcccaaa acccttacgg ttgaaaaact tcacagcggg 20461 actatttgcc ggatcagtgt ctaccaacat gaaccgtgct ccatcttcaa tcatacgtgc 20521 tatgacccta tcaacgagct tgtctcctac gccgcgacgc tgaaatttag gactcactcc 20581 taaccacaga atgtatccat aagtccaact tgctttggta ataatagttc ctaaaatgaa 20641 tcctgctaat tgttcatcaa cttctgcaac cagacagtat tctggatcag tgttataaag 20701 tcctataact tcccactcat cccagcaacg atataagtag ggatatgaat cgctggtaaa 20761 taactgttca cccaagtgat aaacaggggc tatgtcatct attcgtattt cgcggacata 20821 aaccgcatca ttttcatcaa aatgcataaa ataataatca aaaagttatc tgtcacttac 20881 agactgactt cttgaatgct agctatattc agtaggttcg tctctactat tgtagaattt 20941 gtagaagata tctgcccagg agtggcgagt gagttaatgt cttgggcacg gctttgccga 21001 ttgcagcgcg ttgtatactg acagtagtca caggctttgc taccttctgg ttcttgcgga 21061 aatagttctc cgtggtagta atgttctatc caataggtta agcgacttaa aacctggtta 21121 agtttttttg ctgtttgttg gtgtcgcgtc gtgttgtaac taaatttaac actttgcggt 21181 gagggggtcg cagtcttggc ggttcccgtc gattgcgttc gcgcagcgtc t // LOCUS NODE_1494_length_21088_cov_5.90396021088 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 21088) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 21088) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..21088 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(1..867) /locus_tag="DP116_13450" /pseudo CDS complement(1..867) /locus_tag="DP116_13450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_076611286.1" /note="frameshifted; incomplete; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ISAzo13 family transposase" gene complement(1189..1797) /locus_tag="DP116_13455" /pseudo CDS complement(1189..1797) /locus_tag="DP116_13455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017803983.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(1875..2141) /locus_tag="DP116_13460" /pseudo CDS complement(1875..2141) /locus_tag="DP116_13460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950375.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(2170..2622) /locus_tag="DP116_13465" CDS complement(2170..2622) /locus_tag="DP116_13465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950374.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA thioesterase" /protein_id="PRJNA477356:DP116_13465" /translation="MQKISLELEVYSFHIDFIGHVNNTVYIQWMEIGRTKLLEAVGMP TQEIFQQGFAPVLVQTNITYKLPLHLGDRVQVQMWISELKNASAIMQFRFYKEQETLA AEGWQKGLFVDTQTMRPRRLRPEERCLFAPYVHSTVDTQLANSLVEIP" gene 2697..3308 /locus_tag="DP116_13470" CDS 2697..3308 /locus_tag="DP116_13470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742554.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_13470" /translation="MARDKEETKARILAAVGKLLAESGFKQLGVNAIAREAGVDKVLI YRYFENLPSLLQTFGKEGNYWTSVEELVGDETAVDAESLADWMVLLLTRFLHDLQKRP ITQEIMRWELLEGNELTHELATVRDRVAIESLEFLKQKCSFPPDKDIPAISAVLIAGI VYLVLRTKVSNTFLGIDFSSPTGWQRIEGAIASLIQPMVETEE" gene complement(3646..4689) /locus_tag="DP116_13475" CDS complement(3646..4689) /locus_tag="DP116_13475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651238.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mechanosensitive ion channel family protein" /protein_id="PRJNA477356:DP116_13475" /translation="MIQHWILPLVFILAGLLGGIITEKIIFTRLHKFVTSRRIPGSAV IFRSLNRMPFFWCFLAGFYGATVSYRPEPDVAELLLKIITSSFLYTVTVVFARLTAGF VNVFFRRTDGFSASLLSNVAKVAVLVLGTLIVLQTLGVQITPILTTLGVGGLAVGLAL QDTLANLFSGFYLIISKQVRTGDYVKLDSGNHEGYVTDITWRNTTIKELSNNVIIVPN SKLASAIFTNYHLPVKEITLTMNVGVSYDSDLEKVERVTVKVAKEVMEEIAPELTANE PYIRFNEFADYSINFTLYMRVNEFFDQRLARHLLVKKLHKSYHKEGITIPFPARDVYL QSNGSKNGAMTLE" gene 4907..5824 /locus_tag="DP116_13480" CDS 4907..5824 /locus_tag="DP116_13480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208899.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histone deacetylase" /protein_id="PRJNA477356:DP116_13480" /translation="MDLPIVYHPDYVAPLPEGHRFPMPKFRQLYELLLADGVAQKEQF HLPSRPPQELIELVHTPEYVQAYCEGTLDAKAQRRIGLPWSSSLVNRTCVAVGGTILT AQLALKYGLACNTAGGTHHAFPSYGSGFCIFNDLAVATRVLQKLRLVQKILIVDLDVH QGDGTAFIFQEDDSVFTFSMHCEVNFPGTKQKSDLDVPLPVGMEDDAYLQTLAEYLPD LLSEVKPDLVLYDAGVDPHVGDKLGKLALTDTGLYRREMQVLSTCIAAGYKVACVIGG GYADDLKSLVYRHSLVHRAASEVFRQYRL" gene 6169..7728 /locus_tag="DP116_13485" CDS 6169..7728 /locus_tag="DP116_13485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743991.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="choline dehydrogenase" /protein_id="PRJNA477356:DP116_13485" /translation="MNQYDYIVIGAGSAGCVVANRLTEDGKTTVLLLEAGNPPNLPEH EVPLGWVKLWGTEVDWAYFTEEEPYLNGRKIYCPRGKVLGGTSSINAMIYIRGNRHDY DRWQELGNPGWSYQDVLPYFKKSENQQWGASEFHGVDGELSVSDPIAPAVTSQRFVEA AIALGYERNLDFNGAQQEGAGLYQLTIKDGKRHSTATAFLVPILNRPNLTVTTGALVT RLLFEGTRTVGVEYIHQGTIHQSFVQQEVILSAGAIDSPKLLMLSGIGNAEIVRSLDI PLVVNLPGVGQNLQDHLHVAVAHQATQDLQPAPTSNIAEAGLFLHTEGRLDTAPDLQL FSGPVLWTHPAYAREGPGFAATACVTNPQSRGSVSLRSASPNDSPIIRMNYLQSESDV QKLVAGIKIIRQLFNSSVFDEFRGEEAAPSADVTSDEALRAYIRETCDTVYHPVGTCK MGTDADSVVDPELRVHGVDGLRVVDASIMPSITTGNTNAPTIMIGEKAADLIKAAGRV LEQATSTTAIV" gene complement(7989..8959) /locus_tag="DP116_13490" /pseudo CDS complement(7989..8959) /locus_tag="DP116_13490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007306299.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS701 family transposase" gene 10199..11503 /locus_tag="DP116_13495" CDS 10199..11503 /locus_tag="DP116_13495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455333.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chloride channel protein" /protein_id="PRJNA477356:DP116_13495" /translation="MLPDKRPQREQTQRWTFSQLFGRARRHPFMISRWVLSWAIVGTF CGLFAALYWNILELIIHGLQRFQGSSLLLVMPLAGLVIGLVIHFLGNPGEIAVIVDNI HFRGGRLDARKNPSMILASLVSISAGGSAGPEAPLVQVTGSFGTWVAERLRLEGEDVR SMSLAAMAAGFTALFGSPLGGAMFALEILHHQHIVEYYEALMPAIVSSCASYVVFAAI THLGIAPTWHFPQYRLDSIDDFAVAIMFGIVGVVAGWIFMGIFRGCDRIFALIPGPIY LRTTIAGLGLGILATFLPLTRYFGHEELDKVVNQSFPVFFLFMLALAKMAAISITVTG GWRGGFIIPLFFTGACVGKAVVALIPGLNPTLAMICTMAAINASVTRTPISTTLLLSK LTNFSPLTPILFASLIGFFLAPKVPFIASQLKSEQGTLINHP" gene 11699..14281 /locus_tag="DP116_13500" CDS 11699..14281 /locus_tag="DP116_13500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995034.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA gyrase subunit A" /protein_id="PRJNA477356:DP116_13500" /translation="MTTSQERIIPTDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPV HRRILYAMHELGLSADRPFRKCARVVGEVLGKYHPHGDTAVYDALVRMAQDFSMRYPL IAGHGNFGSVDNDPPAAMRYTECRLQALTSDSLLQDIESETVDFIDNFDASQQEPTVL PARIPQLLLNGSSGIAVGMATNIPPHNLGELIDGLVALIQKPETNNTDLMQYIHGPDF PTGGQILGTASIKEAYTTGRGSITMRGVATIETIEQRGRPDRDAIIITELPYQTNKAA LIEKIAEMVNDKRIEGIADIRDESDRDGMRIVIELKRDAYPRVVLNNLYKQTPLQANF GANMLALVNGEPQILTLKEFLQVFLDFRVEAITRRTRYELRKAEERDHILQGLLIALG RLDEIINLIRHAPDAPTAKGELMTTYGLSEAQADAILQMQLRRLTALEADKIRQEHEQ LQAIITDLQDILARRERILEIIETEVKQLREKHATPRRTVISPLEGEIDERDLIANEK ALILLTEQGYIKRMPVNTFEAQSRATRGKAAAKMKEDDGVEHFLTCCDHDSVLFFSER GVVYCLKTYQIPISSRTSRGTPIVQMLPIPKEEKITSIVPVAEFTSDEYLVMLTKGGN IKKTELAAFSNIRANGLIAISLEEGDQLRWVRRARVEDSVIIGSRRGVAIHFRCDHEQ LRPLSRATRGVKSMKLKKGDELVGMDILPAAILATLNTETEAEIEEVETEEIENEEST EVPANGSTGPWVLVITMGGYGKRVPVAQFRLQNRAGQGLTATKFKNRKIKDQLATLHI VNSDDETMMVTSRGIIIRQAVNAISVQSRSATGVRVQRLDEDDVITGVAIVPPDSGDA AEAE" gene 14262..15908 /locus_tag="DP116_13505" CDS 14262..15908 /locus_tag="DP116_13505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206551.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="apolipoprotein N-acyltransferase" /protein_id="PRJNA477356:DP116_13505" /translation="MQQKQSKKQGKQGEKFTILFPYLIALSSGILMGLTVAPVGVWFL AWLSLAPLWVLVIRSAQRKNLSSSASSAPPASPALLPLSLAWGIGYHGIALSWITGVH PMTWMGVPWLASLAIALFCWIFITLWGAALVASWAIGMRVISRSIHNGFVRVLIGTAL WCGLEALWSAGPLWWSSLSYTQSPHNLIILHLGQLSGPSAVTAAIVAVNGLIAEALVN RRGAEYTEEKIKFFSALSASGRFVYLSTAAALFITLHLIGFSLYSRPLAQPPETALKV GIIQGNVPNEIKLYPEGFRRAIQGYTTGYLKLANQSVDAVLTPEGALPFFQDAIIRSS MVAAVREKGVVAWIGGFYKQGSSYTNSLFTVAGDGEIKSRYGKAKMVPIGEYVPFESI LGGIVSRLSPLDEHQVPGSSNQVFDTPFGRAIVGICYESAFSGQFRRQAAAGGQFILS PSNDAHYSAAMPSQHHAQDIMRAIETDRWAVRATNTGYSAFVNPHGKTVWISGHNTYE VHAETIYRRQTQTLYVRWGDWLTLLLLGAGTLAWLIPKAD" gene complement(15895..16098) /locus_tag="DP116_13510" CDS complement(15895..16098) /locus_tag="DP116_13510" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13510" /translation="MVKTIRQPTRKAITCTAKNLSLIISKVVNFLINHPWFISRPWTL GVVRKLLKIDLEIIRFVVKINQP" gene 16514..16849 /locus_tag="DP116_13515" CDS 16514..16849 /locus_tag="DP116_13515" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13515" /translation="MATLKLIELLCIDPEDTDMKDEAYIRIITTDRSLRIPTAGSFPI SKSQSVSLKSNTINFTGKAEIRLFDQDSLDADDFLGTNTVDSSKLGAGKLKFIKSGVN YELKYEVIR" gene 17215..17400 /locus_tag="DP116_13520" CDS 17215..17400 /locus_tag="DP116_13520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876260.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13520" /translation="MGTEGSNVALLPRWTRKGDKTRFSTFGLGAAYGVYTIAAAISLF FVLFFIKETKGIELENM" gene complement(17609..18955) /locus_tag="DP116_13525" CDS complement(17609..18955) /locus_tag="DP116_13525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410297.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TldD/PmbA family protein" /protein_id="PRJNA477356:DP116_13525" /translation="MKHEELSALEVSFNQLIETLLNKKEESEQFTVKLSSERSQFTRF NHAKVRQTGCVTDGSIELTLMQNQRSSFRQFPFTGHWEIDWQLAYEALQELREELPLL PLDPYLVLPSGTNTSREVHCGNLLAEEVVVPTVLEQLTELDFAGIYAGGVVIKAYADS FGQKHWFSTETFSLDYSIFTTEAQAVKGTFAGSEWNQEAYSEKISDAKRQLQLLSRPI KELPRGQYKTYFAPAAVADLVAMLSWGGVSEADLQQGGSALASLWRKEKQLSVAFHVK ENFQRGLVPRFNELGEMAPLELPVIEKGVLVNTLVNSRTAREYQKTANGANSYESLRA PEISPGNLSFEDILTHLNTGLYVSNLHYLNWSDRPTGRITGMTRYACFWVENGEIIAP IENLRFDESLYRFWGENLVDLTNFQEFVPEVGTYDGRQLGGSMVPGMLVEDFTYTL" gene complement(19276..20715) /locus_tag="DP116_13530" CDS complement(19276..20715) /locus_tag="DP116_13530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320419.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TldD/PmbA family protein" /protein_id="PRJNA477356:DP116_13530" /translation="MWSELTKAIAQINIPADWLGIRVVKETSFTCHVRDGIPQSNGKS STMGFMLEVMINGCIGYAATNSLRILDLQTAAQKAYTQALAASEWWIHPFGVENERPK VVGQYISPFLKPFDALSPGEINDLLVRVCHTLRVSDQIVQTLANMTTSQRETWFVSSN GSEVYQKFMFFATHYGATAQDGTVVQQRSNNGLQANCYQGGLELFQEADLWVRVQQIG EQAVELLTAEECPTTRTSLVLAPDQMMLQIHESVGHPVEIDRILGDERNYAGGSFVNK SDFGTLVYGSPQMNITFDPTVLGEFASYGFDDTGAVATREYLIKEGVLQRGLGSLESQ ARAGVPGVACARACSWNRPAIDRMANLNLEPGEGSFEEIIAGIEHGVYMESNRSWSID DRRYKFQFGCEYAKLIENGKLTKTLRNPNYRATTPEFWHSLVKVGNSSTWEMYGTPYC GKGEPNQAIWVGHGSPICVFANVEVFGGG" gene complement(20935..21081) /locus_tag="DP116_13535" /pseudo CDS complement(20935..21081) /locus_tag="DP116_13535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995265.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="four helix bundle protein" BASE COUNT 5866 a 4573 c 4531 g 6118 t ORIGIN 1 tcatgaactg cgttcgtctg gaattattct ctgacccatt atcttgatta agaataagtg 61 ttctgatttg gggaaatcgc caactctcac ttttccaaaa atcttctaga acatcaacaa 121 taaaatcact cgttatctga gactctgtaa agtataaaaa cagttcatct agctctggaa 181 gaaagatgcc ataaggagtt acagttgttt ttggattata atcatggtct tcagtctcgg 241 ttaaagctct gttagagcct cctctatcaa aagagccaat gtttacacga gctttggcat 301 caagactgag gcgtaaaatg ctctcatcat ttaaagctga ttgattaact attgcaagtt 361 gctcaaagat tgcatcagtt tgtgggattt tttttgaggt aaaacttttg ctacccgttt 421 aagtcgataa cctaaataat tcaatttgat gcgaattgtt tcttcggacg gtaactgttc 481 atcagtataa ttaaattttt caactaatag ctttctcact tggcttgcag ttaatcgcgt 541 atacagtctt tgacttttaa aactagggtc agtttgactc tgcgaatcca ccaaattttt 601 aatatcttct aataggttag tcaggtgttc ttctgtttta taacgccctc tggctgaata 661 attatcaaca caagcaatac cactaattaa ttctttaatt cctttacgga tagtatttct 721 atcccatccc aatttttttt gagctagtag ttgccccccg taacccaact ccatgacaat 781 ttgtgcttga aaacgccgcc ttgctgcacc tttaagttgg cttgcggttt cctggaacaa 841 ttttatgact ggatcagtta attctatcga cacttcaaac acccaaaatt ttcctgacca 901 attgaattta ctacaaaaag gtaaggagaa gcagccaacg gctgcttctc cttccccacc 961 cccctaacaa tgattattat tattgacttg ggtgatttta ttccttggaa gtccctaaac 1021 aggtgcatct tggcagagaa ctgtgacaca gttccataaa ggtaatgccg ttacataaca 1081 cctacccaaa gctacccaga attggacttt tctacacttt gggtttcgtt tacttccttg 1141 cttcatcctg gctcgttcgg acgacaccag tccacaggct atcactgtct aactgacagc 1201 tttagacaaa ttgattgctg cgtttaaatc gcggtcaatg acaaaaccgc aatcaccgca 1261 ctcgaacact cgttcagaca aggtgagtgt ttctttcttt gtcccacagt tagaacatgt 1321 cttggaactt gggaaccatc ggtcaaccac aaccaacttt gaattataca actggcattt 1381 gtaagtcaac tgacggcgga actcgaaaaa gctcatgtca gcaattgatg ccgccagctt 1441 atgatttgcc atcatgcctg atacgttcaa atcctctatc acaactgttc cgtggttctt 1501 ggctagcagt gttgtgagtt tgtggagcgt atcttttcgg atattagcaa tctttctatg 1561 taacctagca atttgtagtt gtgctttttt ccagttagat gaaccgatga ttttatggcg 1621 atgcaaccac tgcattctac tgagcttggc ttcgtatttc ttgtaggact tagcgcctag 1681 tacaacttca ccagtagaaa gagtggcaag gttttttaca ccaaggtcaa cgcctacaac 1741 actcttggaa gtgcattctt gagtttctac ctcaaaccgg aaactgataa accattggca 1801 cgacaaaagc taccactaca ggagtaatta tgggtaactt tgtccataaa ttaagttaca 1861 gtttctaacc ctaggcttgc taaacctgtt aatgagtaga cgaacagaca gctataagca 1921 agccaaaact tttgatattt atacaaccaa tacccaatga ttccgactac agtaagaatg 1981 agccacgatt gataaattga aggggctgta atccagtctg gttggggata ttgcttgatg 2041 aacagataat tataattatc agtgaaatga atacctgtcg aaataatact taaaatgaga 2101 atacttatca gtaagttatg agacttaacc tcaagattca taatcaggtt tcctttaata 2161 atcatcgttt tacggtattt caaccagtga attagccagt tgagtatcca ccgttgaatg 2221 cacataaggt gcgaataagc aacgttcctc tgggcgtaac cgtcttggac gcatcgtttg 2281 tgtatccaca aacaatcctt tttgccatcc ctctgctgcg agtgtttcct gttctttgta 2341 aaagcgaaac tgcataatag cagatgcgtt tttcagttca gagatccaca tttggacctg 2401 cacgcgatcg cctaaatgta gcggcaattt ataggtaatg ttagtttgaa ccagcacagg 2461 agcaaaccct tgctgaaaaa tttcctgcgt tggcattcca acggcttcca gcagttttgt 2521 ccgtccaatt tccatccact gaatatagac agtgttgttc acgtgcccaa tgaaatcaat 2581 gtggaaggaa tacacctcta actcaaggga aatcttttgc atcctgaatc ctaagtcact 2641 gctcagtgac aatgtatcat agtcaccagt tggtgacaag tagatgagaa cgattcatgg 2701 ctcgggataa agaagaaacg aaggcacgaa ttctggcagc tgttggtaaa ctattggcag 2761 aatcgggctt taagcaactg ggggtgaatg cgatcgcccg tgaagccggc gtcgataaag 2821 tcttgattta tcgatatttt gagaacctcc cttctctgct gcaaaccttt ggcaaggagg 2881 ggaactactg gacgagtgtt gaagaattag ttggggatga aacagccgtc gatgcagaat 2941 cgctggctga ctggatggtt ttgttactga cgcgttttct acatgacttg caaaaacgac 3001 cgattacgca agaaattatg cgctgggagt tactcgaagg taatgagtta acgcatgaat 3061 tggcaacggt gcgcgatcgc gtggcgattg aaagcttgga atttctcaag caaaagtgtt 3121 ctttcccgcc agataaagac attcctgcta tcagtgcggt gttgattgct ggaattgttt 3181 atctagttct gcgaaccaaa gtcagcaata cattcttagg aattgatttt agttcaccaa 3241 ccggatggca aaggattgaa ggggcgatag catctttgat tcagccaatg gttgaaactg 3301 aggaataaag cagcttctca atatactgat atcttgcacc attacgagta aaccactaat 3361 atttatattg aatagtgttt aaatttactc aaaaataata tcgcgtccca gtccatttta 3421 atggacttcg cctatgcagc ccaggactta cagtcccgtg acgagatcag tgcgtcaaat 3481 aatatgtcat aactttgaca tttattttcc caggagcaag atgtaagtat accaatacgg 3541 ttggtgataa gccttttttc tcttcccctg ctcaagagct gcctacctca acagataaat 3601 tttcgagagc agcaactata ttgccctata tagggcatct acttatcatt ccaaagtcat 3661 tgctccattc ttacttccat tactttgcag ataaacatct ctggctggga aaggaattgt 3721 aattccttct ttatgatagc ttttgtgtaa ttttttcacc aataaatgtc tagccaagcg 3781 ctggtcgaaa aattcattta cccgcatata aagcgtaaag tttatactat aatctgcaaa 3841 ttcgttaaat cttatataag gttcatttgc tgttaactcg ggagcaattt cttccatgac 3901 ttctttagca acctttacag tcactctttc gactttctca aggtcactgt cgtagctcac 3961 tcccacgttc atcgttaatg taatttcctt aacggggaga tgatagttag taaaaatagc 4021 agaagctagc ttagaatttg gtactattat gacattatta gaaagctctt ttatcgtcgt 4081 gtttcgccaa gttatatctg taacgtatcc ctcgtgatta ccactatcta atttaacgta 4141 atcgcctgtt ctgacttgtt tcgatataat taaataaaaa ccagaaaata aatttgctag 4201 tgtatcttga agtgctagac caactgctaa accaccaaca cctaacgttg ttaatattgg 4261 tgtaatttga actcctagtg tttgcaatac aattaatgtt cctaaaacta aaacagcaac 4321 tttagcaaca ttagaaagta gtgaggctga aaaaccatct gttctccgaa aaaatacatt 4381 gacaaaacca gccgtgagtc tggcaaagac aacagttact gtgtataaaa aagagctagt 4441 aataattttt aacagtaatt ctgcaacatc tggctcagga cggtaactaa cagttgcgcc 4501 gtaaaaccca gcaaggaagc accaaaaaaa gggcatacgg tttagggaac gaaatataac 4561 tgcactcccc ggaattcgtc tactagtaac aaatttgtgc agtcttgtaa aaataatttt 4621 ttcggttatg attccaccaa gcaagccagc caagataaat acaagtggca aaatccaatg 4681 ttgtatcatg acaattttta attttcaatt ttggatttga gataaaaatt ctcaattatt 4741 tccttgtcca gtgtaaacaa aactggtgga aatctacgta gcataaacta tattagcaga 4801 caaaaaaata tcaccgattt cttagctttg tgtagaaaca gcaaaatcaa acgtgataat 4861 atctcttggt gttttggctg tgagaaattt ttacaacaat tacttcatgg acttgccgat 4921 agtctaccac cctgattacg ttgccccact acctgaaggt catcgctttc cgatgccaaa 4981 atttcggcaa ctctacgaac tattattagc agatggtgtg gcacaaaaag aacaatttca 5041 tctgccctca cgcccgccac aagagttaat agagttagtt cataccccag aatacgtcca 5101 agcgtattgt gagggcactc ttgatgccaa agcacagcga cgtattgggt taccttggag 5161 ttcctcactg gttaatcgta catgtgtagc cgttggtgga acaattctca cagcacaact 5221 ggcgctcaag tacggtttgg cttgtaatac tgctggtgga actcatcacg cctttcctag 5281 ttatggatct ggtttttgta ttttcaatga tttagcagtt gcaactcgcg tattgcaaaa 5341 actccgccta gttcagaaaa ttttgattgt agacttggac gtgcatcaag gagatggcac 5401 tgcttttatc tttcaagagg atgatagtgt tttcactttc tcaatgcact gcgaagttaa 5461 ttttcctggt actaagcaaa aaagtgattt ggatgttccc ctgcctgtgg ggatggaaga 5521 tgatgcttac ctacaaacat tagctgaata cttaccagat ttgttgtctg aggtcaagcc 5581 agatttagtg ttgtacgatg ctggtgttga ccctcatgtc ggggacaagc taggaaaatt 5641 agccttaact gatactggtt tataccgccg agaaatgcaa gttttgagta cttgtattgc 5701 ggctggctat aaagtagcct gcgtcattgg cggtggatat gctgatgatc tcaaatcgct 5761 ggtgtatcgt cattctttgg tgcatcgggc tgcaagtgaa gtttttcggc aatatagact 5821 ttgacaatcc caaagtgcat aagttgattt tgcttttatc tatctatacc agaacataaa 5881 aggaaagctt actcatgctt aacttccatt cctaaaatta agtaaggctt catgcacaca 5941 atcaaattta tatcaatgtg taagtccgaa tattctacca atgtatgaca aatgtaatgg 6001 tttgaaaatg acatttatta gatgtaaatc aaagcttatt gctaatatga tgaaagttcc 6061 aaagacagaa aacaaccaac aaagtgaact ttcatacaca aaatattaac aaattgagct 6121 tattgaaagt gccaaaatag ctgataccaa aacacaagtg aggatgttat gaatcaatac 6181 gattatattg tcattggtgc aggttcggcg ggctgtgtcg tcgccaatcg tttaacagaa 6241 gacggtaaaa caactgtgtt gttgctggaa gcaggcaatc cgcctaactt accagagcac 6301 gaagttcctc taggttgggt aaaactctgg ggtactgagg tggactgggc atattttaca 6361 gaagaagaac catacctcaa tggacgcaaa atctattgtc cacgtggtaa agtcctgggt 6421 ggcaccagtt cgatcaatgc catgatctac atccgaggca accgtcacga ttatgatcgc 6481 tggcaagagt taggtaatcc cggttggagt taccaagatg tattgccata cttcaagaag 6541 tctgaaaacc agcaatgggg agcatccgag tttcacggtg tcgatgggga gttgagcgtt 6601 agcgatccaa ttgccccggc ggtgacatca caaagatttg tagaagcagc gatcgcactt 6661 ggctacgaac gcaatctcga cttcaatggc gcacaacagg agggagcggg actttatcag 6721 ttgactatca aagatggtaa gcgacacagt acggcgacag cgtttttagt acctattttg 6781 aatcgcccta acttaacagt tactactggt gcgttagtga ctcggttatt atttgaggga 6841 acacgcacgg taggagtgga atacatacac cagggaacga tacaccaatc ctttgtgcaa 6901 caggaagtca ttctctcggc tggggcgatc gattcgccca aactgttgat gctttctgga 6961 attgggaatg cagaaattgt gcgatcgctc gacattcccc tagttgttaa tttacctggt 7021 gtcggtcaga atctccaaga tcaccttcac gtcgccgtgg cacatcaagc cacccaggat 7081 ctgcaacctg ccccaaccag caatatcgcg gaagccggat tgttcctaca tactgaaggt 7141 cgtttagata ctgcgccaga tttacagttg ttttctggtc ctgttttgtg gacacatcct 7201 gcttatgccc gtgaaggtcc gggattcgca gctacagcgt gtgtgactaa tccccaaagt 7261 cgcggcagtg tcagtctgcg ttctgcttcc cccaatgatt caccgataat ccgaatgaac 7321 tatctccaga gtgagtccga tgtgcaaaag ctagttgcag gaattaaaat tatccgtcag 7381 ttgtttaact caagtgtatt tgatgagttc cgaggcgagg aagctgcccc tagtgccgat 7441 gtaactagcg atgaagcact ccgagcttat atcagagaaa cctgcgacac tgtataccac 7501 cctgtcggca cttgcaaaat gggaactgac gccgattcag ttgtagaccc cgaactacgg 7561 gtacatggtg ttgatgggtt gcgtgttgta gatgcatcaa ttatgccatc tatcactacg 7621 ggaaatacga atgcacccac aattatgatt ggcgaaaagg cagctgattt gattaaagcc 7681 gcaggacgtg ttttagagca agcaacctca acaactgcga ttgtctaaag ggcactgcgc 7741 aagagtgcga tgatactcaa cacccgcctg agcgtaaagc gcacgctcgc ttgactatct 7801 aaatttatct ttgccacttg ctgaaaaagc agccgatttg attaaagcta caggttgtgt 7861 ttgacagcaa acgtccttac agcggttttc atatgaatag accacaaatc gtagggtggg 7921 cattgccacc aaaagcctat gtgataaaca tttgagatta ggacttacgc attgacaaaa 7981 aatgtcattc atgtgttggc agcacaagtc ttggtgagat tgtccacaat gtactcacgc 8041 acaacgaaag tgaataaatt cctttgcaat tcataccaat tagaaattat gttttcagag 8101 cgcattagct ctaaacgaac aaatgcttga agtgaacatg cggatgtgtg ttttaattgc 8161 atggctatct ctaaccatga atcgacaaat cccacatagt tgtttgatgg ctctatgaaa 8221 actttcaatt ccccaatgga tatcatgaat tgtgacgaag gtacttcttg ttatctgctt 8281 gagcttttca tcatctggta gatataaaat atagtgtcta gagtcttctt ttttaaagtc 8341 gcacctttac aacttaacaa atccaaattc tctcagatga gtcagcaatc cttcttcggg 8401 aatatccaaa gtgtttacct gacagtactt acctggttca tttgagacag ttctattctt 8461 ctcaatgccg aacagaaaac ccaacttctc gttttttaga aatttcaaat tttctactcc 8521 tgagtaccaa ctatctcctg ttactagcct tggctttaaa ccccaagcta gcacttcagt 8581 tagcatttcc tcaattgtcg ttctttgttt tcccctccct tttgtcatat attctgtagt 8641 tgagcggaac tgaattttca tgaatatcac tgtagtatag tgttattaaa ttcaatcctt 8701 taatcgtttt atgatatttg ccagaccaaa aataacttat taattctgca tatttaggat 8761 tgctatacag cttttctatt actgtgtcat caatacttaa aatccctcct ctcatattaa 8821 taatagtctt gaccatgtcg aacaaatctt ttggttcgta tctctccctc aacaaaaaac 8881 gattcacact atcatgtgaa acatctccca gtatctctgc taatctactg catcctccat 8941 gctttggctc cgatagcaaa aacagagtgt aatgttccag attgcattgc gctgtcgagg 9001 gtttgctgat ttctcgaatt gtccgatacc tgcttgtaga cgacatttgt ttttcaacgc 9061 attattctat cattgtgtca atgcataagt cctacttccg ataacgaaat tcgtttttct 9121 aaaccagttg tagcgagagc ggtgagcagc gcgttgcggg ggttcccccc gttgtagcga 9181 ctgcgaaccc gaagggtgac cgagcgctgg cactctcgtg ttttttagag gtatcgtgag 9241 cagcgctcgg tcaacaactg gtaacatcgc tcaatatggt cgtttttcaa aaccttacta 9301 cagcaacttt tcgggagctt ttatgctttc tgtttttaag agatgaaatg tggattggtc 9361 tcacctttct tttactgcac aacaccagat atttgtcaat gcgtaagttc tagagatcaa 9421 ccctttcaag gtgttaaatc tggaagagat gaaaaacgga aaaccgctat ttcaggcttt 9481 gatttttcag acttcgagaa acgaaaataa cttttcagag ttttggttgt gagccaaaac 9541 tctgaaatct agtaacttcg ttgcccacct tgaaagggct ggttctagag atattggtga 9601 gcaatgccca ccttacagaa aaccgctgta aaaactgcga tcgccttgat cgcctaacta 9661 tctgaattca cctttatcaa tgttttccgg acaaaagtga atgacgagta taaatagtac 9721 gcgattgtac cgggcgcaga cgtctctggc gtatcgctcc cacaaaaatt atgttttcaa 9781 ccacggtttt tgtcacattt ttaagtaaca ttacttactg acaaaagcgg gttgaagaag 9841 gcagaaggca ggaggcagag ggcagaaggt tctgttcaga aggggattca gacccctcct 9901 gaacagcagc caccaaattg aaaatttggt gggggtctta aacccttggc tccctttggt 9961 cgcccagagg gttcgggcat tcacgagaag ggattttcac ttatcacttc tgccctctgc 10021 cctgtgcctt tcttgataac ccgcaacttg ggttattaac tgtttgtttc attcgtagcg 10081 ggtgctgtgc cgccagtgcg cctgctcaat tatgtgaaaa gtcaggtgca agtgaggtat 10141 agcatagaaa aagaacaaat tttgtatctt tcgtttcact tacacaggga aataagcagt 10201 gctaccagac aagcgccctc agagagagca aactcaacgt tggacttttt cccagctttt 10261 tggacgagca agacgtcatc ctttcatgat ttcacgctgg gttttaagct gggctattgt 10321 aggaactttt tgtggtttat ttgctgcgct gtactggaat atcttggaac tgataattca 10381 tggactgcaa agatttcagg gttctagtct tttgttggtc atgccattag ccggtttagt 10441 cattggttta gtgattcatt tccttggtaa tcctggtgaa attgccgtta ttgtcgataa 10501 catccatttt cgcggcggac ggttggatgc tcgcaaaaat ccttcaatga ttcttgcttc 10561 cttagttagt atatcggctg ggggcagtgc tggaccagaa gcaccactgg tacaggtcac 10621 aggttctttt ggtacttggg ttgctgagcg tttaagactc gaaggtgaag acgttagatc 10681 tatgagttta gcagcgatgg ctgcaggctt tactgctttg tttggctcac cccttggcgg 10741 tgcaatgttc gctctagaaa ttttgcatca ccagcatatt gtcgagtatt acgaagcgct 10801 aatgccagcc attgtttcga gttgcgctag ttatgtggtc tttgcagcta tcacacattt 10861 gggaattgca ccgacttggc attttcctca ataccgctta gatagcatcg atgattttgc 10921 cgtcgctatc atgtttggga ttgtaggagt cgtagcagga tggattttta tgggcatttt 10981 tcgcggatgt gaccgaattt ttgccttgat tcctggacca atttatctac gcacaactat 11041 agcggggttg ggattgggca ttttagcaac tttcttacct ctgactcgtt attttggaca 11101 tgaggagtta gataaggttg ttaatcaaag ctttcctgtc tttttcttgt ttatgcttgc 11161 cttagctaaa atggctgcca tcagcataac tgtcactggt ggatggcgcg gtggatttat 11221 cattccctta tttttcactg gtgcttgtgt cggtaaagct gtagtagcct tgattcctgg 11281 actcaatccc acccttgcca tgatttgtac aatggcggct atcaatgcgt ctgtgacgcg 11341 cacacctatc agcacaacct tgctactgtc aaaattgaca aattttagtc ctttgactcc 11401 aattctgttt gccagtttga ttggattttt tcttgcccct aaagtcccct ttattgcttc 11461 tcaactgaag tctgaacaag gaacactgat taatcatccg tagtcacttg ccagctatca 11521 attaacagct taataaccgt ttaatattgg tgtttacgac tcacccacaa cccagtaaac 11581 agtaaaatat gctagaattg tatcatatta tggcagtaat tcagcaataa ttgggaacaa 11641 ttttgctctt ttctcagtat aaagcctaaa atttgcaatt tttggagttt tttgggttat 11701 gacaacctct caggagagga ttatccctac tgatctgcgg aatgaaatgt cccggtcata 11761 tttggaatat gcaatgagcg tcattgtagg tcgggcacta ccagacgcca gggatggtct 11821 gaaacctgtg catcgtcgca ttctctacgc aatgcatgaa ctcgggttga gtgctgatcg 11881 cccctttcgt aagtgcgctc gtgtggtagg ggaagtactg ggtaaatatc acccccacgg 11941 cgacacagca gtctatgacg ctttggtgcg gatggcgcag gatttttcca tgagatatcc 12001 cctgatcgca ggacatggaa acttcggttc agtagacaac gacccacctg cggcgatgcg 12061 atacacagaa tgccgcttgc aagctttaac cagcgattct cttctacaag atatcgaatc 12121 agaaacagtc gatttcatcg ataactttga tgcttcccag caagaaccaa cagttttacc 12181 agcacgtatc ccgcagttgc tgctcaatgg ctcttctggt attgccgtgg gtatggcaac 12241 caacatacca ccccataact tgggcgaatt aattgatggg ctggtagcac tgattcaaaa 12301 gccggaaaca aataatactg atttaatgca gtatatccac ggtcctgatt ttccgactgg 12361 aggtcagatt ctaggaactg catcaattaa agaagcttat accactgggc gtggttccat 12421 caccatgcgc ggcgtggcta caattgaaac aattgaacag cggggacgcc ctgacagaga 12481 cgcaattatc atcaccgaat tgccctacca aaccaacaaa gcggcgctga ttgaaaaaat 12541 cgccgagatg gtgaacgaca agaggatcga gggaattgca gatattcggg atgaaagcga 12601 tcgcgatggg atgcgaattg tgatcgaact caagcgcgat gcttatcccc gtgtcgtgct 12661 aaataacctc tacaagcaaa cgccactgca agccaacttt ggggcgaata tgctggcgct 12721 ggtgaatggc gaaccccaga tactcaccct gaaggagttt ttgcaagttt tcttggattt 12781 ccgcgttgaa gctattacca gacgcacccg ttacgaactg cggaaagccg aagaacgcga 12841 ccacatactg caaggcttgt tgattgccct agggcgttta gatgagatta ttaacttaat 12901 tcgccatgca cctgatgcac caactgccaa aggcgagttg atgacaacct acggactttc 12961 ggaagcgcaa gcagatgcaa tattgcaaat gcaactgcgg cgtttaacag ctttagaagc 13021 agacaaaatc cgccaagaac acgagcaatt gcaagcaata attactgact tgcaggatat 13081 attggcacgg cgagagcgga ttttagaaat tattgaaacc gaagttaagc aacttagaga 13141 aaaacatgcc acacctcgcc gcacggtcat ttcacccctt gaaggagaaa tagatgagcg 13201 tgacctcatt gccaatgaaa aagctctgat tctgctgact gagcaaggtt acatcaaacg 13261 aatgccagtg aatacctttg aagcgcaaag ccgtgcaacc agaggcaaag cagctgccaa 13321 gatgaaagag gatgatggcg ttgagcattt cttgacttgc tgcgaccatg acagcgttct 13381 attttttagc gaacgaggtg ttgtttactg cctgaaaacg tatcaaatcc ccattagttc 13441 ccgtaccagt cgcggcacac caattgtgca aatgctaccc attcccaaag aggagaaaat 13501 cacttcaatt gtccctgttg cagagtttac cagtgatgaa tatttggtca tgcttaccaa 13561 aggtggcaat atcaagaaaa ccgagttagc agcgtttagc aacattcgtg ctaacggttt 13621 gattgccatt tctttggaag aaggcgacca actccgttgg gtgcgacgcg ccagagtcga 13681 agacagcgtg atcatcggtt cccgtcgggg tgtagcaatt cactttagat gcgaccatga 13741 acaactgcgt cctttgagtc gggcaactcg tggggtaaaa tccatgaaac tcaagaaagg 13801 agatgaactg gtgggaatgg atattctccc cgctgccatt cttgccactt tgaatacaga 13861 gactgaagcc gaaattgagg aagtcgaaac agaagagatt gaaaatgaag agtctactga 13921 agtcccagcc aacggcagta cgggaccttg ggtgttagtg attacaatgg gaggatacgg 13981 caagcgtgta ccagttgccc agtttcgtct gcaaaatcgt gctggtcagg gtttaacagc 14041 aactaagttc aaaaatcgta aaatcaaaga ccagctggca actctgcata ttgtcaacag 14101 tgacgatgaa actatgatgg ttaccagtcg tggtatcatc atacgtcagg cggtaaatgc 14161 gatatccgtc caatcgcgat cagcaacagg agtgcgagtg cagcgcttag acgaagatga 14221 tgtcatcacg ggagtggcaa tagttccacc cgatagtgga gatgcagcag aagcagagta 14281 aaaagcaggg gaagcaaggg gagaagttca cgatcttatt cccttaccta attgctctat 14341 ctagtggaat tttgatgggg ctgactgttg cacctgttgg ggtgtggttc ctagcatggt 14401 tatctttagc acctctgtgg gtcttagtga ttcgttcggc acaacggaaa aacttatcct 14461 cctctgcttc ctctgccccc cctgcttccc ctgctcttct ccccctctcc ctcgcctggg 14521 gtattggata tcacggaatt gccctatcct ggattaccgg ggttcacccc atgacttgga 14581 tgggggttcc ttggttggcg agtttggcga tcgccctttt ctgctggata ttcattaccc 14641 tttggggagc tgcattagtc gcgagttggg cgatagggat gcgagtcatt tcccgttcaa 14701 tccataacgg atttgttcgc gtcctcattg gcacagcatt atggtgtgga ttagaagcac 14761 tttggagcgc tggacctctt tggtggagtt ctctttctta cactcaaagt ccgcataatc 14821 tcattattct acacctcggt caactttctg gacctagcgc tgtgacagcg gctattgtcg 14881 cagtgaatgg cttaattgct gaagctttag taaaccgcag aggcgcagag tacacagagg 14941 agaaaataaa gtttttctct gcgctttctg cgtctgggcg gttcgtttat ttatcaaccg 15001 ccgcagcact attcatcact ctccacctca tcggatttag tttatacagc cgtcctcttg 15061 ctcaaccacc agaaacagca ctaaaagtag ggattattca gggaaatgtc cccaacgaaa 15121 tcaagcttta cccagaaggc tttcgtcgtg ctattcaagg ctacaccact ggatatttga 15181 aactagcaaa tcaaagtgta gatgcagtac taacgccaga aggcgctttg ccttttttcc 15241 aggatgccat catacgtagt tcaatggtcg cagcagtgcg ggaaaaaggc gtggttgcat 15301 ggattggcgg tttttacaaa cagggaagca gctatacaaa tagtttattt acagtcgctg 15361 gcgacggtga aataaaaagt cgctatggta aagcaaagat ggttcctatt ggggaatacg 15421 ttccgtttga atcaatttta ggtggtattg ttagccgttt atcgcctttg gatgaacatc 15481 aagttcctgg ttcgtcaaat caggtatttg acacaccttt tggtcgcgct attgttggta 15541 tttgttatga atctgcgttt tctgggcaat ttcgtcgtca agctgctgcg ggtggacaat 15601 ttatactgag tccttccaat gatgcccatt acagtgcagc gatgccatct cagcaccacg 15661 cacaggatat catgcgggcg atcgaaactg acagatgggc agtgcgagca accaatacag 15721 gatattcagc ttttgtcaat ccccacggca aaacagtgtg gatatctgga cataatacgt 15781 atgaagtgca tgcagaaacg atttatcgac gacaaactca gactttatac gtgcgttggg 15841 gtgattggtt aacactgtta ttgttgggag cgggaacgtt agcatggttg atacctaagg 15901 ctgattaatt tttaccacaa agcggatgat ttccaggtct atttttagca acttgcgcac 15961 gactcctaaa gtccaagggc gggaaatgaa ccaagggtgg ttgatgagaa aattgacaac 16021 cttactgata atcagtgata agtttttcgc agtgcaggtg attgctttgc gagtgggttg 16081 acgtatggtt ttgaccatga tcctcctggt agatggtgtg gggtgctttg aagttggtga 16141 ttactttaat aaaggaaggc tacaacagaa tagccgaatt ccttaagaag gttttaccac 16201 ttccaagctg gaatctggta cgcgactgaa atcaggaaaa aatgcttgtt taccaaggat 16261 ttcccaactt acagcggttt tcatatgaat agaccacaaa tcgtagggtg ggcattgcca 16321 ccaaaagcct atgtgataaa catttgagat attggtgagc aatgcccacc ttacagaaaa 16381 ccgctgtaaa tggattaatc gttaaagttc gggaatcctt acattaattg gagtgtggtg 16441 caacacaacc aaatctgtaa aatcgcaagt acggcttgta tctacaatta tcaaacttaa 16501 atcgaagaaa atcatggcaa cactaaaact gatcgagtta ttgtgcatag atcctgaaga 16561 cacagatatg aaggacgagg cttatatccg aatcatcact actgatcgtt cacttcgaat 16621 tcccactgca ggttctttcc ctatttccaa aagtcaatcg gtaagtctta aaagcaatac 16681 catcaacttt acggggaaag cagaaattag gctttttgac caagattctc tagacgctga 16741 tgacttttta ggcacaaata ccgtagattc atcaaagcta ggggcaggta aactcaagtt 16801 tatcaagagc ggtgtcaact acgaattaaa gtacgaggtt attcgttgac atcctcactc 16861 ccgcaaagcg ttcgactgag cgctcacgcc gaagtccacg gtgattccta agactcacga 16921 taatggtttc tgtttccttc ccgttcggct gacgccgtga gcgtcagccg aacgggtttt 16981 ttgcaactgc ccttcagtcg aatgactgtc gggtcttacg ttcgctccac agactgaaac 17041 ggcaagacaa acaccgccaa aatattttta gctgcataag gtagcatgag gataaccaac 17101 gatcgcccca taacggcacg ctgcgcgttc gcccgattcg tgcgctttgc gcttacggtc 17161 acgctgcgat agccttctgg cgtggctgcg cgtctacgtt aaatcaggta ttctttgggc 17221 accgaaggga gtaatgtcgc actactaccc cggtggacta gaaaaggaga taaaacgcgg 17281 ttttccactt tcgggctggg tgctgcttat ggtgtttata caattgcggc tgctatctcg 17341 ttattcttcg ttctcttttt tattaaggaa actaaaggaa ttgagttaga aaacatgtaa 17401 ttggcatcga actacagcta ctctccaaaa atttaaaggc agatgcagat ttgaattcca 17461 gacttttatg aaagattacg tgcatctgct ttgcacccaa tgtcccagaa taaagctatg 17521 gatagaattt ttacttgttc acactcatgc ctacattcac cgtcctcatg gaggtgggaa 17581 aaatgtgaca atgattaagg cgatttcttt agagggtata cgtaaaatcc tctaccaaca 17641 tacctggaac catactacct cccagctgac gaccatcgta tgttccaact tcgggcacaa 17701 attcttggaa attggtcaaa tctacgagat tctctcccca aaatcgataa agactttcat 17761 caaaacgcaa attttcgata ggggcaataa tctcaccatt ttctacccaa aaacaggcat 17821 aacgggtcat acctgtaatt cgaccggtgg ggcgatcgct ccaattcaag taatgtaaat 17881 tagacacata caaacccgta tttaaatgag taagaatatc ctcaaagctt aaatttcctg 17941 gactaatttc tggtgcacgt aaactctcat aactattagc gccattagca gttttttgat 18001 attccctagc ggtgcgagaa ttgactagcg tattgactaa aactcctttt tcaataacag 18061 gtaattctag cggagccatt tctcccaatt cattaaatcg cggtactaat cctcgttgaa 18121 agttttcttt cacatgaaat gcaacagata actgcttttc tttgcgccac agtgacgcta 18181 aagcactccc tccctgttgt aaatcagctt cactcacacc tccccaagaa agcatcgcca 18241 ctaaatcagc aactgcagct ggtgcaaagt aggttttgta ctgtcctcgt ggcaattctt 18301 ttattggacg agaaagcaat tgtagttgtc ttttcgcatc gctaattttc tcgctataag 18361 cttcttgatt ccactcgcta ccagcaaaag ttcctttgac agcttgtgct tcagttgtga 18421 atatagaata gtctaatgaa aaagtctctg tactaaacca gtgcttttga ccaaaagaat 18481 cagcataagc tttaataaca actcccccag catatattcc agcaaaatct aattcagtaa 18541 gctgttctag cactgttggt acaaccactt cttcagccaa tagatttcca caatgaactt 18601 ctcgactggt atttgttcct gatggtaaaa ccaaatatgg atctagaggt aacaagggaa 18661 gttcttctcg tagttcttgc aaagcttcgt atgctaactg ccaatctatt tcccaatgtc 18721 cggtaaaggg aaactgccga aaactactgc gttgattttg catcaaagtt agttcaatgg 18781 aaccatcagt aacacagcct gtttgccgca ctttcgcatg attgaaacga gtaaattgac 18841 tgcgttcact gctaagtttc acagtaaatt gttcactttc ttctttctta ttaagaagag 18901 tttcaatcag ttggttaaag ctgacttcta aggcagatag ttcttcgtgt ttcatgagca 18961 ttgtaaatcg agagaaagct ggggagctag gggagatgtt gaggaaattt ttgtatctcc 19021 tagggagatg ctacgtgtct gtgtaggaga tattttgaat gtgttacagc aatcctagat 19081 aggtcgtgaa caaaaaataa aaaaaacgaa ccgccaagac gagccagcgc tgcaggaggg 19141 tctccctccg taggcgtctg gcgtcgcaaa gagcgcaaag aaaagaaaga gagaggaaga 19201 aggtgtttac atcctattta ggaaggctgt agcagcgtgc cgcaggcagc gaattcattt 19261 gtccctgtgt cttcttcaac caccgccaaa gacttccacg ttagcaaata cacaaatggg 19321 tgaaccgtga cctacccaaa ttgcttggtt aggttctcct ttgccacaat aaggagttcc 19381 gtacatttcc caagtagagg agtttcctac ttttactaag ctgtgccaaa actctggagt 19441 tgtggctcgg tagttgggat tgcggagggt tttggtgagt ttgccgtttt caattaattt 19501 agcgtattcg caaccaaact gaaatttgta gcggcggtca tcaattgacc aagagcgatt 19561 ggattccatg tagacaccgt gttctatacc agcaataatt tcctcaaaag agccttcccc 19621 tggttctaaa ttcaagtttg ccatgcggtc aattgctggt cgattccaag agcaagcacg 19681 agcacaggcg acccccggta cacctgctct tgcttgactt tccaaactac ccaaacctcg 19741 ctggagaacg ccttctttga ttaaatactc ccgtgtcgcc acagcacctg tatcatcaaa 19801 gccatagctg gcaaattcac caagtacggt ggggtcaaag gtaatgttca tctggggaga 19861 accatacact aaggtaccga aatcactttt attgacaaag ctacctccag cgtaattgcg 19921 ttcatcgccc aaaattcggt caatttctac gggatgcccg acactttcgt ggatttgcag 19981 catcatttgg tctggggcta acactaatga agtgcgagtt gttggacatt cttctgctgt 20041 caacagttct actgcttgtt cgccaatttg ctgcacccga acccacaagt cagcttcttg 20101 gaaaagttcc agtccacctt ggtaacagtt tgcttgcaag ccgttgttgc tacgttgctg 20161 gacaaccgtc ccatcttgtg ctgtagctcc gtaatgggtg gcgaaaaaca taaatttttg 20221 atacacttct gagccgttgc tgctgacaaa ccaagtctct ctctggcttg ttgtcatgtt 20281 ggctagagtt tgtacaattt ggtcagaaac tcttagggta tggcagacac ggacaagtaa 20341 atcgttaatt tcccctggac tcaaagcatc aaagggtttc agaaatggcg agatatattg 20401 accaacaact ttagggcgtt cattctcaac tccaaacgga tgtatccacc actcacttgc 20461 tgcaagtgct tgtgtatatg ctttttgggc agcggtttgc aaatctagga tacgcagcga 20521 attggtagct gcatatccta tacagccgtt aatcataact tccagcataa accccatcgt 20581 agaagatttg ccgttacttt ggggtatgcc gtcacggaca tgacaggtaa atgaggtttc 20641 cttgacgact ctaataccaa gccagtcggc tggtatattt atttgggcga tcgcttttgt 20701 cagttcagac cacataacaa ttcaaagttc aaaattcaaa gttcaaaatg gaagatttca 20761 aaaaatattg ttaactgtta actgttaact gtttccgcat ttgctccaat tgggagcatc 20821 ccagatgtgt aaattgattt tattttgtag tgcgggccgg acagcccgct ttggtaataa 20881 agtagggagc aagatgctcc cactaccgtg ttttttgcaa aattgggatg ctcccgcttc 20941 aattgatgtt ccagacttta tcaactgttt agaaagtgtt tctgccactc ccgaactttc 21001 atttagtact tgacatagtt taacaatgcg gatggcaaac tcaaacgtcc tgtcagtaat 21061 tgctttctga tgactactca ttttttat // LOCUS NODE_1495_length_21083_cov_4.58136821083 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 21083) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 21083) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..21083 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(86..409) /locus_tag="DP116_13540" /pseudo CDS complement(86..409) /locus_tag="DP116_13540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744260.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" gene complement(1046..3274) /locus_tag="DP116_13545" CDS complement(1046..3274) /locus_tag="DP116_13545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015219548.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NHLP family bacteriocin export ABC transporter peptidase/permease/ATPase subunit" /protein_id="PRJNA477356:DP116_13545" /translation="MRFAFIKSILSRPQTSRVRTPTLLQIEATECGAAALGIVLGYYS RIVPLAELRRECGVSRDGSNAFNVIKAARNYGLNAKGLKPSLENLKTLPPPYIVFWNF NHFLVVEGFFKNRVYLNDPATGRRTVSLEEFNRAYTGVVLAMQPGSDFQKGGKKNNVI SALVSRLQHSRSTILFCLIAGLLLTFPRLAVPAFTQVFVDEILVTERSEWIRPLVLGM VLTAVGRGFLAWLRLTYLRRLMLKLSVTMSGQFLWHILRLPVGFYAQRFAGEISSRVQ LNDKVANLLSGTLATTVIDAVMMVFYFLIMLQYDSVLTSVALSFAGINFFALQFLSRT RVDANMKLAQEYGKVNGVAIAGIQTIETIKASGLESDSFAKFAGNYAKALNAQQELAL QTQILTVLPTFLTALTTTSILFLGGYRVMNGNLSIGMLVAYQSLTASFLEPINSLINF GSMLQDLEADLNRLDDVLQNSVDAEAQRGVEGERERGREGERDNYQLPISFQLQGYVE LRNITFGYSRVDNPLIENFSLILEPGQRVAIVGGSGSGKSTIAKLVCGLYEPWSGEIY FDGVPRTQIPRSVLANSLAMVEQDIFLFAGTVRENLTLWDSTVSQTDLVRACSDAVIH DLILSLPGGYDTKLIEGGMNLSGGQRQRLEIARAMVRNPAVLVMDEATSALDAQTELM IDRNLRRRGCSCIVIAHRLSTIRDCDEIIVLEQGKVVQRGTHEELWDSGGKYASLLRT EE" gene complement(3401..4006) /locus_tag="DP116_13550" CDS complement(3401..4006) /locus_tag="DP116_13550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411947.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_13550" /translation="MTYTPPKLLTFEQFITEYGDNPRYELIDGELRDMEPTGPHEAVS GSIAGKIYAEIFRSNLNWLIPKNCLIKPLASEATALRPDVIVLDQAELKAEPLWQKEP IICNGRTIKLVAEVVSTNWQDDYARKVEEYAFLEIPEYWIVDFRGLGGKQFIGNPKQP TFTVCQLINGAYQQQQYRLQDTITSHLLANLQLKLDDIMPL" gene complement(4092..5018) /locus_tag="DP116_13555" CDS complement(4092..5018) /locus_tag="DP116_13555" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13555" /translation="MYSISIISSLVVGSVFLVTGIVKALAQETFIIHITKLQLFSQKV NLAAAIAFTILECVLGIALILRLFPQWLFPGTIVLLLVLSSLTYWSTSTKRTNDCGCY NGLINISPNQSLLLNILYIALIGLAWFYPVADILTVSQQVTALLISLAMSCLLTGLSY LYLWKKQKPLLDLSPLKVNRLWQPKWLEEYADNLTTGDKLVVFLMPGCQMCKSWVKVL KIVHKRPDLPDVVAGVARTPEEVQEFVQLHNINFPVVAMNPLVMSRLAKAFPTGVLLE NGVIREKWMGVMPLAFVQRVKPNLSVPEVAKS" gene complement(5123..5260) /locus_tag="DP116_13560" /pseudo CDS complement(5123..5260) /locus_tag="DP116_13560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003947234.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="chaplin" gene complement(5517..7124) /locus_tag="DP116_13565" CDS complement(5517..7124) /locus_tag="DP116_13565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874025.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NHLP bacteriocin system secretion protein" /protein_id="PRJNA477356:DP116_13565" /translation="MQDRKSRLFREEALEQLSSPEKLDQLMQVTSQQAWLPLLSMGFL VVIAGIWSVVGRIPLTVTGSGVLIRPRNVLQFQALSAGQLLTLNIKPGDVVRQGQVLA TIDQSNIKQQLQQEIAKLNQLLEQNLNTDRLQKQQTVLALRTLQQQQKDFSETLRRES VAPILHSQTLQALKQKRQSLEQSVSREQVAPVLYKQTLTALAEKRKSLEEQKQQISAL VKTLQQRVETRRSLFQQKIISQDTLLQSQQELLNAQQQISEISTQLKDLDVQKTSSER EYLQNLSKIDDIKNNIQELEVQRTNAERDYLQNLNKLDEIKTKIKDTETQKTKLVQQD LEKSINKLNQIQEVKRKIAQLKLQLTKSSQVISKYDGRILEVGVLPGQVVNSGTRIGT IEAEDRNTKLESLVYFADKDGKQIKPGMTVQVTPSVVKRDRFGGIVGMVTRVSPFPVT TGDIVAQVGNEDLAKSLANSNAARVQVFVQLQKDPNTVSGYKWSSSNGPALNISSGTT TQVRVKVGEVAPISYIIPILRSWTGVY" gene 8036..8647 /locus_tag="DP116_13570" CDS 8036..8647 /locus_tag="DP116_13570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215978.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13570" /translation="MNYRRIFKPGESYTFSRYFELSFTIDDILAELNCTLERKNLILP ESSKILDLQFLQRQLQRNLSHIDLVNETARREALIGPILFEVCDLTNQRLNIEYSIAV NDYLKGTLDYYIAAPQNLLVIEAKQSDLVRGFTQLAIELIALDEWTKSTSELLYGAVT TGEDWRFGVYNRALRKVIQDQKRYQVPEDLLILVKILIGIISG" gene complement(8801..11185) /locus_tag="DP116_13575" CDS complement(8801..11185) /locus_tag="DP116_13575" /inference="COORDINATES: protein motif:HMM:PF00072.22,HMM:TIGR00229" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_13575" /translation="MDTQEPVKILIIDDSAEDRFAYCRYLLQEVEYQYRILEQETGEA GLSLCLQEQPDVILLDFLLPDLDGLEFLSQLQHQIGKNHPPVIILTGQGDESVAVQVM KRGAQDYLIKGKTTSETLRLAVKSAIKNARLQQQLQQSQAALRESEERFRLLSAFAPI GIFQTDGAGRCLYTNPQWQAIAGLTLQESLGDGWARAIHPDDRQQVWAEWNRCTQEGS EFSMEFRFLTPQGGAHWVQANAAAIRSELGEIIGYVGTDQDITVRKQAGEALQNALQK LNYHVENSPLAVIEWDSDLRIIRWSQSAENIFGWTAQEILGKQMNEWQFIYPEDAAAV SEVVRRILSGDQAQTVCCNRNYAKDGSVVYCEWYNSTLTDKSGNLVSVLSLIMDVSDR VRLAFERDRILQLEQAARSEAERANRVKDEFLAVLSHELRSPLNPILGWSKILRTRKL DPDMATQALEIIERNAQAQAQLIEDLLDVSRILQGKLKLNVYSIDLTTVILAALETVR LAAQAKSIELQPIFEASVWQVSGDSTRLQQVFWNFLSNAVKFTPNGGRVEIRLQEVAS QAQITISDTGKGISPEFLPYVFESFRQADSTTTRQFGGLGLGLAIARNIVEMHGGTVM ANSLGEGQGATFTVQLPLIKSSKNNADKEDDGKNFSSPLNSSILTGLRVMVVDDEPDS LELIAFVLKQEGAEVTAVASAHEALRVLEKWQPVLLISDIGMPEMDGYMLIRQIRKLP QEQGGQISAIALTAYAGEADSQKALATGFGAHVSKPVDSTQLIAVVTKLLKERV" gene complement(11311..11757) /locus_tag="DP116_13580" CDS complement(11311..11757) /locus_tag="DP116_13580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312302.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_13580" /translation="MLNNSLLVVEDSDEDFEAFKRIVRKSSVYPSIYRCVDGEDALNY LLQVGDYANQTSASRPAMILLDLNLPGMDGRDVLMQIKQDATLKMIPVIIFTTSSNPK DIDVCYEHGVNGYIVKPIDLSKLKETIEVFIQYWFEITTLPSFMGR" gene complement(11747..14071) /locus_tag="DP116_13585" CDS complement(11747..14071) /locus_tag="DP116_13585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015189454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanobacterial phytochrome A" /protein_id="PRJNA477356:DP116_13585" /translation="MSENETITPDTLDLTNCDKEPIHLPGSIQPHGLLFVLTEPELTI IQISNNTAQFLGREPEDLLNTRLHDLLDFQQLNAIDKCLSKDFESVNPLKIVLHQQGK NLIFDGIVHRFDGILILELEPSQSQENVSFFGFYHLVKGAISKIQAASTTEEMCRVAV QQVQQLTEFDRVMVYQLDAQGAGHVIAETAQDELTPYLGLRYPASDIPTQANQLYRLN KLRLIPDTDYQPAALVPSLHPVTQRPTDLSLAVLRSVSPLHIEYLHNMGVSASMSISL VKNKQLWGLIACHHTSPKYLSYEVRTACEFIGQVMSLDLVTKQANDDYDYKMNLKSIL ARFIELIPQHENLVETLAQSEADLLGIVSATGVAICWNDDWTCIGQTPEPDELQKLLA WVGAKIDGDNLLYTDALPQIFPEAEKFKQVASGLIALAISKVKHNYILWFRPEVIQTV NWGGNPNKPVEVLQDHSVRLSPRKSFALWQETVQGRSLPWKASEIEAVLELRGAIVSI VLRKADELAKLNLELERSNTELDAFAYIASHDLKEPLRGIHNYASFLIEDYADVLEED GLEKLQTLLRLTNRMEDLIESLLQFSRVGRVELNLQLTDTNEVVQHATEVVKLSMKSK DVEFRIPRPLPIVRCDRIQLDQVFTNLISNGIKYNSSPQKWVEIGYLDEADTPNNCLL ENSHNGETPIVFYVRDNGIGIKKQYLETIFRIFKRLHGKNQYGGGTGAGLTIAKKIVE RHNGRIWVESTYGNGSTFYFALPRANQLYETSVE" gene 14406..15629 /locus_tag="DP116_13590" CDS 14406..15629 /locus_tag="DP116_13590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011208560.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S8" /protein_id="PRJNA477356:DP116_13590" /translation="MESGQYIILRDTSRVDLSEPFSGRSFTLEASRTAEAPTPRIDIE SLDTRDLFDLRRDPQVVSIAPPMPVALIEPTASEEAQQTHDQLGTTTWGVSVTGAVDS PFTGQGVTVAVLDTGIDANHEAFKGVQLIEQDFTGEGNGDKNGHGTHCAGTIFGQVVN GYRFSMAPGVQRALIGKVLDGQGRGGTEQIYRAILWALDQGAHIISMSLGMNFPAQVD DLIKEKNFPAPLATSKALENYRANVRLFDKLADLVNARNALFQQGTVIVAAAGNESKR NIDPNYILSVAPPAAADGIISVGALETPGEPNNALKVAYFSNTGPNVAAPGVKIYSAK PGGGYRYLNGTSMATPHVAGIAALWAEKLLKQNTTVNRLELSSRLIGQAKKDRLAAGF NPLDVGAGLVQAPVN" gene 15819..16898 /locus_tag="DP116_13595" CDS 15819..16898 /locus_tag="DP116_13595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408095.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_13595" /translation="MDTPILYIAITNHGFGHATRAAAVAATIQKLCPEVLLIIVSTAP RWLLECYIEGDFIHRPRAMDLGVVQADSLTMDQAATLEKLLEIKKNQNSIIASEVNFI RQNRVNLILADVPFLACEIAKVAKIPCWMMSNFGFDFIYRDWGSGFTEIADWISDCYS KCDRLFRLPFHEPMQAFNNITDVGLTGGSPRYTPEEVRSTWGITAPKEKTILLTFGGL GLQQIPYENLKQFPDWQFITFDQSAPDLPNLVSATDQKYRPVDFMPICGRVVSKPGFS TFAEATRLDVPIVSVTRDDFAEAAFLLEGISNYNQHQILKPSEFFQGTWDFLNQPPNQ PKESQPIAKDGNETVAHAVLNYLQR" gene complement(17131..19143) /locus_tag="DP116_13600" CDS complement(17131..19143) /locus_tag="DP116_13600" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13600" /translation="MQVLQSITKSFFVIFYVLANWQFNFNQLLFITGREVMAAPLELL GQNNTLQNKIQAKFNVANQALADGLEQNQELLGNTQQTFSQANAAIRQSINPQQATSA QAQDFAQNNGGVGSLLSGIADGQMTYDDYAAQVNQAVEAGQITPQERRTQLALARKRL QVPSTLQDSTPHEAQKLVKHNLQLTDNILKIQQAKLVNEAKTQTLSNQSTGTFAWKFI KNGVGVASEYSGDIKEQQQIQLELNSLRQNIADINVNGVEWLVQNSKDESVQQALKIE GFNIPSQGLTKEQAQQYTGALQQLVLSYEEQATLLQMRINTIAEKYTPYRLNRPQAQL KKTFPLKLKAAEQETQSLTQQANLSQVGSTKHSSGGHLHWKIPKRNTFDGNDSVDPIE SFAGLNTTSSVARKAIGKTNTWQYNFNIIKIQEKNGENVPKMGNSSYMLNDNVGNQNQ SGYPDSTQQAFNFQVPGKKGYVPNVAYPEENIDNTHGYAYLNKNVAFRRAINTTANKL GIPGEWIADIIAVETSFNPREFHKTNSCGYGGLIGFGADDAQEFGTTLRHILSLPPER QMVYVEKYLSRPHIKRHLNKGVEYVAAAVFGGRGLLNKLIKNPSKALRTSDGAINLQT YMQRLGRDVGRRYDFTTDRQTKPTIFTLSRDLQNRVLFSQQMQQAN" gene 19382..19798 /locus_tag="DP116_13605" CDS 19382..19798 /locus_tag="DP116_13605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858815.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YjbQ family protein" /protein_id="PRJNA477356:DP116_13605" /translation="MTDYQKLLRISTTGKSLHNITRKVEDTVAESGVQTGLCVLFLRH TSASLLIQENADPDVLKDLANFMAKLVPESAQYIHDTEGPDDMPAHIRTVLTHTSEII PINKGHLVLGTWQGIYIWEHRQRNHARELVIHISGS" gene 19976..20776 /locus_tag="DP116_13610" CDS 19976..20776 /locus_tag="DP116_13610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457739.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Nif3-like dinuclear metal center hexameric protein" /protein_id="PRJNA477356:DP116_13610" /translation="MKIADFITWFEKWANPAWQESWDNCGWQIEPGVLQENARVLVCL TPTLAVMQEAIAVQNAGIPVNLIFAHHPLIFSPTKSLRRGDAIGEMARLAFTHNIGIY SAHTNFDQVEDGTADVLAQILELKEVSPVVPTQAGLGYGRVGNLDQSIALGELLTIIQ TRLTPSHLISSPVVDLKQAISRVAVLGGSGASFLSAVVKTGAQAYLTSDCKFHQFQES RDRGVILIDAGHYATERPACDRLVQKFRDLNLDWVQLSQTDEDFRLLY" BASE COUNT 5629 a 4735 c 4493 g 6226 t ORIGIN 1 caacttgtaa ggactacact tcattatgta cccattccgg aagttatgtc cagatcttta 61 cagattttaa gttacagttc ccaacccagc gtgtctcagt accaatagtt tcttgattca 121 aaacaatgat gtccggttcg tacccagatt tgtcgcggta cggttttaca atcgattctc 181 tgggtattgt ccaaatgtcg gttttaccca ttttatcaat ggttgtaagt aaccgtccac 241 cgagagaacc cgttaaattt gaatgcttcc ccctaggctt aggcatctcg acaattactc 301 catcatataa ttcgtaccga acttccgagt tttctggata ccattcaata aattcatcaa 361 aggtgtataa ttttggttcg gtttgtgctt gtgtttgggc ttgggtcata gttacaacct 421 ccagatatat ttgactttaa catcttgcga ttaaccccga ttccttaacc aaatcatttt 481 tattgctgtt actacctcaa aaatttggat ttttgggaga gtaaattacc cagatccatt 541 aaaatccgtc gatggggtcg ggtaattatt cctcatattt ttttattgtt taatcattct 601 tcagtacgca gcaagctggc atatttcccc cctgagtgcg tactccgctg caccactttc 661 ccctgctcca gcacgataat ttcatcacaa tgaaagttta gatttaactt gagaccaaac 721 agtagagtcg atttcattcg cgatcttcag ctgcactttc ctctgttagt tgacttcgct 781 gactgattga cttgagttca agatggttta gtagttgtgt cagttgctga taatcaattg 841 cttttttatc aatcttgatg atgatatttt gttcgtcaac actaaggttg taataaggat 901 tgttagacat aatttctaag tcaatacagt agcaaggtaa tcgtaatata tttcatatat 961 agcaatctat gcagacgaat ctatgtttat ctatcgatat tcagacactt cgttgagatc 1021 ctgccaaaaa gctggcttgg gtgaatcatt cttcagtacg cagcaagctg gcatatttcc 1081 cccctgagtc ccataactct tcatgcgtac cccgctgcac cactttacct tgctccaata 1141 caataatttc atcacaatct cgaatcgtgc tgagtctgtg cgctatcaca atacaagagc 1201 aaccgcgccg tcgcaaattt ctatcaatca tcaattctgt ttgagcatcc agcgcagaag 1261 tcgcttcatc catgaccaat actgcgggat tccgaaccat tgcgcgagca atttctaaac 1321 gctggcgctg tccaccactg aggttcattc ccccttcaat caacttggta tcatatcccc 1381 caggcagaga taaaattaaa tcgtgaatga ctgcatcaga gcaagcccgc actaaatcag 1441 tttgtgatac tgtggaatcc cataaggtaa gattttcccg aactgtccca gcaaagagaa 1501 aaatatcctg ttctaccata gccagagaat tcgccaaaac agagcgagga atctgagttc 1561 tgggtacacc atcaaagtaa atttccccag accaaggttc atataaaccg caaactagct 1621 tagcaattgt agacttacca gaaccactcc ctcctacgat cgccactcgt tgtcccggct 1681 ccaaaattaa gctaaagttc tcaatcagag gattgtccac acgactgtag ccaaaagtta 1741 tattccgcaa ctccacataa ccctgcaact gaaaggatat tggtaattgg taattatccc 1801 tctctccctc tctccctctc tccctctctc cctctactcc cctctgggct tctgcatcga 1861 ccgagttttg cagtacatca tcaaggcggt ttaagtcagc ctctaaatct tgcaacatac 1921 tgccaaaatt aatcaaactg ttgattggtt ctaggaagct ggctgttaaa ctttggtaag 1981 caacgagcat cccaatactc aggttaccgt tcatcacccg atagccgccg agaaataaaa 2041 tagaggtggt tgtcagtgct gtcaaaaatg tgggtaatac tgtcagaatt tgagtttgta 2101 gagctaattc ctgctgagcg ttgagtgctt tggcatagtt cccggcgaat ttagcaaacg 2161 agtcagactc aagcccggaa gctttgattg tttctatagt ttgaataccg gcgatcgcca 2221 ccccattgac cttaccatat tcctgcgcca gcttcatatt ggcatccacc ctcgttcgcg 2281 ataaaaactg tagggcgaaa aagtttatac cagcaaaact aagagcaaca ctcgtcagta 2341 ccgaatcgta ctggagcata atcagaaaat aaaacaccat catcaccgcg tcaataaccg 2401 tggtggctag cgtacctgaa agtagattag caactttgtc attcagttgc acacgagagc 2461 taatttcacc cgcaaagcgt tgagcataaa acccaactgg taagcgtaaa atatgccaga 2521 gaaactgccc cgacattgtc accgataact taagcatcag ccgccgcaag taggtcagcc 2581 gcaaccaagc aagaaagccc cttcctacag ctgtgagaac cattcctaac acgagcggtc 2641 gtatccactc agaacgctca gtgacgagaa tttcatcaac aaacacctga gtaaaagctg 2701 gcacagctaa cctgggaaaa gtcagtaata gtcctgctat gaggcagaat aagatagtgc 2761 tccgcgagtg ttgcaagcgg gagactaagg cagaaataac gttatttttt ttacctcctt 2821 tttgaaagtc tgatcctggt tgcatagcca ggacgactcc agtgtaggcg cggttaaact 2881 cttctaggga aactgttcgg cgtcctgtgg cgggatcgtt gaggtagaca cggtttttaa 2941 agaacccttc caccacaaga aagtggttga aattccaaaa gacaatgtag gggggaggaa 3001 gagttttgag gttttccaga gacggtttca aacctttggc atttaagcca tagttcctcg 3061 ctgctttgat gacattaaag gcgttgctac catctcgcga gacgccacac tcgcgccgca 3121 gttcagccaa ggggacaatg cgcgagtaat agcctagaac gattcctaaa gcagcagcgc 3181 cgcattcggt tgcttcgatt tgcagcagcg tcggggtacg gacgcgactg gtttgaggtc 3241 gggatagaat acttttaatg aaggcaaaac gcatgagata attttgcctt agttagccag 3301 tttgctgcaa gttcatcaaa cagtgaccag tgaacagtga ccagtgaaca gtgaacagtg 3361 accaaaacag tgataactgg taactgataa ctgttaactg ttaaagtggc ataatatcgt 3421 cgagtttgag ttgcaagttc gccaagaggt gagaggtaat agtatcttgt aagcgatatt 3481 gttgctgctg gtacgcgccg ttaatcagtt gacagacggt aaatgtgggt tgttttggat 3541 taccgatgaa ttgcttgcca cccaaaccgc gaaagtctac aatccaatac tctgggattt 3601 ctaaaaaagc atattcttct acttttcttg cataatcatc ttgccaatta gtactcacaa 3661 cttcggcaac tagcttaatt gtcctaccat tacagataat aggttctttt tgccaaagcg 3721 gttccgcttt aagttctgct tgatctagaa caatcacgtc tggacgtagt gctgtagctt 3781 cagaggctag tggtttgatt agacaatttt ttggaatcaa ccagtttaaa ttagaacgaa 3841 aaatttctgc atagatttta ccagcaatac ttcctgaaac tgcttcgtga ggaccagtgg 3901 gttccatgtc acgtagttct ccgtcaatga gttcatagcg gggattatcc ccatactcgg 3961 tgataaattg ctcgaaagtt aggagtttcg gaggggtgta ggtcataggt catattttga 4021 gaccaaacag ctttatttta tctaataaag cttttagcta ggtcttcatt tggtaaatga 4081 aaagtgctac ttcacgactt cgcgacttca gggacagata ggtttggttt gacgcgttga 4141 acaaaggcta atggcatcac tcccatccat ttttcccgaa ttaccccgtt ctccagcaat 4201 acgccagttg gaaaggcttt agctaagcga ctcatcacca acgggttcat cgccactacc 4261 ggaaagttga tattatgaag ttgaacaaat tcttgaactt cttccggtgt tcgggcaact 4321 cctgccacta catctggtaa atcaggtctt ttgtgaacta tttttagcac tttgacccag 4381 cttttgcaca tttgacaacc cggcattaaa aacacaacga gtttgtcacc tgttgttaag 4441 ttatcggcgt attcttctag ccatttgggt tgccaaaggc gattaacctt aagcggggat 4501 aagtctaaga gaggtttttg tttcttccac aaatacaaat acgaaagacc agttaacaga 4561 caactcattg ccaaagaaat gagtaatgct gtcacttgct gagaaactgt taaaatatcg 4621 gcgacgggat aaaaccaagc taacccaatc aaagcaatgt agaggatgtt gagcagtaaa 4681 ctttggttgg gactaatatt aatcagacca ttgtagcaac cacaatcatt tgtgcgtttg 4741 gtcgaggtac tccagtaagt taaactcgat aagaccaata acagaacaat tgtaccggga 4801 aaaagccact ggggaaacag ccgtaaaatc aaggcaattc caaggacaca ctctagaatt 4861 gtgaaggcga tcgccgcagc tagattcacc ttttgggaaa acaactgcaa tttagtgatg 4921 tggataataa acgtctcttg tgcgagtgct ttgacaatac cagtcaccag aaatacactg 4981 ccaactacta aggaactgat aatagaaatt gaatacatac tgaagagcga tcgccactta 5041 attgtcatag ctaattaaag cacaaatggt tacttccgcg cttcaatgaa aatgccagct 5101 acctcaattg ctcaaatctt ctttaagcat taacacacgt attcccgaaa gctggattaa 5161 gcaaaccaat tacactgaca gtattaccac aaacgttaac aggtacatga actggtactt 5221 gaagaacatt acccgaaagg acaccaggtg aaccaacggc gacattagca tcaaccccgc 5281 ctattactgt caataaatca tcatcatcaa gttcctcaaa ccattcccga tctttttctt 5341 gttgagtcag tataggagct tgtttggttg cttttttttc ctttgcctga aacattttaa 5401 acatttgagt tttcctcttg agttaacttt tctgagtata cctctctttt agagtatttt 5461 ttttaccagt aagtctacca ttctgactcc tgactactga ctcctgagtt cttttgttaa 5521 tacacaccag tccaagagcg taatattgga ataatgtaag aaataggtgc cacttctcca 5581 actttgaccc ggacttgcgt agtcgtacct gatgaaatat ttagcgctgg tccattagaa 5641 gaagaccatt tgtaaccgct gacagtgttt gggtcttttt gtagttggac gaaaacttgc 5701 acgcgagcgg cgttactatt tgcaaggctt ttagctaggt cttcattgcc aacttgcgct 5761 acaatatcgc cagttgtcac agggaaggga gaaactcttg ttaccatccc aacaattcca 5821 ccgaagcgat cgcgcttgac aacgctaggt gtaacctgta ccgtcatacc tggttttatt 5881 tgcttgccat ccttatcagc aaagtaaaca agactttcta gtttggtatt acgatcttca 5941 gcttcgatag tcccaatacg agtcccagaa ttgactactt gaccaggcaa gacgccaact 6001 tctaaaatcc gaccatcata tttactaata acttggcttg atttcgttaa ttgcaatttt 6061 aactgggcaa ttttgcgttt aacttcctga atttgattta acttatttat tgacttttct 6121 aaatcttgtt gaaccagttt agttttttgg gtttcggtat ctttaatttt ggttttgatt 6181 tcatcaagtt tattaagatt ttgcagataa tcccgctctg catttgttct ctgaacttct 6241 agttcttgaa tattgttttt gatatcgtca attttactca agttttgtag atattcacgt 6301 tcgctactcg ttttttgaac atcaaggtct ttaagttgtg ttgaaatctc tgatatttgc 6361 tgctgagcgt ttaacaattc ctgttgagat tgcagtagtg tatcttggct aataatcttc 6421 tgttgaaaaa gactacgacg cgtttcaact cgttgttgca aggtttttac caatgcactg 6481 atttgttgct tttgctcttc aagacttttg cgtttttctg ctagtgcagt gagagtttgt 6541 ttgtacagca ctggcgcgac ttgttctctt gacacgcttt gctcaaggct ttgtcgtttt 6601 tgctttaacg cttgaagggt ttgtgagtgc agtatcggtg ctactgattc tcgacgcagg 6661 gtttctgaaa agtctttttg ctgctgttga agcgttctta gcgccagcac tgtttgctgc 6721 ttttgcagcc tatcggtgtt tagattctgc tctaaaagtt ggttcaactt tgctatttct 6781 tgttgtagtt gttgcttaat attggattgg tcaattgtgg cgagtacttg accttgccta 6841 acgacatctc ctggcttgat attcagagtt aataattgac cagcactaag tgcttgaaat 6901 tgcaagacat tacgaggtcg aatcaacacg ccggaacctg taacagtgag tggaattcgc 6961 ccaactacac tccaaatacc agctataacg accaaaaagc ccatagaaag caaaggtaac 7021 caagcttgct ggcttgtgac ctgcatcagt tggtcaagtt tttctggtga tgacagctgt 7081 tctagcgctt cttctcggaa gagtctgctt tttctatctt gcatcttatc tatttttatc 7141 aggacttacg caactgtcag gtataataca tatagcaggg aacagggaat aggcttgatg 7201 gtctggtggt gtccgtattt tctcattagt tcatgtgcta atctacctag caactactat 7261 agctttttat aactgctact tagctcacga aatcttaaag gcatctagaa cggggcatca 7321 caaaattact caaaattgca cgaacgtagt cgtcagcgcc cacgttttca catagagcta 7381 tatgtttcca ttttaactct agaacgagtc gcacttgtga ccaactcagt tatcagttat 7441 cagttatcaa ttgttcactg ttcactattc actgttcact gttcactgtt aataaaccca 7501 actcctttgt aagatttgtg taatcctagg agacatcaaa gccacgacga cactcgttct 7561 tttaagagtt tggtgacaac tgcaattaat tgagttggct ccactggttt agatacatgt 7621 gcaccgaacc catttgccag agccttttga gaatcagctt cgctcttcta cgcgcttgag 7681 cgcgattgcc tatgtcctgc ggacacgctc cgctaacggc agagctagcg ccgaagggcg 7741 cgctgagtcc caaggggaca cgctacgcga acgcaaacgc ttaacgcatt atgatatact 7801 agcgcgttaa gcgtcctttg agtgcgcccc cttagaggct agctctgccg ttagcgatag 7861 gcgcagccgt gccgaacgga gacgctacgc gttcgccctt tgggcgtgcg ccttgcgcat 7921 acgacgcagc cgtggccgta cccactagga cataggcgat cgcgctcacc gtgctttttc 7981 aaatcgatag ctaatttata taaaataggt aggataaatt tggctcaggt tttttatgaa 8041 ttaccgtcgt atcttcaaac ctggtgaatc ttatactttc agccgatatt ttgaattgtc 8101 tttcactatt gatgatatat tagccgaatt aaattgcact cttgaaagga aaaatttaat 8161 attacctgaa tcttcaaaaa ttttagattt gcaattttta cagaggcagc ttcagcgtaa 8221 tttatctcac atcgatttag tcaatgaaac tgccagacgt gaagctttaa ttggtccgat 8281 tttgtttgag gtatgcgatt taactaacca gcgactcaac atagagtatt ctattgctgt 8341 caacgactat cttaaaggaa ctctcgatta ctatattgcg gcacctcaaa atttattagt 8401 catcgaagca aaacaatctg atttagttcg ggggtttact caactcgcta ttgaattaat 8461 agcacttgat gaatggacga aatccacatc tgagcttcta tatggggcag ttacaactgg 8521 cgaagactgg cgttttggag tctacaatcg cgccttacgt aaggtgattc aagaccaaaa 8581 acgctatcaa gttccagaag atttattgat acttgttaaa attcttatag gtattatttc 8641 tggttaacaa aatgctctac gatcaatgat gaaaattaag cgaacgtgcc ttctcagcgc 8701 ttgcaatgag cacattttgc attcgtttaa ggcaaagcga tcgcctctcc aatctccaca 8761 atagcctttg tcaatagcat ttgagaccca cttaccctgt ctaaactcgt tcttttaaga 8821 gtttggtgac aactgcaatt aattgagttg agtccactgg tttagataca tgtgcaccga 8881 acccagttgc cagagccttt tgggaatcag cttctccagc atacgctgta agcgcgattg 8941 cactaatctg tcctccttgc tcttgaggta acttccggat ttgacgtatt agcatatacc 9001 cgtccatttc tggcatccca atatcgctga tcaacagaac tggctgccat ttttctagaa 9061 ctcgtaacgc ctcgtgtgct gatgctactg ctgttacttc tgctccctcc tgcttcagta 9121 caaaggcaat caattccaat gaatcaggtt cgtcgtctac aaccatgacc cgcaagccag 9181 ttaggattga ggagttaaga ggcgaggaga aattcttgcc gtcgtcttcc ttgtcggcgt 9241 tgtttttgct acttttaatc agtggtagtt gcactgtaaa agtcgctccc tgtccttccc 9301 ctagactatt cgccattaca gtaccgccat gcatttctac gatattacgg gcgatcgcca 9361 gccccaatcc taagccaccg aattggcgcg tggtggtgct gtctgcttga cgaaaggatt 9421 caaacacgta aggcagaaac tctggactga tgcctttacc tgtgtcactg attgtgattt 9481 gagcttgaga ggcaacttcc tgcaagcgga tttctactcg accgccattg ggagtgaact 9541 tgacagcatt cgagagaaaa ttccaaaaaa cttgttgcag ccgagttgaa tcgccagaaa 9601 cttgccacac actagcttca aatattggtt gcagctcaat agattttgct tgtgctgcca 9661 agcggacagt ttctaaggct gcgagaataa cagttgttag gtctattgaa taaacattca 9721 gtttcagttt gccttgtaaa atccgcgata catccagcaa atcttcaatc agttgggctt 9781 gtgcttgggc attgcgttcg ataatttcta aagcttgtgt tgccatgtct gggtcaagct 9841 tgcgggttct caagatttta gaccagccta agattggatt cagcggcgat cgcaactcat 9901 gggagagaac cgccaaaaac tcatctttaa ctcgatttgc tcgttcggct tcgcttcggg 9961 ctgcttgttc gagttgtagg atgcgatcgc gttcaaatgc taagcggacg cgatcgctca 10021 cgtccataat cagagatagc acagacacca gattacccga tttgtctgtt aacgttgagt 10081 tgtaccactc acaataaacc accgagccat ctttggcgta gttgcgatta cagcaaacag 10141 tttgcgcttg atcgccactg aggatacgcc ggacaacttc actcacagcg gctgcatcct 10201 ctggataaat gaattgccat tcattcattt gcttgcccaa tatttcctgc gctgtccagc 10261 cgaaaatatt ctctgccgac tgcgaccaac ggataattcg caagtcgcta tcccattcaa 10321 tcactgctag cggagaattt tcgacatggt agtttagctt ttgcagagca ttttgcaagg 10381 cttcccctgc ctgtttgcga acagtaatgt cttgatcggt gccaacataa ccgatgattt 10441 cgcctaactc agaacggata gctgctgcat ttgcctgtac ccaatgtgcc ccgccttgag 10501 gcgttaagaa gcggaactcc attgaaaact cacttccctc ctgcgtgcaa cggttccact 10561 cagcccaaac ctgctggcga tcatcaggat gaattgccct tgcccagcca tcacccaaac 10621 tttcttgtaa agttaaacca gcgatcgctt gccattgagg gttagtatag agacaacgac 10681 ctgccccatc agtttgaaaa atcccaattg gagcgaaggc actcagcagg cgaaatcgtt 10741 cctcgctctc gcgtagggct gcttggcttt gttgcaattg ctgctgtaat cgcgcattct 10801 taatggcgct tttcactgcc aaccgtaatg tctctgaagt cgtcttaccc ttaatcagat 10861 agtcttgagc gccacgtttc atgacttgca cggcaactga ttcatcacct tgacctgtta 10921 agataatgac aggtgggtga ttcttgccaa tttgatgctg caactggctc aaaaactcca 10981 atccgtctaa atcaggtaac aagaaatcca gcaagatcac atctggttgt tcttggagac 11041 acaaactcaa ccctgcctcc ccagtctctt gttccaatat cctatattga tactcgactt 11101 cttgcagcaa gtagcggcaa taggcaaacc tgtcctctgc actatcgtcg ataattaaga 11161 ttttcactgg ttcttgtgta tccatgcact gctgcaagaa gtaggagttc caatatttta 11221 gatattttag aaaaaattct caaaaaattc tttaatttta gtttgtttgg gacgggtttc 11281 tcgatttttc atctatagtt catggctgat ttatctaccc ataaaactag gaagcgtagt 11341 gatctcgaac caatattgga taaatacctc gatcgtttcc ttgagcttgc taagatctat 11401 cggcttgaca atataaccat ttaccccatg ttcataacac acgtcaatat ctttaggatt 11461 agaagaggtt gtaaaaataa tgacgggaat cattttaaga gttgcgtctt gcttgatctg 11521 catcaacaca tcccgtccgt ccatacctgg taaattcaga tcgagcaaaa tcatcgctgg 11581 gcgcgacgct gaggtttgat tcgcgtaatc accgacttga aggagatagt ttaatgcatc 11641 ttcgccatct acgcaacggt agatggaagg ataaactgag gatttgcgga ctatccgttt 11701 aaacgcttca aaatcttcgt cactgtcttc gacgactaat aggctattat tcaacactgg 11761 tttcataaag ttgatttgct cgcggtaagg caaagtagaa agtactccca ttcccatagg 11821 ttgactccac ccatattcta ccgttgtgac gttcaactat cttcttcgca atcgttaacc 11881 cagcccccgt accgcctcca tactgatttt tgccgtgcag tcgtttgaag atacggaaga 11941 ttgtttcaag atactgtttt ttaattccga tgccattgtc gcgcacgtaa aacacgattg 12001 gcgtctctcc attgtgagag ttttctaaca agcagttatt gggcgtgtca gcttcatcta 12061 aatagccaat ttcaacccat ttttggggac tgctgttgta cttgatccca ttgcttatca 12121 agttagtgaa aacctgatca agctgaatgc gatcgcacct cacaatcggt aaagggcgag 12181 gaatccgaaa ctccacgtcc tttgatttca tactcagttt caccacttca gttgcgtgtt 12241 gcaccacttc gtttgtgtct gtcagttgca aattcagttc cacccgtccc acgcgggaga 12301 attgcagcaa agactcgatc aagtcttcca tgcgattagt taagcgcaac agcgtttgta 12361 acttctccag tccatcttct tcgagaacat cagcgtagtc ttcgatcaaa aaactggcgt 12421 agttgtgaat accccgtagg ggttccttca agtcgtgaga ggcgatataa gcaaaagcat 12481 ctagttccgt attgcttcgt tctaactcca aattgagttt cgccaattca tctgccttgc 12541 gcaagacaat gctgacaatt gccccccgca gttctaaaac tgcctcaatt tcagaagcct 12601 tccaaggtaa agaccgtcct tgcaccgttt cttgccacaa agcaaaggat ttacgcggcg 12661 atagtcgcac gctatgatct tgcagcactt ccacaggttt attgggattc cctccccaat 12721 tcacagtttg aatcacttcg ggacgaaacc acaaaatgta gttgtgcttg acttttgaaa 12781 tggctaatgc tattaatcca ctagcaacct gtttgaattt ttccgcttct ggaaaaattt 12841 ggggtaaagc gtctgtatat aaaagattgt cgccgtcaat cttcgctccc acccaagcga 12901 gcaatttttg caactcgtca ggttcgggag tctgaccgat gcacgtccag tcgtcattcc 12961 agcaaatcgc cactccagtt gcgctgacta tacccagcaa atcagcttcg gactgggcta 13021 acgtctccac caaattttca tgttggggaa tcaattctat gaatctagcc agaatcgact 13081 ttaaattcat cttgtagtca tagtcgtcat tggcttgttt tgtcactaaa tccaaggaca 13141 taacttgacc aataaattca cacgctgtcc gcacttcgta actcagatat ttgggcgatg 13201 tgtggtgaca agcgatcaat ccccaaagtt gtttgttttt gaccagagag attgacatcg 13261 atgcacttac acccatattg tgcaagtact ctatgtggag cggcgacacg ctgcgtaaaa 13321 ccgctaggct taaatctgta gggcgttggg taacaggatg cagtgatggg actaaagcag 13381 caggttgata atcggtatca ggaatcaagc gcaacttatt caatcggtaa agttggtttg 13441 cctgtgtggg gatatctgag gcaggatagc gtaaaccgag atagggtgtc aactcatctt 13501 gtgcagtttc ggctatgacg tgacctgcac cctgtgcgtc taattgatag accatgacac 13561 gatcaaattc tgttagctgc tgcacctgtt gcactgcaac tcgacacatt tcctcagttg 13621 ttgatgctgc ttgaatttta ctaatcgcgc cttttaccaa atgataaaac ccgaagaaac 13681 tcacattttc ttggctttga ctcggttcta gttctaaaat aagtatgcca tcaaagcgat 13741 gtacaattcc atcgaaaatc aggtttttgc cctgttgatg cagaacaatt ttcaacggat 13801 tgacgctttc aaaatctttt gacaaacact tgtctattgc gttgagttgc tggaagtcga 13861 gcaagtcgtg caatcgcgta tttaacaaat cctctggttc gcgccccaga aattgagcag 13921 tattgttgct aatctgaatg atcgttagtt ccggttcagt gagaacgaac agaagaccat 13981 gaggctggat ggaacccggt aggtgaattg gttccttgtc gcaattggtt aagtcaagcg 14041 tatctggtgt gatcgtttcg ttctcgctca ttctaggcag caaatttttg ctctgctttc 14101 caattgtagt gaaattgtag tgaaaattcc aacagagtca gcagcgatta tgtattgctg 14161 gacaataccg caagtgcatg agtttcttgt gctaacacct attaaaaaac acaaggaaga 14221 aggaaaactt tccaagctta actgaccttt agcaaacaag tagggacatg catataatac 14281 aagtcatgtc aatgcataag tcctaaactt tcttgcgctt atcgagcaat tgattttttg 14341 ggtcattaat gttttggtag ttcttgggtt ttgcatcgga aactagacaa atcggagtca 14401 tcactatgga atcaggtcaa tatatcattc tccgtgacac cagtcgtgta gatttaagtg 14461 agcctttctc aggacgctct ttcactcttg aagcgagtcg cacagcggag gcacccactc 14521 cccgcattga cattgaatca ctcgatacgc gagatctctt cgatctgcgg cgcgatcctc 14581 aagttgtcag tattgctcca cctatgccag ttgctctgat tgagccaact gcgagtgagg 14641 aagcacaaca gacgcatgat cagcttggaa caacgacgtg gggagtgagt gtaaccggtg 14701 ctgtcgatag tccttttaca ggtcagggcg taaccgtagc ggttctggat actggcatcg 14761 acgctaacca cgaagccttt aaaggagttc aactgataga gcaagatttc acaggtgaag 14821 gcaacgggga taagaacgga catggtaccc attgtgctgg taccattttt ggtcaggtcg 14881 ttaacggata ccgcttcagc atggcaccag gcgtacaacg tgcactgatt ggtaaagtgc 14941 tggatgggca agggcgtgga ggaactgagc aaatttatcg agctattctg tgggctttag 15001 atcagggcgc acatatcatc tccatgtcac taggtatgaa ttttccagct caggttgatg 15061 atcttatcaa ggagaaaaat tttccagcac ctcttgctac ctctaaagcg ctagagaatt 15121 accgggcaaa tgttcgcttg tttgataagc tagctgatct agtcaatgct cggaatgcac 15181 tatttcaaca aggtactgtg attgtcgcag ccgccggaaa cgaaagcaag cgtaacatcg 15241 atccgaacta catattatcc gttgccccac cggcggcggc tgatggaatt atttcggtcg 15301 gagcgcttga aacgccaggt gaacctaata acgctttgaa ggttgcctat ttttccaaca 15361 ctggtccgaa cgttgcggct ccgggtgtta agatttactc agctaaacca ggaggcggct 15421 atcggtacct taacggtaca agcatggcaa ctcctcacgt tgcaggtatt gcagcgctct 15481 gggctgaaaa actgttgaaa cagaatacca ctgtcaatcg gctcgaactc agttctcgat 15541 taattggtca agccaagaaa gacaggttag ccgctggctt caatcctctg gatgtgggtg 15601 ctggtcttgt acaggcacca gtcaattgaa taggttgttt tcacttataa tttcgaatcc 15661 gggaagaaac aatcaaaaca ctgaccccct ccctatttcc gctacacgga aaatcggtta 15721 aggtcttgct ttgattgtcc gggcattcat ctcaacgcct aagcggaaca ggacgcgtga 15781 agcttctgcc cgaactagct aaatgaccaa tgacttcaat ggatacacca attttatata 15841 tagcaattac caatcatggc tttggtcatg ctacccgcgc agcagctgtc gccgcgacaa 15901 ttcaaaaatt atgtccagaa gtcctgctga tcatagtcag cactgcacca cgatggttgc 15961 tagagtgcta catagaaggc gattttattc atcgtccccg tgcgatggat ttaggtgtgg 16021 tgcaagcaga tagcttgaca atggatcagg ctgcaactct agaaaagtta cttgagatta 16081 aaaaaaatca aaattctatt attgcttctg aagtcaattt tatccgccaa aatcgcgtta 16141 accttatctt ggcagatgtc cccttcctcg cctgtgaaat tgccaaggta gctaaaatac 16201 cctgttggat gatgagtaac tttggctttg actttatcta ccgagattgg ggtagtggct 16261 ttacagagat tgctgattgg ataagtgatt gttactcgaa gtgcgatcgc ctattccgtc 16321 ttcctttcca cgaacccatg caagccttca acaatatcac agacgtcggc ttaacaggcg 16381 gttctcctcg ttatactccc gaagaagtac gatccacatg gggaatcaca gccccaaaag 16441 aaaaaactat cttactcacc tttggtggat tgggtttgca gcaaattccc tacgagaacc 16501 tgaagcaatt tccagattgg caatttatca catttgatca gtctgcccca gatttgccaa 16561 acttagtgtc agcgactgac caaaaatacc gtccagtgga ttttatgccc atttgtggac 16621 gagttgtatc taaacccggt ttcagtacct ttgctgaagc cacacgtttg gacgtgccga 16681 ttgtttcagt cactcgtgat gactttgcgg aagctgcttt tttattagaa ggtattagca 16741 attacaatca acatcaaatc ctcaaaccat ctgagttttt ccaaggaact tgggattttc 16801 tcaaccaacc acccaatcaa ccgaaggaat ctcaaccgat tgctaaggat ggaaatgaaa 16861 ctgtagccca tgctgtgctt aactatttac agcgttgaaa gcagaaagcc cctatgattc 16921 aaaaatccca cggcttttta tgttcgcata gcgtgccgta agcatagctg aaggtagcgt 16981 tcatgcgtag cttgcccgta gggcataagt ttcgtcgcag ccgtgggagt gtcaaaaagt 17041 ctttaagtat agcagttctc gttcagatga actacatact catctgcgtc gagagtgcac 17101 agcagttcca agacttgtgg tcataaacta ctagttagct tgctgcattt gttgagagaa 17161 cagcaccctg ttttgtaaat ctcgcgagag ggtgaaaata gtcggctttg tctgtctgtc 17221 tgttgtaaag tcataacgtc gccctacgtc acgccccaac ctttgcatat acgtttgtaa 17281 gttaatagcg ccgtcgcttg tgcgtaaggc tttcgatggg tttttaatga gtttgttgag 17341 tagtcctcta ccaccgaaaa cagccgccgc tacgtattca acacctttat ttaggtgacg 17401 tttgatgtgt ggtctgctta aatacttttc gacgtacacc atctgtcttt ctggtggtaa 17461 cgaaagtatg tgcctaagcg tcgtaccaaa ctcttgggcg tcatctgcgc cgaaacctat 17521 caaaccacca tacccacacg agttagtttt gtggaactct ctcgggttga aactagtttc 17581 aacggcgata atgtccgcta tccactcgcc aggaataccg agtttgttag cggtagtatt 17641 tatcgctcgt cggaaggcaa cgtttttatt aaggtacgcg tagccgtgtg tattgtctat 17701 gttctcttcg gggtaagcga cgtttggaac ataccctttt ttcccaggca cttggaaatt 17761 aaacgcctgt tgtgttgaat ctggataacc cgattggttc tgatttccaa cattgtcgtt 17821 aagcatgtat gaactattgc ccatttttgg gacgttttca ccattttttt cttgtatttt 17881 tatgatgttg aagttgtatt gccaagtgtt tgtcttaccg atagcttttc gagcaacact 17941 actagttgtg tttaagccag caaaagactc aataggatct acactgtcgt ttccgtcgaa 18001 ggtgtttctt tttggtattt tccagtgaag atgcccacca gaggaatgtt tagtactgcc 18061 gacttggctt aagtttgctt gctgtgttag gctttgtgtt tcttgttccg cagcttttag 18121 tttcagaggg aatgttttct ttagttgcgc ttgaggtctg tttaagcggt agggagtgta 18181 tttctctgct atcgtgttga ttctcatctg aagaagcgtc gcctgttcct cgtaactcag 18241 aactaactgt tgcaaagccc cggtgtactg ttgcgcttgc tctttggtta gaccttgaga 18301 cggtatgttg aacccctcaa tctttagcgc ttgttgtaca ctttcgtcct ttgagttttg 18361 cacgagccat tcgacgccat tgacgttaat atcggcgata ttttggcgaa ggctgtttaa 18421 ctctagctgt atctgctgtt gctctttgat gtcgccggaa tactcggaag caacaccaac 18481 tccgtttttg atgaacttcc aagcaaaagt acctgtgctt tggttagaga gcgtttgtgt 18541 ttttgcctcg ttaactaact tggcttgttg tatcttaagg atgttgtcag taagctgtaa 18601 gttgtgcttt acaagttttt gcgcttcgtg aggtgtggag tcttgtagtg ttgacgggac 18661 ttgcaaccgt ttgcgcgcga gcgctaattg tgttcgtcgt tcttgcggtg ttatttgtcc 18721 agcttcgaca gcttggttaa cttgtgcagc atagtcatcg tatgtcatct gtccatctgc 18781 tatgccggat aacaaactac caacaccacc attattttgc gcgaagtctt gtgcttgggc 18841 actagtggct tgttgtggat taatagattg acgtatcgcc gcgttcgcct gactaaaggt 18901 ttgttgtgtg ttacctaaaa gttcctggtt ttgttctagt ccatcagcta acgcctggtt 18961 agccacgtta aacttagctt gaatcttatt ctgtagggtg ttgttttgcc ctagtaattc 19021 gagtggagcc gccattactt ctctccctgt aatgaaaagt agttggttga agttaaactg 19081 ccagttggcg agtacgtaga aaataacgaa aaacgatttg gtgatgcttt gaagtacctg 19141 catttggcgg cttacgctta tttttgtgta aataaacttt cgcggaaata ttatcggtaa 19201 ctattgtaaa ggattatacg gctttttcct atactaagta atccaccgtg ttcttcttgc 19261 caccacttct taaagtcaat gccctgagtc cttacggaca ccctcttgca acgagcaggg 19321 ggtaagcttg ttttaggtaa aatagggaat gattagggaa tgaataatca ttaatgcaaa 19381 aatgactgac taccaaaagc tactgaggat ttctacgacg ggcaaatctt tgcataacat 19441 tacccgaaaa gttgaagata cagtcgctga atcaggtgtt caaactggac tatgtgttct 19501 atttttgcgc cacacttcag ctagtttgct gattcaagaa aatgctgatc cagatgtctt 19561 gaaagattta gccaatttta tggcaaaact cgtaccagaa tcagcccagt acatccacga 19621 tactgaaggc cctgatgata tgccagcaca tatccgtaca gttttgaccc atacttcaga 19681 aatcattccc atcaataaag gtcatttagt actaggaact tggcaaggaa tttatatttg 19741 ggaacataga caaagaaatc acgcaagaga actggttatt catatctctg gatcgtgaaa 19801 acaaccatag ggaacaggga acagggaaca gggaacaggg aagataaatt gtacctagct 19861 tcgcaaaaat cagatagaaa tcctatagtt tcaaactaat gaccaatgac caatgactaa 19921 tgactaatga ctaatgacca atgaccaatg actaatgact aatgagcaat gactaatgaa 19981 aatagctgat ttcatcacct ggtttgaaaa atgggcaaat ccggcttggc aagaaagctg 20041 ggataattgt ggctggcaaa tagagccagg agttttgcaa gaaaatgcac gggttttggt 20101 ttgtttgaca ccaactttgg ctgtgatgca agaagcgatc gccgtccaaa atgctggtat 20161 cccggtgaat ctgatttttg cccaccatcc tttaattttc agtcccacca agtccttacg 20221 tcgtggcgac gccatcggcg aaatggcacg attagcattt acccataata ttggtattta 20281 cagtgctcat acaaatttcg accaagttga ggatggaact gctgacgttt tggctcaaat 20341 tctagaactc aaggaagtct ctcctgttgt acccactcaa gcaggattag gatatgggcg 20401 tgttggaaat ttagaccagt ccatagcttt aggggagtta ctcacaatca ttcaaactcg 20461 acttacgcct tctcatctga tttcctctcc agtggtagat ttaaagcagg cgatttcccg 20521 agttgctgtt ttgggtggtt caggagccag ttttctttca gcagttgtga aaacaggtgc 20581 tcaagcctac ttgacttctg actgtaagtt ccatcagttt caggaaagcc gcgatcgcgg 20641 tgtgattttg attgatgcgg gacattacgc tacagaacgt ccagcgtgcg atcgcctcgt 20701 gcaaaagttt cgggatttaa atctagattg ggtacagttg agtcaaacag atgaggattt 20761 tagattactt tattaatttt gataaaatag actgatggca gcaaaagtgt caataactag 20821 gggtgtaagg gggtaagggt ataagggtgt aggggatgat gtagttcgtg ggggtaccaa 20881 gccacccaac gggaaagcca ctcccccaaa ctcctgcaca gacgctactt tgaacgtgac 20941 gagaccgcca agacgggagg tagactcacc aaaacacgct catacaacca aattgaaatt 21001 ttggtggggg gtgtaatgct ggtgtgtaaa gcgataggaa tataaatgca ttttccttac 21061 acccctacac ccctacaccc cta // LOCUS NODE_1508_length_20979_cov_5.22944920979 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20979) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20979) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20979 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..926) /locus_tag="DP116_13615" CDS complement(<1..926) /locus_tag="DP116_13615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865253.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13615" /translation="MNFYYRLAPAVIGVSIAVLQTQVAVALSPAEVNKTAKEITVLIQ SKKPRYGSGVIIKKEGNTYTVLTAAHVVEGADNYEIITPDNRRYAVNYRTIKPLPGVD LAEVQFSSSQNYTVAKMGNSDASTEGTTAYVAGFPAPTFAINQSIYTFIDGRITANAS KPLRDGYALVYSNNTSDGMSGGAVLNEKAELIGIHGRADKDTKEIKTGFNLGIPINTF LIISAKAGENVGVSAPSTQVATKPKADDFFIQAGDKYKKRDFKGAIADLNQALRINPN YANAYFGRGVVRYDLGDKQAAIADFNQALRINP" gene complement(937..1488) /locus_tag="DP116_13620" CDS complement(937..1488) /locus_tag="DP116_13620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865252.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13620" /translation="MKLRTSTQVITLGVRVFSIATLTTFATTVTLNQRSYAESPTFYC GKSNGVPTTFVRTQDGKDLPVIRWFSKYFSGTGLTPQQRCLEVSRRFQRSYDHDTLRY IKADTYKGQPVMCAVAEKNAACTDTTLLFTLKPGSDPDATARQLFDRRALAAGNTVNQ TGGDKSNTRVNIDVEAYLYFTKN" gene complement(1845..2627) /locus_tag="DP116_13625" CDS complement(1845..2627) /locus_tag="DP116_13625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865686.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_13625" /translation="MNWRKSTQTVCVGGLLIALLILVANISFSDSQSKVSSKDTTLKQ LPTQLSVEEIQKQAQAIAVKVISKNFLGSGIILKKQKSVYTVVTNAHVLRADKPPYRI QTSDGRIWQAKTLSATSLQGNDLAILQFRPTNAVYAVASVGSFPKEGDEVFAAGFPFD EEKQEAKNFTFTTGKVSLVLPKALEGGYQIGYTNDIQKGMSGGPLLNRRGEVVGVNGM HAYPLWDAPSVFLDGEEADEKLHKMIVRLSWAVPIETVGRSP" gene complement(2868..3188) /locus_tag="DP116_13630" CDS complement(2868..3188) /locus_tag="DP116_13630" /EC_number="3.6.1.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995051.1" /note="catalyzes the hydrolysis of acylphosphate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acylphosphatase" /protein_id="PRJNA477356:DP116_13630" /translation="MQNTTPHPKQVRAHVFISGRVQGVGYRYSTVDTATQLGLTGWVR NLPDGRVEAVFEGSQTVVGQMIRWCYQGPPAAMVKEVSIEYEEPEDLRGFEVRRFEKS ELGQ" gene 3269..3643 /locus_tag="DP116_13635" CDS 3269..3643 /locus_tag="DP116_13635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878990.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1823 domain-containing protein" /protein_id="PRJNA477356:DP116_13635" /translation="MSNLPPLTTETIWAILNEDIDDATVNQLVWHCLGYRHDSSTGEW DTHQVAPEWRDEYPQPPDFIDSRPPTVKLTRSIPQENKQLLKEKLGFKGYKIGEFGPR QTRRATAANWLLSYMQLNSIDF" gene complement(3861..4232) /locus_tag="DP116_13640" CDS complement(3861..4232) /locus_tag="DP116_13640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009460318.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13640" /translation="MSLKTFQLNLSNLRPWLTLLAITWLLASLGLGWLVNSLLIIIGL LLLAPILIFFGFRFWLQYNLVTDQCPVCRYEFTGLKHSQLQCPNCGEQLMVQQGHFQR LTPEGTIDVTAVEVPSKSLSD" gene complement(4349..5581) /locus_tag="DP116_13645" CDS complement(4349..5581) /locus_tag="DP116_13645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866131.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase group 1" /protein_id="PRJNA477356:DP116_13645" /translation="MDKFLTEKGIDFSQNSQTKVNQVFVFLEVFEREGGIQSHVKDIF RAYLALDEPFYAEVFLLRDSHNCSNPFESERLKFHYFKSQSPHLGRVRMTLALLSHLL RQRPAHVFCGHVNLAPLVSILCQPLGIPYTVMTHGKEVWQALPTLTKSSLQKAAHIWT VSRYTRQVACAVNNLDPNKVKLLPCAVNGNNFTPGTKSTALLERYGLVGAKVLMTVAR LWSGDIYKGVDVTIQALPQIAEVFPEVKYLVIGRGDDQPRLAKLAQDLGVSDAEAATF GDAHSGSRAKRDRVIFAGFVPTEELVEHYRLADAYVMPSAEGFGIVYLEAMACGVPVV SGDADGSAEPLQDGKLGWRVPHRDPNAVAQACIEILKGDDQRCDGVWLREQAIALFGI ETFQQRLKEQLLSGVKTS" gene complement(5651..6094) /locus_tag="DP116_13650" CDS complement(5651..6094) /locus_tag="DP116_13650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876624.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-sigma regulatory factor" /protein_id="PRJNA477356:DP116_13650" /translation="MLSIVQQDHLTVKSELKLLNQVQQWFERFCLQHLFQLGWSESQL YRLNLALAEGFTNAVRHAHHALPPETTIEIDVSLWMDRLEMRIWDYGKPFNPDALEEP EPGTLQVGGYGWFLLRRLADHVVYERGADGRNCLLIVKYGLQGQP" gene complement(6649..7131) /locus_tag="DP116_13655" CDS complement(6649..7131) /locus_tag="DP116_13655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653216.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome b6-f complex subunit IV" /protein_id="PRJNA477356:DP116_13655" /translation="MATHKKPDLSDPKLRAKLAKGMGHNYYGEPAWPNDLLYVFPIVI MGSFACIVALSVLDPVMTGEPANPFATPLEILPEWYLYPVFQILRSVPNKLLGVVLMG SVPLGLMLVPFIENVNKFQNPFRRPVATTVFLFGTLVTLWLGIGATFPIDKSFTFGLF " gene complement(7330..7977) /locus_tag="DP116_13660" CDS complement(7330..7977) /locus_tag="DP116_13660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019490099.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome b6" /protein_id="PRJNA477356:DP116_13660" /translation="MANVYDWFEERLEIQALAEDVTSKYVPPHVNIFYCLGGITLTCF LIQFATGFAMTFYYKPTVTEAFSSVQYIMNEVNFGWLIRSIHRWSASMMVLMMILHVF RIYLTGGFKKPRELTWVSGVILAVITVSFGVTGYSLPWDQVGYWAVKIVSGVPEAIPV VGTLMADLLRGGSSVGQATLTRYYSAHTFVLPWLIAVFMLFHFLMIRKQGISGPL" gene 8364..9605 /locus_tag="DP116_13665" CDS 8364..9605 /locus_tag="DP116_13665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407120.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxyl-terminal protease" /protein_id="PRJNA477356:DP116_13665" /translation="MGFMQKKVFRAGLSLLLAFWVGFCGFCQPALALTDEQKLISQVW RIVNRTYLDETFNHQNWASVRQKVLAKPLKDQESAYSAIQKMLQSLDDPFTRFLNPEQ YRSLQVNTSGELTGVGLQIALNAETGLLEVLAPISGSPAEKAGIRPHDRILKIEGIST QKLTLDEAAAKMRGSIGTPVTLFMQRDTEEWQVQLVRDRIALNPVVAELRSSPQGKSI GYLRLTQFNANAPMELAHAISSLEKKGADAYILDLRNNPGGLLQAGVEIARLWLESGT IVYTVNRQGIQGSFEAFGPAMTRDPLVVLVNQGSASASEILAGALQDNGRGTLVGETT FGKGLIQSLFELSDGSGLAVTIAKYETPNHRDINKQGIKPDKLISSEPITREQIGTEA DVQYQAAIEFLAKNSVVAGAA" gene complement(9846..10583) /locus_tag="DP116_13670" CDS complement(9846..10583) /locus_tag="DP116_13670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" /protein_id="PRJNA477356:DP116_13670" /translation="MSSLPQESNHYTAKLPRLVFSNEVLISANPRSDFALSTIFAFPP NRDTLGGTAYFIVGNEGNILIDCPAWDQINQDFLRSHGGVRHLFLTHRGAIGKTAEIQ KTFSCEVLIQEQEAYLLPGVDITTFNIEFTLNSTAQLIWTPGHSPGESCLYYQQLGGV LFTGRHILPNQHSEPVPLRTAKTFHWFRQIKSVKLLLERFTPETLQYICPGANTGFLR GKRFIDNAYTRLASVDLTALRQVQPLL" gene 10778..12256 /locus_tag="DP116_13675" CDS 10778..12256 /locus_tag="DP116_13675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876629.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="site-2 protease family protein" /protein_id="PRJNA477356:DP116_13675" /translation="MNFWFLLLLGLITYLMVQRSVAQITRTPVWLLWLVLMIPALLWT TWTMVYGAKQPPPRALMIWPLIVCPLLYWVLFQWGRRSLKETRTEPQKVPQSQPDIHN IAEPVPVRPIEPTEETQLRNCFPWSVYYIQNIEYRPQAIICRGQLRTTASNAYERIKA NIEKEFGDRFLIIFQEGFSSKPFFVLVPNPQLAKANSRGQEKITRPGLALMLLAVTLV TTTLVGVRFAGVNSTMLASNPAELLKGLPYALALITILGTHELCHYLTARFYKIRSTL PYFIPMPLFLGTFGAFIQMRSPIPNRKALFDVSIAGPLGGFIMTLPLLVWGLAHSDIV PQPEKTGLLNTNALNPQYSILLALLSKLALGSQLTSNSAIHMHPVAVAGFLGLIVTAL NLMPVGQLDGGHIVHAMFGQRTAVVIGQISRLLLLLLSLIQPEFFPWAVILLFLPLID EPALNDVTELDNKRDIAGLLAMALLVIIVLPLPQVLARLLHI" gene complement(12468..13142) /locus_tag="DP116_13680" CDS complement(12468..13142) /locus_tag="DP116_13680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006516534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_13680" /translation="MVMISTQVSELELEDFQGKVTQNVVLPNVSWQTYKALLADMGDH RAARLTYDQGILQIKMPSKLHEIINRLLARIVTTLTEELELNVIDLGSTTLDREDLDK GAEPDTCFYIQNANQLQGLDPEIPKDLPPDLVIEVDITSPSTHRIGVYLALGIPEVWC YTKKQGLKIYHLQTDASHRDYVESEFSLAFPKVSAQALNQFLQQRQTQNENTVIRAVR DWIQSN" gene complement(13806..15332) /locus_tag="DP116_13685" CDS complement(13806..15332) /locus_tag="DP116_13685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hemolysin D" /protein_id="PRJNA477356:DP116_13685" /translation="MKLDHKRVVSGKILDTEIKVPVKSPQQTDSTPSVVLKQSPLFSR VILWGLMGMTTLVVIWANFAKIDEAVPAQGKLEPQGTVKEIQTPVNGVVKAVYVKDGQ KVHRGDVLLRLDPTTARSQLLSLQKVRNTLMQEIQFYRTQLVAKNALSNPQFIERTNL PPEIASLTKNRIALIAENRLYRAQLSENSQNHQLAPEEEARLQFSKAELNSRLADAKL ETEKLERQLSQAQIQFSSAKQVKDVSQTILNGIEPLAKQGGISRVQYLKQKQEVSQQQ AEMEQLRQEQARLQYAIAQSREKLRNTMAGAKKDLLTKIADNEKQIASLDRELNKVII ENEKRIAEIDGQISQAQVTLQYQELRAPSDGIVFDLKAKSPGFVASSSQPVLKIVPDE ALTAKVFLTNKDIGFVRKGMKVNIRVDSFPFSEFGDIKGELVWIGSDALPPEQVRPYD SFPARIRLDTQSLRVNNQELPLQSGMSITANIKVRNRTVMSIFSDLFTKQVDSLKSVR " gene complement(15489..18470) /locus_tag="DP116_13690" CDS complement(15489..18470) /locus_tag="DP116_13690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315679.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I secretion system permease/ATPase" /protein_id="PRJNA477356:DP116_13690" /translation="MTYTASIQEFISELFPFNQLPTRELAKLVPKFEMLRYRMGQSVL VREQMPAQISIIYEGQARLLGYDPNAQMPETLQLVQRGEILGWTSLIRGIPCETVIAS TETLCLTLNATEFLELLRHYPAIAICFENRCTAVEVFELLSTELAKRADSETVLASYG ATDIKELTLKVLDEAVVCTFLEETIPLDHLDPQLVWFVSGGILKDFPLGSRLHLNDVM TYLEVQTFGYVRLVGLPRATLSASASGSQTRDFSRIDSLLEEVPLAPEHPHRLSEPSD SQSRKYPHVHGRGPLKATLACFQMLSQHLNIPFRRDVINQILNNQVSRTQTLSLQLCG AVCELIGLNTQMINVSAKAITKLQAPAMIRWQDTFAILYEISEKKLVLGVPEVGIIQQ SPDHFVENWGQEGQVLLLKVAKYTPRRRFSLSWFVPSLVKYRKVLLEVLIASFFVQVF GLANPLIIQIIIDKVIIQNGFETLNVLGILLLIMAVFEGLLTSVRTYLFVDTTNRIDL TLGSEIIDRLFRLPLRYFERRPVGELATRASELENIRSFMTGTALTVVLDAVFSTIYI AVMLVYSWMLTLVALATVPLFAFLTIVFSSIMRRQLRFKAERYSETQSYLVEALSGIQ TVKAQNIELRSRWRWQEYYSNYVTAGFQSVVTSTAASVTSNFLNQVSSLLVLWVGAYL VLKGQFTLGQLIAFRIISGYVTAPLLRLTQLWQNFQQTALSLERLADILDTPSEVDED SENIPLPTIDGTVKYENVCFRFTPTSPLQLNNINLDFPSGKFVGIVGQSGSGKSTLMK LLPRLYDLESGRIFIDGYDISKIELYSLRRQIGMVLQDSLLFDGTIQENIALTNPDAT TDEIMTASKVAAAHDFIMSLPNGYNTRVGERGSSLSGGQRQRIAIARTVLQNPRLLIL DEATSALDYQSERQVCDNLAEAFRGKTVFFITHRLSTIKNADIVLLLEKGLVAEQGTH NELMAQKGRYYCLYQQQEAQL" gene complement(18507..18785) /locus_tag="DP116_13695" CDS complement(18507..18785) /locus_tag="DP116_13695" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13695" /translation="MEFPQSGTKGETSALGRGASAVLGSQCVGRLCRLVAPGVSPQVE HLAFSRQQATGVCLRHALWAKAVRRALCAYPEGSRVQVNSQKSWTFDF" gene complement(19424..19990) /locus_tag="DP116_13700" CDS complement(19424..19990) /locus_tag="DP116_13700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobiliprotein lyase" /protein_id="PRJNA477356:DP116_13700" /translation="MNIEEFFQLSAGKWFSHRTSHHLASKQSENGKSDTIIEILSSDD PEVVKLYQQYNVDPSRTCYGAKVTWNATMAGSEKKDTGSTVFVAVPDEDNPNEGKFLR PIDSAGKPLVAGRYKIGTDDALTLTTEYETMWYEERLWFASPNLRMRVSVVKRIDGFS TASFTSEIRMGGSQPTAKTSQATHSASS" gene complement(20244..20915) /locus_tag="DP116_13705" CDS complement(20244..20915) /locus_tag="DP116_13705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860610.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_13705" /translation="MAVPSLQEISTQLESSNLRDRMVALASLRHIAAEDALPLIKKVL DDESLQLRSMAIFALGVKQTEESYPILVRILETDPDYGIRADAAGALGYLGDIRAVEP LMRIFYEDTDWLVRFSAAVSLGNLKDSRAREILRKALDSEEVILQQAAIAALGEIKDI ESVDLILRFAQSDDWLVRQRLAEALGHLPSPKSISALKYMEKDSHNHVAEAARISLKR LEEAG" BASE COUNT 5928 a 4651 c 4432 g 5968 t ORIGIN 1 ggattaatcc gcagtgcttg gttgaaatca gctattgctg cttgcttgtc tcccaagtcg 61 tagcggacaa caccccgacc gaagtaggcg ttggcatagt tgggattaat ccgcagtgct 121 tggttgagat cagctattgc tcctttaaag tctcgttttt tatacttatc tccagcctga 181 ataaagaagt cgtcagcttt gggttttgtc gctacttgag tgctaggagc agaaactccc 241 acattttccc cagcttttgc tgaaatgatt aaaaacgtat tgatgggaat gcctaagtta 301 aagcctgttt tgatttcttt ggtatctttg tcagctctac catgaattcc tatcagttca 361 gctttttcat tcaagactgc accaccactc atgccgtctg acgtattatt actgtagacc 421 aaggcatagc catcacgcag cggtttcgag gcattagcag taatccgccc atcaatgaaa 481 gtgtaaattg actgattaat cgcgaatgtc ggtgcaggaa acccagctac ataagccgtt 541 gtgccttctg tgctggcgtc agagttaccc attttagcaa cagtgtaatt ttgactgctg 601 ctaaactgca cctcggctaa gtctactcca ggcagcggtt ttatagtcct gtagttgaca 661 gcatagcgcc gattatcagg agtgataatt tcataattat ctgctccttc taccacatga 721 gctgcagtga ggacggtgta agtattgcct tctttcttga taatcacccc agaaccgtac 781 ctaggctttt tactttgaat cagcactgtg atttccttgg cagttttatt cacctcagct 841 ggagatagtg ccacagccac ttgcgtctgc aaaacagcga ttgatacccc aattactgct 901 ggcgcgagcc gataataaaa gttcattatt caatattcaa tttttagtaa agtataaata 961 tgcttccaca tcaatattga cgcgagtgtt gcttttgtca ccaccagttt gattcacggt 1021 atttccagct gccaaggcac ggcgatcaaa taactgacgt gcagtagcgt caggatcgct 1081 acctggtttg agagtaaata acaaagtcgt atctgtgcaa gcagcgtttt tttcagcaac 1141 agcgcacatc acgggttgtc ccttgtaggt atctgccttg atatatctca gtgtatcgtg 1201 gtcatagctc ctctgaaatc tgcgagagac ttccagacag cgttgctgag gcgtcaatcc 1261 tgtaccgcta aaatacttgg aaaaccagcg aatcaccggc agatcttttc cgtcttgtgt 1321 acgaacaaat gttgtaggta caccattgct tttaccacag tagaaggttg gactttctgc 1381 gtagctgcgt tgatttagag ttacagttgt ggcaaatgtc gtcaacgtag ctatactgaa 1441 aactctcaca cccaatgtta ttacttgagt agatgttctc aatttcataa tgcaattcac 1501 taaaaaaggt taaagctcta ttttagaaga ataaattgaa ctacaaattt aaatagcact 1561 acttcaaact atgccataat attcagtctt aatggtactt ttacactcac aaattgtgaa 1621 gaaaagttat cagccgtttc ccttaaagca ttggtgttaa cttaagctga aaactttgta 1681 attaactaca tcatgccctt atacttgccc acaattgtcg cataacccct aaaatctttc 1741 cctctccgaa ctcgcaccag agggatgccc gacagggcag ggtgaggttc cgagggaatt 1801 ttgagtgatt cgatgacttg tgtgtacacc gtaggaatgc gcacctatgg acttctccca 1861 actgtctcaa ttggcacagc ccaactcaac cgcacaatca tcttatgcaa cttttcatca 1921 gcttcctcac catccaaaaa cactgaaggt gcatcccaca acggataggc gtgcatcccg 1981 ttcacaccca caacctcacc gcgacgattg agtaatggtc cgccactcat acctttttgg 2041 atatcattgg tgtaaccaat ctggtaacct ccctctaaag ctttgggtaa aaccaaagat 2101 acctttccag tggtgaaggt aaagtttttt gcttcttgtt tctcttcatc aaacggaaac 2161 ccagctgcaa acacctcatc cccttctttg ggaaaagaac caacagacgc gacagcataa 2221 acagcgttag tggggcgaaa ctgcaagatc gccaaatcat tgccttgtag agatgttgca 2281 cttaatgtct ttgcttgcca aatgcgacca tcagaagttt gaatacgata agggggcttg 2341 tctgctcgca gtacatgagc atttgtgaca actgtataaa cagatttttg tttttttagt 2401 ataataccgg agcctaagaa gttctttgat atgactttga cagcgatcgc ctgcgcctgc 2461 ttttgaattt cttctactga tagttgtgta ggcagttgtt tgagtgtcgt atccttgcta 2521 ctcactttac tctgtgagtc agaaaaactg atatttgcta ccaatatcaa aagcgcgatt 2581 aataaaccgc caacacaaac agtctgtgtt gatttacgcc aattcatcta aggaatatta 2641 ctagcgcttg ggctagattg agagtttgac tgaactgttg ggctgaactt ctgctttctt 2701 gaggtataaa tagacatcta caatgaatgc gctacatcct tgagctacaa taaactaggc 2761 tacaaagaaa tgtagcagta tttctgaatg atatccagat actggatatt tccctatctc 2821 agtcggttgt ctccttattg cctacatcga aagaacagcc gtgctggtca ttgacccagc 2881 tcagactttt caaaacgcct aacttcaaat ccccgtaaat cttcaggttc ctcgtattct 2941 atcgaaactt ctttaaccat cgcagcgggt ggtccttggt aacaccaacg aatcatttgt 3001 cctacaaccg tttgactacc ttcaaacact gcttctacgc gaccatctgg aagatttcgt 3061 acccaacctg tcaatccgag ttgggtagca gtatccacgg tggagtagcg ataccctact 3121 ccttgaactc ttccagaaat gaatacatga gcgcggactt gctttgggtg cggtgtggta 3181 ttctgcatta actggtcaag tctttactat tttcagttta ccttgtttat gatccaccaa 3241 agaattttga attccaacta acttgtgtat gtctaacttg ccaccactga caacagaaac 3301 tatttgggca atcctcaatg aggacattga tgatgctaca gtcaaccaat tggtatggca 3361 ttgcttgggg tatcgccatg actcctcaac aggagagtgg gacactcacc aagtcgcacc 3421 agaatggcgg gatgagtacc cacaaccacc agattttatc gatagtcgcc ccccaacagt 3481 caagctgact cgttccattc ctcaagagaa caaacagtta ttaaaagaaa agctgggttt 3541 taaaggttac aaaattggtg aatttggacc tcggcaaaca cggagggcaa cagcggcaaa 3601 ttggttgtta agttatatgc aactcaatag cattgacttt tgattgggct gtttgcgtag 3661 catgctgcag ggattttaac acgtgctaag gactccgcag tcgcctcaac gggggggaac 3721 cctccgaagc tttgagtctc ctccggagac gctgcgcgaa cggaggaaac ctccgctcaa 3781 acttctctcc gcacggcgct gctcacagag tcatactctg tgccctcgca aagctttgtg 3841 tttcacccga ttgaaaactg ctaatcacta agtgatttac tcggtacttc aactgctgtc 3901 acatcaattg tgccttctgg agtcaagcgc tgaaaatgtc cctgttgcac catcaactgt 3961 tccccacaat taggacactg caactgagag tgctttaaac cagtaaattc gtatctacaa 4021 acgggacatt ggtcggtgac caagttgtat tgtagccaga agcgaaaccc aaaaaaaatg 4081 agaatgggtg ccaacaacaa tagcccaata ataatcagca aggagttaac caaccagccc 4141 aagcctaatg acgccagcaa ccatgtaatt gctagcaagg tgagccaagg gcgtaaatta 4201 gacagattaa gctgaaacgt tttaaggctc attttttaaa tgtctcttgc tctccctcta 4261 ggataacgat tttactaaaa agtcaatagt tactcgttaa ttgttaatcg caatcagtca 4321 caaaactcaa caaaggtgca atgggtgact atgaagtttt gacgccagac aaaagctgct 4381 ctttcaagcg ctgctgaaaa gtctcgatac caaatagggc gatcgcctgt tctcgcaacc 4441 ataccccatc acaacgttga tcatcccctt tgagtatttc tatacaagct tgtgcgactg 4501 catttggatc gcggtgtgga acgcgccatc ccagtttacc atcctgcaac ggctcggctg 4561 agccatctgc atcaccagac accacgggaa caccacaagc cattgcttct aagtagacaa 4621 tgccaaaacc ttcagcagaa ggcattacgt aagcatcagc aagacggtag tgttctacta 4681 attcttctgt gggaacgaac ccagcaaaga tgacgcgatc gcgctttgcg cggctccccg 4741 agtgagcatc accgaaggtg gcggcttccg catcgctaac gcctaaatct tgagcaagtt 4801 tagctaatcg aggttgatca tcgccacgac caataactaa atattttacc tcagggaaaa 4861 cctcagctat ttgcggtaac gcttggattg tgacatctac acctttataa atatcaccag 4921 accacagtcg cgccacggtc atcagtacct ttgctcctac taaaccatag cgctctaaca 4981 acgctgtcga ttttgttcct ggagtaaagt tattaccatt cacagcacaa ggtaatagtt 5041 tgactttgtt agggtcaagg ttattgacag cacaggctac ttgacgagtg taacgactta 5101 ctgtccaaat gtgcgcggct ttttgcaaac tagacttcgt aagggttggt aacgcctgcc 5161 aaacttcttt accatgagtc ataaccgtgt aaggaattcc caaaggttga cacaagatac 5221 taactaaggg ggcaaggttt acatgaccac agaagacgtg cgccggtcgc tgacgcaaca 5281 gatgggaaag taatgctaga gtcattctga ctcgtcccag atgaggagac tgggatttga 5341 agtaatgaaa ttttaaacgt tctgactcaa aaggatttga gcagttatga ctatctctga 5401 gtaaaaatac ttctgcataa aaaggttcgt caagtgctag gtaggctcga aaaatatcct 5461 ttacatgtga ttgaattccg ccttctcgct caaaaacttc taggaacacg aagacttgat 5521 taacttttgt ttgggaattt tgactgaaat ctataccttt ttctgtgaga aatttatcca 5581 taattgacta ctttgcttcg agactagctt tctcaaacta ctattttgta gtcaatttaa 5641 ctgaaaactc ttacggctgc ccctgtaaac catacttgac gatgagcaaa caatttctgc 5701 catctgcacc acgttcatag acaacgtggt cagccagacg ccgcagaaga aaccatccat 5761 atcctcctac ttgaagtgta ccaggctctg gttcttcaag cgcatcagga ttaaaaggtt 5821 ttccataatc ccaaattctc atctccaacc gatccatcca taaagaaaca tcaatttcta 5881 tggttgtttc cggaggtaaa gcatgatggg cgtgacgtac ggcgttggta aagccttctg 5941 ctaatgctag gttgaggcga tagagttgac tttctgacca accaagttga aacaagtgtt 6001 gcagacaaaa tcgttcaaac cattgttgca cctgatttag aagctttagt tcgctcttca 6061 ccgtcagatg gtcttgctgc actatgctta gcattgacct tgacttaatg ataagtgagt 6121 gaaaaatgtt tgtagctttt gtatactaca aatttgaaat tttgcaattt catctccatg 6181 caaaattgaa atccaaaata caacctggtt aatcaaagtt ttgtactgac taataaagaa 6241 ctcacgagtc aggagtcagg agtcaggagg aacttgctct ttttattccg agttctgaat 6301 tccgaattct gtgttctttt tcagaccttg attagatcaa tttggtagca attaaacttg 6361 ccgtgtgatg cgacattccc tgataaaaaa cttaagggtt ttaaacttcc ctcttactcg 6421 ctaaaacttg ttaattcaac tgttgggcgt ctatgctgca atcaacgtgt aggttttcac 6481 cttttggctg acgcgtctat tttagcgtat tttttgacaa agcataagcc agcgccagtg 6541 gtttgcaaga gccagtacag acttaccatt tgaagaaaat ttgaggttcg acatagaacc 6601 tcaaacttgg tagttaatgc aatgcccgta tttctttagc taactaattt agaacagtcc 6661 aaaggtaaag gacttatcta ttgggaaagt tgcaccaatt cccaaccata aagttaccaa 6721 agtaccaaat aaaaacaccg tagttgctac tggacgacgg aaggggtttt ggaacttatt 6781 cacgttctca ataaagggaa caagcatcag tcctaggggt acggaaccca tcaataccac 6841 tcctaagagt ttgttaggaa ctgaacgcaa aatttgaaat acgggataga gataccactc 6901 tggtagaatt tccagcggtg tagcgaaggg attcgctggt tctccagtca tgacaggatc 6961 taatacagat aaagccacga tacaagcaaa agaccccata atcacaattg ggaagacgta 7021 cagcaggtca ttaggccaag ctggttcgcc atagtaattg tgacccatgc cttttgctag 7081 tttggctctt aattttggat cgctcaggtc aggttttttg tgtgtcgcca ttttttaatg 7141 tgctctcctg ctgaaatgaa tttcaagctt ttgttaatac cgtcataagt gtgggcatta 7201 actttttagc ttatgcgaag tatttgttct tgtacctgtt atgtctaatg tctcattttc 7261 cataagttta gaacaaacga ctcccgtgtt taaactaaaa gtttatgccc caccgctcac 7321 aacttgcgat tacaaaggac cagaaatccc ttgtttacga atcatcaaga agtggaatag 7381 catgaagacg gctattaacc aaggcagtac aaaagtgtgt gcgctgtagt aacgagttag 7441 tgtcgcttga ccaacgctgg agccgccacg tagtaagtca gccatcaagg taccgacgac 7501 gggaattgct tctggtacac cacttacaat tttcacagcc cagtaaccaa cttggtccca 7561 aggcaatgaa tagccagtca caccaaaaga aactgtgatc acagctaaaa tgacaccact 7621 tacccaagtc aattcgcggg gctttttaaa cccacccgtc aggtaaatcc ggaaaacgtg 7681 caaaatcatc atcagcacca tcatactggc agaccagcgg tgaatggaac gaatgagcca 7741 gccaaagttg acttcattca tgatgtactg cacggaagaa aaagcttcag tgacggttgg 7801 cttgtagtag aatgtcatcg caaatccagt cgcgaactgg ataagaaagc acgttagagt 7861 gattccaccc aagcagtaga agatattgac gtgaggaggg acatacttgc tggtgacgtc 7921 ctcagcaagc gcttgaatct ccaagcgctc ctcaaaccag tcgtaaacat tagccataca 7981 agctcaggtt cctagaaatc gattcgcggt tgataaactt cccaaattgt tgcagttttg 8041 ggattttagt taaacaggca tttgctcgtt atctttttga gcattcagag tgttaatagt 8101 ttaaaaaaaa ctttacactc tgaaaagagg gtatgtattg cgttctcttg tgccccaaac 8161 acacggtggt ggtggacaca aagtcgattt tatctttttg gaagtagaat gtgcgctctt 8221 taattttgat gttgactgaa caactagatg agagcaatgc accacttcta taaaaaaagt 8281 aacatagtcg agagacagat ttcatcaaaa agcaaggaag cttgggcaaa ctctggcaat 8341 tcgggatgaa agtggaagtg aaaatggggt tcatgcaaaa gaaagttttt cgggcgggat 8401 tatccctgtt attagcgttt tgggtaggtt tttgcggctt ttgtcaacct gccttggctt 8461 taacagacga gcagaaactt atatcacaag tctggaggat tgtcaatcgg acatatcttg 8521 atgaaacatt taatcatcaa aactgggcgt ccgtaaggca aaaggttctg gcaaagccac 8581 tcaaagacca ggaatcagct tattcggcaa ttcaaaagat gctccagagc cttgacgacc 8641 cttttacccg ttttttaaac ccagaacagt accgcagtct acaggtcaat acttctgggg 8701 aactcacagg agttggattg caaatcgctc tcaatgccga gacgggttta ttagaggttc 8761 tggctcccat atctggttca ccagcagaaa aagctggaat tagaccgcac gatcgcatct 8821 taaaaattga gggcatctcc acacaaaaac tgacccttga cgaagcagca gccaaaatgc 8881 ggggatcgat tggtactcct gtgactctgt tcatgcaacg agacacagaa gaatggcaag 8941 ttcaactggt gcgcgatcgc attgcactca acccagtcgt agcagaacta cgctcttccc 9001 ctcaaggaaa gtctatcggc tatcttcgcc tcacacaatt taacgccaat gctcctatgg 9061 agttagcaca tgctatttcc agtttagaaa aaaaaggcgc tgatgcctac attctcgatt 9121 tgcgaaataa cccaggtgga ctcttgcagg caggagttga aattgcccgc ttgtggttag 9181 aatctggtac cattgtttac acagtgaatc gtcaaggcat tcaggggagc tttgaagcat 9241 ttggtccagc aatgactcgt gaccccttag tggttttggt caatcaagga agcgccagtg 9301 ctagcgagat tcttgctggt gcattacaag ataacggtcg tgggacttta gtaggagaaa 9361 ccacctttgg caagggttta atccagtcct tatttgaatt atctgatggt tccggcttag 9421 ctgtcactat tgctaagtat gaaaccccca accaccgcga tatcaataag caaggtataa 9481 agccagacaa attaatttct tctgaaccaa tcactcgcga acagattggc acagaagcag 9541 atgttcaata tcaagcagct atagaatttt tggcgaaaaa ctctgtagta gcaggcgcag 9601 cttagtgtgg gttaggggag atgagggagt gtgggagtga gacagcgcga atgacggtga 9661 gacagcgcga atgacggctc tccctcactt ggcgactgcg tatgcgcaaa gcgcacgccc 9721 aaagggctaa agcgcagcgt gaccgttcgg cgcagccgtg gcgttagcca taggtcatac 9781 ccgaagggtg tgagaaaata tcaagacctc ctactgttcc tctttccttc cctcccctac 9841 tttgctcaaa gtaggggttg tacttgtcgt aaagccgtta aatcaacaga agccagacgc 9901 gtataagcat tatcgataaa acgctttccc cggagaaaac ctgtgttggc tccaggacaa 9961 atgtactgga gtgtttctgg tgtgaaccgt tctagcaaca atttgacact tttgatttgt 10021 cggaaccagt gaaaagtttt agcggttcgt aggggtacag gttcactgtg ctggttgggg 10081 aggatatgac gtccggtaaa tagtacgcct cctagctgtt ggtagtatag acaagattcg 10141 ccaggagaat gaccaggtgt ccaaattaat tgcgctgttg aatttaaggt aaactctata 10201 ttaaaggtag ttatatctac tccgggtaat aggtaggctt cttgctcttg gatcaaaact 10261 tcacaactga aagttttttg aatttctgct gttttgccaa tagcgcctcg atgagtcaga 10321 aataggtggc gcacgcctcc atgcgatcgc aaaaaatctt gattaatttg atcccaagct 10381 ggacaatcta tcaaaatatt gccttcgttt cctacaataa aataagcagt cccccctaat 10441 gtgtctctgt ttggtggaaa tgcaaaaatt gtacttaacg cgaaatccga gcgggggttt 10501 gctgaaatca gaacttcgtt agagaagacc aatcgtggta gctttgccgt ataatgattt 10561 gactcttggg gcaaggaaga catgagaaaa actgttagat gaggagttgt caaaaatcag 10621 gtgacagtgt tggacaattg atgtaaaaac gcaaaatttc ccaggactgt tgactgttgt 10681 cagaaaaatt gaattttggg ggcatatgct tgttgtgggt gctgcacagc gcttgaggac 10741 aagcaagatt taacatctta aaacaactga aagtcacatg aatttttggt ttcttcttct 10801 actgggacta atcacttatt tgatggtgca gcgcagtgtt gctcaaatca ccagaacgcc 10861 tgtgtggctg ttatggctag tactcatgat accagcactg ctatggacaa cttggacaat 10921 ggtatacggg gcgaaacaac ctccgccaag agcgttgatg atttggccat taattgtctg 10981 tcctctgtta tactgggtat tgtttcagtg ggggcgtcgc tcgctaaagg aaactcgtac 11041 tgaaccacaa aaagtaccac aatcacaacc ggatatacat aatatcgcag aaccagtacc 11101 agtgcgtccc attgagccaa cggaagaaac tcagctaaga aattgttttc cctggtctgt 11161 atactacatt caaaacattg agtatcgacc ccaggcaatt atctgtcgag gtcaattgag 11221 gacgacagca agcaacgcct acgagcgaat taaggcaaat attgaaaagg aatttggcga 11281 tcgctttctc atcatctttc aagaaggttt cagtagtaaa cctttcttcg tgcttgttcc 11341 taatcctcaa ttggcaaaag ctaatagtcg tggtcaagaa aagataacac gaccaggatt 11401 agcgctcatg ctcctagcag tgacattggt aacgactacc ttggtgggtg tgcgatttgc 11461 tggtgtcaat tctacaatgc ttgcatccaa cccagcggaa cttctcaagg gattgcccta 11521 tgccttagcg ttgataacga ttctgggtac gcacgaactt tgtcactact taacagcacg 11581 attttacaaa attcgctcaa cgctgcctta ctttatccca atgcctttgt ttttgggaac 11641 tttcggtgca tttattcaaa tgcgtagtcc aattcccaac cgcaaagctt tattcgacgt 11701 cagtatcgcc ggaccccttg gtgggtttat catgacctta ccacttctgg tatggggttt 11761 ggctcattcc gacatagttc cccagccaga aaaaacagga ttgctcaaca caaatgccct 11821 caatccccaa tattccatcc tactggcgct actttcaaag ctagctttgg gaagtcagct 11881 aacatcaaac tcagcgattc atatgcatcc agtcgcagtc gctggttttt taggactgat 11941 tgtaacagca ctgaatttaa tgccagtagg acaactcgac ggaggtcaca tagtccatgc 12001 gatgtttgga caaagaacag cagttgtgat tggtcaaatt tctcgcttgt tactgctgct 12061 actttctttg atacagccag aattttttcc gtgggcggtt atcttattat tcttaccatt 12121 gattgatgaa cctgctttga atgatgtcac agaacttgat aataagcgtg acatagcggg 12181 gttgctagca atggctttgt tagtcatcat cgtactaccg ctgccgcagg tacttgctag 12241 attgttacat atttaacaac tgattgcgga gcaaacctgg agggctggcg agtcccaaaa 12301 gctactcaaa cagatctgga aacagcagtc ctcaccacta gtagtctgtc aaggcaactt 12361 tgattggtta agaacgaacc gccttagcgc agcgtgtccg caggacatag gcgccaagaa 12421 cgcaaaggaa gagaaagaag aatgagcaaa gttggtgtgg tggactacta gttactttga 12481 atccaatctc gaacagcacg aatcacagta ttttcgtttt gagtttggcg ctgttgcaaa 12541 aattgattga gcgcttgtgc agataccttg ggaaaagcaa ggctaaattc tgactcaaca 12601 tagtctcgat gagacgcatc tgtttgcagg tgataaattt ttaatccctg ttttttggtg 12661 tagcaccaga cttcaggaat gcctagggca aggtataccc caattctgtg ggtagaagga 12721 ctggtgatat ccacttcaat aaccagatcg ggcggcaaat ctttaggaat ctctggatcg 12781 agtccttgga gttgattggc gttctgaata tagaaacaag tatcgggctc tgcaccttta 12841 tcaagatcct ccctgtctag agttgttgat cccaaatcaa taacatttaa ttccagttct 12901 tctgtcaaag ttgtgacaat tcttgccaat agccgattaa tgatttcgtg gagtttggat 12961 ggcatcttaa tttggaggat tccctgatca taggtcagac gggcagcacg gtggtcaccc 13021 atgtcagcca atagggcttt gtaagtttgc caactcacat tggggaggac aacattttgg 13081 gtgactttgc cttgaaagtc ttccagttca agctcactca cttgtgtgga aatcatcacc 13141 atatgtggtg tctccctagt taagactata ttaccagagc gtttttctct cgggtcttta 13201 tctattatcc acgggttcaa aagtgcatga tgtcccattt caaggagcga ttgcgctctt 13261 gtgctgacgg aggttttatc ttcgtttgga attgcgccaa acaaaatgcc tgtggattga 13321 tcgcaagaat catatcaata ggacttacgc aaaaaccctt ttaaaccctc ttaactatgc 13381 gctaaggcgc acgctacgcg ttagcgaagc gtctccgcag gagatacgtg ccctttgcgt 13441 cctttgcggt ttattttttc attattttgc gtaagtcctg atcacatccg gtagcatctg 13501 aatcattaaa aaaccgcaga gagcgaattt gcggtgtgga tgttcctccc gcaaagttcg 13561 cggtgggcgc tccagaacac accagagaag agggagaaat aattgtgata caaacggatt 13621 taatatcaca aatactgcta aatagttcaa cctaaaaaaa cgtaaaatgt agggtgcgtt 13681 acactccgtt aacgcaccct acgtaaatct tatgtttaat ttggttgact tacttactag 13741 cttgccaaat aggggaattc cggtactcgc agccgcagta aatttgcata cattataatg 13801 ctctgttatc tcaccgactt aaggctatca acttgcttgg taaacaaatc gctgaaaatg 13861 ctcatcacag tgcgattacg taccttaata tttgcagtga ttgacatccc tgattgcaga 13921 ggtaattcct gattgttaac acgcaaagat tgtgtatcta agcgtattct agcaggaaaa 13981 ctgtcatatg gacgaacttg ttcaggtggc aaagcatcag aaccaatcca aactaactca 14041 cctttaatat cgccaaactc actaaaggga aaagagtcaa ctctaatatt gactttcatt 14101 ccttttctca caaaaccaat atctttattg gtgagaaaaa ctttagccgt gagagcttca 14161 tccggtacaa ttttgagaac tggttgacta gaactagcaa caaatccggg agacttagct 14221 ttaagatcga aaacaattcc atcagatgga gcacgcagtt cttgatattg taaagtgact 14281 tgagcttggc tgatctgacc gtcaatttca gcaatccgct tctcattttc tatgattact 14341 ttgttcaact ctctatcaag actcgcaatt tgtttctcgt tgtcagcaat ttttgtgagt 14401 aaatcctttt tcgctccagc cattgtattt ctgagctttt ctcttgattg ggcgatcgca 14461 tattgtaaac gcgcttgctc ctgccttaac tgttccattt cagcttgctg ctgactcact 14521 tcctgttttt gcttgaggta ttgtacgcga gaaattcctc cttgtttagc aagtggttca 14581 ataccattaa gaatggtctg actaacatcc ttaacttgtt tagcagagga gaactgaatt 14641 tgcgcttgac taagctgtct ttctaatttt tctgtttcca atttagcatc tgcgagtcga 14701 gaatttaatt ctgctttgct aaattgtaga cgagcttctt cctctggcgc taactgatga 14761 ttttgagagt tttcactcaa ctgagcacga tacaatcgat tttcggctat cagcgctatt 14821 cgattttttg tcagagaagc gatctctgga gggagattag ttcgctcaat gaattgcgga 14881 tttgataaag catttttcgc aacaagttgg gtacggtaaa attgaatttc ctgcattaat 14941 gtgttgcgga ctttttgtag agaaagtagt tgcgatcgcg ctgttgttgg atctagcctt 15001 aacagcacat caccacggtg taccttttgc ccatccttaa cataaacagc tttcacaacc 15061 ccgttcacag gtgtttgtat ttcttttaca gtaccttgcg gttcaagttt accctgagct 15121 ggaactgctt cgtcaatttt ggcaaaattt gcccaaatca ccaccaaagt tgtcattccc 15181 atcagtcccc aaagaatgac ccgtgagaat aaaggcgact gtttgagcac aacagatggt 15241 gtagaatccg tctgttgtgg tgactttaca ggcactttaa tctcagtgtc tagtattttt 15301 ccagagacaa cacgtttgtg atcaagtttc attgctttca attcacttat agccgtagtt 15361 tttatcagtt atcagttatc agttaccagt tatcagttat caggtaggaa acggactcgt 15421 ccaccccttg ttcactgttt actgttcact gtttactgtt cactgttcac tgttcactgt 15481 tcactgattt aaagttgtgc ttcttgttgt tgataaaggc agtagtaacg tcctttttgt 15541 gccatcaatt cattgtgggt tccctgttct gcaactaacc ctttttcgag tagcaaaacg 15601 atgtcagcat tcttgattgt acttaagcga tgcgtaataa aaaacactgt ttttcctcgg 15661 aatgcttctg ccaaattgtc acaaacttgt cgttccgatt ggtaatctag agcactggtt 15721 gcttcatcga gaatcaacaa gcgggggttt tgtaaaactg tacgagcaat agcaatacgc 15781 tgtcgctgtc ctccagataa ggatgagcct ctctctccga ctctggtgtt atagccattg 15841 ggtaaagaca taataaagtc atgggctgca gcaaccttgc ttgctgtcat aatttcatca 15901 gttgtggcgt cgggatttgt cagcgcaata ttttcctgta tggttccgtc aaacaaaagt 15961 gagtcttgaa gcaccatacc aatctgacgg cgcagggaat aaagttctat tttggagatg 16021 tcgtagccat caatgaaaat tctgccagac tctaagtcat acagtcgtgg taaaagtttc 16081 atcaaagtgc ttttaccaga accactttgt ccgacaatac cgacaaattt tccgcttggg 16141 aaatccaggt taatattatt caactgaagt ggagaagttg gtgtgaagcg aaagcagacg 16201 ttttcatact tgacagtacc gtcaatggta ggtaagggaa tgttttcact gtcttcatca 16261 acttctgagg gagtatccaa gatatctgca aggcgttcta gagagagagc cgtttgttgg 16321 aagttctgcc acaattgggt taatcgtaat aaaggagcgg tgacgtagcc agaaataatc 16381 cgaaaggcta taagttgtcc taaggtgaat tgtcctttga ggactaagta cgctcctacc 16441 cataacacaa gtaaggaaga gacttggtta agaaaattgc tggtgacact agcagctgtg 16501 gaagtgacaa cactttggaa tccagcagtg acataattac tgtaatattc ttgccagcgc 16561 cagcgcgatc gcaattcaat attttgggct ttgactgttt gaatacctga cagcgcctcg 16621 actaaataag attgtgtttc agaataacgc tcagctttga aacgcagttg gcggcgcatg 16681 attgaggaaa aaactatcgt cagaaaagca aacaagggta cagtcgccaa agccacaaga 16741 gtcaacatcc aactgtagac cagcatcacc gcgatgtaga ttgtcgaaaa aactgcgtcc 16801 aacaccactg ttaaagcagt acccgtcata aaagagcgaa tgttttccag ttcactggcg 16861 cgagtggcta actccccaac aggacgtcgt tcaaaatagc gcaagggtaa acgaaacaaa 16921 cggtcaataa tttctgagcc taaagtcagg tcaatccggt tagtcgtatc aacaaaaaga 16981 taggtgcgga cactcgttaa aagtccttca aaaacagcca tgattaagag caaaatcccc 17041 aatacattca gggtttcaaa gccgttttga ataatcactt tgtcaataat gatctggatg 17101 attaagggat tagccagacc aaacacttgg acaaaaaagg aagcaatcag aacttccaac 17161 agaactttgc gatatttcac caaagaaggg acaaaccaac ttaaactaaa tcgccgccta 17221 ggagtatact tggcaacttt gaggagcaaa acctgacctt cttgacccca attttcaaca 17281 aaatggtcag gagattgctg gataatccca acttctggta cacccagtac tagctttttc 17341 tcgctaattt cataaaggat tgcgaaagta tcctgccagc gaatcatcgc tggggcttgc 17401 agttttgtaa ttgcttttgc tgacacgttt atcatttgag tatttaaccc aatgagttca 17461 caaacagcac cgcaaagctg tagtgaaaga gtttgagtgc gcgaaacctg gttgttcagg 17521 atttgattaa tcacatcccg gcgaaagggt atatttaaat gctggctcag catttggaag 17581 caagctaaag tcgcctttag aggacctcta ccatgaacat gaggatattt tcgcgattga 17641 ctatctgatg gttcgcttag cctatgagga tgttctggtg ctagtggtac ttcttctagt 17701 aaagagtcta tccttgagaa atctctggtt tgagacccag atgcagaggc agaaagggtt 17761 gcccgaggca aaccgactaa gcgaacataa ccaaaagttt gtacctctag atacgtcatg 17821 acatcgttta ggtgtaagcg acttcccaga ggaaaatctt taagtatccc accactgaca 17881 aaccacacca attgaggatc taggtgatcc aaaggtattg tctcttctag gaacgtgcag 17941 acaactgctt cgtctaaaac tttgagggtc agttctttga tatctgttgc accatatgaa 18001 gccaatacag tttcactatc agctcgcttt gctaactcag tactcagtag ttcaaaaact 18061 tcaacagcag tacagcgatt ttcaaagcaa atagcaatag caggataatg tctgagcaac 18121 tctaaaaact ctgtggcgtt tagagtcaga caaagggttt ctgtagaggc tatgactgtt 18181 tcgcatggaa tccccctgat taaactagtc caacccagga tttctcctct ttgaactaat 18241 tgcagcgttt caggcatctg agcattcgga tcataaccca gcaagcgggc ttgcccttca 18301 taaatgattg atatttgagc aggcatttgt tcccgcacca aaacgctctg acccatgcgg 18361 tagcgtaaca tctcaaactt tggaactaat tttgccagtt ctcgtgttgg cagttgattg 18421 aagggaaata attcagagat gaactcttgt atggaagcag tataagtcat gcgattttgg 18481 atgagttact agtccttaat cattcgtcag aaatcaaaag tccaagactt ttgactgttg 18541 acttgcaccc tgctaccctc cgggtatgcg caaagcgcac gccttacggc tttagcccag 18601 agggcgtgcc gtaggcatac gccagtcgcc tgttgtcggg aaaacgccag atgctccact 18661 tggggagaca cgccaggtgc tacaagtcgg cacagccgcc caacgcactg gctccccaag 18721 accgcactgg ctcccctacc aagagcgctg gtctcacctt ttgtcccact ttgtggaaac 18781 tccatcttgt gacttagatg aagaagagat atgacctgat aaagtgataa aagaagttaa 18841 aggtgataag agaaactaat tattagttct gctttttgag ccacatttca aacagttcat 18901 gtgagcgagc tcacacatca gttgatccaa ttgtgcaaag ataaattttt ccagtcgtat 18961 aagcacaaac caacctccca tgtgaataga acgtcaaagt tgacatggtt tacggatcga 19021 aagtatatgc accaatactg aacgcgcgaa agttcagtag ttttctttcc tgctgtgtta 19081 cttgctcttt taattggtgt gtggcagatt taaacgaccc tgactaattt ttggtaggca 19141 cttgtgacgt agactttact tctactttgt caggtatgaa aaccttttac aatatcttca 19201 ctatcttaag tgtgatttca ggcagattgc acggtaaaaa tactattcat ttctcaagga 19261 tttcatcaca tttccagtaa tattactgta tacacgaata tttgttacta taaatatcac 19321 atgaaaaatt ctgttaagac ttaaccaaaa ctatagcccc tattagcagg ggctagttaa 19381 atcagaactt tggtcaccta atggtgattg ttacttataa aacttaactg gatgctgaat 19441 gagttgcctg agaagttttc gctgtaggtt gacttccacc catacgaatt tcagaagtga 19501 atgaagccgt gctaaaacca tcaatacgtt tcaccacact cacccgcatc cgcaaattgg 19561 gactggcaaa ccacaggcgt tcctcatacc acatggtttc gtactctgtt gttaaggtca 19621 atgcgtcatc agtgcctatc ttgtagcgtc cagcaactag gggtttccca gcagaatcta 19681 ttgggcgcag gaatttgcct tcattgggat tatcttcatc tggaacagca acgaatactg 19741 tagaaccagt gtcttttttt tcactccctg ccatcgtggc attccaagtc actttggcac 19801 cataacaagt gcggctaggg tcaacattgt actgttgata cagtttcacc acttctggat 19861 catcactact cagtatttct atgattgtgt ctgatttgcc gttttctgat tgcttagaag 19921 ctaaatggtg actggtacga tgggaaaacc atttaccagc actcaactga aaaaactctt 19981 caatattcat caatgaaatt accttgcttc aaaatccttt tcatttcact tcttccaagg 20041 taacaggagg gagtcacccc ccaaaagaat tcaaaatcgc gtagcgttgc gagcgaagcg 20101 acgcaatctc aaaattcaaa aattttgacc aaagctaagg gtgtagagga ataaaagaat 20161 aaatgcattt tccctatccc ctggttaatg tatttccctt acacccttac acccttacac 20221 ccttataccc ctagtttatt tgcttaacct gcttcctcca agcgtttcag ggaaatcctc 20281 gccgcttcag caacgtggtt atggctatct ttttccatgt atttcaaagc tgaaatactt 20341 tttggactgg gaagatgacc taaggcttct gcaagacgct gacgcactaa ccaatcatct 20401 gattgggcaa agcgcaaaat caaatccaca gattcgatat ctttgatttc tcctaatgca 20461 gcgatcgccg cttgttgtaa aatgacttct tcactatcta atgcttttct gaggatttca 20521 cgggcacgac tgtctttgag attaccaagt gaaacagctg cgctaaaccg taccaaccaa 20581 tcagtgtctt cataaaatat tcgcattagg ggttcaacgg ctctaatatc acctaaatat 20641 cctaaagcac ccgcagcgtc cgcacgtata ccataatccg ggtcagtttc taggattcgc 20701 accaaaattg ggtaactttc ttctgtttgt ttgactccca aagcaaagat tgccattgac 20761 ctcagttgga gagactcgtc atccaaaact tttttaatca aagggagtgc gtcctctgcc 20821 gcaatatgcc ttaatgaggc gagggctacc atgcgatcgc gcaaatttga actttctaac 20881 tgagtggaaa tttcctgtaa gcttgggact gccatctttt taacttaaat gacttttact 20941 atttttcttc atattaagta tcattacaaa ttctcagac // LOCUS NODE_1525_length_20851_cov_4.65772320851 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20851) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20851) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20851 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..886) /locus_tag="DP116_13710" CDS complement(<1..886) /locus_tag="DP116_13710" /EC_number="2.7.9.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878705.1" /note="catalyzes the formation of phosphoenolpyruvate from pyruvate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate synthase" /protein_id="PRJNA477356:DP116_13710" /translation="MVTVAKDTLLESSSKERSLVLWFDQVGITDIPIVGGKNASLGEM IQQLTPKGINVPIGFATTAYAYRYFIEGAGLEEKLRELFSDLDVEDVKNLRERGKKAR SLLLHTPFPKELREAIVNAYLALCERYNPDTDVAVRSSATAEDLPDASFAGQQESYLN VTGIEGVLAACHKCFASLFTDRAISYRHIKGFDHFSVALAVGVQKMVRSDLASSGVMF SIETETGFKDAALISAAYGLGENVVQGSVNPDEYYVFKPTLKEDFRPIVDKRLGSKEL KMVYDDGSKFTKNVLVPES" gene 1815..2867 /locus_tag="DP116_13715" CDS 1815..2867 /locus_tag="DP116_13715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017803880.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13715" /translation="MGKYDKFFNSKKTLEESISPEEAVAAIAVVTAAADSSLEELDPD FLADILWGLEIFEEYSDDELLETLDKVVAIAEEAQIGALFNAAKNSLTDDLVLDAYAA GVSVLVDEEEVRIPKGKTTLLKKLQEALQINDEDAKEVIDEVIAAFEEIENEDFLEDE DETELEEDLNPNVYESPSGNFIVLIPVNSQQGGRVETQESSVSFSDDSGRLLRIDYFP ISSKQTEEMDSVGHQEYLRSFLLSKYVPQTIVANLPDAQVKHTEYLEDILEGAYFVLV DMPQGSTVTKTGNNGTATKLNAYRGVLAFSYGDLLYIVSSQQSFSDGEKPDSLEEEAE AIKQNVLSFVDTIEFL" gene complement(3028..3239) /locus_tag="DP116_13720" /pseudo CDS complement(3028..3239) /locus_tag="DP116_13720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007306958.1" /note="frameshifted; internal stop; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 3412..3888 /locus_tag="DP116_13725" /pseudo CDS 3412..3888 /locus_tag="DP116_13725" /inference="COORDINATES: protein motif:HMM:PF03235.12" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 3844..3853 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 3921..5108 /locus_tag="DP116_13730" CDS 3921..5108 /locus_tag="DP116_13730" /inference="COORDINATES: protein motif:HMM:PF03235.12,HMM:PF07510.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13730" /translation="MLLFKQISKRITEQGDFAEFIIEIEECIRDRLLSILIAVADEAN SYLIFETLNDRGLDLSVADLLKNYLFSRATDKLKDVQKKWEEINLLADRFELTKFIRH YWLSKYELVTEKNLYRKIAEKVRNSLQVFDFVSQLREAAEVYGAFENSQSHVWDSYDS GLKQDIERLSLFKVSQCYSVLIAAKESLPDELFPKVLRMIVILSFRYNVICSLNPNKL EAAYSKTSKYIQEQKPKSVKAIFEELKEFYPSDIHFKRAFAEKIIAASNAKLARYILS EINQHYMDSKELIANPNATELNLEHILPQTPSEKWLVEFPKTDYNQYIYRLGNMTLLD SSINRKVGNTSFQDKCATAFSTSQLEITKEIINYRVWSPKQVEERQSKMAEAACQIWR LGY" gene complement(5154..7681) /locus_tag="DP116_13735" /pseudo CDS complement(5154..7681) /locus_tag="DP116_13735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875046.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DNA helicase RecG" assembly_gap 7348..7357 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(7763..8782) /locus_tag="DP116_13740" CDS complement(7763..8782) /locus_tag="DP116_13740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875047.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="primosomal protein" /protein_id="PRJNA477356:DP116_13740" /translation="MAKAIEQIEREILALEEAIEAIAKELRNAYTNYLTALGQTVQRQ LILATYHLCTQGYPETFLSLSLNQRQQLQQAIRKLGQQTAQQLQDFIKTEQEQQKEAA EQHKEDEEIDEDEEDEETDKPSSPTPITLPTSPTSLTPLTTSPTPLTPSGPVTPSTLP PSSTLPPTLTSSTLFSLSSLQSVSSFPNTSNPVELVKWQQNIERVTQEKLKRVSREAN LRLQKAGILPKKLPEPILEAAAAAASEASGEVMPGPPNLLNLVIEIDNEQESEDSNLT QIMAINLRLGEIEFADPTLSSYRKQIHTVLLNLRRRGQEYQKQQRERSIAQAEAAWRA IWSED" gene complement(9084..10028) /locus_tag="DP116_13745" CDS complement(9084..10028) /locus_tag="DP116_13745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013193033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="elongation factor Ts" /protein_id="PRJNA477356:DP116_13745" /translation="MAEISAKIVQELRLKTGAGMMDCKKALKENDGDIEKAIEWLRQK GIAKADKGAGRIAAEGLVDTYIQPEGRVGVLIELNCQTDFVARNEDFKALVQNLAKQA TTAESVESLLAQPYIEDESVNVEDFIKQTSGKLGENIQLRRFTKFELAEDTKGVVDSY IHTGGRVGVLVELNNQNDSVANPSEYQTLGRNVAMQVAACPNVEYVNVNEIPAEVAQK EKDIEMGRDDLANKPQNIKEKIVQGRIEKRLQEMSLLDQPYIRDQSITVEELIKQNAS KLGDSIQVRRFVRYILGEGIEKQESNFAEEVAAQMGSK" gene complement(10425..11225) /gene="rpsB" /locus_tag="DP116_13750" CDS complement(10425..11225) /gene="rpsB" /locus_tag="DP116_13750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S2" /protein_id="PRJNA477356:DP116_13750" /translation="MAVVSLAQMMESGVHFGHQTRRWNPKMSPYIYTSRNGVHIIDLV QTAQLMEEAYTYMRSQAEAGKKFLFVGTKRQAAGIVAQEAARCGSHYINQRWLGGMLT NWATIKTRVDRLKDLERREENGALDLLPKKEASMLRREMTKLQKYLGGIKAMRKVPDV VVIVDQRREYNAVQECQKLSIPIVSMLDTNCDPDVVDIPIPANDDAIRSIKLIVGKLA DAIYEGRHGQLDVEDYEDYDGAEDDYDYEEGERDYSDLGIPNDEEEEQ" gene complement(11445..12533) /locus_tag="DP116_13755" CDS complement(11445..12533) /locus_tag="DP116_13755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_13755" /translation="MPRGHAARTVILNFELFSVVGETVFFSVVIPTYNRKPILEKCLR ALERQKLYDVETLHATSLQGDATSLQGDATSLQGHATSLEGYEIVLVDDGSTDGTLEW LEAHKDEFPHVRTFCQNHTGPAAARNLGVEKALGDTIIFIDSDLVVTENFLQAHESAL VQGKEKLGSDRFFTYGAVINTCNFDNPTSEPYKITDFSAAFFATGNVAIPKHWLEKAG LFDTRFQLYGWEDLELGVRLKNLGLKLIKCPDAVGYHWHPPFNLQQVPRLIDKEIQRG RMGVLFYQKHPTWEVRMMIQMTWLHRLLWGILSLNGILNEHTMAPFLQWLIDLGKPQL ALEAARIFLNWYNVKGVYQAYSQMQKAT" gene complement(12889..13533) /locus_tag="DP116_13760" CDS complement(12889..13533) /locus_tag="DP116_13760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319243.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13760" /translation="MPVALAFGGFGLNVQSASAQQTTYNTYEFTTNYKTSVEINPFLP EQNILRATITGENADAPYGLTKFTSNTYGQSEPRGANTFTRFNSDPTVFGIEGKVLGD IYYGDGSNKLFGLANDSAEINPIEGTIKGAGTITITSGTGIFQNVTGKIDFTEEDKLA PPGSPSIGNAILKFSLRTPRAVPEPTATPALVGLGILGAGFLLRKHRRKATFNR" gene 13838..14815 /locus_tag="DP116_13765" CDS 13838..14815 /locus_tag="DP116_13765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319242.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation transporter" /protein_id="PRJNA477356:DP116_13765" /translation="MLLEQDCNCRCLDNSVKTNQNSRKTQLLSITVCLLAGFFVAEWS VGLWSGSLSLQADAEHILSDIAALGISLLASWLAQQPATARATFGHRRIEVMAALVNG LSLLVIAIFICWEAIHRFQSPQEISGLPMLAIAVLGLIVNLLNITLLHPHSHDDLNLR GALLHIIADTASSVGVMVAAVVIHLWNWLWADTAISLVVAGFMGLSALPLVRESLSIL LEYAPKSINPAEVEVFLKSFPKVLQVEKLYIWRITREQVMLCAHLSVDCATVEERDRL LGQLQTHLEQTFGVNQITLQLTKPKSLAAMPIHPLFKQDLISMLSLEKK" gene 14965..16170 /locus_tag="DP116_13770" CDS 14965..16170 /locus_tag="DP116_13770" /EC_number="6.3.4.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319241.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="argininosuccinate synthase" /protein_id="PRJNA477356:DP116_13770" /translation="MGRAKKVVLAYSGGVDTSVCIPYLKHEWGVEEVITLAADLGQGD ELEPVREKALKSGASESLVVDVKEDFVKDYAFRAIQANTLYENRYPLSTALARPLIAK ALVEAAEKYGADAIAHGCTGKGNDQVRFDVSVAALNPNLKILAPAREWGMSREETIAY GEQYGIPSPVKKKSPYSIDRNLLGRSIEAGPLEDPNVEPPEEIYLMTKAIADSPDKPE YVEIGFSRGIPTTLNGEFINPVDLIQQLNQVVGNHGVGRIDMVENRLVGIKSREIYET PALIVLIQAHRDLESLTLTADVTHYKRGIEETYSQLIYNGLWYSPLKAALDAFIQKTQ ERVSGTVRVKLFKGNATIVGRSSENSLYTPDLATYGAEDKFDHKAAEGFIYVWGLPTR IWSQHQNDR" gene 16400..18028 /locus_tag="DP116_13775" CDS 16400..18028 /locus_tag="DP116_13775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012406851.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dolichyl-phosphate-mannose--protein O-mannosyl transferase" /protein_id="PRJNA477356:DP116_13775" /translation="MNKKWFRIGIVGVFLLSLALRFWGLGRFNTFVFDEVYYAKFGNN YLTHTPFFDGHPPLGKYMIALGIWIGSHIPFWQDEVNGFTGSVMSPVAYRWMNAFSGS FIPIIVAAIAYQISYRRGFALLAGLFTACDGIFLVESRYALINQYIVIFGLLGQWFFL LALAKQRQQRRFWLILSGIAFGASAATKWNGLWFLLGTYLLWILAWGIRWWQSFSFVD NETSPQLQQEFISHPPQTRSGRKRSPKISSLSFLTLSNSGLFGTSTQKNFPENVSSES SNNNGQVNLTPLQKLTQLHLIHIIIYFGFLPLIVYSLIWIPHLLLNTSYGFLELHKQI LLFHERLGGNTPKVHPYCAAWYKWPLMTRPMAYFYQTAQSLKEPPPVMGPPLPAGAGK IIYDVHAMGNPFLWWFGVAALLFLVGTLVWRFLIPLVRQKRFSPPRTLSVDTWIALYI VINYIANLLPWVRVNRCVFIYHYMTGVVFAFLAIAWLVDQCLRSYHIPLRAVGVTITF IIVSAFIFWMPIYLGLPLSNFEYRPMRMWFNSWI" gene 18244..18573 /locus_tag="DP116_13780" CDS 18244..18573 /locus_tag="DP116_13780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="(2Fe-2S) ferredoxin domain-containing protein" /protein_id="PRJNA477356:DP116_13780" /translation="MENISCSSDSSVTSPVFPKCVRVCQNRTCRKQGAVKVLAAFEAL PTPEVTITGSTCLGQCGNGPMVLVLPDMVWYSGVHPSEVPLLVEQHLRGGQRVTKMLY HRFHPQG" gene 18645..18929 /locus_tag="DP116_13785" /pseudo CDS 18645..18929 /locus_tag="DP116_13785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310723.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(19704..20699) /locus_tag="DP116_13790" CDS complement(19704..20699) /locus_tag="DP116_13790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875105.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="folate-binding protein" /protein_id="PRJNA477356:DP116_13790" /translation="MSTSAIDTNNAAAIQAVQEGVAVCDRSFWGRIKVSDGDRLRFLH NQSTNDIQSLKPGQGCDTVFVNSTARTLDLVTAYILEDAVLLLVSPNRREFLMQWLDR YIFFADKVQLTDVTDETATISLIGSQSDAVVEKLGAAAIIGQAYGNHISIPSVSGGLI GVGSGLASPGYTLILPASSKQEVWSKILEAGAVPISDSNWQMLRILQGRPAPEQELTE EYNPLEAGLWQTISFNKGCYIGQETIARLNTYKGVKQHLYGIKLSGCAEPGSVITIGD EKVGKLTSYTKTLDGDFGLGYIRTKAGGAGLKVHVGDVEGEVVEIPFVSHDYPQF" BASE COUNT 5809 a 4428 c 4369 g 6225 t 20 others ORIGIN 1 cactttcagg aactaggacg tttttggtaa atttcgagcc gtcgtcataa accattttta 61 attctttact acccaatctt ttatcgacaa ttgggcgaaa atcttctttg agagttggtt 121 taaaaacgta gtattcgtct gggttcacgg aaccttgaac aacgttttct cctaaaccgt 181 atgctgcgct aatcagtgca gcatccttga aacctgtctc tgtttcaatg gagaacatca 241 ccccagaaga tgctaaatca gaacgtacca ttttttgcac gccaacagcg agggctacgc 301 tgaagtggtc aaatccctta atatggcggt aggaaatggc acggtctgta aacagggaag 361 caaaacattt gtgacaagcg gctaaaactc cttctatacc agttacattg aggtaacttt 421 cttgctgtcc ggcaaaactg gcgtcgggga gatcttcagc ggtagcactt gagcgcactg 481 ctacatctgt atctggattg tatcgctcac ataaagcaag gtacgcgtta acaattgcct 541 ctcttaattc ttttggaaac ggtgtgtgta gcaggagcga tcgcgccttt ttccctcgtt 601 ctcgtaaatt tttgacatct tctacatcta agtcagaaaa gagttcgcgc agtttttctt 661 ctagcccagc accttcaatg aaatagcgat aagcgtaagc tgtggtagca aacccgatgg 721 gaacattaat gcctttgggt gtcaattgtt gaatcatctc acccaaagat gcattcttac 781 caccaactat gggtatgtcg gtaatgccaa cctgatcaaa ccacaggacg agcgatcgct 841 ctttggagga tgattctaat aacgtatctt ttgctactgt aaccataaat atttacctcc 901 ctttttgatg ccctggttgt tcgaggattg cttgcctcta cgtaaggtat accacattaa 961 attgtaaacg tttttatact ttcttaaggt tttccagaag gaattttaga ttttggactg 1021 acaaatcatc caaaatctaa gttctctcat gagtaagagt gcatttgcac tcttctagag 1081 taaaaatata gattttgctt ggttaaatcc acattggcga ttgatatata ctaacttgtt 1141 ttcatttgcc aacatgatag aaatgacact gtgtatgact gcaacaggat aagacatcta 1201 tgactactct ttaagattca acgattccgg aattttgtgt gttattgctg gagctatatt 1261 tgtaccgcac ggtgtttcat ttcaaatgaa aattctagta aatttgtcaa gctagctgtt 1321 aaattgattg gcaagaatgg cagacagact ctaattctat ctttactaaa ttcgtttata 1381 atccgtcgtt ttctgtaatc tgcttgtcac ttcagtcaag cacccgtaaa accgtgattt 1441 aaccatattt taagtcagaa ttgacacatt aaaccgtgtc aaatcgcctc ataaaaaggt 1501 ttgagaggag atgctttgag aatccttagg attattgaaa gttattctca taaattagca 1561 agcaaattac aaccaagtta tactgattaa acattctctg gctagctgta ttgcttacga 1621 tacactgaag cgcctctgta ctgtaaatca gccatgtttg agcattgttc tcctcgattc 1681 atacagcgtt tgagtacacg tcagaaaaaa cctttttgtg aagatagaac aacaggagac 1741 tcgctgattg ctaaactacc cattgatact agatttgtgt tatcactagc gagccatttt 1801 aggaaggagc agtcgtgggt aagtacgaca agttttttaa ctcaaaaaaa acattagaag 1861 aatcaatcag tccagaagaa gcagttgcgg cgatcgcagt tgtcacagct gctgctgatt 1921 cttcattaga agaactggat ccagattttt tagcggatat cctctgggga ctggagattt 1981 ttgaggaata ctcagatgat gaattattag aaacgctgga taaagtcgta gccattgcag 2041 aggaagctca gataggagca ctatttaatg ctgcaaaaaa ttctctgaca gacgacttag 2101 tactggatgc ttatgctgcg ggagtcagcg tacttgtaga cgaagaggag gtacgtattc 2161 ccaaaggaaa aacgactttg ctcaaaaagc tccaagaagc cttgcaaatt aacgatgaag 2221 atgctaaaga agtgatagac gaggtgatcg ccgcctttga ggagatagaa aatgaagact 2281 tcctagaaga cgaggatgag acagaacttg aggaagactt aaacccaaac gtgtatgaat 2341 caccctcagg taattttata gttctaattc ctgtgaactc tcaacaaggt ggtagagttg 2401 aaactcaaga aagctcagtc agcttttctg atgactctgg cagattgttg agaatcgatt 2461 atttccctat ctcctcaaaa cagactgagg aaatggattc tgtgggacac caagaatatc 2521 tgcgttcatt tctcctaagt aagtatgtgc ctcaaacaat agttgctaat ttaccagatg 2581 ctcaagttaa acatactgaa tatctggaag acatattaga aggggcttac ttcgtattag 2641 ttgatatgcc ccaaggttca acagttacga aaacaggaaa caacggtact gcaactaagt 2701 tgaatgcata ccgtggtgtt ctagcattta gctatggtga ccttttgtat atcgtcagca 2761 gtcagcagag cttttctgat ggagaaaaac cagattctct tgaagaagaa gctgaagcta 2821 tcaagcagaa tgttttaagt ttcgttgata ctatcgagtt tctttagggt taacgagctt 2881 ttaccaattt tcactacagg ggtagacgcg tgatggtttg ttatgagaca ccacacctgt 2941 tctatccctg ttgtttatca gttatcagtt accagttatc agttgttcat tatttactgt 3001 ttactgttca ctgttcactg ttcactaaat catttttgtc cgtgctggat caagtcgcaa 3061 agtcgcgatc tgcgcttcgc agcagcgctt cgctatcgca gccgcaaaca ttccgacttg 3121 actctagggc gttcttctgg agctatgttc attttagcca tgagcgcact ggctacggga 3181 ttttcttgtc gcaagaaatt ccgccaattt aaacgattta gctggagtac tacgtagttg 3241 agttaagaat aatgctagct taggttaagc aaagcgcaat tcaacaaaac ccaagatgaa 3301 cattgggtaa tgtccacagg acatactaca tgaatgttct tcaacttaac atacatagtt 3361 ttctattttt agctctaaca gaaccgtatt gaaatacaag gaaattttat aatgcctact 3421 accagtaata ttgatagtcg attgatgagt tttggtgaat tgcttgcagg taataattct 3481 tactctgtac ccagctttca aagggattat tcttggacag aagcagaagt tcaacagctt 3541 tgggatgata tcacacagac gttggaagag ggacgtacag aacattttat tggctcagtt 3601 gttgtgaaca actccaacaa gcctaagctg atgttgatag atggacaaca acgtttaaca 3661 acagtatcaa tattgatgtg tgtacttagg gatatagcca aggaaaaagg agataatcag 3721 ttagcccaaa ctatatcgca ggagtattta ggttcattaa atctgcgaac tcgtaaaaca 3781 gaatctaaat tagtcctcaa tgaaagaaat aatcagtttt atcaagaaaa tattgttgaa 3841 tcannnnnnn nnnagattta caaaatattt ctaaaaagag aaatctagaa aaatcgaata 3901 aattacttat tgatgcatac ttgttacttt tcaagcaaat tagtaagaga atcacagagc 3961 aaggagattt tgcagaattt atcatcgaaa ttgaagagtg tattagggat agattactgt 4021 caattttaat agcagttgca gatgaagcta actcctactt aatctttgaa actttaaatg 4081 atagaggtct ggatttgtca gttgctgacc tactgaaaaa ctatttattt tcacgagcaa 4141 ctgataaatt aaaagatgta caaaaaaagt gggaagaaat caatctttta gcagatagat 4201 ttgagttaac caagtttatc cgtcactact ggctatccaa atatgaactt gtgactgaaa 4261 agaatttata tcgtaagata gcagagaaag tgcgtaattc acttcaggtt tttgattttg 4321 ttagccagtt gcgggaagct gctgaagttt atggtgcgtt tgaaaactcc caaagccatg 4381 tttgggattc atatgattca ggcttgaaac aggatattga gcggcttagt cttttcaaag 4441 taagtcaatg ctattcagtt ttaattgcag caaaagaaag cttaccagat gagctttttc 4501 caaaagttct aaggatgata gttattcttt cgtttcggta taacgttatc tgtagtttga 4561 atcctaacaa gctagaagct gcatacagta agactagtaa atatattcaa gaacaaaagc 4621 caaaatctgt gaaagctatc ttcgaggaac taaaagaatt ttaccctagc gatatacatt 4681 tcaaaagagc attcgcagag aagatcatag cagcaagcaa tgcaaagcta gcacgataca 4741 ttttaagcga gattaatcag cattacatgg acagtaagga attgattgca aacccaaacg 4801 caacagagtt gaatttagag catatattac ctcaaactcc aagtgaaaaa tggcttgtag 4861 agtttcctaa aactgattac aatcaataca tttatagatt aggtaacatg actttgttag 4921 attcgtctat caatcgaaaa gtaggcaata cttcatttca agacaaatgt gctactgcct 4981 tttctacatc tcaattggaa attacaaaag aaattataaa ttaccgtgtt tggagtccaa 5041 aacaagttga agaaaggcaa agcaaaatgg ctgaggcggc ttgtcaaatt tggcgtttgg 5101 gctactagtt ttgcttagta aaaaagcctt tatgtatatt ttgttgagag tatttacgtc 5161 aaaatcgctc cacccatcaa ccgctcatac cgatacttca actcttcttt catcaaatac 5221 caacgttcca aagttgcatc tatctctatc accttttccg ctgcttgtcg cgccaaaatt 5281 aacacttcct catcttccac taaactcgcc aaggtaaaat ctggcacacc tgattgacga 5341 gtccccaaaa cttgcccagg accacgaaag cgcatatcca tttctgagat gaaaaagcca 5401 tcctgagatt gttccaaaac cctcagacgt tgctgcgcat ctgaactttt ggaactactc 5461 agtaaaagac aataggactt agctgcacct cgaccaacac gtccccggag ttggtgcaac 5521 tgagataagc caaatcgctc tgcattttca atgagcataa ctgttgcatt aggcacatct 5581 acacccacct ccacaacggt ggtggaaacc aaaatttgag tttcgttatc gcggaatttg 5641 gtaattgctt cgtccttttc cgctgaactc atacgaccat gaagcagccc cacctgaaac 5701 tctggaaaga tactttcctg taacttttta tgctcttcta ctgccgatcg caaatccaac 5761 ttttctgatt cttccaccaa tggcaaaact acataagctt gtcgtccttg ggcaatttcc 5821 cggcgaataa gctcataagc atggctacgt tgctgaccca tgagcacagt tgtctgaatc 5881 ttttgccgtc ctggtggtaa ctcatcaatc tgactcacat ccaaatcccc gtgtattgtc 5941 aatgccaaag ttcgaggaat cggagttgct gtcatagtta atacatgggg ttgctcgcct 6001 ttttgctgca aacgcgctcg ttgttccact ccaaaacggt gttgctcgtc aataacaacc 6061 aaacccaatc gctgaaagtt tacagggtct tgaattaagg catgagttcc caccaaaagc 6121 ggcaattcac ccgtttctaa ctgagcgtga atttgtcgtc tttttgcaag tttcgtagaa 6181 ccagtcagta attccactgg taaatgcagg aggttaaacc aactcactaa cttgcgataa 6241 tgctgttctg ccaaaacctc tgtgggagcc atcagtgctg cttgataccc agactgaaga 6301 gcagcaagga tagcaactac agcaacaaca gttttaccag aaccgacatc cccttgcacc 6361 agacgattca ttggtgcagg tttttccaag tcattgagaa tttcgttaat gactcgttgc 6421 tgtgcgttag tcagtttaaa cggcaaaatt tcggaaaatt tttccagtaa ttgacctttt 6481 ggggcaagaa tcgcactggt ttgagtttcc ttggctttct gttgacgttg gagtaaccca 6541 agttgcaggt agaaaaattc atcgaaaacg aggcggcgac gggcagcttg gaggttatcg 6601 ctgtctgggg gaaaatggat attatgaatt gcttccttca attccatcaa attatactta 6661 tctcgcagac cgctaggcag tgtatctttg agatgaacag tagaaggtaa agcagcaata 6721 actgcctgtc gtaccaaatc cgcccccact ccctctgtca gcggataaac agggactatc 6781 ctaccaatca ttaaagactc aattgtatct cctggatgtc ccaaaacctc cagttctggg 6841 tcttccaacg ttagaccgta tttactttct ttcaccaacc cgcatgctgc tacaatacta 6901 ccattcgggt agcgacgctt gaaactttcc tgccaagcac gactggtata gcgtgtacca 6961 gcaaaaaatc ggctgatttt gatctgaccc gtctgatcac gtagcagcac ttctaaaatc 7021 gtcaatttgt tgtttttggg agttgtaaag cagttacacc gcttcaccgt tgccactatt 7081 gtcaccgtct cccccgccgt caactcctga atatttacct gacgagcata gtcaatatgg 7141 tcacggggat aataggaaag taagtcacgc acagtgtata aaccaagacg tgctaaatta 7201 gcagattttc tgatgcctat ttctggtaag tcactcaact tttggtctag attgggagca 7261 aggttccgac tgacttcagc aacaattgga gattttgaga gtggagagta gccagaagaa 7321 ttttcttttg ctctccgact cccctgcnnn nnnnnnnccc ctgcttttct gcttcccctg 7381 cttccgaagc ttcccctccc tcttcaacca cttgctgaac ttggtagaga tacctacgag 7441 tctctgcgac taagtgttgt ctatcttcca gttccagatt gggataatta gcaaattcag 7501 ctgccaattc ttgccagcga cgacgttcaa taccaggtaa gcctatgggg aatttgccaa 7561 aagtcagggt aagaaactca ctgaagcggt attgtttacc cagcaaatca gcaaagcctt 7621 tttctgcttc tattgccaaa gctttctgta atcgtatcca atctggtttg tcattagtca 7681 ttagtcatta gtcattagtc attaatcatt agtcattagt cattagtcat tagtcatttg 7741 acttcggaca aagaacagat cactaatcct cagaccaaat agcacgccat gctgcttcag 7801 cttgagcaat tgaacgttcc cgctgttgtt tttgatactc ttgccctcgt ctacggagat 7861 tcagtaagac agtgtgaatc tgcttgcgat aagacgagag tgtcggatca gcaaactcaa 7921 tttccccaag ccgcagatta atagccatga tttgtgtcaa gttagagtct tctgattcct 7981 gctcattgtc aatttcaatc accaagttta ataggttggg gggtcctggc atcacttcac 8041 cagatgcttc tgatgcagcg gctgcagctg cttctaaaat cggttctggt aatttcttag 8101 gtaagatccc tgctttttgt aagcgaagat tagcctcacg agaaactctt ttgagcttct 8161 cttgtgtcac tctctctata ttctgttgcc attttactag ttctacaggg ttagaggtgt 8221 ttggaaaaga ggacacgctt tgcagcgagg ataaagagaa taaagttgag gaggtaagcg 8281 ttggtgggag agttgaggaa ggtgggagag ttgagggagt taccggacct gagggagtta 8341 gaggagtggg tgaagtcgtt aggggagtta gggaagtggg agaagtgggt agagttatgg 8401 gagtaggaga agagggttta tctgtttcct catcttcctc atcctcatca atttcctcgt 8461 cttccttgtg ttgttccgct gcttccttct gctgctcttg ttcagtcttg ataaaatctt 8521 gcaactgttg ggctgtttgt tgacccaact tacgaatggc ttgttgcaat tgttgccgct 8581 gattcaatga caaactcaaa aaggtttcag gatacccttg ggtacacagg tgataagtcg 8641 ccagaatcag ctgcctctgt acggtttgcc ccaaggcagt tagataattt gtgtaggcgt 8701 ttcggagttc ttttgcgatc gcctctatcg cttcctctaa tgctaaaatc tcccgttcaa 8761 tttgctcgat tgctttcgcc atatcttctt gctatcatgc atctggagct ttttatggta 8821 caaatgtatt tatggatttt caaataacca cccaaaaaat tttggaattt taggcttttg 8881 gatattgaat tttggataga gactttaatc tatagtaacg ttcgctcaat agtagtcatt 8941 tttcttgact ttaatcatcc agtagaaaac ctgaaattta gctcagtttg aatgaaaaac 9001 aggtcagcta agcttgcttg acctgtcttg gaatcatctt gtgtcccaag tctattgtcc 9061 agaactaatg acaaacgact caattacttg ctgcccattt gtgcagcgac ttcttcagca 9121 aagttacttt cttgcttttc aatgccctcg cccaggatat agcgaacgaa gcgacgaact 9181 tggatactat cgcccagttt tgaggcgttt tgcttaatca attcttctac tgtgatactc 9241 tgatcgcgaa tataaggttg atctagcaaa ctcatttctt gtagacgttt ttcaattcgt 9301 ccttgaacaa tcttttcttt gatgttctgg ggcttgtttg ccaagtcatc ccgccccatt 9361 tcaatatcct tttctttttg ggcaacctca gcgggaattt cgtttacgtt cacgtactca 9421 acgtttgggc aagccgcaac ttgcatggct acattccgtc ccagagtttg atactctgaa 9481 ggattagcca ctgaatcatt ctgattgttc agttcgacta acacaccaac tcgaccgcca 9541 gtgtggatgt agctgtctac tacccctttt gtgtcttctg ccagttcaaa tttggtaaag 9601 cgacgcagtt ggatattttc accaagttta ccactcgttt gcttgatgaa atcttctacg 9661 ttaacacttt catcttcaat gtaaggttga gccagcaaag attcaacact ttcagcagtg 9721 gttgcttgct tcgccagatt ttgaacgagg gctttaaaat cttcgttacg ggcaacaaaa 9781 tcagtttggc agttgagttc tatgagcaca ccaacccgac cttctggttg aatgtatgta 9841 tctaccagac cctctgccgc aatacgtcct gcgcccttat cggctttagc gatgcccttt 9901 tgtcgcagcc actcgatggc tttttctatg tcaccatcat tttcttttag tgcctttttg 9961 cagtccatca tgccggcacc tgttttcaga cgtagctctt ggacgatttt tgcagatatt 10021 tccgccatgt tgcctcaatt cctacttcac tgacagtttt aattgctctc aaataaaatt 10081 ttgagttctg agttttaaga aagtcctgag taatgagttc tgagtactaa gtcaaaggaa 10141 ctcaccctcc gggtatgcct gccttcgcct gacggcttta gccctctggg cgtgcgcttt 10201 gcgcatacgg cagtttgctt atgccgggga accctttcag cagtcttccc tcgtcgggaa 10261 accctaccaa gagcgctgct tcacccccgc actgcctcag catttagcac taacttagga 10321 ttttacctgc ggctagagac tcatgaatag aaagtaccac taagtaccga gtactactat 10381 cttactcggc actcggcact taacactcaa gactcagcag gcaattattg ttcttcttct 10441 tcgtcgttgg gaatccccaa gtcagagtaa tcgcgttcgc cttcctcgta gtcgtagtcg 10501 tcctctgcgc cgtcgtaatc ttcgtaatct tctacatcca attgaccgtg acgaccttca 10561 taaatggcgt ccgccaattt gcctactatc aacttaattg agcggatggc gtcatcattt 10621 gctgggatgg gaatatctac gacatctggg tcacagttag tatccagcat ggacacaatg 10681 ggaattgaca acttttggca ttcttgcact gcgttatact cccgccgttg gtctacaatc 10741 acgaccacat cgggaacttt ccgcatagct ttaatcccac ccaggtattt ctgaagcttc 10801 gtcatttccc gacgtaacat tgaggcttct ttttttggca gcaaatcgag agcgccattt 10861 tcttcccgac gttccaaatc tttgaggcga tctacgcggg ttttgatagt tgcccagtta 10921 gtgagcattc cacccaacca acgctggtta atgtagtgag aaccacaacg ggcggcttct 10981 tgggcaacga ttcctgctgc ttgccgtttg gtaccaacaa acaagaattt cttgcccgcc 11041 tcagcttgtg acctcatgta tgtataagct tcttccatca actgggcagt ttgcaccaag 11101 tcgatgatgt gtactccatt acgagatgtg taaatgtatg gagacatttt tggattccat 11161 ctacgggttt gatgcccaaa gtgaacccca gactccatca tttgagccaa tgaaacaaca 11221 gccatgtttt ttactcctat tcgggttaaa cctccattca ggcgtatttc caaaatcagg 11281 aaacacccga cgtactcctg aatgtgcgag attaatgaac catactaagg taacatactt 11341 gtggcagcta ttggggaatt aaaaatgttg agaaaataca acatttaaaa gaggaaatta 11401 ctacacctgt acaagcacaa agccgatgcc cccagagttt cttttcaggt tgccttttgc 11461 atttgggagt aagcttgata aacgcccttg acgttatacc agttaagaaa aattcgagca 11521 gcttctaaag ctaactgtgg ttttcctaaa tcaattagcc attgcaaaaa tggtgccatt 11581 gtatgttcgt tgaggatgcc attcaatgat agaattcccc aaagtaagcg atgcaaccaa 11641 gtcatttgaa tcatcattcg cacttcccaa gtaggatgct tttgataaaa caacactccc 11701 atacgtccac gctgaatttc tttgtcaatc aagcgaggta cttgttgtaa gttgaagggt 11761 ggatgccagt gatagccgac agcatctgga catttgataa gttttaaccc aagatttttg 11821 agtctgacac ctagttctaa atcttcccaa ccatagagtt gaaaacgagt gtcaaaaagt 11881 ccagctttct ctaaccaatg cttggggatg gctacatttc ctgtggcaaa aaaagctgcg 11941 gagaaatctg ttattttgta tggctcagaa gtggggttgt caaaattaca agtattaatg 12001 actgcaccat aggtaaagaa gcgatcgctc cccaattttt cctttccttg caccagcgct 12061 gactcatggg cttgcaggaa attctccgtc acaactaaat cactatcaat aaaaataata 12121 gtgtccccta atgccttttc tactcccaaa ttacgcgctg ctgctggtcc agtatggttt 12181 tgacaaaacg ttctcacatg gggaaactcg tctttgtggg cttctaacca ctccaatgtg 12241 ccatcagtgg aaccatcatc caccaagaca atctcgtaac cttctagaga cgttgcatgc 12301 ccctgtagag acgttgcatc cccttgtaga gacgttgcat ccccttgtag agacgttgca 12361 tgcaacgtct ctacatcata aagcttctgt ctctccaaag ctctgaggca cttttctaaa 12421 attggcttgc gattataagt tggaataaca acgctaaaaa acacagtctc acccacaaca 12481 ctaaacaatt caaaattcaa aatcaccgtt cgcgcagcgt gccctctggg catagatttg 12541 aatctccttc accagacgct gcgcgtaggg aacagcacga cgccttacgg ctaacgccca 12601 aaccgcgtgc gcttgcgctt acggaggaaa cctccctccg ggttcgccag tcccccaaac 12661 tcctgcaaag acgctacgcg aacgtgacgg aaaccgccaa gacgggggct ggactcaccg 12721 ctcaaacttc tctcaaaatt caaaattcaa aataaaaaaa cgagcgtgct agtctacctt 12781 caggatagga cactaaggac tacagcactg ttccctaaag cgaaaccacc ccagtccttt 12841 tcggtttaga ggtggttgaa ataagtgttg tggtacaaca agcagttttt atctgttgaa 12901 agtcgctttc cgacggtgtt tgcgtagcag aaaacctgcc ccaagtatac ccaagccaac 12961 taatgctgga gtggctgttg gttctggaac cgcgcgagga gtccgcaagg agaatttcag 13021 gatagcgtta cctatagaag gacttcctgg gggagcaagt ttgtcctcct cagtaaaatc 13081 tattttgcct gtgacgtttt ggaatatacc tgttccacta gtgattgtta tagtaccagc 13141 acccttaatt gtgccttcaa tcgggttaat ttcagcgctg tcatttgcta atccaaataa 13201 cttattggaa ccatcaccgt agtatatgtc gccaagtact ttgccttcta tgccaaatac 13261 ggttggatct gaattgaatc tggtaaatgt gttggctcca cgaggctcag actgtccata 13321 agtattacta gtaaatttag tcaatccgta aggagcatca gcattttctc ctgtgatagt 13381 tgccctgaga atgttctgtt ctggtaggaa cgggttaatc tcgactgatg tcttatagtt 13441 agtagtaaat tcataggtat tataggtagt ctgctgtgcg ctcgcgcttt gcacatttaa 13501 cccaaaaccg ccgaaagcta aagctactgg tataaaccat ttgatgcgtg aacggagcat 13561 aatttaccct atattctatg ttaagaagac actgagtata aacaaataat ttgcctaact 13621 taattttacg taataatgag gaaaaacgta ataaatttgt tataaagcag tcgtaaagtc 13681 attgtgaatt ttagttaaac gaataaaagt actaattaat acacagtagg aaacgtactg 13741 caatagaaaa cgtcagttgt ctagcaatgc ccaccgtaag ctaatttgtg ggcattattc 13801 atgctaaaaa tgtaatttac atagggttac ttggatcatg cttttagagc aagattgcaa 13861 ctgtaggtgt ttagataatt cggtcaaaac aaatcaaaac agtcgaaaaa ctcagctttt 13921 atcaatcact gtttgtttac ttgctgggtt ttttgttgct gagtggagtg tcggtttgtg 13981 gagcggaagt ttatctctac aggcagatgc ggaacacatt ctttctgata ttgcagcttt 14041 gggaatatcg ttgttagcta gttggttagc acagcaacct gctactgcga gggcaacatt 14101 tggacatcgc cggattgaag ttatggcggc tttggtaaat ggattaagtt tgcttgtgat 14161 tgctattttt atttgttggg aagcaattca ccgctttcaa agtccacaag aaatttcagg 14221 cttgcccatg ttagcgatcg ccgtgttagg tttaatcgtc aatttgctca acataacttt 14281 gctgcatccc cactcacatg acgacttaaa tctacgaggt gctttgcttc atatcatcgc 14341 tgatactgct agttctgtgg gtgtgatggt tgctgctgtt gtgattcacc tttggaattg 14401 gttgtgggca gatacagcta ttagcttagt agttgctggc tttatgggtt taagtgcttt 14461 gccgcttgtc cgagaaagtc tctcaattct tttggagtac gcaccaaaat caattaaccc 14521 tgctgaggtg gaagtttttc tgaaatcttt tccgaaggtt ttgcaggtgg aaaaactcta 14581 catttggaga ataaccagag aacaggtgat gctttgcgct catctgagtg tggattgtgc 14641 gactgttgaa gaacgcgatc gcctactcgg gcaattacaa actcatctgg agcaaacttt 14701 tggcgtcaat cagataactt tacaacttac taagcctaag tctttggcag caatgccaat 14761 tcatcctttg ttcaagcaag atttaatttc aatgctatcc ctagagaaga aataataaca 14821 agaaaaaatt tccaataatc ggtcatgtaa tcaaaaatgt tgagaatatg aacagattat 14881 gattaaatcc ttgagttgaa aaatacaccg aaataggtta gataatagtg actcattgtt 14941 gagttttgct aactggagag ataaatgggt cgcgcaaaga aagtagtttt ggcttattct 15001 ggtggtgtag atacctctgt gtgcattcct tacctcaagc atgaatgggg tgtggaagaa 15061 gtgatcactc tagcagcaga tttaggacag ggggatgaac tagaacctgt cagagaaaaa 15121 gctttgaaat caggtgccag tgaatcgctg gtagtggatg tcaaagagga ctttgtgaaa 15181 gactacgctt ttcgcgcaat tcaggcaaat actctttatg aaaatcgcta tcctctaagt 15241 actgcccttg ctcgtccgtt gattgccaaa gctttagtag aagcggcaga aaaatacggt 15301 gcggatgcga tcgctcacgg atgcactggt aaaggaaatg accaagtgcg ctttgatgtt 15361 tccgtggctg cgcttaaccc caatctcaaa atacttgccc cagcgcgaga atggggaatg 15421 agtcgtgagg aaaccatcgc ctacggggaa cagtatggca ttccctcgcc agtgaagaaa 15481 aagtctccct acagcattga tcgtaacttg cttggtcgga gtattgaagc aggtcctctg 15541 gaagatccca atgtggaacc accagaagaa atttatttga tgacgaaggc gatcgccgac 15601 agtcccgaca aaccagagta cgttgaaatt ggattttcca gaggaattcc taccaccctt 15661 aatggcgaat tcatcaaccc agttgatttg attcaacaac tgaaccaggt ggttggaaat 15721 cacggtgttg ggcgcatcga catggtagaa aaccgcttgg taggtatcaa atcgcgggaa 15781 atttatgaaa ctcccgcttt gatcgtgtta attcaggcac accgcgattt agaaagcctg 15841 actttgacag cagatgtcac ccactataag cgtggaattg aggagactta cagtcaatta 15901 atttacaacg gtctgtggta tagtcccttg aaagctgcac tggatgcctt tattcaaaag 15961 acacaagaac gagtgtctgg aactgtgcgg gtgaaattat ttaaaggcaa cgctaccata 16021 gttgggcgta gttccgaaaa ttctctttac actcccgact tagcaaccta tggtgctgaa 16081 gataaatttg accataaggc agcagaaggt tttatttacg tttggggatt accaacgcgc 16141 atttggtcac aacaccaaaa cgatagataa attatggttt gtaggggttg tatgcccctt 16201 gattaaacct gtaaaaatcc gggcgagagt tcaaccctcg cccggatttt tctagatttg 16261 acccaactcc cttatggagt gcaactggcg tgcaagtctg gagttgattt atctgtgtta 16321 atctgtgtta atctgtggtt ccaaatattg gaatatcact tttgcaaaag gttgactatt 16381 gactaagaac tagtgaccaa tgaataaaaa gtggtttcgg attggcatcg tgggtgtatt 16441 cttgctgtca ctcgccttgc gattttgggg attgggaaga tttaacacct ttgtctttga 16501 cgaagtttat tacgctaaat ttggtaataa ttaccttacc catacaccat tttttgatgg 16561 tcatccacca ctaggtaagt atatgattgc actggggatt tggattggta gccatattcc 16621 cttttggcaa gatgaggtga atgggtttac tggttcagtc atgtcacccg tcgcttaccg 16681 ttggatgaat gccttttctg gttcatttat ccctataata gtcgccgcca ttgcctatca 16741 gataagttat cgtcgtggtt ttgctttact tgctggtttg ttcaccgctt gtgatggcat 16801 atttcttgta gaatctcgct atgccctcat caaccagtat attgtcatct ttggattgct 16861 gggacagtgg tttttcttat tggcattagc aaaacaaaga caacaacgca ggttttggtt 16921 aatactttct ggcatcgctt ttggtgcatc agctgctacc aagtggaatg gtttgtggtt 16981 tttgctaggg acttacctcc tctggatatt agcttgggga attcgctggt ggcaatcttt 17041 cagctttgtt gacaacgaaa cttcgcctca acttcaacaa gaatttattt ctcatcctcc 17101 ccagaccaga agtggaagaa aacgcagtcc aaaaatttct tcattgtctt ttttaaccct 17161 gtctaactct ggcttatttg ggacaagtac acagaaaaat tttccggaaa atgtcagttc 17221 ggaatcttcc aataataatg gacaggtcaa tctaacgcca ttgcaaaaat taactcaact 17281 tcatctaatt cacattatta tttatttcgg ttttctccct ttaatagttt acagtttaat 17341 ttggatacct catttactct taaatacaag ctatggattt ctagaactac acaagcaaat 17401 tttactgttt catgaacgtc ttggtggtaa tactcctaag gtgcatcctt actgtgctgc 17461 ttggtataag tggcccttga tgactcgacc aatggcgtat ttttaccaaa cggcacaaag 17521 tcttaaggaa ccaccacctg ttatgggacc tcctttgcca gcaggtgcgg ggaaaatcat 17581 ttacgatgtc catgcaatgg gcaatccgtt tttatggtgg tttggcgttg cggcgctttt 17641 atttttagtt ggaacgctgg tgtggcgatt cctcattcct ttggtcaggc aaaaacgctt 17701 ttctcctcca aggacattga gtgttgatac ttggattgcc ttatatatag tgatcaatta 17761 tattgctaat ttactgcctt gggtaagagt aaatcgttgc gtttttatct accactatat 17821 gacgggtgtg gtgtttgcat ttttagcgat cgcttggctt gtggatcagt gcttacgtag 17881 ttatcatatt ccactccgcg ctgttggtgt gactattacc tttattattg tgagtgcttt 17941 tattttttgg atgcctattt atttaggttt acctctttca aattttgaat atagaccaat 18001 gcggatgtgg tttaattctt ggatttagaa ttagaaattt aacaacaact cagtgaaagc 18061 cacagatccc cgactccttt aaaagttgtc ggtgatcaac tttgctcaca aatgatttaa 18121 gatcactccc caaaaagtgg acgctatagt gcggaattca acccgcgatc aagagtgttg 18181 accacatctt tttgatgcct acttgtatca acattaaagt gaaagggtat aagaaggtaa 18241 caaatggaaa atatatcttg ctcatctgac tcttcagtaa cgtcacctgt ttttcctaag 18301 tgcgtacgtg tttgccaaaa tcgcacttgt cgcaagcaag gtgcagtcaa ggtgttagcg 18361 gcttttgaag cgttaccaac tcctgaagta actatcactg gtagtacctg tttgggacaa 18421 tgcggtaatg gaccaatggt gttagtgcta cctgatatgg tttggtacag tggtgttcac 18481 ccaagtgaag tacctctact ggtagaacag cacttacgcg gtggtcaaag agtgacaaag 18541 atgctctatc atcggtttca tccgcaggga taatgttgat ttttaacaat ttatagcgat 18601 tatagagatc tgtacttttt ttgaccgaac caggacatct cattatgaat attcagcagc 18661 tacgccagtc tttgaaactt aagtggataa gttactatta caagaatcgt tcgtggttgg 18721 gaaaaatgcg agtttgggga acttacgatg ctcaacgtcg cccttcctct ggttttattt 18781 tggcagcttt gtctgcttta gagccacagc ttgaagaagt tttgcctttt atttgcgaac 18841 tcaacaacga tccagatcag attattgtag ctttgggtct taactttaat cctgaggagc 18901 agttacactt agtagaatcg gctgattctg tagcccaaag cgaggtgaac tgtgagtctg 18961 ttctccagac atttcccaac catcagtctg taactcccaa tatagttacc acagcaaaag 19021 aagaaaaaga aataaaagaa aataacagca aacctgtacc atctgttgca gtaactaccg 19081 cagttccttt tcaaacactc agcgagtgta aatatttgac ttttgctgca actgctaatc 19141 aggagaaaag tttcagtcaa tcattacaat ctcttgaagt cgctagcatt cagaaggaaa 19201 gcaactctat ggctttgata acggctgcta gcactttggt ggaaagcaaa agccagttta 19261 tacctttggt cgcattttca aaaaccgaaa ttgaaagcaa aagtcaatcc gcgccatctg 19321 ttgacgttcc tacacggatg gaaagtaaaa gcaaacccag atcattgctt gctattgctt 19381 ctgatggaga aagtaaaagt agacccaggt catcgactgc tgttgcttct gatggggaaa 19441 gtaaaagtaa atctgtgtca tcgactgctg tcattcctag ggaaatggaa agcaaagctg 19501 tttcatctgg tgtttcattt attgccctgg ctaacaaggt ggaaagtaaa agtaggcttg 19561 tgacaactcc acacaaaaat cttgctcatc aagttaaccc accacctgct aggactactc 19621 gtttagccaa ttggatagat gactcttgcc aaggtgtcgg atgggataga gattaaacgc 19681 gtttcgttcc atgaaatgac tgatcaaaac tgcgggtagt cgtgagaaac gaacgggatt 19741 tctacgactt cgccctcaac gtctcctaca tggactttta aacctgcacc accagctttg 19801 gtgcggatgt agccaagccc aaagtcacca tcaagggttt tggtgtaact cgtcagtttg 19861 ccaacttttt catcgccaat ggtaatcaca cttcctggtt cagcacaacc acttaattta 19921 atcccatata agtgctgttt aacaccttta taggtgttga gtcgggcgat ggtttcttgt 19981 ccgatataac atcccttatt aaaggaaata gtctgccata aacccgcttc caagggattg 20041 tattcttcgg tgagttcctg ctctggggct ggacgtcctt gtaatattcg taacatttgc 20101 caattagaat cgctgattgg aactgcgcca gcttccagaa ttttgctcca aacttcttgt 20161 ttactagaag caggcaaaat cagagtatag ccaggggaag ccaaaccact accaacgcct 20221 atcaaacccc cactaactga ggggatggaa atgtgattac cataagcttg accgataatt 20281 gctgcagccc ccagcttttc tacaacagca tcactttgtg atccaataag gctaatggtg 20341 gcagtttcgt cagtgacatc cgtgagttgt accttgtcag caaagaagat gtagcgatct 20401 agccattgca tgagaaactc gcgacggttt ggcgaaacca gcaacaacac ggcatcttcc 20461 aggatgtatg ctgttaccaa gtcaagagtc cgggcggtag aattcacaaa aaccgtgtca 20521 cagccttgtc ctggcttgag gctttggatg tcgtttgtgc tttggttgtg caagaagcgg 20581 aggcgatcgc catcagaaac tttgatccgt ccccaaaaag agcgatcgca cacagcaact 20641 ccctcttgta ccgcttggat tgctgctgcg ttgttggtgt caatggcaga tgtagacata 20701 gttatatctt cacagtttgc cgctaggttg tgagcattga aaatcttagc aaatatgggt 20761 gagatactgt aacttaattt atagcgcttc tgacttcgtt gcaatacacc accgaacccc 20821 accccggctt tggctgacgc caaaacctcc c // LOCUS NODE_1530_length_20812_cov_4.82299920812 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20812) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20812) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20812 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 157..345 /locus_tag="DP116_13795" CDS 157..345 /locus_tag="DP116_13795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139525.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13795" /translation="MNECPCCSHQLLRHIRHQQVYWFCSSCRQEMPNFCNSVKASNLF STTLSQRLSGYSMKVNTI" gene 359..2815 /locus_tag="DP116_13800" CDS 359..2815 /locus_tag="DP116_13800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_13800" /translation="MKTLLNNEAARLEALLSYQILDTEPEEPFDALTRLAAYICQTPI ALISLIDANRQWFKSKIGLHATQTPRNIAFCNHAICAKRSSEAIAQSDVFIVTDALTD ERFATNPLVTCDPYIRFYAGVPLITSQGYALGTLCVIDYVPRELDSQQIEALQALASQ VMTQLEQRCTFAQLVYRTTTHRRNEERLQLALNAAQMACWDWDIQTNKVTWTKDHESL FGLALSSFGSTYEAFLQCVHPQDRKLLEQAMARSLEERYYEHEFRVVLPDNSIHWLVS KGEVFFNDIGKATRMLGVVWDITARKQQEQQIREEATLLNESQDAILVLDTNGHISFC SKSAEDLFGWSQSEAISPWLDATRITTKADELLFEQASSEFAQAQNITTEHGSWHGEL HLLTKSGKEIIVQSRWTLLLDEEQKPKSILIVNTDITDKKKLEAQLLRTQRLESIGRL SSGIAHDLNNILTPILLTAQLLQTQLYDQRSQRLLPILVNNAKRGANLVKQVLSFARG IEGKHTILQTRHLILEIQQILTEVFPKSIEFDTDILPDLWTVSGDATQLHQVLMNLCI NACDAMPEGGTLNISAENLVIDEHYALMNIDAKVGYYIVITVSDTGLGIPPEIIERIF EPFFTTKELGLGTGLGLSTVIGIVKNHAGFVNVYSEMGKGTEFKVYLPSSEKSEMQQH FQDLEGLAGDGELILVVDDEAAIREITKLSLEAYNYRVLTASDGVEALATYAQHQQEI SAVLLDMMMPNMDGLTTIRTLQKINSCIKIITMSGLVSNARIAKETGIGVKAFLSKPF NTKELLQTLNAVKSRNQRNQ" gene 2818..4842 /locus_tag="DP116_13805" CDS 2818..4842 /locus_tag="DP116_13805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019498572.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diguanylate cyclase" /protein_id="PRJNA477356:DP116_13805" /translation="MRILIVEDDELTAKALTMVLEDQNYAVEVASDGQAGWDLVQAFE YDLIMLDVMLPKLDGISLCQKLRSAESSSASLSHNFKIPILLLTGRDSSHDKAIGLDA GADDYVVKPFDCEEIVARVRALLRRGNSTSLPVLEWGNLRFDPSSLEVRYETRTVELT PKEYALLELFLRNPQRVFSCSAILDRIWSFDKTPTEEAVRTQIKGLRQKLKAAGAAAD FIETVYGIGYRLKPHKDVASFSSPGEQIQQQTLTALGGVWNRFQGKISQQVSVLEQAA THALQDALSQELHAQAEQQAHTLAGSLGTFGFSEGSQLARKIEHLLQAGQSVGKKQGK HLRKLVLELRQQIEQSPKVLVSATQTNQDERFDTNAEAVVMVVDDDPQILATLQTLLE PWGLEVITLNDPRHFWETLEACSPDLLILDIMMPHLSGVELCQVVRNDPRTCGLPILF LTAHTDAASVNQVFAVGADDFVSKPIVGPELVTRIINRLERIKLLRSLAEIDQLTTVF NRYRATQDLNKFLHLSGRHNQPLCLAILDLDNFKLINNAFGHATGDAVLRQFGQLLRQ SFRFEDVVARWGGEEFVVGMYGMTKSDGVSRLEEVLKTLYEQEFLSPECTKFRITFSA GVAQYPEDGTNLQSLYQVADAALYQAKAAGRNCVLPAASAPKSESSVTKH" gene 4868..9397 /locus_tag="DP116_13810" CDS 4868..9397 /locus_tag="DP116_13810" /inference="COORDINATES: protein motif:HMM:PF01590.24,HMM:TIGR00229" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13810" /translation="MTNSLPENEAQELRLALEATHMGIWDWDILTNKVTWAGEHEQLF GFAPGTFGGTYEAFDACVHPMDRQGVTQAVNCARQQRQNYHHEYRVVWPNNSIHWVEG KGQFFYDETGQAVRMIGTVMDVSERKQTEVALRERENRWRAIIDAEPECVKLVAADGT LLEMNAAGLAMIEVESADAVIGKSIYSLIAPEYKAAFQHLNDSVCTEGKKGTLEFEII GCRGTRRWMETHAVPLRNESNGTLIQLAITRDITLQKQAQESLNARLHQQAAVAKLGQ LALSSCDLFTLIDEAVALVAQCLKVEYSMVLELLGDGDALLLRAGVGWQAGLVGHVRV STGIDSQAGYTLLVNEPVITEDLPRETRFSGSPLLHQHQVISGVTVSIPGKNNPYGIL GAHTTRQRAFTQDDINFLQAIANALADAIERQLFEQILQASLKDLADIKFAIDESSIV AITDHKGTITYVNDKFCEISKYSRAELLGQNHRIINSGYHSKEFFQQMWASITRGQVW KGEIKNRAKDGTCYWVDTTIVPLLNSQGQPLQYVAIRSDITERKRAQEALRLSEERFR ALVEGVKDYAIYIVNPEGYIVSWNAGAERIKGYQAEEILGQHFSCFYTHEDIQLCKPE QKLRVAAVEGRYEDEGWRVRKDGSRFWANTIITALRDESGQLYGFSKVIRDISERKQT EEALRKAKDELEMRVAERTAELISVNAQLHSELDERQRTQSALRLSQARFAGILEIAD DAIISIDASQCITLFNQGAEKIFGYTAAEVIGQPLDLLLPARYTCVHRQHVAGFASSN GTARRMGERREIFARCKDGTEFPAEASISKLELGNEIIFTVILRDITVNQRAREVLER LSHQNELILNSVGEGLCGLDKLGKITFVNPAAAKLLGYQVTELIGQSIDIILPLSKLD GTPYTLTDSPIYESLKDASVDQVTNEVFRRKNNSSFPVEYASTPILEQGKVKGAVITF KDITERQLVERMKDEFISVVSHELRTPLTSIHGSLGMLSSGLLSPASERGKRLLEIAV DSTDRLVRLINDILDIERIESGKVTMAKEVCNAGDLMTSAADVMQAMAQRYGVNLSVS PICVDLWADRDRLIQTLTNLLSNAIKFSPSGGTVWLTAERQELQILFQVKDQGRGIPS DKLETIFERFQQVDSSDSRNHEGTGLGLAICRSIVQQHSGNIWAQSNLGEGSTFYFTL PIPKDAQQTTQETANHYRPLVLVCDHDLRARTVLQTMLEQQNYRVVTVASGEEVLQQA AALQPNAILLDLLIPGMNGWEIIAVLKQHPDTKNIPIMLCSVFLPTDSSIGEIDSYTT ALTIDGTPQKLTVDVLDDNENYYNTSFVDWLCQPVNEQSLLESLKQVVARPHKRVRIL LVEDDNDLAQLLIMLFERHEIEVFHAQTVKEAIRQSQQLHPDLLILDLILPDKDGFAV VEWLSQHNYLHNLPLVVYSAKDLDDSQRNRLKLGQTEFLMKSRVTVQEFEQRVLELLS GITHNRDKEGSSDDS" gene 9387..9767 /locus_tag="DP116_13815" CDS 9387..9767 /locus_tag="DP116_13815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310848.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system response regulator" /protein_id="PRJNA477356:DP116_13815" /translation="MTAKRVLVIDNEEYIREVAQICLETVAGWEILTAADGRSGLVLA QSEQPDAILLDVMMPDMDGPTTFQHLQANAATAYIPVILLTAKVQASDRRRYASMGMK AAIAKPFDPLQLASLVAEALGWSL" gene 9911..10546 /locus_tag="DP116_13820" CDS 9911..10546 /locus_tag="DP116_13820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317983.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sigma-70 family RNA polymerase sigma factor" /protein_id="PRJNA477356:DP116_13820" /translation="MTTDELDSYLLQLACVAQQHPPHSQERQIALTKLIQSIVRFGNL WYPSKNQFFSNVQDIYNEARQELFLYICQNIDKYDPERGTVLVWVNVLLERRFFKDTL RKNLTHGSVTKMTLTDLDNLALPEESKDLTEIVKECIESDPEDIFKNEYIEKCPQATF QALAMRRFSGKSWKEISAEFEMKVPTVSSFYYRCIKKFSHSLKEYCENQVD" gene 10562..11521 /locus_tag="DP116_13825" CDS 10562..11521 /locus_tag="DP116_13825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13825" /translation="MRGNLSESLTFTVPISLEAHALAERFRKKHRNPQKAKQVYLNTL AVFTVQFYLRCMGIQTSWQESLSWNPLIQTLMDIADLEVIGFGKIECLPVLPNEQIIQ ILPEVCSDRIGYVAVQFEQSLEEATLLGFVKTVPDNGSLLLSQLGSLEDLLIHLNQPI EKVEQLIHLSQWFMNVVDAGWQTIESLLNPQQSELVFRFRGTEHTLDIHPENSTSSLQ KGKLLDLGRDSKSKIIALVVGLLPVSREEINIGVKVYPTAGQSHLPEELELLVLDSDG IAVMQATARNTKSIQLNFSSEIGERFSVKIALGDVSLTEFFIT" gene 11897..13240 /locus_tag="DP116_13830" CDS 11897..13240 /locus_tag="DP116_13830" /EC_number="6.3.4.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylosuccinate synthase" /protein_id="PRJNA477356:DP116_13830" /translation="MANVIVIGAQWGDEGKGKITDLLSRSADVVVRYQGGVNAGHTIV VKGQTFKLHLIPSGILYPNTECMIGCGTVIDPQVLIAELDQLEALGVSTRNLLISETA HVTMPYHRLIDQASEERRGSYKIGTTGRGIGPTYADKSERTGIRVLDLMNPEGLREQL EWTINYKNVILEKLYNLPLLDPEQVIDEYLGYAERLRPHVVDTSLKIYDAVQRRRNIL FEGAQGTLLDLDHGTYPYVTSSNPVAGGACVGTGLGPTMIDRVIGIAKAYTTRVGEGP FPTEIDGEMGELLGTRGAEFGTTTGRKRRCGWFDAVIGRYAVRINGMDCLAITKLDVL DELEEIQVCVAYEIDGERCEHFPSNARKFARCRPIYKTMPGWRTSTTDCRSLEELPKE ALDYLKFLAELTEVPIAIVSLGASRDQTIIVEDPIHGPKRGLLHQDGTPASLLSA" gene 13322..13954 /locus_tag="DP116_13835" CDS 13322..13954 /locus_tag="DP116_13835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015118654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L25" /protein_id="PRJNA477356:DP116_13835" /translation="MQLTVDCQKRPEGTKPNALRRSGKIPAHLYGHKGTEAISLVLDA KVVERLLIKASVNNTLIDLNITDVPWRGKTLLREVQSHPAKRTTYHLSFFAVAGHGDT DVEVPVNFVGEPVGVKLEDGVLDTQITVVSLRCAPENIPEAISVDVSNLHVGDSLYVD ALNLPANVTYLGDSEQAIVRILPRQVNAEAEAEAEAAAAAESEPATSEQA" gene 14049..15596 /locus_tag="DP116_13840" CDS 14049..15596 /locus_tag="DP116_13840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317512.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylyl-sulfate kinase" /protein_id="PRJNA477356:DP116_13840" /translation="MTEVSIPPLIQQMLQPEFYPHQVKEPIQLVQTHISYVLLTGDYA YKVKKPMNFGFLDFSTLEKRGHFCQEELRLNQRGAAELYLEVFPVTQVGQKYQLGGTG EPVEYVLKMLQFPNEGLFSKMFEQNQLTEELLEELGRVVAEYHDTKTVTNDYIRSFGE VNQIRAAFDENYEQTEKYIGGPQTQKQFEETKQDTDKFFAERGELFKSRIQNNYIREC HGDLHLGNICLWKDKIWLFDCIEFNEPFRFVDVMFDVAYAVMDLEGQQRPDLSNAYLN TYLEVTGDWEGLQVLPIYLSRQSYVRAKVTSFLLDDPSVPSSVKEEVTKKAANYYKQA WEYTKPRQGRLILMSGLSGSGKSTTARYLARQLGAIHVRSDAVRKHLAGIPLMERGGD DLYTPAMTEKTYGRLLELGILLAKEGYVVILDAKYDRQQLREQAIAAAVEHQLPLQII SCTAPPSVLQQRLSNRTGDIADATADLLASQLQQAEPLTEKEKPLAKIVDTTQPIEAQ LENVIRQ" gene 15611..16108 /locus_tag="DP116_13845" CDS 15611..16108 /locus_tag="DP116_13845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311696.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13845" /translation="MAPPQLNFSPNFVTQILLGLFFTLVFLSTGYDPALSIFLGVLGG FALGWVTASTKSGPQSSTVATSEGVDAGLKYWLFFLLGFVFAGYQPPMSIFLGAIAGI GGGWTIAWWQSKEESRTQLPQEQLEQIDEAEVSGERSSRRQVRKTTRRFRRRPGSFNF KFWER" gene complement(16089..16373) /locus_tag="DP116_13850" CDS complement(16089..16373) /locus_tag="DP116_13850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318535.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13850" /translation="MPTNLIILIGALFVAYIVFRALLSLLQTAISTAIAILIIVVILM FFGFSPQDLMREINNLPQILNQFIKEVKKILGLSAIPTYSRELLTIFPKI" gene complement(16776..17225) /locus_tag="DP116_13855" CDS complement(16776..17225) /locus_tag="DP116_13855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Orange carotenoid protein" /protein_id="PRJNA477356:DP116_13855" /translation="MTYTVDESTRPALEAFQRFDVDTQLALLWFGYLDIKDQLNPAPP PSVETPAKAVFDLIQSLSKEEQLQAQRDLINGAATDISRGYDALSPNAKLDVWLLLAQ GMENGTIIGMPSDYQLPAETDKFTADIQKLDFEQRINFMLSAVQKMG" gene 17363..17557 /locus_tag="DP116_13860" /pseudo CDS 17363..17557 /locus_tag="DP116_13860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744222.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 17839..19686 /locus_tag="DP116_13865" CDS 17839..19686 /locus_tag="DP116_13865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860023.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13865" /translation="MIKTSYDFDQAILPAGSSLKTNILLRFRADIAESPRRNLNLSLV IDRSGSMAGGPLHHALKAAESVVDQLEPNDILSVVVYDDEVDSVVPPQPVTDKAALKN SIRKVRAGGITNLSGGWLKGCEHVKTQLDPQKINRVLLLTDGHANMGIQDPKILTATS GQKAEEGITTTTLGFAQGFNEDLLIGMARAASGNFYFIQSIDEATEVFSIELDSLRAV VGQNLKVTLELADGVSLVDTLSFAKVSQNEAGLAVITLGELYEGEDKLLGLSLLISSA QVGELPVMRLHYSADVVQDDLIQSVSGTADIVAKVGTVEEAALASTSRIILELSRLTI AKAKETALDLAEHGKHQEAEQILRALVQELRDKGLNENFEIAEEIDQLEYFAGRIAQK ALGNAGRKELRDQTYQTMTRNRSDLVARGVTAGDEVYAMPVVNEVGSGVELYCVREGG KLRVKVMSEGYDSTKNVQFPRAIRAEGARYVVEGLELSSDGTFYRVNGNITRFAQPGE ADIFVAHRQSRSASTGKASKGPASAADLPTTDTTKDGILVQCVKDGSKLRARVVSDGY EPDWNMRFPRSVREEGMLYVVDEIKTAPDGKSYIACGEIKRFVQPTISS" gene 19729..20283 /locus_tag="DP116_13870" /pseudo CDS 19729..20283 /locus_tag="DP116_13870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013320651.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="phosphorylase" BASE COUNT 6033 a 4280 c 4840 g 5659 t ORIGIN 1 gacacttagt taatattccg gacgtgtcta ttgagaataa gttaaaggtg aagtgattca 61 cacaaagcta aagtttcttg tctcaattga aacaaaattt aatttgaatt tttactaact 121 tgcttggttc acctagaaat ttgaggtaaa aagatcatga atgaatgtcc ttgctgctct 181 catcaattat tacgccacat tcgccatcaa caagtttact ggttttgttc gtcctgtagg 241 caggaaatgc cgaatttctg caattcagtc aaagcgagta acttattctc cacgactcta 301 agccaacgtt tatcagggta cagcatgaaa gttaacacta tctaatacct gttgagttat 361 gaaaacatta cttaacaatg aagccgcgag gctggaagcc ctactcagtt atcagattct 421 tgacacagaa ccagaagagc cgttcgatgc actgactcgt ttagccgcat atatttgtca 481 aacccctatt gccttaatca gcttaatcga tgcaaaccgc caatggttca agtcgaagat 541 tggtttgcat gcaactcaaa caccccgcaa catcgcattt tgtaatcatg cgatctgcgc 601 aaagcgcagc agcgaagcta tcgcccaatc cgacgttttt attgtgactg atgcattaac 661 cgacgaacgg tttgcaacca atcctctcgt cacctgcgat ccctacattc ggttttacgc 721 tggtgtacct cttattacat cccaaggata tgcactagga acgctgtgtg tgattgacta 781 tgtgccacgg gaacttgatt ctcaacagat agaggcattg caagccttgg cgagccaagt 841 catgacacaa cttgagcaac ggtgtacatt cgcacaatta gtataccgca ctactacaca 901 cagacggaat gaggagcgtt tgcagttagc tctaaatgct gcccaaatgg cttgttggga 961 ctgggatatt caaacgaata aagtgacttg gaccaaggat catgaatcgc tgtttggtct 1021 agctcttagc agttttggct caacgtatga agccttcctt caatgtgttc atcctcaaga 1081 ccgtaaattg ctagagcaag cgatggctcg ttctctagaa gaacgctatt acgaacatga 1141 attccgcgtt gttttgcctg ataacagtat ccactggctc gtctcaaaag gagaagtttt 1201 tttcaacgac attggcaaag caacgcgtat gcttggagtt gtctgggata taaccgctcg 1261 caagcaacaa gaacaacaaa ttcgagagga agcgactctg cttaatgaat cacaagacgc 1321 tattctcgtt ttagacacga atggtcatat tagcttttgc agcaagagtg ctgaggattt 1381 atttggttgg agtcagagtg aggcgataag cccttggctt gacgctacgc gtatcaccac 1441 aaaagctgac gaacttttat ttgagcaagc ttcttctgaa tttgcacaag cccagaatat 1501 caccactgag catggttcat ggcatggtga gttgcatttg ttgacaaaat ctggcaaaga 1561 aatcatcgtt caaagccggt ggacactgct gctagatgag gagcaaaaac ctaaatcaat 1621 ccttattgtt aatacagata tcacagataa gaaaaaactc gaagcgcagt tactccgcac 1681 ccagcgcttg gaaagtatag gcagactctc cagtggtatt gctcacgatt taaacaatat 1741 actaaccccc attttgttaa ccgctcaact tttgcagaca caactttatg atcagcgaag 1801 tcaacgtctg ctaccaatac tagtcaacaa tgcgaagcgt ggagctaatt tggttaagca 1861 agtgttatcg tttgcacgag gaatcgaagg aaagcacacg attcttcaaa caagacacct 1921 gatcttggaa attcagcaga ttctcacaga agtctttcca aaatctattg agtttgacac 1981 tgacatatta ccagaccttt ggacagtttc gggagatgcg acacaactgc atcaggtact 2041 gatgaacttg tgtatcaatg cttgcgatgc aatgcctgag ggtggaacac tgaatatttc 2101 tgccgaaaat ttggtcatag atgaacacta tgctttgatg aatattgatg ctaaagttgg 2161 atactatatt gttattactg tttctgatac tggacttggc attccacctg aaattataga 2221 gagaattttt gagccatttt tcacgactaa agaattgggc ctcggtacag ggttaggtct 2281 ttccacagtg attggtattg ttaaaaacca cgctggtttt gtgaacgtat acagtgaaat 2341 gggaaaaggc acagaattta aggtgtattt gccttcttcg gaaaaaagcg aaatgcagca 2401 gcatttccaa gacttggaag gactcgcagg agatggcgaa ctaattttgg ttgtggatga 2461 cgaagctgca attcgtgaga ttaccaaatt atctttggaa gcgtataact atcgagtgct 2521 aactgccagt gatggtgttg aggcattagc aacatatgct cagcatcaac aggaaattag 2581 tgccgtgtta cttgatatga tgatgccaaa catggatgga ctcacgacca tccgtacgtt 2641 acaaaaaata aattcttgta tcaaaattat tactatgagt ggactggtgt caaacgctag 2701 aatagcaaaa gaaaccggaa ttggagtcaa agcgtttttg tcgaaaccct ttaatacgaa 2761 ggaattgtta caaactctta acgcggtcaa gagcaggaat caaaggaatc agtgataatg 2821 aggattctca tcgtagagga tgacgagtta actgctaaag cgttgacaat ggttcttgag 2881 gaccaaaatt atgccgttga agtcgccagc gatggtcaag caggttggga tttggtacaa 2941 gcatttgagt atgacttaat catgcttgat gttatgttgc ccaagctgga tggcattagc 3001 ctttgccaaa agctgcgatc tgctgaaagc agcagcgctt cgctatcgca caactttaaa 3061 atacccatcc tcttgctcac aggtcgagac agcagccatg ataaagcaat cgggcttgac 3121 gcaggtgcag atgattatgt tgtcaagcct tttgactgtg aggaaatagt cgctcgtgtt 3181 cgtgctctac tgcgtcgagg caattctacc tcactccctg tcttggagtg gggaaattta 3241 cgttttgacc ccagtagctt agaagtacgt tatgaaacac gcactgtaga attaactccc 3301 aaagagtatg ctttgttgga gttatttctg cggaatcctc aacgagtgtt tagctgtagt 3361 gcaattcttg atcgcatttg gtcgtttgat aaaacaccca cagaagaagc agtcagaact 3421 cagatcaaag gtttgcgaca gaagttgaaa gcagcaggag cagctgctga tttcattgag 3481 actgtttatg gcattgggta tcgcctcaaa ccacataaag acgttgcatc cttcagttct 3541 cctggggaac aaattcaaca gcagacattg acagcattgg gtggagtttg gaaccgattt 3601 caaggaaaaa ttagtcagca agtgtccgtc ttggaacaag ctgccacaca tgcgttgcaa 3661 gatgcactca gtcaggaact acatgctcaa gcagaacagc aagctcacac actagcaggt 3721 tcgttaggca cttttggttt tagcgaaggc tcccaactgg cgcggaagat tgaacatttg 3781 ttgcaagctg gtcagtccgt tggaaagaaa cagggcaaac acttgcgtaa attagtgttg 3841 gaactacgtc aacaaatcga gcaatcaccc aaagtgttag tttcagcaac ccagactaac 3901 caagatgaaa gatttgacac gaatgcagaa gcagtggtga tggtcgtgga cgacgatccg 3961 caaattttgg ccactctgca aaccttacta gaaccttggg gacttgaggt aatcaccctc 4021 aacgatcccc gccacttctg ggaaacgcta gaggcatgtt caccggatct gctaatttta 4081 gacattatga tgccgcattt gagtggagtt gaactgtgtc aggtggtgcg aaacgatccg 4141 cgcacttgtg gtttaccaat acttttttta accgctcaca cagatgcagc cagtgtgaat 4201 caggtgtttg ctgtaggtgc agatgacttt gtgagcaagc cgattgtcgg accagaactg 4261 gtaactcgca ttatcaatcg cttggagcga atcaaactgc ttcggagttt agctgaaatc 4321 gatcaactga ctacagtatt taaccgttac agagcaaccc aagacctaaa taaattcttg 4381 cacttgagcg ggcgacacaa tcaaccactg tgcttagcta ttctagattt ggataacttc 4441 aagcttatca acaacgcttt tggtcacgca actggcgatg cggtattacg ccaatttgga 4501 caattgctaa ggcaatcatt tcgctttgaa gatgtagtcg cccgttgggg tggagaagaa 4561 tttgtcgtgg gtatgtacgg tatgaccaaa agtgatggcg tttctcggct tgaagaggtg 4621 ctaaaaactc tgtacgagca ggagttttta tcccccgagt gcaccaagtt tcggataact 4681 ttcagcgctg gcgtagctca gtatcctgag gatgggacta acttgcaatc attgtatcaa 4741 gttgcagatg cagcccttta tcaggcaaag gcagctggac gtaattgtgt attacctgct 4801 gcaagcgctc ccaaatctga aagcagtgtt acaaagcatt gactagaaat aactataaat 4861 aaattttatg actaattctt tacctgaaaa tgaagctcag gaactgaggt tagccttaga 4921 agcaacccat atgggtatct gggattggga tattttaact aacaaagtta cttgggctgg 4981 cgaacatgaa caattgtttg gttttgcacc tggtactttt gggggaactt atgaagcctt 5041 tgatgcctgt gttcacccta tggatcgtca aggagttaca caggcagtaa actgtgcccg 5101 tcaacaacga cagaattacc atcatgaata tcgggttgtc tggcctaata atagcattca 5161 ttgggtcgag ggcaaaggac aattttttta tgatgaaaca ggtcaagcag tacgtatgat 5221 tggcactgtt atggatgtga gcgaacgcaa gcaaacggaa gtcgcactcc gagaaagaga 5281 aaatcgttgg cgtgccatca tcgatgctga acctgagtgc gtgaagctgg ttgcagcgga 5341 cggcacactc ctagagatga atgctgctgg tctagctatg attgaggtgg aaagcgccga 5401 tgcagtgatt ggtaagtcga tatattctct gattgcgcct gagtacaaag cggcgttcca 5461 gcatttaaac gacagtgttt gtaccgaagg caaaaagggg acgctggagt tcgagattat 5521 tggttgccga ggtactcgtc gctggatgga aactcatgcg gttccattac gcaacgaatc 5581 aaatggaact ttgatacagt tggcgattac acgtgatatt accttgcaaa agcaggctca 5641 agagagcttg aacgcacgac tccaccagca agcagccgtg gctaaattag gtcaattggc 5701 tttatcaagt tgtgatcttt ttaccttaat agacgaagct gttgccctcg ttgcccaatg 5761 cctgaaagta gagtatagca tggttttgga actgcttggt gatggtgacg ctttactttt 5821 acgagcaggt gtaggatggc aagccggact tgttggtcat gtaagagtga gtacaggtat 5881 tgattctcaa gctggctata ccttacttgt caatgagcca gtgattacag aggatttgcc 5941 cagagaaact agatttagtg gttcaccttt actgcatcaa catcaagtca tcagcggtgt 6001 aactgtttcc attccaggaa agaacaatcc ttatgggatt ttaggcgcac atacaacaag 6061 acaacgggct ttcactcaag atgacatcaa ttttcttcag gcaattgcca atgcactggc 6121 agacgcaatt gaacgtcagc tctttgagca aattctgcaa gcatcactca aagacctggc 6181 agacatcaag tttgctatag acgagtcttc catcgtagca atcactgacc acaaaggaac 6241 gattacctac gttaacgata aattctgtga aatctcaaaa tactctagag cagaactttt 6301 gggacaaaac caccgcatca ttaattctgg gtatcactcc aaagaattct ttcaacagat 6361 gtgggcgagt attaccaggg gacaagtttg gaaaggcgaa atcaaaaacc gtgccaaaga 6421 cggaacgtgt tactgggtag atacgaccat agtaccgctt ttaaatagtc aaggacaacc 6481 attacaatat gtagcaattc gtagcgacat caccgagcgc aaacgggccc aagaggcgct 6541 gcgactaagc gaagagcgct tccgcgcatt agtagaaggt gtaaaagact acgcaattta 6601 catagtcaat ccagaaggat atattgtcag ttggaatgct ggggccgagc gcattaaagg 6661 ctatcaagct gaggaaattt tgggtcagca tttctcttgc ttttacactc atgaagatat 6721 ccaactgtgt aagccagagc aaaaactgag ggtggctgcg gttgaaggtc gctatgaaga 6781 tgagggctgg cgcgtacgca aggacgggtc acgattttgg gcgaatacga tcataacagc 6841 cttacgtgat gagtctggac agctatatgg cttttccaaa gtgatccgcg acatcagcga 6901 gcgcaagcag actgaagaag cgctacgcaa agctaaggat gagctagaaa tgagagtcgc 6961 agagcggaca gccgaattga ttagcgtcaa cgcgcagttg cactcagaac tcgatgagcg 7021 ccagcggacg cagtcagctt tgcggctatc acaagctcgg tttgcgggaa ttctggaaat 7081 tgccgatgat gcaattatct cgatagatgc aagccaatgc attaccttgt ttaatcaggg 7141 agcagaaaag atttttggtt acacagctgc tgaggttatc ggtcaacccc ttgatttgct 7201 cctaccagcc agatacacct gtgttcatcg ccagcacgtc gctggttttg cctcctcgaa 7261 tggtacagcc cgtaggatgg gagagcgccg cgagattttt gcccgctgca aagatgggac 7321 agagttcccg gcagaagctt cgatttctaa gcttgagctt ggaaatgaaa taatatttac 7381 tgttatttta cgtgatatta cggtaaatca acgagccagg gaagttctcg aacggctcag 7441 tcatcaaaat gaactgattc taaattcagt gggagaaggc ttgtgtggat tggataagtt 7501 aggcaaaatc acctttgtca atccagccgc cgccaagttg ctgggatacc aagtgacaga 7561 attaattggt cagtctatag atattatctt gcccctctca aaattagacg ggacaccgta 7621 caccttgaca gattctccca tttatgaatc gctaaaggat gcatcagtgg atcaagtcac 7681 taacgaagta tttcggcgca aaaataattc gagttttcca gttgagtatg cgtcaacccc 7741 aattctagaa caaggtaaag tcaaaggagc tgttatcacc tttaaggaca ttaccgaacg 7801 ccaacttgta gaacgaatga aggatgagtt tatttctgtc gttagccacg aactccgcac 7861 acccttgact tcaattcatg gttctttggg gatgctctct agtgggttgc tttcaccggc 7921 ttccgaacga ggtaagcgcc tactggaaat tgctgtagat agcactgatc gcttggtgcg 7981 actcatcaac gacattttag atatcgagcg cattgagtcg ggtaaggtga caatggcgaa 8041 agaggtttgc aatgccggtg atttgatgac ttcagcagca gatgtgatgc aagctatggc 8101 acaacggtat ggggtgaatt tatcagtttc acccatttgt gttgatttgt gggctgatcg 8161 cgatcgcctg atccaaactc ttaccaatct gttaagtaac gccatcaaat tctcaccctc 8221 aggaggtaca gtttggttaa ctgccgaacg tcaagaattg cagatactat ttcaagtcaa 8281 ggatcaggga cgaggcattc ctagtgataa gctggaaacc atttttgaac gctttcaaca 8341 agtcgattcc tccgactcac gcaaccatga aggaactggc ttgggcttgg caatttgccg 8401 cagtattgtg caacagcact ctgggaacat ctgggcccag agtaacttgg gtgagggtag 8461 taccttttac tttaccttac caatccctaa agatgcacag caaaccacac aggaaactgc 8521 caatcattat cgtcccttag tattggtgtg tgatcatgac ttaagagcca gaactgtgtt 8581 gcaaacaatg ctggaacaac aaaattaccg ggttgtcacc gtggcttccg gtgaggaagt 8641 cctgcagcag gcagctgcct tgcagccaaa cgccattcta cttgatttac tcataccagg 8701 gatgaatggc tgggaaatta tagcagttct caagcagcac ccggatacta aaaatattcc 8761 gatcatgctt tgtagtgtct ttttaccaac ggattcttca atcggtgaga ttgattcata 8821 cacaacagcc ctcactatag atggaacacc acaaaagctt actgttgatg tgttggacga 8881 taatgaaaac tactacaaca ctagttttgt tgattggctg tgtcagcctg tgaatgaaca 8941 atcgctgttg gagtcactca agcaagttgt ggctagacct cataagcgag tccgtattct 9001 gctagtagaa gacgacaacg atttagcaca gctgctgatt atgctgtttg aacgtcatga 9061 aattgaggtt tttcatgctc aaactgtaaa ggaggcaatc cgccaaagcc agcaactgca 9121 cccggatttg ctcattctcg acttgatact acctgataaa gatggatttg cagttgtaga 9181 gtggctttct cagcataact atctgcacaa cttacctttg gtggtatact ctgctaagga 9241 tttggatgat tcccaacgca atcgcctcaa gctggggcag actgagttcc tgatgaagag 9301 tcgcgttaca gtgcaagaat ttgagcaaag ggtgttggaa ctcctttcag gaattaccca 9361 caaccgagac aaagagggca gcagcgatga cagctaaacg tgttttggtg attgataacg 9421 aggaatacat tcgagaagtt gctcagattt gcctagaaac tgtggcgggt tgggagattc 9481 taactgctgc tgatgggcgt tctgggttag ttttggctca aagtgaacag cctgatgcca 9541 tcctgctgga tgtaatgatg cctgatatgg atggtcctac aacctttcaa catttacaag 9601 ctaatgctgc aacggcatac attccggtga ttttgctgac agcgaaagtg caggcttcgg 9661 atcgtcgccg atatgcgtct atgggaatga aagctgcaat tgccaagccg tttgatccgc 9721 tacagttagc aagtcttgta gcagaagcct taggctggag tttgtaacag atttatgagt 9781 cttgagtgca actcaaaatg taaacaattg taaaaaatac agttgttttg cataaagtac 9841 catgctttga tcacgagaga gaaataggag ttcacttcgg ctcgcaaaac ttggtttgga 9901 gtctcaagtc atgaccacag atgaacttga ttcgtatctg ctacagcttg cttgtgttgc 9961 gcagcaacac ccgccgcatt cccaagaacg gcaaatcgct ttaacaaagc taattcaaag 10021 cattgtgcgc tttggcaatc tttggtatcc atccaaaaat caatttttca gcaatgttca 10081 agatatttac aatgaagcac gtcaggaact ttttctttat atttgtcaaa atattgacaa 10141 gtacgacccg gaacgcggta cggttttggt atgggttaat gttcttttgg aacgacgatt 10201 ttttaaagat actcttcgga aaaacctcac tcatggttct gtaacaaaaa tgacccttac 10261 tgacctagat aatcttgctt tacctgaaga atcaaaagac ctcacagaaa tcgttaagga 10321 atgtatagaa tcagacccag aagacatttt caaaaatgag tatatagaga aatgtcccca 10381 agcaacgttt caagcattag ctatgcgaag attctctgga aaatcatgga aagagatttc 10441 agcagaattt gaaatgaaag ttcctacagt cagcagcttc tattaccgtt gtattaaaaa 10501 gttttctcat tcgttaaaag aatattgtga gaatcaggta gattaaggaa taggtgtaca 10561 aatgagagga aatttatcag aatctttaac ctttacggta ccgataagct tagaagctca 10621 cgcactggca gaacgtttcc gaaaaaaaca taggaatcct cagaaagcca aacaggttta 10681 tctcaatact ttggcagtat ttactgtcca attttatttg cgttgtatgg gaattcaaac 10741 ttcttggcaa gaaagtttaa gttggaaccc tctcatacaa actctcatgg atattgctga 10801 tttagaagtt attggtttcg ggaaaataga gtgtctcccg gtgttaccta acgagcaaat 10861 tattcagatt ctaccagaag tttgttcaga tagaattggc tatgtagctg tacagtttga 10921 gcaatcactt gaagaagcaa cactcctggg atttgttaaa acagttcctg ataacggatc 10981 gttacttcta agccagttgg gttctcttga agatttactt atacacctga accaaccaat 11041 tgaaaaagtt gaacagctaa ttcacttgag ccagtggttt atgaatgttg ttgatgctgg 11101 ttggcagact atagaatcgc ttttgaatcc gcaacaatca gaattagttt ttagatttcg 11161 aggaactgaa cacactttag atattcaccc agaaaattca acttctagtc tacagaaagg 11221 taaactgctg gacttaggac gagattcaaa aagtaaaatc atcgctttag tggtaggact 11281 gctcccagtc tcaagagaag aaataaatat tggtgttaaa gtgtatccaa cggcaggtca 11341 aagtcactta ccagaagaac ttgaacttct agtattggat tcagatggaa tagcagtcat 11401 gcaagcaact gcgagaaata caaaaagtat tcaactaaac tttagtagcg aaattggaga 11461 acgttttagc gttaaaattg ctttaggtga tgtcagtttg acagagtttt ttataaccta 11521 aagctcacca gacaaaatga attgtgctct ttgcaatacc tatgtttaaa ttattaagga 11581 aacgaagaac ctcacccgcc tgcggcaccc tctcctaccc tgcaagaagc cctatgcctg 11641 cggcacggct gcgcaagagc gggtatgtcc agaggacacg ctacgctttg cgcttacgct 11701 tgcgtgcgct ttgcgcatac gccagttcct gtagcctgac cgaacccttt cagcagtctt 11761 ccctcgtcgg gaaagcctca ctcgatcgct gcttcaccac cggactggct ccagaagtct 11821 aatcgcaaat ttgacaaaag tcagaaaatt tgctataaat agttttttgt gcaaacttta 11881 gcattttaag aaacacttgg ctaacgtaat tgttataggt gcccagtggg gcgatgaagg 11941 aaaaggtaaa ataacagact tactcagccg ctccgcagat gttgttgtac gttaccaagg 12001 gggcgtcaat gctggacaca caattgtagt caagggtcaa accttcaagc tgcacttgat 12061 tccctctggt attttgtatc caaataccga gtgcatgatc ggctgtggaa cagtcataga 12121 tccacaggtt ttgatcgcag aacttgacca actagaagca cttggtgttt ccactcgcaa 12181 tctgctcatt tctgagacgg ctcatgttac gatgccttac caccgattga ttgaccaggc 12241 atcggaagag cgcaggggaa gctataaaat tggcacaact ggtcgtggga ttggtccaac 12301 ctatgctgat aaatcagagc ggactggtat cagggtatta gatttgatga accctgaggg 12361 gctgcgtgaa cagttggagt ggacaattaa ttacaaaaac gtcattttag aaaagcttta 12421 caacttacca ctattagacc ccgaacaagt gattgacgag tatttggggt atgcggaacg 12481 ccttcgaccg cacgttgttg atacgtcctt aaaaatatac gatgcggttc agaggcggcg 12541 caacatttta tttgaaggag cgcaaggtac tctcctcgac ttggatcatg ggacataccc 12601 gtatgtcacc tcctctaacc cagtggcggg tggggcttgt gttggtactg ggttaggacc 12661 aacaatgata gaccgagtga ttgggatagc caaggcttac accactcggg taggcgaagg 12721 acctttcccc acagaaatag acggggaaat gggagaattg ttgggtacgc gcggtgccga 12781 atttggtaca accactggac gcaaacgacg ttgcggctgg tttgatgcgg ttatcggtcg 12841 ctatgctgtc cgcatcaacg gtatggattg tctagcaatt accaaactcg atgtcctcga 12901 cgaattggag gaaattcaag tttgtgttgc ctatgaaatt gatggggaac gctgcgaaca 12961 ctttcctagc aatgcccgta aatttgcacg gtgtcgccct atctacaaaa ccatgccagg 13021 atggagaact tcaacaactg attgccggtc tttagaagaa ttgccaaagg aagcgctaga 13081 ctatttaaaa tttttggcag aattaacaga agtgccgatc gcaattgtct ccttaggagc 13141 gagccgcgac caaactatca ttgttgaaga ccccatccac ggtccaaaac gtggtttatt 13201 gcaccaggat ggtacacctg cttcgttgct gagtgcttag agcgaacaat atagcccaat 13261 gccttgtgcc cgtactaagt gatactatct aaatctcaaa tctcaaatct aaaatcagaa 13321 aatgcagcta acagtcgatt gtcagaagcg accagaaggc actaagccca atgctttgcg 13381 ccgttctgga aaaatccccg cgcatttgta cggtcataaa ggtaccgaag caatttctct 13441 cgtccttgat gctaaggttg ttgaacgtct gctcatcaag gcttcggtaa ataatacttt 13501 gattgacctt aatattactg atgttccttg gcggggaaaa accttgctac gggaagttca 13561 gtctcatccc gcaaaacgta caacttacca cctgagcttt tttgccgttg caggtcacgg 13621 tgacacagat gtggaagtac ctgtgaattt tgttggagaa ccagtaggtg taaagctaga 13681 agacggtgta ttggacacac aaattacagt cgtgtcgctg cgttgtgcac cggaaaatat 13741 tccggaagcg atttcagttg atgtctccaa cctgcacgtg ggagatagtt tgtatgttga 13801 cgcactcaat ttgcctgcaa atgtgacata tctgggtgat tctgaacaag ctattgttag 13861 aattttacct cgacaagtta atgctgaggc tgaggcggaa gctgaggctg cggctgcggc 13921 tgagtcagaa ccagcgacat cagaacaagc ctaatatttg atatagttac gaagaacacc 13981 aggggtttac ctctggtttt tttgtatcct aattttaacc gtgaagacgc caaggacgcc 14041 aagagaaaat gacagaagtt tctattcctc ctttaattca gcagatgttg cagcctgagt 14101 tttatccgca tcaggtgaag gaacctattc aattggttca gacgcatatt tcctacgtgc 14161 tgttgactgg ggattatgca tacaaagtga aaaagccgat gaattttggc tttttggatt 14221 tttcaacttt ggagaagcgg ggacattttt gtcaggagga gttgcgttta aatcagcgag 14281 gtgctgcgga actttatttg gaggtgttcc cggtgactca agtcgggcaa aagtaccaac 14341 tgggaggaac gggagaacct gtggaatatg tgctgaaaat gcttcagttc cccaatgaag 14401 ggctgtttag caaaatgttt gagcaaaatc agttaactga ggaacttctc gaagagttgg 14461 gacgagtcgt cgctgaatat catgacacga aaactgtcac aaatgattac attcgctctt 14521 ttggtgaggt aaaccaaatt cgggctgcgt ttgacgagaa ttacgagcaa acggaaaaat 14581 atattggtgg tccccaaacg cagaagcagt ttgaggaaac caaacaagac acagataaat 14641 tttttgctga acgtggcgaa ctgtttaaga gcagaattca aaacaactat attcgagaat 14701 gtcacgggga tttgcactta ggcaatattt gtttgtggaa agataaaatt tggctgtttg 14761 actgcatcga gtttaatgag ccgtttcgct ttgtcgatgt gatgtttgat gttgcttatg 14821 ctgtcatgga tttggaaggt cagcaacgtc cagatttgag taatgcttat ttaaatactt 14881 accttgaggt gactggcgat tgggaagggt tgcaggtatt gcccatttat ttaagccgtc 14941 aatcttatgt tagggcaaaa gtgacttcat ttttattaga tgatccaagt gtgccttctt 15001 ctgttaagga agaagtgacc aaaaaagcag ctaattatta taagcaggct tgggaatata 15061 caaaaccccg tcaagggcga ctgattttga tgtcgggttt gtcgggttct ggtaaaagta 15121 cgacagcgcg atatttagct cgtcaattgg gggcgattca cgttcgttcg gatgcagtgc 15181 gaaaacatct ggcgggaatt ccgttgatgg aacgcggtgg tgatgatttg tacacacccg 15241 cgatgactga gaaaacttat ggacgactgt tggaattagg aattttgttg gcaaaagaag 15301 gctatgtggt gattttggat gctaagtatg accgacagca gttacgtgag caggcgatcg 15361 ccgctgctgt tgagcatcaa cttcccctcc aaattatctc ttgcacagca ccaccatccg 15421 ttttgcaaca acggctctct aaccgtactg gtgatattgc tgatgccacg gctgatttat 15481 tagcatctca acttcagcaa gctgaaccct taacagaaaa agaaaaacca ctggctaaaa 15541 ttgtggatac aactcaacca atagaggcac aattagagaa tgttattcgc caataggtag 15601 gctacatctt atggcaccac cgcaactcaa cttttcacca aacttcgtca ctcaaatctt 15661 attagggtta ttttttacac tagtcttttt gagtacggga tacgacccag ccctcagtat 15721 ttttctgggt gtattgggtg gttttgcctt gggctgggtg acagcatcaa ccaaaagtgg 15781 accccagtca tctacagtcg ctacctctga gggcgttgat gcgggcttga agtattggtt 15841 atttttctta cttggctttg tgttcgcagg gtatcaacca cctatgagta tttttttggg 15901 tgcgatcgcc ggtataggcg ggggctggac gattgcttgg tggcaaagca aggaagaatc 15961 aagaactcaa ctaccacaag aacaactaga acaaatcgac gaagctgaag tttctggtga 16021 aagatcaagc agacgccaag tgaggaaaac cactcgacgt tttcgtcgcc gtcctggaag 16081 ttttaatttc aaattttggg aaagatagtc agcaactccc gactataagt cgggattgct 16141 gacaagccca gtattttctt cacttctttg atgaattggt tcagaatttg aggtaagttg 16201 ttgatttctc gcatcagatc ttgaggactg aaaccaaaaa acatcaagat aaccacaatg 16261 ataagaatgg cgatcgccgt actaatcgct gtttgtaata aactgagcaa agcccggaag 16321 actatatacg cgacaaatag cgcaccaatg agaattatga ggtttgttgg catgattttg 16381 cctgaataaa tcacaatttt atagttgtga atctcctgat ttatagtcag gaaagcgtca 16441 attatatcgg attatgtata agtttcagta gaaaatttca ttctgctgat tttcatttgt 16501 atacatattt tgagctattt gtcaatagat gtatgaacaa ttcaacaggt ttctctgggt 16561 ccaaccctta cacccctata cccttacacc cctataccct taccccctta cacccttttt 16621 tttgacgttt agcaaggttt gaagcgataa accaaacgtt gaaaaccaag gctgaacaca 16681 ttaatcaaac tacaaatact caattgaatg cacaagtcta gcctgaatcc ctatccaggc 16741 tagacttgtg catcttcatt cagaaatggt atcagctaac ccattttttg cactgcactc 16801 agcatgaaat taatccgctg ctcaaagtct agcttttgaa tatctgctgt aaatttgtcc 16861 gtttcagcag gtagttgata gtcagacggc attcctatga tggttccgtt ctccattcct 16921 tgtgccagca gcagccaaac atctaatttg gcattgggac ttagagcatc ataaccacga 16981 ctgatatcgg tagctgcacc attaataagg tcgcgttgtg cttgcagttg ttcttcctta 17041 gacaaactct ggattaggtc aaacactgcc ttagctggtg tttctacgct tggaggtggt 17101 gctgggttta gttggtcttt aatatccagg taaccgaacc aaagtagggc taattgagta 17161 tccacatcaa agcgttgaaa agcttccaga gctggtcttg tgctttcgtc aactgtgtag 17221 gtcataaaaa actccaaaca agttaaagtt attctcaatt tcctgagtga ctaggaaaat 17281 ttgctcttag taactttaca caagttaacg aataaataaa gatagataac cctacttagg 17341 gtaggaatag gattcactac ataagataga ttaccgttgt ggcaaatggt ttagttgttt 17401 ccctcaatac cgtgaagtat caccaactga aagccataaa gtgattgcgc ttgagccagg 17461 aaattgcatt ttcttaactg gatttgatgg ggagaatgtc ttagaagttg gaaagggaga 17521 catcggaagt atacaacgat tatgtcagca ccttgaccga aaatcgaaag gcgcaaaagt 17581 tctttcaact tgctaaaaag aaactctgtg ttgcctgggt ttcatagtaa atgtgcatcc 17641 atactctcaa gagttcaaag tccagtcaga tattaattgg cagttaaatc tgtcagatac 17701 caaagctata aaagtttact ttaactgccc ctcatcacac tggacacact gatttcggag 17761 taaactaaag agagcactgg ttgtccctat ttcataaagc ttgcagctta tcacagacaa 17821 ctctgaggga tataaaaaat gatcaagaca agttacgact ttgaccaagc tattctacct 17881 gcaggatctt cattaaagac taatattctg ctgcgttttc gtgctgacat agctgaatct 17941 ccccgacgta accttaacct ttctcttgtc attgaccgat caggctctat ggcaggtggt 18001 ccgttgcatc atgcgctcaa agctgctgag tctgtggtgg atcaacttga gccaaacgac 18061 attctctcag tggttgttta cgacgatgaa gtggacagtg ttgtgccacc ccagcctgtg 18121 actgacaaag ccgcgctgaa aaattccata cgcaaggtga gagctggcgg tattaccaac 18181 ttatcggggg gatggctcaa aggctgcgaa catgtgaaga cgcaactcga tccgcaaaaa 18241 ataaaccgtg ttctgctgct taccgatggt cacgccaata tgggcattca agaccctaaa 18301 atacttacgg cgacatcagg gcaaaaagct gaggaaggca tcaccacaac taccttgggt 18361 tttgctcagg gtttcaatga agacctcctg atcggtatgg caagggctgc cagtgggaac 18421 ttctacttca tccagagcat cgacgaagca acagaagtgt ttagcattga gctagattct 18481 ctgagggcag tggtgggaca gaacctcaag gtgacacttg agttggctga tggtgtcagc 18541 ctcgttgata ccttaagttt tgctaaagtc agccagaatg aggcgggtct tgctgtcatt 18601 acgttggggg aactttacga gggtgaagac aaacttctgg ggttgagtct gctgatatcc 18661 tctgctcaag ttggcgagtt acctgtgatg aggctgcatt acagcgctga tgttgtgcaa 18721 gacgacttga ttcaaagcgt gtcaggcaca gcagatatcg ttgccaaagt tggcaccgtt 18781 gaggaagcag ctcttgcctc tacaagccgt atcatccttg aactgagccg cctcaccatt 18841 gctaaagcca aagaaactgc cctcgatcta gctgaacatg gtaagcatca ggaagctgag 18901 caaatcctcc gtgcgcttgt gcaagaactg cgggataaag gattgaacga gaattttgaa 18961 attgctgaag agatagatca gctggagtat ttcgcgggtc ggatcgccca aaaagctctg 19021 ggcaatgccg gacgtaaaga actgcgggat cagacttacc agacaatgac gcgcaatcgc 19081 agcgatttgg tggctcgcgg tgtcactgct ggcgatgaag tgtacgcgat gccagttgtg 19141 aatgaagtcg gttctggtgt ggaactgtac tgcgttcgtg aagggggtaa gctgcgagta 19201 aaagtgatgt ctgagggcta cgattcaacc aagaacgtcc agtttcccag ggcaattcgt 19261 gccgaaggag cacgctatgt cgttgaagga ctggaactct caagcgatgg cactttctac 19321 cgtgtaaatg gcaacatcac ccgttttgct caacctggcg aagcagatat cttcgttgcc 19381 cataggcaat cgaggtcagc tagcacaggc aaagcctcca aaggtcctgc cagtgcagcc 19441 gacctcccca caactgacac tacgaaggac ggcattctag ttcagtgtgt caaagatggc 19501 agcaagctac gtgctagggt ggtttctgat ggatatgaac cggattggaa catgcgtttt 19561 ccccgctccg ttcgtgagga aggaatgctt tacgttgttg atgaaatcaa aacagcacct 19621 gatggtaagt cttatattgc gtgtggtgaa attaagcgat ttgtgcaacc tactattagc 19681 agctgatata atttccgcta ggaaaattat caaaagtaga gatgtaagat gtttcagccg 19741 gatcattcac caaaagtctt tattagctat agccacgact cagcagaaca catggatcgt 19801 gtacttgcat tggctgagcg tttacgatct gaaggtgttg actgcaatat agatcaatat 19861 gaaatagcac cttcagaagg ttggtaccaa tggttaagga ctcaattgga atggactgat 19921 tttgtagttg tagtttgtac cgaacagtac tatctccaat ttctccaatt tagaggtcga 19981 gaagagctag gaaaagggcg gaaagtaacc tgggaagggt caattattac tcaagagctt 20041 tattataatt ccgatctcaa aagtatcaaa tttattccag ttgtgttttc tgctcaggat 20101 ataaactaca taccgataat atttcgtgct ttaagcatgt actctttgga cacagaagaa 20161 ggatatacta cactgtatcg tcacttgaca aatcagccgt cttccccagt acctgtatta 20221 ggcgaaattc ctttagctgt taacaagcct ttagatgaca agcgtttaga agatgtattt 20281 aagcggtcag gcatacccac gcatacgttt gttaaaccag ttgagtatca acggcttttt 20341 gttgctttac gcacacctgg tcgaggatta gttgttgaag gaccatctgg tgtgggtaag 20401 actacttgtg ttttaaaagt attggagcaa ctttctctgc atgataggag aataaaaata 20461 ctaacggctc ggagacgaca agacacggaa gaaattgttc ttttaccaga aaaaaatagt 20521 tatgatataa ttattcttga tgactttcat gtattggaag ataaagcaaa acaattaatt 20581 gctgactata tgaaggttat tgctgatgag gaaaaagcta ttaaattagt ccttatcggt 20641 attaataaag ctggtaacag tcttgttaaa ttagcaaaag atttgaataa tagaattgaa 20701 acaatcagat ttggaatcaa tcctgaaaat aatgttatcg aacttgtaaa aaagggcgaa 20761 gaagctttaa agattaaatt taatgagact ttacttacaa atatagttaa ag // LOCUS NODE_1551_length_20671_cov_5.06679320671 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20671) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20671) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20671 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..961 /locus_tag="DP116_13875" CDS <1..961 /locus_tag="DP116_13875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="amine oxidase" /protein_id="PRJNA477356:DP116_13875" /translation="VEVVEFSSSSFFFGEGIDFILAHQPDFDVVWCRGTVGEQIFRPW VERIEKSGAKVLANHRVTDLIVDDNNQVTGVVCNDEVFDADAVIFAVGITGMRKILSS SPSLQKRDEFRNLSNLGAIDVLATRLWFDRKISIKRPSNACFGFDATTGWTFFDLNAL HDEYRNEPGTVVEADFYHANQFLNLENEEIIPIVHRYLANCVPEFRNAKVIDSSVIRL PQAVSHFAPGSYRYMLPAKTSFKNVFMSGDWIVNRHGSWSQEKAYVTGLEAANLVISY FGEGALAKILPVEADEAHIQLARTVNKSVRQIGKSILPEFWLP" gene 1325..2947 /locus_tag="DP116_13880" CDS 1325..2947 /locus_tag="DP116_13880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319919.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13880" /translation="MLQKRFDLIKNNFGLKIALTLAASFFILMLVFCLHRFYTFYASY DQGLFDQLFWNTIHGHIFQGSLSSGQSSAYTQDGQIQTVFYCHLGQHFVIDFLLWMPI YALFPTGATLVVLQVSLITAAGLVLYALCRHYLPPSIAILITASFYGANAVIGPTFAN FYEHCQIPLFVFSLFLALEKRKWWLFWLFLILILGIREEAGIITFGVGLYLFLSRRYP RLGLAVCLISFSYVTVVTNVIMPLFSNDNSRLYLTARFDQYVPGNNSPSTLQLLWGII THPKELIQSLFTPVDKRVKYFLGQWLPLAFIPAISPSAWIVAGPPLLVLLMQDGQSAL AMSIRYALTVVPGLFYGTILWWSQHQERFQPSFRRWWIRFIALSIFFTITSNPNRAFY FLVPESIHPWVYVPLTRQWEHVGHIRSLMQYIKSDASVSATTYLLPHLATRRGIIRLP AIQLRLDSGEVIDVDFALADLWQLQQYQPAFKSDRRQLQDFIAFIDKLLAQGKYGLVD VQDGVVLLQKKVVSQPQAITRWLEWKAEIQKI" gene complement(3177..3371) /locus_tag="DP116_13885" CDS complement(3177..3371) /locus_tag="DP116_13885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13885" /translation="MSSSKLNRTLNQGFGVFLGIAIAVWVLRGFGILTFLPGGVIWLL VLAAIAMAILSYVQKTWWRY" gene 3821..4375 /locus_tag="DP116_13890" CDS 3821..4375 /locus_tag="DP116_13890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315226.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyclase" /protein_id="PRJNA477356:DP116_13890" /translation="MSASFTSDSVPTNSSMPWTQDKQRLLMQGEILVQTRSHTAWGGA VTAWMYLPLVRSNIWQQITDYPRWVQYFPDLTKSEVLQSGEVKRLYQAAQKTFFFFTA QVEIYLDVVEEFGQHIQFRLQKGSFHDFTANLDLKDCGNGTLLSYTVQATPIIPIPSI FIQQAMNFELPANMRKMRQVLCKD" gene 4382..4957 /locus_tag="DP116_13895" CDS 4382..4957 /locus_tag="DP116_13895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF924 domain-containing protein" /protein_id="PRJNA477356:DP116_13895" /translation="MSQAKEILDFWFGSSGSPDYGKPKSFWFSKKPEFDEELRIRFLT DYQKAAGGYLDDWMDFPDSCLALILLLDQFPRSMFRDTPEAFATDWEALSVAQHAVAQ RYDQKLLPVQRWFVYLPFEHSENLEHQSQAVRLFQQLGDDPDSVSCIDYALRHMQVIE RFRRFPHRNKILGRVSTPEEKEFLKHKGSSF" gene complement(5056..6300) /locus_tag="DP116_13900" CDS complement(5056..6300) /locus_tag="DP116_13900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_13900" /translation="MLLTYQYKLNPTSEQVVTLETWGELLRRHYNYALGQRLDWLTRT RCQIDRCSLVCEPIGDIPDKTDYYTQASGLKQTKELFPEYKNIYTDCQQQNLMRLDKA WKRWLVPDKNGKRGGRPRFKKRGDICSFTFPRVNSPKAGAHLTGSILKLSKIGEVEVI LHRPIPEGFEIKQATILRKADGWYTSFSLEDKTVPNALPVNEIKTATGIDVGLEKFLT TADGQSIQVPQYYRKAQATLARQQRKLARKTKGSKNYQKQLSKVAKLHLHVARQRKEF HYQVAHWLVNSYDLIVFENLNIKGLARTRLAKSILDVAWGAFLQIMQAVAVRRGKHTR GVDPRGTSIDCSGCGERVEKTLAVRVHSCSCGLVIDRDWNSALNLLKHKSVGLPISGC GGLEVAQPVKQQVSEVNLRCAP" gene 6307..6732 /gene="tnpA" /locus_tag="DP116_13905" CDS 6307..6732 /gene="tnpA" /locus_tag="DP116_13905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136513.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS200/IS605 family transposase" /protein_id="PRJNA477356:DP116_13905" /translation="MIAFSAMSYNIGYRSVYSLNIHLVLVTKYRRKVINQAILKRLQE IFETTCLKWRSKVTEFNGESDHVHLVISYPPDVEVSKLVNNLKTVSSRLIRKEFYEHV SQFYNKPVFWTGAYFVASCGGVTLEQLKSYVEKQSSPAN" gene 7192..7515 /locus_tag="DP116_13910" CDS 7192..7515 /locus_tag="DP116_13910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015364311.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="PRJNA477356:DP116_13910" /translation="MVHLDNVRSSVAQILPTEKAQQIAEVFGVLADTNRLRLLSALAS QELCVCDLAALMKMTESAVCHQLKLLKAMRLVRYRREGRNVYYTLADSHIVNLYQSVE EHLQA" gene 7659..8885 /locus_tag="DP116_13915" CDS 7659..8885 /locus_tag="DP116_13915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410760.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminoacetone oxidase family FAD-binding enzyme" /protein_id="PRJNA477356:DP116_13915" /translation="MKPLRIVVVGGGAAGFFGAIAATEANPHAQVTILEASRQLLAKV RISGGGRCNVTQACFDPSGLVQNYPRGGKALLGAFTRFQAKDTVAWFAAHGVPLKTEA DGRMFPITNSSETIVNCLINTAKAAGVEIRTGTFIVDVKQLSTPNFEISLKSGEILEC DRLLLATGSNPIGYKIAQKLGHTIEPPVPSLFTFNIKDEFVLKLAGVSVNPVRLRLSV QGFPQLEQTGPLLITHWGFSGPAVLKLSAWGARVLHYRNYQATLHINWLPNLQPEEVR QKLLAVKNEVAKKAIALHRGVDLPHRLWQYIISRAGITPEDRWAELSNKTLNQLVQEL TKGQYQINGKGVFKEEFVTCGGVNLKEVNFKTMESRLVPRLHFAGEILDIDGVTGGFN FQSAWTTGYLAGVGSS" gene 9312..10499 /locus_tag="DP116_13920" CDS 9312..10499 /locus_tag="DP116_13920" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13920" /translation="MNLSGEQRKKLENALVDAFRDKASLERMVQYELNKNLNEIAPDS SLQGIIYKLIQKAEAEAWVEKLIHAALESNPGNLKLQNIARELSSSSATIPSENMIGS RNRNEYIFPDTIVLCVDNLDNITKENLLERTRVLPRKPGSLKLFGLIDTPSVENENWR LKDLLKNLPQEINRKWLEELVKSVYSALKGELDPNRYERNPQEVSTFQSSEGKLFRPI LYKRKRSIDDRSIEFTVIFEEHISRGYVENAPNLAYATLVTAFILANRLQLEVCNKYL PMLDDWSQEVSEVIRTRLQEVRISFEYIEEDAERRRKGEAINKNNKDRLRDSFESKDE RTTIESNLSVQQRYKNILLQADTRHNIDEVRVALTELKRLNKIVLRMIIQRLSGFYDA DSP" gene 10632..11384 /locus_tag="DP116_13925" CDS 10632..11384 /locus_tag="DP116_13925" /inference="COORDINATES: protein motif:HMM:PF13489.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_13925" /translation="MEQEFDLIFRHKLWNCPESASGFGSTLKNTEEIRKQLPSLVKKL AAKTFLDLGCGDFNWMKEIDLEVDKYYGVDIVHSICKDNNQNHARYNKVFIRQDLTKD ALPKADIILCRDALVHLSFCDIGKAIQNIKKSGSRYLLATTYPNVEVNFEICTGGWRP LNLQKQPFNWPEPIFLIKDSEEIGLPDWGKHLGLWEVQKMMLPSRGTYLYSGSANVPG SYAKVLPALSCIETYLEPRLAFCEQPGRTHLN" gene complement(11605..12342) /locus_tag="DP116_13930" CDS complement(11605..12342) /locus_tag="DP116_13930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410865.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-phosphosulfolactate phosphatase family protein" /protein_id="PRJNA477356:DP116_13930" /translation="MKLFIYHTPELTPTDKVPECAIAVDVLRATTTMATVLAAGGEAV QVFSDLDQLLAVSEKWSGEKRLRAGERGGSKVPGFDLGNSPLDCTPELVQGRRLFIST TNGTRALQRIQDAKAVLAAAFVNRAAVVQYLLEKQPETVWIVGSGWEGSFSLEDTACA GAIAHSVAQQTKLSQDELAGNDEVISAIALYSQWQDNLLELLHLASHGKRLLRLNSDE DLKYCSQTDILDVLPIQQEPGVLMSRK" gene 12757..13176 /locus_tag="DP116_13935" CDS 12757..13176 /locus_tag="DP116_13935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996725.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopY family transcriptional regulator" /protein_id="PRJNA477356:DP116_13935" /translation="MAPLPNYRPKQLSVGPLEAEILNIIWELGSATVKDVHDRILTDP NRELAYTSVTTVLRRLTDKGWLVCIKKGRAYYWRPLVTKQQADVIRAHEQLQRFLAVG NPDVIAAFADSLDETASEQIQAIAKRIQAARQAREEQ" gene 13176..14015 /locus_tag="DP116_13940" CDS 13176..14015 /locus_tag="DP116_13940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M56 family peptidase" /protein_id="PRJNA477356:DP116_13940" /translation="MHLIMIVASMVVVYWIRCQWTQPQGTWSERWQYALFLFLFPPLL IFMTAIAVICMGPQGKMSGLQTGWFSYVLALISLAFSVVLCIKLAWGGWQSVKSARKC SLVDLAGRQVRVLDTGVPFAGQIGFWQPELVLSQGLLHTLSPEQLETVLAHEQGHYHY RDTFWFFWLGWVRSCTAWLPNTEALWQELLALRELRADAHAASQVDPLLLAESLLLVV SSPPISSEICCAALSSSHVDRLEQRIEALLAQPEPVPEVQLQSWNSYLLAFLPLVTVI FHT" gene complement(14179..14358) /locus_tag="DP116_13945" CDS complement(14179..14358) /locus_tag="DP116_13945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13945" /translation="MTFKKNHKLGFTSDRPFDKDPVCFKVLPGVKQKLKAVPDWQERL REFVNELIKDVDNCQ" gene 14413..15534 /locus_tag="DP116_13950" CDS 14413..15534 /locus_tag="DP116_13950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008181581.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_13950" /translation="MFLSVKTKLKLSKAQKTVMSKHAGIARFTYNWGLATWNNLVKDG LKPNKYILKKFFNNHVKPEFDWIKEKGICQRITQYAFDNLGDAFSRFFSGKGGYPNFK KKDHHDSFTIDAGGKPIPVGGKSVKLPTIGWVRTYEGLPHVTCQSITISRTADSWFIA FAYEQEHEPTAKQYEVVGVDLGVKELATLSTGVVFPNPKHYKSALKKLRKLSRELSRK KKGSNNRHKAKIKLAKHHLRIANLRKDTLNKATTFLCKNHAKIVVEDLNVKGMLANHK LAQAIADCGFHEFKRQLEYKAKKFASEIIIADRWFASSKTCSACGHVQDMPLKMRTFN CENCGSSIDRDLNAAINLSHCSTRVSRLEACGVLRLAKA" gene 15646..16179 /gene="rfaE2" /locus_tag="DP116_13955" CDS 15646..16179 /gene="rfaE2" /locus_tag="DP116_13955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-glycero-beta-D-manno-heptose 1-phosphate adenylyltransferase" /protein_id="PRJNA477356:DP116_13955" /translation="MTPSLPHSLSPSPLPTSYKHIRDLDQLIALVASHRIAGRKIVFT NGCFDILHAGHVSYLQRAKVLGDILIIGVNSDDSIRRLKGSTRPINPLEDRMQVLAAL ACVDYLIPFEEDSPSHLISKLHPNIYVKGGDYTKETLPETPVVEEYGGVIEFLPFLEN RSTSKIIEQISQGKNHS" gene 16176..16719 /locus_tag="DP116_13960" /pseudo CDS 16176..16719 /locus_tag="DP116_13960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007921205.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="N-acetyltransferase" gene complement(17154..17486) /locus_tag="DP116_13965" CDS complement(17154..17486) /locus_tag="DP116_13965" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13965" /translation="MEPGVGESIGVVVPVVPGFLPPVVLAGFLPPVVLAGFLPPVVLA GFLPPVVPLGVAAPVGVGVAVVVGVVPVCAWTIDGKVATGVATAIPKTREAAMVLRSL LILTNHSS" gene 18199..19698 /gene="glpK" /locus_tag="DP116_13970" CDS 18199..19698 /gene="glpK" /locus_tag="DP116_13970" /EC_number="2.7.1.30" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747886.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycerol kinase" /protein_id="PRJNA477356:DP116_13970" /translation="MQKSGYILALDLGTTGNRAFVFNADGKIVGQAYQELTQYYPQPG WLEHDSLEIWQATCWVMQTAIKNAQIAPDEIIALGLTVQRETCLIWDKTTGQPLHRAI VWQDRRTAPLCHHLQEQGYAQEISDRTGLIVDAYFSATKLSWLLEHFPGVDLKNILAG TIDTWVLWNLTGGKVHATDHSNASRTMLMNLASCEWDETLLTLFKIPRHILPQIQPSL GTFGVTDASLLGVEIPITAILGDQQAALFGHGCHSPGLMKCTYGTGSFLVAHTGSQIV RSQHQLLSTLAWTQVNASGTLDVGYALEGSIFTSGACIKWLRDRVDLITTAAETEVLA NRVTDNGGVYFVPAFSGLGAPHWDMSARGAFFGITAMVQREHLVRAVLEALAYQVQEV VQAILASNTISIERLSVDGGACENNFLMQFQADVLGIPVERPKMREMTVQGIAFAAGL AAGFWDNDQLLVQQRRIERVFLPGVGRNHALENFTTWQKAVERAKHWAD" gene complement(19889..20647) /locus_tag="DP116_13975" CDS complement(19889..20647) /locus_tag="DP116_13975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_13975" /translation="MPQFKLDDLLGQLTKKLNEYGKELYWFSFMGQPSMKEPWGWQLD GHHLNINYFVLGDQVVMTPTFMGSEPVRATEGEHAGIRVFEPEESKGLATIRALTAEQ RDKAVLAKEIPTDVFTSAFRDNFEMRYEGISFAELMSGQQEILLDLIREYTSDIRNGH AEVKMEQVRQHLENTHFAWMGGFDNNSVFYYRIHSPVILIEFDHQRGQALGNKEPSRN HIHTVVRTPNGNDYGKDLLRQHYQRSNHSHVHSH" BASE COUNT 6004 a 4247 c 4391 g 6029 t ORIGIN 1 agtcgaggtt gtggaattta gctcttcatc cttttttttt ggcgaaggta ttgattttat 61 tttggcacat caaccagatt ttgatgtggt ttggtgccgg ggaactgtag gagaacaaat 121 ctttcgtcct tgggtagaac gtattgaaaa atctggggca aaagtgttag ccaaccatcg 181 agtcactgac ttaattgttg atgacaataa tcaagtcaca ggagttgttt gcaatgatga 241 agtctttgat gctgatgcag tcatttttgc tgttggtatc actggtatga gaaagatttt 301 atcaagtagc cctagtttac aaaaacgtga tgaatttcgc aatctcagca atttaggagc 361 aattgatgtt ttagcaactc ggctatggtt tgaccgtaag atttctatta aacgtccttc 421 taatgcttgc tttggttttg atgcaacaac gggttggaca ttttttgatt tgaatgcgtt 481 gcatgatgag taccgtaacg aacctggcac agttgttgaa gctgattttt atcatgctaa 541 tcaatttctt aatttagaaa atgaggaaat tattccgata gttcaccgtt atttagcaaa 601 ttgtgtgcca gaatttcgga atgccaaagt tattgatagc agcgtgattc gtctacccca 661 ggcagtatct cactttgctc ctggtagtta tcgttatatg ttaccagcga aaacgagttt 721 taaaaatgta tttatgagtg gtgattggat tgtaaaccgc cacggttctt ggtcacaaga 781 aaaggcttat gtcacaggtt tagaagcagc aaatttagtg atttcctatt ttggagaagg 841 tgcactagcc aagattctac ctgtggaagc agatgaagca catattcaat tggcaagaac 901 tgttaacaaa agtgtacgtc aaattggtaa atctattttg ccggagtttt ggttaccgta 961 aatgacactc acccgtcagg cagtgctgtt tcatgagact ttgtggcagt tgtggggaac 1021 tcttaacagg gaactcttaa caggacctcg taaaatctca tttttgcaag aggtctattt 1081 tctgaaaggc ttacattgtg aagtattaat gggacaggct gcgcgtaaat ccccttagga 1141 cttatgcgca gcatctgtgt aggaaatacg attaattttt acaactcaaa tagaattgct 1201 ataatccata atactccttt tttgtcaatt ctttctaccg tactgcctat ctagcttttc 1261 acaatgctta gcggcacaga catctacaca acagtatttt tagaactatt ggatttaaat 1321 ggtaatgctt caaaagcgct ttgacctaat aaaaaacaat tttggcttaa agattgcact 1381 gacattagca gcatcttttt tcatcttaat gctagttttt tgccttcatc gcttctacac 1441 tttctacgca tcctatgacc aaggtctttt cgaccagtta ttttggaaca ccatacatgg 1501 acatatcttt caaggttctc tctcttctgg tcaatccagt gcttataccc aagatggtca 1561 aatacagaca gtgttctact gccatttagg acaacacttt gtcattgatt tcttgctatg 1621 gatgcccata tatgcattat ttcccactgg cgcgactctt gttgttttac aagttagttt 1681 gataactgct gcggggttgg ttttatatgc tctatgtcgc cactatttac ctccttctat 1741 agctatttta atcactgcca gtttttatgg agctaatgct gtcattggac caactttcgc 1801 caatttttac gagcattgtc agattccttt atttgttttc tctttattct tggctttaga 1861 aaaacgtaaa tggtggcttt tttggctatt cctaattttg atattaggaa tacgagaaga 1921 agcaggaatc attacatttg gtgttggcct ttatctattt ttaagtcgtc gctatccgcg 1981 tttaggtcta gctgtatgtt taattagctt cagctatgtc acagtcgtga caaatgtaat 2041 tatgcccttg ttttctaatg ataattcgcg actttactta actgctcgtt ttgatcaata 2101 tgttccagga aataatagcc ccagtacttt acagcttctg tggggaatta ttacccatcc 2161 taaagaactg atccagagtc tgttcactcc ggttgacaag cgagtcaagt actttctcgg 2221 tcagtggcta ccgttagctt ttatacctgc tatctcacca tcagcttgga tagttgcagg 2281 tcctccatta ctagtgttgt tgatgcaaga tggtcaatct gctcttgcta tgagtattcg 2341 ctatgcctta actgtagttc caggtttatt ctacggcaca attttgtggt ggtctcaaca 2401 tcaagagcga tttcaaccaa gttttcgtcg ctggtggatc agatttattg ccctatctat 2461 tttcttcaca attacatcta atcctaaccg tgctttttac tttttagttc cagaatctat 2521 tcatccttgg gtttatgttc ctctaactcg tcagtgggaa catgtaggac atatccgctc 2581 attaatgcag tatatcaaga gcgatgctag cgtttctgca acgacttatc tgcttccaca 2641 tttggcaact cgtcggggaa ttattcgttt gccagctata cagttacgac tcgattcggg 2701 agaggttata gacgttgatt tcgccttagc agacttgtgg caactgcaac aatatcaacc 2761 agcctttaag agcgatcgcc gccaactgca agattttata gcttttattg acaaactttt 2821 ggctcaaggc aagtatggac ttgtggatgt gcaagatgga gttgttttgc tgcaaaagaa 2881 agttgtttct cagcctcaag cgataacacg ttggttagaa tggaaagcag aaatccaaaa 2941 aatttaggaa gattaagaga ttttttgatc gcaatcctcc gcagatttgt ctgaaagtgc 3001 ctacgcgtaa gcgcataaat cgcgcatcct ataaagaagt cgggaatctt tttgttcaca 3061 aatgatttag gactactcta tcaaaacagt attgggacgg atgtgtcctc attcgtcggt 3121 cacatcctca tgggctttct gctaccaaca cttttacaac gcgatggatt cttcttttag 3181 taacgccacc aggttttttg cacgtaactg agaatagcca tagcgatcgc cgcaagcacc 3241 aacagccaaa tcactcctcc tggaagaaaa gtgagaatac caaaacctcg cagaacccaa 3301 acagcgatcg caataccaag aaaaaccccg aaaccctggt ttaatgtacg atttaattta 3361 gacgaactca tcttatttgc cttaaactca acagtcactg cattttatta gtaacaatct 3421 tagtcactaa atctgaaagc cgcaaataaa aatcagacca gaggtagatg cagttaaagc 3481 gctgattcta tgggaagcag tcgtaagaaa atctactcta caatcagttc ttcatcttta 3541 ctctgacttc tgagtattat tactcaaatg ttaagatttg ctattttgtg aataaaattg 3601 tcactatatg ctgagagaga acacaatatt ttgatatgag tgataaacct gacatatgat 3661 aagcgtgaag ttttggtaaa aatgactaga gcatgactgt ttccggaaaa ctctcgtatt 3721 ttactggaat tagaattaat gacttggact gtataccaag ttagatattc acgagtttaa 3781 gcttatgacg ttgatgtgtt gaggagtgac tgaaaaaagt atgtctgcgt cctttacctc 3841 agattcggtt cctacaaatt caagtatgcc ttggactcaa gataaacaaa gattactgat 3901 gcaaggtgaa attctggtgc aaacgcgatc gcacacagct tggggtggtg ctgtcaccgc 3961 ttggatgtat ttgcccttag tgcgatcaaa tatctggcaa cagataacag attacccgcg 4021 ttgggtacaa tattttccag atctgaccaa aagcgaagtt ttgcaatctg gtgaggtgaa 4081 acgtctgtat caagcggcac aaaaaacatt tttctttttc actgcccaag tagaaattta 4141 ccttgatgtg gtggaagagt ttgggcagca catacaattt aggctgcaaa aaggtagctt 4201 tcacgatttt acagccaact tagacctaaa agattgtggt aacgggactt tactgagcta 4261 tactgtgcaa gctacaccga tcattcccat accatcaatt tttattcaac aagccatgaa 4321 ctttgaattg cctgcgaata tgcgtaaaat gcgacaagtt ctttgtaagg attagtaaaa 4381 gatgtcacag gcaaaagaaa ttttggactt ttggtttggt agttctggtt caccagacta 4441 cggcaaacca aagtctttct ggttcagtaa aaaaccagag tttgatgaag aactgcgaat 4501 ccggtttctg acagattacc aaaaagcagc aggagggtac ctagacgact ggatggattt 4561 tcctgacagt tgtctagctt tgattttgct gttggatcaa tttcccagaa gtatgtttcg 4621 tgatactccc gaagcctttg cgactgactg ggaagcactc tcagtagctc agcacgctgt 4681 tgcacaacga tatgatcaga agttgctacc cgtacaacgc tggtttgttt acttgccttt 4741 tgaacatagc gaaaatttgg agcatcagag tcaagcagta cgactgtttc agcaattggg 4801 ggatgatcct gatagtgtga gttgcattga ttatgcacta cgccatatgc aagtgataga 4861 gcgttttagg cgctttcctc atcgcaataa gattttggga agagtttcaa ctccagaaga 4921 aaaggagttt ttgaagcaca aaggctcatc attttaacct acattccgca ccttgaactg 4981 tcatatcaac aataaacctg cttgaaaatc caattatcga ggaaggtgac aactctcccg 5041 ctaaccgcaa gcggtttaag gagcgcatct cagattcact tcggagactt gctgcttcac 5101 aggctgagca acctctaagc ctccacagcc agaaatcggt agtccaaccg acttgtgttt 5161 taacaaattc agtgcgctgt tccaatcgcg gtcaataact aacccgcaag aacaactatg 5221 aacacgaact gctagagttt tctctactct ttcaccacac cctgaacaat cgatacttgt 5281 ccctctagga tcaacacctc ttgtatgctt gccgcgtctg accgctactg cttgcatgat 5341 ttggagaaat gcgccccaag caacatcaag tattgactta gctaatcgtg ttctagccaa 5401 gcccttaatg ttcaagtttt caaaaacgat tagatcatag gaattaacca accaatgtgc 5461 gacttgataa tgaaattctt tgcgctgcct tgcaacgtgc aaatgaagct tagctacctt 5521 cgacaattgt ttttggtagt ttttagaccc tttggtttta cgtgcaagct tgcgttgttg 5581 acgggctagt gttgcctgcg ctttccgata atactgtggt acttgaatac tttgcccatc 5641 agcagtcgtt aaaaactttt ccagcccaac atctatacca gtagccgttt taatttcgtt 5701 aacaggcaat gcattaggaa cggttttatc ttctagcgaa aaacttgtat accatccgtc 5761 agctttgcgt aaaatcgttg cttgcttaat ctcaaatccc tctggaattg ggcggtgcaa 5821 aataacttca acctcgccaa ttttagacaa cttgagaata ctacctgtta aatgtgcgcc 5881 tgcctttgga gaattaacac gcggaaacgt aaaagaacaa atatccccac gtttcttaaa 5941 acgtggtcgt cctcccctct tgccgttctt gtcaggaacc aaccagcgct tccacgcttt 6001 gtccagcctc atcaagtttt gttgttggca atcggtgtag atgtttttgt attcaggaaa 6061 cagttcttta gtttgcttaa gacctgatgc ttgtgtgtaa tagtcagtct tgtctggaat 6121 atcaccaatt ggctcacata ccaaactaca gcggtcaatc tggcatcttg tacgggttag 6181 ccagtccaat ctttgaccta gtgcatagtt ataatgacga cgcagcaact cgccccaggt 6241 ttcaagagtg acaacctgtt ctgatgtggg attgagcttg tactggtaag tgagtaacat 6301 atagtcatga tagcatttag tgccatgtct tacaatatag gatacaggtc agtttatagt 6361 cttaatattc atttggtgct tgtaacaaaa taccgcagaa aagttattaa tcaagcaata 6421 ctgaaacgat tgcaggaaat atttgagacc acttgcctca agtggagaag caaagtcaca 6481 gaatttaacg gagaaagcga tcatgttcat ttggtcatca gctatccgcc agacgttgaa 6541 gtgagtaaat tagtcaataa ccttaaaacc gtatctagtc gtttaatccg caaagagttt 6601 tatgaacatg tcagccagtt ttataacaag cctgtgtttt ggacaggtgc atatttcgtt 6661 gcatcgtgtg gcggagtcac tcttgaacaa cttaaatcct atgttgagaa acaaagtagt 6721 cctgctaatt aagggggcat cgaaccccct taattagccc gcaattcccc ttcgccgcca 6781 accgtcccgg tgtggcggga gtacccttgc ggaaaaccag atgggctaag aaaaagcaga 6841 aaagggattt gtccatgtga tcgtccaaga cggatgccac tgtagagaaa atctaaacag 6901 aggtcaaaga aggtttattt taaaacaaag caatgcatta gttagcaaga tgctgcagat 6961 gaattagatt ctttagtcac cttgtaaccg agatgttcga gacgcgaacg cacacgttta 7021 attagatgct cttgttggtg ttggctcaaa taattgacgt ccaagtctcg atagagccgg 7081 gctgtggaca acatatggta gacagtagtc aaaattcgcg ctcgccacac cgacaagagc 7141 tcttttccta cctcgtcgtg cagctaaact acgccaactt gtgacaccca tctggtacat 7201 ctagataatg tacgctcctc agtagcgcaa atcttgccaa cagagaaagc acagcagata 7261 gcagaagtct ttggggtgct ggcagatacg aaccgtctac gcctcctatc agctttggct 7321 tcccaggagt tgtgtgtttg cgatttagcc gcactgatga aaatgacgga atcagccgtt 7381 tgccatcaac taaaattatt gaaagctatg cgtctagttc gctatcggcg agaaggtcgc 7441 aacgtttatt acacattggc tgacagtcac attgttaacc tttatcagtc tgtagaagaa 7501 catttgcaag cttgatgaac tcgccttacc ggcggcatcc acaaaacaaa agtgcaaaat 7561 atgataagac actgatttta ttgttactat tcggttttgg tagtgggaac ttcaccatga 7621 accaccagca cgccaaagga tgacaggaga aaaaaaaagt gaaaccgtta cggatcgtag 7681 ttgtaggtgg tggcgcggct ggtttttttg gtgcaatcgc agcaaccgaa gcaaatcccc 7741 acgcgcaagt gaccatactc gaagccagtc gccaacttct ggcaaaagtc cgcatttctg 7801 gtggcggacg ttgcaatgtc actcaagctt gttttgaccc ttcaggattg gtacaaaatt 7861 atcccagagg tggaaaagct ttgctaggtg cgttcacgcg ctttcaagcg aaagatacag 7921 tcgcttggtt tgctgcccac ggagtacccc tcaaaaccga agctgatggc agaatgtttc 7981 ccatcactaa tagttccgaa actatagtga actgtctgat aaatacagca aaagcagctg 8041 gggtagaaat acgtacagga acatttattg ttgatgtcaa acaactcagc actcctaatt 8101 ttgaaatttc tctaaagtcg ggagagattt tggagtgcga tcgcctactc cttgccacag 8161 gcagcaatcc cataggctat aaaatagcac agaagttggg acatacaatt gaaccaccag 8221 tcccctcgtt atttaccttc aatatcaaag atgaatttgt tttgaagttg gctggtgtga 8281 gtgtcaaccc cgtgcggttg cgcttaagcg tacaaggatt tccccaacta gaacaaactg 8341 gacctttgct catcacccat tggggtttta gtggtcctgc ggtgctaaaa ctttctgcgt 8401 ggggtgcgag agttttgcac taccgcaact atcaagcgac tttacatatt aattggctac 8461 ccaatctcca gccagaagaa gtcaggcaga aattactagc agtgaaaaat gaagtggcaa 8521 aaaaggcgat cgccttacat cgcggcgtag acttacccca ccgtctctgg caatacatta 8581 tctcccgtgc aggtatcaca ccagaagacc gttgggcaga attatcaaac aaaacactca 8641 accagttggt gcaagaactg acgaaaggac aataccaaat taatggcaaa ggagttttta 8701 aggaagaatt tgtcacctgc ggcggtgtca accttaaaga agtcaacttt aaaacaatgg 8761 aaagtcgatt agttccaagg cttcactttg ctggagaaat tttggatatt gatggtgtca 8821 ctggtggctt taactttcaa agtgcttgga caaccggata tttagcgggg gttgggagca 8881 gttaactaaa aacttgtcta gaatagtaca aattttagat gagatcgtta gaacaccatg 8941 actttttgag tgattaaact caatcaattt ttcttaggat aagatgtaag ctaattcaga 9001 aaaaattaag taccaagcga tcgctccgga atcggattgt gtgcctttgt agctttgttc 9061 gactataatt agctattaat atccatcaaa aacgctgtta attatgcatg tcagtattgg 9121 gaacaacctc aagaaacgat ttcatgcgac tcacgccatc cgagagttga agatgagtta 9181 agttgtagta aagctaatag aggagtggct acaagtaaac gaggtctatt catttaaggt 9241 tcacgaaaat gtccttatta gtaacaaaac agctaagtag taaaaaggta gttagcctgt 9301 attgaaaaaa tatgaattta tcaggtgagc agcgtaaaaa actagaaaat gctttagtag 9361 atgcttttcg tgacaaagca tctttggagc gaatggtgca gtatgagtta aataaaaatc 9421 tcaacgaaat tgctcctgac agtagtttac aaggtattat ctacaaatta atacaaaaag 9481 cagaagctga ggcttgggta gaaaaactaa ttcatgctgc gcttgagtct aatccaggga 9541 atttaaagtt acaaaatatt gcccgcgaac tatcatcttc ttcagcaact ataccttctg 9601 aaaacatgat cggatccaga aacagaaacg aatacatttt tccagataca atcgttttgt 9661 gtgtagataa tcttgataat attacgaaag aaaatttgct tgaaagaact cgtgttttac 9721 ctagaaagcc aggttcattg aagctttttg gtctaattga tactccttca gtggaaaatg 9781 agaactggcg tttgaaagac ttactcaaga acttgccaca agaaataaat cgaaaatggt 9841 tagaagagct tgtaaaatct gtctattcag ctctcaaagg agaattagat ccaaatcgct 9901 acgagaggaa tccccaagaa gtttcgacgt tccagtcatc agagggtaaa ttatttagac 9961 ctatcttata taaaagaaaa cgctcgattg acgaccgttc aatagagttt actgttattt 10021 ttgaggaaca tatcagtagg ggatatgtgg aaaacgcgcc caatctagca tatgccaccc 10081 tagtaaccgc tttcatcctt gcgaatcgcc ttcaattgga agtgtgtaac aaatatctgc 10141 caatgcttga tgattggtcg caagaggtct cagaggtaat taggacaaga cttcaagaag 10201 ttaggatctc attcgagtat attgaagaag atgccgaacg tcgtagaaaa ggtgaagcta 10261 ttaacaaaaa taataaagac agacttcgag actcttttga atctaaggat gaaagaacaa 10321 caattgaatc taacttgtcg gttcaacaga ggtacaaaaa tattctactt caagctgata 10381 cacgtcacaa tatcgatgaa gtcagagttg cattaactga gcttaagcgt ttgaataaaa 10441 ttgttctgcg tatgataatc caacgattgt ctggatttta tgatgcagat tctccttgag 10501 tgcttagcat attgcttttt actataattg gattactccc atttttgtat aggcatttta 10561 ataggttttg cgggagatca tttctttagt gtaagctagc tggtttttgg tacctacttt 10621 agataaagac tatggaacaa gagtttgatc tcatttttcg ccataaacta tggaattgtc 10681 ctgagtcagc ttctggcttt ggttcaaccc tcaaaaacac agaggaaatt aggaaacaat 10741 tgcccagttt agttaaaaag ctagctgcaa aaacatttct ggatcttggc tgtggtgatt 10801 ttaattggat gaaagagatt gacctcgaag tcgataaata ttatggcgtt gatatagttc 10861 acagcatttg taaagacaat aatcaaaatc atgcacgata taataaagtt tttataagac 10921 aagacttgac aaaggacgca ctacccaaag ctgacatcat tttgtgccga gatgcattag 10981 tgcatttgtc cttttgcgat atcggtaaag ctattcaaaa cattaagaag agtggctctc 11041 gatatctgct ggcaacaact tatcctaacg ttgaagttaa ttttgagata tgcacagggg 11101 gttggagacc actaaatctg caaaaacaac cctttaattg gccagaacca atctttttaa 11161 taaaagattc tgaggaaatt ggtctaccgg attggggtaa gcacttaggg ctgtgggaag 11221 tccaaaagat gatgcttcct agtcgtggaa cttatttata ttctggttct gctaatgtcc 11281 caggctctta tgctaaagta ctccctgctc tgtcatgcat agaaacttat ttagaaccaa 11341 ggcttgcctt ctgtgagcaa ccagggcgaa cgcatcttaa ttaaacgaaa aattttaact 11401 tactaacatg ggcatcacta atactgctga ctcccgcaaa aataactcat cattttatga 11461 caaagcaaaa gtttcttaac agcctgagaa cttccataaa cggatgtgtt tgacttcaat 11521 gaaaaaagct ttaaaccccc ctgcttaatg tctagaaaag gggtaaaaag caaagatgaa 11581 aaaagaatga tttgattatt tcttttattt tcgactcatt aaaactcctg gttcttgctg 11641 tatgggcaag acatctaaaa tatcagtttg agaacaatat ttcagatctt catcagagtt 11701 caggcgcaac agacgtttgc cgtgacttgc aagatgaaga agttccaaca agttgtcttg 11761 ccattgagag taaagagcga tcgcgctaat cacttcatcg ttacccgcca gctcatcttg 11821 tgacaatttc gtctgctgtg caacactgtg agcaatcgca cctgcacaag ccgtatcttc 11881 tagagaaaaa ctgccttccc aacctgatcc gactatccaa actgtttctg gttgcttttc 11941 caacagatat tgcaccacag cagcacggtt gacaaaagca gctgcaagta ctgcttttgc 12001 atcctgtatt ctttgtaaag cacgggtgcc attagtggta ctaatgaaca agcgccgtcc 12061 ttgtacaagt tctggggtac aatcgagagg agaattaccc aaatcaaagc caggaacttt 12121 tgacccgcca cgttctcctg ctcgtaggcg tttttcaccg gaccatttct cgctaactgc 12181 taaaagttgg tccaaatcac tgaacacttg gacagcttca cctccagctg ccaaaacggt 12241 cgccattgtt gtcgtagctc gcagtacatc gactgcgatc gcgcattctg gcactttatc 12301 tgttggagtt aattccggag tgtggtaaat gaatagcttc acgccttgag tacacctgat 12361 tgaatactac tgccgaaaat aatatttcta ctgaacaagc gtcaactacc gtagtgatag 12421 tttatgcaag aattggagca atggagttga ttataactac tgttatcaac acagcagtct 12481 taatttgttg gcaaatcagc ggtaatctga aataaagaac aaattttgta aaaactgtgt 12541 ttggcgttca atttgtcctc tactaccaaa ctgaggctat tcctgctgat atggtgacga 12601 gtatgtggtg atctggcgct attttgcctg atttcctacg ccctgggcaa gggctgagga 12661 gatacttgta gggttagtag tgctttttga aaacaactaa cccaacggga gagccactaa 12721 caatccccaa tccaaaatct aaaatcccaa acccatatgg cacctttacc caactatcgc 12781 ccgaaacaac tgtctgtagg tccgttagaa gcagaaattt tgaatatcat ctgggaactc 12841 ggttccgcta cagttaaaga tgtgcacgac cgaattctaa cagaccccaa ccgggaactc 12901 gcttatacat ctgtgaccac tgtactgcgt cgtctcaccg ataaagggtg gctagtctgt 12961 attaaaaaag gacgagcata ctattggcga ccattggtga caaagcagca agcagacgtt 13021 attagagcgc acgaacagtt acagcgattt ttggcggtgg gaaaccccga tgttatagct 13081 gcttttgcag atagtttgga tgaaactgct tcggagcaaa tacaggcgat cgccaaacgc 13141 attcaagctg cacgccaagc cagggaggaa caataatgca tctaataatg attgtggctt 13201 caatggtcgt tgtttattgg ataagatgcc aatggacaca accccaagga acttggagtg 13261 aacgatggca atatgcccta tttttatttc tctttccacc attgctcatt ttcatgacgg 13321 cgatcgccgt catttgcatg ggaccccaag gaaaaatgag cggcttacag acaggttggt 13381 tcagctatgt gctggcatta attagtcttg ctttttctgt cgttttatgc ataaaactgg 13441 catggggagg atggcagtct gtgaaatcag ctcgcaagtg ttccctggtt gatttagcgg 13501 gtagacaagt gagagttctg gatacagggg taccgtttgc cggtcaaatt ggattttggc 13561 aacccgaact cgtcctcagc caaggactgc tgcacactct ttcaccagaa caattggaaa 13621 ctgtcttagc ccacgaacaa ggacattacc attaccggga tacattctgg tttttctggc 13681 taggttgggt gcgttcctgc accgcgtggt taccaaacac agaagctttg tggcaagagt 13741 tgttagcctt gcgcgaacta cgcgccgatg cacatgctgc ttcacaagta gatcctttac 13801 tgcttgcaga atcacttttg ttggtagtta gcagtccgcc tatatcctca gaaatttgct 13861 gtgctgcatt aagttctagt cacgtagatc gtttagaaca gaggatagaa gccctgttag 13921 cacagccaga accagttcca gaagttcaat tacaatcttg gaatagttac ttgttggcat 13981 ttttgcctct ggtgactgta atatttcata cgtagttcat ttttttgatc tcaagtagag 14041 tgcgttcaca aagtgcgcac tctatttttt ttggtttttc gtttcaagtg tcaccgcaga 14101 cttgccacat cagtccagat attaatacag tgtctgagct tgtcttaagt tgccaataat 14161 atagtcaagc ctacactttt attgacagtt atcgacatcc ttaattaatt cattcacgaa 14221 ctctctaagc cgttcttgcc agtcagggac ggctttaagc ttttgtttga ctcccggcaa 14281 taccttgaag catacaggat ctttgtcaaa agggcgatcg cttgtaaacc ctaatttgtg 14341 atttttctta aaagtcattg catattcctg tttttatgag tatactaata ctagcagcgt 14401 taaaagcaat aaatgttttt atctgtcaaa acaaaattaa aactgagtaa agcccaaaaa 14461 acggtaatga gtaaacacgc gggaatagct agatttactt ataattgggg cttagcaact 14521 tggaataatt tggttaaaga tggattgaag cctaataaat atattcttaa gaagtttttt 14581 aacaatcacg taaaacctga gtttgattgg attaaagaaa aaggtatttg ccaaagaata 14641 actcagtatg cttttgataa tttaggcgat gctttctcta gatttttttc tggcaagggc 14701 ggttatccta atttcaaaaa gaaagatcat cacgattctt ttactattga tgcaggtggc 14761 aagcctatac ctgttggtgg gaaatcagta aaacttccaa caattggatg ggttagaact 14821 tatgaaggat taccccatgt tacgtgtcaa tcaattacaa tatcacgaac tgctgatagt 14881 tggtttatag cttttgctta tgaacaagaa catgagccaa ctgctaaaca gtatgaagtc 14941 gtaggggttg atttaggtgt caaagagttg gctacgttat caactggcgt tgtgtttcct 15001 aatcctaagc actataaatc cgccttaaag aagctccgaa aattgtctag ggaactttca 15061 agaaaaaaga aaggatctaa taacagacac aaagccaaaa tcaagcttgc taaacaccat 15121 ttaaggatag ctaacctcag aaaagatacg cttaataaag ccactacttt cttatgcaaa 15181 aaccacgcaa aaatagtagt agaggattta aatgttaagg gtatgttagc taaccataaa 15241 ttagctcagg cgatcgctga ttgtggcttt cacgagttta agcgccagct agaatataag 15301 gcgaaaaaat ttgctagtga aataataata gctgatcgct ggtttgcgtc aagcaaaacc 15361 tgctcagctt gtgggcatgt tcaagacatg cctcttaaga tgcggacttt caattgcgaa 15421 aactgcggat cttctatcga tagggatctc aatgcagcaa ttaacttgag ccactgctcg 15481 actagggttt cccggcttga agcatgtggc gtattgcgtt tggctaaagc gtgaaagctt 15541 accgagggat aagtgctccc atgctcccat tgaagtaaga aataaatgtc tggatttgtc 15601 tagagtttat gtagcagaat aacgaatcat tagaccaaag aaattatgac tccctccctc 15661 cctcactccc tctctccctc tccactcccc acctcctaca aacatatccg cgatttagac 15721 caactcatcg cccttgtcgc ctcccatcgc atcgcaggac gtaaaatagt cttcaccaat 15781 ggttgttttg atatactcca cgccgggcat gtctcctatc tccaacgcgc caaggtctta 15841 ggtgacattc tcattattgg ggttaattcc gacgacagca tccgccgtct caagggatca 15901 actcgtccta taaacccgtt agaagatagg atgcaggttc tggctgcact agcttgtgtt 15961 gactatctca ttccttttga agaagatagc cccagtcacc ttattagcaa gctacacccc 16021 aatatctacg tcaaaggcgg cgactacacg aaagaaactc tgccagaaac acccgtagta 16081 gaagaatacg gtggagtcat tgaatttttg ccctttttgg aaaatcgttc cacaagtaaa 16141 ataattgagc agatttccca aggtaaaaat cattcatgat aattgaaaca cctcggttag 16201 tgctgcgtcc atttgaagac aaggacacac taccattcat agcttatcgc tgtgatccgg 16261 aagtagcaaa gtatcagagt tgggatgcgc cctatcctga agcggaagca atcacgttta 16321 tagaagccat gaaacgtgca acaccaggtg taccaagaga gtggtatcag ttagcaattg 16381 aacttttagc gacaggcgaa actatcggtg attgtgcctt ttgtatttgc gccgatgatg 16441 aacgtcaagc agagattggc ttcacacttg cgccagccca cctttggcat tggctatgga 16501 accgaagctg taaaatgctt gctcaatcat ctgtttggtg agcgcaacct gcatcgcgta 16561 cgggcaaact gtgatcctgc caacatcgcg tctgtgaagc taatgcagcg gatcggtata 16621 cggtgtgaag gacattttgt taagagtttg tggttcaaag actcttgggt agatgaactt 16681 tggttcgcca ttttacgcga ggaatggaaa gtcacttaac tcaaaaagta tgtgaaaagt 16741 ttgctctata ggtttttttt tgactctaaa aatctctata aattggagaa aaatttgaga 16801 acttgttaaa attaaaagat ttctgatatc actatggtat tttctcaaaa atggtttata 16861 tggagtcaga gtttagtctg acaaagcgtc cctaaaagct ttttagtttt ttttgacgtt 16921 aaatcaactc tttctatact gatatatttc agttttgttc atttttttaa atacagccta 16981 atgcaagtgt ttgaagtagt tttagttctc aactggctga gcgtgcaaaa gaaaaaaaag 17041 agtgaatatc ccaatagcca tcagttaatt aaattacctt taaaagatat cagagcacta 17101 gttccttctg accacatcgg atataaaatc agggagaagt acttacttta atgttaagag 17161 gagtgattag ttaagatgag taagcttctt aaaaccattg cagcctccct agttttcggt 17221 atggcagttg ccacacctgt tgctaccttt ccatcaatag tccaagcaca aacaggtaca 17281 accccaacaa ctacagctac tcctacacca acaggtgcag ctactcctag cggcacaaca 17341 ggaggcaaaa aacctgccag caccacagga ggcaaaaaac ctgccagcac cacaggaggc 17401 aaaaaacctg ccagcacaac aggaggcaaa aaacctggca caacagggac aacaacacct 17461 atggactcgc ctactccagg ttccactaca ccttatacag gcgcgcctac acccactggt 17521 tcaccttcag ctacgccgag caaaaaacca taaggcaaaa agtcaatgta acaacagtca 17581 gggaatggaa ctgtaaccta aaagctattc aagcctgatc aaaggttgtc aattttgatc 17641 aggtatagcg aaaagatagt ataatagtcc tgatagtacg ggttaacaac cgtgcaccaa 17701 gaggatcaac ccctgacagc atagactggc gtttgagaaa cgtgccacag gcgaagtgca 17761 ggtgacctag acaagcgagt gcggtcttgg ggtttcccca agtgaagcaa ctcgcgtaga 17821 gcgtcgttag tagcgcctca aaagaagttt ttcataagaa cttcgacaca tatagcggtc 17881 agttagaccg tgttagtctc tggactggat agaaccgact cttccaggat gaaagaggaa 17941 gtaagcttca aagctaaccg ttgtgccaat tatctattat agaaaagggc atggttgaat 18001 agctttgagt aggtcttatg taacggaaga taggcgatgc atccctaggg gtatacgagc 18061 gatggcactc cgtgcctgcc cctttagggc atcgccattc aatactgata caaaagctcc 18121 ttgtgcatca tgccttggag ctttgttatt gcttgtgagt ttaagttatt atcacaagaa 18181 tcttctggtg aatattatat gcaaaaatcg ggctacatcc tggcattgga tttgggtaca 18241 acaggcaacc gtgcctttgt gtttaacgca gatggtaaga ttgttggtca ggcatatcag 18301 gaactgacac agtactatcc tcagcctgga tggttagagc atgattcttt agaaatttgg 18361 caggcgactt gctgggtgat gcaaactgcg atcaaaaatg cccagattgc tcctgatgaa 18421 attatcgctc ttgggctgac agtgcagcga gaaacttgct taatttggga taaaacgact 18481 ggtcaacctc tgcatagggc gatcgtctgg caagatcgcc gcacagctcc tttgtgccat 18541 catttacaag agcaaggata tgctcaggag atttctgatc gcacaggact cattgttgat 18601 gcctattttt cagccacaaa gctgtcatgg ctgttagagc attttccagg cgttgacctc 18661 aaaaatatct tagctggaac gattgatact tgggtgctgt ggaacctcac aggtggaaaa 18721 gttcacgcta ctgatcacag taatgccagc cgtacaatgc tgatgaatct agcaagctgt 18781 gagtgggatg aaacactgct gactctgttt aaaatacctc gtcatatcct gccccaaatt 18841 cagcctagtt tgggaacttt tggagtgact gatgctagct tgttaggcgt tgaaattccc 18901 attactgcta ttttgggaga ccaacaagct gctttgtttg gacatggctg tcattctccc 18961 ggattaatga aatgtactta cggtactggg agttttttag tcgctcacac tggctctcaa 19021 atcgtgcgtt cccagcatca acttttgagt actttagcat ggactcaagt aaacgcaagc 19081 ggaactttgg acgtgggcta tgcattagaa ggtagcatat tcacgagtgg agcttgcatc 19141 aaatggctgc gcgatcgcgt tgatctgatc acaactgctg ctgaaactga ggtgctcgca 19201 aatcgagtca cagataatgg cggagtttac tttgtgcccg catttagtgg actgggtgcg 19261 cctcattggg atatgagtgc taggggagcg ttttttggaa ttactgcaat ggtacaacgt 19321 gagcatttgg tacgtgcagt actggaggca ttggcttacc aagttcagga agttgtacag 19381 gctattcttg catccaacac tatttccatt gaacggctga gtgtggatgg tggtgcctgt 19441 gagaacaact ttctcatgca gttccaagca gatgtgttag ggataccagt tgaacgtcct 19501 aagatgcgcg aaatgacagt gcaaggtatc gcgtttgcag caggtcttgc tgctggattt 19561 tgggataatg atcagctact cgtgcagcaa cgacgaattg agcgcgtgtt tttacctggt 19621 gtagggagaa atcacgcttt ggaaaacttt acaacttggc aaaaggcagt tgaacgcgct 19681 aagcactggg ctgattgatt ctttattcca cagaggtgtg gtcaaaagtg gttggtagag 19741 ggggtaggca tgggttatga aaccgtcttt gtttttgaca ctgccatatt tcctacactc 19801 aaactttgat ttgattaact attttaacat tgtttaaccg tgtaattcac ggaagctcgc 19861 gcttcgcttt tgtggtagtc tttagacctt agtgagaatg tacatgagag tgattagaac 19921 gctgataatg ttggcgtagc aaatctttac cataatcatt accgttgggg gttcggacga 19981 cagtatgaat atgattgcga gatggctctt tgtttcccaa cgcctgacct cgctgatggt 20041 caaattcaat taagattacg ggactgtgaa ttcgataata gaagacactg ttattgtcaa 20101 atcctcccat ccaagcaaag tgagtgtttt ccaaatgttg acggacttgt tccatcttaa 20161 cttctgcatg accattgcga atatcgcttg tatattccct aattaagtca agcagtattt 20221 cttgttgacc actcatcaac tccgcaaagg agattccttc ataacgcatt tcaaagttat 20281 ccctaaatgc tgaagtaaag acatctgtgg gaatttcttt ggcaaggact gctttgtctc 20341 tttgctcggc tgttagtgca cggattgtag caagtccctt actttcttct ggctcaaata 20401 cacgaattcc agcatgttca ccttctgttg ccctcacagg ttctgacccc atgaaggttg 20461 gtgtcatgac gacttgatct cctaaaacaa agtagttgat attaagatga tgcccatcaa 20521 gttgccaacc ccaaggctcc ttcatggaag gttgtcccat gaaactaaac caatatagtt 20581 cttttccata ctcgttcaat ttcttcgtaa gctgtcctag taggtcatca agtttgaatt 20641 gagggataaa gtcgcttcaa tttaatgcga g // LOCUS NODE_1557_length_20630_cov_4.76029220630 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20630) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20630) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20630 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(235..2703) /locus_tag="DP116_13980" CDS complement(235..2703) /locus_tag="DP116_13980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320177.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent Clp protease ATP-binding subunit ClpC" /protein_id="PRJNA477356:DP116_13980" /translation="MFERFTEKAIKVIMLAQEEARRLGHNFVGTEQILLGLIGEGTGV AAKVLKSMGVNLKDARIEVEKIIGRGSGFVAVEIPFTPRAKRVLELSLEEARQLGHNY IGTEHLLLGLIREGEGVAARVLENLGVDLSKVRTQVIRMLGETAEVSAGGQPRGNKTP TLDEFGSNLTQMAADGKLDPVVGRAKEIERVIQILGRRTKNNPVLIGEPGVGKTAIAE GLAQRIANKDVPDILEDKRVVTLDIGLLVAGTKYRGEFEERLKKIMDEIRQAGNVVLV IDEVHTLIGAGAAEGAIDAANILKPALARGELQCIGATTLDEYRKHIERDAALERRFQ PVMVGEPSVDETIEILYGLRERYEQHHKLKISDEALLAAAKLSDRYISDRYLPDKAID LIDEAGSRVRLINSQLPPAAKELDKELRQILKEKDDAVRAQDFDRAGELRDREMEIKA EIRAIAQSKTNSTRTEGDEPVVTEEDIAHIVASWTGVPVNKLTESESEKLLHMEDTLH QRLIGQEDAVKAVSRAIRRARVGLKNPNRPIASFIFSGPTGVGKTELAKALASYFFGS EEAMIRLDMSEYMERHTVSKLIGSPPGYVGYNEGGQLTEAVRRRPYTVVLFDEIEKAH PDVFNMLLQILEDGRLTDAKGRTVDFKNTLLILTSNIGSKVIEKSGSGFGFDVAEDQT EAQYNRIKSLVNEELKQYFRPEFLNRLDEIIVFRQLSKEEVSQIATIMLKEVFGRLTE KGITLEVTDKFNNRLIEEGYSPSYGARPLRRAIMRLLEDSLAEEILSGRIKDGDTAIV DVDETGNVIVRAEQRRELLTPVVE" gene complement(3096..3635) /gene="rimI" /locus_tag="DP116_13985" CDS complement(3096..3635) /gene="rimI" /locus_tag="DP116_13985" /EC_number="2.3.1.128" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosomal-protein-alanine N-acetyltransferase" /protein_id="PRJNA477356:DP116_13985" /translation="MSKLELEIKHLTSADLSAVLELDQICFGGLWTLQGYQRELDSPN SDLLGLFSGGSVVRLLGISCFWSILDEAHITILAVHPQYHRQGFGQALLYSVLKTACK GGLERATLEVRASNSAAISLYQKFGFKIAGRRRRYYKDNDEDALILWLGDLQQPQFQK TLHDWDTIINDRLTKASWS" gene 3726..5267 /gene="lysA" /locus_tag="DP116_13990" CDS 3726..5267 /gene="lysA" /locus_tag="DP116_13990" /EC_number="4.1.1.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875604.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diaminopimelate decarboxylase" /protein_id="PRJNA477356:DP116_13990" /translation="MVSTHPAGVQHSGSQYLPQANPQNINLSPNQQLLPLTARVNNHD HLEIGGCDVTTLVEQFGSPLYILDEQTLRTACRQYQDSFKRYYKGESQVLYASKAWSC ISVCAIAHDAGLGIDVVSGGELYTALSAGVSPDKIYFHGNNKSHQELTFAIESGCIIV VDNWYELRTLVEIATEAGETALREGSQCGLGVSPSGASGVFPPQATALAQRDRRSYPK GGEVTSSSPIRIMLRLTPGIECHTHEYIRTGHLDSKFGFDPNQLDDLFTFVSQQSVIN CLGLHAHIGSQIFERQPHQDLAAVMVEWMNKAAGYGLSIKELNVGGGLGIKYTESDDP PSIEEWVKPICEVIQQACAANNLPLPKLLSEPGRSLIGTACVTAYSVGSSKVIPEIRT YVAVDGGMSDNPRPITYQSVYRAIVANRMSAPLTESVTIAGKHCESGDILIKDAQVPK IESGDILVVIATGAYNYSMASNYNRLPRPAAVIVANGEANLILRRETYQDLIRQDCLP QRLKS" gene 5570..6496 /locus_tag="DP116_13995" CDS 5570..6496 /locus_tag="DP116_13995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457558.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR00159 family protein" /protein_id="PRJNA477356:DP116_13995" /translation="MGDWWTQWLANLGRSQSLLIGTLDIGLVLALTYMILVIISERRT LWMVRGFIILMLASAISSRLHLQLLNFVLEKLVIGCAVAMAVALQSEFRRFLEQLGRG EFQQLFQPSRLAVPKYNSVIDEIVDAVRELSKNRIGALLILETTGPMDERDFSVPGVK LNAEVSKELIQTIFQPKTLLHDGATLIRGSRIVASGIILPLSGRTASRQLGTRHRAAM GITERVENCICVVVSEETGSISLAERGTLNRPLTITKLKESLETRFSPNVDREAGAPD LLSLGRQIRSKVLALFSRLLGLPSTASRRDKK" gene 6493..7242 /locus_tag="DP116_14000" CDS 6493..7242 /locus_tag="DP116_14000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320173.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="isoprenyl transferase" /protein_id="PRJNA477356:DP116_14000" /translation="MTAQHTELRDLPSDLKQELLPRHVAVIMDGNGRWAKHQGLPRIM GHKAGVDALKNLLRCCDDWGIGALTAYAFSTENWGRPTEEVDFLMTLFQRVLRQELRE MMKENVQIRFVGNLSALPLALQKEISHSMEATRNNRGIKFTVATNYGGRQEILQACRT IAHKAQQGLLQPDEIDEATFERHLYTEGVCDPDLLIRTSGEMRVSNFLLWQIAYAEIY ITEALWPDFNRNEFHHALCAYQQRDRRFGKV" gene complement(7408..7680) /locus_tag="DP116_14005" CDS complement(7408..7680) /locus_tag="DP116_14005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457562.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3143 domain-containing protein" /protein_id="PRJNA477356:DP116_14005" /translation="MLLPPPETPLYNHPLPQIEYWLREQGCERDEKQLHCWRLQQSTW QAELSLDIEQIIVRYLEAGDDGQDIQRAFKYSLSREDVERAVFSGP" gene complement(7724..8245) /locus_tag="DP116_14010" CDS complement(7724..8245) /locus_tag="DP116_14010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318707.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="J domain-containing protein" /protein_id="PRJNA477356:DP116_14010" /translation="MPQSSKPTYYSLLGLHPSASVIEIRRAYRELSKHYHPDTTKLPT AVATAKFQQLNEAYATLSNPERRFSYDLKIGYSRFGVIQAPLDLNHPVSDSYDWSKSA YLDASDRPLSAGEIFALFILGLTLLGCLLLAIAIGLTRGEGAFQTQRLHTPAIQQPNT SISQPTIPSEILN" gene complement(8413..9402) /locus_tag="DP116_14015" CDS complement(8413..9402) /locus_tag="DP116_14015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="J domain-containing protein" /protein_id="PRJNA477356:DP116_14015" /translation="MPNLQNFRDYYEILGVTKDASNEDLKKNYRRLARQYHPDLNPGN KAAEEKFKDIGEAYEVLSDTAKRAQYDQFSRFWKQKGFDKQAAARENGWNRPNGRTRN QEVDPGRYPDFDSFINQVIGVGVRKDTKNGTSTSGVESDPFSSTNRKVQYTVNSRPTR RDIEARLTLPLEKAYKGGTERIRLEDGRSLEVNMPPGMVTGQTIRLKNQGINDGDLYL KITVNPHYLFKIEGSNVFCQVPVTPTEAVLGGQIEAPTLDGPVKMSIPPGVRSGQRFR LANKGYPKENGERGDQLVEIQIVAQKNISEQERELYEKLRQIETFKPRADLVK" gene complement(9608..11809) /locus_tag="DP116_14020" CDS complement(9608..11809) /locus_tag="DP116_14020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140260.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaK" /protein_id="PRJNA477356:DP116_14020" /translation="MGKVVGIDLGTTNSVVAVMEGGKPVVIANAEGMRTTPSVVGFTK EGERVVGQMARRQTLLNPQDTFFAVKRFIGRRYGELSPDSKRIPYTIRKDEVGNIKIS CPRLNKDFAPEEISAMVLKKLAEDASKYLGDPVTGAVITVPAYFNDSQRQATRDAGRI AGLEVLRILNEPTAASLAYGLDRGSTETILVFDLGGGTFDVSILDVGDGVFEVRSTSG DTQLGGNDFDKKIVDWLAEKFLETEGVDLRRDRSALQRLLEAAEKAKIELSSVSVTDL NLPFIAADQEGPKHLETRLTRSEFEGLSDDLLGRVRLPVKRAMKDAGLTPADIDEVVL VGGATRMPMIQQLVRAMIGKQPNQNVNPDEVVAVGAAIQAGILAGELKDVLLLDVTPL SLGLETVNGVMKKLIPRNTTIPVRRSDIFSTSENNQNTVEVHVVQGEREMAVDNKSLG RFKLYGIPPAPRGIPQIQVSFDIDANGILQVTALDRTTGREQSITVQGASTLSEGEIR QAIQDAERYAEIDRERKERVEKRTRAEALIIQAERQLREVALDFGMQLARSRRQRIDN ICRELRESLKENDDRGIDQAYADLQDALYELNREIRQYYDDDEEDDLFGTIREIFTGE KDRDPPGGSRPRGDDRDYPRDTYRDSFGGSRPRGDGAERPLRDAPEGSRARRDGAERP LRERDSYDRDYGKDYNRDYDRRGRSSDEGGYSRKPRPNYQDNWDDDDDNWL" gene 11927..12784 /locus_tag="DP116_14025" CDS 11927..12784 /locus_tag="DP116_14025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318704.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M28" /protein_id="PRJNA477356:DP116_14025" /translation="MNLKNQLENHLTWIARDRDPYLATAGHFFVQEYIRQQLAQWGSV EIHTFRVGSKTCQNLILNLPSQPEFQKRDLPPILIGAHYDAVPGTPGADDNATGIAVL LELARMFASVPTKYPLQLVAFDMEEYGLLGPSGASEYAAELRQQDQPLRLMISLEMLG YCDRTPGSQTYPSPLERFYPNRGDFIALIGNWRTIRDLISISRSIRKVGVPSHWLPVP NRGLILPLTRRSDHAHFWDQGYPAMMVTDTANLRNPNYHKPSDTIETLDLDFLTGVCQ GLESGIRHL" gene 12881..13444 /locus_tag="DP116_14030" CDS 12881..13444 /locus_tag="DP116_14030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181638.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_14030" /translation="MVNTSTIKKLTLEEFLALPEGDVNYEFVDSYAVPKVSPKFFHST LQLTLGLLLRTWCKGKGRIGSEWAIISKRQGQDWAPVPDLTYISYKRLPKVWKRNEAC PVPPELVIEIISPDQTMKEFEDKARDYFDAVVSRVWIVDPEAITIKVFVSAEESQVYS DNMPIMDTLLPGLELTVRQVFEEAELV" gene 13843..15204 /locus_tag="DP116_14035" CDS 13843..15204 /locus_tag="DP116_14035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319096.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome biogenesis GTPase Der" /protein_id="PRJNA477356:DP116_14035" /translation="MSLPIVAIIGRPNVGKSTLVNRLAEEQSAIVHDQPGVTRDRTYK PAYWRDREFTVVDTGGLVFNDDTEFLPLIRQQAMAALAEASVAIFVVDGRTGPTPADE EIAEWLRQQKVPVLLAVNKCESPEQGLIQAAEFWELGLSEPFPISAIHGSGTGDLLDV VINYLPATPDVPETNEIKVAIVGRPNVGKSSLLNTFVGETRAIVSPISGTTRDAIDTV VERNGQIYRLIDTAGIRKKKNVEYGPEFFSINRAFKAIRRADVVLLVIDALDGVTEQD QKLVGRITEEGRACIIVVNKWDAVEKDSYTIYDYEKHLQERLHFTEWAETIFVSALTG QRVEKILELVDKAAESHKRRVSTAVINEVLEEAMGWHSPTVSRAGRQGKIYYGTQVSS QPPTIALFVNDSKRFNDNYRRYIERQFREHLGFQGTPIRILWRSKKVREVEGTNANRA TRV" gene 15340..16266 /locus_tag="DP116_14040" CDS 15340..16266 /locus_tag="DP116_14040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865530.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14040" /translation="MDLLRSLPIGLYLEQPQTWLHKLDPRVKLIWLMSFLTTYILANN LWRVLLVVLLIIATLIARIPRKVWQQQMGLVLTASFFILVILPITPDGLGIKYQPRLP INQQVLTEQSASTPSTPAALSANKHEGYKYVLFDKGSIKVTRRSLDLAITASTMLFTL IYSSNLYLLTTATEEITAAIESLMQPLRRLKLPVTEITLTLTLSLRFIPLVLEEIQNL IRSVMTRAINWKKLGLKGAVKVWMLVAERLLENLLLRAEQMASAMTVRGFTSPSEHRV QWHDLRLRRGDWLAIAILILFWGVRLAIGTEV" gene complement(16840..18084) /locus_tag="DP116_14045" CDS complement(16840..18084) /locus_tag="DP116_14045" /inference="COORDINATES: protein motif:HMM:PF00534.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14045" /translation="MKILHVTQGYTPAIGGTELLIQRVSEELVRQFGDEVTVFTTNCF NGEGFFNPKLPRLQPGDEEINGVKVRRFPVNSRMSQMVRFPQKVAYRLRLPFNEHLRT IAGGPIIPGLKKAIQEFPADIIAASSFPLLHMYAALNGARNSGRPCVLHGGIHPQDNW AFQRSKIYSAIEQSTYYLSNTKYEADYVIQQGVSPERVVVIGVGVDPEPFEQISSTQA KKHFGFKEQPVVGFIGQFGGHKGVDTLVQAMTLVWQIFPDVQLLLAGAKTMFAEKVEN IVNQLPESYKKQVKFYYNFSNEEKPFLFSAVDVFAYPSGFESFGIAFLEAWAASKPVI GCRAGAIPWVIDEGVDGLLVDYKNQEMLAEAIIELLKNSDWAKTLGDAGREKVLSRYT WSKVAQKFREVYVEAMRRDNTK" gene complement(18097..19452) /locus_tag="DP116_14050" CDS complement(18097..19452) /locus_tag="DP116_14050" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14050" /translation="MPELENIKRPHLLIVSNDVVDTKMAGPGMRYLELARVLSQELNV TLAIPSETTLEVPNIKLVRYWDERPKSLQVLVENSDITLVSGYMVEKFPFLHQTQKRI VVDLYDPFILENLHYHLNKPLPAQESLNNRAVDVTNSLARLGDFFICGSDRQRDFWIG LLASNGRINPRNFAKDTSLRSLIDVVGIGFLNREPRPNPTLRGIHPGFPEDARIVLWG GGIWDWLDPLTLVKAWTGVIAEYPQARLVFLGTRHPNPQVAPHKMAEQVQVLAAEIGE KDRTIFFYEWIPYQEREALLCEADIGVTLHPPHIETRYSIRTRVLDYLWARLPVLVSE GDVTSEWVKEYGVGKAVPPLDVEAVRVAIAQILERPKSSWASAFEPLRNSLNWSQVVE PLRRYCLQPEYAPDRQTRKLVTPTVVKRGRLARVIEIWRTEGSRELLKRIRLKFRRRP V" gene complement(19452..>20630) /locus_tag="DP116_14055" CDS complement(19452..>20630) /locus_tag="DP116_14055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319090.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_14055" /translation="NNCSFNTNHRKLDSTYTNFSDDNPYPINIIHVNGDMINSFLNSV GIEYFKNKYNIGFWAWELPDFPKEWLTAFNLFDEIWTPSNYSVEAISEAYCQGQNAPL SPIPVLKVMHSLSLPQPTATKQSLGLPDNKFIFLFIFDFYSVFERKNPIAVIEAFQRA FGKDEQVLLVLKFSNAHKFPDKYKQLKDLAQDFKNIKFIEKYLLKDEINALIYHCDCY VSLHRSEGFGLTIAEAMFYGKPVIATAYSGNTDFMNVSNSFPVKYSLTTLTEDYAHYK KGSTWAEPDIDHAAYLMRHVYNNYEEAKQIGAKACEDIKSLLSPKLIGQKIKNRLEYI THNSSKLIESPQLQTELKHKNAEIERLKALVDYMEKSRFWQLRNQWFKLKGILQSKFN " BASE COUNT 5514 a 4490 c 4525 g 6101 t ORIGIN 1 catgtggcgt ccctgggcta aagccacagg gttttcatct cacccactat aaagccattt 61 tggatattag cttgaaattc tacagcaaac atagctagcc tctgagaaga actcatcttg 121 tcttattata gaactcgctt ccttctgaaa tgaaatacaa aaaaaggtag agctgttgga 181 tgctctacct ttcttttatc aaaagaataa agaaatatta agtcttcttt gtttctattc 241 aacgacaggg gtcaacaatt cccggcgctg ttcagctctg acaatcacat taccagtttc 301 atcgacatca acaatggctg tatcaccatc tttaatgcga ccagatagga tctcttcggc 361 taggctatct tctaacaagc gcataattgc ccgacgtaat ggtcttgcgc cgtagctggg 421 gctataacct tcttcgatga gacggttgtt gaatttatcg gtaacttcca gtgtgatgcc 481 tttttctgtc aggcgaccaa acacttcctt gagcatgatg gtagcgattt gagaaacctc 541 ttccttgctc aattgacgga agacgatgat ttcatccagt cggttgagga actcaggacg 601 gaagtattgc ttgagttctt cgttaaccaa ggatttaatc cggttgtact gtgcttctgt 661 ttggtcttca gcgacatcaa agccgaagcc gctaccactc ttctcgatca ccttggaacc 721 gatgttggaa gtcaaaatca gcaaggtgtt cttgaagtct acggtgcgac ctttagcatc 781 ggttaagcga ccgtcttcta aaatttgcaa tagcatatta aagacgtcgg ggtgtgcttt 841 ttcgatttcg tcgaatagca cgactgtgta aggacgacgg cgcacggctt cggtcagttg 901 accgccttcg ttatatccga cataaccagg aggcgaacca atcagcttgc tgacggtatg 961 gcgctccatg tattccgaca tatctaagcg aatcatcgct tcttcggaac cgaagaagta 1021 agaagccaaa gcttttgcta actcggtttt acctacacct gttggaccag agaaaatgaa 1081 gctagcaatg ggtcgattag gattcttcaa accaacacga gcgcgacgaa ttgcccgtga 1141 aactgctttc acagcatctt cttgaccaat caagcgttga tgcagggtgt cttccatgtg 1201 cagcagcttc tcggattcag attcggtgag tttgttcacc ggaactccag tccaggaagc 1261 gacaatgtga gcaatgtctt cttccgtaac cacaggttcg tcaccttcag tgcgggtaga 1321 attcgtcttg ctttgagcaa tggcgcgaat ttcggctttg atttccatct cgcgatcgcg 1381 caactcccca gccctatcga agtcttgagc gcggactgca tcgtcttttt ctttcaaaat 1441 ctgacgcagt tctttgtcca actccttagc tgcggggggc agttgggagt taatcaagcg 1501 cacacggctt cctgcttcgt caatcaagtc gattgcttta tctggaagat aacggtcgct 1561 gatatatcgg tcagatagct tcgccgctgc aagcaaagct tcatcggaga ttttcagctt 1621 gtggtgctgc tcgtagcgct cgcgcagtcc atagagaatc tcaatcgttt cgtctactga 1681 cggttcaccc accatgactg gctggaagcg acgctctagg gctgcatccc gctcgatgtg 1741 tttgcggtat tcatctagag ttgtggctcc aatacattgc agttcacctc gcgccaatgc 1801 tggtttgagg atatttgctg catcgattgc cccttctgcc gcacctgcac caattaaggt 1861 gtgtacctcg tcaataacga gaacaacatt gcccgcctga cggatttcat ccataatttt 1921 tttcaggcgt tcttcaaatt caccccggta tttggttccg gcgacaagta aaccgatatc 1981 cagggttacc acacgcttgt cttctaagat gtccggcaca tctttattag caatgcgttg 2041 tgctaaacct tcggcgatcg cggttttacc tacccctggt tccccaatca ggacgggatt 2101 attttttgtc cggcgaccca aaatctggat cacacgttca atttctttgg cgcgtcccac 2161 tactggatcg agcttgccat ctgctgccat ttgggtcaga tttgagccaa actcgtccaa 2221 tgttggggtt ttgttcccac gtggttgacc accagcgctc acctcagctg tttctcccag 2281 cattcggatc acttgggttc ttaccttaga aagatccacc cctaggtttt ctagaactct 2341 ggctgctaca ccttcccctt ctcggatcaa gcccaacagc agatgctcgg tgccaatgta 2401 gttatgccct aattggcgcg cttcttccaa ggatagttcc agaacgcgct ttgcccgtgg 2461 cgtaaacggt atttccacag cgacaaagcc agagcctcga cctatgattt tttctacttc 2521 aatccgagcg tctttgagat tgacacccat agacttcagc accttagccg ccactccggt 2581 accttctcca atcagaccca agaggatctg ctcggtacct acgaaattat gcccaagacg 2641 gcgagcttct tcttgggcta gcataattac ctttatggct ttttctgtga agcgttcaaa 2701 catatggcat tcatcccatc acctgcgtcg tgccggtacg ctgattttag cacagactaa 2761 aattgcggat gcttgtataa acaaatagac atccgacacg acttaaccca ggttgatggg 2821 gcaattgcta attttttatg acttttcttt aattttcttt acttttgtac ggtcagtgtg 2881 ctgaggagcg taatcttatt taccttttca ggcttgacca cggataatta gtggttggat 2941 tgaaaagttt tttattcaat cccaaataac accaaatatc atattctcgg tttcatatca 3001 agcgttattg gaacagtatg acaatttctt atataagtaa aaataaaacc caaatttcaa 3061 tcaaaaaagc taataagagg cgcatgatga ctgactcagg accaagacgc ctttgtcaag 3121 cgatcgttaa ttattgtatc ccagtcatgt aaagtttttt gaaattgtgg ctgttgcaaa 3181 tctccaagcc aaagtatcaa cgcgtcctca tcgttatctt tgtaatagcg ccgccgccga 3241 ccagctattt taaagccaaa tttttgatat aaagaaattg ccgctgaatt agaagctcgc 3301 acttcgaggg tcgctcgctc caaaccgcct ttgcaagcag ttttcagcac agaatacaaa 3361 agagcctgtc caaagccttg acggtggtat tgaggatgaa ccgccaagat agtgatgtga 3421 gcttcatcta aaattgacca aaaacaactt attcccagta gcctgacaac agagccaccg 3481 gagaacaaac cgagtaaatc actgttaggg ctatctaact cgcgttggta gccttggaga 3541 gtccataaac caccgaagca gatttggtcg agttctagca ctgcactcaa atctgctgaa 3601 gtcaggtgtt tgatttctaa ttctaatttg ctcacacttg ttcgtaatgg tacaagggct 3661 ttgtaaactg ggaagcggag gaattactca cccctataat ttagagaagg acaactctta 3721 tagttatggt atcgacacac cctgctgggg ttcaacattc tggtagtcag tatttaccac 3781 aagcaaaccc tcaaaacata aatctttcac caaatcagca acttctgccg ttaacagcca 3841 gagttaacaa tcatgatcat ctggaaattg gcggttgtga tgtcacaacg ctagttgaac 3901 aatttggttc accgctatat attttagatg agcaaacctt gcgaacagct tgccgtcagt 3961 atcaagacag cttcaagcgt tactacaagg gtgaatctca ggttctgtac gcttctaagg 4021 cgtggagttg tatttctgtt tgcgcgatcg cacacgatgc tggtttagga atagatgtcg 4081 tctctggggg tgaactctac acagccctga gtgcgggtgt cagtcccgat aaaatctatt 4141 tccacggcaa taataaatcc caccaagaac tgacttttgc cattgagtct ggctgcataa 4201 ttgtagtaga taactggtat gagttacgta ctttggtgga gatagcgacg gaagcaggtg 4261 agacagcgct gcgggagggg agccagtgcg gtcttggggt ctccccaagt ggagcatctg 4321 gcgttttccc tccgcaggcg actgcgttag cgcagcgtga ccgaaggtca tacccgaagg 4381 gaggagaagt gacctcctca tctcctatcc gcatcatgtt gcgattaaca ccaggtattg 4441 aatgtcacac acatgaatat attcgcacgg gacatttaga cagcaaattt ggctttgatc 4501 ccaatcaact agatgatttg tttacttttg tgagtcaaca atctgtcatt aactgtctgg 4561 gattacacgc tcacataggt tcccaaattt ttgaacgtca accgcatcaa gatttagctg 4621 cggtgatggt ggagtggatg aataaagctg ctgggtatgg tttatcgatc aaagagttga 4681 atgtgggagg cggcttaggg attaagtata cagaatcgga tgatccacct agcattgaag 4741 aatgggtgaa accgatttgt gaagttattc aacaagcttg tgctgcaaac aatttacctt 4801 tgcccaaatt attatctgaa ccagggcgtt cactgattgg aacagcttgt gtgacagcat 4861 actctgttgg ctcatccaaa gttattccag aaatacgtac ctacgtagca gttgatgggg 4921 gaatgtctga caatccacgc ccaatcacat accagtctgt gtatcgcgca atcgttgcta 4981 accgaatgtc tgctccactt acagaaagtg taacaatagc tggcaaacat tgtgagtctg 5041 gggatattct aataaaagat gcccaagtgc caaaaattga atctggggat attcttgtag 5101 ttattgccac tggtgcttac aattacagta tggcatctaa ctacaatcgt ctgccccgac 5161 cggcagctgt tatagtagcg aatggcgaag caaacttgat tttgcgacgc gaaacttatc 5221 aggacttaat tcgacaggac tgcctaccgc aaagacttaa aagctagtta ttaggagcca 5281 ctgagtcctt ggaggtttcc tgcaaagatc gctgtgcaat taggcagtgc gcccttacgg 5341 gtttcccgac agtcgcgcaa ctgccgtagg tgcaagcgtc ctttcgcgta ccgtgttagc 5401 tcctgccaga ggaggcacgt gaacacagaa cacagaagca atgcttactg gcattgcccg 5461 tgccagtgga ctgtagaagg cacgtaactg gcgtttatta attaatcacg cacttgagct 5521 gttaactgtt gacttacttt attagaggca aaaggttaaa tccagagtca tgggagattg 5581 gtggacgcaa tggctggcaa acctaggaag gtcacagtcc ttgctgattg ggactctgga 5641 tattgggtta gtactggcgc tgacgtatat gatacttgtt attattagtg agcgccggac 5701 attgtggatg gttcggggat ttattattct gatgctcgca tcagcaatca gtagcagatt 5761 gcatttacaa ctgctgaact ttgtactcga aaagttagta attggctgtg ctgtggcaat 5821 ggcggttgct ctccaatcag agttccgccg atttttggaa caattaggac gcggcgaatt 5881 ccagcagttg tttcaaccct cccgtctggc ggttcccaaa tacaatagtg tcattgatga 5941 aattgttgat gctgttagag aactgtcaaa aaatcgaatt ggagctttgc tcattttgga 6001 gaccactggt cctatggatg agcgggattt ttctgtgcca ggagtaaagc tgaatgctga 6061 agtgtcaaag gaacttatac agacaatttt tcagccgaaa actttattac acgatggggc 6121 gacgttaatt cgtggctcgc ggattgtggc atcaggtata attttaccgc tatcgggacg 6181 cacagcctca cgccagttgg gaacacgcca ccgagcggcg atgggaatta ctgagcgagt 6241 cgaaaattgc atttgtgtcg ttgtatcaga agaaacgggt tctatttcct tagcggaacg 6301 aggaacctta aatagaccgc tgaccatcac gaagctcaaa gagtctcttg agactcgctt 6361 ttccccaaat gtagatcggg aagcaggtgc tcctgatttg ttaagtttgg gtcgtcaaat 6421 tcgtagtaag gtactagcac tgttttcacg tttactcgga ttaccatcga ccgcttcgcg 6481 acgagataaa aaatgacagc acaacacact gaactgcgag atttgccctc tgacttgaaa 6541 caagaattat tgcccagaca cgttgcggtc attatggatg gcaatggtcg atgggctaag 6601 catcaaggtc taccccggat tatgggtcat aaagctggtg tagatgcgct gaaaaattta 6661 ctacgttgtt gtgatgattg gggaattggg gcgctcacag cttatgcttt ttccactgag 6721 aactggggaa gaccgaccga agaagtcgat tttttgatga ctctatttca gcgagttttg 6781 cgccaagaac tgcgcgaaat gatgaaggag aatgttcaaa ttagatttgt gggcaatttg 6841 agcgctttac ccctagcgct tcaaaaagaa atctctcatt caatggaagc aacaagaaac 6901 aatcgcggta taaaattcac agtcgcaacc aactacggag gtagacagga gattttacag 6961 gcatgccgta caattgctca taaagcacag caaggtttac ttcaaccaga tgaaattgat 7021 gaggcaacat ttgaacgtca cctctacaca gaaggcgttt gtgacccaga tttattaatc 7081 cgtacaagtg gggaaatgcg ggtctcgaat ttccttctct ggcaaatcgc ttatgcagaa 7141 atttatatca cagaagctct gtggcctgat tttaaccgca atgaatttca tcacgccctg 7201 tgcgcttatc agcaacgaga ccgtaggttt ggcaaagtgt gagaagaggg gattatgagg 7261 gaagcagaaa gttcttagca gggtgttggg gagacgaaga gctaatgtcc attaactttt 7321 cttcccccca gcgccccgtt tcctcttctt agctcttgtt acctcacttc cactctccca 7381 cactccctca tctggtctat ccttgttcta cggtcctgaa aaaacagctc gttctacgtc 7441 ttctcgactg agggaatact tgaaggcgcg ctggatatct tgtccatcat ctccagcttc 7501 gagatatcgc acgatgattt gctcaatatc tagcgagagt tctgcttgcc aagtagattg 7561 ctgtagccgc cagcaatgca gttgtttttc gtctcgttca cagccttgct ctctcaacca 7621 atattcaatt tgagggagag gatggttgta taagggtgtt tcagggggag gaagaagcat 7681 aagttaagtc aatagtcaat agtcaaaagt taaaaagtaa tttttaattt agaatttctg 7741 aagggatagt tggttgagaa atactggtat tgggttgttg tattgctggg gtatgtagtc 7801 gttgtgtttg aaaggcacct tcaccacggg ttaaaccgat ggcgatcgct aacaacaagc 7861 aacccaaaag tgtcagtcct agaataaaca aagcaaatat ttcaccagcg ctgagtgggc 7921 gatcgctggc gtcaaggtaa gcagatttag accaatcgta agaatcggaa actgggtgat 7981 tcaagtcgag gggcgcttgg atgactccaa agcgcgaata accaattttc aggtcgtagc 8041 taaaccgccg ttctgggttg cttagggtgg catacgcctc attaagttgc tgaaatttgg 8101 cagttgccac agcagtgggt aattttgtgg tatctggatg atagtgcttg ctcaattccc 8161 ggtaagcacg acggatttct attaccgatg ctgagggatg caatcctaga agggagtaat 8221 aggttggttt gctgctttgt ggcattgccc cattctgatt cacagccgtt taactcacaa 8281 gcgcgtaatg gttttttcta atattctaaa cctaattatt ccctcagcag tttaaagccg 8341 tcacgagtct gtgattggtt gttggttgtg agttgtcaaa aatgactacc aacaattaac 8401 gattcataat cgctacttta ccaaatccgc gcggggttta aaggtttcta tctgccgtaa 8461 cttttcgtaa agttcacgtt cttgctcact aatatttttc tgggcgacaa tctgaatttc 8521 taccaattgg tcaccacgtt caccgttttc cttggggtag cctttgttgg caagacggaa 8581 ccgttgacct gaacgtacac cagggggaat agacattttt actggtccgt ctagagttgg 8641 tgcttctatc tgtcctccta aaactgcttc agtgggagtg actggtacct gacaaaagac 8701 atttgagcct tctattttaa ataaataatg agggttgact gtaattttta aatataaatc 8761 gccatcattg ataccttggt ttttgaggcg gatggtttga cctgtgacca taccgggagg 8821 catattgact tcaagcgatc gcccatcttc caggcgaatc ctctcggttc cacctttgta 8881 tgctttttcc aaaggtaacg ttagtcttgc ttctatatct cggcgagtgg ggcgagaatt 8941 aactgtgtat tgaacttttc tgtttgtgga actaaaggga tcactctcca ctccacttgt 9001 ggaagttcca tttttggtgt ctttgcgaac acctacacca atgacttgat tgataaaact 9061 atcaaaatca ggatatctac caggatctac ctcttgattg cgtgtacgac cgttaggacg 9121 attccagccg ttctctcgtg ctgctgcttg tttatcaaaa cctttttgtt tccaaaagcg 9181 actaaattgg tcgtattgcg cccgtttagc tgtgtcagaa agtacttcat aagcctctcc 9241 gatatcttta aatttttctt ctgctgcttt gtttcctgga ttgaggtctg gatgatactg 9301 acgcgctaac cgtcgatagt tctttttgag atcttcattg gacgcatctt tcgttactcc 9361 taaaatctcg tagtaatccc gaaaattctg caaatttggc atagtttcag ttgtttatct 9421 ttaattttta aactttaact tttttactgg ctattcatta actcttgaac gactaactaa 9481 ttacctatta cctcttacga atgactaaat ccgaatttga agtcaacctt ttactcttga 9541 ttccatttgt gaggacttta tttgaggtac taggggcggg gtttcccgcc ttaatatcta 9601 tcagacgtta taaccaatta tcatcgtcat catcccagtt atcctggtaa ttaggacggg 9661 gtttgcgtga ataaccgccc tcatcagagg aacgacctct cctgtcatag tctctgttgt 9721 agtccttgcc ataatccctg tcataggaat cgcgttcgcg cagcgggcgc tctgcgccat 9781 cgcgccttgc gcggctccct tcgggagcat cgcgaagcgg gcgctctgcg ccatcgcccc 9841 ttgggcggct tcctccaaac gaatcgcggt atgtatccct tgggtaatcg cgatcatcgc 9901 ctcttgggcg gcttcctccg ggaggatccc ggtctttttc gccagtgaag atttcacgga 9961 tagtaccaaa caagtcatct tcttcatcat catcgtagta ctggcggatc tcacggttta 10021 gctcatataa agcatcttgc aagtcggcgt aagcttggtc aatgcctctg tcatcatttt 10081 ctttcaaact ttcgcggagt tctcgacaga tattatcaat tctttgacgg cggctgcgag 10141 caagctgcat cccaaaatcc aacgccactt ctctgagttg tcgttctgct tgtataatca 10201 atgcttcagc acgagtgcgt ttttctaccc gttctttgcg ttccctgtca atttcagcgt 10261 atctctcagc atcctgaatt gcctgcctga tttccccttc gctcaaggta gaagcacctt 10321 gaactgtaat actctgttct ctacctgtgg ttctatccag agctgtcacc tgtaatatcc 10381 cgtttgcatc tatatcaaat gatacttgta tttggggtat gccacgtggt gctgggggaa 10441 taccatagag tttaaaccgt ccgagagact tattatctac tgccatttcc cgttcgcctt 10501 ggacgacgtg gacttccact gtattttgat tattttctga tgtggagaaa atatcagagc 10561 ggcgtactgg tatagtggtg ttgcggggaa tcagtttttt catcacgccg ttgacagttt 10621 ctaatcccaa agacagcggc gtgacatcca acagcaggac atccttgagt tcaccggcga 10681 gaatacccgc ttgtattgct gcacctaccg ccacaacttc atctggattg acgttttggt 10741 tgggttgttt gccaatcatt gcccgtacaa gctgctgtat cattggcatt cgcgtagcac 10801 caccaaccag cacaacttca tcaatgtctg cgggtgttaa accagcgtct ttcatggcgc 10861 gtttgactgg aaggcgtacg cgtccaagta agtcatcaga taaaccttca aactcagaac 10921 gagtcaaacg agtttcgaga tgtttgggac cttcttgatc agcagcgatg aaaggtaagt 10981 taagatcggt gacgctgaca gaggaaagtt cgattttggc tttttccgct gcttctaaca 11041 aacgttgtaa agctgagcga tcgcgtctta aatctacccc ctcggtttcc aaaaatttct 11101 ctgccaacca atcgactatt tttttatcaa aatcgtttcc cccgagttgc gtatctccac 11161 tcgtggatct gacttcaaac actccatccc caacgtcaag aatcgacaca tcaaacgtgc 11221 caccgcccaa gtcaaagact aatatagttt cagtgctgcc gcgatccaat ccgtaagcca 11281 aagaagcggc tgtcggttca ttgagaattc gcagtacttc caaacctgct attctaccag 11341 catctcgtgt tgcttgccgc tgagaatcat taaaataagc aggaacggtg ataactgccc 11401 ctgtgactgg atcaccaagg tacttactgg cgtcttctgc cagtttcttt agcaccattg 11461 cggaaatttc ttctggagca aaatctttat tgagacgagg acaagaaatt ttgatgttac 11521 caacctcatc tttgcgaata gtatacggga ttcgctttga atctgggctt agttcgccat 11581 acctgcgccc aatgaagcgt ttcacggcga aaaaggtatc ttgtgggttg aggagggttt 11641 gtcgtcgtgc catttgccca acaactcttt cgccttcttt ggtgaagcca actacggagg 11701 gagtcgttcg cattccttct gcattggcaa tcaccaccgg cttgccaccc tccatgacgg 11761 caactactga gttggttgta cccaagtcaa tgccgactac cttgcccatg cgtttttgtt 11821 ctcctatagt gtcttataaa ttaatttgtt tagatctaat gcttgattta ttgtatcttg 11881 ctgttgagaa ttacacggtc aaacagtgtt gcttgttagt taatttgtga atctaaagaa 11941 tcaactggaa aatcacctca cctggatagc ccgcgatcgc gatccctatc tagcgactgc 12001 tggacatttt tttgtccaag aatacattcg tcaacaactt gcacaatggg gaagtgtgga 12061 aatccacacc tttagagtcg ggagcaaaac ttgtcaaaac ctcattttga atttaccttc 12121 acaacctgag ttccaaaaaa gagatttgcc tcctatttta attggtgccc attatgatgc 12181 tgttcctgga acaccaggtg cagatgataa tgctacaggt atagcagttt tgctggaatt 12241 ggcaagaatg tttgcatccg ttccaacaaa atatcccctg caactggtcg cttttgacat 12301 ggaagaatac ggcttactag gtccttctgg tgcttctgag tatgcagccg agttgcgaca 12361 acaagaccag ccgctacgct taatgatttc tctagaaatg ttgggctatt gcgatcgtac 12421 ccctggttcg caaacttacc catctccttt agaacgcttt tacccaaatc gcggtgattt 12481 tattgcttta attggcaatt ggcgcacaat tcgtgactta atttctatta gccgcagtat 12541 tcgtaaagtt ggcgtaccca gccattggct accagtacct aacagaggtt taatacttcc 12601 actgactcga cgaagtgatc atgcacactt ttgggatcaa ggttatcctg caatgatggt 12661 gacagataca gccaatctgc gaaatccaaa ctatcataaa cccagcgata ctattgagac 12721 tttggattta gattttctca ctggtgtctg tcaaggtttg gaaagcggta ttcggcattt 12781 gtgaaaaaat agataacttt tcttaactga acagtattga actataagct ttggaaaaag 12841 agtttttaga atatatttat cagcgacagg taaccttgtt atggttaata caagtacaat 12901 aaaaaaactc accttagaag aatttctggc gcttccagaa ggagatgtca actatgagtt 12961 tgtagatagt tatgcagtgc ctaaagtgtc tccaaaattc tttcattcga ctttacaatt 13021 aactctggga ctcttgcttc gtacttggtg taaaggaaaa ggtagaatcg gctcagagtg 13081 ggcaattatt tcaaagcgtc aggggcaaga ttgggcacct gtaccggatt tgacttatat 13141 ttcttataaa cgtttaccta aagtgtggaa gcgcaatgaa gcttgtccgg ttcctccaga 13201 attggtaatc gaaattattt ctccagacca aacaatgaag gaatttgaag acaaagcacg 13261 agattatttt gatgcggtcg tgtcacgggt ttggattgtc gatccagaag caataactat 13321 caaagttttt gtatcggctg aagaaagtca agtttacagt gataatatgc ctattatgga 13381 tactctgctt cctggtttgg agttaactgt tagacaagtt tttgaagagg ctgaattggt 13441 ttgaatgttg gtggtaatca ctttttgaaa cgtttattta acagcaatta aacctaccca 13501 cctcggaatt cattccgagg ctaatagctc aagtcggcta taagccgact agaagatttt 13561 catccttatt agccactgga tccatgctga taactgcttt ctcattactg aagttcttgt 13621 cttatagccc aatacagttc agttcagaga atgataggtt gggtttcgtt ccagcgaaac 13681 ccaacaaacg ctcttatttg ttgggtttgg ttcctcaact ttccaaacat tctctaagtg 13741 agttattgtc tgtaaactaa gtttactgag ttttttctgc aacagacaat agtatcagct 13801 taaaccgcta gaatagagat ttggagcatt gtgcatccgc ttatgagtct tcctatagtt 13861 gctattattg gtcgcccaaa tgtgggcaaa tctaccttgg tcaatcgtct tgctgaggaa 13921 caatctgcta ttgttcatga tcaaccaggt gtgacgcgcg atcgcactta caaaccagca 13981 tactggcgcg atcgcgagtt tacagttgtt gatactggtg gtttggtctt taatgatgac 14041 accgaatttt taccactaat tcgccaacaa gcaatggcgg cgctagcaga agcaagtgtt 14101 gctatttttg tagtagatgg tcgcacagga ccaacacccg ctgatgaaga aattgctgag 14161 tggttacgac aacaaaaagt ccctgttctg ctggctgtta ataaatgcga atccccagaa 14221 caaggtttga ttcaagctgc tgaattttgg gaattgggat tgagcgaacc tttcccaatc 14281 tctgctattc atggaagtgg tacaggagac ttacttgacg tagtgattaa ctaccttcct 14341 gctacaccag atgtcccaga aaccaatgag attaaagtgg caattgtggg acgcccaaat 14401 gtgggtaaat ccagtttatt gaatactttt gtgggagaaa caagggcaat tgttagccca 14461 atttctggta caacccgtga tgctattgat actgtcgtgg aacgaaacgg gcaaatttac 14521 cgcttgattg atacggcagg aattcgcaaa aagaagaatg tagaatatgg tcccgaattc 14581 tttagcatca accgtgcttt taaagcaatt cgtcgcgctg atgtggtttt attggtgata 14641 gatgcccttg atggtgtcac agaacaagac caaaaattag ttgggcgaat tactgaagaa 14701 ggtcgagctt gtattattgt ggttaataag tgggatgctg tcgaaaaaga ctcttacacg 14761 atttatgatt acgaaaaaca tctgcaagaa cgactgcatt ttactgaatg ggcagaaacg 14821 atttttgtca gcgccttgac tggacaacgg gtagaaaaga ttttggaatt ggtcgataaa 14881 gcagccgagt cacacaaacg tcgtgtgagt acagcggtga tcaacgaagt tttggaagaa 14941 gcaatgggtt ggcactcgcc gacagtctct cgtgctggac gccaaggcaa aatttactat 15001 ggcacacaag ttagtagtca accgcctaca atagcattgt ttgtcaacga ttccaaacgc 15061 tttaacgata actaccgccg ctatatcgag cgacaatttc gcgaacattt gggctttcaa 15121 ggaactccta tccgtatact atggcggagt aaaaaagtcc gtgaagttga aggtactaat 15181 gccaatcgag ctacccgcgt gtagtgctca caagtcatca gtcatgagta tttggcaatt 15241 gacccttacg ggaacgcggg tcgcctctgt cgggaaagcc gtcattcgcg ctggctcact 15301 catgactcat gattaacgac tcatgattaa taattataaa tggatttact gcgatcgcta 15361 ccaattgggc tttacttaga acaacctcaa acttggttac ataaactcga cccacgagtc 15421 aagttaatct ggttgatgag ctttctgaca acttacattc ttgccaataa cttatggcgc 15481 gtgctgttgg tggtactgct gattattgct actttaatag cgcgaattcc ccgaaaagtt 15541 tggcagcagc aaatggggtt ggtgttgaca gcgtcctttt ttattttagt cattttacct 15601 atcactcctg atggacttgg tatcaaatat caaccgcgct taccaattaa ccaacaagtc 15661 ttaacagaac aatcagcttc gactccgtct actccagcag ctttgtcagc caataagcac 15721 gaaggttata agtatgtgtt gtttgacaaa ggttcaatca aagtgactcg ccgttcttta 15781 gatttggcaa taactgcgag tacaatgctg tttactctta tttacagtag caatttgtat 15841 ttgttgacaa ccgcaacaga ggaaatcacc gctgctatag aaagcttaat gcaacccttg 15901 cgacgcttga agttacctgt gactgagata actttgactt taactttatc cttgcggttt 15961 attccccttg ttttagaaga aatccagaat ttaattcgtt ctgtaatgac gagggctatt 16021 aattggaaaa agttaggatt gaaaggagct gttaaagttt ggatgttggt agcagaacgc 16081 ttgttggaaa atttgttact cagggcagag caaatggcaa gtgcaatgac ggtgcgaggg 16141 tttacaagtc ctagtgaaca tcgtgtgcaa tggcacgatt tacgattgag aagaggtgat 16201 tggctggcga tcgcaatttt aattttattc tggggagttc ggctggcaat aggaactgaa 16261 gtttaaaaaa tatgattatt aaaggcaaca acaattagtt tcttaacttt tagtttctta 16321 actttaacgg ataggaacaa atccagcctg ctcaataatt ttctgacctt ctgttgatag 16381 caacatactg gcataggcta ttgcggcttt ttcatcggct gagcgataag cctctggctt 16441 tagcttgtgc gtatcgcgac gcaacactat aaaccaatac ggttcagata aggctaaaaa 16501 ctaatgctga cgtaggttgg gcccttggaa cgaaacccaa cacctacctc gtttttgttg 16561 ggttgcgctg cgcttaaccc aacctacaat tcttcttaac tgaaccgtat tgcactataa 16621 acaagcggcg agtcattggg taagtaccat caacaaacgc ttgttcatta acctgctttt 16681 gggctatagg ttggacatag ttaatctttt taacgaaccg cagaggcgca gaggacgcag 16741 agagaagaaa aagcagagag aagaaaaaaa tgcttcactg aaccgtatta gagtataagc 16801 cactcctaat ataaaggaat atgacaagac taactttgcc tacttggtat tatctcgtct 16861 catagcttca acataaactt ctctaaactt ctgagcaact ttactccaag tgtatcttga 16921 taaaactttt tcacgaccag catcacccaa agtttttgcc caatcagaat tttttaataa 16981 ctctataatt gcttctgcta acatttcctg atttttatag tctaccaaaa gtccatcaac 17041 tccctcgtct atcacccaag gtattgctcc agcacgacaa ccaatcacag gtttactcgc 17101 agcccaagct tctagaaatg caattccaaa agactcaaaa ccagaaggat aagcaaacac 17161 atctacggca gaaaataaga aaggtttttc ttcattagaa aagttatagt aaaacttgac 17221 ttgcttttta tatgactctg gtaattgatt aacaatattt tctactttct ctgcaaacat 17281 tgtctttgct cctgctagca gaagctgcac atcagggaaa atttgccaga caagtgtcat 17341 tgcttgtaca agcgtatcta ctcctttgtg tcctccaaat tgtccaataa aacctacgac 17401 tggttgctct ttaaaaccga aatgcttttt ggcttgagtt gaggaaattt gttcaaaggg 17461 ttctggatca actcctactc ctatgacaac aacacgttca ggtgaaactc cttgttggat 17521 aacgtaatct gcttcgtatt ttgtattaga taaataataa gtagattgct caatagcact 17581 gtaaatttta gagcgttgaa atgcccaatt atcctgtgga tgtatccctc catgaaggac 17641 acatggtctt ccagagtttc ttgcaccgtt aagcgctgcg tacatgtgta acaatggaaa 17701 agatgaagca gcgataatat cagcgggaaa ttcttggata gcttttttta agccaggaat 17761 gattggtcct cctgctattg ttcttaaatg ttcgttgaat ggtaagcgca gacgataagc 17821 aactttttga ggaaatcgaa ccatttgact cattcggcta ttgactggaa aacgccgcac 17881 tttgacacca tttatctcct cgtcacctgg ttgcaagcgt ggtagttttg ggttaaaaaa 17941 accctctcca ttgaagcaat ttgtggtgaa gacagtcact tcatcgccaa attgtcgaac 18001 taattcttca gaaactcgtt gaataagtaa ttctgtaccc ccaattgctg gggtataacc 18061 ttgagtaacg tgcagaattt tcatgtatag ttatggttag actggtcgtc gccgaaactt 18121 cagccttata cgtttgagaa gttcgcgact tccctcagta cgccaaattt caatgactcg 18181 cgctaatcta ccacgtttga ctacggttgg tgtgactagt ttacgtgttt gtcggtcagg 18241 tgcgtactct ggttgtaagc agtagcgccg cagaggttcg acgacttgtg accaatttaa 18301 ggagttgcgt aagggttcaa atgctgaagc ccaagatgat ttggggcgtt ctaatatttg 18361 tgcgatcgcc actcgcactg cttctacatc caatggtggg acagctttac caacaccata 18421 ctcttttacc cattcactgg tgacatctcc ttcactcaca agtactggta aacgcgccca 18481 gagataatct agtactcgtg tgcgaatgga ataacgagtt tcgatgtgtg ggggatgaag 18541 agtgactccg atatcagctt cacacaacag ggcttctctt tcttggtatg gtatccactc 18601 gtaaaagaaa atggtacggt ctttttcgcc aatttcagca gcaaggactt gaacttgttc 18661 cgccatttta tgtggagcaa cttgagggtt gggatgacgc gttcctaaaa agacgagtcg 18721 tgcttgtggg tattcagcaa tcacaccagt ccaagctttg accaaagtta ggggatcgag 18781 ccagtcccaa ataccgccac cccataatac aatacgagca tcttctggaa atccaggatg 18841 tataccgcgc aatgtaggat ttggacgggg ttctcggtta agaaagccga ttcctacgac 18901 atcaattaaa gagcgtaggc tggtatcttt ggcaaagttg cgggggttga tgcgtccatt 18961 ggaagcgagt aaaccgatcc agaaatctcg ttggcgatcg cttccacaaa taaaaaaatc 19021 gcctaaacga gctaggctgt tagtaacatc gactgcgcgg ttgttcaagc tttcctgtgc 19081 tgggagtggc ttgttgagat ggtagtgtaa attttctaga ataaaaggat cgtaaaggtc 19141 aacgacgatg cgtttttgtg tctggtgcaa gaaggggaat ttttctacca tataaccaga 19201 caccaaggta atgtcactgt tttctaccag gacttgtaaa cttttcgggc gttcatccca 19261 gtaacgtact agtttaatat tggggacttc gagagtcgtt tcagagggta tagcaagggt 19321 gacattcaat tcttgactaa gaacacgcgc taattccagg taacgcatcc ctggacctgc 19381 cattttagta tctaccacat cattactgac aatgagtaag tgtgggcgtt tgatgttttc 19441 taattcaggc attaattgaa ttttgattgt aatattcctt ttagcttaaa ccactgattt 19501 ctcagttgcc aaaatctact tttttccatg taatcaacta atgcttttaa tcgttcgatt 19561 tctgcatttt tgtgtttcaa ctctgtctgt aattgaggac tttctatcaa tttactactg 19621 ttatgtgtta tatattccaa tctatttttt atcttttgac caataagttt tgggcttaac 19681 aaagacttaa tgtcttcaca tgcttttgca cctatctgct tagcttcttc ataattatta 19741 tatacatgtc gcattaaata tgcagcatgg tcaatatctg gttctgccca tgtacttccc 19801 ttcttgtaat gtgcataatc ttctgttaag gtagttaaag aatacttgac aggaaaacta 19861 ttactaacat tcataaagtc agtgtttcct gagtaggcgg tggctatgac tggttttcca 19921 tagaacatag cttcagcaat agttaagcca aacccttctg agcgatgtag agatacatag 19981 caatcacaat ggtaaatgag ggcattaatc tcatctttga gcaaatattt ttcaataaat 20041 tttatgtttt taaaatcttg tgctaaatct tttaactgct tatatttatc aggaaacttg 20101 tgagcattag aaaatttcag aacgagtaag acttgctcat ctttaccaaa agctcgctga 20161 aaagcttcaa taactgctat gggatttttg cgttcaaaga cactgtaaaa atcaaagata 20221 aagagaaaaa taaacttatt atctggtagc ccaagtgatt gtttggttgc ggtcggttgt 20281 ggtagcgaga gactgtgcat gactttaagg actgggatag gtgatagtgg tgcgttctgc 20341 ccttggcagt aagcttcgct tatcgcctcc acactataat tgctgggagt ccatatttca 20401 tcaaaaagat tgaatgctgt tagccattct ttgggaaaat caggaagctc ccatgcccaa 20461 aagccaatat tgtacttatt tttaaaatat tcaataccta cagagttcag aaaagaattt 20521 atcatatccc cattcacatg gataatattg attggatagg gattatcgtc ggaaaaattt 20581 gtataagtag aatctaattt tctatgattt gtattaaaac tacagttatt // LOCUS NODE_1564_length_20531_cov_5.23383520531 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20531) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20531) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20531 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(123..1022) /locus_tag="DP116_14060" CDS complement(123..1022) /locus_tag="DP116_14060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878449.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_14060" /translation="MTTSASTPAFTSTKTWIWQGFPICYQTQGSTGPAVILVHGFGAS WWHWRKNIPVLAEHCRVYALDLIGFGASAKPKPNETIAYTIETWGQQVADFCREVVGE PAFFVANSIGCIVVMQAAVSDPEVALGVALLNCSLRLLHDRKRATLPWHRRFGAPILQ RVLSVKPIGDFFFNQVAKPKTVRKILLQAYANSEAVTEELVDILTSPARDPGAVAVFL AFTSYSSGPLPEDLLPQLLCPAIMLWGTADPWEPIELGRELANLPQVQKFFPLEGVGH CPQDEAPEVVNLIIQDWIWKLTR" gene 1508..2236 /locus_tag="DP116_14065" CDS 1508..2236 /locus_tag="DP116_14065" /EC_number="2.7.4.22" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006275695.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UMP kinase" /protein_id="PRJNA477356:DP116_14065" /translation="MGTHYRRVLLKVSGEALMGNMGYGIDPEVVKEIAEEVAEVVSTG VQTAIVVGGGNIFRGVKAASAGMDRATADYIGMIATVMNAMTLQDSLERIGVQTRVQT AISMQEVAEPYIRRRAIRHLEKGRVVIFGAGSGNPFFTTDTTAALRAAEIEAEVIFKA TKVDGVYDADPHIYPDAKRYNSLTYGHVLAKDLRVMDSTAIALCKENNIPILIFDLTV RGNICRAVMGESIGTLVGGSCEIS" gene 2223..2771 /locus_tag="DP116_14070" CDS 2223..2771 /locus_tag="DP116_14070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878445.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome recycling factor" /protein_id="PRJNA477356:DP116_14070" /translation="MKLAEAESTMQKTVESTQRAFNTIRTGRANASLLDKVTADYYGS PTPLKSLANISTPDSTTILIQPYDRNTLSLIEKAISMSDVGLTPSNDGSLIRLNIPPL TSDRRKELVKIAAKYAEEGRVAIRNIRRDAIDTIRKLEKSAEISEDESRDQQDKLQKL TNKYTVKIDELLAEKEKDITTV" gene 3711..4835 /locus_tag="DP116_14075" CDS 3711..4835 /locus_tag="DP116_14075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458385.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="PRJNA477356:DP116_14075" /translation="MYDCIIVGAGPAGGTAAYHLAKRGRSVLVLEKETLPRYKPCGGG VSPAIAQWFDFDFSPAISIKATTIRCTWNMGEPVEAELGTPEPVWMVRRDVFDHFLIQ QAQKQGAQLRDNTAVTGIEFKGDSWQVNTANGPVSGRYLIAADGAKGSMARLLGFKER KRRLAGALEAEALTKVENGHIAHFEFGLVKNGYLWNFPKADGYSIGIGTFVGGGEAQD FKSILSEYSSLFGVDLKICKQYGHALCLWNGNQKLHTENAVLAGEAACVVDPFTAEGI RPSIFSGMKAAVAIDQALGGDINALEKYTDTINEEWGSDMAWAQKLAGAFYRFPGIGY KVGVKRPSSVQIMGKLLCGELRYSEVTGRALKRLVPGFGG" gene 5042..5374 /locus_tag="DP116_14080" CDS 5042..5374 /locus_tag="DP116_14080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002745369.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14080" /translation="MQLEDYFEFFASDDIRVKGTRIGIEHILDEYIHSAKAPEEIAKE LHTVTLEQIYATILFYLHNQQTVEKYMADWLDYTLKAEAEYDKNLPSVVMRLRQLKEQ QKAEPVSN" gene 5374..5577 /locus_tag="DP116_14085" /pseudo CDS 5374..5577 /locus_tag="DP116_14085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007311559.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(5602..6768) /locus_tag="DP116_14090" CDS complement(5602..6768) /locus_tag="DP116_14090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317503.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14090" /translation="MAGIWLNIPKLPLLEAQRIALYWRKRLAIECSEQSVENRESIIS WLLGSDLHRFEVLSQNELDIAKQAMEYRYSLLRQRYLGFSRERAYRNLITMLGSLVTL RHKIQTWIALSRDRDRTVLDVLQEVIQELLQSDNYIQQQMIYISELTTDSRLKNALLF ASVEEYCLRPVRNQPLLAYRFVNYLRRTQRGGLTQVPTQDLVRLVSEEILTDDNENRV NLVDTQAVAEYQEAQAAEEQQAVRQTVKQEFEDYLLENLGQEAVEWLRLYIKGKSQDE IAKKLNKPIKEVYQLREKISYHAVRVFALKGKPELVDSWLSISLKEHNLGLTPKQWQQ LHEQLTPLQRSVLELRKAGYSIEDTASELKLKMHQAMGELTKVYLLAQALRSQE" gene complement(7431..8051) /locus_tag="DP116_14095" CDS complement(7431..8051) /locus_tag="DP116_14095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210835.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14095" /translation="MAYSDFTLAKAKNTFGLTLDETRNLFRNVPGVQPSDLLTRLLDE NLPLATAINSEKARSEFLIAPILSEVRRQLDYRISLFSGTEFNVEPAQGLSGFCDFLL SASGEQYFISAPVVTVVEAKNENIIAGLGQCAATMLAAQIFNQRAGNDIEVIYGVVTT GTSWKFLTLEQNIVCIDSIEYYIKEVDKILGIFLQPFQRFLSKTSV" gene complement(8602..9738) /locus_tag="DP116_14100" CDS complement(8602..9738) /locus_tag="DP116_14100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744434.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14100" /translation="MEASQEELPLLDLFTQLREAGLSLGINEYYLVLRALQEGFGTVD QEALAELCRTLWVKSQDEEHLFNYHFQQVIAKSTNTAPVSSPATTSSEKPVVSNSTTV DSGSAPTSELDKTTPASVPVSSELMQVEDEIQVAEAVQITTQRDEDITVNRFTQTDEY FPVTRRQMKQSWRHLRRLVRKGLPTELDVEATIERVVQQGVLLEPVLVPHRVNRTELL LLIDHKGSMVPFHALSQRLVETAQRGGRLGGAGTYYFHNCPTRYLYQDPTRQKAESVA DVLAQLRPERSAVLIFSDAGAARGGFSIERLELTQKFLDQLKQRVRYVTWLNPMAKER WFGTTAGEIARLVPMFEFSRQGLDGAIDVLRGRGSYAQLEYELF" gene complement(9741..10730) /locus_tag="DP116_14105" CDS complement(9741..10730) /locus_tag="DP116_14105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859419.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MoxR family ATPase" /protein_id="PRJNA477356:DP116_14105" /translation="MARALKGKRLKYTGKVQPKPGEQDPQTGQLLYPYLPSEKLVEAV NLAITLERPLLLKGEPGCGKTKLARAVAYELRLPYEAWYVKSTSRARDGLYTYDAVGR LRDAQLAASGIDDEAAVRAKNVDAYVEWGPLGRAFLNDLRTVVLIDEIDKADIDFPND LLLELDEKRFEVAEVKQSSSIKKIQAKVTPIVFITSNDEKDLPDAFLRRCLFHYVKFP ERQQLIEIVKAHFPVSPLNVVDTIIDRFVELREEMRRDKGEAGKKVSTSELMDWVRIL RHHEDDEILAKLKTELLYPGVLLKSWDDYRRYGEQGRSSSQEDDSSAPSSSGI" gene complement(10715..11725) /locus_tag="DP116_14110" CDS complement(10715..11725) /locus_tag="DP116_14110" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14110" /translation="MSYVNALRNKLKRLNEELDGLQQEWRRRHDKVKELRLALAIEVS PASKFQLKQQIQEEDTQLKSLDHRIQEVEKYIERVNNEQIHSALFRLNYVQQVQLFRE FIEEKRIGAFLVHGSPEHGQIFLLKRLLQAIPDSTITPPIQFYLSRRALRTDIAALWR EFGRQMGVQNFSSHEEIAKNVVSQLQTQHVILVFHDFDCIDEDYLHELMRDFWLPLVN SVQQTIYPTNEFFLLMFLVDHEGCVSNWKIQFANQLDPVWEPCIPIRLPIIDRLSDRV LASWMENAIDTLPTKVTRQIDSTVQVILEKSEGVPEHVFAQIFGLCGCNWQEEEVRWL EL" gene complement(11751..12791) /locus_tag="DP116_14115" CDS complement(11751..12791) /locus_tag="DP116_14115" /inference="COORDINATES: protein motif:HMM:PF00656.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="caspase family protein" /protein_id="PRJNA477356:DP116_14115" /translation="MTNQTFANGYALLIGIGADLPITVKDATALRDILVNPNQAAYPS NQVNLLTETSATRQEILKAFDQLIEQVNQNPDASVIIYYSGHGGRIERTNEYFLVPYG YDPSQRADTAISGLEFTQKIEAIKARKLVVLLDCCHAGGVPVLKEAGETFVKSPVPPD LLEILGTGSGRVIVASSREDEYSYTGQPYSAFTDCLLEALQGKAAKEKDGYARILDVM IYLFDQVPNRASGPQHPFVKKVLDLGDNFPLCYYAGGSKFLPGETPVAEVHMTTSSLT AGQKRRLVQRQDTLQAEWDLRSEKVKRMRDALAIETGTAVKFQLEKQLLNEEAQLARL GDELDEIEQALQ" gene complement(12979..13863) /locus_tag="DP116_14120" CDS complement(12979..13863) /locus_tag="DP116_14120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316027.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CPBP family intramembrane metalloprotease domain-containing protein" /protein_id="PRJNA477356:DP116_14120" /translation="MQEVDCVFLPTFISLPKPSVNFLLSSLKDAPVFFVVMAFFVIWV SCWLPIAALTAFVLKWQPSQLLQPEKKLPLLVSLYLLAPFILWGTSWLTNTPFSDYGF IGNVSTLYSLVVGLALGILSITLVFLWQLWLGWCSFQWSNIKLVRPILLPILLVALFV GGIEELIFRGFVFTELGKNGFVWVAAFISSSIFALLHLVWEQKETIPQLPGLWFMGMV LVLARFVDGGSIGLAWGLHTGWIWAIATIDTAALIDYTGNVSDWVTGKNKKPLAGVAG VLCLLLTGGILWLFSRYF" gene complement(14143..14610) /locus_tag="DP116_14125" CDS complement(14143..14610) /locus_tag="DP116_14125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316028.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB family transcriptional regulator" /protein_id="PRJNA477356:DP116_14125" /translation="MTETATAPLTGKTLLAKVKELSNLPRRERAKQCGYYTVTKNNQV RVNLTDFYDALLSARGIPLSPEAPKDGRGREPTYRVSVHQNGQIVIGATYTKAMGLKP GDEFEIKLGYKHIHLIQVDSDKKLLQHDEDAEVYDEDLEDEEDLEDDDYEDEE" gene 15968..16978 /locus_tag="DP116_14130" CDS 15968..16978 /locus_tag="DP116_14130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876345.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="succinate dehydrogenase/fumarate reductase iron-sulfur subunit" /protein_id="PRJNA477356:DP116_14130" /translation="MVMEVIFKIIRQQQNSSPIVQTYSLDVEGGNTILDCLNLIKWEQ DGTLAFRKNCRNTICGSCAMRINGRSALACKENVGSEIAKLEQTVTTASHANGIPEII IAPLGNMPVIKDLVVDMSSFWNNLETVTPYVSTAGRNIPEREFLQTPQERSRLDETGN CIMCGACYSECNAREVDPNFVGPHALAKAYRMVADSRDDKTESRLEEYNEGTKGVWGC TRCLYCNSVCPMDVAPLDQITKIKQEILDRKQASDSRSIRHRKVLIELVKEGGWIDER QFGLQVVGNYFKDLKGLLSLAPLGLRMIVRGKFPLSFEPSEGTQQVRSLIEAVKQEES RV" gene complement(17149..17673) /locus_tag="DP116_14135" CDS complement(17149..17673) /locus_tag="DP116_14135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862063.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Tol biopolymer transporter periplasmic protein" /protein_id="PRJNA477356:DP116_14135" /translation="MKHSIFLPIFATASLLSGCAGYPRIVNYPYDPGGLSLNSSASEL DPQISRRYIVFASDRRGRQDIYMFDRVTGSLVDLPGLNSFDTVASHPGVSESGRYVVF AGNRQGRSAIFLYDTETRQLRNLTANLQAEVRHPTISADGSRIAFESSVNGQWDILVY NRSGQPLNIPQDPR" gene complement(17721..18281) /locus_tag="DP116_14140" CDS complement(17721..18281) /locus_tag="DP116_14140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876347.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14140" /translation="MSKITPSFWIQRQIPWGLSIALASFLVSCSYSDVPLGPTSLNSR YTEQQPALSGNSRFLAFVSNRNGSHQLLVYDLEQQQFIQTTGLNQRETIVESPSLSYT GRYIAYITSDQGRPVVALYDRATQQSQILTPTYRGWIRNPSISPNGRYVVFETASRGQ WDIEVLDRGPDVELDIANGASVGASP" gene complement(18335..19258) /locus_tag="DP116_14145" CDS complement(18335..19258) /locus_tag="DP116_14145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316034.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14145" /translation="MLMFGLNSASVLAQVNFGANSASVLGIFLAVAGAALYFLRTVRP ELSRDQDIFFAAVGLLCGFILIFQGWRLDPILQFGQLLLAGSTIFFAVESIRLRGIAT QQAKRNTPIVDDEREVSDNYRYNKKYKAQVDAELEPLPYDDDYDERPVRGRISGSRDN RSSRDDYYEDETPRRSERRNGSERPTSSDKTDKTRRRSSGRSVRPSERIEDEDWGGSN RTVDEWGSSTTEERRPPRRGSNGTARPETRDEDVPARPRKRRSPNESAPQRGREDDEV ISSEYVEYKPLDRSERSDQDRDSSSKFDDDL" gene complement(19897..20016) /locus_tag="DP116_14150" CDS complement(19897..20016) /locus_tag="DP116_14150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Photosystem II reaction center X protein" /protein_id="PRJNA477356:DP116_14150" /translation="MTPSLANLLWSLFWGALIVVVPATVGLIFVSQKDKIQRS" gene 20213..20503 /locus_tag="DP116_14155" CDS 20213..20503 /locus_tag="DP116_14155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740297.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YggT family protein" /protein_id="PRJNA477356:DP116_14155" /translation="MTGADLAIWILGPILGLMTFLFIFRIILTWYPQANLNRLPFNLV AWPTEPFLVPLRKIVPPMGGVDITPIIWVGIFSLLREILLGQQGLLTMLSRA" BASE COUNT 5935 a 4570 c 4273 g 5753 t ORIGIN 1 ggcatgtggc gtccctgggc taaagccaca gggttttcat ctcacccact ataaattaac 61 acttccccaa ttggatgaat ttggctttgt tcatttatcc agaaaatccg gggattgtga 121 cgttatcgtg ttagcttcca aatccaatcc tgtataatta gatttaccac ctcaggagct 181 tcatcctgag gacaatgccc aacgccttct aaaggaaaaa acttctgcac ttgcggaaga 241 ttggctaact ctctacctaa ctcgattggt tcccacgggt ctgctgttcc ccacaacata 301 atcgcaggac agagcaattg aggtaaaagg tcttctggta aaggacctga ggaataagag 361 gtaaacgcga ggaaaacagc tacagcccct ggatcgcgtg ctggcgaagt cagtatatct 421 actaactcct ctgtaactgc ttctgaatta gcataggctt gcaagagaat tttccgcact 481 gttttgggtt tagcgacttg attgaaaaag aagtcaccaa tcggtttgac agatagtaca 541 cgttggagta ttggtgctcc aaaacggcga tgccacggta aagttgcccg cttgcgatca 601 tgcaacagcc gtaaggaaca gttgagtaaa gcaactccca aggcgacttc cggatcactc 661 accgccgctt gcatcaccac aatacagccg atggagttag caacgaaaaa agctggttca 721 cccaccacct cacggcaaaa atctgcgact tgctgtcccc aagtttcaat tgtgtaggct 781 attgtctcat ttggttttgg tttagcagaa gcgccaaaac caatcaaatc aagggcatag 841 acacggcagt gttctgccag cacagggata tttttccgcc agtgccacca agaagcacca 901 aagccatgca cgaggataac agcaggtcca gtgcttcctt gggtttggta gcaaatggga 961 aaaccttgcc aaatccacgt ttttgttgaa gtaaatgctg gagtggaagc agaggtggtc 1021 atgggagatt aatgattggt acaaggagta gagatgcacc ggcataccgc aggtacgaca 1081 gtttttattg ttgcagtact taacaatgat tagccccagg tgttaagcgt tgggcatgaa 1141 aaatatgttc ataaccctgc caacacacgt caaggacaaa caagatgata actcgtaaaa 1201 cactgaaggt gcactcgcag aattatcaac aagcgtgaga tcgcccttag gactcatgca 1261 aagggagttt ggattgccgt agtcctactt tgatcgccct ttgaatggcg agaaccctag 1321 accatccccc aggcagtggg tactaaactt tgcccaatca tctgtttttt gtaagcttaa 1381 gaaattcatc ccaaagacac tgaatgaatt tatacttttt tggtaatcaa aaactagaga 1441 cagaatgcca ctgtattgct atcttaaatt ggaggtaatc acaaattttt gataagcgcg 1501 taacttcatg ggaacacatt accgaagggt tttacttaaa gtgagtggtg aagccctcat 1561 gggcaatatg ggttatggaa ttgatccaga agtggtcaag gaaatagccg aagaagtagc 1621 agaggtggtg tcaactggcg ttcagactgc catcgtcgtc ggaggcggaa atatttttcg 1681 tggcgtcaag gcggcgtcgg cgggaatgga tcgagcaacc gctgactaca tcgggatgat 1741 tgctacggta atgaatgcca tgacattaca agattcacta gaacgaatcg gcgtacagac 1801 gcgcgtgcaa accgcaatct ctatgcaaga agtcgctgaa ccctatattc gtcgtcgtgc 1861 catccgtcac ttagaaaaag ggcgggtagt tatttttggt gctggttctg gaaatccctt 1921 cttcaccacc gatactaccg cagcattgag agcggcagaa attgaggcgg aagtgatttt 1981 taaagccacc aaggtagatg gcgtttatga tgctgatcct catatatacc ccgacgccaa 2041 acgttacaat agcttgacat atggacacgt tttggcaaaa gatttgcgag taatggacag 2101 taccgcgata gccttgtgta aggaaaacaa tatccctatt cttatatttg acttaactgt 2161 gcgcggtaat atctgccgag cagtcatggg agaatcgata gggacgcttg tgggaggttc 2221 ttgtgaaatt agctgaagct gagagtacaa tgcaaaaaac cgtagagtca actcaacggg 2281 cttttaatac aattcgtact ggtcgcgcaa acgcgagttt attagataag gtgacagcgg 2341 attactacgg ttcgccaacg cccttaaagt cactggcaaa cattagtacg ccagattcca 2401 caacaattct gattcagcct tacgatcgca ataccctaag tctgattgaa aaggcaatat 2461 ctatgtcaga tgtgggttta acccctagca atgatggttc cctgattcgg ctaaacattc 2521 caccgttgac gagcgatcgc cgtaaagaat tagtcaaaat cgccgccaaa tatgccgaag 2581 aaggacgcgt tgctattcgc aatatccgcc gcgacgccat agatacaatt cgcaaactag 2641 aaaaaagcgc cgaaatttct gaggatgagt cacgtgatca gcaggacaaa ctgcaaaaat 2701 tgacaaacaa gtacactgtc aaaatagacg agttactggc agaaaaagaa aaagacatca 2761 caactgtcta gaaacaacaa aacaaaatgc tcatcaggga gacgcggttc tcaaagacaa 2821 gggagacatg aggatagagg ataccaggac taagggacaa accacgggga aattcttccg 2881 aaagattctc cacgtcacaa gctccagtta tcttcacttt atcaacaatg tcataacgtc 2941 tccttttctt cgtgtatacg cgtaaaacag aaaccgcgtc agcgcatctt acagtttttg 3001 cagcatataa acacattctc gtacactacc tcaaaactgt tgattgaggt ataaatctac 3061 tttttagtaa ctgttataaa tgactcttag aaatttatga ttttcctata aaatactata 3121 taatttattt tttgaaaatt gcaaaaatgg aaaaaaatca tcaaaaaaat tacaaaaact 3181 ttatcaagct ctgaaaaaca tggcacaatt ctttttgcca agtaaaatga atgaattgta 3241 ttttagtatc tggagaaaaa cagacgagtt acttgtaggc aatgaacatt tatagcagtc 3301 ctaaatcatt tgtgtaaaag tgcctgcgct ctgcgcttaa aaaccaagta cacctgtaca 3361 tccttctttc ctgttaagag ttaagagtta agccttccct cccttaacca gtaagtttac 3421 aactcctgca cggattgcta tacattatcc aagtttatgt ttattaataa aaaaataatg 3481 ataacaaacc tgtaatcaat caagaaatta aaagagtagt cccacagact taagatttcc 3541 ggtgaaaaat agtttagaaa aaataaagat ttcgaggagt cagaccaagc actgtacgat 3601 tttaaacagc aaactataaa aacagttaaa atctcttgag actgtctatt gacagagtca 3661 taaagaaaaa cagggaagct aacaagttca ggagcaaaga caaaaaccat atgtacgact 3721 gtattatcgt cggtgcagga ccagctggtg gaacagccgc ctatcattta gctaagcggg 3781 gacggtcagt cttagttcta gaaaaagaaa ccctgccaag atataaacct tgtgggggtg 3841 gggtatcacc agcgatcgcc caatggttcg actttgactt tagccctgca atttccatca 3901 aagcaaccac tattcgctgt acctggaaca tgggcgaacc tgtggaagcc gaactcggaa 3961 ctcctgaacc agtttggatg gttaggcgcg atgtctttga ccatttcctc attcagcaag 4021 ctcaaaagca aggggcccaa ctgagagata atacagcagt cacaggcatt gaatttaaag 4081 gtgactcttg gcaagtcaat acagcaaatg gacctgtcag tggacgttac ttaattgctg 4141 ctgatggcgc aaaaggatca atggcaagat tgctagggtt caaagagcgc aaacgccgtc 4201 tagcaggggc attagaagca gaagcactaa ccaaagtgga aaacggtcac attgctcact 4261 ttgaatttgg cttggtcaaa aatggctatc tctggaactt ccccaaagca gatggatatt 4321 caataggaat cggcacattt gttggcggcg gtgaagccca agatttcaaa agcattttaa 4381 gcgagtacag cagcttgttt ggtgtggatc tcaaaatctg taagcagtat ggacatgctt 4441 tgtgcttgtg gaatggcaat caaaagctgc acactgaaaa tgcggtttta gctggggaag 4501 cagcttgtgt cgttgatccc tttaccgcag aaggtattcg cccctcaatt tttagcggga 4561 tgaaagctgc agttgctatt gaccaagcct tgggtggtga tatcaacgcc ttggaaaaat 4621 acacggacac catcaacgaa gaatggggta gtgatatggc ttgggcgcag aaattagctg 4681 gggcgtttta ccgttttcct ggtattggtt acaaggtggg ggttaagcgt ccgtcatctg 4741 tccaaattat gggcaaactt ttgtgtggtg aactgcgtta cagcgaagtg actggtcggg 4801 cgctcaagcg gcttgttcct ggttttggtg ggtagttatt tgtcattgcg cgataagccg 4861 gaggacttcg gcgtgagggc tacttcgaca aagctcagta caagtcagtc gaacgcttga 4921 cgctacgcgt attgccactt gggttatttc cccaatcatc caagcaaaca tctgtaaaaa 4981 ggcagggata ttggagaagt tcaaaacctg ccatgataga actaaaaagc gtgaatgtca 5041 tatgcaacta gaagattact ttgaattttt cgcttctgat gatattcggg ttaaaggaac 5101 gcggattgga attgagcata ttctggatga gtacattcat agtgctaaag ctccagaaga 5161 aattgctaag gaacttcata cagtgacttt ggagcaaatt tatgccacga ttctgttcta 5221 tttgcataat cagcaaactg tagaaaagta tatggcagat tggttagatt acacgcttaa 5281 agctgaagca gaatacgaca aaaatcttcc atccgtagtc atgagattac gacagttaaa 5341 ggaacagcag aaagcagaac cagtcagtaa ctaatgtcta tccgatactt actggacgaa 5401 aatttgccgc ccacttttcg agaacagttg ctgcgttatc aactcaattt aactgtttta 5461 atggttggcg atccggatgc gccaccaaaa ggaactctag atcctgaaat tttgtcttgg 5521 tgtgaggaac aacgttttct cttggtaacg agaaatcgcc gttctatgcc tgtacatcct 5581 accttaaagt ttatatagat atcactcttg acttctcaaa gcttgggcta acaggtaaac 5641 tttagtcaat tcacccattg cttggtgcat tttgagtttt aactcagaag ctgtgtcttc 5701 tatcgaataa ccagctttac gcaactctaa aacagagcgc tgcaaaggag tcaattgttc 5761 atgcaattgc tgccactgct ttggtgttag ccccaagttg tgttctttca aggaaatcga 5821 cagccaacta tccactagtt ctggtttacc tttgagggca aacacgcgta cagcatggta 5881 gctaattttt tctcgcagtt ggtagacttc cttaattggc ttatttaatt ttttggcgat 5941 ctcatcttgg gactttcctt ttatatagag tcgcagccac tctacagctt cttgtccaag 6001 attttctaat aaataatcct caaattcctg cttaactgtt tgacgcactg cttgttgttc 6061 ttccgctgct tgtgcctctt gatattcagc tactgcttga gtatcaacca agttgactcg 6121 attctcattg tcatcagtaa gaatttcctc agagacaagc ctaactaagt cttgagttgg 6181 cacttgggtt aagccaccac gttgagttct tctcaggtaa ttcacgaaac gatacgccag 6241 tagaggttga ttgcgtactg gtcgcaaaca atattcttct acactggcaa acagtaaggc 6301 gttcttgagc ctgctatcag ttgtcaattc tgaaatgtaa atcatttgtt gttgtatgta 6361 gttatcgctt tgcagtaatt cttggataac ttcttgcagc acatctagca ctgtccgatc 6421 gcgatcgcga ctcaaggcga tccaagtctg aattttatgg cgtaatgtca ccaaactccc 6481 caacatagtg atcaaattgc gataagctcg ttctctagaa aaacctaaat agcgttgacg 6541 caaaagactg tagcggtact ccatcgcttg cttggcaata tccaattcat tttggctaag 6601 tacttcaaaa cgatgtaagt cacttcctaa aagccagcta attatacttt ccctattttc 6661 aacactttgt tctgaacact cgatagccag acgctttcgc caatatagtg ctatacgttg 6721 tgcttctaag aggggcaatt tgggtatatt taaccatatt ccagcaatct ctacctgttg 6781 tccctgttga gttaggttac tcaactgagc tacagcacct tccaagtcac cacgggtaaa 6841 gcgatccata ccacgggcat aaattagtag tggttgaaat tctgctaacg gctctgcaaa 6901 agtctctact agggaagtca tgcgaaccca ctctgctttt tcctccaatt tcaactttga 6961 aagagctaac gctattttat ttgctgcctc actcggttgc gtataagcta acgccgtcca 7021 cctctgaact tgagcaaaat cacgaatatc cggatcatgg ctgtcaagct gctgctgtac 7081 gtaggcaagt aaaaaatcag ataactcatt tacccgcttc tcaccaaagc gagaattcgc 7141 cttcaactcc ttcaacagcg cattccgcac tgcggtatcc atttcataca gttcatgccc 7201 cacctcatca caaaggctag aaagcagcaa atccgccacc gccatccaag ggatgtttag 7261 ggcttcaccg ttgatatcac gttgaaaatt agcccacatc cggtaaagca aatccggcgt 7321 cagggctaat ggaaaagcag catggtaagc cagatagagg tgtgcttctc caaaacgttc 7381 cccaaaagtt ttaatacggc gagtggcgac ttctggtttc attggcttct ctaaactgaa 7441 gtttttgaca aaaaacgctg gaagggttgc aagaaaattc ctaaaatctt atcaacttcc 7501 ttaatgtaat attcaataga atcgatacaa acaatatttt gttctaaagt caagaacttc 7561 cagctagttc cagttgtaac gactccgtaa ataacttcaa tatcatttcc tgccctttga 7621 ttaaatatct gagcagcaag cattgtagca gcgcattgcc ctagccctgc aataatattc 7681 tcattttttg cctcaaccac agttactact ggagcactaa taaagtactg ctcaccagac 7741 gcactgagga gaaaatcaca aaagcctgag agtccttgcg ctggttcaac attaaactcc 7801 gtgccagaaa ataagctgat tcgataatcc agttgtcgtc tgacttcaga caagatggga 7861 gcaatgagga actcagaacg ggctttttca ctattgatag cagttgctaa aggtaagttt 7921 tcatctagca agcgtgtcag caagtcagat ggctgcactc ctggtacatt ccgaaacaaa 7981 ttgcgagttt cgtccaaagt taagccgaat gtatttttag ctttagcaag agtaaaatca 8041 ctgtatgcca tctgtttcta gtgcttctca tttttagtta gttaaaactg atattttgca 8101 gcattgtcaa ttagcaacct cgcccgctca gaactgaagt tccgggctaa tagctcaagt 8161 ccattaaaat ggactgaaaa tcttggtaag taagcgttta gtcctcttga gaggacttag 8221 gctataagcc aggggtttct aacccttggc ggacgatgca agctggcgca agatatcggt 8281 taaaactgac tttagctgtt agcccaaact ttataccaaa ttatctgtaa acttgcactt 8341 tcactcttgg cgttcttggc gacgccagtc gcctacggag ggaaaccctc ctgcagcgct 8401 ggctcgtctt ggcggttcgt ttcataaata ttaagtagaa cggcacaaat aattcatagt 8461 atgtcattgc gagtggaacg aagtggaacg aagcaatcac aggggtttga acaactttac 8521 tttccgttac atagttaggt ttatttccgc cgacctataa atgcatcttc atggagaatt 8581 ggtcttagtc caaagctgtt actaaaacag ctcatattct aattgtgcat agcttccccg 8641 tccccgcaac acatcaattg ctccatccaa gccctgacgg ctaaactcaa acattggcac 8701 caaacgggct atctcacccg ctgttgtgcc aaaccaacgc tcttttgcca tcgggttgag 8761 ccatgttaca taacgcactc gttgtttaag ctggtcgaga aatttttggg tcaattccag 8821 ccgttctatg ctaaaaccgc cccgtgctgc ccctgcatca ctaaaaatta atacagcgga 8881 acgttccggg cgtagctgag ctaaaacatc tgccacagat tcagctttct ggcgtgtcgg 8941 gtcttggtag agataccgag tgggacaatt gtgaaagtag taagtaccag cccctcccaa 9001 gcgtcctccc cgttgggcag tttccaccaa ccgttgagat aaagcgtgaa acggtaccat 9061 tgatcccttg tgatcaatca gcaatagcaa ctcagttcga tttacccgat ggggaaccaa 9121 aacaggttct agcaacaccc cttgctgtac aactcgctca atagttgctt ccacatctaa 9181 ttctgtaggc aaacctttcc gcaccaaacg gcggagatgt cgccagcttt gtttcatctg 9241 ccgtcgggtg actgggaaat actcatctgt ttgtgtgaag cgattaacag taatatcctc 9301 atctcgttga gtagtaatct gtacagcctc agcgacttga atttcatctt ctacctgcat 9361 taattcagat gagacgggta cggatgctgg tgtagttttg tcaagttcag acgtgggagc 9421 agaaccggaa tcaaccgtag ttgagtttga gacaactggt ttttcagatg aggttgtcgc 9481 gggagatgaa actggggcag tgttggttga tttggcgatt acttgttgaa agtggtagtt 9541 aaacaggtgt tcctcatcct gagatttcac ccacaaggtg cggcacagtt ccgccaaagc 9601 ttcctggtca accgtgccaa agccttcctg taacgcacgc agaacaagat aatactcatt 9661 aatacctaaa gataagcctg cttctcgcag ctgagtgaag aggtcaagca gtggtagttc 9721 ttcttgggaa gcttccatgt tcatattccc gatgacgatg gcgcggagga gtcgtcctct 9781 tgagatgatg atcgcccttg ttctccatat cgccgataat catcccaact tttgagtaac 9841 acccctggat acaacagttc tgtcttcaat ttggcaagaa tttcgtcatc ctcatgatgc 9901 cgcagaatcc gcacccagtc cataagttcg ctggtgctga ctttcttgcc agcttcgcct 9961 ttatcccgcc gcatttcttc ccggagttca acaaagcggt caataattgt gtctaccaca 10021 ttcaaaggtg aaactgggaa gtgtgctttg acaatctcta tcaactgctg acgttcagga 10081 aatttgacat agtgaaacaa acaacggcgt agaaaagcat ctggtaagtc tttctcatcg 10141 ttactggtaa taaagacaat aggcgtcacc ttcgcttgaa ttttcttaat ggaactgctt 10201 tgttttactt ctgctacctc aaaccgtttc tcatcaagtt ccagtagcag atcattggga 10261 aagtcgatgt cagccttatc aatttcatca atcagcacca cagtgcgcaa gtcatttaga 10321 aaggcacgtc ctagcggtcc ccactctacg taagcatcca catttttcgc ccggactgcg 10381 gcttcatcat caattcccga cgcagcaagt tgagcatccc gcaaccttcc aaccgcatca 10441 taagtgtaca acccatcccg tgcccgactg gtagatttga cgtaccaagc ttcataaggc 10501 aaacgcaatt cgtaagccac tgcacgagct aattttgtct taccacaccc cggttctcct 10561 ttcagcaata aaggtctttc aagggttatg gctaaattaa ccgcttctac caacttctca 10621 ctcggcaggt agggatacag caattgcccc gtctgcggat cttgttctcc tggcttgggc 10681 tgcaccttgc ctgtgtattt cagccgcttt ccctttaaag ctctagccac ctaacttctt 10741 cctcctgcca gttacagcca cacaaaccaa aaatttgggc aaacacatgc tctggaaccc 10801 cttcactctt ttccaaaatc acctgaactg tagagtcaat ttgtctagtg accttggttg 10861 gtaatgtatc aattgcattt tccatccaac tagctaacac tctgtcagat aggcgatcta 10921 tgattggtaa cctgataggg atacagggtt cccaaaccgg atcgagctgg tttgcaaatt 10981 ggattttcca gttgctgaca caaccctcat gatcaactaa aaacatcagg agaaaaaact 11041 cgttagttgg gtagattgtt tgttgaactg agtttaccaa cggtagccag aagtcgcgca 11101 tcagttcatg tagatagtct tcgtcaatgc agtcaaagtc atgaaaaacc aggatgacgt 11161 gctgggtttg caactgagat actacattct tagcaatctc ttcatgagat gagaagtttt 11221 gtactcccat ttgacgcccg aactctcgcc acaaagcagc aatgtctgtt ctcaatgccc 11281 tgcgggagag gtaaaactgg attgggggag ttatggtact gtctggaatt gcctgtaaca 11341 agcgcttcag cagaaagatt tgaccgtgtt ctggcgagcc atgaaccaaa aaagctccta 11401 ttcgtttttc ttcaataaat tctcgaaaca gctgcacctg ctgtacataa ttcaggcgaa 11461 acagcgcact gtgtatttgc tcattattta cccgctcaat atatttctca acttcttgaa 11521 ttctgtgatc caaactttta agctgagtat cttcttcttg aatttgttgc tttagctgaa 11581 atttgctggc tggactgacc tcaatggcta gagccaggcg taactcctta accttgtcat 11641 ggcgtcgcct ccactcttgc tgaagtccat caagttcttc attcagcctt ttaagtttat 11701 ttcttagagc gttgacgtat gacatctcta cttagcccca gctttctaaa ttactgcaac 11761 gcctgttcta tctcatccaa ctcatcacca agacgcgcta gctgagcttc ttcattcaaa 11821 agctgctttt ctaattgaaa cttaacagcc gtaccagtct caatagctag agcatcccgc 11881 atccgcttca ccttttcact acggaggtcc cattctgcct gtagtgtgtc ttgtctctgc 11941 actaatcgtc gcttttgtcc agccgtcaaa ctactggtag tcatgtgaac ttctgccact 12001 ggtgtttccc caggtaaaaa tttacttcca cctgcgtagt agcaaagtgg aaaattatca 12061 cccaaatcaa ggactttttt gacgaatgga tgctgtggtc ccgaagcccg gttgggtact 12121 tggtcgaaca aataaatcat aacgtcaaga atacgagcat agccatcttt ctctttggct 12181 gctttgcctt gtaaagcttc taacaagcaa tctgtaaaag cactgtaggg ttgaccggta 12241 taagaatatt catcttcccg cgatgaagca acgataactc gaccacttcc cgttcccaga 12301 atctctaaca aatcaggtgg tacaggtgac ttgacaaaag tttctcctgc ttctttaaga 12361 accggaacac ctcctgcatg acagcaatct agcaaaacga cgagcttacg tgctttaatc 12421 gcctctatct tttgggtaaa ctctaatcct gatatggcag tgtctgcacg ttgactggga 12481 tcatagccat aaggcactaa aaagtattca tttgtacgct caatccgtcc accatgacca 12541 gagtagtaaa taatgacact cgcatcggga ttttggttta cctgttcgat aagctggtca 12601 aaagccttga gaatttcttg ccgagttgca gaagtctcag tgagcaggtt gacttggttt 12661 gatgggtaag cagcttgatt gggattaacc aaaatatctc gtaaggcagt agcatcttta 12721 acagtgatgg gtaaatcagc accaataccg atgagtaatg cataaccatt agcaaaagtt 12781 tgatttgtca cgccttatcc tcttgactta ttgacagtgc ttggtgactc tcttttgagg 12841 ggagtatact aatctaccaa ctgaagtcaa agcatcccag atttttttca gcaacaattg 12901 agaacaaaga aactctgtta ttgtttgtct caccctgtca ctactgattt ttttacgcct 12961 cagcgtcaat gcacaaaatc aaaaatagcg agagaataac cagagaatcc ccccagtcag 13021 gagtaaacaa agaacacccg cgacaccagc aaggggtttt ttattcttac ctgtcaccca 13081 gtccgagacg ttacctgtgt aatcaatgag tgcggctgtg tctattgtgg cgatcgccca 13141 aatccatcct gtatgcagtc cccaagctaa gcctatactc cccccatcga caaatcgtgc 13201 tagtaccaac accattccca taaaccacag tccaggtaac tgtgggatgg tttctttttg 13261 ctcccaaacc aagtgtagca acgcaaaaat cgagctagaa ataaacgctg ctacccaaac 13321 gaagccattc tttccgagtt cagtaaacac aaacccgcga aaaattagct cctcaatacc 13381 acctacaaac aacgccacta ataaaattgg tagtaagata ggtcgcacaa gcttaatatt 13441 cgaccattga aaagagcacc atcctagcca caattgccac aaaaaaacta atgttatact 13501 taatattcct aaagctaaac ctaccaccaa agaatataga gttgaaacat ttccaataaa 13561 gccgtaatct gagaagggtg tattagttag ccaactcgtt ccccacaaaa tgaagggagc 13621 taataagtat agggatacca gtaaaggtaa ctttttttct ggctgcaaaa gttgagaagg 13681 ttgccatttg aggacaaacg ctgttaatgc tgctatgggt aaccagcaag acacccaaat 13741 aacaaagaaa gccatcacaa caaagaatac tggtgcatct tttaaggatg acagtaaaaa 13801 gttgactgat ggcttaggta acgatatgaa agtaggtaaa aaaacacaat cgacttcttg 13861 catagtagaa acattttatt ttggtaagag aaaagactta ttcatttaga gcagctttca 13921 agtagagata gctaggttgt gtttattttc ccactacctc aagtattgct catctaatcg 13981 ctcttctttt ttcttaacca aacctttacc aaaatgaatg aatcatgtat ccaagaagtt 14041 taattgtttt ttttgtttca aaaattttca aaggtttgac tcttgccgta aagacaagtt 14101 atacagcttt tgctttaaac ttgcctctac gatcacaaaa atctattcct cgtcttcgta 14161 gtcgtcgtct tctaaatctt cctcgtcttc aagatcttca tcatagactt ctgcgtcttc 14221 atcatgctgg agtagctttt tatcactatc aacttgaatt aagtgaatat gcttgtaacc 14281 cagcttaatt tcaaactcat cacccggctt taaacccatt gctttggtat aagttgcacc 14341 aataacaatc tgaccgtttt gatggacact aaccctgtaa gtaggttcac gcccacgacc 14401 atcttttggt gcttctggac ttaagggaat tcccctagcc gagagcaaag catcgtaaaa 14461 gtcggtgaga ttgacacgaa cttggttatt ttttgtaacg gtgtaataac cacactgctt 14521 tgctctttct cgccgtggta aattagaaag ctctttcact ttcgctagca gtgtttttcc 14581 agttaatggt gcagttgcgg tttcagtcat tatgcttgaa atatcctcac tctctccaaa 14641 aaaataaaca aatcatcatt tgccagtact tgactttttg acatctgcac aaagttgctt 14701 cagttctcaa tcgcaaactg gttattcctc catatggggg gagccacaag agtatctgtt 14761 acgattgact acaacagaag atggttgtag ttgtcacagg cattaatttt cgtaatttta 14821 cggcatttct aaccatcatt tgccatctga aacgaaataa ttctcttttt tggcactact 14881 agagtggagc aactttgaac aactacaaat tggggagaca aaaagcgagg acactgcgcc 14941 cttaggggtt cccaaggacg gtgcaaagtg tccgggtgag gattttcctc ccgatttctc 15001 gcgccttgtg ctctaccact ataaacaagt cgtagaccca ataaattgca ggcttttcac 15061 tttcctaagt ctgtcttaat tgaccttgag tcgtagggtt ttccccattt ttactcaaga 15121 aacatgattg acagcatagt attttcataa ggttgtgctg acatacaacc ttttaccctg 15181 ctagtagaga ccagaaattt cttgcctcca ctttattaca aagaaacaaa gcagtaacag 15241 tcgcacttac acggcgattg ctactggttt tgttgcttca taattttata ttaaatagta 15301 gcatggttta ataatagcaa aatagatgtt taactttgaa ttcaacttgc tgatacactt 15361 ttatctaaat ttttatgcct gggatgagta ttgctggtca ggatttataa aaattcattt 15421 aatttagaca agcagtttta acgagatgag tgagaaccaa gagtttcttg atcaactggt 15481 caagatgttc attaagagct ttaacaattc tatataacca aaaaaatgca gcttcacgac 15541 caattgagcg attttttcac tcagagcttg tttaactcgt cttgacatca agatatagtt 15601 ataatccaag gagcattaaa tttggaaatt ttcgcgcctt agaatgaact taagtctgac 15661 ttccagtgtc cctgtgtttt actctcgttc tccatcagcc caaatatata aattaattgt 15721 caagcagtga gcattttttc tggtaatctg gctattgaca catctggtgt taaacccatt 15781 gggtagtcat acaactaaaa ttttgatgat tatccttaaa ctataatgag tagctcttta 15841 atcatcagcc tattaacagc gagcttttaa ttaaaggcaa caagagtata cctcacttgt 15901 tggtgagcaa cccatgtgtg gaaattattg ccaccagatg ggtatagcaa tttaaaaact 15961 ttggttagtg gtcatggaag ttatttttaa gatcatacgg cagcaacaaa attcctcccc 16021 gattgtgcaa acttactctt tggatgttga aggggggaat acaatcctag attgtctcaa 16081 tcttattaag tgggagcaag atgggacatt ggcatttcgc aaaaattgtc gcaacacgat 16141 ttgtggtagc tgcgcgatgc gaatcaatgg gcgttcagct ttggcttgta aggaaaatgt 16201 tggcagtgaa attgccaaat tggaacaaac ggttacaaca gcaagtcatg caaacgggat 16261 tccagaaatc atcatcgcac cactaggcaa tatgcctgtg atcaaagatt tagtggtaga 16321 tatgagcagt ttttggaaca accttgagac ggttactcct tatgtgagta cagctgggcg 16381 taacatccca gaaagagagt ttttacaaac accccaagag cgatcgcgcc ttgatgaaac 16441 tggtaattgt atcatgtgtg gagcttgcta ctctgaatgc aatgctcgtg aagttgatcc 16501 aaattttgtt ggtccccacg cactggctaa agcttaccgc atggtagctg actctcgtga 16561 tgataaaacc gaaagtcgtt tagaagaata taacgaagga actaaaggag tttggggttg 16621 cacgcgttgt ttgtactgta attccgtatg tccaatggat gttgccccac tagatcaaat 16681 cacaaaaatc aagcaggaaa ttcttgatcg caaacaagca agcgacagcc gttcaattcg 16741 ccatcgcaaa gtattgatag agttagttaa agaaggcggc tggattgatg agcgtcaatt 16801 tggtttacaa gtcgtcggga actattttaa agacctcaaa ggattgctca gtcttgctcc 16861 ccttgggtta agaatgatag tgcgaggcaa gtttcctctt tcgtttgaac cttctgaagg 16921 aactcaacaa gtgcgatcgc ttattgaagc cgtcaagcaa gaagaaagta gagtgtagag 16981 atgagggaga tacggcgaga gtgatatagc ggttcccatt tgaatcacat acatcactac 17041 gaataatgta gagcacgtaa atactacgtc tctacagatc gtgtgtcttg cacgcaacac 17101 cagaagcatt attattgttc atttgcttcc ctatctccct catctggctc aacgtggatc 17161 ttgaggtatg ttcagcggtt gcccagaacg gttataaact aaaatatccc attgcccatt 17221 aacactagac tcaaaagcaa tcctactacc atctgcacta attgttggat gacgcacttc 17281 tgcttgcagg ttggcagtca aatttcttaa ttgacgtgtt tctgtatcat aaagaaaaat 17341 agctgagcga ccctgtctat tacctgcaaa cacgacataa cgaccacttt cagaaacacc 17401 aggatgagaa gcaacagtat caaaggaatt tagaccaggc aaatctacca aactgccagt 17461 gactctgtca aacatataaa tatcttgtct accccgtcga tcactggcaa aaacgatgta 17521 tcttcttgaa atttggggat ctaattccga agcagaacta ttaagactta gtccgcctgg 17581 atcgtaagga taattgacta tgcgtgggta accagcacag ccacttaaca aactagctgt 17641 agcaaatata ggtaaaaaaa tagaatgttt catagtcatg agtcatgagt cgaagactta 17701 agacttttac agcatccctc tcatggagat gcacccaccg atgcgccatt ggcaatatct 17761 aactccacat ccggtccccg gtctagaact tcaatatccc actgaccacg gctagcagtt 17821 tcaaagacaa cataacgccc atttgggctg atacttggat ttctgatcca gccacgatag 17881 gttggggtaa gaatttgcga ctgttgcgta gcgcgatcat aaagcgccac cactggtcta 17941 ccttggtcac tggtgatata agcaatgtaa cgtccggtgt agctgaggct aggactttca 18001 acaattgttt cccgttggtt taagcccgtt gtttgaataa attgttgctg ttccaagtca 18061 tataccagca gttggtgact accattgcga ttagagacaa atgctaaaaa acgactattt 18121 ccactcaacg ctggttgctg ttccgtgtaa cggctgttga gggaagttgg tccaagaggg 18181 acatcgctat aactacaaga tacaagaaaa cttgccaacg caatactgag accccaagga 18241 atttgccttt ggatccaaaa actaggtgta attttgctca cctcaacttt tttcgttacg 18301 ctttcgcgtc ttgcacctaa tcttatttat agatttatag atcatcatca aatttcgatg 18361 agctatctcg atcttgatcc gaacgctcag aacggtctag tggtttgtat tctacatact 18421 cactagaaat gacttcatca tcctcacgcc ccctttgagg agcagattca ttgggcgaac 18481 ggcgttttct cggtcttgca ggaacatcct catcacgagt ttctggacgt gctgtaccat 18541 tactaccacg acgcggaggt cgtctttctt cggtcgttga actaccccac tcatcaactg 18601 ttctattgga accaccccaa tcttcatctt caatcctttc agaagggcgt acagaacgtc 18661 cagaactccg gcgacgggtt ttgtcagttt tgtcacttga tgtcggtctt tcgcttccat 18721 tgcgacgttc tgagcgacgg ggagtttcat cttcgtagta atcatcccga gatgaacgat 18781 tatccctgct acccgaaatt cgaccacgga cggggcgctc atcatagtca tcgtcataag 18841 gcagtggttc taattcagcg tctacctgtg ccttatactt tttgttgtac ctatagttgt 18901 cgctaacttc acgttcgtca tcaacaatag gagtgttacg cttagcttgc tgagtggcta 18961 taccccgcag gcgaatactt tccaccgcaa aaaatatagt tgagcctgcc aaaagtaact 19021 gaccaaattg taaaatcggg tcaagacgcc aaccttggaa tatgagaatg aagccgcaga 19081 gtaagcccac agcagcaaaa aagatatcct ggtcgcgtga gagctctgga cgcacagtgc 19141 ggagaaaata aagtgctgcc ccagccacag ccaagaaaat tcctaaaaca ctggctgagt 19201 ttgccccaaa attgacttga gccagaacac tggctgagtt cagcccaaac attagcatcg 19261 ttgtttacct aatatcaaaa tctatgttag cgagtcagaa ttgaaatctc tactgtaatt 19321 agacacaata tactgtgaag tagctttctg tatgcaacac attcgagtgg gtgatcaacc 19381 atctttatgg taacataaag tctagacttt gtgtagacac catgaaagcc aaaggattac 19441 atattcgcat atctgaacgt agactaaata agcttagact atatgcagag tggaaagaga 19501 aaaccatgac gcaattgatt gaagactgga ttgacagact accgacgccg gagactggta 19561 attcctcaat caccccactc cctctcaagg agtgaggatt gaatcgggga tggtgtttca 19621 taagtgtttg ctgtccatcc ccacccaaca tagcggttga gggtggggac ttatgcgaac 19681 tttttaagta cgtgaatttg ccacttgggc taagtgaagc aatacttacg cgtctattat 19741 gacccaaagc taacaagtct gtgtttgtct gctcagtaag cttagtataa gtattttttt 19801 ataatgaatt gctaagagtt agcaattgag cagtgaactg acggaaaatc agttttaaaa 19861 ctagaggcgt ggtatgccac gcccctacaa tagatcttat gaacgctgga ttttatcctt 19921 ctggctcaca aaaatcaatc ctactgtggc ggggacaaca acaattaatg caccccaaaa 19981 cagactccaa aggagatttg ctaaagatgg cgtcatagat ttcaaccttt ttttacatac 20041 tttataacaa ttctaatggt ctagtagacc gttgagaagc cactcatggc aaagggaaat 20101 gattctagtt gtgtaacagc ctttggttaa aattagagaa aggctttaca aaccttaaaa 20161 tctgctgatt ctgttctttc cctttattca caaaaaaagt caagcttaaa tcatgactgg 20221 tgctgacctt gccatttgga ttctcggtcc aatattagga ctgatgacat ttttgtttat 20281 tttccggatt attctgactt ggtatcccca agctaatttg aatcgtttgc cttttaattt 20341 ggtcgcttgg cctactgaac ctttcctagt accgttacga aagattgtgc cacctatggg 20401 tggggtggac attaccccaa tcatttgggt tggtatcttc agtcttttac gggaaattct 20461 tctgggtcag caaggactgc tgaccatgct gtctcgtgcc tagtaacaag gggtataggg 20521 gtataggggt g // LOCUS NODE_1572_length_20450_cov_5.07673420450 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20450 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 403..1008 /locus_tag="DP116_14160" CDS 403..1008 /locus_tag="DP116_14160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317903.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine acetyltransferase" /protein_id="PRJNA477356:DP116_14160" /translation="MNFSINYILQDWQVNQETSSKSRLILLMFRSTRILGNLPAPFSL FSSFYRALYQFVVEWILGVELPWDTQVGPNLKLLHCQGLVVNHETIIGMNCTLRHSTT IGNKILPDGTASGSPKIGNNVEIGSNVVIIGPITVGDNAVIGAGSVVVKDVPNRTVVV GNPARVIRTLNTPLSLMNNESQQASELASVSNHSNTDSLYR" gene 1050..2006 /locus_tag="DP116_14165" CDS 1050..2006 /locus_tag="DP116_14165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740285.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_14165" /translation="MVKVSIVIPAYNAMTYLPETVESVLRQTFSDFEVLIINDGSADN IVEWVSQLVDSRVKLISQTNQGVPKARNTGIANAQGEYIAFLDADDLWEPTMLEKQVR CLENNPAVGLVHTWMAVIDAQSQPTGRVMISNAEGDVWKQLVVQNTVPSSSVMVRRCC FDTVGGFDPNLRNIDDWDMWIRIAARYPFAVVKEPLMRYRMHLNNMTKNWQVVEEAFE IIIEKAFSCAPPELLYLKSRSYGHANMFLAWKAVQSGNRDYKKAIHFRQQAITHFPML RHSREYFRLSLAIAMLQWFGSNNYTKILSLLYTLRRRILSVT" gene 2377..3306 /locus_tag="DP116_14170" CDS 2377..3306 /locus_tag="DP116_14170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012165874.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14170" /translation="MPIGDRAVYRPANIQQAKYASADEEYFYKLKTSDPQRPVYSPGN FGPGRCTGTQSMGISLPVPDNLIVPDATSQPYYTPNNASAFLMPDGKTLEQFEPLARC TAGGSVHGWRNPWGGVDIYGDGIKGGHLGSGLSSIGGSIRKGELIKNQPIRHALKVVI WGEKYLHYSRTVPGYRWPAYIADSKAANQYHGTNPKLVQGTLLAIPHNVTAARLKLQT PAAKKLFQALQNYGAYVVDDAGWDAHYLCLEKGVLEEFRATYGYDFQTTNGKFYDDFM KLFKALYIVDNNAPKNIGGGGTPRVSLAPPIGN" gene 3546..4652 /gene="glf" /locus_tag="DP116_14175" CDS 3546..4652 /gene="glf" /locus_tag="DP116_14175" /EC_number="5.4.99.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006635084.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-galactopyranose mutase" /protein_id="PRJNA477356:DP116_14175" /translation="MFDYLIVGAGFAGSVLAERLASQSGKKVLIVDKRPHIGGNAYDH YDDSGLLVHKYGPHIFHTNSREVFDYLSFFTEWRPYEHRVLASVDGQVVPIPINLDTV NRLYGLNLTAFQVEEFFASVAEQKDYIRTSEDVVVSKVGRELYEKFFRGYTRKQWGLD PSELDRSVTARVPTRTNRDDRYFTDTYQAMPLHGYTRMFEKMLSHPNIKIMLNTDYRE IQEAIAYREMIYTGPIDEFFDYCYGKLPYRSLEFKHETLNKPVHQPAPVINYPNEHLY TRVTEFKYLTGQEHPKTSIVYEYPQAEGDPYYPIPRPENAELYKQYKALADATPSVHF VGRLATYKYYNMDQVVAQALTVYSQLVQRVLIKI" gene complement(4692..5093) /locus_tag="DP116_14180" CDS complement(4692..5093) /locus_tag="DP116_14180" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14180" /translation="MANNKFFPSYFPIAQSKRIYKFFATAILVFAMGYPSSAQAVELN DKDFNNAFGQQSLEPEINTVGEQPLVVAQRYNRRRPQVRRTIIIGPRRRYRVRPIRRS RVQRVYRPVRRYRVQPVRRYNRYGNQRVYVR" gene 5301..6935 /locus_tag="DP116_14185" CDS 5301..6935 /locus_tag="DP116_14185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357138.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-D-glucose phosphate-specific phosphoglucomutase" /protein_id="PRJNA477356:DP116_14185" /translation="MVIKTIQTQPFTDQKPGTSGLRKKVSVFQKPHYLENFVQSIFDS LEGYQGQTLVLGGDGRYYNRQAIQIILKMAAANGFGRVLVGQGGILSTPATSAIIRKY DTFGGIILSASHNPGGPDRDFGIKYNISNGGPAPEKVTEAISNRSKEINSYKIFEAPD VNLDTLGEIKIGDMLVEVIDSVADYAELMQSLFDFDRIRQLVTNGQFRMCVDSLHAVT GPYAHNIFEERLGAPPGTVTNGKPLEDFGGGHPDPNLVYAHDLVEILFGNNAPDFGAA SDGDGDRNMILGRQFFVTPSDSLAILAANAKLVPGYSSGLAGIARSMPTSQAPDRVAK QLGIECYETPTGWKFFGNLLDADRATLCGEESFGTGSNHIREKDGLWAVLFWLNILAV RQQSVEQIVTEHWQTYGRNYYSRHDYEQVDSDRANTLITSVRAMLPTLKGKQYGSYQV EYADDFSYTDPIDGSVSQKQGVRIGFTDGSRIVFRLSGTGTQGATLRLYLESYESDPA KQNHDPQEALAELITIADEIAQIHKLTGMDKPTVIT" gene 7323..9251 /locus_tag="DP116_14190" CDS 7323..9251 /locus_tag="DP116_14190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017804461.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein kinase" /protein_id="PRJNA477356:DP116_14190" /translation="MMFVGTVLRNCYKIVRLLGSGGFGDTYLAENLDLPGHPLCVVKH LKPRDPNPDVLQIARRLFESEALVLYKLGHENNQIPRLFAHFEENGEFYLVQEYIEGS DLSSEVTVGKRWSEQEVTQLLREILEVLTVVHKQNIIHRDIKPQNLMRRREDRKIILI DFGAVKQISTLVNIQGQTSASVAIGTPGYAPNEQAAGYPKLSSDVHAVGMLAIFALTG IKPHELPRDPTNGEVVWRNWANVSERFADILTKMVRYHFSERYQSAAEALSALPAPPK PKPQPKSQSTPIPAPLPTSQPTLQSTPMPRRQVIQMLGLIGTGVGLAIVGQQLLQGVF RRRDDVVETPTPSPSIRSDISSTSSPTPKQLATPRSVSLQTFNFETVSVDAQGNINNR SNRQAKYFAQDLGNGVTLEMVQIPGGTFLMGSPPGEKQRESNEGPQHQVTVPGFFMGK YEVTQAQYQAIMGNNPSNFKGEKRPVEKVSWNDAVELCQRLSQKTGRTYRLPSEAEWE YAARAGTTTPFYFGETITTDLANYNGTKTYASEPKGQYRRQTTNVGSFPPNAFGLYDM HGNVYEWCQDDWHDTYQSARTDGSAWLTQGSTNVLKLVRGGSWGLSLGLCRSAFRYRV YPGLRYYVIGFRVVCVVA" gene 9677..10897 /locus_tag="DP116_14195" CDS 9677..10897 /locus_tag="DP116_14195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996717.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetate kinase" /protein_id="PRJNA477356:DP116_14195" /translation="MKILVLNAGSSSQKSCVYEITGNTLPEEPLKPLWEAKVDWTHQQ GFAELEVKTAKGEQLQEKIPADSRTQVMAHMLDTLCKGSTQVISQPSEIDVVGHRVVH GGQDYRESVVITEDVKQAIARLAELAPDHNPANLEGIEAIEQHLGTLTQIAAFDTAFH SHLPDAAAIYPGPYEWIEQGIRRYGFHGISHQYCAKRATRILGRDLASVRLITCHLGN GCSLAAIQNGRSIDTTMGFTPLDGLMMGSRSGAVDPGILIHLLRQSDNSVDELAKVLN KASGLRGISGISNDMREIREAISQGNSRAQLAWDIYVHRLRSYIGGMLASLGGLDALV FTGGVGENNPEIRAAACEAFAFLGLKLDHEKNAHKPVDVDIATPDSTVRVLVIHTEED WEIACECWRLLEKK" gene 10979..11449 /locus_tag="DP116_14200" CDS 10979..11449 /locus_tag="DP116_14200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309703.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14200" /translation="MFTLTSCDSLTTNQYEATALTTYTWQVKYADDLANEQVPRFETF ATTSLLNRNGLKPEGAVTGPDDKGLWWSSLPPRPSVDEIEQRKKSQEAGSPELLKSVK YELKYKVGEDQRKLPTNYQVYREVVKAYPSQTSLQLTLGLDNNSVEKAEPVGSK" gene complement(11552..12352) /locus_tag="DP116_14205" CDS complement(11552..12352) /locus_tag="DP116_14205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410157.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14205" /translation="MSDYLDDYELNNLDFYRKNHGLQLYYNWSGEIWWQEKPDLPREK LFSIIGMNATKVFIKTSPEYGEVGYRINRELGLFCDPVTQEILTYWKSPEANQALPVV HIANRIVQGSVKPKKFVIPKGTGYITSVMEIPLEYPHPLAGDSKYSDYCPGEKFQGVE YFTSNICRPDASDVPPAKWARDCPWMPWMKLGYGHPAKLRFETTISRVDSFEQLHPKL VNLVREKVPIYEFTPTESDEPNMTSIQYFKKHFESYLKGDIFPIEETS" gene 12550..13347 /locus_tag="DP116_14210" CDS 12550..13347 /locus_tag="DP116_14210" /inference="COORDINATES: protein motif:HMM:PF00805.20,HMM:PF13599.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14210" /translation="MHFIFINIHNSTVMVDVDYLTLLELGVEAWNHWRKNNPDIKPNL SYVELSATLDDINLSHTDLKYAKIHSAELQNANLTKANLRGADFSIVDLSGADLSYAD LSDGLFLGSVEFCGANLSGVNFTSADLGMCVINYGLFFRASLSGVNLSHANLTQASLR FADLNGANLCGVDFSGADLCAADLRGANLFGANLSKADLRGADFADANLTYANLSEAK LVGSPNFGKLGMYDFSYLNDRTLNLSNANLNGADFSRADLTDLKQTV" gene 14048..14848 /locus_tag="DP116_14215" CDS 14048..14848 /locus_tag="DP116_14215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867388.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-amino-4-deoxychorismate lyase" /protein_id="PRJNA477356:DP116_14215" /translation="MFIYWFNGKLIESRTLELEIDDPGLLYGATVFTTLRVYNNSLDS RLTHWRCHCDRLKFSLQTFGWQMPDEERLRQGAEIIMTHFPVLRIAVFPDGREWITGR LLPKNLTQNQKNGLIATLAESEFARSLPSHKTGNYLSAWLAKANAQKLDAQEVILVDT VGNWLETSTGNLWGWRNGSWWTPPLTAGILPGVVRGQLVDWLLKKQQVVREEPWTPEL VKGFEAIAYSNSVVETVPIHTVIQPTGKLEYNPHHGCFQQLRMFFIAL" gene 14956..16827 /locus_tag="DP116_14220" CDS 14956..16827 /locus_tag="DP116_14220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015079544.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsH" /protein_id="PRJNA477356:DP116_14220" /translation="MNNKRWRNAGLYALLFIVVIALGTAFFDNKQPQGRKTWRYSEFI QAVEKSKIKPNGENDTFAQVALSADRSMADVKLDDSRRFVVTLVNDPDLINTLTAKNV DISVLPQTDDTFWFKALSSLFFPVLLLVGLFFLLRRAQNGPGSQAMNFGKSKARVQME PQTQVTFNDVAGIDQAKLELNEVVDFLKNADRFTAVGAKIPKGVLLVGPPGTGKTLLA RAVAGEAGVPFFSISGSEFVEMFVGVGASRVRDLFEQAKTNAPCIVFIDEIDAVGRQR GAGLGGGNDEREQTLNQLLTEMDGFEGNTGIIIIAATNRPDVLDAALLRPGRFDRQVV VDRPDYAGRVEILKVHARGKTLAKDVDLERIARRTPGFTGADLSNLLNEAAILAARRN LTEISMDEINDAIDRVLAGPEKKDRVMSEKRKELVAYHEAGHALVGALMPDYDPVQKI SIIPRGRAGGLTWFTPSEDRMDTGLYSRAYLENQMAVALGGRLAEEIIFGEEEVTTGA AQDLQQVARVARQMVTRFGMSDRLGPVALGRQQGNMFLGRDIMAERDFSEETAAAIDE EVRTLVETAYQRSKEVLENNRHILDQIAQMLVEKETVDADELQELLANNDVKTATFA" gene 17310..17795 /locus_tag="DP116_14225" CDS 17310..17795 /locus_tag="DP116_14225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874069.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14225" /translation="MPRSKILTTSFHDQGVYPCPVCRVGKISHMPLMEAMSCDFCQQI FTANVEEQQIKMPSRQPPLIWHWNGFNWTEAQIEGVELGWGYVIAAVVFVLLPTVLIG IVAYHFPPHPETPLSWVPYVWTALTFLLHLAIIVWLLIEVYQIPVGAYWRAIQQRFLG R" gene 18300..19010 /locus_tag="DP116_14230" CDS 18300..19010 /locus_tag="DP116_14230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316802.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaJ" /protein_id="PRJNA477356:DP116_14230" /translation="MSEFEQYYRVLELEPGATFEEVTQAYKDLAFVWHPDRLPKDNSR LQEKAQKKLQEINEAREQLRLSKMKYQRLHYSASHYSAPSAQKQPTEKTYQHSAPSAQ KQPTEKSYQPPHPNPDLSGKDYSRANLQNKDLSGRNMSYANLSGANLSDTFMHKVNLR GANLSEANLFRANLLLADLREANFRGANLVGADLSGADLRGANFMGARMKSGDRLLVK LVGANLAGAIMPDGKIYE" gene 19020..19844 /locus_tag="DP116_14235" CDS 19020..19844 /locus_tag="DP116_14235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877406.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SDR family NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_14235" /translation="MNVAIIGCGYVGCTIARYWQQKMTFVVTATTTTPERVPALQAVA QRVEVVKGNDPEGLKSVLKNQDVVLLSVGAKSADVYEETYLHTAQSLVSLLKQTPSIQ QLIYTGSYAVYGDRQGAWVDEESPPAPANQNGQIITDTEQVLLSASSANLRVCILRLG GIYGPGRELVKIFGRYAGATRPGNGKDTTNWIHLDDIVAAIEFARRTQLQGIYNLVDD AHLTTGELLEGVFETHNLPKVTWDSSQESKRPYNAKVSNKKMKDAGYKLIHPQILF" gene 19865..>20450 /locus_tag="DP116_14240" CDS 19865..>20450 /locus_tag="DP116_14240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_14240" /translation="MIYQEKTVSPTQFYNWKNYRCAYEVDNPSQSTPEGIPLLLIHPI GVGLSRQFWQRFSSEWYKQGRRNPIYNPDLLGCGESDMPHVAYTPSDWAEQLQQFLKT VVQKPVVLVAQGALFPVAIELVQKEPNLIAGLVLAGPPAWAVITKKTPEWQDKVIWNL LDSPFGNTFYRYARGEKFLRDFSTSQLFDSADAVD" BASE COUNT 6013 a 4299 c 4520 g 5618 t ORIGIN 1 gtaatatata tattatggct aagcttttac atcttaaatc tcactaacct atctctataa 61 aaacagttac ccacaagtag actgcttatc cttattgttt agtagagagt ccgacgcctc 121 aaaaatggct tacagtggat agccaaacct ccttcgatat aactcaaaaa aacgattgca 181 aatattttgc aattttgttt gcaagctcat aaagtttatc atcttagttt aaatttaata 241 gcctgagtcg aggtaaaaaa taataggtca actatttttt ttgtcaaaga actacctgaa 301 ttctgtaagc tggttgaatt ttgacttttg gaaaacgagg gagtggaact ataaacacac 361 gctgaaatta tttatttgtt gagtaaggag atgtcataaa gcatgaattt ttctattaat 421 tatatactgc aagattggca agttaatcag gaaacaagct caaaatcacg tttgatatta 481 ttgatgtttc gctctacaag aatacttggg aatctgcctg cgcctttttc tttgttttct 541 agtttttatc gagctttgta ccagtttgta gttgagtgga tattaggggt tgaattacct 601 tgggatacac aggtaggacc aaatcttaag ctacttcatt gtcaaggttt ggtcgtcaat 661 catgaaacga taataggtat gaattgtacc ttgaggcatt caaccactat tggaaataag 721 atactgccag atggtacagc cagtggatct ccaaaaattg ggaataacgt agaaataggt 781 tccaatgttg tgattatcgg accaattaca gttggggata atgctgtgat tggagctggt 841 tctgttgtgg tgaaagacgt tcctaaccgt accgtggttg taggaaatcc agcaagagtt 901 attcgcacac tcaatacccc tttgtctttg atgaacaacg aatcacagca agcttcagaa 961 ctggcatcag tatcaaatca ttctaatacc gatagtctgt atagataatt tactattctt 1021 gattagcaaa atgctgagga agaccgaaaa tggtcaaggt ttctattgtt attccagcct 1081 ataatgctat gacctacctc cctgaaacag tggagagcgt tctgaggcag acctttagtg 1141 attttgaagt attaattatt aatgatggta gcgcagacaa tattgtcgaa tgggtttctc 1201 agctagtaga ttcacgagtg aaactgattt cccaaacaaa tcagggcgta cccaaagcac 1261 gcaacacagg tattgctaat gctcaaggag agtatatagc attcttggac gctgatgatt 1321 tgtgggagcc gaccatgcta gaaaaacaag tgcgttgtct ggaaaataat cccgcagtag 1381 gcttggtgca tacttggatg gctgttattg atgcacaaag tcagcccaca ggtagagtca 1441 tgatctcaaa tgccgagggc gatgtatgga aacaactcgt ggtgcaaaat acagtacctt 1501 cctcttcagt catggttcgt cgctgttgtt ttgacacggt tgggggattt gacccaaatt 1561 tacgcaacat tgacgattgg gatatgtgga ttcgtattgc agctcgttat ccctttgcag 1621 tggtcaaaga acctttaatg cgctatcgaa tgcaccttaa taatatgact aaaaactggc 1681 aagtggtgga agaagctttt gagatcatta ttgagaaagc attcagctgt gcaccgcccg 1741 aactactgta tttaaaaagc cgaagctatg gtcatgccaa tatgttttta gcttggaaag 1801 ccgtgcaaag tggtaataga gactacaaaa aggcgattca ttttcggcaa caagccatta 1861 ctcacttccc tatgctgcgt cactcgcggg aatactttcg cttgagtttg gcgatcgcca 1921 tgctgcaatg gttcggctct aataattaca ctaaaatact gtcgctgctt tacactctgc 1981 gccgacgcat attaagtgtt acctaacaac atcgcacttc ttgcaccaag tgattacaga 2041 atatcgaaca ctggtcttta tagcttgacc tgcgtattat ctgacaaact atcctagttg 2101 ttgttaggag taaaggagca tactaacttg acgaccaaca aagctcgtaa atatacggta 2161 tttttacggc aactagggca gaagaagcgt cacttaagca atataacgtg tgttttctta 2221 acgatatttt tggttagctt gctctcgtat ctaagttatg cttctaccac cggcgttacc 2281 atgagccaga acaccaacca cattgccaaa tctaccacca ctcaaggcaa acgcgacaaa 2341 tggctatggc ctttcgcatc caattctata tggaatatgc cgattggcga tcgcgcagtt 2401 tatcgaccag caaatattca acaggcaaaa tatgcttcag cagatgagga atacttctac 2461 aaactgaaga ctagcgatcc ccaacgtccg gtttacagtc ctggcaattt tggaccagga 2521 cgctgtacag ggactcaatc tatgggaatt tcactacctg tgccagacaa cttaattgtt 2581 ccagatgcca caagtcaacc atattataca cccaacaatg cctcagcctt tctcatgccc 2641 gatggtaaaa ctcttgaaca gtttgaacca ctagcacgct gtacagcagg aggatctgtt 2701 catgggtggc gtaacccttg gggcggtgtt gatatttatg gcgatggtat caaaggcgga 2761 cacctcggtt ctggtctctc ttctattggt gggtctattc gcaaaggtga acttataaaa 2821 aatcaaccga tccgccatgc attaaaagtt gttatctggg gagaaaaata ccttcactac 2881 tctcgtactg ttccaggata tcgttggcct gcctacattg ccgacagcaa agccgccaat 2941 caatatcacg gtactaaccc caagttggtg caagggacac ttctggcaat tccccacaat 3001 gtcactgcag caagattaaa gctgcaaaca ccagctgcta aaaagctgtt tcaggcatta 3061 caaaattatg gtgcttacgt tgttgatgat gcaggttggg acgcacatta cttatgtcta 3121 gaaaaaggag ttcttgaaga attccgggct acttatggtt atgacttcca aactactaat 3181 ggaaaatttt acgacgactt tatgaagcta tttaaagcac tttacatcgt cgataacaac 3241 gcgccgaaaa atatcggtgg tggtgggact ccacgagttt ccctagcacc tccaatcggt 3301 aactgaccaa tcagttatca gttatcaatt atcagttatc agttatcaat tatcgatttg 3361 ataattgttc attgttcatt agacatctga gtaaaacaat gtagagacgt tccatggaac 3421 gtctctacaa gggttacaga taacgcacaa ttaatttctg gagatgtcta ttgatgttga 3481 cctgaacttg caaatcattg cttctgcttg tacatatcca aatatttaaa aggtggaata 3541 cttaaatgtt tgattactta attgttggcg caggatttgc tggaagcgtc cttgccgaaa 3601 gactagcaag tcagtctgga aaaaaagttc tgattgttga caaacgtcct catattggcg 3661 gtaacgccta cgatcactac gacgattcag gcttactcgt acacaaatac ggtccccaca 3721 tctttcacac caactcccgc gaagtcttcg attatctttc attcttcacc gagtggcgac 3781 cttacgaaca ccgcgttttg gctagcgtag atggtcaagt tgtgcccatc ccaatcaacc 3841 ttgacaccgt taatcggtta tacggattga atctcaccgc gttccaagta gaggagttct 3901 tcgcttcagt cgctgaacaa aaagactaca ttcgcacctc agaagatgtg gtggtaagca 3961 aagttggtcg agaactgtac gagaaatttt tccgtggcta cacccgtaag caatggggac 4021 tcgatccatc ggaactcgac agatcagtca ccgcccgtgt ccctacccgc acaaaccgcg 4081 acgatcgcta tttcacagac acttatcaag caatgcctct acatggttat acgcggatgt 4141 ttgagaagat gttgtctcac cccaacatca agattatgct taacaccgat tatcgggaga 4201 ttcaagaggc aatcgcctac cgcgagatga tctacactgg tcccatcgac gagttcttcg 4261 attactgcta tggaaaacta ccctaccgtt ccctagagtt caagcatgaa accctaaaca 4321 aacctgtgca tcagcctgca ccagtgataa attacccgaa cgaacatcta tacacccgcg 4381 tcaccgagtt caagtacctg acgggacagg aacaccctaa aactagtatt gtctacgagt 4441 acccgcaagc agaaggagat ccctactacc ccataccgcg tccagaaaac gctgaactct 4501 acaaacaata caaagcacta gccgacgcaa ccccaagcgt gcatttcgtg ggacggttgg 4561 caacctacaa gtattacaat atggatcaag ttgtggctca ggcactaaca gtctactccc 4621 agcttgttca acgggtactc atcaagattt aactcatccg ccggctataa aaacacgaag 4681 ccgtgggagt atcaacgtac atacacccgt tgatttccat atctgttgta tcgccgaact 4741 ggttgaacac gatatcgccg aactggtcga tacactcgtt gaacacgcga tcgccgaatt 4801 ggtcgaacac gatatcgccg acgaggtcct attatgattg ttctgcgaac ttggggtcgt 4861 cgtctattgt atctttgagc aacgactaag ggttgttctc caactgtgtt tatctctggt 4921 tctaaacttt gctgtccaaa agcattgtta aaatccttat catttaactc aactgcttga 4981 gcagatgacg gataacccat agcaaacacc aatatcgctg tggcgaagaa cttgtatatc 5041 cgttttgact gggcgatagg gaaataagat gggaaaaatt tattattagc cattgtaaat 5101 ttctctatta ggcgagatac ctaaacatat cgctagttga cactcccctg gaagacagcc 5161 taagtaagat tcaagctaca agagattcaa ccgtaagaag gaaattaaaa ttcaactaga 5221 ggttgttgca gcaacatctg cttaagtagt gttaggacta tgaagcaatt cttgttgtga 5281 actctgcctc aaatatcacg atggttatca aaactataca aacccagccg tttaccgacc 5341 aaaaaccagg tacatctgga ctcagaaaga aggtttctgt cttccagaag cctcattact 5401 tggaaaactt tgtgcagtcg atctttgaca gcctggaagg atatcaaggt cagactttgg 5461 ttttaggagg tgatggtcgt tactacaatc gtcaagcaat tcaaattatc ctgaaaatgg 5521 cagccgctaa tggctttggg cgagtgctcg tcggtcaagg cggtattctt tctaccccgg 5581 ctacctctgc cattattcgc aagtatgaca catttggtgg tataatcctc tctgccagtc 5641 acaatcctgg tgggccagat agagactttg gtataaagta caatattagc aatggtggac 5701 cggctccaga aaaagtcacg gaagccattt ctaatcgtag taaagagatt aatagctaca 5761 aaatttttga ggctcctgat gttaatcttg atactttagg agagatcaaa ataggcgata 5821 tgctggtgga ggtgattgac tctgtcgcag actacgcaga attgatgcag tcgctgtttg 5881 attttgaccg catccgccaa ctggtgacca atggacagtt ccgtatgtgt gttgactcct 5941 tacatgctgt gactggtccc tatgctcata atatttttga ggaacgtttg ggcgcaccgc 6001 cgggaactgt cacgaatggt aaacctttag aagattttgg cggtggacat cctgatccca 6061 accttgttta tgctcatgat ttggtggaaa ttctgtttgg gaacaatgcg cctgactttg 6121 gtgctgcttc cgatggagat ggcgatcgca acatgattct cggacgccaa ttctttgtca 6181 ctcctagcga tagcttagct attttagcag caaatgccaa attggttcct gggtatagct 6241 ctggtttagc aggtatcgcc cgctcaatgc ctacgagtca agcaccagac cgtgtggcaa 6301 aacaactcgg tattgaatgt tatgaaactc ccacaggctg gaagttcttt ggtaatttgt 6361 tagatgcaga tagagcaacc ctttgcggtg aggagagttt cggtacaggt tccaaccata 6421 tccgcgaaaa agatggactt tgggctgtgt tgttctggct gaatattctc gcagtgcggc 6481 aacaatcagt agagcagatt gtcacagaac attggcaaac ctacgggcgt aattattact 6541 cgcgtcatga ttatgaacaa gtcgatagcg atcgcgccaa cactctcata acaagtgtgc 6601 gtgctatgct gccaacactc aaaggaaagc agtatggttc ctaccaggtt gagtacgctg 6661 acgatttcag ttacaccgac ccgattgatg gtagcgttag tcaaaagcaa ggagttcgca 6721 ttggcttcac cgatggttcg cggattgtct tccggctatc aggtactggt acacaaggtg 6781 caacactaag gttatatctg gaaagctacg aaagcgatcc agcaaagcag aatcatgatc 6841 cacaagaagc actagcggaa cttatcacga ttgcagatga gattgcccag attcataagt 6901 tgactggaat ggataagcca actgtgatta cctgatatca aatcagtgaa cagttatcag 6961 tgaacagtta ccagctctgt cgttcccatg ctgagcatgg gaatgcactt ctctcgttcc 7021 catgctcagc atgggaatgc actatcggag gctctgcctc ccaattatta tattgaggca 7081 gagcctcaag ttatgcattc ccgaacagag gcaaggaacg agaaaatctt tccctctcct 7141 ttataaggag agggatgccc gacagggcag ggtgaggttc ccaagtctcc tttataagga 7201 gagagatgcc cgacagggca gggtgaggtt cccaagctcg gcaagaaaat gtacacttta 7261 attaatcgct agcttgttca gtaagcaatt tggcaaaata agatcatcgg atataaaaaa 7321 acctgatgtt tgtgggcaca gttctccgta attgctacaa aatcgtcaga ctgctaggaa 7381 gtggtggttt tggtgatact tatctggcag aaaacttgga tttacctgga catccattgt 7441 gtgttgtcaa acatcttaaa ccaagagacc cgaacccaga tgttttacaa atagctagac 7501 gactgtttga aagtgaagca ctggttttat ataaattagg tcatgaaaac aaccaaattc 7561 cgagactgtt tgctcacttt gaggaaaatg gtgaattcta tctggtgcag gaatatattg 7621 aaggaagtga tttaagtagt gaagtcacag ttggtaagag gtggagtgaa caggaagtca 7681 ctcaactttt gcgagagatt ttggaagttc taacagtcgt tcacaagcaa aatattatcc 7741 accgagatat caagccgcaa aatctcatgc gtcgtcgcga agatcgcaag ataatattga 7801 ttgactttgg cgcagtcaaa caaatcagca ccttagtaaa tattcaagga caaaccagcg 7861 cttcagttgc cattggtact ccgggttatg cgcctaacga acaagctgcc ggatatccaa 7921 aactatcgag tgatgtccat gcggtgggaa tgctggctat ttttgccctc acaggtataa 7981 agcctcacga attgccaaga gatcccacga atggcgaagt cgtttggcgg aattgggcaa 8041 atgttagcga gagattcgct gatattttaa cgaagatggt gcgctatcac ttcagtgaac 8101 gctatcagtc agcagcagaa gcgttgtcag cacttccagc accaccaaaa ccaaaaccac 8161 aaccaaaatc acaatcaacg ccaatacctg caccgctacc aacatcgcaa ccgacactgc 8221 aatcaacgcc aatgccgcga cggcaagtca ttcaaatgtt agggttgata ggaactggag 8281 ttggtttggc tattgtcgga caacagcttt tgcaaggtgt tttcagacgg agagacgatg 8341 ttgtcgagac accaacacct tctcccagca taagatctga tatttcttct acatcatcac 8401 caacaccgaa acaattagcc acaccacgaa gtgtatcctt acaaaccttt aattttgaaa 8461 ctgtcagtgt tgatgcacag ggaaatatta acaaccgcag caaccgtcag gcgaaatact 8521 tcgcgcaaga cttggggaat ggtgtcacct tggagatggt acagataccg ggtggtacgt 8581 ttctcatggg ttcaccacca ggggaaaaac aaagagagtc aaacgaagga ccacaacatc 8641 aagtcacagt tcctgggttt tttatgggca agtatgaagt gactcaggct cagtatcagg 8701 caattatggg taacaaccct tccaacttta aaggagagaa gcgtccagtg gaaaaagtta 8761 gttggaatga tgctgtggaa ttgtgccaac gcttaagtca gaagacggga cgcacctaca 8821 gactacccag cgaggcagaa tgggaatatg cagctcgtgc aggaacgact acacctttct 8881 actttggcga aacgattaca actgatttag caaattacaa cggaacgaaa acttacgcct 8941 ctgaaccaaa aggtcaatat cgccgacaga caacaaacgt ggggagtttt ccacccaacg 9001 cttttggttt atatgatatg cacggtaatg tttatgagtg gtgtcaggat gattggcatg 9061 acacttacca aagcgcacgc actgatggta gtgcatggtt aacgcaaggt agtacaaatg 9121 ttctaaagct ggttcgtggc ggttcgtggg gcctcagtct agggctttgc cgctcggcgt 9181 ttcgctatag ggtctacccc ggccttcgtt actacgttat cggttttcgg gtggtgtgcg 9241 tggttgcgtg aggacttcct tgccctttgc tcttttgctc ttttgctctt tacacttctt 9301 tactttttcc ttcttccgcc gcaggcgaat aaaaaatttt ttataaaaac aggacttacg 9361 cacgaacaat gtaactatag cggttctcgg ttgggtgcag tacgcttttt gacctcaccc 9421 cttacccctc tcgaaatttc ctcgaaacct caccctgccc ttcgggcatc cctctcctta 9481 gtaaggagag ggaaagattt tagcgaagct aaaagcgagg gtgaggtttt ggcgtcagac 9541 aaaaccgggg tggggttttt acaccaatgc ctaaaaaaat attcgcgaca cagcacatta 9601 gacagatgtt cattaagtaa aaccatctca taattattag cgccataacc actctaaaaa 9661 tcagtttctg ctgactatga aaatactggt actaaatgcc ggatcgagca gtcaaaagag 9721 ttgtgtgtac gaaattacag gtaacactct ccccgaagaa cctctcaaac ccctttggga 9781 agcaaaagta gactggactc accaacaggg ttttgcagaa ctcgaagtta aaacagccaa 9841 gggtgaacaa ttacaagaga aaattcccgc agattctcgc actcaagtta tggctcatat 9901 gctggatacc ctgtgcaagg gttctacgca ggtgattagt cagccgtcag aaattgatgt 9961 agtgggtcat cgcgttgtac atggtggaca agattaccgt gagagtgtgg tgattactga 10021 agatgtgaaa caggcgatcg cccgtctggc agaactcgct cctgaccata accccgctaa 10081 tttggaagga atagaagcca ttgagcaaca tttaggaaca ctcacccaaa tagcagcatt 10141 tgatactgca tttcattctc atctccctga tgccgcagcc atctaccccg gtccctacga 10201 gtggatagaa caaggtatcc gtcgctacgg atttcatggc atcagtcatc agtactgtgc 10261 taagcgtgcg actcgcatcc tcggtcgaga tttagcatct gtgcggttaa tcacatgtca 10321 tttgggcaat ggttgctctt tagccgcgat tcaaaacggt cgcagcattg acacaacaat 10381 ggggttcaca ccactagatg gattgatgat gggcagccgt tccggtgctg ttgacccagg 10441 aattctcatt cacctgttgc ggcaatctga taactctgtt gacgagctag caaaagtgct 10501 aaataaggct tctggtttac gtggaatttc tggtatttct aatgatatgc gagaaattag 10561 ggaagcaatt tcccaaggca attctcgcgc tcaattggcg tgggatattt acgtacatcg 10621 cctgcggtct tatattggtg gaatgcttgc tagcttggga ggattagatg ccttagtgtt 10681 taccggtggt gtgggcgaaa ataatcctga gattcgcgcc gcagcttgtg aagcttttgc 10741 atttctggga ctaaaacttg accacgagaa aaatgcacac aagcctgttg atgtagatat 10801 tgccactcct gattctaccg tgcgggtatt ggtgattcat acggaagaag attgggaaat 10861 tgcttgtgaa tgttggcgac tgttggaaaa aaagtagatt ttacaaaagc agatggttga 10921 tactaaaaag ataatacgac agccaatctt tttcctcaca gcagtagctg tgttgacaat 10981 gttcacactg actagttgcg acagcttaac cactaaccag tatgaagcca cagccctcac 11041 cacttatacc tggcaagtta agtatgccga tgacctcgcc aatgaacaag taccacgctt 11101 tgaaactttt gccaccactt ccttacttaa tcgtaacggc ttgaagccag aaggagcagt 11161 gactggtcct gatgataagg gtttatggtg gtctagttta ccaccgcgac cctcagttga 11221 tgaaatcgaa caacgcaaaa aatcccaaga ggcaggtagt cctgaattgt tgaaatctgt 11281 gaagtacgaa ctcaagtata aagtgggaga ggatcaaaga aaactaccca cgaactatca 11341 ggtttatcga gaggtggtga aggcgtatcc ctcacagact tctttgcaat taactttggg 11401 attggataat aactcagttg agaaagctga acctgtgggt agcaaataaa attagcttat 11461 gaattaaatc agcgaccagt taccagttat cagttaccag ttaccagtta ccagttttca 11521 ctgttcactg ttcactgtta agcgttccct ctcaagaggt ttcctcaatt gggaaaatat 11581 cgccctttaa gtaggattca aaatgtttct taaagtactg aatactcgtc atattcggtt 11641 catcactttc tgtgggtgta aattcataaa ttggtacttt ttctctcacc aggttgacca 11701 atttaggatg aagttgctca aaagaatcga ctctagatat tgttgtttca aaccttaatt 11761 tagctggatg accatatcct aacttcatcc aaggcatcca aggacaatcc cttgcccact 11821 tcgcaggagg gacatcagaa gcatcgggtc ggcaaatatt tgatgtgaaa tattcaactc 11881 cctgaaactt ttccccagga caataatccg agtatttact atctcctgct aacggatgcg 11941 gatattccaa aggaatttcc attacagagg tgatatatcc agtacctttg ggtatgacaa 12001 atttttttgg tttaactgag ccttgtacaa ttcggttagc aatatgtacg actggtagtg 12061 cttgatttgc ttctggtgac ttccagtaag tcagaatctc ttgcgtcacg gggtcacaaa 12121 ataaacccaa ttctctattg atacgatacc caacttcgcc gtattcagga ctagtcttga 12181 taaagacttt cgttgcattc atgccaataa tagaaaacaa tttctccctt ggcaagtctg 12241 gtttttcttg ccaccaaatt tccccagacc agttgtaata aagctgtaac ccatgatttt 12301 ttctataaaa atcaaggtta ttaagttcgt agtcatccaa gtaatcactc atgtttgttc 12361 cattcatcct gaaagacttt tgctcgcaat tatcacctaa agcgcgttgg atctcattca 12421 aaattcacgc atcacatctt agaacacaga ttaaagcaat ggaaaattta ttgtaaaatt 12481 tggccaaact tctacgtaaa attcgggatt gcagatagtt gcatagaata gctatcttgt 12541 actcattata tgcattttat ctttataaat atccacaata gcaccgttat ggtagatgtc 12601 gattatttga cgctacttga gttaggagtt gaagcttgga accattggag gaaaaacaac 12661 cccgacatca agccgaacct aagttatgta gaattatctg caactttaga tgatataaac 12721 ctcagtcaca cagacctcaa atatgcaaaa attcactctg cagaactaca aaatgcaaat 12781 cttaccaagg caaacttaag aggagctgat tttagcattg tagatttaag tggcgcagac 12841 ctcagttatg cagatctcag tgatggacta tttttaggtt cagttgagtt ttgtggagcc 12901 aacttgagtg gagtaaactt tacaagtgca gacttaggca tgtgcgttat caattatggt 12961 cttttttttc gggctagtct cagtggagtg aacctctctc atgcaaatct gactcaggct 13021 agtttgagat ttgcagacct taacggcgca aacctctgtg gggtagactt ttctggagca 13081 gatctttgtg ctgcagattt gagaggagca aacctttttg gtgcaaacct cagcaaggca 13141 gatctcagag gcgcagattt cgctgatgct aatctcacat acgcaaacct aagcgaagcg 13201 aagttggtgg ggtcaccaaa ttttggaaaa ctgggaatgt atgatttcag ttatctgaat 13261 gatagaacgc tcaacttgag caatgccaac cttaatggtg ctgattttag tagagctgac 13321 ttgacagatc ttaaacagac agtgtgacga taatcacaag tttgcagtaa tacgtagcac 13381 ttgacaactg tgagatttaa caataaacaa cagtgactgc gacacttccc ttacatcttc 13441 agttcgggga tatttattga taacacactt atcaatggat cgtcaagacc gaacttacca 13501 aaccaattca caaagtaaca taagttgagg caacaattat gaaaaaacta ttaaacttca 13561 tttataaatt atttggttta gaaaccgaag aatcgcgttt atcagattct tacaaggatt 13621 aaattccgaa taagtcttgc aagcagtatc gttgctgtaa ctgaatacga cttggagttg 13681 aaatagtagt acaccattag ggacattgcc actcagttct ttcgtgagct ttgactataa 13741 ttgaggcttc gtacaattcc agttttttta taagaaggtt tacacgtcca gttgagaaaa 13801 atctggacgt ataagcatgt ttctatgagc taagcaaaac tgttttccaa aactttaact 13861 gaatttttta gagtttcgct gtatcggcga taaccaatta atcgaaaaaa ctaagatcag 13921 aatggagttt aatagggact cctaattctg acgggagtga caaaaatgag ctatcttcaa 13981 aagactgaag atagaatcaa cagttagtag ttagtcacaa aatactacta accactccct 14041 tggaggtatg tttatttact ggtttaacgg caaattaatt gagtctcgaa ctctagagtt 14101 agaaattgac gatcctgggt tactctacgg agcaactgtt tttacgacgc tgcgagttta 14161 caataactcg ctcgatagta gattaactca ctggcgctgc cactgcgatc gcctaaaatt 14221 tagtttacaa acctttggtt ggcaaatgcc agatgaggag cgattgcgtc aaggagcaga 14281 aattatcatg acgcacttcc ctgtcctcag aatcgccgtc tttcctgatg gacgtgagtg 14341 gataacggga agattgctgc caaaaaactt gacacaaaac cagaaaaatg gtttaatagc 14401 aactttagcc gaatctgagt ttgctcgcag tttaccctct cataaaacag gaaactacct 14461 cagcgcatgg ttagcaaaag ctaacgctca aaaattagat gcccaagaag tcattttggt 14521 agatactgtg ggaaattggc tagaaacaag tacgggtaac ctctgggggt ggcgaaatgg 14581 tagttggtgg acgccgccct taacagcagg aattttgccc ggagtcgtgc gcggacagct 14641 tgtagattgg cttttgaaaa agcagcaagt ggttcgagag gaaccttgga caccagagtt 14701 agtcaaggga tttgaggcaa ttgcctacag caatagtgta gtggaaaccg tcccgattca 14761 taccgttata cagcctacag gaaagctaga atataatccc catcatggct gttttcaaca 14821 actgcggatg ttttttatag cattgtaggt gcaaaatttg gtatacgtta agataagtta 14881 acataattct taaaaagccc tctctttaca gagacgctac gggaaagcac aagagatcta 14941 ggaggattga cctgggtgaa taacaaacga tggagaaatg cggggctgta cgcactgctg 15001 tttattgttg tcattgcgct ggggacagcg ttttttgaca acaaacaacc acaaggtaga 15061 aaaacatggc gatacagtga gtttattcaa gcagttgaaa aaagcaaaat caaacctaat 15121 ggcgaaaacg acacatttgc acaagttgcg ctgagtgcag atcggtctat ggctgatgtc 15181 aagttggatg actcaagaag gtttgtagtg accttggtca acgacccaga cctgatcaac 15241 actctcactg caaaaaacgt agatattagt gttttgccgc aaaccgatga tacattttgg 15301 tttaaggcac taagtagctt atttttccct gtattacttc tggttggctt attcttcttg 15361 ctacgccgcg ctcaaaatgg tcctggtagc caagccatga actttggcaa gtccaaagcc 15421 agagtgcaaa tggaaccgca aacccaagtg acatttaatg atgttgctgg tattgaccaa 15481 gcgaagctag aactcaatga agtcgtagac tttttgaaaa acgctgaccg cttcacggct 15541 gttggagcaa aaattcccaa aggcgtactg ctggttggac ctccaggaac tggtaaaacc 15601 ctgctcgcgc gtgcagtcgc aggcgaagct ggtgttcctt tcttctccat ctctggttct 15661 gagtttgtgg aaatgttcgt gggtgtcggt gcatcccgtg tccgcgactt gttcgagcaa 15721 gcaaagacaa atgccccctg tatcgtcttc atcgatgaaa ttgacgccgt aggtcgtcag 15781 cggggtgctg gtttaggcgg tggtaacgat gaacgggaac aaaccctcaa ccagttactc 15841 acagaaatgg acggctttga aggtaacaca ggtatcatca tcattgctgc taccaaccgt 15901 cccgacgttc tagatgcagc gttgttgcgt cctggtcgct tcgaccgtca agttgttgta 15961 gaccgcccag attatgctgg acgtgtggaa attctgaaag ttcatgcccg tggtaagacc 16021 ttggcaaaag atgtggactt agaaagaatt gcccgtcgta ctcctgggtt cactggtgca 16081 gatctttcca acttgctgaa tgaagctgct attctcgctg cgcgtcgaaa tttaactgag 16141 atttcgatgg atgaaattaa cgatgcgatc gaccgcgtgt tagcaggtcc agagaagaaa 16201 gaccgagtta tgagcgaaaa gcgtaaagaa ctcgtagcat atcacgaagc aggtcacgcc 16261 cttgtcggtg ctttaatgcc agactatgac cctgtgcaga agattagcat tattcctcgt 16321 ggtagagctg gtggtttaac ttggtttacc cccagtgaag accggatgga cactggctta 16381 tacagccgcg cttatctgga aaatcagatg gctgtcgcat taggtggtcg tcttgctgaa 16441 gaaattatct ttggtgagga agaagtgacc acaggtgcgg ctcaagactt gcaacaagtt 16501 gctcgtgtcg cccgtcagat ggtgacacgg tttggaatga gcgatcgcct aggtccagtt 16561 gctcttggtc gtcagcaagg caatatgttc ctcggtcgag acatcatggc tgaacgcgac 16621 ttctctgaag aaactgctgc tgctattgat gaagaagtcc gcacactggt agaaacagct 16681 taccaacgct ccaaagaagt gttggaaaat aaccgccaca ttcttgacca aatcgcgcaa 16741 atgctggttg agaaggaaac agtggacgcc gacgaattgc aagaactgct ggcaaacaac 16801 gatgtgaaaa ctgcaacgtt tgcttaaaaa gtgaggagtt aagagtgagg aatgcttcaa 16861 acagtgaaca gtgaacagtg aacagtgaac agtgaacaga gaaagcgatt tattgacccc 16921 cctggtaact ggtaactggt aactggtaac tgataaaaca cttctcattc tgaactccta 16981 aattttgaga ttcagggtaa ttctggaatt ttccaaagtt gccctgattt tttgtgagaa 17041 atcttgctta gaggatcttt taaaagtggt ggctgaggta tccaaaattt tagatcctcc 17101 taaatcctcc ttaaaaagga ggactttgag aaggttattc cccccttgga aagggttccc 17161 gcgttgttgc aagagcgtag gcgtgcagga gatagcaact ggcgtgagtt agaggggttc 17221 tcaatataac ggtagacttt acaaacatcc tctaacacta gttttcttaa ttttgaattt 17281 ctcattttgc gaattttgaa ttgtttatga tgcccagatc caaaatccta accacatctt 17341 ttcatgatca aggagtttat ccctgcccag tctgtcgtgt aggtaagatt agtcatatgc 17401 cattaatgga ggctatgtct tgtgacttct gccaacaaat ttttactgcc aacgtagaag 17461 aacagcaaat taaaatgcca tccagacaac cacctctaat ttggcattgg aatggcttta 17521 attggacaga agcgcaaata gaaggtgtag aacttggatg gggctatgtg atagcagcag 17581 ttgtttttgt actcctccca actgttttaa ttgggatagt cgcttatcat tttccacccc 17641 atccagaaac tcccttatct tgggtaccat atgtctggac agcattaact tttttgttac 17701 atctagcaat tattgtttgg cttttgattg aagtttatca aattccagtt ggggcgtatt 17761 ggcgagctat acaacagcgt tttcttggtc ggtgatgatg agaaatatct actgttaagc 17821 cagatggaat cactatttgg atagcttata gcgtctgagt taacaacaga tgttggaaaa 17881 tcttaacgaa gcttgatttt caagtataga agttatataa gaatgggaaa aactataaca 17941 agcttataaa agttacgact taacaaaaaa aataacaaaa tgttaaggta tatatcattt 18001 ctcataaaat caagaaggaa ttcaaatttc aatgacgcaa agcatagggt ttgatacttg 18061 tttgtagaat gcttactata tttcctcatt cctcgaaaga ctgcgtctgt tagcatacag 18121 ttatgctgta gtaaaatgga gagtcacgat tgctcaagtg tggtttccag ctaagcaaat 18181 atactaataa atctcggctt ttagccaaaa tcagcaaaca gcaaaaaagt ttgacaaata 18241 atgcagccgc taaacggttg catcactcat aattaagata atttccaccg aaacgtctta 18301 tgagcgagtt cgagcagtac tatagagttt tggaattaga gcctggggca acatttgagg 18361 aagtgaccca agcttacaaa gatttggctt ttgtatggca tcctgatcgc cttccaaaag 18421 acaatagccg cttacaagag aaagcgcaaa aaaagctaca agaaattaac gaggctcgtg 18481 agcaattacg cttatcgaag atgaagtatc aaaggttaca ttattctgcc tcacactatt 18541 cagcaccatc tgcacagaaa cagccaactg agaaaaccta tcaacattca gcaccgtctg 18601 cacagaaaca gccaaccgag aaaagctatc aaccaccaca cccaaaccca gacttaagtg 18661 gaaaagacta tagtcgggca aacttacaaa acaaagattt atctggcaga aacatgagtt 18721 atgcaaactt gagtggtgca aatctcagtg atactttcat gcacaaagtg aatcttagag 18781 gtgcaaattt gtctgaggca aatttattta gagctaacct acttttagct gatctcaggg 18841 aagccaattt ccggggtgct aatttagttg gagcagattt gagtggagca gacttgcggg 18901 gagcgaactt catgggagcg cggatgaaat ctggtgacag acttcttgtt aaactggttg 18961 gagctaactt agctggggca atcatgcctg atggcaaaat ttatgaataa attaagataa 19021 tgaacgttgc gattattggt tgcggatatg ttggttgtac aattgctcgg tattggcaac 19081 aaaaaatgac ttttgtggtc accgctacca caaccactcc agaacgtgtc cctgcgctgc 19141 aagcagtagc ccaacgagtc gaagtcgtca aaggaaatga cccagaaggt ctaaaatcag 19201 tcttgaaaaa tcaagatgtt gtgcttttga gtgttggtgc aaagagtgct gatgtttacg 19261 aagaaaccta tttacatact gctcaaagtt tagtctctct cctcaaacag actcctagta 19321 tacagcaact gatatataca gggagctatg ctgtttatgg tgacagacag ggagcatggg 19381 tggatgaaga atcaccacca gcacctgcta accaaaatgg acaaattatt accgataccg 19441 agcaagtttt actatcagca tccagtgcaa acctccgtgt ttgtattttg cgattgggag 19501 gtatttacgg tccaggtcga gaattggtaa aaatatttgg tcgatatgct ggcgcaaccc 19561 gacctggcaa tggcaaggat acaacaaatt ggattcacct ggatgatatt gttgctgcta 19621 tagagtttgc tcgtaggact caactacaag gcatttataa cttagttgat gatgcccatc 19681 tcaccactgg agaattactt gaaggtgtgt ttgaaacaca caatctaccc aaagtcacat 19741 gggattcctc ccaagagagt aagcgcccat ataatgccaa ggtgtctaat aaaaagatga 19801 aagatgcagg atacaaattg attcatcccc agatactttt ctagctgttc ttaggtgcag 19861 tcccatgatt tatcaagaga aaacagtatc tcctacccag ttctacaatt ggaaaaatta 19921 tcgctgcgcc tatgaggttg ataacccaag tcaatcaact cctgagggta ttcccttatt 19981 gttaatacac ccgattggcg ttggattatc gcggcaattt tggcagcgtt tttccagcga 20041 atggtataaa caaggtcgtc gcaatcccat ttacaacccc gatttattgg gatgcggtga 20101 aagcgatatg cctcatgttg cttacactcc aagtgattgg gcagaacagt tgcaacagtt 20161 tttaaaaacg gtagtgcaaa aacctgtcgt tttggtcgca cagggtgctt tatttccagt 20221 tgcaattgaa ttagttcaaa aagaaccaaa tttaatcgcc ggacttgtat tagctggtcc 20281 tccagcgtgg gcagttatta cgaaaaaaac accagaatgg caagataaag tgatttggaa 20341 tctgttggat tcgccttttg gtaatacttt ttatcgttat gcccgaggtg aaaagttttt 20401 gcgtgatttc tcgactagcc aactgtttga ctcagccgat gctgttgatg // LOCUS NODE_1583_length_20347_cov_5.36787920347 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20347) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20347) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20347 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(147..1487) /locus_tag="DP116_14245" CDS complement(147..1487) /locus_tag="DP116_14245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317186.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tyrosinase family protein" /protein_id="PRJNA477356:DP116_14245" /translation="MMKIHKTIKAVIIASIVAVLVVFGTPALSHNVEHREHKQTLNKV LSDSSDTKLNAAYHNNAINSINSSNSDLAPPNRKFLISTKKTYGIRKNVVDLTQEEKQ AFVDAVRTLKHVVPEGSSISIYDQFVAVHVAAMGLMYDSAQGPAAGHDGAHESDLFLP WHREFIHRFEKALQSVNPNVTLPYWDWTDANALAVLFQDDFMGPNGQGVNLSIPGLGE VQGGPVVSGPFTKANGWVLNPNLHIKPSGEPFGDTLLRFLQVPPTNSYPIPKEDVEQI LAINDYETFRLALEGFIKLDSSSQQPTPGVFEHNYFHSFVGGATFDPAVGRPEALGTM ADLSSAINDPVFWLVHANVDRLWAEWQENGHKGSNYYPATGRHYGENLNDRLWPWDGG ESIPANWTPGDLFSLLPSFSPDDIVTPADTLNFRKYAYTYDTLKRAIVNLKSSS" gene complement(1685..2354) /locus_tag="DP116_14250" /pseudo CDS complement(1685..2354) /locus_tag="DP116_14250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206328.1" /note="frameshifted; internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RluA family pseudouridine synthase" gene complement(2351..3046) /locus_tag="DP116_14255" CDS complement(2351..3046) /locus_tag="DP116_14255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_14255" /translation="MVTQAKSFKQKGIIYPESDGQPMAENTKQFELIVLIKKNLDLLF DNDPNVFVAGDLLWYPVEGDNLTRRAPDVMVIFGRPKGDRGSYLQWREDNIAPQVVFE ILSPSNSAKEMISKYKFYERYGVEEYYLYDPDTGELTGWLRSGNELAEIEQMIGWVSP RLNIRFELSDGELQIYHPDGQRFLNYVELAQKQEQAEARAKQAETRAEQAEAELQALR ALLQQRGINPDQV" gene complement(3257..4402) /locus_tag="DP116_14260" CDS complement(3257..4402) /locus_tag="DP116_14260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873577.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PepSY domain-containing protein" /protein_id="PRJNA477356:DP116_14260" /translation="MNYKQIRDFTFYIHRYLGLIVGLLLIIIGLTGTLLVFQREIDQY LVSQQFGQVIPQGQRVPIESVVKTVKTAYASQPELKLLSINTLPDAHLPYRVWLEAPG EKRTQVFVNPYTGVIMGSRQWERTLIGFTFKLHYELLAGKIGEVIVGIAAFFLFLLSI TGFILWPGWRKLILGFKIKWNAHPKRTNFDIHKVAGIIAAVFLGMISFTGVCWNFWDF SQPVIHAATFTPIPPTPVSQPIHGKSPLGLGEILKKADAALPGAVTTYISLPQTPEGV FRVGKKQPQEASEYGYSQVYLDQYTGKVVLLKNGLQPSRADRVFNSFSPMHYGTFGGL PTRILYIFVGLTPLILFITGFVMWWYRYKGKNSGRESIKMIEPSVKS" gene complement(4440..5051) /locus_tag="DP116_14265" CDS complement(4440..5051) /locus_tag="DP116_14265" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_926920.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14265" /translation="MHSYLLQSWATLPIVRSKDENRAKAESSAIDKFCLDVSNNSCLE SDGQSSSGIEFDIAGEILPGWKIIASYAYTDATITKENTFSVDNRLNNMPRHTASLWT TYTLQNGGLKGLGFGGGIFYIGDTHDCVCPLGNRAGDLANTFEVPSYTRVDAAVYYET GNFRAALNFKNLSNIRYFEGTQGRTEVQPGAPFTVQGTISWEF" gene complement(5324..6610) /locus_tag="DP116_14270" CDS complement(5324..6610) /locus_tag="DP116_14270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314308.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cysteine desulfurase-like protein" /protein_id="PRJNA477356:DP116_14270" /translation="MVDLDLGWIRNQFPALSQEINGQPVIFFDGPGGTQVPKSVIDAI AQYLVTSNANAHGAFATSQRTDALITSARTAMADLLGCSSDEVVFGANMTTLTYSLSR AIAHTGDSLRAIARELQSGDEIIVTKLDHYANVSPWFALSERGVVIREVEINPEDCTL DINHLKQQINERTRLVAVGYASNAVGTINDIATIVQLAHAVGAMVFVDAVHYVAHAPI DVRVLDCDFLACSAYKFFGPHVGILYAKREHLARLRPYKVQPAPDEIPSRWENGTQNY EGLAGLVAAINYLAELGHRVLPGVQNRREALLAAMIAIKQYERDLCQKLVTGLQQIPD ITLYGITDTTRFDWRTPTVGIRLAGRTPHAVAKALGEQGIFTWNGNFYALGLTKELGV ESSGGLLRIGLVHYNSVEEIHQLLKALIKIVPSHES" gene complement(6991..7668) /locus_tag="DP116_14275" CDS complement(6991..7668) /locus_tag="DP116_14275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746311.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14275" /translation="MLELQRKDFNCLTSYAIENISSINVDKLKPFFSNLPADPYLAGN YRFRRLSHFQISGRSIIKLPHRPLYQPKQYNPLLGDVVREYAELDDELIKLADFQRMI LEFFEFCELCSTFKEIAVHQIRTTASPEQIGKPAPEGIHRDGVDVIGIFSVNRERIEG GETHLYKSKNDSPVLNKILNPGEMLVFGDGEFFHFTSVINAISKIGGVRDVFVLTCPG LGSPNDK" gene complement(8210..8704) /locus_tag="DP116_14280" CDS complement(8210..8704) /locus_tag="DP116_14280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318849.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Ureidoglycolate hydrolase" /protein_id="PRJNA477356:DP116_14280" /translation="MTTSKTVQQLQAQWVTPENFRRYGQVIFPSEHGKSFDAEDARLV LDKGTPRFYIMRLHRRGRRFHTITRHVQCTQCLGSLEGKDWFIAVCPPNNNIDQPSLE DIAAFRIPGNCFIKLEVGTWHAGPYFEHEVVDFYNLELSDTNVVDHFTHNFLNSHSLE FEIL" gene complement(8789..9232) /locus_tag="DP116_14285" CDS complement(8789..9232) /locus_tag="DP116_14285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007356509.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14285" /translation="MKYNLKETFRVILAVSMLIAGITHFTSADQYVRIVPPQLPYPLE IVYLSGFYEILGGIGLLVPPVSQATAWGLIALFIAVFPANINMAVNLIPIDNIPNSPW VHVIRLPFQAVLIAWAWLYTQPSDLEKQASIIPKSLIPKKLLKLE" gene 9754..10371 /locus_tag="DP116_14290" CDS 9754..10371 /locus_tag="DP116_14290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874450.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_14290" /translation="MDTDELLIRYAAGNRDFLQVNLQQANLSEFALAGINLSRSDLTG ANLSGASLRGANLSEAVVAEATLWRANLTEAVLIWANFSKACLIRATLTQVDLHKAVL TKADLRLANLRYADLSYTNLEGADLRYADLTGANLAGANLSKANLTGANLSQADLYNA NFTKAILSRTNLRYADLSSINLDGVDLRDAQVSGTSLQPLVNTHQ" gene 10513..11271 /locus_tag="DP116_14295" /pseudo CDS 10513..11271 /locus_tag="DP116_14295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874451.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 11695..12657 /locus_tag="DP116_14300" CDS 11695..12657 /locus_tag="DP116_14300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015217460.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Red carotenoid-binding protein" /protein_id="PRJNA477356:DP116_14300" /translation="MVYSIESAQSIFSSTQVPSPIPATIALFEQLNVDDKLALLWFAY TEMGRSITPAAVGAARLQFAEGLLNQIKQMSQADQLQVMRDLANRADTPISRSYANFS VNTKLAFWYELGEFMKQGIVAPVPSGYEMSRGVQIVLDAIKQLDAGQQITVLRNTVVD MGFGDVVVSSSPQVDAEPLFPRTEPAPTKITVKGITEPTVLSYIEALNKDDFDAAIAL FTPDGALQPPFQKPIVGYEAIKRYMRAEAQGLNILPQEGISEELQNGSKQVKVTGIVQ TPWFGVNVGMNISWRFLLNSEGKIFFVAIDMLASPAELLNLRVR" gene 13115..13363 /locus_tag="DP116_14305" CDS 13115..13363 /locus_tag="DP116_14305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742572.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14305" /translation="MDSSTLRHLWSVIEETQTSILLNFSDTELVKQLIRLLENRKLLN DEEISIVSAYIRSRIPLIRDLAFARSPINGQWVMSMAG" gene complement(13376..14287) /locus_tag="DP116_14310" CDS complement(13376..14287) /locus_tag="DP116_14310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874276.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Clp protease" /protein_id="PRJNA477356:DP116_14310" /translation="MVNQETARYADMFAALGSEPRLEIMRLLFAAYPEGMTVGEIQEK LKIPNSTLSHHLEKLRNEELVKSRKDKQFLWYSANAETMEDLLTFLTTDRRRGEGDRS AAPMATPGLRRTEHRVSNEHTLERTNKTPIQEGFMFEKFFESIFHKLSGSFSDRFHLK GFERFTQKAVNAINLAQGESRRLGHNFVGTEQILLGLLGEGSGIGWQFLNSVGVNLEN AQIEVEKIIGRGKGDTAIDIPFTPRAKQVLELAVEDARCLNVNYVGTEHLLLGILHEG GGVAIRVLQSLGVDLISLEQRLRRALT" gene complement(14384..15373) /locus_tag="DP116_14315" CDS complement(14384..15373) /locus_tag="DP116_14315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314717.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate carbamoyltransferase" /protein_id="PRJNA477356:DP116_14315" /translation="MSNNTWSRHHILSLADFTTDEYDTVLQTAASFQEVLSRRTKKVP TLQGQVVANLFFEPSTRTRSSFELAAKRLSADTLNFASGTSSMTKGETILDTAKTYLA MGTDIMVIRHREAGVPDAIAQEMDRLGVRVSVLNAGDGQHEHPSQGLLDLFTICSLLD PDLPRVELLKEKKIAIVGDILHSRVARSNIWSLTASGADVHLAAPPTLLPKLFADYCE NTPGKVFLHWELEPALQDADFVMTLRLQKERMTAHLLPSLREYHQMFGMTHSRLQLCK PNVKLLHPGPVNRGVEISSELMDDPEFSLIQAQVTSGVAVRMALLYLIGSGKA" gene 15677..15778 /locus_tag="DP116_14320" /pseudo CDS 15677..15778 /locus_tag="DP116_14320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874044.1" /note="internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 15950..16885 /locus_tag="DP116_14325" CDS 15950..16885 /locus_tag="DP116_14325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871735.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_14325" /translation="MLLDDLRLSEPAIETAVDYKPLTVTPETLLVDVITLMNQKRIHS CSLSRVNSPLNGLTVHPTGSSCVLVIKNTKVLGIFTERDIVRLTADEVDFRKVTIAQV MTQPVMTLEQATFQDVFAALFLFRRYRIRHLPIVGQKGELIGVISHESIRRILRPVHL LKFMRVADVMTTQVIRASMSASVLTLAQVMAEFRISCVVITEDQPPLDLGQFVHIPIG IVTEEDIVQFQAMEVNLSEVLAQTVMSTQLFVLNPEDSLWTALVEMQRQQTRRLVVSW DRGIGLGIVTQTSLLRVLDPVQMYGVIQTLQQTVK" gene complement(16889..17044) /locus_tag="DP116_14330" CDS complement(16889..17044) /locus_tag="DP116_14330" /inference="COORDINATES: protein motif:HMM:PF11208.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14330" /translation="MSTKAQQVLKLEFENLKRERNEISKAVREEEKEEKRLLARQKAK QRHRGKA" gene 17194..18063 /locus_tag="DP116_14335" CDS 17194..18063 /locus_tag="DP116_14335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858756.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CAP domain-containing protein" /protein_id="PRJNA477356:DP116_14335" /translation="MESNNATFEQQVFELTNQERAKNNLPPLKANAELNYAADKYAQE MSQRGILSHTSPDGSQAWDRAKVVGYSAQMMAENIAAGQTTPQQVVQDWMNSPGHRAN ILKPEYTEIGTGFSNNYWVQDFGSGDTNPMSYIPNSTSNSTIASTSTPTPQSASMPTA TLDSVFPSNSVLASVPASTPVAEPTVEAASNTDTGSGSGKVIDNPVNDSLLLGGFNNN TPDYASGHDTFMLGQVENFDWIGNLQFSQGIPIIENSRILEILPGSEINNNIPIGDRN IETLIAQGQNTFS" gene complement(18135..18326) /locus_tag="DP116_14340" CDS complement(18135..18326) /locus_tag="DP116_14340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859071.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14340" /translation="MQEDKLNNPETKTNNSADDEKSHIEAASKFIQETSIEKMHEIAE AARLKKLMEPPKWVYPIDD" gene complement(18397..18990) /locus_tag="DP116_14345" CDS complement(18397..18990) /locus_tag="DP116_14345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195969.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_14345" /translation="MEILTLNVSPAVGLTDEQFYQLCIANEPWQLELTQTGELIIIPP TGGESGIRNSDINMQVGLWNRQTKLGKVFDSSTEFKLPSGAYRCPDASWVKLERWEAL TKDEKRRFPPICPDFVIELRSESDALDKLRTKMREYQENGASLGWLIDPQTPLVEIYR PEQDVEVLNFPFDNPPQLSGEEVLPGFVLDLTIILNP" gene 19246..20169 /locus_tag="DP116_14350" CDS 19246..20169 /locus_tag="DP116_14350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130991.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron permease FTR1" /protein_id="PRJNA477356:DP116_14350" /translation="MNWSAALPTFFITLREGVEAALVVGIVFACLQQAEQQRLHRWVY LGVISGTLASFVVGLLLNLGIQGLQTSDLLYAPVFKQLFEVGLGVIAIAMLSWMLIWM TRQARFLKAEIEGSVKSALVEDNRAAWAIFSLIFIAVFREGLETALFIVAKFQEGWTP VLGAIGGLSMAVLIGMLLFKWSIRINLSKFFQVMGVFLLLIVSGLVISALRHLDAAAI ALSQINSSVADACGQGNTSCLLGPQVWDLSGILPDRQFPGILLKTLFGYTQKLYFVQA VGYLLFWVVVGSLYFRSLSQPTQLKPATKLN" BASE COUNT 5668 a 4404 c 4351 g 5924 t ORIGIN 1 aggcatgtgg cgtccctggg ctaaagccgc agggttttca tctcactctt tataacctat 61 attgcaccat tcaaaagtcc agaatcactc tggaccctag acgcttgaca acactcacaa 121 ttatgctgtg atttttacct cacagtttat gaacttgatt tcagattgac aattgctctt 181 ttaagagtgt cataggtgta ggcatacttt ctaaaattca gagtgtctgc aggtgtgaca 241 atatcatcag gtgagaaact aggtagtaaa gagaacaaat ctcctggagt ccaattcgct 301 ggaatggact ctcctccatc ccaaggccac aatcggtcgt tgagattctc tccataatgt 361 ctacctgtgg caggataata attgctaccc ttgtgcccat tttcctgcca ttcggcccaa 421 agacgatcca cattggcatg aaccagccaa aatacgggat cgttgatagc actggaaagg 481 tcagccattg tacctagagc ctcgggacga cctacagcag ggtcaaaagt ggctccacca 541 acgaagctat gaaaatagtt atgctcaaag actcctggcg ttggttgttg agaggaacta 601 tcaagtttga taaatccttc tagagctaag cgaaaagttt catagtcatt aatagcaagg 661 atttgctcca catcttcttt aggaataggg taactatttg tgggaggcac ttgcaaaaag 721 cgtaagagtg tgtcgccaaa tggttctcct gatggcttga tatgtaagtt tgggtttaga 781 acccatccat tggcttttgt aaaaggacca gacacaacag gtcctccttg aacttcaccc 841 aaacccggaa tgctgaggtt gactccttga ccatttggcc ccataaagtc atcttgaaag 901 agaactgcca acgcattagc gtctgtccaa tcccaataag gaagcgtgac attggggttt 961 actgattgca aagctttttc aaatcggtgt ataaactcac gatgccaggg aaggaataaa 1021 tcactttcat gggcaccatc atgtccagct gctggacctt gtgcactgtc gtacattaaa 1081 cccattgccg ctacatgcac tgcaacaaac tggtcgtaga tactgatact gctaccttct 1141 ggaactacgt gttttaatgt ccgcacagca tcaacgaatg cttgtttctc ttcttgggta 1201 aggtcaacaa cgttctttct aattccataa gttttctttg tgctaataag aaattttctg 1261 tttggaggag caagatcaga gttactgcta ttgattgaat taattgcatt attgtgataa 1321 gctgcgttca gcttggtgtc tgaactatcg ctgagcacct tatttaatgt ttgcttatgc 1381 tcacgatgct caacattatg agaaagtgct ggagttccga aaacaactaa aacagcaact 1441 atactagcaa taatgacagc tttaatagtt ttgtggattt tcatcatcta cgcttgttgt 1501 ggttcctaaa gatttcaaga ttttttgagg actctactga gagcaatgat tacttggatt 1561 agtttgtata taatttgaca gtaaaaagaa tcactcacaa tagaaaatct ttaggaaaat 1621 ccctctattc tttcgcaacc agcgcaaaag tcgtcatttt ttgcagtatt gctgtgtttc 1681 aacctagttt gacccgatca actggatgac gaccgatggg taaatttatc gtaccgcttt 1741 ctgttttggg tacaccgtag ataatgccga ggtattctcg ccgtgcagtt tttgctttga 1801 gttgagcttg tatatgttga tgggcgtgtt ctgttttggc tattgcgatc gcgaagcggc 1861 tcctccggag catcgcccct gttgtatcct tatctaatcg atggacaatt cccggacgtt 1921 gaactccctc aattcctggt aggttagggc agtgagctaa aatagcgttg actaaagtcc 1981 cgtcttgatg acctggtgca ggatggacaa ctaacaagag cgcttttgtt gatgataagt 2041 agagagtcgt cttcgtaaag aatgtcgaga ggaatatttt ctgcttggag ttctagaggt 2101 tcaggttctg gtattgttat ggaaatgcga tcgcctactt taagcgctgt tttctttgat 2161 gtgcaaacat tgccattgag ttggactaac ccttgttcta tgagttgctg aatgcgcgat 2221 aagccggagg cttgacgcta cgcgtatcgc gagaaatttg ataattcttg ggagaggtag 2281 cgatctaagc gttcacccga aacattggct tcgcgatcgc cgttttgttt gacttctaaa 2341 tgatattcgt tcacacttgg tctggattaa ttcctcgttg ttgaagtaaa gcacggagtg 2401 cttgtagttc tgcttcagct tgttctgccc tagtttcagc ttgttttgcc ctagcttcag 2461 cttgttcttg tttttgagcc agttctacat aattcagaaa ccgctgtcca tcaggatgat 2521 aaatctgcag ttcgccatca gacaattcaa agcgtatgtt tagacgggga ctaacccaac 2581 ctatcatttg ctcaatttct gctaattcat taccagaccg tagccagcca gtcaactcac 2641 ctgtgtctgg atcatacaag tagtattctt ccacaccata gcgctcataa aatttgtatt 2701 tactaatcat ctctttggcg ctattactgg gggaaaggat ttcaaacacc acttgtggag 2761 caatattgtc ttctcgccac tgtagataag aacctcggtc tccttttggt ctgccaaaga 2821 taaccataac atcaggtgct ctgcgagtga ggttatcgcc ctcaacggga taccacagta 2881 agtctccagc aacgaacaca ttggggtcat tgtcaaacag cagatccaag ttttttttaa 2941 tgagtacaat caattcaaat tgctttgtat tctctgccat tggctgtccg tcgctttcgg 3001 ggtaaatgat acctttttgc ttgaaagatt tagcttgagt aaccattgca atcagtccta 3061 tttttgcaaa gtggcattgt tcattttaac aattatcaaa actccataaa catcaaacca 3121 agagcgactt tataccattt caccaggatt gagtaggtct gactcctgac tcaggaattg 3181 ctgctgtatt ctacctactt cagcttttat aagtagtaga atgggcgatc gcgccaccac 3241 agatcgccgt cattctttaa cttttcactg aaggctcaat catttttatt gactcccgtc 3301 cggaattttt ccctttgtag cgataccacc acatcacaaa gccagtgata aataaaatca 3361 gcggtgtaag tccaacaaat atatagagaa tgcgggtagg taaaccacca aaagtgccat 3421 agtgcattgg tgaaaacgag ttgaaaactc gatcagctcg tgatggctgc aagccatttt 3481 ttagtagcac cactttgcct gtatactggt cgagataaac ctgagagtaa ccatactcgg 3541 atgcctcttg aggctgtttt ttgcccactc taaaaactcc ttctggggtt tgcggtaggc 3601 tgatgtaggt agtaacagca cctggtagag cagcatcagc ttttttaaga atttcgccta 3661 gtcccaaggg tgacttgccg tgtataggtt gagaaactgg tgtgggtggg atgggagtga 3721 aggttgcagc gtgaatgacg ggttgggaaa aatcccaaaa attccagcac acgcctgtga 3781 aggaaatcat gccaaggaaa acagctgcaa tgatacctgc cactttgtga atatcaaagt 3841 tagttcgctt gggatgtgca ttccacttga tcttgaagcc aagaatcagt ttgcgccaac 3901 cgggccataa aataaatcct gtaatgctaa gaaggaaaag gaaaaacgcg gcaattccaa 3961 cgatgacttc gccgattttg ccagcaagaa gttcatagtg gagtttgaag gtgaacccaa 4021 tcaatgtacg ctcccactga cgagaaccca taattacacc cgtgtaggga ttgacaaata 4081 cttgcgtccg tttttcacct ggggcttcca gccagacgcg ataggggagg tgggcatctg 4141 gaagtgtgtt gatgcttaat agtttcagct caggttggga ggcgtaggcg gttttgacgg 4201 tttttactac ggactcaata gggacgcgtt gtccttgagg aatgacttgc ccaaattgtt 4261 gactcactag gtattggtcg atttcccgct gaaagactag taaggtgcca gttaagccga 4321 tgatgattag cagtagtcct acgattaagc cgagatagcg gtgtatgtaa aatgtaaagt 4381 cgcggatttg tttataattc atggtggatg ctcctaaatt aaatgtcgtg tgattgcaat 4441 caaaactccc aagaaatcgt tccctgcaca gtaaaagggg caccaggctg aacctccgtg 4501 cgaccttgtg tcccttcaaa gtaccgaata ttggagagat tcttgaagtt gagcgcagct 4561 ctgaagtttc ctgtttcgta gtaaacggca gcatctacac gggtgtagct agggacttca 4621 aaagtgttgg ctaaatcccc agcgcgattg cccaaagggc agacgcaatc atgcgtatcg 4681 ccaatgtaaa aaattccccc tccaaagccc aatcctttca acccaccatt ttgtagcgtg 4741 tacgttgtcc acaaacttgc agtgtgcctt ggcatattat tgagacgatt gtctacagaa 4801 aacgtgttct ctttggtgat agtagcgtcg gtgtaggcgt aagaagcgat aatcttccaa 4861 cccgggagaa tttcgcctgc aatgtcgaat tcaatgccgc ttgagctttg accatctgac 4921 tccaagcagg agttgttact aacatccaaa cagaacttgt caatagcaga gctttcagct 4981 tttgctcgat tctcatcttt gctccttaca atcggtagtg tcgcccagct ttggagtaga 5041 taactgtgca gctaattgca atttattggc atagctaaca gacttacttg agaaaactct 5101 acaggaggta gcgaatcaat tgcaaatatt tgtcatagtt tcgtaatgtt ttgtgtgcag 5161 caagtgctaa cacttttgtg tgagaggtga actgaaaaca actgcacaga caagcgtatg 5221 gattatttgt tgagaaatcg ctctgaccgt gatttgttac gcatgttagg ggcacagctg 5281 gctcgtgccc ctacggtgga tacaactgat gaattgctat atctcaggac tcatgacttg 5341 gcactatctt gataagtgct ttcaacaatt ggtgaatctc ttcaacgcta ttgtaatgta 5401 ccagcccaat tcgcaaaagt cctccgcttg actcaactcc taattctttt gtaagaccga 5461 gtgcatagaa gttaccgttc caggtaaaga taccttgctc acccaatgct ttggcaacag 5521 cgtgaggtgt tcgcccagca agtcgtatgc caacagttgg tgtccgccag tcaaagcggg 5581 ttgtatcagt aatgccatat aaagttatat caggaatttg ttgcaatccc gtgaccaatt 5641 tttggcacaa atctctttcg tattgcttaa ttgctatcat cgctgcgagt aaggcttctc 5701 ggcggttctg aacacccggt aagactcggt gacctagttc agcaagatag ttgattgctg 5761 ctactaaccc cgctagccct tcgtagttct gggttccatt ttcccaacgt gacggaattt 5821 catctggggc tggttgcacc ttataaggac gcaatcgagc aagatgctcg cgctttgcat 5881 ataatatccc aacatgcggt ccaaagaatt tataagcaga gcaagcgaga aagtcgcagt 5941 caaggacacg tacgtcgatg ggagcgtgag caacgtagtg aaccgcatcc acaaaaacca 6001 tcgccccaac agcatgagct aattgcacaa ttgttgcaat atcgttaatt gtgcccacag 6061 cgttggatgc ataccctacc gccactaatc gcgtccgttc atttatctgc tgcttgagat 6121 ggtttatgtc tagggtgcag tcttctggat tgatttcgac ctcacgaatg actacaccgc 6181 gttcagaaag cgcaaaccaa ggtgaaacgt tagcatagtg gtctagcttg gtgacaatga 6241 tttcatcacc gctttggagt tcgcgagcga tcgccctgag tgagtctccc gtatgggcga 6301 tcgcccgact caagctataa gtcagcgtag tcatattggc accaaagacc acttcgtcag 6361 aactgcagcc caataaatca gccatagctg tacgagctga agttatgagc gcatctgtgc 6421 gctgacttgt ggcaaaagcc ccgtgtgcat tggcgtttga ggtgactagg tattgggcga 6481 tcgcatctat aactgacttt ggtacctgag tcccaccagg tccatcaaaa aaaatcacag 6541 gttgaccatt aatttcttgt gaaagtgctg gaaactgatt gcgtatccat cctaagtcaa 6601 ggtcaaccat aaaaaactcc tttatttgtg attttacttt cctgttgatt gagatcttgc 6661 accttaccgg tgagtacgcc ttgaacttaa gtaccctgcg ggaagccctc cgggttcggg 6721 ggtaactcct tcggagacgc tgcgcgttcg cccttggcgt gcgcttgcgc ttacgcaatc 6781 gacgggaacc gccaagactg cgaccccctc accgcctccg gcgtctacaa ggctaataga 6841 ttaagtccgt taaaacggac tgggtaaatc ttttagtccg ttttaacgga cttgaacttt 6901 gagccaagaa atttatttct tggcggacaa aaaagcagat ccaagatctc agttaagtca 6961 gttcaagctc tggatgctga atttggtttt ttacttatcg tttggcgaac ccaatcctgg 7021 acaagtcaac acaaacacat ctctcacccc acctatctta gagattgcat tgataacaga 7081 tgtaaaatga aaaaattccc cgtcgccaaa gactaacatt tctccggggt tgagaatttt 7141 gttcaaaact gggctgtcat tcttagattt gtataaatga gtttctcccc cttcaattct 7201 ttctctgttc acagaaaata taccaattac atctactcca tctctatgga ttccttcagg 7261 agcaggtttg ccaatttgct ctggggaagc ggtggttcta atttgatgaa ctgcaatttc 7321 tttaaaagtt gagcagagtt cacaaaactc gaaaaattct aaaatcattc tttgaaaatc 7381 tgccaatttt atcagttcat catctagttc tgcatactcc ctgacgacat cacccaacaa 7441 aggattatat tgtttgggtt gatataaagg acgatgtggc agcttgatta tactcctgcc 7501 agaaatttga aaatgagata aacgccggaa acgataattg cctgctaagt atggatctgc 7561 aggaaggttg ctaaagaatg gcttgagctt atctacattt atagaagaga tattttctat 7621 agcgtagctg gtaagacagt taaagtcttt tctttgtaat tccaacattt tcctattctc 7681 ctcagattga ccattactag ccaaatggac acatgaccaa atttgggttg gctatattgt 7741 caaacccaat atgaaaaatt tcagtcaatt tatacactct tcagctgtca ataaaaaact 7801 tccacacgaa aagcatcctt gagcatagct caatttttca tgacagctac gaggctcaaa 7861 ttattgatca tcatcaggca aagttaattt tttgtttttt gagacccgat aatttttaga 7921 ttttttaacc tagttatttg tcgttttatt tagattttgg tgcttttgtc ttacacctta 7981 tttctttaat tataagcaag gaaaaactga cctgcaatat tgtatcaata ttgtattgat 8041 ttagtacctg tactgctgta acaggaagtt gtgttcaaaa atccaagacg ggagccactg 8101 cgttggtgag cgcgagtgct cctccgtaag ggcaaagccc gcgctaggag taccacaatt 8161 tttcggattt ctctgacaaa cttcgcttgc ttgcaagcac ttggaagtac tacaaaatct 8221 caaactctaa actgtgactg ttaagaaaat tgtgagtgaa atggtctacc acattcgtat 8281 cactcaattc caaattataa aaatccacaa cttcatgctc aaaatacggt ccagcgtgcc 8341 aagttcctac ctctaatttg ataaagcaat ttcctggaat gcggaaagct gcgatatctt 8401 ctagagatgg ttgatcaata ttattgttag gaggacaaac cgcaatgaac cagtccttcc 8461 cttccaacga acccaaacac tgagtacatt gcacatggcg agtgatggtg tgaaatctgc 8521 gccctctgcg gtgcagtctc ataatgtaaa agcggggagt acctttatca agtactaatc 8581 tcgcatcttc tgcatcaaag gatttgccat gttcactggg aaaaatcact tgtccgtatc 8641 gacgaaaatt ttccggcgtt acccattggg cttgtagttg ttgcacggtc tttgatgtcg 8701 tcatatccaa tccgcttgta ccctgattat tttaactctg ttccctctcc agttaagaga 8761 gggcaaggga gaggtttcta taaatggctt actccaattt cagcagtttc ttgggaatga 8821 gtgacttagg aataatagaa gcttgtttct ctaaatctga aggttgtgtg tacaaccaag 8881 cccaagcaat taaaaccgcc tgaaaaggaa gtctgatcac atgtacccaa ggtgaattgg 8941 gtatattgtc aatcggaatc agattaaccg ccatgttgat atttgccgga aaaacagcaa 9001 taaaaagagc aataagcccc caagccgtag cttgactgac aggcggaact aataacccaa 9061 taccgcccaa gatttcataa aagccactga gataaacaat ttccagaggg taggggagtt 9121 gcggcggcac aattctgaca tactggtcag cagatgtaaa gtgtgttatc ccagcgatca 9181 gcatagatac cgcaaggatg acacgaaaag tttctttgag attatatttc atctgtttcc 9241 gtagttgcgt tcataccttc agtttgttta ccaaatctca ctccgtcgtc agccatgagt 9301 gggaattagg gagtggaaaa atgagggcac aaggggacaa tgaaccaata gtgatccagt 9361 cttaagcagt cttgatcagt tctagaacca aaaattctta gcagataatt ttcatgttat 9421 tgctttttgc tcattatttg ttacaattat ttatataaat ttacaaaaag acaaaaccat 9481 gcgttacacc gctgatgaca gaggcattct taacaactac gcggttgaat ctgctattta 9541 tttggcagag tatccaaccc ttgaacaaca gaagcgctaa gcttttggtg gagtgctggc 9601 gattttgcta gttttcgcgc ttaattgtta atgacctttg ctatcagcta aactgctcaa 9661 cccttaggtt ttatggttgc tcagttgtgt cacacatcac ttgtctgtca gtaataagaa 9721 attccggggt ttgagcctca actttacatg atcatggata ctgatgaact tcttatacgc 9781 tatgccgcag gtaacagaga ttttttgcag gtgaacttgc agcaagccaa cctcagcgaa 9841 ttcgctttag caggaatcaa cttgagtcgc tcagacttga ctggggcaaa tctatcaggc 9901 gcttctctac gtggagccaa tttgagtgaa gctgttgttg cggaagcaac cctgtggaga 9961 gccaatctca ctgaagctgt cctgatttgg gcaaatttca gtaaagcttg tttaatccgg 10021 gcaaccctga ctcaagtgga tttacacaag gctgtcctaa caaaggcaga cttgcgctta 10081 gccaatctgc gctacgctga cctcagctat acaaatttgg agggtgcgga tctgcgctat 10141 gctgacctca ctggtgctaa cttagcgggt gcgaatttga gcaaggcaaa cttgacaggt 10201 gcaaacttga gtcaggcaga tttgtataac gccaacttca ctaaagccat tttgtcaaga 10261 accaacctgc gctacgctga ccttagcagc atcaacctgg atggggttga tctgcgtgat 10321 gcacaggtga gtggtacttc tttacaacca ttagtaaata ctcatcagta aacagtgata 10381 actgaacaac tgataactct gattatggaa gattgtgttt ggaggagtga actgtttcta 10441 accctgttta cagctcctca atttcaactg gaaattagct gaaaagggga attgacagct 10501 tatcggtctt ctatgaaggg agaaacttgg tcgttggcaa atcatccgct gtttcaagct 10561 gctttgactg gtgatcgagc gatcgccatc gccgacacca cacaagactt aggcatacaa 10621 acagatcccg atttaagtgc tcaattccaa cgcgcttcaa ttcggtcgtg cctcttggtg 10681 ccaattcgct atcatcaaac ttggttagga atattagaat tgcatcactg tggtgactct 10741 gcttatgttt ggagtgactc tgagcgagtg ttggtagagg cgatcgccac ccaagtagca 10801 gccgccttaa tccaagctca agcttacacc aacttggaaa cccttaacca gcaactggca 10861 gctttagagc gtacacaaag caacctgatt gcgattgtag gacatgaatt gcggactcct 10921 ctatctacta tccaagtttg tttagagagt ttagccagtg aacctgaaat gccattagag 10981 tttcagcgct caatgctgga aacagcatta ggtgattccg aacggatgcg ttcatttttg 11041 ttaaacaaat agattcaagt tttgtatctc atatagcgct tctcgtttgg atcaaataca 11101 caacgatatg tagaactttc ggatgcaacg tctctaccgt attgcacgca attgaaaacc 11161 gctataaatc atcagtcggt ttttgcaata gttctgaaga cagtacatag cacaattgat 11221 gaacctttta tatataaaag tctgttaaat gactctacca aaaaagaata atcttatatc 11281 atgttaaaaa actttacatt agttaacttt tttgttacat taatttacgt tagagagaaa 11341 tctatgtgaa gaaagttttc gtgaatacac acagctacgt cattaatgac taaggtgggc 11401 atcactgctt tcctctcatt aactgatgta cttgtaagaa aactcacggg ctagcttcac 11461 ctttgggtaa ggcagttgcg cgactgtccc aaaaccctag ctgagcactg ctcaccataa 11521 gacagaaaaa ttcaatgttc acaagcgcca agattggagt tgtatcgctc tagaagaaag 11581 ctggattcta actgaacaca gcttaatagg gactactgca agtctgtctt agcttaatat 11641 ccgaaaaagc aaattttgag taattaaata caaatagacc tggagattct gacgatggtt 11701 tattctatcg agtccgctca aagcatcttt tctagcactc aagttcctag tccgattcca 11761 gccacgatag cactgttcga gcaacttaac gttgacgata aactggcatt actctggttt 11821 gcctacactg aaatgggtcg ttccatcact cctgcggctg taggagcagc acgcctacag 11881 tttgcggaag gtttactaaa ccaaattaag cagatgtctc aagcagatca attgcaagtc 11941 atgcgcgatc ttgctaaccg tgctgacact cccattagtc gttcctatgc aaacttcagt 12001 gtcaacacta aactggcttt ctggtatgag ttaggagagt tcatgaaaca gggcattgtt 12061 gctcctgttc cctctggcta tgaaatgtct agaggtgtcc aaatcgtgct agacgcaatc 12121 aagcaactcg atgccggtca gcaaatcaca gtacttcgta atactgtggt agacatggga 12181 ttcggggatg ttgttgtttc tagtagtccc caagtagatg ctgaacccct gtttccacgt 12241 accgagccag ctcctactaa aattactgtt aagggcatca cagagcctac tgtattaagt 12301 tacatagaag ctctgaacaa agatgacttt gacgctgcta ttgctctatt tactcccgac 12361 ggtgccctgc aaccaccatt ccagaaaccg attgttggtt atgaggcgat caagagatat 12421 atgcgtgcag aagcacaggg actcaatatc ctgccgcagg aaggtatatc cgaagagcta 12481 caaaatggct ctaagcaagt taaagtcaca ggtatagtac aaactccttg gtttggtgtc 12541 aacgttggta tgaatatcag ttggcgcttc ttactcaatt ccgaaggcaa aattttcttc 12601 gtggcaattg atatgttggc atctcccgca gagctgctaa acctgcgtgt aagatagaac 12661 gcttcatcaa ttcgtacagt gccgtgtaga gacgttgcat gcgtagagac gttgcatgca 12721 acgtctctac attggctcac caagcgtatt gagatctaag attctcatct aattacagca 12781 gttgccaggt agattaggac atgaactaat gagaaaatac ggacgccacc agaccatcaa 12841 gcccgttaag agttcgctgt tccctgctat aatttgtgta aaatttaaga ttttctttag 12901 attttagatg tcaggtgggc attgcaatgc cagccctaca ccattcgtgt atttgactca 12961 aatgagaagc gctatatacg attagattgt catctaatta atttatgtaa aatttaagat 13021 ttactttaga ttttaggttt aagattgttt ttagattctt gataacacta atgtgtttga 13081 ctaagaacga gtctttagaa aggaggttta tccgatggac tcttcaacgt tacgtcatct 13141 atggtctgtg attgaagaaa cccagaccag tattcttcta aacttcagtg ataccgaact 13201 agtcaagcag ttaattagac tgcttgaaaa cagaaaacta cttaacgacg aagaaatcag 13261 tattgtgagt gcttatattc gttccagaat cccactcatc cgcgacctcg cttttgcgcg 13321 ttcaccaata aatggacaat gggttatgag catggctggc tgaaaacctc ttatgttagg 13381 tcaaagccct tcgcagtcgc tgttctaaag aaatcaaatc taccccaaga ctttgcaata 13441 cccgaattgc gactccccct ccttcatgga gaatgcctaa tagcaaatgc tcagtaccaa 13501 cgtagttaac attaagacat cgggcgtcct ctaccgctag ttctaacacc tgtttagctc 13561 tgggagtgaa gggaatatct atagctgtat cacctttccc tctgccgata attttttcga 13621 cttcaatctg tgcattttcc aaattcaccc ccacagaatt aagaaactgc cagcctatac 13681 cactcccttc accaagaagt cccagtaaga tttgttctgt cccaacaaaa ttatgcccca 13741 aacggcgcga ttcaccctga gccaaattaa tcgcgttaac tgccttttgg gtaaaccttt 13801 caaaaccttt gagatgaaat ctgtcagaaa aagaccctga cagtttgtga aaaatagact 13861 caaaaaattt ttcaaacata aacccctcct gaataggtgt cttgttagtg cgttcaaggg 13921 tgtgttcgtt ggagacgcga tgctccgttc gccgaaggcc tggcgtagcc ataggagccg 13981 ctgagcgatc gccttcacct ctccttctgt ccgtcgtcaa aaaagtaagc aaatcctcca 14041 tcgtctcagc atttgctgag taccacagaa attgcttatc tttgcggctc ttcactaact 14101 cttcgttccg cagcttttcc aaatggtgcg acagggttga gttcgggatt ttgagctttt 14161 cctgaatctc acccactgtc atcccttctg gataagccgc aaacagcagc cgcattattt 14221 caagtcgtgg ttccgacccg agtgcagcaa acatatctgc atatcgagcc gtttcttgat 14281 ttaccatatt tctagaataa tcgaaatata gaagatgtca atattaatat ttagtcatga 14341 cccgtgagtc atgagttagc aattgtctgc ttgtccctta ttcctatgcc ttaccactcc 14401 cgatcaaata cagcagcgcc atacgaacag caacaccact agttacttga gcttgaatga 14461 ggctaaattc cgggtcatcc attaactctg agctaatttc gacaccacgg ttcactgggc 14521 cagggtgcaa aagcttgaca tttggcttgc acagttgcag ccttgagtgt gtcataccaa 14581 acatctggtg atattctcgc aaagaaggca ataaatgagc agtcattcgt tctttttgca 14641 agcgtaaagt catcacaaaa tctgcatctt gcaaagctgg ttctaactcc cagtggagaa 14701 acactttacc tggtgtattt tcacagtaat ctgcaaataa tttaggtaag agggtgggtg 14761 gtgctgctaa atgcacatca gcaccacttg ctgtcagact ccaaatattt gagcgtgcta 14821 cccgcgaatg gagaatatcc ccaacaatcg caatcttttt ttctttcaaa agctcgactc 14881 ggggtagatc tggatcaagg agactacaaa tagtaaataa gtccaacaga ccttgggaag 14941 gatgttcgtg ttgaccatca ccagcattga ggacgctgac tcgtacaccg agtcgatcca 15001 tttcttgggc gatcgcatca ggaactccag cctcccgatg gcgaatcacc attatatcag 15061 ttcccattgc caagtaagtt ttcgccgtat caagaatagt ttctcctttt gtcattgaag 15121 aagttcctga ggcaaaattc agtgtatctg ccgatagccg cttagcagcg agttcaaaac 15181 tgctgcgagt gcgagtcgat ggctcaaaaa ataaattcgc cacaacctgt ccctgcaaag 15241 ttgggacttt cttcgtccgc cgagatagaa cctcttgaaa actagcagcc gtttgcaaaa 15301 cggtatcgta ctcatcagtg gtgaagtcag cgagggaaag aatgtggtga cgactccagg 15361 tgttattaga cataaatagc gaggatattt ggcttaaaga cacgtctaca gcgcactcca 15421 gggggaagct tcgatgcaca ctgatagcgc ctttggagtt tgggtaaaac ccgtgtaaaa 15481 attaggcaac atattgtttc agtggcgctt attatatcgc atttttacag ttttttaatt 15541 tatatttcag agaaaatcat catacacctt gcgggaattt ggaaaccatg cttggcgttg 15601 caaaacgaag gggaacctgc aaaggcgcac tgcctcctga atacgcttgc cccattttct 15661 tacgtgttac acctgaattt agctgaccca aaagaagcaa atccgagaac tgaataagtc 15721 tgaatttctg cttgtgcagg agttgtgccg tttcgctaag aacctttaca acatagggca 15781 acagcctcgt tcaaagcaac ttcttagaac tttcatgaca gtctccccag agagtgtcat 15841 aagctaatgt aacgaatagt gacaaaactt acaacagacg aaacaggtca ggaactctgc 15901 ccagtcttat aggtacatta tgaaacagtg ttttccagac aaatgagcca tgctacttga 15961 tgatctgcgg ctgagtgaac cagcaataga gacggcggta gactataaac cactgactgt 16021 cacaccagaa acattgctgg tagatgtcat tactttaatg aaccaaaagc gaattcacag 16081 ttgctcatta tctcgtgtta attcaccgtt aaatggattg accgtacatc cgacaggctc 16141 tagctgtgtt ttggtgataa agaatactaa agtattgggt atttttacag aaagagatat 16201 tgtacggcta accgcagatg aagttgactt tagaaaggtc acaatagcgc aagtgatgac 16261 gcaaccagtg atgacattgg agcaagcgac ttttcaggat gtttttgcgg ctttattttt 16321 atttcggcgg tatcgtattc gccatctgcc tatagtagga caaaagggcg aacttattgg 16381 tgttatttcc cacgagagta ttcgtaggat tttacgccct gttcatctcc tgaagttcat 16441 gcgtgtggca gatgtgatga caacccaagt cattcgtgct tccatgagtg cttcggtatt 16501 aactttagct caggtgatgg cagagtttcg gattagctgt gtcgtcataa ccgaggatca 16561 accaccgtta gatttgggac aatttgttca cattcccatc ggaattgtca ccgaggagga 16621 cattgtacaa tttcaggcaa tggaagtgaa tctatctgaa gttctagcgc aaactgtgat 16681 gagtacacag ctatttgtgc tcaatcctga ggattcttta tggactgctc ttgtggaaat 16741 gcaacggcaa cagacgcgaa gattggtggt atcttgggat cgaggaatag gattaggcat 16801 tgttactcaa actagcttac tgcgagtcct tgatcccgta caaatgtatg gagtcatcca 16861 aactttgcaa cagacagtta agtagcgtct acgcttttcc acgatgcctt tgcttggctt 16921 tttgacgcgc tagtagtcgt ttttcttctt tttcttcttc cctcaccgct ttagaaatct 16981 cgtttctctc acgtttcaaa ttttcaaact ctaatttgag cacttgctgt gcttttgtgg 17041 agatacctct tcgcctttat attttttgta cacaaaaatc gtttcagctg ctatattttt 17101 taattagccc tcttcaggtt tgcttctacc aaaactgaca gcagaattct catctgctga 17161 gcatgtgaac aaatctttgc gaggacagta ttgatggaat caaacaacgc gacattcgag 17221 cagcaagttt ttgaactaac taaccaagaa cgagctaaga ataatcttcc acctctgaaa 17281 gcaaatgctg aactgaacta tgctgctgac aaatatgctc aagagatgtc acaacgtggt 17341 attttgagtc atacatcacc agatgggtct caagcgtggg atcgagcgaa ggttgttggg 17401 tattcagctc aaatgatggc agagaacatc gcagcagggc aaacaacacc tcaacaggtt 17461 gttcaagatt ggatgaacag tcctggacac cgagctaata tcctcaagcc cgaatatact 17521 gagattggca ctggtttttc taacaactat tgggttcagg actttggtag tggtgataca 17581 aatcccatga gctacatacc aaactctaca tctaactcaa cgatcgcatc tacctcaacc 17641 cctaccccgc aatctgcctc catgcctaca gctacattgg acagtgtttt tccttcaaac 17701 tctgttctag cttctgttcc agcatctacc cctgttgctg aacctacagt tgaagctgct 17761 tctaacactg atacaggatc tggtagcggc aaagtcatag acaacccagt taatgactcg 17821 ctactcttgg gtggtttcaa caacaatacg cccgactacg cttctggaca tgatactttt 17881 atgcttggtc aggttgaaaa ctttgattgg attggcaatc ttcagtttag ccaaggtatt 17941 cctattatag aaaatagcag aatccttgag atattaccag ggtctgaaat aaataacaac 18001 atccccattg gggataggaa tatagagact ttaattgcac aaggacaaaa cactttctcc 18061 tagagattgt gtaagttaca ctcctacact gtaatggagg gtgagtgtag gagtgtaaat 18121 aaaaggatct ttgattaatc gtcaatggga tatacccatt ttggaggttc catgagtttt 18181 tttagtcgcg ctgcttcagc tatttcatgc attttttcta tagaagtctc ttgaataaac 18241 tttgatgcag cctctatgtg cgatttttca tcgtccgcgc tgttgttggt tttagtttct 18301 ggattattta gtttatcttc ttgcatgcaa acctcctgat gggttatttt gaaaataagg 18361 gattggtgat aggaaaaatt atctatagtc tattcgctat ggatttaaaa taatagtcaa 18421 atctaaaaca aaaccaggta aaacttcttc accggaaagt tgaggagggt tatcaaaagg 18481 gaaattcaaa acttctacat cttgttctgg tcgataaatt tctaccaaag gcgtttgcgg 18541 atcaattaac caacctaaac tcgctccatt ctcttgatat tcccgcattt tggtgcgtaa 18601 tttatctaaa gcatcacttt ctgaacgtag ttcaattaca aaatcaggac aaataggtgg 18661 aaaacgtcgt ttttcatctt ttgttaaagc ttcccaacgt tctaacttca cccaagaagc 18721 atcaggacag cgataagcac cactgggtaa tttaaactca gtggaagagt caaagacttt 18781 tcctaacttg gtttgacggt tccataaacc gacttgcatg ttaatgtctg aatttctaat 18841 tccgctttct ccgcctgttg gagggataat gattaattct ccagtctggg taagttctaa 18901 ttgccaaggt tcgttagcaa tgcaaagttg ataaaattgc tcatcggtta aacctacggc 18961 tggagatacg tttaaggtga ggatttccat catggttttt tacggaaggg gttaattcta 19021 ttatcaaaca aagactgaag ccaaaaagga aaaaagatca caaacataag tctgttgagt 19081 ccaagaaatt aaagtcgcta aaagcaaagg taaaaagtct acaacaaatg gtagaagcat 19141 ctgccacaga aattatgtaa aactaaaatc acgctacaga tgtaaattga gaaagtcaga 19201 tttcatctgg gttacttaat tacgaattac gaattacgaa tgactatgaa ttggagtgct 19261 gcattaccaa cattttttat taccttaaga gaaggcgtcg aagctgcttt agttgtcgga 19321 atcgtcttcg cctgtttaca gcaagctgaa caacaaagat tgcatcgttg ggtatactta 19381 ggtgttatca gcggcacttt agctagtttt gttgtcggtt tactactaaa tctgggaatc 19441 caaggtttgc agacatctga tttgctgtat gcacctgtct tcaagcaact tttcgaggtg 19501 ggactgggcg ttattgcgat cgccatgctc agttggatgc taatctggat gacacgacaa 19561 gcacgatttc ttaaagcgga aatcgaaggc tcagttaaaa gtgctttagt tgaagacaac 19621 cgtgcagctt gggcaatttt cagcttaatt tttattgccg tatttcggga aggtttagaa 19681 acggctttgt ttatcgtcgc taagtttcaa gaaggttgga cacccgtact tggggcaata 19741 ggaggactaa gtatggctgt tttaatcggg atgctactgt ttaagtggag tatccgcatt 19801 aacttgagca aatttttcca ggtgatgggc gtgttcctgc tgctgattgt ctctggatta 19861 gtgatttctg cactccgcca tcttgatgcg gcggcgattg ctttgagtca aatcaattcc 19921 tcagttgctg atgcttgcgg acaaggaaat acttcttgtc ttttaggtcc ccaagtttgg 19981 gatctctcag gtattttacc tgatcgacag tttcctggaa ttctgctgaa aactttattt 20041 ggctatacgc aaaaactcta tttcgttcag gctgtaggat atctcctgtt ttgggtggtt 20101 gtgggtagcc tctatttccg cagtcttagc caacccacac aattaaagcc agcaaccaaa 20161 ctcaattagg acttatttat atcatgtccg tgggaaaacc tcaccctgcc ctatcgggca 20221 tccctctcct tataaaggag agggaaagat tttagcgcag cgaaaagcga gggtgaggtt 20281 ttgagcgagc cggtgtgtac accgttaaaa cctcaccctg ccctgtcggg catccctctc 20341 cttataa // LOCUS NODE_1597_length_20233_cov_5.18614320233 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 20233) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 20233) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..20233 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 172..615 /locus_tag="DP116_14355" CDS 172..615 /locus_tag="DP116_14355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949992.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14355" /translation="MTQLSTAEIEQYREVLPNDASFQSALTQIEQHNGDIDSYLDEIL LDKFGATRDYQKSLREVTLKRLRKELCGADDSFRTKVQEYKRNPASAPLLTGLIVSLL AMTGVPLDPTIATVIVLYIVHVGIDIFCEYTEPDADALGNQPPKN" gene complement(1019..4372) /locus_tag="DP116_14360" CDS complement(1019..4372) /locus_tag="DP116_14360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194104.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Tfp pilus assembly protein PilF" /protein_id="PRJNA477356:DP116_14360" /translation="MPVITIREEEKTDTGFEVTLRFEEEEYLVTIANPFTSKEETELE WYFEEWLEFPFSDTAIAQRAAASIKKYGQNLFNQVFTDRDAYSQYQRLGKGHNPLQIE IVGKTPDFQALHWEALWEPGYPQPLAVDCVMVRKSVKRAGKRADVPQSPVINLLVVIA RPNEEQDVGYRTISRSLIEAIENSQLAVNVELLRPGTYEALAKHLEEKGPGFYHIIHF DTHGALMKYNDIQKETANPYAYQLRWGRYDLQPYEGVKAFVFLEGETKGDADPVEATE LANLLMGKGIPVCLLNACQSGKQVKMEDAENDYRETSLGSQLMAAGMQMVVAMGYTVM VDAAKLMMQQVYAHLFDHKDISEAIRLGRLELFNDKKRKAYFNKHIDLEDWLLPVVYS NQAVKFNLREFTPQEEEEFWQTEGERYRFTSPEYGFFGRDLEILKIEKSLLRHNILLL QGMGGTGKTTLLKYLRQWWQRTCFAKDIFYFGYDHKAWTLTQILFEIGTRVYKRNELA NFQTMNQTVQVPKLVAKLRAESYILILDNLESVTGQPLAIQNTLPETERNQIKDFLTR LVGGRTRVILGSRSREDWLQATTFKNNIYELQGLDKEARSDLAQAILQSHVTAKRILA IRQDEDFQKLMQLLAGYPLAMEVVLANLKNQSPKEILSALQSADVKLDTGNEEKTKSI LKCVEYSHSNLSPEAQKLLLCLAPFSGFINRRDIPNYIKKLQTQEPFKDYHFEKFDEA IQEAINWGLLSPMNLTPSPSPTRRGGEEDLPFLAIQPVFPYFLKTKLETLDEETREAL QSGFKNHYEGGANFYNQLMESKEAQERQLGIDFCRLEYENLYNALQICLDKQENISIY FCLNKYFEFINDNQSNLKLAEMVCQRLQKYPQVFIEGELGYQIPFAIHRLGTCQLETK QYQQARKSYEKTLEFYDALGSEEERQKQRWKASTYHELGRVAQDLREFSEARRNYELA LQIKIDFGDRYSCASTYHNLGMVAQDLREFDEARRNYRQALQITIEFGDRYSCARTYH QLGWVAQIWREFSEARRNYELALQIFIDFGERYSCASTYYCLGLLAQAEENYPEARAN LQKALEIYVEYKNESSAEVVREDLERLPE" gene 4687..5619 /locus_tag="DP116_14365" /pseudo CDS 4687..5619 /locus_tag="DP116_14365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319609.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" gene 5842..6399 /locus_tag="DP116_14370" /pseudo CDS 5842..6399 /locus_tag="DP116_14370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744339.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" gene complement(6855..7055) /locus_tag="DP116_14375" CDS complement(6855..7055) /locus_tag="DP116_14375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455637.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NblA-related protein" /protein_id="PRJNA477356:DP116_14375" /translation="MNQPIELTLEQQFNICSFATQVQHMSHDQAKDFLVKLYEQMVLR EATYKELLKHQWGIDSGESWAA" gene complement(7654..8319) /locus_tag="DP116_14380" CDS complement(7654..8319) /locus_tag="DP116_14380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_14380" /translation="MSITPESVKELLESQDLGDRLRAVNQIRQLEPKLGFELVQNAIN DSNSRVRYSAVSQLDTLGKQDLDLSLNILRDRLLNDPEADVQAAAADCLGALGLREAF DDLQQLYQTTNEWIVQFSIIATLGELNDPRSFELLKTALSSDNDLVKTAAISSLGELG DSQAIPLLAPYATDPDWQVRYRLVQALSRLGGTDAKSILETLANDEAEAVATEAKKSL TEA" gene complement(8545..9006) /locus_tag="DP116_14385" CDS complement(8545..9006) /locus_tag="DP116_14385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127819.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribulokinase" /protein_id="PRJNA477356:DP116_14385" /translation="MPKIVADIMSRDPIVVEPETPLQEAIKILAERRISGLPVVNDAG KLVGIISETDLMWQQTGVTPPAYIMFLDSVIYLQNPADYERDLHKALGQTVGEVMSKN PITIAPDKTVTEAAKLMHDRNVHRLPVLDSESQVIGILTRGDIIRAMAAEQ" gene complement(9189..10232) /locus_tag="DP116_14390" CDS complement(9189..10232) /locus_tag="DP116_14390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114813.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M42 family peptidase" /protein_id="PRJNA477356:DP116_14390" /translation="MCNYDRLFHTIEQLVMHHSPSGAETEINKFLIQQFAALGVEVWC DRADNIIAKIQGLDSTRQIAITAHKDEIGAIVKTVGEKGRVEVRKLGGSFPWVYGEGV VDLLGDNTTVSGILSFGSRHVSHESPQKVQQEDTPVKWENAWIETKRTTTELEAAGIR PGTRMVIGKHRKRPTRLNDYIASYTLDNKASIAILLALAEKVKQPAVDVYLVASAKEE VGAIGALFFTQNQRLDALIALEICPLSSEYPIEDGESPVILFQDGYGIYDETLNGQLR HCAKQVDMSVQLATISGFGSDASIAMKFGHVARGACLAFPTQNTHGYEIAHLGAIANC VDLLQVFCETEFE" gene 10424..11029 /locus_tag="DP116_14395" CDS 10424..11029 /locus_tag="DP116_14395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315263.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_14395" /translation="MYHQQRQNLLFTLSAILTGTTLLAGVGIYVVLKGVETPVAVPKQ IVSTQDITEAPVDTLTPPPEDTLVSSPEDTQVVTIQTQRNELGKNLGGVNWQGKDLRT MKLRNANLGGANLANTDLSGVDLSGSTLSGANLANANLSGVNLRSANLGGAILNNANL RNANLSHADFRGARLDNANLKGAILKGTLLEGVNLNNTIMP" gene 11469..12239 /locus_tag="DP116_14400" CDS 11469..12239 /locus_tag="DP116_14400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198024.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14400" /translation="MKSQILATAVFFTLISLSQTIQAANIDHVRQLLASKQCQNCDLA GAGLVMADLSGADLSKANLAGANLSRANLNGADLRGANLSGVGLFGANLTEANLSGAN LQNADLRNTYLVNAQLNGVNLNGINLQGAIGIPLQIATHEKFYAWGVTEAKKGNQQQA IDYFNQAIVMKPDYAGAYLARGVARYQIFDRQGAFQDAQVAEKLFRTQKNSSGIQTAQ AFIKELQTPYETKVKPKQPDFIDFVGSVGSLLLQFLPF" gene 12308..12931 /locus_tag="DP116_14405" CDS 12308..12931 /locus_tag="DP116_14405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315266.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TenA family protein" /protein_id="PRJNA477356:DP116_14405" /translation="MTISKELWQANQDIAQACLEHPFVQGIGDGTLEQKKFGYYVGQD AFFLEAFARAYSIAAAKAPDFSVFTTFHFLADGVLQELKLHEGYAAKWGVSLHSVEPG VTTRRYTDFLLATAWSGDVGLTAAAMSPCMRLYAFLGKQLAGDNIPNHQYADWIRTYS GSDFQPLTQQLEDLVDSYTSRTTLVESTYRYAMLCERDFFQAAWEYL" gene 13057..13477 /locus_tag="DP116_14410" /pseudo CDS 13057..13477 /locus_tag="DP116_14410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409463.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="fatty-acid oxidation protein subunit alpha" gene 13465..13800 /locus_tag="DP116_14415" CDS 13465..13800 /locus_tag="DP116_14415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409464.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_14415" /translation="MEKLVLYRQLVQQLLLEYGKQKPAYGDIEVEKIFDIKRDHYQIV HVGWEGDNWVHSCIVHIDIKGGKIWLQWNGTEDDIASDLVAAGVPKEDIVLGFQSPFM RQFTEYAVS" gene 14269..14688 /locus_tag="DP116_14420" CDS 14269..14688 /locus_tag="DP116_14420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872767.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_14420" /translation="MSVDDLLKRYHAGERNFKGVNLIGGYLNGVNLSGASLKGASLSE VDLSKANLVGVDLTEASLVRANLTGANLTGANLSGANLHEAKLTGAILDGAVLKMADL IDADFTAANLEGANLYGAHMMGTLLQGAIMPDGKINN" gene 14810..16496 /gene="ilvD" /locus_tag="DP116_14425" /pseudo CDS 14810..16496 /gene="ilvD" /locus_tag="DP116_14425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410139.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="dihydroxy-acid dehydratase" gene 16549..17049 /locus_tag="DP116_14430" CDS 16549..17049 /locus_tag="DP116_14430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004598334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14430" /translation="MIIVSNKRGVKVDDSGIRCVGGGLTGGGLTGGGLTGGGLTGGGL TGGGLTGGGLTGGGLTGGGVSPVVGGGVSPVVGGGVSPVVGGGVSPVEGGVTATVFEE LSSPPPKRKGRRIKGIPASAQYGNPPPPPRVTTSPSGIGAISWLITSPSGNIGSPPTT TSPGRP" gene complement(17972..18637) /locus_tag="DP116_14435" CDS complement(17972..18637) /locus_tag="DP116_14435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14435" /translation="MNVIIRLLSVFSLSVSALAASTNVARALETPTTPQQTTTQQNLC SSDTVENLLPTAVGKQSPSPLSYLAEAGFTQKPDGSWVCYVRDNTKQKRYYTLLKVQQ VNGALVASSFLENGTLTEGQDARSLELFMSLISNYTNTNQGDRQGIQRYLESFISLVK QGKVPASGRGYLFDLTSRGFVVYQPITQGQLQGTAITINITSPQNLDSSPVSQAKSLR GTV" gene 19417..20157 /locus_tag="DP116_14440" CDS 19417..20157 /locus_tag="DP116_14440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207261.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14440" /translation="MALLAELCKNISLGFQQTKDFLNQSVNSFTTSAQQIGESWQERA SQATHRTVDTVTTTLEQAKASVEQTLQSADKVKNTTSVAVQTAISSAMSHWFEQHPTL LRLVQILGWAANHPIISLVILVFALAVIWSLIKAIGRIVESASLSVLQVPLKLLQAVA KFSFVSFTKVGSLVFKQLTDTKTTDSIPTVLPTISPIHKDNQQRLVEISTRLEEIQQE QNQLLQEAAELLASEKIDREAQMQHYIK" BASE COUNT 5743 a 4281 c 4318 g 5891 t ORIGIN 1 aaatctttcc ctctccttaa taaggagagg gatgcccgac agggcagggt gaggtagaaa 61 tcatgaatgc aacgcgcgta tccgtttggc taactttttc cgtaatctca acctccgcca 121 atgtaatgta tatttgtttg tgcctacaat cttcaataac caaggttcag catgactcaa 181 ctttctaccg ccgaaatcga gcaatatcgc gaagttttgc caaatgatgc atctttccag 241 tcagcactca ctcagatcga acaacacaac ggcgacattg actcttattt agatgagatt 301 ttgttagata aattcggtgc gacgagagac tatcaaaaat ctctgcggga agtgacactc 361 aagcggttgc gcaaagaact ctgcggcgct gatgatagtt tccgtaccaa agtgcaggaa 421 tacaaaagaa atccagcaag tgcgccgtta ctcacagggt tgattgtctc attgctagct 481 atgactggag tgccactcga tccgactatt gctactgtta ttgtgttgta cattgtccat 541 gtcggcatcg atattttttg tgagtatacc gaaccagatg ctgatgcttt gggaaaccaa 601 ccaccaaaaa actaatcctg aagcacctta taagtggtta cgcacaatta tttatgtcat 661 tgcaagcgaa atgaaatgta gcgaagcaat ctcaagccct tgcgattgct accctatggc 721 tgacgccacg ccttacggct atcgggaagc cgctagcccc taagggggcg ctgcgcaaac 781 gcgtctacat tccgcttcgc ttcattcgca atgacatata ggaatcatat ttgatttttc 841 tcgttccctg gttcaaccag ggaatgccta cgatgaggct ctgcctcatc aaacatcagt 901 acaattattg aattttgact agatctgtga gtggaggcag cagcccgact tgattacatt 961 cccatgccga gcatgggaac gagaaatctg aaatcgcatc atcaagcctg atttctaact 1021 actctggtaa tctctctaaa tcctcccgta ctacctcagc gctagattca ttcttatact 1081 caacatatat ctctaaagct ttttgcaaat tagctctcgc ctccgggtag ttttcctctg 1141 cttgtgcaag caatcctaaa cagtagtagg tgctcgcaca agagtagcgc tcgccaaaat 1201 cgataaaaat ttgcaaagct agttcgtaat tgcgccttgc ttctgaaaat tctcgccaaa 1261 tttgggctac ccatcccaac tggtggtagg tgcgagcaca agagtagcga tcgccaaatt 1321 cgattgtgat ttgcaaagct tgtcggtaat tgcgccttgc ctcatcaaat tctcgcaaat 1381 cttgggctac cattcccaag ttgtggtagg tgcttgcaca agaatagcga tcgccaaaat 1441 cgattttgat ttgcaaagct agttcgtaat tgcgccttgc ttctgaaaat tctcgcaaat 1501 cttgggctac ccttcccaac tcgtggtagg tgctagcttt ccaacgttgt ttttgtcttt 1561 cttcttcact gccaagtgca tcataaaatt ccaaagtctt ttcgtaagac tttcttgctt 1621 gttggtactg cttcgtttcc agttgacatg tgcccagtct gtgaattgca aatggtatct 1681 gataacctaa ttcaccctcg ataaatactt gtggatactt ttgtaaacgt tgacaaacca 1741 tttctgctaa cttcaggtta ctttggttat cgttaataaa ctcaaagtat ttgtttaaac 1801 agaaatatat actgatattc tcttgcttat ccaaacagat ttgcagcgca ttgtagagat 1861 tttcatactc caagcggcag aagtctatac ccaattgcct ctcttgcgct tccttggatt 1921 ccatcaactg attgtaaaaa tttgctcctc cctcatagtg atttttaaat cctgattgca 1981 aagcttcacg agtttcctca tcaagagtct ccagcttcgt tttcaaaaag taggggaata 2041 caggttgaat tgccagaaac ggtaaatcct cctccccccc tctccttgta ggagaggggc 2101 tgggggtgag gttcatcggt gaaagcaaac cccagttaat tgcttcttga atcgcctcat 2161 caaatttctc aaaatgataa tctttgaatg gttcctgtgt ttgcaatttc ttgatatagt 2221 ttggaatatc cctacggttg ataaacccac taaaaggagc caagcacaac agcagttttt 2281 gcgcttctgg tgaaagattg ctgtgggaat attccacaca tttgagaata cttttcgtct 2341 tctcctcatt tcccgtatcc aacttcacat cagccgactg caacgccgaa agaatctcct 2401 tgggcgattg gtttttcaaa ttcgccagca cgacttccat cgccaaagga taccccgcca 2461 acagttgcat caacttctga aaatcttcgt cttggcgaat agcgagaatg cgctttgctg 2521 tgacatggct ttggagaatt gcctgtgcta agtcagaacg tgcttctttg tctagccctt 2581 gcaattcata aatattgttt ttaaaagtcg ttgcttgcaa ccaatcttca cgactgcggg 2641 aacccagtat cactcgcgtt cttccgccta ccaagcgcgt caaaaaatct ttgatttggt 2701 tacgttctgt ttctgggaga gtattttgaa tcgccagcgg ttgcccagtc acggattcca 2761 aattatctag aatcaaaata taagattcag cgcggagttt tgctaccaat tttggcactt 2821 gcaccgtctg gttcatcgtt tgaaagttgg ctaactcatt acgtttatac actcgcgtac 2881 caatttcaaa caaaatttgt gttaatgtcc atgctttgtg atcataccca aaataaaaaa 2941 tatctttcgc aaaacacgtc ctctgccacc attggcgcaa atatttcagc agggttgttt 3001 tgcctgttcc ccccattccc tgcaacagca agatattgtg tcgcagtaat gacttttcaa 3061 ttttgagaat ttccaaatct cgcccgaaaa atccatattc tggcgacgtg aagcgatatc 3121 tttcaccttc agtttgccaa aattcctctt cttcttgtgg cgtgaattct cgcaggttga 3181 acttcacagc ttgattgctg taaaccacag gtagcaacca atcttccaaa tcaatgtgct 3241 tgttgaaata agctttgcgc tttttgtcat taaaaagttc cagcctgccc agccgtatcg 3301 cttcgctgat atccttgtga tcaaatagat gagcataaac ttgctgcatc atcagcttgg 3361 cagcatccac catgacggta taccccatcg ccactaccat ctgcatccct gcagccatca 3421 actgacttcc gagactggtt tctctgtagt cattctccgc atcttccatt ttcacctgct 3481 ttcccgactg acaagcattg agaagacaca caggaattcc cttacccatc agtagattcg 3541 ccaactccgt cgcctccacc ggatctgcat cgcctttggt ttccccttcc aaaaacacaa 3601 acgccttcac tccctcgtaa ggttgcaaat catagcgccc ccatctcaac tgataagcat 3661 aaggatttgc ggtttccttt tggatatcgt tatatttcat cagcgcaccg tgggtgtcaa 3721 agtggatgat gtgataaaac cctgggcctt tttcttccag atgttttgct agtgcttcat 3781 aagtaccagg gcgcagtaac tccacattaa ctgccaactg actattttca atcgcttcaa 3841 tcagcgatcg cgaaatggta cgataaccaa catcttgttc ttcattcggt cgtgctatca 3901 ccaccagcaa attaatcaca ggtgactgtg gaacatcagc ccgttttcct gcacgtttaa 3961 cactcttacg caccatcaca cagtctaccg ccaacggttg cggatagcca ggttcccaca 4021 aagcttccca gtggagtgct tgaaagtccg gagttttccc gacaatttct atctgtaggg 4081 gattatgccc tttacctagc cgctgatatt gactataagc atctctatcc gtaaaaactt 4141 ggttaaataa attttgccca tactttttaa tactggcggc tgccctctgg gctatagctg 4201 tgtcactaaa gggaaactcc aaccactctt caaaatacca ctccagttct gtttcttctt 4261 tagaagtgaa tgggttagcg attgtaacta aatattcttc ctcttcaaat ctgagagtta 4321 cctcaaagcc ggtgtcggtt ttctcttctt cccgaatcgt gatgactggc atggtgtgac 4381 acttttaaca caaaaaggat atctatttat aaactacaca ttcactgttt gcgtttccgt 4441 atagctgcat catctgcagg gcaaagcggt tggaaattct ctctacttac gacttccact 4501 gacgctgggc actatttgtt acatctaatt tgtggatgtt ttggttatta ttgagtataa 4561 tttaaaatta tttagtgcta tataacacta aataagtaag caaaaacact atatactttt 4621 gtctgtatca acttacaagt tgtcagcaaa gctgggttct ctactcttaa actttggaat 4681 cgatttatgt cagatcccaa cattggtcgt ttactcggca aacgctacga actccaggag 4741 ataattggta ctggagcaat gggtagagtc tatcgtgcta aggacatttt gttgggtggc 4801 gtacccgttg ctgttaagtt tcttgctcta tccacccaaa atcaaaatct gcgagtgcaa 4861 gaacgctttg agcaagaagc aaaaacctgt gccatccttg gacacaaaag catccacatt 4921 gtccgagtca tggactatgg cgtagacgag aataatactc cgttctatgt tatggaatac 4981 ctccaaggaa ccagcctgag caatctcatc cgccaccaac cactctcttt accaagattt 5041 ttgaatttgg tgcgtcaaat ttgcttggga ttacagtgcg ctcataatgg tatctttgta 5101 gaaggcaaaa tatgcccaat tattcaccgc gatattaagc ccagtaatat agtggtcatt 5161 caagaccata gtttcggcga attagtcaaa gttctagatt ttggtattgc caagttacta 5221 catgccgata gcagctatac tgactactat ttaggcactc ttgcttattc ttctcccgaa 5281 cagatagatg gtcaggaatt agataaccgt gctgacattt atagtttggg agtcatgatg 5341 tttgagatgc ttacaagcaa aattccccta gtcgcatcaa ccaactcttt tggagcatgg 5401 tacaaagtac accactatca acaaccacgt ttttttgctg aagttgcacc caatctagag 5461 ctaccacaga gtgtgaaaaa tttggttatg agttgtttag ccaaaacacc aagctcacgt 5521 ccccaaagta taaacgaaat tcttcaggtt ctgcgatctt tagaacaggg cgatcgcgca 5581 agtcaaactt tgccaaatat taatcaacct ccaactgtct gtgtctcagt caaaacagat 5641 gttgatacca ggttgcattc agaaatagca tttacaaatg ggggagcaac agagagtgag 5701 agaactagag agtatgggag tatgggagtg tgggagtgtg ggaaaatatc cttcccggac 5761 tcctcatccc acctactatc tttgaatcca actcagtatg aagacaaaag ccaagcagtg 5821 acgctatcat ctggatcgat agaggaaatt gtacagcaag cttcttggcc ctcgaataaa 5881 ccgatcgccg atatcgtctt tcctagtcct ctggatttga atggtaaggt aatacccgcc 5941 ttgtgggtga tgttaccgca tttggaaatt caaaagcgtt tggtttgtaa gcgctataat 6001 caatttcttt ttataacttc tcctcacccc atgctgctat ggattaccct catctacaac 6061 gctaaacatg gtgcaaaatg gctaacatac tacattgatc tcaaaacaac tttcgggcaa 6121 aaaatcactc gcctactcag acaaacaggt tactacccga tactgttctt tactcgggag 6181 acaccaaatc gttgtgttga tgtattactc tcaagtattg cttctacaca acgcctgtgg 6241 ttacatcact gggtaacgat gagcaacaag ctggtatctt ctgccgaacc acaatttagt 6301 aaaactttac ttaaaagtga gtatgaaaag attaagccct atattttggc aaagatagaa 6361 gcaactgata cagattcctc atttgacctt ttcgtctgaa aagaatatac tgtaaaacag 6421 tgtcattttt ataccatcat tccatccttg tgtaaagtct ttttaagaaa atatgtatac 6481 aatcatgaac aaagttataa gaaatgtaaa tacaaagaat atgaacagta aaatttagag 6541 taaactaatg actgggctta catgcgcgct tacgagtcaa acgctgtttt ttgaaaacac 6601 gtgcaatggc atttatcgtc aaaacctgca attaagcagc agtgtagtgt cataaatgca 6661 tacatatcgt catgtccaat tgttttgcca aacgttctgc gtcaccaagt gaaagtcact 6721 tgataagatg aattatcgtt gggaaaatct ctaatcgtag ttgcttatga ttagttgacg 6781 ccccggcatc cccagctcgt ttcctccttg gaactgagca agagaaggag gtcgcctact 6841 acggcgctaa cactctatgc tgcccagctc tcaccagagt ctattcccca ttggtgctta 6901 agaagctctt tataggtcgc ttcccgaagt accatctgtt catagagctt gactaaaaag 6961 tctttagctt gatcatggct catgtgctgg acttgagttg caaatgagca aatgttgaat 7021 tgctgttcga gcgtcagttc gattggttga ttcatgatga actcctaaaa agaaaaagta 7081 aaggtgatat ggctgtcagg taaagtgata aggtatgtcg caggagcttt atttgtccaa 7141 aacatttgta cctccgtatt acgaacgata acaatagttt gacaaattgc ccatacccta 7201 atgggtaatt tttggattgt gacaccttgt tcttaagcca actgatatag accccaagtc 7261 tgttgcagca gattccggga aaatataggt ttttgaaccc ctttttttgg caggaactcc 7321 cttagttttc cctagtttta gattggggtt tgttttaaga tgtagagatt tttacataac 7381 tacaccgaga cctgcacagg ttattgcgct cttgtcccac aaaccctgtt tctttagggt 7441 atttcaggta ggtaaaaatg aattatattc gccctacctc attgagtgca agatgtcaat 7501 tacacaaaat atgatgatta gcctcctgag ttgagaactg aggaggctat aagtatattt 7561 atcaaactgt aaacgggaaa aaagctcagt atgaacaaat ttttagtggt atggagcaat 7621 acggttcact gagtcatgct aaagccagat gcttcaagcc tcggttaaag atttcttggc 7681 ttccgttgca acagcttctg cttcatcatt cgccaaagtt tccaatatgg attttgcatc 7741 tgtaccgcct aaacgactca aggcttgcac cagccgatag cgcacttgcc aatctggatc 7801 tgttgcatag ggagccaaca agggaatagc ctgcgagtct cccaactcgc caagagaact 7861 aatggctgca gttttgacta agtcattatc tgaagaaagt gccgtcttga gcaactcaaa 7921 acttcttgga tcgttcaact ctcctagtgt ggcgatgatg ctgaattgta caatccactc 7981 gttggtggtt tggtaaagtt gttgtaaatc gtcaaaagcc tcacgcagcc caagagcacc 8041 taaacaatct gctgctgctg cttgaacatc ggcttcggga tcattgagca agcgatcgcg 8101 caatatgttt aaagataaat ctaaatcttg tttacccagc gtatccaact gactcacagc 8161 tgagtagcgc acacgggagt tgctatcatt aatcgcattt tgaaccaact caaaacctag 8221 ctttggctct aattgacgga tttggttaac tgctcgcaag cgatcgccta aatcttgaga 8281 ttccagcagt tccttaacag attcaggagt aatgctcatt taatgatcct aattttcaaa 8341 gatgttttta aaaagagtta accagttatc agttatcagt tatcagttat cagtcatcag 8401 ttatcagtca tcagttatca gtttgaagaa taattcagaa tacaggaatt cttgattcca 8461 aaggaagaaa taaagaattt agttttccga cttccgactt ctgacttctg actcctgact 8521 tctgactcct tcactgttca ctgttcactg ttctgcagcc attgcgcgaa tgatgtcacc 8581 acgggtgagg ataccaataa cttggctctc gctgtcaagt actggtaggc ggtggacgtt 8641 gcgatcgtgc atgagtttag cagcttctgt gacagttttg tcaggagcga tcgtgatcgg 8701 gtttttgctc atgacttcgc caacagtttg cccaagtgct ttgtgcaagt cacgttcata 8761 atctgcggga ttttgcaaat agatgacgct atcaagaaac ataatgtaag ccggaggggt 8821 tacaccagtt tgttgccaca ttaagtccgt ttctgagata ataccgacta acttcccagc 8881 gtcattcaca actggtagtc cactaatgcg tcgttctgcc aggattttaa tcgcttcctg 8941 tagaggagtt tctggttcga cgacgattgg atcacggctc atgatatcgg caactatctt 9001 aggcattggc ttactttcag aacaactcag aatatagaat acagaattgg tggggattca 9061 ggatggaaga attttcttta cgaattaatt cacctacctg ggttcattgt aaagacttgt 9121 gggtattaac tttatcacgt taatacattg ttacttgtag agaaatgagg gaaatagagg 9181 aagaataatt actcaaattc cgtctcacaa aaaacttgta acaaatcaac acagttagca 9241 attgctccta agtgggcgat ttcgtatccg tgagtgtttt gtgtcggaaa cgccaaacaa 9301 gcaccgcgag caacatgacc aaatttcatc gcaatcgaag catcactacc aaaaccactg 9361 attgttgcta gctgtactga catatctact tgcttggcgc agtgacgcag ttgtccattt 9421 agcgtttcat cgtaaattcc atatccatct tgaaaaagta tcacagggct ttccccgtcc 9481 tcaattggat attctgatga gagaggacaa atttctaaag caatcaaggc atctaagcgc 9541 tggttttgag taaagaaaag tgctccaatt gctcccactt cttcttttgc tgatgcgact 9601 agatacacat cgactgctgg ctgttttact ttttcggcaa gtgctagtaa aattgctatt 9661 gaagctttgt tatctaaggt gtaactggca atatagtcgt ttaacctcgt tggacgtttt 9721 cgatgcttgc caataaccat tctggtacct ggtcgaatac cagctgcttc caattcagta 9781 gtggtacgtt ttgtttctat ccaggcattt tcccatttga caggagtgtc ttcctgttgg 9841 actttttggg gagattcgtg ggagacgtga cgggaaccaa aactgagaat accgctcaca 9901 gttgtgttgt ctcctagtaa gtctaccact ccttcgccat aaacccaggg gaatgagccg 9961 cctagcttgc gaacttcgac tcgacctttt tcgccaactg ttttcacaat tgcgcctatt 10021 tcatctttgt gagcagtaat agcaatctgt ctagtggaat ctagcccttg aattttagca 10081 atgatattgt cagcgcggtc gcaccacact tccactccta gcgccgcaaa ttgctgtatc 10141 aaaaatttgt taatttctgt ttctgcacca ctaggagaat gatgcatgac taattgttca 10201 attgtgtgaa ataagcgatc gtaattgcac atcgttaatt ttaactgaac ttcagtgtac 10261 tttataggac ttacgcacga gttacgaaag aacaagactg tagttacgac ttacgtcgca 10321 atgacgcaac tacgttattt ttgcgtcagt cctgctttag tcaattgaga actgctataa 10381 atagtaggca aaagcccaaa tttataccac tcaagcagga caaatgtatc accaacaacg 10441 ccaaaatttg cttttcaccc tttccgcgat actgactgga acaactcttt tagctggagt 10501 tggaatctac gtagtgctca aaggcgtgga aacaccagtt gcagtaccca agcaaatagt 10561 aagcacacaa gatataacag aagcacctgt cgatacatta actccgcctc cagaagacac 10621 attagtttca tcaccagaag acacacaagt cgtaacaatt caaacacagc gaaacgaact 10681 ggggaaaaac cttggtggtg tgaattggca aggaaaggac ttacggacga tgaagctgcg 10741 taatgctaac ttaggcggtg cgaatcttgc aaacactgac ttgagtggtg ttgatttgag 10801 tggatctacc ttgagtggtg cgaatcttgc aaatgctaat ttaagtggtg tgaatttgag 10861 gagtgctaat ctcggtggtg caattcttaa caatgccaat cttcgcaatg cgaatctgag 10921 tcacgccgat ttcaggggtg ccaggcttga taatgctaac cttaagggag ctattttgaa 10981 aggaactctt ctagaaggtg taaacctgaa taacacaatt atgccttaga aagttcgttc 11041 taggtaagca actcacgcac atgatactac gtcagtgacg agcgtcactc aaataattga 11101 tattagtcac tttccccaaa tttgtttatc tcttagcttg ggagtgaaga caaatcatcc 11161 ttgtacttag actacaagta ctgatttctt acttggcagt tgtgaggaat aacttatgcg 11221 agaaaaacta ttgctagctg ttactctgac atttgccttg agcttgttta cagaactgaa 11281 ctggttttct tcagcccgta caaccacaga caagacaaat tctgattcat ccgctttcac 11341 tgtcactcaa cgtcataaag agaattatac gccatagcaa cagtgatata actaggaaaa 11401 taaaaaagat aaaaagatat gtctaaggaa acactgagtc ttcagggaaa gacattgcga 11461 cgtcaatgat gaaaagccaa attctagcca ccgctgtatt tttcacgctt atcagcctca 11521 gccagacaat tcaagcagca aatatcgatc acgtcagaca gttattagca agcaaacagt 11581 gtcaaaactg tgatttagcc ggggctggtt tggtgatggc tgatttatca ggagcggatt 11641 taagtaaagc caatcttgca ggtgctaact tgagccgcgc taacttgaac ggtgctgatt 11701 tgaggggtgc aaacttgagt ggtgttggtt tatttggtgc taacctcacc gaagcgaatc 11761 tcagtggggc taatttgcag aatgctgatt tgcgaaatac ttatttagtt aacgctcaat 11821 tgaatggtgt gaatcttaat gggattaatt tgcaaggagc aataggcata ccattgcaaa 11881 ttgctacaca tgagaaattt tatgcttggg gtgttaccga agcaaaaaaa ggcaatcaac 11941 agcaagcaat tgattatttc aatcaggcta ttgtcatgaa acctgattat gcaggtgctt 12001 atctcgctcg tggtgttgct cgttaccaaa tatttgaccg acaaggtgca tttcaagacg 12061 cccaagttgc tgaaaaattg tttagaaccc aaaaaaatag cagtggaata caaacagcac 12121 aagcttttat caaagaactg caaacacctt acgaaactaa ggtaaaacca aaacaacctg 12181 atttcattga ttttgtagga agtgttggct ctttactact ccagtttctg ccattttaga 12241 gaggaggatt agtgattagt cgtgagtgga tagtagtatt taattgacaa ctcatcgctt 12301 atgactcatg actatctcaa aagaattatg gcaagcaaat caagacatag cccaggcttg 12361 tcttgagcat ccctttgtcc aaggtatcgg tgacggtact cttgagcaga aaaaatttgg 12421 ctattatgtc ggacaagatg cttttttctt ggaagccttt gctcgtgctt acagcatagc 12481 cgcagctaaa gcaccagatt tttcagtatt tactacattt cactttttgg ctgatggagt 12541 tttgcaagaa ctgaaacttc atgaaggtta tgctgcaaag tggggagtca gtttgcattc 12601 tgtggaacct ggagtgacta cccgtcgcta tactgatttt ttattagcaa ctgcttggag 12661 tggtgatgtg ggtttaactg ctgcggctat gtctccctgt atgcgtttat atgctttttt 12721 agggaaacaa ttggctggtg ataacattcc caatcatcaa tatgcggatt ggattcgtac 12781 ttatagtggc tcagattttc aaccactaac ccaacaattg gaagatttgg ttgattctta 12841 tacgagtcgc acgactttgg tagagtcaac ttatcgttat gccatgttgt gtgagcgaga 12901 ttttttccaa gcagcatggg aatatttata aatcggcgct tgtgtttgct cctgatcaaa 12961 atcaggactg ataccagtcg ccagggcgcg ggaaaaccat ctgctgggcg atataatgag 13021 gtagcatgaa agcagaaaat atcattgtgg atacctatgt ctgccaaaga tgtctttcat 13081 gaagttgtca agaaagcttt acaaaaagat ggttggcaga taactcacga tccactttca 13141 tgagcgtgtg ggtggcgtga atatgtcaat tgatttggca gcagaaaaac tcattgcagc 13201 agaacgagaa ggagagaaaa ttgctgtcga gatcaagagc tttttggaaa agtcctctgc 13261 aatctcggaa tttcatacag cgttagggca atttattaat tataggggtg cgttaaggcg 13321 gcgagagcca gagcgtattt tatatttagc agtgcctatg acaatttaca acactttttt 13381 tcaacttgat ttccctaagg aaatggttca agaaaatcag gtcaagatga ttatttatga 13441 tgttaagcgt gaggtgattt cagaatggaa aaattagtgc tataccgtca acttgtacag 13501 cagttattat tagaatacgg caagcaaaaa ccagcttacg gggatattga ggttgagaaa 13561 atttttgata taaaacgtga tcactatcag attgttcacg taggttggga aggggataat 13621 tgggtgcata gctgtatcgt gcacatagac attaagggtg ggaaaatttg gcttcagtgg 13681 aacggtacag aagatgatat tgcctcagat ttggttgcag caggagttcc aaaggaagat 13741 atcgtgctgg ggtttcaatc tccttttatg cggcagttta ccgagtatgc cgtaagttag 13801 tcagttctat tgtctatcta acctgaacca gacacttgtg agtgaagatt aatatgcaaa 13861 ccgctctatt ttgtctcctc tgagagtaca tttaaaaatg tttgagcaac taccatcacg 13921 acttcaagta tactatcaca tctgatttcc tatctgaaga atcaagtcaa ctgcgtcgtg 13981 gttggaatta tatatgaagc gtaagagaac tacactcatt aagtcaaaaa acttgacaaa 14041 caatcataag tgcaaattgt ctaatataca actttaattg gtggtagtgt ttacgcccta 14101 acaaactatc atttgtgagt tgtttaccaa tgacttagtt tgaattgcca tcatgatatt 14161 ttgtcaatta atgagagaat ttgtcacaac ttgacttgat gacacaattc acactaaagt 14221 ctgttctttg aggcacactg catgcaagaa gtattgttaa atcgtgcaat gagcgtagat 14281 gatttattga aacgctatca cgcaggagaa agaaatttca aaggtgtaaa cctgataggt 14341 ggttacctaa atggagtcaa tctcagcgga gcaagtttaa aaggagcttc cttaagtgaa 14401 gttgatttga gtaaagcaaa tttggttggc gttgacttga cagaagcatc tttagtaaga 14461 gctaacctga cgggggctaa cctcacagga gcgaatctga gtggagcaaa tcttcatgaa 14521 gcaaagctca caggagctat ccttgatgga gcagtgttga aaatggcaga tttgatagat 14581 gcagatttca ccgctgcaaa tctagaggga gctaatctgt acggggcaca catgatgggt 14641 actctgttgc agggtgcgat tatgccagat ggtaaaatca acaattagct acggtagcta 14701 tcatatttcg gcaaacgcgt gaacatcagc tctaaagtat ctgatatgtt gaaataatta 14761 caaaggaaat gtgccaagag aaagatgacg gcgcagggga attatcaaaa tgtcagagaa 14821 tttaagaagc caagttgtaa cgcaaggagt gcagcgatcg cccaaccgag caatgctgcg 14881 cgccgttggt tttcaagatg aggatttctc caaagctatt gttggtattg ccaatggcta 14941 tagtactatt atcccctgca atatggggat caacaaatta gcactaagag cagaaattgg 15001 cgtcagaaat gccaaggcaa tgccgcaaat gttcggtacg attaccatca gcgatggcat 15061 ttctatggga actgaaggga tgaaatattc cttggtatcg cgggaagtca tcgctgactc 15121 aatagaaacc gcttgcaatg ggcaaagtat ggatggtgtt cttgccattg gcggttgcga 15181 caagaatatg ccaggggctg ttattgctat tgctcgcatg aacatccctg ccatctttgt 15241 ttatggcggt accatcaaac ccggacatta caacggacgc gacttgactg tcgtcagctc 15301 ttttgaagca gtcggtcaat acagcgctga taaaatcgac gagaaggaat tattggaggt 15361 tgagagaagg gcttgtccgg gtgctggttc ctgtggtgga atgtacacgg caaatactat 15421 gtctagtgcc attgaagcga tggggatcag tttaccttac tcctccacga tggcagcaga 15481 agatgaagag aaagccgaaa gtactgaaaa atctgctttc ctcttagtgg aagccattcg 15541 taagcaaatc cttccctctg agattataac tcgtaaatcc atagaaaatg ctatttctgt 15601 cgtcatggcg gtgggtggtt ccacaaatgc agtgttgcat ctgctggcaa ttgcccacgc 15661 tgctggtgtc gaactgaccc taaacgactt tgaaactatc cgtgctcgtg tcccagtatt 15721 gtacgatttc aaactcattg gtcgatatgt agccacagat ttacacaaag tgggtggtat 15781 tcctcaagtc atgaaaatgt tgctcgtaca taacttgata cacggagact gcctaactat 15841 ttctggacaa acagtagccg aagtgctggc tgatgttcca gaagaaccac ctagtaatca 15901 agacgttatc catccttgga attatccaat gtataaaaaa agaacattta gccatcctca 15961 aaggtaactt ggcaacagag gggtctgtcg ctaaaattac tgggataaaa atacctaaaa 16021 ttactggacc agcacgggtt tttgaatctg aggaatcctg cttggatgcg attttagcaa 16081 agaaaattaa cccaggggat gtcatcgtca tccgctacga aggacccaaa ggtggtcctg 16141 gaatgcggga aatgctggct cccacctcgg ctattattgg tgctggtttg ggtgattcag 16201 tgggactcat aaccgatgga cgtttttctg gtggtaccta cggtatggtt gttggtcacg 16261 ttgcgccaga agcagcagtt gggggggcga tcgctctagt agaagaaggc gataccatca 16321 ccattgatgc ccatgctcgt tcactacatt tacatatatc tgatgaagaa ctagcccatc 16381 gtcgtgccaa ttggcagcca cttccacccc gttacaccag aggtgtgctg gcaaaatata 16441 ccaaattggt atcttccagc agtcttggtg cagtaacgga tttggatttg ttttgatttc 16501 tacatccgta caccgggctt gagccacttt ctgtagctga gtatagacat gatcatcgtc 16561 agcaacaaaa gaggcgttaa ggttgatgac tctggtatac gctgtgttgg tggtggactt 16621 accggaggtg gacttaccgg aggtggactt accggaggtg gactcaccgg aggtggactc 16681 accggaggtg gacttaccgg aggtggactt accggaggtg gacttaccgg aggtggagtt 16741 tcacccgtcg ttgggggtgg agtttcaccc gtcgttggag gtggagtttc gcccgtcgtt 16801 gggggtggag tttcgcccgt cgagggtgga gtgacagcta cggtctttga ggaattatca 16861 tccccaccgc caaaaagaaa aggtagaaga attaagggaa taccagcgag tgcccagtat 16921 gggaacccgc cacctccgcc tcgtgtgaca acatctcctt cagggatcgg agcaatctct 16981 tggctaatca cgtctccctc aggcaatatt gggagtcccc ctacaacgac atctcctgga 17041 agaccttgac caacaatctg gtcaaactct tgaggaatac cagcattcgg acggttaaag 17101 ggacttgcct ctggcgtgta tggtgcggag aggacatcaa gaagcttccc ttctagttgc 17161 tcctcattgc cactgttcaa ggcacgcaac ccttgagaga ccttactcac ataatactgc 17221 ttgacctctg gcggcaaatc ttgagattga atctgctgct tccaattggg aatttttgcg 17281 acttcttcca acagcaaacg cccataggag tatttcaaat tcagtaagcc actgcgaggg 17341 tcaagcgagt tcccagatcc ttcaggtgac cagtaaaagt tcttggttac ctgtcctaga 17401 cctgcttggg gtgtccgact aactgtttgc ccaagagtat accgatatcc atccataata 17461 taaggactgt cgttttgcca gaatgaggta tacggtatct tttcaagatt aggattattt 17521 ccatagaaag aggcaaaatt ttgaaagttt tgctgtcctc cagcggctcg gatcacgagg 17581 tcattgtaac taggtccgtt gttggattcc actaattgct gtagtactgc tccagacttg 17641 ttacaagcag caccaaatcc tacgctacat ccaggaatga ctcgcatttt actcaaatcc 17701 tggccggaga gttgataccg cagataattt gactccagtc ctggctggac accgtaatgc 17761 ccaatactga gctgagcaag tgcctgctga ggcgcactca gtattgccgc catataggat 17821 gaaaacgcaa taatactttt tgtaatacga gacttaaaca agatgaacct ccagtaaaca 17881 aggcagtagg ggattgaaga aaaagcaatt tcaaattcaa aaagattatt ttgagagagt 17941 tttttgaatt tcgttttgat gttgaatttt tctacactgt gcctcgaagg gatttcgcct 18001 gagagactgg agaagaatct aaattttgag gcgaggtaat gttaatcgtg attgctgtgc 18061 cctgaagttg tccttgtgta atgggctgat ataccacaaa tccacggctt gttaggtcaa 18121 aaagataccc tcgacctgag gcaggcactt ttccttgctt gaccagagaa ataaacgatt 18181 ctaaatacct ttggatacct tgacgatccc cttgatttgt attcgtatag tttgaaatca 18241 ggctcatgaa caactccaag ctgcgagcat cctgtccctc agtcagagtc ccattttcta 18301 agaaggaact agctacaagc gctccattga cttgttgcac tttgaggaga gtgtaatagc 18361 gcttctgttt tgtattatct ctgacataac aaacccagga accatctggc ttctgtgtaa 18421 aacctgcctc tgccaggtag gaaagtggag aagggctttg cttacctacg gctgttggca 18481 gcaaattttc cacggtatcg gaagaacaaa gattttgttg tgttgttgtt tgttggggag 18541 tcgttggagt ttccagcgct cttgcaacgt tggtagaggc tgctaatgca gacacagaca 18601 ggctgaatac actcagcaaa cggataatga cattcatggg acttgtgaca gtgtatagct 18661 caaaaaaagt agtgttcccg agtcaaataa tagggttata agactttttt ctcagatgtc 18721 atgctcgtaa catatgaaat gctagggtga actaggttgt aaatcttcac atggaacatg 18781 agcatctatt atttctgtat tatttgtgtg acagtgtctt tttattatgt gaataagggc 18841 aaagcttgct aatttgcaag aatgtgccaa aaataagcta attgattagc agagattgct 18901 tatcaggagg ttcatacatt gcactaaggc atataagttt gataaaagac tgggctatgt 18961 gtttcttctt ttttgacgtc aaagctaaga tcggtgttcc ctaataacct ttggataagt 19021 gcatgggtag tggatttaca agcgcttaac gcctgctatt tcaaatcatt tatgttttat 19081 ttgtaacaaa tagtacttgt ctatatcttc taacctagga aagattatta gttatcaatg 19141 aaggatgaat tgtaaatttt atgtctaagt agaaaaagaa attcgtttca agatcatcgt 19201 gcactaattt ttgagagtac tcaatcattt tacaagcctg tgacgaccat cttatgagta 19261 aagatgaatg gacaaataca gacagctatt atcaatgcac ttatatctag ggatagaata 19321 tatcatctaa gaaaattgat aatcaaccgg acaaattttt ggttacattg aatcaagttc 19381 ttcatcttgc tttcagtgtc cgttgtgaaa taccctatgg cacttctagc cgaactctgt 19441 aaaaatataa gtctaggttt tcagcagacg aaagattttc ttaaccagag tgttaactct 19501 ttcacgacat cagcacaaca gattggggag tcttggcagg aaagagcatc tcaagccact 19561 catcggactg ttgatacagt gacgacaacc cttgagcaag ctaaagcttc tgttgaacaa 19621 acgttgcaat cagcagacaa agtaaagaat acaacatcgg tggcggttca aacagccatt 19681 tcttctgcta tgagtcattg gtttgagcaa catcctacac ttttgcggct ggttcaaatt 19741 ctcggttggg cggctaacca cccaatcatc agtttagtga ttctggtttt tgcccttgct 19801 gttatttgga gcctcatcaa agcaattggt cgtatagttg agtcagccag tttatcagtt 19861 ttgcaagttc ctttgaaatt actccaagct gttgcaaagt tcagctttgt atcatttacc 19921 aaagttggta gcttggtctt taagcaactg accgatacga aaaccactga tagtatacca 19981 acggtactac ctacaatctc tcctattcac aaggataacc aacaaagatt ggtagaaatt 20041 tctactcgac tagaggaaat tcaacaagaa cagaatcagc ttttacaaga agctgcagag 20101 cttttggctt cagaaaaaat tgatagagaa gcacaaatgc agcactatat caaataaagg 20161 gtagcagata aggctgaacg actttgaaag tttatagcac tctcaacact tttatactta 20221 agtgggataa agt // LOCUS NODE_1634_length_19804_cov_5.10289119804 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 19804) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 19804) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..19804 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..562) /locus_tag="DP116_14445" CDS complement(<1..562) /locus_tag="DP116_14445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877105.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fused response regulator/thioredoxin-disulfide reductase" /protein_id="PRJNA477356:DP116_14445" /translation="MAKPVIITVDDDPEVLQAVARDLRQEYGDRFRIIRADSGASALE ALEQLKLRNQPVSLFLVDQRMPHMSGVEFLEQAMSMFPDAKRALLTAYADTDAAIRAI NNTKIDYYLMKPWDPPEERLYPVLDDLLDDWQATFHPPFEGIRVIGNRWSPHSHQVKD FLARNQVPYQWLDIELSEEAQKLVEYA" gene complement(608..1003) /locus_tag="DP116_14450" CDS complement(608..1003) /locus_tag="DP116_14450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877106.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bleomycin resistance protein" /protein_id="PRJNA477356:DP116_14450" /translation="MKVSKSYTRLLVKDWKACFLFYKDVIEFDIAVEDQEAGYAEFKA GDMRLAVSHRQEMAQLIHNAEKPAHAECQDTVVLIFTVHDLEEEYQRLRHKGVEFTAA PMNNPYYGIKTAYLRDPDGTLIGLYEFLV" gene complement(1233..1982) /locus_tag="DP116_14455" CDS complement(1233..1982) /locus_tag="DP116_14455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186584.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_14455" /translation="MAQKLNGKVALITGASSGIGEATALAVAAQGAKVALAARRQDRL EKLVKQITDNGGQAFSIQTDVTDETQVTEMVQKTKTHFGSVDILVNNAGVMLVAAVEG ADTSDWRRMIDINLLGLMYATHATLPLMKAQGGGHIVNISSVAGRVAIPDYAVYNATK FGVGAFSEALRKEVSKDKIRVTVIEPGGVATELANHITNLESKQKIEEILQSTTVLES EDIAAAIVYAVTQPPRVNVNEILIRPTEQEL" gene complement(2014..2727) /locus_tag="DP116_14460" CDS complement(2014..2727) /locus_tag="DP116_14460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015189419.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="short-chain dehydrogenase" /protein_id="PRJNA477356:DP116_14460" /translation="MNSQFQGKYALVTGGNKGIGFAICKGLLQSGFEVIVAARSLSAA KTAAEKLQSVNSKVRVVELDIADDQSIDQAVKHLSQEIPQLDVLVNNAGIYPDEGVNI LTISRELLNQTMNTNAFGPIRTSQAFLPLLEKAPQARIINVSSGYGRLNGLSFDVPSY CLSKLTVNGATIMLADALQAKGIAVNAIDPGWVKTDMGGTSAPRSPEQGADTAIWLAT EASVNLSGKLFRDRREISY" gene complement(2724..2951) /locus_tag="DP116_14465" CDS complement(2724..2951) /locus_tag="DP116_14465" /inference="COORDINATES: protein motif:HMM:PF13561.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14465" /translation="MELGERGITVNTVSLEPTETEMYANSGDEPKAAAAQSPFNRLGK TEDIADVVAFVMSDQCRWITGQTFQAGGGYV" gene complement(2971..3588) /locus_tag="DP116_14470" CDS complement(2971..3588) /locus_tag="DP116_14470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010993899.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-phosphoglycerate kinase" /protein_id="PRJNA477356:DP116_14470" /translation="MQNETRVILIGGSSHAGKSTLGRSLAAKLGWSYRCTDKLARHPG RPWVSANGKVFCEYVAEHYRTRSVDTLFLDVLSHYEKNVLPQIEAIVHSHAFDLSTEY LILEGSALWPEFVANLVGENGVKAIWLTASDQLLGNRIKRESNFYNVGEDEKHLIQKF LDLTLFYNKRMREKVERLGFICIDVESVSTTDELSNKCMELMEVS" gene complement(3876..4592) /locus_tag="DP116_14475" CDS complement(3876..4592) /locus_tag="DP116_14475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874970.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="short-chain dehydrogenase" /protein_id="PRJNA477356:DP116_14475" /translation="MSQSKVAVVLGVGPGLGSAVAHRFAREGFAVGLMARNSQQLSQI QSEIEQSGAKALSVTVDASDPASVKAAFEEVSSQLGAPEVFVYNAGAFQRAGILELTP EQFESCWKVNCFGAFLAVQQVLPAMVERRRGTILLTGATAAVRGSAKFAALAVGKFGL RALAQSLAREFCPQGIHVAHIIIDGMINTPRVRAMASDAEEDTLLAPEAIAQTYWQLY QQDATAWTLELDLRPAVEKF" gene complement(4607..5800) /locus_tag="DP116_14480" CDS complement(4607..5800) /locus_tag="DP116_14480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745972.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_14480" /translation="MQPTSTNKQPKSPKTLYMDTNFYIICAVSLIAVLGVASVSPAFP RLAHELGVNPTNIGLLITFYTFPSLVFGPIIGVLADRLGRKKVIIPSLFLFGIAGTAC AFARDFNLLLLLRFLQGIGAASLLSLSITLIGDLYTADRRTTAMGYNASITSIGTASY PLIGGTLATFGWYYPFMLPIIAIPIGLLVLFALKNPEPQGEYNLKEYLSNAKKVLKNR QILGLYIASAANFFLLYGAHVTYLPHLIKETFKAPPYTIGILLSTVSVAIMISASQLG RLTRTFKATTLIRASFILYALALFTVPLIQNIWLLLIPTTIFGLGLGIGFPSIQTLLT EISPKQYLATILSVNGTFYGLGQTIGPLVMGVVFGFAGMSGVFYVGVAFAILTLIVFR YCTCL" gene complement(5891..6544) /locus_tag="DP116_14485" CDS complement(5891..6544) /locus_tag="DP116_14485" /inference="COORDINATES: protein motif:HMM:PF07589.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14485" /translation="MILSSLKVWMVPLALTLVSVCSDAARATAQTIYPFTGYYRTTVN ITPIAGDVSQVFEVGVSDDAPYGLELYEGLTYSVLDANGNLTFNNNPEAFGIQGFPLG YIQFGSGTNKLFGTSDASAAVNFETLTAKGSGIVNITGGEGIFENATATLLFSEDDIV NLGQNITLNGLALVTGPIEVPQKVPEPTASTTLVGIGLMGAGFLLRQRRLGSANKKS" gene complement(6752..7858) /locus_tag="DP116_14490" CDS complement(6752..7858) /locus_tag="DP116_14490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314905.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkene reductase" /protein_id="PRJNA477356:DP116_14490" /translation="MTSDINLFTPVQLGPYTLPNRMVMAPMTRLRAIDNIPNSLMATY YTQRATAGLIVTECTMVSPLSLGYINCPGIYSPEQVDGWRQVTDSVHEKGGRIFLQLW HSGRVSHPSLLGGQLPVAPSAIAASGQLHTPIGKVAMETPRALETHEIPEIVEQFRKG AQNARAAGFDGVELHGAFGYLIDQFLQDWTNQRTDEYGGSIENRARFLLEVVEAVASV WGANRVGIKLSPSNTFYGMGDSNPQEIFAYAINALNRFGLAYLHLMEPNEGDLATRDV MNPVTPYFRKIYKGTLITNGGYDHAKGDTILANGDADLVSFGKLFISNPDLPKRFELD ASLNQPDVKTFYAPDEKGYTDYPFLELQSSNLLH" gene complement(8026..8664) /locus_tag="DP116_14495" CDS complement(8026..8664) /locus_tag="DP116_14495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314906.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="antibiotic biosynthesis monooxygenase" /protein_id="PRJNA477356:DP116_14495" /translation="MPTIAKNNDVITVIIIFAIEPERQQELIDTIIEFLETTVKHQPG FVSSSIHKSIDGVRVMNYAQWKTLEDYQAFINNSEVQAKGAKLFQFQIHESHVYEVVV SKPDDTTLKISKGGLIHLAEFRVKPENQMRLVELEREYVGVGLQNPGLLSANFHRSLD GVHNVNYGQWRSFADFEELLKDPKYKPLNEYWQGLAENEFHLYEVVYTQPSN" gene complement(8925..12098) /locus_tag="DP116_14500" CDS complement(8925..12098) /locus_tag="DP116_14500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456972.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrophobe/amphiphile efflux-1 family RND transporter" /protein_id="PRJNA477356:DP116_14500" /translation="MFVDFFIKRPVFTSVCAILILLVGAVSIPTLPTAQYPEISPVQV NVTANYVGASAEVVENTVTTVLERQINGVEGLKYMTSSSSNNGSSTIRVTFDASRNKD IAAVDVQNRVSLAEPQLPEPVQRTGVTVSKQSSNILLAIGLYSEKNEFNNVFLSNYAD LYIVDALKRISGVSEARIFGERRYAMRLWLDPNRLASRNLTAQDVINALNEQNIQVGA GQIGQQPAPKDQMYQIDLQALSRLKEPSEFEDMILKTDQNGTLIKLKDVGRAELGAEN YGSFLRFRAKEGVGIGIFPTPGSNALDVAKAVKLEIARLAQDFPPGMDYQVAFDTTLF VEASLSEVFKTLFEAIVLVVIVIFIFLQDWRTTLIPVIVIPLTLIGTFAFVKVFGFSI NTLTMFGLTLATGLVVDDAIIVVEDITRLMEDEEMSPRQAASAAMGELFGALIATSLV LMAVFVPVAFFPGSTGQIYRQFALTIAFSIAISTFLALTLTPSLSALLLRRGQKPGGV LGWVFTKFNNFINWTRRKYERTLYRLNRITAIIVLLFILSLGLTGWLYTRVPQAFLPE EDQGYFITIIQGPEGVSLNYTSKVMSQVEAEILKLPEVVGTFAIGGFGFSGSTANSGA IFTTLKPWDERHEASQSAQGIIKNLAGKFSTITEARILPVNPPAIQGLGSFGGFQFEL QDRKGNSGLNTMLQVMGQLLQRGNQTPGLQAVFSTFSANTPQLLIEVDRNKAKALQVN VGDIFNTLQSYLGSRYVNDFNYLQRTYRVYVQADTQFRSNPADIGLLYVRSANNQMIP LSNLVKVTSNTGAQTINHYNLFRSIEINGAAAPGFSSGQAIQAMQQVAKQVLPAGFGY EWSGVAAEEQQSGGQAPLIFGLGLVFVFLVLAAQYENYVDPLIIMLAVPLAILGALSA QSLRGLANDVYCQIGLVMLIGLASKNAILIVEFANQLHEQGLSITKAAIQASQERLRP ILMTSFAFILGIEPLVFPEGAGAASRKSLGTAVVGGMLVSTFLSLFVVPILYIVIGKI RDRLTRRDKSPRLQPVEDGRVPTETRR" gene complement(12242..13606) /locus_tag="DP116_14505" CDS complement(12242..13606) /locus_tag="DP116_14505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_14505" /translation="MKSPEPQTHLEDSRQDAQLEEPPRKQRRWLWLLLALLLAGGGFA LWRWLAPQNKAPATANAQPPAARVKISTVQAGMIEDSQQYLATVESRRSVTLQPRVQG QVAQVFVKYGDTVIAGTPIIQVDARQQQAAVGSVDAAAEVAQSQLENTRANLRSLEAQ RLSKLADLKLNQQDYERYTSLASQGAVSRQTSEQYANKLATSLAALGQTEAQIKAQQA TINEAEKSLKQAQSNIKQQQVQLQYYKITAPFPGTVGNIPVKVGDFVNTSTQLVTITQ NQPLELNISVPVERGSQLRKGTPVEVMDAQGKSIGMSRVFFISPNASNNTQSLLVKAL YDNSKNELRTDQVAYARVIWSQRPGVLVPTTAVTNLAGETFIYVAVPAPSKPPQGEKP QAEKTQGNQQRTFQLVARQKRVKLGNITGNNYQVLQGLEPGDRIIVSGLLNLRDGVPI IPES" gene 14266..14691 /locus_tag="DP116_14510" CDS 14266..14691 /locus_tag="DP116_14510" /inference="COORDINATES: protein motif:HMM:PF01471.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14510" /translation="MAITGTPTPERQTAQGYRIPDYSKNNYKELMNILHGFGYVNPED QSNDVDKKALIAFQKYMKITADGVYGPKTQEKLAEAMRILHGNLNTVLKTNFSFNEPF YGPKTTDAVKQFQHKYNVGAPAGEANLVTRQKLAEQASK" gene 14881..15252 /locus_tag="DP116_14515" CDS 14881..15252 /locus_tag="DP116_14515" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14515" /translation="MQGKKRHFQAIGLSVSLFAASAIGLLASHSVVQAKKPVPRPTVA TVKSMVNGDLMCYVNLVDEKGKQYNSVGASFEICANEKRFLNKKVQLSYSQASVNDCQ SAEPCGKSRIETLITKMKIIR" gene complement(15522..>19804) /locus_tag="DP116_14520" CDS complement(15522..>19804) /locus_tag="DP116_14520" /inference="COORDINATES: protein motif:HMM:PF00501.26,HMM:PF00975.18,HMM:PF08659.8" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14520" /translation="IIYIQRDGNELFQSYEDLLTQAQRINAGLKKLGLQPQDKVILQI KNPQDFIPAFWGCILGGFVPVPLAIAPTYSQVNNAVNKLHNAWQMLEKPIILASIELV EAIASLSGLLGLENFQVESCDRLHEYDPDPSYHLSQPDDLAILLLTSGSTGMPKGVML SHWNILSSVAATSQVSQLTQEDISLNWLPLDHPGPLIRCCIRCVFLGCEQIHAPTDVV LHQPLTWFDLLERYQVTTTWSPNFAFTLVNEQSDEIKQRKWNLSAVRSLLNTAEPIVP QTAQRFWELFQPHQLSASAMHSSWGMAETSSGVTFSDKFLSENPTSIFAELGLPIPGV SLRIVSSENQVLEEGTVGYLQVKGESVTQGYYNNSQLNGEVFTTDGWFATGDLGFLQA GRLTITGRQKDVIIINGANYYSHEIEAVVEKIPDIEVSYTAAVSVERDKLAIFFSSPL ADNERLLREKLEEIQQQVVSQIGIKPDYLLPVDKTAIPKTSIGKIQRSQLCQQLETGE FNSILKKVDILLENAKTIPDWFYRQVWQTKQVRRLTVSTGPTLVFIDPLGLGDYIGEQ LTQENQSCIKVSVGSVFAKISDDSYSINPERAEDYQLLLESITAKQKYIEQIIHLWNY SKYKGEISSLAELETAQNTGVYSLLFLVQALAKVQGEQNPVKLLFISSHTQLTETTEE IAYEKSSVLGLLKTIPQEMPWLNCRHIDLYVDLVEVNGDYILAEMQDITGEREVAYRK GQRLVPRLLRVNWESEFKQPLPFKTGGSYLITGGLGGIGVEIAKYLLENYHARLLLVG RTPLLAEKLAAFEQLQQLNGTVVYETVDICDRPGMEVIISALGELDGIIHLAGVARES LIESETQTGLATALRSKVSGTWVLHQLIKDKPDSIFINFSSINGFFGGTTVGAYAAAN SFLDAFSDCQRTRSSLQSYCFAWSMWDETGMSRGYEMKQLSRDRGYYAIEKQQGIYAF LAGMSLEPTRLTIGIDGNKANIQRYQSECDSCEKLIAYFTGTVPAIPQTLCDRFGTPT TCQLQLVEEMPLTDSGEIDLHKLVGKIIPHKGERVAPRNDVEREIAKIWEDVLGVENV GIYDNFFQLGGHSLRALQVMYRLQNKFSVDLPLQNLFKTPNIAGLAATIGQQHQEQPF SCLVPIQPKGNRSPFFCVHPVGGNVLCYADLARHLGDEQPFYGLQSVGLNGEQEPMTC VEDMATHYIKAIQTIQPTGPYYIGGWSLGGAIAFEIAQQLSVSGQEVALLALLDSYTP KVIDQVSEQRKRILSQKSSVRVLDDATLASYYFAQDLGSLLGKELVVSLAQLQELELD AQLTYILEQAKTAIVLPPELELAQIRRLFQVFQANIQARLRYRPQPYTGSIVLFCASV QPEEMTADSSGGWDTLAIGGLEKHLIYGGHYQLVKSPKLAEKLNQYIT" BASE COUNT 5569 a 4393 c 4180 g 5662 t ORIGIN 1 cagcgtattc taccagtttt tgtgcttctt ctgataattc gatatccaac cactggtaag 61 gcacttgatt tcgtgcgagg aaatctttca cttgatggga atggggcgac caacggttac 121 cgatgacgcg aattccttca aacggtgggt ggaaggttgc ttgccagtca tcaagtaaat 181 cgtccagcac tggatacaag cgttcttctg gtggatccca cggcttcatc aagtagtaat 241 cgatttttgt gttgttgatg gcacggatgg cggcgtcggt atctgcgtag gcggtgagta 301 aggcacgttt tgcatccggg aacattgaca tcgcttgttc caaaaactcc acgccggaca 361 tgtggggcat ccgttgatcg acaaggaaca gcgatactgg ctgattgcgg agtttgagtt 421 gttccaaggc ttccagggca ctggcaccag agtcagcgcg gataattcgg aagcgatcgc 481 catattcctg ccgcagatct cgcgccacgg cttgcaaaac ttctggatca tcgtctaccg 541 ttatgatgac aggtttagcc atagcgcttc agtgacctcc cttgtttaat tcctggtttt 601 ttatatatca cacaagaaac tcatataaac caatcagagt tccatctgga tctcgaagat 661 aagctgtttt aattccgtag tagggattat tcataggcgc tgcggtaaat tctacccctt 721 tgtgtctgag tcgctgatat tcctcctcta aatcatgcac agtaaaaatc aatacaacag 781 tatcttgaca ttctgcatga gcgggctttt ctgcattgtg aatcaattgt gccatttctt 841 gccggtggga tacggcaagt ctcatatctc cggctttaaa ctcagcatat cccgcttctt 901 gatcttctac agcaatgtca aattcgataa catctttgta aaacaagaaa caagctttcc 961 aatcttttac aagcaatctt gtgtatgatt tactaacttt catagacttt gtacctcttt 1021 tctttgcgcc ctttgtacgc caggtgcttc aagtcggcgg agccgcccaa cgcactggct 1081 cgtctttgcg gttttattcc tttattaatt attgatgaca actggaatac ggtggttatt 1141 tattgggtga aagacaggtt tttgttgggg gacaacattg tacccagtga aggtgttgca 1201 tctagaggaa gttctccgat actcctacac tgctacaatt cctgttcggt tgggcggatt 1261 aatatttcat tgacattaac acgaggtggc tgagttacgg catagacaat cgcagcagca 1321 atatcctcgc tttcgagtac cgttgttgac tgaagaatct cttctatttt ctgcttagac 1381 tctagattag taatgtggtt agctaattcc gtagccaccc cacccggttc aatcactgtt 1441 acacggattt tgtctttaga aacctctttc cgtaaggctt cactaaacgc tcctactcca 1501 aattttgtgg cattatatac cgcgtaatcg ggtatagcta cgcgaccagc tacagaggaa 1561 atattcacaa tatgtcctcc gccctgagcc ttcattaagg gtagggtagc atgagtagca 1621 tacatcaaac ccaacaggtt gatgtcaatc atgcgccgcc aatctgaagt atctgcaccc 1681 tcaactgcag cgacaagcat cacgccagcg ttattgacca ggatgtctac actgccgaaa 1741 tgagttttgg ttttctgtac catctcagtc acttgggttt catcagtaac atctgtttgg 1801 atagaaaatg cttgtccacc attatcagta atttgcttga ctaacttttc tagtcggtct 1861 tggcggcgtg ctgctaaggc aacttttgct ccctgggctg ctactgctag ggcagttgct 1921 tctccaatgc cagatgaagc accagtaatt agcgcaactt ttccatttaa tttctgtgcc 1981 atttttttct cctacgtgga acttataaac gcttcagtag gaaatttccc ggcggtcgcg 2041 aaacaacttt ccacttaaat ttactgaagc ttcggttgca agccaaatag cagtgtcagc 2101 cccctgttca ggggaacggg gagcagaagt cccgcccata tctgttttta cccagccagg 2161 atctatggcg ttgactgcaa tgccttttgc ttgcaaagca tctgccagca taatggttgc 2221 accattgacg gtaagtttag acaagcaata actggggaca tcaaaagaaa gtccattcaa 2281 ccgtccgtag ccactggaaa cattgatgat acgggcttgg ggtgcctttt ctagtagggg 2341 taagaatgct tggcttgtgc gaataggtcc aaaagcatta gtgttcattg tctgatttag 2401 caactcacgg gaaatagtga gaatgttcac accttcgtca ggataaatgc ctgcgttatt 2461 taccagtaca tccagttgag ggatttcctg acttaagtgc ttgacagctt ggtcaatgct 2521 ttggtcgtct gcgatatcaa gttcgacgac gcgcacttta gagttgacag attgcaattt 2581 ctctgcagca gttttggctg cagagagcga tcgcgctgcc acaatcacct caaacccaga 2641 ctgcagcagt cctttgcaaa tagcaaaacc aatgccctta ttcccgccag tcacaagggc 2701 gtattttcct tgaaattgtg aattcatacg tatcctcctc ctgcttggaa agtctgtccg 2761 gtaatccatc ggcattggtc gctcataaca aaggcgacga catctgcaat atcttctgtt 2821 tttcccagac ggttaaaggg cgattgtgcc gctgcggctt ttggctcatc cccagagttg 2881 gcatacatct cggtttccgt tggttctagt gaaactgtgt tcactgtaat tccgcgctca 2941 ccaagttcca ttaaggagat tttttgttat ttacgaaacc tccatcaatt ccatacattt 3001 gtttgaaagt tcgtctgtcg ttgacacaga ttctacatca atacatataa atccaagacg 3061 ctcaaccttc tccctcatgc gtttattgta gaatagagta aggtctagaa acttttgaat 3121 taagtgcttt tcatcctcac ctacattata aaaattactt tcacgcttga ttcggtttcc 3181 caagagttgg tcgctagccg taagccaaat cgctttaaca ccattttcgc caactaaatt 3241 cgctacaaat tctggccata acgcagatcc ttcaagtatc aggtattccg ttgataaatc 3301 aaacgcatga gaatgaacaa tagcctcgat ctgcggtaac acatttttct cgtaatgcga 3361 caatacatct aaaaaaagag tgtcaacaga ccgagttcta taatgttctg ctacatactc 3421 acagaatacc tttccatttg cactcaccca aggacgaccg ggatgccgag cgagtttgtc 3481 cgtacaacga taactccaac caagcttcgc cgccaaagat cgtccaaggg ttgatttgcc 3541 tgcatgagaa gagccgccaa ttagaatcac tcgtgtttca ttctgcataa acgcgaatca 3601 aaacttcgtt tgctgctggt tccggttgtg gtgcatcttc ataaactagg gtttctgtct 3661 accgtgagta tgaatacgta ctgcttttat tgttgccatt ttatgttctc cctgtgaagt 3721 cagtttgcaa tggattcaag ttcatgactt cgtttccttt ttgataagaa agctgcaaat 3781 caaaccaaac cacataaaag agattggcaa atgtttcaag ctgctgtctt ttctgttctt 3841 gcgaaaaatg ttctcatctt gctaaaactt gaaaatcaga atttctctac agcaggtcgc 3901 aaatcaagtt ccaaagtcca tgctgttgca tcttgttgat aaagttgcca atatgtttgg 3961 gcgatcgcct ccggtgccag taatgtatct tcttcagcgt ccgatgccat tgctcgcact 4021 cttggagtat taatcatgcc atcaataata atgtgagcta cgtgaattcc ttgaggacaa 4081 aactcccgtg ccaacgactg cgccaatgct cgtagtccaa atttgcccac tgccaatgct 4141 gcaaatttag ctgaacctct cacagcagca gtggctcctg tcaacagaat agtaccgcga 4201 cgtcgctcaa ccattgctgg aagaacttgc tgtaccgcta aaaaagctcc aaaacagttc 4261 actttccaac acgactcaaa ctgctctggt gtcagttcca ggataccagc cctctgaaat 4321 gctccagcat tgtagacaaa aacttctggt gcacctagtt gtgaggacac ctcctcaaaa 4381 gcggctttta ctgatgcagg gtcacttgca tcaaccgtaa cagacaatgc tttcgcgcca 4441 gactgttcaa tttctgactg aatttgacta agctgctgtg agtttcttgc catcaatcct 4501 acagcaaagc cttctcgtgc aaaccgatga gcaactgcag atcccaatcc cggacctaca 4561 cctaaaacaa ctgcaacctt tgactgactc atcttatctc cgtaaattac aagcaagtgc 4621 agtatctaaa cacaataagg gtaagaattg caaacgccac acctacataa aacacgccgc 4681 tcattcctgc aaaaccaaag actacaccca taacgagtgg accaatagtt tgccctaaac 4741 cgtaaaatgt tccattgaca gatagaattg tagcaagata ttgttttggt gatatctctg 4801 tcaaaagagt ttggatacta ggaaaaccaa tccctagacc aagaccaaaa attgtcgttg 4861 gaattaatag taaccagata ttttgtatta aaggaacagt gaatagagct aaagcataca 4921 aaataaaaga tgctctaatt aaagttgtgg ctttgaatgt tctagttagt cttcccaact 4981 gagatgctga gatcataatt gccacagaaa cagtagaaag taagattcct attgtgtaag 5041 gtggtgcttt gaatgtctct ttaattaagt gtggtaaata agtgacatgc gctccataca 5101 atagaaagaa gttggcagca ctagcaatat aaagtcccaa aatctgacga ttttttagaa 5161 cttttttcgc attactcaaa tattccttga ggttatattc gccttgaggt tcaggatttt 5221 ttaatgcaaa taagactagt aacccgatag gaatcgctat tatcggtagc ataaaggggt 5281 aataccagcc aaatgttgcc agtgtaccgc caatgagggg ataacttgct gtgccaatac 5341 tggtgatact ggcattgtaa cccattgcag tagtacgtct atctgctgtg tacaaatcgc 5401 ctattaaagt gatactaaga gacagcaaag aagcagcacc aattccttgc agaaaacgca 5461 acaacagcag aaggttaaag tcgcgggcaa aggcacaagc agtgccagca attccaaata 5521 aaaacagtga aggaatgata actttctttc tgcccaatct atcagccagg acaccaataa 5581 taggaccgaa aaccagagat ggaaacgtat aaaatgttat caataaacca atatttgtcg 5641 ggttgacgcc cagttcatga gctagcctgg gaaaagccgg agacacgctg gcaactccca 5701 aaacagctat caaagaaact gcacaaataa tgtaaaagtt tgtatccatg tacagagtct 5761 ttggggattt cggctgctta ttagtgcttg ttggttgcat aaaatttatt tttttatatt 5821 aaatttatta tgaattcaat tgtgggcatg gatgaacaaa ctaccatgcc cgacaagacg 5881 ttatgatgtc ttatgatttt ttattagcag aaccaaggcg gcgttgacgc agcaagaaac 5941 cagctcccat taagcctatg ccaacaagcg tggtggaagc tgttggttcc ggaactttct 6001 ggggaacttc aataggacca gttactaaag ctagaccatt gagggtaata ttttgtccca 6061 gattgactat gtcatcttca gagaacaaga gtgtggctgt agcgttctcg aatatgccct 6121 caccgccagt gatatttaca ataccagaac ctttggctgt gagggtttca aagttaactg 6181 cagcgctagc atcactagtt ccaaacaact tgttcgtacc gctaccgaac tgaatgtagc 6241 ccaatggaaa accttgtatg ccaaatgcct ctggattgtt gttaaaagta agatttccat 6301 tggcgtcaag cacggagtat gttagaccct cgtaaagttc taaaccatac ggggcgtcat 6361 cactcacacc tacctcaaat acttgtgaaa cgtcgccagc aatgggcgta atgttaactg 6421 ttgtccgata atagccagta aatggataga tggtctgtgc tgttgctctt gctgcatctg 6481 agcaaacact aactaaagtg agggctagtg gaaccatcca caccttcaat gaactgagta 6541 tcattagcaa tcctcaaaga tttaaaaata ctgtgtttga cattgagttg aattttctct 6601 ttgccagata ttttcatagc ttgtcggcac tacagacaat tactgccgta atgtatcatt 6661 tttacaatat cacgtttaac aactgtaaaa atggcaagtt tccctgtcaa agatacgaga 6721 gagtaagtgt ggaatcgtgc agccttcaag actaatgcaa aagattacta gattgcagtt 6781 ccaaaaaagg gtaatcggtg tatccctttt cgtctggtgc gtaaaatgtt tttacatctg 6841 gctggtttaa agaagcatcc agttcaaagc gttttggtaa gtcgggatta gagataaaca 6901 acttaccaaa ggaaaccaaa tctgcatcac cattggcaag tatggtatct cccttggcat 6961 ggtcgtaacc gccattagta attaaagttc ctttgtaaat tttacggaag tagggtgtta 7021 ccgggttcat cacatcgcgg gttgccaaat caccctcatt cggttccatt aagtgaagat 7081 atgctaatcc aaagcgatta agagcattta ttgcataagc aaaaatttct tgtggattcg 7141 agtcgcccat cccgtagaat gtattactag gtgaaagttt gatcccaaca cgatttgcac 7201 cccacacact ggcaactgct tccacgactt ccagcaaaaa tcgtgctcga ttttctatcg 7261 aaccaccgta ttcatcagtg cgttggtttg tccaatcttg caaaaactgg tcaattaagt 7321 aaccaaatgc cccatgcaat tccacaccat cgaatcccgc agcacgagcg ttttgtgcac 7381 ctttgcgaaa ctgctcgaca atttcaggaa tttcatgtgt ttctaaagca cgaggtgttt 7441 ccattgctac tttacctatc ggagtgtgca gttgccctga agcagcaatt gcagaaggtg 7501 caactggtaa ttgtccaccc aataaagaag gatgagaaac cctgccagag tgccacagtt 7561 gtaagaaaat tcttccccct ttctcatgta cggaatctgt cacttgccgc catccatcta 7621 cttgttctgg tgagtatata cccgggcaat taatgtagcc taaactcaga ggtgaaacca 7681 ttgtgcattc agtaacgatg agtcctgctg ttgcccgttg cgtgtagtaa gtcgccatca 7741 atgagttggg gatgttgtca atagctcgca aacgggtcat tggtgccatc accatccggt 7801 ttggtagagt gtaaggacca agttgaacgg gagtgaaaag attgatatcg gaagtcatat 7861 aaattaccat tactccatta ttcttgttga cctcgttccc gggctggagc ttgggaacgc 7921 ataactgtgg gctgctgcct tagctgttaa catatagagg ctgagcctct aagatttcat 7981 tccttggctg agccaaggaa cgaggaaacg aggaaagcaa acaaattaat tactaggctg 8041 agtgtagacg acttcgtaaa gatgaaactc gttttcagct aaaccttgcc aatactcatt 8101 caagggctta tattttgggt ctttgagcag ttcttcaaaa tctgcaaaac tgcgccactg 8161 tccatagttc acattatgca cgccatcaag cgaacggtgg aaattagcag aaaggagtcc 8221 tggattttgc aaaccgactc ctacatattc cctttccaat tccactaagc gcatctgatt 8281 ttctggcttg actcggaatt ctgcaaggtg aatcaaaccg cctttagaaa ttttaagtgt 8341 agtgtcatct ggtttagaga caaccacctc gtaaacgtgc gactcgtgaa tctgaaactg 8401 gaaaagttta gctccttttg cttgcacttc ggaattgttg ataaatgctt ggtagtcttc 8461 cagcgttttc cattgggcat aattcatcac ccggacgcca tcaatacttt tgtggatgct 8521 agatgaaaca aaaccaggct gatgcttcac cgttgtttca aggaattcga tgatagtatc 8581 aataagttcc tgctgacgtt ccggttctat ggcgaagata atgatcaccg taatgacatc 8641 gttgtttttg gcaatcgtag gcatcgcttt ctctcctgat gtgttggtga atggcaatta 8701 attggaatca cgcaaaattg gaatttgaaa gcatcgtacc ttgtgtcatg cactactgct 8761 actagccatt tttctaggtt gttaatcaca aaagttatgc gaaacagaca tattgtctga 8821 cacaggcgtc gttgcttgtt atctgaatga ttggcacaca atgtttcatg atttcttgac 8881 acgaaaaaca ataggtgggg tacgtcccca cctactcagt taatttaccg acgagtttca 8941 gtaggaactc taccatcctc aacgggctgt agccgtggag atttatcgcg tcgtgtgagg 9001 cgatcgcgaa tttttccaat cactatgtac agaattggca caacaaacag acttaagaaa 9061 gtagagacaa gcatcccacc aacaacagca gtaccaaggg attttcgact agccgcccct 9121 gctccttctg gaaaaactaa tggctcaata cctagaataa aagcaaaaga agtcatgaga 9181 attggtcgta agcgttcttg tgaagcttgg attgctgctt tggtaatcga aagcccctgc 9241 tcgtgtagtt ggttggcgaa ctctacaatc aaaattgcgt tcttacttgc caaaccaatc 9301 agcatcacta aaccgatttg acaatagaca tcattagcta gaccccgcaa cgactgtgcc 9361 gacagtgctc ccaaaatagc cagagggact gcgagcataa taatcaaagg gtcaacatag 9421 ttctcatact gagccgccag cactaggaat acaaagacaa gtcccaagcc gaaaatcaga 9481 ggtgcttgac cgccggactg ttgctcttca gcagcaactc ccgaccattc ataaccaaag 9541 cctgctggta aaacctgctt cgccacttgc tgcatcgctt gaattgcttg ccctgagcta 9601 aaaccaggag cggctgcacc gttgatttca attgagcgga acaagttata gtgattaatt 9661 gtttgcgccc cggtatttga agtcactttt accagattgc tcaagggaat catttgattg 9721 ttagcagaac ggacatacag taaaccaata tccgcaggat ttgagcgaaa ctgagtatct 9781 gcttgaacat acactcggta agttctttgg agataattaa aatcgttgac gtagcgtgaa 9841 cctaagtaac tctgtagcgt attaaagata tcgccaacgt tgacttgcag cgccttagct 9901 ttgttgcggt ctacttcaat caaaagctga ggcgtatttg cggaaaaagt gctaaataca 9961 gcttgcaatc ctggtgtttg attgccccgt tggagtaact gacccatgac ttgcagcatg 10021 gtatttaaac cactattgcc ttttctatct tgtagctcaa attggaaacc accaaaactg 10081 cctaaacctt gaattgctgg cggattaact ggcaaaattc ttgcttctgt aatcgttgag 10141 aacttgcctg ccaaattttt gatgattccc tgtgctgact ggctagcctc atggcgctcg 10201 tcccaaggtt tgagtgttgt aaaaattgca ccactgttgg cagtgctacc actaaaacca 10261 aagccaccaa tcgcgaaagt tcccacaact tcaggcaatt tgagaatttc tgcttctacc 10321 tgactcatga ctttgctggt gtaattgagc gaaaccccct caggtccctg aataatggtg 10381 atgaaatagc cttggtcttc ctcgggaaga aatgcctgag gcacgcgggt gtacagccag 10441 ccggttagcc ctaaagacaa gataaacagc aacactatga ttgctgtaat acgatttaag 10501 cggtagagag ttcgctcata ttttctgcgc gtccaattaa tgaagttatt aaactttgta 10561 aatacccagc cgagcacacc gcccggtttt tgtccgcgac gcagtagcag ggctgaaagg 10621 gaaggagtta gagtaagggc aagaaaagta gaaattgcaa tagaaaatgc tattgttaaa 10681 gcaaattggc gataaatttg ccctgtagag cctgggaaaa aggcaacagg cacaaaaact 10741 gccattagca ctaaggaagt ggcgatgagt gccccaaaaa gttcacccat tgctgcggag 10801 gctgcttgac gcggtgacat ttcctcatcc tccatcaggc gagtaatgtc ctcaactacg 10861 ataatcgcgt catcaaccac tagtcctgtg gctaaagtca aaccaaacat ggtcaacgtg 10921 ttgattgaaa acccaaaaac cttaacaaag gcaaaagtac caatcaatgt caggggaatg 10981 acaataacgg gaatgagagt ggtgcgccag tcttgtaaaa agataaaaat gactataact 11041 acaaggacaa ttgcctcaaa cagcgtcttg aatacttcgg aaagggatgc ttccacaaat 11101 aaggtcgtat caaacgcgac ctggtagtcc atcccagggg gaaagtcttg agcaagtcgc 11161 gcgatttcga gtttgactgc cttagcaaca tccaaagcat tacttcccgg agtcggaaat 11221 ataccgatac ctacaccctc tttggctcta aaccgtagga aggagccata gttttctgct 11281 cccagttctg cgcgacctac atctttgagc ttgatcagcg tgccattttg atctgttttg 11341 agaatcatgt cctcaaattc ggacggttcc ttgagtctgc tgagggcttg caggtctatt 11401 tgatacattt ggtcttttgg agctggttgt tgaccaattt gcccagcacc tacctgtatg 11461 ttttgttcat tgagggcatt aatcacatct tgggcggtga ggttgcgact agcaaggcgg 11521 ttgggatcaa gccacagacg catagcatag cggcgttcac caaaaatccg cgcctcactc 11581 acaccgctaa ttcttttgag agcatctact atgtaaaggt cagcgtaatt gcttaaaaat 11641 acgttgttga actcattttt ttcactgtac agcccgatcg ccaacaggat attagaagac 11701 tgcttgctaa cagtgacccc agtccgctgt accggttctg gtaactgcgg ttcagccagc 11761 gatacccgat tttggacatc gactgctgca atatccttat tgcgagacgc gtcaaatgtg 11821 actctaatcg tactggagcc attgttacta ctactcgaag tcatgtactt caagccctcg 11881 accccgttga tttgccgctc taagacagtc gtcaccgtat tttctacgac ttcagcactg 11941 gctccaacat agtttgcagt gacgttaact tgaactgggc tgatttctgg atactgcgcc 12001 gtgggtagtg tgggaatgct aactgctcca actagcaaaa taagaatagc gcagacactg 12061 gtgaagacag gtcgcttgat gaaaaagtca acgaacataa gttatggagg aatggtaagt 12121 tgcacaacag ggagtagggg aagtgggcag aaattcaaaa ttttgaattt tgaattttga 12181 attttgaatt ttgaattttg aattttgaat tttgaattga tttgtcccct tgttcctatt 12241 gctaagactc aggaataatg ggaacgccat ctctaagatt gagtagcccc gaaacaataa 12301 ttctgtctcc tggctctaat ccttgaagaa cttggtagtt gttacctgta atgttgccta 12361 gtttcacgcg tttttgccga gctactagtt ggaatgttcg ttgttgattt ccttgggttt 12421 tttctgcttg aggtttttct ccttgaggag gttttgatgg tgctggaact gctacataaa 12481 taaaagtttc tcctgccaaa ttagtcactg ctgttgttgg aactaatact cctgggcgct 12541 gactccagat cactcttgcg taagcaacct gatccgtacg taattcattt ttagaattgt 12601 catagagcgc tttcacaagt aaagattgag tattattact ggcattagga gatataaaaa 12661 atacacggct catccctata ctcttgcctt gtgcatccat cacctctact ggtgttcctt 12721 tacgtagttg agatcctcgc tcaacaggca cagaaatatt caattctaaa ggttggtttt 12781 gagtaatagt aactaattgt gtagaagtat tgacaaaatc accaactttc actggtatgt 12841 ttccaactgt gccaggaaag ggagcagtaa ttttgtagta ctgaagctgg acttgttgtt 12901 gtttgatatt ggattgtgct tgctttaaag atttttcagc ttcgttaatg gtagcttgct 12961 gtgctttgat ctgggcttct gtttgaccca gagcagctag tgatgttgcg agtttgttgg 13021 catactgttc actcgtctgt cgagagacag ctccttgaga cgccaaactg gtataccttt 13081 cgtagtcttg ctggttcaat ttcaaatcag ctagtttaga caaccgttga gcttccaagg 13141 atctgaggtt agcacgggta ttctccagtt gtgactgagc gacctcagca gcagcatcca 13201 cactaccgac agctgcttgt tgctgtctag cgtctacttg gataatcgga gttcccgcaa 13261 tcactgtatc tccgtatttt acaaatactt gtgccacttg accttgaact ctcggctgaa 13321 gagtcactga tcggcgcgat tctacggtag caaggtactg ttgactatcc tcaatcatcc 13381 cagcttgtac tgtagatatc ttgactcttg ctgctggcgg ttgagcgtta gcagttgcgg 13441 gtgctttgtt ttgaggagcc agccaacgcc aaagtgcaaa acctccgcct gctagtagta 13501 gggctaacaa taaccaaagc caccttcgct gtttgcgggg tggctcttct aattgggcat 13561 cttgacgact gtcttcaaga tgagtttgag gctcaggaga tttcatgact tacagagcac 13621 ctaagaagaa ataatcacac aatttaatca aaacaggagc cttgatagca aaattgctta 13681 gctagcatca ggcttttacg gaaaccactt agtagtccat ttcaatatac ttcttccgtt 13741 tagttgtcta atctatccaa aggagaatga tataaataag cttacatata actgttccat 13801 gttgtctttt ctgttcttgc gaaaaatgtt ttaatcttgc tattactcac caaaataatt 13861 ctaagtccct aaggagccac gctcgttgca gaaattcccc ttcctgcatg caaactgccc 13921 tcccttattc gcatatctcc ttaaaagacg ctctcgttaa tctccgaaac taagcacatt 13981 caagtcatga ttttggctga caccgcctga aaaactgcgt tttttataca taatctacgt 14041 aaattgatac gttctgcaat tacaatagta ccctttagtt acttgattga gaggactggg 14101 ggaatcgcgc tcagactgcg gtcgaattta tcttcctgca aacacttgtc ctggatttct 14161 cggtggtgat cgacgtggca aaagtcgtca aagctgattt caggaatatc aaatgaatgt 14221 gatgtaggag tagacagaca ataaattttt aaggctgaaa aatcaatggc aatcacagga 14281 actcctacac cagaacgtca gacagcacaa ggatatcgta ttcctgatta ttcaaagaat 14341 aattataaag aattgatgaa tatcttgcat ggttttggat atgtaaaccc agaagaccaa 14401 agcaatgatg ttgataaaaa agctctgata gcttttcaaa aatatatgaa aattacagcc 14461 gatggggtgt atggtccaaa aacccaagaa aaacttgctg aagctatgag aattctccac 14521 ggaaacttaa atacagttct caagacaaac ttttccttca atgaaccatt ttatggacca 14581 aaaacgactg atgcagttaa acagtttcag cacaagtaca acgtaggagc accagcagga 14641 gaagcaaatt tagtaactcg tcaaaaactt gctgaacaag caagtaaata actcgttgaa 14701 atctgctaag tagctcaaca aacgtaaaac gtaaaatgtc attacgagcc aaatcaaacc 14761 ctatgatcaa agggcaatca cggtgaattc aagtaattgc caacactcta ttttacgttt 14821 attcatgttg acctacttag ccagcactca cagtttaaga caacaaacga gtacaaaaca 14881 atgcaaggta aaaaaagaca tttccaagct ataggcttat ctgtatctct ctttgccgcc 14941 tctgctattg gcttgctggc tagccacagt gttgtacaag caaagaagcc agttccaaga 15001 cccacagttg ccacagtaaa aagcatggta aatggtgacc tcatgtgtta cgtcaactta 15061 gtagatgaaa aaggaaaaca gtataacagt gtcggtgcat cgtttgagat ttgtgcaaat 15121 gaaaaacgtt ttttaaacaa gaaagtgcag ctatcttata gccaagcttc agtcaatgat 15181 tgccaaagtg ctgagccttg cggtaaatcc cgaatagaaa ctctcattac gaagatgaaa 15241 atcatccggt aattgtataa gcgaattggc aattattaag aatcacaatc aagttacaca 15301 ctatcaagct tatgaggaaa atcaaggatt agaggtaaat tggtatcgcg attaactttt 15361 ttacgaggtc tacccctacc agagtaagca gtctgaactc aactcgcttg gcatatctgc 15421 ggctacggtt ggggcaagtt gaatcagata gaagcagatc acgctcaggc aaaatcctat 15481 gctttgcaac aagtttgttt ttttgaactt cataagaggt tttaagtaat atattgattg 15541 agcttttctg ctaatttggg actttttacg agctggtaat gtcctccata aattaagtgt 15601 ttttctaaac ctccaattgc aagagtatcc caaccaccag acgagtctgc ggtcatttct 15661 tcaggttgaa cgcttgcgca aaacaagaca atcgatccag tatatggttg ggggcgatag 15721 cgaagccgcg cttggatgtt ggcttggaag acttgaaata agcgacgaat ttgtgctagt 15781 tctaactccg gtggtagaac tatagctgtc ttagcctgct ctaaaatata agtcaattgt 15841 gcgtctaatt ctagttcttg tagttgagcc agtgatacta ccaactcttt tcctaaaaga 15901 ctaccaaggt cttgggcaaa gtagtaggaa gctaaggtag catcatctag aacccgaacg 15961 cttgactttt gtgacaaaat gcgcttgcgt tgttccgata cctggtctat tacttttgga 16021 gtgtagctgt cgagcagtgc tagcaacgct acttcttgtc cggaaacgga taactgttgg 16081 gctatctcaa aggcgatcgc ccctcctaaa gaccaacctc ctatataata gggaccagtg 16141 ggctgaatgg tttgaatggc tttaatataa tgggttgcca tgtcttccac acaggtcatc 16201 ggttcttgtt ccccattgag tcctactgat tgcagtccat aaaatggctg ttcgtcaccg 16261 agatgacggg ctaaatcagc atagcaaagg acattaccac caacgggatg gacgcagaag 16321 aagggcgagc gattaccttt tggttgaatg ggaactaagc aggaaaatgg ttgctcttga 16381 tgctgctgtc caattgttgc ggctaaaccg gcaatattag gagttttgaa gagattttgc 16441 aagggtaagt ctacagaaaa tttgttctgc aaccgataca tcacttgcag cgctcgcaag 16501 gaatgaccac ccagttgaaa gaagttgtcg taaatcccga cattttcaac gcctagtaca 16561 tcttcccata tttttgctat ttcccgttct acatcgttcc gaggtgctac ccgttcgcct 16621 ttgtgaggta taattttgcc gacaagcttg tgaagatcga tttcaccgct atcggtaagg 16681 ggcatttcct ctactaactg aagctgacag gtggtaggag tgccgaagcg atcgcacagt 16741 gtttgtggta ttgcgggtac tgtgcctgta aaataggcga ttaatttttc gcaactatca 16801 cattcggact gatagcgttg tatatttgcc ttgttgccat ctataccgat ggtcaagcgt 16861 gtgggttcca aactcattcc tgccaaaaat gcataaattc cttgctgctt ctcaatcgca 16921 tagtaaccgc gatcgcgact gagttgtttc atttcatagc cccgactcat accagtttcg 16981 tcccacatac tccaagcaaa acaataactc tgcaaggaac tgcgagttct ttggcagtca 17041 gaaaaagcat caagaaaact gttggctgcg gcataagcac ccacagtcgt cccgccaaaa 17101 aaaccgttga tcgaagaaaa gttgataaaa atactatcag gtttatcttt aatcaattgg 17161 tgtagtaccc acgttccaga taccttagaa cgcaaggctg ttgccaaacc tgtctgcgtt 17221 tctgactcta ttaaactctc acgggctaca cctgccaagt gaataatccc atccaattcg 17281 cccaatgcag atatgatgac ttccataccc gggcgatcgc aaatatcgac tgtttcataa 17341 actacagtcc catttagctg ttgtaattgc tcaaaagctg cgagtttttc tgccagtagc 17401 ggtgtccgac caaccaataa taacctggca tgataatttt ccagcaggta tttcgcaatt 17461 tctacgccaa ttcctcccaa cccgccagta ataagatagc tccctcctgt tttaaaaggt 17521 aatggttgct taaattcact ctcccagttt actcgtaaaa gacgcggaac caaacgctgt 17581 cctttgcggt acgcaacttc tcgttctcct gttatgtctt gcatctccgc tagaatataa 17641 tccccgttga cttcaactag atcgacatat aaatcaatat gacgacagtt caaccaaggc 17701 atttcttgag ggatcgtttt cagcaaacca agaactgaag atttttcata agcaatctcc 17761 tcagtagttt ctgtcaactg ggtatggctg gaaatgaata acagttttac tgggttttgt 17821 tcgccttgaa cctttgccaa agcttgtacc aagaataata aactgtaaac tccggtattt 17881 tgagccgttt ccaattctgc cagactggaa atttctcctt tatatttgct gtaattccat 17941 aaatgtatga tttgctcgat atatttttgc ttggctgtga tgctttctag tagtaactga 18001 taatcttctg ctctttctgg atttatactg tagctatcgt cgctgatttt cgcaaaaacc 18061 gagcctaccg atactttaat acaagattga ttttcttgcg tcagttgctc gccgatataa 18121 tcccccaaac ctaatggatc aataaagact aatgtcggtc ctgttgacac cgtcaatcgg 18181 cgtacttgtt ttgtttgcca gacttggcgg taaaaccaat caggaattgt tttggcattc 18241 tccaacagga tatctacttt tttgagaata gagttgaatt cgcctgtttc taattgctga 18301 cacagttgcg atcgctgaat tttgccaata gaagttttgg gaatggcggt cttgtctaca 18361 ggcagtaaat aatcaggttt aatgccaatt tgactgacaa cttgctgctg aatttcttct 18421 aatttttccc gcaagagacg ttcgttatca gcgaggggac tagagaaaaa tatcgctaat 18481 ttatctcgtt ctacagatac cgcagccgtg tatgacactt caatatcagg aattttttcg 18541 acaacggctt caatttcatg gctgtaataa ttagccccat taataataat tacatctttt 18601 tgccgtcccg taattgttaa gcgtcccgct tggagaaatc caagatcgcc tgtagcgaac 18661 cagccatctg tagtaaagac ttctccattg agttgagaat tgttatagta accttgtgtg 18721 accgactcgc ccttgacttg taaatacccg actgtccctt cttctaaaac ctgattttca 18781 ctgctgacga ttctcaaaga cacacccgga atgggaagcc ctagttctgc aaaaatcgaa 18841 gttgggtttt ctgacaaaaa cttgtcagaa aaagtcacac ctgaagaagt ttctgccatt 18901 ccccaagagg aatgcatagc agaggcagat aattggtggg gttgaaataa ttcccaaaat 18961 ctttgggcag tttgcggtac aattggttcg gcagtattta gcaaagacct gacagcagac 19021 aaattccact tccgctgttt gatttcatca gattgctcgt taacaagagt aaaggcaaaa 19081 ttaggactcc aagtcgttgt cacctgatag cgttccaaca agtcaaacca tgtcagcggc 19141 tgatgtaaaa ctacatctgt gggtgcgtga atttgctcgc aaccgagaaa tacgcatcgg 19201 atacaacagc gaatcagagg tccgggatgg tctagcggta accaattgag agatatatct 19261 tcttgagtca attggctgac ttgggaagta gccgccacgc tacttaatat attccaatga 19321 cttagcatta ctcctttagg cattcccgta cttccggaag ttaagagtaa aattgctaaa 19381 tcatctggct gactgagatg ataactggga tcggggtcgt actcgtggag gcgatcgcaa 19441 ctttccactt gaaaattctc taaacccaga agccccgaca aagaagcgat cgcctccacc 19501 agttcaatgc ttgccagaat gatcggcttt tccagcattt gccaagcatt gtgcaattta 19561 ttcacagcat tattgacttg gctgtaggta ggggcgatcg ccagggggac aggtacaaaa 19621 cctcccagaa tacagcccca aaatgctgga ataaaatctt gcggattttt gatttgcagt 19681 ataactttat cttggggttg gagaccgagt tttttcaagc ctgcattgat gcgttgtgcc 19741 tgtgtcagca aatcctcata ggattgaaat aactcgttac catcgcgctg tatatatata 19801 atat // LOCUS NODE_1656_length_19649_cov_5.75053619649 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 19649) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 19649) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..19649 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(193..858) /locus_tag="DP116_14525" CDS complement(193..858) /locus_tag="DP116_14525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316145.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase family protein" /protein_id="PRJNA477356:DP116_14525" /translation="MLKLYHDPISPNSRRVWITLLEKELEFELVEVKLDGEQFQPEYL ALNPFHHIPVLVDDGFKIVESLAILDYLEAKYPKPAMLPTDAKDLAIARMAQLVTVNE LLPAISPLFPMMLGLGEGNPEKIEQAKQKASTVLKFFENLLDERPYFGSQNLTLAEVV AGTVVPWLPRGSVSLSDYPKLSAWCDRLMVRPAWQTTEATPEMIEAFKSRMAARMAQT PIS" gene 982..1755 /locus_tag="DP116_14530" CDS 982..1755 /locus_tag="DP116_14530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="16S rRNA (guanine(527)-N(7))-methyltransferase RsmG" /protein_id="PRJNA477356:DP116_14530" /translation="MNNLKLPSLPDMANIWQETLQWTPTVQQQVQFQQLYEFILQGNS QLNLTRITDPQEFWEKHLWDSLRGIAPLLRDGGDEDVICAPTPPTSLSFIDIGTGAGF PGIPVAIAVANSIVNLLDSTRKKIAFLEETKAALDLTNVKTLTGRAEEVGQQPQHRQN YDVALIRAIGTASVCAEYALPLLKQDGLAVIYRGNWTEEETKVLENAVDQLGGIIELV DKFTTPLSGSIRHCLYLRKVATTPVHFPRPVGVPTQKPL" gene complement(1827..2570) /locus_tag="DP116_14535" CDS complement(1827..2570) /locus_tag="DP116_14535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456531.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="superoxide dismutase" /protein_id="PRJNA477356:DP116_14535" /translation="MAIARRHFLVLLGASAGAFTLDACALAQNAPSQNKPSPGKTGAI QLPALPYAYEALEPHIDAATMRFHHDKHHATYVKNLNAALDKNPELKSRTVEQLLRDL NSVPEDIRKTVRNNGGGHVNHSMFWRIMKPKGGGEPRGPIADAIKQNFGNFAAFKKQF NEAGASQFGSGWVWLVRSKDGKLAVATTANQDSPLSEGNYPIMGNDVWEHAYYLKYQN RRADYLSAWWNVLNWEEINRRFVEGGKAA" gene complement(2770..3951) /locus_tag="DP116_14540" CDS complement(2770..3951) /locus_tag="DP116_14540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872403.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_14540" /translation="MLYRRFGRTELKMPVFSCGGMRYQYKWQDVPQSQIPADNQKNLE ATIRRAVELGINHIETARGYGSSEMQLGRILPTFPREELIVQTKIGPQEDPKEFQHDF ETSLRNLQLDYVDLLGIHGINNAEFLNYTIRPGGCLEVAKKLQAQGKVRFIGFSTHGS TDIIIQSIHTNQFDYVNLHWYYINQVNWAAIEAANSQDMGVFIISPSDKGGMLYKPPQ KLINLCTPLSPMVFNDLFCLSHSQVHTLSLGAAKPQDFDEHLKTLDLLENASEILSPI LTRLEEEAIATLGEDWVKTWHLNLPTFEETPDEINIPVILWLRNLAIAYDLLEYAKMR YNLLGNASHWFPGNKADEVHTLDLQQCLSYSPHADKIPRLLAQAHQMLAGEAVQRLSQ S" gene 4213..4581 /locus_tag="DP116_14545" CDS 4213..4581 /locus_tag="DP116_14545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745542.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14545" /translation="MKTLLVIAIGFLPSLLSLWLMRKTEARTRSRFRRAAATIYPVGR VQQNRVSNDSDCTERSAFMGDRYYLEGVGYLVGDISCKFNARSGYLRCAVNPTGPCEG CRLYQATETVSREKSSSELM" gene complement(4627..5667) /locus_tag="DP116_14550" CDS complement(4627..5667) /locus_tag="DP116_14550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1611 domain-containing protein" /protein_id="PRJNA477356:DP116_14550" /translation="MRLALNQRVAVLLHNELLGRHGKTGLSILRYSEAPIVAVIDREY AGKSLTELTGIKRDVPIVASVAAALEYQPQVLVIGIAPKGGVLPDDYWHELKNALSAG MSIVNGLHTPLASMPDLKALLKPGQVIWDVRKEPPNLDVASGLARNLACQRVLTVGTD MSVGKMSTSLELHWASRLRGLRSKFIATGQTGLMLEGDGVPLDAVRVDFAAGAVEQVV MRYGKNYDILHIEGQGSLLHPGSTATLPLIRGSQPTQMILVHRAGQTEVLNGVPIPPL SEVVKLYETVARAGGAFASVPVVGISLNTKDLDESQALDAVAKTETETGIPCTDPIRF GVDKLLNALMQG" gene complement(5651..6703) /locus_tag="DP116_14555" CDS complement(5651..6703) /locus_tag="DP116_14555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128498.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dipeptide epimerase" /protein_id="PRJNA477356:DP116_14555" /translation="MQIKVEIFTVNKRFPLTISRGTTAQTTNIWVRILHDGIEGWGEA SPFNVGTHPQSTEIIKDALLQVAPCFQAFSPLQRQEIEQILRKALLPSSAVAALDMAM HDWLGKRVGLPLWQIWGLNRDAIVPTSVTIGINSPEGARARARDWLAFTNVRVFKVKL GSPDGIDADRKMLIAVREEAAARDLYVDANGGWSLEDAVFMCNWLADIGVKYVEQPLP KGQEEHLAELKRLIPLPIFVDESCFNSSDIPKLASCVDGINIKLMKSGGLTEAMRMVH TARAYKLQVMFGCYSDSTLANTAAAQLAPLADYLDLDSHLNLTDDPFTGAYMHEGKIL PNDLPGLGVQNSASGA" gene complement(7427..7546) /locus_tag="DP116_14560" /pseudo CDS complement(7427..7546) /locus_tag="DP116_14560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314339.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="phosphatase" gene 7524..8024 /locus_tag="DP116_14565" CDS 7524..8024 /locus_tag="DP116_14565" /inference="COORDINATES: protein motif:HMM:PF07924.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nuclease" /protein_id="PRJNA477356:DP116_14565" /translation="MSSILLSIIFTDKMSLTDSFQPERFSEQQLLEKLKESTEGILWS SESDYPLEVVFWETSTISPEKLLQLTNKPPDTVVTVQEGDKFFAKLFRQYKQYISESS DEESEKEYKSCLVKCQKLADLLKANLTDIQYFDVGGNQSEIHLYLIGKSPSGNILGLS TIGIYT" gene complement(8054..8314) /locus_tag="DP116_14570" CDS complement(8054..8314) /locus_tag="DP116_14570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308989.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14570" /translation="MGRLNPYTLQMELTRMFENGQSFFATTKVHDWLKERHHNPADYD IIFHQKPAPPGSKSVMVVEVELRRKDGEPVDPWLQEQANLHA" gene complement(8386..9273) /locus_tag="DP116_14575" CDS complement(8386..9273) /locus_tag="DP116_14575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408955.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14575" /translation="MSKSISKLVDELPSSNLTVSALRALDFIAPGEWQNVVGFVNTIK TVTGETDERLIQQIGERAVYLYNDRSQGYQRAMWLYQTVERTDKALGAAALANKVGEK VPLLGFLNRLTPKADKAQTIDLCLKLVVELVAFCQINGIPGNSIGDFVKSLGEYSGES FIRMAALVCFDGLIPLGPDFINRALSRINQISPQELEQNSTFANIKAEIPGNNSSSKL NFIGESFNSVSGWMTSLITSKHLTPQKVVNHLQSFVDVADDKLDFVAALLDISTNYYE HTGTQTLARRLIERAAAEI" gene 9691..10065 /locus_tag="DP116_14580" CDS 9691..10065 /locus_tag="DP116_14580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743859.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4870 domain-containing protein" /protein_id="PRJNA477356:DP116_14580" /translation="MQVTYDSDKRKLLSSLCHGAIFFSTALFSIGVPIIINLISDDPV VKSNAKESINFHFNVWFWAILIGVPIGILSFLTFGLGGILFFPVVAFGFLLHWGLTIW ALFHCFTNPDEPFRYPFIFRVF" gene complement(10124..10456) /locus_tag="DP116_14585" CDS complement(10124..10456) /locus_tag="DP116_14585" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14585" /translation="MQRVTVDDIKRDPLKYLNQVQAGKSFVIVQADKAIAELKPITNT NKQLRPFGLCAGEFTVPDDFDEPLPEDSLRGYYVHTSCSKPHPALSGRVRFRCVVGLG RVVYASKK" gene 10614..10877 /locus_tag="DP116_14590" CDS 10614..10877 /locus_tag="DP116_14590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491503.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14590" /translation="MEPSPHELNQPKSKVESPTTLLKRFSAGAAVGLIIAALNWGSYA YFFDNPIPLTNGIIFCLGLTIISGLMTLKWGYNTLESLLQLLS" gene complement(10988..13627) /locus_tag="DP116_14595" CDS complement(10988..13627) /locus_tag="DP116_14595" /EC_number="6.1.1.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alanine--tRNA ligase" /protein_id="PRJNA477356:DP116_14595" /translation="MSFHPQYKSGNEIRALFLDFYAQRGHQILPSASLVPEDPTVLLT IAGMVPFKPIFLGQRTPEFKRATTSQKCIRTNDIENVGRTKRHHTFFEMLGNFSFGDY FKEQAIAWGWELSTKVFGLPPEQLVVSVFEEDDEAFGIWRDKIGVPEARIKRLREDNF WAMGSTGPCGPCSEIYYDFHPERGDDNIDLEDDTRFIEVYNLVFMQYNRDAEGNLTPL ANKNIDTGMGLERMAQVLQGVPNNYETDLVFPIIKTAAEIAGIDYTKADEKTKVSLKV IGDHIRAVVHMIADEIRASNVGRGYILRRLIRRVVRHGRLIGIGGEFTTQVAETAIAL SESAYPNVRQRENAIKAELQREESRFLKTLERGEELLAEIIQQVQSKGATQISGESAF TLYDTYGFPLELTQEIAAENNLTVDEAGFDVEMQKQVERAKEAHKTIDLTVQGSLDKL AEDIQATEFIGYTQPVATSKVEVLLIGGVSQEEAETGTDVQIVLDQTPFYAESGGQIG DRGYLSGDGVLVTIHDVKKESDFFVHFGRIERGTIRVGDAVTAQIDKACRRRLQANHT ATHLLQAALKKIVDDSISQAGSLVSFDRLRFDFNCPRPLTPDEIHQVEELVNSWIAQA HSAKVEILPLAEAKARGAVAMFGEKYGEEVRVIDFPGVSMELCGGTHVSNTAEIGVFK IISEAGVASGVRRIEAVSGAAILDYLNVRDKVVKDLSDRFKVRPEELPDRITTLQNEL RTSQKELETLKSQLAIAKSDSLLQTVESIGDYKILIAKMEDVDPESLKTVAERLLQKL GNGAVVLGSVPEEGKVSLVAAFSSEVNKKGLQAGKFVGAIAKICGGGGGGRPNLAQAG GRDASKLPEALEQARSELRSALG" gene 13941..14144 /locus_tag="DP116_14600" CDS 13941..14144 /locus_tag="DP116_14600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866590.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="copper chaperone" /protein_id="PRJNA477356:DP116_14600" /translation="MALKLTVPNISCQDCATTITESIHTMEPDAKVDVDVEAKTVTVE SKASEESIKQSIVAAGYHIEGYQ" gene complement(14425..15651) /locus_tag="DP116_14605" CDS complement(14425..15651) /locus_tag="DP116_14605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179696.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_14605" /translation="MRTAYQYKLRPTKQQARDIDKWLSMLCAQYNYLLADRFNWYEQN RCPVNACPLICYLPELRDNPDYYGQKRTLPELKKTHPHYKEVYSDVLQDVVGRVKKTF ERFLSGDSNGKRSGRPRFKSRERYRTFSYPRIKENCIVGGKITLPKLGAIKLILHRSI PAGFKVKTVSVTKKADGYYVTLSLEDLTVPTVKPDINPDSITGIDMGLKEFLTTSEGE PVAIPQHYRKAQKRLRVIQKRVSRRKKGSNRKRKAVKQLGKQHKKVADKRKDFHFKTA KQLLSKYDVVVHEDLNVKGLSRTRLAKSMHDAGWSSFLSILSNKAENAGLLVIAVNPK NTSQDCSSCGVKVPKKLHERWHSCQCGCSLDRDHNAAINIRNRALGHRVLKAQLMSDG IPGVAEKPTLYCTQSV" gene 15712..15915 /locus_tag="DP116_14610" CDS 15712..15915 /locus_tag="DP116_14610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313026.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14610" /translation="MKNKELKIRISERRLNKVRLYAAHADKTMTQVIEELIDSLKIPE IANQGTVRFTLPANSEIDSTDVP" gene complement(16002..16271) /locus_tag="DP116_14615" CDS complement(16002..16271) /locus_tag="DP116_14615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308351.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14615" /translation="MDKPILQKDGTTFELGMNIPWFVAVHFHPLAKEKYSYAIAIHNV LECNPFPIADFDSCLFGCYETAYQALNAGVEEAQRRASDFGENIK" gene complement(16348..16665) /locus_tag="DP116_14620" CDS complement(16348..16665) /locus_tag="DP116_14620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863967.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14620" /translation="MLDFNTLTAFSSTYCIGICAFLIPANLISTLSTIILTLLSRPAI QIWQSAGIASFFALLMILHVFAWFMMGVVMAPTYILLVLGSICLLANFMTILSHRRFA FRT" gene complement(16872..17177) /locus_tag="DP116_14625" CDS complement(16872..17177) /locus_tag="DP116_14625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131640.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14625" /translation="MTNLAFGVEPAVLITIIGETVLKDRIVKLLKSHSVSGYTISQVQ GEGGHGRRLADLAGYNTNIEIKTIVSPEVSDTIFWALKEEQGKHALIVFRYNVEAFY" gene complement(17339..17644) /locus_tag="DP116_14630" CDS complement(17339..17644) /locus_tag="DP116_14630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="PRJNA477356:DP116_14630" /translation="MSVYVGNLSYEVTEESLNAVFAEYGSVKRVQLPTDRETGRLRGF GFVEMGTDAEETAAIEALDGAEWMGRDLKVNKAKPREERGSFGGGNRGGNNSFRNRY" gene 18086..18580 /gene="infC" /locus_tag="DP116_14635" CDS 18086..18580 /gene="infC" /locus_tag="DP116_14635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="translation initiation factor IF-3" /protein_id="PRJNA477356:DP116_14635" /translation="MIIVIQKQLINSQIKSPQVFLIDHENNNRGLTDTHEALQLAQSL ELDLVLVSEGKDAPVAKILNYGKLQYQKKKRQGQSARPTVKEVRLRPNVGVADYNLRI EQASQWLSKGDSVKFVIRLRGRENQYREQAGEMLERIVTALSQVGKVQSLDKRSLVAQ VIPA" gene complement(19249..19602) /locus_tag="DP116_14640" /pseudo CDS complement(19249..19602) /locus_tag="DP116_14640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015135644.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 5515 a 4342 c 4038 g 5754 t ORIGIN 1 ttagtcttct agaaaactag acctcttgtc tatagcggtt ttcggcttga gtgtaagaca 61 aatcttgtgg agtgggcttc ttagcttccc ggatataaga caggcgctaa tcccatccca 121 caagatagga gaaccgtaat ttgattcaat taagaaccgt tatagtcatc gggtaacgaa 181 tgcaatttta tactagctta taggtgtctg cgccattctt gccgccatgc gagacttaaa 241 cgcttcgatc atctctggtg tagcttcagt agtttgccac gctggacgta ccatcaagcg 301 atcgcaccaa gcacttaatt ttggataatc actcaaggac acacttcctc tcggcaacca 361 aggtactaca gttccagcca caacctcggc taaagttagg ttttgactac caaagtaagg 421 gcgttcatct aataaattct caaaaaactt cagtaccgtt gacgctttct gttttgcctg 481 ctcaattttt tcgggatttc cttctcctaa gcctaacatc atcggaaata gtggagatat 541 agcaggcaat aactcattca cggtgacgag ttgtgccatt cgggcgatcg ccaaatcctt 601 cgcatcagtt ggcaacatcg caggcttagg gtacttcgcc tctaaataat ctaaaattgc 661 caaagattcc actatcttaa agccatcgtc taccaaaact ggaatatggt ggaaggggtt 721 aagcgccaga tattctggtt gaaactgttc gccatccagt ttcacctcca ccaattcaaa 781 ctcaagttct ttctccaaca gagtaatcca gacacggcgc gaatttggag aaatcggatc 841 gtgatacagc ttcaacatag aaaaaactta cagttttata aatagggtgg gcattgtcac 901 cctataccct acactcaatg gaatgactct aacaacaaac aaagaaaact tggaatacaa 961 ttcaaagcta tttgtgttac tgtgaacaac ttaaaattgc cttctttgcc ggatatggca 1021 aatatttggc aggaaactct ccaatggact cctactgtcc aacagcaagt gcaattccaa 1081 cagctttacg aattcatact acagggtaac agccagctta atctaactcg cattaccgat 1141 cctcaagagt tttgggaaaa gcatctttgg gattctctac gaggaatagc ccctctatta 1201 agagatgggg gagatgagga tgtaatttgt gcccctactc ctccaacttc tctatctttt 1261 atagatatcg gtacgggtgc gggttttccg ggaattccgg ttgcgatcgc tgtagctaat 1321 agtattgtaa atctacttga ttcaacaaga aaaaaaattg cttttcttga agaaacaaag 1381 gctgcgcttg atttgactaa cgttaaaact ttgactggta gagcagaaga agtcggtcaa 1441 caaccccagc accgacaaaa ttacgatgtc gccctcattc gcgctatagg gacagcttct 1501 gtgtgtgctg aatatgccct gccgttactt aaacaggatg gtttagctgt aatctaccgg 1561 ggaaattgga cagaagaaga aacaaaagtt ctagaaaatg ccgttgacca gttgggtggc 1621 attattgaat tagttgacaa atttacaact cccttaagtg gaagcattcg tcactgtctt 1681 tacctgcgga aggtagcaac tacaccagtc cactttcccc gtcctgttgg tgttcctacc 1741 caaaagccgc tttagaaaag tgagggacta aaggaaaggg agagtaaaga aaaattccct 1801 ctttctctcc ctctctccct tatcccctac gccgctttac ccccctctac aaaccgtcta 1861 ttaatttcct cccagttcaa gacattccac caagcactca agtaatcagc gcggcggttc 1921 tggtacttga ggtagtatgc gtgttcccac acgtcattgc ccataattgg gtagttacct 1981 tcactaaggg gactgtcctg attggctgtg gttgcgactg caagctttcc atctttgctg 2041 cggactagcc aaacccatcc actgccaaat tgacttgcac cagcttcgtt aaactgcttt 2101 ttaaaggcag caaaattgcc gaagttttgt ttgattgcgt ctgcgatcgg tcctctaggt 2161 tctccaccac ccttcggctt catgattctc caaaacatcg agtggttcac atgaccacca 2221 ccattattgc gtactgtttt acgaatatct tctggcacac tgttaaggtc acgcagcagc 2281 tgttcaacag ttctgctttt gagttctgga tttttgtcta atgctgcgtt taagtttttt 2341 acgtaagtcg catggtgttt atcatgatga aaccgcattg ttgcggcgtc tatgtgtggt 2401 tcaagcgcct cgtaggcata cggtaacgca ggtagctgga tagcccctgt tttgccggga 2461 cttggtttgt tttggcttgg cgcattttgt gctaaagcac aagcgtctaa agtaaaagca 2521 cccgcactag ccccgagtaa aactaagaaa tgacgacgag caatagccat aatagtgact 2581 gtggcagaaa aatgttatct tcagttgtta ttttctgaaa tttagactaa aaactatatc 2641 gattttcaat tttaaccata agtagaccca aacacgtgtc aattcttaat actgaatttt 2701 aatagtctta aaaaatgctt ttatcattat aaatactgct gcttgtgtcc aaaagtactt 2761 tttgcacctt taactttgcg ataaccgctg cacagcttca cctgctaaca tctgatgtgc 2821 ttgtgcaaga aggcgaggaa ttttgtcagc atgaggactg taactaagac attgttgcaa 2881 gtctagtgta tgtacctcgt ctgctttgtt accaggaaac caatgactcg cattgccaag 2941 taggttgtag cgcattttgg catattctag taagtcgtag gcgatcgcca aatttcttaa 3001 ccataaaatc actggaatat ttatctcatc aggtgtttcc tcaaaagtgg gtaaattgag 3061 gtgccaagtt ttcacccagt cttcccccaa ggtagcaata gcttcttcct ctaggcgtgt 3121 caaaattggt gacaaaattt cagatgcatt ctccaacaaa tctaaagttt tcaggtgttc 3181 gtcaaaatct tgtggttttg cagcccccag actcagagta tgtacctgag agtgacttag 3241 acaaaacaaa tcattaaaaa ccatcggact caaaggagtg caaagattaa ttaatttttg 3301 tggtggttta tatagcatcc ctcctttatc agatggacta ataataaata cacccatatc 3361 ttggctattt gctgcttcaa tagcagccca atttacttga ttgatgtaat accaatgcaa 3421 attgacgtaa tcaaattgat tagtatgaat gctttgaata atgatatctg ttgatccatg 3481 tgtggaaaag ccgataaacc tgacttttcc ttgtgcctga agtttttttg ctacctccaa 3541 acagccgcca ggacggatag tatagtttaa aaactcagca ttattgatgc catgtattcc 3601 aagcaagtca acataatcta actgaagatt tcgtagtgag gtttcaaaat catgctgaaa 3661 ttcttttgga tcttcctgag gaccaatttt cgtttgaaca attaactctt cgcggggaaa 3721 cgttggtaaa attctcccca actgcatttc tgaactacca taaccacggg cagtttcaat 3781 gtgattaatt cctaattcaa cagcccgtcg aatcgttgct tctagatttt tctggttatc 3841 agcaggtatt tgcgactgcg ggacatcctg ccatttatac tggtatctca tgccaccgca 3901 ggaaaaaaca ggcattttta attctgtgcg tccaaatcgt ctgtataaca tcagtggatt 3961 catgagcgaa atcaaaaaag aattacagct acggaaaatc cgaaggcgat atttaagttt 4021 aaacctactt taacctaagc ccacggtttg agttacaacg gttatgctgg taaaagtggc 4081 ggctaacaca acactttttt gagtgttagg tttcttccac acaaaatttt ccacgggttg 4141 tactctccct aatatgtgag gatataacac cagactataa gcgcacaaaa gagattaaaa 4201 gaaggtgttg aagtgaagac acttttggtt atcgccattg gtttcttacc gtccctatta 4261 tccctgtggt tgatgcgtaa aaccgaggct agaacacgtt cacggtttag acgagccgct 4321 gctactattt atcctgtagg gcgagtgcaa caaaataggg tcagtaatga tagtgattgt 4381 acagaacgta gtgcctttat gggcgatcgc tattatctcg aaggagtcgg ttaccttgtg 4441 ggcgacatca gctgcaaatt taatgctcgt tctggttact tgcgctgtgc tgtcaatccc 4501 accggaccat gtgaaggttg tcggttatat caagcaacgg aaactgtttc tagagaaaaa 4561 agctcatctg agttgatgta gtcagtggtg aagtatcaaa gacgggtttt cccgccctga 4621 tagacttcaa ccttgcatca gtgcatttaa caacttatca acaccaaatc gtattggatc 4681 tgtacagggt attccagttt ctgtttcagt tttcgcgaca gcatccaatg cttgtgattc 4741 atctaaatcc ttcgtgttca aagatatgcc aacaacaggt acagatgcaa atgcaccacc 4801 tgcacgggca acagtttcat aaagttttac gacttctgat aaaggcggaa tgggtacacc 4861 attaagcacc tcagtttgtc cagcacgatg gacgagtatc atctgcgttg gttgcgatcc 4921 tctgattaag ggtagcgttg cagtcgaacc ggggtgcagt aaggaacctt gtccttcaat 4981 atgcaaaatg tcgtagtttt taccatagcg catgaccacc tgttccacag caccagcggc 5041 aaagtcaacc ctgactgcat ccaatggcac accatcccct tctaacatca aaccagtttg 5101 acctgtggca ataaatttag aacgcaaacc ccgcaggcgt gacgcccaat gtaactccaa 5161 actggttgac attttgccca ctgacatgtc agtacctaca gtcaaaactc gctgacatgc 5221 taaattacgt gccaaaccgc tagcaacatc taagttaggt ggttctttcc ttacatccca 5281 aatcacttgc cctggtttca gcaatgcttt taagtctggc atacttgcca atggcgtgtg 5341 caaaccgttg actattgaca ttcccgctga taaagcgttc ttaagttcat gccaataatc 5401 atctggcaaa acacctcctt taggggcaat gccaataacc aaaacctgag gctgatattc 5461 tagcgctgct gcaactgatg ccactattgg cacatcacgc ttgatacccg ttaactcagt 5521 taaagatttg cctgcatatt cgcggtcaat cactgcgaca attggggctt cactgtagcg 5581 taaaattgac agcccagttt tgccatgacg tcccagaagt tcgttatgta gtaggacagc 5641 tactcgttga ttaagcgcca gacgcactat tttgtacccc caaccctggt aaatcgtttg 5701 gtaaaatttt cccctcgtgc atgtatgcac ctgtaaaagg gtcatctgtt aaatttaagt 5761 gactatctaa gtcgagatag tcagctaatg gcgcaagttg tgctgctgct gtattcgcta 5821 gcgtactgtc agaatagcaa ccgaacatca cttgcaactt atatgcccgt gccgtatgta 5881 ccattcgcat tgcctcagtt aaaccccctg atttcataag tttgatatta atcccatcta 5941 cacagcttgc caatttggga atatcggagc tgttaaagca actttcatca acaaaaatag 6001 gtaaaggaat tagccttttt agttcggcta aatgttcttc ctgccctttt ggcagtggct 6061 gctctacata ctttacaccg atatcagcaa gccaattgca cataaatact gcatcttcca 6121 aactccaacc cccgtttgca tcaacgtaca agtctctagc tgctgcttct tctcgcactg 6181 ctattaacat ttttcggtct gcgtcaatac catccggact acctaacttt accttgaaaa 6241 cacggacatt cgtgaacgcg agccaatctc gcgcccttgc tcttgcgcct tctggcgaat 6301 tgataccaat tgtgactgaa gtcggcacta tggcatcgcg attgagtccc caaatttgcc 6361 acaagggtag ccccacacgc tttcccaacc agtcgtgcat tgccatatcc agcgctgcga 6421 cagcagatga aggtagtaaa gcttttctta atatttgctc aatttcttgt cgttgcaatg 6481 gactaaatgc ttggaaacag ggcgctactt gcagcaaagc atcttttata atctcagttg 6541 actgcggatg agtgcctaca ttgaacggcg atgcttcccc ccaaccttcg ataccatcgt 6601 gcaaaattct cacccatata ttcgtggttt gtgccgttgt accacgacta atggtcaaag 6661 gaaaacgctt attgacagta aatatttcta ctttgatttg cataattacg cttaatattg 6721 cggtagaacc gctgactttc aacaaatcaa agcttatact acagcgtttg tgccgttgtg 6781 ccacgactca tgcttaaaga aaagcgttac ttatattcag ctttatgtaa ataaatattt 6841 caattattaa aacgaacata aagaattagc tagactttga tcattcttta gattctctta 6901 acaaaatttt ctccgtactt ctaatcaagt ctgttgcggg tagtcatacc tgcaaccttt 6961 ttgtgagtta gaagcgagat tgatatagct ctaaaagaat acttttttga tttgctttga 7021 tatctctcat tttttagaaa ttcctatgac tatcgttcca ggtacagttg ttaaactccc 7081 caacggtcgt aacggtctgg ttattccttc gccctggtgg aaacccggta gtgtgttagt 7141 gaagttaccg cgtggtaaaa agcgatggtt cagggtagat gagtgtattc ccataccctc 7201 aggctggtga gtcgcgatca ctcaaacatt gaagaagaag tcagaattca aaattcaatc 7261 agtgggggat ttagacgaag gatctaaatt caaaaaagac aaggataaaa acgcgccgaa 7321 gactttatcg ctggggttgt tggggtagcg atctgcgctc cgcgcagcag cgaagctatc 7381 gccctcacaa aaaggtgact cttgagccta ttcctcttct aaaaaaaacg atttgatatc 7441 ctgtttgttg aattgctgaa aattcttgta ttgttggttg tccggaagtg gcaatcaaat 7501 ctgatatttt tatatagtta taaatgtctt caattttatt atccataata tttactgata 7561 agatgtcgct tactgatagt tttcaaccag agagattttc tgagcagcaa cttttggaaa 7621 aactaaaaga gagcacggag ggaattttgt ggtctagcga atcagactat ccattagaag 7681 tcgtcttttg ggaaactagc acaattagcc cggagaaact gttgcaattg acaaacaaac 7741 cccctgatac tgtagtcaca gtccaagaag gagataagtt ttttgccaag ttattcaggc 7801 aatataagca atacatttcc gaatcgagtg atgaggaaag tgaaaaagaa tacaaaagct 7861 gcctagtaaa atgtcaaaaa ttggctgact tgctcaaagc caatctgact gatattcaat 7921 actttgatgt tggtggaaat caatcagaga tacatttata tctgattggc aagtctccat 7981 caggcaacat attgggattg tcaacaatcg gaatttatac gtaacgaaat catcagtaac 8041 gtattcttcg ttatcatgca tgaagattgg cttgttcttg caaccacggg tcaacaggtt 8101 ctccatcttt acgacgcaat tcaacttcaa caaccataac tgatttagaa ccaggaggtg 8161 ctggtttttg atgaaaaata atatcgtaat ctgcaggatt atgatgacgt tcttttaacc 8221 aatcatgcac ttttgttgtc gcgaaaaatg attgcccgtt ctcaaacatc cgggtaagtt 8281 ccatctgtag agtgtaagga ttgagtctgc ccattgttaa ctacctttgt taataatgat 8341 ttatgatcca acacttgctg gggctagcac ccagatttac ctggcttaaa tttcagccgc 8401 tgctcgctct atcaatcgac gtgctaaagt ttgtgtacct gtatgctcgt agtaattcgt 8461 tgatatatcc agcaacgcag caacgaagtc tagcttatca tcagcaacat cgacaaaact 8521 ttgcaagtgg ttgacgacct tttgtggtgt caaatgcttt gaagtaatta gactagtcat 8581 ccaaccactc actgaattga agctttcgcc aataaagttg agtttgctac tggaattgtt 8641 acctggaatt tctgctttga tattcgcaaa ggtcgagttt tgctctaact cctgaggact 8701 tatctgatta attctagata gtgcgcggtt aatgaagtct ggacctaggg gtatcaaacc 8761 atcaaagcaa actaatgcag ccatgcggat gaacgactca ccgctgtatt cacccaaaga 8821 cttgacaaaa tccccaatac tgtttccggg aataccgtta atttggcaga aagctactaa 8881 ctcaacaact agttttaagc acaaatctat ggtttgtgct ttatcagctt tgggagtgag 8941 tcggtttaaa aaacccaaaa ggggaacttt ctcacccact ttgtttgcta aagcagctgc 9001 accaagtgct ttatctgtgc gttcaacggt ctgataaagc cacatggctc gttgatatcc 9061 ttgagagcga tcgttgtaca gataaacggc tcgttcgcca atttgttgaa tcagcctttc 9121 gtcggtttcg ccagttacag ttttgatggt attaacaaag ccaaccacat tttgccactc 9181 accgggagca ataaaatcga gcgcgcgcaa tgcagaaacg gtcaaattgc tgcttggtag 9241 ttcatcaact aatttagaaa ttgatttgct cacaaataga cctcttaaat tgttttttgg 9301 ttactcagga tagagaatgt gtctgaacga taggcgcgcg gcgcggctaa ccaaaagtgc 9361 gatggcgatc gcctacggcg gatactcccc accatcccac cttgcgcgac tccagaaagg 9421 agaatcggct tcgccggaca ctccatgtca tagcacctcg gagtgacttt gaagtaatga 9481 gcgcatccac gcctgttaga tcttaacttt tcttctacga atgtaagggt tcggaaaacc 9541 tcataaccta tagttgatga gttgcactca aagttaagct gtagctaagg aaaagcgtaa 9601 aagctttcta agtacggtaa cctgtaatgc gaaatctatt aggtaatcgt taaaatcaaa 9661 acaagaaagc gaattgagga gccaccaacg atgcaagtta catacgattc tgataagcgc 9721 aaattgctat catctctgtg tcatggggcg atttttttta gtacagcatt attttcgatt 9781 ggtgttccaa ttataattaa cttaatttct gacgatccgg ttgttaaaag caacgctaaa 9841 gaatctatta attttcactt taatgtttgg ttttgggcaa tactaattgg agttccaatc 9901 gggattttat cttttctcac attcggtctt ggcggaattt tgttcttccc cgttgttgct 9961 tttggctttt tactgcactg gggactgaca atttgggcgc tgttccactg tttcaccaac 10021 cctgatgaac cgtttcgtta tccatttatt tttcgagttt tctaattata ttgacttaag 10081 ataaagtttt ttttggaaaa gggattgaga taaaaatccc tttttatttt ttggatgcgt 10141 agactactct ccccaagcct acgacacatc ggaacctcac cctgcccgac agggcagggt 10201 gaggttttga gcaagatgtg tgtacatagt agccccgcaa gctgtcttcg ggcaaaggtt 10261 catcaaaatc atcaggtact gtaaattctc cagcacacaa accaaatggt cgtaattgct 10321 tgttagtatt tgtaattggt tttagttcgg caattgcttt gtctgcttgg acaatgacaa 10381 agcttttacc tgcttgaact tgattaaggt actttaaggg atctcgtttg atatcgtcaa 10441 ctgtgactct ctgcatcctt tgtacttcgg cgtagtggtg tagagatagt acattcctta 10501 agccaatttt agtataaagt acctacccac aagggtgaag cacttactca gtaatttaga 10561 ctaaattggg aaatattgcc caacactgtt atttaaaaag tcaaaagttt cttatggaac 10621 catcaccaca cgaactaaac caacccaaat caaaagtaga aagtcctacc acgttgctta 10681 agcgctttag tgctggggcg gcagtcggac tgataattgc tgcattaaat tggggtagct 10741 atgcctattt ttttgacaat cccataccgc taaccaatgg cattatattc tgtttgggac 10801 tgacaatcat ctccgggtta atgacactca aatggggcta caacacattg gagtctttgt 10861 tgcagttatt gtcgtagtgt taacagacgt atacggggtt ttctgttcca atgcttacct 10921 aacccagcct acaaatttat tcaaagtttt taggcaggca agatgcctgc cctacaagaa 10981 aatacaatta acccaaagcc gatcgcaact cactccgcgc ttgttccaac gcctctggta 11041 atttactcgc atcgcgtccg cctgcttgtg caagattcgg acgtccgccg ccaccaccgc 11101 cacaaatttt ggcgatcgca cccacaaatt tccctgcttg caaccctttc ttgttcactt 11161 ctgaactaaa agccgcaact aagctgactt tcccttcttc tggaacagaa cccagcacca 11221 ccgcaccatt gccaagtttt tgcagcaagc gttcggctac agtttttaag gactctggat 11281 caacgtcttc cattttggcg atgagaattt tataatcgcc gatggactcg actgtttgca 11341 gcaaactatc tgatttggcg atcgcaagct gtgacttaag agtttctaat tccttctggc 11401 tggttctaag ttcgttttgc agagttgtaa ttctgtcggg gagttcttcg ggtcttactt 11461 tgaagcgatc gctcaaatct ttcacgactt tatcccgtac attcaggtaa tctagtatcg 11521 cagcaccaga cacagcttca atccgccgga ctcctgacgc cacaccagct tcagagataa 11581 ttttgaaaac gcctatctcc gcagtattac ttacgtgcgt cccaccacac agttccatcg 11641 atacaccagg aaagtcaatc acgcgcactt cttcgccgta tttttccccg aacatagcga 11701 cagcacctct ggcttttgct tctgctagag gcaaaatttc cactttggca gaatgtgctt 11761 gggctatcca agagttcacc agttcctcaa cttggtgtat ttcatctggt gttaaaggac 11821 gcggacagtt gaagtcaaac cgcaacctgt caaatgaaac gagggaacct gcttgcgata 11881 tggagtcatc aacaattttt ttcaacgctg cttgcagtag gtgagttgcg gtgtggttcg 11941 cttggagacg acgacgacaa gctttatcaa tttgcgcggt cacagcatct cctactcgga 12001 ttgtaccgcg ttcgatgcgt ccgaagtgca caaagaaatc agattctttt ttcacatcgt 12061 gaattgtcac gaggacacca tcaccagaga gataaccgcg atcgccaatt tgtccccccg 12121 actctgcata aaatggagtt tggtcgagga caatttgcac gtctgttccg gtttctgctt 12181 cttcttggga gacgccacca atcaacaaca cttcgacttt ggatgtcgcg acaggttggg 12241 tgtatccgat aaattctgtc gcttggatat cttctgctag cttgtccaga gaaccttgca 12301 ccgttaagtc gattgttttg tgtgcttcct tggcgcgttc tacttgcttt tgcatttcga 12361 catcaaaacc cgcctcatca accgtgaggt tattttcagc agcaatttct tgagtcagtt 12421 ctagagggaa accgtaagta tcgtacaagg tgaaagcact ttccccacta atttgggttg 12481 cgcctttgct ttgcacctgt tggataattt ctgcaagcag ttcttcaccg cgttccaaag 12541 ttttcaggaa gcgagactct tctcgttgca gttctgcttt gatggcgttt tctctttgcc 12601 ggacgttggg gtaagctgat tcggagaggg cgatcgcagt ctcagcaact tgggtagtaa 12661 actctccacc aattccaatc aatcttccat gacggacaac ccgacgaatc aggcggcgca 12721 gaatataacc tcgtcccacg tttgaagcgc ggatttcatc ggctatcatg tgaacgactg 12781 cacgaatatg atctccaatc acttttaagg agactttggt tttctcgtca gctttggtgt 12841 aatctatccc agcaatttct gctgctgttt tgataatcgg gaaaaccaaa tcagtttcgt 12901 agttgttggg aactccttgg agaacttgcg ccatcctctc caaacccatc ccggtgtcga 12961 tgttcttgtt cgccagtggt gttaggttcc cttcagcatc tcggttgtat tgcatgaaga 13021 cgaggttata aacctcaata aatcgggtgt cgtcttctaa atctatgttg tcatcgccac 13081 gttcggggtg gaaatcatag tatatttctg agcagggacc acaaggacca gtagaaccca 13141 ttgcccaaaa gttgtcttcc ctcaaacgtt taattctggc ttccggaaca ccaattttgt 13201 cgcgccagat tccaaaagct tcgtcatctt cctcaaagac actaacaacg agttgttctg 13261 gtggtaagcc aaagactttt gttgacaact cccaacccca agctatggct tgttctttga 13321 aatagtcccc aaagctgaag ttccccaaca tctcaaagaa cgtgtgatgc cgtttggtgc 13381 gtccgacatt ttcgatatca ttggtgcgga tgcatttttg ggaagtcgtc gcccgcttga 13441 attctggtgt ccgctgtcct aggaagatgg gtttaaatgg taccatcccc gcgatcgtca 13501 gcagtacggt tggatcttct ggaacgaggg aagcactcgg gagaatttgg tgtccccgtt 13561 gggcgtagaa gtcaaggaat aaggcgcgaa tctcgtttcc gctcttgtat tggggatgaa 13621 aagacatggg tatttacaaa aatagtgtca actaataaag tttctgagtt ccgagtcttg 13681 agcaataaat tataagttct gagaaaatct tgatttattt tctcctaatt gtcttgattc 13741 tactcagcac tcattaagta cgtttatata tttttgcatt ttgtttcagt ttacgaccac 13801 taacctttgc caaaagcgca gcttatgtag gtattgtgtc gtgtcacagt cacctaactt 13861 tagtaggatg agcgatcgcg ctgacgatga ttacaatcaa agctgagaaa aaaatcaaaa 13921 ttccctaaaa gaggtaaatt atggcgttaa aactaactgt accaaatatt tcttgtcaag 13981 actgtgcaac aactattacc gagtccatcc atacaatgga acccgacgcc aaagtggatg 14041 tggatgtgga agcgaaaact gtgactgtag aatctaaagc ttctgaagag tctatcaaac 14101 agagtattgt tgcagccggt taccatattg aaggttatca ataactgctc tgacaaaaga 14161 atgcgacaga tatgtagtag cgcatccaga ctaaaatatt gggtaaacaa gcttgcaaaa 14221 actctcatcc tagctatcct caacccaaga ttttaagtta gacgtgctag taggttgggg 14281 aggaaacttc tgtctggaga gtacagtacc tgccgtgggg tctcccccta ctccttatct 14341 ccacagggaa cctcattcat ccccacggag tgggaatccc gaagcccctg cttacggtga 14401 tcagatttct tccgtgacat actcctacac tgactgagta cagtacagtg taggcttctc 14461 agcgactccc ggtatcccgt cggacattaa ctgagcttta agaacgcgat gccccagcgc 14521 tctatttctt atatttattg cagcattatg gtcgcggtca agactacaac cacattggca 14581 ggaatgccat ctttcatgca gcttcttagg aaccttgacg ccgcaactag aacaatcttg 14641 tgacgtattc ttaggattta cagctattac caacaaacca gcattttcgg ctttgtttga 14701 tagtattgac aggaaacttg accatcccgc gtcatgcata gattttgcta gccgcgtcct 14761 tgaaagtcct ttaacattta agtcttcatg aacgacaaca tcatattttg acaacagttg 14821 ttttgctgtt ttaaagtgga aatctttgcg tttatcagca actttcttgt gttgcttgcc 14881 caactgtttt actgctttac gtttacgatt acttcctttt ttacgccgcg ataccctttt 14941 ttggataact cgtagacgtt tctgtgcttt acggtaatgt tgggggatag cgacaggttc 15001 accttcagaa gttgtcaaaa attctttcaa ccccatgtca atccctgtta ttgaatcagg 15061 gttgatgtca ggtttaactg taggaactgt taggtcttcg aggcttaaag tcacatagta 15121 gccatctgct tttttggtaa cagaaacagt ttttaccttg aatccagcag ggatagaacg 15181 atgcaagatt agcttaatcg cgcctagttt tggcaaagtg atcttgccac caacaatgca 15241 attttccttg atcctgggat aggaaaaagt tcgataacgt tcacgagact taaacctagg 15301 tctaccacta cgtttaccgt tactgtcgcc gcttaagaag cgttcaaaag ttttcttaac 15361 tcttcctacg acatcttgca gaacatccga atacacttcc ttgtaatgag gatgagtctt 15421 ctttaattct ggaagcgttc ttttttgacc ataataatct ggattgtctc gtaactctgg 15481 gagataacaa ataagaggac aagcgttgac aggacaacga ttttgttcat accaattaaa 15541 cctgtccgcc agcagatagt tgtattgagc gcaaagcata gacaaccatt tgtctatatc 15601 ccttgcttgt tgttttgttg ggcgtagttt gtactgatat gcagtcctca tttgttcctc 15661 acctcctaac acatctatgc taacattaat gagggaacat tggtggaacg tatgaaaaac 15721 aaagagttaa aaatacggat tagtgaacgt aggctcaata aagtacgctt atatgcagct 15781 cacgctgata agacaatgac ccaagtgatt gaggagctta ttgactcgtt aaaaatcccg 15841 gagattgcta atcaaggcac tgtccgtttc actcttcccg caaacagtga aatcgactca 15901 acggacgtac cttgactcgc gccggacatt cctctcaccg ctcatctgag tacagataga 15961 gcgggaggct cctgtccgaa atagctgaag aatttcccta attatttgat gttttcgcca 16021 aagtcagatg ctctcctttg tgcttcctca actccggcat tcagtgcttg gtacgctgtt 16081 tcgtaacagc cgaacaggca tgagtcaaaa tcggcgatgg ggaatgggtt gcactctaat 16141 acgttatgga tagcaatcgc ataagaatat ttctctttag ctaaagggtg aaaatgtacc 16201 gccacgaacc acggtatatt cattcccaac tcaaatgtag tcccatcttt ttgcaggatt 16261 ggtttgtcca ttgtcgcttg aacccctcgt ggctaagtga tgccaggcgc tgacctaata 16321 tagaagcgaa cagggtatct ttcttaatta tgtgcggaaa gcaaagcgtc tgtgtgagag 16381 aatcgtcatg aaattggcca acaaacagat gcttcctagc accaataaaa tgtatgtggg 16441 agccataacc actcccatca taaaccaagc aaatacatgc agtatcatca atagagcaaa 16501 gaaactagca attccagcac tttgccatat ctgaatagca ggacgactca ataatgtgag 16561 aattattgtt gataaggtag aaatcaggtt ggctggaatg agaaaggcac aaatgccaat 16621 gcagtaagtg cttgagaatg ctgtcaacgt gttgaaatct aacattaatt ttcctgggtt 16681 aagttgccaa attttatata gatttttatg gatatttgta cttgtatcct aaagatttgt 16741 tggactttca aatatttgta agttgataac cgaaactact ctgtaaattc aacccaagtg 16801 aagatccact aaaaatttga accgttaaga agttggtggt tcagaaaacg aaactgatca 16861 aaaagggcta atcaatagaa ggcttctaca ttgtatcgga agacgattaa ggcgtgcttg 16921 ccctgctcct ctttcagtgc ccaaaagata gtgtcagata cttctggtga aacaatagtt 16981 ttgatttcaa tgttagtatt gtagcctgct aaatctgcca agcgtcttcc atgcccgccc 17041 tcaccttgta cttgactaat ggtgtaaccg ctgacactgt ggctttttag tagcttaaca 17101 atacggtctt tgagtactgt ctcaccaatg atagtgatca ggacggcagg ttcgacacca 17161 aaggctaagt ttgttatgga gaactccaaa caataggttt tggcttctaa agcttaagtt 17221 tttatccgca gtccaaatgt tgtatcaaaa aggctcaagc attcttgcct gagccactgc 17281 taaataaaat taatttgcta tattaatgag ttagctcaaa gagctaaatc tatcaagctt 17341 agtagcggtt acggaagcta ttatttcccc cccgattacc accaccaaat gaacctcttt 17401 cttctctagg cttagcctta ttaactttca ggtcacgtcc catccactca gcaccatcaa 17461 gagcttcaat ggcagctgtt tcttcagcat ctgtacccat ttccacaaaa ccaaagccgc 17521 gcaaacgacc tgtttcacgg tcagtaggta gctgaacccg ctttacagaa ccatattctg 17581 cgaaaaccgc attcagactt tcttctgtaa cttcgtaaga aagattgcct acataaactg 17641 acatagagtg tctccgaaat cataagtgtg tagagattta aatttcggag aaaagtttgt 17701 aaataccaaa aacaaaaaac ctgtcaatac taaaaacaaa cgctgttaac cgaattaact 17761 ctcacctccc atcatgacat agcagacaac ttttagggag gaagttggaa aaatcgttat 17821 ataaagttac ataagcaatt gatagctagg tatgatatgg cacaccttgg aggcataatt 17881 tctagagttt aaatttgatg tagacactgg ttgagacgga aagctccaaa aacgaatcag 17941 acccttccta ttctttgttt tgcttggaag cttcagcctg caacgaagta agtccaaggg 18001 tacttcacca catcatcatt tatagttagt cttgtgttac attagcatta ctagcactcg 18061 caaagtattg aacaattgag aggaaatcat tatcgtcatt caaaagcaac tcattaactc 18121 gcaaatcaag tcgcctcaag tcttcttgat tgaccatgag aataataacc gtggtctgac 18181 cgatacacat gaagctctac agctagcaca gagcctagag ctcgacctag ttctagtctc 18241 ggagggcaaa gacgctccag ttgcgaagat tcttaactat ggtaagcttc aatatcaaaa 18301 gaaaaagcgt cagggacaga gtgctagacc tacggtaaag gaagttcggc tgcgtcctaa 18361 cgtgggtgtg gctgattaca acttacgcat tgagcaagca agtcagtggt tgagtaaagg 18421 cgattcagta aagtttgtta ttcgtttacg aggacgagaa aatcaatatc gtgaacaggc 18481 tggagaaatg ttagagcgga ttgtaactgc tctgagccaa gtgggtaaag tgcagtcgct 18541 tgataaacgc tcactagttg ctcaagtcat tcctgcctaa attaattttt taggaatatt 18601 tagtcaatcc aagtagccag aggaactgct tagatacgct ggctactgta atgctaccgt 18661 gtgctgtaac taaatgcacc aaacgcaagg ggcgtgggag aagctatatc acacgggttt 18721 tgctaatgca gatcagggat gggctttaga gacttagcaa tatattaacg cttcacataa 18781 aaccatagtt gcaagcttga aagcatgacc ctgaggaatg gaacaattga agatggcagg 18841 gatacttctt tgagaatcca agctttgagt acttgcagaa ctactgggag atggttcctg 18901 caagtgcaga tagttattaa gaagttgttc actaagtttc cgcactgagg gattacttgt 18961 gtgagtggag agctagttga ttggaatagg tagcgtttaa ggtgccacag attttgcaaa 19021 aatcttgctt gtctataatg ttgtccactt atcttgattc acagattgca ccaataaaaa 19081 ttgatgttat ctcatcactc agaattcaac aaaaagtcct ttattttagc gacaaaagcc 19141 aataaaaata atgtagctta cccttgtata tttttagttt gtacggtttt ctacggagag 19201 gtgccagata tgagttgaac cattggcgtt gcataattga agtatgaatt acgcgatcgc 19261 ctcaagcttg tgggcgctat gcgcccacag cgccacggaa ccgtgcttga gataatatat 19321 tactaatcga atagacagtt tcagcatttc ctcggactta gagtaacaaa aagttttacg 19381 atgtagtcga gccagataat gcctaaaacg gctattttcg ccttcgactc tcgtcatata 19441 agttttgctg acccttacgg gttcgccagt cgcttcaacg gggggaaccc ccgcaacgcg 19501 ctggctcaca atatggtctt cattactgat aaaacaaggg taaactgggt aaccatctgt 19561 cacataaaaa taagaatgcc aacaacttat aatccaatac agttcagtta agcctcaaag 19621 tcaggtaaag taagcattat tcgcaagca // LOCUS NODE_1675_length_19413_cov_5.28355219413 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 19413) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 19413) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..19413 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 177..542 /locus_tag="DP116_14645" CDS 177..542 /locus_tag="DP116_14645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869072.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14645" /translation="MKPPKQSSGPPFGKENQDPEFEQELLKVERALVALKERYNQVQA DKASQKELQQRLGRLRHSKLPNVKAELKQIQQQLEELELNLESQLFSWDSVKEPFWQA VRFGGLGVIIGWLLKSVAG" gene 544..1935 /locus_tag="DP116_14650" CDS 544..1935 /locus_tag="DP116_14650" /EC_number="6.1.1.22" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997802.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="asparagine--tRNA ligase" /protein_id="PRJNA477356:DP116_14650" /translation="MKNRRIAEILRSGEPDEFVVVQGWVRTKRELKGFSFIEVNDGSS LASLQVVLNENLPDYDDILKKLNTGASVEVAGVLVGSLGKGQRIELKTDKIKVFGEAD PETYPLQKKRHSFEFLRTIGHLRSRTNSFGAVFRVRNACATAIHQFFQERGFLWVHTP VITANDCEGAGELFSVTSLDLKNVPRTQTQAIDYSKDFFGKPTYLTVSGQLEAEVMAM AFTNVYTFGPTFRAENSNTSRHLAEFWMVEPEMAFCDLKEDMNLAEAFLKHIFKYVLE TCPEDMEFFNQRIDNTVLATADNIINNEFERITYTEAIKLLEKADVQFEYPVNWGLDL QSEHERYLCEQLFKKPVIVTNYPAQIKAFYMRLDEDEKTVSAMDILAPKIGEIIGGSQ REERLDVLERRILAQGMKPEDLWWYLDLRRFGTVPHAGFGLGFERLVQFMTGMGNIRD VIPFPRTPESAEF" gene complement(2050..2538) /locus_tag="DP116_14655" CDS complement(2050..2538) /locus_tag="DP116_14655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747076.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_14655" /translation="MHNQQFLIRDANVSDVSTIMELIRLKAEFDGCLDFVEATPKKLE DTLFCENPLAFVLLVEIDENPIGFATYHQIYSTFLAQPGIWLDDLYIKAEYRRLGIGE ALIKYLCQITKKIGGGRIDWIVATHNDPAIQFYEKMGAQIIQRVRLCRFNREAIMHHV CM" gene complement(2567..3898) /locus_tag="DP116_14660" CDS complement(2567..3898) /locus_tag="DP116_14660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120437.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Na+/H+ antiporter NhaC family protein" /protein_id="PRJNA477356:DP116_14660" /translation="MNSLFPLVFCFVILLCSVIKGYFVAYPLLLSLVILLVTFYYQGF AVKSLIKMGFTSSQKAFPIIKILLLIGAVTAVWMAAGTIPALVYYGTQFINPRYFILS AFVLSCFISVLLGTSFGTVSTVGVALMIMAKEGDINPHLVAGAIIAGAYFGDRCSPMS SSAHLVASITKTKLHRNLRAMISTAWLPLMASSLIYLIFSISNPVEIRNSDFISEIPK LFNINPLVLLPAFTVIVLCLFKVEVKLTLLASLGIGFFVGIFSQGYSLLKMINFAWLG FNLEQRMDLSEVLTGGGIFSMLRVSLVVILSTSLSGIIVGTKTFASVENILKRASSRS RLFFGTTTVGLASAAFGCTQTLAILLTHQLVKEKYEQERLDNYQLATDIENTAVVLSP LIPWNIAGLVPATLLMTDSGFIPYAVYLYLIPVWNWVRFKLTESKMQDEFE" gene 4072..4941 /locus_tag="DP116_14665" CDS 4072..4941 /locus_tag="DP116_14665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009784149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LysR family transcriptional regulator" /protein_id="PRJNA477356:DP116_14665" /translation="MLKISQLRAFVAVADHGNFSLAALELGLSQSTVSHAIATLEEEL GVILFFRGRQGATLTPIGGEVIQEARQVLHLLEVIQEKANLEKGLQAGQVRVACVRSI ATHVLPEVIARFREKFPKISVVITECDRYAEVEQALRENHADVGFTLLPTTTEFDTRE LFRDEFVALLPPKSIAADESLSWEQLAGYPMIVNIRSPQHNKTVQTHFLKFGQTLKVD YEVREDSTILSMVKQGLGATVMARLAAEPIPEIIETRSLPVPLERIIGVAILGDALLP RGVFAFLDVLKVG" gene 5161..5469 /locus_tag="DP116_14670" CDS 5161..5469 /locus_tag="DP116_14670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315945.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NIL domain-containing protein" /protein_id="PRJNA477356:DP116_14670" /translation="MAIPNQQGKSTIDFRDNRRTTTRVTVHIPKELHKQPVISRLISY CGITVNIAAAQLNGHMPQPGNFDLELRGTVFQIASALTYLNELNLEICHPFLTEEEGW " gene complement(5926..6573) /locus_tag="DP116_14675" CDS complement(5926..6573) /locus_tag="DP116_14675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740397.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_14675" /translation="MKEPSMKDHKRLLLIDDDPNLILLVKDYLEFRGYDVSTAENGRE ALDILEQDIPDMIICDVMMPEMDGYTFVEQVRQTERTSWIPVLFLSAKGQSADRVKGL NKGADVYMVKPFEPEELVAQVESSLKQTIRWKEHQAKGGENGSRIQVPFDVQLTPTEL KVVQFVARGLANREIAEELNVSQRTVESHVSNMLGKTNLHNRTELARWAIENQMA" gene complement(6919..7405) /locus_tag="DP116_14680" /pseudo CDS complement(6919..7405) /locus_tag="DP116_14680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457501.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ferredoxin" gene 7652..7963 /locus_tag="DP116_14685" CDS 7652..7963 /locus_tag="DP116_14685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491741.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="co-chaperone GroES" /protein_id="PRJNA477356:DP116_14685" /translation="MAAVSLSVSTVKPLGDRVFVKVNAAEEKTAGGLYLPDNAKEKPQ VGEVVAVGEGKIKDDGARQALDVKVGDKVLYSKYAGTDIKLGTDEYVLLSEKDILAVV S" gene 8086..9726 /gene="groL" /locus_tag="DP116_14690" CDS 8086..9726 /gene="groL" /locus_tag="DP116_14690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194607.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chaperonin GroEL" /protein_id="PRJNA477356:DP116_14690" /translation="MAKRIIYNENARRALERGMDILAEAVAVTLGPKGRNVVLEKKFG APQIINDGVTIAKEIELEDHVENTGVALIRQAASKTNDAAGDGTTTATVLAHAVVKEG LRNVAAGANAISLKRGIDKATNFLVDKIKEHARPVEDSKAIAQVGSISAGNDEEVGQM IAEAMDKVGKEGVISLEEGKSMTTELEITEGMRFDKGYISPYFATDPERMEAIFDEPF ILLTDKKIALVQDLVPVLEQVARAGRPLVIIAEDIEKEALATLVVNRLRGVLNVAAVK APGFGDRRKSMLEDIGILTGGQVVTEDAGLKLDTTKLDSLGKARRITITKDSTTIVAE GNEAGVKARIDQIRRQIEETESSYDKEKLQERLAKLAGGVAVVKVGAATETEMKDKKL RLEDAINATKAAVEEGIVPGGGTTLAHLSPALEEWANSNLKAEELTGALIVARALAAP LKRIAENAGQNGAVIAERVKEKDFNIGYDAAKNEFVDLFEAGIVDPAKVTRSALQNAA SIAGMVLTTECIVVDKPEPKDNAPAGAGAGMGGGDFDY" gene complement(9723..10049) /locus_tag="DP116_14695" CDS complement(9723..10049) /locus_tag="DP116_14695" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14695" /translation="MFLTIKHLREKQNEYLEGSSKKRKFAGSLHSPVRVWNGSIAGDE SSAQKSRPSEQDRRLIFVTIFKLHSHALLYPHLSVRFLVEPENLCSDSGNPPCSAQAG GMSKEC" gene complement(10141..10260) /locus_tag="DP116_14700" CDS complement(10141..10260) /locus_tag="DP116_14700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013190555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein J" /protein_id="PRJNA477356:DP116_14700" /translation="MSGSGRIPLWIVATIAGLGVITVVGIFFYGSYAGLGAPT" gene complement(10336..10455) /locus_tag="DP116_14705" CDS complement(10336..10455) /locus_tag="DP116_14705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018399352.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein L" /protein_id="PRJNA477356:DP116_14705" /translation="MERTPNSNNQPVELNRTSLYLGLLLVFVLGILFSSYFFN" gene complement(10465..10602) /locus_tag="DP116_14710" CDS complement(10465..10602) /locus_tag="DP116_14710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997987.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome b559 subunit beta" /protein_id="PRJNA477356:DP116_14710" /translation="MTSGNNVNQPVTYPIFTVRWLAVHTLAVPTIFFLGAIAAMQFIQ R" gene complement(10612..10863) /gene="psbE" /locus_tag="DP116_14715" CDS complement(10612..10863) /gene="psbE" /locus_tag="DP116_14715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748156.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome b559 subunit alpha" /protein_id="PRJNA477356:DP116_14715" /translation="MSGTTGERPFSDIITSVRYWVIHSITIPALFIAGWLFVSTGLAY DAFGTPRPNEYYPSARQELPIVNNRFDAKQQVEQFSNTK" gene complement(10960..11982) /locus_tag="DP116_14720" CDS complement(10960..11982) /locus_tag="DP116_14720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314478.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosynthesis system II assembly factor Ycf48" /protein_id="PRJNA477356:DP116_14720" /translation="MLPIVKFCQRILALFVVILMCVGCSNVVSTSYNPWEVVAVPTNA KLLDIAFTDNPQHGFLVGSNATLLETKDGGATWQPLQLELDDQRYRFNTVSFAGKEGW IAGEPALLLHTTDEGRSWSRIQLSENLPGNPVSIVALGQNTLEIATDVGAIYRTTDGG QNWKAQVEQAVGVVRNIERSPDGKYVAVSAKGNFYSTWEPGLNAWVPHNRNSSRRVEN MGFAENGQVWMLARGGQVQFTDPANPEQWQSAQNPELATSWGLLDLAYRTPDEVWISG GSGNLLRSSDGGKTWEKDRDVEQVPANFYKIVFLNQEKGFVIGDRGVLLKYNPNIAAA TKSEAA" gene complement(12073..12408) /locus_tag="DP116_14725" CDS complement(12073..12408) /locus_tag="DP116_14725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rubredoxin" /protein_id="PRJNA477356:DP116_14725" /translation="MSEQAVETIALDRYECRSCGYVYEPEKGDDKHDIAPGTPFAEIP VTWRCPVCSAKKSAFTNIGPVGKASGFDENLKFGFGVNTLTPGQKNLLIFGALALAFL FFLSLYGLR" gene 12580..12822 /locus_tag="DP116_14730" CDS 12580..12822 /locus_tag="DP116_14730" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14730" /translation="MPIACPLSDLPLSGARSDRPLSAASSKEGTASGEVNAQGFAQGE NPSPELVAVRARNPFPALVWSKLLAPKAVKSHCVCP" gene 12809..13171 /locus_tag="DP116_14735" CDS 12809..13171 /locus_tag="DP116_14735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit 3" /protein_id="PRJNA477356:DP116_14735" /translation="MFVLSGYEYLLGFLIICSLVPALALSASKLLRPSGRNPERRTTY ESGMEPIGGAWIQFNIRYYMFALVFVIFDVETVFLYPWAVAFHRLGLLAFIEALIFIA ILVVALAYAWRKGALEWS" gene 13162..13920 /locus_tag="DP116_14740" CDS 13162..13920 /locus_tag="DP116_14740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit B" /protein_id="PRJNA477356:DP116_14740" /translation="MVLDTNIEQQKEQQKEQQKERILNPMSRTTVTQDLSENVILTTV DDLYNWVRLSSLWPMLFGTACCFIEFAALIGSRFDFDRFGLIPRSSPRQADLIITAGT ITMKMAPQLVRLYEQMPEPKYVIAMGACTITGGMFSVDSPSAVRGVDKLIPVDVYLPG CPPRPEAIIDAIIKLRKKIANDSMQERDRIKQTHRYYSTTHNLKPAEQIHTGKYLRTE SRFAPPKELTEAIGLPVPPALLTQKVQQEETKRG" gene 13913..14461 /locus_tag="DP116_14745" CDS 13913..14461 /locus_tag="DP116_14745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015154012.1" /note="catalyzes the transfer of electrons from NADH to quinones; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit C" /protein_id="PRJNA477356:DP116_14745" /translation="MAEESKEESKEESKPAPAEEKSLVKAGAVSQWLTENGFDHESLE ADHSGVEILKVEADFLLPLCTALYAYGFNSLQCQAGIDLGPGQQLVSVYHLIKISDNA DRPEEVRLKVFLPRENPRVPSVYWIWKTADWQERESYDMFGIVYEGHPDLKRILMPED WVGWPLRKDYVSPDFYELQDAY" gene 14682..16055 /locus_tag="DP116_14750" CDS 14682..16055 /locus_tag="DP116_14750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012733894.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2252 domain-containing protein" /protein_id="PRJNA477356:DP116_14750" /translation="MLKPVHAILIALTLTLSLTSALQAANPRQSWVENEIYQYNHQFA SQLPQELATKMQKMKASPFAFYRGTAHIFYRDMQTLASSGFVNSSTSAIWLEGDMHMQ NLGGMRDSNGNNVFDTTDFDEGYLGPYVWDLRRMAVSILLAAKENGLSSSDAQDIVRN FLDAYLNKMSDFKGTNDELSYRLESSNTSGVVKDLIQQAASKSRSNFLNKYTQINTSG NRVFQTTSELQPVSSSTYSAINTGMIGYIASIPSSKRYNNNYYILKDIRLKLGSGTGS LGKYRYFLLIEGSSLASDDDQILEMKQETSSAVAIAAPNLLPSSVYGNHEGQRVTTAT KAMLSNTDPLVGYTTVSGITFMLHEKSPYQEDFDYTLLTTKSKFLDAMAYAGKVVAKN HAISDKDYDTSIVPYSVDKEVTDVVSGNKAAFKNEIVNFALDYATQVEYDYASFVDAY NRNIPLY" gene 16322..17143 /locus_tag="DP116_14755" CDS 16322..17143 /locus_tag="DP116_14755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314459.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14755" /translation="MRTDKIFYSLFQAFPSIFFDIIGDTTINPNTYEFVSVELKETAF RIDGVFVPATETTKQPVYFVEVQFQLDSNFYRRFFAEIFLYLRQNTSVNFWRAVVIYP NQNFDPDDQQPYQLLLESPQVQRIYLDELGTAAENSLQLAVVQLIIEREETAVERGRE LILRARQQLADETIRQQIVELIETILLYKFTQLSREELAAMLGIDDEFKKTRMYQSIK DEALEEGRQEGLQEGKLQAKLEAVPRLFALGLSVEQVARALDLTLDQVQQVIQSP" gene complement(17272..17517) /locus_tag="DP116_14760" CDS complement(17272..17517) /locus_tag="DP116_14760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14760" /translation="MVQTAGNAQTQNPVSNLEYDFLTVLHNKAEAIKAYETYINDAQQ AGSQPCVELFEKLRQSDIQHAQEIRHHLQEVMQKGKM" gene complement(17704..18246) /locus_tag="DP116_14765" CDS complement(17704..18246) /locus_tag="DP116_14765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14765" /translation="MKKRLKSQFLFSERWLDRHWIVRNMEAFQDLIVIVLCLGLFAIM VMQLWVFFTTLEIPVDFKHVTAKILFILILVELFRLLMVYLEEHSIAVGVAVEVAIVS VLREVVVHGALEISGVQTASICGLLVVLGGLLFVCAKTPHMDYMSADDTHSSPVKQIE NEQQEKERELKYPRHYKDIV" gene complement(18333..19277) /locus_tag="DP116_14770" CDS complement(18333..19277) /locus_tag="DP116_14770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314456.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Tfp pilus assembly protein PilF" /protein_id="PRJNA477356:DP116_14770" /translation="MRVQLCQYRLPALVGLILVVQSLLSPSSSAFPPPAVPLSPHLLK LAQYIDSSAQDYLTQGLEAIQVGRVEDAIALFRKATELDRNLAAAHYNLGLALRQAGQ LQPAADAFYKATQSDPQFALAYANLGGSLLEGSNWQQANDYLQRAIQLDPKLGVAYYN LGLLREQQRDCDRAIASYKKAIELSQNAPEPAYHMGICYLQQGKADKAKDAFRKAVKI NPKYSEAYYNLGAILFNQKKLKDALEAFRKSAEANTNNANAYYSAGLVFIQLKQYPQA AEVLQYARDLYKAQKNFQWAKNAENLWQQLSNQNYPPR" BASE COUNT 5622 a 4088 c 4195 g 5508 t ORIGIN 1 gcgttcgggt ctcccgactt gaagcgtctg gcgtggtttc ccccatgagc gactgcgtat 61 gcgcaaagcg cacgcccaaa gggctaacgc gaagcgtctc cgttcggcac ggggccgtgc 121 ccgaagggct caggagatac ccggagggtg caagatctga gtctactaac ttgtacatga 181 agccgccaaa acaatcatct gggccaccat ttggtaaaga aaatcaagat ccagaatttg 241 aacaagaact tttgaaagtt gagcgtgcgc ttgttgcttt aaaggaacgc tacaatcaag 301 tgcaagcaga caaagctagt caaaaagaat tacaacaacg tctcggacga ttgcgtcaca 361 gtaaattacc aaatgtaaaa gccgagttaa agcaaattca acaacagttg gaagaacttg 421 aacttaactt agaaagtcaa ttattttcct gggatagcgt taaagaacct ttttggcaag 481 ccgtccgctt tggtggtttg ggcgttatca taggctggtt attaaagtct gttgctggat 541 aatatgaaaa atcgacggat tgcagaaata ttgcgaagtg gtgaacctga tgagtttgta 601 gtagttcaag gctgggttcg cacaaaacga gagttaaaag gattttcttt tatagaagta 661 aatgatggct catccttggc gagtttgcaa gtggtactca atgagaattt gccagactat 721 gatgatattt taaaaaagct caatactggt gcttctgtgg aagtcgctgg agtgctggtg 781 ggatcacttg gtaaaggaca gcggatcgag ttgaaaacag acaagataaa agtctttgga 841 gaagctgatc ccgaaacata tccgctgcaa aagaaacgcc attcctttga gtttttgcga 901 actattggac atttgcgatc gcgcaccaat tcctttggtg cagttttccg agtcagaaac 961 gcctgcgcta ctgctattca tcaatttttc caagaacgcg gctttttgtg ggtacacact 1021 cccgttatca ccgctaacga ctgcgaaggt gcaggtgaac ttttcagcgt caccagtttg 1081 gatctcaaaa atgttcctcg tacccaaacc caagccatag attatagtaa agactttttt 1141 ggcaaaccaa catacttaac agtcagcggt caactcgaag ccgaagttat ggcgatggcg 1201 tttacgaacg tctatacttt tggtcctacc ttccgtgcag aaaactctaa cacctcgcgc 1261 cacttagcag aattttggat ggtggaacca gagatggctt tttgtgatct caaggaagat 1321 atgaatttag ccgaggcgtt tctcaaacat attttcaaat atgtactaga aacttgccca 1381 gaagacatgg aatttttcaa tcagcgtatt gataatactg tattggcaac ggcggacaac 1441 attattaata atgaatttga gcgcataact tacacagaag ccatcaaact attagaaaaa 1501 gctgatgttc agtttgaata tcctgttaac tggggtttgg atttacagtc agaacacgaa 1561 cgctatcttt gcgaacaact cttcaagaag ccagttatcg ttaccaatta tccagcgcaa 1621 atcaaagcct tttatatgcg cttagatgag gatgaaaaaa ccgtttctgc gatggatatt 1681 cttgcaccca aaattggtga aattatcggc ggttctcaaa gagaagaacg attagatgtc 1741 ctcgaacgtc gcatacttgc tcaaggaatg aaaccagaag acttgtggtg gtatctcgat 1801 ttgcgccgct ttggaactgt cccccatgct ggttttggat taggttttga acgacttgtg 1861 caatttatga cggggatggg aaatatcaga gatgtgatcc cgtttccacg gactccggaa 1921 agtgctgagt tttaactgta tttaaaaacc cgatttatga tcgaatcggg tttttttagt 1981 tgggtattac tagttgtgtt aatttcggca ccagaacgta cgattatttt tgtctgtaaa 2041 aatgcacttc tacatacaga cgtgatgcat aatcgcctcc ctattgaaac gacataatct 2101 tactcgttgg ataatttgtg cgcccatttt ttcgtaaaac tgtattgcgg gatcattatg 2161 agtagcaact atccaatcta ttcttccacc tcctattttt ttggtaattt gacataaata 2221 ttttatcaac gcttcaccaa ttccaagtct gcgatattca gctttgatat ataaatcatc 2281 tagccagatt cccggttgag caagaaaggt ggagtaaatc tggtgatatg tggcaaatcc 2341 aattggattt tcatcaatct caactaacaa aacaaaagca agtggatttt cacaaaataa 2401 agtgtcttct agtttcttcg gtgtcgcctc tacaaaatca agacagccat caaactctgc 2461 cttcaatcta atgagttcca taatagtaga aacatcactg acgttggcgt ctcgaattaa 2521 gaactgttgg ttgtgcataa catcaattat ttagacataa gtaaagttat tcaaactcat 2581 cttgcatttt ggattctgtt aacttaaatc gaacccaatt ccaaactgga attagatata 2641 aataaacggc ataggggata aatccagaat cagtcattaa tagagttgct ggtactaatc 2701 cagcaatatt ccaaggaatg agaggtgata gtacaacagc agtattttca atatcagttg 2761 ctaattgata attatctaag cgctcttgct catacttttc ttttacaagc tggtgagtca 2821 ggagaattgc aagcgtttga gtacaaccaa aagctgctga tgctaaccca actgtagtgg 2881 ttccgaaaaa taaacgactt ctggaagaag ctctcttgag aatattctcc acacttgcga 2941 atgtctttgt tccgacgata attcctgata gagaagttga tagaatgacg actagcgaaa 3001 ctctaagcat cgaaaagatg cctcctcctg tcagaacttc actcaaatcc attctttgtt 3061 ctaagttgaa gccaagccaa gcaaaattta tcattttcaa caaagaataa ccttgggaaa 3121 atatccctac aaagaatcca attccaagac tagcgagtaa tgttaatttc acttctactt 3181 tgaaaagaca aagcactata acggtaaaag ccggaagtaa cacgagtggg ttaatattaa 3241 ataatttcgg aatttcggaa ataaaatcac tatttctaat ttcaacagga ttgctgattg 3301 aaaagattaa ataaatcaag ctagaagcca ttaaaggcaa ccaagcagtt gatatcatag 3361 ctcttaggtt tctatgtagc tttgttttgg taatactagc aacaagatgc gcactagaag 3421 acataggaga acacctatct ccaaaatagg ctcctgcaat aattgctcca gcaactaaat 3481 gagggttaat atctccttct tttgccataa tcatcagggc aactcctaca gtactaactg 3541 tgccaaagga ggtgcccaat agtacagaaa taaaacagga aaggacaaag gctgaaagaa 3601 taaaatatcg aggatttatg aattgagttc catagtaaac taaagccgga attgttcctg 3661 ctgccatcca cacggctgtc actgctccaa ttaataagag aattttgatg attggaaaag 3721 ctttttgact actagtaaaa cccattttta ttaaagattt tacggcaaag ccttggtaat 3781 aaaatgtaac aaggagtata accagagaaa gtaaaagtgg ataggcaaca aagtaacctt 3841 tgataacgct acacagcaag ataacaaagc aaaaaactag aggaaataat gaattcatct 3901 tcttgataat tccattctct tttctccaac actaaacagt ttagtcattc cgaaattaca 3961 aaacaaagta tttcttagat accttgtatc gataaaatcg atggatgaag ggtaaagact 4021 tttgcatcta tcaaaatgtg gctttcctgg atgaatgaag tcaaagacag tatgctaaaa 4081 atctctcagt tgcgggcttt tgtggctgtt gctgaccacg gtaacttcag cttggctgct 4141 ttagagttgg ggttatctca gtcaacagtt agtcatgcga tcgccacact tgaagaagaa 4201 ctaggagtta ttctcttttt tcggggtcgt cagggtgcaa ctttgacgcc aattggaggg 4261 gaggtgattc aggaggcgcg tcaagttttg catttattag aggtgattca agaaaaagcc 4321 aacctggaaa aaggattaca ggctggacag gtgcgggttg cttgtgttcg cagcattgca 4381 actcacgtgc taccagaggt gattgctcgt tttcgcgaga agtttccaaa gattagtgtg 4441 gtgataaccg aatgtgatcg ctatgcagaa gtagaacaag cactgcgaga aaatcacgcg 4501 gacgttggct tcacgttgtt accgacgaca acagagtttg acacacggga gttattccga 4561 gatgagtttg ttgcactgct acctcccaag tcaatagctg ctgatgagtc gctgagttgg 4621 gaacagttag cagggtatcc gatgattgtg aatatccgca gtcctcagca caacaagaca 4681 gtccagactc attttttaaa atttggtcaa actctcaagg tggactatga agtgagggaa 4741 gattcgacga ttctcagtat ggttaagcaa ggattgggag caactgtcat ggctcgatta 4801 gcagctgaac ctataccaga aatcatagaa acgaggagtt tgccagtacc gctagagaga 4861 attatcggag ttgctatcct tggagatgct ttattgccga gaggtgtctt tgcttttctg 4921 gatgttctca aagttggttg acgtatttgg catcactgct acaccctaaa ccagattgga 4981 acaagcagca atgatccaag acattttcat ccttgggcta gtttgcatag catgcctaag 5041 gcatagccac aagcctttga acctaccgcc ttataaaaag tagtgtcaag atttttgttg 5101 cgggagaaaa aatagttgta acataaaacc cgcatctaaa aattacaggg gaaagctatt 5161 atggccattc caaatcaaca gggtaaatct actatcgatt ttcgagacaa ccgacgcacc 5221 acaactcgcg ttacagttca tattcccaag gagttgcata aacaaccagt tatttcacgc 5281 ctcatatctt attgcggcat cacagtcaat attgcagcag cccaactgaa tggtcatatg 5341 ccgcaaccag gtaactttga tttagagcta cgaggaacag ttttccagat tgctagtgcc 5401 ttaacctatc ttaatgaact gaatttagaa atttgccacc catttcttac tgaagaagaa 5461 ggttggtaag ttgagtttct agcaataagc aacgagtgct gggctaggta gctactcagc 5521 actcgtcaat accaatcgtc gcccttatag aagcgctctt agcgcgtcta taagtcgctt 5581 aactatgcgc ggcagtcgcc tacagagatt gatacgccac ttgaactaaa gggggagggg 5641 aacccccaat cccaacgcca gttaccaccc ttacggaaag cagttgcggg tctacaacga 5701 aggaaacttc cgtaggcgga aactttctcc gtggaaacat tgaagctcac cgttgtttac 5761 ggatctccgt agatcgagca gtactgtaag gcttaggaga tagcaagtgt cggtggcgtt 5821 tacaacactg tactagacat ctcccgaaat taattatgcg ttacgtgaaa ctcttgtaga 5881 gactgtactg tacggtctct acattgtttt gcaacagagg tctacttacg ccatttgatt 5941 ttcaattgcc caccgcgcta gttcagtgcg attgtggaga ttggttttgc ccaacatatt 6001 ggacacatgg ctttctaccg tgcgttggct gacatttaat tcttcagcta tttcgcggtt 6061 agctaagccc ctagccacaa attgcacaac tttgagttct gttggggtta actgcacatc 6121 aaagggaact tggatgcggg aaccgttttc tcctcctttt gcttggtgtt ctttccagcg 6181 aatggtctgc ttcagagaag attctacttg tgctaccaat tcttctggtt caaaaggctt 6241 aaccatgtac acatcagcac ctttattcag accttttact cggtctgcgc tttgtccctt 6301 agctgaaagg aatagaacgg gaatccaact ggtgcgttcg gtctgccgga cttgttctac 6361 aaaagtataa ccgtccattt ctggcatcat cacatcgcaa atgatcatgt ctggaatatc 6421 ctgctctaaa atgtccagcg cttctcgacc attttcggcg gtgctgacat cataaccccg 6481 gaattccaaa taatccttca ccagcaagat gaggttaggg tcatcatcaa taagtagaag 6541 tcgtttgtga tctttcatgc tgggttcttt catagcagtg gcacttatcg cgcttgatcc 6601 atcaaggtct tgaatcaata tcctctctgg tggattaagg aggtgttgtg ctttttccta 6661 cttaacttgc gtttgcttta taaagtccta aaagtgggtc tcactctcaa gaaattctct 6721 acaaacagca aaagtccagt aaggatacgt agagcagtag tatttataat gcgttattat 6781 atatcgcttt gaaagctgcg ggttaaaaaa cactgattaa ataataatta ttcaggttaa 6841 caagtaagtg ttggactcaa gatgctccca aattacaatt aacccacact ttcatttgtt 6901 tactactgtg ccatatcctt agttggatgt taagcttcta taagttgttc acgacaacct 6961 agggaacttt ttggcaaagg atgggaaaac acaatgcata ttcttctacc accttgttgc 7021 ctaagaagtg ttcttgtata atccgctcaa tggtttccgg gattgttggg cgataccgca 7081 cagcatccgg gtaatccttt attatcagtc tagaacagca aactggcaag caattgattt 7141 tatttccaaa gatgcaactg gaattgccat cttgcctttt gtaaagactt agctgtttga 7201 gtcgtttttt taagctttcc caaaatggta aacgagcttg ttcacacaag cagttagaaa 7261 ctgtttggac aacacaaata aaaatgtccc tcggaatgtt cgacagttct aaagcttcta 7321 tagaaatact tagggcttta ttcgctggtg taagcgtggt ttttcgtagc gtatggtagg 7381 actggatcac tgtagtgtta ctcaaatggc ggctccctga taactgttct atatcaattt 7441 cccagctttg caaacaaagt tgcccatata tccatattcg tcatttaccg agtaaaacct 7501 gaaagctgag aaaagtagag tatatgtact tacgtgggtt tgggcaagca aagttcggga 7561 acccgtacaa gagaaaatgt tgccattgca ctttgccttt aggtacatta gcactcagga 7621 gttgagagtg ctaacaaggt gagaatttgg tatggcagct gtatctttaa gcgtttctac 7681 agttaaacct ctgggtgatc gcgtattcgt caaagtgaac gcggctgaag aaaagaccgc 7741 tggcggtctg tatttgcccg acaacgcaaa agaaaagccc caagtcggcg aagttgttgc 7801 cgtgggtgaa ggcaagatca aagatgatgg tgctcgtcaa gcgttggacg tgaaggtcgg 7861 agataaagtt ctctactcca aatacgctgg taccgacatt aaacttggaa ctgacgaata 7921 cgtcctgctc tctgaaaaag acattctggc agtcgtttca tagtctttta gtcatttgtc 7981 atcagtcatt tgtcatgagt tgttggcaaa ggacaaaaaa caaatagcat tctttgtagt 8041 tttctgctgg tagttttttt tctactaaac attctttgga caattatggc aaagcgcatc 8101 atttacaacg aaaacgcacg tcgtgccctt gaaagaggca tggatatttt ggctgaggct 8161 gtcgctgtta ccctcggtcc caaaggtcgt aacgtagtcc tagagaaaaa gtttggcgca 8221 ccacaaatca tcaatgacgg tgtgaccatt gctaaagaaa ttgaattgga agaccacgta 8281 gaaaatactg gcgttgctct catccgtcaa gcggcttcca agacaaacga tgctgctggt 8341 gatggcacca caactgcgac tgttttggct catgcagtag tcaaagaagg cttacgcaac 8401 gtcgcagctg gcgcaaatgc tatttcgcta aaacgtggta ttgataaggc taccaacttc 8461 ttggtagaca aaatcaaaga acacgctcgt cccgtagaag attccaaagc aattgcccaa 8521 gttggttcaa tctctgctgg taacgacgaa gaagtcggtc agatgattgc cgaagcaatg 8581 gataaggtgg gtaaagaagg tgtgatttcc ttggaagaag gaaagtccat gaccaccgaa 8641 ttggaaatca ccgaagggat gcgctttgac aaaggctaca tttctcccta ctttgcgacc 8701 gatcccgagc ggatggaagc tattttcgat gagcctttca tcctgctaac cgataagaaa 8761 atcgccttag tacaagacct cgtacccgtt ttagagcaag ttgctcgtgc tggtcgtcct 8821 ttggtgatta tcgccgaaga tatcgagaaa gaagctttgg cgactttggt tgtcaaccgc 8881 ttgcgtggtg tgctgaacgt ggctgctgtg aaggctcctg ggtttggcga tcgccgcaaa 8941 tccatgctag aagacatcgg tatcctcact ggtggtcaag ttgttaccga agatgctggt 9001 ttgaagctgg acaccaccaa gctagattcc ttgggtaagg ctcgccgcat cactatcacc 9061 aaagacagca cgaccattgt tgctgaaggt aacgaagctg gtgtcaaggc tcggattgat 9121 caaattcgcc gtcaaatcga agaaaccgaa tcttcttacg acaaagaaaa gctgcaagag 9181 cgtcttgcta agctcgctgg tggtgtcgcc gtcgttaagg taggcgctgc gacagaaacc 9241 gaaatgaagg acaagaagct gcgtttagaa gacgctatca acgctaccaa agctgctgtg 9301 gaagaaggta tcgttcctgg tggtggtaca acactggctc acctctctcc tgcactggaa 9361 gaatgggcaa acagcaacct caaagctgaa gagttgactg gtgcgttgat tgtagctcgc 9421 gctttggctg ctcctctcaa gagaattgct gaaaacgctg gtcagaacgg tgctgttatt 9481 gctgagcgtg tgaaggaaaa agacttcaac attggctacg atgctgctaa gaacgagttc 9541 gttgatctgt ttgaagctgg tatcgttgat cctgccaaag tcactcgttc tgctctgcaa 9601 aacgctgcat ccattgctgg tatggtgtta accactgaat gcattgtggt tgacaagcct 9661 gagccaaagg ataacgctcc tgctggtgct ggcgctggta tgggcggcgg cgacttcgat 9721 tactaacatt cttttgacat ccccccagct tgcgcgctgc atgggggatt tccagaatca 9781 ctacacaggt tttctggttc tactaaaaac ctcaccgaaa gatgagggta aagtaatgcg 9841 tgtgaatgca atttaaatat tgttacaaaa attaaccgcc tatcttgttc agatgggcgg 9901 cttttttgtg ctgatgattc gtcacctgct atacttccgt tccataccct aacgggtgag 9961 tgtaggcttc cagcaaattt tcttttttta gaacttcctt caaggtactc gttttgtttc 10021 tctctaaggt gttttatggt aaggaacata ccctgcctta ctaatcttta caaaaagaaa 10081 gacatattta cttgcgttga atatttcaca cacaagaggc agggacgggg tcttacaaga 10141 tcacgttgga gcaccaagtc cggcgtagga tccataaaag aaaataccta caacggtgat 10201 gacgcctaaa cctgcgattg tcgcaacaat ccacagagga attctgccac ttccagacac 10261 tgctttctct cctttctttg ctaaaaaaat taatgacaaa agttaaaaaa catgactcaa 10321 tagccgtcat tccaattagt taaagaagta actggaaaac aaaataccaa gaacaaaaac 10381 cagtaacagt cccaggtaaa gggaagttcg gttgagttca actggctgat tattagaatt 10441 gggggttctt tccatggttt tctcctaacg ttgtatgaat tgcattgcag cgatcgcgcc 10501 taaaaagaag atagttggca cagctagagt gtgaactgct agccatctga ctgtaaaaat 10561 tggatacgta actggttgat tgacgttatt tccgctagtc atgatttcaa actactttgt 10621 gttactaaac tgttctactt gttgttttgc atcaaaacgg ttattcacaa ttggcaattc 10681 ttgccgtgct gatgggtagt actcattagg gcgaggtgta ccaaacgcat cgtatgccaa 10741 gccagtgctg acaaataacc aacccgcaat aaacagtgca ggaatggtga tgctgtggat 10801 cacccagtaa cgaacgctcg taataatgtc cgaaaatgga cgctctccag tagttcccga 10861 catttatatc cctaccttca caaaaattgt tattgagatc attatcctac aaagttaaga 10921 aagagttaca cctcgtaact tctttctcag aaatcaattt caagcagcct ctgatttggt 10981 agcagcggca atattaggat tatatttgag taaaacaccg cgatcgccaa tcacgaatcc 11041 cttttcctgg ttaagaaaaa caattttgta aaaattagcg ggaacttgtt caacatcacg 11101 gtctttttcc caggtttttc caccatcaga actgcgcagc aaattaccgc taccaccact 11161 gatccaaact tcatctggtg tgcgataagc cagatctagt aaaccccaac tagtagcaag 11221 ctctggattt tgagctgact gccattgctc tgggttagct gggtcggtaa actgtacttg 11281 gccacctcgt gctaacatcc atacttgacc attttcagca aatcccatat tctctacccg 11341 tcgggaacta ttccggttgt ggggtaccca agcgtttaac cctggttccc aagttgagta 11401 gaaattacct tttgcagaaa cagccacata tttgccatca ggagaacgtt cgatgttgcg 11461 aaccacacca acagcttgtt ctacctgtgc tttccaattt tgaccgccat ctgtggttct 11521 gtatatcgct cctacatccg tcgcgatctc aagagtgttt tgtcccagtg cgacaatact 11581 aacaggatta cctggtaggt tttcactcag ttgaatgcgg gaccaagaac gaccttcatc 11641 agtggtgtgt aacaacaaag ctggttctcc agctatccat ccctctttac ctgcaaaact 11701 gaccgtatta aaacggtacc tttggtcatc taattctaat tgtagaggtt gccaagtcgc 11761 gccaccgtct tttgtttcca aaagagtagc attgctacct actaggaaac catgctgggg 11821 gttatctgta aaagcaatat ccagtagttt tgcatttgtt gggacagcaa ccacttccca 11881 aggattgtag ctagtcgaaa cgacattact acaaccaaca cacataagta taaccacgaa 11941 caaagcgagt attcgttggc aaaattttac aattggaagc atctgttatc tgtttgtttt 12001 tttttgtgaa tttgtgaatt tgtgtatggt tttgtgttat tagggtatta cttaccccgt 12061 aaagagcgtg ttttaccgta agccataaag acttagaaaa aacaaaaagg caagagccaa 12121 agctccgaaa atcagaagat ttttctgccc tggtgtgaga gtattcacac cgaaaccaaa 12181 tttcagattt tcgtcaaagc cagatgcttt gcctacagga ccaatgtttg taaaagcgct 12241 cttttttgca ctgcaaactg gacagcgcca agtgactggt atttccgcaa agggtgtccc 12301 tggggcgatg tcatgcttgt cgtccccctt ctcgggttcg taaacgtaac cgcaagaacg 12361 gcactcgtag cggtctagcg caattgtttc aacagcttgt tcgctcatgg ctcttggcct 12421 cggtgagaag caaatattaa atatacgtta aaaattatga cattaagtgt taggtttttc 12481 tatgttgacc aaaaacgaat caaatcatct acaactctgt ttgttgagat tcaagaaagt 12541 caaaaagata taatgtaaaa gaagtttaca cttcaccgga tgcctatcgc ttgccccttg 12601 agcgatctgc cgctatctgg agcacgaagc gatcgcccac tcagcgcagc ttcttcaaaa 12661 gaaggcacgg catccggtga agtgaacgcc caagggttcg cccaagggga gaacccaagc 12721 ccagagctag ttgccgtgag agcgagaaac cctttcccag cactggtctg gtcaaagtta 12781 ttagcaccaa aagcggtaaa aagtcattgt gtttgtcctt agcggttacg agtatcttct 12841 tggcttctta attatctgta gcctagtgcc tgccttggct ctgtcagcgt ccaagctcct 12901 gcgacccagc ggtcgcaacc cagaacggcg caccacatat gaatccggca tggaacccat 12961 tggtggagct tggattcaat tcaacattcg ctactacatg ttcgcgctgg tcttcgtcat 13021 ctttgatgtt gagactgtat ttttgtatcc ttgggcggtt gctttccacc gtcttgggct 13081 attggcattt attgaagcgc taatttttat tgcaattctt gtcgttgccc tagcgtacgc 13141 atggcgtaaa ggagctttgg aatggtctta gatactaaca tagaacagca aaaagaacag 13201 caaaaagaac agcaaaaaga gcgcattctc aacccaatgt cgcggactac agtaacccaa 13261 gatttgtcag aaaacgttat cctgacgacg gttgatgacc tctacaactg ggtgcggctt 13321 tctagccttt ggccgatgct gtttggtacc gcttgctgct ttattgaatt tgcagctttg 13381 attggttccc gatttgactt tgaccgcttt ggtttgattc cccgttctag ccctcgtcaa 13441 gctgacttaa ttattactgc aggcacaatt accatgaaga tggcacctca gcttgtgcgt 13501 ctttatgagc aaatgccaga acccaagtac gtgattgcta tgggtgcttg cacgattact 13561 ggcggtatgt tcagcgtgga ttctccttcg gctgtacgcg gtgtcgataa attgattcca 13621 gtggacgttt acttacccgg ttgtccccca cgtccagaag caattattga cgcaataatc 13681 aagctgcgca agaagatagc aaatgattcg atgcaggaac gcgatcgaat taagcaaacc 13741 caccgctatt acagcacaac tcacaacctc aaacccgcag agcaaatcca cacaggtaaa 13801 tatttgcgga ctgaatctcg cttcgcacca ccgaaggagt tgactgaggc gatcggttta 13861 cccgtcccac ctgctctttt gactcaaaag gtacaacagg aggaaactaa gcgtggctga 13921 agaatcaaaa gaagaatcga aagaagaatc gaaaccagca ccagcagagg aaaagtcact 13981 agtcaaagcg ggcgcagttt cccaatggtt aacagaaaat ggctttgacc atgagtctct 14041 ggaagctgat catagcggtg tagagattct caaagtcgag gcagattttc tgcttccact 14101 ttgcactgca ttgtatgctt acgggtttaa ttctctccag tgtcaagctg ggattgattt 14161 gggaccagga cagcagttgg ttagtgtgta tcacttgatt aaaatcagtg ataatgctga 14221 ccgtcctgag gaagtgcgtc ttaaagtctt ccttccacgg gaaaatccca gagttccctc 14281 agtttactgg atttggaaga cagcagactg gcaagaacgt gagtcctacg atatgttcgg 14341 cattgtctac gaaggacacc cagatctgaa gcggattttg atgccagaag attgggtagg 14401 ttggccctta cgtaaggatt acgtgtcgcc tgacttctat gagttgcagg acgcttatta 14461 aagcgaagtg cgttttcatt cttagcccct cttggcgatt tgtcgtcgaa gaggggtttg 14521 ttttttgttt cgcaacagag gcgagtcagc gctatgcaga agggcaagga acagtttctg 14581 tttaattttt catagacatg gacaatcttt aagttgtcac agatgttaca caatacaata 14641 attcatgtaa cctaaatttc tttacagctt aatcgactac catgctaaaa ccagtgcatg 14701 caatcttaat tgctttgacc ttgactttga gcttgacatc tgcactccaa gctgcaaacc 14761 cccgtcaaag ttgggtagag aatgaaattt accaatataa ccatcagttt gctagccagt 14821 tgcctcaaga actggcaaca aaaatgcaaa aaatgaaggc tagtcctttt gctttttatc 14881 gaggaacggc tcatatattc taccgggata tgcaaacctt agcaagttct ggatttgtca 14941 attcctctac gtcagccatc tggttagagg gagacatgca catgcagaat cttgggggga 15001 tgagggatag caatggtaac aacgtttttg acaccaccga ctttgacgaa ggctatcttg 15061 gtccttatgt ttgggactta cgacgcatgg cagtctcaat tcttctggca gctaaagaga 15121 acggtttaag ttccagcgat gcccaggata tagtccggaa ttttttagat gcctacctga 15181 acaaaatgag cgactttaag gggactaacg atgaattatc ctaccgtcta gaatctagca 15241 atactagcgg tgtggtcaaa gacttaattc aacaagcagc aagcaaaagc cgctctaatt 15301 ttttaaataa gtacacgcaa attaatacca gtggcaatcg agtttttcag acaacctctg 15361 aactccagcc tgtatctagc agcacttatt cagcgataaa tactggtatg attggttaca 15421 ttgcttcaat tcctagcagt aagcgctata acaataatta ctatatcctc aaggacattc 15481 gcctcaaatt aggttcaggt actggcagtc tcggcaagta tcgctatttc ttgctcattg 15541 aaggttcgag tttagcaagc gatgacgatc agattctaga aatgaaacag gaaactagta 15601 gcgcagttgc aatcgcagcc cctaatcttt taccaagttc ggtttatgga aatcatgagg 15661 gacaaagggt cacaacagca actaaagcaa tgctttccaa cacagacccc ttagttggtt 15721 ataccacggt tagcggtata accttcatgt tgcatgaaaa atctccctat caagaagact 15781 ttgactatac gttgctaacc accaagtcaa aatttttgga tgcaatggcg tatgcgggaa 15841 aggtcgttgc taagaaccat gcaatctctg ataaagatta tgatacttca attgttcctt 15901 atagcgttga caaagaagta acggacgttg tcagcggtaa caaggctgca ttcaaaaatg 15961 aaattgtcaa cttcgcccta gactacgcca ctcaggttga atatgattac gcaagttttg 16021 tagacgctta caatcgaaat attccgcttt attagaaata atcagttgga gatagctaca 16081 actttcttgg gattaagcaa tcgcaataag caaaagtgtg aaggtttggt agatgccaaa 16141 gcagttatta ctgaagtcta gactacggca agaagttacc aattgtccta acggtgaaaa 16201 aagactacat attttgcagg ctctatctgt agccagcttt tagagaattg ctataatttc 16261 agttactttg ggtttccttg catcgctttt agtctaaact tcactaatac agtttaataa 16321 cgtgcggact gacaaaatat tctacagttt atttcaagct tttcccagca tattttttga 16381 catcattggc gacacaacca tcaatcccaa tacttatgaa ttcgtttccg ttgaactcaa 16441 agaaactgcc tttcggattg atggtgtctt tgtcccagca actgaaacca caaaacagcc 16501 agtttatttt gtcgaagttc agtttcaact agactcaaat ttttatagac gcttttttgc 16561 agaaatcttt ctgtacctgc gtcaaaatac atccgtcaat ttttggcgtg cagttgttat 16621 ctatcccaac caaaatttcg atccagacga tcaacagcca tatcagttgt tgctagagag 16681 tccacaagtc cagcgcattt atttagatga attaggtaca gcagcagaaa actcacttca 16741 acttgcggtt gtgcagttaa ttatagaacg ggaagagaca gctgttgagc gaggtagaga 16801 attaattttg cgggcaagac aacaactggc agatgaaacc atcagacagc aaattgtaga 16861 attaatagaa accatcctgt tgtacaagtt tactcaatta agccgagagg agttggcagc 16921 aatgttgggc atagatgacg aattcaaaaa aacaaggatg tatcaatcca ttaaggacga 16981 agctttagaa gaaggtaggc aggaaggctt acaagaaggt aagctgcaag cgaagttaga 17041 agcagtcccc cgattgttcg cattggggtt gagtgtggaa caagtagcaa gggctttgga 17101 tttgacgtta gatcaggtgc aacaagttat tcaaagtccc tagatctaca tcaatcgtgc 17161 gttagctagc gctaacacaa cgccaggtac tacaacgggg gggaaccccc gcaccgtact 17221 ggctccccta cactgctaca ctgcttcacg gcttagcgcg aaatttaagg tttacatctt 17281 acctttctgc atgacttcct gaaggtgatg acgaatttcc tgtgcgtgct ggatatcaga 17341 ctggcgtaat ttctcgaaca gttctacaca aggctgagaa cctgcttgct gtgcatcgtt 17401 gatgtaagtt tcataagctt tgattgcttc agctttgttg tgcaacacgg tgaggaaatc 17461 atattccaag ttgcttacgg gattttgagt ttgagcgttt cctgcagttt gaaccataaa 17521 tttaccctcc tttgggtttc ttatttactt ctactcaccc gaaaagaggt attcatcttc 17581 ctaaaagagt atgcgacgat acgtaattag tagcaaagca gtaacccaag aggttgcatt 17641 tttttaaaaa ttctcagtta tcactcttta ctgataactg ataactgata actgttcact 17701 gattcaaaca atatccttgt agtgccgtgg atacttgagt tcacgctctt tttcctgttg 17761 ttcgttctca atctgtttaa caggggaaga atgagtgtcg tcggcactca tataatccat 17821 atgtggtgtt ttggcacaca caaataatag tccacccaga acgactaata agccgcaaat 17881 tgatgcagtc tgaaccccgg aaatttccag cgcaccatga acaaccactt ctcgcaatac 17941 ggaaacaatt gctacttcca ctgctacccc aacagcaata ctatgctcct ctaaatacac 18001 cataagcagt cgaaataact cgaccaaaat taagatgaat agtattttgg cagtgacgtg 18061 tttaaaatct acgggtattt caagggtggt aaagaaaacc catagctgca tcaccataat 18121 agcgaacaaa cccaaacaca gaacaatgac aattaagtct tggaaagcct ccatgttacg 18181 gacaatccaa tgccgatcta gccagcgttc ggagaataag aattgactct tgaggcgctt 18241 tttcatgagg atttccgtta ctatcaagac aacttcattg cacaccagcc atttgggctt 18301 ataatttcat tgtctcgtaa ttaagccact atttagcgag gtggataatt ttgatttgat 18361 aattgttgcc atagattctc tgcattttta gcccactgga aattcttctg agctttgtat 18421 aaatctctgg catattgcaa cacttctgct gcttgtggat attgctttaa ctgtataaac 18481 actaacccgg cgctataata tgcattggca ttatttgtat tagcttctgc tgattttcta 18541 aaagcttcta atgcatcttt taactttttt tggttaaata aaatggcacc gagattgtaa 18601 taagcttcgg aatattttgg attgatttta actgctttgc gaaatgcatc ttttgcttta 18661 tcggctttgc cctgttgtaa ataacagatg cccatatgat aagcaggttc tggtgcgttt 18721 tggctcagtt ctattgcttt tttataggat gcgatcgctc tatcacaatc tcgttgttgc 18781 tcgcgcagca accctaagtt atagtaagcc actcccagtt taggatcaag ttgtattgct 18841 cgttgcaaat aatcgttcgc ttgctgccaa ttgcttcctt ccaaaagcga gccaccgaga 18901 ttagcatatg ccagagcaaa ctggggatca gattgtgtcg ctttgtaaaa agcatcagca 18961 gcaggttgta attgtcctgc ttgtcgcagt gctagtccta aattgtagtg agctgcagct 19021 aaattgcgat cgagttctgt ggctttacga aataaagcta tagcatcttc tactctcccc 19081 acctgaatgg cttctaatcc ttgtgtcaag taatcttgag cgctgctatc aatgtattgc 19141 gcgagtttga gaaggtgagg agataaagga acagctggag gtggaaaagc tgaggatgat 19201 ggcgaaagaa ggctttgaac cactaatatc aaacccacta aagcaggaag gcgatattga 19261 caaagttgga ctcgcattgt tttctggtat tggaacactt taggttacgt ttactttaca 19321 aattttattt aacatataaa cagattcata taagtttaat tttatatagc aagttttgta 19381 atagttaaat tttcttcagt taaatattac gag // LOCUS NODE_1687_length_19275_cov_5.28303919275 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 19275) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 19275) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..19275 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(19..1701) /locus_tag="DP116_14775" CDS complement(19..1701) /locus_tag="DP116_14775" /inference="COORDINATES: protein motif:HMM:PF05729.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14775" /translation="MSRNEFAKKLINKLFRNKLEESLCRLCEILEEDFPQGDYSDDLR SIQKKLNRDPDRETLLNRVQKTKVQPFLEPKGLISNPEELNNKVIPLGLEERPTGTKV FDIFDNEMDDKRTLLILGEPGSGKTTILADLTNELIERAREDKNHPIPIILNLSSWKE KQTIDEWIVEQLKDFYQVPKEKGWNFVQQKQLLLLIDGFDELKSVSQKSCVRELNKFI TTYDSTRIVVCSRTDEDNENEEREELLQELTSQIVVRLKPLDLEKAVQYLNDNSVPDW LKELVNTNNQLQILIKLPLFLYLIIEVYKSDKNSQQKTDFLQEDSEDEIWNKIFYKYI EKQLEKWKSNPNYSEKVNKEEVNRYLKWLAKQMHKENQTDFLIEKMQPKWLSQPSNQV TFQETIYHIGVSLIVGLLCGLASGLFSEILNYNSNPQDQIPWIHYIFFGLISGLLSGF ISGLIFLLPEDLEYNPISKLKSALRKSTRRRSALIGYPLRLLSKLTYWLRANRNSKLT SRFRFVMIGLISGLIRWEVNLRLEEGKTQPLDAIIFGLISGFIVGEMKTLWL" assembly_gap 1820..1829 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(2007..2894) /locus_tag="DP116_14780" CDS complement(2007..2894) /locus_tag="DP116_14780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006102839.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14780" /translation="MGYDIDELKKSMVKIINPDTQSTAGSGFIIHSDGYFITCHHVIY RLNSLKVEYQGQVYNHVQWCEDLSNPDVDIAILKINLSGAKAVPIINLQDVSTSVTVY GFSLEKAFNFRDGSDFHAQSIHKSASVNTLSTYEYKDKTFTNSWNKLPNSTSTFEAYR INVLVDSGTSGGPVLANNLNAVVGVIQSKGSNESYVIRWDNITESLKKLGLEPGTGKV EKGVNDIDRRKLIELLINSERTTDLRRETLCEDIRIDIGNISYESGLSRNEFAKKLIN RLFRNKLEESLCRLCEILE" gene complement(3031..3396) /locus_tag="DP116_14785" CDS complement(3031..3396) /locus_tag="DP116_14785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739723.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14785" /translation="MAITTILAPDGSKIYIQYDVEESTQLRAVGAPDPIEDIARRTER FKKSLVTTINGYSQILLDSVQQGVNDLTKPNKVTLEFGLQMGGEAGIPLVAKGTSQAN VKVTIEWNLSNNRQNSSNS" gene complement(4021..4108) /locus_tag="DP116_14790" /pseudo CDS complement(4021..4108) /locus_tag="DP116_14790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877789.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="VapC toxin family PIN domain ribonuclease" gene 4286..4714 /locus_tag="DP116_14795" CDS 4286..4714 /locus_tag="DP116_14795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015954333.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative toxin-antitoxin system toxin component, PIN family" /protein_id="PRJNA477356:DP116_14795" /translation="MKTYQIILDTNVLLAGLRSSRGASYKLLTMLNNNRWQLNISTAL VFEYEEILKREQTQLGLSLEDIDNIIEALCAIANKRKIFYLWRPMLNDPDDDFLVDLA VESQADYIITYNQKDLQPAEKFGIKVVTPKQFLQEMGEIA" gene 4711..4950 /locus_tag="DP116_14800" CDS 4711..4950 /locus_tag="DP116_14800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006617484.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14800" /translation="MTHLNVQIPQSLYKQIETLATRENISIEQLVAVALSAQVSAWMT KDYLEEKVQRGSWEKFQQVLNKVPDVEPEDYDKLD" gene 5293..6399 /locus_tag="DP116_14805" CDS 5293..6399 /locus_tag="DP116_14805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_14805" /translation="MWSYLLKIKQNILTRNERGLIYWLIISGFAIVIALEYLTPPEYV FGYLYTGTILLASSRLSRNAVLGVTLAATGLTLLNLFIPGVETVHPPTVANRLIAVIA LVVTGYLSDRNHRNKEAIAYAQTQLRSQQQLAQMREDFVSTLTHDLKTPLLGAIQTLK SFEEGQFGAVTPMQERIIETMTRSHRTTLQLVETLLDVYRIDTEGLKLQRSPVNLVTV AQEAIATLTEIARSRQICVCVNYGESDFCRSFWVNGDSLQLGRVFSNLLINGINHTPR GGKVEVVLETSAIDQIVKISDNGCGITQEELPYLFERFYQGQSDRLFIGSGLGLYLSR QIIEAHGGIIWAESRASKGAIFGFRLPVCPPPGD" gene 6446..7132 /locus_tag="DP116_14810" CDS 6446..7132 /locus_tag="DP116_14810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357991.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_14810" /translation="MKPGAVVKILLVEDDELFRLGLRMRLQQETGIEIVAEAEDGEQA VELANRYLLDLVLLDVGLPGIGGIEACRQIKQQHPDLPILVLTSRSEKPLIARLIEAG AQGYCLKGIPSESLILALRSVAAGASWWDHTATREIRAAFGGNNTAAPVQDKQPSENP LTKREQEILALVAAGKSNQEIAEILYIASGTVRVHVHAILHKLEVRDRTQAAILAIQK GLVAKELLQN" gene 7358..8062 /locus_tag="DP116_14815" CDS 7358..8062 /locus_tag="DP116_14815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317460.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_14815" /translation="MTNIPLFANRTQAGEQLAQAIDAILTQQIADQVTNPLQIVYALP RGGVPVAAAVARLLNCPLMIEVAKKIGHPENPELAIGAVTASGNVIWDQHNVFFRRTP KSGWREEALDTAISQAKSLQAQLSPACPQVNTQGAILILVDDGIATGMTIAVAATSLK ALSPAEVWLCSPVAPLELLPWLDQWGDRVVVLSTPKPFFSVSNFYVEFPQVDTKEAFE CLLQQNQDIINSQKEI" gene complement(8142..9125) /locus_tag="DP116_14820" CDS complement(8142..9125) /locus_tag="DP116_14820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997168.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="PRJNA477356:DP116_14820" /translation="MEFQVENIVYVESLRACLTEREHVSISDDFRDSHEEILVPHRQQ DHYEEANRLLRQGVQQQQAGDSLAAMRSLQESLALFQAVGDIEKQAQVLSCLAYIVYH LGDYKSAISHSKQCLLLTKDVANLQVIKMQALSHLGNAYRHLGEYNKAIEFLQKCLKI AQQLGDKRSQVAGLNNLGLVYKALGDLHQAIEFKQQSLEIVRELQDHWGEEQVLKNLG SAWYALGDFAKAIAYYEQCIKRAYSLNNHQTALQVLKNLGNACYAQGDYAKAIVYYEQ RLLLARAMKDKRSEEQSLGSLGVACEALGDYVKAINYYEETLELAKFLQDS" gene 9310..9962 /locus_tag="DP116_14825" /pseudo CDS 9310..9962 /locus_tag="DP116_14825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317461.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" gene complement(10214..11320) /locus_tag="DP116_14830" CDS complement(10214..11320) /locus_tag="DP116_14830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876189.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_14830" /translation="MFKILKSWLKNSPIALLLVTLFLGISTAAWTPSSSAGLPAGNAI TDGNALLRYSLPIDNKPVRQLQASLEDITTQLRANRRWGAVSQDLSKASRILDKPSKL LADVPEERQSEAEAVIAELKSGVNALQEVVKTKDKEQVREQRAKLLSLVGKLEESMVK EFPFEVPAKYSNLPQLKGRATVEVKTNKGDLTVIVDGYSAPVTAGNFVDLVQRGFYDG LEFTRSEESYVVQTGDPPGKDVGFIDPKTGKYRAIPLEFLVQGDKEPTYGFTLEEIGR YTDLPVLPFSAYGTVALARPESDNNGGSSQVFFFLFEPELTPAGRNLLDGRYGVFGYV TEGKDVLEKLKAGDKIESAKVIQGGENLVEPNVA" gene complement(11591..12307) /locus_tag="DP116_14835" CDS complement(11591..12307) /locus_tag="DP116_14835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197792.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_14835" /translation="MVQSEPRFTLPTTDELPCSDDTPVDNEDQNFLPNLLLFILESIW ANRNDWFFGIDMGIYHTTGVSHLVPVIPDGFLSLGVERRKAGKSRLSYAVWEEEEIVP KFVLEVVSKTPGDEYDKKLEIYAKLGVLYYVIYNPQYWRRDQHQPFELYKLVDGEYQQ QIQEPFWMPEVGLGIGRGSYTSGVVKREVLYWHDKEGKRYLTADEVAQSERQQRELAE QHQQRLAAKLRELGIDPNSI" gene complement(12595..13272) /locus_tag="DP116_14840" CDS complement(12595..13272) /locus_tag="DP116_14840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rhomboid family intramembrane serine protease" /protein_id="PRJNA477356:DP116_14840" /translation="MFPLYDENPTRITPYITYGLIGMNVLVFLHEVSLSNAQIEQFFQ LYAVIPRELTNNFAGEWTTLFTSQFLHGGWWHLLSNMLYLWIFGNNIEDRLGHFKYLI FYLSCGALAALCQWIIGVNSGIPSLGASGAIAGILGAYIIRFPDTRVLSLIFLGFFFT TIRVPAVVVIGLFFVQNVISGLANLQAAANMSVETGGVAYWAHIGGFVFGIILGPLLG LFRRDDY" gene complement(13415..14134) /locus_tag="DP116_14845" CDS complement(13415..14134) /locus_tag="DP116_14845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rhomboid family intramembrane serine protease" /protein_id="PRJNA477356:DP116_14845" /translation="MVPIRDDNPTTITPYVTYGLIAVNILAFLYEASLPARQLDGFLH LAAVVPRELTASFAGISVNQPVPEWLTLITSQFLHGGLLHLAGNMLFLWIFGNNVEDK LGHVKYLFFYISCGVLASLAQWYFAQNSSIPSLGASGAIAGVLGAYILRFPQAEILGI VPLGIFFPTFRVPAYFFLGFWFIQQAFYGLASLETPTNVGMESGGIAYWAHAGGFVFG AILGPLLGLFSDKPSQESWYR" gene complement(14378..14845) /locus_tag="DP116_14850" CDS complement(14378..14845) /locus_tag="DP116_14850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SsrA-binding protein" /protein_id="PRJNA477356:DP116_14850" /translation="MSENSQGYKVVTDNRQARYLYEILETYEAGIELTGTEVKSIRAG KVNLQDGYGLIRDGQAWLLNAHISPYNASGQYFNHEPRRTRKLLLHREEIRKLIGKVE QQGLTLIPLKMYLKRGWVKITIALARGKKVHDKREDIKRRQDQRDMQRAMKNY" assembly_gap 15166..15175 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 15200..15502 /locus_tag="DP116_14855" CDS 15200..15502 /locus_tag="DP116_14855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862679.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14855" /translation="MKRQLTKVIGTSVIALSMALPFSVPAFAQTTTESGATTTAPNTT TDTRNTRTYNDNDFDWGWLGLLGLLGLAGLAGRKRHEEPTRYRDPNTTVGTTTYRD" gene complement(15651..17075) /gene="ictB" /locus_tag="DP116_14860" CDS complement(15651..17075) /gene="ictB" /locus_tag="DP116_14860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197164.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative bicarbonate transporter, IctB family" /protein_id="PRJNA477356:DP116_14860" /translation="MNLVWQRFTLSYLPLRQYLGTSYLHRSLVGLFHSWRQSSLLMQW GETIAAVLLSLVYTLAPFVSNDLLGLILVACAGFWLLLTLSDETTPNASSVTPIHLLV LLYWGVATVATALSPVKKAALTDLRNFTLYLLLFALCARVLRLPRFRQWLITLYLHIS LIVSVYGLRQWFFGAPPLATWVDPTSSMSKTTRVYSYLGNPNLLAGYLLPAVVLSIVA VFAWASGFKKALALTMCVVNGSCLILTFSRGGWIGLVVAVLILIGLLYYWWSVQMPPF WRTGLPWILLSSLTCVLVLAVVFVEPVRDRVFSIFADRQDSSNNFRRNVWTAVFKMIQ DYPITGIGPGHSAFNKVYPLYQLPRYSALSAYSIFLEVIVETGFVGFACFLWLLIVIF NTGLLQLQRLREIKSVDGFWIMGAIATLAGTLAHNLVDTVWFRPEVNTVWWLMVGLIA CYYKSIPQGRDREFNSHNTEPTAS" gene complement(17291..18793) /locus_tag="DP116_14865" CDS complement(17291..18793) /locus_tag="DP116_14865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874516.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_14865" /translation="MVESENKLFTPGDVWVSQEMGERQRLKVLSDLGLRQPETIPVFE EATQTAAHLLEAQICILGFVDQERHWFKSSLGLSKLRGGIMNHLAQSRELLRHESFCT KVVEDSQPLVINDTEKLTRADSSAYSKLIQDYGIRSYLGAPLFDACGHCLGTLAVMDI KPRDFTDKDIEFIQIIARWSMSEFERNRLLETTSSSSVPSSASNLSDKNVVNTTEIKI SPSIQKDLVITNQLKLELLGQLIQELRTPLTSVLGMASVLGREIYGPLTNKQREYVDI IQHSGRYLLSLVNEISQMGTMEENPTALNLTPIDIEMLCQQAISTLEEAANRREQDIR LSIEPERNRICSLDKDKVRQILYHLVFSVIQLSATGSTVRIHVSSKDDTLNITVWVSH PWLGEGITEIDPLFSHSRTLPLQELTRTHSGHLENQDQTGDLIVNMSSTSATKPHGYL SRESLGLLLSCHVAELHGGQIAIQGSSQSGYRYVLSLPLNLDTAEANSDV" BASE COUNT 5683 a 4023 c 3752 g 5797 t 20 others ORIGIN 1 gcatgtggcg tccctgggct aaagccacag ggttttcatc tcacccacta taaatcctga 61 aatgagtcca aatattattg catctaaagg ctgggtttta ccttcttcta gtcgcagatt 121 aacttcccaa cgtatcagcc cagatatcag cccaatcatt acaaacctga atctagacgt 181 taacttagag tttctatttg cacgcaacca atatgtcagt ttagatagta atctaagtgg 241 ataccctata agtgcagacc ttctacgtgt agactttcta agtgcagact ttaatttaga 301 tatcgggtta tattccagat cttcaggcaa taagaatatc agtccagata tgaatccaga 361 tagcagtcca gatattaatc caaaaaatat ataatggatc cagggaattt gatcctgggg 421 attggaatta taatttaata tctcagagaa tagtccgcta gctagtccac acagcagtcc 481 tacaatcaaa ctaactccaa tatgatatat tgtttcttga aaggttactt gattggatgg 541 ctgagacagc catttgggtt gcatcttttc aattaaaaaa tctgtttgat tctctttatg 601 catttgtttt gctaaccatt ttagatatct gttaacttct tctttattaa ctttttctga 661 gtaatttgga ttacttttcc atttctctaa ctgcttttca atatacttgt aaaatatttt 721 gttccatatt tcatcttcgc tatcttcttg caaaaaatct gttttctgtt gtgagttctt 781 atcagatttg taaacctcaa ttatcaaata gaggaataaa ggtaatttga ttaaaatctg 841 tagttgatta tttgtattta ccaattcttt caaccaatca gggacagaat tatcattcag 901 atattgaaca gctttttcta gatcgagagg ttttagtcgg actacaatct gagatgttaa 961 ctcttgaagt aattcttctc tttcttcatt ttcattatct tcgtctgtac gactacaaac 1021 aacaatccgt gttgaatcat atgtagtaat aaacttattt aattctctaa cacatgattt 1081 ttgagaaacg gactttaatt catcaaagcc atcaattagc aaaagcaatt gtttttgttg 1141 aacgaaattc catccctttt ctttaggaac ttgataaaaa tcttttaatt gctccacgat 1201 ccactcatca atggtctgtt tctccttcca agaagaaagg tttaatatta taggaatcgg 1261 atggttctta tcttcacgag cacgttcaat tagctcgttg gtcaggtctg ctagaattgt 1321 agtcttaccc gatccaggtt cacccaggat aagcagcgtt ctcttatcat ccatttcgtt 1381 gtcaaagata tcaaaaactt tagtacctgt aggtcgttct tctaaaccca gtggaatgac 1441 tttattgttc agttcctctg gattactaat cagtcctttt ggctcaagaa acggttgaac 1501 tttggttttc tggactctat tgagcaaagt ttcgcggtct ggatcacgat ttagtttctt 1561 ttgaatactt ctcaaatcat cactgtaatc tccttgggga aagtcttctt caaggatttc 1621 acagagacga caaagcgatt cttctaattt gttcctaaat aatttattaa tgagtttctt 1681 agcaaactca ttacgtgaca aaccgctttc ataagaaata ttgcctatat cgattctaat 1741 atcctcacat aacgtttccc gtctcaaatc tgttgtacgt tcgctattta ttaagagttc 1801 tatcagcttt cttctgtctn nnnnnnnnnc aaatctgttg tacgttcgct atttattaag 1861 agttctatca gctttcttct gtctgtttca ttatttagat cttttagaga catattttct 1921 cagaaaaact cctagttgta gtacttacag ttaagtttta tttttatcct tctcaaatca 1981 tcactgtaat ttccttgggg aaagtcttat tcaaggattt cacagagacg acaaagcgat 2041 tcttctaatt tgttcctaaa caatctatta atgagtttct tagcaaactc attacgtgac 2101 aaaccgcttt cataagaaat attgcctata tcgattctaa tatcctcaca taacgtttcc 2161 cgtctcaaat ctgttgtacg ttcgctattt attaagagtt ctatcagctt tcttctgtct 2221 atgtcgttta cccccttctc aacctttcct gtaccaggct ctaatcccag ttttttcaag 2281 gattctgtaa tgttatccca gcgaatgaca taactttcat tgctaccttt agattgaata 2341 acccccacta cagcatttaa gttattagcc agaacaggtc caccgctagt tcctgagtcc 2401 accaaaacat taattctata agcttcaaaa gtcgaagtag agtttggtag tttattccaa 2461 gaattagtga atgttttgtc cttatactca taagtagaga gagtattaac agatgcactt 2521 ttgtggatac tttgagcatg aaagtctgag ccatctcgaa aattgaaagc tttttctaga 2581 gaaaaaccat aaactgtaac cgatgttgat acatcttgta gattgataat cggaacagct 2641 tttgcaccag aaagatttat cttaagaatg gcaatatcaa cgtctggatt tgaaaggtct 2701 tcacaccact gtacgtgatt gtaaacttgc ccctgatact ctactttcag tgaatttaaa 2761 cggtagatga catgatggca ggtaataaaa tagccatcag agtggatgat aaatccagaa 2821 ccagcagtac tctgtgtatc aggattgata atttttacca ttgatttttt gagttcatca 2881 atgtcataac ccatttttat agtctccttt cagattcata gcggttttca atgcaataca 2941 ttacattaca gttttgattt tcttgtgaga cagatgtatt gccttttcta tgctgatcac 3001 tccacctcac aagaactgta tttaagacaa ttaagaatta ctagaattct gtctgttatt 3061 gcttaagttc cattcaatag tgaccttaac attagcttgt gatgtccctt tcgccaccaa 3121 aggaatgccc gcttctcctc ccatctgtaa accaaattcc aatgtcactt tatttggttt 3181 tgtaagatca ttcacacctt gctgtacgct atcaagtaat atttgtgaat aaccgtttat 3241 cgtagtaact aatgattttt tgaatcgctc tgttcttctg gcaatatctt cgatggggtc 3301 tggagcacct acagctcgta gctgagtact ttcttctaca tcgtactgta tgtatatttt 3361 agaaccgtca ggagcgagta ttgttgtgat agccataagt ggaattctct ttgatgatgt 3421 ctttttagga aaaagttaac aaggtactca gacgcaactg gtaaaaccag taacgctact 3481 attttatctt atttagtagt agcctcttac ctcttagcct taatggttag cagtttctag 3541 ttccatgtgc tacctcttta aatagtgttt tgctttatta ataccacacg gtttctatta 3601 cagattccgc aattgaggtg attaaaattt ctagctgttg tttcgactta tgcagtcatg 3661 cagaattcta gagaaggaat tttttgttgt tttttataca aaattatctt ctcaaagttt 3721 aatttcgata gatagatagc taaatatcag atttgaagat gcccaaaata tatatatagg 3781 aaacagtaaa agtaacttat gcagtttttg cttaagcaaa cttgccccta aaatcagctt 3841 tagcgcgttg ttgtagatgc caattagagg aatgcagagt aggcattaga ctgacacact 3901 tgctgggcac ttagttttaa ctccggcaaa gctattgaaa taaaagattc ttattcttaa 3961 ttttgaactt tgaattttgt tcgcgcagcg tgcccgaagg gcatactttg aattgttttg 4021 gcgtacttgt tgatgatgct tatctctggc gctgtagtac gcaaacaaaa acccactgtc 4081 ttttgagtat gagcagatgg taagtcatga atgaatggcg ctctttaata tattgcgctt 4141 atgacaagta taacagagga gtagaagctc tatcgttaag gtatggattg ctgagcggct 4201 tgagtggcat acttaagtca tgcgtggcgc tctttttgtt tgggcattga tgacacaaca 4261 gcctaaccgc aagtgtagaa aaacaatgaa aacgtatcaa ataatactag ataccaatgt 4321 cctactcgcg ggtttacgct ccagccgtgg cgcatcctat aagctattaa caatgctcaa 4381 caataatcgc tggcaattaa atatttctac cgccctagta ttcgagtatg aagaaattct 4441 taaacgagaa caaacacaac taggtttaag tttggaagat attgataata ttatagaagc 4501 tctttgtgcc attgccaaca aacgcaagat tttctatctg tggcgaccaa tgttaaatga 4561 tccagatgat gattttttgg tagatttagc tgtagaatct caagccgatt atatcatcac 4621 ttataaccag aaagatttac aaccagccga gaaatttggt atcaaagtcg taacccccaa 4681 acagttcttg caagaaatgg gagaaatcgc atgactcatt taaatgtaca aattcctcag 4741 tctttgtata aacaaataga aactttggca acaagagaaa atatatcaat tgaacaactt 4801 gtcgcggttg ctctctctgc acaagtttcg gcttggatga cgaaagatta tttagaagaa 4861 aaagttcaac gtgggagttg ggagaaattt caacaagttt taaataaagt ccctgatgta 4921 gaaccagaag attatgataa acttgattaa tttgattgta actcagaggt tgtttgaaaa 4981 gggtctttct atgtcatatt gaacgcagcg tagcggagtg aaatatctca gtatgtgccc 5041 gaaaccctag attcttcgct ccactttgtt ccgctcagaa tgactaaatt atactttctc 5101 ggactttaaa aacatcctct cagattatgc tgtgagtccg atgcctttcc ctgataaaaa 5161 tcggaagaaa taactagtac actaaggcgt aaataaacca cctcttccaa atcatagttg 5221 cagaagtcta tcgtcaacaa ctctggttgt acgaaaataa aaaacagagt attgacgcta 5281 taaaagctga cgatgtggtc ttatttgttg aaaattaaac aaaatatctt aaccaggaat 5341 gaacgaggac tgatttactg gttgatcatc agtggatttg ccatagtcat agcgcttgaa 5401 tatctaacgc cgcctgagta tgtattcggc tacctctaca caggaacgat tttgttggcg 5461 agttcccgat taagtcgtaa tgcagtgctt ggcgtgacgc tagcggctac cggattaacg 5521 ctgttgaatt tgtttatccc gggagtagaa acggttcatc cgccaacggt agcaaatcgg 5581 ttaattgctg taattgcgtt ggttgtaaca ggttacttaa gcgatcgcaa ccaccgcaat 5641 aaggaagcta ttgcttacgc gcaaacacag ttacgctctc aacaacagct agcgcagatg 5701 cgggaagatt ttgtttctac cttaactcat gacttaaaaa caccgctact aggagccatt 5761 caaacgttga aatcgtttga agaagggcaa tttggtgcag ttacaccgat gcaagagcgt 5821 atcatagaaa caatgactcg ttctcaccgt accacgttgc agcttgtaga aactttgttg 5881 gatgtatacc gcatcgacac tgaagggctg aaacttcagc gatcgccagt gaatttagtc 5941 acagtagcac aggaggcgat cgccacccta actgaaatcg caagatcacg tcagatttgt 6001 gtctgtgtta attatggaga atcagatttt tgccgctcat tctgggtgaa tggggattct 6061 ttgcaacttg ggcgagtttt ctctaatctt ttaatcaatg gcattaacca tactccccgt 6121 ggtggaaaag tggaagttgt gctggagact tctgctattg atcaaattgt gaagatttct 6181 gacaatggtt gtggaataac acaagaagaa ctaccttacc tatttgagag attttatcaa 6241 ggacagagcg atcgcctctt tataggttca gggctgggat tgtatttatc ccgccaaatt 6301 attgaagcac atggcggtat aatttgggca gagagtcgcg caagcaaagg ggcaatattt 6361 gggtttcgac tgcctgtgtg tccgccacca ggcgattagg atggcgaagt aaggaagatt 6421 ctgacttgtg aaccaataac taaaaatgaa accaggtgcg gttgtgaaaa tattactcgt 6481 tgaggatgat gaactgtttc gcttgggttt gcgaatgcgg ttacaacagg agacgggtat 6541 agaaatcgta gctgaggcgg aagatggtga acaagctgta gaactagcta atcgttatct 6601 gctggatttg gttttgctag acgttggctt accagggatt ggtggaatag aagcttgtcg 6661 tcaaatcaag cagcagcatc cagatttacc aattctagtt ttaacatctc gttcggaaaa 6721 acctttgatt gcgcggttaa ttgaagcagg ggcgcaaggt tattgtctca aaggaatccc 6781 atcagagtct ttaatattgg cattgcgttc ggtagcagca ggcgcttctt ggtgggatca 6841 cacagcaacc agagaaattc gagccgcttt tgggggaaac aacacagcag cgcccgtaca 6901 agataagcaa ccatcagaaa atccgttgac aaagcgcgag caagaaattc tggcgttagt 6961 agctgctggt aaaagcaatc aagaaattgc agaaattctc tatattgctt ctggtacagt 7021 aagggttcat gtccatgcga ttttgcataa gttagaagta cgcgatcgca cccaagccgc 7081 aatattggct atacagaaag gattagtagc aaaagaactg ctgcagaatt agtaacagtt 7141 atcagttatc agttaccagt taccagttat cagttatcag ttaccagtta ccagttacca 7201 gttaccagtt atcaggtagg aaacggactc gtccacccct tgttcactgt ttactgttta 7261 ctgttccctg ttaagcgttc cctgttccct aagcagttta cttgagccaa aatcagtcat 7321 cataaaagca taggcattga tctatgctgg gaaacctatg acaaatatcc cgctttttgc 7381 taatcgcact caagctgggg aacaactggc gcaagcaatt gacgctattt tgacccagca 7441 aattgctgat caagttacaa accctttaca aattgtctat gctttgccaa gaggaggagt 7501 accagtcgca gcagcagtcg cgcgtctcct caactgtccg ttgatgatag aggtggcgaa 7561 gaaaattggt catcccgaaa atcctgagtt ggctattggc gcggttactg cttccggaaa 7621 tgttatttgg gatcagcaca atgtgttttt tcgccgtaca cccaaatcag ggtggcggga 7681 agaagcactc gataccgcta ttagtcaagc taagtctctt caagctcaac taagtcctgc 7741 ttgtccgcag gtgaatactc aaggtgctat cctcatctta gtagatgatg gtatcgctac 7801 aggtatgaca atagcagtcg cggcaacatc tctcaaagca ctctcgccag cagaagtttg 7861 gttatgtagt ccagtagccc ctttggaatt gttaccctgg ttggatcagt ggggcgatcg 7921 cgtggttgtc ctatcaacac caaaaccttt cttcagtgtc agcaattttt acgtagaatt 7981 tcctcaggta gacacaaaag aagcttttga atgtctactg caacaaaacc aggacatcat 8041 aaattctcaa aaagaaattt gagaattcat agcgtgatca ttttttctct gtagccgacg 8101 tgtgtagtca accaaagctt ggtaacaatc tggttctcat tctaagaatc ttggagaaac 8161 ttcgctagct ctaaagtttc ttcatagtaa ttaattgctt taacataatc gcccaaggct 8221 tcacaagcaa ctcccaaact accgagtgat tgttcttcac tacgcttgtc tttcattgct 8281 ctggctaata ataaacgttg ctcataatag acaattgctt tcgcataatc cccctgggca 8341 taacaagcat tacccagatt ttttaacact tgaagtgcag tttggtgatt gttcagcgag 8401 taagctcttt tgatacactg ctcgtagtag gcgatcgctt tggcaaaatc ccctaaagca 8461 taccaagcgc tgcccaaatt ctttagaact tgctcttcgc cccaatgatc ttggagttcc 8521 cgcacaattt ctaggctttg ctgtttgaat tcaattgcct gatgaaggtc acccaaagct 8581 ttatagacca atcccaaatt attaagtccc gccacctgac ttcgcttatc tcctagttgc 8641 tgcgctattt tcaaacactt ttgtaggaac tcaattgctt tgttatactc acctaaatga 8701 cggtaagcat tgcctaaatg cgaaagcgcc tgcatcttga tgacttgcaa gtttgcgaca 8761 tcttttgtta aaagtaaaca ctgctttgag tgtgaaatcg cacttttgta gtctcctaag 8821 tggtaaacta tgtaagctaa acaagaaaga acttgcgctt gtttttcaat gtctccaact 8881 gcttgaaata acgcaaggga ttcttgtaaa gacctcatgg ctgcaagtga atctccagct 8941 tgctgctgtt gaactccttg tcgtaataac ctgtttgctt cttcatagtg gtcttgttgt 9001 ctgtggggaa ctaatatttc ctcatgtgaa tccctgaaat catcagagat actgacatgt 9061 tccctttcag ttaagcatgc tctaagtgat tccacgtaaa ctatgttttc gacctggaat 9121 tccattcgtt tttccacatc aacctacccc tgttaatagt ggtgatacat aatagtgttc 9181 ccaaaattag agtgcgtatc tcacgggaag atgaagaata taggaattca ggatcgtttt 9241 cgaggattag gggtcagtag gggttcccct tgttaaaaag attactgtgg agtcgtgaga 9301 gttatgacga tagaaagagt gttggaaccg gaagtcatgg atagtttaga agaagccatt 9361 gagtatgatg caatggactt catcaaagtc aatactgctt ttgctaaaga agcaagcact 9421 cttggaccaa aggaacaggg tctagtactc gatgctggta ctggttctgg tcgcattcca 9481 gttttactgt gtcaaatgcg tccccaatgg gaagtgatag ctattgattt agctcaaagt 9541 atgttgcaaa tagcatcaca acacattcag caggctggtt tacaacagca aattcgactg 9601 gaattagtag acacgaaaaa cttgccttat caagatgagc aatttgatat ggttgtgtca 9661 aatagtctcg tccaccattt gcccgatccg ttacctttct ttagcgaact taagcgtgtc 9721 ttaaaaccca atggtggtat tttcatccgc gacttatttc gaccagttga tgaaaccaca 9781 atcaatgctt tggttaatac tatcgcaaga gaatacaata ttcatcagaa aaagttattt 9841 cgtgattctc ttcacgccgc actcacacta gatgaagtga atcagttgat ttcacccctg 9901 gggttgcaaa gagtaaaggt ctaaacagtc attggactgc acaacgggct tggagtgatt 9961 gagttgggag ttgggagttg gaagtcgcaa gtcagaagcc ggaagtcgga atctcttagt 10021 gggggattgg gatctatatg tgctcacgat tctcgcttta ccccaccccg gttttgtcta 10081 acgccaaaac ctcccctccc cttattaagg ggaggggatt aaggggtggg gtcaaatcaa 10141 cgtgggtttc cccagttgtt gcaagagcgt aggcgtgcag gagatagtaa gtgtgctgga 10201 agctgatgac gacttacgcc acattcggct caactaaatt ttctccacct tgaatgactt 10261 tcgctgactc aattttgtca cccgccttga gtttttccaa aacatcttta ccttcagtga 10321 catagccaaa aacgccgtaa cgaccatcca acaagttgcg tcctgctgga gtcagttccg 10381 gttcaaacaa aaagaagaag acttgtgaag aaccaccatt gttatcagat tcgggacgtg 10441 ccaacgctac agtaccataa gcagagaaag gcagaacggg taaatcagtg taacgaccaa 10501 tctcctctaa agtgaaaccg taggtcggtt ctttatcacc ttgtacaaga aattctaggg 10561 gaatggcgcg gtatttacct gtttttggat caataaagcc gacatcttta cctggtgggt 10621 ctccagtttg tacaacgtag gattcttctg aacgggtgaa ttctaaaccg tcataaaaac 10681 cccgttgcac caagtccaca aaattcccag cagtgacagg ggcgctgtag ccatctacaa 10741 taactgtcag gtcgcctttg ttagttttca cttccacagt cgcacgacct ttgagttgag 10801 gtaaattgct gtatttggct ggcacttcaa agggaaattc cttcaccatt gactcttcta 10861 gtttgccaac aagactcagc aatttggcac gttgctctcg aacctgttct ttatcttttg 10921 ttttgacgac ttcttgcaaa gcatttacgc cggattttaa ttcagcaatc acagcttcag 10981 cttcgctttg acgttcttct ggaacatctg ccaaaagttt ggagggttta tcgagaattc 11041 gcgatgcttt gctgaggtct tgagaaacag caccccagcg tcgatttgct cgcaattgag 11101 tggtaatgtc ctctaaagaa gcttgcagtt gccgtacagg tttattatct ataggaagtg 11161 agtagcgcaa tagagcattg ccgtcagtaa tggcattccc cgctggtaac ccagcgctac 11221 tagaaggagt ccatgcggct gtacttattc ctaaaaatag agttaccagc agcagtgcta 11281 ttgggctgtt cttcagccag gatttcaata ttttgaacat aaaatggctc aaatgcagca 11341 tcaaattgtt gactgggcgt cacccaacaa tcatcttccc acagtacaag aagcactttc 11401 gctttcatat caagttacat ctttagaaaa cgtaaatcta gaaacccggt ttctcgaact 11461 cgactttagg cattatggca attttaaacg aataatcatg gcaaggtttc aacttttgag 11521 cgttcgcgaa cgcgagtgcg tgcgctttgc gctcagcggc tccctgctgg agcatcacct 11581 gagattattc tcaaatgcta tttggatcaa tacccaactc tctcaacttt gctgctagtc 11641 tttgttggtg ttgttcagca agttcccttt gttgtcgttc gctttgagca acttcatcag 11701 cagtcagata tcgcttacct tccttatcat gccaatacaa tacttctcgc ttcaccacac 11761 cagaagtata acttccacgt ccaattccca atcctacttc tggcatccaa aacggttctt 11821 gtatttgctg ttgatattct ccatctacca atttatacaa ttcaaagggt tgatgttggt 11881 cgcgtcgcca atactgagga ttataaataa cgtaatacaa aactcctagt tttgcataaa 11941 tttctagctt cttgtcatac tcatcccctg gtgttttgga aaccacttcc aaaacgaact 12001 tgggtactat ttcctcctct tcccagactg cgtagctcaa acgagacttt cctgctttgc 12061 gtcgttccac acccaagctg agaaatccat ctggtatgac tggtactaag tgactcactc 12121 ctgttgtgtg gtaaataccc atatcaatac caaaaaacca atcattccga tttgcccaaa 12181 tggattcgag gataaataat aacaggttgg ggagaaaatt ctggtcttcg ttatccacgg 12241 gtgtatcgtc tgaacatggt agttcatcgg tagttggtag ggtgaagcgt ggttcagatt 12301 gtaccatagg taggactttc aagtaattta accgatggat gtggtgcctg gtgaattcta 12361 ttctagaaaa tatcaggcaa tctgtttgta aaacctattg ttcttttgtg gtcagttaag 12421 taggtcaaca taaaaaaacg taaaatctag aaaccgagtt tctttaagaa acccggtttc 12481 tgacgccacg cactttatgt taaagacgat tcaaaccatt gtctactctt gcaaagatgt 12541 gcagaatagg aaattcatct gacgcggtta tcacaaccac gttacgtaca tttcctaata 12601 gtcgtcgcgt ctaaataacc ccaataaagg accaagaata ataccaaaca caaagccacc 12661 gatatgcgcc cagtaagcaa ctcctcccgt ctccacactc atattagcgg ctgcttgcag 12721 gttggcaaga ccagatatca cattctgaac aaagaaaagt cctatcacga caactgctgg 12781 aactctaatt gtcgtaaaga aaaaacctaa aaagattaat gatagaactc ttgtatcggg 12841 aaagcggata atgtatgcgc ctagaattcc agcaatagca ccacttgccc ccaaggatgg 12901 aattccagaa ttgacaccta tgatccactg acataaagct gctaaagcac cgcaactcag 12961 ataaaaaatt aagtatttaa aatgacctaa gcgatcttca atattgttgc caaaaatcca 13021 tagatacagc atgttggaga gcaagtgcca ccaaccgccg tgtaagaatt gcgaggtaaa 13081 taaagtcgtc cactcaccag caaaattgtt tgttaactct cgtggtataa ccgcatacaa 13141 ctggaaaaat tgctctattt gtgcatttga tagactcacc tcatgaagaa aaactaagac 13201 gttcatgcca atcaacccat aggtaatata tggggtgatt cgtgttggat tttcgtcgta 13261 caggggaaac acaggtgttt ttcctaatca ctgattaact gcaatatatc ttgtaaaact 13321 ggcagttgct aagtagctgc aatggaaaat ccatatttgg caaacttaat cacaaatcac 13381 aagtgccaac tctattgagg gcacttgctt tgtactatct gtaccaagat tcttggcttg 13441 gtttgtcgct aaataaaccc agcaatggac cgagaattgc cccaaaaaca aaaccacctg 13501 catgtgccca gtaggcaata ccaccacttt ccatgccaac attagtcggt gtttctagac 13561 tggctaatcc gtaaaacgct tgttggataa accaaaatcc cagaaagaag tatgctggga 13621 cacgaaatgt ggggaagaaa attcctagag gaacgatgcc gagaatttct gcttggggga 13681 aacgcagaat gtatgcccct aacacgccgg cgatcgcacc acttgctccc aaggaaggaa 13741 ttgaagagtt ttgtgcaaag taccattggg ctagtgatgc caaaacaccg caacttatat 13801 agaaaaacaa atatttgaca tgacccaatt tgtcttcaac gttgttacca aaaatccata 13861 ggaacaacat attaccagct aagtgcagta aaccgccgtg caaaaactgc gaagtaatca 13921 aagttaacca ttctggcact ggttggttga ctgaaattcc cgcaaagctt gcagtcagtt 13981 ctcgcggaac aacagctgca aggtgtaaaa atccatctaa ttgtcgtgca ggtaaacttg 14041 cttcgtaaag aaaagctaag atattgacag caatcagtcc ataagtgaca tatggggtga 14101 ttgtcgtcgg attatcatct ctaataggaa ccacaggcgt ttttcccaat ttttgagcaa 14161 ccagcctcac attacataat tttctgtatt atttcttcta acctgtggca ggcttttagc 14221 acagtcaaca cttatcagtg aacagtaaag gagtcaggag tcaggagtta taagtcagaa 14281 ttctttattt cttccttctg aatcaagaat tgcaccaatt gctgaattct tcttcacact 14341 gataactgat aactggtaac tggtaactga tgactgctta ataattcttc atcgcccgtt 14401 gcatgtcacg ctgatcttga cgtcgtttga tgtcttcgcg cttatcgtga acttttttac 14461 ctctggcaag agctatagtt atcttcaccc aaccgcgctt gagatacatc ttcaaaggta 14521 ttaaagttaa accttgctgt tctactttgc caataagctt gcgaatttcc tcgcgatgca 14581 gcaacagctt gcgcgtgcgg cgcggttcgt ggttaaaata ctgcccactc gcattgtaag 14641 gcgagatatg ggcgttaagg agccatgctt gaccatcacg gattaagcca tatccatctt 14701 ggagattaac tttacccgca cgaattgatt tcacctcagt tcctgtcaat tcaattccag 14761 cttcataggt ttcaagaatt tcgtataaat aacgggcttg acgattgtcg gtaacaactt 14821 tgtaaccttg gctattttca ctcatttaga attctacgta tgtttaatgt gttagaaaag 14881 atagtgtatg gtcgtttgtg aatttatgtg gggatgttaa accgccatca atattcaact 14941 gaatatcagt gttatcctac taatttagct tttttgtact gttttaggtt gtttgtattg 15001 ttaattaaac ttttgcaaca gttaagcgcg tatttccggt ttatcatatc tttttattta 15061 gcttatttct actacgtaat aaaaaatcaa aaatgtttaa caaacacaaa ctcagtattt 15121 ttcttctgta atggttggga taattttgta cttcttgagt gtgagnnnnn nnnnnctcaa 15181 gggttgagga agttaaatca tgaaacgtca attaactaaa gtcatcggca ccagcgttat 15241 tgctttaagc atggcattgc cctttagtgt acctgctttt gctcagacta caactgaatc 15301 cggagctacc accaccgctc ctaacacaac aactgacaca agaaacacaa gaacttataa 15361 tgacaatgat tttgattggg gttggttagg attacttggt ttacttggtt tagctggttt 15421 agctggcaga aagcgccacg aagaaccaac ccgctatcgt gaccctaata ccactgttgg 15481 caccactacc tacagagatt aattgtagtg tctgtccagc gatcaagtac aaagaatgaa 15541 gcaacaaaaa aacatgaagc cttagatgag gtatctaagg cttatgtcaa gcttattgat 15601 caaggacagt tagggtgcaa aagactgtcc ttaactattg taaacaaaag ttagctagct 15661 gtaggctctg tgttatgtga attgaattct ctgtcccgac cttggggtat agatttgtaa 15721 tagcaagcaa ttaagccaac catcagccac catacggtat tgacttcagg acgaaaccag 15781 acggtatcaa ctaaattgtg ggcaagtgta ccggcgagag ttgcgatcgc ccccattatc 15841 caaaatccat ccacactttt gatttctcgt agccgttgca actgcaataa gccagtatta 15901 aatattacaa ttagcagcca aagaaaacaa gcgaaaccaa caaagccagt ttctacaatc 15961 acctccaaga aaatggaata ggcactcaaa gcgctgtaac gggggagttg atagagagga 16021 taaactttgt taaacgcaga atgaccaggt ccaataccag tgattggata atcctgaatc 16081 atcttgaaga cagcagtcca aacatttctg cgaaaattat tactactatc ttgcctatca 16141 gcaaaaatac tgaatactct atcgcgtact ggttcaacaa agaccacagc taacaccaag 16201 acgcaagtca agctactcaa aagaatccac ggcaaaccag tgcgccagaa aggaggcatt 16261 tgcacgctcc accaatagta cagcaaccct attaagatca gaactgcaac gactagccca 16321 atccagccac cacgactaaa agtcagaatt aggcatgaac cattaacaac acacatggtt 16381 aatgccaatg cttttttgaa gccgctagcc cacgcaaaaa ctgcgactat gcttaaaacc 16441 accgcaggca aaaggtatcc agccagcaag ttaggattgc ctaaataact ataaaccctt 16501 gtggtctttg acatactaga tgttggatca acccacgtcg ccagtggtgg tgctccaaaa 16561 aaccattgtc gtaacccgta tacacttaca atcagagaga tgtgcaagta cagcgtaatc 16621 agccattggc gaaagcgagg caatctcaga actctagcac agagggcaaa caggagcaaa 16681 tagagtgtaa aatttctcaa atcggtcaac gctgccttct tcacaggtga caatgctgta 16741 gctacagtag caactcccca ataaagcagt accagcaaat gaatcggggt aacacttgag 16801 gcatttggtg ttgtttcatc tgataaagtt aataacagcc aaaatccggc acaagctacc 16861 aatatcaatc ccaataggtc attggacaca aaaggtgcaa gagtatacac caggctgagt 16921 aaaacagctg ctattgtttc tccccactgc atcaatagac tactttgtcg ccaagaatgg 16981 aacaacccca ccaaagaccg gtgtaggtaa ctcgtgccaa gatattgtct tagaggcaga 17041 taggataaag tgaatcgttg ccaaactaaa ttcatagtat caagctgtga tgaatgacca 17101 ctcccgtgta gaacttcgac ttaatgatga taatctgggt attttaaaaa gttcataaca 17161 cagattgcaa gttattattc attcgttcct aactgccatc cgccattgtt gatcatttga 17221 acaatttgca cttcattgct cattgaacac agctagtcta gtttgacatc aacacaagca 17281 ttggcttcag tcaaacatcg ctgtttgctt ctgctgtgtc taaattcaat ggcaaactca 17341 gtacataacg atatcctgat tgtgatgaac cttgaatagc aatttgtcca ccatgtaact 17401 ctgccacatg acaactcagt aaaagtccta ggctttcacg agaaagataa ccgtgaggct 17461 tggttgcact agtgcttgac atattcacga ttaagtctcc tgtttggtct tggttctcca 17521 aatgaccact atgagtacgt gtcaattctt gcaatggtaa tgttctgcta tgagaaaata 17581 aagggtcaat ttcagttatc ccctcaccta accaaggatg tgacacccag acagtgatgt 17641 ttagtgtatc gtctttggaa gaaacatgaa tgcgaacagt actacctgta gctgaaagtt 17701 gaatcacact gaaaaccagg tgatatagga tttgtcgcac cttgtcttta tccaaggagc 17761 aaatgcggtt gcgttctggt tctatagaca gacgaatatc ttgctcacgg cggttagctg 17821 cttcttctaa agtactgata gcctgttgac acagcatttc aatatctata ggagttaaat 17881 ttaatgctgt tgggttttcc tccatagtcc ccatctgaga aatttcgttc accaatgaga 17941 gcaagtaccg accactatgt tgaataatat ccacatattc tctttgttta tttgtcaacg 18001 gaccataaat ctcacgtccc aaaacactag ccatacctag gacagatgtt aagggagtac 18061 gtaactcctg aatgagttgt cctaacagtt ccaatttcaa ttggttggta ataactaagt 18121 ctttttgtat agatgggctg attttgattt cagtagtatt aacaacattc ttatctgaca 18181 agttagaagc actactaggg acactgctac ttgaagttgt ttccaacaga cggttacgct 18241 caaactcact catactccaa cgggcaataa tttgtataaa ttcaatatct ttatctgtga 18301 aatcgcgggg ttttatgtcc attactgcta atgtacccag acaatgccca caagcatcaa 18361 acagtggcgc tcctaaataa gaacgaatac cataatcctg aatcaacttg ctgtaagcag 18421 aactgtctgc tctcgtcaat ttctccgtat cattaatgac aagaggttgt gaatcctcta 18481 ccaccttggt acaaaacgat tcgtgacgta acagttcacg gctttgtgcg agatgattca 18541 taattcctcc ccgcagctta gacaagccca gagatgattt gaaccagtgg cgttcttgat 18601 ccacaaatcc caaaatgcaa atttgtgctt ccaataagtg ggctgcagtt tgagtagctt 18661 cttcaaaaac tggaattgtc tctggttggc gcaaacctaa atctgataaa actttaaggc 18721 gttgtcgttc tcccatttcc tgggaaaccc agacatcccc tggggtaaac aatttgtttt 18781 cagactctac cattgtcatc gctaccttaa caatctctta ctagagtaat ttatataacc 18841 actgtttctg ggttatttcc tatccttaag attccctaat ttgctcctaa atgaacacaa 18901 ttagaggaaa aattttttat ttttttgttt atagctgtat aagtaaacac tcttaagtaa 18961 cttccttgga tcacaaggta atggagtctt cctgaatcca tatttataat gagctactgt 19021 taccatcaag ataactagta taataaacat aaaattttaa gtattaacac ttgcgtgctg 19081 acgatcaaca tatcacaaga aactacaacg cttctcatga atgaaaagaa atctattttt 19141 tatgttttta tactaaacca gggttcgttt agaattggat ttgggcaact attacatccc 19201 acattccgcc tctggaactt tagcatagta taatcactgt taaaatgatt gattagtttg 19261 tattaaataa taatt // LOCUS NODE_1708_length_19159_cov_5.26612219159 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 19159) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 19159) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..19159 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..512 /locus_tag="DP116_14870" CDS <1..512 /locus_tag="DP116_14870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="thiazole synthase" /protein_id="PRJNA477356:DP116_14870" /translation="LGQEDNNFVKLEVIPDPKYLLPDPIGTLQAAEQLVKEGFAVLPY INADPMLAKHLEEAGCATVMPLASPIGSGQGLKTTANIQIIIENAKVPVVVDAGIGAP SEAAQAMEMGADALLINSAIALAQNSPAMAHAMNLATVAGRLAYLAGRMPLKTYAIAS SPLTGTITS" gene 690..869 /locus_tag="DP116_14875" CDS 690..869 /locus_tag="DP116_14875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321164.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ssl1498 family light-harvesting-like protein" /protein_id="PRJNA477356:DP116_14875" /translation="MPYTTEEGGRLNNFAREPKVYKAEAPSDGQKRNYVILGVTAAIL VMGLIFVAFSVSSVS" gene complement(978..1163) /locus_tag="DP116_14880" CDS complement(978..1163) /locus_tag="DP116_14880" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14880" /translation="MAAKEPKRSGYTLLGCMQAVCKSNHDYDFQLILALLRINLFFER SKKLSIQLHFKQRNYFK" gene 1162..3315 /locus_tag="DP116_14885" CDS 1162..3315 /locus_tag="DP116_14885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868141.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="PRJNA477356:DP116_14885" /translation="MSVNESTHSDNFAWNRQVYHRLKLALSLGLRRQIFLAVCDDLNL RNQVAARLHSTLAYPVGQVLYQPSNTQEASTPAYPRLVTLRLNLSDPNLIAQINQWLS NYPPPIVGASKDNPGRSLPVPAFQIVGVEQLTRQPVAVQRLFLHYLRLTEQQLSNFES SLLLWVPRPWLYAIQQSAPQFWRYRTGVFVFAGEPTPTTQNKTSPERFSGSRSLELGN VEQPILEESVIQDDFDFPTQTLVNSGQKPQEVPPSKSEKLTNKPLQEEPTDKLSTNDS SLPQQTSHISKELTELVLATINTTIAQDDEQNLQVKQILEEIEQLHTQQASNVKLAAA YHRLGNLYRLRIEQGQSTLENLMVAILTYQESITHDDDSPQVPDTLNDLGTLYWMLYR TPPNSEEGQAYIEQGIEFYHLALKLISPESHPETYARVQNNLGTAYGDLARFTNPAEN WQQAVLAYSEALSLRTDHIEPLKYAACQNNLGTAYWHLAQYNQPVEHLKKAIAAYKLA LSYYSPEEEPLKYGMIQNNIGTAYWNIAQYEQPAENLHFAIDAYREALKYRTPANVPP ACAATQNNLGTAYWHLANQSQTSKDERQKLLQQCISSYEEALSLAHSLSGMALNFDVL ATHNNLGLAHYQLATDQYFDGDKASRSKHLEAALDNHLQALNGMSKQPEAYQTTFGYV VKTIRAFHNELGIQGQNLALSKVPGQLLPEILPKL" gene complement(3356..3640) /locus_tag="DP116_14890" CDS complement(3356..3640) /locus_tag="DP116_14890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138295.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14890" /translation="MNKIQLALVVSYLLITCYFLMNWLIFCIRHPNSSPEGKFLNLVM LVITTSFWFVIIPISCLEILKTRKLEVSIVVPVIVALSAFSMYLYTFLTS" gene complement(3920..5683) /locus_tag="DP116_14895" CDS complement(3920..5683) /locus_tag="DP116_14895" /inference="COORDINATES: protein motif:HMM:PF01432.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oligoendopeptidase F" /protein_id="PRJNA477356:DP116_14895" /translation="MSGLYPDIESAEFTSSLQVSIKAIDEFEALCNQYHVGYPLPTIL DKNTVARLEKVLVHYNETLETAEPMHNYLRCLMAVDSFNQTVRKHWSTFQVVYARLSL LKTRFSAWLGSLDITQLIADSPLAQEYSFVLRQSQKQSQYQMSAQEESLAAELQITGS QAWLDLYNQLTRQRRVLVEIDGETRSLLPTEAQQLGRHHDREIRRRTYEAEIASWEQL AVPLAASLNSIKGETLTLVKRRGLASPLEIALNRDHINQTTLDAMLIAVREALPDFRR YLHIRAKVLGLPILAWYDRGAILNEQGEVWSWKRAVNLIIEQFTAYSPQLGQMAERAF RDKWIDALPRAGKDSNGFCLPQRGDESRILVNYTPVFKEVSTLAHELGHAYHYMKLAQ RPMLQRTPVPLTLAETASIFCETLVRQGALLNTNKGEQINILDAYLSSACNLVVDVYS VFTFEQRLFEQRKHTTLSVDALNQMMLTAQQEAYGDGLDSNMLFPYQWGRMLLYYIET FYNYPYTFGLLFGLGLYACYKAEPESFRTKYDDLLTLTGMDCAAELAARFNINICTPD FWRLGLNLIQQDIERFQVLVS" gene complement(5756..6682) /locus_tag="DP116_14900" CDS complement(5756..6682) /locus_tag="DP116_14900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318270.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14900" /translation="MDSKSDNSESGKSVIAREAVDWLLQRIFQLPKKWLIISIILILI STCEIGWSKKELFSFKFRVTNTTAIFLSLIWLPSILKVFALSGGALKTPLGEIGGSGM KTMLESLTGDSLGFLIEQTKRAEEVAPPKQQEEMRQIRREWQKVYASTVPVSDARQEM EGLAERYKQLRITMSSGPQRTFEMESVTGQMRALAPEVKYSVQEVKDLLQSPDQGKRL LGLSVVEWSGDAIYFDLVLHLISHSETAFEQTSALRSVGKMVSKLDTHQKKNLQAAII EQRNFNEKEQRWIKPNSNRWVLSDRILSALNE" gene complement(6723..7376) /locus_tag="DP116_14905" CDS complement(6723..7376) /locus_tag="DP116_14905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128662.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RES domain-containing protein" /protein_id="PRJNA477356:DP116_14905" /translation="MTFRFWGPLHRFDHHLGECPCVGVAQENNDTCGLWTEEASSPTE ETAIPQKHSVPTCYKPLERKACDDLERGIYYAAPVDLLSSCLVEVFGDAGCIEITDHQ IAIITTTSRIKLLDVRGRGAMRAGANDATLAKTDKRGISQAWSRYFYEQKEIYPDIQG IIYRNAHNDEEAIAIYESAKHLLTCLPENVFSLKHELFRGLILKAAEDNNLEVERYW" gene complement(7779..8396) /locus_tag="DP116_14910" CDS complement(7779..8396) /locus_tag="DP116_14910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652958.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14910" /translation="MTGISVTIDAPCDKDVMQQALGSIARGFQDYLNSLQPDEIKNLA QQMDSTPSKAKYSKAETEFAAKLGANKISDQERGKLEFAALTRNFKWRQELLKSSLTA PQVAQMLNTTRQTPHDRLKKNSLIAVQDNGVWKFPTWQFDPQGPDGVIAGLPDVLNAL NVPAMSKISWLTRHNKALHGLTPIEALKNGQKHEVIAEACCVGVY" gene 8599..10425 /locus_tag="DP116_14915" CDS 8599..10425 /locus_tag="DP116_14915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_14915" /translation="MSRRSLQASAEGMKKAQAALIRNSLTQQALAEELSISRQPVSKF FQGKPVDRYIFVTICEKLGLEWDEIASFCSLPDVDDAASTSQIESLVQTVREKIHDNI HRRCNIIRLLDKEQPVRLESIYTDINILERVTGRRRLSLAELHDNCDKNNIYQKRVSG LEAVRHSVGRVPRLEGSAVSASARPEGEREASPFGAAVPQAQEIPVRVERYNKHWIWG KPGSGKTTFLKWIATQCNLGQFLCNAVPIFITLKDFAQTRDQPSLLDYITAQFEECGV VEPQAVLTLLSRGRSLLLLDGLDEVRKTELNRVLQEIRNVSSRFCANYIVITCRIAAL EYTPEEFTEVEVADFDDQQIADFAAKWFQNQDTLKAEHFVQRLFANQPLGELATHPLL LTLLCLVFEETGDFPTHCSQLYQEGLDLLLKKWDAKRGIERNQVYKQLSRQHKEDILS QIAWTTFERGEYFFKQSAVEQGISHHIQTFPEVSTFFETLQVDSEDILKSIEAQHGLL VERARGIYSFSHIIFQKYFAARKIVASPPSHAEEAFQNLVYHMNEKRWREVFLLAVEM LPNPDYLLQLMKRQLDNLVAGDETQYDNAKKLLIDCLNNSSC" gene 10777..11694 /locus_tag="DP116_14920" CDS 10777..11694 /locus_tag="DP116_14920" /EC_number="2.5.1.130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-carboxy-1,4-naphthoquinone phytyltransferase" /protein_id="PRJNA477356:DP116_14920" /translation="MTTVIDKDNSSRRKLWRAAIKLPLYSVALIPLWVGTTVAIAETR NFNGTNFLIFLFASICIQVWVNVSNDVFDAETGVDVNKLHSLINLTGKETSNFWLWFG NSFLIVGILTTSLLAFWQKDVTLLILVLLACLLGYSYQGPPFRLGYKGIGEIICFITY GLLSVSAAYYNQNPTWSLTALAASVIVGLATSLILFCAHFHQVEDDLAGGKYSPVVRL GTKKSARLLSWWGYGIYFLIAVFVLLKIFPLLSLLSFVSLFYALKLFRHVNNYHDQPN QVSNCKFIAVSMYLSLGLLLGIGFLLPTV" gene 11937..12677 /locus_tag="DP116_14925" CDS 11937..12677 /locus_tag="DP116_14925" /inference="COORDINATES: protein motif:HMM:PF01209.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14925" /translation="MSQNTNYSSVTPAQATKLEGDDKRNYIRNLFDDIATRYDFLRTL VFLGHTSLWYRQALRDLELQPGEKILDVGCGTGESTRCLNRFYPGIQIEGMDLSPGML TVARSMDADSNYFEGDVCSIPRPDCTYDVVVTAFTFRNFPNREMSLAQMLRVLRPGGR LLILDHFYPEKPVLWRNIYTIWMSKIVPQIVRPFIADTTPYRYLAQSIINQLKMPDFI QLIESSGAKVIKTNTYTGGAAGRLIAVR" gene 13065..13673 /locus_tag="DP116_14930" CDS 13065..13673 /locus_tag="DP116_14930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009629927.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase family protein" /protein_id="PRJNA477356:DP116_14930" /translation="MLKVYGFNESGNCYKVKLLLKQLCRQFEWVNIDILKKENRTPDF LAKNPHGKVPLLETETGTFLWESNAILYYLSEGTDFLPKNRLERAQTLQWLFFEQYSH APNLGVARYITRYLGTPSEYQQTLTAKRELGYVALDIMEKHLTTRHFFLGDRYTIADI ALYAYTHVADEGGFDLTNYRFIKTWLELVRTQANHITMNSLE" gene complement(13884..14756) /locus_tag="DP116_14935" CDS complement(13884..14756) /locus_tag="DP116_14935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_14935" /translation="MKVAFLGTGLMGLPMAQRLLEANVQVIAYNRTPEKLEPLKAAGA EVVTRPYQAINAADCTILMLSNAGAIYSVLLSDRSWQTVAGRSIIQMGTITPTESKEI RDAVVAAGGEYLEAPVLGSIPEVKDGKLIVMVGAHQEQYQRHLELFKHFGPEPLLVGP VGSASALKLALNQLIASLTTAFGLSLGFAQHQGINVDLFMYVLRQSALYAPTFDKKLQ RMLEGNYANANFPTKHLMKDVDLFIAEAKSTSLNLSSIEGVRDIIDMAMKMSFSDGDY SSIFSAINASTDGA" gene complement(15210..16451) /locus_tag="DP116_14940" CDS complement(15210..16451) /locus_tag="DP116_14940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318275.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_14940" /translation="MPSNQELCKAIAHRITTSPQKRITFAEYMDMVLYHPETGYYSTK ALKLGKEGDFFTSVHLGADFGELLAVQFFQMWEVLGQPVPFSLVEMGAGQGLLALDIL KYIKQQYPDFFDVLDYVIVEKSSVLREEQQQHLQEFSVRWLNLEEIPSQSVTGCFFSN ELVDALPVHQFVLEEGRLQEIYVTTQEEGTGNREQGTGNREQSKMSLSTSTPALSFVE VADTPSTAKLEEYFELVEINISSYPDGYRSEVNLAALDWLSLVADRLQRGYVLTIDYG YSASRYYNPMRSQGTLQCYYNHQRHNDPYINIGRQDITAHVDFTTLERWGERCGLDKV GFTKQALFLMALGLGERIAALSYTDQPISQLLHRRDVLHQLLDPLGIGGFGVLVQAKG LTKQEAAQPLKGLSVPEQMQI" gene complement(16743..17927) /locus_tag="DP116_14945" CDS complement(16743..17927) /locus_tag="DP116_14945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997506.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADPH-quinone oxidoreductase" /protein_id="PRJNA477356:DP116_14945" /translation="MARLETRTEPMVLNMGPHHPSMHGVLRLIVTLDGEDVIDCEPVL GYLHRGMEKIAENRTNIMYVPYVSRWDYAAGMFNEAVTVNAPEKLANIPVPKRASYIR VIMLELNRIANHLLWFGPFLADVGAQTPFFYQFRERELIYDLWEAATGYRMVNNNYFR IGGVAVDLPYGWVDKCLDFCDYFVPKIDEYERLVTDNPIFRRRVEGIGTITREEAINW GLSGPMLRASGVKWDLRKVDHYECYDDFDWDVQWETVGDCFARYVVRMREMRESVKII RQAIKGLPGGPYENLEAKRLAAGKKSEWDAFDYQYISKKISPTFKIPKGEIYSRIESG KGELGIYLIGDDNVFPWRWKIRAADFNNLQILPHLLKGVKLADVVVILGSIDIIMGSV DR" gene 18124..19020 /locus_tag="DP116_14950" CDS 18124..19020 /locus_tag="DP116_14950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198475.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="16S rRNA (cytosine(1402)-N(4))-methyltransferase" /protein_id="PRJNA477356:DP116_14950" /translation="MEHDVQGMEVQDFFHVPVLSRQVIDGLAVRPGGHYLDATVGGGG HTRLILEAAPDVHVTAIDQDEDALAAAQKILGALGERVKFIHSNFANYEFPLIKFTGI IADLGVSSHHLDTSERGFSFRHQANLDMRMNRRQSLTAADVINDWDEAELANIFFKYG EERLSRRIARRIVQQRPFHTTTELAEAIASCVPPKYRYGRIHPATRVFQALRIVVNDE LKSLETFLSKAPQALIPGGRIAIISFHSLEDRIVKHSFRDSPLLQVLTKKPIEAQEEE IVNNPRARSAKLRIAQRVEEVR" BASE COUNT 5560 a 4096 c 3998 g 5505 t ORIGIN 1 ttttaggaca ggaagacaac aacttcgtta aattagaagt gatacctgac cccaagtatc 61 tgctaccaga ccccattggc actttacaag cagcagaaca actggtaaag gaaggttttg 121 ctgtgttgcc ttatatcaat gcagacccga tgctggcgaa gcacttggaa gaagctggct 181 gtgcaacggt gatgccttta gcctcaccca ttggttctgg acaggggtta aaaacgactg 241 caaatattca aatcatcatt gaaaatgcta aggtgccagt ggtggtagat gcgggcattg 301 gagcgccctc agaggcggcg caagcgatgg aaatgggcgc agatgctttg ttaattaata 361 gtgcgatcgc cctcgcacaa aactcacctg caatggctca tgccatgaat ttggcaacag 421 ttgctggtcg tttggcttat cttgctggca gaatgcctct caaaacatac gctattgcca 481 gttcaccttt gactgggacg attactagtt aatcataaat caacctcgca ggagtcaggg 541 aaattgtctc tgactccctc ctcttcttag tgctttgtta ctaatgatta agcactaaag 601 actaagataa agatttgtat catatgttga agagtgaatc taacgtttat gttaaattaa 661 atgaatgaac aggataagga attatactca tgccttacac aacagaagag ggtggtcgtc 721 taaataactt tgcccgtgaa cccaaagttt ataaagcaga agctcctagt gatggtcaaa 781 agcgaaacta cgttatcctg ggagtgactg ctgccatttt agtgatgggc ttgatttttg 841 ttgctttctc ggtttccagc gtcagttgat aaaaagtgat attaaaaact tagtaatttc 901 attttttaag ctttttcagg ctcacagttt gatcggagcc tggtttattg ttgtttataa 961 taaatattta taataagtta tttaaaataa tttctttgtt taaagtgtaa ttgtatactt 1021 agcttctttg atctttcaaa gaatagattt atcctgagca aagcaagaat gagctgaaaa 1081 tcatagtcat ggttagattt acagactgcc tgcatgcacc ccaagagggt atatccactg 1141 cgttttggct ctttagccgc catgagcgta aatgaaagta cacactcaga taattttgcg 1201 tggaatcggc aagtgtatca tcgcctgaaa cttgctctca gtctcggttt aagacgtcaa 1261 atctttttgg cggtgtgtga tgatttaaac ctgagaaatc aggtggcggc gcgtttgcat 1321 tcaactttgg cttatcctgt tgggcaagtg ctgtatcagc catcaaatac acaggaagct 1381 agcacaccag cctatccccg attagtcacc ttgcggctga atttaagcga tcctaatctc 1441 atcgctcaga taaatcaatg gttatctaat tatccaccac ctattgttgg agcatcaaaa 1501 gacaatccag ggcgatcatt accagtacca gcatttcaga ttgtaggtgt ggaacagctc 1561 actagacagc cagtagcagt acagcggtta tttttgcact atctgcgttt aactgagcaa 1621 caactgtcta actttgaatc cagtttactc ttgtgggtgc cgcgtccttg gttgtatgct 1681 atccagcaat cagcaccaca gttttggcgt taccgtaccg gtgtttttgt cttcgccgga 1741 gaacccacac cgacaactca aaacaaaacc tcaccagaac gtttttccgg ttctagaagt 1801 ttggagttag gcaatgttga gcaacccatc ttagaagaat cagtcattca agatgatttc 1861 gattttccaa cacaaacatt agtgaatagt ggacaaaaac cgcaggaggt tcctccgtca 1921 aaatcagaaa aattaacaaa caaacctctt caagaggaac ccactgataa actcagcacc 1981 aacgatagct cattacctca gcagacatct catattagca aagagttaac agagctagta 2041 ctcgcaacga ttaacacgac aatcgcccaa gacgatgagc agaacttaca agtgaagcaa 2101 attctggagg agatagaaca attacacacc caacaagcat caaatgtcaa gctggcagca 2161 gcttatcacc gcctaggaaa cttatatcgc cttcgtattg agcaaggaca gtctacctta 2221 gaaaacctga tggtggcaat tcttacgtat caagaatcca tcactcatga tgacgattca 2281 ccccaagttc ctgatacctt aaatgacttg ggtacactct actggatgct ttaccgcacg 2341 ccacccaatt ctgaggaagg acaagcttac atagaacagg gaattgaatt ttatcatttg 2401 gcgttgaagc tgatttcacc agaaagtcac cctgaaacat atgctcgtgt acaaaataat 2461 ttagggacag cttatggtga tttagctcgt tttactaacc ctgctgaaaa ctggcaacaa 2521 gcagttttag cttacagcga agcactctcc ctgcgtacag atcatataga accgttaaag 2581 tatgctgctt gccaaaacaa tttaggcaca gcttactggc atttagcaca atacaatcaa 2641 ccagttgagc atctcaagaa agccattgca gcttataaac tcgcgcttag ttattacagc 2701 cccgaggaag aaccgctgaa atatgggatg attcaaaaca atataggcac agcatattgg 2761 aatattgcac aatacgagca accagcagaa aatctgcact tcgctattga tgcttaccgc 2821 gaagcactga aatatcgaac tcccgctaat gttcctcccg cttgcgccgc cacccaaaat 2881 aaccttggaa ctgcctattg gcacctagca aatcaatcac aaacttccaa agatgagcgg 2941 caaaagttat tacaacagtg cattagttca tatgaagaag cgcttagtct tgctcattca 3001 cttagcggta tggctttgaa ttttgatgtg ttggctactc acaataatct cggacttgct 3061 cattatcaat tagccactga ccagtatttt gatggtgata aagcaagccg ttctaaacat 3121 ttagaagcag cattagacaa tcacttgcaa gccttgaatg gaatgagtaa acaacctgaa 3181 gcttatcaaa caacttttgg ttatgtggtc aagacgattc gtgctttcca taatgaatta 3241 ggcatacagg ggcaaaattt ggctttgtct aaagttcctg gtcaattgtt accagaaatt 3301 ctgcctaagt tgtaatgagt gagttgttag ttgttagtgg gtcaaagcaa ttttcttaac 3361 ttgtcaaaaa tgtatacaga tacatgctaa acgcagaaag cgcaacgata acaggtacaa 3421 caatgctaac ctctagtttc cgtgtcttca agatttctag acaagagata ggaataatca 3481 caaaccagaa gctagttgtg ataacaagca ttaccaaatt taaaaacttt ccttctggag 3541 aagagtttgg atgccgaata caaaaaatta accaattcat caaaaaatag caggttatta 3601 gaaggtagct aacaaccaaa gccagttgaa tcttgttcac ggtaggcact cctgaagcta 3661 actctatctg aaaaagaata cttaaagcgt taacttttcc gaaaaagtta cattttgtaa 3721 atggattttc aaatcaaaaa ctattttttg gccgagaaat tagcctaaat actagcaggt 3781 ttctggcttt attaaaaagt aagaaagtta ggttatttct cttaaaaaag taagaaaggt 3841 aggaaatttg agagtttgga aattaggact tatatatagc agtgacaact ttggcgaagt 3901 cggggatctg tgtctctcct tagctgacga gtacttgaaa gcgttcaata tcctgttgta 3961 tgagatttag tccaagacgc cagaaatcag gtgtacaaat gttgatatta aaacgtgctg 4021 ctagttcggc agcacagtcc attcctgtta aggtcagcag atcatcatat ttggttctaa 4081 atgactctgg ctctgcttta tagcaagcat acagtcccag tccaaacaaa agtccgaaag 4141 tataggggta gttgtagaaa gtttctatgt aataaagtag cattctgccc cattgataag 4201 gaaagagcat gtttgaatcg agtccatcac cgtaggcttc ttgttgcgct gttagcatca 4261 tctgattgag tgcatctaca gaaagtgtgg tgtgttttcg ttgttcaaac agccgctgct 4321 caaaggtaaa gacgctatat acatcaacaa ctaggttgca ggctgacgag aggtaggcat 4381 ccaaaatatt gatttgttca cctttattag tatttagtaa tgcaccctga cgtactaatg 4441 tctcacaaaa gatgcttgcc gtctcagcca aggttaaggg aacaggtgta cgttgaagca 4501 tgggtcgctg tgccagcttc atataatggt aagcatgacc aagttcatgg gctaatgtac 4561 ttacttcttt gaaaactggc gtgtagttga cgaggatgcg cgactcgtcg ccgcgttgag 4621 gtaagcagaa accattagaa tctttcccag cacgagggag agcatctatc cacttatcgc 4681 gaaatgctcg ctcagccatt tgaccaagct gaggcgaata ggctgtgaat tgttcaataa 4741 ttaaattgac agctcgcttc cacgaccaca cctccccctg ctcgttcagt atcgcccctc 4801 tgtcatacca agcaagaatt ggcaaaccaa gtaccttggc gcggatgtgc aagtaacgtc 4861 gaaagtcagg caacgcctcg cgcactgcaa tcagcattgc atcaagtgtc gtttggttga 4921 tatggtctct gttgagagct atttctagcg gtgaagctaa cccacgccgt ttaactaaag 4981 ttaatgtttc acctttgata ctattgagag atgctgccag aggcactgct agctgttccc 5041 atgaagctat ctctgcttcg taggttcggc ggcgtatttc gcggtcatgg tggcgtccca 5101 gttgctgtgc ttctgtgggt agtagtgaac gagtctcacc gtcaatttca accaataccc 5161 ttcgctgtct ggtcaattgg ttatacaagt ctagccatgc ttgactacct gtaatttgca 5221 gttcagctgc tagagactct tcctgtgccg acatctggta ttgggattgc ttttggcttt 5281 ggcgcagtac aaaagagtac tcttgtgcta aaggtgagtc agctatcagt tgtgtgatgt 5341 ctagtgaacc taaccaggca gaaaaccgcg ttttgagaag tgataagcgg gcatacacca 5401 cctggaaagt actccaatgc ttccgtactg tttggttaaa agaatctact gccatcaagc 5461 aacgaagata gttatgcatt ggttcggcag tttccaaagt ttcgttgtag tgaaccaaaa 5521 ctttttctaa gcgtgcaaca gtgtttttgt caaggatagt tggaagcgga taaccaacat 5581 gatactggtt acacagagcc tcaaattcat cgatagcttt gatggaaacc tggagtgagg 5641 atgtaaattc agctgattca atatcgggat acagtccact catatcccaa cacggcagtg 5701 tttgtgtgtt tatatgaatc accaatttgt ctcctcaaaa tgctgcaagt cacacttatt 5761 cattaagtgc tgataagatg cgatcgctca atacccaacg attagaatta ggcttaatcc 5821 aacgctgttc cttttcattg aagttacgct gttcaataat tgctgcttgc aaatttttct 5881 tctggtgagt atccagctta gacaccattt ttcccacaga tcgcaatgca ctcgtttgct 5941 caaacgctgt ttctgaatga ctaatcagat gtaaaaccaa atcaaaatat atcgcatcac 6001 cagaccactc gacaacagaa agtcctagca agcgtttccc ttgatccggt gattgcaaaa 6061 ggtctttaac ttcttgtaca ctatatttga cctctggagc taaagcacgc atctgtccag 6121 ttacagactc catttcaaat gttcgctgtg gtccggaaga cattgttatt ctcagttgct 6181 tgtagcgttc agctaaaccc tccatttcct gtctggcgtc tgacactgga actgttgagg 6241 cgtaaacttt ctgccattct cgacgaattt gacgcatttc ttcttgctgt ttcggtggag 6301 caacttcttc tgctcgctta gtttgctcga tgagaaatcc gagtgaatca ccagttaatg 6361 actctagcat agtcttcata cctgagccac caatttctcc caagggtgtc ttgagtgctc 6421 ccccactcaa agcaaaaact ttcaatatag atggtaacca aatcagcgac aaaaaaatag 6481 ctgttgtatt cgtcactcta aatttaaatg aaaataattc ttttttgctc cagcctattt 6541 cacaagtact tataagaata agaataatac taataattaa ccatttttta ggaagttgga 6601 agatgcgttg cagtagccaa tctactgctt ccctagcaat tacactttta ccagactcag 6661 aattatcgga tttactatcc atactcttgc ttctttgttt tgactttcgc agagcgtgag 6721 aatcaccaat aacgctctac ttcaagatta ttatcctctg cagcttttag aattaaaccc 6781 cggaacaatt catgttttag tgaaaaaacg ttttctggta gacaggtgag cagatgttta 6841 gcagactcat aaattgcgat cgcctcttca tcattatgag cattacggta aataatcccc 6901 tgaatatcag ggtatatttc tttttgctca taaaaatagc gcgaccaagc ttgactgata 6961 ccacgtttgt ctgtctttgc cagagttgcg tcattagcac cagctcgcat tgcaccacga 7021 ccgcgtacat ccaataattt tatgcgactt gtggtagtga tgatagcaat ctggtggtca 7081 gtaatttcta tacagcctgc gtccccaaaa acttctacta aacagctcga aagtaggtct 7141 acaggcgcag cgtagtatat cccacgttct agatcatcgc atgctttgcg ttccaatggt 7201 ttatagcaag taggaacgga atgtttttgt ggaatggctg tctcctctgt aggagaagac 7261 gcctcctcag tccacaatcc acaagtatca ttattctctt gtgcaacgcc aacacaggga 7321 cactctccca aatgatggtc aaagcgatga agtggacccc aaaaacgaaa ggtcagggca 7381 gcagtattat attcaggacg aaaaattctt aaaagttccg tacctttctt taaggtataa 7441 aattttggtt tcggcttgcc aaagcaagga ggcgggggtg ttatttttac cacatttttc 7501 actcctattc tggttgattg acagaattcg gagatttaac gttccatcag ccttgaaacg 7561 ttacaaaatc tgaagttaga gactctgtca actggaagat ccacaagcca tccatgtcat 7621 tccggaagtt ttttgtaagt tctagaaaaa tttttagtat aacttcagta attatttcaa 7681 aaatcattta tataaataaa aatacttaaa acccgggtca ctccaaacac ccgggtttct 7741 aagttagtac catttgatga atgaactcta atagatagct aataaactcc cacgcaacac 7801 gcctcagcaa taacctcatg tttttgaccg tttttcaagg cttctatagg cgtaagtccg 7861 tgtaatgctt tgttgtgtcg agtcaaccaa ctaattttcg acatagcagg aacattgagg 7921 gcattcaaca catctggtaa tccagcaatc acgccatctg gtccttgcgg atcaaattgc 7981 caagtgggaa atttccaaac tccattgtct tgaactgcaa ttaaactgtt ctttttcagg 8041 cgatcgtgtg gtgtttgacg agttgtattg agcatttggg cgacttgagg ggcagttaac 8101 gaagacttaa gtaactcttg cctccactta aaattcctag tcagggcagc aaattctaat 8161 ttacctcgtt cttgatcact aattttattg gcaccaagtt tggcagcaaa ctcagtttca 8221 gccttggagt actttgcctt gctaggagtg ctgtccattt gttgggccaa gtttttaatt 8281 tcgtccggct gtagtgaatt cagataatcc tgaaaccccc ttgctatact tcctagagct 8341 tgttgcatta cgtctttgtc acaaggagca tctattgtca cgctaattcc tgtcataatt 8401 tcacctcgtt tgacattctc atcatctgtt gtgcgataaa ctcaaataat gtaagagatg 8461 tcaaataact acaattctag tattctatgt tcctgagtga gttagcgtat aagtaaagtt 8521 tgagaattgt atgacttagg catgacttat tgtgaaagca gttaatatac gtacaacaag 8581 tatttcttat ggggtataat gagcaggcga tcgcttcaag catctgctga gggtatgaaa 8641 aaagcgcaag ctgcgctcat ccgtaattca ctaacgcaac aggcgttagc agaggaactg 8701 agtattagcc gtcagccagt tagcaagttt tttcaaggta aaccagttga tcgctatatc 8761 tttgtcacaa tctgcgaaaa actaggattg gaatgggatg agattgcgag tttttgctca 8821 ctcccagatg tagatgatgc agcaagtact tcccagatag agagtctggt acagacagta 8881 cgagaaaaaa ttcacgacaa cattcacaga cgttgtaata taatccggct ccttgataaa 8941 gagcagccag tgaggttgga gagtatatac accgatatca atattttaga gcgagtcact 9001 ggacgcaggc ggctttccct tgccgaattg catgataatt gtgacaagaa caacatttac 9061 caaaaacggg tgtcaggact tgaagcagtg aggcactccg ttgggcgggt tccccgactt 9121 gaaggaagtg ccgtaagcgc aagcgcacgc ccagagggcg aacgcgaagc gtctccgttc 9181 ggcgcagccg tgccgcaggc tcaggagata cccgtaaggg tagaacgcta caacaagcat 9241 tggatttggg gaaaaccagg ttcgggtaaa acgacgtttt tgaaatggat cgccacccaa 9301 tgtaacttag gtcagttttt gtgcaatgct gtcccaattt ttatcaccct taaggacttt 9361 gcccagacca gagaccaacc aagcttgttg gactacatca cagcacaatt tgaagaatgt 9421 ggagttgttg aacctcaagc agttctcacg cttttgagtc gggggcgatc gctactttta 9481 ctagatggat tagatgaagt caggaaaact gagcttaacc gagtgctaca ggagattcgc 9541 aacgtttcat cgcggttttg cgccaattat atagttataa cttgtcggat tgcagctttg 9601 gaatatactc ctgaagagtt caccgaagtg gaagttgccg attttgatga ccaacaaatc 9661 gctgacttcg ccgccaaatg gtttcaaaat caagataccc tgaaagcaga acactttgtg 9721 caaaggctgt tcgcgaatca gccgcttggt gaactcgcaa ctcatccctt gctactgact 9781 ctcttatgcc tggtgtttga agaaacagga gattttccaa ctcattgttc gcagctttat 9841 caagaaggac tggatttatt actgaaaaaa tgggatgcta agcgcggtat tgagcgaaat 9901 caggtttata aacaactttc ccgacagcac aaagaagata tcctcagtca aattgcttgg 9961 acaacttttg aacgtgggga gtacttcttc aagcaaagtg ccgttgagca agggatttcc 10021 catcatatcc aaacatttcc tgaggttagc actttttttg aaacacttca agttgacagc 10081 gaagatattt tgaaatctat agaagctcag catggtttac tggttgagcg agcaagaggt 10141 atttactcct tttcccacat catctttcag aaatactttg ctgccagaaa aattgttgct 10201 agtccaccat cacatgctga agaagctttt caaaaccttg tctaccatat gaatgagaaa 10261 cgctggcgag aagtcttctt gctagcagtg gagatgttgc caaacccaga ctacctgttg 10321 cagttgatga aacgccaact cgataatctt gtggcgggtg atgaaacgca gtatgacaat 10381 gcgaaaaagt tgttgattga ttgtcttaat aactcaagtt gctaattatg cacaaatgcg 10441 atatcctgcc ataagtttgc ccatccggca attgcccttt tgggagcatc gcccagtatg 10501 tgcccagaga gttcgttgga agcaaccctt aaaacatctt ttgaacttaa gttaattttc 10561 tttttgtaat ttaccatttt cacttatcat aagaaaaaat aacaactacc atcagaacaa 10621 ttctagaaat acgactctag ggataaatta ttgaaatatc tttaaaagtt cccaaaagtc 10681 tcatttcatt tacgagtgag caatttagct gaagtaattt accctggttg attccggttc 10741 acttagttaa gttccttttg aaaaaacttt ttgcaaatga caacagtaat agataaagat 10801 aattcatcta gaagaaaact ttggcgggct gcaattaagc ttcctcttta cagtgttgct 10861 ctcatacctc tgtgggtggg tacaactgta gcgatcgcag agactagaaa ctttaatggg 10921 acaaattttt tgatattttt attcgcatct atttgcattc aagtttgggt gaatgttagt 10981 aacgatgtgt ttgatgctga aacaggggta gatgtcaata aactacactc tcttattaat 11041 ttgacgggca aagaaacatc aaatttttgg ttgtggtttg gtaattcatt tttaattgta 11101 ggtatattaa caacatcgct attagctttt tggcaaaaag atgtaacgct tcttatctta 11161 gtattacttg catgtttgtt aggttacagt taccaagggc ctccctttcg attgggctac 11221 aaaggtatag gtgaaataat ttgcttcatc acctatggtc tattgtctgt gagtgcagct 11281 tattacaatc aaaatcctac ttggtctcta actgctttag ctgcttcagt cattgtagga 11341 ttagcaacca gcttaatttt gttctgcgcg cattttcacc aagttgaaga tgatttagcc 11401 ggagggaagt attcacctgt tgtacgtttg ggtaccaaaa aatctgcgcg gctattgtct 11461 tggtggggat atggaatata ttttttaatt gctgtcttcg ttttattaaa aatatttcca 11521 ctgctgagtt tgctaagttt cgtaagtctt ttttacgctc tcaaattatt tcgtcatgtc 11581 aataattatc atgaccagcc taatcaggtg agtaactgta agtttattgc tgtgtccatg 11641 tacttaagcc ttggactatt gctaggaata ggatttttac tcccgactgt ctaattacta 11701 ttaagtaagt ggacataaat aaacgtcact atagcaatcc tacatgattt gtgaaattgt 11761 tctttttgga accgcagagg acgcagagag cacagaggaa agaggaaaga gaaagcggag 11821 attctcatga tttaattagg attgctatat gtaattgcgc ttacgctttg ctcgcaacga 11881 cgcataatat ttttatccaa ctacttagca agcaactcat cagtgagaaa aataaaatgt 11941 cgcagaacac aaattactct tctgtcactc cagcgcaagc aaccaaatta gaaggagatg 12001 acaaacgcaa ttatattcgt aatctctttg acgatattgc cacccgttat gatttcttaa 12061 gaactcttgt ttttcttggg cacactagtt tatggtatcg acaagcttta cgcgatttag 12121 aattgcaacc aggtgaaaaa atactagatg ttggttgcgg tactggagag tctaccagat 12181 gtttgaatcg cttttatcct ggaatacaaa ttgagggaat ggatctttct cctggaatgc 12241 tgacggtagc tcgtagcatg gacgctgata gtaattattt tgaaggtgat gtgtgttcta 12301 ttcctcgccc tgattgcaca tatgacgtag tggtaacggc gtttactttc cgtaactttc 12361 caaatcgtga aatgtcactt gcacaaatgt taagagtcct acgtccagga ggacgtttac 12421 tcatactcga ccatttttat ccagaaaagc ctgtgctgtg gagaaatatt tatactatct 12481 ggatgagtaa gattgtaccc cagattgtac gtccttttat tgccgatacg actccatatc 12541 gctacttagc ccaaagcatt attaatcagt taaaaatgcc tgattttatc caattaatag 12601 aaagtagcgg tgcgaaagta ataaagacaa atacttatac cggaggtgca gcaggtagat 12661 taatagcggt acgttaatct taaagtatac tgcgtcaaaa gataggactc ttctttgatt 12721 tttgcgaagc taggtacacc tttatcagtt atcaagtacc cgcagtacca gttaccagtg 12781 atcaattgtt cactgtttac tgttcaccct tcgggttagc cagtcgccta cggcgggaaa 12841 cccgcctgca gcgctggact cactgttcac tgttaagcgt tccctattaa gcgtctgcgc 12901 gagcgattct ccgaatcaaa tcggattgcc atagcaggat agagtttttt gagacagaac 12961 ttcatacaag gagatgttta aacatttacc tgttttgaag tcgtaaacga ctgtctcatt 13021 gcttcaatct caatgacaat taatacttgc ttggagaaaa ttagatgctt aaggtttatg 13081 gctttaacga atcaggtaac tgctacaaag tcaagttatt actgaaacag ttatgtagac 13141 aatttgaatg ggtaaatatt gatattctga aaaaagagaa tcgcactcct gattttctcg 13201 ctaagaatcc tcatggaaaa gtccccttgt tagaaactga gacaggaaca tttctttggg 13261 agtcaaacgc tattctatat tatctcagtg aaggcactga ttttttacca aagaaccgac 13321 tggaacgtgc tcaaacatta caatggcttt tttttgagca gtacagccac gcgccaaatc 13381 ttggtgtagc tcgctacatc actcgctatc ttggaacacc atccgaatac cagcaaactt 13441 tgactgccaa gagagagtta ggttatgtag ctctcgatat catggagaaa catttgacaa 13501 ccaggcattt tttcctagga gaccgttaca ctatagcaga tatcgctctc tacgcctata 13561 ctcacgtagc tgatgagggc ggctttgatt tgactaacta tcggtttatt aaaacttggc 13621 ttgagttagt gagaacccag gctaatcata ttactatgaa tagtctggag taggcatttt 13681 tgtaactagt aacttgtgaa agaaaccaaa aactcaagcc aagtcgggag gaggctagct 13741 aatggaagta tagagattta tcaagtatta gatcaccata ataccattct ccggcaaccc 13801 catatttaaa ttctagctga acatttcaaa agttatctca attccgtagg agcctgaata 13861 agtagacaat tttcctgcga tgattatgca ccgtcagttg atgcattaat tgcagaaaag 13921 attgatgagt aatctccatc cgaaaacgac attttcattg ccatatctat gatgtcgcgt 13981 acaccctcaa tgctgctgag atttaaacta gtcgatttcg cctcagcgat aaacaaatct 14041 acgtctttca tcaagtgttt tgtcggaaaa ttcgcattag cataattgcc ctcaagcatc 14101 cgttgcaact ttttgtcaaa tgtcggtgcg taaagagcac tctggcgcaa aacgtacata 14161 aacaaatcca cattgatacc ttgatgctgg gcaaaaccaa gacttaaacc aaaagcagtt 14221 gtcagggaag caatgagttg atttaacgcc agtttcagcg ccgacgcgga tcctacagga 14281 ccgacaagta gaggttctgg accaaaatgt ttgaataact ccaggtggcg ttgatattgt 14341 tcttggtgtg cgcccaccat cacaattaat ttgccatctt tgacttctgg aatgctaccc 14401 aaaacaggtg cttctaaata ttcaccacca gcagcaacaa ctgcatctct gatttcttta 14461 ctttcggtag gagtaattgt ccccatttga ataatgctgc gtccggcgac agtttgccaa 14521 gacctatccg aaagcaaaac actgtatata gctccagcat tactcagcat gagaatggta 14581 caatcagcag cgttaattgc ttggtagggg cgtgtgacca cttcagcacc agctgctttg 14641 agtggttcta acttttctgg ggtgcggttg taggcgataa cctgtacgtt cgcctctaac 14701 aacctttgag ccatcggtag tcccatcaat ccagtcccca gaaatgctac cttcattttt 14761 tcaccttcct tggtaagcga tcgctttttc tcatcattca ttctcataat tatttcttgc 14821 aaaaatccga tttggtgggt atttggaacc acaaatctac gaattttgga cgcaaacaaa 14881 cacagagaat ttgtttgttt tgtgattata aaaattgact tttgcaagac gtctattctt 14941 tagggagtgg ggagttgggg tgagtgaagg aaatgtcttt ggttccctca ctctctcact 15001 cttattgtcc cctattctcc atcagccacc tgctgcaaaa gtcaccataa tcatcactga 15061 aagcaaaaga cctggactca gtagcagcac taacatcaaa agatcatgag aactcataat 15121 tcttactctc caaaagatat aattgcgaat gtaatcactg ctatagtaga ccaacaatca 15181 ccaacacact gttaatagag cttaattttt catatttgca tttgctctgg cacactcagt 15241 cctttcaatg gttgtgcagc ttcttgtttg gttagtcctt tcgcttgaac caagacacca 15301 aacccaccaa tacccaaagg atctaaaagc tgatgcaaaa catcccggcg atgcagcaac 15361 tgtgagatgg gttggtctgt gtaagagaga gcagcaattc gttctcctaa gcccaaagcc 15421 atcaaaaaca acgcttgctt cgtaaaacca actttatcta atccgcagcg ctcaccccat 15481 cgttccaaag tggtaaaatc aacatgagcc gtgatatctt gtctcccaat attaatatac 15541 gggtcattgt ggcgttgatg attgtagtaa cactgtaagg taccttgcga tcgcatagga 15601 ttataataac gactagcact gtaaccataa tcaatcgtca acacatatcc acgctgcaaa 15661 cggtctgcca ctaaactcaa ccaatccaat gcagctaagt taacctcact acgataacca 15721 tcagggtaag agctgatatt tatttctaca agctcaaaat attcctcaag tttggcggtt 15781 gaaggcgtat ctgcaacttc cacaaatgat agtgcaggag tagaagtaga tagggacatt 15841 tttgactgtt ccctattccc tgttccctgt tccctgttcc ctgttccctc ttcctgtgta 15901 gtgacataaa tctcctgtaa tcgtccttct tccaaaacaa attgatgcac aggtaaagca 15961 tccaccaact cgttagaaaa aaagcagcca gtaacagact gacttggaat ctcctccaaa 16021 tttaaccaac gcacagaaaa ttcttgcaag tgttgctgtt gttcttccct caaaactgaa 16081 gatttttcaa caatcacata atccagaaca tcaaaaaaat cagggtactg ctgcttgata 16141 tacttaagaa tatctaaagc caacaatcct tgacctgccc ccatttccac caaagaaaat 16201 ggcacaggtt gtcctaaaac ttcccacatt tggaaaaatt gcaccgccag taactcacca 16261 aaatcggcac caaggtgaac agaagtaaaa aaatcacctt cctttcccag ctttagcgcc 16321 tttgtggaat agtaaccagt ttccggatga tataacacca tatccatata ctcagcgaaa 16381 gtgattcgct tttggggact tgtcgtgatg cggtgtgcga tcgctttaca tagttcttga 16441 ttggaaggca taagatttta agggaacagg gaacagggaa cagtgaacag ggaacaggga 16501 acagggaaca gtgaacaggg aacagtgaac agtgaacagg gaacagggaa caggaaacgc 16561 ttaactctta acagaacctc gtaaaatctc gcttttgcaa gagatctaat gataagaata 16621 agcgctgtaa aagtgactgg tgacgtattg caaacccaaa tacatacaga tagacgccga 16681 tagaacagaa aaacatctgc ctccatcggc gtccatctgc ggttcaaaat ttcaaatttt 16741 ccttatctat ctacagaacc cataatgatg tcaatactac cgagaatgac gacaacatcc 16801 gccagtttca cgcctttgag tagatgaggc agaatttgca ggttgttgaa atctgctgcg 16861 cgaatcttcc accgccaggg aaagacgtta tcatcaccaa tcagataaat tcccagttca 16921 cctttaccgc tttcgatacg ggagtaaatt tcacctttag gaattttgaa agtgggggaa 16981 attttcttgc tgatgtactg gtaatcaaag gcatcccact cggatttttt gcctgctgct 17041 aagcgcttag cttccaggtt ttcgtaggga ccaccgggaa gtcctttgat ggcttgtctg 17101 atgattttga cagattcacg catttcccgc atccgtacga cataacgggc aaagcagtca 17161 ccaacggttt cccactggac gtcccagtcg aaatcatcgt aacattcgta gtgatcaact 17221 ttccgtaagt cccacttcac accagaagca cgcaacattg gaccagaaag tccccagtta 17281 attgcttctt cgcgggtgat agtgccaata ccttcaacac gacgccggaa gatggggtta 17341 tcggtcacta agcgctcgta ctcatctatt ttaggcacga agtagtcgca gaagtcgaga 17401 catttgtcaa cccaaccgta gggtaagtcc accgcgacgc caccgatgcg gaagtagttg 17461 ttgttgacca tccggtaacc tgtggcggct tcccacagat cataaatcag ttcccgttca 17521 cggaattgat agaagaaggg agtttgagca ccgacgtcag ctaggaacgg accaaaccac 17581 agtaagtggt tcgcgatgcg gttcaactcc agcatgatga cgcggatgta gctggcgcgt 17641 ttgggaacgg gaatgtttgc cagtttttct ggggcgttta cagtgacggc ttcgttaaac 17701 atccccgctg cgtagtccca ccgactcacg taggggacgt acataatgtt ggtgcggttc 17761 tccgcaatct tttccattcc ccggtgcaag tagccgagta ctggttcaca atcaataaca 17821 tcctcgccat ccagagtaac aatcagtctc agcaccccgt gcattgaggg atggtgggga 17881 cccatgttca gcaccatggg ttcagtgcgg gtttctaatc ttgccataaa aagagtgttc 17941 tccggtttgg gaaaatttta aaatggaacc gcacaggcga ggcagcacgc ttcgtggggt 18001 ttccccgacg gtgcaaactg ccgtcgcaca gagtgcgaac taagcggtaa gttggagaga 18061 ggttggcagt tgagggtttt tgtttgatgt ttcgttttgt tgcacttatt caactattat 18121 tatatggagc atgacgtgca aggaatggag gtacaagatt tttttcatgt gcctgttctt 18181 agtcgccagg taattgacgg tttggctgtc cgtcccggtg ggcattattt ggatgcgacg 18241 gtaggcggtg gcggtcacac tcgcctgatt ttagaggctg cacccgatgt gcacgtcaca 18301 gctattgacc aggatgagga cgctttggca gcggcgcaaa agatattagg ggcgcttgga 18361 gaacgtgtaa aatttattca tagcaacttt gctaattacg agtttccact gatcaagttt 18421 accggaatca ttgctgattt gggcgttagc tctcatcatt tggatacctc ggaacggggt 18481 tttagttttc gccaccaagc aaatttggat atgcggatga acaggcgaca atctctgacg 18541 gctgctgatg tgattaatga ttgggatgag gcggaattgg caaatatttt ttttaaatac 18601 ggtgaagaaa ggttatcacg gcgcatagct agacggattg ttcaacaacg tccatttcac 18661 acgacaacag aattggcaga ggcgatcgcc tcctgtgtcc ccccaaaata ccgctacggc 18721 agaattcacc cagctacccg cgtttttcaa gcgttgcgaa ttgttgtcaa tgacgagtta 18781 aaatctttag aaactttttt aagcaaagca ccacaagccc ttattccagg tggcagaatt 18841 gcaattatta gctttcacag cttagaagac cgcatcgtca agcacagttt tagagattcc 18901 cctttattgc aggtcttgac gaaaaagccg atagaagcac aagaagaaga aattgtcaat 18961 aatccccgcg ctcgttctgc aaagttgaga atagcacaga gggttgagga agtcaggtga 19021 aaactcctga gtaatgacta atagacctct tgcaaaagtg agattttacg aggttctgtt 19081 aaacgttaag agttcagagt cccccgttcc ctgttccctg ttccctgttc cctgttccct 19141 gttccctgtt ccctgttcc // LOCUS NODE_1741_length_18846_cov_4.64328718846 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18846) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18846) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18846 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1366 /locus_tag="DP116_14955" CDS <1..1366 /locus_tag="DP116_14955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315290.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14955" /translation="SPTLLRETADVFTGLPQGEQTALQLYRQIAPQFPNDKSLVVRQL VLETKLGQLSKSDLKRRLAAEVNPLPSDPVQLQQIALALSDVDAPDPELLPLYQTLSP RVNVPFLNFRIAQIALQQNDTNGARQALAAYTATPEGAKSLAPQLLAAEIERREGNLE ASAQRFQAVLTSRSGGDDITDGALRGLAGVRVQQKRFDEALAAYDQLIIRQPQNLTTQ LGRTSVAYQAKKISDQEAEAVLNNWLATQPATNAPPELFSLVGTLPAQPQREPLYNYL AQVDPSYLPVQLRLVQVIAKRNPAQAQARVKQLIARLPNNANTYQLQGDLARAIGDLN LAGKAYENILAQQPDNIDALAALGGIRFEQRRFDSARQIYSQVVAQKPQDKDARRALA GLSAIADQPLTALSQLEQLEIEQISQGTSDAEVSRQRQQIQEDFLQRRGFQPSWENYE RRGK" gene 1464..2735 /locus_tag="DP116_14960" CDS 1464..2735 /locus_tag="DP116_14960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129308.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl hydrolase" /protein_id="PRJNA477356:DP116_14960" /translation="MRHSVKLTVIVSMIVLNACNLGYSVIKTKVSDDRTPVETALPLN GSESENSSKYVAALPKSVPNRELLTQSWEVYRRRFIQGDGRVIDYEAGDRSTSEGQAY AMLRAVLIDDAATFAQTLNWGENNLQRQVDGKRTDNLWAWQWGRNADGKWGAIDSNFA SDGDIDAITALILASRRWNRPEYLNLAKAKLQDLWNLSTVLGHGGKLYLLPGPAAAFI PNASTLYLNPSYFAPYAFRLFAQVDPEHDWLSLVNSSYEVLEKSAPLSAVGLPSDWVA QDTKTGKYQPLPQTTSQLQTLYGFDAYRVWWRLSLDVAWFNSPQARRYLQTNSRYLQQ QWRERSRLPARIDLQGKGLVDYEATAQYAMLYAAWQFVEPQLAKELLEKKVLLQYKQG LWDDKSAYYTQNLAWLGLLSPSVVPPQLLKK" gene 2900..5344 /locus_tag="DP116_14965" CDS 2900..5344 /locus_tag="DP116_14965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198611.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cellulose biosynthesis cyclic di-GMP-binding regulatory protein BcsB" /protein_id="PRJNA477356:DP116_14965" /translation="MKPLFHPDKISTKAIIFTFCLLLFPTSLPSAQARNEEDLIHDSS QKSSYFYQQQRYSEKQQTILLAQVKKPKADSQDQEDTSEDTGEDTKDKKEAADAKNLI TYNLEFNRSPIVGNRLRLRGLNAEGRLGFNRPRGWKIGKLQALIKFQHSPSLYANRSN LTVFINDTAVGSISLNRKQSQVGQVLINNIDPKLLQDYNEIKFVAQQNSSQQCTDPRD PNLWTDILPDSKLIFGFQRQPVPLNFSRYPYPFFDQLGLETNSIVYVQPSQVNQYWLT TAARLQAAFGRFADFRPIKTSLVSDIKSVKPEQRLVIIGTPSEQPALTSLKLPLSITG TQILDINQKPVPEDTGVLIAATTKEKGGTPVLIATGNGAKGVAKAVQFLVQPDLRKMG TGSVVFVNQVKEVTAPEPRQWPRYIPEENNFRLSDIRTPLNNEGFGDVTVRGSATAPV TIDFRTLPDDRFLRGSSMNLVYSYGPQLNPRTSALEVLLDGRYIGGTRLTSDSGETRQ TLKVDLPASLLQSNSQLQVFFRMNPKEPFDKQKCLSAADQQLVGTLHADTSFELKREP SVQLPDLKLLQFGFPFAAPQDLSRTAIVLPQNPSNTDILTLLALSERLGRLSQAESIK FQVYTPESIPDSVRKNEHLVGIGTRDKFPFPEVFRSTGFNLSQAFSRVSAQGTIQTPQ DSQGLIKQIVSPWSNERIILALTAQTETGLERVRQVVEQDPLFYQLKQDTVLVGSDKN NPSPYEPDAYQLEFIRSAPSKTRVENTNLLSKTTRLLQENWLLLPIGIVGVSLLLYGI VQLYLKRLAGTERNEL" gene 5656..8253 /locus_tag="DP116_14970" CDS 5656..8253 /locus_tag="DP116_14970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129306.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cellulose synthase" /protein_id="PRJNA477356:DP116_14970" /translation="MSSSSNSPTRGRFNLSRWLIDITPRFFDRALEKVGMKQFKWLVL LLLVLSVPLIIVPLRIWQQAVIGLFLVILGQWVMQAEEQESSAEISQYYHLFMAWLSL VTTLRYLYYRTSYTLNLDGWLNSFVCLLLYAAELYAILTLALAYFQTLKIKERQAVDL SNIPQEEWFNVDIYIPTFNEDVEIVRKTALSAITCDYAPGKKTVYVLDDGRPERYQEN DPRREKFRARREQIRLMCEEIGCIHMTRDNNKHAKAGNINNAFNKTDGDLVLILDCDH IPSRQFLLNTVGFFYDPKVSFVQTPHWFYNPDPFERNLLTRGRIPVGNELFYKVLQKG NDFWNSAFFCGSAAIIRKSHALEVGGIAVETVTEDCHTALRLHSRGYKSVYYDKIMVA GLAPDTFSSYVGQQVRWARGMAQILRLENPLFNPWLKLKIPQRICYFSATSHFLYGYP RLVYAIVPTLFLLFGINPIRGLGLETLSYAVPHILLALFTNHIIYKNVRFSFWNEIFE FVMAFQAGWVTMLALINPKLGSFNVTDKGVNVTKRTFDWQSMRGLIIVTILVSASLFA VPYWLLLRPEDWQAVLVNTLWSGFNLILLIAALLVGFEQPQVRSSHRLQRRLPVVITN SNYTFRGETVNISETGALISLESWPNLPDEVEVEIMGDFTVRVSLTAQVVRVSPVSDS NTLLAIDFVSPSRAQMDNLILILYSDVREWYSQKRESVDQPVNSLGFLATSLTRSFRD LKPVTRNQVRKQVSAVGELYWDGHFFPGIATELGVTGLRLEMRSKKARGGNRLLGQED LHKMRNIKPLVGLLLSREEGNPSPSKLVAEISAVKEETDNKIVIDLNFPTEFKQRQGT KIKQLLQVL" gene 8785..9330 /locus_tag="DP116_14975" CDS 8785..9330 /locus_tag="DP116_14975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864940.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GIY-YIG nuclease family protein" /protein_id="PRJNA477356:DP116_14975" /translation="MTTETNIPSLANIEYIAYIDDNGQLPEQLQGKIGVYAIFDQEKV LQFVGYSRDVYLSLKQHLVRQPKNCYWVKAETIERPNRTILEKIEQTWIAENGTVPAG NRENKDKWTQPIDAKTAMTPEEQANYNNPANDEIAQTKIIKNVARRVEGEIIAVLESR GLQTQIRFHPKLKENGLLDLK" gene 9647..9961 /locus_tag="DP116_14980" CDS 9647..9961 /locus_tag="DP116_14980" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14980" /translation="MRNLTSNNVKFFHPLTFFLFFFFRDVNVNKKIQIIGEHHVNMDD ERGSGRFCLEMLTANQYFFAAISVLFTNHFGISALVAEFFQKMWVQTPLSDTSNMGLI SI" gene 10920..11117 /locus_tag="DP116_14985" CDS 10920..11117 /locus_tag="DP116_14985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14985" /translation="MLKRQTKTYRTALAPWAVVHWVSPTERIVMNRFRSRSDADGHLT ILQRLMPQADLRVIVDVDQDN" gene 11178..13727 /locus_tag="DP116_14990" CDS 11178..13727 /locus_tag="DP116_14990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315281.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bile acid beta-glucosidase" /protein_id="PRJNA477356:DP116_14990" /translation="MRKLPSSVIPSCTWSRSIGFGWDEPYTVRYASNIDDGSWHGMPL GGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPACQFSVFESFGSSSQACALCTEPPED GSLKTWQWYPASRKIGKEVGREENGEEGRETSVTSSSPSSPSSPQPNTGTYHALYPRS WFVYEGVFQAELTCEQFSPIWAANYQQTSYPVGVFVWTAHNPTHAPITLSIMLTWQNM VGWFTNSLKSPLVRVRDDGSPVYEYQPRLGESLDNFNQLVENNESIGLLLKRVNMSEP PQEGEGQWCIATRKQPNVEVQYHTRWNPVGTGEEIWQSFAQDGSLPNHVDDTPAAEGE QIGVAIAVRFTLQPGETLEIPFALTWDLPITEFAAGITYYRRYTDFFGISGNNAWTIA STALAEYHIWQQQIQFWQQPILERQDLPDWFKMALFNELYDLTSGGTLWSAATKRDPV GQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVIRAFARAIPSCDETPRVIGY YYTLGAESPIAVRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYR DFLLTGAEDVEFLAECWDAIVQTLDYLKRFDKDEDGIPENSGAPDQTFDDWRLVGVSA YCGGLWLAALEAAIAICDILLNSIHAEEDTSTQDSSRPWKAFVDTEKLVQQKSIYETW LAQSRPIYQEKLWNGQYYRLDSESGSDVVMADQLCGQFYARLLGLPDIVPVDCALSAL KTVYDACFLKFYDGKFGAANGVRPDGSPENPNATHPLEIWTGINFGLGAILVQMGMKD EAFRIAYAVVQQIYENGLQFRTPEAITVVGTFRASTYLRAMAIWAIYLLID" gene 15726..16595 /locus_tag="DP116_14995" CDS 15726..16595 /locus_tag="DP116_14995" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_14995" /translation="MNSIIRSNAVETLSKTDNTGRNFHKFYPYCDLVSQTAFIYGNHL TSIKQALPQAPIETRTATLTKPQIASKNTGYWSTSLQTLLDQLPLTVAHLVTLKNITF FTTVVVWAWTGKIQQISHVREKFVALDEPHKLQLQDLSKVLNTAVKEAQEFKTGQVVA ELDREPLLSEVEVQKQEIAADDQTHFNHIKDLMARTRILAQTHALAKAQQDKAGLEAS DKLEPQNAEMPIQPTERLKQRQIQQLVMKMTHLQNKKVQINLDALRKVDFSTRSKNAT SILSTAKPPATAE" gene 16739..17188 /locus_tag="DP116_15000" CDS 16739..17188 /locus_tag="DP116_15000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315277.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heterocyst differentiation protein" /protein_id="PRJNA477356:DP116_15000" /translation="MNQDMSYTRNLDKKLPQEQFDQVVEAILAGKYSWACVLMLRFAG YNPLHYIPYRTYNRLLKENSHVSRSNQQQNENLKVAKISSDKRSQSHITPASCLSKIK DRPLLEVVGKKKTEIRGGNLEQWLTTQVHEYESTNPEIAPEQVKILP" BASE COUNT 5518 a 4062 c 4020 g 5246 t ORIGIN 1 ctcgcctacc ttattgcgag aaaccgctga tgttttcact ggcttacccc aaggagagca 61 aacagcacta caactttatc gccaaatagc tccacagttc ccgaatgata aaagtttggt 121 ggtgcggcag ttagtcctag agactaaatt gggacaactc agcaaaagtg acttaaaaag 181 gcgtttggca gctgaagtta accccctacc ctcagatccg gtgcaattgc aacagatagc 241 tcttgctcta tcagacgttg atgccccaga tccagaactg ttgcctttat accaaactct 301 ttcaccaagg gtaaacgtac catttttaaa tttccggatc gctcagattg ctttacaaca 361 gaatgacaca aatggggcaa gacaagcttt ggcagcttat acagcaacac cagaaggagc 421 caaaagcctt gcaccccaac tacttgcagc agaaattgag cgacgggagg gcaacttaga 481 agccagtgct caacgtttcc aagctgtctt gacaagtagg tcaggaggtg atgatattac 541 cgatggagct ttacgtggac tggcaggtgt gagagtacaa caaaagcggt ttgatgaggc 601 tttggctgct tatgaccaat tgattatccg tcaacctcaa aatttgacga ctcaattggg 661 acgaaccagt gtcgcttatc aagccaagaa aatttctgat caagaggcgg aagctgtcct 721 caataattgg ttagcgacac aacctgcaac gaatgcacct ccagaacttt ttagtttagt 781 cggaactttg ccagctcaac cacagcggga acctctgtat aattacttag ctcaagttga 841 cccaagctac ctgccagtgc aactgcgtct ggtgcaagtc atcgcaaaac gcaatcccgc 901 ccaagcgcaa gcgcgggtga aacagttgat tgctcgtctc ccgaacaacg cgaatacgta 961 tcagttgcag ggagatttgg cacgagcaat tggtgacttg aatttggctg gcaaggcgta 1021 cgaaaatatt ttggcacagc aaccagataa tatagatgcc cttgctgctt taggaggaat 1081 tcgttttgaa caacgacgct ttgattctgc acggcaaatt tactctcagg ttgtagcgca 1141 aaagccacaa gataaagacg cacgtcgcgc ccttgctggc ttaagtgcca ttgctgatca 1201 gcctctaaca gcgctgtcac agcttgaaca actcgaaata gagcaaattt cccaaggcac 1261 aagcgatgct gaagtttctc gtcagaggca gcaaatacaa gaagatttct tgcaacgacg 1321 tggctttcag ccctcttggg agaactatga gcgcagaggg aaataacaga gggtaagacg 1381 cacagatttg cccttatctg tatatccctt tgtttatctt gtgttcccta aatctccgtg 1441 tcccctttgt tctctgttaa agaatgcggc actcagtgaa attgacagtt attgttagca 1501 tgattgttct caatgcctgt aatttaggtt actcggtgat aaagacgaaa gtatcggatg 1561 acaggactcc ggtagaaaca gctctgccat tgaatggtag cgaaagcgaa aatagctcta 1621 agtatgtagc agctttaccg aaatcggttc ctaaccgtga gttactgact caaagctggg 1681 aagtctatcg ccggagattt attcagggtg atggtagagt gatcgattat gaagcgggcg 1741 atcgctctac aagtgaaggt caggcatatg ccatgctgcg ggcagtcctg atcgatgacg 1801 ctgcaacttt tgctcaaacc ttgaactggg gagaaaataa cctccaaaga caagtagacg 1861 gcaaacggac agacaacttg tgggcgtggc aatggggacg aaatgcagat ggaaaatggg 1921 gtgcgatcga tagtaacttt gccagtgatg gcgatattga tgccattacc gctttgattc 1981 ttgcctcaag gcgttggaat cgtcctgaat accttaattt ggcaaaagcg aaactgcaag 2041 atttgtggaa cctttccact gtactaggac acggaggaaa gctgtattta ttgcctggtc 2101 ctgcagcggc atttattccg aatgcctcaa ccctttacct caacccctct tacttcgctc 2161 cctatgcttt tcggctattt gcccaagtcg atccagagca tgattggttg agtttagtca 2221 atagcagtta tgaagttctg gaaaaatcag ccccactttc cgcagttggg ttgccgagtg 2281 attgggtggc tcaagatacc aaaacaggaa aataccaacc cttgccgcag acaacaagcc 2341 aacttcagac tttgtatggc tttgatgctt atcgagtttg gtggcgcttg tcgctggatg 2401 tggcgtggtt caattcaccg caagcgcgac gttatctcca gacaaatagt agatatctgc 2461 aacagcagtg gcgggagcga tcgcgtttac cagcacgcat cgatctacaa ggcaaaggat 2521 tggtggatta tgaagcgaca gcacaatacg cgatgcttta cgccgcttgg cagttcgtag 2581 aaccacaact agcaaaggag ttacttgaga aaaaagttct acttcaatac aaacaaggcc 2641 tttgggatga taaatccgct tactatacac agaatttggc ttggttaggt ctgttgtctc 2701 cttcagtagt tccgccgcaa ctactcaaga agtagcaaaa tatttcctta gttctttttt 2761 taatcattaa aaatagtgtt cccttttgag ttcaaaaaat aacaaatttt ccgttggaaa 2821 atgtcgaaga gcaaaactca cacaaatcac aaccacaaaa ccagaaattt tcctactatc 2881 tctaaagtga caccgtccta tgaagccgtt gtttcatcct gacaaaatca gtactaaagc 2941 aattattttt actttttgct tgttactctt tcctacttca ttgccaagcg ctcaagctcg 3001 taatgaagaa gatttgatac acgatagctc acaaaaatct agctattttt atcagcagca 3061 aaggtattct gagaaacagc aaactatctt gttggctcaa gtaaagaaac ccaaagccga 3121 cagccaagac caagaagaca caagcgaaga cacaggcgaa gacacaaaag acaaaaaaga 3181 ggctgctgat gccaagaatc tgataacgta caacctggaa ttcaaccgca gtccaattgt 3241 cggcaatcgt ctacgcctac gaggactgaa tgcggaaggt cgccttggct ttaaccgtcc 3301 tcgtggctgg aaaatcggaa agcttcaagc tttaattaag ttccagcact caccatcact 3361 ctatgccaac cgttctaacc tgacggtatt catcaatgac actgccgtag gcagcatatc 3421 gctcaaccgt aaacagtccc aagtcggtca ggtactcata aataatattg atcctaagct 3481 gcttcaggac tacaacgaaa tcaagtttgt tgctcagcaa aatagttcac aacaatgtac 3541 tgatcctcgt gatcctaatt tgtggacgga tattctgcca gattccaaat taatttttgg 3601 tttccaaagg caaccagttc ccctcaattt tagccgctat ccctatccgt tttttgatca 3661 actcggctta gaaactaaca gtattgttta cgtgcaacca agtcaggtca atcaatattg 3721 gctgacaaca gcagctcgtt tgcaagcggc atttggtaga tttgcagatt ttcgcccaat 3781 aaagacgagc ttggtatctg atataaagag tgtgaaaccg gaacagcggt tagtgatcat 3841 tgggactcct agcgaacagc cagctttaac ctctttgaaa ttacctcttt ctatcactgg 3901 tactcaaatt ctcgatatca atcaaaagcc cgtaccagaa gatacagggg tgctgatcgc 3961 agcgactact aaagaaaaag gtggtacacc tgtcttaatt gcgactggta atggagcaaa 4021 gggtgtggca aaggcagtac agtttttggt gcagccagac ttgcgaaaaa tgggaacagg 4081 ttcagtcgtt tttgtcaatc aagtcaaaga agtgacagca ccggaacctc gacaatggcc 4141 tcgttatatt ccagaagaaa ataattttag actcagcgac ataagaactc cactcaataa 4201 tgaaggtttt ggtgatgtga ctgtgcgtgg ttcagcaaca gcgcctgtta caattgattt 4261 ccgcacttta cctgatgaca gatttttgcg tggcagttcc atgaacctag tttacagcta 4321 tggtccacag ctcaacccga gaacctctgc gctcgaagtg ttactcgacg gcagatacat 4381 tggcgggaca cgtctgactt cagattctgg agaaacccgc caaaccttga aagttgattt 4441 accagcaagc ttgttgcagt ccaactctca actccaagtg ttctttcgga tgaatcccaa 4501 agaaccattt gacaagcaaa aatgtctgag tgccgcagat caacaattag taggtacact 4561 tcatgctgat acaagcttcg agttaaaacg ggaaccttct gtacaacttc cagatttaaa 4621 gttgttgcaa tttggttttc cgtttgctgc accgcaagat ttatctagaa cggcgatcgt 4681 cttgccacaa aacccatcta atacagatat actcactttg ctcgcgttga gcgaacgcct 4741 gggacggctg agtcaagcag agtctatcaa attccaagtt tatacgccag aatcaattcc 4801 tgactcagtt cgcaagaatg agcatcttgt gggaattggc acacgggaca agtttccttt 4861 tccggaagtc tttagatcca ctggctttaa cttgagtcaa gcattctctc gggtatcagc 4921 tcaaggtacg attcaaactc cacaggattc acaaggcttg attaagcaaa ttgtttcccc 4981 ctggagcaac gagcgtatca ttctggcttt aacagctcag acagaaactg gtttagagcg 5041 agtgcggcaa gtggttgagc aagacccgtt gttctaccaa ttaaaacaag atacagtgct 5101 agttggtagc gataaaaaca atccatctcc ctacgagcca gatgcctatc aactagagtt 5161 tatccgtagt gctccctcaa aaactcgtgt agaaaatacc aatttgctga gtaagacgac 5221 gcgcttgcta caagaaaatt ggctcctcct acctatcggt attgtaggtg tctcgctcct 5281 actttatgga attgttcagt tgtatctcaa gcgccttgct ggtactgaga ggaatgaatt 5341 gtaaccgctg atgcacgcag acgcacgcga acatagcaga caattatctg cacggcaggt 5401 gctaagaaag cgggaagccg cctccggcgt ctacaagtcg gggaacccga cggcagatgc 5461 tccacttggg gagccagtgc gttgcggggg ttccccccgt tgtagcacct ggcgtgaaac 5521 cccaagaccg cactgcctca acggacagat gcctcaagtc gggaaacccg cccacggcac 5581 tgtcctcccc aacgcactgc ctcgtttatc tgcggttcaa atcccaaatt taaaatctaa 5641 aatccaaaat ccaaaatgtc ttcttcatca aattccccta ctcgagggcg ttttaactta 5701 agtcgatggc tcattgatat cacgccccga ttttttgacc gtgctttaga aaaagtgggt 5761 atgaaacagt ttaagtggct ggttttgcta cttctggttc tctcagttcc acttattatt 5821 gtgccgctac gaatctggca gcaagcagtt attggcttgt ttctggtgat acttggtcaa 5881 tgggttatgc aagctgagga gcaagagtct tctgctgaaa ttagccagta ttatcactta 5941 tttatggcgt ggctgagttt ggtaacaacg ctgcgttatt tgtattaccg cacaagctac 6001 actcttaacc ttgatggttg gctcaatagc tttgtttgct tactgctgta cgctgctgag 6061 ctgtatgcta ttctcacctt ggcactggct tattttcaaa ctctaaaaat caaagagcgt 6121 caggcagttg acctttccaa cattcctcaa gaagaatggt tcaatgtcga tatttacatc 6181 cccaccttca acgaagatgt tgaaattgtt cgcaaaactg ctttatcagc aatcacttgc 6241 gattatgccc ctggtaaaaa gacggtttat gtcttagatg atggtcgtcc agaaagatac 6301 caagaaaacg acccgcgccg agagaagttc cgggcaagac gagaacagat acgactcatg 6361 tgtgaagaaa ttggttgtat ccacatgacg cgggacaaca acaagcatgc taaggcgggt 6421 aatatcaaca atgcctttaa caaaactgac ggtgacttag ttttgatttt agactgcgac 6481 catatcccct cgcgtcaatt tctgctgaat acagtaggct ttttctatga tccaaaggta 6541 tcgtttgtcc aaactccgca ctggttctat aatcccgacc ccttcgagcg caatttgctg 6601 actcgtggta gaatcccggt tgggaatgaa ctgttttata aggtgctgca aaaaggcaat 6661 gatttttgga attctgcctt tttctgcggt tcagcagcga taattcgcaa atcccatgcc 6721 ttggaagttg ggggaattgc agttgaaact gtgacagagg attgtcacac agcactacgg 6781 ttgcactctc gcggttacaa gtctgtttat tacgacaaaa tcatggtagc tgggttagcc 6841 ccagatacgt tctcttccta tgtcggtcaa caagtgcgct gggcaagggg tatggcgcag 6901 attctgcgat tggaaaaccc tctgtttaat ccctggctga agctgaagat tcctcaacgg 6961 atttgttatt ttagtgcgac ttcgcacttc ttgtatggat atccccgact ggtgtatgca 7021 attgttccta ccttgttttt attgtttggt atcaatccca tccgaggtct aggtttagaa 7081 actctgtcat acgccgtacc acacattctt ctggctttat tcaccaacca tatcatttac 7141 aaaaacgtcc gtttttcttt ttggaacgaa atttttgaat ttgttatggc tttccaagca 7201 gggtgggtga cgatgttggc gctgattaac cccaagcttg gttcgtttaa cgtcactgac 7261 aaaggagtga atgttaccaa acgtaccttt gactggcagt caatgcgtgg tttaataata 7321 gtaaccatac ttgttagcgc ttccctattt gccgtccctt actggttgct gcttcgtcca 7381 gaggattggc aagcagtttt agtcaacact ttgtggtcag gttttaattt gattctgctg 7441 attgcagcat tgctggttgg ctttgaacag ccgcaagtcc gttcttctca ccgtttgcag 7501 cgacgccttc ctgttgttat tactaatagt aactacacat ttaggggcga aacagtcaat 7561 atctcagaaa ctggggcgtt gatttcttta gaatcttggc ctaacttacc agatgaagtt 7621 gaggttgaaa tcatgggaga ttttactgtt cgcgtctccc tcacagcaca ggttgtgcga 7681 gtttctcctg taagtgattc aaacacgctt ttggcaattg actttgtcag tcctagccgc 7741 gcccaaatgg ataacttaat cctgattttg tattctgatg tgcgagagtg gtattctcag 7801 aagcgggagt ctgtagatca accagtgaat tctcttggct tccttgctac cagtttgact 7861 cgctcttttc gagacctcaa acccgtcacc cgtaaccagg tacgcaaaca ggtcagcgcc 7921 gttggtgaac tttactggga tggtcatttc ttccctggaa ttgccacaga actaggggta 7981 acaggtttac ggctggaaat gcggagtaaa aaagcgcggg gaggtaatag actgcttgga 8041 caagaagact tgcacaagat gcgaaatatc aaaccactcg ttggtttgct gttgagtcga 8101 gaggagggga acccctcacc tagtaagttg gttgctgaaa tttctgcggt gaaagaagag 8161 acagataaca agattgtcat tgatttaaat tttccgacag agttcaagca acgccaaggt 8221 actaaaatta aacagctttt gcaagtgttg tagaacaggt tattgtgaag tacctacact 8281 gcacacaaag tgtgcagtgt acgcttccgg tttcacctac cctccgggtg tgcgcggagc 8341 gcacgcctta cggcgaacgc cagtcgcctg gctgtcggga aaacgccagg tgctctactt 8401 ggggagacac gccaggtgct acaagtcggc acagccgacg gcacatgctt caagtcggga 8461 aactctagcc gagcagtgcc tccccaacgc actggctcaa cggacagttg cctcaagtcg 8521 ggaaacccgc ccacggcact gtcctcccca agaccgcact ggctccccta ccaagagcgc 8581 tggtctcacc gttgccacct ttaagtaatt ttgtggcaag aagggagtag aatatctcca 8641 atactccata ttttttcact tcttaacaaa tgattgcaaa gaaaatgttt ttgtacaatt 8701 tatagaaaag ttattattag ttatgagttt cgtcaacaac tatgactact aaacactaac 8761 taataaccgt cttgaacttt ctttatgaca acagaaacaa atattccttc tttagccaat 8821 atcgagtaca ttgcatatat tgacgataac ggtcaattac ctgaacaatt gcaaggtaaa 8881 attggagtat acgcaatttt tgaccaagaa aaagtgttgc aatttgtcgg atattctcgt 8941 gatgtttatc tcagcttaaa gcagcattta gttcgtcaac ccaaaaattg ttattgggtc 9001 aaagctgaaa caattgaacg tcctaatcgt acgattttag aaaagataga gcagacttgg 9061 attgctgaaa atggcactgt ccctgctggt aatcgagaga ataaagacaa atggacgcaa 9121 ccgatagatg ccaaaacagc catgacacct gaagaacagg caaattataa caatcctgca 9181 aatgatgaaa tagcgcaaac taaaatcatt aaaaatgtgg ctcgtcgggt agagggtgaa 9241 atcatagcag ttttagaatc acgtggttta caaacacaaa ttcgctttca tcctaagttg 9301 aaagaaaatg gcttgttgga tttgaaataa gaccgttgca tatttactgt aactgactgt 9361 agaggtatca ggtatagggg gtcattaccc acattttcta agatttctgg tctaatgttt 9421 ccctcgccta atttggcatt ttggcaacca tattgatttt gtgagcagta tcgtagctgt 9481 taataggtta ttgcaccaaa aagcaatcat aaagctcaaa ctcctctttt cggttaacga 9541 caagtctggt actgtacgat actaaatcag tactaaccgt tatgctttca tggtttgcat 9601 cctccccagg cttgtcacct ggggatttct ttggtcaatt taacttgtgc ggaatttaac 9661 tagtaataat gttaaatttt ttcatccact tacttttttc ctgttcttct tttttaggga 9721 tgtaaatgtg aacaagaaaa ttcagatcat tggggagcat catgtgaata tggacgatga 9781 acgcggtagc ggcagatttt gcctagagat gttgacagcg aatcagtact tctttgcagc 9841 aatttccgtc ctgtttacca atcatttcgg gatttcagca ttggttgcgg agtttttcca 9901 gaaaatgtgg gttcaaaccc ctctctctga tacgagcaat atgggtctaa tatcaattta 9961 gacagtttat tggagagttt gtttttgcag gtagttacaa aatctataat ctatcactag 10021 ccatgaagta tctttctggt tggtttcggg ttccgtttgg tttgatgcaa atatactagc 10081 ttgtcgcaag acactatctt gatagtgact cgtcggaaat gctcttaata tttagttgag 10141 aagctctaat aagggttttt aaaaaaactc ctaactcacg agccacatag ttgcgttttc 10201 aaatcgccgg acttttttcc agttagcaat agtcaataac aagaaaactt agattcaacc 10261 aatggctcta gtctttctaa actagattgc tgtttttgcc ttgagttttg aaagttgatt 10321 ctggttgctt gttcacgcac atattagcac gttcaaagac aaaattatca acacttcttt 10381 gtgtgagttg gaagataatt atcaggcaag cactaccaac tagcgctttg cataaatatt 10441 tgccttttcc ttaaatcagt accacccgtc agccacactc ggtgcatgcg tatggtgaga 10501 caagctgctg ctttgaggga aaccttaaaa gtaggcaagt ggcttagaac tgctaccaac 10561 gcataaaggt cgttacccaa gtgagaaaac tgataaccct tcgggtatgc gcaaagcgca 10621 cgccaagggc gaacgccagt cgccaagtga gggaaagccg tcattcgcgc tggctcactg 10681 ataactgtta aagtgacatg ctaatcgctt tgctcacgca agtatgaatt gcgtgaatag 10741 agggaagcct ttgtctagca ctgcgttttg ttagcgctga gtgcagaaat aatgcttgta 10801 aaaacacgag catctgctga ttacaacgac agtatctatc gattacaaaa gaagcagaca 10861 taaatcagtt gatgacaact gtatcaattt gtgtcttatg gatgtcatag gaggttttct 10921 tgttaaagcg acaaactaaa acataccgca cagcacttgc accgtgggct gttgttcact 10981 gggtttcacc aactgagcga attgtgatga atcgattccg cagccgcagt gatgctgatg 11041 gtcatctgac aattttgcag cggctgatgc cacaagcaga tttacgagtg attgttgacg 11101 tagaccaaga caattaaaaa atttcgacat cacttacgga tttgggaacg gatacctaaa 11161 cgctgtataa tccaatgatg agaaaactgc catcttccgt aattccctct tgcacctgga 11221 gtcgttccat cggttttggt tgggacgaac cttacacagt ccgctatgcc agcaatattg 11281 atgatggatc ttggcacggt atgcctttgg gcggctttgg tgcaggttgt attggtcgtt 11341 cgtcgcgggg agattttaat ttgtggcaca ttgacggcgg tgaacatgta ttcaaaaaca 11401 tccccgcgtg tcaattcagt gtctttgaat cttttggctc atcctcgcaa gcttgcgctt 11461 tgtgtactga acctcctgaa gacggaagcc tcaaaacatg gcagtggtat ccagcaagta 11521 ggaaaatagg gaaagaagta gggagagagg aaaacggtga ggaagggagg gaaacttctg 11581 tgacttcctc atctccctca tccccctcat cacctcaacc caacacggga acttatcacg 11641 ccctataccc tcgtagctgg tttgtttatg aaggcgtgtt tcaagcagag ttaacgtgcg 11701 agcagttctc tcctatttgg gcagcaaatt atcaacaaac aagctatcca gtgggggtgt 11761 ttgtctggac tgcacacaac cctactcatg cacctattac tctcagtatc atgctgacct 11821 ggcaaaatat ggttggctgg tttacaaatt ctctgaaatc tcctctggtt cgggtgcgcg 11881 atgatggaag cccagtttat gagtatcagc cacgtttagg cgaaagtctg gataacttta 11941 atcagctagt tgaaaacaac gaaagtattg gattgttgtt gaagcgggtt aatatgagtg 12001 aaccccctca ggagggagaa ggacagtggt gcattgctac tcgcaaacaa ccgaatgttg 12061 aagtccagta ccacacccgt tggaatccag ttgggacggg ggaagaaatc tggcaaagct 12121 tcgctcaaga tggctcttta cctaatcacg tggatgatac tccagcagca gaaggtgaac 12181 agataggagt ggcgatcgct gtacgtttca ctcttcaacc aggcgaaact ctcgaaattc 12241 ccttcgcgct gacttgggat ttgccgatca cagaatttgc agccgggatt acatattacc 12301 gcagatacac agattttttt ggtataagtg gaaacaatgc ctggacaata gcatccacgg 12361 ctctagcaga atatcacatt tggcaacaac agattcaatt ttggcaacag ccaattcttg 12421 agcgccaaga cttgccagac tggttcaaaa tggctctgtt taatgagctt tatgacctca 12481 caagcggtgg aactctctgg agtgcggcaa ccaagcgcga ccctgttggt cagtttgccg 12541 tgttagagtg cttagattac cgctggtatg aaagtttgga tgtgcggctt tatggttctt 12601 tcgccctgtt gatgctgttt ccagaactgg agaaggcggt gatacgtgcc tttgcacgag 12661 ctattcccag ttgtgatgaa acacctcgtg ttattggcta ctattacact cttggtgcgg 12721 aaagtccaat tgcagttcgt aaagcagcag gtgcaacacc tcacgactta ggtgcaccca 12781 acgaacacgt ctgggagaaa acaaattata caagttatca agattgcaat ctttggaaag 12841 atttaggttg tgattttgtg ttgcaagtgt accgagattt tctgctgaca ggtgctgagg 12901 atgtggaatt cctggcagaa tgttgggatg ctattgtgca aactctagac tacctcaaga 12961 ggtttgacaa agatgaagat ggtattccgg aaaattctgg tgcacctgat caaacttttg 13021 atgactggcg cttggtagga gtcagcgcgt attgtggtgg gttgtggttg gcggctttgg 13081 aggcggcgat cgccatttgc gatattttat taaactccat acacgctgaa gaagacacct 13141 ccacacagga ctcatcccgc ccttggaaag cctttgttga cacagaaaaa ttggttcaac 13201 aaaagtctat ctacgaaact tggcttgcac agtcacgccc tatttaccaa gaaaaactct 13261 ggaatgggca atattatcga ctggatagtg agagtggatc tgacgtagtc atggcagatc 13321 agttatgtgg gcaattctat gctcgtttgt tgggattgcc agatattgta cctgttgact 13381 gcgccctttc tgctttgaaa actgtttatg atgcttgctt cctcaagttc tacgatggta 13441 agttcggtgc tgctaacggt gttcgtcctg atggttcccc agagaacccc aacgcgactc 13501 atcctttgga gatttggacg ggaataaatt ttgggttggg ggcaattctt gtgcaaatgg 13561 gaatgaagga tgaagctttt aggattgcat acgctgtggt gcagcaaatt tatgagaatg 13621 ggctacaatt ccgcacacct gaagctatca ctgttgttgg tactttccgt gctagtactt 13681 atctccgggc tatggcaatt tgggcaattt atttgctcat tgattagcca tcggttacag 13741 attagagatt tttgcttgct ccctattttc tatgcaattt taacatttta tacctaacaa 13801 ttcaaatcgt ttttcatctg tcagacaatt attttttcac ctctttccgg acttgagtta 13861 ctaactccag aagtattatt taggactgct cttaaaatat gtcgattcac gcggagtttt 13921 aagggcttag gagtagggag ttactcttga acaatctaaa aactagtagc tttaaataca 13981 tgaaaaacta gtcagataga ggaatgttta cagatgacaa gtttccggct agggagtttc 14041 tttcaattca aaattgctgt aaatctttat tgaaaggctt ctcaatcttt tcgcttcaac 14101 acgaaacaac attgtccagt ttatccaaac acacagatga aacgatgaac atattttcgt 14161 cgtaaacagt tcactgagaa cagaaattta taggtaaaca gcagttcttt ccacgcttaa 14221 gagttcaaaa gctgctgaaa ccatagcaaa tattcatttt taccaacaag caattcatct 14281 gcgcccttta atgatctagt tatggcgttg ctatcccttt attcaacaac agagttcatt 14341 agattgtggt gcagattatt tggcgatgat tagctgatca tagggtaaac attagtcgcg 14401 atcggtgctt ggtacgccac ttactacttt ttaggaagcc gccgcgagtg tgtgtttata 14461 agccctgtgg acagactacg ccaaagcgta gcgtgtggta aacagatctg aagggtttca 14521 aaaatagcaa atggtgtgtg attggtatga aatattttcc cttctacacg gctttcatac 14581 atcctactca gtactaagaa acaactgaac tgtgttactt aaaatttcaa ataaatacaa 14641 gtatttttga aatctactga gcaacaaagt gtacaagttt tcttttttga ctgaaaaatt 14701 caagaaatct acacagttac agtatgctac ttttgctttg tcaaagactt tagacagtgt 14761 ttgctggtaa ccaagacagc ctgtatcttc aaattttaaa aattatacca caaagatgtc 14821 agaaaagaag attggatcag tattgaatta ataattaatg tgccaaatcc ataacaaact 14881 catcgtagat ttgcatttgc ctctgaaaat caaaatctca ctttgggttg ctcttaagtg 14941 acataaccaa aacaagatgt ttcccggaaa ttgtaaaaaa tattctcacg aattctctct 15001 agtaatgatt gtctcaatgc tacttttgct gatatatttt taaagtgtta gaacatttac 15061 tcacatctca aatctagatt ctccttgcga agacaaaagc cttatatctg ttgtaaatga 15121 ggggtgtaac cccccaatat ctaacactta agacagaaaa cttaaagtag ataaaatttg 15181 agttattcta gaaatacaat cttcaaaatg ctgaaaatta ataattgagg agatgacatt 15241 tacggaaaaa aatagtataa atacgaatat ggggattaac gattagggat ggttattttt 15301 tttctaaaac cctaaaaagt atgcaacttt acttaaaccc tcctaattat ttggtagaaa 15361 actagaagac atttttaaca ggcatacaat gatttaattg ggagagagga aaatttgaaa 15421 aagataaaaa tgcatctact tagagccaac aggaaacttt ttcatatcag agatataaca 15481 tggagaaggt taatgaaatc ctatggcaaa ggtaagcaaa agttacagtc gataagtcac 15541 cagcaactga gtggtctgta aaaaaaaact tacaagttta ataaccctct tttgtatagt 15601 gtccgtctgt ataaattgct tttttacaaa ctagctgctg aatttcatag agacataaca 15661 tcattccggc ttagcagcaa aaagacttga atagaaagtc aggtttggga tcatcaggac 15721 agatcatgaa cagtatcatc aggtcaaatg cagtcgaaac cttaagtaaa acagataata 15781 cagggagaaa ttttcataag ttctatccct actgtgattt ggtttcgcaa acagcattca 15841 tttacggaaa tcatttaact tcaataaaac aggctttacc acaagcacct atagagacac 15901 gcaccgctac gcttacaaag cctcagatag caagcaagaa tacaggctat tggtctacat 15961 ccttacagac attgctcgat caactacctt taaccgttgc tcatttagtc acgttgaaga 16021 acataacttt ttttacaact gttgttgttt gggcgtggac aggaaagatt caacagatca 16081 gtcatgttcg agaaaagttt gtagcactag atgagcctca taaacttcag ctacaagatc 16141 tgagcaaagt ccttaatact gcagttaaag aagctcaaga gttcaaaaca ggtcaggttg 16201 ttgcagaatt agacagagaa cctttgcttt cggaagttga agtccaaaag caagaaatag 16261 ccgccgacga ccaaacacac ttcaaccaca tcaaagattt gatggcgaga actcgaatac 16321 ttgctcaaac tcatgcatta gccaaagcac aacaggacaa agctggctta gaagcatcag 16381 acaaactcga accacaaaac gcagaaatgc caatacagcc aacagaacga ctaaagcaaa 16441 ggcagattca gcaattggtc atgaaaatga ctcacttgca aaataaaaaa gtgcaaatca 16501 atcttgatgc acttagaaag gtcgatttca gtacccgttc aaaaaacgct acatctattc 16561 tctcaacagc aaaaccccca gccactgcag aatagttccc taccgccgta tcgctgatat 16621 tttccttaaa gcaatgcaat atctgcaaac aggcgagaac agttgatata ccttcaattg 16681 agcacaaata accgaagaaa ctgatcagct aattcacttc cacaacaaca caaacaccat 16741 gaaccaagat atgtcttaca ctcgcaactt agataaaaaa cttcctcaag aacaatttga 16801 ccaagtcgtt gaagcaattc ttgctggtaa gtattcctgg gcttgtgttc ttatgctacg 16861 ttttgcaggc tacaatcctc tacattacat tccctaccgt acctacaacc gactgctcaa 16921 agaaaattct catgtcagta ggtcaaatca acagcaaaac gaaaatctca aagttgctaa 16981 gatatcctct gacaaaaggt cccaaagtca tattacacca gctagctgcc tcagcaaaat 17041 taaagataga ccattacttg aggtcgttgg taagaaaaaa acagaaatcc gtggtggtaa 17101 tttggagcag tggttaacga cacaagtgca tgaatatgaa tccactaatc ctgaaatcgc 17161 tccagaacaa gtcaagattt taccttaaaa ttctgtgaat tcaattaccc aattgctgga 17221 attaataaat tctgaaagac actcgattga gttgtttagt atctgccatg ctgatttaat 17281 atctacagca tggcggtttt ttgagttgat aacccaagtt ggacgttatc tatgacaagt 17341 tgttaacaag tgggatacgc gttgcaaaat ggtgcctgca aaagcattaa gccagaggct 17401 tatcgcaccc ccgtcttggg cgatacgcgc aagggacgca aacaggcgta tggcagacca 17461 aaacaacatc ggttgcgcga cacgcaactg cattaataac ttactgagtt ttagactatg 17521 gataacttga tcagtcatta cacaatgatc gaacattgac agatttttat ttttaatttg 17581 ccattttggc agatttatac aacctatttt gtataaatta taaataacat caaaaaaatc 17641 gcttgatgct taaaattcat tctaaaggag aaaaaaatga tgataaacaa ttatttttga 17701 gtgtaaaata tgtcgaacct gttgctgaag taatcagcat tttctggagt atataactcc 17761 tacactcagg ccaacagatc caatcttgct aatggtggtt ctttttcaca aaaagtctca 17821 gcaaagagaa attggagtgg tctcttcccc caatccaaac atctacggct gctacctaat 17881 ctatgctagg ttgtagcaac gctgacagaa cttttagtgt gaaaaatgaa atagcaacac 17941 cgaactgttg tcaaggcttg taaaacttat agctttttcg cttctgtgtg aataactgtc 18001 atcctactcc tccctaaata ggactgtcac tgcgcaagtt tacttgcaat ttatgcagaa 18061 atgactggaa actgagctat tgtctcagct tccagttttt gtttgaatac agttcgattc 18121 aagctaaaaa cactgaaaca atgtagattg aggaggacac ttcagtgcga gttgtctgtt 18181 ggtccaccaa ccgtaataag ccctagagag ttctcctatg ccctatgcct ggcacaccga 18241 agcgcaccga aacaaggact gtagggctat ctcaccccaa aggggttcgg cagtcgctcc 18301 tgggggaaac cacgccaggt gcttcaagtc gggaaacccg cccaacgcac tggctcccca 18361 agaccgcgct gcctcaccgc tacgtacgta cgcatgcgtc tacaaaggaa cgaggctcaa 18421 aacttctcta aatcttcacc caaccctaga gggaagccct aaagccgcaa gatattttat 18481 aactctaaga gagtgacaaa tcggacttgt cagacaactt gttcaaaaat tcttcaaaaa 18541 ttcttcaaaa atttaaatac aggattcgta tttgattttt gttcagctag gtagagttta 18601 tgttccctgt tccctgttaa gagttctctg ttccctagct caactagtaa gtatggctac 18661 gccacgcaag caaacataaa ccaaaccgga ttcctataca tcaaatagta cttctaaatg 18721 gctgtattca tccgacttta caggaatttt aaggttaata atgtatacag gctagtaaat 18781 atattagttt ttttgtgata gatacttttc ttgtcaatac cctaaagtag taaattatat 18841 tcatac // LOCUS NODE_1756_length_18655_cov_13.44903218655 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18655) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18655) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18655 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..626) /locus_tag="DP116_15005" CDS complement(<1..626) /locus_tag="DP116_15005" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15005" /translation="MFAQQIQGIESQIQELNAKLTQYRSLEGEVETALAAIARIKESA QALGVENEVSQQIVEAVGVNQSVTQEVDVQPVVEAEQQVTQEVDVQPVVEEPPTEPPT EREELRQEHHDQVASEVQMSPEPLPQDHGQLDIFAAIESAKETTVDNRQLDSVIDSTS TEENAEEQWVNLVEDEPENGYETAEQISEALADTENLDTHLMAIASIL" gene complement(685..891) /locus_tag="DP116_15010" CDS complement(685..891) /locus_tag="DP116_15010" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15010" /translation="MTQQPKLQQLKNRAFQLAQSIGLHCTQTKHFKKRFSGLDLRSKT GWTALIGRLQVMSGGVIPVLLAAA" gene 1098..1343 /locus_tag="DP116_15015" CDS 1098..1343 /locus_tag="DP116_15015" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15015" /translation="MKLVSFKVTDEIALKLESLQGDEKSISLVAKRIVEESLENASRK IPVEEKLEAVEEKLDSKIDEVLDLLRSQLPGKSPKVV" gene 1476..2246 /locus_tag="DP116_15020" CDS 1476..2246 /locus_tag="DP116_15020" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15020" /translation="MTTIKPSDIDLTSLPWLPLEEKAAFPKRPAIYFAIDSFGTVQYI GKSVNVRHRWGSHHRYEKLKNIGNIRIAYLFVDLPELLPEIEQALIKHFHPQLNTVRF TETKLIRTERKYIGAGLHLKEKYLQDNKTRIDCNTPSFEQWLQDNISFRVEAGENSYR ARKESYTSNDYWYAVKKVDGKLHKKFIGKSDEVTCDRLKEVADVIRQPPVKTPPKAVV QPVDQISLAQKITALEAQVTAMQEQLTKLVEYQGKVLA" gene complement(2965..3372) /locus_tag="DP116_15025" CDS complement(2965..3372) /locus_tag="DP116_15025" /inference="COORDINATES: protein motif:HMM:PF05866.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15025" /translation="MLEFWIEGRAIGKARPRFGRNGVVHTARGYGAWKSDAVLQLIRL NLPEAPKPARIECYFVNFASSDVDNLVGAVLDALVQGGVLGNDSSSYITAASGTFART KGKRGEEKPIGVLVRILPAQIEYIDLDLKAFAA" gene complement(3766..5763) /locus_tag="DP116_15030" CDS complement(3766..5763) /locus_tag="DP116_15030" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15030" /translation="MQADFNEEDRERLERLLSHPESAEYLTEALSKAIVSDNKETPQP SVQVQVKEPAQNDTAINVTRHFMISGGLLAGVLLLAAAVATPLRDTFTVCFNPNPVPN TSKCLGVKSYSIDQDHIEKLVSHPALVVVPDKRGNPALAVGLALFGTAFAGGSGFIAK QALKEKTEAIPVNRTNRRTTWTATKMEADKVVKQRHQDLQHGFVVSDNTNQHALAASD TLNEFSQIQKLIAALEPQERELFFQHLALKEQKEMMQQQAQIQAQQQQSPFAALLGGV SGQQALPDSTPVNNIEQAKEMGQSIIKSMAGSIKSKVLVAPSRAGKTTVLYLYYNAML DRYPNAEVFVVADKREMIHPRIPVSNYAFCDGTSFKGVGLEMLNKVYGIYQERAKLPE SERDALAQSHPVRLALIDWLATWSTVKKDEPTADKVNSQLIGLITKGASCGVCIDIDT HSATVEALGIDASTRESLDFVAIAYCQQSDKGDLDRISDGLGLLPKVISKALIVPDKG DRTRLGTDFTKLRDALLKGEFNSSIIFTTTGGNRIGITPHFERKTLGDLTLDAEEEVE EEETIEVDEDEVTIESCLLDASRAIKLYLESREDKSASFKQIRDQRSLKKLWNRPTDL ARIAIDFLISQEILEHVQEVDPVLGTKVDVPEDRRVYRLMA" gene complement(5766..6404) /locus_tag="DP116_15035" CDS complement(5766..6404) /locus_tag="DP116_15035" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15035" /translation="MNNKDFDAFMAISGIEPPKTETKVQAPQPKASDNFFGAPSYRGE DQRDLTEDERRAESTDRSDVDDLTSKQNFSGGVSSIGTGLTITIGISSVAGTSGAGLG LGVGLVGCLIFGTEAATGKKASIVRLGAMTIGTGYAGCQLWNYYAQNSAVGEINHSVE LYMVKQPQSSISLGQIATWGIYAVVAIAIIRFLLSPKTRQQKTTSSNNPLNF" gene complement(6468..7262) /locus_tag="DP116_15040" CDS complement(6468..7262) /locus_tag="DP116_15040" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15040" /translation="MDFTEDSSQQFKAEDYKLDPTLYEIDTATPGDDPITRSFETTPA NWVSRTFFNAPGKEALVDLDRDAIKQGGDAAQRASQQAKFLKGYSKVNEKMVDDKVSA AKSILAIMQKRYDGAQSLMEVAKDLHLAGVEFKSSHQLIQAETQYASQTITAETQSKL ILARVQFLKDLAAIATTHSEKLKDKLKPEDENQYSLRAQIFRQMDAEYMKYLGALGKY GTPVQNALSAPAKPVYDTVAPKGGAVGQFNTVMRNAAGVFRRVLGL" gene complement(7315..7503) /locus_tag="DP116_15045" CDS complement(7315..7503) /locus_tag="DP116_15045" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15045" /translation="MIKYLAYLAECQLQEYYKYWEGYYPGTWQPKNLYANRCYDRHDR LIALILFLDPEYFRRDND" gene complement(7500..7991) /locus_tag="DP116_15050" CDS complement(7500..7991) /locus_tag="DP116_15050" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15050" /translation="MHPVTYFIEDSTINLLTQRFGCYLEQLDRSQIIQVRVLLTNFLM GQEMMGEHYTICDAWQDSSLHLLVQDEAVVEVADILDGLTIAQAEGLLSAFQEQCTVG NARLKTSVETTTDDLIKHGVPHELARAGAIILREVDGQRSRTPEEQEIINQVFQLTTK EAA" gene complement(8053..8274) /locus_tag="DP116_15055" CDS complement(8053..8274) /locus_tag="DP116_15055" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15055" /translation="MYALNRLSPNKKYLISNHPGEWHFLRVTGDYANFYKFAQNNTRV SIKLSLARVEQAVWEQVSLTSNLESFCGL" gene complement(8274..8744) /locus_tag="DP116_15060" CDS complement(8274..8744) /locus_tag="DP116_15060" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15060" /translation="MLTIYQNEKVAIALKLIKEVQEQVSDTDIHKSILLGDAAEACDE VLTEDLPEGYTPPKEAWYSPERIKESRSAPSMGDRIVGALTRVAASTVVCAGLSACLS LGCLGVAQFKIASKDFSQPDFTAQGRAYLGLTMIFAAATGASLVLIAVADREVN" gene complement(8914..10209) /locus_tag="DP116_15065" CDS complement(8914..10209) /locus_tag="DP116_15065" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15065" /translation="MPRKKVEQLTEDLQETSESQNADEATSDGTVEAQPPKKQSQKEP LRPFLLVADLGRSSCKSLEYFDGQEIEIDKLTSCVARLDSSPGNDYGGFTLELEEVHP TEVNEKGKPVKVKKQEHWVVGDRAQAYPNPIWMTDKSDAKVEYFHILLMGVISVTPNL DKLSTGKSAKQRTLTIDLATLSIAKPDELKKKLKSCKALTINGIKYRLNFTSNQQSFV EGHGGAIHGKNCFPNHNTLYVADLGAGTFQISEFSILNKLPSKKSKDSYHGGGGITSL KREVAKALSNGDSSNYLTRSQVSTILEKSEWTNGKVVAQDFTKSDVSNHITFAINQWL RESPAGFALEDLVNISRRSPVIFCGGGFAIASVKGIIQKQIIESGGDAKNLIFPEDLG LIAVRGVLDYLIQKPSNILQLPTIKEIATDDHDDQAATA" gene complement(10699..10986) /locus_tag="DP116_15070" CDS complement(10699..10986) /locus_tag="DP116_15070" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15070" /translation="MFITDETVTNLKQVTQFVPNLDYIDLASLLHAVADAIRQGQTLS KTLEELGVDIPGLTDGGYGLLPTFYNVLEEQELIALAGSCASHLMHTYYFR" gene complement(11056..11265) /locus_tag="DP116_15075" CDS complement(11056..11265) /locus_tag="DP116_15075" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15075" /translation="MINIANWFFEPTDNSSPVASADPGDEYSGSVISTGVNTVSQLAE KHGLETPHFVLDETGNRYNYTAKKK" gene complement(11262..11528) /locus_tag="DP116_15080" CDS complement(11262..11528) /locus_tag="DP116_15080" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15080" /translation="MFSKAQFEFLTAVQLVDDWHCSCFVDVLRQYHENDLNRYATSVR LKQEEFDELMTILSRADDEDIFQLQFALNALNLLPCSMFESATE" gene complement(11762..12853) /locus_tag="DP116_15085" CDS complement(11762..12853) /locus_tag="DP116_15085" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15085" /translation="MGSHLLYPYWFSKRYANRFKPTYQASQPKNNKVELIALMPVGRR TVKARPTKKYHERLIVVEVELEVERPKEPKEPPYCNYRQQWRRLTGTVPHASNVFKAM PDYKLPIYTDEDWEAVVKTYEFRRSLISLKPNSRRKPRVTLQSEIKPDPEAKVVITPP SHKEESIKNKTLRQLANKAERKNQSRKTEEAKAGWERVTGLPAGVSIVEELFPDFAKM SSDHAQYWRSAIIRWKQCKQPEPQTPSSPKPPLQQQPQPVISYPPKTQREMAVSDAIA LMLNGQREVHCPPVGRIDVLTDIEVIEVKDADDWKHAIGQVKVYGFHHQDKQQVIYLF GDNAQSVYSIAVDYCVRLGIELRIYQHSK" gene 13305..13466 /locus_tag="DP116_15090" CDS 13305..13466 /locus_tag="DP116_15090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015083723.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ParG" /protein_id="PRJNA477356:DP116_15090" /translation="MEKEVFVRGRVPESVRARFKAACALKNKTMDSVMESLILEWLKE NENDPRPTK" gene 13444..13740 /locus_tag="DP116_15095" CDS 13444..13740 /locus_tag="DP116_15095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999657.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15095" /translation="MTQDQPSRLDRIEAILADIATQQAGNTQAIAQLAENTDRVLARS AILDDVLLELRENSEQHQREFEQHQRNFEEHQRTTNAALQSLEAILLQLIRRGS" gene 14324..14566 /locus_tag="DP116_15100" CDS 14324..14566 /locus_tag="DP116_15100" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15100" /translation="MKSIKKLPSVTKEQLKMAFSKAFERELSYSSHNSSVTRLSAIRS SLSAILKAPPPGELLLIKILINNGAIQQVLAVGGAA" gene 14563..14898 /locus_tag="DP116_15105" CDS 14563..14898 /locus_tag="DP116_15105" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15105" /translation="MSASSPFKPVNFTTRPDVLEFIIKYFDSTGCYQCANLGEADCYS IFYLAKDGGSECGWIYDKEEYIEVELGNLEELAALLQKFDHKLYCSLSSLNNRFLSIF HLSNGEVAA" gene 14895..18584 /locus_tag="DP116_15110" CDS 14895..18584 /locus_tag="DP116_15110" /inference="COORDINATES: protein motif:HMM:PF08707.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15110" /translation="MITTSRTRFRFAVNTSGKDKNWDYKLLTNRYRDKEGTLEDVQRH VANGHALLVGLLGGKSRSKANVIGSDVLLLDIDNSTPLLDADGNPVKGEDGKIIPTYS HELTLEDALAHPFIQKYCCLIYTTPSHRPDWHKFRLIFLLPEYLEGGDTVEVCIRFLM QHLPHDPACKDAGRVFYGNSKAEFPLVQPGVTLPMEWVEEARAIAQEERILHTERIKQ FEARKAEIRARNDAEGWDNDKLIQQALSFIPPRNPGSNTYDESLAVLMALVDYYGPAE AEAVAEQWSPSVRGTDWNVGRKIRSFKRGGTGIGSLFHIAKQYGFKFPAIQNYTAVST PQDIEAAEDWEEGLDRYLNAKKLEDFLGKTLKRVKKFVSQAPKHLGVKGFAEKAQPKS DDNGVISFAKGDRQKLITVLVEQAAREGRKLRILDTSETGAGKSHEWGEISNGLLGID KLFYCSPTHRNPSTSTVEKNFADLPVRHNGLLVDENKPQTALGNAHVRWAKKQIGEQS NITPNCKNAEQFHIYSTKGYHESVNSSAGINPICKKCKFALGCAGVKDEDGNYIMEPV EGYTFRLERQKVLKEYAKIRASLDSLPSITTMQEPTEEGQPGIRAGIIVDEFYAQYRS TNLTQVSLAEFDSTWVFLQSREAFILEHLAQTEELIEWKRQQLENAANEQEIKKLEQE ITDLAHNYQAVKDLVEKLPGALETVKPIIYSIRGVLSGEEIDVTQETFYGWNELQLRE VIGELPEGLQEAAKVLAAVKPRLETLLDGIEADSVSQEGIQGNSKDKKGSSPVLRYIR GQFNKEANQQKRDRIKNLPANCIVPILKTLAGERGSFRVKHGTLEIVTANDRHKEQLL AADLAVVLDATLTPQSLAQSLGIDPSEIVVICEERKSYSNLTIKHVRGLGKLTRNRSK ELQERLDALLTHLEEKHLGLGIIDHLATKRAGDGHWFNDSRGTNVYQHKEALITIGSP YQDIGSLAMVYATTTGDKNTTKGNPKFDAYVQELKEVEVIQAGGRLRAHRRPDERLIW YIVTDEDINYLKGAFPGATFEEVSAFSITPDAGSEGERTLGTVFAAIAKLAQEDGRET ITQAEVAHTAGLAQSTISKAINRDELVAKIGGWRGFRKLFQALLDAPRGWNNLSKKLQ EEFGEDALFAIETYLPLVLQDNDQNVENLASECTNLAQTVGWGAIQAFLHLMPVESRV NMLSFLVGLLSDDWQEEFLHLMPIA" BASE COUNT 4933 a 4160 c 4317 g 5245 t ORIGIN 1 tcgaggatgg aggcgatcgc cattagatga gtatccaaat tttctgtatc tgccaacgcc 61 tcagaaatct gctctgcggt ttcgtagccg ttttctggct catcctctac caggtttacc 121 cattgctctt cggcattctc ctccgtgctg gtagagtcaa taacagaatc taattgccga 181 ttatccactg tggtttcttt tgcggactca atggcagcga atatatctag ttgcccatga 241 tcttgtggca atggctccgg actcatttgc acttcggaag caacctggtc atgatgctcc 301 tggcgcagtt cttcccgctc agttggtggt tcagttggtg gttcctcaac aactggttga 361 acatccactt cttgggtaac ttgttgctca gcttcaacga ctggttgaac atcgacttct 421 tgggtaactg attggttaac ccctacagct tcgacaatct gctgagacac ttcattctcg 481 acgcctagcg cctgggctga ttctttaatg cgtgcgatcg ccgcaagtgc cgtttccacc 541 tcgccttcta agctgcggta ctgggtcaac ttagcgttga gttcttggat ttgagattcg 601 ataccttgga tctgttgtgc gaacatggtt ttatcctttt tacttaattg tgatccgcgt 661 ctttattggc gcggattttg tgttttacgc agccgccaaa agtacaggaa tgacgccacc 721 agacataacc tgcaaccgac caatcaaagc cgtccaacct gttttggagc gtaagtctag 781 tccggaaaac cgtttcttga agtgcttggt ctgggtacag tgcaacccga tggactgagc 841 tagttggaac gcgcggttct tgagttgttg tagttttggc tgttgggtca cgtcctcacc 901 tccggttgac ctggttaact aacttaatag tagaccaggt taacctaaac agtcaacctg 961 gtctactatt aaaaccatag gttttttgca ggattctgaa gataccatag aaaagcaggt 1021 tgactcggtt aactcaggta agttgacgtg gttaaccttc ttttgtaaga tgaagggtaa 1081 gttaaccagg tcaacccatg aaattagttt ctttcaaagt gactgatgaa atcgctctaa 1141 agcttgagag cttgcagggt gatgagaagt ccatttcttt agtcgctaaa cgtattgtcg 1201 aagaatcctt ggagaatgca tcacgaaaaa ttccagtgga agaaaagctg gaagctgttg 1261 aagaaaaatt ggattccaag atagatgaag tattggactt gttgaggtca caactgccgg 1321 gaaagtcccc caaggttgtt taaccatgtc tcagcttgat gtggcgatcg cgcaaattta 1381 taagcgatgt actccaacgc aaaagaagct tttagcgcgt tggactaatg aaataaaaga 1441 aactttgcta ttgagtcaat ttagccaatt ccgcaatgac aacgattaaa ccatcagaca 1501 tagacttaac aagtcttccc tggttaccac ttgaagaaaa agctgctttc ccaaaacgtc 1561 ctgcgatcta ctttgcgata gattcttttg gtactgttca atacattgga aagtctgtaa 1621 atgttcgcca tagatggggg agccatcaca ggtatgaaaa actaaaaaac attggtaaca 1681 tcaggatagc gtacttgttt gtagacttgc ctgagttact gccagagata gagcaagcgc 1741 taattaagca ctttcaccct caactgaaca cagtaagatt tacagaaaca aaactgattc 1801 gcaccgaaag aaaatatatt ggtgcgggtc ttcatcttaa agagaaatac ttacaagaca 1861 acaaaacccg catagattgc aacacaccga gttttgaaca gtggctgcag gataatattt 1921 ctttccgggt agaagcagga gaaaacagct accgagccag gaaagagagt tatacatcaa 1981 acgattattg gtatgccgta aagaaggttg atgggaagtt gcacaaaaag tttattggca 2041 agagtgatga agtgacatgc gatcgcctaa aagaagttgc agatgttatt aggcaaccgc 2101 ccgttaaaac gccacccaag gcggttgtac aaccagttga ccaaataagc ctcgcacaga 2161 aaataacagc tttagaagct caagtaaccg cgatgcaaga gcagttaact aagttggttg 2221 aataccaggg aaaagtacta gcctgattgc agaattaaaa gcagtcaggc ataaatacga 2281 acagacattt gtgattgtag aagaggagcg cgatcgccac ctcaaaaagt ttcgtcctgc 2341 caaaagcgcc gactcaaaag cactgtatga aaagcggctt ttagcttttg tacaggagct 2401 acttgatcgg ctgcgctcta tttgacatct ccctgaccta aaggtacagg gattctgaat 2461 ggatactacg cagcagggta aacccatgcc acttcgtatt attagctgtc acccagatta 2521 aagttctggg tgggacgtgt caaattcgcc cctattcact cgatcaagct aaaagcttaa 2581 cgttgcgtgt acttttcttc gttacgttca cgagtccgca cttgccagaa tggaacactc 2641 cttgacctgc cattgtttgc agtgataggc ttgctgcatt gcagtagtga cttatatgcc 2701 ggggtttcac cgtactaggt catttcgtcg tgtttgcggt tattcttact agataacatg 2761 cattagtata caacacgttt gtgatcaatt ccggaaagat ccacggcagg ctaaaccctg 2821 tctcgtgggt ttcatccctt gcctaaagta caaagatact ttcctgagtc aagtatcttt 2881 tcaagggttt tcacctcacc ccttataaga ttttttaggc ggtttcagga acccgttgta 2941 ccacgtgcag cgggttttct tcgttcaagc cgcaaaagct ttaaggtcta gatcaatata 3001 ttcaatttgc gctggcagta tccgaaccag aacaccaatt ggcttttctt ccccgcgctt 3061 ccccttagtt cgggcgaagg tcccgctcgc cgcagttata tatgatgatg agtcgtttcc 3121 caaaactcca ccttgtacca acgcatccag cacggcacca accaggttat ccacatcaga 3181 gctagcaaaa ttcacaaaat agcattcaat ccgcgccggc ttgggcgctt ctggcaagtt 3241 taaccggatc agttgtagaa cagcatcact cttccacgct ccataaccgc gtgcggtatg 3301 cacaacaccg tttctcccaa aacggggtct tgctttgcca attgcgcgac cctcaatcca 3361 aaattcgagc ataattccgc gtgcgagtag caaccgaacc ggaatgtacg ttaaggactg 3421 attcctgtga taacaataaa atttctacct gactcaatgc ctcattttta ccgtttattc 3481 aggcaatgag tgtatttttc ctgtgataac aatctttatc acaggaatag acacttttaa 3541 gggcaacggt tcaaagtcag tgacagtaag catcttgggc ttaacgtaca ttcctgttcc 3601 aatgctactc acgggcgaat tcaaagtaat tgacgttcaa taacctgttc aagcaagtgt 3661 acactctaca acaacaaaag cgcccacaac gtaagctgcg agcgcgttaa ggttgttttt 3721 tagatctgct tcactgtttg taaaaatttt tataaacagt gttttctaag ccatcaggcg 3781 gtaaactctt ctatcttctg gtacgtctac tttcgtccct agcactggat ctacctcttg 3841 tacgtgttct aaaatttctt gtgaaataag aaaatcaatt gcaattcgcg ccaggtctgt 3901 gggtctattc cacagtttct tcaaggaacg ttgatcgcga atctgtttaa agctagcgct 3961 tttatcttct ctggattcta ggtagagctt aatagccctg gaagcgtcca ggaggcagga 4021 ttcaatagta acctcatctt cgtcaacttc tatcgtttcc tcctcctcta cttcttcctc 4081 agcatcaaga gttaagtcgc ctagggtctt acgttcaaag tgaggtgtga tgccgatgcg 4141 gttgccgcca gtggtcgtga agatgatgga ggaattgaac tcacctttga gtagagcatc 4201 acgcaacttg gtaaaatcgg ttcctaatcg ggtgcgatcg cctttatcag gcacaatcaa 4261 agctttggat attactttgg gtaatagtcc cagtccatca gagatgcggt ctaagtcccc 4321 tttgtcggat tgctgacagt aggcgatggc aacgaagtcc agcgattcac gagtagaggc 4381 atcaatgccc aaagcttcta ctgtggctga gtgagtgtcg atgtcaatgc aaacgccgca 4441 gctcgcacct ttggtgataa gtccaatcaa ttgactattc actttatcgg cggtcggctc 4501 atcttttttg actgttgacc aggttgccaa ccaatcaatt aaggcaagtc gcaccggatg 4561 actttgagct agagcgtctc gttctgattc tggcagcttg gcgcgttctt ggtagatacc 4621 gtacaccttg ttgagcattt ctagccccac acccttaaag ctggttccgt cgcaaaaggc 4681 atagttagaa acaggaattc tggggtgaat catttcgcgc ttgtcagcaa ccacgaacac 4741 ctcagcattt gggtaacgat caagcatggc attgtagtac agatatagta ctgttgtctt 4801 accagcccgt gaaggagcaa ccaagacttt tgacttaata gatccagcca ttgatttgat 4861 gatagattgc cccatctctt tggcttgctc aatgttattt actggtgtgc tatccggtaa 4921 agcttgttgt ccagaaacac caccaagtaa tgcagcaaac gggctttgtt gctgttgtgc 4981 ttgaatctgg gcttgttgct gcatcatttc cttttgctct ttgagcgcca ggtgttgaaa 5041 gaataattca cgctcttgtg gttctaatgc tgcaatcagc ttttggatct gagaaaattc 5101 gttaagggtg tcgctggctg ctaaagcatg ttggttggtg ttatcagaaa caacaaaccc 5161 gtgctgcaag tcctggtgac gttgcttaac caccttatcc gcttccatct tggtagccgt 5221 ccatgttgtt cttcgattag tgcggtttac tggtattgcc tcagtttttt ctttcaatgc 5281 ctgtttggca atgaatccag atccaccagc aaatgctgta ccaaacagtg ctaaccctac 5341 agccaaagca ggattacctc ttttatcagg cacaaccacc aaagctgggt gagaaaccag 5401 cttttcaata tggtcttggt caatagagta actttttact cctaagcact ttgaggtgtt 5461 aggaactggg ttcgggttaa aacagacagt aaaagtatct ctcagaggag tggcaacagc 5521 agcagcgagc agtagtacac ctgctaacag cccaccacta atcataaagt gacgggtaac 5581 gttgatagct gtgtcattct gggctggttc ttttacttgt acttgaaccg agggctgtgg 5641 tgtttccttg ttatcagaaa cgattgcttt acttagtgct tcggttaagt actccgcgct 5701 ttcagggtga gacagtaagc gttctaatct ttcccggtct tcttcgttaa aatcagcttg 5761 cataattaaa agttgagtgg attgttggag gaagttgttt tttgttgcct ggttttggga 5821 gaaagaagaa atctgattat ggcgatcgca acaaccgcat aaattcccca agttgctatc 5881 tgaccaagtg aaatactgct ttgaggttgc ttcaccatgt acaactccac gctgtggtta 5941 atttcgccga ctgcactgtt ctgggcgtaa tagttccaaa gctggcaacc cgcgtaaccc 6001 gtaccaatgg tcatagctcc cagtcgcact atgcttgctt tctttccggt tgctgcttct 6061 gtaccaaaaa taaggcaacc gactaaacca actcctagcc ctaaaccagc accacttgta 6121 ccagcaacgg aactaattcc aattgtgatg gtaagtccag tcccgataga ggaaactcca 6181 ccagagaagt tttgtttact ggttaggtca tctacatcac tacgatcagt tgactctgcc 6241 cgtcgctcat cttccgttaa gtcccgctga tcctcgcctc ggtagctggg agcaccgaaa 6301 aagttatcac ttgcttttgg ctggggtgct tgtacttttg tttcggtttt gggtggttct 6361 atacccgaaa tcgccataaa tgcgtcaaaa tctttgttgt tcattgcttt tatctggtgg 6421 tgaatatcgg tgcagtagtg ggcaaaactg actgcaccga atcaaaatta aagtcctaat 6481 actctacgaa aaacaccagc tgcgttacgc ataacagtgt tgaactgacc gaccgcacct 6541 ccctttggtg cgaccgtgtc atacacgggc ttagctgggg cagaaagtgc gttttgtaca 6601 ggagtgccgt acttgcctag cgcacccaag tacttcatgt attcagcgtc catctgcctg 6661 aagatttgag cgcgtaagct gtactggttc tcgtcttcag gctttaactt gtccttgagt 6721 ttttcagaat gagtagtggc gatcgccgcc aggtctttga gaaactgaac ccgtgccaga 6781 atcaacttgc tttgagtttc ggcggtaatt gtctgagaag cgtattgtgt ttcagcttgg 6841 atcaactgat gggaggactt gaactcaacc cctgccaggt gcaagtcctt ggcaacttcc 6901 atcagggatt gagcgccgtc atagcgcttt tgcataatgg caaggatgga cttggcggct 6961 gacaccttat catccaccat cttctcgttt actttcgagt aacctttaag gaacttagct 7021 tgttggctgg cacgttgagc agcatcacca ccttgcttaa tggcgtcgcg gtctaagtcc 7081 accagggctt ctttgccggg agcattgaaa aaggtacgac ttacccaatt agctggtgta 7141 gtttcaaagc tacgagttat tgggtcatca ccaggggtcg cggtgtcaat ctcatacagt 7201 gtggggtcaa gcttgtaatc ttcagctttg aattgttggc ttgaatcttc ggtaaagtcc 7261 atgatttctc tccttttttg gtaactgggt tagtcacctc cccagcgttt ttgtttaatc 7321 gttgtcgcgt ctgaagtatt cagggtcaag gaaaagaatc aaagctatca ggcggtcgtg 7381 gcggtcataa caccgattgg cataaaggtt ttttggttgc caagtaccgg ggtagtagcc 7441 ttcccaatat ttgtaatact cctgtaactg gcattcggct aggtacgcaa gatatttaat 7501 catgcagcct ccttggtggt gagttgaaaa acttggttga tgatttcctg ttcctctggt 7561 gtgcgtgagc gttgaccgtc aacttctcga agaatgattg caccagccct ggctaattca 7621 tgtggcacac cgtgctttat caagtcgtcc gtcgttgtct ctaccgaagt tttcagcctc 7681 gcattgccta cggtgcactg ttcttgaaaa gcggacaata acccttctgc ctgggcgata 7741 gtcagcccat ccaaaatgtc agctacctct accaccgcct catcttggac aagaagatgt 7801 aacgaggaat cttgccaagc gtcacagatg gtgtagtgct cgcccatcat ctcttgaccc 7861 atcagaaaat tagtcagaag aacccgcact tggattattt gagagcggtc taactgttct 7921 agatagcagc caaaacgctg cgttaacagg ttaatggtcg aatcttcgat aaagtaagta 7981 actggatgca tattctctcc aaaaagttta gtacgttatg ttttaggggc tgtttttaca 8041 gcccccgttt ttttacaaac cacaaaagct ttcaaggttg gaggtcaaag acacttgttc 8101 ccagaccgcc tgttcgacac gtgcaaggga tagcttgata cttacccgcg tgttgttctg 8161 agcaaacttg taaaagtttg cataatcacc tgtgactcgt agaaagtgcc attcaccggg 8221 atggttgctg atcaagtatt ttttgttggg tgaaagtctg ttgagcgcgt acattagttt 8281 acctccctat cagcgaccgc aattaaaaca agactcgcgc cagtagcagc ggcgaaaatc 8341 atcgtcagtc cgagataagc gcgtccttgt gctgtgaaat ctggttgaga gaagtctttg 8401 ctggcaattt tgaactgagc cacgcctaaa catccaagac tcagacaagc acttaaaccc 8461 gcgcaaacca cagttgatgc cgctacgcga gtaagcgctc ctacaatgcg atcgcccatt 8521 gatggcgcgc ttcgcgactc ctttattctt tcagggctat accatgcttc ctttggtggc 8581 gtgtatcctt ctggcaaatc ttcagtgagc acctcatcac acgcttcggc tgcatcaccg 8641 agaagtattg atttgtggat gtcggtatca ctaacctgct cttgaacttc tttgatgagt 8701 ttgagcgcaa tggctacttt ttcgttttgg taaattgtta gcattacttc actgataatt 8761 gtttgtgagc cggattgtgt ccggctttcc tgtgtttaac gagattcgac aacagccttc 8821 tcaatcactt ttcttagcca gtagttaggc tcaattcctt gctggctgca aatattaaga 8881 atgtcgccca caagcgcttg tgggacattg tagttaagcc gtggcggctt ggtcgtcgtg 8941 gtcatctgtt gctatctctt taatggttgg taactgcaaa atgtttgacg gtttttgaat 9001 caagtaatca agcactcccc tcactgcgat aagtcccaaa tcttcaggga agatgaggtt 9061 tttagcatct cctcccgatt caatgatttg tttttggatg atgcctttga ctgaggcgat 9121 cgcaaaccct ccaccacaaa aaatcactgg tgaacgacgc gagatattga ccaggtcctc 9181 tagtgcaaaa cctgctggac tttctctaag ccactgattg atggcaaagg tgatatggtt 9241 gctcacatcg gattttgtga agtcttgggc gaccaccttg ccgttggtcc attcactctt 9301 ttctaggatg gtggacacct gggaacgggt gaggtagttg ctagaatctc cgttcgatag 9361 ggctttagca acttcgcgct tgagtgaagt gataccaccg ccgccgtgat agctatcctt 9421 ggatttctta ctgggtaact tattcaagat actgaattca gaaatctgga atgttcccgc 9481 acccaaatcc gcaacatata gggtgttgtg gtttgggaag cagtttttac cgtgtattgc 9541 accgccatga ccctccacaa aggattgttg gttactggtg aagttgaggc ggtatttgat 9601 gccattaatg gtgagcgctt tgcagctttt cagtttcttc ttgagttcat ccggcttagc 9661 aatgctcagt gttgctaggt caatggttag cgtgcgttgc ttcgctgatt tcccggtaga 9721 aagcttatcc aaatttgggg taaccgagat aacccccatc aggagaatgt ggaagtactc 9781 caccttcgca tcactcttgt cggtcatcca aataggattg ggataggctt gtgctctatc 9841 tcctaccacc cagtgctctt gcttcttaac tttgactggc ttacccttct cgtttacctc 9901 ggtaggatga acctcctcca gttccagggt aaagccacca taatcattgc caggagagga 9961 gtctagtcga gccacgcaag aagtaagctt gtctatctct atttcctgtc catcaaagta 10021 ctccagggat ttacaagaac ttcgtcccaa gtcagcaacc agtaaaaagg gtcttagtgg 10081 ctccttctga ctctgctttt tgggcggttg agcttccaca gtgccgtcag atgtcgcctc 10141 atcggcgttt tgactttctg aggtttcttg caaatcttcg gtaagttgtt caaccttttt 10201 tctgggcatt tttagttctc cttaattaag ggtgaaaaat cgtcgtcgta actggcacac 10261 actaaaggta gtacgtgcgt gttgtattat ttgctttgag taattccgta caaaaacgct 10321 gtttgaacgg tgcttttttg tttgcgacaa tcgggcagtg ctcgcgacag gcttgcgaca 10381 tggcttgcga caatcgggcg atcctcgcga cactttttat gagcgctcag ggcagccgcg 10441 acaggcttgc gaccgttaaa caaaatcccg cgacaaaatc ccgcgacagc cgcgacaggc 10501 ttgcgacaaa ggctgcgaca atttacccct gttcgcgacg ccaaaagtgt cgcggggtaa 10561 aatgcccctt tttagtacct cctcgcgctg cggcagccca aaaacgggca acgggacaac 10621 tatttttatg cgtatacttt cgcagagtgt acggagatga taggtactgc aaccatatga 10681 cagttacgac atagttattt atcgaaaata gtatgtgtgc atcaagtgac ttgcgcaaga 10741 accagccaag gctattagtt cctgttcttc caatacgtta taaaacgtcg gtaacaatcc 10801 atatccgcca tccgtaagcc ccggtatatc cacacctagc tcttctaaag tcttggaaag 10861 agtttgacct tggcgaatag catcagctac tgcatgaagt agagatgcaa ggtcgatgta 10921 atcaaggttg ggtacaaatt gtgtcacttg tttaagattt gtgacggttt catcggttat 10981 gaacatggaa ctgtccttaa gggtacgaag tgttttgcgt ggttgtgttg tagcgtttta 11041 tagtgactga gatgtctact tctttttggc tgtgtagttg tagcgattcc cggtttcgtc 11101 caaaacgaag tgtggggttt ccaaaccgtg tttctctgca agttgcgaaa ccgtattaac 11161 tcctgtggag ataacgctac ctgagtactc atcgccgggg tcggcgctgg caactggaga 11221 ggaattgtcg gtaggttcaa aaaaccagtt tgctatgttt atcattcggt tgcgctctca 11281 aacatggagc agggcagtaa gttgagagca ttcaaagcaa actggagttg aaaaatatct 11341 tcatcatctg cgcgtgaaag tatggtcata agctcgtcga actcttcttg tttgagtcgc 11401 actgaggttg catatcggtt gagatcgttt tcgtggtact ggcgtagcac atctacgaag 11461 caagagcagt gccagtcatc gacaagctga acagccgtca aaaactcaaa ttgcgcctta 11521 ctgaacatga tttttactct ccatcttctt ggctagtgct tgcatgatca agctttgcag 11581 gtgaaacagg gcatcggctg aaagagtggc aagcgtttcc acgtccaaac tctcaaaagc 11641 gtcaacaccg aagaattctt ggagttcaat gaaattttgg tagtctttcg acataaaatc 11701 ctcactggtt ggtaactgcc gttggtggtg gcagatgcca accgaggggc tagctgaaaa 11761 actattttga gtgttggtaa attcttaact caatgcccag tctgacgcag tagtcaaccg 11821 caatactgta aactgattgt gcattgtcgc caaacaaata gataacttgc tgtttgtctt 11881 gatgatggaa cccatagact tttacctgtc cgatggcatg tttccagtca tcggcgtctt 11941 ttacttcaat cacttcaata tcagttagta cgtctatcct cccaactggg gggcagtgta 12001 cttcacgttg cccgttgagc attagtgcga tcgcatcgct caccgccatt tcgcgttgtg 12061 ttttgggcgg gtaacttatg actggctgtg gttgttgttg tagaggtggt tttggtgatg 12121 atggtgtctg gggttcgggt tgcttacatt gtttccatcg tatgattgca gacctccagt 12181 attgcgcgtg gtcagaactc atctttgcaa aatctggaaa aagttcttca actatagaaa 12241 cccccgcagg taatccagtt actctctccc agcctgcttt agcttcttca gttttacgag 12301 attgattttt tctttccgct ttattcgcta gttgtcgcag cgttttattt tttatggatt 12361 cttctttgtg ggacggtggt gtaatgacta ccttagcctc tggatctggt tttatttccg 12421 attgtaaagt tactcgtggc ttccgccgtg agtttggctt cagagatata agtgaacgac 12481 gaaattcata agttttcaca acggcttccc aatcttcatc tgtatatatt ggcagcttgt 12541 aatctggcat tgctttaaaa acattggaag catgaggtac tgtaccagtc agccgccgcc 12601 actgctgcct gtaattgcaa tatggtggct ctttcggttc ttttggtctt tctacttcaa 12661 gttctacttc taccactatt aacctttcgt gatatttttt tgttggtcgt gcttttactg 12721 ttctgcgacc aacaggcatc aatgctatta actctacttt gttattttta ggttggcttg 12781 cttggtatgt aggtttaaac ctatttgcat acctcttact gaaccagtaa ggataaagta 12841 agtgtgatcc cattatttat accttttaat tagctaatta gaaaacactt attactttcg 12901 tgttctctag ataacctcaa aaaggcaact ttgttttacg gagtcactaa caagattcct 12961 ctcgtattag cgcaataccg ccacttttaa aaaaagtgct gggggcgaac tatgcagagg 13021 tgcataatat tcccatgctt tacaaagctt gattgaaact cctgcaagcc gcagttcact 13081 ttggtgtacc cgttgcctgc tttgtttctt tcagagacat gcgaagctca cccacacagt 13141 gctggagttc tccccctggc tgcgtttggg cttgttttct gtctatgaat tcaatatagc 13201 tcacaatagt catagtgaac aagtgaacgt tatttaattc actaatacaa tgtgctgtac 13261 gatatagagg atactaatac acaagtaaac aaatgtactc ttaaatggaa aaagaagtat 13321 ttgttagggg tagagtccca gaatcagtac gcgcacgctt taaagcagct tgcgcactaa 13381 aaaataaaac tatggatagt gttatggaaa gtttaattct tgaatggcta aaggaaaatg 13441 aaaatgaccc aagaccaacc aagtagactt gaccggatag aagcgattct ggctgacatt 13501 gccactcagc aagctggcaa cactcaggcg atcgcccaat tagccgaaaa taccgaccgc 13561 gttttggctc gcagtgcaat tttggatgac gtgcttttgg aactgcgcga gaattccgag 13621 caacaccagc gggaatttga acaacatcag cgaaactttg aagaacatca aagaacgacc 13681 aatgctgctt tgcaaagctt agaagcaata ttgctgcaac ttatcaggcg aggtagttag 13741 tcgtatcatc agttctcctg atggatctct ggataacttg cgagagcaac tattcaagtt 13801 agggaaagga attatgagca aatgtactag cgtggattta caaaacttcg catcagagtg 13861 caggttagtc aatcaccgga aaaataagat gccactagtt aaaccagaca gaaaacttct 13921 tcaacaaata gcggcttaca ttggcagctt gctctcggca agggcgatac gcggagcgtc 13981 aagccaagga cttatcgcca acgtcaaagc cgcttaccta tttttttagg caagcctgag 14041 atggaggaaa acgctgtgta atcagcgttt ttcaaccgtt ttttgataga aatagattat 14101 ttttgtattc gcgctttcgg cgaaacttgt tacatactgg gacagcgtta aaaaaaaagg 14161 tttgcttttt aatgcgcaaa cactcttttc ctcccctttt atcccaagcg aaagatgccg 14221 tgtccgtaca ctcaaaagca acccaaaatt cttaaagttt tcaatttctc caatttacgg 14281 catttttagc aggcgcgcta ccgccttcgg tggcagaaaa cgagtgaaaa gcattaaaaa 14341 gctgccctct gtaaccaagg agcagctaaa aatggctttt tcaaaagctt tcgagagaga 14401 actttcatat tcttctcata attcaagtgt aacacggctg agcgctatta ggtcaagcct 14461 ttcggcaatt ctcaaagcac cgccgccagg ggaattgctg ttaattaaaa ttttaattaa 14521 caacggcgct atacaacagg ttttagcggt aggaggtgca gcatgagtgc atcttctcca 14581 ttcaagccag taaattttac gactcgtcct gatgttttag aatttatcat caaatacttt 14641 gacagcactg ggtgttacca gtgtgctaat cttggcgagg ctgactgcta cagcattttt 14701 taccttgcaa aagatggagg tagcgaatgt ggatggatat acgacaaaga ggaatatata 14761 gaggttgagc ttggaaacct agaggaatta gcagcactac ttcaaaaatt cgaccacaag 14821 ctatactgct cgctttcttc attaaataat cgctttcttt ccatttttca tctaagcaat 14881 ggtgaggtgg cagcatgatt accacttcgc gaacacgttt tagatttgca gttaatacca 14941 gcgggaaaga caaaaattgg gattataagc tcttaaccaa tcgttaccgc gataaagaag 15001 ggacgctaga agatgtacag cgccatgtcg cgaatggtca cgctctactt gttgggttgt 15061 tgggtggtaa gtcgcgcagt aaagccaatg ttattggttc cgatgtactt ctactagata 15121 ttgataactc cacaccgttg ttagatgcgg atggcaaccc cgtaaaagga gaagatggga 15181 agattatccc aacatacagc cacgagttga cgcttgaaga tgcacttgct cacccattca 15241 ttcaaaaata ttgctgcctc atttacacca ctccgagtca cagacctgac tggcacaagt 15301 ttaggctgat atttttattg cctgaatatt tggaaggtgg ggatacggta gaggtgtgca 15361 tacgttttct gatgcagcac ctaccgcatg atccagcatg taaagacgct ggacgtgtgt 15421 tttatgggaa ctctaaagct gaattcccac tggtgcaacc tggtgttact ttgccgatgg 15481 aatgggtgga ggaagcacgg gcgatcgcgc aggaagaaag aattcttcat actgagcgca 15541 ttaagcaatt tgaggcacgt aaagcagaaa tccgggcaag gaatgatgcc gaaggatggg 15601 ataacgacaa gttaatccaa caggcactca gcttcattcc accgcgtaat cctggctcaa 15661 acacctatga tgaatccctc gcggtgctaa tggcactggt ggattactat ggaccagcgg 15721 aagcagaagc tgttgctgag cagtggtcgc cttctgtccg tgggacggat tggaatgtag 15781 gcaggaaaat taggagtttt aaacgcggtg gcactggtat tggatcactt ttccatattg 15841 cgaaacaata tggctttaaa tttccagcga ttcagaatta cacggcagta tccaccccgc 15901 aggatataga agccgctgag gattgggaag aaggcttaga ccgctactta aatgccaaga 15961 agttagaaga ctttttgggc aaaaccctaa agcgggttaa aaagtttgta tctcaagcgc 16021 ccaaacatct aggtgttaag gggtttgcag agaaggcaca gccaaaatct gatgataacg 16081 gtgttatcag ttttgcaaag ggcgatcgcc aaaagttaat cactgtgctt gtggagcagg 16141 cagcaaggga aggaagaaag ctaagaattc tcgatacctc cgaaaccgga gcaggaaaat 16201 ctcacgagtg gggtgagatt tccaacgggt tactagggat tgacaaatta ttttattgtt 16261 cgccgacgca ccgtaatccc agcacctcaa ctgttgagaa gaattttgcg gatctaccag 16321 taaggcacaa cgggctgttg gtagatgaaa ataaaccaca aaccgcattg ggtaacgctc 16381 acgtgcgctg ggcaaaaaaa cagataggcg agcaatcaaa tatcacccca aactgcaaaa 16441 atgccgaaca atttcatatt tattcaacca aaggctacca cgaaagtgta aattcaagtg 16501 ctggcattaa tccaatttgt aaaaaatgca aatttgctct tggttgtgcg ggggtcaaag 16561 atgaggatgg taattacatt atggagccag tggagggcta caccttccga ttggagcgcc 16621 aaaaggtatt aaaagaatat gctaaaattc gcgcttcttt ggattctctt ccaagcatca 16681 cgacaatgca ggagccaacc gaagaagggc aaccaggaat tagagcgggg attattgttg 16741 atgaatttta cgcccaatac agatccacaa acctcaccca agtgagtcta gctgaatttg 16801 atagtacttg ggttttcctc caatcgcgcg aggcttttat tctggagcac ttggcgcaaa 16861 cggaagaatt aatcgagtgg aagcggcagc aactcgaaaa tgcagccaat gagcaggaga 16921 ttaaaaaact ggagcaagaa atcaccgacc tcgcgcataa ctatcaagca gttaaggatc 16981 tggttgagaa actgcctggt gctctggaaa cagttaagcc cattatctac tccattcgtg 17041 gggttttgag cggggaagaa attgatgtta cccaagaaac cttctacggt tggaacgagt 17101 tgcagctgcg cgaggtgatt ggtgaattgc cagagggact gcaagaagca gcaaaagtac 17161 tagcggcagt aaaacccaga ctggaaacgc tgctggatgg catagaagcg gatagcgtaa 17221 gccaggaggg gattcaaggt aactccaagg ataagaaagg tagctcccca gttctgaggt 17281 acattcgtgg tcagttcaac aaagaagcga atcaacagaa gcgcgatcgc attaaaaacc 17341 tgccagctaa ctgcattgtc cccattttaa aaacactggc tggcgagcgc gggtccttcc 17401 gagtaaaaca tgggactttg gaaatcgtca ccgcaaatga ccgtcacaaa gaacaattgc 17461 ttgctgctga cttagccgta gtactagatg ccactctcac gccacaatca cttgcccaaa 17521 gcttagggat tgacccatcc gagattgtcg tcatctgcga agaacggaaa tcttactcca 17581 atttaaccat caaacatgtc cgggggttgg gcaaactaac gcgcaaccgc tctaaggaat 17641 tacaagaaag gttggacgcc ctgttgaccc atttggagga gaagcactta ggattaggaa 17701 taattgacca cttggcaact aaacgcgctg gcgatggtca ctggttcaac gatagccgtg 17761 gaaccaatgt ttaccaacac aaggaagcat taattaccat aggtagccct tatcaggaca 17821 ttggttcgtt agcgatggtc tacgctacca caacgggtga caagaatact acaaaaggta 17881 atcccaagtt tgacgcttat gtccaagagc ttaaggaagt agaagttatc caagctggtg 17941 gacgactacg cgctcatcgt cgtccggacg agcgactgat ctggtacatc gttaccgacg 18001 aggacatcaa ttatctaaaa ggtgcttttc cgggtgcgac ctttgaggag gtaagcgcgt 18061 tctcgatcac tcctgatgct ggcagtgaag gcgagagaac tttgggtact gtttttgctg 18121 cgatcgccaa gctcgcccaa gaagatggtc gcgaaactat cacccaagcc gaggtagctc 18181 acacagcggg acttgcccaa agcacaatca gcaaagccat taaccgcgat gaactagtag 18241 ccaagattgg tggttggcgc ggatttagaa aattattcca agctctatta gatgctccta 18301 ggggttggaa taatttatct aagaaattgc aagaagagtt cggtgaggat gccttgtttg 18361 cgattgaaac gtatctacct ctggttctcc aagataatga ccagaatgtg gaaaatcttg 18421 ctagtgagtg cactaatcta gctcaaactg ttggatgggg ggcaatacaa gcgtttttgc 18481 acctcatgcc agtggaaagc agagtaaata tgcttagctt cttggttggg ctgctcagtg 18541 atgattggca ggaagaattt ctacacctca tgccaatcgc ctaacccaat ccagaaaatt 18601 atcgcccaaa tgtcaactgg gctagtcgct acattgaaag ttttttctct ttatt // LOCUS NODE_1761_length_18616_cov_5.48214018616 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18616) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18616) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18616 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1178 /locus_tag="DP116_15115" CDS <1..1178 /locus_tag="DP116_15115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318571.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_15115" /translation="LCPPYQSLWFNYLKIAVRLPKHNFSYKVIYTVVELGILLLPVLP ALTNSHLRCAPLLGLIAVIRSCQMFNLQGRLIVAALVYISYVYTEFLRISRTPVFFKM PLGHGRAYMGPPMPPPNTVNFILNNAITFALTLTFVLLLVNAVLAERRSRQELAVAHE QLRQYALRIENQATLQERNRIAREIHDSLGHALTAQTIQLENALLLLPSNVDKAIEFL QQVKQLAYQALQEVSRSVATLRADPLRGKSLENAIHNLIRDFSSATTLTPECKISLTS PVTSEVGTAIYRILQEALTNISKHSGATQVSVQLQTQAGRINLLVEDNGKGFYPEQNT TGFGLQGMRERATALGGNFHIISKQKAGCRIQVSIPILDSSMINSEPATKNPKSKIL" gene 1175..1843 /locus_tag="DP116_15120" CDS 1175..1843 /locus_tag="DP116_15120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318570.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_15120" /translation="MIRLLLVDDQVIIRQGLKNLLESKPDLQVVGDAENGQLAIEALQ KLYGTPSQPDVVLMDVRMPVMDGVAATRLICQGFPEIHILVLTTFDDDEYVSQAMRYG AKGYLLKHTPLEDLAIAIRAVHQGYTHMGPGLFEKALNAPDISEPVQSAIPSELAELS PREKEVLRLIAMGLSNREIAHTLYISERTVRNHVTSILSQLHLRDRTQAALLASTFLP QLEI" gene complement(1893..2540) /locus_tag="DP116_15125" CDS complement(1893..2540) /locus_tag="DP116_15125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-3-methyladenine glycosylase 2 family protein" /protein_id="PRJNA477356:DP116_15125" /translation="MFISQSLTQETLARGLMELASRDSDFARVLETLGSPPMWERKAG FPTLVRIILEQQVSLASAKAIYERLCAIVVTLTPENFLTLDDVQLKAIGFSRQKTVYG RALANVIINGQLDLVKLETMDETTIRTELKRIKGIGDWTVDIYLLMALQRPDVFPKGD LALAIALQKLKKLSVRPTPEQLEAIAENWRPWRAIAARILWHYYLNIPKTLPSQT" gene 2611..3120 /locus_tag="DP116_15130" CDS 2611..3120 /locus_tag="DP116_15130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875367.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="PRJNA477356:DP116_15130" /translation="MVILRPFVDKQTILDFAKHYPELDIGALETCLAFLHTTADVYQA LDVHFARYGLSKGKFTLLMQLFLADEKGFTPSECAERGGVTKATITGLLDGLERDGLL KRFPDSEDRRMLRLQLTEQGRDLLSQMLPDHFCRTTNLMANLTDNEKKTLIKLLNKVR AGTSAMLDP" gene 3212..3748 /locus_tag="DP116_15135" CDS 3212..3748 /locus_tag="DP116_15135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875368.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15135" /translation="MNKLSVKQKNWLLTFHVAFGGIWFGTALCMIAIALSNRNTPSGD ELYAVNAVMKLLDDFVIIPSATLSLITGGLLCWLTIWGFFKHYWVIVKWIATVTLIVT GTIWLGPWTNAMTAISDAERLQALQNPLYVFDQKAVLVGAIIQTSCLLVIIGISVLKP WGRRNIKKQEKQPVASNS" gene complement(3745..4830) /locus_tag="DP116_15140" CDS complement(3745..4830) /locus_tag="DP116_15140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316267.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional DNA-binding transcriptional regulator/O6-methylguanine-DNA methyltransferase Ada" /protein_id="PRJNA477356:DP116_15140" /translation="MELQQIQSTEETFWQVVLNKDSSFDGKLFYGVRSTSIYCRPTCP SRRPNKNQVCFFKSSQEAEIAGFRPCKRCQPQYETMPNLTKAKILAVCRYIQEQVDRI PTLSELSSHFGISPSYLQRVFKQIMGVSPFEYADALRSERLKQLLQQGDEIAHALYDT GYGSSSRLYEKASKQLGMTPKTYRCNGKGIHITYSVVPCSLGYLLVATTEKGICAVKL GDQADELERLVLSEFNQAQIARDDDIHRDWVQTILDFVKGDIAHIDLPLNVRGTAFQK QVWQALQNIPYGETRTYADIAHEIGKPQAVRAVGSACGANPIALIVPCHRVLRTDGNL GGYHWGIERKQKLLAQEAQQKGKTPSC" gene 5119..5781 /locus_tag="DP116_15145" CDS 5119..5781 /locus_tag="DP116_15145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747960.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_15145" /translation="MVSSLKELIEESELVHVDDPEERFTISGVSWEIYEALLAKLEDN SHYRVTYLDGVLEIVSPSIRHEKVKKNLAMLLEHYMYIKRINCIPMGSTTFRNKAKKA GAEPDECYCIGEEKSIPDVAIEVNLTSGNLDKLETYRRLGVKEVWMWKTNGLYLYYLR EETPKQFIDTYGYERIDTSELLPELDISLLSRCALITNSLECIDEFEQGLKNKDNSTE LR" gene complement(5912..7084) /locus_tag="DP116_15150" CDS complement(5912..7084) /locus_tag="DP116_15150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein MoeB" /protein_id="PRJNA477356:DP116_15150" /translation="MLNPNLDEIQLTKDDYERYSRHLILPEVGLEGQKRLKVASVLCI GTGGLGSPLLLYLAAAGIGRIGIVDFDVVDTSNLQRQVIHGTSWVGKPKIESAKNRIL EINPYCQVDLYETRLTSENALDIIRPYDIVVDGTDNFPTRYLVNDACVLLNKPNVYGS IFRFEGQATVFNYEGGPNYRDLYPEPPPPGMVPSCAEGGVLGVLCGIIGTLQATETVK IIIGQGNTLSGRLLLYDALNMKFRELKLRPNPVRPVIEKLIDYEQFCGIPQAKAQEEK QQMEMSEITVQELKQLLDNGTNDFVLLDVRNPHEYEIAQISGSVLIPLSEIENGEGVT KVKELLNGHRLIAHCKSGMRSAKALGILKEAGVDGTNVKGGILAWSKEIDPSVPTY" gene complement(7224..7685) /locus_tag="DP116_15155" CDS complement(7224..7685) /locus_tag="DP116_15155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15155" /translation="MLKLLPQHIQTIYTHAETTYPEECCGIIFGNLTSEGKTVVEVMS TENAWNAETAADFPKDDTLDYSKKQRYAIAPQVMLQAQREARERNLNIVGIYHSHTDH PAIPSEFDRQCAWQEYSYIIVSVQNGKASDIKNWCLDDNHQFQQEVIENKI" gene complement(7795..8541) /locus_tag="DP116_15160" CDS complement(7795..8541) /locus_tag="DP116_15160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749272.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sterol desaturase family protein" /protein_id="PRJNA477356:DP116_15160" /translation="MTNHSFLFYWFVLFAVILARYFLIAGGAYLLFYSVLGKFLAKRS LRLKPPMAGSIQRDIKLSVLSAVVFALCAALISKYGIGVTLFYTDLHKYGLWYLGVSF VAVLILQDTYFYFMHRMFHHPLIFKWMHHGHHRSGEPTPWSSFAFDLPEAIIQALFFV GVIFTVPLHLITLVAVLMTMTVWAVLTHLGFEVFPSSSHHHWLGRWFIGSKHHLIHHR KYTVHYGLYFTFWDKLLGTQYPNYEDEFHM" gene complement(8601..9005) /locus_tag="DP116_15165" CDS complement(8601..9005) /locus_tag="DP116_15165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316625.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4112 domain-containing protein" /protein_id="PRJNA477356:DP116_15165" /translation="MDAAKRLATLNRIRKLSVLMDTSIRVPFFNFRIGLDPIIGLVPG AGDLISTTFSAYIIFLATRFGIPPKDLAQMIFNITLEAVVGTVPLVGDLFDAFYKSNI RNLAILEQHLTVVEPELEQVRSEIYDSKVSQV" gene complement(9134..9877) /locus_tag="DP116_15170" CDS complement(9134..9877) /locus_tag="DP116_15170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111375.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_15170" /translation="MSDSLTDSSALLEVKNVHAGYIKDVDILQGVNFRVERGELVTVI GPNGAGKSTLAKAIFGLLTPHTGSITFNGENIGGLKSNQIVEKGMCYVPQIANVFPSL NVEENLEMGAFVRNVPLKPLKDKIFQMFPRLGDRRQQRAGTLSGGERQMLAMGKALML EPSLLLLDEPSAALSPILVTQVFEQIKQINQTGTAIVLVEQNARKALEMASRGYVLES GRDAISGPGLQLLNDPKVGELYLGAGKAH" gene complement(10008..11219) /locus_tag="DP116_15175" CDS complement(10008..11219) /locus_tag="DP116_15175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995330.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_15175" /translation="MIVLEFKAKGRTTQYSAIDSAIKTAQFVRNKCLRFWMDNRGVGQ KELYRHNTALRAEYSFVKDLNSHACQAAVERAYSSIARFYDNCKKSIPGKKGYPQFKK NCRSVEYKTSGWSLSETRKQITFTDKKAIGKLKLKGTWDLNFYQLDQIKRVRLVKKAD GYYVQFLIRTESKIDTQPTGKTIGLDVGLKEFYTDSNGQTEPNPKFYRTGEKRLKFRQ RRVSRKNIGSANRKRAINSLGRVHLKISRQREEHAKRLARCVIQSNDLVAYEDLRIRN LVKNRCLAKSINDAGWYQFRRWLEYFGIKFGKVTVAVNPRLTSQECSNCGTMIKKSLS MRTHVCQCGFVLDRDYNAALNILSRALSTTGHVGTWILDPNASGDSTSTPIGAILSEQ VGSKSEESSPL" gene complement(11359..12093) /locus_tag="DP116_15180" CDS complement(11359..12093) /locus_tag="DP116_15180" /inference="COORDINATES: protein motif:HMM:TIGR02595" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PEP-CTERM sorting domain-containing protein" /protein_id="PRJNA477356:DP116_15180" /translation="MSVALISNLLRLSLATGVITLASVSSAIAGTLNVDFTKLEGLTG GNPGLTGVYRADLSNLGFDINSIAIADSNSAQGGQPGKFSGFDLDAIKISNVLVNNAT DAKNLPGLDVFDFTPSGTLFTPGIQRSSAEPTGLFGTTGGNIDNSVATLQNFDANASA DANAFGLVSLGDGGKVVFNLKNPISNTTPLYLYIGEAGDNGEVASGQITVSDKPLQVP EPTTLAAISLLGIYFFANRRKKTKAA" gene complement(12278..12502) /locus_tag="DP116_15185" CDS complement(12278..12502) /locus_tag="DP116_15185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015171149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_15185" /translation="MPHTIVTNVCEGVADCVDACPVACIHDGPGKNVKGTDWYWIDFA TCIDCGICLQVCPVEGAIVPEERPDLQKKP" gene complement(12605..13849) /locus_tag="DP116_15190" CDS complement(12605..13849) /locus_tag="DP116_15190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP phosphoribosyltransferase regulatory subunit" /protein_id="PRJNA477356:DP116_15190" /translation="MVYQSPVGARDLLPEYVAQKRWIEDRLQQVFHRWGYHRIITSTL ERMDTLMAGGAIDRSAVIQLQNGNDEDLGLRPELTASIARTAVTRMADVNYPQRLYYN ANVFQRTGELKHNRQQEFYQAGVELLGSGGGLADAEILLLMADSLQALGLNHWYLILG EAQITRSLIKAFPPHLQEKVRSAIAHLDRITIDTLPLSDELRERARIMLDLRGKPADV LQKLSSLGLDQSLQAIVDHLKTLVELLERSVNLSTTHINAREIEIILDLSLVQTFDYY TGIVFEVVSNTDGQTRVVGRGGRYDQLLKLYHPKGESIPGIGFALFLEDLQQILLSAR RLPQTTLASNWLIVPETPDAYTAAFAYATKLRDSTHLVRVEMDLGGRDADEIRQYARD RTISQIAWIKADGARTIESLVK" gene complement(13849..14787) /locus_tag="DP116_15195" CDS complement(13849..14787) /locus_tag="DP116_15195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873202.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaJ" /protein_id="PRJNA477356:DP116_15195" /translation="MSFKFNRGLFKYDFIDYHAVLCVPIDADVKEIRKRYLKIARRLH PDSCKDETDADKELANELLSKLVNPAYETLSHEKQRMEHILILSQMGKRLVQQSASVE LKSDVTQQLASAAHFEHVYKSGLAQIAETQYDSLEKVQDTIALASELNLIYIMRSAGK IFAAASPAEIPTNAAAVKQAGVISPRQKKDSVVLQYIRRAQDLISQNQLTQARVELQD GLKLEPNNSRCHSLIGVVYLKQNLTTSAKVHFNRALQLDPKDEIALAGRLKIEQMTGQ KPSGAKRTASSHTGSQQPGKSGGGGLFGGLFGGKKK" gene complement(14918..15733) /locus_tag="DP116_15200" CDS complement(14918..15733) /locus_tag="DP116_15200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407906.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="inositol monophosphatase" /protein_id="PRJNA477356:DP116_15200" /translation="MSNLQIFLDIATEAALAAGAVLQGYLGKVEDAIIEKGRPGDLVT AADKASEAVILDVLRRHFPDHSILAEESGKLGNQDSQHLWAIDPLDGTTNFAHQYPFF AVSIGLLIQGVPQVGVIYDPFHDELFRAAQGLGATRNRRPIKVSETSELGKSLLVTGF AYDRRETSDNNYAEFCHLTHLTQGVRRSGSASLDLAHVACGRLDGYWERGLSPWDITA GIILLREAGGKVTAYDGSPLDIWSGRILATNGFIHDSLSQELQQVPPLSSWVG" gene 16488..16931 /locus_tag="DP116_15205" /pseudo CDS 16488..16931 /locus_tag="DP116_15205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311877.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transcriptional regulator" gene complement(17305..17589) /locus_tag="DP116_15210" /pseudo CDS complement(17305..17589) /locus_tag="DP116_15210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014277087.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(17613..18038) /locus_tag="DP116_15215" CDS complement(17613..18038) /locus_tag="DP116_15215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_15215" /translation="MSVYIPVELQRRIRNHFADCCAYCRTAESLTVSTFEFEHIIPRS AEGETIFENLCLSCPSCNRYKASRQTAIDTITQQEVPLFHPQQQLWTEHFTWSEDGTE IIGLTPVGRATILALKMNRPQLTRVRKMWVKMEEHPPNI" gene complement(18035..18379) /locus_tag="DP116_15220" CDS complement(18035..18379) /locus_tag="DP116_15220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748474.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15220" /translation="MSESVQYVTNQQGERVGVLLNLETYHQLTNALTSDAEILTGLSL DELQALAESMLSPKAQVQLDDLLARNAENQLSADETITLERLLEQVDQLNILKTRARY TLNKLEGTSGVA" BASE COUNT 5388 a 4111 c 3811 g 5306 t ORIGIN 1 ctctatgtcc accttaccag agtttgtggt tcaattacct gaaaattgct gtaagattac 61 caaagcacaa cttcagttat aaagttatct atacggtagt agaattgggg attcttttgc 121 taccagtgct accagctttg acaaacagtc atcttcgctg tgctcctcta cttgggttaa 181 ttgcggtgat tcgcagttgc caaatgttca acttacaagg tcgcttaatt gttgccgctc 241 tagtatacat atcatacgtc tacacagagt ttttacgaat tagtcgaacc cctgttttct 301 tcaaaatgcc cttgggacat ggaagagcgt atatgggacc accaatgccg ccacctaaca 361 cagtcaattt tatactgaat aatgcgatta cttttgcttt gacgttgact tttgtattgc 421 tactggttaa tgctgtgctg gcagaacgca gaagtcgaca agaactagca gttgctcatg 481 agcagctacg ccagtatgcc ttgcgaattg aaaatcaggc aactttacaa gagcgcaacc 541 gcattgcccg tgaaattcat gactccctag gacatgctct cactgcccag acaattcagc 601 tagaaaatgc tttgctatta ttaccttcca atgttgataa agccatagag tttctccaac 661 aagtaaaaca gctcgcttac caagcattgc aggaagtctc tcgttctgtc gcaacactgc 721 gagccgatcc attgcgcgga aaatctctag aaaatgcaat tcacaatctt atccgagact 781 ttagcagcgc aacaaccctc acaccagagt gcaaaattag cctgacatct cctgtgactt 841 ctgaagtggg tactgctatc tatcgcattt tacaagaagc actgaccaac atatccaaac 901 atagcggagc gactcaggtg agtgtgcagt tgcagacgca agctggaagg ataaacttgc 961 ttgtggaaga caacggcaaa ggcttttatc cagaacaaaa caccacaggt tttggacttc 1021 aaggaatgcg agaacgagca actgcattag gaggcaattt tcatattatc agcaaacaaa 1081 aagcaggatg ccgaattcaa gtaagcatac ccatactaga ctctagtatg atcaattctg 1141 aacctgcaac aaaaaatcca aaatccaaaa tcttatgatt cgactgttgc tggtagacga 1201 ccaagttatt attcgtcaag gacttaagaa cttactggaa tcaaaaccag atttacaggt 1261 ggttggtgat gctgaaaatg gtcaacttgc cattgaagca ttgcagaaac tttatggaac 1321 accatcgcaa ccagatgtcg tgctgatgga tgttagaatg cctgttatgg acggcgttgc 1381 tgcaactcgg ctgatttgtc agggatttcc agaaatacac attctggtgc tgacgacatt 1441 tgatgatgat gaatatgttt cacaagctat gcgctatgga gcaaaaggtt atctgttgaa 1501 gcatacaccg cttgaggatt tggcgatcgc cattcgagcc gtacatcaag gctacaccca 1561 tatgggacca ggactgtttg aaaaagccct aaatgcccct gacatctctg aaccagtaca 1621 gtcagccatc ccatcagaac tagcagaact gtcccctagg gaaaaagagg ttctgcgcct 1681 cattgcaatg gggttgagta accgcgagat tgcccataca ctttacatct cagaacggac 1741 agtcagaaat catgtcacga gtattttgag tcagttgcat ttgcgcgatc gcactcaagc 1801 agccctcttg gcgagtactt ttctcccaca gttggaaatt tgaggaacgc ttaacaggga 1861 acagggaacg cttaacaggg aacagggaaa aattacgtct gcgatggtag agttttaggg 1921 atgtttagat aatagtgcca taaaattctt gcagcgatcg cacgccaagg acgccagttc 1981 tctgctattg cttcgagttg ttcgggtgtt ggacgtactg acaatttttt gagtttttgt 2041 agggcgattg ccaatgctaa atcacctttg ggaaaaacat caggacgttg cagcgccatc 2101 agcaagtaaa tatctacagt ccaatctcca atacctttga tgcgctttaa ctcagttcgg 2161 atagttgttt cgtccattgt ctctagcttg acaaggtcaa gctgaccgtt tataataaca 2221 ttcgctaaag cacgaccata cacagtcttt tgccgactga agccaattgc ctttaactga 2281 acatcatcaa gtgtgagaaa attttctggc gtcagcgtta cgactatggc acacaagcgt 2341 tcatagatag cctttgcaga agctaaagaa acttgttgtt ctaaaatgat acgcacaagc 2401 gttgggaaac cagcttttct ttcccacatt ggtggagaac ccagagtttc gagaacgcga 2461 gcaaaatcgc tgtcgcgact ggcaagttcc atcaaaccac gggcaagagt ttcttgtgtg 2521 agtgattgtg agataaacat aaatacagtc atataaattt gactttaagg taccttagaa 2581 taagcgagcg ttccaaattg ctttttaatt gtggttatac tgcgtccatt cgttgataaa 2641 cagacaatac tggattttgc caagcactac ccggagttag acattggtgc actagaaact 2701 tgtttggctt ttctacacac cacagctgat gtttatcaag ccttagacgt tcattttgcc 2761 cgatatggtc tttcaaaagg gaaatttacc ctgttgatgc agttatttct ggctgacgag 2821 aaaggattca caccttctga atgtgctgaa cgagggggtg ttactaaagc aacaatcaca 2881 ggactccttg acggacttga acgagatgga ttactcaaac gcttccccga ctcagaagac 2941 aggcggatgc ttcgtttgca actgacagaa cagggacggg acttgctttc tcaaatgctt 3001 ccagaccatt tttgccgcac caccaacttg atggcaaatc tcactgacaa cgaaaaaaag 3061 acactcatca agcttttaaa taaagtacgc gctggaactt cagcaatgct tgacccctaa 3121 aacaaaaact tgtcaatggt tgttacctag tgtaaggttt taattaggag gctaactata 3181 tgccacctaa tcaagtttcc gaaaaactta tatgaacaag ctaagcgtca agcaaaaaaa 3241 ctggttgcta acgtttcatg tcgcctttgg aggaatttgg tttggtactg ctttatgcat 3301 gatcgctatt gctttaagca accgaaacac tcccagcggt gatgaattgt atgctgtcaa 3361 cgcagttatg aagctgctgg atgactttgt gattatcccc tctgctactt tgtcgctgat 3421 aaccggtgga ttactttgct ggctgacgat ttggggattc ttcaaacact attgggttat 3481 cgtcaagtgg atagcaaccg taacactcat tgttactggt acaatttggc ttggtccttg 3541 gacaaatgcg atgactgcaa tttctgacgc agaaaggttg caagcgctgc aaaatccgct 3601 gtacgtattt gaccaaaagg cagtacttgt gggtgctatt attcaaacct catgtctttt 3661 agtcattatt ggcatttcag ttctcaagcc ttggggaagg cgaaatatta aaaagcagga 3721 gaaacagcca gttgcttcta atagctagca gctgggcgtt ttgccctttt gttgtgcttc 3781 ttgtgcaagc aatttttgtt tacgctcaat cccccagtga tagccgccaa gattaccatc 3841 cgttcgcaga acacggtgac atggtacaat cagcgctatt ggattagcac cacaagcact 3901 acctacagcg cggactgcct gaggcttacc aatttcatga gcaatatcag cgtaggtgcg 3961 cgtttcacca taaggaatat tctgcaaagc ctgccagact tgtttttgaa aagctgtccc 4021 gcgtacatta aggggtaaat cgatgtgggc gatgtctcct ttgacaaaat caaggattgt 4081 ctgtacccag tctcggtgta tgtcgtcatc acgtgcaatt tgcgcttgat tgaattcact 4141 aagaacaaga cgttctagtt catctgcttg atcacctaat ttaacagcgc agattccttt 4201 ttctgttgtt gcgacaagca gatatcctag agaacatggc acgacgctat aggtaatgtg 4261 gattccttta ccattacacc gataagtttt gggtgtcata ccaagttgct tggacgcttt 4321 ttcatacagt cggctactcg atccatatcc tgtgtcgtaa agtgcatgag caatttcgtc 4381 tccttgttga agaagctgct tgaggcgttc actccgaagt gcatctgcat attcaaaagg 4441 ggatacgccc attatttgtt tgaagactcg ttgcaagtag ctaggactta tcccaaagtg 4501 agaactgagt tccgatagag ttgggatacg atcaacttgt tcttgaatgt atcgacagac 4561 tgccaaaatt ttagctttgg tcaaatttgg cattgtttca tattgcggct gacaccgctt 4621 acaaggtcga aaccctgcga tttcagcctc ttgtgaggat ttaaaaaaac aaacttggtt 4681 tttattcggt cttcgactag gacaagtcgg tcggcagtag atgcttgtag aacgaacacc 4741 gtagaatagt ttcccatcaa aactggagtc tttgttgaga actacttgcc aaaaagtttc 4801 ttctgtgctt tgaatctgtt gtaattccat tgatatctcg tttaagatac atcgcttatg 4861 attaaaggct ttaattgata ctagattagg cgtatataca aatcatgttc tacccgattc 4921 ttgcgatcaa tatttcagct tccatagcaa tccgtctagg agttgtcaga aaatttctcc 4981 ccgaactaag aaagcttcag caaagttggg attcatctgc agtgcttgac taaaatcttc 5041 aattgcttgc ttcctgtaag catttgccat gttactccat tgtgttacgg taatgatgta 5101 ccgcaaacag cgcataatat ggtaagcagc ttaaaagaac tcatagaaga atcagaactg 5161 gtacatgtag atgacccaga agaaagattt accattagcg gcgtaagttg ggaaatttat 5221 gaggcgctgt tagccaaact ggaggataat tcccattacc gtgttaccta cttggatggg 5281 gtattagaaa ttgtgtcgcc gtcaattaga catgaaaaag tgaaaaaaaa tctggctatg 5341 ttgctagagc attatatgta cataaagcgc attaactgta tacctatggg aagcactact 5401 tttagaaata aagctaaaaa agctggtgct gaaccagatg agtgttactg cataggagaa 5461 gaaaaaagca tcccagatgt tgctatagaa gtaaatctta caagcggtaa tcttgataag 5521 ttagagactt atcggcgact tggtgtaaaa gaagtttgga tgtggaaaac taatggactc 5581 tatctatact atttgcgaga agaaactccc aagcaattta tagataccta tgggtacgag 5641 cgcattgata caagtgaact tttgccagaa ttagatatat cattgctctc tcgatgcgct 5701 ttaattacaa attcacttga gtgtatcgac gaatttgagc aaggcttaaa aaacaaggat 5761 aattccaccg aactacgtta aaaacctgac aaagcagaaa attcctcaaa aaggtaaata 5821 aagcaactat ctaccgaagg aaaatctgct caatcaggct taacaaagat atcttgcgtt 5881 actcacaaat gaaattcatt tgcagttcat cttaatacgt tggcaccgaa gggtcaattt 5941 ccttactcca agcaagaata ccacctttca cattcgtgcc atcaactcct gcttccttga 6001 gaatgccaag ggctttggca gaacgcatac ctgatttaca atgagcaatc aggcgatgac 6061 cattgagcaa ttctttcacc ttagtcacac cttcgccatt ttcaatttct gacaaaggta 6121 tcaaaacaga accagaaatt tgagcaattt cgtactcatg tgggttgcgg acatccaaca 6181 gcacaaaatc atttgtgccg ttatccagca attgcttcaa ctcttggact gttatttctg 6241 acatttccat ctgctgtttc tcctcttgtg ctttggcttg tgggataccg cagaattgtt 6301 cgtagtctat cagcttttca atgactggac gaactggatt aggacgcagc tttaattctc 6361 ggaacttcat gttcaacgca tcgtaaagca gcagtcgccc gctcaaagtg ttaccctgtc 6421 caataatgat tttgacagtt tccgttgcct ggagagtgcc aataataccg cacagtacac 6481 ccagcacacc accttctgca caagaaggaa ccattcctgg tggtggtggt tcgggataaa 6541 ggtcacgata attcggtcca ccttcgtagt taaagactgt cgcttgtcct tcaaaacgga 6601 aaatggaacc gtagacattg ggcttgttca gcaacacgca agcatcgttg actagataac 6661 gagtggggaa gttatcggta ccatccacca caatatcata aggtctaata atgtcgaggg 6721 cattttccga agtcaggcga gtttcgtaca agtcaacctg acaatacgga ttaatctcta 6781 gaatgcgatt ctttgcagat tcaatcttgg gtttacctac ccaagatgta ccgtgaataa 6841 cttggcgttg caagttggaa gtatcgacaa catcgaaatc aacaataccg atgcgtccaa 6901 tacccgctgc agccaaatac aaaagtagag gtgaacccag tccacccgta ccgatacaca 6961 gcacactagc aactttcaga cgtttttgtc cctccaaccc gacttctggc aaaatcaggt 7021 ggcgggagta acgttcgtaa tcatctttcg tcagctggat ttcatccaga ttaggattga 7081 gcatagcact ttgatgagca agagcgaact gaacaattca aaattgaaaa tatgccctat 7141 gggcacgcta cgcctttcta tacacaagac agtttattga tcctatccaa aaacgaggtg 7201 tcttgagaaa ttgttaaata ctgttaaatt ttgttttcaa ttacttcttg ctgaaactgg 7261 tggttatcat ctaaacacca atttttgata tcactcgctt tgccattttg tacggaaaca 7321 ataatgtaag agtattcctg ccaagcacac tgacggtcaa attctgaggg aattgctgga 7381 tgatcagtat gggagtgata aatacctaca atatttaggt tacgctcacg tgcttccctt 7441 tgtgcttgca acatgacttg gggggcgatc gcgtaccttt gcttcttact gtaatctaat 7501 gtgtcatctt tagggaaatc tgcggctgtt tctgcattcc aggcattttc tgttgacatc 7561 acttctacca cagttttgcc ttcacttgtc agattaccaa atattatacc gcaacattct 7621 tctgggtaag tggtttcggc gtgggtgtag atggtttgta tgtgctgtgg caggagtttg 7681 agaatcatga aagattttta accgcagata gacgctgatg aattaaatgt attgccattt 7741 tgtagctgtg atcagaaatt cctgacgagc acttgaaatt ctaccttaag aaatttacat 7801 gtggaactca tcctcataat tagggtattg agtaccgagt agcttgtccc aaaacgtgaa 7861 gtacaatccg tagtgtactg tgtacttgcg atgatgtatc aagtgatgct ttgatcctat 7921 gaaccacctt ccaagccagt ggtggtggga tgacgaggga aatacctcaa atccaagatg 7981 ggtcaacact gcccatactg tcatggtcat gagcacggca accaaggtga ttaaatgaag 8041 gggaacggtg aagatgacac ctacaaaaaa gagtgcctgt atgattgcct ccggtaggtc 8101 gaaagcgaag gaactccacg gtgttggttc tcccgaacgg tgatgcccat gatgcatcca 8161 tttgaaaatt agagggtgat gaaacatccg atgcataaag taaaagtacg tatcctgaag 8221 gatgagcacc gcaacaaaac taactcctaa ataccacagt ccatacttat gcaagtcagt 8281 gtagaagagt gttactccta taccgtactt tgatatgagc gctgcacaaa gagcaaaaac 8341 gactgcagag agaaccgata atttgatatc cctttgaatg gaaccagcca tcggcggttt 8401 cagacgcaaa ctccgttttg cgaggaactt ccctagaact gaatagaaga gcaagtatgc 8461 tcccccagct atgagaaagt atcgagcaag aataacagcg aacaagacaa accaatagaa 8521 caaaaatgag tggttcgtca atttaacgtt tcctccaagc agtcaggata atataactcc 8581 tcacaccgaa actctcactc ttaaacttgg ctaactttac tgtcgtaaat ttcactccta 8641 acttgttcga gttctggttc aactaccgtg agatgttgct ctaaaatcgc caaattacgg 8701 atattggact tatagaaagc atcaaacaaa tcacccacca atggtactgt accaacgact 8761 gcttccaaag taatgttgaa aatcatttgg gctaagtctt taggtggtat accaaagcga 8821 gtagctaaaa agatgatgta agctgaaaat gttgtactga ttaaatcacc agcacctgga 8881 actagaccaa taattgggtc tagtccgatg cgaaaattga aaaaaggaac ccgtatagac 8941 gtatccatta atacgctgag tttgcgaatg cgattgagag tagcaaggcg tttggcagcg 9001 tccataattt ccaagatttg cactactctt aaattgcatc agtttcggtt acgagtcttg 9061 taccctgaga aagaacactt attacatcag aagtagcttt gccttaagct tgctgtctga 9121 cttcctcatt ttctcagtgc gccttacctg ctcccagata cagttcgcct accttgggat 9181 cattcaataa ttgcaaacca ggacctgaga tagcatcgcg tccagattct aagacgtaac 9241 cccgtgaagc catttccaaa gctttacggg cgttttgctc tacgagtacg atcgctgtac 9301 cagtctggtt aatttgttta atttgctcaa acacctgcgt taccaggatg ggagacaaag 9361 ctgctgaagg ttcatctaaa agcaataagc tgggttctaa catcaaggct ttgcccattg 9421 ctagcatttg acgttcgcca cctgagagag taccagcacg ttgttgacgg cgatcgccaa 9481 gcctaggaaa catctgaaat atcttatctt tgagcggttt gaggggaaca ttgcgcacaa 9541 aagcgcccat ttctaaattt tcttctacat tcagcgaagg gaagacattg gcaatttgcg 9601 gtacgtagca cattcccttc tcaacgattt gattcgactt gagtccaccg atattctcgc 9661 cattgaaggt aattgagcct gtgtggggag tcaaaagccc aaaaatagct tttgctaacg 9721 ttgattttcc agcaccattg ggaccaatga ctgttaccaa ttctccccgt tccactcgaa 9781 aattgacacc ttgtaggatg tccacatcct tgatgtaccc agcgtggaca tttttcactt 9841 ctaggagggc agacgaatca gttagcgaat cagacataat gttctatatt agaaatacaa 9901 aatatagtta ctttaaaata cagttatttg gcttaatttg aaacgatcca actatatccc 9961 aacacaaaca cagcgccaat aaaatgacat agttgacact ctccgcccta taagggcgaa 10021 gattcttcgc tctttgaccc aacttgctcg gacaggattg ctccaatcgg agtagaggtc 10081 gaatctcccg aagcgttcgg atctaagatc caagttccca catgccctgt ggtacttaag 10141 gctcgactta gaatattcaa tgcggcatta taatcgcggt ctaacacaaa cccacattga 10201 caaacatgag ttctcatgga caaagacttt ttgatcatcg tgccacaatt agagcattcc 10261 tgagaagtta atcgcggatt aacggcaacc gtaacctttc cgaacttaat cccaaaatac 10321 tctaaccatc ttctaaattg ataccaacca gcatcattaa tagatttagc gagacagcga 10381 ttttttacta gattcctaat tctcaaatct tcgtaggcga ccaaatcgtt agattggatt 10441 acgcaacgtg ccagtctctt ggcatgttct tcacgttgtc tacttatttt gagatgtact 10501 cgtcctaagc tattaatggc gcgcttacgg ttagcggagc caatattttt ccgagaaacc 10561 cgacgttgac gaaatttcaa ccgtttctca cctgttcgat aaaatttagg attaggttcg 10621 gtttgcccat tgctatcggt gtagaactct ttgagtccaa catccaatcc gatagttttt 10681 ccggtaggct gtgtgtctat tttgctttca gttctaatta aaaactgaac ataatagcca 10741 tcggcttttt tgaccaacct aactcgtttt atctgatcta actgatagaa gtttaaatcc 10801 cacgttcctt tcagttttag cttgccaata gcttttttat cggtaaatgt gatttgcttc 10861 ctggtttcag aaagcgacca tcccgatgtt ttgtattcta ctgagcgaca gttctttttg 10921 aactgaggat agcctttttt acctggaatc gactttttgc agttgtcgta gaatcgagca 10981 attgaactat aagctctttc tactgctgct tggcaagcat gcgagtttaa gtctttgaca 11041 aaggagtact ctgctcgaag tgctgtgtta tgacgataaa gttctttctg tcctacaccg 11101 cgattatcca tccaaaaacg gagacactta ttgcggacaa attgagccgt ttttattgcc 11161 gaatctatag cactatattg agttgtcctt cccttagcct taaactctaa aactatcatt 11221 tgacgtggac aaaccctacg tcaaacatcc tagcacgcaa aataaaaagt cgccctaaaa 11281 gggcgaggct tgtacccatc tttttcggtc aaaaacctat gtcctaacat gagtctaaaa 11341 gattattaaa ttttgaagtt aagcagcttt cgttttcttt ctacggttag caaaaaagta 11401 gattccgagt aaagatatag cagctaaagt agttggttct ggaacctgaa gaggtttgtc 11461 agaaactgta atttgaccac ttgctacctc accattatca ccagcttctc caatgtataa 11521 atacagtgga gtggtatttg aaatgggatt tttgaggtta aacacaactt ttccaccatc 11581 acctaagctt accaagccga aagcattggc atcagcactg gcgttggcat caaagttttg 11641 taatgttgca actgaattat caatgttacc acccgttgtg ccaaataaac ctgttggttc 11701 agcagaagag cgctggattc ctggggtgaa taaagttccg gagggagtaa aatcgaagac 11761 atctaaccct ggtaagtttt tggcgtctgt tgcattgtta accaagacgt tgctaatctt 11821 gattgcatct aaatcaaagc cgctaaattt cccaggttga ccaccttgtg cgctgttgct 11881 atcagcaata gcaatggaat tgatatcaaa gccaagatta gagagatctg cacggtaaac 11941 accagtcaaa ccagggttac cacctgttaa gccttctaac tttgtaaaat caacattcag 12001 tgttcctgct atagcactcg aaactgatgc tagggtgatt accccagtag ccaaacttaa 12061 cctcaacaaa ttgctaatca gagctacact catgagttgt cttacgtagg attactatac 12121 tcacctatac acaagtgcac tagcaatttc cgaacaataa atttttactg ttactttttg 12181 aaatctttat ctagtactac cagtgtaaaa acaggcggta tcatatatgt atgtaataga 12241 taatactgcc tgcttaaaaa gaatgtgatc gttgatacta cggttttttt tgtaaatctg 12301 gtcgctcttc tgggacaatc gccccttcta ctggacaaac ttgcagacat ataccacaat 12361 caatacaggt ggcgaagtca atccagtacc aatcagttcc cttgacgttt ttgccaggac 12421 catcatgaat gcaagctact ggacaagcgt ctacgcaatc agcgacgcct tcacagacat 12481 tagtcacaat tgtgtgcggc atagttcccc tctagttgga tcaaaattgc tatgtttagc 12541 cagctctata tagcctagca ataatactga attactagtc cagcagtcat ttgtccttgg 12601 ttttttactt gactaatgac tcaatcgtcc ttgcaccatc agctttaatc caagcaattt 12661 ggctgatagt gcgatcgcgc gcatattgtc gaatctcatc agcatctcgt cctcccaaat 12721 ccatttctac ccgtaccaag tgagtcgagt ctcgaagttt tgttgcgtag gcaaaggcgg 12781 cggtgtaggc atccggtgtt tctggtacaa tcagccaatt gctagctaag gtagtttgtg 12841 gtaaccgccg agccgacaag agaatttgtt gtaagtcttc caagaagagt gcaaaaccta 12901 ttcctggtat actttctcct ttaggatgat agagctttaa aagctggtca taacgaccac 12961 ctcgccctac aactcgcgtc tgtccatcgg tattactgac cacttcaaag acaatacctg 13021 tgtagtagtc aaaggtttgt actaaactca ggtcgaggat gatttctatc tctcttgcgt 13081 tgatgtgcgt tgtactcaaa ttgactgaac gctcaagtaa ttctactaat gttttcaggt 13141 gatctacgat cgcttgtaga ctctggtcta aacctagact gctgagtttt tggaggacat 13201 ccgcaggctt tccgcgcaaa tcaagcatga ttctagcgcg ttctcgtagt tcatcactta 13261 acggtaaagt gtctatggtt atgcggtcaa ggtgggcgat cgcactccgg actttttctt 13321 gtagatgagg gggaaaagct tttatgagcg atcgcgtgat ttgagcctca cctaaaatca 13381 aataccaatg gttcaaaccc aaagcctgta aagaatctgc catcaacagc agaatttctg 13441 catcggctaa gccaccacca ctgcccaaca actccacccc agcttgataa aattcttgct 13501 gacggttatg cttcaactcg cctgtccgtt ggaaaacatt agcattgtaa tacagacgtt 13561 gtggataatt cacatccgcc atgcgcgtta cagccgtgcg agcaatagaa gctgtcagtt 13621 ctggacgtaa gcccaaatct tcatcattac cattttgcag ttgaatcacc gcagagcgat 13681 caatagctcc ccccgccatc agcgtatcca tacgctcaag cgttgaggtg ataattctgt 13741 gatatcccca acgatgaaac acttgctgta gcctgtcttc tatccagcgt ttttgagcca 13801 catattcggg taataaatcc ctcgctccca ctggagattg atacaccatt attttttctt 13861 cccaccaaac aaaccaccaa acaaaccgcc acccccagat ttacctggtt gctgactccc 13921 cgtatgagac gaagctgtac gcttcgcacc actaggtttt tgccccgtca tctgttcaat 13981 tttaagtctc cccgccaacg ctatttcgtc tttggggtct aattgcaggg cgcggttaaa 14041 gtgaactttt gcactcgtgg taagattttg cttcaaatag accactccga tcaagctatg 14101 gcaacgacta ttatttggct ctagtttcag accatcttgt aattctaccc ttgcttgtgt 14161 caattgattt tgtgatatca aatcttgagc gcgacggatg tattgcagga caacagagtc 14221 tttcttttgt cgtggagaaa ttacaccggc ttgcttaact gctgctgcat tcgttggaat 14281 ttctgctggt gatgctgctg caaatatttt gccagcactc cgcattatgt atatcaaatt 14341 caactcgctg gctaaggcta tcgtgtcttg taccttctct aaagagtcat attgagtttc 14401 tgcgatttgg gcgagtccac ttttatagac atgctcaaaa tgagccgcac tagctaactg 14461 ctgagtcaca tcgctcttga gttccaccga agctgactgt tgcaccaaac gcttacccat 14521 ttgcgacaaa attaaaatat gttccattct ctgtttctca tgagagagtg tttcataagc 14581 cggattcacc agcttagaca acaattcatt agcgagttct ttatcagcat cggtttcatc 14641 cttacagcta tcaggatgca agcggcgggc gattttgaga tagcgtttgc gaatttcttt 14701 gacatcagca tcaattggca cacacaaaac cgcgtgataa tctataaagt catatttaaa 14761 caatccgcga ttgaacttga aagacatata acggtattgc accataatcc cgtggatttt 14821 tctcgtttac tgtactggtt ttatgttttt tttataacat ttattactca gtaaacagtg 14881 aacaaaatgg taactggtaa ctgataactg ttaactgtta accaacccat gaagacagtg 14941 gcggaacttg ttgtagttct tgactcaggc tatcgtgaat aaaaccattg gtagcaagaa 15001 ttctaccaga ccagatatct aagggacttc catcgtatgc agtcaccttt ccgccagctt 15061 ctcgtagtaa aatgatacca gcagtaatat cccaaggaga aagacctctt tcccagtaac 15121 catccaaacg accacaggcg acatgagcca aatctaggga tgctgaacca ctacgccgaa 15181 ctccttgagt gagatgagtc aagtgacaaa attccgcata gttgttgtca gatgtttcac 15241 gacggtcata ggcaaatccc gtaaccaaaa ggcttttacc aagttcagaa gtttctgaaa 15301 cctttatcgg gcgacggtta cgcgttgctc ccaaaccttg agccgcccgg aatagctcat 15361 catggaaagg gtcataaata acaccgactt gcgggacacc ctgaatcaac agcccaatcg 15421 aaacagcaaa aaagggatat tggtgagcga agttcgttgt tccatccaga ggatcaatcg 15481 cccacagatg ttgactatct tgatttccta gtttcccaga ttcctcagcc aagatagaat 15541 ggtcaggaaa gtgacgacgc aaaacatcta aaatcaccgc ttccgaagct ttatcagcag 15601 ccgttaccaa atcaccgggg cgtccttttt caataatcgc gtcttctacc ttacccaaat 15661 aaccttgcaa caccgcgcca gcagcgaggg ctgcttctgt agcaatatcc aaaaagattt 15721 gcagattact catgactcgt taataattcg taatattagc gtattacgaa ttatgcatta 15781 agcatagacg tgaagtttac ccttatccgc ctccagcacc tttcccctta gaaaggctac 15841 cgtgtataca ccggatcgct cgaaacctca ccctcgcttt tagctgcgct aaaatctttc 15901 cctctcctta gcaaggagag ggatgcccga tagggcaggg tgaggttcct ggttcccaga 15961 ggcagagcct cacaccaaat gcattcccag cctgaggctg ggaacgagcg aggacagacg 16021 cggagtttac tcctatccgc ctgcggcact tttcccctta gaaaggggaa agggaggaga 16081 atttccggaa tagtctactg tttagttgat atctacatac agtagaattg tcgcaagtgt 16141 atatattatt cataataaaa gctgtattat ggcagatacc tcaaacgtta aactaagcat 16201 tacattattc ttcaatcggc aaaaggatga tttggtgttg gtgtattttt ctggtcatgg 16261 tattaaaaag gaagtagaac ggtgcatccg aaatggtcag gtttgacctg taggacgccg 16321 gattctcaaa catcgccaga gtgaactgag tttatcacca gatgaggcaa aagaaattga 16381 gaacgaagtg cttgaacctt tccgtaagca ccagcaaaat ctataagaat ataaagaagc 16441 actagcgact attgcaagtg tgaaagaata acaaaaggaa cttgtaaatg accaaccaac 16501 cgcaatttga tgttttccta gctcacaata ctaaggataa gccaaaagtt agaataattg 16561 ccaataaact caagcggttt ggtctcaagc cgtggttaga cgaagagcaa attgcaccag 16621 gacgaccatt tcaggatgtc atccaaaaag cgattcaaaa tgttaaatcc gctgctatct 16681 tcattggatc tacaggatta ggaaagtggc agttcttgga actgcgatcg cttctcgacc 16741 aacttgtgga agcagatatc cctgtcattc ccgtcttgct accaggagtg gaagaaattc 16801 ccagcaattt gcattttctc aagcaattca attgggtaaa gtttaccaat gggatagacg 16861 atactcaagc cttaaactac ttaaagtggg gtattaccca agaaaagccc ataacagaat 16921 ttcaagccga tggcgacagt gatgagtatg atatactaga tgattacgat ataattgcga 16981 ataacggcaa tataattgcg aataaagatc tgaaagatga atacgatccc aataaagata 17041 tactagatga ttacgatata attccgaatg attaattgga ataaatatac tatgaattca 17101 ttggggtgca accgattcac attgtgtggg tgcaaccaca ccgttttttg atatgtgttg 17161 ggtgcaacca cactgcacac cgttagcaca aaatcaaatc ggagaaaaaa ttctcttctc 17221 tcgcatcgag acttgtaaac tgtaacatct aagcgtttca gaatattgta aatatttatt 17281 gcagcttgct ttcgcacgct acaatcaaca tttccagatt tcaatacctg gcaccgaggg 17341 gaaatatcca tttgcgtgga taaatttttc tgctacttca tgcacctctg tacactcggt 17401 gattgtttgc ttatcccagg gtaaactcat ttatcctggc gactttttct gatttccttt 17461 actcttcatt gatggcaaaa agcgactacc ccaatggttt cttttttctg gctttgcttg 17521 tggtcgatat tttttgcaaa atcctcgata tttagcagcg cattcctcta agtttctccc 17581 taattgcaaa aatgcaggat gccacttact cctcaaatgt ttggtggatg ttcctccatc 17641 ttcacccaca tcttacgcac acgagttaat tgtggacggt tcatttttag agccaaaatt 17701 gttgctcgac caacaggcgt aagtcctata atttctgtac catcctcact ccaagtaaaa 17761 tgttctgtcc acaattgttg ctgtggatga aacaaaggga cttcctgttg agtgattgtg 17821 tcaattgctg tttgacggga tgctttatag cgattacaag atggacaaga caagcagaga 17881 ttttcaaaga tagtttcccc ttcagcggat cgaggaatga tatgctcgaa ttcaaaagta 17941 ctaacggtta aggattcagc agtacggcag taagcacaac aatctgcaaa atgattacga 18001 atgcgtcgtt gtaactcaac aggaatataa acgctcacgc tactcctgat gtcccttcaa 18061 gcttattcaa agtgtacctt gctctggtct taagaatatt tagttggtca acctgttcca 18121 acaaacgctc taaagtaatt gtttcatcag ccgaaagttg attctctgca tttctagcta 18181 gcaaatcgtc caattgaact tgggcttttg gagataacat actttctgcc aaggcttgca 18241 gttcatccag actcaagcca gttaatattt cagcatcaga tgttaatgcg ttggttaact 18301 gatgataggt ttccaaattt aagagtacac caactcgctc tccttgttgg ttggtaacgt 18361 actgaactga ttcagacatt ggtggttttg gaatatattt ggatttatcg taacctaaaa 18421 gcaacgtttg cttaaatcag tgaacagtta actctgaaca gtgaactctg aactctgaac 18481 agtgaaactg ataactgata actgataact gataactgat aactgaaata tcaagtttgt 18541 ctaattactc acaataaaaa gacctcaccc cgccctatgc ctaacggcac gctacgctat 18601 cgggcacccc tctcct // LOCUS NODE_1782_length_18399_cov_5.05326018399 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18399) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18399) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18399 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(181..1260) /locus_tag="DP116_15225" CDS complement(181..1260) /locus_tag="DP116_15225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321944.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gfo/Idh/MocA family oxidoreductase" /protein_id="PRJNA477356:DP116_15225" /translation="MTVQNRSMSVGEQNGQLQRNHPRPIRIGVIGVGNMGQHHVRVLS SMKDVELVGVADINVERGLETASKYKVRFFEDYCDLLPHVQAVCVAVPTRLHYAVGIN CLLAGIHVLIEKPIAASISEAESLVNAAADSQCILQVGHIERFSPAFQELSKVLKTEE VLALEAHRTSPYSSRANDVSVVLDLMIHDIDLLLELAAAPVVKLTASGNRTLDSGYLD YVTATLGFANGIVATVTASKVTHCKIRRIVAHCKNSFTEADFLRNEILVHRHTNDYRQ AALYRQDGVIERVSTSNTDKLGAELEHFVNCVRGGNQPSVGGEQALKALRLASLIEQM ALEDRVWNPLDWQSEPKVQSLTPTV" gene 1853..2530 /gene="queC" /locus_tag="DP116_15230" CDS 1853..2530 /gene="queC" /locus_tag="DP116_15230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312206.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="7-cyano-7-deazaguanine synthase QueC" /protein_id="PRJNA477356:DP116_15230" /translation="MKAVILLSGGLDSSTVLYQAIADGYNCHTISFDYQQRHRRELDS ALAIAQKAGVVDQQVVKFDLRQWGGSALTDDAIELPQKRDLEEMSQSIPVTYVPARNT IFLSFALAYAETISAQCVYIGVNALDYSGYPDCRPDYIQAMQEVFRLGTKQGREGQSI SIKTPLIQLKKTEIIQLGNKLGVPWELTWSCYLGGDVACGVCDSCRLRLAAFGELGLK DPLAYAN" gene complement(2593..3969) /locus_tag="DP116_15235" CDS complement(2593..3969) /locus_tag="DP116_15235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15235" /translation="MGHQYYTATKVSASFGLSWTDVGLRLLSVLVLIAINAFFVTAEF SMVTVRRSRIHQLVEAGDIQAIAVEVLQHSMDRLLSTTQLGITLSSLALGWIGESTIV VLVKSCLLSLPLPIGMSIVVAHSLSIPIAFFFIAYLQIVLGELCPKSVSILYSEQLAK FLGPSIRATVRFFSPFIWILNQSTRWLLRFFGIEYTGQSWKPPVTPEELQLIISTERE STGLQPQERQLLKNVFEFGDETVQAVMIPRTSVIALPKIATFGTLLQEMIATGHSCYP IIGESLDEIRGIVYFKDLAKPLAVGKLTLETQIQPWMRPARFVPEHTPLKELLPTMQR EEPTMVMVVDEFGATVGLVTIQDIIAQIIGYPGEPDTTNDLLIKISDEQTFLVQAQIN LEDLNEILHLDLPLRKEYQTLGGFLLYQLDKVPTLSETFRYENLEFTVVSVTGPRLHQ IQVRRLEE" gene complement(4013..4717) /locus_tag="DP116_15240" CDS complement(4013..4717) /locus_tag="DP116_15240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319144.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15240" /translation="MKLNSTVLLTLILLVLMLGAGSVSAFWGFTLGSAALKGVTAPDA RPTSKYTSAKSAKEQHAGVAFLKEDQILKIVKSRIEGKTKAAKSRKKDDDDEQVNTSK AKKEEKSTPAAEQEKPQEGFPINAQSEGVSLSVQSARFSGGALLLKVQMQNKGKDSVR FLYSFLDVSDDKGRTLSASTEGLPSELPVNGSPVSGTVSIPTPLLDNVKRISLALTDY PAQRLRLEVPNIPVER" gene 4906..4978 /locus_tag="DP116_15245" tRNA 4906..4978 /locus_tag="DP116_15245" /product="tRNA-Met" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:4939..4941,aa:Met,seq:cat) gene 5112..5735 /locus_tag="DP116_15250" CDS 5112..5735 /locus_tag="DP116_15250" /EC_number="2.4.2.10" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196677.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="orotate phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_15250" /translation="MTYQAEIHAQPNISATTTDLVTVSQYLLDLLCQLAYKEGDFVLS SGQRSSYYINGKQVTLHPQGALAIGRILLSMLPLETQAVAGLTLGADPIVSAVSVVSA YENRPIPALIIRKEAKGHGTRAYIEGPSLPESAKVVVLEDVVTTGQSAMKAVERLREA GYTVNQVISLVDRQQGGAEFYQQAGLEFEAVYTIKEIQERYRQLANQ" gene complement(5888..7072) /locus_tag="DP116_15255" CDS complement(5888..7072) /locus_tag="DP116_15255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006634853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bacteriocin transporter" /protein_id="PRJNA477356:DP116_15255" /translation="MLSLTVGDPVPWFTLPSTSNPTFHFSSVGGYRVILFFFGSTKNN LIREILNQFCTRQNQLTSYQVPFFGVSIDPDDSFLAELVENKTYFKFLWDFKREVSIQ YGVCQPDNPAGSEFQYEPKIFILDENLRVLQVIDVLASAHPIEQVFQFIDGLPAIPPA SLATKLAPVLFIPNVLEPSFCQNLIDLYLADGGQDSGFMRQVDGKTVTVYDYSFKKRR DYFISNPQLLEQINQSIIRRVKPEIEKAFQFSITRFERHLVACYEATDQGFFNRHRDN TTKGTAHRRFAMSLNLNTGSYEGGYLRFPEYGSQLYCPNTGEALIFSCSLLHEATPVT SGRRFALLSFFYNDEDAKVRKTNSKYVVLDSTADTQTEYSSKQAEKTAQGFGKQPTKK KR" gene complement(7494..8870) /locus_tag="DP116_15260" CDS complement(7494..8870) /locus_tag="DP116_15260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317241.1" /note="catalyzes the formation of 4-amino-2-methyl-5-phosphomethylpyrimidine from 5-amino-1-(5-phospho-D-ribosyl)imidazole and S-adenosyl-L-methionine in thiamine biosynthesis; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiamine biosynthesis protein ThiC" /protein_id="PRJNA477356:DP116_15260" /translation="MRTEWVAKRRGQSNVTQMHYARQGVITEEMHHVAHRENLPIELV RDEVARGRMIIPANINHTNLEPMCIGIASKCKVNANIGASPNSSNLEEEVAKLNLAVK YGADTVMDLSTGGGNLDEIRTAIIKASPVPIGTVPVYQALESVHGNIEKLTPDDFLHV IEKHAQQGVDYMTIHAGILIEHLPLVRSRLTGIVSRGGGILARWMLAHHKQNPLYTHL RDIIEIFKKYDVSFSFGDSLRPGCTHDASDDAQLAELKTLGQLTRKAWEDDVQVMVEG PGHVPMDQIEFNVKKQMEECSEAPFYVLGPLVTDIAPGYDHITSAIGAAMAGWYGTAM LCYVTPKEHLGLPNPEDVRNGLIAYKIAAHAADIARHRPGARDRDDELSRARYNFDWN RQFELSLDPERAREYHDETLPADIYKTAEFCSMCGPKFCPMQTKVDADALTELEKFLA KDKATVIQ" regulatory complement(8878..8970) /regulatory_class="riboswitch" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00059" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="TPP riboswitch; Derived by automated computational analysis using gene prediction method: cmsearch." /bound_moiety="thiamine pyrophosphate" /db_xref="RFAM:RF00059" gene 9123..9398 /locus_tag="DP116_15265" CDS 9123..9398 /locus_tag="DP116_15265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215751.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_15265" /translation="MENQTVRTTLSIPVELLEATDRAVREGKAKSRNDFVARALRHEL AVQKRAEIDAAFAGMGNDVECQAEAVMMSHEFAKSDWEAFQIGETQQ" gene 9395..9754 /locus_tag="DP116_15270" CDS 9395..9754 /locus_tag="DP116_15270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866613.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" /protein_id="PRJNA477356:DP116_15270" /translation="MRRGEVYDACLDPTEGSEQARTRPVIIVSRDAINSASQVVLVVP CTNYRLGRRIYPSQVLIHAPDGGLDRDSVAMAEQVRALAKTRFLYLRGMLSPLSLQQL DQALLIALDLPGQVEFE" gene 9769..10605 /locus_tag="DP116_15275" CDS 9769..10605 /locus_tag="DP116_15275" /inference="COORDINATES: protein motif:HMM:PF07924.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15275" /translation="MKPSNAEFVAQLQAATEGLLFGSENSYPFKTFVFEVVNQGDFTV ENLLQTAGFMKPVNLDDFLQFVSEIAPESSQNYQEIINFLELYTTSSQIYRISLEDES GEYEAFHILVGNTKDGDWIGISPRIDNEPSARRSEKFLMESSTLVKESTFKLKTKLEP LLAKLKFIVTEYYEKNQEKQGFVLEIADTRAVMMKKLLDSTGFVKTCAFKGFSENAQE NDYPDREYFEQFKPLDELLQSRLTNLREYVIGGMAVYYLYDIGQTPDGNWVGVWTIAI WT" gene 10652..11287 /locus_tag="DP116_15280" CDS 10652..11287 /locus_tag="DP116_15280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871465.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pseudouridine synthase" /protein_id="PRJNA477356:DP116_15280" /translation="MTNYRYILFYKPYGVLSQFTKDTPTRSTLKDYIDIPDVYPVGRL DWDSEGLVLLTNNGQLQHRLSHPQFGHERTYWVQVERIPDAAALTQLQQGVTIQDYRT RQVMVALLPTDPPLPERDPPIRFRKNVPTAWLEMTLTEGKNRQVRRMTAAVGFPTLRL VRISIAHLHLDDLQPGQWRDLSSSELKLLLDLAFAGSRKFKSNRTTLLDYN" gene 11598..13556 /locus_tag="DP116_15285" CDS 11598..13556 /locus_tag="DP116_15285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317704.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein phosphatase" /protein_id="PRJNA477356:DP116_15285" /translation="MLICPQCKFENPNANKFCQRCGASLTHKVCGECGTDLALNAREC DNCGAACGSVLWAIITKEESRRVGEDKEDFASSSFSSALSPGSYLDSQKRYQLLEPLP ALQEIAPNTQVSVRVLDCQPYQVSLLEVMLTNQQKGLVMASVGVDKIPSLAKPYIALR SWCHLGIPPIHDAWQQDDMQVVLIEDRSNWQELVDLWQDDSLSSLQIVHFFSQMTQLW SVLEKAHCRQSLLELSNLRVDEKLSLALQKLYVESPDKTATNFPEDTQATLPETTVVQ PLTIQALGQVWHALFRQSQRTQFGSVLHLIGELELGNLETLAQLQSSLSDIVSELQGN STSVSIPSTLFKSNAAPTVLQLDDQEDETEFKSDDKSIAMLPMQLISLENAGLTNVGR QRDHNEDYFGIDTKVYNLESPNTHTLQARGLYILCDGMGGHAGGEVASALAVKSLREY FQTHWASNQLPTEDNIRAAVRLANQAIFNVNQQDARSGVGRMGTTVVMILIQDTQVAV AHVGDSRLYRVTRKRGLEQVTMDHEVGQREISRGVERSVAYSRPDAYQLTQALGPRDE QSIIPDIQFFEINEDTLFLLASDGLSDNDLLTLHWQNYLAPLLSHSANLESGVQALID LANHYNGHDNITAVLIRARVRQSINQQQ" gene 13698..14921 /locus_tag="DP116_15290" CDS 13698..14921 /locus_tag="DP116_15290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317705.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_15290" /translation="MVTLTLLEPQQNIPLQQWHFDDESVIRVGRSADNDVVLSDSLVS RYHLELRQVDSDKNGGSWQVISQGTNGTFLNGVLITQTSLSDNSVLQLAQKGPILKFQ IHKQTTPPYSPTPALPYTSCTHEGNSPNNLFCIHCGQPMSIMQTIRQYQVLRILGQGG MGTTYLAWDGLGTIAGHPQLLVLKQMNADMAKIAKAQELFEREANTLKSLSHTGIPKY YDSFVEGGKKYLAMELIHGQDLEKLVYSKGPVVPNQAINWMIQTCDILEYLHSKEPPL IHRDIKPANLMVQNCNDRIVVLDFGAVKEIATTPGTRIGAEGYCAPEQERGQPVTQSD LYAIGPTLIFLLTGESPFKFYRQRGRSFRFEVANVPTITPQLREVIDRATEPLPCDRY QTAKDLVAALAVCKV" gene complement(14969..15190) /locus_tag="DP116_15295" CDS complement(14969..15190) /locus_tag="DP116_15295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196690.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4327 family protein" /protein_id="PRJNA477356:DP116_15295" /translation="MSKQQVIHPMVKLQRNVRSLVDSSIIKPTDNIWKIALLFGNEWQ HWKQELLDYGFSMQDPVSELLAVEAWDEE" gene complement(15923..16450) /locus_tag="DP116_15300" CDS complement(15923..16450) /locus_tag="DP116_15300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309469.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pesticin" /protein_id="PRJNA477356:DP116_15300" /translation="MGIERKVKLLQHVLKKKKKDEGFTLMELLVVIIIISILSAIALP SFLSQANKAKQSEAKTYVGSMNRAQQAYYLENGQFVIDTSKLGIGIKPETENYKYEIK VNTNQVANNGISRKNGLKSYAGVVILSEQEGATLAILCESDNIGVGSTQEPSSTTNGT SAQPKCPPNYTELKK" gene 16679..18052 /gene="trxB" /locus_tag="DP116_15305" CDS 16679..18052 /gene="trxB" /locus_tag="DP116_15305" /EC_number="1.8.1.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317714.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin-disulfide reductase" /protein_id="PRJNA477356:DP116_15305" /translation="MTNPTVENVVIIGSGPAGYTAAIYAGRANLKPVVFEGFQAGGLP GGQLMTTTEVENFPGFPQGITGPELMDKMKAQAERWGAELYTEDVVEVDFSQRPFTVR SDERELKTHSIIIATGATARRLGLDSEHQFWSRGISACAICDGATPIFHGAELAVVGG GDSAAEEAIYLTKYGSKVNLLVRTEKMRASKAMQDRVLSNPKITVHWNTEAADVFGND KHMEGIKIRNTKTGEESKLHVKGLFYAIGHNPNTSLFKGQLELDDVGYIVTKPSSPET SVEGVFAAGDVQDHEYRQAITAAGSGCAAAMLAERWLSSSGLIQEFHQKPETPDNELE QLAEKKTEEQLAAEFDLNATRHQGGYALRKLFHDSDRLLIVKYVSPGCGPCHTLKPIL NKVVDEFDGKIHFVEIDIDKDRDIAENAGVTGTPTVQFFKDKDLLKEVKGVKQKSEYR QLISSNL" gene complement(18107..18289) /locus_tag="DP116_15310" CDS complement(18107..18289) /locus_tag="DP116_15310" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15310" /translation="MAYQERLKPWAVINSVSPSQKVVVSRHRSRVDAEEYLRLLRQQK PTSQHEVVFELAEDRV" BASE COUNT 5257 a 3858 c 4018 g 5266 t ORIGIN 1 gtcctaatct aaatactttt tagctaattt tttgaaaaaa tacactttct ataaatgaaa 61 aagggtacag gttacaagtt gtggatgata acgagattct attctctcac cccctacaat 121 ctatccccca ttcccttttt atacattgat gtaccgtgct gtaacctgtt tcctgttccc 181 ttaaacagtg ggtgtcaaag attgcacttt tggttcagat tgccaatcta atgggttcca 241 aacacggtct tccaaagcca tctgctcgat taagcttgct agtcttaaag ctttgagtgc 301 ttgttcacca ccaactgaag gttggttacc tccacgcacg cagttgacaa aatgctccaa 361 ctctgcacct agtttatcag tattacttgt agaaactctt tcaatcacgc catcctgtct 421 atataatgct gcttgtcgat agtcgttggt gtgtcggtga accaaaattt catttctgag 481 aaaatctgct tcagtaaagg aatttttaca atgggcaaca atacgacgaa ttttgcagtg 541 agtcacttta ctggcagtga cagtagcaac aataccattg gcaaatccta atgtcgcagt 601 tacgtaatct aaataaccag agtctaatgt acgattacca ctagctgtta acttcactac 661 aggagctgcg gctaattcca aaagcaggtc aatgtcatga atcattaaat ccaaaacaac 721 tgagacatcg tttgcccgac ttgaatatgg acttgtacga tgtgcctcta gcgctagcac 781 ttcctcagtt ttcaacacct tgctcaattc ttgaaatgct gggctgaagc gctctatatg 841 accgacttgt agaatacatt gagaatcggc ggcagcattt actaatgatt ctgcttcaga 901 tatacttgct gcgattggct tttcaatcag aacatgaatt cccgccaaaa gacagttgat 961 tccaacggcg taatgcaaac gcgtgggtac ggcaacacaa actgcctgta catgaggcag 1021 caggtcgcaa taatcttcaa aaaagcgcac cttgtatttg ctggcggttt ctaatcctcg 1081 ttcaacatta atatctgcta caccaaccag ttcaacgtcc ttcattgaac tcagtactcg 1141 aacgtgatgc tgtcccatat tacccacgcc gatcacgcct atgcggatcg gtcgtggatg 1201 gttgcgctgt agttgaccat tttgttctcc cactgacata cttctattct gcactgttat 1261 tctctcctca accaccacat ttagagacgt tgtagtcatt ggatttgata ctcaactgcc 1321 aatggctatg cgtctaaaac catccagatg gtaacataga ggttctattt atgaagaatt 1381 tcaaagttta ttgtgagttc ccgccatatg actcggtttt ataagaagtg tttaaattga 1441 tgcttttttt gcactttacg aaaattataa tgagcttttt attagcttta gaaactaact 1501 aagacgttag tcaagtgctc tttgtttcac aaaaaaaatg attaccgaaa acgaaatcta 1561 caataacgtg ggagcgtggt agccccgtat tttaaaatcg aagcgaataa tgtgaacatg 1621 ggcggttttt actaccgtta agtctttgtg tgataattag ttgtagggct ggtttcattt 1681 gttacgtgtt agaccatttc aggaaatgcc tgatacaaat gattttctac aattatttcc 1741 ccttgctccc aactccattg ctgttcattt gtatcaacct taaagtaaaa cggtattata 1801 cctcaaccca gatctcaagg tactgggttt cgacacctgc cgcaagttga ttatgaaagc 1861 tgtcatttta ttgtctggtg gtttagactc ttccacggtt ttataccaag cgatcgccga 1921 tggttataac tgccatacca tttcctttga ttaccaacag cgacatcgac gagagttaga 1981 ctcagctttg gcaattgctc aaaaagctgg cgtggtagac cagcaagtgg taaagtttga 2041 tttaagacaa tggggcggct cggctcttac agatgatgcg atagaattac cccagaaacg 2101 tgacttggag gaaatgtctc aaagtatccc tgtcacttat gttccagctc gcaataccat 2161 ctttttaagt tttgcccttg cttatgccga aacaatttct gctcagtgtg tttacatagg 2221 tgttaatgcc ctagattatt ctggatatcc tgattgtcgt cctgactaca tccaagcaat 2281 gcaggaagtg tttcgtttag gaacaaaaca aggacgcgaa gggcaaagca tttccatcaa 2341 gacaccctta attcagctaa aaaaaactga aattatccaa cttggcaata aattgggtgt 2401 tccttgggaa ctcacatggt cttgctatct aggaggcgat gttgcttgtg gtgtctgtga 2461 ttcttgccgt ttgcgtcttg cagcttttgg ggaactggga ttgaaagacc cattggctta 2521 tgcgaactag ggagatgagg aagccacaga ggaagagaca atttccttct ttgtgttctt 2581 ccctcttcac tcctactcct ccaaccgtcg cacctgaatc tgatgtaagc gtggtccagt 2641 gactgagacg acggtaaatt ctagattttc atagcgaaag gtttcgctaa gggtaggaac 2701 tttgtctaac tggtaaagta aaaagcctcc taaagtttgg tattcttttc tcaagggcaa 2761 atcgagatgc aaaatctcat tcaggtcttc caggtttatt tgagcctgga caagaaaagt 2821 ttgctcgtct gaaatcttga tgagcaaatc attagtggtg tctggttcgc cgggataacc 2881 gataatttga gcaatgatgt cttggatagt gaccaatcct acagtcgcac caaattcatc 2941 taccaccatc accatagttg gttcctctcg ctgcatagtt ggcaagagtt cctttaaggg 3001 agtatgttct ggcacaaatc ttgcagggcg catccagggc tgaatttgtg tttctaaagt 3061 cagctttccc acagctaagg gttttgccaa gtctttaaag taaactatgc cgcgaatttc 3121 atccaaagat tctccaataa tagggtagca agaatgacct gtagctatca tttcctggag 3181 aagagtcccg aaggtagcga tttttggcaa agcgataaca ctcgtacgag gaatcatgac 3241 tgcttgtact gtctcgtccc caaattcaaa gacgtttttg agtaattgtc gttcttgtgg 3301 ttgcaatcca gtagattcgc gttctgtgga aataatgagt tgtaattctt caggtgtaac 3361 aggcggcttc caactttgac ccgtgtattc aataccaaaa aatcgcaata gccagcgagt 3421 tgattggtta agaatccaga taaaagggct gaaaaacctg acagttgctc tgatcgaagg 3481 tcccaaaaat tttgctagtt gttctgaata gagtatggat actgatttgg gacacagttc 3541 tcctaaaact atttgcaaat aagctatgaa aaaaaaggca attggaattg atagagaatg 3601 agctacgact atactcattc ctataggtag aggtaaagat agtaagcatg attttactag 3661 cacaacgatg gtgctttccc caatccagcc tagtgccaaa ctagagagag taatacctaa 3721 ttgagttgtc gatagcaatc gatccatact gtgttgcaaa acttcaacgg cgatcgcctg 3781 aatatcgcca gcctccacca gttgatggat gcgcgatcgg cgcactgtca ccatagaaaa 3841 ttctgctgtt acaaaaaagg cattgatagc aatcagcact agcactgata acaatcgcag 3901 cccaacatct gtccaactta agccaaaaga agcactcacc ttcgttgccg tataatactg 3961 atgccccatg aatattttat tggtggttag ttgtgctact aaccactaac acctatcttt 4021 ctacgggaat atttggtact tctagccgca gtctttgagc aggataatct gtcagagcga 4081 gtgaaatacg cttgacatta tcaagtaaag gtgtaggaat gctcactgta ccagaaacag 4141 gagatccatt taccggtaat tcagaaggta aaccctctgt actcgcactt aaggttcgtc 4201 ctttatcatc agatacatcc aaaaaactat ataggaagcg tacagaatct ttgcctttgt 4261 tctgcatttg tacttttaga agcaaagcac caccagaaaa gcgagcagat tgtacggaaa 4321 ggctaacacc ttcgctttgg gcgttaatgg gaaatccttc ctgaggcttt tcttgttcag 4381 ctgcgggcgt tgatttttcc tctttcttgg ccttgctggt atttacctgc tcatcatcat 4441 catctttttt tctagatttg gcagctttag ttttgccctc aattcgcgat ttgacaattt 4501 tcaggatttg atcctccttg agaaaagcaa cccctgcgtg ttgctcctta gctgatttag 4561 cgctggtgta tttgctggta ggacgagcat ctggtgctgt aacacctttg agtgctgcac 4621 ttcctaaagt aaatccccaa aatgcactta cagagccagc ccccaacatc agaactagca 4681 aaatcaaagt cagaagtacc gtagaattta gtttcattgc cgacaagtca ttgcaaaaga 4741 atgctggtct atttaaaagt atcgaagtta tttacttcaa tcactagagg agcttaatca 4801 atattagtct ccactactgc cacgtcaaag tatatggtat gaaaatctgg tatgtcatat 4861 taacattgtc aaattctatt gacttgtgct ataatctaaa ctcgaccagg gttggccgag 4921 cggttgaggc agcgaactca taattcgcgc taggcaggtt caactcctgc accctggatt 4981 taaaaatggt gagaaaaaac tctttggatt tcccgtaaga gaaaaattac tcaggactga 5041 gcacttattg gtacgatgtc tcagaagtat atttttctca ttatgagtgg aacgcagtgg 5101 agcgtattgt gatgacatat caagccgaaa ttcatgccca accgaatatc agcgcaacta 5161 ctactgattt agtaaccgtt agccaatact tactcgattt actttgtcaa ctagcatata 5221 aagaaggtga ttttgtcctc tcctcaggac agcggagttc ttactatatc aacggcaagc 5281 aggtaacact tcacccccaa ggcgctttgg caattggtcg tattcttcta tctatgctgc 5341 cattagagac tcaagcagtt gctggtttaa cattaggggc tgatcctatt gtcagcgccg 5401 taagtgtggt ttctgcctat gaaaaccgac cgataccagc attaattatt cgtaaggaag 5461 ccaagggaca tggaacaagg gcatatattg aaggtcccag cttgccagaa agtgcaaaag 5521 tggtagtttt ggaagatgtg gttacgactg gacaatctgc gatgaaagcc gttgagcgac 5581 ttagagaggc aggttatacc gtgaatcaag tgatttcact agtagatcgt cagcaaggag 5641 gagccgaatt ctaccagcaa gcagggttgg agtttgaggc agtgtatacg ataaaggaaa 5701 ttcaagagcg atatcggcaa cttgcgaatc aatgactaag gaggagaaat aaactcaatt 5761 gtgtaaagaa atattaaggg aaaaatggta ggcgataagc ctctggcttg acgcactagg 5821 ggttgggcaa acccaaccca agtactgctc acaattgaat caaattgagt gagaccaaca 5881 aagaaaccta acgttttttc tttgttggct gcttcccaaa gccctgtgct gttttctctg 5941 cttgcttaga ggaatattca gtttgagtat cagcagtact gtcaagtaca acgtacttgc 6001 tatttgtttt tctaacttta gcatcttcgt cgttgtaaaa gaacgagaga agggcaaagc 6061 gccgaccact agtgacagga gtggcttcat gaagcagaga gcaagaaaaa attaacgctt 6121 ctcctgtgtt tggacaataa agctgggatc cgtactctgg aaaacgtaag tagccacctt 6181 cataagagcc agtgttgaga ttcagtgaca tggcaaacct tcggtgagct gtacctttgg 6241 ttgtgttatc gcggtggcga ttaaagaagc cttggtcagt tgcttcgtag caagcgacta 6301 ggtggcgttc aaagcgagta atgctgaact gaaatgcctt ctcaatctcc ggcttgactc 6361 tccgaatgat gctttgatta atctgctcta gtagttgagg atttgaaata aaatagtctc 6421 gacgtttttt aaaactgtag tcatagactg tcactgtttt accgtcaact tgtcgcataa 6481 agccagaatc ttgaccacca tctgctaagt acaagtcaat caaattctgg caaaagcttg 6541 gctctagtac attgggtata aaaagaacag gtgcaagttt tgttgctaga ctggctggcg 6601 gtatagcagg taagccgtca ataaactgga aaacttgctc tataggatga gcactcgcta 6661 ggacatcaat aacctgtagg actcgtaaat tttcatctaa aataaagatt tttggctcgt 6721 attggaattc actacctgcg ggattatctg gctgacaaac accgtattgg atgctgacct 6781 cccgcttaaa atcccacaag aacttgaaat aagttttatt ctcaactaac tctgccagaa 6841 aactatcatc tgggtctata ctgacaccaa agaatgggac ttggtacgaa gttaattggt 6901 tttggcgagt acagaattga ttgagaattt cacgaataag attattttta gtactcccga 6961 aaaaaaacaa aatgacacga tatccaccaa ctgaactgaa atggaaggtt gggtttgagg 7021 tggagggaag ggtaaaccaa ggaaccggat cgccaacagt aagagacagc ataaagtaaa 7081 ttcccgtact agtatttata ggacttacgc aagaactctg gtgaaactct tatttctatg 7141 cgctaaggcg cacgctacgc gttcgccctt ggcgtgcgct ttgcgcttac atgtcctttg 7201 cgtcctttgc ggtaccctgc gggaagccgc tccgcgtcta cgtttcttta taattttgcg 7261 taagtcctga tttacttgca tacctgaata aaaggtttgc caatactcag gataatcttg 7321 agtattgcta aatacttagt actataacgt gtacgaagca gatcgtactt atcacgataa 7381 gaatagaatt ctgaggctag acaaacaaag cccgcacagg cgggcttaaa cataaaagag 7441 tttgcgtctt tgtgacggat caagtcgaga aacccctcgt ctttgcgcga accttattgg 7501 ataactgttg ctttatcctt tgctaagaac ttctcaagtt ctgtgagtgc atcagcatca 7561 actttggttt gcatggggca gaacttggga ccacacattg aacaaaactc agcagttttg 7621 taaatgtctg ctggcagagt ttcgtcgtgg tactccctcg ctctttccgg atcgagtgat 7681 aattcaaact gacggttcca gtcgaagttg taacgggcgc gggagagttc atcgtctcta 7741 tctcttgcac ctgggcggtg tctggctata tctgcggcat gagctgctat cttataagca 7801 atcaaaccat tcctgacatc ttcaggattg ggcaagccca agtgttcttt tggtgtgaca 7861 tagcacaaca ttgcagtacc gtaccaacca gccattgctg ccccaattgc cgaggtgata 7921 tggtcgtagc cgggggcgat gtctgtcacc aatggtccca gcacgtagaa aggagcttca 7981 gaacactctt ccatctgctt tttgacgtta aactcaattt gatccattgg aacgtgtcca 8041 ggaccttcta ccatgacctg tacgtcatct tcccaagctt tacgagtcag ttgtccgaga 8101 gttttgagtt ccgctaattg cgcatcatca gaagcatcat gggtacaacc aggacgcaga 8161 gaatcaccaa aactaaaaga aacgtcatat ttcttaaaga tttcaatgat gtcgcgcaag 8221 tgcgtataaa gtggattttg cttgtgatga gcgagcatcc atcttgccaa aatacctcct 8281 ccacgagaca caatgccagt gaggcgactt ctgactaaag gcaaatgctc aatcaaaatc 8341 ccagcgtgga tagtcatgta atcgacaccc tgctgggcgt gtttctcaat cacatgcaga 8401 aagtcatctg gtgtcagctt ttcaatattg ccgtggacgc tttctaaagc ttgataaact 8461 ggtactgtcc caattggtac tggtgaagct ttgataatcg cagtgcgaat ttcatccaag 8521 tttccgccac cagtggacaa atccatgacg gtatcagcgc catacttcac cgctagattc 8581 agctttgcga cttcttcctc aagattggaa gagttcggag aagcgccaat attggcatta 8641 accttacact tggaggctat accaatacac attggttcca agttcgtgtg gttgatgtta 8701 gcagggataa tcatccgtcc ccgtgctact tcgtctctaa caagctcaat gggaaggttt 8761 tcccggtggg cgacgtgatg catttcttcg gtgatgacac cctgacgggc atagtgcatt 8821 tgagtaacat tactctgccc acgccgcttg gcgacccatt ctgtccgcat atttgctttc 8881 ctcgataaac agcttccctc cgccggtatt acccggattc aggtgttaag ggtgtaatct 8941 cagcctatgc aggcacccct agcatgaaaa tagattttac atcttgccta gggctggaag 9001 caaggaagtt gacaaatttt gaagtgagta cggtggtggg gttgggtaac tatcatatga 9061 tgtcataata gacatcatat gaatcatcaa actcttgatg gctgtatatt tgttatagct 9121 gtatggaaaa tcaaactgtt cgcacaacat tgagcatacc agttgagtta ttggaagcaa 9181 ctgatcgcgc tgtaagagaa ggaaaagcaa agagtcgcaa tgattttgtt gctcgtgcgt 9241 tacgtcatga actagcggtt caaaaacgag ctgagattga tgcagctttt gcaggaatgg 9301 gtaatgatgt tgagtgtcaa gcagaagcag taatgatgag tcatgaattt gctaaatctg 9361 attgggaagc gtttcagata ggtgaaactc agcagtgagg agaggagaag tttatgatgc 9421 ttgtctcgat ccaactgaag ggtctgaaca ggcacgaacc cgacctgtca ttattgttag 9481 ccgtgatgct atcaactctg ctagtcaagt cgtgctagtg gttccctgca caaattatcg 9541 ccttggaagg cgtatttatc caagtcaggt tctcattcac gcgcctgatg ggggtttaga 9601 tagggattca gtggcaatgg cggagcaagt tcgtgcttta gctaaaactc gttttttgta 9661 tttgcgtggg atgctttcac cgttatcttt gcagcaattg gatcaagctt tgctaatagc 9721 tttggattta cctgggcaag ttgagtttga ataatttggg gcgatgccat gaaacctagt 9781 aatgccgaat ttgttgctca acttcaagca gccacagaag gattattatt tggtagtgaa 9841 aattcttatc cttttaaaac ttttgtattt gaagttgtaa atcaaggaga ttttacagta 9901 gaaaatttgc tacaaaccgc tggttttatg aaacctgtaa atttagatga tttcttacag 9961 tttgtttcag aaatagcgcc agaaagtagt caaaactatc aggaaataat aaattttctg 10021 gaattataca ctacatcatc acaaatatat cgaataagtc ttgaggatga aagtggagaa 10081 tacgaggctt ttcacatact tgtaggaaat accaaggacg gagattggat tggtatatct 10141 cctagaattg acaatgagcc tagtgcaaga cgttcagaga aatttttgat ggaaagtagt 10201 actttagtaa aagaaagcac tttcaagtta aaaactaaac ttgaaccatt actagcaaaa 10261 ttaaaattta ttgttaccga atactatgaa aaaaatcaag aaaagcaagg ctttgttttg 10321 gaaatagcag atactagggc agtgatgatg aagaaactgc tagattctac aggatttgtt 10381 aagacttgtg cctttaaagg atttagtgaa aacgcacaag aaaacgatta cccagatcgg 10441 gagtactttg aacaatttaa gcctttggat gagttacttc aatcacgtct aacaaactta 10501 agagagtatg taattggcgg tatggctgtt tattatcttt atgacattgg tcaaacacca 10561 gatggaaatt gggtaggtgt ttggacaata gctatctgga cataagattt gaaggcgatc 10621 gcaaggcgca agggcgctac actcatcact catgactaac tatcgataca ttctgtttta 10681 caaaccctat ggtgtcctca gccagtttac aaaagacact cccactcgta gcaccctcaa 10741 agactatatc gacattcccg atgtgtaccc tgtgggtcgt ttggactggg acagcgaggg 10801 gttagtccta ttgactaaca acgggcaatt gcaacatcgc ctttcccacc ctcagtttgg 10861 acacgaacga acttactggg tgcaggtaga acgaattcct gatgctgcgg ctttgacgca 10921 gctacaacaa ggtgtgacga ttcaagatta ccgtactcga caagtaatgg tggcactatt 10981 gccaacagac cctcctttac cagaacgcga tccgccgatt aggtttcgca aaaatgtgcc 11041 gacagcgtgg ttagaaatga ctttgacaga gggcaaaaat cgtcaggtca gaagaatgac 11101 ggcggctgtg ggatttccca ctttgcggct ggtcaggata agcatcgccc acttacattt 11161 agatgatttg caaccaggtc aatggcgcga cttaagctca agcgaactca agttgttact 11221 cgatttagct tttgcaggct caagaaagtt taagagcaat agaacaacgc ttttagacta 11281 taactaaagt atgacactag ttatcattgt agagcctatc atcccaaatc tagttacctt 11341 ctgagtttcc taaacctaaa gatatgtgtc ggaatatgcg acaatgccta gttttcatgg 11401 caattttttg tcaaaatata gcttgccaat atctcaaatc tatagttagt atgcgtctaa 11461 attaagactg ccactaaaat tcgccagatg aatcacaata tattgtagat ctactgtagg 11521 caggtgctaa caaatgtggc acttttggat tgtagtggct aaaagcacct tggaacggga 11581 attaaggaac ttccacaatg ctgatttgcc ctcagtgtaa atttgaaaac cccaatgcta 11641 acaaattctg ccaacgctgt ggcgcttcac tgactcataa ggtctgtgga gagtgcggca 11701 ctgatttggc tttaaatgca cgagaatgtg ataactgtgg cgcagcatgc ggaagcgttt 11761 tgtgggcaat tattacaaaa gaagagagtc ggcgagttgg ggaggataag gaagattttg 11821 cttcctcatc tttctcatct gccttatcac caggttctta cttagactca caaaagcgtt 11881 atcagctgct agaaccgcta ccagccttac aggaaattgc tcctaacact caagtctcgg 11941 tgagagttct agattgccaa ccataccaag tctcactcct tgaggttatg ctgacaaatc 12001 agcaaaaggg actagtcatg gcatcagttg gggttgataa gattcccagt ctagccaaac 12061 cttatattgc tttacgatca tggtgccatt tgggaatacc gcccattcat gatgcttggc 12121 agcaggacga catgcaggtg gtactcatcg aagaccgctc gaattggcag gaattagttg 12181 atttgtggca ggatgattca ctaagctcgt tacaaattgt acattttttt tcccagatga 12241 cccaactttg gtcagtgctg gaaaaagctc actgtcgtca aagtttgttg gaattgtcca 12301 atctacgagt tgatgaaaaa ctatcattag cactacaaaa attgtacgta gaatcaccag 12361 acaagacagc tactaacttc ccagaggata cacaagcaac actaccagag acgacagttg 12421 tgcagccttt gacaatccaa gctttagggc aggtttggca cgcacttttt agacagtctc 12481 aacgcactca atttggctct gttttacacc tgatcggaga gttagaacta ggaaatttag 12541 agaccctggc gcagctgcaa tcatctttgt cagacatcgt gtctgaattg caaggcaatt 12601 ctacaagcgt ttcaattccc tctacattat tcaaaagtaa tgctgcaccc actgtcttgc 12661 aattggatga ccaagaggat gagactgaat tcaaaagcga tgataagtcc atagctatgc 12721 tgcctatgca gttaataagc ctggaaaatg caggacttac caatgttggg cgtcaacgcg 12781 accataatga agactatttt ggtattgata caaaagtata caacctagaa tcacccaaca 12841 cccatacttt gcaggcgcgt ggtttgtata ttctgtgtga tggaatgggc ggacacgcgg 12901 gtggcgaggt tgctagtgct ttggctgtca aatctttgcg cgagtacttt caaacccact 12961 gggcttctaa tcaactgcca acagaagata atatccgcgc agcagtgcgg ctagccaatc 13021 aagcaatttt taatgttaat caacaagacg cccgttctgg tgttgggcgt atgggtacga 13081 ctgtagttat gattttaatc caagatactc aagttgcggt tgctcatgtg ggagacagtc 13141 gtctctaccg cgtcactcgc aaaagaggac tggaacaagt cacaatggac catgaagttg 13201 gtcaacgaga aatttctcgg ggtgtagaac gtagtgtggc atactcccgc cctgatgctt 13261 accaattgac tcaagccctt ggtcctcgcg atgaacagtc tatcattccc gatatccagt 13321 tttttgagat caatgaagat accctttttc ttcttgcttc ggatggttta tcagataatg 13381 atttgctgac tcttcattgg caaaactact tagcaccttt gttaagtcat tctgccaatc 13441 tggaaagtgg cgttcaagct ttaattgatt tggcaaacca ttacaatggt catgacaata 13501 ttactgctgt acttattcgg gcaagagtgc gccaaagtat aaaccaacag caataattgt 13561 aaattacaga caagggagtt ggaggagcaa cagagtccga gagtaacaga gtccgggaaa 13621 cttctgttgc tttctcactc cttctcctca actccccttt gccactttct actcccttct 13681 cctattaaaa attacctatg gtcacgctga ccctgttaga accgcaacaa aatatacctc 13741 tccagcagtg gcactttgac gacgaatctg tgattcgggt tggtcgttca gcagataatg 13801 acgtggtttt aagtgatagt ttggtgtcac gatatcattt agaactccga caagttgatt 13861 cggataaaaa tggcggttct tggcaggtca ttagtcaagg cacaaatggc acttttctca 13921 acggcgtttt aatcactcaa acttcattgt cagataactc tgtgctgcaa ctggcacaaa 13981 aaggtcctat tctaaaattc caaattcaca agcaaaccac tccaccatat tctccaactc 14041 ccgcgttacc ttatacaagt tgcactcacg aaggaaactc gcctaacaac ttgttttgca 14101 tccactgcgg tcaaccaatg tctatcatgc agacaattcg ccaatatcag gtattgcgaa 14161 ttctgggaca gggaggtatg ggtactactt atctagcttg ggatgggcta ggaacaatag 14221 ctggacaccc acaattgttg gtgttaaagc agatgaatgc tgatatggca aaaattgcta 14281 aggcacaaga attatttgaa cgagaggcaa atactcttaa atccctaagc catactggaa 14341 ttcctaaata ttatgattct tttgtcgaag gtgggaagaa atatttggca atggaattaa 14401 ttcatggaca ggatttagaa aaacttgttt attcaaaagg accagttgtc ccaaatcaag 14461 cgattaattg gatgattcaa acctgcgata ttctggaata tctccatagc aaagaaccac 14521 ccctgatcca ccgggatatt aaacccgcta acctcatggt gcaaaactgc aatgatcgta 14581 tagtggtgct ggattttggc gcggtcaaag agattgccac aacgcctggg actcgtattg 14641 gtgcagaggg ttattgtgcc cctgaacaag aacggggaca acccgtcacc caatccgatt 14701 tatatgccat tggaccgaca cttatttttt tgctgactgg cgaaagccct tttaagttct 14761 accgccagag agggcgaagt ttccggtttg aagtcgcaaa tgtgcctacg attactcccc 14821 agttacgaga agtgattgac cgcgccacgg aacctttgcc gtgcgatcgc tatcaaacgg 14881 ccaaggattt agtggcggct ttagcagtgt gtaaagtcta gggaaaaggt ataagaaaca 14941 ggaaatagag actttattgc cctacttctt attcttcgtc ccaagcttcc acagctagca 15001 attcgcttac tgggtcttgc atactaaacc cgtagtctag caattcttgc ttccagtgct 15061 gccattcatt gccgaagagc aaagcgattt tccagatatt atctgtgggc ttgataatac 15121 tagaatctac gagtgaacgc acattacgct gcaacttcac cattgggtga ataacttgct 15181 gcttactcat aacctcgatt aaattcagat ttggttgggt aaatgcttaa tcaaaactgc 15241 tttccgtatt ggaattcttg ccgatgcgtg gagcttttta ctttggcaaa gcaagtctca 15301 actttctaac tatatcataa cttaactcaa gtgagtgact atttcattcg cttttgtacg 15361 gtaatcccca cctgcaggag ttacattttc ttgccagaaa gtcattactt ttgacagatt 15421 taaaacaata gccaccctac atcttgcaca aagagcacgt tttgggcacg ttttgaggat 15481 atatgcaaaa tgttattttt ttctttcacc atgatgtgtg caggttttaa aaaacttata 15541 taactggagg ttttttgaag gttttgacgg aacggtgaac tatgcagtta gctggagtta 15601 tgccgaatta ttcagttata tacaatatcg taaacatcta gcattatata gagattagaa 15661 aaatttgtag aaaaaattaa tattttgagc agaggcaaag actcggaaaa atacgctcca 15721 aaagaaagaa gacaaaaaga gcacacgtaa ataagatttt taaatggtat aaaattttta 15781 gaaaaaagtt gtcttgctag aattcaaatc aaaataaaag tgggtagttt ggactaccca 15841 cctttatcaa cggttctatt agagattaaa acagctaaat ttttttacgg ggactgattc 15901 ggtggaggta tcatttaggc tgctacttct tcagttccgt gtagttcggc gggcattttg 15961 gttgtgctga ggttccgttg gttgtgctac ttggctcctg cgtgcttcct acgccaatat 16021 tatcagactc acacaagatg gctaaagttg ctccttcttg ttctgataaa ataacaacac 16081 ctgcgtagga tttgaggcca tttttccgag aaataccatt attggcaacc tgattggtat 16141 ttactttaat ttcgtacttg tagttctcgg tttcgggttt aataccaatc cccaacttac 16201 ttgtatcaat tacgaactga ccattttcca agtaataagc ttgttgtgct cggttcattg 16261 aacccacata ggttttagcc tctgattgtt ttgctttgtt cgcttggcta aggaacgagg 16321 gcaaagcgat agcagataga atacttataa taataattac tactaacagt tccatgagtg 16381 tgaagccttc gtccttcttt tttttcttaa gaacgtgctg tagtaattta acttttcttt 16441 ctatacccat cagtattatt ccgtaactag tcactttttt ggttacaaac agaactttct 16501 cagttcttcc cagtttcaaa tcactcctca ccgaatgctt tggagagggg taatggggta 16561 cttggtgagg taatttcgcc gcaacttggt attagtattc ccagatcgga attttctata 16621 acgccctgtt tggtatgcta tactaagtca tagtcgttat gagtttatat ataaaagcat 16681 gactaacccg acagtagaaa acgtagtcat tatcggttct ggtccagcag ggtatacagc 16741 tgccatttac gcaggacgag ctaacctcaa accagttgtc tttgagggtt tccaagccgg 16801 ggggttacct ggtggtcagt taatgacaac gactgaagtg gagaactttc ctgggtttcc 16861 ccaaggaatt accggacccg aattgatgga taaaatgaaa gctcaagcag aacgctgggg 16921 ggctgagtta tatacagaag atgttgttga agttgatttt agtcagcgtc cttttacagt 16981 ccgttcagac gaacgggaat tgaaaaccca cagcattatc atcgccacgg gtgcgacagc 17041 aaggcgcttg ggtttagaca gtgaacatca attctggagt cgaggaattt ctgcttgtgc 17101 gatctgcgac ggtgctacgc ctattttcca cggtgcagaa ttagccgtgg ttggtggtgg 17161 cgactcagca gcggaagaag cgatttactt gaccaagtac gggtctaaag tgaatttgct 17221 ggtgcgaacc gagaagatgc gggcttccaa agccatgcaa gaccgggttt tgagtaaccc 17281 caaaatcacc gttcactgga acacagaagc agcggatgtg tttgggaacg acaagcacat 17341 ggaagggata aaaatccgca acaccaaaac tggcgaagag agtaaactgc acgttaaggg 17401 attattctac gccattggtc ataaccccaa cacatcctta ttcaagggac aactagaact 17461 tgatgacgtg ggttacattg tcaccaagcc tagttctcct gaaacaagcg tagagggcgt 17521 tttcgctgca ggcgatgtac aagatcatga ataccgtcaa gcaattaccg ctgcaggtag 17581 tggctgtgcg gcggcgatgt tagcagaacg ctggttgtct tccagtggct taattcagga 17641 attccatcaa aagccggaaa caccagataa cgaactcgaa cagttagccg agaagaaaac 17701 tgaagaacag ctagccgctg aatttgattt gaatgcaacc cgccatcagg ggggttatgc 17761 tttgcggaag ctgttccatg atagcgatcg cctactcatc gtcaaatacg tctctcctgg 17821 ctgcggtcct tgccataccc tcaagcccat cttaaacaaa gtcgtggatg aatttgacgg 17881 caaaattcac tttgtcgaaa ttgatatcga caaagaccga gatattgctg aaaatgctgg 17941 tgtcacagga acacccacag ttcagttctt caaagataag gatctgctca aggaagtcaa 18001 aggcgttaag caaaagagtg agtaccgcca attgatttca agcaatttgt aggtggtgag 18061 tcgaaaaaag aaaggagatg aagcaatttc atctccttac ttttccttac accctatcct 18121 ctgctagctc gaaaacgact tcatgttggc tagtaggctt ttgctgacgt aataaccgaa 18181 ggtattcttc tgcatcaacg cgactccggt ggcggctgac gactaccttt tgtgacggag 18241 acactgaatt gataaccgcc cagggtttga ggcgttcttg ataagccata atgagattat 18301 cttcttcaaa aatttaggag gcagagccag cgctgtcact ttcggtggtt aggtactggc 18361 ttctgctact ttaaatgata ttattgccat gttaaataa // LOCUS NODE_1789_length_18328_cov_4.60055818328 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18328) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18328) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18328 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 404..616 /locus_tag="DP116_15315" CDS 404..616 /locus_tag="DP116_15315" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15315" /translation="MLNLTYKTSQNQKVEFIESGLVIEEFGRLNNYAIESKKYVMDEP RFSFTQYLEKRAQTFPSWLRLTPDQA" gene 730..910 /locus_tag="DP116_15320" /pseudo CDS 730..910 /locus_tag="DP116_15320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015196844.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 1051..1248 /locus_tag="DP116_15325" CDS 1051..1248 /locus_tag="DP116_15325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131225.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobilisome degradation family protein" /protein_id="PRJNA477356:DP116_15325" /translation="MDQDIELTLEQEFSIRCFTDQVQQMSREQAQECLILHYKQMMIR EMMYQEILKQQWKLDMDFASL" gene 1320..1499 /locus_tag="DP116_15330" CDS 1320..1499 /locus_tag="DP116_15330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214790.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_15330" /translation="MSAKTNIISSVNNKRNAWVWGFTPQTELWNGRLAMIGFISAVLI EMFSGQGLLHFWSIL" gene 1837..3738 /locus_tag="DP116_15335" CDS 1837..3738 /locus_tag="DP116_15335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015188835.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsH" /protein_id="PRJNA477356:DP116_15335" /translation="MPIKEQPTPTPNRLIGNILLALAALFLITNFILPFVLGPQIPGV PYSLFIHQVQAGEVERVQVGQNQIQFQLKTVDDQAGSVFSTTPIFDLGLPKLLEEQGV EFAAAPPPKNGWFTSLLGWVIPPLIFVAIWQFFIRRGGGGPQGVLSIGKSKAKVYVEG ESAKTTFADVAGVEEAKTELVEIVDFLKTPGRYTQIGARIPKGVLLVGPPGTGKTLLA KAVAGEAGVPFFSISGSEFVELFVGVGSSRVRDLFEQAKKQAPCIVFIDELDAIGKSR SSGGFYGGNDEREQTLNQLLAEMDGFAAGDATVIVLAATNRPEVLDPALLRPGRFDRQ VLVDRPDLSDREAILKIHAQKVKLGEDVNLAAIATRTPGFAGADLANLVNEAALLAAR NQRSYVAQEDFAEAIERVVAGLEKKSRVLNDKEKKIVAYHEVGHALVGSLTSGNGRVE KISIVPRGMAALGYTLQLPTEDRFLMDEAELRGQIATLLGGRSAEEIVFGSVTTGAAN DLQRATDLAERMVTTYGMSKILGPLAYQQGQQPMFLGDGMPNPRRMMSEETAQAIDRE VKEIVETAHQQALDTLKLNRDLLEAIATQLLETEVIEGEKLHSLLRQVQTVNNLTVSD RPNLISGVR" gene 3738..4223 /locus_tag="DP116_15340" CDS 3738..4223 /locus_tag="DP116_15340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="allophycocyanin" /protein_id="PRJNA477356:DP116_15340" /translation="MSIITKMIVNADAEVRYPSLGELDQIKLFISSSERRLRLVQALT LSRDRIIKQAGNQLFQRRPSLVSPGGNAYGEEMTATCLRDMDYYLRLITYSVAAGDTT PIQEIGIVGVSQMYRSLGTPIDAVAESVRAMKNITTSMLSAEDASVVGTYFDYLISAL Y" gene complement(4380..6950) /locus_tag="DP116_15345" CDS complement(4380..6950) /locus_tag="DP116_15345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315456.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dynamin family protein" /protein_id="PRJNA477356:DP116_15345" /translation="MNTDMAIEKLRRYKEYGESVIQNLELVSHNPPPQESWVPSSLYE SLQRLQQSAQRTVQLASSPVKIGIMGEFSSGKTLLLGSLIGYADALPVSETPTTGNVT AIHLVQQSDFQTTKLGKFTVEYLSNEGVKECLRFMLEETQQRAKAAELSSAQLATLKS LHPTREVDINGILRWCEQAWKPQSLELRSLLKELVAFVRTYSVYKEDICGKSYQIDNT TAHKGLRLAEPPTNILELSFEDLSPAPKRWENLAHPSAQDLLNSFSLIRRIDVTVEVS KEIWDLSSLQGTNEFILLDFPGLGSADSGVRDAFLSLRELKDVQTILLLLNGRYPGGA TAAKIRSILERDKGEDLRDRIIVGVGRFNQLPLSSREEGMIDDLLDEMLLEEEAVFDS LSILKLAIASANNLTTEKKNIVLLSQLYGLTKLAELSSLVQVCSKEFLSELDKPNKLE EVQLREKWQKLSEMLPPSSTLHKQLSDYAEDGGIGRLRSLLKEHVALHGMKQLVEDTQ RAAEALRKEQNHLKNLLEEIPAYIPVEESSAFFTLREAIENVVTTYRRFQEDLEKQPI LKNRHSVAVSDVVKNELTNKIFFDWSEWTLLFDRTQNGTISLSKGESFFEEDEVDDSI PTKSDDFYAAFVNTIQEMQAFAHEQTTEAVTDLFNKLSADVQLERENLSTILLPEKEQ QIEQKFGKSQVRFFRTLQRAVDPAKKWQNLIIQHSGLASRATSINADTLFPLARKDDK HQNGQIFDWSPDKQFRVPPRPFNHQIAVLRLRDEITASAGLHLVQYVSQLTKQVKSSF SRALKEIVDNLQELLKSKHEPLLRHIASAQEQTSTPVPPWLEILSQIPTISYPQEMSN " gene 7313..8650 /locus_tag="DP116_15350" CDS 7313..8650 /locus_tag="DP116_15350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_15350" /translation="MQPVPTSFTNIAQRLAALSQARASGELIFSSGAKAWHLYFFLGR LLYATGGVHRVRRWHRALKQDCPNLKFDAHDLKENELWEYHLLNQSINKNQISLTSAK SIIEKIVEEVLFSLVSHSDLKSLWLPKKLYPIALIEVNQSLCTAFDLYKQWRNMGLGT LCPDMTPILKQPTSEQNLKFSKILFSLRQLIDGENTIWDIALHTRQSVITAASLVQNL VYEGVLELLTIPDLPIPVGATISAPSSEKQTQPTYDIVGNTTQTPSSQSLPNPLTKTD FHQTPELMEASKDNTRPQSTFDSAKIKLIQPKETTSIVAYIDDSQSDNLIMSEILKLA GYKYVNIQDPVTALPILLECKPNLIFLDLVMPIANGYEICTQIRRVSAFKETPVIIVT SSDGIIDRVRSKIVGSSGFLAKPITKEKVLKVLQKYLLTPVPVQSTHLQTLQV" gene 8804..9196 /locus_tag="DP116_15355" CDS 8804..9196 /locus_tag="DP116_15355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748137.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system response regulator" /protein_id="PRJNA477356:DP116_15355" /translation="MTTVLLVEDSLTETEVFTRYLKQAGLTVVTAISSEEAQLKLQFQ TPDLVIIDVILPGQSGFELCRELKTNANTQQIPVVICSTKGTEADKLWGTMLGADAYI PKPVNQQHLVQTIQQLTSQLQPWRRQII" gene 9342..9872 /locus_tag="DP116_15360" CDS 9342..9872 /locus_tag="DP116_15360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315452.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein CheW" /protein_id="PRJNA477356:DP116_15360" /translation="MQTEPKSVLAVTSFEPLQLNPLLPETNLSKLLRFPLGFADSGLL PLEQIAEIITVNLASILPVPEMPSCILGICNWRGEMLWLLDLNHLVGYPLLTRGVTPV AIVVKVNEQAIGLVVPQVDDIELHDLQQLQKAAAGLFPPKLLPFVLGALPNGSTVLDI TAITQYPLWQIHSTKN" gene 9949..14073 /locus_tag="DP116_15365" CDS 9949..14073 /locus_tag="DP116_15365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748135.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein" /protein_id="PRJNA477356:DP116_15365" /translation="MVLQDFEQTKQIFDFQSQNGDVETAKFIGYSELHQASELVNAIK ADLQQIGDSLKPEAQQKVLQLEKCVQQLQTEYQAHLAVACEQHTKQTRQRLFTILAQM RQAASVDTLLRTTVTEVRKLLQVDRALIYRFQSEKHGIVIAESMVSGYTPSLGESLSA IAFGHENQQNYKEEQVLIQDDVYEKVKSAYQHQLLKHFQVQASLSLPMIIDGQVWGLL VMQQCSTSRQWQEAEVSLLYQIVTELRLNLQTIELLTQHKEEAKQEKILAQILEKIPP SSDTSVTLGNITQELRQFFKADRVVVYRFYPDWSGVFIAESVASGWVALMQEQEKDVS LKSKNKVDYDRCTVNMLGNLTSLDIDTYLRDTQARDLVKSKAVKRVDDIYTAGFSDCY ITTQEKYQSRAYIIAPIFEGGELWGLIGVYQNSGPRRWQDSEVTLLSLVSHRLGAILK QADAAAVLKLKSEQLAAERERIVADAIDRIRQSSDIHTIFKTTVYEVRNLLKAERVVI YRFNLDWSGEFVAESVSSEWKSIMQEQFDNPTLPQNVSECSAKILGGGKASITDTYLQ QTQGGRFSTTTNFRVVSDIYQAGFLPCYIDTLERLQTRAYVITPIVTGQRLWGLLAAY QNSSPRDWEEHEIKVMTQISNQLGIGIQQAEYLKQLQEQATVIAKAVERERGIAKVIE KIRQTSNIDTIFRTTTQEVRKLLGTERVTIYKFRPDYFGDFIMESESGGWPKLVGSGW EDAYLNEHQGGRFRNNEPYVVDDVYNANLSDCHIESLEGFGVKSCVVVSIFQGQKLWG LLSAFQHSGPRHWEDGEVKVLAQIGSQLGVALQQAEYIEELRVQSQKLAKSVDQGTLY SKLVYRLGLALIQENFSLDNLLKMAVTELRRLLKADRVVIFRFAPDWSGEFIVEDVGS DWVKLVGSELSLADDTCLRETNGGRFRRRETLSTDNLRASGYSECYIKLLEQWEAKAN MAAPIFKGDQLWGLLGAYQNDAPRHWEQIDVNLLAQVGVQIGLALQQAEYLEQLRTQA QQLTEASERERAAKEQLQREVIQLLSAVQPVLQGNLTIRVPVTEGEVGTIADVYNITL QSLRKIVMQVQTASRKLAQTSQANESSIVGLATQAQQQFGSLTHALEQIQTMVNSTKA VTTNAQQVEEAVQVANQTIQQGDAAMNRTVDGILDIRETVAETNKRLKRFSESSQKVS KVVNLISNFTTQTQLLALNAAIEATRAGQYGRGFAVVADEVRSLARQSAEAATEIEQL VQEIQKGTAEVSMAMENAIRQVATGTTQVHEARQNLNAIVAATTQISQLVEGITQATQ VQSQEIQSVTQTMTEVADIANNTKEDSMEISTSFKEILAMAQNLQASADQFKVD" gene 14197..17319 /locus_tag="DP116_15370" CDS 14197..17319 /locus_tag="DP116_15370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315450.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_15370" /translation="MTHDPTIQEQSHRYFLQEAPELLQVIEQELFTLREDFSVNKVYT LMRATHTLKGAAASICLETITTVAHTLEDVFRAICKPDLSIDQEVEALLVESYECLRL LLTAELTGGSIDDNDILDRIARIFAQLQEKLGDCFAEEAYLPDSQELGFDLTQSIFEV GVTQRLEELATTISSGHPEQVATTLRVQAEIFFGLAESLNLPGFKAIAQTALTALDNY PEQAMVVGYVALANFQEARKAVLNGDRSQGGEVSLALQKLGERLHDVPQVSKNAPVKA QESVRQRERGGFPKTNPGKVTQTTPDTESDTEESANDHLTTHQPDSFAPSAVEFGAKE SEEVTLTEEQEPVRTSPHLLISSPPQPAISSSTRTVRVNVGELEQLNYHVGELLTNQN RQFLQNEQQLTVVRVLLLKLQQHQQLLHQLQDLSQRQFSVPEQQWLFRNGQDKGYFDS LELTTSYTEFHHLVQSLLEDMVQLGETTDAIEMFTRQSQETLEKQRRLLTHTHDSLME ARMLPLGHLFERFPRILHQLEVIHKKQVTLNFRGSDTLVDKAVVEKLYDPLLHLLRNA FDHGIELNSIRQKRGKPEKGQIEISAYQQGKYLVIEVRDDGQGLDFETIRAKAVERQL VSPEQASNLNEAQLTEFIFQPGFSTASNINDLSGRGIGLDVVRIHLQTIKGSVEIYSQ FSQGTMFRLQVPLSLTIANLLLCQAGNQIYALFTNSIEEILIPKADQIRSWEEGKVLQ WSQDGLMKLIPCYQLTKVLNYFSSVTQPSVFSTKSNGISHKQEKPIILIRSQDKLFGL EVDQLIGDQELVIRPLGAMIVPPPYIYGSSVLPDGRLTLVLDGLALMEYLSKRQKQDD SDWGRNSALWGERHFVSPTPLMFTSRIEQPRLLSQASTARTEAPTLSHHKPRPKKTIL VVDDSITVRQTVALTLEKADYQVLQAKDGYEAIELLQGHTDIHLVFCDIEMPRMNGFE FLKNRQQDPALANIPVVMLTSRSSDKHRQLAFKLGANAYITKPYLEHILLTTLRDVFE KDTGVVARRE" gene 17329..17784 /locus_tag="DP116_15375" CDS 17329..17784 /locus_tag="DP116_15375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315449.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15375" /translation="MHNESQLDKFLVFKIADYLLALPITDVLKVINFSSANSKSLPTM GLVQIGQYIIKILDLHQHLGSDTFPHSSDHPFFLVVAHHSQQELCGILIDEPPDIVEL LPETIRFLPRSSNHSKPLIEMVSHVAVLSEQEVVKTIFLLDLKRIAEPV" gene complement(17946..18191) /locus_tag="DP116_15380" CDS complement(17946..18191) /locus_tag="DP116_15380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_15380" /translation="MSGQALGTVNMPRLSKNSQGEAVKILQLLLDNFHGYSIAVDGIF GSNTENAVKDFQNSRDFVNDPPGVVGYQTWEALANRQ" BASE COUNT 5314 a 3972 c 4030 g 5012 t ORIGIN 1 ggggtgtggg gtgtggggaa ggaaactatg gcgttccgcc acgctacgct atcagttctg 61 ggtttaaagc ccagtttaaa aaagaattgc tttgagtggc gcagcccgat aaagcaaggc 121 gtccttgttt caatcccacg ctttaaagcg tgagtcccta caccctctga aataggttga 181 ttttttcacg ccgacttact tacgtctcta caagggtttt gaaaaaaata aaatcatttt 241 catacattca atcagcaacg ccgcaaatac catctatggg acagcaatat aggtagttgc 301 tgccccacaa ctgtctggat gtattttata tggggtagag ccgaaaattt atatatcatt 361 atttatttta ttttgcaaga aaagttgcaa tttcgtaaca aaagtgttaa atttaactta 421 taaaacatca caaaaccaaa aagtagaatt tatagaaagc ggtttagtga ttgaagaatt 481 cggtaggctg aacaactacg ccatcgaatc caagaagtac gtgatggatg agcctcggtt 541 tagctttact cagtatctag aaaaacgagc ccagacgttt ccctcctggc tcaggctcac 601 gcctgaccag gcataattgg tctgttgact ggcttgtagg aaattgaact tcaccatata 661 ccctgaacac ttttgcaaaa tataaggata aaaccaatga gaactaatat tgccttaact 721 aatccctctg tgttcagtcc cactgatgtt gacttaacta gtccctctgt gttcagtccc 781 gttgtcttca aacccggttt caatcacttt gataaaatca acgcaactca agcctggtca 841 ctgtttttca ctcttcggca gagaagataa agcacttgga tttagcccta gagtcggtcg 901 tttggcatag gtttaagttg ggggtgaacg cccgttcgcc tctacagccg tccgggcggg 961 ctagctttac cccgccaagt tcacgaaaag gactactagt gcaacatcga actgatagtt 1021 ttcaccctaa ttgagttaca ggagccaact atggatcagg atatcgaact aactttagag 1081 caagaattta gcatccgttg ctttaccgat caagtgcagc aaatgtctcg cgagcaagct 1141 caagaatgct tgatcttgca ctataagcag atgatgattc gggaaatgat gtatcaggag 1201 attctcaaac aacagtggaa attagatatg gatttcgcct ctctgtaaaa ccaccgaagc 1261 aatcctgact ttgcacaact gagtcattcg ttacagtcac aaaacttgga gtattaacca 1321 tgtcagctaa aacaaatatc atctcctcag tgaacaataa gcgtaatgct tgggtttggg 1381 gattcacacc tcagactgaa ctctggaacg gtcgtttagc gatgattggc ttcatatccg 1441 ccgtcctgat tgaaatgttt tctggtcaag gcttacttca cttctggagt attctgtagg 1501 tcagcatttc ctgctcaacc aatgtacgct gctgaagaag caacaggtta atctgaactt 1561 cgttttgagc ttcacagaaa aagtcaaacc ttggagttcg agatgcttgt cttaagtttt 1621 atgccgtact taatcaaata accctagaaa tgaaacagca ccctctcatg ctgtagggaa 1681 agaaatacta aattatcacc ttgtgctttg ttcatttctc tatggggtta tcgttttttt 1741 gcactcgtta catgtgaaat ttatcatcca aacctgagag gtgctgctgt cttcaacacc 1801 cacaaataaa acgtctcaaa ctatttggtg atagtgatgc cgattaaaga acaaccaact 1861 cccactccca accgcctgat cggtaacatt ttgctagcgc tggcagcttt gttcctgatt 1921 acaaatttta tattgccttt tgtcttaggt cctcagattc ccggtgttcc ctatagcctg 1981 ttcatccatc aggttcaggc aggggaagtt gagcgagtac aagttggtca aaaccagatt 2041 caatttcaat tgaaaacggt cgatgaccag gctggatctg tgttctccac tacacccatc 2101 tttgatttgg gattgcccaa gttgttagaa gagcagggtg ttgaatttgc cgctgctcct 2161 ccgcccaaga atggatggtt tactagtctt ctgggctggg tgattccacc tctgattttt 2221 gtggcaatct ggcaattttt tattagacgg ggtggtggtg gacctcaagg cgttctctcg 2281 attggtaaga gtaaagccaa agtttatgtg gagggggaat ctgccaaaac gacctttgcc 2341 gatgtggctg gggtagagga agccaagact gagttggttg agattgtgga tttcctcaag 2401 actccagggc gctataccca aattggggcg cgcattccca agggggtgct gctggtgggt 2461 ccgccgggga ctggtaaaac gcttctggct aaagcggtgg caggggaagc aggcgtgccg 2521 ttcttcagta tctctggctc tgagtttgtg gaactctttg tcggagttgg ttcctcgcgg 2581 gtgcgtgatc tgtttgagca ggcgaagaag caagctccct gcatcgtgtt tattgatgaa 2641 ctggatgcga tcggcaaatc tcgcagcagt ggtgggttct acggcggcaa tgatgaacgc 2701 gaacaaactc tcaaccagtt gttagcagaa atggatgggt ttgcggcagg agatgccacc 2761 gtgatcgtgc tggcagccac aaatcgccct gaagtattag accctgctct gttgcgtcct 2821 ggtcgatttg accgtcaagt tttggttgat cgccctgatc tgtcagatcg tgaggcaatt 2881 ctcaaaatcc atgctcagaa ggtgaagctg ggtgaggatg tgaatttagc ggcgatcgcc 2941 actcgcaccc ccggttttgc cggagcagac ctggcaaacc tggtgaatga agcggctctg 3001 ttagcagccc gtaatcaacg ctcctacgtg gctcaggaag actttgcgga agcgatcgag 3061 cgagtggtag caggtctgga aaagaaaagc cgtgtcctca acgacaagga aaagaaaatt 3121 gtggcttacc acgaagtcgg tcacgctctg gtcggttcct tgacttcggg aaatggtcgg 3181 gtcgagaaaa tctccatcgt cccccgtggt atggcagcgc taggttacac gctgcagctg 3241 ccaacggaag atcggttcct catggacgaa gcggaactcc gggggcagat tgccaccttg 3301 ctcggtggac gctcagctga agagattgtg tttggcagcg tcactacggg ggcagccaat 3361 gatttgcagc gggcgacgga tctggcagag cggatggtca ctacctacgg gatgagcaaa 3421 atcttaggcc ccctggctta ccagcaggga caacaaccaa tgttcctggg cgatgggatg 3481 cccaatcccc gccggatgat gagtgaggaa acagcacagg cgatcgaccg cgaggtgaag 3541 gaaattgtag aaacggctca tcagcaagct ttggatacac tcaagctcaa ccgagactta 3601 ctggaggcga tcgccactca actcttagag acagaagtga tcgagggtga gaaactgcac 3661 agtttgctaa ggcaggttca aactgtaaat aatttaaccg tttcagatag acctaattta 3721 atttcaggag ttcgttaatg agtattatca cgaaaatgat tgtgaatgca gatgccgagg 3781 ttcgctatcc cagccttgga gaactggacc agatcaagct ctttattagc agtagtgagc 3841 gccgtttacg ccttgtgcaa gctttaactc tctcgcgcga tcgcattatt aagcaagctg 3901 gaaatcaact gttccaaagg cgtccaagcc ttgtttctcc aggtggcaat gcctacggcg 3961 aagagatgac cgccacctgc ctgcgggata tggattacta cctgcgcttg atcacttaca 4021 gtgttgcggc tggagatact acgccaattc aagagatcgg gatcgttggt gttagccaaa 4081 tgtacagatc cctcggcacc ccaattgacg ccgttgctga aagtgtccgt gccatgaaga 4141 acatcactac ctcgatgttg tccgcagaag atgcaagcgt tgttggcact tacttcgact 4201 acctaattag tgctctgtat tagacaacag gaatcacctt cgggcgatct gcgccaatgc 4261 gcagcgcact cgtgcaaagc acacgctgcg cgtttggcgt tcgcgtatcc ctcaacgcaa 4321 ggatttggaa gaatcgataa attagacggc tggatatttt attttttgga tctcccttac 4381 taattactca tctcctgtgg ataagaaatt gtcggaattt gcgaaagaat ttctaaccac 4441 ggtggaacag gagtcgaggt ttgctcttgt gcagatgcaa tgtgtcttag caaaggttcg 4501 tgtttcgact ttaaaagttc ttgcaaatta tctacaattt ctttcaaagc tctagaaaac 4561 gaagatttca cctgtttcgt taactgactc acatactgca caagatgcaa tcctgcacta 4621 gcagttattt catctcgcag ccgcaaaact gcaatttggt gattaaatgg tctaggggga 4681 acacgaaact gtttatcagg actccagtca aagatttgtc cattttggtg cttgtcgtcc 4741 ttgcgtgcta aaggaaataa agtgtcagca ttgatagaag tcgcacgact agcgagtcca 4801 ctgtgttgaa tgatgagatt ttgccatttc ttagcaggat caacagcacg ctggagggtt 4861 cggaaaaaac ggacttgact ttttccaaat ttttgttcaa tttgctgttc cttctctggt 4921 aacaaaattg tgctgaggtt ttcgcgttct agttggacat cagcagataa tttattgaat 4981 aagtccgtga cagcttctgt ggtttgctca tgggcaaaag cttgcatttc ttgaatagtg 5041 ttaacaaagg ctgcataaaa atcatcgctt tttgttggaa tcgaatcatc aacttcatct 5101 tcttcaaaaa agctttcgcc tttgcttaga gatattgtgc cattttgggt tcgatcaaaa 5161 agcaaagtcc actcgctcca atcaaagaat attttattcg ttaattcatt tttcacaaca 5221 tcgctgactg caacactatg acggtttttt aaaatcggct gcttctctaa atcttcttga 5281 aatcgtctgt aggtcgtgac tacattttca attgcttcgc gcagagtgaa aaaagcagaa 5341 ctttcctcaa ctggtatgta agcggggatt tcctcaagca gatttttgag atgattctgt 5401 tctttgcgca aagcctcagc agctctttga gtgtcttcta caagttgctt cataccatga 5461 agagcaacat gttctttgag taatgaacgc aacctaccaa taccaccatc ttcagcatag 5521 tcactcaatt gtttgtgcaa ggtgctggag ggaggtagca tttcacttaa tttctgccat 5581 ttctcccgca gttgaacttc ttctagtttg ttgggtttgt ccagttcgct taagaattct 5641 tttgagcaaa cttgtacaag agaagaaagt tctgcgagtt tcgttaatcc ataaagttgc 5701 gacaacagaa caatattctt tttttctgtc gtcaaattgt ttgcgctggc gatcgccaat 5761 ttaagaatac tcaggctatc aaaaacagct tcttcttcaa gcagcatctc atccaaaaga 5821 tcatcaatca ttccttcctc cctactgctg aggggaagtt gattaaagcg tcccacaccc 5881 acaataatac ggtctcttaa gtcttctccc ttatcccgtt ctagtatact gcggattttc 5941 gctgcggttg caccaccagg atatctacca ttgagcagta ataagatagt ttgtacatct 6001 ttgagttccc gcaaagataa aaaagcatcc cgcacaccag agtcagcaga tcccaatcct 6061 ggaaagtcta aaagaataaa ttcattcgtc ccttgtaaag aagacaaatc ccaaatttct 6121 tttgatactt caactgtgac atctatacgg cgaatcagcg aaaagctatt gagcaagtct 6181 tgagctgagg ggtgtgctaa attctcccag cgtttgggag caggtgagag atcctcaaag 6241 ctgagttcta aaatatttgt aggcggttct gcaagcctta gtcccttgtg agcagtcgtg 6301 ttgtcaattt ggtaactttt cccgcagata tcttctttgt aaacgctgta ggtacggaca 6361 aaagcaacca gttccttaag cagagagcgt aattccaagc tttggggctt ccatgcttgc 6421 tcgcaccatc tcaggatacc attaatgtca acttctcgcg tggggtgtaa actcttgaga 6481 gtagcaagct gtgctgaaga aagttctgca gcttttgctc gttgttgagt ttcctctaac 6541 ataaagcgca gacactcttt taccccttca ttagaaagat attcaactgt aaacttgcca 6601 agtttcgttg tctgaaagtc gctttgttga accaggtgaa tcgcagtcac gttacctgtg 6661 gtgggtgttt cactgactgg taaagcatct gcgtagccaa tcaaactgcc taaaagtaaa 6721 gtttttccac tactgaattc tcccatgata ccgattttga caggcgagga tgcaagttgg 6781 actgttcttt gagcagactg ctgaaggcgc tggagacttt cataaagact gctaggaacc 6841 caactctctt gaggaggtgg gttgtgtgaa accaactcaa ggttctggat aacagattcc 6901 ccatactctt tatagcggcg taatttctct attgccatat ccgtgttcat ataatttctt 6961 tgttgattca gaaccaaaat gatttttctg tacttcggtc atttaaagtt taccaatttc 7021 aagaatttgt tcgttcctaa tagcagtact cctacgcaat atatgtattt tgttgaactt 7081 ttgtttagcc aagttaacac ttgtttcctt aatattttct tcacaattat ttaaaatttg 7141 attgtttttt aagtaaaatc gctgatatat cattaagtaa tactatgtct accgtgagta 7201 tttaccgcaa atattgctgt ttaacggtaa aagctgaaag ctaagatatt ggaaactact 7261 gggtgcaact ccaagagaaa ataagtaact tttcattaag gaaaattgcg taatgcaacc 7321 agtacctaca agctttacga atattgctca aaggttagca gcattaagcc aagccagagc 7381 cagtggagag ctaatttttt catccggggc taaagcgtgg catctgtatt tcttcttggg 7441 acgtttgcta tacgctacgg gaggagttca ccgtgttcgg cgttggcata gagcactaaa 7501 gcaagattgc cctaacttga aatttgatgc tcatgattta aaagagaatg aattgtggga 7561 atatcattta ctgaaccaga gtattaataa aaaccagata agtctgacga gtgcaaaaag 7621 tattattgaa aaaattgttg aagaggttct atttagctta gtaagtcatt cggacttgaa 7681 aagcctttgg ttgcccaaaa aactgtatcc tattgccctg atagaagtaa atcaatcgtt 7741 atgcacagcc tttgatttgt acaagcaatg gcgaaacatg ggtttaggaa cactttgccc 7801 tgacatgaca ccgattttaa aacaacctac ttctgagcaa aatttaaaat tttctaaaat 7861 tctcttcagt ctcaggcagc tcattgatgg tgaaaacacc atttgggata tcgccttaca 7921 cacgagacaa agtgtaataa ctgcggcaag tcttgtacag aatttggtct atgagggtgt 7981 tctagaactc ttaaccatcc cagacttacc tattccagtc ggagcaacga tatcagcccc 8041 ctcaagcgaa aaacaaactc agcctacgta tgatattgtt gggaacacaa cacagactcc 8101 gtcgtctcaa agcttaccca atcccttaac aaaaactgac ttccaccaaa ctccagagtt 8161 aatggaagcg agtaaggata acactcgccc tcagtcaacc tttgactctg caaaaataaa 8221 attgattcaa cctaaagaga ctacctcaat agtcgcctac atagatgaca gccaaagcga 8281 taatctcata atgagtgaga ttctaaagct agctggctat aagtatgtca atattcaaga 8341 tccagtcaca gcactgccaa tcttgctaga gtgcaaacca aatctcattt tcttggactt 8401 agtcatgccg atcgccaacg gatatgaaat ctgtactcaa attcgtcgcg tatcagcttt 8461 caaagaaaca cctgtcatta ttgtgactag cagtgacggt ataattgaca gagtgagatc 8521 aaaaatagtt ggctcctctg gctttttagc aaagccaatt accaaagaaa aagtactaaa 8581 agtcttacag aagtatctac taactcctgt gcctgtgcaa tctactcatc tacaaacact 8641 tcaagtttag tttctagatg cgtagatttc tatccaataa agttcactag tcattagtca 8701 ctagtcatta gtgagccagc gctatgcagt ccttgttttc cacgccctgg cgtagtcaat 8761 cgagtcatta gttattaatc attgtataca ggaagatttt tttatgacca cagtcctttt 8821 ggttgaagat agcttgacag aaactgaagt gttcacccgt tatctcaagc aagcaggctt 8881 aacagtcgtc actgctataa gcagtgaaga agcacaatta aaattgcaat tccagacgcc 8941 tgatttggtt attattgatg tcattttacc agggcaaagt gggtttgaat tatgccgtga 9001 actgaagacc aatgccaata cccaacaaat cccagtcgtg atatgttcaa ccaaaggaac 9061 tgaagctgat aaactttggg gtacgatgtt gggagcggat gcttatatac ctaaaccagt 9121 aaaccagcaa cacctagttc aaacaattca gcaattaact tcacagcttc aaccttggcg 9181 tcggcaaatc atttaagtaa gtcggtacaa ataaagttaa ctataggact catatttgat 9241 ttttgtgaaa ctaggtacat ttgatgttcc ctgttctctg ttccctgttc cctccggatt 9301 cctatagcaa ttatgaacga caatttataa ctcaaaaaat tatgcaaaca gagccgaaat 9361 ctgtacttgc agtgactagt tttgagccac tgcaactcaa tccactcctc ccagaaacta 9421 acctttctaa attgcttcgc tttccccttg ggtttgctga tagtgggcta ttacctctag 9481 aacaaattgc cgaaattatc acggttaatt tagcatccat tctacctgtt cctgaaatgc 9541 caagttgtat tttgggaatt tgtaactggc gaggagaaat gctgtggctt ttagacttaa 9601 accatcttgt tggatatcct ctgctgacga ggggggtaac tcctgtcgca atagttgtta 9661 aggtcaacga acaagctata ggtttagtag taccacaggt tgatgatatt gaattacacg 9721 acttgcaaca actgcaaaaa gcagcagctg gcttatttcc tcctaaactt ttaccttttg 9781 ttctgggggc tttgccaaat ggcagtactg ttttagatat tacagctatc actcagtatc 9841 ccctgtggca aatacactca acaaaaaact gataactgat aactggtaac tggtaactgg 9901 taactggtaa ctgataactg ataactgacc aatcccaagg atataaacat ggttttacag 9961 gattttgagc aaaccaagca aatatttgat ttccagtccc aaaatggtga tgtagaaaca 10021 gcaaagttca taggttattc tgagctacat caagcttctg aattagtgaa tgctattaaa 10081 gcagatcttc aacaaatagg agattcactc aaacctgaag ctcagcaaaa agttctccag 10141 ctggaaaagt gcgttcaaca attgcaaaca gaatatcaag ctcatctagc tgtcgcttgt 10201 gagcaacata cgaagcagac tcggcaaagg ttattcacca ttttggctca gatgcgccaa 10261 gcagcaagtg ttgacacctt gttgagaacg acagtgactg aggttcgtaa gcttttacag 10321 gtagatcggg cgctcatcta ccgttttcag tctgaaaagc atggcatcgt gatcgctgag 10381 tcaatggtat caggctatac tcctagccta ggggaatctc tgagtgcgat cgcctttggt 10441 catgagaatc aacagaacta caaggaagag caagtcctta tccaagacga tgtttacgaa 10501 aaggtcaaga gtgcttatca acatcaatta ctgaaacatt ttcaagtcca agccagtttg 10561 agtttgccca tgatcataga cggacaagta tggggcttgc ttgtgatgca gcaatgctca 10621 acttctcgtc aatggcaaga agctgaagtt agcctgcttt accaaattgt cactgaactg 10681 aggctaaatc tacaaacgat agaacttctt actcaacaca aagaggaggc taagcaagaa 10741 aagattttag ctcaaatcct ggaaaaaata ccaccatctt ctgatacaag cgttaccctt 10801 ggcaacatta cccaagaact gcgacagttt tttaaggctg accgagtcgt tgtttatcgc 10861 ttttaccctg attggagtgg cgtatttatt gctgaatctg tcgcctcggg ctgggttgct 10921 ttgatgcaag agcaagagaa agatgtcagt ttgaagtcaa aaaataaagt agattatgac 10981 cgctgcacag tgaatatgct aggcaatctt actagccttg acatcgacac ctacttgaga 11041 gatacacaag caagagattt ggtcaagagt aaggcagtta agcgtgtgga cgacatctat 11101 actgctggtt tttcggactg ctatatcaca actcaggaga aataccagag tagagcctac 11161 atcattgccc caatttttga aggaggagaa ctctggggtc taataggtgt gtatcaaaac 11221 tctggacctc ggcgttggca ggattctgaa gttactctgt tgtcactagt cagtcatcgt 11281 ctgggcgcta ttctcaagca ggctgatgct gcggctgtgc taaaactcaa atcagagcaa 11341 ctagcggctg aacgagagcg aatcgtggcg gatgcgatcg atagaattcg tcaatcttct 11401 gacattcaca ccattttcaa gacaacagtt tatgaagtcc gaaatctcct aaaagcagag 11461 cgtgttgtta tttatcgctt caatctggat tggagcggtg agtttgttgc tgagtctgta 11521 tccagtgaat ggaaatccat aatgcaggag caatttgaca accccaccct gccacaaaac 11581 gtcagcgagt gcagtgccaa aattcttggt ggaggaaaag ccagtattac agacacctat 11641 ctgcaacaaa ctcaaggggg acgattttcc acgacaacaa actttagggt tgtcagcgat 11701 atttaccaag caggattttt accttgttac attgacactt tggagcggtt gcaaacaaga 11761 gcttacgtca ttactccgat tgtcacagga caaaggcttt ggggtctatt agcggcatat 11821 caaaactcta gtccgcgtga ttgggaagag catgaaatta aggtaatgac ccaaattagc 11881 aatcaactgg ggataggcat acagcaagca gagtacctca aacaactaca ggaacaagcc 11941 actgtgatag ccaaagccgt agaacgggaa cgaggcatag ccaaggttat cgaaaagatt 12001 cgccaaacat cgaatattga cactattttc cggacaacaa cgcaagaagt tagaaaactt 12061 ctgggtacgg aacgagtgac gatttacaaa tttcgcccag attattttgg tgactttatc 12121 atggagtctg agtccggcgg atggcccaaa ttagtcggta gcggttggga agacgcatat 12181 ttgaacgaac atcagggcgg tcggttccgc aacaatgaac cttatgtcgt ggatgatgtt 12241 tacaacgcca atctcagcga ttgccatata gaatccttgg agggatttgg agtgaaatcc 12301 tgcgttgtcg tttccatatt ccaagggcag aaactctggg gtttattgtc agcgtttcaa 12361 catagtggac cacggcactg ggaagatggt gaagttaagg tactggcgca gattggctcg 12421 caactgggag ttgctttaca gcaagcagag tatattgaag aactgcgtgt gcagtcacag 12481 aagttagcca agtcagttga ccagggaaca ctttacagca aactggttta tagactcggg 12541 ttagccctaa ttcaagagaa tttttctctt gacaacctac tcaagatggc tgttacagaa 12601 ttacgcagac tgttgaaagc tgatcgagtc gtcatcttcc gctttgctcc tgactggagt 12661 ggtgaattta tcgttgagga tgttgggagc gattgggtca aacttgtggg aagtgagcta 12721 tctctagccg atgatacttg tcttcgagaa acaaatggcg gtcggtttcg ccgtagagaa 12781 actctcagca ccgataatct cagagcttct ggatatagtg aatgctacat caaacttcta 12841 gaacagtggg aagcaaaggc taatatggct gctcccatct tcaagggcga ccaactttgg 12901 ggattattgg gagcttatca aaatgatgca ccacgtcatt gggagcaaat agatgtcaac 12961 ttgctagctc aggtaggggt acaaattggt cttgccttac aacaagcaga atacttggag 13021 caattgagaa ctcaagcaca gcaattaaca gaagcaagcg aaagggaaag agcagccaaa 13081 gagcaactcc aacgggaggt gattcagttg ctgtcagcag tgcaaccagt attacaaggg 13141 aacctcacta ttcgggtacc tgtaaccgaa ggtgaagtgg gtaccatcgc cgacgtctat 13201 aacatcaccc tgcaaagtct gcgaaaaatc gtcatgcagg tgcagacagc atccaggaaa 13261 ctcgctcaaa cctctcaagc aaacgaatct tctatcgttg gtttagccac ccaagcacaa 13321 caacagtttg ggtcccttac tcatgctttg gagcaaatcc aaacgatggt caattccaca 13381 aaagccgtaa caaccaacgc ccaacaggtg gaagaagcag tgcaggtggc aaaccaaacc 13441 atccaacaag gagatgcggc gatgaaccgc actgtggatg gaattcttga tattcgggag 13501 acagtggcgg aaaccaacaa gcgcctcaag cgcttttcgg aatcttctca gaaggtttcc 13561 aaggttgtga atttgattag taactttaca actcaaactc aactgcttgc cctaaatgcg 13621 gcaattgaag caactagagc cggacagtac gggcgtggtt ttgccgttgt cgccgatgaa 13681 gtgcgttctt tggcgcgtca gtctgctgag gctgcgactg agatagagca gctagttcaa 13741 gaaatccaaa aaggcacagc cgaagtttct atggcaatgg aaaatgcgat tcggcaagtt 13801 gcaacaggaa caacgcaggt acacgaagct cgccaaaatc taaacgccat tgttgcagcc 13861 accactcaaa tcagccaact tgtggaaggt attacccaag caacacaggt acaaagtcag 13921 gaaatccaat cagtcacgca aaccatgaca gaagtggcag atattgccaa caatacaaag 13981 gaggattcga tggaaatttc cacatccttt aaggaaatcc tagcaatggc tcaaaatctc 14041 caagctagtg ctgaccagtt caaggttgat tgatgaatag tcattagtca tttgttcctt 14101 tgtttaagtg ctaatgacta atgactaata actaatactg gctaaaaaat caaaaaagct 14161 aataattaat aactcataaa taataaataa taactaatga ctcatgaccc aacaattcaa 14221 gagcaaagcc accgctactt tctccaagaa gcaccagaac tgctgcaagt catagaacaa 14281 gaattgttca ctctcagaga agacttcagt gtcaacaagg tttacacgtt gatgcgtgct 14341 actcacacac tcaaaggagc agcagccagc atctgtttgg aaacgattac aaccgtggct 14401 cacacgttgg aggatgtttt tagagcaatc tgcaaacctg acttatcgat tgaccaagaa 14461 gttgaagctt tactcgtcga gagttatgag tgtttgcgct tactcctaac cgctgaattg 14521 acaggagggt caattgatga taatgatatt ctcgaccgaa tcgctcgtat ttttgcccaa 14581 ctgcaagaaa agttaggaga ctgttttgct gaggaagctt accttcccga ttcacaagaa 14641 ttgggctttg acttaacgca gtccattttt gaagttggag tcactcaacg cttagaagag 14701 ctggctacaa ccataagcag tggtcatccc gaacaagttg ctactacgct gagggtacaa 14761 gcagagattt tctttggttt agctgagtct ctaaatttac cagggtttaa agctatagct 14821 caaacggcgc tgactgcttt ggacaattat cctgagcaag cgatggttgt tggctacgtt 14881 gctttggcaa acttccaaga agcacgaaaa gccgtgctaa acggtgacag aagtcaaggc 14941 ggtgaagtat ccttagcatt acaaaaactg ggggaacgac tccacgatgt accgcaagtg 15001 tccaaaaatg caccagtcaa agctcaagag tctgtgaggc agcgcgagag agggggtttc 15061 cccaaaacga accccggaaa ggtaactcaa acgacacctg atactgaatc agatactgag 15121 gaatcagcca atgatcatct aacgactcat caaccagact cttttgcccc atctgctgtg 15181 gagtttggag caaaagaaag cgaggaggtg acgctaacgg aggaacaaga accagtgcgg 15241 acttctcccc accttttaat ctcctcacca ccccaacctg ctatctcctc ctcaactcgt 15301 actgtgcggg ttaatgttgg ggagttagaa caactgaatt accatgtcgg agaattgcta 15361 acaaaccaaa atcgccaatt tctccagaac gaacagcagt tgacagttgt tcgagtactt 15421 cttttaaaac tacaacaaca tcagcagttg cttcatcagt tacaagattt gtcgcaacgc 15481 cagttcagtg ttccagaaca acagtggtta tttcgcaatg ggcaagacaa gggatatttc 15541 gattctctgg aactgacgac gagttacaca gagtttcatc atctggttca atcacttcta 15601 gaagacatgg tgcaattagg agaaacgacc gatgcgattg aaatgtttac tcgtcagtcc 15661 caggagactt tggaaaaaca acgtcggtta ctaactcata cccatgattc tctcatggaa 15721 gcccggatgt tgcccttggg acacttgttt gaacgctttc cccgtatttt gcatcagcta 15781 gaagttatac acaaaaagca ggtaacactg aattttcgtg gaagtgacac cttagttgat 15841 aaagcagtgg ttgagaaact gtatgatccc ttgctgcatc tgcttcgcaa tgcttttgac 15901 cacggaattg aattgaactc aatccgacaa aaacggggca agccagagaa aggtcagatt 15961 gaaatctctg cctaccaaca gggtaaatac ttagtgattg aagttcggga tgatggtcaa 16021 ggattagatt ttgaaacgat tcgtgccaag gcggtagaac gtcaacttgt ctcgccagaa 16081 caagctagca atctaaatga agctcagcta acagagttca tttttcaacc aggcttttct 16141 acagcttcta atatcaatga cctctctgga cgaggaattg gtctggatgt tgttcgcatt 16201 catttgcaaa caattaaagg gtcagttgaa atatattctc aattcagcca gggtacaatg 16261 ttcagactgc aagttcctct gagcttaacg atcgccaacc tactgctttg tcaagctggc 16321 aaccaaatct acgccttgtt tactaattct atcgaagaaa ttcttattcc taaagcagac 16381 cagattcgct cctgggaaga aggtaaagtc ctccagtgga gtcaggatgg tttgatgaaa 16441 ttgattcctt gttaccaact cacgaaagtc ctgaattatt tctcatccgt cactcaaccg 16501 tcagtcttta gtaccaagtc gaatggtatt tctcacaagc aggagaagcc aattattctc 16561 attcgttctc aagataaact ttttggtttg gaagtagatc agctcattgg tgatcaggaa 16621 ttggtgatcc gtccattagg agcaatgatt gtccctcccc cttacatcta cgggagtagt 16681 gttctacctg atggtcggct gacattagta cttgatgggt tagcattgat ggaatatttg 16741 tctaaacggc aaaaacagga tgatagtgat tggggaagga acagcgccct ttggggcgaa 16801 cgccatttcg tcagtccaac gcccctcatg ttcacctcca ggatagaaca gccgcgatta 16861 ctatcccagg cgagtactgc ccgaacagaa gcaccaactc tgagtcacca caaaccaaga 16921 ccaaaaaaaa ctattctcgt agtggacgat tcaattaccg tccgccaaac tgtagctttg 16981 accctagaaa aagcagacta tcaagtgctt caggcaaaag acggttacga agcaattgag 17041 ctactccaag gtcacacaga tattcatcta gtattctgtg atatcgagat gccacggatg 17101 aatggctttg agtttctgaa aaaccgtcag caagacccag ctttggcaaa tattcctgtt 17161 gtgatgctaa cttccagaag tagtgataag caccgccaac ttgctttcaa gttgggagct 17221 aacgcttata ttaccaagcc ttatctagaa catatactgt tgaccacact gagagatgtc 17281 tttgagaaag atactggagt tgttgctagg agagagtaat taggtacaat gcataacgag 17341 tcacagttag acaagtttct tgtttttaaa attgcagatt atctcttagc gctcccgatt 17401 accgatgtgc tgaaggtaat aaatttttca tctgcaaata gcaaaagcct gccaacaatg 17461 gggctagtgc agataggtca gtacataatt aagattttgg atttacatca acacttaggc 17521 tcagatactt ttcctcactc atctgatcac ccattttttt tggtggttgc acaccattca 17581 caacaggaac tctgtggaat tttgatagat gaaccacccg atatagtgga attactgcca 17641 gaaacgattc gctttttacc aaggtctagc aatcatagta aacctctgat tgagatggtc 17701 agtcatgtcg ctgttttatc cgagcaagag gttgtaaaaa caatctttct gctcgatttg 17761 aagcgtatcg ctgagcctgt ttaactaata cataagtact aaaaaacgaa aaaattccca 17821 ctattctagt cggaatacaa aggtcaacga cctctagtta agtacaatcc ccatgcctat 17881 aaagcagggg cttgcagtta actgtagttg accagtggga tgatcccacc ttttgtaaac 17941 gttaattatt gtctattggc taacgcctcc caagtttggt aacccacaac tcctggagga 18001 tcattcacaa aatcacgaga attttgaaag tctttgacag cgttctctgt gtttgaacca 18061 aaaatgccgt caacagcaat agagtatcca tgaaagttgt ctaaaagcag ttgtaaaatc 18121 ttcacagctt caccctgact atttttgctt aatctgggca tgttcacagt acctagagct 18181 tgtcctgaca taagttttaa ctcctgtaaa aaaaggttat tccgtcaaag cgttctttct 18241 agttttaata ggtgttatac caattctcca agaagatgca ctttattttt aaaacgttag 18301 cgtagcgtgc gcccttggcg cataccgc // LOCUS NODE_1790_length_18320_cov_4.55937618320 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18320) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18320) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18320 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1272) /locus_tag="DP116_15385" CDS complement(<1..1272) /locus_tag="DP116_15385" /inference="COORDINATES: protein motif:HMM:PF13374.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15385" /translation="MNEQRQQAYFNLIQRLLNCRSNEIRKILAANQDLIDAGLVQKML EMASNLLRQGELDQANRLMNIAGQQLGVSSKLSSTATEKEYLDFLEQVLKVTADSRGN AQLVYPLLEKNTDKLDEVFAEILRQWGTKTLREAQADVAESIAADIFLFSDLIQQFPL GNKASNMEIAITGYKVALTVYTSEAFPQKWATIQNCLGLAYRERILGEKAENIELAIA AFSDALTVYTQQAFPQDWAGTQNCLGAVYVDRITGEKAENIELAIAAFFDALSVHTQQ DFPQNWATTQYNLGNAYCERILGEKAENIELAIAAYTAALSVHTQQDFPREWATTQYN LGLAYVQKILGEKAENIELAIAAFSDALRVHTQQDFPQEWATTQNCLGNAYLERILGE KAENIELAITAFSTALRVHTQQDFPQEWATTQ" gene complement(1529..1602) /locus_tag="DP116_15390" tRNA complement(1529..1602) /locus_tag="DP116_15390" /product="tRNA-Val" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(1566..1568),aa:Val,seq:gac) gene 1719..1898 /locus_tag="DP116_15395" CDS 1719..1898 /locus_tag="DP116_15395" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15395" /translation="MCFYSKLEHNRAVDFLVTGQFHNYSEVALPGNARADLEKNHQNI VETLTMPQRRCNETA" gene 2000..2785 /locus_tag="DP116_15400" CDS 2000..2785 /locus_tag="DP116_15400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317023.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_15400" /translation="MSTALITGASDGIGKAFAEELAAQNTNLVLVARSEAKLNQLAKQ LQEKYKVQVDTIVKDLTETDATHDVFDAVKSKGLTIDLLINNAGFADYGDFAETDQER QLKMVQLNILALVALTHKFLQGMRQRGSGSIINVSSLTAFQPMPYLSVYAASKAFIVS FSQALWAENRHYGIRVLVTCPGPVETNFFTEAKFPPALAAKTNKISTSEEVVRESLKA LERGDSTVVVGGFSTHFISKLSRFVPRQTLLSLLAKQFKAKTV" gene complement(2813..3592) /locus_tag="DP116_15405" CDS complement(2813..3592) /locus_tag="DP116_15405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015081534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II S4 domain protein" /protein_id="PRJNA477356:DP116_15405" /translation="MLPREELLKGVENRDSIARVIDQAEQAIKTWEVVFTDFLSPPEL AEIMVVFSRLTEVQLVVWGGYPQAERTRVAIARSEIPLDQSQVALTALDIAGNFLFDT ATHRDFLGAMLATGIVREKTGDIIVLGERGAQVIVVPELAEFLEMNLKQVRSVPVKTQ RIDFSELKVREPKKKELTTVEASLRLDAIASAGFGMSRSKMVELIDSGDVRVNWKEIT QASSQLKTGDLIAIRSKGRLEVGEIAVTKKDRYRVQLTRYV" gene complement(3703..4209) /locus_tag="DP116_15410" CDS complement(3703..4209) /locus_tag="DP116_15410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015081533.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15410" /translation="MNYLVAVLADRIQAEAAYLALEKEAIKSTILGKGYKTADEFGLI DPNEQAKKQTQFMAVWLVPFGFFAGITFSLLTGLDTFAWAGEIGNHIIGGLLGAGAGA MGSVFVGGGVGLLVGSGDALPYRNRLNAGKYLIVVQGPEILTRQATRVLRQFEPENIQ GYAAPSEY" gene complement(4246..4623) /locus_tag="DP116_15415" CDS complement(4246..4623) /locus_tag="DP116_15415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15415" /translation="MTKVTAKEIAQFRSQVADDFTAMEALDIIEECDGDLEDAAITLA IRAGQEPEIANSEWLDALARKWRAAICQEEFRDDLVNGSVKGMMEHLKTMPTFPKILA TPVLIYVLKKSVNNFCEPLDLVQ" gene complement(4806..7034) /locus_tag="DP116_15420" CDS complement(4806..7034) /locus_tag="DP116_15420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878723.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TolC family protein" /protein_id="PRJNA477356:DP116_15420" /translation="MNRQQIFSSFLPGVTAAVLATQPAWANPAPATGVKLFASSDGLT STFEGTPTALDIKPQLPPTASSSFLAAAVPTVDVMGGGLISSITRGGVEVILTGKTGI PIPQVAHKVQSMLMSLKPTSNQTQLTNKIPPSGTSIQKQNNSIIIAEASKNTFSSYNE TSRPASTVEQKTLSSGQLPTEKKTSAQLPNKLQTPVRGTVEVAKLLDQGKLCPQQAKN GKTQVGRSASVLQKSSTCSQQNTMKNLVAQAGSSKPARTTPATTTPARITPAGSSKPA RTTPAPTTPAQTTPAPTTPASASSVQIPNYLKANPNPLQFPTKPEEVRIQGTQPITLA DSLEIARRNSQDLQISLLNVERSRASVRQAQAALLPTASLSAGLTRGGPAFLNQQQLN SQRTALEDVPSTTNFSSTAQVEYNLYTSGQTTARIRAAEEQLRFDELAVEVQSEEIRL SVTSQYYDLQQADEQVRIAQSAVRNGQASLRDAQALEAAGVSTRFDVLRAQVNLATYQ QQLTSGISQQQIARRRLAQTLSLPQSVDISSADPVRIAALWNVPLPETLVLAFKNRPE LQQNLAQRNIAEQQRRAALSQLGPQVGLVGSYQLSDRFDDQRNGTDNYSLGVQARLFL YDGGAARAAAAQQKANIRIAENQFAIARNQIRYNVESSYSQLQSNLQNVQTASTALEQ ARESLRLARLRFQAGVGTQTEVIISENALTQAEGARVTAILNYNRALATLQRAVTSRA AR" gene complement(7106..8665) /gene="kaiC" /locus_tag="DP116_15425" CDS complement(7106..8665) /gene="kaiC" /locus_tag="DP116_15425" /EC_number="2.7.11.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747666.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="circadian clock protein KaiC" /protein_id="PRJNA477356:DP116_15425" /translation="MSQKENLEQTKTPTRGVEKIRTMIEGFDDISHGGLPVGRTTLVS GTSGTGKTLFSLQFIYNGITYFDESGVFVTFEESPSDIIKNAGIFGWNLERLISEGKL FILDASPDPEGQDVVGNFDLSALIERLQYAIRKYRAKRVSIDSITAVFQQYEAVGVVR REIFRLVARLKQLSVTTIITTERTDEYGPVACFGVEEFVSDNVAIVRNVLEGERRRRT IEILKLRGTTHMKGEYPFTITNAGINIFPLGAMRLTQRSSNVRVSSGVKALDQMCGGG FFRDSIILATGATGTGKTLLVSKFLQDGCVHSERVILFAYEESRAQLSRNASSWGIDF EDLERQGLLKIICTYPESTGLEDHLQIIKSEIAEFKPSRIAIDSLSALARGVSNNAFR QFVIGVTGYAKQEEITGFFTNTTEQFMGSHSITDSHISTITDTILMLQYVEIRGEMSR AINVFKMRGSWHDKGIREYNITADGPEIKDSFRNYERIVSGAPTRVTIDEKAELSRIV KGFQDTQSSDP" gene complement(8747..9070) /locus_tag="DP116_15430" CDS complement(8747..9070) /locus_tag="DP116_15430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495861.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="circadian clock protein KaiB" /protein_id="PRJNA477356:DP116_15430" /translation="MNQPRKTYVLKLYVAGNTANSVRALKTLQTILEQEFQGVYALKV IDVLKNPQLAEEDKILATPTLSKILPPPVRRIIGDLSDRERVLIGLDLLYEELIEDER EPFEE" gene complement(9186..9494) /locus_tag="DP116_15435" CDS complement(9186..9494) /locus_tag="DP116_15435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459933.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="circadian clock protein KaiA" /protein_id="PRJNA477356:DP116_15435" /translation="MTRAEEQALLRQLKSDYRHILINYFTTTDQALKDCIDKFINSIF SANIPVPRIIEIHMEIIDEFSKQLKLEGRSDEALLDYRITLIDILAHLCELYRCSIYQ " gene 10914..13010 /locus_tag="DP116_15440" /pseudo CDS 10914..13010 /locus_tag="DP116_15440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317015.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" gene 13113..14750 /locus_tag="DP116_15445" CDS 13113..14750 /locus_tag="DP116_15445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409413.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15445" /translation="MSTGQAAPTRSVFVSPAETLREREASPKDIGDTGITYAFNTSLA PGQVPRQSKRQVPTRLQTQSPTQGKSLFDLSHRSTGVYGKSSWASREAALQASITHHL SSSKSLVLVVEAVARYIEDLTEQLTGLGYRVVIARSGCEAVEKARRLQPKAIFLNPLL PLLSGWDVLTLLKSDVATSHIPTIVTATGAEKDQAFANQADGFLSLPVQHQVLTPLIE SLCTPPEQKQQKLDDDDILHNNTPLKILKLVDPESESSTSHSLLQQHRVIEVDDLDQA ELLTRVWQFDVVLLDIEMPAAEALLKQLSDHPRLANLPLVTCNVSTTQAASQMPGLSV FPCLTPLATDKNHGGGKTDALLSVLQIASGTCYCPPSILVVDLGILPDLPDTSQATVR GCRTQKNFSLNSPIAGENTPCPWIISGGSEWFQALIQYLQTGGFKASMSRSWAELLQQ IRHQNVDVLLICLGESSINKEVYSALKALQQLPFDLPPVLLLDQRLNSDQVEDEQTEI PLLESIETVISTLASQILPRSISMENLLNEINKALET" gene complement(14857..16641) /locus_tag="DP116_15450" CDS complement(14857..16641) /locus_tag="DP116_15450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_15450" /translation="MINDYMGKRRVFTVTLSILWLVLIGGVAFFWHLGSIGLIDETEP LFAEASRQMFVTGDWITPFFNGETRFDKPALIYWCQAIAYSIFGVNEWAVRLPSALAA TGLISLAFYTIQWHLARQDYLERTTRPTRRWLTAGLGAAVMALSSQMIVWGRTGVSDM LLVGCMDSALLCFFLAYAQPSSQSEVKARWYLAFYVLIAGAILTKGPVGIVLPALIIG IFLLYLGNVKAVLREMRPLTGLLITLCLSVPWYLLVIWRNGENYINSFFGYHNLERFT SVVNRHSAPWYFYFFVVLVGFAPYSVYLPLAMARLKFWQPKYWRSQKRSSQLGLFAFF WFIGVFGFFTTAVTKLPSYVLPLMPAAAILVALLWSDLLKDEKMREQYPPDSLDRPFF WTGWVNVVFLLVLAVAMFYVPQLIRDPAAPNFSELFQQSGLSVLGGVIWLLCALILAA ILVRRYYQPMLMVNVLGFAAFLVFVLTPCLFLIDQERQLPLRELSAIAVQAQQPNEEL IMVGFKKPSVAFYTKKIIHYVKVSTDAEQYIQDKAAKKAQPPSVLVLAQLTKFPEMNL KPTDYENLGTSGAYQLIRVPFNKKELRR" gene complement(16748..17350) /locus_tag="DP116_15455" CDS complement(16748..17350) /locus_tag="DP116_15455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319991.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15455" /translation="MNPPVYCHAVLVGFSLGVLFLPMGCRQKSVSFESTELKASSLPS AQTHRDVSAASSSGSSVSAKSYQVESSRGGIKHTVMTYQQGSSGQAANQGTLRMSNQT NQPVRLALLSRRSLNKGSSSGQIKDAVPAHWDFAPQEGSEKGMILALPNGKLELETGD ILVAFAQDGSRRYWGPYVVGETSLPSWNSQKKEWQLILSP" gene 17956..18288 /locus_tag="DP116_15460" CDS 17956..18288 /locus_tag="DP116_15460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208719.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF565 domain-containing protein" /protein_id="PRJNA477356:DP116_15460" /translation="MQNTRLNNLFNAIALRLRLWFFNPWRRFSVLIISFLFGFFLGSA VATTAGQSAEWDVVVAGMLMVLTEIGSRIYYGRSIRERQALWVESLNILKIGLMYSLF LEAFKLGS" BASE COUNT 5286 a 4063 c 3789 g 5182 t ORIGIN 1 ttgcgttgtt gcccattctt ggggaaagtc ttgctgagta tggactctca aagcagtaga 61 aaaagcagta attgccagct cgatattttc tgccttctct cctagaattc tttcaaggta 121 ggcattaccg agacaatttt gcgttgttgc ccattcttgg ggaaagtctt gctgagtatg 181 gactctcaaa gcatcagaaa aagcagcaat tgccagctcg atattttctg ccttctctcc 241 tagaattttt tgaacgtagg caagaccgag attatattgc gttgttgccc attctcgggg 301 aaagtcttgc tgagtatgga cactcaaagc agcagtataa gcagcgatcg ccaattctat 361 attttctgct ttctctccta gaattctttc acagtaagca ttaccgagat tatattgcgt 421 tgttgcccaa ttttggggaa agtcttgctg agtatggaca ctcaaagcat caaaaaaagc 481 agcgatcgcc aattctatat tttctgcttt ctctcctgtg attctatcaa cgtagacagc 541 accgagacaa ttttgcgttc ctgcccaatc ttggggaaag gcttgctggg tgtagacagt 601 caaagcatca gaaaaagcag cgatcgccaa ttctatattt tctgctttct ctcctagaat 661 tctttcacgg taggcaagac cgagacaatt ttgtattgtt gcccattttt ggggaaaggc 721 ttcgctggtg tagactgtca gcgcgacttt atagccagtg atggcaattt ccatattgct 781 ggctttgtta cccaagggga attgctgaat tagatcgcta aaaaggaaaa tatctgcagc 841 gatggattcc gccacatctg cttgggcttc cctcagtgtc tttgttcccc actggcgcaa 901 tatttctgcg aacacttcgt caagtttgtc tgtattcttt tccagtaatg ggtaaactag 961 ctgtgcgtta cctctactat ctgctgttac tttcagcact tgttctaaga aatctaagta 1021 ttctttttct gtggcagtcg atgataattt actagaaacc ccaagctgct gtcctgctat 1081 attcattaaa cgattagctt ggtctaattc gccctgcctt agtagattac ttgccatttc 1141 cagcatcttt tgtactaagc cagcatcaat taaatcttgg tttgctgcca aaatctttcg 1201 gatttcgtta ctacggcaat tcaacaggcg ttgaatgagg ttgaaatagg cttgctgacg 1261 ctgttcgttc atagctaagg gcgtgagagt cgcacatatt ttcctatcag agaaattcaa 1321 caccagcgtc ctagattccg gaaacggatg gagtttcatc acaggtgatg cattccccac 1381 aaccagcgat catcgcccta aatctacatt caacaaaaaa acttgctact gaagctttta 1441 agtcactacc actaccacga caagattttg tcccaacgac attaatctgt caccgatgac 1501 agataattag tcactgtaca aaatttaatg gacgtaactg gactcgaacc agtgacctct 1561 acgatgtcaa cgtagcgctc taaccaactg agctatacgt ccgcaacttt attaaaatag 1621 catagattta gcagaattgt caagattaat ttgctaaagt tgcctgcgag tcggcaaatt 1681 ctgtatttat tttgctcata caagtgtata caatcaacat gtgcttctac tcaaagcttg 1741 aacacaacag ggctgttgac tttttagtta cgggtcaatt ccacaattat tctgaggttg 1801 ccttgccagg aaatgcaaga gcagatttgg aaaaaaatca tcagaatata gtagaaactt 1861 tgactatgcc ccaacgcaga tgcaatgaga cagcttgata tattagtttt aagttagttt 1921 aaataacaga gagtcctaaa taggattgtt acaatctgta taatagttaa attttgttaa 1981 ttttttcata gtacgtagaa tgtcaactgc tttaattact ggtgcctctg atggtattgg 2041 taaagccttt gccgaggaat tagctgcaca aaatacaaat cttgttttag ttgctcgttc 2101 agaagcaaaa ttaaaccagc tagccaaaca actgcaagaa aaatacaaag ttcaagtaga 2161 tactattgtt aaagatctga cagaaacaga tgctactcat gatgtgtttg atgctgtcaa 2221 atcaaaagga ttaacgattg acctgttaat caacaacgct ggttttgccg actatggtga 2281 ctttgctgag acggatcaag aacgacaact caaaatggtt caattgaata ttttggcatt 2341 agtagcttta actcataaat tcttgcaagg gatgcgacag cgtggttctg gaagcattat 2401 taacgtatct tccctcaccg catttcaacc aatgccttat ctttctgttt atgctgccag 2461 taaagcattt attgtcagtt ttagtcaggc actttgggca gaaaatcgtc actatggtat 2521 ccgcgtttta gtgacttgtc ctggaccagt tgagacaaat ttcttcacag aagcaaagtt 2581 tcctccagca cttgcagcta aaacaaataa aatatcgact tcagaagaag tggtacgcga 2641 atcattgaaa gctttggaga ggggagattc gaccgtcgtt gttggtggtt ttagcactca 2701 ctttattagc aaattatcca gatttgtccc acgccaaact ctgttgagtc ttttggcaaa 2761 acagtttaag gcaaaaacag tttaaggaaa gacgcactat ggtgcgtctg gattacacat 2821 accttgttaa ctgcacacgg tagcggtctt ttttagtaac agcaatttcc ccaacttcta 2881 aacgtccctt actgcgaatg gcgattaaat cgcctgtttt tagttgagaa cttgcttgag 2941 ttatttcctt ccagttgacg cgaacatctc cgctgtcaat taactcaacc attttgctac 3001 gggacattcc aaaaccagca gaggcgatcg catccaacct caaagaagct tccacagtgg 3061 ttagttcttt tttctttggt tcccgtacct tcaactcact gaaatctata cgctgagttt 3121 tcacgggaac tgatcgcacc tgcttgagat tcatttccaa aaactctgct aactccggta 3181 cgacgattac ctgtgcgccc cgttcgccta gtacaataat gtcccccgtc ttttcgcgga 3241 ctatcccagt ggctagcatt gcgcccaaaa agtcgcggtg agtggcggta tcaaacagga 3301 aatttccggc tatgtctaaa gcggtgagtg cgacttgaga ttgatctaaa gggatttcgg 3361 aacgggcgat cgccactctt gtgcgttcag cctgaggata tccaccccaa accaccaact 3421 gcacctctgt cagacggcta aataccacca tgatttctgc taactctgga ggagagagaa 3481 aatcagtgaa aaccacttcc caagttttga tagcttgttc cgcttggtca atcacacgag 3541 ctatactatc tcgattttca acacctttta aaagttcttc ccgtggcaac attttaaaca 3601 gttatcagtt atcagttatc agttatctag ttacaagtta gcagttattc agtcgttcag 3661 ttatgagtga aattccatat agctgtatta actgatgact gattagtatt cactgggcgc 3721 ggcgtaacct tgaatatttt ctggttcaaa ctgacgtaag acacgggttg cttgacgggt 3781 aagaatttca ggaccttgaa cgacaatcaa gtatttgcca gcattcaagc gatttcggta 3841 aggtaaagcg tcaccgcttc caaccaataa accgactcca ccaccgacaa acacactacc 3901 catagcgcct gcaccagcgc ccaacagtcc accgataatg tgattgccaa tttcacccgc 3961 ccaagcaaaa gtgtccaagc cagtgaggag gctaaaagta atacctgcga aaaatccaaa 4021 gggtaccagc caaacagcca tgaactgagt ttgcttcttt gcttgctcgt tggggtcaat 4081 caagccaaat tcatcagcag ttttataacc tttacccaaa attgtagatt ttatggcttc 4141 tttttctaaa gctagataag cagcttctgc ttggatgcgg tcagctaaaa cggcaacaag 4201 gtaattcatt gatgttaaaa ctcaagtcta ataaggatga gaatattact gtactaagtc 4261 aagaggctcg caaaaattat ttacactttt cttcaacaca tatattaaaa cgggtgttgc 4321 taagatctta ggaaacgtcg gcattgtttt gagatgttcc atcatccctt tgactgagcc 4381 attaaccaaa tcatcacgaa attcttcttg acaaatagcg gcacgccatt ttcgagctaa 4441 agcatccaac cattctgaat ttgctatttc tggttcttgt ccggctcgaa tcgctaaagt 4501 gattgcagca tcttctaaat ctccatcaca ttcttctata atatcgagtg cttccatagc 4561 tgtgaagtca tctgcaactt gagagcgaaa ctgtgcaatt tcttttgccg taactttcgt 4621 catcagattg tgttttcggt gtttgaagtt ttcctttatg ctaggatatg acaattttga 4681 tagctcgaat caacaattca tcaaaaaatg taaaggggca ggagaaaaag gggaaaggtt 4741 ttcctcttta acctttcccc tttaccttcc aagggcgcac tggctcctaa tgactattga 4801 gtaacctagc gtgctgctct agaggtgact gctcgttgta aagtagctag agcacgattg 4861 taattcaaaa ttgctgtgac tctagcacct tcagcttgtg tcaaggcatt ttctgagata 4921 ataacttcag tttgagtgcc gacacctgct tggaatcgca aacgtgctag acgtagagac 4981 tctctagctt gttcaagagc agtactagcc gtttgaacat tctgcaagtt agattgcaat 5041 tgagaataag aactttctac gttataacga atttggttac gcgcaatggc aaattgattt 5101 tcagcaatcc gaatgttagc tttttgttga gctgcggctg ctcttgctgc tccaccatcg 5161 tacaaaaaca atctagcttg gactcctaag gaataattat cggtaccgtt tctttgatcg 5221 tcaaaccgat cagagagctg gtagctacca accaaaccca cttgaggacc tagctgacta 5281 agcgctgccc gtcgctgttg ttcggcaata ttgcgttgcg ccaaattctg ttgcagttcc 5341 ggacggtttt taaaagctaa gacaagagtt tcaggcaggg gaacattcca cagagctgct 5401 attctgactg gatcagccga actaatatca actgactgcg gcaaacttaa tgtttgggct 5461 aatctacggc gagcaatttg ctgctgtgag ataccacttg tcagctgttg ttgataagtt 5521 gctaaattga cctgagcacg cagaacatcg aatcgagtac tcacaccagc ggcttccaaa 5581 gcttgtgcat cccgcaaact agcttgacca tttctcacag cagactgggc aatgcgtacc 5641 tgttcatctg cttgttgcaa gtcgtagtat tgagaggtga cactcaggcg gatttcctca 5701 gattgaactt caacagccaa ttcatcgaaa cgtagctgtt cctcagcagc tcgaatgcgg 5761 gctgttgttt gtccagaggt gtagagattg tactccactt gcgcagtact agagaaattt 5821 gtagtagagg gtacatcttc gagtgctgtg cgctgtgaat taagttgttg ttgattcaag 5881 aaagcaggtc caccacgggt gagaccagca ctcagactgg cagtaggtaa caaagcagct 5941 tgcgcctgcc ttacgctagc tcgactgcgc tctacattta gtaatgatat ctgtagatcc 6001 tgactgtttc gtcgtgctat ctccagagag tctgctaaag taatcggttg agttccctga 6061 atcctcactt cttctggttt ggtaggaaac tgtagaggat tcgggttggc tttgagataa 6121 ttagggatct gcaccgaact tgctgacgca ggagttgttg gtgcaggagt tgtttgcgct 6181 ggagttgttg gcgcaggagt tgttcgtgct ggcttagagg aacctgccgg agttattcgc 6241 gcaggcgttg ttgtcgcagg agttgttcgc gctggcttag aggaacctgc ctgagctacc 6301 agatttttca tggtattttg ttgcgagcag gtagacgact tctggaggac ggaagcagaa 6361 cgaccaacct gagttttacc atttttcgcc tgttgcgggc ataacttccc ctgatccaac 6421 agctttgcga cttctactgt cccacggact ggggtttgta acttgtttgg taactgagca 6481 gaagttttct tttctgtagg aagctggccc gaagaaagtg ttttttgttc aacagtggag 6541 gcgggtcttg aggtttcatt ataagaagaa aatgtatttt ttgacgcctc agcaataatt 6601 attgagttgt tttgtttttg tatgcttgtc ccagaaggag gaatcttatt agtcaactgg 6661 gtttgatttg aggtcggttt caagctcata agcatacttt gaaccttatg tgctacctgt 6721 ggtattggta taccagtttt tccagtcaat attacctcga caccgccacg agttatagaa 6781 gatattaaac cacctcccat aacatcaaca gttggtactg ctgcggctag aaaactgctg 6841 ctagcagtag gaggtaattg cggtttaata tcgagagcag ttggagttcc ctcgaaagta 6901 gaagtcaagc catcggaaga agcaaaaagc tttactccag tggctggtgc aggattagcc 6961 caagcaggct gagttgctaa tactgctgct gttacaccag gtaagaaact actaaatatt 7021 tgctgtctgt tcaccgcatc ccctcacaca gaaattaagc ggctgaaact gttttgtcgc 7081 cacagaatat atcacgatgc caagtttagg gatcggaact ctgtgtgtct tgaaatcctt 7141 tgacaatgcg agacagttcc gccttctcat cgatagtaac gcgcgtagga gcaccactga 7201 caattcgttc atagttgcgg aacgaatctt taatttcggg accgtcagcc gtgatattgt 7261 actcgcgaat acctttgtca tgccatgaac cacgcatttt aaagacgttg attgcccgcg 7321 acatttctcc ccgaatttcc acatactgta acatcaaaat tgtgtctgta atcgtagaaa 7381 tatgagagtc tgtaatagag tgcgaaccca taaattgctc agttgtgttg gtaaaaaagc 7441 ccgtaatttc ttcttgcttg gcataacctg taacaccaat gacaaactgc cggaacgcat 7501 tattacttac ccctcgtgct agtgccgaaa gagagtcaat ggcaatacga gatggtttaa 7561 attcagcaat ttctgactta ataatttgca agtggtcttc taaaccggtt gattcaggat 7621 acgtacaaat tatttttaac aaaccttgac gttctaaatc ctcaaaatca attccccatg 7681 aggaagcatt acgagagagt tgtgcgcgtg attcttcata agcaaataat atcacccgtt 7741 cactgtgaac acagccatct tgcagaaact tgctaaccaa cagagtttta ccagtaccag 7801 ttgctcctgt tgctagaata atcgaatccc taaagaaacc accgccacac atttgatcta 7861 aggctttaac accagaggat actctgacgt ttgaagaccg ttgagttaag cgcatcgctc 7921 ccaatgggaa aatattaatt cctgcattgg taattgtaaa aggatattca cccttcatat 7981 gtgttgtacc acgcagtttg agtatttcaa ttgtgcgacg gcgacgttct ccttctaaaa 8041 cattacgtac aatcgccaca ttatcggaaa caaattcttc tacgccaaaa caggcaacag 8101 gtccgtattc atccgtgcgt tctgtggtta taattgtagt cacacttaat tgtttcagac 8161 gagcaacgag acgaaatatt tctcgccgca caactcccac agcttcatac tgttgaaaaa 8221 ctgctgtaat tgagtcaata gaaacgcgtt ttgccctata cttacgaata gcgtattgta 8281 gacgttcaat cagggcagag agatcgaagt tgccaacaac atcctgtccc tctggatcag 8341 gagaagcatc gagtataaat aattttcctt cactaataag acgttctaaa ttccagccaa 8401 aaataccagc atttttaata atatcactag gagattcttc aaatgtaaca aacactcctg 8461 attcatcaaa gtaggtaata ccgttataaa taaattgaag agaaaataaa gttttgccag 8521 ttccagaagt cccgctgact aaagtcgttc ttcctacagg taacccacca tgactaatgt 8581 cgtcaaagcc ctctatcatt gtccgaattt tttcaacacc tcgtgtcggt gtcttggttt 8641 gctctaaatt ttctttctga ctcatgttga cgtttacaaa taggaaggaa acccagttat 8701 tgtattaata agtagtagct actgctgatt aattttgtgg tctatattac tcttcaaacg 8761 gttctcgctc gtcttctatc aattcttcat aaagcaaatc caatccaatt aacactcttt 8821 ctctatccga aaggtctccg ataattctcc gaactggtgg aggtaaaatt ttggataatg 8881 tcggtgtggc taagatttta tcttcttccg ccagttgtgg gttcttcagc acgtcaataa 8941 cttttaaagc ataaacacct tgaaactctt gttccagaat tgtttggagt gtttttaatg 9001 cccgtactga gttcgcagtg ttccctgcca cataaagctt aagaacatag gtttttcgtg 9061 gttgattcat attaggtaca ggctagaagt acagatagaa ccacaatatc cttcgattga 9121 ctttggtgga attttaaggc tcgattttta ccaacaaaaa ctgttgaaca tttgatgaag 9181 ttattttatt gataaattga acaacggtat agttcacaca agtgagccag tatatcaatc 9241 aatgttatac ggtaatcaag taatgcttca tcactccttc cttctaattt tagctgtttg 9301 gaaaattcgt caatgatctc catgtgaatt tctatgattc gaggcacagg aatattagca 9361 gaaaagattg aattgataaa tttatcaatg cagtctttca gtgcttggtc tgtagtggta 9421 aagtagttta taaggatatg gcggtagtct gatttgagtt gtcgcaacaa tgcctgctct 9481 tcggctctcg tcattggttg ataaaaactg gtggaatttt agcttgtgcc atttaacagc 9541 atgggaataa ttcaagctta atattttatt tatattttgt cccagcaact tgagtaaaca 9601 ctgaagagag ctttctgtgg tacatctgac gaaaaattgg gcaagttgac caacaatcac 9661 gttagtaacg tcactcatct gcccaaagaa tccttgttca ctcgttgaat caactgtaga 9721 gacgttgcat acaacgtctg tttgagtcat ccaaaatgct ttgctatcta cgtggaaaat 9781 cattatgagg aataacatct gtgttgatac attatggttg ctcaccctac ggttaaaaag 9841 tttagagctt tttgtgagcc tctcacgctt ggctagaaaa ccgctagcaa catatgaaag 9901 aagcaccatt caccctattc cctgttccct atcaaaaaaa acacttttgt cgtaaatatc 9961 aaagtacaaa gcaaatttgg taattgactt aaaggactga gtgtgttagc aatttctgag 10021 cgagaatgag cagtctgttg ctgctatgaa gttatagaac ttggtacatg gtagggtaag 10081 cggtgagtca aatgagttgg tacacatcta ccaccgtcac taagtattcg ccccggtctt 10141 aatttcgggg ctaccgcgct cccatgttgt ttacttgtgg tatatatcca ctcttgatat 10201 ctaccgaatt atagatttct taactttgct cattaataag atacttaaac aaagataaac 10261 gattgtcgca ttccaaatca ttataattcc taaaggttct atagcgaatc tttagagcca 10321 taattggtgt agctgtagct atggatgaca agtttcaaca cgcttgaaga taccagtgtg 10381 gacgcccaca ataccctaag ggctatgaga cagcgagcag ttcggataag gatttacctt 10441 tccaaacaac tgcgatgcta caggagggga gccactgctc aactaggatt tcctagattg 10501 ttaaggctgg ctcctcagta gtgtccttgg gacacgctac gtgtctctct agaagtgctt 10561 tgcgctgaca gaagagcaat tggtaaagca aacctgtgcc caaggcataa aaattcccca 10621 atttcttagg ggagcttttt ccacacattg gcggctgact ccagggtaga gtcgcctcca 10681 aactgccaag cggcaaaggt acatgccgat cgccgtgtga atcactctta ctggcaaaaa 10741 atgtcagtca ggaagaagct catctctaaa gcattcaatt tttgctcaga atgctagact 10801 cagtcttctt agaggaatag cttcaaactt gaacacccaa agcttctttg gcagttagct 10861 ggtggtgccg gaagcccaac ccctaacaag agcgacctca ttgccatatg tcaatgctac 10921 agtacccgct ttatgacttt atagcaaccg tgcctagctg cgtagaaaca gctactctgg 10981 caattgtgct ggaaattttt caacaagagg agtgcccccg cctagcagtt ttagataagc 11041 aaaaatgctt gttaggcttg gtttactctg cccgtttgct accacaatta ctgacaccag 11101 gtcaaggcaa aggtgattct aaatgtttag agttacagca acccctttcc acgctgggtc 11161 aagggttaat tgaacctata caaacaatac cagcatcttt gcgtatagag caattgagtt 11221 cgtttcttca ttcccagcaa attcaaacaa acacgaattt agattgggca ctggttgact 11281 caaatggcaa atttttgggg cttgtggata gcccacgctt gttgaaagtg ttgacaacac 11341 aaaaattgct cgcttgtact cataagggga ctaagcgcac aaattcaagg aaagcacctc 11401 aaaaggagac tgcggatact cttggtgatg acaccaaagt cgctggagtt aggggaacca 11461 ctgatagaac tcacacaggc gagcaaacac gagaatgtaa gccacttgtg cagttgttag 11521 aacgactgcc ttggccttta acgctacaaa caagcaacgg cgaagtcgtg acgcaaaatc 11581 cagcttggtg gcagcaattg ggagcattaa aagatccaga aggagtcagg cgacaagtag 11641 aaacgatttt ggcaaatgtc tcctccaaaa aattacaata cgcgactcaa acagcagcca 11701 aaatttcttc cattttctca tccacaaacg agtattcttg tcaagaaaaa tcttcgtcgc 11761 gattaggcga agtgatgcct caaaaagatg tgttaccttg ctcaacactg gcggcggatc 11821 ctcacactca gactcataac cttcaacagc agccaatagt tgaaaatccg gcaccaaatc 11881 gctgcttttt agatagccag caagggactt gtacctgcgt agtagaagtg caaaatggtc 11941 aggagcgggt ttggcagttt gccaaaatcc ctctagatag tcctgaattg aaagtcttga 12001 gtacaaattt aaaaacacct ctggctacca aaaactcagc actcagtagt gagttgtggc 12061 tgatgctagc cactgacgtt acagagcagc agcagctttc caaagaactc gtggcgaaaa 12121 atgccgattt aattcaactc aatcggttaa aagatgaatt tttagcttgt attagtcatg 12181 aactcaaaac tcctctaact gccgttttag gattatcgcg gttgctggtg gatcagcagt 12241 tgggagaatt aaacgagcgt caagcgcgtt atgcaggact gattcaccaa agcggacgcc 12301 acctcatgag tgtggttaat gatattttgg atttaacccg tatggaaacg ggacaaatgg 12361 agcttacgct caccaacgcc aacattcaga aagtgtgtga gcgtgctgtc tctgaagcaa 12421 aagctattca tacccaaagt aacaaagctt ccttaaactc ccgagatcaa agtacgtctc 12481 cacaagaaca ccaattcacc ctctcaattg aaccaggctt agaccaaatt gtggcggatg 12541 agttgcgctt gcgccagatg ctggtacacc tgctttccaa tgctttcaag tttactgaaa 12601 caggtggtga aattggactc ggggttagtc gttgggaagg ttggattgcc tttacagtct 12661 gggatacagg tattggtatt ccagaacatc agcagcattt aatctttcaa aaattccaac 12721 aactcgaaaa tcctctcacc cgtcagcatg aaggaacagg tttggggctt gtcttaacca 12781 gagcgttagc tcgtcttcac ggtggtgatg tgagtttctt gtcgcgtgag ggtaaaggta 12841 gccagtttac actacttctt ccacccagtc ctccgaaaga atcgggaaga ttggcagatg 12901 aggatatggg aacagcttct tttgcgccac gccatcctat cgccccatca acacaatcca 12961 accctactcg acaaaaagcc acccaacagg cgtctacgga aaatcccgtt gaaccttacg 13021 ggaaaacacc tggaagacaa catctacaac ctataaaccc aacacaaccc cacttcacga 13081 aaaaccctat acctggggga cgtcaaggac gtatgtccac aggacaggct gcgccaacac 13141 gcagcgtctt tgtatctcct gcggagacgc tacgcgaacg cgaagcgtct ccgaaggaca 13201 taggagatac gggtataacc tatgccttta acacatctct cgcgccaggg caggttcccc 13261 gtcagtccaa aagacaagtt ccaacgcgat tacaaacgca gtcgcccacg caaggaaagt 13321 cgttatttga tttgtctcac cgctccacgg gagtttacgg taagtcatct tgggcttcac 13381 gggaagccgc cctccaagcg tctataaccc atcacttgag ttcttctaaa tccctcgtgt 13441 tggttgtgga agcagttgct cgatatattg aggatttgac cgaacagctg acaggtttgg 13501 gctatcgagt cgtgattgct cgttcaggat gtgaagctgt agaaaaagct cgacgcttgc 13561 aacccaaagc catcttcctc aatccgttac ttcccttgct gtcaggttgg gatgtactga 13621 ctttacttaa gtccgatgtc gcaacatctc atattcctac cattgtgaca gcaacaggag 13681 ctgaaaaaga ccaagcattt gccaaccaag cagatggttt tttgagtttg ccagtgcagc 13741 atcaagtctt aacaccgctg atagaaagtt tatgtactcc accagaacaa aagcagcaaa 13801 aattagatga tgatgatatt cttcataaca atacaccact caaaattttg aagttggttg 13861 atcctgagag tgaatcttca acttcgcact cgttacttca acagcatcgg gttatagaag 13921 tggatgactt ggatcaagca gaattgctga ctcgggtttg gcagtttgat gtcgttttgc 13981 tggatataga aatgccagca gccgaagctt tgcttaaaca actctctgat cacccccgtt 14041 tagcgaatct accgcttgtc acctgtaatg tgtcaaccac ccaagctgct tctcaaatgc 14101 ctgggctttc tgtgtttcct tgcttaacac cattagcaac agataaaaat catggtggtg 14161 gcaaaacaga tgctctgttg tcagttctgc aaattgcatc tggtacatgc tactgcccac 14221 ccagcatctt agtcgtggat ttaggaatac tgccggattt accagataca agtcaagcca 14281 cagtcagggg ttgtcgcaca caaaaaaatt tctcattgaa tagccctata gctggagaaa 14341 atactccgtg tccttggatt atctctggtg gatctgagtg gtttcaagct ttgattcagt 14401 acctacaaac aggtggcttc aaagcctcaa tgagtcgatc ttgggcagaa ctgctccagc 14461 aaattcgcca ccaaaatgtt gacgtactcc tcatttgcct aggagagtct agcatcaaca 14521 aagaggtgta ctcagccctg aaagcgttgc aacagttgcc ttttgactta ccaccagttt 14581 tgctccttga tcaacgatta aattctgatc aggttgaaga tgagcaaact gagattcccc 14641 tattggaatc catagaaact gtcataagta ctcttgcctc ccagatttta cctcgctcta 14701 tatcaatgga gaacttacta aacgagatta ataaagcttt ggaaacttga gacgaaaaac 14761 acaaacttcc actcttccac ttccactctt ccacttccga ctagatactc cctgttaagc 14821 gttccctgtt ccctattccc cataactaaa ttctcgttac cttctgagtt ctttcttgtt 14881 aaaaggaact cgaatcagtt gataagcccc gcttgtaccc aagttttcgt aatcagttgg 14941 ctttaagttc atttcaggaa actttgtgag ttgagcaaga actaacactg agggaggctg 15001 tgctttcttg gcagctttgt cttgaatata ctgttcagca tctgtagata ctttgacgta 15061 atgaatgatc tttttagtat aaaaagcaac gcttggtttt ttgaaaccca ccatgataag 15121 ttcttcattg ggttgttgtg cttggactgc gatcgcagac aattcccgca aaggtagctg 15181 acgttcctga tctatcaaaa acaaacaagg cgtgaggaca aaaaccaaaa acgccgcaaa 15241 ccccaacaca ttcaccatta acatcggctg ataataacga cgcacaagta ttgctgccag 15301 tatgagagca caaagtagcc aaatgacacc tcccagtact gataagcctg actgttgaaa 15361 caactcggag aagttaggtg cagcaggatc ccttattaac tgaggaacgt aaaacattgc 15421 gactgctaac actaataaaa atacgacatt cacccaacct gtccagaaga agggacgatc 15481 aagagaatca gggggatatt gctccctcat tttctcatcc ttgagcagat cgctccacaa 15541 taaagctacc aaaatagctg ctgctggcat taagggcaac acgtagctgg ggagtttggt 15601 aacagcagtg gtgaagaagc caaaaacacc aataaaccag aaaaaggcaa ataaaccaag 15661 ttgactagag cgtttttgag agcgccagta ctttggttgc caaaatttca gccttgccat 15721 tgccaaaggc aaatacactg agtatggtgc aaaacctact aatacaacga aaaagtaaaa 15781 ataccaaggg gctgagtggc ggttaaccac acttgtaaaa cgttctaaat tatgataacc 15841 aaaaaacgaa ttaatataat tttcaccatt acgccaaata actagcaaat accagggtac 15901 tgataagcat aaagtaatga gtaaacctgt aaggggacgc atttcccgta aaacggcttt 15961 tacatttccc aaataaagca aaaatatgcc gataatcagt gcaggtaaga caattcccac 16021 tggtcctttg gttaaaattg cacctgcaat caggacataa aaagccaaat accaccgtgc 16081 tttcacttct gattgtgagg agggttgagc ataagcaaga aaaaagcaca acaacgctga 16141 gtccatgcag ccaacaagca acatatcaga aacacccgtt ctcccccaga caatcatttg 16201 tgaactaagt gccatgacag cagctcccaa accagctgtt aaccagcgtc gagttgggcg 16261 agttgttcgc tcaaggtagt cttgtcttgc taaatgccac tgtatagtgt aaaatgccaa 16321 actaattagt cccgtagcag caagcgctga aggcaaacgg actgcccatt cattgactcc 16381 aaaaattgag taggcgatcg cctgacacca gtaaattaat gccggtttat caaaacgagt 16441 ttcaccattg aaaaatggag taatccaatc acctgtaaca aacatttggc gggaggcttc 16501 agcaaacagt ggttctgtct catcaatcaa gccaatactg cccaaatgcc aaaagaatgc 16561 taccccacca atcaaaacta accataaaat cgacagagta acagtgaaga ctcgacgttt 16621 gcccatatag tcattaatca ttagttataa gtaagtagtc aatagtctaa atcctaaatt 16681 aatcaacatg actttggctt agcttttcgg ttccaagtgt tcaagactaa agactaaaga 16741 ctaaggacta aggactaaga atcagttgcc attctttctt ttgagaattc cagcttggta 16801 aggaagtttc tccaacaaca tatggacccc agtagcgacg ggaaccatcc tgtgcaaaag 16861 caactaaaat atctcctgtc tccagttcta acttaccgtt tggcaaagct agaatcatcc 16921 ccttctcact accctcttgt ggagcaaaat cccaatgtgc tgggacagca tctttaatct 16981 gtccagagga agagcctttg tttagagaac gccgcgacag tagagcaagg cgcacaggtt 17041 gattagtttg gttactcatt cgtaaagttc cttgatttgc tgcttgccca cttgatccct 17101 gttgatatgt cataacggta tgcttgatcc cacccctgct actttcgact tgataacttt 17161 ttgcggatac tgatgaacca gatgatgatg ctgcgctcac atcccgatgt gtttgagcag 17221 atggtagaga agatgctttt aattctgtgg actcgaagga tacactcttc tgacggcatc 17281 ccattggtag aaacagcact cccaacgaaa aacctaccag aacagcgtga caatacactg 17341 gcgggttcat ggtaatatat tttacgaata agtatttttt gtgtgggctt gaccaagtaa 17401 cagcctaata gtagtaagag caataagaca gaaagttgcc cacccaaggc tatctgggaa 17461 ctagtgcgtg aaatatatca tgtatgcaac ttttaagcaa aacagcaaag tattgctcac 17521 aatgaagaaa agtatatatt ttgtatggtt atttaaaacc caacaaatgc ccacaatact 17581 atttaagcaa gcatatgcgt ttgccagata tcctccatag tgaacttgcc agcaataaat 17641 ccacaaccat tacccagcct ttgggctagc atcaacacta atttccgttt tctacaccca 17701 ttaacgaata accttaggac attttatttt ttggatctcc ctatagaaat atgtcataat 17761 ttcttgactt taccttgaaa gtagccctca taaactgtat acagcacaac acatgaaaat 17821 atttctccct ctctccctcc ctctctcctt attttcaagc taggtaaggg ggtttttgaa 17881 caggtttcaa aattcataca tacggatgca gtgcctactc tcccacactc ccagtccgca 17941 gcggaagttc tattcatgca aaatacacgc ctaaacaacc tatttaatgc cattgcatta 18001 cgcttgcggc tatggttttt caatccttgg cgacgatttt cggttttgat aattagtttt 18061 ttgttcggtt tttttctggg aagcgcagtt gctaccacag ctggacaaag tgctgaatgg 18121 gacgttgtcg ttgctggaat gttaatggta ttaacagaaa tcggcagccg aatatactac 18181 ggtcgtagca tacgagagcg acaggcgctt tgggtagaat ctctgaatat ccttaaaatc 18241 ggtttaatgt acagcttgtt tctggaagct tttaagcttg gttcgtgaac ggaaaaggaa 18301 gaaggaagaa gggagaaggg // LOCUS NODE_1799_length_18277_cov_5.17775218277 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18277) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18277) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18277 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 159..434 /locus_tag="DP116_15465" CDS 159..434 /locus_tag="DP116_15465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318121.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_15465" /translation="MSDVDKYIARRKHTDPEFPEDFESGYSSFKIGVLLAQARIEAGM TQEELARRLNLHESIIIRIENDGLDVGISTLERYANALGKKLYVEIQ" gene complement(666..1697) /locus_tag="DP116_15470" CDS complement(666..1697) /locus_tag="DP116_15470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318120.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_15470" /translation="MTTSQDYEVFLCHNSKEKQQVERIRTQLKHQGILAWLDKYDFEP FRPWQDQLEEIIPQIKAVAVFIGSSGVGPWANIEMREFLVEFANRKLRMGLVILPDCP QELINSVPRFIRSFHWVDFRQQEPDPMEQLIWGITGQKPVPTVKVTPQSQSDNLSNEP KKEVPTTQELRNLEVIETQNTPQNQRDDSEDDLSSERGVNYTKLRNLLKAGQWKEADL ETVTFMLKATGREIESWLDVESIKNFPCTDLRTIDQLWVKYSSGRFGFSVQKRIWESV GKDYEKFGDRVGWRKGMPWKKEWLYYNKLTFSTEAPQGHLPVWGFRGVKKNEDLWPDL FSRVQTCKV" gene complement(1715..2212) /locus_tag="DP116_15475" CDS complement(1715..2212) /locus_tag="DP116_15475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15475" /translation="MPKIDPAKLKKVSVSLPFGIGSAEWEADPTERRAAWSLYVELVT RIAVQPLEVDEGLLREALTSLYSLFGITREVLKQAGPDVGASRESVGGIAIAVLNKGL RPFLAKWHPVLQTWEAQRPPHLSLKEHERNWSQEAKLRHELEALRRDLEQYANALAQI AGVDK" gene complement(3384..4478) /locus_tag="DP116_15480" CDS complement(3384..4478) /locus_tag="DP116_15480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317786.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_15480" /translation="MIGVAVIGTGFGQKVHIPGFLAHPSIEIVAVYNQDLNKAKAIAQ SYNIPHACNTITDIVGLQEVQAVSISTPPFLHYEMAKAALQAGKHVLIEKPTTLNAAE AKELYQLAQKAGVIATVDFEFRFVPGWQFFSELLSEGYVGSKRLIRIDWLGSSRADAG RPWNWYSRKDQGGGALGSLGSHTFDYIHWLFGPVRKLSAHFTTAITERLDSNTGELKP VETDDTCMLMLELADGTPCQVSISAVVHASRTHWIEVYGDRATLVLGSENQKDYIHGF RVSSSGPGKPLTEIEIPNRLLFPKNHSDGRISAFIRVVDEWVQGIESKKEIVPSLREG VYSQLLMDLSHESNNSSSWVDVPSLEEFLA" gene 4831..6300 /locus_tag="DP116_15485" CDS 4831..6300 /locus_tag="DP116_15485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655810.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydantoinase" /protein_id="PRJNA477356:DP116_15485" /translation="MSEAPVLDKIIKNVRVVRPHHDAVELLDLGIKDGKFATIAPDIS PDKGKDVLDGKNLLGFPGVVDAHMHIGIYQPLAKDAVTETKAAAMGGVTTSLNYIRTG QYYLNKGGSYRDFFPEVLALSAGHFFVDYGYHVAPIASQHIDEIPLLFKEHGVSSFKI FMFYGGYGLHGLSDQQNLFLMINKEERYDFAHFEFIMRRLSRLMEEHPEAQETISLSL HCEVAEILNAYTKIVEKDSSLSGLNAYSAARPPHSEGLAICIASYLAHETNCANINLL HLSSRKAMEAALTMQTAFPHINFRREVTVGHLLLDVDTPTATWAKVNPPIRPRADVEY LWQAVLNHQVDWIVSDHACCSAEQKRSTKDPNNIWLAKSGFGGTEYLLSGVFSEGRKR GMSYNHMAKLLSWNPSRRFGLLQKGDIAIGYDADLVLVDPNETFVVRAAESESQQGYT PFEGVELTGRVKSTFLRGNLIYNNGQVLGLPIGRYLKRC" gene 6545..6991 /locus_tag="DP116_15490" CDS 6545..6991 /locus_tag="DP116_15490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206193.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3598 domain-containing protein" /protein_id="PRJNA477356:DP116_15490" /translation="MSAIREEMPVLTRHEGDWVGTYTVVDTEGKIVDKYESHLTCQFP EDGSHSYYQINRYKWSDGKQEEYEFPGTYRDKALWFDTERIDGKAWEVDDATVILWFS YKTVPDMYLYEMIVISPCNNHRARTWHWFKNNQLFKRTLIQEERLR" gene complement(7083..8426) /locus_tag="DP116_15495" CDS complement(7083..8426) /locus_tag="DP116_15495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316572.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aromatic ring-hydroxylating dioxygenase subunit alpha" /protein_id="PRJNA477356:DP116_15495" /translation="MIVDSKIEEQARQTALQIEQNQEFNWRECWYPVCFVQDLPKNRS YSFSLYDEPFVLFRDLDGKLVCLVDRCPHRAAKLSDGQITDGKIECLYHGWQFGSDGQ CLHIPQLATDAKIPANACVQSFKIVERQGMVWMWAGVGEAAADDDIPTIEELDKPEFV TTDVMRDLPYDQFYFIENIMDPAHVHISHDGTLGQRENAKPSEMEVLENSSRGIRGRL RGMSKPNLPWSQLDFIAPNFVIYKFSVPQRGVAGGVAFYSIPLGKGRCRILVRNYNNF KTWKFKLTPRWLDHMLRNRVLEEDLPLIIGQKTQIERLGQSLKQVFLPLKTCDTFVVE YYKWLDKFGSSLPYYQGYSTSKNIDNEGNCLNPPSLDRFSQHTQLCSSCSGAYQVTNR VKQISVGVAIALAALAISANGWMQILAVSASLGAVGIAVAAQKLKTHFERPYTRH" gene 8578..9966 /locus_tag="DP116_15500" CDS 8578..9966 /locus_tag="DP116_15500" /EC_number="6.1.1.21" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411674.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine--tRNA ligase" /protein_id="PRJNA477356:DP116_15500" /translation="MAKSDKINFSTPSGFPEFLPGEKRLELYLLDTIRKVFENYGFTP IETPAVERLEVLQAKGNQGDNIIYGLNPILPPNRQAEKDKAGETGSEARALKFDQTVP LAAYIARHLNELTFPFARYQMDMVFRGERAKDGRFRQFRQCDIDVVGRRELSLLYDAQ MPAIITEIFDAVNIGDFLIRINNRKILTGFFKSVGVEEEKIKSCIGIIDTLDKVGENK VKQELVKEGVLADTTQKIIDFIHIDGTVDEVLDQLKYLATSTPEAEEFALGVTELETV ISGVRNLGVSENRFCIDLSIARGLNYYTGTVYETTLIGHEALGSICSGGRYEELVGMF LDEKMPGVGISIGLTRLISRLIKAGILSTFAATPAQVMVVNMQNDLMPVYLNVSQKLR QAKINVITNFEQRPLGKQFQLAEKQGIPFCVIIGSEEATAQKASLKDLRTREQMEVAL EDLAEEIKRRLA" gene 10132..10959 /locus_tag="DP116_15505" CDS 10132..10959 /locus_tag="DP116_15505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865438.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_15505" /translation="MNKAIKRAFLDTEDGQIHYRIGGEGEALLLLHMNPRSSDEYREL MPILAQKYRVIAMDLMGFGDSDKPPRLYSVADYAKTVIALLDELGIEKVNLLGNHTGA FVSGEVTAAYPERVNKLILGNVAGFGEAGKTDLMQRFDEGFVIKEDGSHLMERWLARS RYVGSAELNHRWVLDDLKCFGYPLYAVWTVGNYCMEAAERFSFIKCPTLILWGIHDVE EFERLGLALAKDRFFLSQAIPHAKVAEFPDGTICMMNQIPEEISHVVLEFLDETSVS" gene 10994..12001 /locus_tag="DP116_15510" CDS 10994..12001 /locus_tag="DP116_15510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198396.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alcohol dehydrogenase" /protein_id="PRJNA477356:DP116_15510" /translation="MSSKTYKKLVAKQFAQNFKSAIEIIELPIPEPAPDEIVIRNKFA GINAGFDTLLCRGDVSYINLTPPFDLGVEAVGEVVAVGNHIKDFQVGDAVVTTIRGGG YREYQAINANLAIKVRQATPEVLTLIPTGVSAMVALEQVGEMKSQEVVLVTAAAGGTG HIAVQLAKLAGNHVIGTCGTDAKVELLQELGCDRIINYRTQNLNQVLKQEYPNGVNLV FECVGKQVFDTCVDNLAVRGRLVVVGFVSEYAKNLEQVTQPRIYHKLFWKAASVRGFL MPLYKEYMTEGRDRLFNLFYTNKLKVAVDSTPFHGIESITAAVEYLLSGQNCGKVVVR F" gene 12041..12430 /locus_tag="DP116_15515" CDS 12041..12430 /locus_tag="DP116_15515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876017.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nuclear transport factor 2 family protein" /protein_id="PRJNA477356:DP116_15515" /translation="MSQQSENTLKVAHQAFEHFQHGLATGEWNQFLDVLTEDFSFWFP IGKYHGLHQGKEKAREFFQYIAESLRGELILEHVTSNETTVVFEFRDEGTLFGELYKN RVAVSFDVRGDKICGYREYFGSDGKSN" gene complement(12453..12917) /locus_tag="DP116_15520" CDS complement(12453..12917) /locus_tag="DP116_15520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456322.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15520" /translation="MNKPLNEALSIETAQRVKSKAKTQFSNAYKAALLTKGAFYVQGF LAFAGKPHKPIEHGWMELEDCIVDPTLPYLNKNAQELWYFTAQRLTVKEVKAIIEESK EDYPEDDPLPVYGDPPYEYYGNVMLGDKSYLEAYQAAEAKCRELNQKNADKN" gene complement(12990..13361) /locus_tag="DP116_15525" CDS complement(12990..13361) /locus_tag="DP116_15525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15525" /translation="MITKAFVIAILKNSSNFLLSLLASLARQLKTAMIHLVKGFTLNS MHARELTTVKRPMCVKCVSAGATGELVRVLGTRGSAVLGSPQVERLPCTSALGGDVRP KSSGRKKSMKTEAIPNELGSD" gene 13456..13947 /locus_tag="DP116_15530" CDS 13456..13947 /locus_tag="DP116_15530" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15530" /translation="MNFSQNNQNFQELDKILRQSINDTLQKFLDENLKNHIETSVKEF LNKYVDEQKKIQRLYYTKTNAIIDNTTASQELYHKLEQTLEELDSLKNSVENMLMEQR QKNGELQRKINCWEQSAIDFFRLLERAVDYETDERRLLINRILYGFNDLVNNLGIERI IPQ" gene 13957..14094 /gene="grpE" /locus_tag="DP116_15535" CDS 13957..14094 /gene="grpE" /locus_tag="DP116_15535" /inference="COORDINATES: protein motif:HMM:PF01025.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotide exchange factor GrpE" /protein_id="PRJNA477356:DP116_15535" /translation="MHENFHEAIDEEESDIIPGNIVKCISWGYRIGDKVLEKAKVVVA K" gene 14291..16039 /locus_tag="DP116_15540" CDS 14291..16039 /locus_tag="DP116_15540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317531.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaK" /protein_id="PRJNA477356:DP116_15540" /translation="MTKKYAIGIDLGTSTSEICVYRNNESLVIPDPVTKIAMIPSIVA INKKGELLVGENARSWVDVSERGVREVKRKMGTGETIKLLGKEYRPEEISALILRQLK ENAEEALGIEIREVVLSVPANFPDAARQATLNAGELAGLKIIRLINEPTAAALAFGIK NIDVEEQLVVFDFGGGTLDITVLEMVAGVLDVKCSFGNPQLGGKDFDEAMMTLLHRKF KTENPEAEISQKAHGALKEAAEKAKKVLCTQQSYDVRIPYFAANNGEFIDLEVEVTHQ EFEVAIAPLLQKARDCIRQALNAKNLHPSTINRVLLVGGTTYIPAVRQLVVEMFGKQG KALDVGADLAVGVGASIHAAFAQGLFCEDSGVILTDVAPFGLGIEVVSYVGGQYMLTY EPLIQPNTTIPYSTQKTYTLLKPDQKRLEIRLYQDNTGKAKLPLEAIDTGIEAEITDI PPAVDGIPYPVEVEFYYDINGIAKLKATIPNINKSVELSYGYSAKRMGNKDIADAASR LKELWKQNAKARLYEGLINKAERYMAGIPPQERSPLSDIVMELKKGLMNDNIQEIQKA GDRLVDFLFDLEKNME" gene 16046..17269 /locus_tag="DP116_15545" CDS 16046..17269 /locus_tag="DP116_15545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197847.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15545" /translation="MVYELYHILGISSQASADEIKRAYFQLVRKYSPEKDPERFQQIR IAYNTLFDSKERENYDAMQKYGDQVKDLILQAQNKMQVEEWTNAISLLKQVLVLAPRI DIARNLLGLCYIHTKNWDFAVKVYTALTKTNPDVAVYWSNLGYAYKLQAQCFNDEDIS QIQLYHNARESFQQVVKLESFNSAPYLDIAETYLDQKNYSEALAWAERAIGADGKADY HDFEALFLICRVHFYSGELQKIEVIAKRIISLLPKKSEIREYAATRFANMGIEIAKNA AISSNFHMWRSAFEFIKIAKEIEPNNLGIQQILTKLEEIVAAINQYENLNRDYLINQG FQRLAAFCLADYFNFYDSPQERKSFLNDILTEILVSPTSTIFASLERIKFYYPAVYKL NVELFHRIEQVACVP" gene 17958..>18277 /locus_tag="DP116_15550" CDS 17958..>18277 /locus_tag="DP116_15550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210444.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="PRJNA477356:DP116_15550" /translation="MSIYVGNLSYQVEQDDLRRVFEEYGTVKSVQLPVDRETGRVRGF AFIEMGTEAEETAAIEALDSAEWMGRNLKVNKAKPKENRGSSGGGGGGGGRGGWNNDR GGDRG" BASE COUNT 5432 a 3588 c 3928 g 5329 t ORIGIN 1 gtgagacagc gctgcaggag ggtttccctc cgtaggcgac tgcgaacccg aagggacttc 61 gtttgtatag ccccagactt ccagtctgtg ggatgaggaa aatttacaca tttgggatgc 121 tcccccaaaa agagattatt tcagcaggaa gcaaaatgat gagtgacgta gacaaataca 181 ttgccagacg caagcacact gatccggagt tccctgaaga ttttgaatct ggatattcca 241 gttttaaaat tggcgtattg ctggcgcaag caagaattga agcgggaatg acacaagaag 301 aattagcacg tcggttaaat ttgcatgaat ctataataat tagaatcgaa aacgatggct 361 tagatgttgg tatctctact ttagaaagat atgcaaatgc cttggggaag aaactttatg 421 tagaaattca ataagtgcgt agacgcaaaa tgactgggaa ccaacgaacc aagaaataca 481 agaagtagcg tgcaagttta tcttcgcgaa ttgttataac ccacaattcg ccagcataat 541 actacaagca caacaagaag atcctctttg gattcagtgc cagatcatga aaacgtagaa 601 accgcaatac ttcgtggaaa atcacttgtt tgcgaatctt tacaattctg tgtaagcctt 661 atatgttaga ctttacaagt ctgtacgcga gaaaagagat ctggccataa gtcttcattc 721 tttttcaccc ccctaaaacc ccaaacaggg agatgtcctt ggggggcttc tgtggagaag 781 gttaatttat tgtaatacag ccactctttc ttccaaggca ttcctttgcg ccaacccacg 841 cgatcgccaa acttttcata gtctttaccc acactttccc aaatgcgctt ttgcacacta 901 aagccaaagc gcccactact gtattttacc caaagttggt caattgtgcg gaggtcagtg 961 cagggaaagt ttttgatgga ttcgacatcc agccagcttt ctatttctct gccagttgcc 1021 ttgagcatga aagtaacagt ttccagatcc gcttctttcc actgtcctgc ttttagcaaa 1081 tttcgtaatt tggtgtagtt tacaccgcgt tcactgctga ggtcatcttc tgaatcatct 1141 cgttgatttt gcggggtgtt ttgagtttca ataacttcta ggtttctcag ttcttgggta 1201 gtggggactt ctttcttcgg ttcattactg aggttatcgc tttgactttg tggggtcacc 1261 ttcacagttg gcactggctt ctgcccagtt atgccccaaa ttaactgttc cattggatcg 1321 ggttcttgct ggcgaaaatc aacccaatga aaactcctta taaatctagg aacgctgttt 1381 ataagttctt gagggcaatc cggcaatata accaatccca tgcgaagctt gcggttagcg 1441 aattcaacta gaaactctcg catttcaata tttgcccacg gtcctactcc tgatgaacca 1501 atgaatacgg ctacagcttt tatctgagga ataatttctt ctagttgatc ctgccaaggt 1561 ctaaacggct caaaatcata cttatccagc caagctaaaa taccctggtg ctttagctga 1621 gtcctaattc tttctacttg ttgtttttct ttgctgttgt gacaaaggaa gacttcataa 1681 tcttgggatg tggtcatatt tcagcttttc cccactactt gtctacacca gcaatctgtg 1741 ctaaggcatt tgcatattgt tccaaatccc ttctcaacgc ttccaactca tgccgcaatt 1801 tcgcttcttg cgaccaattg cgctcatgct cttttaaact caggtgagga gggcgctgtg 1861 cttcccatgt ctgcaacact ggatgccatt tagcaagaaa aggacgcaaa cccttattca 1921 gcacggctat agcaatacct cccactgact cgcgggatgc accaacatct ggtccagctt 1981 gtttgaggac ttcccgtgtt atgccaaaca gactatataa agatgtcagc gcctcacgca 2041 acagtccttc gtcaacttcc agaggttgaa cagcaatccg ggtaacaagt tcaacataca 2101 acgaccatgc agcccgacgt tctgttggat cggcttccca ttcagcggaa ccgatgccaa 2161 agggaaggct gacagatact tttttcaatt tggcggggtc tattttaggc atgagtttgt 2221 tgtcaggagt atatagttgc ttgttttaag ttttaacaac tgtccgcttg ccatagttgg 2281 cgcggttgca ggaaattctg tttgataact taataatttt ataattacat acagcctgct 2341 aggtgtgaat tccgggtgag taggaacaaa gcacgacgct ttggggaata gaattcgcct 2401 tacagcattt ttcaggtgga tggaatacac ttgtccctaa ctctctgctg cgtgagggga 2461 gccagtgtgt tttaagggta ggacttacgc agaagagatc ccccaactcc cttaaaaagg 2521 gggcttttaa cccaatacgg ttcagttaag aagaattgta ggttgggtta agcgcagcgc 2581 aacccaacaa aaacgaggta ggtgttgggt ttcgttccaa gggcccaacc tacgtcagca 2641 ttagttttta gccttatctg aaccgtattg gcttttaacc ttcttttacc ccctttttaa 2701 gggggtcgcc gcaggcgggg ggatcttaac cgaaccgtat tgaaacctca ccctgcccta 2761 tcgggcatcc ctctccttag taaggagagg gaaagatttt tgcgtagcaa aaagcgaggg 2821 tgaggttttg agcgagccgg tgtgtatata gccttataaa aggaggaaga agaatgatgt 2881 tctctctatt gatgtctatt tacttactgt aaatacaaca gctatatcca aagaaaaacc 2941 ccaatcttat cggggtgtgc tgtgctttta tttatgcagc gtattccagt gttgtattgt 3001 attttcattt gaagaaaaca attccacacc atcccaaagg agtaaaactt aatatgtaac 3061 tgagtataca tattttttca agatgagagt ttaaggaaaa agccggaatt gtttactaaa 3121 aaccaaattc aaatttcaaa aatagcctga atgaccatta tcctatcaac tacaaaaaac 3181 cccgacgaag tcggggtgaa ggtctttgcc aaaacaggtt tcctgtgacg tttatattgt 3241 gatctacaaa taaaccttcg caccccatcc aaaagataga aagctgaaat tcgtgatgtg 3301 gtatacatct tgttagttta tgatttttct atttcatgag tcatgaaaca agaagcacaa 3361 tgaaacgtta acctacccta ctcttaagca agaaattctt ctaagcttgg tacatctacc 3421 caacttgaac tgttattaga ttcatgggat aaatccatta ataactgaga gtaaactcct 3481 tctcgtaacg atggaacaat ttcttttttg gattcaattc cttgcaccca ttcatctaca 3541 actctaataa acgcagaaat gcgaccatcg gagtgatttt tcggaaacaa taagcgattg 3601 ggaatttcaa tttcggttaa tggtttacct ggtccagaag aggaaacacg aaagccatgt 3661 atgtaatctt tttgattttc acttcctaat actaaagtcg cgcgatcgcc atacacttct 3721 atccaatgag tacgcgatgc atgaaccaca gcactgatag aaacttgaca aggtgtccca 3781 tctgccaact ctaacatcag catacaggtg tcatcggttt ctacgggctt taattctcca 3841 gtgttggagt ctagtcgttc agtgatggcg gtcgtgaaat gggcgcttaa cttgcgtaca 3901 ggaccaaaca gccaatgaat ataatcaaaa gtgtgagaac ccaaagaccc caatgcaccg 3961 cctccctggt ctttacgaga ataccagttc caaggacgtc cagcatcagc acgagaagaa 4021 cctagccaat caattctaat caaacgcttt gaacccacat aaccttctga caacagttcc 4081 gaaaagaatt gccatcctgg tacaaagcga aattcaaaat ctacagttgc aatcacacct 4141 gctttttggg ctaactggta aagttcttta gcttcggctg catttagagt tgtaggtttt 4201 tctattaata cgtgcttccc tgcttgcagc gctgcttttg ccatttcgta atgcaaaaat 4261 ggcggtgtag aaatgctcac tgcttggact tcttgcagcc caacgatatc tgtgattgtg 4321 ttgcaggcgt gaggaatatt gtaggattgt gcgatcgctt tggctttatt taaatcttgg 4381 ttatatacag caacaatttc tatgctagga tgagccagaa atccaggaat atgcactttc 4441 tgaccaaatc ccgtgccaat aacagcaacg ccaatcacag ttaaacgagt tcctttttat 4501 atttttctgg gttattatat caaattcggt ggcgtctgaa tcattaaaaa actgcagaga 4561 cgcagagggc gcagagagag agaggagaga gaaataattg tcatataaac ggacttacca 4621 tgattgatta atagagcctt cctaaatagg atatgaacat cctcctcttc ttctctttct 4681 tttctttgcg tcctttgcgc ctttgcggtt cgtttttttt tgttcatgac ttatctagga 4741 ttgctgtaca accgttgaat gaaataatat catctgtctt gatgttctta aacttttggg 4801 ttccatactc attttagtgt gaggtctact gtgtctgaag ctcccgtatt agataaaatt 4861 atcaaaaatg tgcgggtagt tcgtccccat catgatgctg tcgaactact tgatttagga 4921 attaaggatg gaaaatttgc tactattgct cctgatatta gcccagacaa aggtaaagac 4981 gtattggatg gcaaaaactt gctgggcttt cctggggttg tagatgccca tatgcacatc 5041 ggtatctatc aacccctcgc caaagatgct gtgactgaaa ccaaagcagc tgcaatgggg 5101 ggagtcacaa ctagtctgaa ttacattcgt acaggacaat attatctcaa caaaggcggt 5161 tcctaccgcg atttttttcc agaagtattg gcgttatctg caggtcattt ttttgttgat 5221 tatgggtatc acgtcgcacc tatagctagc cagcatatcg acgaaatacc tctactgttt 5281 aaagaacatg gtgtatcttc gtttaaaatc ttcatgtttt atggcggtta tgggttgcat 5341 ggtttgtcag atcagcaaaa cctctttttg atgattaata aagaggaacg ttacgacttc 5401 gcccattttg aatttattat gcgtcgtcta agtcgcttga tggaagaaca tccagaagca 5461 caagaaacta tcagcttaag tttacactgc gaagttgcag aaattctcaa cgcttatacc 5521 aaaatagttg aaaaggactc cagccttagc ggactaaacg cctacagtgc agcgcgtccc 5581 cctcattccg aaggattagc aatttgcatt gcttcgtatt tggcacatga gacgaactgt 5641 gcaaatatca atttgttgca cctgagttcg cgtaaggcta tggaagcagc tttgactatg 5701 caaactgctt ttccccatat caactttcga cgagaagtga ccgtcggaca tttgctatta 5761 gatgttgata cccccaccgc tacttgggca aaagtaaacc cccctattcg tccgcgtgcc 5821 gatgtagaat acttatggca agcagtactc aaccatcagg tagactggat agtaagtgac 5881 catgcttgct gttctgctga acaaaaaaga agtactaaag acccaaataa tatttggtta 5941 gcaaaatctg gttttggtgg tacagaatat ttactttcag gtgtctttag tgaaggtcgt 6001 aagcgcggaa tgtcgtacaa tcacatggct aagctgttat cgtggaatcc gtcacggcgc 6061 tttggtttgt tacaaaaagg ggatatcgcc attggctacg atgctgattt agtactggta 6121 gacccaaatg aaacctttgt ggtacgtgct gctgaatcag agtcacaaca aggttacaca 6181 ccctttgaag gagtggagtt aacgggacga gtgaaaagta cctttttacg tggaaatctt 6241 atctacaata atggacaggt tctaggttta cccattggac gttatctaaa aagatgttag 6301 agcaatctcg atttaaatct tgttgcttat agcgagttgc attcagagat agaattgact 6361 aaacgaaccg ccaagacgag ccagtgcgtt gcgggggttc cccccgttgt agcacctggc 6421 gcgccaagag cgcagaggaa gagaaaagag aaaagagtaa gttatctgta tgaatgcaac 6481 ttagtattac ttgttgctta aaatctttat taattgctat ctgcaaagac aaaggagttg 6541 tgttatgtct gccattcgag aagaaatgcc tgtactgact cgccacgaag gagattgggt 6601 aggtacgtat acagtggttg atacagaagg aaaaatcgtt gataaatatg aatctcactt 6661 aacctgccaa tttccagaag acggttccca ttcctactac caaatcaatc gttacaagtg 6721 gtctgatggg aaacaggagg agtatgagtt tccgggaact tatcgagaca aagcgctgtg 6781 gtttgacacg gaacgtattg acggaaaagc ctgggaagtc gatgatgcaa cggttatttt 6841 gtggttttct tataagaccg tgccagatat gtacttgtat gaaatgattg tgattagtcc 6901 ttgcaataat catcgcgccc gcacttggca ctggtttaag aacaatcaac tcttcaagcg 6961 aaccctgatc caagaggaac ggctacgata agaagaaatt gagcgcttct agatgatttg 7021 taaagtagat cttcaatgcg tggggtgcgt taagcagtgt aacgcaccat tgtttcattc 7081 ctttaatggc gagtgtatgg acgctcaaag tgagttttga gtttctgggc agcaactgct 7141 ataccaactg cgccaagaga agccgaaact gccaagattt gcatccaccc gttggcagat 7201 attgctaaag ctgctagggc gatcgccact cccacactaa tttgtttgac tcgatttgtc 7261 acctgataag caccagagca agaactacac agttgggtat gttgcgagaa cctgtctagt 7321 gacggtgggt ttaaacaatt cccctcgttg tctatatttt tagaggtgga gtagccctga 7381 tagtagggta aagatgagcc aaatttatct agccacttgt agtactctac gacaaatgtg 7441 tcacaagttt tcaggggcaa aaacacttgt ttcaagcttt gtcccaaccg ctcaatttgt 7501 gtcttttgcc caataataag tggcaaatct tcttccaata ctctgttgcg aagcatatga 7561 tctagccacc ggggcgtcag cttgaatttc cacgtcttaa agttattata atttctgacc 7621 aaaatccggc atcgaccctt acccaaaggt atcgaataaa aagcgactcc cccagccaca 7681 cctcgctgcg gaacactaaa tttgtaaatc acgaagttgg gagcaatgaa atctagttgt 7741 gaccaaggta gattgggttt gctcattcct cgcaaccttc ctcgaattcc tctactagaa 7801 ttttccagca cttccatttc cgatggcttg gcattttctc gctgacctaa agttccatca 7861 tgactaatgt gaacatgagc tgggtccatg atgttctcaa taaaatagaa ttggtcataa 7921 ggcaggtcac gcatcacgtc tgtggtgaca aactctggct tatctaactc ctctatcgtt 7981 ggaatgtcat catcagcagc tgcttcacca actcctgccc acatccaaac cataccttgg 8041 cgttccacaa tcttaaatga ttgcacgcaa gcatttgcag gaattttggc atctgtagct 8101 aactggggaa tatgcagaca ttgaccatca ctgccaaatt gccagccgtg gtacaaacac 8161 tctattttcc cgtcagttat ctgtccatcc gagagtttag cagcacggtg aggacaacga 8221 tctaccagac agacaagctt cccatccaag tctctaaata aaacgaacgg ttcgtcatat 8281 aaggaaaaac tataggaacg gtttttgggt aaatcttgca caaagcaaac aggataccag 8341 cattccctcc agttgaactc ttgattttgc tctatttgga gcgcagtttg tcttgcttgt 8401 tcttctatct tggaatctac tatcatggat gaatcctgat ttggctactt atatactctg 8461 tattatttat ctaatttaca tatagcaaaa gttaggtttt attggttgac ttatctaatt 8521 tctggaaagt cagtcaagct agaggaattt gcttctatat tcttataagg acaaggtatg 8581 gcaaaaagtg acaaaataaa tttctcgact cccagtggtt ttccagaatt tcttcctggc 8641 gaaaagcgct tggaattata tttactagat accatccgga aagtttttga aaactacgga 8701 tttacaccca tcgaaactcc tgcagtggaa cgcttggaag ttttgcaagc aaagggcaat 8761 caaggggaca acattatcta tggtcttaat cccattttgc caccaaatcg gcaagccgaa 8821 aaggataagg caggtgaaac aggttcggaa gcaagagctt taaaatttga tcaaacagtt 8881 cctttagcag cgtatattgc tcgtcaccta aatgagttaa cctttccctt tgctcgctac 8941 caaatggata tggtgtttcg tggggaaaga gcaaaagatg gtcgatttcg tcagtttcgt 9001 cagtgtgata ttgatgtcgt tggtcgtcgt gaactgagtt tgctgtatga tgctcagatg 9061 cctgctatta tcactgagat atttgacgca gttaatatcg gtgattttct gattcgcatc 9121 aacaatcgta aaattcttac tggtttcttt aaatcagtag gagttgagga agaaaaaatt 9181 aaatcttgta ttgggattat tgatactttg gataaagtcg gtgagaataa ggtaaagcag 9241 gagttagtaa aagagggagt gttagcagat accactcaaa aaatcatcga ttttattcat 9301 atagatggca ctgtagatga agttctagat caactcaaat acctggcaac atcaacacca 9361 gaagctgagg aatttgctct tggagtcacg gaattagaaa cagttatttc tggagttcgc 9421 aatcttggag tttcagaaaa ccgtttctgt attgacttat ccattgctcg cggtttgaat 9481 tattatacgg gtacagtgta cgaaacaact ctaataggac atgaagcttt aggtagcatt 9541 tgttctggtg gtcgatatga agaattagtg gggatgtttt tggatgaaaa aatgccaggt 9601 gtgggcattt ctattggctt aactcgctta attagtcggt tgatcaaagc tggtattctc 9661 agtaccttcg cggctactcc agcgcaagtt atggtagtca atatgcaaaa tgatttgatg 9721 cctgtttatt tgaatgtgtc gcaaaaactg cgtcaggcta aaattaatgt tatcacgaat 9781 tttgaacagc gacctttggg caagcaattt caattagctg aaaaacaagg aattccattt 9841 tgtgtgataa ttggttctga agaagctaca gcgcaaaagg cgtctctcaa agatttgaga 9901 acacgcgagc agatggaagt cgcgctggaa gatttggcgg aggaaattaa aagaagactg 9961 gcgtaattcc tcgtttttcg ttcccaggtt ccacctggga atgccctcag ggaggctcag 10021 cctcctaaaa tatactagag gcagagcctc tggacaaggt gttaccaggc tgagcctggt 10081 aacgagatga aaaggggttt gatattagtt ttaagttatc aggaggttca catgaataaa 10141 gctatcaagc gagcattttt agatactgaa gatggtcaaa ttcactaccg cattggtggt 10201 gaaggagaag cacttctact actgcatatg aacccccgta gtagtgacga gtatcgcgaa 10261 ttaatgccca tccttgcaca aaagtaccgc gtgatagcaa tggatttgat gggctttggt 10321 gattctgaca aaccccctag attgtactct gttgctgact acgccaaaac tgtcattgct 10381 ctcttggatg agttgggtat tgaaaaagta aaccttctgg gaaaccacac gggagcgttt 10441 gtttccggag aagtaacagc agcttaccca gaacgtgtta acaaactgat attgggcaat 10501 gttgctggtt ttggtgaagc tggaaaaact gatttaatgc agagatttga tgaaggtttc 10561 gtcattaaag aagatggctc tcacttgatg gaaagatggt tagctcgttc tagatatgta 10621 ggttctgcgg agttaaatca tcgttgggtt ttggatgatt taaaatgctt tggctatcct 10681 ttatatgcag tttggactgt gggtaattac tgcatggaag cggcagaaag gttcagtttc 10741 atcaaatgtc caacactcat tttatggggt attcatgatg tagaagaatt tgaaagattg 10801 ggtttagcgc tagcaaaaga tcgatttttt ctctctcaag cgattcccca cgctaaggtt 10861 gcagagtttc ctgacggcac aatttgtatg atgaaccaga tacctgagga aatctcacat 10921 gttgtgcttg aatttttgga tgagacaagc gtttcataag tgatacaaaa ggtgaaactc 10981 tcaagcaaaa aacatgagtt caaaaaccta caaaaaacta gttgcaaaac agtttgccca 11041 aaacttcaaa tcagctatcg aaatcataga actccccata cccgaacctg caccagatga 11101 aattgtcatt cgtaacaaat tcgctggaat caacgctgga tttgacactt tactttgtcg 11161 aggtgatgtg agttacatta acttaactcc tccctttgat ttgggtgtgg aagcagtggg 11221 agaagtcgtc gcagtaggaa atcatatcaa agatttccaa gttggtgatg ctgtagtcac 11281 taccatacgt ggcggagggt atcgcgagta ccaagcgata aatgccaatc ttgcaatcaa 11341 ggtacgccaa gcaacgccag aagtgctaac cctgatccct actggtgtat cagcaatggt 11401 agctttagaa caagtggggg aaatgaaaag ccaagaagtt gtgttggtga cagcagcggc 11461 gggtggaacc ggacacattg cggtgcaatt ggcaaagtta gcgggtaacc atgtgattgg 11521 tacttgcgga actgatgcaa aggtagagtt acttcaggag ttgggatgcg atcgcatcat 11581 caactaccgc acacaaaacc tcaatcaagt cctcaagcaa gaatatccca atggcgttaa 11641 cctagttttt gaatgtgtcg gtaaacaagt ctttgatacc tgtgtggata atttagcagt 11701 tcgcggacgt ttagttgtgg ttggtttcgt ttccgaatat gcgaagaact tggaacaagt 11761 gacacaaccg cgaatttatc acaagttgtt ttggaaagca gcttcagtgc gggggtttct 11821 tatgcccttg tataaagaat atatgacaga gggacgcgat cgcctcttca atctgtttta 11881 cacaaacaag ctaaaagttg ctgttgactc aaccccattt cacggcatag aatccattac 11941 tgctgctgtc gaatacctcc tcagtggtca aaattgcggc aaagtcgtcg tcagatttta 12001 gaaaatagtc atatatttaa cagagggcga ggaagtcacg atgtcacaac agtcagaaaa 12061 cactttaaaa gttgctcatc aagcatttga acacttccag cacggtttgg caacaggcga 12121 gtggaaccag ttcttggatg tgctgacaga agactttagc ttttggtttc ctattggaaa 12181 atatcacggt ttacatcagg gaaaagaaaa agctagagaa tttttccaat atattgctga 12241 atcattaaga ggtgaactca ttttagagca cgttacaagt aatgagacaa cggttgtgtt 12301 tgagttccgc gatgagggaa cgttgtttgg agaactttac aaaaatcggg tggcagtttc 12361 ctttgatgtg cggggagaca aaatttgcgg ctatagagaa tattttggca gtgatggcaa 12421 atcgaattga agtcagacta cgttaatttt cgttaatttt tatcagcatt tttttgattc 12481 aattctctgc acttagcctc cgctgcttga taagcttcca agtaactctt gtcacccaac 12541 atgacatttc cgtaatactc gtaaggtgga tcaccataga ctggcaatgg atcatcttct 12601 ggataatctt cttttgattc ttctattatt gcttttactt cctttacggt taagcgttgt 12661 gctgtaaagt accaaagttc ttgagcattc ttgttcaggt agggtaatgt tggatcgaca 12721 atacaatctt ctagttccat ccaaccgtgt tcaatgggtt tgtgtggctt gccagcaaaa 12781 gctaaaaagc cttgtacgta gaatgctccc ttagtcagta atgcagcttt gtaagcatta 12841 ctaaactgag ttttagcttt gctttttaca cgttgggcag tctcaataga aagcgcttca 12901 ttcaatggtt tgttcatcga cagcgagcat aagccatgac tctaacatag tatcacttgt 12961 aaactaccta cactcccctg gcgggtgagt tagtcgcttc ccaattcatt ggggattgcc 13021 tcggtcttca tggatttttt acgtcccgaa gactttggtc ttacatcccc tccaagggca 13081 gaagtgcacg gcaggcgctc cacttgggga gaccccaaga ccgcgctgcc tctagttcct 13141 aacactctta cgagttcgcc agttgctcct gcggacacgc actttacgca cattggtcgc 13201 ttaaccgtcg tcaactcacg cgcatgcatg ctgtttaagg taaatccctt aaccaaatga 13261 atcatagcgg ttttcaattg ccgtgccaaa gacgccaata aagagagaag aaaatttgat 13321 gaattcttta agattgctat aacgaacgcc tttgttatca ttgtgataca tttgctaaac 13381 ttagcagtca agtatgccaa atttatcggc tctaattttt acactagtgt ctcatgttga 13441 attggtaaat tttagatgaa tttcagtcag aataatcaaa attttcaaga attagataaa 13501 atactacggc aatctataaa tgatactttg cagaagtttc tagatgaaaa cttgaaaaat 13561 catatagaaa cttctgttaa agaattttta aacaaatatg tggatgagca aaaaaaaata 13621 cagcggcttt attatacaaa gactaatgcg attatagaca acactactgc tagccaagaa 13681 ttgtaccata aacttgagca aacgttggaa gaacttgact ctttgaaaaa ctctgtagaa 13741 aatatgctta tggagcagcg tcagaaaaat ggggaactgc aacgaaaaat taactgttgg 13801 gaacagtcag caatagattt ttttcggtta ttagagaggg cagttgatta cgaaacagat 13861 gaacgtagac tgttaattaa tagaatatta tatggattta atgatcttgt taataattta 13921 gggatagagc gtattattcc acaataaaat gattatattc atgaaaattt tcatgaagca 13981 attgatgagg aagaatccga tatcatacca ggcaatatcg ttaaatgtat aagttggggt 14041 tacagaattg gtgacaaagt tcttgagaaa gcaaaggttg ttgtagcaaa atagctagcc 14101 cctgatcaga aaaattaacc aattcaaaat atgctctacg gatacgcttt acatgagcac 14161 ctatcactcc tttgcaggaa ttacaaaaag atgaaaatag cgtgctcaaa tatcctatct 14221 gttgactgta cgcacttatt tttaggtgac agcttatgaa ttttacactg agaaaggaat 14281 cagttaggca atgacaaaaa aatatgctat cggaattgat ttaggaacat cgacatctga 14341 aatttgtgtc taccgaaata atgaatcact tgtgattcct gatcctgtga ccaaaatagc 14401 gatgattccc tcaattgttg ctattaataa aaagggtgaa cttttagtag gagaaaacgc 14461 cagaagttgg gttgatgtct cagaacgtgg agttcgtgaa gtcaagcgta aaatgggaac 14521 tggagaaacc ataaaactac taggtaaaga gtatcgacct gaagaaattt ctgctttgat 14581 tcttcgtcaa ctcaaggaaa atgctgaaga agcactagga atagaaattc gagaagtagt 14641 cctttctgtt ccagctaact ttccagatgc agctcgacaa gctacactga atgcaggtga 14701 attggcagga ttaaaaatta ttcgcctgat aaatgaacca acagcagcag ctttagcttt 14761 tgggatcaaa aatattgatg ttgaagaaca gcttgttgtc tttgattttg gcggtggtac 14821 attagatatc actgtactgg aaatggttgc aggtgttctt gatgttaagt gtagttttgg 14881 taatcctcaa ttggggggta aagattttga tgaagcaatg atgacattac ttcatcgaaa 14941 atttaagaca gaaaatcctg aggcagaaat ttctcaaaaa gctcatggtg cactgaaaga 15001 agctgcagaa aaagctaaaa aagttctttg tacacaacaa tcctatgatg tacgaattcc 15061 gtattttgca gcgaacaatg gtgaatttat tgatttagag gtggaagtga cgcaccaaga 15121 gtttgaagtg gcgatcgcac ctctattaca aaaagcacga gactgcatcc gtcaagcact 15181 aaatgccaaa aatctccacc ccagtacaat caatcgagtg ttacttgtag gtggaacaac 15241 ttacattcct gcggttcgtc aattagtcgt agagatgttt ggtaaacaag ggaaagcact 15301 tgatgttggt gcagacttag ctgtgggtgt tggcgcatct attcatgctg cttttgctca 15361 aggtttattc tgtgaggatt ccggtgttat tctcaccgat gttgctcctt ttggattggg 15421 tattgaagta gtgagttacg ttggcggaca gtatatgcta acctatgaac ctttgattca 15481 acctaataca acgattcctt attctactca aaaaacttac actcttttga aaccagatca 15541 aaagcggttg gaaattcgtc tctatcaaga caacacaggg aaagcaaagc taccattaga 15601 agcgattgac acaggaatag aagcagaaat cacagatatt cctcctgctg ttgatggtat 15661 tccctatcca gtggaagtgg aattttatta tgacattaat gggatagcta aattgaaagc 15721 gactattcct aatattaata aaagtgtcga gctatcttat ggttattcag ccaagcgcat 15781 gggtaacaaa gacatagctg atgcggcttc tcgcctcaaa gaactgtgga agcaaaatgc 15841 caaggcaaga ctttacgaag gactcattaa taaagcagaa aggtatatgg ctggaatacc 15901 tcctcaagaa agatcgccgt tatctgatat tgttatggaa ctcaaaaaag gtctcatgaa 15961 tgacaacatt caagagattc aaaaagcagg cgatcgcctt gtagacttct tgtttgattt 16021 agaaaagaac atggaataat aacagatggt atatgaactc tatcatatcc tgggaatttc 16081 ctctcaggca tctgctgatg aaataaaacg agcttacttt cagttagttc gtaagtattc 16141 tccagagaaa gatccagaac gctttcaaca aattcggata gcctataaca cgctgtttga 16201 ctcaaaggaa cgagaaaatt atgacgccat gcagaagtat ggcgaccaag ttaaagacct 16261 gattttgcaa gctcaaaata aaatgcaagt agaagaatgg acaaacgcca tttccttact 16321 caagcaagtt cttgtactag caccaagaat tgatatagct cgtaatctac taggtctttg 16381 ttacattcat acgaaaaatt gggattttgc tgttaaagtc tacacagcac ttacgaaaac 16441 taacccagac gtagcagtct attggagtaa cttgggttat gcctacaaac tgcaagctca 16501 atgtttcaat gatgaagata ttagtcaaat ccagttatat cacaatgccc gtgaatcttt 16561 tcagcaagta gtcaaattag aatccttcaa ttcagcaccc tatttagata tagcagaaac 16621 ttaccttgac caaaaaaatt actctgaagc acttgcttgg gcagaacgtg ctattggtgc 16681 tgatggtaaa gctgattatc atgactttga agcacttttt ttgatctgcc gagttcactt 16741 ttatagtggg gaattacaaa aaatagaagt catcgcaaaa agaattatat cattactacc 16801 taaaaagtca gaaattcgag aatatgctgc gactcgattt gctaacatgg gtattgaaat 16861 tgccaaaaat gccgcaattt ctagtaattt tcatatgtgg agatcagctt ttgaatttat 16921 caaaatagct aaggagatag aaccaaataa cttaggtatt caacaaattt tgacaaaact 16981 tgaggaaatc gttgcagcta ttaaccaata tgaaaatctc aatcgtgatt atctcatcaa 17041 ccaaggattc caaagactag ctgcattttg tttagctgat tattttaatt tttatgattc 17101 accgcaagaa agaaaaagtt ttttaaatga catactcaca gaaattttag tttctcccac 17161 cagcacaatt tttgcgtcac ttgagagaat taaattttac tatcctgcag tgtataagct 17221 gaatgtagaa ctctttcatc gaattgagca agtggcttgt gtaccttaac acccgttaca 17281 tagcagtcct tcgcacttaa agaaaattat caatcagcaa gtctgcgcga gagcaaaagc 17341 cttggggcgc taagaggacg acactcgcac agtcatggtt tcacaggctt agacgcgcag 17401 tggtaaaaca gcactacgca ccaatgttgg gggcgattgc gttcacttat ataagtgagc 17461 gtgtccgtat ctcctgtgca aaacttgggg gcaatagccg tgggcagaaa cacagagttt 17521 accgcaagtt aagaactgtt cgtcgccaag aaaacagaga agagaaaatt tccgaaattc 17581 tttaacattg gtgttaaaat ttaccattgt tttgattttg ttatgttttc ttaatgaaaa 17641 cagttatcgc cattagcggc gtatatgttt caatgtcgaa ccatgtcgcc taattcaggt 17701 catagtgctt tgaatagtcg ttttatgttt gctactattt atctaaactt tgttagcaaa 17761 ttgtttttat tgataaaggt ctgttattaa tcaatttgaa gatttttgaa ttttcctcat 17821 gtgggcaaca taattttgta cttttgtgct agcataacta acgaggaatt atattcggtt 17881 gtgtagtttg tttttatcta tttacaagcg tctctccgaa tctacctctc tacactattt 17941 gattctggag atgtttcatg tcgatttacg tcggtaattt atcttaccaa gttgagcaag 18001 atgacctcag acgggtgttt gaggaatatg gaaccgtaaa aagtgttcaa ttgcctgtag 18061 accgggaaac tggtcgtgta cgagggttcg cttttataga aatgggaaca gaagcagaag 18121 aaactgcagc catagaggct ctagatagtg ctgagtggat gggtcgtaat cttaaggtta 18181 ataaggctaa gcccaaagaa aacagaggct catctggtgg cggtggcggc ggcggcggca 18241 gaggcggttg gaacaatgat agaggaggag atagagg // LOCUS NODE_1804_length_18259_cov_5.39106818259 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18259) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18259) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18259 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 10..402 /locus_tag="DP116_15555" CDS 10..402 /locus_tag="DP116_15555" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15555" /translation="MYGLATVIQLTTNRTTSQSESTRRQAAESLEKILPTDQMAKVVI ALYEFKPNEQRCKVIWHCTQSMSYPDFYQAWHHRSYMKLAMKFARYYWWQIALYLFLG LSIFALVRFTVSHNVNNPPHIQRQQQIR" gene 552..872 /locus_tag="DP116_15560" /pseudo CDS 552..872 /locus_tag="DP116_15560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745169.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DNA polymerase beta" gene 906..1220 /locus_tag="DP116_15565" /pseudo CDS 906..1220 /locus_tag="DP116_15565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015157310.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DUF86 domain-containing protein" gene 1573..1782 /locus_tag="DP116_15570" CDS 1573..1782 /locus_tag="DP116_15570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009768228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15570" /translation="MAEISLDENKLKELLKTAILEVIQERKEVFSDLFAEIIEDIALE KAIKEGENTESVSREAIFKILDRQG" gene 1779..2045 /locus_tag="DP116_15575" CDS 1779..2045 /locus_tag="DP116_15575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876366.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_15575" /translation="MNVEFRKSFEKDLGNIREDTLLQRIKAVIEEVEIAEKLGDVSNL KKLKADGDYYRIRIGDYRIGITLGEDVVIFVRVLHRKDVYRYFP" gene 2273..2509 /locus_tag="DP116_15580" CDS 2273..2509 /locus_tag="DP116_15580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741660.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15580" /translation="MTPVVIKPSSWLTTGIRVEKVNNLNLFKFTEELGSRMQELLDKK KADLLTPEEAAELEAIGELDMIFSYINAITASQS" gene 2506..2970 /locus_tag="DP116_15585" CDS 2506..2970 /locus_tag="DP116_15585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002772704.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_15585" /translation="MTIPNSIQEFIRQRADFRCEYCHYPEFLSTSPLTIDHIMPKSLG GSDDTDNLALACRRCNERHYNFIIGIDPQTQQEVSLFNPRQQNWSEHFIWTADGTKMI GITPTGRATCNRLDLNDERRADRFIQKSRRLWAQGGFHPPRQDPQQVSDINQ" gene complement(2960..3202) /locus_tag="DP116_15590" CDS complement(2960..3202) /locus_tag="DP116_15590" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15590" /translation="MHKRDPRLLAKVGDLGFRRYSQKSRALAGVRAACAFAHSHKRQF PIACDSVLPIPSLSAIGKMSLSPQISESISRFDIIG" gene complement(3205..3879) /locus_tag="DP116_15595" CDS complement(3205..3879) /locus_tag="DP116_15595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873537.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15595" /translation="MMTPLKNAIAIPDDKQAIAKSDFDQQYLEERATVLKSIVDDSIS FQMLADATETILRHAFWLAKQKQKRSVREYKRLLIDYGWKGEEKKYLKIAAAFQKFSP QELAQIEPSTVYQLAHNSNKYKQIIDKLLDLTAITQEAVRSLMREPRTPKEDKPEKPS IWRRTKNGGRYCQIPPIHETDERTGTTLQKMIDEEGLSAQHIVAEAVALRQAYKEGRL TVVEKN" gene 4358..4882 /locus_tag="DP116_15600" CDS 4358..4882 /locus_tag="DP116_15600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488989.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_15600" /translation="MVLSITVQSPTKKPTLQFGACGQIVKDMQKALNRRLAQLDIVSV SPLSVSTVGYFDHQTRDAVKYLQCLAFLTIDGIVGQQTWAYLSNGFAGLPILSFGSTG SVVKAVQEPLKVGGYYFGAIDGIFGAKTEAAVLAFQAEHCLESEGIIEALTWNALSKL DSHGSHCKINAFRG" gene 5203..5715 /locus_tag="DP116_15605" CDS 5203..5715 /locus_tag="DP116_15605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411931.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15605" /translation="MKNLAYGIVLSSCWLTSWVVSSLAVNNVVSSNEINTKTIAIEKA EFGVLRDDRDGKMSFLPTTKVPHQEGKRYGWRIQLKDNQNEVTWKEVLRLPKLPETWS TSSGENFVISTDGMEAVTKRTQSAKKGVIENFWTVASGDPTGKHTIAVYIDNRRIGFF EFELVSPKNK" gene 6214..7377 /locus_tag="DP116_15610" CDS 6214..7377 /locus_tag="DP116_15610" /EC_number="2.6.1.52" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoserine transaminase" /protein_id="PRJNA477356:DP116_15610" /translation="MSPHLTPPTTKPRVPNFSSGPCAKRPGWSVSKLENAFVGRSHRS EDGRSRIKEVIERSKTILGVPADYRLGIVPASDTGAVEMALWSLLGKYPLDILAWESF GLEWVKDVVDELKLPNLNVLKATYGSLPDLNQVDFSHDVVFLWNGTTSGVRVPNGDWI KDDRQGLTICDATSAVFAMEVPWQKLDVVTYSWQKVLGGEAQHGVIVLSPRAVERLES YQPTWPIPKLFRLAQKGKLIEGIFKGDTINTPSMLCVEDALDGLLWAESIGGLPGLIR RSEANLATIARWVEQSDWAAFLAQKPETRSCTSICLKIVDDWFTGLSPEKQAESAKKI AKLLQKQEVAYDIASYRSAPPGIRIWGGATVETTDIEALLPWLDWAYATVKQE" gene complement(7492..7947) /locus_tag="DP116_15615" CDS complement(7492..7947) /locus_tag="DP116_15615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127384.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldehyde-activating protein" /protein_id="PRJNA477356:DP116_15615" /translation="MNTTYTGGCQCGQIRYEIRAEPLTLYLCHCKECQKQSSSAFGMS LTVPRDAVVITQGQPKAWTRKADSGREVTCLFCDDCGTRLFHERTYNRETINIKAGTL DDTSWLRPVGNLWTSSAQPWVIISDQMLNYERQPADVRLLWEKWAQQHS" gene 8856..9155 /locus_tag="DP116_15620" CDS 8856..9155 /locus_tag="DP116_15620" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15620" /translation="MVNQIVDIVDSLSYPELAGLLTACGYAIREQTKLSRALLELSFE ISKLHDKKGFFNPEYQDVVNDIDLALLAGLCAVKLTQMATVIEEINRRYHEVFVD" gene complement(9910..11484) /gene="purH" /locus_tag="DP116_15625" CDS complement(9910..11484) /gene="purH" /locus_tag="DP116_15625" /EC_number="2.1.2.3" /EC_number="3.5.4.10" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196540.1" /note="involved in de novo purine biosynthesis; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/inosine monophosphate cyclohydrolase" /protein_id="PRJNA477356:DP116_15625" /translation="MARLALLSVSNKTGLIDLARSLVEEFDFEFISSGGTAKALKDAG LPVTKVADYTGSPEILGGRVKTLHPRIHGGILARRDVPEDVADLENNQIRPIDLVVVN LYPFEETIAKQGVTLAEAIEQIDIGGPAMLRAASKNFAHLTILCDPAQYEEYLQEMRI TAGEPSLEFRQKCALKGFLHTSSYDQAIAAYLSQNLSKEAELPQQYTLKGKQLQSLRY GENPHQSAAWYETGTTSTGWAAATKLQGKELSYNNLVDLEAARRIISEFTETPAATII KHTNPCGVALGHSIQEAYQKAFNADSVSAFGGIVALNRPIDAGTATELTKTFLECVVA PSCDAEAQEILAAKSKVRVLIFPDLKSGPKETVKVIAGGFLLQASDDAVANTSTWQIV TEKKPSDDELEELLFAWKVCKHVKSNAIVVSGDTPLGVCALRNRTTLGVGAGQMNRVG SVKIALEQAGEKAKGAILASDGFFPFDDSVKTAAAAGIKAIVQPGGSLRDQDSILAAN ELGLVMVFTGIRHFLH" gene 11658..12185 /locus_tag="DP116_15630" CDS 11658..12185 /locus_tag="DP116_15630" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15630" /translation="MKDTGLEQQKKITVLRDQSPSGALVKRDTGFDIEVYKEVLSDRF IEARTVDEYEKLLVLRQRVQELDIEVRRLDYAERSAEIQLQQAQQKALLQRGQQIVAI IISIAAGLYLLQTLPLAGLLFLILGLAKPLGYSLGEIGNFLDSLKGFPKDSDKLLSDG KEQRDQAEESRDARP" gene 12517..12783 /locus_tag="DP116_15635" CDS 12517..12783 /locus_tag="DP116_15635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011611495.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15635" /translation="MNYGYEKKQNLAEAAAEIQDLLQQLEKSNPTASEAEKVAYVDEE IEPDLKSRLVKALKTSGEVAIESSLDSRYIDIIRAIIKSWSSSE" gene 12960..13184 /locus_tag="DP116_15640" CDS 12960..13184 /locus_tag="DP116_15640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008179559.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15640" /translation="MDYQDIITIEPGKRSGKPCIRRMRITVYDILEYLAGGMTEAEIL EDFSELTLEDIKACLAFAADRERKLFVASL" gene 13181..13513 /locus_tag="DP116_15645" CDS 13181..13513 /locus_tag="DP116_15645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017656019.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15645" /translation="MKLLLDENLSDRIIHRIVDLYPNSEHVKTLGLTNTDDTVIWEHA KADDFVIVSKDSDFHQRSLLYGHPPKFIYLRIGNSPTSKIIQILRDNFDTITQFKSSE TESILVLM" gene 13684..14016 /locus_tag="DP116_15650" CDS 13684..14016 /locus_tag="DP116_15650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739755.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15650" /translation="MTLAILVEPTPLEVDANGVVRVGETRVTLDTVVTAFLEGATAEE IGEQYTSLQLSDIYSVLGYYLRHKAEVDAYLLERQRQAAMIRQEAEQRFNPVGIRERL LARRSQHG" gene 14023..14376 /locus_tag="DP116_15655" CDS 14023..14376 /locus_tag="DP116_15655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182922.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15655" /translation="MVRFLADENFNNQIVRGVLRQSPSIDILRIQDVDLSGANDPTVL EWAAQHRRVVLTHDVATMITFAYERIQARLSMPGLFEVSRRVSVGLAIEEIILIGECS LEGEWEGQVRFLPLR" gene 14617..15249 /locus_tag="DP116_15660" CDS 14617..15249 /locus_tag="DP116_15660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318488.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_15660" /translation="MANTLEFISVPPKTGQPPKGLIVTLHGWGANAEDVASLSRFFNL PDYQFLFPNAPFPYLNSSVGRAWYDLRMENMYQGLVESRQLLTGWLQSLENNTGVPLS RTILSGFSQGGAMTLDVGLKLPLAGLVSLSGYLHQDVESVKTQNMVSLPPVLIMHGRQ DTVVPLQAAVSARKTLESLGAAVEYYEFDMGHEIRPEMLELLRNFVVVNA" gene 15429..15641 /locus_tag="DP116_15665" CDS 15429..15641 /locus_tag="DP116_15665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744883.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2555 domain-containing protein" /protein_id="PRJNA477356:DP116_15665" /translation="MKTLSISKREIATITPQEVEVLATRLEQDNYSNAFEGLNDWHLL RAIAFSRPELVESYIHLLDLEPYDEA" gene 15642..16844 /gene="coaBC" /locus_tag="DP116_15670" CDS 15642..16844 /gene="coaBC" /locus_tag="DP116_15670" /EC_number="4.1.1.36" /EC_number="6.3.2.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318486.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate--cysteine ligase CoaBC" /protein_id="PRJNA477356:DP116_15670" /translation="MFQKKRVLIAIGGGIAAYKICEVVSTLFKIGVEIKVILTNSAQK FITPLSLATLSRHPAYTDENFWQSTHFRPLHIDLGEWADLIVIAPLTANTLAKLAYGM ADNLLTNTVLASTCPILLAPAMNTDMWEQQAVQRNWQQVLTDGRYHGMCTGLGLLACD RVGAGRMAEPPEIITYVQSLLHTAGKRDLVGKKVLISAGGTREYLDPVRFIGNPSTGK MGLALAQAALHRGASVTLVHSPASWDVPLGVQAISVVSADQMRASMVEYLPNADMIIM SAAVADVKPREYSQQKLPKKLLPQALPLEPVPDIVAELARLKQPHQQLIGFAAQTGDI VTPALEKLHSKNLDAIIANPIDEPDSGFGSDNNRAIFLDKQGQRIEISPCSKLQMAHH IFDVLAKK" gene complement(16902..17897) /locus_tag="DP116_15675" CDS complement(16902..17897) /locus_tag="DP116_15675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318962.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15675" /translation="MTSERERVDNDSPWKEILEAYFPQAMEFFFPQTAALINWERPHE FLDKEFQQIARNAEQGRRYADKLVKVWQIQGEEIWLLIHVEVQAKPEDDFAERMFSYN LRIFDRFAKPAISLAILCDTDLTWRPNQYSYNYPDTSLHFKFGTIKLLDYQNRWTELE KSDNPFATVVMAHLKTQQTSKKPRERKTWKFSLIRRLYELGLAEKDIRNLYRFVDWVM ILPKALEAEFWQDFKEFEEQCVARVPRVVATAEQERTMSYITTGERIGYERGQQELVL RLLQRRVGELPQEVKKQIQALSLEELEALAEALLDFTAVGDLLNWLQAHLNETEN" gene complement(18078..>18259) /locus_tag="DP116_15680" CDS complement(18078..>18259) /locus_tag="DP116_15680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15680" /translation="IVWGVAFTPDGKTLASGSYDKTIKLWSLDLDDLLARGCNYLKEY LATRDELPKKLCPGK" BASE COUNT 5207 a 3684 c 4006 g 5362 t ORIGIN 1 ggttagctta tgtacggact ggctaccgtg attcagctaa caacgaatcg caccacctca 61 cagtctgaat ccacccgtag gcaagcggca gaaagcttag aaaagatttt gccaacagat 121 cagatggcga aagttgtcat tgccttgtat gagttcaaac caaacgaaca gcgttgcaaa 181 gtcatctggc attgcactca aagtatgtcc tacccagatt tttatcaagc ttggcatcat 241 cgctcttata tgaaattagc aatgaagttt gctcgatatt attggtggca aattgctttg 301 tacttgttct tagggctatc aatctttgct ttagttcgct tcacagtttc acataacgtg 361 aataatcctc ctcatatcca gcgacagcag caaattcgtt aactttgata aaccagcgct 421 caacatttac aattctattt catccattca acctaatttg ctagaacata aggagacttc 481 ttcatttcat ccccgtcacc aaaattgatc tgaacacaac ttttgataag tttggagtaa 541 aacttttgaa gatgaaaacg ttggaggaaa ttaagcaaat tctcagacag agtaaaccgc 601 tgttgcagga gcagtttcat atcacgcagg taggcatctt tggttcttac gctcgtggag 661 aacagacaca ggagagtgat gttgatgtac tcattgacta tgatcgagcg cctaccttat 721 ttaagctggt agagctacgc gattatctca gtagtgcgat ctgcgcgtag cgcagcagcg 781 aagctatcgg tatgaaggtt gatatagtaa cgcaaaacag tttaaagcca agaatccggg 841 agcgagtgtt atcggaagtc gtttatatat gactaaacga caactctggt gaatttctcc 901 aagacatatt ggatgctatt gcggacattg aagcgtttac agacgggatt gattttgaga 961 catttcgagt caaccgtgag aaaattttag ctgttgtgaa gtcaatcgaa atcgaatgag 1021 aggcagtcaa gcgaattccc gacgacatcc gcagccagca ccctcaaatt ccttggaagg 1081 cggtggctgg aatgcgagat gtgttagtgc atgaatattg gggaattgat gtaaatgttg 1141 tctgggcaac agttcaggaa ggattgccac ctttaaaagc agtgattgtt gaaatcacga 1201 gaaacttaca gaatacttag aatgctgagg ctttccgttg tttgagcaga taaactcgtt 1261 ttgccaatcg cgctcaatca acgtattggt aaactcagat cttgcaccat aattttcgtc 1321 cgtcaagaaa taaatttcgg aggctcaaag ttcaagtccg ttaaaactga ctgggtaagt 1381 cttttagtcc gttttaacgg acttggatta ttagccttga acttgagttc aaggcgtact 1441 atgggtgagg tgcaagatct gagtaaaaca gagtcagagg tgatgcggaa gcagccgcta 1501 ggcgatcgcc caaagggcgt gtgttagttg ggcgtgtgtt agtatcagtc gcaagagatg 1561 cttggagggt atatggctga aatttcccta gatgagaata aacttaagga actcttgaaa 1621 accgcgattc tggaggtaat tcaagaacga aaagaggtat tttctgattt attcgcagaa 1681 attatagaag acatagcctt agaaaaggca atcaaagaag gtgaaaatac tgagtcagtc 1741 agccgagaag caatttttaa aattctggat aggcaaggat gaacgtagag ttcagaaaaa 1801 gctttgagaa agatttagga aatattcggg aagacacatt acttcaaaga ataaaggcag 1861 tcattgaaga agtagagatt gccgaaaaac ttggagatgt cagtaatctg aaaaaactca 1921 aagctgacgg tgactattat cgcatcagaa taggagatta cagaattggc attacgctag 1981 gggaagatgt agtcattttt gtaagagtct tgcatcgaaa agatgtttac agatactttc 2041 cctaatttga gcagagaaac tcgttttgcc aatcgcgctc aatgaacgta ttggcaaagg 2101 agagttagag gcgatgcgga agcaccgctt atgcgagtgc gcgaaggcgc acgctgcgcg 2161 aacgccttac tggaatcctt cggcgtgact tctcgtccta attcgatagt taatcgccac 2221 ataacctaat tccacatatt ctatgatgga gcaaagcaat agctataacc ctatgactcc 2281 agtagtaatt aaaccctcat cctggctaac gactggtatt cgagtcgaga aagtcaacaa 2341 tctcaatctt ttcaaattta ctgaagaact cggttcacgt atgcaagaac ttctggataa 2401 gaaaaaagct gatttactga cgccagaaga agccgctgaa ctggaagcta ttggagaatt 2461 agacatgatt ttcagctata ttaatgctat aactgcatcc cagtcgtgac tattcctaat 2521 agtatccaag aatttatacg ccaacgcgct gattttcgat gcgaatactg ccattatcct 2581 gaatttctca gtacatctcc cctgacaatc gaccatataa tgccgaagtc tttggggggt 2641 tccgacgata cagataattt ggctttagcc tgtcgtcgct gtaatgaacg gcactataac 2701 tttataattg gaatagaccc tcaaactcag caagaagtct ccttatttaa tccccgtcag 2761 caaaattggt ctgaacactt tatctggaca gcagacggta ctaagatgat cggcatcaca 2821 cctacaggtc gagctacttg taaccgactg gatttaaatg atgagcgtcg tgctgatcgc 2881 tttatccaaa aatctcggcg actttgggcg caaggtggtt ttcatcctcc ccgtcaagat 2941 ccgcagcaag taagcgacat taaccaataa tatcaaacct ggagatactt tcgctgattt 3001 gaggagataa actcattttg ccaatcgcgc tcaatgaagg tattggcaaa acagagtcac 3061 aggcgatggg aaactgccgc ttgtgcgagt gcgcgaaggc gcacgctgcg cgaacgcccg 3121 caagggctcg actcttctgg gagtatcgcc taaaaccaag atccccgact tttgccagaa 3181 gtcggggatc tcgtttgtgc atgactagtt tttttccaca actgtcagcc gcccttcttt 3241 ataggcttgt cgcagcgcta cagcttcagc cacaatatgc tgtgcgctca acccctcctc 3301 atctatcatc ttttgtaaag tcgtcccagt tcgctcatct gtttcgtgaa ttggtggaat 3361 ttggcagtac ctcccgccat tctttgtgcg ccgccaaata cttggttttt caggtttgtc 3421 ttctttagga gtgcgcggtt ctctcatcaa agaacgtaca gcttcttggg tgattgcagt 3481 taagtctaaa agcttatcta taatttgttt gtacttattg ctattatgag ctagttgata 3541 cacagtgctt ggttcaattt gtgctaattc ttggggcgaa aacttttgaa atgctgctgc 3601 aatcttgaga tatttttttt cttcaccctt ccaaccatag tcaataagta atcttttata 3661 ttctctcaca gagcgcttct gtttttgctt ggctaaccaa aaggcatgac gcaaaattgt 3721 ttcagtggca tcagcaagca tttggaaaga aatactatca tccacaatgc ttttgagcac 3781 agtagctctt tcttctaaat attgttggtc gaagtctgat tttgcaattg cttgtttgtc 3841 atctgggatt gcaattgcat tcttcagtgg tgtcatcatg gtactcaata atactggatt 3901 tttagaagtt cattgtgtcc tatggacata gagacaagag cacagtgcct tcacaggaga 3961 ttgtccatgt gaaagcacag cgcaactggc gattctctca taggagccgt ccttttggaa 4021 gcatcgccaa gaccaaaatt tcttactata gaacaattgt actatttttt ttgcaggagt 4081 aatcgccaaa catactcatc ttttggaaga tatgatgaaa attagataca tccaaaactc 4141 tgttgacaga ggggtagtca tgcatatctt ggaatactcc gcaatctgct agtagttaag 4201 tcgcataact gaagttggtc gatcctagca gaaaaatatt ggtctccact actggtgtca 4261 aaaaaaagaa ttctttcttc tgacgtagtt atgttaggga actttaacga ttcttttatg 4321 agtccttaat atatgcttac atatttaaag agacattatg gttctatcta ttacagttca 4381 atcccctaca aagaaaccaa ccttacaatt tggtgcttgt ggtcaaattg ttaaagatat 4441 gcagaaagcc ttaaatcggc ggcttgctca actagatatt gtatcagtat ctcctttatc 4501 ggtttctacc gtaggttact ttgatcacca aaccagagat gctgtgaaat acttgcagtg 4561 tcttgctttc ttaactatag atgggattgt gggacaacaa acttgggcat atttgtccaa 4621 cggatttgct ggcttgccaa tactaagttt tggcagtact ggaagtgttg tgaaagctgt 4681 tcaagaacct ttaaaagttg gtggctacta ctttggtgcc attgatggta tttttggagc 4741 aaaaactgag gctgcagtcc tggcttttca agcagaacac tgtttagaaa gcgaaggtat 4801 cattgaagct ttgacttgga atgcattaag caagttagat agccatggct ctcattgcaa 4861 gattaacgca tttcgtgggt agtgaagtac taacgacctt ctgtttttgg aatttggaat 4921 tccaagaaaa atatttttta atttttaatt gataatttgg aatgagggcg cagcaagagt 4981 catgttatct ggtgcgctct tgttatcagg gaacagggaa cagggaacag ggaactctta 5041 acgcttaact tttaataaga aagaattcag tcttaactgt cttccaactt ccgacttcca 5101 ctcttccact tcttgacagt aaaaatgtca gcattgtatc tgaaaaaagg atgttgattc 5161 atatatgaat taacaagtcc ttttttagaa aagctgatgc aaatgaaaaa tttggcttac 5221 ggtatagtac tgtcatcttg ctggctaaca tcatgggttg tatcttcttt agctgtcaat 5281 aacgttgtat cttccaatga aatcaatact aaaacgattg ctattgaaaa agccgagttt 5341 ggggttttaa gagacgatcg cgacggtaaa atgtcctttc tgcccacaac aaaagtgcca 5401 catcaagagg gaaaacggta cggatggcgt attcagctta aggacaacca aaatgaagtg 5461 acatggaaag aagtccttcg attaccgaaa cttccagaaa cttggagtac aagtagtggt 5521 gaaaattttg tcatatcaac tgatgggatg gaagcagtga caaagcgcac acagtcggca 5581 aaaaaaggag tcattgagaa cttttggact gttgcttctg gcgatcctac tggtaaacat 5641 acgatcgcag tctacattga taaccgtcgc attggttttt ttgagtttga gcttgtttcc 5701 ccgaagaaca agtaacagtt atctgttatc agttaggttc ggactcagat gatttcttga 5761 atgagtgtat tctttttaca atacgcgcgt tgcatttata cgtatcactt caccccgccc 5821 tgatgggcac ccaggtttat gtaggcgatc gcaataatgt tcatacacca atgctggttt 5881 tagtcgtgtc aaccccccaa ggttgtattt acttgtcatt tccattgctg cccacttttc 5941 cattcaccct atccttggcg tgtgctctgt gtgccgcaag catacacgaa cgggcatgag 6001 gttatgatct gtctaaaact tgataagtct caaagaaaac ttctattgat atcaatcggt 6061 ctcaaaagaa actttagttg attgccagca agcatctata ttatgtgtag gagttgagaa 6121 attttgactg ttgacctgag tacaaagcaa ggcgtaaagt agtagcgagt tacgaccttc 6181 cagtaggtcg catctataac tgaagtacct tgtatgtcac cacatcttac tcctccaaca 6241 acaaagccac gagttcctaa tttctcatct ggtccttgtg cgaaacgtcc tggctggtct 6301 gtttccaagc tggaaaatgc tttcgtaggt cgttctcacc gttctgagga tggcagatct 6361 cgtataaaag aagtcattga gcgctccaaa acaattctcg gtgttcctgc tgattatcgc 6421 ttgggcattg ttccagcttc cgatacaggc gcagtggaaa tggcactatg gtcgctgtta 6481 ggaaagtacc cccttgatat cttggcgtgg gaaagctttg gtttggaatg ggttaaagat 6541 gttgtagacg aactgaagtt gcctaatctc aacgttctca aggcaactta tggcagtttg 6601 ccagacctca accaagttga ctttagtcat gacgtggtat ttttgtggaa tggcacgaca 6661 tcaggtgtta gagtccctaa tggcgattgg atcaaagatg accgtcaagg tctgaccatc 6721 tgcgatgcca cttctgccgt tttcgcgatg gaagtacctt ggcagaagtt ggatgttgtg 6781 acttactctt ggcaaaaagt gctaggagga gaagcgcagc atggagttat tgtgctttca 6841 ccccgcgccg ttgaacgcct ggaaagttat caacctactt ggcccattcc gaaactcttt 6901 cgcttggcac aaaaaggcaa gttgattgaa ggaattttca aaggggacac aattaacaca 6961 ccatcaatgc tgtgtgtgga agatgcgctt gatggattac tttgggcaga aagtattggt 7021 ggacttcctg gcttgattcg tcgcagtgag gcaaatctag caaccattgc ccgttgggtt 7081 gagcaaagtg attgggcggc tttcttggcc cagaagccag aaacccgttc ttgtacttcg 7141 atttgcctca agattgttga tgattggttt acaggcttga gtccagagaa gcaagcagaa 7201 tccgctaaaa aaatagcgaa acttctccaa aagcaagaag ttgcttatga tatcgcatcc 7261 taccgttctg caccgccagg aatcaggatt tggggtgggg cgacagtaga aacgacagat 7321 attgaagcgt tgctaccgtg gttggattgg gcatacgcga ctgttaaaca ggagtaagaa 7381 gaacacagaa cgcagaacac agaatatccc catcaattct cttgatttta ttctgagttc 7441 tgacaactga gttggtgagt tcttgttcgg taggaacagc ttttcatggt ttcatgaatg 7501 ctgttgggcc catttctccc acaaaagacg cacatctgct ggttgtctct cataattgag 7561 catctggtct gaaattatca cccacggctg agcgctactt gtccaaaggt tacctacagg 7621 gcgtaaccaa cttgtgtcat ctagcgttcc ggctttgatg ttgatagttt ctcgattata 7681 agtccgttca tggaacaatc gcgttccgca gtcgtcgcag aacaggcaag ttacctcacg 7741 tccactatcg gctttacgtg tccaagcctt cggttgtcct tgggtaatga caacagcatc 7801 tcgtggtaca gtgagggaca ttccaaaagc gctggaagat tgtttctgac attccttgca 7861 gtggcacaaa tagagagtca acggttcagc acggatttca tagcgaattt gtccacattg 7921 acagccgccg gtataggtcg tgttcacagc ttttctcctc cttgctattc actgctgcct 7981 gattgtaccc aaagctgctc ttatcttgaa gttaaaaaat ctatgcttat ttttccggaa 8041 ttggtattca tctggtggta gatattatta acctaaccac ttatatctga acttcttata 8101 ttgagccaaa ggggagataa ctccctgtct attattaaag cccagtcgtt atgactgggc 8161 ttttggctat ggactgatgc aatgattttc cctgaccaca actaggggca taggggagtg 8221 ggggtgtaag ggtgtagggg cttatcattt aggactgcca tagcgtgtag ctgcttctga 8281 aaataggctc attttttgag tttcaacgca aatacaagca cttcactgtt cgcccacgct 8341 ttattgctct ttctttttac gtagtattct gctaaaataa gttatataaa atacttcatt 8401 gtactgagga gggttccagt ttctagtgtc actttcttac atacatttat tcatgcatga 8461 atgatttatc ctgttagctg caagatatga gcctacaaaa gctcttcacc cccttgttct 8521 cacttgttga ggcttggtgc aagatgtgaa ctttggaacc cggaggatga ggcgactgat 8581 agaaccaaaa ctgattcagt ggcgattagg cgtatggaac agcggtacgc tttgcgctta 8641 gcaagcgtcc caagggcaat tgctaacact gaataaagag gtaaaaatat ccgaatagct 8701 tgtacagcaa tcctttggaa aaagatacaa tcaattaaca aaggaaagct taaagcattc 8761 aaggaatttg aaaaaaagaa gcaacggcag accgccaatc cgaacgttgc ctctctaatt 8821 tctgtaaaac ccattagaaa tggatttcca caactatggt aaaccaaatt gtagacattg 8881 tagatagtct gagttaccca gagctagccg gacttctcac ggcttgtggg tatgctattc 8941 gagagcaaac gaaactttct agggctttgc ttgaactcag ttttgaaatt tctaaactgc 9001 atgataaaaa aggctttttt aatcctgaat atcaggatgt ggtgaacgat atagacttag 9061 cacttttggc cggtttgtgt gctgtaaaat tgactcagat ggctacagtg attgaggaaa 9121 taaatcgaag atatcatgaa gtttttgtag attaacaaac gaataccccg tccttaagga 9181 cggggtttat ccttataagg atgaacattc gtatcaaatt cagtcaaaaa actgaggtac 9241 ggaaaagcta acattttgtc atcaaaacaa atctactaat tgatgccaag tttctagaaa 9301 cttatcaaac cgaaatgatt aaatcagtga acagctaccg tagctattca ctgatttaaa 9361 acgctgtcca agattgtcta atttatcaat gaacggcata gtaaaggata ttttgaattt 9421 taaattgctt agcactatcc tactgatctc aaagatgcag aagaagctca cccgtagcga 9481 cttgtaaaaa cgagagcttg atcacccaac acaactaaaa agataacaaa aagcactcag 9541 gagtgttacc gtgtttccac cctaagcgct ttttgtttta cacttgctcc tcacacacac 9601 tcctagaaaa taacattgac tttatcaaag agcaagtaac ctaggtgaca ctttgaaaac 9661 tgaaccttct gtgttccgtt cagcaaagca aaaagcgcct gaacgtctgg ccgtgtttcg 9721 acttcaagcg ctttctgttt tcaacttact cctcacacta cataaacaat atcacgagtt 9781 aaatgagatc acaaatctaa tgggtgacac ttcagcaact gctctactgt cttaagacat 9841 agagtctgaa tctttttcta tgtctcttgg gttacttcta agttgacatc cctcttagga 9901 aagagatttt taatgtagga agtgacggat acccgtgaac accatgacta aacccagttc 9961 attagcagcg agtatggaat cttgatctcg caaacttccc cctggttgca caatagcctt 10021 aattccggct gcggctgctg ttttgactga atcatcaaag gggaagaatc catcgctggc 10081 aagaattgct cctttggctt tttccccagc ttgttctaaa gcaattttaa ctgagccaac 10141 acggttcatt tgacctgcac ccacacctaa agttgtgcga ttgcgcaacg cgcagacgcc 10201 taaaggcgta tcgccactca caacaatcgc attagactta acgtgcttgc aaactttcca 10261 agcaaacagc aattcttcta actcatcatc acttggtttc ttttcggtga cgatttgcca 10321 tgtactggtg ttggctaccg cgtcatctga agcttgcagg agaaaaccac ctgcaatgac 10381 ttttacggtt tctttcggtc cactcttcaa gtctgggaaa attaatactc gcacttttga 10441 tttagcagcc aaaatttctt gtgcttcagc atcacaactt ggtgcaacca cgcattctaa 10501 aaacgtcttt gttaactcag tagcagttcc cgcatcaatc ggacggttta gtgcaacaat 10561 tccaccaaaa gcagaaacag agtcagcgtt gaatgctttt tggtaagctt cttgaatgct 10621 atgtcctaaa gcgacaccac agggattcgt atgtttgata attgttgctg ctggtgtctc 10681 ggtgaattca gaaataatcc gccgtgcggc ttctaagtca accaagttat tgtaactgag 10741 ttctttgcct tgaagtttag tagcagctgc ccatccagtt gaagttgtac cagtttcata 10801 ccaagctgca ctttgatgag gattttcgcc gtaacggaga gattgtagtt gttttccctt 10861 aagggtgtat tgttgaggta attctgcttc tttgctaagg ttctgactca ggtatgcggc 10921 gatcgcctga tcatagctag acgtatgcaa aaatcccttt aaggcacact tttgccgaaa 10981 ctccagagat ggttcgccag ctgtgatgcg catttcctgt aaatattctt catactgcgc 11041 tgggtcacat agaattgtga gatgggcaaa gttttttgaa gcagccctga gcatagcagg 11101 accaccgata tcaatttgct cgatcgcctc agctaaagtc acaccttgtt ttgcgatcgt 11161 ctcttcaaaa ggataaagat tcaccacgac taaatcaatc gggcgaattt ggttattttc 11221 caagtctgct acatcttcgg gtacatcgcg ccgtgctaag atcccgccat gtatccgagg 11281 atgcagcgtt ttgactcgac cacctaaaat ttctggagaa cctgtgtaat cagcaacttt 11341 tgtaactggt agccctgcat ctttcagtgc tttggctgtt cctccactgc tgataaattc 11401 aaagtcaaat tcttctacta agctacgggc aaggtcgatt aatcctgttt tattagatac 11461 actcagcagt gctagacgcg ccatattttc tgggttccct ttagtgtaaa tacagttagt 11521 gcagatacat attgtataaa ccgcaagggc acatagacgc aaagcggctg aagacaaagc 11581 attgcctgga atacaataac tgtctcaggt tgcatggtgc tatactgaac gttggcttac 11641 ctaaatagag gtaggagatg aaggatacag gactagagca acagaaaaag atcaccgtat 11701 tacgcgatca gtctccaagc ggagcactgg taaagcgtga tactggattt gatattgaag 11761 tatataaaga agttttaagt gatcgcttca tagaagcaag aacagtagac gaatatgaaa 11821 agctccttgt attgagacaa agagttcaag aattagacat agaagttaga cgattagact 11881 acgcagagag atcagcagaa attcagcttc aacaggctca gcagaaagct ctcctccaac 11941 gcgggcagca aattgtagct attataatct ctattgctgc tgggctttat ctgctacaaa 12001 ctcttccctt agcgggtcta ttattcttga ttctgggctt agcaaaaccg ttaggctatt 12061 ctttgggaga aataggtaat ttcttagata gtttaaaggg ttttccaaaa gactcagata 12121 agcttttgtc tgatggaaaa gaacaaagag atcaagctga ggagtctaga gatgcaagac 12181 cctaaattag acatgattaa cgtatttaag cttgagaaac ggcttcagtt acttgttgca 12241 atttatatct cactaattct gcttgtacta acttttgtct tggtcaggat tccaatgact 12301 agtgatttac agcaggcttt gataattatt ttaggcttga ttacaatagc cactattgat 12361 gcatttagaa aagtactggg caatacttca tcatcatctt tagtgaacga atttaagcaa 12421 attattgttc aagatggagg ttcttatgtt gatggaaatg ttgatggaat cgataagcgt 12481 atcaatgttg aaggcaatta cgtagcgagt ggtgctatta attatgggta tgaaaaaaaa 12541 caaaatcttg ctgaagctgc tgctgaaatt caggatttgc tccaacaact agagaaatct 12601 aatcctaccg ctagcgaagc tgaaaaagtt gcatatgttg atgaggaaat tgagcctgat 12661 ctaaagtcgc gtttagtcaa agcattaaaa actagtggtg aagtcgctat cgagagttca 12721 ttggatagtc gctatatcga tatcatcaga gctattatta aaagctggtc atcttcagaa 12781 tagttcaaca gcaagcccac cataaaaggt ggtacggttc tggttgatga cattttgtaa 12841 ccttaggcaa aagtataagc gacgcaaaca agctttagcg gtgtacgatg ttcgtgattg 12901 ggaaacaaaa cttctcaata tgatggatgt aagcgaatat ttttaccaac tgtcacgtta 12961 tggattacca agacatcatt acaattgagc ctggaaagcg cagtggcaag ccgtgtattc 13021 ggcgaatgcg aatcaccgtg tacgacatct tggaatatct ggcaggtggg atgactgaag 13081 cagaaatttt ggaagatttt tctgaactca ccttagaaga tatcaaagct tgtcttgctt 13141 ttgcagctga tcgtgagaga aagttgtttg tggcatccct gtgaaactgc tattggatga 13201 aaacctatca gaccgaatca ttcacaggat tgtcgatttg tatcctaatt ctgagcatgt 13261 caaaacttta ggactgacaa atactgatga tacagttatc tgggaacatg caaaggcgga 13321 tgattttgtg attgtttcca aagattctga cttccatcag cgcagtttac tttatggtca 13381 tccacccaag tttatttatc ttcgtattgg taacagtcca acatcgaaga ttattcaaat 13441 attgagagat aattttgata cgatcactca atttaaaagt agcgaaacgg aaagtatttt 13501 ggtgttgatg tagtgtggaa ccgccgcata acaacctcct tgcacccgag cctcgaaggt 13561 tatttgtgag tgtctgaggt tacttgcgcc gggtgaaacg gaacattagg ctgcttcgtt 13621 tctgggttgg ggcagtattt caaattagtt tgtcagagtt caagaggaag gtattaattt 13681 gatatgacat tagctatttt agttgaacct acgcccctag aagttgatgc caatggtgtt 13741 gtaagagttg gagaaactcg tgtaactcta gacaccgttg taacagcttt tcttgaaggg 13801 gctacagcag aagaaattgg tgagcaatat acttcgctgc aactctcaga tatttactca 13861 gtccttggtt actacctgag acataaagct gaagttgatg catatctctt agaacgtcaa 13921 cgtcaagcag caatgatccg acaagaagct gaacaacgtt ttaacccagt tggaatacgt 13981 gagcgtttac tggctagacg aagtcagcac gggtaattca aaatggtacg attcctagct 14041 gatgaaaatt tcaataatca gattgttcgt ggtgttcttc ggcagagtcc tagtattgat 14101 attttgcgta ttcaagatgt tgacttatcg ggagccaatg atccgactgt tctagaatgg 14161 gcagcccaac acaggcgcgt tgttctaact catgatgttg ccacgatgat aaccttcgct 14221 tacgaaagga ttcaagcaag attatctatg cctggattat ttgaagtgag ccgtcgtgtc 14281 tcagtaggtc tagccatcga ggagattata ctgattggtg agtgtagtct tgagggagaa 14341 tgggaagggc aagtaaggtt tcttcctctt cggtaaaagt agcagcaaaa cgccacctaa 14401 taaccgtgtt gcagcggaca gaagagatat attggtagag atacaaaggt tgtctaccgc 14461 cgctcaattg tgccgttatg ccagtggagt ccttagtttt gctgatgcaa aaattttttt 14521 cttgtcattc catacttttt atgcttgact gtcctttgtc cttagtcatc tgtaatgact 14581 aaggactaat gaccaatgac taatgactca taactaatgg ctaacactct agaatttatc 14641 agtgtacctc caaaaacagg gcaaccacca aagggtttaa ttgttacttt acacggttgg 14701 ggtgccaatg ctgaagatgt ggcgtcttta tcgcggtttt ttaatttgcc agattatcag 14761 tttctgtttc cgaatgcacc ttttccttat cttaattctt ctgttggaag agcatggtat 14821 gaccttcgga tggaaaatat gtatcagggg ttagtagaaa gtaggcagct actaacaggt 14881 tggttgcaat ctttagaaaa taacactggt gtgcctttgt cgcggacaat tttgagcgga 14941 ttttctcaag gtggagctat gactttagat gtaggattaa agttacctct ggctggttta 15001 gtttctttaa gtggttattt acatcaagat gtagaaagtg tgaaaacaca aaatatggtg 15061 tctctaccac ccgttctcat tatgcatggc agacaagata cagttgtgcc attacaagct 15121 gctgtttcag cacggaaaac tcttgaatct cttggggcgg ctgtagaata ctatgagttt 15181 gacatgggtc atgaaatccg accagaaatg ctagagttgt tacgaaattt tgttgttgtc 15241 aatgcataaa tactcggaat ttgtctttgt ttagttgggg gtaccaaatt tttacgcatt 15301 ctcttagaaa tccgaaaaat tttcacgaaa ttatcgctcg caagtgagta atatatattg 15361 ggtggctgcg tcaagcgcaa ctcaacccat gctggatgca aatgctgcta atgggagggg 15421 caagcactat gaaaactcta agcatttcta agagagaaat tgctactatc actccacaag 15481 aggtggaagt gttagctaca cgtctggagc aggataatta cagtaatgct tttgagggtt 15541 tgaatgattg gcatttactg cgagcgatcg ccttttcgcg tccagagtta gttgaatcat 15601 acattcacct cttggattta gaaccctacg atgaggcgta gatgtttcag aagaaacggg 15661 ttctgattgc aataggcggc ggtatcgccg cctataaaat ctgtgaggtt gtttccactt 15721 tgtttaaaat tggagtggaa ataaaagtta ttctcactaa ttcagcacaa aagtttatta 15781 cgcctctgtc tctggcaacc ctttcccgcc atccagctta tacagatgaa aatttttggc 15841 aatcaactca ctttcgtcca ttacatattg atttaggtga gtgggcagat ttaattgtga 15901 ttgcacctct aacagcaaat acattagcaa agttggctta tggtatggct gacaatttgc 15961 tcacaaatac tgttttagct tccacttgtc ccatactgct agcaccagca atgaatactg 16021 atatgtggga acagcaggcg gtgcagcgca attggcaaca ggtattgaca gatggtcgat 16081 atcatggtat gtgtacaggg ttggggttat tggcgtgcga tcgcgtcggt gctggtagaa 16141 tggcagaacc cccagagatt attacttatg tccaatcctt gttacacacc gccgggaagc 16201 gagatttagt aggtaagaag gtgttgatta gtgctggggg aacacgagag tatcttgacc 16261 cagtaaggtt tattggcaat ccttccacag ggaaaatggg gttagcttta gcacaagcag 16321 cactgcaccg gggagcaagc gtgacattgg tacatagtcc ggcgagttgg gatgtaccat 16381 taggagtgca agcaatctct gttgtgagtg ctgaccaaat gcgagcaagt atggtggagt 16441 acttaccgaa tgcggatatg attatcatgt cggcggcagt ggcggatgtc aagccacggg 16501 aatacagtca acagaagttg ccaaaaaaat tgcttccgca agctttacct ttggaaccag 16561 taccggatat tgttgcagaa ttggcacgtc tcaaacaacc acatcaacaa ttaattgggt 16621 ttgcagcaca gacaggggat attgtcacac cagcgttaga gaaattgcac agtaaaaatc 16681 tggatgcgat tattgctaac cctatagatg aaccggatag cggttttgga agtgataata 16741 atcgagccat atttttggat aagcaaggac aaagaataga aatttctcct tgttccaaat 16801 tacaaatggc tcatcatatt tttgatgtgt tggcaaagaa ataacgaaag gaactgacaa 16861 gtcaccaaaa atgcgataac ttcgttaagc tgcgaagatc gctaattttc agtttcatta 16921 agatgtgctt gtaaccaatt tagcaagtca cccactgctg taaaatctaa taaggcttcg 16981 gcaagtgcct ctaactcttc cagtgacaga gcctgtattt gctttttaac ctcttgaggt 17041 aactctccta ctcgtctttg tagtagccgt agaacgagtt cttgctgtcc acgttcgtag 17101 ccaatgcgct cacctgtggt aatgtaactc atagtacgct cctgctccgc agttgctaca 17161 acgcggggaa cccgcgcaac gcactgctct tcaaactcct taaaatcttg ccaaaactct 17221 gcttctaatg cttttggtaa aatcataacc caatccacaa agcggtaaag gttacgaata 17281 tctttttctg ctaaacctag ttcatacaat cggcgaatta agctaaattt ccaagttttg 17341 cgttctcttg gctttttact ggtctgctgt gttttcaaat gtgccatgac aaccgttgca 17401 aaaggattat cgctcttctc taattccgtc caacggtttt gataatcgag aagtttgatg 17461 gttccaaatt taaaatgcaa actagtatcg ggataattat aactgtattg atttggtcgc 17521 catgtcaggt ctgtatcaca caaaatcgct aaactaatgg ctggtttggc aaatctgtca 17581 aaaattcgta ggttgtagga aaacattctt tctgcaaagt catcttccgg tttagcctgg 17641 acttcgacat ggattaacag ccaaatttct tctccttgaa tttgccaaac tttgaccaat 17701 ttgtctgcgt atcttcttcc ttgttcggca ttgcgggcta tttgctggaa ttccttatca 17761 agaaattcat ggggacgttc ccaattaatt aatgcagcag tttgagggaa gaagaattcc 17821 attgcttggg gaaagtaggc ttctaagatt tctttccacg gtgaatcatt atcgaccctt 17881 tcacgttcgg aggtcatgaa ttagcttggc tttatagtac tggtgtttca attatcaagc 17941 ttacggactt gatatcacaa tcaataaccc gaaaaatgcg ataagcctct ggctaagagt 18001 tgtgcgtatc gccgttgcag ctgctttgca gcatcgccta agcgccctac gccccttctc 18061 ccttctcccc tactccctca tttacccgga cacaacttct tgggcaactc atcacgggta 18121 gcgaggtatt cttttagata gttgcaaccc cgtgctagta aatcatctaa atctaaactc 18181 cacaatttga tggtcttgtc ataacttcca gaagctaaag ttttgccatc cggagtaaat 18241 gccacgcccc agacaatac // LOCUS NODE_1807_length_18240_cov_5.36497118240 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18240) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18240) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18240 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(57..146) /locus_tag="DP116_15685" /pseudo CDS complement(57..146) /locus_tag="DP116_15685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_076611791.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS1 family transposase" gene complement(200..475) /locus_tag="DP116_15690" /pseudo CDS complement(200..475) /locus_tag="DP116_15690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113683.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 1080..2033 /locus_tag="DP116_15695" CDS 1080..2033 /locus_tag="DP116_15695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010478156.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_15695" /translation="MQSNLRLDWKIRLLDWLLRLNKPLDQLSLDELRKLSETPIPFVV ERLMGGKRIRVLSVINQIVEGRHGEIPIRLYYPSSKQNLPLILFFHGGGWVFGNFQTY DLMCRRIAHSTSAIVIAVGYRLAPWFKYPTAVEDCYDILTWAVKNATNLGANNQQVIV MGDSAGGNLATSVCLMARDQGQRLIARQILVYPVTDGTLSQPSIEVYANAPVLTKDLM QCFVKYYARTEADRFEPYFSPMLAENLSYLPPALIITAEYDPLHDEGQKYAQRLHSAG NQVRLIDYSGMVHGFLSFPPFCPEALPAFAEIAAYVGALSN" gene 2121..2417 /locus_tag="DP116_15700" CDS 2121..2417 /locus_tag="DP116_15700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316132.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TatA/E family twin arginine-targeting protein translocase" /protein_id="PRJNA477356:DP116_15700" /translation="MNVFGIGLPEMAVIFVVALLIFGPKKLPEVGRSLGKAIRGFQQA SSEFQNEFQKEAVELQEAVKTTAELDTKQTAEPETKQIEAAKLEQDTVSSAQKS" gene 2430..3083 /locus_tag="DP116_15705" CDS 2430..3083 /locus_tag="DP116_15705" /EC_number="3.1.1.29" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875715.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aminoacyl-tRNA hydrolase" /protein_id="PRJNA477356:DP116_15705" /translation="MTEAVAKKTLVIPQLVVGLGNPEPKYDQTRHNIGFAAIDALSRS WKIPLAENRKFQGEYGEGIAPNGDKIRLLKPLTYMNLSGQAMQAVTSWYKLQPELVLV IYDDMDLPLGKTRLRLSGSAGGHNGMKSAIAHLGTQNFPRLRIGIGKPKNAASHDEHG TVSHVLGRFSAAENQMVSVVLQFVGECVELSLNSGVEKAMNVCNSRSFDSSQSNVYF" gene 3163..3648 /locus_tag="DP116_15710" CDS 3163..3648 /locus_tag="DP116_15710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002632428.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="PRJNA477356:DP116_15710" /translation="MSSEKVRSTHIKSAFPGLHRSLLDIVGAMNRPELDQAMLEMAGL SLEPALFTPLVLIAKLGPIGVVNLAGRVGRDYTTLSRQVARLEELGLVSRQISSADRR VREAVITRKGKIATDAIDEARERIALTLFRGWSRDDFDQLVRLMRMLADRLNETPGGN A" gene 3745..4155 /locus_tag="DP116_15715" CDS 3745..4155 /locus_tag="DP116_15715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013570650.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ketosteroid isomerase" /protein_id="PRJNA477356:DP116_15715" /translation="MMNTQALVAQAYSAFNRRDIDGTLALMSENVSWPKASEGGRVVG KQDIRAYWTRQWAEFDGYVEVLQVIDREAGKVDVKVRLLVKNLKGDVLSDTELWHLYT IANGLIERMDIKEEGESNSDLGPSAAFSGHNRAK" gene complement(5017..5196) /locus_tag="DP116_15720" CDS complement(5017..5196) /locus_tag="DP116_15720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655011.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S21" /protein_id="PRJNA477356:DP116_15720" /translation="MTQVVLGDNEGIDSALRRFKRQVSKAGILADVKFHRHFETPLEK RKRKAVAARRKRSMR" gene complement(5318..6178) /gene="rsmI" /locus_tag="DP116_15725" CDS complement(5318..6178) /gene="rsmI" /locus_tag="DP116_15725" /EC_number="2.1.1.198" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316129.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="16S rRNA (cytidine(1402)-2'-O)-methyltransferase" /protein_id="PRJNA477356:DP116_15725" /translation="MDIKSGTLYIVATPIGNLEDMTFRAVKVLQTVDIIAAEDTRHTG RLLQHFQVTTPQMSYHEHNRSSRIPELLEQLSNGKAIALVTDAGIPGISDPGYELIKV SVEAGITVVPIPGANAAMTALSAAGLPTDKFVFEGFLPVKSQQRRSHLESLKIEPRTL ILYESPHRLRETLEDLAEVFGNTRQIVIARELTKLYEEFLRGTIESAIVHYSQREPQG EYTLVVAGTPPTQPQLSEEELKAELQKIMSQGISRSQASRQLAKEISFPRRQLYQLAL SIKMPDQDSH" gene complement(6332..7492) /locus_tag="DP116_15730" CDS complement(6332..7492) /locus_tag="DP116_15730" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15730" /translation="MPIKRVVKNNATYLYLTEEVYDPEKKRGKTVVKKTLGVEEPAAP LNSMTEEFAVVWAENRTLGNAVPFSDRVTGQFPPENNGHGVILPCDIVECGKFRNGKL RWWCRTHQVHWGTKADIQQASESEEGAIRCSNATQPMNYVKNPLILNPDDYAGGIGIW AALPTAINTTVYPDLANVEVHVHVRPEPRGKKTIDANFPAIVIRSTDHTPLFANVSIK RVVIASPSALAYLEALINNLPLGTLYCNRCNHPHLDLGDFAKNPHKKHFCGNCGSDSN WSSEAIVSSPIKELADKLNGNLMFVRSDRKLDLRDYADCQFKIWASTPAILWTSELSQ EIGIHVHVYRSDKKIIDNTFGDVTWTDGTKLERENLLSQMLQKCRQQKAPSV" gene 7866..9227 /locus_tag="DP116_15735" CDS 7866..9227 /locus_tag="DP116_15735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017303760.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TrpB-like pyridoxal phosphate-dependent enzyme" /protein_id="PRJNA477356:DP116_15735" /translation="MDTIKYTLSENQMPQSWYNIQADLPTPMAPVLHPATHQPITAKD LEPLFPAALISQEVTTERWIEIPEEVQSIYRQWRPTPLYRARRLEQALDTPAKIYYKY EGVSPAGSHKPNTAIPQAYYNKQAGVKRLTTETGAGQWGSSLAVAGAFFGLEVVVYMV KVSYRQKPYRRAFMESFGARVIASPSDETQAGRKILQENPDSTGSLGIAISEAVEVAV QDEQTKYALGSVLNHVLHHQTVIGQEAVTQLEQAGDYPDIIVGCTGGGSNFAGIAFPF MGAKLRGEQSDIKFVAVEPAACPTLTKGKYTYDFGDTAHLTPLVKMHTLGSTFVPQGI HAGGLRYHGMAPLLSHVVNLGLIETRAYTQLDCFASGLTFARTEGILPAPEANHAVKG AIDEALRCKEEGVSKTILFNLCGHGHFDMQAYIDYKAGLLRDTEYSAEEIAMALSGLP VIR" gene 9233..10090 /locus_tag="DP116_15740" CDS 9233..10090 /locus_tag="DP116_15740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875712.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar kinase" /protein_id="PRJNA477356:DP116_15740" /translation="MEKYGLFVGLVTLDLIYLAQSPPLNNQKIVAADYTVAAGGPATN AAVTFSHLGNQATVLGVVGSHPMTQLIKGDLANYKVEITDLNPTTQNAPPVSSIIVTQ ATGERAVISINAVKTQATRESIEPEVLQNVDIVLIDGHQMTVGNEIAQIAKARNIPVV IDGGSWKNGFDKILPFVDYAICSANFHPPNCQTEDEVFAYLSGFGISHIAITHGQKPI RYLQDGKAGFIDVATVQAVDTLGAGDIFHGAFCHHILRESFTTALQQAAHVAALSCQF FGTRRWMHS" gene 10215..11051 /locus_tag="DP116_15745" CDS 10215..11051 /locus_tag="DP116_15745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866316.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3'(2'),5'-bisphosphate nucleotidase CysQ" /protein_id="PRJNA477356:DP116_15745" /translation="MKDLENILELARSVSWAAADILRSYYHQTDDKLEVEYKQNEPVT IADVNVNNYILENLQGVLGDKDFAYISEETYKGEHAKQEWVWIIDPLDGTRDFIEKTG EYAIHIALVQGTRPVLAVVAVPEAEKLYYATKGSGAFVETRDGKSLPLRVSSRERLED LTLVVSRSHRNERLNYLLQHLPCQNQKAVGSVGGKIAAIVEQQADIYISLSGKSAPKD WDIAAPELILTEAGGQFTHFDGTPLQYNTGDVNQWGGLLASNGQYHEVLCQEAERILA KF" gene 11986..12837 /locus_tag="DP116_15750" CDS 11986..12837 /locus_tag="DP116_15750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309180.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_15750" /translation="MTVAVHLENVHKSYNSIPVVNDLSFTINAGEMFGLLGPNGAGKS TTIRMLTTLTKPTQGQIEVFGYDVVSQPILAKQCLGVVLQAISVDGDLTVWENMELHG RLHHIGNPGRQRLIEQWLEYVELGTRRNSLVKTLSGGMKRRLQIARALLHQPQILFLD EPTVGLDPQTRRRLWEIIRDLNKQGMTMLLTTHYMDEVEYLCDRIGIMDNGKLISLGT LQELRSTHGEGLVMKQVGERWEYVFFPTLEDANLYLNQQQDKTGMMVRPSNLEDIFVE LTGRKLD" gene complement(12840..13460) /locus_tag="DP116_15755" CDS complement(12840..13460) /locus_tag="DP116_15755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876444.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15755" /translation="MPLVAQAQDALPQQSSEEIKGLLQEGRKLVDSGDYNGAIALYQR ASALEPKNATIYSGIGYLYALQGNFPSSLQAYRRAVALNPNNSDYQYALGYVSGNLGD NKAAKEAYRRAIQSNRSNVNAYIGLATVLLRLGEKENVKWAYEQAVSLDPKNPQVYEL RGNILMKQGKSKDAIAAFQQARDLYQKQGKQDSVVRIEAVLRTLRG" gene 13776..14111 /locus_tag="DP116_15760" CDS 13776..14111 /locus_tag="DP116_15760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458250.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15760" /translation="MIVTTTDVIQGAVIQSYLGIVTAEVVYGSNFLRDFFASIRDVIG GRTGSYERLFEEGQRKALEELERRALRLGADAVVGIEVDTGTINVDQSGVLMLITATG TAVKLRQQL" gene 14505..15140 /locus_tag="DP116_15765" CDS 14505..15140 /locus_tag="DP116_15765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746894.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15765" /translation="MSYTNRVGDEVITEPAVVGRVADYHDRVRWGPILSGLLIALATQ LVLSSIFAAIGAGSIEASGRPRTIASDVTGNVGIWSTIGLLISLFTGGWVMARACGPM NRNTALLNGAILWATTLAVGSWLLASGVSGAFGVAASNAGAVVNQVQQQGGVNIPQNT PNVSAQQAREIAANVRSGLWWFVFGSLLGLLASMIGAATGTRSPRTNNYVS" gene complement(15312..15542) /locus_tag="DP116_15770" CDS complement(15312..15542) /locus_tag="DP116_15770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006621764.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15770" /translation="MHRGVGNTVRNTARVLPKKQKCVGWVEERNPTFGTLCWVSLRST QPTNILNCGHSVNYLCCNTVGDKTKTVIRMAM" gene complement(15552..17876) /locus_tag="DP116_15775" CDS complement(15552..17876) /locus_tag="DP116_15775" /inference="COORDINATES: protein motif:HMM:PF05729.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15775" /translation="MPESENPKDPGKVSNNDLRNTQFGGGLIDAQNVNAGRIGGDIRN TVNFSFGQRASNELLNPKTRNHIRKILLRQVSTEVESRISTSLHNRIYIVQDTDQNPS EIELPWASEIKVGSTPKIHLTNTEIIAIYDQPDIAGRLLILGNPGVGKTTMLLKLAEE LVKRAKNDSAHPIPVLFALSSWKNDSNSIKDWLVDQLKHKYGVRKDIGKQLVENQEIL PLLDGLDELAAERQEKCVVKINNFINAGWSNPLVICSRIQEYQRYKALLQLNNSLELC PFTQEQVNQYLQNTDNLQLCDSINQDQELSKLAKTPLLLNIIVLSAQELSIETWQRLK SSQERLSYLFETYISRMLKRKYTGKQPDPEKTKRWLNWLAQRLKDESATEFFIEGIQP SWLKKKIQKFVYNLIVWGLIGGLISGLIFGLIGDRIEKIKLINHLRSFRLTTIKFMIS GLIYGLLFGLIGWLIGWLTGGLNVLIPGLNGRIFGLIFGLISGLIFGLIYGLLFGLIG DEIQTVEIIKFSLKKSSMGLIYGLIYGLIYGLIFGLISGLISGLISGLISGLIFGLIY GLAGLIYRLISGLILGLILGLISGLIFGIDGRTIENKTIPNKGIRQSVINTVIISTVT CLLATLILLLGIKIYGGKLDLSSSLVSGLIIGLLIAIPKSGTPAIKHFVLRVIFWGNG YAPWNYAKFLDYCTNRLFLQRVGGGYRFMHDLLRQHFANYSVPSNLAPIQETTVASSV LIPDYISCTSCGHHNSTNCKFCTQCGMGLNKLDV" BASE COUNT 5076 a 3855 c 3950 g 5359 t ORIGIN 1 cactcttttt ttgtttcttc agggtaaccc tgctgttcat aagaatctat aaactaacga 61 ccacaatgaa cacatatgtg attttgttta ccatactttt tcccattctt gcgaatatga 121 gtggatgcac accttggaca tttcatagca atacgcctct ggcgtctgcg ctttgcgcaa 181 tcgcatgatt catacttcaa ttatgcaacg ccgacaaatt tattggattg gcgatcgcac 241 catacttaat ttcttctaat tctgtactca aatcagacga aagttcagca tagcgttcgt 301 tgaaatactc gacagctgac cccacaggat gaccttctag cagtcgcttt agtgtacttt 361 gaaaaactgc taactgttta ccagtttttt cccagacgaa agaacaaccc caagcgcgtt 421 ctacgtgacc gatgactgcc aaagcacctc cattggggcg actgaggagt cgtctgtttc 481 ttcaattgcc ataagtaaat tttcttgacc tcggtttaat ttctcttttg taaattttat 541 aacccttata tagcaaggct ttcataaagt gaagaaaaat gtgcttcttt gatttgttgc 601 tagcagagac tttcgccgct agtgagaggc tcatatcttg cacttaaccg tataactcag 661 caaaaattat actacaagag attacttctg tgagttctta cgactttatt ctcgaagctt 721 tatttagcgc agcctcatgc atatttagta tgttttttac aaagaatgtt cccagttctt 781 ttttggatta aaaccaagag tatttcctag taagttcatg ttaagtcatg gtgttctttc 841 ttcctgaata tagatgtggg tggtgttttg agaacttcag aaatttgaac tttagttgct 901 ctgatctaaa cttaccgcct cacactccgc caccaaagtc ggatcttcaa gtcggatcaa 961 gtcggatttt gataagtaag tcggcacaat aaaataaaac tgtattgata aaagctggag 1021 tttagactag cactgaaaac ttatgcagtc agtgaccaaa agattctaga attgttaaaa 1081 tgcaaagcaa tctcagactt gattggaaaa ttcgtctact tgattggtta cttcgcttga 1141 ataaacctct tgatcaattg agtcttgatg agttacgtaa gctttctgaa acacccattc 1201 cttttgttgt agagcggctg atgggtggaa agcgcattcg ggtactaagc gtgataaatc 1261 aaatagttga ggggcgacat ggtgaaattc cgattcgact gtactatcca tcaagcaaac 1321 aaaatctgcc tctgattctg ttctttcatg gtggcggctg ggtttttggc aattttcaaa 1381 cctatgactt gatgtgccgt cggattgcac acagtacaag tgcgattgtg attgcagtag 1441 gctatcgact tgctccttgg ttcaaatatc ctactgcagt ggaagattgt tatgatatcc 1501 tgacttgggc ggtcaaaaac gcaacgaatt tgggagcgaa caaccaacaa gtcatcgtaa 1561 tgggtgatag tgctggagga aatttagcga catctgtgtg cctgatggca cgagatcaag 1621 gacaacgatt gatcgctcgg caaattttag tttaccctgt tacagatgga acgctgagtc 1681 agccttcaat agaagtctat gcaaatgcac ccgtcctgac aaaggacttg atgcagtgct 1741 ttgtgaaata ttacgcccgc actgaagctg atcgatttga accgtatttc tcaccaatgc 1801 tggcagagaa cttaagttat cttcctcctg cgttgattat cacggctgaa tatgatcccc 1861 tacacgacga ggggcagaag tacgcccaac gattgcactc agctggcaat caagttcgct 1921 taatcgatta ttctggtatg gtgcatggct ttttaagttt tcctcccttt tgccctgaag 1981 ctttgcctgc attcgcagag attgcagctt atgttggagc attaagtaac taaaattatg 2041 atcgcagcaa tgacggtaac tattgattaa actacatatg tgggtttcca atgcggtttt 2101 tgagttgact ggagaaaaac atgaatgtat ttggtatcgg tttgccggag atggctgtaa 2161 tctttgtggt agcactgtta atctttggtc cgaaaaagct accagaagtt ggtcgcagtc 2221 tgggaaaagc aattcgcggt tttcaacaag cttctagcga gtttcaaaac gagtttcaaa 2281 aagaagcagt tgagttgcaa gaagctgtaa aaacgactgc tgaactagac accaagcaaa 2341 ctgctgaacc agaaaccaag caaatagaag cagcaaagtt ggagcaagac accgttagct 2401 ctgcccaaaa aagctaaact gaagtcaaaa tgacagaagc tgttgccaaa aaaactttgg 2461 ttattcccca attagtcgtc gggttgggga acccagaacc caagtatgat caaacacgcc 2521 acaatatcgg ttttgctgct atagacgcgc tatctcgttc ttggaagatt cccttggcag 2581 aaaatcgtaa gtttcaaggc gaatatggcg aaggaattgc accaaatgga gataaaatcc 2641 gtttgttgaa gccattgact tatatgaatc tttcaggaca agcaatgcaa gccgtgacaa 2701 gctggtataa gctgcaacct gagttggttt tagttattta tgatgatatg gatttgcctt 2761 tgggaaaaac tcgcttgcgc ttgtctggtt ctgctggagg acataatggt atgaaaagcg 2821 cgatcgcaca ccttgggaca caaaactttc ctcgtttgcg tatcggcatt ggtaaaccaa 2881 aaaatgcagc gagtcatgac gaacatggta ctgtttctca tgtcttagga cgattttctg 2941 ctgctgaaaa tcagatggtt tctgttgtac tacagtttgt gggcgaatgc gtagaactca 3001 gcctgaattc gggagtagaa aaggcgatga atgtttgtaa cagccgctct tttgatagtt 3061 cgcaatcgaa tgtgtatttt tgaccaacaa aggggtgtag gggaatgaac gcgcgggtgt 3121 acggtcttgc gggtataggg gtgtaggggg agaggcaatc ggatgtcatc agagaaggtg 3181 cgcagtacac acattaagtc tgctttccca gggctgcatc gttccctcct tgatatcgtc 3241 ggcgctatga atcggcccga acttgatcaa gcgatgcttg agatggcagg gctgagcctc 3301 gagccagccc tattcacgcc ccttgtgctg attgccaagt tgggtccgat cggcgtggtg 3361 aaccttgcgg ggcgcgtggg gcgtgactac acgaccctta gccggcaggt cgcacggttg 3421 gaagaactcg gtctggtaag ccgtcagatc agctctgcgg atcgacgagt gcgggaagcg 3481 gtgatcacgc ggaagggcaa gatcgcgaca gatgccatag atgaagcgcg cgaacggatc 3541 gctttgacgc tattccgggg ttggtcccgc gatgacttcg accaactcgt ccggttgatg 3601 cgaatgcttg ccgacaggct gaacgaaacc cccggcggca acgcgtaatc gaggtgcatg 3661 caaggtctga aatgatgaat cctggaactc gccagaggca tctcttaaat atgtgtataa 3721 tacacttacc ctactcaaag aagaatcatg aatacacaag cattggtagc tcaggcttac 3781 tctgcgttta atcggcgaga cattgatggc acgctcgcgc tcatgagtga gaacgtcagt 3841 tggccaaagg cttcggaagg cggtcgagta gtcgggaaac aagacatccg tgcttactgg 3901 acgcgtcagt gggctgagtt cgatggttac gtggaagtac ttcaggtcat cgaccgtgag 3961 gcgggcaaag tcgatgtcaa ggttcgtcta cttgttaaaa atctgaaagg cgatgtcctg 4021 tcagatacgg agttgtggca cctctatacc atcgcgaacg gactgatcga acgcatggac 4081 atcaaggagg aaggcgaaag caactcagat ctaggtccgt ccgcggcttt ttcggggcac 4141 aatcgcgcga agtgacgagc ttcgcggagc aaatgccacg ttcgcgaacg tttgttgaac 4201 gtcggtaaca agcaaagcac agggacgatc ccttcttgta gagcgccgga tctatccgac 4261 gttttacagc tttgcagcct cggcggcctc ggtttgcttc gcgttaagct agctgcgcct 4321 ccggcaatcg cccccaccaa attggacttt tttcacatga ttaccgaaaa agaactcaga 4381 actcagaact cagaatatta aacgcgcttt ttaagtatga ttaacagttc tggtcagccg 4441 tattggttta aagcccccgg ataaatcctt tggagaccaa caaagttttt ctctttctta 4501 aatcccccgg attccattcg tggggatatt ctgattactg cattctgtgt tctgtgttct 4561 tcttcataca tattttgtca gtccttagag gtattgacaa aatgaaaata accttaactt 4621 atcacaggtg cttacaccca gcctacggct cgctagcccc ggagggggcg ctgcgcaaac 4681 gctaacacac cgttcggctg agcgatcacc ttggcgaagc ccttacaccc ctaattattg 4741 acattcccac tactgaaata ggtgagctta gctcacctat ttcaaaaaga agtagaattg 4801 ttgggctgag tgcgactgtt cttcacttca aatattacac atcatacatg agtttctgac 4861 tcacctggcg tattctatag ttttgaaaac cgactagatc gtaccctggt aggggtacga 4921 ttcatcttgt ctaatttttg tatcaaggct accgaatcta tttaagcttt tctaacaagc 4981 gtgaacgtag tcaagcaacg tcagctccaa ttcggtttat ctcatacttc gcttgcgtct 5041 tgcagctaca gccttgcgtt tacgtttttc taatggtgtt tcaaagtgtc tgtgaaactt 5101 gacatcagct aagattccag ctttggaaac ttgccgttta aatcgacgca aagccgaatc 5161 aattccttca ttgtctccta aaactacctg ggtcattctt cttcctcatg agcaatactc 5221 tcattgtagt gcgactgcaa ccggcagcga aactcccctt gacaaaagta ggagaaagga 5281 aaaagtagag ggaatcacgg aagaagaaaa ttgatgacta atgactatct tgatcgggca 5341 ttttgataga aagagctaat tgatataatt gacgacgggg aaaggaaatt tcttttgcta 5401 actgacggct ggcttgcgat cgcgatattc cctgactcat gatcttttgt aactctgctt 5461 tgagttcttc ttctgaaagt tgcggttgag tgggtggtgt tcccgccaca actaaagtgt 5521 attcaccttg aggttcgcgt tggctgtaat gaacaattgc tgactcaatt gttccccgca 5581 aaaattcctc atacaactta gttaactccc gcgcgatgac aatttggcga gtgtttccaa 5641 agacttctgc taaatcttcc aaagtttcgc gtaggcggtg aggagactcg tataaaatga 5701 gtgtgcgagg ttctattttc agggattcta agtgcgatcg cctttgttga cttttaactg 5761 gtaaaaatcc ttcaaaaaca aatttatccg ttggtaatcc agctgcactt aatgcagtca 5821 ttgctgcatt tgcaccaggg attggcacaa ctgttatccc agcctcaaca gaaactttaa 5881 tgagttcata cccaggatca gaaattcctg gtatacctgc atcagttacg agagcgatcg 5941 ccttcccgtt actcagttgc tctaataact ctgggatacg gctgctacgg ttgtgttcat 6001 ggtaactcat ctgcggggtt gtcacttgaa aatgttgtag caatctacct gtgtggcgtg 6061 tatcttccgc cgcaatgata tccactgtct gcaaaacttt cactgcccgg aatgtcatat 6121 cttccaggtt accaataggc gtggcgacaa tgtaaagtgt tcctgatttt atatccatga 6181 ataaatatta tcttgtgtgt tgactggagc ttataacagt tttcaattgg gtgtaatata 6241 tagattattt ctaggggatt acatttttcc gcaaaatatt gtggataagg gtaccccgtc 6301 aaaggtgatg gggtagtgcg gataatttgt actaaactga gggagccttt tgctgtctac 6361 acttttgcag catttgagag agtagatttt cacgttctag tttcgtgccg tcagtccaag 6421 taacatcgcc aaaggtgtta tcaataattt tcttgtcgct tcggtaaacg tgtacgtgta 6481 ttcctatttc ttgtgacagt tcgctcgtcc acagtatcgc tggtgttgat gcccaaattt 6541 taaactgaca atctgcgtag tctcttaagt caagcttgcg atcgctcctt acaaacatca 6601 agttcccgtt aagcttgtcc gctaattctt tgataggaga gctaacaata gcctcgcttg 6661 accagttgga atctgagcca cagttgccac aaaaatgttt tttgtgcggg ttcttagcaa 6721 aatctccaag gtctagatgc ggatggttgc atctgttaca gtaaagcgta cccagtggca 6781 gattattgat taaggcttcc aaatacgcga gcgctgacgg cgaagcgata accacgcgct 6841 taattgaaac gttggcgaat aacggcgtgt gatcggtcga tctgattaca attgctggga 6901 agttagcatc tattgtcttt ttaccacgcg gttcaggtct gacgtgtacg tgaacttcta 6961 catttgccaa gtccggataa acagttgtgt taatcgcggt tggcagtgct gcccaaatcc 7021 ctataccacc agcataatca tcaggattca gtatcaacgg gtttttaaca taattcatgg 7081 gctgagtggc gtttgagcat ctgatagccc cttcctcact ctcggacgct tgctgaatat 7141 cggcttttgt tccccagtgt acttgatggg tgcgacacca ccaacgaagc ttaccatttc 7201 tgaacttgcc gcattctacg atgtcacaag gaagaatcac cccatgccca ttattctccg 7261 gtggaaactg tcctgttacg cgatcgctaa acggtacggc atttcctagc gttctgtttt 7321 ctgcccacac aacagcaaat tcctcggtca tgctgtttag cggtgctgcg ggttcctcaa 7381 ctcccagagt tttttttact acagttttac ctcgtttctt ctctgggtca tacacttcct 7441 cagtcaggta caagtaagtt gcattgtttt tgactactct tttaattggc attccttaaa 7501 ctcctagatt agtttccgta tacggaaacc gcgaacattg tacactcatt tctcgtcgtc 7561 atgcgaacat ttcttggata agctctactc aggctgtaga ggcacggcat agaacataca 7621 tgtcaagtta agatttaaac ccttgtcatg aggtcatttc ttgtttcttg ctttcggtca 7681 taaatgtata atttaaggca tttttcgggt gagtatcgga acgagagagc ttgacttttg 7741 attaagttga caccaatggg cataaaacaa cccctaccgt atgcgtgtat tctatccagc 7801 tgaaaatcgc tatagactaa aagcgtacga tagaaagaaa aattgattgc agtcactttt 7861 caccaatgga caccatcaaa tacacgctta gcgaaaacca aatgccacag tcctggtata 7921 atatccaggc tgatttaccc acgccgatgg caccagtgct acatcccgct acacatcagc 7981 caattacagc aaaagaccta gaaccccttt ttccagcagc cttgatttcc caagaggtta 8041 ccactgaacg ctggattgaa attcctgagg aagtacaatc aatttaccgt cagtggcgac 8101 caactccact ttaccgcgcc cgacgcctgg agcaagccct tgatacccct gccaaaattt 8161 actataagta cgagggtgta agtccagctg gtagccacaa acctaataca gcgattccac 8221 aggcttatta caacaagcaa gcaggggtaa agcgcttgac aacagaaact ggtgcaggac 8281 aatggggttc ttcgcttgca gttgctggtg ctttttttgg tttagaagtc gtggtttaca 8341 tggtaaaagt gagctatcgg caaaagccat atcgtcgcgc ttttatggaa tcttttggtg 8401 cgcgtgtcat agctagcccc agtgatgaaa cacaagcagg gcgaaagatt ctccaagaaa 8461 atcctgatag cacaggcagt ttgggcatcg ctattagtga agcagtagaa gtcgcagttc 8521 aagatgagca gacaaaatat gcattaggta gcgtgttaaa tcatgtgcta catcatcaaa 8581 ctgtcatcgg tcaagaagca gtgacacaac tagaacaagc aggtgactat cctgatatta 8641 tcgtgggctg tacaggtggc ggtagtaatt ttgctggtat tgcctttcct ttcatgggtg 8701 caaagctgcg cggtgagcaa agtgatatca aatttgtcgc agttgaaccg gctgcttgtc 8761 caaccttaac taagggcaaa tatacatatg actttggcga cactgcacat ctcacccctc 8821 ttgtcaagat gcacactttg ggtagcacct ttgttcccca aggcattcat gcaggcgggt 8881 tgcggtatca tggcatggca cctttgctca gtcatgttgt aaacttgggt ctaattgaaa 8941 caagagctta cacacaactt gattgttttg catctggttt gacttttgcc cgtactgaag 9001 ggattttacc tgctcctgaa gccaatcatg cagtcaaggg tgcgattgat gaagcactac 9061 gctgtaagga ggaaggagtc agtaagacca tcctattcaa tctctgcggt catggtcact 9121 ttgatatgca agcgtatata gattataagg caggattgtt acgcgatact gagtacagtg 9181 ctgaagaaat agcgatggcg ctgtcaggct tgcctgtgat tcgttgaact ttatggaaaa 9241 atatgggtta tttgtcggtt tagtcacctt agatttgatt taccttgccc aatctcctcc 9301 tctgaataac cagaaaatag tcgctgctga ctacactgtt gctgcaggtg gtccagcaac 9361 aaacgcggct gtgactttca gtcatttggg taatcaagcg acagtcttgg gtgtagtggg 9421 ttctcatccg atgacgcaat tgatcaaagg agatttagca aattataaag tcgaaatcac 9481 cgaccttaac cccacgacgc aaaacgcacc gccagtttct tctattattg tcacccaagc 9541 aactggtgaa cgagcagtca tttccatcaa cgctgtcaaa actcaagcaa cccgcgaatc 9601 tattgagcca gaagttttgc aaaatgttga tattgtgctg attgatggac atcaaatgac 9661 tgttgggaat gaaattgccc aaatagctaa agcgaggaat atcccagttg tcattgatgg 9721 tggtagttgg aaaaacgggt ttgacaaaat cttacctttt gtagactatg ctatttgttc 9781 cgctaatttt catcccccca actgccagac tgaagatgag gtttttgcct atctcagtgg 9841 atttggcatt tcccacatcg ccatcactca tggacaaaaa cccattcgat acctccaaga 9901 tggaaaagct ggctttatag atgtggcgac tgttcaagct gttgatacac tgggggctgg 9961 agatattttc cacggtgctt tttgtcatca catcctacgg gaaagtttta ccactgcatt 10021 gcagcaagca gctcatgttg ctgctctttc ttgtcaattt tttggcacgc gtcgttggat 10081 gcattcctag tagtcgattg ctgattgtga agtactactg taatcatggc gatcagcctc 10141 cggcttatcg catgtgcata tcgcccatga ctagcaatta aacacccaag ggagataagg 10201 agacaaggtg agaaatgaaa gacttagaaa acattttaga actagctcgt tccgtaagtt 10261 gggcagcagc agatatactg aggtcttatt accaccagac cgacgacaag ctagaagtag 10321 aatacaagca gaatgagcct gtgactattg cagatgtcaa tgtcaataat tacatcctgg 10381 agaatctgca aggagttttg ggtgataaag attttgctta cattagcgaa gaaacttata 10441 aaggggaaca cgctaagcaa gaatgggtat ggataattga ccctttagat ggtacacgag 10501 attttatcga aaaaactggg gagtatgcaa ttcatattgc cttagtgcag ggaacacgcc 10561 cagtgcttgc tgtggtagca gtacccgaag cagaaaagtt gtactatgcc acaaagggaa 10621 gcggtgcctt tgtggaaacc cgtgatggga aaagtttacc attacgagtg tcatcacggg 10681 aaagactgga agatttaact ttagtcgtta gtcgcagcca ccgtaatgaa agattaaatt 10741 acttgctgca acacctacca tgtcaaaatc agaaagcagt tggcagtgta ggcggcaaaa 10801 ttgctgcaat tgtggaacaa caagcagata tctatatttc tctttctggt aagtctgctc 10861 caaaggattg ggatatagcc gccccagaac tcattttaac agaagctggt ggtcagttta 10921 ctcattttga tggcacgccg ttgcaatata acactggtga tgtgaatcag tggggtggtt 10981 tacttgcaag caatgggcag tatcacgagg tgttgtgtca ggaagctgaa aggattttag 11041 caaaatttta gacttattta gaaccgcaga tgcacgcaga taaacataga taaattatct 11101 gcctccaatg ctcatgcatc tgcggtttca tattcaaaaa agaagttcaa tatagcagtt 11161 ctcgtttgca tgaagtacat ttttcaacct caccacgcca gatgctacaa cggagggaac 11221 ctccgcaacg cactggctcc ccaacccctc tccgaactcc ggagagggga gcgtttgcga 11281 tactcaaatg cggggtgagg tgacgactat attgcaccga agtgagaacc gctatatact 11341 cccttcaggt tcgctcaaca ggaaaaccag ttgcctacgt caggctcacc aatcgcgccc 11401 gagagcggaa gacgttggtg catagcgctg gcttattgat aactattaaa atattgctct 11461 gatctatata aactaaatct tgaattgcta tataatgaaa gtttccacta tttatcgtca 11521 ttcgggtgat ctacattagg agtggaacgt atgaaaagag aacatcgacc aacagttgtc 11581 aatgatcgac ttagagacat catagcccaa gtagagtcta catcggacac taattatctc 11641 aagttacttg aaccttctca gcctatacaa atattgccaa agactagtgg ttcttgtggt 11701 gaagatgaga gtgataatgt aaagaattgt aactaatttt ggctaaagca acataaagtg 11761 tggtaagtaa attctcgatt tcgttgagac agagagtgta gtaggttagt tttacatcgc 11821 tttgtcaaat gtttgtaatt tgttatcccc ttgccctcat agtatgaaag gcgggggtaa 11881 ttattttgtg tagcaaaaat atttagtgta gatcgaaaac tagagcgttc ccatctgcca 11941 caatggtcac aataggcgat gaaattctct gttggcaagc tacccatgac tgttgctgtt 12001 catttagaaa acgtccacaa aagttacaat agtattcctg tggtgaatga cctctcattc 12061 actatcaatg cgggagaaat gtttggttta cttggtccga atggtgcagg aaaatctact 12121 acaattcgga tgttaaccac actgacaaaa ccaacccagg gacaaataga ggtctttgga 12181 tatgatgtcg tcagccaacc catactagca aaacagtgtc tcggtgttgt gttgcaagca 12241 attagtgtag atggagattt aacagtatgg gaaaatatgg agctgcacgg aaggctacat 12301 cacataggta atcctgggcg acagcgcctc attgagcaat ggctggagta tgttgaactc 12361 ggaaccagac gtaatagcct agtaaaaact ctgtctggag gtatgaagcg gcggctacag 12421 atagctagag ctttgttgca tcaaccgcaa attctgtttc tagatgaacc aacagtggga 12481 ctagatcctc aaacaaggcg acgtctttgg gaaattattc gggatttgaa taagcaagga 12541 atgacgatgc tgctcacgac tcattatatg gatgaggtcg agtatttgtg tgaccgcatt 12601 ggcattatgg acaatggaaa attaatttct cttggcactc tacaagagtt acgctctact 12661 catggtgagg gtttagtcat gaagcaagta ggagaacgtt gggaatatgt ttttttccca 12721 actttggaag atgcaaactt gtaccttaat caacaacaag acaaaactgg catgatggtt 12781 cgtccttcta acttggaaga tatttttgtt gaattaaccg gacgcaagtt ggactaatct 12841 tatcctctca aagttcgcag cacagcttct atcctgacta cgctgtcctg cttcccttgc 12901 ttttggtaca aatcccgagc ttgttgaaat gctgcgatcg catcttttga tttgccctgc 12961 ttcattaaaa tgttaccccg caactcataa acttggggat ttttgggatc aaggctaaca 13021 gcttgttcat atgcccattt cacattctcc ttctctccca aacgcaacag gactgtagcc 13081 aaccctatat aggcgttgac gttactccgg ttactttgaa ttgcacgccg gtaggcttcc 13141 tttgctgcct tattgtctcc taaattgcca ctgacataac ccaaagcata ttgataatcg 13201 ctattgttcg gatttaaggc aacagcacga cggtaagctt gtagtgatga gggaaaatta 13261 ccttgtaggg cgtagaggta gccaatacca gaataaatcg tggcgttttt tggctctaga 13321 gcggatgctc tttgataaag agcgatcgcc ccattataat caccactatc taccagtttg 13381 cgcccttctt gtaaaagtcc ttttatttct tcactacttt gttgtggtaa tgcatcctga 13441 gcttgagcaa ccaaaggtat tgttgcgaga aagctcccta ttaaaagaac acttactatc 13501 aatgatgttc gtttgtacac agtcaattac ctaaaacttc ctgcaacttt ttttccacaa 13561 tctccgtggc tgattgtcta ttaaaacaga tttatttgat tttgcaagat ctcttacttt 13621 atatttacaa aatatttacg ataattttct tttttttttg agattacaat aaattaacag 13681 tttttttcat aaaatctgct ttccaagtat ctgctttgac gtttaaaaat ccaagcagga 13741 atatgagccg atagttacct ttttgaggaa aatctatgat agtaaccacg actgatgtca 13801 ttcaaggagc cgtcattcag tcatatttgg gcatcgtcac ggcggaagtt gtctacggta 13861 gtaatttctt acgagatttt tttgctagca tccgggatgt tattggcgga cgcacaggta 13921 gctacgaacg cctttttgag gaaggacaac gtaaggcttt agaagaatta gagcgacggg 13981 cattacgtct aggggcagat gcagttgttg gtattgaagt ggatactggc acaatcaatg 14041 ttgaccaatc aggtgttctc atgcttatta ctgctacagg tactgcggtg aaactgcgtc 14101 agcagttata agttatcagt catcagttgt tgattatcat ctgttcactc ataactgatg 14161 gctattaact gctcactgat aactcttttg aacttgatga aagacacaaa tttatcactc 14221 tcattaaatg actagagata gaatgaaaac ataatgaaat taagctgatt gctcttgtga 14281 aattttctta attttttcat ctagtttttc attttgtcaa cgtgtgttcg cttaataaaa 14341 cttaaataaa tattcttgac aatgataata attgtatgac caaagataga gttatgaatt 14401 atttctgctg atagatatca aaaactctca taggcaattt aattataatc attagcataa 14461 atagctagaa aaaagtttac tggagtacta agggataaat tgctatgtca tacacaaatc 14521 gagtcggtga tgaggttatt actgagcctg cagttgtggg tcgagttgct gattatcatg 14581 atcgcgttcg ttggggacct atactttctg gtttattaat tgctttagca actcaattag 14641 ttttaagttc tatttttgct gcaataggag ctggtagcat tgaagcttca ggtagaccga 14701 gaacaatcgc ttcagatgtc acaggtaatg ttggtatttg gtctactatt ggtttattga 14761 tttcgctatt tactggtggt tgggtaatgg ctcgtgcttg tggtccgatg aaccgcaaca 14821 cagctcttct caacggtgca atactttggg caacaacatt agcagttggc tcttggttac 14881 tagcaagcgg agtatccggt gcttttggtg ttgctgcttc taatgctggt gctgtggtta 14941 accaggtgca acagcaaggt ggtgtgaata taccgcaaaa cacacctaac gtcagtgctc 15001 aacaggcaag agaaattgca gcaaatgtac gttccggttt atggtggttt gtctttggtt 15061 ccttgttagg tttactcgct tcaatgattg gggctgctac tggaactcgc agtccacgta 15121 ctaataatta cgtctcataa cttcttttta aaaacctgtt tagacaaatt agtgtctcag 15181 ttttctttac acagtacctt tattaaggtg cttttttttg ttaaataagt agaagagcat 15241 aaatcaatac ggttcagttg aggattgttg ctaacaattc cacacatgta gagacgttgc 15301 atgcaacgtc tctacattgc catcctgata acggttttgg tcttatcacc aactgtattg 15361 cagcataaat aattcacact atgtccgcaa ttaagaatat ttgtaggttg ggtagaacga 15421 agtgaaaccc aacaaagcgt cccaaatgtt gggtttcgtt cctcaaccca acctacacat 15481 ttttgttttt tgggcaaaac ccgagcagta ttgcgaaccg tattgcctac acccctgtgc 15541 atcttaaaag tttaaacatc tagtttattc aatcccattc cgcactgcgt acagaacttg 15601 caattagtag aattatgatg accgcaactg gtacatgaaa tataatctgg aatcaaaaca 15661 gaacttgcaa ctgtagtttc ttgaataggt gctagattgc ttggaacaga gtaattggca 15721 aagtgctggc gtaacaagtc atgcataaac cgatagccgc cccctactcg ttgtaggaat 15781 aaacgattgg tgcagtagtc aagaaactta gcataattcc aaggagcgta accattaccc 15841 cagaaaatta cgcgtaaaac aaaatgtttg atagctggtg ttccactttt aggtattgca 15901 ataagcagtc caataattaa tccagacact agactagaac ttaagtctag cttacctccg 15961 tatattttta ttcctaataa aagtattaat gtagccaata aacaagtgac cgttgatata 16021 ataacagtgt taataactga ttggcgaatt cctttattag gaatagtttt gttttcaatt 16081 gttcgcccat caattccaaa aatcagccct gaaatcagcc ctaaaatcag ccctaaaatc 16141 agccctgaaa tcagcctata aatcagccct gccagcccat aaatcagccc aaaaatcagc 16201 cctgaaatca gccctgaaat cagccctgaa atcagccctg aaatcagccc aaaaatcagc 16261 ccataaatca gcccataaat cagcccataa atcagcccca ttgaagactt ttttagagaa 16321 aactttataa tctcaactgt ttgaatttca tctcctatca gcccaaaaag cagcccataa 16381 atcagcccaa aaatcagccc tgaaatcagc ccaaaaatca gcccaaaaat ccgcccattc 16441 agccctggaa tcagcacatt cagccctcct gtcagccatc ctatcagcca tcctatcagc 16501 ccaaaaagca gcccataaat cagccctgaa atcatgaatt ttatagttgt taacctaaaa 16561 cttctgagat gattaattaa tttaatcttt tctattcgat ctccaatcag cccaaaaatc 16621 agccctgaaa tcagccctcc tataagtccc catacaatca aattataaac aaatttttga 16681 attttctttt tcaaccaact cggttgaatt ccttcaataa aaaactcagt tgcactttca 16741 tccttcaacc tttgcgccaa ccagtttaac cagcgtttag tcttttctgg atcaggttgt 16801 ttacctgtat acttacgttt cagcattcta ctgatgtaag tttcaaatag atagcttagg 16861 cgttcctgag aagactttaa ccgttgccac gtttctattg atagttcttg agccgaaagt 16921 acaataatat ttaacagaag tggtgtttta gctaatttgc ttaactcttg atcctgatta 16981 atactgtcac acaactgtaa attatctgtg ttttgtaaat attggttgac ttgctcttga 17041 gtaaaagggc atagttccaa agaattattt agttgaagta aagctttata acgttgatac 17101 tcttgtatcc gactgcaaat aactaaagga ttgctccatc ccgcgtttat aaaattatta 17161 attttgacaa cgcacttttc ttgacgttct gccgctagtt catccaatcc atctagaagg 17221 ggtagaattt cttgattttc tactaactgt ttaccaatat ccttccgtac cccatattta 17281 tgtttaagct ggtctactaa ccaatcttta atactgttgc tatcattttt ccaggaagat 17341 aaagcaaata acactggaat tggatgtgct gagtcatttt tagcacgctt gactagttcc 17401 tcagctaact taagaagcat ggtagttttt ccaacaccag ggttccccag aatcaataat 17461 ctacctgcga tatcaggttg atcatagatt gcgattattt cggtatttgt taaatgaatt 17521 tttggtgtgg aaccaacctt tatctcactt gcccaaggta attcaatctc agagggattt 17581 tggtctgtat cttggacaat ataaatccga ttatggagag aggtacttat ccggctttca 17641 acttcagtgc tgacttgtct taacagtatt ttccgtatat gatttctggt tttgggattc 17701 agtaattcat tactagctct ttgtccaaag gaaaaattca cagtattgcg gatatcgccg 17761 cctatacgtc cagcattaac gttctgggca tcgataagcc caccaccaaa ttgggtattg 17821 cgtaagtcat tattagagac ttttccaggg tctttggggt tttctgactc cggcatggct 17881 tgtttgggcg ttgttggttg tttgtattat atatgtctcc acctctatct gctttgattc 17941 cagatgaagg agaatattag tgaagtgact ttaattttgc gagctacttt tgtctgatta 18001 actgaattaa gtaggtggaa aaagttgtgt gtgtccgcag cgagtgtaac gtagtgaagc 18061 aatcgcaaga tatgagattg cttcactacg ctgtcgctcc gtatgcgcca agggcgcacg 18121 ctacgctaac gcaatgacaa ttcatcacct ggatttgata taacttacac atttgggatg 18181 ctccctgcaa aaaaacagta gtgggagcat cttgctccct agtttgtgac ctcagcgggc // LOCUS NODE_1816_length_18124_cov_4.55747418124 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18124) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18124) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18124 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 464..2476 /locus_tag="DP116_15780" CDS 464..2476 /locus_tag="DP116_15780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315078.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_15780" /translation="MNKPLSPSKRLDEVDLHKQYSVKLMPHQEEPVYQLVNKINKIIA NSSTPTRMLQDIAKVIGVAFQVDGCSLVTVSGEVSSEAITASWCAQENLELSGADEVF SMEQHLDVAVIECASERFTIEDIPTIQKCLAIGRQSLPPTIKAVLAIPTRFSGKNNGV ISLIKSQPYIWGESEKQLLKAVESCCAIAFSQVAQAQQIADQNQYLRTCVQHQSLIKR LTIISRTNLEINQMLRLALASTAEALEADRGLLILLKYTDPLFRNRRQQQIPQAKASV VAQWNGTTETPLINQLDISQISFSLKDCGLCQRAFVNSGNPLIINDDTDLKDTLTVNP VFAIEALPVVLLMPLEIQGKILGFLVLQQAVARDWQATELNIVQMVCAQLSNAIIQTQ TLRQVQTLVDERTAQLQRSLEVQARLYEVKRQQTEQLQKLNDLKDEFLSNISDRLRHP LTRISVAIRNLRQMGQLNERQTKYAHMIEQDCTDEIHLINDLLKLQELATHNERPQLE TTNLNARIRDLSATFDAKLADKGISLSLDLPDSPLTVQTERESFDRILQELLTNASKY SEHDTVVHLQVTHQVNQELDQVMIKVSNIGRGISEEEASYIFDRFRRGKGRWTPGAGL GLALVKSLVQHLNGTIAVESTPIEDSSFSTICFTLTLPQFSDKNDS" gene 2486..3070 /locus_tag="DP116_15785" CDS 2486..3070 /locus_tag="DP116_15785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195714.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyclase" /protein_id="PRJNA477356:DP116_15785" /translation="MTEEHNIQEELDLSAIGDDTDLERNGSADADDLQAVEVQIEKIA ERQRQITASLQISQPVEQVWKVLTDYEALADFIPNLTKSRLLEHPQGGIRLEQVGAQR FLRLNFCARVVLDLEEYFPKEINFRMVEGDFKDFSGSWLLEPYFFSEHMGTYLCYTVK IWPKRSMPVGIIERRISDDMRLNLLAIRQRVLQI" gene complement(3133..3205) /locus_tag="DP116_15790" tRNA complement(3133..3205) /locus_tag="DP116_15790" /product="tRNA-Glu" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(3169..3171),aa:Glu,seq:ttc) gene 3594..5516 /locus_tag="DP116_15795" CDS 3594..5516 /locus_tag="DP116_15795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876530.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium:proton antiporter" /protein_id="PRJNA477356:DP116_15795" /translation="MEASFEITLQIVIAVFAGISAQVLAASLRVPSIVFLLVFGILLG SDGIGLLHPAMLGTGLEVIVALATAIILFEGGLSLDLEELSKVSTSLQLLVTLGTMIT LVGGSMAVHWLGEFPWPIAFLYASLVVVTGPTVVSPLLKQINVDRQVATLLEGEGVLI DPVGAILAVVVLNTVLSNHTDFITAMSSLTLRLGVGGVIGAAGSWLMSLISKRANFLS FELKNLVVLAGLWGLFALSQMIRSESGLMTVVVAGVVFGASSVPEERLLRRFKGQLTI LSVSVLFILLAADLSIASVFALGWGGVFTVLVLMFVVRPINILFCTWNSEFNWRQKLF LSWVAPRGIVSASVASLFAILLTQRGINGGEAIKALVFLTIIMTVFCQGLTAGWVAKC LEITSKDAVGAVIVGCNPLSLLIARLFQEWGEPVVMIDTDAQRSEQALAQNLRVISSS ALDTGVLEEAGLGSMGTFLAMTSNGEVNFVLAQRAAEEFNPPRVLAVFPRDPQATTST NENKVNQALIPDLPIKTWNEYVNDGQVKLGTTTLNESSFDLQQEHLKALIRTGELIPL LVEREQHLQVMPAAQEWEVGDRIIYLLHDPRPQLLKRLSGASQSTRLALEKLPKVEEI PLAKLSQLSTSDARTP" gene complement(5523..6179) /locus_tag="DP116_15800" CDS complement(5523..6179) /locus_tag="DP116_15800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome bc complex cytochrome b subunit" /protein_id="PRJNA477356:DP116_15800" /translation="MHNTQSDVVLRRITTILSVVIITLTLVGATTGILLSFYYEPAAG RAYQSLKIIDTEVPYGWLFHKAHQIAGNAVVVIALIQIVLMFVSRQFSKSWLTAWISE IFFTLSAIGLGWTAMILSWDQEGFWRFNIELGTIEAIPFVGSILRDILTGGGAISTVT VQHLYTIHSYLISVTAIVLSVVHLLSVLWLELQLKKMYAEGTLPQTDKIQQQPANAQG " gene complement(6287..6982) /locus_tag="DP116_15805" CDS complement(6287..6982) /locus_tag="DP116_15805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874138.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipase" /protein_id="PRJNA477356:DP116_15805" /translation="MPLPTVILPGYLESGVAYRSLEQSLQQLGFPAVTVALRRRDWIP TLGGRSVTPILQQLDATVKQILQQCNTSQINLIGHSAGGWLSRIYLGEKLYSGRGEVT SSVWQAHPVVATLITLGTPHVSQERWTRWNLDFVNQHYPGAFYKNVRYVCVAGKTIFG QRRRGSWLAYSSYQLTCGNGNTWGDGITPIEAAHLEGAQNIVIEGVRHSPRSSPMWYG SPEPLKAWVQYLV" gene 7302..9281 /locus_tag="DP116_15810" CDS 7302..9281 /locus_tag="DP116_15810" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15810" /translation="MSYVSLLKNIPEFLSQPTGIAVLASLGIHSAIAFLLPIVPTNSN KPKQEPSLKSSVGLVELSQSEKNRLPQPGIPQLPLQPLQPPAPALLPQVPSPKFANQS IPSLPPLPPSKSSTALILPPLPKTNNLAVASLPKSQSLPILSKKDLQPASLSAKVKPL PRNVEPDPSLSAKVKRLPGYAEPDPSLSAKVRSLPRYPEKVELGEAKPLASSKIPYNV PPIQAANIAEEQELLNTSAPVSPDQMPPSGDVSTATQTTAQGSEVSQGANNQQLVTPV VQPPQVGDNSIALGRQNLPQLQQGLNVQPPELPPLATGRSLSTPSIASTPSTPSTPKT FAQRFTEVKQQYPNLETRQPIAETVDGKTGQQGNVEGDLVINREGQVESIDFHNNSVP SELKTSVRQYFREYFQKNPVQANGKPKFYPFNISFKPNSDISKTPASELSTSSRVNQT QPVTIAQRSLIQRLRSVPVTSLPSKEPQKIEMTQGLRPDRVNLQPSKEPQKVNLVQRS GSSSMKQSQSQTESAPQANLEQQTSRRRRVIVMQNTTPAPQANLEQQTSQQRVMIRQS ASSSQTDKEQQTSQQRVVVRQSASSSQTDKEQQTSPQRVVVKQTTSPQQANLEQTSQQ EVNSNQSSASGQNSQKLLQQLRQIRDTRQGNNQEK" gene complement(9371..9736) /locus_tag="DP116_15815" CDS complement(9371..9736) /locus_tag="DP116_15815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409314.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome C" /protein_id="PRJNA477356:DP116_15815" /translation="MALLAIALWAAPCGSIALFQFTFIDSALAAEISNGSKIFNANCS SCHIGGGNILISEKTLKKEALSKYLEDYDADSIQAIIYQVQNGKNAMPAFKNKLSEQE IIEVAAYIFQKAELGWQDS" gene complement(10115..10534) /locus_tag="DP116_15820" CDS complement(10115..10534) /locus_tag="DP116_15820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="plastocyanin" /protein_id="PRJNA477356:DP116_15820" /translation="MKIVASSLRRFGLALLTIFFFVSSFAVFAPSASAETYTVKLGSD KGMLVFQPAKLTIKPGDTIEWVNNKVPPHNVVFDPAKNPNQDKALAKDLSHKKLLMSP GQKTTTTFAADTPAGTYTFYCEPHRGAGMIGKVTVEG" gene complement(11294..11785) /locus_tag="DP116_15825" CDS complement(11294..11785) /locus_tag="DP116_15825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315089.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c-550" /protein_id="PRJNA477356:DP116_15825" /translation="MFRKLFGIFAATILLTFQFVVGSACAVQLDPATRTVTLNESGET TVVSLKQLKEGKRLFNNTCSQCHPGGITKTNQNIGLEPETLALATPNRNNIEGLVDYL KNPTTYDGEEEISELHPSTKSADIFTEMRNLTDDDLEAIAGYILVQPKVDPIRWGGGK IYY" gene complement(11930..12121) /locus_tag="DP116_15830" /pseudo CDS complement(11930..12121) /locus_tag="DP116_15830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315090.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(12280..12489) /locus_tag="DP116_15835" CDS complement(12280..12489) /locus_tag="DP116_15835" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15835" /translation="MTEERQRKQGQKAEGIKNFLEKVSPLFFALWFQAPKFIYGKGNK IYSEMHGMRNKASLLQAPNLWYGDS" gene complement(12490..13278) /locus_tag="DP116_15840" CDS complement(12490..13278) /locus_tag="DP116_15840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874578.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_15840" /translation="MINDSPILKVEGLTVYQDSYLAVRDVSFELLSGTNTAIVGPNGA GKSTLVQAVLDLIPRSAGKIEILGRPLARLGNLRCQLGYMPQNFIFDRSFPICVSELV GLGWVKQAKRKPSLATLTTGSFGEARQEKSAAVAEALRRTDAYHLRNQAIGTLSGGQL KRVLLAYCLVMARKLLVLDEAFAGVDVQGTADFYTLLNELKQEEGWTVLQVSHDIDMV SRYCDRVICLNQTIVCTGVPEVALSPQNLLTTYGPAFSRYQHHH" gene complement(13360..14331) /locus_tag="DP116_15845" CDS complement(13360..14331) /locus_tag="DP116_15845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455304.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_15845" /translation="MTPVIALLMLSLSAGCTESNTNQATTPEQSPPAQETASTPSPQS GKTKVVTTFLPMYMFTKAVTGNAADVEILVPPGTEVHEYQSTPNNVKAIATANVLVKN GLGLEEFLDNTIKNAQNPKLSVINASTGIQPLNEISPVEKTGKKEQEHDHEHAEGNPH VWLDPVLAKQQVANIRDGLIAADPANKVSYEANAAAYIKQLDSLNSEFQQTLQKNPNC TFITFHDAFPYLAKRYNLKQVAVVEIPEDQLAPADVKNAVNAVKKYKVKALFSEPGVD NKLLSSLSKDLNLTLRSLDSLETGNTDPQHYFKAMRDNLQTLETACK" gene 14697..15779 /locus_tag="DP116_15850" /pseudo CDS 14697..15779 /locus_tag="DP116_15850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115794.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(16104..17840) /locus_tag="DP116_15855" CDS complement(16104..17840) /locus_tag="DP116_15855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319296.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15855" /translation="MQTFDFTTLTAVCSDIRAHWLPSRIEQVYQRNSYTIAVALRTLK QRGWLEISWHPQAARLHIGDPPPRTPDTFTFSQQLLHQLGGLALVGLETIAPWERVLD LQFARRPGESALYHLYVEIMGKYSNVILTDAYNSIITVAHQVKQQQSSVRPILTGQPY ETPPTLTAATPSLSESQERWQERVSLVPGSIKRQLLKTHRGVSPSLIQSMLLVSDIDP ESSTDTLKVDDWKRLFHHWQEWLQKIESEKFEPGWTQEGYTVLGWGVHKRAKDIQELL NRYYTDQGNQQTFSQLRHQLSQKLNNLLEKLRVKAATFEQRLQQSEKADEYRQKADLL MAHLHEWQVGMTQIIIPDFETSEPVKIALQPDKNAVQNAQNLYKQHQKLKRARIAVEP LLAEVNAEIEYLEQVEAAISHIENYNTPDDLQALEEIREELIQQRYLEDSEYRRRSPT EAASINFYNYRTPSGFEVLIGRNNRQNDQLSFRVANDYDLWFHAQEIPGSHVLLRLEP GAVPEEADLQYVANLTAYYSRGRQSDQVPVVYTQPKYVYKPKGAKPGIAIYKQERIIW GQPQSLVLSAKS" BASE COUNT 5219 a 3933 c 3805 g 5167 t ORIGIN 1 aggtatttgg ttatttgtaa aagtaagcca cagagctttt ctgaaacttc gtggtgtaac 61 agattttctt ctcgctaaac gagcggggtt ccaacccatt caccaaatgc ataatgataa 121 ataaaaatac cttatttcca gatacgtttc agtgttaacg cggatacatt taacagagct 181 ttagcctcaa aaccagagaa attacggttt gtatatggag gctggcagga tattctaggt 241 ctaggttggc tagttattga agacgaaaat acccgaaatt ctttgacgac agcgtctgga 301 taaatttttt aatatttttt acacaaagag aaaatttctc acaaaggatt tgaccatctc 361 caacagaaag gggctataac aggtagcacg agctttgggt gaatcatgtt ccgagaggtt 421 gtttacgaag aaaatggggt caatattcaa gagtgtggca agaatgaaca aacctttatc 481 accgtccaag cgtttagatg aagtagatct acacaaacaa tactcggtta agctgatgcc 541 gcatcaggaa gaacctgttt accagttagt caataaaatt aataaaataa tcgccaatag 601 ctcaactccg acaaggatgc tgcaagatat tgccaaagtc ataggagttg cctttcaagt 661 cgatggttgt agcttagtga cagtatcagg tgaagtatcg agcgaggcaa taactgctag 721 ctggtgtgcc caagagaact tagagttgtc tggtgcagat gaagtgtttt caatggaaca 781 gcacttagac gtcgcggtga tcgaatgtgc ttctgaaaga ttcaccattg aggatattcc 841 aacaattcaa aaatgtttgg ctattggacg tcaatctttg ccaccgacaa tcaaggcggt 901 tttagcaata cccactcgat ttagtggcaa aaataacggc gtaattagtc tcatcaaatc 961 gcagccatac atttgggggg aatctgaaaa acagctatta aaagctgtcg agtcgtgttg 1021 cgcgatcgca ttttctcaag tcgcacaagc acagcagatt gctgatcaaa atcagtacct 1081 gcgaacttgt gtacagcatc aaagcttaat caagcgacta actattataa gtcgcaccaa 1141 tctggagatc aatcaaatgc ttcggctggc gctcgcttcc acagctgagg ctctcgaagc 1201 ggatcgtggt ctgcttatac tgctcaagta taccgatcca ttgtttagaa atcgacgaca 1261 acaacaaatt ccccaagcaa aagcctctgt tgtggcccag tggaatggaa caacagaaac 1321 tcccctaata aatcagttag atatttcaca gatttctttt tctctgaagg actgtggtct 1381 atgccagcgt gccttcgtga actctggaaa tccattaatc attaatgacg atacagattt 1441 aaaagatact ttaacagtta acccagtatt tgcaatagaa gcgctgcctg tagtgctatt 1501 aatgccatta gaaattcaag gtaaaatatt aggattttta gtgttacagc aagcagttgc 1561 tcgcgattgg caagcaactg aattaaatat agtacaaatg gtctgtgctc aactcagcaa 1621 cgccataatt caaacacaga cactacgcca agtacaaaca ttggtagatg aacgtacagc 1681 acaactgcaa cgtagtttgg aagtccaagc aagactttac gaagtcaagc gtcaacaaac 1741 tgaacagtta caaaagctta acgatttgaa ggatgaattt ttaagcaata ttagcgatcg 1801 cttgcgccat ccactgacaa ggattagtgt ggctatccgt aacttacgtc aaatggggca 1861 attaaacgag cgtcagacca aatacgccca tatgatcgag caagattgca cggatgagat 1921 tcatttgatt aatgacttat taaaactcca agaattagcg actcataatg aacgtccgca 1981 attagaaacg accaatttaa atgcaagaat tcgtgattta tcagccactt ttgacgctaa 2041 actagcagat aagggcataa gcctttccct agatttgcca gattcgccac ttacagtgca 2101 aaccgaaaga gagagttttg accgtattct gcaggaattg ttaacgaatg ctagcaaata 2161 ctcagaacat gacactgttg tccatttgca agtcactcat caagtcaatc aagagcttga 2221 ccaagttatg attaaagtga gcaacatcgg acgtggcatc tcagaagaag aagctagtta 2281 catatttgac aggtttcgtc gggggaaagg gagatggact ccaggggcag gcttgggact 2341 tgctctggtc aagtctttag ttcaacatct caatgggaca attgcagtag aaagtacgcc 2401 catagaggat tctagcttca gtacaatctg ctttaccctg acactgcccc aattttctga 2461 taaaaacgac tcataacttg tgaaagtgac tgaagaacac aacatacaag aggaactgga 2521 tttatctgct atcggcgatg acaccgatct agaacgaaat ggatctgctg atgcagatga 2581 tttacaagct gtagaagtcc aaattgagaa aatagcagaa cgacagcgac aaatcacggc 2641 ttcgcttcaa atttcccaac cagtagaaca agtctggaaa gtactcacag attatgaagc 2701 tttagctgac ttcatcccca atctcacaaa aagtcgcctc cttgagcatc ctcaaggagg 2761 tattcgttta gaacaagtag gcgctcagcg ctttctacgc cttaactttt gtgcgcgtgt 2821 tgttctggac ttagaagaat atttccccaa ggaaattaat ttccgtatgg tagaaggaga 2881 tttcaaagac ttctctggta gctggctatt agagccttac ttctttagtg aacatatggg 2941 aacatacctt tgctacaccg tgaaaatttg gccgaaacgt agtatgccag tcggaatcat 3001 cgagcgtcgt atcagcgacg atatgcggtt gaatcttctt gcaattcgcc aacgggtgtt 3061 acaaatctag gtaagatgtt accaagagag actaggatta tacagttccc acaatctctc 3121 ttctttattt aataccccca ggggaattcg aatccccgtt acctccgtga aagggaggtg 3181 tcctaggcct ctagacgatg ggggcgctag gtcaacactt tgttaaagtc taactagttt 3241 tcttactgat gtcaacagct tttcaaaaaa actttctcgg aaaatgaaag atatggacat 3301 tttcgctgtg tctccgcaaa aatacttaat aataacaaga atggagaact cagagtatag 3361 aataactcca gactctatag tagaagttgg gaacaaggtt gcttcgcctg ataatcacag 3421 gcgctcatca ctcctagttc acaaaaaata aaagctaaag tattattagc aacttagtat 3481 gaaacagaaa cttgatttga tcaaaaactc tgggtgtgat gttgcacgca gtagaattaa 3541 gatttactat tctttacaaa agattttgaa atatctcggt caaaatctac aacatggaag 3601 catcttttga aatcaccctg caaattgtga ttgccgtctt tgcaggtatt agcgctcagg 3661 tgctggctgc atcccttcgg gtacccagta tcgtcttttt gcttgtgttt ggcatcctac 3721 ttggctccga tggtattggg ctattgcacc cagctatgct aggcactggg ctggaagtta 3781 tcgtcgcttt ggcaacagca attattttgt ttgaaggcgg attgagtctg gatctcgaag 3841 agttaagcaa agtttcgact agcctgcaat tgcttgtcac cttgggaacc atgatcacgc 3901 tagttggagg tagtatggct gtgcactggc tgggtgaatt cccttggcca attgcttttc 3961 tctacgcttc cttagttgtg gtgacaggac caactgttgt cagtcctctg ctcaaacaaa 4021 tcaatgtgga tcggcaggtt gcaacgcttt tggaagggga aggcgttctt attgacccag 4081 taggagctat cctcgccgtc gtggtgctca acacggtatt aagcaaccat actgacttca 4141 ttacggcaat gagcagtctc acgctgcgct tgggtgttgg tggagttatt ggtgcagcag 4201 gtagctggtt gatgtcctta atttccaaac gtgcaaattt tctctcattt gagctgaaaa 4261 acctggttgt tttagcagga ttatggggtt tatttgccct atcgcagatg attcgcagcg 4321 agtcgggatt aatgacagtt gtcgttgcag gagtcgtttt tggagcttcc tcagtgccag 4381 aagaacgatt gttgcggcgt tttaaaggtc aactgactat tcttagcgtt tcggtgttgt 4441 tcattttgct cgctgctgac ttatctattg cgagtgtgtt tgctttgggt tggggtggtg 4501 tgttcacggt gctggtcttg atgtttgtcg ttcgcccgat aaatatcctc ttttgtacct 4561 ggaacagtga gtttaactgg cgacagaaat tgtttttaag ctgggttgct ccacgcggaa 4621 tagtttctgc ctccgttgct tctttgtttg caattttact aactcagcgc gggattaacg 4681 gtggtgaagc gatcaaagct ttggtgtttc tcacaattat catgactgta ttctgtcaag 4741 ggttaacggc tggttgggtt gccaagtgtc tggaaatcac ctcaaaagac gccgtagggg 4801 cggtgattgt tggttgtaac cccttgagtt tattaattgc tcggttgttc caagaatggg 4861 gagaaccagt ggtgatgatt gacactgacg cacaacgtag cgaacaagcc ctagcacaaa 4921 atctgagagt tatctccagc agcgccttgg atactggtgt attggaagaa gcaggacttg 4981 gctcaatggg aactttctta gcaatgacca gtaatggtga ggttaatttt gtcttagcac 5041 aacgtgcggc agaagagttt aacccgccgc gtgtcttggc tgttttccct cgcgatccac 5101 aagcaacgac ttctactaat gagaataagg tcaaccaggc tttgattcca gatttgccaa 5161 tcaagacctg gaatgagtat gtgaatgacg gacaggtcaa gttggggaca acgacactca 5221 atgaatctag ttttgacctt caacaagagc atttgaaggc attaattaga actggcgagt 5281 tgataccgct attggtagaa cgagaacaac atcttcaagt tatgcccgca gcgcaagagt 5341 gggaagttgg cgatcgcatc atctacctgt tacacgatcc cagaccacaa ttattaaaac 5401 gcttgtcggg tgctagccag tccactcgcc tcgcccttga aaaattaccg aaggttgaag 5461 aaataccgtt ggcaaaactc tctcaacttt ccactagcga tgctcgcaca ccttaaaatt 5521 tcttatcctt gagcattggc tggttgctgc tgtatcttat cagtctgtgg gagcgttcct 5581 tctgcataca tttttttcaa ttgcaattct agccatagca cgctcaataa atgcacaaca 5641 gatagaacta tggcagtaac tgagatcaga taactgtgta tcgtgtaaag atgctgcaca 5701 gtaacggtac taatggctcc accaccagtc aaaatatctc gcaatataga accaacaaaa 5761 ggaatagctt ctatggttcc cagttcgatg ttaaaacgcc aaaatccttc ttgatcccaa 5821 gagaggatca tcgctgtcca acccaatcca atcgcactca gggtaaagaa aatttcactg 5881 atccaagcag tcagccaact cttgctaaat tgtctgctga caaacatcag cacaatttgg 5941 atcagagcaa tgacaacaac cgcgttacca gctatctggt gcgctttgtg gaacaaccaa 6001 ccgtatggaa cttctgtgtc tatgattttc aacgactggt aagctcgacc tgctgctggt 6061 tcatagtaaa acgacaacaa aattccagtc gtagcaccaa ccagggtcag agttatgatg 6121 actaccgata atattgttgt gattcgccgc agaaccacat ctgactgtgt gttgtgcatg 6181 gcttagcact ccttttctta aaaattcctt agttttatta cgagtgtaac taagaattgt 6241 taagaatagt tagtttctca taagaactaa aaataatgca ctgtctttat actaagtatt 6301 gaacccaagc ttttaatggt tctggtgaac cgtaccacat tggagagctt ctgggggaat 6361 gcctgactcc ttctataaca atgttttggg ctccctcaag atgagccgct tcaatgggtg 6421 taattccatc accccacgtg ttaccattac cgcaagttaa ctgatagcta ctgtaagcta 6481 accaactacc ccgccgtctt tgcccaaaga tcgtttttcc agcaacacaa acgtagcgaa 6541 catttttgta aaatgctccg ggataatgtt gattaacaaa atccaaattc cagcgtgtcc 6601 agcgttcttg gctgacatga ggtgtaccca atgttatcag agttgcaacc acaggatgag 6661 cttgccatac agatgatgtg acttcaccac gtcctgagta aagcttttct cccaagtaga 6721 tacgagaaag ccaacctccg gctgagtgac caatcaagtt aatttgagac gtattgcatt 6781 gctgtaatat ctgcttgacc gtagcatcga gttgctgcaa aataggcgtc acagatcttc 6841 cgccaagagt gggtatccag tcacgccgtc gcagtgctac agtaacagca ggaaaaccca 6901 attgctgcaa agattgttct agtgagcggt aagcgactcc gctttctagg tatcccggca 6961 aaataactgt cggtaaaggc atttttgatt ttcgatagag atgagagatg aaaaagcatc 7021 cggaattgga agcccaacat cagcgagtgc aacttttgag tagtatatcc gtcatttttg 7081 aaaaaaataa ccagtcaaaa aatcggtgtt gttcctgttt tcaattccca atagaaagct 7141 attatttgaa gaggggcgct tccgttctac tactatatgt tacctcaaag ataagaaaag 7201 taatagattg ctagataact gcaactgagt cgagtgaacg ataatagcaa agtaatagat 7261 tgctagataa ctgcaactga gtcgagtgaa cgataacagc tatgtcctac gtttccctcc 7321 tgaaaaatat acctgaattc ttaagccagc cgactgggat agcagtccta gcatctcttg 7381 gcattcacag tgctatcgcg tttcttctgc cgatagtgcc aacgaattct aataaaccta 7441 agcaagaacc atcattaaaa agcagtgttg gacttgttga attaagtcaa tctgaaaaaa 7501 accgtctgcc acaacctggc ataccacaac tccccttaca acccttacaa ccgccagcgc 7561 ccgcattact tccacaagtt ccttcgccaa agtttgctaa tcaatcaatt ccatcattac 7621 cgcctctacc accatcaaag tcttctactg ctctgatatt acccccgctg cctaaaacga 7681 acaacttagc cgttgcttcc ttgcctaaaa gtcagtcttt gccaattctt tccaaaaaag 7741 atttgcaacc tgcatcctta agtgcgaaag tcaaaccatt gcctcgtaat gtagaacctg 7801 acccatcgtt aagtgcaaaa gtcaaacgat tgcctggtta tgcagaacct gacccatcgt 7861 taagtgcgaa agtcagatca ttgcctcgtt atccagaaaa agtagaatta ggagaagcta 7921 agccactggc atcttctaag attccctaca atgtgcctcc aatacaagca gctaatatag 7981 ctgaggaaca ggaacttctt aacacctctg ctcctgtttc ccctgaccag atgcctccct 8041 ctggtgatgt aagcaccgct acacaaacca ctgctcaagg gagcgaggta tcacaaggtg 8101 cgaataacca acagctcgta acccctgttg tacaacctcc acaagttggg gataatagca 8161 ttgctttagg aaggcaaaac ctgccgcaat tgcagcaagg tttaaatgtt cagcctcctg 8221 agttaccccc actagcaact gggcggtcgt tgtcaacacc gtcaatagcc tcaacaccgt 8281 caacaccgtc aacacctaaa acttttgcac aacgtttcac tgaagttaaa cagcaatatc 8341 ctaacttaga aacgagacag cccatagctg aaacagtcga tggcaaaaca ggacaacaag 8401 gcaatgttga aggtgatttg gtaatcaatc gtgaaggtca agtagagtcg atagatttcc 8461 acaataattc tgttccatcg gagttgaaaa catctgttag gcaatatttc agagaatatt 8521 tccaaaaaaa tcctgtccag gcaaatggca agccaaagtt ttacccattt aacatttcgt 8581 tcaagcctaa cagcgatatt tctaaaactc ccgcatcaga actgtcaacc tcatctcgag 8641 tcaatcaaac acaaccagta actatagcac aacgcagctt aattcaacgt ttgcgctcag 8701 ttccagtcac ctcactacca tccaaagaac cacaaaaaat tgagatgact cagggtttgc 8761 gtcccgacag ggtcaattta caaccatcca aagaaccgca aaaagttaac ctggttcagc 8821 gttctggttc atcatcgatg aagcaatccc aaagccaaac tgagtctgca ccacaagcaa 8881 atcttgaaca acaaacatca cgacggcgac gggtgattgt gatgcagaat acgacgcctg 8941 caccacaagc aaatcttgaa caacaaacat cacagcaacg ggtgatgata agacagtctg 9001 catcttcttc acaaactgat aaagaacaac aaacatcaca gcaacgggtt gttgtcaggc 9061 agtctgcatc ttcttcacaa actgataaag aacaacaaac atcaccgcaa cgggttgttg 9121 tcaagcaaac tacgtcacca caacaagcga atcttgaaca aacttctcaa caagaagtca 9181 atagcaatca atcttctgct tctggtcaaa acagccaaaa attactacaa cagttgcgtc 9241 aaattagaga tacaagacaa ggcaataatc aggaaaaata aaaacagtca tcactcaact 9301 aaagttcctt atcttgcttg tcatcaataa aaaaccctcc ggaacaccag agggttttct 9361 ttttctattt ttagctatct tgccagccta gttctgcttt ttggaaaatg taagcagcta 9421 cctctataat ctcctgttca cttaacttat tcttgaatgc gggcatagca ttcttaccat 9481 tttgcacctg gtaaataatc gcttgaattg agtctgcatc ataatcctcc aggtactttg 9541 ataaagcttc ctttttcaaa gttttttcgc taataaggat gttaccacca cctatgtgac 9601 aagaagagca gttagcgtta aagattttgc taccgttgga tatttcggct gcaagcgctg 9661 aatcaataaa tgtaaattgg aacaaggcga tgctcccgca gggagctgcc caaagggcga 9721 tcgccaataa agctataaat aaaactattc tcaagagatt ctccttgcaa ctagcctcca 9781 gcaacccaaa aacagtctcc agacactatt gtaaggctac aagaagtcct aaaagaacta 9841 gtcttttata cagttgcgaa cttccaagtg cgactacgcc atcatgctct aaattgtttg 9901 tgagtaacga gataatttgt ttgtttagat acaaatttat tcaaaattag cggtgacgag 9961 tttatccctc ttggttgcta tcaacttttg gatcaggata aaacttgtca ccactgataa 10021 cccctcatgt aaacatccca acattcgggt ttgttatcta ggagcttatt tcgctttata 10081 gcaccataaa gtcttttctg tcaacagctt aaatctagcc ttcgacagtg actttgccaa 10141 tcatgcctgc gccacggtga ggttcgcagt agaaggtata ggtaccagca ggtgtgtctg 10201 ctgcgaaagt ggttgttgtt ttttgaccag gactcatcag caacttttta tgagacagat 10261 ctttggctag agctttatcc tgattgggat ttttggctgg atcaaataca acattatgag 10321 gaggaacttt gttattcacc cattcaattg tgtcgcctgg tttaatcgtc aacttcgccg 10381 gctgaaacac taacattcct ttatcactac ccagtttgac agtatatgtt tccgctgaag 10441 cactgggagc aaaaacagcg aagctactaa caaagaaaaa aattgttaac aatgctagac 10501 caaagcgccg taagcttgac gcgacgattt tcatggtttt ctcctaatgc atggttttat 10561 cttccttatc cattttaaat aaaattaagc cgaaatatga caatatgtcg tagatgcaaa 10621 gtttttttaa tattggcgaa atatactata gttcaatcga gtgtcatcag caatgacgac 10681 ttgactttaa tacacaaaaa tgtatatgat tcatcagtct taagtactga gttttcaaaa 10741 actcttttta tcctcaaatc cacctcactg actcgaacta taggaatcag gtttggttta 10801 tgaacttaac tagttgaaca gggaacacaa cgaccacgct cagtgcatcg cgcttaacac 10861 ttcgaccacg ctcagtgcat cgcgcttaac agggaactct taacaacaaa aaaggctctt 10921 taaggtgtac ctagctgagc aaaaatcaga tagcaatcct atatttttct aaattttgta 10981 actgttaaac atagtttgta gtcattagtt actggtaact aatcgttagt ttctatttat 11041 aggacttctt atgacccaac aggagagccc ttcgggtatc tccttcggag acgctgagtc 11101 ccaaggggac acgctgcgcg aacgcgaacg ccagtcgcct gcggagggaa aacgccaggt 11161 gctacaacgg ggggaacccc cgcaacgcac tggctcccct cccgcagcgc tggactcacc 11221 agagccggaa cgagggaaac ctcggtgcag cgctggctca ctaataacta ataactaaat 11281 caagccatta gttttagtaa taaattttac cgccgcccca tcgtataggg tcaaccttag 11341 gttgcacgag aatataacca gcgatcgcct ccaagtcatc atcggtcaga tttctcattt 11401 ctgtgaaaat atctgcgctc ttagtgctgg ggtgtaattc agaaatttcc tcttccccgt 11461 cgtaggtggt aggatttttc aggtagtcca ccaaaccttc gatgttgtta cggtttggtg 11521 tcgctaaggc gagtgtttca ggctcaagtc ctatgttttg gtttgtcttg gtaatacctc 11581 caggatgaca ctgagagcaa gtgttgttaa ataagcgttt gccttctttg agttgtttaa 11641 ggctaacgac agtagtctcg ccactctcat ttaatgtcac cgtgcgggtc gctgggtcta 11701 gttgcactgc acaggcgcta ccgacaacga attgaaaggt cagcaaaata gtagccgcaa 11761 aaatgccaaa cagttttcta aacatttttc cccttaacga ttttgatgct caacacagct 11821 aaaatgacaa tgctcaatct cattcgtgct catggctaat tgctgagtta atattctttg 11881 ctattaggca cgaactttta gcaactcaag tcacttcggt ttcatgaact cagttgctag 11941 tgccttctgg gtgtggttca aacagaattt ggcaaataat tgcctttgca tcctaagcaa 12001 gccaaagtgg tcttaggttt ttgtaggcaa ttaagagttc gtcacacata gacaacacct 12061 tttccacagg aacactgtag tccgtgggta tctaggcaat tgacgagtcc gcaaaagcca 12121 taacacttca ggtagagata gagctactgt aataccaata tcacaaaagg cgacggggaa 12181 actcatcagt ggtgggctga ctcagaaagc ctgccatgtc aaaacaattc aaaattcaaa 12241 attcaaagta tgccctacgg gcacgctgcg ctatcaaaat taagaatccc cataccataa 12301 gttgggggct tgaagtaagg acgccttatt cctcatgcca tgcatctcgg aataaatctt 12361 atttcctttt ccataaataa atttaggggc ttgaaaccaa agggcaaaaa acagaggact 12421 aactttttca aggaaattct taatcccttc tgccttctgc ccctgcttgc gctgcctttc 12481 ttcggtcaat tagtgatgat gttgatagcg actgaaggca ggaccatagg ttgttaaaag 12541 gttttggggt gaaagggcaa cttctggaac accagtgcaa acaatggttt ggttgaggca 12601 aatgacgcga tcgcaatagc gactcaccat atcgatgtca tgagaaacct gtaacaccgt 12661 ccatccttct tcctgcttga gttcattaag tagtgtataa aaatccgctg taccttgtac 12721 atctactcca gcaaacgcct catctagcac caatagcttt cgagccataa ccaaacagta 12781 tgccaacaac acccgcttaa gttgaccgcc actgagagtt ccaattgctt gatttcgcaa 12841 atggtaagca tcggttcgcc gtaacgcttc agcgactgct gctgattttt cctgacgggc 12901 ttccccaaaa ctccccgttg ttaaagtggc taaagatggt tttcttttcg cttgtttcac 12961 ccatcccaac cccaccaatt cactcacaca gatgggaaag ctacggtcaa agataaagtt 13021 ttgtggcatg tagcccaact gacaacgtaa atttcctaaa cgtgctaatg ggcgacccaa 13081 tatttcaatt ttgccagcac ttcgtggaat caaatccaac actgcttgta ctaaggtact 13141 tttacccgca ccattgggac caactatagc tgtatttgtt cctgataata attcaaaaga 13201 aacatctctg acagctaggt agctatcttg ataaacagtc agtccctcta cttttaaaat 13261 aggagaatca ttaatcattc gtcattcgtt atgagtcgtt tttcataagt cacttgttat 13321 ttgcactttc tctaaaggac aaatagtaaa tgaccaaatt tatttgcagg ctgtttctaa 13381 agtttgtaaa ttatctcgca ttgccttgaa ataatgctgc gggtctgtat tgccagtttc 13441 taaagaatcc aaggaacgca aagttaaatt caagtcctta gaaagacttg ataataattt 13501 gttatctact cctggttcgc taaataaagc tttcacttta tactttttca ccgcattcac 13561 tgcatttttg acatccgctg gcgctagttg atcttctgga atttccacga ctgccacttg 13621 tttgaggtta tagcgtttag ctaagtatgg aaaggcatca tggaaggtga taaaggtaca 13681 attagggttt ttttgtaaag tctgctgaaa ttcactattt aaactgtcta attgtttgat 13741 ataagctgct gcatttgcct cataactcac tttatttgca ggatcagcag caattaaccc 13801 atcccgaata ttagcaactt gctgttttgc taaaactgga tctaaccaaa catgagggtt 13861 accttcggcg tgttcgtggt cgtgttcttg ctctttttta cccgttttct caacaggtga 13921 aatttcattt aagggttgaa taccagtact tgcattaatc acagacaatt tgggattttg 13981 ggcatttttt atggtgttgt ctaaaaattc ctctaaacct aagccgtttt ttactaagac 14041 attggcagtg gcgatcgcct tcacattatt tggtgtagat tgatactcat gtacttctgt 14101 tcccggtggt actaaaattt ccacatctgc tgcattccca gtgactgcct tggtaaacat 14161 atacatcggt aaaaacgttg tcaccacttt ggtttttccc gactgtggtg atggcgtgga 14221 tgcagtttct tgtgctggtg gtgactgttc cggagttgtt gcttgatttg tattagattc 14281 cgtacaccca gcactcaaag atagcataag cagagcaatc acaggtgtca tccctcttgt 14341 gtatctattt cttctgcgtc tggtgtcagt ccgattcact attttgtctc cttattgaaa 14401 gctgtttgct ttgctgtgtt taaatttttg ttgataatga gatttatttt atctcatata 14461 ttataataat tctcattctc acattgtgag cgataactga gcgttaatcg taaaaaattt 14521 tttttaggaa gtactgctga ctgaatgtat agatatagat ttattgagaa agaactctta 14581 tggtttgttc tgttggagtt ttactttgac atgtatacag atttcatgag atttttttga 14641 aattgattgt taaataatct gctaattact gataaattaa tgagactact tatgtaggag 14701 accggattcg cctcaacttt gatacaagct tcaatggcga agatttactc agaactcgac 14761 tacaggcaag caacctgaat gccttctcta gcacctctac gttcacacca gaaggtgact 14821 tccgatttgc aggcggcgct ttcgatgaca gtaacagtaa tgatgtaggg atagacgcac 14881 tgttgtatca gttccgaatt ggcgaaaaaa caaccgttgt tgttgaagcc aatgcaggtg 14941 cgattgatga ttttaccaat acagttaacc ctttcctgga tggagatggt ggtagcggtg 15001 ccctttctca ttttgggact cgtaacccaa tttactatca gcttaatggt gctggtttag 15061 gacttaggca tgagttcagc aagaacttgg aactcagctt agggtattta gcaaacgccg 15121 ctgctaatcc tgacagggga agtggtttgt tcaacggtcc ttatggcgcg atggcgcagc 15181 taactatcaa accaagtgac aaatttactc ttggtttgac ctatgtgaat tcttataaca 15241 atgatttcac tgctaacggt agtagtggca gcaaccgtgc taacctgcga tcagcattgg 15301 caaataatcc caatctacca gatactttgg cggccttctc agggattgag gtgccagtct 15361 ctagtaattc ctacggtgtg gaagcatcct tccagctaag ccctaagttt gttttgggtg 15421 gttgggctgg ttataccaat actcgcactt tatcgtctgt tggtggaaca attcctcgtg 15481 gggaatttga gatttggaat tatgctgtca ccctagcctt tccggattta ggtaaaaagg 15541 gtaacgtcgc aggtattatt gtgggtatgg agccgaaagt gactggtgtg agcgattctc 15601 ttaggagggc gatcggcaaa gacgaagaca cttcctatca cattgaagct ttctaccaat 15661 atcaggtgag cgaaaatatt agcataactc caggagtcat ttggctgaca gcgccagacc 15721 acaatagtaa taacgatgat attgtcattg gtgctgtcag aaccaccttc actttttaaa 15781 aagtacaaat taaacagtac aaaaagacca gcctaccaca gctggtctat ttaaaaatct 15841 tgtttgtgtt gtcttttcaa aagagttaag cccaaatcgc gcgttcccaa aacgcaacga 15901 gatacttttc ttaacaatat tgtaacagtc agtacaaatt ttttatgaca tttatcaaaa 15961 aaagatattc aggaaggaaa ttagcctcta tctttaagaa gaattccaaa tgtggtatat 16021 aggcggaaaa tatttaaatt tttaattgaa aattaaaagt tcaagaaaaa ttatgtattt 16081 tgaataacaa aaatactcgc aactcaggac ttagcactca gcactaacga ctgcggttgt 16141 ccccaaataa tacgctcctg cttgtagata gcaattcccg gttttgctcc cttgggtttg 16201 taaacatatt ttggctgagt gtaaacaact ggcacctggt cactctgacg accgcgactg 16261 tagtatgctg ttaagttagc aacatattgt aaatcagctt cctctgggac agcacctggt 16321 tctagacgca gtagcacatg gctgcctgga atctcttgag catggaacca taagtcataa 16381 tcgtttgcta cacgaaaact taattggtca ttctggcggt tattacgacc aattaaaact 16441 tcaaaaccac tgggggtgcg gtaattataa aagttaatac tagcggcttc agttggactg 16501 cgacggcgat attctgagtc ttctaaatac cgctgttgaa tgagttcttc acggatttct 16561 tcaagagctt gtaaatcgtc tggtgtgttg taattctcta tgtgagaaat cgctgcttcc 16621 acttgctcca aatactcaat ttcagcattg acttcagcca ataatggttc tacagcaata 16681 cgagcgcgtt tgagtttttg atgctgcttg taaaggtttt gggcattttg aacagcattt 16741 ttatctggct gtaaagcaat ttttactggt tcgctagttt caaaatcagg gataatgatt 16801 tgtgtcattc ccacttgcca ttcatggaga tgtgccatca acaaatcagc cttttgtcga 16861 tattcatctg ctttttctga ctgctgcaag cgttgctcaa aagtagcagc tttgactcgt 16921 aatttttcca aaagattatt cagtttctga ctcaattgat gccgcagttg ggaaaatgtt 16981 tgttgatttc cttggtcggt gtagtatcga ttgagtaact cttggatatc tttcgccctc 17041 ttgtgaacac cccaaccaag tacggtgtaa ccttcttgtg tccaaccagg ttcaaatttc 17101 tcgctttcga tcttttgcag ccattcttgc caatggtgaa acaaccgttt ccaatcatca 17161 acttttaaag tgtctgtgga actctctgga tctatgtctg ataccaacag cattgactgt 17221 atcagggaag gagaaacacc gcgatgggtt ttgagcaact gacgcttgat agatcctggt 17281 actaagctaa cccgttcttg ccaacgttct tgagattcac tcaagctggg tgtcgcagcg 17341 gtgagagtgg gtggtgtttc ataaggctgt cctgtcagaa ttggacggac actagattgt 17401 tgttgtttga cttgatgggc aacggtgata atgctgttat aggcatcggt gagaatgacg 17461 ttgctgtact tgcccatgat ttcgacgtac aggtgataca gggcgctttc tcctggacga 17521 cgggcaaatt gcaaatcaag gacacgttcc caaggggcga ttgtttcaag tcccactaag 17581 gccaaaccgc ccaactggtg cagtagttgt tggctaaagg taaaggtatc tggtgttctt 17641 ggtggtggat caccgatgtg aaggcgtgct gcttggggat gccaagaaat ttccagccaa 17701 ccccgctgct tgagggtgcg gagtgctacg gcaatggtat agctgttgcg ttgatagact 17761 tgctctatgc gcgatggtag ccagtgagcg cggatgtcac tacaaacagc ggtaagagtt 17821 gtaaagtcaa atgtttgcac aacggttagt cgctgttatt taggtactga ttctcagtat 17881 agcaatctta aatcattcgt gaacaacaag atagggaaca gggaacaggg ggtaggttgg 17941 ggaggacagt gcccttcggg ttcgcagtcg cctgcggagg gaaaccctcc cgcagcgctg 18001 tctcaccgtg gacgggtctg tcttaacgtt caaaccttta ttccacgttg atttcacccc 18061 acccctaacc cctcacgcca ggtgctacaa cggggggaac cccaacgcca gataccaagt 18121 gagg // LOCUS NODE_1826_length_18002_cov_5.87630218002 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 18002) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 18002) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..18002 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..906) /locus_tag="DP116_15860" rRNA complement(<1..906) /locus_tag="DP116_15860" /product="16S ribosomal RNA" gene 1241..2119 /gene="glyQ" /locus_tag="DP116_15865" CDS 1241..2119 /gene="glyQ" /locus_tag="DP116_15865" /EC_number="6.1.1.14" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215550.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycine--tRNA ligase subunit alpha" /protein_id="PRJNA477356:DP116_15865" /translation="MNFQSVIAILHKFWSDRGCLIAQPYDIEKGAGTKNPHTFLRALG PEPWAVAYVEPCRRPTDGRYGENPNRFQHYYQYQVLIKPSPDNIQEIYLDSLRALGIR PEDHDIRFVEDNWEDATVGAWGTGWEVWLDGMEITQFTYFQQCGGIDCRPVSIEMTYG LERLTMYLQGVEAITKIEWMDNITYGDVHLQGEIEQCVYNFEASNPEMLLTLFNMYEQ EAQQLTERGLVLPSLDYVLKCSHTFNLLDARGVISVTERTRYIARIRHLARKIAQLYV EQREKLGFPLLKKSAT" gene 2422..4779 /locus_tag="DP116_15870" CDS 2422..4779 /locus_tag="DP116_15870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458154.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="competence protein" /protein_id="PRJNA477356:DP116_15870" /translation="MNQVSGVIICLGYILGLLFTAVPWGGFWVLGLGVVGAIFFGRKR LNIRKHLPKKENSQAKTKTAPQLSQTSPHARVWLIAGVIGLLASFYFQSRVPTPQAND ISKFVPSENGNNQEQLFIVRGKVLSKPRMTRSQRGQLWLEATQFDEVKNENVPDGRTK GVTGKLYVTMPLLQTTGLHPTQQIAVTGVLYKPKPPLNPGAFDFQKFLQQEGAFAGLS GRQVNILDEGKTWGWWKVRERIVRSQVRSLGVPEGPLVSAMVLGSKVVDLPYETQDRF VQVGLAHALAASGFQTSLILGVILGLTSKAKKGTQMILGSIALLLFLTLTGLQASVLR AVIMGFAALIGIGLRRKVKQLGSLLVAAVLLLLFNPLWIWDLGFQLSFLATLGLIVTV SPITKRFDWLPPIMTSSIAVPLAAAIWTLPLLLYVFSVVAIYTLPANIISTPLISVIS IGGMISALVSLISPELGSSLADLLYHPTHWLLKLVEFFGSLPGSTVAVGSISLGQMLA MYILIILAWVVRWWQQRWWFSAIIALGLVFIPVWHSANTLFRVTVLAAGGEPVLVIQD KGKVTLINSGDEGTGRFTILPFLQQQAVNKVDWAIASDFQHNGNNAWLEVLQRLPIGI FYDYSPRSDNDTTNQVIQKEVQNSKGIYQPLSVGQTISTGSVVAQLINNQLPILQLQM FGQNWLLVGQTKTTELLKLLNTGRLVRPQVLWCPGESLKELISVLQPQVAIATTTNVD QKILSELSQTQTKLFFTSRDGAIQWTPNGQFEIFIQAGENKTSIL" gene 4883..4967 /locus_tag="DP116_15875" tRNA 4883..4967 /locus_tag="DP116_15875" /product="tRNA-Ser" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:4917..4919,aa:Ser,seq:cga) gene 5052..5573 /locus_tag="DP116_15880" CDS 5052..5573 /locus_tag="DP116_15880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15880" /translation="MLTPAEFLKYTQWSGIATLVFAVLAIAGFVFKWGIRFRLVGSTG FMVVLTGGLFALSLAPLSRTVIPGAVRYSRVYDNGGTQVVITTSPQITPSELEATLRQ AASDLYSYGRFGTQAENQLTIRARTIIHPEPEISVPLYLGQVKRSLSSREDSQMTIEI YQDKFAQLPQTTS" gene 5640..6815 /locus_tag="DP116_15885" CDS 5640..6815 /locus_tag="DP116_15885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316197.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_15885" /translation="MSHQSTETLSTQTSSPLLTLSVAPARVLRGSHVLTQASDVISQL GSRPLIIGGEYTLSVIQQSLEQFLKQPRLHFTQAFYTPDCSEASLKALHKAAKEHKAD VIIGVGGGKALDTAKLVAYKLQLPVVTIPTSASTCAAWTALSNVYSDEGAFLYDVALA KCPDLLILDYELVKTAPQRTLVAGIGDAIAKWYEASVSSGHSEQTLIIAAVQQARVLR DILFQKSAAAVKEPGSEAWQQVVDATVLLAGVIGGLGGAQCRTVAAHAVHNGLTHISK SGSIHGEKVAYGILVQLRLEEIIQGNQLAAAARQQLLKFYAEIGLPQKLNDLGLGNIS LNELQKAAEIALASDSDIHRLPFKVALEQLMAAMVSTTAPVEGRNHTALTPIVNKDE" gene 6986..8170 /locus_tag="DP116_15890" CDS 6986..8170 /locus_tag="DP116_15890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198005.1" /note="produces methionine from 2-keto-4-methylthiobutyrate and glutamine in vitro; mutations do not affect methionine salvage in vivo however; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LL-diaminopimelate aminotransferase" /protein_id="PRJNA477356:DP116_15890" /translation="MISDWIAPAERIQKLPPYVFARLDELKAKAREQGLDLIDLGMGN PDGATPQPVIEAAIKALQNPANHGYPPFEGTANFRQAITNWYRRRYGVDLDPDSEVLP LLGSKEGLAHLAIAYINPGDLILVPSPAYPAHFRGPIIAGGKVHSLILKPENDWLIDL AAIPDSIAQQAKILYFNYPSNPTAATAPREFFEEIVAFARKYEILLVHDLCYAELAFD GYQPTSLLEIEGAKDIGVEFHTLSKTYNMAGWRVGFVVGNRHIIQGLRTLKTNLDYGI FAALQTAAETALQLSDDYLHEVQERYRTRRDFLIQGLAELGWNLSKTKATMYLWVPCP VGMSSTEFALKVLQQTGVVVTPGNAFGVAGEGYVRISLIAECDRLGEALHRLKQANIR FH" gene 8278..9336 /locus_tag="DP116_15895" CDS 8278..9336 /locus_tag="DP116_15895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316193.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycerol acyltransferase" /protein_id="PRJNA477356:DP116_15895" /translation="MINQHSDNLLNLQLKKTSVEDYKFGWFDWFCLWYPPGWLILFNR HWQHYHKDPDGWNWLEYGLFLIPCGFYLALFIRWLRLGCRSPRRQIGEFDPNYQKAFR DEIIAPIVKYYFRGELHKIENLPQTGSMIVTMNHAGMCFPWDFLTLGYLLSKARGWLV QPIAGVSLFDHHWIAWWLPPGWSKVLGGVRAELNDFKTVMQERKILLYAPEGLRGPRK GWVKRYQLERFDLSFLQLSQRYQISILPVVCIGNENLHPWTLNIRKLQRLFNLPFLPI SPLMPLFILFPSMGVWAMRSRLRYFIQPLCTTELDGEETKRAEGYRKAQQLREKLQSQ INQLLSLKIKSEQQVQKT" gene complement(9349..10050) /locus_tag="DP116_15900" CDS complement(9349..10050) /locus_tag="DP116_15900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741537.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_15900" /translation="MRILLVDDEVELTDPLSRMLTREGYSVDAAYDGTSGSQLAQSGS YDLLLLDWMLPGKTGLEICQELRRQGDTTPVLFLTAKDTLDDRVEGLDAGADDYLVKP FELRELLARVRALLRRSGSQSHSTTTRRLVVADLELDRENQVGYRQGRIIDLSEKESQ LLQYFMENTGQLLTHAQILQYLWTDDEPPSSNVIAALIRLLRRKVEQAGETPLIHTVY GKGYRFGASAPGTGE" gene 10118..10762 /locus_tag="DP116_15905" CDS 10118..10762 /locus_tag="DP116_15905" /EC_number="2.4.2.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206613.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_15905" /translation="MLTIALPKGELLKNSIRILQNVGLDFSAFLDSSNRQLQIPDCSN KAKGLLVRAQDVPVYVEYGQAQLGIVGYDVLREKKPQVAHLVDLQFGHCRMSVAVKQS SPYRSVLELPPHGRVASKYVNSAREYFHGLDLPVEIVPLYGSVELGPITGMSEAIVDL VSTGRTLRENGLVEIETLFESTARLIAHPLSYRLNTDDLLGLVEQLRSSGLATV" gene complement(11085..11642) /locus_tag="DP116_15910" CDS complement(11085..11642) /locus_tag="DP116_15910" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15910" /translation="MPHSSASDGLAAKKEVIDEALLLALEQVAEDMANQSRQLTADWL LNLTNELATYLGVQATSHKWKAYQTFLVDVLEAIEKNPDPQVMYPFLAANQDKLNDNL AYVLQIWAMGTLPYLEEISAQYTAAFIVDFSNLVQGFEQGNSASHMEIAIVGYQVAAT VFTRDHRRWRLPHRFPYEWATNDAK" gene complement(12252..13208) /locus_tag="DP116_15915" CDS complement(12252..13208) /locus_tag="DP116_15915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458605.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_15915" /translation="MQKRTLGTSNVKITPILIGTWQAGKKMWVGIEDADSIKAIRAGF EAGITTVDTAEVYGDGHSERIVAEALSDVRDQVEYATKVFANHLKYNQVIEACNHSLK NLRTDYIDLYQIHWPAGSFNSEVVPIEETMSALNHLKKEGKIRAIGVSNFSRAQLEEA SQYGRIDSLQPPYSLFWRYVEKDALPYCIEHKITIIAYSPLAQGLLTGKFEAGHKFDP QDNRAKNKLFQGENFERAQQALEKLRPIAERHNCTLAQLALAWLIAQPQSNAIAGARY PEQATANAQAASVQLSTEDLHQIDVIGRIVTDHLDDNPVMWA" gene 13299..13901 /locus_tag="DP116_15920" CDS 13299..13901 /locus_tag="DP116_15920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15920" /translation="MLMEELIMTNHALKEWAVAINALETGKTIMLLRKGGIHERNGRF EVNHKQILLYPTFEHQQPFLLKPESANLVIPVTPGWHPETVGINSWAEITDIFPVSEE SVVNALLPFHIWNDNFISDRLKWKPRQPLYILLLRTYKLPQEQEIPYHAKYGGCKSWI DLDQPISLQGSQPILSSSMYDQLVAQIRDIVSDKLYAPSI" gene 14172..>14380 /locus_tag="DP116_15925" CDS 14172..>14380 /locus_tag="DP116_15925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198776.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15925" /translation="MHRRMCWLSKFGDSEEKVLHLQTSPNEPWRPYTAFGQYAVPDYK IPGGSKGWATFQKLSKEGWTLIPTA" assembly_gap 14381..14390 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(14476..14631) /locus_tag="DP116_15930" /pseudo CDS complement(14476..14631) /locus_tag="DP116_15930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876899.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 15462..15704 /locus_tag="DP116_15935" CDS 15462..15704 /locus_tag="DP116_15935" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15935" /translation="MGGSVVNKTTYVLTVNVSSLNIMTLIGYTEEIKIECLEMYVNGS GFRAIERVKKVHHTTVINWVRQLGDTLPDIGVLFDF" gene complement(15690..16202) /locus_tag="DP116_15940" /pseudo CDS complement(15690..16202) /locus_tag="DP116_15940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_086558172.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" gene 16241..16426 /locus_tag="DP116_15945" CDS 16241..16426 /locus_tag="DP116_15945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15945" /translation="MPFSKKHHMGAKPLNDTPFDRTPVCFNVRVGVREKLKTVPDWKE RLREFFDQLISDLPKNE" gene complement(16471..17041) /locus_tag="DP116_15950" /pseudo CDS complement(16471..17041) /locus_tag="DP116_15950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879841.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 17254..17970 /locus_tag="DP116_15955" CDS 17254..17970 /locus_tag="DP116_15955" /EC_number="2.6.99.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111582.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxine 5'-phosphate synthase" /protein_id="PRJNA477356:DP116_15955" /translation="MPTLGVNIDHIATIRQARRTVEPDPVAAAVLAELAGADGITAHL REDRRHIQERDVRLLRQTVRTHLNLEMAATDEMVAIALDIKPDYVTLVPEKREEVTTE GGLDIVGQIDRIGQVVDKLQSAGIPVSLFIDAEPAQIEASVKVQAKFLELHTGRYAEA KDETSREEELAFLSKGCEQAINAGLRVNAGHGLTYWNVYSVANLPGMEELNIGHTIVS RSALVGMERAVREMVRFVVS" BASE COUNT 5043 a 3746 c 3942 g 5261 t 10 others ORIGIN 1 ttaaaccaca tactccaccg cttgtgcggg cccccgtcaa ttcctttgag tttcacactt 61 gcgtgcgtac tccccaggcg ggatacttaa cgcgttggct acgacactgc ccgggtcgat 121 acgggcaacg cctagtatcc atcgtttacg gctaggacta ctggggtatc taatcccatt 181 cgctccccta gctttcgtcc ctcagtgtca gttgcggtcc agcagagcgc tttcgccacc 241 ggtgttcttc ctgatctcta cgcatttcac cgctacacca ggaattccct ctgcccctac 301 catactctag tctctcagtt tccactgcct ttatctggtt gagccagact ctttgacagc 361 agacttgaaa aaccacctgc ggacgcttta cgcccaatca ttccggataa cgcttgcatc 421 ctccgtatta ccgcggctgc tggcacggag ttagccgatg ctgattcctc aggtaccttc 481 agtacttatt ccctgagaaa agaggtttac aacccaagag ccttcttccc tcacgcggta 541 ttgctccgtc aggctttcgc ccattgcgga aaattcccca ctgctgcctc ccgtaggagt 601 ctgggccgtg tctcagtccc agtgtggctg gtcatcctct cagaccagct actgatcgtc 661 gccttggtgc gctcttacca caccaactag ctaatcagac gcgagctcat caaaaggcaa 721 ttaatctttc acccgaaggc acatccggta ttagcagccg tttccaactg ttgtcccgaa 781 ccttttgcca gattctcacg cgttactcac ccgtccgcca ctaagttccg aagaactccg 841 ttcgacttgc atgtgttaag cataccgcca gcgttcatcc tgagccagga tcaaactctc 901 cgttttgatt ctctgtgtag atggcgaacg aatcgcttct ctactgctca tctagctgat 961 ttttcttacc ttcagcctag gtttattctt tactgacgca aggcttgtag tgtatactag 1021 ctttcaaact ataggatttt caaggttcgt tgccctcggc gtcggctttg tttggcgtcc 1081 gctcttccgg cacttattca atatagcgaa cccccttttt cttgtcaact cttttttcaa 1141 agtttttttg tttttttttt gaaagtgctc aaaactcccc actatgacgg gtttaggcaa 1201 aaatgggaga tgcactttca gtcatgaagg aagagtcatt gtgaattttc agtcagttat 1261 agctatattg cataagttct ggagcgatcg cggatgcctt attgcccaac cctacgacat 1321 agaaaaagga gcaggcacta aaaaccccca cactttttta agagcgctgg gacctgaacc 1381 gtgggctgtc gcttacgttg aaccttgtcg ccgtcctact gatggacgtt acggcgaaaa 1441 ccctaatcgc ttccaacact attaccagta ccaagttctg attaaacctt caccagacaa 1501 tattcaagaa atttatcttg attccttaag agctttgggg attcgtcctg aagatcacga 1561 tatccggttt gtagaagata attgggaaga tgcaacggta ggagcttggg gaacgggctg 1621 ggaagtctgg ttagatggaa tggaaatcac ccaatttaca tactttcagc aatgcggagg 1681 cattgattgc cgtccagtct ctattgaaat gacttatggc ttggaaagat tgactatgta 1741 tctccaagga gtagaagcaa tcaccaagat tgaatggatg gacaacatca catatggaga 1801 tgttcacctc caaggagaaa ttgagcagtg tgtttacaat tttgaagcct caaatccgga 1861 aatgctgctc acgttattta acatgtacga gcaggaagcc cagcaactga ctgaacgagg 1921 attggtgtta cctagcctgg attatgtgtt gaaatgttca cacactttca acttactaga 1981 tgccagagga gtcatttctg taacagaacg aactcgctac attgcaagaa ttcgacattt 2041 agcgcgcaag attgcgcaat tatatgtgga acagcgagaa aagctgggtt ttccgctttt 2101 gaaaaaatca gctacttaag tgcatagtca ctgagaaagc taaggaagcg cagatgaaga 2161 taggtcaatc tgtacttcat ctgaagaaaa accaccatat tctcaaccca agatttcaaa 2221 ctaaagaaac tacaggtatg agtttttggt agatctattg ccaaaatagt gaagcgcttg 2281 cagactggaa gtctggggct atagaaacag agcttgtcta cgcaggctta acatatagag 2341 tttgcttgcg tagaaaattt tcgggcttct cgtctacata gtcacaaatt gtgtctgcca 2401 ggtgtttgtg caaaaactct gatgaatcag gtaagtggtg tcataatctg tcttggctac 2461 attctaggat tactgtttac agcagttccg tggggtggct tttgggtgtt gggtttgggc 2521 gtagtgggag ctatattttt tggaagaaaa cgccttaaca tacggaaaca tctgccaaaa 2581 aaggaaaatt ctcaagcaaa gactaagaca gcgccacaat tatcgcaaac cagtcctcac 2641 gctagagttt ggcttattgc tggtgtaata ggtttgttgg ctagttttta ctttcaatca 2701 cgagttccaa caccacaagc caacgatatc agtaaattcg tcccgtcaga aaatggcaat 2761 aatcaagaac aactttttat tgttcgcggt aaagtgctta gtaaaccccg catgactcgc 2821 agtcagcgtg gacagttgtg gttagaagca actcagttcg atgaggtaaa aaatgagaat 2881 gttccagatg gtagaacaaa aggagtcaca ggaaaattat atgtgactat gcctttactt 2941 caaaccactg gattgcatcc cactcaacaa attgctgtga ctggggtttt gtacaaacca 3001 aaaccaccat taaatcctgg tgcttttgat tttcaaaaat ttctccagca agaaggtgca 3061 tttgctggtt tgagtggacg gcaagtcaat attctggatg aaggaaaaac atggggatgg 3121 tggaaagttc gagaacgaat tgtgcgatca caagttcgtt cgttaggtgt tccagaaggc 3181 ccacttgtaa gtgcaatggt tttgggtagc aaagtcgttg atttacctta cgaaacccaa 3241 gaccgttttg tgcaggtagg acttgctcat gctttagccg cctcagggtt tcaaacatct 3301 ttaattttag gtgtcattct aggattgaca tcaaaagcga aaaagggaac ccaaatgata 3361 ctcgggagca tagctctgct tcttttctta actttaacag gtttacaagc atcggtactt 3421 cgagccgtca ttatgggctt tgccgcgctt attggaatcg gattaaggcg caaagtcaaa 3481 cagttgggat ctctgctggt tgcagcagtc ctattattgc tctttaatcc tctatggatt 3541 tgggatttgg gttttcaact cagttttctg gcaacactgg ggttaattgt aacagtatcg 3601 ccaataacaa aacgtttcga ttggttgcca cctatcatga catcttcaat tgcagtccca 3661 ctagccgctg caatttggac attacctctt ctattgtacg tctttagtgt agtagcaatt 3721 tacaccctac cagccaatat tatctccaca ccattaattt ctgtcatcag tatcggtgga 3781 atgataagcg ccctagtcag cttaatttca cctgagcttg gaagcagctt ggctgatttg 3841 ttatatcatc caacccattg gctgctaaag ttggtggaat tttttggcag cttaccagga 3901 agtacagttg ctgtaggtag catatccctt ggtcagatgc tggcaatgta catattgatt 3961 atcttggcct gggtggtgcg ttggtggcaa cagcggtggt ggttttctgc cataattgca 4021 ttgggtttgg tttttatccc tgtttggcat tctgcaaaca ctttgtttcg ggtaacagtg 4081 ttagcagctg gtggggaacc agttttagtg attcaagaca aagggaaggt cacactcata 4141 aatagtggag atgaaggtac gggacgcttc actatactgc cgtttttaca acagcaagct 4201 gtgaataaag tagattgggc gatcgcttct gattttcaac acaatggcaa taatgcttgg 4261 ttagaggtgt tgcaacgttt gccgattgga attttctatg actattctcc caggtctgac 4321 aatgacacaa ccaatcaagt tattcaaaag gaagtgcaaa acagtaaagg aatttaccaa 4381 cctttgtcag ttggtcaaac tattagtact ggttcggtag tcgcgcaatt aatcaacaac 4441 caattgccca tcttacagtt gcaaatgttc gggcaaaatt ggctgttagt gggtcagacc 4501 aaaacaactg aactgcttaa actactgaat actggacgtt tagtgcgtcc gcaagtctta 4561 tggtgtcctg gtgagtcttt aaaagaatta atttctgttt tgcaaccgca ggtggcgatc 4621 gcaaccacta ccaatgttga ccaaaaaatc ctatctgaac tgagccaaac tcagacaaaa 4681 ttattcttta caagtagaga tggtgctatt caatggacac ccaacggaca gtttgagata 4741 tttatccagg caggtgaaaa caaaacatct attttgtgac actcccacgg ctacggaaat 4801 atcgcactac gtgcttgctc attcgctgta caaaactcgc acatgcatga tatgttattt 4861 atcataggaa tgttatgcct ctggagaggt ggcagagtgg ttgaatgcgg cgcactcgaa 4921 atgcgtttta gggcaaccta acgggggttc gaatcccccc ttctccgttt aatagcaata 4981 atgtggacag aactgactaa aataggtgta gcaaaaacga ataaagaaaa cacgcctagt 5041 agctaacctt tatgctcaca cctgctgaat ttctcaaata cacccagtgg tcgggtattg 5101 caacactagt gtttgctgtc ttggcgattg cgggttttgt tttcaaatgg ggcattcgct 5161 ttcggcttgt gggttcgact gggtttatgg tagtactgac gggtggtcta tttgctttat 5221 cattggctcc tttgtctcgc actgtgattc caggtgcggt gcgatatagc cgggtttatg 5281 acaacggagg aactcaagtg gtcattacta cctcaccgca aattaccccg tcagaattag 5341 aagctaccct acgtcaagca gctagtgatt tgtattctta tggtcgcttt ggtacacagg 5401 cggaaaacca gttgaccatt cgagcacgca ctattatcca cccagaacca gaaatttctg 5461 tgccacttta cctaggtcaa gtgaagcgat cgctgtctag tcgtgaagat tctcaaatga 5521 ctatcgagat ctaccaagat aaatttgctc aattaccaca aaccacctct tgaaaagtga 5581 ggatgaagag gttctccaac acattgagaa gtggtagtgt taatttatta attagttcta 5641 tgtctcatca atctactgaa accttgtcta ctcaaacctc tagtccatta ctcactcttt 5701 ctgtagctcc agcaagggtt cttcgtggct cgcacgtctt gacacaggca agtgatgtca 5761 tttctcagtt gggaagtcgt cctctgatta taggaggtga atacactctg agtgttattc 5821 agcagagtct ggaacaattt ctcaaacaac cacgcttgca ttttacccaa gctttctata 5881 ctcctgattg tagtgaagcc agcttgaaag ctttacacaa agcggcaaaa gagcataaag 5941 ctgatgtcat aattggggtt ggtggcggta aagcactgga tacagctaag ctagttgctt 6001 acaagttgca gttaccagtc gtcacaattc ccacgtcagc gtctacgtgt gcggcttgga 6061 ctgccctctc gaatgtgtat tctgatgaag gggcatttct ctatgatgtg gcattggcaa 6121 agtgccctga tttactcata cttgattacg agttagtaaa aactgcacca caacgaacac 6181 ttgtagcggg aattggggat gcgatcgcca agtggtacga agcttcagtc agtagcggac 6241 actccgaaca aaccctaatc atcgctgcag tacaacaagc gcgagtttta cgggatatcc 6301 ttttccaaaa atctgctgct gctgttaaag aacctggtag cgaagcttgg caacaagttg 6361 tagatgcaac agttcttctt gcaggagtca tcgggggatt aggaggcgca cagtgtcgta 6421 ctgttgctgc tcatgctgta cataacggtt taactcacat ttctaaaagt ggcagcattc 6481 atggtgaaaa agtcgcttat ggtattttgg tgcaactgcg tttagaagaa ataatacagg 6541 gtaatcagct agcagcagct gcacgacaac aattattaaa gttttatgca gagataggat 6601 taccgcaaaa attgaatgat ttaggattag gcaacattag cttaaacgaa ttacaaaaag 6661 ctgcagaaat tgccttagct tctgattctg acattcaccg acttccattt aaagttgcac 6721 tggaacaatt gatggcggcg atggtttcca ctactgcgcc agtagaagga aggaatcata 6781 cggctttgac accaattgtg aataaggatg agtgatgatg aaatcagtga acagtgaaca 6841 gttatcaaga taatcagctt tttttatctg ataactgata actgttgaat ggcatcaaat 6901 gcagaaggca aaagataaat tacgagttat tactctttgt ttttcagctt ttataattta 6961 taagttgtaa accataattc cttcgatgat atcagattgg attgctccag cggaacgcat 7021 acagaaatta ccaccctacg tgtttgcccg tctagatgaa ctcaaggcga aagcacgaga 7081 gcaagggcta gatttgattg atttgggcat gggaaaccca gatggtgcaa cgccgcaacc 7141 agtaatagaa gcggcgatta aagctttgca aaatcccgca aatcatggct accctccttt 7201 tgaaggcaca gcgaattttc gccaagctat cacaaactgg tatcgtcgtc gttatggagt 7261 cgatttagat ccagatagtg aggtattacc gcttcttggt tctaaagaag gattggctca 7321 tcttgccata gcctatatta atcctgggga cttgatttta gtaccttctc ctgcctatcc 7381 tgcccatttt cgtggtccaa taattgctgg aggcaaagtt cacagcttaa ttctcaaacc 7441 cgaaaatgac tggctgattg atttggctgc aattcctgat agtattgctc aacaagcaaa 7501 gattctctat tttaattatc ccagtaatcc gacagctgcc accgcacccc gtgaattttt 7561 tgaagaaatt gttgccttcg cccgtaaata tgagattctg ctcgtacatg acttgtgtta 7621 cgccgagtta gcttttgatg gctatcaacc gacgagtttg ttagaaattg agggtgctaa 7681 agacattggc gttgagtttc ataccctttc taaaacttat aatatggcag gttggcgcgt 7741 tggttttgtg gttggtaacc gccatattat tcaaggtttg cggacgctaa aaacgaattt 7801 ggattatggg atttttgcag cattgcaaac cgcagcagaa actgctttgc aactaagcga 7861 tgattattta catgaggtac aagaacgtta ccgtactcgg cgtgattttc tcattcaagg 7921 cttagcagag ttaggttgga atctcagcaa aaccaaagcg acgatgtatt tgtgggttcc 7981 ttgtcctgtt ggtatgagtt ctacagaatt tgctctcaaa gtcttgcagc aaactggggt 8041 tgtggttacg ccaggtaatg ctttcggggt tgcgggtgaa gggtatgtgc gaattagctt 8101 aattgcggag tgcgatcgct tgggtgaagc tttgcaccgt ttaaaacaag ctaatatccg 8161 cttccattga cagactcatc atccaaacta tatgagtcca gagtaagatg gcaactaaaa 8221 gtttctgaag ctaagattat cgattaatcg aattctttcg gaagctactc agctactgtg 8281 attaatcaac actctgataa cctgttgaac cttcaattga aaaaaacatc agttgaggat 8341 tacaaatttg gctggttcga ttggttttgt ctgtggtatc ctcctggttg gctgatttta 8401 ttcaaccgac actggcagca ttatcacaaa gatccagatg gttggaattg gttagaatac 8461 ggattatttt tgattccctg cggattttac ttagcacttt ttattcgttg gttgcgactt 8521 ggctgtcgtt cacctcgtcg tcagataggt gaatttgacc caaattatca aaaagctttt 8581 cgagacgaaa ttattgctcc tattgttaaa tattattttc gaggcgagtt acacaaaatt 8641 gagaatttgc cacaaacagg atcaatgatt gtgacaatga atcatgcagg aatgtgtttt 8701 ccctgggact ttttaacgtt aggttaccta ttaagtaaag cacgaggatg gttggtgcag 8761 ccaatagctg gagtctcttt atttgatcat cattggatcg cttggtggtt accacctgga 8821 tggtcaaaag ttttaggtgg tgtaagagca gaattaaatg attttaagac tgtaatgcaa 8881 gaacgtaaga ttcttttgta tgcaccagaa ggtttacgcg gaccaagaaa aggttgggta 8941 aaacgctatc aactagaaag gtttgatttg agttttcttc aattaagcca acgttatcaa 9001 atttcgatct tacctgttgt ttgcattggt aatgagaatt tacatccttg gactttaaat 9061 attagaaagt tgcaaaggtt attcaattta ccatttttac caatatcacc tttgatgcca 9121 ttatttattc tctttccatc aatgggagtt tgggcgatga gaagtcgctt gcgttacttt 9181 attcagcctt tgtgtacaac tgaattagat ggtgaagaga cgaaaagagc agagggttat 9241 cgtaaagcac aacagttacg agaaaagttg caaagtcaaa ttaatcagtt gttaagtctt 9301 aaaattaaga gtgagcaaca agttcaaaaa acataacgca ctctactttt attctccagt 9361 tcctggtgca gaagcaccaa atcgatatcc tttgccataa acagtatgaa ttaaaggtgt 9421 ttcaccagct tgttcaacct tgcgccgtag taaacgaatt aatgcagcaa tgacattgct 9481 actaggtggt tcgtcatctg tccagagata ttggagaatc tgcgcgtgag tgagcagttg 9541 tccggtgttt tccataaaat attgtagaag ctgactttct ttttcggata agtcgatgat 9601 tctgccttga cgatagccga cttggttttc gcgatcaagt tctaagtcag cgacaacaag 9661 ccttcgggtt gtagtactgt ggctttgtga accggaacga cgcaataaag cccgaactcg 9721 tgctagcaac tcccgtaact caaaaggttt aaccaaatag tcgtccgcac ctgcatccaa 9781 accttcaact ctatcatcca gagtatcttt agcggtgaga aacaatacag gcgtagtatc 9841 cccttggcgt cgtaattctt gacaaatttc taaccctgtt tttcctggta gcatccaatc 9901 tagaagcagt aagtcataac taccgctttg tgcaagttgg ctaccacttg tcccatcata 9961 agccgcatca acgctatacc cctcacgagt taacatgcga ctcaaaggat cagttaattc 10021 aacttcatca tcaacaagca aaattcgcat ggtgagctat tgtacttagc tatcagatat 10081 tattattttg ctgattgctg agcgctgatt gctacttatg ctgactattg cattgccgaa 10141 aggggaactt cttaaaaata gcatccgtat cctacaaaat gtaggattag attttagtgc 10201 ttttttggat tcaagtaacc gccaacttca gattcctgac tgtagtaaca aagccaaagg 10261 gttactggtg cgggcgcagg atgtaccagt atatgtagaa tatggtcagg cacaacttgg 10321 tatcgttgga tatgacgtgt tgcgcgagaa aaagccacaa gtagcccact tagttgactt 10381 gcagtttggt cattgtcgaa tgtcggtggc ggtgaagcaa tcaagccctt accgttcggt 10441 gttagagtta ccacctcatg gtcgagttgc ttccaagtat gtcaatagtg cgcgagagta 10501 tttccacggt ttggatttac ctgtagaaat tgtaccgttg tatggttctg tggaactagg 10561 tccgattaca gggatgtcag aagcaattgt ggatttggtt tcgacaggac gaaccttacg 10621 cgaaaatggt ttggttgaaa ttgaaacttt gtttgaaagt acggcaagat tgattgctca 10681 tcctttgagt tatcgattga atacagatga tttgcttgga ttggttgagc aattacgctc 10741 cagtggtctg gctacagttt aactctagta cttcaccaac caatgctagc ccaaatttga 10801 ccgctattgg tgaccgttat tggttggtga agatgcaggg tggtttaata caaacagaga 10861 aaaatgacaa gactttacac aaggcgataa gcctctggct tgatgctacg catatcgcaa 10921 atggataacc tggaagtatt cccagataat tttgcgtctt tgcccagtca taaggaaagc 10981 gcgagcgtgt gtattgttgc aatgcttggt tgtaatagta tatagcttgt tccaagttat 11041 gagatttttt gccacggatg cgttcagaca aagcaagccc aagattattt tgcgtcgttc 11101 gttgcccatt catagggaaa gcgatgcgga agccgccacc tccggtgatc gcgtgtaaaa 11161 actgtagcgg ctacttggta acctacaatc gcaatttcca tatgacttgc tgagttacct 11221 tgctcaaacc cctgcactag gttactaaaa tcaacaataa acgctgcagt atattgtgct 11281 gatatttcct ccaagtatgg cagggtaccc attgcccaaa tttgcaacac ataagccaaa 11341 ttatcattca gcttgtcctg atttgctgct aaaaacgggt acatcacttg cgggtcagga 11401 tttttttcaa ttgcttcgag tacatcaact aaaaaagttt gataagcttt ccatttatga 11461 gaagttgctt gtactcccaa ataagtagcc aactcatttg tgagatttaa taaccaatct 11521 gcagttaatt ggcgactttg gtttgccata tcctctgcga cttgttcgag ggctagtagc 11581 aaggcttcat ctatcacttc tttcttcgct gctaaaccat cacttgctga tgagtggggc 11641 aatttagcag ggctgcaatt gtatttaaat aagtctggat tattgtttcg tctatatgac 11701 ttgccagttg taactttagc acgtttactg atttacaatt ttattgtgca aaatttaact 11761 gggcgttgtg actgaagaaa gaactcaaaa ctcagaattc aggaaagtag gactgttgtg 11821 acgaatatca accaacgctg aaaacgcgaa aacctggtca ggagggacca ggttttatgt 11881 aaaaaatttc gtagtgatca aattgctatt atcgttaaac aaatacaaac gttctgctac 11941 tagggttgac aaaaacacta acaaactcag cataacgaac gtttttccgc tttgggattt 12001 ggcttcgcct ttcaagacag gcgctgtgcc acttcatcat cagacttatg ctgatatttg 12061 cttgggctga tgacggaagg acaccgagtc atttgccctt ttcagtttgg ttgtgccttg 12121 tttggtgtcc tatttataaa ttagcacctc aatatgaaat tgctagcagt tgtttactga 12181 agttaacaag ttgtaggact tctcaataac acaattgagg cgcaaagtat cgcgcctcta 12241 cttcaggatt atcaagccca cataactggg ttatcgtcta aatggtcggt gacaatccgc 12301 ccaataacat ctatttggtg cagatcttca gtggaaagtt gaacagaagc agcttgagcg 12361 tttgcggttg cttgttcggg ataacgcgca ccggctattg cattactttg tggttgagca 12421 attaaccatg ctagcgctaa ctgggcaagg gtacagttgt gacgttctgc aatcgggcgc 12481 aatttttcca aagcttgttg agctcgttca aaattttctc cctgaaatag cttattcttg 12541 gcgcggttat cttgtgggtc aaatttgtga ccagcttcaa attttcctgt caacaatcct 12601 tgagccagag gtgaataagc aatgatcgta atcttatgtt cgatacaata aggcagagca 12661 tctttttcta cataccgcca gaataaagaa tagggaggct gcaagctatc aatacgtccg 12721 tactgagatg cttcttccaa ctgagcacgt gaaaaattgg aaacaccaat tgcccgaatt 12781 ttcccttcct ttttgaggtg attaagagcg ctcattgtct cctcaattgg aacaacttca 12841 gaattgaacg atccagcagg ccaatgaatt tggtataagt ctatgtagtc agttctgagg 12901 tttttcaagg aatgattaca agcctcaatc acctggttgt acttgagatg gttagcaaaa 12961 acttttgtgg catactcgac ttgatctcga acatcagata aagcttcagc aacaattctt 13021 tctgagtgtc cgtcaccata aacctcagca gtatcaactg ttgtaatacc agcttcaaat 13081 cctgctcgta ttgctttgat cgagtcagcg tcctcaattc ccacccacat ttttttacca 13141 gcttgccaag ttcctatgag aataggcgta attttgacat tcgatgtacc cagggttcgc 13201 ttttgcataa tgattcctta tctatttctt aattcatttc tgtagacagt gacatagtgt 13261 atcggtcttg gcgtcaaatt aaatcagagt taaagtagat gcttatggaa gaactgatta 13321 tgacgaatca tgcactcaaa gaatgggcag ttgccatcaa tgccttagaa acaggcaaaa 13381 caattatgct cctgcgcaag ggtggtatcc atgaacgtaa tggacgcttt gaggttaacc 13441 acaagcagat tttgctttac ccaacgtttg aacatcaaca gcctttcttg ctcaaacccg 13501 agtctgctaa tttggtcatt ccggtgacac ctggttggca tccagaaaca gttggtatca 13561 acagttgggc tgagattaca gatatatttc cagttagtga ggagtcagtg gttaatgctc 13621 tacttccatt ccatatttgg aatgataact ttattagcga tcgcctcaaa tggaaaccgc 13681 gtcagccatt gtacattctg ctgctgcgga cttacaagct cccgcaagag caggaaattc 13741 cctatcacgc caaatatggt ggctgtaagt catggattga cttagaccaa ccaatttcgt 13801 tacaaggatc acaaccaatc ttgtcttctt ccatgtacga ccaattagtc gcccaaattc 13861 gcgacattgt cagtgataag ttatatgccc catccatata aaggataact atcaaaaaaa 13921 gtactcaaaa atacatcaat ttttgtgaac cataggatag cattggcaaa acgcaagctt 13981 gttgctgcgt ttttttccgt agactttcgt ctataacgcg tctaagtctt acattcactt 14041 tgagttctgt gacttgtatt gaggttaaca ggtgtgaaaa atacaaaaat taccccaaaa 14101 gggcgattta catattgaca gatgtctgta attatgctta caagctaact aacataaatt 14161 aggagtttgc catgcataga agaatgtgct ggttatcaaa gtttggcgac agtgaggaaa 14221 aagttttgca cctgcaaact tcaccaaatg aaccttggcg tccttacaca gcctttgggc 14281 aatatgcagt tccggattac aaaataccag gcggttctaa gggttgggcg actttccaaa 14341 aactctcaaa agaaggttgg actttgatac caactgcaag nnnnnnnnnn aactgcaaga 14401 gcaaacgagt ttttgtcctc caagacttcc gtggagtcat gactcaaaac agaagataga 14461 aaagtttttc actttttaca aacccctagg ggaaccttca ccacgcggat cagctgcccc 14521 ttctaaagtt ccatctggtg tgacgacaat cgagttaata ttaccccaag gctcagtttg 14581 ttttattttg tgtccccgac gctgcaactg ggcaaaagta agggcttcca aacccagagg 14641 ttcaaacaat aaggtactct gaacgtcatc ttatcaccgt cccattggca agtttgattc 14701 ccaaaacttt catctttaga accaacaatg aaaacaccat ctttaggaac aaacacctgt 14761 aaagctgagg ttttcaagtg ttgcaaacgt tgttctgttt gagaaatata acgcttctta 14821 ttatgtatct taaatttcag gtttcgccaa ttagtacctt gaaattgcag cgaacacgat 14881 aacggaaatt tgcaaccagt ttgagaatgt tgccagttct ttttacgata aaacttctga 14941 gcatctgtta atcttttctc agccgctttt acccaactca tagccgattt tcgtttagct 15001 tataactggt tgatgtgatt tgtacggcat ttgtgagcag aatctacttt cccaacagca 15061 tctgaaatca taccattggc atggcgttta ttaatgttgt aagtcttttg aagataggta 15121 ttccattcac ctttgttgaa ctttttcacg gtcagtaaat gattgactgt ctcacaagtt 15181 gctttgtgaa caaaagagcg aatacgttgc taaaaacatc tcgaaatctg taaaaccgag 15241 gttgtttagc tcgtcttctg gtgtcagtat tcctttgcaa tatgtaagtt tagtgatgtc 15301 atttattttt gttcgacctt tctccgtatt atatattcat atagcaatat ttttggcgtt 15361 gcagaataga aatatgaatt agggcgattg cgcagagcgc agacgccctt ggcgtatcgc 15421 tatgaaatgt ccaaggtgtg aatcaaccca tactcgtaaa aatgggcggc agcgtggtaa 15481 acaaaactac atatgtgttg actgtaaacg tcagttcatt gaatattatg actcttattg 15541 gatacacaga ggagattaaa attgagtgtc tagaaatgta cgttaatggt tctggttttc 15601 gcgcaataga aagagttaaa aaagtacatc atactactgt tataaattgg gtgaggcaat 15661 tgggtgatac tttaccagat ataggagtcc tatttgattt ttgaaaaaac tccgtacata 15721 gcaagagtgt caaaatcaaa ggtagtgtgt cggataaacc actcaaacag taatggctaa 15781 tgagaaaatg aaggaatcaa ggcacaaaaa cgtcgaaccc atgttttggc ttgcagccaa 15841 atatcctcta taggattttg ttctgggcaa ttaggggcaa agcgaacgca gtgtattttc 15901 cattggtcgg ttggcagacc ttgattaagc tcgtccaaaa aacctcgaac tgccttggag 15961 cggtggtaac tcgcaccatc ccaaaaaatt agcaatcgct ggttggggga ctgagttaac 16021 aaatatcgca ggtaatcaat agtattttct gaatttccgg catcgtaggg tttgagcagt 16081 aattctcgtt cgagatagtc cactgctccg taatatgtct gtttatctcg ctcattcacg 16141 actggaaccg cgatttccag gtcagttttt ccccatacat aaccacttaa atctccccac 16201 attttttagt tcagcaacgc catattttta gtgattgtgt atgccatttt caaaaaaaca 16261 ccacatgggt gcaaaaccat tgaatgacac accatttgac cgaacacctg tttgtttcaa 16321 tgtcagggtt ggtgtacggg aaaaattaaa aacagttcct gattggaaag aacggcttag 16381 ggaatttttt gaccaactta tatcagatct acccaaaaat gagtagtaat gggtagaggt 16441 tggtaggaat taagttacag ttcctcaccc ttattcacca aataccatct cagcgaaatc 16501 tggaacttgg ggctttctag cacaacagct acaagcccaa tcagcctgat atccgccttt 16561 ttttacccat ctgtgaaaac tggaatactc ccacaggtgc ggacaggata ctaacccatg 16621 tttaacagga ttgtaatgga tgtaatccaa atgccttgct aggtcagctt catcacgaat 16681 agtgtgttcc caaaaccgac gttgccaaac attactttct caatgtttgc gacgggaagc 16741 tgaaacattt tgaggtaatg actttttacc ccgtagcaat cttgtaaaca gcacttttaa 16801 ccgggagact cgttgcgagt aatcacaatc atcagatggt aaagtccaaa gaaaatgaat 16861 atgatcaggt aaaaatgact gcagcagtca tttcaaaggg tttttctgtc cgtgtcttgg 16921 cgacagcagt ccgcaagcga gagatgtttt ctggctccga aaatagaggt gtacggtagt 16981 aagtcaccag agttaggaaa acagttccac ctgggacata gcagcgacga tgttgggaca 17041 tatgggtaat atttaaggat tttggtgggc attgccctat cagattattg ttagaaagta 17101 tactttttga agggcaatgc ccaccctacc tctgtgcgag aaataattgt taagcgtcaa 17161 cagcaacagc caggttcatg agtcaggtaa ataaacaacc gatattccct ggtaaaatca 17221 aatcctgaag actgtgaaag ggagataatc aggttgccta cacttggtgt aaacattgac 17281 cacatcgcca cgattcgcca agcacggcgg acggtggaac cagatcctgt ggcggcggcg 17341 gtactagcag agttagcagg tgcagatggc attacggcgc atctgcgaga agataggcga 17401 catattcaag aacgggatgt gcgcctgttg cggcaaacgg tgagaacgca tcttaattta 17461 gaaatggcgg cgacagatga aatggttgcc attgctctcg atatcaaacc cgattacgtt 17521 actttggtac ccgaaaagcg agaagaagtc accacagaag gcgggctgga tatcgtcggt 17581 caaattgata gaataggaca ggtagttgat aaattgcaaa gcgctggtat tcctgtcagt 17641 ctgtttatcg atgccgaacc agcacagatt gaagcttctg tcaaggtgca agctaaattt 17701 cttgaactgc acactggacg atatgctgag gctaaggatg aaaccagccg tgaggaagaa 17761 ttagccttcc tatctaaagg gtgtgagcaa gcgataaatg caggactacg agtgaatgct 17821 ggacatggac tcacttactg gaacgtctat tctgtggcta atctaccagg gatggaagaa 17881 ttaaatattg ggcacaccat cgttagtcgc tcagcactcg ttggtatgga aagagcagtc 17941 cgtgagatgg tgcgattcgt tgttagctga atcacggtag ccagtccgta cataagctaa 18001 cc // LOCUS NODE_1848_length_17852_cov_4.59476317852 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17852) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17852) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17852 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1715) /locus_tag="DP116_15960" CDS complement(<1..1715) /locus_tag="DP116_15960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydantoinase/oxoprolinase family protein" /protein_id="PRJNA477356:DP116_15960" /translation="MLKVFADRGGTFTDIVAVTNNQAIIDGLSEHPKRFLIVPLPKGQ WVIVYKLLSENPEQYQDAVIQGIRDIMDLSGNAPIPTEAIEVVKMGTTVATNALLERN GDRVALLITKGFKDALLIGYQNRPDIFARHIILPTMLYEQVIEVSERYDANGKELTPV NIEQVKNDLQALLNTGIRSCAIVFMHSDRYPHHEQQVAQIAQEIGFTQISVSHQVSPL MKLVSRGDTTVVDAYLTPILRRYVNQVASHLPGVRLMFMKSDGGLVAAEQFQGKDSIL SGPAGGIVGAVETSKRAGFELVITFDMGGTSTDVAHFKGEYERQLDSEIAGVRMRVPV LAIHTIAAGGGSILFFDGSSYRVGPASAGSNPGPACYRRGGPLAVTDANVMLGKIHPQ YFPSVFGLDGNLPLDKDIVTQKFTQLAQEISTVTGNTRTPEQVAAGFIAIAVDNMANA IKKISLQRGYDVTEYVLCCFGGAGGQVACLIADTLGMKKIFLHPYAGVLSAYGMGLAD VRATRVGGVEKPLTQALIPQLVQLMEFLETQARSELPLPNPPTPLCPPLSRGDERGVP DRAGGV" gene 1885..2220 /locus_tag="DP116_15965" CDS 1885..2220 /locus_tag="DP116_15965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006618649.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" /protein_id="PRJNA477356:DP116_15965" /translation="MSFSLKANVKASSGNIFADLGLANPDELLVKAELARQISEIITK QDMTQIEAAELLGVDQPKISALMRGKLSGFSTERLFRFLNALGCDVQIVVKAKPESRK HAQIKVYSL" gene complement(2329..3297) /locus_tag="DP116_15970" CDS complement(2329..3297) /locus_tag="DP116_15970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949159.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protochlorophyllide oxidoreductase" /protein_id="PRJNA477356:DP116_15970" /translation="MEQHQKPTVVITGASSGVGLQAARALAQKGWYVVMACRDLPKTE KAAQSLGMSPDSYTIIHLDLASLESVRQFVKNFRETGRSLDALVCNAAVYLPLLKEPL YSPDGYELSVATNHLGHFLLCNLMLEDLKNSGAKEPRLVILGTVTANPKELGGKIPIP APPDLGDLQGFEAGFKAPISMINNKKFKSGKAYKDSKLCNVLTMRELHRRYHESTGII FSSLYPGCVATTGLFRNHFPLFQKLFPLFQKNITGGFVSEELAGDRVAEVVADPEYNK SGSYWSWGNRQKPNRKSFEQEMSNEALDDKKAQKLWDLSTKLVGLA" gene complement(3476..3709) /locus_tag="DP116_15975" CDS complement(3476..3709) /locus_tag="DP116_15975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456369.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Nif11-like leader peptide family natural product precursor" /protein_id="PRJNA477356:DP116_15975" /translation="MTQTNAAQLFKAVKQDQVLKERLKAATNPEAFIKIAKERGYDFT VEELQTEISKLSEEELAGIVNPGVAPRSHIYPR" gene complement(4024..5037) /gene="cydB" /locus_tag="DP116_15980" CDS complement(4024..5037) /gene="cydB" /locus_tag="DP116_15980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315157.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome d ubiquinol oxidase subunit II" /protein_id="PRJNA477356:DP116_15980" /translation="METLTYFLPQVWFVVLALFLLLYVMLDGFDLGVGILSLTSKDEE RRGILMTSLSNIWDANETWLVLMGGGLFGAFPLAYGTILNALYIPIFVMIFGFIFRAV AFEFRELSNRKFFWNFAFGAGSFVAALGQGFALGAVLKGIAVDETGHFIGTSWDWLSW QSVLVALTLIQAYVLIGSTYLVWKTTGELQTTHYKTAKIAALTTLIGAIFITISTPIF YESARTRLFQQPLVYIFAVIPILGVLLIWQLLKSLNRQEERAPFLWTILLFVLTFLGL GLIVFPYIIPVKITIYEASADPSSLVIMIIFIGFLIPVMLFYNLYQYIVFRGKVTGGH YEG" gene complement(5161..6606) /locus_tag="DP116_15985" CDS complement(5161..6606) /locus_tag="DP116_15985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191499.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome ubiquinol oxidase subunit I" /protein_id="PRJNA477356:DP116_15985" /translation="MEFLSDSVVLSRMQFALTALFHMLWPVLTTGMGIYLVIVEGVWL KTRNPDYYLHARFWSKFYVLNFGIGVATGIPMEFQFGTNWSRFSEAAGNFFGSVIGFE ASWAFMLEAAFLGIMLFGWERVNPIIHYVSTILVAVGANLSTLWILTANSWMQTPAGG ELVNGKFIVHDYFAAIANPFMKNSVLHMFFATLETSLFVIGGISAWYILNRRHEAFFS KSLKIALAAAIAVAPLQIYIGHLSGEQVYHYQPSKLAAMEAQWETTPAGQSADWSLLA IPNDKAQKNDWEITVPNALGYILEFKQKLTYPVRGLSEWKPEDRPHMIGLIYYAFRIM IGIGFFFAGLMLLSVLQWLRGKLSAENIAQQRWLMRAWVFAAPLGYIAVDSGWIVRCV GRQPWIVYGQIRTVDGASNIPASNVLVSLTSFAVVYSILFVGVLYFGSRIIRRGPNLE LPVPGIEPDRPAVDTTPAEFVPDERPVEAQQ" gene 6706..7071 /locus_tag="DP116_15990" CDS 6706..7071 /locus_tag="DP116_15990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15990" /translation="MTRRSRLQHHQRDIFSRLAIAQGGAEAIAFTATYLLRFAAVGER ILSSLKFIFFRLSQINHLLTITTKGERPNTFGKLPNEFTAHVPHAVFILSGWVSFFSF SPPKNLFYGTPTVIHNLKF" gene complement(7060..8427) /locus_tag="DP116_15995" CDS complement(7060..8427) /locus_tag="DP116_15995" /inference="COORDINATES: protein motif:HMM:PF05419.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_15995" /translation="MNTYRLSQIFTFLKALVYSIQLLIWAFHPTFGSFIGLIITITCT RIFEILFLSFFRNRRTRKLKRVAANLGLDFYKTDKNKNIKPILEGLPLFEEIRPRAKN IWNDLLMILELLYRLIFSIRETKKMKNILISKQDNQGHFYAIFDFHSHNWNLNGSQDS TVSSNGLSISNQSQTMIIFASEDLKLPEFSVKVKAKSILRKIFERICKVFGHEKKETK QNIDVFDSKINDFLKAEKNLCMAAKGYRLVCYRDNILIKPKKIHSLLSTVFQASELLK APDSINVQKDFDYTKLRDLLKAGNWKEADRETTAILLKAIGTKIEYKNINIAVTLIDD IFLNPVLHNIDALWVEYSNGHFGFSVQKHIWLEVGGKVNYKTERLLADRVGWRVQGKW LCYSDLTFSLNAPKGHLPTTKLSELPFGWFYMGKRPLFKFIVSGLIALRRFIVLPGCF IASKF" gene complement(8488..11121) /locus_tag="DP116_16000" CDS complement(8488..11121) /locus_tag="DP116_16000" /inference="COORDINATES: protein motif:HMM:PF05729.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="caspase" /protein_id="PRJNA477356:DP116_16000" /translation="MQKYLGKESEKPQLQFVVQSMFTKVRCSLLIFVFICLKGIFMSK YKFNRNFAILIGINNYKNGIPALETAAPDALKLAQIIQEQHQNLKQQYQAQNKYEVQL LLNQRVTLKKLKQLIEDFKKGQIFLDKEKVTVNKDDRFLFYFAGHGIALEALENQEGP VGYLIPQDATLGDSNTYLPMQELHDALNALPCRHMLAILDCCFAGAFRWASLKRDIVP KVTVYKERYDRFISDAAWQVITSAADDQKALDFLGQRGKVIDGNEIHSPFAKALFDAL RGGSDEGADFNKDGIITATELYSYLRNQVEILTEKHYKRQTPGLCPLKKHDKGEFIFL LPDFDRDKLEDAPPLNLNNNPYRGLESYDEKDSHLFFGRENLVEKLYQKAIDNKQPLT IVLGVSGTGKSSLVKAGLLPRLRNSNEFQFKILDPIRPGESPLKALAQICLLLATIVT PEELAKNEQALANIVERWSQTNPKTKLLLPVDQFEELITLCKSDKEREQFQKLIKNAI AKYPQNVHVVITLRLDFEAQFQNSVLKDFWNNATRFVVPPMTQDEFRTVIEKPASEKV VYFDPPSLVDELINEVVQMPGALPLLSFTLSQLYLKYLEQRRDNRALTKKDYEELGRV VGSLTQRANQEYENLVAKDPAYEQTIRQVMLRMVAVEGGESARRQVPDSELIYSSSEK NQRVAQVIEHLVQARLVVKGKEAGGEPYTEPAHDVLITGWKMLVKWIADERETLLLQR RLSSAEKRWQFQQSLTESGELITFKQGESFWNRQESLKTQTASNFLWHRDPYLEGLQQ VLNSDDNNWLNKAEEMFVRDSLKRRRRDHLLTIFQRIPYVILAFVFSLQLIISQQGFL SVLGLIIFVYSLLLLKSLFRY" gene complement(11352..14828) /locus_tag="DP116_16005" CDS complement(11352..14828) /locus_tag="DP116_16005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315153.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="caspase" /protein_id="PRJNA477356:DP116_16005" /translation="MTRHLYALLVGIDNYPAPIRCLQGCVNDITAIEEYINERFDKQE YQLHLQTLKDEQATRKAVIDGFRSHLSLAGQDDIVLFYYSGHGSQELAPKEFWQLEPD HFDETLVCYDSRTEGGWDLADKELAVLIAQVAQKNPHMTIIMDCCHSGSGTRDPMQET KERRLPTDKRERPLDSFIFTLDDLNRLLGTREVKPEDNPTGWNIPKGRHVLLAACQDY QTAKEYYGGDKHRGSFSYFLMDTLSKTNGKKLTYRDLFGRTNALVRSQIRDQSPQLEV NNPEDDNKFFLDGAIAELEPYFIVKNDKTDGWVIEGGAVHGVQPPRDGETTSLALFPF DANIDDLRDPSKSVGTAKVTKVLPTKSKIDIEGVQNLTAAGTSFKAVVTSLPLPPLGV YFEGDETGVTQARDALKTAGSNNNQPSPYIREEQELAKAQFRLLCRNEQYLIARPTDD RPLVEQIDGYTKDNADKAIKRLEHIARWTTIAELSNTAATQIKAGDVKMELIFKDEES SQSKQLRLQYKYRDGEWQKPEFQLKLTNTTNKSLYCALVNLSDNFAISAPFFEAGSVR LQPGEEAWALDGDPLVLSVPDEYWELSITEYKDIIKLIVSNNEFDARLLNQDELDAPR PPVSRDIDSSNQSSLERLMNRTQNRQIEAKNSARYDDWYAEEITITTVRPLDSIPVSQ EQEQQLDAGVKLLPHNSLVANARLTTTPQVSRDLGNKIVPPILREDPEVTRPFQFTSS RGTDPGLSVLELIDVADHKVVTPDAPLKLLVDVPLADNEYLLPVGYDGEFYLPLGRGT TTQDGKTEIILEQLPAPVSQGERSLKGSIRIFFQKVISQNFGREFKYPILAVAEVGDD KKVDYKRDMADVQQRVAQAKRIALYIHGITGDTESLVSTIKQPILQADGQKRSISELY DVVLTFDYENLNTSIEQNAQSLKRRLAEVGLGANHGKELHIIAHSMGGLVSRWFIERE GGNKVVQHLFMLGTPNAGSPWSVVEDWVKFTLAIGLNGLSLVAWPAKVVAMLMGALEK NIRVALTQMNPGSDFINSLAASDDPGIPYSIIAGNTSIIPAALEQQAGKKSSVLERLQ QSLFKKVVALPFFGVPNDIAVTVDSITSIPKGRSHPPKITEVACDHMSYFGTEFTEVG LEALVAALIQQK" gene 15277..15729 /locus_tag="DP116_16010" CDS 15277..15729 /locus_tag="DP116_16010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020163491.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_16010" /translation="MLHVTEARLDDIPQLCDLLTILFTQEADFQPDSAKQSQGLRQII EHPEVGRILVLHDGSTIIGMVNLLFTISTALGRRVAILEDMIVHPDWRGGGAGSTLLQ EAISFAQASGCSRITLLSDRVNSSAIRFYQRHGFTLSDMVPLRLLFPQ" gene 15830..16609 /locus_tag="DP116_16015" CDS 15830..16609 /locus_tag="DP116_16015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112766.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16015" /translation="MLILDAPQVEVIVGENAVHISQLPKVWQDIALGKAGVGLANPQS YVEMAQLFQYKLQQGDVDLFNERPELAHLKPSFKELFGLLARETLEFYGQDFKVERYP DFEAILREFESKGAEFSNEVKVARICLELFNEFDYELPASFYLVHLAPIYRDSVFEER ALRFDPRDTEHKRGWDAVLHAGKVFAVQMKIQSIASKYGLTYQHGCGCESHLSSIDMS LGAFDYQLNTEKRQRWIRSFIWTTWYEYAFFPIVPNTRYLV" gene complement(16692..17645) /gene="cysK" /locus_tag="DP116_16020" CDS complement(16692..17645) /gene="cysK" /locus_tag="DP116_16020" /EC_number="2.5.1.47" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407152.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cysteine synthase A" /protein_id="PRJNA477356:DP116_16020" /translation="MRIAHDITELIGRTPLVQLNKIPQAEGCIARIVVKLEGMNPAAS VKDRIAASMIKTAEDEGLIKPGKSILVEPTSGNTGIGLAMVAAARGYRLVLTMPETMS LERRAMLRAYGASLELTPGLEGMRGAIRKAEEIVASTPNAYMLQQFRNPANPKIHRET TAEEIWADTDGEVDIVIAGVGTGGTITGVAEVLKQRKPSFQAIAVEPANSPVLSGGQP GPHKIQGIGAGFVPEVLDKKLVDEVITVSDEQAIAYGRRLATEEGLLSGISSGAALYA AIQVAKRPGNAGRLIVMIQPSFGERYLSTPMFQDLALQTVR" BASE COUNT 5101 a 3878 c 3620 g 5253 t ORIGIN 1 accccacccg ccctatcggg cacccccctt tcatcccccc ttgacaaggg gggacagagg 61 ggggtaggag ggttagggag gggtagttca cttctagctt gggtttctaa aaactccatt 121 aactgcacca gttgaggaat taatgcttgg gttaaaggtt tttctactcc cccaactctt 181 gtcgcccgga catcagctaa tcccatccca taagcagaga gaactccagc ataagggtga 241 agaaatatct ttttcatccc caaagtatca gcaattaaac aagcaacttg tcctcctgct 301 cctccaaaac aacaaaggac atattcggta acatcatacc cgcgttgtag actaattttt 361 ttaatcgcat ttgccatatt atccacagcg atcgcaataa atccagctgc tacttgttcg 421 ggagtgcgag tgtttcctgt gacagttgaa atttcttgag ctaattgtgt aaatttttga 481 gtaacaatat ctttatctaa tggcaaattg ccatctaatc caaagacaga aggaaaatat 541 tgaggatgaa ttttacctaa catgacatta gcatcagtaa ccgccaaagg accgccacga 601 cgatagcaag cgggtcctgg atttgaacca gcagaagcag gtccgacgcg ataactagaa 661 ccatcaaaaa atagaattga accgcctcca gcagcgattg tatgaatagc taggacagga 721 actcgcattc tcaccccagc aatttccgaa tctaattgtc gttcatactc tcctttaaaa 781 tgggcaacat ctgtacttgt ccctcccata tcaaaagtaa taactaactc aaaacctgct 841 cttttgcttg tttctactgc accgacaatt cccccagcag gaccactcaa aatactatct 901 tttccttgaa attgttcggc tgcaactaaa cctccgtcag atttcatgaa cattaatctg 961 actccaggta gatgactagc tacttggttg acatagcgac gcagaatagg agttaaataa 1021 gcatcgacta ctgttgtatc cccccggcta actaacttca ttaaaggact tacttgatgg 1081 gatacagaga tttgagtaaa tccgatttct tgggcgattt gggcgacttg ttgttcgtgg 1141 tggggatagc gatcgctgtg cataaaaaca atcgcacaac tacgaattcc tgtgtttaac 1201 agtgcttgta agtcgttttt gacttgttca atatttacgg gagttaattc ttttccatta 1261 gcgtcatagc gttccgaaac ctcaatcacc tgctcataaa gcatggttgg taaaatgatg 1321 tgacgggcaa agatgtcagg acggttttgg taaccaatca agagagcatc tttaaatcct 1381 ttggtaatca gaagtgcaac tcggtctcca tttctttcta acagtgcatt tgttgcgact 1441 gttgtcccca tttttaccac ttctattgct tcagttggaa tgggtgcatt acctgaaaga 1501 tccatgatat ctcgaatgcc ttggatgact gcatcttgat attgttcggg attttctgag 1561 agtaatttat aaactattac ccattgcccc ttaggaagag ggacaattaa aaaacgtttg 1621 ggatgttctg agagtccatc tattattgcc tgattattag taacagcaac aatatctgtg 1681 aatgtaccac ccctgtcagc aaaaactttc aacatgactc aatttcctca atacctagag 1741 gtaggataac aaggggattt ggaacgtttc tacttcgggt gtgcaatatt ttacgacagg 1801 tgtggaattt gataagtctt aggactatta gaaacgccaa gatgtatcat tgacaaaata 1861 taccaaaact agcatactat aaagatgagt ttttcactaa aagcaaacgt caaagcaagt 1921 agtggtaata tttttgcaga tttaggtcta gctaatcctg atgagctact tgttaaagca 1981 gaacttgcac gtcaaattag cgaaatcatc accaaacagg atatgactca aattgaagct 2041 gcggaacttt taggagtgga tcaacctaag atttctgccc taatgagggg aaaactatca 2101 ggtttttcaa cagaacgact tttccgattt ttgaatgctt taggttgtga tgtgcagatt 2161 gtggtgaagg cgaaaccaga atctcgaaaa cacgctcaga taaaagttta tagtctgtag 2221 taggataagt tggcttacct gaagctgttt taggaaaggg tgaaggcgat cgccccaagg 2281 gcagacttct ggcggcgtat cgccacgaac gaaaacgcac cgtcttcgtt acgctaatcc 2341 aaccaacttg gtgcttaagt cccaaagttt ttgagctttt ttatcatcta acgcctcatt 2401 agacatctct tgctcaaaag acttgcgatt aggtttttgt cgattccccc aactccaata 2461 ggaaccagat ttgttgtatt caggatcagc gacgacttcg gcgacgcgat cgcccgccaa 2521 ctcctcagac acaaatcctc ctgtaatgtt cttctggaaa agtgggaaga gtttctggaa 2581 caggggaaag tggttgcgga acaagcctgt tgttgctaca catccaggat acagagaact 2641 gaaaataata cctgttgact catggtagcg ccgatgcaac tcccgcatcg tcaacacatt 2701 gcagagtttg ctatccttgt aagccttacc agatttaaat ttcttgttgt taatcattga 2761 aatcggcgct ttgaaacctg cttcaaaacc ttggagatcg cccaagtctg gaggtgctgg 2821 aatcggaatc ttacctccca actccttcgg attagctgtc acagtgccta aaatgacaag 2881 tctcggttct ttcgcgcctg agtttttcag atcctctagc ataaggttac acaagaggaa 2941 atgtccgagg tgattcgtag caacgctcaa ttcatatccg tctgggctgt acaaaggttc 3001 ttttaataaa ggcagataaa ctgcagcatt gcacaccaaa gcatccaggg atctaccggt 3061 ttccctaaag ttcttgacaa actggcggac gctctctaag ctagctagat ccagatgtat 3121 gattgtgtag ctgtcgggcg acattcctaa gctttgagcc gctttttctg tcttcgggag 3181 atcccgacaa gccatgacta cataccatcc cttttgagca agtgctctcg cggcttgcaa 3241 acctaccccc gatgatgcac ccgtgatcac aaccgttggc ttttgatgtt gttccatttt 3301 attcacactc cgttgtcttg actgtctact aggatctcac gccctggttg ttagatgtca 3361 attacttgga gtgaatggcg acaattattt cagaagtagt cagtcctcag tcctcaggac 3421 cctacgggcg gctgcagcca cccgctatga tacattcaat ctttcagttc agaaactacc 3481 taggataaat gtgtgatcta ggtgccaccc ctggattgac aattcctgcc aactcttctt 3541 cagacaattt actgatctca gtctgtagtt cttcgactgt gaagtcatag ccacgctctt 3601 tagcaatctt gatgaaggct tctggattag ttgctgcttt gagtctttcc tttaatacct 3661 gatcttgttt gacagctttg aaaagttggg cagcatttgt ctgtgtcata acagtctatc 3721 tcctgtttga aatatcagag tcgaatgtag taatttaact cctctcccga acgtcaactt 3781 caataaacga gccattaact aaggttatac acctcaatct tgaactcaag aggaaaatgt 3841 attattgatt ttgaaaaaaa aatataagtt catgttaggg tgaaatcgtc gaggatgaat 3901 gagatgagcc aaaggttcaa cgcgagtgac tgcgctggta taaaaaagcc tgccaaagca 3961 ggcttggtat atgtagtcac acccttgcag gtgacggcta cccgttaacc gaactgtatt 4021 gtgttaaccc tcgtaatgac cgccagtcac ttttccccgg aaaacaatgt actgataaag 4081 gttataaaac agcatcacgg ggataaggaa accaatgaag ataatcataa tgacaagcga 4141 actggggtca gcagatgctt catagatggt aatcttcacc ggaatgatat aggggaaaac 4201 aatcaacccc agtccgagaa atgtgagaac gaaaagaaga attgtccaga gaaaaggcgc 4261 tctttcttct tggcgattca agcttttaag aagttgccaa ataagcaaaa ctcccaatat 4321 tggaataaca gcaaatatgt aaacaagtgg ctgctgaaac aagcgagtcc ttgcactttc 4381 ataaaatatt ggtgttgaaa ttgtgatgaa aatggcacca atcaatgttg tcaaagccgc 4441 aattttagca gttttgtaat gagttgtttg caactcccct gtcgttttcc aaacaagata 4501 agttgagcca ataagaacat atgcttgaat taaagtcaga gccaccagta cagactgcca 4561 actcagccaa tcccaagatg tgccaataaa gtgaccagtt tcatcaacag caatcccttt 4621 tagcacagca ccaagggcga aaccttgacc gagggctgca acaaaactgc cagcaccaaa 4681 ggcaaaattc caaaagaatt ttcggtttga tagctcccga aactcaaacg ctacagcccg 4741 aaatataaac ccaaatatca taacaaaaat tgggatgtac agcgcgttga gaattgtgcc 4801 ataagcgaga ggaaatgctc caaaaagacc tcctcccata agaactagcc aagtttcatt 4861 agcatcccaa atgttgctca agcttgtcat taaaatgcca cggcgttctt catcttttga 4921 agttaaagat aagataccta cccctaagtc aaatccatct agcattacat agagcaacaa 4981 aaatagggct aaaacgacaa accatacctg gggcagaaaa tatgttaacg tttccataca 5041 acctcaagga aatgttgtgt attgttgaga gcatcctaaa tgtgttgatt ctcaaattca 5101 aaatttcaaa tcatcaattc aaaaattttt gaattttgaa ttttgaattt tgaattttat 5161 ttattgttgt gcctcaacag gacgttcatc tggtacaaac tctgctggag ttgtatccac 5221 agcaggtcta tcgggttcaa ttcctgggac agggagttct aaatttggac ctctacgaat 5281 aatgcggcta ccaaagtaca aaaccccaac aaacaaaata ctgtagacaa cagcaaagct 5341 agtgagtgag actaagacat tgctagcagg tatattggat gccccatcaa ccgtgcgaat 5401 ctgcccgtag acaatccacg gttgtcgtcc aacacaacgc acaatccagc cggagtctac 5461 agcaatgtat cctaaaggag cagcaaaaac ccacgcacgc atcagccaac gctgttgagc 5521 aatattctct gccgaaagtt taccacgtaa ccactgcaaa acactcaata gcatcaatcc 5581 tgcgaagaaa aatccaatcc caatcatgat gcggaaagcg tagtaaatta aaccaatcat 5641 gtgaggacgg tcttctggtt tccactcact caatccgcgt actggatatg tgagtttttg 5701 cttaaattcc aaaatatatc caagggcgtt gggaacggta atttcccagt catttttctg 5761 tgctttgtcg ttgggtatag cgagcaaact ccaatccgca gactgtcctg caggtgttgt 5821 ttcccactgc gcctccattg cagcaagttt tgaaggttga tagtgataaa cttgttcgcc 5881 actcaaatgc ccgatgtaaa tctgcaatgg tgcaacggcg atcgccgcag ccaaagcaat 5941 cttcaaagac ttggaaaaga aagcttcatg acgacgattg agaatatacc aagcactaat 6001 tccaccaatg acaaacaggg aagtctccaa tgtggcaaag aacatatgga ggacactgtt 6061 tttcatgaac gggttggcga tcgccgcaaa gtaatcatga acaataaact tgccattgac 6121 aagttcccca cctgctgggg tttgcatcca agaatttgct gttaaaatcc acagagttga 6181 taagtttgca ccaactgcaa ccaggatggt ggaaacatag tgaattatcg gattgacacg 6241 ttcccaacca aacagcataa tacctagaaa agcggcttct agcataaatg cccaagaagc 6301 ttcaaaccca atgacgctgc caaaaaagtt accagctgct tccgaaaaac gtgaccaatt 6361 ggtaccgaat tgaaactcca ttgggatacc agttgcgaca ccaattccaa aatttagcac 6421 gtaaaactta gaccaaaagc gagcatgaag gtagtagtca ggattacgag tcttgagcca 6481 cactccctca acaataacta gataaattcc catacctgtc gttaagacgg gccagagcat 6541 atggaataat gcagtcaatg caaactgcat ccgtgataac acaacagaat cagataaaaa 6601 ttccacaagc tatcccccta tattcagcca aaatctacgg atagattccc ttgtatagta 6661 ttgcaacaat ccttaacagt gcgttttttg agaaaaattt aaggtttgac aaggcgtagt 6721 agacttcagc atcatcaacg cgatattttc tcaaggctgg cgatcgccca agggggagcg 6781 gaggcgatcg cttttactgc aacttacttg ttaagatttg cggcggttgg tgaaaggatt 6841 ttgtcatcat tgaaattcat tttcttcagg ctaagccaga tcaaccatct cctgaccatt 6901 acgactaagg gtgagagacc caacactttc gggaaattgc ccaatgagtt taccgcccat 6961 gtcccccacg cagtttttat acttagtggg tgggttagtt tctttagttt ttcgccaccc 7021 aaaaacctat tttatggaac ccccactgta atacacaatc taaaattttg atgcgataaa 7081 acaccctggt aatacaatga accgtctcaa ggctattagt ccagaaacta taaatttaaa 7141 taacgggcgc ttacccatat aaaaccaacc gaaaggcaat tctgataact ttgtagtcgg 7201 cagatgccct ttaggtgcat ttagggaaaa ggtcaggtca gagtaacata accacttgcc 7261 ttgtacacgc catccaaccc tatcagccaa caagcgctct gttttatagt ttactttgcc 7321 gccaacctct aaccagatat gtttctgcac actaaagcca aagtgcccgt tgctgtattc 7381 tacccagagt gcatcaatgt tgtgtaagac tggattgaga aagatgtcat caataagagt 7441 aacagcgata ttaatgtttt tgtattcaat ttttgtacct atcgctttta gcaagatagc 7501 agtagtttcc cgatcggctt ccttccaatt gcctgctttt agcaagtccc gtagcttagt 7561 atagtcaaag tctttttgaa cattgatact atcaggtgcc ttcaataact ctgaggcttg 7621 aaatactgta gataggagtg aatgaatctt tttcggttta attaaaatat tatcacgata 7681 gcaaactaat cggtatcctt tcgctgccat gcagagattt ttctcggctt ttaagaagtc 7741 attaatttta gaatcaaaaa cgtcaatatt ttgcttggtc tcttttttct catgtccaaa 7801 tactttacaa attctttcaa aaattttacg caaaattgac tttgctttta cttttactga 7861 gaattctggt aattttaaat cttccgaagc aaaaataatc atggtttgtg actgattgct 7921 tattgataga ccatttgagg agactgtgct atcttgagag ccattaaggt tccaattgtg 7981 actatgaaaa tcaaaaatag cataaaaatg gccttgattg tcttgcttgc ttatgagaat 8041 atttttcatt ttttttgttt ctcttatcga gaaaatcagt ctatacaata gctctaaaat 8101 cattaatagg tcattccaaa tgtttttggc tcttggtctg atttcttcaa atagaggtaa 8161 gccttcaaga atgggtttaa tatttttatt tttatccgtt ttataaaaat ctaaacccag 8221 attagcggca actcttttaa gttttctggt tcgacggtta cgaaaaaatg aaagaaataa 8281 aatttcaaaa attctagtac aggttatagt aataattagc cctatgaaag acccgaaggt 8341 tgggtgaaaa gcccaaatta ataattggat agagtagact aaggctttaa ggaaagtaaa 8401 gatttgagaa agtcgataag tattcatatc agataccttg gagttttcag taataaagtt 8461 aagcgtgagc agctattttt tttgttttta gtacctaaac aaggacttta acaacaataa 8521 tgaatataca aaaattatta aacctaacac agagagaaat ccctgctgag aaataataag 8581 ttgcaaagaa aaaacaaaag ctaatataac ataaggaatt ctttggaata ttgttagcaa 8641 atggtctcta cgtcttcgct taaggctatc ccgcacaaac atctcctcgg ctttgttcag 8701 ccagttgttg tcatcagaat tcaacacctg ttgcaagccc tcaaggtatg ggtcacgatg 8761 ccaaagaaaa tttgatgcag tctgagtttt taaagattct tgacgattcc aaaaactttc 8821 accttgctta aaggtaatca attcaccaga ctctgttaaa gactgctgaa attgccaacg 8881 tttttcagca gaagagaggc ggcgttgcaa taacagagtt tctcgctcat ctgcgatcca 8941 tttcacaagc attttccatc ctgtgattaa aacatcgtgg gctggttctg tataaggttc 9001 accccctgct tcttttcctt taacaactag acgagcttgt accagatgtt cgatgacttg 9061 cgccacgcgc tggttttttt cagagcttga gtatatcaat tctgaatcag gtacttgtcg 9121 ccgtgctgac tctccaccct ctacagccac catccgcagc attacttggc gtatcgtctg 9181 ctcataagcg gggtcttttg caactaggtt ctcatattcc tggtttgcac gttgggttag 9241 agaaccaacg actcttccta actcttcgta atctttcttt gttaaggcgc ggttatccct 9301 tcgctgctct aaatatttca gatacaattg actgagggtg aaagaaagta agggtaaagc 9361 accaggcatt tgtacgactt cgttaatcag ttcgtctact aaactgggag ggtcgaagta 9421 taccactttt tccgatgctg gtttctcaat tactgttcgg aattcatctt gggtcatggg 9481 tgggacaaca aaccgagtcg cattattcca aaaatctttg agtacagagt tttggaactg 9541 tgcttcaaag tcgagccgca gggtaataac tacatggacg ttttggggat atttggcgat 9601 cgcattttta ataagtttct gaaattgctc tcgttccttg tcactcttac agagagttat 9661 caattcttca aactggtcaa ctggtagcaa cagcttcgtc ttaggattag tctgactcca 9721 acgttcaacg atgtttgcta aagcttgctc attttttgcc agttcctctg gtgtaactat 9781 agttgcgagc aacaaacaaa tctgtgctaa agctttcaaa ggactttccc ctggtcgtat 9841 ggggtctaaa atcttaaatt gaaactcatt tgaattacgt aaacgtggta ataacccagc 9901 tttcactaag ctagattttc ctgtgccaga aacacccagc acaattgtta atggctgctt 9961 attgtcaatt gctttctggt acagtttttc aacaaggttc tctcgaccaa agaataagtg 10021 actgtctttt tcatcataag actccagtcc tcgataggga ttattgttta ggttaagtgg 10081 tggtgcatct tccaatttat ctcggtcaaa gtcaggcaac aagaagatga attccccttt 10141 atcatgtttt ttcagcggac acaaaccagg agtttgtcgc ttgtagtgtt tttctgtaag 10201 gatttcaacc tgattccgca gatacgagta aagctcagtt gctgtgatga taccatcttt 10261 attaaagtca gcaccttcat cagatccccc acgtaaggca tcaaaaagag cttttgcaaa 10321 gggagagtga atctcgttcc cgtcgatgac tttccctcgc tgtcccaaaa aatccaaagc 10381 tttttgatca tcagccgctg aagtaatgac ttgccaagct gcatcactaa tgaaacggtc 10441 atagcgttct ttatagactg taactttggg cacgatatct cgcttgaggc ttgcccaacg 10501 gaaagcgcct gcaaagcagc aatctaaaat tgctaacata tgccgacacg gaagtgcgtt 10561 gagggcatcg tgtaactctt gcattggcaa gtaggtgttg ctatccccta atgtggcatc 10621 ttgaggaata agataaccca ctggtccttc ttggttttct aaagcttcca aggcgattcc 10681 atgcccagca aagtaaaaga gaaagcgatc gtccttgttc accgtcactt tttctttgtc 10741 aagaaatatt tgtccttttt taaagtcttc aattaattgc ttaagctttt tgagggtaac 10801 gcgttgattc aatagcaact ggacttcata cttattttgc gcttgatatt gctgtttgag 10861 gttttgatgt tgctcttgga taatttgagc gagtttgaga gcatctggag ctgctgtttc 10921 tagcgctgga atgccatttt tatagttatt aataccgata agaatagcga aattacggtt 10981 aaatttatat tttgacatga aaataccttt taagcatata aatacaaaga tcagaaggga 11041 gcatctcact tttgtaaaca tagactgaac tacaaactga agttgcggct tttcggattc 11101 ttttcctaga tatttttgca aaactcacat cttgcacctc accgtataag cccgtacaaa 11161 ccaaaatcat gcacaataaa gctacattat ttttattggc ttctctcgct aaattaaggt 11221 tcttttttta atactgagtg actaaaaaac ataaaatttt cttggtgcaa gatgtcagaa 11281 aactgagatg ctcccggtga gaaaggcact catttagaca agataatgcc taatgagtgc 11341 ctttttgctg tttacttctg ttgtatgaga gccgcaacca aagcctctaa accaacttct 11401 gtgaattcag taccgaagta actcatgtgg tcacaagcga cttctgtgat cttgggagga 11461 tgcgatcgcc ctttaggaat actcgtgata ctatccacag tcaccgcaat atcattcggt 11521 acaccaaaga aaggcaaagc tacaactttc ttaaataaac tttgttgtag ccgttctagt 11581 acgctggatt ttttccctgc ttgttgctct aatgctgctg gaataataga ggtattgcca 11641 gcgataatcg agtagggaat accaggatcg tcactcgctg ccagagaatt gataaaatcg 11701 gaacctggat tcatctgagt caaggctact ctaatatttt tttctaaagc gcccatcagc 11761 atagctacca cttttgcagg ccaagcaact agagaaagac cgttgagtcc aatagccaga 11821 gtgaatttta cccaatcttc aacaactgac caaggagaac ctgcatttgg tgtacccaac 11881 ataaacaggt gttggacgac tttatttcct ccttctcgct caataaacca acgagatacc 11941 aaaccaccca ttgagtgagc gataatgtgc agttctttac cgtgatttgc tcccaaacca 12001 acctctgcta gtcgccgttt taaagactga gcattctgct caattgaagt attcaagttt 12061 tcgtagtcaa aggtgagaac tacatcgtaa agttcgctga tagaacgctt ttgtccatct 12121 gcttgcaata tgggttgttt tatggtgcta actaaacttt cagtatcgcc ggtaatcccg 12181 tggatgtaga gggcaattcg ttttgcttgc gccactcgct gctgaacatc agccatatcg 12241 cgtttatagt ctactttctt gtcgtctccc acctcagcca cggctaaaat gggatactta 12301 aattcccggc caaaattctg gctgatgact ttttgaaaga agatgcgaat agatcctttc 12361 agactgcgct ctccctgact aactggtgct ggtaattgtt ctaggataat ttctgttttg 12421 ccgtcttgag ttgttgtacc acgacctaag ggcaaataaa actcaccgtc atagccgaca 12481 ggaagaaggt attcattgtc tgctagggga acatcaacta ataatttcaa aggtgcatct 12541 ggagtgacta ctttgtggtc tgcaacgtcg atcagttcca gtacacttaa tccagggtca 12601 gtaccgcgac tggaagtaaa ttggaagggt cgagtcactt ctggatcttc ccgcaggatc 12661 ggaggcacaa ttttattccc caaatctcgg ctgacttgcg gtgtagttgt gaggcgagca 12721 ttcgctacca aactgttatg cggcagtaat ttcacccctg cgtctaattg ttgttcttgt 12781 tcttgagaaa ctggaataga atccaagggg cgaaccgtag tgattgtgat ttcttcagcg 12841 taccaatcgt catatcttgc tgagttttta gcttcgattt gacggttttg agttcgattc 12901 atcaagcgct caaggctgct ttgatttgag gaatcgatat ctcttgatac aggtggtcga 12961 ggagcatcaa gttcatcctg atttagcaat ctagcatcaa actcgttgtt actaacaatc 13021 aatttaataa tatctttgta ctctgtaatt gagagttccc agtactcgtc tggtacactc 13081 agtaccaaag gatctccatc caacgcccaa gcttcttctc ctggttgaag cctcacgcta 13141 cctgcttcaa aaaacggagc gctaatagcg aaattgtcag aaagattgac caaagcacag 13201 taaagtgatt tgttagttgt attggtgagt tttagttgaa attctggctt ttgccactca 13261 ccatcccgat acttatactg caaacgcagc tgctttgatt gagatgattc ctcatcctta 13321 aagatgagtt ccatcttcac gtcaccagct ttaatttgag ttgctgcagt atttgaaagt 13381 tcagcaattg ttgtccaacg ggcaatatgt tctaggcgtt taattgcttt gtctgcgttg 13441 tcttttgtat aaccatcaat ttgctctaca aggggtcggt catctgtagg tcttgctatc 13501 aaatactgct cattgcgaca caacagacga aactgggctt tggcaagttc ttgctcttcg 13561 cggatatagg gtgagggttg gttattatta gaccccgctg tttttagtgc gtcacgggct 13621 tgagtgactc ccgtttcatc tccctcaaaa tatactccta aagggggtag cggtaaactg 13681 gtaacaactg ctttaaaaga agtgccagca gcagtcaggt tttgcacgcc ctcaatatct 13741 atcttgctct tggttggtaa taccttagtg acttttgctg tacccacaga ctttgacgga 13801 tcacgcaaat cgtcgatatt agcgtcaaag ggaaatagtg caagtgaagt tgtttcgccg 13861 tctcgtggtg gttgaacccc atgaacagca ccgccttcaa tcacccagcc atcggttttg 13921 tcattcttaa caataaagta gggttcaagt tcggcgatcg ctccatctag aaaaaatttg 13981 ttatcatctt ctggattatt cacttccaac tgaggagact gatctctgat ttgactgcgg 14041 actaaggcat ttgttcgccc aaataaatcc cgataagtca gcttctttcc attggttttg 14101 gaaagtgtat ccatcaagaa ataagagaaa ctacctcggt gtttgtcacc tccgtaatat 14161 tccttggctg tttggtagtc ttgacaagct gctagcagaa cgtgacgacc tttgggtatg 14221 ttccagcctg tggggttatc ttcaggttta acctcacgag ttcctaaaag tcgatttaag 14281 tcatccagcg taaaaataaa actatcaagt gggcgttctc gcttgtctgt ggggagacga 14341 cgttccttgg tctcttgcat tgggtctctt gtgccggaac cagagtggca gcagtccata 14401 ataatggtca tatgcgggtt cttttgtgct acttgggcaa tcaatactgc caattcttta 14461 tctgccaaat cccagccgcc ttcggtacga ctgtcataac agactaaagt ttcatcaaaa 14521 tgatctggct ccagttgcca aaattccttt ggtgctaact cctgagaacc atgaccactg 14581 taataaaaca gaactatatc atcttgccct gcaagactta gatgcgaacg gaagccatca 14641 ataactgcct tacgggttgc ttgttcatcc ttgagtgttt gtaaatgcag ttgatattct 14701 tgcttatcaa agcgttcgtt tatatactct tctattgctg taatgtcatt aacgcagcct 14761 tgtagacagc gaattggagc aggataatta tcgataccaa ctaacaaagc atataaatga 14821 cgagtcatga ttatttttgt ctcctaaagc tatcatgtct aaattgatcg gctcagatga 14881 cggcgttttt tgacctcacc aacaggcgtc ttgaattaaa aatcaagtta cgcatgaaga 14941 ttctggctct ttctgcttta tttttaatag ttgtcagatg ctaggactta tgcactatag 15001 aatcaagttg gctctaagtc gctgataagt gacgcccacc ataatgaatt tcccgccagc 15061 ttttacttag tacatttgga ttatgcaaat attttaaata acgacaatgc cccacgttac 15121 acaagcaaca ttagatgaca tcccgcagat tgaaaatttt cactgtaggt tgggtttcgt 15181 tcctcaaccc aacctacgaa atttcaattc cgaacccttt ttacggtgac gtgtacttag 15241 tacatttgga ttatgcaaat attttaaata acgacaatgc tccacgttac agaagcaaga 15301 ttagatgaca tcccgcagct ttgtgatttg ctgactattt tgtttactca ggaagcagac 15361 tttcaaccag acagcgccaa gcaatcacag ggattacgcc aaatcatcga acacccagaa 15421 gtcggacgta tccttgttct ccacgatggc tcgactataa tcggtatggt taaccttctg 15481 ttcactatca gtacagctct tggaagacgt gttgcaatcc tggaggatat gatcgttcat 15541 ccagactggc gtggcggtgg tgctgggtct actctccttc aggaggcgat ctcgtttgca 15601 caagcatctg gttgttcacg tatcactttg ctaagcgatc gcgttaactc ctcagcgatt 15661 cgtttctacc agcgacatgg tttcacgctt tctgatatgg ttcctttgcg tctgctgttt 15721 ccccaatgag caactataga ctctaagtca ctgatgggtg tcacccacca tacagcaact 15781 gactcactca cggtaaactc aactgcggat attactctgg agtctgcata tgcttattct 15841 cgatgctcct caagttgaag tcatcgttgg agaaaatgcg gttcatattt ctcaactgcc 15901 gaaagtatgg caagatattg cactaggaaa ggctggtgtt ggactggcaa atccacagag 15961 ttatgttgag atggcgcagt tgtttcaata caagttgcag cagggagacg ttgacttatt 16021 caacgaacgc ccagaactcg ctcacctcaa accatcattt aaggagttgt ttggattgtt 16081 ggcgcgggag acacttgagt tctacgggca agactttaag gttgagagat atccagattt 16141 tgaagcaatt ttgcgtgaat ttgagtcaaa gggagcagag ttttctaacg aagtcaaggt 16201 tgctcgcatt tgtctggaac tgtttaatga gttcgattat gaacttcccg ccagctttta 16261 cctagtacat cttgctccta tttaccgaga tagcgttttt gaagaacggg ctttacgatt 16321 tgatccgcgt gatacagaac acaagcgcgg ttgggatgct gtacttcatg ctggaaaagt 16381 gttcgctgtg cagatgaaga tacaaagtat tgcttctaaa tacggcttga catatcagca 16441 cggttgtggt tgtgaatccc acttatcttc tattgatatg tcactgggag cgtttgacta 16501 tcagttaaat actgaaaagc gtcagcgatg gattcgcagt tttatttgga caacgtggta 16561 tgagtatgca tttttcccga ttgtaccgaa tactagatat ttagtgtaga tgtccacaga 16621 ctagaagtct gacgctatac aaaaaaagcc tgcggaagca ggcttaaata attgtatttt 16681 tgaaacgact tttatcggac agtctgcaac gccaaatctt ggaacattgg ggtgctgagg 16741 taacgttcgc caaaagaagg ctgaatcatc acgattaaac gacctgcatt tcctgggcgt 16801 ttcgcgactt gaatggcagc atataaagca gcaccagagg atataccaga taacaaacct 16861 tcttctgttg ctaagcgtcg tccataggcg atcgcctgtt catcactaac ggtgatcact 16921 tcatcgacta actttttgtc gagaacttct gggacaaatc cagcaccaat accttgaatt 16981 ttatgtggtc ctggttgacc tccggaaagg acagggctgt tagctggttc gacggcgatc 17041 gcctgaaaac tgggtttacg ttgtttgaga acttctgcaa ccccagtgat tgttccgcca 17101 gtaccgacac ctgcaatcac aatatccact tccccatcgg tatctgccca aatctcctct 17161 gctgtggttt ctctgtgtat tttgggatta gctgggttgc ggaattgttg taacatataa 17221 gcattaggag tgctagcgac aatttcttcg gctttacgaa tagcgcctcg cattccctca 17281 agccccggtg tcaattccaa agaagcacca tatgctcgta gcatggcgcg tcgttccaaa 17341 ctcattgttt cgggcattgt caaaaccaaa cggtagccac gcgccgccgc caccatcgct 17401 agtccaattc ctgtattacc agacgtaggc tcaaccagaa tgctttttcc tggcttaatg 17461 agtccctcgt cctcagccgt ctttatcatg cttgcagcaa tacggtcttt caccgaagct 17521 gctgggttca ttccttctaa cttcacaaca atcctggcta tacatccctc agcttgaggg 17581 attttgttta gctgaactaa aggagttctt ccaatcagtt ctgtaatgtc atgagcaatc 17641 cgcatattat taactcctta caaagatacc ttttcatcag ggtgtgtaga aattaagctg 17701 taaccagtta gtacatgaag agaaatttga aaaagttact ctcattcatt ttaaatgatt 17761 tgaagatcat cacagctttt gtgttatttt tgaacaaaat tctaaaatat caaattttgg 17821 atgtttttta aaaaaactca actatatatt ta // LOCUS NODE_1855_length_17774_cov_4.83238317774 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17774) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17774) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17774 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 191..445 /locus_tag="DP116_16025" CDS 191..445 /locus_tag="DP116_16025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873711.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UPF0175 family protein" /protein_id="PRJNA477356:DP116_16025" /translation="MRTVPIQLPETVFSALRKNPEEFVQEMRIAAAVKWYELGEISQG KAAEIAGLTRAEFINALSRYRVDFMQYTTEELAEEIGNVD" gene 435..926 /locus_tag="DP116_16030" CDS 435..926 /locus_tag="DP116_16030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879296.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3368 domain-containing protein" /protein_id="PRJNA477356:DP116_16030" /translation="MLINRVIINSSPLIVLFKSQQAELLPQLFAEILVPEGVFEEVTI AGEDDAASRQLPRVSWIQRVEITTIAPEVAAWDLGKGESQVLSLALKTLANSAAIVDD RAARRCGQVLGITTIGTGGILIRAKRRGLIKSVSQGIEALRDAGLWLSDNVVNLLKQQ AGE" gene 1281..1979 /locus_tag="DP116_16035" CDS 1281..1979 /locus_tag="DP116_16035" /inference="COORDINATES: protein motif:HMM:PF13432.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16035" /translation="MKQTILATESLLTLAGEPSTGVSQPAEATTVEENSLAGKYIKDF HSSSSSVGNDFAGILVTVGIILALGSTVCVLHKSLHLGKPSSSRADSLHTAENKRQST QEDITSGNQEITPTVYIEKAYTSWRQGDVQKALAELNNGIRLYPHDAYLYTERANFRR KNLGDNQGALEDYTQAIDLHPDNALFYLWRSQLYHEIGDILKAMTDYNTAIRLAPEDT MYHVSPTNANSLRG" gene 2801..4465 /locus_tag="DP116_16040" CDS 2801..4465 /locus_tag="DP116_16040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114495.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ShlB/FhaC/HecB family hemolysin secretion/activation protein" /protein_id="PRJNA477356:DP116_16040" /translation="MKLPKLSLHQTYSGDHNKKIPRSHPPVIPPPENLFPSPPQTPSP PEQLINEFAGTIEVERFEVVGSTVFSRKQLDDATKNFVKRQITFAELLQASEEINKLY REKGYVTTGAFIPGDQTFKVKGSVVTIKVLEGRVESIQVRGLKRLNSNYVRSRLKIAT GEPLNVNKLQRALQLLTLNPLIQNISANLASGSTPGTNVLDVRVTEAKTFSAQISLDN YRNPSIGSFQRQIQLNQANLLGLGDGLSVGYSNTDGSNAVEARYTLPINAYNGTLEFA YNYTDSSVIEKPFDDLDINGTAQDFSLTLRQPIVETPTEVFALGLTANRRESDVGFLE SLIGRRVGFPQPGANNNGETRLSILRFFQEWTKRDSQQVLAARSQFSFGLDVFDATTN KRAPDGSFFSWRGQAQWARLLAADTLVLVRADTQISDRALVPLEQFGLGGQRTVRGYR QDLLLTDNAFLASAELRYPILRVPEVGVLQVTPFFDYGTAWNSSGYSNPDPSNLASLG LGLLWQSNNLSARFDYGIPLIDVKSRNDETLQEKGLYFSIVYTQRF" gene 4756..7485 /locus_tag="DP116_16045" CDS 4756..7485 /locus_tag="DP116_16045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114494.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16045" /translation="MSRLIFLAALAQIAVIASPLALAADTRNYPQTQQSQPTQLVQQT DARQLLQQGLERYQREQFAPAVQVLQQAAEKFQTQGDTLNQALALNYVALAYQQLGQL PQASIAIAQSFDLLQKNRINSQEYITVRAQALNTQGQIELAQGKSEKALASWEEASTL YVKNRDKEGEIGSKINQIQALQALGLYDRARITLIDVNKLLQAEPDSLLKAKGLLSLG NALRVVGILDQKDPKKIEDFGSQQALEQSLAIASKLNSAELVAEIYLSLGNTAQARQK TDEAMEYYKKAAASSPLPITRLVAQTNQLRLSLKPPQQKSPETAQQSSNQLLDLSLLP QIQSQLDTLAPSRKSLYARINYAQTLACLRQRTEGRRTVVHGNCPKQGIGAQNIATNS IPEWKAIAQITATAVDQAKSLEDKRAEAYALGTLGGLYEQTQQWTDAQKLTQQALALS ESITAPDIGYRWQWQLGRILKGQKDEKGAKASYSKAVENLKSLRGDLVAQRDVEFSFQ EEVEPIYRELVSLLLQPGNKEPSQDDLDKARDVIESLKLAELDNYFRTACINAKPVKV DQVDRTAGVIYPVILGDRLEVIYSLPSSSTNQKQGRNLRHYTKILSQKEVEDKLDELR QKLETRSTPEFKVPSQEVYEWIIKPIESELEKRQIKNLVFVLDGPLQNIPMAALYDGK SYLVQKYNIALSPGLELLNPQPFARSELRTVAAGLSEEVRDFPALPAVKRELDEIKSI VRDSDVLLDKKFTRSAIKEAVKSFNAPVVHLATHGQFSSKPEDTFILTFDGKVNVNDL SNLLKTRATDQRGAIELLVLSACSTAEGDNRAALGIAGIAFQAGARSTLASLWVVDDQ ATAEIMGEFYKQLSKSNTTKAEALKKAQLSLLKNPLYDHPYYWAPYVLVGNWL" gene complement(7604..9727) /locus_tag="DP116_16050" CDS complement(7604..9727) /locus_tag="DP116_16050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_16050" /translation="MVASIRNLLTQPAVFASAIVSVLLVGAQQLGWLEHPELIVFDQM MQSRADVEQDKRLLIVGFTENDIQNLKQSSPNGNVLTTVLSKLEQYQAKVIGLDFFRD VPVDPGHKKLLSHIKQSSRIVSICKLGDDKEPVVPPPQGVAPETVGFADFSEDGDGVI RRNLLIASPDPKSKCAAEGSLGFQLALQYLNIPPEITEKRIKLGNTVFQRLEPDSGNY RNLDNRGFQILLNYGSKKSIAQQVSFTDVLNNRINPSLVKNRIVLIGSTAPSSQDVRI TPYSSGNKQDNSGKMPGVVIHAQMVSQILDAVSNKRRLFWFLPEWGQVLWIWGWAVVG GVVASRIQHPLYLGLATTASLAILLLSCFAIFTHAGWVPVVSPILGFLLVQGGVLAYT SFQNKQQKEKVALQIQDQNETISLLQSLLRDGGNHQTQTQGGIHNGLQLQGILNHRYK IIELLGCGGFSYTYLAEDTYRPGSPLCVVKYLQPARNDDLFLDVARRLFKTEAEILET LGQHEQIPQLMAYFEENKQFYLVQEYIQGHSLHQELTPGKRFSDVQVVHFLKDVLQIL AFVHSHGVIHRDIKPSNLMHREKDQRTVLIDFGAVKQIQPQHPTENPTVAVGTIGYSP PEQFMGQPRLNSDIFALGMIAIQALTGTPAKYLERDSTTTELVWRHLAETSEDLAAVL DKMVCYDFRKRYQFVEEVLYSLRNF" gene complement(9914..11560) /locus_tag="DP116_16055" CDS complement(9914..11560) /locus_tag="DP116_16055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114500.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flavodoxin" /protein_id="PRJNA477356:DP116_16055" /translation="MLKIKLIDSNTSDQFKDVDLIPETKPNQECFIGRLANCDLLLEA AEVSRMHGKVSMKRGNYYFVDLASSGGSRINGEKAQINQDYLLNPGDKIQIGRFILMI LQIRTEEDETFMESQKRAQQILVRDSESQPPPPRVSLEKFMPLAILEPSQVQRWVKGE LTLTCIGVIDETHDVKTFRFVAQPPVLFTYNPGQFVTLKQEINGKQVSRSYTISSSPS RPHTLEITVKRVPHSLGESSLPEGLVSNWLHDHVTVGSRIHCNGPLGKFTCFTNPLPK MLFLSAGIGITPMMSMSRWLCDTASDCDIIFFHSVRTLRDFIFRQELELMSARHPNFR LVVSTTRKEPGQTWFGLTGRFDTAMLQVVAPDFRERSVYVCGPHGFMENVNQILQTFD YPMQNYHEESFGPPRKSSKFPISKQTVISTLDDPVTPVVDYRFKQSFIHLPRETVSDS NVGRDAARFSNKSSVIFSQSRIEAYSDGEESILHLAEQQGVRIRNSCRSGVCGSCKKL KLQGQIYTEGEPEALEESERQQGYVLTCISYPIGRVVIDA" gene complement(11675..12862) /locus_tag="DP116_16060" CDS complement(11675..12862) /locus_tag="DP116_16060" /inference="COORDINATES: protein motif:HMM:PF07282.9,HMM:PF12323.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_16060" /translation="MQLGFKTKLKTTLFQQQLFAQHAGFARWVYNWGLATMENFYQSG VKLSYRDARKFYTNVVKPEYPWMTQMSSRVYVYAFEQLKEAYKRFFNGIALKPTFKKK GKSKDSFTVDWNGKVKRTDGKSIKLPAPLGIVCTFEQLPSVDIKKATISHGADGWYIS FNYEIPDISVGQLDECNHVDDIVGVDLGINSLAVAVSLNTVKTFDNPKKYRKSKKQLA RLQRQLQRKTKGSKGHLKAKNRLARYHKHIADTRSHTINHMTTTICKNHAVVVIENLN VEGMMQNHKLAGAVADCGFGEIERQFQYKTQKFAHRLIQVDRFYPSSQLCPKCGAKQK MPLNLRTYKCDCGYERDRDENAAFNLCLYGWEHNGFQTGELPGSDRGGLKLPTAPVEA IKE" gene complement(13204..16500) /locus_tag="DP116_16065" CDS complement(13204..16500) /locus_tag="DP116_16065" /inference="COORDINATES: protein motif:HMM:PF00027.27,HMM:PF16697.3" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16065" /translation="MISQVSETKMHFIRWLLAIGWLMLIFSLFYAPISPWFTSSIHLD FNNYQNVSTCVKVQGVCLKEQPYTIGARIFWTVVIPSAIVILLVFGHEFWRRICPLYF FSQIPRALGIQRKRKTVSEETGSVRSELAGIEKESWLGRNYLLLQFGLLYLGLNIRIL FVNGDRIAMGFFLLFTIASAMTVGFLYKGKSWCQYFCPMAPVQMFFTGRRGLLGSEAH LQPPQTITQSTCRTVDSSGNEKSACVSCQSPCIDIDAERSYWEGITRPDQKLVFYGYF GLMLGYYVFFYLYSGNWEYYFSGGWLYEQDPLRTLFNPGFYIFGRPVPIPKIIAAPLT LAVFCAASYLVGEFLERAYRTYLKKKNKYHDEEQVLHVCFVLCTFIAFNVFFIFGIRP IFKPLPDWAIIVLNILSVLVSSLWLERSLAGSKERYARESLANNLRRQLNKLAVDWSK LLEGRSLQDLIPDEVYVLAKVLPGFRRVDQVRVYQGVLRDALEEGNMSYAESLHALKD LRNQFNITDEDHYSVLAELNVEDPTLLTPQKQPSRENKLRIESYRRALEMLIQKQSER GISPPDAIDRKQKQIQTLRQEYAINIDEHEQVLAQMLNQKGVLLRTAETLLAQLQELA VRDQILYNSVPNRQAPVYVLLRKAVQEKKKLIATQLLRILEFLKQSPEALNIARSTGV LAANVIVEILESNDEQLSWRKRLNPRVLTSLQQQDQLSHLVQISTQLDVGASTANESQ TKRYSLDVTNQLTNTRVPRSKAIDQILLELLHDLDPLVQAASLYALDQHNPPVAMQQA RQILNSKENKDWLVQETAQIILGQNQQRKQPADVPTLIAQVKEMERTERRTFQQPTIR VGRGHENDIVILDNRVSRQHAIFYLDQTGVSVKDLGSGNGLRIGKEHIHDQQKQLKQG DIIRFSSEDDLFLLVQWQMQPLQGDALSVALPQAIAQSTGTLEKLLWLYNSSFFQAPK ANVLVELARNATVREYQPQQEICRIGATAFELIILIDGEAMLLSGNTANNQTILPGQV IGELEVLTHSYYVATVVAVKQGTRALAIKAKDFEAALSHNPPLAINVLQVVSHRLQES LGHTAAVTQL" gene 16914..17651 /locus_tag="DP116_16070" CDS 16914..17651 /locus_tag="DP116_16070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111761.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16070" /translation="MKTFLRHGFMTAPLAAIIGFSVVIPLIPAEARITFKAPAALGVP GRRVAGASRTMQQCLLDNKPLTAIVPQSNIGLTTAANPVLLFYIPKTSAQAQLELVVQ TANEKNIVTKQSYKPSSKAGVVSIPLTNASLEVGKDYHWFFSIVCNPKARSKDHLVHG GIKRIQAEPPLTTQLKNASPQQVVNVYAQAGIWQDSVAKLAGLRYSRPNDAELKADWE GLLQSVEFKPDVVNAPLLQGGEAPQQQ" BASE COUNT 5232 a 3793 c 3692 g 5057 t ORIGIN 1 tccccccgtt gtagcacctg gcgtgaggga tgcccgacag ggcagggtga ggttctttgt 61 ttttttataa gtgttcatcc ggatataaaa taagccaact ggatgagcca atttttggtg 121 aggactttga gtaggattta tgtaacggaa tggagaatag tccaattagc aataacaaaa 181 ggagtcttct atgagaacag ttccaatcca attaccggaa accgtatttt ctgccctccg 241 caaaaatcct gaagaattcg tccaagaaat gcggattgcg gcagcagtaa aatggtatga 301 actaggcgaa atttctcaag gaaaagcggc agaaattgct ggactcactc gggcagagtt 361 tattaatgct ttgtctcgtt accgagtaga ttttatgcaa tatactactg aggaattagc 421 cgaggaaatc gggaatgttg attaaccgcg tcatcattaa ctcttcacct cttattgtcc 481 tttttaaaag tcagcaagca gaattactac cccaattgtt cgcggaaata ttggttccgg 541 aaggtgtatt tgaagaagtt acgatcgcag gtgaagatga tgctgcatca agacaattac 601 ctagagtctc ttggatacaa cgagtagaaa ttactacgat tgcccctgaa gttgcagctt 661 gggatttagg caagggagaa tcacaagtgt tgagtctagc tttgaaaact ttagccaata 721 gtgcagctat tgttgacgat agagcagcgc ggcgttgcgg tcaagtcttg ggtatcacta 781 ccattggtac gggaggcata ctcataagag ccaaacgacg cggactgatt aagagcgtat 841 cacaaggaat tgaagcttta cgcgatgcag gtttatggtt atctgataat gtggtgaatc 901 tgctcaaaca acaagccgga gagtaaacgg ttagtcgctg catgagatga cgttaacgca 961 ccctactcat atatgatgca ctttatcgag cacctcttaa gtttctataa tggtctatca 1021 gatactagac tgtgatacct gccctaaggt actagaacct atcttagata gaatcataat 1081 ttttcctccg gactagttcg atttacagaa gttaaataaa acagactttg gaaaaataag 1141 ataatataaa caaccattag tttgaagagt ttggtaaaga atcaagtctc ctctggctag 1201 agtctttttt tatcaaataa aatatttgta tttatgttac ttattcaaaa gcagtgccat 1261 gtagtggagt acaccagatt atgaagcaga caatccttgc taccgaatcg ctgctcactc 1321 tagcaggaga acctagtaca ggtgtttccc aaccagcaga agccactaca gttgaagaaa 1381 actctctagc cggaaaatat ataaaagact ttcatagctc tagtagcagt gtcggaaatg 1441 attttgctgg aatacttgtg acagtgggaa ttatattagc cctcggctcg actgtttgtg 1501 ttctacataa atctcttcat ttgggcaagc cttcttcaag cagggcagat tctttgcaca 1561 cagcagaaaa taagcgtcaa tctacccaag aagatatcac atctggcaat caagagatca 1621 ctcccacagt atacattgaa aaagcttata cttcctggcg acaaggagat gttcaaaaag 1681 cgcttgcgga attaaataat ggaattcgtc tttatcccca tgacgcttac ctttacactg 1741 aacgagctaa ctttcgtcgc aaaaatttag gagacaacca aggcgcactt gaggattata 1801 ctcaagctat tgatctccat cctgacaatg cccttttcta cctttggcgc agtcagcttt 1861 atcatgaaat cggcgatatc ttaaaagcaa tgacggatta taatacagcg attcgtcttg 1921 cccctgaaga tactatgtat cacgtttcgc caacaaatgc aaactcttta agaggttgac 1981 tgttttgcaa tacgcttgag taagcgcaaa aaacataaaa ctatgtaggt tgggttgagg 2041 aacgaaaccc aactcctggc ttggttttgt tgggttgcgc ttcgcttaac ccaacctaca 2101 attctgagtg ttttgcaaca gaattgcgtc aatccggttt tagctgtaat atactaccgt 2161 tagcagatta tcaaacaatt ttcatccgta ctaagcgata acttaatttt tgtaacaaaa 2221 gattagattg ttgtcgaatt aatggtaatc attctttcta tcattttttg atggaaatat 2281 gtttgctatt tgtacaacag gtaaaacgac tcgttcacca cacaaacttt gcagggtttg 2341 tagtgagcgc ttaatgggaa gacagtggtt attactcaga gcttgcacgc cgacccaaca 2401 accgcctagg gttagaaacc cagggctgca tagcgaaagt cctctggtga ggactaaatg 2461 ctgactcacc aaaattttca gtccatttta atggacttgg gctattagcc tggaacttga 2521 gttcaaggtg ggcgaagttg ctaattaaca atggtgcaaa atctcagtta ctacaaatta 2581 tagttacccc tcatttttgg cttagaagac tactactcat gtagtgtcat cgctcttact 2641 ctccagcatc tacccatgcc aaataaaagg tttttaagtc tctattatta ccagttttcg 2701 cctagtctac tcatactgag cctgatggta ttcaatagct tgcaccaaca gccgttgtca 2761 gcacaaacag tgaatgcatc tgggggtgtc gaattaaaaa ctgaaattgc ccaaactcag 2821 cctccatcag acatattccg gcgaccacaa caagaaaatc ccccgatcgc accccccagt 2881 tatcccacca ccagaaaatt tatttccctc tcctccccaa actccttccc ctccagaaca 2941 gttaatcaat gagtttgccg gaaccattga agttgagcgt tttgaggtgg ttggtagtac 3001 tgtgttcagc cgcaaacaac tcgatgatgc aaccaaaaat tttgttaaga gacagattac 3061 ttttgctgaa cttttacagg cttctgaaga aataaacaaa ctttatcggg aaaaaggtta 3121 cgttaccacc ggggctttca ttccaggcga ccaaaccttc aaagttaaag gaagtgttgt 3181 caccattaaa gtcttggaag gtcgcgtgga aagtatccag gttagaggtc tgaagcggct 3241 gaactctaac tatgttcgca gtcgtttgaa gatcgccact ggcgaacctc tcaacgtcaa 3301 caaattacaa agagcattac aactacttac actcaatcca ctgattcaaa atatttctgc 3361 gaaccttgca agtggatcaa cccctggtac taacgtgcta gacgtcaggg tgacagaagc 3421 aaaaacattt tctgctcaaa ttagcttaga caattatcga aaccccagta ttggtagctt 3481 tcagcgacaa atccaactta accaagctaa cttgttggga ctgggagatg gtttaagtgt 3541 gggctactcc aacactgatg gtagcaacgc tgtggaagct cgttatactt taccgataaa 3601 tgcttataat gggacgctgg aatttgccta taactataca gacagctctg tcattgaaaa 3661 accctttgat gatttagata tcaacggaac cgcacaagac ttttctctga cactgcgtca 3721 accaattgtg gaaacgccca cggaagtatt tgccctcgga ctgactgcaa atcgccgaga 3781 aagtgacgtt ggctttttgg aatctctcat tggtcgccga gtaggatttc cccaacctgg 3841 ggctaataac aatggagaaa ctcgcttatc aatactgcgg ttttttcaag aatggacaaa 3901 gcgtgatagc caacaagtgc tggcggcgcg atcgcaattt agttttggct tagatgtctt 3961 cgatgcaacc acgaacaaaa gagcacccga tggtagcttt ttttcttggc gaggacaggc 4021 gcagtgggcg cggcttttag ctgctgatac attagtgctc gtccgtgcag atactcaaat 4081 atccgacagg gcattagttc ctttagagca attcggtttg ggaggacaac gcacagtacg 4141 cggatatcgt caagacctgc tgttaacaga taatgctttt ttagcgtctg ccgaactcag 4201 gtatccaatt ttacgagtac cagaagtcgg cgtgttacaa gtgactccgt tttttgacta 4261 tggtacagcc tggaatagct ctggttacag taaccccgac cctagtaacc tggcttccct 4321 gggtttgggg ttgctatggc aaagcaataa tttaagtgcc agattcgact atggtattcc 4381 tcttattgac gtgaaatcac gcaatgacga aactttgcaa gaaaagggtt tatatttttc 4441 catcgtttat acccaacggt tttaacagtt atcagttacc agttatcagt agttttgtag 4501 gttgggttga gggacgaaac ccaacaaatc gatcaaatgt tcggttccac ttcgttccac 4561 ccaacctaca aagttcagtt ttgtaggttg ggttgaggga cgaaacccaa caaatcgatc 4621 aaatgttggg ttccacttcg ttccacccaa cctacaaagt cttaatattt actttataaa 4681 tcagttctta agctgtctga ctagttttct tgattttgaa ttttgaattt tgaattttga 4741 attccttaat agcttatgtc tagacttatt tttctcgctg cacttgcgca aattgcagtc 4801 atcgccagcc cacttgcttt agcggctgat acaagaaatt atccacagac tcaacaaagt 4861 cagcctaccc aacttgtaca gcaaactgat gcaagacaac tcttgcagca aggcttggaa 4921 cgctaccaaa gagaacagtt tgcaccagcc gtgcaagttt tacaacaagc tgcagaaaaa 4981 tttcaaactc aaggtgatac cttgaatcaa gctttagcat tgaactatgt tgcattggct 5041 tatcagcagc ttggacagtt acctcaagcg agcattgcaa ttgcacagag ttttgatctg 5101 ttacaaaaaa accgtataaa ttctcaagaa tacatcacag ttcgcgctca agcactgaac 5161 acccaaggac agatagaatt agcacaagga aaatctgaaa aagctcttgc tagttgggag 5221 gaggcaagca ctctctacgt gaaaaaccgg gataaagagg gggagattgg tagtaaaata 5281 aaccagattc aggcactgca agccttgggg ctttacgatc gcgctcgtat aactttaata 5341 gacgtaaata aactactgca ggcggaacct gactctctcc tcaaggcaaa gggactgtta 5401 agcttgggta atgctttacg agtcgtagga attttagacc agaaagaccc aaagaaaata 5461 gaggactttg gttcccaaca ggctttagaa caaagtttag caatagcttc taaactaaac 5521 tctgcggaac tggtagcaga aatttacctg agtttgggaa atacagccca agcgcggcaa 5581 aagactgatg aagcaatgga gtattataaa aaagcagcag catcttctcc tttaccgata 5641 actcggctgg tagcgcaaac aaatcaactg cgtctatccc tgaaacctcc acaacaaaag 5701 tcaccagaga cagcgcaaca atcatctaac cagcttttag atctgagttt gttacctcaa 5761 attcaatctc agttagatac cttagcacct agtcgtaaga gcctttatgc aagaattaac 5821 tatgcgcaaa ctttggcttg tctcaggcag agaactgagg gaagacgaac tgtagttcat 5881 ggaaattgtc ctaaacaagg gataggtgcg caaaatatag caactaacag tattccagaa 5941 tggaaggcga tcgcccaaat caccgcaaca gctgttgatc aagccaagag tttggaagac 6001 aagcgtgctg aagcctatgc tttggggact ttagggggac tatacgaaca gactcaacag 6061 tggactgatg cacaaaaact cacccagcaa gccttagcac tttccgaaag tataacagca 6121 ccagatattg gctatcgttg gcaatggcag ttaggtcgca tactcaaagg ccaaaaagat 6181 gagaaaggag caaaagcatc atatagcaaa gctgttgaga acctcaagtc tttacgcggc 6241 gatttagttg cccagcgcga cgtggaattt tcttttcaag aagaagttga gccaatctat 6301 cgcgaactgg ttagtttgtt gttgcagcca ggaaataagg aacccagtca agatgatctt 6361 gacaaagcta gagatgtgat agagtcactc aagctagcag aactagacaa ttacttccgc 6421 acagcttgta taaatgcaaa acctgtcaag gttgaccaag ttgatcgcac agcaggagtg 6481 atttatccag tgattttggg cgatcgccta gaagtgatct attccctccc ttcctcttca 6541 accaaccaga aacaaggtcg aaacttacgt cattacacaa agattctatc ccaaaaagaa 6601 gtcgaagaca agcttgatga actgcggcaa aaattagaaa cccgctcgac tccagagttt 6661 aaggttccgt ctcaagaagt ttatgaatgg atcattaaac cgattgaatc tgaattggaa 6721 aaacgccaaa tcaaaaacct ggtgtttgtc ttggatggtc cactgcaaaa tattccaatg 6781 gcggcgttgt acgatggaaa aagttatctt gtccaaaaat ataacattgc tctgtctcca 6841 ggcttagaac tattaaaccc ccaacctttt gcacgcagcg aactgcgaac tgttgctgct 6901 ggactcagcg aggaagtcag agactttccg gcattacctg ctgtcaaacg cgaacttgat 6961 gaaattaagt caatagttcg cgacagtgac gtgctgcttg acaagaaatt taccagaagt 7021 gcgataaaag aggctgttaa gtcgtttaat gcgccagtgg ttcacctggc aactcatggt 7081 cagtttagtt ctaagccaga agataccttc attttaactt ttgatggtaa agtgaacgtc 7141 aacgatttaa gtaacttact caaaactagg gcaactgacc aaagaggtgc aattgagcta 7201 cttgtcctga gtgcttgttc aacggctgag ggagacaata gagctgcatt aggaattgct 7261 ggaatcgctt tccaagctgg ggcacgcagt acattggcat cgctttgggt tgtagatgat 7321 caggcaactg ctgagatcat gggtgagttt tacaaacagt taagcaaatc caacaccact 7381 aaagcagaag ctttaaagaa ggctcaattg tccttgttga aaaatccttt gtacgatcat 7441 ccgtattatt gggcacctta tgttttggta ggtaactggc tttaagaaag tagggtgggc 7501 attgcaatgc ccaccctaca gaatttgcta cttataaagg agagggaaaa gggatccggt 7561 gtgtagacgg taggagaggg gcagggggtg aggtaattgc ttgttaaaaa tttcgcagac 7621 tgtataaaac ttcttctaca aattggtatc gctttcggaa gtcgtagcac accattttgt 7681 ctaaaacagc agccaagtct tcgctagttt ctgcaagatg tcgccaaacg agttctgttg 7741 tagtagagtc tcgttccaga tatttagcag gagtcccagt taaagcttgg atagcaatca 7801 ttcccaaagc aaagatatcg ctattgagcc ttggctgtcc cataaactgc tcaggaggtg 7861 aatatcctat agtaccgact gctactgtcg ggttctctgt tgggtgttga ggctgaattt 7921 gtttgacagc gccaaagtca atcagtacag ttcgttgatc tttttctcgg tgcatcaagt 7981 tgctaggttt gatatctcga tggataacac cgtgactatg aacaaacgcc agtatctgca 8041 aaacatcttt caggaaatgc accacttgaa catcagaaaa acgtttaccg ggagtcagtt 8101 cttgatgtaa ggaatgccct tggatatact cttgtaccag ataaaattgc ttattttctt 8161 caaaataagc catcaattgt ggaatctgct cgtgttgacc taaagtctct aaaatttctg 8221 cttccgtttt aaataaacgc ctagcaacat ctaaaaacag atcgtcatta cgagcaggtt 8281 gcaagtactt aacgacacac agagggctgc ctgggcgata agtatcttca gccagatagg 8341 tataactaaa tcccccacaa ccaagtaatt cgataatttt ataacggtga ttcaatattc 8401 cttgtagttg taagccatta tgaattcccc cctgtgtctg cgtttggtga tttccaccat 8461 cgcgcagaag cgattgcaat agggagatag tttcgttttg atcttgaatt tgaagtgcta 8521 ctttttcttt ctgttgcttg ttttggaatg atgtataagc aagaacaccc ccttggacga 8581 gaagaaaccc taaaatcgga gatactactg gtacccaccc tgcgtgagta aagatggcaa 8641 aacaactcaa aagcagtatc gctaaggaag cggttgttgc taaccctaaa taaagtggat 8701 gctgaatgcg cgacgccaca actccaccta ccacagccca gccccaaatc caaaggactt 8761 gtccccattc aggtaaaaac caaaacagtc gtcttttatt ggaaacagca tccagaattt 8821 gactcaccat ctgagcatgg atgacaactc caggcatttt gccagaatta tcctgtttgt 8881 taccactgct ataaggtgtg atgcgaacgt cttgggaact gggcgcagtt gagccaatca 8941 agacaatacg gtttttgacc aaactcggat tgatgcgatt gttgagcaca tctgtaaagc 9001 tgacttgttg agcaatgctt ttcttagaac cgtagtttag taatatttga aaaccccgat 9061 tatcaagatt cctgtaattc cctgaatcag gctctagacg ttgaaacacg gtattaccca 9121 gtttaattct cttttcagta atttctggtg gaatgtttaa gtactgcaga gctaattgaa 9181 accccaggga accttcggct gcacactttg acttgggatc gggggaggct atgagtagat 9241 tccgccgaat caccccatct ccatcttccg agaaatcagc gaatcctacg gtttccggtg 9301 ctacaccctg tggaggtggt acaacaggct ctttgtcatc accaagtttg cagatactca 9361 caatgcgcga actctgtttt atatgactca ggagcttttt atgccctgga tccactggta 9421 catcgcgaaa aaagtctaaa ccgataactt ttgcttgata ttgttctagt ttgctcaaga 9481 ctgtagttaa aacattgcca ttgggcgacg actgctttaa gttttggatg tcattttcgg 9541 taaatcccac aatcaggagg cgcttgtctt gttcgacatc tgcacggctt tgcatcatct 9601 ggtcgaaaac tatcagttcc ggatgctcta accatcctaa ctgttgagcg ccaaccagca 9661 acacactcac gattgcactc gcaaaaaccg caggctgtgt cagtagattt ctgatgctgg 9721 caaccataag caatgaccaa tttaatatat aagaaattat tatatttttt tactgttatg 9781 aatttttgca aaagtaggac tgcctagaga gtaacaagaa gatgaaattt ctgttttcca 9841 aggctcccct gtttctattg ttccatatca ttatggttct tcaaagaaaa tcaaataaat 9901 tgaaaattag ttattatgca tcaatcacaa cccgtccaat aggataggag atgcaagtaa 9961 gaacgtaacc ttgttgacgt tcactctctt ctaaggcttc cggttcccct tctgtgtaaa 10021 tttgtccctg taacttcaac ttcttacagc taccacacac tccagaacgg caactattgc 10081 ggattctcac cccctgttgt tcagccaaat gaagaatcga ttcttcgcca tcactgtagg 10141 cctctattct tgattgagag aaaatcacag aacttttgtt ggaaaacctt gctgcatctc 10201 tacccacatt tgagtcagaa acagtttctc ggggcaaatg tataaaactt tgtttgaacc 10261 tatagtcaac aacaggagtc acaggatcat ccaaagtgct gattacagtc tgcttagaga 10321 taggaaattt agaacttttg cgcgggggtc caaagctttc ctcatggtag ttctgcattg 10381 gatagtcaaa agtttgcagg atctggttga cgttttccat aaaaccgtgc ggaccacaga 10441 catacacact gcgctcccta aaatcgggtg caacaacttg tagcatagct gtatcgaacc 10501 tgccagttag accaaaccaa gtctgtccgg gttctttgcg agttgtggaa accaccaaac 10561 gaaaatttgg atgccgtgct gacatcagtt ctagttcttg ccggaaaatg aaatcacgca 10621 gggtgcgcac actatgaaag aatattatat cacaatcaga tgcagtatca catagccacc 10681 gggacataga catcataggt gtgataccaa tccctgcgct gaggaacaac atttttggga 10741 gggggttggt gaagcaggta aactttccca atggtccgtt acagtggatt ctactgccaa 10801 cggtcacatg atcgtgcaac cagttagaaa ccaaaccctc tggtaaactc gattcaccaa 10861 gtgaatgagg aacgcgtttg accgtgattt ctaaggtatg gggacgagag gggcttgatg 10921 aaatggtgta agaacgcgaa acctgtttgc cattgatttc ctgtttgagg gtgacaaatt 10981 gacctgggtt gtaggtgaac aacaccggag gctgtgctac aaagcgaaag gttttcacat 11041 catgtgtttc atcaatcaca cctatgcaag taagtgttaa ctctccttta acccagcgtt 11101 gcacttggct tggctctagg attgccagag gcataaattt ttctagagaa acacggggag 11161 gtggcggctg actttcggaa tcgcgaacga ggatttgttg tgcccttttt tgtgattcca 11221 tgaacgtttc gtcttcctct gtcctaattt gtaaaatcat caggataaac cgaccaattt 11281 gaatcttgtc acctgggttg aggagataat cctgattaat ttgagctttt tcgccgttaa 11341 ttcgggaacc accactgcta gctaaatcaa caaaataata atttcctctt ttcatggaaa 11401 cttttccatg catccggctg acttcagctg cctctaaaag taaatcacag ttcgcaaggc 11461 gaccaataaa acactcctga tttggcttag tttctggaat taaatctacg tctttgaact 11521 ggtctgaagt attagagtct attaacttga ttttaagcat atctttttgt atgaaaaaac 11581 gttaataatg aatataaatt ttctttgtgt ttgcaatata ctcatattca gttataattt 11641 cattggataa accaggacaa accgggctag cctcttactc ttttatcgct tcaacaggtg 11701 ctgtaggcag ctttaaccct ccacgatcag aaccgggtaa ctctccggtt tgaaaaccat 11761 tatgttccca tccatataaa cataaattaa acgctgcatt ctcatccctg tcacgttcat 11821 aaccacaatc gcatttataa gtacgaaggt ttagcggcat tttctgtttt gcaccacatt 11881 taggacacag ttgagaggat gggtaaaatc tatcaacttg tattaaccga tgagcgaact 11941 tttgcgtttt gtactgaaac tgacgttcaa tctcaccaaa tccgcaatca gccacagcac 12001 cagctaattt gtgattttgc atcataccct caacgttgag attttcaatc actacaaccg 12061 cgtggttttt gcatatagtt gtagtcatat gattgatggt atgactgcga gtatcagcta 12121 tgtgtttgtg gtatcgagct aatctatttt tagcttttag atgtccttta ctgccctttg 12181 ttttacgttg taactgccgt tgcagtcttg ctagttgctt tttcgatttg cggtattttt 12241 tcggattatc aaaagtctta actgtattta aactgaccgc tacagccaaa ctatttatac 12301 ccaaatcgac accaacaata tcatcaacat gattacattc atctaattga cctacagata 12361 tatctggtat ttcgtagtta aaagaaatat accaaccatc tgcaccatga ctaatagtgg 12421 cttttttaat gtcaacagaa ggtagttgtt caaaagtaca aacaattcct aatggtgcag 12481 gtaatttaat actttttcca tctgttcgct tgactttacc attccaatca acagtaaaag 12541 aatctttgga cttccctttt tttttgaaag ttggtttgag agcaatacca ttaaaaaaac 12601 gcttgtatgc ctctttaagc tgctcaaatg cataaacata aaccctagat gacatctgtg 12661 tcatccaagg atattcgggc ttgactacat ttgtataaaa cttgcgagca tcacggtagc 12721 ttagtttgac accactttga taaaaatttt ccatcgtcgc aagtccccaa ttataaaccc 12781 aacgggcgaa acctgcatgt tgagcaaata actgttgctg aaaaagagtt gttttcagtt 12841 tagttttaaa acctagctgc atgttgctcg atagaaatgc aatatttatc ttgatacttt 12901 agcattagta gtatgtttct acttatgacc aatattgtct gatttgttga aagtggcgag 12961 tttgagcgct attttcgggt ttgtccaacg tttttattat ttagtgtttc accctaactc 13021 gtgtaactcg aaagtaaaaa gtacagccat ccaaggttca tactttcagc gtggactctg 13081 gatggcgacc acagcttcag aacttacgca aaataacgtt agcgtagcgt tgcaagagcg 13141 cgcatactac aaaggacaca taggaataag agttttcgag agttcttgcg taagtcctaa 13201 cctttaaagc tgagtcacag cagctgtatg ccctagactt tcttgaagac gatgactcac 13261 cacttgtaag acattgatcg ccagtggtgg attatgtgaa agcgccgctt caaaatcttt 13321 tgccttgata gctaaagccc gagtcccttg cttgactgct acgactgttg ctacatagta 13381 gctatgggtt agaacttcta attctcctat aacttgccca ggcaaaatcg tttgattatt 13441 cgctgtattg cctgaaagta gcattgcctc cccatcaatc aggattatga gttcaaaagc 13501 agttgctcct attctacaaa tttcctgttg gggttgatat tcgcgtacgg ttgcattgcg 13561 agcaagttca accaagacat ttgcctttgg tgcctggaag aaactgctgt tataaagcca 13621 tagcaacttt tctagggttc ctgttgactg ggcgattgcc tgcggcagag ctacgcttaa 13681 cgcatctccc tgtagaggtt gcatttgcca ttgtacgagg agaaacaaat catcttcgct 13741 gctaaaccga ataatatctc cctgtttcag ttgcttttgc tggtcgtgga tatgttcttt 13801 accaatacgc agaccattac cgcttcccaa atccttaacg ctgacacctg tttgatctag 13861 ataaaatatg gcgtgctgtc gggaaacacg gttatccaat atcacaatat cattctcgtg 13921 tcctcgtccg actcgaattg tcggctgttg gaaagttcgc ctttctgtgc gttccatttc 13981 tttgacctga gcaattaaag ttggtacatc agcaggttgc ttgcgttgct gattctgacc 14041 aagaatgatt tgggcggttt cttgtaccag ccaatcttta ttttccttgg aattcagtat 14101 ttgacgagct tgctgcatag caacaggagg attgtgctga tctagagcat aaagactggc 14161 ggcttgcacc aacgggtcta agtcgtgtag gagttcaagc aagatttgat caattgcttt 14221 tgaacgaggt acacgtgtat ttgttaattg attggtaaca tccagagaat agcgttttgt 14281 ttgtgactcg ttcgcagtgc tagcgccaac gtcaagttgt gttgatattt gcaccaggtg 14341 actgagttga tcttgttgtt gtaatgaagt caggactctt ggatttaaac gcttgcgcca 14401 gcttaactgc tcatcattag attccaaaat ttctacaatc acatttgccg ccagtacgcc 14461 tgttgaacga gcaatgttga gggcttcagg agattgtttt aaaaattcta agatgcgaag 14521 taactgagtg gcaatcagtt ttttcttctc ttgcactgct tttcttagca aaacataaac 14581 aggtgcttgg cgattgggta ctgagttgta gagtatctga tcgcgtactg ccaattcttg 14641 caattgagcc aaaagtgttt ctgctgtgcg tagcagtaca cctttttgat tcaacatctg 14701 ggccaagact tgttcgtgct catctatatt aatggcatac tcctgtctca atgtctggat 14761 ttgcttttgc ttgcgatcaa ttgcgtctgg cggagatatc cctctttctg actgtttttg 14821 gataagcatt tctaacgcac gccgataact ctcaatacgc aatttatttt cacgactggg 14881 ttgcttttgc ggagtgagta aggttggatc ttctacgtta agttctgcta agacagagta 14941 atgatcctca tcagtgatgt taaattggtt tcgcaaatcc ttgagagcat gtaaactctc 15001 tgcataactc atattcccct cttcgagggc atcacgcaaa accccttggt aaactcgtac 15061 ttggtctaca cgcctaaaac caggcaggac tttcgccaat acgtacactt catctggaat 15121 cagatcttgc aatgaacgtc cttcaagcaa tttagaccag tctactgcta gcttgttcaa 15181 ctgtcggcgc aaattatttg ccagactttc acgggcataa cgctctttgc tacccgccaa 15241 actgcgctcc aaccataagc tactcacgag aacgctgaga atattgagca cgatgatagc 15301 ccaatctggg agtggtttaa aaatggggcg aatgccaaag ataaaaaaga cattaaaggc 15361 tataaaagta cacaaaacaa aacagacgtg caggacttgt tcttcatcgt ggtatttgtt 15421 tttctttttc agataagttc tatacgctct ttctaaaaat tcccctacta aatagctcgc 15481 agcgcaaaaa acagcaagag tcaaaggagc tgctatgatc ttgggaatgg gaactgggcg 15541 accaaatatg taaaatcctg ggttgaacag tgtgcgtaat ggatcttgct cataaagcca 15601 tccaccagaa aaatagtatt cccaattacc agaatataag taaaagaaga catagtatcc 15661 caacatcaaa ccgaagtaac cgtaaaatac tagcttctga tctggtctgg taattccctc 15721 ccaataagaa cgctctgcgt caatatcgat acaaggagat tggcaactca cacaggcact 15781 cttctcgtta cctgaactat cgacagtcct acaagtagat tgggtgatag tctgtggcgg 15841 ttgaaggtga gcttcactcc ccagcaagcc ccgtcgtcct gtaaaaaaca tttgtacagg 15901 tgccattggg caaaaatact gacaccagct ttttcccttg taaagaaaac cgacagtcat 15961 ggctgaggcg atggtaaata ggaggaaaaa acccattgct atgcgatcgc cattcacaaa 16021 caaaatacgt atattcaacc ccaagtacaa caaaccaaat tgcagcagta ggtaattccg 16081 acctagccaa gactcttttt ctatacctgc caattccgaa cgaacactac ctgtttcttc 16141 agaaaccgtt ttacgcttac gttgaattcc cagtgctcgt ggaatttgag agaagaaata 16201 caatggacaa atccgccgcc aaaattcatg accgaacact aacaaaataa caattgcact 16261 cgggataact actgtccaaa agatgcgtgc cccgatagta tagggctgtt cttttaaaca 16321 cactccttga actttcacac aagttgatac attttggtag ttgttgaaat caagatgaat 16381 agaacttgta aaccaaggtg aaattggagc gtagaataaa gaaaaaatta acatcagcca 16441 gccgattgca agcagccacc tgatgaagtg catctttgtt tctgaaactt gactaatcat 16501 tgttttggct tgtattcagt aatcgttggt actgatcacc catcaaaaat tcccttctgc 16561 ataggcaaat gttgtctagt acacatttgt tgagaaaggg ctttatgtgt gaaaatttga 16621 tattgtagat attatgatgg atgaaccgct gtacagtact cgcgtcaagt tctaagacac 16681 aatagtataa aatatttaat agttatttat aaatcattct ggatttttac aaatcgttgc 16741 tacaacccaa tagaagcaag caatattaac tttgagacat atcacaacaa aaatttaagg 16801 tgcatattct taaaagatta cctctacaat attgttagcg atacaaagta tggtgggaaa 16861 agctttttat tatgcatcta ccaccaaggt tttcaccata gcgagttatt cttatgaaga 16921 cctttttgcg acacggattt atgacagccc ccttggctgc cataataggc ttttctgtcg 16981 tcattccttt aattcctgca gaggctcgga taacatttaa agctccagcc gctttgggag 17041 tacctggtag gcgtgtagcg ggtgcctcgc gtacaatgca gcaatgctta ttagataata 17101 agcccctgac tgctatagtt cctcaatcta acataggatt aacaacagca gcaaaccctg 17161 tactgttatt ctatatccca aaaacatctg cgcaagcaca attagaactg gttgtacaaa 17221 ccgctaatga aaagaacatt gtaaccaagc agtcctataa accgagtagc aaagctggag 17281 ttgtcagtat tcccctgaca aacgcatcac tagaagtcgg taaggactat cactggtttt 17341 tttcaattgt ttgtaatccg aaagcacgtt ctaaagatca cttggtgcat ggagggatta 17401 aacgcattca agcggaacct ccattgacaa cgcagctaaa aaatgcaagt ccgcaacaag 17461 ttgtcaatgt ttatgcacaa gctggaattt ggcaagatag cgttgccaag ttggctggtt 17521 tgcgttactc tcgtcccaat gatgcagaat taaaagctga ttgggaagga ttgttacaga 17581 gtgtagaatt caaacctgat gtggtgaatg cccccttgct tcaaggaggg gaagcaccac 17641 aacagcagta gattttgttg aggcgggcta taccttgcaa caaaaaatgc ctgactaccc 17701 aacttagtgg tctatcgcat tacagcaatt ttcagataaa tagaccacag tgattgaacc 17761 aaccaaaggc atca // LOCUS NODE_1862_length_17705_cov_5.26787517705 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17705) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17705) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17705 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 85..579 /locus_tag="DP116_16075" /pseudo CDS 85..579 /locus_tag="DP116_16075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867205.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="porphobilinogen synthase" gene complement(765..1136) /locus_tag="DP116_16080" CDS complement(765..1136) /locus_tag="DP116_16080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197609.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16080" /translation="MLNFKKYAALLALLPVATFWTQAAHAETLKTKNFNVTITRNCPE GNVTCNNVTYFGKDLRTGKSISLTGKTIHTTGADGVTPGRFLGYQFRNNEYVYRVTAD GILEVYQGKKLILQEKGALTF" gene complement(1311..1520) /locus_tag="DP116_16085" CDS complement(1311..1520) /locus_tag="DP116_16085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862633.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_16085" /translation="MQVSARNALKATVKEVVEGSVNTEVTLEVAPGVEVVAIITKSSA HKLQLEEGKQAYAIIKSSDVMVAVD" gene complement(1699..3540) /gene="modB" /locus_tag="DP116_16090" CDS complement(1699..3540) /gene="modB" /locus_tag="DP116_16090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdate ABC transporter permease subunit" /protein_id="PRJNA477356:DP116_16090" /translation="MPFDLSPLWISLKTSLLATFITFFLGIAAAYWMLGYRGKGKSLI EGIFVAPLILPPTVVGFLLLLLFGKNGPVGKLMEPLGFSIVFTWYGAAIAATVVAFPL MYKTALGAFEQIDSNLLRVARTLGAKESTIFWRISLPLAVSGILAATTLAFARALGEF GATLMLAGNIPGQTQTIPMAIYFAVEGGAIEEAWFWALAIMGISLSGIIAVNYWQETR GNRRGQINSKSRSVASIATQSQNSKIITPDSPILTQHSPGSGLFVDIEKQLSGFSLKV CFSADKQPLGLLGGSGAGKSMILRCIAGIETPSSGSIVLNGRVLFDSQQGINLPPRDR RVGFLVQNYALFPHMTVGQNIAFGLPKGLSATAIRQLVETQLIAVQLEGYSQRYPHQL SGGQQQRVALARALASQPEVLLLDEPFSALDTHLRSQLEQQMIGTLSSYEGVTLFVTH NMEEAYRVCPNLLVLEKGKAVHYGTKYDIFEHPATVGVAQLTGCKNFSSAVATTSGQV QASDWGCTLSVIEPIPEALSNVGIRAHQIIITDKPNQENTFPCWLARTSETPHRMTLF LKLHHRSTNNNDYHLQAEVFKEKWATLKDQPMPWYVRLDPLRLILMQ" gene 4015..4431 /locus_tag="DP116_16095" CDS 4015..4431 /locus_tag="DP116_16095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878758.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16095" /translation="MSQPIIWIHGDCLSPGNPALEEYPQAPAIWVWDDALIEEWQLSL KRLAFIYECLLELPVVIRRGDVAKEVLAFAKEHNANKVVTANSPSPRFDAICEEIERS VELEVFEVEPFFDYDGYIDLKRFSRYWKVAEKYVFD" gene complement(4615..4830) /locus_tag="DP116_16100" CDS complement(4615..4830) /locus_tag="DP116_16100" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16100" /translation="MFSKALSQKFVYLTTVSITIAAIVVIFNTENFHASSMCVITNTA SMDKGVDTSLRENTDKFLDVAIEQSIH" gene 5169..5666 /locus_tag="DP116_16105" CDS 5169..5666 /locus_tag="DP116_16105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015193100.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase" /protein_id="PRJNA477356:DP116_16105" /translation="MIPISTLFIGLNGLIALLLTYIVVMERTRTRLWHGESKEDVAIQ RDPLINPNVVAATVEKLATKIIPDKVEDYGALQRKVRAHGNFAEYVPLGLLFVIALEL MHSPNWLIWLEGGVLTVARIAHAWGLITTYGPSIGRATGFYLTLFVYIIGSLACVYYG IQGVI" gene complement(5663..6121) /locus_tag="DP116_16110" CDS complement(5663..6121) /locus_tag="DP116_16110" /inference="COORDINATES: protein motif:HMM:PF00583.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16110" /translation="MVRIRLYEPKDLEDIVQLWYRTWHQTFPNIQHPQPYSAWKSRFL DEFAVQGEVWVAEVEHHIVGFVVVIKEEQYLSQIFVNTEYQNCGVGSALLNKAKEICP QGLMLQTLQQNIRACVFYEKHGFKAGKISVNKINGQPNIEYHWKPLINTH" gene complement(6346..7701) /locus_tag="DP116_16115" CDS complement(6346..7701) /locus_tag="DP116_16115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013189867.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MATE family efflux transporter" /protein_id="PRJNA477356:DP116_16115" /translation="MFDYVKANHDLFRRFYRLTVVNVLSNLTEPLAGLIGIAFLGHLT EIRHLAGVSLATVLFNYIYENLLFLRISTTAVTSQAVGQDDQEAILLAGLRNGFIALV LGVLIFVLQYPIGVLGFNLLNGSPEVESIGLDYFNARILGAPAVLLNFVIIGWFLGRE QNGKVLLLTAVGNAANIVLTYFSIMRWDLGSTGAGLSHAISEYLTLLVGMLLAFRSIQ WQELRTAVQKFWEWSAFQATFILNSDLLVRSLVYMSIWTIFFNLSATFGTDVLTENAL LQQVVFLLAYLIEGIGFTTETLTGNFKGQSADDQLLPLLQISLFTSLLVGVATSGACV LLPETVFGLLTNHAELIEPIKHYVPWLFFVLSFFSVAWILEGYFAGLTKGQSLRNAAL MAALLGFAPVAFWAWLAENNHLLWLATSAFMAIRVLLLGVQLPEMFESHSTTVSELPK L" gene complement(7686..7928) /locus_tag="DP116_16120" CDS complement(7686..7928) /locus_tag="DP116_16120" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16120" /translation="MRKKGKLLAISYNKAFEWSMIIDKQIMYTMLSNLFFNGLYGLKQ ITVDIFFPTIFSTRFYKQVNYREFIGIFRIVICLTM" gene complement(7995..8453) /locus_tag="DP116_16125" CDS complement(7995..8453) /locus_tag="DP116_16125" /inference="COORDINATES: protein motif:HMM:PF00583.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16125" /translation="MVRIRPYEPKDLEKIVQLWYRTWHQTFPNIQHPQPYSAWKSRFC DDLAVTGKVWVAEVEHHIVGFVVVIKEEQYLSQIFVNPEYQNRGLGSALLNKAKEICP QGLMLQTLQQNIRACVFYEKHGFKAGKLTVNKINGQPNIEYHWKPLINTY" gene complement(8580..10391) /locus_tag="DP116_16130" CDS complement(8580..10391) /locus_tag="DP116_16130" /EC_number="3.6.5.n1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654428.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="elongation factor 4" /protein_id="PRJNA477356:DP116_16130" /translation="MTDVPAARIRNFCIIAHIDHGKSTLADRLLQVTGTVDQREMKEQ FLDNMDLERERGITIKLQAARMNYQAKDGQEYVLNLIDTPGHVDFSYEVSRSLAACEG ALLVVDASQGVEAQTLANVYLALEHNLEIIPVLNKIDLPGAEPDRVIGEIEEIIGLDC SGAILASAKEGIGVGDILEAVVERIPAPRNTVSDRLRALIFDSYYDSYRGVIVYFRVM DGTVKKGDRVYLLASDKEYEIDELGVLSPTQKQVEQLHAGEVGYLAAAIKAVADARVG DTITLSNAKAAEPLPGYTEANPMVFCGMFPIDADQFEDLREALDKLRLNDAALHFEPE TSSAMGFGFRCGFLGLLHMEIVQERLEREYDLDLIITAPSVVYKVHTLKGEELYIDNP SHLPSPNEREKIEEPYVQVDMITPETYVGTLMELSQNRRGIFKDMKYLTQGRTTLTYE LPLAEVVTDFFDQMKSRSRGYASMEYHLIGYRENPLVKLDILINGDPVDSLAMIVHRD KAYNVGRSMAEKLKELIPRHQFKVPIQASIGSKVIASEHIPALRKDVLAKCYGGDISR KKKLLQKQAKGKKRMKSVGTVDVPQEAFMAVLRLDQG" gene 10890..11126 /locus_tag="DP116_16135" CDS 10890..11126 /locus_tag="DP116_16135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408258.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16135" /translation="MNMKEKTTDAEQRLADKVEIAIRLDSDLLEQIHHLTNDPSKVIE VAIRQWLRGESPRDDELTRTPVRKPLPSRGEWND" gene 11452..13209 /locus_tag="DP116_16140" CDS 11452..13209 /locus_tag="DP116_16140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878771.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_16140" /translation="MSITANSQRSNILSENLEDITSNTNGSWANLGAELVYTQDSTGR YLSFWWQHSECLGLNPVQILEDQSGKECAFTPVDQAAYVEKLQRILISLIPQRYQCWF SYGQQLFELELVMSPIMPTFASTPTTVLVMGRLLQTALSNEVNNLTSQTPVQLDSATR SRRHHKLISQITRNIRRTLDLDIIWQQTVDGLGKALQLERCIICPYQSPSTKVQVIAE YRQPSLNSMLGLEIDIASEPSFAHALATLQPILAENPQHIEFDQQKMLVVATSHQDQP NGLIAVTLGKKFCAVSVEELEIAKDVADQLGTAIAHATLYKELEDARQQAEQATRRIR EFLANVTHELRTPLNGIIGFLKLILEGMADDPEEQRQFLEEAHKSSLYLLDIINDILD IARIEADKMELELQSVKLDELFSDVENFMRPQAEGRNLSVQINMPPTSDEIIVYGDYQ RLKQVMLNLVSNAIKFTHEGGITVTTDVVRKKVTFQDQQFPGMVRVRVADTGIGVSLD KQEKLFQLFSQVDSSRTRQYGGTGLGLAISQKLVEAMGGQVNFYSLGEGLGSTVTFTV PLYQQPVMVSSSNSDSQEL" gene 13386..14036 /locus_tag="DP116_16145" CDS 13386..14036 /locus_tag="DP116_16145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3386 domain-containing protein" /protein_id="PRJNA477356:DP116_16145" /translation="MTHPTTARQIFQTAYESRYTWDENFPGYSADVQLVQGDEVHTGK IRINRDLSVEVTGVADEQVEEGISTQLRDIVTHRKRTSFEESHGNHEFSLGEQDPDGA IEILVNGDSMGSNYKVRGNDICQVSRVLGRMAFIINTHENLDTGSGYLGTRYDAVFRN LKTNEITSILKFEDSYEKIDGYYVMTKQVVQEYKDGTSTTTEFAYFNIKLLEAAVV" gene 14337..15146 /gene="cobM" /locus_tag="DP116_16150" CDS 14337..15146 /gene="cobM" /locus_tag="DP116_16150" /EC_number="2.1.1.133" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="precorrin-4 C(11)-methyltransferase" /protein_id="PRJNA477356:DP116_16150" /translation="MDESTTRVNDKNNYISLKSGVYIVGAGPGDPELLTVKAQKLLEL ADVILFADSLVPQQILELCREDAEILPSVNKTLEEILPIMIERVRSHKSVVRLHSGDP SLYSAIHEQMYLLAEAEIPFEVIPGISAFQAAAAKLKVELTVPGLVQTIILTRISGRT EVPETEELTTLAAHQASLCLYLSARHVEAAQAKLLQHYSPETQVAICYRLGWSDEKIV VVPLHEMADCTHKEKLIRTTLYVISPALSQASARSRLYHPEHSHLFRSSHS" gene 15306..15659 /locus_tag="DP116_16155" CDS 15306..15659 /locus_tag="DP116_16155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873647.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16155" /translation="MPLIKVQTSVSAPLGEEVEALLKSLSGKLAKHTGKPESYVMTAF EAGVPMTFGGTTDPVCYIEVKSVGTFKPDQTQAMSQDFCQEINKALKVPQNRIYIEFA DAKGAMWGWNGTTFG" gene 15772..16869 /locus_tag="DP116_16160" CDS 15772..16869 /locus_tag="DP116_16160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950020.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="23S rRNA (adenine(2503)-C(2))-methyltransferase RlmN" /protein_id="PRJNA477356:DP116_16160" /translation="MSAKSVSVLPSPKTSVVQQPNISIPPLVGASLEELTGWVQQQGQ PAYRGKQLHEWIYQKGVRSLSDISVFSKQWRAEVAEIPIGRSNIHYRSVASDGTVKYL LQLSDNQIIECVGIPAEKRLTVCVSTQVGCPMACDFCATGKEGYKRNLGRHEIVDQVL TVQEDFQQRVSHVVFMGMGEPLLNTDHVIGSVKSLNQDVGIGQRNLTVSTVGIRDRIR HLAQHQLQVTLAVSLHASNQALREQLIPSARSYPIEDLLTECREYVKITGRRVTFEYI LLAGVNDLPEHALELAQRLRGFQSHVNLIPYNPITEADYKRPNENRILAFVKVLKLQQ IAVSVRYSRGLEADAACGQLRAKKNPDINFP" gene complement(17038..17217) /locus_tag="DP116_16165" CDS complement(17038..17217) /locus_tag="DP116_16165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195899.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="replication restart DNA helicase PriA" /protein_id="PRJNA477356:DP116_16165" /translation="MQIVQKIRCSNCGSEGERHYLPDSQLTRTQCPSCDYLMICCTRT GKVIEAYAPGILAPR" BASE COUNT 5238 a 3783 c 3702 g 4982 t ORIGIN 1 tggtatgact atacccaaaa ttatagatgt gcggattttg gcatacgggg tgtagggggg 61 taggggtgta agggtgtagg ggaagatatg gtcgcacctt ctgacatgat ggatggtaga 121 gtaggggcta ttcgtaaagc actcgatgcg gaaggatata tagatgtgcg gattttggca 181 tactctgcaa aatacgcttc tgcatactat ggtccatttc gggacgcgtt ggattctgca 241 ccaaaatttg gggataaaaa gacttatcaa atggacgccg caaatgctag agaagctatt 301 aaggaagtag ccctagatat tgccgaaggt gcagacatcg tgatggtcaa gcctgctcta 361 gcatatttag acattatccg ccaagtcaaa gattacacgc aattacctgt agcagcttat 421 aacgttagtg gtgagtatgc catgattaaa gctgctgctc aaatgggctg gattgatgag 481 aaaaaggtca ttttagaaac cttgactagt atgaaacgag caggtgctga tttgattctg 541 acttactttg cctgcataag ttgcgttgat attggttaac agaatagggt ttttgttata 601 gcagtatagt aggttgggtt aagcagggag caacccaaca aaatcaagta tagtcggttg 661 ggttaagcag ggagcaaccc aacaaaacca aggcacaagt tgggtttcgt tcctcaaccc 721 aacctacaat tctgaaccat attgacttat aaacgagata atttttaaaa agtcaaagct 781 cctttctctt gaagtattag ctttttccct tgataaactt ccaaaatgcc gtctgctgtc 841 acccgataaa catattcatt attgcggaac tgataaccga ggaatcgacc tggtgtgaca 901 ccatctgcac cagttgtgtg gattgttttg ccagtcagac taatcgattt acctgttctt 961 aaatcctttc caaagtatgt aacattgttg caagttacat ttccttctgg acagtttcta 1021 gtaattgtga cgttgaaatt tttcgttttt aaagtttcag catgagctgc ttgagtccag 1081 aaggtagcaa ctggtagcaa agcaagcaat gcagcatatt ttttgaagtt taacatttta 1141 ggttttctgg aaattaatta atttagtagt tgtttgatag atacacattg attgagccaa 1201 ttccgaaaaa aatatgctct caagacaata aaaatgttaa aagcccagtc attgtgactg 1261 ggctttcagt ataggcttta gataattgaa aattctctca aaaaaacttg ctagtcaaca 1321 gcaaccataa cgtctgatga tttgataata gcgtatgcct gttttccttc ttccagttga 1381 agtttatggg ctgatgactt ggtaataatt gctacaactt cgactcctgg cgcaacctct 1441 agcgttacct cagtgttaac agaaccttcc acaacttctt tgacagtcgc tttgagcgcg 1501 ttacgagcac tgacttgcat cttttaatat cccctgtatc gttttgactg actcatcata 1561 gcactatagc actatcatct caaatgaaaa ttttaagtat aaataaaata aatttttact 1621 taaatatcaa agcttataag tattatttcg gttgaacgat gacggcttag tgggtacctg 1681 aaaaactttt gttgtggttc attgcatcaa aatcagcctt aaaggatcta aacgaacata 1741 ccacggcatc ggttgatctt tgagagttgc ccatttctcc ttaaacacct cagcttgcaa 1801 gtggtaatca ttgttgttgg tagatcggtg atggagtttg agaaacagcg tcattcggtg 1861 tggagtttca ctcgttctcg ccagccagca aggaaaggta ttttcctgat tcggtttgtc 1921 tgtaataatg atttgatggg cacgaatccc gacattggac aaggcctctg ggataggttc 1981 aatcacactc agggtacaac cccaatcact tgcttgcact tgccctgatg tggtagcaac 2041 agcacttgag aagtttttac atcctgttaa ttgagcaaca ccaacagtcg cggggtgttc 2101 aaaaatatcg tatttagtgc cgtaatgaac tgcttttcct ttctcaagaa cgagcaaatt 2161 cggacaaacc cgatatgctt cttccatatt gtgggtgaca aataaagtca caccttcgta 2221 agaagataat gttcctatca tttgctgttc cagttggctg cgcaagtggg tatccagtgc 2281 agaaaatggc tcatccaaca atagtacttc tggttgactt gctaaggctc ttgctaatgc 2341 gactctttgc tgctgtcctc ccgaaagttg atgcgggtaa cgctgggaat atccttccaa 2401 ctgcactgcg atgagttgag tttcaacaag ttgtcgaatc gctgtagcag aaagtccttt 2461 gggtagacca aaggcaatat tctgtcctac agtcatatgc gggaacaaag cgtaattctg 2521 cactaaaaaa ccaacacggc gatcgcgcgg tggcaaatta attccctgtt gcgagtcaaa 2581 caacacccgt ccattaagca caatgctacc agaacttggt gtttctatcc cagcgatgca 2641 acgcaaaatc atgctcttac cagccccaga ccctcccaat aatcccaaag gttgtttgtc 2701 cgcactgaaa cacactttca agctgaaacc agaaagctgt ttttcaatgt caacaaataa 2761 tcctgagcct ggagaatgct gagtgagtat tggtgagtcg ggagtgataa tttttgaatt 2821 ttgagattgc gtcgcaatgc ttgcaacgct acgcgatttt gaattgattt gtccccttct 2881 gttcccccga gtttcctgcc aatagttaac agctataatc ccagacagag aaatacccat 2941 gatagctaaa gcccaaaacc aagcttcctc aattgctccc ccttccacag caaaataaat 3001 tgccattggg attgtctgag tttgccctgg gatatttcct gctaacatca aggttgcacc 3061 gaattcaccc aaagcacgag caaaagccaa ggtcgttgct gctaaaatcc ctgataccgc 3121 taaaggtaaa cttattcgcc aaaatattgt agattccttt gcacctaggg ttctggcgac 3181 tcgcagaaga ttgctgtcga tttgctcaaa agcccctagg gcggttttat acattaatgg 3241 gaaggctacc actgtggcgg cgatcgccgc accataccaa gtaaaaacaa tactaaagcc 3301 caaaggctcc ataagtttcc ccacaggacc atttttgcca aacagcagta gcaacaaaaa 3361 gccaacgact gtcgggggta aaatcagagg agcaacgaag ataccctcaa tcaaagactt 3421 acctttccct cgatatccca gcatccaata ggcagcagca atcccaagaa agaaggtaat 3481 aaatgtcgca agtaaggaag tttttaggga tatccacaac ggagaaaggt caaagggcat 3541 aagtcacctg aagtcaacaa gtgtgaaatc taaaggataa aatccgcttc tctcggacac 3601 atcagcaagc tttgtctttg gcgaagttag aagtataaat tttttatcct ttattttttg 3661 tcattctcta ataccaactt aatagaaaaa gttggtattg aacaatatgt acaacaaaat 3721 agttttaagg gacaccctaa aagcaaacac atcaactatt gattgttaaa aagctttctg 3781 aaactggtta taataccaaa ttgcgttcaa cacctcaccc ctgcccctct ccttactaac 3841 tatcgtgtac acacatctgc ctgaaaacct caccctcgct ttttgctacg caaaaatctt 3901 tccctctcct ttctttctct ctcctttttt tccctctcct taataaggag agggatgccc 3961 gatagggcag ggtgaggtaa atacgtatga atccaactat aaattttaaa tccaatgagt 4021 caacccatta tttggataca cggagactgt ctgagtccgg gaaaccccgc actcgaagaa 4081 tatccccaag cacccgccat ctgggtttgg gacgatgctt taatagaaga atggcaactg 4141 agtttgaaac gcctcgcttt tatctacgaa tgcttgctag agttacccgt tgtcattcgt 4201 cgtggcgatg tcgcaaaaga agttttagct tttgccaaag agcataatgc caacaaagtc 4261 gtgacagcaa acagtcctag tccccggttt gatgctatct gcgaagaaat tgagcgttct 4321 gtggaactgg aagtctttga agtagaaccg ttttttgatt acgacggcta tattgacctc 4381 aagcgcttct cccgctactg gaaagttgca gaaaagtatg tatttgatta gacttgccag 4441 agagaatttg cagctttttt taaacaactt ctggagtttt ttgagttgag ttcaacagaa 4501 ggcaacttgg tattatttga ggcaatgaat catcttgaat tgggtgcaaa agcttgatca 4561 ttcttggtcg gtattttgat ttatctcaac ttgtaaacct catcaaccgc atctttaatg 4621 gatactctgc tcaatcgcca catctaaaaa tttgtccgta ttttcccgta aggaagtatc 4681 tactccttta tccatagatg cagtgttagt aatgacacac attgaacttg catggaagtt 4741 ttcagtatta aaaataacaa caattgcggc gatagttatg gatacagtag tcagatatac 4801 aaatttttgg ctcaaagctt tggaaaacat caagaatgcc ctttgaagag aagagtgtat 4861 atcgggaaaa ttatgtttta gctaatgtaa cattagctga aacatcgcct aagatggatc 4921 gtgctaaaca tacatcatcc tatgaaatga cccgattgga tgatttctga aaaattttga 4981 aaatacagag atgagagcaa cacaagtaac aaacagctat ttgtaaagta gctaatcaaa 5041 tttattgaag ctagagagaa aagtcatgat tttgctatga cttcagtcac attttgatga 5101 gaaaattgta ctatagtatc tttctttaga ctaaaaactg agtttaaata cttaaaaata 5161 acgaaaatat gattccaatt tcaactcttt ttatcgggct taatggtttg attgctcttt 5221 tgctcaccta cattgtggtg atggaacgaa caagaacaag gctgtggcat ggtgagtcaa 5281 aggaggatgt ggcgatacaa cgcgatccgc tgataaatcc aaatgttgtt gcagctacag 5341 ttgaaaagtt agcgacaaaa attattcctg ataaagttga agactacggc gctttacagc 5401 gtaaggttcg cgcccatggg aattttgcgg aatatgtacc cctaggattg ctttttgtca 5461 ttgctcttga gttaatgcat tcgccaaatt ggctcatttg gctagagggt ggcgttctca 5521 cagtagccag aattgctcat gcatggggtt taattacaac ttatggtcct tctattggta 5581 gggctactgg cttttatctt accttgtttg tctacataat tggtagccta gcttgtgttt 5641 attatggcat tcaaggtgtc atttagtgtg tatttatcaa aggcttccag tgatattcaa 5701 tattgggttg cccgttaatt ttattgactg agatcttacc agctttaaag ccatgcttct 5761 cgtaaaatac gcaagctcgt atattctgtt gtaatgtttg aagcattaat ccttgtggac 5821 aaatctcttt agctttgttg agtaaggctg aaccaacacc acagttttga tattcggtat 5881 tcacaaaaat ctgtgataaa tactgttctt cttttatcac taccacaaag cccacaatgt 5941 gatgttcaac ttcggcaacc caaacttctc cttgtacagc aaactcatca agaaagcgag 6001 atttccatgc agaatatggc tgtgggtgtt gaatattggg aaacgtctgg tgccaagttc 6061 gataccaaag ctgaacaatg tcttctaaat cttttggctc ataaagtcga attctcacca 6121 tgtgaagtca atattttgaa aataaaacca cagatgtaca cagataaatt atctgtgatt 6181 atttgcacgg caggtgcaac aagagcggca tgccgctttc tcctggtgct tgttcgattt 6241 ttttcctttg ttgcaagcta ctaactgttt ccatctgtga cgataaatca gcaaaagggc 6301 aatttgagcg cgatacgcac aagcttgaaa tactctggct tatcgctata gcttgggcaa 6361 ttcactaacg gtagttgaat gactttcaaa catctctggc aactggactc caagtagaag 6421 tactcttatt gccataaacg ctgatgtcgc caaccacaaa agatggttat tctcagccaa 6481 ccatgcccaa aacgccacag gtgcaaatcc caataaagct gccattaagg cggcgttacg 6541 cagggattgc ccttttgtta atcctgcaaa atacccctcc agaatccaag caactgaaaa 6601 aaaactcaag acgaagaata accaaggtac atagtgttta attggctcta tgagttcggc 6661 gtgattggtt aacaacccaa atacagtctc aggtaacaaa acgcacgctc ctgaagtggc 6721 aactcccacc agcaaactag tgaacagcga aatctgcaat aagggcagta actgatcatc 6781 agcggattga cctttaaaat ttccggttaa agtctccgta gtgaatccta ttccttcaat 6841 caagtacgcg agcagaaata ctacctgttg gagcaaggca ttttctgtta agacatctgt 6901 tccgaaagtg gcactgagat tgaaaaaaat tgtccaaatg gacatataaa ctaaagatct 6961 aaccaatagg tcactattga ggataaaagt agcttgaaaa gccgaccatt cccaaaactt 7021 ctggactgct gttcgcaatt cttgccactg gatagagcga aacgccaaga gcatacctac 7081 caataacgtc aaatactcgc taatcgcatg agacagtcca gcccctgtac ttcccaaatc 7141 ccaccgcata atcgagaaat aggtcagcac aatattagcg gcgttgccaa cagcggtcaa 7201 cagcaacact ttgccatttt gttcccgtcc cagaaaccag ccaatgatta caaaattgag 7261 caaaactgcg ggcgctccca aaatccgggc gttaaaataa tctagtccga tagactcaac 7321 ctctggagag ccgttcaata aattaaaccc tagcactcct atagggtact gcaacacaaa 7381 gatgagaaca cccagcacta aagctatgaa gccatttcgc agtcctgcta gtaatatcgc 7441 ctcttggtcg tcctgtccga cggcttgaga tgttactgcg gtagttgaga tccgtaaaaa 7501 gagcaagttc tcgtagatgt agttgaagag aactgtagcc aaactaactc ccgctaaatg 7561 acggatttcg gtgagatgac ctaagaatgc aataccgatc aaacctgcca aaggctctgt 7621 gagattggaa agaacattga caactgtcag tctgtaaaaa cgacggaaca ggtcatggtt 7681 cgcttttaca tagtcaaaca tataacaatc ctaaatattc ctataaattc tctataattt 7741 acctgcttat aaaagcgagt actaaatatt gttgggaaaa atatatcaac tgtaatttgc 7801 tttaaaccgt ataatccatt aaaaaacaaa tttgataaca ttgtatacat tatttgcttg 7861 tcaataatca tactccactc aaatgcttta ttgtagctaa tagccaggag ctttcctttt 7921 ttccgcaagc aacttcacct tttgtctgtc ttgtcaatgc gtaagtccta aacctatggc 7981 attcaaggtg tgatttagta tgtatttatc aaaggcttcc agtgatattc aatattgggt 8041 tgcccattaa ttttattgac tgtcagctta ccagctttaa agccatgctt ctcgtaaaat 8101 acgcaagctc gtatattctg ttgtaatgtt tgaagcatca atccttgtgg acaaatctct 8161 ttagctttgt tgagtagggc tgaaccaaga ccacggtttt gatattccgg atttacaaaa 8221 atctgtgata aatactgttc ttcttttatc actacgacaa agcctacaat gtgatgttca 8281 acttcggcaa cccaaacttt tcctgtgaca gcaagatcat cacaaaagcg agatttccat 8341 gcagaatatg gctgtgggtg ttgaatattg ggaaatgtct ggtgccaagt tcgataccaa 8401 agctgaacaa ttttttctaa atcttttggc tcatagggtc gaattctcac catgttcagt 8461 caatattttt gttctcgttc ccaagtagag tttgggaatg cataacggga ggcagagtct 8521 ccctgaaggc gttaccaggc taagcctagt aacgagaatc acagggaagg gggtagtggt 8581 tatccctgat ccaaacgcag taccgccata aaagcttcct gcggtacatc caccgtaccc 8641 acagatttca tccgcttttt accttttgct tgcttctgca agagtttctt cttccggcta 8701 atgtcaccgc cgtagcattt agcaagcacg tctttgcgca atgctggaat gtgttcactg 8761 gcaataactt tactaccaat cgatgcttga attggtactt tgaattgatg gcgaggaatt 8821 aactctttga gtttttctgc cattgagcgc ccaacgttgt atgctttatc tctgtggaca 8881 atcatcgcta aagaatccac tggatcgccg ttaattaaaa tatcgagctt gacgagggga 8941 ttttcccggt agccaatcag gtgatattcc atgctggcat atccccgaga acgcgacttc 9001 atttgatcaa agaagtccgt aacaacttct gccaagggca actcgtaagt gagtgtggta 9061 cgtccttggg tgagatactt catatctttg aagataccac gccgattttg cgacaactcc 9121 atcaaagtgc cgacgtaagt ttctggcgta atcatatcca cttggacata gggttcttcg 9181 attttttccc gttcgttggg agatggtagg tgactgggat tatcgatgta aagttcttcg 9241 cctttaagag tgtgcacctt gtaaactaca gaaggagcag taatgattaa gtctaaatcg 9301 tactctcgct ctaggcgttc ttggacaatt tccatgtgca acaagcctaa gaagccacag 9361 cggaaaccaa aacccatcgc gcttgaggtt tctggttcaa agtgcagcgc tgcatcgttg 9421 agtcttagct tatccaaggc ttcgcgcaag tcttcaaatt ggtcagcatc aatggggaac 9481 attccacaaa agaccattgg gttagcttct gtataaccag gcaaaggttc agcagctttg 9541 gcgttagata atgttattgt atctcccact cgtgcatcag ctacagcttt tattgctgct 9601 gctaaataac cgacttctcc agcgtgcagt tgttcaacct gcttttgagt gggagaaagg 9661 actcctaact cgtcaatttc gtattccttg tcagaagcca acagataaac gcgatcgccc 9721 ttcttcactg tgccatccat cacccggaaa taaactatca ctccccggta actgtcgtaa 9781 tagctatcaa aaatcaatgc ccgtaggcga tcgctcaccg tatttcgtgg cgctggtatg 9841 cgctcaacaa ctgcctctaa aatatcacca acaccaattc cctctttggc agaggcgaga 9901 attgcaccac tgcaatccaa accgataatt tcttcaattt ccccgatgac tcggtctggt 9961 tctgctcctg gtaagtcaat tttatttaaa accgggataa tttccaagtt atgctctaag 10021 gctaagtaaa catttgccaa ggtttgagct tctactccct gggaagcatc cactaccaac 10081 agcgcacctt cgcaagccgc aagactacga gatacttcat acgaaaaatc cacatgccca 10141 ggagtatcaa tcaagttaag tacatactcc tgaccatcct ttgcctgata gttcattcgg 10201 gcagcttgca gcttaattgt aatgccgcgc tcccgttcca aatccatgtt gtcgagaaac 10261 tgttctttca tctcccgctg atctacagtg cctgtcactt gcagcaaccg gtctgctagg 10321 gtagatttcc cgtgatcaat gtgagctata atacaaaaat tccgaatacg agctgcagga 10381 acatcagtca tatagttctt tggtttatca gcaacaaagg ataaaacagc aatatactta 10441 acgtatttta atgctttctg agctaagacg gtgatatgga gaggtgagga gatgggagtg 10501 aggcaagcag gaaagaagaa attttgacta ttgactaagg actaaggact attgactaat 10561 gactactgac aaatgaatat tgacttgtag acttcttaat gaagatatga aggtctatgc 10621 atagcttatc gtatttatca gttaggggcg tacaataaac aacaagaagt acaaaatacc 10681 gacttcatgg gcaacggcta tccgcaatag cccgctccga tcgcggcgag ggtgtcctct 10741 tctccctaag gggagggtgg aaagccgccc cgccgctcaa attcaagccc tacgctttca 10801 ggggatgggg tgctttacag gtgccctact gatgtgaaga aagcgacttt acaaagccaa 10861 ggaaaaacaa gttcagagta tatcatccta tgaatatgaa agaaaaaact accgatgcgg 10921 aacaaagact tgccgataag gtagagattg ctattcgcct cgactccgac ttactagaac 10981 aaattcacca tctcaccaat gatccaagca aagtcattga ggtggcgatc cggcagtggt 11041 tgaggggtga aagcccaaga gacgatgaac tgacgcgtac acctgtgcgt aaacccttac 11101 catcccgagg agagtggaac gattaaattc acaaaaagtt caaaaatgta aaaagtaaga 11161 cacatatgga ggattaaaaa atgtaaaatt attgatgcta gatataagcc ttccccaggc 11221 ggcgattgca gtcatagcaa cagacgcggg catgcgtatg gcagggaaat atgaaataaa 11281 cgcgatttct ccaaattctt gctctgtaag gatttgaaaa tgtgttttct agcagaacaa 11341 tcaaaaattg ccgctcaagt ttgctaaagc cataaaaatt tcttcccaac taagccagcg 11401 aatcgtaaag cttttttact gggttggctt ctatgttatc caactcgggt catgagtatt 11461 actgctaact cccaaaggtc gaacatattg tcagaaaatc tagaagatat taccagtaac 11521 actaatggct catgggcaaa tttaggagcc gagttggtgt atacgcaaga tagtacagga 11581 cgctatctaa gtttctggtg gcaacacagc gaatgcctag ggttaaatcc tgtgcaaatt 11641 cttgaagatc aaagcgggaa agaatgtgcc ttcaccccag tagatcaggc tgcatatgtg 11701 gaaaagttgc agcgaatttt gataagtttg ataccccaaa ggtatcagtg ctggtttagc 11761 tacggtcagc agttgtttga gttggagttg gtgatgagtc caataatgcc aacgtttgca 11821 agcactccaa caacagtttt ggtcatggga cgactgctgc aaacagcact cagcaatgaa 11881 gttaacaact tgacatctca aacacctgta cagctagatt cagctacacg ttcacggcgt 11941 caccataaac tcataagcca aattaccaga aatattcggc ggacattgga tctggatatt 12001 atttggcaac aaacggtgga tggtttgggg aaagcactgc aattggaacg ctgcatcatt 12061 tgtccctacc agtcccctag cacgaaagtg caggtgatag cagagtatcg ccagccatct 12121 ttaaattcta tgcttggctt ggaaatagat atagcttctg agccaagctt tgctcacgca 12181 ttggcaactc tacaacctat tttggcagaa aacccacaac atattgagtt cgaccagcag 12241 aaaatgttag tggttgcgac ttcccatcaa gaccaaccca atggattgat tgctgttact 12301 ttgggcaaga aattttgtgc cgtcagcgta gaagaacttg aaatagcaaa agatgtggca 12361 gatcagctag gaacagcgat cgcccacgca actttataca aagaactgga agacgcgcgt 12421 caacaagccg aacaagccac tcgccgcatc agagagtttc ttgccaatgt cacccatgag 12481 ctgagaacac cacttaacgg tattatcggt tttttgaagt taattttaga aggcatggct 12541 gatgatccag aagaacaaag acaatttctg gaagaagctc ataaatcatc actgtatctg 12601 cttgatatta tcaatgacat cttagacatt gccagaattg aagcagacaa aatggaactg 12661 gaattgcaat cagtcaaatt agatgagcta ttcagtgatg tagaaaattt tatgcgacct 12721 caagcagagg ggagaaacct cagcgttcaa attaatatgc ctcctacctc tgatgaaatt 12781 atcgtctacg gtgattacca acgtcttaag caagtgatgc tgaatctggt tagcaatgcg 12841 atcaaattca ctcatgaagg cggtatcact gtgaccaccg atgtcgttcg caaaaaagtg 12901 acatttcaag accaacaatt tcctggtatg gtgagagtgc gcgtggcaga cactggcatt 12961 ggtgtctctc ttgacaaaca ggagaaactg tttcaattat ttagtcaagt agatagctcc 13021 cgcactcgcc agtacggtgg tacaggtttg ggattggcaa tatcccaaaa gctggtagag 13081 gcgatgggag gtcaggttaa tttttatagt ttgggcgaag gcctaggatc aacagtaaca 13141 ttcactgtac cgctttatca gcaaccagtt atggtttcct cctcaaatag cgactcccaa 13201 gagttgtagg gaacagggaa caggctgctg ataactggta actggtaact gataactgat 13261 aactgataac tgataactga taactggtaa ctgataactg ataactgatg actgatacct 13321 gatttcaggg atttactgtg ctcagtgaac taaactggag aagaaggtga agtcaaaatg 13381 gttatatgac acatccaaca acagcccgtc agatattcca aaccgcttac gaaagtcgtt 13441 acacttggga tgaaaacttt cctggctaca gtgcagatgt gcaactcgtt caaggagatg 13501 aagttcacac aggtaagatt cgcatcaacc gcgacttaag cgtagaagtt accggtgttg 13561 cagatgagca agtagaagaa ggaatttcta cccaattgcg agatatagtc acccaccgta 13621 aacgtacaag ttttgaggag tctcatggaa accacgagtt tagccttggt gaacaagacc 13681 ccgatggtgc aatagaaatc ttggtaaatg gcgactctat gggttcaaat tataaagtcc 13741 ggggcaatga tatttgtcag gttagtcgcg tcctcggtcg tatggctttt attattaata 13801 ctcacgaaaa tttagataca ggttctggtt accttggaac tcgctatgat gcggtttttc 13861 gtaacttgaa aactaatgaa atcaccagca ttctcaaatt tgaagattct tatgagaaaa 13921 tagacggtta ctatgtgatg actaagcaag ttgtgcaaga gtataaagat ggcactagca 13981 ccacaactga gttcgcttat ttcaatatta aattactaga agcagcggtt gtttaactag 14041 ctcatacccg aaaagcttta acagttatca gttatcagtt atcagcagcc tgttaagcgt 14101 ttcctgttca ctgcgatgca ctgagcgtgg tcgttgtgtt cactgttcac tgttccctat 14161 tccctgttaa gcgttccctg ttcactgttc actgaataaa aaatgcccca gagaattctc 14221 tagggcttaa ccagggtgca tctaccaata cactctacta gaaggagggg ttctatccgg 14281 aaagatactg aggaaatgcc aaaaaatttt ttggatgagt tgctgtatgt tgctatatgg 14341 acgaatctac tactagagtg aatgacaaaa acaattatat atctctcaaa tctggtgttt 14401 acatcgtcgg tgcaggtcct ggagatccag agttattgac cgttaaggcg caaaaacttc 14461 tagaacttgc tgatgtcatt ttatttgctg attctttagt accccaacag attttagagc 14521 tttgccgcga ggatgcagaa attcttccct ctgtcaataa aactttggaa gaaattttgc 14581 caattatgat cgaacgagtg cgatcgcaca aatctgtcgt tcgtctccat tctggcgacc 14641 ccagtcttta cagcgccatt catgagcaaa tgtatctttt ggcagaagcg gaaataccct 14701 ttgaagtgat accaggtatt agtgcttttc aagcagcagc cgctaaactc aaagtggaac 14761 tcacagttcc cggtttagtc caaacaatca ttctcacgcg cattagcgga cgcacagaag 14821 ttcctgaaac agaagaatta accactctcg cagcccatca agctagtcta tgtttatatt 14881 tgagtgcacg tcacgttgaa gcagctcaag ccaaactact ccaacactac tcacctgaaa 14941 cacaagtagc tatttgctat cgcttaggat ggtctgatga aaaaatagtg gttgttcctc 15001 tccatgaaat ggcagattgt actcacaaag aaaagctaat tcgcaccaca ctttacgtaa 15061 tcagccctgc actttcccaa gcatcagcac ggtctcgttt atatcatccc gaacatagtc 15121 acctgtttcg ctcgtctcac agttgaggag cacccgagca tccgagcagg gggagaaata 15181 actccctcat cgccctcatc tttgcgtcct ctgcgcctgg tgcggttcga taaaattaac 15241 tgggtaattg atactatcaa gattgataat tttgaatttt gaactttgaa ctttgaattg 15301 ttactatgcc tttaattaaa gtccaaactt ctgtatctgc tcctcttgga gaggaagtcg 15361 aagcattgct caaaagcctc tcaggtaagc tagccaagca taccggaaaa ccagaatcct 15421 acgtcatgac ggcttttgaa gcaggagttc cgatgacatt tggtggtacg actgacccag 15481 tatgttacat tgaagttaaa agtgtcggta ccttcaagcc agaccaaacc caagcgatga 15541 gtcaggactt ttgccaggag attaataaag cgctaaaagt acctcaaaat cgtatatata 15601 tagagtttgc tgacgcgaag ggtgcgatgt ggggctggaa cggcacaact tttggttagt 15661 ttgtcagata gattcttctg tcttccagtg ttgaaaacca caatcataaa ctcaaacaac 15721 aagagcagcg atcgcctaac atagagattc aaagcgcatt ccttaactac catgtctgct 15781 aaatctgtct ctgtacttcc ctccccaaaa actagtgttg ttcaacaacc aaacatctct 15841 attccaccgc ttgtaggagc aagtttagag gagttaacgg gttgggtaca gcaacaggga 15901 caaccagctt acagaggaaa gcaactgcac gagtggattt atcaaaaggg agtgcgatcg 15961 ctctcggata tttctgtctt ttccaaacaa tggcgtgctg aagttgcaga aattcccatc 16021 ggacgctcaa acatacatta ccgttctgtc gcctctgatg gtactgtcaa atatcttttg 16081 caactcagtg ataatcaaat tattgaatgt gtgggtattc ccgcagaaaa gcgcttaaca 16141 gtttgcgttt caactcaagt gggttgtccg atggcgtgcg atttttgcgc taccggtaaa 16201 gaaggttata agcgcaattt gggacgtcat gaaattgttg atcaggtgtt gacggttcaa 16261 gaagattttc aacaacgagt cagccatgtg gtatttatgg gcatgggtga accattgtta 16321 aatacagatc atgttatagg atctgtcaaa tctttaaatc aagatgtagg tattggacag 16381 cgtaacctga ctgtctctac tgttggaata cgcgatcgca ttcgccacct tgctcaacac 16441 cagttacaag tcactcttgc tgtcagtctc catgcttcca atcaagcact ccgggaacaa 16501 cttattccca gcgcccgttc ctatcctata gaagatttgc tgactgaatg tcgggagtat 16561 gtgaaaatca ccggacgccg tgtgactttt gaatacattc ttcttgctgg cgtgaacgat 16621 ttgccagaac acgcgttgga attagcacaa cgcctgcgag gattccaaag tcatgtgaat 16681 ttgattccat acaatcccat cacagaagca gattacaaac gccctaacga aaatcgaatt 16741 ctagcttttg tcaaagtcct caagctgcaa caaattgcgg ttagtgttcg ctactctcgt 16801 ggtttggaag ccgatgctgc ttgtggacaa ttgcgagcaa aaaaaaatcc tgatattaac 16861 ttcccctaat tacttacaac aaaagcacct caccccccgc ccctctcctt aataaggaga 16921 ggggtgcccg aagggcgggg tgaggttctt cgttaataag gagaagagtg cccgaagggc 16981 ggggtgaggt tcttcgtttt ttacaagtat ttatccggac atgatatcag cactcaacta 17041 acgcggtgca agaatacccg gggcataagc ttcaatcact ttgccagtac gtgtacaaca 17101 aatcattaag tagtcacaac ttggacattg cgtccgagtc agttgactat caggtaaata 17161 atgtctttca ccttcactac cacaatttga gcagcgaatt ttttgtacta tctgcatttt 17221 aaaatcctta actttcgtgt tcttctgaaa aaaaacgcaa attcaacctg taaatcttct 17281 cgatttacaa aattcaacaa atgagaaaaa tgagaaatta taccgcttac cttaacacaa 17341 aatgaactgt tctttctttt ttgtcttttt tttattatct tcatcttaaa ggtctgtgtt 17401 atccaactct ggatatataa atcgtttata ttttttcact atacttataa atttcatgat 17461 cccaatcact aagacgttga gacttcgttc tttgttgcca tatcaagttt ttcattttta 17521 ctatgagaac tagaactttt tgagaagtgg aattgacagt aaacagtgag tccagtgctt 17581 gatgagggtt tcccgacaga ggcatctggt gagtccagcg ctgcgggagg ggagccagtg 17641 cggtcttggg gagccagtgc gttggggagg cactgctcgg ctagagtttc ccgacttgaa 17701 gcatg // LOCUS NODE_1863_length_17698_cov_5.55563117698 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17698) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17698) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17698 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..95 /locus_tag="DP116_16170" rRNA <1..95 /locus_tag="DP116_16170" /product="23S ribosomal RNA" gene 215..332 /gene="rrf" /locus_tag="DP116_16175" rRNA 215..332 /gene="rrf" /locus_tag="DP116_16175" /product="5S ribosomal RNA" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00001" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="Derived by automated computational analysis using gene prediction method: cmsearch." /db_xref="RFAM:RF00001" gene 620..2503 /locus_tag="DP116_16180" CDS 620..2503 /locus_tag="DP116_16180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873679.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_16180" /translation="MMSTSKPRKIFGTGHRRRENDWRLFLRLLPYVRRRGRSLAFAML LLVPIAVANAVQPLLIGQVISLIRQEQSAYEFLRNRPLSQGLNIIEILLLITISVRLI FTGFQGYLVQKLGQQITAEIRRDLFDHVTSLAVRFFDRTPVGKLITRLTSDVEVLGDV FSTGAIGIVSDLFSMLVILGFMFSMQWQLAFLLLMVFIPITGIIVYLQKQYRKANYKT REELSGLNSQLQENIVGINVVQLFRREKFNAELFRVNNQRYVHEVDKTIFYDSAVSAT LEWIALISIAAVLWVGGYLLLQINLTFGVLSAFVLYAQRLFDPLREFAEKFTVIQAGF TAIERVSEVLDERIEIHDRGNPRFSIYDSRLGYIDEITDNPELPIENSQPEFGEIRFE HVWFAYKDNDYVIKDLDFTIHPGEKVALVGPTGAGKSSIIRLLCRLYEPNEGRIFVDG IDIREIPQAELRRYMAVILQEGFLFAGDVKSNITLGDSYSLEEIQQAAEKTNIAQFIE QLPQGYDTQLRERGTNLSSGQKQLLAFARAAIRDPQILVLDEATASLDVGTEALIQEA LDKLLLGRTAIIIAHRLSTIRNVDRILVLKRGELIEQGSHEELLQQGGMYATLHNLQM LAT" gene complement(2578..2748) /locus_tag="DP116_16185" /pseudo CDS complement(2578..2748) /locus_tag="DP116_16185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868725.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="sucrase ferredoxin" gene 2897..3169 /locus_tag="DP116_16190" /pseudo CDS 2897..3169 /locus_tag="DP116_16190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198592.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ROK family protein" gene complement(3163..3330) /locus_tag="DP116_16195" /pseudo CDS complement(3163..3330) /locus_tag="DP116_16195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873672.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ROK family protein" gene 3443..3748 /locus_tag="DP116_16200" CDS 3443..3748 /locus_tag="DP116_16200" /inference="COORDINATES: protein motif:HMM:PF13592.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16200" /translation="MPKGQTPRAYRLLEVKHAPGKVSIVSEEALKRLKQRLQEPQGFH SYGQIQQWLVAEFQLDIAYKTVYELVRYRIGAKLKVPRPQSTKQHPQSLSHFKKNFL" gene 3937..4314 /locus_tag="DP116_16205" CDS 3937..4314 /locus_tag="DP116_16205" /inference="COORDINATES: protein motif:HMM:PF13358.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16205" /translation="MDTANSAEFSHLDGDCFQQFLELLSVQLGDDVAVIQFDQGSFHR VKALDCPENIIPIFQPPHSAELNPIERFWEFLKSKLEWENCKTLNQLRQKLAQVLDTI TPEVIASLTSYDFILEALFSAAS" gene 4882..6123 /locus_tag="DP116_16210" CDS 4882..6123 /locus_tag="DP116_16210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012166529.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA (cytosine-5-)-methyltransferase" /protein_id="PRJNA477356:DP116_16210" /translation="MPAKNLVSAHKLKPTVLDLFCGAGGMSLGFQNAGCKILAGIDHS PHAIRTHHKNFPNCKLKLKPQDITNIKPHDLNLKPGEVDIVVGGPPCQVYSLVGIGKM RSLGRKIENDPRNFLYQKFVEFLDFYQPLFFVMENVDSLVKRTIFPTILRELEFGLPR KRENYPGYRIHHNILIASDYGVPQIRKRLFIVGVRQDLEYEFEFPQPLKRNPVSVGEA ISDLIPLTPPYLPLKSKNSGLPQEDSKKFYLTHPQSSYQKKMRREITKMPEPDGVMNH ICRSHNPVDIICFAMLAQGGKYTDLPENMRRYRWDIFDDKYKRLPWNKPAWTLTAHMR KDCLAYIHPIQNRSISVREAARLQSFPDHFVFDAPMTRMFELVGNSVPPLLAEAIAKP IVKQVQNYYETNPKVEQLSLL" gene 6264..7766 /locus_tag="DP116_16215" CDS 6264..7766 /locus_tag="DP116_16215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007799350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_16215" /translation="MVRERANDEYDLVEPRAPAMLESLRAFGYNIQTAIADLIDNSIS AGAKNVWLQFYWDGSESYISILDDGKGMTEAELVNAMRPGSRNPLEEREPNDLGRFGL GLKTASFSQCRRLTVCAKAVNQNSVTRRWDLDYVSQTGEWRLLRSAASGSEERLTALE QMESGTVVLWESMDRVVGGTKTDDPKAHNRFLEMIEDVEKHLTMVFHRFLERKNKLQI WINQRLIEPWDPFLTNEKATQWLPEENLYFREDRVVIQPYVLPHHSKVDSQTYEKAAG PNGWNAQQGFYIYRNERMLVAGDWLGLGLQRDEHCKLARIQVDLPNSMDSDWNIDVKK SRARPPASLREDFKRIAKLTRARASDIYRYRGKVIARKYSDNYVFTWLKKLKHGKVFY VINPEHPLVKEALNIPTEYRQIIKALLRLIEETVPVQQIWLDSAVSSEQHSQPFEGVP AREVREVMMQIYQALIKDGLTTSEARSLLVKMEPFQHFEELIATLPESTY" gene 7779..10457 /locus_tag="DP116_16220" CDS 7779..10457 /locus_tag="DP116_16220" /inference="COORDINATES: protein motif:HMM:PF10593.7" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="endonuclease" /protein_id="PRJNA477356:DP116_16220" /translation="MNNSKEYQQALDFALAMLKHNDEVTDELIREKVNLALKMIQLQG SNENFDPDTLIWDLQSRFTVRMAQATILDGEERVDWLPKRRDSIEWRFWKRYERYLLE EKKLSPLVVRRIDELTDSIIERIEDPTKNGHWDRRGMVAGQVQSGKTGNYTGLICKAA DAGYRLIIVLAGIHNNLRSQTQIRLDQGVLGYNTRQNMTFDPNNRRTGVGKLRGEQLH PVHSLTNAEERGDFQITIAKHSNIDLRAVPTLLVVKKNGSILKNLINWATKRHGEKDP ATGRLIVHDIPLLLIDDECDNASINTREDDTNPTTINARIRELLNSFSQSAYVGYTAT PFANIFIDVNSNTHKHGDDLFPRDFIINLPAPDNYVGSSRVFGITADPDSGLEEQDGL PIVHTVKDYQDWMPDTHKTDHVPSELPYSLKKAVKSFIISCAARMARGQDKEHNSMLV HVTRWNEVQILVKNQVHNEIKNIQHRLRYGEGGYQKKISEEFRRLWEEDFVPTTKAID DPEKKLLTWQEVEKFLEPAAQKIQVKTINGKAKEVLDYEDNPDGISVIAIGGDKLSRG LTLEGLTVSYFLRASKMYDTLMQMGRWFGYRPGYLDLCRLYTSEELIYWYQHITLANE ELRQEFDYMAMLNKTPSNFGLRVRTHSDGLTISNVGKIRKGKVLRVTYAGAITETVAF DKNPAINNKNIDSFVKFLNLIGKPEKEPRINNIDAYVWTNLDGNDIVALLKEVKTHRD SIRANSQLLADYVSQQIMKNELRNWTVVLKSKKDAKKPLVGKYEVGLYKRTGSKASNS ERYSIQRLVSPDDEVIYLPGGSRQIKDKKTFRENRDPDRGLLLIYPLDPAEAGLEGNP ILGFALSFPGSQNASSVEYLVNNVYYAQEFGEENQE" gene 10454..11464 /locus_tag="DP116_16225" CDS 10454..11464 /locus_tag="DP116_16225" /inference="COORDINATES: protein motif:HMM:PF14390.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16225" /translation="MSIRELWTELEKSIKLGGSEYLVRRVRPDCACDLRIGVQEPTGN RMLLLKIRRSSASSIVDFPSSEGFEVRRILLPSDGENYVTLQLVLTQNRYADIFTSLV DDVIEGVAGKQKEKAALEEFIIRLRRWQSFFKQHSPDGLSKTQQQGLYGELWFLRQVI IPQLGSRQSIQYWTGPRGTQQDFQFPNCAVEVKTTVEKQHQKLSISSERQLDGTGTGT LILVHLSLDVRQGRGESLPDIVNSVRILVQNDPIAKEELETLLLEVGYLDIHTPRYEE IGYTQREVNYFKVEGDFPRIVEADLPNGVGDVRYSISVAECKRFSLPELDVISLIGCN YE" gene 11457..13541 /locus_tag="DP116_16230" CDS 11457..13541 /locus_tag="DP116_16230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017721083.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="abortive phage infection protein" /protein_id="PRJNA477356:DP116_16230" /translation="MNEQTKLIQFAAELIQEVINNSEAGQDNEDGEGDSFREDEFTRL MIEYLTDAGELDDGEVCYHRNRGIKVNGYSINQDFECLNLFISIYTQSIPPVTVTKQE VETAFRRLTNFLQKALKGYHLSIEEASSVFDMALQIHDLRTQLSQIRLYLFTDGRTTI DVKQHETIENITCSFHVWDIERTYRCLSSGKQRETIEIDFESQYGVAIPCLPMPRSNS DYTAYMVIIPGEILYKIYAEYGPRLLERNVRSFLQARGKVNKGIRQTILQEPHRFLAY NNGISATAEAVELVDLPGGGKGIKSARDLQIVNGGQTTVSIYQAAKKDKADVSNIYVQ AKLSVVAPEKANEIVPLISRYANNQNKVNEADFSANDPFHIQIEEFSRTIWAPAVDGT QRQTRWFYERTRGQYLDVKGREGTAAKKKIFTTTHPTSQKFTKTDLAKFENTWNQLPH LVSLGAEKNFREFTIQLAKRGKFQPDEDYFKRLIAKAILFRKTEKIVQVQQFGGYRAN IVTYTLAYLSNKTAQQIDLERIWREQGLSPALQEAIKIVSYQVHQVIINPPGGRNVTE WCKKEECWKQIQTIEIELPNEFWNELISVDTKPNQIDKGIESPDTEDLKVIVQINEVS DETWFQLAHWAKETDNLHSWQRSLAFSLGKLAAKRKSPSYKQANEGIKILHEAEQLGF KYTHDSNGKISV" gene complement(13683..14792) /gene="proB" /locus_tag="DP116_16235" CDS complement(13683..14792) /gene="proB" /locus_tag="DP116_16235" /EC_number="2.7.2.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127014.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamate 5-kinase" /protein_id="PRJNA477356:DP116_16235" /translation="MTKTIVVKIGTSSLTQPETGQLALSTIATLAETLSHLRRQGYKV ILVSSGAVGVGCGRLGLTERPKAIALKQAVAAVGQGRLIRVYDDLFTTLQQPIAQVLL SRSDLVQRSRYLNVYNTFRELLELGVIPVVNENDTVAVEELKFGDNDTLSALVASLVE ADWLFLLTDVDRLYSADPRSVPDAQPIALVSNIKELAELQVQTGTQGSQWGTGGMVTK ISAARIAIAAGVRTVITQGRYPHNIEKILQGETIGTHFEPQPEPTSARKRWIAYGLIP AGKLYLDAGAVLAISGGGKSLLAAGITTVEGEFDTQEAVQLCDKNGHEVARGLVNYSS TELQKIRGRRSSEIAAILGYVGAETVVHRDNLVLT" gene complement(15036..15581) /locus_tag="DP116_16240" CDS complement(15036..15581) /locus_tag="DP116_16240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868579.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YqeG family HAD IIIA-type phosphatase" /protein_id="PRJNA477356:DP116_16240" /translation="MHWNNLIQPSLILEGSVLNLTPDMIQKNGLLGLVLDVDETLVPI RAASASVELRQWVEQMRPFVKLCLVSNNLSETRIGGIARSLNLPYFLGAAKPSRRKIR QALKAMNLPVHQVGMVGDRLFTDVLAGNRLGMFTILVEPIIHPDVALRSHPIRNFEVW LSEILGASITPKTRRVTKIDK" gene complement(15633..16161) /locus_tag="DP116_16245" /pseudo CDS complement(15633..16161) /locus_tag="DP116_16245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198444.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(16315..16575) /locus_tag="DP116_16250" CDS complement(16315..16575) /locus_tag="DP116_16250" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16250" /translation="MTRTGKHTRAPKNGVDRTRWNIDRQSKENVGRANKSEGLKLQLP SEQGNAKLQMFGRLKTVIKVCQFLKTVYEWDFGLCTQLPTGC" gene complement(17256..17651) /locus_tag="DP116_16255" CDS complement(17256..17651) /locus_tag="DP116_16255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019498352.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_16255" /translation="MRIGIRSRTWKVFNKEASLKRESTNKLIDKSFPAVPHSEHKHVN VTGNKSPYDGDIIYWSTRNSELYNGETSKALKRQNHTCGYCGLRFTSEERVHLHHKDG NHSNCKTKNLLAVHESCHDYIHMGKRDKP" BASE COUNT 5183 a 3619 c 3836 g 5060 t ORIGIN 1 taaggtcaca ggcagaacac ctgttaatag gcgggaggtg tacgtgcagt aatgtatgca 61 gccgacccgt gctaatagac cgagggcttg acttcaaact caattgcgat tcgcgtttct 121 gtgcagtctt cagggttttt ctaactctct catccaaaac agatcttcaa ataggaagtt 181 ttgatgaccg aagtttgaaa tcccccaggg ttttcctggt gtcaatggcg cggtggaacc 241 actctgattc tatcccgaac tcaggtgtga aacgctgcag cggcgacgat agttggtggg 301 tagctgcctg cgacaatagc tcgatgccag gatttgattt caactaatac aaaagggtgt 361 tgctagttta agcaacaccc ttttgttcgt ttttaccttt tttctctaaa tcaggactta 421 cgcaaaaatc gctaaaaagc ttaatttgtt gaaccggcaa gacgccaaga acgcgaagaa 481 ttcgtagagt gtgcgtaagt cctataaata accgctacat ttatttagtg agttgaggtc 541 gttcaatcaa aacggtctct aaagtattga gggtagaaat gggaagactt taggttgtaa 601 aaattaatat aatgctctta tgatgagcac ttccaaacct cggaaaattt ttggcacagg 661 tcaccgtcgg cgtgaaaacg actggcggtt atttttgcgt ctactacctt atgtccgccg 721 tcgtgggcga tcgctagcgt ttgctatgtt actgttggtg ccaatagctg ttgctaacgc 781 tgtgcaacca ctgcttattg gacaagttat ctccttaatt cgccaagaac aaagcgccta 841 tgagttttta agaaatcgtc cgttgtcaca agggctaaat atcatcgaaa tattgttgct 901 gataacaatc agtgtacgac tgatatttac aggttttcaa ggttatttag tacaaaagct 961 aggtcaacaa atcactgccg aaattcgtcg agacttgttt gatcatgtca catctttggc 1021 ggtgcgtttt tttgatagaa cacctgtagg aaaattaatc accaggctca ccagtgatgt 1081 ggaagtatta ggagatgtgt tttccactgg agctataggc attgtgtcag atttgttttc 1141 tatgctggtg attcttggtt ttatgttttc tatgcagtgg caactggctt ttttgctatt 1201 gatggttttc ataccaataa caggtattat agtctactta caaaagcagt accgcaaagc 1261 aaattacaag acgagggaag aactttctgg actgaattct cagttacaag aaaatatcgt 1321 tggcattaat gttgtgcaat tgttccggcg ggaaaaattc aatgccgagt tgtttcgtgt 1381 taacaatcaa cgttatgtcc acgaggtaga taagactatc ttttatgatt cagcagtttc 1441 ggcaacactt gaatggattg ccctaatttc aattgcggct gttctgtggg tgggcggtta 1501 tttactactg caaataaact tgacttttgg tgtattatct gcatttgtat tgtatgctca 1561 acgcctattc gatccattac gagagtttgc ggaaaaattc actgtcatcc aagctggttt 1621 caccgcgata gaacgcgtga gtgaagtttt ggatgaacgg atagagatac acgatcgcgg 1681 taatcccaga ttctcaattt acgattctcg gttaggttac atagacgaaa taactgataa 1741 tcccgaatta ccaatcgaaa attcccaacc tgaattcgga gaaatccgct ttgaacacgt 1801 ctggttcgcg tacaaagata atgactatgt catcaaagat ttggacttta ccattcatcc 1861 tggtgagaaa gtggcgttag taggtccgac aggtgcgggc aaaagttcta ttatccgtct 1921 tttgtgtcgc ctttacgaac ccaacgaagg acgcattttt gtagatggta tagatatccg 1981 cgaaatcccc caagcagaac tgcggcgtta catggcggtg attttgcaag aaggcttttt 2041 gtttgctggt gatgtcaaaa gcaacattac cttaggagac agctacagtc ttgaggaaat 2101 tcaacaagca gcagagaaaa ctaacattgc tcagtttatt gaacaactgc ctcaaggcta 2161 tgatactcaa cttagagaac ggggtacaaa cctttctagc ggtcaaaagc aacttttagc 2221 atttgctcgt gctgctattc gcgatccaca aattttggta ctggatgaag caactgctag 2281 tctggatgta ggaacggaag ctctcatcca agaagcttta gacaaactgt tactaggacg 2341 tactgccatt attattgctc accgcttgtc caccattcgt aacgtggatc gaattttggt 2401 actaaagcgc ggagaattaa tagaacaggg aagtcatgaa gagttgctgc aacaaggagg 2461 aatgtacgct actttgcaca atttacagat gttagcaact tgacatctct ggctatgtgt 2521 aaacacgcct tcgttatcca aagaacctga tatttctata gccagcgaag cctcatttta 2581 ttgagtacaa gtcgccagct tgttagatgc aagccagaaa ctatccacag catacttggt 2641 aaacaccgat tcttgtacag cattgcaaga agttttcagt ctcacagttt tcgtttcgtc 2701 tttcaccagc tttgcttgat aacagtataa agaggcatct ggcttttcaa cattctgagc 2761 gatagcgtgt cgctcagaat gacacaatac tacatttccc aatttggtat cacaattcag 2821 aatgatccca cggagaaatc tagaggattg actcaagtta gtatagggaa agagaaaaat 2881 ccccaaactt gttgctatgg cttgtgctta tttccgcttt tcacctccaa atgcaaatgc 2941 tgacactgat ttggaaatta tgcgttctgt gatcaactct ttactaaagg gagcgaaacc 3001 agacgctatt ggtgttagct ttggcggacc agtggacgca acaacaggga aggtaagact 3061 gtcccatcat gtgcttggat gggagaatgt tcctttgcgc gacttgctag aggaagagtt 3121 tggcgttcca gcttctgtag ataacgatgc taatgttgct gctttggggt tgattaaatt 3181 cgccacatta ccaataccca cacccagcgc ccaactagcc gtgtacaaaa tttcctgagc 3241 taaattatcc ccagccgccg cagcttcact caccaccttc cccgtcacca actccaaatt 3301 atctcccacc aactccctta acacgtctcc tcttcttcct tcgtgtcctt ggcgtctttg 3361 cggttcgctg gcagaactta taggacggga tcaggcaact attactcgcg cttgtgagaa 3421 aatataaaga tggaggacgc gattgcccaa agggcagacg ccaagggcgt atcgcttgct 3481 cgaagtcaaa catgcaccgg gtaaagtttc tattgttagc gaggaagcct tgaaacgtct 3541 gaaacagagg ttgcaggagc cgcaaggatt tcacagctat ggtcaaattc aacaatggct 3601 cgttgctgag tttcaactgg acatcgccta taaaacggtc tatgaactcg ttcgctatcg 3661 aatcggtgct aagctcaaag tccctcgccc ccaaagcacc aaacaacatc cacagagtct 3721 gtctcacttt aaaaaaaact tcctctagca ttcaaattct tgcaggagga atttggagag 3781 gggaagcgat tgaggtactt gtgcagagga gacaacccgc ttgggactta agacgatcgc 3841 gggacgttta attactgcgg cgggcggtta aacccctcgg actcagccag tggcaacgtg 3901 agaattttta tttatacgga gtcgtggagc cgttgagtgg atacagctaa ttctgccgag 3961 ttttctcacc ttgatggtga ctgttttcag cagtttttgg agttgctttc tgttcaacta 4021 ggagatgatg ttgcggttat ccagttcgac caagggtcat ttcatcgggt taaagctctc 4081 gattgtccag aaaatattat ccctattttt caaccgcctc actctgcgga acttaatcca 4141 attgagcggt tttgggaatt tctcaaatct aaactggaat gggaaaactg caaaactctc 4201 aaccaactgc gccaaaagtt agctcaagtc ctagacacaa ttacacctga ggtgattgct 4261 tctctcactt cttacgattt cattcttgaa gctttattta gcgcagcttc ataaagaatt 4321 ggtattagcc caaatgttgc tgctaaataa gggtctggtg atgaatctca ctcccattta 4381 gtgatgtgtt gtatgcctag taccgctgcg cggaagtcaa aagtcaaaag tcaaaagtca 4441 aaagtattat gaaatgggct tttgagcgat tgtaaatggt tgccctattt acgccgtgac 4501 gtactagttg ccttgtcacc gtatttaatg gcagttgaag ataactcagg tgtgcgcgag 4561 ccaacagttc gcttactgtt agaatgtctc aactttgggg cacttttcaa gattccagca 4621 ttgtgggtag atgggcttta ccagcggttt tttttgacgt ttatggaaat ggcaaaatgg 4681 taattttttt tgataaaagt tgtgtttttt aagttgaata atctttaaat aactagctat 4741 attttcatga atttaaaagt tccaatttaa aaaaactgtt ttttcacgat ttaatcttga 4801 cgagaaaacg ataaaagcag aaaatagtaa tgagttatgt tatctgttca ttttgtttaa 4861 tttaatatta tttgttgaac aatgccagcc aagaatttag tgagcgctca taaattaaaa 4921 cctactgtac ttgatttatt ttgtggtgca ggtggtatga gtttagggtt tcaaaatgct 4981 ggatgtaaaa ttttagcagg aatagatcat agtccccacg ctattagaac tcatcataag 5041 aatttcccga actgtaaact gaagcttaag cctcaggata ttactaatat aaagccacat 5101 gatttgaatt taaaacccgg tgaagtggat attgtggttg gtggaccccc ctgccaagta 5161 tactcattgg taggtattgg taaaatgcgg tcattaggca gaaaaattga aaatgaccct 5221 agaaattttc tgtaccagaa atttgtggag tttctagatt tttatcagcc attatttttt 5281 gtaatggaaa atgtggatag cctagtaaaa agaacaatat ttccgactat tcttagggaa 5341 ctagagtttg gtttaccacg aaaacgagaa aattatcctg gttatcgaat tcatcataat 5401 attctaatag cttcagatta tggtgttcct caaattagga aacgtctttt cattgtaggt 5461 gtacgtcaag atttagaata tgaatttgaa tttcctcaac cacttaaaag gaatccagtt 5521 tcagtagggg aggctattag cgatttgata ccacttaccc ctccatatct acctttgaag 5581 agcaaaaata gtggattacc tcaagaagat agcaaaaagt tttatctcac tcatccgcaa 5641 tcaagctatc agaaaaaaat gagaagagaa attactaaga tgccagagcc agacggagtt 5701 atgaatcata tatgtcgctc tcataaccca gtagatataa tttgttttgc catgcttgcc 5761 caaggtggaa aatatacaga tttacccgaa aatatgagac gttatcgttg ggatatattt 5821 gatgataagt ataaacgttt accttggaat aaaccagcgt ggactttgac tgctcatatg 5881 cgcaaagatt gtctggctta tattcatcct atacagaatc gtagtatttc agtaagagaa 5941 gcggctaggc tgcaaagttt cccggatcac ttcgtttttg atgcgcccat gaccagaatg 6001 tttgagttgg ttgggaattc tgtgccaccg cttttggcgg aagcgatcgc taaaccgatt 6061 gttaaacaag tacaaaatta ctatgagact aacccgaagg ttgagcaact cagcttactt 6121 tagtgtgttc agggagtgtg ccttaacgca tcactttgag tcatttaaat gaggaattgc 6181 aaaatgataa tattcgccgt acaggtgaaa aatatgtaat tataactact aagatgaatt 6241 gctctaccgg gagttacaac cggatggttc gtgaacgtgc taacgacgaa tatgacttgg 6301 ttgaaccccg tgcgcctgcg atgctggagt cattacgggc ttttggatac aacatccaga 6361 ctgcgatcgc cgatctgatt gacaacagta tctctgctgg ggcaaagaat gtatggctgc 6421 aattttactg ggatggatct gagtcttaca tatctatcct ggatgacggc aaaggcatga 6481 ctgaagcgga actagtgaat gccatgcgtc caggtagccg aaatcctttg gaagaaagag 6541 aaccaaatga tttgggaaga tttggtttag gactgaagac ggcatctttt tctcagtgta 6601 ggcggctcac tgtttgtgca aaggcagtta atcaaaactc tgtaactcgt cgctgggatt 6661 tagattatgt gagtcaaaca ggagagtggc ggttactccg ttctgcagca tcaggttcag 6721 aagaaagatt aaccgcctta gagcaaatgg aaagtggaac agtggtgctg tgggagagta 6781 tggatcgggt agttggtggg acaaaaactg acgatccaaa ggcacacaac cgctttttag 6841 aaatgattga agatgtggaa aagcatctga cgatggtatt ccatcggttt ttggagcgaa 6901 agaataagtt acaaatttgg attaatcagc gactaattga gccttgggac ccattcctca 6961 caaacgaaaa agcaacacag tggttgccag aagagaatct atacttccga gaagataggg 7021 tggttattca accttatgtg cttcctcatc actcaaaagt agattcgcaa acctatgaaa 7081 aagctgcggg tccaaatggc tggaatgctc aacaaggttt ttacatctac cgtaatgaaa 7141 ggatgcttgt tgctggtgac tggcttggtt taggtttaca gagggatgaa cattgtaagc 7201 tggcgcgaat ccaggttgat ttgccaaatt caatggatag cgactggaat atcgatgtta 7261 aaaagtctag agcgcgtcca cctgcatctt tacgggaaga cttcaaacgc atcgcaaaac 7321 taacgcgagc aagagcatct gatatatata gatatcgcgg aaaagttatt gccagaaaat 7381 attcagacaa ttatgtcttt acttggctga agaagcttaa acatggcaag gttttttatg 7441 ttattaaccc agagcatcca ttagttaaag aagcgctaaa tattccgaca gaatatcgtc 7501 aaattatcaa ggcattactt cgactaattg aagagactgt tcctgttcag caaatttggc 7561 ttgacagtgc tgtaagctca gagcaacaca gtcaaccatt tgagggagtc cctgcaagag 7621 aagtcagaga agtgatgatg caaatttatc aagcgttgat aaaagatggt ttaaccacct 7681 cagaagcacg gagcctgtta gtaaaaatgg aaccatttca gcactttgaa gaactgatag 7741 cgacattacc tgaatctact tattagtaaa gacaaagcat gaataattct aaggaatatc 7801 aacaagcatt agattttgca ctagcaatgc tgaagcacaa tgacgaggtg acggacgagt 7861 taatccgtga aaaagttaac cttgctctta agatgataca gcttcaaggt tcaaatgaaa 7921 attttgaccc agatactttg atttgggatt tacagagcag atttacagtc agaatggcgc 7981 aagccaccat cctcgacgga gaagaacgtg tagattggct gcctaaacga cgggatagca 8041 ttgagtggcg cttctggaaa aggtatgaac gttacttgct agaggagaag aagttgtctc 8101 cactggttgt tcgtcgtata gacgaactta cagattcaat tattgagcgg atagaagatc 8161 cgactaagaa tggtcattgg gatcgtcgtg gcatggttgc agggcaagtt cagtcaggaa 8221 agactggtaa ctacactggt ttaatttgca aagcggcaga tgctggttat aggctaatca 8281 ttgtattagc aggtatacac aataatctcc gcagtcagac gcagattcgc ctagatcagg 8341 gtgttcttgg atataacact cgacaaaata tgacatttga cccaaacaat agacggactg 8401 gggttggtaa acttcgggga gagcagttac accctgttca ttcacttaca aatgctgaag 8461 aaagaggaga ttttcagata acaatcgcca aacacagtaa tattgactta cgagcggtgc 8521 caacattgct tgttgtcaag aaaaatggct caattctcaa aaatttaatc aattgggcaa 8581 ccaaaagaca tggtgaaaaa gacccagcta cagggcggct tattgtacat gatataccac 8641 ttttattaat tgatgatgaa tgtgacaatg catcaattaa tacaagagaa gacgatacaa 8701 atccaactac aatcaatgct cgtatccgtg aactactaaa tagtttttct caaagtgctt 8761 acgttggcta cacagctact cccttcgcca atatttttat tgatgttaat agtaataccc 8821 acaagcacgg agatgacctg tttccacgag actttattat caaccttcct gccccagaca 8881 attatgtcgg atcatctcgc gtctttggca ttactgccga tcctgattca ggactagaag 8941 aacaagatgg cttaccaatt gttcatacag tcaaagatta tcaagattgg atgcccgaca 9001 cccataaaac agatcatgtt ccgagtgaat tgccttactc actgaaaaaa gctgtgaaat 9061 cttttatcat ctcctgtgct gctaggatgg cacggggaca agataaagaa cataattcaa 9121 tgctggttca tgtaacgcga tggaatgaag ttcaaatctt agttaaaaac caagttcata 9181 atgaaataaa aaatatacag catcgattac gatatggaga aggtggttat caaaaaaaaa 9241 tttcagaaga atttagaagg ctttgggaag aagactttgt accgacaaca aaagctatag 9301 atgatcctga aaaaaaactg cttacatggc aagaagttga gaaattccta gaaccagcag 9361 cacaaaaaat tcaagtaaaa accattaacg gtaaagcaaa agaagtacta gattacgaag 9421 ataacccaga tggaatcagt gtgattgcaa ttggtggcga taaactttca cggggtctga 9481 ctcttgaagg tttaactgtc agctacttcc tgagagcatc aaaaatgtat gatactttga 9541 tgcaaatggg aaggtggttt ggatatagac caggatacct tgatctctgc cgattgtaca 9601 catcagaaga attaatttat tggtatcaac acattacact agcgaacgag gaacttagac 9661 aggagtttga ctacatggcg atgctgaaca aaacgccatc taattttggt ctacgggtga 9721 gaacgcattc tgatggacta acaatttcca atgtaggaaa aattagaaaa ggaaaagttt 9781 tacgtgttac ttatgctggt gctattactg aaactgtagc tttcgataaa aacccagcca 9841 ttaataataa aaatattgat tcctttgtca aatttttgaa tttaattggt aaaccagaaa 9901 aggaacctag aatcaataac attgatgcct atgtatggac taatttggat ggaaatgaca 9961 ttgtagcttt acttaaagaa gttaagacac atcgggattc aattagggca aacagtcaac 10021 ttttggctga ttacgtcagt caacagatta tgaagaatga actcaggaat tggactgttg 10081 ttctgaaatc caaaaaagat gcaaaaaaac cgttagttgg taagtatgaa gttggtctat 10141 acaaacggac aggttcaaaa gccagcaact cagaaagata ctcaattcaa agacttgtta 10201 gccctgatga tgaggttatt taccttcctg gaggaagtcg gcaaattaag gacaaaaaga 10261 cttttagaga aaatagagat cctgatagag gtctgcttct aatttacccc cttgatccag 10321 cagaggcagg attggaagga aaccctattt tgggatttgc gcttagtttt ccaggtagtc 10381 aaaatgctag cagtgttgag tatctggtaa acaacgttta ctatgcacag gaatttggag 10441 aagaaaatca ggaatgagta ttcgtgaact ctggactgaa ctcgaaaaaa gtatcaaact 10501 aggtggttct gaatacctag ttcgcagagt cagaccagat tgtgcctgtg acttacgtat 10561 aggagttcaa gaaccaactg gaaacagaat gttacttcta aagatcagac gtagttctgc 10621 ctcatctatt gtagactttc caagttctga aggttttgaa gttcgacgaa ttttattacc 10681 tagtgatggg gaaaattatg ttactctaca actcgttctt acccaaaata ggtatgctga 10741 tatttttact agcttagtag atgatgttat agaaggagtc gctggaaaac aaaaagaaaa 10801 agcagcatta gaagaattta taatccggct aaggcgatgg caatcttttt tcaaacagca 10861 ttcaccggat ggactaagca aaactcaaca gcaaggatta tatggtgaac tctggtttct 10921 gcgtcaagtt ataattcccc aactgggttc tcgtcaaagt atccaatact ggactggtcc 10981 cagaggaacg caacaagatt tccaatttcc aaactgtgca gttgaggtta aaacaactgt 11041 agaaaagcag catcaaaaac tgagtatttc tagtgaacgg cagttagacg gtactggtac 11101 gggtacacta attcttgttc atttatccct tgatgtcaga caaggacgcg gtgagtcgtt 11161 acctgatatt gttaacagcg tcagaatctt agtccaaaat gaccccatag ccaaggaaga 11221 attagaaaca ctcctgttgg aagttggcta tctggatata catacccccc gttatgaaga 11281 aatcggctac acccaacgag aggttaacta tttcaaagtc gagggagatt ttcccagaat 11341 tgtcgaagcg gatttgccca atggcgttgg ggatgttcgc tacagcatta gcgttgctga 11401 atgtaaacgc ttctccctac cagaattaga tgtcatttct ctaatcgggt gcaactatga 11461 atgagcaaac caaattaatt cagtttgccg cagagcttat acaagaggta attaataatt 11521 ctgaagctgg acaagataat gaagatggtg aaggtgattc ttttcgcgaa gatgaattta 11581 cccgcctcat gattgaatat ctgactgatg ctggagaact agatgatgga gaagtttgct 11641 accatcgcaa tcgcggtatt aaagttaacg gctacagcat taatcaagat tttgaatgtc 11701 tgaatttatt tatttctatt tatactcaaa gtattccgcc tgtgactgtc accaagcaag 11761 aggttgaaac agcatttcgc aggcttacaa attttttaca gaaagcactc aagggatacc 11821 atctgtctat tgaggaagcc tccagtgttt tcgatatggc gcttcaaatt catgacttga 11881 gaacacaact tagccaaatt aggctgtact tatttacaga tggtcgtaca actattgatg 11941 tcaaacaaca tgaaacgatt gaaaatataa cttgctcgtt tcatgtttgg gatattgaaa 12001 gaacttaccg ttgtctgagt tcaggtaagc agcgggaaac cattgaaatt gactttgaat 12061 cccaatatgg ggtagcaatt ccttgtttgc caatgccaag gtctaattca gactacactg 12121 cttacatggt aatcattccg ggtgaaattc tttacaaaat ctacgctgaa tatggtcctc 12181 gtctgctaga acgcaatgtt cgttcttttc tgcaagcgag aggaaaagtt aacaagggaa 12241 ttcgacaaac aatcctgcaa gaaccccatc gttttttagc atataacaat ggcatttctg 12301 caactgcgga agctgtggag ttagttgatt taccgggagg cggtaaaggt atcaaatccg 12361 cacgggattt acaaattgtt aatggaggtc aaaccactgt atctatttat caagcagcca 12421 agaaagacaa agcagatgta tctaacattt atgtacaagc aaaactgtca gtagtagcgc 12481 ctgaaaaagc caatgaaatc gtccccctga tatcccgtta cgctaacaac caaaataaag 12541 tcaatgaagc agatttttca gcgaacgatc cgttccacat tcaaatcgag gaattttctc 12601 gcaccatctg ggcacctgca gtagatggaa cgcagcgaca gactcgttgg ttttatgaac 12661 ggacacgggg acagtacctt gatgtaaagg gacgtgaggg aactgcagca aaaaagaaaa 12721 tctttactac tacgcatccc acctcacaga aatttactaa gacagattta gccaagtttg 12781 agaatacttg gaatcagtta ccccatttgg ttagtcttgg tgctgagaaa aactttcgtg 12841 agtttacaat tcaattggct aaacgcggca aattccaacc agatgaagac tatttcaagc 12901 gactcattgc caaagctatt cttttcagaa aaacagaaaa gattgtgcaa gtccagcaat 12961 ttggtggcta tcgtgccaat attgtcacat atacacttgc ttacctaagt aacaaaacag 13021 cacaacaaat tgatttagaa cggatttgga gagaacaggg tctttcacca gcattgcagg 13081 aagcaattaa aatagtatcc tatcaagttc atcaagttat tatcaatcct cctggcggtc 13141 gtaatgtaac tgaatggtgt aagaaagaag aatgctggaa gcaaattcag actattgaga 13201 ttgagttacc aaatgaattt tggaatgaat taatttcagt tgatacaaag ccaaatcaaa 13261 ttgataaagg aatcgaaagt cctgatacag aagacttaaa agttattgta caaatcaacg 13321 aagtatcaga tgaaacttgg tttcagcttg ctcattgggc aaaagaaaca gataacttgc 13381 actcttggca acgaagccta gcgtttagtc ttggaaaact tgctgctaaa cgaaagagtc 13441 catcttataa acaagccaat gagggtataa agattttgca cgaagcagag caactagggt 13501 ttaaatatac acatgacagc aatggcaaga tttccgttta gtaacaaaaa tttctgccat 13561 aattgcagct gtcacctcac gcatgaaatg gaggcgttca gcgctccgcg ctgtagcgtt 13621 tgcgcagcgc ccccttaggg gctagctatc gcctgtatct cttttgatgt cgtatacgat 13681 ttctacgtca aaaccaaatt atcccgatga accacggttt ccgcaccaac ataacctaaa 13741 atagcagcaa tttcactaga acggcgtccg cgaatctttt gtaattcggt gctactgtag 13801 ttcaccaatc ctctggcaac ttcatgacca tttttatcac acaattgcac tgcttcttga 13861 gtgtcaaatt ccccttctac tgtggtaata ccagcagcta acaacgattt gccaccacca 13921 gaaattgcta gcactgctcc cgcatccaaa tagagtttcc cagcaggaat caaaccataa 13981 gctatccaac gtttacgggc ggaagttggt tctggttgtg gttcaaagtg agtaccaatg 14041 gtttcgccct gtaaaatttt ttcgatattg tgcggatatc gtccttgagt aatgacggtg 14101 cgaacaccag cagcgatcgc aattcgtgct gcagaaattt tggtcaccat accgccagta 14161 ccccattggg aaccttgtgt ccctgtttgc acctgtaatt ctgctaactc tttgatgttg 14221 ctcactaaag caatgggttg ggcatcaggt acagaacggg gatcagctga gtaaagccta 14281 tcaacatcgg tgagcaaaaa gagccaatct gcttccacta agctagcaac gagggcagaa 14341 agggtgtcat tatccccaaa tttcagttct tccactgcga ctgtatcatt ttcattcacc 14401 actggaatga ctcctagttc gagtaattct cggaacgtgt tgtagacgtt aaggtaacga 14461 ctgcgctgaa ctaagtcact acggctgagc aaaacctgag caattggctg ttgcagtgtc 14521 gtaaataaat cgtcgtacac tcgtattaat ctgccttgcc ctactgctgc aactgcctgt 14581 tttagggcga tcgccttagg acgttctgtt aaccctagcc gcccacaacc cactcccaca 14641 gcgccagagg ataccaaaat gaccttataa ccctgtcgtc gcagatgtga aagggtttct 14701 gctaaagtcg cgattgtgga aagtgccagt tgtccggttt ctggttgagt caggctggag 14761 gtgccaattt tgacgacaat tgttttagtc attagtcatt agtcattagt tattagtcat 14821 tcgttttatg acataggact agggacaatc aactaaattg tgaagcgcca gaccctctat 14881 aatcttgagg ctgacgcttc acttttttat attgagttgt tatcacttag ggtctttatt 14941 ttggctgacc cctttgttat ttatacttat tatgattgca gatttttgtt tgaaacaaag 15001 ttttttttat tttttcttta gatttacttt cctgactatt tatcaatttt tgtaaccctt 15061 cttgtcttag gagtaataga agcaccgaga atttcagata gccaaacttc aaagttgcgt 15121 atcggatggg agcgaagagc aacatctgga tgaataattg gttctaccag aatagtaaac 15181 attcctaagc gattacctgc caagacatca gtgaataatc tatctcccac catgcctacc 15241 tggtgtacag gtaaattcat cgctttgagt gcttgccgaa ttttacgtcg cgagggcttg 15301 gcagcaccta agaaataggg caggttgagc gatcgcgcaa ttccaccaat ccgagtttca 15361 ctcaggttat tactaaccaa acacaacttg acaaaggggc gcatttgttc tacccactgt 15421 cgcagttcca cagaagctga ggcggctcta attggtacta atgtttcatc tacatccaac 15481 accagcccca agagcccatt tttttggatc atgtctggtg tgaggttcaa cactgaacct 15541 tctaaaatca agctaggctg tatgagattg ttccagtgca tagtcattag tcattactca 15601 ttaatcattg gcattaattg acactcctcg ggctaaagcc gcgaggattc ttggttcgtt 15661 gactcgccac taagcagcag gtttgcacca actgcccagg agggcaaatc tcccacgcca 15721 ggtgcttcaa gtcggggaac ccgctgttag cacctggctc ccaagcgtaa gttcccgtgt 15781 gccccacggt actgagtcct cttttcagga tgttgattgc tgcattcacg tctctgtctt 15841 ctacgaaccc acaatgtgga cacacgtgag ttctagtgga tagggatttt tttactttct 15901 gaccacagtt agagcaattc tggcttgtat tatggggagg cacagcaact gtgaccttcc 15961 tatacttgtg accaaaatac tctaaccacg accggaaagt tgaccaacca gcgtcagata 16021 ttgatttagc cagatgacga ttacgtacca aacctttcac atttaagtct agcctgcggc 16081 aagccgcttc gcgtctacat aagctaccaa atcgttagat tggatgagga ggtgtgccaa 16141 tctcttgcaa tactcttttc ttgcctaggt tgagtcggtg gggactatta tgtcccgcac 16201 ctcccctccg cgatccggac gtgcccattt ctgtgcagtt agagtcgatg tccgtattgc 16261 tacagcgcac ctctccttca gaaccggacg tgcgagtttc cccgcatccg gctcctagca 16321 acctgtgggt aactgggtac acaacccaaa atcccattca taaaccgttt tcaggaattg 16381 acagacctta atcaccgtct tgagccttcc aaacatttgg agttttgcat tcccctgttc 16441 gctgggcagt tgcagcttca gtccttccga cttgttcgca cgaccaacgt tttccttgga 16501 ttgtcggtct atgttccacc tagtccgatc tacgccgttt ttcggggcgc gtgtgtgctt 16561 gcccgtccgc gtcaatacgt cctgctacca atcccagtgg tattatctta cttagaatta 16621 agtcgattac agtccgcttg tctacgcatg gtctaatcca tttttagact tctcggttgc 16681 cgggggcaat ttcgttcccc catcctaccc gtctccttcc ggtaaggaat taatcacttc 16741 cctccaccca ggttgtcaac cctgagaagt tggagacgtt tttcctcgtt ccgttcccta 16801 gattgttttg ataccgaatg ggtcaccatt tcgcctacca ccaacccaaa ggttatgcag 16861 ctaatgccgg aactgctgcg ggtttatagg tgatgaagtc gagatttggc caggttctcg 16921 ctttcgggta aggcgctttt tcaagtgctt cgcatatagc tcaccctagt atctccccgt 16981 tagcgcaaag ctctgcgggg gaagcttccc ccgcaagact ttgcgccagt gttgttacct 17041 gcttgtttta caactgatta cgccctgttt ccagcttcgc tctggtaatt aacctcgacg 17101 tggactagac gcgggagtca cttctaaccc tgaaggggag gacttgcacc tccatccaag 17161 gaacagttat gagattttca aggttctacc ttgacgttct caccgcagct tatgtattag 17221 gctgcgaacg agtcgcactc cggctcccga tattcttagg gcttatccct tttacccatg 17281 tggatataat cgtgacaact ttcatgaact gctaacagat tttttgtttt gcagttgctg 17341 tggttgccgt ctttgtggtg caggtgtact cgttcttcac ttgtgaacct taagccacaa 17401 tatccgcatg tatggttctg tcttttgagg gctttggaag tttctccgtt atataattca 17461 ctattgcgag tgctccagta aattatgtct ccgtcatagg gagatttatt ccctgttacg 17521 ttgacgtgtt tatgttcgga gtgaggtact gctggaaacg atttgtcaat taatttattt 17581 gttgactctc gttttaaact ggcttccttg ttgaagactt tccacgttct gcttcttata 17641 ccaatccgta ttcacacaag gcaatactaa cgctgccatc tctcccaatg atggcgcg // LOCUS NODE_1870_length_17670_cov_5.12795917670 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17670) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17670) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17670 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..407) /locus_tag="DP116_16260" CDS complement(<1..407) /locus_tag="DP116_16260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131002.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_16260" /translation="MTQALPKTKLVTFEEFVAWYPENSERRYELYDGVIVEMALPKGK HERVVGFLASQTTSEFLRLKLPYFIPKTVIIKPPLHESGYSPDVLILNNDNLVNEPVW EDESFITQTASISLAIEVVSQCVARVPRVEATD" gene complement(540..1969) /locus_tag="DP116_16265" /pseudo CDS complement(540..1969) /locus_tag="DP116_16265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872505.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" gene 2279..3670 /locus_tag="DP116_16270" CDS 2279..3670 /locus_tag="DP116_16270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998727.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD-dependent malic enzyme" /protein_id="PRJNA477356:DP116_16270" /translation="MADLTPNSSFSLTLRLEIPNRVGMLANVTKAIATSGGNFGQIDL IEQTRDISIREITVDAASSDHAEIIVQAVKAVPDIKVLNVYDRTFNLHRGGKISITSR IPLKSVSDLAMAYTPGVGRICTAIAQNPEEVYNLTIKRNTVAIVTDGTAVLGLGNLGP AAALPVMEGKAMLFKEFAGIDAFPICLDTQDTEKIIEAVKNIAPVFGGVNLEDIAAPR CFEIEARLRQELDIPVFHDDQHGTAIVTLAALYNALKVVQKSMGDIRIVINGAGAAGV AVARLLRKAGAEKIWMCDSKGILSTSRTDLTEEKREFAVKAQGTLAGALQGADVFIGL SAPGVLTPEMVRSMAKDPIVFAMANPIPEIQPELIKDDVAVMATGRSDYPNQINNVLA FPGVFRGALDCRAATITTTMYLEAASAIASLIKPSDLDKQHIVPSVFDERVVTAVAGA VQRAARQEGIARG" gene complement(4017..4763) /locus_tag="DP116_16275" CDS complement(4017..4763) /locus_tag="DP116_16275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315609.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="creatininase family protein" /protein_id="PRJNA477356:DP116_16275" /translation="MLLHLSTWQEVEAYLQQSGGIILPIGSTEQHGPTGLIGTDAICA EAIARGVGEATQAIVGPTINVGMALHHTAFPGSISLRPSTMILVLRDYITSLAKAGFT KFYFINGHGGNIATLKAAFSETYAHLEDIQIPNAQRVQCQVANWFMCGSVYKLAKELY GDQEGSHATPSEVAVTQFVYPEAIKQAPLSEEVGTGHRIYGAADFREKYPDGRMGSNP ALATPEHGKQFYELAVKELSNGYLEFLNAD" gene complement(4868..5143) /locus_tag="DP116_16280" CDS complement(4868..5143) /locus_tag="DP116_16280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950542.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cupin" /protein_id="PRJNA477356:DP116_16280" /translation="MEIKVEHQPSIEQLNELGVFKWGIWTKEVSKFPWTYDTQETCYF LEGDVIVTPDGGKSVQMGKGDLVTFPAGMSCTWEIRSDVKKHYCFDE" gene complement(5558..5923) /locus_tag="DP116_16285" CDS complement(5558..5923) /locus_tag="DP116_16285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878736.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase" /protein_id="PRJNA477356:DP116_16285" /translation="MLKTGNHGQRIFFPAFVHSGLEAHQTVNKNQIAQPKIDLLQPLR RKLDSLEIQNPKLAKFIAKVIPAQCPFERDILLFGRKVAHIPPMCKLNPLYDELVGLR FRALCYLADVCGEDIQAYC" gene complement(6605..6970) /locus_tag="DP116_16290" CDS complement(6605..6970) /locus_tag="DP116_16290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652227.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16290" /translation="MSEVQRLIIKDNGEEYELYIQSKTVAEVPEINDEQEVYRGIGDI LPTVNIQEFHAKLRGYTKLALGAFRNLPEAEEVTIKFGIKLGSKVGIPILVEGSSEGN FEIEVKCKFPENKKNSSSS" gene 7046..9877 /locus_tag="DP116_16295" CDS 7046..9877 /locus_tag="DP116_16295" /inference="COORDINATES: protein motif:HMM:PF00656.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16295" /translation="MARYALVIGIANYNNFRNLPKAVTDAEKIALVLREHGRFEVQPL PGKLIESENRWCVAPDKKLPGKELGSTLSKFLLEKAKNHEALIYFAGHGFEAATLTGK QKGYLATSDCTSDGQNAIAFDDFNDLIRESQLASLVVLLDCCYAGSFLEKSFFRSSFP VFHTKQDYCLITASREFERAREDVEGGIFTQAVLRGLSHDKADETTGEVNASDLFSFI SRELKQSGQEPIYMGGGRSIPLVWYPPTNPVVTGVVREECPYRGLEAFDKQHAQFFFG RKKVVEDILQKLTQAQFVPIIGASGSGKSSVVRAGLIPQLEKNGWRILDPMKPGIEPL AKLGAAFEPFFQRPREIQQLYDFIHNQQDGLHRVIERLPGSERFLLVVDQFEEVFTLC SKEEERRKFIDLLTQVVELSEATSLQSLRLAIVTTMRADFLEPCLSYPFLTQLIQNQA VYMPPLVGAELEQAIASPAALQGYRFEDGLLGEIIQDVGKEQGCLPLLQFALTELWEK RDSQKHQLTVEQYRAMGGVIGALDLHAENIYHSLTQQEQEWIKRIFLKLVRTGEGEKD TRQQQPKAKLLAIAGENEIGFVLDELIQERLLVSGQENLHREAWVDLAHEALIEGWQR FDEWRDKNRELRRLIDRVEDALQEWRKQPKNENLIMGGLLAQVREKWEEMEPDLDAVA KEFYQKSVAFEEQQRQLIQAASNAKSQFLANISHELRTPLNAIIGFSQLLRDDALDIS LSEEFIGDLESINIAGRHLLILINDILDLSKIAAGKMTVYPEAFQLATLINNVVLTVK PLVEKNANVLEVDFDEKLAIMYTDQTKLRQVLYNLLSNAAKFTTNGRVALTVKKETPD LNGNYAPEIITFTVEDTGIGMSYHQQQQLFQPFIQGDASTTKKYGGTGLGLAISRHFC HLMGGKILVRSETGVGSIFIVRLPLNVTE" gene complement(10034..10324) /locus_tag="DP116_16300" CDS complement(10034..10324) /locus_tag="DP116_16300" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16300" /translation="MAISYSRKNYSLPARRILNLTNRQDAKDAKKIKKKIGNLARLMG VSSEMEGKTLKIIIVETDELEAKKERFFSLLDKHSFALPANYQFDREELHGK" gene complement(10440..14012) /gene="mfd" /locus_tag="DP116_16305" CDS complement(10440..14012) /gene="mfd" /locus_tag="DP116_16305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcription-repair coupling factor" /protein_id="PRJNA477356:DP116_16305" /translation="MAFSSIVRALGRSGLTTELLSKLNRQQELLLSGIPRLPKGLVAS ALAQTQSKNLFVVCATLEEAGRWTTQLEAMGWQTVHFYPTSEASPYEPFDPETEMTWG QMQVLADLIQLARGRDEELGEAKSSSFPLPKMAVVATVAALQPHLPPAEAFKPFCFTL KRGMEFDLDTFSEEITKLGYERVPLVETEGQWSRRGDIVDVFPVSSELPVRLDWFGDE IEQIREFDPATQRSAALDKIDQLILTPTSFAPILMTALKDNAEFQALSAQLSDDSEVD IQDSTLVEGSRRFLGLAFDKPASLLDYLPENTLITIDEPEQCHAHSDRWVENAEEQWE LLGSRLAEEAGEAGGTGKISALLPRIHRPFDECLADTAKFPKLNLSELVEENTGINLA SRRVPVMPHQFAKIAETLRQERDRNFSIWLMSAQPSRSVSLLQEHDCPAQFIPNPRDY HAIDKQQVNHTPVALKYSGLAELEGFILPTFRLVVVTDREFYGQHSLATPSYIRKRRK AASKQVDPNKLRPGDFVVHKNHGVGKFLKLESLTLNNEIREYLVVQYADGLLRVAADQ VGVLSRLRTTNEKPPELNKMTGKAWENTKNRVRKAIKKLAVDLLKLYASRSQQKGITY PHDTPWQQELEDSFPYQATIDQLKATQDVKRDMESDRPMDRLVCGDVGFGKTEVAIRA IFKAVTSGGKQVALLAPTTILTQQHYHTLKERFAPYPINVGLLNRFRTAEERRDILKR LATGELDIVVGTHQLLGKSVTFRDLGLLVVDEEQRFGVNQKERIKSLRTQVDVLTLSA TPIPRTLYMSLSGIREMSLIATPPPSRRPIQTHLSPMNPDSIRTAIRQELDRGGQVFY VVPRVEGIEEIGTQLREMIPSARVAIAHGQMDESQLESTMLTFSNGEADILVCTTIIE SGLDIPRVNTILIEDAHRFGLAQLYQLRGRVGRAGIQAHAWLFYPKQRTLSDAARQRL RAIQEFTQLGSGYQLAMRDMEIRGVGNLLGAEQSGQMEAIGFDLYMEMLEEAIREIRG QEIPKVDDTQIDLNLTAFMPADYITDLDQKMSAYRAVAAAKSKEELSQIAAEWNDRYG AIPKPATQLLRVMELKQLAKKLGFSRIKPENKQHIILETQMEEPGWKNLAQNLSETLR SRFVYSPGKVTVRGLGVMKADQQLQTLIDALGKMQGAVVEEAVV" gene 14313..15638 /locus_tag="DP116_16310" CDS 14313..15638 /locus_tag="DP116_16310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16310" /translation="MSPTIEILIIVLLILANAVFVMSELAIFSVRKVRLQQLADRGDA RARVALELASSPNQFLGTVQIGITLLTIISGAYGEETIAKRLTPILSFIPLQGQYKQQ LAKGLAILVITYLTLILGELVPKRLALNHPEPIASVIAIPMRMLSKFTSPVVYLLSMS TETVLRLLGIKPSKEPLVTEEEIRVLIEQGTEEGTFEEAEQDMVERVFRLGDRPVSSF MTPRPDIVWLDLEDSTEENRQKIIDGGYSRYPVCQGGLDNVLGIIPVTDLLARSFCGE DLDLTVGLRQPVYVPESTRGLKVLELFKQTVTHMALVVDEYGVIQGLVTLNDVMIEIV GDVPSIDDQEDPQIVQREDGSWLLDGMLGVDDFFELFNVEELSSEHRGSYQTLGGFVM AHLGRIPSAADHFEWQGMRLEVMDMDGNRVDKVLVVPEQVQSGNNEKLD" gene complement(15718..16515) /locus_tag="DP116_16315" CDS complement(15718..16515) /locus_tag="DP116_16315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315615.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HAD family hydrolase" /protein_id="PRJNA477356:DP116_16315" /translation="MDRLLPLSEVSSTQCFSNVRLVATDMDGTLTKKGKFTTALLQSL QDLATAGIKVVIITGRSAGWVSGLAYYLPVVGAVAENGGLFFLGGSEKPVALTPIPDL VAHRQNLASAFQQLQTQFPQIQESADNRFRVTDWTFDVQSLSIDELKTLSHLCQQMGW GFTYSNVQCHIKPLGQDKANGLLQVLQEYFPEYTPEQVVSVGDSPNDESLFDHRYFPL SVGVANVLEYANQLQHHPVYMTSAAEGEGFCELAQMIIDAIPQAVSL" gene complement(16889..17605) /locus_tag="DP116_16320" CDS complement(16889..17605) /locus_tag="DP116_16320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315617.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_16320" /translation="MLNKEPVISIKHINHYYGKGILKRQILFDINLDIYPGEIVIMTG PSGSGKTTLLSLIGGLRSVQEGSLQFLGVELYRASQKKLVNIRRKIGYIFQAHNLLEF LTARQNVQMAVELNKYISQEQAIAKSEAILKAVGLGERINYYPENLSGGQKQRIAIAR ALVNSPPLVLADEPTAALDKQSGRDVVELMQHLAKEQGTSILLVTHDNRILDIADRIV EMEDGILVRDSQGKVKQLGG" BASE COUNT 4913 a 3740 c 3795 g 5222 t ORIGIN 1 tcgtcagttg cttcaacgcg gggaacccgc gcaacgcact gactcacaac ttcaatcgct 61 aaagatatcg atgcagtctg agtgatgaat gattcatctt cccatacagg ttcatttaca 121 agattgtcgt tatttagtat cagtacatct ggggagtatc cagattcatg aagaggtggt 181 ttgatgatta cagtttttgg tatgaagtac ggaagcttta gccgcaggaa ttcagatgtt 241 gtttgggaag ctaaaaaccc aacaactctt tcatgtttcc cttttggcaa agccatttct 301 acaatcactc catcatataa ttcataacga cgttcggaat tctcaggata ccaagctacg 361 aattcttcaa aggttactag cttggttttt ggtaaagctt gagtcatggt tattacctca 421 actaaggata gtacataagt aggtcaacat tgagaaacat aaaatagaat ctttgcgatt 481 gcttcgctcc actccgtttc actcgcaatg acatttcacg tttaattagg ttgagctact 541 tatctggacg gcgtggacgt acttttacga atctccaacg aattcatcat tgcttgtgca 601 gtttctaaaa aagcatcata ttgactttct tctgctgtgt aagtcacgat gtatgctttg 661 ttgtctttta ctgtccaaac tgccatcctc ttaacattaa attgctcttg ctttcctgtg 721 taaataactt catgtgctgg tagctttgcc agtttagttg gatgcgattg attgattcta 781 ggatttgtta aatatcgggt aatttgatta acttttaaat ttgtataatc aggtaaggaa 841 atagattttt tcaagtcttc catttctaca gatagttctg gtttaaaaga atttgaatca 901 ttattttgag gagaaaaaaa tttcgcgaca tcacccgtaa atctatcttc tatcttttga 961 attatccaat ctttaggata tttcatccta atcagaaaat agttagaact atcataagtc 1021 aaaaatgagg ttttagttaa cgatgaatca ttatgtgatg aagcgaatat tttctctcgt 1081 atttctggaa ccaaaacaac tatacaaggg acaagagcaa ccaacaccaa agctaaggca 1141 atctttctat tttgccactt ctttaatttc ctttttgtga ctaagcttaa aagcgcttgc 1201 gaaacttcat tcactgactg atagcgctta caagaatcat aacgcaccat tttgtctaaa 1261 atatctgcta gctgaggttt aacttgtacc aaatctcgcc aaacaacttc atgagtctct 1321 ggattttttg ggagttgttt aggcaaaatt cctgtcaggg cttggatagc agtgataccc 1381 aaagcataaa tgtcgctgtt aaattctgga ttgccaatag cttgttcgct tggcacataa 1441 ccgggagtgc caatagggac tgtctttcct tgagagttag gtctaaaggt attgattttt 1501 tctttgactg ccccaaagtc aatcagaact aacttgccat ctgaggtacg tctaataata 1561 tttgaaggtt taatatctcg gtgaataata ttctgctgat gaacaaactc taatattttt 1621 aaaatatctt gcaaaaaatt aataacttta tcttcactcc actgtctatc tggtataatc 1681 tcgttgctga gaatatcacc gtttataaat tcttgaacta aataaaattc tccattttcc 1741 tcaaaatgcg ctgaaagtcg cggaatttgt acaggacttg tgcttccgtt tcaaataaac 1801 gtgtggctgt ttgcaaaaca aacggatcgg aaagctgggg cttgagttgc ttaactatgc 1861 aatatggttt gccaggtaag tctttatcta gagccagata agtctcacta aaccctgttc 1921 ccaaaaattt tataatttcg tagcgccctc ggagcgttct ttccagcatt tcttttacct 1981 aaaggtgcac aagttcttca cctgatggtt gataccgatt gttgcacaga atctgattta 2041 gcagaagata tctacgccat gatacggaac ctcaccctgc cctgtcgggc atccctctcc 2101 ttagtaagga gagggaaagt taaactgtgt aggttgccca acctaccttg ttctatgttc 2161 tatatacgcc attcttagca ctggaagaag tcaaacaggt gggtggagat ggagagaata 2221 agattgatta caaatatttg gtctaaaatc taaatctaaa acccacaatc aacaaagcat 2281 ggcagatctg actcctaatt ctagttttag cttgacactc cgcttggaaa ttcctaaccg 2341 cgttggaatg ttagctaacg taaccaaggc tatagcaacc agtggcggta attttggtca 2401 aattgattta atcgaacaaa caagggatat ttccattcgc gaaatcaccg ttgatgcggc 2461 tagcagtgac cacgctgaga taattgtgca agcggtgaaa gctgtgccag atattaaggt 2521 gctcaatgtc tatgatcgca cctttaattt gcatcgtggc gggaaaatca gcattaccag 2581 cagaattccc ctaaaaagtg tgtctgattt agcgatggct tatacgccgg gagttggaag 2641 aatctgtact gcgatcgccc aaaatcccga agaagtttac aacctcacca tcaaacgcaa 2701 cactgtagcc attgttactg atggcaccgc cgttttagga ttgggaaatc ttggtcctgc 2761 agccgcctta ccagttatgg aaggtaaagc catgctgttt aaggaattcg ctggtattga 2821 tgcctttcct atctgccttg atacccaaga tactgagaag attatcgaag ctgtcaaaaa 2881 tattgctccg gtatttgggg gtgtcaattt agaggatatc gctgctcccc gctgttttga 2941 aattgaagca agactacggc aagaattaga tatccccgtt tttcacgatg accaacatgg 3001 tacggcaatt gtcactttag cagcgttgta taacgctctc aaggtagtac aaaagtcaat 3061 gggagacatc cgcattgtga ttaacggtgc tggggctgct ggcgtagctg tcgcccggtt 3121 actcagaaaa gcaggagcag aaaaaatttg gatgtgcgac tcgaaaggta ttctttctac 3181 cagtcgtact gacttgacag aagaaaagcg cgaatttgca gttaaagcac aaggaaccct 3241 agcaggtgct ttacaaggtg cagatgtgtt tattggtttg agcgcaccag gagttttaac 3301 accagaaatg gtgcgttcta tggcgaaaga tccaattgtg tttgcaatgg caaatcctat 3361 tcccgaaatt cagccagagt tgatcaaaga cgatgttgca gttatggcaa caggtcgcag 3421 tgattacccg aatcaaatta acaatgttct ggcatttcca ggtgttttcc gtggtgcttt 3481 ggattgtcgg gctgcaacaa ttactaccac gatgtacttg gaagcggcga gtgcgatcgc 3541 atccctaatt aaaccctcag accttgacaa acaacacatc gttccttctg tatttgatga 3601 gcgagtcgtg actgctgttg ctggggctgt gcaacgtgct gcgcgtcaag agggtattgc 3661 tcgcggttaa ttaactcatc aagactagca ctctgtgcga tcgccctttg cgcggacact 3721 ctgtaccatt acatccgttt ccattgaaat aatttctcca ggacttacgc aaaattatga 3781 aaaaacaaac cgcatagacg cggaggacac aaaggaataa gagttttaga gagttcttgc 3841 gtaagtccta ttctctcttc tcttcctctg cgttctctgc gcctctgcgg ttttttaatt 3901 attcagattc aaccagaaac gatataaaat cttcctcccc ccaagcaagc cttaaaaaga 3961 aaggaagata aaagccccct ttttaagggg gttgggggat caaatctttt catcattcaa 4021 tccgcgttca aaaactccaa atacccattg ctaagttctt tcaccgccaa ctcataaaac 4081 tgcttcccat gttcaggtgt cgccaaagct ggatttgaac ccattctccc gtctggatat 4141 ttctcgcgaa agtccgctgc accataaatc ctgtgtccag taccaacttc ttctgaaaga 4201 ggtgcttgct taatcgcttc tggatacaca aattgagtca ctgcaacttc gcttggtgtt 4261 gcatgcgaac cttcttgatc cccatataat tcttttgcta acttgtaaac ggaaccgcac 4321 ataaaccagt ttgcaacttg gcattgcact cgttgcgcgt tgggaatttg tatatcttcc 4381 aaatgggcgt atgtttcaga aaaagcagct ttgagggtag ctatgttacc gccgtgtccg 4441 ttgataaagt aaaattttgt aaaaccagct ttagctaaac tcgtgatgta gtctcgcagc 4501 actagaatca tcgtgctggg acgcagactt atactaccag gaaaagctgt gtggtgtaat 4561 gccatgccca cattaattgt gggaccgact attgcttgag tggcttcacc tacaccacgg 4621 gcgatcgcct ctgcacaaat cgcatctgta ccaattaatc ctgtaggtcc gtgttgttct 4681 gtcgaaccaa taggcaaaat aataccccca gactgctgga gataagcctc gacttcctgc 4741 caggtgctta aatgcaataa catttttatt ccgcgtcctt atcaaggttg caccttttta 4801 aggatagatt atcaaccctg aattaggaat tatggaaaca ctcttcattc aataattcct 4861 aaattcctta ctcatcaaaa cagtagtgtt tcttgacatc gcttctaatc tcccaagtgc 4921 aggacattcc agccggaaaa gtgaccaaat cacctttacc catctgcact gatttgccac 4981 catcaggtgt aactatgaca tcaccttcta aaaaatagca agtttcttga gtgtcataag 5041 tccaaggaaa ctttgagact tcttttgtcc aaatccccca tttgaacaca cccagttcgt 5101 taagctgctc aatgctcggt tgatgctcta ccttaatttc cattgtttat ctcctacaag 5161 gtgatcttaa aagaaggaat tataggtatt ttaaaatcaa agctgttcag taggtcggta 5221 caaatgaagt taacttgttc ggatcgtcag agtactgcgc ccttgctgat tctttgacag 5281 taggtgtcgt ctttcacaat cctttggagg tgagagataa ctcaactatt ggagatgatt 5341 tagtactgtt gtgctctatg gttggcaata atctcacgat tgcaactggg agacattagt 5401 cgtcgttggt gtgagtttat ctgatgttgt acaactcgcg tgagagtgca gtcattataa 5461 cagaggagta agctgacgct ttaaagtcaa tcatacatta agatgaaact tgcactgcga 5521 gccaagctat gaccttatga agtgatgttt tgggagttta acagtaagct tgaatatctt 5581 caccgcagac atcagccaaa taacacaaag cacgaaaacg taatcccaca agttcgtcgt 5641 acagcggatt taacttgcac attgggggaa tgtgggcaac tttgcgacca aacagaagaa 5701 tatctcgctc aaaaggacat tgagccggga tcacttttgc aataaatttc gctaattttg 5761 gattttgaat ttctaatgaa tcaagcttac gacgtaatgg ttgaagtaaa tcaattttcg 5821 gttgagctat ttgattttta ttgactgttt gatgagcttc caatccagag tgaacaaaag 5881 cagggaagaa aatacgctga ccgtgattac cagttttgag gatagtcata agtaagttaa 5941 ccttgaactt tgaatggttt ggttaggggt ttttggttgt ccataaaatc catttagcta 6001 atttttgaaa tagaatcatg gcaataccaa atatatgtat acacggaata tttacttttg 6061 taaacaccgt atattatttt tatttaaact gaccgtagtg tgaaaaactg tcctacctaa 6121 tacaatacta taaaaaaagt ctaaagaagt tttttatact tattgaaatc tgatatttgt 6181 aatatgcatt tcatatcttg aatgtaaacg acctcggcta aaaccacaca gtatcttgag 6241 gtgtagtttt tttgcctaga agagtcgttt agttcatacc taagctatat attgttttta 6301 tgtcacctat aaacacttct tattgtaact attggttcac cagtttggtg tgagataaat 6361 cacaaaagac caagactcac atctgacatc agctacttta ttgcaatttt tgtcgggtgg 6421 gcactgcaaa ccctgcgttc aaaccctttt ttggttgtgt gcagtgccca ctcgaagggt 6481 tttgagcaat tttgtcgagg gttagcctga cgataacaag gtatccacaa ccgcacacga 6541 ccgatataaa ctcgattttt tgagttttgt caagtgaacc gagatacgta ataaagtttg 6601 ctacttaact gctgctagaa tttttcttat tctcaggaaa cttacatttc acttcaattt 6661 caaaattgcc ttcactagag ccttccacca agataggaat accaactttg ctacctagct 6721 tgatgccaaa tttgattgtg acttcttcag cttcaggtaa gttcctgaat gccccgagag 6781 ccaacttagt ataaccacga agcttcgcgt gaaattcttg aatgttaact gtaggcagaa 6841 tatcgccaat accacggtaa acttcttgct cgtcgtttat ttctggcacc tcagcgacgg 6901 tttttgactg aatataaagc tcgtactctt caccgttatc tttaattatc aggcgttgta 6961 cttctgacat tggtagcctc tagttattgc agattactat gttagttaga atactctaaa 7021 atgctcatat gcttggtcgg acagtatggc tcgatatgcc ttggtgattg gcattgctaa 7081 ctataataat ttcagaaatc tgccaaaggc tgtcaccgat gcagaaaaaa tagcgctggt 7141 actccgtgaa cacggtcgct ttgaagttca gccgttacct ggtaagttaa tcgagagtga 7201 aaatcgctgg tgtgtcgcgc cagacaaaaa gctacctggt aaagaactgg gttcgacact 7261 gagcaagttt ctgttggaaa aggcgaaaaa tcacgaagca ttgatttatt ttgctggaca 7321 tggatttgaa gcagcaacct taacaggtaa gcaaaagggc tatctggcaa cttctgattg 7381 taccagtgat ggacagaatg cgatcgcctt tgacgacttc aacgatttaa tccgcgaatc 7441 tcagcttgca agtttagtgg tactgctaga ctgctgctat gctggttctt ttctagaaaa 7501 aagttttttc agatcaagtt tccccgtttt ccacactaaa caggactatt gtctaataac 7561 cgcctctcgt gagtttgaaa gagcgcggga agatgtcgaa ggaggcattt ttacccaagc 7621 agttttgaga ggattatctc acgataaagc tgatgaaact actggtgaag taaacgccag 7681 cgacttgttt agctttatat cgcgggaact taagcagagt ggacaagaac caatctacat 7741 gggcggagga agatcaattc ctctggtttg gtacccgcca acgaatccgg tagttactgg 7801 ggttgtgcgt gaagaatgtc cttatcgagg tttggaagct tttgacaagc agcacgcaca 7861 atttttcttt ggtcgtaaga aggttgttga ggatatttta caaaaactta ctcaggcgca 7921 gtttgtccca ataatcggcg cgtcgggaag tggtaagtct tccgtagtgc gtgcgggttt 7981 gattcctcag ttggagaaga atggctggcg aatattagac cccatgaaac cagggattga 8041 gccattggca aaattgggag cagcctttga accatttttt caacgtccaa gggagattca 8101 gcagctttat gatttcatcc acaatcaaca ggatggcttg catcgcgtaa ttgaacgtct 8161 cccaggttca gagcgtttct tgttagttgt ggatcaattt gaagaagtgt ttactctttg 8221 ttccaaagaa gaagagagac gcaaatttat tgatttgtta acccaggttg tagaactttc 8281 agaagcaacg tctctacaaa gtttgcgctt ggcaattgtt accaccatgc gagcggattt 8341 tctcgaaccc tgtttgagct atccattcct gacacagttg attcaaaatc aggcagtgta 8401 tatgccgccg ttggtggggg cagaattgga acaggcgatc gcatccccag cagccctcca 8461 aggttatcgt tttgaggatg ggttgctagg agagattatc caagatgtgg gtaaagagca 8521 gggatgtttg ccactgttac agtttgccct aacagaactt tgggagaaac gagacagcca 8581 aaaacatcag ctaacagttg aacagtatcg ggcgatgggt ggtgtgattg gtgcactgga 8641 tcttcacgcc gaaaatattt atcacagttt gacacagcag gaacaggaat ggattaagcg 8701 gatatttttg aaactagtgc gaacgggtga aggggaaaag gatactaggc agcagcaacc 8761 taaagccaaa ttattagcca ttgctggtga aaacgaaatt ggttttgttt tagatgagtt 8821 gattcaagaa cgcttgttag taagtggaca agaaaatctg cacagagaag cctgggttga 8881 tttagcacat gaagctttga tcgaaggttg gcagcggttt gatgaatggc gtgataaaaa 8941 tcgagaacta cggcgattaa ttgacagagt ggaagatgcc ctacaggagt ggcgcaaaca 9001 accaaagaat gaaaatttaa tcatgggtgg attgctggct caagtccggg aaaaatggga 9061 agaaatggaa cctgatttag atgctgtagc gaaagagttt taccaaaaaa gtgttgcttt 9121 tgaagaacaa cagcggcaac tcattcaagc tgctagcaac gcaaaaagtc agtttttggc 9181 gaatataagc catgaattac gtaccccatt gaatgccatt attggtttta gtcaacttct 9241 acgagatgat gctttagata tcagcttatc agaagaattt attggggatc ttgaatctat 9301 caatattgca ggtaggcatt tactaatatt aatcaacgac attcttgact tgtcgaagat 9361 agcagcagga aaaatgactg tttacccgga ggcatttcag cttgcaacgc ttattaataa 9421 tgtcgttttg acagtgaagc ctttggtgga gaaaaatgcc aatgttttag aagtagactt 9481 tgatgaaaaa cttgccatca tgtacaccga tcaaacaaag ttacgacagg tactgtacaa 9541 cctattaagc aatgctgcca agtttactac taacggcaga gtggcactaa cagtcaagaa 9601 ggagacgcca gacttaaatg gaaactacgc tcctgaaatt attacgttta ctgttgaaga 9661 tacaggtatt ggtatgtcct atcaccaaca gcaacagcta tttcaacctt ttatccaagg 9721 agatgcttca acaacgaaaa agtatggtgg taccggactg gggttagcaa ttagccgtca 9781 cttttgtcat ctgatgggtg gtaaaattct tgtcagaagc gaaactggag tagggtctat 9841 tttcatcgtt cgtctaccat tgaatgtgac tgaataagct aataagacat attgttcgcc 9901 tccgctacca tcccatcttc acaaatccac aaaccgaaga agcaaaagcc gcttgtgaaa 9961 cctatcgtaa acatatttgg ggcgcataag ggctaaaaat agatagatcc aaaggttggt 10021 gtctatcatc accttatttc ccatgcagtt cctctctatc gaattgatag ttggcgggca 10081 gagcaaatga atgtttgtct aaaagactga aaaatcgttc tttttttgct tcaagctcat 10141 ctgtctctac aataattatt ttgagagttt taccttccat ttcagaactt actcccatca 10201 agcgtgcaag attacctatc ttctttttga ttttcttggc gtccttggcg tcttggcggt 10261 tcgttaaatt aagtattctt ctggcgggaa gggagtagtt tttccttgag tatgagattg 10321 ccattttcac aagtagcttg cagaacttta tacataaaat cactgtatgc catttaagtt 10381 tgaattaaat attgggaatt ttggatattc aagacgcgat gcatcgcgtc tctacaattt 10441 taaacaaccg cctcctccac aacagcaccc tgcatcttac ccaaagcatc aatcaacgtt 10501 tgcaattgtt gatctgcttt catcactcct aaaccccgca ctgtcacttt accaggagag 10561 taaacaaagc gtgatctgag ggtttccgac aaattttgag ccaagttttt ccaaccaggt 10621 tcttccattt gcgtttctaa aatgatgtgc tgtttgtttt ctggtttaat gcggctaaat 10681 cctagtttct tcgccaattg tttgagttcc attacccgca acagttgagt cgctggttta 10741 ggaattgcac catatcgatc attccactca gcagcaatct gcgataattc ttctttagat 10801 tttgcagctg caaccgcacg gtaagcactc atcttttggt ccaaatcggt gatgtaatcg 10861 gctggcataa acgctgtcag gttgagatca atttgggtat catcaacttt aggaatttct 10921 tgccctctga tttcacggat agcttcttct agcatttcca tatataaatc aaaaccgatc 10981 gcctccattt gtcctgactg ttctgcacca agcaagttac ccacacccct aatttccata 11041 tcccgcattg ctaattgata gccggaaccg agttgcgtaa attcctggat tgctcgtaac 11101 cgctgacgtg cggcgtcgga taacgtccgc tgtttgggat aaaataacca tgcatgagct 11161 tgaattcctg cacgaccaac acgacctcgt agttgataca gttgagctaa tccaaagcgg 11221 tgagcatctt caattaaaat agtgttgact cgcggaatgt ccaaaccaga ttcaataatc 11281 gtagtacaaa caaggatgtc tgcttctcca ttgctgaaag ttagcattgt tgattctaat 11341 tggctttcat ccatttgacc gtgggcgatc gcaactctcg cactcggtat catctcccgc 11401 aactgtgttc ctatttcctc aattccctca actcgcggaa cgacgtaaaa cacctgtccc 11461 cctcggtcga gttcttgacg aattgcagtg cgtatgctat ctggattcat gggtgacaaa 11521 tgagtttgaa tcggtcgtct ggatggaggt ggtgtcgcaa tcaaactcat ttcccgaatc 11581 cccgataagg acatatataa agtacgggga atcggagttg cagaaagagt cagcacatca 11641 acctgagttc tcaagctttt gattctttct ttttggttca ccccaaaccg ctgttcttcg 11701 tcaactacca aaagtcctaa atcgcggaag gttacgcttt tacctaaaag ttggtgtgtg 11761 ccgacaacaa tatctaactc tcctgtcgcc agtcgtttca aaatatcgcg gcgttcttca 11821 gcagtccgga agcgattaag taacccgacg ttaattgggt agggcgcaaa gcgttctttt 11881 aaggtgtggt aatgttgctg agtcaggata gtagtgggtg caagtagtgc tacttgtttt 11941 ccaccagagg tgacagcttt gaaaatagcg cgaattgcga cttctgtttt tccaaaaccg 12001 acatctccac aaactaggcg atccattggg cgatcgcttt ccatatcccg tttcacatcc 12061 tgagtcgctt taagttgatc aattgtggct tggtaaggaa aagagtcttc taattcctgc 12121 tgccaaggtg tatcgtgtgg gtaagtaatg cctttttgtt gcgatcgcga tgcatacagc 12181 ttaagtaagt ccaccgccaa tttcttaatc gctttgcgga ctctattctt tgtattttcc 12241 caagccttac ccgtcatttt gttgagttct ggtggtttct cgtttgtcgt acgcaaccgc 12301 gacaaaacac caacttggtc agctgcgaca cgcaataagc cgtctgcata ctgcaccacc 12361 aaatactcac gaatttcgtt gtttagtgtg agactttcta gcttgaggaa tttacctaca 12421 ccgtggttct tgtgaaccac aaaatctccc ggacgcagct tattgggatc aacttgttta 12481 gaagcagctt ttctccgctt acggatgtag ctaggagtcg ctagggagtg ctgaccataa 12541 aattcacgat ctgtaacaac aacaagccgg aacgtaggta aaataaaacc ttcgagttca 12601 gcaagaccgg aatatttgag ggcgactggt gtgtgattaa cttgttgctt gtcaattgcg 12661 tggtagtcgc gggggttagg aataaactgg gcgggacagt cgtgttcttg caacagggat 12721 acagaacgcg aaggttgggc agacattagc caaatcgaga agttgcgatc gcgttcttgt 12781 cgcagtgttt ctgcaatttt agcgaattgg tgcggcataa ctggcaccct ccgactggca 12841 agattaatgc cggtgttttc ctcaaccagt tctgatagat ttagtttggg aaattttgct 12901 gtatcagcga gacattcgtc aaaagggcga tgaattcttg gtagcaaagc agaaattttc 12961 cctgttcccc ctgcttcccc agcttcctct gctaacctgc ttcccaaaag ttcccattgt 13021 tcttctgcat tttctaccca gcgatcgcta tgggcgtgac actgttctgg ttcatcaata 13081 gtaattaacg tattttcagg taaataatct aacagagaag caggtttatc aaaagctaat 13141 cctaaaaagc gacggctacc ctccacaagt gtagagtcct gaatatcaac ttctgagtca 13201 tcgcttaatt gagcactcag cgcttggaac tcagcattat ctttcagtgc tgtcatcaga 13261 atgggagcaa agcttgtggg agttaggatc aactggtcta ttttgtcaag ggcggcggaa 13321 cgttgggttg ctgggtcaaa ttctcgtatc tgctcaattt catcgccaaa ccaatctagt 13381 cgtacgggta actctgagga cactgggaaa acgtcaacaa tatcgccccg ccgactccac 13441 tgcccttctg tttccaccaa aggaaccctt tcgtacccca gtttcgtgat ttcttcactg 13501 aaggtatcca agtcaaattc cataccacgt ttcagggtga agcaaaaagg tttaaaagct 13561 tctgcgggtg gtagatgagg ttgtagggcg gcgacagtag cgacaaccgc cattttgggt 13621 aaggggaatg aggaagattt tgcttcccca agttcctcat ctcttcctct ggcaagttgt 13681 atcaaatccg caaggacttg catctgtccc caagtcattt cggtttccgg gtcaaaaggt 13741 tcgtatgggg acgcctcgga ggttgggtag aaatgtaccg tttgccaccc cattgcttct 13801 agttgtgttg tccagcgtcc ggcttcttcc agagtcgcac agaccacgaa caaattcttg 13861 ctctgggttt gagccaacgc cgaagcgacc aaacctttgg gcaagcgagg aataccactt 13921 aagagcaact cttgttgccg attgagctta gagaggagtt cagtggtgag cccagaccgg 13981 cccaaagcac gcacaatgga agaaaatgcc atagatcaca aaattatgaa gaagtccttc 14041 tcgcgggcaa gcataatagt aacgcttacc acaactattt taaaagtcta gacagagaaa 14101 gaagtatagc tatagagtta cctggtacaa tgaccactta ctaatccact gactcatggt 14161 taatatacga atcagtgtaa tcgttgctaa ctcatgactt atagttgatg acttttttgt 14221 atgcagagtg tttcgtaatc acacgttgca tactagatac taagaggtta aggttaagca 14281 cagtttgccg atggtatcgg cagtttttac agatgtctcc aacaatcgaa attctaatta 14341 ttgttctttt aattcttgcc aatgctgtgt ttgtcatgtc agaattggcg attttctcag 14401 tacgaaaggt gcgtctacaa caacttgctg accgaggcga tgcgagagca cgcgttgctt 14461 tagaactcgc atcctcgccg aatcagtttc tagggactgt tcagattggg atcacactcc 14521 tgaccatcat ctctggtgcc tatggtgaag aaactatcgc caagagacta actcctattc 14581 taagttttat ccctttgcag ggacaatata aacaacagtt ggctaaagga ttagcaattt 14641 tagttattac atatttgaca ctgattcttg gtgaactggt acctaagcgg ctcgctttaa 14701 accacccgga acccattgct tccgttatcg caataccgat gcgtatgttg tctaaattta 14761 catctccagt cgtttatttg ttgagtatgt ctacagaaac ggtgctgcgg ttgttgggta 14821 tcaaaccatc caaagagcca ctcgtgacag aagaagagat tagagtcttg attgaacaag 14881 gtactgagga gggaactttt gaggaagcag aacaagatat ggtcgagcga gtttttcgct 14941 tgggcgatcg ccccgtcagt tccttcatga caccccgacc ggatattgtt tggctggatt 15001 tggaggactc taccgaagaa aaccgccaga agattattga tggtggttat tcccggtatc 15061 cggtttgtca ggggggactt gacaacgtgc tgggtatcat cccagtcacg gacttgttag 15121 cccgaagttt ctgcggtgaa gacttggatt tgacagtagg attgcgacag cccgtatatg 15181 tgccagaaag cacccgaggc ttgaaagttt tggagttgtt caagcaaacc gtcacccata 15241 tggcgctagt cgtagatgaa tatggtgtga ttcagggatt agtcactctt aatgacgtca 15301 tgatagaaat agtcggtgat gttccttcta ttgatgacca ggaagaccct caaattgtgc 15361 aacgagagga cggttcctgg ttgttggatg gtatgttggg tgtagatgac ttttttgaac 15421 tatttaatgt tgaggagttg tcgtccgaac accgaggaag ttatcaaaca ttgggtggtt 15481 ttgtgatggc gcatctaggt cgcataccct cagcagcaga tcattttgaa tggcaaggta 15541 tgcgtttgga agtcatggat atggatggta accgtgttga taaagttctc gttgtgcctg 15601 agcaagtgca atctggtaat aatgagaaat tagattagta tcagacgtag gggagccagt 15661 gctgcaggag ggtttccctc cgtaggcatc tggcgttggg catgaacaat gcccacccta 15721 caaactaaca gcttgcggta ttgcatctat aatcatttgc gccaattcac aaaaaccttc 15781 tccttcagca gccgaagtca tatatacagg gtgatgctga agttgatttg catactccag 15841 cacatttgcc acacccacag acagtggaaa ataacggtga tcaaataaac tttcatcgtt 15901 gggactatcg ccaacactca caacttgttc tggagtgtac tcaggaaagt attcttgcaa 15961 cacctgtaat aacccatttg ccttgtcttg tcccagaggt tttatgtgac actgtacgtt 16021 actgtaggtg aaaccccaac ccatttgctg acatagatga ctcagggttt tgagttcatc 16081 tatgcttaaa gattgcacat caaatgtcca atcagtcacg cgaaaacgat tatccgcaga 16141 ttcttgaatc tggggaaatt gagtttgtaa ttgttgaaaa gctgatgcca agttttgacg 16201 atgtgctact aagtcaggaa tgggtgttaa agcaactggt ttctcacttc cacccaaaaa 16261 aaacaaaccg ccgttttcag ctacagcacc aacaacgggc agataatacg ctaaaccact 16321 cacccaacca gcactccgtc cggtaataat tacgaccttt ataccagcag ttgctaagtc 16381 ctgtaaactt tgtaatagtg cggtggtaaa ttttcctttc ttggtcaagg taccatccat 16441 atctgtagca accagacgaa cattgctaaa gcattgagtc gatgaaacct cagacaaggg 16501 caggagtctg tccataaaag tttcgtaaaa agctgtaaaa acactaattt acagcaatac 16561 ggtactcaga attgatccca ctaattgttt ttttacaccc ttaccccctt atatcccaat 16621 actgcacggg gttttggatt tttgtaggtt gggttgaacg aagtgaaacc caacaaatgc 16681 ctgtaggtgt tgggttttgt tcctcaaacg ccagacttgc tgtgagagcg gaaagccgtc 16741 attcgcgctg gctcacgcca catgcctcaa cgcggggaac ccgcgcacgg cagtggctcc 16801 ccaacctaca caatttcatt ttttgggcaa aacccgagta gtattgcctt atatccttac 16861 tcccttacac ccttacaccc ttaacccctt acccccctag ttgtttgact ttgccttgtg 16921 aatcacgaac taggatacca tcttccattt ccacaatgcg atcagcgatg tccaaaatcc 16981 ggttgtcgtg ggtgactaac aatatagatg ttccttgttc tttagcaaga tgctgcatca 17041 attccacaac atcgcgccct gattgtttgt ctaaagctgc tgtgggttcg tctgctagta 17101 ctaatggggg actgttgacc aatgcgcggg cgatcgcaat cctttgtttt tgtcctccag 17161 aaagattttc tgggtagtaa ttaatccgtt ctcccaaacc cacagcctta agtatggctt 17221 ctgatttcgc gatggcttgt tcttgagaaa tatatttatt cagttctacc gccatctgca 17281 cattttgcct agcagttaaa aactccaaca agttatgagc ctgaaaaatg taaccaatct 17341 ttcgccgaat attaaccaat tttttctggc tagccctata tagttctacg cctaaaaatt 17401 gtaagcttcc ttcttgtaca gatcgcaaac caccaatcaa actcagtaat gttgtcttac 17461 ctgaaccaga tggcccagtc ataataacaa tttctcctgg ataaatgtct agattaatat 17521 caaagagaat ttgtcttttg agtatgcctt taccgtaata atggttgata tgtttaatgg 17581 aaataacagg ttctttattt agcataaata tcaattatag gaatccggtt tggtttatgt 17641 tagcttgcgt ggcggagcca tacgaactcg // LOCUS NODE_1871_length_17667_cov_4.71979317667 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17667) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17667) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17667 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(160..1131) /locus_tag="DP116_16325" CDS complement(160..1131) /locus_tag="DP116_16325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318950.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium-dependent bicarbonate transport family permease" /protein_id="PRJNA477356:DP116_16325" /translation="MNSSLILSNILNPPVLFFFVGMLAIFLKSDLEIPQPLPKLFSLY LLFAIGFKGGYELDESGINPQVALTLIAAIIMACVVPIYSFFILKTKLDAYNAAAIAA TYGSISAVTFITAQSFLKVLDITSDGYMVAALALMESPAIIVGIVLVRAFGQKKEGGE FSWSEVLREAFLNGSVFLLVGSLIVGILTGQKGWEKLQPFTQSIFYGVLAFFLLDMGM VAAKRILDIRKTGSFLIIFSVFMPVGNAILGIIIAKLIGIPQGNALLFAVLCASASYI AVPAAMRMTVPEANPSLYISMALALTFPFNIIVGIPLYMNIIKAMKI" gene complement(1929..3359) /locus_tag="DP116_16330" CDS complement(1929..3359) /locus_tag="DP116_16330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455001.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LytR family transcriptional regulator" /protein_id="PRJNA477356:DP116_16330" /translation="MVKQVASQKHQSTSSKLQQSPNGVTLTKQKNISRNFVSSGSVPS QLYHRLGLAMPRWLFWVLTIVVGMTLSGLLVSSLALWTPLWSDIDRTDEEVGLSGKDQ KTPLPGELWSNISQYRLTRPMNILVMGIEPVLGSVDGSPESFAGHSDTILLIRLNPSN KTIRVLSIPKDTMTAIPEKGLTKVSEANAKGGKVLAARVVSRTLSNAPIDRYIRISTS GLRRLVDQLGGVEVFVPKPMVNKDPGTGFSINLVNGWQTLNGEQAEQFVRFREPAMGD LERVQRQQALLVGLRDRLNTPTVLPKLPQIIRVMGRHFDTNLKLEEMMAIVNFALNIQ RDNYQMTILPGIFSRLSQDPNSYWLDLTGRVDLLQDYAGVTIGGMSSSVKPPTSLKIA IQNASRKPQLTQKVINNLKQQGFTKLYAIPDWSDNRSETKIIVQKGNRQAGEQLQKIL GFGQIEVSATGDLESDITIRIGKDWK" gene 3655..4719 /locus_tag="DP116_16335" CDS 3655..4719 /locus_tag="DP116_16335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873989.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mannose-1-phosphate guanylyltransferase" /protein_id="PRJNA477356:DP116_16335" /translation="MTRALFPVILAGGKGERFWPLSRQNRPKQFLNLDGTDRSLLQAT ADRLLTLAGGWDNLWVITSGQIAQGVREQLPLLPSDNLLVELQGRDTAAAVAWTSLEI QRRYGDDAIIGFFPADHWIAEQKAFVCTIDAARELAATQAAIVTLGIKPTFPSTGYGY IEQGEKIGSFNELPAYHVNRFTEKPDRQTAEDFLSTGRFSWNSGMFVFRAGVVLKELH RHAPEIIEPIEKYGPDIYQDLPKKSIDYALMEKTDIAYVLPAEFGWDDLGDWNAIERL MKKEGIPNVEFATHVGLDTQGSIVYSTNEEDVIVTIGLEDVVIVRDRNVTLIVKKDRT QEIKQVLKTLQADPRFTNLL" gene 5333..6979 /locus_tag="DP116_16340" CDS 5333..6979 /locus_tag="DP116_16340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873988.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AarF/ABC1/UbiB kinase family protein" /protein_id="PRJNA477356:DP116_16340" /translation="MFLTQTVPRQREIIEVLLRNGWDYMRRVLTGGKADEPQLPTPAV LKNILVDLGPVYIKLGQLMSTRPDLLSAAYIEELSTLQDEVPPVPWADVEVLIRKQLK RPLEETFTTINAIPVAAGSIAQTHKATLVDGREVALKVQRPGIDITVPQDIALIQGIA DLVARTEFGQTYEIKAIAEEFTKALEAELDFIREASFTDQLRRNLSESRWFDPTQLVV AEIFWDLTTPKLLVMEWLNGVPLLSANLESEENGKDPAQERKEITTLLFRAFFQQLYI DGFFHADPHPGNLFYLKDGRVALLDCGMVGRLDPRTQSILIEMLLAIVDLDARRCAQL TLQLAESTQPVIMVKLENDYDRMLRKYYNLSVSQINFSQIIYELLQVARNNKIRLPSN MGLYAKTLANLEGVARGFNPELNFLDEIQPLLTDVFRQQLFGDSPVRSLLRTALDVKS LSLQSPRLLEFLLERITSESLQWNISLRGLDSLRRTTDDAANRLSFSILVGSLIMGAA IISSNARTSQLLFLSNVLFATASLLGLWLIISILRSGRLR" gene complement(7158..8012) /gene="fghA" /locus_tag="DP116_16345" CDS complement(7158..8012) /gene="fghA" /locus_tag="DP116_16345" /EC_number="3.1.2.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208775.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-formylglutathione hydrolase" /protein_id="PRJNA477356:DP116_16345" /translation="MTNPNLISEYKSFGGKLGFYNHPSSTCNGEMRFAVYQPPQATQK PVPVLYFLSGLACTEENFMIKAGAQQYAAKYGLMLVAPDTSPRNTGIPGEDDDWDFGT GAGFYVDATVEPWASHYRMYSYVVQELPALIAEHFPVQPEKQGIFGHSMGGHGALVCA FRNPQQYKSVSAFAPIAAPMHCPWGEKAFSRYLGEDKESWRAYDASELVRQTRYHSTI LIDQGTADKFLSQQLLPEVFEQACAAVNQPLNLRYQEGYDHSYYFIASFIEDHIRHHA LSLGIKSF" gene 8217..8996 /locus_tag="DP116_16350" CDS 8217..8996 /locus_tag="DP116_16350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015173982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16350" /translation="MRALQKTSWYASFLKFLLKKLSFSPSTRKSWFQILVTLLITIFI VGGIAWLENEQKNDKKCEHKLDFPTILCFISDSKLLNQVQNISVISAAILFFCDTFDR KKQLERQAWQLIDGAQGSETSGARRQAIEELYKEGADITGLDADGADLRGINLSGANL ERASFKNAILEEANFEGANLTEANFEGANLKGANFKKAMLLHADFTRADLTAYDQKKT DLRDADLGRTIFNRAKVSEAIFGVIDSEPHLGNKTNLSGAI" gene complement(9306..10322) /locus_tag="DP116_16355" CDS complement(9306..10322) /locus_tag="DP116_16355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307402.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1016 domain-containing protein" /protein_id="PRJNA477356:DP116_16355" /translation="MSELNAGHYGELLIAVKQRIRAAQYEALKAVNKELIALYWDIGR LIVSRQQEETWGKSVVEQLAKDLQAEFPGISGFSARNIWNMRNFYLTYSQNEKLQPMV AEIGWTHNLVIMEKCKDDLEREFYIRMTRKFGWTKNVLINQIENQSYEKTLLNQTNFD QTVPKNIRQQALLAVKDEYTFDFLELADEHSERQLEQAILAKVEPFIQEMGGMFTFIG SQYRLEISDKEYFIDLLLFHRRLKCLVAIELKIGEFLPEYVGKMQFYLAALDDKVRLE DENPSVGIILCKLKDKTIVEYALRESNKPIGVATYRIVSTLPQELQDKLPAPEQVAKL LEGY" gene complement(10600..11712) /locus_tag="DP116_16360" CDS complement(10600..11712) /locus_tag="DP116_16360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861126.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase" /protein_id="PRJNA477356:DP116_16360" /translation="MKVKAAVAYESGKPLSIETVQLEGPSAGEVMVEIKASGVCHTDA YTLSGTDPEGLFPAILGHEGAGIVVEVGEGVTSVKPGDHVIPLYTPECRQCEYCLSFK TNLCQAIRATQGRGVMPNGTSRFSIDKQMIHHYMGTSTFSNYTVLPEIAVAKIREDAP FDKVCYIGCGVTTGVGAVINTAKVEPGANVVVFGLGGIGLNVIQGARMVGANMIVGVD INPSKKALAEKFGMTHFVNPKEVEGDLVAYLVDLTKGGADYSFECIGNVNVMRQALEC CHKGWGVSVIIGVAGAGEEIRTRPFQLVTGRIWKGSAFGGARGRTDVPKIVDWYMQGK INIDDLITHVMPIEQINDAFELMHKGESIRSVVTFD" gene 12230..13513 /locus_tag="DP116_16365" CDS 12230..13513 /locus_tag="DP116_16365" /EC_number="2.6.1.66" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="valine--pyruvate transaminase" /protein_id="PRJNA477356:DP116_16365" /translation="MNPALTQFGVHMSNLTGVRAIMKDIIETLRANAGQQLINLSAGN PLILPEVEQLWRDCTAQLLASPEYGEVVCRYGSSQGYAPLVEAIVGDFNRRYGLDLTE RNILVTPGSQSLYFYAANAFGGYTTSGELKQIVLPLSPDYTGYGGVCLVPEALIAYKP ALDIDAAAHKFKYRPDFSKVSITEKTGCVIFSRPCNPTGNVLTDDEVKKIAALAAPYD LPVFVDSAYAPPFPAMNFTQMTPIFGKNIIHCMSLSKAGLPGERIGIAIGDEGVIQVL ESFQTNACLHASRYGQAIAARAINSGALAEISVQVIRPFYQNKFTVLESTLNEAMPKD LPWFLHRGEGAIFAWLWLQDLPITDWELYQELKRVGVIVVPGSTFFPGLEEEWEHKQQ CVRISLTGSDEEIATGMQRLAKVVQQVYQRTAVSA" gene 13519..14187 /locus_tag="DP116_16370" CDS 13519..14187 /locus_tag="DP116_16370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208772.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome maturation factor RimM" /protein_id="PRJNA477356:DP116_16370" /translation="MTNRQDAKDAERKMRNKGSRGGGKEIISSSPTPVPEGWLEIGTI VAPQGLDGQMRVYPDTDFPERFEVPGTRWLLRPHGTEPQAVELLSGRYVQGKNLYIIE LEGVEDRDQVEELRGCKLMVPESDRPQLGEDEYHVPDLIGLQVFMQESGELLGSVVDI LPAGHDLLEVELQSSKDEEQKTKDKRKKTVLIPFVKAIVPVVDLEARRIEITPPAGLL EINT" gene 14462..14905 /locus_tag="DP116_16375" CDS 14462..14905 /locus_tag="DP116_16375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197717.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16375" /translation="MTDPVTTLTAFAIAEFAFKKFFESSVGKLGEKFTQTALTKMDEL HQKIWDKLRRNPKATEALQEVEKGSKDHLNKVAFYLEDEMKDDTEFAAEIRAIAHEIN INRVQDNSSQTQNIYGGKGYQTKMGDNNTNQFGDTHNYYGTPPQS" gene 15017..17089 /locus_tag="DP116_16380" /pseudo CDS 15017..17089 /locus_tag="DP116_16380" /inference="COORDINATES: protein motif:HMM:PF00931.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002756904.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 17034..17043 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <17044..17514 /locus_tag="DP116_16385" CDS <17044..17514 /locus_tag="DP116_16385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207518.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16385" /translation="AIDYYQQSLAISLEIGDRNGETASLNNLGGTYCSLKQYQQAIDC YQQLLTIQPETGDRNGEAQSLQNLAQLYNLTGRIKEGYAAGIQAAQILQELGLPIEAW AIPKWQKSIAKFAQRGKLQLGLCFLAGLFAFPFALVFIVSLMLWRLVKSQLLRR" BASE COUNT 5104 a 3733 c 3754 g 5066 t 10 others ORIGIN 1 attatgccaa aacagggaaa tgaatgatct cttaagccaa ttccgaaccc gttttatggt 61 gacgtgtact aagaacgttc ccctacaccc cgtacgggcg caaggccgca gcgcccctac 121 acacccttac acccctagtt tttgactgct tcgactctgc tagattttca tagcttttat 181 gatattcata tatagaggaa ttcctacaat aatgttgaac ggaaaggtga gtgctaaagc 241 catagaaata tacaagctag gattagcttc aggaactgtc attctcatag ctgcaggaac 301 tgctatgtaa gaagcgctag cacaaagaac agcaaataaa agggcattac cttgtggtat 361 gccaatgagt ttagcaatga tgatccccaa aattgcgttg ccaacaggca taaatacaga 421 aaaaattatt agaaaagaac ctgtttttct tatatctagt attcttttag cagctaccat 481 tcccatatct agtagaaaaa atgctagaac tccgtagaag atgctttgag taaatggttg 541 tagtttttcc catcctttct gtcctgttaa aatgccaacg atcagactcc caactaagag 601 aaatacagaa ccatttagaa aagcctctcg caaaacttca ctccaagaaa attcaccccc 661 ttctttcttt tgaccaaatg ctcttactaa aacaataccc acaataattg ctggagattc 721 catcagtgcc aaagcagcga ccatgtatcc atcagaggta atatcaagca cttttagaaa 781 ggactgtgca gtaataaagg tgacggcact gatggatcca taagttgcag caatagccgc 841 agcattatac gcatcaagtt ttgtttttaa aataaaaaag gagtagatag gaaccacaca 901 agccatgatg atagctgcta tcaatgtcaa cgctacttgt ggattaattc cgctctcatc 961 aagttcatat cctcccttaa acccaattgc aaagagcaag taaagagaaa acagttttgg 1021 taaaggttgg ggaatttcta aatcggattt tagaaaaatt gccagcattc ctacaaagaa 1081 aaagagcact ggcggattta atatgttcga taagataagg ctggaattca tttctactaa 1141 ccctgatttt tttgaagaga agtgggaacg acaataggca tcaggcaatt tacttaagtc 1201 actgtcagca atgagcaact cttcagtgaa tatatttcaa ggctatgaat atggtatgaa 1261 cattgtgtgt caatgttatg acattaactt ttataaaggc agtgcactaa gctacgatcc 1321 ttgataaata aagataaaat aatttttata tactcttaat tggttttaga agattttgat 1381 aattttaact tgatgtttgc caactgcaag gatgttgttc aaagaacagg gagcagggga 1441 acgcttaact cttaacgctt aacataaact gtacctagct gagcaaaaat caaataggaa 1501 ttctatatat tttctttata aaaagacagt caaaatatgg taaaatttcg ctttttttat 1561 aattatgaaa cacttgccca tcaattaacc ataagctgga ctagcgtaaa gcagcaccca 1621 taaccaggca agaagccaaa cgacaatgaa tcccaagata cgaagtagga tttgtttcat 1681 ctttctttgc gcgagatact ctaagcgatt gcccaaagcg cagacgcaat catgcgtatc 1741 gccaagcgtt tgtaggcaag ggtgtgctgt gtcaagcttg acccttcgtt cacgaagtgt 1801 gcgctgagtc cgtcggacac gctacgcgtt agcccaagcc gcgtcgcgtg cccctgtttc 1861 ctacgcgcat tggggtaatc gcttgacaat ctctgttgtt cacagcatgc ttcgctgttg 1921 agtgatgact acttccaatc cttgccaatc cggattgtga tatcagattc tagatcgcca 1981 gtcgctgaca cttcaatctg accgaaacct aagatttttt gtagttgttc tcctgcttgt 2041 cggttacctt tttggacaat aatcttagtt tcgcttcggt tatcagacca atctggtata 2101 gcgtaaagtt tggtaaatcc ctgctgttta agattattga taactttttg agttagttga 2161 ggttttctag atgcattctg aatagcaatt ttcaggctag tgggcggttt gacacttgag 2221 gacatacccc ctattgtcac cccagcgtaa tcctgtagca agtcaactcg cccagtcaga 2281 tccaaccaat agctattggg atcttggctg agacggctga aaattccagg taatatggtc 2341 atctggtaat tatctcgctg tatgttcaag gcaaagttca ctattgccat catttcttcc 2401 agcttaaggt tagtgtcgaa atgccttccc ataacgcgga taatctgagg caatttgggt 2461 aaaacagtgg gagtgtttag gcgatcgcgc aaccccacca agagtgcttg ttgtcgctgt 2521 actctttcca aatcgcccat tgctggttcg cgaaaccgca caaactgttc tgcttgttca 2581 ccattcaatg tttgccagcc gttgactaag ttaatcgaaa atcctgttcc aggatctttg 2641 ttaaccattg gtttcggaac aaaaacctct actccaccca actgatccac taggcgtcgt 2701 aagccactgg tagaaatacg aatgtagcga tcaataggtg cattactgag agtacggctg 2761 acaactcgtg ctgctaaaac tttacctcct ttagcgttgg cttcagatac cttagttaat 2821 cctttttctg gaatagctgt catggtatct ttggggatag aaagtacccg aattgttttg 2881 ttgcttgggt tgagtcgtat cagcagtatc gtatcgctat gacctgcaaa actttctgga 2941 gaaccatcta cacttcccaa aactggctca atacccatga ccaagatatt catgggtcga 3001 gtcagacggt attgtgaaat gttactccac aactcccctg gtagaggtgt tttttgatct 3061 ttgccactca gtcctacttc ttcatctgtt cgatctatgt cactccagag tggagtccaa 3121 agtgccaaac tcgataccag caaccctgac aaggtcattc ctacaacaat tgtcagtacc 3181 caaaaaagcc atcgaggcat ggctaaccct agccgatgat aaagctggct gggaacagac 3241 cccgaagata cgaagttacg agaaatattt ttttgctttg tgagtgttac cccgtttggg 3301 gattgttgta acttacttga tgtgctctgg tgtttttgac ttgctacttg cttgaccaca 3361 atactctccc cactcatcac actccctaaa cgtaatgtta atgcaactta taaatcttgc 3421 ctgctggtga actaccttac tcgaagtttg agtaagagtg taaatttttc atcttacctc 3481 atcctagtta tcgtactgga agtccccctg aatccctgta tacctttact cttcgctttt 3541 ttttggtgta tgtctgtgtt atatcctcta attttgtcca agtatcttta tacttcatta 3601 cttttatttg tcacagtaca cttgtagtgt tctttagtaa gttactctga cccaatgact 3661 agagctttgt tccctgtaat ccttgctggt ggaaaaggtg agcgtttttg gcccctgagt 3721 cgtcaaaatc gacctaagca atttttgaat cttgatggta ctgacagaag tctcctacaa 3781 gcaactgccg accgactatt gacacttgca ggcggttggg ataacctgtg ggttattact 3841 tctggtcaga tagctcaagg agttagagaa caactaccat tactgccatc agataaccta 3901 cttgttgaat tgcagggaag ggatactgca gctgcagttg cttggacaag tttggaaatt 3961 caaaggcgtt atggagacga cgctattatc ggttttttcc cagctgacca ttggatagcc 4021 gagcaaaaag catttgtgtg tacaatagat gcagctagag agcttgcggc tacccaagca 4081 gcgattgtca cattggggat taagcctacg tttccatcaa ctggttacgg ctatattgaa 4141 caaggggaaa agattggtag ctttaatgag ttgccagctt atcacgtcaa ccgctttact 4201 gaaaagcctg accgtcaaac ggctgaagac ttcctatcta caggacgttt tagctggaat 4261 agcggtatgt ttgtttttcg agcaggtgtt gttctcaagg aactgcacag acacgctcca 4321 gaaattatcg aaccgataga aaaatacggt cctgatatct accaagatct ccctaagaaa 4381 agtatagact atgccttgat ggaaaagaca gatatcgctt atgtcttacc tgcagaattt 4441 ggctgggatg atttgggaga ttggaatgcg atcgagcgtt taatgaaaaa agaaggcatt 4501 cccaatgttg aatttgcgac tcatgtcgga ctggacacac agggttctat agtctattcc 4561 acaaatgagg aggatgtcat tgttaccatt gggttagagg atgtcgtgat tgtgcgcgat 4621 cgcaacgtta ccctcattgt caaaaaagac cgtacccagg aaattaagca agtcctcaaa 4681 accctacaag ctgatccccg atttacgaac ttgctttaaa acttaaaact cttggcagag 4741 gcacgctagg tgtatcaacc agtataactt ttctgtctgt tatttcattt gaacaattat 4801 ggaaatcagt agttgtgaac ctctagccaa gagcatggca ataaactgca ttattgctta 4861 tcaaaagcct ttttctgtgt cccaaaaatt tttatttccc catcgtttca tcaacgatgg 4921 ggatttctcc ttaaataact tgacattccc acagcccgaa ggagtaggat tcttggactc 4981 agattgcccc tttcagttgc cctacttaag gtctaacatt ctgatgtgcc tagtcttaca 5041 actagcttct caagcgttag tttccatgtg ccccatggta ctgagtacga tgatatcaca 5101 gcaggtcttg gggcggcaga tttctataca cccccttgaa cggatgaact ttctctacaa 5161 tattttgtaa gattaccttt acaaatactt aacgattgcg cgacatcaag caaaatcccc 5221 aaaactaaaa gcaatatcaa cggcttggtg ctcatcatca tgtcctgcgg cttacttctc 5281 gttcctcgtc tattcctctt tctgttgcat ccttctgctt ttttataaaa ccatgttcct 5341 cacccaaact gttcctcgtc aaagagaaat cattgaagtc ctacttcgta atggttggga 5401 ctatatgcga cgagtcctca ctggaggaaa agctgatgaa ccccagctac ccacacctgc 5461 tgttttaaag aacattctag tagacttagg accagtctac atcaagctcg gtcagttaat 5521 gtctacccgt ccagacttgc tgagcgcagc atacatagag gaattatcaa cactacaaga 5581 tgaagtacca ccagttccct gggcagatgt agaagtcctg attcgcaaac agctcaaacg 5641 tcctttagaa gaaaccttca ccacaattaa tgctatccca gtagcagcag gctcaattgc 5701 ccagacacat aaagctacac ttgtagatgg tcgggaagtc gctctcaagg tacaacgtcc 5761 aggaatagat atcactgttc cccaggatat cgccttaatt caaggtattg ccgatttagt 5821 ggctcgtacc gaatttggtc agacttatga aatcaaagcg atcgccgaag aatttaccaa 5881 agccctagaa gcagaattag attttatacg ggaagcaagc ttcaccgacc agttacgacg 5941 caatttatcc gagagtcgct ggtttgatcc cacacaatta gtcgttgccg aaattttctg 6001 ggatttaacc acaccaaaat tactcgtgat ggagtggctg aacggagttc ccctgctatc 6061 ggcaaatctt gagagtgaag aaaacggcaa agatccagcg caagaacgta aagaaattac 6121 cacactactg tttcgagctt tcttccagca actgtatatt gatggtttct ttcacgctga 6181 tccccatcct ggaaacttat tttatcttaa agatggtcgc gttgcccttt tagattgtgg 6241 catggtggga cggcttgacc cccgcacgca aagcatatta atagaaatgt tgttggcaat 6301 tgtcgatttg gatgcgaggc ggtgtgctca actaactttg cagctggcgg agtctacaca 6361 gccagtgatt atggttaaac tggaaaatga ttatgaccga atgctgcgaa agtattacaa 6421 cttgagcgta tcgcaaatta atttcagtca aatcatctac gaacttttgc aagtcgctcg 6481 taacaacaaa attcgcttgc ccagtaacat gggtttatat gctaaaaccc tggctaactt 6541 agagggggtg gcgcgaggat ttaacccaga gttgaacttt ctcgatgaaa ttcagccatt 6601 gctgacagat gtgtttcgtc aacagttgtt tggtgacagt cccgtgcgat cgctccttag 6661 aacagctttg gatgtcaaaa gtctctcttt acaatctcct cgactcctag agtttctgct 6721 agagcgaatt acctcggaaa gtttacagtg gaatatctca ctgcgtggtt tagatagctt 6781 acgccgaaca acagatgatg ctgcaaatcg tctttctttt agcatcctag tgggttcact 6841 gattatgggt gcagcaatta tctctagcaa cgcacggaca tctcagttgt tatttttgag 6901 caatgtatta tttgccactg ctagtttatt ggggttgtgg ttaattatca gtattttgcg 6961 ctcaggacgg ctgcgttaaa atctatacgt cacggctacg cctagacatg atgtaggttg 7021 ggtagggttg tactattcct ttcccgccaa aagattaccg attttaaaac cgcacgaggc 7081 aagagagaac gcagagaaaa tacgttttcc tgttagaggt aattttacag tgggaaggga 7141 gtaagttgtc ctagttttta aaaactttta attcctaatg aaagcgcatg atgacggata 7201 tgatcctcaa taaaactagc tatgaaataa taactgtggt cataaccttc ttgataacgt 7261 aagtttagcg gctggttaac tgctgcacaa gcttgctcaa acacctcagg tagcaactgc 7321 tgactgagaa atttatcagc tgtaccttgg tcgataagaa tagtactatg gtatcgtgtt 7381 tgcctgacca attcactagc atcgtaggca cgccaacttt ccttatcttc accaaggtag 7441 cggctaaacg ccttttcacc ccaaggacag tgcattggtg cagcgatggg tgcgaaagct 7501 gagactgatt tatattgctg ggggtttctg aaagcgcaaa caagcgcacc atgtcccccc 7561 attgaatgac cgaaaatgcc ttgtttctct ggttgtacag gaaaatgttc ggcaatcaaa 7621 gcaggcaatt cttgcacaac ataactatac attcggtagt gagatgccca aggttcaaca 7681 gttgcatcca cataaaagcc agcacctgtt ccaaagtccc agtcgtcatc ctcaccagga 7741 ataccagtgt tgcggggact agtatctggt gcaactagca ttaaaccgta ttttgccgca 7801 tactgctgtg caccagcttt aatcataaag ttttcttctg tgcaagccaa accggaaagg 7861 aaataaagaa ctggtactgg tttttgggtg gcttgcggtg gttggtagac agcaaagcgc 7921 atttctccgt tacaggtaga ggagggatga ttgtagaagc cgagtttacc accgaaggat 7981 ttatattctg aaatgaggtt ggggttagtc atgagtgttt ggttagtgtt caaatctcga 8041 taaattctaa ccccaagggt tcctaatagc agaaataaaa caatgtctta gcgatttgcc 8101 tgaaagtctg attgtacgta agtatttgaa gaaatagatt agcgaaaacc accattaagt 8161 gataaactct tgtttaattt aatgcgtgtg tcatttaaga cagggagcga gggttgatgc 8221 gtgcactgca aaagacttct tggtatgctt catttttaaa atttttactg aaaaagttat 8281 ccttctcccc tagtactaga aaaagctggt ttcaaatatt agtaacgctg ttaataacca 8341 ttttcattgt tggcggaata gcttggttag aaaatgagca gaaaaatgat aaaaagtgtg 8401 aacataaatt ggattttccc acaattcttt gttttatctc agattctaaa ttattaaacc 8461 aagttcaaaa tattagtgtt atttccgcag ctattttatt tttctgtgac actttcgaca 8521 gaaaaaaaca attagaacgt caagcttggc aattaattga tggtgcccaa ggttcagaaa 8581 caagtggtgc aagaagacaa gcaattgagg aattatataa ggaaggcgct gatatcacag 8641 gtcttgatgc agatggcgca gacttaagag gaataaactt aagtggcgct aatttagaaa 8701 gagcaagttt taaaaatgca atcttagaag aagctaattt tgaaggagca aatcttacgg 8761 aagccaattt tgagggcgca aatctcaagg gtgcaaactt taaaaaagct atgcttttgc 8821 atgctgattt cactagagca gatttaacag cttatgatca gaagaagact gatttacgtg 8881 atgcagacct cggacgtaca atatttaatc gagctaaagt tagcgaagct attttcggtg 8941 tgattgactc cgagcctcat ttaggtaata aaactaactt aagtggagca atataatgct 9001 tagacttaac aagaatgagt atttgtaagc tcaagaccac tgaaaagtct tgtagttggt 9061 tgtttttaac ttcgtgtact tgttattcag ttaagtagat aaacataatt aattacacaa 9121 tgtcattgcg aatggagcga agcgaaatga agcaatcgcg agggctggga ttgcttcgct 9181 tcgctcgcaa tgactgtaaa tatttttgtt catctactta tcagtaaaga gcgcatcgcg 9241 cgcttggagg aaacaccaga ccaattggtc gtttgtcgtc agacgtgccg taggcatagg 9301 cgttgttaat acccttctaa caattttgcg acctgttccg gtgcaggaag cttatcctgt 9361 aactcttgtg gtaacgttga aactatccga taagtcgcca cgccaatagg cttattcgac 9421 tccctcaacg catattccac aatagtttta tctttcaact tacagagaat aatacccacc 9481 gaaggatttt catcttctaa cctgactttg tcatctaacg ccgccaaata aaactgcatt 9541 ttacccacat attccggcaa aaattcaccg attttcaact caatagccac caaacacttc 9601 aagcggcgat gaaataataa taaatcaata aaatattctt tatcactaat ttctaaacgg 9661 tactgactac cgataaatgt aaacattccg cccatttctt gtataaatgg ttctaccttc 9721 gccaaaattg cttgttctaa ctgacgttca ctatgttcat ctgctagttc taaaaagtca 9781 aatgtatact catctttaac agctaaaaga gcctgttgac gaatattctt tggaacagtt 9841 tggtcaaaat tagtttgatt gagtaaagtt ttctcataac tttgattttc aatctggtta 9901 attaaaacat ttttcgtcca gccaaacttc cgcgtcatgc ggatataaaa ttctcgttct 9961 aagtcgtctt tacacttctc cataattacc aaattatgag tccagccaat ttctgcaacc 10021 attggttgca gtttttcgtt ctggctgtag gtaagataga agtttcgcat attccagata 10081 ttacgagccg aaaatccgct aatacctgga aactcagctt gtaaatcttt agccagttgt 10141 tctacaactg attttcccca ggtttcctcc tgttgacgac taacaattaa ccgtccaata 10201 tcccagtaga gagcaattaa ttctttgtta accgccttca atgcttcata ctgtgctgca 10261 cgaatacgct gtttaacagc aattagcaat tcaccataat gacctgcatt gagttcactc 10321 atagggaatt tttgagaaat gaattacaat tttgggattg aagtagccaa cgcaaaaatc 10381 tgcccgatac gacatcttca gcgtcttttg caaaatcggt agcagttgca aatctatgtt 10441 gtcaaagatg cgtggcataa tttgatagta tacaaatctc aacaggcaga gcctcatgga 10501 atgcattccc agccagagac tgggaacgag ggaacgaggg aagaggctgg gaacgagaaa 10561 aatgagggaa acgaggaaaa ctcctgtgca gttccttaac taatcaaacg tcaccacact 10621 ccgaattgac tcacccttgt gcatcaattc aaaagcgtca tttatctgct caattggcat 10681 cacatgggta atcaaatcat caatatttat cttcccctgc atataccaat caacaatttt 10741 tggtacatca gttcgtcctc ttgcaccacc gaacgctgaa cctttccaaa tgcgtccagt 10801 caccagctga aaaggacggg tgcgaatttc ctcgccagca ccagcaacgc caataatcac 10861 gctgactccc caacctttgt gacaacattc caaggcttgg cgcatgacat tcacattacc 10921 aatgcattca aaactgtaat ctgcaccgcc tttagttaaa tcaaccaagt aagcaactaa 10981 atctccttca acttccttgg ggttgacgaa gtgagtcatg ccaaattttt ctgctaaagc 11041 ttttttgctg ggattgatat ctacccccac aatcatattt gctcccacca tccgcgcccc 11101 ttggatgaca ttcaacccaa taccacccaa accgaaaacc actacatttg ctcctggttc 11161 cacttttgct gtattgatga ctgcaccaac tcctgttgtc acgccgcagc caatgtaaca 11221 aactttatcg aatggcgcat cctcgcgaat ttttgccacg gcgatttccg gcagcactgt 11281 atagttagaa aaagtggatg tccccatgta gtgatgaatc atctgcttat ctatggagaa 11341 ccgactggta ccattaggca tgacaccccg cccttgagtc gcgcgaattg cttgacagag 11401 attggttttg aaactcagac aatattcaca ctgacggcat tctggagtat ataagggaat 11461 aacatgatcc cccggcttga cgctggtaac tccttcgccc acctctacaa caatacccgc 11521 gccttcatgt cctaaaattg ctgggaataa accttcagga tccgtaccag aaagagtata 11581 agcatcggtg tgacaaactc cgctagcttt aatctcgacc atcacttccc cagctgatgg 11641 tccttctagt tgaactgttt caatacttaa cggcttacca gactcgtaag ccactgctgc 11701 ttttactttc aaggtcagtt ctcctcatca ggaattcaaa agatgcccta cagggacgct 11761 acaggttagt ttaatgtaac tctccataat ctgtgatttt gtgagggaga atgtcaagtt 11821 ttttgctata atgttaatgg tttgcttcac ccacattggt gtctgataag cctctgctca 11881 atggcactgc gatcattgat aagcgtctcc gttgcacagc atggaagtaa gtcggcacat 11941 cttgattaat acaaattacc tatcggtttg tcccatagcg cgaatatgcc agtatctcat 12001 ccaggacagt aaaatttgta gggtacaagg taagaaattc attgtctccg gtggggcatc 12061 acggagacgt aaagccaaag ctcaggaaat gtccgacggg ataccgggag tcactgagaa 12121 gcctacactg tacagcttgc tctcagtgta ggacatgtca cttacattgg atataagcga 12181 tagcagcttt tttttccacc attagattac gtttattttt tcatcgccta tgaaccctgc 12241 cctgactcaa ttcggcgtcc atatgtccaa cctgactggc gttagagcca ttatgaagga 12301 cattattgaa actttacgag ctaatgcggg gcagcaattg ataaatttga gtgctggtaa 12361 tccgttgatt ttgcctgagg tagaacagtt atggcgcgat tgcactgcac aattgctagc 12421 tagcccagaa tatggtgaag tcgtttgtcg ctatggatca agtcagggtt atgcaccact 12481 agttgaagca attgttgggg attttaatcg gcgatatgga ttggatttga ctgaacgcaa 12541 tatccttgtc acccctggaa gtcaaagtct ctacttttac gctgctaatg cttttggggg 12601 atacaccacc agcggtgaac tcaagcaaat cgttctgccc ctcagcccgg attatacagg 12661 atacggtggt gtttgcctag ttccagaagc tttaattgct tacaaacctg cattggatat 12721 tgacgcagca gcccataagt ttaaatatcg ccctgacttt agcaaagtat cgattacaga 12781 gaaaactggt tgcgttatct tttctcgccc ctgcaatccc actggtaatg tccttacgga 12841 tgatgaggtg aagaaaattg ccgcccttgc tgcgccttac gatctacctg tgttcgttga 12901 ctcggcttat gctcccccat tcccagcgat gaactttacc caaatgacac caatctttgg 12961 taagaatatt atccactgca tgagtttatc caaagctgga ttaccaggag aacggattgg 13021 gattgctatt ggggatgaag gagtgattca agttctagag tctttccaaa caaatgcctg 13081 tctccacgct tcacgctacg gacaagcaat cgcagcgcgt gcgattaact ctggtgcgct 13141 ggcagaaatt tctgtacagg ttatccgccc attttatcag aataagttta cagttttgga 13201 aagcacgtta aacgaagcga tgcctaagga tttaccttgg ttcctccatc gcggtgaggg 13261 ggcaattttt gcttggttgt ggttgcagga cttgccaata actgactggg agttatacca 13321 ggaactcaag cgggtaggtg tcatagttgt ccctggtagt actttcttcc ctggcttaga 13381 ggaagagtgg gaacacaaac agcagtgtgt ccgtattagc cttactggaa gtgatgagga 13441 aatcgctacg ggtatgcagc gtttggcaaa agtggtacaa caggtttatc aacgtacggc 13501 tgttagtgct tagttagtat gacgaaccgc caagacgcca aggacgcaga gaggaagatg 13561 aggaataagg gaagcagggg aggagggaaa gaaataattt cctcatcccc cactcctgta 13621 ccagaaggct ggctggaaat tggcacaatt gtggcacctc aagggttgga tggacagatg 13681 cgggtttatc ctgatacgga tttcccagaa cgctttgagg tgccgggaac acgttggttg 13741 ttgcgtcctc atggaacgga accacaagcc gtagaattac tgtctgggcg ttatgttcag 13801 ggtaaaaatt tatatataat tgagttggag ggggtggaag atcgcgacca agtggaggag 13861 ttgcgcgggt gtaagttgat ggttccagag agcgatcgcc ctcaattagg agaagatgaa 13921 tatcacgttc cagatttgat tggcttacaa gttttcatgc aggaatctgg cgaacttctt 13981 ggttcagtgg tggatattct tcctgctggt catgatttgt tggaagtaga actacagtct 14041 tcaaaggacg aagaacaaaa aacaaaggac aaaagaaaaa agactgtttt aattccgttt 14101 gtcaaagcta tagtgccagt ggtagatttg gaagctcgtc gaattgaaat cacaccacca 14161 gctgggttat tggaaattaa tacataaatc gcatcaacca aagcgaccac caatgtattc 14221 tttaatggga gtaggctgtc aaagcgtatt gattcatcta tctagagact gagttagtta 14281 aaatttcagt ctgcgttctt tgtagcgtgt cgtagacaag gcggactttg ttcttgtagc 14341 cgcgacttta gtcgtcaggc tcaaaattta ttagcaataa tgcggaaagt cggaacattt 14401 atctgtataa aaaagtgtaa aactatataa aaatcaaccc gcgtctgagg gatttacagg 14461 tatgactgac cccgtaacaa ccttaacagc ttttgcgatc gctgaatttg ccttcaaaaa 14521 gtttttcgag tccagtgtgg gtaagctggg ggaaaagttc acccaaacag cgcttaccaa 14581 gatggatgaa ctgcaccaga agatttggga taagttacgc agaaacccca aagctacaga 14641 agcattgcag gaagtagaaa aaggctcaaa agaccatttg aataaagtgg cgttttattt 14701 agaagatgaa atgaaggatg ataccgaatt tgctgcggaa attcgagcca ttgctcatga 14761 aatcaatatt aaccgggttc aagataatag cagccaaact cagaatattt acggtggtaa 14821 gggttatcaa actaaaatgg gtgataacaa tactaatcag tttggggata cccataacta 14881 ctatggcact ccgccacaat cttgactgga gtcgtccgga ggggaacgga caaagagaaa 14941 tcccgttagt tctattatca acgaattacc tcgtgaaacc cagaattggc acgggcggat 15001 agaggaactt gcccaaatgc aagaatggtt gcaagcggat aacgtgcgtt taattggcat 15061 taccgggact ggtggttatg gcaaatcttc cctggttgca aaagtctttg cttctacaca 15121 aagctttgaa aagcaagttt gggcaacttt tagccaaaat tacccgtttg cggtttgggg 15181 acgctggtta ttggaaaagt taggcaaagc aacacctgaa aaagaagcag atttattaac 15241 tgcggtttgt aataatttgc gaacagggcg ctatttgctg gtattggata acttagaaac 15301 tctgctagaa gcaaacggag aatggcatga taaaacttac tatgacttct tgctgcggtg 15361 gttgagtagc cagactgaaa gtgttatttt agtgactagc cgcgaacaac ctcaattacc 15421 acccaatagc tggaattact gccgttggtt gccattgaaa ggactttcaa ccgatgctgg 15481 ggtagcactg ctagaagact tagatattca aggtactgat gcagaaatca gagaatttgt 15541 caagcaagcg gatggacatc ctttattaat caagctggtg gctggagtat tgcacgctga 15601 tgaaggggat gtggttgata tcagcgcttt gagacaaaat atctttgaga ttttggggtt 15661 acatcgatac gactcagaag cgagtattgg taagattctt gatgccagta ttgcccggtt 15721 aacgccaaag ttacaacaac tattatttaa tttaagcgta tatcgtcccg cttttaatac 15781 cacagccgca gccgcacttt taccagaaca agaagtgaca caagcagatt tgcggggatt 15841 ggttaaacgt tcgcttttgc aagaaaacaa aatagaaagc ggttgggtat ttgagtttca 15901 gccgttaatt ttggcttatc tgaaacagca agcgggtgac ttaaccgaag tacatgaaag 15961 agcaattacc tactatcact ctatcgctca agaatctgca tcgacaatag aagatatcag 16021 accacaacta gaaatttttc atcaccactg cgaactcaag cagtatcaac aagctaatga 16081 cattcttgat tcctgcaacg aatttttgga cttgcgagga tattacacta ccattattga 16141 actatctagc cgactggaaa aggaatggca acccagtaat caagataaaa acagtgaatt 16201 tgcaaatgtt ctcacatatt tgggcagtgc ttacgattcc ttgggacaag ataaaattgc 16261 gattaattat tatcagcagt cactggctat tgctcttgag ataggcgatc gcacaggtga 16321 aggtggctca ctatgcaatc taggaagtgc ttactgttcc ttgggacaat accaacaggc 16381 tattgattac tatcagcaag cattgatagt tttacgagaa actgatcatc atgattttcg 16441 agccaattct ctgattggtt tgggcaatgc ttactgttcc ttgggacagt accaaaaggc 16501 aattgattat catcagcagt ctctggctat ttttcgtgag ataggcgatc gcaatggtga 16561 agctgcttct ctaaataatt tgggcaatgc ttacaatttt ttgggacaat accaaattgc 16621 gattgattac cttcagcagt cacttgctat ttctcttgag ataggcggtc gcggtggtga 16681 agctaattct ctaaataatt tgggcaatgc ttacaattcg ttgggacaat accaaatggc 16741 gattgattac tatcagcagt cacttgctat ttctcttgag ataggcgatc gcattggtga 16801 agctggttct ctaaataatt taggcagtac ttacgattcc ttgagacaat accaaattgc 16861 gattgattac cttcagcagt cacttgctat ttctcttgag ataggcggtc gcggtggtga 16921 agctaattct ctaaataatt tgggcaatgc ttacaattcg ttgggacaat accaaatggc 16981 gattgattac tatcagcagt cacttgctat ttctcttgag ataggcgatc gcannnnnnn 17041 nnngcgattg attactatca gcagtcactt gctatttctc ttgagatagg cgatcgcaat 17101 ggtgaaactg cttctctaaa taatttgggc ggtacttact gttccttgaa acaatatcaa 17161 caagcaattg actgttatca gcagttatta acgatccaac cagagacagg cgatcgcaat 17221 ggtgaagccc agtcacttca aaatcttgcc cagctttata atttaacagg cagaattaag 17281 gaaggttatg cagcaggtat tcaagccgcc cagattctac aagaactagg acttcctatt 17341 gaggcttggg ctataccaaa gtggcagaag tctattgcta aatttgctca acgtggtaaa 17401 ttgcagttgg gtttatgttt tctggcgggg ttattcgctt tcccctttgc cctagttttc 17461 attgtgtcgt tgatgttgtg gcgcttggta aaatctcaat tactgcggcg ttgaaagttt 17521 tttccaaagg gcgtaactca aatcttgcac ttcaccggtg agtacgcctt gaactaaagt 17581 tcaaggctaa tagatcaagt ccgttaaaac ggaacggcac atgctacaag tcggcaaagc 17641 cgcccaacgc agtgcctcct caaagac // LOCUS NODE_1879_length_17594_cov_4.87975417594 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17594) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17594) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17594 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1..502 /locus_tag="DP116_16390" /pseudo CDS 1..502 /locus_tag="DP116_16390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456594.1" /note="frameshifted; too many ambiguous residues; incomplete; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="peptide-binding protein" assembly_gap 211..220 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 667..1410 /locus_tag="DP116_16395" CDS 667..1410 /locus_tag="DP116_16395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M50 family peptidase" /protein_id="PRJNA477356:DP116_16395" /translation="MTNFGKNFEPLLTREAPKTVDRMGLFWLLAAAIATIVLWQVPGG NYILYPFTILATWFHEMGHGLMALLLGGQFQQLQIFSNGSGVAFHSVPLYLGSIGRAL VAAAGPMGPPIAGAGLILASRSFKAAHLSLTILGGFLLISTLIWVRSPFGIVAIPLLG LIILGVALKAPRWMQGFGIQFLGVQACVSTYHQLDYLFSASAGPGLLSDTAQIQQQLL LPYWFWGGLMAIASLVILVQSLRLAYRSK" gene complement(1422..1640) /locus_tag="DP116_16400" CDS complement(1422..1640) /locus_tag="DP116_16400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353649.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16400" /translation="MSISKTEAKQLLERLIFDDERPHDWVQDVWGLSPILGDSAAKLL EVFEALIECCPQDQLENLLQTFYQEDFE" gene 2252..4663 /locus_tag="DP116_16405" CDS 2252..4663 /locus_tag="DP116_16405" /EC_number="2.4.1.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sucrose synthase" /protein_id="PRJNA477356:DP116_16405" /translation="MHELVQTILNSEEKTALRHLIKTLSALDQRYFLRNEILQAFAEY CQNSEKPGYFFYSSSVGKLIHYTHEMIVEGESTWFLLRPRIGSQEVWRLGANMTSFEQ MTPEALLDARDRLVNRFQPQILEIDFSPYYHGYPRISDPRNIGQGLGSLNRHLCTQVL TDPDYWLEVLFDVFHRHSYDGIPLLINARIDSGKQLAKQVKQALNFLNERPSSEPYEK FRFDLQELGFEPGWGNTASRVRETLELLNRLIDTAEPAILEAFVSRVPTVFRVVLVSI HGWVSQENVLGRPETTGQVIYVLEQARHLENKLQEEIKLAGLDLLGIQPQVIILTRLI PNCEGTLCNLRLEKVEGTENAWILRVPFADSNPNVTQNWISKYEIWPYLERFALDAET ELLAQFRGSPDLIIGNYSDGNLVAFLLARRLKVTHCNIAHSLEKPKHLFSNLYWHDLE EKYHFSAQYTADIIGMNSADFIITSTYQEIVGTPDTLGQYESYKCFTLPQLYHVVDGI DLFSPKFNMIPPGVNEHIFFPYNQIQHRDITVSKRVQDLLFTREDSQILGHLENQSKR PIFAVGAITAIDNLAGLAECFGKSQELQQSCNLIIVTDKLHPHQAINSEEAEEIEKLH NIINEYNLHGHIRWVGIQFPLADLGEAYRIIADFQGIFVHFARFEAFGRTILEAMSSG LPTFATQFGGSLEIIEDGEDGFLLNPTDLEGTAKTILSFIDQCNAYLEHWYKISELVI QRVRNKYNWQLHTKQLLLLAKVYSFWNFVNQEIGEAKARYVETLFHLLYKPRAEKILE QHMKR" gene 4693..7569 /gene="pgmB" /locus_tag="DP116_16410" CDS 4693..7569 /gene="pgmB" /locus_tag="DP116_16410" /EC_number="5.4.2.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318844.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-phosphoglucomutase" /protein_id="PRJNA477356:DP116_16410" /translation="MDTTQNSHDFIYTDWTLVETQLKPNQTQHRETVFTIGNGYLGTR GSFEEGSTGAVPATFIHGVYDNVPLVYTELANCPDWLPLIVTVDGDRFRTERGEILSY ERQLDLQRGVLRRKVRWRSPGGKTIDFCFERFASRADEHVLGLRCQLTPVDFDGLIEI QGSINGYPENQGFNHWELIDQGKTNRGAWLQLQTRNTRINLGLAVGMTVTGADASVQV SSPPGYPTLSTVFQASLGQTVTVDKFVTVFTSRDVDNPVKEANEKLAQLPDYEALVDA HAQAWAEAWDKSDILIEGDTKAQLAVRYNIFQLLISAPEHDEKVSIPAKTLSGFGYRG HVFWDTEIFILPFFIYTQPKLARNLLTYRYLTLNGARRKASHYGYKGAMYAWESADTG DEVTPRWLPPNDFYGEDIRIWCRDREIHISADIAYAVWYYWKATDDDEWMRDCGVEII LDTAVFWGSRVEYDTKDERYEIRGVIGADEYHEIADNNAFTNRMVQWHLEKALFVYDW LRHTYPDRFSTLVKKLQLTPGRLSRWQDIINNIWIPYDPSTGLVEQSEGFFKLEDIDL AEYEPRNRSIQTILSIEETNKRQVLKQPDVLMLLYLMRQSQEFPYTQETLEKNWDYYA PRTDITYGSSLGPAIHAILASDLGKSKEAYERFMQAALVDIEDVRGNAHEGIHGASAG GVWQAVILGFGGIQLAEHQPTATPQLPPGWKRLKFKLHWRGEWHEIDLRPTAQDTMTL PDIRGVIFDLDGVLTDTAEYHYLGWQKLADEEGLPFNRLANEDLRGVSRRESLLKIVG NKQYSEAQLQEMMDRKNRYYVDFIQTMRPGNVLPGTIALLDELKEAGIKIALGSASKN AQTVIEKLGIADRIDVVADGYSVQQPKPAPDLFLFAAQQLGLKPEQCVVVEDAAAGIE AALAAGMLAIGLGPTERVGAAHVVLPNLAGVHWTELREKLSSDQ" gene 7736..9469 /locus_tag="DP116_16415" CDS 7736..9469 /locus_tag="DP116_16415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871944.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfonate ABC transporter permease" /protein_id="PRJNA477356:DP116_16415" /translation="MLKRTFPSPEALRRFPFGLADIALIFGTLVLLGLIARVGAGTLV SFVPPDVVPDVSLNPLHLPYYAGRSTLRMFIALFCSTLFTLIYSYVAAKSRRAEQILI PLLDILQSVPVLGFLSITVTGFIALFPGSLLGLEAASIFAIFTSQVWNMTFSFYQSLR MVPSELDEAARLYRLSAWQRFTKLEVPSAMIGLIWNAMMSFGGGWFFVAASEAISVLN QKYTLPGLGSYVAAAVTAQDLPALGWAFLTIAVVILLVDQLFWRPLIAWADKFRLEQS SAAEAPNSWVFDLLKAARLPRLMRRAFTPVGETINRLLSSLTPQRPRVAINQKQKVVS DRLYNFALLLLIGGLLAALLHFILTTVGLGEVFKTFMLGLLTLGRVVVLLVVATLIWT PVGVAIGFNPRLSRLLQPVVQFLASFPANFIFPFATLFFIRAHISIDWGSIFLMSLGA QWYILFNSIAGAMSIPTDLREMARDLGLRGWRLWRKLIIPGIFTAWVTGGITASGGAW NASIVAEVVAWGQTTLTATGLGAYIAKATEVGDWPRITLGIGMMSLYVVGLNRVFWRR LYQLAETKYHL" gene 9484..10842 /locus_tag="DP116_16420" CDS 9484..10842 /locus_tag="DP116_16420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319596.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrate ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_16420" /translation="MTTTQATHEVLIAVEQVHKSFPLPEGKGEFTVLRNVNLTVSTGE VVALLGRSGSGKSTLLRIMAGLIPPSEGQVISSGKRLQQANQDVAMVFQSFALLPWLT VQENVELGLEAQGVNRDQRRKQALKAIDLVGLDGFESAYPKELSGGMRQRVGFARAFV LEPQVLFMDEPFSALDVLTSENLRGEIDDLWNAGTFPSKSILIVTHNIEEAVFLADRV IILGSNPGRVRGEVVIDLPRPHDRANVRFKALVDYIYTVMTNPEVEVTGEVAVAAPTT AQASKSPYAQSLPHVRVGGISGLLELIVEKPEGREDIFRLAERIQLEVDDLLPILDGA VMLGFADVIQGDVQLTEIGRDFATTTILRSKDLFRQQVLQRVPMLVSILQTLREKQNG SMGGDFFLDLLDEHFPHAEAERQFATAVDWGRYTELFEYDASEGRLYLPEPVPAESRE AS" gene complement(11196..12575) /locus_tag="DP116_16425" CDS complement(11196..12575) /locus_tag="DP116_16425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008312806.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2252 domain-containing protein" /protein_id="PRJNA477356:DP116_16425" /translation="MTTTPPIKSAELTPNNQLTVAQRLEAGKALRQVVSRSAHREWHP TDRPDPIEILEVSNQGRIPELIPMRYGRMLQSPFAFLRGSAIIMAADLATTPTTGIHV QACGDCHLLNFGGFATPERNLIFDLNDFDETLSAPWEWDVKRLVTSIIVAGKDIRLTD KHCYDTAEAAVRAYRLSIREYGQMGTLAVWYARLDANVLVEHAPDEETRQYWQQMASK AFTRTLQQTFVQMTEEVNGQRRFIDQPPLLYHLPLQEQYLEEVGVLFEQYRDTLQSDR QFLLDRYHLVDVAMKVVGVGSVGTHCGVALLLSNDNDPLLLQFKEARPSVLEPYAGKC PYSHNGQRIVNGQRLMQAASDIFLGWTSNSRGQDFYFRQLKDMKTSIKLKGMSARGLE DYAEICGSALARAHARSGDPVVISSYLGKSDAFDSAVADFAVTYAHQVEQDHQALVAA VKSGRIEAK" gene 13120..13821 /locus_tag="DP116_16430" CDS 13120..13821 /locus_tag="DP116_16430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011429571.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MgtC/SapB family protein" /protein_id="PRJNA477356:DP116_16430" /translation="MTWLDFTIRLAVAFLLGSVIGVERQWRQRMAGLRTNTLVATGAA LFVMLGVMTPGGNPTQVEAYIVSGVGFLGGGVIFRGGASVQGLNTAATLWCVAAVGAL AGGGFFPQAFIGTVAVLVANIFLRPLGYRINQQPLKGTEIEVCYRCSIVCRSNDEAHV RALLLQAVSATGKMKLRSLHSEDLEETPERVEVEADLVTQDRNDPFLEQIVSRLSLES GVSAVSWKIIEQEYG" gene complement(14080..14160) /locus_tag="DP116_16435" CDS complement(14080..14160) /locus_tag="DP116_16435" /inference="COORDINATES: protein motif:HMM:PF13358.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16435" /translation="MQGCKLLYLPPYSPDLNLIEKCCKRG" gene 14364..14506 /locus_tag="DP116_16440" /pseudo CDS 14364..14506 /locus_tag="DP116_16440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861021.1" /note="frameshifted; internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS982 family transposase" gene 14609..14878 /locus_tag="DP116_16445" /pseudo CDS 14609..14878 /locus_tag="DP116_16445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002745639.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS982 family transposase" gene 15346..16299 /locus_tag="DP116_16450" CDS 15346..16299 /locus_tag="DP116_16450" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16450" /translation="MVQPNIKTINTPFIGIIGPSQYLGVKLTNFSFSQLFLVRDQNNA NEKESPLKQQITKAREKMYENIVNNLNGLISLNDKEEGVKTIYVFDKYEDFFLKTQFR PDRERMVIRITRITPRETRVSTICRIYTTGTHIYIALDSYLLGKINIFSLVIHTLLLL IFVPMFFTGLLGFLGALVLLLPTLFNPSNINALLAVFSGLLLPFIPGLYLYFSWFPVV KALLNKESFKAALKHRFHNRRFTNLFDEDDGLTYLKSVTPFIIDQITNALASYGIKDE TIFSKLNEIKEAILAQPTISLNNSGIMSNVLIGNNNFQSSK" gene 16335..16721 /locus_tag="DP116_16455" CDS 16335..16721 /locus_tag="DP116_16455" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16455" /translation="MPNDITVDNSGVITNSVFGNNNIQQNISQNTDEITKLISSLRDM SQGFPEAQREATIVHLDDLQEDITIPEKQKKERFKTRLAALLAITGTLGGVVANSVDF GNKVLDLSKKLGVPIEIVQPQPKQIP" gene complement(17026..17376) /locus_tag="DP116_16460" /pseudo CDS complement(17026..17376) /locus_tag="DP116_16460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866155.1" /note="internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="diguanylate cyclase" BASE COUNT 4966 a 3703 c 3966 g 4949 t 10 others ORIGIN 1 cacaggaact atcagcatca actccaccac cacccgcaca ggaactatca gcaccaccaa 61 ccccaccaag cgcaccaact ccaccaccac ccgcaccaac tccagcagcc caaactccac 121 cgccaactgt ggtctcgtct agttcttcta gaactcaatt gcaacaggag acagcaagac 181 tgatccacgt tcaaacagat aggataattg nnnnnnnnnn aggaatgcga tcattgggct 241 tgcccatatg ggcaagccca atgatcgcat tcctccagat gtggatgttt caggatttcc 301 caattcggaa attgtttcgc gaattcatgc tgatattcgt ttagagggag gcgctcatta 361 tatcgaagat gtaggaagtt ctaatggtac ttacattaat aacttgccct tattgccagg 421 aaatcggcat cgtttacgcc caggcgatcg catcagtttg ggtaaaggag atttagtaac 481 attcctgttt caagtttcct aaaataactt gtgtctggtg gttcaaacag ttaacagtta 541 acagttaaca gttaacagtt aacagtgaat aactgataac tggttaatta tgctactaga 601 cacaaacaac ggatagatac aactacaatc gaaatgatag gggtataacg ttgaggaaag 661 tgatcgatga ccaactttgg aaaaaatttt gaacccttgc taactagaga agccccaaaa 721 acagtcgacc ggatgggttt attttggctt cttgctgcag cgatcgccac tattgtgctg 781 tggcaagttc caggaggaaa ttacatttta tacccattca ccatcctggc aacttggttt 841 catgaaatgg gtcacggctt gatggcactt ttgttaggag gacagttcca gcaattgcag 901 attttttcca atggttcggg tgtcgccttt catagcgttc cgttgtactt gggatccatt 961 ggtcgtgctt tggttgctgc agcaggacct atgggtccac ctatcgctgg tgcgggttta 1021 attttggctt cgcgtagttt taaagcagcg catctgagtt tgaccatctt ggggggtttt 1081 ttactgattt cgaccctgat ctgggtacgc tcgccatttg gaatcgttgc aattccccta 1141 cttggtctga ttatacttgg tgtcgcacta aaagctcctc gttggatgca gggatttggg 1201 attcaatttc tcggcgtgca agcttgtgtt agtacttacc atcaactaga ctacttattt 1261 agcgcctctg caggtcctgg tttgctctct gacactgcgc aaatccagca gcaattgctt 1321 ttaccttatt ggttttgggg tggattgatg gcgatcgcat ctctcgttat tttagtccaa 1381 agtctccgtc ttgcatatcg ctctaagtga tcatcacctc ttcactcaaa gtcctcttga 1441 taaaacgttt gcagcaaatt ttccaattga tcttgcggac agcattctat cagcgcttca 1501 aaaacctcta gcaattttgc tgcactatct cccaatattg gactcaaccc ccacacatcc 1561 tgcacccagt catgaggacg ttcatcatca aaaataagac gttctagcag ttgctttgct 1621 tctgttttgg agatggacat attattttta tttatataag taattttttt gagtttaatg 1681 gttgatctta agtcattacc agtataaagc ttaactgctc attctcaacc cgcgaaaaat 1741 ttgttccgct ttcaactaaa actcaatact atcctttgag tatgagagag tcactttcct 1801 aacagagttc tggttgttga ttatgcagaa aaatcaggac atttactcat atcatgttct 1861 gttaaagact tagcattaag actgcagcaa tgcttagggc gcaaggcgct tttttgagag 1921 gattttaagg tttttgcaca aaccctctga atcataactt atgaaccgga tttgttatca 1981 aaagtcaaat tgggggcgaa ttgttacata agctacaaaa catgaaaact tgctcctaaa 2041 aattactggg ctatacttgt ttttgatatt caacaaatac ctaaaacctt tctgtagaac 2101 ctgaaccttc aagcgagata tgcgctaacg tagggggtga atcattatct gtggttcttc 2161 ttttgaaaat cgttagactc caaggagagt tattttcagg acaagtgtag aaacatgagg 2221 tattggtcta gttaatgttt aggaatgtgc catgcatgaa ctggttcaaa ctatcttaaa 2281 tagtgaagaa aagactgctc tgcgtcattt aatcaaaact ttgagtgcct tggatcaaag 2341 gtactttctg agaaacgaaa ttttacaagc ttttgctgag tactgtcaaa attcagaaaa 2401 gccaggctac ttcttctact cttcttctgt agggaaactc atacactaca cgcatgaaat 2461 gattgtggaa ggggaaagta cctggtttct tctgcgacca aggattggta gccaagaggt 2521 ttggcggctt ggggcaaata tgacaagttt tgagcagatg acgccagagg cattattaga 2581 tgcgcgcgat cgcttagtca accgttttca accccaaatt ctagaaattg atttcagccc 2641 atattaccac ggttacccca gaattagcga cccaagaaac attggtcaag gtctcgggtc 2701 tctcaaccgt cacctatgca ctcaagtgtt gactgatcct gactactggc tagaggtttt 2761 gtttgatgtt tttcatcgac actcgtatga tggtattccg ttgctgatta acgctcgtat 2821 tgactcaggt aaacagctcg ccaaacaagt caagcaagcc ctaaatttcc tcaacgaacg 2881 tccttcttct gaaccttacg aaaaatttcg ctttgacctt caagaactcg gttttgaacc 2941 aggttggggt aacacagcat cgcgagtgcg tgaaacccta gaacttctca accgactgat 3001 tgacactgca gaaccagcca ttctcgaagc cttcgtctcc cgtgttccga cagtttttcg 3061 tgttgtcctc gtttccatac atggctgggt ttcccaagaa aatgttctgg gaagacctga 3121 aacaacaggt caagttatct acgttcttga acaagcgcgc cacttagaaa ataaactgca 3181 agaagaaatc aaacttgcag gattagacct tcttggtatc caaccccaag ttattattct 3241 gactcgcctt atccccaact gcgaaggaac actgtgcaat ctacgcttag aaaaagttga 3301 gggaacagaa aatgcctgga tcttgcgcgt tccttttgct gattctaatc ctaatgtcac 3361 tcaaaactgg atttccaaat atgagatttg gccttatcta gaaagatttg cccttgatgc 3421 agaaacagaa ctgcttgccc aatttcgggg tagtccagat ctgatcattg gtaactacag 3481 cgatggtaac ttagttgctt ttctgttggc gcgccgtctg aaagtgactc actgcaacat 3541 tgctcactct ttggaaaaac ccaaacacct gttcagtaac ttgtactggc atgatttaga 3601 ggagaaatac catttttcag cacagtacac tgctgacatt atcggtatga actcagcaga 3661 cttcatcatt acatcaacct accaagaaat tgtagggaca cccgatacgc tgggccagta 3721 tgagtcttac aaatgtttta cgttgcccca actgtatcat gtggtagatg gtattgactt 3781 gttcagtccc aagttcaaca tgataccacc aggggtcaat gaacatatct tcttccctta 3841 taaccagata caacaccgag atattaccgt cagcaaaaga gttcaagatt tactgtttac 3901 ccgtgaagac tcccaaattc ttggacacct agagaaccaa agtaagcgac ccatttttgc 3961 tgttggtgcg atcactgcta ttgataacct tgcgggtttg gcagaatgct ttggtaaaag 4021 tcaggaatta caacagagtt gcaatttaat tattgtgact gacaagctgc atccccacca 4081 agcaatcaat tcagaagaag cagaggaaat cgaaaaactc cacaatatta tcaacgagta 4141 taatcttcac ggtcacattc gttgggtagg aatacagttc cctcttgctg acctgggaga 4201 agcataccgt attattgcgg attttcaagg aattttcgtc cactttgccc gatttgaagc 4261 ctttggacga accattctcg aagcgatgag ttccggatta ccaacttttg ccactcaatt 4321 tggcggttcg ttagaaatca tcgaagatgg agaagatggt tttctgctaa atccaacaga 4381 cctagaagga acagccaaga cgatattaag ctttattgac cagtgcaatg cttatctaga 4441 acactggtat aagatatcgg agttggtgat tcagcgtgtc cgtaacaaat ataattggca 4501 gttacacacc aagcagttgt tactgctagc taaagtctac agcttttgga actttgtgaa 4561 ccaggaaatt ggcgaagcca aagctcgcta tgtggaaaca ttgttccatc tgctctacaa 4621 acctagggct gaaaaaattt tggaacaaca tatgaaaaga taacagtctt atctcaaaaa 4681 ccccggtttg ccatggatac aacacaaaat tcccacgact ttatttatac agattggaca 4741 cttgtcgaaa cccagcttaa gccaaaccag actcagcaca gagaaactgt tttcactatt 4801 ggcaacggtt atctgggaac gcggggaagt tttgaggaag gttctactgg tgcagtgcca 4861 gcgaccttca ttcatggggt ttatgataat gttcctcttg tatacaccga acttgctaac 4921 tgtcctgact ggttaccatt gattgtcact gtcgatggcg atcgcttccg caccgaacgc 4981 ggcgagatac tgagctatga gcgacagctt gacctccagc gcggtgttct gaggcgtaaa 5041 gtacgttggc gcagtccagg cggaaagaca atagacttct gctttgaacg ctttgcgagt 5101 cgggcagatg agcatgtgtt aggactgcgc tgtcagttga cgccagtaga ttttgacggg 5161 ttgattgaaa ttcaaggtag cattaatggc tatcctgaaa atcaaggttt caaccactgg 5221 gaattgatag accagggcaa aaccaaccga ggagcctggt tgcaactcca gactcggaac 5281 acccgcatca acttgggcct ggctgttgga atgacggtaa caggagctga tgcgtcagta 5341 caagtcagta gtcctccagg ttatccaact ttgagcaccg tattccaagc ttctttagga 5401 cagacggtca ccgtggataa gtttgtgaca gtttttacgt cacgagatgt ggataacccg 5461 gtcaaagaag cgaatgaaaa gcttgctcaa ctcccagact atgaagcgtt ggtagatgcc 5521 catgcacaag catgggctga ggcttgggat aaaagcgaca tcttgattga aggagatacc 5581 aaagctcaac ttgccgttcg gtacaatatc tttcaattgc tgatcagcgc cccagagcat 5641 gatgagaagg tgagtatccc agcgaaaaca ctttcgggtt ttggctatcg cggtcatgtg 5701 ttttgggata cggaaatttt tatcctgccc tttttcattt acactcaacc aaaactcgct 5761 cgtaacttac tcacttaccg ctatctcacc ttaaatggtg ccagacgcaa ggcatctcat 5821 tacgggtata agggagcaat gtatgcttgg gaaagtgcgg atactgggga tgaagtgaca 5881 ccgcgttggt tgcctcctaa cgatttttat ggtgaagaca ttaggatttg gtgtcgcgat 5941 cgcgaaatcc acattagtgc tgatattgct tatgcggttt ggtactattg gaaagcgact 6001 gacgacgacg agtggatgcg ggactgcggt gtggaaatta ttctcgatac cgctgttttc 6061 tgggggagtc gcgttgagta tgacaccaag gacgaacggt atgaaattcg tggggtaatt 6121 ggagcggatg agtaccacga gattgcagac aacaatgcct ttacgaaccg gatggtgcaa 6181 tggcacctag agaaagcgct ctttgtgtat gactggttgc gtcatactta ccccgaccgc 6241 tttagcacac ttgtcaaaaa attgcaactt actcctggac gactttctcg ttggcaagac 6301 attatcaata atatatggat tccctacgat ccatcgacgg gacttgtcga gcagtccgag 6361 ggattcttta aattagaaga tatcgacttg gctgagtacg aaccacgtaa ccgctcaata 6421 caaacgattt tgagcattga ggaaacaaat aagcggcagg tgctcaaaca gccagatgtg 6481 ttgatgcttt tgtacttaat gcgccaatca caggaatttc cctacaccca agaaacgctg 6541 gagaaaaact gggactacta cgcaccccgt acagacatca cttatggttc gtctctcgga 6601 cccgccattc atgccatttt agcctcggat ttgggcaaat caaaagaggc ttatgaacgg 6661 tttatgcaag ccgcattggt ggatattgaa gatgttcgtg gcaatgctca cgaaggaatt 6721 catggtgcca gtgctggcgg tgtttggcaa gctgtgattt tggggtttgg gggaattcaa 6781 cttgcagaac atcaaccaac agcaacgcca caattgccac ctgggtggaa acgtctgaag 6841 ttcaagcttc attggcgtgg cgagtggcac gaaattgatc tacgtcctac agcacaagat 6901 actatgacac ttccagatat ccgaggagtc attttcgatt tggatggtgt tctcacagat 6961 acagcagaat accactactt aggctggcag aagctggcgg atgaagaggg attacccttt 7021 aatcgcctag caaacgaaga tttgcgaggt gtttctcgtc gcgagtcact gctcaagata 7081 gttggtaaca agcagtactc agaagcacaa ctccaggaga tgatggaccg caagaaccgt 7141 tactatgtgg actttatcca gacaatgagg ccgggaaatg tattgccagg gacaattgca 7201 ttgttggatg aattgaagga agctgggatt aagatagccc ttggttctgc tagcaaaaat 7261 gctcaaacgg tgattgagaa attgggtatt gccgatcgca ttgatgtggt tgctgacggt 7321 tacagcgtcc agcaacccaa gccagcacca gacttatttc tctttgctgc ccaacagcta 7381 ggactcaaac ctgagcaatg tgtcgttgtc gaagatgcag ccgcaggtat tgaggctgca 7441 cttgcggctg ggatgttggc tataggactt ggtcctactg aacgagtggg agcagcacac 7501 gttgtgttac ctaatctcgc gggcgtccac tggacggaac taagagagaa attgagcagc 7561 gatcagtaaa acgacgaaca aaagcaaatt atatacaagg cagaaggtag aagggaaaaa 7621 gcacaacttc tacttcctgc cttctgcttt atgggacgac aaaaaagtag acagaatgta 7681 taactttgat gcttaaataa tactgtctac catatgggca aaacttccat aacctatgct 7741 gaagcggacc ttcccatccc cagaagcgct tcgacgcttc ccgttcggtc tagctgatat 7801 tgccctgatt tttggcacat tggtgttact agggctaatc gcacgtgtgg gtgcaggaac 7861 tttagtgagt tttgtaccgc cggatgtggt gccagatgtt agcctcaacc cgcttcatct 7921 gccatactac gctggacgct caactctgcg gatgtttatc gcgctgtttt gctcaacatt 7981 gtttacttta atctatagct atgttgctgc caaaagccgt cgtgcagaac aaatcttaat 8041 cccactcctc gatattttac agtcagtacc ggtactgggc tttttgtcga ttacggtgac 8101 aggctttatt gctctattcc ctgggagttt actgggatta gaagcagcgt caatttttgc 8161 catttttaca agtcaggtct ggaacatgac cttttcgttc taccagtcgc tgagaatggt 8221 accaagcgaa ttagatgagg cagcaaggct ttatcggctt tcggcctggc agcggtttac 8281 aaagctggag gtgccaagcg cgatgattgg gctaatctgg aacgcaatga tgagttttgg 8341 gggcggctgg ttttttgtcg cagccagtga agcgattagt gtactcaacc agaagtatac 8401 gttacccgga cttggttctt acgtagcagc agcagtcacc gctcaggatt tgcctgcttt 8461 aggttgggca ttcctaacga tcgccgtggt tattttactg gtagaccaac tgttctggcg 8521 accgctgatt gcctgggctg ataagttccg cttagaacag agttcggcag cagaggctcc 8581 caactcctgg gtgttcgatt tgctcaaagc cgcgcggctt ccacgtttaa tgagacgggc 8641 gttcactccg gtgggtgaaa ctatcaaccg cctactatca tcactgaccc cacaacgtcc 8701 acgagttgcg atcaaccaga agcagaaagt ggtgagcgat cgcctttaca acttcgcttt 8761 gttactcctg attggcggat tgctggctgc ccttttgcac tttattctca caacagtggg 8821 actgggtgag gtgttcaaaa cttttatgct agggttactg accctgggac gcgtagtggt 8881 gctgctggtg gtagcaacgc tgatttggac accggttggt gtggcgatcg ggttcaatcc 8941 gcgactatca cgcctgttgc agcccgtggt acaattttta gcatccttcc cagcaaattt 9001 tattttcccc ttcgcaactc tcttcttcat tcgtgcccat atcagcatcg attggggaag 9061 tatcttcttg atgtccctgg gtgcccagtg gtacatcctc tttaactcta ttgctggggc 9121 gatgagcatt ccaactgacc tgcgcgagat ggcgagggat cttggtttgc gtggctggcg 9181 gttatggcgc aagttaatca tccctggcat tttcaccgct tgggttacag gtggtattac 9241 tgctagtggt ggggcgtgga acgccagtat tgttgctgaa gttgtcgctt gggggcaaac 9301 gacccttacc gcaactgggt taggagcata catcgccaag gcaactgaag tgggcgactg 9361 gccccgcatt acgttgggga ttggaatgat gagtctgtat gtggttggac tgaaccgcgt 9421 gttctggcga cgactctatc aactcgcgga aacaaaatat catctgtagg aaggggtgca 9481 agcatgacaa ctacccaagc aactcatgaa gtactgattg ccgtcgagca agtccataaa 9541 agttttcctc tgccagaagg taaaggagag tttacggttc tccgcaatgt caacctgaca 9601 gttagtacag gcgaagtcgt cgcattgctg ggacgcagtg gtagtggcaa aagcaccttg 9661 ttgcgaatca tggcaggctt gattccacca agtgaaggac aggtgattag cagtggtaaa 9721 cgtctacaac aagctaacca agatgtggca atggtctttc aaagcttcgc gctgctgcct 9781 tggttgacgg tgcaagaaaa tgtggagttg ggactggaag cgcagggagt taatcgagat 9841 cagcgacgta aacaagcact caaagccatt gacttagtcg gcttggacgg ttttgagagc 9901 gcttatccca aagaattgtc cggcggtatg aggcagcgag tgggctttgc acgggcattt 9961 gttctagaac cgcaagtgct gtttatggat gagccattta gtgcgctgga tgttctaaca 10021 tctgagaact tacggggtga aatcgacgac ttgtggaatg ctggcacctt tccgtccaaa 10081 agtattttaa ttgtcaccca taacattgag gaagctgtgt ttctggcgga tcgagtgatt 10141 atcctgggat caaatccagg gcgcgttcgt ggtgaagtcg ttattgattt gccgcgtccc 10201 catgatcgcg caaacgttcg cttcaaagcg ttggtagact atatctacac ggtgatgacc 10261 aacccagaag ttgaagtgac tggtgaagtc gcagtggcag cccccacaac cgcacaagca 10321 tctaaatcac cttatgctca gtcgttacca catgtacgag tgggtgggat cagtggtttg 10381 ttggaattga ttgtagagaa accagaaggt agggaggata tattcaggct ggcagaacga 10441 attcagttgg aagtggatga tctcctgcca attcttgatg gtgctgtgat gctgggcttt 10501 gcagatgtta tacaaggcga tgttcagctt accgaaattg ggcgcgactt tgccacgaca 10561 acgattttgc gaagtaaaga cttgttcagg cagcaagtcc tgcaacgtgt gccgatgtta 10621 gttagtatac tacagacgct gcgagaaaaa caaaatgggt caatgggagg agatttcttt 10681 ctggatttac tagatgaaca ttttccgcac gcagaagctg agagacaatt tgctacagca 10741 gttgactggg gacgctacac agaactattt gagtacgatg ctagcgaagg acggctttat 10801 ctaccagaac cagtacctgc agagagtaga gaagcatcct gaaaaacagg aaatattttc 10861 ttggcacaga cgtgcagtgg cttgtcattg atattggtac ttagggtgtt gggagatact 10921 cctattatta ggatgggcaa aaattagcca catcatacta aatttctgag caaagagtag 10981 agttcatttt gcacataaag attgactgaa attttgtttt gacttatact aagttttgag 11041 tcgcatgacc aagaattatt ggctgtcaac tcacaatttt ttcacaattt caataggata 11101 aacattaaat ggaggcttct ggtagtagta gccatcaggt aactactcaa gtgactatgg 11161 tttcaaaaga agcgcaaaat gtcaatgctg ccaacctact tcgcttcaat tcgaccagat 11221 ttgactgctg ctactagggc ttgatgatct tgctcaactt gatgggcgta ggtaacagca 11281 aagtctgcta cagcagagtc aaaagcatca cttttgccca ggtagctact gataacaaca 11341 ggatcgccag aacgggcatg ggcgcgggct aaggcagaac cgcaaatttc cgcgtaatcc 11401 tccaaacccc tggcagacat ccccttgagt ttaattgagg ttttcatgtc ttttaattgc 11461 cgaaaataga aatcttgtcc gcgactgtta cttgtccaac ccaagaaaat atcactagct 11521 gcctgcatga ggcgttgacc attgactatg cgctgtccgt tgtgggagta aggacattta 11581 cctgcgtagg gttctagtac tgagggacgg gcttctttaa attgcagcag caaaggatca 11641 ttatcgttac ttagcagcag tgcaactccg cagtgagtac caacactgcc aacgccaacc 11701 accttcatgg ctacatctac tagatggtag cgatctagca aaaactggcg atcgctttgc 11761 agagtgtcac gatattgttc aaataacacc cctacttctt ctaaatattg ctcctgtagt 11821 ggtagatgat acaacagggg tggttgatca ataaaccgcc gttgtccatt cacctcctct 11881 gtcatctgca caaatgtctg ctgtagagtg cgggtaaagg ctttgcttgc catttgctgc 11941 cagtattgac gagtttcctc atcaggtgca tgttccacca gcacattagc atccagtcgc 12001 gcataccaca ccgccaaagt ccccatctgc ccatactccc gaatggacaa gcgataggca 12061 cggactgctg cttcggcagt atcatagcag tgtttgtcag tcagacggat atctttcccc 12121 gcaacaataa tacttgtcac taagcgtttt acatcccatt cccaaggtgc gcttaaggtt 12181 tcgtcaaagt cattcaagtc aaagatgaga tttcgttctg gtgtggcaaa tccaccgaag 12241 ttaagtaagt gacaatctcc acatgcttgt acgtggattc ccgttgtggg agtagttgcc 12301 aagtcagccg ccataataat tgcacttccc cgcagaaatg cgaaggggga ctgcaacatc 12361 cgaccgtaac gcatggggat taactctggt atgcgtccct gatttgatac ttcgagtatt 12421 tcaattggat ctggacgatc tgtaggatgc cattcgcggt gggcacttcg agaaacaact 12481 tgacgtaatg ctttgcctgc ttccagtcgc tgtgctacgg tgagctgatt attgggagta 12541 agttcagctg atttaatagg tggtgtagta gtcatataga ggaaagtatt tttcagtgat 12601 taaagactac aagcactagg agatgaaaga atatttttcc ctctcttctc ctacatcatc 12661 acatcaaaat attaaaaaga tataggaatc cggtttgatt tgatgaacta acaaagtagg 12721 gggagggaac aggaaacagg cttcgccgtg aggcttcgac ggtgagcgct cacgccgaag 12781 tccgaaccgc gtcagcggct cgactgagcg ttcgactgag ctgacgccga agtcttcgcc 12841 gaacgtccga acgggaagaa ggaataaagg tgtacctagc tgaacaaaag tcaaatagga 12901 atcctatact ccaaaatgtt agcaatagtt ctagcaaaac atggagataa ccttttctta 12961 ataaaaaagc gagaaagctt aaccttaatg agcaaatttg tagtcaaaaa atcattgtaa 13021 agtacgtaca tcataaagaa taaaaagaga ttaggtattg actaaattct ttagttcctt 13081 attttctatt aaccataatt ttggaatttt aaatgactta tgacttggtt agattttaca 13141 attcgcttgg cggtagcatt cttactaggc tctgttattg gagtagaaag acagtggcgg 13201 caacgaatgg caggactgcg gactaataca ctagtcgcca ccggtgctgc cttatttgtg 13261 atgctgggag tcatgactcc aggtggaaat ccgacccaag ttgaggcgta tatagtctct 13321 ggcgtcggat ttttgggagg aggcgtaatc tttcggggag gtgccagtgt acagggattg 13381 aatacagcag caacattatg gtgtgtcgca gcagtgggcg ctttggcagg aggtggattt 13441 ttcccacagg cgtttatagg aacagtagca gttttagtgg caaacatctt tctgcgtcct 13501 ctaggttacc gaattaatca gcaacccctc aaaggtacag aaattgaggt atgctatcgc 13561 tgttctatag tttgtcgtag caacgatgag gctcatgtcc gtgccctgct gctacaggca 13621 gtcagtgcca ctggtaaaat gaagttgcgt tctttacata gcgaggatct cgaagaaact 13681 cccgaacgtg tggaagttga agctgactta gtcacgcagg atcgcaacga tcctttttta 13741 gaacaaattg tcagccgctt gagcttggag tctggagtaa gtgcggttag ttggaaaatt 13801 attgaacaag agtacggcta aagataagta agtcggcgta gaaaaaccaa actatgtaaa 13861 gataaataaa tacggagaat acatctacca aaatcactaa aggatacggg ggctcgttta 13921 agggtatttt tgactcgtga acaagacaaa actctgctaa acctaagaac tgtcgatgta 13981 taccaaccgc cttggcggtt aggacgctaa atgtaacaca tgcgagtggg gcatctcgta 14041 aacaattaaa ttcctccagt tttttgcgga ttcggctttt taacctcttt tgcagcattt 14101 ctcaatcaag ttaagatctg gagaataagg cggtaggtac agcagtttac acccctgcag 14161 cttaaatgcg gattagctta gaatatgcgc cgagataaca gcaaaaaggg acatattgta 14221 aataaatatt acagattctg ccttattgat ttactaattt ttccataaat gatataaagt 14281 taacgtgagt tcgacgggtt gaaaaaggtg caaaaaaagg ctgagactgt atttagacga 14341 aaaatactca taatgtctca gcaatggaaa ttattgtatc tcggttggat gtgacacaaa 14401 ttttttgtga tgtagatgat ttgtgccaac aataggagaa caagtaccgc agctaccatc 14461 aatgagcggt aaacgccgca gtacttctag gatgcatcta aaagaaccat cagttgcggt 14521 ttaatactaa agattattca taatttcgct aagtctccga ttaactcagt cctgctgggt 14581 cgttttgcat agcttagtct aaactccagt aatcaatgag cgtggcgaat tgctggcttt 14641 taagctgaca agtgggaatg tggatgaccg cgagccagtt cccgatttga ctaaagactt 14701 gattggtaaa ctgtttggcg ataccctttg cgtcaagccg gaggcttatc gcggatatat 14761 ttcccaaaag ttgtttgagc aattatatga acctggcttg cagttaatta ctcgctccaa 14821 gaaaaacatg aaaaatcggt tggtcaaatt aattgataag attctttttc tcgaaagagt 14881 ttggtgagca aaatttttca ggacaagatt tcaagaggac acgctttagc cttgcaaatg 14941 ctcaaggagc taattttctt ctagtaagtc taaaatataa caaattttta tatgcctcaa 15001 ttgagaatac aagctcactc cccaatattg gtaagtatct cgctgaaagt acccttgagg 15061 gtaatattgt aggagcgtaa tttatgatac tctttcaaaa gaagtgaatt cttatagctt 15121 ttcgcaaaca tcctgactta atgtcaattg gagtttgtca caatatatgg tgctttatcc 15181 agaaactttt gcttgttatt taaaatttca ctacacagaa tctgaattat gtaaacttta 15241 taccatttgt tacctccact tcaactattt attccaaatt gcacttggag tattttgttt 15301 caattgttct atttatccat agttcatacc taacgaaaaa ttatcatggt acaaccaaac 15361 ataaagacta tcaatactcc cttcattggt ataatcggcc catcgcaata cttgggagtg 15421 aagctgacta atttttcatt ttcacaatta ttcttggtta gagaccaaaa caatgccaac 15481 gagaaagaaa gtcccttaaa acaacaaata acgaaagcta gggaaaaaat gtatgaaaat 15541 atagtaaaca atttaaatgg cttaatcagc cttaatgata aagaagaagg agtcaaaact 15601 atctatgttt ttgacaagta tgaagacttt ttcttgaaga cacagtttag acctgaccgt 15661 gaaagaatgg ttatccgaat cacacgaata actccacggg aaacaagagt atccaccatc 15721 tgtcgaatct ataccactgg cactcatata tatattgctt tggattctta tctccttggt 15781 aaaataaata ttttcagcct tgtgattcac acactgctac tgttgatttt tgttcctatg 15841 tttttcacag ggcttttagg atttttaggg gctttagtgc tattattgcc aactctgttt 15901 aatccatcca acataaatgc tttactggcc gttttttcag gtctgctttt accttttatt 15961 ccaggtctgt atctatattt cagttggttt cctgtagtta aagccttgct aaataaagaa 16021 agttttaaag cggctttgaa gcataggttt cacaatagaa gattcaccaa cttatttgat 16081 gaagatgacg gcttaactta cctaaagtct gtaacgcctt ttatcattga tcagataaca 16141 aacgcacttg caagctatgg gataaaagac gaaaccattt ttagtaaact taatgaaata 16201 aaggaagcaa ttttagcaca accaacaata agcctaaaca attcagggat aatgtctaat 16261 gtcctgattg gaaacaacaa ttttcaaagt tcaaaataat aactaagaac taataaagag 16321 aggtaaaaag aagtatgcca aatgatatta ctgttgataa tagtggtgtc attaccaact 16381 ccgtatttgg taataacaac attcaacaaa acataagtca aaatactgat gaaattacta 16441 aactaatttc ttctctgcgt gacatgtctc aaggatttcc tgaagctcaa cgtgaagcaa 16501 ccatagtgca tttagatgat ttacaggaag atataactat acctgagaaa caaaagaaag 16561 agagatttaa aactcgtctt gctgcattgc tggcaattac tggtacgctt ggaggtgttg 16621 tcgcaaactc tgttgacttt ggtaacaaag ttttagactt atctaaaaaa ctgggagtcc 16681 ctatcgagat tgtccagcct caaccgaagc aaatccctta aaatctgggg tattcgtacc 16741 aatggtacga ataccccggt ttgaaggatt tagtcgaaga aacatctaaa tataacaaag 16801 aatagtattt gattcgcgct aagggacttc caaaaaataa attaatcaaa caaactcaag 16861 agacagtctc ctcctcataa taattatcat tatgaggagg ggaggaagtt acctctgctt 16921 tttgttactg cgatccgaga actacgcgat cgcgtccttt tcgtttagct tgaattaggg 16981 ctttctcagt agcatccatc agagcctcag gctcagtgtc agattaatgt aaaatcttga 17041 atcaagcgat accgaaccat ccggaaattc tagggttata ggagtagctg gacagttaaa 17101 tttattaata aattgctgta gtatttcatt caagtcacca tattctcaag tctgacgaaa 17161 gttggattta ttgatcggtt ctttccaaac ttcactcaac aactgccaga aatcagatgc 17221 aactttttta actcgttctt ctccgtaatc ggcttcatcc gctatgttag tataagtttt 17281 ttcttcccag caagagcgca gtactatctc ttcgagtgga gtaagagcgt ttggaagaat 17341 tgcttttagt agcagtacaa tttcatctat actgattagt catttgaccc aacttttggg 17401 taatttatta aataattttc tttacggaag tttaagtgat aagataaccc taatggaata 17461 ctcgtagtaa aaataacaat ctttttttca aaaactgaca aaagaatgtt gacaattatt 17521 tgaatttctg aaaacttcat caaaactaga tagcagcttt catccgaaga gtttttacgt 17581 attataattt acta // LOCUS NODE_1886_length_17517_cov_5.33455517517 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17517) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17517) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17517 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(249..1631) /locus_tag="DP116_16465" CDS complement(249..1631) /locus_tag="DP116_16465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136400.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chromosomal replication initiator protein DnaA" /protein_id="PRJNA477356:DP116_16465" /translation="MEMPIENLWSQVLERLQLELSRPTFETWIKTASAERLENNCLVI FTPNPFARNWLQKYYIKTIANVVQDILGYPVDIYITVTQGDEVSHVNEQEVSWGFRTQ TSPSETLPQNRLKTTELNLKYVFSRFVVGANNRMAHAAALAVAEYPGREFNPLFLCGG VGLGKTHLMQAIGHYRWEISPDSRIFYVSTEQFTNDLIAAIRKDSMQSFREHYRAADV LLVDDIQFIEGKEYTQEEFFHTFNTLHEAGKQVVLASDRPPNQIPGLQQRLCSRFSMG LIADIQPPDLETRMAILQKKAEYENIRLPREVVEYIAFHYTSNIRELEGALIRALAYI SIWGLPMTVENIAPVLEPPIEKVEVTPKAILSVVAEVFDISIEDLKGNSRRREISWAR QIGMYLMRQHTGLSFPRIGEEFGGKDHTTVIYSCEKITQLRQTDQNLVKTLRQLSDRI NMASRPHKSS" gene 2019..3200 /locus_tag="DP116_16470" CDS 2019..3200 /locus_tag="DP116_16470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875842.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_16470" /translation="MNGILVMLLGSGAVMSIGGCSIPGSTLNATKTDAQAQTSVQKTT DNPSPSILPVVSSSNNPNFVVGVVQKVGPAVVRIDAARTVVAQVPEEFDDPVIRRFFS SQPRERIEQGSGSGFIINAGGQILTNSHVVNGAQSVTVKLKDGRSFKGRVMGEDPVTD VAVIKIEANNLPTVSIGNSELLQPGEAVIAIGNPLGLDYTVTSGIISATGRSSSDIGV TDKRVDYIQTDAAINPGNSGGPLLNARGDVIAMNTAIIRGAQGLGFAIPINTAQGIAQ QLIAKGKVDHPYLGIKMATLTPDVKEQVNSTLGINLATDKGILLLDVVPRSPAAAAGL KTGDVIQRISNQPVTKIEEVQKIVEKSQIGSPLELQVLRNGQTTQIAVRPEPLPVRRG S" gene 3477..4010 /gene="def" /locus_tag="DP116_16475" CDS 3477..4010 /gene="def" /locus_tag="DP116_16475" /EC_number="3.5.1.88" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194446.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide deformylase" /protein_id="PRJNA477356:DP116_16475" /translation="MGELLSILQLGDPVLRQKASFVENINDEHIQKLIDHLVATVAKA NGVGIAAPQVAQSCRLFIVASRPNPRYPNAPEMEPTAMINPKIIAHSTEIVKGWEGCL CVPGIRGFVPRYQEIEVEYIDRNGKAQKQKLTDFVARIFQHEYDHLDGVVFLDRLEST LDIITEQEYQKQVINNT" gene 4105..5265 /locus_tag="DP116_16480" CDS 4105..5265 /locus_tag="DP116_16480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316493.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16480" /translation="MTKTLSSRERVSPQEKEPVDLFSSQQSSDESSETTKTSVAQSPM QPPIVELQVVHATTGRVRIRATDGSHNSIFETISQQLRKQDGVREVSVNEQTGSLVIN FDEKKLPLPQMLERLQQFNIHQLQASPEAKSKKDPFAAWKSPDFWKEQGISFIPLFTG LAVTGGLGISGLASIPVYYVTANATRRVIDELQSESKTSALPSSQKAKDNNKSSTKRN TTDHPSLSKSKVEHKSIEAAAQPAKIAYSVVHAIPGRIRLNVPRVARDRAYARRLERL LKTEPQVTNVRVNCDAASVAITYGSAEIPVSHWVGLMQLADETIPQTNLIKIKEQPLT QPVHQQSESTTSTALKEQAVVETDGLWSDFKSPALFTALSFMANFPLDPVPY" gene 5285..6679 /locus_tag="DP116_16485" CDS 5285..6679 /locus_tag="DP116_16485" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16485" /translation="MAGLTTAGLTTAGLTTQLAPHSDKKSQILEEQTSHSIAKPQSAI AQSARPEGAIAYSVVYALPGRVGFCVPQISLDPTYTQRLLTLLACDPRVLSQQVNEIA GSIVIDYKSGIMSDGEMRLYLARLIQSASSEVTTKAPEKAAQLSLQCDEKSQIVEQQT SHPTAKPHSPVVYSIVYATSGSVGFCVPQISLDPAYVQRLLTLLACDPRVISQQVNED AGSIVINYQPGIVPDVQMRLCLANTIEFAGASEEITPVTEKPVSLSSVSQIAACPSVP QEKENDYEPVGAKVLSDPGLVREEKENGSETVKANVLSDPEVVRMETSLDSKLETAKV SPSCELKGADKIPTSSNKCNHTPDKTDHKTKKPAKVAYSIAHAIPGRVRFRIPRIAKD SKYVQRLEALLKADALVTGKRVNSAAASIVITYKSGTIPNSKKRSLSLLEQVISYLSS LIQSAASDAVVSIS" gene 6821..9091 /gene="cadA" /locus_tag="DP116_16490" CDS 6821..9091 /gene="cadA" /locus_tag="DP116_16490" /EC_number="3.6.3.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875838.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cadmium-translocating P-type ATPase" /protein_id="PRJNA477356:DP116_16490" /translation="MAKLSVHLPPSGSHASQVAVAAKNGISRVESNGHSNGHPQVKSC EVPYSIVHTLPERVRLRVPRLLYDADYAQRLQVLLEADALVTSVRIKRAAASLTVTYK SSKVADTKIRSHLGYLIQAASEVVVLKPSKPKAASDANEEQSWPGMQLSALATALAVL GGPLGWSIPPVMVAGTIALATLPVIKRAWEGIRDERKLNIDFLDFMAIAITTVQRQFL TPALMLSLIEIGENIRDRTARSSAQQTLDLLSSLGQFVWVERDGEKVQIPIQDVQRGD TVIVYPGEQVPVDGTILRGKALLDEQKLTGESMPVLKRKGQTVYASTLMREGRIYISA ERVGNDTCAGQSIRLMQEAPVHDTRMENHVLKIAQKAVVPTLLLGGAVFAVTRNPARA ASVLTLDFATGIRVSVPTTVLAALTYAARRGVLIRSGRALEKLAEVDTVVFDKTGTLT KGEVAVVGVASLNEATSITRVLELAAAAEQRLSHPVAEAIVRYAQEQGVEAPSRSKWD YQLGLGVRAEIDGETVYVGSERYLRQEGVEMNLNGYQKQTTSAIYVASNGQLQGIIEY SDIPRPESREVITALLTVEGVEVHMLTGDNKRTASAVASQLGIPPTHTHAEAFPEQKA TVVRELHEQGKTVAFVGDGINDSPALAYADVSVSFANGSDIARETADVVLMQNDLHGL LEAIEIARNARQLIHQNTGLIAVPNIAALVTAVLFGLNPLAATMVNNGSTVVAGVNGL RPIFKSRKQKTLPSAR" gene 9162..9470 /locus_tag="DP116_16495" CDS 9162..9470 /locus_tag="DP116_16495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF5132 domain-containing protein" /protein_id="PRJNA477356:DP116_16495" /translation="MAPKITDFVEDAGAPGIIASIGAVLLAPVVIPVVAGIGKPIAKS LIKGGLVLFEKSKGAVAELGENWEDMVAEARAELAEGRQLPAVDVAGSPVDNTLDNGA " gene 9676..10254 /locus_tag="DP116_16500" CDS 9676..10254 /locus_tag="DP116_16500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308939.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16500" /translation="MSKNGYINLTDMPKVIAEKPIKKAFQPISTRIVSSTPGRLRLRI AQPHRQSGEMQRIANALQANPNINQVRTNIQNGSIIINHDGEHGSLDNVYATLRDLGI IFGDVALGKSDAAAEVSNAVVDLNKRVRQATNNAVDLRFLFPLGLGMFSIRQLVTRGL QLEIIPWYVLAWYAFDSFIKLHAISQVQSSKE" gene complement(10465..13239) /locus_tag="DP116_16505" CDS complement(10465..13239) /locus_tag="DP116_16505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877857.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer family protein" /protein_id="PRJNA477356:DP116_16505" /translation="MKHDWRSGSVSFSASPSIRKAHSLFLFTLPLCIIGSLTSILTSI NTVTAQQVTSDGTVSTTVITPDGKNFNINDGTTRGGNLFHSFKEFSIPTGGSASFNNA ANIQNIISRVTGGSISTIDGLIKANGAANLFLLNPAGIIFGPNARLQIGGSFLGSTAN SFVFDNGFEFSATDPQAPPLLTINVPIGLGFRDNPQNITTKSTPTQYPTLEVPEGKTI TLVGGNLSLDGPDLLAPGGRVELGGLSTPGTVGLNTDGSLNFPVGVQLGDVSLTNGAI VDVSAGGGGSIAVNARNLDISGGSALYAGINQGLGSVGSQVGDITLKASGTTTVANSF VFNYVDSEAVGNSGNLTIETQKLRVSDGARIGTLIFGQGNAGNLTIKADELVEVLGTE KLNQTTTSLQANLESGGIGKGGDLTIDTKKLVVRNAQVGASTFGKGDAGNLTVLATDS VELSGEIPGNEKGFPGGLLAQVDLKGEGRGGNLTIKTGRLSVSDGSKVQVATFGQGDA GNLFIKADDVDVFETPKYNFYSTGIFAGVQIAPQTVDQPKGNAGNLTIETDRLRIRDG GIVTTFTQGEGDAGTLQIRAKESVEVFGTSLNGRLRTSTISAGATSTSTGSGGSLRID TGKLIVRDSGTVTVSSENTKPAGQLEINARSISLDNQGSLTATSTSGNGGNIILGVQD FLLLRRNSNISTSAGLAGAGGDGGNITINSPLIVAFPGENSDITANAFNGSGGKVTIN TQGLFGITPLSRQELEQRLNTTDPAQLDPRNLPTNDITAISQNNPNLSGTVNIITPDV DPTRGLFELTETVIDPAQQIAQNPCIKGFGSSFTIVGRGGIPTDPKKILSSDNVRVDL VKPVASTVSSTSATQKQPSQKPPVKEIIPARGWIYNEKGEVLLVGYDPTKTGPQRSQP APASSCAAVR" gene complement(13530..14348) /locus_tag="DP116_16510" CDS complement(13530..14348) /locus_tag="DP116_16510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015176324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-ketoacyl-ACP reductase" /protein_id="PRJNA477356:DP116_16510" /translation="MNIQGKTALVTGASRGIGRAIAFELARQGIKRLLLVARDSACLA EVASEIKMLGVEAVILPLDLSEVVEVNIAIAQAWRDHGPIDLLVNCAGVAHQAPFLKS RLPNVQVEINVNLIGMYTMTRLVARRMVAQGSGTIVNVSSLMGKVAAPTMATYSATKF AIVGFTQALRGELSKHNIRVVALLPSLTDTDMVRELEWFRWVVPITPQKVAQALITGL QRDSPEILVGWQSHVAVWCNRIAPWLLEKVLLMAAPQERQPRYQRFRDARATSR" gene 14930..16003 /locus_tag="DP116_16515" CDS 14930..16003 /locus_tag="DP116_16515" /EC_number="2.7.1.121" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875835.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydroxyacetone kinase subunit DhaK" /protein_id="PRJNA477356:DP116_16515" /translation="MKKLINKPEDFVRESLQGMAAAHPDLIQVNYEPAFVYRADAPIQ GKVAIISGGGSGHEPMHAGFVGMGMLDAACPGEVFTSPTPDQMLSAAKRVDGGAGILY IVKNYSGDIMNFEMATELARSEGIRVLSILIDDDVAVKDSLYTQGRRGVGTTVLAEKI CGAAAQQGYDLRFISHLCRYVNLNGRSMGMALTSCTVPARMTPTFELGDREIEMGIGI HGEPGRKRMNLKSADEITEMLALSIIEDAPYTRTVREWDEEKDEWVDVELIDPVFQQG DKVLAFVNSMGGTPISELYIIYRKLAEICEKKGLQIVRNLIGPYITSLDMQGCSITLL KLDDELTRLWDAPVKTPSLRWGV" gene 16007..16657 /gene="dhaL" /locus_tag="DP116_16520" CDS 16007..16657 /gene="dhaL" /locus_tag="DP116_16520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316502.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydroxyacetone kinase subunit L" /protein_id="PRJNA477356:DP116_16520" /translation="MVTKEQILRWLQTFATQIEQNKDYLTELDAAIGDADHGINMERG FKKAIAQLPTVADKDIGSILKTVSMTLISSIGGASGPLYGTFFLRASTAVAGKEELTV EDMLGMFKAGLDGVLGRGKAQLGDKTMIDVLSPAVSAFQQAVTEGKGTLEAMQRAVAA AEQGVKDTTPMIAKKGRASYLGERSIGHQDPGATSCYWMLKSLLETLASSNDEAVG" gene complement(16688..16880) /locus_tag="DP116_16525" /pseudo CDS complement(16688..16880) /locus_tag="DP116_16525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873721.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(16924..17262) /locus_tag="DP116_16530" CDS complement(16924..17262) /locus_tag="DP116_16530" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16530" /translation="MSKLVKKGCIVLLAGTALSMIMGVQGAKADHDDKQYNYGTEQCN KIYGLTRELTREEFQACDLAIQKMQRNRKRPLQYEQRKREREDPNGRSVDRLRPQEEI IDTGTEPITP" BASE COUNT 4963 a 3783 c 3913 g 4858 t ORIGIN 1 ccccggctcc cctgcacacc ttacggtaat cttcgggcgg gaaaggagta gtcattggtc 61 attagtcatt ggtcattggt cattagtcat tggtaatagc tcgatttctt cgcgtttttt 121 tacttatttt gtttcttttt taggacttac gcattatctg ccagaaaacc ggttgattcg 181 ttcgtatttg gttgtgaaac agagattttt ggtagaaact gggtttattt gcgtaagtct 241 tgtttttgtc aagatgattt gtggggacga ctagccatgt taatgcgatc gctcaattgg 301 cgcagtgttt tcaccaagtt ttgatctgtt tgtcgcagtt gggtaatctt ttcacagctg 361 tagatcaccg ttgtatggtc ttttccacca aattcttctc caattctggg gaaactgaga 421 cccgtgtgct ggcgcatgag atacattcca atttgacgtg cccagctaat ctctcgtcgc 481 cgcgaattcc ctttaaggtc ttctattgat atatcaaaaa cctctgctac cactgataaa 541 attgcttttg gtgtaacttc tactttttca attggtggtt ctaaaacggg tgcaatattt 601 tctaccgtca tgggtaaacc ccaaatagaa atatacgcca atgcacgaat taatgctcct 661 tccaactctc ggatattaga agtatagtga aaagcaatat actcaacaac ctcccttgga 721 agacgaatat tttcgtactc agcttttttt tgtaaaattg ccattctcgt ttctaaatct 781 ggcggttgga tatcagcaat taaccccata gaaaatcgcg aacacagacg ttgttgcaac 841 ccaggaattt ggttgggagg acggtcagaa gctaagacga cctgtttacc agcttcatgt 901 aaagtattaa aagtatggaa aaattcttct tgagtatatt ctttaccttc aataaactga 961 atatcatcca ctaaaaggac atcagcagca cggtaatgct ctcgaaaact ttgcatactg 1021 tccttacgaa tcgcagcaat gagatcatta gtaaattgct cagtagaaac gtaaaatatt 1081 cttgaatctg gactaatttc ccatcgatag tgaccaatcg cctgcatcag gtgagttttg 1141 cctaaaccca caccaccaca caaaaataaa ggattaaact ctcttccagg atattcagca 1201 actgctagtg cagcagcatg agccatacga ttgttagcac caactacaaa tcgagagaaa 1261 acatacttga ggtttaattc tgtcgttttt agtcggtttt gagggagcgt ttctgaagga 1321 ctggtttgag tccgaaatcc ccaagaaact tcttgttcat tcacatgaga aacttcatca 1381 ccttgagtaa cagtgatata aatatctacg ggataaccaa gaatatcttg tacaacatta 1441 gcgatcgttt ttatgtaata tttctgtaac caattacgag caaacgggtt gggagtaaaa 1501 attactaagc aattattttc caatcgctca gcgctcgcag ttttgatcca agtttcaaag 1561 gtgggacggg atagctctag ttgtaagcgt tccagtacct gactccacag attttctatg 1621 ggcatttcca taatttacca ccaatgctcg gcaatggtca caataataat gaagatatga 1681 gcaagtacca atgactggaa acaacactct agaccgcaaa acttgccttt gagttgtcat 1741 cattcttggt ctccccccgg tagccaagat tttatcatcc atagattcat acggagaaac 1801 aaagacaaat gaaaactaca gactatggct ttgggccgag aaaagaacaa gcaaagcagg 1861 ggagcaaggg agcagaggag caggggagca ggggagtaaa tcaattcaaa atcgcgcagc 1921 gttgcgagcg aagcgaagca atctcaaaat tcaaaattag aagtcactcc ctccccgttt 1981 ccgtgtcccc aaaagggcgt ttgtcccaaa ggtggtcaat aaatggtatt ttggtgatgc 2041 tgcttggtag tggggcagtg atgtctattg ggggttgttc tatccctggt agtacgctta 2101 atgctacaaa aacagatgct caggctcaaa cgtcagtcca aaagacaaca gataacccct 2161 ctccaagtat tttacctgtt gtctcttcgt ccaataatcc taacttcgtt gttggagttg 2221 tacaaaaggt aggacctgca gttgttcgca ttgatgctgc aaggactgtc gttgctcaag 2281 ttcctgagga gtttgatgat ccagttattc ggcggttttt tagctcacag ccaagagaaa 2341 gaatcgaaca gggtagcggc tctggattca ttattaacgc tggcggtcaa atcttgacca 2401 attcccatgt ggttaacggt gctcaaagcg tgacagtaaa actgaaagat ggtcgctctt 2461 ttaagggacg cgtcatgggt gaagacccag taacagatgt tgctgtgatt aaaatagagg 2521 caaataatct gccaactgtt tctataggta actctgagtt attacaacca ggagaagcgg 2581 tgattgcgat cggtaatcct cttggtttag actatacagt gacatctggc attatcagcg 2641 ccacaggtcg ttctagtagt gacattggtg ttactgataa acgtgttgat tacattcaaa 2701 ctgatgctgc tattaaccct ggtaactctg gtggacctct gttaaatgct cgcggggacg 2761 tgattgcaat gaatacagct atcattcgcg gcgctcaagg gttaggattt gccattccta 2821 ttaacacggc gcaggggatt gctcagcaat taatcgcaaa aggtaaggtt gatcatcctt 2881 atttaggtat taaaatggcg actttgactc cagatgttaa agaacaagtg aattccacat 2941 taggtattaa tttggcaaca gataagggaa ttttactact tgatgttgtg ccccgttctc 3001 cagccgctgc tgctggacta aaaacaggag atgtgattca acgcattagt aaccagccag 3061 ttactaaaat agaagaagtg caaaagatag tcgaaaaaag tcaaattgga tctcctttag 3121 aattgcaagt gctacgtaat ggacaaacaa cacaaatagc tgtaagacca gaacctttac 3181 cagtacgacg tggaagctag tagtgcttgt gcggaattcg ctcattcgca atccggagaa 3241 atatgcccta cggggacgca ctttacaaaa aactcatctt ataagctttt cactgaatgt 3301 gtagacgcgg agcggcttcc cgcagggtat ggtatgttta tttatgcgcc cgccgtacta 3361 gagcaaaaat gatgaagtat gaattatgaa ttcaaaaatt tttcggattc atattttaga 3421 tttcataaat tttctcgatt aatttttttg acgaaaagtt attactaaga gttattatgg 3481 gtgaactgct atcaattctt caattaggcg atccggtact gcgtcaaaaa gcttccttcg 3541 ttgaaaatat caacgatgag catattcaaa aactcattga tcatttagtc gcaacagttg 3601 ctaaagctaa cggtgtcggt attgctgcgc ctcaagttgc ccaatcctgt cgcttattta 3661 ttgttgcctc ccgtcctaat cccagatatc ccaacgctcc ggaaatggaa cctactgcta 3721 tgattaatcc caaaatcata gcccattcca cggagatagt caaaggatgg gaaggttgtt 3781 tgtgtgttcc cggaattagg ggattcgttc ccagatatca agaaattgag gtagaataca 3841 tagaccgaaa tggcaaagcg caaaagcaaa aattgaccga ttttgtcgct cgtatctttc 3901 aacacgagta tgaccacctt gatggtgtcg tctttttaga cagactcgaa agtactttgg 3961 atataatcac agaacaggaa tatcagaaac aagtgattaa caatacttaa ctataaataa 4021 ttctaaagaa tgagaattca ccgagaatat taaggtacac ttaacctatt gttaaccatc 4081 tggcttgacg cagttatcgg gtaaatgacc aaaactctca gtagtcgcga gagagtaagt 4141 cctcaggaaa aagagcctgt ggatttattt tcgtctcaac aatcaagcga tgaatcaagt 4201 gaaactacca agacatctgt tgcacagtca ccgatgcagc cacctatagt tgaattgcaa 4261 gttgttcatg caacaacagg acgcgtccga atccgtgcta ctgacggtag tcataactcg 4321 atatttgaaa ccatctctca acagttacga aagcaagacg gggtgaggga agtatctgtt 4381 aatgagcaaa caggcagttt agtcattaac tttgatgaga aaaaactgcc cttgccccaa 4441 atgttggaac gactccagca atttaacata caccagttgc aagcttcgcc tgaggcaaag 4501 agtaaaaaag acccctttgc tgcatggaaa tctcctgatt tttggaaaga gcagggcatt 4561 tcgtttattc ccttatttac agggttagca gtcactggag gactcggaat tagcggttta 4621 gcatcgattc cagtttacta tgtgacggca aatgcgactc gtagggtgat tgacgaactc 4681 caatcagaat caaaaacaag tgcactccct tcttcccaaa aagcaaagga caataacaaa 4741 tcttctacaa aacgcaatac aactgaccac ccttccttat caaaatcaaa agtggagcac 4801 aaatctattg aagcagcagc acagcctgca aaaattgcct acagcgtagt tcatgccatt 4861 ccaggacgta tccggttgaa tgtgcctcgg gttgcacgcg atcgcgccta tgcgcgaaga 4921 ctcgaaaggt tactgaaaac agaaccccaa gtgacaaacg tacgcgtcaa ttgcgatgct 4981 gcatctgttg ctattaccta tggttcagct gagattccag tatctcattg ggttggtttg 5041 atgcaattgg cagacgaaac aatcccgcaa acaaacctca taaagataaa agagcaaccg 5101 ctgacacagc cagttcatca gcaaagcgaa tcaacaacat caacagcgct aaaagagcaa 5161 gccgtagtag aaacagatgg tctttggtct gattttaagt ctccagctct ttttacggct 5221 ctatccttca tggcgaactt tcccttagac ccggttccat actaaggaat actggaggac 5281 aaaaatggct ggtttaacaa cggctggttt aacaacggct ggtttaacaa cacaattggc 5341 accccactca gataaaaaat ctcagatcct agaggaacag actagtcatt caatagcaaa 5401 gccacagtct gcgattgcgc aaagcgcccg ccccgaagga gcgatcgctt acagcgttgt 5461 ctatgcgctt cctggcagag taggcttttg tgttcctcag atatccctag atcccacata 5521 tacacagcgc ttgctgactt tgcttgcttg tgatcctcgg gttctaagtc aacaagtcaa 5581 cgagattgca gggtctattg tcatagacta caaatctggg atcatgtcag atggtgaaat 5641 gcgtctgtat ttagctcgtc tgattcaatc tgctagcagc gaagtgacaa caaaagcacc 5701 tgaaaaggca gcgcaattgt cactccagtg tgatgaaaaa tctcagatcg tagaacaaca 5761 gactagtcat ccaactgcaa agccacactc tccagtcgtt tacagcattg tctatgcaac 5821 ttctggtagc gtagggtttt gtgttcctca gatatcccta gatcccgcat atgtacagcg 5881 cttgctgact ttgcttgctt gtgatcctcg ggttataagt cagcaagtta acgaggacgc 5941 agggtctatt gtcataaatt accaacctgg gatcgttcca gatgtgcaga tgcgtctgtg 6001 tttggctaat accattgagt ttgcaggcgc atcagaagag ataacgccag taactgaaaa 6061 accagtcagt ttatcctccg tttctcaaat agctgcttgt ccaagtgttc cgcaagaaaa 6121 ggagaacgat tatgaacctg tgggggcaaa ggtcttatct gatcccggac ttgtcaggga 6181 agaaaaagag aatggttctg aaactgtaaa ggctaacgtc ttgtctgatc ccgaagttgt 6241 caggatggaa acatctttgg actctaaatt agagacagcg aaagtatccc caagctgcga 6301 attaaaagga gcagacaaga tacctacctc atcaaacaag tgcaatcata ctcctgataa 6361 gactgatcac aaaacaaaga aaccagcaaa agtggcatat agcattgctc atgcgattcc 6421 aggacgagta cgttttcgta tacctcgaat cgctaaggat tcaaaatacg tccaacgctt 6481 ggaagcgttg ctgaaagcag atgctttagt tacaggtaag cgcgttaata gcgccgcagc 6541 ctcaattgtc atcacctaca agtctggaac aatacccaat tccaaaaagc gttctttaag 6601 ccttttggaa caggtcatat catatttatc cagtctaatt caatctgctg ctagcgatgc 6661 tgtagtttcc atcagttaac cagaagacag gtaagtccag ttcaacaagc gccacgcttg 6721 aaggcagttg tctttgagct gccttgtgca tcctgtctca atctcgcgca gcaggctgtt 6781 gaaaatccaa tttgaaccaa ttttgtatga gagagaagaa atggcaaaac tcagtgtgca 6841 tctaccacct tccggaagcc atgcatccca agtagctgtt gcagcaaaaa atgggatttc 6901 tagagttgag agtaatggac actcaaatgg acacccacag gttaagtctt gcgaagttcc 6961 atatagtatt gtgcatacgc tcccggaaag agtgaggttg cgagtacctc gtttgcttta 7021 tgatgcagac tatgcacagc gcctgcaagt gttgttagaa gctgacgccc ttgtgacaag 7081 tgtgcgtata aaacgtgcag cagcgtcact aacagtgact tataaatcca gcaaagttgc 7141 agatactaag atacgttcgc atctgggtta tttaattcaa gcagctagtg aggtcgtcgt 7201 tctcaaaccc tccaagccaa aagctgcatc agatgcgaat gaagaacaat cttggcctgg 7261 aatgcaactt tcagctttag ctacagcttt agctgtgttg ggtggaccgc taggatggtc 7321 tattccgcct gtgatggttg caggaactat agcccttgcc acattacctg ttatcaaacg 7381 agcttgggag gggatcaggg acgagcgaaa actgaatatt gactttctgg attttatggc 7441 aattgctatt actacagtcc agcgtcagtt tctcacgcct gcgctcatgc tgagtttgat 7501 tgaaattggc gaaaatatac gcgatcgcac agctcgttct tctgctcaac aaactctgga 7561 tctattaagt tccctcggac aattcgtctg ggttgaacgc gatggtgaaa aggtgcaaat 7621 tcctattcaa gacgtgcagc gaggcgatac agtcattgtt taccccggcg aacaagtccc 7681 tgttgatggt actatcctac gaggtaaagc tcttctcgat gagcagaaac ttactggtga 7741 gtctatgcca gttttgaaga gaaagggaca aaccgtttat gcctcaactc tcatgcgcga 7801 gggacgaatt tatatttcgg cagaacgcgt aggtaatgat acttgtgccg gacagagcat 7861 tcggttaatg caagaagctc ccgtccatga tacccgcatg gaaaaccatg tcttgaaaat 7921 tgctcaaaaa gcagtcgtgc caacattgct acttggtgga gccgtgtttg ccgtaactcg 7981 taacccagca agagcagcca gcgtcttaac tctagacttt gctactggta ttcgtgtatc 8041 agtgccgaca acggttttgg cagcacttac ttatgcagca cggcgtggtg ttctgattcg 8101 tagcggacga gcactagaaa aactcgcaga agttgacaca gttgtgtttg ataaaacagg 8161 cacactaact aagggtgaag tggcagttgt tggtgtcgca agtctcaatg aagcaacatc 8221 aatcacacga gtgctagaac ttgctgcagc tgctgagcag cgtctgagtc acccagtagc 8281 agaggcgatt gtacgctatg ctcaagaaca aggagtagaa gccccttccc gtagcaaatg 8341 ggactatcaa cttggtttgg gtgttcgcgc agagattgac ggggaaactg tttacgtggg 8401 tagtgagcgc tatctgcgtc aagaaggcgt tgagatgaat ctcaatgggt accaaaagca 8461 gacaacttca gcaatttatg ttgctagcaa tggtcaactt caaggtataa tagagtatag 8521 tgacataccc cgcccagaaa gccgagaagt tatcacagcg ctgttaacag tcgaaggtgt 8581 ggaagtccac atgctgactg gggataataa acgaactgct agcgctgtag cttctcaatt 8641 gggaattcct ccaacgcaca cccacgcaga agcttttcct gagcagaaag caactgttgt 8701 ccgtgaactg cacgaacaag gtaagacagt tgcgtttgtg ggcgatggaa tcaatgattc 8761 gccagcttta gcctatgctg atgtttccgt ttccttcgcc aacggttctg acatcgctcg 8821 cgaaacagca gacgtagtac taatgcagaa tgacttgcat ggtttgttgg aggcgattga 8881 aattgcccgc aatgctaggc aattgattca ccaaaacaca ggtctgattg ctgttcctaa 8941 catagcagca ttagtaacgg cagttttgtt tggtcttaac cccctagcag cgacgatggt 9001 taacaatggc tcaacagtcg ttgcgggagt taacggtttg cgcccaattt tcaaaagccg 9061 caaacaaaaa actctaccat cagcaagatg agatagctgt cctaacgaca gtacactgaa 9121 caactttgta aaatttttca gatcacagga gcaaaataat catggcacct aaaatcactg 9181 attttgttga agacgctggc gcacctggaa ttatagccag tattggagca gttctgctag 9241 cacctgtcgt cattccagtt gtcgcaggta ttggtaaacc cattgccaaa tcactcatca 9301 agggtggact tgttcttttt gaaaaaagca agggagccgt tgcagaactt ggcgaaaatt 9361 gggaggacat ggtagctgaa gcaagagcag aacttgctga aggaagacag ctaccagcag 9421 tagatgttgc tggttctcct gttgacaata cgctcgataa tggtgcatag ttttcggttc 9481 atttcgtgtt ccttggctga gccagggaac acccaaggaa attcgtagtt cttaacagtt 9541 atcagttatc agttatcagt taccagttat caggtatgaa acggactcgt ccacctcttg 9601 tttactgttc cctgttccct gttccctgtt ccctgttcta cgaattacga attcctaaat 9661 taggagggta attacgtgtc aaaaaatggt tatattaatc tcaccgatat gccaaaagtt 9721 attgctgaaa aacctataaa aaaagctttt caaccaatat ctacgcggat tgtcagctct 9781 accccaggaa ggctgcgttt gagaattgca caacctcacc gtcaatctgg agaaatgcaa 9841 cggattgcta atgcgctgca agcaaatcca aatattaatc aggtgcggac taacatccaa 9901 aatggcagta tcatcataaa tcacgatggt gagcatggaa gtcttgacaa tgtttacgcg 9961 acattgcgcg atttaggtat tatttttggt gatgttgcat taggaaaatc tgacgcagca 10021 gcagaagtat caaatgcagt tgttgactta aataagcgag ttagacaagc gacaaataat 10081 gccgttgatt tgcgctttct ttttccttta ggattaggta tgttttctat tcggcagtta 10141 gtgactaggg ggttgcaatt agaaattatt ccttggtatg tgttggcttg gtacgctttt 10201 gatagtttta taaaactgca cgctataagt caagtacagt caagtaagga gtgatatgat 10261 gtccggctaa ttgcttataa atcacggatg accccacccc ggttccctcg ttcctatgct 10321 ctgcatggga atgcctcaaa ggaggctgcg cctcccaatg attattatat tgaggcggag 10381 cctcaagtta tgcattcctt ggctctgcca aggaacgagg aaagtgagta gagtcgtcac 10441 cctacgttgg attcgcgttc attactatct gactgcagca caactactag ctggtgctgg 10501 ctgcgatcgc tgtggacctg ttttggtggg gtcataacct accagcaaca cctcaccttt 10561 ttcgttatat atccatcctc gggctggtat tatctctttg acaggtggct tttgagatgg 10621 ctgcttttgt gttgcacttg ttgaacttac cgtactggca acaggcttaa ctaaatcaac 10681 ccgcacattg tcactactga ggattttctt tggatcagtt ggaattcccc cacgtcctac 10741 gatggtgaaa ctgctaccaa aacctttgat acagggattt tgggcaatct gctgtgcggg 10801 gtcgatgaca gtttctgtca attcaaataa tccacgggtg gggtcaacat ctggagtgat 10861 aatattaaca gtaccactca agtttggatt gttttgagaa atagctgtga tgtcattcgt 10921 aggtaagttt cttgggtcta gttgagcggg atctgtggta ttcaaacgtt gttcaagttc 10981 ttgtcggctt agaggagtta tgccaaaaag accttgagtg ttaattgtca ctttaccacc 11041 gctaccgttg aatgcattgg cagtgatgtc actattttca ccagggaaag cgacgattaa 11101 tggagagtta atcgtgatgt taccgccatc tccgccagca cctgctagac ctgcactggt 11161 ggagatatta ctgttacggc gtaagagtaa gaagtcttgg actcctagaa tgatgtttcc 11221 accgttacct gatgtgctgg tagctgtaag acttccttga ttatctaggc tgatagagcg 11281 tgcgttgatt tctagttgac ctgcgggctt tgtgttttca ctactcacgg tgactgtacc 11341 gctatccctg acaattaact tgccagtatc gattctcaag cttcccccac tacctgtact 11401 tgttgaggtc gccccagcac tgattgtact tgttctcaac ctaccgttta gtgatgttcc 11461 aaaaacttct acagattcct tcgcacgaat ttgcaatgtc cctgcatcgc cttcaccttg 11521 agtaaaagtt gttactatcc ctccatctct aatccgcaag cgatcggttt caatcgttaa 11581 attgcctgcg ttacctttgg gctggtctac agtttgagga gctatttgaa caccagcaaa 11641 tatgcccgta ctgtagaagt tatatttagg tgtctcaaac acgtccacgt catcggcttt 11701 gatgaataaa ttgccagcat cgccttgacc gaaggtagca acttgcactt tgctgccatc 11761 actgacactt aaacgtcctg ttttaatggt caggttaccg ccgcgaccct caccttttaa 11821 gtctacctga gcaagtaagc cgccaggaaa ccccttttcg ttgccaggaa tctctccgct 11881 cagttctacg gaatcggtag cgagaactgt gagatttccc gcatctcctt tgccaaaagt 11941 tgaagctcct acttgggcgt ttctgacaac caactttttc gtatcgatgg ttaaatctcc 12001 gcccttacct atgcctcctg attctaggtt agcctgcaga gaagttgtag tctggttaag 12061 tttttccgtg cctaaaactt ctactaattc atcagcttta atcgtcagat ttcctgcatt 12121 tccttgaccg aaaatcaaag tgccgattcg tgcgccatca ctgactctta atttttgagt 12181 ttcaatggtt aagtttcctg agttacctac tgcctcagag tctacgtaat tgaaaacaaa 12241 gctatttgca actgttgttg tcccgctagc tttcagtgtg atatctccca cctgactacc 12301 aactgacccc aaaccttggt ttattcccgc atacaaggca cttcctcccg aaatatctaa 12361 attccgggcg ttgactgcaa tactaccacc tccaccagca cttacatcaa caatcgcacc 12421 attagtaagc gatacatcac cgagttgtac tcccacagga aaattcaagc tcccatcagt 12481 atttagtccc accgttcccg gtgtagataa tcctcctagc tcaactcgtc cccctggtgc 12541 aagcagatca ggaccatcta agctgaggtt accgcctacc aatgtaatgg tctttccttc 12601 aggtacctca agagtaggat attgagtcgg cgtggatttt gtggtgatat tttgcggatt 12661 atcccgaaat cccaagccaa tcggaacatt tatagttaac aacggtggtg cttgagggtc 12721 agttgcacta aactcaaacc cattgtcaaa cacaaaacta ttcgccgtac tccccaaaaa 12781 tgaaccacca atttgcaaac gcgcatttgg tccaaaaata atcccggctg gattgagtaa 12841 aaataagtta gctgcgccat tggctttaat caagccatca atcgttgaga tagaaccacc 12901 cgtgactcga ctgataatgt tttgtatatt ggctgcattg ttaaagctag ccgaaccacc 12961 tgtgggaata gaaaattctt tgaagctgtg aaaaaggttt cctcctcttg ttgtcccatc 13021 attgatgttg aaatttttgc catcaggagt gataactgtg gtggatacag ttccatcaga 13081 agtcacttgt tgggctgtga ctgtatttat agaagtcaaa atagaagtta aactccctat 13141 gatgcatagg ggtaaggtga atagaaagag cgaatgcgct ttacggatgc taggcgatgc 13201 tgaaaacgag acgctacccg aacgccaatc atgtttcata aatttaaaat ttcaaatggc 13261 tgtttctctt ctcaaaaaag attctaagtg aaaggctgtt tcttaagaaa tcaaagttaa 13321 ttaatttaca taattagcag gcgtgaggat gattgcaggt tgggcatcga atgactcaaa 13381 actccctcga aacctcaccc tgccctatcg ggcatccctc tccttatcaa ggagagggaa 13441 agattttggc gtagctaaaa gcgaggggag gttttggcaa gagccaaagc cggggtgggg 13501 ttccgacgtg aaaagcgcta taaaaaacct caacgtgagg ttgctctagc gtctctgaat 13561 ctttgataac gcggttgtct ctcttgggga gccgccatca gtaaaacttt ttccaacagc 13621 caaggagcaa tgcggttgca ccacacggct acatgacttt gccatcctac taagatttct 13681 ggtgaatctc tttgtagtcc ggttataagt gcctgagcca ctttctgggg agttatgggt 13741 accacccacc gaaaccactc taactctcgc accatatcag tgtcggttag ggatggtaac 13801 aaggctacaa ctcggatgtt gtgtttgctt agctcaccgc gtagggcttg agtgaacccc 13861 acaattgcaa acttagtggc tgaataggtt gccatagttg gtgcagctac tttgcccatc 13921 aaactagaaa cgttaacgat tgttcctgag ccttgtgcta ccatgcgtcg ggcaactaga 13981 cgagtcatgg tatacattcc gattaaattg acgtttatct ctacctgcac atttggcagt 14041 cgagacttta agaatggtgc ttgatgtgca actccagcac agttgacaag tagatcaatt 14101 ggtccatgat ctcgccaagc ttgggcgatc gcaatattca cttccaccac ttcagacaaa 14161 tccaaaggca ggataacagc ttccacacca agcatcttta tctcagaggc tacttcagct 14221 aaacaagcac tgtctcgtgc taccaacaac aagcgtttta ttccttgcct tgctaattca 14281 aaggcgatcg cccgcccaat accacgtgaa gccccagtaa caagagccgt ctttccttga 14341 atattcatca caaaaacctc cgcgcgttaa atcagtcaac agtgaacagt tatcaagatt 14401 tcaaacgaca acaagtactg ataactggta actgataact ggttgaacta aaccgttgca 14461 gtaagaaagc ttgtcttacc gcattctttg agaaaaactt tgggtatgtc tttagtcaga 14521 tttcagtaaa gtttgagttg gactggcaat acgtaaagtc aagcaaatta agcaacctct 14581 gaagtcgtcc ttagtgtagc caaacaagca acaaagatat ttcttagctc ggaaagcggg 14641 aaaattggtg taaagcagtc agagaaatat cttttggaca atactagaat cagtgaatca 14701 gtgaaaagcc aagattgacg tgtagcatgg atggcacctc ttttagatac aagtgtctac 14761 cagaaccaac tttatatgca tacgctagca gggggatcaa ctacctgcaa catttattaa 14821 caagaatgct taagttttac ttaactttac aaagctctca gaaaccacga aagagcaaga 14881 gtgctttatc cttggtttta gtaacttctg aatgaggaac cacaagacta tgaaaaagct 14941 tattaataaa ccggaagact ttgtacgcga aagtttacag ggtatggctg cggctcaccc 15001 ggatttgatt caagtaaatt atgagcctgc ttttgtctat cgagccgatg cacctataca 15061 aggtaaagta gcaattattt ctggtggtgg gagtggtcac gaaccaatgc acgctggatt 15121 tgtggggatg ggaatgttgg atgctgcttg tccgggagag gttttcactt ctcctacccc 15181 agaccaaatg ttgtcagcgg caaagagagt agatggtggt gctggtattc tttatatcgt 15241 taagaactac agcggcgata tcatgaactt tgagatggca acggagttag ctcgtagtga 15301 gggtatccgg gtattaagta ttttaataga tgatgatgtg gcagtgaagg acagcctata 15361 tactcaggga cgccgtggtg tgggcacaac agtgctagca gaaaaaattt gtggtgcagc 15421 ggcacagcag gggtatgatt tgcgtttcat ttcacatttg tgtcgttatg tcaatttaaa 15481 tggtcggagt atggggatgg cgctgacttc ctgcacggta cctgcgcgca tgacgcctac 15541 ttttgaatta ggcgatcgcg aaatcgaaat gggtatcggt atccacggtg aaccaggacg 15601 caaacgcatg aacctgaaat cagccgatga gataactgaa atgctggcgc tatcaattat 15661 cgaggatgca ccttacaccc ggacagtgcg cgagtgggat gaagaaaaag acgagtgggt 15721 ggatgtggaa ttgattgatc cggtttttca gcaaggtgat aaagtattag cttttgttaa 15781 cagtatgggg ggtactccca tttccgaact ttatatcatc taccgcaaac tagctgaaat 15841 ctgcgaaaag aagggactgc aaattgtgcg aaacttgatt ggtccttaca tcacttcttt 15901 ggatatgcaa ggttgttcga ttacactgtt gaagttggat gatgagctta cccggttatg 15961 ggatgcacct gtaaagacac caagtctgcg gtggggagtg tgatgtatgg tgacaaaaga 16021 gcagatttta cgatggttgc aaacatttgc aacgcagata gagcaaaata aagattattt 16081 gacagaatta gatgcggcga taggtgatgc tgatcatggg atcaatatgg agcgtggttt 16141 caaaaaggcg atcgctcagt taccgactgt tgcagataaa gatattggta gtattttgaa 16201 aacggtgagc atgactctca tttcctcaat cggcggtgca agtggtcctc tttatgggac 16261 tttctttcta cgagcaagta cagctgtggc tggaaaggaa gaactcactg ttgaggatat 16321 gcttggaatg ttcaaagcag gattagatgg tgtgcttgga cgtggtaaag cacaacttgg 16381 cgacaaaacg atgatagatg tcctctctcc cgcagtgagt gcttttcaac aagctgtgac 16441 tgaggggaag ggtacattgg aagcaatgca acgcgctgta gcagcagcag aacagggggt 16501 gaaagatacg acaccgatga ttgccaaaaa gggacgggct agttatttgg gagaacgtag 16561 tataggacat caagatccag gggcgacttc ttgttattgg atgctgaaga gtttgttaga 16621 aacacttgca agttctaatg atgaggcggt aggctagttc aacgcttgta atgatgaaat 16681 tcatgttcta aatgctgccg aaggtgacac cctgttgttt aagccacttg cggtaagcta 16741 cagagtagcg tagacgctgc atcggcttgt cgcaagacat cgcttggcaa atggacttct 16801 gcttgcaagt ccaaaggcag gcgttttggt cgctgcgatt ctacaacggc aatgtcttga 16861 ctgataatct cgtcttgcat agctagaaat gcttgatctc aaagagcaag gcattattgt 16921 tggttaagga gttatgggtt ccgtccccgt gtctataatt tcctcttgcg gtcgtaacct 16981 atctacagac cttccgtttg gatcttctct ctctcttttc ctttgctcgt attgaagagg 17041 tctctttcta ttcctttgca tcttttgaat agcaagatca caggcttgaa actcttcacg 17101 agttaactcc cttgtcaatc catatatttt attacactgc tcagttccgt aattatactg 17161 tttatcatca tggtctgctt ttgctccttg cacacccata atcatagata aagctgttcc 17221 ggctaataga acaatgcaac cttttttcac taatttagac atatacaaat cctctgaaaa 17281 tacaccgcaa tatagagttt gaaattttaa ttaaatagcc acactgtata tatttaggtg 17341 caagtcactg taacttatgc aattattcag tccatctaca ataaaacgtt gaaaaatact 17401 tttatatcgc cctaaagacg tttaaatgct taagaaccag tataatctac cacaaaacct 17461 ctaacccgcc ttgtagcgcc ttctaaacat cgctaataga attacactgt gttgcgt // LOCUS NODE_1889_length_17484_cov_4.82489017484 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17484) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17484) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17484 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(437..1654) /locus_tag="DP116_16535" CDS complement(437..1654) /locus_tag="DP116_16535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127807.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16535" /translation="MKPNLPNLPTSTPSLNHYVSTEQLQACNSDALVQLLCNEMQPQV KTTPTSVQAIAKRIAKEVERICNKSSRIQTSGQIKSWLLNLARHRSQKCLRYYQLGSK KGRVELHSQLGAMVYRHIITSSSELGFEARYNLIEDFLQAFYLEAIKAFRRENELPED YTPRTQLELAEYMAFTEQYAKRRINLPGGNQQLVILRAQSFARRLPQETTVDIEQAVE SAKTEEAESYQRHSAVQQVRSQMTAQSQFDPAEDSERDRVISELVKYLEAHGQSDCID YLSLKLQDLSAPEIDQILGLTTRQRDYLQQRFKYHVEKFAKQHQWQLVHQWLGAGLEQ KLGLSSQQWEIFVSQLSEQQQQILQLKTAKHSDQAIARAIKCTPKQLQKRWTQMLELA WAIRNGSTGVQVG" gene complement(2031..2693) /locus_tag="DP116_16540" CDS complement(2031..2693) /locus_tag="DP116_16540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655299.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonylcarbamoyl-AMP synthase" /protein_id="PRJNA477356:DP116_16540" /translation="MAKIFQVHPDNPQVRRIEEIQAELQRGAVMLYPTDTVYAIGCDL NAKSAVERVRQIKQLANDKPLTFLCPSLSNVATYAYVSDTAYRMMKSLIPGPYTFLLP ATKLVPRLVQSPKRKTTGIRVPNHTMCIALLSALANPIISTSAHLPPDDDIDDEYNGK EPQAYLSRIELFDRLDRLVDVIVDTGEEPNYEVSTILDMTGERPMIVRQGLGWEKVAA WV" gene 2789..4117 /locus_tag="DP116_16545" CDS 2789..4117 /locus_tag="DP116_16545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216121.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nickel pincer cofactor biosynthesis protein LarC" /protein_id="PRJNA477356:DP116_16545" /translation="MTKIAYLQCPTGISGDMCLGSVVSLGVPLEYLTEKLNRLGIEHE YQLRAELVHRNTQQATKVHVDLLDQHHHHHHEHNHHHGRHLPEIEQMIQKAGLPSRAE AWSLAVFQQLAVAEGAVHGIAPEKVHFHEVGAVDAIVDIVGTCLGLDWLGIESNHQGL PLLFCSPLPTGGGIVRAAHGQMAVPVPAVLKLWEMRGCPVYSNGIEREMVTPTGAAIA TTLAVDFGSPPPMTIKQIGLGAGSSHLPIPNILRLWLGEATNVTDKLSVGFTDNSSAT KSIASDTSPALETVSVLETQIDDLSPQAIGYVFEALFTAGALDVFTQFVGMKKSRPGI LLSVICHPENLHSCEAVLFRETTTLGIRRSTQQRATLQREIQQVETEYGVVRVKVAWT GQANEKAITNVQPEYEDCAELARKHNIPWREIQRLGLQNWYAQTATAISD" gene complement(4395..4880) /locus_tag="DP116_16550" /pseudo CDS complement(4395..4880) /locus_tag="DP116_16550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875276.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(4988..5530) /locus_tag="DP116_16555" CDS complement(4988..5530) /locus_tag="DP116_16555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875277.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16555" /translation="MNSLTSFIKKLRLRQIVTVFLAGLLLIVSTACGNAANTQGANPD NPAVQAGGANNPYKSGGDKYVNEKMSKSGHDQASSQLNSQLLIASGVNTEGKLYPGAE TPEGRAYKEAELPIKTQKNIGQPEPGGLNQRQSDVGERIQNRLETVGEAIQEASGFLK DKADEASNRPELQRNPAVNK" gene 6112..7032 /locus_tag="DP116_16560" CDS 6112..7032 /locus_tag="DP116_16560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UPF0104 family protein" /protein_id="PRJNA477356:DP116_16560" /translation="MIKKILRWLILGGTLFFLVKAFKDNWQEVASIHIDAAGWAILAI ATGVTLLAHTWAGWVWTWVLRELNQPVHSFEFIQVYLKTNLAKYIPGNIWHYYGRIIA AKNANVSTGAATLSVLLEPLLMAAAALIVVLLSSQFLTEKTTIIILIVQLLGLLGVLC VLHPKFLNRATHLLQRIKVKKSASNTVQADPWSIERYPLRPLLGELGFLGLRSAGFML TLLSLSPLNFSQIPLLLGAFSFAWLLGLVVPGAPGGLGVFEATAIALLQQHFPTAVVL GATGLYRLVSILAETAGAGLSWLDERLFQS" gene 8035..8277 /locus_tag="DP116_16565" CDS 8035..8277 /locus_tag="DP116_16565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865740.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase" /protein_id="PRJNA477356:DP116_16565" /translation="MVFSTENQEIMVVALLYLILAGAYLLVLPAAVLFYLNLRWYVAS SLERAFMYFLVFFFFPGLLLLSPFVNLRPRPRQIEV" gene 8302..8610 /locus_tag="DP116_16570" CDS 8302..8610 /locus_tag="DP116_16570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194987.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3007 domain-containing protein" /protein_id="PRJNA477356:DP116_16570" /translation="MRRIDAIGIGFGIFVAGGLAYVLLKQVGIDSSKAGIWSQVLLFV GLIGWLFTYAFRAVGKKMTYHKQREQYEEAYLQKRLEELTPEELAKIQAEIEQEQSQV " gene 8651..8875 /locus_tag="DP116_16575" CDS 8651..8875 /locus_tag="DP116_16575" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16575" /translation="MAPNHNYLPPLNLLNISTASSAGICVCGDSSGVLVSQRVGKAHR PCSEAKTVGSAVESPPSKQNICLKAAMTTT" gene 9155..9796 /locus_tag="DP116_16580" CDS 9155..9796 /locus_tag="DP116_16580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454698.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heterocyst differentiation related protein" /protein_id="PRJNA477356:DP116_16580" /translation="MSESMAFIGGVAVAGLAALVLLKGTGGNSLPNYTVGSQLPAVVA PSAMMPPTATYPGQPYPNPVPISPNNEEMRVQTERLKLENEGLKNENNGLKTQVQQLQ SQIQQVYNYQVQLNQQNQQQNAAQLQHQSENRWWSSPVVWAVGGMTLTVGGGIVVAGV LALFSPKDRPTRTVQVIHPYNGPTPPLAPVRRAEFLPPRTERRVEAQEYDDMY" gene 10694..11242 /locus_tag="DP116_16585" CDS 10694..11242 /locus_tag="DP116_16585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749860.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16585" /translation="MQTTPLISSVLTGALLLSSLLSFGGKIAQAQRAPRPVSLLTTKC VNSGFGSVNQQDLDVSIGRAVYTSRFYLGPGNRSASITCNIKPEKSSKPGFETLNLGF GMRDNDTKSPSVEVKIYLDGNQAQTRTVSPTQQASLTLDVNNASNVAIEAVCASPNQY CDRVYFFNAALERPIPPPPTKK" gene complement(11396..13405) /locus_tag="DP116_16590" CDS complement(11396..13405) /locus_tag="DP116_16590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_16590" /translation="MIGKLLDHRYRVIRILATGGFGETYIAQDTKRPGNPICVVKHLK PANSEAKMFDTAKRLFQSEAETLEQLGNHDQVPRLLAYFDENQEFYLVQEFIEGHPLG DELVPNHRWSESQVIELLVEVLDILKFVHGQGVIHRDIKPDNIIRRASDKKLVLVDFG AVKQLRGSAGYAGRSPIFTAVHHSATVAIGTPGYMPTEQGQGKPRPNSDMYALGIIAI QALTGVAPVDFQEDPNTGETLWQHLVPVSDALEAVLSKMVRYHFKDRFQSASEALQAL QSLSSSYTPREYTNTTSSHQPIKSSSALSPLSRQKTIAVAPANPVLQLAPATPKSLAR SSSRPDLLQFVIIGILVGGAAAVTPAVVKNVQGFASNFAINDTNSAENCLAVVQENSN IRSEPTSINDDSIIKAVNKDTKFEVTGRRTKRGWVEIKLDPTQTAWANSEIIKNNEQW VSCLRDKGTAVKTVDDSDLIAARPAPKPKTESVVDAATSSSPESEQSETSKSLSASKS TPATLDKGGSKVVEQAKQKYESGDLQGAIALLKTMTANPTAVKQTTEMISQWQQDWSK AEALFKDVDTALGDGQWDKVLAYKDHPEKLPNIQYWRNKVEPLFQQAADHLAKQELPQ LGNSSNQNKAKLEHQNSSEDYGLDTIDDFDNIETSEPQKTPGNHL" gene 13569..14054 /locus_tag="DP116_16595" CDS 13569..14054 /locus_tag="DP116_16595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865416.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16595" /translation="MPPKINNSVAWDQAELLMQPAFIRVIDNIRKLLDVSSWTGTYQD VLIWPTGTTDETKAIVTQFLQDLEAATPEQALEIREKLSRLPIPHPGYHLSLQRQEQT VNIDLWELCYRVCFSNYISGDDTADIDTDLIDENGDVDWQNLDNKAKELVEQVFANLP E" gene 14303..14722 /locus_tag="DP116_16600" CDS 14303..14722 /locus_tag="DP116_16600" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16600" /translation="MEPVTLSAAAIIGFVFTKVSETLIGKATEAVVIPKINELRQKIV SKLEKINEAKVEIEKHDKGSEPNLEVLESFFKVAMLTDKQFKEEVSHLANEINQELEA EGKGSNVMNVYGGKAYQQNHNKGEFYNAETITIHKHP" gene 15205..17151 /locus_tag="DP116_16605" CDS 15205..17151 /locus_tag="DP116_16605" /inference="COORDINATES: protein motif:HMM:PF00931.20,HMM:PF13646.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16605" /translation="MNVSGGKEFVQTGNQGYMYNADNMEVHHHHAIDSVKPEKIDPID WRKVCDAMLEHEQESQRERRKVTEMEYELNVHVPLGLVERKQQSRRGVDENCQLNEVY GVEKEAIAQIYQHDEFLQQVIEQTPTGKNKHVAIIGEPGAGKTTLLGAIASFIKSQTE NFPIFISLASLEKRTLEEYLLNRWLPEAMRLSHPEIVVTSQIEPQIQQQLIKRFQQGG VWLLLDGVDEMGFDSPVRALDTIQKQLTSWLTQARVVLTCRMNVWDASVNNPLSGFDT YRTQDFEPEQIDEFIQNWFADAKNPQRGEQLQAKLKETGRERIRELVKNPLRLALLCQ IFYLDKQAELPKTKAGLYDRFRRYFYEWKHKEVKNPEELHFYLGKLAVAGINSPARFR LRESLAKEMDEKLFKLACDLGWLNLVDRDSQTDEAVYAFFHPTFQEYFAACAIQDWDF FLPREHKNKPVKDKYNPDKYKPYRIFELQWKEVILLWLGREDEKLSQQKEQFIEALIK FNDGCNNFFWYRAYFLAAAGIAEFKDYPQTDAILEEIVKWTISHDFAKKEAIAAIQQT DRTKAIKALVELIQNSQDEETRWGAAYSLGEIGKDNPIAIKALVELIQNSQDEETRRG AAYSLWPCAENMPYPKFYQLWHEG" BASE COUNT 4921 a 3617 c 3754 g 5192 t ORIGIN 1 ccttacggta atcttcgggc gggaaaggag tagaaccgac tagctcatac ccatgcccta 61 acaacgttcc ttctacagtt ctttccaaga cgttgccaga ctgatttgct ctgccacctt 121 gagtcattgc tctccaaatc ttgaaaagga aaagttgact ataaattata gcttttattt 181 taagaataat ttacttttgg ctgcaacatt tgagaattat tgaactctgc aactcttcct 241 ttgtgcccca ttaccctcct ctattcccca atagatggag agaagagaag ggtgacgtgg 301 gttattggga aagagaatag cagatttttg aattagaaga aggaatagat tatctatgtt 361 taacattaaa caaataggat aaactctaga aaattgaata acactacctt cttggtggat 421 acaattcact acaactttag cctacctgta ctcctgtact accgttgcgg atagcccatg 481 caagttctaa catttgagtc cagcgctttt gtagctgttt gggggtacat ttgattgctc 541 tggcgatcgc ttgatcactg tgttttgctg ttttgagctg caaaatttgc tgttgttgtt 601 cagaaagttg gctcacaaat atttcccact gttgggagga taaacccaat ttttgttcta 661 gccctgcgcc taaccattga tgtaccaact gccactggtg ttgcttggca aacttttcta 721 catgatactt aaaacgctgt tgtagataat cgcgctggcg ggtggttaaa ccgagaattt 781 ggtcaatttc tggtgctgag agatcctgta gttttaagga tagataatct atgcagtcag 841 attgaccgtg agcttctaaa tattttacta attctgagat gacgcgatcg cgctcagaat 901 cttcagcagg atcaaactga gattgtgctg tcatctgaga gcgaacttgt tgaacggcgg 961 agtgacgctg ataagattct gcctcttcgg ttttagcaga ctcaacagct tgttcaatat 1021 ctacagtggt ttcttgaggt aaacgacgag caaaactttg agcacgtaat ataaccaact 1081 gttgattacc tccaggcaga ttgatgcgac gtttggcata ttgttctgta aacgccatat 1141 attctgcgag ttctagttga gtacgcgggg tataatcctc aggaagttcg ttttcgcgcc 1201 ggaaagcttt gatagcttcc agataaaatg cttgtaggaa atcttcaatc aagttgtaac 1261 gagcttcaaa ccctaactca gaacttgatg tgataatgtg gcggtaaacc attgccccca 1321 attggctatg taattctacg cgaccttttt ttgaacccaa ttgatagtaa cgcaggcatt 1381 tttgcgaacg atgccttgcc aaatttaaca gccaagactt aatttgtcct gatgtttgga 1441 tgcgggaact tttattgcag atgcgttcca cttcttttgc tatgcgcttt gctatggctt 1501 gcaccgacgt aggtgttgtt ttcacctgag gttgcatttc gttgcacagc agctgcacga 1561 gagcatcact attacaagct tgcaattgtt ccgtcgaaac gtaatggttt aaagaaggtg 1621 tggatgtggg taggttggga agatttggtt tcatgacttt tctagaagtg gtattgcacc 1681 gtacgacaat gctcagactg cactaccaag gcacgcccca gtaggcaaat ctctttcatg 1741 attttcatct atgaagagtc gccgtcagca ttgctcacac aatatgactg taaaacaacc 1801 gaaaaagtta cagtgttagc ggtgtgtcgg ttttttcaca agccactcaa aagcgacaga 1861 tacatggata tccaccatga tccccgagta tcccaaaccc ttactgctcc aagatttttc 1921 acactttcaa tcatacctag ccccaatgtt gtaggtatga caatctaaaa actggagtca 1981 tttatactct gttttctgga attgacgact gggagcgtga agtttaatat ttatacccat 2041 gcagcgactt tttcccagcc taaaccctgc cgcacaatca ttggtctttc tcctgtcata 2101 tccaaaatgg tagaaacttc atagttaggt tcctccccag tgtcaacaat cacatctacc 2161 aatctgtcca aacgatcaaa taattctatt cgggataagt aagcttgtgg ttctttgcca 2221 ttatactcat catctatgtc atcatccggt ggtaaatgtg ctgaagtcga aataattggg 2281 tttgccagcg ctgacaataa agctatacac atagtatgat ttggcactct aattccagtt 2341 gttttccgct tgggactttg taccagtcgt ggcactaact tggttgcagg taacaaaaat 2401 gtatacggtc ctgggattag gcttttcatc atccgataag cagtatcgct tacataagca 2461 taagtcgcta cattagagag cgagggacat aaaaatgtca gtggtttgtc atttgccaac 2521 tgtttgattt gccgtactcg ctccactgct gatttagcat ttaaatcaca accaatagca 2581 taaactgtat ccgtgggata aagcatcact gcgccacgtt gcagttctgc ttgtatttcc 2641 tctatacgcc ggacttgggg attatcagga tgaacttgga aaatttttgc catagaaatg 2701 aaaaagaaag tcagtagttg gtagttagtg gtatttttag cacttaacaa ctcgcaacta 2761 acaaacaatt aataattaat gactcattat gacgaaaatc gcttatcttc aatgtccgac 2821 gggtatttcc ggtgatatgt gtctgggatc tgtggttagt ctgggtgttc ccttagagta 2881 tttaactgaa aaactcaatc ggttagggat tgagcatgag tatcaattga gagcagaact 2941 tgttcaccgt aacactcaac aggctaccaa agttcatgtg gatttactag accaacatca 3001 tcaccaccac catgaacaca atcaccatca cggacgccac ttgccagaaa ttgagcagat 3061 gattcaaaaa gctgggctac catcacgagc agaagcttgg agtttggcag tattccaaca 3121 gctagcagtc gcagaagggg cagtacacgg tattgcgcca gaaaaagttc attttcatga 3181 agtgggtgct gttgatgcca ttgtcgatat tgttggcact tgtttgggtt tagattggtt 3241 gggtatcgag agcaatcatc aaggattacc tttattgttc tgttcaccgc tacctactgg 3301 tgggggaata gtgcgggcgg cgcacggtca gatggctgta ccagtaccag cagttttgaa 3361 gttatgggaa atgcgcgggt gtccagtcta tagtaacggt atcgaacgag aaatggtaac 3421 accaaccgga gctgccattg caacaactct tgccgtagac tttggttctc cacccccaat 3481 gaccatcaaa caaataggat tgggtgctgg ttcaagtcat ctacctattc cgaatattct 3541 acgcctgtgg ctgggtgaag caacgaatgt cacagataaa ttaagtgtgg gtttcacaga 3601 taattcatcc gctacaaaat caattgccag tgatactagt ccagctttgg aaactgtctc 3661 agttttagaa actcaaattg atgacttgag tccacaagca ataggttatg tgtttgaggc 3721 attatttacc gctggtgctt tagatgtctt cacccagttt gtaggtatga aaaaatctcg 3781 tccaggaatt ttgctgagtg tgatttgtca tccagaaaat ctacacagct gtgaagccgt 3841 tttatttcgc gaaaccacca ctttgggaat tcgtcgttca actcaacaac gcgccactct 3901 ccaacgagaa attcaacaag tagaaactga atatggtgtt gtgcgcgtca aagtcgcatg 3961 gacaggacaa gcaaacgaaa aagcgataac taacgtccaa ccagaatacg aagactgcgc 4021 agaacttgcg cgaaaacaca atatcccttg gcgagaaatt cagcggctag ggctacagaa 4081 ttggtacgcg caaacagcga cagctattag cgactaatcc ctttggcgga ctaaataaca 4141 aaaaagcccg cctaggcggg cttagttcgt gaagccccag actcccagtc tgtgggcaca 4201 taacagcacg ctattgctag ttaattaaca gcatctttga tggcatcacc agccttttcc 4261 aacaagttct cagtgttttc agccgcctct tgacctttct cagcaacagt ggtcttagca 4321 tctgttaagt ttgtgctcac tgaattgcca gcatcttcag cagaacgctg aacgttattg 4381 actgcaccct gagcagcatc gctggtgttt gctttgatat tttcaattcc ttgcttggtt 4441 ccttgaacca aatcctcagc agtaccttga gctttttctt taatattttc agcaccttcc 4501 tgaagatttc tgcctacttc tcctctatct tcaaccacgc ggcgaatatt ttctgctggg 4561 ttactcgaac tgttctggat gtttcgttcc gcattttctt tgagagcttc agcctgagct 4621 tttactttag ctttgttggt tctagggtca acatcactaa agttattcat cccaccttca 4681 ttgggagaaa gcacattagt tccttttgga acgtatgttt cagaattagg aggtgcagat 4741 tgctctccta cagtttgacg aggcgttgtt gcagctacag aactacaagc ttgtgtaaaa 4801 aacaggatca ttcctgccgc aacaactgtt agaagtttga ttgggcgaat gtttttcagc 4861 aaagcaataa cttttttcac agtgcgactc cttttgtttg tatgacaaag tcaaacatta 4921 accaagaact aactcactac ttttatatct tggatttact ccaaatttgc aatcccaaac 4981 acggtttcta tttgtttact gctggattgc gttgcaattc aggtctatta ctagcttcat 5041 ctgccttgtc tttcaaaaag cctgaagctt cctgaattgc ttccccaaca gtttcaagtc 5101 tgttttgaat tcgctcgcct acatcagatt gacgctgatt taaaccacct ggttcaggct 5161 gccctatgtt tttctgagtt ttgatgggca attctgcttc cttgtaagct ctcccttctg 5221 gtgtttcagc acctgggtaa agtttccctt ctgtgttgac accagaggca attagtaatt 5281 gtgaattaag ctgtgagcta gcttggtcgt gtccagactt agacatcttc tcgttgacat 5341 atttatcccc accactttta tagggattat ttgcaccacc agcttgcaca gcaggattat 5401 ctggatttgc tccttgagta tttgctgcat taccacaagc cgtgcttact atcaacaaca 5461 gcccagctaa aaagactgtc acaatctgac gcagccgcag tttttttata aaagaagtca 5521 aactgttcac aaattcctcc tgtaaactac aaaaaactac aaacaagaac aatatcttct 5581 aaatatctga ccaacaattt ctaacactac aaaggcaaga acaaaggatg cctctagcaa 5641 aggtcacatt ttgatttata cttatttcct cttactctct aactttcgag agatataagt 5701 tagtataatt agcttttcat gtgcttacgt taattcaggt tagttgaagt cgtgtatgaa 5761 tatttagtta tcaatctatg atcaaaaata tcaactagta aaaaatatta agcctgaaat 5821 tttaagagtt tctagtatat ttcttgaaac taatttgtcg aataaaaaat accggctgaa 5881 tcatacgctc gtgtagtatt tcagatattt acataaattc acacaagaga gagaaactca 5941 tcagtaaata taaatagttt cttgatttca tgatattttt tggcaaaatc gctacagaaa 6001 cttgactaat ctattaacaa tttttgcggg tcatcttggg atatatggct gatctaccat 6061 atgatttgat gtagtcagtc agtaggcaaa tctgtaaatt aaattaccaa aatgatcaag 6121 aaaattttac gctggctaat tttaggtgga acgttatttt ttctggtaaa agcttttaag 6181 gataattggc aagaagtcgc tagtatccat attgatgcag cgggatgggc aattttggcg 6241 atcgccacag gtgtcacatt actcgcacac acttgggcag gttgggtatg gacttgggtt 6301 cttcgagaat taaatcaacc tgtccactct ttcgagttca ttcaagttta cctcaaaaca 6361 aatctcgcta aatatatacc aggtaatatc tggcattact acggaagaat tatagcagca 6421 aaaaatgcca atgtttctac tggtgcagca accctgagtg tattgctaga acctctactg 6481 atggcagcag ctgctttaat tgttgtttta ctcagtagcc aatttctaac tgaaaaaact 6541 accatcatta tactcatcgt acaactacta ggtttattag gagtgctttg tgttttacat 6601 cccaagtttt tgaacagagc gactcaccta ttgcagcgta taaaggtaaa aaagtctgct 6661 tccaacaccg tacaagcaga tccttggagt atagaacgct atcccctacg acctttactc 6721 ggggaattgg gctttttggg actacgtagt gcagggttta tgttaacttt gcttagcttg 6781 agtcctttaa actttagtca aattccttta ttacttggtg cttttagttt cgcgtggtta 6841 ctggggttag tggttccagg agcacctggt gggttgggtg tgtttgaagc tacagcgatc 6901 gcactcttgc agcaacattt tccaactgcg gtagtcctgg gcgcaactgg tctgtatcgt 6961 ttggtaagca ttcttgctga aactgctggt gctggcttat cctggctcga cgaacgcctt 7021 ttccagtcct gagttttcag gagccagcgc cattttcttc ttgaattctg aatttcatac 7081 ctactccctc aggagctagg aactgtgcgt atcatgccct actgacatat aaggcatatt 7141 ctgttcgcgt agcctacccg tagggcatat tttgaattct gaattctgaa ttttgaattt 7201 tgaatttata aatgggggtg gagggacttg aacccacacg accgtttagg gtcaacggat 7261 tttcattctc ccgcagcttt cactactgcc tagtaggtta aataacccaa taaggctttg 7321 agaattggac tctctcttta ccctcgactt tacgttaggg tagctcccgt cgagtctctg 7381 caccttccgc attgatcatt cgtcatttgt cgttagtcat ttgctagtaa caattaacta 7441 aggacgaagg acaaaagaca aacttggctt ggctcaggat tgccatatct atgatcagat 7501 ttaggtttcc ctgaatttga gagcagtcac ttgctggatt tctccatcaa ggctcagtta 7561 tctaagtccg tagcgtctac cattccgcca caccccctga catgtttggg agtcagttgt 7621 ggaggtaaaa aataacctcc tcatctattc cagcattaaa tgcgttattt gtccaatttt 7681 gcacttttac gctggtaaat ttgagattct tgagtcgaaa ttgaggtttt aggttttcct 7741 aacttgcctc ttacttcatt gccgcacctt ttgggtttgc tagtcgctga cgccggagtt 7801 atcctcaaag cagcttgctc acccttacag atgcttgctc aagcatttct cgtgtgtgtt 7861 gtacagcact ggttgtttac tatagcataa tttttagaaa atagatcaaa aaataaaagc 7921 aaaggcagaa ataattttct ctatgtttat tatttaatct tcgttcctct attagggcaa 7981 caatatttcc aacagccaag gcattcagcc tattatggaa aacagatgca ccacatggtc 8041 ttttctactg agaaccaaga aattatggtt gtagcgctac tgtatctgat tttggctgga 8101 gcttatcttt tggtcttacc agccgctgtc ctgttttact taaacttacg ctggtatgtg 8161 gctagctctc tagaacgtgc ctttatgtac tttttggtct tcttcttctt tccgggtttg 8221 ttgctcttgt cgccgtttgt aaacttgcga ccccgaccgc gacaaattga agtttaacga 8281 aattggtagg tcataactct catgcgacgc attgacgcta tcggaattgg ctttggcatt 8341 tttgttgcag gtggcttggc gtatgtctta ctgaaacagg taggcataga tagctcgaaa 8401 gctggtattt ggagccaagt cttattattt gttggtttga ttggctggtt gtttacctat 8461 gctttccgtg cggtgggaaa aaaaatgacc taccacaaac aacgggaaca atatgaagaa 8521 gcttatttgc agaagcgctt ggaagaactc actcccgaag aactcgcaaa aattcaagcc 8581 gagatagaac aagaacaatc ccaagtgtaa atttgttctt tgtcctttat taggagccac 8641 tgcgccggta gtggctccta atcacaacta tttgccacct ttgaaccttt taaacatcag 8701 taccgcaagt tctgcaggaa tttgcgtctg tggagattcc tctggcgtat tggtgagcca 8761 acgcgttggc aaagcacacc gcccttgcag cgaggcgaag acagttggta gtgccgtcga 8821 atctccccca agtaaacaaa atatttgtct caaagcagcg atgacgacaa cttgatatat 8881 gcttaataag ttaaacctga ttaactcaag cgaggtagat taaaaaagtt taacaagcga 8941 gttttgttac tttccttggt agacaatttc atagatatcg tcttgaatat ataaagcaga 9001 tatcgagtgg tgggatggta aaaaacgatt ctactccatg gctgataatt tgggtatgta 9061 tgatgtaacc agtagggcaa aagttttatg atgagtgtga gagagtgtca gatattaatc 9121 attgtagttt cgtcgcagtc taggggaaaa ggcgatgagt gagagcatgg catttatcgg 9181 cggagtcgct gtcgctggac ttgcggctct cgtgttacta aaaggtacag gaggaaactc 9241 tttacctaac tacacagttg gctcacaact accagctgtg gtagcacctt cagcaatgat 9301 gccgccaact gcgacttatc ctgggcagcc atatcctaat ccagtgccaa tcagtccaaa 9361 caacgaagaa atgcgcgtac agacggaacg gctgaagttg gaaaatgaag gactcaagaa 9421 tgaaaacaat ggtctaaaaa cccaagtcca acaactccag tcccaaatcc aacaagttta 9481 taactaccaa gtccaattaa atcagcaaaa ccaacaacaa aatgcagcgc agttacagca 9541 ccagtctgaa aatcgttggt ggtcttctcc ggttgtttgg gcagtaggag gtatgactct 9601 cacagtaggt ggtggtattg tggtcgctgg tgtcttggct ttgttctcac caaaagaccg 9661 tccaactcgt accgtacaag tgattcatcc ttataacggt cccactccac cgcttgctcc 9721 tgtacgtcgc gctgagttcc tccctcctcg taccgaaaga cgagttgaag cacaagaata 9781 cgatgatatg tattaaaagt ttcagtgggt aacagttaac actgagtcat aaaccctata 9841 ctcattaatc ataggctgtt agattagtct accaatttgt agtgctcaga tcgaaaacta 9901 tttttaccgc agcgttcttg ctttggagtt atcaaggtga catccgtcgt gagaatgctc 9961 tggatttgga agacaaataa ggccgcttaa ttttaggaat acgacgtctt tatgtttact 10021 atcgagcata gtcagtcgta ttcatacaaa acaaaaaggt caagactgtt gtataaccca 10081 cgttccgcac cttgaactgt cacacagcat aattatttag aataaatcat tttcagagtc 10141 attaattatg cctttatact gagatgaatg ttttttgaaa tacatttaat cagtacgttg 10201 agtattttag atagcatcct tgcatcttta aaaacagtaa aaacctttaa cagcccgttg 10261 agggtgcgga agatgggata tcgcactact gaagagaaag ccaactcaat ataaaaattt 10321 tcagatgttc tatatataaa aaaattgaaa gttaattgcg gcgttgcata tttgcgggat 10381 aattttattt tgtttgcgcc cttgaaaact tatttcttcg cgagggacag ttcgtaccct 10441 tgggaaaacc gtttgggtat ctcctgcaca agacgctgcg cttacactag tcgcctagta 10501 ggtcgggaaa caaggactgc atagcactga atttactgca cgcgcttacg agagccaggg 10561 cgcactgact ctgtatgggg gcgttgcctt cttttaggga ttcagacttc atcccgtctt 10621 tgtacgccta attagctatt attattttat tttgttttcg tcttttgcct gaaaactaga 10681 ttctcaaatc ccaatgcaaa caactccgct tatcagttcg gtgctcactg gtgccttact 10741 attatcctct ttattatcct ttggaggaaa aatagctcag gcacaacgag cgccgcgtcc 10801 tgtctctctg ctgacgacta aatgtgttaa cagtggattt ggcagtgtta atcagcagga 10861 tcttgatgtt tctataggca gagcagttta tactagcagg ttttatctag gacctggcaa 10921 ccgctctgcc tcaataactt gtaacattaa gccagaaaaa agttctaaac ctgggtttga 10981 aactctcaat ctgggttttg gaatgcgtga caatgataca aaaagtccaa gtgttgaggt 11041 taagatttac ttagatggca accaagcaca aacacgtact gttagtccga cacagcaggc 11101 ttcattgaca cttgatgtta ataatgccag caacgttgcc atagaggctg tttgtgctag 11161 tccaaaccaa tattgtgacc gagtttactt cttcaacgct gctctagagc ggccaattcc 11221 tcccccacca acgaaaaaat agcttggcaa ccggcaaggg atctacagat agattgagta 11281 ttaattgcta atctttagca gttagtagtt agttattagt tttttgctaa caactaacta 11341 ttaactgcta accaaagact atcagtaacg aagtgaagtt tattgatagc tttgattaca 11401 aatgatttcc cggagttttc tgtggttctg aagtttcgat attatcgaaa tcatctatcg 11461 tatcaagtcc ataatcctca ctggaatttt gatgttccag tttggctttg ttttggttgc 11521 tagaattgcc tagttgagga agctcttgtt ttgctaaatg gtcggcagct tgttgaaata 11581 atggctctac tttgtttcgc cagtactgaa tattgggtag tttttctgga tggtctttat 11641 aagctaatac cttatcccat tgtccatcac caagggctgt gtcaacgtct ttgaataaag 11701 cttccgcttt agaccaatct tgctgccatt gggatatcat ttctgttgtc tgtttgacag 11761 cagtaggatt tgcagtcatt gtttttaaca gggcgatcgc accctgtaaa tctcccgatt 11821 cgtacttctg ttttgcttgt tccacaacct ttgaaccacc tttatccaaa gtggcgggcg 11881 ttgatttgct tgctgataaa ctttttgatg tttctgattg ttccgactcg ggtgatgatg 11941 acgttgctgc atcaactaca gattctgttt ttggcttggg tgcagggcga gcagcaataa 12001 ggtcactatc atccactgtt ttgactgcag tccccttgtc tcgcaggcaa gaaacccatt 12061 gttcgttgtt tttgataatt tctgagtttg cccaagctgt ttgtgtaggg tcaagtttaa 12121 tctctaccca accgcgtttt gttcgtctgc ctgttacctc aaatttagta tctttattaa 12181 cagctttgat aatagaatca tcattaatag aagttggctc agaacggata ttagaatttt 12241 cctgaacaac agctaaacag ttttctgctg agttcgtgtc attaatagca aaattagaag 12301 caaaaccttg aacattttta accacagctg gggtcacagc agcagcacca ccaactaaaa 12361 ttcctataat tacaaactgt aataaatctg gtctactaga acttctggcg agagatttgg 12421 gagttgctgg tgctagttgg agaactggat ttgccggcgc aacggcgatg gttttttgac 12481 gagacagtgg agataaggct gatgaggact taattggttg gtggctagaa gtcgtatttg 12541 tatattccct gggcgtgtag cttgaggaaa gtgactgtaa tgcttgtagt gcttctgatg 12601 cgctctggaa acggtctttg aaatgataac gcaccatctt gcttaacacg gcttccaagg 12661 cgtcgctgac aggaactaaa tgctgccaga gggtttctcc agtgttagga tcttcttgga 12721 aatctactgg tgcgactcct gtgagcgctt gaatagcgat gatgcccaga gcatacatat 12781 cactgttggg acggggtttg ccttgacctt gttccgtggg catatagcca ggagtcccaa 12841 tggctacagt tgcagagtga tgaactgcgg taaatattgg cgatcgccca gcatatcccg 12901 cagatcctcg caattgtttg acagccccaa aatctacgag aacgagtttt ttatctgagg 12961 cgcgacggat aatgttatct ggcttgatat cgcgatgaat gacgccttgt ccgtggacaa 13021 atttcaggat gtccagaact tccaccaaca attcaataac ttggctttca ctccaccggt 13081 gattgggaac caattcgtca cccagaggat gcccttctat aaactcttgt actaaataaa 13141 attcttgatt ttcgtcgaaa taagccaaaa gccgaggaac ttggtcatga ttgcccaatt 13201 gttctaaggt ttcagcttca ctctgaaaca gccgtttggc ggtgtcaaac atcttagctt 13261 ccgaattggc aggcttaagg tgcttgacaa cgcaaatcgg gttccctggt cgttttgtat 13321 cttgggcgat gtaagtttca ccaaatcctc ctgtagcaag gattctaata actcggtaac 13381 gatgatctag tagcttgcct atcatattca ctccccagcg ataacaccca atttaagaaa 13441 gggtaataaa tatctccaat aatttctcaa tgaaatccgc tatcttgaga aaagatttaa 13501 atgtaaaaca cagtttttta aaaaatacac attgttaatt tttatttcaa tagcccttgt 13561 caattccaat gccacctaaa attaataact cagtagcgtg ggatcaggcg gaactgctca 13621 tgcaacctgc tttcattcgc gttatcgata atattcgcaa gctgcttgat gtatcttcct 13681 ggacgggaac ttatcaagat gtcctgattt ggcctactgg cactactgat gagacgaaag 13741 caatagtgac ccagttcctg caagacttgg aagctgcaac accggagcaa gctttagaaa 13801 tcagagaaaa actctcccgt ctgcccatac cccatccagg atatcatttg tctctacagc 13861 gtcaagagca aactgtcaat attgatttat gggaattgtg ttatcgcgtg tgttttagta 13921 actacatttc cggggatgat acagctgaca ttgatactga tttaatagat gaaaatggtg 13981 atgtagactg gcagaacttg gataataaag cgaaggaatt agttgaacag gtgtttgcaa 14041 atttaccgga gtaactttgc taaaacagtt ctgaggactt tctttgccat ctcaaagaga 14101 agacggtaaa aaatgtcctc aggtttccaa gatctgaatt tacaaaaaaa ggtgtagggt 14161 gggcattgca atgcccaccc taatgatgcc ccgtgactag caggctaaga gaagaaggga 14221 aagaaaatta agcaaagctt ttgcggaatt acactggtta tcattgaagt cagatgtgtc 14281 taaattattt agaggcatat ttatggaacc tgtcaccttg tcagcggctg caattatcgg 14341 tttcgttttc acaaaagttt cagaaaccct gattgggaaa gcaactgaag cggtagtcat 14401 tcctaaaatt aatgaactac gccagaaaat cgtttctaaa ttagaaaaga ttaacgaggc 14461 gaaagtcgag atagagaaac atgataaagg ttctgaacca aatttggaag tacttgagag 14521 tttttttaag gttgcgatgc taacagataa gcaatttaaa gaagaggttt cacatctcgc 14581 caatgaaatc aatcaagaac ttgaagctga gggtaaaggt tccaatgtta tgaacgttta 14641 tggtggcaaa gcttatcagc aaaatcacaa caaaggagaa ttttataacg ctgaaacaat 14701 aaccattcac aagcatcctt aacagtcagt ttaggtttgt gggaagccaa agaatcagct 14761 tccctatttt gcaaaaacat gtttgtcatt gcgaacgaag cgacagcgta gttaagcaat 14821 cgcaagatgt gagcctctca cgggcgcgat taacactccc gcaaaatatg aaacaagcac 14881 atttttatga tgtttctcaa agccttactg tataacggtt ataaaaggtg caagttaaga 14941 gattgcttcg ctccactacg ttgcgctcgc aatgacaatt catcacctgg atttaatata 15001 acttacatat ttgggatgct ccgttctcgt ccgcctagga cttgaagtcc caggctaata 15061 gaggaagtcc atttttatgg actgaaaaaa ctagaactga tttttgagta acttgaaaca 15121 ccatttcata aaagccaagc gtggtttcct cgtcatcgtg caagctctca gttataaaaa 15181 ctcacttgaa taggaatata aaatatgaat gtttctggtg gtaaagagtt cgttcaaaca 15241 ggtaatcaag gttatatgta taatgccgac aacatggagg tgcatcatca ccacgccatc 15301 gattcggtaa agcctgaaaa aatagatccc attgactggc gcaaagtttg cgatgcgatg 15361 ctggaacatg agcaagaatc ccagagggaa agacggaaag tcactgagat ggaatatgag 15421 ctgaatgttc atgtaccact gggattagtg gaacgcaaac aacagtctcg ccggggtgta 15481 gatgaaaatt gtcagctaaa tgaggtgtac ggggtagaga aagaagcgat cgcccaaatt 15541 tatcaacatg acgagtttct ccaacaggtg attgagcaaa ctcccacagg aaaaaacaag 15601 cacgttgcga ttattggcga accaggagca ggaaaaacga ctttgttggg ggcgatagca 15661 tctttcatca agtcacaaac tgaaaacttc cccatcttta tttctctggc tagtctggaa 15721 aaaaggacgt tagaagaata tctgctcaac agatggctcc cagaagcgat gagattatct 15781 catcctgaaa ttgttgtcac ctcgcaaatt gagccgcaga ttcagcaaca gctgatcaaa 15841 cggttccagc agggtggggt gtggctgctg ttggatggtg tggatgagat gggttttgat 15901 tcacctgttc gggcgttaga tacaattcaa aaacaactca cttcctggct gactcaggcg 15961 cgggtggtgc tcacttgtcg gatgaatgtc tgggatgcta gcgttaacaa tccgctgagt 16021 gggtttgaca cgtatcgcac tcaggatttt gagccagagc aaattgacga atttattcag 16081 aactggtttg ctgatgccaa aaatccacag cgaggcgaac aactacaagc gaaattaaaa 16141 gaaaccggaa gagaacgcat ccgtgagtta gtcaaaaatc cgctgaggtt ggcgctattg 16201 tgtcagattt tttatctcga taagcaagca gaattgccaa aaaccaaagc aggattatac 16261 gacaggttcc gccgctattt ttatgagtgg aaacacaagg aagtcaagaa tccggaggaa 16321 ttgcactttt atctggggaa attagccgta gcaggaatca acagcccagc aagatttcgc 16381 ttgcgggaga gtcttgcaaa agaaatggac gaaaagctgt tcaaattagc ctgtgatttg 16441 ggttggctaa atttggtaga tcgagattcc caaactgacg aagccgttta tgctttcttc 16501 catcccacgt ttcaagaata ttttgcagcg tgtgcaattc aggattggga ctttttctta 16561 ccccgcgaac acaaaaataa acctgtaaaa gataaatata atccagacaa gtacaagcct 16621 taccgcattt ttgaactgca gtggaaagag gtgattttgc tgtggttggg gcgagaggat 16681 gagaagctaa gccagcagaa agagcagttt atcgaagcgt tgataaagtt taacgatggg 16741 tgcaacaatt ttttttggta tcgagcttac tttttagccg cagcggggat tgctgagttt 16801 aaggattatc cccaaacaga tgcaatactg gaagaaattg ttaagtggac gattagtcat 16861 gattttgcta aaaaagaagc aatagctgca atacaacaga cggatcgcac aaaagcaatc 16921 aaggctttag ttgagttaat ccaaaactct caagatgaag aaactcgttg gggagcggca 16981 tatagcttag gggaaattgg caaggataac ccaattgcaa tcaaggcttt agttgagtta 17041 atccaaaact ctcaagatga agaaactcgt aggggagcgg catatagctt atggccctgt 17101 gccgaaaata tgccctaccc taagttttat cagctttggc acgagggata gggtggtgtt 17161 ttcaagcggt tgctcaattt tcctaaacgc ttgctcaggc ggataggcaa tagtatctaa 17221 aataaagtta tatgaacaaa cctgaagaag aactggaact acatttgcga ttccgtgtag 17281 tagcgcttta ggggagattg cttcgtccca cttcgtttca ctcgcaatga caggtattac 17341 ctttgattgc aacttggtat gagcgtgtgt tatagcagtt gtcatttgga tgcagtacgc 17401 tttttaaccc caccccttac ccctcacgcc aggtgctaca acggggggaa cccccgcaac 17461 gcactggctc cccttgctaa gggg // LOCUS NODE_1896_length_17437_cov_4.92837417437 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17437) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17437) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17437 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 138..527 /locus_tag="DP116_16610" CDS 138..527 /locus_tag="DP116_16610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878166.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16610" /translation="MSDPHTPRKNDRALEKALTNKIIDDYFDSSESWLRAILRMCIFS LGHLDGQSVFVVECPNQAVAKRLSRKTHPFRGIVYYLTDNLNAGDRSLFCYRDSSEAT WRCFDTRTNTWRTLSHRQTPTAPTDGL" gene complement(579..833) /locus_tag="DP116_16615" CDS complement(579..833) /locus_tag="DP116_16615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315602.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16615" /translation="MTKNRTLTCTFLALLSGFIGGYIGGQITLSLHSQKCQNQTWILK QTCNFGVTPGAVWQGSTTGLWTGTVLGAFVGGLATRQTRE" gene complement(1022..4075) /locus_tag="DP116_16620" CDS complement(1022..4075) /locus_tag="DP116_16620" /EC_number="4.1.1.31" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870745.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate carboxylase" /protein_id="PRJNA477356:DP116_16620" /translation="MNYLLYSFTQAVNIYPASDLFLRHRLQVVEELWESVLRQECGQK MVDLLRQLRDLCSPEGQAINEQASSVFKLIEQLNINEAIRAARAFALYFQLINIIEQD YEQQQQLTRYEVETESTTQETLPDFICSSNQEEAENPVNSGLGADLLTKSWQANSNSK RTATFASLFPYLFKLNVPPQQIQRLIAQLDVQLVFTAHPTEIVRHTIRDKQRRVVQLL QQLDVMEKRSTSGGPSWEVEEVREQLLEEIRLWWRTDELHQFKPSVLDEVDYALHYFQ EVLFDTIPHLYKRFKHALSSTFPWLEPPNKNFCKFGSWVGADRDGNPSVTPEITWQTA CYQRNIVLEKYIKSVKQLINLLSLSLHWSDILPDLLESLELEQSQLSEVYEQLALRFR QEPYRLKLSYVLKRLENTRDRNLALYKREPLKNENIPIYRSGAEFIAELRLIERNLTE TGLSCRELEHLICQVEIFGFNLTHLDIRQESSRHSDALNEILEYLGVLKVPYDELSEE ERTAWLVEELQTRRPLIPAELPFSEKTNDVIQTLRIVRSLQQEFSNNVCQTYIISMCR QVSDVLEVLLLAKEAGLYDPGTAIGSIQVVPLFETVEDLRRSTSVMRELFALPLYRAL LAGGYQAGETGEVPSSTSQFRSSLIPNLQEVMLGYSDSNKDSGFLSSNWEIHKAQKSL QKIAEEYGLNLRIFHGRGGSVGRGGGPSYEAILAQPGHSINGRIKITEQGEVLASKYS LRDLALYNLETISSAVIQASLLRTGFDDIEPWNEIMEELAARSRVHYRNLIYEQPDFI DFFHQVTPIEEISQLQISSRPARRPSGKKDLSSLRAIPWVFSWTQTRFLLPSWYGVGT ALQEFLNEEPEEHLKLLRYFYIKWPFFKMVISKAEMTLAKVDLQMAQQYVQQLSNPED KSRFEQVFEQIASEYYLTRGLVLQITGHQRLLDGDPVLQRSVQLRNGTIVPLGFIQIS LLKRLREARTNVTSGVIHSRYSKGELLRGALLTINGIAAGMRNTG" gene complement(4185..4412) /locus_tag="DP116_16625" CDS complement(4185..4412) /locus_tag="DP116_16625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132227.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16625" /translation="MPPVTPGSSSDSEQNLAFFAIPQSLLLQVGTASILLLQIGEKAT TETIQAFGEATEELFRGDRLPILNFPDDHES" gene 4738..7755 /locus_tag="DP116_16630" CDS 4738..7755 /locus_tag="DP116_16630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455236.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1574 domain-containing protein" /protein_id="PRJNA477356:DP116_16630" /translation="MKTELPDRQKSLVQWVSQATGISTFGVKVRLQGNELHILCEGGE CPERWRTLSDLLRALQQTDLDVLTSIDQPAIYQVFVYGQKKGENRPRWCHKVYLNQLD RHLEQVDQALLEDEEKSKKPCRALIVSNETLARQGDPEAIARYLSETLSTFDIAVQVE VIKPKPTENNDKNESQLWIFCESSYSPDPSLIAEPVAQKLRHLKLYGYQDAVIASRVK GENRIDWRFLVDLTPSEVMVKEWARWGDVQALSRLLSEALLKSKVAVEPILKESTLHI FCTPVSETLEPAPVPDKILCLQAVKPLLEKIAPQGIIAATVYGQQKITDNEPAWVDWL YLPAKEHPALAISAQELATSGDEPAIVFLLERLLNPDVDTRLKTGGLRVLVQRIGDLL HIMCDAPVCPAREQIASQVTEFVHQLKILGIAGVRVYGRCAGNKEPNWDYSVDYKHRE LLIAEAPPKFAAISAYVPNLLPTSKTDEPVLRPNLSTEEIYIFVTEVTQDWSANVRKL FLGTQLFTDNDKSQEKTTNHDQKQGLRVALVWGALGLLLTLQTDWIFGKILARTTAPT STVTSVSPKSSSTIKTSYTYRADEKQRTAFFTNTSKEKSPKDESSVFNGLKSTQPDLE ASPLKPKARPTAIILAARSYKSGALLKQRPSFKPPQLDQQLTLYKQRLAKIGHPPDVL IIGSSRALRGIDPVAVSKFFATQGSHNIDVFNFGINGATAKVVDFVVRQLLQPSELPK IIIWADGARAFNSGREDITFNTIAASPGYQYVLQKAAEKTTSTTNSTEQETAIEEPEQ GNSTYQAVDNWLSKGFATLSASYQKRDNLKNLLKQPLKYLPDISNTNQAVTQKSHRMN LEEASQQAVDLDGFLALSTRFQPTTYYQKYSQVSGNYDNDYKSFRLQGDQDTALQALL KFTQSQKITVVFVNMPVTAYYLDSVRSKYEQEFQQYMRTIAGKPNFIYQDLSQLWPKA NDYFSDPSHLNRYGAYKISKKLANDPTIPWFSK" gene 7931..9430 /locus_tag="DP116_16635" CDS 7931..9430 /locus_tag="DP116_16635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213184.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBOAT family protein" /protein_id="PRJNA477356:DP116_16635" /translation="MNFISILYGLFLLSVLGIYWFVAQQKLRLWTLLLASLVFYASLQ VHYIPLLVVLTFFNFRIAKEIDEKTIPHPHSSEWQLSEEEWYFAQIDWNHRRLKLLWL GIVSNVFLLLIFKYLLPLLRFFSPNLVILSDDSFKLITPLGISFFTFECIAYLVDVYR GAPATQEFLQFAAYKFFFAKLISGPITRFHSLASEFKNLRLLTPDIVAEGLWLIARGA VKKGILADHLGTFVDLCFANLQRAGSTDLWLATFAYGFQLYLDFSGYVDIARGSALLF GLVLPENFDFPYFSTSIADFWRRWHITLGDWLRNYLYFPLGGSRQGFDRTCFNLIIIM LIAGIWHGAALGFVVWGVFHGLALGVHRFTDAISNRFEDLENFWQQPLGITLAWLLTQ FMVFTSWVWFRLPNLEDSSFVFQHLWGFPGDEQFAQKVYVEALNITQYQLATLLVVLY IAMAGVYIFNRTLKLQFSWPVKLVFVPLCFYSVWLLAPEGSLPYIYFDF" assembly_gap 9511..9520 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 10082..11164 /gene="psbA" /locus_tag="DP116_16640" CDS 10082..11164 /gene="psbA" /locus_tag="DP116_16640" /EC_number="1.10.3.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006631074.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II q(b) protein" /protein_id="PRJNA477356:DP116_16640" /translation="MTTTIQRRESGNVWERFCQWITSTENRLYVGWFGVLMVPTLLSA TICFIIGFIAAPPVDIDGIREPVAGSLIYGNNIISGAVVPSSNAIGLHFYPIWEAASL DEWLYNGGPYQLVIFHFLIGIFCWLGRQWELSYRLGMRPWICVAYSAPVAAATSVFLI YPLGQGSFSDGMPLGIAGTFNFMLVFQAEHNILMHPFHMLGVAGVFGGSLFSAMHGSL VTSSLVRETTETESQNYGYVFGQEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLA AWPVIGIWFTALGISTMAFNLNGFNFNQSLIDSQGRVVSSWADVLNRANLGMEVMHER NAHNFPLDLASTEVAPVALSAPAING" gene 11495..12109 /locus_tag="DP116_16645" CDS 11495..12109 /locus_tag="DP116_16645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872508.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16645" /translation="MKLNLFADILSPKVRKSTALYAVAFAISSTATLTQPSYAQNQKF FCGMSKGVPATFVNTSRGKIPMIRWVDAGFAPPWTPERRCEDISARFQRFYDNGTLNF LRAGKSERQPVLCVAGENGGPCLPEGILLTLKPGKDPEDILQQLLNGRGGANPGIVEL SGNTNRDVVSSEKDAAYLDVQKLLSKMEGRGSTSCPAGQPLWKC" gene 12132..12905 /locus_tag="DP116_16650" CDS 12132..12905 /locus_tag="DP116_16650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865686.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_16650" /translation="MQWRLLTLVACIGSLSISSVASAPNVGVCVEQPLTQRLGNQLQY KAKSITVKVLSKNFLGSGILIHKQDSVYTVLTNAHVLKSGKPPYQIQTPDGYVYLADV PSTKDSPPYFKNNDLAVLQFRSPKVSYAIASIGSASSLNVGNEVFAAGFPFDFDRNQD QGFVFKTGKVSLILKKALEGGYQIGYTNDLQKGMSGGPLLNRFGEVVAINGMHAEPLW GNPYVYQDGTQPEQPLREKMSKSSWGIPIETFRRLVSTP" gene 13104..15209 /locus_tag="DP116_16655" CDS 13104..15209 /locus_tag="DP116_16655" /inference="COORDINATES: protein motif:HMM:PF12895.5,HMM:PF13365.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16655" /translation="MNFFSRRLPAVLMGAAVVMVQPQLAVALSPIQVSDIAKEFTVLI DGDGIGSGIIFERKGDTYLVITNQHVVPKNDGKYEIQTPDGSRYPVSRSQVLPGLDIA ILQFTSNKNYRLAELGNSDQIREGSTIYAAGWADSLPGITNERTYQFTNGFIRSRVKQ ADRGYALVYNNEVIPGMSGGPMLDENGRVVGVNGRAYNTETILAVLRIGIPVNTVLTA RSRPTTASSTVAAAPQERTAEALINLGGVRANRKDYRGAIGDYNQALRINPNNPDAYF RRSFAYFYLGDFPAATVDLNKVLELNPKNAVAYAQRSVLRIQQKDLQGALADGEQAVR LAPNLSLSYLSRGSVRLLSQDYKGAIADIDRFIQQDRKFAHAYAVRGFARAMSKDKQG ANADFDKAIQLDPNFFTTYQFRGVSRQLIWGDKEGASADFQKVAVLCQQELSTPLCQQ VQQEIKQAQDPTLLYKQAIADANLAIQRNPQDANAYLRRAAGYYLQGDNNKALEDLNQ ATRINSKYSQAWVLRGDALFKLGQKEEAISSYERAIQVNSEWGGASPAETWFKRGTIL QKLGRKQEAISSYERAIQANSESDNVYPASAYNNIGLVKYEQGDVEGAIRQFQSAINN DSKKVEPQLALAVALYTKGEREKGLTMAESALRSENRYADVEFLKKNLWGEKLIADTQ KLLQTPKIREITSRPSRSQ" gene 15225..16931 /locus_tag="DP116_16660" CDS 15225..16931 /locus_tag="DP116_16660" /inference="COORDINATES: protein motif:HMM:PF13365.4,HMM:PF13432.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16660" /translation="MALENLNMNFFSRHLPPVLMGAAVVMVQPQLAVALSPTQISDIA KEFTVLITGEGIGSGVIFERKGDTYSVITNQHVVANDGRYEIQTPDGSRYPVYRSQEL PGLDVAILQFTSKKNYRLASLGNSDQIRQGMTVYVVGWGLVNDLSDKTNKSSYLTFAG IIESLSKNPQQGYGLAYNNQAISGMSGSPVLDENGRVVGINAARLDQNLTVQGTRLLA GWRLGIPINMVLTTRNRPVSSPVTAQQPGAIAFTTLRRTEALISSGGAKKNRKDYQGA IADYNQALRINPNNPDAYLQRGSAYYYLKKYQAAREDFNKVLQLSPKNANAYNNRGVL RYQSGDKQAALADFNSAIQLDPKLAGTYYNRGAIRDQSGDKQAALADYNSAIQLDPKN APAYIERAVLRYESGDKQAALADYNQAIQLDPKNAKVYTNRGFLRKESGDKQAALADF NQAIQLDPKDAFAYYNIGLVKYEQGDIEEAMRQFQTAINNDSKKVQPQLALAVALYSK GEQEWGFSTVEVVLRSDKRFADLKFLKKELLWGDKLIADTKKLLENPKIREITSRPSR SQ" gene complement(17156..>17437) /locus_tag="DP116_16665" CDS complement(17156..>17437) /locus_tag="DP116_16665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742887.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_16665" /translation="TDEPVRVSTNWRDDYLKKYADYEEMGIREYWIVDYAGLGGREFI GDPKQPTILVCSLDEGEYRVTKFRGDEHIVSPTFPELTLTAQQIFNAET" BASE COUNT 4929 a 3735 c 3794 g 4969 t 10 others ORIGIN 1 cccttacacc cctaccccct tacaccccta gttcttgact caccttgaac ttaatgagtg 61 tatgtttcga catacgcttg tcacttttag agcttttggt ttaatctcaa accaacggta 121 aagcattagg ttgcgttatg tcagaccctc acactcctcg taaaaacgac cgtgccctag 181 aaaaagctct cactaacaaa attattgatg attattttga tagctcagaa agctggctgc 241 gagccatttt gcgcatgtgc atcttctctt tgggtcattt agatggacaa tccgtgtttg 301 tcgtagaatg tcccaatcaa gcagtagcta agcggttgag tcgaaaaact catcctttta 361 gaggaattgt ttactattta actgataatc ttaacgctgg cgatcgctct ctgttttgct 421 accgagactc ttctgaagct acttggcgtt gttttgacac cagaaccaac acctggagaa 481 ctttgagtca tcgacaaaca ccaacagccc ctactgacgg gttataaacc atcttgtgca 541 ggggaactgg gggagaaata actctctccc tcactccctc actcccgagt ttggcgtgtc 601 gctaaacccc caacaaatgc acccaagact gttcctgtcc acagtcctgt tgtgctacct 661 tgccaaactg cccctggtgt taccccaaag ttacacgtct gttttaaaat ccaagtttgg 721 ttttgacact tctgactgtg gagacttaag gtgatttgcc ctccgatgta acctccaata 781 aatcctgaca ggagggctaa aaaggtgcaa gtgagagttc gatttttagt cattagacgt 841 tgcagaagta ggaacaggga acagggaaca gggaacaggg aacagggaat agggaacagg 901 gaactcttaa cagaacctcg taaaatctca cttttgcaag aggtctatta gtcattagtc 961 attagtcatt agtcattagt caatgtttca ggacttttga ctaatgactt tgtataacaa 1021 atcaaccagt atttctcatc cctgcagcaa taccgttgat ggttaataac gctcctcgca 1081 acaattcgcc tttactgtaa cgagagtgaa tcactccgga tgtcacattt gttcgagctt 1141 cgcgcaggcg cttgagcaag gaaatttgta tgaagcctag gggcacaatt gtaccattac 1201 gtaactgcac cgagcgctgc aatacggggt ctccatccaa aagccgttga tgaccggtga 1261 tttgtaaaac caaacccctt gtcaaatagt attcactagc gatttgttca aaaacttgct 1321 caaaacggga tttgtcttca gggttagaca gttgttgaac atattgctgt gccatttgca 1381 agtctacttt agccaaggtc atttctgcct tggaaatcac cattttgaag aagggccatt 1441 tgatataaaa ataacgcagc aacttcaaat gttcttctgg ttcttcgttt aagaattctt 1501 gcaacgctgt accaacaccg taccaagaag gtagtaagaa tcgggtttgc gtccagctga 1561 aaacccaagg aattgcccgt aagctgctca aatctttttt accagatgga cgacgggctg 1621 gacgagagct aatttgcagc tggctaattt cctcaatggg ggtgacttga tggaaaaagt 1681 caataaaatc aggttgttcg tagatgaggt tgcggtagtg aacacgcgat cgcgccgcta 1741 attcttccat aatttcattc caaggttcta tatcatcgaa ccccgtccgc agcaagctcg 1801 cttgaatgac agcagaactg atcgtttcca gattatacag cgctaagtcc cgcaaggagt 1861 atttagaagc taaaacctct ccttgttcgg taattttgat tcgtccgttg atactgtgac 1921 caggctgagc caaaatagct tcataagatg gaccaccacc gcgtccaaca gaaccacctc 1981 gtccatgaaa aatccgcaaa tttagaccat attcttctgc gattttctgt agtgattttt 2041 gagctttgtg gatttcccag ttactgctta agaaacctga atctttgttg ctgtcggaat 2101 atcccagcat cacttcttgc aagtttggga tcagggatga tctgaattgg gaggtagagg 2161 aaggaacttc ccccgtttcc cctgcttgat accctcctgc tagcaaagcg cggtacaagg 2221 gtaatgcaaa cagttcccgc atcacgcttg tggaacgtcg caagtcttct actgtttcaa 2281 acaggggaac aacttgaatg cttccaatag cagtacccgg atcataaagt ccagcttctt 2341 tagcgagtag caacacttcc aacacgtcgc tgacttggcg acacatgctg ataatatagg 2401 tttggcaaac attattacta aactcttgtt ggagcgatcg cacgatccgt aaagtttgaa 2461 tcacatcgtt cgttttttcc gaaaatggca gttctgctgg aattaacggg cgacgtgttt 2521 gcagttcttc aaccagccaa gcagttctct cctcttctga cagttcgtcg taggggactt 2581 ttaaaactcc caggtattcc aagatttcat ttaaagcatc agagtgacgg gatgattctt 2641 ggcggatgtc gagatgcgtt aagttaaacc caaaaatttc tacctgacag ataagatgtt 2701 ccaattctcg acaactcaaa cctgtttctg tcaagttgcg ttcaatcaag cgtagttctg 2761 ctataaattc tgctcctgag cggtaaattg ggatattctc attctttagg ggttctcgtt 2821 tgtacagagc gaggttgcga tcgcgagtat tttccagccg cttcaggaca taagatagtt 2881 tcagccgata tggttcttgt cgaaagcgca aagccagttg ctcgtatact tcactcagtt 2941 gagactgttc caactccaaa gattccagca agtctggtaa gatatcactc cagtgcagcg 3001 atagactcaa caagtttatc aactgcttca ccgacttgat gtatttctct agcacaatgt 3061 tgcgctgata gcaagctgtt tgccaggtaa tttctggtgt caccgatgga tttccatccc 3121 tgtctgctcc tacccaagag ccaaacttac aaaagttttt attgggaggt tccaaccagg 3181 gaaaagtgct agacaatgca tgtttgaagc gtttgtataa gtggggtata gtatcaaata 3241 acacttcttg gaagtaatgc aaggcatagt ctacttcatc gagtaccgaa ggtttaaact 3301 ggtgtagttc gtcggtacgc caccacaggc gaatttcttc gagcaattgt tctcgcactt 3361 cctcaacttc ccaggaagga ccaccgctag tagagcgctt ttccatcaca tctagctgtt 3421 gcaacagctg aaccactcgc cgttgcttat cacggattgt atgccgaaca atttctgttg 3481 ggtgagctgt aaagacaagt tgcacgtcca actgtgcaat cagacgttga atttgttgtg 3541 gtgggacgtt taatttgaat aaataaggga acaaactggc gaaagttgct gttcgtttac 3601 tattggagtt tgcttgccaa ctttttgtca acaagtctgc cccaagtcca ctgttaacag 3661 gattttctgc ttcttcttgg tttgatgaac agatgaaatc tggtagagtt tcttgagttg 3721 ttgactctgt ttctacctca tagcgagtta attgctgttg ttgttcgtag tcctgttcta 3781 tgatgttaat caactggaaa tacagagcaa aagcacgagc cgctcgaatt gcttcgttga 3841 tgttgagctg ttctatcaac ttgaagacag aggaagcttg ctcgttgatt gcttgtcctt 3901 ctggggaaca caaatcccgt agttgccgca gcaagtctac catcttttga ccacactctt 3961 gacggagaac cgactcccac aattcctcta cgacttggag gcgatggcgc aaaaataagt 4021 cggaggcggg gtagatgttc actgcctgag taaaagagta taaaaggtaa ttcatatcct 4081 ttatgctggt tcagcaggtt aagattttta gattctatga actgagtcag ttatgatagt 4141 tggcatagaa tcattgacga ggcaaagtag attttttagc tttgttaaga ttcgtgatcg 4201 tcagggaagt taagaatggg caggcgatcg cctcgaaata attcttctgt tgcctcccca 4261 aacgcttgta tagtttctgt ggtggctttt tcccctattt gcaatagtag tatagacgca 4321 gtaccaactt gcaaaagcaa agactgggga atggcaaaga aagccagatt ttgttcagaa 4381 tcagatgaag aaccgggggt cacaggtggc atttgtattt cctgggggtt caataaggga 4441 gagggataaa atgtaaaacc aaacaaaagt cagtcttgtg actgtataca actatcttgg 4501 ccgattttac aaagttataa ggtaacaaaa gtgtaaaaaa aaacgacata gtgtgataat 4561 tctatggttc tagtttacaa ggaattggga ttggcgattc gggaataacc attgagaatc 4621 atctcacgaa gttcctagtc ggaacaggtt gacaaaatcc ccctgagcta gtaagcaatt 4681 tcttgctaaa cgaaagcact gttccctatc ataagcttcc tgtcaagtca agtttccatg 4741 aaaacagaat taccagatcg ccaaaaatct ctagtgcagt gggtcagcca agcaacaggg 4801 attagcactt tcggtgtgaa agtccggttg cagggaaacg aactacacat tttatgtgaa 4861 ggtggagaat gtccagagcg ttggcgaact ctgtctgact tgctgcgagc attacaacaa 4921 acagatttag atgtcctgac tagcattgac caacctgcaa tttatcaagt ttttgtctac 4981 ggacagaaaa aaggggaaaa tcgaccgaga tggtgtcata aagtttatct caatcaactt 5041 gatcggcatt tagagcaggt agatcaagcg ctgctagaag acgaggaaaa atccaaaaaa 5101 ccttgtcggg cgctgattgt ttctaacgaa actttggcac gccaaggaga tccagaggcg 5161 atcgctcgct atctgagtga aacactcagt acttttgata ttgctgtaca agttgaagtc 5221 atcaagccaa aacccacaga aaacaatgac aaaaatgaaa gtcagctgtg gatattttgt 5281 gaatcgagtt atagcccaga tccatcttta atagccgaac cagtcgccca gaaattacgc 5341 catcttaaac tttatggtta ccaagatgct gtgattgctt cgcgtgtcaa gggtgaaaac 5401 agaatagatt ggcggttttt ggtggatttg acgccatctg aggtcatggt taaggaatgg 5461 gcgcgttggg gagatgtgca agccctgtcg cgtttgttaa gtgaggcgtt gttaaagtca 5521 aaagtagcag tagaaccaat actcaaagaa tcaacactac atatcttttg taccccagtt 5581 tctgaaacat tggaacctgc tccagtacca gataagatac tgtgtttaca agctgtaaaa 5641 cccctgttag aaaaaatagc cccccaaggt attattgcag ccacagtgta tggacaacaa 5701 aaaataacag acaacgaacc agcatgggtt gattggttat atttacctgc gaaggaacat 5761 ccagctcttg caatatcagc ccaagagttg gcgacttctg gggatgaacc tgctattgtt 5821 ttcttacttg aacgtttgct caaccctgat gtagatacgc gtctcaaaac gggaggtctt 5881 cgcgtccttg tgcaacgcat tggggattta ttgcacatca tgtgtgatgc acccgtttgt 5941 ccagcacgcg aacaaattgc ctcacaagta actgagtttg tgcatcagct caaaatcctt 6001 ggaattgcgg gtgtacgtgt ctatggtcgt tgtgctggta acaaggaacc aaattgggac 6061 tacagtgtcg attataaaca ccgcgaactc ttgatagcag aagcacctcc aaaatttgct 6121 gcaatttctg cttacgttcc caatcttctt ccaacttcta agactgacga acctgtgctg 6181 cgccctaacc tatctactga agaaatttac atctttgtca cagaagtgac tcaagattgg 6241 agcgcaaatg tcagaaagct ttttttagga acgcagctct ttacagataa cgacaaatca 6301 caagaaaaaa ctacaaatca tgatcaaaaa caaggactga gagttgcctt agtttggggt 6361 gcgttgggat tactgctgac tttacaaaca gattggattt ttgggaaaat tttagcccgc 6421 accacagccc caacatctac ggtgaccagc gtttcgccga aatcatcttc tactattaaa 6481 acatcgtata cttaccgggc tgacgagaaa cagagaactg cattttttac gaatacttct 6541 aaagaaaaat ctcctaaaga tgagagcagt gtcttcaatg gtttgaaatc tacgcaacca 6601 gatttggaag cgtcaccgtt gaagccaaag gcaagaccaa ctgctattat tcttgccgca 6661 cgttcttaca aatcaggagc attactcaag caaagaccaa gtttcaaacc tccgcaatta 6721 gatcagcaac ttacactata caaacagcgt ttagcaaaaa tagggcatcc accagatgta 6781 ttgattattg ggtcttcccg cgctctcaga ggaatagatc ctgtcgctgt ttctaaattt 6841 tttgcaactc aaggttctca taatattgat gtttttaact ttggcattaa tggtgccaca 6901 gcaaaagtcg tcgattttgt tgtgcgtcaa cttttgcagc catcggaact accaaaaatt 6961 attatttggg cagatggtgc tcgtgctttc aacagcggtc gagaggacat aacctttaac 7021 acaattgctg catcaccagg atatcaatat gtgttgcaaa aagcagcaga aaaaacaacg 7081 agcacaacga atagcacaga acaggaaacc gcaatagaag aacccgagca aggtaacagc 7141 acttatcaag cggtagataa ttggttaagt aaaggttttg ctactttatc tgctagctat 7201 caaaagcgtg acaatctcaa aaatcttttg aaacaaccgc tgaagtattt gcctgatatt 7261 agcaacacga accaagcagt tactcaaaaa tcgcacagga tgaatctaga agaggcttca 7321 cagcaagcag ttgacttgga tggatttctt gctctttcga ctcgcttcca accgactaca 7381 tactatcaaa aatattctca agtttctggg aactacgaca acgactacaa atcttttcga 7441 ctccaaggtg accaagatac tgctctacaa gcactactta aatttaccca gtctcagaaa 7501 ataaccgtag tgtttgtcaa tatgcctgtt acagcatatt atttagactc agtacgctca 7561 aaatatgagc aagaatttca gcagtatatg cgaactatag ctggcaaacc aaactttatt 7621 taccaagact taagtcaatt atggcccaaa gcaaatgact acttttccga ccccagccat 7681 ctcaaccgct acggtgctta caaaatatcg aaaaagcttg cgaatgatcc tacaattcct 7741 tggttcagta aataaatcat ggggacagta ggggtaagtt gttacgtcta caagtagaac 7801 tagggggagt gataaagagg gtttgaagct tttcttcaaa gttaaaaaga tgcattttta 7861 aaagcgtggc agcttataaa actaatgact aatgactaat gactaatgac taatgactaa 7921 tgactaaaac atgaacttta tatctattct atacgggtta ttcttgctga gtgtgctggg 7981 aatttattgg tttgtggcac aacaaaagtt gcggttatgg acgttgctgc ttgctagcct 8041 tgtgttttat gcatctttgc aagttcatta catcccgtta ctagtagtac tgactttttt 8101 taattttcgt attgcaaaag aaattgatga aaaaacgata ccacatcccc attcttcaga 8161 gtggcaactt tctgaagaag aatggtattt cgctcaaatt gattggaatc atcgccgtct 8221 caagcttttg tggctaggta tagtttctaa tgttttttta ctacttattt ttaaatactt 8281 actaccttta ttaaggtttt tttcccccaa tcttgtcatt ttatctgatg actcttttaa 8341 actgattacc cctttgggaa tttctttttt tacctttgag tgtattgcat atttagttga 8401 tgtctatcgt ggggcacctg ctactcagga gtttctccaa tttgccgcat acaagttttt 8461 ctttgctaaa ctgatttcag gtccaattac tcgtttccac agcttagcga gtgaattcaa 8521 aaatctccgg ttgctcactc ctgatattgt ggcagaggga ctatggctca ttgctagagg 8581 tgcagtcaaa aaaggtattt tagcagatca cttgggaact tttgttgatt tatgttttgc 8641 taacttgcaa agggcaggca gtacagattt gtggttggca acttttgctt atggtttcca 8701 gttgtattta gattttagtg gttacgtgga tattgcccgt ggtagtgctt tgttatttgg 8761 gttagtttta cctgaaaatt ttgactttcc ctacttcagc acgagtatcg ctgacttttg 8821 gcggcgctgg catataactc tgggagactg gctgcgtaat tacctgtact tccctttggg 8881 tggttctcgc caaggatttg atcgcacctg ctttaatcta atcattatca tgctgattgc 8941 aggtatctgg cacggtgcag cattgggttt tgtggtttgg ggagtctttc acgggttagc 9001 tttgggtgtt catcgtttca ctgatgcgat cagcaatcgc tttgaagatc tggaaaattt 9061 ttggcaacag ccattgggta taactttggc ttggctactc acgcagttca tggttttcac 9121 ctcttgggtt tggttccgcc tacccaacct cgaagattcc tcttttgtat ttcagcatct 9181 ttggggtttt cctggggatg agcaatttgc tcaaaaggtg tacgttgagg cgttaaacat 9241 aactcaatat caactagcaa ctttgctagt tgttctatat attgcaatgg ctggagtcta 9301 catttttaac cgaacactca agttacagtt cagttggcct gtaaagcttg ttttcgtacc 9361 tttgtgtttt tacagtgttt ggttactcgc tcctgaaggc agtttacctt atatatattt 9421 tgatttctag ttgatccgat ttgatttggt gaaaaaatct aagtatctgt agtatctgta 9481 ggggagccag tacggtgcgg gggttcccga nnnnnnnnnn acagcccgag ccactgcgtt 9541 gggcgggttt cccgacttga agcacgtggc gtcccgttgt agtatctggc gttgtgtgag 9601 agtgcataac gcaccaaaac ctgatgctgg tgcgttagcc accaggcata acgcacccta 9661 cgtacttaat ttttcttaat aattttgtct ggaagacttg cacaccgttg ttatttgtca 9721 ccaagattaa gctattgata ttatgtttta ttttttaatt ttgaatgagg gaattttaaa 9781 ttgttaaaac ctctatgttt acagatagtc tcagcttcag taggggttga aatcctcgat 9841 tgaaactgta tttcatttta aatattgaat tattaattgt gcttatttac ttaacacacc 9901 ataaaatttc tcgttaggtt tatacgaata actttacttt ttttaaacaa aatacaactt 9961 gtaacaaagt attaagtaat cacttcagtt caagtgaatt gatgtaaatt aaaacatgac 10021 aggtaattca aggagttacc tgataaataa cagtcaaaac atttatcgca cttataaaac 10081 aatgaccaca acaatacaac gtcgcgaaag cggcaacgta tgggagcggt tctgccagtg 10141 gatcacctcc accgaaaatc gcctatatgt aggttggttc ggtgtactga tggttcccac 10201 cctcctctcc gctaccatct gtttcatcat cggttttatc gcagcacctc ctgttgatat 10261 cgacggtatc cgtgagccag ttgcaggttc tttaatttac ggaaacaaca tcatctctgg 10321 tgctgttgtt ccttcctcca acgctatcgg tttacacttc tacccaatct gggaagctgc 10381 ttctttagat gagtggttgt acaacggtgg tccataccaa ttggtgatct tccacttcct 10441 tatcggtatt ttctgctggt taggtcgtca gtgggagtta tcctaccgct taggtatgcg 10501 tccctggatc tgcgtagctt actctgcacc tgttgcagca gcaacctccg tattcctgat 10561 ttaccccctc ggacaaggtt ccttctccga tggtatgcct ctgggtattg ctggaacttt 10621 caacttcatg ttggtgttcc aagcagagca caacatcttg atgcacccct tccacatgct 10681 gggcgttgct ggtgtcttcg gtggttcact gttcagtgca atgcacggtt ctttggtgac 10741 ctcctccttg gttcgtgaaa caaccgaaac tgaatctcag aactacggtt acgtcttcgg 10801 tcaagaagaa gaaacctaca acatcgttgc tgctcacggc tactttggtc gcttaatctt 10861 ccaatacgct tctttcaaca acagccgtag cttgcacttc ttcctggctg catggcctgt 10921 catcggtatc tggttcaccg cactgggtat cagcaccatg gcgttcaacc tcaacggttt 10981 caactttaac caatctctga ttgactctca aggtcgcgtg gttagctctt gggcagatgt 11041 gctcaaccgt gcgaacctgg gtatggaagt catgcacgag cgtaacgctc acaacttccc 11101 cctcgactta gctagcacag aagttgctcc tgtagcactt tctgctcctg ctatcaacgg 11161 ctaattctta atagcctaag gacgtgatgt ttcgcgtcct ccataagtga aaagcgcctt 11221 tcttgaaaga gagggcgctt tctgattttt atattttttt ataattgaaa taatatttga 11281 ctccgttctt gtccgcctag gactgtaagt cccaggctca tagtcgaagt ccattaaaat 11341 ggactgacac tggaattgat ttttgagtcg attgctaaca ggaggtttaa atgggtcaaa 11401 ataatgtgca tctatggcac tttacgggca aactatcaca atgactcacg aaaagtaaat 11461 attttaagat agtttcataa ggattgaaat aactatgaaa ctcaatttat ttgccgatat 11521 actcagtccg aaggttcgta aaagcactgc gctttacgcc gtcgccttcg ccatcagttc 11581 aactgcaact ctgactcagc ccagttatgc tcaaaaccaa aagtttttct gcggaatgag 11641 caagggtgtt ccagcaacat tcgttaatac ctcacgggga aaaatcccga tgattcgctg 11701 ggttgatgca ggatttgctc ctccttggac tcctgaaagg cgctgtgaag atatatctgc 11761 cagattccaa cgattctacg acaacggcac gctaaatttc ctccgcgctg gtaagtccga 11821 acgtcaacct gtattgtgcg ttgctggcga aaatggtggt ccttgtttgc ctgagggaat 11881 attgctcacc ctcaagcctg gtaaagatcc tgaggatatt ctacaacaac tgcttaatgg 11941 tcgcggtggg gctaatcctg gaatcgttga actcagtggt aacaccaaca gagatgttgt 12001 ttcctcagaa aaagacgcag cttaccttga tgtccaaaaa ctcctttcta aaatggaagg 12061 tagaggaagc acatcttgtc cagcaggaca acctctttgg aaatgttaag caaattgcaa 12121 actaaatcaa catgcaatgg cgcttactta cactggttgc ctgtattggt agtttatcaa 12181 tttcgtcggt agcatcagca ccgaatgtcg gggtttgtgt tgagcaaccg ttaacccaac 12241 gtttgggaaa ccaactgcaa tacaaagcta agtcgattac agtcaaagtc ttgtcaaaaa 12301 actttctggg gtcaggtatt ctgattcaca aacaagactc agtttataca gtactgacga 12361 atgctcatgt actcaagtca ggcaaacctc cctatcaaat tcaaaccccc gatggttatg 12421 tgtatctggc tgacgtccct agcaccaaag actctccccc ctactttaaa aacaatgatt 12481 tggctgtttt gcagtttcgc agtcctaaag tcagctatgc gatcgcctct attggttctg 12541 catcaagttt aaatgtgggc aatgaagtgt ttgcagctgg atttcccttt gattttgaca 12601 gaaatcaaga ccaagggttt gtttttaaaa caggtaaagt ttctttaatt ctgaagaaag 12661 ctttagaagg tggataccaa attggatata ctaacgattt gcaaaaaggt atgagtggcg 12721 gaccactgct caatcgcttt ggtgaggtgg tagctattaa tggaatgcac gctgaacctc 12781 tttggggtaa tccatatgtc tatcaagatg gcactcaacc ggaacaacct ttgcgagaga 12841 agatgagtaa atctagctgg gggattccga ttgagacatt taggcggttg gtatcaaccc 12901 cctaagccct gcgggcacgc tgcgcgttag ccctctgggc gtgcgctctg cgcatacggg 12961 ggagttcaaa ataaagaaaa ttaaattgta gggtgtgtta tcgctttagc gtaacgcacc 13021 atcgacaatt taaggtgcgt tagccttcgg cataacacac cctacgtctg attttgatat 13081 aacaattggg gcaaaagttg aatatgaatt tcttttctcg tcgtcttcca gcagtgctga 13141 tgggtgcagc agtcgtgatg gtgcaacctc aattagctgt agcattatct ccaatacaag 13201 tcagcgacat tgctaaagaa tttactgttc tgattgatgg ggatggaatt ggttctggaa 13261 tcatttttga acgtaaaggt gacacctacc ttgtgatcac caatcagcac gtggtgccta 13321 agaatgatgg gaaatacgag attcagacac ctgatggaag tcgctaccca gtttcccgca 13381 gccaagtttt gcctgggtta gacattgcaa ttttgcagtt tactagtaat aagaattatc 13441 gtctagcaga gttgggaaac tctgaccaaa tacgggaagg ctcaacaatt tatgcagcag 13501 gttgggctga tagtttacca ggtatcacca acgaacgtac ctatcaattt acgaatggct 13561 ttattcgcag tcgcgtgaag caagctgatc gtggctatgc cttagtttat aacaatgagg 13621 tgataccagg gatgagtggt ggtccgatgt tggatgaaaa tggtcgtgta gtaggagtta 13681 atgggcgagc ctataatacg gaaacgatat tagcagtttt gaggatagga attccagtta 13741 acactgtttt aacagccaga agtcgcccaa caacagcttc ctctactgtt gcagccgctc 13801 cacaggaacg cacggctgaa gctttaatca acttaggagg cgtgagagcc aatagaaaag 13861 attaccgagg agcaattggt gattacaacc aagctttgcg aattaatcct aacaatcctg 13921 atgcttactt ccgacgaagt tttgcttact tttacttagg ggattttcca gccgctactg 13981 tggacttaaa taaggtattg gaactcaatc ccaaaaatgc tgttgcttac gcccaacgga 14041 gtgttcttcg cattcagcag aaagacttgc agggggcgct tgctgatggt gagcaagcag 14101 ttcgcctggc tcccaaccta agcttaagct acctttctcg tggtagtgtc cgcctcttat 14161 cacaagacta taaaggagcg atcgcggata tagacaggtt cattcagcaa gaccgtaaat 14221 ttgcccatgc ctacgctgta cgaggttttg cccgcgctat gtcaaaagac aagcaaggag 14281 caaacgctga tttcgataag gcaattcagc tagatcccaa ctttttcact acataccaat 14341 ttaggggtgt tagccgccaa ttaatatggg gagacaagga aggagcaagc gcagattttc 14401 aaaaagtcgc agttctttgt cagcaagaat tgagtacacc cctttgtcaa caagtacagc 14461 aagaaataaa gcaggcgcaa gacccaacat tgttatacaa gcaagcaatt gctgatgcaa 14521 atttggcgat tcaacgaaat cctcaagatg ccaatgctta tctcagacgg gctgctggtt 14581 actatctcca aggagataac aataaagcgt tggaagatct aaaccaagct actcgcatca 14641 attccaaata ttcccaagct tgggttctgc gaggagatgc tttattcaag ctaggacaaa 14701 aagaagaggc tatctcctct tatgaacgtg caattcaagt taacagtgaa tggggcggcg 14761 caagccctgc tgaaacttgg tttaagcgag gtaccatatt gcagaagcta ggacgaaaac 14821 aagaggctat ctcctcttat gagcgtgcaa ttcaagctaa cagtgaatcg gataacgtat 14881 accctgcttc agcttacaac aatattgggc tagtcaaata cgaacaggga gatgtagaag 14941 gagctatccg ccaatttcaa agtgctatta ataatgacag taaaaaggta gaacctcaat 15001 tagcccttgc agttgcactc tatacaaagg gcgagcggga aaaaggttta acaatggcag 15061 aatctgcttt acgttcggaa aataggtatg ctgatgtaga gtttcttaaa aaaaatcttt 15121 ggggcgagaa gctcatagca gatacacaaa aactcctaca aactcccaaa atccgagaaa 15181 tcacatctcg cccatctcgc agtcaatgat atttatataa aaaaatggcg ttggaaaatt 15241 taaatatgaa tttcttttct cgtcatctac caccggtgct gatgggtgca gcagtcgtaa 15301 tggtgcaacc tcaactcgct gtcgcattat ctccaacaca aatcagcgac attgcgaaag 15361 aatttactgt tctgattact ggtgaaggta ttggttctgg ggtgattttt gaacgtaaag 15421 gtgacactta ttctgtgatc accaatcagc acgtggtggc taacgatggg agatatgaaa 15481 ttcaaacacc tgatggcagt cgctacccag tttaccgcag ccaagaacta cctgggttgg 15541 acgttgcaat tttgcagttc acaagcaaga aaaattaccg ccttgcgagt ttaggtaact 15601 ctgaccaaat caggcaagga atgacggttt atgtggtagg ttggggttta gttaatgatt 15661 tatcagataa aaccaacaaa tccagctatc ttacctttgc tggaatcatt gaaagtctga 15721 gcaaaaaccc acagcagggt tatggtttgg catataacaa tcaggcgata tccggaatga 15781 gtggtagtcc ggtactagat gaaaatggtc gtgtggtggg gattaacgca gcaagacttg 15841 atcaaaatct cacggtacaa ggaacacgac tgttggcagg ttggaggttg ggaattccga 15901 ttaacatggt tttaacaact cgaaatcgtc cggtaagttc gccagtaacc gcacaacaac 15961 caggtgctat agcgtttaca acactaagga gaactgaagc tttaattagc tcagggggag 16021 ctaagaaaaa tagaaaagac taccaaggag cgatcgctga ttacaaccaa gctttgcgga 16081 ttaatcccaa taaccctgat gcttacttgc aacgaggtag cgcttactat tatctgaaga 16141 agtaccaggc agctcgtgag gattttaaca aggtactgca actcagtccc aaaaatgcaa 16201 atgcctacaa caacaggggt gttctccgct atcagtcggg agacaagcaa gcagcgctgg 16261 cagattttaa ctccgcaatt cagttagacc ccaaattggc aggtacctac tacaacaggg 16321 gtgctatccg cgatcagtcg ggagacaagc aagcagcgct ggcagattat aactccgcaa 16381 ttcagttaga ccccaaaaat gcacctgcct acattgaacg ggctgttctc cgctacgagt 16441 cgggagacaa gcaagcagcg ctggcagatt ataaccaggc aattcagtta gaccccaaaa 16501 atgcaaaagt ctacaccaac aggggttttc tccgtaagga gtcgggagac aagcaagcag 16561 cactggcaga ttttaaccag gcaattcagt tagaccccaa agatgcattt gcctactaca 16621 acattgggtt ggtcaaatac gaacaaggag atatagaaga agctatgcgt cagtttcaaa 16681 ctgctattaa taatgacagc aaaaaggtac aacctcaatt agctcttgca gtcgcactct 16741 atagtaaagg agagcaagaa tggggttttt caacggtaga agtggttttg cgttcggaca 16801 aacgttttgc tgatttaaag tttctcaaga aggaactact ctggggcgat aagctaatag 16861 cagatacgaa aaagctttta gaaaatccta agattaggga gataacatct cgcccatctc 16921 gcagtcagta aatttcttgt gtgaatacct ggcgaatgga attcgcggct atacagacaa 16981 agtccgccta cgcggactaa tttgtagcct gcggaggcag gctttgtttg tgtagcctca 17041 gacttccagt ctgaaggcag aattctgttt ttcttattta gacagcgtgg acgtgtttaa 17101 catctcgccc atctcgcagt cagtaaattt cttgcactca ttcgctattt tggattcatg 17161 tctcagcatt aaaaatctgt tgggcagtta aggtcagttc tgggaacgtg ggagatacaa 17221 tatgctcatc acctcgaaac ttggtaactc gatactcacc ttcatccaat gagcaaacca 17281 aaatagtagg ttgttttgga tctcctataa actctcttcc gcctaaacca gcataatcaa 17341 cgatccaata ctcacgaatc cccatttcct cataatcagc atattttttg agataatcat 17401 cacgccaatt agtactaacc cttacgggtt cgtcagt // LOCUS NODE_1900_length_17395_cov_5.77549017395 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17395) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17395) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17395 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(161..1336) /locus_tag="DP116_16670" CDS complement(161..1336) /locus_tag="DP116_16670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317126.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_16670" /translation="MIVKNEEATLSKCLGSVKNVVDEMVVLDTGSTDSTPQIAQKFGA KVHHFEWCNDFSAARNEALKYVTGDWILVLDADETLTQKIVPQLKQAIRREEYLLINL LRHEVEAEQSPYSLVSRLFRNHPDIRFSRPYHALVDDSVSEILTQEPGLQVGYLEGVA ISHTGYQKSAIAQQDKFTKAQAAMEGFLASHPNDPYVCSKLGALYVESGKLIEGIKLL AQGVANCEEEYETLYELYYHLGIAYSRLQNYKNAIAHYDAAVKLPIYPILKLGAYNNL GNLLKAAGDLNGAKTAYEATIKIDSSFVPGYYNLGMTLKEMSLFKDAILAYQKAIELS PKYADAYQNLGVLLLKLGYVKDSLVSFKKAIALHEEQQNPEEAKRLRQGLKEMGLLR" gene 1727..2533 /locus_tag="DP116_16675" CDS 1727..2533 /locus_tag="DP116_16675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357732.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_16675" /translation="MNNSSLIRPLAVVTGASNGIGYELAKQFAQNGFDLIITATGSSI NEAAQAFVGLGVKVETVQSDLATYDGVETLYNKIKAANRPVDAIAINAGVGVGGDFAR ETDLQDELNLINLNVVSTVHLAKRVVKDMVTRGKGRILFTSSIAALMPGPFEAVYAAS KAFVHSFSQGLRNELKDTGVTVTALMPGPTDTNFFQRAGMDDTNVGANQKDDAAEVAK QGFEALMAGKDEIIAGSLKTKILGTVSKILPDTVTAELHSKLSEPGSANK" gene 3035..4273 /locus_tag="DP116_16680" CDS 3035..4273 /locus_tag="DP116_16680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317127.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16680" /translation="MEQKSNTPRLPLPQPLVGAEAGTFTEFTVTQRMPNIARRVIAEN KFPANINASLEKLASELPSGYLPTLVDDTGSDFADWSKYLESYKEQRWIDIPWFFTET YFYRYLLQITNYFRAGESQGVDPFELQKRQGLETSLDSIVALCTQVNGWLNVSEQENQ LRQTALITLLYFGLWGNRVDLSLWSAFETDRSRFDIQNQQSHILVDDGLKVTELLVNS NSGRVDFVVDNAGFELVCDLCLVDYLLGSGVASVVRLHLKSHPTFVSDAMIKDVHQTT EFLLASSNPEVTSFPQRLQQYIASDQLVLCDDYFWTSPLAFWEIPESLKNELSHSNLI VIKGDANYRRLLGDRHWDFTTKISDIVCYLPVPMVILRTLKSEVVAGIQPEVLEEVEK SDSAWLTNGQWGVVQLVDNN" gene complement(4274..4744) /locus_tag="DP116_16685" CDS complement(4274..4744) /locus_tag="DP116_16685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3368 domain-containing protein" /protein_id="PRJNA477356:DP116_16685" /translation="MIVVSDTSPICYLLLIDHIRVLQELYHVVIIAQTVADELNAPES PSVIRDWIAKPPDWLQIQPVETLQNVEIEKLDPGERDAILLAEKLKADLVILDDKAAR RVALERGLTIIGLLGILKDAAKSDLLDLRTVFDDLREVGFWVAPSLLEQLLKEE" gene complement(4741..5007) /locus_tag="DP116_16690" CDS complement(4741..5007) /locus_tag="DP116_16690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009547641.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UPF0175 family protein" /protein_id="PRJNA477356:DP116_16690" /translation="MQITVEIPDEIAERLNQVWGSLSRRLLETVVADAYRCGEISTAE VGQILQLPSRLETHAFLKRMGVYLNYDEAELEQDLQTLKKFRAQ" gene complement(5093..6862) /locus_tag="DP116_16695" CDS complement(5093..6862) /locus_tag="DP116_16695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317969.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metallophosphoesterase" /protein_id="PRJNA477356:DP116_16695" /translation="MHRRSGKLAMLRYLTSRKILAALFVLVICLTLPYGCFSGQKAFS SVPQLLTDPFLQLPTQSSVRVVWFTEFAGSKHNVAYGDNLQQTVTANTTKLSRTREDQ ESKVGNQKENGQVYKQPISRDIWRHEAEVVGLTGLKRVSYRVTSVREDGKSISSNSFT LAPTPTPGTPLKILLTSDHQLKPMVATNLQKVVETVGKVDAVLLAGDTVNIADRASEW FDDNGGGAFFPCLQGRAKFETDVNGIKTSFLGGEILQHAPMFTSIGNHEVMGRFGKGK SLGDEFNDAFPRAVAMKLYGEKSLKNNSYNTDTYEEIFTLPESQEGGESYYAVSFGDV RLVVLYATNMWRTPNMDAEARGKYRERDKDLQNPENWGYGQVIFEPIAKGSQQYNWLE KELNSPEFKQAKYKVVMFHHPPHSLGDNIVPAYTHPVEIIERDANRNVKTVRYKYPKK SDYLIRDVIPLLEAAKVQMVFYGHSHVWNRFRSQSGMHFLETSNVGNTYGAFLSGDQR SVPSGDKKDYAALGDPNGLEPVLPTIAPLPGKDGKPIPYIASNNITAFSIFDTGKGTV SSYRFDISQPDSKVVKFDEFQLR" gene complement(7189..7443) /locus_tag="DP116_16700" CDS complement(7189..7443) /locus_tag="DP116_16700" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16700" /translation="MGIQSQKVCKHEPGKKWEDGCCSSNPAKASTSLVPNSYEAKFAT LLDWCKIIRACLDAGQVEEAKFFLEQAMVEAKTVSKEQYS" gene complement(7637..10192) /locus_tag="DP116_16705" CDS complement(7637..10192) /locus_tag="DP116_16705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316461.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16705" /translation="MVAITVNQKVFERLKQAYAEKFGGSPKLLIDTLNHVYHETTNNS KDVISDKTIRNFFKNTEPMKMQEKNLNFLCGVLLECESYQEALRQQAALEQVDQTNPH VNEEWLDCYQEHIRRKWGTMKVLTMTQPVQLDSIYANVNVLEVIKAKKHKTIEELLDN HKTMDELWANIFSESISFSSLNYVVSQKNVAAFDAVKRYQKLLIWGRPGAGKTTFLKH LALHYVQELGEQFIPIFISLKVFAEEEEKSNLIDVIEREFLICVPEPAQLVQELLQQG RCLILLDGFDEIVETKRNRVYRIINDFVEQFSQNKFVLTCRLGASESTFEHFTEVEMA DFNEEQVYLFVRKWFASCSEQKLGSKFLEELKINRSIKDLSKNPLLLTMLCLVFEDSY DFPKNRDLLIDEAVNILIRKWDASRRVDHSSINKFNLPYRRKINLLGKIAYEAFNQEP QKYFWQQRELEEFIRNYIENIPEIPTETLALDSLVVLKAIETNHGLLLKQSNDFYSFS HLTFQEYFVASYIVENQNPEILKEVIKRYLTNRQWREVFLIIAGRLLNADDFFKLMFT QISKLVDSKPLQDMLVWLYNVTALHKVESSSWRGFYLLVDHWFELYTNCQTKIDYNLA QQLAIMLRDLNIEREEIVKASPLNRLAFDLVKTHAQVSAKFCGDEFKPQKVTPLLKKE LSITDNMLIAPQLRDRVETLDKEKGILTINKQGIQDIDKKIQDIAGLDDRFKDVKAHL EERNISDNLKEELIFLLESFPDDDNPKEDWKQWTNSLRAAMMLYLDIGFAWKFSEAEI QTLKDYFYANILLIECIRGGSYCSKDLRNQIVDHLLLPIKNIPETLGGCLQSI" gene 11439..11804 /locus_tag="DP116_16710" CDS 11439..11804 /locus_tag="DP116_16710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015201625.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16710" /translation="MSSLPNSNPYNPYVRLLGIIFSLGVALGSGLGYGFSQAISSTKQ QKLPTQTELCYVSRIEYERLQPGMSLTDVQAILGSGGTEVDRTATTATFIWENPNGYK ITTVFNIGKLESKKQTGLR" gene complement(12093..12968) /locus_tag="DP116_16715" CDS complement(12093..12968) /locus_tag="DP116_16715" /EC_number="3.5.1.15" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872973.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartoacylase" /protein_id="PRJNA477356:DP116_16715" /translation="MNRINRVALVGATHGNEFTGAYLIKKYEQYPHLIGRDSFETLTL FANPKAFEVGRRYIDKDLNRCFKIQDLENPTLSSYEDIRAKDINQILGPKGKSQVDVI LDLHSTTANMGLTILLGNQHPFSLQFAAHLSLTYPEVKVCWAAPVQSTLLKSICEFGF VIEVGPVAQGVLNADLFQKTEKLVYTILDYLEAYNQGSISQTNSTLTLYEYVKDIDYP RNDLGEIQAMIHQKLQFRDYEPLNPGEPMFITFDGKEITYEGESTVYPIFINEAAYYE KGIAMCVTEKRQVSV" gene 13280..14641 /locus_tag="DP116_16720" CDS 13280..14641 /locus_tag="DP116_16720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)/FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_16720" /translation="MVNAVENQPPHHVVIVGGGFGGLYAAKALSCADVRVTLIDKRNF HLFQPLLYQVATGAISPADISSPLRSILSKSKNTKVLLGEVNDIDPQGQKVFMGGEAI HYDSLILATGAKHSYFGKDQWEEFAPGLKTVEDAIEMRHRIFMAFEAAEKETDPEKRR AWLTFVIVGGGPTGVELAGAIAELAYHTMKEDFRNIDTSEAQVLLLEGLDRVLPPFAP ELSKEAEASLTRLGVTVQPKTMVTNIEGDVVSLKQGDEVKQIHAKTVLWAAGVKASPL GKLLAESTGAECDRAGRVIVEPDLSLKEHSNIFVIGDLAHFAHQNGKPLPGVAPVAMQ EGQYVASLIKQRLQGKTLPQFRYFDWGSLAVIGQNSAVVDLGFFKFTGFLAWLFWLFI HIYFLIEFDNKLVVMIQWGWTYFTRKRGARLITGKEVLENAKAGGSNGYYTPENGRQA VNL" gene 15345..16862 /locus_tag="DP116_16725" CDS 15345..16862 /locus_tag="DP116_16725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317966.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16725" /translation="MSSNSPSQHKRNRGVILTPVGWRKLENAEKESGQHFTKEQIRNR TGLSIQTISRIRKRKVAVDQDSLECYIKAFGLEKLSDKDYTHVPQENQQQDWGDAPNV SVFYDRCEEMAQLQQWVLEEDCRLIALLGMGGIGKTALAVKFGQKFKTEFEIVVWRSL QNVPTLEELLGSVLQSIMQMLQKDSVVPTSLDGKLSKLMEYFRDKRCLLILDNAETIL STGGGAGHCMQGYEGYCQLFQRIGEVSHQSCLLVTSREKPKDIVALEGEQKKVRSLQL GGLKPEDGRKLFEHRGQFTGKDAEWIRLIEHYGGNPFALKMVAAGIQQLFDGSIAEVL EYIGQGVLVFNDIRDLLDRQFSRLSPVEQEVMLWLAINPEPVSVKELKQDLASVTSKQ ELPQALYSLLRRSLIEKTGKQFSLQPVVKEYVTEQLGKQVCQEIVSTRERAESTSPLA LLQTHALMKASAKDYIQETQRQLIVQPLLEQLLIELGSQQKLVQMLKDVLEQQRD" gene complement(17202..>17395) /locus_tag="DP116_16730" CDS complement(17202..>17395) /locus_tag="DP116_16730" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16730" /translation="QYTISELDKFIEIKTYIQEGDKFFKFTNKGEFKSLSNLPTDVEA ELKNNGSVTIEYSEDNNDT" BASE COUNT 5099 a 3604 c 3556 g 5136 t ORIGIN 1 atcctcccta gccctcctta aaaaggaggg aactaagccc cctttttaag ggggtcgccg 61 taggcggggg gatcttatcc gaaccatatt gattagtcat taggaaaaca cttctgacta 121 gtgaactgtt aactgttaac tgttaactgt taactgttga ctaacgcagt aaccccattt 181 ctttcaaacc ttggcgtagt cgtttcgctt cttcaggatt ttgctgttct tcgtgaaggg 241 cgatcgcttt tttaaaactc acaagactat cttttacata gccaagtttc agcagcaata 301 cccccaagtt ttggtacgca tcagcatatt taggacttaa ctcaattgct ttttgataag 361 ctaagatagc atctttgaat aagctcattt cctttagcgt catcccgaga ttgtaatatc 421 ctgggacgaa actggagtca atcttgattg ttgcttcgta agcagtctta gcaccattta 481 aatctccagc tgctttgagt aagttgccga gattgttgta tgctcctaat ttaagaatag 541 ggtaaatggg caatttcact gcagcatcat agtgggcgat cgcattttta taattttgca 601 aacgagagta ggcaatacca agatgatagt agagttcgta taaagtctca tactcctcct 661 cacaattcgc gactccctgc gccaacaact tgataccttc tatgagtttc ccactttcca 721 cataaagcgc ccccaactta ctacaaacat aaggatcatt aggatgagat gcaagaaaac 781 cttccatcgc tgcttgcgct ttggtaaatt tatcttgttg agctatggca cttttttggt 841 atcctgtgtg tgaaatagca accccttcca aatagccaac ttgcaaacct ggttcctgag 901 ttaaaatttc agacacgcta tcatccacta acgcatggta agggcgagaa aagcggatgt 961 ctggatgatt acggaagagg cgcgaaacca gcgaataagg agattgttct gcttctacct 1021 catggcgcaa aaggttgatg aggaggtatt cttcccttcg tatcgcctgc ttcaactgcg 1081 gtacaatttt ctgggtgaga gtttcatctg catctaatac aagaatccaa tcacctgtga 1141 cgtattttaa agcttcattg cgagcagcac taaagtcatt acaccattca aaatgatgca 1201 ctttcgcacc aaatttttga gcaatttgtg gagtgctgtc tgtagatcct gtatctaaca 1261 ctaccatttc atcgacgaca tttttcacac tgcctagaca tttagacagc gttgcttctt 1321 cgtttttgac aatcatgcac aggcttagtt tcataagtaa ctcgtctact tgtttaaata 1381 ttgtgtaaat atgagtaact ttaattgtgt gccgaaagct ttatcaaatg tgataagaac 1441 ttcttcaaca catggtgact tcaatgataa gtcaaacact aggtagacgc tatagccgaa 1501 tggcattaag accgtttcac tttaagattg atacgaagcg tgggatgagt gcccgattta 1561 ggtaaatgag ggttagaagt ataaacggaa ctctagaaaa gtaatcctaa agacacaaag 1621 ggcgatactt ccttggaaag acgtaatcac actactacgt tcttaggatt ttaagggtta 1681 aaagctcaca gtccacgcaa accatctggt atcgaaagac ttgcaaatga acaacagttc 1741 cttgattcga cctctggctg tcgtaacggg tgcctctaac ggtatcggct acgaacttgc 1801 caaacagttt gcccaaaacg gctttgattt gatcatcaca gctactggct caagcattaa 1861 cgaagccgct caagctttcg ttggactagg tgtcaaagtc gagacggtac agtccgatct 1921 tgccacctat gacggggttg agacgctcta caacaagatt aaggcagcga accgaccagt 1981 ggatgcgatc gctatcaacg caggtgttgg tgttggcggt gactttgccc gcgagaccga 2041 tctacaggac gaactcaatc tgatcaatct gaacgtcgta tcgactgtcc atctcgctaa 2101 gcgggtggtg aaggatatgg tgactcgcgg caagggtcgc atcctcttta cttcctcgat 2161 cgccgctttg atgcctggac cgtttgaggc agtctacgca gcctccaagg cgtttgtcca 2221 ctctttttcc cagggactgc gcaacgagtt gaaggacacg ggcgtcactg tcaccgcgct 2281 catgcccgga ccgaccgata ccaacttctt ccagcgcgcg ggtatggacg ataccaacgt 2341 gggcgcgaat caaaaggacg acgcagccga agtcgccaag cagggttttg aagctttgat 2401 ggcgggcaag gatgagatca ttgcaggctc actcaagact aagattctgg gcaccgtgag 2461 caaaatcttg cctgataccg tcactgccga actgcacagc aagcttagtg agccggggtc 2521 agctaacaag taagtagtca gacagaatta attacacaat gtcattgcga atgaagcgaa 2581 gcggaatgta gacgcgtttg cgcagcgccc ccttaggggc tagcggcttc ccgaagggta 2641 gcaatcgcaa gggctgggat tgcttcgctt cgtatgccct ccgggcacgc tacgctaacg 2701 caatgactgt aaatattttt gtccgactac ttaataccga ttcaaaaagc gatctgcgct 2761 ccgcgcagca gcgaagctat cgcttccact cccgacgcgc aattggtgct tgaaatgcat 2821 agcacgctcg ttaatcgatc tgcgctccgc gcagcagcga agctatcgct tccacacgga 2881 caccatatcc aaaagttttg cttctttaag aaagttgtag tcaagttaac gactgcatca 2941 atgatacatt tctaaactga atcttaaatc tataaggttt gttaacaaag taccgcaata 3001 ttttaaggtg gtgctttatc acaaacaaaa gcgagtggaa caaaaatcca atacccctag 3061 attacccctc ccacaaccac ttgtgggtgc agaagctggt acgtttaccg aatttacagt 3121 cactcaacgg atgcctaata ttgcccgtag agtcatcgct gaaaataagt ttccagccaa 3181 tattaatgcc agcttagaga aactagccag tgaactgcca tcgggatatt tgccaactct 3241 tgtagatgat actggttcag attttgcaga ttggtctaaa tatttagaat catacaaaga 3301 acagcgttgg atagatattc cgtggttttt tactgaaact tatttttata gatatcttct 3361 gcaaattacc aactactttc gtgctggtga atcgcaaggt gtagatccat tcgagttgca 3421 aaaacgtcaa ggtttagaaa catcccttga ctcaatcgtt gctttatgca ctcaagtgaa 3481 tggatggttg aatgtatcag agcaagaaaa tcaattaagg caaacagctt tgataacatt 3541 attatatttt ggtttgtggg gaaatcgagt tgatctcagt ttgtggtcag catttgagac 3601 tgaccgcagt cgttttgata ttcaaaatca acaatctcat atattagtag atgatggact 3661 caaagtcaca gaattgttag tgaatagcaa ttccggacgt gttgactttg ttgtagataa 3721 tgctggcttt gaacttgtct gtgatttgtg tttggtagat tatcttttag gtagtggtgt 3781 cgcgagcgtt gttagactac atttaaagtc tcacccaaca tttgtctctg atgccatgat 3841 aaaagatgtg catcaaacaa cagaattttt attagcctca agcaatccag aagtgacatc 3901 ctttcctcaa agactacaac aatatattgc atcagatcag ctagttttgt gtgacgatta 3961 tttttggaca tcacctttag ctttttggga aatacctgag tctctaaaaa atgagttatc 4021 tcattccaat ttgatagtta ttaaaggaga tgcaaattat cgaagattgt taggtgatag 4081 acattgggat tttacgacta aaatctcaga tatcgtatgt tacttacccg ttccaatggt 4141 aatcctacgc actttgaaat cggaagtagt agcaggaatt caaccggaag ttttggagga 4201 agtggaaaag tcagactctg cttggttaac gaatggacaa tggggagttg ttcagttggt 4261 ggataataat taattactcc tctttcagaa gctgttcaag taaacttggt gcaacccaaa 4321 agccaacttc tcgcaaatcg tcaaaaactg tcctcaaatc cagtaaatca gatttagcag 4381 cgtctttcag aatacccaag agtccaataa ttgtcaaacc acgctccaga gctacgcgtc 4441 ttgcggcttt gtcatccaag atcaccaagt cagccttcag tttctctgct aataaaattg 4501 cgtctcgttc accaggatcg agtttttcga tctctacgtt ttggagggtt tcaacgggtt 4561 gaatttgcag ccagtcagga ggtttcgcta tccaatctct gataacagat ggtgactcag 4621 gagcgtttaa ttcatcagcc acagtttgag caattatcac aacgtggtag agttcttgca 4681 ataccctgat atggtcgatc aacagcaagt aacaaattgg tgaggtatca gatacaacaa 4741 tcattgtgct ctgaacttct taagggtttg aagatcctgt tctaattcag cttcgtcgta 4801 attcaagtaa acacccatcc gtttcaaaaa agcatgggtt tctaaacgtg atggtaattg 4861 aagtatctgc cctacttcag cagtgctaat ctcaccacaa cgataagcat cagcgacaac 4921 cgtttctagg aggcgacgag aaaggcttcc ccatacttgg tttaaccgct cagcaatttc 4981 atcgggaatt tcaactgtaa tttgcatggc gcattcaggc tacagctata agcttagttc 5041 tatagtaagc aatgatcgtc tacccttagt ccgtagggct taaaccacta tcctatcgca 5101 actgaaactc atcaaacttc acaacttttg aatctggctg acttatatca aaacgataac 5161 tgctgactgt acccttaccc gtatcaaaaa tgctaaaagc tgtaatatta ttactggcaa 5221 tatacggtat gggtttgcca tctttaccgg gtaatggagc gatcgttggt aaaactggtt 5281 ctaacccatt aggatcccca agtgcagcat aatctttttt gtcacctgat gggactgatc 5341 gctgatcgcc actcaagaaa gcaccgtaag tattgccaac attcgatgtt tctagaaagt 5401 gcattcccga ctgactacga aaacggttcc acacgtgcga atgcccatag aataccattt 5461 gtactttagc tgcttcaagt aagggaatca catcgcgaat caggtaatct gattttttgg 5521 ggtatttgta acgtaccgtt ttaacattgc gattcgcatc acgttcaata atctccacgg 5581 gatgagtata agcaggaaca atattgtcgc ccaaagaatg aggcggatga tgaaacatca 5641 cgactttata ttttgcttgt ttaaactcag gactgttgag ttctttttct aaccagttgt 5701 actgctgact tcctttggca attggttcaa aaataacttg tccgtaaccc caattttcgg 5761 gattttgcaa atctttatct ctttcccgat acttccctct ggcttctgca tccatattgg 5821 gagtccgcca catattcgtc gcatatagaa cgactagacg cacatcacca aaactgactg 5881 cataataact ttctccacct tcttgactct caggtaaagt gaaaatttct tcgtaagtat 5941 cagtattata agaattattt ttcagggatt tttccccata caatttcatc gcaactgcac 6001 gcggaaacgc atcattgaat tcatcgccta aacttttacc cttgccaaag cgtcccatca 6061 cttcatgatt accaatacta gtaaacattg gggcatgttg aagaatttct cctccaagaa 6121 aagatgtctt tattccattc acatctgttt caaacttggc acgaccttgc aaacaaggga 6181 agaaagcacc acctccatta tcatcaaacc attcagaggc acggtctgca atattcactg 6241 tatctcctgc taacaaaact gcatccactt taccaacagt ttccacaacc ttttgcagat 6301 ttgttgctac cattggtttc aattgatggt cagaagttag gagaattttc aatggtgttc 6361 cgggagttgg ggtaggtgca agcgtaaaac tattgctgct aatactctta ccatcttctc 6421 gtacactggt gacacgataa gaaactcgtt taagcccagt taaaccaacg acttccgcct 6481 catgtcgcca aatatcacgt gagattggtt gcttgtaaac ttgtccgttt tccttttggt 6541 ttcctacttt tgattcttga tcttctcgtg tacggctgag tttggtcgta tttgctgtaa 6601 ctgtttgctg gagattgtcg ccatatgcaa cattatgttt ggaaccagcg aactcagtaa 6661 accacactac tcgcactgaa gattgcgtgg ggagttgtag aaatggatct gtaagcaatt 6721 gcggtactga cgaaaaggct ttttgcccag aaaagcaacc gtatggtagg gtaaggcaga 6781 taactagtac aaataaggca gctagtattt tgcgacttgt tagatatctc agcattgcga 6841 gttttccaga acgtcggtgc atctcagttg tatctaaaaa aacacaacga gctacacaat 6901 aaagtcgcac tgttccggat tatagtttca atactcgcgt ttagatggac atagtaatcg 6961 cactcatcac agtgaaagat actacttggt tttagcgctc ttccgtcaat cgacaacgga 7021 tgaataaagc taacgatgtc attttagccg cgctgtagca agcgagaccg agaagccaag 7081 agaaataaac ttagctacag cgtcagtttt agaactaaca acaatattcc agaaatggtc 7141 gtttatctta gcggttctaa cctcaattgg cataatttag gcgaagattt aggaatattg 7201 ctcttttgat acagttttag cctcaaccat cgcttgctcc aagaagaatt ttgcctcctc 7261 aacctgtcca gcatccaagc aagctcgaat aatcttacac caatctagca acgttgcgaa 7321 tttggcttcg taactattgg gaaccaaaga tgttgaagct tttgctggat ttgagctaca 7381 acagccgtct tcccactttt tgccgggttc gtgcttacat actttctgac tctggatgcc 7441 catagagaat aaccattcct tctaaatgct atggcgtact tacctatcat agtaccaaaa 7501 ggtattaacc ggcaataagc ggtaaatata ggtaagatta tccaggagta gcaatctaag 7561 ggatttccaa aaaatgaacc aaataataaa taatccaacc tagtagcgtg gtaatgtgac 7621 tgaaaggaat agcctttcat atactttgca gacatcctcc gagagtttcg ggaatatttt 7681 taattggcaa tagtaggtgg tcaacaattt ggttgcgtaa atctttagag cagtaactac 7741 ctcctcgaat gcattctatt agcagaatgt tggcataaaa ataatctttt aaagtttgaa 7801 tttccgcttc ggaaaatttc caagcaaagc caatatctaa atagagcatc atcgcggctc 7861 tcaaactatt tgtccactgc ttccagtctt cttttggatt atcatcatca ggaaaacttt 7921 cgagcaaaaa tattaattct tcttttaaat tgtcagatat attccgttct tccaggtgag 7981 cttttacatc tttaaatcta tcatccaaac cagcaatatc ttgaattttt ttgtcaatgt 8041 cttgaattcc ttgtttatta atggttaata tacccttttc tttatcaaga gtctctactc 8101 tatctcgtag ttgaggagcg ataagcatat tatcagtaat cgaaagttct ttttttagca 8161 aaggagttac tttctgaggc ttaaattcat ctccacaaaa tttagcagaa acttgagcat 8221 gagttttaac taaatcaaat gctaacctgt tcaatggtga ggctttaaca atttcctccc 8281 gttcaatatt taaatctctc aacattatgg caagttgttg agctaagtta taatcaattt 8341 ttgtctgaca attggtatat aattcaaacc aatgatctac taatagataa aatcctctcc 8401 aagaactaga ttctacttta tgtaaagcgg ttacattata taaccaaacc aacatatctt 8461 gtaagggttt actgtctact agcttgctga tttgggtaaa catgagtttg aaaaaatcat 8521 cagcatttaa aagtcgtcct gcaattatta gaaacacttc tcgccactga cggtttgtta 8581 ggtatcgctt gataacttct tttaaaattt ccggattttg gttctccact atgtaactag 8641 caacaaaata ttcttgaaaa gttagatgtg agaatgaata aaagtcatta gattgcttaa 8701 gaagtaatcc atgattggtt tcaattgctt tcaacacaac taggctatca agagcgagag 8761 tctcagtagg aatttcagga atgttttcaa tatagttcct gatgaactct tctagttccc 8821 tttgctgcca aaaatacttt tgtggttctt ggttgaaagc ttcataagct atctttccaa 8881 gcaaatttat ttttcgccga tatggcagat taaatttgtt aattgaacta tggtctactc 8941 gtctgctagc atcccattta cgtataagta tatttacagc ttcatcaatc agcaagtctc 9001 gatttttggg aaaatcataa ctatcctcaa aaactaaaca caacatagtc aatagcaacg 9061 gattttttga taaatctttg attgatctat tgatttttag ttcttctaaa aatttagatc 9121 ctaatttttg ttctgaacaa gacgcaaacc attttctaac aaacagataa acttgttctt 9181 cattaaaatc tgccatctcc acttctgtaa agtgttcaaa tgtagattct gaagctccca 9241 aacgacaggt gagcacaaac ttattttggg aaaattgttc tacaaaatca ttaataattc 9301 gatacactcg gttccttttt gtctctacaa tctcatcaaa tccatctaac aaaattaagc 9361 aacgaccttg ctgtagtaat tcctgaacaa gttgagccgg ttcaggaaca catataagaa 9421 attctcgttc gattacatca atgaggttcg atttctcttc ttcctctgca aagactttca 9481 atgaaataaa aatcgggata aattgctctc ctaattcttg aacataatgc agagctaggt 9541 gctttagaaa tgtagttttt cctgcacctg gtctacccca tataagaagc ttttgataac 9601 gtttgactgc atcgaaagca gcaacatttt tttgactcac cacatagtta agggaactaa 9661 agcttatact ttcgctaaag atattagccc acagctcatc cattgttttg tgattatcta 9721 aaagttcttc tatggtcttg tgcttttttg ctttaataac ttctaaaaca tttacgttgg 9781 cataaatact atctaactgc acaggttgag tcatagtgag cactttcatt gtgccccatt 9841 ttcttcttat atgttcctgg taacagtcaa gccactcttc atttacatga ggatttgttt 9901 gatctacttg ttctaaagct gcttgttgtc tcaaagcttc ttgataactt tcacattcca 9961 gcaatactcc gcacaagaag tttaggtttt tctcttgcat cttcatcggt tcagtgttct 10021 tgaagaagtt gcggatagtt ttatcggaaa tgacatcttt ggaattgttt gttgtctcat 10081 gataaacatg attgagagtg tcaattaaaa gctttggcga gccaccaaac ttttctgcat 10141 aagcttgttt taatctttca aacacctttt gattcactgt gattgctacc atttatgtcc 10201 tcttaactta ccttcaagat ttgagttctt tcaaagttta agcagattta tcaactcaag 10261 tttatgcgaa caataaccgc aaaattagat aaatccttgc tgcatcaact atagctttgc 10321 tatctgtaaa aagctgctta taatctttcc ttgaaacgta taatatgatg ttgaatatct 10381 ctgggagatt caccatgtga ggcgtctgct gaatttagct atggaaatca gaggtttgat 10441 aaattgccac ttaagtcgaa tgcggtcaat ttcgtagact tttagggtca tgagtagata 10501 cattgtaacg tataaatact catactacta gctaactctc acctaaagcc ttacttttaa 10561 actttattca taagtacctc aggagactga gttttactca tcaacattct aaaagcttca 10621 gtttgaggaa agcaaggttc aggagatgaa aaacaaaata tactcaagca cctataaatt 10681 tatgagtatt gaaaacattc ttttttaagt aattaaaagg ataactttgc cgaaacacag 10741 tggcttaaag gaaggcacgc attctacaac tacgcccaaa tactgagttt acaaggatta 10801 gagacattgg gagcatcagc ctgcaaaaac agcaatatta tctttccaat gacaataacc 10861 tttacagcct gcgcatggtg tctgattgta aatttgctac ttttaaccct taagatagag 10921 acctcaagaa agaaagcttg ataaattgaa aatacctctt aatgccgttt tataccgaac 10981 aataccgtgt gatactaaaa aaataccgca aacagtgtcg gaagtaagtt tagatcatac 11041 caagttgcca gagatattat tctttgagtc aatcaaaaac tctgcataac ccaatggttg 11101 tcaaaaattt cattgactcc attgaatgta aaggaatgta acaatactct ctttggcaac 11161 ttggtataag tcaatagcct ttgactgatt ggcattcaat caaaggctat caaaacaaga 11221 cgatgctcct caaaaaggaa cgaatactcg gcgcaaatac ttcaaaatca cgctgagcag 11281 gatttcatgc tctcaatgat tgggaatttg catatgcggc agtttttaaa tacagggttc 11341 gtaactacta aacaaatgta gaagcgctga tgcggcaaca tcaacgcttc ttaactaaag 11401 actcactatc tgctgttaca gcagggaaag tacacactat gtctagttta cccaattcca 11461 acccttacaa tccttacgtc cgcctattgg gaataatttt ctccctagga gtagcacttg 11521 gctcaggttt ggggtatggc ttcagccaag cgatttccag taccaaacag caaaaattac 11581 ccactcagac tgaactttgt tatgttagtc gcatagagta tgaacgccta caaccgggaa 11641 tgtcattgac agatgtacag gcaattctag gtagcggtgg tactgaggtt gatagaaccg 11701 ctacaacagc aacctttatt tgggagaatc cgaatggcta caaaatcacc accgttttca 11761 acatcggcaa gcttgagagt aagaaacaga ctggattaag gtaaaaaatt aagttaggcg 11821 tgaacgaggg cgtgaattgc agcgaagatt aggcgataag ccggaggctt gacgctgagt 11881 cccaagggga cacgcgcaag ggacgcgtat cgcgcaaagc gcagtgccga acaaagagca 11941 ataaaccctc gtctttgcgt cagacaaaat aatatttatt ccatccataa ctgcgattgc 12001 acggagcgca gacacctgaa ggcgataccc taaatcatac gccttcaagc ctctcaaata 12061 caaagatttc accttaactt cgtgccatac ccctaaacac taacctgtcg cttttcggtt 12121 acacacatgg caattccttt ctcataataa gcagcttcat tgatgaatat tggataaaca 12181 gttgattctc cctcataagt aatctcttta ccatcaaagg ttataaacat tggctcgcca 12241 ggattaagag gttcatagtc ccgaaactga agcttttggt gaatcattgc ctgaatctct 12301 cccaaatcat ttctgggata gtcgatatcc ttcacatatt catacagtgt gagtgtgcta 12361 tttgtctgtg aaattgaacc ttgattatat gcttctaaat agtccaaaat tgtataaaca 12421 agtttttctg ttttctgaaa caagtctgca ttcaagactc cctgagctac aggaccaact 12481 tcaatgacaa aaccgaattc acagattgat ttgagcaagg tactttgaac aggtgctgcc 12541 caacaaacct tcacttctgg ataagttaaa ctgagatggg cagcgaattg cagactgaag 12601 ggatgctggt tccctagcaa gatggttagt cccatatttg cagttgtaga gtgcaaatcc 12661 aaaatgacat ctacctggga tttgcctttt ggtcccagta tttggttgat gtcttttgcc 12721 cggatatctt cataactaga cagcgtcgga ttttccaaat cttgaatttt gaaacaacga 12781 ttcaagtctt tatcaatgta acgtcttccg acttcaaaag ctttggggtt ggcgaataat 12841 gttaaagttt caaaactgtc tcgcccaatg aggtggggat attgctcata ctttttgatt 12901 aggtatgccc ctgtgaattc attcccgtgg gtagcaccaa ccagcgccac tctatttatt 12961 cgattcataa tagtgctcct tttagacaat aggaagtccg caacgggtca ttaatacgtt 13021 gaacttgcct attggttata gctgatttta attcctcttt acaagaatat gtagtcttca 13081 ctgaagctta tataagtagg gtctgcatca atcgcaaccc tgaaactaga gagtgtcatt 13141 tttactacat tttttagaga ttaatcaaat actgtttata tttcttaaag atgtaaaaaa 13201 tgagcagaac tactctcaca aaattgatta aaatttgtta agaaggatga caagcggtca 13261 ataaaaggaa acaaatctta tggtgaacgc agttgaaaat cagccacccc atcacgtagt 13321 gattgttggt ggtggatttg gaggattata tgcagccaaa gcactctcgt gtgctgatgt 13381 ccgtgttacc cttattgata aacgtaattt tcacttattt caaccgcttt tataccaagt 13441 tgctactggt gcgatatctc ccgctgatat ttcctctccc ctgcgttcca tactaagtaa 13501 gagcaagaat acgaaagtgc tgttgggaga agtgaatgat atcgatcccc aaggacaaaa 13561 agtcttcatg ggtggggaag caatacatta tgattctctc atcctggcga cgggtgcaaa 13621 gcattcctat tttggcaagg accagtggga agaatttgct cccggtttaa aaactgtgga 13681 agatgcgata gaaatgcgtc accgcatctt catggcgttt gaagccgcag aaaaggaaac 13741 tgatcctgaa aaacgtcgtg cttggttaac ttttgtgatt gttggtggtg gtcctactgg 13801 tgtggaattg gcaggtgcga tcgcagaact tgcataccac accatgaaag aggacttccg 13861 caatatcgac acttccgaag cgcaagtttt actcctggaa ggtttggatc gggttctccc 13921 accgtttgca ccagagttat caaaagaagc agaagcatca ctaacccgct tgggtgttac 13981 cgtacagcca aaaacaatgg ttacgaatat agaaggtgat gttgtcagcc tcaaacaagg 14041 tgatgaagtc aagcagattc atgccaaaac agtattatgg gcagcaggtg tgaaagcttc 14101 acctctagga aagttgctag cagaaagtac aggtgctgag tgcgatcgcg ctggacgagt 14161 tattgttgaa cctgatttga gtcttaaaga acactccaat atatttgtca ttggagactt 14221 agcacatttt gctcatcaaa acggcaaacc cctacctggt gttgcacctg tagcgatgca 14281 agaaggacag tacgttgcct cactgatcaa acaacggctt caaggtaaaa cattaccaca 14341 atttcgttat tttgattggg gtagtttagc ggttattgga caaaactctg ccgttgtaga 14401 cttagggttt ttcaaattta caggcttttt ggcgtggcta ttttggctgt ttattcacat 14461 ctacttctta attgagtttg acaacaagct ggtagtcatg attcagtggg gctggaccta 14521 tttcacccgc aagcgtggcg caagattgat tacaggtaag gaagttttgg aaaacgcgaa 14581 agctggaggt agcaacggtt attacacgcc tgagaatggt agacaagcgg ttaatctata 14641 agggctttta cctcaccccc atcccctctc cgaattcgcc cagaggggtg cggtgaaggg 14701 cggggtgagg tcattattat tctaagtaat tagacggtgc aacctgtcta catgtagctc 14761 aaacacttgg tgcaggagat agcatctgcc gtcccattgc ctgaagtaaa gatgtgtctt 14821 cttgagtcag ccacatcttc aagagttttc agccagccat tcttataata atccgaaaag 14881 cagcaacttc agtttgtagt ttagtctata tatatagcag ccctgtttga ttagtgaaat 14941 catcataagg gaagggttgt caatagtcac ttttgactgt tgattgttga gtcttgagtg 15001 ccgataattt cacaattcat tcagcattgc tgtatctaaa aaagtgagat gcatccgatt 15061 tcagaatgaa ttcgacaggg taaattaact aggacttacg cactttacaa ataaacgatt 15121 gtgtgcagaa cgcaaagagt gccaacttcc gtgctgttct gcaatagtca gatttggcaa 15181 gaatttccca tttttatctc actggtgcat aagttgattt tggttgcacc taattaaaaa 15241 aggaatggca gcaaataagt ctgaagcaca aaatcgattt ttgtcaatgc gtaagtccta 15301 ttaacgtgtc aatacccctt gacctgaatc cttacggtac atctatgtcc tccaactctc 15361 cctctcaaca caaacgcaat cggggtgtca tcctcacacc tgtggggtgg cgcaagctgg 15421 aaaacgctga gaaagaaagc ggtcagcact ttactaaaga gcaaattcgc aatcgcacag 15481 gtctatcaat acagaccatc tctcgcatcc gcaagcgcaa agttgcggtt gaccaggatt 15541 cactagagtg ctatatcaaa gcctttggtt tggagaagct atccgacaaa gattataccc 15601 atgtcccaca ggaaaaccaa cagcaagatt ggggggacgc accgaatgta tctgtgtttt 15661 atgatcgctg tgaggaaatg gcacagctac agcagtgggt attggaagaa gattgccgtt 15721 taatcgcact cctaggaatg gggggaattg gcaaaactgc gctagccgtg aagtttgggc 15781 agaaatttaa aactgagttt gagatagtag tttggcgatc gctccaaaat gtcccaacct 15841 tggaggagtt attgggaagc gtgttgcaat caataatgca gatgctccaa aaagactcag 15901 ttgtgcctac tagcctggat gggaaactgt ccaagctgat ggagtacttt cgggataagc 15961 gctgtctact gattttagac aatgctgaga caatcttaag cactggtggt ggggctggac 16021 attgtatgca gggttatgag ggatactgtc aactgttcca gcgcattgga gaggtatccc 16081 atcagagttg cttgctcgtg accagtcggg aaaagcccaa agatattgta gcgcttgaag 16141 gagagcagaa aaaagtgcga tcgctacaac ttggagggtt gaaaccagaa gatgggcgaa 16201 agctgttcga gcatagaggg caatttacag gtaaggacgc agaatggatc agactgatcg 16261 aacactacgg gggcaacccg ttcgcgctga agatggtagc agcaggaatt caacagttgt 16321 ttgacggctc tattgcggag gtgttggagt atataggaca aggagtacta gtctttaatg 16381 atatccgcga tctgcttgat cgccagttta gtcgcttgtc gccagtagaa caagaggtga 16441 tgttgtggtt ggcaattaat ccggaacccg tatctgtaaa ggaattaaaa caagatctag 16501 cgagtgtcac ctctaagcaa gaactgccgc aagctttgta ctcgctgttg cggcgatcgc 16561 tcattgaaaa aacaggaaag caattctcac tgcaacctgt agtcaaggaa tacgttacgg 16621 agcaattggg taagcaagtt tgtcaagaaa tagttagcac tagagagaga gcagaaagta 16681 cctcacccct cgctcttctc caaactcatg ccctgatgaa ggcgagtgcg aaagactaca 16741 ttcaggagac gcaacgacag ttgattgtgc aacccttgct tgagcaactg ctgatagagt 16801 tgggcagcca gcaaaagctt gtgcaaatgt tgaaggatgt gctggagcag caaagagatt 16861 aaacccctat actggcaagg tatgcgggag gcaatgtcct cgatttgtta gcatatttga 16921 gtgcattaaa accctgaaag tagatcgcct tgataaaggg atgaacatca ggggggtgac 16981 tggattagca gatatcagta ctattcgctt taggaactgt ggaaggtgta gggtaggttt 17041 tattgttggc gttaagcgta gttctccgtc cgagagtgcc gcccctttta ggggctagga 17101 atcgcttctt tgagacagtt aataaacaga gcgatcgcct gttttaaggc gatcactaga 17161 agtgcgtaga cgctattttt taattcgtaa ctgctcgttt attatgtatc attattatct 17221 tctgaatact ctattgtaac acttccgttg ttttttagct cagcttcaac atcagtaggt 17281 aggttgctta gactcttaaa ttctccttta tttgtaaatt taaaaaattt atctccttct 17341 tgtatataag tctttatttc aataaactta tcaagttcac taatagtgta ttgtg // LOCUS NODE_1906_length_17328_cov_5.61998517328 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17328) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17328) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17328 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..365 /locus_tag="DP116_16735" CDS <1..365 /locus_tag="DP116_16735" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16735" /translation="VTQPTITVPPVTQPSVPVTQPTITVPPVTQPSVPVLTQPTSTVP PVTQPSVPVLTQPTTVPQPASPSPVPVLTQPTTVPQPASPSSVPTQPTTVPQPASPSP VPASTTSQPSETPVRPST" gene 499..3048 /gene="ptsP" /locus_tag="DP116_16740" CDS 499..3048 /gene="ptsP" /locus_tag="DP116_16740" /EC_number="2.7.3.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866269.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate--protein phosphotransferase" /protein_id="PRJNA477356:DP116_16740" /translation="MVGIVIVSHSRQLAEGVRELAAQMVQGKVPLAVAAGIDDPENPL GTDAMQVYEAIASIYNDQGVIVLMDLGSALMSAEMALEFLSEEQREKVHLCEAPLVEG AIAAAVAAASGRNIQQVMAQARGALVAKATQLGVNVSHILGETTDVDTLSFVIEEQVT KEIRLTVRNPLGLHARPAAKFVATAAGFQSQIKVQNITKGTEAVRADSINQVATLGVR QKHELVITATGSDADEALAALQGLVENNFGEEDATLPPLPTTPADHPIFSSSHHFLQG IPASNGVAIAPAFLYHPTLLNIQQYHVENIEQEWQRLQVALQIAHEEIQALLSQASIQ IGDAEAAIFDAHLLFLEDPVILESVHQRIFEQHLNAEAAWQAVINELANNYRTIEDSY LRERVADVVDVGQRVLRVLSRFESHIIISDDSPTHLNLSEPGILITTDITPSDTARLD PTRVLGICTTSGSALSHSAIIARRLGIPAIFGLPPEILQVANHTIVALDGESGRVWTE PEPDIQTALETKRNAQQIAHQQAIATATSPAVTRDKTRQIKVYANIGGISDTEEALSL GAEGVGLFRTEFLYLDRTTPPSEEEQLAVYQRIAQLLHNRPLIIRTLDVGGDKPIPYL NFPQESNPFLGWRGIRFCLDNPDILKTQLRAILKPSLGHQIKIMFPMITTVQEIQAAK AILAEVQAELRQAGVPFDQKMEVGIMVEIPSAVVLAEELAAEVDFFSIGTNDLTQYVM AADRTHPQVATLVDAMHPAVLRMIQQTVQAAHKAGIWVGLCGELAADPLAAPILLGLG LDEVSLNAQGIPGFKQAIAQLTMVEAEAIAASALQQDSADKIRTLIRQMLT" gene complement(3179..4531) /locus_tag="DP116_16745" CDS complement(3179..4531) /locus_tag="DP116_16745" /inference="COORDINATES: protein motif:HMM:PF00120.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamine synthetase" /protein_id="PRJNA477356:DP116_16745" /translation="MIVSDKTKPAFDSVTQFVENLTDQGVEYIRFELPDLHGVSRSKV VPIDKVERFTRKGLNFYGGCLALDTASMVVPGSGYHAERKYRDLLLIPDLDTLTPVPW IEKTAKVICDPVWSAKEPVEVAPRYILKQLLAEAAQLGFDVMMGHEFEFYLLNPETKE PLFDGLHIFNHIRNQYVPEISQLLEYLRASGIDVITHNCEYGPSQFEINYGPSTGIRG ADKAFTFKNAVKEIVHQLGYHATFMSKPFIDKSGCCCHFHISLIDRNTGDNAFVDKDD KYGLSTTAQAFIQGILDHAAAMMPLVSPTPNCYRRLKPHTFAPSNISWGIEDRSAMVR VKVTDDESTHIEMRAASGLSNPYLSAAATLAAGLLGIKQQRKLQPSVEGPSEDNPNLP KLPQTLEEALSGLAVDVDMQNMLSQEFVHLFTTVKRFEVARFHEHLTEWERHEYLDVY " gene 4799..5896 /locus_tag="DP116_16750" CDS 4799..5896 /locus_tag="DP116_16750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009633549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="APC family permease" /protein_id="PRJNA477356:DP116_16750" /translation="MLYHINIIMSIVKVCKYYYNSYCAGISWYIAYKDIQLSTIVMLA LEGISIGVILLLGLIVLGKHGFAIDTAQLTLQGTQPGSIVSGLVLAVFSYVGFESATT LGDEAQKPLRNIPRAVIMSTVICGLFFIVLSYIEVLGFQNHTTPLNKSEAPLNDLANL AGVGFFGLVISLGAMVSLFACALATINAGGRILFSMARHNIFHASLGRAHGKNQTPHV AVTLVALVAFLLAASTTLLGVKVLDNYAYFGTIATYGFLFAYILVAIASPVYLWREHQ LRAVDILYSVLAVVFMVIPAIGSVGIPGENSLFPVPAAPYNIFPYLFLLYLVVGGGWF IMLRLRRPEIIEQMENDLEAVHTRFGDMKKV" gene 5982..7310 /gene="glnT" /locus_tag="DP116_16755" CDS 5982..7310 /gene="glnT" /locus_tag="DP116_16755" /EC_number="6.3.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008315980.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type III glutamate--ammonia ligase" /protein_id="PRJNA477356:DP116_16755" /translation="MVGSLPENKTKSLVELAHDLKLDFFLVSFTDVLGGTRAKLIPAA KIATVESDGAFFAPFACHLGLGPDSHDIAAIPDPNSLIVLPWQRNVAWVASDVYLDGE LFSASPRVIFKKILQQCESLGYSYKTGVEAEFFLLKKNDQGYEIADAMDTAARPCYDQ LNLMRQFDLISTIVSYMEELGWEPYQCDHEDGNGQFELNWTYSDALTTADRHVFFKYM VKTLAEQRGLTATFMPKPFSHLTGNGGHIHMSLWGSGNAFLDKTDEMGLSAIAYEFLA GVLAHARGLSALCNPTVTSYKRLGASNTNSGSTWSPRYISYGGNNRTHMIRIPEAGRF ECRLVDGSANLYLAQAGILAAGLEGMAKHLSPGKRLDENMFVRGSEFPNLQKLPTSLF EALQCLEQDALLMTTLGELGAKTLLEFKYQEWDAYNSTVTPWELQQYINC" gene complement(7829..10189) /locus_tag="DP116_16760" CDS complement(7829..10189) /locus_tag="DP116_16760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454605.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="WD40 repeat domain-containing protein" /protein_id="PRJNA477356:DP116_16760" /translation="MDIGVRLLFQIVLELTPVIAALIQKRNEERPTLAKYLASQEIKE FLQSVSSASHSISHSGKLEQEKIQQQQLAFDFQKTQLKIATQQQETALKLPEVQKIFE NWPLRLLPSQILESHTKTQRTPLKIFLAPPKIKFDKFDNRGEDISDIEFMLAEGLREF INQHYSLHNPIRPTEFLAGAWDSKRFHSESSIKALFGMLKTEPILILESECDENYLNF RIGYWGIGQDNYFYKTISRLPYKEIVYESARSRALEWKTIRDELIALGENLEEINNLG GDNVINLAILEKAEKWKAKGIDISKLSLQYEVNRQDIEKLCQVLITCHILCAGWVADA YHLIHNDVPPRLPELLPSFMTNLDTKSLQAIATGYKQLYQTLEVKQHHWIPNLALELA RTLLHLSNDIWAKEQVDYSVNTWLQLRQVSQQQGSHPLQAMQSAVKIEDEEYIEKLKE YFTAVGDSHSMTYAEELLNAIANHKDQRQQESAYLSHTFTGHSDKITSVAICADGNTL VSGCADKTIKIWNLSTGKVIRTLTGNIGEISSVAISPDGNFLVVGSCEHPRNNVKVWH LATGKLLHTLLGHQKPVNCVVISPDGQILASASNKIKIWNLHTGDTPAGSPPRGERIS TLWHSSAVYAAAISPDGTILASGSNDHKIRLWNPITGEPLRTLSGHSGEVKAIAISPN GEFLISGSADKTIKIWHLDTSHVVYTLSGHSDEVKSVVVSPDGQTLFSASADKTIKIW SFETGELLQTLTGHSAAVNSVAISPDDRFIVSGSSDKTIKIWQRTD" gene complement(10707..11096) /locus_tag="DP116_16765" CDS complement(10707..11096) /locus_tag="DP116_16765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16765" /translation="MKTAVIAKTLLASLSLLALFAPQSAHAQIVPQPWVSVGSQDGDV TYSVGARALNLGAELGFGPDGSTGVDILKFISLPVISPYVGLGYYSADKGVAFSGGVQ VSATDKVFVGVGYNSVRGINGQLGIRF" gene 11408..11689 /locus_tag="DP116_16770" CDS 11408..11689 /locus_tag="DP116_16770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860680.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16770" /translation="MNYTEELKNYLDEQGRVKEWPSKRNKGKFQKLVLEYLASKFEVG TIYTEKEVNALLNGHHTFGDPAMLRRELFESGLIDRKRDGSAYWRNLQN" gene complement(11680..13158) /gene="mmsA" /locus_tag="DP116_16775" CDS complement(11680..13158) /gene="mmsA" /locus_tag="DP116_16775" /EC_number="1.2.1.27" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860514.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methylmalonate-semialdehyde dehydrogenase (CoA acylating)" /protein_id="PRJNA477356:DP116_16775" /translation="MEKVMTLPNYINGQWCTSSATEYLNVINPATAEILIKVPLSPAS EVNQAAQAAAEAFVSWRRTPPTERVQYLFKLKNLLEENLEDLARTITLECGKTLAESQ GEMQRAIENVEVACGIPMMMQGTNLEDIARGIDEMMIRQPLGVAAVIAPFNFPGMIPF WFMPYALACGNTYIVKPSEKVPLTMQKIFQLLEKTGLPKGVVNLVNGAKEAVDAILDH PKIRAISFVGSTPVAKYIYSRAAANGKRVQCQGGAKNPLIVLPDADLEMTTRIAADSA FGCAGQRCLAASIAVTVGQMRDTFTEAIAETAKKRVVGNGLESGVEMGPVITTQSKTR IEDLIQKGADQGARVLVDGREPNISGYENGNFIRPTILQNVDPAGEIASTEIFGPVLS LIHLESIEEAIALINSGQYGNMACLFTTSGAAARKFRYEAEAGNIGINIGVAAPMAFF PFSGWKESFFGDLHGQSNHAIEFFTQTKVVVERWPKDWSRQF" gene complement(13639..15399) /locus_tag="DP116_16780" CDS complement(13639..15399) /locus_tag="DP116_16780" /EC_number="6.1.1.19" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="arginine--tRNA ligase" /protein_id="PRJNA477356:DP116_16780" /translation="MKATQEQLKVKLQEALGAAFGIDYAGVDPILVSASNPRFGDYQA NAALALAKQLGQQPRAIAQQIVDKLDVSDICKPPEIAGPGFINLKLKTEYLEAQLKAI QTDPRLGVAPTKNPKRVIVDYPSPNIAKEMHVGHLRPAVIGDCLSRIVEFVGHEVERI SHVGDWGTPFGMLIAYLEEAYPEALTTTETLNLGDLSSFYRQAKTRFDADANFQEAAR QAVVKLQAGDEKTLLAWKIVCQLSSRAYQVIYDLLEIAPFVERGESFYNSLLPEVVEE LDKKGLLVENQGAKCVFLEGFTNREGEPLPLIVQKSDGGYNYAATDLAAIRYRVQVDK VQRVIYPVGTEQTNHFAQIFQVGTKAGWITDDVEFEHAPFGLVLGEDGQKLKTRSGEA VRLRDLLDGAIAHARGDIEKRIKEEGREETEEFIHNVAQIVGISSVKYADLSQNRTSN YIFSYDKMLALKGNTAPYMLYAYVRTQGISREGNIDFEKLGTDAPIVLREETELTLAK HLLQLDEVISEVEKDLLPNRLCEYLYQLSDKFNKFYENCPVLKAEEPVRTSRLVLCDL TARTLKLGLSLLGIRVLERM" gene complement(15511..16236) /locus_tag="DP116_16785" CDS complement(15511..16236) /locus_tag="DP116_16785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012627268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16785" /translation="MLREIDNNIWIAEQPLKYWGLEVGTRMTVIRLTTGELIVISPIQ SDKTTIHQLNEIGNVAYIIAPNLYHHLFVYDLKSIYREAQLWGVPGLVSKRPELSFDR VITNKEGSIKEQVDYLLFDGFKLLDLSGPSIVNEFVFFHQKSRTLILTDIAFHFDETF SFKTRLAAQFLGSYKVLSPSRLDKLATSDKEKVKDSVEKILRWDFNRVIMAHGSIIET NGKQKFKQGYEWVFENTSLSKKS" gene complement(16288..17148) /locus_tag="DP116_16790" CDS complement(16288..17148) /locus_tag="DP116_16790" /EC_number="2.4.2.19" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411530.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxylating nicotinate-nucleotide diphosphorylase" /protein_id="PRJNA477356:DP116_16790" /translation="MSHFGVLPPGLVLDTLLRNWLLEDIGRGDRTTQTLISEEVGQAK WIAKAPGVIAGLPVAARVFQLLDEKVNFVPVVAEGSSCQRLEVVAEMDGPLDALLTGE RVALNIVMRLSGIATLTRKYVEQIADLPAKLVDTRKTTPGLRLLEKYATTVGGAVNHR MGLDDAVMIKDNHIAAAGGIGKAISRIRQYIPYPLTIEVETESLDEVEEALQHNADII MLDNMPVDMMHEAVQMIRQHDSRIKIEASGNITLETIRAVAETRVNYISSSAPITQSS WLDLSMRIRL" BASE COUNT 4917 a 3766 c 3507 g 5138 t ORIGIN 1 cagtcacaca accgacaatt acagtaccac cagtcactca gccttctgtg ccagtcacac 61 aaccgacaat tacagtacca ccagtcactc agccttctgt gccagtgtta acacagccga 121 cctcaacagt accaccagtc actcagcctt ctgtgccagt gttaacacaa ccgacaacag 181 taccacaacc cgcaagcccg tctcctgtgc cagtgttaac acaaccgaca acagtaccac 241 aaccagcaag tccgtcttct gtgccaacac aaccgacaac agtaccacaa ccagcaagcc 301 cgtctcctgt gccagcatca acaactagtc aaccttcaga aacaccggta agaccaagta 361 cataaaatct atttcgtcat ggctatggga tgaaataagg tgcgttacac ttcgttacgc 421 attctacgct tccaagccac gctcagcata aagtaagttg gtaagtacat gcttatcaag 481 ggaattgatg atgtggcagt ggttggaatt gtcatagttt ctcacagtcg ccagcttgca 541 gaaggagttc gggaactcgc agcgcaaatg gttcagggca aagtccccct cgctgttgca 601 gcaggaattg acgatccaga aaaccctcta ggtacagatg ccatgcaagt ttatgaggcg 661 atcgcctcca tttacaatga tcaaggtgtg atcgtcttaa tggatttagg cagcgctttg 721 atgagcgcag aaatggcact tgagtttctc tcagaagaac aacgcgaaaa agtccatttg 781 tgtgaagcgc cacttgttga aggtgcaatt gcagctgctg ttgctgcagc ttctggtcgc 841 aacattcaac aggtgatggc ccaagcaagg ggagcattag ttgctaaagc cactcaattg 901 ggtgtaaacg taagccatat ccttggggaa accacggacg tagatacgtt gtcatttgtc 961 attgaagaac aagtgacaaa agaaatacgt cttacagtcc gtaacccgtt gggattacac 1021 gctcgtcctg cagctaaatt tgtcgcaaca gcagccggtt ttcaatccca aatcaaagta 1081 cagaatatta ctaaaggtac tgaagcagta cgtgctgaca gtatcaacca agtagctacc 1141 ttaggcgtac gtcaaaaaca cgaattggtt attactgcta ctggttctga tgcagatgag 1201 gcgttggcag cattgcaagg attagtggaa aataactttg gcgaagaaga tgccactcta 1261 cctcctcttc ctaccacccc agcagaccac cctatttttt cgtcttccca tcactttctc 1321 caaggtattc ctgcttctaa tggagtggcg atcgcacccg cttttcttta tcatcccact 1381 ttgcttaata tccagcaata tcacgtagaa aatatagagc aagagtggca acgtttacaa 1441 gtagctcttc aaatcgctca cgaggaaatt caagctttac tctcacaagc atctattcaa 1501 ataggagacg cagaagccgc gatctttgat gctcatctgc tttttctaga agatcctgta 1561 atacttgagt cagttcacca acgcattttt gagcaacact tgaatgctga agcagcttgg 1621 caagcagtta tcaatgagtt ggcaaacaat tatcgtacta ttgaggattc ttatttacga 1681 gagcgagttg ctgatgtcgt agacgttgga cagcgagtgt tacgagtgct aagtagattt 1741 gaatcccata ttatcatttc agacgacagc cctacccatc tgaatttatc cgaaccaggt 1801 atcttaatta caactgatat caccccttct gacactgcta ggctagatcc aacaagagtc 1861 ttgggaattt gtaccacatc tggcagtgct ctttcacata gtgccataat agcaaggaga 1921 ttaggtattc ctgcgatttt tggtttacca ccagagatac tgcaagtagc aaatcatact 1981 atagttgcac tcgatggaga gagtggcaga gtttggacag aaccagaacc agatatccag 2041 actgcacttg agacaaagcg gaatgcacag cagattgctc atcagcaagc aatagcaacg 2101 gcgacaagtc cagcagtgac tcgtgacaag acgaggcaaa tcaaagtata cgctaatatt 2161 ggcggtatct ctgatactga agaagctttg agcttgggtg cagaaggagt gggactgttc 2221 cgcactgagt tcctttattt agacagaaca acaccgcctt cagaagaaga acaattggca 2281 gtttatcaaa gaattgcaca acttcttcat aatcgtccgt tgattattcg cacacttgat 2341 gtaggaggcg acaagccaat tccttacctg aatttcccac aagaaagtaa tccttttttg 2401 ggttggcgcg gaattcgttt ttgtctggat aatcctgata ttttgaaaac ccagttgcgg 2461 gcaattttaa aacccagtct cggacatcaa attaaaatca tgtttccgat gattacgact 2521 gtgcaagaaa tacaagctgc aaaggcaata ttagcggaag tacaagctga actacgtcaa 2581 gcaggtgtcc cttttgatca aaagatggaa gtggggataa tggtggagat accctcggca 2641 gttgttcttg ctgaagagtt agcagctgaa gtggactttt ttagtatagg aacgaatgac 2701 cttacccagt atgtcatggc agcagatcgt acgcatccgc aggttgcaac tttagttgat 2761 gcgatgcatc cggctgtgtt gcggatgatt cagcaaactg tccaagctgc acataaagca 2821 gggatttggg tagggttatg tggagaacta gcagcagatc ctctagcagc gccgatttta 2881 ctagggttag gattggatga ggtgagtttg aatgcacaag gtattccagg atttaagcag 2941 gcgatcgccc aactcacaat ggtagaagca gaggcgatcg ccgcatccgc attgcaacaa 3001 gattccgcag ataagataag aacgctgatt cgtcaaatgt taacgtaatt ttctcatttt 3061 gatagcgcag cgtggcacga agtgccatac gttgaatttt gattttgaac tttgaatgag 3121 accgccacgc tccgctcgcg gcgctacgcg attttgaatt ctccaaaggc ggttgactct 3181 agtagacatc caaatattca tggcgttccc actctgtaag gtgttcgtga aaacgagcca 3241 cctcaaaacg cttcaccgta gtaaagaggt ggacaaactc ttgagagagc atattttgca 3301 tatccacatc tacagccagt ccagataaag cttcctcaag ggtttgaggt aacttcggca 3361 gatttgggtt gtcctcgctg ggaccttcta ctgagggctg tagtttgcgc tgttgtttaa 3421 tacccagcaa acccgccgct agagtagctg ctgcacttag gtaaggattg ctaagaccag 3481 aagctgctcg catttctata tgggttgatt catcatcggt taccttcact cgcaccattg 3541 ctgaacgatc ttcaataccc caactgatat tggagggagc aaaggtgtga ggtttaaggc 3601 gacgataaca attaggcgta ggactcacta atggcatcat cgccgctgca tgatctaaaa 3661 tcccttgaat gaacgcttga gcagtagtag ataaaccata cttgtcgtcc ttgtcaacaa 3721 aggcattgtc tccagtgttg cggtcaatga ggctgatatg aaaatgacaa cagcaacccg 3781 atttatcaat aaagggttta gacatgaagg tagcgtgata accgagttga tgaacaatct 3841 ccttgactgc attcttgaag gtaaaagcct tgtcagcacc acgaatccct gtacttggtc 3901 catagttaat ctcgaattgg gagggaccgt actcgcagtt atgggtgatc acatctatgc 3961 cagaagcacg cagatattcc agtaactgac taatttccgg aacgtattgg ttgcggatat 4021 gattgaaaat gtgtaaccca tcaaaaagcg gttcctttgt ttctggattg agaagataga 4081 actcaaactc atgccccatc atgacatcga acccaagctg cgccgcttct gctagtaact 4141 gcttgagtat ataacgcggt gcaacctcca caggttcttt tgcactccat acggggtcac 4201 aaatgacctt agctgttttt tcaatccaag gtactggtgt taaagtatca aggtctggga 4261 ttaatagtaa gtcacggtat ttccgctctg cgtgatagcc agaaccgggt acaaccattg 4321 aagcagtatc taaagcaaga caaccaccat aaaagttaag acctttacga gtaaaccttt 4381 ctactttgtc aattggcaca acttttgagc gagagactcc gtgtaaatcc ggtagctcga 4441 agcggatgta ctcaacacct tggtctgtta gattttcaac aaattgggtg acgctatcaa 4501 aagcaggttt tgtcttgtct gatacaatca ttttcaagcg cgtaagtgat tgacatatgt 4561 taatttagct ttaaattgaa cctatgacca tcgttatact cagaaaaaac cctgatatga 4621 acgcttggtt tgattttgtc atattgttct ttgtagcgtg ctacttacat atgtagcatt 4681 ccttgcgctt gtcccctttt ttctggacta atttttctgg actaaatgtg acactggact 4741 ggctcaccgc accaagcgtc tacaggatga caattcatac tctgattcag taacgcctat 4801 gctatatcat ataaatataa ttatgtcaat agtaaaagtc tgtaaatatt actataactc 4861 atactgtgca gggatttctt ggtacatagc ttacaaagac attcagctat ctacaattgt 4921 catgctggcg cttgagggaa tttcaatcgg ggtgattctg cttttagggc tgattgttct 4981 gggtaagcat ggctttgcga ttgataccgc acaactgaca cttcaaggaa ctcagcccgg 5041 ttcgattgtt tcaggtttag ttctggctgt gtttagctat gtaggttttg aaagtgcaac 5101 aactttagga gatgaagcgc aaaaaccact acgaaatatt ccgcgtgcgg taattatgag 5161 cacggtgatt tgtggattat tctttattgt tctctcttat atcgaagtct tgggatttca 5221 aaaccataca actcctttga acaaaagtga agcacctctc aatgaccttg ctaatcttgc 5281 aggtgtcggc ttttttggat tagtgatttc cttgggagca atggtgagtt tgttcgcctg 5341 cgcccttgct accattaacg ctggtggacg cattttgttt tcaatggcgc gtcacaacat 5401 ttttcatgct tcccttgggc gtgcacacgg aaaaaatcaa actcctcacg ttgctgtaac 5461 gttagtggcg ctagtggcgt ttctcttagc tgcttctaca acactgttag gtgttaaagt 5521 tctagataat tacgcttact ttggtacgat cgctacctac ggcttcttat ttgcttatat 5581 cctggttgca atcgcttctc ctgtttatct ttggcgtgaa caccaattac gtgcggtcga 5641 tattttatat tctgtactag cagtcgtatt catggtgatt cccgcgattg gtagcgttgg 5701 tattcctggt gaaaacagtc tttttcccgt tcctgctgct ccatacaaca ttttccccta 5761 cttgttccta ctgtatctgg tagttggtgg tggttggttt attatgctgc gcttgcgccg 5821 tccagaaatt attgagcaaa tggaaaacga tctcgaagct gttcataccc gctttggtga 5881 tatgaaaaag gtttgatatt gcaacgtatt tgagttgtaa gatttcgttg tgaaacgaat 5941 catagagaac acacaggaca tacgaactgg gagaaattgc gatggttgga agtttgccgg 6001 aaaataaaac gaaatctctt gttgagttag ctcacgactt aaagctagat ttcttcctgg 6061 tatcatttac ggacgttttg gggggaactc gtgccaagct cattcctgct gctaagattg 6121 ctacagttga atctgatggt gctttctttg ctccttttgc ttgtcattta gggttaggac 6181 cagactctca cgatattgct gctattcctg acccaaattc tctgattgtt ctaccttggc 6241 aaagaaatgt tgcttgggta gctagtgatg tttatttaga tggtgaattg tttagcgcgt 6301 ctccacgtgt cattttcaaa aaaatactcc agcagtgcga aagcctgggt tatagctata 6361 aaactggggt agaagctgag tttttcctac ttaagaaaaa tgaccaaggt tatgaaattg 6421 ctgatgcaat ggatacagcc gcccgacctt gctatgatca gttgaactta atgcggcagt 6481 ttgatttgat ttccaccatt gtgagttaca tggaggagtt aggctgggaa ccttaccagt 6541 gtgatcacga agatggtaac ggtcaatttg aactcaactg gacttacagc gatgcactga 6601 caactgctga tcggcatgta ttctttaagt acatggtgaa aactctagcc gaacaacgag 6661 gtttgacagc gactttcatg cccaagccat tctcgcattt aactggaaat ggtggacaca 6721 tacatatgag cctttgggga agtgggaacg cctttttgga taaaacggat gagatgggat 6781 tgagtgcgat cgcctacgaa tttcttgctg gcgttctcgc ccatgcccgt ggattgtcag 6841 cactgtgcaa tccaacagtc acatcataca agcgattggg agccagcaat accaattctg 6901 gaagtacctg gagtcctcgc tacatatctt acggcggtaa caaccgcacg cacatgattc 6961 gcattccgga ggcgggaagg tttgaatgtc gtcttgtgga tggatctgct aatctttact 7021 tagctcaggc aggaattttg gcagcagggt tagaaggaat ggcgaagcat cttagccctg 7081 gaaagcgctt ggatgaaaat atgtttgtgc ggggttcaga gttccccaac ctccagaaac 7141 tgccaactag cttatttgaa gcactccaat gtttagaaca ggatgctttg ttgatgacta 7201 ctctgggaga attaggagca aagactttgc tggagttcaa gtatcaagaa tgggatgctt 7261 acaactccac agtgacacct tgggaattgc agcagtacat caattgctga tggctaattg 7321 cctttttgcc attggttatt aggaattgat ataggggtgg acgagtatga aggaaaaaac 7381 tgaattttat caaaaccaat acctaaagaa ttagctattg gtcttgatcg aattcagtta 7441 tcacagaaaa acaggtttgc aggggaattg caccaagcgc taatagtaat acgagtataa 7501 gcgatagaat ggcaatgctg aacgattgat atagtcatat tcctgcatct acaattgtgc 7561 aaatccaaca tatcaaactt gctttggtca aatgaaaaag gtttgataag taggtaggcg 7621 ggaaaattta cataaaagga agaagaaaca agtaagggta cactttaaaa ctttagttag 7681 gggtataaag ccatacgctt ggtgataagg cttttttctc tccttgcaac ccaagcgcct 7741 ctcctgctta agaactacct caatggataa atttcttaac agcaaccgta ttgtcttatc 7801 atctccccca ctacgctacg ttcgatacct aatcggttct ctgccaaatt ttgattgttt 7861 tatcagaact accacttaca ataaagcgat cgtctgggct aattgcaaca gaatttactg 7921 ctgctgagtg tccagtgaga gtttgtagca attctccagt ctcaaaactc caaatcttga 7981 tagttttatc agcactagcg ctaaaaagag tttgtccatc aggactgaca acaacagatt 8041 taacttcatc tgaatgtcca ctgagcgtat agactacatg acttgtgtcc agatgccaaa 8101 ttttaattgt tttgtctgcg ctaccactga ttaagaattc accattagga gaaatagcta 8161 ttgctttcac ctcacctgaa tgcccactaa gagtgcgtaa tggttctcct gtaattggat 8221 tccatagtct aattttatga tcattactac cactggctag gattgtacca tctggactaa 8281 tcgcagcagc ataaactgcg gatgaatgcc aaagagtgga aatgcgttcg ccccttgggg 8341 gactccctgc tggagtatcg cctgtatgca aattccaaat cttaattttg ttactagcgc 8401 tggcaagaat ttgtccatct gggctaatca ctacacagtt aacaggtttt tgatgcccta 8461 aaagagtatg caataattta ccagtcgcaa gatgccagac tttaacatta tttctggggt 8521 gttcgcagct accaacaacg aggaaatttc catctgggct aatagccaca gatgaaattt 8581 ctccgatgtt tccagttaaa gtgcgaataa cttttcctgt gctaagattc caaatcttaa 8641 tcgttttatc tgcacaccca ctgactaaag tgtttccatc tgcgcagata gctacagatg 8701 tgattttatc agagtgtcca gtgaaggtgt gactgagata agctgattct tgttggcgtt 8761 gatctttatg gttagcaata gcattcaaca gttcctcagc ataagtcata ctatgactat 8821 cgccaactgc tgtaaaatat tctttcaact tttcaatgta ttcctcatct tctatcttca 8881 cagctgattg cattgcctgt aaaggatggc taccttgctg ttgtgaaact tggcgtagtt 8941 gcaaccatgt gttaactgag taatctacct gttcctttgc ccatatgtca tttgataaat 9001 gtaataaagt ccgcgccaat tccaatgcta aattaggaat ccagtgatgc tgcttcacct 9061 ccagagtttg gtagagttgt ttatatcctg tggcgatggc ttgtagtgat tttgtatcaa 9121 gatttgtcat aaaacttggt agtaattcag gtagtcgtgg aggaacatca ttgtgaatca 9181 aatgatatgc atctgctacc caacctgcac agagtatgtg acatgtaatc aaaacctggc 9241 aaagtttttc aatatcctgg cgattgactt cgtattgtaa ggatagctta ctaatgtcaa 9301 tgccctttgc tttccatttt tctgcttttt ccaaaattgc taaattaatt acattgtctc 9361 cgccaagatt attaatttct tctagatttt ctcctaatgc tatgagttca tctctaatgg 9421 ttttccactc aagagcacga cttcttgccg actcataaac aatttcctta tatggtaagc 9481 gagaaatcgt tttataaaaa taattatctt gtccaattcc ccaataacca atacgaaaat 9541 ttaaataatt ttcatcacat tctgactcta gaattaaaat aggctctgtt ttcaacattc 9601 caaacagagc cttaatactg gattcactat gaaaacgctt actatcccat gcacctgcta 9661 aaaactctgt tggtctaatt ggattgtgta gagaataatg ctgattaata aattctcgta 9721 aaccttcagc taacatgaac tcgatatctg aaatatcttc tcctcgatta tcaaatttat 9781 caaactttat tttaggagga gcaagaaaaa ttttgagtgg agtgcgttga gtcttggtgt 9841 gagactctaa aatttgtgaa ggcaataatc gtaaaggcca attttcaaaa attttctgta 9901 cttctggtaa tttgagggct gtttcttgct gttgtgttgc tatctttagt tgtgtttttt 9961 gaaaatcaaa tgctaattgc tgttgctgaa ttttttcttg ctccagtttt cccgaatgac 10021 tgatactgtg actcgcactg ctaacagatt gaagaaattc tttaatctct tgggaagcta 10081 ggtattttgc gagtgtcgga cgttcttcat ttcttttttg tattaaagct gcaatgacag 10141 gtgtcaactc taatactatt tgaaacagta atcttactcc tatatccata aaaagatatc 10201 tgccaggtca atacagttaa tactgtttaa tgtttgatga tagccatatt ttgtgttaat 10261 tcctgctatc tacaccgaaa gatagatata gcctatccta ctactaactg gcgtttgatt 10321 ctttattcaa aaagaaataa tatgccagaa atcatcaaaa aataagttat acagatattt 10381 agtaataaaa aaataattat aaatagctga taattataac aattgatatg acaaccatta 10441 caatcgcttt tctgttcact caactattat ataacaattt taaatgagtt gtcagataat 10501 gcttttggct tcatctctca taaattgtgt tttttgcatt ttagtagttc atttgagagt 10561 tcaatcaggt aaaaaataga aaaaacatca attagcttta aaaaaggaga ttaatcattt 10621 atcttgctga ttgggaaaat gttgaggtaa ggtacgtgga accctatatt tcacgtacct 10681 tagaaaaaga cgtatagaga taaaaattaa aatcttattc ccagttgacc gttaattcct 10741 cgaacagagt tgtaaccaac accaacgaaa actttatcgg tagcgcttac ctgaacgccg 10801 ccagaaaaag caacaccttt atctgctgaa tagtatccca gtcctacata aggtgaaatc 10861 acaggcaaac tgataaattt taatatatct acacctgtag aaccgtcagg accaaatcct 10921 aactctgctc ccaaattcaa agcccttgcg cctacagaat aggtaacatc accatcttgg 10981 ctaccaactg atacccaagg ttgcggtact atttgagcat gagcactttg tggggcaaat 11041 aaagctaata aactgagtga tgcaagcaat gtttttgcaa taaccgctgt tttcacgttc 11101 tactccttca ctacctgcga cattgtcatc tttttggcag aaataccact gacaaaaaac 11161 tgagtactta agctaattgc aacagacttg actgctgcta aattcccaac gaacatctgc 11221 agcagttgac ttgtccggtg cttactgctg ttaattaagg tttgtctagt ataactgatg 11281 gtgacgaaat gactcttact taagtgccca ctttagcagt gggtagagtc gattttccaa 11341 aacattgata cgaaatcgtg attggtctta atgtttcttt ctttgtaaca cagcacgaaa 11401 atgacctatg aattacacag aagaattgaa aaactattta gatgaacagg gacgtgtgaa 11461 agagtggccc tctaaacgca ataagggaaa gtttcaaaag ttggtgttgg agtatttagc 11521 gtcaaaattt gaggttggta ccatttatac agaaaaagag gtgaatgcac tgcttaatgg 11581 gcaccacacc tttggcgatc ccgcgatgtt gagacgagag ttatttgaaa gcgggttaat 11641 tgacagaaag cgggatggtt ccgcttactg gcgcaatctt caaaattgac gcgaccaatc 11701 tttaggccaa cgttccacga cgactttggt ttgggtgaaa aattctatag catgattgct 11761 ttgaccgtgc aaatcaccaa aaaagctttc tttccaacca ctgaacggga aaaaagccat 11821 tggtgcggca actcctatgt tgatgccaat attaccagct tcagcttcat agcggaactt 11881 ccgggcagct gcgccactgg tggtgaaaag acaagccatg ttgccatatt gaccgctgtt 11941 gatgagggcg atcgcctcct caatactctc taaatgtatc aaactcagta ctggaccaaa 12001 aatctctgtg ctggcaattt cacctgcggg gtcaacgttt tgcaaaatag tcgggcgtat 12061 aaaattacca ttttcataac cagatatatt cggttctcgt ccatccacta acaccctcgc 12121 cccctgatct gccccctttt gaatcaaatc ctcaattcgt gtcttacttt gggttgtaat 12181 cactggtccc atttctacgc ctgattctaa accattaccg acaactcgct ttttagcagt 12241 ctcggctatt gcttctgtaa aggtatcacg catttgtccc acagttactg ctattgaagc 12301 agcaagacaa cgttgtcctg cacaaccaaa agcactgtca gcagcaatgc gtgttgtcat 12361 ctctaaatct gcatctggta aaacaatcag gggatttttt gccccgcctt ggcattggac 12421 acgtttacca tttgctgccg ccctactata tatatattta gcgacaggtg tagaaccaac 12481 gaagcttatt gctcgaattt tcggatgatc caaaattgca tctacagctt ctttggcacc 12541 gtttaccaag ttgacgacac ctttaggcaa tcctgttttt tctaacaact ggaatatttt 12601 ttgcattgtt agcggtacct tctccgacgg cttgacaatg taggtgttac cacaagctag 12661 ggcataaggc atgaaccaaa agggaatcat ccccggaaag ttaaatgggg caatgactgc 12721 agcaactcct aaaggttgcc gaatcatcat ttcatcgata cctctggcaa tatcttctag 12781 gttcgtaccc tgcatcatca tggggatacc acaggcaact tctacatttt caatcgcacg 12841 ctgcatctcg ccttgggact ctgccaaagt tttaccgcat tccaacgtaa ttgtacgagc 12901 caaatcctct aaattctctt ctagcagatt cttcagttta aataaatact gtactcgttc 12961 cgtgggtgga gtgcgtcgcc aactcacgaa agcctctgct gctgcttggg cagcttggtt 13021 tacctcagaa gcaggtgata aaggtacttt aatcagtatc tctgctgttg ctgggttgat 13081 aacattcaag tattctgtag cactagatgt acaccactga ccattaatgt aattaggtaa 13141 tgtcatcact ttttccatca gtcaaagatg aattctcaca aaagtaactg tacaccctag 13201 aagtaggtcg aggcagctag gactggttca aacagtttta gcccgaaaga gcaaaaccat 13261 gttttgcaat gctgctttag tttagcccat gaagaaacaa cagtttggaa tcctggtttt 13321 aaaaaaccag gagagtttaa gataggacgc aaaattctaa aacatttctg acattgcccg 13381 acaaaatgtc gcttctattc gtgctaaaat gtgttaaatg aactgttttg agcttgtgta 13441 accattggtt ccggcattat caaacagttc aaataagcgg tttaagtttt cttaaacctt 13501 tgtgggatac caccaaagtt gaaaaatcaa caatggtata cctggttttt tagcttgtct 13561 aaaatgtcca ggttgaaagt atcagttaaa gtatctttgt agtttttaaa aaaaaaccac 13621 aaggatacaa aaaaacacct acatcctttc gagaacccta attcccagca acgagagtcc 13681 cagctttaga gttctagccg tcaaatcaca cagtactagg cgcgatgtcc gcacaggttc 13741 ttcagctttg agaacagggc aattttcgta gaacttgtta aatttatcac tcaattgata 13801 caaatactcg cacaagcggt ttggtagtaa atctttctcg acttcactta tcacttcgtc 13861 aagctgtaac aagtgctttg ccagagttaa ttctgtttct tcccgcagga caattggagc 13921 atctgttccc agcttttcaa agtcaatatt accttcacga ctaatgccct gagtcctaac 13981 ataagcatag agcatatacg gtgcagtgtt gcctttgagc gctagcatct tgtcatagct 14041 aaagatgtag ttgctggtgc ggttttggct taagtcggcg tatttaactg aactaatacc 14101 aactatctgc gcaacattat gaataaactc ttctgtttct tcccgccctt cttcttttat 14161 tctcttttct atgtctccac gggcgtgggc gatcgcacca tccaacaaat cccgcaatcg 14221 cacagcttcc ccagaacgag ttttcaattt ttgaccatct tcccccagca ccagaccaaa 14281 gggtgcatgc tcaaattcta catcgtctgt aatccagcct gcctttgttc ccacctgaaa 14341 aatctgagcg aagtgatttg tttgttctgt accaactggg taaattaccc gttgcacttt 14401 atctacttgc actcggtagc gaattgccgc taaatctgta gctgcgtagt tatatcctcc 14461 atctgatttt tgcacaatca agggtaaagg ttcaccttcc ctgttagtaa agccttccag 14521 gaagacgcac tttgcaccct gattttccac tagtagacct tttttatcta attcctctac 14581 gacttcaggg agtaaagaat tataaaacga ttcacctcgt tcaacaaaag gagcaatctc 14641 cagcaagtcg taaataactt ggtacgctcg actggagagt tgacagacta tcttccatgc 14701 gagtaaagtt ttttcatctc ctgcttgtag ttttaccaca gcctgacgag ctgcttcttg 14761 gaaatttgca tctgcatcaa atcgtgtctt tgcctgacgg taaaacgaag acaaatcacc 14821 taaatttaaa gtttcagtag tggttaaagc ttctgggtac gcttcttcca aataagcgat 14881 aagcattcca aatggagttc cccagtctcc tacatggcta atgcgctcga cttcatgacc 14941 gacaaattct acaattcgag acaggcaatc tccaattact gctggacgca aatgtcctac 15001 gtgcatttct ttggcgatgt ttggactggg gtaatctaca atgactcgct tgggattttt 15061 agttggtgcg actcctaacc ttggatctgt ttggattgcc ttgagttgtg cttccagata 15121 ttctgttttt agcttcaaat tgataaagcc aggaccagca atttctggtg gcttgcaaat 15181 atcagataca tcaagtttat caactatttg ttgtgcgatc gcccttggtt gctgtcccaa 15241 ttgctttgct agcgctaaag cagcattggc ttgataatca ccaaatctag gattgctagc 15301 agaaactaaa attggatcta ctccagcata gtctatgcca aaagctgcgc ccaaagcctc 15361 ttgcaattta acttttagtt gttcttgtgt agctttcatg tttattgttt ttttgtataa 15421 gcaacgttat tttatctacc cctttatcaa ccatcaactg ccaagatatc ggaatacata 15481 agatatcacg cctctgcaac aattgacata ttaacttttt ttgctcaaag aggtgttctc 15541 aaaaacccat tcataacctt gtttgaattt ttgctttcca ttcgtttcaa taatgctacc 15601 gtgagccatg atgactctat tgaagtccca acgaagtatc ttctcaactg aatctttcac 15661 cttttcctta tcgcttgtag ctaatttgtc taagcgtgat ggactcagta ccttgtatga 15721 ccccaggaac tgggctgcta accgcgtttt gaaagaaaag gtttcatcaa aatgaaaggc 15781 tatgtcggtc aagatcaaag tccggctttt ttgatggaag aacacaaact cattaactat 15841 tgatggacca cttaaatcaa gaagtttaaa gccgtcaaac aataaataat caacttgttc 15901 cttaatgcta ccttctttat tagttatcac tctatcaaaa gagagttcgg gtcttttaga 15961 aactaaacct ggtacacccc aaagctgcgc ttctcggtaa atggacttca agtcatatac 16021 aaaaagatgg tgatagagat ttggtgcaat aatataagca acgttaccaa tttcattgag 16081 ttgatgaata gtagttttgt cagactgaat tggggaaata actattaatt cgccagttgt 16141 aaggcgaata actgtcatcc gcgtaccaac ctcaagtccc caatacttca acggttgttc 16201 tgcgatccaa atattgttgt cgatttctct gagcatggtt tatcgctgac aagcagtgtg 16261 aatcaacttc agaatacttc aatattgtta aagtcgaatc ctcatactta aatccaacca 16321 agaggattga gtaattggcg cactactaga aatatagttt actcttgtct cggcaactgc 16381 acggatagtt tctaaagtaa tattgccaga ggcttcgatt ttaatgcgac tatcatgctg 16441 tcgaatcatc tgcaccgcct catgcatcat gtcaacaggc atattgtcca acataataat 16501 gtcagcgtta tgctgcaaag cttcctccac ctcgtctaaa ctttccgttt ctacttcaat 16561 cgttaaggga taaggaatat attggcgaat gcgggaaatt gctttgccaa ttcccccagc 16621 agctgcaata tgattgtctt taatcatgac tgcatcatcc agtcccattc ggtgatttac 16681 cgctcctccc acggtcgttg cgtacttttc caatagtctc agccctggtg tggttttgcg 16741 cgtatctact aattttgcgg gtaagtcagc aatttgctct acatatttcc tagtcagcgt 16801 ggcgatgcca cttaggcgca taactatatt aagagcaacc ctttctcccg ttagtagtgc 16861 atccagtgga ccatccattt cagctaccac ttctaaacgc tgacaagaac taccctcagc 16921 cacaacaggt acaaagttga ctttttcatc taaaagctga aatacccttg ccgcaactgg 16981 taaacctgct ataaccccag gcgctttcgc tatccacttt gcttgtccta cttcttcact 17041 tattagggtt tgcgttgtgc gatcgccccg accgatatcc tccaacaacc aattgcgcaa 17101 cagcgtatct aaaacaagcc cgggcggcaa aacaccaaaa tgactcacag cttcattatt 17161 cccttctcga cttccagaac tactatagcc tagagtgctt actatatctg gttttcacaa 17221 ctcaatcaaa aaaaactcaa aaaacttttc caaaagagtt gacaaccaag agtgggttcg 17281 ctatattgaa taagtgccgg aagagcggac gccaaacaaa gccgacgc // LOCUS NODE_1936_length_17130_cov_5.31051217130 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17130) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17130) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17130 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..3758) /locus_tag="DP116_16795" CDS complement(<1..3758) /locus_tag="DP116_16795" /inference="COORDINATES: protein motif:HMM:PF00400.30" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="WD40 repeat domain-containing protein" /protein_id="PRJNA477356:DP116_16795" /translation="MTEGNIKQTINGSQYAAMSATGDVFMYVIHNYREDDRTVGIKPA DTSLEDLDRPCPYRGLSYFETQHAELFFGRNSQIDELKTATQHRNFIPILGASGSGKS SLVFAGLIPKLAEQGNWLFTYFRLGSHDDPFYAIAEALLPLYRSDDEKTTAMVLAQKL KDNKLEIAKILTRIQGKHPQYKLLLIADQFEQLYTSYKDEIQHLFLDLLLSIIQASND ESLSTVMVTTMRADFLDKALSYPPFAEALKQGDIKLGSMTPDKLEQVIEKPALKYGVT FQDGLVERILGDIYNKEDCLPLLAFALEELWNKRTERLKKAALQGKQTDRQLTHEDYT AIGQVKGALATYADDVYNNLTLEQKEQVPKIFIQLVNFSQFTKDRTDRRYVRRVAKKT ELGEKRWRLVQILAEKRLVVTNRNADNEDTVEIIHETLIKQWPLIETWMNENRDFGTW LERMRAAMSQWEKSDRDSGALLRGKPLADAEDWLQKHEEDLTNEKEYIHQSLQLREQE KAEKERQELEKLEAQVALDTATERNQILTDANQKAKKKIRYSNIYLAVSVFLGAVFLG SAAIAINKQLEAQKGTKLEQAGVNALRQMPSGEIDALLSAMQAGQELKQMIRDNTLLK DYPATSPLLALQKIEDSIHEQNRIDTGQEQIKSVSFSPDGKYIATAGKNDTVILWSPS GEKKWIKKGLQRVLADSVKTMNFVAFSPDGKKIAAGEGDGTITLWDLSGNQLTSFKAS TTNFKSLSFSPDGQKIATADEEKARLWDLWGKQLAEFVGHKGRVNSISFNPNGQQVAT AGYDGTVRLWEVSGKQLKQFTAHKGQQILSLSFSPDGKYLTTAATGDNTALLWNLSAQ DPVKLEGHQGSVLNVGFSRDGKIIATTSNDGTVRLWDLSGKPITTLQGHRGAVSSASF SPDGKYLVTGGVDNTARIWNLSKHNQPINKFQGHQKDVNSVSFSSTGQQIVTADHEGI VKLWNLSGQEQASWQADRRGPLWSVNFSPDGQLIATGGYDNTVAIWDLSGKLKTRLKG HKNLINNLSFSPNGQMIVTSGADKTARLWNLSGKLLTILEGHQDVVERASFSPDEKTI ATGGWDGNVIIWDLSGHKIKEWQTKQGKISGLSFSPDGKQLATADKSGVIKIWNLSGN QPLEFFSYQTGVSSLSFSPNGQYLASGGMDSTVRLWNLKGYQIAEFKTGKGAIWGISF SPDGKSIVAGGDNGVVQVWQIKPLDELLSQGCDWLENYRKNPAKEINI" gene complement(3885..4283) /locus_tag="DP116_16800" CDS complement(3885..4283) /locus_tag="DP116_16800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008273708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16800" /translation="MDPITSAIILGVAGNFATDAIKAAYNSLKDALTKKHGQDSDLVD AVNKLEKTPDRDDRKVTVETEVKIAKANDDAELVKLAQHLLAQLKEQPGGIPSINMTV SHVKFAATSATGSATISQINDNAPAEDWKR" gene complement(4723..4872) /locus_tag="DP116_16805" CDS complement(4723..4872) /locus_tag="DP116_16805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017293361.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_16805" /translation="MNKLTKRLNFRLTEEEYVLLEKYCSATVRSKNDVLRELIRTLKR KMSDD" gene 4928..6088 /locus_tag="DP116_16810" CDS 4928..6088 /locus_tag="DP116_16810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008184609.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_16810" /translation="MAYKAFRTKLKLSTQQKPVMAQHAGYSRWVYNWGLKLWEQAYKD GLKPSVGMLKKFFTNHVKPQYQWMNQLSSKVYQYAFINLGEAIARFFAKKGKSPRFKK KGKADRFTIDNSGAPIKVGGLRHKLPFIGWVRTYEALPECITKKVTISQQAGDWYLSF HIEIPEASPTPKSIDRVGVDLGVNALATLSTGAVYPNLKAYRKAKHKLAKLQRLASRK QKGSNNRHKANIKVARQHRRVASIRNDYLHKITTYLAKNHGEVVIEDLNVSGMLANHK LASAIADCGFYEFRRQLEYKCERYGSSLVVVDRFYPSSQICSNCGYRQKMPLKERVYI CPCCNVSRDRDLNAAINLSNWGRLDPGKPVEQVPPTVCDETGSKRFKQLCLF" gene 6116..8164 /gene="ligA" /locus_tag="DP116_16815" CDS 6116..8164 /gene="ligA" /locus_tag="DP116_16815" /EC_number="6.5.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872751.1" /note="this protein catalyzes the formation of phosphodiester linkages between 5'-phosphoryl and 3'-hydroxyl groups in double-stranded DNA using NAD as a coenzyme and as the energy source for the reaction; essential for DNA replication and repair of damaged DNA; similar to ligase LigB; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA ligase" /protein_id="PRJNA477356:DP116_16815" /translation="MQAINTEAKQRVQELRQLLQKASYAYYVLDSPIMEDAVYDQLYR ELQQLENQYPELVTPDSPTQRVGEKPATQFISVRHNVPLYSLENAFNIEELKAWEGRW RRQAPNVGQVEYVCELKIDGSALALTYENGILTRGATRGDGVTGEDITQNVRTIRSIP LRLNLETLRENSLPDRIEVRGEAFLPLEVFKQINEERQKAGESVFANPRNAAAGTLRQ LDSRIVAQRRLDFFAYTLHIPGRDDASIANTQWEALELLQKLGFRVNPNHKLCPSVDD VAQYYEYWDTERLNLPYMTDGVVVKLNSFKLQEQLGFTQKFPRWAVALKYAAEEAPTR VENIAVNVGRTGALTPLAQMRPVQLAGTTVSRATLHNADRVAQLDIRIGDTVIVRKAG EIIPEVVRVLKELRPDNTQSFVMPSHCPVCGQPVIREMGEAVTRCVNASCAAILKGAI EHWVSRDALDIRGMGEKLVHQLVDKGVVHSVADLYNLTEEHLCGLERMGKKLAQKLVE AIAQSKNQPWSRVLYGLGIRHVGSVNAQLLTQKFPTVEQLAEASQADIETVYGIGEEI AQSVYQWFHISANQTLISRLKDAGLQFASTEEETPLSQSHQKFTGKTFVITGTLPTLK RDEAKALIQKAGGKVTDSVSKKTDYLVVGEEAGSKLTKAQELGVRQLNEGELLELLED " gene complement(8375..9181) /gene="egtC" /locus_tag="DP116_16820" CDS complement(8375..9181) /gene="egtC" /locus_tag="DP116_16820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407049.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ergothioneine biosynthesis protein EgtC" /protein_id="PRJNA477356:DP116_16820" /translation="MCRLVGYLGNTIPLDELLYKQEHSLYNQSYNPVELKSGVVCADG SGVGWYDKEGKPFIYRNTIPIWNDPNLEELSHYVQSTCALGYVRLAGTGESLDISNCQ PFRSDKLLFVHNGEITNFQQTLARPIRDSLSDSTYRLIKGMTDSEHIFALLVEMWQLS PGSTLLSALRATLEKLTVLAKKYDTSFSANIIVSDGQALAATRYAYGTQAPTLYWSCD DAKPPTQVIVASEPLSNQNWTAFPDQSTLFVQSESLQPTTSLIMSELPYF" gene complement(9358..9858) /locus_tag="DP116_16825" CDS complement(9358..9858) /locus_tag="DP116_16825" /inference="COORDINATES: protein motif:HMM:PF01471.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_16825" /translation="MVENIYSFRQITGGHMVVSVLKEGSTGPEVIDLQFILKFRDGKN EFDPGATDGSFGSKTKAAVVKFQQSRKLTADGIVGGKTWEALRPRSDWPKEPGEFLRE GEKGEVVKQLQEGLKSEGVYTGAIDGIFGPNTKAAVIKIQKSDEIVSNTVGVVGPLTW GGIIGD" gene complement(10239..10940) /locus_tag="DP116_16830" CDS complement(10239..10940) /locus_tag="DP116_16830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316220.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipoyl(octanoyl) transferase" /protein_id="PRJNA477356:DP116_16830" /translation="MSLHNHQCLLYNQVVIPYSDALMWQRTLLAERIQNPSLEDVLIL LEHPPVYTLGQGASSEFLKFDHTKSTYEVHRVERGGEVTYHCPGQLVGYPILNLHHYR KDLHWYLRQLEEVLIRTLKIYGLQGERNPGFTGVWLEGRKVAAIGIKVSRWITMHGFA LNVCPDMSGFERIVPCGIADKPVGSLAQWIPDISLLEVRAHVAKCFAEVFEVELIAPD VNGSSIRCKWEDENP" gene 11024..11248 /locus_tag="DP116_16835" CDS 11024..11248 /locus_tag="DP116_16835" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16835" /translation="MQRILKESQGNWFSKIQGANFEYITSIELGGILCSNHHANKRVT SLLADDAETMVDIKYKIRSAKRPVYFRKND" gene 11850..12506 /gene="raiA" /locus_tag="DP116_16840" CDS 11850..12506 /gene="raiA" /locus_tag="DP116_16840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316221.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome-associated translation inhibitor RaiA" /protein_id="PRJNA477356:DP116_16840" /translation="MKLVIHGKNIEITDAIRDYVHHKIEKAASHYQNITNEVDVHLSV ARNPRINPKQSAEVTIYANGSVIRAEESSENLYASIDLVADKIARRLRKYKERRQEKK THAQTTIEGVVQEAVVTDLIGDRTPELPEEVVRCKYFAMPPMTMAEALEHLQLVGHDF YMFHNVETGEINVIYERNHGGYGVIQPRHTNNGHTNSKNGKTSNGYVAMPEKTFQNKV " gene complement(12603..13061) /locus_tag="DP116_16845" CDS complement(12603..13061) /locus_tag="DP116_16845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316222.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peroxiredoxin" /protein_id="PRJNA477356:DP116_16845" /translation="MAVKVGDTAPDFTLPSQNGTPVSLQDFRGKPLVLYFYPKDDTPG CTTQSCAFRDQYEVFKTAGAEVIGVSGDSPESHQKFAAKYQLPFTLLSDKGDQVRKQY GATTAFGLIPGRVTYVIDNQGVVQYVFDSMFNFKGHVEEALKTLQQLQTA" gene complement(13457..14923) /locus_tag="DP116_16850" CDS complement(13457..14923) /locus_tag="DP116_16850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004392206.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16850" /translation="MNSRLPERKASALSRLRESLPTIFDLSYIKHVIVPAFLTSVYQG EKLSLPMIDEERLTKENALPYYFWGLLYDNWEPNQEEDGLSVFIQGYEKRGDDNLRKR IYYSALTPDLYRPIYGDKVVKFFDQLFDEKNAGKPLMRQYLDNYFDLYWDLHLGVKGD AVPPEVREIGESFNTVLAYRDPTQEIVYENYMRVRARRKFLKEWIDQRVEDVVKRNVA DPEKTFVYYWIKNGEEGEDFRRKDVVFECFHNFVALSQWGNTIYNIMSKLSKNTGDAE VRAWFKKTMESDYDQANDSPFTPLDRFVMELFRTISPNTGSISAMGEVRKPPYDRYGY VISPHKATSEYPRHWEKPGEFDPDRYKDAPTSDQIDEAKSKEIGFAQCPFHKATFEVK DGRKAELTNSAFGTVYGIVDDKAFPVCDYAGYAPFGFGYRRCPGELFTIEVFKDFFTK VWNDKIEFEKLDLPDPKKIPVGPTTVVDDNIGFTKSML" gene 15565..16698 /locus_tag="DP116_16855" CDS 15565..16698 /locus_tag="DP116_16855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF5009 domain-containing protein" /protein_id="PRJNA477356:DP116_16855" /translation="MRLTSLDVFRGITIAAMILVNMAGVTEPPNVYPPLLHADWHGCT PTDLVFPFFLFIIGVAMTFSLSKYTGDNKPTASVYWRILRRATILFALGLLLNGFWNK GPWTFDLSTIRIMGVLQRISLTYLLASVAVLKLPRKGQWILVGVLLIGYWLAMMYVPV PGYGAGVLTREGNFGAYIDRLIIPQAHLYKGDGFKNLGDPEGLFSTIPAVVSVLAGYF AGQWIRAQPVKFRTSIGLVLFGLGCLIIGWAWGWTFPINKKLWTSSYVIFTSGWALLL LAACYELIEVRRVRAWSKPFEVLGLNAISLFVASVLLIKILVRTKVGSGENAPSTYDW IYQNVFTSWAGAVNGSLLFAIVTVLLWVAIAYVMYTRRWFFKV" gene complement(16709..>17130) /locus_tag="DP116_16860" CDS complement(16709..>17130) /locus_tag="DP116_16860" /inference="COORDINATES: protein motif:HMM:PF12796.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16860" /translation="CGVPGLKPQGFHLTHYKSLPQAIGQGDIKRARALVLTVANRYGF SKDVQALMKAGIDVNARDDRGQTLLISVAMWGENKQIDPDLVKTLILAGVDVNAKDNY GKNALFYAARASNAAVRKALLQPDTDGDLKTKKSTNQ" BASE COUNT 4719 a 3723 c 3673 g 5015 t ORIGIN 1 aaaatattta tttctttcgc tggatttttt cgataatttt caagccaatc acaaccttgg 61 gataacagtt catccaatgg cttaatttgc catacttgca caaccccatt atctccccct 121 gccacaattg attttccatc cggactaaaa cttatgcccc aaatagctcc tttcccggtc 181 ttgaattcag caatttgata gccttttaaa ttccatagtc taaccgtgct atccatgccc 241 ccgctagcca gatactgtcc gttgggacta aaactcaagc tgctaacccc tgtttgataa 301 ctaaaaaatt ccaatggttg attaccagat aaattccaaa ttttaattac accacttttg 361 tctgctgttg ccagttgttt tccatccgga ctaaaactca agccactaat tttcccttgc 421 tttgtctgcc attccttaat tttatgccca gacaagtccc aaattataac attgccgtcc 481 catccaccag tagcaattgt tttttcatct ggactaaaac tagctcgctc aactacatct 541 tgatgcccct ctaatatcgt caatagctta cccgacaagt tccataatct agctgttttg 601 tctgctcctg atgtgacaat catctgtcca ttaggactaa agctgaggtt attaattaaa 661 tttttatgac cttttaatcg agtttttaat ttaccggata aatcccaaat ggcaaccgta 721 ttgtcatatc cacctgtagc tattagctgt ccatcgggac taaaatttac actccacagt 781 ggacctctcc tatcagcttg ccatgaagcc tgctcctgtc ctgacaagtt ccacagttta 841 actataccct cgtgatctgc ggtcacaatt tgttgtccgg tagagctgaa acttacacta 901 ttgacatctt tttgatgacc ttgaaattta ttgattggct gattatgttt ggacaaattc 961 caaatacggg ctgtattatc aactcctcct gtaaccagat attttccatc agggctgaaa 1021 ctggcactag aaaccgcacc tcgatgacct tgcaaagtag ttatgggttt acctgacaaa 1081 tcccagagtc tgactgtacc atcattggat gttgtggcaa taatctttcc gtcacggctg 1141 aagccgacat ttaatactga accttgatgt ccttctaatt taactgggtc ttgtgcagat 1201 aagttccaca gcagtgctgt attatcacct gttgctgctg tggttaaata tttgccgtct 1261 gggctaaaac tcaaactcag aatttgttgt cccttgtggg ctgtaaattg ctttaattgc 1321 ttgcctgata cttcccacaa tcgaactgta ccatcgtatc ctgctgtagc aacctgctgt 1381 ccattagggt taaaactaat actgttaacc ctgcctttat gtccgacaaa ttcagcgagt 1441 tgtttccccc ataaatccca cagccgtgct ttttcttcat cagctgtcgc aattttttga 1501 ccgtcagggc taaagctgag gcttttgaaa ttggttgtgg aagctttaaa tgaagtcagt 1561 tggtttccag ataaatccca cagtgtaatt gtgccgtctc cttctccagc agcaattttc 1621 ttgccatcag gactaaaggc aacaaagttc atggttttaa cgctgtcagc aagcacacgc 1681 tgaagcccct tttttatcca ttttttttcc cctgatggac tccaaaggat aacagtgtcg 1741 ttctttcctg ctgtagcaat gtactttcca tcagggctaa agcttacact tttgatttgt 1801 tcctgccctg tatcaattcg attttgttcg tgaatagaat cctcgatttt ctgtaatgcc 1861 aatagagggc tggtagctgg ataatcttta agtagggtat tatccctaat catttgcttt 1921 aattcttgcc ctgcttgcat tgcagagagt agagcgtcta tttcacccga tggcatttgt 1981 cgtaaagcat tgacacccgc ttgttctaat tttgttcctt tttgggcttc tagctgtttg 2041 ttaatggcta tagctgccga gcctaaaaat acagcaccta aaaacacact cactgctaaa 2101 taaatattgc tgtagcgaat tttcttttta gctttttggt tagcatcagt caatatctga 2161 ttacgctctg ttgctgtgtc caaagcaacc tgagcttcta atttttctaa ttcttggcgt 2221 tctttctcag ctttctcctg ctctctcaat tgcaaacttt ggtggatata ttctttctca 2281 tttgtcaagt cttcttcatg cttttgtaac caatcttcag catcagcgag aggtttgccc 2341 cgtaacaatg cacccgaatc gcgatcgctt ttctcccact gactcatagc cgctcgcatt 2401 ctttctagcc aagtgccgaa gtcgcgattt tcattcatcc aggtctcaat cagaggccat 2461 tgcttaatca gcgtctcatg tatgatttct accgtatcct cattatctgc attgcggtta 2521 gtaacgacta atcgcttttc ggctagaatt tgtaccaaac gccagcgttt ttctcctaat 2581 tcagtttttt ttgcaactct acgtacatat cgcctgtcgg ttctatcttt tgtgaattgg 2641 ctaaaattta caagttgaat aaatattttt ggtacttgct ctttctgttc taaagtcaaa 2701 ttattataaa cgtcatcagc gtatgttgct aaagcacctt ttacttgccc tattgctgta 2761 taatcttcat gggttaattg tcggtcagtt tgctttccct gtaaagcagc tttcttcagt 2821 cgttcagttc gcttattcca caactcctct aaagcaaatg ccaataaagg taaacaatct 2881 tccttattat atatatcacc taaaatgcgt tctaccaaac cgtcttgaaa tgtcactcca 2941 tatttcaagg caggcttttc aatcacctgc tctagcttgt cgggtgtcat cgaacctaat 3001 tttatatcac cttgctttaa cgcctctgca aacggtggat aagaaagagc tttatctaag 3061 aaatccgcac gcatagttgt caccatcacc gtagaaagcg actcgtcatt agatgcttga 3121 ataatactta acaacaaatc cagaaaaagg tgctgaattt cgtccttgta ggacgtgtat 3181 agttgctcaa actggtcggc aattaacagc agtttatact gaggatgttt tccttgaatc 3241 ctagtcaaaa ttttggcaat ttccagcttg ttgtctttga gtttttgcgc taaaaccata 3301 gccgtagttt tctcatcatc tgacctgtac aacggaagca gtgcttcagc aattgcatag 3361 aaagggtcat cgtgagaacc aagtcgaaaa taggtaaaaa gccaatttcc ttgttctgct 3421 aatttgggaa tcagtcccgc aaacactaaa gaagatttac cactaccgga tgcacccaaa 3481 atgggaataa aattacgatg ttgagtggca gtttttaatt catctatttg cgaattacgt 3541 ccaaaaaata attctgcgtg ttgggtttca aagtaagata aaccccgata cggacatgga 3601 cggtcaaggt cttcaagaga agtatcggca ggctttattc ccacagttct gtcatcttca 3661 cggtagttgt ggatgacgta cataaataca tcacctgttg ctgacatcgc agcatattgc 3721 gagccattaa ttgtttgttt gatattgcct tcagtcatgt ctacctcatc cctcaaataa 3781 tactatttct taaaagtcga gctacaggta aggacacggt aataccgtgt ccttatagag 3841 atcgcacgct atatctagct agagttttca gaagtgtgat aagattatct tttccagtct 3901 tctgctggtg cgttgtcatt gatttgactg atagtcgccg aaccagttgc agaggttgcc 3961 gcaaatttaa catgactaac agtcatatta atactgggta taccacctgg ttgctctttg 4021 agttgagcga gcaaatgttg agctaatttc acaagttctg catcatcgtt agctttggca 4081 attttaacct cagtttcaac tgtgactttg cgatcgtctc tgtctggtgt tttttccaat 4141 ttgttaaccg catcaaccaa gtcgctatct tgaccatgtt tcttggttaa agcatctttg 4201 aggctgttgt aagcagcttt aatagcgtct gttgcaaaat ttcctgcaac tcctaaaata 4261 atggctgagg tgatagggtc catagtttat tcttaattta cttaaccaac agcgtaacgc 4321 acggttcgcc catataaccg tatattctca gatagtaagc aataaaaacg gtatattaga 4381 gagatagtat ttggtaagcg tcacgacatt caatattggc aagtactgtc accagagact 4441 ttgcctcaaa attcaacttg gtttattatg accaaagtcc ttagtgcggc agattctcgc 4501 ttgatagaac gtgcttctac tgggaagtag aggacaagac ttgttaaatt taattaagct 4561 ataagctgta gctttgctgc tctttgggtg ttaggataaa tagccaacct actccatccc 4621 tcaaaaagca gaatgctgaa agctgcaaac tagtagcgat taaatgatga aggaattact 4681 caatagctaa aaacttgtat aaactgccca tttcttgata aatcaatcgt ccgacatctt 4741 gcgctttaaa gttctaatta gttctctcaa cacatcattc ttggatctaa ccgtggctga 4801 acaatacttc tctagcagta catactcttc ttcggttagt cgaaaattta accgcttcgt 4861 tagtttattc atgacatgac ggagccaatc tgatgtacaa tagaattgta cccaaaatgt 4921 ggcattaatg gcatacaaag catttagaac caaattaaag ttatctaccc agcaaaaacc 4981 tgtaatggca cagcacgcgg gatactcacg ctgggtgtat aactggggac taaagctttg 5041 ggaacaagca tataaagacg gactaaagcc aagcgtgggg atgctgaaaa agtttttcac 5101 caaccatgtc aagccacagt atcaatggat gaatcagtta tcatccaaag tctatcagta 5161 tgcatttata aatttgggag aggctatagc gcgttttttc gccaaaaaag gtaagtctcc 5221 aagatttaag aaaaaaggca aagctgatcg ttttacgatt gataactcgg gtgcgccaat 5281 taaggtaggt ggattgcgtc acaagctacc ttttatcgga tgggtgcgca cttatgaggc 5341 gttaccagag tgcattacaa agaaggtaac gattagccag caagctggtg attggtattt 5401 gagttttcat attgagatac ccgaagcctc gccaacgcca aagagcattg atcgagtggg 5461 ggttgacttg ggcgtaaatg ccttagcaac tcttagcact ggtgctgtgt acccaaattt 5521 aaaggcgtac cgtaaggcaa agcataaact agcaaagctg caacggttgg ctagtcgcaa 5581 gcaaaaggga tctaataacc gtcataaagc caacataaaa gttgctcgtc agcatagacg 5641 agtcgcgagc attcgcaacg attacttgca caaaatcaca acgtatttag ctaaaaacca 5701 cggtgaagtt gtgattgagg atttaaacgt gtcgggcatg ttggctaacc ataaactggc 5761 atctgcaatc gccgattgtg ggttttatga gttccgtcgc caacttgagt acaagtgtga 5821 gcgatacggt tcaagcttgg tggtagttga taggttctac cccagttctc aaatttgttc 5881 taattgtggt tatcgtcaga agatgccgct aaaagaacgt gtttatattt gtccttgctg 5941 caatgttagt cgggacagag atttgaacgc agcgataaac ctgtcgaact ggggtcggct 6001 tgaccctgga aagcctgtgg agcaagtgcc gccgacggtt tgcgacgaaa caggaagtaa 6061 acgctttaaa cagctttgtt tgttttgagt agattttatg gagcagatgg atgtagtgca 6121 agcgataaac acagaagcta aacagcgcgt tcaagaactg cgacaattac tacaaaaagc 6181 gagctacgcc tactacgtcc tcgattctcc catcatggag gacgcagttt atgaccagct 6241 gtatcgagaa ttgcaacagc tggaaaatca gtatccagaa ttagtgacac ctgatagccc 6301 aactcaacga gtgggagaga aaccagcaac ccaatttatc tcagttcgtc acaatgttcc 6361 actatatagt ttagaaaatg cctttaacat tgaagaatta aaggcatggg agggacgctg 6421 gcggcgacaa gcacccaatg taggacaggt agaatatgtc tgcgagttga aaatagatgg 6481 ttctgcctta gcgttgacat atgaaaatgg tattctcacc agaggtgcaa ccaggggtga 6541 tggagttaca ggtgaagata tcacccaaaa tgtgcggaca attcgctcaa ttcccttgcg 6601 gttaaatcta gagacgttac gtgaaaattc tctcccagat aggatagaag tgcgaggtga 6661 agcgttttta cccttggaag tgtttaaaca aatcaatgag gaaaggcaaa aagcaggtga 6721 atcagtcttt gcgaatcctc gaaacgctgc tgctggtact ctaagacaac tggactcccg 6781 aattgtagct caacggcggt tagatttctt tgcctacacg ctgcacattc ctggtagaga 6841 tgacgccagt atcgctaata cccagtggga agcgctggaa ttgttacaaa agttggggtt 6901 tcgcgtcaat cctaatcata agctttgtcc ctctgtagat gatgttgcac agtattacga 6961 atactgggac actgaaaggt tgaatttgcc ttatatgacg gatggggtgg tggtaaagct 7021 gaattctttt aagcttcagg aacaacttgg gtttacgcaa aagtttcctc gttgggctgt 7081 agcgctgaag tacgcagcag aagaagcacc cacccgtgta gaaaatattg ctgtgaacgt 7141 cggaagaacg ggggcgctga ctcctcttgc tcaaatgcgc cctgtacaac tcgcgggaac 7201 aacggtttct cgtgctaccc tacataatgc tgaccgtgtt gctcaattag atattcgcat 7261 tggcgataca gtcattgtcc gtaaagctgg ggagattatt ccagaagtcg tgcgggttct 7321 taaagaactc cgtcctgata atactcaaag ctttgttatg ccttcccatt gtccagtttg 7381 cggacaacct gtgattaggg aaatgggaga agcggtgact cgctgtgtta atgcttcttg 7441 tgcggctatc cttaaggggg cgattgaaca ttgggtgagt cgcgatgctt tggatatcag 7501 aggtatgggc gaaaaactgg tgcatcaact tgtggataaa ggagtggtgc attctgtagc 7561 tgatttgtat aacttgacag aagagcattt gtgtggttta gaacgaatgg ggaaaaagtt 7621 ggcacagaag ttggtggagg cgatcgccca atcaaaaaac caaccttggt cacgcgtgtt 7681 gtatggttta ggtattcgtc acgttggtag cgtgaatgct caattgttga cacagaagtt 7741 tcccacggtg gaacagttgg ctgaagcttc acaagctgat attgaaactg tttacggtat 7801 cggtgaggaa attgcccagt ctgtatacca gtggttccac atcagtgcga atcaaacttt 7861 aatttctcgc ttgaaagatg ctgggttgca atttgctagc acagaagaag aaacaccgct 7921 gagtcagagt catcaaaagt ttactgggaa aacttttgtg attactggta cgcttcctac 7981 cttgaagcgg gatgaagcaa aggcgttaat tcaaaaagct ggcggaaagg taacagattc 8041 tgtgagtaag aaaacagatt atttggttgt gggggaagag gcgggttcaa agttgacaaa 8101 agcacaagag ttgggtgttc gtcagttgaa tgaaggggag ttgttggagt tgttggagga 8161 ttgaaccgca ccaggtgcag ataagggagc gaacacaagt actaaccatc ttaacctgct 8221 tgtatttgcc agcacgcaaa ttcttaatct gaactccggc tttggaagga attaaaataa 8281 atttggtcca gatattatca aaacagcaaa tgactatatc taccccttcc aggattacct 8341 gtcaggggtt tgtttgttgt aagaaaagac ttgctcaaaa atacggaagc tcagacataa 8401 ttaaagaagt cgttggctgt agtgactccg attgaacaaa cagcgtactt tggtcaggaa 8461 aggctgtcca attttggttg gataaaggct cagaggcaac aataacttgg gttggcggct 8521 ttgcatcatc gcaagaccaa taaagagtag gggcttgtgt gccgtaggcg tagcgagttg 8581 ccgcaagtgc ttgaccatca ctgacaatta tattggcact aaagctggta tcatattttt 8641 tagccagaac cgtcaacttt tctagcgtgg cacgtaaggc tgacaacaga gtactacctg 8701 gagataactg ccacatttcc accagtaggg caaagatatg ttcagagtca gtcatgccct 8761 taattaggcg atatgtggaa tcagaaaggc tgtcacgtat cggtctggcg agcgtctgtt 8821 gaaaatttgt gatttcacca ttatgcacaa ataacagctt gtcactgcgg aatggctgac 8881 agttgcttat gtcgagcgat tcgcccgtcc cagcgagtcg gacatagccc agtgcacaag 8941 tagattgtac atagtggctc aactcttcca ggttcggatc gttccagatc gggatagtgt 9001 tccggtaaat aaatggttta ccttctttat cgtaccagcc taccccgctt ccatctgcac 9061 aaactactcc tgactttaat tcaacgggat tgtaactttg gttgtagagc gaatgctctt 9121 gcttatagag caactcatct aatgggatgg tattgccaag gtagccaact aatcgacaca 9181 taagaaaaac aactctactg agcagggtac actcactgca ataatgttcg tatataaaat 9241 tttagcacgc taattttttc tggtgatccc acacaggggg gatttgctgc taaggggtta 9301 agtaaacggc tggtaagtat ttttaccagc cggcacaagt tcacacagct tcttgtatta 9361 atcgccgata attccacccc acgtgagtgg accaacaacc cctaccgtat tagagactat 9421 ctcgtcagac ttttgtattt taatgacggc tgcttttgtg tttggaccaa atataccgtc 9481 tatggcaccg gtataaacac cttcagattt taaaccttcc tgaagctgtt tgacgacttc 9541 gcccttttcg ccttctctca aaaattctcc tggttctttg ggccaatcag agcgaggacg 9601 taaagcctcc cacgtttttc cccctacaat cccatcagca gtaagctttc tactctgctg 9661 aaactttacc actgcggctt tagttttaga accaaacgac ccgtctgttg cacctggatc 9721 aaactcgttc tttccatctc gaaatttcag tataaactgc aaatctatga cttctggtcc 9781 cgtagagcct tccttgagaa ctgatactac catgtgcccc ccagttattt ggcgaaagct 9841 gtaaatgttc tcaaccatta tacgtatatt cctatgtgtt taataattga gaaaagatta 9901 agaacagtag tggtcttcta aacctgttac gtatggagta gtagccctac ttttcaactc 9961 agcaatgaag tttaattagt agtgattgat gaggtctcag acgtcattga acagtacaat 10021 caacgggctg ctgataacct ctttgttacg cgactttgaa aagtttccgg tactcgcaaa 10081 gtttcataga agtcacagaa acctaagcaa caatgacatc tttactgaat cggtgttcta 10141 ttttaaatac attaagacca cccttttgcc tgaaaggaca gatgtgtgaa aacgcccgtg 10201 ctcaatacca catctacggc gttagccgcg cttccctatc aggggttctc atcctcccac 10261 ttgcacctga tagaacttcc attcacatca ggtgcaatca attcaacctc aaaaacttct 10321 gcaaagcact ttgccacatg ggcgcgtacc tccaacaaag aaatatcagg aatccattga 10381 gctaaactac ccacaggttt atcagcaata ccacagggta caatacgctc aaatccgctc 10441 atgtcaggac agacatttaa tgcaaagcca tgcatggtaa tccaacggct aactttaatc 10501 ccaatagcag caactttacg cccttctaac caaacacctg tgaaacctgg attgcgttct 10561 ccctgtaagc cataaatttt gagtgtccga attaagactt cttctagttg acgtaagtac 10621 cagtggaggt ctttacgata atggtgcaaa tttaaaattg gatatcctac cagttgaccg 10681 ggacaatgat atgtgacttc gccacctcgt tcaactcgat gcacttcata cgtactctta 10741 gtatggtcaa atttcagaaa ttctgaacta gctccttgtc ctaaagtgta gacaggcgga 10801 tgttctaaca agattagcac gtcttctaag ctagggtttt gaatgcgctc agctagaaga 10861 gtacgctgcc acatcagcgc atctgagtat ggtatcacta cttggttata tagcaaacat 10921 tgatgattgt gcaaagacat attacaaaca aagaaaaaac tcaaccagaa tacagagcaa 10981 aatgtatctc ttgtaaccaa taggggttca caaatgtcaa gctatgcaaa ggattctaaa 11041 ggaaagtcaa gggaattggt tttccaaaat acaaggtgct aattttgaat atataacctc 11101 aatcgagttg ggaggaattc tctgcagtaa ccatcacgcc aacaaaagag taacaagtct 11161 actggctgat gacgccgaaa caatggttga cataaaatac aaaataagaa gtgcaaaaag 11221 acctgtgtat tttaggaaaa acgattaagc ctcctgagtg agaaaaccaa agatgtctgc 11281 gtgaatatct aattgctttt caaccgatta gaatcacagc agaggttctt gaactaaata 11341 tgggcgaaac ctcacaaaaa ctcagattca gtgggcttag gctaggcgtc ttaccagcaa 11401 aaagtcataa gacgcgtagt ggtgagacag cgctacagtc cttgtttccc gacagaggcg 11461 actgcgtaaa cgcaagcgca cgccaagggc gttagccgta aggcgaaggc aggcatacgc 11521 gtagcgtctc cgcaggagat acccgaaggg cttcccgtta gggtagagcc gtttccactt 11581 agtctcttat gtttttacgg acacgctacg agtgtacgtt atgcttctgt gtaggagaca 11641 cgctaacaga actatgtcct tggggcactc actggcgaac ggtgagtcca gcgctgcggg 11701 agggtctccc tccgcaggtg actggcgaac ccggagggta gtcagtctaa atccgttcac 11761 gattgattct ggcggcagtc atagacctta ccgcatttcc ttgtcatccg aaacaaaatt 11821 cacaaaaaat attgagtggg agagtttaca tgaagcttgt catccacggc aaaaatattg 11881 aaatcactga tgcaattagg gattacgtgc atcacaaaat tgaaaaggcg gcaagtcatt 11941 atcaaaacat caccaacgaa gtggatgtcc atctgagcgt agctcgtaat ccccgaatta 12001 atcccaagca gtcggctgaa gttacgattt atgctaatgg gagtgtgatc cgtgctgagg 12061 agagtagcga gaacctctat gccagcatag atttggtggc agacaaaatt gcccgtcggc 12121 tgcgtaaata taaagagcgg cgtcaagaga agaaaacaca cgcccaaaca acaattgaag 12181 gagttgttca agaggcagta gttacagatt taattggcga tcgcactcca gaattgcccg 12241 aagaagtcgt ccgttgtaaa tattttgcca tgcctccgat gactatggca gaagctttag 12301 aacatctgca attggtagga cacgactttt atatgttcca caatgtggaa actggcgaaa 12361 tcaatgtcat ttacgaacgg aatcacggcg gttatggtgt gattcaaccg cgtcatacca 12421 ataatggtca taccaacagc aagaacggca agacaagcaa tggttacgtt gctatgccgg 12481 agaagacttt ccagaataag gtgtaaaaag aggtaaggtg agtggagcga tcgcgcaaag 12541 aaaggcgatc gttccactta ccctattcaa aaaactttat tcactcttta acatcgctgc 12601 tatcaggctg tttgcagttg ttgcaacgtt ttgagtgctt cctcaacgtg acctttgaag 12661 ttaaacattg agtcaaagac atactgtaca actccttggt tatcaataac ataagtcaca 12721 cgaccaggaa tcaaaccgaa ggctgtagtt gcgccgtatt gcttccgcac ttggtcgcct 12781 ttatcgctta aaagggtaaa aggcagttgg tacttagcgg caaatttctg gtgagactct 12841 ggtgagtcac cactcacacc aataacttct gcgcctgctg ttttaaaaac ttcatattga 12901 tcgcgaaagg cacacgattg agttgtacat cctggtgtgt cgtccttggg gtaaaagtac 12961 aggacgagtg gtttaccgcg aaaatcttgc aggctcacag gtgtaccgtt ctgggacggt 13021 aatgtgaaat cgggagctgt gtctcctact ttgactgcca taggtacttg tagtatttct 13081 taataaaatt ttattttaat ccgtaatcta gtgaatactt tccaccagcc agctttattt 13141 gccaagctag tggatatagt gtccggaatc gcatccatca tatattcaaa cggcacttgt 13201 gagttagtta agcgcttcgc attgcttgag cagtccttgc gtgttggagg agcatctgct 13261 gcagaaattt ggggcaagca acggtcttga atatcttttg tgcgatccta atgttgatgc 13321 agtaaaggtt cacggagctt gatcacagga gatgattttg tgaggttttg atgtcacaac 13381 tctcaacgct acctcacctg gttttaatac tgccatgagc aattagagat ctagggaagt 13441 ccgcaacaac cgcgctttac aacatcgact tagtaaaccc aatattgtca tcaacgactg 13501 ttgttggacc tacgggaatt tttttcgggt ctggtaggtc gagtttctcg aactcaattt 13561 tgtcgttcca cacctttgtg aagaaatcct taaacacctc aatcgtgaac agctcgccag 13621 gacagcgacg gtatccgaaa ccgaaaggcg cgtagcctgc atagtcacat acgggaaacg 13681 ccttgtcatc aacaataccg taaacagtgc caaatgcact attggttagt tctgcttttc 13741 tgccatcctt cacctcgaat gtcgctttgt ggaagggaca ttgggcaaat ccgatctctt 13801 tagatttggc ttcgtcaatt tggtcgctgg ttggcgcatc tttgtagcga tcaggatcga 13861 attcccctgg cttctcccag tgtcgtgggt attcactggt cgctttatgc gggcttataa 13921 catagccata tctatcgtat ggtggttttc tcacttctcc catagcagag atgcttccag 13981 tgtttggcga aattgtgcgg aataactcca tcacgaaccg atcaaggggg gtaaagggtg 14041 aatcgttcgc ttggtcataa tcactttcca tcgtcttttt aaaccatgcc cggacttcag 14101 catctcctgt gtttttgctc agtttagaca tgatgttgta gatagtattg ccccactgac 14161 tcaaggcaac gaaattatgg aagcactcga aaacaacatc tttgcgcctg aaatcctctc 14221 cctcttcacc attcttgatc cagtagtaca cgaatgtctt ctctggatct gctacatttc 14281 tctttaccac atcctcgact ctttgatcaa tccactcttt caaaaacttg cggcgcgcgc 14341 gcaccctcat gtaattctca tagacaatct cctgtgttgg atctcgatag gcaaggacag 14401 tgttgaaact ctcccctatc tcccgaacct ccggcggaac ggcatcgccc ttgacaccaa 14461 gatgtagatc ccagtacagg tcgaaatagt tatctaagta ctgacgcatc aagggtttac 14521 cagcattttt ttcgtcaaat agctggtcga aaaatttcac caccttgtcc ccatagatgg 14581 gacgatatag atcgggagta agagccgaat agtaaatcct ttttctaaga ttatcgtcgc 14641 cacgtttttc gtaaccttga ataaatactg agagtccatc ctcttcctga ttcggttccc 14701 aattgtcata taacagtccc caaaagtaat atggtagggc gttctccttg gtgagtcttt 14761 cctcatcgat catgggaagg gatagcttct ctccttgata aacacttgta agaaaagcag 14821 gcacaatcac atgcttgatg taagaaaggt caaaaatggt agggagtgat tcgcgaagac 14881 gactgagggc tgaagctttt ctctcaggaa gccttgaatt tatatcttcc ttaacttcgt 14941 ccgcaagagt agaaaaaaat tctgaaacat caagaatagg aaaaaaagac atgatttgga 15001 ctccttgtgt ctggctgctg taaactcatg cagctacatc aaataaatag atgccagcat 15061 tttgatctga tgcagctgga gctagagggc ggtcaaaaag caggtaaact aagactcagc 15121 aagtaataga catcatggcc accatacctt gatgcctgct ccagcaaaac cgatgccgaa 15181 gatgactcca gatatgacaa agagtaggtt aagacctagt acaaaatacc cgataaaccc 15241 ttgacaaatc cagcttttcc ttaacttgta acaaatctat aggacttacg cattgacaaa 15301 aaataccaaa tatgggcttt gtatggattt caggctttca ggcttgattt ttcgacaatt 15361 tttaggcgat cgcgcagacc aacagacgcg acgagcgcgt atcgctactc atcaatgatt 15421 gtgaaaagcc tgaaacgacc gattttcgtt gatgtacatg ataatgaatt tgagagcgtg 15481 cgtaagtcct aatctactga gcattgtact tgattacctg gactaggcaa gctaacttag 15541 agtttctaaa acaaaattcc tcctatgcgt ctgacttcgc ttgatgtttt tcgcggcata 15601 actattgcag ccatgattct tgttaacatg gcgggtgtta ccgaacctcc taatgtctat 15661 cctccactac tccatgcaga ttggcacggc tgtaccccaa ctgatttagt ctttcctttc 15721 tttttgttta ttatcggtgt agcgatgact ttctcgttgt caaagtacac tggcgacaac 15781 aaaccaacag catctgttta ctggcgtatc ctgcgtcggg ctacgatttt atttgcttta 15841 gggttattac tcaatggctt ttggaataaa ggtccttgga cttttgattt gagtactatc 15901 cgcattatgg gagtgttgca acgcattagc ttgacatatc tccttgcttc tgttgccgtc 15961 cttaaactcc cacgtaaagg acaatggata ctggtaggag tgttactcat cggctactgg 16021 ttagccatga tgtatgtgcc tgttcccggt tatggcgctg gagtgctgac gcgagaggga 16081 aatttcggtg cttatataga ccgcttaatt attcctcaag cacatcttta caagggtgat 16141 ggttttaaga atttaggaga tccagaggga ctttttagca ctattcctgc tgttgtgagt 16201 gtattggctg gatatttcgc tggacaatgg atacgcgctc agccagtgaa atttcgtaca 16261 agcataggtt tagtcttatt tggtcttggt tgcttaatta ttggttgggc atggggctgg 16321 acgttcccga ttaacaaaaa gctatggacg agttcttatg taatttttac tagtggttgg 16381 gcattattat tacttgcagc ttgttatgaa cttattgaag tgcgacgggt gcgggcttgg 16441 agtaaacctt ttgaagtgct gggattaaat gcgatttccc ttttcgttgc ttctgtactg 16501 ctgattaaaa ttttggtgag aactaaagtc gggtctggag aaaatgctcc cagcacttat 16561 gattggattt atcagaatgt atttacgtct tgggctggcg cggtgaatgg ttctttattg 16621 ttcgcgattg tcactgtctt actgtgggtg gcgatcgcct atgtgatgta tacgcggcga 16681 tggtttttca aagtctaaag tcttgctttt attgatttgt tgatttctta gtcttcaaat 16741 caccatcggt atctggctgc agcagagcct ttcttactgc tgcgttagac gctctagctg 16801 catagaataa agcatttttg ccataattat ctttagcgtt cacatcaacc ccagcaagaa 16861 tcagggtttt caccagatct ggatctatct gtttgttttc tccccacatt gctacactta 16921 tcaacaatgt ttgtcccctg tcatcccttg catttacatc aatcccagct ttcatcaggg 16981 cttgaacatc cttcgagaag ccgtatctat ttgctactgt cagcaccagc gcacgggcac 17041 gtttgatatc tccctgtcca atcgcctgtg gtagagattt atagtgggtg agatgaaaac 17101 cctgtggctt tagcccaggg acgccacatg // LOCUS NODE_1944_length_17082_cov_4.71504117082 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17082) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17082) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17082 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 186..364 /locus_tag="DP116_16865" /pseudo CDS 186..364 /locus_tag="DP116_16865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006513592.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" gene complement(699..1028) /locus_tag="DP116_16870" /pseudo CDS complement(699..1028) /locus_tag="DP116_16870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998185.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DUF2834 domain-containing protein" gene complement(1275..2369) /locus_tag="DP116_16875" CDS complement(1275..2369) /locus_tag="DP116_16875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411997.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heat-inducible transcriptional repressor HrcA" /protein_id="PRJNA477356:DP116_16875" /translation="MQVQLTNRQQQILWATIRHYIATAEPVGSKALVEEYNLGVSSAT IRNVMGVLEKVGLLYQPHTSAGRVPSDSGYRIYVDQLITPSETLAREVELSLQKRLKW EDWSLEALLQGAAHILATLSGCITLITMPQTATAVLRHLQLVQVETGRVMLIVVTDGY ETHSALMDLAQASEDAQPDAEVIDRELQIVSNFLNTHLRGKSITELANLKWSELAQEF QRYGEFLKNSLADLARRTLTPSATQIMVRGVAEVLRQPEFSELQQVQMLIHLLEEEQD QLWRLIFEEQPEAEEMGKPRVTVRIGSENPLEPIRTCSLISSTYRRGSVSLGSVGVLG PTRLNYESAIAVVAAAADYLSEAFSCFNPQ" gene complement(2806..3162) /locus_tag="DP116_16880" CDS complement(2806..3162) /locus_tag="DP116_16880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314517.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rhodanese-related sulfurtransferase" /protein_id="PRJNA477356:DP116_16880" /translation="MMGKPSEPSITQINVEELAQRLSSGDPIQLVDVREPQEVAIAYI DGFVNLPLSEFPDWADQVNTRLDPHAETLVLCHHGIRSAQMCQWLVVQGFTNVKNIMG GIDAYSTLVDSSIPQY" gene 3490..5175 /locus_tag="DP116_16885" CDS 3490..5175 /locus_tag="DP116_16885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3352 domain-containing protein" /protein_id="PRJNA477356:DP116_16885" /translation="MTQRSFLRVVAVGIMMLLLIGMTGCNGLSAKNSLTGIAPTGQSD AAIFVSKQAPVMVSMLVNPERLQAFERDGELSKLKTRLLANTGIDYQQEIKPWLGNEI TLAVTSLDIDHDSENGQQPGYLMALATTQPEKSREFVQLLFSKRALAGANLATEEYKG VKLISDNQIPSTSDHNEGVKNQNSLAGAVVGNSFVLFANHPKVLREAINNVQVSDLNL TSSGKYQKATKQLSKEAQAVAFLNLPVVAKWQNLKPDAQTYDNQIISLVSNPKGLLAE TAFLAKKETSPPAAQLSKPVGALDYIPASASLAVAGANLSGLGDSDLAQLWQQVTASL SASTEDVISKFLPVADVQKRSGINWREDIFNSVQGEYAIGLLPRAEQTNPDWIFVAEK SEGTLAAISRLDELASSERLSLNSFTLNDQKISAWAQLKTAINKSSEAKERELFTIQA NVLAARADIGNYEIFASSVEAINAALTAKENPLVKNRDFQDSIATIPEPNQGYVYIDW TKSQDILERQLPILKFVEVLGKPFFKNLRSLTLSSYGSDTGLLKAGIFFQFNR" gene complement(5762..6439) /locus_tag="DP116_16890" CDS complement(5762..6439) /locus_tag="DP116_16890" /inference="COORDINATES: protein motif:HMM:PF08332.8" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16890" /translation="MRLPAITFTTLLATTFAMTSVVYADEQADKQAIMKAYAAYNAAI ERKDVNQIFADYAPEFTIIRPNGKLTNLEQERQQTQNDFKNIRQIKAHDEIKQIQING QTATVIGIGYTSAIGSNPNNPQVPVPFSNVSQYQDIWKRTPGGWKLISTHVLQSNVNG QQASQVNENNLTPAQRQSLAEMKRRYMQGREREMQGIIEDMRMRNNMMNCMNGVGYGC GSSIIGN" gene 6782..6928 /locus_tag="DP116_16895" /pseudo CDS 6782..6928 /locus_tag="DP116_16895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313895.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="site-specific integrase" gene 7031..7822 /gene="cysE" /locus_tag="DP116_16900" CDS 7031..7822 /gene="cysE" /locus_tag="DP116_16900" /EC_number="2.3.1.30" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859127.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine O-acetyltransferase" /protein_id="PRJNA477356:DP116_16900" /translation="MLSIFLADFRIIFERDPAARNWLEVLFCYPGLQALLLYRLAHWL HIIHIPLIPRLISHIARFLTGVEIHPGATIGHSVFIDHGMGVVIGETAIVGDYTLIYQ GVTLGGTGKECGKRHPTVGENVVVGAGAKVLGNIEIGNNVRIGAGSVVLRDVPPDCTV VGVPGRILYRSGVRVNPLEHGSLPDAEAQVIRALVDRIEQLEQKIEQLQQPQQKVLVP FVNSSLSTPQTSNSHILKDEASSEMRTCCLLENKVIEEFLDGSGI" gene complement(7948..8523) /locus_tag="DP116_16905" CDS complement(7948..8523) /locus_tag="DP116_16905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009460142.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_16905" /translation="MIKLQRQLTLEEFLALPEGDITYELIEGEAVPKFKNDEISPKFF HSSITGALFILLSAWAEGKGRVVIEWAIKLTRNQQNWVPVADLTYISYNRLAADWLQD DACPVAPELVIEIISPGQTFGEMTEKATDYLKAKVQRVWIIDTRAKTITIFYPEHALP QTKRGTDSLEDSLLPGLQITPQQIFQQARIC" gene complement(8716..9309) /locus_tag="DP116_16910" CDS complement(8716..9309) /locus_tag="DP116_16910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314520.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxamine 5'-phosphate oxidase" /protein_id="PRJNA477356:DP116_16910" /translation="MNFELLIAPWRSPLARALHRNRSQPYSRYFQLATVQTDGRPANR TVVFRGFLGDTNLLKMITDTRSEKFDQILHQPWTEVCWYFSVTREQFRIAGELSLIDA NHPDSGLQKARQQTWQDLSDNARVQFTWPHPGKPRAEQEAFSSSVADPAHPPQNFCLL LLDPVQVDHLELRGNPQNRWCYLRDNSHTWSTTAINP" gene 9546..10769 /locus_tag="DP116_16915" /pseudo CDS 9546..10769 /locus_tag="DP116_16915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869818.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ribonucleoside-triphosphate reductase, adenosylcobalamin-dependent" gene 11169..12239 /gene="nrdJ" /locus_tag="DP116_16920" /pseudo CDS 11169..12239 /gene="nrdJ" /locus_tag="DP116_16920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015117381.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ribonucleoside-triphosphate reductase, adenosylcobalamin-dependent" gene 12659..12769 /locus_tag="DP116_16925" /pseudo CDS 12659..12769 /locus_tag="DP116_16925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018396235.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 12850..13077 /locus_tag="DP116_16930" CDS 12850..13077 /locus_tag="DP116_16930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876424.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16930" /translation="MFVDELKPIFQQFTHHPLSFLGGFVSGLLRLNLADDPVKSWLSQ QLDSTSNTTFTTQSAEGHNGKASGPQSISIE" gene 13573..13848 /locus_tag="DP116_16935" CDS 13573..13848 /locus_tag="DP116_16935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998162.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="PRJNA477356:DP116_16935" /translation="MTIYVGNLSYRATEADLKVVFAEYGEVKRVVLPTDRETGRLRGF AFVDMSEDAQEDAAITELDGAEWMGRQLRVNKSKPREEERRGSWVKR" gene 14428..15477 /locus_tag="DP116_16940" CDS 14428..15477 /locus_tag="DP116_16940" /EC_number="3.1.3.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213247.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class 1 fructose-bisphosphatase" /protein_id="PRJNA477356:DP116_16940" /translation="MARAPESLELNTNEVADKALDRDCTTLSRHVLQQLQSFSPQAQD LSALMNRIALAGKLIARRLSRAGLMEGVLGFTGEVNVQGESVKKMDVYANDVFISVFK QSGLVCRLASEEMEKPYYIPENCPIGRYTLLYDPIDGSSNTDTNLSLGSIFSIRQQEG SDLNGEAADLLASGRQQIAAGYILYGPSTMLVYTIGRGVHSFTLDPSLGEFILTEENI RIPDHGSVYSVNEGNFWQWDEPIREYVRYVHRTEGYTARYSGAMVSDIHRILLQGGVF LYPGTLPKPEGKLRLLYESAPLAFVIEQAGGRATTGHMDILEVVAKKLHQRTPLIIGS PKDVAKVESFIQNGH" gene 15635..16780 /gene="tal" /locus_tag="DP116_16945" CDS 15635..16780 /gene="tal" /locus_tag="DP116_16945" /EC_number="2.2.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458796.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transaldolase" /protein_id="PRJNA477356:DP116_16945" /translation="MATNQLLEIKNYGQSIWMDNLSRDIIQSGELKDLVENKGICGIT SNPAIFEKAIAGNAIYDADIEAGIKAGLPTDKIYESLAFADIRNACDILRPVYDATNG LDGYVSIEVPPNIADDTQATIKEARRYYQEIGRENVMIKIPGTKAGLPAVEQVIADGI NVNITLLFSVDSYIKTAWAYISGLEKRAAEGKDISKIASVASFFLSRIDSNIDGKIDA KLKKGVDDITVEAKLKDVKGKVAIANAKIAYQEYKKIIQTDSWKALAAKGAKVQRLLW ASTSTKDPNYSDVMYVDELVGPDTVNTLPPATIDACADHCDVASRIETRIEEAYQLIE SLKDPDINIDINKVMDELLVEGIDKFVKPFESLMSSLESKVKQLSPV" BASE COUNT 5080 a 3476 c 3750 g 4776 t ORIGIN 1 gcaagttctg acgatgaatg tagaatcccc tcgcctttag gcaggggagt gtcaaataaa 61 ctgacagcca cactcacaaa ctagctaaag atgtgtgctg cataaagcta aagcctggct 121 gataaaaagc ttattactta attaattcac aaaagatttt gacaaattgt taggagacaa 181 aagatatgga tgctcaggaa cttttgagac gatacgtatc aggacagact ttagcaatgt 241 aaacctggtt catgtgtgct taactaatgc aaatctggtt ggggcacacc tgattggagc 301 acacttgatt cgcgcagact taaggggggt agacctgact gctgcacatc taagtcaggc 361 gaattgtcag acaatagctt atttgaccgc agcgatatca cccaggaagt cgtagaccgt 421 cggcaaattg aaaacttttg ctgtttttgc caaaccaatc acgttattct tgagaatttg 481 ctgatcaata ctatgaacac ctagaaaaag tccagcctga tggtcaatca gcaacatggc 541 ggcattatca ggtgtcagtg gatcgtaaaa tgcgtcgctg ttacctgtta tgttagtcat 601 tgacagttcc ttgagcggtt caaggctaat tattcaatga agctaacgat tccgcaagta 661 ggtacacgct tgtacgtaac gttcatattg aaaaaagcat ttttgctgcc aggaaacttt 721 tgatttggtt cccgtaaagc taagtaaggt agtaaggcaa aagccccaac tccaaaagaa 781 acagaagcaa aaggccaagc aggaattttt tgtcctttac catcaataaa tgccacgcag 841 ctgtaaatca taggccagat gcccatgatg ttgaagagtg ctacaactaa agggttaatt 901 ccttgccact gaccagtaga aagattttta atcaactcga atgtatcagg ttgattagga 961 ggggcaaaga aaaaggcata gacaatcagt cctagccata gcgctccaaa agtaattttt 1021 ctgaccatta tttaagctag ttttgacgaa ttcactaatt gttttgccat aactcagatc 1081 ttggatctgc tttttcatcc gccaataaat aaatttcttg gctcaatatt caagtccgtt 1141 aaaacggact ggataagttt ttgagtccgt tttaacggac ttgatctatt agccttgaac 1201 ttgagttcaa ggcgtactca ccggtgaagt ccaagatttg ccatatgctg tttttttata 1261 aatacctact ggctctactg gggattgaaa caactgaaag cttctgagag gtaatcagca 1321 gcagcagcga ctacggcgat cgcactttca taattcagcc gcgttggccc cagaactccc 1381 acgctaccta atgatacaga acctcggcga taggtcgaag aaatcaacga gcaagtgcgt 1441 ataggttcta gagggttttc tgaaccaata cgaacagtga ctcgcggttt acccatctcc 1501 tccgcctccg gttgttcttc aaatattaat cgccacagtt ggtcttgttc ttcttccaac 1561 aggtggatga gcatttgcac ttgctgtaat tccgaaaact ctggctgacg caaaacttca 1621 gcaacaccgc gaaccataat ttgtgttgca gatggtgtca gagtgcgacg agcaagatca 1681 gcaagtgaat ttttcaagaa ttcgccatat ctttggaact cttgggctaa ttcactccat 1741 ttcaagttag ctaattcagt tatgcttttc ccccgcaagt gagtattcaa aaaattagag 1801 acaatctgca actcccggtc gatgacttct gcatctggct gtgcatcttc cgaggcttgc 1861 gctaaatcca tcaatgctga atgtgtctca tatccatccg tcaccacaat caacatcacc 1921 cgtcccgttt ctacttgtac gagttgtaaa tgtctcaaca ctgctgttgc agtttgcggc 1981 attgtaatta aggtgatgca accacttaaa gttgctaaaa tgtgagcggc tccttgtagg 2041 agagcttcta agctccaatc ctcccacttg agccgcttct gcagtgatag ttctacctct 2101 cgtgctaaag tttcagaagg tgtaatcagc tggtcaacat aaatacggta gccagagtcg 2161 gaaggtacgc gtcccgcaga agtgtgaggc tggtacagta acccaacttt ttccaacacg 2221 cccatcacat tgcgaattgt tgctgagcta acaccaaggt tgtactcttc aacaagagct 2281 ttagaaccaa caggttctgc tgtggctata taatgacgta ttgttgccca aagtatctgc 2341 tgttgacgat ttgtaagctg gacttgcata aggagttttt tactgaagcc gaattttaag 2401 cgaactttga aaaagttttt agagatagct ctaaattaaa taaggattaa taatagtaaa 2461 gcaacatagc aaacactttg cttctacgag aataaggaaa aagaagtaaa taaaagtaaa 2521 tattcagtat tgtgttgata aagttactat taatacaagt ttactgctga aaaaagcttt 2581 tggattgaag taaatagccc ctacttatgg atacttgcat aagcttcttg aaatctctgt 2641 atttcaggtt acgaatttcc ggaacagcca aaaaaggcac gtaggtagaa tacaatattt 2701 tccactcaac acttaaagct atcctagctt gtacagagga tagcatagac aagttggaat 2761 cgtcagtctt aggagactac atagccactt cactcaaaac aggagctagt attggggaat 2821 tgaagagtca accaaagttg agtaagcatc aattccaccc ataatatttt tgacatttgt 2881 aaagccttga acaactaacc actgacacat ctgagcagaa cgaatgccat gatgacacag 2941 cacaagggtt tcagcgtgag gatctaagcg agtgttgact tgatctgccc agtcaggaaa 3001 ctcactcaaa ggtaaattga caaagccatc gatgtaggcg atcgccacct cttgtggttc 3061 acgcacgtcc acaagctgaa ttgggtctcc tgaagataag cgttgtgcca gttcctcaac 3121 gttaatttgg gtaatggatg gttcggaagg tttgcccatc atgaagtttt gataattttt 3181 ctatactata ttatattcac aaaaaatctt cgttttttgg ggttggtagg tgcgtaagtg 3241 ctgtttcatt caagaactga aaaattcggt tttaatgaaa cgggtatcag ccttaattgc 3301 agaattttct ctagagttcg gctggatgcg cgaaactgag catctaacaa ttagagataa 3361 catgattttt gatagacttg tgcacaagtt tcagtactaa actaatttct agtaacagaa 3421 ctaccattca taatttaatt tggtatttga ttattagtat ttgcacctct agcagtctgt 3481 ataaaaatta tgacacaacg ttcatttttg cgagttgtag cagtgggtat tatgatgctg 3541 ctattgattg gtatgactgg ttgtaacggg ttatccgcca aaaattccct aaccggtatc 3601 gctcctacag gacagtcaga tgctgccata ttcgtgtcta aacaagcacc agttatggtg 3661 tcgatgctcg tgaatccaga acgcttgcag gcgtttgagc gtgatggaga actgtcaaaa 3721 ttgaaaacaa gattactggc gaatacgggt atagattacc aacaggaaat taaaccttgg 3781 ttagggaacg aaataacatt agctgtcaca agcttagata ttgatcacga ctctgagaac 3841 ggacaacagc cagggtattt aatggcactt gcaaccactc aacccgagaa aagccgtgaa 3901 tttgttcagt tattgttttc taagcgggct ttggctgggg caaacttagc aactgaagaa 3961 tacaaaggcg tgaagctgat ttctgacaat cagattcctt caacgtctga tcacaacgaa 4021 ggagtcaaaa accaaaatag tcttgctggt gcagttgtag gtaatagctt tgtcttgttt 4081 gccaatcacc caaaagtgct gcgagaggca attaataatg tacaggtatc cgatttgaat 4141 ttaaccagtt ccggcaaata ccaaaaagca accaaacaac tgtccaaaga agcacaggct 4201 gttgcttttc tcaatctccc cgtagtagca aaatggcaga atctcaaacc cgatgctcaa 4261 acttatgaca accaaattat ttccttagta tcaaacccca aaggattgct cgctgaaact 4321 gcttttttgg ctaaaaaaga aacctcaccc ccagctgcac aactctctaa acctgtgggt 4381 gcgttggatt atatcccagc ttcagcaagt ttagcagtag caggagcaaa tttaagcggt 4441 ttgggtgaca gtgatttagc acaactttgg caacaagtca cagctagcct atctgcttcc 4501 acagaagatg tgatttctaa attcctcccg gtggcagatg ttcaaaaacg ttccggcata 4561 aactggagag aggatatttt caactcggtg caaggagaat acgctatagg attgttaccc 4621 cgtgcagaac aaacaaatcc cgattggatt tttgtagcgg aaaaatccga ggggacactc 4681 gcagcaattt cccgtttgga tgaactcgcc tcatcagaaa ggctttcgct gaattccttc 4741 accctaaatg accaaaaaat ctctgcttgg gcacagctaa aaactgctat caataaatct 4801 agtgaagcaa aagagcgaga attgtttaca attcaagcaa atgttctagc agcgcgtgct 4861 gacattggaa attacgagat ttttgcatct tctgttgagg caatcaatgc agctttgacg 4921 gctaaggaga accccttagt taagaatcgt gatttccaag acagtatcgc tactattcct 4981 gaaccgaacc aaggttatgt gtatatcgac tggacaaaga gtcaggatat tttagagcgt 5041 caactaccaa ttctaaagtt tgtggaggtg ctgggcaagc cgtttttcaa aaatttgcga 5101 tcgctcaccc tcagtagtta tggtagtgac acaggattgc tcaaagccgg catatttttt 5161 caatttaatc gttagtcaaa tcccgaaatc tctataaacc aagctgtagc tattcatggt 5221 tcatggagta cagagggtgc agggaaagac ttagtacatt tctagtcttt ccctgctttc 5281 ttatcattct gtactttcag gtttggtgta ggtattggta tttcaccaac ttataccaat 5341 ttcagaaaag aatgcgacag atgggtagat gggtagatag gtaggttggg gagccactgc 5401 gttgggcggc tttgccgact tgtagacgcc cagagggcgg cttcccgtag ggtagcaagt 5461 ggcgtttgaa cgcagtgaaa cccaacaaaa tcaaaatttt ggagttgggt ttcgttcctc 5521 aacccaacct acatgtatag ctacatgtgt taccaagggt ttaatagcag tcaccagatt 5581 tcatgtcatc taactcaaga atccctatga atttaaaata agtggttgga aactgaaaag 5641 caaaactaaa acaaaaagca ctggataatt agccgtcgtt ccagtgcctc tgcaattttt 5701 acggttgact accctgtcta atgtttggtc aagtaacctc gaagtttttt cttaatacta 5761 cttaattacc aattatgctg cttccgcatc cgtaaccaac accattcatg caattcatca 5821 tattatttct cattcgcata tcttctatta tcccttgcat ctctctttct ctcccttgca 5881 tgtatcttct tttcatttca gcaagtgatt gccgttgtgc aggagttaaa ttattttcgt 5941 ttacttgaga tgcttgctgt ccgttaacat ttgactgtaa aacgtgggtg cttataagtt 6001 tccatccacc cggtgttcgt ttccagatat cttggtattg acttacattt gaaaaaggga 6061 caggaacttg aggattgttc gggtttgaac caattgcact tgtatatcca atacctatca 6121 ctgttgcagt ttgcccattg atttggattt gtttaatctc atcgtgtgcc ttaatttgac 6181 gtatattctt aaaatcgttt tgtgtttgtt ggcgctcctg ctccagattt gttaactttc 6241 cattaggtct aattatggtg aattcaggag catagtcagc aaatatttga ttaacatcct 6301 tacgttcaat agcagcgtta taagcagcat aggctttcat gattgcttgc ttatcagctt 6361 gttcatcagc ataaacaaca ctagtcattg caaacgttgt agctagcaga gtagtaaaag 6421 ttatagcagg tagtctcata aacaattatc cctaaaggta aaaagcaaag gattttagca 6481 gtataactac catattacaa tggccgtata aacacttaat tacaaaaata taaagtttcc 6541 gcaactaaaa agctgcaagc ccttagaata ctgagttagc agctttttaa ttatggtatc 6601 aatattctaa aaagtaatta gctagtttga atacgtctca cattcaaact agatacaact 6661 cattagtcaa tgagttttcg taattcctcc agtagtacca ttttgcaaat tctcagcctc 6721 cagtaccgca tctgggggct tttaattgta agctgtggac acagtagttt tttgtctgac 6781 gatgaacgtc aatcggtacg gacgcgccaa aattctcaca cagcaagaga tacagctagt 6841 ttttgcccaa gggtttgact cagagcgtga taaaacccta ttcggtgtat gcctgtttac 6901 ggctgcaaga attcgtgaag cgtgtaccaa tcatttttca aattggtatt acaaccaacg 6961 attttattca tggcacctag aacgctaaaa tagagtctga gttgattggc gaaactaggg 7021 tacaacaacc gtgctatcta tattccttgc tgacttccgc atcatctttg aacgcgaccc 7081 agctgctcgt aactggttgg aagtgttgtt ttgctacccc ggtttgcaag ccttactgtt 7141 atatcggttg gctcattggc tgcatatcat tcacattccc cttattcctc gcctgatttc 7201 acacatagcc cgatttttaa ccggagttga aatccaccct ggtgcaacga ttggtcatag 7261 tgtttttatt gaccacggaa tgggtgtggt gattggggag acggcaattg tgggagacta 7321 taccttaatt tatcaaggtg tcacccttgg cggtactggt aaggaatgcg gtaagcgcca 7381 tccaactgta ggagaaaatg ttgttgttgg agccggagcc aaggtactcg gtaatatcga 7441 aattggcaac aatgtccgca ttggtgctgg atcagttgtc ttgcgcgacg tgccaccaga 7501 ttgtactgta gttggcgttc ctggtcgaat tctataccgt tctggcgttc gagtcaatcc 7561 cctggaacac ggaagtttac cagatgctga agcccaagtc atccgtgctt tagtagaccg 7621 catcgagcaa ttggaacaaa aaattgaaca gttgcaacaa ccgcagcaaa aagttttagt 7681 tcctttcgtt aactcatcat tgtcaacacc tcagacctca aattcccaca tattaaaaga 7741 tgaggcatct tcagaaatgc gcacttgttg tcttttagaa aataaggtga ttgaagagtt 7801 tttggatggt tctgggattt aacgaggaag ggagtcaata taagcttttc catgattctg 7861 tagacgtgca ccggctttcc ggagggtatg ggtgctttgt tttcgtcgtc ctgtactagg 7921 tttctgggtt tggattcgtt gcgtaagtta gcaaattctt gcttgctgga agatttgttg 7981 aggtgtaatt tgtagtccag gcaataagga atcttctaaa ctatcagtac cgcgcttggt 8041 ttgagggaga gcatgttcag gataaaaaat agtaatagtt tttgctctag tatcaataat 8101 ccaaactcgt tgaactttag cttttaaata atctgttgct ttttctgtca tttcaccaaa 8161 agtttgacca ggtgagatta tttcaataac tagttcgggt gcaacaggac aagcgtcatc 8221 ttgcaaccaa tcagcagcaa gacggttata agaaatataa gtcaaatctg ctacaggaac 8281 ccagttttgt tgatttcgtg ttagcttaat tgcccattca atgacaaccc gtccctttcc 8341 ttctgcccat gcagacaata gtataaataa ggctcctgtt atagaactat gaaaaaattt 8401 tggtgatatt tcatcgtttt taaatttagg aacagcttct ccctcgatta gttcgtaggt 8461 gatatctcct tcgggaagtg cgaggaattc ttcgagggtt agttgtcttt ggagtttaat 8521 catggttcat aatttttcag gaagatgact ctattttaga tttttaataa ttaccaagaa 8581 agaactcaga attatagcgg ttatcagttg ggtgcagtac atcaaaaaat tagcatatga 8641 accaatcact gcttgactgt cctagaagta gaagcaattc gacagcaata aaactttaac 8701 aaaagcagaa attatttacg gattaattgc agtcgtagac catgtgtgag aattatctct 8761 cagataacac caacgatttt ggggattgcc acgcaattct aggtgatcca cttgcacggg 8821 atcgagcaat aacaagcaaa agttttgtgg aggatgagcg ggatcagcaa cagaagatga 8881 gaaagcttct tgctcagctc taggttttcc aggatgaggc caagtaaatt gtacacgagc 8941 attgtctgaa aggtcttgcc acgtttgctg acgagctttt tgcagtcctg agtcaggatg 9001 attggcatcg atgagagaca attctccagc aatgcggaac tgttcgcgag tgacactaaa 9061 gtaccagcat acttctgtcc aaggttgatg gagtatttgg tcgaattttt cgctgcgtgt 9121 gtcagtgatc attttcagca aattagtatc gcctagaaag ccacgaaaca caacagtacg 9181 attagctggg cgtccgtcag tttggactgt cgcaagttga aagtagcgcg agtatggctg 9241 gctacggtta cgatgaaggg cacgggcaag aggagaacgc caaggggcga taagtaattc 9301 aaaattcaaa atttaaaatt caaaattaaa agttcaaaaa gacatagact agcgtcccgg 9361 caaagatctc acctgtggac tgtgtgcaaa ttaaaggcgt gacggcttag acataggaaa 9421 attatactgc aatgagtcta gccaagcacg cagcgcaaat ctcacaattt ttcaatcttt 9481 atcttctcat tgcacttggc gtatagtcgt aatctatgtc ggtgagcaac aaggcttgat 9541 tctctatggt tcgtgagctt gaaaaaaaac gccagggtgc aaaatttcca gaaactgcgc 9601 cagctgccaa tccagtcttt tttagaacct atagccgtcg tcaagaggct ggggtgaggg 9661 agacttggga tcaggtgtgc gatcgcacta tccaaggctt catcactctt gggaaattac 9721 ttccacacga agctgatatc ctacaacgga tgcagcgaaa cttgaaagca ttacccagcg 9781 gacgttggtt atgggttggc ggtacaaatt ggatcaaaga gcaaaaaaat ttttctgggg 9841 cttataactg tacgtctacc aacctgcaag actggagtgc tttcgggttg atgatggatt 9901 tggcaatgat gggctgcgga actggagccg ttttagaacc acaatatatc agcaagttgc 9961 cccctatccg taatcacctt catgtacaag tgcaaggtga aattggtatg actcctccta 10021 ggagacgtcg ccagcagaca gaagtcataa tagaaagcaa tctcgttact ctttacgttg 10081 gagatagccg tcaaggttgg gtgcaatctt atcaaagttt gttggaactt tcaaccgatg 10141 aacgattttc acaagatgtt caagttattg ttgatcttag cgacgtacgt caagcaggag 10201 aacctctcaa cggttttgga ggggttgcga atcctgtaaa attaccagaa ctgtatgagc 10261 attgtgcatc tatcctgaat aaagctgtag gacgacagtt aaattctgtt gaatgctgtt 10321 tattaattga tcgagcagct gtcacaattg ttgccggcaa tattaggcgc tcagctggaa 10381 tgcgccaagg ttatagtgag gataatttgt ttgcagatgc taaggcaaat ctttggcagc 10441 aagatgagaa tggcaactgg cgcatcgatc cagagcgtga ttctctgagg atggcaaacc 10501 atactagagt ttttcatcaa aagcctacat taaaagaatg tattgatgct gtccgcaaac 10561 aatactacag cggtgaaggt gcgattcaat gggctggtga agctgtcgct agagctaatt 10621 gtgacctttt gaaaaatcac gaacaaaaaa tagattttct caaagcttat gctctgggta 10681 aagccaaaga ttggttgcaa gagaattatc ctcaaattcc tgaaagtgaa ttagagcatc 10741 gtttggctcg ttacggttta aacccgtgtg gtagatagtt ttgcctcacg ttaaaaaccg 10801 gggaaattca aggaaggcta agcagaaaag gtgagttttg aattgttatt taaaattgaa 10861 aaaattatta ttaactaact ctgaaatgac tcattaatga ctgtatgcta atcttgagct 10921 aaccttatta tttaaaagtc taaggtagtg caacgcatag gagttgaacc ttttcagtgc 10981 atagttaagc agtgatcact gaagaactgg taactgaaga gaatataata cccccacgag 11041 tccccggcta ctaaatggta gaaaaggtat gctgaactga actggaattg accagttgta 11101 tcccagcaat gggtatgagg aaaacctcca gaactagagg ataacaagcc tttaggataa 11161 caaaattgga aattattggc tcaaattttc attgcaatct ctcagaaatt cacctcaacc 11221 aaattgaccc aaataactac aaagaacagg aagaagcttt cactgctgga gcactttctg 11281 tagcagcact tctgcatcac aaatttattg aaccccgcta ccaatacagc cgcgaattag 11341 acccaattgt cggtgtttct tttactgggt tatttgattt ctttgttcat gcttttgggg 11401 ttgattggtt acgttggtgg gaaaaaggaa gacctgcaac tcctgaagga ttggcgttta 11461 agcgtcaaga ggaggaatac ctaagttttt ggaaagagac tgtacatcgt gttgtttggg 11521 attactgcga tcgccacggc ttaaaacgtc caaaccgctg caccacagtt caaccaagcg 11581 gtacgaagtc tctgttgaca ggtgctagcc ccggatggca tccccccaaa gcacaaagat 11641 ttatacgtcg gattacctgc cgcaaaaatg atcccgtcgc tttggcttgt cttgagtatg 11701 ggtacaatat tataccctcc caatcagata aagatgacga cggtacattg ttgaatgacc 11761 cctttgatcc gcgagtcagc gaatggttgg tggaaatccc ggtcgctgta tcctgggctg 11821 atttaccagg tgctgataaa atagatatca gtcaatttag tgcgatcgcc caaatagatt 11881 tctatatgca ggtacagaga ttttatgtca cacataacac ctctgcgacc attgagttac 11941 gagaaaatga agttgaaact ttaggaactc ggatatacga agccatcaaa aatgatgagg 12001 gctacatcag tgctgcactt ctggcacggt ttgacgacct tcaaactttt cctcgcttac 12061 cttttgagcc gataacaaaa caacggtacg aacagctgat gaaagaagtg gaaatgcacc 12121 gcaaaacaga agattttcat gctgttttaa gccgttacga tttcggcgat ttaatcgaag 12181 caggaccagc aggttgcgac tctgataagt gcatgatgcc tgaacaaacc ttaatgtaaa 12241 taattaggca gtccgccttt gggggttccc cttcgttttg caaccagccg ttccgttgta 12301 ggaactaccg gtggcgcagc ctaggcgtgg agatacttat aacggtatcg acaacaaaag 12361 cggtggttgt gagggaatct accatcaccg cccaatacgg ttttcatcag cttacagcct 12421 aaagccttgg ttacaagaac aaagcccaca taagtgggct actcctttcc cgcccgaaga 12481 ttaccgttta ggcagcaggg gagccgggga gcaggggagc cggggagcag aaatgacatc 12541 ggtaatctta caccaccgaa gggagtaata caaattgagg ctacccgacc aactagaagt 12601 taagagagta tgacttaatc gctaaatagt ttgctaaact tcctcatagt ttaccgctat 12661 gagaagattt ctgtggaaag ttgcagagtt tggggatctc attggggttt ccccctccac 12721 tttacgacga tgggaaaaag aagagaagtt gattccagaa cgaacactag ggaatcaacg 12781 catttacagt tcttaggtta cgatcgcctt aataacggtc atttttaact gagtcatagg 12841 agaaccctca tgttcgttga tgaattgaaa ccaatatttc aacaattcac ccaccatcca 12901 ctttcttttc tgggcggttt cgtttctggt ttgctacgac tcaaccttgc tgacgatcct 12961 gttaaaagct ggctctctca acaactcgac tcaactagca acacgacttt taccactcaa 13021 tctgctgaag gacataatgg caaagctagt ggtcctcagt cgattagcat tgaataaagt 13081 tcaaatctac caatatttag gtaaatatca ccaaaataag taaagatact aaggaaacct 13141 tagtatcttt gcaagaaaca gcttaaaaaa tcaaacaaaa ttttattgac tatgtagcgc 13201 ttggtggtgt ggcgtgcaat acacacggag ctagtgaaag tgcatgcact ttcacattat 13261 tcgtggtgat atatgtgatt caaatgagaa ttgctatata ttctacaaca acctataagt 13321 taagctaatt taaattgata tgtgattgta aaatctcata ctttgaaaat tgacaaaatc 13381 agttatcata tgtagcgtat gcgaaaattc agtccagccg gtggcgaagt cgcaattaag 13441 attcaacttc gttgtatctt tcctgcgctc tttgctattg gtgaatgagg cgaaacggaa 13501 gcagtttctg ctgtttgttt ccctatattc acgctaaatt cttcgctatt cgttgcgact 13561 tacgtgattc acatgactat ttacgttgga aatctctcct accgcgccac agaagcagat 13621 ttaaaagtag tatttgcaga atatggcgag gtgaaaagag ttgtcttacc tactgaccgc 13681 gaaactggcc gcttgcgcgg ttttgccttt gtagacatga gtgaagacgc ccaagaggat 13741 gcagctatta cagaattaga tggtgctgaa tggatgggtc gtcaactcag ggttaataag 13801 tctaaaccgc gcgaggagga gcgacgaggt agttgggtga aaagataata gtctcaatag 13861 cgatgaataa tcatgataac agattgtttc ctagttatgc aagctattgc tgaaatatct 13921 agaaacaaga aatcaaacaa ttcgatgaag tgaaaagtta tgcccgacga gttgtgactc 13981 taagatcata gtcataatga gtcggttttt ccaaccctat ggtgtactta gcaatcttaa 14041 agactaagga agtggaaaat ttgcttattt gtattttttc agtgactgta tctgtataag 14101 tgtacccaac gagtgggaac agtaagaaaa ctctggttgc gaatagatgt ttctttcttg 14161 tacaggtagg caagtaaatt tacagattat tctcacagga aacactattg tatctaaatt 14221 cttcgccaga ctgagtagag ttactcaagt atatcattcg agactggtga ggcagctttg 14281 catagaccca tagctgattc atttctttag tttatctaaa atcacattta agttgtcaaa 14341 ttacctttct aagatagagg cagcaaccac aagttgcaac ccacgggatg aaccacaaaa 14401 acacctagtc tctagggagg ttaaaaaatg gccagagcgc cggaatcttt ggagttaaat 14461 actaacgaag tggcagacaa ggcgttagat cgggattgta caacattatc ccgccatgtc 14521 ctgcagcaac ttcagagctt ttcacctcaa gcacaggatc tgagtgcgct catgaatcgg 14581 atcgccttag ctggcaaact gattgctcgt cgcctcagcc gtgcaggttt aatggaaggg 14641 gttctcggat ttacagggga agtcaatgtg cagggagaat ccgtcaaaaa gatggatgtc 14701 tatgccaatg atgtctttat ctcagttttc aagcaaagcg gtttagtttg tcgcctagct 14761 tccgaggaaa tggaaaaacc ctactacatt ccggaaaact gccccatagg tcgctacact 14821 cttttgtatg accccataga tggctcatcc aacaccgata caaatctcag tttgggttcc 14881 attttctcaa ttcggcaaca ggaaggaagt gacctcaatg gtgaagcggc tgatctgcta 14941 gcctctggac gccagcaaat tgcagcgggg tacatattat atggccccag cacaatgcta 15001 gtctatacta ttggtagggg agttcattcg tttacccttg acccaagttt aggggagttt 15061 atcctcacag aagaaaacat ccggattcct gaccacggtt ccgtatacag cgtcaacgaa 15121 ggaaactttt ggcagtggga tgaaccgatt cgggagtatg ttcgctatgt ccacagaaca 15181 gaaggttaca ccgctcgcta tagtggggca atggtaagtg acatccacag aattttgctt 15241 caaggcggtg tgtttctcta cccagggaca cttccgaaac cagaaggtaa actgcgctta 15301 ctttatgaat ccgctcccct agcctttgtg attgagcaag caggtggtcg cgctactacg 15361 ggacacatgg atatcttaga ggtagtagcc aaaaaactgc accagcggac acctttgatt 15421 attggaagcc caaaagatgt tgcaaaggta gagtctttca ttcagaacgg tcactagaag 15481 agcgtaaaaa gcgcagtttc aagtgtagtc agcacgtaag ctatgagtaa gcggttagtc 15541 gttagtcgtg atgagttagt agaacgacta acaactaaca acttgacaaa cgtcacttag 15601 atacattaga agcaggagtc aaacaaatac agctatggca actaatcaat tactggaaat 15661 taaaaactac ggtcaaagca tctggatgga taatttgagc cgtgatatta ttcaatctgg 15721 cgaactcaaa gacctggttg aaaataaagg aatctgtggg attacctcca acccagccat 15781 ctttgaaaaa gcgatcgccg gaaacgcgat ttatgatgct gatatagaag caggaatcaa 15841 agctggatta ccaacagaca aaatttatga atcgctagct tttgcagata tccgcaacgc 15901 ctgtgatatt ctacgccccg tgtatgatgc aaccaatgga ttggatggtt acgtcagcat 15961 cgaagttcca ccaaacattg ctgatgatac acaggcaaca atcaaagaag cccgtcgcta 16021 ttaccaagaa attggtcggg aaaatgtcat gattaaaatt cccggtacaa aagcgggttt 16081 acctgcagtt gaacaagtca tcgctgatgg cattaatgtc aacattacgc tgttgttctc 16141 tgttgacagc tacatcaaaa cagcttgggc ttacattagt ggtttagaaa aacgggcagc 16201 cgaaggtaag gatattagca aaattgcttc tgtcgctagc ttcttcctca gccgaattga 16261 cagcaatatt gatggtaaga ttgacgcgaa attgaaaaaa ggcgttgatg acattaccgt 16321 agaagccaag ctgaaagatg tcaaaggaaa agtggcgatc gccaacgcca aaattgctta 16381 ccaggaatac aaaaagataa ttcagacaga cagctggaaa gcattagcag ccaagggagc 16441 aaaagtacag cgcctcctgt gggcaagcac cagcaccaaa gaccccaact acagcgacgt 16501 gatgtatgtt gatgagttgg ttggtcccga cactgtcaac accttgccac ctgcgacaat 16561 tgatgcttgt gctgaccact gcgacgtcgc cagccgcatt gaaacacgga tagaagaagc 16621 ttaccaactg atagaaagcc tcaaagatcc agacatcaac atcgatatca acaaagtgat 16681 ggacgaactg cttgttgaag gtatcgataa gtttgtcaag ccctttgaat cgctgatgag 16741 ctccctggaa agcaaggtta aacagttgtc tcctgtgtag gaacttaaca gttaacagtt 16801 atcagttatc agtgaacagt gagaccagtg cgaatgacgg ctttccctca cttggcaact 16861 ggcgttagcc gtaaggcgtg cgctttgcgc atacccgaag ggttatcaat gacttaatag 16921 ctgataacta gttgataact caatacggtt cagttagagc caaaaacctt aaaattagta 16981 ggttggggag ccacttgcat gggcgggttt cccgacttga gcaaagtggc gtttgagcgg 17041 agcgaaaccc aacacaaacg ttaattgttg ggtttcgttc ct // LOCUS NODE_1948_length_17032_cov_5.41261717032 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 17032) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 17032) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17032 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(138..944) /locus_tag="DP116_16950" CDS complement(138..944) /locus_tag="DP116_16950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="inositol monophosphatase" /protein_id="PRJNA477356:DP116_16950" /translation="MTEFWTTILDFAQTTTARVGNQLMQDFGQVQALEKADGTLVTQS DKWADQEIRNAIASTFPDHGILSEEDDKVFGGTEWCWVIDPLDGTTNFTRGIPIWSIS LGLLYQGTPVFGYVAVPPLREAFHGYWGSIPELELPTGAFRNYHPIHTSHDAPSGNHF FNLCSRSTSVIQKDFPCKIRMLGVASYNFLTVAAGATLGGIEATPKVWDIAGAWVIVQ AAGGSWRSLKSEPFPLIPQQDYTTRSFPTLVVSRRELLPVFAPFVERVKI" gene complement(1010..2431) /locus_tag="DP116_16955" CDS complement(1010..2431) /locus_tag="DP116_16955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_16955" /translation="MASGDLFDSPTNSLAMPKVNVLTMFRLGLFQMGLSMMSILTLGV LNRVMIQEIAIPATLVGVVLAIQLFVSPSRVWFGQISDAKPIWGYHRTTYVWAGAAIF AVASFLAVQVLWQLDTVVNNVGGWAWTTQTIGWTALLALVFAFYGLAICASGTAFGAL LVDVSEENNRSKVVGVVWSMLMVGIIIGAIISSSLLKQSTPETLQASVNRLFLVVPAI VFGLAIVATFGVEKKYSQYTTRSTIISREDSITLGKAWQILTASPQTGLFFTFLVVMT LGLFMQDPVLEPYGGQVFRMPLAESTKLNIFYGIGVLIAYGVAGFFIVPRLGKRRTAR LGCVLVASCAILLGISGFSANPALLKLALLLFGLATGILTTAAVTLMLDLTAAETAGT FIGAWGLAQAMSRGLAVVAGGAILDLSRKFLPNLVLAYGLVFVLEAIILLLAISFLNR INVREFQTNAKQAIASILESELD" gene 2584..3846 /locus_tag="DP116_16960" CDS 2584..3846 /locus_tag="DP116_16960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015175346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_16960" /translation="MLSLTYEYKAMPTNEQIQQIEHTLTVCRKVWNFALRERKDWLNS RKCPINSCSIISEYITPADVPYPNYYEQANALTRAKAEFPELATVHSQVLQQVLRKLE TAFVDMSRKKMGFPRFKNKYRMRSFVYPQLGKGQVLKDNQIKLPQLGWMEYVKSREIP NGFKVKQVRVVRKASGYFLMLTLECDVNVPDTVASGHPRGIDLGLDKFAATSDGELIE RPRFLNTLHRKLKLLQRRLKNKQKGSNNRHKLNRKIARLHQRISDTRKNWHFKLAHKL CNDAGMMFVEDIDFRTWAKGMLGKHTLDAGFGQFVEILKWVCWKRGVYFDKVHKDYTS QVCPQCDTHTGKKELKDRIHSCQSCGYTTHRDVASAQVIRNRGVNALGRSVEENACGD GLAGTGNRLVKSQRSKKKGGEARLKPAS" gene 3853..4746 /locus_tag="DP116_16965" CDS 3853..4746 /locus_tag="DP116_16965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873589.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_16965" /translation="MRIPAHLCRGVRQGNNYGADQLQAAFTAALEVGITFFDTAEIYG FGLSEEFLGQFLKKTNQQVQIATKYGPFPWRFMGQSVADALTDSLKRLQLGQVPLYQV HWPFTFFMSQETLMNALADEVKRGRIEAVGVSNYSAQQMREAHQILAARGVPIAVNQV RYSLLTRQIETNGILKTARELGVTILAYSPLAQGLLTGKYTADSANNLKDARRIDSRF SQEGLRKIEPVISLLRQLGEKHGRTPAQVALNWLIAQGNVIPIAGAKTAEQVRQNAGA LGWRLDDDEIRLLEEVSRSYK" gene complement(5063..5827) /locus_tag="DP116_16970" CDS complement(5063..5827) /locus_tag="DP116_16970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861782.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="circadian clock protein KaiB" /protein_id="PRJNA477356:DP116_16970" /translation="MTSDKPLLPQLFKGIALFTPGGDLIYCIDPSKQGRWHLHLCITL QEILDLPEPPHFLVPCYTATIDHWLNPRTQQIQTFAEAYPSVMRHQAVLNAVFGTGEL TWQSAPWQEGLCDHLVLTTYRSTFPQLWEDHDLVMRLDLWEPTPSYYQPTTSAQQPQP KTKGYVLRLFVAGHSAATERILQNLHELLEKNLGNPYTLKVIDVLTNPEQAESNQVSA TPTLVKVWPHPVRRIVGDLDNVEKILQMLANQDNSM" gene 6636..7013 /locus_tag="DP116_16975" CDS 6636..7013 /locus_tag="DP116_16975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655157.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_16975" /translation="MALSQTSHFGKTHSLLMMKSFLIWTFTLAVCLLVVGFPLVVLMA TVGCLLSVVLQSVMPVSAVLLVAGGLIMFNVMAVVMAAAALTLKGVHPSEIKWLSWLH GETENIQTTAVYASCPLTCEIKP" gene 7171..8286 /locus_tag="DP116_16980" CDS 7171..8286 /locus_tag="DP116_16980" /EC_number="5.1.3.14" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748499.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-N-acetylglucosamine 2-epimerase (non-hydrolyzing)" /protein_id="PRJNA477356:DP116_16980" /translation="MTKKQIYIILGTRPEAIKLAPVIQVFQNSPSLNTSVILTGQHRE MVEQVMQLFNLKATHDLEIMQPKQSLSDITCRSLRGLEGLFQESKPDLVIVQGDTTTA FAGTLAAFYQKIPVGHVEAGLRTDDLFNPYPEEANRRLISQLTQLHFAPTPIAVENLK RSGVLGEIHLTGNTVIDALLTVAASVPGCDIPGLEWEKYRVLLATVHRRENWGEPLYD IAQGFLSLLDKFGDTALLLPLHRNPIVREPLQTLLGNHPRVFLTEPLDYAELVGAIMR SHLLLTDSGGLQEEAPSLGKPVLVLRETTERPEAVTAGTAKLVGTQTENIFATAAQLL SDSTAYETMANAINPFGDGHAAKRILQIVQNYLGLSP" gene complement(8319..9317) /locus_tag="DP116_16985" CDS complement(8319..9317) /locus_tag="DP116_16985" /inference="COORDINATES: protein motif:HMM:PF01471.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_16985" /translation="MDSTYAAVTADTEYHLPEFKLTSSGQKYFKSAWLTLALITALLG ILAQSQAATAAYYGPGRYSVSTNGSCLNVRTGPSTSYRSAKCDSNGSPLPRVVGYRSG FARLSTGYYVSANWINVRAGREYTLRRVPTQDTYYNDIDGSATLRKGSQGQAVAELQR ALGNVTVTGYYGSFTETAVKNFQQRNGLRPDGVAGSQTLSYLGFGNIGSRPSIDYPPY PNDGFGARGYSPYYNDNFSRNITLRRGSRGKAVVELQRALANVPVTGYYDASTQRAVK NFQASLGFRPDGVATPETQSYLGVGNVSIRRYSSGYPGNYSYSGYSGYSGISVGGP" gene complement(9575..10219) /locus_tag="DP116_16990" CDS complement(9575..10219) /locus_tag="DP116_16990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_16990" /translation="MEYLAYSLMDSAYAEATKDTEFSLPELKLPELKLPELKLKFNWN RNFKSAWLTFALIGALLGILAQAQTATAAYNGPGNNYYVKTKGSDLLVRKSPSSSSAA VASYRNGSRLPKVVGYANGFAKLSNGYYVGANWIGNKPGKGYTRGPGVGGPYTLSLGS QGSTVAKLQETLGLTPTAYYGSITADAVKNYQRRNGLLADGVAGPQTLSALGVY" gene complement(10583..11518) /locus_tag="DP116_16995" CDS complement(10583..11518) /locus_tag="DP116_16995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315235.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M23 family peptidase" /protein_id="PRJNA477356:DP116_16995" /translation="MKKVTITQICTYSLIGLTSFIAVLSTNTQVLTQTAKSSNTPQKS FPSNLIWPTQGIVSQGFRKYQHEGIDIAAASGTPVVAAASGTVVKAGWNEWGLGNVVV VEHPDGSVTVYGHNSRLLVKQGQQVNQGQVIAEMGSTGNSTAPHLHFEVRKNHRFAVD PLTTLPSLIAGKIPQQQMTSPTTVASQANHEINQVSPAQTPQQVSSSQPIPVAVGSVM ADTKCNGTTVIEGETANIFVKVCQENGQLFYIGQLKQDPSQPVRLPARSVGSSQYRAD NGSFSYVVSSDKVEVWRNGQQVRSDTFSISKQFGK" gene 11605..11727 /locus_tag="DP116_17000" /pseudo CDS 11605..11727 /locus_tag="DP116_17000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741351.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine protease" gene 11735..14797 /locus_tag="DP116_17005" CDS 11735..14797 /locus_tag="DP116_17005" /inference="COORDINATES: protein motif:HMM:PF00931.20,HMM:PF13646.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17005" /translation="MAQVSGKIIQELLLILRPFMRDKQQRQAYLELALGTNSPVLNLL VWDTSADVFIPQMVNTLVVFGEITPGKPALCALLEVVRGNVGLDKQLEIDNLLQGIRE ELQRSPTNSSRVSPLFPHPEFQTYLENIAQKYQQWSNMYTLTDAEGKVFDVGLMVQTR QPKQREGMPGEVKQETERLPVLEAIRKYAINHVLLIGKPGSGKSTALQRLLWEEAQAA IQGEKRKIPVLVELRYWDTSVETLICKFLRKHKHRIDISKIEDLLIDEELLLLMDGLN ELPSDEARDKVARFRQDYPETPMIFTTRDLGVGGSLGIDKQLEMQPLTEAQMQEFVRK YLPEQGEQMLQQLGNRLRELGETPLILKMLCDVFFQKREIPKSRGDLFRQFDSTVNNL KEEKETVPVAEGLRLWKKDLLQHLAFVMMQPENLQANPTDFRLLISRRQAETILEDFL KGRVEYPAQKAKDWLEGLLKHCLVESKASESEQVLIQFHHQLFQEYYAAEYFLRLLPN LSDAKLKRDYLNYLKWTESIAIALALVEDEALAVQVVRLALDVDLMLGARLAGEVKEE FHEKTVGLVLELGVHQRLKIELLGMTRSEQARSPLQQARDNRDYDNVYSLAEALSCVG DEQLMSQLLEWEDKHIAEWNQAKFYEDYNNHFLCERIASELENIESSDVVVLNLVKLL NDKKLINKKGSDEYLKPRNYQAQAVLGEGLFNQAISFLLKSLKHEDFRVRYHAALALG NIGSDAEACALFKIVEDENYFVRSGAVEALGRFRSYTVITPLIKALNDEKAFVRSRSA EALGKFRDYKVINALIQALKDEKFFVRSNVVKALGKMDYKQVLEHLIKALNDENSDVR QSAVIALGELAEVNHNNQLITDALNYALNDEESSISSSATDVLKSIKKNRLIKLEKIK KNRLIDSVLDKESLVYSYPVQVEEIVVSKFLPQMQELLLVAIYEMKDLILQIQEIYKF YNHEIFHSPPIEETKSTSTSSTTIINSEIVQIIEKNEGDVIGKKTTET" gene 14933..15229 /locus_tag="DP116_17010" CDS 14933..15229 /locus_tag="DP116_17010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004163226.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17010" /translation="MEENQGTVIGKNVAEKTPAEAAKEIQDLLAQLQTNYPTTTEYEK QVFVNKFNDEVKTNSRVRDVILAGGIELIKILCPPLGIPIEMGKRWLETAQKQK" gene complement(15446..16894) /locus_tag="DP116_17015" CDS complement(15446..16894) /locus_tag="DP116_17015" /EC_number="2.4.1.21" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315236.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycogen synthase GlgA" /protein_id="PRJNA477356:DP116_17015" /translation="MRILFVAAEAAPIAKVGGMGDVVGALPKILRTMGHDVRIFLPYY GMLPDKMEIPTEPIWWGSAMFQSFAVYETVLPGTDVPLYLFGHPAFSPRRIYAGEDED WRFTFFANGAAEFCWNYWKPEIVHCHDWHTGMIPVWMHQSPDISTVFTIHNLAYQGPW RWFLEKITWCPWYMQGHNTMAAAVQFADKVNTVSPTYAEQIQTAAYGETLEGLLSFIS GKLSGIINGIDTEVYDPPTDKYIAQTFTADTLEKRKANKIALQEEVGLEVNSNAFFIG MVTRLVEQKGIDLTLNILDRFLAYTDAQFVLLGTGDRYYETQMWQLASRYPGRMATYL LYNDALARRIYAGTDAFLMPSRFEPCGISQMMALRYGSVPIVRRTGGLVDTVFHHDPT NHAGTGYCFDRYEPLDLFTCMIRAWEGFRYKDYWQQLQQRGMNQDFSWYRSAKQYVNL YRSIYGLPPEEEEQPSQQEEAQVDQPVASGKS" BASE COUNT 5064 a 3557 c 3711 g 4700 t ORIGIN 1 caagttctga cgatgaatgt agaatcccct cgcctttagg caggggagtg tcaagggaat 61 atattcttag ctgcgaagtt ctgacctatg tgttcttctt aatctgtacc tttagcttat 121 caagctacga ttctgtctta aatttttact ctttccacaa agggagcaaa aaccggaagc 181 agttctcttc gactgacaac caatgttggg aaagagcgag ttgtgtaatc ttgttgtggt 241 attaaaggaa atggttctga cttgagcgat cgccagcttc ccccagctgc ttgaacaatc 301 acccaagcac ctgctatatc ccaaactttt ggcgtcgctt cgataccacc caacgtcgca 361 ccagcagcaa ctgtcaagaa gttatagcta gcaactccca acatccgaat tttacaggga 421 aagtctttct gtataactga ggtgctgcgc gaacagaggt taaagaagtg atttccactg 481 ggggcgtcat gactggtatg gataggatga taattgcgaa aagccccagt aggtagttct 541 aactcaggta tacttcccca gtagccgtga aaagcttctc gcaatggcgg gacagcaacg 601 taaccaaaaa ctggtgtgcc ttgatacaac aaacccaaag aaatcgacca aattggaatt 661 ccacgagtaa agttagttgt accatccaag gggtcaatca cccagcacca ttcggtacca 721 ccgaaaacct tatcatcttc ttcgctcagg ataccgtgat cagggaaagt agatgctatg 781 gcgtttcgta tttcctgatc tgcccattta tctgattgcg ttaccaaagt gccatctgct 841 ttttccaaag cctgtacttg cccaaagtct tgcattaact ggtttcccac tctagcagtg 901 gtggtttggg caaagtcaag aattgttgtc caaaattcag tcatactttt attttcacgc 961 aaatacgcca aggcgcgaag attttttgca gtcttggcgc gagacaaaat taatctagtt 1021 cgctttctaa aatagaggcg atcgcctgct tagcatttgt ctgaaattct ctcacattta 1081 tccggtttag aaacgaaatc gccagcagca ggattattgc ttctagaaca aacaccagtc 1141 cataagctag caccaggttg ggcagaaact tacgactcag atctaagatg gcaccacctg 1201 caaccacagc cagtcctctc gacatcgctt gcgctagtcc ccaagcacca ataaatgtgc 1261 ctgcggtttc agctgctgtg agatccaaca ttaaagtgac tgcagcagtt gttaggatac 1321 cagttgctaa accgaacaat aacaaagcta gcttgagcaa cgctgggtta gcggaaaatc 1381 ctgatattcc aagtaatatt gcacacgatg ccactaagac acagccaagg cgtgcagttc 1441 ttcgcttacc caaacgcggc acaatgaaaa agcccgcgac accgtaggca atcagtacac 1501 ctatcccata aaaaatattc agtttagtgc tttcagccaa aggcatccgg aaaacctgac 1561 ctccatacgg ttctaaaact ggatcttgca taaacaagcc caatgtcatc accactaaaa 1621 aggtgaaaaa taaacctgtt tgcggactag ctgttaatat ttgccaagct ttgccaaggg 1681 taatgctatc ttcccggctg ataattgtag aacgggtggt gtactgagaa tactttttct 1741 ccacgccaaa ggttgctact atcgccaacc caaagacaat tgccggaacg acgagaaaca 1801 gcctgttaac cgatgcctgt aatgtctcgg gagttgattg cttgagcaag ctagaactga 1861 taattgcccc aataataatc cccaccatca gcatcgacca aacgacaccg acaactttgg 1921 aacggttgtt ttcttcagag acatcaacca acaaagcgcc aaaggcagta ccactagcac 1981 aaattgctaa accgtagaaa gcgaaaacta gagccaaaag tgcagtccag ccaattgttt 2041 gggttgtcca tgcccagcca ccaacattat taactacagt gtccaattgc cacaatacct 2101 gcacggctaa aaatgaggcg acagcaaata ttgctgctcc tgcccaaaca taggttgtgc 2161 gatgataacc ccatattggc ttggcatcgg atatttgacc aaaccaaacg cgagaaggag 2221 aaacaaataa ctgtattgcc agcaccactc ccaccagcgt tgctggaatg gctatttcct 2281 gaatcataac tctgttgagc acccctagag tcaggataga catcatgctc agccccattt 2341 gaaataagcc aagccgaaac atagtcaata cattgacctt tggcatcgcc aaggaatttg 2401 ttggagaatc gaataaatcg ccgcttgcca tagctacttt ttcactggaa atttctaagg 2461 atggcaggct gaaaaccctt tgcctttagg caagggatga aagccgccga cgcaggattt 2521 atcctgcggt gtcacacctc ttacaagata gtgtataatt ttgtaagagg aggtgaaaga 2581 acaatgttga gtttaactta cgaatacaaa gcaatgccca caaatgagca aattcaacag 2641 attgaacaca ctttaacggt gtgtcgcaaa gtatggaatt ttgctctgcg tgaacgcaaa 2701 gactggctta attctcgtaa gtgccctatc aactcttgtt ctatcatttc agagtacatt 2761 acacctgcgg atgtacctta ccctaattac tacgaacagg ctaacgcatt gactcgtgca 2821 aaagctgaat ttcccgaact ggcaacagtt cactctcaag tcctccaaca agtgctgaga 2881 aaattagaaa cagcttttgt ggatatgagt cgtaaaaaga tgggttttcc ccgattcaaa 2941 aacaagtacc ggatgcggtc ttttgtgtac ccgcaattag ggaaaggtca ggttctcaag 3001 gataatcaaa ttaagctacc tcagcttggt tggatggagt atgtcaagtc tcgcgaaata 3061 cctaatggct tcaaagttaa gcaagtcaga gttgttcgta aagcatcagg atatttcctg 3121 atgcttacac tggaatgtga cgttaatgtt cctgacactg tagcaagcgg ccatcctagg 3181 ggaattgatt taggtctaga taaatttgcg gcgaccagtg atggtgaatt gattgaacga 3241 cctcgctttt tgaatacact gcatcgcaag ctgaaattgc tgcaacgcag gctcaaaaat 3301 aaacagaagg ggtcaaacaa tcgtcataag ctgaatcgca aaatagcccg actccatcaa 3361 cgtatttcag atactcgtaa aaattggcat ttcaagttag cccacaaact ttgtaatgac 3421 gcgggaatga tgtttgttga agacatcgat ttccggactt gggcaaaagg aatgttgggc 3481 aaacacactc tagatgctgg atttggacaa ttcgttgaaa tccttaaatg ggtgtgttgg 3541 aagcggggcg tatattttga caaggtacat aaagactaca cctcgcaggt gtgcccacaa 3601 tgtgacacac atactggaaa gaaagaattg aaagacagga ttcattcctg tcaatcatgc 3661 ggctacacga cacatcgtga tgttgcatct gcacaagtga tccggaatag aggggtcaac 3721 gcgctgggac gcagcgtaga agaaaatgct tgtggagacg gtctggcggg gacgggaaac 3781 cgtctagtta agagtcaaag aagcaagaag aagggtggag aagcaaggct taagcctgct 3841 tcgtaacatc ttttgagaat ccccgcgcat ttatgccggg gagtacgtca aggcaataac 3901 tacggtgcag atcaattgca agcagctttt acagcagctt tagaagttgg tatcaccttc 3961 tttgatacag ctgaaattta tggatttgga ctttcagagg aatttttggg acaattctta 4021 aagaaaacca accaacaagt acaaattgca accaaatacg gtccttttcc ctggcgattt 4081 atgggtcagt ctgttgctga tgctctcaca gatagtctca aacgtctaca actagggcaa 4141 gttcctctct atcaggttca ttggcctttt acgtttttta tgagtcaaga aaccttgatg 4201 aatgctttgg cagatgaggt gaagcggggc agaattgaag cagtcggtgt tagtaattac 4261 tcagcacagc aaatgcggga agcccaccag atattagctg cccgtggtgt accaatagcc 4321 gtgaatcaag tccgttactc tttgctgact cgccagattg aaaccaatgg cattctcaaa 4381 actgcccgtg agttaggtgt gacaatcttg gcttatagtc ctttggctca aggcttactg 4441 actggtaaat acactgctga tagtgccaat aatctcaaag atgccagaag gatagactcg 4501 cgttttagtc aagaaggctt gcggaaaatt gaacctgtga tatctttgct acgccagcta 4561 ggagaaaaac acgggcgtac tcctgcccaa gttgccttaa actggttaat cgctcaagga 4621 aacgtcattc ctattgctgg ggcgaaaaca gccgaacagg tacgacagaa tgcaggtgct 4681 ttgggttgga gattggacga tgatgagatc agactgttag aagaagtcag tcgttcttac 4741 aagtgaaaac aagcaagtaa aaggaaatta tttcttttta ctttttattt tcctcgttcc 4801 ctgcctctgt cacgggatac ataatatcaa gtccggatga acacttataa tacttgtagg 4861 ttggggagcc agcgcgaatg acggctttcc ctccgtaggc gtctggcgtt tgaggaacga 4921 aacccaacac caaattatcg actgcattgg gtttgactgc gttcaaacgc cactttgctc 4981 aagtcgagcc actgccttgc gggggttccc cccgttgtgg caagtggcgt gggaaacccg 5041 cccatgcaag tggctcccca acctacattg agttgtcttg attagctaac atctgtaaaa 5101 ttttctctac attatccaaa tccccaacaa tgcgtcgaac aggatgaggc caaactttaa 5161 cgagggtggg agtcgcagaa acttgattgg attctgcttg ttctggattc gttaaaacgt 5221 caattacttt gagtgtgtaa gggttgccaa gatttttttc tagcagttcg tgtaaatttt 5281 gtaaaatacg ttcagtagca gcactatgtc cggcaacaaa caggcggaga acataacctt 5341 ttgtttttgg ttggggttgt tgtgcggacg ttgttggttg atagtacgaa ggagttggct 5401 cccaaaggtc gaggcgcata actaagtcgt gatcttccca aagctgagga aatgtagagc 5461 gatatgttgt caatactaag tgatcgcaca acccctcctg ccaaggtgca gattgccatg 5521 ttaactcccc tgtcccaaaa acagcgttca ggacggcttg atgtcgcatc actgacggat 5581 aagcttcggc aaaagtttgt atctgttgag tgcgtggatt taaccaatgg tctatagtcg 5641 ccgtgtagca aggtactaga aaatgaggcg gttctggtaa atccagaatt tcctgcaacg 5701 taatgcacaa atgcaaatgc catcgacctt gcttactagg gtcgatacag taaattaaat 5761 ctcctccagg cgtaaacagc gcaatgcctt taaacagttg aggtaaaagt ggtttgtcgg 5821 atgtcaaggg cgtgtcactc ccttctcctt aggagacgct aaacttaagt ttctccggga 5881 cacgctgctt tgtatacctg cggtctgtgt gattccgcat tccgcaaagc gtgccgtgcc 5941 tcctacagag gagctacgaa ttgcgcgttc agcgtgccca caaagcatag gcatacgcat 6001 gattgcgtgc gctttgtgca tacgggggaa ttcaaaatat gcccgacctg cacattacgc 6061 gaccataatg atcaattcaa aattaacaat atatccttta gtgatattaa aaagtcttaa 6121 tatttataat tcataattct caacacttat actctctcac gaagaaattt tgccatctca 6181 ctgggagttg gtgacatttc tagaaagaaa ttttgacttt tgtgaacaaa ttcaaaacgc 6241 ttgtcatttg cagaaacgac aaaccatctg gtagggacac cactaacata taatttgttt 6301 aagaccattt gtgaattatg aggaacctcc cttactttca taagaaatgt taagcagacc 6361 agaagtaact gaatttaaat aaacctttaa aatgattgct ttttggttta tctgcctaaa 6421 gatagatgtc aatatcaagt tttctatcag acaaagaacc agacttaatt acataattgc 6481 aaaacattaa gtgtaaagtt tattaaataa acgctcattc tatgagttgg tttttatata 6541 cttaatttct aattgttgat cccgctcatg aatgagctag acatttctaa acctgaatct 6601 ttgttttgtt gatttactat ttgagggaaa aagtcatggc tttgtctcaa acatctcatt 6661 ttggtaagac acactctttg ttgatgatga aaagtttctt aatatggact tttacattgg 6721 cagtatgctt gctggttgtc ggttttcctt tagttgtctt gatggctacg gtcggatgtc 6781 tgttgtcagt tgttttacaa tcggtaatgc ctgtcagtgc ggttttgctt gttgcaggtg 6841 gtttaatcat gtttaatgtg atggcagttg taatggctgc tgcagctctg actcttaaag 6901 gagttcatcc aagcgaaatc aaatggttga gctggctgca tggagaaaca gagaatattc 6961 aaacgaccgc tgtttacgct tcttgcccat taacttgtga aattaaacca taactatcac 7021 tactcaattc gacaaacagt acatctgccc ggttcagccg ggttttttca tgcctttttg 7081 tcttttgtca tttccggatg agaactattg actaatgact attgactagt gactattgac 7141 taatgactat tgactagtga ctattgacta atgactaaaa agcaaattta cattatattg 7201 ggtactcgtc cggaagcaat caaactagct ccagtcattc aggttttcca aaattcccca 7261 agtctgaaca cttctgtgat tttgacagga cagcatcgcg agatggttga gcaagtgatg 7321 caactgttca acctcaaagc tactcatgat ttggagatta tgcaaccaaa gcaatctctt 7381 agtgatatta cctgtcgcag tttacggggt ttggaaggat tattccaaga aagtaagcca 7441 gatttagtca tagtgcaggg agatacaact acagcttttg ccgggacttt ggctgcattc 7501 tatcaaaaaa tccctgtagg acatgtagaa gctggattaa ggacagatga cttatttaat 7561 ccttatccgg aagaagctaa tcggcggctg atttctcaac taactcaatt gcactttgcg 7621 ccaacgccaa tagccgtgga aaatctaaaa cgttctggcg ttttgggtga aattcacctg 7681 acgggtaaca cagtgattga tgcgctgtta actgtggctg caagtgttcc tggctgtgat 7741 atccctggat tggaatggga aaaatatcgt gtcctgctgg caacagttca ccgccgtgaa 7801 aattggggag aaccactgta tgatattgct caaggatttt tatcgttact ggataagttc 7861 ggtgatacag ctttgctact gccattgcac cgtaatccaa tagtgcgaga accattgcaa 7921 acactattag gaaaccatcc ccgcgttttt ttaacagaac ctttagatta tgctgaatta 7981 gtgggagcga ttatgcgatc gcacctctta ctcaccgact ctggcggttt acaggaagaa 8041 gcacccagtc ttggaaaacc agttttggtt ttgagagaaa caacagaaag acctgaagca 8101 gtcactgctg gtacagctaa attagtggga actcaaaccg agaacatttt tgcgactgca 8161 gcccaattgc tctctgattc tactgcttat gaaacaatgg caaacgcaat taaccccttt 8221 ggagatggtc atgcagcaaa gcgaattttg caaattgtgc aaaattactt gggactttcc 8281 ccataaacat cagcctcaca tcaagagcct tttcgcaatt agggaccacc aacactaata 8341 ccagaatagc cagaatagcc agaatagcta taatttccag gatacccaga agaatatcgt 8401 cttatgctaa cattacctac ccccagataa ctctgagttt ctggtgtcgc tactccatct 8461 ggacgaaagc caagacttgc ctgaaagttt ttgactgctc tttgtgttga agcatcataa 8521 tatccagtaa ctggaacatt agccaaagct ctttgaagtt ctaccacagc tttacctctg 8581 gagcctcttc ttaaagtaat gttacggcta aagttgtcgt tataatatgg agagtaacct 8641 ctagcgccaa aaccatcatt aggatacgga ggataatcta tagatggtct gctgccaatg 8701 tttccgaacc caagataact aagagtttgg gatccagcca ctccatctgg acgcaagccg 8761 tttctttgct ggaagttctt aactgctgtt tcagtgaacg aaccataata tccagtgact 8821 gtaacattgc ccaaagccct ttgaagttct gccaccgctt gaccttggga gccttttcta 8881 agggtggcgc taccgtcaat atcattataa taagtatctt gtgtgggaac gcggcgaaga 8941 gtgtattccc tgccagcgcg cacattgatc caattagcag aaacgtaata acctgtagac 9001 aaccgagcaa atccactcct atatcctaca actcgtggca atggtgagcc attggagtca 9061 catttggcag aacgatagga cgtactagga cctgtacgta cattaagaca gctgccgttg 9121 gtactgacag agtacctacc cggaccgtag taggcggctg ttgctgcttg agattgggcg 9181 agaattccca ataaggcagt gatgagcgcc aaagttaacc atgcagattt gaaatatttt 9241 tgcccactag atgtgagttt aaattcagga agatggtact cggtatctgc agttaccgct 9301 gcatatgtac tatccataag ggaataggct aagtattcca caatcttctc ctttgtgctg 9361 aaagtcgttt ggtgagtgta gaagttttct tgcgctcttt cctagttttg tctaggtaga 9421 actttactgg tctatataaa ttgtgacaga gtctagaacg aaattctaaa ttcaagtgaa 9481 tttttcaaaa acaaaaatcc ctatccgctt gaatcaaaca gacagggaac attgaccaag 9541 aaaaatatgt gttgcatact agctgctttc ttaattaata cactcccaaa gcagatagcg 9601 tttgtggtcc agcgactcca tctgcaagta aaccatttct tcgctgatag tttttgaccg 9661 catctgctgt tattgaaccg tagtatgcgg taggtgtgag tcccagagtt tcttgtagtt 9721 ttgcaacagt agaaccttga gagcctaaac tcaaagtgta aggaccacca acaccaggac 9781 cacgggtgta gcctttgcca ggtttgttgc caatccaatt agcaccaacg taataaccat 9841 tggatagctt agcaaatcca tttgcgtatc ctacgacttt tggaagtcgt gagccattac 9901 ggtaagatgc aacagcggca cttgatgagc taggactttt gcggactagg agatcactgc 9961 cttttgtctt aacataatag ttgtttcctg gaccgttgta ggcggctgtt gctgtttgag 10021 cttgagctaa aatccctaac aacgcaccga ttagcgcgaa agttaaccac gcagatttga 10081 aatttctgtt ccaattgaat ttcagtttca attctggcag tttcaattct ggcagtttca 10141 attctggaag actgaattct gtatctttag tcgcctctgc gtaggcacta tccatcaaag 10201 aataagcaag atattccaca gtcagctcct taatactgga aaattcacaa atgcagagtt 10261 ttacttcgct ttctggtgag tttgtctaat gtaacttaga ctacgcttta tcgtaaagga 10321 tttttaccga gaacattatt aagaaaacat aatttttgta aaaaaagtcc ggaagaatac 10381 gtacccaaag cgtcaagcca gaggcttatc gcctctccac aggcagctcc aaaccagcat 10441 tgccccaaaa gagcaggcac tctgtgcgat acgctttgcg tcaagccaga ggcttatcgc 10501 ctctccacag gcagctccaa accagcatca ccccaaaaaa gcaaatgtat ctgtagtatt 10561 gccagaggta gagaatctaa ccttatttcc caaactgctt tgaaatggaa aaagtatcag 10621 aacgtacttg ctgaccattt cgccaaactt ccaccttgtc agaactaaca acataggaaa 10681 aactaccatt gtctgcccta tactgagaac tgccaacact cctagcaggg aggcgtacag 10741 gctgacttgg atcttgcttg agttgcccaa tataaaacaa ctgaccattt tcctgacaga 10801 ctttcacaaa aatatttgct gtttcgcctt caatcactgt agtcccattg catttagtat 10861 cagccatcac agagccaaca gccactggaa ttggttgtga agatgaaact tgctgtggtg 10921 tctgcgctgg agagacttgg ttgatttcgt gatttgcttg ggaagcaaca gttgtaggac 10981 ttgtcatctg ctgttgtggg atttttcctg caatcaaaga cggtaatgtc gtcaaagggt 11041 caacagcaaa acgatgattt ttacgcactt caaagtgtag gtgaggagct gtactattac 11101 ctgtcgatcc catttctgca atgacttgtc cttggtttac ctgttgaccc tgtttcacca 11161 aaaggcggct attatgaccg taaactgtaa cactcccatc gggatgctca acaactacga 11221 catttcctaa tccccattcg ttccaacctg ctttgactac tgtacctgat gcagcagcaa 11281 caacaggagt accagatgct gctgcaatat caattccttc atgttgatat ttgcgaaaac 11341 cctgagaaac tattccctga gtgggccaaa ttaagttaga gggaaaagat ttttgtggag 11401 tgttggaact ttttgctgtc tgagtcaaaa cctgcgtatt tgtacttagg acagcaatga 11461 aagacgttaa ccctatcaag ctatatgtac aaatttgagt aatagtaact tttttcatta 11521 gttagagctt gtaagaaaat ggtttaggat gtcttgagtt attttctgtt acagtcttaa 11581 atcaactaca tcaggagtgt tagtgggatg cttccggaat caagtacgca gcgaaggtat 11641 ttatgtaatg cagggacgag tgcagtagct cttttaaatg acttgaaaaa caacgcacca 11701 gaaatttatg ctcgtttggc gatatagtta ggcaatggca caagtcagtg ggaaaatcat 11761 tcaagagtta ttgcttatcc tcagaccatt tatgcgggat aagcaacaac gtcaggcgta 11821 tcttgaactg gctttaggaa ctaattcacc agtattaaac ctcttggtgt gggatacgtc 11881 tgcggatgtt tttattcccc aaatggttaa cacactggta gtctttggag aaattacccc 11941 tgggaaacca gcactttgtg cattattgga agtagtacgt ggaaatgttg gtttagacaa 12001 gcagctagaa attgataatt tgctgcaagg aattcgagag gaacttcagc gatcgccaac 12061 taactcctct agagtatcgc ctctatttcc tcatcctgag ttccagacat atctagaaaa 12121 tatcgcccag aagtatcagc aatggtcgaa tatgtatact ttgacggatg ctgagggtaa 12181 agtatttgac gtgggtttga tggtgcaaac acggcaaccg aaacaacgag aaggaatgcc 12241 aggtgaggta aaacaggaaa cagagcggtt gcccgtgttg gaggctatcc gtaaatatgc 12301 tataaaccat gtattattga taggaaaacc tggttctgga aagtccacag ctttacaacg 12361 gttgctgtgg gaagaagcac aagcagcaat ccagggagaa aagcgaaaaa ttcccgtttt 12421 ggtagaactc cgctactggg atacttcggt agaaactttg atttgcaagt ttttgcgaaa 12481 gcacaagcac cggattgata ttagtaaaat tgaagactta ctaattgacg aggaattatt 12541 gctgttgatg gatgggttga acgaattacc ctccgatgaa gcacgcgata aggtagcacg 12601 attccgtcaa gattaccccg aaacaccaat gatttttacc acgcgggatt taggtgtggg 12661 gggaagttta ggaattgaca agcaactgga aatgcaaccc ctgacagaag cacagatgca 12721 ggagtttgtg cgtaagtatc tcccagaaca gggtgagcaa atgttgcagc agttgggaaa 12781 tagactccgg gagttggggg aaactccgtt gatattgaag atgctgtgcg atgtattttt 12841 tcaaaaacgg gaaattccca aaagtcgggg agatttattt cgtcagtttg acagtacagt 12901 taacaacctc aaagaagaaa aagaaactgt tccggttgcg gaaggattgc gactctggaa 12961 aaaggattta ttgcagcatc tagcatttgt gatgatgcaa cccgaaaatc tccaagccaa 13021 ccccacagat tttcgactgt taatttctcg tcgccaagcg gaaacaattt tagaggattt 13081 tctcaaaggt agggtagagt atcccgcgca aaaagccaag gattggttag agggtttgct 13141 gaaacattgc ttagtggaga gtaaagcttc cgaatccgag caagttttga ttcaatttca 13201 ccatcagctt tttcaagaat attatgcggc agaatatttc ttgaggttgt tacctaactt 13261 gagtgatgca aagttaaagc gcgattattt gaattatctt aagtggacgg aaagtatagc 13321 gatcgcattg gctttggtgg aggatgaagc tttagctgtg caggtggtgc ggttagcgtt 13381 ggatgtggat ttgatgttgg gtgcgcggtt agcgggggag gtgaaggagg agtttcacga 13441 gaagacagtg gggttggttt tggagttggg ggttcaccaa agacttaaga ttgaactttt 13501 gggaatgact cgttctgaac aagcgagatc gccattgcag caagcacggg ataatcgaga 13561 ttatgacaat gtttatagtc tagctgaagc tttatcatgt gtcggtgacg agcagctaat 13621 gtctcaattg ctggaatggg aagacaaaca tattgcagag tggaatcaag ctaaatttta 13681 cgaagattat aataatcatt ttctttgtga aagaatagct tctgagttag aaaatataga 13741 atcatctgat gtggtagttt tgaatttagt gaaattactc aatgataaaa aattaataaa 13801 taagaaagga tctgatgaat atttaaagcc ccgaaattac caagcacagg cagtactagg 13861 agaaggacta tttaatcaag cgatctcatt cttgttaaaa tcattaaaac atgaagattt 13921 tcgggtacgt tatcacgctg ctttggcatt aggaaatata ggttcggatg cagaagcttg 13981 tgctttattc aaaattgtag aagatgaaaa ttattttgtg cgctctggtg cagttgaagc 14041 attaggaaga tttcgcagtt ataccgtaat tactccctta attaaagcac ttaatgatga 14101 aaaagctttt gtacgttcta gatctgcaga agctctaggt aaattccgtg attacaaggt 14161 aattaatgcg ttgatacaag cattaaagga tgaaaaattt tttgtgcgtt ctaatgtagt 14221 aaaagcatta ggaaaaatgg attataagca ggtgttagag catttaatta aagctttaaa 14281 tgatgaaaat tcagatgttc gtcagagtgc tgtaattgct ttaggagagc ttgctgaagt 14341 aaaccataac aatcaattaa taacagatgc tctgaattat gctttaaatg atgaagaatc 14401 atctatttct tcaagtgcta ctgatgtatt gaaatcaatc aagaaaaatc gcttaatcaa 14461 attggaaaaa atcaagaaaa atcgcttaat cgattctgtt ttagataaag aatcattagt 14521 ttatagttac cctgtacaag tagaagaaat tgtagtaagt aaatttctac ctcaaatgca 14581 agaattatta ttagttgcaa tatatgaaat gaaagatttg attttacaaa ttcaagaaat 14641 atacaaattt tataaccatg aaattttcca ttcaccccca atagaagaaa caaaatcaac 14701 atctactagt tctacaacta taattaattc cgaaatagta caaattatag aaaaaaatga 14761 aggtgatgtc atcggtaaga aaacaactga aacctaacca atcttcagat tctcgacctc 14821 tttaagaagt cgagaatcta catccgtgtt gctatatttc gagaggtaaa aaatgtcaga 14881 acaatccaaa gcgtctaaat acaactttga aaaatctgaa gtagtgcaaa ttatcgaaga 14941 gaatcaaggt actgttatag gtaaaaatgt tgcagagaaa accccagccg aagcagcaaa 15001 ggaaattcaa gatttattag cgcagctgca aacaaattac ccgacaacaa cagaatacga 15061 aaaacaagtg tttgtcaata aatttaatga cgaagtgaaa actaattccc gcgttcggga 15121 tgtcatttta gctgggggaa ttgaattaat taaaattctt tgtccaccgt tgggtatccc 15181 gattgaaatg ggtaaaaggt ggttggaaac tgcccagaaa cagaaataat atcatgtcca 15241 cttgaatagt tgtccgcagc gttagcgtag cgaacgcaag tgcttgagat tgcttcgttt 15301 cacttcgttc cactcgcaat gacacatcgt aagtaattaa gcggacatga tataactagc 15361 ttggtggtta caaatccagg cagttgtaac cgatcgcatt aagtagcgat actatcgctt 15421 gttgcgtgac gtgaaagttg tccatttagg acttaccact cgctactggc tgatcaactt 15481 gcgcctcctc ctgctgtgac ggttgctctt cttcctcagg tggtaagcca taaatcgacc 15541 gatacaggtt cacgtactgc ttagcagatc gataccaact gaagtcttga ttcataccgc 15601 gttgctgtag ctgttgccag taatccttgt aacggaaacc ttcccaagcc cgtatcatac 15661 aggtaaatag atccaatggt tcataacggt cgaagcaata accagtacct gcatggttcg 15721 ttggatcatg atggaataca gtatcaacta atcccccagt acgacgcaca attggcactg 15781 aaccgtaacg caaagccatc atttggctaa taccgcaagg ttcaaaacga cttggcatga 15841 ggaaagcatc ggtacccgca tagatgcgac gagccaaggc atcgttataa agtaggtaag 15901 ttgccatgcg tccggggtag cgcgatgcga gttgccacat ttgggtttcg tagtagcgat 15961 cgcccgtccc caacaagaca aactgtgcat ctgtgtacgc caaaaagcga tccaaaatat 16021 tcaacgtcaa atcgatgcct ttttgttcca ccaaccgtgt caccatacca ataaaaaagg 16081 cattgctatt tacttctaac cccacttctt cttgcaaagc aattttattc gctttgcgtt 16141 tctctaaagt gtcagccgtg aaggtttgag caatgtactt atcggtaggt ggatcgtaaa 16201 cttctgtatc aatgccattg ataatacccg acaacttgcc actgatgaaa gacaataacc 16261 cttctaaagt ttcaccataa gcggctgtct ggatttgctc ggcataagtt ggggagacag 16321 tatttacctt atctgcaaac tggactgcag ctgccattgt gttgtgtcct tgcatatacc 16381 aaggacacca agtaattttt tctaaaaacc agcgccacgg accttgatat gccaggttgt 16441 gtatcgtaaa aacagtgctg atatcaggag attgatgcat ccacacagga atcattcctg 16501 tgtgccaatc gtgacagtgg acaatttccg gtttccagta attccaacaa aactccgctg 16561 ctccgttggc gaagaatgtg aaccgccaat cttcatcttc tccagcgtaa attcgccgag 16621 gcgaaaaagc cggatgtcca aataagtaca agggaacatc agtaccaggc agaacagttt 16681 cgtaaactgc aaagctttgg aacatggcag atccccacca aattggctcg gtaggaattt 16741 ccattttgtc tggtagcata ccgtagtaag gcaagaagat ccgcacatca tgccccattg 16801 tcctcaaaat tttggggagt gctccaacaa catcacccat tccaccaact ttcgcaatgg 16861 gagctgcttc tgctgcaaca aataaaatcc gcatgataat ctttgtcccc tcgattctgc 16921 ctagattaga gttgttttaa tatcaaatcc gcgcttactg actttatcta tacctcaccc 16981 cggctgcgcc acccctctcc ttattaagga gaggggtggc gcagccgggg tg // LOCUS NODE_1955_length_16986_cov_5.74862716986 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16986) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16986) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16986 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(406..2874) /locus_tag="DP116_17020" CDS complement(406..2874) /locus_tag="DP116_17020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749320.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17020" /translation="MRLGEILALQGWIKQETADFFAQKWSDLLNEKSKQPLGEYLKKA GLLDEYQVKTILCEQKQMGLKFGELAVHKGWLQPTTINFFLEYIVPLGQCSQKILQQA EHSVEARNERVSPSGELKIQNMSSGDAAETEFQIEDFTSIDDSLRQVETFLEEDKSSV VGHKVHTKPFSRSIIKLFNLNQKASRPDILLQEILSWTSGQPFLTQKLCQLLADSEAF VPVGEEAFTVQQLVQTRFIDHWETQVASEHFKAIRYGLLRNNKCNSFALLELYQQILQ EENISVKDSIRKTELLNLGLVVEQENTLKVSNRIYQSIFSLSWVNQELIRLEINCNRI KLFKLDEKASRPYVLLEEVLSWTSGQLFITQKLCQLLANSQDFIPVHEEAVRVQQLVQ TRIVENWEIQAASEHLEGIRNGILKNQQCNPLSLLRLYQQILQHKEVVVNNNPAQTEL LNLGLVVEQENTLKVSNRIYQSVFSLSWVNQELEKQPPPLSQITQNTQSQLPPSTLRL KNIKKTISNSTKVLPKGTWILLGSLGLVIVGFSVVRSSTMKSLEVQILFKQGNELFNQ RKYQQAIAKYNEILKIDQSYYQAWTNQGYALAGLQQYHKMLKSCSAATIVEQKAVYGW NCQGEALHNLKRYNEAIAAFDKAIAIDSKDPVFWINKTESLLALKQTEQALVTINKAI ELLEKGSEVDTKNTIKRDLSVAYSHKGKALSQTQKHKEALQAYNQSLAYDPNYFTAHR GRGIALGGMRRYHEAIAQFERMLKELKLVDAQKAETLYYLGFTLCKTSKVQEAFTAFD EALKLKPDYPAVQQAKMSCSPSSH" gene 3314..3655 /locus_tag="DP116_17025" CDS 3314..3655 /locus_tag="DP116_17025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17025" /translation="MSIITNLHDETNIHDETTILVEFAPSAGMKQVSLTSEDLAKKSS EALDKAMATIRQMAQKTMVTIDTLTNKPTEVQLEFGIKLNTEAGAIIAKTSGEASLKV KLTWERKEANE" gene 3652..6777 /locus_tag="DP116_17030" CDS 3652..6777 /locus_tag="DP116_17030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141661.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_17030" /translation="MNKPARKPQIINLCKFTVQIRDVNNNTVGTGFVVSESGQIVTCA HVVRDACATGEVAEGVEVDVYFSKAQNEEQKAQKARVAVCFHDYEDDVVLLQLDTPSL PDGIEVAILGTAEESAGNKFRSFGYRRLGKNQGCPAEGKIIDFAESPENSVLHGDPLM LSCQHIDSGMSGAAVLDTERDLVVGVIAQTWDSGQSEKDRDTSFAVDCKVLTFDPMRL PLAGMPITRLLATQDKMDLHTTGGQPVSEPGIVLNNAPALLPEWVGREEFLRTLNQDW VDSDCLITGLIGFGGEGKSSLTRRWLENLLQDSSLPCPQGVFWWGFDEKTSIDEFFEA ALTFLVKDIDPRKLTPAEKAKFIHAMLKSGRYLFILDGLEVLQHEDGDDYGELKNADL REFLRGFATGGHQSFCLINSRVPLLDLIDFTTYTHRDVDRLSAEEGRTLLRNVGVKGS NQDLDRIVADWDGYALVLSLLGAYLVDVYNGDVKRIRDISPPTADEPRYDRVKRVLRR YDKHLTQPEKEFLTVFSAFRLGVSQSAFAQVFQGVIPGYFPPRRQTSSFVDRFIWQFQ RLLNRLFPSRRRAETQLKKQLKAPLTELPGRTFDAMVRRLVNYRILRYYLEANYYAMH PLIRAHYLKQLDEDKRAQAREIHQRIADYYLRIAGPIPDHPILENLAFPIEVVHHLCC AENYDQAYDVFWERVLQSAQRVLVDQLNAWDTCLALVLEFFPNNDTSQEPQVSSLNCK AWILNEVGLCLKKLGQLSEAEQFYKRAIAIELNKENWKNAAINYQNLAGLYIELGKLT ASEQAAREALTLARRAGDKSQEANSLGYQAQIAHLQGNLQLASAAFQQAEALRQEIEA DTYDLYTLNGIYNADHLRRVSNLEYARRITQANLEICQRNHWLARISRCYRVFGDLDA DTEQHESARENYNEALRIARGISNRDVLIEALLARGRWAARRSEVEAARSDLDEALSY ALSGGYRIYEADIRVALAWAHLAEGNYSVAQVQAEKAQRMSAEMGYHWGQVDAAEVLA GLKQLSQSVLN" gene complement(6809..7009) /locus_tag="DP116_17035" CDS complement(6809..7009) /locus_tag="DP116_17035" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17035" /translation="MLGGIALFFVLAFVVLVNKNLYVGIVRVSVGFRYRSTQPTKDIL ISVFYTIISFNVVNLLKQQAGE" gene 6991..7533 /locus_tag="DP116_17040" CDS 6991..7533 /locus_tag="DP116_17040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009547453.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_17040" /translation="MQYRRAKVNGGSYFFTVVTHNRRKFLCEPDNISLLRNAFRYVMQ QHPFEIDAIIVLPDHIHSIWTLPDGEHDFSTRWRLIKSYFSRQCATEYQGEISKAREK KKEQAIWQPRFWEHQIRDDKDFAHHVEYIHYNPVKHGLVAAPKDWQYSSFHRYVRDGI YDMDWGADGEILFDASIGNE" gene 7992..11543 /gene="metH" /locus_tag="DP116_17045" CDS 7992..11543 /gene="metH" /locus_tag="DP116_17045" /EC_number="2.1.1.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455044.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methionine synthase" /protein_id="PRJNA477356:DP116_17045" /translation="MTSTFLERLHSPTRPVIVFDGAMGTNLQTQNLTAEDFGGPQYEG CNEYLVYTKPEAVAKVHRDFLAAGADVIETDTFGAASIVLAEYDLADKAYELNKTAAE LAKRVAAEFSTPEKPRFVAGSIGPTTKLPTLGHIDFDTMKSSYVEQVEGLFDGGVDLF IVETCQDVLQIKAALNGIEEVFAKKGDRRPLMVSVTMETMGTMLVGTEINAVLTILEP YPIDILGLNCATGPDLMKPHIKYLAEHSPFVVSCIPNAGLPENVGGQAHYRLTPTELR MSLMHFVEDLGVQVIGGCCGTRPGHIQQLAEIAKELTPKVRHPELEPAAASIYNIQPY DQDNSFLIIGERLNASGSKKCRELLNAEDWDGLVSMARAQVKEGAHILDVNVDYVGRD GVRDMHEVVSRLVNNVTLPLMLDSTEWEKMEAGLKVAGGKCLLNSTNYEDGEPRFLKV LELAKKYGAGVVIGTIDEDGMARTADKKFAIAQRAYRQAVEFGIPPTEIFFDTLALPI STGIEEDRANGKATIESIRRIRQELPGSHVVLGVSNISFGLSPASRIVLNSMFLHEAM TAGMDAAIVSASKILPLSRIEERHQEVCHQLIYDERKFEGNVCVYDPLTELTKLFEGV TTKRDKGVDENLPIEERLKRHIIDGERIGLEEQLTKALEKYPPLHIINTFLLDGMKVV GELFGSGQMQLPFVLQSAETMKAAVAYLEPFMEKSEAGNNAKGTFIIATVKGDVHDIG KNLVDIILSNNGYKVINLGIKQPVENIINAYEQHKADCIAMSGLLVKSTAFMKENLEV FNEKGITVPVILGGAALTPKFVDQDCQNTYKGKVVYGKDAFSDLHFMDKLMPAKAASQ WDDLRGFLNETAETAQVSGNGHKQSLAEAVEEKSPEPKEVDTCRSEAVAVDIERPTPP FWGTKVLQGEDIPLGEVFWYLDLQALIAGQWQFRKPKEQSKEEYQAFLDEKVYPILED LKQRIIQDKLLHPQVVYGYFPCQAEGNSLYIYDTNRRGAEDAEVREVRASFEFPRQRS LRRLCIADFFAPKESGVIDVFPMQAVTVGEIATEYAQKLFADNKYSEYLYFHGFAVQM AEALAEWTHARIRRELGFVADEPDNIRDILAQRYQGSRYSFGYPACPNIQDQYKQLEL LGAERINLHMDESEQLYPEQSTTAIITYHPVAKYFSA" gene complement(11795..13105) /locus_tag="DP116_17050" CDS complement(11795..13105) /locus_tag="DP116_17050" /inference="COORDINATES: protein motif:HMM:NF033590.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS4 family transposase" /protein_id="PRJNA477356:DP116_17050" /translation="MRIVEDLASQPSSSVPQACGNMAATCAAYDFWSSPYFEPDDIRK AHVRSSIERIKSHEIVLAIQDTTNIDLTNHPSTTGVGYLDHQKLSGLKVDSTLASTID GVPLGIIDQQVWTRPRENLGIAKKRRQRETQEKESQRWLDSLKTTQQLIPKENMVVTM GDSEADIFDLFSLKRPENSHLLIRGTHNRKVDHTAQYLHQAIRQTQPCGLLSVEIKRN PEQNPRIANLILRFATLEVCVPANHLARSQLKPVKLQVILAQEENPPDGVEAISWLLL TTIEICRFEQAARCVKWYTYRWLIERYHYTLKSGCGIEKLQLETGRRIEMALATYSIV AWRLLWITYQARLHPDESCDTVLEAHEWQSLSATINKHPIPPKNPPSLQQAVRMIASL GGFLGRKSDGEPGVKTIWRGLRRLHDIAATWKLTHSTFKANGCS" gene complement(13176..13376) /locus_tag="DP116_17055" CDS complement(13176..13376) /locus_tag="DP116_17055" /inference="COORDINATES: protein motif:HMM:PF14706.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17055" /translation="MQKWAELELQQADLGDARRNKRLTKNSGVRSQESEGRKNRTKNR RISCTNDSESRGLNPHPSTRCL" gene 13891..14244 /locus_tag="DP116_17060" CDS 13891..14244 /locus_tag="DP116_17060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748557.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17060" /translation="MTQKIQESVTESKDVRVSATSIIWGIAVGMLAICIPLSSATKSG SILPLATIAGAAISTVAVWRSDDKKSKYNSLPQQKVELLEQRIANLETIVTRDDFELR MRMKQVESRDRKSDN" gene 14532..14846 /locus_tag="DP116_17065" CDS 14532..14846 /locus_tag="DP116_17065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860927.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17065" /translation="MTSNIQKYRFVCTLTFGDIYGQIIVWLITITISLASALALMGAR KPVYALATVGLVVLLSLPFLLFAFVTTLLNHIEVSPVEPGTRMEPIPGNVSQQQPVEA TS" gene 15199..16926 /locus_tag="DP116_17070" CDS 15199..16926 /locus_tag="DP116_17070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409296.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="succinate dehydrogenase/fumarate reductase flavoprotein subunit" /protein_id="PRJNA477356:DP116_17070" /translation="MLEHDVIIVGGGLAGCRAAVEIARIDPSLNVAVVAKTHPIRSHS VAAQGGMAASLKNVDSADSWEAHAFDTVKGSDYLADQDAVEILTREAPDVVIDLEHMG VLFSRLNDGRIAQRAFGGHSHNRTCYAADKTGHAILHELVSNLRRYGVQIYQEWYVMR LILEEGQAKGVVMYRIEDGHIEVLRAKAVMFATGGYGRVYNTTSNDYASTGDGLAMTA LAGLPLQDMEFVQFHPTGLYPVGVLISEAVRGEGAYLINSEGERFMERYAPSRMELAP RDITSRAIAYEIRAGRGVHPDGSAGGSFVYLDLRHMGKEKIMSRVPFCWEEAHRLVGV DAVTQPMPVRPTIHYCMGGIPVNTDGQVRSSGDGLIDGFFAAGETACVSVHGANRLGS NSLLECVVYGRRTGASLAHFVQKRKLPTVDEQRYITEAQQEIQALLDQSGKYRINKVR QAFQDDMTQYCGVFRTEELMREGLNKVQELEQQYSQIYLDDKGSFWNTEIIEAFETRS LMVVGRMILESALNRQESRGAHFREDYPQRDDTNFLRHTMAYYSPAGIDIQYRPVAIS MFQPQERKY" BASE COUNT 4765 a 3409 c 3892 g 4920 t ORIGIN 1 actataaact agtaaatttt tcagcaaaaa tttttagaaa aattttaaat tttagagaaa 61 atacggagac attaagaatc aattggaacg agattcaaac ccctcccaat tgcacattgc 121 gtgttagcgt aagcttctcg cgaagccagc accttagcgt gccttccaga catagatttg 181 gtgggcgcgg tatataccgg taacttctga cggggcgaga aaaagggcgc ttgacgcaga 241 taaaataagc tccatagccc ttagcacgtc gttgataata cttgcgctta ttgcaacata 301 attttataaa atgcacaacc acctgacgcc aggactatta gtcgtattta ctctgagatt 361 aaacctcgta agtcttggtg gttcttctaa gagagaataa ttgacctagt gagatgatgg 421 gctgcaactc atttttgctt gctgtactgc cggataatct ggtttaagtt tgagggcttc 481 gtcaaaggcg gtaaaagctt cctggacttt agaggtttta cacagcgtga atcctagata 541 gtacaaagtt tctgcttttt gggcatctac caactttaac tcttttagca tccgttcaaa 601 ttgggcaatt gcctcatggt atcgcctcat gcccccaagt gctatacctc gaccccggtg 661 agcagtaaag tagttgggat cgtatgctaa tgattggtta taggcttgaa gagcttcttt 721 gtgtttttgc gtttgtgata atgctttacc tttatggcta taggcaactg ataaatctct 781 tttgatggta ttttttgtgt caacttctga acccttctca agcaactcga ttgctttgtt 841 aatcgttaca agagcttgtt ctgtttgctt gagtgctagc agtgattcag ttttgttaat 901 ccagaaaaca ggatctttcg agtcgattgc aatggctttg tcaaaggcgg cgatcgcctc 961 attatatcgc ttgaggttat gtagtgcctc cccttggcaa ttccagccat aaactgcctt 1021 ttgctcaacg atagttgctg cactacagga ttttagcatt ttatggtatt gttgtaaacc 1081 agctaatgcg tagccttggt tagtccaagc ttggtagtag ctttgatcga tttttaaaat 1141 ctcattgtat ttcgcgatcg cctgttgata ttttctttga ttaaatagct cattgccttg 1201 tttaaaaagg atttgcacct ctaaagactt catcgtacta gagcgaacga cactaaagcc 1261 aactatcacc aagcctaaac ttcccagtaa aatccaagtg cctttaggaa gcacttttgt 1321 cgaattggaa atagtttttt ttatattctt caaacgaagt gttgatggag gtaattggct 1381 ttgggtgttt tgagttattt gtgaaagtgg gggaggctgt ttctccaatt cctgattcac 1441 ccaactcaaa gaaaaaacag attggtaaat acggttagaa acttttagcg tgttttcttg 1501 ttctacaacc aatcccaaat tcagcagttc tgtttgtgca ggattgttat tcactacaac 1561 ttctttgtgc tgcaagattt gctgatacag ccgcagtagt gacaaaggat tgcactgctg 1621 atttttcaga attccattgc gaattccctc caaatgctca gacgccgctt gaatttccca 1681 gttttcaact atgcgggttt gtactagttg ctgtactctt acggcttctt catgaacagg 1741 aataaaatct tgtgagttag caagtaattg acaaagtttt tgagtgataa acagttgacc 1801 acttgtccat gacaacacct cttccagtaa gacatacgga cggctggctt tttcatctaa 1861 tttgaataat ttgattctat tgcagttgat ctctaaacgt attaattcct gattcaccca 1921 actcagagaa aaaatagatt ggtaaatgcg gttagaaact tttagggtgt tttcctgttc 1981 cacaaccaat cccaaattca gcaattctgt ttttcttata ctgtccttaa cagagatatt 2041 ttcctcctgt aatatttgct gatagagttc cagcagtgca aaagagttgc atttattatt 2101 tcgcaaaaga ccatagcgaa tcgccttgaa atgctccgac gctacttgag tttcccaatg 2161 atcaataaaa cgggtttgta caagttgttg gactgtgaat gcttcttcac caacgggaac 2221 aaaagcttct gaatcagcta gtaattgaca taacttttgg gtgagaaacg gttgaccact 2281 tgtccatgac aaaatctctt gcagcaaaat atccggacga ctagcttttt gatttaaatt 2341 gaataatttg attatgctac gactaaaagg ttttgtatgt actttgtgac caacaacaga 2401 tgacttgtct tcttcgagaa acgtttctac ttgtctgaga ctgtcatcaa tactcgtgaa 2461 gtcttcaatt tggaattctg tttccgcagc gtccccggag gacatatttt gaatttttaa 2521 ttccccggag gggctgactc tctcgttgcg ggcctcaact gagtgttctg cctgctgtaa 2581 gattttttga gagcactgtc caagaggaac aatatactct agaaaaaaat tgattgtagt 2641 tggctgaagc caacccttgt gaacagccaa ttccccaaat ttaagtccca tctgtttttg 2701 ttcacagaga atagtcttca cttgatactc atccaacagt cccgcttttt tgagatactc 2761 acccaagggt tgtttagact tttcgttcag caaatctgac cacttctggg cgaaaaaatc 2821 agccgtttct tgtttgatcc atccttgtaa ggctaaaatt tctcctagcc gcatattctt 2881 attttgggtt tgctgttgca aagcgaactc tatctgggca attgaaatta ggtcagcgag 2941 ctgtaaaatt tcacctagcg gtttgagaga aaagatttca cacataagca tgtcaaaata 3001 gggcagtaaa taattactta accagaagtt acaaaaaaat tcagccaact ttgtcaaaaa 3061 tttagcaatg aaattttcat tattttcttg acaaacagcg cttttcgtgc ataaaagaca 3121 ctctcccatt tgatgacatt tcagagaaca tttcatgctt aaaatcagca acgcctgtcc 3181 tcaactactc gttctgtgtc tccaatacga aatgtattaa gttatattgt ttacaatagt 3241 ccttgagaga gtgtattatt tacaaccaaa aaaatacatc tttctttcac agaaattcag 3301 taactattca aatatgagca taattactaa ccttcatgac gaaactaaca ttcatgacga 3361 aacgacgatt ctagttgaat ttgcgcctag cgctggcatg aagcaagtta gcctcacttc 3421 tgaagattta gccaaaaagt cttcggaagc attggataaa gcaatggcta ccattcgtca 3481 gatggcacag aagacaatgg tgactataga tacactgacg aacaagccaa cagaagtcca 3541 attagaattt ggtatcaaat tgaatacaga agcaggtgca attattgcca aaacttctgg 3601 agaggcgagt ttaaaagtga agttaacttg ggagcgcaaa gaggcaaatg aatgaacaag 3661 ccagcacgca aaccgcaaat tataaatttg tgtaaattca ccgttcaaat ccgtgatgtc 3721 aacaataaca ctgttggcac gggttttgtc gtttctgaaa gtgggcaaat tgtcacttgc 3781 gctcatgtgg tgcgggatgc gtgtgcaaca ggcgaagtcg ctgagggagt ggaagttgat 3841 gtttacttct ccaaagccca aaatgaggaa caaaaagctc agaaagcaag agttgcagtt 3901 tgtttccacg attatgaaga tgatgtggtt ttgttgcaac ttgacactcc ctcactaccg 3961 gatggaattg aggtggcaat tctagggacg gctgaagagt ctgctggtaa taagttcaga 4021 agttttggct atcggcgttt aggcaaaaac caaggatgcc cggctgaggg gaagattatc 4081 gattttgccg agtcaccgga gaattccgta ttgcatggcg atccacttat gttaagttgt 4141 caacatatcg acagtggtat gagtggagca gcagttctgg atacagagcg agatttggta 4201 gtaggcgtca tagcccaaac ttgggactct gggcagagtg aaaaagatcg tgacacaagt 4261 tttgctgtcg attgcaaagt tctcaccttt gatcctatgc gtctaccttt agcaggtatg 4321 ccaattactc ggctgctggc aactcaggat aaaatggatt tacatactac aggaggacaa 4381 ccagtgtctg agcctggaat tgtcctgaat aacgccccag cgcttttacc agaatgggtg 4441 ggaagagaag aatttctcag gacgctgaat caggactggg ttgattccga ttgcttgatt 4501 accggactta tcggctttgg tggcgagggc aaaagttctc tcacccgtcg ttggttggaa 4561 aatctgctgc aagactcatc cctaccatgt cctcaaggag ttttctggtg gggctttgat 4621 gaaaaaacta gtatagacga gttctttgaa gcagcgctga cgtttctggt gaaagatatt 4681 gatccgcgca agttaacacc agcagaaaag gcgaaattca ttcatgccat gctcaagagt 4741 gggcgctatc tgttcatttt agatgggcta gaggtgctac aacacgaaga tggggatgat 4801 tacggtgagc taaaaaatgc tgacttgcgg gagtttttgc ggggatttgc gacaggggga 4861 catcagtctt tttgcctgat taatagtcgt gtgccattgt tagacctaat tgactttacc 4921 acctacactc atcgggatgt agatcgcctc agtgcagaag agggacgtac tttgctgcgg 4981 aacgtaggtg ttaaaggtag caatcaagat ttagatcgaa ttgtggcaga ctgggatggt 5041 tacgccttgg ttctcagtct attgggggct tatctggtgg atgtgtataa cggcgatgtt 5101 aaacgcatcc gcgatatctc gccgccaacc gccgatgaac cgcgttatga tcgagtgaag 5161 cgggtgctgc gtcgctatga caaacatctg acacaacctg agaaagagtt tttaacagta 5221 tttagtgctt ttcgcttggg tgtttcccaa tcagcctttg cacaagtatt tcaaggcgtg 5281 attcctggtt actttcctcc cagaagacag acttcctcct ttgtggatcg tttcatctgg 5341 caattccaaa gattgctgaa tcgtttattt cccagtagac gcagggcaga gactcaacta 5401 aaaaaacagt taaaagcacc tctgactgaa ttacctggtc gcacttttga cgcaatggtc 5461 aggcgattag tgaattatcg catcttgcgt tattacctgg aagcaaatta ctatgcaatg 5521 catccgctga tccgcgctca ctacttaaag caactggatg aggacaaacg cgcccaagct 5581 agagaaattc atcaacgcat tgcagattac tacctcagaa ttgctggacc aataccggat 5641 catccaattt tagaaaactt agcatttcct attgaggtgg tacatcacct gtgctgtgct 5701 gagaattatg atcaggcata cgatgttttt tgggagcgtg tcttacagag tgcacagcgc 5761 gtgttggttg accaactgaa tgcttgggac acatgcctag ctttagtgct ggaattcttc 5821 cccaataatg atacttccca agaaccgcag gtaagcagcc ttaactgtaa agcttggata 5881 ctaaacgaag tcggtctttg cttgaaaaaa ttgggacaat tgagtgaagc agagcagttt 5941 tataaacgtg ccattgcgat tgaattgaac aaggaaaact ggaaaaatgc tgccataaac 6001 taccagaacc tggcaggact atacatcgaa ctcggcaaac ttaccgctag tgaacaggct 6061 gcccgtgaag cgctcactct tgcccgtcgt gcaggagata agtcgcaaga ggctaattcg 6121 ctgggttatc aagcgcaaat tgctcatttg cagggtaatt tacaactagc aagtgcagcc 6181 ttccagcaag cagaagcctt aaggcaggaa atcgaagcag atacatacga cttgtatacc 6241 ttgaacggga tctacaatgc tgatcaccta cgacgagtaa gtaatctaga atatgctcgt 6301 cgaatcacac aagcaaacct agaaatttgt cagcgcaatc actggcttgc tagaatcagc 6361 cgatgttacc gtgttttcgg cgatctagat gctgacactg aacagcacga gagcgcccgt 6421 gagaattaca acgaagcgct aagaattgca cggggtatat caaatcgaga tgtcttaatt 6481 gaagcattgc tggcgcgggg gcgttgggct gcacggcgca gtgaagtgga agcggcgcgt 6541 agcgatttag atgaagcttt gagttacgcg cttagcggtg gctatcgtat ttatgaggca 6601 gatattcggg tggcgctagc ttgggcgcat ttagcagagg ggaattactc agtcgcacag 6661 gtacaagcag aaaaggcaca gcgtatgagt gctgagatgg gttatcattg gggtcaggtg 6721 gatgcggctg aggttttggc aggattaaaa cagttgagtc agtctgtttt gaattagggt 6781 taaatcaact catgcagcga ctaaccgttt actctccggc ttgttgtttg agcaaattta 6841 ccacattaaa ggaaataata gtataaaaaa ctgaaattag aatatccttc gtaggttggg 6901 ttgagcgata gcgaaaccca acactaaccc ttacaatccc tacatataaa ttcttgttca 6961 ctaaaaccac aaaagccaaa acaaaaaaca atgcaatacc gccgagcaaa agtcaacgga 7021 ggtagctact tttttaccgt cgttactcat aacagacgca aatttttatg tgaacctgat 7081 aacatttccc tattaagaaa tgcttttcga tatgtaatgc agcaacatcc ctttgaaatt 7141 gatgccatta ttgtattacc tgaccatatc cactctattt ggacattacc tgatggagag 7201 catgacttct cgacacgctg gcgtttgatc aaaagttatt ttagtcggca atgtgcaact 7261 gagtatcagg gtgaaatatc aaaagccaga gaaaagaaaa aagaacaggc gatttggcaa 7321 cctcgctttt gggagcatca aatacgagat gacaaagatt ttgctcacca tgttgagtat 7381 attcattaca acccagtcaa acatggatta gtggctgcac caaaggattg gcaatattct 7441 agttttcacc gttatgtccg tgacggtatt tatgatatgg attggggtgc agatggggaa 7501 attttatttg atgcgagtat aggaaatgag taatttcagt agggtaagta agtttggttg 7561 atgtagcata tgtgtcgctt gaccatagat aagcaaatat tgtcaaagca taagaatatt 7621 tgtaggttgg gtagaacgaa gtgaaaccca acaaagcgtc ggaaatgttg ggtttcgttc 7681 ctcaacccaa cctaccttat tcttaacagc aatcctattg tgctaaaaaa taaaaagtca 7741 aaataatctt ctagttgcat atttataacc ttccttcatt attgggcaaa aagtctcaat 7801 tcgcttgcta gcaactcact gcggctgttg attctggatg acttagggag ttgcggatat 7861 atcttttgtc ctcatgcacg caatctttat acactccaga cgcatttcat acttcataag 7921 cactcaaaat tctggaaaat agtatatgta gtcgaacatt tccagactaa cttaattttc 7981 tcttggagaa tatgacctct actttcttag aacgcctgca tagtcctaca cgcccagtca 8041 tcgtcttcga tggtgcaatg ggaactaacc tgcaaacgca aaacctgact gctgaagatt 8101 tcggtggtcc tcagtacgaa ggttgcaacg agtacctcgt ttacacgaag ccagaagcgg 8161 ttgcaaaagt tcatcgcgac tttctcgctg ctggtgcgga tgtgattgaa acggatactt 8221 ttggcgctgc gtcgattgtc ttggctgaat acgacttagc agataaggcg tatgaactca 8281 acaaaacagc agcagaactc gcaaagcgcg ttgctgcgga attttctact ccagaaaaac 8341 cccggtttgt tgcaggttct ataggaccaa ctaccaaact accaactttg ggacatatcg 8401 actttgacac catgaagtct tcctacgtcg aacaagtaga aggacttttc gatggtggag 8461 ttgatttatt catcgttgag acttgccaag atgtgctgca aatcaaagcg gcgctgaatg 8521 gaattgaaga agtttttgcg aagaaaggcg atcgccgtcc cctcatggtc tctgtcacaa 8581 tggaaacaat gggcacaatg ctggttggga cggaaatcaa cgctgtgcta acaattctgg 8641 aaccttaccc aatagacatt ctcggtctga attgtgccac aggtccagac ttgatgaaac 8701 cacatatcaa gtatcttgca gaacattcac ccttcgtggt ttcctgtatt cccaacgctg 8761 gtttaccaga gaacgttggt ggtcaagctc attatcgctt gacaccaacg gaattacgca 8821 tgtcattaat gcattttgtt gaagatttgg gtgtccaagt gatagggggt tgctgtggga 8881 cacgtccagg acacattcaa caattagcag aaattgccaa agagttgacg ccaaaagtta 8941 gacatcctga acttgaacca gcggcggcgt caatatacaa tatccaacct tacgaccaag 9001 acaattcatt cttaattatc ggcgaacgtc tcaacgccag tggttccaaa aaatgccgcg 9061 agttactaaa tgcggaagat tgggatggac tggtgtcgat ggcgagggcg caagttaagg 9121 aaggcgcaca tatattagat gtcaacgttg actacgtggg acgtgacggt gtacgtgata 9181 tgcacgaagt tgtttcacgt ctggtcaata atgtgacact tccattgatg ctcgactcca 9241 cagaatggga aaagatggag gcgggattga aagttgctgg tggtaagtgc ttgttgaact 9301 ccaccaacta tgaagatgga gaaccgcgtt tcttgaaggt gttggaactg gcgaagaaat 9361 acggtgctgg tgttgtgatt ggtacaatcg atgaagatgg gatggcgcgg acggcagaca 9421 aaaagtttgc gatcgcccaa cgcgcctacc gtcaagctgt tgaattcgga attccaccta 9481 ccgaaatctt ttttgatacc ctagcactac ccatttccac agggattgaa gaagaccgcg 9541 ccaacggtaa agctacaatt gaatcaatcc gtcggattcg tcaagaatta cctggatctc 9601 atgtcgtctt gggtgtttcc aatatctcct ttggtcttag ccctgcatca cgtatcgtcc 9661 tcaactccat gtttttacat gaagcaatga ctgctggtat ggatgctgca attgtcagcg 9721 ctagtaaaat tttaccactg tcacgcattg aggaacgcca tcaagaagtc tgtcatcagt 9781 tgatttacga tgagcggaaa tttgagggaa atgtctgcgt ttatgacccc ttgacagaac 9841 tgacgaagtt atttgaaggg gtgacgacga aacgggacaa aggcgttgac gaaaatctac 9901 ccattgaaga acgtctcaag cgccacatca tcgacggcga acgcattggt ttagaagaac 9961 aactcaccaa agctttagaa aaatatcctc cactgcatat tatcaacacc ttcctgctgg 10021 atgggatgaa agtggttggt gagttgtttg gttctggaca aatgcagctt cccttcgtat 10081 tgcaatcagc ggaaaccatg aaagcagcgg tggcttatct ggaaccgttc atggaaaaat 10141 cagaagctgg taacaatgct aagggtacat tcataattgc tacggtgaaa ggcgatgttc 10201 acgacattgg taagaactta gttgatatca tcttgtcgaa caacggctac aaagtgatta 10261 atctgggaat taagcagcca gtggagaaca tcatcaatgc atacgaacag cacaaagctg 10321 attgtattgc gatgagtggt ttgttggtga aatccactgc tttcatgaaa gagaatttgg 10381 aggtgttcaa cgaaaaggga atcaccgtcc ctgtgatttt aggcggtgca gcgctgacac 10441 ccaagtttgt tgatcaagat tgccaaaata cctacaaagg taaggtggtt tacggcaaag 10501 atgcgttttc tgatttgcac ttcatggata agttaatgcc agcgaaagca gcaagtcaat 10561 gggatgattt gcggggattt ttgaatgaaa ccgctgaaac tgcccaagtg tcaggaaatg 10621 gtcacaaaca atctctggcg gaggcggttg aagaaaaatc tcctgaacca aaagaagtag 10681 atacgtgtcg ttctgaagct gtggcggtag atattgaacg tccaacgccg cctttctggg 10741 gaacgaaggt actccagggg gaagatattc ctttggggga ggttttttgg tatttagatt 10801 tacaagcttt gattgcgggg caatggcagt tccgtaagcc taaggagcag tctaaggagg 10861 agtatcaggc gtttttggat gagaaggtgt atccaatttt ggaggatttg aagcagcgga 10921 ttatacagga taagttgttg catccgcagg tggtttatgg atattttcct tgtcaggcgg 10981 aggggaatag tttgtatatt tatgatacga accgcagagg cgcagaggac gcagaggtaa 11041 gagaggtaag agcaagtttt gagtttccga ggcaaaggtc gttaaggagg ttgtgtattg 11101 cagatttctt tgcgccgaag gagtcgggag ttattgatgt gttcccgatg caggcggtga 11161 ctgtaggtga gattgcgact gagtacgcgc aaaagctgtt tgcagataat aaatacagtg 11221 agtatctgta tttccacggt tttgcggtgc agatggcaga agcgctggcg gagtggacac 11281 acgcccgtat tcgtcgcgag ttgggttttg tggctgatga accggacaat attcgggata 11341 ttttggcaca aagatatcag ggttcgcggt atagttttgg gtatccggct tgtccgaata 11401 ttcaggatca atacaagcag ctggagttgt tgggagcaga gcgtattaat ttgcacatgg 11461 atgaaagtga acagctttat ccggaacagt ctacgactgc gattatcact tatcacccag 11521 tagcgaagta ctttagcgcg taacttattc ctattcccct ctccttaata aggagagggg 11581 tgcccgtagg gcggggtgag gtagctaacc agaggagtat agtatgatgt ttgttgaatt 11641 tcaccccctt cccctctcct ttataaggct atgcattagt cacatttttc ttaacatttg 11701 tagaagttag aaggagaaag gattacagaa atcaaagagg tgatttaggt tgacaaaatg 11761 taggaaatgt gagttaacaa aggtggatgt gtgattagct acacccatta gctttgaaag 11821 tagagtgagt gagcttccaa gtagcggcga tatcgtgtaa tcgccgcaaa ccgcgccaaa 11881 tagttttgac acctggttcg ccgtcacttt tgcgacccaa aaatcctccg agacttgcaa 11941 tcattcgtac agcttgttgt aaagaaggcg ggtttttagg tggaatagga tgcttgttga 12001 tagtggcaga taaagactgc cattcgtgag cttctaaaac ggtgtcacat gattcgtcag 12061 gatgaagtcg tgcttgataa gttatccaga gcaaacgcca agcgacaatg gaatatgttg 12121 ccaatgccat ctcgattcga cgccctgttt ccaattgtaa tttttcgata ccacaaccac 12181 tttttaatgt gtaatgataa cgttcaatta accagcgata tgtgtaccat ttgacgcaac 12241 gtgccgcctg ctcaaatcta caaatttcaa tagtcgtaag taataaccaa ctaatagcct 12301 caacaccatc aggcgggttt tcttcttgtg ccaaaatcac ttgcaatttc acaggtttta 12361 actgtgaacg cgctagatga tttgctggca cacaaacctc aagagtggca aatctcaata 12421 tcaggtttgc gattctagga ttttgttcgg gattgcgctt gatttccaca ctcagcagac 12481 cacatggctg agtttgacga atcgcttgat gtaagtattg cgcggtatga tctactttgc 12541 gattgtgggt tccacgaatc agtaaatgag aattttctgg tcttttgaga ctaaataaat 12601 cgaagatatc cgcttcggaa tcccccatag tcaccaccat attttctttt ggtatcaact 12661 gttgtgttgt ctttaaagaa tctaaccaac gttgactttc tttctcttgg gtttctcttt 12721 gccgacgttt tttcgctatc cctaaatttt ctcgtggtct tgtccatacc tgttggtcaa 12781 ttattcccaa tggcactcca tctattgtgc tggccaatgt tgaatctact tttaatcccg 12841 ataatttctg atggtcaagg tatcccaccc ccgttgttga tggatgattt gttaaatcta 12901 tgtttgtcgt gtcttgaatt gccagtacta tctcatgaga tttgattctt tctatgctac 12961 tcctgacatg ggctttgcga atatcatctg gctcgaagta tggtgaactc caaaaatcat 13021 aggcagcgca tgtagccgcc atgtttccac acgcttgtgg cacactactt gatggctgag 13081 atgctaagtc ttcaacaatc cgtattaaaa aaagaagtca gaagtcagaa ttcagaattc 13141 agaattaatc ggatggggat ttagacccca accgattata gacaccgtgt agacggatgg 13201 ggatttagac ccctcgactc ggagtcattc gtacaggaga ttcgtctgtt cttcgttcta 13261 ttttttcttc cttctgactc ctgactcctg actcctgagt tcttggttaa ccgcttattt 13321 ctgcgtgcat cccctaggtc tgcttgttgc aattctaatt ctgcccactt ttgcatcttc 13381 tcccctcacc cacaactcct ttattctccc aaaactatac ccatcaatcc tttctctatt 13441 gttttaagtc ttttttactt ttcttaagaa agatgtgact aatgcatagc cttataaagg 13501 agaggggttg gggaaccccc gcaccgtact ggctccccta ctcagatctt gcacctcgac 13561 gtgagtacac cttgaactga agttcaaggc tcataggcaa agtccgttaa aacggactga 13621 atatagatat ccagtgagct ttagcttact tgacctttga gccaagaaat ttatttcttg 13681 gcggacgaga atgatggtgc aagatatgag cctacagata tttagatttt ttcataaatt 13741 aaatcggatt tttatatcat agaagtcatt gtggaatgac acatttctaa cactcgtcat 13801 ccaacgagca aatctactta acccaatagt aggaagcttc tttgtagaag ttcatgtcaa 13861 ggttttagaa aatgactcta caagaaattt atgactcaaa aaattcaaga aagcgtgaca 13921 gaatctaaag atgtgcgagt tagtgcaacc tcaataatct ggggtatagc tgtaggaatg 13981 ttagcaattt gtatcccgct ttcatctgcc acaaaaagtg gttctatttt acctttagct 14041 actatcgctg gtgcagcaat aagtactgtt gctgtgtggc gttctgatga caaaaaatca 14101 aaatataact ctttacctca acaaaaagtc gaactgttag agcaaagaat tgctaactta 14161 gaaacaattg tgactcgtga tgattttgag ttgcggatga gaatgaaaca ggtagaatct 14221 cgcgatcgca aaagtgacaa ttaaataact ttcgtccaac tcaccaactt tgtgtaaggt 14281 atagttctac ttctggcaat tttcaagaat tgccttaatt tgttgttaat attgtataac 14341 taaattttaa ttttttcatt ataccataga tgttcaaaat ctgataatat ttctggcatt 14401 ttggcatctt tttataaaaa aaagtcctta attgtgaagc gatattacat ttgtttcaaa 14461 aatatttaaa aatcgtctga aaagcaaaca tcgttctaac gtagaaccta agaagatact 14521 agaggcaatt tatgacgagt aacattcaaa agtatcgatt tgtctgtacc ctgacctttg 14581 gcgacatcta tggtcaaatc attgtttggt tgattaccat tacaataagt ttggcgtcag 14641 ctttggcgtt gatgggtgcc agaaaacctg tttatgcttt agccactgtt ggactcgtag 14701 ttctgctatc tttgcctttc ttgctttttg ctttcgtgac tacattgcta aatcacattg 14761 aagtgtctcc tgtagaacca ggaacaagaa tggaaccaat tccaggtaat gtgtcgcagc 14821 aacaacctgt agaagcaact agctgatgga ttttagatag aggactgatg agtgatgagt 14881 tgttagcagc tagtggtaaa gaagaagctg atgcgtgtct aagttgtttg atttttttta 14941 tgactgtcat tagtcagcgc gtcctttgtt attacaaatg actaatgacc aaaagatacg 15001 gactccctga actggtcaat gtgtaactta acaagtgcaa cagcgtaagc ttacaactaa 15061 acaattctca ttcctgactt taaaattcat aactgacact acatccctgt ctgttggcgg 15121 ggaatttttt tgacacacgc gacgtatcag ctttttacaa tataacagcg atcgcttttt 15181 gataattggg gacttgttat gcttgaacac gatgtgatta ttgtcggagg tgggttggct 15241 gggtgtcgcg ctgcagtgga aattgctcgc attgacccca gtttaaacgt agcagtggtt 15301 gctaaaactc acccgattcg ttcccactca gtcgcagcac aaggtggtat ggctgcgtct 15361 ctgaaaaatg ttgattcagc agacagttgg gaagctcatg cttttgacac tgtgaaaggt 15421 tccgactatt tagcagacca agatgcggta gaaattctca ctcgtgaagc gccggatgtc 15481 gtgattgacc tagaacacat gggcgtttta ttctcccgtt taaacgatgg tcgcatagct 15541 caacgtgctt ttggcggaca ttctcacaat cgtacgtgtt acgctgctga taaaactggt 15601 cacgccattt tgcatgaact cgtgagcaac ctacggcgat atggtgtcca aatttatcaa 15661 gaatggtatg tgatgcgcct gattttagaa gaaggtcagg cgaaaggtgt ggtgatgtac 15721 cgtattgaag atggtcacat agaggtgttg cgggcgaaag cggtgatgtt tgcgactggg 15781 ggatatggtc gtgtttacaa caccacgtct aatgattatg cttccacggg tgatggtctg 15841 gcaatgactg ctctggctgg tttacccttg caagatatgg aatttgtgca atttcatccc 15901 acggggttat atccagtagg agtgctgatt tcagaagcgg tgcgtggaga aggggcgtat 15961 cttatcaact ctgagggaga acgctttatg gaaagatacg ctcctagtcg catggaactt 16021 gctccgcgtg atattacttc acgggcgatc gcctacgaaa ttcgtgctgg tcgtggtgtt 16081 catcctgatg gaagtgcggg tggttccttt gtctatcttg acttacgaca catgggtaaa 16141 gaaaaaatta tgagtcgcgt tcccttttgt tgggaggaag cacaccgtct ggtgggtgtt 16201 gacgcagtca ctcaacctat gcctgtccgc cctaccattc attattgcat gggcggtatt 16261 cctgtaaata ccgatggtca agttcgtagt agtggcgatg gtctgattga tggctttttt 16321 gctgctgggg aaactgcttg tgtttccgta catggtgcaa atcgccttgg tagtaattct 16381 ttgttggaat gtgtggttta tggacgcaga acaggtgctt cgcttgcaca ttttgtgcaa 16441 aaacgcaagc ttccgacagt agatgagcaa cgctacatca ctgaagctca gcaagagatt 16501 caagctttgc tagatcagtc tggaaaatac cgcattaaca aagtccgtca agccttccaa 16561 gatgacatga ctcagtactg cggcgttttc cgcaccgagg aattaatgcg tgaaggttta 16621 aacaaagtgc aagaattaga acaacagtac tcgcagatat atttagacga caaaggcagt 16681 ttctggaata cagaaatcat agaagccttt gaaacgcgga gtttgatggt ggtagggcgt 16741 atgattttgg aatcagcttt aaatcgtcag gaaagtcgcg gtgctcactt ccgcgaagat 16801 tatccccaac gggatgacac caacttttta aggcacacaa tggcttatta ttcaccagca 16861 gggattgata tccaatatcg cccagtggca ataagcatgt ttcaaccaca ggagcggaag 16921 tattaggaaa gctcagatct tgcacctccg gctgagtacg cccagaactc aagttctggg 16981 ctaata // LOCUS NODE_1959_length_16948_cov_8.01414816948 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16948) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16948) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16948 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..206 /locus_tag="DP116_17075" CDS <1..206 /locus_tag="DP116_17075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745089.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="IS607 family transposase" /protein_id="PRJNA477356:DP116_17075" /translation="LVFAICEEFETEVVIINKSNEEVPFEQELVQDMIELITVFSARL YGSRSKKNKKLIDGMTQVVKEVQ" gene 206..1297 /locus_tag="DP116_17080" CDS 206..1297 /locus_tag="DP116_17080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015954174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_17080" /translation="MLLGFKTELKLNNQQRKAFAQHCGVARHAWNWGLGLTKQILDHN KANPDSKIKFPSAIDLHKWLVALVKSEHEWYYEVSKSTPQQALMALRESWKRCFNKTA GVPKFKKKGRRDSFTLEGTVKILGNNKIQVPVIGVLKTYERLPQLKPKSVTISREATR WFISFRHEVEAQATEHTDVVGVDLGVKTLATLSTGEVVPGAKSYKKYEAKLSRMQWLN RHKIIGSTKWKKAQIQIARLHRKIANIRKDTLHKLTTLLAKNHGTVVIEDLNVSGMLA NHKLAKAIADMSFFEFRRQLTYKCELYGSKLVVVDRWFPSSKTCSNCGTKKETLTLSE RVFECDHCGFVIDRDLNAAINLSLYVAAS" gene 1434..1628 /locus_tag="DP116_17085" CDS 1434..1628 /locus_tag="DP116_17085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199340.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17085" /translation="MSVVFSPDGQTIATGSEDKTVILWNMDLDDLLRGGCAWASDYLN NNLNVKDDDRHLCDDILKRK" gene complement(1683..1754) /locus_tag="DP116_17090" tRNA complement(1683..1754) /locus_tag="DP116_17090" /product="tRNA-Lys" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(1720..1722),aa:Lys,seq:ctt) gene complement(2049..3152) /locus_tag="DP116_17095" CDS complement(2049..3152) /locus_tag="DP116_17095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318015.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AI-2E family transporter" /protein_id="PRJNA477356:DP116_17095" /translation="MNLGQWIGLIALVLSLYILWQIREVLLLMFAAVVLATTLNRLAR RLQRFGIKRGIAVFLSVAIFLGLIVLFFFVVVPPFAVEFQALTKQVPQGLARFNTWLD YLKDRIPAQLTAYIPDLNSLIQQAQPFINRVLGNSLALVSGSLEVLLKTLLVLVLTGM FLADPAAYRKVFVRLFPSFYRRRVDGILDKCEVSLEGWVTGAFIAMSVVGLMSVIGLS ILRVRSALALGVLAGFLNLIPNLGPTMSVVPAMAIALLDAPWKSVAVFILYFIIQQVE SNFLTPIVMAHQVSLLPAVTLIAQLFFVTFFGFLGLFLALPLTVVAKIWVQEVLIKDV LDQWGDKSHRETEFVLVSDEPQTEKSAENSVDE" gene complement(3243..3431) /locus_tag="DP116_17100" CDS complement(3243..3431) /locus_tag="DP116_17100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651749.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17100" /translation="MQNQLGFVLKVFLLSTGLSVLIKYILPNLYIPATATNALIIVFL PTVILASVFFWRLQRQQN" gene 3549..4217 /locus_tag="DP116_17105" CDS 3549..4217 /locus_tag="DP116_17105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320335.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17105" /translation="MIYLPVSLLLFLVLLLLLPFFWFVVAVDVVEIAVAKLGFSPSIA TLLFTLVILTSTINIPVYRTESSVTMANDLASLWVREYWGIPLTKVQRSTVIALNVGG GLIPVLLALYQFTQGNALAILLVTAIVTLVSYYAARVVPGIGIQMNPLLAPLTAALSA MLLAANHAAPVAFAGGVLGTLIGADLLHLKDIQAMSSGVLSIGGAGVFDGIALCGLFA LLLT" gene 4295..6235 /locus_tag="DP116_17110" CDS 4295..6235 /locus_tag="DP116_17110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874966.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsH" /protein_id="PRJNA477356:DP116_17110" /translation="MPVETNNKKRQIKPPRIRQFGGSLLILLTLLFLLNAIVPSFFGP KLPQVPYSDFIAQVQTGKVDRAIVGGDRIEYALKTQTPDGQATEQVFATTPVAIDLDL PKILRDNNVEFAAPPPDQNGWISVLLNWVVPPLIFFGIWGFLLNRGGGGPAALTVGKS KARIYSEGSTGVKFLDVAGVDEAKAELEEIVDFLKNADKYTKLGAKIPKGVLLVGPPG TGKTLLAKAIAGEAGVPFFSISGSEFIELFVGVGAARVRDLFEQAKQQAPCIVFIDEL DALGKSRGGAGGFVGGNDEREQTLNQLLTEMDGFDANTGVIIIAATNRPEVLDPALRR PGRFDRQVVVDRPDKIGREAILKVHARNVKLADDVNLGTIAIRTPGFAGADLANLVNE AALLAARQNREAVIMADFNEAIERVVAGLEKRSRVLNETEKKTVAYHEVGHAIIGALM PGSGKVEKISVVPRGVGALGYTIQMPEEDRFLMIEDEIRGRIAILLGGRSAEETVFGK VSTGASDDIQKATDLAERAVTVYGMSDKLGPIAFEKVQQQFIEGYGNPRRSISPEVAK EIDREVKQIVDNAHHIALSILQENRDLLEETAQELLQKEILEGAKLREHLNQAKAPDE LAEWLRTGKLSEDKPLMQTLLV" gene complement(6443..6667) /locus_tag="DP116_17115" CDS complement(6443..6667) /locus_tag="DP116_17115" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17115" /translation="MKLNKNQLDLLGTVLGVVAGVSTVLTTQGVIDQKVGGSVGGIAT VLLGVVVQRPTDAEPTTQQVEQEEVKQTKV" gene complement(7041..7631) /locus_tag="DP116_17120" CDS complement(7041..7631) /locus_tag="DP116_17120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197420.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_17120" /translation="MTIASKSKLTLEEFLKLPDTKPASEYINGEIIQKQMPQGEHSLI QTSFCELINGVGKKQKIAIAFPELRCTYPAGSRSSSVYGGQSIVPDVTVFRWERIPLK PSGRIANRFEIHPDWAIEILSPDQRQTKVLGNLLYCSRCGTELAWLIDPEEESVLAVF PNQRVEVYEGSAQLPILNNIELELTVEQIFSWLTLS" gene complement(8031..9802) /locus_tag="DP116_17125" /pseudo CDS complement(8031..9802) /locus_tag="DP116_17125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874964.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="WD40 repeat domain-containing protein" assembly_gap 9675..9684 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 10476..10796 /locus_tag="DP116_17130" CDS 10476..10796 /locus_tag="DP116_17130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17130" /translation="MNNVKLETWEEKVLRNYLDGIHLINIPASRKKRLVILKWLVRKF EQEVTYTERQVNEIIVRHHSDYATLRRELIGYQLMERENGFYWRLPAAQCKSETEIMR QISL" gene 10959..11279 /locus_tag="DP116_17135" CDS 10959..11279 /locus_tag="DP116_17135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010073336.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="QacE family quaternary ammonium compound efflux SMR transporter" /protein_id="PRJNA477356:DP116_17135" /translation="MAWIYLFIAGLFEVGWAISLKYAQGFTKFGSSVATVTLMILSFT FLSKALRTLSVGTAYTVWTGIGAVGTVLLSIILFKEPFEARRLTCIGLIVMGVIGLRL VSPH" gene 11504..12409 /locus_tag="DP116_17140" CDS 11504..12409 /locus_tag="DP116_17140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317135.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_17140" /translation="MFLPPGFGEKYVMTTLGRMVYYTAVGKPWSDTEIEQSSQKTLVF LHAFGGGSSAYEWSKVYPAFAADYRIVAPDLIGWGRSDHPARSYNVNDYIQTIIEFIE RTCNGSINAIASSLTAAFTIRAAILRPDLFKSLILTTPAGLAEFGQDYSKSLSAQIVN IPVVDRLLYMTGVSSSFGIRSFLEERQFARPERVYPEIVEAYLQSAQQFNGEYAALAF VRGDLSFDLSQYITQLTVPTAIIWGQKSEFTGPEVGRRLAEMNPQAIRIFYRLEDVGL TPQLELPAVTIGLIRKFLPLLESPF" gene complement(13242..15200) /locus_tag="DP116_17145" CDS complement(13242..15200) /locus_tag="DP116_17145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194949.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-galactopyranose mutase" /protein_id="PRJNA477356:DP116_17145" /translation="MSQVFNSPESSAISRRTLLKLFGVGATTGVLGYSRLTKPKPTVF QQDTLSLPLLLNQPKSVVVVGGGLAGLACAYELSQRGFAVTLLEKSPQLGGKIASWQI EVNGDSFKMEHGFHGFFPQYYNLNNLAAELGISENFKSLKSYSVVYRDTKYQPEVFRP SRSAFPWNIVDLAIASPNRFQWGINLTKLKHLQVFQAIGGFEREKNYRRFDNISVANW VEEEFPKGLYDLYFLPFAKSSLNAPDMMSVAELLQFFHFYFFGNPEGLAFNGTKDDMG TSLVQPIAQAIQSKGGKIITEAMVSEIQVLKTKVDSLSYQIGNNTNNVPFGVKRNNTI ETRQGTSLQYFGAADEVFALPDNSQEAISLTCTHQGCTVKIAEDGKFHCPCHGAVFAA DGKVLKGPAQRDLSKFQVVQRQDDGLQLIAANLDSPSSQTIQADYYVFATDVPGVQQL FRQINGDVDGVVGAMRTQIKKLNVADPFAVCRFWFDRDFEWNHSYFTSLSGYQLTDSI TLYHRIQEQFIEWAKRTGGSVVELHAYCYKEKQFPTQFALLTTFEQELYEIVPELKQA NMLHRELVNQKNFSGYPPNSYAERPETSTNISNLVFAGDWVKMPFPSGLMERAVSSGF LAANEILHREGLQRRTLLSVNPEGLLQI" gene 15340..15612 /locus_tag="DP116_17150" CDS 15340..15612 /locus_tag="DP116_17150" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17150" /translation="MIPDSCILDSSDVTQWLSQQGILIELLQHKHSLIFWSLSQRIDF EQSAKFSKVYSVENAASSTLVADECLADGTRKLEDLGKKTEHYLTV" gene 15658..16257 /locus_tag="DP116_17155" CDS 15658..16257 /locus_tag="DP116_17155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_17155" /translation="MPDDSFSPKEISHSGRSRILAQAEQLFRTRGYNAVTMRDIAGEV GIRQASLYYHFPSKEQLFVAVTERMFERHRTGLQQAIDDAGDELRSQLHAVGGWFLSQ PPIHFLSLFHNDMPSLGEDNIKKLAICSEQCIFEPLRQTFIKAQQRGEIRHTRPESLA GFFLSVMESIPFVITGSDAVSGEIIVDEMISVLLDGLKP" gene 16711..>16948 /locus_tag="DP116_17160" CDS 16711..>16948 /locus_tag="DP116_17160" /inference="COORDINATES: protein motif:HMM:PF09912.7" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2141 domain-containing protein" /protein_id="PRJNA477356:DP116_17160" /translation="MTSLSSAKAVSNSSLTVTINGLKNQRGQVCLSLFSSGRGFPTSS DRAVAARCVKLENAPLTVKFENLKAGNYAIAAYHD" BASE COUNT 4852 a 3522 c 3648 g 4916 t 10 others ORIGIN 1 agcttgtttt tgcaatatgt gaggagtttg agactgaagt tgtcattatc aataaatcca 61 acgaagaagt accttttgag caagaactgg tacaagacat gattgaactt atcactgtgt 121 ttagcgctcg cctttatggc tctagaagca agaagaacaa gaaattgatt gatggtatga 181 cccaagttgt taaagaggtg cagtaatgct gttaggtttc aagactgaat tgaaactgaa 241 taatcaacaa cgcaaagcat ttgctcaaca ttgtggagtt gctcgtcacg cttggaattg 301 gggactggga ttgactaagc agatactaga tcacaataaa gctaatcctg actccaagat 361 taagttccct agtgctattg acttgcataa atggttagtg gcactggtga agtctgaaca 421 tgaatggtac tacgaagtca gcaagtctac tccacaacaa gcgttaatgg cactgcgtga 481 atcttggaag cgctgtttta acaagacggc tggtgttccc aaattcaaaa agaaaggtag 541 acgcgactct ttcacattag aaggtacagt gaaaattctt ggaaacaaca aaatccaagt 601 acctgtaatt ggtgtactca aaacctatga acgtctacca caactaaaac cgaagtcagt 661 tactattagc cgcgaggcaa cgagatggtt tatcagcttc cgtcatgagg ttgaggctca 721 agctaccgaa cacaccgacg ttgtaggtgt tgacctgggt gtcaaaacat tggcgacatt 781 atcgactggc gaagtagtac ctggtgctaa gtcctacaag aaatatgaag ctaagttatc 841 tagaatgcaa tggttgaatc gtcataaaat tatcggttca actaagtgga agaaagccca 901 gatacaaata gcgagactcc acagaaagat agccaacatc cgaaaagata cgttgcacaa 961 gctcacaaca ctgcttgcca agaaccacgg cacagtagta attgaagact taaatgtatc 1021 tggaatgttg gcaaatcaca aactcgctaa agcaattgct gacatgtcat tctttgagtt 1081 tcgtcgccag ctaacttaca agtgtgagtt gtatggttca aagctggtag tcgttgacag 1141 atggttccca tccagtaaga cttgctctaa ctgtggaacc aaaaaagaaa cgctcacatt 1201 gagtgagcga gtgtttgaat gcgatcattg tggttttgtt atagaccgtg acttgaacgc 1261 agcgataaac ctcagtttgt acgtagccgc cagttaggtg gtggagaagc ccgtggactg 1321 gtttcgtccg aacgaaccag gatgaagcga ggaattaatc gaagcccaag gtagatattt 1381 ccagtaatgg acagctttgg gtaggtatta tataacggca taacggtcca gtcatgagtg 1441 tggtgttcag cccggatggt cagacgattg ccactggaag tgaggacaag acagtaatac 1501 tatggaatat ggatttggat gatttactca gaggtggttg tgcttgggct tctgattatc 1561 tgaacaataa ccttaatgtt aaggatgatg accgtcacct ttgtgatgat attcttaaaa 1621 gaaagtaatg ccggattcaa atctccagcg atttagttaa tgtacaacct ggattcaaaa 1681 agtgggtgac gagggatttg aacccgcaac caatggatta agagtccact gctctaccgt 1741 tgagctagtc accctatagc gagattagga atttacatga tatcacttca aagctaatct 1801 tgtccagcaa agtgaactgt caacaggtcg cggcgtttgg tacaaatcct tattctactc 1861 gtccataatt gtcgcgctta cctcaccccg gttttgtctt gcgccaaaac cgcccctctc 1921 cttagtaagg agaggggacg tgaagcgtag ctttacgggg gtgaggtcaa accaacgtgg 1981 aatcaaggct gaacgttaag ttgacaccaa tgcaagcaca gtccccacta ctcttaaatt 2041 agctgctcct attcatctac ggaattttca gcagatttct ctgtttgtgg ttcatcagaa 2101 acaagcacaa actcagtctc tctatgagat ttatcacccc attgatccaa aacatctttg 2161 atcaacactt cttgcaccca aatcttagcg acaactgtca ggggtagtgc aagaaacaag 2221 cctaagaaac caaagaatgt gacaaaaaac aactgggcaa ttaaggtaac agctggtaac 2281 agcgacactt gatgcgccat aacaatgggc gtgaggaagt tactctcaac ctgctgaatg 2341 ataaagtaga gaataaatac agcaacagat ttccagggag catccaaaag ggcgatcgcc 2401 attgctggaa ccacgctcat cgtaggacca agattaggaa tcaagttcaa aaatcctgct 2461 aaaactccca aagccagtgc tgaacgcaca cgcaaaattg ataaaccaat cacgctcatc 2521 agtcccacaa cactcatagc aataaaagcg ccggttaccc atccttccaa tgacacctcg 2581 catttatcta aaatcccatc cactcgccgt cgataaaacg agggaaacag ccgcacaaag 2641 acctttcggt aagccgcagg atcggcgagg aacattcctg tcaacaccag caccagtaaa 2701 gtcttgagga gaacttctaa ggaaccagag actaaggcta aggaatttcc tagcacacga 2761 ttgataaaag gctgtgcttg ttgaatcaag ctattcaagt ccggtatata agcagtgagt 2821 tgggcaggaa tgcggtcttt taagtaatcc agccaagtat taaaccgcgc caacccttga 2881 gggacttgtt ttgtgagtgc ttgaaactca actgcaaaag gcggtacaac cacgaagaag 2941 aaaagtacaa tcaagcctag aaagattgca actgacaaga aaaccgcaat tccacgcttg 3001 attccaaagc gctgcaacct tctagccagt cgatttaagg tagttgctaa aacaactgca 3061 gcaaacatca gtaacaggac ttcccgaatc tgccacagaa tgtacaaaga aagaactaaa 3121 gcgattaacc ctatccattg acctagattc actacctgac tcctggctat tgacgaactg 3181 ctacttcttg tcgctgcagc taggctagct aattttagca gttctcgcct agacgttttt 3241 tgttaatttt gctgtcgttg caatcgccag aagaaaacac ttgccagaat gacggttggc 3301 aaaaaaacga ttatgagggc gttggttgct gtcgctggaa tatataaatt cggaagaata 3361 tacttaatta aaactgagag tccagtcgag agaagaaaca cttttaaaac aaatccaagc 3421 tgattttgca taacaagtac ggcaattcac atttatctgg gggggagaca cgaaatagaa 3481 aaatatgtct cattctttaa aataaattca agcagagcca cagacaacac tctgcattat 3541 aaaccgccat gatttacttg ccagtgtcgc tgctgctatt tctggtgtta ttactgctcc 3601 tacctttttt ttggtttgtt gtggcagtag atgtggtgga aattgcagtt gcaaaattag 3661 ggttttcccc atcgatcgca actttattat ttacgctagt gatattgacc agtaccatca 3721 atatccctgt gtaccgtacc gaatcctctg taacaatggc aaatgacctt gcttctttat 3781 gggtgaggga atattggggt atacccctaa caaaagtaca gcgttctact gtgatagctt 3841 tgaatgtggg cggaggcttg atccctgtct tgttagcact ttaccaattt acacaaggaa 3901 acgctctcgc tattttgtta gtgacagcta ttgtcacact ggttagctat tatgcagcac 3961 gtgttgtccc tggaattggt atacagatga atccgttgct ggctcctttg accgctgctt 4021 tgtctgcaat gttacttgcg gcaaatcatg cggctcctgt tgcctttgcg ggtggtgttc 4081 ttggaacctt gattggtgct gacttactgc atctaaagga cattcaagct atgagttcag 4141 gagtcctgag tattggtggt gctggggtat ttgatggtat cgctttgtgt ggtttatttg 4201 ctcttttatt aacgtgaggt agttatttgc tatttgtaga aactaaaaaa ataacgataa 4261 atagaaaata aattattgga ggggaaatca agaaatgcca gttgaaacta ataataaaaa 4321 acgccaaatt aaaccaccaa gaatacgtca gtttggtggt agcttgctca ttctattgac 4381 ccttcttttt ctcctgaacg cgattgttcc tagctttttc ggtcctaaat taccgcaagt 4441 tccttatagc gattttatag ctcaggtaca aacaggtaaa gtagatcggg cgattgtggg 4501 gggcgatcgc attgagtatg ccctcaaaac tcaaacccca gatggtcaag ccacagaaca 4561 agttttcgca acaacaccag tggcgatcga cctagattta cctaaaattc tgcgtgacaa 4621 taacgtagag tttgccgcac caccaccaga ccaaaatggt tggattagcg ttcttttaaa 4681 ctgggttgta ccaccattaa ttttctttgg tatttggggc tttttgctca atcggggcgg 4741 cggtggtccc gcagcactga cagtaggtaa aagtaaggct cgtatctact ctgaaggtag 4801 cactggtgta aaatttcttg atgttgctgg tgtagatgaa gcgaaagccg aactggaaga 4861 gattgttgac tttctcaaaa atgctgataa atacaccaaa ttaggagcga aaatacctaa 4921 aggtgtgttg ttggtaggac ctccaggaac gggtaaaaca cttctcgcga aagcaattgc 4981 tggtgaagct ggtgtccctt tcttcagtat ttctggttct gaatttatcg aactctttgt 5041 tggtgtcggt gctgcacgag ttcgcgactt attcgagcaa gcaaaacaac aagcgccttg 5101 tattgtcttt attgatgaat tggacgcact cggtaagtct cgcggtggtg ctggtgggtt 5161 tgtaggtggt aacgatgaac gggaacaaac cctgaatcag ttactaactg aaatggacgg 5221 ctttgatgct aatactggag ttatcatcat cgccgctacc aaccgtcccg aagttcttga 5281 tccagcactg cgtcgtcctg gtcgctttga ccgtcaagtg gtcgtggatc gtccagacaa 5341 aattggtcgt gaagcgattc tcaaagttca tgctagaaat gtcaaattgg ctgatgatgt 5401 taacttggga accatcgcta tcagaacgcc tggatttgct ggagcagatt tagccaatct 5461 tgtcaacgaa gctgcactcc ttgctgcacg ccaaaatcgg gaagcagtga tcatggcaga 5521 ttttaatgaa gctattgagc gcgttgttgc tggtttggaa aaacgctctc gcgtcctcaa 5581 tgaaaccgag aaaaagactg ttgcttatca cgaagttggt cacgccatta tcggtgcttt 5641 gatgccagga tctggtaaag tcgaaaaaat ctctgttgtt cctcgtggtg ttggtgcttt 5701 gggttacacc attcaaatgc cagaagaaga ccgctttttg atgatagaag acgaaattcg 5761 tggtcgcatc gctatcttat tgggtggacg ttctgcagaa gaaaccgtct ttggcaaagt 5821 gtccacgggt gcgagtgacg atattcaaaa agcgactgac cttgcagaac gcgctgtgac 5881 tgtctacggt atgagcgata aacttggtcc tatcgcattt gaaaaagttc agcagcagtt 5941 tattgaagga tatggtaacc cgcgtcgttc aattagtccg gaagtggcaa aagaaattga 6001 ccgtgaggta aaacaaattg tagataatgc tcatcacatt gctttgagta ttttgcaaga 6061 aaaccgcgac ttactggagg aaactgcaca ggaactgttg cagaaggaaa ttctcgaagg 6121 cgcaaaactg agggaacacc tcaaccaagc caaagcgcca gatgaactgg cagaatggtt 6181 gcggacgggt aagttatcgg aagataagcc tttgatgcaa acacttcttg tgtaagtaag 6241 atttttgaaa gaattgaaaa gcgatcgctc taaggtgtgg cgatcgcttt ttttagccta 6301 aatttcctca cctgcaactt gcctcaaatc acagttaagt agctacaaat gtcttcaact 6361 ggttgattga tataaagtta gtaatagttg attgcccaat agcccgttag ggcattgtgc 6421 ttcgttgcag tcgcaacatt cgttaaactt ttgtttgttt gacttcttcc tgttctactt 6481 gttgagtagt tggttctgca tccgttggtc tttggacaac aacccctagc aaaacagtag 6541 caatgccacc aacgctaccg ccaacttttt gatcaataac gccttgggta gtgaggacag 6601 tggaaacacc tgcaacaaca cctaagacag tgccaagtaa atcaagttga tttttgttga 6661 gtttcatagt tagtaatcaa ggttagttag agtgtgccga caaagtatgc tacagtttcg 6721 attctttcta ttccggagag attggcaact ctgatttttc acgggatgtg gaaagtcagg 6781 ggcatgaaac tcccctgctc ctctctcccc cctatcccca gaggggaccg gtgagtcccc 6841 cctgctcctc tgctgtctaa atgtgcaaag gtgggctaat tacttacgat atgtccgcgc 6901 cccagtggaa cgtagaggcc ggcgtcgcgg cactctccca aaacctactt gtttttttat 6961 acttgttaga ttgaatttgt agcgatcgca atttcaagag tgcgatcgct ttttttatcc 7021 aaaatttgct cacctgcacc tcaactcaaa gtcaaccagc taaaaatctg ttcaaccgtc 7081 agttctaact caatattatt aagtatgggc aattgagccg aaccttcata gacttccact 7141 cgttgattag ggaacactgc caaaacactt tcttcttcag ggtctattaa ccacgctaac 7201 tcagtgccac atcgtgagca atacaacaag ttaccaagaa cttttgtttg tctttggtct 7261 ggggaaagaa tttcaattgc ccagtcgggg tgaatttcaa agcggttagc aatcctacca 7321 gatggtttta atgggattct ttcccatctg aacacagtaa catcaggaac gattgattga 7381 cctccgtaga cgctcgaaga gcggcttccc gcagggtagg tgcagcgaag ttctggaaaa 7441 gctatggcaa ttttctgttt tttaccgact ccattgatga gttcgcaaaa actagtttga 7501 atgaggctat gttcaccttg aggcatttgt ttttgaataa tttctccatt aatatattct 7561 gatgctggct tagtgtctgg aagttttagg aactcttcta aggtcagctt agatttacta 7621 gctatagtca taagtcaatt cctttttaat ttttagtcaa gcctaaataa actcttcaag 7681 gtgtgctatt gtctcaccta tgttctctat caattccgag gaaaggttct tgctagattc 7741 ttgtccttca agccagtcaa acaagcagca cagccataag ctttcactac ctttaagatt 7801 aatatgtgta agaaaatctc cttgtttatt tcttacagga tttggaacga ctatactatt 7861 ttctgaggat aggaaacgca gccaagttaa ttcggaataa acttgaactt catcaaagtt 7921 atctggcgaa tatattcttc aaatatcatt tgtcatatga gaattaccta ttgtagagac 7981 gtgagaaact ttgcgcttca cgtctctaca atttagcatc gaaaatcctg ttaactccgc 8041 tgccaaattt taatcgtctt atctaaactc ccactcacta atagttcccc tgaagcagta 8101 aaagccaatg ctgtcaccgt gtgtgtatga cctgtaaatg taccaagcag ttctcccgtt 8161 tgtaggtgcc acaacttaat cgttttatct gcactaccgc tggcgataat ttgtccatcc 8221 ggactcaacg caatggcgta gactccatct ctgtgtccgt tcaatgtcct aagtaattcc 8281 ccattttcta agtgccaaat tttaatcgtt ttatcccgac taccactgac caaaattttg 8341 ccatccgtac tgacagccaa ggaacgaaca atatgggaat gacctgtaaa cgagtgcagc 8401 aattcgggaa ccaaaggtat ctcagcttcc tgttgggaca tgcgccagac tttaattttg 8461 cggtaactcc ctgtgatgag agtttgtcca tcaggactca aagcaagtga atgggctgct 8521 gtatcttcta gagataaggc agtagcaacc tgacgctgca tcaagtccca aaacataatt 8581 gttctatcat cccccccggt tgccaacatt cgtccatctg gggtaaaggc tacacaccgc 8641 accattccat tatgcttgtg caaaatatca atcaagtctt gtgcccctac gtgccaaagt 8701 ttaattgtgg agtctgcacc agcactgact aatgtctgcc catctggact gaaagctagg 8761 gaattcactt catcaaccaa cccagataaa atccaaggat attctgataa tgtccctact 8821 aactcacctt tggtcaaatc ccatagcttt gtctccccac gactaccact tgctacaaga 8881 ggagatgtgt tgccgttgtt acccctgtga ctgaaggcta agcaattaat tcccctggta 8941 tgtcctttta aggtctgcca gcattcccat tcacctacta ttgtcttgag tggatggtct 9001 gatggtagag tctgagtcgc ttctgtgatc acaatttcat tcaagactct tagcttgtct 9061 ttttgagtac tcagtgattc taaataccaa ctcagccctc ccgcgtacaa caactggctt 9121 ggggtaatat acgattttgg ctcactaaat tcaagttgat ttgtgggtaa aaaacctgct 9181 atgatcgcct gcttctcata acctctttga ccagtaaatg ggtaaaacaa acatagaaaa 9241 attaagactt gatggttttt taaatcttct tgtgtgatcg accatctaac tttgtctttc 9301 ttgatactgt taaaactttc tccatcagca gcataaactt gaagactgac tttgggattt 9361 aagctaagca cgtaggaaaa tttgctttca ccacgtaaat ttccctcatc atctagaact 9421 atattgttca aggtatcttg tgccactttt ttcacctgct tgcctaggcg atcgctcatg 9481 actcggtcaa cgatttgctc tcgcaaaaga taatcttccg cttggtgctg gaagtatctt 9541 ggaaaaggaa ttgcccagtc aggaggatca gggtattcag gcggtgtggg ggggggattt 9601 ttggcaataa tttctgcttg ttggcaagca atctccagca gttttgccaa agattcgccc 9661 caaaatgcca taatnnnnnn nnnntttcgc tatgacaacc tttgacttga ctttctagca 9721 aaaagagatc gtaagtctta ggttttttga ttcgttgaag gaaatcagct tgttgagcct 9781 ttagtaagct aatccaatcc atttgcattc ctgcggcgca gtatttgcat acctacgtga 9841 agttaagtca aggcaattgc tcaagttagt gtatcaactt cagttattta agaagtcggt 9901 catttcagta gttaatgtaa cacaaccttt tgacgtggat agacaccaaa tgaccatagt 9961 caagtggtga cgtagccagt ttgccttgga aacattgaga tgatagatga ttttacctaa 10021 cctggagcat gagatactgt ttcgctagac gcagtttttg acaaaactgt gctcaaattt 10081 gaacaaatct taggactggt tgcataaaag ggcaatactc gcttttcaaa catagacact 10141 tacagcaatt ttcagcttat tagactacgt ttgagacagg gagtgaggga acgtgtgaaa 10201 gtgcctgaca aaactgtagc ttacaataca tttatttgat tcactgaaaa ttctgagtgc 10261 cacaagcctt tgatataacc cgatgtaacc cgatataaaa ttataaggag gaagcacaat 10321 aactaactga ctttctctaa catattattg ttgagttctg ctttctgttt tctgtattct 10381 tttcatattt tttaatgttc cagggctaac acaacgccag agtttttttg tgcttagtta 10441 ttaatattaa tattttggag tctaggatgc aagaaatgaa caatgtcaaa ttagaaacat 10501 gggaagaaaa agtactcaga aattatcttg atggtataca cctgataaat attcctgctt 10561 ctcgcaaaaa acgcttagtc atactcaagt ggttagtcag aaaatttgag caagaagtga 10621 cttacacgga acgtcaagtg aacgaaatta ttgtgcgtca tcactctgac tatgcgacat 10681 tacgacggga actcatcggt tatcaactta tggaacggga aaatggtttt tactggcgat 10741 tgccagcagc tcaatgtaag tccgaaactg aaattatgag acagatttcg ttatagacaa 10801 ctctaacaac tatgttacac tagataccac gcacacgggc ttagttgcga gatagtgttc 10861 cctgttataa ctcagataga aactcctcgt atttggttgc atctactaaa aaaatcaaat 10921 cacagttcta agttattgca acctataagg acgagcacat ggcgtggatc tatcttttta 10981 ttgctggttt atttgaggta gggtgggcga ttagtttgaa gtatgcacaa ggatttacta 11041 agtttggttc tagtgttgct actgttaccc tgatgatact cagtttcacc tttttgtcta 11101 aagccctgcg tacactatca gttggaactg cttacactgt ctggacaggt ataggagctg 11161 ttggtacggt tctgttgagt ataattttat ttaaagaacc ttttgaagcg cgccgcctca 11221 cttgcattgg cttaattgtc atgggtgtga taggactgag gctagtttct ccacattaac 11281 cgactctctg acccggcaat gttgtctggg tagactacta atctttggaa gtaagctatt 11341 gctcacacct gggaaagatg aacattggct tgataaagtt taatacatag aagtaaaaac 11401 ttaatttaga cgacagcata ttaaatttaa ctttagaatg atgtcgtttc ctagctgctg 11461 tatccacgat ggaaatggat gatgcagcac ctgaggtatt tctatgtttt taccccctgg 11521 gtttggtgag aaatatgtga tgaccacgct aggaaggatg gtgtactata ctgctgttgg 11581 gaaaccgtgg tctgacacag aaattgagca atcaagccag aaaacattag ttttcctaca 11641 cgcgtttggt ggtggttctt ctgcttacga gtggtctaaa gtttatccag cttttgctgc 11701 ggattatcga atcgtcgcgc cggatttgat tgggtgggga cgctctgacc atccggctcg 11761 aagttataat gttaatgact acattcaaac tattattgag ttcatagaaa ggacttgcaa 11821 tggttcgatt aatgcgatcg cctcttcact cactgctgct ttcacgattc gcgctgcgat 11881 tcttcgcccc gatttattca agtctttaat tttgaccaca ccagcaggtt tggctgaatt 11941 tggacaagat tacagcaaga gtttatctgc tcaaattgta aatattccag ttgtagatcg 12001 attgctttac atgactgggg tttctagcag ttttggcatt cgcagctttt tggaagaacg 12061 tcaatttgct cgtccagaac gagtctatcc tgaaattgtc gaagcatatt tgcaatcggc 12121 tcaacagttt aatggagagt atgctgctct tgctttcgta cgtggtgatt tatcttttga 12181 tttatctcaa tatatcactc aattgaccgt tcctactgcc ataatttggg ggcaaaagtc 12241 agaatttaca ggacctgagg tcggtcgccg acttgcagaa atgaaccctc aggcaattcg 12301 tatcttttat cgtttggagg acgtgggttt gactccccaa ttggaacttc ctgctgtcac 12361 gattgggctg attaggaaat ttttaccttt actagagtca cctttttgaa cagcttttct 12421 tgtgatctga tttgcgcttg cgtcggagtg atttaacagt accagccgca gtcaacagta 12481 ccagccgcag tgaacagtta tcaagcaatt gatgactgat gactgataac tgataactga 12541 taactgtttt aatgattgaa aaaacaagaa ccccgacttt ttagaaaagt cggggttttg 12601 aatacaaatg tgcacgacta ttgcgataat tcaaatgcta acgaatttgc tatgaaatac 12661 acatattgtg taggggagcc agtgcgaatg acggtgagac cagtgctgcg gtccttgttt 12721 cccgacagcc aggcatctgg cgttagccgt caggcctgcc gtaggcatac ccgaagggct 12781 ttccctcact tggcatctgg tgagaccagc gctgcgggag ggtctccctt cgtaggcgac 12841 tggcgttagc cgtaaggcgt gcgctttacg catacccgaa gggcgttggg gattgcaatc 12901 cccaccctac ttaattctag ttaacctcag gaattagagt tggtatcatt gctaccgata 12961 attgcttgga agatgttcga tactagaaag aaaaaggctg tgaataacaa cagaagcatc 13021 aggcttgcaa gagtagcaaa gggagaaatg ataaataata gtaagaatgc aaacatgaga 13081 gttatcatca acgttttatt catctttttt ccttttcata atttttaatt taacttaact 13141 agctaattct ataactgtgc cacttctatt agaaagattt tgtcttcttt cttaaggatg 13201 aagcccaagc gtcaaacctg ggcttttttc ttcgtccagg cttaaatctg caataatcct 13261 tctggattga ctgacaaaag tgttcgcctc tgcaaaccct cccgatgtaa aatttcattg 13321 gctgctagga agccactact caccgcccgt tccatcaagc cggaaggaaa tggcattttt 13381 acccagtcgc ctgcaaacac caagttagag atattggtac tagtttctgg gcgttcggcg 13441 taactatttg gcggatatcc agaaaagttc ttttgattca ccaattccct gtgcagcata 13501 tttgcttgct ttaactcagg gacaatttcg tagagttctt gctcaaaagt tgttaataat 13561 gcgaattggg tggggaattg tttttctttg taacagtaag cgtgtaattc taccacacta 13621 ccaccagtgc gttttgccca ctcaatgaat tgttcctgaa tgcggtgata gagggtgatg 13681 ctgtctgtga gctgatagcc tgacaaagaa gtaaaataac tgtggttcca ctcaaaatca 13741 cgatcaaacc aaaaacgaca gacagcgaat ggatcggcga cgtttagttt tttgatttgc 13801 gttcgcatag cgcccacaac tccatcaaca tcaccattga tttgccgaaa cagctgttgt 13861 acacctggta catcggtagc aaagacataa taatctgctt gaattgtctg tgatgatggt 13921 gaatcgagat ttgctgctat caactgcaaa ccatcatcct gacgttggac aacttgaaat 13981 ttagataaat cacgttgggc tggacctttc aacactttac catcagccgc gaacaccgct 14041 ccatgacaag ggcaatgaaa cttaccatct tctgctattt tcacggtaca accttggtga 14101 gtgcaggtaa gagaaattgc ttcttgacta ttatctggta gggcaaagac ttcatcagcc 14161 gcaccaaaat attgtagaga tgtcccttgg cgtgtttcta tagtattatt acgcttaacc 14221 ccaaaaggaa cattgtttgt attattacca atctgatagc tgagtgagtc tactttggtt 14281 tttaaaactt gaatttcact caccattgct tctgtaatta ttttgccacc cttactttga 14341 atagcttgag cgatgggttg caccaaactt gttcccatat catctttggt accattaaaa 14401 gcaagtcctt ctggattacc aaaaaaataa aagtggaaga actgtaaaag ttcagcaaca 14461 ctcatcatgt ctggtgcatt caaactagat ttagcaaaag gcagaaaata caagtcgtat 14521 aaacccttgg gaaattcttc ctctacccaa ttggcgactg aaatattatc aaaacgtcga 14581 taattctttt ctctctcaaa accaccaatt gcttggaaaa cttgtaaatg tttcaatttt 14641 gtgagattga taccccattg aaagcgattg ggagaggcta tagctaaatc cacaatgttc 14701 caagggaaag ccgaacgact gggacgaaat acctctggtt gatacttggt gtcacgataa 14761 acaacagagt aagattttaa tgacttgaaa ttttccgaaa tccctagttc tgcagctaaa 14821 ttattcagat tataatactg aggaaaaaac ccatgaaagc catgctccat cttgaagctg 14881 tcaccattaa cttcaatttg ccaactggca attttgccac caagttgggg agatttttcc 14941 aataatgtga ctgcaaatcc tctctgactc agttcataag cacaagctaa accagctagc 15001 ccacctccaa cgacgacaac actttttggt tgattcaata acagtggcaa actcagggta 15061 tcttgttgaa aaactgttgg ttttggctta gtcaaacgag agtatcccaa aactcctgta 15121 gtagcaccta caccaaataa ttttagtaat gtgcgacggg agatagcaga tgattctggg 15181 gaattgaata cttgactcat gcgttactca attaactcct caccatgaag tactaaccga 15241 agaagaattc accaactcag aacttagaac tcagaaagaa tttgaattct gggggttgaa 15301 ttggagttac aagttcctag atgaatctat aggattattt tgatccctga ctcctgtatt 15361 ctggattctt ctgatgtgac acaatggcta tcacaacagg gcattttaat agaattgttg 15421 caacataaac acagcctcat attttggagt ctttctcaac gaattgattt tgagcaatcg 15481 gcaaaattct caaaagtata ttccgttgag aatgctgcat cctcaacttt agtagctgac 15541 gagtgtctag ctgatggaac aagaaaactg gaagatttag gtaaaaaaac agaacattat 15601 ttgacagtgt aatataatct ttaatagttg actactaaac gttaggtagg ttccagattg 15661 ccagacgatt ctttctctcc aaaagaaatc tctcatagtg gacgatcacg cattctcgct 15721 caggcagaac agctatttcg cacgcgaggc tacaatgcag taaccatgcg ggatattgct 15781 ggtgaggtgg gaattcgtca ggcatcgttg tactatcatt ttcccagcaa ggagcaactt 15841 tttgtggcag ttactgagcg aatgtttgag cgccatcgaa caggtttaca gcaggcgatt 15901 gacgatgcag gagatgagtt gcgatcgcaa cttcatgctg taggtgggtg gtttctttct 15961 cagcctccca tccatttttt gagtctgttc cacaacgata tgccttcctt gggtgaagat 16021 aatattaaaa agctggcaat ttgtagcgaa caatgtattt ttgagcctct gcgacaaacc 16081 tttatcaaag cgcaacaacg aggtgaaatc cgccataccc gtcccgaatc gctcgctggg 16141 ttttttttgt cggtgatgga aagtattccc ttcgtcatca ctggatctga tgcagtttct 16201 ggggaaatca tcgtagatga gatgatttcc gttttgttag atggactcaa gccttaggta 16261 gaaagggact agtaccgcta gcgttcgtca aaattcaaaa ttcaaaattc aaacaggtat 16321 acagcgtaag cgtttcattg atttggaatg ggtagtttat ttacgcgccc ggtgtactag 16381 taccgcgctc gtcccaaggg gacgagaaat tgattgaaag gatcgcccca taattgaata 16441 tcgcaagatg caagcaactt ttgagcaaca aatgtcaccg tattgcaatc aaatgaatgt 16501 agagaattca atatgtttcc atcattagac aaacctggcg ctctgtcgct ggttctcaat 16561 gttcttctag cagttggtgc atctgtgttg agtggattgg gctggcttta acaaaagcat 16621 tgcgtaagtt ttttatatgt tttaaaacat taagtcatga cgatgcaacg ttgtttctgg 16681 agctttttac ttttaacaga tgtcatattt atgacatctt tatccagtgc aaaagcagtg 16741 tctaacagta gtctcaccgt caccattaac ggcttgaaaa accagcgtgg gcaagtttgt 16801 ctgagcctgt tttctagtgg gcgaggattt ccgaccagta gcgatcgcgc agtggctgcc 16861 cgttgcgtca aactagaaaa cgctccgctg acagttaaat ttgaaaactt aaaagcagga 16921 aattatgcga tcgccgccta tcatgatg // LOCUS NODE_1961_length_16925_cov_6.30569116925 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16925) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16925) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16925 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1..125 /locus_tag="DP116_17165" /pseudo CDS 1..125 /locus_tag="DP116_17165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874444.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="16S rRNA (uracil(1498)-N(3))-methyltransferase" gene complement(161..583) /locus_tag="DP116_17170" CDS complement(161..583) /locus_tag="DP116_17170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318761.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17170" /translation="MSNPARYSFIATDSSLGEASFKPFLPLTLNYREKSQEVIGLLDT GAMVNVLPYQVGVELGAVWEEQTTMLQLSGNLAQFEARVLILSATVGQFPSVRLVFAW TQATQIPLLLGQANFFMEFNVCFYRSQKILEVTPKQSS" gene complement(576..791) /locus_tag="DP116_17175" CDS complement(576..791) /locus_tag="DP116_17175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318760.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17175" /translation="MVSTELVEQLRKLNRVDKLMVIQLLAAELANEETNLIKSGASYP VWSPYDAVEAANIMLEALNAEASSNHE" gene 1187..1504 /locus_tag="DP116_17180" /pseudo CDS 1187..1504 /locus_tag="DP116_17180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874444.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="16S rRNA (uracil(1498)-N(3))-methyltransferase" gene complement(1704..2843) /locus_tag="DP116_17185" CDS complement(1704..2843) /locus_tag="DP116_17185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="PRJNA477356:DP116_17185" /translation="MQTTKKIIIIGGGIGGAATALALHRAGFEPVVYERTKELREVGA GIALWANATHILKNLGLLEEALCVGYLTTNYQFNSQSGNSLVNIPVDTFELPVIGIHR AELHQLLWRNVPHEKFVLGQTFLGFEQEGEKVRADFSSGLTVEGDALIGADGLRSQVR AALLGDQPPIYRNFKTWRGLTDYVPKEYRPGYIQEFLGRGKGFGFMMLGKGRMYWYAA ATAPPEQPDAPIGRKKKLEIMFQDWFASVPELITTTDEANILTTDLYDRVPTQSWSKQ NITLLGDAAHPMLPTMGQGACTALEDAFVVAKCLKEQANPTAAFQQYESERFPRTKLI VEQSLRAGKMGELDNPFGVALRNTFMKLMGSTISNNFKSLHAYRV" gene complement(2887..3732) /locus_tag="DP116_17190" CDS complement(2887..3732) /locus_tag="DP116_17190" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17190" /translation="MDSLNGDSKSTFLEDLQSGLDNTEHDNILALSVLSNTIEILEKI VIKISQFGQTGAQLKNSKHKDSQVIESLDLNSGQRPEVEDRVELLCEQVETLKQLLYD KELELQEIKQELRDTNEDLWATLNSPWLALDEAKELVKEILVSKKPIAETLATLITTI YNSTVKPLELGHKEKSNSIKLLISAPGNFILTNNEAYQMKSTALIKQAREIRAKSKIL REESREVQAKFREVEVQFMKLESKFVRQASFMLLTPNFRHRQKPKAIELADVSPPAQL LDFGV" gene complement(4353..5486) /locus_tag="DP116_17195" CDS complement(4353..5486) /locus_tag="DP116_17195" /inference="COORDINATES: protein motif:HMM:PF03417.14" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17195" /translation="MSPQPDISKDSTVELLTAQGLKIVPPNLIEEHPTAKLPLVRLTG SPEEIGAEHGTKLCDRIEKAFELYRSKLFSQWTDASLKATSLAFFERIREFSHPYAIE LEAIASHAGRKLWQVVMLMSRTEILRSSTPNECTSVYFKQSRILGQNWDWVEEFENLA VIFDVTRQDGFRFVALGEPGFVKIGLNAAGVGVCLNILKCQAPTGGIPVHILLRKVLD STSLTEAYHAITAAERATMSNILIADDCGRYVNLELAGNQLFNLNSDEVEVNNVVVHT NGFLTSKRENIHFPEESESSAARIIRAKSLTSTQDGRHEADMLKILLDQEGNLPICRQ SERNIFDGLTYGTVSTVVMSLKERKLIFSQGNPRNKTFYYICI" gene complement(5556..5704) /locus_tag="DP116_17200" /pseudo CDS complement(5556..5704) /locus_tag="DP116_17200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013335127.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS5/IS1182 family transposase" gene 5802..6024 /locus_tag="DP116_17205" /pseudo CDS 5802..6024 /locus_tag="DP116_17205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195677.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="16S rRNA (uracil(1498)-N(3))-methyltransferase" gene complement(6213..7748) /locus_tag="DP116_17210" CDS complement(6213..7748) /locus_tag="DP116_17210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869996.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VanZ family protein" /protein_id="PRJNA477356:DP116_17210" /translation="MKRRKNVNSISKNLGISGDVILIILSIVSVLIATLYPFHFHFPD SFSLPALVASFDNSSFFKDQVNNILLFTPLGFGFASLLQRMRMKPTSQFIIVIFLSAG LSFTVEALQVFLPSRTPTPADILNNTIGGFVGLICFSLWNSQSFLYTLMRRENSRSNN SIKKLTLCFLGYIIISFLISVLWQNTTNLSNWSLNYPLLIGNERTGDRPWQGQVSDVY IADRAISKNEVSQVFHQKNYSDIFGKSLLASYQLTDTKSYQDSTGQLPELLSQGQLPD IEDEKGVVLSSNHWLKTTEPVTFLSKRIRETSQFTIITTVATANTAQTGPARVISLSG DSLHRNFTLGQQGTDLDLRIRTPMTGANGADTKLTIPDIFADTNPHHIVITYSGATIQ VYVDKSQHSYSLNLLEWFPKEQKIFYYALTFIPLGLCLALLTTLAKRKLTFNRLLLPS GILLPSVILESILVTDGGKSLSLKGLLLGILLTAGTTLTLRWRASMVLKTVRAAVNST SRS" gene 8194..8688 /locus_tag="DP116_17215" CDS 8194..8688 /locus_tag="DP116_17215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876403.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17215" /translation="MRLPIIIGAGLVMSLSLHAPVISKSPILVAQSSARGQSDSQQLT ANQRLWNRQNISNYRYTLSRSCFCTPEARGPVIIEVRNGRTTSVTSVATGQPVNPEFF QKYDTVPRLFDLIRDAIKRKADSLDVKYNSTLGYPTQINIDYKSQIADEEEYLTIENL QQIN" gene complement(8894..9670) /locus_tag="DP116_17220" CDS complement(8894..9670) /locus_tag="DP116_17220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015144523.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4198 domain-containing protein" /protein_id="PRJNA477356:DP116_17220" /translation="MLIRNLKKLLLAGAILPLILQPASAHVVWFDYQNGEYNLLYGHP EEGPQPYSPAKLKEATAYDAKRQIVPFTINQKQDGLSLTPDGNIAALTAFFDNGYYAR ISENQSRNISEAEISQYQNVSHNLKYTKALYDWSDTLAQPFNQPLEIIPLENPFAVQE GDNLEVQVYYQGQPLSDVTVEYLGQEVSKNNNGIFSVPIGIGGLQQIEASYGFLSDGN LRISYESSFTAQKISVFEPSALLGIGVVGLLALRKKKNLA" gene complement(10167..11189) /locus_tag="DP116_17225" CDS complement(10167..11189) /locus_tag="DP116_17225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315741.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="saccharopine dehydrogenase-like oxidoreductase" /protein_id="PRJNA477356:DP116_17225" /translation="MNAEQVTGSTHLPKAMRVGVLGFGGLGQAAAKVLAPKREMLLVA AADNQGYAYAADGLNAQECIATYQSQGSVGYLEPIGTLTNHSVEDLIEKSYPVDGYFL ALPNLPNDFIASVARQFIKSGWRGVLVDAIKRTSAVEQLLAMKEELQAAGITYMTGCG ATPGLLTAAAALAAQSYAEVHRVEITFGVGIANWEAYRATIREDIAHMPGYTVETARA MTDQEVEALLDKTNGVLTLENMEHADDVMLEIAGIVGRDRVTVGGIVDTRNPKKPIST NVKITGRTFEGKISTHTFTLGDETSMAANVCGPAFGYLKAGIGLHQRGIYGLFTAAEI MPQFVR" gene 11623..11832 /locus_tag="DP116_17230" CDS 11623..11832 /locus_tag="DP116_17230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17230" /translation="METLEFVIYPDGRVQEKVTGIIGASCAEVTAAIEAQLGQVLTHE PSSEYFATKVQQSSVVNTQTTFSDW" gene 11872..12258 /locus_tag="DP116_17235" CDS 11872..12258 /locus_tag="DP116_17235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1257 domain-containing protein" /protein_id="PRJNA477356:DP116_17235" /translation="MSHFSQIKTQIRNVDSLKDALSDLGVDWKHGPREVRGYRGQTHN AEVTIEQENGYDIGFKWNGKEYELVADLQYWQQNLSVEGFLRQVTQRYAYHTVVKETA RVGFQVAEQQKNEDGSIRLLVQRWSA" gene 12258..12725 /locus_tag="DP116_17240" CDS 12258..12725 /locus_tag="DP116_17240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315745.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_17240" /translation="MSDFLPSPEQSEDERSGLEPELGGFLRDAPERSGFEPELGGVLR QKGVYVDEITCIGCKHCAHVARNTFYIEPDYGRSRVIRQDGDPDEVIQEAIDTCPVDC IHWVNYTELKNLEQERKYQVIPLIGYPVDAAVVATERRRKKQRLTRKKPLREI" gene complement(12752..13975) /locus_tag="DP116_17245" CDS complement(12752..13975) /locus_tag="DP116_17245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311363.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_17245" /translation="MQLVEKHIISRQHKFWKECDYLALQSKHLYNCANYVQRQYFFET KKYYNSIDIYHQTKNLESYRYLPTKVSKQIVRRVSEAWKGWLAALKDWSKHPEKYQGM PRMPGYKHKERGRNVVIYPIDAISKPALTKGIVKLSQTNIEFSTNANSVDQVRIVPKL DHYVIEIIYTVAEPSKSNGEYVAGVDLGLNNLMAITSNHPGVRPLLINGRPLKSINQF FNKQVAKAQSIEAWRQIKELNSKRDRRIDNYLHTSTRRVIDWCQLNDIGQLVIGNNQR WKQDINIGKKNNQDFTKIPHAKLINLLTYKAQLAGIEVTLTEESYTSKASALDGDTLP TFNSKSDIKPVFSGKRVKRGLYKTSTGRTINADTNGSMNIARKVIPNFMDGIVGLPFI PVVLGLWIKITNGFV" gene 14033..14221 /locus_tag="DP116_17250" CDS 14033..14221 /locus_tag="DP116_17250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17250" /translation="MPFQKNHKYRWESNQDKTLDKTPICFKGWEGQKEKLKAVPDWQE RLRDFVDRLIVENLPKND" gene 14335..15456 /locus_tag="DP116_17255" CDS 14335..15456 /locus_tag="DP116_17255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012241581.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_17255" /translation="MRVLFSVVGTRGDVQPVIALALEVRDRGHEVHLCVPPNFIEWAH RLGFGFTPVGIEMRAPRGTAVSDTTTTKPMPDLITDQFDAIGATANGCDIIVGANAHQ YAARSIAELHGIPYINAIYAPTALPTDDTIRIWNERSGDRVNVNRTQLGLSPIDDVLG HIVTDQPWLATDPTLAPSPLVPSMSILQTGAWFLEDSTPLPSDVEVFLEAGDPPVYFG FGSMPIAGDTSLTLIEAARAVGRRAIVSQGWADLKLIDQAPDCIAIGDVNHQALFPRV AMVVHHGGAGTTHTAARAGAPQVLVPMFSDQPFWANRVRELGIGTLIPIADLTANQLI SALHDASDPAIADRADAIAERIILDGVKVAAQHLIDRVK" gene 15471..15689 /locus_tag="DP116_17260" /pseudo CDS 15471..15689 /locus_tag="DP116_17260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015954539.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(15718..16086) /locus_tag="DP116_17265" CDS complement(15718..16086) /locus_tag="DP116_17265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system response regulator" /protein_id="PRJNA477356:DP116_17265" /translation="MSKCVLIVDDEEDVRAIAQMGLEMASDWNVLCASSGEEALAIAQ INRPDVILLDLMMPDMDGRATLQQLKANPTTKHIPVILVTAKAQSSDKNSFSELDVAA VFAKPFRPLKLAEQISAVLK" gene complement(16079..16858) /locus_tag="DP116_17270" CDS complement(16079..16858) /locus_tag="DP116_17270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017286884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17270" /translation="MRVVLLSAELIDLFGVQCILNIINDITERKRLENEFISLVSHEL RTPLTSLMGSLDLLGTGQLGTLSAKGQQVLNIATTNTERLIRLINDILDLERMKSGKI PMQPVKCNVADLIDQAVAAMQAMAERAQITLLTEVVAAELIADPDRILQTLTNLLSNA IKFSEPGGTVWLRVRRSHEVQIEVQDCGRGIPANKLQTIFERFQQVDVSDSRKKGGTG LGLAICRKIVEQHHGKIWVESVLGQGSTFHVILPSSEYGCE" BASE COUNT 4843 a 3562 c 3516 g 5004 t ORIGIN 1 aaattttgct cacacctcaa caacaacatt atctggggcg tgttttgcgt ttgcgcgagg 61 gcgatcgctt tatagcaatg gatggaaaag ggaaatggtg gctggcgcag ctacaaggag 121 aaaaaattca ttacaaccgt tgatacagtt ccatgtcctt ttaggatgat tgttttggag 181 tcacctctaa tatcttttga gagcgataaa aacaaacatt aaattccatg aaaaagttag 241 cttgtcctaa caggaggggt atctgagtag cttgcgtcca agcaaatacc aatctaacac 301 tcggaaattg cccaacagta gcagacaaaa ttaacactcg tgcttcaaac tgtgctaagt 361 ttcctgataa ttgaagcatt gttgtttgct cttcccaaac tgctcctaat tcaactccaa 421 cttgataggg aagaacattt accatcgctc ccgtatccaa aagtccaata acttcctgtg 481 atttttctct gtaattcaac gtcagtggca aaaacggctt gaagcttgct tctcccaaac 541 tgctatcagt tgcaatgaaa gaataccttg ctggattact catggtttga ggaagcttca 601 gcattcaacg cctctaacat gatatttgca gcttccactg catcgtaagg agaccaaact 661 ggataggatg caccagattt tatcaagtta gtttcttcat ttgccagttc ggcagccaga 721 agttgaatta ccatgagctt atctacgcga ttgagtttgc gtaattgttc cactaactca 781 gtagatacca ttgttgacct taatgtttac tgatttaata tcaattttca gcataacctg 841 taataagtag ctcaaccaga ttaaacgtaa aatgtcattg cgagcgaaac gtagtgaagc 901 gttcgcgcag cgtgtcccct tgggactcag caatcgcaag aatcgtattt tatgtttttt 961 aatgttgacc tacttatcat tattttgctt cgcttgtaag gctcaaggta ttctaaaagc 1021 tcatcagaat tgccttttct ccggggaaat caatcttttc cttttttgaa gattgatatc 1081 aggatgaggt gacactcgtc tcccactaca ttagttttta caacaggact gagaactctg 1141 ctcagaaaaa agcaagcgcc tttattactt acctttatca aatccaatgt cacaactgca 1201 acgaatcgca ataacctcct cccaactcca acaagagcaa attttgctca cacctcaaca 1261 acaacattat ctggggcgtg ttttgcgttt gcgagaaggc gatcgcttca tagcaatgga 1321 tggcaaaggg aaatggtggc tggcgcagct acaaggagaa aaagcgcaaa tattagaacc 1381 actgacagta gaaaccgaat tgcctgtatc aattacgctg atagtcgctt tgcctaaagg 1441 aaatggattt gatgatgtag tacgggcttg tactgagttg ggagtcgcag ttattgctcc 1501 ggtggggaag cgatcgcact ttttaaaaac ctctgatcac gagtatcgta tccgcgttgg 1561 tgattataga gttcgttatg agattgatga tgaaagtcaa ctcgtgcaac ttttacagtg 1621 caagcatcgg aaagatgttt atagaaaata atttgtaaat ttgggtggtt aatgagtgct 1681 ttttgaatgt catctgtttc tcattaaact ctgtaagcat gaagcgactt gaagttgttg 1741 ctgatagttg accccatgag cttcataaat gtgtttcgta atgccacacc aaaaggatta 1801 tccaattcac ccattttccc tgcccgcaag gattgctcaa cgattaactt cgtgcgagga 1861 aaccgttctg actcatactg ttggaaagca gctgtgggat ttgcttgctc tttgagacac 1921 ttagcgacta caaaagcatc ctctaatgct gtacatgctc cctgtcccat agttggcagc 1981 attggatgtg cagcatcacc tagcagtgta atattctgct tgctccaaga ttgagtcggg 2041 acacgatcat acaaatctgt tgtcagaata ttggcttcat ccgtcgttgt aattaactca 2101 ggaaccgatg caaaccaatc ttgaaacata atttcaagct tttttttacg accaattggg 2161 gcatccggct gttctggggg agcagttgct gctgcgtacc aatacatccg tcccttgccc 2221 agcatcatga aaccgaagcc tttgccacgc cctaaaaatt cctgaatgta gccgggacga 2281 tattctttag gaacgtaatc tgttaaacca cgccaagtct taaaattgcg atagatgggt 2341 ggttgatcac cgagaagggc agctcgtacc tgtgatcgca aaccatcagc cccaatcaaa 2401 gcatcccctt cgactgttaa gccagaactg aagtcagcgc gaaccttctc tccttcttgc 2461 tcaaatccaa ggaaagtttg tcctaaaaca aatttttcat gtggtacatt acgccacaat 2521 agctgatgta attcagccct atgaatgcca ataacaggca actcaaaggt atcaacgggt 2581 atattaacca atgagttgcc gctttgggaa ttgaattgat agtttgttgt gagatagcca 2641 acgcacagcg cctcctccaa caagcctaaa tttttcaaga tgtgtgtcgc atttgcccaa 2701 agtgcaatac cagcccctac ctcccgcaac tcctttgttc gctcatagac aactggttca 2761 aaaccagctc tatggagagc aagtgcagtc gcagcaccgc caattccacc gccgataatg 2821 atgatcttct tagttgtttg catttttctt gtgtaatcga ccgacttgtc agttgattaa 2881 ttcctatcag actccaaaat cgagcaattg agctggcgga ctaacgtccg ccagctcaat 2941 tgctttgggc ttttgtctgt gcctaaaatt cggcgtaagg agcatgaatg aagcttgacg 3001 aacaaattta ctttccagtt tcatgaactg aacttcaacc tctctaaatt tcgcctgaac 3061 ttcacgagat tcctctctaa gtattttaga tttcgcccga atttcacgag cttgctttat 3121 cagcgccgtg ctcttcattt gatacgcctc attattagta agtataaaat tgcctggagc 3181 gcttatcaaa agtttaatag aatttgattt ttctttgtgc cctaactcta aaggtttaac 3241 tgtcgaatta tagatagtgg taatgagtgt tgctaaagtt tcagctatag gctttttact 3301 gaccaaaatt tctttaacta attcttttgc ttcatcaagt gctagccacg gtgaattaag 3361 tgtagcccac aaatcctcat ttgtatcacg caactcttgc tttatctctt gcaattctaa 3421 ctccttgtcg tacagcagtt gcttgagagt ttctacttgc tcgcacaaaa gctctactct 3481 atcttcaact tctggtcttt gacctgagtt aaggtcaagg gattctataa cttgactatc 3541 tttatgttta ctatttttta gttgagcacc tgtctgacca aattgactaa ttttgatgac 3601 aattttttcg agaatttcta tagtattgga taatacgcta agagccagaa tgttgtcatg 3661 ttcggtgtta tccagccctg actgtaaatc ttctaggaat gttgacttgg aatcgccgtt 3721 taaagagtcc atgcactacg ctcgcaacct tcttaagtta ttctagccac aaaagttgta 3781 atcagagtca agtctggaga gttattttgc agctacgaat atccttaacc atgagacgtg 3841 aggcttctgg cggattaagc taaaagctca tcagaattgc cttgggtgtt ttggcatgca 3901 ttgggatgct ctcaaattga gacataacaa aacgttccat tacaaggatt accttgtgag 3961 aaaatcagtt ttcttgcctt caaattcatc acaaccgtct agaagtctca gcactatcta 4021 gtccacgaaa gaattgagta agaaaacaac cacgataaat aaataagtca ggtagttaag 4081 tttattgcat tggtttttta gcctccttaa gaggttttga aatgatttgt ttgaccctag 4141 ccgcttgttt tattcgtaaa tcttcaacaa ttgcgtgtaa gtcgtagtta aaagacttag 4201 cgtgttattc tctatatttc tgaatttctt ctaacatttc atatttccac atatttcatc 4261 tcccattagt tcagacggtg tgcaaaatac tcaaggtatt ctaaaagctc atcagaattg 4321 ccttagaaat tgtggcatgt attgggatac tcctaaatac agatataata aaacgtttta 4381 ttacgaggat taccttgtga aaaaatgagt tttctttctt tcaaactcat cacaactgtt 4441 gatacagttc cataggtgag tccatcaaag atatttctct ctgactggcg acatatcggt 4501 aagtttcctt cctgatcaag aagaattttt aacatatcag cttcatgtcg tccgtcttga 4561 gtggacgtaa gagactttgc tcgaataatt cgcgccgctg aactttcgct ttcttcgggg 4621 aaatgaatat tctctctttt tgaagtaaga aatccattcg tgtgaaccac aacattattc 4681 acttcaactt catctgaatt gagattaaat aattggtttc cagcaagttc gagattcaca 4741 tacctaccgc aatcatccgc gataagaata ttactcatcg ttgctcgttc agcagcagta 4801 attgcatgat acgcttcagt cagtgaggta gaatctaaaa cctttcgcag aagtatatgc 4861 acaggaattc ctcctgtagg ggcttgacat ttaagaatat tgagacaaac cccaacacct 4921 gctgcgttta agccgatttt gacaaaacct ggttctccta aagcaacaaa gcgaaaacca 4981 tcttgtcgtg tcacgtcaaa aatgacagct aagttttcaa actcttccac ccagtcccaa 5041 ttttgaccga gaatacgact ttgtttgaag taaacagagg tacattcatt aggtgtacta 5101 gagcgaagta tttcggttcg ggacatgagc atgacgactt gccaaagttt tcgtccggcg 5161 tgagatgcta tcgcctctaa ttcaattgca tacgggtgtg aaaactctcg aatccgctca 5221 aaaaaagcta acgaagtcgc cttgagcgaa gcatcagtcc attgggaaaa aagttttgac 5281 ctgtaaagtt caaaagcttt ttcaatgcga tcgcacagct tagtaccatg ttctgctccg 5341 atctcttctg gagaacctgt taaacgaact aagggtaatt ttgctgttgg atgttcttca 5401 ataagatttg ggggaacaat ttttaagcct tgagcggtaa gtaattcaac agttgaatct 5461 ttacttatat caggttgagg tgacacaata atcgtctcct agtgaactac ctacaccgac 5521 ctgagtacag gtacggtgtc gggcgtgttt tcaaactaca accacaacaa aatgcacgcg 5581 attataatca tagctgtata atttgtagcc aatttctcat atgggtggcg atgcgacgaa 5641 attgcttgag tcggttgata gatcgttcaa caacgttgcg ttgacgatag atttcacgac 5701 taaacggaaa tggtggctgg cgcagttata gcaacggaca tattggttag gacatcatat 5761 tttgatctca agggaacagg gaacagagaa atgtcctaac aattgtggcg actgctatac 5821 aaggagaaaa ggcgcaaatt ttagagtcac tcacggtgga aactgagtta cccgtatcca 5881 taacactgat ggtagctttg cccaaagcaa atggatttga tgatgtggta agggtttgca 5941 ctgagttggg agtcgcggtt attgctccgg tggggaagcg atcgccatcg cactttactt 6001 catcccagtc cccaaaaact cgaaccctga gcagtaagta attcaaccgt taaattttta 6061 cttgcatcag ggtgaggtga cacaacagtc gtctcccact acattagttt gtacaacagg 6121 acagagttat tatctattta agcttgcttc tttaacagtt atcagttatc agttaccagt 6181 tatcaaagag aaatacggat tcgtttcttt tgttaactgc ggctggtact gttaactgcg 6241 gctcgtactg ttttaagcac catcgaagcc cgccatctta atgtcagtgt tgtcccggct 6301 gttaacaata taccaagcaa cagacctttc aggctgagac ttttaccgcc atcagttacc 6361 aaaatacttt ctagtatcac agacggtaat aatatcccac tcggaagcaa taacctatta 6421 aaagttaact ttcttttcgc tagagtcgtc aaaagcgcta agcaaagccc tagaggaata 6481 aaagtcagtg catagtaaaa aattttctgc tctttcggaa accattccag aagattcaaa 6541 gaataagaat gttgtgattt atctacataa acttgaatag tcgctccaga ataagtgatg 6601 actatatgat ggggattcgt atctgcaaaa atatcaggaa tagttaattt tgtatctgcc 6661 ccgtttgctc cagtcatcgg tgttcgtatt cgcaagtcta agtcggttcc ttgttgtcct 6721 agtgtaaaat tgcgatgaag agaatcaccg gaaagtgaga tgacgcgtgc aggtccagtc 6781 tgagcggtat tagcagtagc aacggtagta attattgtaa attgagaagt ttcacgtatc 6841 cttttactca agaaagtgac aggttctgtt gtttttagcc aatggttgga actcaaaaca 6901 acaccttttt catcttcaat gtctggtaac tgcccttgcg ataatagttc ggggagttga 6961 ccagtactat cctgataact ttttgtatca gtcaattgat aggaagctag caaagacttc 7021 ccaaaaatat cactgtaatt tttctgatga aaaacttgtg aaacttcatt tttagaaatt 7081 gctctatcag caatataaac atctgagacc tgtccctgcc aaggtctatc acctgttcgt 7141 tcattgccaa tcaagagtgg ataattcaaa gaccaattgc tcaaattcgt tgtattttgc 7201 cagagaactg agataagaaa tgatatgatt atgtatccca aaaaacatag tgttagtttt 7261 ttaatagaat tgttggatct actattttct ctacgcatca aggtatagag aaagctttgt 7321 gagttccata gagaaaagca gattagaccc acaaatccac ctatggtatt gttcaaaata 7381 tcggcaggag taggtgttct tgaaggcaaa aacacttgca gtgcttcaac tgtaaatgat 7441 aagcctgcac tcagaaatat cactattata aattgacttg ttggcttcat cctcattctc 7501 tgtaagagac tggcgaaacc gaaacctaaa ggcgtaaata ataaaatatt gttgacctgg 7561 tctttaaaaa aactactgtt atcaaaactg gcaacaagtg ctggtagcga aaaactatct 7621 ggaaagtgaa aatgaaacgg ataaagtgta gctattaaaa ctgaaacaat gctcaaaata 7681 atcagtatga cgtctccgga aattcccaag ttcttagaga tagaattaac atttttgcgt 7741 ctcttcataa tgctccacgt tttttgtctt tctagtaatg gataattgaa ctgccgatga 7801 ctggtagacg cttaaataag agtcattaac aaacatgtaa atgaatgctt ttcttcatgt 7861 tatttatttg tcaatttggc tgtgtaaaga gttgtatttt ttttataaaa acgtcataaa 7921 gataggtttt tgtgaaatac atatagtaac tacattagga gatcaggttt ttccagattt 7981 caatgtattc cgcaccaatc atcgattcag agcaaccatt tatactttaa aagtttgcgc 8041 aaaggtgatt taaccactta cctagcagtg aacgcgcact tgacaaagaa attgttatac 8101 cctaagttgt aagaatttcg caaaatgaag cgaaaacgaa aattcccagg agtttgccaa 8161 atgcttagaa ccttgaaaat gaaatcgatg gctatacgct tgccgattat tattggtgca 8221 ggattagtaa tgtctctaag tttgcatgca ccagtcatat cgaaatcccc catactggta 8281 gcacaatcat cagcgagggg tcaatcagat tcacagcaat taacagctaa tcagcgtttg 8341 tggaatcggc aaaatatttc caactatcgg tatacactta gccggagttg cttctgcaca 8401 cccgaggcta gaggaccagt catcattgaa gtgcgtaacg gtagaacaac ttctgtcact 8461 tccgtcgcta ctggtcaacc agttaatcca gaattcttcc aaaaatacga tacagttccc 8521 aggctttttg atttaatcag ggatgcgatc aaaaggaaag cagatagctt ggatgtcaag 8581 tataattcta cactcggcta tccaacccaa attaacattg attacaagag tcagatagcc 8641 gatgaagaag aatatctcac aattgagaat ctacaacaga ttaactaata gtctttattc 8701 cgccttgccg ccagcctcaa agatgagatg tcctttgata ggtaaaataa aagcgcttgc 8761 tttttgctgg gcagaattct aagttgtggc tttagtgtgt gtaaagcaat aaataagctg 8821 ggttaaaaaa cgaaacccag ctgacaaaat ataaataatt taatcgtaag aatgattcta 8881 aattctttct tgatcaagct aaattctttt tcttgcgtag tgctagcaat ccaactacac 8941 caataccaag taatgccgat ggttcaaaaa cagagatttt ttgtgctgta aagctacttt 9001 cgtaagaaat tctcaaattt ccatctgaca aaaagccgta gctagcttca atttgttgca 9061 accctccaat accaatggga acggaaaaga tgccgttgtt atttttagag acttcttgac 9121 ccaagtattc tactgtcaca tctgaaagtg gttgtccttg atagtatact tgaacttcca 9181 ggttgtctcc ttcctgaact gcaaagggat tttccaaagg tataatttcc agcggctggt 9241 tgaacggttg cgctagagtg tcagaccaat cataaagagc tttggtgtac ttcagattat 9301 gactaacatt ttggtattga ctaatttcag cttctgaaat attgcgagat tgattttcag 9361 atattctggc ataatagcca ttatcaaaaa atgccgttaa tgccgcaatg ttgccgtcag 9421 gagtcagaga aagtccatct tgtttctggt tgatagtaaa tggcactatc tgtctttttg 9481 catcatatgc tgtggcttcc ttaagttttg ccggagaata gggttgtggt ccttcttctg 9541 gatgaccata caatagattg tattctccgt tttgatagtc aaaccagaca acgtgtgctg 9601 aagctggttg caaaatcaat ggtagaattg cccccgctaa tagcaatttt ttcaaattgc 9661 ggataagcat ctcaatttcc ttaattgagt tattacatca ttattaaacg cagatttttt 9721 gataccgaac aatcccccta tcagggatat ttttgttaat aaaaatttct atttttctct 9781 aatgtttaat ttttggtttt gtcagagaat aatcataaaa agtatgatat tttctagaat 9841 ttatccatga aaaaatacaa aatgtataag ttgtggcttg agtgtgttaa aggcaataaa 9901 taagctgggt ttcgtttttc aacccagcag acaaaatata aataatttag taagaactca 9961 gcaccaatga ggtcagaagt cagaatttat aaggagtcta ttattattgg tctgttcatt 10021 ggatcaaatg ttctcctgat tctttaacta ttctgactcc tgttcgcgta gcgtgcccgt 10081 tcggcctcaa gccgtgccgc aggcataggg catattctgg ctcctgaatt cttacataat 10141 ttaatactca agctgatagc taataactac ctaacaaact gaggcataat ttcagcagcg 10201 gtgaataatc cgtaaatgcc tcgctggtgc aatccaatac cagcctttaa ataaccaaag 10261 gctggtccac aaacgttggc tgccatactg gtttcatctc ctaatgtgaa ggtatgggtg 10321 gaaattttac cttcaaaggt gcgaccagtg attttaacgt tggtgctaat gggctttttg 10381 ggattgcggg tatcaactat accaccaacg gtcacgcgat cgcgccccac aattccagct 10441 atctctaaca tcacatcatc agcgtgttcc atgttttcta aagtcagcac gccattggtt 10501 ttatctagga gtgcttctac ttcctgatcc gtcattgccc tagcagtttc cacagtgtaa 10561 cccggcatat gggcaatatc ttcgcggatg gtggcgcggt aagcctccca gttcgctatt 10621 cctaccccaa aggtaatttc cactcgatga acttcggcgt aactttgagc agctaaagct 10681 gctgctgctg ttaacagtcc tggtgtcgcg ccacatcctg tcatgtaggt aattcctgct 10741 gcttgcagtt cttctttcat cgccagcagt tgttccacag cactggtgcg tttaattgca 10801 tccaccagta ccccgcgcca tccagatttg ataaactgcc tagctacaga ggcaataaaa 10861 tcattgggga gattgggtaa cgccagaaaa tatccgtcta caggataaga tttttctatt 10921 aaatcctcaa cactgtgatt tgttaaagtt ccaatgggtt ctagataacc caccgaacct 10981 tgagattgat aagttgcaat gcattcttga gcatttaaac catcagcggc gtaggcgtag 11041 ccttggttat ctgctgctgc tactaaaagc atttcacgtt ttggggcgag taccttggca 11101 gctgcttgcc ccagtccgcc aaagcccagt actcctacgc gcatcgcttt cgggagatga 11161 gtagaacctg tcacttgctc tgcattcatg atgaactttc tacaattaga taacagctta 11221 tcctaccgtt taacagttag cagtcttggt ttggctagtt tcattcctga gaattggatg 11281 attcagagaa taatgataaa ctttttcaac aaatctcact gtttttacac ctcttcctta 11341 gaataatgct tgtttttagg cgtttcccca ctgaaatgaa aagttgaatt tttagcttga 11401 ctaaattgat atggttaagg gaaacttaca aataacagat gtaacaacta ataagagtaa 11461 taagttttgt taagtttctg agatatatat tttgagagat ggaacattag ttaacaaatg 11521 gcaaatttgt agctaaggaa aaatgctgtt tcagagggct gcatttgtag tcagttgcta 11581 cagcagtact gtccaagcct agtattttgg ttggcaaaga gcatggagac attagagttc 11641 gtaatttatc cagacggtag ggtacaagaa aaagtcactg gcatcatagg tgcttcctgc 11701 gctgaagtta cagcagctat agaggcacag ctaggacagg tactaactca cgagccaagc 11761 tcagaatatt ttgctaccaa ggtgcagcaa tccagtgtgg tgaatacgca aaccacgttt 11821 agcgattggt aagttttcat tcattgttta gttcattcat tcacaaccgc catgtcacac 11881 tttagccaga ttaagaccca aatccgtaac gttgattcct tgaaggatgc actgagcgat 11941 ttaggtgtag attggaagca cggtccacgt gaggtacgtg gttatcgcgg tcaaactcat 12001 aatgccgaag ttaccattga gcaggaaaat ggttacgata tcggctttaa atggaatggc 12061 aaagaatacg agctggttgc tgacttacaa tattggcagc aaaatttatc agtagaaggt 12121 ttcttgcggc aagtaacaca gcgatacgca taccatacag tcgtcaaaga aactgctcgt 12181 gttggatttc aagttgctga gcaacaaaaa aatgaagatg gttccattcg cttacttgta 12241 cagcgctgga gtgcgtaatg tctgattttt tgccgtcgcc ggaacaaagc gaagatgagc 12301 gttcaggttt ggaaccagaa ttgggcggtt ttttgcgaga cgccccagaa cgctctggtt 12361 ttgagccgga attagggggt gtgttgcgcc aaaaaggtgt ttatgttgac gagattacct 12421 gtattggctg caaacactgt gctcatgttg cccgtaatac gttttatatt gaaccagatt 12481 acgggcgatc gcgtgtgata cgtcaagatg gcgatccaga tgaggtcatt caagaggcaa 12541 ttgacacctg tccagtggat tgcatccact gggtgaatta taccgaattg aaaaacttag 12601 aacaagagcg caaatatcag gtcatacctt taattggata cccagtggac gcggcagttg 12661 tggctactga acggcgacgc aaaaagcaaa gactaacccg taaaaaaccg ctcagagaaa 12721 tctaaacaaa gctacccaaa gatagtcaat gctagacaaa cccgtttgta atcttaatcc 12781 aaagtcctaa aactacaggg ataaacggca atcccactat cccatccatg aagtttggga 12841 ttactttacg agctatattc atgcttccat tggtatcagc attgattgtt ctaccagtgg 12901 aagttttgta taaaccacgc ttaacacgtt taccactaaa aacaggcttg atgtcagatt 12961 tactgttgaa agttggtagt gtatcaccat ctaaagcact tgctttactg gtgtaacttt 13021 cctcagtcaa agtcacttcg atacctgcta attgtgcctt ataggttaac aaattaatca 13081 atttagcgtg aggtatcttg gtaaaatctt gattattctt cttaccaata ttgatgtctt 13141 gcttccatct ttgattgttg ccaataacta actgaccaat atcatttaac tgacaccaat 13201 caataaccct gcgggtacta gtatgcaagt agttatcaat tcgtctatca cgcttgctat 13261 tgagttcttt aatctgtctc caagcttcga ttgattgtgc ttttgctact tgtttattga 13321 agaattggtt aatacttttc aacggtctac cattaatcaa aagcggtcta acaccgggat 13381 gattggatgt tatagccatt aaattgttta atccaaggtc tacacctgca acgtattcac 13441 catttgactt agatggttct gcaacggtgt agataatttc aataacatag tggtcaagct 13501 taggaacaat ccgtacttga tctacgctgt tagcattagt tgagaactca atattagttt 13561 gagaaagttt gacaattcct ttagtcaaag caggttttga tattgcatct atcggataaa 13621 tgacaacatt tctgccacgt tctttatgct tgtatccagg cattctaggc atgccttgat 13681 acttttctgg atgttttgac caatctttca atgctgctaa ccaacctttc caagcttctg 13741 agactctacg aacaatctgc ttactaacct tggtaggcag atacctataa gattctagat 13801 tcttggtctg atgataaatg tcaatggagt tgtaatactt cttggtttca aagaaatatt 13861 gacgttgaac atagttagca caattgtaaa gatgtttgga ctgcaatgct aaataatcac 13921 actctttcca aaacttatgc tgccgactga taatgtgttt ctcaactaat tgcatcgttc 13981 gacctccgat gtgctatgca tagtagtata ccaatgccta gatgatttgt gtatgccatt 14041 ccagaaaaat cataaatatc gctgggagtc taatcaggac aagactttag ataaaacacc 14101 tatctgcttt aaagggtggg agggacaaaa ggaaaaactt aaagccgtcc ctgactggca 14161 agaacggctt agagattttg ttgaccgatt gattgttgaa aacctaccca aaaatgacta 14221 gtaatgagta gagatgggta gggattaagt tacagttcct tacccgaaat attggtgaag 14281 taaattgggc atatagaaac catttgacag cttttagatt gccttggaga gaatatgcga 14341 gtgttgtttt cggtagtcgg tacacgcgga gatgtgcaac cagtgatcgc gttggcgttg 14401 gaggtgcgcg atcgcgggca cgaggtgcat ctgtgtgtgc cgccgaactt tatcgagtgg 14461 gcgcataggc ttgggttcgg gttcacacca gtggggatcg agatgcgagc gccgcgcggg 14521 actgcggtga gcgacaccac aacgactaag ccaatgcccg acctcatcac cgaccagttc 14581 gatgcgatcg gtgcgaccgc aaacggctgc gacatcatcg tgggagcgaa cgcgcaccag 14641 tatgccgcgc ggtccattgc cgagcttcac ggcatccctt acatcaacgc gatctacgca 14701 cctacggcgc tgccgaccga cgacaccatt cgtatttgga atgagcgatc aggcgatcgc 14761 gtcaacgtca atcggacgca gcttgggtta tcgccgattg acgatgtgct cggtcatatt 14821 gttactgacc agccgtggct tgcgaccgat ccaacgctcg ccccttcgcc ccttgtaccg 14881 tccatgtcca ttttgcagac cggcgcgtgg tttcttgagg attcgactcc gctcccgtct 14941 gatgtcgaag tgttcctcga agctggcgat ccgcccgtct acttcggttt cggcagcatg 15001 ccgatcgccg gggatacgag ccttaccctc atcgaggctg cgcgtgcggt cgggcgacga 15061 gcgatcgtgt cgcaaggatg ggccgacctc aagctgatcg accaagcccc ggattgcatc 15121 gcaatcggtg atgtcaacca ccaagcgttg ttccccagag tcgcgatggt cgtgcatcat 15181 ggtggtgccg gaacgacgca caccgccgca cgtgctggcg caccacaggt actcgttccg 15241 atgtttagcg atcagccgtt ttgggcgaac cgcgtgcggg aactcggcat cggcacgttg 15301 ataccgattg cagatctgac cgccaaccaa ctaatatcgg cactgcacga cgcgagtgac 15361 ccagccatcg cagaccgggc tgatgcgatc gccgaacgga tcatcctcga tggtgttaag 15421 gtcgcagcgc agcacctgat tgaccgggtt aagtagaggc tagggcgtgt tttcaaaagt 15481 taccacctga tgttcaagga ctcgcggatc agtattatga gtttctcaag caagatcctc 15541 gttatctttc attgcattta aaaaaagtgg gtcaattttg gtcagtgcga gttggtttgc 15601 actatcgtgc gttggctgta gagcaagatg gggattttgc ttggttttgg atcggatcgc 15661 acgcagaata cgacaaatta ttgggttaac ggtttgacac caggtaaata tagcaaccta 15721 ttttaatacg gcgctgattt gttcagctag tttcaaagga cgaaaaggtt tagcaaatac 15781 agccgcgaca tccaactcac taaagctatt tttatctgat gattgcgcct ttgctgttac 15841 caaaataaca gggatatgct ttgttgttgg attcgctttg agttgttgca atgtagcgcg 15901 accatccatg tcgggcatca tcaaatcgag taaaataaca tcaggtcggt tgatttgggc 15961 gatcgctaaa gcttcttcac cagaactagc gcacagtacg ttccagtcag atgccatttc 16021 taaacccatt tgcgcgatcg ctcgcacatc ttcttcatca tcaacaatca atacacactt 16081 actcacaacc gtactccgaa gaaggtaaaa tgacatgaaa ggtgctgcct tgaccaagga 16141 cgctttcaac ccaaatcttg ccatgatgtt gctcaacaat ttttcggcaa attgcaagcc 16201 ctaaacctgt acctcctttt ttacgagaat cagacacatc tacctgctga aaacgttcaa 16261 agatggtttg tagtttattt gctgggattc ctcttccgca atcttgcacc tcaatttgca 16321 cttcatgcga tcgacgcaca cgcaaccaaa cagtaccacc gggttcggaa aacttaatcg 16381 cgttactcaa aaggttagta agcgtttgca atatccgatc tggatctgct atcaattccg 16441 cagcgacaac ttctgtaagt aaggtaattt gagcgcgttc tgccattgct tgcatcgctg 16501 ctactgcttg gtcgattaag tctgcaacgt tacacttgac gggttgcata ggtattttac 16561 ctgatttcat ccgctctaag tcaaggatgt cgttgattag gcgaatcaag cgttcggtgt 16621 tggtagtagc gatatttagg acttgttgac ctttggcgga aagtgttcct aattgtcccg 16681 tgcctaaaag gtctaacgaa cccatcaaag aagtcagcgg agtacgcagt tcgtgactaa 16741 ctaaagaaat gaattcgttt tctaaacgct tgcgctcggt gatatcgtta ataatattta 16801 agatacactg gacaccaaac aaatctatca attcggcaga tagcaaaaca actctcacct 16861 caccggactt atactaagtt gcgttcagag gtagtatcca ccagatgagg gaaaaaagtt 16921 aaaga // LOCUS NODE_1968_length_16872_cov_5.99589716872 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16872) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16872) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16872 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(131..718) /locus_tag="DP116_17275" CDS complement(131..718) /locus_tag="DP116_17275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006514324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="shikimate kinase" /protein_id="PRJNA477356:DP116_17275" /translation="MKSDIILIGPIGTGKTTIGALLAHRLGLPQYSMDERRWDYYKAI GYDEELAKHKRETEGFWGVYQYWKPFEAYAVERLLSEHNQCVIDFGGGHSVYEDAGLF QRVQQALAPYPNVVLLLPSPNEYESVQILNQRNKYVPDDKPNINEHFVRHTSNYELAK FTVYTKGKTPEETCSEILNLIEGNSLYGWNHSTSC" gene 860..3463 /gene="leuS" /locus_tag="DP116_17280" CDS 860..3463 /gene="leuS" /locus_tag="DP116_17280" /EC_number="6.1.1.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321592.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="leucine--tRNA ligase" /protein_id="PRJNA477356:DP116_17280" /translation="MESRYNPAAIEEKWQKTWTEQGLDKTVTDSSKPKYYALSMFPYP SGSLHMGHVRNYTITDVIARLKRMQGYRVLHPMGWDAFGLPAENAAIDRGIPPAKWTY ENMAQMQQQLKRLGLSIDWDCELATCSPDYYKWTQWIFLQFLTAGLAYQKEAAVNWDP IDQTVVANEQVDSEGRSWRSGAKVERKLLRQWFLKITDYAEELLNDLNKLPGWPERVK LMQANWIGKSTGAYLEFPIVGMEEKIGVYTTRPDTVFGVSYVVLAPEHPLTKRVTTQE REAEVAAFIQEVSNQSELERTAEDKPKRGIPTGGKAINPFTGEEIPIWIADYVLYEYG TGAVMGVPAHDARDFKFAKEQNLPIKVVIVPPDNVETLDAGNVETLDATSLHQAYTEP GIVINSSQFDGMASVDAKQAIVAYAEQQGFGKARVQYRLRDWLISRQRYWGAPIPVIH CPNCGIVPVPEEDLPVQLPENVEFSGRGPSPLAKIEDWVNVPCPTCGTPAKRETDTMD TFIDSSWYFLRFTDAKNDQQVFDSAKTNDWMPVDQYVGGIEHAILHLLYSRFFTKVLR DRGLLNFDEPFQRLLTQGMVQGLTYLNPNKSGKDKWIPSYLVSNPDDPRDPQTGEPLQ RLYATMSKSKGNGVAPEEVISKYGVDTARMFILFKAPPEKDLEWDEADVEGQFRFLNR VWRLVTDFTAQPREVHNQQTQLSKAEKDLRRAIHIAIKEATEDVEGEYQFNTAVSELM KLSNALADASCKDSPVYAEGIETLIVLLAPFAPHIAEELWQQLGNTESVHKQAWLKYD ESALVADEITLVIQVNGKKRADLQVPAQANKAELEKYARESEIVQRFIEGKEIRKVIV VPGKLVNFVVG" gene 3841..5625 /locus_tag="DP116_17285" CDS 3841..5625 /locus_tag="DP116_17285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652045.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_17285" /translation="MKKFSVFTAIRHRRWLILLALVTAFALITFSSVNIKTKTVPVSG IVTSFSTDPKTFNPALIQEPPYTSDYTHEGLVSENGRGEIEPALAESWKMSEDKKRII FTLRKGLKWSDGKPLTADDVVFTYNDIFLNPAISNDAKDLWKIGKNRVFPTVQKIDNL QIEFIIPEPFVPFLRIAKLAILPAHVLREAVNIKDKQGQSKFISTWGTNTPPKEIIAN GPYTIEAYTPGQRVIFRKNPYYWRKDAQGNVQPYIERVMWQFVDNTDTSLIQFRSRGL DYIKVFPQYFSLLKREEKRGQFTIYNGGSSTEVTYISFNLNKGSRNGKPLVNPIKSRW FNTVEFRQAVAYGVDRQRLLNNIYRGLGELANSYIPKQSPYYLSPKEGLKDYEYNPIK AKELLVKAGFQYNSQGQLLDSQGNLVQFTLMMSSSNKIIETIAVQIKQDLSKIGISVD LNTVSYSLFIDKILNSFDWECRLGTMNFSMEPNDVASLFLPEGSFHVYNQEPQKGQTP ITGREVADWERKIGDLYLQAVKEFDEAKRKAIYAEVQRISQEYLPQIYLTQPLTMGAV RNHIQGIKYSALTTVFWNIYELKLSKQK" gene complement(5926..6150) /locus_tag="DP116_17290" /pseudo CDS complement(5926..6150) /locus_tag="DP116_17290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311856.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DUF4231 domain-containing protein" gene complement(6326..7060) /locus_tag="DP116_17295" CDS complement(6326..7060) /locus_tag="DP116_17295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872777.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione peroxidase" /protein_id="PRJNA477356:DP116_17295" /translation="MLSNREGQKVPNVTFRARKDNNWVSMTTVDLFAGKTVVVFSLPG AFTPTCSSTHVPGYNHLAKVFKENGVNDIVCISVNDTFVMNEWAKDQQAENITFIPDG NGEFTEGMGMLVDKSDLGFGKRSWRYSMLVKDGVVEKMFVEPEEPGDPFKVSDAETML RYINPQAAKPELVSLFTRVGCPYCARAKSMLLERGIDYEEIVLGKDVTPRTLQAVTGA STVPQVFVNGKLIGGSEALEAYLTAR" gene complement(7182..7874) /locus_tag="DP116_17300" CDS complement(7182..7874) /locus_tag="DP116_17300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015162923.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aquaporin" /protein_id="PRJNA477356:DP116_17300" /translation="MNPKALIAEFIGTFALIFIGISSLATNHITKIGTLSPVDLVAIA LAHGFTIAVMVSATAAISGGHLNPAVTFAALLTRKIDAKNAVGYIISQCLGGIFAASM VKLAIPLQALQAVGMGTPSLGKNITPFMGLVMEFIMTFFLVFVIFGTAIDKRAPKMGG LFIGLTVALDILAGGAITGAAMNPARYLGPALIAGRLQDFWLYWVGPLAGGAVAALVY HYQLEEKSSHST" gene complement(8091..9593) /locus_tag="DP116_17305" CDS complement(8091..9593) /locus_tag="DP116_17305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311642.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl hydrolase family 57" /protein_id="PRJNA477356:DP116_17305" /translation="MTTTKSSANLPILEQFRTGLPNICGWEAEIHSVVQQNHPVFLNT TNLRLENITAGFACALHMHQPSIPAGDHGEIISNLQYMFEHPHQGDNHNAEPFAWCYS RMGDFIPQLISEGCNPRIMLDYSGNLLWGLQQMGREDIINNLKRLTCDSQYQPYVEWL GTMWSHAVIPSTPIPDIKLHIQAWQHYFAAVFGYDALQRVKGFSPPEMHLPNHPDTLF EYIKALKECGYRWLMVQEHSVERLDGSGLSHDEKYIPNRLIARNSKGETISITALIKT QGSDTKLVAQMQPYFEAKGRTRQQIGHVTIPSLVTQIADGENGGVMMNEYPRDLLRVY HEIRDSGNNHSGIVALNGTEYLELIEAAGVNPDEYPTCQAVQQHKIWQRVYPEHSTPE AVEKAIAELKETDHQFHMDGASWTNSLSWVKGYENVLDPMKKLSALFHEKYDSLVEQD PSVTQSSDYQKALLYNLLLQTSCFRYWGQGTWTDYARELYRRGEALLAVS" gene complement(9610..9840) /locus_tag="DP116_17310" CDS complement(9610..9840) /locus_tag="DP116_17310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874482.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17310" /translation="MARQVLQVGGAAQRTGSSWRFVKLGILLAGVIVCNTCFPKGCKM REPQIKIDPGTLVLIVSVLLLLPLLLAGFVFQ" gene complement(9918..10226) /locus_tag="DP116_17315" CDS complement(9918..10226) /locus_tag="DP116_17315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206322.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17315" /translation="MEEIDALLTKEERLNRLAEINVLVQVKNLYQTSIMRKALHEEKA PVVHGWVLNIRTGLIKDLQVSTKQWELRPQALVVESILPLPDFKFMDEDLGCFSQEAY " gene 10482..10997 /locus_tag="DP116_17320" CDS 10482..10997 /locus_tag="DP116_17320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459627.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17320" /translation="MTQDYSAKWWVAFICIIFIEAVLLILLISRQATPLLYVAIIAIA FLLFLIPQLDEVIALTFDRGKLDSKINSIGKKISTTKARTDKLVLLSMSKSMFETLKK LAAGSFGVYEMSDALERELYHLKDIGYVEVGRIRDIPYKDNNLCDYVKLTDFGKQYIE LRRSIEEDNKG" gene 11243..11701 /locus_tag="DP116_17325" CDS 11243..11701 /locus_tag="DP116_17325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318601.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17325" /translation="MFKSKITYVLFTLTLIIVLSWTTISRLPNVLTCQESNVIRGTGE RFYTHPHKIIVEPWRGEHHVYAIFMIPGGHLNDKLFTVTIKDTGTFCGSLAFAGTTVA DGVYAKPGYYLMKALFHTRAAVWLISQGKKDELKQPLNWKVGYAKVQEPG" gene 11933..12214 /locus_tag="DP116_17330" CDS 11933..12214 /locus_tag="DP116_17330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318600.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="PRJNA477356:DP116_17330" /translation="MLRSFRKYHRQIAIILCLPLFLTVLTGMAYTILNEWFHQPELAV FLIKIHSLEVLNLQGIYPLLNGLGLIGLLITGLSMTGLFGQRTNRNTLG" gene complement(12322..13638) /locus_tag="DP116_17335" /pseudo CDS complement(12322..13638) /locus_tag="DP116_17335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456342.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="amidohydrolase" gene 13913..14692 /locus_tag="DP116_17340" CDS 13913..14692 /locus_tag="DP116_17340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408988.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="isochorismatase" /protein_id="PRJNA477356:DP116_17340" /translation="MNIPVRNLGIAPNAWAVNHTFADITRPPQIPQPVILSTETKTLR LDLAKTAILVIDMQNDFCHPDGWLAHIGVDVTPAQKPVQPLQILLPELRNKNVPVIWV NWGNRPDLLNISANVLHVYNPTGEGVGLGHPLPTNGAKVLMAGSWAAAVVDELQQLPQ DIRVDKYRMSGFWDTPLDSILRNLGRTTLLFAGVNADQCVMATLQDANFLGYDCLLVK DCTATTSPEYCWLATIYNVNQCFGFVSDSQAILSALQSAEC" assembly_gap 15051..15060 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 15256..15717 /locus_tag="DP116_17345" CDS 15256..15717 /locus_tag="DP116_17345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195970.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cupin domain-containing protein" /protein_id="PRJNA477356:DP116_17345" /translation="MYATRCVIPVVKSPKDYQTYRITPQDSNRLAIIFDTASANTSLT CCVEIFDVGGKTPPNRHQWAVEMFFVLKGEGIASCDGKRVRIKAGDSLLVPPTGTHLI ENIGYGRLYTLTIMVPNEDFAELIRSGTPVELDEEDMAVLGRVDSLMPCKV" gene complement(15902..16648) /locus_tag="DP116_17350" /pseudo CDS complement(15902..16648) /locus_tag="DP116_17350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017803983.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" BASE COUNT 4982 a 3467 c 3607 g 4806 t 10 others ORIGIN 1 tccttcttct tcacattaca tcttttttat tcctcaaaac aaacttgata acctactagt 61 cgctatgtaa agctctaccg cagcaaagca gtacaccagt tagtgcaatt caaagaaaac 121 aggcaaaaac ttagcaactg gttgagtggt tccaaccata caaggagtta ccctctatca 181 agttcagaat ctcgctgcaa gtttcttctg gtgtcttgcc tttggtgtat actgtgaact 241 tcgcaagctc atagtttgag gtgtgtctga cgaagtgttc gttgatgttt ggtttatcat 301 ctggcacata cttattgcgc tgattgagga tctgcactga ctcatattcg ttgggtgatg 361 gaagtaaaag aacaacatta ggatacggtg ctagagcttg ctgaactcgt tggaatagac 421 cagcgtcctc atacacggaa tgaccacccc caaaatcaat gacgcattgg ttgtgttccg 481 atagcagcct ttcaacagca taggcttcaa atggcttcca atactgatag actccccaga 541 acccctcagt ttctcgctta tgctttgcta attcctcgtc atagccgatc gctttgtagt 601 agtcccaccg tcgctcatcc atcgagtatt gtggaagacc gagcctgtga gctagaagcg 661 ccccaatcgt agttttgcca gtaccaatcg gaccaatgag aatgatgtct gatttcatgt 721 tcttctattg tcagccctag cactgctccc agcatcaccg ctttaatatg cgcgaatcgc 781 agtgccgtca cggtaactct agagtatgcc acaatagcga aacagaagcg actattaaat 841 tcagttagga gttctcgttg tggagtcccg atataaccca gcagcaattg aggaaaaatg 901 gcaaaaaaca tggactgaac aaggcttaga taaaactgtt acagatagca gtaaaccaaa 961 atactatgcc ttgtccatgt tcccttatcc ttcgggcagc ctgcacatgg gtcacgtccg 1021 taattatacg attactgatg tgattgcccg cctcaaacgg atgcaaggtt atcgggtact 1081 acatcctatg ggttgggacg cttttggctt accagcagaa aacgccgcga ttgacagagg 1141 aataccgccg gcgaagtgga cgtatgaaaa tatggctcag atgcagcaac aattaaagcg 1201 tcttggcttg tccattgatt gggattgcga acttgctact tgttcgccag actattacaa 1261 gtggacgcaa tggattttct tgcaattttt aacagcgggg ttggcttacc aaaaagaagc 1321 tgctgtaaac tgggatccaa ttgaccaaac tgtcgtggca aatgagcaag ttgatagcga 1381 aggacgttcc tggcgcagtg gtgctaaagt tgaacgtaaa ctcttgcggc agtggttttt 1441 gaagattact gactacgctg aagaattgtt gaatgatctc aacaaattac caggttggcc 1501 tgaacgcgtc aagttgatgc aggcaaactg gattggtaag tcaacagggg cgtacttgga 1561 attccccatt gtcgggatgg aagagaaaat tggtgtgtac accactcgcc ctgatacagt 1621 ttttggcgtt agctacgttg tgttagcgcc agaacatcct ttgacaaagc gtgtgacgac 1681 acaagaacgg gaagcagaag tagcagcttt tattcaagag gtttccaatc aaagcgagtt 1741 ggaacgcacc gctgaagata aaccaaagcg tggaattccg actgggggta aagcaattaa 1801 cccgtttact ggggaagaaa ttcctatctg gattgctgat tacgtgttgt acgagtacgg 1861 tacgggcgca gtgatgggcg ttcccgcaca cgacgcacgg gattttaagt ttgccaagga 1921 acagaatcta cccatcaaag tggtgattgt gccacccgat aatgtagaga ctttggatgc 1981 cggtaatgta gagactttgg atgcaacctc tctacatcag gcatatacag aaccaggaat 2041 tgtgattaat tccagtcaat ttgatggaat ggcttctgtt gatgccaagc aagcgatcgt 2101 cgcatacgct gaacaacaag gttttggtaa agcgcgggta cagtatcgct tacgcgattg 2161 gttgatttcg cggcagaggt actggggcgc acccattcca gtgattcact gccctaactg 2221 tggtatagta ccagtccctg aagaagattt gccggtgcag ttgccagaaa atgttgaatt 2281 ctctggacgc ggaccctcac ctttggctaa aatagaagat tgggtgaatg ttccttgccc 2341 aacttgcggc actccggcaa agcgggaaac cgacacgatg gacaccttta ttgattcctc 2401 gtggtatttc ttgcgcttta ccgatgctaa gaatgatcaa caggtttttg attccgcgaa 2461 gacaaatgac tggatgccag tcgatcaata tgtaggtggt attgaacacg cgattttgca 2521 tttgttgtat tcgcggttct ttactaaagt cttgcgagac agagggttgt tgaattttga 2581 tgaacctttc caacgcctgt tgactcaagg catggtgcag ggtttaactt atttgaatcc 2641 taacaagtct ggaaaagata aatggattcc ttcctatctt gtcagcaatc cagatgatcc 2701 tcgcgatccg caaacaggtg aaccattgca gcgcctttac gctaccatgt ctaagtctaa 2761 gggcaacggt gtcgcaccag aagaggtcat cagcaaatat ggtgtagata ctgcccggat 2821 gttcattttg ttcaaagcgc ccccagaaaa agacttggaa tgggatgaag ctgatgtgga 2881 aggacaattc cgcttcttga acagggtttg gcgattggta actgatttta ccgcccagcc 2941 aagagaggtt cacaatcagc aaactcaatt gagtaaagca gaaaaagact tacggcgggc 3001 aattcacatt gctatcaaag aagctacgga agatgtagaa ggtgaatatc aattcaacac 3061 ggctgtctca gaattgatga agttgagtaa cgctttggct gacgctagct gcaaagattc 3121 accagtttat gcagaaggta ttgagacatt gatagtgttg ctggcacctt ttgcacctca 3181 cattgctgaa gaattatggc agcaattggg taatactgaa tctgtccata aacaagcttg 3241 gttgaagtat gatgaatctg ctttggttgc tgatgaaatc actttggtga ttcaagtcaa 3301 cggcaaaaag cgtgcggatc ttcaagttcc ggctcaagca aataaagcag agttggaaaa 3361 gtatgctcgt gagtcagaaa ttgttcaacg ttttattgag ggtaaggaaa ttagaaaggt 3421 gattgtggtg ccaggaaagt tagtaaattt tgtcgttggt taattttctc aacgacgaca 3481 tgaacaaaag gaagaaggta atatcttgcc ttcttctttt tcatagacct ctgaagaaac 3541 actcttgagg gagggagtga gggaaggagg gagtgaggga agtgtttctt ccaaaatctc 3601 tactcaatga caatacccat agtccgaaac cctcattctg cggacattct aaaaaattgg 3661 cgaaggtatt gcaatgagag tcggtttttt acatttcttt acagaatgat acagataatc 3721 cgacggcgag tatcatcgat agtcaatcgg tgaaaacgct gaaaaaaggg ggaagtatac 3781 ggttacgatc gtggaaaaag ggttaaaaga tgcagtcgcc ccatcaggtg aactagagtg 3841 atgaaaaaat ttagtgtgtt tactgctatt cgtcatcgcc gatggctaat tttactagct 3901 ttagtaacag cgtttgcact aattacattt agctcagtta atataaaaac caaaacagta 3961 ccagtctctg ggatagttac cagcttttct acagatccta agacctttaa tcccgctctg 4021 atccaggaac ctccctatac ttcagactac actcacgagg ggctagttag cgaaaatggt 4081 cgaggagaaa ttgaacctgc ccttgccgaa tcttggaaaa tgtctgaaga taaaaaacgg 4141 attattttta ctcttagaaa agggctgaaa tggtcagatg gcaagccttt aactgcagat 4201 gatgtggttt ttacttataa cgacattttc ttgaatccag ccatttctaa tgatgccaag 4261 gatctatgga aaattggtaa aaatcgcgtt ttccccactg tgcaaaaaat cgataatctg 4321 caaattgaat ttattatacc tgaaccattt gtcccattcc ttcggatagc aaaactagca 4381 atattaccag cccatgtatt gcgagaagca gtcaacataa aagataaaca aggtcagtct 4441 aaatttatat caacctgggg tactaatact ccacccaaag aaattattgc caatggtccc 4501 tatacgatag aagcttatac tccaggtcag cgggtcattt tccgaaagaa tccttattac 4561 tggcgaaaag atgctcaagg caatgtgcag ccttatatcg agcgtgtgat gtggcaattt 4621 gttgacaata ctgatacctc tttaatacag tttcgctcta gaggattgga ttacatcaaa 4681 gtctttcctc aatatttttc cttgttgaag cgcgaagaaa agcgagggca attcaccatc 4741 tacaacggtg gttcttctac ggaagtcacg tatattagtt ttaatctaaa caaaggaagc 4801 agaaatggca aacctttggt caatccaatc aaatctcgct ggtttaatac agtggaattt 4861 agacaagctg ttgcctatgg agttgaccgc caaagattgc tcaataacat atatagagga 4921 ttgggtgaat tggcaaattc atatattcct aaacaaagcc cttattatct ttctcctaaa 4981 gaaggtttaa aagattatga atataatcct ataaaagcga aagaattact tgtaaaagcg 5041 ggctttcaat acaatagtca aggacagtta ttagattctc agggaaattt ggtgcagttt 5101 accctaatga tgagttctag taataaaatc atagagacaa tcgcagtgca gattaaacaa 5161 gatttaagta aaattggcat atctgtggat ttgaataccg taagttatag cctttttata 5221 gacaaaattc ttaactcttt tgattgggaa tgccgtttgg ggactatgaa ttttagtatg 5281 gaaccaaatg acgttgctag tttattcttg cctgaaggta gtttccatgt atataatcag 5341 gaacctcaga aaggtcaaac tccgataaca ggacgagagg tggcagactg ggaacggaaa 5401 attggcgacc tttatcttca ggcagtaaaa gaatttgatg aagcaaagcg caaagctatc 5461 tatgcagaag ttcaacgcat tagtcaggag tatctgccac agatttattt aactcagccc 5521 ttgacaatgg gagcagtgcg aaatcatatc caaggtatta agtactccgc cttaacaaca 5581 gtattttgga acatttacga actgaaactg agtaagcaga aataaagttg tgttgcaaaa 5641 acgaaagtga ggcaattttg ttgaggtgct tggggttcat ggaggggtta ctctggcgat 5701 cgcaagtcct gaataaaagc ttccgaaaga tggatagaga caaatacgag cgcaagatgg 5761 gaacacaact ctccaaatca ggcgattgca cagattagga gggtcaaaac catgaaaaat 5821 catggttttg agtcttcatt ggcatagttt gacgatagag ccacagtgtt ttactaacac 5881 cattgaaaag attatgagat catatcagag cgactcgata tgcaatcagc ttcgttcact 5941 ctgctcctgc ttttgctcgt cagatggagt tgattctgtc ttttctggtt gcttctcagt 6001 tgtttgttgt tcttgtagtt ggttgagatg aatgttattc acttgaataa taaagtattc 6061 taatgcctgg cttgcctttt gttgctgttg ggtttcgtca gacacatcgt agcaacgata 6121 aggcggcgtt actcccaata taaatttctc ttaacgcctt gttaataaaa tcgagtggcg 6181 gaagcgatgc tgcaaaccgc gacgcacgcg aacgcccaac gtgacgtaga cgaaaaattc 6241 cacagcgttt tcattgccaa caaaaacatt agtgatgaaa tgtttgttag gcaggctata 6301 cgcctgccaa tcaaaaccta caacgctacc tagcagttag gtaagcttct aatgcttcgg 6361 aaccaccaat gagtttgcca ttaacaaaca cttggggaac agttgatgca cctgtgacag 6421 cttgtaatgt gcgaggtgtt acatctttac ccaaaacaat ttcttcgtaa tcaattccac 6481 gttctaatag catagatttg gcacgagcgc aatagggaca accaactctt gtaaacagag 6541 aaaccagttc aggtttcgcg gcttgagggt tgatatatcg gagcattgtc tctgcatcag 6601 acaccttaaa ggggtcgcct ggttcttcag gctcaacaaa catcttttca accacgccgt 6661 ctttcaccaa catggaatag cgccaagacc gtttgccaaa ccctaggtct gatttatcca 6721 caagcattcc catgccttct gtgaattcac cattgccatc aggtataaaa gtgatatttt 6781 ctgcttgttg gtctttcgcc cattcattca tcacaaaagt atcattgaca gagatacaga 6841 caatatcatt tacgccattt tctttgaaaa cttttgctaa atggttgtac ccaggaacat 6901 gggttgatga gcaagtggga gtaaatgctc ctggtaggga aaagactacc acagttttac 6961 ctgcaaatag gtcaaccgtg gtcatactta cccaattatt gtccttgcgg gcgcgaaagg 7021 taacattggg aactttttgc ccttctcgat tagacagcat gagtttgcct ccttagtgaa 7081 tctactcata aatcatattc ataatgaata cgacttacaa gtagaaaagt ataaatagaa 7141 tatgaaattc cctaaattaa tgatctgatg taggttggta atcaggtact gtggctgctt 7201 ttttcttcca gttgataatg atacaccaaa gccgcaactg cgccaccagc caaaggacca 7261 acccaataca gccaaaaatc ctggagtcta cctgctatta aagcaggacc aagataccgc 7321 gctggattca ttgccgcacc agtaattgca ccaccagcca ggatatctaa agccactgtt 7381 aaaccaataa acaagcctcc cattttgggt gcgcgtttat caatggctgt accaaagatc 7441 acgaatacga gaaagaaagt catgataaat tccatgacta aacccatgaa aggagtaata 7501 tttttaccga gagatggcgt acccattccc acagcttgga gtgcttgcag gggaatagcg 7561 agcttgacca tacttgcggc gaagattccg cccaaacatt gggatataat gtatcctaca 7621 gcatttttgg catcaatctt gcgagttagt aaagctgcaa aagtcacggc tgggttaagg 7681 tgaccaccac tgatagcagc tgttgcactg accatcacag caattgtaaa accgtgggca 7741 agggcaattg ctactaaatc gactggtgag agagttccaa tttttgtaat gtgatttgta 7801 gcgagagaac taatgccaat gaaaattaag gcgaaagtcc ctataaattc tgcaattaaa 7861 gcctttggat tcatagccct tggttatctc cacaatttcc aaatcaacta gactttaaac 7921 taaaaaccga taagccggag gcttgacgct acgcgtatcg caaaaagtta agcaggctat 7981 gagttagttg tatcaaaata aaaataaagg tatctatgtg tgacacagat acctcgtaac 8041 aggaacttta gtttttctaa aagtagtttt tgtgagctat tttaattagc ctagctaact 8101 gctaagagcg cttcacctcg tcgataaagt tcgcgagcat agtcagtcca agttccttgt 8161 ccccagtatc ggaaacaact tgtttgtaat aataagttat acagtaacgc tttttgatag 8221 tcagaactct gtgtcactga agggtcttgc tcaactaatg aatcatattt ttcatgaaat 8281 aaagcactga gtttcttcat tgggtctaag acattttcat agcctttaac ccaacttaaa 8341 gaatttgtcc aagaagcacc atccatgtga aattgatggt ctgtttcttt taattctgct 8401 attgcttttt ctactgcttc tggtgtggaa tgttctggat atactcgctg ccaaattttg 8461 tgttgctgta cggcttgaca ggttggatat tcatctggat tcacaccagc agcttcaatt 8521 aattccaaat attcagtacc gttgagtgca acaatccctg aatgattgtt tcctgaatcg 8581 cgaatttcat gatatactct taacaaatcg cggggatatt cgttcatcat gacgccgcca 8641 ttttccccat cagcaatttg agtgactaaa gaaggaatcg taacatgacc aatttgctgc 8701 cgagttcttc cttttgcctc aaaataaggc tgcatctgag ccactaattt tgtatcggaa 8761 ccttgagttt tgattaatgc cgtgatgcta atggtttcac ccttagaatt gcgggcaatc 8821 aatcggtttg gaatgtattt ctcatcatga cttaagcccg aaccatccag acgttctacc 8881 gaatgttctt gcaccatcag ccaacgatat ccacattctt taagagcttt gatatattca 8941 aacaaagtat ctggatggtt tggtaagtgc atttctgggg gagaaaagcc cttgacgcgt 9001 tgtagagcat catacccaaa aacggctgca aagtaatgtt gccaagcttg aatatggagt 9061 ttaatatcag gaattggtgt ggaaggaata accgcgtgac tccacattgt tcccaaccat 9121 tcgacgtaag gctgatattg ggagtcgcaa gtcaaacgct tgaggttatt gataatatcc 9181 tctcgtccca tttgttgtaa cccccacaac aaattacccg aataatcgag cataattcga 9241 ggattgcaac cttcggatat gagttgagga ataaaatcgc ccatgcggct atagcaccaa 9301 gcaaaaggtt ctgcgttgtg gttatctcct tgatggggat gttcaaacat atactggaga 9361 ttgctgatta tttccccgtg atcacccgca gggatgctgg gttgatgcat atgaagagca 9421 caagcaaacc cagccgtgat attttctaag cggagattgg tggtgttgag aaatacaggg 9481 tgattttgtt gaaccacaga atgaatttct gcctcccagc cgcaaatatt gggtaatcct 9541 gtccggaatt gttccaaaat cggtaggttc gcagatgatt ttgttgtagt catagtcatc 9601 ccaaatagct cattgaaaaa cgaaaccagc cagcaaaaga ggtaagagta gaaggacaga 9661 gacaattaag acgagagttc ccggatcaat cttaatctgt ggttcgcgca ttttacaccc 9721 ttttggaaaa caggtattgc aaacgattac tcccgccaga agaataccta atttaacgaa 9781 ccgccaagac gagccagtgc gttgggcggc tccgccgact tgaagcacct ggcgtgccaa 9841 ggacgccaag aaaatcaaaa agaagatagg taatcttgca cgcagtcagg ggagtagtta 9901 ctcaagaaca ccatacctca atatgcctct tgagagaaac aacccaaatc ctcatccata 9961 aatttaaaat caggcagagg gagtatggat tcaacaacta gggcttgtgg gcgtaattcc 10021 cattgtttcg ttgagacttg caaatctttg attaatcctg ttcggatatt cagtacccaa 10081 ccatgaacta ctggtgcttt ttcctcatga agtgctttac gcataataga agtttggtac 10141 agatttttca cctgtaccag aacattgatt tctgcaagac gatttagacg ttcttctttt 10201 gtgagcaaag catcaatttc ctctatagga tcataaaaac tgtttatatt gaacaattac 10261 aagataccaa aaatacagaa ttcaaaattt atacaacaag gcactttttt atttttttgt 10321 catctagcgc tcacatttta gcagagaaac aaaatattat atgaaatgag caagcaaatt 10381 cagatctcat gtgaatagga gagaaacatt tcagaaagaa caatgaaaat taaaaaaatt 10441 aaataaactc atctaaattg agagatttca ggagaccaat tgtgactcaa gattattcag 10501 caaaatggtg ggtggcgttc atctgtatta tatttatcga agcagttttg ctgattttgt 10561 taatttctag acaagcaact ccacttcttt atgtagcaat cattgctatt gcttttctac 10621 tgtttcttat tcctcagctt gatgaggtta ttgcattgac ttttgataga ggaaagctag 10681 atagtaaaat caattcaatt gggaaaaaaa tctctacgac taaggcaaga actgacaagc 10741 tagttttgtt atctatgtcc aaatctatgt ttgaaactct caagaaacta gctgcgggta 10801 gctttggtgt ttatgaaatg agtgatgctc tagaaagaga actttatcac ctgaaagata 10861 taggatatgt agaagttggt agaattcgtg acattcctta taaagataat aatctttgcg 10921 attacgtgaa actcactgat tttggtaagc agtatatcga gcttcgtaga agtatagaag 10981 aggataacaa aggttagttt taacttgatt tcatttagcc agggaactct acactttcat 11041 gctagaatca cctctcttga atgaagttca ggaagttaat ttcccatgct agcgatcgca 11101 tacgaataat cgcatgcaag gtaaagcgtc tcggtttatc ctggatatcc gtcaattgca 11161 atgaattata tatcaagcac aggtgccatc gccacacaat acagaatctg ttaccttata 11221 acacgtagtt ctaaaagtac gaatgttcaa aagtaaaatt acttacgttt tatttaccct 11281 gactttaatc attgtcctta gctggactac tatttcaagg ctaccaaacg ttctcacttg 11341 ccaggaatca aatgttattc gaggaacagg tgaaagattc tacacacatc cccacaaaat 11401 catagttgag ccttggcggg gtgaacatca cgtttacgca attttcatga ttcctggtgg 11461 acatctcaat gataagctgt tcacagtcac tataaaagat actggcactt tctgcggatc 11521 acttgctttt gctggcacca ctgttgctga tggcgtttat gctaagccag gatattacct 11581 aatgaaggca ttattccata ctcgggctgc tgtatggttg atttcccaag gtaagaaaga 11641 tgagttaaag cagcctctta attggaaagt gggttacgca aaggtacaag aacccggatg 11701 aaggctaaaa aaataactgt acttaattct ttcagtcttc ttaatttttg taaaatatat 11761 tcagtctttt caagtttagc tctaagttac agttattaag ctaagtacac atcatcataa 11821 aaggggttca aaattggctt gagggagaat gtcaattctc cgttttggca taattttgat 11881 ttcttacccc tattaatgat gagttgtatt caggattatt tcagtaagtg ttatgctacg 11941 tagtttccga aaataccatc gccagattgc cattattttg tgcctaccat tatttctgac 12001 tgtgctcact ggtatggcgt acactattct caatgaatgg ttccatcagc ctgagttagc 12061 tgtatttctc atcaaaattc atagcctaga agtcttgaat ctacagggaa tttatcctct 12121 tttaaatggt ttaggattaa ttggtttatt gattactggt ttaagtatga caggtttgtt 12181 tggtcaacgc actaatagaa acacgctggg ataatttcac attaagaggg aacagagaat 12241 agggaactct taacaggaat tcaaatttcc ctatttcctc caaggagcga aaaccagtag 12301 agcgaaacgc cgcaccgatc actacaataa atccatgact gtacgataat gagcttcaag 12361 ttgagcaaca gtctcagact tgcgctttgt atcccactga ctgtgatcaa ataattgttg 12421 acgcaattca tccacgttaa tagtagtcac tttaccatca gcgactacct gcttaccatt 12481 tacccagaca ctttctacag cattggtggg acgaccaaga attaataagc caatcgggtc 12541 tgtacgcggt agtagtgata aactggtgag gtcatacatc accaaatcgg cttttttgcc 12601 gacggttagg gaaccgagtt gatcagccat gttcagtcct tttgcaccac ccaaagatgc 12661 catctccact gattgacgag gtgtaatcca gtgttggtaa tcgaaatctg tgatattgtg 12721 caatattgaa cctatcttga tggcttctag taagtcttgg gaatcgttgc tagatgcacc 12781 atcacaacca aaagtgacgt tgactccagc ttgacgatat ttcagaatgg gggcgatacc 12841 gctacctaga cgcaagttac ttaagggatt gtggacaact gtggatttgg tttcggcgag 12901 tattgcaata tcggtgtcac tcaagtgaat gcaatgggca agggatgtgc gatcgcctaa 12961 atacccaatc cgccccagat gttcaaccgc actacagccg tatttttctt gtgcgagttt 13021 ctcttgcgct ttggtttcca gtagatgcga gtgacggcaa agattgtact tatcgcttaa 13081 ctcaatacat ccttggaaca aagcatcaga acataattgt attcctgtgg gcgcaactaa 13141 aatactcata ccctcatctg ggcgatgaaa ctgcctgaca gcttcttcta taagttccag 13201 tgttgcctga gttgagcgaa aataaggctt gtgagttagt gcggtggtta ctccagatgg 13261 tatcccagca ctcagggatt catcttgaat cagaggagag acaaaagcac gaatgccaac 13321 ttccttgtaa gcgcgaacgg cggttgcaat tgtttccaac tctttgcctg gaatgaggat 13381 gagatgatcc accacactcg ttccaccgga aagtaaagtt tccactgcgg ttcccaaagc 13441 actgaggtaa actttttcta tatcaagagg cgcaaagtcg tagagttgtg ctaaccataa 13501 ttctagagga aacggtggaa tgatacctcg ttgccacatt tctgaggagt ggctgtgggc 13561 gttgaaaaat ccgggcaaca ggagtttgtt tttaccgtta attgccgtac cgataacctc 13621 tagagtgggt gcaatagcgg cgatggcttc gcctgcccta tgccccaaga tgccttcagc 13681 atcttgccct gagcgaagct cagggagagg gttttcctcc agaaaaccct ctgggcatat 13741 acgggcgatc gccccatcta caacttgcac atccacagtt gcataatcat caacagtggc 13801 aattaaaaca ttttggattg taaagttcac gggttaaacc ttaattaagt aatagaacgt 13861 ttaggagatt ttcaagttac aaaataaagg atggtgttgg gtgtagcaat gtatgaatat 13921 acctgttcgg aatttaggaa ttgcaccaaa cgcatgggct gtcaaccata cgttcgcaga 13981 tatcactcgt cctccccaga tcccacaacc cgttatttta tcaacagaaa ccaaaaccct 14041 gcgcctggac ttggcaaaaa ctgctatcct cgtcattgat atgcaaaacg acttctgtca 14101 ccctgatggc tggttggcgc atattggtgt agatgtcacc ccagcacaga aacctgttca 14161 acctttacaa atcttattac ctgaactccg aaacaagaat gtccccgtga tctgggtgaa 14221 ttggggaaat cgtcccgact tactcaatat tagtgccaat gttcttcacg tctacaaccc 14281 cacaggtgaa ggcgtgggat taggtcatcc gttacctact aacggtgcta aggtactcat 14341 ggcaggtagt tgggctgcgg cggtggtaga tgaactccaa caattacccc aggatattcg 14401 tgtggacaaa taccgtatga gtggtttttg ggatacccct ttagatagta tcctgcggaa 14461 cttgggcaga acaacactac tttttgcagg agtgaatgct gatcaatgtg tgatggctac 14521 cttacaagat gccaacttct taggatacga ctgcctatta gttaaagatt gtactgcgac 14581 aacttctcct gagtattgtt ggctagcgac aatatacaac gtcaaccagt gctttggctt 14641 tgtgagtgat tcgcaagcaa tcttaagcgc acttcaaagt gctgagtgtt gaatagtaat 14701 tgggctgtta tgctctatgg ttatataacc atagagatat ggctagaatc tacttccaag 14761 cttgaaactt cttacaccaa taactataga ggaaacaaat tatgtacgcg actcgttgtg 14821 tcattcccgt tgtcaaatct cccaaagatt accaagcatg tctcaacttc aacaaagcca 14881 tgagtgttga atagtcattg ggctgttctg ctctatggtt atataaccat agagatatgg 14941 tcggcacttc caaactcacg actctttaca ccaataacta taagaggaaa caaattatgt 15001 acgcgactcg ttgtgtcatt cccgttgtca aatctcccaa agattaccaa nnnnnnnnnn 15061 tatgtacgcg actcgttgtg tcattcccgt tgtcaaatct cccaaagatt accaaacaga 15121 tcgcaacttg aacaaacccc tgagtgttga atagtaattg tggcttgcct ctgtatgacc 15181 atataaccat agagagagaa atacacttca aaactcacga ctccttacac cgataacaat 15241 aagagaaaac gaattatgta cgcgactcgt tgtgtcattc ccgttgtcaa atctcctaaa 15301 gattaccaaa catatcgcat cactccccaa gactcaaatc gcttagcaat tatctttgac 15361 acagcaagtg ctaacacttc cttgacttgt tgcgtggaaa tttttgatgt tggtgggaaa 15421 acaccaccca atcgtcatca atgggcggtg gaaatgtttt ttgtcctcaa aggagaagga 15481 atcgccagtt gtgatggcaa aagagtccga attaaagcag gagatagttt gttagtccct 15541 cccactggca ctcatttgat tgaaaatatc ggttatggtc gtttatatac cctgactatt 15601 atggttccga atgaagattt tgcggaattg attcgtagtg gtacgccagt ggagttggat 15661 gaggaagata tggcagtgtt ggggagagtg gatagtttga tgccatgtaa agtgtgattt 15721 aaatagaggc agagcctaga acactgatgg tgaaccgtta cataaaacct actcaaagct 15781 actcaacaat actcttttct atacaagata actagaacta tttttgattg ctttgagttt 15841 tgtttatttc ctgcttcatc ctggagatgt cggcatcaac cagtccacag gctatcactg 15901 tctaactgac agctttactc aaattgattg ctgcgttcaa atcacggtca acaacaaaac 15961 cgcagtgacc gcagttgaat actcgctcat caagcccaag tgtttctttt ttggttccgc 16021 aattagaaca ggtcttgctg gatgggaacc atcggtcaac tacaaccaat tttgagccat 16081 aaagctcaca cttgtaggtt aactgtctac ggaactcgaa aaagctcata tcagctattg 16141 ctttagccag tttatgattt gccaacattc cagatacatt caagtcttca attactactg 16201 tgccgtggtt cttggctagc aatgtcgtaa gtttgtgcaa cgtatctttg cggatgttgg 16261 caatctttct gtgcagtctg gctatctgta attgtgcctt cttccagtta actgaaccaa 16321 taattttatg gcggtttaac cattgcattc tagataactt agcttcgtac tttttataag 16381 acttagcacc tgtaattact ttaccagttg ataatgtggc aaggttttta actccaaggt 16441 caacgcctac aacgtttgta ttacctagat tttgtgtttc tatctcaaac cgaaagctaa 16501 gaaaccacct atcagcttgg cggctaattg ttgcagattt agttaatact tgaggcagtc 16561 gttcataagt tttaagtacg ccaatcacag gtacttgaat cttgttactg cctaagattt 16621 tgactgtacc ttcaagcgta aaagagtcac gctaaaacaa gcttagatct gttcctaatt 16681 ggcaagaacg gttaagagaa ttcgttgatc gtctgattgg agaagaacct actcaaaaag 16741 aagagagatg agtataggtg attaggaatt aaatgacagt tccttgcccc acccccgtaa 16801 agctacgcta tggctaacgc cacgctagtc cctaagaggg acgctgcgca aacgctatca 16861 cgtcccctcc cc // LOCUS NODE_1974_length_16846_cov_4.61503216846 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16846) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16846) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16846 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 305..490 /locus_tag="DP116_17355" CDS 305..490 /locus_tag="DP116_17355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748416.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17355" /translation="MPFQKNNKLGANRRLKRPLDKETISLRGYEGQKEKLKAVPDWQE RLREFVDQLTSDLPKNE" gene complement(699..2177) /locus_tag="DP116_17360" CDS complement(699..2177) /locus_tag="DP116_17360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318388.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_17360" /translation="MLGQQEDDLLVETSPLPPVPIQEISEDRAITQTVLPSTPKSPVK ISKPEIRTSLRALTFESVFATVFYSMIGGALLTNFLLELGAGPVEIGLLASIPQLVNL LQPLGAYLVDRSPSFHWYSMFIFVPSRLLWVILLPAIWLLTSSDISFDITGHQVLLLT LGIILVTNIIEALGRAPFLGWTAVLVPQRLRGRYFGFRNSLVSLTNLIGVPLLGLAVS KWPYGTLQGYGVVLVLGVVFGLSSLVSQFWLTDVNPQLLKVAGSETSQAQSGGIDLSF LKDANFLKFVFYIAIWCFAVNVSAPFFNLYMLDNLEIDISVVTIYNGIATGANMLLLL LWGKLADRIGNRPLLVFVGVLVGVTPLLWLGTGADSISLWVWFPLLHVLTGGTWAAID LCTNNLMMAVAPLGNQSKYFAITGAVAGVSGAIGITCGSFLATQAGAGGLLGLFVLSG VLRLFALLPLLFVQEERSVPLDKLWQVLFPVRQQKVLIESKE" gene 2828..3061 /locus_tag="DP116_17365" CDS 2828..3061 /locus_tag="DP116_17365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318387.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17365" /translation="MTQEPRKLINLSVHESADPSVINPTNPDKEASGEINDLNDSVHD EENVDVPIPSLFDDDSDDNPVDPQIGIVGRSAG" gene 3711..6146 /gene="glf" /locus_tag="DP116_17370" CDS 3711..6146 /gene="glf" /locus_tag="DP116_17370" /EC_number="5.4.99.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879642.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-galactopyranose mutase" /protein_id="PRJNA477356:DP116_17370" /translation="MSGEQSQIKNNGISNGKTKMLKAKLSTLTEPLSSGASSQNLSIS NKIYKEASTDTPDIICLSHLRWNFVYQRPQHLLVRCAQGRRVFFIEEPIFTSEPLGRL DVSLDKNGVVVVVPHLPQGLSEEAVNADLKVLIDGLFAEHNIRKYICWYYTPMAIAFT RHLQPEAVVYDCMDELSAFKNAPPALKNNETELFHRADLVFTGGQSLYESKVNQHPNV YAFPSSVDVAHFAQGRTLKEEPADQVNIPHPRLGFFGVIDERMDIELLAGIADARPDW HLVMIGPVVKIDPASLPQRENIHYLGGKDYQDLPAYLAGWDLAMLPFARNESTRFISP TKTPEYLAAGKPVVSTSIRDVVRPYGNLKLVRIADTVSEFVAAAEMAMQEDTAVSGWL SRVDAFLEQISWDRTWGSMMQLIESAIAAQNDENKISSNQIVTGKQAPNIITREFVFD YLIVGAGFSGSVIAERLASQSGKKVLVVDKRSHIGGNAYDHYDDHGILVHKYGPHIFH TNSREVFEYLSQFTAWRAYEHRVLASVDGQLVPIPINLDTINKLYGMNLTSFQVEEFF KSVAEPKDYIRTSEDVVVSKVGQELYEKFFRNYTRKQWGLDPSELDKSVIARIPTRTN RDDRYFTDSYQAMPLHGFTRMFETMLAHPNIKVMLNTDYHEIETSIPCREIVYTGPVD EFFDYRYGKLPYRSLDFKHETHNKSVFQQAPVINYPNEHLYTRVTEFKYLTGQEHHKT SIVYEFPKAEGDPYYPVPRPENQEIYKQYKELADETPGVYFVGRLATYKYYNMDQCVA QALSVYKQIAVKA" gene 6156..6344 /locus_tag="DP116_17375" CDS 6156..6344 /locus_tag="DP116_17375" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17375" /translation="MSASKRYAQSARNHANAHATRTQLIIKRTAKTSQRRAGVPPVEA TGVAKNAKEEKENIKVEF" gene 6347..8596 /locus_tag="DP116_17380" CDS 6347..8596 /locus_tag="DP116_17380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318385.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dTDP-4-dehydrorhamnose reductase" /protein_id="PRJNA477356:DP116_17380" /translation="MVSFNSTLNTHNSQLPLEVWAGVECTVNRVGDEYFDQLERNGHA TRLDDLDLFAELGIKAIRYPILWERTAPDGLENADWSWASERLGRLRELGIRPIVGLV HHGSGPRHTSLIDPEFPEKLALYARAVAERYPWVTHYTPVNEPLTTARFSGMYGHWYP HGGDELSFARALLGQCRAIALSMKAIREVNPNAQLVQTEDLGKIYSTAKLAYQAQFEN ERRWLSFDLLCGRVTPTHRMWGHLRHCGINEAELEWFLENPCPPDIIGINHYLTSDRF LDERKERYPVCSHGGNGRDEYADVEAVRVCAEGAADPRNLLLEAWERYKLPIAITEAH VSCTREEQLRWLYEVWSAAGQLRDQGVDVRAVTAWSLLGSYDWNSLVTRSVGYYESGV FDLRSRSVSSGESPHPRPTAIAKMVRDLAAGRKPNHPLLETPGWWHRQERLLYPAVSC LKESSGQSGVGGEMTSSSSPSPLVIVGATGTLGRAFARLCELRGISYRLLSRKEMDIA DCASVNTVLTELKPWAVVNAAGYVRVDDAEREPHVCLRVNATGPAILAAACAQHNVAL LTFSSDLVFDGAVFNPYVESDAVAPLNVYGCSKALAEKLVLKLHPASLVIRTSSFFGP WDDYNFVTIALRQLSAGNTFVAAEDAIVSPTYVPDLVHTSLDLLIDGECGLWHLANQG AIAWADLARLAAKTAGFNPSNVIALPTRELGLTAKRPTYSVLGSNRGDIMSCLDSAMS RYFDECQRF" gene 8876..9862 /gene="galE" /locus_tag="DP116_17385" CDS 8876..9862 /gene="galE" /locus_tag="DP116_17385" /EC_number="5.1.3.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-glucose 4-epimerase GalE" /protein_id="PRJNA477356:DP116_17385" /translation="MTTILVTGGAGYIGSHAVLALKNAGYEVVVLDNLSNGHRELVEE VLQVKLIVGDTSDRPLLDTIFSTHNIAAVMHFAAYIAVGESVTDPAKYYHNNVAATLT LLEAMLAASINKFIFSSTCALYGVPKFVPLTEDHPQDPISPYATSKWMVERILSDFDT AYNLKSVRFRYFNAAGADPNGLLGEDHEPETHLIPLVLMAALGKRESISIFGTDYPTR DGTCIRDYIHVTDLAQAHILGLEYLLKGGDSEVFNLGNGSGFSVREVIESAKEVTGGD IKIEERDRRAGDPPILVGSSDKASKVLGWRPQYPNLQEIISHAWQWHQQRHG" gene complement(9951..10430) /locus_tag="DP116_17390" CDS complement(9951..10430) /locus_tag="DP116_17390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863606.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4383 domain-containing protein" /protein_id="PRJNA477356:DP116_17390" /translation="MDNVNKADIIERYCALIIGILFLVLGVAGFIPGLVSLPGTTASY VPIDATKSAYAMGFGYVFGLFPTNFLHNIVHCAVGLLGIASYTSTSSARIFNRSFAVA YTLLSIMGLLPLAKTTFGLMPLFGNNVWLNALTATVAAYYGSIIQAKVRGTTVLHEL" gene complement(10578..10727) /locus_tag="DP116_17395" CDS complement(10578..10727) /locus_tag="DP116_17395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195831.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PsaJ protein" /protein_id="PRJNA477356:DP116_17395" /translation="MQKQNEQVKYFLQYLSLIPVIAVISISVAFSTWAVFNYFFPDLL FHPMP" gene complement(11372..11764) /locus_tag="DP116_17400" CDS complement(11372..11764) /locus_tag="DP116_17400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407029.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal peptidase I" /protein_id="PRJNA477356:DP116_17400" /translation="MQAKVVFGNAIVEGAKRWGWFIGHFITPSEDPRSTEDLEVKWAV HKAGDSRTQWAVNNEAATLSILIHGRFRLQFEDGDIVLSQEGDYVLWCSGVPHCWVAE SDCTIVTVRWPSKSGDSVGMPRQFEVTE" gene complement(11754..12854) /gene="galK" /locus_tag="DP116_17405" /pseudo CDS complement(11754..12854) /gene="galK" /locus_tag="DP116_17405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407028.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="galactokinase" assembly_gap 11826..11835 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 13224..13463 /locus_tag="DP116_17410" CDS 13224..13463 /locus_tag="DP116_17410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867725.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17410" /translation="MELQETQTKKTENNVLSAENIHDYINPEKIRESEAKTQAQTDNA VTSKEALDPRLRYGFTLILAIFLFIAAIYYGIINP" gene 13675..14220 /locus_tag="DP116_17415" CDS 13675..14220 /locus_tag="DP116_17415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876766.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA 2'-phosphotransferase" /protein_id="PRJNA477356:DP116_17415" /translation="MSYSRLVQISKYLSKYLRHTPGAIGIKLAPGGWVSVDELLTACA KNKFPLTRQELQVVVELNEKKRFSFNSTGTLIRANQGHSTEVDLQLEPVVPPDVLYHG TGHKSVESIMQTGLCKMSRHHVHLSKDIATAQIVGARHGKPVVLLVDTAAMYQAGYKF YCSDNGVWLVDSVPPEYLQKI" gene complement(14561..15685) /locus_tag="DP116_17420" CDS complement(14561..15685) /locus_tag="DP116_17420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M24" /protein_id="PRJNA477356:DP116_17420" /translation="MNPLNQEVSTKLELIRQTLIETEMQGLRLRGTDWFAWATAGASN TVLLTAETGVAEVLVTAQDAWVLTDEIEAQRLQDEELPANFKLHINPWADAARREAFV RDATNGGKVLSDSFAAALRADRPIPHVEQQLPPSLQHHKRVMMSSELERYRQVGRKAS VAMTEVLKAAKPTWTEYQLAGAGAEALWARGLHPALTLVAGERRLPLYRHATATGEQI GREAMLVFCARGYGLYANLTRFVCFGALWDEQTELHRHVREIEAQALNLCKEGTSLNS VYHTLAQAYQQHGFNHAIREHHQGGTTGYLAREIVANPATTDTLAEGIAVAWNPSLPG AKVEDTFVILQDGQLENLTFDPNFPSTEVEGRLRPVVLEI" gene 15934..16695 /gene="ubiG" /locus_tag="DP116_17425" CDS 15934..16695 /gene="ubiG" /locus_tag="DP116_17425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198896.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-demethylubiquinone-9 3-O-methyltransferase" /protein_id="PRJNA477356:DP116_17425" /translation="MKRNNLEYYDLNADKWRKESEPLHLSNHLNKSRFEFFSSYVPDW KGVKVLDIGCGGGLACEFLASLEANVSGIDLSLNSIKAAQEYAQINHLNIDYQWGAAE NLPYDEKIFDVVLCYDVLEHVADWQKVLSEAYRVLKNNGLFFFDTINKTFKSKLIMIW LLEDILKQLPRGLHDWNKFIQPQDMLDIMKSIGFADVVIKGFDLTAGTSLKTLRDIVF EGLNNQNKGKEIKLFEIHINDDTSVWYIGKAVKLI" BASE COUNT 4867 a 3699 c 3746 g 4524 t 10 others ORIGIN 1 agttggattt gggggaagcc ccatcttcag ccgaaggcaa gatggggtac ttcactcctc 61 tggttgaatc accacgagtt gcagccggac tcttaacagg acttgcgcat gcgctcatta 121 cttcgtcccc gcagcgtaag gtaaatcgcc gacaccgatg agtaatgaac tcattaaagg 181 atatgaatgg gaagcctaag tcgttctagt acgttcatca gtactgacag ggattgccta 241 cgggacgaga gtcctatggt aatggcgtgg tctttccatg cttgggcagt caaatcaatt 301 gtcaatgcca tttcagaaaa acaacaaact tggtgcaaac agaaggctta aacgaccgct 361 ggataaggaa acaatctctt tgagaggata tgaaggtcaa aaagaaaaac ttaaagctgt 421 ccctgactgg caagaacgcc taagagagtt tgttgaccaa ctcacatcag atctacccaa 481 aaatgagtag ggatgggtag aagtgggtag gaattaagtg acagtttctt acccacatcc 541 gcagagcgaa ccatctttcg cgggaaaaac gaccaggtag ggagcgatcg ctaattgccg 601 gaacggctag ttcggtagaa atttttgata tccaatgttc gacattattc ccctctaacc 661 taatggtaca gaggggaata ctataaatgt ttgacaactt attcctttga ttcaatcagc 721 actttttgct gcctaacagg aaacaggact tgccaaagct tatccagagg cacagagcgc 781 tcctcctgaa caaaaagcaa gggcaggagg gcaaacagcc gtaggacgcc tgagagaaca 841 aacagcccaa gcaagccacc ggcaccagcc tgagtcgcca gaaagctacc acaggtgatt 901 cctattgccc cactcacacc agcaaccgcc cccgtgatcg cgaagtattt ggactgatta 961 cccagcggtg ccactgccat cataagattg ttggtacaca ggtcaattgc cgcccatgtc 1021 ccaccagtta gcacgtgtaa caagggaaac caaacccaaa gggaaattga atcagctcca 1081 gttcctagcc acaacagagg tgtcaccccc accaagactc ccacgaatac cagaagtggg 1141 cgattcccaa ttcggtcagc cagtttgccc cacaaaagca gcagcagcat gttagcacca 1201 gttgctatgc cgttataaat tgttactacg ctgatatcta tctccaggtt atccagcatg 1261 tagaggttaa agaagggagc gctgacgtta acggcaaaac accatatggc aatgtaaaac 1321 acaaacttca aaaaattggc gtctttgagg aagctaagat ctattccccc tgactgcgct 1381 tgggatgtct ctgaaccggc gacttttaaa agctgcgggt tcacatcggt caaccagaac 1441 tgactgacca gactacttag cccaaacaca actcctagaa ccaagaccac accgtagcct 1501 tggagtgttc catagggcca tttcgatacc gctagaccca gcagcggcac accgatgaga 1561 ttcgtcaagc tcacaagact attgcgaaag ccaaaatacc gcccccgcaa ccgctgtggg 1621 actaacacag ccgtccagcc caaaaaggga gcacggccca aagcttcgat gatattagtc 1681 accaatataa ttcccaatgt caacagcagc acttggtgcc cagtgatatc aaatgagata 1741 tcagatgagg tgagcaacca aatcgctggc aagagaatca cccacagtag ccgcgacggg 1801 acgaaaatga acatcgaata ccagtggaag ctgggacttc ggtctactag atacgctccc 1861 agcggttgga gcagattcac caactgagga atcgaggcta aaagaccaat ttccactgga 1921 cctgcaccca gctccagcaa gaaattagtg agcaacgcgc cgccgatcat actgtaaaaa 1981 accgtagcga agacactctc aaaagttaac gccctgaggc ttgtccgaat ttccggctta 2041 gaaattttaa caggcgattt tggagttgaa ggaagtaccg tctgtgttat tgccctatcc 2101 tctgagattt cctgaatagg gaccgggggg agagggcttg tttctactaa taagtcgtct 2161 tcttgttgcc caagcatatt atttagccaa tatttaaaga gttaacggga acttaattgc 2221 tgctttgatt tgagtgcatt tccgaagaaa aatctcctaa aaaatgcttt gttgaaatcc 2281 tgtcatggca atttccccta tcagggttaa taagtaaacg ctatagaggg cttcatgaat 2341 ctaaccaaaa ctgtcacaaa acagccatta atgttgttta aaacacattt gtaaccattc 2401 ttattatttg agcgatcaat ttatcaaata ataattttct aacttataat aatataaatt 2461 tattttttta aatctccttc tcaggagtta ttctttttgt ataatcaatt tttaattttc 2521 aataaaaatg tttgctgaaa ttattttagg gttaatcatt agatgtcata tctaaagaag 2581 aatatactgc tccaatggga ttaatcactt gacacgaccc aaagcaatgt aacgcaagat 2641 ttgaagcggt acgttaacag agtgtaacgc actctactac tgtagtactt gacccgaaag 2701 cgacctttgg gaggcgatcg ctacaattgt catccaaaag agtacgtccg ttgctagagg 2761 gaacgatacc cgcgaatcct taacttatgt agatagttag tctagcaact tcaacaaaga 2821 ggcaaaaatg actcaggaac cacgtaagct gattaattta tcagtgcatg aaagcgctga 2881 cccaagtgta attaacccta ctaatccaga taaagaagcc tctggagaga taaatgactt 2941 gaacgactct gttcatgatg aagaaaatgt tgatgttccc attccgagtc tttttgatga 3001 cgatagcgac gataaccctg tagatcctca aatcggcatt gtaggtcgaa gtgctggata 3061 agctgacttg atggaatgta gctatgatta caggcagctt taacattaca cctgcctgtc 3121 gatagtaaaa aacctgttaa tattcagcaa cactaaaaat tgcacaacaa catccttttg 3181 tccctgttag ttacgcagtt attagcaggg gtttttttat ctaaaaccta atgcaccgta 3241 aatttatttg ctgcgttcgt tacataggta aattttaaga gggaagaaag tagttaggat 3301 caagcgattc attttttaca ttaatacacg tactggttta tggagtcccc gtccttaagg 3361 acggggtctt tactgaagtt ctatctttca gcaaaattgt attcataatc acaaatgctc 3421 ttaacattat ataaaattct tgactagaga caaatatgtc gattgataga aggtagcata 3481 aacagttagt agtatgtttt taaattagtg atagacattg ttgtcaaagt aaatagcctg 3541 aatactttct aggcagtcaa taaaatagca aaagcgtctt taccagataa ttatcactgc 3601 ccttgtacta ataaaaagtt agcaaaaagg gtattttttg tatgcaattc tatctgggaa 3661 agcatttgta aacacttcaa ttacctacaa acaaaaaatt gagataacct atgtctggcg 3721 aacaaagtca aataaaaaat aacggtatca gtaatggtaa gacaaagatg cttaaagcta 3781 agctatcgac attgactgaa ccgctatcat caggtgcgtc atcgcagaat ttatctatat 3841 ccaacaaaat ctacaaagaa gcctctacag atacgcctga tataatttgc ttgtctcatt 3901 tacgttggaa tttcgtctat caaagaccgc aacatctttt ggttcgttgc gctcaaggac 3961 ggcgggtttt cttcattgag gagccgattt ttacctccga accgttgggg cgattggatg 4021 taagccttga caagaatggg gtagtggttg ttgttccaca cttaccacaa ggtctgagtg 4081 aggaagcggt aaacgcggat ctaaaagtgc taattgatgg tttgtttgca gagcataata 4141 tccgcaagta catctgttgg tactacacac cgatggcgat cgcatttaca cgccacttgc 4201 aaccagaagc agtcgtatat gattgcatgg atgagttatc tgcattcaag aatgcgccac 4261 ctgctttaaa gaacaacgaa accgaacttt tccaccgtgc agatttggtg tttacaggtg 4321 gacaaagcct ttacgaaagc aaggtgaacc agcaccccaa cgtctacgca tttcctagta 4381 gtgtagatgt cgcgcatttt gcacaaggaa gaactcttaa agaagaacca gcagatcaag 4441 tcaatattcc ccatccgcgc cttgggttct ttggggtgat tgacgagcgg atggatattg 4501 aactgctagc tggtattgcc gatgcgcgtc ctgactggca tttggtgatg attggaccgg 4561 ttgtgaaaat cgatcccgca agtttgccac agcgggaaaa tatccattat cttggtggta 4621 aagattatca agatttacct gcatatttag cggggtggga cttggcgatg ctgccgtttg 4681 cccgtaacga gtcaactcgc tttattagcc ctactaaaac tccagagtat cttgccgcag 4741 gtaagcctgt ggtgtccacc tcaattcgcg atgtcgtacg tccctacgga aatttaaagc 4801 tggtgcgaat agcagacacg gtttctgaat tcgtcgccgc cgcagaaatg gcaatgcaag 4861 aggacaccgc agtttcggga tggttgagcc gggtagatgc gtttctggag cagatttctt 4921 gggatcggac ttggggatca atgatgcaat tgatagagtc tgccattgct gcccaaaatg 4981 atgagaataa aatcagctca aatcagattg ttactggtaa acaagcacca aacatcatta 5041 ccagagagtt tgtcttcgat tacttgattg ttggtgcggg gttctcaggg agtgtgattg 5101 ccgaacggtt ggcaagtcag tctgggaaga aagtgctggt tgtggacaag cgatcgcaca 5161 ttggcggcaa cgcttacgat cattacgacg atcatggcat cctcgtacac aaatatggtc 5221 ctcacatctt tcacaccaac tcccgcgaag tctttgaata cctctcgcag ttcactgcgt 5281 ggcgggctta cgaacatcgc gtcctcgcca gcgtagacgg gcaacttgtt cccatcccca 5341 tcaacctcga caccatcaac aaactctatg gaatgaacct gacttcattt caggtggagg 5401 agttcttcaa gtcggttgct gaaccgaaag attacatccg aacctcagag gatgtggtgg 5461 taagcaaagt cggtcaggaa ctgtatgaaa agttcttccg gaactacact cgcaaacaat 5521 ggggactcga cccatcggaa cttgacaaat cagtcattgc ccgcatcccc acccgcacca 5581 accgcgacga ccgatatttc acggattctt accaggcgat gccgctgcac ggctttaccc 5641 ggatgttcga gacgatgttg gcacatccca acatcaaagt gatgctcaac accgattacc 5701 atgaaatcga aacaagcata ccttgccgcg aaatagttta cactggacct gttgatgagt 5761 tctttgatta tcgctacggc aaactaccgt atcgttcgct tgatttcaag catgagacgc 5821 acaacaagtc ggtgtttcag caagcgccag tcatcaacta tccaaacgaa cacctttata 5881 cccgcgttac agagtttaaa tacctgacgg gacaggaaca ccacaagact agtattgttt 5941 acgagtttcc caaggcagag ggagaccctt attaccctgt accgcgtccg gaaaatcagg 6001 aaatttacaa gcaatacaag gaactggctg atgagacgcc aggtgtgtat tttgtaggaa 6061 ggctggcaac ctacaagtat tacaatatgg atcagtgtgt tgctcaggct ctttctgttt 6121 acaaacaaat tgcggttaag gcttgatact acttcgtgtc tgcgtctaag cgctatgcgc 6181 aaagcgcacg caatcatgcg aacgcgcacg ctacgcgaac gcagttaata ataaaacgaa 6241 ccgccaagac gagccagcgc cgtgcggggg ttccccccgt tgaggcgact ggcgtcgcaa 6301 agaacgccaa ggaagaaaaa gagaatatta aagttgagtt ttaagtatgg tttccttcaa 6361 ctcaacactc aacacccata actcacaact tcctttggaa gtgtgggctg gtgtggagtg 6421 tacagttaat cgtgtgggtg atgagtattt cgaccagttg gaacgcaacg gtcatgcaac 6481 gcgcttggat gacctagact tattcgccga actagggata aaggctatcc gctacccgat 6541 tctgtgggag cgaaccgcgc ctgacgggtt ggagaatgct gactggtcgt gggcttcgga 6601 gcgactgggg cgattgcgcg aactgggcat ccgtccgatt gtgggcttag tgcatcatgg 6661 tagtggacca cgtcacacca gcttgataga tccagaattt ccagagaaac tagctttgta 6721 tgcccgtgcg gttgcagaac gttatccttg ggtaacgcat tacacacctg taaacgagcc 6781 actgacaacg gcacgattca gtggaatgta cggacactgg tatcctcacg gaggtgatga 6841 gttaagtttt gcacgtgctt tgttggggca gtgtcgtgcg atcgcccttt cgatgaaggc 6901 gatccgagaa gttaacccta acgcccaact tgtgcaaacc gaggatttgg gtaagattta 6961 cagtacggca aagctggcgt atcaagctca atttgagaac gagcgccgct ggttgagctt 7021 tgatttatta tgcggtcgag tcaccccaac tcatcggatg tggggtcacc tgcgtcactg 7081 tggcattaat gaggctgaac ttgaatggtt tctggaaaat ccctgtccgc cagatattat 7141 cggaattaac cactacctga cgagcgatcg ctttttggac gagcgcaaag aacgctatcc 7201 ggtttgttcg catgggggta acgggcggga cgagtacgca gatgtagagg cagtgcgggt 7261 ttgtgctgag ggtgcagcag atccgcgcaa cttgctacta gaagcatggg aacgctacaa 7321 actgccgatt gctattaccg aagctcacgt cagctgtacc cgtgaggagc agctgcgctg 7381 gctttatgag gtctggagtg cggcgggaca attacgagat cagggtgtag atgtccgcgc 7441 tgtcactgct tggtcgctcc ttggtagtta cgattggaat agcttagtga ctcgttcggt 7501 tggttactac gagtcaggcg tgtttgactt gcgttcgcgt agcgtctcct ctggagaatc 7561 gccacaccca cgaccgacag caattgccaa gatggtgcgg gatctagctg ctgggcgcaa 7621 accgaatcac ccactacttg aaacacccgg atggtggcat cggcaagaac ggttattata 7681 cccagcggta agttgtttaa aagaaagcag tgggcagtcg ggagtagggg gagaaatgac 7741 ttcctcatct tccccatctc ctctagtcat tgtgggtgca acaggaacct taggaagggc 7801 ttttgctcgt ttgtgtgaac tgcggggtat ttcgtaccgc ttgctgtcac gcaaagaaat 7861 ggacattgct gattgtgctt ctgtcaatac ggttctgact gagttaaagc cgtgggcggt 7921 tgtgaacgct gcgggatacg tgcgggtgga cgatgcggaa cgcgaacccc atgtttgcct 7981 gcgggtgaac gccaccggac cagcgatttt agctgcggct tgcgctcagc ataacgtggc 8041 actgctgact ttctcgtcag acctcgtatt tgacggtgct gtgttcaacc cttatgttga 8101 aagtgatgcc gttgctcccc tcaatgtgta tggctgcagc aaagctttgg cagaaaagtt 8161 ggtattgaag cttcatcccg catcgctggt cattcgcacc agttcatttt tcggtccttg 8221 ggatgattac aattttgtaa caattgcact acgtcagcta agcgctggga ataccttcgt 8281 cgctgctgag gatgcaattg tttcgcctac gtacgtgccg gatcttgtcc acaccagtct 8341 agatttgttg attgacggcg agtgtggttt gtggcatctg gctaatcaag gtgcgatcgc 8401 ctgggctgac ttggcacggt tagcggcgaa aacagcaggc tttaatccta gcaatgtgat 8461 tgccctgcca acgcgagaac ttggtttaac cgctaagcgc ccgacttaca gcgttcttgg 8521 tagcaatagg ggtgatatca tgtcctgcct tgacagtgcg atgtctcgct attttgatga 8581 gtgtcaacga ttttagattt ttgtcatttg tcaatgattt ctaactctat acgggcgttg 8641 ctttcgacca gaaatatatc gagttatcag aaatattttt gcttcgtagc ttcgccccta 8701 cctaattgat tgtttaatcc aaaatctaaa atagacttgt tgcagaagta gggaacgctt 8761 aacgcttaac aggggactct taacgcttaa ctcttaacgc ttaagtctta acagaacctc 8821 gtaaaatctc acttttgcaa tcttgtctaa tctaaaacct aaaattagta cagttatgac 8881 aaccatttta gtcacagggg gagcaggata tattggctcc catgcagtat tagctctaaa 8941 aaacgcaggt tatgaggtcg ttgttcttga taatctgtcg aatgggcatc gagaacttgt 9001 ggaagaagtt ttgcaggtaa agttgattgt tggtgatacg agcgatcgcc ctcttttaga 9061 taccatattc tcaacccaca atatagctgc agtgatgcat tttgctgcct acattgccgt 9121 gggtgaatct gtcactgacc cagccaaata ttaccacaac aacgtcgcag ccaccctgac 9181 gcttttagaa gcaatgcttg ctgcttccat caacaagttc atcttttctt ctacttgcgc 9241 tctttatggt gtgcctaagt ttgtgccgct gactgaagac catcctcaag accccatcag 9301 tccttatgca actagcaaat ggatggtaga gcgaattttg tctgatttcg atacagctta 9361 caatctcaag tctgtccgtt tccgctactt taacgccgca ggtgctgacc caaatgggtt 9421 attgggtgaa gaccacgaac cagagactca cctcatacca ttggtgctaa tggctgcttt 9481 aggtaagcgc gaatctattt ctatttttgg cactgattac cctacccgtg acggtacttg 9541 cattcgagat tacattcacg taactgactt ggcacaagca catatcttgg gtttggagta 9601 tctcctaaaa ggaggagata gcgaagtctt taatttaggc aatggcagtg gattttcagt 9661 tagagaagtc atcgaaagtg caaaggaagt cacaggaggt gacatcaaaa tagaggaacg 9721 cgacaggaga gctggtgatc cgcctatctt agttggcagt agcgacaaag caagtaaagt 9781 tttgggttgg cgtccccaat acccaaacct acaggaaatt atctcccacg cttggcagtg 9841 gcatcaacag cggcatgggt agacataatg aaaatcaagt tttgagttta aatatcccgc 9901 tctagtcact ggataattat ccagtgacta gagttggtgt atggagtacc tcaaagctcg 9961 tgcaatacag tagtaccccg aaccttggct tgtatgatac tcccatagta ggcggcgaca 10021 gttgccgtta gagcgttcaa ccaaacattg ttaccaaaca gtggcattaa gccaaacgtt 10081 gttttagcca agggtagcaa tcccatgatc gagagcaagg tataggcaac tgcaaaacta 10141 cgattgaata tgcgtgcact gctggtgcta gtataggagg caatccccaa tagaccgacg 10201 gcgcagtgta cgatgttatg caagaaattg gtgggaaaca gcccgaatac atagccaaat 10261 cccatagcat aggcgctctt agttgcatca attgggacat aagatgcagt cgtcccaggc 10321 aatgaaacca aacctggtat aaatccagct acacctaaaa ccaaaaagag aattccgata 10381 atcagggcac agtaacgctc tatgatgtcc gccttgttta cgttgtccat attattcatg 10441 acctcaagtt ttgtgcaaat cttcactcga atgttcgcta tatgcaaaga tagggtttca 10501 accgagaaat acagaagcga atcgctgacc actctcgggg ttctaattaa ctgaaatccc 10561 taatatagat tcaaacctta cggcatcgga tggaaaagaa gatcgggaaa aaagtagtta 10621 aacacagccc aagtcgagaa ggcaactgat atcgaaataa cagcaataac tggtatgagg 10681 gaaagatact gaaggaaata tttgacttgc tcgttttgct tctgcatgac tgctttccaa 10741 aaaaagttaa tgacttaagc tgaactcgta tggcgaaaca aaaattgatt ctagtgatat 10801 acaaatcact aaaaaatagc tcagtttgct tttggtgtaa aaactttcaa actgaggact 10861 tgaagttgac tgtacaatca ctaaattaat tacgaaaatt ttctttttct tctaactaaa 10921 gatagggcta atattaagtt aatttgtcat tgataacaca aaaaatatat tgcttcaggg 10981 ttggaatagc tgtgtgaagt gagaaatctg aaatctaaaa aatgacagaa gcaatatatt 11041 gatggaatat taatggatac taaaaaagcc ccgatggaat ttcaggggct aagcgaggtt 11101 gaattagagg aatcgaagaa cacagactgc tttgttgttc aacttaatac tatctattta 11161 ataactgtaa atatccctcc taaagtcgga aaatcagtta caaattactg tgtttatgaa 11221 tacaaccagc cccacttcaa atattccttt ttgatagttc atactttcaa gaaaaaaagc 11281 ctcctgctaa atagacgggt gagacagcag gagacaactg gaatgttgat attttcacta 11341 gaaagttttt ataaagtagc catgcttagc atcattcagt cacctcaaat tgtctcggca 11401 ttcccacgct atcgccggac ttggagggcc atctaacagt aacaatagtg cagtcagact 11461 cagcaaccca acaatgcggc acacctgaac accaaagaac gtagtcacct tcttgagata 11521 gtacaatatc tccgtcctca aactgaaggc gaaatcttcc gtggatgagg atggaaaggg 11581 tggcggcttc gttattgact gcccattggg ttctactgtc tcctgctttg tgcacagccc 11641 atttcacttc taaatcttcg gttgagcgcg gatcttcact cggggtgata aagtgaccaa 11701 taaaccagcc ccagcgcttc gcgccttcaa caatagcgtt accaaaaaca actttagcct 11761 gcatctgtca aatcaaaaga aggaactaaa attcgtccag tgttacctgt tttgttatat 11821 tcttcnnnnn nnnnntcttc aaggacattg gttgcaatcc ctctcccctc acccgctaca 11881 actaaagcta cgcttgctcc cccaaaacca ccacccgtca agcgtgcgcc gaagactcct 11941 ggggttttct gcaatatcgc tactaaggta tctactgctg ggaccgaaac ttcgtaatcg 12001 ttgcgctgac tggcgtggga agcattcatc aattcaccaa agcgtttcgc tgacacccca 12061 tgtacagcct caagcactcg gttgtcctcg gtgatgacgt gtctcgcgcg acggcgcaac 12121 ggttcgggta atgactctgc ggcttgtgga tcggtgatgt ctctgagcgc tttgactccc 12181 aagcgccgcg ccgcttcctc agactcagcc cgacgctggt tatatccgct acctgcagcc 12241 agtgcatgat ggacaccact atcgattacc aagatttctg ctccccctgg gaaaggtatg 12301 actcgacgct caagggtgcg ggtgtccaaa aacagcatcg agtcagtgcc agccaaactc 12361 gatgccattt gatccatgat gccgcagttt aagccggcat actgaatctc cgcttgttgt 12421 ccaagttggg cgatttccac atcattaatg ggaagattga gcagttcgcg caatccccta 12481 agcgttgcaa cttccaaagc ggcactgcta gataagccta cacctatcgg gactgtcgat 12541 ttgacgtaca aagaaagcgg cggtatcgta tatccttgtt tttgcaaaag ttgaatacac 12601 ccaaagatat aacttgcaaa tccagatggc gtatgattga tttctaaaat gttgacttgc 12661 tcgtctaaat cttcagaata aaagtggtga tgtccgtcgg tactaaaacc tagttgtacc 12721 gtggtacgtt gaggaatcgc agttgggaga acaaagccat cgttgtagtc agtatgttca 12781 ccgagtaggt ttacccttcc tggcgcactg gcttctgttt caggtgattt accaaatatt 12841 tgttggaagt tcataattat ttcatcaaag cataagtatc aaatttcacc ctctaacgag 12901 agatagaaaa taggaaattt ccaattgcat ctttgaacaa ttttcaattt tcaaatctaa 12961 gagctaattt ctaaaataaa tcttaaatca ttatgatgct tcaaggtaaa ataacccact 13021 cttcatgagc agtacgttgc agtgcttaga gataagtagt tgctaaaaac ctaaacatca 13081 gattttatca gagcgcttag agaattgtat tcacagttac atataaattt catattagct 13141 agttctcaag aatgatgtag taattagtga atcattgaaa atataacagt agaggaagca 13201 caaagacata acaaaagaaa tctatggaat tacaagaaac tcaaacaaaa aaaacagaga 13261 ataatgttct gtcggcagag aacatacatg attatatcaa tcccgaaaaa ataagggaat 13321 ctgaggctaa aactcaggca caaacagata atgcagtcac tagtaaagag gcacttgacc 13381 ctcgattacg ctatggcttt acactgatat tagcgatttt cttatttatt gctgccattt 13441 actacggaat catcaacccc taggtgatta acctaagctt gggggacgtc tctcaggctt 13501 agttaaatca tgttctgttg caaaattgtg ccttcctgcc taatctgtgg acaagtgggc 13561 cagcgggtca gcgtaaaata cagtgccttt gtgggaattg tgattattcc ttcttgggac 13621 tcatgttata ttattaatat atatatgtag tatcaataat gttactgcaa tcatatgagt 13681 tattctcgcc tcgtccaaat cagcaaatat cttagcaaat atttgcgaca tacaccgggt 13741 gcaattggaa ttaaacttgc ccccggtggt tgggttagtg ttgatgaact gcttaccgct 13801 tgcgctaaaa acaaatttcc actcacccgt caggaattac aggtggtggt tgaactcaac 13861 gagaaaaaac gcttttcttt taactccaca ggcactctta ttcgtgctaa ccaaggtcac 13921 tcaacagaag ttgatttaca attagaacct gttgttcctc cagacgtgct ttatcacggt 13981 acgggacaca aatctgtaga gtcaatcatg caaacaggac tctgcaaaat gtcgcgacat 14041 catgtccatt tatcaaagga tattgctaca gcacaaattg taggtgcaag acatggaaaa 14101 ccagtagttt tgctcgtaga tactgcggct atgtatcaag ctggttataa attctactgt 14161 tccgataacg gagtttggtt agtagatagt gtgccacctg agtatctaca aaaaatttga 14221 attttcatag gttgcagaga ttcatttaaa aatcataaaa ttttagattt attgttataa 14281 aagtgtcata aatagaactc aagatttact aggacttacg cactgtacaa attaaccata 14341 atatgggcgt tgctgaacca aaatatgaat catgcgctct gcgcaatcgc tcaaattcat 14401 atttctattc tgcaacgcca taatataaat taaggttttt ggtcgtttca ggcacaaggg 14461 gcttcccgta gacgacgaag ggcggctttc tttgcccaag gtaagctagg ctaggacacc 14521 aaggaaagac aagtgttgca ttatttttgt gttatttgtt ttaaatctcc aaaaccaccg 14581 ggcgcaatct tccctctacc tctgtgctgg gaaaatttgg atcaaaagtc agattttcca 14641 actgtccatc ctggagaatg acaaaagtat cttcaacttt tgcccctggt aaacttggat 14701 tccaagcaac agctatacct tctgctaaag tgtcagtggt agcgggattt gctactattt 14761 ctcgtgctaa atatcccgta gttccacctt gatgatgttc gcggattgca tgattgaatc 14821 cgtgctgttg ataagcttga gctaaagtgt gataaaccga attaagagat gttccttctt 14881 tacataaatt caaagcttgg gcttcaattt cgcggacatg acgatgcaat tcggtttgtt 14941 catcccaaag cgcaccaaaa cagacaaatc gcgttagatt cgcatacaaa ccatatcccc 15001 tagcacagaa caccagcatc gcttctcgtc caatctgttc cccagtggct gtagcgtggc 15061 ggtatagggg taaacgcctt tcgccagcca ccagcgtcag cgctggatgc aatccccttg 15121 cccacaatgc ctctgcacct gcacctgcta attgatattc tgtccaggta ggcttggcgg 15181 cttttagtac ctctgtcatc gcaacgctag cttttcgccc cacttgacga tatcgctcta 15241 gttcgcttga catcatgact cttttatggt gctgtagaga cgggggtaat tgctgctcta 15301 catgagggat agggcgatct gcgcgcagcg cagcagcgaa gctatcgctc aaaaccttcc 15361 caccattagt agcatcccgg acaaaagctt cacgacgagc agcatcagcc caagggttga 15421 tgtgcagctt aaaattagct ggtagttctt catcctgtaa acgctgggct tcaatttcgt 15481 ctgttaatac ccaagcatct tgcgccgtta ccaatacttc tgctactccg gtttcagcag 15541 tcagtagtac ggtgttagaa gcgccagcag tcgcccaagc aaaccagtct gttccgcgta 15601 accgtaaccc ctgcatctcg gtttctatga gagtttggcg gattaactcc agcttagtag 15661 agacttcttg attcaaagga ttcatgttca agggtgcgat cgcataattc ataatttaca 15721 ccaacaccaa ataatttatc tctgctgtca ttgttacctc ttcaatcgcg agcatggttt 15781 taggttaagt ttgaatgtta ccaggaatac aagcaaattt atggaaaatc tcctcccaat 15841 aagcatatct gcaccctacc ttatgcagat actaaatcag tagtgaaaag acacgataaa 15901 aagagttttc tggaatggca ttgaggagaa aaaatgaaaa ggaataactt agaatactac 15961 gatttgaatg cagataaatg gagaaaagaa agtgaaccat tacatttgtc taaccatctg 16021 aataaatcaa ggtttgagtt tttctcaagt tatgttcctg attggaaagg ggtcaaagtt 16081 ttagatattg gttgtggagg tggattagct tgtgaatttt tagcttctct tgaggctaat 16141 gtatcaggaa tagatttatc tttaaactca attaaagcag cacaagaata tgctcaaatc 16201 aatcacctaa atattgacta tcagtgggga gcagctgaaa atttacctta cgatgaaaaa 16261 atctttgacg tggttttgtg ttatgacgtt ttagagcatg ttgctgattg gcaaaaagtt 16321 ctttcagaag cttacagagt tttaaagaac aatggattgt ttttctttga tacaattaac 16381 aaaactttca aatctaaatt aataatgatt tggcttttag aggatattct aaagcagcta 16441 ccacgtgggc ttcatgattg gaataagttt attcaacccc aagacatgct tgatattatg 16501 aagagcattg gctttgcaga tgttgtcatc aaagggttcg atttgacggc tggtactagc 16561 ttgaaaacac ttagagatat tgtatttgag ggtttaaata atcaaaacaa aggtaaagag 16621 ataaaattat ttgaaatcca cattaatgat gatacttccg tatggtatat tggcaaagct 16681 gttaagctaa tttgatattt cggctgttgc ataaataggt aagctttggg acgaaattta 16741 cgaattcgcc gctcactacg tccacagcaa acaaaaattc tcctctctac cattagacat 16801 gtccgggaat ttgcaaagca ctccacatct ggctttgcac agcatt // LOCUS NODE_1982_length_16790_cov_4.86029316790 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16790) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16790) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16790 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..877) /locus_tag="DP116_17430" CDS complement(<1..877) /locus_tag="DP116_17430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496175.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GMC family oxidoreductase" /protein_id="PRJNA477356:DP116_17430" /translation="MSRVLKRRQFLQGSLAAAASVGVSAVRASAVRTKEDYVEAIVIG SGFGGAVASLRLGQAGIETIVLERGRRWQITDAGDTFSTYQQPDGRSTWLSPTTVVFD QVPIDVYTGVLDVKRGDNIRAYRGAGVGGGSLVYNGVTYQPTQELFYQVFPRTINYEE LDRVYYPRVRSILKPSQIPDDILQTNYYLSSRIFLEQAAKAGLKARKLDMAVDWDIVR QEIAGQKVPSAITGQVYYGINSGAKNSLDRNYLSMAEATGKVEIRPLHVVTTIEESVD TPLPKGEGILHSSSEL" gene complement(1095..2681) /locus_tag="DP116_17435" CDS complement(1095..2681) /locus_tag="DP116_17435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320812.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase" /protein_id="PRJNA477356:DP116_17435" /translation="MVDYQNFERFLYSRIKRRNLILGAGVLSGLAIASQFNQRAIAQV PFPSYPFTLGVASGEPYPTSVVLWTRLAPNPLQGGGMPSVNVPIRWEVATDPNLKNVV AKGTEIAIPELAHSVRVVVNKLKPNTWYWYRFTSGTEESPIGRTRTAPQPGSDVSQFA FAFVSCQHYEQGYYTAYKYLAQEDISLVVHTGDYIYEGGIASNGVRQHNSSEIFTLDD YRNRHALYKTDPNLQATHAAFPWIVTWDDHEVENNYANATSEVDNEPDQDPQVFLQRR AAAYQAYYEHMPLRPFSKPEGPDMQIFRRLSFGNLATFHVLDTRQYRTDQPCGDGTKE RCPENFDPNATITGKRQENWLYQGLDRSTARWNILAQQVIVAQRDLTPGEGATFSMDK WDGYLASRDRLMSFLEQRKPSNPVVLTGDVHSNWAMDLKADFNKPESATVGSEFVCTS ISTGGDGADSSPTVEAYLPDNPHIKFYNGQRGYVRCVLTPATWQTDYLVLSNVTTQSG TISNRASFVVENGRPGIQKV" gene 3164..3868 /locus_tag="DP116_17440" CDS 3164..3868 /locus_tag="DP116_17440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015174651.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_17440" /translation="MTAVVPTPSKCGIIYPSSDKEPVAETYDHLYAILTTLEVLRQYL LNRQATVLANQFLYYVEGFPKLRVAPDVMVIYDVEPGGRDNYKIWSEKQVPKVIFEIT SKSTQDEDKSSKKNLYEALEVQEYWLFDPKGEWISQKLQGYRLKGDSYESITDNQSEP LQLRLAVEEKVIGFYRLDNGQKLLVPDELAQALKEETLKRLEAERQAEQERQRAVKLE SLLARYKERFGELPEE" gene complement(3902..4237) /locus_tag="DP116_17445" CDS complement(3902..4237) /locus_tag="DP116_17445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321135.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4870 domain-containing protein" /protein_id="PRJNA477356:DP116_17445" /translation="MQQVNQRRLLSAICHGAIFFSSTIVSVGIPIAIMLTTKDSVVKA NAKESLNFHINLYIYAIVFALLVLVAIGIPLLAVLTIVSFIMPILAILHILDDPNLPY RYPFIFRVL" gene 4580..5152 /locus_tag="DP116_17450" CDS 4580..5152 /locus_tag="DP116_17450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407758.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="group 1 truncated hemoglobin" /protein_id="PRJNA477356:DP116_17450" /translation="MKIQSLFTKAFWLALACITVVVVSVFKLSPSFARSTTTPPAQLS TAQVAVAQYDSRLAKAPYDSRSGEKSLYKRLGGYSAIAAVIDDTAQIVFNDPLIGKYF IGLSTNSKQRLRQLLIDQFTQAAGGPAVYTGRSMKLSHSGIGGGLTNAEYDAFVNGIA QALDKNNVNQPEKDEVLAFANSFRDEIVER" gene complement(5420..5677) /locus_tag="DP116_17455" CDS complement(5420..5677) /locus_tag="DP116_17455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877198.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17455" /translation="MSSRTRYAIAFFQTPKSISATTATTPPLFLLAEAKKAALYVARF NPDNTGTWIALNLETPTNPIAPSVISSAALSHHLRGLTENQ" gene complement(5667..6068) /locus_tag="DP116_17460" CDS complement(5667..6068) /locus_tag="DP116_17460" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17460" /translation="MSKLSRRQVLIFFAGAAGAVLADQVLGGTVDAREAKFAPLSFTP VRLPHPLPIYKQQKNYLPREIGQGKTLDASPDVKLVSYNVVDDVVVPPEYERYVIVGW GDRPYTAKPVLSEQASSNLRFEAKSLRDFEQ" gene 6709..7266 /locus_tag="DP116_17465" CDS 6709..7266 /locus_tag="DP116_17465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314410.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17465" /translation="MSLKSKQEVRYQLKYLFTHVLASVPLLLACAVFLSVISSQSSTL LLCLILGTIAVALTQQIGQTIGLPKPWLILLQTLVFSAILSFFWIDYFSDPAAAQFFG KAETFFKNNLTQGSQQTGAGAAVSLVFNVLRALYLLYIAVALIGVINAVRKDEDWQVV ARTPLLVVIAVTVADVLTSFIVGSQ" gene 7285..7662 /locus_tag="DP116_17470" CDS 7285..7662 /locus_tag="DP116_17470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994646.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17470" /translation="MTDERDKEFRPVNQILGTQPSLGPIPADQIFPWTIIALVSYMIV NGIFGGVFSDEWQKWLWTVLIAGWGIATWWILTGGRSWRFLSKFIGVPTWTRGTARYK SFLEFHYERKNRKTKRRHRRSRK" gene 7610..10357 /locus_tag="DP116_17475" CDS 7610..10357 /locus_tag="DP116_17475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314408.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17475" /translation="MKEKIGKQSVGIAGQENRLTPFEDALNLATMLRIAVDGRDVGAY ILTKGSQRDRFCFVFGFECRGIHTTLRTEQIDTICNNIEAGLKDLPPEERVTFHLGSF STDKKRQQELASLVQKTSSPSIQYLLTSERARTQQLTRAGIRKPKFLRVYVTYTVEPD ASAADDWIEKFLARAESWWLSFKGEAAERENQKLESLISNAYTEGFRRWEQILSNKWG LDIKALTAKDLWQEAWRRFNSTEPIDIPQLLVLDEKGLHEEIYSDLSSCKLLLENIHS TTLLMESDVPVADRKWVNLKNRYVGVMTFLEKPGGWQNKASQLRYLWELVARDAIVDT EIFCELTAANPAIVKTTLQRVLKQSNVTSKVAQEKGNTIDVGAQLKLRKSVAAQEQIY EGALPIYTGIAILVHRQSPEKLDEACRYIENCFQRPAQVIRETEYAWKIWLQTLPIVW EMLLAKPFNRRQLYLTNEVWGLIPLVTTRAGDRKGFELIADEGGTPLHLDLFNQHKNL ALFATTRAGKSVLVSGILTQALAHNMPVVALDFPKPDGTSTFTDYTEFVGEKGAYFDI SKQSNNLFEQPDLRFLSEEEQRERFQDYAAFLESALMTMVLGSSTGNQLLAQTVRSLL NLALNTFFADEGIQKRYYAAILEGFGTEAWQKTPTLRDFIIFCSREHLNLHGIGGRID DALEVINLRLRFWVESRVGQAVSAPSSFPTDAQLLVFALRNLSDNEDAALLSLSAYSA ALRRALSSPASIFFIDEAPILFEFDQISDLVGRLCANGAKAGVRVILSGQDPDTIAKS KAASKILQNLSTRLIGRIQPVAVDSFVQILKYPREIIARNASEGFFPRKEDIYSQWLL DDNGVYTYCRYYPGFEQLAVVANNPDEQAARNQAMQRYGDKFEAVSHFARQLIGSIRG Y" gene 10720..11520 /locus_tag="DP116_17480" /pseudo CDS 10720..11520 /locus_tag="DP116_17480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319633.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 11840..12898 /locus_tag="DP116_17485" CDS 11840..12898 /locus_tag="DP116_17485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879624.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17485" /translation="MWIYNVKIFLAQLQLDTSNVLTNGVVTAQSIAEGWDKQWIDLLQ NNTNNNLYGALTNLGIFFAVGTLLFFMAQWIKDVLDNEYSRPLSALIWPFIVVLLLAN PGNGTALSNLTLGLRDFLNTINQQVVEAADVNQTYQQALNMSVGEEVVGGLLRPCQSL TGQQQTNCFIKAKEKIDVLLGQYRNTYGIQPWIDRLEIKVNQIVISTGNVSEFGFNSL VGSTTQTIIKNLLVSLQSAFQNLIEVTMLLIAALGPLAVGGSLLPVAGKPIFAWLTGL FSIGIAKISFNIIAVISAAVIVNGPAQNLDADPDLMWFMILLGVLAPIISLGLAAAGG FAVFNAISNTSVWIQQRV" gene 13240..14064 /locus_tag="DP116_17490" CDS 13240..14064 /locus_tag="DP116_17490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17490" /translation="MTKLLQEKKSSVNLLTLFTIFTFSLHFLAAIFLLFEGLRIYGLI HKKPLTFVQLVDGKRVSQIDTLEREPEVIRQFVAKTMAAMFNWSGTLPPASVEDATNP KPDPGIPINTLQNLTKKVSTSSWVGSFALSEDFRQGFLAQIAEMTPPEIFSKNNNQAL TGQLVIQRVYPPEKIAPGRWRVGMVANIVQIRRSDNKKLLIPFNKDFFVRSVDSFGHP LSNSLTPLQKAVYSVRAQNLEIYEVSDFCLTNGYDSSPKSQSQRCGDIPNSGSFTR" gene 14078..14755 /locus_tag="DP116_17495" CDS 14078..14755 /locus_tag="DP116_17495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197375.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17495" /translation="MLFNQKKNNTSILPVFVVATFVLNVLTILLLMYHQSMLKRLSGQ LPQSLVQLVDGRAITIDSQENLERNPETIRRFVGETMTMMFTWSDKQPQQIVWQATSE LLSGDVRRKFEVETTQGIPKGVLANPGGNAESLLLIRRISQPEKIADGQWQVEIVANR LIFAGYNNKMGEAIPFNKKILIRALETQAISIPNVENPLYSAIYRLNEARLEIANICD IKQKKCP" gene 14926..16620 /locus_tag="DP116_17500" CDS 14926..16620 /locus_tag="DP116_17500" /inference="COORDINATES: protein motif:HMM:PF03743.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17500" /translation="MANLVGLEEQIPPSRPESSLSETSSANPDEAPPSHPSSQKTKQA LSSNPFAKVGVVGAATLSVALVAGAFLTQLMSGTNKQAPKNFPQITRNENESNKIEQL KPEEEIEILKTKLALAEQAKAVKLAQLQLKSVRPTTQPKPTPAQPRIKPQTVTRVVVQ RVPTPAQTVYVPRVVERIVRVPQRVVVQQPKPISPVPPQPTVRPVPPQPTVPPQQTPQ PTAKPSISPSFELSIPGLNPSELAQIPFALSIPTPTPTPTPTATPTPTPTPSLPRVAN SPPSTLGSELNSRNRDNAQILPVPNRRNTAANPQQTPAEATTNETTQYTGKSVAVGTN AKAVLATAVFGEANRIGSNNSNSNSNNNNNNNKNDSQFVVRLREPLKSVDGAIALPAN TELLAQLDQVSETGALNLTVVSVVSQNKSNLTETRLRQSAMKVRAPGGRPLLAKKYPD KSGKISAMDTFIFGLGGAGQIGRTINLPETKTRNTCDGLNDAQRSVQQYCGYFSETKQ PRNIAGAVLEGGMNALVPQLNQRNQQAINEMITKSNIWYLPAGTEVEVVANQITRF" BASE COUNT 5118 a 3429 c 3546 g 4697 t ORIGIN 1 caagttctga cgatgaatgt agaatcccct cgcctttagg caggggagtg tcaacagatt 61 cctctatcgt agtgaccaca tgtaggggtc gaatttcgac ttttccagtc gcctcagcca 121 tgctcaagta gttgcggtct aagctgtttt tcgcaccact gttgataccg tagtacactt 181 gacctgtgat ggcggaagga accttttgac ctgcgatttc ctggcgcacg atgtcccaat 241 caacagccat gtcaagcttg cgagctttta gacctgcctt tgctgcttgc tccagaaaaa 301 tgcggcttga caaatagtag ttagtctgta ggatgtcgtc tggtatttga gaaggcttga 361 ggatagaacg cactcgcggg taatagaccc gatcaagttc ttcgtagtta atggtgcgtg 421 gaaagacttg atagaaaagt tcttgagtcg gctgataagt caccccattg tagacaagcg 481 aaccgcctcc aacgcctgct ccccgatagg ctcttatatt atctcctctc ttgacatcaa 541 gcacgccagt atacacgtca atgggaacct ggtcaaatac taccgtagtg ggactcagcc 601 aggtagaacg accatccggt tgctggtatg tggagaaagt atcaccagcg tcggtgattt 661 gccatcgtcg cccccgctcc agtacaattg tttcaattcc tgcttgacca aggcgcaatg 721 atgcgactgc tccaccaaaa ccgctaccaa taactattgc ctcaacataa tcttccttag 781 tacgaacagc ggaagcacga acagcagaca cacccacact cgccgcagct gcaagcgaac 841 cctgaaggaa ttgacgacgc ttcagcaccc gagacatact taatctccta caattttcaa 901 aaataaattt ttttatggca tgaaatttag attacctatt caaattaggt atattcagtt 961 tttacactcg ttacaatcag ttggttgcca aattgccaag attatgacga aaaaggtgta 1021 gaatgctaaa aagccagtac tacatctcct taggatgtaa tactggcttt tttcaacgaa 1081 aagcaactat ttttctaaac tttctgtatt cctggacgac cattttcaac tacaaatgaa 1141 gcacggttac tgattgtgcc agattgggtt gtcacgtttg acaagactag atagtctgtc 1201 tgccaagttg cgggagtcag cacacaccgg acatatcctc gttgaccgtt ataaaattta 1261 atgtgcggat tgtctggtaa atacgcttca actgtaggac ttgaatctgc cccatctcca 1321 ccggtgctaa ttgaagtaca aacaaattca ctcccaactg tagcagattc tggtttatta 1381 aagtcagctt ttaaatccat tgcccagttg gaatgcacat cgcctgtcaa gacgacggga 1441 ttggagggtt tgcgttgttc taagaaactc attaggcgat cgcgcgaagc caaatatcca 1501 tcccacttat ccatgctgaa agttgcgcct tcccctggtg tcaagtctct ctgagcaaca 1561 ataacctgct gtgccagtat attccaacga gccgtagagc gatctagacc ttggtatagc 1621 caattttctt gacgtttacc agtaatcgtt gcattgggat caaaattttc tggacaacgt 1681 tctttagtcc catctccaca gggttgatct gtgcgatatt gacgggtatc tagcacatga 1741 aaggttgcca aatttccgaa ggaaagccga cggaaaattt gcatatctgg tccttctggc 1801 tttgagaagg gtcgtaacgg catatgctca taataagcct gataagcagc cgcccgacgt 1861 tgtaagaaga cttgggggtc ttgatccggt tcattatcga cttctgaggt agcattggcg 1921 tagttgtttt ccacctcgtg atcatcccaa gtgacaatcc agggaaatgc tgcatgagtt 1981 gcttgcagat ttggatcggt tttgtaaagg gcgtgacggt tgcgataatc gtctagagtg 2041 aaaatttcag agctattgtg ctgtctgaca ccatttgacg cgattcctcc ctcatagata 2101 tagtcacccg tatgcactac caagcttata tcttcctgcg ccaagtattt atatgctgtg 2161 tagtacccct gctcgtagtg ctgacaggag acaaatgcaa aggcaaattg actcacgtcg 2221 ctacctggtt gaggagctgt acgagtgcga ccaattggac tttcttctgt acctgaagta 2281 aagcgatacc agtaccaggt attgggtttg agcttattca caacaactct aactgagtga 2341 gctaattctg gaatggcaat ttcagtgcct tttgctacta cattttttag gttggggtca 2401 gttgccactt cccagcgaat tggcacatta acagatggca ttccgccgcc ttgtaaagga 2461 ttgggagcaa gacgagtcca cagtaccacg ctggttggat agggttcacc tgacgctaca 2521 cccagcgtga agggataact gggaaatggg acttgggcga tggctcgttg attgaattgg 2581 ctggcgatcg ccaaaccaga caatacccct gccccaagaa ttaagttccg tcgcttgatt 2641 cggctgtaga gaaaccgctc aaaattctgg tagtctacca ttgttctccc cctgactaac 2701 tcacaaatga acgcagagga acaggcaaca caagccaagc aggatgagaa ttgacaaacc 2761 actaaaccaa tatcttaatc gccgaactca cactttgaat cacgtcaggt tacatcaatc 2821 actaaacctt gtcattcaat accagagcta ggcaacgcct gttatagttt tatgaagtat 2881 gtctatcctt ttatttaaag aacaccaaag tattaaattt tcctgctgct gcaaagtgtc 2941 aactgttgct ttgccgaatt taacttacca ctcagacgtt aatttctagc aaagaaacag 3001 ttaaaacagg gaacagtgaa cagggaacag ggaacaggga acaaggaaca aggggtggac 3061 gagtccgttt cctgcctggt aactgataac tgataactgt ttgattcacg ttgttgactc 3121 taaggtttct aaaatagaaa aacaacactg aggaattgca acaatgactg ctgttgtccc 3181 taccccctct aaatgtggta ttatttaccc aagcagcgat aaagaacctg tggcagaaac 3241 atacgaccat ctttacgcca tattaactac tctagaagtt ctcagacagt atctactcaa 3301 tcgtcaagct acagttctag caaaccaatt cttgtattat gtggaaggtt ttccaaaatt 3361 gcgcgttgct cctgatgtga tggtcattta tgatgtagag ccaggaggac gagacaatta 3421 taaaatttgg tcagaaaaac aagtaccaaa ggtgattttt gaaatcacat caaaaagtac 3481 tcaagacgaa gataaaagta gcaagaaaaa tctgtacgaa gcattagaag tacaggaata 3541 ttggttattc gacccgaagg gagagtggat ttcacaaaag ttgcagggat atcgactaaa 3601 aggagatagt tatgagtcca tcaccgataa tcaaagcgaa cctttgcaac tacgtctagc 3661 agtggaagaa aaagtgattg gtttttaccg tttggataat ggacagaaat tgttggttcc 3721 tgatgagtta gcccaggcgc tcaaagagga aacgctgaaa cgtctggaag cagagagaca 3781 agcagaacaa gaacgccaac gcgctgtcaa actcgaatcc ttgttagctc gttataaaga 3841 gcgatttggg gaattaccag aagaataatc aaagaaaaaa ttctgtatct ctacttctca 3901 actacaacac acgaaaaata aacggatagc ggtagggtaa attaggatca tcaagaatgt 3961 gaagaatagc aagaattggc attataaaac tcacaattgt taatacagcc agcagtggaa 4021 taccaattgc tacaagaacg agtaatgcaa aaacaattgc atagatgtag agatttatgt 4081 ggaaattaag cgattctttg gcatttgctt tgacaactga gtccttggta gtcaacataa 4141 ttgcaatggg tatgccaaca gagacaattg tcgagctaaa aaagattgct ccgtgacata 4201 tggctgataa aagcctgcgt tggttcacct gttgcatatc atagccccct gttatgtaga 4261 cgtgatgaat cgggtctaat cttgacgata agtcaattct aattttcttg tgccacaggt 4321 gtcatctgct gtcttgtaga ttagggtgag caagtaccca gcctacaacg gttataaaaa 4381 gtaggaaaat taggatacaa acaaaaaatc acgaaatttt tctaagagtt cattcaatct 4441 ttaaaaatta cccctagtaa gtacacggca tatgtcgttt aatttatagc aagtaatgtt 4501 tcagcaatta cactcaagaa tttttcttgt gtaaggagaa aaattactaa gtaacaaact 4561 taagagagga gttttcataa tgaaaataca aagcctattc acaaaggctt tctggttggc 4621 attggcttgt atcacagtcg tggttgtgag tgttttcaaa ttaagcccta gttttgctcg 4681 ttcgacgaca acgccgccag cacaactatc aacagcgcaa gtggctgtag cccaatatga 4741 ttcacgttta gcaaaagctc cctatgattc acgttcagga gaaaagtctc tctataagag 4801 gctaggaggt tacagcgcga tcgccgccgt catcgacgat actgcgcaaa tcgtattcaa 4861 tgacccactg attggtaaat actttattgg cttgagcacc aactcaaaac agcggctacg 4921 tcagttgctg atagatcaat tcacccaagc ggctggtggt cctgctgttt atactgggcg 4981 gagtatgaag ctttcccata gtggaattgg tggaggtctg acgaacgctg aatatgatgc 5041 ttttgttaat ggaatagcac aagcccttga taaaaacaac gtcaatcaac cagagaagga 5101 cgaagtactg gcgtttgcca atagtttcag agacgagatt gtcgaaagat aataaacgag 5161 cgatagtaca ctcttaggca cttacaagca ataaaaaatc caggcgtgtc ctacagctgg 5221 atttttattt tttggatttc cctgatccat tctgtttggt tatctttgat gagcatggga 5281 atcatcttcc atgctcaggt gaacggttta ctaaggctac attaacttag atgatgtttg 5341 agaagtctca tttcatactt tacttggcga ctagaagtcg cagcaacgag cgtcaaaact 5401 tgcctgcgca ggttcaaaat tattgatttt ctgttagtcc acgtaggtgg tgagacagcg 5461 ctgcagaaga tatcactgat ggcgcaatag gattggtggg agtctctaag tttaaagcga 5521 tccacgtccc agtattatct ggattgaaac gagcaacgta gagagctgct ttctttgcct 5581 ctgccaacag gaacaaaggc ggtgtagtcg cagttgtagc cgaaatactc ttcggggttt 5641 ggaaaaacgc gatcgcgtag cgcgtcctac tgctcaaagt cgcgaagcga ctttgcctcg 5701 aatcggagat tcgaggaagc ctgttctgaa agaacaggct tagcagtata cggacgatcg 5761 ccccaaccaa caataacgta ccgttcatac tctggaggta caacgacatc atcaacgacg 5821 ttgtagctaa ctaacttcac atcaggtgaa gcatctagag tttttccctg tccaatttct 5881 ctgggtaaat agttcttctg ttgcttgtaa attggcagag gatgtggtaa acgtacaggt 5941 gtaaaactca gaggtgcaaa ttttgcttct cttgcatcaa cagtaccacc taaaacttga 6001 tctgccagta cagcgccagc agcgccagca aaaaagataa gtacttgtct gcgactcaac 6061 ttagacataa aattttcctt ttcagtgcag tctccacttc tttaaggagt tgtcttaaaa 6121 gacaaaccaa tgacaaaaaa gccctcgtcg tttccaacca aaccctgggc aatgatttta 6181 tttccttcgc ctagacaatt aaaaaatagt tgctgtcaac aggaaattag ctctaaaaag 6241 gcttacagcg gttcccatcc aacatgagca tatataatgt atgacgtgta agttaagaat 6301 aggtaatata atagttaata gtcgtttttt tcttattttt aattaataaa gttaatagtt 6361 atttctgatt gttcctaact ccaataatgt tatgaattca caggctgatt cataacattt 6421 ttggctagtt ttatgcatga tgtcttgcaa ctatatgtat tactttgtat tagcttcaaa 6481 agcgaaactc gtatttatca acactacatt cttcttaggt atgttatatt atccattgat 6541 cattcgcttg ttgtgtagta taatataaca ccaaaaaacg cagggtgcaa tgaataatgg 6601 caagactaag ttatttttga gattgtaatg atacattatc aacaaagagt tatagatgaa 6661 ttactatgtg taagtcgttt ttgttaaaga agttaggaac gttagtccat gtctctaaaa 6721 tctaagcagg aagtacggta tcaactaaaa tacctattca ctcatgtgtt ggcatctgtc 6781 ccactgttgc ttgcttgtgc tgtcttttta agtgtgatta gctcccaaag ctctactctg 6841 ctactttgct tgattctggg taccatagca gtagcactta ctcagcaaat aggacaaaca 6901 ataggtctac cgaaaccttg gctcatatta ctgcaaactc tcgtattttc agcaatactc 6961 agtttctttt ggatagacta ctttagcgat ccagcagcag ctcaattttt tggaaaagca 7021 gaaacctttt tcaaaaacaa tttaacccaa ggttcacaac aaaccggtgc aggtgctgca 7081 gtgagtttgg tgtttaatgt tttaagagca ctttacttgc tttacatcgc tgttgctttg 7141 attggtgtga ttaacgccgt tcgcaaagac gaagattggc aagtcgtcgc cagaacaccg 7201 ttgttagttg tcattgctgt gactgttgct gatgtgttga ccagctttat cgttggtagt 7261 caatgattaa gtcaggaatt tgttatgact gatgagcgag ataaagaatt tcgacctgtc 7321 aaccaaattt taggaaccca accttcctta ggaccaattc ctgctgacca aatttttcct 7381 tggacgatta tagctctcgt ttcctacatg attgtgaatg gcatctttgg aggtgtcttc 7441 tcagatgaat ggcaaaaatg gttgtggaca gtcttaattg ctggctgggg aatagctact 7501 tggtggatac taacaggtgg tagaagttgg cgatttttga gcaaatttat cggtgttccg 7561 acctggacaa gaggtactgc tcgttataaa agttttctag aattccacta tgaaagaaaa 7621 aatcggaaaa caaagcgtag gcatcgcagg tcaagaaaat agactaacac cttttgaaga 7681 tgctttgaat cttgccacca tgctgcgtat tgcagttgat ggcagagatg ttggtgctta 7741 cattttgacc aaagggagcc aaagagatag attttgtttt gtttttggat ttgaatgcag 7801 aggcattcat accactttaa gaacagagca aatagataca atttgtaaca atatagaagc 7861 tggtttaaaa gaccttccac cagaggaaag agtcactttt catttggggt ctttttctac 7921 agataaaaaa cggcaacaag aacttgcatc tcttgtacag aaaacctcgt cgcctagtat 7981 acagtacttg ctcacctctg aaagagccag aacacaacaa ctcacccgcg cgggtatccg 8041 aaaaccaaaa tttctgcggg tgtatgtgac atatactgta gaaccagatg cttcggctgc 8101 tgatgactgg attgagaagt ttttggcaag ggctgagtct tggtggttat catttaaagg 8161 tgaagcagca gaaagggaaa accaaaaact agaaagtctg atttctaatg cttatactga 8221 aggatttcga cgttgggaac aaattttatc taataaatgg ggtttggata ttaaagcttt 8281 gactgcaaaa gatttatggc aggaagcttg gcgacgattt aatagtactg aaccgataga 8341 tattccccaa ttattggttt tagatgaaaa gggactacac gaagaaattt attctgattt 8401 atctagctgc aaattactat tagaaaatat ccatagcacc acattattaa tggagtcaga 8461 tgtacctgtt gctgaccgga aatgggtgaa tctaaaaaat cgctatgttg gcgtgatgac 8521 atttctagaa aaaccaggag gatggcagaa taaagcatcg caactgcgtt atttatggga 8581 actcgttgct agggatgcaa ttgtagatac agagattttt tgtgagttaa ctgcagcgaa 8641 tcctgctata gtcaaaacga cattacaacg ggtgctgaag cagtcgaatg taacgtcaaa 8701 agttgcacaa gaaaaaggga atacgattga tgtcggcgca caattaaaat tgaggaagtc 8761 tgtggcggcg caagaacaaa tttatgaagg tgcgttgcca atctatacag gaattgcgat 8821 tctcgttcat cgccaaagcc cagaaaaatt ggatgaggcg tgcaggtata tagaaaactg 8881 ttttcaacgt ccggcacaag tgattaggga aacggaatat gcctggaaga tttggttgca 8941 gactctgcca attgtttggg aaatgttgtt agccaaaccc tttaaccgtc gtcaattgta 9001 tctcacgaat gaagtgtggg gattgatacc attggtgaca acaagggcgg gcgatcgcaa 9061 aggctttgaa ttaattgctg acgaaggcgg aacaccactt catctagatt tatttaacca 9121 gcacaaaaat ttagcgttgt ttgcgacaac tcgcgcgggg aaatcagttt tggtgtcagg 9181 aattttaacc caggctttgg cacataatat gcctgtcgta gccttggact ttccgaaacc 9241 agatggtacc tctaccttta ccgactatac agaatttgtt ggagagaaag gagcttattt 9301 tgatatttcc aaacaatcca ataatttgtt tgaacagcca gacttgcgat ttttgagtga 9361 agaagaacaa cgagaacgct ttcaagacta cgccgcgttt cttgagtcgg cgttgatgac 9421 gatggtgctg ggttcatcta ctgggaatca attgctggcg caaacggtgc gatcgcttct 9481 caacttagcc ctaaacacct tcttcgccga cgaaggaatt caaaagcgat actatgcagc 9541 gatattggaa ggatttggta ctgaagcttg gcaaaaaact ccaactttga gggactttat 9601 catcttctgt tcgcgggaac atcttaacct gcatggtata ggtggcagaa ttgatgatgc 9661 actcgaagtt atcaatctcc gtttgcgctt ctgggtggaa agtcgagtcg gacaagctgt 9721 ttctgcacct tcgagttttc caactgatgc tcaactttta gtttttgcat tgagaaactt 9781 atcagataac gaagatgcag cattgctgtc tttaagtgcg tattcagcag cactgcgacg 9841 tgctttgagt agtcccgcct cgatattctt tatcgatgaa gcaccaattc tgtttgaatt 9901 cgaccaaatt tctgatttgg taggtagact ttgcgcgaac ggagcaaaag caggggttag 9961 agttatttta tcaggacaag accctgacac aatcgccaaa tctaaagctg cttccaagat 10021 actgcaaaac ttatcaacaa gattaattgg tcgcattcaa cccgttgcgg ttgatagttt 10081 cgtacagatt ttaaaatatc ctcgcgaaat tattgctcgc aatgcgtcag aaggtttttt 10141 ccctcgtaaa gaagacattt atagtcagtg gctgcttgat gataatggcg tctatactta 10201 ctgtcgttat tatccagggt ttgaacagtt agcagtcgtg gcgaataatc ctgatgaaca 10261 agctgcacgt aatcaagcaa tgcaacgata tggagataaa tttgaggcgg tttctcattt 10321 tgcgcgtcaa ttgatcggat caattcgtgg ttattgaggt gagggggtag aatttaaaat 10381 aattttagac aaaggaataa tgtaactgaa atcttatacc attctcaaga acgcaacctc 10441 gcccgcccag aactgaagtt cacaggcttt tagcccaagt ccactcaagt ggactgaata 10501 ctttggtgaa tcagcattta gtcctcttaa gaggactttg actataagcc tagggtttct 10561 aaccctaggc ggttgttgga actggtgcaa gatctcagtt ttaacagacg agcgtctaca 10621 gaataacaaa aatatacttc tcaaacatcc tccaaagcaa tcgcaagtca ttaaacctaa 10681 cctgatatta tgctcaaaat tggcaaatct caaatcatca tgacttcttt ttttgctgtg 10741 gcatttttag gaaccttacc agccatagca gaattgggaa ctgtttggac tgattttcaa 10801 ttgtacacaa ctgattttag aaattatatt acaaacaata tttctgagac tttaaaccct 10861 gttgaattgc agacacaatc agcaatcaca agttcatcag gtgatctcaa tcttcctaat 10921 cctaattatg ctagtcagta tactcgtgac cagattactc agtactcaat ttcggataag 10981 tttgagaata atccagcagt atatggtaaa caagtgagta gtgaaattaa tcgatatatt 11041 acccgtagtg cagttgaagg tgtttttggg agagatggac aaactcgttt aaaaggtaaa 11101 ttacagaata ctgagcaagc tgttaataag agcaatagag ctgctaatac agctcagcag 11161 aagtactcgg aaatacaaag ccaaagccaa agccaaggag caatgtgctc gacacctggt 11221 gttggcaata acaatttagc aaatgtatgc gatcaagcgc aattcctgat gatgcagggg 11281 ttagtagagt tagaattaca aaacatcaat attcaaagag aacaaaccaa gatcacagga 11341 gaaaccctag gaaatacaat ccaattgcgt aatgacatcc aatattccaa cttgaattta 11401 gcagatatct cccagcaaat gaatgaggtg aatcgtgcta gaagagtggg tacatcagcc 11461 gaagtcgcac gacttttgcg agtcacttct caaactgatt tgtttaggaa aagcagtaac 11521 cctcttcctt tacaaacgaa tcctccttcc tcagaaaata atccttctcc tacagaaacg 11581 aatcctcctt cctcagagaa tatcccttct cccttggaaa caccatagga ggaatcaaga 11641 taaagtgatg gtaagatgtg gctttctaat ttctcattct cgttcccatg ctctgcgtgg 11701 gaacgcattc tcaaaggctc tgccttttgt gttgaggcag agcctccatt gtagcattcc 11761 ctggctgagc cagggaacga ggaagaaagc tgagcgaggg aacgaggaag aaagctgagc 11821 gagggaacga agaagaaaaa tgtggattta taatgtaaaa atttttttgg ctcaacttca 11881 acttgacacc agtaatgttt tgacaaatgg tgtcgtgact gcacaaagta ttgctgaagg 11941 ttgggataag cagtggattg atttattaca aaataacaca aacaataact tatatggagc 12001 gctgacaaac ctgggtattt tctttgcagt cggaacttta ctttttttca tggcacagtg 12061 gataaaagat gtgctggata atgaatattc tcgtccctta tctgctttga tttggccctt 12121 catagtcgta ttgttgttag ctaatccggg caatggaact gcactctcta atttgacact 12181 gggattaaga gattttctca acacaatcaa tcagcaagtt gtagaagccg ccgatgtcaa 12241 tcaaacttat cagcaagcac tgaatatgag tgttggtgaa gaagttgttg gtggtttatt 12301 gcgtccttgt cagtctctca caggtcaaca acaaactaat tgtttcatta aagcaaaaga 12361 aaaaatagat gtcctcttgg gacagtacag aaatacatac ggtatccaac cttggataga 12421 cagacttgaa attaaagtta atcagatagt gatcagcact ggtaatgtct cagaatttgg 12481 ttttaactct ctggtgggtt ccacaactca aacgattatc aaaaatcttt tagtttcctt 12541 acagtctgct tttcaaaact taatagaagt gacgatgtta ctcatagcag ctttaggacc 12601 cctagcagta ggagggtctt tgctacctgt ggcgggtaaa cctattttcg catggctcac 12661 gggattattt tctattggta ttgccaagat ttctttcaat attattgctg tgatatccgc 12721 tgcagtgatt gtgaatggtc cagcgcaaaa tctcgatgca gatccagact tgatgtggtt 12781 tatgattctg ttaggagttc tggcaccaat tatatcttta ggtttggctg ctgctggggg 12841 atttgccgtt ttcaacgcca tcagcaacac ttctgtgtgg atacagcaaa gagtttagaa 12901 aaattaggag tctaaaatat aagagggaac gcttaacagg gaacagctag ttagtaggtt 12961 gggttgagga acgaaaccca acggttatct tgggttgtgt tgggttgcgc tttgcttaac 13021 ccaacctaca attattctaa tatcttgcac cacttgcaag actcgcccgc caagggttaa 13081 aaacccctgg ctaatagcaa aagtcctctt aagaggactc aatactaacc agtcagaata 13141 ttagtccact tgagtggact tgaattatta gcccggaaat tcatttccgg gtggactatg 13201 aggctagaag aatgaaataa ccctgaggtg aatggcatta tgaccaagtt attacaagaa 13261 aaaaagtcct ctgtcaatct tttaacgctc tttactattt ttaccttcag tctacatttc 13321 ttggcagcca ttttcttatt atttgaaggt ttacgtatct acggtctcat tcataaaaaa 13381 cctctcactt ttgtccaact tgttgatggt aagagagtct ctcaaattga cactcttgaa 13441 cgagaaccag aagtcattcg ccaatttgtt gctaaaacaa tggctgctat gttcaactgg 13501 tctggaacac tcccaccagc cagcgttgaa gacgcaacta accctaaacc tgatccaggg 13561 atacctatta atactctaca aaatttgacg aaaaaagttt ctactagtag ttgggtaggg 13621 agtttcgcac tttcagaaga ttttcgccaa ggttttttgg cacaaattgc ggagatgaca 13681 ccgccagaga ttttttctaa aaacaataac caagcgttga caggacagtt agttatccaa 13741 cgagtttatc ctcctgaaaa aatcgcgcct ggtcgatggc gcgttggtat ggttgctaat 13801 attgtgcaga ttagacgtag tgacaataaa aagctattaa ttccttttaa taaggatttt 13861 ttcgtacgtt cagtagattc ttttggacat cccctatcga acagtctgac tccattgcag 13921 aaagctgtct atagtgtccg cgcgcaaaac ttagaaattt acgaagtgag tgatttctgt 13981 ctcacaaatg gctatgattc ttcgccaaaa agtcagtcgc aacgctgtgg agatattcct 14041 aattctggta gctttacacg ataggtagaa aaagctcatg ctatttaacc aaaaaaagaa 14101 taatactagt attcttcctg ttttcgttgt cgcaaccttt gtgttgaatg tgttaactat 14161 attattgctg atgtatcacc agtctatgct caaaaggctg agtggtcaat taccacaaag 14221 tttagtgcaa cttgttgatg gtcgtgccat aacaatagat tcccaagaaa atttagaacg 14281 caatccagaa acgattcggc gttttgttgg tgaaacaatg accatgatgt ttacttggtc 14341 agacaaacaa ccacagcaaa tagtttggca agcaacttcc gaacttttat ctggtgatgt 14401 gaggcgaaaa tttgaggtgg aaacgacaca gggaattcct aaaggtgtgt tagccaatcc 14461 cggaggaaat gcagaaagct tattgttaat ccgcagaatc tctcaacccg aaaaaatagc 14521 tgatggtcaa tggcaagtag aaatcgtggc caatcgcttg attttcgcag gttataataa 14581 caaaatgggg gaagcaatac cttttaataa aaaaatattg attcgggcgt tagagacaca 14641 agcaatttct atcccaaatg tagaaaatcc tttatactca gcaatatacc gccttaatga 14701 agcgaggtta gaaattgcga atatttgtga cattaaacaa aaaaaatgcc cctagctaaa 14761 ggaaagacac tcatgaacaa cctaccaaac tctcataatt caaataattc taacaatcat 14821 aacaatcatt ctaataataa caatcataac aatcataata attcacactc ctcatctaac 14881 aatcatcatt cgcatgaagt acaaccatca gattggcatc aacggatggc aaatttagtt 14941 ggtttagaag aacaaattcc cccatctcgt ccagaaagca gtctttcaga gacttctagt 15001 gctaatcctg acgaagcacc tccttctcat ccatcctcac agaaaacaaa acaagcattg 15061 tcatctaacc cttttgccaa ggtaggtgtg gtgggtgctg ctaccctgag tgtggctttg 15121 gtagcaggtg cgtttctgac tcaacttatg agtggaacaa ataaacaagc gccgaagaat 15181 tttccacaga taactcgtaa cgagaatgaa tcgaataaaa tagagcaatt aaaaccagaa 15241 gaagaaattg agattttaaa aaccaaacta gcattagctg aacaagcaaa agctgtgaaa 15301 ttagcacagc ttcagttaaa aagtgtcaga ccgacaactc agccaaaacc aactccagca 15361 caaccaagaa ttaaaccaca aactgtcacg cgggtggtag ttcaaagagt acccacacca 15421 gcacagactg tttacgtacc tcgtgttgtt gaacgaattg tcagagttcc tcaacgtgtt 15481 gtagtgcagc agccaaaacc aatttctcct gtgcctcctc aacctactgt gcgtcctgtg 15541 cctcctcaac ctactgtgcc tcctcagcag actcctcaac caacagctaa accttcaata 15601 tctccatcct ttgagttatc aatcccagga ttaaatccat cggagttagc acaaattccc 15661 ttcgctttat caatacccac tcccactccc actccaactc caacagccac tcctactccc 15721 actccaacac caagtttacc aagagtcgca aattcccctc catctacttt gggttctgaa 15781 ttaaattcca ggaatagaga caacgcacaa atcttacctg ttccaaatag gagaaataca 15841 gcagcgaatc ctcaacagac accagcagaa gcaacaacaa acgaaacaac tcagtatact 15901 ggtaaatctg ttgcagtagg aactaatgcc aaagctgtgt tagcaactgc tgtttttgga 15961 gaagcaaata ggatcgggag taataatagt aatagtaata gcaataacaa taacaataat 16021 aataagaatg acagtcaatt tgttgtgcgt ctgcgggaac cattgaagtc tgtagatggg 16081 gcgatcgcac tacctgcaaa taccgaatta ttagctcaac ttgaccaagt ttctgaaacc 16141 ggagcgttaa acctaacagt cgtttcagtt gtttctcaaa ataaaagtaa tctcacagaa 16201 actcgcctac gtcaaagtgc aatgaaagtt cgcgcacctg gcggaagacc tttacttgct 16261 aagaaatatc ctgacaaatc tggaaagatc tcagctatgg acacattcat cttcggcttg 16321 gggggggctg gtcaaatagg aaggacaatt aaccttcctg agacaaaaac tagaaataca 16381 tgtgatggtc ttaatgacgc gcagcgtagt gttcagcaat attgtggcta ttttagcgaa 16441 accaaacaac caagaaacat cgctggtgca gttttagaag gcggtatgaa tgcccttgta 16501 ccccaactga accagcgtaa ccaacaagca atcaacgaga tgattacaaa aagtaacatt 16561 tggtatttgc cagcgggtac agaggttgaa gtggttgcta atcaaataac gcggttttag 16621 gttgttgctt acttgttgtc aattgactta gatacttaag tatacgattt aggttgtata 16681 tttcatatca tgttcggcga atgagttatg atttccacgt tgactgcacc ccaccccggt 16741 aaagctgcgc tttacctccc ctccccgcaa gcggggaggg gattaagggg // LOCUS NODE_1990_length_16721_cov_5.26623116721 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16721) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16721) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16721 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(88..2793) /locus_tag="DP116_17505" CDS complement(88..2793) /locus_tag="DP116_17505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316340.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain S-box protein" /protein_id="PRJNA477356:DP116_17505" /translation="MLQADVAMSSTYNPFLVALSIVIAVLASYTAVDLAGQITVAKAK ARLAWLIGAAVVMGIGIWSMHFVAMLALSLPISMGYDALTVVLSVLPAIVASGGALFL ASRPVLNTQQFQAGGVLMGIGIASMHYIGMAAMRMEAATRYDPLLFMLSVAIAIGASM IALWIAFQLRLQTGKSGRRRKILSAFVMAIAISGMHYTGMAAACFKPTRVTGTVATMQ VSLPALAVSIGVSTLIILSFTLLTSFVERRMVSQTLLLEQQEAQRSQLFMDITLRIWR SLKLEDVLNTAVCEISKALNTDRVIIYRFNADWGGTIIAESVAKGWIKTLGRTVFAPF GKDDIEMYKQKYKDGQVRAINNISEGNFTDSYREILERFQIKAILVAPLLSGHRLLGL LCAHQCSESRNWQQLEIDLFGQLAIQVSLALEQANLLHELNAAQEVLRVRDRAIAAAS NAIVITDPHQEDNPIIFCNPAFETITGYSPQEVLGRNCRFLQGSDTNPQTIEQLRNAL RQEQECHVVIKNYRKDGTPFWCELSIAPARDVTGQVINFIGVQTDITSRKQAEEELRH SKEFLQRQLMELTDDVKEVAKGDLRVRAQMTTGEIGIVANFFNTIIESLQQLVLQVKQ AAIQVNVSVGENSDAIRHLADEALQQAEEISCTLELVNQMNISIQEVANNASQAAQVA RTSANTALSAGEAMEHTASSIFNLEKTITETAKRIKHFGESSQEISKVVTLINEIALQ TNLLAINAGLEATRAGEQYQGFIVMAHEVGRLAIQSAEATQEIEQIAENIQFETNAVI QAIEQGTTQVVESAELVKDVKQSMETIVKVSHQIDDLVQSISQTTVSQADTSQAVALF MKEITKTSERTANSSGVVSTSLQQTVEVAQQLQASVSLFKTGVGR" gene complement(3153..3785) /locus_tag="DP116_17510" CDS complement(3153..3785) /locus_tag="DP116_17510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="OmpA family protein" /protein_id="PRJNA477356:DP116_17510" /translation="MHSFRFTTYLAPIALTIVANNFVYCATAKTESPQTEFLKAQLLA LEFSKIQTPIVQFSKDTYPEVNLPEIISEQIIVQENQYLTIITLPADILFDSNKDTIR PDAEKMLRQVSQAINNHYPHTWLQILGHTDSKGSKDDNLKLSEQWVAAVQKWLSEKGG IDISLISKEGYGEAQPIAPNQKSDSSDNPAGRQRNRRIEIVIQKLVNHQV" gene complement(3914..4696) /locus_tag="DP116_17515" CDS complement(3914..4696) /locus_tag="DP116_17515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408861.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ion transporter" /protein_id="PRJNA477356:DP116_17515" /translation="MLLSREKTAFYLKDLETPIGKFVNLTIAGLVLLSSVIFVAQTYN LSDNLRNFLDSIDTVVLLIFSIEYLLRVWSEENKIKYIFSFYSLIDLIAIVPYFLGGV DISFVRLLRWFRILRLIRFIDNKFFWGVSTEDSLVSTRILFTLFAIIFIYSGLIYQVE HPVNSENFATFLDAFYFSIVTMTTVGFGDVTPISELGRLLTVLMILTGIALIPWQVGD LIKRLVKTANQVETICSGCGLSFHDTDAKFCKVCGTNLRNHS" gene 4875..5318 /locus_tag="DP116_17520" CDS 4875..5318 /locus_tag="DP116_17520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216165.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleoside deaminase" /protein_id="PRJNA477356:DP116_17520" /translation="MNQEDFMRLALEEAKKGDAPYGAVIVKDNQVVAQAYNTVKRDND PSAHAEINVIRSLTTQLQNPSLEGYTIYTTGEPCPMCASACVWTGLSEIIYGASIEDL ISVNQSQINISSEEVIVKSFRKIKVTRGVLREECIKLFHKKSTFN" gene complement(5311..5973) /locus_tag="DP116_17525" CDS complement(5311..5973) /locus_tag="DP116_17525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316337.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_17525" /translation="MIKVLLVDDQSLIRQGLKALLELEQDLEIVGEAENGEIAIHFIE EFYPDVVLMDIRMPIMDGVAATREIQKRFPKTKVLVLTTFDDDEYVKTALQNGAMGYL LKDTPSEELAVAIRAVNKGYTQLGPGIVKKLFSQFSSVTPTKSPSPPESLGELTPREK EVLRLIATGASNREIAQQLYISEGTVKNHVTNILNRLDLRDRTQAAIFANSFLPYFND PS" gene complement(6008..7219) /locus_tag="DP116_17530" CDS complement(6008..7219) /locus_tag="DP116_17530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197501.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_17530" /translation="MSRSIQFHNHPFRFLLYLEWILLAFTAFTAALPSHRYRVQTGLT ELTICSLVLFGLMGLRLPTRNHISKVLYTTLEIFIIVLVGLFGGKGDRVFPFIDLILV TRSCLIFQLPGRLVVTGLSFLLFLLTLRRRFERMPLSLLAQERFWFFNLNFAIVFILA LLFVLLLMNAVLSERQSRDKLAMANEKLRQYALRIENQATLEERNRIAREIHDSLGHS LTALNLQLETGLKLWNSNPTKAQTFLGRAKELGSKALQDVRQSVSAMRSHPLQEQSLE QAIAGLAENVQRSTGVTPICQIDLSHPIPVEVSTAVYRIVQESFTNICKYAQATEVKL EITTTKTSLQLKVEDNGIGFDLTQNTTGFGLQSMRDRTLALNGHFDMNSAPGSGCTMT AYIPLSKVTTN" gene 7648..8091 /locus_tag="DP116_17535" CDS 7648..8091 /locus_tag="DP116_17535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="P pilus assembly/Cpx signaling pathway, periplasmic inhibitor/zinc-resistance associated protein" /protein_id="PRJNA477356:DP116_17535" /translation="MKLKNLSLICGAIALSLTTASFAVKAEANSSLPLVVAQSQEKEG SFQRLGLTSDQKAKIKEIRTNTRTEVDKILTEQQREQLKTARQNRQGKGGFAALNLSD DQKNQLKQVMQSQKTQIEAVLTPEQKQQLQKYRQEKGARRQQPNM" gene 8759..9280 /locus_tag="DP116_17540" CDS 8759..9280 /locus_tag="DP116_17540" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17540" /translation="MPRPVTRYSTYHRPKVFKLLILALTTLGVSFLTPSAAEAQSNST SSPGVGNILNYPYGTRVHQNGVINTPDGSTISPATTINNGNGSTTYYYQNGTRVNINT NRVTPNGAVLTPGSLNGGLNRVPENPNRGLLLTPANPNGELNRGLENPNRGFLLTPAN PTRLWQKPSFETR" gene 9299..10102 /locus_tag="DP116_17545" CDS 9299..10102 /locus_tag="DP116_17545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194437.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA methyltransferase" /protein_id="PRJNA477356:DP116_17545" /translation="MALNQVRVILVEPAGPINVGSIARVMKNFGLNHLVLVNPQCKPL ATEALQMAVHARDILESAVSVTTLPEALQGCTRAIATTARVRHWDSPLENPSTALPWL LDQPQQPAAIIFGREDRGLSNEELNYAQRFIRIPTSSNYPSLNLATAVAICCYELAKS DTENQEDTLRENTAITSKKLSAPFAVPIPASVHESAPLNILEEYYQQLESLLLKIGYL YPHTAASRMETFRQMYNRAQLQTKEVAMLRGILRQVEWALENRNNHQGF" gene 10365..11957 /locus_tag="DP116_17550" CDS 10365..11957 /locus_tag="DP116_17550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878283.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine hydrolase" /protein_id="PRJNA477356:DP116_17550" /translation="MPTRTVIKTVSRRPKTSRRPVKSKVQKQGQNKVEATKQQQAPYS RTMPTRIKSIPPLIPVTSPIKRSGVPPTPAKPTAAAKGRIPPYNPKKVQSKNVQMRKQ PLPRKIGASRQKKRLKPMAKTFLYALRLLIVGVGVGAIVGTALSVLDPATRIATSSGR SSDTTIGQAQPQFTQNPSEAASGLFLTQEISSLKTIVQNLAAANPNLTPGIFLVDLDN GNYVDVNASSSFSAASTIKIPILIAFFQDVDAGKISLDETLTMTKRMVVGGSGDMQYK PAGTQFRIMEVATKMITVSDNTATNMLIARLGGIETLNQRFRNWGLTTTTISNPLPDL QGTNTTSPKELGKLMGMVNKGNLVSVASRDRILDIMRRTVRNQLLPSGLGPGATIAHK TGDIGTTLADAGLIDMPTGKRYVLAVMVQRPNNDPGAEKLISSISRAAYQQFSPTAPI PPSSGSTIPTTGYQSPVMSQPLPNGMGSTMPTTGYQPPVMSQPLPNGMGSTIPPNGYQ PPVQLPVQPPVMNPQYYYPYQR" gene 11971..12342 /gene="psb28" /locus_tag="DP116_17555" CDS 11971..12342 /gene="psb28" /locus_tag="DP116_17555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein Psb28" /protein_id="PRJNA477356:DP116_17555" /translation="MTSITPSIQFFSGIREELSNVSLRRNLTSGKRIIVMIFARIKAL EGFNSFTKQPLNSMLLTDEEGEISVTPSSTQFIFGGAEGDELQRVECKFEVEQQDHWE RFMRFMNRYAEANDMVYGESQ" gene complement(12531..13652) /gene="dprA" /locus_tag="DP116_17560" CDS complement(12531..13652) /gene="dprA" /locus_tag="DP116_17560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-protecting protein DprA" /protein_id="PRJNA477356:DP116_17560" /translation="MGQERAYWLAWSQISGVGPVLLRRLQQHFGTLAAAWDAKPAQLK EVEGFGFQTLQKVVQQRSRLHPEQFLQQHQEQNPSFWTPADADYPRLLLEIPSPPPIV YYRGEIDLQENLGQKQLVAIVGTRQPSEYGMRWTRQISTALAKNGFTVVSGLAEGIDT ESHAATVKAGGRTIAVLGTGVDVVYPSKNVELYKQILTAGLVVSEYPAKTPPDRTHFP RRNRIIAGLSRAVLVMEAPIKSGALITASYANDFGRDIYVLPGRVDDHPSQGCLKLLS QGATPILKELDELLKMLGAIPQLDSVEASPSPQQLTLPDLPAELQRVMDAIASEALPF DFIVQQTGMATGEVSSALLQLELMGLVSQLPGMRYQRCL" gene 13849..14121 /locus_tag="DP116_17565" CDS 13849..14121 /locus_tag="DP116_17565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016514663.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="BrnT family toxin" /protein_id="PRJNA477356:DP116_17565" /translation="MKFEWDDNKAAKNLSKHGVSFEEAKTVFDDPLYVDFYDPDHSDE ENRYLIVGQSNRGRLLIVSYTQRGDSIRLISAREVTRAEREAYEEG" gene 14108..14374 /locus_tag="DP116_17570" CDS 14108..14374 /locus_tag="DP116_17570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006508356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17570" /translation="MKKGKPEMEDELRSEYDLKSLRVRRLGSGRKSFGPITVRLEPDV AEMFPNADAVNEALRFLIRVMQEKQSPASRLELNTSLEQTDERP" gene complement(14664..>16721) /locus_tag="DP116_17575" CDS complement(14664..>16721) /locus_tag="DP116_17575" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="filamentous hemagglutinin" /protein_id="PRJNA477356:DP116_17575" /translation="GGGDIVVNANNFTATNGGRLTAGTEGVGNAGDITVNVNNFNISG VGQRGNAAGVSNQTVDGASGNAGNIFINSKSFNGSSGAGVSNQVLAESQGDGGNINIT SGSFSLSDTASIDASTYGEGDAGNVLVRASGSVELVNAKIFSNVERGGVGNGGNIDIK AATLSLKDSAQLQAIVRGESETQLAGRGDAGDVTVDVTGPVTIVGVKDGSRSAIFSSV GTRATGNGGNITISSGSFSLSDGAELSASTFGQGNAGNVSVRASDLVSLVNADIFSNV EAGGVGKGGNIDIKAATLSLTDSAQLQTLVREASGNQLAGNGNAGNVTVDVTGSVTIA GVKNEFRSGIRSRVNTGATGNGGNITITSGSFSLTDGAQLNASTLGQGNAGNVSVRAS DLVSLVNADIFSNVEAGGVGKGGNIDIKAATLSLTDSAQLQTLVREASGNQLAGNGNA GNVTVDVTGSVTIAGVKNEFRSGIRSRVNTGATGNGGNITITSGSFSLTDGAQLTAST FGRGDAGNVSVRTSGSVELVNGDIFSTLGSTGVGKGGNMDINAASVSLRDGAELSAST FGQGNAGNVSVRANDSVELANNGYIFSTVGSTAVGNGGNININAPIVSLKDGAQLATV TFGQGDAGDVIINANQRVILKGSTQSPDGVFIPTAVLAGVPPVEASGVVSPMRNCRKG SPA" BASE COUNT 5038 a 3700 c 3332 g 4651 t ORIGIN 1 ctgataactg ataactgata actggtaact gataactgat aactgataac tggtaactgg 61 caactgctat aattctgaaa aatcacgtca acgacctaca ccagttttaa acagactgac 121 tgatgcttgt aattgctgcg ctacttctac agtttgttgt aaagacgttg acacaacacc 181 agatgaatta gcagtgcgtt ctgaagtttt ggtaatttcc ttcataaaaa gagcaacagc 241 ctgcgaagtg tcagcttgag acactgttgt ttgagaaatt gattgaacta agtcgtcaat 301 ttgatgagat acttttacaa tcgtctccat gctttgctta acgtctttga ctagctcagc 361 gctttctacc acctgagtcg ttccttgttc tatcgcctgg ataactgcat tagtttcaaa 421 ttggatattt tctgcaattt gttcaatttc ttgagttgct tcggctgact gtattgctaa 481 gcgaccaact tcgtgagcca taacaataaa gccctgatat tgttcacctg cacgcgtcgc 541 ttcaagacca gcattgatag cgagtaaatt ggtttgtaat gcaatttcgt taatcaaagt 601 cacaacttta gaaatttctt gggatgactc gccaaaatgt tttatccttt tggctgtttc 661 agtaattgtt ttttccaaat taaagatgct agatgcggta tgctccattg cttcacctgc 721 actcaaagca gtattggcag atgtacgagc cacttgtgct gcttgagaag cgttattagc 781 gacttcttga atggagatat tcatttgatt taccaactca agggtgcagc taatttcttc 841 agcctgttgg agtgcttcat ctgctaaatg acgaatcgca tcagagtttt cccccacaga 901 aacattgacc tgtatagccg cttgtttcac ttgaaggact aattgttgca agctttcgat 961 aatggtattg aaaaagttgg ctactattcc aatttcacca gttgtcattt gagctcgaac 1021 tctcaggtca cctttggcta cctctttaac gtcatcagta agttccataa gctgacgttg 1081 gaggaattct ttgctgtgcc tcaattcttc ctccgcctgc ttgcgtgagg taatgtcagt 1141 ttgcacccca ataaaattta tcacttgtcc cgtgacgtct cgtgctggag caatacttaa 1201 ttcacaccag aatggggtac cgtccttacg gtaattctta atcacaacat gacattcttg 1261 ctcttgtcgc aatgcgttgc gtaattgttc aatagtctgt ggattagtgt cagatccttg 1321 caaaaatcgg cagttgcgtc ccagaacttc ttgtggtgaa tatccggtga tggtttcaaa 1381 tgccggattg caaaagatga taggattatc ttcctgatgg gggtctgtta taacaatggc 1441 attgctagca gcggcgatcg cacggtcacg tacccgcagc acttcctgtg cagcattgag 1501 ttcatgcaaa agatttgctt gttctaaagc aaggcttact tgaatggcta attgcccaaa 1561 taaatcaatt tctaactgct gccaattccg agattcagaa cattggtgag cacacaataa 1621 acccaaaagc cgatgcccac ttaacagagg tgcaaccaaa atcgccttga tttgaaagcg 1681 ttcaagaatt tctcggtaag agtctgtaaa atttccctca gaaatgttgt ttatggctcg 1741 gacttgacca tctttatact tttgtttgta catttcaatg tcatctttgc caaacggagc 1801 aaagacagtt cttcctaaag tttttatcca accctttgct actgactcag caatgatggt 1861 accaccccag tcagcattga agcgatagat aataacgcgg tctgtgttca gtgctttgct 1921 gatttcacag accgctgtgt taaggacatc ctctaacttg agagaacgcc aaatacgcaa 1981 ggtaatgtcc atgaatagct gcgaacgttg agcttcttgc tgttcgagta atagtgtttg 2041 ggaaaccatt cgccgttcca caaaggaagt tagcagtgtg aaacttagga taatgagagt 2101 actaacgcca atacttacag ctaacgcagg gagggatact tgcatggttg cgactgtccc 2161 agtcactctg gttggtttaa aacaggcagc tgccatccct gtgtaatgca tcccggaaat 2221 tgcaattgcc atgacgaacg cactcaaaat cttgcgtcgc ctaccacttt tccccgtttg 2281 caaacgcagt tggaaagcaa tccacagcgc gatcatcgat gcaccgatgg cgatcgccac 2341 agaaagcata aacagtagtg gatcataccg agtcgctgct tccatccgca ttgctgccat 2401 gccaatgtag tgcatagacg caataccaat gcccattaac acaccaccag cctggaattg 2461 ctgcgtattt aagactgggc gactggcaag gaaaagcgca cctccagagg caacaatcgc 2521 aggtagcaca gaaagcacca cagtcaacgc gtcatagccc atcgatattg gtaaactgag 2581 ggcaagcatg gcaacaaagt gcattgacca gataccaatt cccatgacaa ctgcagcgcc 2641 aattaaccaa gctagtcttg cttttgcttt cgctactgtt atttgcccag ccaaatcaac 2701 agcagtgtac gaagcaagga cagctatgac aattgaaagg gcaacaagaa atggattgta 2761 agtactactc atcgccacat ctgcctgcaa catcttgtta acctcacttt ggctctttgt 2821 aaaatagcaa acaaaacatg agtattgtta tcactttcat caagtgcagc acactcatga 2881 attggaaaaa ttttgaccac taagttcggt agatgcaatc actcttttga aaatagcaga 2941 tgcaaatagt atctgtctcg cttgtgagac actactacga gttttccact acagtaatgt 3001 ttattttata attttatttt taataagaaa atattaaacc tgatttaact gtcaaataat 3061 ggtcgttgtt attacaacat acaacttgaa agaatttctg tctatagtag tgtttataac 3121 cagccaatag ctgcttgctt tctaaaagtc atttacacct gatgatttac caacttttga 3181 ataacaattt caatccgacg gttcctttgt cgtcctgcgg gattatccga actatcagat 3241 ttttgatttg gtgctatagg ttgggcttct ccataaccct cttttgatat caaagaaatg 3301 tctatgccac ctttttcact cagccacttt tgcactgctg ctacccattg ctctgacaat 3361 tttaggttat catctttaga ccccttagag tcagtatgtc ctaaaatttg taaccaagtg 3421 tgaggataat gattattgat tgcctgactg acttgacgca acattttctc tgcatctggg 3481 cgaatcgtat ctttattaga atcaaataag atatctgctg gcagggtgat aatagttaag 3541 tactgatttt cctgaactat aatttgctca gatataatct caggtaagtt gacttccgga 3601 tatgtatcct tagaaaactg aacaatggga gtttggattt ttgagaactc tagtgcaagt 3661 aattgtgctt ttaaaaactc tgtttgaggg gattcagttt ttgcggtagc acaatacaca 3721 aaattgttag cgacgattgt cagggcaata ggcgctaggt aagtggtgaa acggaaggaa 3781 tgcatttgtt cttttcataa ttttaccatt ttgggaaagc atacgtcaaa taaaaaacaa 3841 atgcttcttt cttaagttat aggcgctttt tgttactaag gatacttttt attctggaag 3901 aaataataac gaattacgaa tgatttctca gattagtacc acaaactttg cagaatttag 3961 catctgtatc atggaatgat aaaccacagc cagaacaaat tgtttctact tgattcgcag 4021 ttttgactaa tcgcttgatc aaatcaccta cttgccaagg aatgagagca atacctgtta 4081 aaatcattaa tactgtcagc aaacgaccta gttcagaaat tggagtgaca tcgccaaaac 4141 caacagtggt catagtaact atagaaaaat aaaaagcatc caaaaaagtc gcaaaatttt 4201 ctgagttaac aggatgctct acttgataaa ttaaacctga gtagataaaa ataatcgcaa 4261 ataacgtaaa taagattcgt gtagaaacta aactgtcttc tgtgctgaca ccccagaaaa 4321 atttattatc tataaaccga attaaacgta aaattctgaa ccatcgtaat agtcgaacaa 4381 agctgatatc aacgccacct agaaaatagg gtacaattgc tattaagtca atcaaagaat 4441 aaaaactaaa aatatactta atcttatttt cctcactcca aacacggagt aaatactcaa 4501 tcgaaaaaat aagtagtact accgtatcta ttgaatctaa aaagtttcgt aaattatcag 4561 atagattata ggtttgtgca acaaaaataa ctgatgatag taaaaccaga ccagcaatag 4621 ttaaattcac aaatttacct attggtgtct ctaagtcttt caagtaaaaa gctgtttttt 4681 ctctgctaag taacataatt tccagaagtc ccgtcgaaca aattcagcaa gaggaagaat 4741 ctacaattga ttgtgaatct aaagactagc ttagcaatag aaaaagtttt attcaggtct 4801 gttgactttt gaacgcgtta ccgtaggata tgctatattc tggaaaattc taattttaga 4861 ctcccaaggt aattatgaat caagaagatt ttatgcgttt ggcgttggaa gaagcaaaga 4921 aaggagacgc cccatatggt gctgtgattg tcaaagataa ccaagtggtt gcccaagctt 4981 ataatactgt gaagcgagac aatgaccctt ctgctcatgc agaaattaat gtgattcgca 5041 gtttaaccac tcaactacaa aacccttctt tagaaggtta tacgatatat actactggtg 5101 aaccttgtcc gatgtgtgca tctgcttgcg tttggactgg tttatcagaa attatatacg 5161 gtgcttctat tgaagattta atatcagtga atcaatctca aattaacata tcatctgaag 5221 aggtgattgt taagagtttt agaaaaatca aagtcacaag aggtgtttta agagaagagt 5281 gtatcaaatt atttcataaa aaatcaactt ttaactaggg tcattgaaat agggtaaaaa 5341 agaattcgca aaaattgcag cttgggtgcg atcgcgcaaa tctaaacgat tcaagatatt 5401 tgtgacatga ttcttcactg tcccttcaga aatataaagt tgttgtgcaa tttctcggtt 5461 actagcacct gtagcaatca atcgcaaaac ttctttttct ctaggagtta attcacctaa 5521 gctctctggt ggggatggtg atttggttgg tgttacacta gaaaattggc tgaaaagttt 5581 tttaactatc cctggtccta attgagtata tcctttgtta acagcacgaa tagcaacagc 5641 caactcttct gaaggtgtat cttttagtaa ataacccatt gctccatttt gtaaagctgt 5701 ttttacatac tcatcatcat caaaagttgt cagtactaaa actttagttt tgggaaaacg 5761 cttttgaatt tcccgagttg ctgcaacacc gtccataata ggcattctaa tatccatgag 5821 tacgacatct ggatagaatt cttcaataaa atgaattgca atttctccat tttctgcctc 5881 tccgacgatt tctaaatctt gttctaattc caataatgct tttaatcctt gacgaattaa 5941 actttgatca tctacaagta gaactttaat cagattagtc attagttatt gattgtcagt 6001 aagaaaatta atttgtggta actttagata agggaatata agctgtcatt gtgcaaccag 6061 aaccaggagc actattcata tcaaaatgac cattcagtgc caaagtgcga tcgcgcatac 6121 tttgaagtcc aaaaccagtg gtattttgtg ttaaatcaaa tcctatgcca ttgtcctcaa 6181 ccttcaactg caaactagtt tttgttgtgg ttatttctag tttaacttct gtagcttgtg 6241 catacttaca gatatttgtg aatgattctt ggacaatgcg gtaaacagct gtgctaactt 6301 caactggaat cgggtgagat aagtcaattt gacaaattgg tgtgacacca gttgaacgtt 6361 gaacattttc tgcgagtccg gcgatcgcct gttccaaaga ttgctcttgc aaaggatgag 6421 aacgcatcgc agacactgat tgacgcacat cttgtaatgc tttagaacct aactcttttg 6481 cccttcctaa aaaagtttgt gcttttgttg gattagaatt ccaaagcttc aacccagttt 6541 ctaattgtaa attcaaagct gtgagagagt gtcctaagga atcatgtatt tcacgagcaa 6601 tgcgattgcg ttcttcaaga gtggcttgat tttcaattcg caaagcatat tgacgcagtt 6661 tttcattagc catcgctagt ttatctcgac tttgtcgctc agataaaact gcattcatta 6721 atagtaaaac aaacaataaa gctaagataa aaactatagc aaagtttaaa ttaaaaaacc 6781 aaaaacgctc ttgtgctaat agtgaaagtg gcatccgttc aaaacgacgc cttagtgtta 6841 gcaaaaataa aagaaatgat aaacctgtga cgactaagcg acctggtaac tgaaaaatta 6901 aacaactgcg agttactaaa atcaggtcaa taaaaggaaa tactctatca cctttacctc 6961 caaaaagtcc aactaataca attataaata tttccagggt ggtgtaaagg actttactta 7021 tatggttacg agttggtaac ctcaaaccca ttaacccaaa aagcaccaga ctacaaattg 7081 tcagttcagt caagccagtt tgaactcgat atcggtgaga tggtagagct gctgtaaaag 7141 cagtaaacgc cagcaatatc cactccaaat atagtagaaa ccgaaaagga tgattgtgaa 7201 attgaatcga acggctcaca aaaatacact caagctcaac tagtttttat agtaaaacta 7261 taaatcaaca ctgttccgtt aaggtttttt gatgaaaaat ttaggtttgt agaaatgcga 7321 agttatacca aattctcaaa cctaaatttt tgagtattaa ttaatactag cgaaaagtgc 7381 aaaaaacata aaactatgta caagcgaaga gctacagttc attctgcatt atgacttttc 7441 ttaagtattg aaagcttgat acacatacat tttctgtagt tacggtgata tttttaggac 7501 acaatgcttc atgatttttt gatgaattta ctgattgtag accatgacta aagtcatggt 7561 ctattcatga ctttttcctc atgtgattct caaaattgaa ctcttatcat agttgtatca 7621 actaggagaa aaacgaaaca gacttaaatg aagcttaaga acttatcact catttgtgga 7681 gcgatcgccc tcagtttaac aacagcctcc ttcgccgtta aagcagaagc aaactcctct 7741 ttgcccttag ttgttgcaca atctcaggaa aaagaaggat catttcaacg tttaggacta 7801 acgagtgacc aaaaagccaa aataaaagaa atccgtacaa atacccgcac tgaagttgat 7861 aaaattctca ccgaacaaca acgagaacag ttaaaaaccg ctaggcaaaa ccgccaggga 7921 aaaggtgggt ttgcagcttt aaatctttct gatgaccaga aaaaccaact aaaacaagtg 7981 atgcagtcac agaaaacaca aattgaagcc gttctaaccc cagagcaaaa gcagcaactc 8041 caaaaatacc gccaggaaaa gggtgctcgt cgtcagcaac ccaatatgta gtttatcact 8101 ggtgttgttg atcacagcta tgtgagtttg cactagctcg gtagtctaaa aattaagtcc 8161 tattgagatc cccgactcct ttaaaagttg tcggggatct gattttttca agagtctcgc 8221 ctgcttaaga tttaaaatta agtaggtggt ttgaattaaa tataaaatgt agtgccaaga 8281 gttgcgcgtt gcggtgaatc cagcgctgca ggagggtctc ccgacctaag cgactggtga 8341 acccggaggg agccagtact gcagaagggt ttccctctgt aggtatctgg cgtcgggtta 8401 agcgcgttgt agcgacttcg gggcaggaca gcccgcgttg gtaataaagt agggagcatc 8461 ttgctcccac taccgctatc tgcgaattaa ttacgcctac ctacttagac tgataatctc 8521 gcgtgaaaac acaaagacgc tatattttcg agtgaagttg cttaacagtt ttctatcata 8581 ggtatagaag ccccactaag ttcttacttg gcaagcttcc ataatgcaac actcataaaa 8641 aaaagctttg aacacaagaa agatgataat ggtattttcg ataaataagc tgtcctctaa 8701 gttgttcagt aaagtctcaa ctgtagaggt aggtgaaata tctagggcta attgtgttgt 8761 gccgcgtcct gtaaccagat attctacata tcatcgccca aaggtcttca agctcttgat 8821 attagccctg acaacattag gagtcagttt tttaacacca tctgcggctg aggcacaaag 8881 taattccact tctagtcctg gtgtaggcaa cattctcaat tatccttatg ggacgcgcgt 8941 tcatcaaaat ggtgttatta acacaccaga tggcagtaca atttctcctg ccacaacaat 9001 caataatggc aatggctcta ccacttacta ttatcagaat gggacgcgcg ttaacatcaa 9061 caccaataga gttaccccta acggggctgt actcacaccc ggaagcttga atggaggatt 9121 gaaccgtgta ccagaaaacc caaacagagg acttttgctc acgccagcaa acccgaatgg 9181 agaattgaac cgtggactag aaaacccgaa cagaggattt ttgcttacgc cagcaaaccc 9241 aacaagatta tggcagaagc cttcttttga aaccaggtaa ataacacagg ggcagtaaat 9301 ggcattaaac caggtgagag ttatcctagt agaaccagca ggaccaatta atgtcgggtc 9361 gatcgcacgg gtgatgaaaa attttgggtt aaatcatcta gtactggtta atccccaatg 9421 taaaccgctt gcaacagaag cgcttcaaat ggcggttcat gctagggata ttttagagtc 9481 agcagtatca gtgacgacgc taccagaagc actgcaagga tgtacgcggg cgatcgccac 9541 cacagcccgt gttcgtcact gggattcccc cctggaaaat ccctccacag cactaccttg 9601 gttactggat caaccacaac aaccagccgc gatcattttt ggtagggaag atcgaggact 9661 gagtaatgaa gaattaaatt atgctcagcg gtttattcgt attcctacca gttctaatta 9721 tccatcgttg aatttggcga ctgctgtggc tatttgctgt tatgagttag caaaaagtga 9781 cacagagaat caagaggaca cgcttagaga aaacacggcg attactagta aaaaattatc 9841 tgcacctttt gctgtgccca tccccgcatc tgttcacgag tctgcacctt tgaacatctt 9901 ggaagaatac taccaacagt tagaatcact actactcaag attggatatc tttatcctca 9961 tacagcagct agccgtatgg aaacatttcg gcaaatgtat aatcgtgctc aattacaaac 10021 taaagaagtt gcgatgctgc gaggtatttt acgacaggta gaatgggcgc tagagaaccg 10081 gaacaatcat caaggctttt aactcatcct aacttgtgta ctaactcatc gtcccaaaac 10141 ttgtcataat aacttaagct caattcaaat aactaactat tataagcaaa aattaatgta 10201 aataataagg tgccttagag gtaaattgaa agagtggcta caaagtccgc acacaaaaga 10261 catttcaatt cccatcaggg attgctcgaa agcgacttta gccaataaag taaaaccttt 10321 ctagatgtgt tctttatttt acacttttat agctacccta tcttatgcca acgagaaccg 10381 ttataaaaac agtctcacgg cgtccaaaaa ccagccgccg tccggttaaa agcaaagttc 10441 aaaagcaggg gcaaaataaa gtcgaagcga caaagcagca gcaagccccc tacagccgaa 10501 caatgcctac acgtataaaa tcgattcccc ccttaatccc tgtgacttcg ccgataaagc 10561 gatcaggagt accaccaaca ccagccaaac caacagctgc tgccaaagga aggattccac 10621 catacaatcc aaagaaagta cagtcaaaaa atgtgcaaat gcggaagcag ccgttaccaa 10681 gaaaaatcgg tgcatctcga caaaagaagc gcttaaagcc gatggcaaaa acatttttgt 10741 atgccttacg gttgttaatt gtaggagttg gtgtcggtgc aatcgtaggt acggcgttat 10801 cagtgttaga tccagcaact cgcatcgcca catccagtgg aagatcatct gatacaacta 10861 tcgggcaggc acagccacag tttacccaaa atccctcaga agctgcttca gggttattcc 10921 tgactcagga aatttcttct ttaaaaacta tagtacaaaa tttggcagcc gcaaacccta 10981 atctcacacc aggaattttc ttggtagatt tagacaatgg caattatgta gatgtgaatg 11041 cctcctccag tttttctgct gctagcacga ttaagattcc gattttgatt gcctttttcc 11101 aagatgtaga tgctggcaaa attagcctgg atgaaacact caccatgacc aagcgaatgg 11161 ttgttggtgg ttctggggat atgcagtata aaccagccgg aactcagttc agaatcatgg 11221 aagtggcgac taagatgata acagtcagcg acaacacagc aacaaacatg ctgattgctc 11281 gcttaggtgg tatcgagacg ctgaatcagc gtttccgcaa ttggggtttg acaacaacta 11341 caattagtaa tcccctccct gatttgcaag ggacaaacac cacaagtccc aaggaattag 11401 ggaagctgat ggggatggtg aacaagggaa atttggtgag tgtggcatcg cgcgatcgca 11461 tactagatat tatgcgtcgc actgtgagaa atcagctcct gcctagtggt ttaggaccag 11521 gcgcaacaat tgcccataaa acgggtgata ttgggacaac ccttgcagat gcaggtttaa 11581 ttgatatgcc cactggcaaa cgttacgtac tcgctgttat ggtacaacgt cctaataacg 11641 atcccggcgc cgaaaaactc attagctcaa tttctcgggc cgcttatcaa caatttagcc 11701 caactgctcc catacctcct agttcaggaa gtacaatccc tacaactggt tatcaatccc 11761 cagtcatgag tcagcctcta cccaacggta tgggaagcac aatgcccaca actggttatc 11821 aacccccagt catgagtcag cctctaccca acggcatggg aagcactata ccccccaatg 11881 gttatcagcc tcccgttcag cttcccgttc agcctccggt gatgaatccg cagtattatt 11941 atccttacca gcgataaatt ttcctttttc atgacatcta tcacaccctc aatccaattt 12001 ttttctggca ttcgtgaaga actcagcaat gttagcttgc gacgtaatct cacttctggc 12061 aagcgtatta ttgtaatgat ttttgcgcga atcaaagcgt tggaaggatt taatagcttt 12121 acaaaacaac ctttaaattc catgctttta acagacgaag aaggtgaaat cagcgtcact 12181 ccatcttcca cacaattcat ttttggtggt gcagaaggtg atgagttgca gcgcgtagaa 12241 tgtaaatttg aagtagagca acaggaccac tgggaacgat tcatgagatt tatgaaccgt 12301 tatgctgaag cgaatgatat ggtatacgga gaatcgcaat agttgtgatg gcagggaggg 12361 aacagggaac ggggaacagg gaacagggaa cagggaacag ggaacaggct tgaaagtctc 12421 ctggtgtccg tagtatgaga ttacgtatag taattgactc ctctctctgt gttctctgcg 12481 cctctgcggt taaaaaatag gtattcttca ccacaccgaa aggagtaatt ttataaacat 12541 cgctgatacc gcatccccgg taattgcgaa accaaaccca tcaactccaa ttgcaacaaa 12601 gcactcgaaa cctcacccgt agccatgccc gtttgttgaa caataaaatc aaagggtaaa 12661 gcttctgaag caattgcatc cataactcgt tggagttctg ctggtaaatc tggcagagtt 12721 aactgctgcg gtgatggaga tgcttcaaca gaatcaagtt gtggtattgc tcccagcatt 12781 tttaagagtt cgtctaattc cttaagaatg ggagtcgccc cttggctgag tagctttaaa 12841 cacccttggg atgggtgatc atccactctt cctgggagga catagatatc tcgcccaaaa 12901 tcatttgcgt agcttgcagt aatcaaagca cctgatttta tcggtgcttc catcaccagc 12961 acagcacgac ttaaacctgc gataattctg ttgcgacggg gaaagtgagt gcgatctggt 13021 ggtgtctttg ctggatactc actcacaacc aaaccagcag tcaaaatctg cttgtacagt 13081 tccacatttt tagatggata gacaacatct acgcctgtac ctaaaactgc gatcgtgcgt 13141 cctccagctt tcacagtggc ggcgtgactt tcagtgtcaa ttccttctgc caaaccagaa 13201 acaacagtaa acccattttt cgccaaagct gtactaattt gacgagtcca tcgcatacca 13261 tactctgagg gttggcgtgt ccctacaatc gcgacgagtt gtttttgtcc cagattttct 13321 tgcaaatcta tttcaccacg ataatacaca atgggtggtg gactgggtat ttccagtagt 13381 aaccgaggat agtctgcatc tgcaggtgtc caaaaactcg ggttttgctc ttggtgttgc 13441 tgcaaaaatt gttcaggatg taaacgagaa cgttgttgca ccaccttttg cagcgtttga 13501 aaaccaaaac cttctacttc ttttaactgt gcaggtttcg cgtcccaagc tgctgccagt 13561 gtaccaaaat gctgctgcaa ccgtcgtaat aatactggac caactccaga aatttgcgac 13621 caagcgagcc aatatgcacg ttcttgtccc aattgcccat cctcaagtct actgggttga 13681 ttattcccag attggagaag caaagataca tgtacgcgat aaggtggagg cttgacgctt 13741 tgcggcatcg ccacactgtc atacacttat tgatatattt tattgttatg actcgccatg 13801 acaattacca gtttttaacc cttaaatgca ccatgctaca attactttat gaagtttgag 13861 tgggacgaca acaaagcggc aaaaaatctg tcaaagcacg gagtttcctt tgaagaagcc 13921 aaaacagttt tcgatgatcc actttatgtt gacttctacg atccagacca ctcagacgaa 13981 gaaaaccgct accttattgt tggacagtca aaccgaggac gcttgctaat tgtgtcttat 14041 acgcagagag gagattcaat tcgtctcatt agtgccaggg aagtaacacg agctgagcga 14101 gaagcctatg aagaagggta agccagaaat ggaagacgag cttcggtcag aatatgactt 14161 aaagagttta agggtcagaa ggctaggttc cggacgaaag agctttggcc caataactgt 14221 tcgtttagaa cctgatgtcg cagaaatgtt tcccaatgct gatgcggtca atgaagcttt 14281 acggtttttg atcagagtaa tgcaggaaaa acaatctcct gcatccagac tggagcttaa 14341 cacttcgttg gagcagacgg atgaacgccc ctagagcgcc agttcagtaa aaatacttga 14401 cagaaggtac agtttttgca tagcttgaag aaatgagaac gatgagttaa gttttcttca 14461 gcctttccaa tttaccgaaa aattgggttt aaagccccgt ccttctagga cggcttttaa 14521 attcgcgaac aaaaacgccc ccgttatgcg ataataggat atagcggtaa agcgcacata 14581 aaaaagacga agtaggtcga gcggacaact cctgatccgg aaccctggta gtataaatcc 14641 tagatcttgc accgccactt ggtttatgcc ggggaaccct ttcggcagtt cctcatgggg 14701 gaaaccacgc cagatgcttc aacgggggga acccccgcga ggacagcagt ggggatgaag 14761 acaccatcgg gactttgagt gctgcccttg aggataacac gttggttagc attaatgatg 14821 acatcacctg catccccttg cccaaatgtg acggtggcca gttgcgcgcc atctttgagg 14881 gatactattg gagcgttgat gttgatgttg ccaccattcc ccactgctgt tgatcccaca 14941 gtgctgaaaa tgtacccatt attggcaagc tcaacggaat cgttagctcg cactgacaca 15001 ttgcctgcat tcccttgtcc aaacgtgctg gcactcagtt cagcaccatc cctaagggaa 15061 acagaagcag cgttaatgtc catattgcca cctttgccca ctcctgttga tcccagagtg 15121 ctgaagatat ccccgttgac aagctcaact gaaccagacg ttcgcaccga cacattgcct 15181 gcatcccctc gtccaaatgt gctggcagtc agttgagcac catcggttaa agagaaggaa 15241 ccacttgtaa tagtaatgtt ccccccattg cctgtcgccc ccgtgttgac tcgactgcga 15301 atcccactgc gaaattcatt cttcacccca gcaatcgtca cagaacctgt gacatcaaca 15361 gtgacattgc ctgcattccc gtttccagct aactggttgc cagatgcttc acgaaccaag 15421 gtttgcaatt gagcgctatc agtgagggac agtgttgccg ccttgatatc gatattgcct 15481 cctttgccca cacctcctgc ttccacattg ctgaagatgt ccgcattaac aagagaaact 15541 aaatcagagg ctcgcaccga cacattgcct gcatttcctt gtcctaaggt actggcattt 15601 agttgagcgc catcggttaa agagaaggaa ccacttgtaa tagtaatgtt ccccccattg 15661 cctgtcgccc ccgtgttgac tcgactgcga atcccactgc gaaattcatt cttcacccca 15721 gcaatcgtca cagaacctgt gacatcaaca gtgacattgc ctgcattccc gtttccagct 15781 aactggttgc cagatgcttc acgaaccaag gtttgcaatt gagcgctatc agtgagggac 15841 agtgttgccg ccttgatatc gatattgcct cctttgccca cacctcctgc ttccacattg 15901 ctgaagatgt ccgcattgac aagtgaaact aaatccgaag cccgcactga cacattacct 15961 gcatttcctt gcccaaatgt gctggcactc aattcagcac catcgcttaa agagaacgaa 16021 ccagaggaaa tcgtaatatt tcccccattg cccgtcgctc tcgttcccac cgagctgaaa 16081 atcgcactac gcgatccatc cttgacccca acaattgtta ctggtccagt gacatcaaca 16141 gtgacatcgc ctgcatctcc ccttccggct agctgagtct cagattctcc acgaactata 16201 gcttgcagtt gagcactatc tttaagcgac agagttgctg ccttaatgtc gatattgcca 16261 ccattgccca cacctcctct ttccacattg ctgaagattt tcgcattgac aagctcaact 16321 gaaccagacg cacgcaccaa tacattacct gcatcccctt ctccataagt gctagcatct 16381 attgaagcag tatcactaag tgagaaagag ccactcgtaa tattgatatt gccaccatca 16441 ccttgtgatt ctgctagtac ttgattgctc acacctgcac ccgatgaacc attaaaggat 16501 ttgctgttaa taaaaatatt acctgcatta ccagaagcac catcaacagt ctggttgctc 16561 acaccagcag cattacctct ctgcccaaca ccagaaatat taaaattatt tacattaaca 16621 gtaatatcac ctgcatttcc cactccttca gttcctgctg tcaaccgtcc accattggtt 16681 gcagtaaaat tatttgcatt aacgacaata tctcctcctc c // LOCUS NODE_2011_length_16612_cov_4.73612416612 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16612) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16612) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16612 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(150..539) /locus_tag="DP116_17580" CDS complement(150..539) /locus_tag="DP116_17580" /inference="COORDINATES: protein motif:HMM:PF11218.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17580" /translation="MSVRFPIIAATLMATTVSLGTLLNASPASAQDTITCESRGNERN TCQVDRRSEVRFVRQLSDASCRGNWGYNRNRIWVRNGCRAEFAVSNRTDDRYDRNDRN DRYDRNDRYDRNDRYDRNDRYYDGYRR" gene complement(816..1172) /locus_tag="DP116_17585" CDS complement(816..1172) /locus_tag="DP116_17585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353007.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_17585" /translation="MATVDEYRQHIQRLLSEHATLVWDSRIRAELIFDQERVGAASPK ETRYQLVYVGWRDSQRVYGVVLHIDIIDGKIWVQQDGTEVGIANKLVEVGVPKHDIVL GIDPPKMRQYTEFAVG" gene complement(1160..1333) /locus_tag="DP116_17590" CDS complement(1160..1333) /locus_tag="DP116_17590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012595929.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17590" /translation="MEGVKIIIGDRLLYLAVPNNVYEQFFATSFIQSLVEQHQLYLLI YDIDQEVIKRWQP" gene 1348..1551 /locus_tag="DP116_17595" CDS 1348..1551 /locus_tag="DP116_17595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF433 domain-containing protein" /protein_id="PRJNA477356:DP116_17595" /translation="MTDQELLSRITVNPKVMVGKPVIRGTRLTVEYILNLLAHGATIT EILEEYEGLVETDIRACFLFAKR" gene 1622..1978 /locus_tag="DP116_17600" CDS 1622..1978 /locus_tag="DP116_17600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017711429.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17600" /translation="MRFLVDESTGVIVARWLREQGYEVFSVYEEARGINDDDIIQKAF AENWILITNDKDFGEKVYREQRPHRGVILLRLDNEKATNKIATLQQLLEMYPPEQLFN NFIVVTERQIRFSRSQ" gene complement(2006..2524) /locus_tag="DP116_17605" CDS complement(2006..2524) /locus_tag="DP116_17605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455849.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thermonuclease family protein" /protein_id="PRJNA477356:DP116_17605" /translation="MQKILILLCLLLLVACQPQNKPEESTQVQVKVTQVVSGQTLKVM GLGNQPSLISQVRLLGIDAPDLQQRPWGDAAKERLEAVLLEHPVILEFDVQVKDQFGR SLAYAWKDGVLLNEQLVKEGYALFVGRSPNHKYDQRLERAQQWARLMGLGIWDPEKPM RLTPAEFRRQYR" gene complement(2600..2968) /locus_tag="DP116_17610" CDS complement(2600..2968) /locus_tag="DP116_17610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746304.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_17610" /translation="MSEKYTVKVRDRSKGMTYTLQVPDDRYILHTGEKQGVELPFSCR NGACTTCAVRVLSGEIYQPEAIGLSPELQKKGYALLCVSYARSDLEVETQDEDEVYEL QFGRFFARGKIRKGLPLDED" gene 3057..3143 /locus_tag="DP116_17615" tRNA 3057..3143 /locus_tag="DP116_17615" /product="tRNA-Ser" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:3091..3093,aa:Ser,seq:gga) gene complement(3685..3984) /locus_tag="DP116_17620" CDS complement(3685..3984) /locus_tag="DP116_17620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315118.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division topological specificity factor MinE" /protein_id="PRJNA477356:DP116_17620" /translation="MILEILEKFFVRSVDNSRTQVKRRLQLVIAHDRADISPDVLEKM RQEILEIVCRYVEVETDGLEFSLESNQRTTALIANMPIRRVKENQEETSELDNSN" gene complement(4012..4818) /gene="minD" /locus_tag="DP116_17625" CDS complement(4012..4818) /gene="minD" /locus_tag="DP116_17625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997606.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="septum site-determining protein MinD" /protein_id="PRJNA477356:DP116_17625" /translation="MTRIIVTTSGKGGVGKTTITANLGMALAKMGRQVALVDADFGLR NLDLLLGLENRIVYTAVEVLARECRLEQALVKDKRQPNLVLLPAAQNRTKDAVTPDQM KLLVNALAQKYQYVLVDSPAGIEMGFKNAIAPAKEALIITTPEIASVRAADRVVGLLE AQGIKRIHLIINRIRPAMVRANDMMSVQDVQELLAIPLIGVVPDDERVIVSTNRGEPL VLAENPSLAATAFDNIVRRLEGETVEFLELDSTQDNIFSRLRKLFWTKSI" gene complement(4871..5899) /locus_tag="DP116_17630" CDS complement(4871..5899) /locus_tag="DP116_17630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878070.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="septum site-determining protein MinC" /protein_id="PRJNA477356:DP116_17630" /translation="MESNSFEPNIEQTNSSDSNVESNSFEPNIEQTNSSDSNVEANSL KPDGESHPISVTQSNPTQSNVESNSVIPDVEFNPALINLELTEEFPYSTATVNPNIQV QIKSQEGKLLLILPTESQLPASEYTWTEIWQQMKLRLLACERSFSPNTAVNLIAQDRL LDNRQLQELAESLNQFKIQLKSVSTSRRQTAIAACTMGYSVEQLQRQTKLGAESKPDT PPLAEPLYLEKTVRSGEEIRHPGHVILLGDLNPGGIVVANGDILVWGRLRGIAHAGAL GNRDCLIMSLQMEPTQLRIADAVARAPEKSPLQFYPEVAYITSQGIRIARVSDFSRIL LSRMNQET" gene complement(6250..7632) /locus_tag="DP116_17635" CDS complement(6250..7632) /locus_tag="DP116_17635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="four-carbon acid sugar kinase family protein" /protein_id="PRJNA477356:DP116_17635" /translation="MTNKPKIIVLDDDPTGSQTVHSCLLLMRWDVETLCLGLQDDSPI FFVLTNTRALPPEEAASITREVCHNLKQAISRVRENAENAGVKSNITSQTDKTEGFIV VSRSDSTLRGHYPIETDVIVSELGPFDAHFLVPAFFEGGRITRDSVHYLITEGVPTPV HETEFARDSVFGYHHSYLPKYVEEKTQGGISAESVERFLLADIRAGSLERLMQLTNNQ CAVVDGETQADLNQFAQDVLAAVSQGKRFLFRSAASILTAIAGLPPQPIAPENMSQYV RGGKPGIVIVGSHVKKTTQQLEVLLQQEETMGIEVDVARLVDDGANESATLLTEVLHH VRAAYDSLKTAVVYTSRKELTFKDVLTRLEFGTKVSSLLMDIVQGLPSDMGFLISKGG ITSNDVLSTGLGLTSARLLGQILPGCSMVTTPSDHPQFPNLPVVLFPGNVGDADALAI VAQRLSKNTG" gene complement(7717..8379) /locus_tag="DP116_17640" CDS complement(7717..8379) /locus_tag="DP116_17640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131159.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17640" /translation="MQPDLLSLEITKGELRRLIGLNPDDVLRPSIMRNSEKRFRFLIN EIVVALLLTLIIVGFVYAFLILPTIGSSMILGIVLLTSMPIAIIVGRWFWRRFTYPRT LRVLLDEVDKYHTLLIAIDIHDQQITSGNTESRIADREKVVAAMQLIREDFVRALKTE RILRDNKKLFTNNQESLVNNLTNLQALQASSQASEYAQLLNQSLQIAMSVQTEIRKLR EI" gene complement(8621..8800) /locus_tag="DP116_17645" CDS complement(8621..8800) /locus_tag="DP116_17645" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17645" /translation="MKFFENLTKLVFEPEIFVLPRRQGSSDTAMQSKFFGAPRRLLSH WKKKKKIGHLRLGRE" gene 8908..9093 /locus_tag="DP116_17650" CDS 8908..9093 /locus_tag="DP116_17650" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17650" /translation="MLYIALVFQLKPPPKEEKNQKTSHMDNSELDFSSQNQAKTKKPE YLQKIQENYYLERRFIL" gene complement(9194..9949) /locus_tag="DP116_17655" CDS complement(9194..9949) /locus_tag="DP116_17655" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17655" /translation="MSPALREGNPGAGDWRTRKGLGGFPHERLAYLLRRRCANAFGVR SAHTRRVQDLSLNKYTTMQPDLQSLYITQDDLKQIAGLSRRDLKADETLKYPRKARLL LLGAYTEQILFFGWGLIPIAYLIKWRFFRERQRNILRQIDDYNAVLKAIDINDQLEAA GNQGVSLTDREKVIDVLKTTRSNLICALKTERILRKNKEFIARNMELLESNVTAMQGI KVNHEAREYARRVDEALKIAQDVQVEMRKLEEE" gene complement(10085..11560) /locus_tag="DP116_17660" CDS complement(10085..11560) /locus_tag="DP116_17660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTP-binding protein" /protein_id="PRJNA477356:DP116_17660" /translation="MTSTLPDPHRNDSPSTDENSLNWDEQLDSAIFTFEDIQAELNYK QARTALRNLVDKLDLTQEEKDGLEMEIGDLETMLLKLERMVVQIAAFGMVGRGKSSLL NALVGQPVFETGPLHGVTRNSQTANWTITEEAIGETERALRVTLPGSGQSQVELIDTP GLDEVDGETRAVLAQQIAKQADLILFVVAGDMTKVEHDALSQLREVGKPILLVFNKVD QFPEADRMAIYEKIRNERVRELLSPDEIVMAAASPLVRRMVHRPDGTRGVQLSAGKAQ VEELKLKILEILHREGKALVALNTMLYADNVNEQLVQRKLKIRETSANQLIWKATMTK AMAIALNPVMLVDVLAGAVIDIILILGLSKLYGIPMTETGAVKLLQRIALSMGGISAS ELLANLGLSSLKTLLGLSAPATGGASLGAYLSVALTQAGVAGVSCYGLGYVTKAYLAN GANWGPDGPKAVISKILSTLDEDSILNRIKDELRLKIIAKT" gene 11769..12494 /locus_tag="DP116_17665" CDS 11769..12494 /locus_tag="DP116_17665" /EC_number="5.3.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113309.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="triose-phosphate isomerase" /protein_id="PRJNA477356:DP116_17665" /translation="MRKIVIAGNWKMFKTQAESLEFLKGFLPSLDETPQDREVVLCVP FTDLNVLSKSLHGTRVQLGAQNIHWEESGAYTGEISGPMLQEIGMRYVIVGHSERRQY FGETDYTVNLRLKAAQRFGLTPILCVGETKQQRDALETESLIISQLEKDLVDIDQENL VIAYEPIWAIGTGDTCEAKEANRVIGLIRSQLSNPDVPIQYGGSVKPNNIDEIMAQSE IDGVLVGGASLEPASFARLVNFK" gene 12740..13606 /gene="folP" /locus_tag="DP116_17670" CDS 12740..13606 /gene="folP" /locus_tag="DP116_17670" /EC_number="2.5.1.15" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998523.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydropteroate synthase" /protein_id="PRJNA477356:DP116_17670" /translation="MTANLIIRERCFEWGQRTYLMGILNVTPDSFSDGGEFNTVAAAL AQAQAMVAAGADIIDVGGQSTRPGAEQITLAEELDRVLPVLHVLRKEIPVPISVDTTT AAVAKAAVEAGADIVNDISGATLDPEMLPTVARMNVPIILMHIRGNPQTMQQFTDYQD LMGEIYSFLAKQIAAATGVGIDERKIIIDPGIGFAKNYEQNLEILRHLPQLRQLKCPI LVGASRKSFIGRILNQPDPKARVWGTAAACCAAIFNGADILRVHDVQQMRDVSLVADA IFRQSSQVLPSD" gene complement(13603..14442) /locus_tag="DP116_17675" CDS complement(13603..14442) /locus_tag="DP116_17675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862086.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SPFH/Band 7/PHB domain protein" /protein_id="PRJNA477356:DP116_17675" /translation="MEPIIAIVLVLIGYALGSAKLINQGNEALVERLGQYHRKLKPGL NFIVPLLDQIVMEDTTREQVLDIKPQNVITRDNIYLEVDGVVYWRVRDIEKSFYEIDD LQQALTNLTTTTLREIIAQNTLEETNAARASMNSALLDQLNQTTAQWGVEITRVDIQS ITPPESVRKSMEEQRAAEINSRAAILEAEGQREAAIKKAQGTKTSMEIISNALRSNPE SKEILRYLVAQDYINASYRLGESQNAKVVFVDPGKGGEMMDLISEMTYQEGHSNNGEK ASN" assembly_gap 15041..15050 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(15298..15744) /locus_tag="DP116_17680" CDS complement(15298..15744) /locus_tag="DP116_17680" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17680" /translation="MSEDPITRDLEITPLEVTTLQLSLYQVIQKSFQILQAVQKALDI EPASHHNQVNELEPGNNLIELESEQEIISPQTDPQLYCEKHKTYTVEELAEEKSYLAN FQAENFKKRAENYNKFLESQSKFAKSKAIFETNQAKFSSIRQSKSF" gene complement(16017..16589) /locus_tag="DP116_17685" CDS complement(16017..16589) /locus_tag="DP116_17685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012900448.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17685" /translation="MPQQNYQQPAYPAQDASQTQAQQGYGQQPAFGNISQPIQPNQVQ GFPVSQQAPGQLPPQNNMQYGQAQASAPATQPGWTADYPNPNEANNMAAQVPMDTAHK KGGGVGSMLGKALGGVAKMAPTVAGAGMAGMTNMAVGNMMMNGGYGMPYGGGYGMPYG MGGMGGGYGMPYGGMGMGGMGMGMGGMGMM" BASE COUNT 4580 a 3701 c 3390 g 4931 t 10 others ORIGIN 1 tttccctccg tagggacctg gcgttggggt tccccccgtt gtagcacctg gcgtgagggg 61 atagggataa ggttcttggt tttttataag tgttcatccg gacatgatat gatgtttcag 121 ttgagtctca aaaatgcctg tagattcact tatctgcggt atccgtcata atatctgtca 181 tttctatcgt atctgtcatt tctatcgtat ctgtcatttc tatcgtatct gtcatttcta 241 tcatttctat cgtatctgtc gtctgtgcga ttgctaactg cgaactctgc tcgacaccca 301 tttcttaccc agatacggtt tctgttatag ccccaatttc ctctacaact agcatcagac 361 aattgcctga caaaccttac ttcgcttctt ctatctacct gacaagtatt cctttcatta 421 ccccggcttt cacaagttat tgtgtcttga gcagaagctg gtgaagcgtt aagaagcgta 481 cctaaactaa cggtggtagc cattaaggta gcagcgatta ttggaaaacg aacgctcata 541 ttaaatggtg aatctcttaa ataactaact atactaattg ttccaaattt gtaaaaagac 601 ttgcctgtat ctcaagtcat aaaatttagc gtcctaaaca gtttcagcaa atcagtgaac 661 agtaaacagc tacctacgca gtgaacagtg agtccagtgc tgcaggaggg tttcccgacc 721 taggcatctg gcgaacccga agggtgaaca aggggtggac gagtccgttt cctacctgat 781 aactgataac tggtaactga taactgattg ttgactcaac ctacagcaaa ttctgtgtat 841 tgacgcattt tgggtgggtc aatacctaag acaatatcgt gttttggcac tcctacttca 901 actagcttat tggcaatacc gacttctgtt ccatcctgct gtacccagat ttttccatct 961 atgatgtcga tatgaagcac aactccatag acgcgctgtg aatcacgcca gccgacgtat 1021 actaattgat agcgagtctc cttcggagag gctgcgccaa cacgctcttg atcaaaaatt 1081 agctctgctc gaatgcggct gtcccaaact aacgtagcgt gttcactcaa caatctttga 1141 atgtgctgac gatattcatc tacggttgcc atcgcttaat tacctcctga tcaatatcgt 1201 agatcagcaa atacagttga tgttgctcaa ctagtgattg aataaatgat gtggcgaaaa 1261 actgctcata cacattatta ggcactgcta aatacaatag gcgatcgcct atgataattt 1321 taactccctc aatttcacgc ttgaactatg acagaccaag aactactgag ccgtatcact 1381 gttaatccga aagtcatggt aggcaaacca gttattaggg gaacccgcct aactgttgag 1441 tatattttga atcttcttgc acatggtgca acaattacag aaattcttga agagtatgaa 1501 ggcttagtcg aaacagacat tcgagcctgt tttttatttg ccaagcgata gcttcgctgc 1561 tgccttcggc agatcgctag aaagtactag ctttatgcca cttgctgcgg agagggcata 1621 gatgcgcttt ttagtggacg aaagcacagg tgtaattgta gcacgctggt tacgcgagca 1681 aggttacgag gtgttttcgg tatatgagga agcgcgtgga ataaatgatg acgatattat 1741 tcagaaggct tttgcagaga actggatttt aatcaccaat gacaaggact ttggagaaaa 1801 ggtctaccga gaacagcgtc cccacagagg tgttattctt ctacgtcttg acaatgaaaa 1861 agccaccaac aaaattgcga ctttgcaaca gctgttggaa atgtatcctc ctgaacaact 1921 atttaataac tttatagtag taactgaaag gcagatccgc ttttcgagat cacagtaagc 1981 tgcctgctca gtctactacg aattactaac gatactgacg gcgaaattca gcaggtgtca 2041 gacgcatagg tttttctgga tcccaaattc ctaaccccat gagtctagcc cattgctggg 2101 cacgttctaa gcgttggtca tatttgtggt tgggcgatcg ccccacaaac aaagcgtacc 2161 cttctttcac caactgttca ttcaacaaaa ctccatcttt ccacgcataa gccaaactcc 2221 gcccaaattg gtcttttact tgcacatcaa actccaggat cacagggtgt tccagaagca 2281 ctgcttctag gcgttccttt gctgcatctc cccaaggacg ttgctgcaaa tcgggtgcat 2341 cgattcccag caagcgtact tgagaaatca aacttggttg attccccaaa cccattacct 2401 tcaaagtttg tccactgacg acttgtgtga ctttgacttg cacctgtgtg ctttcttcag 2461 gtttgttttg aggttgacac gctacaagta gtagtagaca aagcagaatg aggatttttt 2521 gcacctttgc accaatgctt ttgaaacaaa atttaactaa ttcggcggaa ttcccgcaac 2581 caatgtggtg gggagaatgt caatcttcat ctaagggaag acctttgcga attttccccc 2641 tagcaaaaaa gcgtccaaac tggagttcat agacttcatc ttcgtcttgt gtctccacct 2701 ccaaatcaga acgagcataa ctaacacaca atagggcgta accttttttc tgcaactctg 2761 gtgacagtcc aattgcttcc ggttggtaaa tttctcctga gagtacccgg acagcgcaag 2821 tggtgcaagc cccattccga caagaaaacg gcagttctac cccttgtttt tcacctgtat 2881 gcaggatgta gcggtcatca gggacttgca aggtgtatgt catgcctttg gagcgatcgc 2941 gaactttaac ggtgtatttc tcggacattt gaacttttga tttcaatttg ggactttaaa 3001 ctaaatttag ttgcaatttg atttatttca ttatataatt agaactcgtg acgcctggag 3061 agatggccga gtggttgaag gcgcagcact ggaaatgctg tttgggggca acctcaacga 3121 gggttcgaat ccctctctct ccgtttcata atatctctaa aattcagtcc agtcatgaag 3181 atgtgcaacg gcaattgaat caatttattc ttagtatgtt tgtttacgtg cctgctgtac 3241 taataaaaaa atattgctaa tttcatctgt tggtatgcag aaaatttttg cgggaaaaca 3301 gcacttttag ttattatgcc tgaggttgct tttgatggtt tgagttaata ttaataggta 3361 tcaccagagg aatctcttgg ggattcttaa acaagaatac agaattcatc atccagaatt 3421 cgcaagcgtg gaaatcagat gaggattcaa aggatcagct gattctagac tttgcttaag 3481 ggttcatcca acgggaaaaa ccccaagaca gatatagtta agggaacacg caaaacgttg 3541 gtgcagtagt gattcactct acagccgaca ctacggaaac tcgtagaacc gcactgctta 3601 accagccgca cagaatcgat tctggcacag ctaggacaga attctttgtt gatgatgaca 3661 caaaaagcgt caaacgactc agacttaatt gctattgtct aattcagatg tttcttcttg 3721 attttctttg actcgacgaa taggcatatt agcaattaaa gctgttgtcc gttgattgct 3781 ttccagggaa aactctaagc cgtccgtttc aacttctacg taacgacaaa cgatttctaa 3841 gatttcttgc cgcatctttt ctagtacatc agggctgatg tcagcacggt catgagctat 3901 caccaattgc aggcggcgtt taacttgagt gcgactgttg tcaacactac gaacaaaaaa 3961 cttttctaat atttcaagaa tcattggagc taagtcagca tacggaacag attaaatact 4021 ttttgtccaa aacaactttc tcaaacgaga aaatatgttg tcctgagttg agtcaagctc 4081 aagaaattca accgtttctc cttctaatct ccgaacaatg ttgtcgaaag ctgtagcagc 4141 taaagaaggg ttttccgcta atactaaagg ttcaccacga ttggtagata caataactcg 4201 ctcatcatca ggaacaactc caatcaaggg aattgcaaga agttcctgaa catcttgaac 4261 tgacatcata tcatttgccc gtaccattgc gggtctgatg cggttgataa ttaagtgaat 4321 acgtttgatg ccttgtgctt ctagtaaccc aacgacgcga tcagcagcac gcactgaggc 4381 aatttctgga gtggtgataa tcagggcttc ttttgcaggg gcgatcgcgt ttttgaaccc 4441 catttcaatt cctgctggac tatcaaccaa aacatattga tacttctgtg ccagtgcatt 4501 caccaataac ttcatctggt caggagtcac ggcatctttg gtgcgatttt gtgctgcagg 4561 taacagaacg aggttaggtt gtcgcttatc cttcaccaaa gcttgttcta agcgacactc 4621 tcttgcgaga acttcaaccg ctgtataaac tatccggttt tctagcccta gcagcaaatc 4681 caaatttctc agaccaaaat ccgcatcgac caaagccact tgacgcccca ttttggctaa 4741 agccattccc agatttgccg taatcgtggt ttttcccacc cctcctttgc cggaggttgt 4801 aacaataatg cgagtcatga tagaaacgag ctcgattagt gattagaaaa gatacaacaa 4861 taaaaaagat ttatgtttct tgattcatcc tacttaatag gattctagaa aaatcgctca 4921 ccctagcaat acggattcct tgggacgtaa tatacgccac ctcaggataa aactgcaatg 4981 gcgatttttc tggtgctcta gcaacagcat ctgcaattcg cagctgggtt ggttccatct 5041 gcaaactcat aatcagacaa tcacggtttc caagggcacc cgcatgagca attccacgta 5101 gacgacccca aacaagaata tctccatttg caactacaat accaccagga ttcaagtccc 5161 ctaagaggat aacatgaccg gggtgacgga tttcttctcc agaacgcacc gtcttttcta 5221 aatagagagg ttctgcaagg ggaggtgtat caggctttga ctcagcgcca agttttgttt 5281 gtcgttgtag ttgttctaca gagtaaccca ttgtacaggc ggcgatcgca gtttgccgac 5341 gactagtaga aactgatttt agctgaattt taaattgatt caaactttcc gccagttctt 5401 ggagttgtct gttatccaac aagcgatctt gcgcgatcag attcacagct gtattaggtg 5461 aaaaagaacg ctcgcaagct aaaagtcgca gcttcatttg ttgccaaatc tcagtccaag 5521 tgtattctga ggctggcaat tgtgactctg ttggtagaat taataaaagt tttccctcct 5581 gactttttat ttggacttga atattaggat tcaccgtagc tgtcgaatac gggaattctt 5641 cagtcaactc taggttgatt aaggctggat taaattctac atcaggaatg acggaatttg 5701 actcaacatt ggattgagtt ggatttgact gtgtaaccga aataggatgt gactcaccat 5761 caggtttgag agaatttgcc tccacattgg aatcagaaga atttgtctgt tctatattag 5821 gctcgaaaga atttgactcc acgttggaat cggaagaatt tgtctgttct atattaggct 5881 caaaagaatt tgactccacg ttggaatcag aagaatttgt ctgttctata ttcagctcga 5941 aagaatttga ctccacgttg gaatgattgg aatttgactc gatattataa tctggaatat 6001 gtgattttaa gtaaagtaga acagaattta cttctgcttc ggggaggact gggttttcct 6061 cagcttgcga aatcgactct atatcagaaa gagcagagat ttcctctact tcacgaaagg 6121 ctttttctga cttcttatta gggatagtag agtcagaagt catgtaatgg ttgccagaat 6181 acgggataga gcgatcatcc atactaaata gttacaattt gcaacagtgt tattaaaaat 6241 taacactcct caccctgtat ttttacttag tctttgagcg actattgcca atgcatcagc 6301 atcgccgaca tttccaggaa acagcacgac tggcaaatta ggaaactgag gatggtctga 6361 tggagttgtc accattgaac aaccaggtaa aatttgacca agtaaccgcg ccgaagtcaa 6421 ccctaaacct gtacttaaga catcgtttga agtaatgcca cctttactga ttaaaaatcc 6481 catatccgat ggtaaaccct gcacgatatc catcaataag cttgaaactt ttgtaccaaa 6541 ctctaacctt gttaaaacat ccttaaaagt cagttcctta cggctggtgt aaaccactgc 6601 tgttttaaga gaatcatatg ctgcacgcac atgatgtaaa acctcggtta gcagtgtagc 6661 agattcattt gccccatcat caactaaccg cgccacgtcc acttcaattc ccatcgtttc 6721 ctcttgttgc aacagcacct ctaactgttg agttgtcttt ttcacatggg aaccaacaat 6781 gactatacct ggtttacctc ctcgcacata ctgtgacata ttttctggag caatgggttg 6841 gggcggtaat ccggctattg ccgttaagat actagcagcg ctacgaaaca gaaagcgttt 6901 cccctgacta actgctgcta gcacatcttg tgcaaactgg ttgagatctg cttgagtttc 6961 accatctaca acagcacatt ggttattagt gagttgcatc agtcgttcta agctaccagc 7021 gcgaatatcc gccaataaaa atctttctac cgactcagca ctgatacctc cttgagtttt 7081 ttcttctaca tacttgggta agtagctatg atgataccca aagactgaat cacgagcaaa 7141 ttcagtttca tggacagggg tgggaacacc ttcagttatc aaataatgta cactgtcgcg 7201 ggtaatacgt ccgccttcaa aaaacgctgg tacgagaaaa tgagcatcaa atggaccaag 7261 ttcagaaaca ataacatcag tttcaatggg gtaatgtccc cgtaaagtcg aatcagaacg 7321 actgacaacg ataaatcctt ctgttttatc tgtttgtgat gtgatgttgg attttactcc 7381 cgcgttttcc gcattttccc tgactctaga tattgcctgt ttcaggttat ggcagacttc 7441 tctggtaata gatgcggctt cttctggggg aagcgctctt gtatttgtta acacaaagaa 7501 aattggtgaa tcatcttgca accccaagca taatgtttcc acatcccaac gcatgagtag 7561 caaacaactg tggactgttt gagaacctgt cgggtcatca tctaagacaa taatttttgg 7621 tttgttagtc atgtttaagt caacggacaa ctgtttgttg gaaagttcaa agcaagcgtt 7681 ggttctttaa atggagactc ctgtgataac cgagattcag atttcccgca attttctgat 7741 ctctgtttgc acactcattg caatctgcaa tgattgattt agaagttgag catactcgct 7801 tgcctgactg cttgcttgta aggcttgcaa atttgtcaaa ttattgacaa gtgattcttg 7861 attattggta aataattttt tattatctcg taaaattcgt tctgttttca aagcgcgaac 7921 gaaatcttct ctaataagtt gcatagcagc aacgactttt tctctatcag ctatacgact 7981 ttctgtgttc ccagaggtta tctgttggtc atggatgtcg atcgcgatca gcagcgtatg 8041 atatttatcg acttcatcta aaagtactct gagtgttctg ggataggtaa aacgccgcca 8101 aaaccaacgc ccaactataa tagcaatggg catgcttgtg agcaaaacaa ttcctagtat 8161 catcgaagaa ccgattgtcg gcagaatcag aaacgcataa acaaacccga caatgatgag 8221 agtcagcagc agcgctacaa caatttcgtt aatcaaaaaa cgaaatcgct tctcgctatt 8281 tctcataata gaaggtcgca agacatcgtc tggatttaac ccaattaagc gtctcagttc 8341 tcccttagtt atctccaggc tcaataagtc cggctgcaca gatggatact cctatcaaag 8401 agcgagtggc gtatttgttg tatctgagat cttgcatcac tttcaattag cccattttga 8461 cgccttggaa tgaattcctc agctcacagc ccaagtctac taaagtaaac ttcaaggctt 8521 atgcagtcgt ctttagacga cttttgctat gagactggga tttgaatccc aggcggacga 8581 gaatcccagg cggacgagaa tgcaagatct cagttgtatc ctactccctt cccagtctaa 8641 gatgacctat cttttttttc ttcttccaat gggagagcag tctcctaggc gcaccgaaaa 8701 acttggactg catcgctgta tcactactcc cttgtctacg gggtaggaca aagatttcag 8761 gctcaaagac aagcttagtc aggttttcaa agaatttcaa acattttttt ccgaaattcc 8821 ttgaaactat ttgtagtttt tgtgtagtat aaaagtcgtc aagcgtaaga caaaaacttc 8881 taaacaaaga gagtcactta aacagttatg ttgtacatcg ccctagtttt ccaactcaaa 8941 cctcctccga aagaggaaaa aaatcagaag acatctcaca tggataactc tgaattagat 9001 ttctcttctc aaaaccaagc caaaacaaaa aagcctgaat atctccaaaa aattcaagaa 9061 aactattact tagaacgacg gttcattttg taagcattta tcagttatca gttatcagtt 9121 atcagttatc agggggtcaa gaaatcgctt cctctgttcc ctgtttactg ttcactgttc 9181 actgttcact ggttcactcc tcttctagct ttctcatttc cacttgcacg tcttgtgcaa 9241 ttttcaaggc ttcatccact cgtcgtgcat attctctagc ctcatggttg acttttatgc 9301 cttgcatagc agtgacattg ctctccaaaa gttccatatt tcgagcaatg aattccttat 9361 ttttcctcaa aatacgctct gtttttaatg cacaaattaa attggatctt gtggttttta 9421 aaacgtcaat gactttttct ctatcagtta aactcactcc ttggtttcca gcagcttcta 9481 gctggtcatt aatatctatt gcttttaata ctgcattata atcatcaatt tgcctgagaa 9541 tatttctctg acgttctctg aaaaatctcc acttaattaa gtaagcaata ggaattaacc 9601 cccagccaaa aaacaatatt tgctctgtat aagcgcctaa aagtaataaa cgagccttac 9661 gtgggtattt taatgtttca tcagctttca aatctctacg actaagacca gctatttgtt 9721 tcaagtcgtc ttgtgtaata tataagcttt gtaaatcagg ttgcatagtt gtgtatttat 9781 ttaagctgag atcttgcacc ctccgggtat gcgcggagcg cacgccaaag gcgttcgcgc 9841 agcgtctccg gaggagatac gccagccgct catgggggaa acccccaaga cccttacggg 9901 ttcgccagtc acctgcgcca gggttaccct cccgcagtgc tggactcacc gcgctggctc 9961 accattctca ataaccgtag gtgggcactg cccacaatag taaaaataag ggtttgagca 10021 aacggtaagc agtgcccacc ctacaaaaat tgctggctta acctgttaag cgttcccaat 10081 ctacttacgt tttagctatt atcttcaatc gcaattcatc tttaatacgg ttgagaattg 10141 aatcttcatc aagggttgac aaaattttac taatcacagc tttaggacca tctggtcccc 10201 aatttgctcc attggctaag tatgctttgg tcacataccc aagaccatag caagaaacgc 10261 ctgccacgcc agcttgagtc agtgctaccg aaagataagc acccaaggaa gcgcccccag 10321 tcgcaggtgc agatagacca agtaatgttt tgagtgaact caagcccaag tttgctaata 10381 gttcactagc actgatacca cccatactca aggcaattct ttgtaataat tttacggctc 10441 cggtttcggt catgggaatg ccataaagtt ttgataagcc tagaatcagg ataatatcta 10501 tcacagcacc agcgagaaca tctactagca tgacgggatt gagggcgatc gccattgcct 10561 tagtcatcgt cgccttccaa atcaactgat tggcgcttgt ctctcgaatt ttgagttttc 10621 gctgcactaa ttgctcattc acattgtcag cataaagcat agtgttgagg gcgaccaagg 10681 ctttgccttc acgatgtaga atttccaaaa tcttcagctt cagttcctca acttgggctt 10741 ttcctgcact caactgcaca cccctggtac cgtcggggcg atgaaccatt ctcctcacta 10801 acggcgatgc tgcagccatg acaatttcat caggtgaaag taattctcgt accctttcat 10861 tccgtatttt ttcgtagatt gccatacgat ctgcttcagg aaactggtct actttgttaa 10921 acaccagcaa aatcggttta ccaacttccc gcaattggga aagggcatca tgttcaactt 10981 tcgtcatgtc gccagcaacg acaaacagaa tcaaatccgc ttgttttgct atctgttgtg 11041 cgagtacagc gcgggtttca ccatcaactt cgtctaaccc tggggtatca atcaattcca 11101 cttgagattg accacttccc ggtagagtca ctcgcaaagc gcgttctgtt tccccaattg 11161 cttcttctgt gatcgtccaa tttgctgttt gagaattgcg ggtgacaccg tgcaagggac 11221 cagtttcaaa tactggttgt cctaccaaag cattgagtag agatgatttg cctcgtccca 11281 ccataccaaa ggcggctatc tgaacgacca tgcgttctaa tttcaacagc attgtttcca 11341 aatcgccaat ttccatctcc agtccgtctt tttcctcttg agtgaggtca agcttatcta 11401 ccaaatttcg cagtgctgtt cgcgcttgtt tatagttgag ttctgcctga atatcttcaa 11461 aagtaaaaat ggcactatct agctgttcgt cccaattgag agaattttcg tcagtgctgg 11521 gcgaatcatt acgatgtgga tcgggtaatg tcgaagtcat atcaattttg aatttgaaat 11581 tttgtatgta tatctcgtct actatcatga ctccgtcagg tgacgacgac catcacctta 11641 tggggtggtc aaataactca tgactctcga ctaataataa atacctaatg attaagacaa 11701 attgacccct taaagtcata aactgaaaaa tgcatgtagt agcgttatag ttcttgacga 11761 aatcacctgt gcgaaaaata gttattgccg gtaactggaa aatgttcaaa acccaggcag 11821 aatctctgga gtttttaaaa ggatttctgc ccagcttgga cgaaacccct caagaccgag 11881 aagtggtatt atgcgtcccc ttcactgact taaacgtttt gtccaagagt ttgcatggaa 11941 cccgtgtaca attgggggcg caaaatatcc attgggaaga gagtggggcg tatactggtg 12001 aaatttccgg accaatgttg caggaaattg gcatgcgtta tgttattgtc ggtcatagcg 12061 aacgccggca atattttggt gaaacggact ataccgtcaa cctgcgcctc aaagcggctc 12121 aaaggtttgg tcttactcct attctatgtg taggtgaaac gaagcaacaa cgagatgcgc 12181 ttgaaacaga atcactgatt atcagccaac tcgaaaaaga tttagtggat attgatcagg 12241 agaatttggt cattgcttat gaacctattt gggcgatcgg tactggcgac acttgtgaag 12301 ccaaggaagc caatcgtgtc atcggtttaa ttcgcagcca attgagcaat cctgatgtgc 12361 caattcaata cggtggttcg gtaaagccga ataatataga cgaaatcatg gctcaaagcg 12421 aaattgacgg cgttctggtg ggaggagcaa gtttggaacc cgcaagtttc gctagacttg 12481 tgaactttaa gtgaagtact tacacttacc cgaagggaag tgtaagcttc ccgtctcatc 12541 cgccgtagcc tctttaaacc cgttctctcg ttccctggct tctctcgttc ctctcgttcc 12601 ctggctcagc cagggaatgc atacagtgag actctgtctc acgtagaaat cgcaaaagtc 12661 attttcaaat ctggaggtcg agcctctagt aaagcgttcc caggctgtag cctgggaacg 12721 agtgtggagt gcaaatctta tgacagccaa tttaatcatt cgagaacgct gttttgagtg 12781 gggacagcga acatacctca tggggattct caatgtcaca ccagatagtt ttagtgatgg 12841 tggcgaattc aataccgtcg ctgctgcttt agcacaagca caagcaatgg tagcagctgg 12901 tgcagatatt atcgatgtcg gtggtcaatc aactcgacca ggggcagagc aaatcactct 12961 tgcagaagaa cttgaccgag ttttgccagt attacacgta ctgcgaaaag agataccagt 13021 gccaatttct gtagacacaa ctacagcagc tgttgccaaa gctgctgtag aagcaggagc 13081 agatatagtt aatgacattt caggggctac cttagaccca gaaatgttgc cgacagtagc 13141 aagaatgaat gtgcctatta tattaatgca catccgggga aacccgcaaa caatgcaaca 13201 attcactgat tatcaggatt tgatgggaga gatttatagt tttttggcaa agcaaatcgc 13261 tgcagctact ggtgtaggta ttgacgaaag aaaaattatc atcgatccag ggattggctt 13321 tgccaagaac tatgagcaaa atttagaaat tttgcgccac ttaccccaat tgcgtcaact 13381 taagtgtcct attttagtag gagcatctcg taaaagtttc attggtcgta ttttaaatca 13441 gccagacccg aaagcacgag tctggggaac ggcagcggcg tgttgtgctg ctatcttcaa 13501 tggcgctgat atcctccgag ttcacgatgt ccaacaaatg cgcgatgtat ccttggttgc 13561 tgatgcaatt ttccgacaat cttcgcaagt cctgccgtca gactagtttg aagctttttc 13621 accattgttg ctatgacctt cttgatatgt catctcggaa atcaaatcca tcatctcacc 13681 acctttgcct ggatctacaa acacaacttt ggcgttttgg ctttcaccca gtctgtagct 13741 ggcattgatg taatcttgag ccacaagata ccgcagaatt tccttacttt caggattaga 13801 acgtaaagca ttagaaatga tttccattga ggttttcgtt ccttgtgctt tcttaattgc 13861 ggcttcccgt tgcccttctg cttctaaaat cgcagcgcgg ctattaattt ccgctgctcg 13921 ctgctcttcc attgatttcc gcacgctttc aggtggtgta atactctgaa tatctacccg 13981 ggtaatttca actccccact gcgccgttgt ctgattcaac tggtctagca aggcactgtt 14041 catgcttgct ctggcggcgt tggtttcttc caaagtattc tgggcaatga tttccctaag 14101 cgtagttgta gtcaggtttg ttagtgcctg ctgcagatcg tcaatttcgt aaaagctttt 14161 ctctatatct ctgacgcgcc agtaaacaac cccatccact tccaagtaga tattatctct 14221 ggtaatgaca ttctgaggct tgatgtctaa aacttgctct cgagtcgtgt cctccatgac 14281 aatttgatcg agcaagggaa caataaagtt gagtcctggt ttcagtttgc gatgatactg 14341 ccccaaacgt tcaaccaaag cttcatttcc ctgattaata agttttgcgg atcctaatgc 14401 ataccctata aggactaaga ctatggcaat aattggttcc atgtattctc ctatttagca 14461 tttggcggaa ttagccgtaa ggacatagca ctaataccat gtcttatctt cagtctaata 14521 tccagaatgt gtaatgttat tatcacaagt tgcggaacta agcaagactc tggagaaaat 14581 aacatgaaat tagcaatgac ctagctttag acatggctat taaatcacca agagagttgc 14641 gtgtgcaagc atttattttg tcattaactt tactaaaaaa aaagtatgtc accaaaaata 14701 ctttatcctt gctcgcagta tcgaaacatt gtattgataa gtaggtcaac attgagaaac 14761 gtaaaatagg ccttctttag gtatctcctc tggacacgct tgaaagaacg ccagttgcct 14821 agcctttgcg atgatcctca aagcataact gatctgactg gcttaagtat gcctgcggca 14881 cgctacgagc gaatgctacg gacattttac gtttcatgtt tgttgaaatc tttatactga 14941 atccgccttt gtaaccccat tttaaccctg gcacaggatt tggtatgcct agacgttgcc 15001 tatgcaatgg atttcaaata gttcatactc aaaacaaaat nnnnnnnnnn attttgtttt 15061 gagtatgaac tatttgaaat ccaaacaaaa tagtcccctg cgaattagaa gcataagaca 15121 gcaggagaca atttaaatgg ttgatgtatt cctgttttta gttttaatga aattgctacg 15181 attgtttact acctgtagac agacagaata ttaagttttt ttgtaaacat aactactatg 15241 atgaactaaa agttaagcta acgcgcaaga cttggttgtt tttacctaaa aagaatgtca 15301 aaaactctta gattgcctta tagagctaaa tttagcttga ttggtttcaa atatagcttt 15361 actcttcgca aacttagatt gactctcaag aaatttgttg taattttcag ccctcttctt 15421 aaaattttcc gcctgaaaat ttgctaaata gctcttttct tcagctagtt cttcaacagt 15481 gtaagtttta tgtttttcac aatagagttg aggatctgtt tgaggtgaaa taatttcctg 15541 ctcagattct aactcaatta agttgttccc aggttctaac tcattaactt gattatgatg 15601 gcttgctggt tcaatatcta aagctttttg tactgcctgt aaaatctgaa aagatttttg 15661 tatgacttga tacagactta gttgcaaagt ggtcacttct aaaggcgtta tttccagatc 15721 tcgggttatt gggtcttcac tcataaccag ttaattgatg tacactcttg cccatatatt 15781 gacaaatttt tagatttcat tgaaaaaaga ttatatttag ttagaaatgt ttatatttag 15841 caatagttat tattagcaat agctactgta ctgtcgtact aaataaaggt agattgtctt 15901 ttaagactgg cagataggtg gatgagcctt gagcttcgag cctgtgcaag ttagtgttaa 15961 aaaaagtaag gtgagcactg ctgcccacct tggcggatga tatttggtca gtaggactac 16021 atcatgccca taccgcccat gcccattccc attccgccca tgcccatccc gccgtatggc 16081 atgccgtaac cgccgcccat accgcccatc ccgtacggca ttccgtagcc gcctccatat 16141 ggcatgccgt agccgccgtt catcatcatg ttgccgacag ccatgtttgt cattccagcc 16201 atgccggctc cagcgactgt tggtgccatt ttcgcaacgc ccccgagagc cttgccgagc 16261 atgctaccaa ccccgccgcc cttcttgtga gcggtatcca tcggcacttg tgccgccatg 16321 ttgtttgctt cattaggatt tggataatca gctgtccaac caggttgcgt cgctggtgct 16381 gaagcttgtg cttgtccgta ctgcatatta ttttgcggag gcaactgacc aggcgcttgc 16441 tgactcacag gaaatccttg cacttgatta ggctgtatcg gctgcgaaat atttccgaac 16501 gcaggctgct gcccgtagcc ttgctgcgcc tgagtctggc ttgcgtcttg ggcaggatat 16561 gcaggttgtt ggtaattttg ttgcggcatc ggcgatgaag ttgccgaatt tt // LOCUS NODE_2036_length_16481_cov_5.10605116481 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16481) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16481) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16481 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(710..1369) /locus_tag="DP116_17690" CDS complement(710..1369) /locus_tag="DP116_17690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741903.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="PRJNA477356:DP116_17690" /translation="MKEKPRVIFLDAVGTLFGVKGSVGEIYSQIAQEFGVEVSADTLN KTFIQSFKAAPPPVFPDAEEQDIPQREFDWWLDIGRKSFEQAGVFQKFSDFSTFFSEL YIHFGTANPWFIYPDVLPALVSWRRMGIELGILSNFDSRIYSVLQSLELREFFQSITI CTQAGAAKPDSKIFAIALEKHHCSSDAAWHIGDSLTEDYHGARGAGLRGIWINRQIVD K" gene complement(1449..2645) /locus_tag="DP116_17695" CDS complement(1449..2645) /locus_tag="DP116_17695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651979.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)/FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_17695" /translation="MTQQPAIICILGGGFGGLYTALRLSQLPWEPLQKPEIVLVDHSD RFVFSPLLYELLTGELQTWEIAPPYQELLSNTGIRFCQGFVSEIDIDQRRVHLQDGPE ISYDQLVLALGGETPLDIVPGATSYAYSFRTIADAYRLEERLRVLEESDADKIRVAIV GAGYSGVELACKLADRLGERGRFRLIEISDQILRTSPDFNRQTANKAIDARGVFLDLE TKVESIAQDSISLEYKNQVDTIPVDLVIWTVGTRVSPVVRNLPVKQNQRGQISTASTL QVHDHPEIFALGDLADCLDAEGKQVPATAQAAFQQADYAAWNIWASLTNRPLLPFRYQ FLGEMMALGIDSATLTGLGIKLEGSLAYVARRVAYLYRLPTLDHKLKVGFNWLTRPIV ETLYRK" gene complement(2685..2906) /locus_tag="DP116_17700" CDS complement(2685..2906) /locus_tag="DP116_17700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878749.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17700" /translation="MGKKVVSLAVDYTFEQLQAVKVVIDPQVWNTRAIRCYEKCGFVK VKILPEHELHEGKYWDCWLMATNHKKSVH" gene complement(3109..3396) /locus_tag="DP116_17705" CDS complement(3109..3396) /locus_tag="DP116_17705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation transport regulator ChaB" /protein_id="PRJNA477356:DP116_17705" /translation="MNTNIRNRSGAMAVNNIDELSQELKDQLQELPQEGKQIFVAAFN AAQSDGISEQGAREVAWNSVKNQYEKGSDGKWHARGEVTAQHNKAITSGGN" gene complement(3499..4152) /locus_tag="DP116_17710" CDS complement(3499..4152) /locus_tag="DP116_17710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015163198.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1345 domain-containing protein" /protein_id="PRJNA477356:DP116_17710" /translation="MFKNSDSRHRLIICVGFAALVSVLLPSWLHFPTRILCAWNLGAD CFLGLTWWIMFRATPQKMRRFAQLEYQGRVAIFTLIIAAACASVLAIGFLLSGNTKKL STILLTLHVTLAVMTIISSWLLVHTIFAMQYAHTYYQVSSDTQQIAAGLDFPNDEEPD YWDFLYFSFVIGMTSQVSDVQTISRSMRRLTLLHGVLSFFFNTSILAMSINIIAALI" gene complement(4318..4497) /locus_tag="DP116_17715" CDS complement(4318..4497) /locus_tag="DP116_17715" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17715" /translation="MRGNREQGERGTGNGKIIVEKSFLSGLIVKWYKEDLPVVDALWA VRQRYALTFSTFWSL" gene 4624..5355 /locus_tag="DP116_17720" CDS 4624..5355 /locus_tag="DP116_17720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17720" /translation="MLDQIFDYLHFHFSIEACIVLLVLIFLEAILSADNAIALAAIAQ GLENKDLEGKALNIGLVVAYVLRITLLLTATWVQQFWQFELLGGVYLLWLVFQHFTSE EGDDNQHHGPRFTSLWQVIPVLAFTDLAFSLDSVTTAIAVSNETWLVITGTTIGVVTL RFMAGLFIRWLDEYVYLEDAGYITVALVGLRLLLKVVNDSLVPPQWGMITAIALILAW GFSKRTHSEEIEEQKKTELVEGSRE" gene complement(5442..5954) /locus_tag="DP116_17725" CDS complement(5442..5954) /locus_tag="DP116_17725" /EC_number="2.7.4.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313430.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleoside-diphosphate kinase" /protein_id="PRJNA477356:DP116_17725" /translation="MTNDRGQRTKHEFNKFEEIALAERTFLAIKPDGVQRGLVGEIIR RFEDKGFTLVGLKFLKVSRELAEQHYDVHRERPFFAGLVDFITSGPVVALVWEGEGVI ASARKIIGATNPLSAEPGTIRGDFGVNIGRNLIHGSDAIETAQQEVSLWFKEEELVSW QPTITPWLHE" gene 6111..8126 /locus_tag="DP116_17730" CDS 6111..8126 /locus_tag="DP116_17730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="arginine decarboxylase" /protein_id="PRJNA477356:DP116_17730" /translation="MHVESTETLEEVVKLPSDGQKAQLKNNKQRKLLPPATSLDAPRL WTIEQSEELYRIEGWGQPYFSINAAGHITVSPKGDRGGSLDLYELVNALKQRSLGLPL LIRFSDILEDRIERLNACFAKAIARYNYPGVYRGVFPVKCNQQRHLIEDLVRFGKPHQ FGLEAGSKPELMIALALLDTPGALLICNGYKDREYIETAMLAQRLGQKPIIVLEQVEE VDLVIEVSQQLGIEPIVGVRAKLSTQGMGRWGTSTGDRAKFGLTIPEIIQAVDKLREA NLLGSLQLLHFHIGSQISAINVIKDAIQEASRIYVELAALGAKMKYLDVGGGLGVDYD GSQTNFYASKNYNMQNYANDIVAELKDTCAERKIAVPTLISESGRAIASHQSMLIFDV LSTSVVPLDPPESPKEGESPIITYLWETYQSVNEENYQELYHDATQFKEEAISRFNLG ILSLTERAKAERLYWACCQKILEITRKQEYVPDEMEDLEQIMASIYYVNLSVFQSAPD CWAIDQLFPIMPIHRLDEEPTRRGILADLTCDSDGKIDRFIDLRDVKSVLELHPLKPG EPYYLGMFLNGAYQEIMGNLHNLFGDTNAVHIQLTPKGYQIEHVVKGDTMSEVVSYVQ YDSEDMVESIRQRCEHALEENHITLAEAQRLLQTYEQSLRRYTYLNS" gene complement(8188..9354) /locus_tag="DP116_17735" CDS complement(8188..9354) /locus_tag="DP116_17735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NDP-sugar synthase" /protein_id="PRJNA477356:DP116_17735" /translation="MKAMILAAGKGTRVRPITYTTPKPMMPILQKPVMEFLLELLRQH GFDQIMVNVSHLAEEIESYFRDGQRFGVQIAYSFEGRIVEGSLVGEAVGSAGGMRKIQ DFHPFFDDTFVVLCGDALIDLDLTAAVKWHKSKGSLATIIMKSVPKEEVSSYGVVVTD VDGRVKAFQEKPKVEEALSTNINTGIYIFEPEVFNYIPSGVEYDIGSQLFPKLVEIGA PFYAIPMDFEWVDIGKVPDYWRAIRGVLLGDIKNVQIPGQQVAPGIYTGMNVAVNWDK VDITGPVYIGGMTKIEDGAKIVGPTMIGPNCWVCSGATVENSVIFEWSRLGPGVRLVD KLVFGRHCVDKTGATIDVQAAALDWLITDARQDPPSHTPVERQAIAELLGNNAS" gene complement(9621..9935) /locus_tag="DP116_17740" CDS complement(9621..9935) /locus_tag="DP116_17740" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17740" /translation="MALRQRSLEDFVRLGELHLTLAQFLSSRVARACLLGIRSVRALS DGARVCKCGEQALWGIACRKGSGHRCTGGIFIFLPPDGSLLAQASTGIELLVKFKGLL AG" gene complement(10079..10921) /locus_tag="DP116_17745" CDS complement(10079..10921) /locus_tag="DP116_17745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319374.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="segregation/condensation protein A" /protein_id="PRJNA477356:DP116_17745" /translation="MDASELLEKITLLIHQAELGEIDPWDVKVISVIDHYLELMGSEA TTKGYEADLSKSGQAFLSASKLVLFKANTLMQLQSSAQEQEAAENDALLESEDGIIHQ TQRLPLERHLRRRPTAMPPSKRRVTLQELISQLQIMAQQLKLVEKANKPARQKRQPSL QSMRAALELAHQENLTEVAFELEQLLQSVATELSLQNTWLNLEQLVELWTQKKQPQQN KAHTSQHSHVVVSVFWALLLLCAQSKVELFQEEFYQEIKIRLLTDSSNHESIETPLNS EFAT" gene complement(11036..11392) /locus_tag="DP116_17750" CDS complement(11036..11392) /locus_tag="DP116_17750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LapA family protein" /protein_id="PRJNA477356:DP116_17750" /translation="MRQINFVIIFIFCLALALFALENTQPGTINVVPEVQVEAPIAIE LLLASGIGAVLAWLYSIWTRFQRLLVSGPQVRQKNLQIKELESKVEQYQAEVQSLKLA LPPVNDSVAKEAQITT" gene complement(11897..12952) /locus_tag="DP116_17755" CDS complement(11897..12952) /locus_tag="DP116_17755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319372.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AI-2E family transporter" /protein_id="PRJNA477356:DP116_17755" /translation="MYRSASVQRLLIYGLSGPIIALNLWLLYVIFRLFQHPITIVSIA AILAFLLNYPVKFFERIALTRAQAVIIVLLLTLTLLVILGVTLVPVVIDQTIQLLNKI PDWLAASQANLEHIEAFAKKRRLPLDLRVVSNQINANIQSLVQQLASVAVGFAGTLLS GLVDLILVVVLAFYMLLYGDRVWSGLFNFLPPHIRFPLTTSLRLNFHYFFLSQILLAL FMVISLTPIFLVLKVPFALLFAIFIGISQLIPFIGATFGIGLVTFLVLLQSWWLAVQV AVAAIVMQQIKDNLLGPKLLGDFIGINPIWIFVAILMGFEIAGLLGTLVAIPIAGTIK VTFDAIKGGNIRKGVTE" gene 13097..13744 /gene="pdxH" /locus_tag="DP116_17760" CDS 13097..13744 /gene="pdxH" /locus_tag="DP116_17760" /EC_number="1.4.3.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740664.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxamine 5'-phosphate oxidase" /protein_id="PRJNA477356:DP116_17760" /translation="MDKTISDLRKDYTLQSLSEKDVDSNPFIQFKQWFDQALAAQLPE PNAMTVATATLDGKPSARIVLLKGFDQRGFVFYTNYNSQKGQELAENPQGSLVFWWAE LERQVRICGSVEKVSEKESDEYFYSRPLNSRLGAWASDQSQVIESREMLEQRMQELQI QYQNQDVKRPPHWGGLRVIPTEIEFWQGRSNRLHDRLLYTRLLDDGSWKIQRLSP" gene 13744..14664 /locus_tag="DP116_17765" CDS 13744..14664 /locus_tag="DP116_17765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651739.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyclase family protein" /protein_id="PRJNA477356:DP116_17765" /translation="MKKFALLCLVIFLSFVVCLSINAAQPRAVPPLWQVYQQSLKTAK YVDLTHTIAPAIPVWSGFGPSKFEPTVNPNTGKPYTYQKDGFEATHYDLSTDQLGTQL DPPAHWNPDYPAIDELPATFAVRPLVVIPIQNKVAGDPNYHLTVKDIQDWETRHGKIP EGSVVFVRSDWSKEWPNPELAKRKKFPGVSLQALQFLHLQRKILFHGHEPLDTDSTPT LEGEAWLLKNGYTQAEGVANLDQVPETGALVTIGYPKFQGGLGGYARYIAICPPDWKY GVSVGQISESPLQKANKPLRWDQQLNLRVR" gene 14875..15192 /locus_tag="DP116_17770" CDS 14875..15192 /locus_tag="DP116_17770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872315.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17770" /translation="MNHPFGLDVLDLEAIELNFEDDLNDEEAAQVVGGLTKATTEAVG EEGGTVTTLAVGEEGGIQCISAPCPGSEGGEKPPKEPPKATTLALGEEGGYTKARFEN GGY" gene 15408..16313 /locus_tag="DP116_17775" CDS 15408..16313 /locus_tag="DP116_17775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872314.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17775" /translation="MNILILGNSLDAHAAHLKNALTEAGATVDYFDTHLFPTQLRMSW RPDTKVGSLALSEENQLNFQDIHSIFWRNFSGVHVPQLKDSNQQYVAFNDSMSTLRSL IQACPCHWVNSWQAYQFHKEKPLQLSKAKEIGVTIPATLISNHPREITEFVHTHEKVI FKPVYGGAHTQFLTQEHLEPKRLNLALSLSPVTLQEYIPGTNIRSYVIGESVYSAEIR SHAVDFREDLDAELIPIELPESIQQQCLAIAKAFMLEWTAIDWRCKPNGEYVFLEANP SPMFIHFENQTGFPITEKLVNLLMN" BASE COUNT 4842 a 3638 c 3447 g 4554 t ORIGIN 1 gcgaccccct caccgcctcc ggcgtctaca agtcgggaaa cccgctgtta gcacctgcct 61 caccgccaat ttttttaggt tgcctaactt gtgcagcttt aactgcaatg ttgtgctgct 121 gggatcgaat cataaactcc acctgtagtg gttcgccaac atcataagaa agaccgcaat 181 acttcaactc agcagttaat taacaccacg atatgtgagt gttatctgct tcaccttttt 241 gaataacaat gaggcgcatt ccttgaggat atctcccttt acaagcgcta ttactgaaaa 301 aagaattatt ggcttccttt accaaaaaaa tatttttatt tatgtactaa ttgtaacttt 361 attatgaaag tctcagcttt tttagtaaca attaatatac agagtgtgag tttggctaat 421 gtcacaaaac tgaatgttaa aaagttgtga atcaactgat gcgggtatga aaaagggtgt 481 actttgtttg tgtagtcgag ccagtgcgtt gcggtgaggc aggtgctaac agcaggtttc 541 ccgacttgta gacgccggag gcggcttccc gtagggtagc aactgccgtt agcgtagcgt 601 gtccggagga catacccgaa gggtgagtta gctcaaactc atctcaccaa tgtccgtcat 661 aatttagaac atcgtctact ggtagtcaga gaaaaagggg atcaacagct catttatcta 721 ctatttgacg atttatccaa atgcctctta gtccagctcc cctcgctccg tgataatctt 781 ctgtgagact gtcaccaatg tgccatgcag catctgatga acagtgatgt ttttccaaag 841 ctatggcaaa gattttgcta tcaggtttgg ctgcacctgc ctgagtgcaa atagtgatag 901 actggaaaaa ttctctgagt tccaaacttt gtaacactga gtaaatacgg gaatcaaaat 961 ttgaaagtat tcccagttca attcccatcc gccgccagct aaccaaagct ggcaagacat 1021 cgggatagat aaaccacgga ttagctgtac caaagtgaat gtagagttca ctaaaaaaag 1081 tcgaaaaatc agaaaacttc tggaaaacac ctgcttgttc aaaagacttt cgacctatgt 1141 caagccacca atcaaactcg cgttggggaa tgtcttgttc ctctgcatct ggaaatacag 1201 gtggtggcgc agctttaaaa ctctgtatga atgttttgtt taaagtatca gcggaaactt 1261 cgactccaaa ctcctgcgct atctgactgt agatttctcc cacactgcct ttcaccccaa 1321 agagtgtgcc aacagcatct aaaaagataa ctcgcggttt ttccttcata aactatcact 1381 tactcgctat tagctatcta cctatctcca ctgactcaaa cagtcctgta tgccaagtat 1441 gtaatttttt attttctata aagtgtctct acaatcggac gagtcagcca gttaaaacca 1501 actttgagtt tgtgatctaa agttggtagc cgataaagat aggcaacacg acgcgcgacg 1561 tatgctaagg aaccttctag tttgatgcct aaaccagtaa gggtggcact gtctatcccc 1621 aatgccatca tctcacctaa gaactgatag cggaagggaa gcaaagggcg atttgtcaaa 1681 cttgcccaaa tgttccaagc tgcataatct gcttgttgga aagcagcctg tgctgttgca 1741 gggacttgct taccttcagc atcaaggcaa tctgctaaat ctcccaaggc aaaaatctcg 1801 ggatgatcgt ggacttggag agttgatgcg gtgctaattt gaccgcgctg gttttgtttc 1861 acagggaggt ttcgtaccac aggcgatacc cgagttccca ctgtccaaat gaccaaatcc 1921 acaggaattg tgtctacttg attcttgtac tctaatgaga tgctgtcttg agcaattgat 1981 tctaccttcg tttctaaatc gagaaataca ccccgtgcat ctattgcttt gtttgctgtt 2041 tgtctgttga agtctggcga ggttcgcaaa atttggtcag atatttcaat cagccgaaat 2101 cgtcctcttt ccccaagtct atctgctagt ttgcaagcta actctacacc actgtaacca 2161 gcaccgacta tagctaccct gattttatcc gcatccgatt cctctaaaac tcgcaggcgt 2221 tcttccaaac gatatgcatc tgcaatagtc cgaaacgaat aggcgtaaga tgttgcacca 2281 gggacaatat ccaacggtgt ttcaccacct agcgccaaca ccaattggtc ataggaaatt 2341 tctggtccat cctgtaaatg tacccgtcgc tggtctatgt caatttctga tacaaagcct 2401 tgacaaaaac gtatacctgt gttgctcaaa agttcttgat agggtggggc aatttcccag 2461 gtttgcaatt ccccagtcag gagttcgtag agaagagggg agaaaacaaa gcgatcgctg 2521 tgatccacca aaactatttc aggtttttgc aaaggttccc aaggaagctg gctcaagcgt 2581 agagcagtgt agagaccacc aaagcctcca ccaaggatac agattatagc aggttgttga 2641 gtcatcggtc tagaggaagg tcaatccccg gggggtaatt tctttcagtg tactgatttc 2701 ttgtggtttg ttgccatcaa ccagcaatcc cagtactttc cctcgtgcag ttcgtgttcg 2761 ggcaatatct ttaccttgac aaagccgcat ttttcataac agcggatagc acgagtgttc 2821 cacacctgtg gatcgatgac aactttaacg gcttgcagtt gttcaaaagt atagtctaca 2881 gccaatgaaa ctactttttt gccaattccc tggttccaat attgagtttc tctcgcatac 2941 gacgaataga gatttcgtct tttttaagat gcatctctct aatattttgg caatttgtat 3001 tatagttata tgattttttt ggcaatcctt tttcaacctg gaaataagca aaagacggtt 3061 tgtctttaac acaaaccgcc gcaatcttaa aacctaggta aaaagaactt agttaccgcc 3121 agaagtgata gctttgttgt gttgagcggt aacttcgccc ctagcatgcc atttgccatc 3181 tgaacctttc tcgtactggt ttttaacgct attccaagca acttcacggg caccctgttc 3241 actgataccg tcactttgag ctgcattgaa tgctgcaaca aaaatctgct ttccttcttg 3301 aggaagttct tgcagttggt cttttagttc ttgagataat tcatctatgt tgttaacagc 3361 catagctcca ctccgattcc ttatgtttgt gttcatccta agaactcaaa ggcaaggttt 3421 tcctctttct agaagtagag atagaaaatt ctcatatctt tgatcctata cttttgacat 3481 aggcttagtg aacctgattc aaatcagcgc ggcaattata ttaatgctca tcgccaaaat 3541 actggtattg aaaaagaagg ataatactcc gtgtagcaga gtcaaacgcc tcattgaacg 3601 tgatattgtt tgcacatctg agacttggct agtcatgcca ataacgaaag aaaaatatag 3661 aaagtcccaa taatctggtt cttcgtcatt aggaaaatct agaccagcag ctatttgctg 3721 tgtatcactg ctaacttgat aataagtatg cgcgtattgc atggcgaata tggtatgtac 3781 tagtaaccaa gaactgataa tcgtcatcac agcaagcgtg acgtgtaggg ttagtagaat 3841 cgttgacagt ttttttgtat taccactgag taagaaccca atcgctaaca cacttgcaca 3901 agcagcagcg ataattaagg tgaagatagc tacacgacct tgatattcaa gttgcgcaaa 3961 acggcgcatc ttttgtggag tcgccctgaa cattatccac caagtcaagc ctaagaaaca 4021 gtcagcgcct aagttccaag cacagagaat gcgcgtaggg aagtgcagcc aagacggtaa 4081 tagtactgag actaatgcag caaagccaac acaaataatc agtcggtgcc gagaatcaga 4141 gtttttgaac aacttaaatg caacccaaaa atgattattg gtaattgaat gctaccaagt 4201 cagttgactc aaaaaagtac ttgagttaga aaacgtttga aaagtctgat ttcagacttt 4261 tcttggtgac taaaagtctc acggcggttg ctactgtgcc gtaaggcata ccgcttacta 4321 cagtgaccaa aacgtggaaa aggtcagtgc atagcgctgt ctcaccgccc agagggcgtc 4381 tacaacgggg agatcctcct tataccattt cacgatcaag cctgataaaa atgatttttc 4441 tacaattatt ttcccgttcc ccgttccccg ttccccctgt tccctgttcc ccctcatcct 4501 gattcgcttt gagaattcgc catccacacg ctaagctgaa taaaagtagc attttattgc 4561 gccttgtata taaaatttgc ggtttgtgct aatttttaat ttttgctgtt taagaaaact 4621 ggaatgctag atcagatatt cgattacctt cacttccatt tcagcattga agcttgtata 4681 gtgctgctgg tgctcatttt tttagaggca atattgtctg ctgacaacgc gatcgctctc 4741 gctgcgatcg cccaaggact agaaaacaaa gatctcgaag gtaaggcgct gaacattggt 4801 ttggtagttg cttatgtcct gcgaatcact ttacttctaa cagctacctg ggtgcaacag 4861 ttctggcagt ttgagttact aggtggagtt taccttttgt ggctagtatt ccaacatttt 4921 acctctgaag aaggcgatga caatcaacat cacggtcccc gttttacctc cctgtggcaa 4981 gtcatacctg tccttgcctt cacagattta gcattttctc ttgatagtgt taccactgcg 5041 atcgctgttt ctaatgaaac gtggcttgtc attaccggca caaccattgg cgttgtgact 5101 ctgcgcttta tggcaggttt atttattcgt tggttagatg aatatgtcta cttggaggat 5161 gcaggctata tcactgtagc gttggtaggc ttgcgcttac tcctcaaagt cgtgaacgat 5221 tctttagtac caccacagtg ggggatgata actgcgatcg ccctgatctt ggcatgggga 5281 ttttctaagc gaacccactc agaagaaata gaagaacaaa agaaaaccga gcttgtagaa 5341 ggcagtagag agtaagaaaa agtgttggaa tgaggaaggg aggaagtgag ggaaaaaata 5401 tccttcgctc cttcccttat aatctccttt gacttcctaa attactcgtg taaccaaggt 5461 gtgatggttg gttgccaaga gactaattcc tcttctttaa accacagaga gacttcctgt 5521 tgtgctgttt cgatagcatc ggaaccgtgg atgaggttgc gaccaatgtt gacgccaaaa 5581 tcacctcgaa ttgtgcctgg ttctgccgag agtgggtttg ttgccccaat gatttttcta 5641 gcagatgcaa tcacgccttc gccttcccaa accagcgcta ccactggacc ggaagtgata 5701 aaatctacta acccagcaaa gaaaggtctt tcccggtgaa cgtcgtagtg ctgttcagct 5761 aattcccggc taactttgag aaacttcaaa ccaacaaggg taaagccttt atcttcaaag 5821 cgacgaataa tttcacctac taatccgcgc tgtacgccat caggcttaat tgctaagaat 5881 gtgcgttctg ccaaagctat ctcctcaaat ttattgaatt cgtgttttgt cctttgtcct 5941 ctgtcatttg tcatttgtca agaaatcatg acaaatgact tgtcatttat gaccaataac 6001 aaatgaccaa tctgaggata tctcagaaag tatctctaag tgcatgtgcg cgtacagagg 6061 aatacgctaa attgtatttt cgtgggtcta acatagaggt cagtgaagaa atgcatgttg 6121 agtcaactga gacattagaa gaggtggtga aactgccgtc cgatggacag aaagcgcaat 6181 tgaaaaataa taaacaaaga aagctgctac caccagccac atcactagat gcacctcgcc 6241 tatggacaat agaacagagt gaagaacttt accgaataga aggttgggga cagccttact 6301 tttctataaa cgcagcaggt catatcactg tttctcctaa gggcgatcgc ggcggttctt 6361 tggacttata tgaacttgtc aacgccctga agcagcgtag cctaggactg ccgctactga 6421 ttcgtttctc ggatattttg gaagatagga ttgagcggtt aaacgcttgt tttgccaaag 6481 cgatcgcccg ctacaactac ccaggtgttt accgtggtgt ttttcctgtc aagtgtaatc 6541 agcaaagaca cttaatagag gacttggtga ggttcggcaa acctcatcaa tttggcttag 6601 aagctggttc taagccagaa ttaatgattg ccctagcttt attagataca ccaggggcgc 6661 tgctcatttg caatggctac aaagaccgag agtacatcga aacagcaatg ctggcacaaa 6721 gactaggtca aaagccgatt atcgtcctag aacaagtcga agaagttgat ttggtgatcg 6781 aggtcagcca acaattgggg attgagccaa ttgtgggtgt gagggctaaa ctaagtaccc 6841 aaggtatggg acggtgggga acttctacag gcgatcgcgc taaatttggt ctcaccatcc 6901 ctgaaattat tcaggcagtt gacaagttac gcgaagctaa cctattgggt tcgttacagc 6961 tattacactt ccacatcggc tcgcaaatct cagcaatcaa tgtgattaaa gatgccatcc 7021 aagaagccag tcgtatttat gtggagctgg cagcattggg ggcaaagatg aagtatctcg 7081 atgttggtgg tggcttgggt gtcgattatg acggttcgca aacgaacttc tatgcctcga 7141 aaaactacaa tatgcagaac tatgccaacg atatcgtggc agagttaaaa gatacctgtg 7201 ctgaacgaaa gattgccgta ccaacactga taagcgaaag cggacgggcg atcgcttccc 7261 atcaatcaat gctgattttt gacgttctca gtaccagcgt tgtccctctt gatccaccag 7321 agtcaccaaa agagggtgaa tccccgatta ttacttacct gtgggaaacc taccaatctg 7381 ttaacgagga gaattaccaa gaactctacc acgacgctac ccaatttaaa gaagaagcca 7441 tcagtcgctt caacttaggg attttaagtc ttacggaacg cgctaaagct gaaaggcttt 7501 actgggcttg ttgtcaaaaa attcttgaaa taaccagaaa gcaggaatac gtaccggacg 7561 agatggaaga cttggaacaa atcatggctt ctatctacta cgtcaatctt tctgtgtttc 7621 aatctgcacc tgactgttgg gcgattgacc agctttttcc gatcatgcca attcaccgtt 7681 tggatgaaga accaacacgg cgaggaattt tggcagattt aacatgcgat agtgatggca 7741 aaatcgacag gtttattgac ctgcgggatg tgaagtcagt tttggaactg caccccctca 7801 aaccaggaga accctattat ctcggaatgt tcctcaacgg agcttaccaa gaaattatgg 7861 gcaatttgca caatctcttt ggcgacacca acgcagttca catccaactg actccaaaag 7921 gttatcaaat tgaacacgtc gttaaaggtg ataccatgag cgaagtggtg agctatgtgc 7981 agtatgactc tgaagatatg gtggaaagca ttcgccagcg ttgtgagcat gctttagaag 8041 aaaatcacat cactttagcg gaagctcaaa gactgctaca aacctatgaa caaagtttgc 8101 gacggtacac ttacctgaat agttaaaagt taaaagtcat aagtcatcaa ctaatgactt 8161 atgactcaat gactaaatta ctaatgacta actagcatta tttcccaaca attcagcaat 8221 ggcttgccgt tctactggcg tatgggatgg tggatcttga cgagcatcgg tgatgagcca 8281 gtctaaagca gcagcttgga catcaatcgt tgctccagtt ttgtctacac aatgacgacc 8341 aaataccaac ttgtccacaa gccgtactcc cggtcccagt cgtgaccact caaaaatcac 8401 actgttttct accgtggcac cactgcatac ccagcaattt ggacctatca tggtaggacc 8461 aacaattttg gctccgtctt caatcttggt catgccgccg atgtaaactg gacctgtaat 8521 atccactttg tcccaattga cagcaacgtt catcccagtg tagataccag gtgcgacttg 8581 ttgtccgggg atttgcacgt tcttaatgtc ccctaagagg acaccacgaa ttgcccgcca 8641 gtagtctgga acttttccaa tatccaccca ttcaaagtcc attgggatag cgtagaaagg 8701 cgcaccgatt tctaccagtt tggggaaaag ctggctgccg atgtcatact ctacgccaga 8761 agggatataa ttaaatacct ctggttcaaa aatataaatg cctgtgttga tattagtgct 8821 cagagcttcc tcaactttgg gtttttcctg gaaagctttc acacgcccat caacgtccgt 8881 gacgactaca ccatagctag aaacttcttc cttgggcacg gatttcatga taatggtggc 8941 aagagatcct ttagatttat gccacttcac agctgctgtc aaatctaggt caatcagggc 9001 atcgccacac aacaccacaa aggtatcatc aaagaatggg tgaaagtctt ggatcttccg 9061 cattccccct gcagatccaa cggcttcccc aacaaggcta ccctcaacaa tgcgaccttc 9121 aaaagagtag gcaatttgca caccaaaccg ctgaccatca cggaaataac tctctatttc 9181 ctctgctaaa tggctaacat tgaccataat ctggtcaaac ccatgctgac gtagaagttc 9241 cagtaaaaat tccatcactg gcttctggag aatgggcatc atcggtttgg gggttgtgta 9301 ggtaatagga cgtacgcgag tacctttacc agctgcgaga atcatcgcct tcatatatat 9361 ttattcctca accacaagcc agtttactgt taagaagtaa tatttcatcg tgagtttatt 9421 tctgatttca gtcacccctg aaagcacggt tcacagacat caagagactt tgacgcaatt 9481 attggcataa cctaactaat agcgtcaagt ataaagcatg aagaaaatag aaaatatgaa 9541 gtatgcagta taaaattttc atacccagct ttcagcatcc acacttcatg actttatatc 9601 attcttgctc ctactctagc ttaccctgcg agaagcccct taaactttac gagaagctct 9661 atgcctgtgg aggcttgcgc caacaaagaa ccgtctggag gaagaaatat gaatatgcct 9721 ccagtacacc tatgtcctga tccctttcga cacgctatac cccagagtgc ttgttcaccg 9781 cacttacaga cacgcgcccc gtcagaaagc gcacgcacgc ttcgtatgcc cagaaggcac 9841 gctcttgcaa cgcgagagct taggaattgc gctaacgtta gatgcagttc ccctaggcgc 9901 acaaaatcct ccaaagagcg ctgtctgagc gccatccaga acaagtaact tgtacacaaa 9961 ggtttccctg atttggcaaa atgtctgttg gtgtctacat cttttcattc gttatcttta 10021 tcaatttaat cggattttgg aatttactga agccaaactc tgataatctt gaatattcct 10081 atgttgcaaa ctctgaattc aggggagtct ctatggattc atgattggat gaatcagtaa 10141 gtagcctgat tttaatttcc tgataaaact cctcctgaaa tagctctact tttgattgag 10201 cacaaagcag taggagcgcc cagaaaacac taacaactac gtggctatgt tgcgatgtgt 10261 gtgctttatt ctgttgtggt tgctttttct gcgtccacaa ctctacaagc tgttcaagat 10321 tcagccaagt gttctgtaaa ctcaattctg ttgccacact ttgcaatagc tgctctagtt 10381 caaaagccac ctctgtaaga ttttcctggt gagccaactc taatgctgcc cgcatacttt 10441 gcaaactagg ttgccgtttc tgacgggcag gtttattggc tttttctacc agtttcaatt 10501 gttgagccat gatttgcaat tgcgaaatca gttcttgcaa agtcacgcga cgctttgacg 10561 gtggcattgc tgttggacga cgacgcaagt gccgctctaa tggtaagcgt tgagtttgat 10621 ggatgatccc gtcttcactc tctagcaatg catcattttc cgccgcttct tgctcttgtg 10681 ctgacgattg caattgcatc aaagtatttg ctttaaataa cacaagtttg gatgccgaca 10741 aaaaagcctg tcctgatttc gacaagtcag cttcatagcc cttggtggtt gcctccgatc 10801 ccattagttc caaataatgg tcaatcaccg aaatcacctt aacatcccaa gggtctattt 10861 ccccaagttc cgcctgatga atcagaagtg taattttttc caataattcg gaagcatcca 10921 ttaattcttt ggatatagga tttcaggttt tgtatgagga attgctctgt gttgtgttag 10981 tagtcgttag tggaacaact aacgactact aactcctcac taaccaattc ttttattaag 11041 tggtgatttg tgcttctttt gctacagaat cattcactgg tggtagagca agtttcaaag 11101 actgaacttc tgcttgatat tgttcaacct tgctttctag ttccttaatt tggagatttt 11161 tctgtcgcac ttgtggacca gaaactagta gtctctgaaa acgtgtccag atactataca 11221 accaagccaa aacagctcct atcccacttg ccaaaagcaa ctcaattgca attggtgctt 11281 ccacctgtac ttctggaaca acatttattg ttccaggttg ggtgttttcg agggcaaata 11341 aagccaaggc taaacaaaag ataaaaatta ttacgaagtt gatttgtctc attggaaact 11401 aaagactaga taattgtttt tgcccaatgc ttcctgaaaa ttgtgcctaa aaccattttg 11461 aattgtgctt aaagatagag caacttgcta tccaaaagat tttaaaaact taaagttgca 11521 catctacagc agttttgatc ctatttgagt tgtgaaacaa ctgttgaggt gatccggtac 11581 tcttggagcc taagctccag cagggttttt aacttcagta tccacaggcg ggaaacccgt 11641 ctatggcact ccagtcattt gtactaaaaa ataaaggaca ctcgacaaat gacaaaggac 11701 aacgctcagg aaaaatttca caaactattt aggattgcaa ttaaacaaaa tggtacgaga 11761 atgaatctaa cttgtaaagt ataacatatg atctttttat acacgcgctt gtgtcaaaaa 11821 aatggcgata aaaaatcata ataatcttga cattaacctc tatatattaa atccttatca 11881 atcattccaa agactcctac tcagttacac cttttcttat gttaccgcct ttgatcgcat 11941 cgaaggtaac ttttatggta ccagcaatag gaatagcaac caatgtccct aataaacctg 12001 caatctcaaa tcccatgaga atagccacaa aaatccagat gggattaata ccaataaaat 12061 cgccaagtaa ctttggacct aacagattat ctttgatttg ctgcatgaca atggcagcca 12121 cagcgacttg aactgccaac caccaacttt gaagcaacac taaaaaagta accagaccaa 12181 taccaaaagt cgccccgatg aaggggatga gttgggatat acctataaat atggcaaata 12241 acaaggcaaa aggtactttg agaactaaga aaatcggagt caggctaatt accatgaaca 12301 atgctagcaa aatctggctg aggaaaaagt agtggaaatt aagtcgtaaa gatgtggtta 12361 aggggaatcg aatatgaggt ggtaagaagt tgaataagcc agaccatacg cgatcgccat 12421 acaacagcat ataaaatgcc agcaccacta ccagtatcaa atcaaccaaa cccgataaca 12481 gtgttcctgc aaatcctaca gcaaccgaag ccaactgttg caccaaactc tgaatgtttg 12541 cattgatttg attgctgacg accctcaaat ctaggggtaa acgccgcttt ttagcgaaag 12601 cttcaatatg ctccaaattt gcttgactgg cggctagcca atctggaatt ttatttaaaa 12661 gttgtattgt ttggtcaata actactggta cgagcgtgac gccaagaatg accaaaagcg 12721 ttaaagttaa cagcaataca atgataactg cttgagcacg ggtcaaggca atgcgttcaa 12781 agaacttgac tgggtagttc agtaaaaaag ccagaattgc tgcaatgctc acaatggtga 12841 tggggtgctg gaataagcga aaaatcacat acagcaacca gagattgaga gcgataatcg 12901 gaccgctcag gccgtatatt aacagacgtt gaactgaggc tgaacggtac atcttgtgtt 12961 atctgtgtta aaacgccttg acacattagg tacaacttgg tgaaagctga tgaccatgta 13021 aattagatag atatttttgc aaatacacac ataagcgtac tgaaatacgt acttaataat 13081 aggaaagtaa acaaaaatgg acaaaacgat ctccgacctt cgcaaagact acaccttgca 13141 aagtttaagc gaaaaggatg tagattctaa cccttttata cagtttaaac aatggtttga 13201 ccaagcatta gcagcccaac tcccggaacc gaacgcgatg actgtagcca ccgctacact 13261 agacggtaag ccctcggcaa gaatcgtgct gctaaaaggt tttgaccaac ggggctttgt 13321 cttctacacc aactacaaca gtcagaaagg acaagagtta gcagaaaatc ctcaaggttc 13381 gttagttttc tggtgggcgg aactagaacg ccaagtccgt atttgtggga gcgtagaaaa 13441 agtttccgag aaagaatcag atgagtattt ttatagtcgt cctttaaaca gtcgtttagg 13501 tgcatgggcg tctgatcaaa gtcaggtgat agaaagccga gaaatgctgg aacaacggat 13561 gcaggagttg caaattcaat atcaaaacca agatgttaag cgaccaccac actggggagg 13621 cttgcgcgtg atcccaacag aaatagaatt ttggcaagga cgttccaatc gcttacatga 13681 tcgcttgctt tatactcgct tattagatga tggcagttgg aaaattcagc gtttgtcccc 13741 ataatgaaaa aattcgcact tttgtgttta gtcatattct tgagctttgt tgtttgcctg 13801 tctatcaacg cagcacagcc gcgcgctgta cctcccctat ggcaggttta tcagcagtca 13861 ctcaaaacag ccaagtacgt tgatctcacc catactatcg ctcctgcaat tcctgtatgg 13921 tcagggtttg gtccatcaaa gtttgaacca actgttaatc caaacacagg aaagccatac 13981 acctaccaaa aggatggttt tgaagcaacc cattacgatc tatctactga tcagcttgga 14041 actcagttag atccaccggc tcattggaac cctgattatc cagctattga tgagttacct 14101 gcaacctttg ctgttcgtcc gttagtagtc attcccatcc aaaacaaagt cgctggtgat 14161 cccaactatc acctcacagt taaagatatt caggattggg aaactcgtca tggcaagatt 14221 cctgaaggtt cagttgtgtt tgtccgctct gactggtcta aggaatggcc aaatcctgaa 14281 cttgctaaaa gaaagaagtt tcctggggtg tcactgcaag ctctccagtt cctgcacttg 14341 cagcgcaaga ttctgttcca cggacacgaa cccctggata cagatagtac cccgactttg 14401 gaaggagaag cttggctact gaaaaatgga tacacccaag cagaaggggt cgcgaattta 14461 gatcaagtgc ctgagacagg cgcactggtg acaattgggt atcccaaatt tcaaggtggc 14521 ttgggaggtt acgcacgcta tattgctatc tgtccgccag actggaagta tggcgtgtcg 14581 gtcggtcaga tatcagaatc tcctctgcaa aaagcaaata aaccgctgcg ttgggatcaa 14641 cagctcaatc tacgagtcag atagcagata acaggacagg aataacgagg aaaagagagt 14701 atttgcttga cttattttca aattaatagg ccttaaaatt ttttacaaaa aatttgtaat 14761 aagagccaca aaaatctggt aaatatataa ccaagtacag gaataaatac cttttcctaa 14821 cacttaggaa accagtacga aacccaaaaa tttaacaaac ccaggagaac tgttatgaac 14881 catccctttg gtctagatgt tttagattta gaagcaatag aactcaattt tgaagacgat 14941 ctcaatgatg aagaggctgc ccaagtcgta ggtggactga ctaaagccac cactgaagct 15001 gtaggtgaag aaggagggac agtcaccact ctagctgtag gtgaagaagg agggatacaa 15061 tgtatttctg ccccttgtcc cggaagtgaa ggaggagaga agcctcccaa ggagcctccg 15121 aaggctacca ctttagctct aggtgaagag ggtggttaca ccaaagctcg gtttgaaaat 15181 ggaggttatt agtaacctcg ttaatgtgaa gtttaaaggc acgactggtt gaaaactccg 15241 cgcgataagc gcacgccgga gacaaaagcc actttagtaa ttgactcaaa aacgatttca 15301 gttcgagttt acataccata tagtttcagc actttactct aaagtgctga aatttcttgt 15361 tgtgaaagtg atttgatttt aagacttctt aaaaccgcaa atttaaaatg aacattttaa 15421 ttttgggtaa ttctttagat gctcatgctg cccatctcaa gaatgctctc accgaagctg 15481 gtgcaacggt agattatttc gatactcacc tatttccaac acaattaagg atgtcttgga 15541 gacctgatac caaggtggga tctttagctc tatctgaaga aaatcaattg aatttccaag 15601 atattcacag tatattttgg cgtaacttct ctggcgttca cgttccacag ttaaaagact 15661 caaaccaaca gtatgttgca tttaatgatt caatgagtac actgcgctca ttaatccagg 15721 cttgtccatg tcactgggtt aattcttggc aagcatacca gtttcataaa gaaaaacctc 15781 tgcaacttag caaagccaaa gaaataggag tgacaattcc agccacttta attagtaatc 15841 atccaagaga aattacagaa tttgttcaca cacatgaaaa agtcattttt aagccagttt 15901 atggtggtgc tcatacccag tttttgacac aagagcattt agaaccaaaa agattgaact 15961 tggctttgag tctttctcca gtcacactac aggagtatat tcctgggaca aacattcgca 16021 gctatgttat tggagaatcg gtttattctg ctgaaattcg cagtcatgct gtagattttc 16081 gcgaagattt ggatgctgag ttaattccga tagaattgcc agaatcaatt caacaacaat 16141 gtttagcaat agcaaaggca tttatgctag agtggactgc tattgactgg cgttgcaaac 16201 caaacggtga gtatgtattt ttagaggcaa accccagtcc gatgtttata cattttgaaa 16261 atcagactgg ttttcctatt acagaaaaat tagtcaatct cttgatgaat taaaagcaag 16321 cagtcgcaga actatcatat caagtttgcc taattactta caataaaaaa cctcaccccc 16381 cgcccctctc cgaactcgcg gagaggggtg cccggagggc ggggtgaggt gagaccagcg 16441 ctgcgggagg gtttccctcc gcaggcgact ggcgaacccg g // LOCUS NODE_2041_length_16448_cov_4.90471516448 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16448) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16448) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16448 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 129..1931 /locus_tag="DP116_17780" /pseudo CDS 129..1931 /locus_tag="DP116_17780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744955.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" assembly_gap 307..316 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 1928..3736 /locus_tag="DP116_17785" CDS 1928..3736 /locus_tag="DP116_17785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744954.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_17785" /translation="MKLWWLRLARYARVRWRGLAFVLLLMLINVGLNVLKPWPLKLIV DCVLGNQSLPNTLVWLKTLAGDADIQLLGPLAGATIALFLASEGVRLLQDYVGAGVGS QMGYDLGAALFHRLQHLSLEFHNQQRSGDLVRRVMTDSICIRNLAMGVFLPVLTSVVN LVVMFVVMWQLDHFLSLLSLVVAPLIVLLIWVFNKPMIERTYQHQQFEGEIMALGEQT LTALPIVQAFGREAHEDERFRYLSQKALQACLRALLAQMQFKIGVSGVTAVGTAAIML FGGFQVLDGSLSIGSLLVFLSYLASLYVPMETLAYMSSGFAAAAASAMRVLEVLDAKE EVREIAGATPLLLQPGKASGYVCLEKVTFGYQDGKPILQDISLEAQPGETIALVGATG VGKSTLVSLIPRFFDPWQGRVLFDGVDIRDVQLQSLREQIALVLQEPFLLPLTIAENI AYGCVGASREAIIAAAQAANADSFIQRLPQGYDTVIGERGATLSGGEKQRLAIARALL KDAPVLILDEPTSALDAQTEFLLLEALERLMAGRTTFIIAHRLSTVQRADRIVVLEQG RVAQIGTHQALLNARGLYYRLHQTQSSRAIIHSESV" gene 3921..4493 /locus_tag="DP116_17790" CDS 3921..4493 /locus_tag="DP116_17790" /inference="COORDINATES: protein motif:HMM:PF14602.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="PRJNA477356:DP116_17790" /translation="MTYLANQQSEYRVITDFPENVQIGSNTVIMGNLAFKRFHSRKKQ GLIIGDHCTMDGVHFDIGEKGQVEIGDYCYFTNAVLLCELEVRIGSYVVIGWNTTLAD TDFHPIAPAERIADAIACSPLGKTLPRPEIVKRPVVIEDGVWIGPNATILKGVHIGAG AIVEPGAMVNRDVPPRTRVMGNPAQIIGEV" gene 4499..5089 /locus_tag="DP116_17795" CDS 4499..5089 /locus_tag="DP116_17795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017824203.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="PRJNA477356:DP116_17795" /translation="MSTRTLSWDWYPGTIPENVVLDETAYVETTFSFHLYRSEAPVGV EYGRGASTYLGTMFDVGPRGQVSLGKFALVHGARIICDAQIEIGDYALISWNVVLMDT YRLPFDPTQRRRELEQIPFRSPRRIDGTVPAQPILIGSNVWIGFDACVLPGVTIGEGA IVGARSVVTQDVPPYTIVAGNPARVVRRLDAGGIEK" gene 5086..6230 /locus_tag="DP116_17800" /pseudo CDS 5086..6230 /locus_tag="DP116_17800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744951.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 5497..5506 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 6233..7408 /locus_tag="DP116_17805" CDS 6233..7408 /locus_tag="DP116_17805" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17805" /translation="MARIVVCGYMIRHPLAGNLFAFFHYVLGLHLLGHEVLYLEESGW SGSCYNPINRSYSDDPSFGIHAVETLINTYGVNATVCYVNRDTGTVYGADWQELKRML KTADLLLNIGGVCWLREFLVCKRRILIDMDPFFTQTGTFAAEGRNDYHAYFSYGVNIG KPDCTIPSDGIEWLPTVPPVVPEIWHQVLAPEDCEKKWVDIPLTTVANWSAYGGIIYQ GEHYGQKNEEFMRLLELPSYCAQKLELALSGKDTEIAEITKSLQTAGWLVRDARVLSA NVSTYINYLTSSRGEFSVAKNAYVKTRSGWFSDRSVCYLAAGRPVILQDTGFSDWLPT GDGVLAFSSLESAVDCIERVNADYQMHCLAAQELAEQNFSYKVVLPRLLEPGRVKVA" gene 7405..8544 /locus_tag="DP116_17810" CDS 7405..8544 /locus_tag="DP116_17810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744949.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_17810" /translation="MIIIFASAIGRFPIGGNAWSDLQYLLGLRSLGHDVFYLEECGLE SWVYNWETEQLTTELDYPTNYVRNCLEGLGFENQWIYRAGERSVGMDIDKFKQICHEA DLMIVRGSPISLWREEYNWPQRRIYIDADPGFTQINIASGHSELVNTVEHCDRLFTIG QRIGAADCLIPTIGRDWLLTLPPVALPYWSVTEDDDATHFSSIMQWHSYREVVYEGVT YGNKDKEFLKFIDIPQLTKQPFRIALSGGFPDELSQYGWEVIPGWIASFTPESYQTFV QESRAEFGVAKHGYVATKGGWFSDRSVCYLASGRPVLVQDTGLSDWLPVGEGILIFRD QKEAVNGVEAINADYKRHRYAARQLAQEYFNSDKVLSSLLEAAMS" gene 8913..10085 /locus_tag="DP116_17815" CDS 8913..10085 /locus_tag="DP116_17815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008181224.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_17815" /translation="MSSTITIGFIPREQFSLAAESLQRIFDYTHIPFNLIVVDCNTPK VYWQQIEQVLDGRSHVEVIHKNHYLTPNQCKNLVIQQAKDDFVCFIESDVIVEEGWLS QLMAACEEHPADVAIPRIIEGRLGETKLHWDPNLGHIRSVQTTDGVKYEILPFTDEQQ LDKGSHRRTIELSGEAHCQLYRRSVFDQVAPFDEEVVYLDWIDSSLALYNAKIPVVFE PKSVVHFWHPFPPRRDDLDYFFMRWDLERAQQDLDRIPKKWNLVQVTADLEFAMERNR IGQLHASMEELKALIPPQKPFILVDEDWLNSNEIIEGFRTIPFTEHNGQYWGAPTDDD TAIREFERLHQTGASAIVFLMHTFWWLEYYTRFHDYLRQKFPCVLQNERLIVFDLR" gene 10130..11263 /locus_tag="DP116_17820" CDS 10130..11263 /locus_tag="DP116_17820" /inference="COORDINATES: protein motif:HMM:PF00535.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17820" /translation="MEKPTASIIITNYNYGRFLREAIASALNQTYQPTEVIVVDDGST DNSQQIIADYGKRIIPVLKENGGQGSAFNAGFAVSCGEVVCFLDADDVLLPSAVEKAV SLLHEPNVVKVHWPLSAIDVHGKPLDKLFPEKPLPEGDLLDAQLTGGVDGHVFSPTSG NAWARSYLKRVLPIPEIHYRINADSYLAILAPLFGSIKRIVEYQALYRIHGDNGTSKT TYRWQLNQYHYEMTVLRKFLQEQDIQIKDAFECAQSSGYKHIQHMVELGQELEPLIPP RQTYILVDMDEWGWNGQLLENSQSIPFLEKDGMYWGAPSDDVIAIEEFERLRREGASF IVFGSPAFWWFDYYAEFARHLRTKFRCVLDSERLVVFDLRISA" gene 11309..12523 /locus_tag="DP116_17825" CDS 11309..12523 /locus_tag="DP116_17825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744946.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_17825" /translation="MIPETKVTIAIPTYNRSKLLKTSLKSALAQDYSDFQVLVLDNAS SDDTEAVVRSFSDSRITYVRNDANIGIFGNWQRTIEINSSPYLSILSDDDILLPNFIR ESVLGLDNHPNAGLSAALAEFIDTNGVLLQVKGTEFSDNLPQGLIEGLEFIHQIVDGR KWILRTSAVMFRASALKAVGGFDITHSKYLLDLNLYLRMATQFDFFFIAKQLAQVRFH VEQDSQVSFHSLSGTGALAVMAERTDAIAYLLQSPRAEDASYRRWLAERLLHISIRRS EFTSKLVPKLNLGWSERLEIAIREIAAVIPAGKHFILVDENQWGFDILPQFHPLPFLE HEGQYWGPPSDDQTAIRELERMRDCGASFMVIGWPAFWWLDYYSKLRNYLSSNFRCVL HNSRLVVFDLQP" gene 12606..13868 /locus_tag="DP116_17830" /pseudo CDS 12606..13868 /locus_tag="DP116_17830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181590.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="efflux transporter periplasmic adaptor subunit" assembly_gap 12660..12669 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 14573..16234 /locus_tag="DP116_17835" CDS 14573..16234 /locus_tag="DP116_17835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197463.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17835" /translation="MKLTNPQKRCFTKFLAAVALCAPLLVTLPGNAAKPEHVKQLRDT KKCRKCDLSGANLSGVNLSGADLSYSNLSGANLSGANLSYANLSGVNLRGVNLRGVNL RGVSLSGVSLSGVDLSGVDLSGVSLSGVSLSGVDLSGVSLSGVNLSGVNLSGVKLSRG NLSGVNLSSVNLSGVNLSGASLNGFNLRGVELKNANLSDADLENADLSKANLRDANLK NANLKNANLKGAKLIGVNLNGASLKNADLRGANLDVETLPNDNILAEAADYSRWGDNR YNKSDYRSAIAYYNKAIEIDAKYKEAYANRGLAQTQLKDYQAALADYNNALSIDPNYA KAYNNRGMTRTAQQDYQAALADFDKAISIDSKYAEAYNGRATVRLIQKDYPAVITDAT EAIRLDPKLAAAYNNRGLARFAKQEYQMAIKDYDKAIDHSEGWAWAYFNRGVARYANK EYKDATEDYDKAIDIDENYVDAYYQRSIARFARQKYEDAIKDCDRVIARDPNYAQAYE NKGNAFLALKKKVEAKQAFEQAAKIYFQKQDNTNLQRVQQTITGI" BASE COUNT 4284 a 3695 c 4137 g 4302 t 30 others ORIGIN 1 aactgataac tgataactga taactgataa ctgataactg ataactgata actgagtcgg 61 tcataacgtt tgttttggtg ctcttctata ctagtgagca aatggctaga cgagaggtca 121 atttgtaaat gaggcaaggg ttaaaagcaa tgatgcaccg atacgggcgc ttgctccaat 181 atccgctgcg gcagtggcgg accataatag caatcctcgg tctcacggcg ggaacttcag 241 ccaccgcaac acttcaacct tggccaatga aaattctggt ggattatgcc ctgtctcaag 301 cagcccnnnn nnnnnngtcc ctgacgccgc cactgctggt tcttgttgct gctttaggta 361 gcctcggact ttatgctttg aacgcggcgc ttgacacgag cttaagttgg gcttggagtg 421 ctgctgggca aggtatggtc tacgaactgg ctcaaaacca gttctctcgt cttcaacgcc 481 tttcactgca gttccatagc cagcgtactg tcggcgattc tttgagccgc ttgacaggag 541 attcctattg tgtatacacc ttggtcggtg ctgtgctgat ttcacctgtg cagcatctgc 601 ttatgctctt gactattagc atcgtcgcct ggaacctgaa ttcttcactg acgctgattt 661 cgctattcgt tgcaccaatg atggctggtt ttgccctcgt ctttggttcc cgactcaagc 721 gccggacaaa gcttaatcgc gaggcgcagt ctcgcctcac cagttttgtc caccaaacgc 781 tcacagccat cccaatgatt caggcattta gcagggagag ttacaatacc gagcagttta 841 agcacctgtc tcaagatgca gttacgattt cgcaacgcga aaccctgctc aaaagtacct 901 acggtcttgc caatggttct gtgacaacgg ttggtaatgc gatcgtcctt tatgtgggtg 961 ggctgcaggt gttatcggga gccatgagtg tcggtagcct gctggttttc ttggcatatc 1021 tacaatcaat acaaggagca tttcgcggct tgtttggcat ctacgggagt ttgaagtcag 1081 tggaagccaa tatggataga gtgctagaga ttttggatgc taaagatggg gtgcaggatg 1141 ccccaggagc caaaccattg cctgttcgtg cagtgggatc tcggggacat gtatgtctag 1201 agggagttac tttcggttac gatgttgact atcctatcct gcaagatatt accctggaag 1261 cacgaccggg agaaactatt gctttggtcg gtgcgactgg tgctggtaag agtacgttgg 1321 tctctctcat tcctcgtttc tttgatccct ggcaggggcg agtcttattt gatggggtgg 1381 acgtacgagg cgtgcaactc aatagcctgc gcgagcagat tgccctagtg ctacaggagc 1441 cgtttatcct gccactctcg gtggcacaga acattgctta cggtcgtccg ggagcaagct 1501 ttgaggagat tgtcgcagcc gcgaaagcag ccagggcgga cgaattcatc cggcagctgc 1561 cccaagggta tgacaccgtt ctgagtgaaa gaggagctat cctctctggg ggacagaaac 1621 agcggctggc gatcgcacgc gcgctactta aggatgcacc ggtactgatt ctcgacgaac 1681 ccacttctgc tctcgatgcc cacactgaaa gcttactgct ggcagcttta gagcgtttga 1741 tgcaggggcg gacagtattt atcatagccc atcgcctctc gactattctc cgggcggatc 1801 ggatcgtggt gctggaacag ggtaggattg tggagatggg aacgcaccag gagttgctga 1861 cagattctgg tctttacaag cgtttgcact ccttgcagtt tcccgatccg ccgcaggagg 1921 tcgtgttatg aaactgtggt ggctacgact agcacgttac gcccgagtcc gttggcgcgg 1981 actcgcgttt gtgctgctgc tgatgctaat taatgtcggg ctgaatgtgc tcaagccctg 2041 gcctttgaag ttgattgttg attgcgtcct gggtaaccaa tccctgccaa atactcttgt 2101 ttggctcaaa accttagctg gtgatgccga tattcagcta ctcggcccct tagcgggtgc 2161 tacaatcgct ctatttctag ccagtgaagg agttcgtctg ctccaagatt atgttggtgc 2221 tggcgttggc agtcaaatgg gttatgactt gggagcagca ctgtttcatc gcctacagca 2281 tctttctctg gagtttcaca accaacagcg atctggcgat ctcgtccggc gagtcatgac 2341 tgacagcatc tgtattcgga atctggcgat gggggttttt ctgcccgtgc tcacttccgt 2401 ggtgaatctg gtagtcatgt tcgttgtcat gtggcaactt gatcacttcc tttcactgct 2461 ctcgctggtt gttgctcctc tgattgtgtt actgatttgg gtctttaaca agcccatgat 2521 tgagcgaacc tatcagcatc aacaatttga aggcgagatt atggcgcttg gcgagcagac 2581 tttaacagcg ctaccaattg ttcaagcatt cggtcgcgaa gcacacgaag atgagcgctt 2641 tcgctacctg tcccaaaaag cgctccaggc ttgtttgcgc gctctccttg cccagatgca 2701 attcaaaatt ggggtgagcg gggtcaccgc agtcggaaca gcagcaatta tgctctttgg 2761 cggctttcag gttctggatg ggtcactttc aattgggtct ttactggtct tcctttctta 2821 tctcgcctcc ttgtatgtgc cgatggagac tttggcttac atgtcgtctg gttttgcagc 2881 agcggcagct agtgctatga gagtgctaga ggtgttagat gccaaggagg aggtacggga 2941 aattgctggt gctacacctc tgcttttaca acctggtaaa gcgagtggat acgtatgtct 3001 agagaaagtt accttcggct accaggatgg gaaaccgata ctccaggaca tttccttgga 3061 agcacagcca ggggaaacga tcgccctagt gggagcgaca ggagtcggca agagtacgct 3121 ggtttccctg attccccgtt ttttcgatcc ttggcaggga cgagtcctct ttgatggggt 3181 agatatccga gacgtgcaac tgcaaagcct gcgcgagcag attgccttgg tgctgcagga 3241 gccgtttcta ctaccgctaa ccatcgccga gaacattgcc tatggctgtg ttggtgctag 3301 ccgcgaagca atcatcgcag ccgcccaagc tgccaatgct gacagtttta ttcaacgatt 3361 accccaaggt tatgacacgg ttattggtga gcgcggtgct acgctttcag ggggagagaa 3421 gcaacggttg gcgatcgccc gtgcgctgct caaggatgct ccagtgttga ttctggatga 3481 acccacctct gctttggatg ctcaaaccga gtttctgctg ctggaggctt tagagcgact 3541 gatggcagga cgaactacgt ttattattgc ccatcgcctc tcgacagtac aacgagcaga 3601 ccggatcgtg gtgttggaac aaggacgagt tgcccagata ggaacccatc aagcattgct 3661 aaacgctcgt gggctgtact atcgcctgca ccaaacccag tccagccgag cgatcatcca 3721 ttcggaatcg gtataaagca atacgtttaa tgctaaggct aaaacttttt gtaaaaatca 3781 atttttttta accaaccgca gaggcgcaga ggagccagtg cgttgcgggg gttccccccg 3841 ttgtagcacc tggtgtgaca cagagagaag aaaaaaatgc ttaactgaac tggattgcag 3901 tataaaccag gggagataag gtgacatatt tagctaatca gcagtctgag tacagggtaa 3961 ttacggattt tccagagaat gtccaaattg gttcaaacac agtcatcatg ggcaatctcg 4021 cgtttaagcg attccatagc cgcaagaaac aggggctaat catcggtgac cactgtacta 4081 tggacggcgt tcacttcgat attggtgaga aggggcaggt ggagattggc gactactgct 4141 acttcacaaa cgctgtactt ttgtgcgaac ttgaagttcg cattggcagt tacgttgtga 4201 ttggatggaa tacaacactc gctgataccg actttcatcc gatcgcacct gctgaacgca 4261 ttgctgacgc gatcgcctgt tcacctctgg gcaagacctt accgcgaccg gaaatcgtaa 4321 agcgacccgt ggttattgag gacggcgtct ggattggacc gaacgccaca atcctcaagg 4381 gagtccatat tggagctggg gcaattgtgg agccaggtgc aatggtgaac cgagatgtac 4441 cgccacgcac gcgagtcatg ggtaatccag cacagataat tggggaggtt taagggcgat 4501 gtctacacgc actctctcct gggattggta tccaggaacg attccagaaa acgtggttct 4561 tgatgaaacc gcctacgttg aaacgacatt cagttttcac ttataccgca gcgaagcacc 4621 agtgggggta gaatacggtc gcggtgcttc cacctatttg gggacgatgt ttgatgttgg 4681 tccgcgtgga caagtcagtt tgggtaagtt tgcgcttgtt cacggtgcca ggattatctg 4741 tgacgcccag atcgaaatcg gtgattatgc cctcatctct tggaatgttg ttttgatgga 4801 tacctaccga ctaccgttcg acccgacaca gcggcggcgt gaactagaac agataccatt 4861 tcgttcgccc cgacgtattg atggtacggt accagcgcaa ccaattctta tcggttctaa 4921 cgtttggatt ggttttgatg cctgtgtact accaggcgtc actatcggag agggggcgat 4981 cgtcggtgcc cggtctgttg tcacccagga tgtcccaccg tacactatcg tggctggcaa 5041 tcctgcccgt gttgttcgtc gcttggatgc aggagggatt gaaaagtgag aactgaacgc 5101 cctagtaagg ggaaaattat agtttttggt attctatttt ggtatccctt agcaggggtg 5161 acttaccagt tccttcacta cctgctggga cttcgtcgct taggttatga cgcttattat 5221 atagaagatt catggcgttg gatttacaat ccccgcatca acgacctctc tccagatgtg 5281 actgagaata ttcagcgaat tgccccaatt ttagaggagt atggcttcaa agaccgatgg 5341 ggatttcgcg attatctcgg gggtgaatgt tatgggatga ctgaagccca aattttgcaa 5401 ttatatcagg aagcagacgc cttcttgaac gtaacgggtg accaggaaat ccgcgacgag 5461 catctggctt gcccccgccg catctatgtc gaatcannnn nnnnnncatt gctcacctgt 5521 ccgcccacga cacccacttc agttttgggg aaaatctagg ggcaccagat tgtggtgttc 5581 ccgtcggtgg ctttcactgg ttacctaccc gccaaccagt agtgctggat ttgtgggact 5641 caaacttcac accagggatt gcttacaaca ccatcgccac ttggaataat aagggaaagg 5701 acattactta tcagggcaaa acttactact ggcgcaaggc gcgtgagttg gaaaaatatc 5761 tggatcttcc caaagcgcgt ccgttgcaat ttgagatggc gactaatgtg agtgaggagg 5821 aaagagagaa tgtgcgatcg ctgcttcaaa agcatggttg gagccaggta gacgcggtgg 5881 aactctctca agatatgaag aattaccgcg cttatatcca ggaatcgcgc ggagagttca 5941 cggtggcaaa agagcaatac acgcgtctgt tgagtggctg gtttagcgat cgctctgctt 6001 gctatctagc tgctggtcgt cctgtgatta ctcaggaaac aggattcagc aaatttctgc 6061 ccacgggtaa aggacttttt gccttcaaca ctatggaaga tattctagca gcactcgatg 6121 cgatcgaaag cgactacaaa ggcaattgcc aagcagcccg cgaaattgca tcagaatact 6181 ttgccgcaga aaaggtgatt ggcagtctta tggagcgagc ggggttataa gtatggcacg 6241 cattgttgtt tgcggatata tgatccgtca tccattagct ggtaatcttt ttgccttctt 6301 ccactatgtg ttaggattac atcttctcgg acatgaagtc ctatatttag aggaaagcgg 6361 ctggtctggg tcctgctata acccgataaa ccgcagttac agcgatgacc ccagctttgg 6421 tattcacgca gtggaaacac ttataaacac ctatggcgtg aatgccactg tgtgctatgt 6481 gaatcgggac acgggaaccg tctatggtgc ggattggcaa gagctaaagc ggatgctcaa 6541 gacggcggat ctgctattga atatcggtgg agtttgctgg ctgagagaat ttcttgtgtg 6601 caagcgccgg atattaatcg atatggaccc attctttact caaactggga cattcgccgc 6661 tgagggtcgc aacgactatc acgcctactt tagttatggt gtgaatatcg gaaaacccga 6721 ttgcacgatt ccgagtgatg gtattgaatg gctccccacc gtaccgcctg ttgtgccaga 6781 gatttggcac caagtacttg ctccagaaga ttgtgaaaaa aaatgggtag atataccctt 6841 gacaaccgtg gctaattgga gtgcatatgg tgggattatc taccagggcg aacactatgg 6901 acagaagaat gaggaattca tgcgcctgct ggaactccca agctactgtg cacagaaact 6961 tgaacttgcg ctttcgggca aggatacaga aatagcagag attaccaagt ctctacaaac 7021 agcaggctgg ttagttcgag atgctagggt attgagcgct aacgtatcaa cttacatcaa 7081 ttacctgact agctcgcggg gagaatttag cgttgccaaa aacgcttatg tcaaaacccg 7141 tagcggttgg tttagcgatc gcagcgtttg ctatctcgct gctggtcgcc ccgtcatctt 7201 acaagatacc ggatttagcg attggttacc gacgggtgac ggcgtgctgg cgttttcctc 7261 cttggagtcc gcagtagact gcatagagcg cgtcaacgca gattatcaga tgcattgctt 7321 ggcagcacag gagttagccg aacaaaactt tagttacaaa gtagtactgc ctcgactact 7381 tgagccaggg agagtgaaag tagcatgatc atcatttttg cgagtgcaat cggtcgtttt 7441 cctataggag gcaatgcctg gtctgacctg cagtaccttt tgggtttgcg atcgcttggg 7501 cacgatgttt tctatttgga agaatgtgga ctagagtcgt gggtttacaa ctgggaaacc 7561 gagcaactca caactgagtt ggactatcct acaaattatg tgagaaactg tctggaaggg 7621 cttggttttg aaaatcaatg gatttaccga gctggcgagc gttcggttgg gatggatatt 7681 gacaaattta aacaaatatg tcatgaagct gatttgatga ttgtccgtgg ctcaccaatt 7741 tctctgtgga gagaggaata taactggccg caacgccgca tttatattga tgcagacccc 7801 ggtttcactc agattaacat tgccagcggt cattcagaat tggtaaacac ggttgaacat 7861 tgcgatcgcc tgttcacgat tggtcagcgc attggtgccg cagactgtct catccccaca 7921 atcggtaggg attggctgtt gacattacct ccagtagcac tgccttactg gtcagtgact 7981 gaggacgacg acgccaccca cttcagttct attatgcagt ggcacagtta tcgggaggtg 8041 gtctacgaag gagtcaccta tggcaacaag gataaagaat ttctcaagtt cattgatata 8101 ccgcaactga caaaacaacc gtttcggatt gcacttagcg gcggctttcc tgatgagcta 8161 tcccagtacg gctgggaggt aatccccgga tggattgcgt cctttacacc agaatcttac 8221 cagacattcg tccaagaatc ccgtgctgag tttggagtcg cgaaacacgg ttacgttgcc 8281 acaaagggag gctggtttag cgatcgcagc gtctgttacc tagcttccgg cagacctgtt 8341 ctagttcaag acacaggctt aagcgattgg ctaccagtag gagaaggaat tttaattttc 8401 cgcgaccaaa aggaggcagt aaacggtgta gaagctatca acgctgacta taagcgacat 8461 cggtatgcag cacggcagtt agcacaggag tattttaact cagataaagt tctttcatct 8521 cttttagagg cagctatgag ttaacgatac attgagttta taaaatatca ccatagcaat 8581 actaaatcat aagcttgaag tacagctttg cataagttcg taactttttt aaacgcagag 8641 gaacactaag gtagcgcaaa ggaacgcaga gtcttgctta atttaatggg ctacgaatta 8701 atgaaatgct gtactaagcc tacggcatgc ttcctaaagc gcgttagctc ctcctaacag 8761 aggcacgggc acgctacttt gaacgtaaaa ttctcttttc tctcttttct tggcgctctt 8821 ggcgacgcca gacgcctacg gagggagacc ctcctgcagc gctggctcgt cttggcggtt 8881 aataattttt acaacttaaa taagactgct atatgtcatc aacaataacc atcggattca 8941 ttcctcgcga gcaattctcc ttagctgctg aatctttgca gcggattttc gattacaccc 9001 acatcccatt caacctgatt gtagtggact gcaacacccc aaaggtgtat tggcagcaga 9061 ttgaacaggt gctagacgga cgtagtcatg tggaagttat ccacaagaac cattacctaa 9121 cgcctaacca gtgcaaaaat ctggtgattc agcaagccaa agatgatttt gtgtgcttca 9181 tagagagtga cgttattgtt gaggagggct ggctatctca acttatggca gcatgcgaag 9241 aacatccggc tgatgtggca ataccacgca ttatcgaggg gcgtttggga gagacaaaac 9301 ttcactggga cccaaacctg ggtcatatcc gttcagtgca aacaaccgac ggagtcaaat 9361 acgaaattct tccgtttaca gacgagcaac aacttgataa aggttcccat cgccggacga 9421 tagaattgtc tggagaggct cactgtcagc tctatcgccg aagcgttttt gaccaagtcg 9481 ctccttttga tgaagaggtt gtttatttag actggatcga ttctagtttg gctttatata 9541 atgccaagat tccagttgtg tttgaaccaa agtctgttgt tcatttctgg cacccttttc 9601 ctccccgtcg tgatgacctc gactatttct ttatgagatg ggatctcgaa cgagctcaac 9661 aggatcttga tcgtattcca aaaaagtgga atctggttca ggtgacagca gatctggagt 9721 ttgcgatgga gcgaaatcgt atcggtcagc ttcatgcgag catggaggaa ctcaaagctc 9781 tgataccgcc gcagaaaccc tttatcttgg tggatgaaga ttggctaaat agcaatgaaa 9841 tcattgaagg cttccggact atacccttta ctgaacacaa tggacagtat tggggagctc 9901 caacagatga cgacaccgct attcgagaat ttgagcgcct acaccaaact ggtgccagcg 9961 cgattgtttt tctgatgcac actttttggt ggctggagta ttataccaga tttcatgact 10021 atttgcgtca aaagtttccc tgtgtattac aaaatgagcg tctgattgtg tttgacttac 10081 gctagcccaa acacagcaag aatttgagat tagttttcaa aaatgtctaa tggaaaaacc 10141 tacagcgagt attataataa ccaactacaa ttacgggcgt tttctgcgtg aggcgatcgc 10201 aagtgctttg aatcaaacct atcagcctac ggaagtcatc gtcgtggatg atggttcaac 10261 agataattca cagcaaatca tcgctgatta cggaaaacga attattcccg ttttgaaaga 10321 gaatggtggg caggggtcag catttaacgc cggttttgcc gtcagctgtg gtgaggttgt 10381 ctgcttctta gatgcagatg atgttttgct acccagtgct gttgaaaaag cagtctcact 10441 gctgcacgaa ccaaacgtgg tcaaggtaca ttggcctttg tctgctattg atgttcatgg 10501 gaaaccgttg gacaagttat ttccggaaaa acctttgcca gaaggagatt tgcttgatgc 10561 acagctcacg ggtggtgtag atggtcatgt cttctccccg accagcggca atgcttgggc 10621 acgtagttac ttaaaaaggg ttttgccgat accggagatt cactatcgaa tcaatgccga 10681 ttcctaccta gcaattcttg ctcccttgtt tggcagcatc aagcgcattg ttgagtatca 10741 ggctttgtat cgcattcatg gtgataatgg cactagcaag acgacttatc gctggcaact 10801 taaccaatat cactatgaaa tgacagtcct gcgtaagttt ttacaggaac aggacattca 10861 aattaaagat gcttttgagt gcgctcaaag ttctggctat aagcatatcc agcatatggt 10921 cgagttgggg caggaactgg aaccgttaat tccaccaaga caaacttata ttttagttga 10981 tatggatgaa tggggatgga atgggcagct acttgaaaat agccaatcaa tccccttttt 11041 agaaaaagac ggtatgtatt ggggagcacc cagtgatgat gttattgcaa tagaggaatt 11101 tgagcgtttg cgccgtgaag gggcaagttt tattgtgttt ggatcgccag ctttctggtg 11161 gtttgactac tatgctgaat tcgctcgcca tttgcgtacc aagtttcgct gtgttttaga 11221 ttcggagcgc ttggttgtct ttgatttgcg catatcagct tgatactgtg taggccttat 11281 tcccaaacag agggtgagca gaaacgatat gatacctgag accaaagtta cgatcgcaat 11341 cccaacctat aatcgctcga agttactaaa aactagcctt aagagcgcac tggcacagga 11401 ctattcagat ttccaagttc ttgtattaga taatgcttct agcgatgata cagaagcagt 11461 tgtacgctca ttctcagatt cgcgaatcac ttatgtacgc aatgatgcta atataggaat 11521 atttggcaat tggcagcgaa ccattgagat aaattctagc ccctatctga gcatcttgtc 11581 ggacgatgac atactgctgc ccaacttcat ccgcgagtct gtcttaggat tggacaacca 11641 tccgaatgct ggtctctcgg ctgccctagc tgaatttatc gatactaacg gtgttctact 11701 gcaggtcaaa ggcacagaat tttcagacaa cttgccacaa gggctgatcg agggtctaga 11761 gttcatccac caaattgttg atggacggaa atggattttg cgcactagtg cggtgatgtt 11821 tcgcgcttct gccctcaagg cggttggggg attcgacata acgcactcta agtacttgct 11881 agacttgaac ttatacttgc gcatggcaac gcaattcgat ttctttttta ttgccaaaca 11941 actggctcaa gtccggtttc atgttgaaca ggattcccag gttagctttc actcgcttag 12001 tggaacagga gcgcttgctg tgatggcaga acgtactgat gcgatcgctt atttgctgca 12061 gtcgccgcgt gcagaagatg catcctatcg tcggtggctt gcagagcgcc tgttgcatat 12121 aagtatacgc cgcagtgagt ttacgtctaa gcttgtacca aaactcaatc tgggctggtc 12181 agaacggctg gaaattgcca ttcgggagat cgcagctgtg ataccagcag gaaagcattt 12241 tattttggtc gatgaaaatc aatggggttt tgatatttta ccacaattcc atccccttcc 12301 cttcctcgaa catgaaggtc agtactgggg acctccatct gatgaccaaa ccgccattcg 12361 ggaacttgaa cgaatgcgcg actgcggagc aagctttatg gtcattggct ggccagcttt 12421 ttggtggctt gattattact ctaaactaag aaattatctc agttcaaatt tccgctgcgt 12481 tttgcacaac agtcgccttg ttgtgttcga tctgcaacca taaggttctg cgacattaat 12541 atggggtgat agcctttcgc tgcactcgct tgtagtgaac tgcaaactga cggggtgaag 12601 tgttcatgat ctggcataag gacaaaaaga aattcaaaac cggagtgcaa tggctagccn 12661 nnnnnnnnng cttgttaact gtggagcgag gcaatgttga aaccaccatc acagagggcg 12721 gtactgttga actgcgcgaa cagaggattg tcaagtcccc aacagagggt gcagtggatc 12781 gggtactggt aaagccaggg gagaaagtca gctctggtca agtgctactc accctgcgtt 12841 accctgagcg aaaaattgcc cttgccaaac atgagttgca gattcgagaa caggaattca 12901 ctttagcacg cgatcgcgag aaaattgtcg aagcccaaca gcagctcatg gctgaggaac 12961 gggaactgcg caagctttca tccctggcga aagtgggagc tgttgctggg caacaagtcc 13021 gaaaacagga agacacagtg cgtgcagatc aggctaaagt gcgagatagt caggcagagg 13081 cgcgcatcac cgccctcaaa cttcaaagcc tgcaactaga gcgccaaggc attgagcagc 13141 aactccaaaa taccactgtg agtgcaccgt ttaatagcgt cgttttgggt atttacgtta 13201 aggacggtga cggagttcag tttcgcacca acctgctcac cttaggcgac cctaagcaag 13261 tactcgtgaa gctgcagctt tccaccctca acgcagcccg agttcgggtt aaccaagttg 13321 cccgtgtcag tgcgatcgga ccatcagcac agaagttcac cggacgtgtg caaagtttgt 13381 atcctcaagc actgtcacct gaagaaactc agaaagaggg cgggaaccaa aaccaatcga 13441 ctcaagcaac ggtacctgct acagtgctac ttgatacctc tactagcaaa ttgattcctg 13501 gtagccgggt gaatgttgag attgttttag aacaacggca aaacgtagtc gttttgagta 13561 ctgaagctat tcaacgttcc caagcgcatc cctttgtctg ggttcgggat agtcaaaata 13621 aggcacagaa acgaaccatc aacctgggat tggaggggtt agtaaccgta gaagtgacct 13681 ctggtttgcg tgcaggcgaa caggttatag tgccgccccc ccagtcgcaa ctcaagccag 13741 gaataccagt caccccctcc cagggaattc aagattcaaa atcagaaaaa acctcgcggg 13801 agtttggaaa tccagcagct ttggctcttg tggacaagac aagtgctcgt agtaacccaa 13861 gtttctagca ccgctgagtc ccaagggaac tcttaacagg gaacaggaag accaattaca 13921 gtacaaaaag tattaaaaaa aactaaatta ttctcatagt tgcgaaataa ccgattgtaa 13981 cagtacaaag tgcggaatgt gggatataag cttccgagtg ctgttgcgat caggtgcgta 14041 ctgatttgag gtgttgaaat ttccggaaac tatatttgag gtgttgtagt tttaacagac 14101 gatacctgag taattatgca actaactcat ccaattttga aaatcgactt ggcgcaaaca 14161 acgagtgttt tccctagtac accttggcgt aagtcctgta cccatgtcgc tacccaattt 14221 ttagcatcag tttgaatcgt aaaaaatggg tactctattt acacctgttc tacgttctac 14281 tagtctactt ttttactgta ttaattaagc agacaaaaca agcttctgcc ccactaacga 14341 caaataaact aggactttta tagtacatat acactataaa ttttggctat ttcagccctt 14401 gttcatgatt tagggcactc aactatgcaa ataggcaagc aaattagggc tgtgcacaag 14461 cgccagaaac aacgaaaaat cctgatgcat atgattgaca atttgtgagc gtgtgtaact 14521 cctataaact acacaaactc taaggagatg gatatcaaca gagacaaaaa agatgaaact 14581 cacaaacccc cagaaaagat gttttacaaa gtttttagct gctgtcgctt tatgtgcccc 14641 attgttagta acgcttccag gtaacgcggc aaaacctgaa cacgttaaac agttgcggga 14701 cacgaagaaa tgtcgcaaat gtgatttaag tggtgccaac ttaagtggtg tcaacttaag 14761 tggtgcggac ttgagttata gcaatttaag tggtgccaac ttaagtggtg ccaatttaag 14821 ttatgccaac ttaagtggtg tcaaccttag aggagtcaac cttagaggtg tcaaccttag 14881 aggtgtcagc ctcagtggtg tcagcctcag cggtgtcgat ctcagtggtg tcgatctcag 14941 tggtgtcagc ctcagtggtg tcagcctcag cggtgtcgat ctcagtggtg tcagcctcag 15001 tggtgttaac ctcagtggtg ttaacctcag tggtgtcaag ctgagtcgtg gtaacctcag 15061 cggtgtcaac ctcagtagtg tcaacctcag tggtgtcaac ctcagtgggg cttccttgaa 15121 tggttttaac ttaaggggtg tggaactgaa gaatgctaat ctcagtgatg ctgatctgga 15181 aaatgctgac ctgagtaaag ccaatctgcg tgatgctaac ttgaagaatg ctaatctcaa 15241 aaatgctaac ttgaaaggtg caaaactgat tggagttaat ctaaatggcg cttctttgaa 15301 gaatgctgac ttaagaggcg ctaacctaga tgtagagacg ctaccgaatg ataacatcct 15361 tgccgaagct gctgattata gcagatgggg agataatcgc tacaacaaaa gtgactacag 15421 aagcgcgatc gcatattaca ataaagcaat tgagatcgat gcaaagtata aagaagcgta 15481 cgcgaatcgg ggtcttgctc aaacccaatt aaaagattat caagcagcat tagcagatta 15541 caacaacgcg ctctcaatcg acccaaacta tgccaaagca tacaacaatc ggggcatgac 15601 tcgcacagcg caacaagact atcaagctgc attagctgat ttcgacaagg ctattagcat 15661 tgactctaag tatgccgaag cttacaatgg acgagcaaca gtccgtctta tacagaaaga 15721 ctatccagcg gtaattactg atgcaacaga agcgattcgc ctcgatccta aattagctgc 15781 cgcatataat aacagaggtt tagcccgatt tgccaagcaa gagtaccaga tggctattaa 15841 agattatgac aaagccattg atcactcgga gggctgggct tgggcatact ttaaccgggg 15901 agtcgcccgc tatgcgaaca aggaatataa agatgccact gaagattatg ataaggcaat 15961 tgatatagat gaaaactacg ttgatgctta ctatcagcga agtatagccc gctttgcccg 16021 ccagaaatat gaggatgcga ttaaagattg cgatcgcgtg attgcgcgcg atcctaacta 16081 tgcccaagca tacgaaaata aaggtaatgc tttcttggct ttgaaaaaga aagtagaagc 16141 gaagcaggct tttgagcaag ctgccaaaat ctattttcaa aaacaggata acaccaattt 16201 gcaacgagtg cagcaaacca ttactggaat ttaagcgttg cagaatagaa atatgaatta 16261 gggagatcgc gcaaagcaag agacgcgacg agtgcgtatc gcgcttgctt tgtccccatc 16321 gttatagaag tagataccat cccagttata aaaggtaggt tgaaagcccc tgatgtaatt 16381 ttgattaggc aatcttatac taagttgcgt tcagaggtag tatccaccag atgagggaaa 16441 aaagttaa // LOCUS NODE_2061_length_16271_cov_5.35212116271 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16271) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16271) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16271 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1556) /locus_tag="DP116_17840" CDS complement(<1..1556) /locus_tag="DP116_17840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459231.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-mannosidase" /protein_id="PRJNA477356:DP116_17840" /translation="MTSPVSQANTKFISEAIEKLRSYVRVNNLSSWQCLEADLSVADV STCDFSLWNLAQLNAKGHIAWTGGQKVIWLVQKLVVPQNLQGYPLKGLSLRLSLVWWA DSAQVYVNGKLVLEGDLFDCSLRVLLSSRVTPGNEFIIALRLVSPSHCDGAVVKSLLI YESTDENHPDPGFVADELAVMQRFLETFEPESLEDLAGTVAGIDWKETNRLSAACPQD IGAFGGSRLEEKDAKEEKKEFENLLFALRQRLLESKIQNINSKIYLLGHAHLDLAWLW PVSETWKAAQSTFESVLKLQQDFPELIFCHSSPVLYAWIEEHRPDLFDAIQEAVKAGR WEVVGGFWVEPELNLIAGESIVRQLLYGQRYVLEKFGKLSSVVWVPDSFGFCATLPQF FASAGVEYFVTQKLRWNDTTKFDYGAFWWRSPDGSQIFSYMSALVGEGIDPVKMASYA CEWQTQTGLSDALWLPGVGDHGGGPTRDMLETAQRWQKSPLFPKLEFTTAENYLQQIK NRQENNSPPAS" gene 1761..3458 /locus_tag="DP116_17845" CDS 1761..3458 /locus_tag="DP116_17845" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17845" /translation="MPPATPGRYQSRLFNFFHQQSRRWGQQFERTIRHVQVAANWSLE ALLTSVYLLIRQATESAGKQLHTNEQQPRFQLQENQTEIVETVSVDTPIQRVLEAAVT LQIPEAGDQGSTGTNNSHYLWIPHFPLFPTVSSPLTPSLSDSLTPSSPPIVRGIASDL GNRNLVLVTIENEILDILTPQQQEILQNRILDEVATYWHSCELTQSEDQTKVLSEIDR LLNKLTGSKKSIPALPQATGTEFKNQYKKLPNLSQKLPLLDTAIAQIESRAVVTISRT SGQLLQAVQNQLNIFIYGKEQQLTTEQRSLDGNWEHQASKIQALIWGAINYFFGERNT NKLEQKTPTNSIDALSIGFKNQPKTVNLPQRPSSSVLPESPDLPSENIEDPWLTMDDL FGDLQEVTEVVNEQQLLVTSSESPKSALPASPSESPKSALPASPSVKIRRQINFKSFS VLSQAKELIQNSKYSLRSSYENKIWKFSSSSDSPISQFESDSNLIKDNKGEILYQQQK TTQVEAKPDWIETQAQTIGYAKHPLEQVLEWLDRIMLWLEEIFVKIGVFLRKVLRIK" gene complement(3399..4940) /locus_tag="DP116_17850" /pseudo CDS complement(3399..4940) /locus_tag="DP116_17850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998869.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="PLP-dependent aminotransferase family protein" gene 5049..5528 /locus_tag="DP116_17855" CDS 5049..5528 /locus_tag="DP116_17855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740740.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_17855" /translation="MLIEGQINIREATPQEDSLIAKHFYQMWQDIGVPDDAINPNWLE ITLQFIEQARRDLFYKAFVAEVNGAVVVGSASCQLYSGLNPNVFIPEYRKYGYIWCVY VEPAYRRQGIAKQLTSTTVNYLKAVGCTRVVLNASPSGKPVYEQLGFSSSNAMHLDL" gene 5581..6276 /locus_tag="DP116_17860" CDS 5581..6276 /locus_tag="DP116_17860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015116690.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxamine 5'-phosphate oxidase family protein" /protein_id="PRJNA477356:DP116_17860" /translation="MSTLNNQINQEQFTPTQRTSIKRVSQRGHYERQLIYEILDEGLI CHVGFVVDNQPFVIPTAYGRVEDKLYIHGSPASRMLRSLLTGIEVCVTVTLLDGLVLA RSAFHHSMNYRSVVIFGTATLVQGADEKLEALRAFTEHIVPERWAEVRPPNRQELQGT LVLSLPITEASAKVRTGPPLDDEEDYSLSVWAGVLPLQVVAGDAIADPRLHTGITQPD YIQNYTRLYVESE" gene complement(6322..6453) /locus_tag="DP116_17865" CDS complement(6322..6453) /locus_tag="DP116_17865" /inference="COORDINATES: protein motif:HMM:PF12600.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17865" /translation="MSRPFFDYTGFRVSYSQGLKRGSRLFLYTDDTLSVVRLDMDDW" gene complement(6463..6762) /gene="higA" /locus_tag="DP116_17870" CDS complement(6463..6762) /gene="higA" /locus_tag="DP116_17870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006964762.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="addiction module antidote protein, HigA family" /protein_id="PRJNA477356:DP116_17870" /translation="MITKRKPRHPGGLIKRQYLEPLNMTITELAEILDVSRKTVSEIV NEQASITPNMALRLARAFQTTPELWLNLQQKYDLWCAANESEAWKEISPINLQTC" gene complement(6774..7052) /locus_tag="DP116_17875" CDS complement(6774..7052) /locus_tag="DP116_17875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007349587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="PRJNA477356:DP116_17875" /translation="MIKTIKHKGLKKLFEDDDRSGINPSFADKLLDILDRLDAASEIQ DMRYPGSGLHQLQGDRKGEWSVTVSKNWRVTFTFQDGDAYDVNYEDYH" gene complement(7184..9685) /locus_tag="DP116_17880" CDS complement(7184..9685) /locus_tag="DP116_17880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875723.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3769 domain-containing protein" /protein_id="PRJNA477356:DP116_17880" /translation="MPHPVPPPEPPPILESSQSANPMSVVTTDTKALNKEKSARQQET PPLQTAEQKPLIYSAKTLAAPNEPESLPPEFSPITASKSAASLGEVSVGYPLQQPETS HPSNGDSTNAPTKTFVAGEPVTQTLTKLENLSHNQTRRDLKTAQTPNQGNVFTEKSGT EQPVQIEFKTRNQTTEPSTPTPNSQNQSAPTGQPDSTIEPKSQPSGTTVPSKTPTGRE RIVEVTSDRQEYDEQRRIVTAEGNVVVRFDGAVVDADRLQVSLDNLIAVGEGNVALTR GGQILRGERFTYNLVQDNGELTNGRGDIFLPTAGQDLSFLPTDITAGGTPARPLSDRI IENQPLQASSPGGLNINVGGRRGANNLAVPKQGGQVRRVRFEAGRVDFNPRGWEAKDV RLTNDPFSPPELVIRADKVTLTRETPLRDRIKTQGQRLVLDQRTSLPIPKDEQVIDRN KRDVTPAIASIGFDGDKRGGLFIERSFQLINTEGTRFSIAPEFFAQKAVTGNIGNVVS LFGFKSRLNSTLGPRTTVTGSALLTSLDLGDIENNLRANLQLRQLLGDQNPYKATFEY SYRDRFYNGSLGFQTVQSSIGGILTSPVIPLGNSGVTLSYQGSAQYIDAETDRQDLLK PKRDNDRISLSRLQASAALNKGFVLWRGKGLPPTPTEGLRYTPNPVVPFVQTFAGLTG TTSYYSSGDTQNTFSGTVGLEGQFGHFSRPLFDYTAFRVSYSQGLNSGLSPFKFDRSV DNRVFSGEIVQQIYGPFRVGFQTTINLDTGARTSTDYIVEYSRRTYGITLRYNPVQEL GGISFRISDFNWSGGTDPFSDPSEVKPVVNGVRRD" gene complement(9946..10062) /locus_tag="DP116_17885" CDS complement(9946..10062) /locus_tag="DP116_17885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209277.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein I" /protein_id="PRJNA477356:DP116_17885" /translation="MLTLKIVVYIVVTFFVSLFVFGFLSNDPARNPGRQDSE" gene complement(10652..12559) /locus_tag="DP116_17890" CDS complement(10652..12559) /locus_tag="DP116_17890" /EC_number="2.2.1.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316138.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-deoxy-D-xylulose-5-phosphate synthase" /protein_id="PRJNA477356:DP116_17890" /translation="MHLSEITHPNQLHGLSVRQLQQIAHQIRDKHLQTVAATGGHLGP GLGVVELTLGLYQTLDLDRDKVIWDVGHQAYPHKLITGRYDRFHTLRQKDGIAGYLKR GESKFDHFGAGHASTSISAALGMALARDIKGEKFKVAAVIGDGALTGGMALEAINHAG HMPKTNLLVVLNDNEMSISPNVGAIPRYLNKMRLSPPMQFLTDNFEEQFKQIPFVGES LSPELARIKEGMKRLAVPKVGAVFEELGFTYMGPVDGHNLEELIATFQQAHQIPGPVL VHVATVKGKGYEIAELDKVGYHAQNPFNLSTGKAAPSSKPKPPAYSKVFAHTLVKLAE QNPKIIGITAAMATGTGLDKLQAKLPNQYIDVGIAEQHAVTLAAGLATDGIRPVVAIY STFLQRAYDQIIHDVCIQNLPVFFCLDRAGIVGADGPTHQGMYDIAYLRCIPNMVLMA PKDEAELQRMIVTGVNHTTGPIAMRFPRGNGYGVPLMEEGWEALEIGKAEILRNGDDV LMLGYGTMVNTALQAAEILSEHGIEATVINARFAKPLDTELIFPLAEKIGRVVTLEEG CVMGGFGSAVAEALLDADIVVPVKRIGVPDDLVEHAEPNQSKAEISLTSPQIAQTVLQ AFFKREFSAVG" gene complement(12760..13440) /locus_tag="DP116_17895" CDS complement(12760..13440) /locus_tag="DP116_17895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316139.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer homology domain-containing protein" /protein_id="PRJNA477356:DP116_17895" /translation="MRQLLGTLSLLALLQLVPNIVSAQEKPSTKKTSNSIARVVAAKV MTNYPDGQFYPERLLSRAELASILVKAFHLEKRQAVTKENVKVTDVPPSNPAFNDIQI VLKTDIMKGYRGNLFFPNQRITRAEALAIFAQAYGVFQFPDQTVNEILSQYPDAASIP GWAKKAIATAATEGFINTDTQGNLSPSQPMTRGEMAHILSKYLQRQQPQAETPEVPGG NNNPESSP" gene complement(13945..14886) /locus_tag="DP116_17900" /pseudo CDS complement(13945..14886) /locus_tag="DP116_17900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316959.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="AEC family transporter" gene 14987..15832 /locus_tag="DP116_17905" CDS 14987..15832 /locus_tag="DP116_17905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316960.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-alanyl-D-alanine carboxypeptidase" /protein_id="PRJNA477356:DP116_17905" /translation="MDNAGFSEKPQNKPSSSGEDIPEALRDTPDAAAKKVSIRPLFLI IGGVVGVVLIAVVSGFLFFVVTPKKTKDSQSPPASSTPATPSKSGNSATSKDNTVLGH FVYSEAPESELQPISNDRRIKMRKAAAQKFLEMAAAARSAGVVLVPVSGFRSIKEQEQ LFFAVGAQRNQTPAERAAVSAPPGHSEHHTGYAVDVGDGAAPATNLTTNFEKTKAYQW LQANAARFSFEISFPKDNAQGVSYEPWHWRFVGDRDSLETFYKARNLKPAQISQQEES RSGAR" gene complement(16003..>16271) /locus_tag="DP116_17910" CDS complement(16003..>16271) /locus_tag="DP116_17910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006616249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_17910" /translation="MLANHKLAKAIADMSFFEFRRQLTYKCELYGSKLVVVDRWFPSS KTCSNCGTKKETLSLSQRVFECGHCAFVIDRDLNAAINLKNAVS" BASE COUNT 4629 a 3643 c 3466 g 4533 t ORIGIN 1 gaagcagggg gagaattatt ttcttgtcta ttcttgattt gctgtagata attttcggca 61 gtcgtaaatt ctagcttcgg gaacaaagga gatttttgcc aacgctgagc agtttctaac 121 atatcacgag tgggaccgcc accgtggtca cctacgccag gaagccaaag agcgtcagaa 181 agaccagttt gggtttgcca ttcacaagcg taggacgcca ttttaacagg gtcgatacct 241 tcaccaacaa gggcagacat atagctaaag atttgactac cgtcaggcga tcgccaccaa 301 aaagcaccat aatcaaactt cgtcgtatca ttccagcgca acttctgagt cacgaaatac 361 tcaacaccag cactcgcgaa aaactgcggc aaagttgcac aaaaaccaaa actatctggt 421 acccacacaa cagaagaaag cttgccaaac ttctccaaca cataacgctg accatacaac 481 aactgacgaa caatagattc accagcaatt aaattcaact ccggttcaac ccaaaaacct 541 cccacaactt cccaacgccc agctttcacc gcttcttgaa tcgcatcaaa caaatccgga 601 cgatgttctt caatccaagc ataaagcacc ggcgaagaat gacaaaaaat taactccgga 661 aaatcttgct ggagtttaag aaccgactca aaagtacttt gtgcagcctt ccaagtttca 721 ctcacaggcc aaagccaagc taaatcaaga tgagcatgac ccaataagta aatttttgaa 781 ttaatatttt gtatcttcga ttccaacaac ctctgacgaa gagcaaacag taaattctca 841 aactctttct tctcttcctt ggcgtccttc tcttcgagac ggcttccgcc gaacgcgcct 901 atgtcctgcg gacacgctgc gctaaggcgg ttcgtttcct tccaatcaat ccccgccaca 961 gtccccgcca aatcctccaa actctcaggc tcaaaagttt ccaaaaaccg ctgcatcaca 1021 gccaactcat cagccacaaa accgggatca ggatgattct catccgtaga ctcataaatc 1081 aggagcgact ttacaacagc accatcacaa tgactcggac tcaccaaacg caaagctatg 1141 ataaactcat ttcctggtgt caccctagaa ctcagtagca ctctcagtga acaatcaaac 1201 aaatctccct caagtaccaa cttcccattg acataaactt gagcagagtc cgcccaccaa 1261 accaacgata gccgcaaaga caagcccttt agcgggtaac cctgtaaatt ctggggaaca 1321 accaattttt gtactaacca tatcactttt tgccccccag tccaagcaat gtgtcctttg 1381 gcgtttaact gggcaagatt ccaaagcgaa aagtcgcagg tgctaacatc agccacagat 1441 aagtcagctt ccaagcattg ccaggatgac aaattattca ctcttacata tgatcgcaac 1501 ttctcaattg cttccgagat aaatttagta ttggcttgag atactggaga ggtcataatt 1561 tcgctaagat gaggaagctc ataatcatca acagtctttg atttattgtt tggtacgagt 1621 ccaacagctc agcaacactc ttaataaaag ttcacctagt cgttacgcat ttaacttgta 1681 cacagctagg gactagacca attgctagca ctggttgttt tcttaatttt gaactttgaa 1741 ttttgaactt tgaattactt atgcctcctg ctacccctgg tcgttatcaa agcagacttt 1801 ttaacttttt ccaccagcaa tctcggcgct ggggtcaaca gtttgagcgt accatacggc 1861 atgtacaagt cgcagctaat tggtcattag aggctctact tacgagcgtg tatctgctga 1921 ttcgccaagc aacagagtca gctggtaaac aactgcatac aaatgagcaa caacccaggt 1981 tccagttaca agaaaatcag actgagattg tcgaaactgt ttctgtagat actcccattc 2041 agcgagttct agaagcagcg gtgactctgc aaattccaga agcaggggat cagggaagca 2101 cggggacaaa taactcccat tatctgtgga tccctcattt tcctctgttc ccgactgtct 2161 catctcccct gactccttca ctctctgact ccctgactcc ctcatctccc ccaatagtgc 2221 ggggaattgc ttcagattta ggaaatcgca atttggtact cgttacgatt gaaaatgaaa 2281 ttctcgatat tttaacgccc cagcagcagg aaatattgca aaaccggatt cttgacgaag 2341 ttgcaacata ttggcattct tgcgagttaa ctcaaagcga agaccaaaca aaagtattat 2401 ctgaaattga ccgtctgttg aacaaattaa cgggtagtaa gaaaagcata cctgccctac 2461 ctcaagcaac gggaacagaa tttaaaaatc agtataaaaa gttacctaac ctttcccaaa 2521 aactaccatt actagacaca gcgatcgccc aaatagaatc tcgtgctgta gtcacaatat 2581 ctcgtacgag tggacaattg ctccaagctg ttcaaaatca gctaaatata tttatctacg 2641 gtaaagagca gcaactgaca actgaacaaa gatcattaga tggtaattgg gaacatcaag 2701 cctcaaaaat ccaagctcta atttgggggg caattaacta tttttttggc gaacgcaaca 2761 ctaataaact agagcaaaaa actccaacaa atagtattga cgcgttatca ataggtttca 2821 agaatcaacc aaaaactgta aatttaccac agcgtccttc ttcctcagtt ttgccggaaa 2881 gccctgattt accaagcgag aatatagaag atccttggtt aactatggat gatttgtttg 2941 gggatttgca agaggttaca gaggttgtta acgagcaaca attattagtg acatcatctg 3001 agtctccaaa atctgctctc cccgcaagtc cttctgagtc tccaaagtct gctctccccg 3061 cgagtccttc tgtgaagata aggagacaaa tcaatttcaa atcgttcagc gtcctgagtc 3121 aagccaagga gctaattcaa aattccaaat attccctacg gtcaagctac gaaaataaaa 3181 tttggaagtt ctcctcatct tccgattccc ctatttctca atttgaaagt gatagcaacc 3241 taattaagga taacaaggga gaaattcttt atcagcagca gaaaacgact caagttgagg 3301 caaagccaga ttggattgaa acacaagctc aaacgattgg gtatgcaaaa catcctctgg 3361 agcaagttct tgaatggcta gaccgcatca tgctttggct agaagaaatt ttcgtgaaaa 3421 ttggtgtatt tttgcggaag gttttgcgaa tcaaataaca tttgagctat catagaaata 3481 ctatccatca gctgctgctc tgtcaactca ccgtaaccaa agataaattc accttgagag 3541 tgcggtccca gataatgagg tgcagcagac atcatgctga taccatgcaa gacagcacgt 3601 tggataatct cttcatcact gaaatcagtg tgtagtcgca ccatgacatg aattcctgcc 3661 ttttccccta aaatcgttgc tttttcccca aaatgaacat ttaatgcctt cacaagcgct 3721 tgacgacgct tatcgtaaag cgatcgcatt tttctaatat ggcgttctaa atgcccttcg 3781 ttgataaagt ctgtaagaac ttgttgttct agtattggta agtggcgtag acgcgaagcg 3841 gcttgtcgtc agacatcgct caaccattta ccacgagcta aagcagaaac caaatttttt 3901 ggcagtacca aataaccaat ccgcagcgaa ggaaacagca ctttggaaaa cgtaccaata 3961 taaagtacag aatcactgcg atccaatcct tgcaaagctg gaataggtct gtcaccataa 4021 cgatattcac tatcatagtc atcttcgatg atcaaagctc ccgtttttcg tgcccaagtg 4081 agtagttcca agcgtcgggg tagcgaaagt attgcaccag tgggaaattg gtgagaaggt 4141 gtcacataaa caagccggat ttgttcgcta gaatggtgag ctaagttttt aaccaccaag 4201 ccagactcat ccacagcaat aggtaagagt ttagcaccat gagtctgaaa gatgagccgt 4261 gcgcttaagt agcctgggtc ttctaaaccg ataacatcat caggttcaat gaacaagcgg 4321 acaattaaat caagtgcttg ctgcgtacca ttgacaatca gcacttgttc tggtaaaaag 4381 ttcacagcgc gagaacgaga gagatatcga gaaatagctt cccgcaaagg tttgtatcct 4441 agaatatccc ttgagtaatc gagccattgc aaatcagagc aacagtgatg agaaagtagc 4501 ctgcgccaca gcttgatagg aaactgctct aaagctggtc gtccgtagcg gaagttaatc 4561 gccatttcag gttcaggaat tctggggaca tcttctgtct taattaaata atcaccatac 4621 ttagataatt tgactggtga acgagtcatc tttccagttg atggaatagg cgctgaacac 4681 agtaaatcat caggaagttg agtgcaaaca aaagtaccag aaccgacaac agtttgtatg 4741 tagccttcac tcaaaagttg atcgtaggtt tgagtcacag tcgtgcgaga aattcctaaa 4801 gatttggcaa gctgacgtgt ggaaggaatg cgccttcctg gtaataacct tccgccaaga 4861 atggcttgac gtagttcttc gtaaagctgt tgatgaagtg gtagaggaga attactatcc 4921 agcgtaatgg caaaatccat acttgctgag ctttcaaagt ggacttatat caaatcataa 4981 aagtggctct tgtaaaagac cactatggag aattacgctg ttattgttgg aattcaaagt 5041 tcctaaaaat gctgatagaa ggacaaatta atatcagaga agcaactcca caagaagact 5101 cactgattgc aaagcacttt taccaaatgt ggcaagatat tggcgttcct gatgacgcca 5161 tcaaccctaa ctggcttgag ataacactcc agtttataga acaggcgcgt cgggatttgt 5221 tttacaaagc gtttgttgca gaagttaatg gtgcagttgt tgtaggttct gcaagttgtc 5281 aactctactc aggtttaaac ccaaatgttt ttatcccaga ataccgcaaa tatggataca 5341 tttggtgcgt ttacgttgag ccagcttatc gcagacaagg tattgctaag caattaacca 5401 gcacaacagt taactatttg aaagcagtag gttgtacgcg agtggttctt aacgcctctc 5461 catcaggtaa accagtgtat gagcaacttg gtttttccag tagtaacgcc atgcatttag 5521 atttgtaaag ttacaaaact caatacttac agcaattttg tgcaattgga gtccaaaaag 5581 atgagtactt taaataatca aatcaaccag gaacagttca ctcccacaca acgcacctca 5641 attaaacgag tttctcagcg cggtcattat gagcgtcagc ttatctatga gattttagat 5701 gaagggttga tttgtcatgt aggatttgtt gtagataacc agccgtttgt cattccaact 5761 gcttacggtc gtgtggaaga caaactctat attcacggtt cacctgcaag tcggatgttg 5821 cgttctctgc ttactggtat cgaagtttgc gtaacagtca ctttgctaga tggtttagta 5881 ctggcgcgtt cggcgtttca ccactctatg aactaccgtt ctgtggttat atttggtaca 5941 gctaccctcg tgcaaggtgc tgacgaaaag ctagaagcac tgcgagcttt tactgagcat 6001 attgtaccag agcgatgggc agaggttcgc ccacccaatc gtcaagaatt acagggaact 6061 ttagtgcttt cccttccgat tacagaagca tctgctaaag tgcggacagg tccaccactg 6121 gatgatgagg aagattatag tttatctgta tgggcaggtg ttttgccttt gcaagtggtt 6181 gctggtgatg cgatcgccga tccacgcttg catacaggaa ttacccaacc agattacata 6241 caaaattata cacgtcttta tgtggagtct gagtagtagt caatactgta ggcgaaaata 6301 aaacttgttt tgcttgattt ttcaccaatc gtccatgtcc aatcgcacaa cagacaaggt 6361 atcatctgtg taaaggaaga gccttgatcc cctttttaga ccctgagagt aactaactct 6421 aaagccggtg taatcaaaaa acggtcgaga gatttgtgtt tttcaacaag tttgtagatt 6481 gataggcgat atttctttcc aagcttctga ttcatttgca gcacaccaaa ggtcatattt 6541 ctgctgaaga tttaaccaaa gttcaggagt ggtttgaaat gctctagcca atcgaagcgc 6601 catattcggt gtaatgcttg cctgttcatt aactatctca gatacagttt tacgagatac 6661 atcaagaatt tcggcaagtt cagtgattgt catgttaaga ggttcaaggt attgacgctt 6721 gattaatcct cccggatgtc tgggttttct tttagtaatc atgatttcaa attttagtgg 6781 taatcctcgt agtttacgtc gtaagcatcc ccatcctgaa atgtaaaggt gacacgccaa 6841 tttttggaaa cagtaactga ccattctcct tttctgtctc cttgtaattg atgaagtcca 6901 gaaccagggt atcgcatgtc ctgaatttca gaagctgcat caagtctgtc aagtatgtct 6961 aaaagcttat ccgcaaaaga tgggttgata ccgcttctat catcatcttc aaaaagcttt 7021 tttagtcctt tatgctttat agttttaatc acatccctaa ttgtaaccct tagggttaca 7081 tctgtcaaac gccaaaattc attgtgaggc gagaaaaaat agggaagcaa ggactcacgt 7141 aaatctcatc atattcactg aataaatctg acggaatcaa tcactaatcc cgcctgacac 7201 cattgacaac tggcttaact tcagaaggat cagaaaatgg atcggttccg ccactccaat 7261 tgaagtcgct aattcggaaa cttataccac ctaattcttg cactggattg taacgcaagg 7321 taattccata agtgcggcga ctatactcta cgatgtagtc ggtgctggta cgtgccccag 7381 tgtccaagtt gattgttgtt tgaaagccaa cccggaatgg accgtaaatt tgctgtacaa 7441 tttcaccact aaatactctg ttatcaacag aacggtcaaa tttgaaaggc gataagccac 7501 tatttagacc ttgggagtaa ctaactctaa aggcggtata atcaaataag ggtcgagaga 7561 aatgaccaaa ctgcccttct agaccaaccg taccactaaa ggtgttttga gtatcaccac 7621 tgctgtaata actagtagta ccagtcagcc cagcaaaagt ttgaacaaag ggaacgactg 7681 ggtttggtgt atatcgtaat ccctcagtgg gtgtcggtgg taatcctttt ccccgccaga 7741 gtacaaagcc tttattgaga gctgcactcg cttgcagacg actcagtgaa atgcgatcat 7801 tatctcgctt aggtttgagc aagtcttggc ggtctgtttc agcatcaata tattgagcac 7861 tgccttgata gctcaaagtg acaccgctgt tacctaaagg aatcacagga gaagtaagaa 7921 taccgccaat actactctgg acagtttgaa agccgaggct accattataa aagcgatcgc 7981 gataactata ttcaaatgtc gccttgtagg gattttgatc acctaataac tggcgcagct 8041 gcaaattcgc ccgcaaattg ttttctatat cccctaagtc aagactcgtc aacaatgcag 8101 aacccgttac agtggttcgt ggacccaaag tagagtttaa tcttgacttg aagccaaaca 8161 gcgatacaac attaccaatg ttaccagtga ctgctttttg agcaaaaaat tctggtgcaa 8221 tgctaaatct tgtcccttct gtattgatga gttggaagct acgctcaata aacaagccgc 8281 ctcgcttatc tccatcaaag ccaatagatg ctattgcagg ggtaacgtct cgcttgttac 8341 ggtcaatcac ctgctcatct ttgggaatgg gtagagacgt tcgttgatcc aagactaaac 8401 gttgtccctg tgtcttaatc cggtctcgta aaggtgtctc tcgcgtcaga gtcactttgt 8461 ctgccctgat gactaactct ggaggtgaaa aaggatcgtt cgtcaggcgg acatcttttg 8521 cttcccaacc tcggggatta aaatcaactc gtccagcttc aaaacgtact cgtctgactt 8581 gaccgccttg ttttggcaca gctaggttat tcgcacccct tctaccacct acgttaatgt 8641 tgagtccacc aggactgctt gcttgcaaag gttgattttc tataatgcga tcgctcagag 8701 gacgcgctgg tgttcctccc gctgtgatgt ccgtgggtag gaaagataaa tcttgccctg 8761 ctgttggtag gaaaatatca cctctgccgt ttgtcagttc tccattatct tggacaaggt 8821 tataagtaaa gcgctcacca cgtaaaattt gaccgccccg tgttaaggca acatttcctt 8881 ctccgactgc aatcaaattg tctaaactga cttgcaggcg atcggcatct acaactgctc 8941 catcaaaccg cacaaccaca ttaccttcag cggtgacaat ccgccgctgc tcgtcatact 9001 cctgtctatc agaagtcact tccacaattc tttctctgcc tgtcggcgtt ttggatggta 9061 cagtggttcc tgatggttga ctttttggct caattgtgct gtccggttgt ccggtcggtg 9121 ctgactgatt ttgggaattt ggggtgggag tggacggttc agttgtttgg ttgcgagtct 9181 tgaattctat ttgtactggt tgctcagtcc ctgatttttc tgtgaaaacg ttaccctgat 9241 ttggtgtttg ggctgtctta aggtcgcgtc gcgtctggtt gtgagataag ttctccagtt 9301 tggtgagtgt ttgagtaact ggttctccag caacgaatgt tttagttggt gcatttgtac 9361 tgtcaccatt ggatggatga gaggtttccg gctgctgtag cggataaccg actgaaacct 9421 cacccaatga agctgcactt ttagaagcgg tgatgggaga aaattcaggc ggtaaacttt 9481 ccggctcatt tggtgcggct aaagtcttag cactataaat taatggcttt tgctcagcag 9541 tttggagtgg aggcgtttcc tgttgtcttg ctgatttctc tttatttagt gcttttgtgt 9601 cagtcgtaac gactgacatg ggatttgcag attgtgaaga ttcgagaata ggaggcggtt 9661 cgggcggcgg aactggatga ggcataatct agcaggaatc aagctaacaa acagccaata 9721 aatagccgga aggactttat acattgtgcc ttccagctgt tgccttggat ctagattaag 9781 attaaggcaa gccgccccgg aggaggacac gcttttgtat gagaatagct ctgggaggta 9841 ttcctatgca ggcgactggc aaaagaattt cccaatttga gctaagtgtc ctgtggcgta 9901 taatgtcgtc cgccttccgg tagtcgaaga aaatttgtcg cgcttttact cggaatcctg 9961 acgaccaggg ttacgagcag ggtcgttcga caaaaatcca aagacgaaaa gactgacaaa 10021 gaaggtaaca actatataaa caacgatttt taaagtcagc attagatata tctccttcgg 10081 gtctcttgaa gctttcagtg tctgcgatgc accaaaagat agctccttta tcttacccaa 10141 atattgttct cttttgagag aacactgata gtagatgtct aaacagttgc cttgcgccaa 10201 gtcggttgag agtcttttgc tttgtttcca agaaaacttg gcaaatgatg ttagcgcgca 10261 acgagtgtcc cagttgaggg ttatctgacg ttttaatctg ttctatattt actggacatc 10321 tttggacatc ggtcaatatt tcttgcttca acctggggag tcggctcctt cgggtccaca 10381 agcgttaagt tcggggcgaa ccgaccctac tccttaattt taagcggtga aaccgttaat 10441 gtctctagtt atggttagac gtttacttga gacagatgtt tcctacagtc tcaacttttg 10501 acatcttggt aggtttcaca atgccctgaa ataaattttg ggcttgtagc taaagtcagt 10561 tttaacttac taaaacactt ataaacatag gtttctaccc gttttaacaa gtttcagcta 10621 tgagcgagaa atttatttca aggcaagtca tttaaccgac agcagaaaac tctcgcttga 10681 agaaagcttg taatactgtt tgggctattt gaggactcgt caaactgatt tctgccttag 10741 attgatttgg ttcggcatgt tccactaaat catctggcac acctatgcgt ttaacgggaa 10801 caacgatgtc agcatctaac agtgcttctg caactgctga accgaagccg cccataacac 10861 aaccttcttc caaggtgact acgcgcccaa ttttctcagc caatgggaaa attaattcgg 10921 tatctagagg tttagcaaaa cgggcattaa tcacagtggc ttcgatgcca tgttcactga 10981 gaatttccgc agcttgcagt gccgtattca ccattgtgcc gtaacctagc atcaacacat 11041 catcaccatt acggagaatt tctgctttgc cgatttctag ggcttcccaa ccttcttcca 11101 tcaggggaac gccgtaaccg ttaccgcgag ggaagcgcat ggcaattggt cctgtggtat 11161 gattcacacc agtgactatc attcgttgca gttctgcctc atctttgggt gccatcagta 11221 ccatgttggg aatgcaacgc aggtaggcaa tgtcgtacat gccttggtga gtcggaccat 11281 cagcaccgac gattcccgcc ctgtctagac agaagaacac tggcagattt tggatacaaa 11341 catcgtggat gatttgatcg taagcgcgtt gcaagaaggt ggagtagata gcgacgacag 11401 gacgtattcc gtcagttgct agtcctgcag caagggtgac ggcgtgttgt tcggcaatac 11461 cgacatcaat atattgattg ggtaatttgg cttgcagttt gtctaaccct gtccctgttg 11521 ccattgcagc ggtaatacca ataattttgg ggttttgttc ggcaagtttg actagggtat 11581 gggcaaagac tttggagtaa gcagggggtt taggtttact ggaaggagcg gcttttccag 11641 tggagaggtt gaaggggttt tgggcatggt agccaacttt gtctagttcg gcaatttcat 11701 agcctttgcc cttgactgtt gcgacgtgta ccaaaactgg tcctgggatt tgatgtgctt 11761 gttggaaagt cgcaatcaat tcctctaaat tatgcccgtc cacaggtccc atgtaggtaa 11821 agccgagttc ttcaaaaact gcccctactt tgggaacagc taagcgcttc atcccttctt 11881 tgatgcgcgc cagttccgga gaaagggatt cacccacgaa aggaatttgc ttgaactgtt 11941 cctcaaagtt atctgtgaga aactgcattg gaggactgag gcgcattttg tttaagtagc 12001 ggggtattgc gccgacgttg ggagatatag acatctcgtt gtcgttgagg acaacaagca 12061 agttggtttt gggcatgtgt ccagcatggt tgatggcttc taatgccata ccaccagtca 12121 gtgcaccatc accgatgaca gcagcaactt taaatttttc gcctttgata tcccgcgcta 12181 aagccatacc caaagcggca gaaatactgg tagaagcgtg tcctgcacca aagtggtcaa 12241 acttgctttc accgcgcttg agataaccag ctattccatc tttttgccgt aaagtatgga 12301 agcgatcata gcgccctgta atcagtttgt gaggatatgc ttggtgtcct acatcccaaa 12361 tgactttatc acgatctaag tctagcgttt ggtaaagccc caacgtcaac tctacaacac 12421 ccaatcccgg tcccaggtgt cccccagtcg ctgctacagt ttggagatgt ttgtctcgaa 12481 tctgatgggc aatctgttgc agttgtctaa ctgataaacc atgcaactgg ttaggatggg 12541 taatctcact caagtgcata ttgtagtgtt ttcctctcta ctttcttcgt atttttgatt 12601 ttcccacgct caggtgaagc agtacctcat acagtctcgc atttatttac caagactgaa 12661 attaaggtac ggcaacagtt ggaacggaaa tcaacacctt tgcatttaca aagttttacg 12721 taaatgccca gagtccttaa aacaatgtat atgttcactt cagggagaag attctgggtt 12781 attattacct cctggaactt ctggtgtttc ggcttgtggc tgttgtcttt gcaaatattt 12841 acttaatata tgagccatct ccccacgagt catgggttgt gagggtgaaa gattgccttg 12901 tgtatctgta ttgatgaatc cctcagttgc tgctgtggcg atcgcctttt ttgcccaacc 12961 cggaatagac gctgcatcag gatattggga aagtatttca tttactgttt gatctggaaa 13021 ctgaaagact ccatatgctt gagcaaaaat cgctaaagct tcagccctgg tgattctttg 13081 attgggaaaa aaaagattcc cgcgatagcc tttcataata tcggttttta agactatctg 13141 aatgtcatta aacgctggat tggaaggagg gacatctgtc acttttacat tctccttggt 13201 gacggcttgt cgtttctcca gatgaaaagc tttcaccaga atagaagcta attctgcccg 13261 actgagcaaa cgttctggat agaactgtcc atctggatag ttcgtcatca ctttggcagc 13321 aactactcgt gcaattgagt tggatgtttt tttagtactg ggtttttctt gggcagaaac 13381 tatgttagga actaattgca atagtgcgag caatgaaaga gtacctagaa gctgacgcat 13441 gatgacaacc gctttgaact gctactgtga atgcaagaat taccaacttt ggtttacctg 13501 tttgcggaaa acacagcaga taattagacg agcttctgtt tggtaatgtt gcctataccg 13561 ttctacaaca ctaggacaaa actagacaaa gttagacttt gttagacact gcccagattg 13621 cttcctgttt catcctggag atgtcggcat caaaccagtc cacaggctta aatttggata 13681 gaacttatcc tatttattgt gttttctgag aaactttgct actctacccc attattttcg 13741 atcctattaa ggtttctggt tttttcaacc gccttgttac aggtaggatt actcttaaga 13801 gttttgttca ggttctcaag ccttaaattt ttcgctcttg tttcgcgtta acactattgt 13861 agcacgtgga caaatagtct aataaagtct aactttgtcc agcaattaag ttacagttca 13921 aaaccctaga aaaattctct caaatcaaaa caaccacaac caaattggca gagtcaccag 13981 caaaaccata gaacccatag ccaatgcagt gacagcaagg tcgcggtcaa gatgaaaagt 14041 ttccgcaatc accagtgtag caaaagctgg aggcatagcc atttgcagga caatcgcccg 14101 tgcggttgtc ccagttaaac caaaaagtgg taaaatacta cctaaaatca gtggcacaat 14161 tagcattttg attcctatgc tgatgcttgc ccttggtaaa ctgtgccacg aagtcagttt 14221 acttagtcgc attccaataa gtaccagaga taaagcaaca caactccaag caactaactc 14281 tagccaaaat tccactggtg cagggatttt tacctctcga aatagtaagc caaacccaaa 14341 actccataga tctggattaa taataattgc cttaataacc tgcaaatggc tatgaacacc 14401 accaccaaaa cgagctgcca aggcaacacc gagtccataa gcccccaatg ttgtccccag 14461 taaatcataa aacaacgccc aagcaaagta ttgtgtacca actattgaca gggtaacggg 14521 atagccaata tatcctgtgt tacccaccat tgccgctaag ataaaactgc cttgagtcga 14581 tttgtgggga gcggtatttg tgagataagc ctgtccttta attgctgcaa gagcgaaaag 14641 tgctcctaat aaaatggctg agtgggcgat tgcctacggc agagcttcgc ttaacgctgg 14701 cgcaatccaa atttgtgctg ataagtcagc cttgcgtaag aagatgataa tacctatcgg 14761 cactcctacc caaaaaagga actgtcccaa acggatagga actgtcgtag acagcctgcg 14821 tcctaaaagg tatcctacta ggactaggaa cactaacttg agatagagtt ctaggaggtt 14881 tgtcaaaatt aaaccgtaag atgtagatgc aaaattttac gtattgcctt cccttttctc 14941 cagtttacaa tttgttatga tattgagcaa acgcaggagt tgagctttgg ataatgctgg 15001 gttttccgag aaaccgcaaa acaaaccatc tagttccggc gaagatattc cggaagcttt 15061 gcgtgatact cctgatgcag cagccaagaa ggtcagcatt cgcccgttgt ttttaatcat 15121 tgggggagtg gtgggagttg tgttgatagc cgttgttagc ggttttttgt ttttcgttgt 15181 cacacctaaa aagacgaagg attctcagtc cccccctgct agttcaactc ccgcaactcc 15241 atcaaaatcg ggtaattcag ccacctcaaa agataataca gttttaggtc attttgttta 15301 ctcagaagcg cctgagtctg aactacaacc gatttctaat gatagacgta taaaaatgcg 15361 aaaagcagcc gcccaaaagt ttctagaaat ggcagcagca gcgcggagtg caggtgttgt 15421 attagtaccc gtttccggtt ttcgctcgat aaaagagcaa gagcagttgt tttttgccgt 15481 tggtgcccag cggaatcaga ctccagcgga acgggctgct gttagcgctc cccctggtca 15541 tagcgaacat cacacaggtt atgctgtgga tgttggagat ggggcagcac cagcaacaaa 15601 tctcacgacc aactttgaaa aaaccaaggc ttatcaatgg ctacaagcta atgcagcacg 15661 tttcagcttt gagatctcct ttcctaaaga caatgctcaa ggtgtgagtt atgaaccttg 15721 gcattggcgt tttgtgggcg atcgcgatag cttggaaact ttctacaaag ccagaaattt 15781 gaaacctgct cagatctcac aacaggaaga gtcgagaagt ggggccagat gagggagtga 15841 ggaagtgagg gaaccaccag acaatttcgt agcttccgtt atataaatcc tgctcaaacc 15901 taatcaacac ctatcaaagc tattcaattg ggtttgagtt tcgtttactt cctgcttcaa 15961 tctggagaag tcggcttcaa ccagtccaca ggcttatacg gtctaactga ccgcgttctt 16021 caaattgatt gccgcgttta aatcacggtc aatcacgaaa gcacaatgac cacactcaaa 16081 cactctttga gatagtgaga gtgtttcttt tttggttcca cagttagaac aagtcttaga 16141 actaggaaac cacctatcaa ccacaaccag ctttgaacca tacagttcgc acttgtaagt 16201 tagctggcga cgaaattcaa agaaactcat atcagctatc gctttggcta atttgtgatt 16261 agccaacatt c // LOCUS NODE_2072_length_16177_cov_4.89771716177 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16177) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16177) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16177 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 370..1326 /locus_tag="DP116_17915" CDS 370..1326 /locus_tag="DP116_17915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128224.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor, RpoD/SigA family" /protein_id="PRJNA477356:DP116_17915" /translation="MKTAQTATDLVRTYLREIGRVPLLSHEEEIHYGKQVQRATILQE VRESLAIHLSRQPTLEEWAKATELEPKELNQAIAEGEIAKRKMVEANLRLVVSVAKKY IKRNVDLLDLIQEGSIGMQRGVEKFDPTKGYRFSTYAYWWIRQAITRAIAEKGRTIRL PIHITEKLNKIKKAQRQLSQNLGRAPTALELAQELELTPRQVREYMEKARVPLSLDLR LGDNYDTELGEMLEDPGVSPEEFVAQSSLSFDLERLMGELTPQQREVISLRFGLNDGQ AHTLASIGQLLSISRERVRQIEREALTKLRKVKAEINEYLAS" gene 1589..1939 /locus_tag="DP116_17920" CDS 1589..1939 /locus_tag="DP116_17920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210374.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17920" /translation="MSNPSNRVSESEFFHSEPETGDLLWQYVKSLSPETVTQLSKPSS PEVFQVMERNIVGLLGNLPSEHFGVSITTSRESLGRLLASAMISGYFLRNAEQRMNFE VALQGSETNNSDAG" gene 2076..2483 /gene="mutT" /locus_tag="DP116_17925" CDS 2076..2483 /gene="mutT" /locus_tag="DP116_17925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114780.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="8-oxo-dGTP diphosphatase MutT" /protein_id="PRJNA477356:DP116_17925" /translation="MSETAVLPHKIIGVAVIWNDQGQILIDRRRSEGLMGGFWEFPGG KVERSESIQECIRREISEELAIQIEVREHLITIDHTYTHLHVTLIVHHCRYVAGVPQP IECEEIRWVSLDELESYTFPEANSQIIAVLQSP" gene complement(2475..3230) /locus_tag="DP116_17930" CDS complement(2475..3230) /locus_tag="DP116_17930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871986.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17930" /translation="MCSSGQDANQLWALVQASVAINLPLSLEELSVSDNGQGISTEFL PYIFDSFRQADSSNTRKQTGLGLGLAIVRNLVELHGGTIYATSLGLGKGSTFTLKLPF LKVKGEGEMAKKTCSSSNSSSLDGIQVLVVDDETDARDLLTIVLEGVGASVTAVGSVS EALNIIELFPPDVIVSDIGMPEENGYSLVQKLRNLETKIGKHIPTAAVTAYARAEDRR QALLAGFEIYLPKPVEPAELIAVVGNLAGRTLR" gene complement(3544..4728) /locus_tag="DP116_17935" CDS complement(3544..4728) /locus_tag="DP116_17935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210339.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_17935" /translation="MQVSQDQPISSDAPLQLLLFVDGRPKSKQQVQRIRSYLKELEAD YDFELQIVDVGQQPYLAEHFKLVATPALIKVHPEPREILAGSNITGQLKAWWPRWQAA MDAYLQLESDLHEHDENGRVKELKSSIRSVAVSAELMRLSDEVFRLKQEKEKLQEQIQ FKDRVIAMLAHDLRNPLTAASIAIDTLQSNYNIEKGEFERLTPKMTVHLFKQARSQTK IIDRMITDLLRVGHGKDREFFIQPQKVDIGKLCLEVLEELRDRYTAKSHKVNKDIPKD LPNVYADPERIRQVLINLLDNAIKYTPEGGDISVCGLHRTSQKIQFSISDTGPGIPED NRDRIFENHYRLERDQGKEGYGIGLCLCQRIVLAHYGQIWVETAPHGGAWFHFTLPVY PN" gene 5737..7113 /locus_tag="DP116_17940" CDS 5737..7113 /locus_tag="DP116_17940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319111.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer homology domain-containing protein" /protein_id="PRJNA477356:DP116_17940" /translation="MAVLLTCLTACANSPSAKNIEQSLGADPKLKNNPVTFGAQEVRS SQQLQTPQATVQLPTDFPKDIPLYPNATLQEVTPSSSENPSVSSRWLSSDPSNVITSY YRQQFQANNWQIVQGASTDEPKSSFEVRRNDIQLKVSIQPKTVTNAAPNQPQTSTEIQ ISYSPLSTTTAQANPTPTPQATDKTVQSGESQFVGPVPSQDSTAQPNATLESKNPTPN ASSLPKSQELSDLNKVPQEFRQYIQDLAALGVFNVESNALKGNGTTTNQFEPAKVVTH RQFAHWLVAANNAMYANKQALQIRLASESSQPTFRDVPKTDPDFRAIQGLAEAGLIPS SLSGDSTAVLFQPDSPLTREQLILWKIPLDIRQALPSASIDAVKQTWGFQDAGKIDPK ALRAVLADFQSGEKSNIRRVFGYTTLFQPKKPVTRTEAAAVLWYFGTQGEGISAQEAL KLAGSPSQ" gene complement(7305..7901) /locus_tag="DP116_17945" CDS complement(7305..7901) /locus_tag="DP116_17945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319110.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glyoxalase-like domain protein" /protein_id="PRJNA477356:DP116_17945" /translation="MVIAVSMSQFQIEPFILGSFLPSLPLDSLFSTQGIMVMLLAAYA GAMWMFLTSAPKVHTVMVSDMEVARQLYEGLLDLPAADVPLHYYYNYEQTLGATGVDP LYLSSSPSFSGSRMSNASEGLWYQLKKNTQLHIIMGASLGSKNQQRHVCFDRDCLEMI LMRVETRGLKFKIRSEKPLNFLVKDYEGRIIEMAEVAS" gene 8476..9114 /locus_tag="DP116_17950" CDS 8476..9114 /locus_tag="DP116_17950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864920.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sigma-70 family RNA polymerase sigma factor" /protein_id="PRJNA477356:DP116_17950" /translation="MQIPYFAEAHHPLIKSLFYHSDTELLTLVQQNPDSGRYFTAIFC RYNPIVYTLIRHSARSPVQADYLFALTWRHIYNELCGLDLNSSQFTKDGLNVQNWLIN MTAYCINEIQLPPTEAIHYSLKTTSPPLLCYVEQALDQLPPILRLMVLMSQTFRWSET RIAAYLQAEGETINPNEIVHFLQEGYRMMEEKLPTDIRAIYLGENLIQPGAA" gene complement(9205..10089) /locus_tag="DP116_17955" CDS complement(9205..10089) /locus_tag="DP116_17955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319107.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17955" /translation="MNQQYEKLSRSAKFLGVICGGLLIGLPAIPQAMAQQSVLQQPRQ QTNSKTNPCPSIFYQEPHNTRVLVPQGCPPNAITRQLEQQGRLTPGSVSNEPTPTQEQ IRQGVGGETPYNNRSDNSSESYSSQTQSTTMSSGQDMNSSSSEVRTYSSQGPTGNYTA RTYSQSQSNPSSQQRQNSVIVPPLPEQNQAPIALVTPANGKVSVRLKNNTNARINYEA IGYTGQRTLSGGQEIVLQNLPLPVSISTIRQDNGFVKVTPTSTESGMIELSLSEQRNA NDKQGVVRIREDGRVFLN" gene complement(10386..10901) /locus_tag="DP116_17960" CDS complement(10386..10901) /locus_tag="DP116_17960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2267 domain-containing protein" /protein_id="PRJNA477356:DP116_17960" /translation="MTNTPKSLDEKQSSPIDSKDIPFLEKVKANGKLGDIYDARDITE VVFRVMRDLMTTEAADRVAEELENKPAEITDEKALQNDIVELWKDTNPIVGFLSRIRP PWQGPGIFKIDSDRFLFRVANEGGLQPNVDREEVVKAVFSATKEELSPERIQEIASWL PDKVRQLWEEA" gene complement(11197..12663) /locus_tag="DP116_17965" CDS complement(11197..12663) /locus_tag="DP116_17965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458370.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_17965" /translation="MFQATRRRLALWYTAVTAVLLLLFASGVYLYVRSTLIERIDDTL NHVVEVVERSLVIEPVKSDNGQFRVNVEASFRDNTDTAEDDRIDLEWFSPTGELLWST FSQPLNIPLHFNRMGETVKVVRKENMGDKGDKATRRQGDKEEFTFPTSQLLLRQVTER VQVGRQVLGYLRVSHPWFEVTKPSRQLIFDLALGIGLMVFSVGASGWFLSGKAMEPVH ESYQRLKQFTADASHELRSPITLIQTNVQVALTDLELAEVDNSISSHYRQQLKVVERL TQRLGKLVNDLLFLARQDSGISKEIFSPCPVDALLMEVVEEQQLLATEKNITFSLDLI DPPACETDPELLDNWFTVVGNWDQLVRLFTNLIGNALQYTPAGGCVNVELARNEATNR VSGLRYNSAYLLFKVKDTGIGIPPEALPRLFDRFYRVDPARTHTAKGTATQNATGSGL GLAIAQAIVESHQGQIQVESTQGIGTTVSVTLPVTFDF" gene 12780..13775 /locus_tag="DP116_17970" CDS 12780..13775 /locus_tag="DP116_17970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17970" /translation="MQRKLEKLERQQYQCLQKALLSAFPHRTKLEQMVRFGLDENLEE IATGENYGDVVFKLIEWAETNGNLENFLIAARNKDCEGNPGNLQLKRICKELLQAQTT TKQSHRLMNPCKFDLGELIRSCLNILEDKQGLVGLAVPYNQDPFLIYFCERLKERIGK SHTDNKQPLTLDNYRTSVDTAVMTMKRYKRLLQKGDVICPIRVAVSDPNSSHEFWKKV SAEFQEPQNFSEHRLIIIMVSSECKFFPQGVTQITPPQFTKADAHEWILEVTDNLGWR EEDRNKWKRYMIDECFESECLNTRLVYEHLEYAIKLLQQNHTAETFLQELKQANY" gene 13781..14755 /locus_tag="DP116_17975" CDS 13781..14755 /locus_tag="DP116_17975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179962.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MoxR family ATPase" /protein_id="PRJNA477356:DP116_17975" /translation="MSKRSVDTSTKYSTIAYESLPDSLEVEDSEVDPVSEKDPPKKKI YKEPYLPDKKLAEAVDLAIALGRPLLLQGEPGCGKTRLAYAVAYALGLPLEVSYIKST SRAQDLLYTYDAVNRLYDAQLGADGPCKNGIPLSRDIGNYIRLGPLGRAIARAQYERR SVVLIDEIDKADLDFPNDLLWELDRLEFRVTEAPDIYYAVGDNPALRPIVFVTHNEEK ALPTAFLRRCIFHYVEFPQTEELLQQVLATHEISNQQLSEKAIKVLLKLRGLDLSKRP GLSELLDWVGYLEAVKTPVEELDKLPYLGTLLKQESDRQRAITEYPKQ" assembly_gap 14812..14821 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 14877..16070 /locus_tag="DP116_17980" CDS 14877..16070 /locus_tag="DP116_17980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179961.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_17980" /translation="MFPLTPDDYETKQPKLPAFLWELFQKLRRRGFPLTPDDYETLRQ SLQAGFGWTSQEALRDLCNSLWAKSRQEQEILTALFNQLAPKNEDWQLSSVQVEKDFD ATHSSNKEQHQNVPEHQEHDEIVTESCSGLPPISLKDVQLSERRFIFVPQFPLTYREV AQTWRRLRRPVRVGPATELDVELTIARRCQQGVTASVVLKPRHRNVARLLLLVDRQGS MTPFHRFCEEVCTAIQLAGRLEETAIYYFHNVPAEGADEQVLEPLGKELFPVLDSILP EITPLKTGYLYEDSDLLSPIALEEVLQKHASDAFVVMMSDAGAVRKYYNVVRLLDTIS FIKALRAYTLNYVWLNPLPKSYWKDNTAAQIARHVPMFPLNREGIQQAVNVLRGQQYI IEKPL" BASE COUNT 4667 a 3532 c 3435 g 4533 t 10 others ORIGIN 1 aaataagttt attaaaaata tctaaaacta cattaggctc aaatcacatc caacaggcat 61 aaaatcagtg cctgacagct agtcaaatat tcctcataaa tgttacaatt tttaacaaaa 121 gaccaataag aataatgttc aagtggttac cttctatcta accgtagata cttgagaaaa 181 gggggaaaaa cctgtaactt ctagcaataa ggaataaaat ttccagacca ttaaccaccc 241 ttgtttatcg agcgcaaaag tcgtgaaata cagcatgggt tcccagtcta actgggtgga 301 catccaaaaa aggcatctat cagcccgctt accgttctgt caaacccgtc gataccgaag 361 agtagtgcca tgaagaccgc tcagacagcc acagacctcg tgcggactta cctgcgtgag 421 attggccgtg tgccactttt atcccatgag gaggaaatac attatggcaa acaggtgcaa 481 cgtgcaacca tactgcagga agtaagggag tctcttgcca ttcacttgag tcgtcaaccg 541 actctagaag agtgggcaaa agcgacagag ttagaaccaa aagagttgaa tcaagcaatt 601 gcagagggtg aaattgccaa gcgcaagatg gtagaagcca atttgcgact ggttgtgtct 661 gttgctaaaa agtacatcaa gcgcaacgta gatttactcg acttgattca agaaggtagc 721 atagggatgc agcggggtgt agaaaagttt gacccgacca aaggatacag gttttcaacc 781 tatgcgtatt ggtggattcg tcaagctatt actcgggcga tcgccgaaaa aggtcgcacc 841 atccgcttgc cgatccacat aacagaaaaa ttaaataaaa tcaagaaagc acagcgacaa 901 ctctctcaga acctaggaag agcccccaca gctttggagt tagctcaaga attagaattg 961 actcccagac aagtccgaga atatatggaa aaagcgcgtg taccgctatc tttggatttg 1021 cggttgggag ataactacga tacagaacta ggggaaatgt tggaggatcc aggagtttct 1081 ccagaagaat ttgttgccca atcttcccta tcatttgatt tagagcgtct catgggagaa 1141 ctgactccac aacaaaggga agtcatatct ctgcgctttg gtttaaatga tgggcaagcc 1201 catactctgg cgagtatcgg tcaactcctc agcattagcc gtgaacgagt gcggcaaatt 1261 gagcgggaag ccttaactaa actgcgtaaa gtcaaagccg agataaatga atatctagct 1321 agttagttag tcatttgtca tttgtcattt gtcattcacc aatgacaaat gacaaaaaac 1381 agcttctgct tgctgtaatg tacgacaaat aacaattagg tgaaacaatc cgttgctttt 1441 ggtacggtta cactcactcc cccattgtaa ctaagctgtt acattagtcc gtaggaatac 1501 aaaaaagcta aagtgttgtc ttacagacga aagatattta cggcgcaagt catatcgacc 1561 gcagttatag tagtcaagga gatataacgt gagtaaccca tccaatcgag tttcagaatc 1621 agaatttttt catagtgagc cagaaactgg tgacttgctg tggcagtatg taaaatcact 1681 aagtccagaa acagtcaccc agttatctaa acccagttcc cccgaagtgt ttcaagtcat 1741 ggagcgcaat attgtagggc ttttgggtaa cctaccttct gaacactttg gtgtcagcat 1801 caccaccagc agagaaagtt tgggtcgtct tcttgcttct gctatgatca gcggttattt 1861 cctacgtaat gctgaacaga gaatgaactt tgaagtagca ctgcagggtt cggaaaccaa 1921 caatagtgat gctggttagt ggtttgtgct gaaaaacaaa aaaatattgc aattcactat 1981 acaatacaga ctcggcaaac attaagatag tttgccgagt tttgttatga gttattagtt 2041 tttttcccac ctctgccact ctcttaaaaa caaacatgag tgaaactgct gttctacctc 2101 ataaaatcat tggcgttgct gtcatttgga acgatcaggg acaaattctc attgatcgcc 2161 gtcgttcgga aggtttgatg ggtggtttct gggaatttcc tggaggtaaa gttgaaagga 2221 gtgaaagtat ccaagagtgt atcagacggg aaatttctga agaactagca atacaaattg 2281 aagtgcgaga gcatttaatc actatcgacc acacctatac gcacttgcac gtaaccctca 2341 tagtacatca ttgccgctat gtggcaggtg ttcctcaacc aattgaatgt gaagaaattc 2401 gttgggttag tttggatgaa ctagaaagtt atacttttcc tgaggcaaat agtcaaatta 2461 ttgctgtttt acaatcaccg taaagttctt ccagcaagat tcccaacgac tgcaatcaat 2521 tcagcaggct caacaggttt tggtaggtat atctcgaaac cagctaaaag tgcttgtcta 2581 cgatcctctg ctcgggcata agcagtaaca gcagcagttg gaatgtgttt gccaattttg 2641 gtttctaagt ttctcaactt ctgaacaagc gagtaaccat tttcttctgg catcccgata 2701 tcactgacta taacatctgg tgggaaaagt tcaatgatat tgagtgcctc gctcactgaa 2761 ccaactgcgg ttacactagc tccaaccccc tccaacacga tggttagcaa atcacgcgca 2821 tctgtctcat catcaacgac tagcacttgt atgccatcca aggaagagga gtttgaggag 2881 gagcaagttt tctttgccat ttctccttct cctttgactt tcaaaaacgg cagtttgagc 2941 gtaaacgtgc taccttttcc taatcctaaa cttgtcgcat agatagtacc gccgtgaagt 3001 tcaacgagat tacggacaat tgcgagtcct agtccaagtc cagtttgctt gcgagtattg 3061 ctactatccg cttgacggaa gctgtcaaat atgtatggca gaaattctgt gctgatgcct 3121 tgaccgttat cacttacgct tagctcttct agcgaaagtg gcaagttaat tgccacagat 3181 gcttgcacca gtgcccacaa ttgattcgca tcttgcccag aactacacat tgctttccaa 3241 cctcaactaa cttgaccaca ttggtttatt gctaggtaag ttcaacataa acccgacctg 3301 agacggtaaa gcggttatac tgcttaacta ccaagcgcat tcagtcagta aatatattta 3361 atttgttcac ataactttgc aaaatgtcaa taattattaa agaaatctta agtagaatga 3421 ctcaatatac aatgtctttg aatttgtcta gttacacatc tgccgtgagg atgaggctga 3481 gggggaaggt agagcagtaa ggtgatcgag agttcagggg agtaggaatg tctgaaatat 3541 tagttaattc ggatacactg gtaacgtaaa atggaaccat gctccaccat gaggggctgt 3601 ctctacccag atttgaccgt agtgcgccag gacaatgcgc tggcataaac acagaccaat 3661 tccgtagcct tctttacctt gatcacgctc taggcggtag tgattttcaa agatgcgatc 3721 gcggttgtct tcaggtatac caggaccagt atcgctaata ctaaattgaa ttttttggct 3781 agtacggtga agtccacaaa cactgatgtc cccaccttcg ggtgtgtatt tgatagcatt 3841 atccagcaag tttatcagca cctgccgtat gcgttctgga tctgcataaa cattgggtaa 3901 gtctttggga atgtctttgt ttaccttgtg agatttggcg gtgtagcgat cgcgcaattc 3961 ctctagaacc tcaagacaaa gtttgccgat gtctaccttt tgcggttgaa taaaaaattc 4021 cctgtcttta ccatgaccta cccgtaaaag gtcagtaatc atacggtcta ttatttttgt 4081 ttggctacgg gcttgcttaa acaaatgcac cgtcattttt ggtgtaagac gctcaaattc 4141 acctttttct atgttgtaat tagattgtaa agtgtctatg gcgatagaag ctgcagttag 4201 cggattgcgg aggtcatgag ctaacatagc tatcacccgg tctttaaact ggatctgctc 4261 ctggagtttc tctttttcct gtttgaggcg aaagacttca tctgagagtc tcatgagttc 4321 ggctgaaaca gcaacagaac ggatagagga cttgagttct ttgacacgac cgttctcatc 4381 atgttcgtgt aaatcactct ctaattgtaa gtaggcatcc atagcggctt gccaccgggg 4441 ccaccaagct ttcagttgac cagtgatgtt actaccagcc aaaatttccc ttggctctgg 4501 atgaactttg attaaagctg gcgttgctac cagtttaaaa tgttccgcca aatagggttg 4561 ttgaccaaca tcaacaattt gaagttcaaa atcgtagtca gcttccaact cttttaagta 4621 agagcgaatt cgctgtactt gttgtttgga ctttggtcgt ccatcgacaa aaagtaacag 4681 ctgtagtgga gcatcagaag aaataggctg atcctgggaa acttgcatgt aatcgtgttt 4741 cagcactggt aacaaaccga cgacgcttta cagaaaatac aaattgactg gcggttgtcg 4801 atggaatagt cgttctgttt tttttattta agatctattt tagattttcc cgactcctaa 4861 cctatatagg tttttaaaat cagatcaatt gttttttgcg ttttggttcc tttggagatg 4921 aaaattcctc caggaaaggg acttccaact gaagctgggt tgcaatcttg gtgttcataa 4981 atcattagtc aaatccaaat gaaagttttt ctaggactgc tgcatgacga ccgaaaattc 5041 tttcaagcac cagtacaaca gtccggaagc ttcccgtcat ttatcacctt ctgcaaaaag 5101 tttcttgtcg ataaaccatt agttcggtgc aactgaatga tgcaccacca acaaaagctt 5161 aggcaactca tacccaaagc tacaacagca ttccctttgt gaaacgaaat ctcaagacct 5221 gaaggaaacc gcagaggcgt tacactactt tatctacaac ctcgaccctt aagaacagcg 5281 ttgcttatgc caactctctt caaagtccaa cactgaagtt ttccaagtgt tgccatattc 5341 agcttatatt tattcaatag agacgatcca aaattggctg cctaggaaag tttgtaagcc 5401 gtctaggacg gacactttac ccaatcgcct ttttggaggg ctgccgtttg cgctattttg 5461 attctgctct tgagtgaggt aaataagaga tttactttca gcttaacact aatggtagtt 5521 gatgataacc accatgactt gcctgataaa gttctccaag gttcttgaat agccacaagg 5581 tgactcagtg cgtgcaattc tcagacattg ttcgttgcaa cttttatgag tcctaagttg 5641 gtgccttctt cccatagtct tttgctagtg taggaattga cactcgaatt gtaattgctt 5701 gtggtgcgct ctaaacatcc agttgtatta gtgagtttgg ctgttttact cacctgtttg 5761 acagcctgtg ccaacagtcc atctgccaaa aacattgagc agtctttggg ggctgacccc 5821 aaactaaaga ataacccagt cacctttggc gctcaagagg tacgcagtag ccaacaacta 5881 caaactcctc aggcgacagt tcaattacca actgattttc ccaaagacat cccgttgtat 5941 cctaatgcca ccctacaaga ggtgacacca tcaagcagtg agaatcccag tgtatccagt 6001 cgctggctaa gttctgaccc tagcaatgtt attactagct attatcgcca acagtttcag 6061 gcaaacaact ggcagattgt gcaaggagct tccacagatg agccgaaaag ttcttttgag 6121 gtgcggcgca atgatataca attaaaagtc tctattcaac cgaaaacagt taccaacgct 6181 gcacccaacc aaccacaaac atccactgaa atacaaattt catactcacc attatccact 6241 actacagcac aagctaaccc aactccaact cctcaagcta ctgacaaaac tgtgcaatct 6301 ggtgagtcac agtttgttgg cccggtgcca tcacaagact cgacagcaca accaaatgct 6361 acattagaga gtaaaaaccc tacacccaat gccagttccc tgcccaaatc tcaagaattg 6421 agcgatttga acaaagtacc tcaagaattt cgacaataca ttcaagattt ggcagcgtta 6481 ggagttttca atgtagaatc aaatgcgctt aagggcaatg gtaccacaac caaccagttt 6541 gaaccagcta aagtggtgac tcatcggcaa tttgcccatt ggttagttgc tgcgaacaat 6601 gcgatgtatg ccaataagca agcattgcag attcgcttgg catcagaaag ttctcaacca 6661 acatttcggg atgtaccaaa aactgatcct gattttcggg caattcaggg attagccgaa 6721 gccggattaa ttcccagttc tctatctgga gattccaccg ctgttttgtt tcaacctgat 6781 tcacctttaa cacgggagca gttgattctt tggaaaattc cccttgacat acgtcaagct 6841 ttaccctctg cttcaataga tgccgttaaa caaacttggg gttttcaaga tgctgggaaa 6901 attgacccga aagcattacg agcagttcta gcagatttcc aaagtggtga aaaatcaaat 6961 attcgtcgag tttttggcta tacaacgctg tttcaaccca agaaaccagt aacccgaact 7021 gaggcggctg ctgttttatg gtatttcggt actcaaggcg aaggaatatc cgctcaggaa 7081 gcattgaagt tagcaggttc gccaagccag tagctttttt gtgcaatgaa gttaatttat 7141 agtcagattc ccgacttcgc agaagttgtc gggaattttg cagcttggtt ggtgtatcat 7201 aaaaagcctg tagagacgtt gcatgcaacg tctctacaac ccaccagaat taatgcgaca 7261 aagcactagg tacaaaatgt ggaatacgat tgtagactag cccactaact cgctacttca 7321 gccatttcaa taatgcgccc ttcatagtcc ttaaccagaa aattcaatgg cttctcgctg 7381 cgaatcttga atttcaaacc ccgtgtttcc actcgcatta aaatcatttc taggcaatcg 7441 cggtcaaagc aaacgtggcg ttgctgattt ttactgccta aactcgctcc cataataatg 7501 tgcagctgag tatttttctt tagctgatac caaagcccct cgctagcatt gctcattctg 7561 cttccggaaa aggagggact gctagatagg taaagcggat caactcccgt tgcacctaaa 7621 gtttgctcgt agttgtaata gtagtgtaaa ggcacatcag ccgctggcaa atctagcagt 7681 ccttcataca actgtcgtgc tacctccata tctgacacca tgacagtatg tacttttggg 7741 gcactggtaa gaaacatcca catggcacca gcgtaggctg ctagtagcat caccatgatg 7801 ccttgagtgg agaaaagact atctaacggc agggatggca aaaaagaacc taggatgaaa 7861 ggctcgatct ggaactgtga catgctaact gctataacca tgaataatca gctaatgtat 7921 aaacagattc gctcaatagc ttttaacttt aagtctatgt ttctagtgta tcaagaccca 7981 taaaatatga ctcctaaatc tcaaattcta ccagccagca actctgatga gtctagtaat 8041 atgggaagca taacagaagg ttcccctaaa atctacctgt agcagtcaca cagccagtac 8101 cccatcaata cgtatgatgt aaaaatataa acaattctat gcctgcggca cgccctccgg 8161 gcgagttccc ttcgggaacg ctgagtccca aggggagcca ctgcggtctt ggggtttcac 8221 gccaggtgct ttaagcgggg gaaccccgcc aacgcactgg ctccccaagt ggagcaagtg 8281 gcgtgacacg ctgcgcgtta gccctctggg cgtgcgcaaa gcgcatacgc gatcgcaacc 8341 agaccattta gcagagaatt gaatcattga tgaacacgca aattttctgc tcaaagcgcg 8401 tttgcaccca gttacccatt gttccagttt tattcaaaaa acgttcctat cactcattaa 8461 ttcgtcattc ggatggtgca aattccttat ttcgcggaag ctcatcaccc attaataaag 8521 tcactcttct atcacagtga cactgaactg ctgactcttg ttcagcaaaa cccggattca 8581 ggtagatact tcacagcgat tttttgtcgc tataacccga tagtctacac cttgattcgg 8641 cattcagcgc gatcgcctgt gcaagctgat tatctttttg cactgacttg gcgacatatt 8701 tacaacgaac tctgtggact tgatttaaac agcagtcagt ttactaaaga cggtctgaat 8761 gtgcaaaact ggctgattaa tatgacagct tactgtatta acgagattca actccctcca 8821 acagaagcaa ttcattattc tctcaagacg acttctccac cattgttgtg ttatgtagaa 8881 caagcattag atcaattgcc gcccattttg cgattgatgg ttttgatgtc tcaaaccttc 8941 cgttggagcg aaactagaat tgctgcttac ctgcaagctg aaggagaaac tatcaaccca 9001 aacgaaatag tccattttct tcaagaaggt tatcgtatga tggaagagaa attaccaaca 9061 gatatccgtg ccatctactt aggtgaaaat cttattcaac ctggggctgc gtagaaacgc 9121 ggtttaatat gttttagaaa agcagatggc agccatttat ctacttagca aatggctgcc 9181 taattaaatg aaaattgcaa aagtctaatt caaaaagacc ctaccatctt ctcgaattct 9241 aactacaccc tgcttatcat tagcattcct ttgttcactc aaagaaagct ctatcattcc 9301 tgattcggta gatgttggag tgacttttac gaaaccatta tcttgacgaa tggtgctgat 9361 actcacaggg agtggcaggt tttgcagtac aatttcttga ccaccagaaa gagtccgctg 9421 ccctgtgtac ccgattgctt cataattgat tcgagcattt gtattatttt tgagtcttac 9481 agagactttg ccgttggctg gagtaaccaa agcaattggg gcttggtttt gttctggtaa 9541 aggaggtact atgactgaat tttgtctttg ttgactgctg gggttacttt gagattgaga 9601 gtatgtccta gcggtatagt tacctgtcgg tccttgactg ctgtaagtcc gaacctccga 9661 tgatgagctg ttcatgtctt gtccggaact catcgtcgtg gactgtgttt ggctgctgta 9721 actttcggag ctattatcag aacggttgtt gtatggtgtt tcaccaccta caccctgcct 9781 tatttgttct tgtgttggtg ttggttcatt agaaacactc cctggagtca gacgtccttg 9841 ctgttctagt tgtcgagtga tcgcgttggg tggacatcct tggggaacca agactctagt 9901 gttgtgaggt tcttggtaga aaatactggg acaggggtta gttttggagt tagtttgttg 9961 gcgaggttgc tgaagtacag attgctgagc cattgcttga ggaattgctg gcaagccaat 10021 cagtaatcca ccacaaatga ctcctaaaaa tttagcagag cgactgagtt tttcgtattg 10081 ctggttcata gtgacctcct atgtaaatgc ttgaaatctt tgaataacat caaaactttt 10141 gcaacaagca gattctttga ggcaacatag aagacggttt acttgttgct atgcttgctt 10201 tttaaaggtt tttaagattt aataataaga atataacaat tactaactaa ttcgtactct 10261 aactataggc atgaaaattt gtagcaggcg agatagactt ttctgccacc agtattagag 10321 aaataggtat gttgagacag agtgtcaagt tatgcatccc ctggcagagc cagggaacga 10381 ggaagttatg cctcttccca aagttggcga accttatcag gtaaccaact agcaatttct 10441 tgaattcttt ctggagacaa ttcttctttc gttgcagaga atacagcttt cacaacttct 10501 tctctgtcta cattgggttg caatccacct tcattcgcaa cccggaataa aaagcggtcg 10561 gaatcaatct taaaaattcc agggccttgc caaggtggac gaatacgact caagaaaccc 10621 acaattggat ttgtatcttt ccaaagctct acaatatcat tttgcagcgc tttctcatca 10681 gtgatttcag ctggcttatt ttctagttct tcggcgactc ggtctgcagc ttcggtggtc 10741 atgaggtcac gcatcacacg gaaaacgact tcggtaatgt ccctagcgtc gtagatatcc 10801 cctagcttac cattggcttt taccttctct aaaaaaggta tatcttttga gtcgatagga 10861 gatgattgtt tttcatctaa agattttgga gtgtttgtca ttcatcctcc tcaaagcatt 10921 ggttttttat aattgctaat gctagaagcg aatttttcaa aatgaaatgt agcaatttgt 10981 agaactactt agaaatacct tagaagtagt catgtcaaat tgctagaact tttttttagt 11041 tcacttcatc aagcattgcc attaacttat taggtaaaga tacttcccta catctgtcac 11101 aggaaacacc cctacatctg tcataagaaa catttggtga agtacccctt tttaggggac 11161 aatgcgtcgg cggatgggga atttcgcgga aatgcgttaa aaatcaaaag taacgggtaa 11221 agtaacgctg acagtagttc cgatgccttg agtgctttct acttgaattt gaccttggtg 11281 gctttcgact atggcttggg cgatcgccag tcccaatcct gaaccagtcg cattttgtgt 11341 tgctgtacct ttcgccgtgt gagtacgcgc tggatctacc cgataaaaac ggtcaaataa 11401 acggggtagc gcctcaggag ggataccaat tccagtgtct ttcaccttaa acagcaagta 11461 ggcagagttg taacgtagtc cagaaacacg atttgttgcc tcattacgtg ctagttccac 11521 atttacgcat cctccagcag gagtatattg caaggcattg ccaatcaaat ttgtgaacag 11581 ccgtaccagt tgatcccagt tgccaacaac tgtaaaccaa ttgtccagta attcaggatc 11641 ggtttcgcaa gcaggaggat caatcaaatc tagagagaaa gtaatatttt tttctgtagc 11701 taacagttgt tgttcttcaa ctacttccat taacaaagcg tcaacaggac aaggagaaaa 11761 aatttctttg ctgataccac tatcttgtct ggcgagaaat aataaatcgt tgactaactt 11821 gcctaaacgt tgcgtgaggc gttctaccac ttttaattgt tgtcgatagt gcgatgaaat 11881 agaattgtct acctcagcta attccaaatc agttagggcg acttgcacat tcgtttgaat 11941 caaggtaatt ggacttctta attcgtgaga agcatcagca gtaaactgtt tgagacgttg 12001 ataggattca tgtactggtt ccattgcttt acctgaaaga aaccagccac ttgctccgac 12061 ggaaaatacc atcaatccaa tacccagtgc taaatcaaaa atcaactgac gactgggttt 12121 tgtgacttca aaccacggat gactaacacg taaatatcct aatacctgcc gaccaacttg 12181 tacccgttct gtcacttgtc tcagcaagag ttgagaagtg ggaaaagtaa attcctcctt 12241 gtcgccttgt cgccttgtcg ccttgtcccc cttgtccccc atgttctctt tcctcacaac 12301 tttgacagtc tcacccattc gattgaagtg cagtggaata ttgaggggtt gtgagaaagt 12361 agaccaaagt aattcaccag tggggctaaa ccattcaaga tcgatgcggt catcttctgc 12421 ggtgtcagta ttgtcgcgaa aactggcttc tacgtttaca cggaattgac cattatctga 12481 cttaacaggt tcgatgacga gcgatcgctc taccacttcc accacatgat tcaaagtatc 12541 atcaatccgc tcaatcaatg tactacgcac atacaaatac accccacttg caaacagcag 12601 cagtaacaca gctgtcacag cagtgtacca cagggcaaga cggcgacgag tagcttgaaa 12661 catatctgat aaatgactga ctcttgcata agatcctcgc tcaaaattta ccgaaaatat 12721 gcaatttgaa tgtgttttag tttatgaaag tgtatattga ctaacctgac tcagccacaa 12781 tgcagcgaaa gctagaaaaa ctagagcgtc aacagtacca atgtttacag aaagcactac 12841 tcagtgcgtt tcctcatcga acaaaactag agcagatggt tcgcttcgga ttagatgaga 12901 atctggagga gatcgcaaca ggtgaaaatt atggggatgt cgtcttcaaa ctgattgaat 12961 gggcagaaac taatggaaat ctggaaaact tcctaattgc tgcacgtaat aaagactgtg 13021 agggtaatcc cggtaaccta cagttgaaaa gaatttgtaa agaactattg caagcgcaga 13081 ctacaacaaa acaatcccac agattgatga atccatgtaa atttgacttg ggcgaattaa 13141 ttagaagctg tttgaacata ttagaagaca agcagggact tgttggacta gctgttccct 13201 ataatcaaga tccttttttg atatacttct gtgaacgtct gaaggaaaga attgggaaaa 13261 gtcacactga taacaaacag cctttgacat tggataatta tcgcacttct gtggacacgg 13321 cagtaatgac aatgaaacga tacaaacgac ttctacaaaa gggggatgtc atttgtccaa 13381 ttcgagttgc tgtgtctgac cccaattcaa gtcatgaatt ttggaaaaaa gtttcggctg 13441 aatttcaaga gcctcaaaac ttttctgaac accgcttgat cataattatg gtcagcagtg 13501 aatgcaaatt ttttccacaa ggtgtcactc agataacacc tccgcagttc acgaaagcag 13561 atgctcatga atggattctt gaggtgacag acaatttggg atggagagaa gaagatagaa 13621 ataagtggaa gcgatatatg attgatgaat gttttgagag tgaatgctta aatactaggt 13681 tagtttatga gcatttagag tacgctatta aacttttgca acaaaaccat acagcagaaa 13741 cttttctaca agaacttaag caagcaaatt attgaggctg atgtcaaagc gttctgttga 13801 tacctctacc aagtactcca ccatagctta cgaatccttg ccagactctc ttgaagtcga 13861 agattctgaa gtagatcccg tatctgagaa agaccctcca aaaaagaaga tatacaagga 13921 accctactta cctgacaaaa aactggcgga agcggttgat ttggcgatcg ccctaggtcg 13981 ccccttactg ttacaaggcg aacctggttg tgggaaaaca cgtttagcgt atgctgtcgc 14041 ttatgctttg ggcttacctt tggaagtcag ttatatcaag tctactagtc gtgcccagga 14101 tttactttac acttacgatg ctgtcaatcg cctttatgat gctcagctag gggctgatgg 14161 accttgcaaa aatggtatac ctctaagtcg agacattggt aactatattc gtttaggtcc 14221 tttgggaaga gcgatcgccc gcgctcaata tgagcgtcgt tcagtcgtgc tgatagacga 14281 aattgacaaa gctgatctcg actttcccaa tgatttgtta tgggagttgg atcggttgga 14341 gtttcgagtc actgaagctc cagatatata ttacgctgtt ggcgacaacc cagcattacg 14401 cccaatcgtg tttgtcacgc acaatgaaga aaaagcatta ccaacagcgt ttttgcgccg 14461 ctgcattttt cactacgtgg aatttcccca aacagaagaa cttttgcaac aggttttagc 14521 aacccacgag atttctaatc aacaattgag cgaaaaagcg atcaaagttt tgttaaagct 14581 tcgcggactc gacttaagca aacgaccggg tttgagtgaa ctgctggatt gggtgggtta 14641 tttggaagcg gtaaaaactc cggtagagga acttgacaaa ttgccgtatt taggaacatt 14701 gctcaaacaa gagagtgatc gccaacgcgc aatcacggag tatcccaaac agtgaagcaa 14761 cccaagttac ccgcgtttct ctgggagtta tttcaaaagc tacgccgtcg tnnnnnnnnn 14821 ngcaacccaa gttacccgcg tttctctggg agttatttca aaagctacgc cgtcgtttgt 14881 ttccactcac accagatgat tatgaaacga agcaacccaa gttacccgcg tttctctggg 14941 agttatttca aaagctacgc cgtcgtgggt ttccactcac acctgatgat tatgaaacgc 15001 tgcgacagtc cttgcaagct gggttcggtt ggacatcaca agaagcactg cgggatttgt 15061 gcaattccct ctgggcgaag tcgcgccaag agcaggaaat tctcactgcg ctgtttaacc 15121 aacttgcacc aaagaacgaa gattggcaat tgtcttctgt gcaagtggaa aaagattttg 15181 atgccacaca ctcatcaaac aaagagcagc atcaaaacgt tccggagcat caagagcatg 15241 acgagattgt aaccgaatct tgtagcggct tacctcccat ttctttgaaa gatgtgcaac 15301 tttcggaacg tcggtttatc tttgtaccgc agttcccatt aacctatcga gaagtcgctc 15361 aaacttggcg gcggttacgg cgtcccgtgc gggtgggacc agcaacagaa ctggatgttg 15421 aactcacaat tgcgcgtcgt tgtcaacagg gagttacagc ttctgttgtg ttaaaaccga 15481 gacaccgcaa cgttgcgcgg ttacttttgc ttgtggatcg tcagggttcc atgactcctt 15541 tccatcgctt ttgtgaagaa gtttgcacag caattcaact agcagggaga ttggaagaga 15601 cagcaattta ttactttcac aacgttcctg ctgaaggagc ggatgagcaa gtgctagaac 15661 ctctgggcaa agaactcttt ccagttcttg attccatttt gcctgagata actcccctca 15721 aaacaggtta tctctacgaa gattctgatt tgctctctcc tattgcgttg gaggaagttc 15781 tccagaaaca tgccagtgat gcattcgtgg tgatgatgag tgatgctggt gctgtgcgta 15841 aatactacaa tgtcgtgcgt ctgttagata ccatctcttt tatcaaagca ctccgcgctt 15901 atactttaaa ctatgtttgg ctcaatcccc tgcctaagtc atattggaaa gataacaccg 15961 ccgcccagat tgcacgccat gtgccgatgt ttcctttgaa tcgggagggc atccagcagg 16021 cggtgaatgt gctacgtgga cagcaatata taattgagaa acctctttaa gtaggagaat 16081 ataattgaac ataactatgt cattgcgagt ggaacgcagt ggaacgaagc aatcccaaac 16141 ccttgtgata tcatgtccgg tggattagtt gtgattc // LOCUS NODE_2081_length_16107_cov_5.22919316107 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16107) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16107) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16107 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(7..495) /locus_tag="DP116_17985" CDS complement(7..495) /locus_tag="DP116_17985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997105.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NUDIX hydrolase" /protein_id="PRJNA477356:DP116_17985" /translation="MPLGRELPQLLRQRLYYKGRKFDFEVNRLRLPNKAEGEWECIRH PGGALAIPVTPEGKMILLRQYRFAVQGRILEFPAGTLEPNEDPLETIQREIEEETGYR AQKWQKLGEFFLAPGYSDEIIYAYLAQDLEKLEKAPAQDEDEDLETCDSLVTESRYSA RT" gene 990..3494 /locus_tag="DP116_17990" CDS 990..3494 /locus_tag="DP116_17990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749128.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="penicillin-binding protein" /protein_id="PRJNA477356:DP116_17990" /translation="MVKFTSWFKERPEKSSDGDKDQPSSRPNDQNEENEGSTNEKTTS TKPVKQNQLLKQILSKLPGSHKPLYRRYWFWAGLGISSGIIAIAYGIRAIDQGLPDKA ELNAIVRERTLTIKAADGSIIQQQGEATREQLNIEEIPDKLKKAFIASEDRRFKEHNG VDPQGIVRAVLNNMRSQNVVEGGSTITQQVSRILFLKQEKTFWRKLKEARLAQKIEGQ LSKDEILERYLNLVYLGSGAYGVADAAWVYFSKSVDQLTLDEMAIIAALPPAPNRFSP QVNKQEAKQRRDLVLQRMLEDGFITAAEKQTATAEPIHLKPSSPKRWQEEAPYFISYI QKELPKYVSPEALKAGGFTVETSLNLNWQKAAEAAVKKTLRNEGRWENFKQAALVAID PRNGEIKAMVGGKDFGKNQFNRVTQAQRQPGSTFKGFVYAAAIASGMDPYDAYLDAPL VVDGYEPKNFDEGYRGMLSMRDALTKSVNIVAVRILMKVGFQPTIQLAHRMGIQSELK PMYSLALGSSEVNLLELTSAYGSFATKGLHVDPHGITRIIDRQGKVIWSADFQPKRAL DAESAAIMTWMLRNVVQNGTGRAAQLGRPVAGKTGTTDDARDLWFIGYIPQMAAGVWL GNDNNKPTDGSSSSAANTWHEFMEKAVKEIPVEKFPERPKLEGRKATIKAIPVKARRI INRSITSNDQQSDEENTSRSYRSRRRNQQVNSDDNSGEERSSRRRRYRRRDYQEEQQQ QQQQETSTPRRRYSRRYRSEESSSSSESSSQPSRSRRRYRTENSDYSAPRRRRREYTP PANNPRPSTSTSSPPTRSWRERLRPTTPSSETSPEG" gene 3939..5036 /locus_tag="DP116_17995" CDS 3939..5036 /locus_tag="DP116_17995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997097.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AI-2E family transporter" /protein_id="PRJNA477356:DP116_17995" /translation="MNLSLSKLLPLLILTLLFPLVFLNGWLAFRVLQYFQPLITTLFL ASLLAFILNYPVSILQQRGVKRNYAVALVFITTVIIIFALGLTLLPIVLQQSHEMVTT LPQWIDSSESQFKNINDWLLSHGFKVNFNQIFSKIVNRLPNELEYLVDKIFSIIIDTI DSISKAVITVVLTFYLLLDGERIWDSFFKKIPLSFGEQLKQSIQQNFQNYLIGQVALA FLMGISLTIVFLVLQVQFALLFGLGVGILSLIPFGDVVSLAVVTLILASHDFWLAARV LAVSVVIDQLIDQAIAPRLLGSFTGLRPIWVLISLLVGTYIGGVLGLLIAVPIAAVIK DALDSWQLPSKPDYSDTVVESKELSEILTNE" gene 5142..6647 /locus_tag="DP116_18000" CDS 5142..6647 /locus_tag="DP116_18000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744021.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rhomboid family intramembrane serine protease" /protein_id="PRJNA477356:DP116_18000" /translation="MMSASLSGTNNNRGWILVSGFILVIIAVLSYLTPSLGGLIGGCL WGILVILPSLGHKKVDQLIDQQRFGQASRVASLIWWLHPLDGWRDQPKLLYALDLGQR GARASAVAILDRYQTTTTPTGRSAAVSLYQMDARWEELLVWIQDSLSEAVLRKDFDML VCYLRTLGETGDLNGMLQAWERYQPSIEKILNPRTQNLARMYVLAFCGKTEYVARLLS GSLADYSNTIKLFWLATVDQAAGRETIAHEQFLSIADSNNVCIRNAVARRLTSPVVVA NTVLTEKSEEILSRISTEIEYETRYSGRDSLKPRFAIATYFIITLNVLAFALEVKLGG STNLNNLYRLGALVPQEVVKGDWWRLLTAAFLHFGFLHLFLNMLGLYLFGRLMEFAFG TPQFFLLYFASAIGSMLAVTYMSVLGYSQSDFVVGASGCVMGLVGGFTAVLLHEWLRK RTRVAARNLRGILAVIVLQSIFDLTTPQISFVGHASGLIVGFVVGMILKAL" gene 6848..7990 /locus_tag="DP116_18005" CDS 6848..7990 /locus_tag="DP116_18005" /inference="COORDINATES: protein motif:HMM:PF11949.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18005" /translation="MKILSKAAAGTFLFTVVMGSGINTVSAASLYSITDLGSLVGSDY SYATDINNFGQVIFDSGKGTSNGSPDRAFLYTNGQVTEIKPLSGDTDIAVTSINNFGQ VVGNSVNENNFTGNNPLLYSQGRTQSLVGLNDAIPYAINDKGEIVGGAQKIGPFLYKN GTVVNFSTEGTVAYDINNQSQVVGILNTNKAFLYENGTTTALGTLPGDNYSSAEGIND KGQVVGVSAPTSISNGRAFLYSSSTNLINLGRLFPTDLYSVAFDINNNGQVVGFSGSN PNFYSNSGIGIRAFLYSDGILQDLNNLISRDSGFTITQARAINDQGQIVGAATFNGQL RAIVLTPESVSTPVPEPSTSAGLGLFLTGLGCSKFLQEKLKKKIAA" gene 8349..9368 /locus_tag="DP116_18010" CDS 8349..9368 /locus_tag="DP116_18010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317080.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lysozyme" /protein_id="PRJNA477356:DP116_18010" /translation="MAPAVFIENFKKRYGMVEAVKDVSFKVEPGEIFGLLGPNGAGKT TTLRTLCTLTTPDAGKIEVSGISVVDNPRAARRRLGYVAQEVALDKVLTGRELLQLQA ALYHLPGAVAKQRVNTVLQLLGLQEYADKKTGTYSGGLRKRLDLAAGLLHAPDVLVLD EPTVGLDIESRFVVWDFLRKLREAGTTVLITSHYLEEVDALADKVAIIDRGVVIATGT PSELKDKVGGDRITLRIREFSPDEETHKAKDLLKSLSFVQEVIINSAQGNSLNLVVTP QNDALISIQQTLNSAGLPIFSIAQSRPSLDDVYLAATGRTLMDAELAAAGNRDPKAER KQSMR" gene 9478..10377 /locus_tag="DP116_18015" CDS 9478..10377 /locus_tag="DP116_18015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_18015" /translation="MSGTVLPPKSDINWQQVASPQAYEVDATPNVFGEFVQETLALTR RLFIQLQRRPSTLIAGIIQPVMWLVLFGALFQNAPKGIFGNTTNYGQFLGAGVIVFTA FAGALNAGLPVMFDREFGFLNRLLVAPLVSRFSIVLASAIFIISQSLLQAAVIVTAAA FLGAGLPNAAGLGAIVLIVFLLALGVTAISLGLAFTLPGHIELIAVIFVSNLPLLFAS TALAPLSFMPQWLQVVATLNPLSYAIEPIRYLYLHKDWGLNSVVMHAFWGDVTFGGAM LVLLGFAVVALLSIQPQLRRTLA" gene 10562..10954 /locus_tag="DP116_18020" CDS 10562..10954 /locus_tag="DP116_18020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18020" /translation="MKIPFLPFKQLFVLTVAGLSFASLLLPQPSLAEPSSRNILQDLN SQQNNDPLSPRSDEVNNMGMFGLMHRLQQGNATWNPNEQNQQLNDAAAAFKQKQQQLF QQNQTRQQPTQPSFQVNTPGVIKPKSGQ" gene complement(11048..11485) /locus_tag="DP116_18025" CDS complement(11048..11485) /locus_tag="DP116_18025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996531.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peroxiredoxin" /protein_id="PRJNA477356:DP116_18025" /translation="MPLAVGSDAPAFTVKDTNGNTVSLSDFKGKTVVLYFYPKDDTPG CTKQACSFRDAVDDYKRNDVVILGVSADDEASHQAFTQKYNLNFPLLADTDHSLIKAY DVDGGGYAKRVTYVIDGNGKITKVDSSVNTSTHASDVLAALGL" gene 12085..12870 /locus_tag="DP116_18030" CDS 12085..12870 /locus_tag="DP116_18030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870495.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_18030" /translation="MRNRYLKVLTLFWSAAIEAELEYRINFLLATLSSLGNLAGSLFG LFLFYGNGYTFAGWSWEAALVVLGIFTLMQGFSATFLAPNLNSIVRHVQEGTLDFVLL KPIRSQFWLSTRSVSPWGLPDIVFGSIIIGYAGKKLGLGINDYLISTIPLCFGLVILY SLWFMLGATSIWFVKIYNATEVLRGLLEAGRYPMVAYPTAYRFFFTYVVPVAFLTTIP AEVMLGRSQITWIVGAGVLALALFFVSTRFWRFALRFYTSASS" gene 12992..13360 /locus_tag="DP116_18035" CDS 12992..13360 /locus_tag="DP116_18035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313967.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18035" /translation="MIGESTVKLTIALKDSGLDDEELDRLTQNLLQEIKDLDELEQVN RVAVAETPQGAKSLGSFLLGMLQVEVSVANIKKLLGFVGDSLGNKPIEFEVEANGKKL KLKAYSQQELQAGQQFVSST" gene 13369..14850 /locus_tag="DP116_18040" CDS 13369..14850 /locus_tag="DP116_18040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181641.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18040" /translation="MAKKVALLIGVSEYGEGLTPLLGAAKDVKVMEQVLQHPEIGNFN EVMLLTNPAPEAMMEAIESLFSGRTKDDLVLLFFSGHGMKDENLKLYFATSRTRKNPQ GELVKATAVPASFVHDMMNNCRSKRQVVILDCCFSGAFAQGLSAKDDGSVNIKAILGG EGGAVLTSSTSTQLSFEQQGSDLSVYTRYLVEGIETGAADLDNDGVVSVDELHEYAKQ KVQEAAPAMKPEIHTTKEGYKIRLAQAPTDNPKLRYRREIERYKSQGEISFVGRRFLD ELRDILGLSPDDAALIETEVLKPYREYKEKLQRYESVLSEAIQRENPLSDNSRHDLKR LQQVLGLRKEDVEPIEERIFDNTTAEQTIINHSPIQQTSTSQPKTALSEPDAITVPSV QMSGAQQGTQTNNLRRAPFFNRRLVMIGVGIITTVAIAGAVIWVSLTSNPSKTEESPD KKKCEEYQRKYNEKDSEVQAKVESSDGFIRKRCKNVWGVVIDR" gene 15172..15534 /locus_tag="DP116_18045" CDS 15172..15534 /locus_tag="DP116_18045" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18045" /translation="MNSQHSVLIIMLNIFAFTGVSLIASSPPTTRTCTINEKGGATPV ILYLKPGAKEEEVKIKPVKVEPGHKVHPTNKPLQKAVYANETVDWINVKVDIEVEGWV RKDVVTEPCYKNTATSKV" gene complement(15760..15948) /locus_tag="DP116_18050" CDS complement(15760..15948) /locus_tag="DP116_18050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019486786.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18050" /translation="MTETILTLAQKLGVDVTAEGIETAEQLAQLRNLKCRYGQRYFFS QALSSGAALAFILANPQW" BASE COUNT 4613 a 3257 c 3601 g 4636 t ORIGIN 1 ggtaacttat gtgcgggctg agtaccgtga ttcagttacc aacgaatcgc acgtttccaa 61 atcctcatct tcgtcttgtg ctggtgcttt ctccagcttt tctaaatctt gtgctaaata 121 agcataaatt atttcgtcag aatagcccgg agcgagaaaa aattctccta gtttttgcca 181 cttttgggca cggtagccgg tttcttcttc tatttcacgc tgaattgtct ctaggggatc 241 ttcgtttggt tccaatgtac cagcaggaaa ctctaagatt cttccctgaa ctgcaaaacg 301 atactggcgc aaaagtatca ttttaccttc tggagttacg ggtatagcta aagcgccacc 361 tggatgacga atgcattccc attctccttc tgctttgtta ggtaatcgta agcgattaac 421 ctcaaaatca aatttgcgcc ctttgtagta caagcgctgt ctcagcagct ggggtaattc 481 tctacctaat ggcatagttc aatgatttgg tttaaaaata cacagactaa aactttcgga 541 agatgtcaaa atcagcgttg tctgtatggt tgttggtaaa attagttgcc caccaacaag 601 tgtacttcag aacagcctac ctctttaaca gttatcagtt atcagggagg ttaagaaatc 661 gcttcctctg ttaagagttc cctgttccct gttaagagtt ccctgttaag agttccctgt 721 tcactgttca ctgatttaat cttcattaaa taacgtattt tgctttcctg ttttgttttt 781 gtgttttata ttttttcaac agtactattg cttcatgtaa tgattaaaga taaagtttgt 841 ataaataaga acgcacaacc gtaacaaaat tttcccaaaa agaatatatc caagcataga 901 tggggtgata tactatcacg aatgtagtgc aatgatctaa aaactttgga tggtggttag 961 attgcaactg cttgaagcga ggaatgaacg tggttaagtt tacctcctgg ttcaaagaac 1021 gaccagaaaa gtcgagtgat ggtgataagg atcaaccaag ctcgcggcct aacgatcaaa 1081 atgaggagaa cgagggatct acaaatgaaa agacaacatc aaccaagcca gtaaagcaaa 1141 accaattact gaagcagata ctttccaaac ttcctggcag ccacaaaccc ctttatcgtc 1201 gttactggtt ttgggcaggc ttgggtataa gtagtggaat tattgcaatt gcctacggta 1261 ttcgagcaat agaccaaggt ttaccagata aagcagaact caatgccatt gttcgagaac 1321 gtacactgac cattaaagct gctgacggta gcattataca acaacaaggt gaagcaacca 1381 gagaacagtt aaatatagag gaaataccag ataaacttaa aaaagctttt attgcttcag 1441 aagacagaag gttcaaggaa cacaacggag ttgaccccca aggaattgtg agagcagttt 1501 taaataacat gcgatcgcaa aacgtggtcg aaggtggtag tacgatcaca caacaggtct 1561 caagaattct ctttctcaaa caagagaaaa ctttctggcg caagctcaag gaagcccgat 1621 tagcacaaaa aatagaaggg caattgtcta aggacgaaat tctggaacgt tacctgaatt 1681 tggtttactt gggttctggt gcttacggtg ttgcagatgc agcatgggtt tactttagta 1741 aatctgtgga tcaactcacc ttggatgaaa tggcgatcat tgcggcatta cctcctgccc 1801 caaatcgctt ttcaccacaa gttaataaac aagaggcaaa acaacggcgg gatttggtac 1861 tacagcggat gctggaagat ggatttatta cagcagccga aaaacaaaca gcaaccgcag 1921 aaccaattca cctgaaaccc agttcaccta agcgatggca agaagaggct ccctatttta 1981 taagctacat tcaaaaagaa ttgcctaagt atgtttcccc cgaggcactc aaagcaggtg 2041 gtttcacagt ggaaacgagc ctgaacttga attggcagaa agcggcggaa gcagctgtga 2101 aaaaaacgtt gcgaaatgag gggcgctggg aaaacttcaa gcaggcggct ttggttgcga 2161 ttgatccccg caatggtgaa attaaggcaa tggttggggg aaaagacttt ggtaagaacc 2221 aatttaatcg cgtgactcag gcacagcgtc aaccagggtc aacattcaaa gggtttgttt 2281 atgctgctgc tatagcaagc gggatggatc cctacgacgc ctacctcgat gcacccctag 2341 ttgtagatgg ttatgaaccg aaaaactttg atgagggtta ccgaggcatg ctaagcatga 2401 gagatgccct caccaaatcg gttaacattg ttgcggtgag aatcctgatg aaagtgggtt 2461 ttcaaccaac tatccaactt gcccacagga tggggattca atcggaattg aaaccgatgt 2521 attccttggc gctgggttct tctgaagtga atttgctaga gttgaccagt gcttatggtt 2581 cctttgccac gaagggctta cacgtagatc ctcatggtat tacacgtatt attgaccgtc 2641 aaggtaaggt tatttggtct gccgacttcc agccaaagcg ggcacttgac gctgaaagtg 2701 ctgctatcat gacctggatg ctccgcaatg ttgtgcaaaa tggcactggt cgtgcggctc 2761 aattaggtag acccgttgct ggcaaaactg gcactaccga tgatgctcgc gacttatggt 2821 ttattgggta tattccccaa atggccgcag gggtttggtt gggtaacgac aacaacaaac 2881 ccaccgacgg tagcagcagt agtgctgcta acacttggca tgaatttatg gaaaaagcgg 2941 tcaaggaaat acctgtagaa aagtttcctg aaagacccaa gttagaaggt cgaaaagcca 3001 cgattaaagc aatacccgtc aaggctaggc gaattatcaa tcgctctatt acttctaatg 3061 accagcaatc cgatgaagaa aatactagca gatcatatag aagtagaaga cgaaatcaac 3121 aagtaaattc tgacgataat agtggtgaag agcgttcatc tagaagacgc agatatagaa 3181 gacgcgatta tcaggaggag caacaacaac aacaacaaca agaaacctca acaccaagaa 3241 ggcgctatag tcgtcgttat cgtagtgaag aatcaagttc tagtagtgag tcatcctcac 3301 aaccatctcg ttcacggcgg cgctatcgaa cagaaaactc tgattattca gctccaagaa 3361 ggcgtagacg agagtataca ccgcctgcaa acaaccctcg cccctcaact tcaacttctt 3421 ctcctccaac acgttcgtgg cgggaaagat tgagacctac tacgccatct tcagaaacaa 3481 gtcctgaggg ttagggctaa tcaacggatt atatccagcc tgccctttgg gagcgtagca 3541 gagggcgggc tacgcgacgg gcgttaagcg aacgcgagtg cgtgcgcctc tggcgcttag 3601 ctctgccgta ggcaatcgcc agttaagtcc aaaataaacc ttgtatggtt gatgacagtc 3661 ttaacggggt gatcgccgaa ggcggcggca acagcctcgc atgacatggc atccattggc 3721 agcatcccca cgtactaagc ttgatattac gggcaattga tataacttga tacaactgtg 3781 caagagatta tcagcaactg aatcaacaac agtagttagt tatcgacagt aagagtatca 3841 aaaccactgt ctgagtgttt ttattgggtg ctttgttatt aattagattt aatatgataa 3901 atgggcattg cccaaatcac aattttcgac tatttaatat gaatctttca ctgagcaaac 3961 tactgccatt gttaattttg acactactct ttccattagt ctttctcaat ggctggctag 4021 cgtttcgagt tcttcaatat tttcaacccc ttataacaac tcttttttta gcaagtttac 4081 ttgcctttat tctcaattat cctgtttcga ttcttcagca acggggagtg aaaagaaatt 4141 atgcagtggc attagttttt atcacaactg taataattat atttgcttta ggtttgactc 4201 tgttgcccat tgttttacag cagtctcatg aaatggtgac aacacttcct caatggattg 4261 attctagtga gtctcaattc aaaaatatta atgattggtt actaagtcac ggtttcaaag 4321 taaattttaa tcaaatattt tctaaaatag taaatcgcct acctaatgag ttagaatatc 4381 tggtagataa aatatttagt attatcatag acactattga tagtatttcc aaagctgtaa 4441 ttacagtcgt gctaactttc tatttactat tagatggtga gagaatttgg gatagtttct 4501 tcaaaaaaat ccccttaagc tttggtgagc agttaaagca gtctattcaa cagaactttc 4561 aaaattactt aattggtcag gtagctttag ctttcctgat ggggatttca ctaaccatag 4621 tgtttctcgt tcttcaagtc cagtttgctt tactctttgg tttgggagta gggattttga 4681 gcttaattcc cttcggtgat gttgttagtc ttgctgtagt tactctcatt ctagcttcac 4741 atgacttttg gctagcagca agggttttag ctgtatccgt tgtcattgac caattaattg 4801 accaagctat tgctccaaga cttttaggta gttttacagg acttagacca atatgggttt 4861 tgatttcttt gctagtagga acttacattg gtggagtttt gggattactt attgctgtac 4921 ctatagcagc tgtcattaaa gatgcactcg atagttggca acttccttct aaacctgatt 4981 attccgacac tgttgttgag agtaaagaat tgtcagaaat attaaccaat gagtagtgat 5041 tgggttatta ctgtacaacc aggtacttcc tgattattgt aaatgaaatg gatcttaatc 5101 acatattaac ttggatggtc tgtttatcat gcatctcgac catgatgagc gcatctttat 5161 ctggaactaa taataatcgt ggctggattt tagtatcagg attcatcttg gtaatcatag 5221 cagttttgtc ttatctcact ccatccttgg gtgggttaat tggtggatgc ttgtggggga 5281 ttttagttat tttgcctagt cttggtcaca agaaagttga ccaacttatt gatcaacaac 5341 gttttggtca agcaagcaga gtggcgagtc tgatttggtg gctccatccg ttggatggtt 5401 ggcgtgatca accaaaattg ttatacgctt tggatttggg tcaacgtgga gcaagagctt 5461 ccgctgttgc aattctagac cgttatcaaa ccaccacaac accaactggt cgttccgcag 5521 ctgtcagcct ttaccaaatg gatgctcgtt gggaagaatt gctggtgtgg atacaggata 5581 gcctaagcga agcagtcttg cgcaaagatt tcgatatgct ggtttgctac ctcagaaccc 5641 ttggagaaac tggcgatttg aatggtatgc tccaagcttg ggagcgctat cagccaagca 5701 tcgaaaaaat acttaaccca agaacacaaa atctagcgcg tatgtatgtg ctagccttct 5761 gcggtaaaac agaatatgtc gcgagattat tgagtggttc actagcggac tattccaaca 5821 ccatcaaact attttggcta gcaacagttg accaagcagc tggaagagaa accatagctc 5881 atgagcaatt tttgagtatt gctgatagta acaatgtttg cattcgcaac gcagtagcaa 5941 ggcgtttaac cagtcctgtg gttgtggcta acacagtact tactgaaaaa tcagaagaga 6001 ttctatctag aattagtaca gaaatagagt atgaaacaag atacagcggt agagatagct 6061 taaagccacg ttttgcgatc gctacatatt ttattatcac tttgaatgta cttgcttttg 6121 ctctagaggt aaaactggga ggtagtacca acctgaataa cttatatcgt ttaggtgcat 6181 tagtaccaca agaagttgtt aaaggagatt ggtggcgctt gttaacagcc gcgtttcttc 6241 acttcgggtt cttgcaccta ttcctaaata tgttaggtct ttatttattt ggtcgcttaa 6301 tggaatttgc cttcggaaca ccgcaatttt ttctgttgta ttttgctagt gcaattggct 6361 ctatgctggc agtgacttat atgtcagttc tgggatattc tcaatctgat tttgtagttg 6421 gtgcatcagg atgtgtgatg ggtcttgttg gtggttttac tgctgtgctg ttacacgagt 6481 ggctaaggaa aagaacacgc gttgctgcta gaaatttacg agggatttta gcagttattg 6541 tgttgcaaag catctttgac ctgacaactc cacagattag ctttgttggt catgcttctg 6601 ggttgattgt tggttttgtg gtggggatga ttttaaaggc tttatgaacg aggtttcaaa 6661 ttgttaattg gcttgtagag taaaaatttt gaaacattgg acaaaatggg tcttttgttc 6721 atcaaagtgc aaacttcatg gtagattcaa gcatatccac agaaaacaaa gaaactatca 6781 atcaagatag cgacatcagg ctgcatttac ttaaaccatt ttgaaattaa ggttaaggat 6841 ttgtctgatg aaaattttat cgaaagcagc cgctggtact tttttattca ctgtagtcat 6901 gggttcaggc ataaatacag tatcggcagc atctttatac tcaataactg atctaggctc 6961 cttggtaggt tcagattaca gttacgctac tgacatcaac aattttggtc aagttatctt 7021 tgattcaggt aaaggcacca gcaacgggag tccagatcgt gcttttttat atacgaatgg 7081 tcaggtgact gaaatcaaac ctctctctgg tgacactgat atcgccgtta caagtatcaa 7141 caactttggt caggtggtag gtaattcggt taatgaaaac aacttcactg ggaataaccc 7201 cttactgtat agccaaggca gaacacaaag cctcgtcggt cttaatgatg ctatccctta 7261 tgccatcaac gataaaggtg agatagtggg tggagcgcaa aaaattggtc cgtttttgta 7321 taagaacgga acggtagtta acttcagcac tgaaggtact gtcgcatatg atatcaacaa 7381 ccagagtcag gtagtcggca ttttaaacac taacaaagct ttcctgtatg aaaatggcac 7441 aacgactgcc ttgggcactc tccctggtga caattactcc tcagctgagg gcatcaatga 7501 taaaggtcaa gtcgtcggag tttcagcccc cacaagcata agtaatggtc gggcttttct 7561 ctacagtagt agtacaaatc tgattaacct cggtagacta tttcctactg acctttacag 7621 cgttgctttt gacatcaaca acaacggtca ggttgttggc ttttcgggca gcaatcctaa 7681 cttttattca aacagtggga ttggaattcg tgcttttctt tacagtgacg gcattttaca 7741 agaccttaac aacctgattt ctcgtgattc tggttttacc atcactcagg caagagctat 7801 taacgaccag ggacaaatcg tgggggctgc tactttcaac ggtcaacttc gtgctattgt 7861 gttgacacct gagtctgttt ccacacctgt gccagaaccc tctacaagcg caggcttagg 7921 gttgtttcta acaggtctag gttgctctaa atttcttcaa gagaaattga aaaagaaaat 7981 agctgcataa ccgcatagat tgttgatgtt gttgaccgag cgctagagct agcttcaagc 8041 cacgttttgc gatcgcccaa ggggcggctc cctttgagag catcgctact attttatgat 8101 cattttaaaa agtgctcacg aagatgaaat gtattcagca atggattggt tcttaaaaag 8161 gcaaccaaaa caggacttac gcatttcaac gtaactccgt ccgcagcgat cgcctttggg 8221 tgagacaaaa ccgcaaccca actcataaga taaaggaagc cgtacccact taatcacccc 8281 accgaacatc actcaactac aataggaata gaatttgttg agatttgtat agtttaggat 8341 aaactatcat ggctcccgcc gttttcattg aaaatttcaa aaaacgctac ggcatggttg 8401 aagccgtgaa ggatgtttcc tttaaggtgg aaccagggga aatctttggc ttactcggtc 8461 ctaatggagc aggcaaaaca accacattga gaactttgtg tactctcacg acaccagatg 8521 caggtaaaat agaggtatct ggcatttctg tggtggacaa tccaagagcg gcaaggagaa 8581 ggctaggcta cgtagctcag gaagttgcct tagataaggt gttgactgga cgcgaactac 8641 tacaactgca agccgcactg tatcacttac ctggtgcagt ggcaaaacaa cgagtgaaca 8701 ccgtgctgca gttactcggt ttgcaagaat atgcagataa aaagactggc acttactctg 8761 gtgggttacg caagcgccta gacttagcag caggattact ccatgcacca gatgtcctgg 8821 ttttagatga gccaacagta ggacttgaca tagaaagccg ttttgtggtg tgggatttcc 8881 tgcgtaagtt gcgagaagca ggaacaacgg tactgattac cagccattat ttagaagaag 8941 ttgacgcctt ggctgataaa gtggcaatta ttgaccgtgg agttgtgatt gctacaggaa 9001 cgccttcaga gttgaaagat aaagttgggg gcgatcgcat taccttacga atccgcgaat 9061 tttcacccga cgaagagacg cacaaagcta aagatttgct gaaatctttg tcgtttgttc 9121 aagaagtgat cattaatagc gctcaaggga actccctgaa cttagtcgtg acaccgcaaa 9181 acgatgcttt gatcagcatc caacaaacgc tcaattcggc tggattacca attttcagta 9241 ttgcccaatc tcgaccgagt ttggatgatg tttatcttgc agccacagga cgaactttga 9301 tggatgcgga actcgcagca gctggaaatc gcgatccaaa ggctgaacgc aagcagagta 9361 tgcgttaggc tttgtgaatt ggtaatgacc aagactatcc tactattttg atagcgcagc 9421 gtgcccggag ggcatacttt gaattttgaa ctttgaattt tgaatgtgtg aatttttatg 9481 agcggtactg ttttacctcc aaaatctgat ataaattggc agcaagtggc atcacctcaa 9541 gcttacgaag tagatgcgac tcccaatgtc tttggtgaat ttgtacaaga gacacttgct 9601 ttgacgcgtc gcttgtttat tcagttgcag cggcgtccct caacattaat tgctggaatt 9661 attcagccag tcatgtggtt ggtgctgttc ggtgctttgt ttcaaaatgc accaaaaggt 9721 atctttggca ataccacaaa ttatggacaa tttctgggcg ctggcgtcat tgtttttact 9781 gcatttgctg gggcgttgaa tgctggttta ccagtcatgt ttgaccgcga attcggcttt 9841 ttgaaccgtt tgcttgtggc tcctcttgtt tcacggtttt cgatagttct ggcttcggct 9901 atctttatta tcagccaaag tttactgcaa gctgcggtga ttgtgacagc agcggcattt 9961 ttgggggctg ggttaccaaa tgcagcgggt ttaggagcaa tagttttgat tgtcttcttg 10021 cttgcattag gtgtgactgc gatcagtctt ggtttggctt ttaccttacc gggacatatt 10081 gaacttattg cagtcatttt tgtcagtaac ttaccattgt tgtttgctag tactgcatta 10141 gctccattat cctttatgcc tcaatggcta caggttgtcg ctaccctcaa tcccctcagc 10201 tatgcgatcg aaccaattcg ctatttgtat cttcacaaag attggggatt aaatagcgta 10261 gtcatgcatg ctttttgggg tgatgtaacg tttggtggcg caatgcttgt attgcttggc 10321 tttgctgttg tagcattact tagcattcaa cctcaactgc gacggactct tgcttaatat 10381 aaaagcatta ttggatttaa gtgattttgg catgaaaaga actcaccaat tacctacata 10441 ccacaaaaaa gaattattct gagttagtga gttggtgtgt tctgtatgat tcttaatcaa 10501 aatcctttaa ctctcccacc ctgagaaggc gtgggtttcc aaactttctt cggagatttt 10561 tatgaaaata ccatttttac cgtttaaaca actatttgta cttactgtgg caggactcag 10621 ctttgcttct ttgctattgc ctcaacctag tttagctgaa cctagctcaa gaaacattct 10681 gcaagacctc aattcacaac agaataacga tccattatct cctcgcagcg atgaggtgaa 10741 taatatgggc atgtttggtt taatgcatcg ccttcaacaa ggaaatgcaa cttggaatcc 10801 aaatgaacag aaccaacaac tcaatgatgc agccgctgca ttcaagcaaa agcaacaaca 10861 attgtttcag caaaatcaaa ctcgacagca gccaactcaa ccaagctttc aggtaaatac 10921 acctggggtg attaaaccta aatccggtca gtagagcaac aaaaaaagaa cacagaacgc 10981 agtaccaaaa acttttgatt ctgagattct gtgttctgag ttcttctgaa tgattgctaa 11041 tcggttttta caatcccaat gctgctaaaa catcgctagc atgagtcgat gtgttgacac 11101 tagagtcaac cttggtgatt ttgccgttac catcaatgac gtaagtaacg cgtttagcat 11161 atccaccacc atcaacatca tatgccttga ttaagctgtg atcagtgtca gccagcagcg 11221 gaaaattgag attatatttt tgggtgaatg cttggtggga agcttcatca tctgcgctga 11281 ctcccaggat gactacatca tttctcttgt agtcgtcaac agcatcccga aaactacagg 11341 cttgtttggt acagcctgga gtgtcgtctt tggggtagaa atacaaaacc actgtcttac 11401 ctttgaaatc agacaacgaa acagtgttgc cgttggtgtc tttgacggta aatgcaggtg 11461 catcgctacc aactgctaag ggcatagttt tagtatctcc actatacagg gtttggttca 11521 gggcagtttg tggtcagtaa cacgagttcg acctttcaag ttttgggatt ttgtttcatg 11581 aaagttacga atacatgtca ctgttttccc tgatggcggt attgtatcaa acttcatata 11641 tttttgtgaa tttatggctt ctttaaggtt tgggggaaaa aaagtataat taaacacttt 11701 ttttcctaaa acacgtaata ttgatgaaac gaaccgccaa gacgctaagc acgccaagag 11761 aattaagtag gcctcaagcc gtgcccgaag ggctcaggag atacccgaag ggctttccct 11821 cacttggcat ctggtgagac cagcgcgaat gacggctctc cctcacttgg cgactggcgt 11881 tagccgtaag gcgtgcgctt tgcgcatacc cgaagggcgt tgggcattgc aatgcccacc 11941 ctacacaata tgtgtatttc actcaaataa gaagcgctat tatattgctt tgttctggag 12001 aatgggtgaa cgcgagcaaa aagcttgatg caaatgagat aatgaggtgt attgcacata 12061 cttttgccct tatagttatt caccttgaga aacagatacc taaaagtact aacattattt 12121 tggagcgctg ccatagaagc tgagttggag tatcgtatca acttcctctt agcaaccctc 12181 agcagcttgg gcaatcttgc aggtagtctt tttggattat tcttgtttta cggtaacggc 12241 tacacttttg ctgggtggtc atgggaagca gctttggtcg tcttgggaat tttcacgctg 12301 atgcaaggct tttctgcgac tttccttgct ccaaatttga atagcattgt ccgtcacgtg 12361 caggaaggta cattggactt tgtcttactc aaacccattc gtagccagtt ttggctttct 12421 acccgtagtg tatcaccttg gggacttcca gatatagttt ttggtagcat catcattggc 12481 tatgcaggta aaaaacttgg tttgggaata aacgattacc tcatcagtac aattcccttg 12541 tgttttgggt tagtcattct ttacagttta tggttcatgc taggagccac tagcatctgg 12601 tttgtcaaaa tatacaacgc caccgaagtg ctgcggggtc tgttggaagc tggaagatat 12661 ccgatggtgg cttatcctac agcataccgc tttttcttca cgtatgttgt tccagtagct 12721 tttttaacga ctatacctgc ggaagttatg ctgggtcgaa gtcaaatcac ttggatagta 12781 ggcgcgggag tgttggcgtt ggcgctgttt tttgtttcta ctaggttttg gcggtttgcg 12841 ttacggtttt atacgagtgc ttctagttag agatgggaag taggatgcaa tatctttgca 12901 tacattaacc gattgtctac tagtgtcaag cgcaaaaatg cgacatagta taattattac 12961 gatacaaata gacaaagttc aaagggtcag gatgataggg gaatctaccg ttaaacttac 13021 aattgccctg aaagattcag gcttagatga tgaagagcta gataggctta cgcaaaatct 13081 gcttcaagaa atcaaagacc tggatgaact tgaacaagta aatcgtgtag cagttgcaga 13141 aacgccacag ggagcgaagt cattgggaag ttttctgctg ggaatgttgc aggttgaggt 13201 tagcgttgca aatattaaga aattgctcgg atttgtgggc gatagcctgg gtaacaaacc 13261 cattgagttt gaagtcgaag ccaacggcaa aaaactcaag ctcaaagcct atagtcagca 13321 agagctacaa gcaggacagc aatttgtatc atcgacatag aagtcatcat ggctaagaag 13381 gttgcactac tgattggtgt tagtgagtac ggagagggtt taaccccact gcttggggct 13441 gcaaaagatg tgaaggtaat ggagcaagtt ttgcagcatc cagagatagg taattttaat 13501 gaggtgatgc tattaacaaa ccccgcccca gaagcgatga tggaggcgat tgagagttta 13561 ttttctggtc gaactaaaga tgacctagtg ctgctatttt tctcgggtca tggaatgaag 13621 gatgagaacc tcaagcttta ctttgcaact agccgcaccc gcaaaaatcc tcaaggagaa 13681 cttgttaaag caacggcagt tccagctagt ttcgtacatg acatgatgaa caattgtcgc 13741 tcaaagcgac aagtagtgat tctggactgt tgcttcagtg gagcttttgc ccaaggttta 13801 tcagctaaag atgacggttc tgtaaacatc aaagctatcc ttggtggcga gggtggagcc 13861 gtcctgacat cttcaacttc tacccaactt tcttttgagc agcaagggtc agacctttca 13921 gtttacacgc gttacttagt tgagggtatt gaaacaggtg cagccgatct agataatgat 13981 ggtgtggttt ccgttgatga gttgcatgaa tacgccaaac aaaaagttca agaagcagct 14041 ccagcaatga aaccagagat acataccacc aaagaaggtt ataaaatccg gctggctcaa 14101 gcacctactg acaatccaaa gctaaggtat cgtagagaaa ttgagcgtta taaaagccaa 14161 ggtgagatat cctttgttgg tcgccgtttc ttagacgagc ttcgagatat tttgggactg 14221 tcacccgatg atgctgctct catcgagact gaagttctta aaccttatcg agagtacaaa 14281 gaaaaattac agcgatacga gtctgtgcta tctgaggcga ttcagcgcga gaatcccctc 14341 agtgataata gccgtcacga tttaaaacgt ttgcagcaag ttttgggtct gagaaaggaa 14401 gacgtagaac caattgaaga acgaattttt gacaacacaa cagcagagca aacgataatt 14461 aatcactcac ccatacaaca gacttccact tcgcagccta aaactgcact ttcagagcca 14521 gatgctatca cagtcccttc cgtacagatg tcaggagcgc agcaaggaac acaaacaaac 14581 aatcttcgtc gcgctccctt ttttaataga cgtttggtaa tgataggggt aggcattatt 14641 acaactgtgg caattgcggg tgctgttata tgggtaagct taacatctaa tccttcaaaa 14701 acagaagaat cacctgacaa aaaaaaatgt gaagagtacc agcgtaaata caacgagaag 14761 gattcggagg tgcaagctaa agttgaatca agcgacggtt ttattaggaa gagatgcaag 14821 aacgtttggg gggtagtgat tgatcgctga gtattagggc taagctgtta cgcattgaaa 14881 ttgcatagtt actcataagg gctgtgaagg taacaagtca acagtctaga ataacccttg 14941 accggaggcg gagcgtcaga ggctccgctc ttgaccttgg actcttgact tgccttaacc 15001 aaaagaagtg caacttgtag gcgcattagc ttatatacag attgatagat tgcatttagc 15061 tatccgcctt gcaaagtttg tggtcaaaat tcaacttttt cctgatggtg ttacagatat 15121 tacgtaaacg ttaagtggtt ttgaatatac actacaaaaa ggaggctcag tatgaacagt 15181 caacactctg tgctgattat tatgctaaat atttttgctt ttaccggtgt aagtctaatt 15241 gctagctccc cacctaccac acgtacctgc actattaatg aaaaaggtgg ggcaacacct 15301 gtaatattat atcttaaacc tggtgcaaag gaggaggagg ttaaaattaa accggtaaaa 15361 gtggaaccag gacataaagt gcatcctact aacaaaccac tccaaaaggc agtatatgct 15421 aatgagacag tggattggat aaacgttaag gttgatatag aagtagaagg ttgggtgcgg 15481 aaagatgtag taacagaacc ctgttataaa aatacggcaa ctagtaaagt gtaattccca 15541 tattaccaat caagctcctg tttggttaca cattccttaa caggagtacc cattccttca 15601 aagaaatcct ttcgcatctg gtggaacaag tttgattcta attaatctgc ccgttcaaag 15661 tcaagtatca ctgacgataa attcgtgaat tttctcagcc tgtcttttat gcatcgggtc 15721 ttactggtaa gtgccgataa gaaattcaac ccacgaacct taccactgag gatttgccaa 15781 gatgaacgca agagccgccc cactagacaa tgcctgagaa aaaaagtatc tctgcccata 15841 tctacatttg agattcctga gttgtgcgag ttgctctgcc gtttctattc cttctgctgt 15901 cacatccaca ccaagttttt gagcaagcgt caggattgtc tcagtaatct ctaaatttca 15961 cctgtcgcac tcagggcgat gccgatactc gccgtggtga acacctcttg tccgcctaga 16021 aggagatagt ctcagatctt gcaccatact tttcgtccgc caagaaataa atttcttggc 16081 tcaaagtcca agtccgttaa aacggac // LOCUS NODE_2093_length_16008_cov_5.53714016008 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 16008) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 16008) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16008 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..2528 /locus_tag="DP116_18055" CDS <1..2528 /locus_tag="DP116_18055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317046.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="PBS lyase" /protein_id="PRJNA477356:DP116_18055" /translation="SLVRRGKDFSADSSEGEVKSEICVHGSLQGEGVIDDVIFPLFIR LSDLDEQATEVIDTIRLIIQRDYPKTAPLVKHLLEEKLKNGKCLLLLDALDEVPKEHR NHLKDKLNRFARYYSCPIICTSRIVGYGGAFVESAKEVEIVPFSQKQTEQYIETWFTN AAGYIEDDSVSAGKLIEELRDKPQITGLAQNPLLLSLLCSLYQEKGLTLPARRTQVYA KAVDYMLSKWRSDNHRQSSSDGWAIAKIQLLESLAYQFSCEGKEIFSLRELREKIEKF LRGESCSDFRNAKAADLMKELCEEDGIIQKLARQGEQYLFLHRTFQEYLTASYLNNAS DDIALAREHFWEYDWHETLTLLAGLMENPIPLLEAITKEKDDIFKTLLLLAGRAAAEC KQNNHPLIAKIIDRIYQFWQFYPDASFITSTVVTLGQVNSQMCQKLQEALNGNAVEAL AKIGNSQAVDTLIAVLNDSNSSMRGNAIEALAKIGNSQAVNTLIAVLNDSNSSMRGNA IEALAKIGNSQAVNTLIAVLNDSNWYVRRYAAQALGKISNSQAVDTLIAVLNNSDWYV RSNAAQALGKIGSSQAVNTLIAALNDSTSSVRSNAAFALGKIGSSQAVDALIATLNDS DWKVRSNAAQALGKIGSSQAVDTLIAAFNDLDLNVRSNAASALGEIGSFQAVDTLITA LNDSDSNVRSNAAYALGEIGNSQAVDALIAALNDLDLNVRRYAAQALGKISNSQAVDA LIAALNDSDSYVRSSAAYALAYIGNPETLAKLIQLPEINIYDRDIFISARTLAVRFSK QGLLSKAGKPLIPVYPELVKFIPIWAFVKRHIRFLILSFRFLI" gene complement(2591..2791) /locus_tag="DP116_18060" CDS complement(2591..2791) /locus_tag="DP116_18060" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18060" /translation="MHHQSLSHEADTQVNKNKRHYVIANGAKRNEAIARVWDCFASLR YARNDILGLIRLSYLSCPAKYL" gene complement(2828..3409) /locus_tag="DP116_18065" CDS complement(2828..3409) /locus_tag="DP116_18065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205426.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_18065" /translation="MATFAAVTPKRFTIADYHRLIELGFLTENDRVELIRGELMQMVA KGTAHTVCNTRLVTELIILLQGQAIVRGQEPITLATNSEPEPDLVIARYRPDDYLAAH PQEADILLVAEVADATLKYDQEVKLSLYAESGISNYWIFNLVASCLEVYTQPYQDLQG NFGYASKQIFLPHAVVTLPGFPDLSVHLSKVFP" gene complement(3574..3825) /locus_tag="DP116_18070" CDS complement(3574..3825) /locus_tag="DP116_18070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131017.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicA family toxin" /protein_id="PRJNA477356:DP116_18070" /translation="MHRDIIFNELEKYLLKLGFTALRTSGSHKVFQHPSSEALVILPA YEQQAYVHPVHLLAVRRILIENELIDRNAFDSFLEKVAS" gene complement(4004..5026) /gene="hpnH" /locus_tag="DP116_18075" CDS complement(4004..5026) /gene="hpnH" /locus_tag="DP116_18075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878349.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenosyl-hopene transferase HpnH" /protein_id="PRJNA477356:DP116_18075" /translation="MGIHLQQAIEIGKYIVTQRLLGRKRFPLVLMLEPLFRCNLACPG CGKIQHPKEILKQHLTPEQCFTAVEECGAPVVSIPGGEPLLHPQIDEIIRGLVARKRF IVLCTNGLLLEKSLHKFEPSPYLTFSVHLDGMRELHDQCVDRKGVFDIAISAIRAAKS RGFRVATNTTVFDGTDPKELQELFDFLSTLGVDGMTISPGYSYEWAPDQDHFLKREQT RALFRQIFAPYKAGKKNWDFINSPLFLDFLMGEKDYDCTPWGSPSYSVLGWQKPCYLL NEGHYKTFQELLDKTDWSQYGHASKNPKCADCMVHCGYEPTAAMDAMQPTNIGRSVKA LLGMGN" gene complement(5120..6139) /locus_tag="DP116_18080" CDS complement(5120..6139) /locus_tag="DP116_18080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydroflavonol 4-reductase" /protein_id="PRJNA477356:DP116_18080" /translation="MRAFVTGATGFIGANLARLLLEEGYTVRVLVRPNSPLNNLQNLD VEIIKGDLNDPDLYRKIQGAQVLFHVAAHYSLWQADQDVLYHNNVLGTRNVLAAARQA GIERTVYTSSVAAIGVGELGKVVDETHQSPLEELIGQYKKSKYLAEQEAKQAVTQGQD IVIVNPSTPIGPWDIKPTPTGDIILRFLRRQMPFYLNTGLNFIDVRDVARGHLLALEK GKTGERYILGNQNLTLKELLDQLAEITGLSAPQKSVPAWLPLSLAWIDERILAPLGKP PSIPLDGVRMAHQPMYYDASKAVQQLGLPQTPIRTALQDAVNWFVAEKYVEVAYNKRL LGRLP" gene 7526..8701 /locus_tag="DP116_18085" CDS 7526..8701 /locus_tag="DP116_18085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745911.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18085" /translation="MLLPTNISKTPSVEPLINIWAERYTVDVSSLSKNPKFYGELIKA AWPEARALTAAKLLNRVLVRTTNQATIRAKSLYEYIPEIIDSYSEQRITQFACKVYQR LLEVYQQQSGILVIPTSRQTTTSDDQQTTLLLWTIPNIEKLVNEMQQLLLTYQEQHIM ARDQRVVGFLTTLFNFTNQSLISQLTSAEKVLLCPYFKFIEEYVAIPWVRVCAAAAKY QLGSPALTLVEQMLPMASEISSMVYCRLLELLPNHRSLRGELGHPQVTHSCLRDLDMF QGYLWLCVLEESLKPVKQELVPLCVMVMPSVGVKWEMTDKWKRFLADEIESRVQLEHK PLLLHYTQGMEEAFFAARKQLGYQGDVVEVISEFAGNLAVHLNSENAYETMWSRQQR" gene complement(8948..10594) /locus_tag="DP116_18090" CDS complement(8948..10594) /locus_tag="DP116_18090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869665.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="B12-binding domain-containing radical SAM protein" /protein_id="PRJNA477356:DP116_18090" /translation="MSLEKPLFEKTTRNTSQIPRNHRRILCIFPKYSRSFGTFHHAYP LRGGVRAFMPPQGILVVASYLPKEWEVRFIDENVNSATRADFQWADVVIVSGMHIQRP QMNQINQLAHQEGKITVVGGPSVSGCPEYYPEFDILHLGELGDATDQMIEYLDLHSNR PPQQIRFETKERLPLNEFPIPAYHLLNLNDYFLGNIQFSSGCPYHCEFCDIPELYGNN PRLKTPEQVSAELDAMLEYGNPGAVYFVDDNFVGNRRAVMQLLPHLIDWQKRNGYPIQ FACEATLNLAQSPKLLEMMREAYFCTVFCGIETPEPEALNAISKTHNLSMPILEAIKV LNSYGMEVVSGIIIGFDTDTPATADRIIEFIRLSQIPMLTINLLHALPRTPLWRRLEK EGRLIFDENRESNIEFLMPYEQVVEMWRRCITTAYEPEFLYQRYAYNMEHTYSNRIEV PNSPARTSWANIKTGLTILTNILLRVGILGNYRNTFWKMALPAFKAGNIEQLIHVGLV GHHLIQFTQECAKNEESASFYSQKIRNTVHGKTVSQPRFF" gene complement(10667..11713) /locus_tag="DP116_18095" CDS complement(10667..11713) /locus_tag="DP116_18095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749291.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18095" /translation="MFVPVVDPNNHPMMPTTPSRAKRWIKSGKATPFFKKGVFCVRLN QEPSNRNTQPVAVGVDPGSKREGYTVKSQAHTYLNIQTHAIDWVKDHVEVRRNMRRAR RFRNTPCRQNRKNRLVNKQKLPPSTKARWQWKLRICKWLALMFPISTFVVEDIKAKTW KGSRKWNTMFSPLEVGKKWFYSELKRIASLETRTGNDTYEMRQSLGLKKSKNKLSNKF DAHCVDSWVLANWFVNGHLKPENTRLIEIIPLEFHRRQLHRLQHSVGHIRTRYGGTVS AGFKRGSVIKHPKFGFCYVGGWQESPTKKDPDRKTISLHSLETGKRLTQSAIPIDCRF KSYGSWRTTAVKTA" gene complement(12038..13219) /locus_tag="DP116_18100" CDS complement(12038..13219) /locus_tag="DP116_18100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495648.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="PRJNA477356:DP116_18100" /translation="MLLTTTLSLTIIHFLLLILCISAVLFYCYGIYAAFAFFHHPHPL QLNFHPPVTILKPICGKDDETYNNLVSFCQQKYPNYQIIFCVRDPIDCGIPVVKQIIH EFPELDIELVVCEDIIGTNPKVSNLANAVTKAKHEILVIADSDIRVGIEYLQRVIQPL EDKSVGVVTCLYRSLAKGWASILEAVGTATDFHAGVLISNQIEGIKFAFGSTIVIRKQ VLDEIGGFGAIADYLADDFQLGYLPTQAGSMVVLSDYIVDHVLASSTIADSIQRQIRW ARCIRVSRPWGYLGLIFTYGTVTSLLLLIATGGSTIGCAVFAITWVMRLVMGWVVGVI YLQDFGVQKFFWIVPVRDVIGFLIWCYSLFGSTIEWRGRRFRLIKGGKLVEITNNIMF G" gene complement(13281..13511) /locus_tag="DP116_18105" CDS complement(13281..13511) /locus_tag="DP116_18105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949362.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18105" /translation="MTGNREQGTGNREQGTGNREQGTGNREKGVCFIHNWWRAASGTF LGKSMTLHIKTFSSIPQKSYFFDSKVIFEVIH" gene 13510..15138 /locus_tag="DP116_18110" CDS 13510..15138 /locus_tag="DP116_18110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871894.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_18110" /translation="MNLIYFLLRSSWGMVAIAVVTGFLSGGSSASLIALISTAASRSA GERLTGMAWGFLGLALVALITSVISQVMLIRLSQRAVFQLRMGLSRQILSSGLSHLEQ IGSPRLLATLTEDVQTVANAVHQLPYLCIDIAIVASCLLYITWLSWFVLLMVIGLAVV AIGSSQWLLIRGEKFLTLARDDQDVLFKHFRTITEGVKELKLHHRRRQVFLSQNLQST AAQFSRHNIQGLTLFTSTTSWGRLLFFFAIGFVLFALPHLFTISRQTLSGYILTFTYL MMPMNNLMENLPVISKASIALQKIESLGLSLANQAEQSTVPPESKSSWHDLQFVDVTH TYRTDQEDSNFIIGSINITFYPQQLVFIVGGNGSGKSTLAKLITGLYIPEAGEILFDG ELITEENREWYRQHFSVVFSDFYLFEELLGLDNINLDTQAQEYLKLLQIDHKVKVRNG KFSTTNLSQGQRKRLGLLTAYLEDRQIYLFDEWAADQDPVFKEIFYTQLLPKLRDKGK TVLAITHDDRYFHVADRIIKLDYGKVEFDKTRSH" gene complement(15086..15376) /locus_tag="DP116_18115" CDS complement(15086..15376) /locus_tag="DP116_18115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874486.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18115" /translation="MKLLLDQGLPRSAGVLLCNVGIETIHVSEIGLSVAEDAVIATHV GWVEARNPTPTIPVNVGFRSALPNLRICRILFSNRLSDSLSYQTQLYRNPVL" gene complement(15360..15602) /locus_tag="DP116_18120" CDS complement(15360..15602) /locus_tag="DP116_18120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009554001.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18120" /translation="MKLDRITSNPNQMNGQPCIRNLRLTVRRVIELLAIYPEREELRQ EFPELEEEDIRQALIFASSYLDDRIIELPTTYETVA" gene 15761..>16008 /locus_tag="DP116_18125" CDS 15761..>16008 /locus_tag="DP116_18125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867642.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18125" /translation="MRHPFFFFALPLATVFSFSYLTIARAQITPDNSLGAESSVVTPN VQIKDIPSDRIDGGAIRGGNLFHSFQEFNINAGRGAYFS" BASE COUNT 4647 a 3364 c 3415 g 4582 t ORIGIN 1 tctccttagt aaggagaggg aaagatttta gcgccgattc aagcgagggt gaggttaaaa 61 gcgagatctg tgtacacggt agcctacaag gagagggagt tatagatgat gtgatatttc 121 cgctgtttat caggctatct gatttagatg aacaagcgac tgaagttata gacacaatta 181 gactcattat acaaagagac tacccgaaaa ccgcgccact ggtcaagcat ttattagaag 241 agaaattaaa aaatggcaaa tgtctgctgc ttttagacgc tttagatgaa gtgccaaaag 301 aacatcgcaa tcacttaaaa gacaagctga accgatttgc tagatattat tcctgtccga 361 taatttgcac ttcccgaatt gtgggttacg gtggcgcttt tgtggagagt gctaaggagg 421 tggaaatcgt tccttttagc caaaagcaaa cagaacagta tattgaaacc tggttcacca 481 atgctgcggg ttacattgag gatgattccg tttcagcagg gaaactcatt gaagaattgc 541 gggataaacc ccaaattaca ggattagcac aaaatccctt acttttatcc ttgctgtgta 601 gtttgtatca agaaaaaggg ctgacgcttc ctgcacggcg gactcaggtt tatgcaaaag 661 ccgtggatta tatgttgagc aaatggcgga gtgacaatca tagacaatca tcgtctgatg 721 gttgggcgat cgccaaaatt caattattgg aatcactggc atatcaattt agctgcgaag 781 gcaaagaaat tttttcgcta cgggaactgc gggaaaaaat tgaaaaattt ctgcgaggtg 841 agagttgcag tgatttcaga aatgccaaag ccgccgattt aatgaaagaa ctgtgtgagg 901 aagatggtat tatccaaaaa ttggcaaggc aaggagagca atacctgttc cttcatcgga 961 cttttcagga gtatttgacg gcttcttatt tgaacaatgc cagtgatgat attgcattgg 1021 cgagagaaca tttttgggaa tacgactggc atgaaacttt gactttactc gcagggttga 1081 tggaaaatcc aattcccttg ctggaagcta ttaccaaaga aaaagatgat atttttaaaa 1141 cactgttgtt attggctggt cgtgcagctg ctgaatgcaa acaaaataat catcctttga 1201 ttgctaagat aatcgacaga atttaccagt tttggcagtt ttacccagat gctagtttca 1261 tcacatcaac cgtcgtgacg cttggtcaag tcaattcgca gatgtgccag aagctgcaag 1321 aagccctcaa cggcaacgcg gtagaagctt tggctaagat tggcaactcc caagctgtgg 1381 atactttaat tgctgtcctc aacgactcaa actccagcat gagaggcaac gcgatagaag 1441 ctttggctaa gattggtaac tcccaagctg tgaatacttt aattgctgtc ctcaacgact 1501 caaactccag catgagaggc aacgcgatag aagctttggc taagattggc aactcccaag 1561 ctgtgaatac tttaattgct gtcctcaacg actcaaactg gtacgtgaga aggtacgcgg 1621 cacaagcttt gggtaagatt agcaactccc aagctgtgga tactttaatt gctgtcctca 1681 acaactcaga ctggtatgtg agaagcaacg cggcacaagc tttgggtaag attggttcct 1741 ctcaagcagt gaatacttta attgctgccc tcaacgactc aacctccagc gtgagaagca 1801 acgcggcatt tgctttgggt aagattggct cctctcaagc cgtggatgct ttaattgcta 1861 ccctcaacga ctcagactgg aaggtgagaa gcaacgcagc acaagctttg ggcaagattg 1921 gctcctccca agcagtggat actttaattg ctgcctttaa cgacttagac ttgaacgtga 1981 gaagcaacgc ggcatctgct ttgggtgaga ttggttcttt ccaagcagtg gatactttaa 2041 ttactgccct caacgactca gactcgaacg tgagaagcaa cgcggcatat gctttgggtg 2101 agattggtaa ctcccaagca gtggatgctt taattgctgc ccttaacgac ttagacttga 2161 acgtgagaag gtacgcggca caagctttgg gtaagattag caactcccaa gcagtggatg 2221 ctttaattgc tgccctcaac gactcagact cgtacgtgag aagtagtgcg gcatatgctt 2281 tggcttatat tggcaatcca gaaactttgg caaagctgat ccaactccct gaaataaaca 2341 tttatgatcg tgacatattt atctcagcaa ggacattggc agttcgattc agcaagcaag 2401 gactactcag caaagcagga aagcctttga ttcctgtcta tcctgagtta gtgaaattca 2461 taccgatatg ggcatttgtg aagcgccaca tccgattttt gattttgtct tttcgctttt 2521 tgatttagat tgatttaatg cctgtcattg cgagcgtaac gcagtgaagc gaagcaatcg 2581 caatgacata ttacaagtat tttgcaggac atgataagta gctcaaccta atcaaaccta 2641 aaatgtcatt gcgagcgtaa cgcagtgaag cgaagcaatc ccaaaccctt gcgattgctt 2701 cattccgctt cgctccattc gcaatgacat agtgacgttt atttttgttt acttgcgtat 2761 cagcttcatg agacaagctt tggtgatgca tcttaacggg cgatcgcaca aataacttat 2821 cacaacttca gggaaacacc ttagataaat gaacagacaa atccgggaaa ccagggagag 2881 tcacaacagc atgaggcaaa aaaatctgtt tgctcgcata gccaaagttg ccttgcaagt 2941 cttgataggg ctgagtgtaa acttccaaac aacttgctac caaattgaat atccagtaat 3001 ttgaaattcc agattctgcg tagagtgata acttcacttc ctgatcatac ttgagcgtag 3061 catctgcaac ttcagccact agcaaaatat ctgcttcttg aggatgtgct gctaggtaat 3121 cgtcaggacg ataacgcgct atcaccaagt ccggttctgg ttcactattt gtagctagag 3181 taatcggttc ctgtcctcgg actattgctt gtccttgtag gagtattatc aattccgtaa 3241 ccaaacgtgt attgcaaacg gtgtgggcag ttcctttggc taccatttgc atgagttctc 3301 cgcgaattaa ctctactcgg tcattttctg taagaaaccc caactcaatt aagcggtgat 3361 agtcagctat tgtaaaacgt ttgggagtaa ccgctgcgaa agtagccata cacagacaac 3421 cacttagaaa taggaatttt gtttttaaaa actagtctag cgtcttacct cgatgagaca 3481 agctttggtg atgcatctgt tagtccattt taatggactt agtctatgag cctgggactt 3541 atagtcctag gcggacgaca acactgctgc tttttagctt gccacctttt ccaagaaact 3601 gtcgaaagca ttcctatcga ttaactcgtt ttctatgagt attcggcgta cagctagcaa 3661 atgtactgga tgaacatatg cttgctgttc gtaagcaggt aaaatgacta gtgcttccga 3721 tgaaggatgc tgaaaaactt tgtgagagcc agatgttcgt aaagctgtga agcctaactt 3781 caacaggtat ttttctaatt cattaaaaat aatatctcta tgcattgtga cgaatgactc 3841 cctgtatctg acgcagaaca tcaaatatat aaaatgtatt caatttttta acttttctga 3901 tatcagacta ccagccaaaa tccacaaatg acgaccatgc cagaaaacac cgtgtccaac 3961 agagcaatca tttgtggtgg ctgatgtaag atctatcctc agactagttc cccataccca 4021 aaagcgcctt caccgatcgc ccgatatttg ttggttgcat cgcatccatt gcagctgtgg 4081 gttcataacc gcaatgtacc atgcaatctg cacacttggg atttttacta gcgtgaccgt 4141 attgactcca gtcagtttta tctaacagtt cttggaaagt tttgtagtga ccttcattta 4201 gaagataaca aggcttctgc caaccaagaa cactataact gggactaccc caaggagtac 4261 aatcgtagtc tttctcaccc atgagaaaat ctaaaaacag tggactatta ataaaatccc 4321 agtttttctt gcctgctttg tagggagcaa aaatttgccg gaagagtgcc cgtgtttgtt 4381 cgcgtttgag aaaatgatct tgatcgggtg cccattcgta actatagcca ggggaaatcg 4441 tcataccatc aacacccagg gtgctaagaa aatcaaacaa ctcctgcaat tctttcgggt 4501 cagtaccgtc aaaaaccgta gtgtttgtag cgacacgaaa tcctctcgat tttgccgcac 4561 gaatcgcact aattgcaata tcaaagactc ctttgcggtc tacacactga tcgtgtaact 4621 ctcgcattcc gtctaaatgt acactgaagg tcaggtatgg ggaaggttca aacttgtgca 4681 ggcttttttc tagcaacaag ccatttgtac acaagacaat aaaccttttg cgtgcaacca 4741 aacctcggat aatctcatca atttgaggat gcagcagggg ttcgcctcca ggaatggaga 4801 caactggtgc gccacactct tccactgcag taaagcattg ttcaggagtt aagtgttgct 4861 tgagtatttc tttaggatgt tggattttgc cacaacctgg acaagccaaa ttacaccgga 4921 ataaaggttc caacatcaga accaggggga agcgtttgcg acccaacaga cgttgagtca 4981 ctatatactt acctatttca atagcttgtt gtaaatgaat tcccataatc tcctctacac 5041 caccatttat atcgactctc gccagcagca gtctgcactc aacctattga gcaaaatgtc 5101 cgatgacaat tttgcatggt tatggtaatc tgccgagaag tctcttattg tatgctacct 5161 caacgtattt ctccgccaca aaccaattta cggcgtcttg taaagctgtt ctaattggcg 5221 tttgaggtaa acctaattgt tggacggctt tggaagcatc gtaatacatc ggttgatgcg 5281 ccatgcgaac accatctaaa ggaatcgaag gtggtttgcc taaaggcgcg agaattcgtt 5341 catcaatcca agctaaactg aggggtagcc acgccgggac agatttttga ggagcactca 5401 aacctgtgat ttcggcaagt tggtctagta gttctttgag ggtaaggttt tggttcccta 5461 agatataacg ctctcctgtt tttccttttt ccaaagctaa taggtgtccc ctcgccacat 5521 cgcgcacatc gataaaattc aaacccgtat tcaaataaaa aggcatttgc cttcgcagaa 5581 accggagaat aatatcccca gtgggagttg gcttgatgtc ccaaggacca attggagtgc 5641 ttggattgac tatcacgata tcttgaccct gtgttactgc ttgtttcgct tcctgttcag 5701 ccaaatactt agacttctta tactgaccaa ttagttcctc aaggggactt tgatgagttt 5761 catccactac cttgcctaat tcacctaccc caatagcagc aacggaactc gtgtaaacgg 5821 tgcgttcaat acctgcttga cgagcagccg ccaacacatt gcgtgttccc aaaacgttat 5881 tgtggtacag tacatcttgg tcagcttgcc atagggaata gtgagccgca acgtgaaaga 5941 gtacctgagc gccctgtatt ttccggtata aatccgggtc gttcaagtcg ccttttataa 6001 tctctacgtc taaattttgt aaattgttca aggggctgtt aggacgaact aaaacccgga 6061 ctgtatagcc ttcttcaagg agtaaccgtg ccaagttagc accaatgaag ccagtagcac 6121 cagtcacaaa cgcgcgaatt gtcattgatc gtgtaccttg ctataaggtt gttaaataaa 6181 atacgctgga tagaaaaaat ataaatcaca atagaaaaac attgctgaag aatgaaaggt 6241 tttcttagtt tttttatgga aatagcttcc tgaaagtttt tagaatagta aatagaaatt 6301 tataataacg attagggatt ttctaatgcc caaaagcatt tttaagaaaa tagttgtatc 6361 tggcttactt tgttccttgc tgctcatgac aaccgcttgc ggcgaggatg aaagttatag 6421 tgacacgaac tcttcagaag ttaatcaaac aacagaaatc aggaatggtg aagttaaaat 6481 gaattgttcc tccagttcac cagataatga ggttactaca acagcaactg ttaatggaag 6541 agagtacaag tgtgagaatg gacggacggt tagggtgaga taaagctgct taggtcaaag 6601 gtcttgattg acaacagaag cctgtaggac gaaacagacg gttgatatca gcctctagca 6661 caatggaaaa acttggtaac tgtgcattta acagtactcg taactgttcc ataagttgcg 6721 agtaaaagca caatcacaaa aaatggtcga attgagtgca atttttactt agtaccaaca 6781 atgtcttgct gaatacaata tttatttatg aattaattct taattaattc ataaagttac 6841 gggctagaag cgcaacatcc acagatgtct tgacactccc ctgccagcgt gaatatcgca 6901 gtcatctcta ttcaatgtat tgtaaaatac attcaaatgt aacattgtat ctatcttagg 6961 gaagaaatag tacgattgat ttgtcggtaa tatgggtatt atatttatcc cattacggaa 7021 gtacacaaag cttctgtcta cttttctagt ggcatatgat taatagacct cttgcaaaag 7081 tgagatttta cgaggttctg ttaagagtta agcgttcaga attaagagtt ccctgttaag 7141 agttccctgt taagagttcc ctacaactgc cacgaagtct aatgaaatga caaataaaga 7201 agatattttc agggctgaaa tagatggttt agctgttgag gtgatggcac taggggcaga 7261 agttgtggat ttgaaagttg aaatgacagc acttaaggca gcacttgacg acaagattca 7321 aagcgctatc tgacgcgctc aaagtaaata tatgaaattt cagtctcaaa aagaattttc 7381 tgaaactttc gatgagtgag gttaaatttt atcttatgta gctactttga caaaaagttt 7441 taacataaag ataagtttgc ggatactaga acacccgtag acttgtctgc tcttgctcat 7501 gctgtcttga ctcaaaaatt aatttatgct attacctact aacatttcta aaactccctc 7561 ggttgagcct ttgataaaca tttgggcgga gcgttataca gtggatgtgt cttctctatc 7621 caaaaatcct aaattctatg gagaattgat taaagcagct tggccggaag ctagagcgtt 7681 gactgcagct aaattgttga acagagtatt agttcgtaca actaatcaag caactatacg 7741 cgcaaagtct ctgtacgaat atattcctga gattattgat tcttactcag agcagcgaat 7801 cactcaattt gcttgcaaag tttatcaaag attactggaa gtttaccaac agcagtctgg 7861 tatccttgtc attcctacaa gcagacaaac aacaacaagt gatgatcagc agacaactct 7921 attgctatgg accataccaa acatcgaaaa gctggtcaat gagatgcaac agttgttatt 7981 gacataccag gaacaacaca taatggcaag agatcagcgt gtggttggct ttctgactac 8041 gctgttcaat ttcactaacc aatcattaat aagtcaattg acatcagctg agaaagtgct 8101 gctgtgtccc tatttcaagt tcattgaaga atatgttgct atcccctggg tgcgagtctg 8161 tgcagctgct gctaagtatc agctaggttc accagccttg actctggttg agcaaatgtt 8221 gcctatggct tctgagatta gttcaatggt ttactgtcgg cttttagaat tactgcccaa 8281 tcatcgtagc ttaagaggcg aattgggtca tccacaggta acgcactctt gtcttcgtga 8341 cttagatatg ttccagggtt acttatggct ttgcgtgtta gaggaaagtc taaaacctgt 8401 gaaacaggaa ctggtgccat tatgtgtgat ggttatgccg agtgtgggag tgaaatggga 8461 aatgacggac aaatggaaac gattcttagc agatgaaata gaaagccgtg tgcaactaga 8521 acacaaacct cttttgcttc attacactca aggtatggaa gaagcattct ttgcagcacg 8581 taagcagcta ggttaccagg gtgacgtagt agaagttatc tctgagtttg caggaaactt 8641 ggctgttcac ctcaactctg aaaatgctta tgagactatg tggtcaaggc aacagagatg 8701 acagagtgac aagttatcta taagctttat ataacaggga ttttaaccgt tttaccggta 8761 aaaagatgaa gccgcaaatc gaataggctt agcgctaaat tttcaactag gttatcaggt 8821 ttgagacagg agttgcttct tttcgttgtc aataagcctg tcaataagca tctatcttgc 8881 ctgagtattg accaatgaat aagacatcaa aggtgacatc aaaggtagtc ctagattttt 8941 tccagttcta gaagaaacgg ggctgactca ccgtcttccc gtgaacagta ttgcgtattt 9001 tttgggaata aaaagaagct gattcttcgt ttttggcaca ctcctgtgtg aactgaatta 9061 gatggtgccc aaccagtccc acgtgaatca gttgctcgat attaccagct ttgaatgctg 9121 gtagtgccat tttccaaaaa gtgttgcgat agttacccaa tataccaacc cgcagtagaa 9181 tattggtcaa aattgttaaa cctgtcttaa tatttgccca agaagtgcga gcaggactat 9241 tgggaacctc tatccgatta gagtaagtat gctccatgtt gtaggcatac ctctgataga 9301 gaaattccgg ctcatatgcc gttgtaatgc agcgacgcca catctcaacg acttgttcat 9361 aaggcattaa aaactcgata ttcgactcgc gattttcatc aaatattaat cgcccttctt 9421 tttctaacct gcgccacaat ggagttctcg gcaatgcatg aagcaggtta attgtcaaca 9481 tgggaatttg agacaaacga ataaactcga taattcggtc tgcagttgct ggtgtatctg 9541 tgtcaaaccc gatgatgatt ccagagacaa cttccatccc gtagctattt aaaaccttga 9601 ttgcttccag aattggcata ctgaggttgt gagttttgga aatcgcgttg agagcttctg 9661 gttctggtgt ttcaataccg caaaagactg tacagaaata tgcttcacgc atcatttcca 9721 aaagtttggg actttgcgct aaattcaatg tggcttcaca agcaaattga atgggatagc 9781 cattgcgctt ttgccagtct atgagatgag gaagcaactg catgacggca cggcgattac 9841 ccacaaagtt atcatcaaca aaatacactg cccccggatt cccatattcc aacattgcat 9901 ctagttcagc gctaacctgt tctggagttt ttaggcgggg gttgttgcca taaagttcag 9961 gtatatcaca aaactcacaa tgataaggac aaccactgga gaactggata ttgccaagga 10021 aataatcatt caaatttagc aggtgataag ctggaatcgg aaattcattt aagggcaatc 10081 gttctttggt ttcaaagcgg atttgctgtg gggggcgatt gctatgtaaa tccaaatact 10141 caatcatttg gtcagttgca tcccccaact cacccaaatg caaaatatca aactctgggt 10201 aatattctgg acaaccagac accgaaggcc cacccacaac tgtgatttta ccttcttgat 10261 gggcaagctg attaatttga ttcatctgtg gtcgctggat atgcatccca ctgacaatca 10321 ctacatcagc ccactggaag tcagcccttg ttgctgaatt cacattttca tcaataaagc 10381 ggacttccca ctctttgggt aaataggatg cgacaaccaa aatgccttgt ggtggcataa 10441 aagcgcggac accacctctt agaggataag catggtgaaa ggttccaaat gagcggctgt 10501 acttgggaaa tatacagaga atgcgtctat gattgcgcgg aatttgcgaa gtgttccgag 10561 tcgtcttttc aaaaagaggt ttttctaaag acattaactc accattcata aaacctcgtg 10621 ggatgtgtgg aaactcagag gctttagccc tgagaggaaa cacgactcag gcagttttaa 10681 ctgccgtggt tctccatgag ccgtaagatt tgaatctgca atctattggt attgcacttt 10741 gagttaatct tttaccagtc tctaaactgt gtaaactaat cgtttttctg tccgggtctt 10801 ttttagttgg tgattcctgc caaccaccga cataacaaaa accaaattta ggatgcttga 10861 ttactgaacc gcgcttgaaa cccgcgctta ctgtgcctcc gtatctagtg cggatatgtc 10921 cgacagaatg ctgcaagcga tgaagttggc gacgatgaaa ctctagcgga ataatctcaa 10981 ttaatcgggt gttttccggt ttgagatgcc cattgacgaa ccagttggca agtacccagg 11041 aatcaacaca gtgagcgtca aacttgttag ataatttgtt tttagatttt ttcaatccta 11101 agctttgacg catttcgtaa gtgtcgttac ccgtcctagt ttctaaactg gctattctct 11161 ttagttcaga ataaaaccac tttttaccaa cctctaacgg gctgaacatg gtattccatt 11221 ttcttgaacc tttccaggtc ttagctttaa tatcttcaac aacaaaggtc gaaataggga 11281 acatcaaagc taaccacttg caaatgcgta atttccattg ccacctagct ttagtagacg 11341 gtggtaattt ttgtttgttt acaagtctat ttttgcggtt ttggcggcat ggagtattac 11401 gaaatcttct ggcgcgtcgc atatttctac gaacttctac atggtctttt acccaatcaa 11461 tggcatgggt ctggatgttt aggtaagtat gcgcctgtga tttaactgtg tagccttctc 11521 gcttacttcc gggatcaaca ccaactgcta ctggttgcgt gtttctatta gatggctctt 11581 gatttaatcg aacacaaaat actccttttt tgaaaaaagg tgtagcctta cctgatttaa 11641 tccaacgttt tgccctacta ggggtagttg gcatcatggg atggttattt gggtcaacta 11701 ctgggacaaa catttacgat aagtcctatt actaggtatt tatatccctt cgctactgac 11761 taccacagag gttaaaaact agggaagtat tcttaacgtg tttcgagtta ccacactcag 11821 gccattcagt ttaattagct agacacttgt ttaacttgat ttggggaggt ggtgaattcc 11881 cctcttgcaa gcccagtggc tttagcccta ggttagtgac actggattca ctccacaaat 11941 ttgaatagct aaaaaaatct tatccccatc ccctctcctg aacaaggaga ggggtgcccg 12001 aagggcgggg tgaggttctt cgttttttta taagtgttca tccgaacata atattatttg 12061 ttatttccac tagtttaccg ccttttatga gtcgaaatct tcgcccgcgc cattcaattg 12121 tgctgccaaa taagctgtag caccaaatga gaaagccaat gacgtcgcgt acaggaacaa 12181 tccagaaaaa cttttggaca ccaaagtctt gaaggtagat aacaccaaca acccagccca 12241 tgactaatcg catcacccaa gtgatagcga acacagcaca gcctattgtt gatccacccg 12301 tagcaatcag tagtaacaag cttgtgacag taccataagt aaaaatcagt cccagataac 12361 cccaaggacg ggaaactcgt atacaacgcg cccagcgaat ttgacgctgg atggaatctg 12421 ctatagtgct agatgccaat acgtggtcaa ctatgtagtc ggaaaggaca accattgagc 12481 ctgcttgagt gggtaagtaa ccgagttgaa agtcgtctgc gagataatcg gcgatcgccc 12541 caaatccccc aatttcatct agcacttgtt tgcgaatcac aatcgttgaa ccaaaggcga 12601 acttgatccc ttctatttga ttgctgatta agacacctgc atggaaatca gtagcagttc 12661 caacagcttc caaaatgctt gcccatcctt tggctagaga acggtataga caagtgacaa 12721 caccaacact cttgtcttct agtggctgga taactctttg cagatattct atcccaactc 12781 ggatatcgct atcagcaatc accaagattt catgtttagc tttggtgacg gcgttagcca 12841 aattactcac cttggggtta gttccgatga tgtcttcgca gacgactaac tcaatatcta 12901 attctggaaa ctcgtgaata atttgcttga caactggtat gccacagtct attggatcgc 12961 gaacacaaaa gataatttga tagtttgggt acttctgttg gcaaaaagaa actaaattgt 13021 tgtatgtttc gtcatcctta ccacaaatag gcttgaggat agtaacgggt gggtggaaat 13081 ttagctgtag gggatgggga tgatggaaga aggcgaacgc tgcataaatt ccataacagt 13141 aaaacaagac agccgatata cagaggatta ataataggaa atgaatgata gtcagactca 13201 aagtagttgt caaaagcata gaaagataca tttgttgtct cagatgcgac agtgtgggcg 13261 tattatgaat gacctgcaca ttagtgtatg acttcaaata tgactttgga gtcaaaaaaa 13321 tacgactttt gagggatact agaaaaagtt tttatatgta atgtcatgct ttttcctaag 13381 aaggttccac tggcggcgcg ccaccagtta tgaatgaaac aaacaccttt ttccctgttc 13441 cctgttccct gttccctatt ccctgttccc tgttccctgt tccctgttcc ctgttccctg 13501 ttccctgtca tgaatctgat ttactttctc ctgcgttctt cttggggaat ggtggcgatc 13561 gcagttgtca ccggatttct cagtggcggt agcagcgcca gcctcatagc cctgatcagc 13621 actgcagcaa gtcgcagcgc tggtgaacgc ctcacaggta tggcttgggg ttttctcgga 13681 ctggcactcg tagcacttat tacgagtgtt atttctcagg tgatgctgat tcgcttatct 13741 cagcgtgctg tgtttcaact acggatgggc ttgagtcgcc agattctctc ttccgggttg 13801 agtcatttgg aacagatagg aagtcctcga ctcttggcaa ccctcacaga agatgtacag 13861 acagttgcta atgctgtaca ccaactgcct tacctttgta ttgatattgc tattgtagcg 13921 agttgcctgc tgtatattac ttggctatcc tggtttgtac ttctgatggt tataggactt 13981 gcagtggtag caattggtag ttcgcagtgg ctgttaataa gaggagagaa attcctcact 14041 cttgcacgag atgatcaaga tgtcttgttt aagcatttcc gcactattac cgaaggagtc 14101 aaggaactta aattacacca cagacggcgt caagtcttcc tttctcagaa tctgcaatca 14161 acagcagccc aattcagccg tcacaacatt caaggtttaa ccctgtttac atcaacaacg 14221 agttggggaa gacttttgtt ttttttcgcg ataggttttg tgctgtttgc acttccccac 14281 ctgttcacca tcagtcgcca aactctctca ggctacatct tgacatttac atacttgatg 14341 atgccgatga ataaccttat ggagaacctc cccgtaatca gtaaagctag catagccttg 14401 cagaagatag agtcactggg tttatctctg gcaaaccaag ctgaacaatc aacggttcca 14461 ccagaaagca agtcttcgtg gcacgactta caatttgtgg atgttaccca tacttatcgc 14521 acagatcaag aagacagcaa ttttattatc ggttctatta acataacgtt ttatcctcaa 14581 caactggtgt ttattgttgg aggaaatggc agcggtaaat ccactctagc caaacttatt 14641 acaggactct acattccgga agctggagaa attttgtttg atggagagtt aattaccgaa 14701 gaaaatcgag agtggtatcg ccaacatttt tctgtggtgt tctctgactt ctatttattt 14761 gaagaacttt tgggattaga taatattaac ttagatactc aagctcaaga gtacttaaaa 14821 ctactccaaa ttgaccataa agtcaaagtg agaaatggta aattttccac gaccaatctt 14881 tcacaagggc agcgcaaacg actgggcttg ctaacagcat atttagaaga tagacaaatt 14941 tatctctttg atgaatgggc ggctgaccaa gatccagtat ttaaagaaat tttctacact 15001 caacttttac caaaactgag agataagggt aaaactgtac tggcaattac tcacgatgac 15061 cgctattttc atgtagcaga caggattata aaactggatt acggtaaagt tgagtttgat 15121 aagacaagga gtcactaaga cgatttgaga aaagaatgcg acagatgcgt aggttgggta 15181 gagcggagcg aaacccaaca tttacaggta ttgtaggcgt tgggtttcgt gcctcaaccc 15241 aacctacatg tgtagctatt actgcgtctt cggctacgga tagcccaatt tcgcttacgt 15301 gaattgtttc aatacctaca ttacatagca aaactcctgc agagcgtggt agtccctgat 15361 caagcaacag tttcatacgt ggtgggtagc tcaataatgc ggtcatctaa gtaggaagaa 15421 gcaaaaatta aggcttgccg aatgtcttcc tcttcgagtt ctgggaactc ttgacgcagt 15481 tcttctcgtt cagggtatat agccaataac tcaatcactc gacgaactgt aagacgaaga 15541 ttacgaatgc agggctgtcc attcatctga ttgggattgc tggtgatacg gtctaatttc 15601 atctattctg gctccagaat tctgacccca tattgtcaca aaatcttaca aataagcact 15661 ctttgcgtac aaacctgata ttagtactct cagtcacgct aattacagta tccttcccaa 15721 cctcatattc aaaactcata aaatactgag gacaacacag gtgcgtcacc cattcttttt 15781 tttcgcctta ccccttgcaa ccgtattcag ctttagctac ctcaccattg caagagcaca 15841 aatcactccc gataacagtc tcggtgcaga aagttcagta gtcacgccca atgtccaaat 15901 aaaagacatt cctagcgaca ggattgatgg aggcgcgatt cgtggaggga acctgtttca 15961 cagttttcag gaatttaata taaatgcagg tagaggagct tatttttc // LOCUS NODE_2101_length_15946_cov_5.54018015946 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15946) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15946) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15946 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(166..1668) /locus_tag="DP116_18130" CDS complement(166..1668) /locus_tag="DP116_18130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18130" /translation="MTTSKIFFQPLDRIAIGLILILSFLIGLLMLQGDAVKPSVRHFS WQNQQIAADDTSFTLTFSRPMDSKSVEDNIKIDPPLAGKVSWAGRRMVYTLLTPAPYG TTYKVQLQGARDKFSQKEGKNRQIQPFTGSFRTRDRIILYIGADQQDKGRLVLYNLSQ ERKMVLTPKDLVVMDFKPFPNGDKILFSARKVGNQDLLSTQLYTVTTGISAKSEKPAQ ALGKVDLLLDNKEYQNLKFDLSPDGKTLVIQRGKKDEPGDFGLWFMPLNNESSQEKLT PKRLKSQPGGDFMITADSQEVAIAQGQGIGLLPLQADASKPRDFFPEFGLVEAFSQDG LQAVMVKFNPDSTRELFLLRNQGVQKPLLRTKGSILSCQFDTASPTLYCLLTQLLPGE LYQEQPYLVAIDLKTMQQKPLLVLPPDHRNVQMSLAPDGLGLLLDQVVPQTTPTDSSQ ANMLTTDEGQPIATSSLWLIPLLPISDPSAVNEIKPQQLPLVGFHPRWLP" gene complement(1896..2717) /locus_tag="DP116_18135" CDS complement(1896..2717) /locus_tag="DP116_18135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209915.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR03943 family protein" /protein_id="PRJNA477356:DP116_18135" /translation="MTNDFNSKKKSQNRYQKLLPWLDVLAIAAWGLLILKYWLTNKLN LLIHPDYFWLAIVGAIGLITIAFFKTQQLSQRRRQIAPNVQHLTLFPPGWSSALVLTA AILGLIITPHVFASQTALQRGVTDLMGATRAKPQTTPDQLQGKRAQPQPFRASSPLEE RTLVGWVRTLNVYPEPDAYTGQKVKVQGFVIHPPDLGKEHLFLARFVITCCAADAYPV GLPVKLKESRDNYPADTWLEVEGQMVTENLANKRQLTIDATSLKKIPQPKDPYTY" gene complement(2791..3849) /locus_tag="DP116_18140" CDS complement(2791..3849) /locus_tag="DP116_18140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997512.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="permease" /protein_id="PRJNA477356:DP116_18140" /translation="MNQLNNGFTLFLSLLVEAIPFLLLGVLFSSLLMFFVDEGKLVEK MPKNPFLGALVGSLVGFLFPVCECGNVPVARRLLMQGVPIPVAIGFLLAAPTVNPIVI WATWTAFRDQPEIVVLRVVLSLLIAVIVAFIFGFQKDLAPVVQPAIARYLKFNPPAKP QPKRSTRSQFIQQEEATGSTLLQSGTYLLGGQAGQSIRMDGDILQANMPASKPSKPLP DKLRLLVDNCVQELRELGAVLVIGSAIAAAIQVLTPREFIISLGAGPISSITAMMILA VVVSICSTVDSFFALSFASTFTSGSLLAFLVFGPMIDIKGIGLLLSILKPKAIIYLFF LAAQLTFLFTLYLNLHVI" gene complement(4519..4995) /locus_tag="DP116_18145" /pseudo CDS complement(4519..4995) /locus_tag="DP116_18145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015955013.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 5205..9763 /locus_tag="DP116_18150" /pseudo CDS 5205..9763 /locus_tag="DP116_18150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317026.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 8132..8141 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(10295..14494) /locus_tag="DP116_18155" CDS complement(10295..14494) /locus_tag="DP116_18155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009783688.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18155" /translation="MSNTTGVKTILILAANPASTARLRLDAEVRSIEEALQHAPKGGQ FRLVQKGAVRSRDFYLAILEHQPQIVHFCGHGTGVNGIVLEDDTGQPTLVDQETLSQL FKLFAVKGVECVVLNACYSSVQAEAISQYIQYVVGMNQTIGDKAAIAFAVAFYDALGA GETVEFAFNLGRTELIRLKEDQIPVLKTTSIHPADIQFEAGDIPPNPYLGLSAFGEKD TAFFFGREKFTDELFRMTNQQPMVAVIGASGSGKSSVVFAGLIPKLREKGIWLIESLR PKSQPFDELALALVRQLEPNLDGVDKVIKVSKLAESLKKGEVKLHQVASQILENKSNK RFLLVVDQFEELYTQCQDKQEQQRFIDTLLATVSQKSITLVFTLRADFYGYVLSYLPF CEALQQFKHTPLGLMRREELQAAIEQPAQKLNVKLQTYLAERILDDIGNEPGNLPLLE FALTQLWDNQKNGEMTHRAYDEIGGLKQALVKHAEQVYSRLSQSQQQQTQRVFLALVR LGEATKDTRRVATHQEIGSGNWELVTHLASSEARLVVTGRNDKSGEETVEVVHEVLIR EWKRLRKWISINRDKLIQHRKIEAAATEWRDKGKSKDYLLAGKQLNEAKAFQKEQASL FALSALASELIQKSIEYRWNNRLRLSSFGLLPLLALTVFSGFATIGQRNAQIEQIKAF EQASDAEWRSNQDFEAVIDALRAGKALDKLLPFGLFKPDAELARVRGTLQKVVYTERV KEFNRLEEDYDLLASVVFSPDGQTIAVGSTNKSVTIWSLEGIKLQTLTGHDAEVKSVA FSPDSQTIATASDDKTVKLWKRNNTGQFNTQPEQTLTRHSGGVKSVAFSPDGQTIATA SDDKTVKLWKRNSTGQFNTQPEQTLTGHDAGVKSVAFSPDGQTIATASEDSTVKLWSI EGQELQTLTGHDGEVTSVAFSPDGLMLVASANLNGTIKLWKRNSTGQFETQPDDITPG VYIRSIRSVAFSPVRVASPQGFGQMLALATEDKTVILVNLQGQVLQTLTGHTGWVSSV AFSPDKKTLTLASSTDRTVKLWRFEGKKLPTLSNVHERAIWRIAISPNGEMLASASID GTVKLWKWDSTGQFETQPYKTLIGHRNWVWSVAFSPDGKTIATASDDKTVKLWSIEGE ELQTLTGHDGGVKSVAFSPDSKTIATASDDKTVKLWKRDDSTGEFVIQPYKTLTGHNG KVFSVAFSPDGQTIATASDDNTVKLWNLDRSQVLRTLTGHKGDISSVVFSPNGQVLAS ASFDGSIKLWKRGFTGQFETQAYKTLTGHTTWVTSVVFSPDGQMLASASEDTTVKLWT LEGKELQTLYGHGDRNWIRSIVFSLNGKKIVSAGAGKVILWNLEDFSLDKLMQPACNW VQNYLKNNPNVSQSDRRLCDN" gene complement(14558..15898) /locus_tag="DP116_18160" CDS complement(14558..15898) /locus_tag="DP116_18160" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18160" /translation="MSIKRTNALVRSKVTAQSPQIEATVVGGLDQPFLGGAIAEHTPY FTVCDHKDHGWVIDGGAVHGIPQPVGDETTLLALFPFDSSFEQLRQLSGAIAQAKVLQ VLPQVSKIQISGIENLNPDTTFKAVVTSLPLPPKGVLITGEEAGVQLARTALLQAGPQ SQPSLYIREVTTLEETEFQLVARDGKYFITRPADDRPLVAEIQGYTTATALQVIQRLE HIARWTNIVELSSPASSRIQPDAVQMVIEQNAQELQDVQLRLEYRQENNKWKSPSFRV KLKNTSHEPLYCSLLDLTDRYAVSADLLDGGGLWLQPGQEAWALGGNSISATVPEKLW REQGITEFKDILKLIVSTAEFDATLLQQSELDLPSRSVPAPRRGQGTLNRLMNRIPSR DIRAKPEEEELCDDWVSSQMTITTVRPQQTTLVPTPDASISLADDIHKVRSRGA" BASE COUNT 4535 a 3237 c 3405 g 4759 t 10 others ORIGIN 1 tgctccactt ggggagaccc caagaccgca ctggctcccc tcccgcagcg ctggtttcac 61 tgctagcaac atatgaaaga aacaccatcc tccttcttaa gagttccctt ttccctgtta 121 agagttccct gttccctgta tttcttaaaa ggttgtattg tgtttttacg gcagccaacg 181 gggatgaaat cctactaagg gtagctgttg tggcttgatc tcgttgactg cactaggatc 241 tgagatcggc aataagggta ttaaccacaa actactggtc gcgattggct gtccttcatc 301 agttgtcaac atattcgctt gtgatgagtc agtgggggtt gtctgcggta caacttggtc 361 aagtaacaac cccaaaccat caggtgctaa actcatctgt acatttcgat ggtcgggagg 421 caaaactagc agtggtttct gttgcatggt tttgaggtca attgccacta aatatggttg 481 ttcttgatag agttcccctg gtaaaagctg tgttaacaaa cagtaaaggg tgggtgaagc 541 agtatcaaac tggcaactga gaattgaacc tttggtgcgt aacagtggtt tttggacgcc 601 ttggttcctc agcaaaaaca attctcgcgt ggaatctgga ttaaacttga ccatgaccgc 661 ttgcaaccca tcctgagaga aagcctccac taaaccaaac tccggaaaaa aatctcgggg 721 cttgctggcg tctgcttgga gaggtaatag ccctattccc tgaccttggg ctattgcgac 781 ctcttgacta tccgccgtaa tcataaagtc tcctcctggt tgacttttca gacgtttagg 841 ggttagtttt tcctgagaac tttcgttgtt taggggcata aaccacagcc caaagtcacc 901 tggttcatct ttttttcctc gttggataac aagagttttt ccatctggcg ataagtcaaa 961 tttcagattt tgatattctt tgttatccaa aagcaaatca actttaccca gtgcttgtgc 1021 tggtttttca gattttgcag aaatccctgt tgtcacagtg taaagctgag ttgacagtaa 1081 gtcttggttt cctactttgc gagcagaaaa taaaatttta tctccatttg gaaagggctt 1141 aaaatccatg acaactaagt ctttgggagt gagtaccatt tttcgttctt ggcttaggtt 1201 gtaaagaact aatcgcccct tgtcttgttg gtctgccccg atgtaaagaa taatgcgatc 1261 gcgtgtgcgg aaacttcctg taaaaggctg tatctgtctg tttttacctt ctttttggga 1321 aaatttatct cttgctccct gcaactgtac tttgtacgtt gttccatagg gggctggtgt 1381 cagaagtgtg taaaccatcc gccgtcctgc ccaactcact ttacctgcta gaggcggatc 1441 aattttgatg ttatcctcta cgctttttga gtccattggg cgactgaagg tgagggtaaa 1501 ggatgtgtca tctgctgcaa tttgttgatt ttgccagcta aaatgacgca cactcggctt 1561 aactgcatca ccctgtaaca tcaatagccc aatcagaaaa ctgaggatga gtatcagtcc 1621 tatagcaata cgatctaaag gttggaaaaa aattttactc gtggtcatta ttatcagtta 1681 tcaatgatca gttatcagtt atcagttaat atctacgaat caataggttc gatttaaaca 1741 atagcaatcc tatatgagtt gtgagaatta tcgattagca acaagacgcc aaggacgcca 1801 agaaagagaa gagaaaatct tacaaatgat ttaggactgc taaaggattt taacaattaa 1861 cagaaaattg atcactgtta actgttcact gataactagt aagtatatgg atctttgggt 1921 tgaggaattt tcttcaagga agttgcatcg atggtcagtt gacgtttatt tgctaggttt 1981 tccgtcacca tttgtccttc gacttctaac caggtatcgg ctgggtaatt atcacgactc 2041 tctttgagtt ttacaggtaa tcctacggga taggcatctg cagcacaaca agtgatcaca 2101 aatcgtgcca agaacaaatg ttcttttcct aaatctggtg gatggatgac aaatccctgt 2161 actttgactt tttgtcctgt atatgcgtct ggttctgggt agacattcag cgtacgtacc 2221 caacctacca gtgttcgttc ttctagaggg ctggaagcgc gaaatggttg gggttgggcg 2281 cgttttcctt ggagttgatc gggtgttgtt tggggtttag cgcgtgttgc tcccattaaa 2341 tctgtcacac ctcgttgaag cgcagtttgg ctggcaaaaa cgtggggtgt gattattaat 2401 cctaagattg ctgctgttaa cactaaagca ctgctccaac cgggtggaaa taaagtcagg 2461 tgctgaacat ttggggcgat ctgacgacgc cgttgcgaaa gttgctgtgt cttgaaaaaa 2521 gcaatagtta tcaagccaat agcacctaca attgctaacc aaaagtagtc ggggtgaatt 2581 agcaagttta gcttgttagt tagccagtat tttaggatta aaagacccca agctgcaatt 2641 gctaagacat ccagccaagg cagtaatttt tgataacgat tttgagattt ttttttggaa 2701 ttgaagtcat tggtcattag ttattagtca attgctaaat gacgtgcaaa ttgatgaaga 2761 gggtgaaact gtgaagttat tagtcgattt ttaaatgaca tgcaagttga ggtagagggt 2821 gaataaaaat gttaactgtg ccgctaaaaa aaacaagtaa atgattgctt ttggtttcaa 2881 aattgataac aataaaccaa tgccttttat gtcaatcatt ggtccaaata ccaaaaaagc 2941 taataaggaa ccactggtaa aggttgaagc aaaagaaagg gcaaagaatg aatcaactgt 3001 agaacaaatt gacaccacta cagctaagat catcatggca gtgatagagc taattggacc 3061 agcccccaaa ctgatgataa attcacgggg cgttagtact tgaatagcag cagcaatagc 3121 gcttcctata actaacactg ctcctagttc acgcaattct tgcacacaat tatcgactaa 3181 caaccgcagt ttatctggta ggggtttact gggtttagag gctggcatat ttgcttgcaa 3241 aatatcccca tccatccgta tgctttgtcc tgcttgtcct cctagcaaat atgttccaga 3301 ctgtaataaa gtagatccag tcgcttcctc ctgctgaatg aattgacttc tggtgctgcg 3361 tttgggttgt ggttttgcag gtggattaaa ttttagataa cgagcgatcg caggctgtac 3421 cacaggagct aaatcttttt gaaaaccgaa gataaaggcg acgataactg cgatcaataa 3481 agaaagtacg actcgtaata ccacgatttc tggctgatcg cgaaatgctg tccaagttgc 3541 ccaaattacg atggggttaa ctgttggtgc tgctagcaga aagccaattg ctactggtat 3601 ggggactcct tgcatcagca accgtcgtgc tactggtaca tttccgcatt cacaaaccgg 3661 aaataaaaag cccacgagac taccgactaa tgcacccaga aacggatttt tgggcatttt 3721 ttccaccagt ttgccttcat cgacaaaaaa cattagcaaa ctagagaaca aaaccccaag 3781 caacaagaaa ggtatcgcct cgactagcaa actaagaaag agagtaaaac cattgttcag 3841 ttgattcatg tgtgtcgctt atgaaagcgg tgttgagttt ttagattttg ccaagtcatt 3901 ttatgatgga aacaagtatt actcatagat tgcgattatt caagaacact cgagtaacaa 3961 tattttaggt tgctaattac gattttttat ctagccaatc ttccgttaag catgagtcat 4021 gagcaattag agtcctacat agctgtatat gcttccctgt gtgctcttgg gtgagccgta 4081 atttctagat tgtcagactt caaaattagt tccaatttgg gagagtccgc atctatttgc 4141 acaattgtaa ttacgacatt ttcttagtca aggtaatttt gcttgatagt atgttattct 4201 tcgcccattg tcataagggg ttcacagttt ttaaacattt ggggatcgct atggtaacga 4261 ctcattttgg gatatgccta ttttaggtaa tatctcatgg gggtgatgga taaataatta 4321 ctacaaaatt tttaaaactg caagaccaaa atggaagaaa tatggtaagt cgcattcttg 4381 tagacaggta gcagtgagac tgcgcgcagg gagggttttt cttgtgcagc ctctgcgcaa 4441 ttcgcagcca tgcctacggc ataggattat ttgtagataa gtgctgacat aattacttta 4501 cttggttcaa aatatcttct accagcctat cttagtaact tcgtcgcctt tgtaaccaat 4561 cgacttcagt gcctgattga accgctctaa accaactttc ttaatatcat ctaccatttc 4621 accagagcat gatgtaacca agcttagagc cttcatcatc taattgttga gcttccatgt 4681 aagagcgtgc taattttcgg gcttcagtca tcgtttcagc gtctttagct gatactctat 4741 ttttcgcgac accttgagaa ataaggtact cagaggtttt ttcccgatgc cataaataat 4801 aaggcgtatg cgttcattgg gcgtgatttt ttcaaagaaa ccaccatctt tagccatttg 4861 actaggctca ccatcttttt cgcgtaatcc gtattgcttg agcagattac cgaatttaac 4921 ggctgatact ccaaactctt gacctagttc agttaaagtt ttccaaactt ttcggaagtt 4981 gtttttcttt accacgatat aatccttttg ggtaatagtg ttcacctcga cacttataat 5041 gaaccaccta tgtataaccg atgattaaga tgtgggtaaa tattagcgaa tagttttgca 5101 tcctactcta ggcgcggctg ttgctaacct tatgctatga aagaagatat ggactaatgt 5161 gaaaagctat attttttcta gtcgtataat attttaatct tcaaatgagc aagtcagttg 5221 tcatcaatct gggacacggt aacttgtgtc atggatttcc aaaagtgact atccaattgt 5281 ggacatttaa tcaaccgctt ccagagcaat tcatcggctc gttaccacct gcaccaaagt 5341 tgattgaatt gtatcataaa tggcagctaa cctatcatgg tttatgtaaa tccgtgtatc 5401 tacgttctcg gttggcaggg gaaagggaag acgatgaact ggaaattgat gaagccgcta 5461 taaccaacat ctctaatgtt gagtttcagg atttatgtca acagttacaa gaaagtatga 5521 atgcttggct caaatccgag ggaattctca atactagccg acaactacgc accttactaa 5581 acccaacaga agaaatacga gtcattcttg aaactgataa tgaaatgatg cggcgaatac 5641 cctggtatcg ctgcgattta tttaatgatt atccacgtgc agaaatagct ttatctcaat 5701 cagaatataa acgccgtgaa ttgttacaac cacaagttaa tcgaaaaata gttagaatct 5761 tggcaatttt ggggaacagt gaaggtatta gtttacaagt agaaagtgaa tttctcaaaa 5821 atttgccagg tgcagaacct gtttttcttg tcaatccaac acgtcaagaa ttcaacacac 5881 agctttggga ttctgctggc tgggatatcc tattttttgc tggtcatagt cacagcgaag 5941 gtgaaacagg tcgaatttat attaacgaaa acaagacaaa taatagtctg acaattgaac 6001 agttagaaga agctctgaaa gcagccattg acaatggttt acgcttggca attttcaact 6061 cttgtgatgg tttaggatta gcaaatgcgc tacaaaaatt gaatattccc acagtgattg 6121 taatgcgaga gccagtgcca aacctcgtag cacaagaatt ttttaagtat tttttgcaag 6181 cttttgcact agaacaactg cctctacacc tagcagtgca gcaagcacgc agaaagttac 6241 atggtttaga agatgacttt cctggtgctt cttggttacc tgttatttgc attaatcctg 6301 cagcagaatc accaacttgg ttacaactga gtgataaaac ccttcatttc cattttaatc 6361 aaatccattt caatcaaaat acgaaaacaa gagccaccag caaacatcag gactggggag 6421 aggcgattga tgcctcagtg ttttacggac gtactgaaga acttaccact ttaaggcaat 6481 ggattattaa ggaagattgc cgactcatta ctttaattgg tatgggtggg attggcaaaa 6541 caactttgtc tgtgaagtta gcacagcttt tgcaagatga ttttgagtat ttgatttggc 6601 gaagtctccg taatgcgcca tcaattcatg agcttttgag cgacttaatt aacttttttt 6661 ccaatcaaca ggaaaccgtt ttaccagaaa ctttagacgg taaaatctct tgcttaataa 6721 aatacctgcg cgactcacgc tgtttgttag tgctagacaa tggtgagtca attttatgta 6781 gtgaaaaacg tgccggagca tatcgacaag gatatgaagg atatgagcaa cttttcaaat 6841 gtttgggaga aagtaaacat cagagttgcc tggtgttaac aagtcgagag aaacctagag 6901 gaattagcgt aaaagaaggt atcaactctc ctattcgttc actgagattg tttggcttaa 6961 cacaagcaga aggtcaggca attctggcag aaaaaggttt ttatgtatca gaagaacaat 7021 gtcgatcgct tgtggagtat tatgcaggca atcctttagc cttaaagatt gtcgcaacaa 7081 ctattgcgga attatttgac ggcgatgctg ctcagttctt acaacaaggc acaattgtat 7141 ttggggatat ttcagattta ctagaacagc agtttaatcg gttgtcaatt ctagaacaac 7201 aggtgatgta ctggctgacg attaatcgtg aatggatatc ttttaaagaa ttacaacaag 7261 acatgattcc tgctgtttca ctacgagatt tgttagaaac attagaatct ctacaatcgc 7321 gttcattaat tgagaagaac tcaggtaact ttacccaaca accggtggtg atggagtatg 7381 taattgaccg attcgtcgaa gagatttgcc aagaaattga aactgaagaa atagcattat 7441 ttaatcgatg tgcattcatc aaatctcaag ccaaagatta tgtgctcaat tcacaaatta 7501 ggctgctcct taagcctgta gcagataaac tattgaccct ttttggtcat cgagaaaata 7561 ttcaaattta cttgaatcag cttttgtcaa aactgagaac gtctacctta caaaaaccag 7621 gatatgctgc tggtaatatt ctcaatttac tctggcaact tcatgtagac ctcaatggtt 7681 atgatttttc taacttaact gtttggcaag cttacttaca gggcatgaat ttgcatcgag 7741 tcaacttcgc taactcagat ttaagcaagt ctacttttac tcgaacatta ggagggattt 7801 tatcagcaac ttttagtcca gacggaaagt ttttggcaac agctattgat gatgaaatta 7861 ttttgtggga agttgcaaac attaaacaaa ttatgaccta cagtggtcat acttgttggg 7921 tgcaacctct tgcctttagt ccagacggac aaattttagc aagtggtagc aacgaccaaa 7981 caattcggtt atggaatatt cacactggac aatgtcttaa aacactgcga ggtcatacaa 8041 gttggctaca atctcttgct tttagtccag acggacaaat tttagcaagt ggtagcaacg 8101 accaaacaat tcggttatgg aatattcaca cnnnnnnnnn naaattttag caagtggtag 8161 caacgaccaa acaattcggt tatggaatat tcacaccgga caatgcttaa aaattttgcc 8221 gggacatacc agtcgagtca tgtttgctac cttcagtcct aatgggcaaa cattaatcac 8281 tggtagtgaa gaccaaaccg tgagagtttg ggatgtgaac acgggtgagt gtctacaaat 8341 cctagaaact catatcaatt gggtgctatc tattgctgtg agtcctgata ggcaaacact 8401 ggtcactgca agtgatggca caacagtaaa attttgggat ttagccagtg gtgagtgcat 8461 aagaatatta ccagattaca acagttatgt gtgggcagtt gccttcagcc cagacggcaa 8521 aacattggca accgggagtg aagataaaac agtcaagata tgggatactt taacaggaga 8581 gtgtttacaa actttgcatg agcatagcga acgcgtttgg ttggttgctt ttaatccaga 8641 tggacaaact ctaatcagtg ccagtgaaaa ccaaacgatg aagctgtggg atgttctgac 8701 aggacaatgc ttgagaacag tggatggata cagcaattgg gtgttatctg tcgcttttag 8761 ttcagatggt caaatgctgg caagtagtag cgaagaccaa agggtaagat tgtgggatgt 8821 tgtgacaggc gaatgtctac aaactttaca aggacatact aacttggttt cgtcagtcac 8881 ttttgcacca caaaatataa atgttcgcac aggcaaattc atcacttcag atgttgaaac 8941 gaagcaaaga agtcaaattt tagcaagtag tagtgatgac acaaccataa agctttggga 9001 tgcaagtacg ggtgagtgtg tgaaaacact ttggggacac agtagttggg taaatgcagt 9061 cagtttcagc gatgatggac aaattttagc aagtgctagt cgtgaccaaa cgctaaagct 9121 ttgggattgg cgcacgggtg aatgtttgca cactctagaa ggacatactc atcgcgtcaa 9181 aacagttgct tttaattctc aaagttcaat actggcaagt ggtagcgacg ataacaccgt 9241 caagctttgg gatgtgagta caggaatttg tttacaaacg ttccaaggac acagtgactg 9301 ggttttatct gttgtgttta gtccctgtgc aggcattctt gcaagtgcta gtggagacca 9361 aacaattaag ctgtgggatg tttctacagg gcaatgttta caaacatttc aaggacatac 9421 atatcgagtc aggacaatcg cctttagtcc agatggcaaa actttagcga gtgggagtga 9481 cgaccaaaaa gttaaactgt gggatgtgag tacaggtgag aatttaaaaa catttgcagg 9541 acatcataaa gcagtccggt cagttgcgtt tagtcccaat tctcccttat tagtcagttg 9601 cagcgaagac gaaaccatca agctttggaa tattgaaaca ggcgaatgcg tgaaaacgat 9661 gagaatcgat agaccctatg aaggtatgaa tattaaaaat gccattggtt taacaacttc 9721 tcagaaaaac acattgaaag ctctgggggc agtagagaga taaattagag gtatgatttt 9781 agtgttctcg tccgcctagg actgtaagtc ccaggcttat aggcaaagtc cattaaaatg 9841 gacttgcact gtaacttagt tttgagtaaa tgtcaacact atttcataga agtattggtg 9901 gtttactcgt aatggtgcaa gatatcaggt aagctactct tacaaaattt ggtaaaaaat 9961 ttgggtgcaa tatcgaatat ggaaaatcaa gaatcgaaaa aagatcctga aagtcaagta 10021 acatctgaaa cagatgatgt taagaaagat attgatctaa aaaagaaaga attaaaaaca 10081 actactgaaa atgaagttgt agatgatacg gaaggtaatt cgcgttctcc gggaaaaaca 10141 gacagaggtt tatagcctaa taccgtttag tatgtagaga cgttgcatac aatgtctcta 10201 cttctcattt gttgaggagc gcgatctgcc tcacgctcat gcgctcgcgc tttgcgcgtg 10261 aggcaagcga agcgatcgct cttaaaagta agaactagtt gtcgcagagg cggcgatcgc 10321 tttgactcac attagggtta ttttttagat agttctgcac ccaattgcaa gcaggctgca 10381 ttagcttatc cagcgaaaaa tcttctaaat tccacaaaat cacttttcct gcaccagcag 10441 acacaatttt cttgccgttg aggctgaaaa caatgctcct aatccaattg cgatctccat 10501 gtccataaag agtttgcagt tcttttccct ctagagtcca aagtttgaca gttgtgtctt 10561 cacttgctga ggcgagcatc tgaccgtcag ggctgaatac aacgctagtg acccaagttg 10621 tatgcccagt tagggtttta taagcttgag tttcaaattg acctgtaaag cctcgcttcc 10681 acagtttgat gcttccgtcg aaacttgccg aggcgagcac ttgaccgtta gggctaaata 10741 caacactgct aatgtcacct ttatgtccgg tgagggttcg tagtacctga ctccgatcaa 10801 gattccagag tttgactgta ttgtcatcac tagcagtagc aattgtttga ccatcagggc 10861 taaaagctac gctgaaaact ttaccgttat gcccggtcag ggttttataa ggttgaatta 10921 caaattcacc tgtgctgtcg tcccgcttcc agagtttaac agttttgtcg tcactagcag 10981 tagcaattgt tttactatcg gggctgaaag cgacgctttt gaccccacca tcatgcccgg 11041 taagagtttg caattcttca ccctcaatgc tccagagttt aacagttttg tcgtcactag 11101 cagtagcaat tgttttacca tcggggctga aagcaacact ccagacccaa ttgcgatgcc 11161 caataagggt tttgtaaggt tgagtttcaa attgacctgt gctgtcccat ttccacagtt 11221 taacggttcc atcgatactt gccgaagcga gcatttcacc attagggctg atagcaattc 11281 tccagattgc ccgctcatgg acattactaa gggttggtag ttttttgccc tcaaatctcc 11341 aaagcttgac tgtacggtca gttgaggagg ctaaagttag ggtctttttg tcagggctaa 11401 aagcgacgct ggaaacccaa cctgtatgcc cagtcagagt ttgcagtacc tgaccttgaa 11461 gattcacaag tataactgtt ttgtcctcag ttgctaaagc aagcatctgc ccaaacccct 11521 gcggggacgc tacgcgaacg gggctaaaag caacgctacg aattgaacgg atatatacgc 11581 cgggggttat gtcatcaggt tgagtttcaa attgacctgt gctatttcgc ttccagagtt 11641 tgatagttcc gtttaaattt gccgaagcca ccagcattag accatcgggg ctgaaagcta 11701 cgcttgttac ctcgccatcg tgtccggtaa gagtttgcag ttcttgaccc tcaatgctcc 11761 agagtttgac ggtgctgtct tcactagcag tggcaattgt ctgaccatca ggactgaaag 11821 ctacgctttt gacaccagca tcatgtccgg taagagtttg ctcaggttga gtgttaaatt 11881 gacctgtgct atttcgtttc cagagtttga cggttttgtc atcactcgca gtggcgattg 11941 tttgaccatc aggactgaaa gctacgcttt tgacaccacc actgtgtctg gtaagagttt 12001 gctcaggttg agtgttaaat tgacctgtgt tatttcgttt ccagagtttg acggttttgt 12061 catcactcgc agtggcgatt gtctgactat cgggactgaa agctacgctt ttgacctcag 12121 catcatgtcc agtgagggtt tgtaatttta taccctcaag gctccagatt gttacagatt 12181 tgttagtact acctacggca attgtttgac catcagggct gaacaccacg cttgctaaaa 12241 gatcgtaatc ttcttccaag cggttgaact ctttcactct ctctgtataa accacctttt 12301 gcagcgttcc tctcactcgt gctaattctg catctggctt gaaaagtcca aagggtaaaa 12361 gtttatcgag agcctttcct gctcgcaaag catcaattac cgcctcgaaa tcttgattag 12421 aacgccactc tgcgtctgag gcttgctcaa aagccttaat ttgttctatc tgcgcgtttc 12481 tctgaccaat cgtagcaaat cccgagaaaa ctgttaaagc tagcagagga agtaagccaa 12541 aactacttaa tcttaatcga ttattccatc tatactcaat acttttttga atcaactcac 12601 tagccaaagc tgatagtgca aagagtgaag cttgttcctt ttgaaatgct ttagcttcgt 12661 ttaactgttt acccgcaagc agataatctt ttgactttcc cttatctctc cattcagtag 12721 cggctgcttc aattttgcgg tgttgtatca atttatctcg attgatcgag atccattttc 12781 gtaagcgctt ccattcacga attaatactt cgtgaacaac ttctactgtt tcttctccag 12841 atttatcatt acgtcctgtt actactaaac gtgcttcaga actcgctaaa tgagtaacta 12901 attcccaatt tccactccca atttcctgat gagtagcaac acgcctagta tcttttgttg 12961 cttcacctaa tcgcactaat gccagaaaaa ctcgttgtgt ctgttgttgt tgggactgac 13021 ttagtctgga ataaacttgt tcagcgtgtt taactagtgc ttgtttgaga ccaccaattt 13081 cgtcatatgc tctatgagtc atttcaccat ttttctggtt atcccacaac tgtgttaaag 13141 caaattctag taatggtaaa ttaccgggtt cattcccaat atcatctagt attctttcag 13201 ctaaatacgt ttgtaatttt acatttaact tttgagcagg ttgttcaata gctgcctgta 13261 attcttctcg acgcattaag cccaatggtg tgtgtttgaa ttgctgcaat gcctcacaaa 13321 aaggaagata agagaggaca tagccgtaga aatcggctcg taatgtaaaa actaatgtta 13381 tgcttttctg cgaaacagta gccaataaag tatcaatgaa gcgctgctgt tcttgcttat 13441 cttgacattg agtgtaaagt tcttcaaact gatctacaac taacaaaaag cgtttattag 13501 atttattttc tagaatttga gacgctacct gatgcagctt aacttcgcct ttttttagac 13561 tttctgccaa cttgctaacc ttgatgactt tgtctacacc atcaagattt ggttctaatt 13621 gacgtactaa agcaagagca agttcatcaa atggttggct tttgggacgc aatgattcaa 13681 ttaaccaaat acctttttct cgcaattttg gaattaaccc agcaaacaca actgaagatt 13741 ttccacttcc acttgcacca atgactgcaa ccattggttg ttggttagtc atcctaaaca 13801 attcatcagt aaatttctct cgtccaaaaa agaacgcagt atctttctca ccgaaagcag 13861 ataaaccaag gtatggattg ggaggaatat cacctgcttc aaactgaata tcagcagggt 13921 gaattgatgt tgttttcaat acgggaattt ggtcttcttt taatcttatc agctctgtac 13981 ggcctaaatt aaaagcaaac tctactgttt ctccagcacc taaagcatcg tagaatgcca 14041 cagcaaaagc gatcgcagct ttatccccaa tagtctgatt catgccaacg acatactgaa 14101 tgtactggct gatagcttct gcttgtacgg atgagtagca ggcatttaaa acgacgcact 14161 caaccccctt aactgcaaat agcttaaaca gttgtgaaag tgtttcctga tctacaagtg 14221 ttggttgccc cgtatcatcc tctagtacaa ttccattaac tcctgtacca tgcccacaga 14281 aatgcactat ttgcggctgg tgttctagaa ttgccaggta aaagtctcgc gagcgcactg 14341 cccctttctg tactaaccta aactgtccac cctttggagc atgttgtaat gcttcttcaa 14401 ttgagcgcac ttctgcatct agtcttagtc ttgcggtact cgcaggattt gctgccagaa 14461 tcaaaattgt tttgacacct gtggtgttac tcatggcagc cttcaaattg cataagtagg 14521 atttctacat attctaatag gtgcaaccaa tattagctta tgcaccacgc gatctgactt 14581 tatgaatgtc gtcagccaag gaaatactcg catctggtgt gggcaccaaa gttgtctgtt 14641 gagggcgcac agttgtgatg gtcatttgac tgcttaccca gtcgtcacat aattcttctt 14701 cctctggttt ggctctgatg tcacgagatg gaatccggtt catcaaacga ttcaaggttc 14761 cctgaccacg acggggtgct ggtactgagc gagaaggtaa atcaagttcg ctttgttgaa 14821 gcaaggtagc atcaaattcg gcggtactga caatcagttt gagaatatct ttaaattcgg 14881 taattccttg ttcccgccat agtttttctg gaactgtggc agaaatcgag ttacctccta 14941 atgcccaagc ttcttgtcct ggttgtagcc agagtcctcc tccgtcaagt aaatcagcac 15001 tgacggcgta gcggtctgtt aaatccaaca gagagcagta cagtggttca tgactggtgt 15061 ttttcagctt aactctaaaa gagggcgact tccacttgtt attttcctgc cgatattcca 15121 agcgcagttg cacatcttgg agttcttgtg cgttttgttc aatgaccatt tggactgcat 15181 caggttgaat gcgactgctg gcgggactgg agagttcaac gatatttgtc caacgagcaa 15241 tatgttctaa tcgctgaatg acttgtaaag ctgttgcggt tgtgtaaccc tgaatttcag 15301 caactaacgg acggtcatca gcgggtcgtg ttatgaaata tttaccatca cgagcaacca 15361 gttggaactc ggtttcttca agggtggtga cttcgcgaat atataaggaa ggttgacttt 15421 gaggaccagc ttgcaagagt gcagtgcgag caagttgtac ccccgcttct tctcctgtga 15481 tcaacactcc tttgggtggt aagggtaaac tggtgacaac tgctttaaaa gttgtgtcag 15541 ggtttaagtt ttctataccg ctgatctgaa ttttactaac ttgtggtagg acttgcagga 15601 cttttgcttg agcgatcgcc cctgacaatt gacgcagttg ttcaaaagaa ctatcaaagg 15661 gaaacaacgc tagtaaggtt gtctcatctc caacaggttg aggaattccg tgaactgcac 15721 cgccatcaat tacccaaccg tggtctttgt gatcacaaac agtgaagtag ggagtatgtt 15781 cggcgatcgc acccccaaga aaaggttggt ctaagccccc cacaactgtt gcttcgattt 15841 gcggagattg tgctgtcact ttactgcgaa ccaaagcatt ggtacgtttg attgacactc 15901 ccctgcctaa aggcgagggg attctacatt catcgtcaga acttgc // LOCUS NODE_2116_length_15839_cov_30.23606215839 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15839) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15839) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15839 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 40..186 /locus_tag="DP116_18165" CDS 40..186 /locus_tag="DP116_18165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015224069.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_18165" /translation="MSPTITFEATDEEKETLKAYCEQEGRTQTDILRSYIRSLKRKIR ADGT" gene complement(619..864) /locus_tag="DP116_18170" CDS complement(619..864) /locus_tag="DP116_18170" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18170" /translation="MTLQLLEFQVINVFINTGKFFFKYMARATETERINVNIYIDKAV AHRLHEFARQQAMWKGRLVEAAIIEYLNKIEPRAGTT" gene 888..1241 /locus_tag="DP116_18175" CDS 888..1241 /locus_tag="DP116_18175" /inference="COORDINATES: protein motif:HMM:PF01381.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" /protein_id="PRJNA477356:DP116_18175" /translation="MIMDGDARNKLAQAIRGARGERSQRRFAKDLGVSYVTIQLWERG EVIPDLGNLEAIATSRGQTLEQLLAEIRGQAPEVTHKPKVAEDVIPTARQLSKKECVR LIKLLVDEVSGGFQS" gene complement(1232..1474) /locus_tag="DP116_18180" CDS complement(1232..1474) /locus_tag="DP116_18180" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18180" /translation="MQIKNSQGAEIYNPFQELFSSERRRTLRECVFVFVDTAFTAGYT QKEVLEAFTDWFFQNDEDKFEEVVKCLEQIVQSSYD" gene 2542..2964 /locus_tag="DP116_18185" CDS 2542..2964 /locus_tag="DP116_18185" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18185" /translation="MKQTIYIVTGNTEIGDSSPTPLSQRVVVVIGRGGGLTRKNLAAL MQRVPVCTDESTNSYVARFLGDNASGGTLYRPKIHLITSLRVGEKLLFVLAKSVASAV ISSNNCSFVLFNCENFSLIVRVTRFKSTNLLNTYILPT" gene 3572..5101 /locus_tag="DP116_18190" CDS 3572..5101 /locus_tag="DP116_18190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740855.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="monoamine oxidase" /protein_id="PRJNA477356:DP116_18190" /translation="MDTKILLEQCQLVNFQKGKKITILGAGIAGLVAAYELERLGHEV EILEGSPRIGGRVWTHRFGNSPDAPYAELGAMRIPSEHEMTLHYVHEMGLSDKLCKFM TVFEESNAMMNINGQVLQMKDAPRVLQQTEGGIFSDTRYSEKTRLFAAWLKTIINTIA PGNLRSEFERDLQSHLMDELERLDLNPYFSEDGETIDLNSFLTQNPSFRAKCSQGLDI FLGDIITETSHDLLQLKGGMDQLIQRLAASITGEIKCNSEVVALRVQFDHVQITYKEN GQLHTRRCDYVLCTIPFSVLRKMELSGFDDDKLDSIHNTVYCPGTKVAFHARESFWEK NGIKGGASFSGEGVRQTYYPSVKFNPERGSVMLASYTIGDDAQRMGMMSEQERFDYVQ NTVSKIHPELNEPGMILDKASIAWGNYKWSAGGCTIHWDEADGSASYLKAQRPQNTLF FAGEHCSRFPAWLQGSIESAVEAVYDIVKHKPALQSTAIPVAVTVSGKNRQLAAVGSA W" gene 5593..6675 /gene="psbA" /locus_tag="DP116_18195" CDS 5593..6675 /gene="psbA" /locus_tag="DP116_18195" /EC_number="1.10.3.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997721.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II q(b) protein" /protein_id="PRJNA477356:DP116_18195" /translation="MTTALQRRESANVWERFCEWITSTENRLYIGWFGVLMIPTLLSA ITCFIIAFIAAPPVDIDGIREPVAGSLMYGNNIISGAVVPSSNAIGLHFYPIWEAASL DEWLYNGGPYQLVVFHFLIGVFCYLGREWELSYRLGMRPWICVAFSAPVAAATAVFLI YPLGQGSFSDGMPLGISGTFNFMLVFQAEHNILMHPFHMLGVAGVFGGSLFSAMHGSL VTSSLVRETTETESQNYGYKFGQEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLA AWPVIGIWFTSLGISTMAFNLNGFNFNQSLIDSQGRVIGSWADVLNRANLGMEVMHER NAHNFPLDLASAEAAPVALSAPAING" gene complement(6932..7585) /locus_tag="DP116_18200" CDS complement(6932..7585) /locus_tag="DP116_18200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316387.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18200" /translation="MESLSDLFHYMQLNSGIRSPSYSATEQTIEQTIEQTIETRSPSN VEQSTILRFVEEQNHSCSSTQPLTHEDTLSAPASEPVEKPPQEDVREICNQLRQIPCA TAFRLNQEIIAVINKFWRNVPGALAYLKEALRTWKRVDSPEAVFVAACKNGRKPENWG KPLPSYPQPSDEDLAQLAEAKSTRRIKDYYCQPDGLWVVDTGSECVNLCDFLSTAGF" gene 7590..7841 /locus_tag="DP116_18205" /pseudo CDS 7590..7841 /locus_tag="DP116_18205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008187560.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS607 family transposase" gene complement(8336..11518) /locus_tag="DP116_18210" CDS complement(8336..11518) /locus_tag="DP116_18210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861360.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional DNA primase/helicase" /protein_id="PRJNA477356:DP116_18210" /translation="MNSCIFPENQEQLQDQHQLTERHQQEWVVDSAVSPALTALNVRS LTGTVVYEYLVYALPQTARRNDGRLRDKYLYQYAHACHGGWWVSGLDPQNNWEPMEWG RFKPDYPRWGWDKITQKQTDKQVKYESPINTPNRVTYLRVPVEIWEMVAHRYNVPMPE EIVTTVDGEAIGFWAWVVNHPQIPIILTEGEKKAACLLSLGFVAISLPGIWNGRVGKR DFNEKLHPDLMPVAKPGRKFIVLFDYETKPKTRWAVFQATIRTAQAIEAVGCFCEVAL LPGPEKGVDDFVVSIGNALAQENESNLEELDLPDFSSHSPSERANTLLTGIIEDAKAF RDYQRSFHCRSRRLSKKYKPHVDVNVKYLSEAVRLPQSGFVVLSSGMGTGKTEIMRRW RDEHPHEQFLNNGHRVNLLKNLAQRLNTQMYSDLRYGELTKATALSITIDSLHKLNTQ ALIYGCVFIDEACQYLTHLLHSKTCKQHRATILEVLEYIVYNAQLVVIADAHMDDVTV DFFRAMRPKDEEPFIIKNQWKNGSRLIYWYEGDDSSALVAQISAALMVGQKIMVASDS KRFIKKLEKSLNVSVRVDDGSAQDSYRQLKVWSIHSDNSGSEENVAFIKDITNAVKNV DALLTSPSLGTGVDLPDYHFDVVFGAFHGISQTATECAQHLHRYRPKVPMHVWVAPRP PFGYAETNATKIKEQLLQTNEMTAFLLRIDRETGKRGAEKDWALQAYCDIQAQRNESM NRLRADLLDLLTEMGNKIIPMGAERNELAQKRLKDAAVALDTAYYSAVAGAKDISASE YRKRMRKDYLKPEEIYECEKFRIQEAYGMEVTPSLVEKDNGGKLIRAISNLEAILAES DGVIEDTGTGRIYPAPPEFVAEKDRQERNKLPLCMDWGNYSARWVLLSSLGLPEILKR LIDGEEVTAHSPELLRMVEIAKQCAAHVKAILGFTIPSKCQPIWLLALLLDQLGLKLT SRKQGARGQQVKFYFLSQEELEFAKVVIDYRERKPQPEVKYKNNPTGSFEATRGAKCI FPYPYDTKSRNLNKPQFSSLTETAPDPEEL" gene complement(12799..13794) /locus_tag="DP116_18215" CDS complement(12799..13794) /locus_tag="DP116_18215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="integrase" /protein_id="PRJNA477356:DP116_18215" /translation="MSDSPTTTSALKNPLALNAPPPLTEHPTAVYLSGLAPGSRPAMR QALDTIAGVLTNGSCDAMTLDWTALRYKHTALVRTILMEKYAPATANKMLCAMRRVLK EALRLELIDAKDFARAVDIKSVQVCSELQGRALASKEIADLMQVCFDDPTPGGFRDAA LIAILRGSGLRRREVVNLNLNDFDKSTGAIKVWGGKGGKNRTVYLPNAAIEVVQDWLG IRGEESGSLLCHVNKAGCVVLRRLTPQAVLFILQKRGEQAGVGHFSAHDFRRTFISEL LDSGVDISTVQRLAGHASPDLTARYDRRGEQTLRRAVQTLSIPGSRTKTERDSKN" gene complement(14106..14288) /locus_tag="DP116_18220" CDS complement(14106..14288) /locus_tag="DP116_18220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006104704.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18220" /translation="MPFEKNNPHRYTRKLKRPLGKMIGFRGYEGQSEQLKTVPNWQER LRQFVDQLICDLPKNE" gene complement(14500..14676) /locus_tag="DP116_18225" CDS complement(14500..14676) /locus_tag="DP116_18225" /inference="COORDINATES: protein motif:HMM:PF01797.14" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18225" /translation="MTSLKTRPSRYLRKEFPDRVNKFYHKDVLWNGSYFIASCGGVTV EMLKKYVESQNKPD" gene 14711..15516 /locus_tag="DP116_18230" /pseudo CDS 14711..15516 /locus_tag="DP116_18230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_076611868.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS5 family transposase" gene 15536..15748 /locus_tag="DP116_18235" /pseudo CDS 15536..15748 /locus_tag="DP116_18235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015170172.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" BASE COUNT 4173 a 3747 c 3537 g 4382 t ORIGIN 1 aggcatgtgg cgtccctggg ctaaagccac agggttttca tctcacccac tataacattt 61 gaggcaacgg acgaggaaaa agaaaccctc aaagcttatt gcgaacagga gggtagaacg 121 caaactgaca tacttagaag ctatatccgg agcctcaagc gaaagattag ggctgacggc 181 acttaaaagt gcatcgcatt cctgagggca gttccgtggg cgggtttccc gacttgagga 241 aactgcccgt ccccacaccg cgtattatat gggctttctg ctccaatact gtaagggacg 301 cggaaagtgc gcgttcagat aaagtgctcc tgcgcctttg ctattgcacc ctctgatttt 361 tggtcacgca ttgtccactt cgtactgaac cacacgagtg caatgtttat ctggaagacg 421 caagtcgtgt attcgttgta atgtgcaagt tcaattaaag aagtgtaaga ccccgtcaat 481 gctgtgagga ggtgccttct tgttcgcgcc gctgtacttt gaatccgtca gtggggaaca 541 acaaacactc actagttaag gttttgcaca atcgctcaaa aatcatttgg ttgtgactcc 601 ttgtgtgaca aatagcagtc atgtcgtgcc tgctctaggc tcgattttgt tgagatattc 661 aataattgct gcttcgacaa gccgaccttt ccacattgcc tgttgccgag cgaactcgtg 721 taacctgtgg gcgacagctt tgtctatgta tatgtttacg ttgattcttt ctgtttccgt 781 cgctctagcc atatacttga aaaaaaactt accagtattg ataaatacat ttataacttg 841 aaattctagt aactgcaacg tcatgataga aatttgttac tcagtagatg attatggatg 901 gtgacgcccg aaacaagtta gcacaagcta ttaggggagc taggggagag agaagccaga 961 ggagatttgc aaaggactta ggtgtaagct atgtgactat acaactctgg gagcgcggag 1021 aagttatccc agatttgggt aatttggagg cgattgccac ctctcgcgga caaactcttg 1081 agcagttgct tgccgaaatt agaggacaag cacctgaagt aacccataaa ccaaaagtgg 1141 cggaggatgt tatccctacc gccagacagt tatccaaaaa ggagtgtgtt cgtttgatta 1201 agcttttggt tgatgaggta agtggagggt ttcagtcgta gctactttgt acaatttgct 1261 caagacattt gacgacttct tcaaatttat cctcatcatt ctgaaaaaac caatctgtga 1321 acgcttccaa cacttctttt tgtgtgtagc ctgctgtaaa agcagtatct acaaacacga 1381 aaacgcactc cctaagagtg cgcctgcgct cgctggaaaa taattcctga aaaggattgt 1441 agatttcggc accctgagaa tttttgattt gcatcgtctc ttagtctgct gtactgttga 1501 cactcccacg gctatgtgaa tatcgcgcta cgcgctcatt cccacggcta gccgtgggat 1561 tcttggttca acgagcaacc ttgacgctcc cacggctacc acatttttgc tccgagacaa 1621 tgtaacataa agccgtggga ttctggcata agcgtcccca atccccgtgg attagttttt 1681 catccagttt ggggctgcgc caaacagccc attctttgaa tcggacaaag ctattgcagc 1741 acataagttc tttaccacta aaaaacttgc caaacccgtt tttgggagct actaccgcaa 1801 ggttgttctg tcctctatca cagccaatcc tgttagttgc atcaacttga ggaacttcct 1861 cggttacagc aaactaagac ataccactta ttccgatgtt tgatgatttt ggcactgcct 1921 ctttcaatcg taccgtcaaa aattccttcc aaaactggtt gccagtgctt agacgcaact 1981 tcaacaggaa cacgttttgt tcctttgatg gtcggaaaac taacgccctc tcgtagttcc 2041 cactttgtgc aatgcccagt tctgattatt tacctctggc ggcagtcgct tgaaggattt 2101 cgcctttttc ccagcatcag atgtggtgtg tcgaatgact tgattagaca gcgcagacat 2161 gagacgagtt tgaaccttag aagttgttaa ctttcgcctc tggtggtaga ggaatagtta 2221 acaaccaatt aggtgacata ctcccgcact gacagcaagc tgtacagtgc aggcttctca 2281 gccaatccag ctattgctta ggaggcgctg agctttgttt tgagtccacg gaatgcccta 2341 ccgcagactt tgaatttctt accttgtacc ctacaagttt tactgccctg gacgagatgc 2401 tggcttattt gcgctatcgg acaaaccaat cggtaacttg tattaatcaa gatgtgccta 2461 cttactttaa tgctgtgcaa cggagacgct taccaataat cgcaatgcta ctggacagag 2521 gcttatcaga caccaatttg ggtgaagcaa accatctaca ttgtaacagg aaacacagag 2581 attggcgatt cgtcgcccac cccactctct caacgagtag tagtcgttat tggtcggggc 2641 gggggtctaa cacgaaaaaa tctggcagcc ttaatgcaac gagtacccgt atgtaccgac 2701 gaatcaacaa acagctatgt tgccagattt ttaggggaca atgcgtcggg tgggactctt 2761 tatcgcccaa aaattcatct catcacctcc cttcgggtag gtgagaagct tctttttgtt 2821 ttagctaagt ctgttgcctc agcagtcatt tcttcaaaca actgttcctt tgttttattc 2881 aattgcgaaa actttagctt aattgttcgc gttactcgtt tcaagtccac gaacctcctt 2941 aatacttata ttttacccac atgagtaaga caagatatca agttaacgca agctttcgtt 3001 aagaaattga cgggctagcg catttgccac gcattcatcc cacccctaaa agaggatggg 3061 ctttctgctc tcgacactgt aaaattggaa ttatttattt gtcaaggttc ggttattttt 3121 taactaatgg gcgagtacac tctcacgttg caagctaatt cttgtaagcc tacacgctcg 3181 ccctcactac tgtttcggta gttgaccgaa acataaggta gagggcagcg tggactgaaa 3241 gaagatgtag tcgtcgcacg tttacgaacg cgagttcgtg cgctttacgc tcactcgccc 3301 gccccactgt ttgccaatag cattcggaaa aaacgctttg cgatatccgt aaggattagt 3361 gggcacttca cccaaaagca aataactcac tataagcggt tgtttcgtaa tgggtatgag 3421 gtgtcacacc cttcaccgaa agctgtgaaa tttttacagc cggggaatga gttaacagct 3481 attgagtgca taaaaatctc tagtccccct tatgaaatat cagaagcggt tgaaacccgc 3541 tcaacacaaa cgactaaggg aagctggaac aatggacacc aaaattttac ttgagcaatg 3601 ccaactggta aactttcaaa aaggtaaaaa aatcacaatt ctcggagcag gtatcgcagg 3661 tttagtagca gcatatgaac tcgaacgcct aggtcatgaa gttgaaattc tagaaggtag 3721 cccacgcatc ggtggtcgag tatggacaca ccgcttcggt aattccccag atgcacccta 3781 cgcagaactc ggagcaatgc gtatccccag cgaacatgaa atgacactgc attacgtaca 3841 tgaaatggga ctgtctgaca aactgtgcaa gttcatgaca gtgttcgagg aaagcaacgc 3901 catgatgaac atcaacgggc aagtgcttca gatgaaagac gcaccccgtg tcctgcaaca 3961 aacagaaggt ggtatctttt ctgacacacg ctacagtgaa aaaacccgct tgtttgctgc 4021 ttggctcaaa accatcatca atactatcgc tcctggtaat ctccgttctg aatttgaacg 4081 cgacttgcaa tctcacttaa tggatgaatt agagcgctta gatttaaatc cctacttcag 4141 cgaagatggc gaaacgattg atttgaactc cttcctaact caaaacccaa gcttccgggc 4201 aaagtgctct caaggattgg atattttcct gggtgacatc atcacagaaa ccagccacga 4261 cttgttgcaa ctcaaaggtg ggatggatca actcatccaa cgtttagcgg cttcaataac 4321 tggtgaaatc aagtgtaact ctgaagttgt ggcgctgcgc gtgcaatttg accacgtcca 4381 aatcacctac aaagaaaacg gtcagttaca tacacgtcgt tgtgactatg tattatgcac 4441 catccccttc agtgtactcc gtaagatgga gttgagcggc tttgatgatg ataagctcga 4501 ttccattcac aacaccgtct actgtcccgg taccaaggtg gcgtttcacg ctcgcgagtc 4561 cttctgggaa aagaatggca tcaaaggtgg tgcttccttc agtggtgagg gcgtgcgtca 4621 aacatactac ccaagcgtga agttcaaccc cgaacgtggt agcgtgatgt tggcgagcta 4681 caccatcggt gacgatgctc aacggatggg catgatgtcc gaacaagagc gctttgacta 4741 cgtccaaaac actgtcagca agattcaccc cgaactgaat gaacctggta tgattttaga 4801 caaagcgtcc atcgcctggg gcaactacaa gtggagtgct ggcggatgca cgattcactg 4861 ggatgaggct gatggatctg cgagttacct caaagcccaa agaccccaaa acaccctgtt 4921 ctttgctggt gaacactgct ctcgcttccc cgcatggtta caaggttcga ttgagtctgc 4981 tgttgaagcg gtctacgata ttgttaagca taagcctgct ctacaatcca ctgctattcc 5041 tgttgctgtg actgtctctg gtaaaaatcg tcaattggca gcagttggaa gtgcttggta 5101 gtcgagcacg agcacgtcaa tattaaaaaa tttaacaatt aactgttcat ttttaattca 5161 gctaaaccct ttttgttatg cctcgcgcgt gcgggctttt tttattgagt tccaaggtcc 5221 ttaaaggggg aaacaaaaat ccccacacat ctctacattc ggatgcgtga gggaatgggg 5281 aacaagaggg atagaaaaag tgacttttga atctcagatt ttagttttcc catcgggaat 5341 tttttcgttg accatgaaac gcaggatgcc acaaaaaacc cccaagtccc tatcgacaaa 5401 cgcaggatta gatgatgggg cgtggggggg gaaatttttc aactcccaac atccataagc 5461 aaagctgata gctgataaaa gaaaacgttc aggaaaaccc cttgattttg taaacgaatg 5521 taaactatta tgaaaataag caaacaaaca tttgcttaca tacatactta aacaactgca 5581 atcataagaa ccatgacaac tgcattacaa cgtcgcgaaa gcgccaatgt atgggagcgg 5641 ttttgcgagt ggatcacctc caccgaaaat cgtctctaca tcggttggtt cggcgtcctg 5701 atgattccta ccctgctatc cgcaataacc tgtttcatca tcgccttcat cgcagccccc 5761 cccgtagaca tcgacggaat ccgcgaacca gtagcaggtt ccttaatgta cggtaacaac 5821 atcatctctg gtgctgttgt tccttcctcc aacgccatcg gcttacactt ctaccccatc 5881 tgggaagcag cttccttaga tgagtggttg tacaacggtg gtccatacca attagttgtt 5941 ttccacttcc tgattggtgt attctgctac ctgggtcgtg agtgggaatt atcctaccgc 6001 ttaggaatgc gtccttggat ttgcgttgca ttcagtgcac ctgttgcagc agcaaccgca 6061 gtcttcttga tttaccccct cggacaaggt tccttctctg atggtatgcc cttgggcatc 6121 tctggaacat tcaacttcat gttagtgttc caagcagagc acaacatcct aatgcacccc 6181 ttccacatgc tgggtgttgc tggtgtcttc ggtggttcac tgttcagtgc aatgcacggt 6241 tctttggtca cttcttcctt ggttcgtgaa acaaccgaaa ccgaatctca aaactacggt 6301 tacaagttcg gtcaagaaga agaaacctac aacatcgttg ctgctcacgg ctactttgga 6361 agacttattt tccaatatgc ttctttcaac aacagccgca gcttgcactt cttcctggct 6421 gcatggcctg tcatcggtat ctggttcacc tcactgggta tcagcaccat ggcgttcaac 6481 ctcaacggtt tcaacttcaa ccaatcgttg attgattctc aaggtcgcgt gattggttct 6541 tgggcagatg tgctcaaccg tgcgaacctg ggtatggaag taatgcacga acgtaatgct 6601 cacaacttcc ctcttgattt ggcatcagct gaagctgctc ctgtagcact ctctgctcct 6661 gctatcaacg gataatatct aagatttaga tcagtaaaaa gcgctctcct gaaaaggggg 6721 gcgctttttg catgctttca acttggtatc catgtacctc tcctagtggt tttttacata 6781 caattcgggg tggggatgaa tacggcggtt ctcatttgaa tcacaccacc acaaataatg 6841 tagagcacgt aaatgctacg tctcagagcg cgagtgtgta ttgcacccaa caccagaagc 6901 ggaaacgagc gtcagtttca accgacttaa gctagaaacc agccgtagat aaaaaatcac 6961 acaagttaac acattcactc cccgtgtcca ccacccatag cccatccggc tgacaataat 7021 agtccttaat tcgcctagtt gatttcgctt ctgccaattg tgccaagtcc tcatcacttg 7081 gttgaggata actcggtaat ggtttccccc aattctctgg ttttctgccg ttcttgcacg 7141 ccgctacaaa tactgcttcc ggggaatcaa ctcgcttcca cgtacgtaaa gcttctttaa 7201 ggtaggctaa agctcctggt acatttcgcc aaaacttgtt gataaccgct attatctcct 7261 gattgaggcg aaaggctgtt gcacatggga tttgacgcag ttggttacaa atttctctga 7321 catcttcttg aggaggtttt tctacaggtt ccgaggcggg ggcggaaagt gtgtcctcat 7381 gagtcaatgg ctgtgttgat gagcagctat gattttgttc ttcaacgaag cgcagaatcg 7441 tggattgttc tacattgctt ggcgaacgag tctctatcgt ttgttcaatc gtttgttcaa 7501 tcgtttgttc agtcgccgaa taactaggcg accgaatacc ggaatttaac tgcatataat 7561 ggaacaaatc ggacaaagat tccattatta tgtacttgac accagccgaa gcgcaaaaaa 7621 gatatggtta ccacccgaag accttgacta gatgggcaga tgagggaaag attcaatata 7681 tcaaatcacc gggtggacat aggcggtatt tgattgaatc tattgaaaag ctggttgata 7741 gagttgacca gcgacccatt attttatatg cacgagtttc tactacctcc cagaaagatg 7801 acttggcgtc acaaattgaa tacttgggga agaattaccc gtcacatcct tgggtaatgt 7861 ctccgaattt tcttccacaa aatctttatc cgcagcagca tgttgttgtg tagagaattc 7921 ttctgtagag aagtctttga tatgatttgc tcgatctgag cagatcgatg tgtcagaatt 7981 gctcacatcg atttgctgat tttgacaaat cctgtcccac aaggctttca cgttttcaag 8041 gttgattgtg taccagtttg cctgatacca ggtgtgttga ctatggcgat gaacattaat 8101 gagttctagt tgtctgagtt taccaatggc tctcctgata gtagacatct taaagaaggg 8161 gagtttttct gcccaacctt ctaaggtgag gtaaaaccag cgttgtccat ctctgaggat 8221 gtgtttggaa ttctgagaaa agtaatgaat ttgttgcagg atgattgctg cttctagtcc 8281 aatttctctg gcgactagag gtggaatgag taaaggtttt tcaggtgtga tgagtttaca 8341 attcttctgg gtctggggct gtttctgtga gcgaactgaa ttgaggtttg ttgagatttc 8401 gggatttggt atcatatggg tatggaaaaa tgcattttgc ccctctagta gcttcaaaac 8461 ttccagtggg gttattttta tactttacct ccggttgagg cttgcgctcc cgataatcaa 8521 tcactacttt tgcaaattcc aattcctcct gggaaagaaa ataaaacttc acttgttgac 8581 cacgtgcacc ttgtttccta gaagtcaact ttaatcccaa ttggtctagc agaagtgcca 8641 acagccatat cggctgacac ttagaaggaa tagtaaaacc taaaatcgcc ttaacgtgcg 8701 ccgcacactg tttggcaatt tcgaccattc tcagcaactc aggcgagtga gcagtcactt 8761 cctccccatc tattaggcgc ttgagaatct caggtagtcc caaactggac aacaacaccc 8821 accgcgccga gtaattcccc cagtccatgc acaacggcag cttatttctc tcctggcggt 8881 ctttttctgc cacaaattct ggtggtgctg ggtatattct tccggtacct gtatcttcta 8941 ttacaccatc tgattctgct agaattgctt ctaaattgga aatcgcccga atcaatttcc 9001 caccattatc tttctcaacc agtgacggag tgacttccat accatacgct tcctgaatgc 9061 ggaatttttc gcattcataa atttcctctg gttttaggta gtctttacgc atacgcttgc 9121 ggtactcgct agctgaaata tctttcgccc ctgcaactgc tgagtaataa gcagtgtcca 9181 atgcaacagc ggcatctttc agacgtttct gagcaagctc atttctttct gcgcccatgg 9241 ggataatttt gttacccatt tcggtcaaca agtcaagcaa atcagctcgc aacctgttca 9301 ttgattcgtt tcgctgagcc tgtatatcac agtaagcttg caacgcccaa tctttttctg 9361 caccccgctt tcctgtttct ctgtcaatcc gcagcaggaa agctgtcatt tcattggttt 9421 gcagcaactg ctctttaatt ttcgtggcgt tagtttcagc atatccaaag ggagggcgcg 9481 gcgcgaccca aacgtgcatc ggtacttttg gacgatagcg gtgcagatgt tgagcgcatt 9541 cggtcgccgt ctgcgaaatc ccgtgaaatg ccccaaaaac cacgtcaaaa tggtaatcag 9601 gtaaatctac ccctgttcct aggctaggcg aggttaaaag ggcatctaca ttcttgacag 9661 cgttcgttat gtctttgatg aaagcaacgt tttcttcgct tccagaatta tcagagtgaa 9721 tagaccacac ctttagttgt cggtagctgt cttgagctga accgtcatcc acccgcaccg 9781 acacatttaa cgatttttca agtttcttaa taaatctttt cgagtcagaa gccaccataa 9841 ttttctgccc caccatcagc gccgccgaaa tttgggcaac taaggcagaa gaatcatcac 9901 cctcatacca ataaatcagg cgtgaaccat tcttccactg atttttgata atgaacggtt 9961 cttcatcttt gggacgcatt gcccggaaaa agtccaccgt tacgtcgtcc atatgcgcgt 10021 cagcaataac gaccaattgc gcattataca cgatatactc cagcacttcc aaaatggtag 10081 cccgatgctg cttacaagtt ttgctgtgca gcaggtgagt caggtactgg caagcttcat 10141 caataaacac gcagccatag attagggctt gggtgttgag cttgtgtaag ctgtcaatag 10201 ttatgctgag ggcagtcgct ttcgttaact ctccataacg caaatctgag tacatttgcg 10261 tgttgaggcg ttgagccaaa tttttcagca aattcacccg atgcccgtta ttgaggaatt 10321 gctcatgggg atgctcatcg cgccaacgtc gcatgatttc ggttttaccc gtgcccatgc 10381 cactgcttaa tactacaaaa ccagactgcg gtagccgaac tgcttctgat aaatacttga 10441 catttacatc aacatgtggt ttgtatttct tgctcagccg tcggctacga cagtggaatg 10501 agcgctggta atcacgaaat gccttggcgt cttctatgat cccagtcagt agagtattcg 10561 ctcgctcact gggggaatga ctagaaaaat cgggtaaatc gagttcctca aggttcgact 10621 cgttttcttg tgccaatgcg ttgccaatcg aaaccacaaa gtcatcaaca cctttttctg 10681 gtcctggcag taatgcgacc tcacaaaagc agcccactgc ctcaattgct tgggcagtac 10741 gaatggtcgc ttggaacacc gcccagcggg ttttgggttt ggtttcgtag tcgaaaagga 10801 caatgaattt gcgccccggt ttcgccactg gcataaggtc gggatgcagc ttttcgttga 10861 aatccctttt gccaacgcgt ccgttccata tccccggtag ggagatcgcc acaaaaccca 10921 aacttaaaag acacgccgct tttttctcgc cttctgtgag gatgatgggg atttgggggt 10981 gattcaccac ccaagcccaa aaaccaatcg cttctccatc aacggttgtg acaatttcct 11041 ctggcattgg gacgttgtag cggtgagcaa ccatttccca aatctctact ggaacccgca 11101 ggtaagtcac acggttgggc gtgtttatcg gtgactcgta tttcacctgc ttgtcggtct 11161 gcttttgggt gattttgtcc catccccagc gtggataatc cggtttaaac cgtccccact 11221 ccattggttc ccaattgttt tggggatcaa gtcccgaaac ccaccatccc ccgtgacagg 11281 cgtgggcgta ctgatataag tacttgtccc gtaaccgccc gtcgttacgt ctggcggtct 11341 ggggtagggc gtagaccaaa tactcgtata cgactgtacc agtaagtgaa cgaacattca 11401 gggcggtgag tgctggtgaa acggctgaat caacgaccca ttcttgctga tggcgttctg 11461 ttagttgatg ctgatcttgg agttgttctt ggttttctgg aaaaatacaa ctgttcatga 11521 tacttccttt ggtggtgaag catctgactc tagattttct tccgatagtt aaagttgtta 11581 agcaagaaag tcgatctact ttgccgaaca ggagtgggat gaattgcatt tcccccaaaa 11641 gggaagtgca aaagctgtta actaagatgg cactgcgtaa atatacgtaa gtaagccaag 11701 attggtaact cgccctcaag attgtttccc aacctcccaa cggggcacac gcgaactact 11761 gcttaacaat ttgtctacaa gacgcagttt ctagccctac ggctttgtca caccaaaatt 11821 gacgcttaaa ctcttcctta actcaagact tccatctgct tccggaactt ttgtagccac 11881 aaatgtatca caatagatat agcgcgttac aacgcgaaac cattgtagga gtgccttccg 11941 cttccatgcc cagagaaccg ataacgacac tggcttctgg gaaaaggcaa aataaactga 12001 ttttttagac ccatccgagg ggggacgcct cttagatgta aaatcaaaat gcctgcggag 12061 acggtctggc ggggagtaag tcatggggct gctctagtta agagtctagg aaacaggaag 12121 aaaggtgtaa tagcacggca atagcgcgcg ctgtggaaca acttttaaga atcctttcgg 12181 ctttagccaa gggagtacgt caaggtggaa gaaaatgaag agccaaatgc ctttcgcaat 12241 ttcttaaaaa aaggcatatg attcttccac ccttgtgtgc taacatgtag tcaacaaggg 12301 tgggttgggt tgttaacaaa gccgtgatat taagcgagca accgtttggt actgtcacct 12361 ctcacagttc gagtaccgaa attggttaga tctatgtgtt atagaaactc acctaatact 12421 tctgacaatg tcataaccgc ttctcttcaa aacccagaca atcattgatg tcggttttga 12481 ttgacgtttc aacgggagca tgggagcagg tatccgctct acaacacttg gactaaagtt 12541 agacagaatg gaacactgtt agaccgtagc ccggattgct tcttgcttca tcctggcagt 12601 gtcggcactc accagtccac aagcttaaat tcggatagag cttatcctat ttcaaattgt 12661 atgttccgtt ttttattgac atgttgtcgt tagattaaca tttttctggt atttttcaaa 12721 gagaaattgc tgagaagttt tgctctctcg tttccatgct ccgctggcga gtaaacattg 12781 cttcagttgc atatcgcatc aattctttga gtcgcgctcc gtttttgttc tagaaccagg 12841 gatactcaag gtttggacag cacgacgtaa ggtttgttcg ccgcgtcggt catatcttgc 12901 agtcaaatct ggcgatgcat gaccagccag cctctggact gtagagatgt ctacaccaga 12961 atcgagcagt tcgcttataa aagttctgcg gaaatcatga gccgaaaaat gacctacccc 13021 cgcttgttca ccgcgttttt gcaaaatgaa cagcaccgcc tggggcgtta gccgtcgcag 13081 caccacgcag ccggctttgt tgacatggca cagcagggaa ccggactcct cacctcggat 13141 cccaagccag tcttgtacga cctcaattgc tgcattcggt aaatacaccg tgcggttttt 13201 ccctccttta ccaccccaga cttttatcgc accagtgctc ttgtcaaagt cattcaagtt 13261 taaattcacg acttcgcgcc gtctcaaccc cgacccgcgc agaatcgcta tcaacgccgc 13321 atccctaaaa cctccaggag tcgggtcatc aaagcacact tgcatcaaat cggcgatttc 13381 ttttgacgct aaagcgcgtc cctgtaactc actacatacc tgaacacttt tgatatcgac 13441 agcacgggca aaatctttgg cgtctatcaa ttccagcctt aaagcttctt ttaaaactct 13501 cctcatcgca cacagcattt tattcgctgt cgcaggtgca tacttttcca ttaaaatagt 13561 acgcacaagt gctgtatgct tataacgtag tgccgtccaa tccagagtca tggcatcaca 13621 actcccattg gttaacaccc cagcaattgt atctaaggct tgccgcatcg ccggacgcga 13681 ccccggtgcc agtccggata aatacacggc tgtgggatgt tcggttaaag gtggtggcgc 13741 attgagtgcc aaggggtttt tgagtgcact cgttgtcgtg ggtgaatcgc tcatctactt 13801 acttgaaacg aaattcaggg caggtaaagt ctggtaagcc atgaaatcat tctctcaccc 13861 tttggtgtat cacgatgatt cacgcgggag tatcaattag ttggtgctga gtaagctgtc 13921 cccgaggttc aagagccaat atattgggga agtccttttc caatgtcgtt ggatttctta 13981 gggctactga aaatttccgc tctagaagtt tgtggcgaaa gcctctggct agaagccaga 14041 ggtaagctta ccgaaaaaaa agggcttgaa actgtaactt aatttctgga caaagctacc 14101 cattactact catttttggg tagatcacag atgagttggt caacaaattg tcttagccgt 14161 tcttgccaat tggggacggt ttttaattgc tcactttgac cttcatagcc tctgaagcca 14221 atcatcttcc ctagaggtcg tttgagcttt cttgtatatc gatggggatt attcttttca 14281 aatggcattg acacgagggt tttatgcttg ttatcatgca taataacaca ttttagcttc 14341 ttcgccgtaa gccccgcacc ataactgtac tcagtttggt gacggcatat aaggtgggca 14401 agacgaattg acatccgacc tatgttgacc gtaggtcaac gcgggagtga tgtcgatgag 14461 gagttgccta tctccgtatt gaaaactagt aaccgaagtt taatccggtt tgttttgtga 14521 ctccacatat ttcttcaaca tttcaacagt tacgccgcca caggaagcaa taaagtacga 14581 tccgttccaa agtacgtcct tgtggtagaa cttattaact ctgtcaggaa attctttgcg 14641 tagataacga ctagggcgtg ttttcaaact cgttatatcc taagatatag gcgatcgcta 14701 tgttgcaata atgaaccgag gaaacttaag taacgaacag tgggaaaagt taaaattctt 14761 acttcctccc caaaaaccca gaacgggtaa gccaagtaaa gaccaccgga tgattattaa 14821 tggcatccta tggatactga ggactggggc accatggcga gacttgccag aacgttatgg 14881 accgtgggaa agtgtcgcta cacggtttta tcgatggcaa aaagccggaa tctggaacca 14941 aattctagaa cagctgcaag tgatggcaga ccaagaagaa aaattggatt ggcaggttca 15001 ctacgtagat ggaacagtaa tacgcgccca tcagcacgca gctggtggaa aaaaaggggt 15061 gaagaagaag agaaactagg tcgttctcgc gggggtttca gtacaaaaat acacattcgg 15121 tgtgagggga aaggtaaacc cataactttt attcttagtc caggtcagag aaacgagtca 15181 atttttctag aacaattgat ggaacaaggc tcagttaagc gttctgggag aggtcgtcca 15241 cgtttgcgcc ccttacgatt agttggagat aaaggttaca caggtcgtag aatacgtaat 15301 taccttcgcc gccgtggtat ccgcctcact atcccacgac tctcaaatga accacgacga 15361 ggtccgttta accgtgaaat ctaccgtcaa cgcaacgttg ttgagcgcgc tatcaaccga 15421 atcaaacaat ttcgtcgcat cgccacccga tatgaaaaac ttgctgccaa ttatacagcc 15481 atgataatga tcgcctccat tattctgtgg ttatagtttg aaaacacgcc ctagggtcat 15541 cgcagttaat cctgcataca catctcaatt gcttgcatat cgtgacgaaa tagtctttac 15601 tgactgcgat ctgcgttcat actgggacgc tgaacattcg cttaatgttg accgcgatat 15661 taacgcggga atcaacatta agcgcgttgg gctgggactt ttcccaacgt taaaacgccg 15721 taaagggaat ccggtagtga gtgactaaga gcaaactcct taaacctcgt caattcgagt 15781 gtaggggaga ctcgtaaaac agagtaaaac tccataaaac ggtgtaattt attgttgtc // LOCUS NODE_2125_length_15772_cov_5.11070815772 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15772) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15772) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15772 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..3693 /locus_tag="DP116_18240" CDS <1..3693 /locus_tag="DP116_18240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859144.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_18240" /translation="EGSFEKGFPVRVKIGEEGRPHYDDFSGRLPPASAVQVNYENWQT IYRNLPANWLIILPESQITNVSTPGDCNQAAQIFISSFNEWLNQPSVRKLERQFLRKV DDWENVRFILQTQDSLLQRLPWHLWDVFQENDHQSEIVVSPEYELFKKKLKTRQLNTP VKILAVLGYGQDINITQDLSALEKNLLGATIEALREPSSQELRKKLWEQSWDILFFAG HSCSKQGDSWGEIQINASESLSLENLRHSLRHAVQKGLKLAIFNSCDGLGLARNLADA RIPYTIVMREPVPDIVAQHFLEYFLTAFAAGESLYASVQQARARLQEQWENQYPCASW LPVIFQNPAAAELKYPQQHNWKKIALQTAIVISAVVGFGVISWRIIHEFQSRARFSDG DKILVKTFTTPYKQEGVQAFRQENYNLAISKFEQSLQQYRNDPETLIYLNNAKIGNQP ALIIGVPVPIGTNPNVAQEILRGVAQAQQEINNQGGINGKLLKVEIANDDNNPDIAVQ VADRFVQNQDILAVVGHNSSDASLPASEKYQAGKLVMISPTSNSIRLTDRIDHNNGNY IYRTVISFTTIADSLTEYAKTTGKTKILICNDSKGADQSSEQAFVRTMENKHLQQINN IQCDFAAKNFQPETIIKNAKEKGVDAILLNPQVDRIDRAIALAKANQGKITLLGNPSL QTLGTLDAGNALNGMVMAVPWHAGVSADKNFVQNANNLWREPDSITWRTATAFDATKA IAAALKQKGGTRSGVQQVLSGDFSLQGATGTIRFLYWGDRAGDRVGNAVLVEVKRNPK ASTGYSFEPKDSMQSRISLGDKILVQDNPSDEKQLGVQAFAAGNYDQAIAHFQASVQK MPNDPEARIYLQNADAARSGKILKIAVSVPIGSNLNVAKEILQGVAQAQDEINQKGGI RGNLLQVEIASDDNNPNIAEKLANSLVADQEILAVIAHNSSEASVAACPIYQQGKLVN ISPTLFSFKFLGCGSYIFRTAPNIRSIAEALSTYAIKNLNQRNLAICVDEKAIDNQSF RDEFSYAINKDGGKLINITCDFSAPHFNPNQVIVDAIKSGANGLVLAPHVDRINKALD LAAANKARLKLFGSPTLYTSQTLQQGRSDVNGLELVVPWHPEANFENNFAKNAQQLWR SPVTWRSATSYDAAVAIINGLQQSTTREELQKVLHNPNFYADGATGKIKFLQSGDRNI KNDVVLVKIKPTSASPTGYKFFLNSP" gene complement(3788..4984) /locus_tag="DP116_18245" CDS complement(3788..4984) /locus_tag="DP116_18245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="site-2 protease family protein" /protein_id="PRJNA477356:DP116_18245" /translation="MQTNWKIGSLFGIPLFLDPLWFVILGLATLNFGVAYQAWGPILA WSAGVVMALLLFGSVLLHELGHSLVARSQGIKVNSITLFLFGGIASIEEESKTPGKAF QVAIAGPFVSVVLFFFLRLLTYILPENSPASLMVGDLARINLVLALFNLIPGLPLDGG QVLKAALWKATGNRFQAVRLAAKAGQILGYGAIALGLAVDYFTGELVTGLWIALLGWF GIRNATTYDRITTLQETLLNLIAANAMTREFRVVDANQTLRSFADSYLLDISAFEVYF AESDGRYRGIVSIDDLRLVERSEWETLTVQSIVHPLTEIPTVAESTPIVEVINKLENK QLPRITVLSPAGAVAGVIDRGDIVRELAQKLSLRITEAEIKRIKEEGTYPPGLQLEVI AKSLQS" gene complement(5296..5574) /gene="psaK" /locus_tag="DP116_18250" CDS complement(5296..5574) /gene="psaK" /locus_tag="DP116_18250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457433.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit PsaK" /protein_id="PRJNA477356:DP116_18250" /translation="MISSLLLAVQATVANTGAEFSLNKFIIITASCILALLIIPRVIR YPHVGPKMPLPFPSVFNNPSVGAFLAAISTGHLVGVGAVLGLTNLGII" gene complement(5788..6045) /gene="psaK" /locus_tag="DP116_18255" CDS complement(5788..6045) /gene="psaK" /locus_tag="DP116_18255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215426.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit PsaK" /protein_id="PRJNA477356:DP116_18255" /translation="MISSILLAAVATTVPATPEWNPTVGIIISVSCLVALLLTSFIKS PKVGPKLPILPVTLPAFIGAMCFGHLIGVGIVLGLTNIGGL" gene 6226..6858 /locus_tag="DP116_18260" CDS 6226..6858 /locus_tag="DP116_18260" /EC_number="5.3.1.24" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866509.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylanthranilate isomerase" /protein_id="PRJNA477356:DP116_18260" /translation="MRVKICGITQPQQGKAIASLGATALGFICVPTSPRYINVEQIRA VVEQLPEEIDKIGVFANATASEITQTVVNSGLTGVQLHGDESLEFCQQLRQLLPDVEI IKALRIRSFEDTEKAETYTSNADTLLVDAYHPQQLGGTGTTLDWRMFSQFSPSCPWFL AGGLTPENIIEALTQITPSGIDLSSGVERAPGDKNLDKVAKLFEKLRSKC" gene complement(6961..7671) /gene="folE" /locus_tag="DP116_18265" CDS complement(6961..7671) /gene="folE" /locus_tag="DP116_18265" /EC_number="3.5.4.16" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408396.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTP cyclohydrolase I FolE" /protein_id="PRJNA477356:DP116_18265" /translation="MTIARSNGTNCSQQSPLVPDLTEAITPRPDRNTHNGRQADLHPQ TEEQMEQMTDAVRTLLVGVGENPEREGLLKTPKRVAEAMRFLTSGYNQSLEEIVNEAI FDEGHNEMVLVRDINVFSLCEHHMLPFMGKAHVAYIPNQKVVGLSKLARIVEMYSRRL QVQERLTRQIAEAVQTILEPQGVAVVMEATHMCMVMRGVQKPGSWTVTSAMLGVFQEE HKTREEFFNLIRHQSSFF" gene complement(8799..9782) /locus_tag="DP116_18270" CDS complement(8799..9782) /locus_tag="DP116_18270" /EC_number="6.4.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318199.1" /note="catalyzes the carboxylation of acetyl-CoA to malonyl-CoA; forms a tetramer composed of two alpha (AccA) and two beta (AccD) subunits; one of the two catalytic subunits that can form the acetyl CoA carboxylase enzyme together with a carrier protein; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetyl-CoA carboxylase carboxyl transferase subunit alpha" /protein_id="PRJNA477356:DP116_18270" /translation="MATTERKPLLLDFEKPLAELATRIEQIRQLAEENGVDVSGQIRQ LEARAMQLREEIFSSLSPSQRLQVARHPRRPSTLDYIQAISDEWMELHGDRCGSDDPA LIGGVGRLGGQPVVMLGNQKGRDTKDNIARNFGMASPGGYRKAMRLMEHANKFGMPIL TFIDTPGALPTVVAEQQGAGEAIAYNLREMFSLDVPIICAVIGEAFSGGALGISIGDR LLMFEHAVYTVITPEACAAILWKDASKAPQAAAVLKMTAQDLRSLGIIDQILPEPIGG AHSDPLGAVTTLKQALLNNLEELNHFTLSERREMRYEKFRKIGVFTEVTHS" gene complement(10180..11205) /locus_tag="DP116_18275" CDS complement(10180..11205) /locus_tag="DP116_18275" /EC_number="1.2.1.80" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194203.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="long-chain acyl-[acyl-carrier-protein] reductase" /protein_id="PRJNA477356:DP116_18275" /translation="MFGLIGHLTTLEHAQAAAKELGFPEYANEGLDFWCSAPPFIADN ITVTSVTGQKIEGQYIESCFLPEMLATRRIKAATRKILNAMAHAQKHGINITALGGFS SIIFENFNLEQFQHIRNIKLEFERFTTGNTHTAYIICQQVEQASKQVGIELSKATVAV CGATGDIGSAVCRWLDAKTDVKELLLIARNQERLQQLQDELGRGKILPIEEALPQADI VVWVASMPKGMEIDAKVFKQPSLLIDGGYPKNLETQIQHPGVHVLNGGIVEHSLDIDW KIMNIVNMDVPARQLFACFAESILLEFEKLYTNFSWGRNRITVEKMEQIGQVSRKHGF RPLLVES" gene complement(11577..12272) /locus_tag="DP116_18280" CDS complement(11577..12272) /locus_tag="DP116_18280" /EC_number="4.1.99.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866504.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldehyde oxygenase (deformylating)" /protein_id="PRJNA477356:DP116_18280" /translation="MQQLVDQPEIDFKSETYKDAYSRINAIVIEGEQEAHANYLKLAE LLSEHQDDLIRLSKMESRHKKGFEACGRNLQVTPDMQFAQEFFAQLHQNFQNAAAEGK VVTCLLIQSLIIECFAIAAYNIYIPVADEFARKITEGVVKDEYSHLNFGEVWLKEHFE ESKAELEEANRQNLPIVWKMLNSVEDDAHTLAMEKDALVEAFMIQYGEALSNIGFSTR DIMRLSAYGLKAA" gene 12467..12724 /locus_tag="DP116_18285" CDS 12467..12724 /locus_tag="DP116_18285" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18285" /translation="MINPKSEKSSALNARSPETDIRKGSIAAKLKISWSLGIQLLGAV IVLLPLVVELLTSFVPSGAVPLCFMGKWLVFSPLSRCLGAR" gene complement(12909..13940) /locus_tag="DP116_18290" CDS complement(12909..13940) /locus_tag="DP116_18290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999403.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18290" /translation="MQYNESIDELAALLQQPADFEFELPDPEDEEIPEPEFQKQLDVA WQVCDRFDLQTDIWRGRILRAVRDREKIGGEGRGAGFLKWLKEREIGKSQAYALIQLA NSADTLLEEGRLEPSAINNFSKRAFVETAKAAPEVQQMIGEAAQKGDRITRREVRQLT DEWTAMSSELLPEPVKIKAAENALPPRYIAPLVKEMEKLPESHQNAIQKEIAQSPDVD TIKQVTTDARNLAKYLKAAAQVQALNAQEVDIETALIEAQRVGCLSIAADLVNQASQL EQMIAKLYMAWKKISNLSDRLYVDTGASTPRLRELLECLEPLGGEVMELQLSGATERT IRLQIQETN" gene 14077..14736 /locus_tag="DP116_18295" CDS 14077..14736 /locus_tag="DP116_18295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315486.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gamma-glutamylcyclotransferase" /protein_id="PRJNA477356:DP116_18295" /translation="MSLTRADLESSRLQQTILQSGRAVNVLSETQLQASMHETLGQQK PNSDVWLFAYGSLVWNPIFKFAEQRIGTIYGWHRRFCLWVPQGRGTPDNPGLVLGLDR GGSCRGIAYQIAASDVHSELQLLWRREMVVGCYIPRWVRVFDGTQKLQAITFVINHQH RAYSGKISLETTVNSIATACGELGSCADYLMHTVNSLMSVGIKDQQLLRLREYVMARQ D" gene 14975..>15772 /locus_tag="DP116_18300" CDS 14975..>15772 /locus_tag="DP116_18300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997102.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent helicase" /protein_id="PRJNA477356:DP116_18300" /translation="MARTPTLTFDRGTLILHPPPRGRAWMDFATWDDRVEKFRIPAIQ YRALVEAMQAEQTSFIDEAKAFYPIELVSSLEMEPYPHQSEALAAWKLAGRQGVVVLP TAAGKTYLAQMAMQATPRTTLIVVPTLDLMHQWYAHLKAAFPDAEVGLLGGGSRDRTP VLVATYDSAAIHAESLGNLYALLIFDECHHLPTDFSRVIAEYAISPYRLGLSATPERT DGKHADLNFLIGREVYRKRAEDLAGKALAEHEVVQIKVKLSQNEREKY" BASE COUNT 4514 a 3389 c 3405 g 4464 t ORIGIN 1 gaaggcagtt ttgagaaagg gtttcctgtc agagttaaaa tcggggaaga aggtagacct 61 cattatgacg atttttctgg caggttaccg cccgcatcgg cagttcaggt aaactacgaa 121 aattggcaaa caatttatcg taacttacca gccaattggt taataatcct tcctgaaagt 181 caaataacaa atgtatctac tccaggagat tgcaatcaag cagctcaaat ttttatatcc 241 agtttcaacg aatggttaaa tcaaccatca gtacgaaaat tagagcgaca atttttacga 301 aaagttgatg attgggaaaa tgtccgcttt attttacaaa cgcaagattc tctgctacaa 361 cgacttcctt ggcatttatg ggatgttttc caggagaacg atcatcagtc agaaattgta 421 gtaagtccag agtacgaact attcaaaaaa aagttaaaaa caaggcagtt aaacactcct 481 gttaaaatcc tcgctgttct tgggtacggt caagatatca atattacaca agatttgtca 541 gccctagaaa aaaatttact aggcgcaaca attgaagcgt tgagagaacc atcaagccaa 601 gaattgagaa aaaaattatg ggaacaatct tgggatattt tgttttttgc aggacatagt 661 tgtagcaaac aaggtgatag ttggggagag attcaaatta atgccagcga gagtttatcg 721 ttggagaatc tgcgccatag tctcagacat gctgtccaaa aaggattaaa actagcgatt 781 tttaattctt gcgatggact tggactagca cgcaatttag ctgatgcgcg aattccctac 841 accattgtca tgcgcgagcc tgttcccgat atagtagcac aacatttttt ggaatatttt 901 ctcacagcat ttgcggctgg tgaatcttta tatgcatctg tacaacaagc tcgcgcacgc 961 ttgcaagaac aatgggaaaa tcagtatcct tgtgcttcgt ggctacctgt tatttttcaa 1021 aatccagcag cagcagaact gaaatatccc caacagcata attggaaaaa aattgctttg 1081 caaacagcga ttgttatcag tgcagtagta ggttttggtg ttatttcatg gcgtattatt 1141 catgagtttc aatctcgcgc tcggtttagt gatggtgata aaatattagt gaaaacattc 1201 acaactccct ataaacaaga gggtgtacag gcatttaggc aggaaaatta caatctagca 1261 atcagtaagt ttgaacagtc tttgcagcag tatcgtaacg atccagaaac gttgatatat 1321 cttaataatg ccaagattgg taatcaacca gcactgataa ttggtgttcc cgtacccatt 1381 ggtactaacc ccaatgttgc ccaagaaatt ctccgaggag tggcgcaagc tcaacaagag 1441 ataaacaatc aaggtgggat taatgggaag ttattgaaag ttgaaattgc caacgatgat 1501 aacaatcctg atattgctgt acaggttgct gatagatttg tgcaaaatca agatatctta 1561 gcagtcgttg ggcataacag cagcgatgct agccttccag cctctgagaa atatcaggcg 1621 ggaaaattgg tgatgatttc accaacgagc aactcaataa gattaacaga ccgcatagat 1681 cataataatg gtaactatat ctataggaca gttattagct ttactacaat tgcagactct 1741 ctcactgaat acgccaaaac aactggcaaa accaagattc ttatctgtaa tgattccaaa 1801 ggagcagatc agtcctcgga acaagctttt gttaggacaa tggaaaataa acacctgcaa 1861 cagattaata acattcagtg tgattttgcc gctaagaatt tccagccaga aaccattatt 1921 aaaaatgcaa aagaaaaagg tgtagatgcc atacttttga atcctcaagt agacagaata 1981 gatagagcga ttgcactcgc caaagcaaat caaggcaaaa ttacattatt aggaaacccc 2041 agtttacaaa ccttaggcac tctcgatgct ggcaatgccc tcaatggtat ggttatggca 2101 gttccctggc acgccggcgt ctcagccgac aaaaattttg tccaaaatgc caacaatctt 2161 tggcgcgaac cagactcaat cacttggcgt acagcaaccg cctttgatgc taccaaggca 2221 attgccgcag cacttaaaca aaagggtggt accagaagtg gagttcaaca agtcttatcg 2281 ggtgattttt ccctacaggg ggctacaggc acaattcgat ttttgtattg gggcgatcgc 2341 gcaggcgatc gcgttggtaa tgctgtgtta gtagaagtca aacgcaatcc caaagcttcc 2401 actggctaca gctttgaacc aaaagactct atgcaaagcc gcatcagtct tggggacaaa 2461 attttagttc aagataaccc cagtgacgag aaacaattgg gcgtgcaagc tttcgcagct 2521 ggaaattatg atcaggctat agcacatttt caagcatctg tgcaaaagat gcccaacgac 2581 cccgaggcgc ggatatactt acaaaatgct gacgctgctc gtagtggtaa aatcttgaaa 2641 attgctgtga gtgttccaat tggcagcaat cttaatgttg ccaaagaaat acttcagggt 2701 gttgctcaag ctcaagatga aatcaatcaa aaaggtggca ttcgaggaaa tttattacaa 2761 gtagaaatcg cttctgacga taataatccc aacattgctg aaaaacttgc caattcctta 2821 gtggcagacc aggaaatttt agcagttatt gctcataata gctctgaagc ttctgttgct 2881 gcttgcccca tttaccagca gggcaagtta gtaaatattt cccccactct tttttctttc 2941 aaattcttag gatgtggctc ctatatattt cgtactgctc ctaatattcg ttctattgct 3001 gaggctttat ctacctacgc tatcaagaat ctcaatcaaa gaaatttagc aatttgcgta 3061 gatgaaaaag ccatagataa tcaatctttt agagacgaat ttagttatgc catcaacaaa 3121 gatggaggaa agcttataaa tatcacctgc gatttctcag caccacattt caacccaaat 3181 caagtcattg ttgatgctat taagagcggt gcaaatggct tagtcttagc tcctcatgta 3241 gatagaatta acaaagcatt agatttagcc gcagccaata aagcaaggct gaaactattt 3301 ggcagtccta ccctttatac atcccaaaca ctacaacaag gacgctcaga tgtcaatggt 3361 ttggagttag tcgtaccttg gcatccagaa gcgaattttg aaaataactt tgccaaaaat 3421 gcccagcaac tttggcgtag tcccgtgact tggcgttctg ccacaagtta cgatgcagct 3481 gtcgcgatta ttaacggttt gcagcaaagt acaactcgtg aagaattaca aaaagtcttg 3541 cataatccca acttttatgc tgatggtgca acggggaaaa ttaaattttt acaatcaggg 3601 gatcgtaata tcaaaaatga tgtcgtctta gttaaaatta aaccaactag cgcatctcca 3661 actggttata aattcttttt gaactctcct taaatccagc gaatgatgca actagattgt 3721 ttttaaaaag catacttgag gtgctttact ttgttgaatg gtgcactcca agtatgctaa 3781 caactgttta cgattgcagc gattttgcaa tcacttccaa ttgcaaacca ggcggataag 3841 tgccttcctc ttttatccgc ttaatttcag cttcagtgat tcgcaaactt aacttttgtg 3901 ctaactcccg cacaatatcc cctcgatcaa tcacaccagc gacagcacca gcaggagaaa 3961 gcacggtgat acggggtaat tgtttatttt ccagtttgtt aatcacctct actatgggag 4021 ttgattcagc aacagtagga atttctgtca ggggatgcac tatactttgc accgtcaggg 4081 tttcccattc acttctttcc acaaggcgca agtcgtctat ggaaactatc ccccggtaac 4141 gtccatcaga ctcggcaaaa taaacttcaa aggcgctgat atctaaaaga taagagtcag 4201 caaaagaacg caaagtctga ttagcatcga cgactcgaaa ctcacgagtc atggcattgg 4261 cagcaattaa attgagcaag gtttcttgta atgttgttat gcggtcatag gtggtggcat 4321 tgcggatacc aaaccaacct aacaatgcta tccacaaacc agtgacgagt tctccagtaa 4381 aataatctac tgctaggcct agggcgatcg ccccataacc caaaatctgc cccgcttttg 4441 ctgctaaccg taccgcttga aaacggttcc ctgttgcttt ccaaagtgct gcttttaaaa 4501 cttgtccgcc atctaagggc aaaccaggaa tcaagttgaa cagtgccaga actaagttga 4561 ttctggctaa atctccaacc ataaggctgg caggactgtt ctcgggcaaa atatatgtta 4621 gcagtctgag gaagaaaaac aagacaacac tcacgaatgg tccagcaatt gccacttgaa 4681 aggcttttcc tggagttttg gactcttctt cgatagaagc aatgccacca aacagaaata 4741 gagtaatcga attaactttt atgccttgcg atcgcgcaac taagctgtga cccaattcgt 4801 gcaataacac tgaaccaaaa agtagcaatg ccatcaccac tccagcactc caagctagga 4861 taggtcccca cgcttggtaa gcgaccccaa agttaagggt tgccaaccct aaaatcacaa 4921 accacagagg gtctaaaaac agtgggattc cgaataaaga cccgattttc caatttgttt 4981 gcattacatt ttcctaaaac tagatatttg taatcgtcca gaaatacgtt ttatatgttc 5041 ttttttacac atatatcatt ttgccccatt aacttttcct tgctgaaatt tggaaaatca 5101 gcaatggctg gttatataga ctccaatttc tagaatagac aattcttgct agcgagcaaa 5161 actcaaacta ggggcgcagt catttaagaa ttaaaaacag aagcggcaag tcttctctgc 5221 ttcaaagtgt gccttgaagg agaggaaacc tgccgcctct tgtgcttcag atgtagtatt 5281 tgtgtaagat aaaatctaga taatgccgag attggtcaag cccaaaactg caccaacacc 5341 aacaaggtga ccagtactga tcgcggcgag aaatgcacca acgctaggat tattaaatac 5401 tgaaggaaaa ggtagcggca ttttaggacc aacgtggggg tagcgaatca cccgaggaat 5461 aatcaaaaga gctaatatgc agctggcagt gataattata aatttgttca agctaaattc 5521 agcgccagta ttggcaacgg tagcttgaac tgctaataac agtgatgaaa tcacggttct 5581 tcctcctcgc atgaataaat tcatttatct gtgtagagac gcgagattgg tgcgtctcta 5641 cagccatgtt tatcctctgc taatgaatta gcagaatcct ctttcttagg gaggtattaa 5701 tctctcccaa gagtcataac tagatagagg gagaaagaat gtgatcgggg atacagagat 5761 cttgtcatgg aacatctctc ttaaaatcta gagaccaccg atatttgtta gccctagcac 5821 gatgccaacg ccaataagat gcccaaaaca catagcaccg atgaatgctg gaagagttac 5881 gggaagtata ggcagtttgg gacctacctt gggagattta ataaacgatg tcagcaaaag 5941 agctactaaa cagctgacgc tgatgatgat tcctacagta ggattccact caggtgttgc 6001 gggaacagtg gtggcgacag cggctagtaa tatagatgaa atcaagcttt ttctcctaac 6061 tagaaaaatc gacaactggt caataatcag catttagcca gctaaatgtt gatgactgat 6121 tgtttcaacc ataaaaatta tagaattata aaggaaagaa cttgaaattg tttatacaac 6181 ttcgtttttc tgtaattatt actactggta ttatcaaaaa gatttatgcg ggtaaagatt 6241 tgcggaatta ctcaaccaca acaggggaag gcgatcgcct cccttggtgc aacagcatta 6301 ggatttattt gtgtccctac ctcaccgcgc tatatcaatg tagagcaaat tcgggcggtt 6361 gtggaacaac tgcccgaaga aatcgacaaa atcggagttt ttgccaatgc taccgcttca 6421 gaaattaccc agactgtggt taattctggt ttaactggcg ttcaactgca cggtgatgaa 6481 tccctggaat tttgccagca gttgcgtcaa ttgctaccag atgtagaaat tattaaagct 6541 ttgagaatcc gtagttttga ggatactgaa aaagcagaaa cttatacttc aaacgcagat 6601 acgctgttag ttgatgctta ccatccacag caactaggtg gtacaggtac aactctagat 6661 tggaggatgt tttcgcaatt cagccctagc tgtccttggt ttttagctgg gggactcact 6721 ccagagaata ttatagaagc tttgactcag attaccccca gtggcattga cctatcgagt 6781 ggtgtagaac gtgcccctgg agataagaat ttagacaagg tagccaagtt gtttgagaag 6841 ctgcgttcaa agtgctgaat gtgattaaca gttatcagtt atcagttacc agttatcagt 6901 tatcactgtt cattgttcac tgttcactgt tcactgttca ctgttcttcg tactcagcac 6961 ttaaaagaac gatgattggt gacgaattaa gttgaagaat tcttcacgag tcttatgctc 7021 ttcttgaaac acgcccagca ttgcgcttgt cacagtccag gaaccaggtt tttgtacacc 7081 tcgcataacc atgcacatat gtgtagcttc catcacaaca gcaacaccct gcggttctag 7141 aattgtctga actgcttcag caatttggcg agtgagtctc tcttgcactt gcaagcgtcg 7201 ggaatacatc tcgacaatgc gggcaagctt gctcaatccc acgacttttt gattaggaat 7261 ataggcaaca tgagccttgc ccataaacgg caacatatgg tgttcgcaca agctaaaaac 7321 gttgatgtcc cgtactaata ccatctcatt atgaccttca tcaaagatgg cttcgttgac 7381 gatttcttct aaagattggt tatagccact ggtgagaaac ctcattgcct ctgctacccg 7441 cttgggtgtt ttcagcaatc cctcgcgttc ggggttttct ccaacaccca ctagaagagt 7501 ccgcacggcg tccgtcattt gctccatctg ctcctctgtt tgcgggtgca agtcagcttg 7561 ccgcccattg tgagtgttcc ggtcaggtct tggggtgatg gcttccgtca agtctggaac 7621 cagaggagat tgttgagagc aattggtacc gttggaacga gcaatagtca tgatcgagtc 7681 tttgttaagg tttgattagg agcaacgagt cagcagtcat gagtcttgtg ttattagcaa 7741 aaaaccaagg acaaaagaca aatgacaagt tagagtgtgc caatgccagg cacgatagtc 7801 aattcatcaa taactgcctg ttgtggtagc aaaggaacgt caagaattga ctttcaaatt 7861 gtttaacggt ttgataacta tgtcactacc ctgttatata agtctcgtag tgtcgataga 7921 ttaaacaaac ctaaaaatcc gattaatttt cttcgacaat tgcgtgagtt gcattgcaaa 7981 gaagatatcc tccccatctc gtatattcta tagggttaga aactgttgat gacaaaagtt 8041 gagttgtata aactcctaaa atctatacag tatcgccgct cgtactggta taggtaagcc 8101 gaaataattt tggctacgaa ctgccatttg cgctcctact cctgttgaaa ttcatatcat 8161 aattgaatgg gatatagaaa gcttacgcag tttaacgagt acgcggttta tacctcctac 8221 caagaactac aagtttatat aactttggtt tctaggataa ctgtatattt tgcctcagaa 8281 atcttgcact accttgaatg aaaattaacg agtcatgagt acatggcata cagtttagtg 8341 tccattgtca caacatccct aactcatgat tcaataaccc tcgctgctta cacattcgct 8401 tatttgctgg ttcaaatagt gccttgcgga cacacaccgc ctcggaaaca gcagcacgct 8461 gcgctaacac aaagtgtgcc tgcactgacg aaaaaatttt gaattgatca tgcgtgagtt 8521 ttgaattaag ggaatagggt gcgtgcccgt aagcgcggct tatccagagg actcagcatg 8581 cgctctgcgt tcagccgtgc cgaacggaga cgccgcgcgt tcgccctttg ggcgtgcgcc 8641 ttgcgcatac ggcgtgagcc atagggacaa gacatagggc ttagcagttg actcatgact 8701 caaaatacaa atgcctatgg atgaaaagat tgttaaaaat ctttaagttc cataggcaat 8761 tatagcattg ccgctatttt aatcaaatag gtagagtttc atgaatgggt aacttcagtg 8821 aaaacaccta ttttgcggaa tttttcgtag cgcatttcgc gtcgttcact aagggtaaag 8881 tggttgagtt cctccaaatt gttcaagaga gcttgcttaa gagtagttac tgctcccaaa 8941 ggatcagaat gagcaccgcc aatgggttca ggtaaaatct ggtcgataat tcccaagctt 9001 ctcaaatctt gtgctgtcat tttcaacaca gcagctgctt ggggagcctt gctagcatct 9061 ttccacaaaa tagcagcaca ggcttcagga gtaataacag tgtaaacggc gtgttcaaac 9121 atcagtaggc gatcgccaat actaataccc aatgcaccgc cagaaaaggc ttcaccaata 9181 acagcacaga tgatcggcac atccaaggaa aacatctcac gtaggttgta tgcgatcgct 9241 tctcctgcac cttgttgttc agcgacgact gtgggtaaag ctcctggcgt gtcgataaaa 9301 gttaaaatag gcatgccaaa cttgttggcg tgttccatca agcgcatcgc cttacgatag 9361 ccaccagggg acgccatacc gaagttgcgg gcaatattgt cttttgtgtc gcgccctttt 9421 tgattaccca acataaccac gggttgtccg cctaaacgac cgacaccacc aattaaagca 9481 gggtcgtcag aaccacagcg atcgccatgt aattccatcc attcatcact gatagcttga 9541 atataatcaa gggtactggg gcggcgggga tgacgggcga cttgcagtcg ttgagacggc 9601 gatagactac tgaaaatttc ttcacgcagt tgcatggcgc gtgcttctag ctgacgaatt 9661 tgaccagaaa catcgacgcc attttcttct gcaagttgcc gaatttgctc tattctagtt 9721 gccagttctg ctagtggctt ttcaaaatcc aacagtagcg gtttacgctc ggtagtagcc 9781 attttttgat tgaaaagtga agagtaaaaa ttgaggtgtc ggatgtgggc aataggaatt 9841 tctatagtca tgagtcatat gcataagcaa gggagcagtg gactgggaga acaagcggga 9901 aattttctca aaatattgac cgggcgatgt ttgagatgac tgacggcgtt ggtcattctt 9961 tctgccttac tccctgcttc ctttcctcct gcctatgcat gtggtgagac cagcgctgca 10021 cagagaagtg ccttgcgcgg gttccccgcg ttgtggcaac ttcggagagg gtctccgtcc 10081 gtaggcgact ggcgtatgcg caaggcgcac gcccagaggg ctaaagcgca gcgtgaccgg 10141 aggtcatacc cgaagggtca ctagttagtc actcatgact catgactcaa ccaaaagtgg 10201 tctaaatccg tgctttcttg agacctgacc aatttgctcc attttttcta cagtaatccg 10261 attgcgcccc caagagaagt tagtgtataa cttctcaaat tccagtagta ttgattctgc 10321 aaaacaagca aataactggc gtgctggaac atccatattg acgatgttca taattttcca 10381 atcaatatcc agggaatgtt ctacaatgcc accatttaag acgtgtacac caggatgctg 10441 aatttgagtt tctaagtttt ttggataacc accatcaatg agcaaagaag gttgcttaaa 10501 aactttggcg tcgatttcca ttcctttggg catactagca acccaaacga caatatcagc 10561 ttggggaagt gcttcttcta taggcaggat ttttccacgc cccagttcgt cttgcagctg 10621 ttgtaggcgt tcttggttac gggctatcag caaaagttcc ttaacatctg ttttagcatc 10681 taaccaacga caaacagcgc taccaatgtc cccagttgcc ccacacacag caacagttgc 10741 ttttgacagt tcaatgccta cttgttttga cgcttgttct acttgttggc aaatgatgta 10801 tgccgtatga gtattacctg tagtaaagcg ttcaaactct aatttaatat tgcggatgtg 10861 ttggaattgc tctagattaa agttttcaaa aattatcgaa gaaaatccgc ctaaagctgt 10921 aatatttatc ccgtgcttct gagcatgagc catagcgttg aggattttgc gtgttgcagc 10981 tttgatgcga cgagttgcaa gcatttctgg caaaaaacat gactctatat attgcccttc 11041 aattttttgc ccggtaacac tggtaactgt aatgttatcg gcaatgaaag gcggagcgct 11101 gcaccaaaaa tctagccctt cattggcata ttctgggaag cctaattctt tggctgccgc 11161 ttgcgcgtgt tctaaagtag tgagatgtcc gattagacca aacatgaacg attttgttaa 11221 gcgtgtcctt cgttagaagg gtggaatagg cgggttaaca tataatgaac ccttcaaaat 11281 actacacaaa aactcacaac caaatcagag tcagctctgg cttttgagaa actcaaaaaa 11341 agcaagctcc actcgcggcg ctgaaattta aaattaccta agttctgatc tgaggaaaac 11401 ccagctcaga attctctcaa aattcaaagt atgccctacg ggcacgctgc gctatcaaag 11461 ttcaaagttc aaaattattt ttttgaactt tgaataaata attttaaatt ccccccaagg 11521 gtagattcct tatccccatt tctttccaaa gtggggattt tgagtttcct attcttttat 11581 gctgctttga gtccgtaagc tgacagacgc atgatatcgc gggtggaaaa accgatgtta 11641 ctaagagctt ctccgtattg aatcatgaaa gcttctacta aagcgtcctt ttccatagcc 11701 aaagtgtggg catcgtcctc aacagagttg agcattttcc agactatagg aaggttttgg 11761 cgatttgctt cttctaattc cgcttttgat tcttcaaagt gttctttcaa ccatacttct 11821 ccaaagttga gatggctata ctcatctttc actacccctt cagtaatttt gcgagcaaac 11881 tcatcggcaa ctgggatgta aatgttgtat gcggcgatcg caaaacattc aataatcaaa 11941 gactgaatca gcaagcaagt aacaacctta ccttcagccg ccgcattttg aaaattttga 12001 tgcaattggg caaaaaactc ctgagcaaat tgcatatcgg gtgttacctg aagattgcgt 12061 ccacaagctt caaatccttt cttgtgacga ctttccatct tagagaggcg aatgagatca 12121 tcttgatgtt cgctcagcag ttcagcgagt ttaagataat ttgcgtgggc ttcttgttct 12181 ccctctatca caatcgcatt gatgcgactg taggcatctt tgtatgtttc gcttttgaag 12241 tcaatttcag gttgatcgac aagctgctgc atggtacact cactcccgta atgtgaatta 12301 tcttatacag aaatctactt aaccccatga aatgagggca acggtaactg tgttttaatt 12361 ttacttaaca agtctctatg ttatagatta gcttttgctg ggggcattag ttcaagctaa 12421 ttcacaggaa aaccaggaaa taagacagac acgggactaa taatggatga tcaatccaaa 12481 atccgaaaag tcaagcgctc tcaacgcacg atctccagaa acagacatcc gcaagggtag 12541 catagctgct aaattgaaaa tttcctggag cctaggaatt caactgctag gtgctgttat 12601 agtattatta cctttagttg tcgagttgtt gacatccttt gttccttctg gagcagttcc 12661 cttgtgtttt atgggaaaat ggctggtctt tagtccatta tcgcgatgcc tgggagctag 12721 gtgaattttt acttgctttt acttcattct accttggtgc gcgatcaccc gtatatgccc 12781 ggagggtaac cttggcgaaa ccctcccctc gaattttcga ttcgaggcaa caaagctgtg 12841 ctttgttggg gcatacggca ggcaaagcca tcgctattac cgtctttgaa attatgactc 12901 cctattccct aattagtttc ctgaatttgt aagcggatag tgcgttcagt cgcaccactc 12961 agttgtaatt ccataacttc accaccaagc ggttctaagc attcaaggag ttctcgcaag 13021 cgcggcgtac ttgcacctgt atccacgtac aaacgatccg atagattact aattttcttc 13081 caagccatgt acaatttggc aatcatttgc tctaactgag atgcttgatt cactaagtct 13141 gctgctatgc tcagacagcc aactctttga gcttctatga gggctgtttc gatgtcaact 13201 tcttgagcgt ttagcgcttg gacttgagct gctgctttga gatattttgc cagattacga 13261 gcatctgtgg taacttgctt gatggtatct acatcgggac tttgagctat ttccttttga 13321 atcgcgtttt ggtgagactc tggtaatttc tccatctctt ttactaatgg cgcgatgtaa 13381 cggggaggaa gagcattttc tgctgctttt atcttgactg gttcgggtag caactcggaa 13441 gacatggctg tccattcatc agtgagttga cgcacttctc gcctggtgat gcgatcgcct 13501 ttttgcgctg cttcacctat catctgttga acttctgggg ctgctttagc tgtttctaca 13561 aacgcccgtt tgctaaagtt attgatggct gatggttcta atcttccctc ttctaaaaga 13621 gtatcagcac tgttcgcgag ttgaatcaga gcatatgctt gacttttgcc aatttctcgt 13681 tctttgagcc atttgagaaa accagcacct cgtccctcac cgcctatttt ctctctgtcg 13741 cgaacagcgc gtaaaattcg tcctcgccaa atgtcggttt gcaaatcaaa gcgatcgcac 13801 acctgccaag ccacatctag ctgcttttga aattccggtt ccggaatttc ctcatcttcg 13861 ggatcaggga gttcaaattc aaaatcagcg ggctgttgta gaagtgcagc caactcatca 13921 atagattcgt tatattgcac gattttatag tttattaatt tattactgta cgcgcggttt 13981 attatcacat attccgcgta agctagattg tatgcctgag ttatttgttc ctaacggcag 14041 ctcattctgg cttgagtaca gcaaaggggc tatatcttgt cactaacacg agccgacctc 14101 gaatccagcc gcttgcagca aaccatctta caatctggac gtgcggtaaa tgtcctgagt 14161 gaaacccagt tgcaggcgtc aatgcatgaa actcttgggc aacaaaaacc aaattctgat 14221 gtttggttgt ttgcctatgg ttctctggtt tggaacccca tcttcaaatt tgcagaacag 14281 cgcatcggca cgatttacgg ttggcatcgc cgcttttgtt tatgggttcc ccaggggcgt 14341 ggtactccag ataatccggg gttggtactc ggtttggata gaggcggtag ttgtcgcggt 14401 atcgcctacc aaatcgctgc tagtgatgta cactccgaac tacagctact ttggcgacga 14461 gaaatggtag ttggttgtta cattcctcgt tgggtgagag tgtttgacgg tacgcaaaaa 14521 ttgcaggcga ttacttttgt catcaatcac caacatcgag cctacagtgg taagatttcc 14581 cttgaaacta cagttaatag cattgccaca gcttgcggtg agcttggttc ttgtgctgac 14641 tacctcatgc acaccgtcaa ctccttgatg agtgttggaa ttaaagatca acaattgctt 14701 cggctgcgcg agtacgttat ggcgcgacaa gattagggat gtgttctcaa agtgttgcga 14761 ttatcaagat gcaggcaaag actacgattg ttttggcttt tattaagatt tagaaattag 14821 ttttgacaaa atgcgttttt ctggtttcgc taggtttgca cccttcaatt cacgatcacg 14881 aatgttagat ggttcagcga ggttagcaac aacagctttt aggttattaa tagagacact 14941 tgtgatcttt taacaaagtc gggaaaagcg aaaaatggct cgcacgccca cactaacctt 15001 tgatcgtggc acattaattt tgcatccacc accacgcggt agggcttgga tggattttgc 15061 tacatgggat gatagagtag aaaaattccg cattccggct attcaatacc gtgctttggt 15121 ggaagcaatg caagcggaac agacgagttt tatcgatgaa gctaaggcgt tttatccgat 15181 agagttagtt tccagtctgg aaatggaacc ttatccccac cagagtgagg cgttagctgc 15241 ttggaaactc gcgggaagac aaggagtcgt tgtgcttcct accgcagcag gaaagacgta 15301 tctggcgcaa atggcgatgc aagcgacgcc acgcacaacg ctgattgttg ttccaacgct 15361 ggatttaatg catcagtggt atgctcattt gaaggcggcg tttcctgatg ctgaggttgg 15421 gttacttggc ggtggttcgc gagatagaac acctgtactt gttgcgactt atgatagtgc 15481 ggctatccat gcggaaagtt tggggaatct gtatgctttg ttaatttttg atgaatgtca 15541 tcatttgcca acagatttta gtcgggtgat tgcagaatat gcgatctcac cctaccgttt 15601 gggattatct gcgacacccg aacgcactga tggtaaacac gctgacttga attttttaat 15661 agggagagaa gtttaccgta aaagggctga ggatttagca gggaaggcgt tagcagaaca 15721 tgaagttgtg caaattaagg tgaaattatc acaaaatgag cgggaaaagt ac // LOCUS NODE_2139_length_15686_cov_4.97748115686 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15686) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15686) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15686 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 209..1435 /locus_tag="DP116_18305" CDS 209..1435 /locus_tag="DP116_18305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744652.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chromate transporter" /protein_id="PRJNA477356:DP116_18305" /translation="MSQNAEDTQLAAQSIPYAELDPQQQKQRLQELALVFLKLGAIAF GGPAAHIAMMDSEVVTRRQWLSREKLLDLLGITNLIPGPNSTELAIHIGYERAGWLGL LVAGSCFILPAMIIVWTLAAIYARYQTIPQVEWLLYGIKPVIIAIVVQAVWLLGKKAI KDIPTTLAAIAVIVAFFLKVDELLLLLLAGLGVMFLKNLWQRKNRTSAAWLLPISLVM GQTGGAAVATPVSWLRVFLLFLKIGSVLYGSGYVLLAFLQKELVEQNHWLTSQQLLDA IAIGQLTPGPVFTTATFIGYLLAGHAGAIAGTIGIFLPAFVLVWIVNPWVPKLRQSSW VSSFLDGVNAASLGLMAVVTYTLGRAAIVDWLTVVLTLLSLIAVFRFKINSAWLVLGG GFVGFVARFLNGAILQ" gene complement(1705..2595) /locus_tag="DP116_18310" CDS complement(1705..2595) /locus_tag="DP116_18310" /inference="COORDINATES: protein motif:HMM:PF04072.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18310" /translation="MKTTSPDHSKISPTAKLVAYFRQFSDIPFSRDVATLIHAEDVLK NFSQGTNLTPEFLKWAALAAEIRYKSIVSAIKKEGITQVLELASGLSFRGLAMTEDPE YIYVETDLPELMQEKQQILSRIISNHGLKERKNLFFDAVNILSFPEIESAIRHFKPNH PVAVIHEGLYHYLSMEEKERAARNIHSVLSRFGGAWITPDFLTNAEHEGRLQTHSELQ NIAQGVQGTTQRDVHKTGFDNQQQIVDFFSHLGFRSRISPQIDGSYQLTAMQNLDISE DEFKNYKSLNFKIWILTVEE" gene complement(3059..3763) /locus_tag="DP116_18315" CDS complement(3059..3763) /locus_tag="DP116_18315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320083.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_18315" /translation="MRKLAMIDKIIQEQIAYYRARANEYDEWFYRLRRYDRGEEINQR WFNEVDVVKQALQKVGQNDKILELASGTGIWTQELLNVGKKITAIDASEEVIEINRSK LNSPKVEYRQIDLFAWEPDTEYDLVFFSFWLSHVPPKLLKSFLTKVYKSVRVGGQVFI VDSRFEPTSTANNHILNDDGSIYQSRKLNDGQEYQIVKIFYQPDELQNKLTEVGFKAD VKVTENYLIYANGRKF" gene complement(4075..5379) /gene="nifK" /locus_tag="DP116_18320" CDS complement(4075..5379) /gene="nifK" /locus_tag="DP116_18320" /EC_number="1.18.6.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015122177.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase molybdenum-iron protein subunit beta" /protein_id="PRJNA477356:DP116_18320" /translation="MPQNPEKIQDHVELFHQPEYQQLFQNKKEFENGHDPEEVKRVAE WTKSWEYREKNFAREALTVNPAKGCQPLGAIFAAVGFEGTLPFVQGSQGCVAYFRTHL TRHYKEPFSGVSSSMTEDAAVFGGLQNMIDGLANSYQLYKPKMIAVCTTCMAEVIGDD LQAFIGNAKNAGSVPQDFPVPFAHTPSFVGSHITGYDNMMKGILSNLTAGKKKETSNG KINFIPGFDTYVGNNREIKRMCNLMGIDYTILADNSDYLDSPNTGEFDMYPGGTPLEE AADSINAKATVALQAHSTPKTRDYIAKEWKQEVTVSRPWGIKGTDEFLMKLSELTGKP IPEELEIERGRAVDAMTDSHAWVHGKRFAIYGDPDLVYSVVGFMLEMGAEPVHILVHN SNEEFAKELQELLDSSPNGKSATLWAGKDMWHMRSLMCLLSG" gene complement(5552..5749) /locus_tag="DP116_18325" CDS complement(5552..5749) /locus_tag="DP116_18325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006278577.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18325" /translation="MLGINIRHLKLLNHKKDYSGPYHGYDGFAIFARDMDLALNSPTW GLIGAPWSQKAKAKAEAKAVA" gene 5937..7238 /locus_tag="DP116_18330" CDS 5937..7238 /locus_tag="DP116_18330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194552.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="site-specific integrase" /protein_id="PRJNA477356:DP116_18330" /translation="MRTKVLQELEKVNQRLKSAKTKVTIRESNGSLQLRATLPIKPGD KDTNGTGRKQYNISLNIPANLDGLKTAEEESYELGKLIARKTFEWNDKYLGNEAIKKD FQTIGELLEQFENEYFKNHKRTTKSEHTFFYYFSRTKRFTNPKDLASPENLINSIEKI DKEWAKYNATRAISAFCITFKIEIDLSRYSKMPENNSRNIPTDVEISAGIIKFADYLN NRGNQVNQDVKDSWQLWRWTYGMLAVFGLRPRELFINPDIDWWLSEENADMTWKVHKD CKTGERQALPLHKQWIEDFDLRNPKYLEMLATAISKKDNTNHAEITALTQRVSWWFRK IGLDFKPYDLRHAWAIRAHILGIPIKAAADNLGHSVQVHTQTYQRWFSLDMRKLAINQ ALSKRNEIELIKEENTKLRMENEKLKLEIEKLKMELVYKRS" gene 7449..8489 /locus_tag="DP116_18335" CDS 7449..8489 /locus_tag="DP116_18335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011613961.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA (cytosine-5-)-methyltransferase" /protein_id="PRJNA477356:DP116_18335" /translation="MAGSISTVDLFCGAGGLTYGFEQGGLPVRAGYDIDPACQFPYEH NTKAEFILEDVERINGSDLAKHFSGSSVKVLAGCAPCQPFSSYSRRYTDKESRWKLLQ DFARLVQECEPEIVSMENVLQLKYHSVFYEFIQQLEDLSYSFETYEVNCSDYGIPQTR KRLVLLASKFGKIALIKPTHNTEKYGTVRKTIGHLEPLFAGQASKTDRLHQCSKLSPL NLQRIRASKPGGTWRDWSKDLIAKCHTKISGKTYPGVYGRMEWDRPSPTITTQCFGFG NGRFGHPEQDRAISLREAALLQTFPADYEFVAPNEPVVFERVGRLIGNAVPVKLGQVI AQSILQHIHEVI" gene complement(8486..9181) /locus_tag="DP116_18340" CDS complement(8486..9181) /locus_tag="DP116_18340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002783751.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18340" /translation="MQTVLLDFNARVQEINEYFLFLEGLINETIKLAVSEDGGGQKIR AIDPELAKTLKANGFLLLYNLIESSMRNAIEAIFDELKGKKVSFNSVRIEIRKVVLQN FKNRSPEDIHTRITDISLDIITAGFKSRELFSGNIDRDEITKTARKYGFSCDTDYSKT KHGENLYIIMRNRNDLAHGNKSFSEVGKDISIGDLLKVKEEVIEYIRQILKNIEKYLN AKEYLDSSLVGTP" gene complement(9184..10272) /locus_tag="DP116_18345" CDS complement(9184..10272) /locus_tag="DP116_18345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF262 domain-containing protein" /protein_id="PRJNA477356:DP116_18345" /translation="MPKAPNIPPDPQITDEQREAAEEEIREKQKIVDYDTKEYPVEVL VQKYREDLEEDISELYIPDYQRELIWEDSRQSKFVESIFLGLPIPYIFVADLRPEKDD LGRLEIVDGTQRIRALDRFLNNELKLCELKKLTRLNGFRFSDLPLARQRRFNRATIRM IVLTEKADEEVRRDLFERINTGSIALNDMEKRRGISPGPFVDLLEELAKEPKFVKLCP LSEASRRSREPEEFVLRFFAYLDNYKNFERQVNVFLNEYLEAHNNSKIDKDAFRNTFH TMLDFVEKYFPNGFTKAKGHVRTPRIRFEAISVGVALALREKSDLEPSSIDWLDSPEF KEYTTSDASNSRPKVIRRIEYVRDQLLN" gene complement(10560..11153) /locus_tag="DP116_18350" CDS complement(10560..11153) /locus_tag="DP116_18350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870133.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_18350" /translation="MVQIPAKLVTLEEFLKLPETEPTSEYIDGRIIQKPMPQGEHSVI QTELAPAINLVVKSKQIARAFCELRCTYPAGSRSSSVYGGRSIVPDISVFLWGRIPRK ENGGVANIFSIAPDWTIEILSPDQSQTKVTKNILHCLKHGTQMGWLIDPEEQSVFVYP PDQSPTFYDEPGTRLPMPEFAKDFNLTVEGLFGWLLE" gene 11417..12011 /locus_tag="DP116_18355" /pseudo CDS 11417..12011 /locus_tag="DP116_18355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314867.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 11621..11630 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 12131..12420 /locus_tag="DP116_18360" /pseudo CDS 12131..12420 /locus_tag="DP116_18360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012597134.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HigB family toxin" gene 12420..12653 /locus_tag="DP116_18365" CDS 12420..12653 /locus_tag="DP116_18365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015081302.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18365" /translation="MQSFNLDQTITAWSSIAENVFVPHTEEEYDRLVEMLDRLIDQVG EDESHPLASLMEVIGVLIENYETQHIPELEDIA" gene complement(12712..13314) /locus_tag="DP116_18370" CDS complement(12712..13314) /locus_tag="DP116_18370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015081303.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18370" /translation="MPYSQFTNISKVKETFGLKTQEGGRFIPPTEPIEASATLRAYLE ESLPLVSSASEKARSEGIIYPVLLEVRRILNRQISLFSGEDFTVDEAVGLNGMCDFLL SRSPEVLEIEAPAIIVVEAKKADLRTGFGQCIAEMVAAQRFNAAKNRPISVIYGSISN GTQWRFLKLEDNTVTIDLMDYPLPPVEQILGILVWMVQNG" gene complement(13634..15040) /gene="nifD" /locus_tag="DP116_18375" CDS complement(13634..15040) /gene="nifD" /locus_tag="DP116_18375" /EC_number="1.18.6.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase molybdenum-iron protein alpha chain" /protein_id="PRJNA477356:DP116_18375" /translation="MTYTDDKKSSEQDPKELVEQRKELIKEVLDAYPEKAKKKREKHI NVYEEGKSDCGVKSNIKSLPGVMTARGCAYAGSKGVVWGPIKDMIHISHGPVGCGYWS WSGRRNYYIGTTGVDTFGTMHFTSDFQERDIVFGGDKKLLKLIEELDELFPLNRGVSI QSECPVGLIGDDIEAVARKSSKEIDKPVVPVRCEGFRGVSQSLGHHIANDMVRDWVFT RSDKERKEGTLKFESTPYDVAIIGDYNIGGDAWASRILLEELGLRVVAQWSGDGTINE MMQTPNVKMNLIHCYRSMNYISRHMEEAYGIPWMEYNFFGPTKIAASLREIASKFDEK IQENAEKIIAKYQPVMDEIIAKYRPGLEGKTVAMMVGGLRPRHVVPAFQDLGMRMIGT GYEFAHNDDYKRTTDYIENGTIVFDDVTAYEFEEFIKALKPDLVASGVKEKYVFQKMG LPFRQMHSWDYSGLSNGA" gene complement(15166..>15686) /gene="nifH" /locus_tag="DP116_18380" CDS complement(15166..>15686) /gene="nifH" /locus_tag="DP116_18380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320752.1" /note="nitrogenase iron protein; nitrogenase component 2; with component 1, an molybdenum-iron protein, catalyzes the fixation of nitrogen to ammonia; nitrogen reductase provides electrons to the nitrogenase complex; in R. etli there are three essentially identical copies of nifH which are actively expressed during symbiosis; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="nitrogenase reductase" /protein_id="PRJNA477356:DP116_18380" /translation="VSYDVLGDVVCGGFAMPIREGKAQEIYIVTSGEMMAMYAANNIA RGILKYAHSGGVRLAGLICNSRKVDREIELIETLAERLNTQMIHFVPRDNIVQHAELR RMTVNEYAPDSNQGNEYRALAKKIVENTKLTIPTPLEMDELEALLVEFGILDDDTKHA EIIGKPAEATAK" BASE COUNT 4484 a 3180 c 3297 g 4715 t 10 others ORIGIN 1 aagttctgac gatgaatgta gaatcccctc gcctttaggc aggggagtgt caattttagt 61 ttccgatctt gcgtcttcaa tcactggtgc agagattctt gttgatggtg gacaaactcc 121 tggagtatga tttgcagtcg aatctagcta tatttagcaa cgagggagat atgaacgatc 181 catttcatca gaaagaaggt tggaaccaat gtctcaaaat gctgaagata ctcagctggc 241 tgcacagtca atcccttatg ctgaattaga cccacaacaa caaaaacaac gactccaaga 301 actagcactt gtttttttaa aattgggcgc gatcgccttt ggaggtccag cagctcacat 361 cgcaatgatg gattcagagg tggtgactcg tcgccaatgg ttgagccgcg agaagctttt 421 agatttgcta ggtataacca acctgatacc tggtcccaat tctaccgagt tggcaattca 481 tattgggtat gaaagagctg gatggcttgg actgctcgtt gctggttcat gcttcatttt 541 acctgccatg attattgttt ggactctggc agccatttat gctcgctacc aaaccattcc 601 tcaagttgaa tggttactct atggtattaa acccgtaatt attgccattg tcgtacaagc 661 agtgtggctg ttgggtaaaa aagctattaa ggatattcca acaactcttg ctgcaatagc 721 cgtaattgtc gcctttttcc tcaaggtgga tgaactgctg ttgctactgc tggctgggct 781 aggggtaatg ttcttaaaaa atctgtggca gaggaaaaat agaacatcag cagcttggct 841 actaccgatt tctctagtta tgggacagac aggaggtgca gctgtcgcga ctcctgtgag 901 ttggctccgg gtgtttctgt tgtttttaaa aatcggctcg gtgttgtacg gtagtggcta 961 tgtattgtta gctttcttac aaaaagaact cgtagaacag aaccattggc tgacttctca 1021 acagctttta gatgcgatcg ctattggtca gttgacacca ggtccagtat ttaccacagc 1081 tacttttatt ggctacttac tggcaggtca tgcaggagcg atcgccggaa caattggtat 1141 ctttttgcct gcctttgttt tggtatggat tgttaaccct tgggttccta agctacgtca 1201 atcttcttgg gtgagtagtt ttttagatgg agtaaatgca gcttctttgg gactgatggc 1261 agtagttacc tataccttgg gacgcgctgc aattgtagat tggttgactg tggtattgac 1321 actcctgagt ttaattgctg tttttcgctt caaaatcaac tcagcttggt tagtccttgg 1381 gggtggattc gtaggatttg tcgcacgatt tttgaacggt gcgatcttgc agtagaaagc 1441 tgccttttgg gcgatcgcaa aataaaatat cggctcgctt tcggtctcac acactcactc 1501 taaaggatat ttccctgatt tgtacccagt aaaccggaaa cgaaaatctt taaaaccctt 1561 gctatgcatt gcatatgact tgtgtcaaag ggggaagttc agattagatg attagtggga 1621 tttttgagaa ttgcattggg tcgaataatg ttgagattag cctcaaatat tgggctgagt 1681 ttggctacca agttggtcga tccctcactc ttccactgtt aatatccaaa tcttaaaatt 1741 tagggattta tagtttttga actcgtcttc agatatgtct aaattttgca ttgcagttag 1801 ctgataactt ccgtctattt gtggagagat gcggctccta aaacctaggt gtgaaaaaaa 1861 gtctacaatc tgttgttgat tatcaaagcc tgttttgtgc acatcacgct gcgttgtgcc 1921 ttggactcct tgcgcgatat tctgcaattc tgagtgagtc tgtaaacgtc cttcatgctc 1981 agcgtttgtt agaaaatcag gagtgatcca cgcgccaccg aagcgggaaa gaacggaatg 2041 gatgttgcgt gctgccctct ccttttcttc cattgaaagg taatgataaa gtccttcgtg 2101 gatgacggct actggatgat ttggcttaaa gtgacggatg gctgattcaa tttcaggaaa 2161 cgaaagaatg tttacggcat caaagaataa attcttgcgc tcttttaggc cgtgattaga 2221 tataattctt gacaggattt gttgcttttc ttgcattagc tcaggaagat ctgtctccac 2281 ataaatatac tctgggtctt cagtcatagc tagcccacga aatgatagcc cactggctaa 2341 ctccaaaacc tgcgtaatgc cctctttctt gatcgcgctg acaatacttt tataacgtat 2401 ctcagcagca agagctgccc atttcaaaaa ttctggtgtg aggttcgttc cttgtgagaa 2461 attttttaaa acatcttctg catgtatcag tgtagcaaca tccctgctaa agggaatatc 2521 tgaaaattgc cggaaataag ccaccaactt ggcagttggg ctaatcttac tatgatcagg 2581 actagtcgtt ttcactgatt tactcctatt gttgttagat cagaggatat tgtctatggt 2641 aaagagattg tactctaaag actaacctta ttgtctttca taactagcga attggcttat 2701 aggaatccta tctgattctt gaacataatt ttaaagaaaa tccttatata ggaatccggt 2761 ttgatttggt gagaccagtg ctgcaggagg gaaaccctcc gtaggcatct ggcgaacccg 2821 aagggtgaac taacaaagta ggggggaggg aactctgaac agggaactct gaacagggaa 2881 gaagcaataa aggtgtacct agcttcgtca aaaatcaaat aggaatccta taaaacgggc 2941 tttgcagtct cagtatgttt tcaaaaatca aaccagagtc ctatatttat aaatagtgtg 3001 agagaaacag actcagaatt caggaactaa gctacaatat ttgttcttta attttattct 3061 aaaacttcct cccatttgca tagatcaaat agttttctgt aacctttaca tcagctttaa 3121 aaccaacttc cgtcagcttg ttttgcagtt catctggctg ataaaaaatt ttgacaatct 3181 gatattcttg accatcattt aacttacggc tttgatatat actgccgtca tcattcagga 3241 tatggttatt agctgtggat gtaggttcaa agcgcgagtc aacaataaat acttgtccac 3301 caacacgaac agatttatag acttttgtta aaaatgattt aagtaatttt ggtggaacat 3361 gagataacca gaaggagaaa aatactaaat catattcagt atcgggttcc caggcaaata 3421 aatcaatttg gcgatattcc acctttggtg aatttaactt gctgcgatta atttcaatga 3481 cttcttcaga agcgtcaatt gcagtaattt tcttaccaac atttaaaagt tcctgcgtcc 3541 agattcccgt tccactggct aactctaaga ttttatcgtt ttgtccaact ttttgtaaag 3601 cctgcttgac aacatcaact tcgttgaacc agcgctgatt tatttcctca ccgcgatcat 3661 aacgacgaag gcgatagaac cattcatcat attcgtttgc tctagcacga tagtaggcta 3721 tttgttcttg aatgatttta tctatcattg ccagcttcct catccgcttg aatcaaagtc 3781 cttaaagtta cttttctttg acgctgattg cagcttacct cactttttca cgttgctctt 3841 gaagctttaa cttgtgaatt attcatttta tatgtaaact gataagttta ccgtaatata 3901 atctcaataa ttgcatagct tgctctaaca agcaatagtt tcctgaatgt tgaggcgatg 3961 gcgcaaagcg cccacctgaa ggctttgcgc aggcacttgc gttcggtgat cccagggaca 4021 gtttgttgtg ttgtgagagc gtcaataatt tgtcacgaat agttctatct ttaattaacc 4081 tgatagcaaa cacattagag aacgcatatg ccacatgtct ttaccagccc agagggttgc 4141 gcttttgcca ttagggctag aatccagcaa ctcttggagt tctttcgcaa actcttcgtt 4201 ggagttgtgg acaaggatat ggactggttc agcacccatt tccagcatga agcccaccac 4261 actgtacacg agatctggat cgccgtagat agcgaagcgc ttgccgtgaa cccatgcatg 4321 ggagtcagtc atcgcatcga ctgcccgacc gcgttcaatt tccagttctt cgggaatggg 4381 tttaccagtc agttcactga gtttcatcaa gaactcatca gtacccttaa taccccaagg 4441 acgagaaacg gtaacctctt gcttccattc tttggcgatg taatcgcgag tcttgggagt 4501 agagtgtgct tgtagagcaa ctgtagcttt agcattaatt gaatctgctg cttcttccag 4561 cggagtaccg cctgggtaca tatcaaactc acctgtgttg ggtgaatcca gatagtcgct 4621 gttatctgcc agtatcgtgt aatcgatacc catcagatta cacatccgct tgatttcccg 4681 gttgttacca acataggtgt caaaaccagg gatgaagttg attttgccat tgctggtttc 4741 tttcttttta cctgcagtta ggttagaaag aatacccttc atcatgttgt cgtagcctgt 4801 gatgtgagaa ccaacaaagc taggagtgtg agcaaaaggt actggaaaat cttgaggaac 4861 tgaacctgca ttcttagcgt tgcctatgaa agcctgcaag tcatcaccaa tgacctctgc 4921 catacaggtg gtgcagacag caatcatctt gggcttgtag agttggtagg agtttgccaa 4981 gccgtcaatc atgttttgca gtccaccaaa taccgctgcg tcttctgtca tggaagaaga 5041 tacgccggaa aatggttctt tgtagtgacg ggttaagtgg gtgcggaagt aagcaacgca 5101 accttgggaa ccttgaacaa aaggtagagt gccttcaaaa ccaacagcag caaagattgc 5161 gcccaatggt tggcaacctt tagcagggtt aacggtcaat gcttcacggg cgaagttctt 5221 ttcacgatac tcccaactct tcgtccattc tgcaaccctt tttacttctt cagggtcgtg 5281 tccgttttca aactctttct tgttttgaaa taactgttgg tactctggct gatgaaacag 5341 ttctacgtgg tcttgaattt tctccggatt ctgaggcatt tctctatctc caagcgagct 5401 agtggttggt taattgtggt tttctcgttg tctcgttccc aggctcagcc tgggaatgct 5461 tagcttgagg ctccgcctca ccaatcttgt ttgctggagg cagaacctcc ccgaatgcgt 5521 taccaggctc agcctggtaa cgaggtattt cttaggcgac agccttagct tcagccttag 5581 ctttagcttt ctgactccaa ggagcgccga ttaatcccca ggtggggctg ttgagtgcta 5641 aatccatgtc gcgagcgaag atagcgaagc catcataacc gtgataagga ccggagtaat 5701 cctttttgtg gttcaagagc tttaaatgcc taatgtttat accaagcaat gtaataatta 5761 gacattgttc gttagtaaca catttatttt taggctaatt ttaggctaaa aaaaccgcaa 5821 tggaaaaaca gagtaaagac aagtatcagc aagcctttga agacttggag ccagtttcat 5881 ctacagatgg aagtttcctt ggctccagtc agcaagccca acagcaaaga gagcatatga 5941 gaacaaaagt actacaagaa ttagagaaag ttaatcaacg tttgaaatct gcaaagacaa 6001 aagtgacaat tagggaatca aatggaagtt tgcagttacg tgctacgtta ccaattaaac 6061 cgggagataa agacacaaat ggcactggaa gaaaacaata caatattagc ttgaatattc 6121 ctgctaattt ggatggatta aaaacggcgg aggaagaatc ttatgaatta ggaaagttaa 6181 ttgctcggaa aacctttgaa tggaatgata aatatttagg taatgaagca attaaaaaag 6241 actttcaaac gataggagag ttacttgaac aatttgaaaa tgagtatttc aaaaatcata 6301 aacgcaccac aaaaagcgaa catacttttt tttattactt ttctcgaaca aagcgattca 6361 ccaaccccaa agatttggct agtccagaaa atctgataaa ttcaattgaa aaaatcgata 6421 aagaatgggc taaatataac gcgacaagag ctatatctgc attttgcata acattcaaaa 6481 tcgaaattga tttatctcga tattccaaaa tgccggagaa taattcccgg aacataccaa 6541 ccgatgtgga aatatccgca gggattatca agtttgcaga ttacctaaac aacagaggta 6601 atcaagttaa ccaagatgtt aaagatagtt ggcagctttg gcgctggact tatggaatgt 6661 tagcagtttt tggtttacga ccacgagagc tttttatcaa tcctgatatc gattggtggt 6721 taagcgaaga gaacgcagac atgacatgga aagttcataa agattgcaaa actggagaaa 6781 gacaagcatt accattacat aaacaatgga ttgaagactt tgatttaaga aatcctaaat 6841 atttagagat gctggcaaca gcaattagta aaaaagataa tactaatcat gcagagataa 6901 cagcattaac acaacgagtg agttggtggt tccggaaaat cggattggat tttaagccgt 6961 atgatttacg tcacgcttgg gcaattcggg cgcatattct aggaattcca atcaaagcag 7021 cggcggataa tttgggacat agtgtgcaag ttcacaccca aacttatcag cgttggtttt 7081 cgcttgatat gcggaagtta gcgattaatc aggctttgag taagaggaat gaaattgagt 7141 taattaagga ggagaataca aaattgagga tggagaatga aaagttgaag ctggagattg 7201 aaaagttgaa gatggagttg gtttataagc gtagttgaat tgtgttcgtt aattaaggac 7261 tttcagtaaa aaatatgttg ccatagttcc attcttttcg tttttctcat gaaccttaag 7321 ttcatgtata gggctgtact tagaagtaaa tctgctaact cgccctcttg taatccgtac 7381 gcaaatacgc tagactagct gtttaattcg catatactaa taaatatagt aataataaga 7441 gggaataaat ggctggcagc atctctacag ttgatctgtt ttgtggtgct ggaggactga 7501 cttatgggtt tgagcaagga ggtcttccag ttagagctgg atacgatatt gatccagcgt 7561 gtcaatttcc atatgaacac aacaccaaag cagaattcat actagaagat gtggagcgta 7621 ttaacggttc tgatttagca aagcactttt ctggcagtag tgttaaggtc ttggcaggtt 7681 gtgctccttg tcaacctttt tcaagctact caagacgtta cacggataaa gaatcaagat 7741 ggaagcttct gcaagatttc gctcgtcttg tacaggagtg tgagcctgaa attgtttcga 7801 tggaaaacgt acttcagcta aagtaccact cagtttttta tgaatttatc cagcaattag 7861 aggatttaag ctattcattt gaaacttacg aagttaattg ttcagattat ggaattcctc 7921 aaactcgaaa acgtttagtt cttctcgcct cgaagtttgg taaaattgcc ttgattaaac 7981 ccacgcataa tacagaaaaa tacggaacag tacgtaaaac gatcggacat ctggaacctc 8041 tttttgcggg tcaagcatct aaaactgata gacttcatca atgcagtaaa ttatctcctc 8101 taaatcttca gcgtatccgt gcttccaaac ctggtggaac ttggcgtgat tggtctaaag 8161 acttaatagc taaatgtcat accaaaatca gtggtaaaac ttatccagga gtgtacggtc 8221 ggatggaatg ggatcgacct agcccaacca tcaccacaca gtgctttggt ttcggtaatg 8281 gacgctttgg acatcctgaa caagatcggg ccatatctct tagagaagca gcattattac 8341 aaacttttcc agcagactat gaatttgtag cccctaatga acctgttgtg tttgagcgtg 8401 taggaagatt aattggaaac gctgttcctg tcaaactagg tcaagttatt gctcaaagca 8461 ttctgcaaca tatccatgaa gttatttaag gagtgcctac aagagaagaa tccaaatatt 8521 ctttcgcatt caaatacttt tcaatatttt tcaatatctg tcgaatatat tctattacct 8581 cttccttaac ttttagaagg tctccgatgc ttatatcttt accaacttca gagaatgatt 8641 tgttcccatg agctagatcg ttacggttcc gcataataat atacaagttc tcaccatgct 8701 ttgttttaga gtaatcggta tcacaggaaa aaccgtattt tctagctgtt tttgtaattt 8761 catctcgatc aatgtttcct gagaataatt ctctactttt gaatccagca gtaataatat 8821 caagggaaat gtctgttatc ctagtatgaa tatcctcagg agaacggttt ttgaagtttt 8881 gaagtacaac ttttctaatt tcaattctaa cagaattaaa cgaaactttc ttacctttta 8941 actcatcaaa aatagcttca atggcattcc tcatacttga ttctataaga ttatagagga 9001 gtagaaaacc gttggctttt agggtcttag ccaactcagg atcaatagct ctaatctttt 9061 gcccaccacc atcctctgat acagccaact taattgtctc attaattagt ccttctagga 9121 ataaaaaata ttcattaatt tcttggacac gagcattaaa atctaacaga actgtttgca 9181 tacctaattt agaagttgat cacgcacata ctcaatacgc cgaatgactt tgggtctaga 9241 gttgcttgca tcagaggttg tgtactcctt gaactctggt gaatcgagcc aatctattga 9301 gcttggttcc aaatcgcttt tctctcgaag tgcaagcgcg acacctacag agatagcttc 9361 aaagcgaatt ctaggtgttc taacatgacc ttttgcttta gtaaatccgt tagggaaata 9421 tttttcaaca aaatccaaca ttgtatgaaa tgtattgcga aaagcatcct tatcaatttt 9481 ggaattatta tgtgcttcta gatattcatt taaaaataca ttaacttgtc tttcaaaatt 9541 tttgtaatta tctaagtacg caaagaagcg tagtacaaat tcttcgggtt ctcggctgcg 9601 acgcgatgct tctgataaag ggcatagttt tacaaacttt ggttcttttg caagttcttc 9661 caataaatca acaaatggac caggagatat tcctctccgc ttctccatat cattaagtgc 9721 aatactgcct gtgtttatac gctcaaacaa atctctacgt acttcttcat cagctttttc 9781 agtaagcaca atcatgcgaa tagtagcgcg attaaagcgt ctctgtcgag caagcggtaa 9841 atcactaaat cgaaaaccat taagccttgt tagtttcttt agttcgcata attttaattc 9901 attattaagg aacctatcta aagcacgaat acgttgagtt ccatctacaa tctccaatcg 9961 acctaaatca tctttttccg gtcgtagatc agcaacgaaa atatatggaa taggtaatcc 10021 caaaaaaata gattcgacaa acttcgattg gcgagaatct tcccaaataa gttctctttg 10081 gtaatcgggt atatataact cactaatgtc ttcctctaag tcctctctat atttctgaac 10141 aaggacttct actggatact cttttgtatc gtagtcaact attttttgtt tttcacggat 10201 ttcttcttct gctgcttctc tttgctcatc agtaatctgt ggatctggtg gtatgtttgg 10261 ggctttcggc attttgatct ccagttctgg cagaatgcgt atattacatg tacaggatga 10321 tgaaacctcg atcagtatat ctacgatttt aacggtttac tgataaattg acctagggca 10381 aggagttcag cagtttgagt tttgtagttg aataaggaga agtaatgttg tgatgttgca 10441 taaagttaaa aaagagtgtc caatctgctt taagttctat gggaagagcg atcgcccaca 10501 actcaaacta ggtgatcgct ataattgtcc tttagatttt aggtcgctca aagcataact 10561 tactcaagca accaaccaaa caaaccttct acagtaaggt tgaaatcttt agcaaattct 10621 ggcatgggca aacgtgttcc tggttcatcg taaaaagtgg ggctttgatc tggtgggtaa 10681 acaaacacag attgttcttc tggatcaatg agccatccca tctgagttcc atgcttgaga 10741 caatgcagaa tatttttggt aactttagtt tgactttgat caggagatag gatttcgatt 10801 gtccaatccg gggcaattga gaagatgtta gcaaccccac cattttcctt gcgtggaatt 10861 cttccccata aaaacactga aatgtcaggt acaattgagc gtccaccata gacgctcgaa 10921 gagcggcttc ccgcagggta ggtacagcga agttcacaga aagctcgcgc tatctgtttg 10981 gattttacga ccaagttgat ggcaggtgct aactcagttt gaatgacgct atgttctcct 11041 tgtggcattg gtttttggat aatacgccca tcaatatatt cactggtagg ttccgtctcc 11101 ggcagcttta agaattcttc taaagttacg agtttagcag ggatttgtac catttaaggt 11161 ttcctgaaga ggggtaagtt gcactgcttt tatttgttat tctacgagat ggatcaatga 11221 gttagcacac gcgaacaaat agaacgaacc ctaccaggga tttgatacca ttaacaaaat 11281 aggcgatagc gaagcgctgc tgctttcagc agatcgtgca agggttcggc tctttctggg 11341 agcatcgccc tttgggcagc tccataggag cgtcaccttc gatgattgca atcgtcagca 11401 aatttgcaaa aacactatgg caaagcgtaa aaaaagcaat cttcagtgga ttaaagaaac 11461 gcttgaacta aaacccgatc atcattggga atctccacca ggctacaaaa tttttgtagc 11521 agataggggg gctgttcgct tcaatgttcc ccaaaattgg gtttttgagc cacaagaaaa 11581 atcgttcaag ttcctcgata gaaaatcccc caacgatgat nnnnnnnnnn gatgattgct 11641 gtttggaagt atctttcaat cgcctaccac ccaatgactg gagccagttt ccgttaaaat 11701 ccaccttgaa gaaagtgata aaagacgaca gtcgtaacgt cattgaatcg ggagaaatct 11761 ttaccatcaa gcgccaaacc gctaggattg tgtggacaga aattaagttt atcgacaccc 11821 aagcagagcc acgcgaagct ttttcgcgga cttgcattgg tttggggtcg aatgttcagt 11881 gcttgattac atttgattat tgggcagatc aagcagagca gttaacaccc gtttgggatg 11941 aagttatgcg tagtctcaca ctagggctat atatccgtga tccaatgact ggtgtagctt 12001 tcccagactg aaccacaaaa ttaaaaagtc aaaagataaa agtaaaaaga taaatgatta 12061 aaatacaaga ttagcgatgg aaactgagtt agatggatga cagaattatt aagacatcag 12121 gggttttcac atgcatgtga ttactcgtaa acgactcaat gaatttgcca aactccatcc 12181 agatacaacc aacgctctgg ctcagtggta tcaattagtg aagcaaaatc aatttgcctc 12241 atttgtggaa ctccgtgaaa tatttccact tactcagatc aagtcggtaa attgactgtg 12301 tttaacattg gcggcaacaa agttcgactc attgccgcaa ttcactacaa cggccaaaaa 12361 gtctatatcc gcgctgtgtt aacgcattca gaatacgacg aaggaaagtg gaaagaataa 12421 tgcaaagttt caaccttgat caaaccatca ctgcttggtc gtccattgct gagaacgtct 12481 ttgttcctca caccgaagag gaatatgatc gcttggtcga gatgcttgat cgccttatcg 12541 atcaagttgg tgaagatgaa agccatcccc tcgcgtctct aatggaagtc attggtgtct 12601 taattgaaaa ctacgaaact caacacattc ctgagctaga ggatatcgct tgaccaaata 12661 agtagttttg atgttcgagt caggagaatt ctttaatggc gatagcatct atcaaccgtt 12721 ttgaaccatc cacactaaaa ttcctagaat ttgttctaca ggtgggaggg gataatccat 12781 caaatcgatg gtgacagtat tatcttccag cttcaaaaat cgccattgtg taccattact 12841 aatcgaacca tatattacgg aaattgggcg attttttgca gcgttaaatc tttgtgcggc 12901 aaccatttct gcaatacatt gtccaaaccc agttcttaaa tcagcttttt ttgcttctac 12961 aactataatt gcaggtgctt caatttctaa cacttcaggt gaacgactta aaaggaaatc 13021 acacatccca ttaagtccaa ctgcttcgtc aactgtaaaa tcttctccgg agaataagct 13081 aatttgccga tttaatatcc gtctcacttc taacagtact gggtaaataa ttccttcgga 13141 acgcgctttt tcactcgcag aagaaacaag tggcagactt tcttcaagat atgccctcag 13201 tgtagcagat gcttcgatgg gttcagtggg aggaataaat cgcccaccct cctgtgtttt 13261 taacccgaac gtttctttaa ctttactgat atttgtaaac tgactataag gcataattgt 13321 taattcttcc acgggtctac acttatatta aacacgcaga gtttgtttgt gcagttgagg 13381 cgatctgcgc gcagcgcagc agcgaggcta tgctctatat ttaatctaag aaaatgttat 13441 tttaaatagt cttttgtttt acaattttaa tgtaagcaaa attcaagcaa atctttaaaa 13501 ccagactcaa taagagttat attgtctgaa gcatcaaaaa aagattgttc ctagataacc 13561 ctacagctcg catttaggct aaaaatagat tgttttttct attaacgcta aaaaaccttg 13621 ctttatctga cttttatgcg ccattgctaa gtccggagta atcccaagag tgcatttgac 13681 ggaaaggaag acccatcttt tggaagacgt acttctcttt cacaccagaa gcgacgaggt 13741 caggcttgag tgctttgata aactcctcga attcgtaagc agtaacgtca tcgaaaacga 13801 tggtaccgtt ttcaatgtag tcagtggtac gtttgtagtc gtcgttatga gcaaactcat 13861 aacctgtacc aatcattctc attcccaaat cttggaaagc gggaacgacg tgacgaggac 13921 gcaaaccacc aaccatcatg gcaacagtct tgccttccaa gcctggacga tacttggcaa 13981 tgatttcatc catcactggc tgatacttcg cgatgatctt ctcagcgttt tcttggatct 14041 tctcgtcaaa cttggaagca atttcccgta aggatgcagc aatcttggta ggaccaaaga 14101 agttgtattc catccagggt ataccgtaag cttcttccat gtgacggctg atgtagttca 14161 tcgaccggta gcagtgaatc aggttcatct tcacgtttgg tgtctgcatc atctcgttga 14221 tggtgccatc acctgaccac tgggcgacta cgcgcaagcc gagttcttct aacaggatgc 14281 ggctagccca agcatcacca ccgatgttgt agtcaccaat gattgcgaca tcgtaaggag 14341 tagactcgaa tttcagtgtg ccttcttttc tctctttgtc ggatctggta aacacccagt 14401 cacgaaccat gtcgttcgca atgtggtgac cgagggattg agaaacaccc cggaagcctt 14461 cgcaacgtac gggtacaaca ggcttgtcaa tctctttgga tgattttctg gcgacggctt 14521 cgatgtcatc cccaatcaga ccgacaggac attcagattg aattgagaca ccacggttga 14581 gggggaagag ttcgtcgagt tcttcgatga gttttaagag ttttttgtca ccgccgaaga 14641 cgatatctct ttcttggaag tcggaagtga agtgcatggt gccaaaggtg tcaacacctg 14701 tggtaccgat gtagtagtta cgacgaccag accaagacca gtaaccgcaa ccaacaggac 14761 cgtggctgat gtggatcatg tccttaattg gaccccagac cacaccttta gaaccagcgt 14821 aagcacaacc acgagcagtc atcacaccag gaagagattt gatgttggac ttaacgccgc 14881 agtcggactt gccttcttcg tatacgttta tatgtttttc ccgctttttc ttcgctttct 14941 cggggtaagc gtctagaact tctttaataa gttcttttct ttgttctaca agctcttttg 15001 gatcttgctc agaagatttt ttgtcatctg tatatgtcat agtgttcctt gctggtgaat 15061 tactggtgag aaggaaggat gaagagtgaa gagtgaacca tttatatttc attctttatt 15121 cttctgtctt caaaggtggg ctgcttgccc accctagggc attgcttact tggcagtagc 15181 ttctgctggt ttgccaataa tttctgcgtg cttggtatcg tcgtcgagaa taccgaactc 15241 tacgagtagg gcttctaact cgtccatttc caaaggtgta ggaatggtga gcttagtgtt 15301 ttcgacgatc ttcttagcta atgcgcggta ttcgttacct tggttgctgt caggtgcgta 15361 ctcgttgact gtcatccggc gcaattctgc gtgttgaacg atgttgtcac gaggtacgaa 15421 gtgaatcatt tgggtgttca accgttctgc cagagtttcg atgagttcga tttctcggtc 15481 aactttacgg ctgttacaaa tcagaccagc caagcgcaca ccaccagagt gagcgtattt 15541 gagaatacca cgagcgatgt tgttagcagc atacatcgcc atcatttcac cagaagtcac 15601 gatgtagatt tcttgtgctt taccttcacg aataggcata gcgaaaccac cgcagacaac 15661 gtcacccaac acgtcgtagc taacga // LOCUS NODE_2142_length_15664_cov_5.71561315664 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15664) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15664) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15664 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..130) /locus_tag="DP116_18385" CDS complement(<1..130) /locus_tag="DP116_18385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome B6" /protein_id="PRJNA477356:DP116_18385" /translation="MEIGQKVKVVRLRDRVSPPIVKRLGQVGIIQGYKMTDSSGVGI" gene 310..1038 /locus_tag="DP116_18390" CDS 310..1038 /locus_tag="DP116_18390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458227.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_18390" /translation="MFVTAQQLEQQMPDASRLLSDEPEMESSLHYMQLLLLVSCLEWL WRDRDDFFIGANLTIYFSRQQLRNRDFRGPDFFLVKDTEKRPRNSWVLWEEDGRYPDL IIELLSESTGKVDRTLKKDLYQNRFRTPEYFWFSPENLECVGFKLVGNEYQEIAPDSR GWRWSQVLGLYLGVNAGKLRYFTSEGDLVLTPEETARVTHQQASEAQQRASQAELLLE GERERSQLLAQKLRSFGIEPESLI" gene 1071..1856 /locus_tag="DP116_18395" CDS 1071..1856 /locus_tag="DP116_18395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319084.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18395" /translation="MYQTDPPRPPQEVLPTMYDLPSELVAESGLPDEFHIFQPRLLSE TCQPSNYPPEEILIATDLNLYYDPRHPLWYKRPDWYMVLGVSRAQQQKDLRLSYVIWQ EGVTPFLVVELLSPGTEQEDLGQTLREVNKPATKWQVYEQILRIPYYVVFDRYSNQLR GFRLEGTRYQELSLPDQRLWLEEIQLGLGVWQGSYENTTGLWLRWYDINHQWIPTPTQ QIQRERQRAEQERQRAEQERQRAQRLAEYLRTQGIDPDNLPQM" gene 2176..3036 /locus_tag="DP116_18400" /pseudo CDS 2176..3036 /locus_tag="DP116_18400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860446.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter permease" assembly_gap 2953..2962 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 3159..4469 /locus_tag="DP116_18405" CDS 3159..4469 /locus_tag="DP116_18405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494939.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_18405" /translation="MGEEIAISLKNVSKCYKRYTRPVDRLKEILLPGKSRSQEFWALR DINLEISKGETLGIIGQNGSGKSTILQIIAKTLTPTTGEVQVDGRVSALLELGSGFNP EFTGRQNVFFNGRLLGLSQKEIEDKFDEIARFADIGDFIDQPVKTYSSGMFVRLAFAV AVSVNPDILIVDEALAVGDIYFQQKCFQQIRQLRDSGTTLLFVSHDPVAVYKLCDRAI LLESGQLVLDGKPRQVIDLYEAKLLKKNDVAPEKIEIQMSSNANGKKSQENTSDLASK EESDEIVINLPEVSIKFIKFFDEKDKEIESVISDQSMQLSIGLLFLKSFEDPHIGFKI RERTGEVVFETNTACMGEKVGRVNCETLLEIRFQFEIPIRPGEYTITVGVADGYLGEG LFRQTLLYAHNFAVLKVLRNQEAILWSGIVNLYPTISILTSNHV" gene 4462..6300 /locus_tag="DP116_18410" CDS 4462..6300 /locus_tag="DP116_18410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494940.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="PRJNA477356:DP116_18410" /translation="MFDLIIQKAKETEYDFRKNFYPNEQLEFGFNHWIDYYKLKWSIA HVLKPSSILEIGVSFGYCAAAFLHGHPAAHYVGLLDIESYKSLNGVIDWAKKITTQFA TEFMITDTQAMKRLPGDIYDLIHIDRQRNEDAFFHNFKLAIHQGRYVLLDGYLDTQQS FLAISCFLFRYANILDWYGVIPGYSGQLLIKVSEDYLRQVKEEQQSNINSSLEIRQTY TSEYYTQDCGGYDAYEKNQGKKLEDPRLKAIATIASLKQSGRVLDLGCGRGELSFYFA HRGFTVTAVDYSPNAIELAKKCFDGEDQLKEKVEFICHDVCHVPLSGKYDLVLASDVI EHLSFEEVDTLYQKMARHLQPDGLFVVHTFPNLWYYKYDYQRKRKIAASVGAYLPAEP RSRYELLMHINEQSPRILKKQLSKYFKNVLLWFGSPENPGGSLVRNFSIRELSAAPSL FAIASHKHINQEQLKNSLQMCPLPAIPSGQIKIIVMDSPREANVSSEFEIQLAIENNS DFILNSFGSYPVHIAYHWMNAQASQYIVFEGERTGLVPPLQRVQNTVLQSLLGGQRTK EIYTAKVKALPEKGDYILRVTLVQENIRWFDNVPTQLMKDIFITLL" gene 6371..7990 /locus_tag="DP116_18415" CDS 6371..7990 /locus_tag="DP116_18415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311304.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_18415" /translation="MMNANNPEINIDELMEKIRAEVAKRHSQSQSVETTEQSESTKTT YKLELPYIAPTFNYNFIHLETLLRNAESRAIARTKWPDNLSKFPYNLSKPIQVIALKI LNFLFKDQREVNFNVIRALKESVALNRQLIEQIKDLRAQIECLGAVNTRLQGMEERLS AVDSYLEVMKESLGVVCHRVPEINEDLNHFSSWIGVIQERLDTVNSQEHPINEHLKTV DSRIQGLNEHFGRVDSRIQGIDEHLGRVDSRIQGIDEHLGTVNGNVKNLHEQHLRNDS FVKNDLMQQKRLITMFLEEVRQRSPEPINKEHLETFVKEEQHFLDAFYVAFENQFRGT REDIHNRLKVYLPLLEEAKVGTPDSFILDVGCGRGEWLELLRESGYTAKGIDINRVML EQCRTRGLDVIESDVLAYLQSLPDASLGAVTGFHIIEHLPFETLMKLFAETVRVLKPE GLVIFETPNPDNILVGSSGFYTDPTHRNPLPSPTIKFIAESFGLCKVKIMNLHPSENQ KLDVDNSVLAERFNQYFYGSQDYSVIGYKHG" gene 7983..9008 /locus_tag="DP116_18420" CDS 7983..9008 /locus_tag="DP116_18420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311305.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18420" /translation="MGSYRIAVVVPRMASGEIGGAERFHEGLANSLNSLDTHADIVKV VIDESNFETIEESYLRCYDLDVSAYDAVISTKSPTYLVRHPNHVCYLQHTIRVFYDMF DREFPYADETLKKQRELIHKLDTGSLRSPRTRKVFSQGHEIRNRLLKWNGIDSEVLYP GIVLNCTQPKNYEYIFMPGRLHRWKRVDLVIEAMRYVKYPVHLKISGTGEDEQQLRSL AGTEKRIEFLGRVSDEELINLYANALVVPFVPIQEDYGYVTLEAFAHAKPVITCEDSG EPLQFVKNSINGFVVPPQAEEIAKAINDLFENPEQAKIMGNRGKLDTSYITWSNVSQT LLNFLRG" gene 9020..11071 /locus_tag="DP116_18425" CDS 9020..11071 /locus_tag="DP116_18425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860442.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18425" /translation="MTQHSSFQVTVLDMQPIDPPVGGGRLRLLGLYHGIGENLPTTYI GTYDWEGEKYRKHRLSNTLEEIDIPLSEKHFSVCAEWRARVGGKTIIDSCFNLLAHHS PEFVETALSKVAESDIVIFSHPWVYPLVKDKLRVRPQLVVYDSHNVEGFLRTSLLDDG AFGTEIAKNVVSIEYELCQNSDLILACSHEDRELFHRLYNLPFSKIAVVPNGVFTDKI YKSDSLKQAARKKLGVGNSPLAIFLGSSYPPNVEAASFIIEKLAPALPHVKFAICGGV GGNFNQREIIERNIHNVIITGFLQEEEKLTYLAAADLALNPMFSGSGTNIKMFDYMAA SLPVISTPIGVRGIFQGSEPSFLICTQEKFVNSIESLLKDKKSAEAYGAAARDVVERK YSWQLISKNLGLLLYRNRLKLDKQRPFFSVIIPTYERHSKLKELLDCLQKQVFKNFEI IIVDQSKTSCKPVQEYSELDILYIHTDIKGAVTARNTAAFYARGEVLAFTDDDCLPQL DWLNNTIKYFENKYVVGVEGLIVSDKLEDSNYRPVTNVGFEGIGFMTANLLLRREVFM AVDGFDECFENPHFREDTDLGWRICHYGKIPFAHDVCVFHPPHLRTDIRESHEQRNRF FEKDALLMKKHPERYRELFLREAQYNRTPGFCENLLRGAIKYNVEIDNFYLCYLKEGD Q" gene 11068..12003 /locus_tag="DP116_18430" CDS 11068..12003 /locus_tag="DP116_18430" /inference="COORDINATES: protein motif:HMM:PF13489.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18430" /translation="MIKIKTFSDALKRIWKSQDKWSSSVEIKQEEIQKEEIQSEISFA QQILTNSNMSSLLSTDIPPIIRTHLDQAQKLLNEVEKLVASNSHGDYFKSSTPRYLHY LAAAMTLPSQSKILDVGSAPGHVGIGLHLLGMDVVGVNLNEAWRSTYSSPEWLEKLGV IEHDIEKADLPYTQNSFDAVYFTEVLEHIAIRNPLEVLSDLRRVLKPDGLMVLSTPNI CNISNIYALMNEVNIFWQPEIFYGGLDRHNREYTPKEVYNVVEKAGFTNIQMYGINSY CNWRYGTGDYAYKVVSALGDHHPLLRNTIILLAKK" gene 12085..12780 /locus_tag="DP116_18435" CDS 12085..12780 /locus_tag="DP116_18435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017304601.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_18435" /translation="MKLSVVIPCFNELGTIGQVIEAVKASPVKDCEIIIVDDCSTDGT RQLLKSRIESQVAQVIYHQKNLGKGAALRTGFAAVTGNIVIVQDADLEYDPQEYPIMI QPILENKADVVFGSRFQSGRPHRVVYYWHRVGNGFLTMLSNMLTNINLTDMETCYKAF RREVIQAIQIQENRFGFEPEITAKVAKMECRIYEVGISYYGRTYKEGKKIGWKDGFRA IWCILKYNLLTIS" gene complement(12763..13008) /locus_tag="DP116_18440" CDS complement(12763..13008) /locus_tag="DP116_18440" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18440" /translation="MDVNDYFSVDSRETLKKHREQGTLNRERLTGNRVEGGWCFLRLL LVVSKRASDGFPDSQATRVPGRFLAKRERLIFLTYRQ" gene 13011..15068 /locus_tag="DP116_18445" CDS 13011..15068 /locus_tag="DP116_18445" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18445" /translation="MKFFIRPQFLLYVVFCIVGVFNAFRPTIMSGFAYMQTDPGDTRL NHYFLEHLFQVIFNKNYTGELFSPAFFYPYKNVLTFSDNLFGSAPIYFILRAFFSLEL SYQLWMIVVCVLCFVSFAVLMRYYKVGHVPAAIGAFLFAFGMPRVVKIGHQQLLPQFF TPLAFLFLWNFLRSPRNKPLAYSLLLIYFQVLAGIYLGWFLMFSLAIFTAITCLLDKS VWQRLTIYFKQNYKPAILITAIWLLLMLGLLGPYIKAKGILGSPSYTQVDSMLPRLSS WFLPAPDSLWWSLLSENSKHLPMAHEHHIFLGFLTILLTVLSIYTLLYRKNILNDERT LLIKICLLVALTIFIITLHVSNSWSIWRIVYGIVPGASVIRGVTRIWTMFYFYILVAV ILCLDSILRTMLNQRLRMTAVSLLCIGCVLEQIVTNSPSFALAPLTKEVAQIQELMQK DCDVAYVTLKAEVPPWSSQLSAMWAGIKANIPVVNGYSGNVPPNYGRMEDSMSTPQLI NWLGEDSRGQLCIISQKSLKNDDKLVSMYSVKENLSSSGNWTSYHLQLPISKIFSQKI EVYEIPKTVKIASAIKVPVVVKNISNFLWSTKGKHYTSFSYRWLDSEGKLAVFEGDGD RIPLPFDLSPGESAAINAVIKTPTKPGQYSLILTMLQEHVAWFNDKQAESPKFEVSVT SKS" gene 15150..15488 /locus_tag="DP116_18450" CDS 15150..15488 /locus_tag="DP116_18450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874751.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="EamA-like transporter family protein" /protein_id="PRJNA477356:DP116_18450" /translation="MSVMASVGGQFFLKVGALKLGSLNPGNTIGQILSIATTPELVVG LSCYGLGAIAYILLLTKVNLSIAGPSVSLVYVFSVLMGYFIFREPIPMMRLIGLSFIV SGVILVIWQK" gene 15509..>15664 /locus_tag="DP116_18455" CDS 15509..>15664 /locus_tag="DP116_18455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319090.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_18455" /translation="MHSEAEISSFNTSIAFGVNTSGYVNSEFGLGEGVRSTLRALEAV NIPFVINN" BASE COUNT 4823 a 2702 c 3288 g 4841 t 10 others ORIGIN 1 ctataccaac gccgctacta tcagtcatct tgtagccttg aatgataccg acttgtccta 61 gtcttttgac gataggggga gatacgcgat cgcgcaaacg aacgacttta actttttgtc 121 cgatttccat gccaacttac aacacaacaa accaagactc agtgtagccg aatctgggca 181 cgagagggag taggaaaaga tccagtacgc ccaggttccg gtttgtgcat gcaacctgtc 241 atggggatga ggggaataac taataacgac tgactaaaat gaaatcaata agtcagcagg 301 tgtaagcaga tgtttgtcac agcacaacag ctagaacaac agatgcccga tgcaagtcgg 361 ttgttaagtg atgagccaga gatggaaagt tctttacatt atatgcagct actgctgcta 421 gtgagttgtt tggagtggtt atggcgtgat agggatgatt tctttatcgg tgccaatctt 481 actatctatt ttagccgtca acaattgcga aatcgggatt ttagagggcc agactttttc 541 ttagtaaagg acactgagaa aagacctcgt aattcttggg tactttggga ggaagatggt 601 cgttatccag acttgattat tgaattactt tctgaaagca caggcaaagt tgaccggact 661 ttgaagaaag acttgtatca aaaccgattt cgcactccag aatatttttg gttttcccca 721 gaaaatttgg aatgtgtggg ttttaagtta gtagggaatg aatatcaaga aattgcacca 781 gattcacggg gatggcgctg gagtcaagtg ctaggtctgt atttaggtgt aaatgcgggt 841 aagctgcgat actttacgtc tgaaggagat ttagtgctga caccagaaga gacagccaga 901 gttacgcacc agcaggcatc tgaggctcaa caacgagctt ctcaggcaga attgctcttg 961 gaaggagaac gagagcgatc gcagttgtta gcacaaaagt tgcgatcgtt tggtatcgag 1021 ccagagagtt tgatttagcc cgagtacagt cgaagtcaga ggtagcactt atgtatcaaa 1081 ctgatccacc ccgtccacca caggaagttt taccaacgat gtatgatttg cctagtgaat 1141 tagtggcaga atccggattg ccagatgaat ttcatatttt tcaacctaga ctacttagcg 1201 aaacctgtca accttctaac tatccccctg aggaaatttt aattgctact gacttaaatc 1261 tctactacga tcctcgtcat ccattgtggt acaagcgacc tgattggtat atggttttag 1321 gggtatctcg tgcccaacaa caaaaagact tgcgcttgag ttatgtgatt tggcaagagg 1381 gagttacacc atttcttgtc gttgagttgc tttcacctgg tacagaacag gaagacttag 1441 gacaaaccct cagggaagtc aacaagcctg caaccaagtg gcaggtttat gagcaaattt 1501 tacgtattcc ctattatgtt gtgtttgacc gatacagcaa tcaattgcga ggttttcgct 1561 tagaaggaac tcgttaccaa gaattgtctt tgcctgacca acgtttgtgg ttagaagaga 1621 tacaactagg tttaggagtt tggcaaggtt cctacgaaaa taccacaggt ctatggctgc 1681 gttggtatga tatcaatcat caatggatac caacacccac acagcaaatt caacgagaac 1741 gccaacgggc tgaacaagaa cgccaacggg ctgaacaaga acgccaacgg gcccagagac 1801 tggcggagta tttgcgtact cagggaattg atccggataa cttgcctcaa atgtgaaaaa 1861 ataacttgtt ggatattcag catcgagtgg taacaagcag ctatttgttt aacaacagtt 1921 ggaccgaaca gttccgatga ttaagaagcg atccacaaga gtttcactga gttctctata 1981 gctgaaatca gagataaaat ctgctgcaaa atgtccaatt aagcgtataa tttacaaggt 2041 aatttaagcg cacttttaca gaaaaaactt attcttccat atagaagtac accgggaact 2101 ttagtagctg ttgggcttca ttgagagaaa tttggtgtgt aggtatcaag tgaggagcag 2161 caattgttgc gaacaatgcg aggagttatc cgaaaggctg gtgcattaaa gcgtatcttg 2221 ccaattttag agccgtggtt agcaaagttt gatttgttga gagctttggt acgacgggat 2281 ttggaagcac gctacaaagg ttctgtttta ggaaatttat ggcctttatt aaatcagcta 2341 tcccagttac taatttatac ttatgtgttc tcgattgtcc taagggtaaa gctgagcctt 2401 aaaggtttac cagagaataa ttttacgttt ggtttatggc tatttgcagg gttactgcct 2461 tggattgctt ttagtggtgg cttgatgcag gcgtctgctt cggtgatagt acagccgaat 2521 ttagtcaaga aggtagtgtt tcccttgtct ttgttacctc tagtaccaat tttatcaaca 2581 tttgttgaaa gttcctttgg tttaatggcg ttgatttttt ttgtggcggt acaaagtcat 2641 actttacata cgactttggc gctattaccg ttggtttggt tcacccagtt attgctgaca 2701 gcaggattag gttatttgac ggcaggacta acggtatttc tgcgagatat accgcagact 2761 ttaggagtta ttttacagct ttggttgtat ctaacaccaa ttgtttatcc agcatcctcc 2821 ataccaccag aatttcggaa ttgggtattt tggttaaatc ccctgacggc tatttcggaa 2881 gtttatcgtg acttaatttt agtgggagag gtgaaacatt ggggcgagtg gggggttgct 2941 tctgtgactt ctnnnnnnnn nncttctgcg gttgtatttt gttgcggttt ttgggtgtat 3001 aagcggttgc gcccagcctt tgctgacgtg ttatagcaaa atgcgaccac ttaacgctat 3061 gattcaccag agtttgctaa gttttatagc ttaattgatg atttctgatt caaaatctta 3121 taagagtttc agatgtccta ggaatatgcg tgtgtgctat gggtgaggaa attgcaatat 3181 ctctaaaaaa tgtctcaaaa tgttacaagc ggtatactcg tccggtagat aggctcaagg 3241 aaattttgct accaggaaag agtagatctc aagagttttg ggcattgcgg gatattaacc 3301 tagagatttc taagggagaa actttgggaa ttattggtca aaatggctct gggaaaagta 3361 caatactaca aattattgcc aaaacgctga cacctacaac aggggaagtc caggttgatg 3421 ggcgggtttc agcattgtta gagcttggta gtggctttaa tcctgagttt acagggcggc 3481 aaaatgtgtt ttttaatgga cggctattgg gattaagcca aaaggaaatt gaagacaagt 3541 ttgatgaaat tgctaggttt gcagatattg gagattttat tgaccaacct gtcaaaacat 3601 attctagtgg tatgtttgtc cggttagcgt ttgctgttgc agtgagtgta aatcctgaca 3661 ttctcattgt agatgaggcc ttagcagtag gtgatattta ttttcagcaa aaatgttttc 3721 agcaaatcag acaactgaga gattcaggaa caacactttt atttgtttcc catgatccgg 3781 tggctgtata taaactttgt gatcgagcta ttttgctgga atcagggcaa ttagttttgg 3841 atggtaagcc aagacaagtt attgatttgt atgaagctaa acttctgaaa aagaatgacg 3901 tagcaccgga aaaaattgaa attcaaatgt catctaatgc taatggtaaa aaatcacagg 3961 aaaacacaag tgatttagct agtaaagaag aatcagatga aatagttatt aatttaccag 4021 aagtcagtat aaagtttatt aagtttttcg atgaaaaaga caaagaaatt gaatctgtca 4081 ttagtgacca gagtatgcag ttatctatag gattgctgtt cttaaagtct tttgaagatc 4141 cacatattgg cttcaaaatt agggagagga ctggagaagt tgtctttgaa acaaacacag 4201 cctgtatggg agaaaaagta ggtagagtga attgtgaaac cttattggaa attcgttttc 4261 aatttgaaat acctatcaga ccaggcgaat atacaatcac agtaggtgta gctgatggtt 4321 accttggaga aggtttattt agacaaacat tactttatgc tcataatttt gctgttttaa 4381 aagtactaag aaatcaagaa gcgatacttt ggtcaggcat agtcaacctt tatcctacta 4441 tatctatttt gacaagcaat catgtttgac ttgataattc aaaaagcaaa agaaacagaa 4501 tatgatttta gaaaaaattt ctatccaaac gagcaactgg aatttggttt caatcattgg 4561 attgattact acaagttaaa gtggtcaatt gctcatgtct tgaaaccttc ttcaatccta 4621 gaaataggag tcagttttgg ttattgcgct gcagcgtttc tacatgggca tcctgctgct 4681 cactatgttg gtcttttaga tatagaatcc tataaaagtt tgaatggagt cattgattgg 4741 gcgaaaaaaa ttacgactca atttgctact gaatttatga ttactgatac acaagcaatg 4801 aaacgcttgc ccggtgatat ctacgatctc attcacattg atagacaacg gaatgaagat 4861 gctttcttcc ataatttcaa gcttgctatt caccaaggac gctatgtgct acttgatggt 4921 tacttggata cccagcaaag ttttctggct attagctgtt tcctatttcg ctatgcaaat 4981 atcttagatt ggtatggagt gataccaggt tattctggac aactgcttat aaaagtttct 5041 gaggattatt taaggcaagt taaggaagaa caacaatcta acattaactc tagtttagaa 5101 attcgtcaaa cttacacatc tgagtattac acccaagact gtggtggtta tgatgcttac 5161 gaaaaaaatc aagggaaaaa gttagaagac ccaaggctaa aagctattgc tactattgct 5221 agtttaaaac aatcaggacg tgtactcgac cttggatgtg gtcgtggtga actgagtttc 5281 tattttgctc atcgaggctt cacagtgaca gctgttgatt attcacccaa tgcgattgag 5341 ttggcaaaga aatgttttga tggtgaagac caactcaagg aaaaggtaga gttcatttgt 5401 catgatgttt gtcatgtgcc tttgtcaggt aagtatgatt tagtattagc atctgatgtt 5461 attgaacact tatcctttga ggaagtggat acgctttatc agaaaatggc acgacatctt 5521 cagccagatg gattatttgt tgtgcatacg tttccgaatc tttggtatta caagtacgac 5581 tatcaacgca aaaggaaaat agctgcttct gttggagctt atttaccagc agaaccacgt 5641 tccagatacg aactgctgat gcatattaat gaacaatctc cacggatttt gaaaaagcag 5701 ttaagtaaat actttaagaa tgtattgtta tggtttggtt ctccagaaaa tcctggtgga 5761 agtctagtga gaaatttttc aataagagaa ctatctgcag ccccaagcct gtttgcgatc 5821 gcctcacata aacacataaa tcaggaacag ctaaaaaata gtttacaaat gtgtccttta 5881 cccgcgattc ctagtggaca aatcaagata atagtgatgg attctccaag agaagcaaat 5941 gttagtagtg aatttgaaat tcagctagca atagaaaata atagtgactt tatccttaat 6001 agttttggtt cttaccctgt tcatatcgct tatcattgga tgaatgctca agcgagtcag 6061 tatattgttt ttgaaggaga gagaacaggt ttagttcctc cgttgcagag agttcaaaat 6121 actgttttgc aatcattatt ggggggtcaa agaacgaaag aaatatacac tgctaaagta 6181 aaggcacttc ctgaaaaagg cgactatatt ttaagagtaa cgctggtaca ggaaaatata 6241 cgctggtttg ataatgtacc aactcagttg atgaaagata ttttcatcac tttactctag 6301 agattgtata gtatgatatt tgacttaacg aataaccagt agagaagagt tttataggta 6361 gagtgtaaaa atgatgaatg ctaataatcc ggaaatcaat attgacgaat taatggagaa 6421 aatacgggct gaggtggcta agcgtcacag tcaatctcag tcagtagaaa cgacagagca 6481 gtcagagtca acgaagacaa catacaagtt agaattgccc tacatagcac caacatttaa 6541 ttataatttc atccacctag aaactttgtt aagaaatgcg gaatctagag cgatcgcccg 6601 tacaaaatgg ccagataatc ttagtaagtt tccttataac ttaagtaaac ctatacaagt 6661 catagcttta aaaatactaa attttctatt caaagaccaa cgggaagtca attttaatgt 6721 cattcgtgca ttaaaagaat ctgtcgctct taatcgacaa ctaatagaac aaataaaaga 6781 tttgagagcg caaatagaat gcttgggtgc tgtaaatact cgcctgcaag gaatggaaga 6841 gcgcctaagt gctgtggata gttacctgga agtcatgaaa gagagtttgg gtgttgtatg 6901 tcatcgtgtt ccagagataa atgaggactt gaatcatttc agtagttgga ttggtgtcat 6961 acaagagcgc ttggatactg ttaatagtca agagcatccc ataaatgaac atctaaagac 7021 tgttgattct cgtattcaag gattaaatga acattttggt cgtgtagatt ctcgtattca 7081 aggaatagat gaacatttgg gtcgtgtaga ttctcgtatt caaggaatag atgaacattt 7141 gggtactgtg aatggtaacg taaaaaactt gcacgagcaa catctgagaa acgacagttt 7201 cgttaaaaac gacttaatgc agcaaaagcg cttgataacc atgtttttgg aagaagtgcg 7261 ccagcgatcg ccagaaccca tcaacaaaga acatttggaa acttttgtca aggaagaaca 7321 acatttctta gatgccttct acgttgcttt tgaaaatcaa tttcggggta cccgtgaaga 7381 tattcataac aggttaaaag tttatctacc tttacttgag gaagccaagg ttggtacacc 7441 cgattctttt attctggatg tgggttgtgg acgtggcgaa tggctggaac tactgcggga 7501 gtctggctac acagcaaaag gtatagacat aaatagagtc atgctagaac agtgtcgcac 7561 aaggggacta gatgtgattg aatcagatgt tcttgcatat ttgcagtctt tacccgatgc 7621 aagtcttggt gcagtcactg gctttcatat tatcgaacat ttgccatttg agacgctgat 7681 gaagttgttt gctgaaacag ttagggttct taaacctgaa ggattagtca tttttgaaac 7741 cccaaaccca gataatatat tagttggcag tagtggcttt tacacagatc caactcatcg 7801 caatccctta ccaagcccca caatcaaatt tattgctgaa tcttttggtt tatgcaaagt 7861 caaaatcatg aaccttcatc cttcagaaaa tcagaaatta gatgtagata attctgttct 7921 agctgaacgc tttaatcaat atttttatgg ttctcaagac tattctgtga ttgggtacaa 7981 acatgggtag ctacagaatt gcagtcgtag tgccaagaat ggcaagtggt gaaatcggtg 8041 gggctgaacg ctttcatgaa ggactagcaa actcattgaa ctctttagat acccatgcag 8101 atattgtaaa agttgtcatt gatgaatcaa attttgaaac gatagaggag tcgtacctcc 8161 gttgctatga cttagatgtg tctgcatatg atgcagtcat ttcaacaaaa tcgcctacct 8221 atcttgtgcg gcatcctaat catgtatgct atctacagca tactattaga gttttctatg 8281 acatgttcga cagagaattt ccctatgccg atgaaacttt aaaaaaacag agagaactga 8341 tacacaaatt agatacgggg tctttaagat ctccaagaac tagaaaagtt ttttcccaag 8401 gacacgaaat ccgaaataga cttctcaaat ggaatggtat agacagtgaa gttttgtacc 8461 ctggcatagt tttgaattgt actcaaccaa aaaattatga gtatatattt atgccaggaa 8521 gactgcatcg ttggaaacgg gtagatttag tcattgaagc gatgcgctat gttaaatatc 8581 ccgttcatct taaaatttcg ggtacaggtg aagacgaaca acaattacgt agtttggctg 8641 gtactgaaaa acgtatagaa tttttaggtc gagtttccga tgaggaactc attaatttat 8701 atgcaaatgc tttagtcgta ccgtttgtac ccattcaaga agattatggc tatgtgactt 8761 tagaagcttt tgctcatgca aaaccagtga ttacctgtga agattctggt gaacccttgc 8821 aatttgtgaa aaacagtatc aacggttttg ttgttccgcc tcaagcagaa gaaattgcta 8881 aagccataaa cgacttgttt gaaaatccag aacaagcaaa aatcatgggt aacagaggca 8941 agcttgatac aagttatatt acctggtcta atgtatctca aacgcttttg aattttttac 9001 gagggtaaat ggaggtcacg tgactcagca ttcatcattc caagttactg tcttagatat 9061 gcagcctatc gatccgcctg ttggtggtgg aaggctgagg ctactgggtt tgtatcatgg 9121 aatcggtgaa aaccttccaa caacctacat aggaacctat gattgggaag gtgaaaagta 9181 tcgaaagcac cgtttgagca atactttaga agaaatagat attcctctga gtgaaaaaca 9241 cttttctgtt tgtgctgaat ggcgagcacg ggtgggtggt aaaactatca tcgattcttg 9301 ctttaatttg ctagctcacc attcacctga atttgtggaa acagctctta gtaaagtcgc 9361 tgaatcagat attgtcatct tttctcatcc ttgggtatat cctctggtta aggataagtt 9421 aagagtacgt ccacagcttg tggtttatga ttcacataat gtagaaggct ttttgagaac 9481 cagtttatta gatgatggtg cttttggcac tgaaattgcg aaaaatgtcg tcagtataga 9541 gtatgaactt tgtcaaaatt cagatttaat tcttgcctgt tcgcatgagg acagagaatt 9601 atttcatagg ctgtataatc ttccttttag caaaatagct gttgttccta atggcgtgtt 9661 tactgataaa atttataaga gtgacagtct caaacaagca gccagaaaaa aactaggcgt 9721 aggaaacagt ccactggcta tttttcttgg tagttcctat ccaccaaatg ttgaagccgc 9781 aagttttatt attgaaaagt tagctcctgc tttgcctcat gtcaaattcg ctatctgtgg 9841 aggcgtcggt ggtaacttca accaaagaga aattattgag agaaatattc ataacgttat 9901 tatcacaggt tttttacaag aagaagaaaa actgacttat ctcgcagcag ctgatcttgc 9961 tcttaatcca atgttctctg gttcaggcac gaatatcaaa atgtttgatt acatggcggc 10021 tagtctacct gttatttcaa cacctatcgg agtacgaggt atatttcaag gttcagaacc 10081 atcatttctg atttgtactc aagaaaagtt cgtcaacagc attgagtctt tgcttaagga 10141 taagaagtct gctgaagctt atggcgctgc tgctagagat gttgtagaaa gaaaatattc 10201 ttggcaatta atttccaaaa atttagggct tctgttgtac agaaatagat tgaagctgga 10261 caaacagcgc ccatttttta gtgtcattat tcccacatat gaaagacaca gcaagttaaa 10321 agaattgctt gattgtttac aaaaacaagt ttttaaaaat tttgagatta ttatagttga 10381 ccaaagtaag acttcatgca aaccggttca agaatactct gagctagata ttctttatat 10441 ccacactgac attaaaggag ctgttacagc gcgaaataca gccgcctttt atgcaagggg 10501 cgaagtttta gcttttactg acgatgattg tttaccacag cttgattggc taaataatac 10561 tataaagtat tttgagaata aatatgttgt tggtgtagaa gggctgattg tttcagataa 10621 actcgaagac agtaattacc gtcctgtaac caatgttgga tttgaaggaa ttggttttat 10681 gactgccaat ttactgctta gacgggaagt gtttatggcg gtggatggat ttgatgaatg 10741 ttttgagaat cctcactttc gtgaagacac cgatttgggt tggagaattt gccattatgg 10801 caaaattcca ttcgcacacg atgtatgtgt ctttcatcct cctcatcttc ggactgatat 10861 ccgtgaaagt catgaacaaa gaaatcgttt ttttgaaaaa gatgcattgt taatgaaaaa 10921 gcatccagaa cgttatcggg aacttttttt acgagaggct caatacaaca gaacaccagg 10981 attttgtgaa aatttattaa gaggagcaat aaaatacaac gtggaaattg ataattttta 11041 cttgtgttat ctaaaagagg gagaccaatg attaaaatta aaacttttag cgatgctcta 11101 aaaaggattt ggaaaagcca agataaatgg tcatcatcag tagagattaa gcaagaagaa 11161 atccagaaag aagaaatcca gagtgaaata tcttttgcac agcaaattct gacaaattct 11221 aacatgagtt ctctgctgtc aacagatata ccaccaataa ttcgtacaca tttggatcaa 11281 gcacaaaaat tattgaacga agtagaaaaa ttagttgcta gcaatagtca cggagattac 11341 tttaagtcat caacgcctag atacctccat tacttggctg ctgcaatgac tttgccaagt 11401 caatcaaaaa tattagatgt cggttctgca ccaggacatg ttggtatagg cttgcatttg 11461 cttggtatgg atgttgttgg tgtcaatttg aatgaagctt ggcgtagcac atattcttca 11521 cctgagtggt tggagaaatt aggcgtaata gaacatgata ttgagaaagc agatcttcct 11581 tatactcaga acagtttcga tgcagtctac tttactgaag tgttagaaca tatagcaatt 11641 agaaatccgc ttgaagttct atccgattta agaagggtat taaaaccaga tggattgatg 11701 gtgctatcta caccgaacat atgtaatatc tcaaatatat atgctttgat gaacgaggtt 11761 aatatttttt ggcagccgga aatattttat ggcggcttag ataggcataa tcgagagtac 11821 acacccaaag aagtttacaa tgttgtagaa aaagcaggtt ttacaaatat acaaatgtat 11881 ggaattaaca gttactgtaa ttggcgctat gggactggtg actatgctta taaagttgtt 11941 tctgcgcttg gtgatcatca tccattacta cgaaatacaa tcatactatt ggcaaaaaag 12001 taatttgtat tcatcaaaat agactttcat ctaggtggta agatcataga tttttctatg 12061 ttgaagagga gttaataagt gaatatgaaa ctatctgtag ttattccctg ctttaatgaa 12121 ctaggaacta tcggtcaagt tattgaagcc gttaaagcat ctccagttaa agactgcgaa 12181 attattatag ttgatgactg ctccacagac gggacgcgtc aactactcaa atctaggata 12241 gagtcacaag tcgctcaagt tatttatcat caaaaaaacc tgggtaaagg tgcagctttg 12301 cgtactggct ttgctgctgt tactggtaat attgtcattg ttcaagatgc tgatttggag 12361 tatgaccctc aagagtatcc aatcatgatt caacctattt tggaaaacaa agctgatgta 12421 gtcttcggct ctcgttttca aagtggtaga cctcatagag ttgtctatta ttggcacaga 12481 gtagggaatg gatttttaac aatgttatct aatatgttga caaatattaa tttgacagat 12541 atggaaacat gctacaaggc atttcgacgg gaagtcattc aagctattca gatacaagaa 12601 aatcgatttg gatttgaacc ggaaataact gcaaaagttg caaaaatgga atgtcgcatt 12661 tatgaagtag gtatatcata ttacggtcgt acttataagg aaggtaaaaa aataggttgg 12721 aaagatggat tcagagctat atggtgtatt cttaaataca atttattgac gataagttaa 12781 aaatatgagc ctctcacgct tcgcaagaaa ccttccggga acgcgggtcg cctggctgtc 12841 gggaaacccg tcactcgcgc gcttactcac cactagcaac agccttaaga aacaccatcc 12901 cccttctacc ctgttccctg ttaagcgttc cctgttaagc gttccctgtt ccctgtgttt 12961 cttgagagtt tctctggaat caactgaaaa ataatcatta acatccatgt atgaagtttt 13021 ttattagacc tcaatttctg ttgtatgtcg ttttttgtat tgtaggtgtg tttaacgcat 13081 ttcgtcctac aataatgtct ggatttgcat atatgcaaac agacccagga gatacacgat 13141 taaatcatta ctttttagaa cacctgtttc aggtaatttt taataagaac tacactggtg 13201 aattattctc accagctttt ttttatccat ataaaaatgt tctgactttt tcagataact 13261 tgtttggctc agctcctatc tactttatac tgagggcatt tttctcacta gaattgtcat 13321 atcagttgtg gatgattgtg gtttgtgtac tatgctttgt tagctttgct gtattgatgc 13381 ggtattacaa agtaggtcat gtaccagctg caattggtgc tttccttttt gcgttcggaa 13441 tgccaagagt tgttaaaata ggtcatcaac agctacttcc tcagtttttc acaccactag 13501 ctttcttgtt tctatggaat ttcctgagat ccccaaggaa taaaccactc gcatattcac 13561 tactacttat ttattttcaa gtactagcag gcatttactt gggctggttt ttaatgttct 13621 ctttagctat ttttacagca ataacttgcc tattagataa aagtgtttgg cagcgtttaa 13681 ctatctattt taagcagaat tataaaccag ctattttgat aactgctatc tggctgttac 13741 taatgcttgg cttactagga ccttatataa aggctaaggg aatacttggt tctccctctt 13801 acacacaagt agactctatg ctaccaaggc tttcctcatg gtttttacca gcccctgata 13861 gtctttggtg gtctttgtta tcagagaatt ccaagcattt accaatggct catgaacacc 13921 atatatttct aggatttttg actatattac tgactgtact atccatatat actctactgt 13981 atcgcaaaaa tatattgaat gatgaaagaa ctctgctaat aaaaatttgc ttacttgtag 14041 ctctgactat ttttattatt accctccatg tgtcaaatag ttggtcaata tggagaattg 14101 tttatggaat tgtacctggc gcttcggtaa ttagaggagt cactcgtata tggacaatgt 14161 tttatttcta catcctagtt gcggtgatcc tttgtctaga ctctatactg cgtactatgc 14221 ttaatcagcg attgcgtatg acagcagtca gtctactttg tattgggtgt gtcttagaac 14281 aaattgtcac taactcacct agttttgcac ttgcaccttt aacaaaggag gttgcacaaa 14341 ttcaggaact aatgcaaaaa gattgtgatg ttgcatatgt gactctaaaa gctgaagtac 14401 caccttggtc ttcacagcta tcggctatgt gggctggtat taaagctaat atacctgttg 14461 tcaatggtta ttcagggaat gtacccccta actatggtcg catggaagac tcaatgagta 14521 cgcctcaact catcaattgg ctaggagaag acagtagagg acaactttgc ataatatccc 14581 agaagtctct gaaaaatgat gataaattag tctccatgta ctcggttaaa gaaaatttaa 14641 gttcttcagg taattggact tcttatcatc tacaactgcc tattagcaaa atcttctcac 14701 aaaaaataga agtatatgaa attcctaaaa ctgtaaaaat agcttcagca attaaagtcc 14761 cagttgttgt taaaaatatc agcaacttct tatggtctac taaaggtaag cattatacat 14821 ccttcagtta tcgttggcta gactctgaag gaaagttagc agttttcgag ggggatggcg 14881 atcgcatacc tcttcctttt gatttatctc ctggagagtc agcagcaatt aatgcagtta 14941 ttaaaactcc tacaaaacca ggacaatata gcttgatttt aacaatgctt caagagcatg 15001 ttgcttggtt taatgacaaa caagcagagt ctccaaaatt tgaagtatct gtcacttcaa 15061 aatcttaata agtaatgaaa gtggctaaaa acaaaagtca atagttgtga tgaaacttta 15121 tgaacataca agaatttttt ttactgataa tgtctgtcat ggctagtgta ggagggcaat 15181 tttttttaaa ggttggtgca ctgaaattag gaagcttaaa tccaggaaac acaattggtc 15241 aaattcttag cattgctaca acacctgaac tagtagttgg gctaagctgc tatggtctag 15301 gcgctatagc ttacattcta cttttaacta aagtaaatct gagcattgca ggtccatctg 15361 tgtccctcgt ttatgttttc tcagttttaa tgggttattt catatttaga gaacctattc 15421 ctatgatgcg tctaataggc ttgagcttca ttgtcagtgg agtgatatta gtgatttggc 15481 aaaagtgaat gaaataacag gaaaaaatat gcattcagaa gcggaaattt catctttcaa 15541 cacgagtatt gcttttggcg tcaatacctc aggttacgtt aacagcgaat ttggactggg 15601 tgaaggtgtc agatcaactc ttagagcact tgaagcggtt aatattcctt tcgttattaa 15661 taac // LOCUS NODE_2155_length_15593_cov_5.04788315593 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15593) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15593) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15593 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..233) /locus_tag="DP116_18460" CDS complement(<1..233) /locus_tag="DP116_18460" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18460" /translation="MFWKFVISALLLPCCLGLEVATASVQKGIQTQLNYEKLGTQVAQ IEDNSLTQADWETDTDTKLQSNVVGYKKGLRKQT" gene 861..1613 /locus_tag="DP116_18465" CDS 861..1613 /locus_tag="DP116_18465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653030.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobilisome rod-core linker polypeptide CpcG" /protein_id="PRJNA477356:DP116_18465" /translation="MAIPLLEYAPLSQNNRVAGYEVPGDEQPRIFSTDNLLSATDLDN LIEAAYRQIFFHAFESDRERFLESQLRSGQITVREFIRGLALSNTFTGSFYNLNSNYR FVEHCVQRILGRDVYSEREKIAWSIVVATKGRAGFINDLLNSDEYLENFGDTIVPYQR RRVLPSGASELPFNIKSPRYDEYYRAKLGFPQIIWQTIVRRYTPPDKQPKAGDPALFA SMAQSINPTGNPPQQISPYNIDYEKAVPYRRR" gene 2618..3289 /gene="rnc" /locus_tag="DP116_18470" CDS 2618..3289 /gene="rnc" /locus_tag="DP116_18470" /EC_number="3.1.26.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115586.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease III" /protein_id="PRJNA477356:DP116_18470" /translation="MIKQQSFQNISLLRRALTHRSYVHENPQEGEHNERLEFLGDALL TFLSGEYLYRRYPQKGEDELTRRRSALVDEKQLAKFAIEVGLNSRMLLGKGATLERGY QNPNLLSSAFEAVIAAYYIDNNYDIEAVRAVVEPLFDSVPESIVEFRSNVDSKNRFQE WVQRNITQIPPKYITVQVGGSSHAPEFIAKVFVGDKEYGEGKGRNKKDAEKAAAEDAL ARLKQ" gene complement(3308..4501) /locus_tag="DP116_18475" CDS complement(3308..4501) /locus_tag="DP116_18475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_18475" /translation="MKDFSQLLRDKTQKIIKQWVEAVRRDKKISSTNNLTRTAIKNHV DHVLLALATVLSQYQDDEVQPLVQASLHHGTLRAEQGFDAAEIAREYRLLRNTIFVNL EPEWLKASAQEVMRAVHLIDMVLDEAIAYCFQSYTQERLTELEQLQNQLTLNNQELTR LVRANQDNLSYLAHELKNPLTSIIGYSDLFLRLQRQKSEEKDSFTHLEHIDRVLRSGR QLLHLINDALEISRYDAGQMKLHSEPINVHELIRNVYEMLEPLASHKNLQIIINCNRA PNEVVTDALRLQQIVTNLVSNAIRYTESGTIKIKCKTLDIDKWSVTVSDTGIGIEPEN QVQIFEPYFRIGSASKSFLPGSTGLGLAIVSRLVKLLQGEISLVSQMGVGSTFTVTLP LKVEV" gene 5047..5898 /gene="ntrB" /locus_tag="DP116_18480" CDS 5047..5898 /gene="ntrB" /locus_tag="DP116_18480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316851.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrate ABC transporter, permease protein" /protein_id="PRJNA477356:DP116_18480" /translation="MILQLNVAAILAVASRTAWKRAKPVIVRDTVLLPLLGFLGVIVV WWIIALANHELMPTPPEALVANLDYILNPFFQRGPGNLGIGWLLIASIRRVLLGFALG ALVAIPVGFLIGMSRTAMMILNPIIQIFKPVSPLAWLPIALAIFNLADPSAIFVIFIT SLWPTIINTALGVSSVSKDYIDVARVLEMPRWRRITKIIWPASLPYIFTGLRISLGIA WLVIVAVEMLTGGIGIGFFVWDEWSRLNLNSVFLAVLVIGLTGLFLDYAIGRIQAFVT RRPITSN" gene 6229..7641 /locus_tag="DP116_18485" CDS 6229..7641 /locus_tag="DP116_18485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006632464.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="twin-arginine translocation pathway signal protein" /protein_id="PRJNA477356:DP116_18485" /translation="MSHNNTNNWTRRDFITGVGATAFATGLSSCAINANRAPKELSKA ALATEPVVDPKTLEKPNITVGYVPVNDCAPFAIAWEKGFFRKYGLNVTLSREASWGTS RDGIIFGRLDASPVVSGAVTNARTGAEGARHAPLCAAMTIHRHGNAMTMNKAMWESGL RPWRDYNGNLEEFGRDFRNYFEKLPSEKRVWAVVLSSAIYEYFIRYLAAAAGVAPDKE FRIIIIPPPQMVVNMRIGAMQGYMVAEPWNSRAISGNDGIGFTFAQGREIWQGHPDRL LGVMESFIKENPKTYRSLVKAMIEACRYCSDTKNREEVAQILTKKSFTGAKLKLTKPA IVGDYNYGGFDGQQRVWKAPETTIFFDKPANLVKAPNDHSTFLWQSQSIWLMTQSARW GQIKEIPKNAQELARKAWRTDLYREIAAEMGIECPKEDYKVEQAELFIDKKAFDPSDP VGYLKSFEIRANSPKSFFMS" gene 7752..8588 /locus_tag="DP116_18490" CDS 7752..8588 /locus_tag="DP116_18490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_18490" /translation="MKSTSFSKDAYDTTSPSRGFLEIENLHKSYPTPDGNQFVVLDNV NLTVGEDEYISVIGHSGCGKSTLLKIVAGLEKATSGLVRLDGKEIRKPGAERMMVFQH YSLLPWLTVRENIRLAVDEVLKDANRTDKISIVNEHLAMVNLTAAADKYPDEISGGMK QRVGIARALAIRPKMLLMDEPFGALDALTRGKLQRQVLDIWENHRQAVMMVTHDVDEA IYMSDRIVLMTNGPAANIGEILEVPFPHPRDRNAMRNSKEYYELRNYALNFLDRYFTQ DE" gene 8706..9377 /locus_tag="DP116_18495" CDS 8706..9377 /locus_tag="DP116_18495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017322573.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="universal stress protein" /protein_id="PRJNA477356:DP116_18495" /translation="MLVRLQNALGRDDLIEQMVLITVPEKPLSVQDQSAKSVNLIVGY NSSPNSHTALDIALLMAHQTRLATKAQVTVQVVYVIEEHQRSHRGDVLQREKFVSQRV TEQNPPHCSTSFSLVGFDTGVTTQLKMQDNAACSQEMLIDKFAQAECILRQASCLAAE WKSSFKAHLRFGCIAKELRKVVKSEAANLLLLGCNSVDHPIVQQLGSNFPCSVLGIPN FVPFG" gene complement(10189..10401) /locus_tag="DP116_18500" CDS complement(10189..10401) /locus_tag="DP116_18500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_18500" /translation="MQTPETRPSTDLPPVAKAYNGVDRNAFVFGLNPQAELWNGRLAA IGFLAYLLWDLAGYSVLRDVLHFIGY" gene complement(10472..>10642) /locus_tag="DP116_18505" CDS complement(10472..>10642) /locus_tag="DP116_18505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_18505" /translation="VAKAYNGVDRNAFVFGLNPQAELWNGRLAAIGFLAYLLWDLAGY SVLRDVLHLIGY" assembly_gap 10643..10652 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(<10653..10749) /locus_tag="DP116_18510" CDS complement(<10653..10749) /locus_tag="DP116_18510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_18510" /translation="MQTPETRPSSDLPPVAKAYNGVDRNAFVFGLN" gene complement(10812..11024) /locus_tag="DP116_18515" CDS complement(10812..11024) /locus_tag="DP116_18515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196702.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_18515" /translation="MATPETRPSSDLPPVAKEYNGVDRNAFLFGFTPQAELWNGRLAA IGFLAYLLWDLAGYSVLRDVLHLVGY" gene complement(11086..11298) /locus_tag="DP116_18520" CDS complement(11086..11298) /locus_tag="DP116_18520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196702.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_18520" /translation="MATPETRPSSDLPPVAKEYNGVDRNAFLFGFTPQAELWNGRLAA IGFLAYLLWDLAGYSVLRDVLHLVGY" gene 12141..15281 /locus_tag="DP116_18525" CDS 12141..15281 /locus_tag="DP116_18525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_18525" /translation="MFQSTHITLALYKSLFWVAQMPIDNQPTNANQTYAFSGFGWNSF FALISGVILAFALQLVLTNLSVAAGISYLGRSSDSNGGDGEVGSFGGTIRKIGTAVGL WTLITVTIALAIACFLAVRLSLFGNWSPVQGAILGLVIWGAYFLLLVWVSSTTVGSLV GSLVNTATSGFQAILGTATAALGAKAINNQVVATAEAAAAAVRREIGSAVDPATLRDK VEDYLEMVRPPELDISKIRGEFEKLLNDPQLKAIASSPDLRNIDRQKFLDLISSRTDL SKREVNRIADTLYGVWQQVVGQQQPTPDRLGELVNYLKSLPPGQTKTDELNAKLDQLM AQMRSGKQGDQNRTQEATPGPVQQTIQQAVSALTGIVLGRTDLSDLDVEKILGAVTNA KDKVTEQADKLGLPTPKQAYSPIRTDVENYLLNTYAWQLSSEKAAQEFRDVLYDPAAD PGTVRRELERLSRSDFVNILKTRGLLTQAEIQRIANQLELTRKDVLIAVIAEEEKEIV QDLQRRVESYLLVTPKSDLSAEGIHRDFKPLLEDSDADYETLSRRLAQFDRQEMREIL LERNDIYPEEADTILDELEKQRDQVLVESQGIAEQARYQAESLWLNLESYLGNTGKDE LNPDAIRAELRQLLDDPQAGFAAIRARLSRFDRDTLVQLLSQRQDLSEDQANQILDNV EENWSNIRHAPKIVADKAKEQYDSVTTTIADYLRNTGKQELNPEGIQRDLNRLFQNPR EGVVALRRRLSQLDRDTLVKLLSQREDLSEEQVNQIIDSMQTSIRDIVRTPRRLATRT QQRVQNFQTYLEEYLRRSGKDELNPEGIKRDLSLLLHDPQVGIESLGDRLSQFDRSTI ITLLKLREDLSDEEAARIADTMISVREQFVEQVRNIQRRIQDVVDGVFGRIRNYLNSL ERPELNYDSIKRDVRTLFDDPQAGFDALRDRLSSFNRETLVAVMSSREDISEEDANRI IDQVERARNNVLQRAERIQQEAQRRLEEVKIQAQRQAEETRKAAASAAWWLFATAVVS GVFSAIGGAVAVLFLV" BASE COUNT 4647 a 3146 c 3354 g 4436 t 10 others ORIGIN 1 gtctgttttc gtaatccctt tttataccct accacatttg actgcaactt agtatcagta 61 tcagtctccc aatctgcttg agttaacgaa ttatcttcaa tctgagcaac ttgtgttcct 121 aatttttcat agtttaactg tgtctgaatt cctttttgga ctgatgctgt ggctacctct 181 aatccaagac aacagggcaa cagtaaagca ctaattacga atttccaaaa cattgtttca 241 cgttcctcat ttctcgttcc ctaagaaaac tagggaatgt ctactgtgag actttggttg 301 aaaaaaggat ttacgcagtg aaacgaaaaa ttaaggctga ttctgctgaa ggataacaaa 361 gcctctataa atgcaatacc atctacacca gggtgaaggg agttcttggt gagcgctatt 421 ttgtttttgg ataatttctt tttaagaaat gcagggaact cttaacaggg aacaggagca 481 aaatttcgta tctcagtgtt gcaaactcgg gcgaaccccc gttacagcag caaggctgat 541 tcattccaac ctaattcttt gtttcatgta ccaatttgtg ccattctgta tttagaagct 601 acaagtgggt tcttcccagt gatacgcttt ggaatagtga acagcttgcc agcatctttg 661 atactcatca aaaccgagac acatctatac aaaacgtgcc aaagtgatgt catctccact 721 agatgaaagc ttgaatttat aaggcttttc tctaaatgtt aaggagtgta atgttttatc 781 agattcctga aaactttaga ccattttaac cttatgatga ggaaaacttg taaagtttag 841 taagaaggaa gttattaaac gtggctattc ctctgctaga gtatgcgcct ttaagtcaaa 901 ataatcgtgt tgctggctac gaagttccgg gtgatgaaca acccagaatc ttctctactg 961 acaacttgct ttctgctact gacctcgata acttaatcga ggcagcctac cgtcagatat 1021 ttttccatgc ttttgagtct gatcgggagc gttttttgga gtcgcagctc cgcagtggac 1081 aaattaccgt ccgtgagttt attcgtggat tggctttgtc caacaccttc actggtagct 1141 tctataacct caacagcaac taccgctttg ttgagcattg tgttcagcgg attttgggac 1201 gcgatgttta cagcgaacgg gaaaaaattg cttggtcaat cgtggtagca accaaaggtc 1261 gggctggctt catcaatgat ctgctcaaca gcgacgaata cctagagaac tttggtgaca 1321 ccattgttcc ctaccagcgt cgtcgggttc tgccttcagg agccagcgaa ttgcccttca 1381 acatcaagtc tccgcgctac gatgaatact atcgtgcgaa actcggcttc ccgcaaatta 1441 tctggcaaac cattgtacgt cgctacactc cacccgacaa gcagcctaag gcaggtgatc 1501 cagctctgtt tgcatcaatg gctcagagta ttaatccaac aggtaatcct ccgcaacaga 1561 tctcgcctta caacattgat tacgaaaaag cagtacctta tcgccgccgg taattagggc 1621 tgcttattta gcaaacacga tacagcgtga gtgtagtctt ccaagataga ggagtcagaa 1681 ttcaggtgct agggttcaga aaaggttttt ctgaatcttg agttctgaaa tctggctcct 1741 tatttgctca agttgatttt tcggcttact aacgttattt atcactctgt gaagaaacac 1801 agtaggtggt ctgagatttc gcaacgaatt caaagcctaa gatttgtgtt tttgctgtcg 1861 cacaagcatc tagagtcttg gtttcaaagc aagctaaaca ctttttaatt tgccagtcta 1921 tggcaacaat ctataattta gtaaacctga tagataccag gtgcaacttt gtggattatt 1981 cttggcaatc gaattctttt tttttgaaga ggtggcaata ttatcaatat attcctagtg 2041 ataactagga agattacata aaaaattatc acaagtaaag aaaatctata gaataaatat 2101 ttagtaaaat actactttca atgaatagat ggctgtgagc aaccacgaac aagttggcag 2161 aagtttaact cttttaaatc aagctttgta cccttatata aaaagggaga tgcaaaaagt 2221 ctatggtgag gtttggctca caattgccct atcatgtatg tcttgttatt cagtgctaaa 2281 agataaccta aaagataact tagaagacat tctacgtgat gatgtttctg ctttattaga 2341 agttatagta gggcagtggg ataacgtctt cagcaaaaaa ttaggtaaca ctgagcgtgc 2401 ttttgttagt gaactcattg aaaccaccaa gacttggaag catcaatctc gcttttcagt 2461 tgatgacact tatcgcactc tcgacacaat aactcgactt ctcaaagcta tctctatatt 2521 agaggcaaat gttgcagaac aacacaagca aaaaatacta gaaaaacttc tttctccgcc 2581 aggtgaaaaa gcggctatta cagaaacatt agctgaaatt attaaacaac aaagcttcca 2641 aaatatttct cttttgcgcc gtgcgctgac acaccgttct tatgtccatg aaaaccccca 2701 agaaggagaa cacaacgaac gtctagagtt tctcggtgat gctttgttga cttttttaag 2761 tggtgaatac ctttatcgtc gttatccaca aaagggagaa gatgagttaa ctcgtcggcg 2821 ttctgcgctg gttgatgaaa agcaactggc aaagtttgca attgaggttg gcttaaactc 2881 cagaatgctt ttaggtaaag gtgcaacttt agaacgaggt taccaaaatc ccaatttact 2941 cagtagcgcc tttgaagcag tgatcgccgc ttactacata gacaacaatt atgatattga 3001 agcagtgcgt gctgttgtag aaccgctgtt tgattctgtc cctgaaagta ttgtggagtt 3061 tcgctcaaat gtagactcta aaaatcggtt tcaggaatgg gtgcaacgca acatcactca 3121 aataccgcct aaatatatca cagttcaagt aggtggttct tctcacgctc cagaatttat 3181 agctaaagta tttgtgggag ataaagagta tggagaaggc aaaggtcgca acaagaaaga 3241 tgcggagaag gctgcggctg aggatgcgct ggctaggcta aaacaatgaa ggtgagtttg 3301 aggtatctca cacttctact tttaacggca aagtcacagt aaaagtagaa ccaactccca 3361 tctgcgagac taagcttatt tcaccctgca acagtttcac cagccgtgaa actattgcca 3421 agcccaagcc agtacttccg ggaaggaagg atttactagc agaaccaatg cgaaagtagg 3481 gttcaaaaat ttgtacttgg ttttctggct caattccaat tccagtatcg gaaactgtaa 3541 cactccactt gtcaatgtct aaagttttac acttaatttt tatcgttcct gactctgtgt 3601 agcgaatcgc attactaaca agattcgtta caatttgctg taatcgtaat gcgtctgtta 3661 caacttcgtt aggagcacgg ttgcaattaa taataatttg taaattttta tgactagcca 3721 aaggctccag catttcgtaa acatttctga ttaattcatg cacattgatt ggctctgagt 3781 gtagtttcat ttgcccggca tcatagcgag aaatctctag tgcatcgtta atcaggtgaa 3841 gtaattgtct cccgctgcgt aacacgcggt caatgtgttc tagatgggta aaggaatctt 3901 tttcttctga tttttggcgt tgtaagcgca aaaacaaatc tgagtaacca ataattgaag 3961 ttaatgggtt tttgagttcg tgcgctagat acgataaatt atcttgattt gctcgcacta 4021 agcgagttag ttcctgatta ttaagggtta actgattttg cagctgctct agttctgtta 4081 atcgctcttg tgtgtaactt tggaaacaat aggctattgc ttcatccaac accatatcaa 4141 tcaaatgcac ggctcgcatc acttcttgtg ctgatgcttt cagccattct ggttctaaat 4201 taacaaatat agtattacgc aaaagccgat actctcgtgc aatttctgct gcatcgaaac 4261 cttgttcagc cctaagagtt ccatgatgca aacttgcctg aactaagggt tggacttcat 4321 catcttgata ttgagaaagc acagttgcaa gcgccaagag gacatggtca acatgatttt 4381 taatggctgt acgagttaga ttattggtgc tggaaatctt tttatcccga cgaacagctt 4441 ccacccactg ttttataatc ttctgagttt tatcccgcag caattgacta aaatctttca 4501 tcactcatgg gtggataaac atacaaatag aataaatcat tctattattt agatcttaag 4561 cataggagta ctacattgta tttactaaaa acatgtaata attagaaata aaagtacata 4621 atatacgggt gatgtttatc catctaaaga tcaatgaatt ttctttctaa caagacttgt 4681 tttttagcca aaaaaagtat atcctagggt tgcttgacgc tattatttac tggtgttaca 4741 tttatcataa ataagagtaa gtcagcctaa tccaatgaat tgcattgcat aagttggagc 4801 ggaggaacca acgtatgggg cgtatcttag ctagaaggaa caaagagtag atgtatgttt 4861 tcgcttcttg ttttttgctt cttctaatag aagggtatct ctcaacccta gcccgtcagc 4921 taacttcgta ggcaatgaga ggagactgaa gagacagcat gattgtaatg ctctgatctc 4981 atcagtatcc ttggctggta ctgctcggat taattgtacc cgcacttgaa tctgttgagg 5041 tcaccaatga tattgcaact gaatgtagct gccattttgg cggttgctag tcgaacagct 5101 tggaaacgcg ctaaacccgt tattgtacgg gatactgtct tactacctct gcttggtttt 5161 ttgggcgtta ttgttgtctg gtggattatt gcccttgcaa accatgagtt aatgcctact 5221 ccacctgagg cgttagtagc taatttagat tatatcctta acccattttt ccaaagagga 5281 ccaggtaacc ttggtatcgg ctggctgtta atagcaagta tccgacgggt cttactaggt 5341 tttgctctag gtgctttagt agcaattcca gttggttttc tcatcgggat gtcaagaacg 5401 gcaatgatga ttcttaatcc cattatccaa atcttcaaac ccgtatcacc cttggcatgg 5461 cttccgattg ctcttgcgat ttttaatttg gcagatccat cagcaatttt cgtgattttt 5521 atcacttctt tgtggccaac aattattaac actgctctgg gagtttctag tgtttccaag 5581 gattatatag atgtggcacg agttctagaa atgccccgtt ggcgaagaat tacaaaaatt 5641 atttggcctg caagtttacc ctacattttt acaggtttgc gaattagttt aggaattgct 5701 tggttggtta tcgttgctgt agaaatgctg acaggcggta ttggaattgg cttttttgtc 5761 tgggatgagt ggagtcgctt aaatctcaat tccgtttttc ttgctgtgct agtcattggt 5821 ttaacaggtt tgttcctgga ttacgctatt ggcagaatac aagcttttgt tactcgtcgc 5881 ccaataactt caaattagtt atttattagc ggtcagtcat tgataaacat aatagatttc 5941 actgcgttat cgcaccctca tctgactgcc tctgataaag gtgcgttaca gttgggacat 6001 ctgtttcatc gatgcaaatt tgacatgcat gaaatctttt taaatagctg atgaattgta 6061 tgggttaagg gagataagta ttcaaaatta aaagaaaaaa ctcactgttt aacaagcagc 6121 ctacctgctg gaactctaca gaaatgaaat ctgtgacagc cgaaacaaga acttttgtac 6181 cttcgtagtt aatattctga gtcatttcgc cgctgcagga ggttttttat gagtcacaat 6241 aacacaaata actggacgcg acgagacttt atcacaggag tgggagcaac agcatttgcg 6301 actggacttt cttcttgtgc cattaatgct aaccgtgccc ctaaagaatt gtcaaaagcg 6361 gcgttggcga ccgaaccagt agtagacccg aagacactgg aaaaacctaa cattacagta 6421 gggtatgtgc cggtgaatga ctgcgctccc tttgctatag cctgggagaa aggatttttc 6481 cgcaagtatg gtttaaatgt caccctcagc cgtgaagcca gctggggtac gtctcgtgac 6541 ggcattattt ttggacgcct tgatgcttcg ccagtggtga gtggtgcggt gacaaatgcg 6601 agaaccggtg cagaaggcgc acgtcatgct cccttatgtg cagctatgac aattcaccgt 6661 cacggtaatg cgatgacgat gaataaagca atgtgggagt ctgggttgcg tccttggcga 6721 gactataacg gcaatttaga agagttcgga cgagattttc gcaactactt tgaaaaatta 6781 ccatctgaga aacgagtttg ggcagtggtg ctgagttcag caatttatga atactttatc 6841 cgctacttgg cggcggctgc tggagttgct cctgataaag aatttcgcat tatcattatc 6901 ccaccacccc agatggtagt aaatatgaga attggggcga tgcaaggata tatggtggcg 6961 gaaccttgga attcacgggc aatttctggc aatgacggaa ttggcttcac cttcgctcaa 7021 ggtagagaaa tttggcaagg acacccagac agacttttag gtgtgatgga gtctttcatc 7081 aaagaaaatc ctaaaactta tcgttctttg gtgaaggcaa tgatagaagc ttgtcgctat 7141 tgtagcgaca caaaaaaccg ggaagaagtc gctcaaattc tcacgaaaaa gtcatttaca 7201 ggagcaaagc tcaaattaac taaaccagct attgttggtg actacaatta tggtggtttt 7261 gatggtcaac aacgagtttg gaaagcaccg gaaacgacaa tattctttga taaacctgcg 7321 aaccttgtca aagcaccaaa tgaccattcc acttttctct ggcaatctca aagtatttgg 7381 ctaatgactc agtcggctcg ttggggacaa attaaagaaa taccaaaaaa tgctcaagaa 7441 ttggcacgta aagcttggcg aactgacttg tatcgagaaa ttgctgctga aatgggaatt 7501 gaatgtccaa aagaagatta caaggtagaa caagcggaac tctttatcga taaaaaagct 7561 tttgatccca gtgatccagt gggatatctt aagagttttg aaattagagc taacagtcct 7621 aaatctttct ttatgtctta agatgagcaa attaaacttg attgtgtaga ctgttctttg 7681 tcattcatga atgacttgat ttagaaacta ataataaatc actaataaat agactgttgt 7741 aggagtgtca aatgaaatct acctctttct caaaagatgc ctatgatacg acttcaccta 7801 gtcgtggatt tctagagatt gaaaacttac acaaatcata tccaacacct gatggcaatc 7861 aatttgttgt tttagataac gttaatttga cagtagggga agatgaatat atttctgtta 7921 ttggtcactc tggttgcggt aaatccacac ttttgaagat tgtagcagga ttagaaaaag 7981 cgacttctgg cttggtgcgg ttagatggca aagaaattcg taaaccaggg gctgaacgca 8041 tgatggtgtt tcaacactat tcgcttttac cttggttaac tgtgcgggaa aatatccggc 8101 ttgctgtaga cgaagtgctc aaagatgcca atcgcactga caaaattagc attgtgaacg 8161 aacacctagc aatggtaaat ttaacagctg cagcagataa atatcctgat gaaatttctg 8221 gtggtatgaa gcagcgggtg ggtattgcca gagcattggc aattcgccca aaaatgttgc 8281 tgatggatga accttttgga gcgttagatg cactaactcg cggaaaattg cagcggcaag 8341 tattggatat ttgggaaaat caccgacagg cggtcatgat ggtgacccat gatgtggatg 8401 aggctattta tatgtcagat cgcattgttc ttatgactaa tggacctgcg gctaatattg 8461 gggagatact ggaagtaccg tttcctcatc cacgcgatcg caatgccatg aggaactcaa 8521 aagaatacta cgaactccgc aactacgcgc tcaacttcct ggatcggtat ttcacccaag 8581 acgagtaact gaaaaaattc gtcattcgtc attttcaagt caattacggt ttataaacga 8641 caaacgagtt attgtttttg taaatttttc ttagtggtgg taataaagtt atgaatctta 8701 aacctatctt ggtgcgtctg cagaatgcgc ttggaagaga cgatttaatt gagcaaatgg 8761 tactcatcac cgtgccagaa aaacctttgt ctgtacaaga tcaatccgca aaatcagtca 8821 atttaatcgt tggttataat agctctccca acagtcatac cgcgttagat attgccttat 8881 taatggctca tcaaacacgt ttagccacaa aggcgcaagt gacagttcaa gttgtctatg 8941 tgatagagga acatcagaga agtcatcgtg gagacgtttt gcaaagagag aaatttgtca 9001 gtcagcgtgt tacggaacaa aatccaccac actgttcaac cagcttctcc ttagttggct 9061 ttgatacagg tgtgacaact caactaaaaa tgcaggataa cgcagcctgt tcccaagaaa 9121 tgttgataga taaatttgca caagcagaat gtattcttcg ccaagcaagt tgtctagctg 9181 cagaatggaa aagttctttt aaagctcatc ttcgctttgg ttgtattgcc aaggaactga 9241 ggaaagttgt taaatcagaa gctgctaatt tactcttact cggctgtaac tctgttgatc 9301 atccaatagt tcaacagctt ggttctaact tcccttgttc agtactgggt ataccaaatt 9361 ttgtcccttt tggatgagtg atattatgtc cgattaaatc accgtcccca gcaaccgaag 9421 cgaatttgag tgcgtgtccg taataccatt tcttggtaaa gctctaatat acaacgcact 9481 caaattcata aagtaaaacc gctgtacaca agacttgtgt acagcggtta tggtttgttg 9541 cctgttagct tgtatgtttt tgggctaagt cagaggagga aaattgttta ctttctggaa 9601 ctgttagctg gttggtactc gtctattgac ctcggctaga aaagttattg acatatattt 9661 ctaactgtct tgataactat actaggaaat ttagtaataa gcaactactc atttataaag 9721 aaatattatt aactcgtgtg aggatataga ttttgagtat caacaggaga aattgcctga 9781 aagctaaacg gttgaggctt tagcaagtac tcactcgtct cagaaaattg tgatcagact 9841 agtactccac caaggcaact ttgaggggca aaggaatgaa ccgcagaggc gcagagagcg 9901 cagagtgaag agaaaagagg atgagtctag aaatgagttt acttggcaaa gttgctgtgg 9961 cagactacta gctaataatt ctgaattaga tgtaaaatat tatcaataac ttccttatgg 10021 ctcaccaaaa caagtgcaaa ctgcttctaa aaagcagatg caatttctgg cgtgggggat 10081 agggtatcta tcgcaatccg ttgaggagtt gtgagaattg aaaaacaaaa tccccgacct 10141 ctggtgagaa gtcggggact ttgtttctcg cgtctacaaa atttctgact agtagccgat 10201 gaagtgcaga acatcacgca agacgctgta gccagccaaa tcccaaagca agtaagccag 10261 aaaaccaatc gctgccaagc gaccattcca tagttcagct tgggggttca aaccaaacac 10321 aaaagcgtta cgatctacac cgttgtaagc tttagcaaca ggtggtaaat cagtagaggg 10381 gcgagtttca ggagtttgca ttgttttttc ctctaaagtt cagttgatat ttgataaaca 10441 gacgtagcat gcactttcac aaaatttctg actagtagcc gatcaagtgc agaacatcgc 10501 gcaggacgct gtagccagcc aaatcccaaa gtaagtaagc tagaaaacca atcgctgcca 10561 agcgaccatt ccatagttca gcttgggggt tcaaaccaaa cacaaaagcg ttacgatcta 10621 caccgttgta agctttagca acnnnnnnnn nnggttcaaa ccaaacacaa aagcgttacg 10681 atctacaccg ttgtaagctt tagcaacggg tggtaaatca ctagaaggac gcgtttcagg 10741 agtttgcatt gttttttcct ccaaaattag attgatattt gatcaggttt tgattaggga 10801 ctaattaata attagtaacc aactaagtgc agaacatcac gcaggacgct gtaaccagcc 10861 aaatcccaaa gcaagtaagc cagaaaacca atcgctgcta agcgaccatt ccacagttca 10921 gcttgaggag tgaagccaaa aagaaaagca ttgcggtcta caccgttgta ttctttagca 10981 acgggcggta aatcactgct aggacgggtt tcaggagttg ccattgtttt ttctccaaaa 11041 ttagattgat atttgatcag gttttgatta gggactaatt aataattagt aaccaactaa 11101 gtgcagaaca tcacgcagga cgctgtaacc agccaaatcc caaagcaagt aagccagaaa 11161 accaatcgct gctaagcgac cattccacag ttcagcttga ggagtgaagc caaaaagaaa 11221 agcattgcgg tctacaccgt tgtattcttt agcaacgggt ggtaaatcac tgctaggacg 11281 ggtttcagga gttgccattg ttttctctcg ttcagttcaa tttgtttgtt gactctaatg 11341 tagctattca caactgtgct gctttctgtc aatgggtaca aactatctgc acctcttgtg 11401 gacgattatc agaacagttt ttccttggat aaatccacct aaagagggtg ctaaaaaatg 11461 tttttgtaaa aaaatgtaac aatggtaacg atgagctgga ttatacctca tcaatttcta 11521 ttagtaggcg actacaaaat caaaaaaaat tttcttggta aactctcctc tcataaacca 11581 atacagttca gataaggcta aggcggctaa aaactaatac tggcgttggt tgaacccttg 11641 tcccgtagac gcccacaaga tacggcgaac cgcaggctaa cccaataaat aagagcgttt 11701 gttgggtttc gctcttttgg taactcaaca aacgcatttt cttaactgaa ccgtattttc 11761 ttatagactt aatgctttga acaataactt cacaagcttt tgttgtgagt tagtgagttc 11821 tatagtttga agacacttat gaattttggt caaaaatgaa atgcataaca cagaaaaaag 11881 aattttgaag aatcaagtgt gagatttctt atttttcact tcacaaaaaa tacttcacac 11941 ttgagtgaat tgaataatta ggtagaaaaa cttctatccc acgctgtagc aagttttcaa 12001 actttggaga gatgccaaaa gcaccgtaat tctagcattc ttaagctcat gggttaagta 12061 gtataacaaa cagatacaat atttactctc acataacttt agcatcccca ccgacttaag 12121 ctacatagga aggattttag atgtttcaga gtacgcacat cacgctggca ttgtacaagt 12181 cactcttttg ggtggcacaa atgccaattg ataaccagcc aacaaatgct aaccagacat 12241 atgccttttc tggatttgga tggaactctt ttttcgcttt aatctcaggt gtgatcctgg 12301 cttttgctct gcaattagtt ctcaccaacc tctccgttgc tgcgggtatt tcctacttgg 12361 gtcgttcatc tgattcaaat ggaggtgatg gagaagttgg aagttttggc ggaaccattc 12421 gcaaaatagg cacagcagta ggactgtgga cattaatcac tgtaactatt gcgcttgcca 12481 ttgcttgttt tttagcggta agacttagtc tttttggaaa ttggtctccg gtgcagggag 12541 caattctcgg gttggtgatt tggggagcgt acttcttact gctagtgtgg gtgagttcaa 12601 ccacggtggg ttccttagtt ggctcgttgg ttaacacagc aacttcaggc ttccaggcaa 12661 ttttggggac agcgaccgct gcgttggggg caaaagctat taataaccaa gtagtagcaa 12721 cagcagaagc agcagccgcc gctgtacgtc gggaaattgg cagtgctgta gaccccgcga 12781 cactccgaga caaagtagaa gattacttag aaatggtgcg tccaccagaa ctggatatat 12841 ccaagattcg gggtgaattt gaaaagttac tgaatgatcc gcaactgaag gcgatcgcca 12901 gtagtccaga tctacgcaac atagaccgcc agaagtttct tgatttaatc agcagtcgca 12961 ctgacctttc aaagcgagaa gttaaccgta ttgctgacac actgtatggt gtttggcaac 13021 aggtggtggg tcaacaacaa ccaaccccag accgcttggg agaattggtt aattatctca 13081 aatcattgcc acccggacaa actaagacag atgaactcaa cgccaagctg gatcaactga 13141 tggctcagat gcgttctgga aagcaaggtg accaaaaccg aacccaagaa gcaactccag 13201 gtccagttca gcagacaatt cagcaagcag tatccgcgtt gactggcatt gtgttggggc 13261 gaactgattt gtcagacttg gatgtcgaaa aaatccttgg tgctgtcaca aacgctaaag 13321 ataaagtcac cgaacaagca gataagttgg gtcttcccac accaaaacag gcttacagcc 13381 ccatccggac ggatgtagaa aactacttac tcaacacata tgcttggcaa ttgagttcag 13441 aaaaagctgc ccaagaattt cgtgacgttc tttacgatcc agccgctgat cctggaacag 13501 tacggcgaga attagagcga ctttctcgga gtgattttgt caatattctt aaaactaggg 13561 gtctgctgac tcaagcagaa attcagcgta ttgcaaatca gttggaactg acccgcaaag 13621 acgtgctaat tgcagtgatt gcagaggaag aaaaagagat agtacaagac ttgcaacgcc 13681 gagtcgaaag ttatctactc gttactccca agtcagactt atctgcagaa ggcattcacc 13741 gggatttcaa acccctgttg gaagattcag acgcagatta cgaaaccctt tctcggcgac 13801 ttgcccagtt tgaccgccaa gaaatgcggg aaatcttgct agaacgcaat gatatttatc 13861 ctgaggaagc agacacaatc cttgatgagt tggaaaaaca gcgtgatcaa gtcttagtcg 13921 aatcccaagg aattgcagag caagcaagat atcaagctga atcgctgtgg ctgaatttag 13981 aatcttatct ggggaacaca ggcaaagatg aactcaaccc tgacgctatc cgcgctgagt 14041 tgagacaact tttagatgac ccgcaagcag gatttgcagc aattagggca cgcctgtctc 14101 gttttgaccg cgatacttta gtacaattac tgagtcaacg gcaagacttg agcgaagacc 14161 aagcgaatca aatcctcgac aatgttgagg aaaactggag taatattcgc cacgcaccaa 14221 aaatcgtagc agataaagcg aaagagcagt acgattctgt tacgacaaca atagcagact 14281 acctgcgaaa cacaggtaaa caagaactca accctgaagg aattcagcgg gatttgaatc 14341 gactgttcca aaacccaaga gaaggtgttg tcgcactgcg ccgtcggttg tcacaacttg 14401 atagagatac cttagtgaag ctactcagtc aacgtgagga cttgagtgaa gagcaagtca 14461 atcaaatcat cgattcgatg caaacttcga ttcgtgacat cgtgcgtaca ccccgtcgcc 14521 tcgccacccg aactcagcaa agagtacaaa atttccaaac atatttggaa gagtatctac 14581 gcaggagtgg caaagacgaa ctcaacccag aaggtatcaa acgcgacctt tctctgttgc 14641 tgcatgatcc acaagtggga attgaaagtt taggcgatcg cctttcccaa tttgaccgct 14701 ccaccatcat cactttgctg aaactgcggg aagacttgag tgatgaggaa gccgcacgaa 14761 ttgcagatac aatgatatca gtgcgtgagc agtttgtgga acaagtgcga aatatccagc 14821 ggcgcattca agatgtggtt gatggagttt ttggtcgcat tcgcaactac ctcaactctt 14881 tagaacgccc agaacttaac tacgatagca tcaagcgcga tgttcgcaca ttgtttgatg 14941 atccacaagc tgggtttgat gcattgcgcg atcgcctatc ttctttcaac cgcgagacct 15001 tagtagcagt tatgagttct cgtgaggata tctcagaaga agacgccaac cgcattatcg 15061 accaagttga acgagcacga aacaacgtat tgcaacgggc agaacgcatc cagcaggaag 15121 cacaacgacg cttagaagaa gtaaaaattc aagcacaacg ccaagccgaa gaaacacgca 15181 aagccgctgc atcagccgct tggtggctat ttgctacagc agttgtttca ggagttttct 15241 ctgcgattgg aggagcagtt gctgtactct tcctagtgta gtttaatact ttttattgac 15301 tgctctgttt tagcctctcg tatgagaggc ttttttgttc ataggcagca gttatgacag 15361 ctatagtcaa aagtgacaaa gtataggtaa actttgtaac atgacaaatg actgaaatca 15421 aagcaaagtt gcaactatgt agtatcaata cggttcggtt agcgcaaaaa acataaaatt 15481 atttaggtta ggttgaggaa cgaaacccaa cacctgcctc gcttttgttg ggttgccctc 15541 cgcttaaatt aacttacaat ttttcttaag tgaactgtat tgtagtataa atc // LOCUS NODE_2162_length_15519_cov_5.25426815519 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15519) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15519) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15519 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(77..2833) /locus_tag="DP116_18530" CDS complement(77..2833) /locus_tag="DP116_18530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875615.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18530" /translation="MSNQEINPELENLLEYIKRNRGFDFSGYKRTSLSRRILRRMQII GIENYTEYLDYLEVHPDEFVELFNTILINVTAFFREGQAWEYIANEIIPQILASKHLS KPIRVWSAGCASGEETYTLAILLAEALGMEQYTTRVKVFATDVDVEALNTARQATYNP KDLQSVPADLQEKYFDRVNGRYIVQKELRRGVIFGRHDLVQDAPISRIDLLVCRNTLM YFNTETQAKILDRFHFALNESGFLFLGKAEMLFTRNHSFTPVDLRRRVFTKIPNGNMR DMLLNMAHSSGHQPVPEMVDQMRTHEAAFEIDPVAQLVVDLNSIVMLANAEARNLFNL HPRDLGRPLQDLELSYRPVELRSRIDHVRSNRRPITLKDIEWPGTERDIKYMDVQIIP LVDDSTDELLGVKIIFTDVTRFKNLQQELVHANQELETAYEELQSTNEELETTNEELQ STVEELETTNEELQSTNEELETMNEELQSTNEELQTMNEELRQRGQFLQHRVTQSPTL QEDLVLQAIEELSTALEELRVAEEELHQQNEQLHIANQQVALERQRYQELFDFAPDGY LVTNTEGKILEVNQAAAQLLNISKNFLVGKALINFIPEEERRAFRNQLLQLSQMQRIQ EWEIRLQPRKGNIFDASLSVATVLASQDKLQGWRWLVRDITSRKQAEEKLRLIQQENL ELQETAAIKTQMMSVLSHELRTPLNSILGFSQLLLHRYYNLFPPELRDMIERITRSGK HLLGLIENMLDFSKLERDRLELNIQEFNLVELVTATTEEVRCLAEQKNLTLVLHANIE NPRVVNDSVRLRQILVNLVSNAIKFTDTGGVFVEVQQGNQDQVVLMVKDTGIGIPESE LAHIFQEFWQVDKSTTRKYGGIGLGLAIADKLVRLMKGTITVESNLGEGSTFRVLLPR NVSH" gene complement(2943..3191) /locus_tag="DP116_18535" CDS complement(2943..3191) /locus_tag="DP116_18535" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18535" /translation="MYDAVGQFTSDVTRTHSRFQAPAWECYSRGKPLVVGQEAEPPLL HSLPEIRNETITDLLFFVSNKQKILKYQSVLRFFPRTG" gene complement(3200..3919) /locus_tag="DP116_18540" CDS complement(3200..3919) /locus_tag="DP116_18540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872496.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Crp/Fnr family transcriptional regulator" /protein_id="PRJNA477356:DP116_18540" /translation="MSVSPSFRNPNENRLLAILPTEEYKRLLPHMESIFLPLKQILYQ MNEPIEYVYFPKNGIVSLVTIMEDGATAEIATVGNEGMIGLPVFLGTDQIPGQAFSQI PGESMRMKAEEFKTFVTPDSPLYKLLQRYTQTLFNQVAQSAACNCLHCIEKRFCRWLL MTHDRVQSDEFLLTQEFLAQMLGVRRASVSQVASIFQKAGIISYSRGQMRILDRTGLQ AASCECYAKVKQEFERLLGNN" gene 4115..4423 /locus_tag="DP116_18545" CDS 4115..4423 /locus_tag="DP116_18545" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18545" /translation="MRVARNTISKAEVDAYIPRSDLNIIQSSILIAEVTSKPASHSNT IPKILVIMTQSAVIRTRSAAIRSTAREVRAKSILSRTNSIAARAKSAELIAEGATFLT " gene 4426..4911 /locus_tag="DP116_18550" CDS 4426..4911 /locus_tag="DP116_18550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748770.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_18550" /translation="MKRNDSQGGLAGVSIIVLKSFDNQRPEDSTNVKTLNNLRVLVVD DNIDTLILITVILEDYGAKVMTATSAREAFEVIRDFELDFLIIDIVMPQEDGYSLICK IRTLDNTQKKQIPAIALTAIDTDEARQLAFKSGFQNYLTKPFDNGELVIEIAKFLVNC N" gene 4923..5333 /locus_tag="DP116_18555" CDS 4923..5333 /locus_tag="DP116_18555" /inference="COORDINATES: protein motif:HMM:PF00072.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_18555" /translation="MGEKSKTILLVEDNPDVGFLIQSLFHDANLPVSFQVIKNGQEAV DYLSGKEPYVNRENYPLPVIILTNINMPHMSGFELLAWVKQHPQLKNLPVVLMSTYDD PKHLIQAASLGAYSYFIKTSSFDDLVDIAAKFVS" gene complement(5576..6625) /locus_tag="DP116_18560" CDS complement(5576..6625) /locus_tag="DP116_18560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318283.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein CheB" /protein_id="PRJNA477356:DP116_18560" /translation="MPGHDIIVVGASAGGVEALSYLVKNLPPDLNAAVMIVLHVPSHG TSVLPHILTRAGKLPASHAKDGEVIQLRRIYIAPPNYHLLVKPGHIHLARGPRENGHR PAIDPLFRTAARAYGRRVVAVVLTGVLDDGTAGLKAVKMRNGVAVVQNPEDAMYAGMP RSAIENVNDIDHILPLSDIPDILVSIANTQVEGEEDPVPEEIEVESDLVELDMNVLNS EQRPGKPSTFGCPDCGGTLWDLSDGNLLRFKCRTGHAYSAETLLAKQSDALEDALWVA LRALEEKASLSRRMAERMRDRNQLLSAQRLEEEVQDSQKRAGVIRDVLLKGNTTTADG NNSKAPEEEEPTNVS" gene complement(6987..7412) /locus_tag="DP116_18565" CDS complement(6987..7412) /locus_tag="DP116_18565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318243.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18565" /translation="MDNTNKVDSTANNSAENTTDTEANIISQPSSNEAKQIPKTINVD GRRPIDPSNIQVQETFDIDGQRPIAKSDFQDHDMLAVDGKRPIDPSDIEVSYTLDIDG QRPIVKSNFQVSSTLEVDGSRPITSNDIQKPEITSDYVD" gene 7689..7928 /locus_tag="DP116_18570" CDS 7689..7928 /locus_tag="DP116_18570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006633975.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18570" /translation="MAREVTDADGITWSCVEAYAGLNDEAHNRAAAQVNEERDTYWVV CTPSGGAKSLRLELPGDWEDSYSDEALLGEIKAHQ" gene complement(7950..8132) /locus_tag="DP116_18575" CDS complement(7950..8132) /locus_tag="DP116_18575" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18575" /translation="MGNSAESVKDFLWFLCSAWERFLEAPPLNNYIEAEPQVMHSLPE TRNEEILYLLLLNAFI" gene 8279..9499 /locus_tag="DP116_18580" CDS 8279..9499 /locus_tag="DP116_18580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873441.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_18580" /translation="MLHKAIQVRLYPNQDQQIQLSQSFGCSRWWWNYALNKSIETYKE TGKGLGQVALNALLPKLKKEKDTEWLADCYSQVLQATTLNLTTAYKNFFEGRARFPRF KSKHGKQSIQYPQNVKIVEGNVKLPGNIGVIKAKIHRPIEGKIKTVTVSKTPSGKYFA SILTELEGENSTISEGKIYGIDLGLKHFAVITDGEKVSKYDNPKHIAKHEKNLKRKQQ KLARKQKGSNSRNKYRKVVAKVYERVSNSRQDFLHKLSYKLVSDSQAVIVENLHVKGM VRNHNLAKAISDCGWGTFTNFLAYKLERKGAKLLEIDRWFPSSKLCSNCFYQVNEMSL DVREWTCPHCGTHHDRDGNAAINIRTEGIRMLKAEGSAVSAVGGEVRPKMGRKSYLRH SPMSTEAPSAYAVG" gene complement(9501..11387) /locus_tag="DP116_18585" CDS complement(9501..11387) /locus_tag="DP116_18585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872557.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_18585" /translation="MRETILDVRNLQVEFSGESKSVKAVDGISFELHRGETLGIVGES GSGKSVTSLAVMGLLQSPGRISGGEIWFHPQENGAPINLAQLPNEQIQLHRGGDIAMI FQEPMSSLNPVYTIGFQITEAILRHQNVSASEARRIAIAGLQEVKLLPSDEALKQQYL ETWKETSFGSSTPDEQKIAQLVKQHKEAILERYPHQLSGGQLQRVMIAMAISCNPLLL IADEPTTALDVTVQATILELLRELQQRREMAMIFITHDLGLISEIADKVAVMYRGKIV EYNSAGQIFSNPQHPYTKGLVACRPTLNRRPQKLLTVSDYMNVEETPTGDLIIQEKQP QQPVEVTSEEMNQRLQSLEQQQSLLQVRDLKVGFPIRGAFGGTKRYFMAVNKVSFDVK KGETLGLVGESGCGKTTVGRTLLRLIEPMSGQIIFEGQDITTLKGKPLQNLRREMQIV FQNPFSSLDPRFKVGEAVMEPLVIHSIGKTKQERRERVAYLLERVGLSADAINRYPHQ FSGGQRQRICIARSLALNPKFIICDESVSALDVSVQAQVLNLLKELQDEFGLTYIFIS HDLSVVKFMSDRILVMNRGEIVEQGTAESIYREPKEEYTQKLIASIPTGSAERVRQRQ VRAS" gene 11920..12720 /locus_tag="DP116_18590" CDS 11920..12720 /locus_tag="DP116_18590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876735.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18590" /translation="MLSQPVKFLFLSAVSLSFVLGHAAAFAQVRGGGDIVVPTQPAGG GSTTTRTRTIETDGSSSTTRTTTSSPISSSNRFFCQSYNGQYTVMYQPESQPGQYFPW ATPRTLGGGWDAQQRCQAIAERLETYRPDGLVELKTAIENRQNILCVTTETNPYCRIV LTVPPEKDPYVVRNSVFQNLASADSGQQTFGVNTYTSGNDDLSNLGRNIFGGGKKPST SSKDPINLKPFLDRADRGTGEKLRNGVSLNRQQSQPQTGNRLDPKKFR" gene complement(12939..13562) /locus_tag="DP116_18595" CDS complement(12939..13562) /locus_tag="DP116_18595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128844.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme-copper oxidase subunit III" /protein_id="PRJNA477356:DP116_18595" /translation="MQSQTIDPAKTALNHHHTATAEADHEEHPDHRLFGLVMFLVAEG MIFMGLFGAYLAMRSTVPVWPPEGTPELELLLPGVNTINLIASSFVIHNADTAIKKND VRGMQIWFGITAAMGILFLVGQVYEYTHLEFGLTTNLFASAFYVLTGFHGLHVTLGVV AILAVLWRSLSKGHYSNEHHFGIEAAEIYWHFVDVIWIILFGLLYLL" gene complement(13642..15384) /gene="ctaD" /locus_tag="DP116_18600" CDS complement(13642..15384) /gene="ctaD" /locus_tag="DP116_18600" /EC_number="1.9.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872876.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c oxidase subunit I" /protein_id="PRJNA477356:DP116_18600" /translation="MTQAQVQEKANIPALIEEPGIRPWRDYFTFNTDHKVIAIQYLVT TFIFYCIGGVMADLVRTELRTPDVDFVTPEVYNSLFTLHATIMIFLWIVPAGAGFANF LIPLMIGAKDMAFPRLNAVAFWMIPPAGLLLIASLVVGDAPDAGWTSYPPLSLVTGQV GEGIWILSVLLLGTSSILGAINFLVTMLKMRTPGMGFFQLPLFCWAMLATSALTLVST PVLAAGLILLSFDLLAGTTFFNPTGGGDPVVYQHMFWFYSHPAVYIMILPFFGAISEV IPVHSRKPIFGYKAIAYSSLAISFLGLIVWAHHMFTSGIPGWLRMFFMITTMIIAVPT GIKIFSWLATMWGGKIRLNSPMLFAMGFVGTFVIGGISGVMLAAVPFDIHVHDTYFVV AHLHYVLFGGSVLGIYAAIYHWFPKMTGRMLNEFWGKVHFTLTIVGLNMTFLPMHKLG MMGMNRRIAQYDPKFTFLNEICTYGAYILAVSTFPFIINAIWSWMYGPKAGNNPWDAL TLEWMTTSPPAIENFDKPPVLATGPYDYGLENTAKGVPLSDPDPVLSAGVNSVLRAEP DEPSPAITAEKEER" BASE COUNT 4413 a 3412 c 3354 g 4340 t ORIGIN 1 ccctgttccc tgttccctgt tccctgttcc ctgttccctg ttccctgttc cctacttcta 61 ccacgaagtc taatgactaa tgactaacat tgcggggtaa tagtacacgg aaagtggaac 121 cttcgcctag gttgctttca actgtgatag ttcccttcat taagcgaacc aatttatcgg 181 cgatcgccag tcccaaacca atccctccat acttgcgtgt cgtagactta tcgacttgcc 241 aaaattcttg aaaaatatgt gctaattccg attctggaat gccaatgcct gtatccttaa 301 ccatcaacac gacttggtct tgattgcctt gctgtacttc cacaaacacg ccgcctgtat 361 ctgtaaattt gatggcgttg gaaactaagt taaccaatat ttgtcgtaaa cgaacactat 421 cgttgacaac tctaggattt tcgatattgg cgtgtaaaac taaggttagg tttttctgtt 481 cagcaagaca acgcacttcc tctgtggttg ctgtgactag ctccactaag ttaaattctt 541 gtatatttag ttctagccta tctctctcca atttggagaa gtcaagcata ttctcaatca 601 gtcctagtaa atgtttgcca cttctagtaa tccgttctat catgtctctt aattcggggg 661 ggaacagatt atagtagcgg tgcaatagca gctgggaaaa ccccaaaatt gaattcaaag 721 gtgtacgcag ttcgtgggag agaactgaca tcatttgagt tttgatggct gcagtttctt 781 gtaactctaa attctcttgc tgaatcaacc ggagtttttc ctctgcttgc ttgcgggagg 841 tgatatcgcg cacaagccaa cgccagcctt ggagtttgtc ttgggaagcc aacacagtag 901 caacactcaa gctagcatca aaaatattac cttttctcgg ttgcaagcgg atttcccact 961 cttggattcg ctgcatctgg gatagctgta agagttggtt acgaaaggcg cgacgttctt 1021 cctcaggaat aaagttaatc agcgcttttc ctaccaaaaa atttttcgag atgtttaaca 1081 gttgagccgc cgcctggttg acttccaaaa ttttcccttc ggtatttgtt accaagtaac 1141 catcgggtgc gaaatcaaat aattcctggt agcgttgacg ttctagtgcg acttgttggt 1201 tagcaatatg caattgctca ttctgttgat gcaattcttc ctcagccaca cgcaattctt 1261 ccaacgcagt gctaagttcc tcgatagctt gcagcactaa atcttcctgg agtgtaggtg 1321 attgggtaac acgatgttgt aagaattgac cgcgctggcg tagttcttca ttcatagtct 1381 gcaattcctc gtttgtggat tgcagttctt cgttcatagt ctctaattct tcgtttgtgg 1441 attgcagttc ttcgttagtt gtttctagct cctctacagt tgattgaagt tcttcattag 1501 tcgtttctaa ttcttcattt gttgactgga gttcctcata agccgtttct aattcttgat 1561 tggcgtggac tagttcctgt tgcagatttt taaagcgagt tacgtctgtg aagatgattt 1621 ttacgcccaa gagttcatcg gtactgtcgt ccaccaaggg tattatctgc acgtccatat 1681 attttatgtc gcgctctgta ccaggccact caatatcttt taaggtgatg gggcgacgat 1741 ttgagcgaac gtggtcgatg cgcgatcgca actccaccgg tcgataagaa agctctaaat 1801 cttgcaacgg gcgacccaaa tctctaggat ggagattaaa caaattgcga gcttcagcat 1861 tagccaacat aacaatgctg ttgagatcca cgactaattg agcgacggga tcaatctcaa 1921 aagctgcttc atgagtgcgc atttgatcaa ccatctctgg gacgggttgg tgtcctgaag 1981 agtgagccat gttcaatagc atatctcgca tattgccatt aggaattttg gtaaataccc 2041 gtcgccgcaa atctactggg gtgaacgagt ggttgcgggt aaacagcatt tctgctttcc 2101 ccaaaaacaa aaaaccactc tcattgagag cgaagtgaaa gcgatcaaga attttcgctt 2161 gagtttcggt attaaaatac atcaaagtgt tacgacacac cagtagatca attctggaga 2221 ttggtgcatc ctgtaccaag tcgtggcgac caaatataac cccgcgacgc agttctttct 2281 gaactatata gcgaccgtta actcggtcaa agtatttttc ctgtagatct gcgggaacgc 2341 tttgaagatc tttgggatta taagttgctt ggcgagcggt attaagagct tccacgtcta 2401 catctgtggc aaacactttc acccgtgttg tatactgttc catacccaat gcttcagcaa 2461 gcaaaatagc taaggtgtag gtttcttccc cagaagcgca tcccgcactc catacccgga 2521 ttggtttgct caaatgtttg ctagcaagta tttgaggaat aatttcattt gctatgtact 2581 cccaagcttg tccctcacgg aagaaagctg tgacgttgat taatattgtg ttgaataact 2641 caacaaactc gtcggggtgt acttctaagt agtccaaata ttcagtataa ttttcaattc 2701 caataatttg catacgtcga agaattcgac ggcttaagct tgtgcgctta taaccgctaa 2761 aatcaaagcc tcggttgcgt ttaatgtatt ctaataaatt ttctagttcc ggattgattt 2821 cttggttact catataagtt tatactgaat aaattcactt gcgtttgtgc ttgttgtcaa 2881 actacagcct tccaggcata attagaggtt caacaagaag tattgctcct aagcagttat 2941 tcttaaccag tacgtggaaa aaatctcaaa acagattggt attttaatat cttctgctta 3001 ttagaaacga aaaataataa atctgttatt gtctcgttcc ttatctctgg caaggaatgc 3061 agcaatggag gctctgcctc ctgacctact actagaggct tgcctctaga atagcattcc 3121 caggctggag cctggaaacg agagtgagtg cgggttacat cgctcgtgaa ttgaccaaca 3181 gcatcataca ttacagtcct caattattgc caagcaagcg ctcaaactcc tgtttaactt 3241 ttgcataaca ttcacacgaa gccgcttgta agccagttcg gtcaagaatt ctcatttgcc 3301 cacggctgta gctaattatt ccagctttct gaaaaatgct tgccacctga cttacactag 3361 cacgacgaac ccccagcatt tgagcaagaa actcctgagt tagcaagaac tcgtcagatt 3421 gtactcgatc atgagtcatt aaaagccaac gacagaatcg tttttcgata cagtgcaagc 3481 agttacaagc agctgattga gcaacttggt taaataatgt ttgcgtatag cgttgtaaca 3541 gcttgtaaag cggactatct ggagtgacaa aagttttgaa ctcctctgct ttcatccgca 3601 tcgactcacc aggaatttgt gaaaaagcct gcccaggaat ttgatctgtt cccagaaata 3661 ccggcagacc gatcattcct tcgttaccca ctgtagcgat ttcggctgtc gcgccgtctt 3721 ccataatagt caccagagag acaataccat ttttggggaa atagacgtac tcaatgggtt 3781 cgttcatctg gtagagaatt tgcttcaaag gcaaaaagat gctctccata tgggggagga 3841 gacgtttgta ttcttctgtt ggcaagatag ccagaagccg attttcattt ggattacgaa 3901 aacttggtga cacggatatc actagtttta ttaaactagt cgtaaattaa ctatatagtt 3961 agctgcattg tcaatgactt ttagcttgac acaaaattat tgcaatgagt ccattattac 4021 gccattttga tattagttat gtacggtatc ggatagacaa acaactagta aataggtaga 4081 ttctgagttt gtagtatata gctaattgtt tgaaatgcgt gtagctagaa acacgataag 4141 caaagctgaa gtagatgcat atatccctag aagtgactta aacattatcc aaagtagtat 4201 tttgattgcc gaggtcactt caaaacctgc cagccatagc aacaccatcc caaaaatcct 4261 agtgataatg actcaaagcg ctgtcatcag aacaaggagt gctgctatca gaagcacagc 4321 aagggaagtt agagccaaaa gcattttgtc tagaaccaac agcatagccg cgagggcaaa 4381 gagtgctgaa cttatcgccg aaggtgcaac ctttttaaca tagctatgaa acgcaatgac 4441 agtcaggggg gtttggcggg ggtcagtatt atcgtgttaa aaagttttga taatcaacgt 4501 cctgaagaca gtacaaacgt caagactctc aataatttgc gagttcttgt cgtagatgat 4561 aatattgata cccttatttt aattactgtt attcttgaag attacggcgc taaagtcatg 4621 acggcaacat cagcaaggga agcttttgag gtcatcagag attttgaact agacttttta 4681 attatcgata ttgttatgcc gcaagaagac ggttattcat taatttgtaa aatcagaacg 4741 ctggacaata cacaaaaaaa gcaaatacct gcaatagccc taacagctat agatacagat 4801 gaagcacgcc aacttgcttt caagtctggg tttcaaaatt atctgactaa gccttttgat 4861 aatggagagt tagtcataga aatagcaaaa tttttagtga attgcaatta aatatagtac 4921 taatgggaga gaaaagcaaa accattttgt tagtcgaaga taaccctgat gtagggtttc 4981 tgattcagtc tttatttcat gatgctaatt tgccagtctc ttttcaagtt atcaaaaatg 5041 gacaagaagc tgtggattac ctgtcgggca aggaacctta tgttaacaga gagaactacc 5101 cactgccagt aatcatattg acaaacataa atatgcccca tatgtcaggt tttgagttac 5161 ttgcatgggt caagcagcat ccccaactga agaatttacc agttgtgcta atgagtacct 5221 acgacgatcc aaagcatttg attcaagctg ctagcttagg cgcttactcg tacttcatta 5281 agacatcatc ttttgatgac ttggtagaca tagcggcaaa attcgtgtcg tagcgttggc 5341 tttggtatgg atgaattggt gtcaatttaa cgttcaaacc tttgaactca cgttgatttg 5401 acccctcccc gaagttcggg ctacggtgta cacagatctc actcaaaacc tcaccctcgc 5461 tttttgctac gcaaaaatct ttccctctcc gaactcacgg agagggatgt ccgtgaggac 5521 agggtgaggt ttcgactgta tgacaacgaa gtagaagagc tatattccta aagattcacg 5581 agacattagt tggttcttcc tcttctggcg ctttgctgtt gttcccgtct gctgttgtag 5641 tgttaccctt gagcaggaca tcccgaataa cgccagcacg tttttgagaa tcctgcactt 5701 cctcttccag tcgttgtgct gataggagtt ggttgcgatc gcgcatccgt tcagccatcc 5761 gacgtgataa agatgctttc tcttccaatg ccctcaaagc aacccacagc gcatcttcca 5821 aagcatcaga ttgttttgcc agcagagttt ctgctgaata agcatgacct gtacggcatt 5881 taaaccgcaa taaatttcca tctgaaagat cccacaaagt accaccacag tctggacacc 5941 cgaaagtaga gggtttacct ggtctttgtt cactgttaag cacgttcata tccaactcca 6001 ccaagtcaga ttcaacttca atctcctcag gcacagggtc ttcttcgcct tccacctgcg 6061 tgttagctat acttaccaaa atatctggaa tatccgacag aggcaggata tggtcaatat 6121 catttacgtt ctcgatagcg ctgcgcggca tcccagcata cattgcatct tcggggtttt 6181 gaacaaccgc aacaccattc cgcattttca ctgcctttag tcctgctgta ccgtcgtcaa 6241 gaacacctgt taacaccaca gcaaccactc gtcgcccata agctcgcgca gccgtgcgaa 6301 acagtgggtc aatagctgga cgatgaccgt tttctctcgg tccccgtgcc aggtgtatat 6361 gtcctggttt caccagtagg tgatagtttg gcggggcaat ataaatcctt ctcaattgaa 6421 tgacttcgcc atcttttgca tgagatgccg gcaattttcc agcacgggta agaatgtgtg 6481 gcagaacact tgtgccgtgg cttggtacat gaaggacaat catgacagca gcattcaggt 6541 ctggcggtaa attcttgact aaataagaaa gtgcttcaac tccgcctgcc gatgctccaa 6601 caacgatgat gtcgtgtccg ggcatttttt cttcttcctc gtaattgcag cggagctaga 6661 catatcgcac tgcctatatc acgctagcag aattaagtaa cacatttacc taactttgag 6721 cggatttata aggacggcta gctgaaaacc catgaggaac aggcacgtag tgccgtattc 6781 cgtactttag gcaggggatt gaaagctgcc tgaaggcgtt tacgccaaaa ccggggtggg 6841 gttatgcgac aattacgaac gagtaaaata aggcatgata tgacgtaatg acaagggttt 6901 caccttaagt tgacacgtat gggcagcctt tgcccctacg actgagacta cttcacccgg 6961 ttgaaaatgg ctatatgtag aaaatattag tctacataat cgcttgttat ttctggcttt 7021 tgaatgtcat ttgaagtgat tggacggcta ccatctacct caagagtgct agaaacctga 7081 aaattactct taacaatcgg acgttgaccg tctatgtcga gagtatagct aacctctata 7141 tcacttgggt caatcggacg cttgccatcc acagctagca tatcatgatc ctgaaaatcg 7201 ctcttggcaa tcggacgctg accatctatg tcaaatgttt cctgaacttg aatattactt 7261 gggtcaattg gacggcgacc atctacgtta attgttttag gaatttgttt cgcttcgtta 7321 gatgatggtt gagaaataat gttcgcttca gtatctgttg tattctcagc agaattattt 7381 gcagtgctat caactttgtt tgtgttatcc atgctgcggc ttgttcttcc ttcttatggc 7441 tatttgctcg tttacactat aatcccaaaa gtttcttatc catcattgga ccataacata 7501 tgatttttct aatgatttgt ttactgataa acatacttag gtagaggcaa aaaatcctct 7561 accacagcac agaatttgcc tctaacttaa gagcgatttt tgacacttgg gtgcggcgta 7621 gagcttctcg cctataacgc acatggcaca atgggtaggt taagagcgat aagcaaagaa 7681 ggtatacaat ggcgcgagaa gttacggatg ctgacggaat tacttggagt tgtgttgagg 7741 cgtatgcagg tcttaatgat gaagctcaca accgcgctgc tgctcaggta aacgaggaac 7801 gtgacacata ctgggttgtc tgcactccaa gtggtggtgc gaaatcattg cggctcgaac 7861 taccgggtga ctgggaagat tcttactcgg atgaggcgtt acttggcgaa attaaagcac 7921 atcagtagcg ctgtaaacaa gtaaagtcat caaataaatg catttaggag gaggagatac 7981 agaatttcct cgttccttgt ctctggcaag gaatgcataa cttgaggctc cgcctcaata 8041 taattattga gaggcggagc ctcaaggaag cgttcccatg cagagcatag gaaccagaga 8101 aaatctttaa ctgattcggc ggaattcccc atccgcctat gcggtggggt gcatcacgcc 8161 gggaaattgt aaggtacatc ctttatggat gtccgagcaa ttcccatatt tgtccggaat 8221 gactttactg gtaagtgtag ataatagtaa gatataaaaa ctaaggaggt gatattgagt 8281 gctacataaa gcaatacaag ttcgtttata cccgaaccaa gaccaacaaa tacaattatc 8341 tcaaagcttt gggtgttccc gatggtggtg gaattatgca ttgaataaat caattgagac 8401 atacaaagag acgggtaagg ggctgggaca agtagcactc aatgcactac tgcctaagct 8461 caaaaaggaa aaagatacag aatggttagc tgattgttat agtcaagttt tgcaagctac 8521 aacacttaat ctaaccacgg cgtacaaaaa cttttttgaa ggtagagcaa ggtttccacg 8581 attcaaatct aaacacggta aacagtctat ccagtatcct caaaacgtca aaattgtaga 8641 aggcaatgtc aaacttccgg gcaatattgg agtaatcaaa gccaaaatac atagacctat 8701 tgaggggaaa atcaagactg tcactgttag taaaactcca tcaggcaaat actttgcatc 8761 tatcttgact gaattagaag gtgaaaattc aactatttca gaaggtaaaa tttatggcat 8821 tgacttagga ttgaaacact ttgctgttat caccgatggc gaaaaagtgt ctaagtacga 8881 taatcctaaa cacattgcca aacatgagaa aaacctgaaa cgcaaacaac aaaaactagc 8941 acgtaaacaa aaaggaagta attcaagaaa caagtatcgt aaagtcgttg ccaaagtgta 9001 cgaacgggtt agcaattcgc ggcaagattt tctgcataaa cttagctaca agttggtcag 9061 cgatagccaa gctgtcatag tagagaatct tcatgtcaag ggcatggtac gtaatcacaa 9121 tttggcgaaa gcaatatctg attgtggatg gggaactttc actaacttct tagcctacaa 9181 gctagaacgc aaaggtgcaa agttgcttga aattgataga tggttcccca gttccaagct 9241 ctgctctaat tgtttctatc aagtcaatga gatgtcgcta gatgtgaggg aatggacttg 9301 tcctcactgc ggcactcatc atgatagaga tggtaatgca gcgataaata ttagaacaga 9361 aggaatcaga atgctaaagg cggaaggttc agccgtctct gctgtaggag gggaagtaag 9421 accaaagatg ggacgaaagt cttatctgcg gcattcgcct atgagtacag aagccccatc 9481 cgcctatgcg gtggggtagt tcacgatgcc cgtacttgtc gttgtcgcac ccgttcagca 9541 cttccagtag gaattgaggc tatgagtttt tgggtgtatt cttctttggg ttcgcggtag 9601 atactttctg ctgttccttg ttcgacgatt tcaccgcgat tcatgactag gatgcgatcg 9661 ctcataaatt tcaccacact caagtcgtga gaaataaaga tataagtcaa cccaaactca 9721 tcttgcaatt ctttcagcag attcagcacc tgtgcttgca ctgatacatc cagcgctgaa 9781 accgattcat cacatataat aaacttggga tttaatgcca aagaacgggc aatacaaatc 9841 cgctgacgtt gaccaccaga aaattgatga ggatagcggt tgatagcatc tgcacttaaa 9901 cccacccgtt ctaagaggta agcaacacgt tctcgccttt cttgctttgt cttacctatc 9961 gagtgaatta ccaaaggttc cataactgct tccccaacct tgaagcgtgg atcgagggag 10021 ctaaaggggt tttgaaaaac aatctgcatt tctcgccgca agttctgcaa cggtttccct 10081 ttgagggttg tgatatcttg tccttcaaaa ataatttgac cactcattgg ttcaattaat 10141 cgcagcaaag ttctaccaac agtggtttta ccgcaaccag attctcccac caatcccagg 10201 gtttctcctt ttttcacatc aaaggaaact ttattgactg ccataaaata gcgttttgta 10261 ccgccaaacg ctccccgtat ggggaaacca actttcaaat cacggacttg caaaagagat 10321 tgctgttgtt ctaagctttg caacctttga ttcatctctt cacttgtgac ttccacaggt 10381 tgttgaggtt gtttctcttg aatgatcaaa tctcctgttg gcgtttcttc cacattcatg 10441 tagtcggaaa ctgtcagcag tttttgggga cgacggttga gtgtggggcg acaggctacc 10501 aagcctttgg tgtatggatg ctggggattt gaaaaaattt gtcctgccga gttgtattct 10561 actattttgc ctctgtacat cacggcgact ttatcagcga tttctgaaat cagtcctaag 10621 tcatgggtga tgaaaatcat tgccatttca cggcgctgct gcaattctcg tagcagctca 10681 aggatcgttg cttgtactgt cacatccaat gctgtggttg gttcatctgc aatcagcagc 10741 aatgggttgc acgaaattgc cattgcgatc atcacccgtt gcaactgtcc gccagaaagt 10801 tgatgcgggt aacgttctag gatagcttct ttgtgctgtt tcaccaactg tgctatcttt 10861 tgctcgtctg gagttgatga accaaaagag gtttctttcc aagtttcgag atactgttgc 10921 ttgagggctt catcgctagg gaggagtttt acttcttgta gaccggcgat cgcaattcgt 10981 cgtgcttcag atgccgaaac attctggtgt cgcaaaatag cttctgtaat ctgaaaccca 11041 atcgtgtaaa ccggattgag agaactcatc ggttcttgga aaatcatcgc aatgtcgccg 11101 cctctgtgga gctgtatttg ctcgttaggc aattgggcta aattgatcgg tgcgccattc 11161 tcttgagggt gaaaccaaat ttcaccgcca ctaattctac caggactctg aagcaacccc 11221 ataaccgcta gggatgtgac tgattttccg cttcccgact ctcctactat tcctagagtt 11281 tctcctcgat gtagctcaaa agaaatccca tctactgctt tgacactttt actctcaccg 11341 gaaaattcaa cttgtaaatt gcgaacgtct aggatagttt ctctcataag agtcgggtac 11401 ttaaattgta acttataaat tatttggatt ttaacagcga ctttgatttt ttttattgat 11461 ctgaatctta ggtacgacga tatttttctg atttttacag atttcctatg ggatttggca 11521 ttaaagatta caaaaaataa cttaagctgt cgcggatata gcggatttta attgagtgcg 11581 atattgggaa aatggtaaag gcacagcatt cctgcgcccc tagacgtggt gtacgtaact 11641 gagaacagga aaactagtga aacccacacg taatatatcc taggacttgc gcacgagtta 11701 cgaaagaaca agactgtgag attgcttcct gacgtcgcaa tcacgcaatt acgttatttt 11761 tgcgtaagtc ctgatatccg ttacattcgt gagccgattt tttggaattc aactctttgc 11821 acaaaaggaa gataatcgtt gaatcaaaat cacaagattc ttcgtcactg gaattatcaa 11881 ctcccaacat gaaaattcac cagcttttta gatgtgtcta tgctatctca acctgtaaaa 11941 tttctttttt tgagtgctgt tagtttatcc tttgttctag gtcatgcagc tgcatttgcg 12001 caagttcgtg gaggtggtga tattgttgta ccaacacaac cagcaggcgg tggctcaaca 12061 acaactagaa caaggacaat agaaacagat ggatcttcca gcacaacgag aacgaccact 12121 tcatcaccta ttagtagcag taatcgattt ttctgtcagt cttacaacgg tcagtacact 12181 gttatgtacc agccagaaag tcaacctggt caatacttcc cttgggcgac tcctaggact 12241 ttgggtggcg gctgggatgc acaacagcgt tgccaagcaa ttgccgagcg cttggaaaca 12301 tatcgcccag atggcttagt ggaactgaag acagcgatag agaacagaca aaatattctc 12361 tgcgttacca cagaaactaa tccttactgt cgcattgtgc tgacagtacc tcccgaaaaa 12421 gacccttatg ttgtccgtaa tagcgttttc caaaacttag catctgctga tagcggacag 12481 caaacttttg gcgttaacac ttatacaagt ggtaatgacg atctctccaa tttggggcga 12541 aatatttttg gtggtggcaa aaaaccctcc acttcttcta aagacccaat caatctcaaa 12601 cctttcttag atcgtgctga tcgcggtact ggagaaaaac tccgtaatgg tgtatcgctt 12661 aatcgtcagc agtctcaacc tcaaactggc aatcgtctag atcctaaaaa atttcgctga 12721 ttgattcaaa aaaaatgaat ttcgcgccaa gacgctcaga tgcaagatgg caacggaaga 12781 gtagccgtta ctcgtttggg ctttttttat gtcatccctc ttaaggtaga aatctagatt 12841 accctaaaaa ataaaacccc ctataggggg ctagaaactg tataggtttt tcggagcggt 12901 tggttaggta attgagtaga aaaactcatc accaatactc aaagcagata cagcaaaccg 12961 aagaggataa tccaaatcac gtcaacgaag tgccagtaga tttcggcggc ttctatacca 13021 aagtgatgtt cattgctgta gtgacctttg ctgagcgatc gccacaatac cgcaagaatt 13081 gccacaactc cgagagtcac gtgcaaaccg tggaagccag tcaaaacata gaatgcgctg 13141 gcaaataagt tagtggtcaa gccaaattct agatgggtat attcatacac ctgccccacc 13201 aagaaaagaa tacccattgc agcggtaatc ccgaaccaga tttgcattcc ccggacatca 13261 ttctttttga tggcggtatc agcattgtgg ataacaaaac tgctagcaat tagattgatt 13321 gtgttgactc cgggtaacaa tagttctaac tctggggtac cttctggagg ccaaacaggt 13381 accgtagaac gcatagccag atacgctccg aacaacccca tgaaaatcat tccttcagcg 13441 accaggaaca taaccagtcc aaagaggcga tggtctggat gttcttcgtg atcagcttct 13501 gcggtagcag tgtgatgatg gttaagggct gtctttgctg ggtcaattgt ttgactttgc 13561 atgaattttc cgtaaattat gaattacaaa ttatgaatga tgaataaaaa aaatctttta 13621 gcattcatca ttcataatct atcagcgttc ttctttttca gcagtgatag cgggagatgg 13681 ctcgtcaggt tcggcgcgta acactgagtt aacaccggca gacaagactg gatcgggatc 13741 agataaaggt acaccctttg cagtgttctc caaaccataa tcgtatggtc cagtcgctaa 13801 aactggtggt ttatcaaaat tctcgatcgc tggtggtgag gttgtcatcc actctagggt 13861 aagtgcatcc cagggattat tacctgcttt gggtccgtac atccaactcc aaatcgcatt 13921 gatgatgaag gggaatgtcg aaacagcgag tatataagcc ccataggtac aaatttcatt 13981 caaaaacgta aatttggggt cgtactgagc aatgcgacgg ttcataccca tcatccccag 14041 cttgtgcatg ggtaagaagg tcatgtttaa accgacgatt gtcaaggtaa aatgaacctt 14101 accccaaaat tcgttcaaca tccgccctgt cattttgggg aaccagtggt agattgccgc 14161 ataaatgccc agaacactac caccaaacag gacatagtgc aagtgtgcca cgacaaaata 14221 tgtgtcgtga acgtgaatat cgaacggtac cgccgccaac atcacgccac tgataccacc 14281 aatgacgaag gtgccaacaa aacccatagc aaatagcatt ggactgttga ggcgaatttt 14341 tccaccccac attgttgcca accagctgaa gattttgatc cccgttggta cagcgatgat 14401 catggtagtg atcatgaaga acatccgcaa ccaaccaggg ataccgctgg taaacatatg 14461 gtgcgcccag acgatgagtc ctaggaagct gattgccaaa cttgagtagg cgatcgcctt 14521 atatccaaaa attggcttac gcgaatgaac cggaatcacc tcagaaatcg ccccaaagaa 14581 gggcaaaatc atgatgtaaa ccgctgggtg ggaataaaac cagaacatat gctggtacac 14641 aaccggatcg ccaccgccag tcggattaaa aaatgttgtg cctgcaagta agtcaaacga 14701 gagcagaatt aaaccggctg ctagcacagg cgtagagacc aaagtcagcg ccgaggttgc 14761 caacattgcc cagcaaaaca aaggcaattg aaagaaaccc atgcctggag tacgcatttt 14821 aagcattgtc accaggaaat taattgcccc cagaatcgaa gacgtaccta gcaaaagaac 14881 gctcagaatc cagatcccct cacctacttg acctgtgacc aagcttaagg gagggtagga 14941 agtccaacct gcatctggtg catcacccac cactaaacta gcgatgagca acaaaccagc 15001 aggaggaatc atccaaaagg caacagcatt caggcgtgga aatgccatat cctttgcccc 15061 aatcatcaag gggatcagga agttagcaaa tcctgcgcct gcgggcacaa tccacaaaaa 15121 aatcatgatt gtggcgtgca gtgtaaacaa actgttgtac acctcagggg tgacaaaatc 15181 gacgtctggg gttcgcagtt ccgtgcgaac caagtcagcc atcacaccgc caatacagta 15241 gaaaatgaac gtcgtgacca ggtattgaat tgcaatgacc ttgtggtcgg tgttgaaggt 15301 aaagtagtct cgccaaggtc ttatccctgg ctcttctatc agagcaggga tattggcttt 15361 ttcttgtacc tgtgcttgtg tcataagaat tagttgtcag aagtcaggag tcaggagtca 15421 ggcaatacgg ttcggctagc ttgtttatct caagggagat ccccccaacc cacgccagac 15481 gctaccctgc gggaagccgc ccggagggcg tctacaagt // LOCUS NODE_2173_length_15435_cov_5.26150815435 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15435) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15435) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15435 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 403..1482 /locus_tag="DP116_18605" CDS 403..1482 /locus_tag="DP116_18605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318361.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="radical SAM protein" /protein_id="PRJNA477356:DP116_18605" /translation="MIADQVLTPNTPLRQSKLWMPERVLFTPAALSEPWGKQILARVE SLNLPVEELARNRLTGLRGESERDTYDIAKRTLAVVTAPPSSMKLSPIPPSADWQFHL AEGCPAHCQYCYLAGSLQGPPVIRVFANLPQILENLAAYEQPGKSTSFEVSCYTDPLG IEHLTGSLAQCIRYFGTRDDAHLRWVSKFDAVDDLLNLPHNGHTRCRISVNAAPVSGR FEGGTASVSSRLMALRRLALPQEQGGGGYPVGLVIAPIMHIDDWQIHYGRLFDQISEA LDFDCDLTFELISHRFTPGSKEVLQTWYPHSKLDMDEDKRSVKRNKFGGTKYVYDTDT MKAMKRFFESEIGRRFPNAKILYWT" gene complement(1517..2359) /locus_tag="DP116_18610" CDS complement(1517..2359) /locus_tag="DP116_18610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015189155.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid aminotransferase" /protein_id="PRJNA477356:DP116_18610" /translation="MTFAKNLIYYVNGKYISSDQASLPLNDLGIVRGYGVFDYLRTYN GIPFKLQEHVQRLQKSAELIGLSLPCSTEELEAITQETLRHNNLPESNIRIVVTGGSS ADFITPPEQPSLVVIVTPVTQYSAQYYEQGVKVITVQMERFIPQAKTLNYISAIMALQ QAKRANAIDALYVNQQSHVLEGTTTNFFIFRDSQLITSQENVLHGITRKVVLELAIKK LKVVERPISYSELKDCDEAFITSSNKEIMPVVQIDDLQISHGKPGENTQLLMHLFQDY TRGL" gene complement(2619..3398) /locus_tag="DP116_18615" CDS complement(2619..3398) /locus_tag="DP116_18615" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18615" /translation="MVALKQHGHQGTIHVVSRHGLLPHRHKPDVTHDRFHPQTAPQTR ALVRKIREEVQALAAQNQDWRTVIDSLRPVTQHLWQTLSVDEQRRFLRHLRSYWDIHR HRIAPEIADVVDELRNSSKLVVHAGRIASYHEVTDGVDVTIHKRHTKDSVVLRVSRVL NCTGPTSNYEKLQHPLVDNLWQQRLLCPHTLGFGIKTAENGALLDHKDAPSKWLYTLG PPRIGDLWETMGVPMIRVQANALAQEFLEQLETEGFKLLDI" gene complement(3597..4892) /locus_tag="DP116_18620" CDS complement(3597..4892) /locus_tag="DP116_18620" /EC_number="4.3.2.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylosuccinate lyase" /protein_id="PRJNA477356:DP116_18620" /translation="MIERYTLPEMGNLWTEAYKLKTWLQVEIAVCEAQAELGYIPTEA VEEIKAKANFDPKRVLEIEAEVRHDVVAFLTNVNEYVGDVGRYIHLGLTSSDVLDTAL ALQLVASLDVLMQRLEDLIEVIRQKAKEHRTTVMIGRSHGIHAEPITFGFKLAGWLAE VLRHQERLKILRETIAVGKMSGAVGTYANIEPRVEAIACQKLGLKPDAASTQVISRDR HADFVQQLALLAASIERFAVEIRNLQRTDVLEVEEFFSKGQKGSSAMPHKRNPIRSER LTGMARLVRSHAGAALENIALWHERDISHSSVERVIFPDSCILTHFMLKEITELVKNL LVYPENMERNLYCYGGVVFSQKVLLALVDKGMSREEAYAIVQQNAHTVWNKADGNFHD LIVKDSRVTQKLSPAEIETCFDPQQHLRHLEQVYQRLGI" gene complement(5764..6726) /locus_tag="DP116_18625" CDS complement(5764..6726) /locus_tag="DP116_18625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318352.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YihY/virulence factor BrkB family protein" /protein_id="PRJNA477356:DP116_18625" /translation="MNLKAVVELFQETFQQWSKDKASRLAAALSYYTIFSIAPLLIIV IAIAGAVFGEAAAQGAIVGQLQGLVGKPSAQVIQTAIQNASQPKAGTIASIISVIVLL FGATGLFTELQDALNTIWEVQPKPGRVMKNMVRQRVTSFAMVLAIGFLLLVSLVISGV LAALVGYFKNIVPGVDFIWQFVNFIVGFAITTLLFGLIFKVLPDVKITWSDVLTGAAL TAFLFSIGRYLLGQYLGNGSFGSAYGAAGSVVIILAWVNYAAQILFFGAEFTQVYARK YGSRIVPDKHAVPLTENARLNQGMKPNNRNERTQKRKHSGDDSN" gene complement(7087..7791) /locus_tag="DP116_18630" CDS complement(7087..7791) /locus_tag="DP116_18630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320144.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="PRJNA477356:DP116_18630" /translation="MILIESKINRTYVQERLLTTLLGGIPNIALGGLLRNLVYRSLFA RLGKSVNIQHCVELLGSSCIEIGDQVRLAKDVQINASGDPKNRVSLANRVKLQRGVDI RSLHNTRITIDEDTYIGPYVCIAGPGDIKIGKACLIAPHSGIFANNHIFADPTQRIGD QGVTRKGIVISDDCWLGHNVTVLDGVTIGKGSIIGAGSVVSKDIPPYSIAVGAPARVI KSRLEESSYGTVQEAA" gene complement(7788..10493) /locus_tag="DP116_18635" CDS complement(7788..10493) /locus_tag="DP116_18635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316340.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain S-box protein" /protein_id="PRJNA477356:DP116_18635" /translation="MLQEDLVISSSHDSRLVTLSIVIAVLASYTALDLAGRVTAAKTS ARMAWLIGGGIVMGIGIWSMHFVAMLAFSLPIPMYYDMWTVVVSIVPAIIASLGALFL ASRRVLSIWQLLIGGTLMGIGIASMHYIGMYAMRMEASTEYNPPLFVLSVVIAIGASI IALWIAFQLRMQTSTTVGWTKLGSALVMGGAIAGMHYTGMAAANFQATNLQAFTNSQA ITNSLTWLAIGIGVATVVILGFALLTSFVDQRLAASAKLLEQQEVETIRSQQFIEITL GIRRSLHIDDVLNTTVNEIRQALTTDRVVIYRFNSDWSGTIIAESVAEGLVKTLGQKV NDPFRQDYIELYKSGKVRATNNIYKAGYTDCHKKILENFQIKANLVAPILKNYQLTGL LCAHQCSEPRKWQNSEVDLFGQLAIQVGIALEQASLFDELQQAQKVLRLRDRAIAAAS NAIFITDSHQSDNPIIFCNPAFETITGYSQEDVLGCNYRFLLGTDTDQTIVEQIRDAM RDSSECQVTLKSYRKDSTPFWCELTVSPVRDAFGRVTNFIGVLSDITLRKQAEEGFRH TKEVLQRQLLELMTDVKQATHGDLTVRAKISVGEIGVVAEFLNTIIDSFQQIVTQAKT AADQVNVSIGQNSSAIQQLTDQALTQVDEISHTLEQIDNMTVSIQTVAESARQATEVI DETYNKAQTTAKAMDFTLDTIWNLQQIVVETANRVKCVSENSQKISSLVSLMKQNDMQ ANLLAINAGVEAAWMSHANRMFITVAEEIAQLVAKSAEVSTEISQIFENIQSETREVV KAIEQGTTQMVGGVKVVENAKLDLNEIIEVSRQINSLIELIFTETVSQTKISQTVVSS VKEVAEACERSADSSGIVSHSLQQTQEVALELQDSVSVFKTGLSA" gene 10998..12443 /locus_tag="DP116_18640" CDS 10998..12443 /locus_tag="DP116_18640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875464.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_18640" /translation="MPPLLSERYHIISVLGSGGFCDTFLAEDTQMPSARRCVIKQLKP VNDNPLIEEVVQKRFRREATILEDLGAASHQIPTLYAYFQKFGQFYLVQEWIDGQTLF QKVQENGCLSESEVVSILISLLDVLEYVHDKGLIHRDIKPDNIIMRSSDRKPVLIDFG AVRETMGTVMNPEGTVSSSIIVGTPGFMPNEQAAGRPVFASDLYSLGLTAIYLLTGQL PQQMTTDLYTGESIWQRDGVSPSLAAVLDKAIRNNVRERYPSARAMIYALQSIASSLP SYVAKRTQPPAQTVPTTLRSAAQNRRHNSIFLGSIFMIGTLFGTSIILALLLTNFRQP TVYNKELSSGLVTKPQGLDAPSTLSPSVSASSQIVQTNKQQNINSGALPSSFHFIADS SFPHLQNAVKQTKTLQAAGYSQTGVFWIPDYPNLVDKHLFVVYVTTFSDRSSCLNFLR DYGKVNPNAYCAFASKDPKAPTARLSFREIE" gene 13360..15375 /locus_tag="DP116_18645" CDS 13360..15375 /locus_tag="DP116_18645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_18645" /translation="MKRSLSILTLFSFLVPYAVVAAPPRTPDKTVNCDILVVGGGLSG VATAYEAILAGQTVCLTEITDWLGGQISSQGTAALDERPTQRRKLFYSRGYLELRKRI ENKYRGNINPGNCWVSDSCFLPRDGHEILTSMLKDAEKKGKGKLQWFPNTVIKELEYS SDGKLISSAIAIQHQPVQGAPPLNTFPLSQTIEDAYTYQNSSRFAKTIVRLVPKQTKS KAGSNAPNWYVVDTSETGEIIALADVPYRLGIDARSYLEPSSSSTQNDSYCTQGFTYT FAMEATDNPQTQTMPAFYPQYAPYYSYELKRLASFPLVFTYRRIWSPTKGQPMEFGGV KFTGPTPGDISMQNWTWGNDYRPGTSADNLIYNRQQLQSAGQLQPGGWMGGLRKEALR KAEEISLGYYYWLTAGNTDSQLGNGVKQPQPNNRFLSGLDSPMGTAHGLSKHPYMREG RRIIGRPSWGQPEGFSIWEIDISRRNYDDEYYRKTLPPDTYRRLKAALGGLEAASVLS GQVSPDKVARRTRSTIFPDAVGIGHYAIDFHPCMTKSPPETPGNTDRAGERRGAGQAY PFQIALRAMIPQKIDNLLVGGKSIGTSHIAAAAYRVHSFEWSAGAAAGTTAAFALKNG VAPYQLVEKLPLPEPRLQLLKQLLERNGNPTAFPDTSIFNQNWDDWK" BASE COUNT 4394 a 3372 c 3214 g 4455 t ORIGIN 1 agctccttct tttcgtgtcc tcaataacgt tttcagcctt aactgaaccg tattgggtta 61 taattgtgtt aaaaacgtca tcccaagttg gttgaagctg aaacaattca gactcagcaa 121 taggtgaata gtaaataaac actatgttat aaaatagaat caatagaaga gctattcttc 181 acggtgccat caccttggca ggaacattag ttagagtcag caacaattgc aacagaaaat 241 tactacctct acttgggcag aaactactcc atacagtggg tacggagtta tttttcaaga 301 ttctacaata gaattatacg acgtaaagaa ttaatgcact tcacctgctc ttgataacgc 361 tcaagcctta catttattaa gaagaattag caagataaat aaatgattgc agaccaagtt 421 ctgacaccaa atacaccttt gcgacaatca aaattatgga tgccggagcg ggtgttgttt 481 acaccagctg ccctatctga accgtgggga aagcagatat tagcacgcgt ggagtctctc 541 aacttacctg ttgaagagtt agcgcgaaat cgcttgacag gtttgcgtgg agaatctgaa 601 cgcgacactt atgatattgc caaacgcact ctcgcagttg tgacggctcc acccagttcc 661 atgaaactta gccctatccc tccatcggct gactggcagt ttcatcttgc cgaaggctgt 721 ccagcacact gtcaatattg ctacctagca gggagcttgc aaggcccacc agtcattcgc 781 gtgtttgcca atttaccgca aatcttggag aacttggcag cttacgagca accagggaag 841 tcaacaagtt ttgaggtaag ttgttacaca gaccctttag gtatcgagca tttgacagga 901 agccttgccc aatgtatccg ctactttggt actcgtgatg acgcgcatct acgatgggtg 961 tcgaagtttg atgctgtgga tgacttactc aatttgccac acaatggtca tacacgctgc 1021 cgaataagtg ttaacgctgc gcctgtttct ggtcgctttg aaggtggtac agcgtctgta 1081 tcatcacggc tgatggcgtt acggcggtta gcgcttccac aagaacaagg tggtggcggg 1141 tatccggtag gtttggttat tgcgcctatt atgcatatag atgattggca gatacattac 1201 ggtcgtctgt ttgaccagat tagcgaagcg ctagattttg attgtgattt gacttttgaa 1261 ctcatatcgc accgctttac acctggatca aaagaggtat tgcaaacgtg gtatcctcat 1321 tccaaactag acatggacga agacaaacgt agtgttaagc gaaataagtt tggcggtacg 1381 aagtatgtct atgatactga cacgatgaag gctatgaagc gcttttttga gagtgagatt 1441 ggacggcgct ttcccaatgc gaagattttg tactggactt agtttgtagt cagcgctgtc 1501 aacgctttag cgttgactac aaacctctag tataatcttg aaacaaatgc atgagaagtt 1561 gagtattttc tcctggtttt ccatgagata tctgtaggtc gtcaatttga actacaggca 1621 taatctcttt atttgaagat gtgataaaag cttcatcaca atccttcaat tcgctgtaag 1681 aaataggtcg ctcaactacc tttaatttct ttattgcaag ttccaagaca acttttcttg 1741 tgataccatg aagaacattt tcttgtgatg taatcaattg agagtctcga aaaataaaga 1801 aatttgtcgt cgttccttcc aagacatgac tctgctgatt aacatatagt gcatcaatag 1861 cattggcacg ttttgcttgt tgtagtgcca taattgcaga aatataattc agagtttttg 1921 cttgaggaat aaatcgctcc atttgcacag tgataacttt aactccttgt tcataatatt 1981 gggctgagta ttgggtaaca ggagtgacaa taaccactaa gctaggttgt tcaggaggag 2041 taataaagtc agctgaagaa ccaccagtta cgacaatgcg aatgttagat tctgggagat 2101 tattatgtct aagtgtttct tgagtaattg cttctagctc ttcagttgag cacggcaaac 2161 ttagaccgat taattctgct gatttttgca gcctttgaac gtgttcttgt agtttaaatg 2221 gtattccgtt gtaagtgcgt agataatcaa agactccata tcctcggaca attcctaaat 2281 cattcagtgg tagggaagct tggtctgatg agatatattt tccattaacg taatagatga 2341 gatttttggc aaaagtcata acgatatttg aaatttcaaa ccacagatgc acacagatta 2401 acacagatga ttctctcgtt cctatgctct gcatgggaat gcatcaaagg aggctctggc 2461 tcccaatgat tatattgaag tagagtctca agttatgcat tccttgcctg agggaaggaa 2521 cgagaaaact gtattaagct aatgtacttt taatttgcat aatcagggtg gacaagatgc 2581 ccactccaca agggttttac ttttagacaa gatgcaactt aaatgtccaa cagcttaaaa 2641 ccctcagttt ctagttgctc tagaaactct tgagctaagg cattcgcctg tactcgaatc 2701 ataggtactc ccatagtctc ccaaaggtcg ccaatacgag gtggaccaag cgtgtagagc 2761 catttggaag gtgcgtcttt gtgatctaaa agtgctccat tctcagccgt tttgatgcca 2821 aatcccaacg tgtgggggca aaggagtcgt tgttgccata gattgtctac cagaggatgt 2881 tgcaatttct catagttgga agttggtcca gtacagttga gcacccgact cactcgtaac 2941 acaacactgt ctttggtatg ccgtttgtgg attgtcacat cgacaccatc agtcacttca 3001 tgataggacg caatacgacc agcatgtacg actagtttgc tggagttccg taattcatcc 3061 actacatctg ctatctctgg cgcaattcga tgacgatgga tatcccaata ggaacgaagg 3121 tgacgtaaaa atcgacgctg ttcatcaaca gacagcgttt gccataaatg ttgagtgact 3181 ggacgcagtg agtctatgac tgtccgccag tcctgatttt gagctgctag tgcttgaact 3241 tcctcacgga ttttgcgaac taaagcacga gtttgaggcg ctgtttgtgg atggaagcga 3301 tcatgagtca catctggttt atgacggtgg ggaagtaacc catgacggga aacgacatga 3361 attgtccctt gatgtccatg ctgctttagg gcaaccacag aaattatcaa ctttcctacc 3421 aggaaatctt agttaaggct caacgtacca cccaaatctt gtatatactc ggtttgatac 3481 tgatgagtgg atttttagtt tcagttttga ggttagttga aatctaacac taagccagtg 3541 ttttgttaca tcagaagtag gtgcgttagc aaagcgtaac gcaccccgta ctaagactaa 3601 attcccagtc tttggtaaac ttgctctaaa tgtcttaaat gctgctgtgg atcaaaacac 3661 gtctcaattt ctgctggcga caacttttgg gtgacgcgag agtctttgac aatcaagtcg 3721 tggaaattgc cgtctgcttt gttccaaacg gtgtgagcat tttgttgcac aattgcgtaa 3781 gcttcttcac ggctcattcc tttgtctacg agtgctaata gtactttctg gctaaagacg 3841 acgccaccgt aacagtacaa attccgctcc atgttttcgg gataaaccag caggtttttc 3901 accagttcag ttatttcctt caacatgaag tgagtcaaaa tacaactgtc tggaaaaatc 3961 actcgttcta cagaactatg agaaatgtct ctctcatgcc atagtgcaat attttccaaa 4021 gcagcaccag catgagagcg aactaatctc gccattcctg tgagtctttc tgaacgaatg 4081 gggttgcgtt tgtggggcat tgctgaggag cctttttgcc ctttggaaaa gaattcttcg 4141 acttctagaa cgtctgttct ttgaaggttg cgaatttcga cggcgaagcg ttctatggat 4201 gcggctagta aggctaattg ttggacaaag tcggcgtggc gatcgcgcga aatcacctgt 4261 gttgatgctg catcgggttt gagtccaagt ttttggcagg cgatcgcctc cacgcgaggt 4321 tcaatattcg cataagttcc caccgcacca gacatcttac ccacagcaat tgtttcacgc 4381 aaaattttca aacgttcttg gtgtcgcaac acttctgcta accacccagc cagcttaaaa 4441 ccaaaagtga taggttcagc gtgaataccg tgggaacgtc caatcatcac cgtcgtgcga 4501 tgttcttttg ccttttgacg aatcacttca atcaaatctt ctaggcgttg catcaacaca 4561 tccagactcg caacaagttg cagtgctaaa gccgtatcca gcacatcaga actggttaaa 4621 cccaagtgga tgtaacgccc cacatcacct acatattcat tgacatttgt caagaaagca 4681 acgacatcgt ggcggacttc agcttcaatt tccaatactc gctttggatc aaaattcgcc 4741 tttgccttaa tttcttcaac tgcttcagtt ggaatataac ccaactcagc ctgcgcttca 4801 caaacagcaa tttctacttg cagccaagtt tttagtttat atgcttctgt ccagagattg 4861 cccatctcgg gcaaggtata acgctcaatc acagtccgcc acagggtaca accgtcatat 4921 tgtacaaaga acgcgggaac tcataaagag tatgtatcag acttttatca aaagtttgct 4981 cttactctta acaaacacag ggaacacttc gacaagctca gtgcatcgca gtgaaaactg 5041 gtaactgata actgacgact gataactgat aactgataac tggtaactga taggggatgg 5101 tgtttctttc atatgttcct agctgtttct tgcgctagtt aaaggctcat aagattgact 5161 tgtttgatgt gtacactgtc acatatataa aatatatatc tttaggcagt agtactattg 5221 tttaccctgg taaaaataca agctggtatt tgtagagtaa tctctgtgta gagtataact 5281 ccaacaattc agggttttct tatttataag tataaaaatt atctttttaa attctattcc 5341 ttctggatgt gggtatatag ctaactatca ggcaaaataa aattgctatg cttctccagc 5401 atttctcttc tcaattacga ctaaaccctc tagagaaaac tagaggtttt ttttgaagtt 5461 tttttctgat ttattggaac gatttcattt ataagtagaa taaagacttt gctccacatt 5521 cacattgata agtaggtaaa cattgaaaaa cgtaaaacag taagagctaa aagccataca 5581 taaagttctt tttactttca aattccgact tccaaattcc ttcttccaat taacttgaag 5641 aatttgtgaa ccaaggttat tcaactcttg gctcaactca ctgagcgagg tagttacttc 5701 cctgactgac tacgactagc gggttaaatg cagtatctgt cacaaacatt tttggaattg 5761 ctattagtta gagtcatcac cagagtgttt tctcttttgt gttcgttcat tcctattgtt 5821 tggtttcatc ccttgattaa gacgagcatt ttcagtcaga ggaacggcat gtttatcagg 5881 aacaatgcgg gaaccatact ttctcgcata aacctgagta aattcagcac caaaaaagag 5941 aatctgggca gcataattaa cccaagccag gataattacc acagagccag cagcaccgta 6001 ggctgatcca aaactgccat tacccaaata ctgtcccaaa agatacctgc caatagaaaa 6061 caaaaatgcg gtgagggcag ctccggtcaa aacatcactc caagtaattt taacatctgg 6121 caggactttg aaaataagtc cgaatagcaa tgtagtgatg gcaaaaccaa caatgaagtt 6181 gacaaactgc caaataaaat cgacaccagg tacgatattt ttaaagtaac caactagcgc 6241 tgctaaaacc ccactaatca caagtgacac gagcaataaa aaaccaatgg ctagcaccat 6301 cgcaaacgag gtaacgcgtt ggcgaaccat gtttttcata acgcgtccgg gttttggctg 6361 cacttcccaa atcgtattta gggcatcttg caactcggta aataaaccag tcgcaccaaa 6421 cagcaggact attacactga tgatggaagc gatagttccc gctttcggct ggctggcatt 6481 ttgaatggct gtctggataa cttgtgcgct aggtttgccg actaaacctt gaagttgccc 6541 tacaattgcg ccctgtgccg ctgcttctcc aaagaccgca ccggcgatcg caattacaat 6601 aatcagtaat ggggcaatag aaaagattgt gtaataagat agtgctgcgg ctaaacgcga 6661 cgctttatcc ttactccatt gttggaatgt ttcttgaaac agctctacaa ctgcctttaa 6721 attcatcaaa atatctcctt ttgcggtaat gtgcttacca gcacaagcga gcggtactaa 6781 aaatattata agtattgcat ctttctcaag agttgatgta aagttcttta tgcttaccca 6841 caattctcgc ttcaccccac cccaattttg tctaacgcca aaacctcccc ttatgaaggg 6901 gagccagtgc gttgcggtga gtccagcgct gcgggagggt ttcccgacag ccaggcgact 6961 ggcgaacccg gaggggggtt ccccccgttg tagcacctgg cgtgagggga ttaaggtgtg 7021 gggtcaaatc aacgtaagat aaaggtttga acgttaagtt gacaccaatg gctctcctgt 7081 ttgagcttag gctgcttcct gtaccgtccc atagctactt tcttctaaac gacttttgat 7141 gactcgtgct ggtgcaccca ccgcaataga atagggggga atatctttgc tcacaactga 7201 gcctgcacca ataatgcttc ctttaccaat ggtgactcca tctaagactg tcacgttatg 7261 ccccagccaa cagtcatcgg aaattacaat tcctttgcga gtcacccctt gatctccaat 7321 tcgttgagtg ggatcggcaa aaatatgatt gttagcaaat attcctgagt gaggtgcaat 7381 taaacaagcc ttaccaattt taatgtctcc aggacctgca atacagacgt aaggacctat 7441 gtatgtatcc tcatcaattg tgatacgtgt attgtggagt gagcgaatat caactccacg 7501 ttggagtttc acccgatttg ccaaagacac tctatttttt ggatctcccg atgcattgat 7561 ctgaacatct tttgctaaac gtacttgatc gccaatttct atacaagaac tgcccagaag 7621 ttcaacacaa tgctggatgt taactgattt acctaaccga gcgaaaagac tccgatatac 7681 caaatttctt aacagtcctc ctaaggcaat attaggaatt cctcctaata aagtcgttag 7741 tagacgttct tgcacatatg ttcgattaat tttagattca atgagtatca tgcgcttaac 7801 cctgttttga agacactgac tgaatcttgt aattctagcg ctacttcttg tgtttgctgc 7861 agagaatgag acactatacc agaggaatcg gcactacgtt cacaagcttc agcaacctct 7921 tttacggacg acacaactgt ttgagagatt tttgtctgag aaactgtttc tgtaaagatt 7981 aactctatca aagaattaat ctggcgagac acctctataa tttcatttag gtcaagcttg 8041 gcattttcaa caacttttac tccgcccacc atctgagttg ttccttgttc tatggctttg 8101 acaacttccc tggtttcaga ttggatgttc tcgaaaattt gtgaaatttc tgtgctcact 8161 tcagcagact tcgcaaccaa ttgagcaatt tcttccgcta ctgtgataaa cattcgattt 8221 gcatgactca tccacgcagc ctcaacacca gcattgatgg ctaacaaatt ggcttgcata 8281 tcattttgct tcatcaagga tacaagggag gaaatttttt gagaattttc gctcacacac 8341 ttgactctgt tggctgtttc cacaacgatt tgttgtaaat tccaaatagt gtcgagcgta 8401 aaatccattg cttttgcagt cgtctgtgct ttattataag tctcgtcaat aacttctgtt 8461 gcttggcgag cactctctgc gactgtttgg atcgacacag tcatgttatc tatttgttca 8521 agagtgtggc taatctcgtc aacctgtgtg agtgcttgat ctgttaattg ttggatagca 8581 ctagagtttt gcccgataga aacattcact tggtcagcag ctgttttggc ttgagttaca 8641 atctgctgga aactgtcgat aatagtattt aaaaactcag caactacacc aatttcacca 8701 actgaaattt tagctcgaac ggttaaatca ccatgagttg cttgtttgac atcagtcata 8761 agttctaaaa gctgcctttg gagaacttct ttagtgtgcc gaaatccttc ttccgcctgc 8821 ttgcgtaaag taatgtcaga tagcacgcca ataaaatttg tgactcgtcc aaatgcgtct 8881 cgcactggag aaactgttaa ttcacaccaa aatggggtac tatctttgcg gtaactcttg 8941 agagtaacct ggcattcact cgaatcacgc atagcgtcgc gtatttgttc aacaatcgtt 9001 tgatctgtgt cggttcccag taaaaagcgg tagttgcacc caagcacatc ttcttgtgaa 9061 taacctgtga ttgtttcaaa agcaggattg caaaaaatga tcggattgtc gctttggtgc 9121 gaatcggtaa taaagatggc gttactagct gctgcaattg cccgatcacg aagtcgcagc 9181 actttttgcg cctgttgaag ttcatcaaaa agactggctt gttctagggc gattccgact 9241 tgaattgcta attgtccaaa taaatcaact tcagaatttt gccattttcg aggttcggag 9301 cattgatggg cacataataa ccctgtgagc tgataattct ttaaaatagg tgcaactaaa 9361 tttgccttga tttgaaaatt ttctagaatc tttttatgac aatctgtata acccgcttta 9421 taaatgttgt tagtggctcg aactttacca cttttgtaca attcaatgta atcttgccga 9481 aaaggatcat tgactttttg tcctaaagtt ttcaccaaac cttctgcgac tgactcagcg 9541 atgatagtac cactccagtc agaattgaag cgataaatca caacacgatc tgttgttaat 9601 gcttgacgaa tttcattgac ggttgtattt agaacatcat ctatatgaag cgatcgccga 9661 atacctaggg taatttctat aaattgctgg gaacgtatgg tttcaacttc ttgctgttcc 9721 aaaagcttcg ctgaggctgc caaacgctga tcaacaaacg aagtcagcaa cgcaaaaccc 9781 aagataacaa cagtggcgac accaatacca atagccagcc aggtgaggga gttagttatt 9841 gcttgggaat tcgtgaatgc ttgtagattt gtagcttgaa aattagccgc agccatccct 9901 gtatagtgca tcccagcaat tgcccctccc atgacgagtg cactgccaag ctttgtccac 9961 cccacagttg tactcgtctg catgcgtaat tgaaatgcaa tccatagtgc tataatcgac 10021 gcaccaatgg cgatcaccac agaaagcaca aacagcggtg ggttatactc ggtgcttgct 10081 tccattcgca tcgcgtacat tccaatgtag tgcatcgacg caataccaat acccataagc 10141 gtaccgccaa taagcaattg ccagatgctc aacactcggc gactggcaag aaaaagtgcg 10201 ccaagcgagg cgatgatggc aggcactatt gacaccacca cagtccacat atcgtaatac 10261 atcggtatcg gcaaactgaa ggcaagcatg gcgacaaagt gcatcgacca gataccgatt 10321 cccatcacaa tcccgccgcc aattagccaa gccattctcg ccgatgtctt ggctgccgtg 10381 actcgcccag ctaaatcaag agcagtgtat gatgcaagaa ctgcaatgac aattgaaagc 10441 gttacaagcc gtgagtcatg gctactactg ataactaaat cttcctgaag catcagctta 10501 acctcaagct atatgatttc cgttctttta ctttttcgtg gtttgatgca catgtttatt 10561 gagctttcaa tcccgctaaa agcatcacta ttttggcact ttttttagat aaattttgct 10621 aattcataaa gacgcattat aattgagtga aataactctc aaaacaaata atttagatag 10681 ggatatatag atttttattg ttttcctttt ttcttcctac ttttcctact ttatttttct 10741 ttatcaaaat tttaaaaata cttattccca atttaaaatt tgaattttag atgaaattta 10801 acagtttatc aaaataggaa tttcttgtga tattctgcaa gtggcaactt tatgtgaaaa 10861 gtaaattatc ttcagtaata acttgacaaa agtcatagat tgtacattat aaaaaactcg 10921 atttgtagaa attacgattc tgtcttgtca atcatttaga aaacacaaaa attaatatga 10981 atcaagaaaa aagaacgata ccaccattac tcagcgagcg ttatcatatc atcagcgtac 11041 tcgggtcagg tgggttttgc gacacattct tagcagaaga cacccaaatg ccctcggcac 11101 gccgttgtgt gattaaacaa ctcaaaccag tgaacgacaa tcctctgatt gaggaagtgg 11161 tgcaaaagcg atttcgacgg gaagctacca ttttagagga tctaggagca gctagtcatc 11221 aaattcccac tctgtacgct tactttcaaa agtttggaca attctattta gtacaggaat 11281 ggatcgacgg acaaaccctc tttcagaaag tccaagaaaa tggatgttta agcgaaagtg 11341 aggttgtctc gattttgatc agcttgttgg atgtgctgga atatgtccat gacaaaggtc 11401 tcattcatcg cgatataaaa ccggataaca ttattatgcg ctcatcagat cgcaaaccag 11461 tgttgattga ttttggtgcc gtgcgagaaa cgatggggac agtgatgaat cctgaaggaa 11521 ctgtcagcag ttcaattatt gttggtacac cagggtttat gcctaatgaa caagcagcag 11581 ggcgtccagt ttttgccagt gatctatata gtttaggact aacagcgatt tatttgctca 11641 caggacaatt gccgcaacag atgacaacag acttatacac tggagagagt atctggcagc 11701 gggatggagt tagtccaagt ttagcagcag tcttagataa ggcaattcgg aataatgtta 11761 gggaacgcta ccccagcgcc agagcaatga tatatgctct acaaagtatt gcaagttctc 11821 ttccatcata tgtggctaag cgtactcaac cacctgccca gactgttcct acaactctaa 11881 gatctgcggc tcaaaacaga agacacaaca gtatatttct tggcagcatt tttatgatag 11941 gtacattatt cggtacatcc ataattcttg ctctattgtt aacaaatttt cgtcaaccaa 12001 cagtgtataa taaagaactc tcatctgggt tagtcacaaa accacaaggg ttagatgccc 12061 cttctacatt atctccttct gtatctgcca gttctcaaat agtacaaact aataaacaac 12121 aaaatataaa cagtggtgct ttgcccagtt catttcattt catcgcagac tctagcttcc 12181 cacatttaca aaatgctgtt aagcaaacga aaactttgca ggcggcaggt tattctcaaa 12241 ctggtgtgtt ttggatacca gattatccaa atctggttga caagcattta tttgttgttt 12301 atgtaactac ttttagcgat cgctctagtt gtttgaactt cctcagggat tatggaaaag 12361 tgaatccaaa cgcatattgc gcttttgcaa gtaaagatcc aaaagcacca acagctcgac 12421 tatcttttag agaaattgag tagttgttat gaagcccatt gccttggtgc atgatgcctg 12481 agtcaggagc gtgatatcat ctgctgatga ttacctattt gcgcataagt actgcacaac 12541 tcatcgcaat gatgggtatg cctatccttc taatatcaag tccactaact tacatcttgc 12601 accaggtcaa aaatagtcaa atttgttaca ttttgtttca ctccgttctc gttcgcctag 12661 gactgtaagt cccgccgctt acaggcaaag tccattaaaa tggactagaa attatctata 12721 agtatattta aacactattg actcgtcatg atgcaagaac tcagctgaat gacttctcgt 12781 tgctgcgtta ccctccgggt atgcctgcgg cacgccctcc ggtctaacgc cagttgccta 12841 cggaggagcc agtgcggtct tggggtctcc ccaagtagag catctggcgt tggagagccg 12901 tcattcgcac tggtctcacc tcaccccaac cctgtgcgaa cctacgaaga gggtagcgca 12961 agcggttgag gttcttggtt ttttataagt gttcatctgg acatgatatt acaggaatgc 13021 gcgacatcgc aataaaaaaa agcccgcgaa aagcgggcac aacaacacaa cacaaatggt 13081 ctacctcaga gggtggtgaa gcgcctactc tgtgataaat ttattctttt gcattgttca 13141 agtaaaattt tggtgttcat tattccagtt tccctttttt gtttgtgtgg cggaagtgat 13201 tgaaaccgct tctgatattt catatgggtg actcttgatt tttatgcact taaggcaaat 13261 gcatttgttt gattcttctt gtcgcagaaa aaagtcgcca ggaaatttgg gaaaaatgct 13321 ctaaaaggtg atttaggagt tgaacggatc ttgttaagaa tgaagcgatc gctatctata 13381 cttactcttt tctcttttct cgtaccatac gctgtcgttg ctgcaccacc tagaacacca 13441 gacaaaacag tcaactgtga cattctggtt gtgggtggag gactttctgg tgtcgccaca 13501 gcttatgagg caatactggc agggcaaaca gtgtgcttga ctgaaattac tgattggctg 13561 ggaggacaaa tctcttcgca aggaactgct gcacttgatg aacgaccaac ccaacgtcgc 13621 aaactctttt actctcgtgg ttacttagaa ctgcgaaagc gtattgagaa caaataccgt 13681 ggtaatatta accctggtaa ctgctgggtc agtgactcgt gttttcttcc gcgcgatggt 13741 catgaaattt tgacttcgat gctcaaagat gccgaaaaaa aaggcaaagg aaagttgcaa 13801 tggttcccaa acacggtcat taaggagttg gaatatagta gcgatgggaa gcttattagt 13861 agtgcgatcg ccattcaaca tcaaccagtc caaggcgcac cacccctcaa cacttttcct 13921 ttatctcaaa ccatcgaaga cgcttatacc taccaaaact cgtctcggtt tgccaaaact 13981 attgttcgcc tcgtccccaa gcaaaccaaa agtaaagctg gcagtaatgc ccctaactgg 14041 tatgttgtag acacttcaga aacaggggaa attatcgccc ttgcagatgt tccctatcga 14101 ctgggcattg atgctcgttc ttacttagaa ccttcttctt ccagtactca aaacgattcc 14161 tattgtactc agggctttac ttacaccttt gcaatggagg cgactgataa cccgcaaaca 14221 caaacaatgc ccgcatttta tccacaatat gctccatatt atagctatga attgaagcgg 14281 ctagcaagct ttcccttggt tttcacctac cgtcgtattt ggagtcccac gaaaggacaa 14341 ccaatggaat ttggtggtgt caagtttaca ggtcccactc caggggacat ctcaatgcaa 14401 aactggactt ggggcaatga ttaccgtccg ggaacttctg ccgataacct catttacaat 14461 cgtcaacagt tacaatctgc tgggcaattg caaccaggag gctggatggg tgggctgcgc 14521 aaagaagcct tacgcaaagc tgaggaaatc tcgttgggat actattattg gttaaccgcc 14581 gggaatacag attctcaatt gggcaatggt gtgaagcagc cacaaccaaa taaccgcttt 14641 ttatcagggt tagattcccc aatggggaca gcgcatggct tgtcgaaaca tccatatatg 14701 cgggaaggac gacgtattat tggacgccca agctggggac aacccgaagg cttttcgatt 14761 tgggaaattg atatctctcg tcgcaactac gatgatgagt actaccgcaa aacactaccg 14821 ccagatacgt atcgccgcct caaagctgca ttaggaggtt tagaagcagc atcagtcctt 14881 tcagggcagg tcagtccaga caaggtagcg cggcggactc gttccactat tttccctgat 14941 gctgtgggta tcggtcacta tgccatagat ttccatcctt gcatgaccaa aagccctcca 15001 gaaacgcctg gaaatacaga tcgtgcaggc gaaagacgtg gtgctgggca agcttatcct 15061 ttccaaattg cactcagggc gatgattccc caaaaaatcg acaatttact cgtaggtggt 15121 aaaagcatcg gaaccagtca catcgccgct gcagcataca gggttcactc ttttgaatgg 15181 tctgcgggtg cagcggcggg aacgacagca gcttttgctc tgaaaaatgg cgttgcacct 15241 taccaacttg tggaaaaatt gcctttacca gaaccgcgat tacaactcct caaacagttg 15301 ttggagagaa atgggaaccc cactgccttc cccgacacct cgattttcaa ccaaaattgg 15361 gacgattgga aataattagt cattagtcaa aaactaaggg tgtaagggtg taagggtgta 15421 ggggtgtagg ggtgt // LOCUS NODE_2193_length_15241_cov_4.64184115241 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15241) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15241) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15241 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 174..2459 /locus_tag="DP116_18650" CDS 174..2459 /locus_tag="DP116_18650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208521.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amylo-alpha-1,6-glucosidase" /protein_id="PRJNA477356:DP116_18650" /translation="MTPDTLMTPEKLSLDGKTFVPADQIPIPEWPRIISERPQPTLTV KDDDLFLVTDTLGNISGCSLNDGNPSMGLFCCDTRFLSRLELQIDEHSPVLLSSTADK GFSLSVLCTNPKIEDRLKADTVGIRRELVLNGALFEELEVSNYSTSSVTFELSISFDA DFVDLFEVRGYHRDKRGRLLRLVEPTPEGGTSNADGVSFQSGPPAQKEQSLSLAYQGL DNLVMESRVQFQHRQPDYFKGYTAVWRLELASHETQKLGYRVNLLTNNKPSSIVSAAF TLVQAKAGEIMEEQQWVQQITQIRSDKGTFNRIIERAEQDMYLLRQSFGKHKTVSAGV PWFSSLFGRDSLIAAFQTLMLNPQIAKETLQILAFYQGKTDDDWREEEPGKMLHELRF GELARCQEIPHTPYYGTVDATPLWLMLYAEHYAWTHDLETLEQLWPNALLAMDWIDRN MRETGYLSYYRKCKQGLDNQGWKDSGDSIVNRKGELATGAIALCEVQAYVYAVKLRLA EIARLKKRIDLSDRWTEEARNLKLRFNRDFWMEDQDFCALALDGEGKHVDSITSNPGQ CLNLGIFTPEKAYSVAERLRAPDMFNGWGIRTLNSLSPAYNPMGYHIGSVWPHDNALI AMGLRSLGLVDQALELFQGLFDMTEQQPYHRPPELLCGYERNGDNAPVQYPVACTPQA WATGSIFQLLQMLVNLVPDAPNNCLRIIDPTLPESISRLSLHNLKVGSTILDLEFERS GTTTACRVANKRGNLRVVIEA" gene complement(2532..2864) /locus_tag="DP116_18655" CDS complement(2532..2864) /locus_tag="DP116_18655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456096.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2973 domain-containing protein" /protein_id="PRJNA477356:DP116_18655" /translation="MLHLLYILAFTILAFIAVANLIRNLIMFSFDTQRIYPPRSGGST NQGRYPYNSSTQQFRPHPELLDATGNLIKEPLLVMRSINVDDARQKLDELYEASPGHR SDNQQEEG" gene complement(3096..3434) /locus_tag="DP116_18660" CDS complement(3096..3434) /locus_tag="DP116_18660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2605 domain-containing protein" /protein_id="PRJNA477356:DP116_18660" /translation="MRDSNLPEPELLKTVLQPLLEDFQYWFARSRDFLETEELSFMSQ HEQSDLLTRVKKAQEEVNTAKMLFTATGGQVGIDMATLMPWHQLVTQCWNVAMRFRSQ QENWQHKDGV" gene 4841..6682 /locus_tag="DP116_18665" CDS 4841..6682 /locus_tag="DP116_18665" /EC_number="6.1.1.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319540.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine--tRNA ligase" /protein_id="PRJNA477356:DP116_18665" /translation="MVQQQISPNSSSQEEKPEKVYLPRTSESEILKKIRHTTSHVMAM AVQKLFPKAQVTIGPWIENGFYYDFDNPEPFTDKDLKAIYKEMVKIINQKLPVVREQV SREEAERRIKEIKEPYKLEILSDIKEEPITVYHLGDKWWDLCAGPHMENTGELNPKAI DLESVAGAYWRGDETKAQLQRIYGTAWETPEQLAEYKRRKEEALRRDHRKLGKELGLF IFSDLVGPGLPLWTPKGTLLRSLLEDFLKQEQLKRGYLPVVTPHIARVDLFKVSGHWQ KYKEDLFPLMAEDEEAAAHEQGFVLKAMNCPFHVQIYKSELRSYRELPIRLAEFGTVY RYEQSGELGGLTRVRGFTQDDAHIFVTPEQLDSEFLNVVDLILSVIKSLRLENFKARL SFRDPTSDKYIGSDEAWNKAESAIRRAVETLGMDHFEGIGEAAFYGPKLDFIVRDALD REWQLGTVQVDYNLPERFDLEYVAEDGSRKRPVMIHRAPFGSLERLIGILIEEYAGDF PLWLAPVQARLLPVGDAQLDYAAYVVAQMRDLGIRAEVDVSGDRLGKLIRNAEKEKIP VMAVVGAKEVETNSLSIRTRTSGELGTVAVSEVLDKMKQAIANYDNL" gene 6759..6950 /locus_tag="DP116_18670" CDS 6759..6950 /locus_tag="DP116_18670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007311441.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18670" /translation="MTTTKEKVQSLLSKLPDDCSVEDVQYHLYVLEKVRQGLVVTDHR ETLISQEEAEALLRKWLIE" gene 6935..7240 /locus_tag="DP116_18675" CDS 6935..7240 /locus_tag="DP116_18675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309657.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_18675" /translation="MAYRVVWSPKALEDVDAIAAYIFRDSASFSATVVRKILDSSDKL SASPYSGSIVPEFNEDTIRELFAYTYRIIYQIQENTVTIGAVIHGKRLLAQRLDIDR" gene 8383..8592 /locus_tag="DP116_18680" CDS 8383..8592 /locus_tag="DP116_18680" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18680" /translation="MLIKERYRLLKQIGQGGFSKTFLATDEGKSPAVACVVQQFWLQN QTPETFVQKAQILKELGKHLVKFCS" gene complement(8595..9074) /locus_tag="DP116_18685" CDS complement(8595..9074) /locus_tag="DP116_18685" /EC_number="1.11.1.15" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113150.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin-dependent thiol peroxidase" /protein_id="PRJNA477356:DP116_18685" /translation="MNNVPQPGQKAPDFSTTDQDDNQVSLGDFSLQWVVLYFYPKDDT PGCTTEAKDFTELYQDFSFLGAKILGVSTDSQKSHCKFINKHNLSITLLTDPEHQVAE AYKAWRLKKFMGKEYMGVERSTFLIAPDQTIAYTWAKVKAKGHATAVLSQLRELIDP" gene complement(9156..9662) /locus_tag="DP116_18690" CDS complement(9156..9662) /locus_tag="DP116_18690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874190.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="general stress protein" /protein_id="PRJNA477356:DP116_18690" /translation="MTDSQNRNEQIKKLRELIKDIDIGMLTTVDEDGTLRSRPMSTNS EVEFDGDLWFFTYASSHKVTEIEQQEQVNVSFSDPHKQNYVSVSGSAQLVRDRNKLQQ LWKPQLKAWFPKELDEPDIALLKVSVQKAEYWDAPSSFVAHTIGLVKAIATGDKPSVG ENEKVTLK" gene complement(9842..10303) /locus_tag="DP116_18695" CDS complement(9842..10303) /locus_tag="DP116_18695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458071.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_18695" /translation="MKQNHVSLPPECTLRSATSEDIWSIRLLVLGAKLDPTQIRWQQF WVIECNGQLVACGQLRNFSGAQELGSLVVLPAWRGRGLGTFLTQHLIHQATQPLYLEC LGERLAQYYTRFGFVTISFEELPPSVKRKFGLSQLGKRLIKVPVVFMKYQE" regulatory 10355..10499 /regulatory_class="riboswitch" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00174" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="cobalamin riboswitch; Derived by automated computational analysis using gene prediction method: cmsearch." /bound_moiety="adenosylcobalamin" /db_xref="RFAM:RF00174" gene 10849..11241 /locus_tag="DP116_18700" CDS 10849..11241 /locus_tag="DP116_18700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316453.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FeS-binding protein" /protein_id="PRJNA477356:DP116_18700" /translation="MINKQHNLFVCTTCASTWQDGKRVGESGGEQLLHRLQELAQNWE LQNNFPIQGVECMSACSHSCVIAFAAEEKLTYLFGNLPVDASAEAIVQCASQYYTKPD GSLPWSERPEPLKKGILAKIPPLNKWAK" gene 11464..11664 /locus_tag="DP116_18705" /pseudo CDS 11464..11664 /locus_tag="DP116_18705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019503053.1" /note="internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(11899..12327) /locus_tag="DP116_18710" CDS complement(11899..12327) /locus_tag="DP116_18710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745821.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18710" /translation="MTQDAIFSPFFATVFLTLLVWVYMYIRRISFITSLKTRQQDLAV PGTLAQISPPNVSNPSDNLKNLFEIPVLFYALVLYLFITKQVDAVYVNAAWVFVVFRT LHSAVHCTFNLIMLRFYLYLFATLAVWFIAIRAALIHFSA" gene complement(12337..12765) /locus_tag="DP116_18715" CDS complement(12337..12765) /locus_tag="DP116_18715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015174928.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18715" /translation="MTINSEIDRDRAPIRSMQIGFYATSVVFNLCLIAQLLTVGVAYF VNPTWWNIHVWLVRGYSGLSLLLLGWSFITPFSPQIQRLTASLPVLLGLQFCSIHLRS PLHLEVLHPLIGFALLYVSSSLVHRVWRSLSPNHQQNEQV" gene 12857..13567 /locus_tag="DP116_18720" CDS 12857..13567 /locus_tag="DP116_18720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017324747.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="PRJNA477356:DP116_18720" /translation="MTSSLGFLFEAINQAHSEHDLRLQIVPKIGEYFAAKRCGIFFFD QLPLTDRNLQKILKIALSIEHNPVARYLVERHAPVHEALVTSPKAWKLICPRPDHWHV MAGPIINRGQLVGVVGCTREKSMPAFDAQNLVDLSAICLHLSVWTATVRSQSVSAGKS LPPSFRTNRLTPRELQIAELVALGRTNAEIGTELWITENSVKQALKRMFRKLEVSSRA QMVAQLLATRHVATQGKL" regulatory 13653..13797 /regulatory_class="riboswitch" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00174" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="cobalamin riboswitch; Derived by automated computational analysis using gene prediction method: cmsearch." /bound_moiety="adenosylcobalamin" /db_xref="RFAM:RF00174" gene 14135..15127 /locus_tag="DP116_18725" CDS 14135..15127 /locus_tag="DP116_18725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006515954.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_18725" /translation="MNIIWLLLTVALGLGLIYIFSARRYQSSDSVANAYDQWTQDGIL EFYWGEHIHLGHYGSPPHPKDFRAAKVDFVHEMVRWGGLDRLAAGTTLLDVGCGIGGS SRILARDYGFRVTGITISPEQVKRAQELTPPEISAQFQQDDAMNLSFPDASFDVVWCI EAGPHMPDKAVFAKELLRVLKPGGILVVADWNQRDARKQPLVWWESWVMRQLLDQWAH PEFASIEGFAELLQATALTEGAVITADWTHQTLPSWLDSIWQGIIRPKGLVSFGIPGF IKSLREVPTLLLMRLAFGVGLCRFGMFRAVRCEQILSSSSSFQNEKSPSTLPAN" BASE COUNT 4425 a 3132 c 3384 g 4300 t ORIGIN 1 aagtaaatat ttgtaaagat tttatgttca aggcttgaca catgttctgt tactgactgt 61 atgtgattat gatttttcct caatggctca aatctagatc taacagcgtc tttctttcct 121 gttgacgctg ttgcttcgat gaaaaactag acctgtacac ggaattttgg ctgatgacac 181 cggataccct gatgacaccg gagaaacttt ctctagacgg aaaaactttt gttcctgctg 241 atcaaatccc gatcccagag tggcctcgca ttatcagcga aagaccacaa ccgacgctaa 301 cggttaaaga tgatgattta tttttggtaa cagacacttt aggcaacata tctggctgtt 361 cacttaacga tggcaatccc agtatgggat tgttctgctg tgacacacgt tttctcagtc 421 gccttgagtt gcaaatcgac gaacattcac cagtgctact cagcagcact gctgataaag 481 gattttctct gtcagtttta tgtaccaatc ccaaaattga agaccgttta aaagctgaca 541 cagtagggat tcgtcgcgaa ctcgtactca atggcgcact atttgaagaa ttagaagttt 601 caaattatag cacaagttcc gttacctttg aacttagtat tagctttgat gccgattttg 661 tggatttatt tgaagtccgg ggctatcacc gagataaacg gggtagactt ttacgccttg 721 tggaaccgac acctgaagga ggaacatcaa atgctgatgg tgtttctttt caatctgggc 781 cacctgcaca aaaagagcaa tctttgagtc ttgcttatca agggctagat aacttggtga 841 tggaatctcg ggttcaattc cagcatcgac aaccagatta ttttaaaggt tacacagccg 901 tttggcggtt ggagttggct tctcacgaaa cccaaaagtt gggctaccgg gtgaatttgt 961 tgacaaacaa taaacctagt tccattgtta gcgccgcttt caccctagtg caggcgaaag 1021 ctggtgaaat catggaagag caacagtggg tgcaacaaat tacacaaatt cgttcagata 1081 aaggaacttt caatcgaatt attgaaaggg ctgagcaaga tatgtacttg ttgcgtcagt 1141 cctttggcaa gcataagaca gtttcggcag gagtaccgtg gttttcttcg ctgtttgggc 1201 gggattcgct gatcgcagct tttcaaaccc taatgctaaa ccctcaaatt gctaaagaaa 1261 ctctgcaaat tctcgcgttc tatcaaggca aaacagatga tgactggcgc gaagaagaac 1321 caggtaagat gttgcacgag ttgcgatttg gtgagttggc tcgttgtcag gaaattcctc 1381 acactcctta ctacggtaca gtcgatgcga ctcccttatg gctgatgctg tatgcagaac 1441 attatgcttg gactcatgat ctagaaacct tagagcaact ttggcccaat gctctgttgg 1501 caatggattg gatagaccgt aacatgagag aaactggcta cctcagctac taccgcaaat 1561 gcaaacaggg tcttgacaat caggggtgga aagactctgg tgattctatt gtgaatcgta 1621 agggagagtt agctacagga gcgatcgccc tttgtgaagt ccaagcttat gtctatgctg 1681 tcaaattacg cttagcagaa attgctagac tgaaaaagcg gattgactta tcagaccggt 1741 ggacagaaga agcaagaaac ctcaagcttc gtttcaatag agacttttgg atggaagacc 1801 aggatttttg tgccttggct ttggatggag aaggtaagca cgtagatagt attacatcca 1861 atcccggtca gtgcttgaat ttgggcattt ttacaccaga aaaagcttac agtgttgctg 1921 aacgtctgcg agcaccagat atgtttaatg ggtggggtat tcgcacgcta aatagcttgt 1981 caccagctta caatccaatg ggttatcaca ttggttctgt ttggcctcac gataacgcac 2041 tcattgcaat ggggttgcgt tctctaggtt tagtcgatca agccttggaa ctttttcaag 2101 gtttattcga catgactgaa caacagcctt atcatcgtcc tccagaattg ttgtgtggct 2161 acgagcgcaa tggtgataac gcgcctgtac aatatcctgt agcttgtaca cctcaagcat 2221 gggctactgg tagtatcttc cagttactgc aaatgctcgt aaacttggta cccgacgctc 2281 ctaacaactg cttacgaatt atcgacccca ctttgccaga gtcgataagt cgcttgtcac 2341 tgcataatct caaggttggt tctaccatac ttgatttgga gtttgagcgt tctggtacca 2401 caactgcttg tcgcgttgct aacaaacgcg gtaacctcag agtggttatt gaagcgtaaa 2461 caacagtgtt gagttatatg tcctgtagac atcctgaggg tgtctacaag actcggttct 2521 cgtggacttt tttatccttc ttcttgttga ttatcacttc tgtgtcctgg agaagcttca 2581 taaagctcat ccaatttttg tcgcgcatcg tcaacgttga tcgaacgcat gactaaaagt 2641 ggttctttaa ttaaattgcc agtcgcatct aataattctg gatggggtct aaactgctga 2701 gttgacgaat tataaggata tctgccttgg tttgttgaac ctccagatct tggtggataa 2761 atccgctgtg tatcaaagct aaacataatc aggttacgaa ttaagttagc gacagctata 2821 aaagctagga tggtaaaagc aagaatgtaa agcagatgta acattgttct ttcctccaga 2881 gttcgcaaat tctaaaacta tatgttgtaa aaaattttcc ttagctgatg cctggtgatg 2941 ccgttaatga tggtgatgct aaatggttaa cgcaattttt gtgcgctcag tctctttaga 3001 ttaagaggga tatttttaac ttcaggggga gtcctttacc ataaactgca tcattttcta 3061 tctcagatga tgttaaaaat tttttactat ctcacctata caccgtcttt gtgttgccaa 3121 ttctcctgtt gcgaacggaa tcgcattgcc acattccaac attgtgtgac taattgatgc 3181 caaggcatca atgttgccat atcaatacca acttgtccgc cagttgctgt aaacagcatc 3241 ttcgctgtgt tcacttcctc ttgtgctttc ttgactcgcg ttaataagtc agattgttca 3301 tgttgactca tgaatgatag ctcttccgtt tccagaaaat cgcgcgatcg cgcaaaccag 3361 tactgaaaat cttctaacag tggttgcaaa actgttttca gcagttcagg ttctggtaaa 3421 ttcgagtctc gcataaatga aaagcatatt tctaactttc ttactaatat taactctatt 3481 taatattctt aacattcttt tcgtctctgc cataagtctg aaaagtaaac ctaattgtaa 3541 tattggttac acttgaatcc tcagagagat tcgccgtctt tcttacatat acaagtcggg 3601 gatctcaaaa aaaccgtctg tactggaggt aaagacattg tcatacgcat atacttgagt 3661 atgaaaaaag actttaaatc tcacataaaa tactaatatg atgactgaat gtgtacagaa 3721 cttctgtaaa aaacctgcat agttttctca taaaaggtgc cctaaaagtg aattaacaaa 3781 tctatgtata tgttttgggg ctgacgtctt gcaatttctg ttgctgtaaa gattaattca 3841 ctttttggtt gaaacataga acctccgcca atgccacagc acttgcaccg actgtcggga 3901 aagcgcagtc ggcacgcagt gactccctac ccccctttca tcccacgcca cttctcaagg 3961 gcggggggaa cccccgcacg agagtggctc cccttgataa ggggggacac aggggggtgt 4021 cgtagacggt ggggtgtact tgtatcagac ctttcgtgaa atgatgtttt ttcggcacta 4081 agaaattagg gggcttcaaa ccccaagcca gaactttcaa accttaaatt cagtcaacct 4141 gaattcattc tgagttctgg attagttatt tctttgttgg ttagattgag ttaacataat 4201 actatctcac aattaaagtt aaaattcact ctcacatagc gtaaaattta gatttgaaag 4261 aggaaactta catgcttggc tttgcccaca aacaaaatgt tcttaagtat atagaaattg 4321 cccttagcta gaaatcagaa cttggtttta tttctgtagt agaaattaga tgcttgacaa 4381 ttgcaagata tcagccaggt caagaaaaat tccaaatgtc ctttttaagg aaaaggcata 4441 tgcatcgaaa tatgtggcac acagatagac gagcagacac ttggaggtgg atgtgagtgc 4501 tcactaacac tatatgtgag tttactgtca ggtttgctac taggcactca ttgtcttatc 4561 accaatagta actagctatg agtattcttt atatgaaggt tatttttgat ttataagtag 4621 atactaaatg acatgatgga tgatcaacaa tagattttct gagggtaaat catcagaaca 4681 taattatatc cttaaatgaa tcgggttcat cttgctaaag agagaaattg tcagaaaatc 4741 aaaagtctga cgaaactgta aaatagccct agcttagatt tgggtatgat ttgtcataac 4801 ctcaactaaa taaatcatca ttcttacaat ctcttcgcca atggtacagc agcaaatatc 4861 gccaaattca tctagtcagg aagaaaaacc ggaaaaagtg tatttaccac ggacttcaga 4921 atcagagatc ttaaagaaga ttcgccatac cacttctcat gtgatggcga tggcagtaca 4981 aaaactgttt cccaaggcgc aagtcacaat tggaccttgg attgaaaatg gcttttacta 5041 tgactttgat aatccggaac catttactga caaggattta aaagccattt ataaagaaat 5101 ggtgaagatt atcaatcaga aattgccagt cgtaagagaa caagtcagtc gcgaagaagc 5161 tgaacgccgt attaaagaaa ttaaggaacc ttataagcta gaaatcctat cagacatcaa 5221 agaggaacca atcacagttt accacttagg tgataagtgg tgggacttgt gcgctggacc 5281 tcatatggaa aatactggcg aactcaaccc gaaagcgatt gatttagaaa gcgttgctgg 5341 cgcatattgg cgtggggatg aaaccaaagc gcagttgcaa cgcatctacg gtactgcttg 5401 ggaaacacca gaacaactcg ctgagtataa gcgacgtaag gaagaagcac tgcgaagaga 5461 ccaccgaaaa ctgggtaagg aattaggatt atttatattt tctgacctag tgggaccggg 5521 gttgccattg tggacaccga aaggaacttt gttgaggagt cttttggaag actttctcaa 5581 acaagaacaa ctcaaacgcg gatatttacc agttgtcact ccccacattg ccagagtgga 5641 cttatttaaa gtttccggac actggcagaa atataaagaa gatttgttcc cccttatggc 5701 agaggatgaa gaagccgcag cgcatgaaca aggcttcgtc ctcaaagcga tgaattgtcc 5761 cttccacgtc cagatatata aaagcgagtt gcggtcctac cgagaattac cgatccgctt 5821 ggcagaattt ggcactgttt accgctacga acaatccggg gaattgggcg gcttaacgcg 5881 tgtacggggt ttcactcagg atgatgccca catatttgtt accccagagc agctagacag 5941 tgaattcctc aacgtggtag atctaatact gtcagtgatt aagagtctgc gattagagaa 6001 ctttaaagca cggcttagtt tccgcgatcc aaccagtgac aagtacattg gttctgatga 6061 agcatggaac aaagcagaaa gtgcgatccg tcgagcagtc gaaaccttag gtatggatca 6121 ctttgaaggt attggggaag cagcgttcta tggtcccaaa ctagatttta ttgtccgtga 6181 tgcccttgat cgggaatggc aattaggaac cgtacaggtc gattacaatc tgccagaaag 6241 gtttgatttg gagtacgtcg ctgaagatgg ttctcgcaaa cgtccagtga tgattcaccg 6301 tgcgcctttc ggttccttgg aacgactcat cggtatttta attgaagaat atgcaggaga 6361 tttcccctta tggttagccc cagtccaagc aagattgctg ccagtgggtg acgcacaact 6421 ggattatgct gcatacgtgg tggcgcaaat gagagacctt ggtatccgcg ctgaagttga 6481 tgtcagtggc gatcgcctag gtaaactcat tcgcaatgcc gagaaagaaa aaattcccgt 6541 aatggctgtg gtgggagcga aggaggtgga aaccaactcc ctaagtattc gcacccgcac 6601 ctctggagag ttaggaactg tggctgtatc tgaggtctta gacaagatga aacaagccat 6661 tgctaactac gacaaccttt aacagaaggt agagataagg gcgagctgtt tgcttgccct 6721 taaaaattac ttcatataaa tattctcaaa aaaatgctat gactactaca aaagaaaaag 6781 tccaatctct gttgagcaaa ttaccagacg attgttctgt ggaagatgtt caatatcatc 6841 tgtacgtact tgaaaaagtt cgtcagggat tggtagtcac tgaccatcga gaaactctca 6901 tctctcagga agaagccgag gcgctgttaa gaaaatggct tatcgagtag tttggtctcc 6961 caaagccctc gaagatgtag acgcgatcgc ggcatatata tttcgtgact ccgcgtcttt 7021 ttctgccaca gtagttcgga agatacttga ctcatctgat aagttgagcg ccagtcccta 7081 ttcaggttct atcgttccag aatttaacga ggataccatc agggaactat ttgcttacac 7141 ttatcgaatc atttatcaaa ttcaagaaaa caccgtaact attggggcag tcattcatgg 7201 taaaaggctc ttggctcaac gtctagatat agatagataa gccaagaaac ctttctccag 7261 acaaaaagca acaataaaca agggtgggca atgctcaccc ttctacaatt tttgatagta 7321 caaaacacaa aatcgatttt tcaactgaag tttttcgttc cttacctctc tttaggaatg 7381 cataactgtg ggctgctacc tcgattacaa acttaatact taattatgaa gttttataaa 7441 ctattaggtc aagtttaaga aattaatcac agccacaggg aagtattgta gaacggctac 7501 tctccatgat tttatttaac taagcctgtt ggagaatctc aagaatggga agaggaactt 7561 tcaaatgcag aacttttggc tgtagttgga ggagttcaac aattggttgt gactaaaaaa 7621 acacagatct gaggaattcc gcttgaaaat gatggttagg ggaaaaccgc aattctaaca 7681 accgattcca gaaaaataat cactaaaaag cactagcagt ctacaactgc tagtgcctca 7741 acatcctaga gtttattatt tatgtagaac ttttcctaaa gtcagaaaaa agatgataga 7801 tgcatccctt acaatcatga accaaaaaaa cgtatgaaat aaatactcaa agaagctagt 7861 aaacgcaccg gatttttgtc aaatatatga aactacacat agtttaaagt cgtaagataa 7921 tgcaaacgtc ttcctattga aatggatatt agaataaaaa aataagtcta atagaggaag 7981 acgttttttg tgttaagtct tttgacgatt aatgatgtca gtcactgaaa cgcatgatcg 8041 cgcagagcgc agacgcaatc atgcgtatcg catctgatgt ctttgcaaag cgtaaagata 8101 taggattcct ccgcagattt ttgctcagct agatacagtt tctgttaagc gttccttacc 8161 tcaactagta agtatggcta tggctgacgc cacggctagc gcgccgtatg cgctcttgcg 8221 cacgcaagcg taagcgcaaa gcgcagcgta ggcgtgtagg agatacgcca cgcaagcgaa 8281 cataaaccaa accggattcc tatagataat tgttttcgct tctacatcat tggcgacata 8341 ctgtgaacga gtttaaccgt aactaaaatt ctgttataaa ctttgctgat aaaagaacgt 8401 taccgtttac taaaacaaat tggtcaaggg ggatttagca aaaccttcct cgcaacagat 8461 gagggaaaat cccccgcagt ggcttgtgtc gttcaacaat tttggctaca aaatcaaaca 8521 cctgaaactt ttgtgcaaaa agcacagatt ctaaaggaat taggtaaaca tcttgtcaag 8581 ttttgctcat aaatttaggg gtctatcaat tctcgcaatt gactcagcac agcagttgca 8641 tgacctttgg ctttcacttt tgcccaagta taagcgattg tttggtctgg tgcgatcagg 8701 aaagttgaac gctctacacc catgtactcc ttacccataa actttttcaa tcgccatgct 8761 ttgtaagctt cagcgacctg atgctctggg tcagttaaca aagttattga taaattatgt 8821 ttgttgataa atttacaatg agatttttgt gaatctgtac taacacccaa aattttcgct 8881 cctaagaagc tgaagtcctg atataactcg gtaaaatctt tagcttcagt cgtacaacca 8941 ggagtgtcat ctttggggta gaaataaaga acaacccact gaagcgagaa atcacccaga 9001 ctgacttggt tatcatcttg atccgtcgta gagaaatcag gggctttttg ccctggttgg 9061 ggtacgttgt tcaccataaa attgtctagg taataccaag aactcgggtt atgaaagaac 9121 ccgagttttt ctgttgctat tatcttactt tttgattatt tcaaagtaac tttttcattt 9181 tcgccgacac tgggtttgtc acctgtggcg atcgccttca ccagtccaat tgtgtgtgct 9241 acgaaactcg aaggagcatc ccaatactca gccttttgta cactaacttt aagcaaagca 9301 atatcgggtt catctagttc cttggggaac caagctttta gttgtggttt ccacagctgc 9361 tgtagtttat tgcgatcgcg taccaattga gccgaacccg acactgaaac gtagttttgc 9421 ttatgtggat cagagaaact gacgttgact tgttcttgct gctcaatttc tgtcacctta 9481 tgggaactgg cgtaggtaaa gaaccaaaga tcgccatcaa attccacctc tgagttagtt 9541 gacatcggac gactgcgcaa agtcccgtct tcatcaactg tagtcagcat tccaatgtca 9601 atatctttga tcagttcacg cagcttctta atttgttcgt tacggttttg tgaatctgtc 9661 atttttctta ctcttgtctg tgctatctag tttgaatatg agcgctctac tgcaagtctt 9721 ttcttcgcac actaaagctc tcttatagtg ttagcgctta catacaacag tgacttgagt 9781 ctaaaggtgg agtttagctg ctagaagagg tacgtaaatg gaggtaaatg aacaaaaaaa 9841 gttattcttg atacttcata aagacaacag gaacctttat caacctcttt cctaattgag 9901 acaatccaaa cttgcgttta acagatggcg gtaactcctc aaaggaaatg gtcacaaagc 9961 caaagcgagt gtaatactgc gccagtcgct cacccaaaca ctctaaataa agtggttgag 10021 tcgcctggtg aatcaaatgc tgtgttagaa aagtccccaa acctcgacct ctccaagctg 10081 gtaagacaac caaactaccg agttcttgtg caccagagaa gttgcgtagt tgtccgcaag 10141 ccaccaattg cccattacat tctattaccc aaaattgctg ccatcgtatt tgggtagggt 10201 caagttttgc ccccagtact aataaccgaa tagaccagat atcctcagat gttgcactgc 10261 gaagggtaca ctcaggtggc aaggacacat gattctgttt catagttttt cttgattaat 10321 atcatgtccg gtatatatat atggaagtat ggtaaaatat tgttaactac tcggttctaa 10381 tgggggtcag ccattagagg taacggggaa agtccggtgt aaatccggcg ctgtcccgca 10441 actgtaaagg agataagtgt ctctcagcca ggatgcccgc cgaagttaac ttgtcagttc 10501 tgcacgtctg cgaggtacag atgaattatg ccatgaatgt ttctacaaat ttatacaacc 10561 tgccgacttc acgttcttat agtggcagag atacgctcaa gattgtggct tgctacaatc 10621 cgcactgata tacgattaat tcacggtaaa aattacgact cgaaattacg actctaggta 10681 tagctctcat tcgtgctagt ttttctaggc gtaagtatct gtcctgttta tttactattg 10741 agaatattta taatagtgaa tttacacaag tctaaaggca gaatttcggc gttggcaaca 10801 ctgtgtgata tctgcggttt agatttcctt ctacacactt caaaatcaat gatcaacaaa 10861 caacataatt tatttgtttg cacgacttgt gctagtacct ggcaagacgg caagcgagtt 10921 ggtgaaagtg gtggtgaaca actactacac cggcttcagg aacttgcaca aaactgggag 10981 ttgcaaaata atttcccaat tcagggagtt gaatgcatga gtgcttgtag ccattcttgc 11041 gtcattgcct ttgctgcgga agaaaaatta acctatctct ttggcaattt acctgttgat 11101 gctagcgctg aagctattgt gcaatgtgcc agtcaatatt atactaaacc cgatggatca 11161 ctaccttggt cagaacgacc tgaaccactg aaaaagggta ttctggcaaa gattccgcca 11221 cttaataagt gggcgaaatg aaacataaat tgttgtcttt gaggactgag taagttgccc 11281 cagactaagg tcccagagtg ctttaagact tttgcaaaag gcttattatt ggacttaaca 11341 ataaaccctc attatctaca ctattctgtg cgttctccgc gtttgtgcgg tggtttttgt 11401 tttggagttt aaaaaagagg aaatcaatat tgagattgag cctagtggat aatagaggtt 11461 tgcatgagta taaaataaaa cttttatgct ctgactacga ttgaacttcg agcctatgtt 11521 ttagaacatc gcagtgatga ggaagcctta cacgcctacc tcgacaagct tcatgctgag 11581 aatccgagtt cacacgtata tagacccgaa gacaacgtgt ccgaagcagt tgcagagtat 11641 ttaaagaaca aaagaactct aagtggcgat cgcttgttcg ctccaagtcc acattatatg 11701 acttatacca attctccaag aagatgcact ttatttttaa aacgaaccgc caagaacgcc 11761 aagaacgcca agaaaagagc aagaagagag atttaattat ttagtgcaag tttatagaga 11821 atcggtatta cgcctgatgt gaatgaaaaa cgtctccggt tcagcgattt gttataccgc 11881 gtcggcatga cagcatactt acgcgctgaa gtggataagg gctgcgcgga tcgcgatgaa 11941 ccacaccgcg agtgtggcaa ataggtagag gtagaaccga agcatgatca gattgaatgt 12001 gcaatgcacg gcgctatgca acgtgcgaaa tacaacgaag acccatgcgg cgttcacata 12061 cactgcatcc acttgctttg tgatgaacag gtatagcacg agcgcataga aaagtaccgg 12121 aatctcgaac aggtttttta ggttatctga tggattggat acgtttggtg gcgagatctg 12181 cgccagtgtg ccaggtacag caaggtcttg ctggcgagtc tttaggcttg tgatgaaact 12241 gatccggcgg atatacatat agacccagac taagagtgtc agaaacactg ttgcgaagaa 12301 ggggctgaag attgcatctt gcgtcatttc tacctcttaa acttgctcat tttgctgatg 12361 gttgggcgat agactgcgcc atacacgatg tacaagactt gaagaaacgt agagtaacgc 12421 aaatccaatt aaaggatgta gtacctctag gtgtaaagga cttcttaaat gaatgctgca 12481 aaattgtagt ccaagcagca ctggtagact tgcggtgaga cgttgtattt ggggtgagaa 12541 tggggtgatg aacgaccatc cgagtaatag taatgacagt ccactatatc ctcggactag 12601 ccaaacatga atattccacc atgtaggatt aacaaagtaa gcgactccaa ccgttaataa 12661 ttgagcaatc aagcagaggt taaagacgac tgaagtcgca taaaagccga tttgcatcga 12721 acggatggga gcgcgatcgc gatcaatttc cgaattgatg gtcatatcaa gtgattgtta 12781 agattaaagc taatataggc tgagtacaga tgatgcttct agacctcgat cgggtacact 12841 tgagaccact tgacttatga ctagttcttt gggatttctg tttgaagcca tcaaccaagc 12901 tcatagcgaa catgatctgc gattgcaaat cgtgccaaaa attggtgagt attttgcagc 12961 gaaacgatgt gggatttttt tcttcgacca actgccctta acagatcgca atcttcagaa 13021 aatattgaaa attgcactct caatcgaaca taatcctgtg gcgcgttatt tggtggagcg 13081 tcatgctcct gtccacgaag cattagtgac atcacccaag gcttggaaat taatctgtcc 13141 ccgtccggat cattggcatg tgatggcagg accaatcata aatcgcggtc aattagtggg 13201 tgtagtaggc tgcacacgtg aaaagtcaat gcctgccttt gatgcccaaa atctagtcga 13261 tttgagtgcc atctgcttgc acttatctgt ttggactgcg acagtgcgtt cacaaagtgt 13321 ttctgcagga aaatcgcttc ccccctcctt tagaaccaat cgcttaacgc ctcgtgaatt 13381 acaaattgca gagttggttg ctttggggcg aactaacgca gaaattggga ctgaactttg 13441 gatcactgaa aattctgtca agcaagcctt aaagcgaatg ttccgtaagc ttgaggtttc 13501 gtcgcgtgca cagatggttg cacaactttt agctaccaga catgtagcaa ctcagggcaa 13561 gctttgataa aggatgaaac ttgccccgat taatgacgaa tgccagtata gatcgaccat 13621 tgattttcat gcaagcaaac cacagttgtg ttaatattca tataaatctc ggttctagtg 13681 ggagaacagc cactagaggt aacggggaaa gtccggtgtg aatccggcgc tgtcccgcag 13741 ctgtaatcaa ggcccgcctt gtgagtcaga acgcccgccg aagctaagtt gtaaatttta 13801 tacatctgcg aggcacagaa cagtattttt atgaacattc ctacaaagac taggctttac 13861 attctcatag gcccacatat caaaacgcca cagattacaa ctttctatga gacggtaggc 13921 ggtgaattga cggtataaat tgcgtcttta gaggcagcga ttgccggagg cgcagctagc 13981 ttaacgctta cgctaatttt ctttggcgtg actcttcaca aaaaaatgga tacgctgatt 14041 tgtttggcta aagttcacgc taaataaatt taacgttttg tttttcatta cagttataaa 14101 tagatgcggt ttccacagtc aagggagttt ttagatgaac attatctggc tgttgttgac 14161 ggtggcgtta gggctggggt taatctatat ctttagtgcc cgccgctacc aatcgagtga 14221 ttctgttgcc aatgcctacg atcaatggac tcaagacggc atccttgagt tttattgggg 14281 agagcatatt caccttggtc actacggttc cccccctcat cccaaagatt tccgtgccgc 14341 taaggtggac tttgtgcatg agatggtgcg ttggggtggg ttggatcgcc tggctgctgg 14401 cactacactt ttggatgtgg gttgtggcat tggcggcagt agtcgtatcc tagcgcggga 14461 ctatggcttt cgcgttacag gcatcactat cagcccggaa caggtgaaac gggcacagga 14521 attgacccca ccagaaattt cagcccagtt tcaacaagat gatgcgatga atctgtcttt 14581 cccagatgcc agttttgatg tagtgtggtg tatcgaggcg ggaccacata tgccggataa 14641 ggcggttttt gctaaagaac tgttgcgggt actcaagcca gggggaatcc ttgtcgtcgc 14701 ggattggaac cagcgagatg ccagaaagca acctctggtt tggtgggaaa gctgggtgat 14761 gcggcaattg ctagatcaat gggcacatcc tgaattcgcc agtattgaag ggtttgccga 14821 acttttgcaa gcaacagctt tgaccgaagg ggcagtgatt actgccgact ggacacacca 14881 aaccctacca tcttggctcg attccatttg gcaagggatc attcgtccca aaggtctagt 14941 aagttttggc atacccggtt tcatcaaatc gctgcgagaa gttcctacgc tgctgttgat 15001 gcgtttggcc tttggcgttg gtctgtgccg ctttggtatg tttcgggcag tacggtgtga 15061 gcagatcctc tccagttcgt ctagttttca aaatgaaaaa tccccttcta cattaccagc 15121 aaattagtca attttcgcac agtcagcaag gttggacttt aatgctggta atactacgtg 15181 aacagtgaac agggaacagg gaacagggaa cagggaacag ggaacaggga acagggaaca 15241 g // LOCUS NODE_2198_length_15217_cov_4.88807515217 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15217) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15217) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15217 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(75..737) /locus_tag="DP116_18730" CDS complement(75..737) /locus_tag="DP116_18730" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18730" /translation="MSKATMENSQSVQQQSSQSKAPIPESGKKVPGRTLSKNPLINLL VHHTWLLPAGLLLIFLGSSAAALYSLGYVGRMQPEEEEIAEAEIIKPIKAPLNAINPT PLWLVAAIALSCGSGTLILFLLLNRPAQRQEVRNQINRYQKRLAKRSPGLEPRPNKSV PAFVPSKPKTPVVAMPVQTKPVVTVLPPEQNVSQGQGKESLANMMDMRKHTPLSTIIR KD" gene complement(900..2012) /locus_tag="DP116_18735" CDS complement(900..2012) /locus_tag="DP116_18735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-alanine--D-alanine ligase" /protein_id="PRJNA477356:DP116_18735" /translation="MTKLRVGLLFGGRSGEHEVSISSARAIARALTTEQNTGKYEILP FYIQKDGRWLAGDVPQQVLESGTPLQLPTTSNEQLSVAKTQILSRWQSPSQVAEVDVW FPVLHGPNGEDGTIQGLLTLMQVPFVGSGVLGSAVGMDKIAMKTAFAQAGLPQVKYKA LNRAQVWSNPCVFPKLCDEIEATLGYPCFVKPANLGSSVGIAKVRSPQELEAALDNAA SYDRRIIVEAGVVARELECAVLGNDSPKASIVGEITYASDFYDYETKYTERKADLLIP APISPAVTRQIQEMALQAFIAVDAAGLARVDFFYVEATGEIFINEINTLPGFTATSMY PLLWAHSGVSFPELVDRLIQLALERHSAFSTTQKEN" gene complement(2455..3021) /locus_tag="DP116_18740" CDS complement(2455..3021) /locus_tag="DP116_18740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311854.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_18740" /translation="MTPKPSNAYERILLTASKLFYQKGIQHVGINEVIAAADVAKRTF YKHFPSKDQLILEVMLYREKQWLQWFEESVEQRGKTAKEKLLATFDVLGEWYAQPDFR GCPFINAVLELANANHPVHHVSARLREAIRTHIKKLAAEAGVRDPETFSQQYLLLIGG ASLMATIEGTPAGATHARQALSVLIDGS" gene 3182..3745 /locus_tag="DP116_18745" CDS 3182..3745 /locus_tag="DP116_18745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012166566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxymuconolactone decarboxylase family protein" /protein_id="PRJNA477356:DP116_18745" /translation="MDFTIYTVETAPENSKEALIKAKEVFGFIPNLEGICAEAPALLK AGMALWDLFSTTSFSPIEQQVIYLAANYENECHYCMAAHSGLAKMVGMSSEDIQALRN GTLLRDPKLQALRHFTGRMVQARGWVEDYEIESFMAAGYGKQQVLEVILGIAVKVIHN YTNHIAKTPLDKVFKANTWSKLKPITP" gene complement(3801..5165) /locus_tag="DP116_18750" CDS complement(3801..5165) /locus_tag="DP116_18750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319778.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA (N6-isopentenyl adenosine(37)-C2)-methylthiotransferase MiaB" /protein_id="PRJNA477356:DP116_18750" /translation="MTTSSRRYHITTFGCQMNKADSERMAGILEDMGFEWSEDPNQAD LILYNTCSIRDNAEQKVYSYLGRQAKRKHEQPDLTIVVSGCVAQQEGEALLRRVPELD LVMGPQHTNRLQDLLQQVFDGNQVVATEPVHIMEDITKPRRDSTVTAWVNVIYGCNER CTYCVVPNVRGVEQSRTPQAIRTEIEEIGRLGFKEVTLLGQNIDAYGRDLPGVTPEGR HLHTLTDLLYYVHDVEGIERIRFATSHPRYFTERLIKACAELPKVCEHFHIPFQSGDN EVLKRMSRGYTHEKYRRIIDTIRQYMPDASISADAIVGFPGETEEQFENTLKLVEDIG FDLLNTAAYSPRPGTPAAIWDEQLSEETKSDRLQRLNHLVAIKAAERSQRYMGRVEEV LVEDQNSKDKTQVMGRTRGNRLTFFAGDINELKGKLVKVRITEVRAFSLTGEPVEVRE ALPV" gene complement(5320..6579) /locus_tag="DP116_18755" CDS complement(5320..6579) /locus_tag="DP116_18755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015176467.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_18755" /translation="MITLTYRYRIYPDITQEQTLIEWMEICRTAYNYALREIKDWCDS RKCLIDRCSLEKEYILSPELKFPGEIYQLNNLPKAKKEFPKLSEVPSQVLQQAIKQLH KGWEYFQQRGFGFPRFKKHGQFKSLLFPQFKENPVTNLHIKLPKIGAIPINLHRPIPS RFVVKQVRILRKADKWYTSISVQCDVNIPDPIPYGHVIGVDVGLEKFLATSDGVLVKP PKFFKQLQSKLKLLQRRLARLQRRSKNYEKQRLTVARLHHKIDNTRKDFHFKVAHALC DAGDMVFMEDLDYRTSAKGMFGKHMLDAAFGQFRAIVKYVCWKRGKFFSEVDARGTSQ QCPECGGQVKKDHYVRVHSCPDCGYIIDRDVAAGQNIRNRGIKLISTVGQTGTQTACA VDLPGTDENQSRQVAKSRKRTTRKSSK" gene 7066..8130 /locus_tag="DP116_18760" CDS 7066..8130 /locus_tag="DP116_18760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318248.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase family protein" /protein_id="PRJNA477356:DP116_18760" /translation="MSQQTLSPAADKQTKTKKGKKSLPPKLIIKLGKFVWTTIWHLMM SKLAPRNKSGEYIRPSSEFRNSVGIEQGNVYQAATGRYNLIAGLGCPWAHRTLVVRSL KKLEQAISLTIVSPSPIEGGWVFNQEYEGCRTLAELYELAQPGYGGRFTVPVLWDSQT KTIVNNESSEIIVMLNSQFNEFANNPTLDLYPQELKEKIDQWNERIYTSVNNGVYRCG FAQTQEAYEQVYHELFTTLDEIDTVLNTSRYLCGDTVTLADVRLFTTLFRFDTVYYAL FKCNRKRIQDYQNLGPYLRDLYQITGIADTCDLESVKQDYYGNLFPLNPGGIIPSGPD PMFLQEPHHRDKMSKQANPV" gene complement(8139..8930) /locus_tag="DP116_18765" CDS complement(8139..8930) /locus_tag="DP116_18765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879241.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3891 domain-containing protein" /protein_id="PRJNA477356:DP116_18765" /translation="MLHRLTKQGLICITQPNHAWVAGQLAQVWGNERFGEFAPKKEVC LAAQLHDIGWLFWEQAPTLNPQTGYPNNFMELSTPEHINIWSGARQLALPWGRYVALL ISLHGTSLYERFTSWQNSPTSSQIVQNFLEREYAFQEQVIAFLKNDEYYAPYTKPEVV ERNRKLVAIWDALSIILCQELTNDAYLSGIPTIDGETTVKLTLKDVKDNHYQVAVSPC PFQVSEVELVYEGRLLQETFSDEKAMREALMGDCAVTLSTTLQPE" gene 9173..9871 /locus_tag="DP116_18770" CDS 9173..9871 /locus_tag="DP116_18770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18770" /translation="MFKRLFLVLGCTLAGTGAIVFFFWQQATQLPVWYSNPPTTSSLP PQTDQNKTQIQLSQQQVLSKIYDNLKGANAKGEVQLDANEVNTLIVSGIAQTTDKSRL AQAVVRTNTQIQDGKISAGAVIDFRTIPLNELPSQEQVAISKLLSTVPILKYRPVYIE VEGKPKVHNKQISLDETTRVKFGNVSLTLSDMYQRFGLSEKHLNQQVANELKKLPVEV KDVEVMGDRLIVRG" gene complement(9918..12284) /locus_tag="DP116_18775" CDS complement(9918..12284) /locus_tag="DP116_18775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015195964.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_18775" /translation="MSRCVVIDISERKQAQEQLQQAHQDVQLVNDRLTGIIEGTHDLI AALDLEFRFIAFNSAYKQEFQQIFGKTIAVGMSLIDALAHLPEEQAKAVAIWGRALRG EEFTVIEEFGDERLERNYYEITYSSIRNANHQQLGASHIGRDISDRKRSENELRDSEA RYRLLFESNPNPMWVFELETLAFLAVNQAAIAHYGYSKEEFLSMTLADIIPPAYITSL HQSLSNFTPGQNDLGVWKHRKKDGSLIDVKALAHIFSFAGKWTSLVLIDDISDRLQAE QKIREQAALLDVATDAICVRDLEHHILFWNKGAERLYGWKTAEVLGKNAIKLLYRPGE TLPEFEAIQATLIREGNWQGEIQKVTKDGKTIVVESRWTLVRDGLGNPKSILTVSTDI TEKKQLEAQFLRAQRLESLGTLASGIAHDFNNILTPILAVAQLLPLKFPNLDENTQQL LSILEGSAKRGADLVKQILSFTRRGVEGSRTIIQARHLLLDVAQVAQRTFPKSIETET NIAPDLWTVCADATQLHQVLMNLCVNARDAMPDGGTLTISAENQCMDESYARMYVDAK VGLYVVMTITDTGTGIGPEIMDRIFDPFFTTKEVGKGTGLGLSTVMGIVKSHGGFVNV YSEMGKGSTFEVYLPSSQVTETQVATDVELPRGNGELILAVDDEATICEITKTALESH NYRVLTASDGIEALALYAQYKNDISVVLIDMMMPGIDGSTTILTLQRMNPQVQIIAMS GLMSNSTTTQNRSLGIQYFLPKPFTVQALLSTLREVLTIQNSKFKIQN" gene complement(12618..13040) /locus_tag="DP116_18780" /pseudo CDS complement(12618..13040) /locus_tag="DP116_18780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015195964.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" gene complement(13054..13497) /locus_tag="DP116_18785" CDS complement(13054..13497) /locus_tag="DP116_18785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017291473.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_18785" /translation="MSANNKTILLVEDNPDDEALAIRALKRNHISNEIVVVHDGVEAL DYLFGTGVYAGRDISLKPTVILLDLKLPRIDGIEVLRRLREDERTKLLPVVILTTSSE EQDMLNSYSLGCNSYVRKPVNFIEFTEAVRQLGMYWLLMNELPQI" gene complement(13494..14915) /locus_tag="DP116_18790" CDS complement(13494..14915) /locus_tag="DP116_18790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496981.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_18790" /translation="MNPEILVKLQNLCRDQAAFETLKQILAEEVQHLEQERNQAYFQM EQQKALFRVITRLREPLDLETIFKATATEVRQLLKADRVGMFRFYPNSGWDDGEFVSE DVDHRFPSAMGQKIHDHCFGEQFAVHYQQGRIQAVADIYNYGLSDCHIQVLSQFQVRA NLAVPLLQGEKLWGLLCIHQCSAQREWLATEIEFVSQIANHLGVALQHAELLADLRAE VLERLQVQQAVQSLNQGLQRAIIELQAVNKELEAFCYSVSHDLRAPLRGIDGFSQALL EDYFDLLDLTGQDYLQRIRGATHRMGQLIDDLLNLSRVTRSEMHPESVDLSLLASGIC TELQHSQPERQVKFAIQTGLVAQGDTRLLRVLLVNLLNNAWKFTSKHPQAEISFGVSK SESGVNVYFVRDDGAGFDMAYANKLFAPFQRLHGMNEFPGNGIGLATVQRIVHRHGGR VWAEGAVEQGATFYFTLSEEKGA" BASE COUNT 4090 a 3545 c 3253 g 4329 t ORIGIN 1 tcccaactcc caactcccaa ctcccaactc ccaactccca actcccaact cccaattatg 61 ttaattcttg cgtattaatc tttacgtata attgttgaca atggagtatg tttgcgcata 121 tccatcatat tcgccagaga ttctttaccc tgaccctgtg agacattttg ctctggtggt 181 aaaactgtga ctacaggttt tgtttgaact ggcattgcta caacaggcgt ctttggtttt 241 gatggcacaa acgctggtac actcttgttg ggacggggtt ctaatcctgg actacgtttt 301 gctaagcgtt tctgatagcg gttaatctga tttctgactt cttgacgctg tgctggacgg 361 tttagcagta ggaataatat taaggtacca ctaccacaac tcagagcaat agcggccact 421 agccataaag gtgtaggatt gattgcgttg agtggtgctt tgattggttt gataatttca 481 gcttcggcta tttcttcttc ctctggttgc atacgcccca catagcccaa gctgtacaac 541 gcagcagcac tactacctag aaaaatcaat aacaatcctg ctggcaatag ccaagtatga 601 tgtacgagca gatttatcag gggatttttg ctcagagttc gtcctggaac tttcttcccc 661 gactcaggta taggtgcttt tgattgcgat gactgctgct gtacactctg gctattttcc 721 atagttgctt tgctcatttt taccctccag ttataaattg tactgaaggg ttaaccagga 781 tgtatccgta gaaacctatg ttatttaagt atagtttttg gaaacactgg tgacttaatc 841 tgatgttttt ttctagaaaa ttgtactttt tgatacttta ttccaaaaac gacgagaaat 901 cagttttcct tttgcgttgt tgagaaagca gaatgtcttt ccagagccag ctggattaat 961 ctatcaacta attccgggaa agagacaccg ctatgcgccc agagtagagg gtacatactt 1021 gttgctgtaa agcctggtaa tgtgttgatt tcgttaatga aaatttctcc tgtcgcttcc 1081 acgtagaaaa aatctaccct tgccaatcca gcagcgtcaa cggcaataaa ggcttgcaaa 1141 gccatttcct gaatttgacg agtcacagct ggcgatatag gtgctggaat cagtaaatct 1201 gcttttcttt ctgtatattt agtttcataa tcataaaagt cgcttgcgta agtaatttca 1261 ccaacaattg aagctttcgg actatcattt cctaaaaccg cacactctaa ttctcgtgcg 1321 acaactccag cttcaacgat gatccttcgg tcatagctgg cggcgttatc taatgctgct 1381 tctaattctt ggggcgatcg cactttagca ataccaaccg atgaacctaa attagcaggt 1441 ttgacaaagc acggataacc caaagttgct tcaatttcat cacacagttt cgggaaaaca 1501 caaggattcg accaaacttg cgctctattt aacgctttgt attttacctg cggcaatcct 1561 gcttgggcaa aggctgtttt catggcaatt ttatccattc ccaccgctga acccaacact 1621 ccagaaccaa caaaggggac ttgcattaag gtgagtaatc cctgtattgt cccatcttct 1681 ccgttaggac cgtggagaac aggaaaccaa acatcgactt ctgcaacttg ggaaggagat 1741 tgccaacggc tgagaatttg agttttggct actgataatt gctcattaga tgtagttggt 1801 aattgcaacg gtgttccaga ttccaaaact tgttgcggga catctcctgc tagccaacgt 1861 ccatctttct gaatgtagaa aggcaaaatt tcgtatttac cagtattttg ctctgtcgtc 1921 aaggctctgg cgatcgcccg tgctgaactt atagaaactt catgttctcc cgaacgacca 1981 ccaaagagta accccactcg cagcttcgtc atctcaaata cctctctcac cctcaaattc 2041 agatagcgta gcacatctga gtaaagttcc tctatttttt tctgttgctt tttttgattt 2101 agttatacgg ctcccttgat tatgacgaca gctgtcgaaa ctatgaaaat catacaaatc 2161 cctgaaagta ttgtcaagtc gtcatttatt gatgctattc tcaacaacta taaaaacttt 2221 acaaacttca taaagatttg taattgcaaa ttttaaaagc gaattccttg ttgaattagt 2281 tctgtttata taagactctt aggtttgttg taatattctc gaagactgag aaagaacctt 2341 tgcgatacgc tttgcgtcaa gccagaggct tatcgcctag gacacctcag caaagttgct 2401 gtatttttcc atgtagaaat tttttgggag ttcaggaaac aacgccagag tttgctaact 2461 tccatcaatg agcacagata gagcttgacg agcatgagtc gcccccgccg gggttccttc 2521 tattgttgcc attaagctag cgcctccaat caaaagcaag tattgttggg agaatgtctc 2581 tggatcgcgc acccccgctt ctgctgctaa cttcttgata tgagtgcgaa ttgcctcacg 2641 tagccttgct gaaacgtggt ggactggatg attggcattc gctaattcaa ggactgcgtt 2701 aataaatggg catccccgaa agtctggttg ggcataccat tcccccagga catcaaatgt 2761 tgctagtaac ttttcctttg cagtttttcc cctttgctcg acagactcct cgaaccattg 2821 taaccattgc ttttctcgat aaagcatgac ttccaagatg agctggtctt tagaggggaa 2881 atgcttatag aaagtccgtt tagcaacatc tgcagcagca attacctcat tgataccaac 2941 gtgttgaatc cccttttgat aaaacagctt tgaggcagtc aaaagaattc tttcgtaagc 3001 gttgctaggt ttcggtgtca taaatcaggt tttggtagac aattttgtct acttttgata 3061 agctgctaga cgtagacaga tttgtctacc tttttaacat gctactcgca ggggtggata 3121 tgagtcactc acccccatta tgtaatgccg tgttttggac atataaccta ggagaaaagg 3181 aatggacttt accatataca ctgtggaaac agcgcctgaa aattccaagg aagccctaat 3241 taaagcaaag gaagtctttg gtttcattcc gaatctagag gggatttgtg ctgaggcacc 3301 tgcgctcttg aaagctggta tggctttatg ggatctcttc agcacgacaa gctttagccc 3361 aattgagcag caggtgattt atctggcagc aaactacgag aacgaatgcc attactgcat 3421 ggcagcacac tctggtttag ccaagatggt tgggatgtca tctgaagata tacaagcgct 3481 gcgaaatggc actctacttc gagatccgaa gttgcaggct ttgcgtcact ttactgggcg 3541 tatggtgcaa gcccgtggtt gggttgaaga ctacgaaatt gaatctttca tggctgctgg 3601 ttatggcaag cagcaggttc ttgaggtgat tcttggcatc gctgtcaaag tcattcacaa 3661 ctacacgaat cacatagcta aaaccccact tgacaaagtg ttcaaggcaa atacttggtc 3721 aaaactaaaa cccataactc cttgagcata accagttagt gaagaggtca atggcgtggg 3781 gcttaagcgt gatttacctt ttacactggc aatgcttccc gcacttccac aggttcaccc 3841 gttaagctaa aggcgcgaac ttcggtaatt cttaccttca ccaacttccc tttaagttca 3901 ttgatgtcac cagcgaagaa agtgagacgg tttccacggg tgcgtcccat cacttgagtt 3961 ttgtctttgg agttttggtc ttccactagt acttcttcaa cgcgtcccat gtaacgttgc 4021 gatcgctcgg ctgctttgat cgcaacgaga tgattgagtc gttgcaggcg atcgctctta 4081 gtctcttcac tcaattgctc atcccaaatt gctgctggtg ttcctggacg tggagaatac 4141 gctgctgtat tcagcagatc aaagccaata tcttctacta gtttgagagt attttcaaac 4201 tgttcctctg tttcacctgg aaaaccgaca atcgcatcag cactaatcga cgcatctggc 4261 atatactgtc gaattgtgtc gattatccgg cgatatttct catgagtgta accccgactc 4321 atgcgtttta gaacttcgtt atctccagat tgaaagggaa tgtggaagtg ttcgcacacc 4381 ttgggtaatt ccgcgcaagc tttgatgagg cgttcggtga aataacgggg gtgagaagtc 4441 gcaaaccgaa tccgctcaat tccctccacg tcatgaacgt agtaaagtaa atctgtcaaa 4501 gtgtgcagat ggcgaccttc tggcgtgact cctggtaaat ctcgcccgta agcatcaata 4561 ttttgaccga gtaaggtaac ttctttgaaa ccaagccgcc cgatttcttc aatttccgtc 4621 cgaattgcct gaggtgtacg ggactgttcc acaccgcgta cattaggaac cacacagtaa 4681 gtgcagcgtt cattacagcc gtaaatcaca tttacccaag cggttacggt gctatcccgt 4741 cgcggcttgg tgatatcttc cataatatga acgggttcag ttgcgacaac ttggttgccg 4801 tcaaacactt gttgtagtaa atcttgcaga cggttagtgt gttgcggtcc catcaccaag 4861 tctaactctg gtacacgtcg caacagtgct tcgccttcct gttgggcaac acaacctgaa 4921 acaacgatag ttaagtctgg ctgctcatgc ttacgctttg cttgtcttcc aagatatgaa 4981 tatacctttt gctcagcatt atcacgaatg gaacaggtgt tgtagagaat taaatctgct 5041 tgattcgggt cttctgacca ctcaaagccc atgtcttcta ggatgccagc catacgttct 5101 gaatcggctt tgttcatttg gcaaccgaag gtggtgatgt gataacgacg ggatgaagtg 5161 gtcatggtaa cttacaaagg gaatgagata gcgaattgca ctactgaaaa ttgtgacatc 5221 ctcccacacc caacgggaga gtcagtcacc tacggcggga aacccgcctg cgccgtgctg 5281 actcaccaaa tcagagatta tggtgtgggc ttccccaaat cactttgaag atttcctggt 5341 tgttctcttg cgagatttcg ccacttgcct agactgattt tcatcagtcc ccggtagatc 5401 gactgcgcag gctgtttggg ttcccgtctg cccaacggta ctaatgagtt tgattcctct 5461 atttcttatg ttttgacctg ctgccacatc tctatctatt atgtatccgc aatcaggaca 5521 gctatggact ctaacgtaat ggtctttttt gacttggcca ccgcattcag gacactgttg 5581 agaagtcccc ctagcatcca cttcactgaa gaactttcct cttttccaac acacatactt 5641 gacgatagct ctaaactgac caaacgcggc atcaagcata tgcttaccaa acatcccttt 5701 ggcactggtg cggtaatcca gatcttccat gaagaccatg tcacccgcat cacaaagcgc 5761 atgagccact ttgaagtgaa agtccttcct cgtgttgtcg attttgtgat gaagtcgagc 5821 aaccgttagg cgctgtttct cgtagttttt cgaccgcctt tgtaatctcg ccaacctgcg 5881 ttgcagcaat ttcagcttgc tttgcaattg tttaaagaac ttggggggtt ttacgagaac 5941 gccgtcacta gttgccaaaa acttctccaa cccaacgtca accccaatca catgaccata 6001 tggaattggg tctggaatat taacatcaca ttgaacagag atagatgtat accacttgtc 6061 agcttttctc aagattctta cttgcttgac cacaaatcta gaagggatgg gtctgtgcag 6121 attaattggg attgctccaa tcttgggcaa cttaatatgc aagttcgtga ctggattctc 6181 cttgaattgg ggaaaaagca acgatttaaa ctgcccgtgt tttttgaatc gaggaaaacc 6241 aaaacccctc tgctgaaagt attcccaacc tttatgcaat tgcttaatag cttgttggag 6301 aacttgagaa ggaacttcgc tcaacttagg aaattctttc tttgccttag gcaagttgtt 6361 aagctgatag atttcgccgg gaaacttcaa ttcaggagag aggatatact ctttttcaag 6421 agagcatcta tcaatcaaac acttacgact gtcacaccaa tcctttatct ctcgaagtgc 6481 atagttatag gctgttcgac agatttccat ccactcgatg agtgtttgct cctgagtgat 6541 gtctggatag attctgtaac ggtaggtaag tgttatcatt ccagtattat agaagattta 6601 ctagacagca ggcaagcttt aaatgttaat tttagggaca tcgagtcccc caagttaacc 6661 tggggagcga gtgtggtctt gggggtttcc cccatgaaca tctcgcgtgg agtccatgta 6721 tcccacgccc tcgcttaatt caaagttcaa aattcgctcg ttcaaaataa tagatctttt 6781 tttgaatttt gcattttgaa ttttgaatgt ttgagaagcg agattttggc gtgggtctta 6841 tagctccccc aaaccccacg taagataaag cacttttaag actattactc caccaagaca 6901 agtttgagga gtaaaggaat gaactgcaga ggcgctccag agcgcagagt gaagagaaaa 6961 gaggatgagt ctaaaaatga gtttacttag caaagttgct gtggcgcact actatggcat 7021 cgctaccatt aaatttatgg actgtaaaac acgcagctta atattatgtc acaacaaacc 7081 ctttcaccag cagcagacaa acaaaccaaa acaaaaaagg gtaagaagtc acttccacca 7141 aagctcatca ttaagctagg aaagtttgtc tggacaacta tttggcattt gatgatgtcg 7201 aaattagctc cccgtaacaa gtcaggcgag tatattcgac caagcagcga atttagaaac 7261 tctgttggga tagagcaagg gaatgtgtac caagcagcaa cagggcgtta caatctgatc 7321 gcagggctgg gttgtccgtg ggcacatcgt actcttgttg tgcgatcgct caaaaaactt 7381 gaacaagcaa tatcgctcac aattgtgtca ccctccccaa ttgaaggagg ttgggtgttt 7441 aaccaagaat atgaaggttg tcgcacgctt gccgaacttt atgaattagc acaacctggc 7501 tacggtggac gctttacagt tccagtgttg tgggactccc aaacaaagac gattgtgaac 7561 aacgaaagtt cagagattat tgtgatgctg aactcacagt tcaacgagtt cgcaaacaat 7621 cccacactag acctctaccc acaggaactg aaagaaaaga ttgaccaatg gaatgaaagg 7681 atttacacaa gcgtaaacaa cggcgtgtat cgttgcggct ttgcccaaac acaggaagcc 7741 tatgagcaag tttatcatga attgttcacc actcttgatg aaattgacac agtcctaaat 7801 accagtcgat acctttgtgg agatactgtc acactggcag acgtccgttt gtttacaaca 7861 ttattccgct ttgacactgt atactatgcg ctttttaagt gtaaccgcaa aagaattcag 7921 gactatcaga acctgggacc ttaccttcgt gacttatatc aaatcacagg tattgctgac 7981 acctgcgact tagagagtgt aaagcaggat tactacggaa acttgttccc actcaaccca 8041 ggtggtatta tcccctctgg tcctgatcca atgtttcttc aagaaccaca tcatcgcgac 8101 aagatgagca agcaagcaaa tcctgtataa aaagcttcct actctggttg caaagtcgtg 8161 ctcaaagtca ccgcacaatc acccattaaa gcctctcgca ttgctttttc atcgctaaaa 8221 gtctcctgta gaagtcgtcc ctcataaacg agttctacct cactgacctg aaaagggcaa 8281 ggagatactg cgacttgata atggttatct ttgacatctt tcaaagtcag tttgaccgta 8341 gtttcaccgt caattgttgg tattccagaa agatatgcat cattagttaa ttcttgacat 8401 aagatgatag atagtgcatc ccatatcgct actaactttc tattacgctc aacaacttcc 8461 ggttttgtgt atggtgcata atattcgtca tttttgagaa aagcaatcac ttgttcttgg 8521 aaagcatact ccctctctaa aaagttctgc acaatttgag aagaagttgg tgagttctgc 8581 caacttgtaa atctttcata caaacttgtt ccatgaagtg agataaggag tgctacatat 8641 ctaccccaag gaagtgcaag ttgtctagcg ccagaccaaa tatttatatg ttctggcgtt 8701 gaaagttcca taaaattatt gggatagcct gtttgcgggt tgagtgtagg tgcttgttcc 8761 cagaaaagcc aaccaatatc atgtagttga gcagctagac aaacttcttt tttgggagca 8821 aactcaccaa aacgttcatt tccccaaact tgtgctaact gacctgcaac ccaagcgtga 8881 tttggttgag tgatgcaaat aagtccttgt tttgttaaac gatgcagcat aaatagaaag 8941 caaaaaatat tgttctactc tttattctga actaatcgct aaacaaaaaa cttatgctaa 9001 ataatgtaaa aagagctatg tagcagtcct gtttggaacg aaccgccaag tcgccaagaa 9061 cgccaaggta tgaaagaaga agagaagaac agaacaagat aatttcacta ctcctcaacg 9121 gattgctata taaaaagata tgaaacttaa aaactcagct tggaatttgg tcatgttcaa 9181 gcgacttttc ttagtattag gctgtactct cgctggtact ggtgctattg tattcttctt 9241 ttggcaacaa gcgactcagc tacctgtttg gtactcaaat ccaccgacaa catcaagttt 9301 acctcctcaa acagaccaaa acaaaaccca gattcaacta tcacaacagc aagtcttgag 9361 taaaatttat gacaatttaa aaggggcaaa tgctaaaggt gaagtgcaac tagatgcaaa 9421 tgaagtcaat acgctgattg tttcaggaat tgctcaaacg actgataaaa gtcggcttgc 9481 tcaagcagtt gtgaggacaa atactcaaat tcaagatggc aaaatctccg caggtgctgt 9541 gatagatttc agaacaattc ccttaaacga gttgccatcc caggaacaag ttgcaatttc 9601 taaactgcta tcgactgtgc ccatcttaaa atatcgcccc gtgtacatag aggttgaagg 9661 aaaacccaag gtacacaata aacaaatcag cttggatgag acaactcgcg ttaagtttgg 9721 taacgttagc ttgactctct cggatatgta tcaacgtttt gggctttcag aaaaacacct 9781 caaccaacaa gttgcaaatg aattgaagaa acttccagtg gaggtgaaag acgttgaagt 9841 tatgggcgat cgcctcatcg ttcgtggtta gttatcgtaa attaaagggg ggtttaaatg 9901 cccaacgaaa actgtcttta attttgaatt ttgaattttg aattttgaat tgttaacacc 9961 tcccgtaggg tactcagtaa tgcctgaact gtgaagggtt ttggcaagaa gtattggatg 10021 ccaagacttc tgttttgagt cgtcgtcgaa ttgctcatca gtccgctcat tgcaatgatt 10081 tgcacctggg gattcatgcg ttgcaacgtg agaatcgtgg tagaaccatc tatccctggc 10141 atcatcatat ctatcaacac cacactgatg tcattcttgt actgagcgta gagtgcaagt 10201 gcctcaattc catcgctggc ggttaaaacc ctataattat ggctttccag ggcagtttta 10261 gtgatctcgc agattgtggc ttcgtcatct accgccaaaa tcaattcgcc attgcccctt 10321 ggtagttcta catcggttgc tacctgggtt tcagtgactt ggctactcgg caaatatacc 10381 tcaaaggtgc tgcccttgcc catctcgcta taaacattca caaaaccgcc gtgactttta 10441 acaatgccca tgacagtaga aagtcctaaa ccagttccct tacccacttc tttagttgta 10501 aaaaaggggt caaaaatgcg atccataatt tccggcccaa tgcccgttcc ggtatcagta 10561 atagtcatca caacatagag acccacttta gcatccacat acattcgagc atagctttca 10621 tccatacatt ggttttcggc gctaatagtt agggtaccgc catcgggcat ggcgtcgcga 10681 gcattgacac agaggttcat tagcacctga tgaagttgag ttgcatctgc acaaactgtc 10741 caaagatcgg gtgcgatatt agtttcggtt tcgatagatt ttgggaaggt tctttgagcg 10801 acctgtgcca catctaaaag caaatgccta gcctggatga tggtacggct gccttccact 10861 cctcgacgag taaacgacag aatttgcttg accaaatcag caccacgctt ggcactgcct 10921 tccagtatgc tcagcagttg ctgagtgttc tcatcaaggt tggggaattt aagggggagc 10981 agttgagcaa ctgccaaaat gggagtgagg atattgttga agtcgtgggc aatgcccgaa 11041 gcgagagtgc ctaagctttc tagccgttgg gcgcggagaa actgggcttc gagttgtttt 11101 ttctcagtaa tatcagtact tactgttaga atggatttgg gattgcctaa gccatcgcgc 11161 accagtgtcc aacgactttc aacaacgatg gtcttgccgt ctttggtgac tttctgaatc 11221 tctccctgcc agttgccctc ccgaatcagt gttgcctgaa tcgcttcaaa ttccggcaat 11281 gtttccccag gtcggtacaa cagtttaatg gcatttttgc ccaagacttc tgctgttttc 11341 catccgtaca agcgctccgc acctttgttc cagaagagga tatggtgttc gagatctcga 11401 acacaaattg cgtcagtagc gacatcgagc aaggctgctt gttcgcggat tttctgttct 11461 gcttgcaggc gatcgctgat gtcgtcaatc agaactaggc ttgtccattt acccgcaaat 11521 gagaaaatgt gagccaatgc cttgacatcg atcagactgc catccttctt gcgatgtttc 11581 caaaccccaa ggtcgttctg tcccggtgtg aagttagata aactctggtg cagtgaggtg 11641 atgtaggcgg gaggaataat gtcggcgagc gtcatgctca aaaattcttc tttggaatag 11701 ccgtaatggg cgatcgccgc ctgattcacc gcaagaaaag ccagggtttc cagttcaaag 11761 acccacatgg ggttggggtt actttcaaac agcaagcgat atcgtgcttc cgagtcccgt 11821 aattcattct ctgagcgctt gcgatcacta atgtctctgc caatatgcga tgccccaagc 11881 tgctgatgat ttgcattgcg aatcgaacta taggtaatct cgtaataatt ccgctctaat 11941 cgttcatccc caaattcctc aatcaccgtg aactcttctc ccctgagtgc ccgcccccaa 12001 attgctacgg ctttggcttg ttcttctggc aaatgcgcga gtgcgtctat caaactcatg 12061 cccactgcaa tagtcttgcc gaatatctgt tgaaactcct gtttgtatgc actgttgaat 12121 gcaataaacc ggaactctag atctagggct gcgatgaggt catgggttcc ttcaataatc 12181 ccggtgaggc gatcgttgac caattgcaca tcctggtgag cttgctgaag ttgttcctgt 12241 gcctgtttac gttcgctgat atcaataacc acacagcgac tcatcaggtc gttgcctgta 12301 ctatccgtca tcgcagtggc gctcaagctc acgggcagga tagtgccatc cttgcgaatc 12361 atttggaact ctagatctcg tactttgcct cgctgctgtc caagcgggag attttcttgg 12421 aagctctcta agctctcggc agtcagtaag tcggaaaact ttttcttgcc aattatctca 12481 tcccgcgtgt accccaacat gttaagttct gtgtcgttga tgcggacgaa gacgccattt 12541 ttgtctaggg agtgatagcc acagggggca tggttgtaaa gttcttctaa ctcatctgta 12601 tactttcgca atacctcgtc tgcttcctgt aacgcctgtt ctgcacgata gcgtttctgg 12661 cgttcctctg cttcccgcaa ctctcgttcc actgcgggta ccaagcgggc gagattacct 12721 tttatgatat aatcatgtgc cccagctttc atggcagcaa cggcagtatc ttcaccaata 12781 gtgccggaga tgataataaa aggtaaatcc agtttccggc tttggaggag tttcagggct 12841 tccaaagcat tgaaagcagg cagcgtgtaa tctgcaatca caatgtccca cgattgttga 12901 tcgagaactg cttgcatggc atctcgggta tccacccgaa catagtccac agtatagcca 12961 ccccgacgta actccctgac gcttaagaga gtatcatctt cagaatcctc aacaatcaga 13021 acacgtaggt aagagctcat cgtgactgct ttcctaaatt tgcggtagtt cgttcataag 13081 gagccagtac atccccaatt gccggactgc ttcggtaaac tcaataaaat tcacgggttt 13141 acggacataa ctgttgcatc ccaagctgta actattaagc atatcttgtt cttcactaga 13201 agtggtcaga atcaccactg gcagaagctt cgtgcgttcg tcttctcgca agcggcgcaa 13261 tacttcaatg ccatcaatgc ggggtagttt cagatccagt aaaatcaccg taggcttgag 13321 gctgatatct cgtccggcat aaactccggt tccaaacagg taatctaggg cttcaacacc 13381 gtcatgaact actacgatct cattgctgat gtgattccgc ttcaaagctc ggattgccaa 13441 agcttcatca tcgggattat cctccactag aagaatagtt ttgttgttcg cgctcatgcc 13501 cctttctcct ccgataatgt aaagtaaaaa gtagctccct gttccacagc cccttctgcc 13561 catacccgac caccatgccg atgcactatc cgctgcacag tggcaagtcc aatcccattt 13621 cccggaaatt cattcatacc atgcaaacgc tggaagggag caaataactt gttcgcgtat 13681 gccatatcaa aacccgcgcc atcatctcga acaaagtaga cgttgacgcc actttcgctc 13741 ttgctcactc caaacgaaat ctctgcctgg gggtgcttgg atgtaaattt ccaggcatta 13801 ttcagtaaat tcaccaacaa cacccgcagc agacgagtat ctccttgagc taccagccct 13861 gtttggatcg caaactttac ctgccgttcg ggctgactgt gttgtaactc agtacagatt 13921 ccactagcta acagactcag atccacagat tcggggtgca tttcgctgcg cgtcacccga 13981 gataggttga gcaggtcatc gatcaattgc cccatccggt gagtcgctcc tcgaattcgc 14041 tgtaagtagt cctgtccggt tagatccagc aggtcaaagt agtcttccag cagcgcttga 14101 ctgaagccgt caataccgcg taaaggagct cgcaaatcat gggagacaga gtagcaaaaa 14161 gcttctagtt ccttattcac tgcttgtagt tcaatgattg ctcgttgcag tccctgattt 14221 aacgattgta ctgcttgctg aacctgtagg cgttctagta cttctgcccg taagtcagcc 14281 aaaagttctg cgtgctggag agctaccccc aagtgattgg caatttggct gacaaattcg 14341 atttctgtcg ctagccactc tcgctgagcg ctacattggt gaatacagag caatccccag 14401 agtttttcac cttgcaacag cggtaccgcc aggttagccc gtacctgaaa ttgagagaga 14461 acttggatat gacagtcgct caacccataa ttatagatat ctgcaaccgc ctgaattcgc 14521 ccctgctgat aatggactgc aaactgttcc ccgaagcaat gatcgtggat tttttgcccc 14581 atagctgaag gaaatcggtg atctacatcc tcagaaacaa attctccatc atcccagcct 14641 gagtttgggt agaagcgaaa catacccact cggtctgctt tcaagagttg gcgaacctca 14701 gtcgctgtgg ctttaaaaat tgtttccaaa tctagaggtt cccgcaggcg agtaatcaca 14761 cggaacaatg ctttctgctg ttccatttgg aaatatgcct gatttctttc ctgttctaag 14821 tgttggactt cttctgccag gatctgctta agcgtctcaa atgccgcctg atctcgacac 14881 aagttttgca actttaccaa tatctctggg ttcattcgtc tggcggtagt taagttgggg 14941 aatttacaaa aaataagcta aacacttgcg tcagccaggt cagacagatt gagaagtgtt 15001 gcacaatttt aagggcttag cacaaagggt gcaatagttt cttagaaaca tacataatat 15061 aacttgatat aacttggata ggttagactt ttccaagttt tatagagaga gggaacgctt 15121 aacagggaac aaggaacagt gaacagtgaa cagtgaacag taaacagcta cctacgcagc 15181 gatttcttga ccccccttaa cttggtaact gataact // LOCUS NODE_2207_length_15161_cov_5.36349815161 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 15161) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 15161) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..15161 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 127..507 /locus_tag="DP116_18795" CDS 127..507 /locus_tag="DP116_18795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131014.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18795" /translation="MNSFSSLASQPQNQSAHDILINVIIKQSSDGKIIATVPGLPELQ VEASNKITALALLQQRLEAHLEGAEIVPLPVKLPSREQKNPWLEMAGIFKDDPQFEQM LAAIESYRQELDQNIENHSSQELG" gene 529..951 /locus_tag="DP116_18800" CDS 529..951 /locus_tag="DP116_18800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131013.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_18800" /translation="MTVWVLDTDHISLLQRGHPVVIRRIAAVNPAEIAVTIVTIVEQM YGRLDVIKRAKSKQELVTAYALLKETFSRLYQGNILDFSEAAFDIYTQLLAGKIRIGT QDLRIAAITLSVGATLVTRNRKDFEKVPGLQIIDWSIP" gene 1057..1410 /locus_tag="DP116_18805" CDS 1057..1410 /locus_tag="DP116_18805" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18805" /translation="MKLNEVLACATKSFAFIGVDGMECDAQAPTSGDSFAAAFGRSRF LGLTGWNAMRKPPPRAIAFIGVMWTRKYSRLSGLRHIFLFFVKPYLDAFLLSIPCCSS YVLFKLLPCKFLALF" gene complement(1373..2740) /locus_tag="DP116_18810" CDS complement(1373..2740) /locus_tag="DP116_18810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196441.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)/FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_18810" /translation="MVVVHENNALHRVVIIGGGFGGLYTAKTLAKANVNVTLIDKRNF HLFQPLLYQVATGTLSPADISSPLRAVFRKSKNTKVLLGEVNDIDPEAKEVILRDRII PYDTLIVATGANHSYYGHDNWRPLAPGLKIVEDAIEIRRRIFSAFEAAEKETDPELRR AWLTFVIVGAGPTGVELAGAISELAYKTLNDDFRNIDTSETKILLLQGGDRVLPHMAP QLSKVAKESLQKLGVDTQTNTRVTNIENDIVTFKQDDKIQEIAAKTILWAAGVKGSAM GQVLANRTGVECDRSGRVIVEPDLTIKGYKNIFVIGDLGNFSHQDGKPLPGVAPVAKQ EGEYVAKLIKKRLKGQTLPQFRYNDVGSLAMIGKNLAVVDLSFIKLQGFLAWIFWLVV HIYFLIEFDTKLLVVFQWAWNYLTRSRRSRLITGREAFEETKTVTNDVPSTTKKVQET YKVGV" gene 3360..4062 /locus_tag="DP116_18815" /pseudo CDS 3360..4062 /locus_tag="DP116_18815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748221.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="helix-turn-helix domain-containing protein" gene 3986..6727 /locus_tag="DP116_18820" CDS 3986..6727 /locus_tag="DP116_18820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012597826.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase C39" /protein_id="PRJNA477356:DP116_18820" /translation="MVHNPTQAHLLVLKQLNNALGYSLSEEEFQRCFQAAKIFNPKVG KFWQGNHVEPGIYIVVAGKVRLLNDAEELIASLEVGASFGEFTLFPDSDFQPYKARAA LNLQVCFISKEVLLPLMAKHPQIREHLQNQAQTRNSRLLGRNDSQTTFVNVDKPYPKI NVSSTSAVQPKQGKRISKAYFPNPTQRVGHFWQRMIRRYPFFAQQSGSDCGAACLVMI SRYWGKRFSINRVRDIANIDRSGASLRGLSAAAESLGFGTRPVKASVDQLAKQKLPAI VHWFGKHYIVLYEINKRNVIVADPAIGQRTLSHAEFKAKWTGYTLLLEPTALLKDAKE TTTPFWQFFELIKPHSVVMLEVLVASIFIQIFGLVTPLFTQLILDRVVVQRSELTLTA VGLGLLIFSLFRVAIMGLRQYLLFHTANKLDVALIVGFIRHTLRLPLSFFESRYVGDI ISRVQENRKIQRFLSGEALSILLDFLTVFIYLGLMFWYSWKMALLALVIIPPFFLLAL IATPFLQKISREIFHAVTNESSYLIEIMTGVRTVKSTAVEQTVRWHWEELLHKEVKTN FSGQVISNGLQIFSNTIQAVATTLLLWFGASLVIQNQLTIGQLVAFNMLVGQIISPFQ RLTVLWNQLQEVVIATERINDVLDAEPEEDFQNQTRQFLPDIHGHICFKNVTFRYHKE SDINILQNLNFEIQPGQMVALVGRSGSGKTTISKLVLGLYPPTDGQILIDGLDITSIS LRSLRSSVGVVDQDTFLFGSTIRENISLGHPGATLEEVIEAANLAGADEFIKKLPMGY ETQIGEGGGLLSGGQRQRIAIARALLGNPRLLIFDEATSHLDTESERIIQRNLNTILK GRTALVIAHRLSTIRNADLILVLDKGVLIESGTHEALMAKREHYFYLNQQQQLNMAA" gene 6917..8423 /locus_tag="DP116_18825" /pseudo CDS 6917..8423 /locus_tag="DP116_18825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748219.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" assembly_gap 7408..7417 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 8468..9205 /locus_tag="DP116_18830" CDS 8468..9205 /locus_tag="DP116_18830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207755.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_18830" /translation="MLNPVTITSEDVLQQVKLSCKIPEIIQEIVTRKIVAVAAEKIGI EVEDEVLQKTADTFRLINNLGSAEETWLWLQKHHLSIEDFEQIAYTSSIYGELVKHLF ADKIEPYFFENQLDYIGAVMYEVVLDDEDLAIELYYTIKEGEMSFYDVAHTYVQDTEL RRKCGYRGTVYRKDLKPEISAAVFAAKPPQLLKPIVTSSGINLIFVEEIVQPQLDDKL RNKITTDFFSEWLKQQIDQMKIIPDLS" gene complement(9406..9627) /locus_tag="DP116_18835" CDS complement(9406..9627) /locus_tag="DP116_18835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18835" /translation="MAKITIAQLPTSDSYINELSNTELDATKGGDGNNGNTIILGGGN SGNNINIGDGKAGDNYGGYGYYPYYRYYY" gene complement(10025..10960) /locus_tag="DP116_18840" CDS complement(10025..10960) /locus_tag="DP116_18840" /inference="COORDINATES: protein motif:HMM:PF00089.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S1" /protein_id="PRJNA477356:DP116_18840" /translation="MKRQFILPFLAGLVGTMLLGVWATLSVQAQSKPVPPKFTTVTNL ADFNLVIDKKPVSPSDVKEIDEPPDARQAIIGADDRIPMTSRKKPWSAVGKIEGIDAD GQDYSCTGTLIADDLVLTNSHCVVNPDTRKVSRAIAFKPNLINGQVRDKNDIAYATTA KAGTDFKSGTLADYVDDWAILKLDRPLGKKYGVIPLKSLPSFDLVGDTQKFALVGYSA DFPNPKKKEYQEFTAGERMTAGVHLGCSILRQKDNLLYHNCDTKGGASGGAIIGNIGG NYYVLALHSGWNTVNGLKLNRAVEISRIQPALRGN" gene complement(11039..14695) /locus_tag="DP116_18845" CDS complement(11039..14695) /locus_tag="DP116_18845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18845" /translation="MNRSAKLRCIFCCSFSKFSRYSLTLLLTAVLLSDSVVATPRNAN LQIAQQSNTKPENTLTPEQKKLEHEGLKLLNEGIKLANEGTLESKQQAIQKDEAALKF ARQLPDKRLEVVVLQNIGAVYSLLVEYNKALEYEKLALAISREQKLTSQEADTLWGLG VTYSNMDDNQKALENFNLALSMYRSEKQPEKEADTLKFIAKIYEEQFDKHQEAIQAYK QALALQQNDPPSQASTYWFMGTTYWKWGENKNALDSSDKALEIYRRINDISGQVTVLE FRNSVYTTLGENQKALEQLQQAQRLLQQVPQDRLSQASILVNFASTYRSLGEYQKALD YLQQARSLSKKVGARQREISALRQISSLYQVFIGEYGKALDALEEALTVARAINNRVE EAEILNNQADIYASQGEYQKALDTFNQALTIQRQLKIRGGQADTLSNMAKLYRSLGDY QQSINTSQQALDLYRQLGDRKNEVFSLSSIGDAYHQMKDYPQAIEYYNKALSLSQQIG GLVQPSLLFGDLGRTYLSLKEYDKALNNASKSLSMVRQQKDKHLESAGLALQGKIYRE KGDYQQALLLFEQSKSLIQQLGNKYTEAGALRQMGKTYNSLKQYQTAIDNHNQELAIR KTLGNKAEEAATLYQIAVNERDRGNLQAALTYIKQTTEIIEGIRTKVTSQDLRSSYFA TVQDYYQFYIDLLMRLHKKDPSKGYDAEALHISERSRARSLIELLTEAHANIRKGADP KLLAEERRLQFLLEARQKRLASLFESKIKVSEQQIATLNTEIANLLNQYRELETKIRT NTKYANLKYPNPLTLPQIQQQLDKDTLLLQYSLGEERSYLWAVTPNSVQSYELAGRKE IEQKVEDLRKLLSDSGMNKVSPEQTAKAADQLSQLILAPVAKDLGQKRLLIVADGALQ YIPFTVLTVPQSSVSAQNYQPLLLNHEIVSLPSATTIDILRQELKGRQKAPKTLAILA DPVFSNTDKRVTGVVKNPALNKNDQPTTQTSQTAIELDKSALMRATRDIKIGNFLRIE GTRKQAEEIMKLVSQPQRLHAFDFDANYTWATNPQLSQYRYLLFATHGILNEINPELS GIVLSQVDKNGNQQQKSFLQLPDLFNLDYPAELLVLSACETGLGKEVKGEGLLGLTRG LMYAGAARVVVSLWKVDYEATSKLMSEFYKEILQQGKTPAAAMRAAQLEMWQQEEWRN PYSWAAFTLQGEWR" BASE COUNT 4262 a 3030 c 3208 g 4651 t 10 others ORIGIN 1 gagacagagg cgtagccgtg ccgcaggcat agggcgcgca ctctcggacg cttatcgctg 61 cgacaatagt aattacaaag caatttgatt taaaatcaaa ctaactacca taaacgaaca 121 gtttttatga attcattttc atctttagca tcacaaccac aaaatcaaag cgctcacgac 181 atattaatta acgttattat caaacaaagt agtgacggaa aaataattgc aacggttcca 241 ggtttgcccg aactgcaagt agaagctagc aataaaatca cagcattagc cctacttcaa 301 caacgcttag aagctcattt agagggagca gaaattgttc ctttaccagt aaaattacca 361 tcacgcgaac agaaaaaccc gtggttggaa atggcgggaa tttttaaaga tgacccacaa 421 tttgaacaaa tgctcgctgc aattgaaagc tacaggcaag aattagatca aaatattgaa 481 aatcattctt cgcaagagtt aggataatag tctcaggtat aaaatataat gactgtatgg 541 gtgcttgata ctgaccatat ttctctgctt caaagaggtc atccagttgt tattcgtaga 601 atcgctgcgg ttaatcctgc ggaaatagca gtcacaatcg taacaattgt cgaacagatg 661 tatggtcgtt tagatgttat taaaagggca aagtcgaaac aagaattagt cacagcttac 721 gctttattaa aagaaacatt cagtcgtcta tatcaaggaa atatccttga ttttagtgaa 781 gctgcgttcg atatctacac ccaattactt gcaggcaaaa ttcgtattgg cactcaagat 841 ttgagaattg cagcgattac actgtctgtc ggtgcaacat tagtaacgcg caaccgcaaa 901 gattttgaaa aagttccagg tttgcaaatt atagattggt cgattcctta agaatttaat 961 aatttggcga tatctaacga taaacatctc agcactcatt ttcatccacc tgccgacact 1021 ctgcgcattt attagaattg aggggaaagt gcgttagtga agctgaacga agttctagct 1081 tgcgcgacaa agtcgttcgc atttattggg gttgatggga tggagtgtga tgcgcaagcc 1141 cccacctcgg gcgatagctt cgctgctgcc ttcggcagat cgcgtttctt ggggttgaca 1201 ggatggaatg cgatgcgcaa gcccccacct cgggcgatcg cttttattgg ggtgatgtgg 1261 acaaggaagt atagccgcct ttctggtctt agacatatat ttcttttttt tgtcaagccc 1321 tatttagatg cgtttctctt gtccattcca tgttgtagct cttacgtact tttcaaactc 1381 ctaccttgta agtttcttgc acttttttag tggtcgaagg aacatcgttg gtaacagttt 1441 ttgtttcttc aaaagcttct cgacctgtaa tcaatctaga gcgacgacta cgagtgagat 1501 aattccatgc ccactgaaat actactagta atttagtgtc gaactcgatt aagaagtaga 1561 tatgaacgac taaccaaaat atccaagcaa ggaaaccttg gagtttgatg aagcttaaat 1621 ctacaacagc taaatttttg ccaatcatcg ccaaactacc tacgtcattg taacgaaatt 1681 gtggtagtgt ttgaccttta agccgttttt taatgagttt agctacatac tctccttctt 1741 gtttggctac gggtgcaaca ccaggtaagg gttttccatc ttggtgagag aagttgccta 1801 aatctccgat gacaaagatg tttttataac ccttaatcgt caagtctggt tctacaatta 1861 cacgtccgga gcgatcgcac tctacacctg tgcggtttgc taaaacttgc cccatagcgg 1921 aacctttcac acctgctgcc cataatatag tttttgcggc aatttcttga attttatcat 1981 cttgcttgaa agtaacgata tcattttcaa tatttgtgac tctggtatta gtttgggtat 2041 ccacacccaa cttttgcaaa gattcttttg cgacttttga taactgtggt gccatatgag 2101 ggaggacgcg atcgccacct tgcaataata aaatcttagt ttctgaggtg tcgatgttgc 2161 ggaaatcgtc gttgagagtt ttgtatgcca attctgagat cgcacctgct aactctacac 2221 cagtgggacc cgctccgaca atcacaaaag tcaaccaagc acggcgtagt tcgggatcag 2281 tttctttttc tgctgcttca aatgccgaaa atatccggcg acgtatttct atggcgtctt 2341 caacaatttt caagccagga gccaatggtc tccagttatc gtgaccataa taggaatggt 2401 tagcacctgt ggcaacaatt aatgtatcgt agggtattat tctatcacgc agaataactt 2461 ctttggcttc tggatcaata tcatttactt ctcccaacaa cacttttgta ttcttgcttt 2521 tcctgaatac agctcgtaat ggtgaagaga tatcagcagg tgatagcgta cctgtcgcaa 2581 cttgatataa aagcggttga aataaatgaa agttacgttt atcaataaga gtcacattga 2641 cattcgcttt ggcaagagtc tttgctgtat acagtccacc aaagcctcca ccaatgatta 2701 caacgcgatg tagtgcatta ttctcatgta caactaccat tagaactatt tccttttgtt 2761 aagagggttg taaagtttct taacaaatat ctaacaaaat tagaataaag gctgccgttt 2821 attcgtagca tttgaagaaa tactaaaagt agtctaaatt acagtcatat cgcttttagt 2881 ctaaactcca gttcataacc caaaccgcgt aaattctctt ttattatcgc ttctaaccct 2941 gccgactcct ggaattgcta ctctaacttc tttgtcaaat gcagcatttt ctcctcaaaa 3001 gcctgatcat catcctcaac ttcctctgct ctacatatgt gcggctaaat atagtagtca 3061 tttttacttt caatgcttag attgtaaatt tttgacatat atttgagtaa cgttattaaa 3121 ctttttgcat ggaaagtgtc aatcttaaat ctttcatgaa tagatattaa agaaaataac 3181 aacaacagta ttattcagtg ttattactga gttgagttat gagaaaagtt ttcttaaata 3241 agtatataca gctattaact gctcatgagc acttatatga ctttttgtac aaaagttagc 3301 aaaatcacgt gtgtccctaa cgtaaaccct ttcacaataa acagctagat tttacagata 3361 tgtcacacta tcaaggtaaa tttgatacta gtaacccgaa agtggctcat ggtaagtttt 3421 tgacagtgtt tcaatgcaaa cttttgcaaa aaagtctaca agaagattta cctgaatcat 3481 accgccagcg tatccagatt atgttgttgg tagatgaggg gaaatcccaa acggaaattt 3541 gtcgaacttt agggtgctct ccagcaacag caaggcattg gacgcatata gcccgtactg 3601 gtatggcgca ccaatggcag gattgtccaa ttggtcgtcc aatggctgtt aatgacgaat 3661 atttgcagcg tttgcagcaa ctagtcaaca atagtcctcg tgattacggc tattcttttc 3721 aacgttggac aggaaactgg ctgagaaaac atttggcaaa ggaatttgga gttgaggtga 3781 gcgatcgcca tatccttcgc ctactcaaac agatgggatt atctactaaa ccacaaccaa 3841 aaaatgctga caaagacacg aataagactg acttggctaa gagttcaaaa atcttaattc 3901 gtgacctcaa ttcggctaat ccgccggatt gcactgaatt attgcctctt aatctcacat 3961 ttaggaggaa ctgattcaga tatctatggt gcacaatcca acacaagcgc acttgcttgt 4021 cttaaaacag ttgaacaatg ctctaggata ttctctttct gaggaagagt ttcaacgttg 4081 ctttcaggcg gctaaaatct ttaatccaaa agtcggaaaa ttctggcaag gaaaccacgt 4141 tgaacccggg atctacattg ttgttgctgg gaaggtaagg ttgctaaacg acgcagaaga 4201 attgatagcc tctttggagg tgggtgcgtc gtttggtgaa tttaccttat ttcccgattc 4261 tgattttcag ccttacaaag ctagggctgc gttgaattta caggtgtgct ttatctcaaa 4321 agaggtgctg ctaccactta tggctaaaca cccacaaatt cgggaacact tgcagaatca 4381 agcgcaaacg cgcaactccc gacttttggg gagaaatgac tctcaaacaa catttgtcaa 4441 tgtagacaaa ccatatccta agataaatgt ttcctcaaca tcagcagtac aaccaaagca 4501 agggaaaagg ataagcaaag cttactttcc aaatcctacg caacgagttg ggcatttttg 4561 gcaacggatg attcgacgct atccgttttt tgcccaacaa agtggatctg actgcggtgc 4621 tgcttgtttg gtgatgatat ctcgttattg ggggaaacgc ttcagtatca accgcgtccg 4681 ggatattgcc aatattgacc gcagtggcgc gtcgttgcgc gggttatcgg cggctgcaga 4741 aagtcttggg tttggtacgc gacctgtaaa agcgagtgtt gaccagttgg cgaagcagaa 4801 attacctgcc attgtccact ggttcgggaa gcactacatc gttctctatg aaattaacaa 4861 aagaaatgtc atagttgcag accccgccat tggtcaacgc accctcagcc atgcggaatt 4921 taaagcaaaa tggactggct acacactgct tctagaaccc acagccttgt taaaggatgc 4981 caaagaaaca acaactccct tttggcaatt ctttgaattg atcaagcccc attctgttgt 5041 catgctagaa gtgcttgtcg cttctatatt tatccagata tttggacttg ttactccctt 5101 atttacccag ttaattttag accgagtggt ggtgcagcgt tcggaactca cgttaacggc 5161 ggtggggttg gggttgctga tttttagcct gtttcgcgta gcgatcatgg gtttgcgaca 5221 atatctccta tttcacacgg caaataagct ggatgtggca ttaattgtgg ggtttattcg 5281 ccacactttg cgacttcctc tctcgttttt tgaatctcgt tatgttggag atattatctc 5341 tcgcgtacaa gaaaaccgca aaatccaacg cttcctttct ggtgaggctt tgtctatcct 5401 gctggacttc ctcacggttt ttatctatct aggattgatg ttttggtata gctggaaaat 5461 ggcattgctg gcgttggtga ttataccgcc ttttttcttg ctggcgttga ttgcgacacc 5521 ttttttacaa aagatttcca gagaaatctt tcatgctgtg accaatgaaa gtagttacct 5581 gattgaaatc atgactggtg tgcggacggt aaaatccacg gcggtagaac aaacagtgcg 5641 ttggcattgg gaggagttat tacataagga ggtaaaaact aacttctccg gacaagttat 5701 cagcaatggt ctgcaaatat ttagcaatac tattcaagcc gtagcaacta ccctcttgct 5761 atggtttgga gcatctttgg tgattcaaaa tcaattaacc attgggcaat tggtagcatt 5821 taatatgctg gtaggtcaga ttatttcacc cttccaacga ttaaccgtgt tgtggaatca 5881 attgcaggaa gtggtgattg caaccgaacg cattaatgat gtgttagatg cagaaccaga 5941 agaagatttt cagaatcaaa cacggcaatt tttaccagac attcatggac acatctgctt 6001 taaaaatgtc acctttcgtt atcacaaaga aagcgacatt aatattctac aaaatctcaa 6061 ctttgaaata caaccagggc aaatggtggc gctggtggga cgtagtggat cagggaaaac 6121 gacaatttct aagttggttt tagggttgta ccctccaaca gatggtcaga tattgattga 6181 tggactagat attacgagta tttccctgcg ttccttacgt tcgtctgttg gagttgttga 6241 tcaggacact tttttgtttg gcagcacgat tcgagaaaat atcagtttag gacatccagg 6301 ggcaacttta gaagaagtca ttgaagcggc aaatttagca ggtgctgatg agtttattaa 6361 aaagttgcct atgggttatg aaacccaaat tggtgaaggt gggggtttgt tgtctggtgg 6421 acaacgtcag cggattgcga tcgccagagc attattaggt aatccccgct tattaatttt 6481 tgacgaggcg acttcccatc tagatacaga gtcagaacgt attattcaaa ggaacttaaa 6541 cacaatcctc aaaggacgaa ctgctttagt cattgctcat cgcctctcaa cgatacgaaa 6601 tgcagatttg attttggttt tagacaaagg tgtgttgatt gagagtggaa ctcacgaagc 6661 attaatggca aagcgagaac attatttcta tctcaatcaa caacagcagc tgaacatggc 6721 ggcgtgagag gacaaggggg aattgactgc ccattccctc ataccttgct taggacttac 6781 gcagaagaga tcccccaacc cccttggaaa gggggctttt aagattcccc ccttgggcta 6841 ggggggatcc aggtttcggg ttttcagtgc gtaagtcctg ttgctgccaa aatccaaaac 6901 tttgtaagga tacatcatga caaatacgtt aaatggaaag gttcacactc acatccaaga 6961 gagtaaaaac tatcaggaaa ttctcaactc tcaaatctca gttaagccaa gtgataagcc 7021 aaaggatgat tggtctgagg tcactcaaga tttacttgat agcttacctc acgtttggac 7081 gaaaggatta ctatattttc tgttgagttt ttcggctatt gttttgcctt gggcgatgtt 7141 gtttaaagtt gatgaaacag gtactgcaag aggaagactt gagcctaaag gtaagacagt 7201 taacttagat actcttgtta ccggaactgt tgccaaaatt ccggtgaaag aaggtgaatt 7261 agtcaaacct ggggagcctg tgttgatatt ggattcggaa ttagttaaaa ccgaattaca 7321 tcagataaaa gagaaattgg aagggcagtt aaatcgcctg tcgcagttga atgttttgaa 7381 aaaacagtta gttgtcgctt tgacaacnnn nnnnnnnggc tcaaattgag caagcgcagc 7441 agaattttag tgctctcaaa aattcttatg aactacaaaa agaagaaaaa ctgacacaag 7501 tcaaccaggc aagagaaact gttgagtcta gtcaaacagc aagtaagttt gtagagagtg 7561 gtttagcaag tgcccagcag gaggtgaaac gctatagaca gttaagtcaa gaagggattg 7621 ttgcagaagt taatgttgta caaaagcaag acatggcaaa cgaaaggcaa aagtcgtttg 7681 cacaaagtca atcagagatt tcgcaagcga aactacgctt agcagaacaa caaagtcatt 7741 atcagcagac tattcgcaaa gccaaagctg agattgaaca ggcttacttg cgcctgaagg 7801 aacagcaaag aagttatcaa actttgactc attctgggaa gctagctgtc ttaaaaagtg 7861 aagaacaact gaagaatctc gaaacagaaa tgactaccct caagtcagag attgctcaaa 7921 gtaaaagtca aattcaaagt ttacagctac agttagggca aagagtctta aaatcaccgg 7981 ttgctggtag ggtgtttcag ttaccaatac aaagggctgg ggctgtcgtg cagtcaggaa 8041 caatgattgc agaaattgct ccagaaggtg cgcctttaat tattcgggca cagataacga 8101 cagccgagag tggttcattg cagaaaaaaa tgccagtcaa gctaaagttt gatgcttatc 8161 cattccagga ttatggagtt gtagaaggag aattggtgga gatttctcct actacaacag 8221 aggtggtaac agctaatgga aaagtggcag cttataactt agaaattgcc ctgaaacaga 8281 attgtattcc caaggcagat aaatgtattt ccttgcgtcc tggagataca gcgacagctg 8341 aggtgattgt acgacaacgt cggattattg attttctgct cgatccgttt cagcagttgc 8401 agaaaggcaa ttttaaatta tagaaagtgc taggtttaag ttcgtttact taattggaat 8461 aaatatgatg ttaaatcctg ttactattac cagtgaagat gttcttcaac aagtcaagct 8521 atcttgtaaa attccggaaa ttatccaaga gattgtaacc cgtaaaattg ttgcagttgc 8581 tgctgagaag attggtattg aagtagaaga tgaagtactc cagaaaacag cagatacttt 8641 ccgattaata aacaaccttg gaagtgctga agagacatgg ctatggctcc aaaaacatca 8701 cctttccatt gaagactttg aacaaatcgc atacaccagc agcatttatg gagagttggt 8761 caaacatctg tttgcagata aaattgaacc ctatttcttt gaaaaccaac tagactatat 8821 tggcgcagtc atgtacgaag ttgtcttaga tgatgaagat ttggcaatag agctttatta 8881 cactataaaa gaaggtgaaa tgagctttta tgatgttgct cacacatatg tccaggacac 8941 agagttacgc cgaaaatgcg gatatcgggg gacagtgtat cgtaaagatt taaagccaga 9001 aatttctgct gctgtgtttg ctgctaagcc gcctcagctt ctcaagccaa ttgtgacgtc 9061 ttcaggaatc aatttaattt ttgtggaaga aatcgttcaa ccacaattgg atgataaact 9121 acgcaataaa ataactacag atttcttctc agagtggctc aaacaacaaa ttgaccagat 9181 gaaaatcatt ccggacttat cataagatca actagttctt ttatgtccag attaataaca 9241 gagtgtttac tttaagaaaa aacctgcatc ttacttgagt aagatgcagg ataagactga 9301 gcaataaatt cagaagtaaa gcgagtgaat gtcattcttt atgaagaaga ttatcactca 9361 agctttattt ctcaaattga tagaaacgat tagtgatcta actttttagt agtagtaccg 9421 gtagtaaggg tagtatccat accccccata gttatctcca gctttaccgt cgccaatatt 9481 aatgttattg ccgctgtttc cgccgccaag aataatggta ttgccgttgt ttccgtctcc 9541 accttttgtt gcatcaagct cagtatttga tagctcattg atatagctgt cagaggttgg 9601 cagttgagca attgtaatct tagccatttg aaatgtattc ctggttcaag ttctttgttt 9661 taggattgga ctttgtcttg tcttttcctg ttaactatct tctgctattt tctttaggta 9721 ttcaaggttt gaatgactag tgaatagaca aaaaggatgc agttatatct cgtataactc 9781 tcataaatta tgctagatgg atcattgttt ttataaacat ctatcgtgtg gcaaaaagaa 9841 ggtttttatg acaacataat gaaaacattt atgtctgttt tactagataa aatttgtctg 9901 gatatttagt tgctttgctg tatttataaa gtttctttca ttcgtaacgg gtggtgatac 9961 gcccaaaagg agtgcgatcg ctccccacca tcgcactcct ttcaaatccc ctcattcttc 10021 ctctctaatt tcctcgcaac gctggttgta tacgagatat ttcaactgca cggtttaatt 10081 tcagtccgtt aacagtgttc cagccagaat gcaaagcgag gacataataa ttaccgccaa 10141 tattgccgat aattgcgccg ccagaagccc cgccttttgt atcgcagttg tggtacagta 10201 agttatcttt ttgccggaga atgctgcatc ccaagtgaac gccagcagtc atcctttctc 10261 cagctgtaaa ttcttggtat tcttttttct tggggttagg aaaatcagca gaataaccaa 10321 ctagggcaaa tttttgtgta tcccctacaa ggtcgaagga tggtagagat ttcaaaggaa 10381 taacaccgta ttttttccca agaggtctgt caagtttgag aatagcccag tcgtctacat 10441 aatctgccaa tgtcccgctt ttgaagtctg ttccagcctt agcagtggtt gcataggcga 10501 tatcattttt atcgcgtact tgaccgttaa ttaaattcgg tttaaaggcg atcgcccgac 10561 tgactttacg agtatcagga ttaactacgc agtgagaatt agtcaacacc aaatcatcag 10621 ctatcagcgt acccgtgcaa ctgtagtctt gaccgtcagc atctattcct tcaattttac 10681 caaccgctga ccaaggtttt tttctactag tcatcgggat acggtcatct gcgccaataa 10741 tagcctgtct tgcgtcgggt ggttcatcaa tctctttgac atcacttggc gagacgggtt 10801 ttttgtcaat gactaaatta aaatctgcaa gatttgtaac tgtagtaaac ttcggcggta 10861 ctggtttaga ctgtgcttgc acagatagcg ttgcccaaac acctaacagc attgtaccca 10921 caagacctgc taaaaaagga agaataaatt gccgtttcat ttgtagttcc tcctaactcc 10981 gaaattttga atgagcgaat tttgaatttt gaactttgaa tgagcgaatt ttgaattgtt 11041 accgccattc cccttgcagg gtaaaagccg cccaagaata agggttgcgc cactcctcct 11101 gctgccacat ttctagctgt gctgctctca ttgctgctgc tggagttttg ccctgttgca 11161 atatttcttt gtagaattcg ctcatcaact ttgatgttgc ttcatagtca accttccaca 11221 aagacaccac aactcgcgcc gctcctgcat acatcaatcc tcttgtcaag cctaataatc 11281 cttctccttt gacttccttg cccagtccag tttcacaggc actcagtacc aacaattctg 11341 ctgggtagtc gagattgaat aaatcaggca gttgcaaaaa gcttttttgt tgttggttgc 11401 catttttatc tacctgggat agcactattc ctgataattc tgggttaatc tcgttgagta 11461 tgccgtgggt tgcaaaaagc aggtagcggt attgactaag ttgtgggtta gttgcccagg 11521 tataattagc atcaaaatca aaagcatgta acctttgtgg ctgggaaact agtttcataa 11581 tttcctcagc ctgttttcgc gttccttcga tccgcaaaaa attgcctatt tttatgtctc 11641 tagttgccct cattaaggca gacttatcta gctcaattgc agtctgcgat gtttgagttg 11701 ttggttggtc atttttgttg agcgcagggt ttttgacaac tcctgtcaca cgtttatctg 11761 tgttgctaaa cactggatca gcaagaatgg ctagcgtctt tggtgctttt tggcgtcctt 11821 tgagttcttg ccggagaatg tcaatggttg tagcggaagg taagctaact atttcgtggt 11881 tgagtagcaa tggttgataa ttttgtgcgc taacagatga ttgtggcaca gtcagcacag 11941 tgaaaggaat gtattgcaat gcaccatcgg cgacaattag caagcgtttt tgacccaaat 12001 cttttgcaac aggagcaagg atgagttgac taagttgatc ggcggctttt gctgtttgtt 12061 ctggggaaac tttattcatc ccagagtcac ttaagagttt gcgtaaatcc tctacttttt 12121 gctctatttc tttgcgtcct gctagttcgt agctttgcac tgaattagga gtcaccgccc 12181 aaaggtagct tcgttcttca ccgagggaat attgcaatag caacgtgtct ttatctagct 12241 gttgctgaat ttgaggcaaa gtcagggggt tgggatactt taaatttgca tatttcgtat 12301 tagtcctaat ttttgtttct aattcacggt attggttcag aagatttgca atttctgtgt 12361 tcagggttgc gatttgttgt tcgctgacct tgatcttaga ttcaaatagt gatgctaacc 12421 gtttctgtcg cgcttccaaa agaaactgta agcggcgttc ttctgccaaa agttttggat 12481 cagcgccttt gcggatatta gcatgagctt cggttaatag ttctattaaa ctgcgggcgc 12541 gagagcgttc gctgatgtgc agcgcttctg catcgtatcc tttggacggg tcttttttgt 12601 gcaaccgcat caacaggtcg atgtagaatt ggtaataatc ctggactgtg gcaaagtagg 12661 aactacgcaa gtcttgactc gtgactttgg tacgtatccc ttcaataatt tcagttgttt 12721 gtttaatgta ggtgagagct gcttgtaaat tacctctgtc gcgttcattc acagcaattt 12781 ggtaaagtgt tgctgcttcc tctgctttgt tacctaaagt ttttctgatg gcgagttctt 12841 gattgtggtt atcaatcgct gtttgatatt gttttaaaga gttgtatgtt ttacccatct 12901 gtctgagagc accggcttct gtatacttat ttcctaattg ttggatcaaa gattttgact 12961 gctcaaataa taacagcgct tgttgataat ctcccttttc ccggtatatt ttcccttgta 13021 aagctaagcc agcgctttct aagtgtttgt ccttctgctg acgcaccata gaaagagact 13081 tgctggcatt atttagcgct ttgtcatact cctttaatga caggtaagtt cttcccaaat 13141 caccgaatag aagtgatggt tgaactagac ctccgatctg ttgactcaga gacagcgctt 13201 tgttgtaata ttctatagct tgaggataat ctttcatttg atgataagca tcaccaatgg 13261 agctcagact gaaaacttca tttttacgat cgcctaattg ccgatataaa tccagcgcct 13321 gttgactagt gttaatactc tgctgatagt cgcctaatga cctgtagagc ttcgccatat 13381 tgctgagagt gtcggcttgt ccaccgcgta tttttagttg acgctgaata gtgagtgctt 13441 gattgaaggt atcgagtgct ttctgatact caccttgtga tgcatagata tctgcttgat 13501 tgttgagtat ttcagcttct tctacacggt tgttaatagc acgggcaacg gtaagcgctt 13561 cctctaaggc atccagtgct ttcccatact ccccaatgaa cacctggtac aaactagata 13621 tttgcctcag agccgagatt tctcgttgac gagctcctac ttttttagac agcgagcgcg 13681 cttgttgcaa gtaatcaagc gctttctgat attcgcccaa tgatcggtaa gtagatgcaa 13741 aattcaccag aatacttgct tgagatagtc gatcttgagg aacttgttgt aatagtctct 13801 gtgcttgctg tagctgttcc agcgctttct gattctcacc taatgtagtg taaacactgt 13861 tcctaaactc aaggactgta acctgtccag agatatcgtt aattcggcgg tagatttcta 13921 gagccttatc cgacgagtca agagcattct tattctcacc ccacttccaa tatgttgtac 13981 ccataaacca ataagtactg gcttggcttg gaggatcatt ttgttgaaga gctaaagctt 14041 gtttgtatgc ttgaattgct tcttggtgct tatcaaactg ttcttcatag attttagcga 14101 taaacttcaa agtatcagcc tctttctctg gttgtttttc agagcgatac atggaaagcg 14161 caaggttgaa attttccaaa gctttttggt tatcatccat gttcgagtat gttactccca 14221 gaccccagag ggtgtcagct tcttgagatg tcaacttctg ttcacgacta atagctagtg 14281 ctagtttttc atattccaac gctttgttgt attcgactaa aagcgagtag acagcaccaa 14341 tattctgtag cacgactact tccaatcttt tatcgggtaa ttgtcgcgcg aattttaatg 14401 ctgcttcatc tttttgaatt gcttgttgct ttgattctag agtcccctca ttagccagtt 14461 ttatcccttc attcaacagt ttaagtcctt cgtgttccag ttttttttgc tctggggtca 14521 aggtattttc tggttttgtg tttgactgct gtgctatctg caaatttgca tttcttggtg 14581 ttgctactac agagtcagat aacaagactg ctgtgagtaa taatgttaaa ctgtaacgag 14641 aaaacttaga aaatgagcag caaaaaatac accgaagctt tgcactcctg ttcatattta 14701 agcctcaata atttatcaat gacagcgata ttattgcgag cgcgaatcct gatattgatt 14761 gtttatttaa ttgtttgaga gattctctgg agaatgtaat gttgattgta taacgttcat 14821 ttacgcaagc caatatgcgt ttctgttttc gtaacaaaat tactgaaaat cttcagaatc 14881 attgggaggc agagcctcta gatgtgcatt cccatgctga gcatgggaac gagagattga 14941 tttattttct tttgagtaaa taaattaaat gggatttgcc cacgattctc gctttacctc 15001 accccggttt tgtcttgcgc caaaaccgcc cctctcctta ctaaggagag gggacgtgat 15061 agcgcagcgt ggcgttagcc atagcgtagc tttacggtga atccagcgct gcaggagggt 15121 ttccctccgc aggcgactgg tgaacccgaa ggggggtgag g // LOCUS NODE_2234_length_14981_cov_5.38262114981 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14981) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14981) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14981 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..213) /locus_tag="DP116_18850" CDS complement(<1..213) /locus_tag="DP116_18850" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="IS4/IS5 family transposase" /protein_id="PRJNA477356:DP116_18850" /translation="MLGAIFERFEKQSPISVMVRGLMERVFAPETIDRIFEENASSQY TRELLFSSLVELMSLVVCGIHPSVNAA" gene 309..449 /locus_tag="DP116_18855" CDS 309..449 /locus_tag="DP116_18855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002794059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18855" /translation="MTGLVLRPKVQGTWVLHQLLQDHPNSLFISFSSLADLYQVRMNT YN" gene complement(446..1536) /locus_tag="DP116_18860" /pseudo CDS complement(446..1536) /locus_tag="DP116_18860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006668373.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" gene 1537..1722 /locus_tag="DP116_18865" CDS 1537..1722 /locus_tag="DP116_18865" /inference="COORDINATES: protein motif:HMM:PF13592.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18865" /translation="MTYIDALQLFWLYYNYSSELDISRHRGWEYLKQMTFRLRVPRPE HRSSDPIEQENWKKNSI" gene 1701..2126 /locus_tag="DP116_18870" CDS 1701..2126 /locus_tag="DP116_18870" /inference="COORDINATES: protein motif:HMM:NF033545.0" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS630 family transposase" /protein_id="PRJNA477356:DP116_18870" /translation="MEKKLDLRLKFLQARYPNAEIEIWSMDEHRIGLHPILRRIWVSE DEQAIPSVRKRYKWMWLYGFVHPESGETYWWILPTVNTEIFNRVLADFAREYGLGTDK RILLVVDQAGWHTSNDLDLPSGVDLIYLPTYSPELQLFS" gene complement(2659..4815) /locus_tag="DP116_18875" CDS complement(2659..4815) /locus_tag="DP116_18875" /EC_number="2.7.7.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409992.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polyribonucleotide nucleotidyltransferase" /protein_id="PRJNA477356:DP116_18875" /translation="MGEFEKSISFDGRDIRLKVGHLAPQAGGSVLIESGDTTVLVTAT RSQAREGIDFLPLTVDYEERLYAAGRIPGGIMRREGRPPERVTLTSRLIDRPLRPLFP SWLRDDLQIVALTLSMDELVPPDVLAVTGASIATLMAKIPFNGPMAAVRVGLVGDDFI INPTYAEIEAGDLDLIVAGSPEGVIMVEAGANQLSERDIIEAIEFGYEAVQDLIRAQQ DLIAEIGLEIPHEEPPEVDSTLENYIRDRAVVEIKKILSQFDFDKTQRDTALDAVKES IKAAITELAEEDSVRVAALANSKALDNTFKDLTKKLMRRQIIEDNVRVDGRKLDEVRP VSCGVSILPKRVHGSGLFNRGLTQVLSACTLGTPGDAQSLADDLQQDQHKRYLHHYNF PPFSVGETKPLRAPGRREIGHGALAERALLPVLPPKEQFPYVIRVVSEVLSSNGSTSM GSVCGSTLALMDAGVPISKPVSGAAMGLIKEGDEVRVLTDIQGIEDFLGDMDFKVAGT DKGITALQMDMKISGLSLNVISQAINQAKAARLHILEKMLQTIEQPRSEMSPYAPRLL TIRIDPDMIGLVIGPGGKTIKGITEETGAKIDIEDDGTVTISAVDETKAKKARIIIQG MTRKLNEGDVYVGRVTRIIPIGAFVEFLPGKEGMIHISQLADYRVGKVEDEVTVGDEV IVKVREIDNKGRINLTRLGIHPEQAAAAREAAAVNR" gene 5490..8792 /locus_tag="DP116_18880" CDS 5490..8792 /locus_tag="DP116_18880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456843.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent helicase" /protein_id="PRJNA477356:DP116_18880" /translation="MAILHGTWLTQNSGCLFVWGETWRTLGTNCSKSPLNDVSKHPLA MTPLELIEWLHSRKISIVKIPHSKHVETSQTKSLKNGKTSKADSAADVMPTHSQIVAL PSYILENTYEGTVEIFPAHSATLDLPNKTPQYLQPWIVEGFCLNPEEAIKFLTSLPLS VPNGEDSFLGGDLHFWVQVARWSLDLISRGKFLPTIQRQSDHSTVAKWQALLDSAQDG TRLEKFSQLMPLACRTYQESLEDGPDEEIELAKSSSSVQINLPPEPQELLLGFLNSTI DAQVRAMVGSQPLLETRVMASLPATVRQWLHSLTTASNTCSADPFGVQRLEAVLKAWT MPLQYQMAGNNQFRTCFVLRSPESGGETHWTLAYFLQAADDSNFLVDATTIWNHPLEK FVYQNRTIEQPQETFLRGLGLASRLYPPIAASLETPYPQFCHLNPIQAYEFIKSVAWR FEDSGLGVILPPSLENREGWANRLGLKITAQTSRKKQERLGLESLLNFKWELAIGGQT ISKAEFDRLVALNSPLVEINGEWVELRPQDIKTAQTFFASRKEQMALSLEDALRISTG DTQTIEKLPVVSFEASGALQELISALTNNKAVEPLPTPASFQGKLRPYQERGMAWLSF LERWGLGACLADDMGLGKTIQFIAFLLHLKEQETLEKPTLLVCPTSVLGNWEREVKKF APTLKVMQYHGDKRPKGKTFVEAVNKHDIVITSYPLIHRDLKSLQSVSWQIIVLDEAQ NVKNSDAKQSQAVRQIESTFRIALTGTPVENRLQELWSILDFLNPGYLGNKQFFQRRF AIPIEKYGDAASLNQLRSLVQPFILRRLKTDRSIIQDLPEKQEMTVFCGLSAQQAQLY QKAVEESLAEIEEAEGLQRRGMILALLVKLKQICNHPSHYLKQDSLEQYHSGKLQRLQ EMLEIVVASGDAPAGSRPRGDRALIFTQFAEWGKLLKPYLEKQLGREILFLYGSTQKK QREEMVDRFQHDPQGPPIMILSLKAGGVGLNLTRANHVFHFDRWWNPAVENQATDRVF RIGQTRNVQVHKFVSTGTLEEKIHDMIESKKQLAEQVVGAGENWLTELDTDQLRNLLI LDRSAVIEEETE" gene 8789..9625 /locus_tag="DP116_18885" CDS 8789..9625 /locus_tag="DP116_18885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195384.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18885" /translation="MTNDTLQASREWWSQRWLDLLDSYRFKKRLERGRIYARQGNVLS IEFQNAKVLAKVQGSDPEPYKVSLSLDPFSEEQWGYVVETMSQKSMFAAKLLAGEMPQ NMEDVFTANGLSLFPFTLSDVHSKCSCPDKANPCKHVAAVYYQLGDRFSEDPFVLFQL RGSTKEKIISDLRHLRTKTVKTSETETPEVQESTQEKRFSVKIESFWQYNEPLESSLV VIVPSTGETVLDVLGSIPLAKEEESLTNLTAADVVMKYLETVYKDVSQKAVLAAMNVG GG" gene 9889..10188 /locus_tag="DP116_18890" CDS 9889..10188 /locus_tag="DP116_18890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315191.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18890" /translation="MELQTKIITVELTDGTSVKVEATQIGDRKINFQSRPFEEVTTAI ESLTKEIVEALHKVKPDRASVKFGVDIAIESGKLTALLVKGSSTANIEITLEWGQ" gene complement(10372..11253) /locus_tag="DP116_18895" CDS complement(10372..11253) /locus_tag="DP116_18895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphodiester glycosidase family protein" /protein_id="PRJNA477356:DP116_18895" /translation="MRKIWILAGVFVSATVLLLFFRTATSKETNLITKTIKYEQRNLP NSIVHILNIPSGSQFVVTPALSSQLNTVEEFAKQHQAVAIINAGFFDPVNQKTTSIIF QQGKLIANPKDNERLVNNPDLKPYLNKILNRAEFRRYLCEQTVRYSITLHSEPLQVGC QLVDALGGGPQLLPELTSVQEGFVDNANRRDALGSTKANARTAIGITGDGTIVLVMVA QKPDVAANSGMSLPELARFMQTLAVEKAMNLDGGSSSSLYYDGKTFYGKVDSKGNLVK RPVKSVLIVQETLGDFK" gene 11833..13053 /locus_tag="DP116_18900" CDS 11833..13053 /locus_tag="DP116_18900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211533.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 1" /protein_id="PRJNA477356:DP116_18900" /translation="MRLLIVQYGGDYREAFQRLSKNGTETYHSQKYVINSIVEISQQI EEATLLCCQTKESYNEVLQDGLRAIGAGLDPYKHKREILKLIEEQNPTHLLIHAPIPG IFNWAIQNKVRTIGLLADSFLTTSFRQKFKNYLLARLLNNKQIEWIGNHGINACLSLQ EIGVNPEKIIPYDWVHAITPEYYSPKKFRQNVNTWNLVYVGAVAEVKGVGDVIEAVAK LKTRNISVNLQVVGGGEIDYYTQRVRQLNIEDCVKFLGLMGNQTVMNLMREADIVVVP SRHEYPEACPFTIYEAFCVHTPIVASNHPVFKGNLQDGINAMIFPAGDSTALAASVEK LISNPEIYERLSEASSKSWEQLQIPVKWAEFINCWLHNSSNNQDWLFKHRLTSGMYNS WFSRDNQGYYSKPI" gene 13064..13258 /locus_tag="DP116_18905" CDS 13064..13258 /locus_tag="DP116_18905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18905" /translation="MKHTEQFLTLAQLRQKLTEVLETITPEVIIFITSYDFIRQALFS ASMALYGQAIIKNWYYKLGF" gene 13353..>14981 /locus_tag="DP116_18910" CDS 13353..>14981 /locus_tag="DP116_18910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113512.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_18910" /translation="MTCCLNPGCHNPPNPDGTMFCSNCGTGLVVLRNRYRPIKSLGSG GFGKTYLAEDIDKLKEKCVIKQFAPQVQGTGAFQKAKELFEQEAIRLQQLGDHPQIPT LLAYFEQDHRLYLVQQFIDGQDLSDEFKQQGYCNEQQIRELLVDLLNILKIVHQHKVI HRDIKPGNIIRRSSDGKLVLIDFGASKQLTATIMSEQGTTIGSFGYAPLEQMQGGEAY PASDLFSLGVTCFHLLSGIQPRGLFNKQGYGWVSSWRQHLQQSVSQEFGRILDKLLQE DYQQRYQSAQQVLQDLNPPPPPPAVPPTVHPPQSSPRSPAPSRKQEGKLKNPLLLGGV ILLLGLVGTQIYGYFRYEVFPFNPKFLITSPPNRFFLKSTLTKHIEPVAAVGISPDGK TVVSGSLDNTIKIWNLQTGELTSTLPGHNGWVVSVAISPDGNTLVSGSYDKTIKIWNL QTLELKTTLPGHTEPVLSVAISADGKTLVSGSNDKTIKIWNLQTGKLKTTLPGHTHGV VSVAISPDGKTLVSGSRDKTIKIWNLQTLELKTTL" BASE COUNT 4367 a 3104 c 3186 g 4324 t ORIGIN 1 ggcagcgtta actgatgggt gaatcccgca caccactaaa ctcattagtt ccacgaggct 61 agaaaaaagt aattctcgcg tgtactggga tgaagcgttt tcttcaaaga tacggtcaat 121 tgtttctggc gcaaataccc tttccatcaa ccctcgaacc attacactaa ttggactctg 181 tttttcaaaa cgctcaaata ttgcacccag cattgcccat atcctccgct actttttagt 241 ttgcgatacg ccccatgcgt ctgcgcttcg cgcaatcgca tggcattatt atctgataat 301 ttgtcacctt gacagggcta gtgctgcgtc ctaaggtaca aggaacctgg gtattgcatc 361 aactgctgca agaccatccc aacagcttgt ttattagctt ctcttcttta gccgatttat 421 atcaagttcg gatgaacact tataattaag ttgctctagt ttttggccac cagtggtagc 481 aagttagacc agaaatgaaa tcatgttgtt tcatcagttt ttgacaacga agcactaaca 541 gagcttctag ttcatctaaa gaatgtgggg aggtattagc aacaatctcg ttagtcagag 601 gccacaaacg ttctgcgggc tgtagttcag gtgaataagc aggtaaatag attaaatcga 661 taccttgtgg tagaaccaaa tcatgactga catgccaacc cgcttgatca atagctagaa 721 gcactcgttt gttacgccct aaaccgtatt cactggcaaa atcagccaag acgcgattaa 781 atatctgtgt gtttacttgc ggaagaatcc accaataagt ttcacctgat tctgggtgta 841 caaaaccata taaccacatc catttatact tttggctcac tgaggcgacg ggttgttcac 901 cttctggtac ccaaactcga cggaggatcg gatgaagtcc cagacgatgt tcatccatgc 961 tccaaacctc aatttcggcg tttggatatc gtgcttgcag aaatttgagc ttcaagtcga 1021 gttttttttc cagttttctt gctccacaag ctcactcatc cgatgttctg ggcgagggac 1081 acgcagccga tatctcattt gttgtagata ctcccatccc cgatatcgac ttatacgttt 1141 gcctgtcagt tcacttaacc aatctgccac cttgcgacca ttccataacc caccatctgg 1201 tgctttacct tctagtactt gccacagttg tgcttgttgt acatctgtta agtttgattc 1261 tttccctggg ttatgatgcc gttgatcccc caacgcgctt ttcccctgtt ggttgtagcg 1321 ttttactaac tgataaatcc agattcttgt atatcctgtt agttgcgcca cctcagtcac 1381 cgttttgcct gtcgctaata accaaattat ttggtaatgg ctgcgttctg ttgcttgtgt 1441 tgtctgacga tagcaaaccc acaattcctc cgtactcatg tggttaacga tacgcggagc 1501 gtcaagcctc cggcttatcc caatccgttt gggcatatga cctacataga tgcactacaa 1561 cttttctggc tttattataa ctattcatcc gaacttgata taagtcggca tcggggatgg 1621 gagtacctca aacaaatgac atttcgtcta cgtgttccca ggccggaaca tcggtcaagt 1681 gacccaatag agcaagaaaa ttggaaaaaa aactcgattt gaggctgaaa tttctccaag 1741 ctagatatcc aaacgccgaa atagagattt ggagcatgga cgagcatcgt attggacttc 1801 atccaatcct acgtcgaatt tgggtttcag aagatgaaca agcaattccc tcagtaagaa 1861 aaaggtataa atggatgtgg ctgtatgggt tcgtacaccc agaatcaggt gaaacttatt 1921 ggtggattct tccgactgta aacactgaga tatttaatcg agttttagct gattttgcaa 1981 gggaatacgg tcttggaact gacaaaagga ttctcttagt tgttgaccaa gctggttggc 2041 atacaagtaa tgatttagac ttaccgtcag gggtcgattt aatctactta cctacctact 2101 cacctgaatt acaactcttc agttaggttg tggcctctaa ctaacgagat tgtagcaaat 2161 tattccccaa attctttaaa tgaactcgaa gaacttatgg tgcttcgttg ccaagaaatt 2221 atgaaacagc accacttaat tcaaggacta acttgctacc actggtggcc aaaaacaagg 2281 gcggcttaat tataagtaat catccgaact tgatatcagc gctagcgcta ctacagtacc 2341 tgctaggata tctgctttag tattggagaa ccattctcgc tttaaaacgg ttgtgttcaa 2401 gttttcctct gctatttagg tgtggaatat taagatatac agactgtcag ctacatcttt 2461 tacttagtga agtagcttga tgattatttg agacctcagc gttgtacctc tattacttgc 2521 aatgcgctct ataaaagaaa cgccattcat cctgttcctt gttaagagtt aagcgttccc 2581 tatttcctca atagacacca aagtggattt tagatttgcg attaatccta aatctacaat 2641 ctaaaatcca aaatctagtt accgattgac tgcagctgct tctcgcgctg cggctgcttg 2701 ttctgggtga ataccaaggc gggtaagatt aattctgcct ttgttatcaa tttctcgcac 2761 tttaacaatc acttcatcac caactgtgac ttcgtcctca actttgccaa cacggtagtc 2821 agccagttgc gaaatgtgga tcatcccttc cttgccagga agaaattcca caaaagcacc 2881 tattggtata attcgtgtta ctctgcctac gtaaacatca ccttcattga gctttcttgt 2941 catgccttgg ataatgatcc gtgctttttt cgctttggtt tcatccacag cagaaatcgt 3001 cactgtgcca tcatcttcaa tgtcaatttt agctccagtt tcctcagtga tacccttaat 3061 cgtcttgcct cctggtccaa tgaccagacc aatcatgtct ggatcaatcc ggatagtcaa 3121 cagacgtggg gcataaggtg acatttcact gcgcggttgt tcgattgtct gaagcatttt 3181 ctccagaatg tgcaaccgtg ctgctttggc ttggttgatg gcttgggaaa taacatttaa 3241 ggacagaccg gaaattttca tatccatttg tagggcggtt atgcctttgt ctgtcccggc 3301 aactttaaag tccatgtcgc ccaaaaagtc ttctataccc tgaatatcag tcaggactcg 3361 gacttcgtcg ccttccttaa tcaaacccat tgcagcacca ctgacgggtt tagaaatggg 3421 tacgcctgca tccatcaatg ccagtgtcga accgcacact gaacccattg aggtggaacc 3481 gttggaagaa agcacttccg ataccactcg aatcacgtaa gggaattgtt cttttggcgg 3541 aagcacaggt aatagcgctc tttctgccag cgcgccgtga ccgatttccc gacgtcctgg 3601 tgcacgtaag ggttttgttt ccccaacgga gaacggtggg aaattgtaat ggtgtaggta 3661 gcgcttatgt tgatcctgtt gcaagtcatc agccagtgat tgagcatctc ctggtgtacc 3721 gagagtgcaa gcagatagca cctgagttaa tccccggtta aacagaccgc taccgtggac 3781 tcgcttgggc aaaatgctaa caccacaaga tacgggacgt acttcatcaa gcttacgacc 3841 gtcaacacga acattatctt cgatgatttg acggcgcatg agctttttgg taaggtcttt 3901 gaaggtatta tcaagagcct tactatttgc caatgcggca acgcgaacag aatcttcttc 3961 tgcaagttct gtgatcgcag ctttaatcga ttccttgact gcatctaaag ctgtatcgcg 4021 ctgtgttttg tcgaaatcaa attgtgacag aattttctta atttcaacaa cagcgcgatc 4081 gcgtatatag ttttccagcg ttgagtctac ctctggtggc tcttcgtgtg gaatttccaa 4141 accaatttca gcaatcaaat cttgctgcgc cctaatcaga tcctgtacag cttcgtagcc 4201 aaattcaatt gcttcgataa tgtctctctc tgagagttga tttgctcctg cttccaccat 4261 gatgacacct tctggtgaac cagcaactat cagatccagg tctccagctt caatttctgc 4321 ataggtgggg ttaatgataa aatcatctcc tactaacccg acacgcactg ccgccattgg 4381 tccgttaaat ggtatttttg ccattaaagt ggcgatcgaa gcacctgtga ctgctagcac 4441 atcaggtggt accaactcat ccattgagag tgtcagcgca acaatttgca aatcatctcg 4501 caaccaagac gggaacaaag gacgtagtgg acggtcaatt aaacgactgg tgagagttac 4561 tctttctggc ggacgacctt cacgccgcat aattcctcca ggaattctac ctgcggcata 4621 cagtctttct tcgtaatcta ctgtcaaggg aagaaaatca atgccctctc tggcttgtga 4681 tcgcgtagcc gtcaccaaaa ctgttgtgtc cccagattct atcaacaccg agccaccagc 4741 ctggggagct aaatgaccaa ccttcagtcg aatatcccgt ccgtcaaagg atattgactt 4801 ctcaaattct cccattcaat tttttttcct tctatgcacg ctattctctc tctgtggcaa 4861 tcctaacatt tatgccctct ggctggcgtc agttacacac taaagatcaa caactttaga 4921 cttgagcacc cagtatcaag tctgttgaga gtggtggtaa gagtgttcat gagcgattgc 4981 tctcccaaat gttaagaaag atccaggtaa cgaataaccg gagcttgtgg ggtgtattat 5041 attgcatttg caatccgcta taatcagcaa tcaggcagaa tactactagc tctttgatta 5101 ggcaattgtt ttgaggaatg tattttatct tttaaaatag caagatatgg tcattttttg 5161 accttttgtt cttctactct atgaaatctt ctacgaatac caagttgcat tgtgaaaggg 5221 taatttgtcc gactcaagcg gaatgtagtg tagcttggca atctcatgcc tcaaaatgac 5281 accaccagac ggcaaattgg tataaaatag tataaaaaaa tgtgtagttt gtttgctaaa 5341 attaatgatt acttttgcaa aatcatcatt caattgatta gccttgaaaa gatttagtca 5401 taatatgaga agtcaaagag tgatgtaatt ttcgacaaaa tagaacagac tattgacctt 5461 agtttagcga acatctgagt aattgttaaa tggcaatttt acacggtact tggttaacac 5521 aaaacagtgg ttgtttattt gtttgggggg agacttggcg aactcttggg acaaattgta 5581 gtaagagtcc gttaaatgat gtatcaaaac atcctctggc aatgacacca ttagagttaa 5641 tcgagtggct gcattcacgt aagatttcaa ttgttaaaat accgcactcc aaacatgtag 5701 agacgtcaca gacaaaatct ctaaaaaatg ggaaaacgtc caaagcagac agtgcagccg 5761 atgtgatgcc aacacactcc caaattgttg ccctaccaag ttatatctta gaaaatactt 5821 atgagggaac agttgaaatc tttccagcgc attctgccac attagattta ccaaataaaa 5881 ctccgcaata cttacaaccg tggattgttg aaggtttttg cctcaacccc gaagaggcaa 5941 taaaatttct gacctcgtta cctctaagtg tacctaacgg ggaagatagt tttttgggag 6001 gagatttaca cttttgggtg caggtagccc ggtggagttt agatctcatt tcgcgtggta 6061 agtttttgcc cacaattcaa cgccaaagcg atcattctac agtcgcgaaa tggcaagcac 6121 ttttagatag cgcccaagat ggaactcgtt tagaaaaatt ttctcaattg atgccattgg 6181 cttgtcgaac ttatcaggaa agtctggaag atgggccaga tgaggaaatt gagttggcaa 6241 aatcctcctc atctgttcag ataaacttgc cacccgaacc tcaagaatta ctgctgggat 6301 ttcttaatag tacgatagat gctcaagtac gagcgatggt gggttcccaa cctctgctag 6361 aaaccagagt gatggcgtct ttaccagcga cggtgcgaca gtggttacat tctctgacaa 6421 ctgcgtctaa cacctgcagt gctgatccct ttggagtgca acgactagag gcggtgctga 6481 aagcttggac tatgcccttg caataccaaa tggcggggaa caaccagttt cgtacctgtt 6541 ttgtgttgcg ttctccagag tcagggggag agactcattg gactttagct tatttcctgc 6601 aagctgctga tgattccaat tttcttgtgg atgcgacaac gatttggaac caccctttag 6661 aaaaatttgt ttaccaaaat cgaacaattg aacaaccgca agagacattt ttacgcggtt 6721 tgggattggc ttcacgattg tatccaccaa ttgcagccag cttagaaact ccgtatcctc 6781 aattttgcca cctcaaccca attcaggctt atgagtttat caagtctgta gcatggcggt 6841 ttgaagatag tggtttgggt gtgattttac cgcctagttt agagaaccgc gaaggatggg 6901 caaatcgttt gggtttgaaa atcactgccc aaacatctag gaaaaagcag gaacgtttgg 6961 gtttagagag tctgttgaat ttcaagtggg aattggcgat tggtggacag acgatttcca 7021 aagcagaatt tgataggttg gtggcgctta atagtccatt ggtggaaatt aatggggaat 7081 gggtagaatt gcgtccccaa gatatcaaaa cagcacaaac ctttttcgct tctcgtaaag 7141 aacaaatggc gctgtctttg gaagatgctt tgcgtatcag tacaggagac actcagacga 7201 ttgaaaaatt accagtcgtt agctttgagg cgtctggggc attacaggaa ttaatatctg 7261 ccttgacgaa taacaaagca gttgaacctt tgcccacacc cgccagcttc caaggaaaat 7321 tgcgacctta tcaagaacgt gggatggcgt ggctttcgtt tttggaacgt tggggcttgg 7381 gtgcatgtct ggcggacgat atgggattgg gaaaaacgat tcaattcatc gccttcctct 7441 tacacttaaa agaacaggaa acactggaaa aaccaacact gcttgtttgc ccaacttcag 7501 ttttaggaaa ctgggaaaga gaagttaaga aatttgctcc aacgctaaaa gttatgcaat 7561 atcacggaga taagcgtcct aaaggtaaga cgtttgtaga agcagtcaac aagcatgata 7621 tagtcatcac tagttatccg ctcattcacc gcgatttaaa atcattgcag agcgtttctt 7681 ggcaaatcat cgttttagat gaagcacaaa atgtcaaaaa ttcagatgca aaacagtcac 7741 aggcggtacg acaaatagaa tctacttttc ggattgctct tacgggaaca ccagtagaaa 7801 atagactgca agaactctgg tcgattttgg atttcctaaa tcctggttat ttgggaaata 7861 agcaattttt tcagcggcgt tttgcgatac caattgagaa gtatggtgat gctgcttcgt 7921 tgaatcaatt gcgttcatta gttcaacctt ttattctgcg tcgtctgaaa actgaccgca 7981 gcattattca agatttacca gagaagcagg aaatgactgt attttgtggt ctgagtgccc 8041 agcaagctca actatatcaa aaagctgtgg aagagtctct ggctgagatt gaagaagctg 8101 agggtttgca acgcagaggc atgattttag ctttacttgt taaactaaaa caaatctgca 8161 atcatccatc gcattatttg aagcaagact cattggaaca atatcactct ggtaaactcc 8221 agcgattgca ggaaatgttg gagatagttg tcgcaagtgg cgatgctcca gcagggagcc 8281 gcccaagggg cgatcgcgct ttaattttca ctcagttcgc agagtggggt aagttgctaa 8341 aaccatatct ggaaaaacag ctaggacgag aaatcttgtt tttatacggt agtactcaga 8401 aaaaacaacg tgaggaaatg gtggatcgtt tccaacacga tccccaagga ccaccaatta 8461 tgattttgtc gctgaaagcc ggtggtgtag gattaaattt aacacgagca aatcatgttt 8521 tccactttga cagatggtgg aacccagcag tcgaaaatca ggctacagat agagtctttc 8581 gtattggtca aactcgcaat gtccaagtgc ataaatttgt ttccacgggg actttagaag 8641 aaaaaattca tgacatgatt gaaagtaaaa aacaacttgc agaacaagtt gtcggtgcag 8701 gtgaaaattg gctgacagaa ctggatacag accaactccg caacttgctc atacttgacc 8761 ggagtgcagt aatagaagag gaaacagaat gacaaatgat acccttcaag caagtcgaga 8821 atggtggtca caacggtggc ttgatttgtt agattcctat cgttttaaaa aacgtttgga 8881 acgtggaaga atttatgcgc gtcagggaaa tgttcttagc attgagtttc aaaatgcaaa 8941 ggtattagct aaggtgcaag gttctgaccc agaaccatac aaagtttctt tgtcccttga 9001 cccctttagt gaagaacagt ggggttatgt ggttgaaact atgtctcaaa aatcaatgtt 9061 tgctgctaag ctactagcag gagaaatgcc gcaaaatatg gaagacgttt tcactgctaa 9121 tggtctttcc ctatttcctt ttaccttgtc tgatgtccac agtaaatgct cttgtcctga 9181 taaagctaat ccctgcaaac acgtcgctgc ggtttactat cagttaggcg atcgctttag 9241 tgaagatcca tttgttcttt ttcaattacg cggtagcaca aaagagaaaa ttatcagtga 9301 tttacgtcac ttacgcacca aaactgttaa aacttcggag acggaaaccc ctgaggttca 9361 agagtcaaca caagagaaac gattttctgt aaaaatcgaa tctttttggc aatacaatga 9421 gccattggag tcttctttag ttgttattgt accatcaaca ggtgaaacag ttttagatgt 9481 gttggggtca attcctttgg caaaggaaga agaaagttta acaaatttaa ccgctgctga 9541 tgtggtgatg aaatatttag agacagtgta caaagatgtt agccagaagg ctgttttggc 9601 tgcaatgaat gttggaggag gttaagagaa caaggacgag ggagatgaaa acagttatca 9661 gttatcagtt atcagttatc aggtacaaga tgggaatttc aaccccaaag tacaccaaca 9721 cttatagaag tggtgaactc cgagtaccca gttaaaaact ggtcaggtga ttgttttgag 9781 catttgtaca gccaatttta gacttcgctt tggtatcttt ttagtattcc tgataatttt 9841 gtccaatgat ggaatgctat aaaagcagag taatccagtt gaacacacat ggaacttcaa 9901 accaagatca ttacagtgga acttactgat ggtacaagtg ttaaagttga agcgacacag 9961 ataggcgatc gcaaaataaa ttttcaatcc cgaccttttg aagaagtcac caccgcaatt 10021 gaatcgctta ccaaagaaat tgtggaagca ttacacaagg ttaagccaga tcgagcaagt 10081 gtaaagtttg gtgtggatat tgcgattgaa tctggtaaac ttaccgcttt actggtaaag 10141 ggttctagta cagcgaatat agagattacc ttagaatggg gtcaatagtt tctcacgaag 10201 ttatgtaaaa catttataat ttggtcagca atatcacata aatcaaaagt ataaaattcc 10261 tattttattt ttgaacggaa gttagtacag tttctgttaa gccttaagag ttccctgttc 10321 cctgttaaga gttccctgtt cgctgttcgc tgttcgctgt tcccaaatta tttatttaaa 10381 gtctcccaaa gtttcctgaa cgattaaaac agactttact ggtcgtttta ccaaatttcc 10441 ctttgaatca actttaccat aaaaggtttt gccgtcgtaa taaagcgaag acgaacttcc 10501 accatccaga ttcatcgctt tctcaacagc aagggtttgc ataaaacgag ccagttctgg 10561 caaggacatc ccggagttag cagcaacatc tggcttttga gcaaccatca ccaaaacaat 10621 ggtaccatca ccagtgatac caatagcagt tcgagcattg gctttggtgc taccaagagc 10681 atctcgtctg ttagcattat ccacaaaacc ttcttgcact gaggttaatt ctggcaatag 10741 ttgcggacca ccacccaaag catcaactaa ttgacaacct acttgtagag gttcactgtg 10801 gagagtgata gagtagcgga cagtttgctc gcacaggtat cgccgaaatt ctgcacgatt 10861 gagaatcttg ttgaggtaag gtttgaggtc aggattattt accaatcgtt cattgtcttt 10921 aggattagct attagttttc cttgttgaaa aataatcgac gttgtttttt ggttgactgg 10981 gtcaaaaaag ccagcattga tgatagcaac tgcttgatgc tgcttggcaa attcctctac 11041 tgtattcagc tgtgacgaca atgcaggagt aaccacaaat tggctaccag atggaatatt 11101 taagatatgg acaatgctgt ttggtaaatt gcgttgctca tatttaatag ttttggtgat 11161 taagtttgtt tcttttgatg tcgcagtacg aaaaaacaaa agcagcacag tcgcgctgac 11221 aaacactcct gctagtatcc atatttttct catgcattat cgcctcaact tagacgaatg 11281 gattatatcc tttgaggttg ggcgatcgct ttagcgataa cgctttgcta caatgaaatc 11341 gatagactag aaccttattc ctaaatcact atatatttat gaagacaaca ggaacacccg 11401 aaactctgtt aaaatcattt ccagaccata agcaggctaa gcgcatctaa attattctgg 11461 aaaaaaatat tagttaatga ttaaaccaaa aaatcaggaa ttcttctttg attgattttc 11521 cgagggttaa agcgattacc tcatcattgt caataagcaa ggattaatcc aatgattttg 11581 gactttatta acactcaggt gttttttttt gacaaaatgc gcttatcctg gaaaccatag 11641 gcagttgcca gaatcggatg gtacatttgt gagccactca cctgtagggg tttcccaaca 11701 atggttcagt tgatgaactt aacggtcttt aagatcacat tattaatgta atcttaaaca 11761 tctactgata aattagcagc agctacagaa tatttcaaaa aagctggact taaaaaatcg 11821 gggtagagtc aaatgcgctt attaatagta cagtatggtg gggattatcg tgaagctttt 11881 caacgattgt caaaaaatgg aactgaaacc tatcattccc aaaaatatgt cattaattct 11941 attgttgaaa ttagtcagca aattgaagaa gctaccttac tatgctgcca gacaaaagag 12001 tcctacaacg aggttcttca agatgggttg cgtgcaattg gggcaggact tgatccatat 12061 aaacataaac gagaaatttt aaaattaatt gaagagcaaa atccaacaca tttattgatt 12121 catgcaccaa ttccaggcat ctttaattgg gcaattcaaa acaaagtccg aacaataggc 12181 ttgctagcgg attctttttt aacaaccagt ttccgccaaa aatttaaaaa ttatttgttg 12241 gctcgtctct taaacaacaa gcaaattgaa tggattggta atcatgggat taatgcttgt 12301 ctatcacttc aagaaatagg agtaaatcct gagaaaatta ttccttatga ttgggttcat 12361 gctatcacac ctgaatatta ttcaccgaag aaatttagac aaaatgtaaa tacatggaat 12421 ttagtctatg ttggtgcagt tgccgaagtt aaaggtgttg gcgacgtcat agaagccgta 12481 gccaaattaa aaactagaaa tatatcagta aatttgcagg tagttggcgg cggtgaaatt 12541 gactattata ctcagcgagt cagacaatta aatattgaag actgtgtgaa attcctggga 12601 ttaatgggga atcaaacagt catgaatttg atgagggagg cagatattgt tgtagtacct 12661 agcagacatg aatatccaga ggcttgtcct tttacaattt atgaggcttt ttgcgttcat 12721 actccaatag ttgcctctaa tcaccccgta tttaaaggta atttacaaga cggtattaac 12781 gccatgattt ttcctgctgg cgactcaaca gcgttagcag catctgttga aaaacttatt 12841 tccaacccag aaatttatga aaggctttcc gaagcttcat caaaatcttg ggagcaattg 12901 caaatacctg taaagtgggc ggaattcatt aattgttggc tacataattc atcaaataat 12961 caggattggc tgttcaaaca taggctgact tctggaatgt acaattcttg gtttagccga 13021 gacaatcagg ggtactatag caagcctatt tgatttctga aaattgaaac acacagaaca 13081 gtttctcacc cttgctcaac tacgtcaaaa gttaactgag gttttggaaa ctattacacc 13141 tgaggtgatt atttttatca cctcttacga cttcattcgt caagctttat ttagcgcatc 13201 tatggcactt tacgggcaag ctatcataaa gaattggtat tataaactcg gtttttgaga 13261 aagactcaaa tttatttact attgtgcggt ttctactacg aatagcttgt agctaggtga 13321 taataaataa gtctgccaat aaaagcacag ttatgacatg ctgtctcaat ccaggttgcc 13381 ataatccgcc taatcctgat ggcacaatgt tttgttccaa ctgcggaaca ggactggtag 13441 tgctgagaaa ccgctaccgc ccgataaaat cattaggtag cgggggattt ggcaaaactt 13501 atctggcaga ggatattgac aaactgaagg aaaagtgcgt catcaagcaa ttcgcaccac 13561 aagtacaagg aactggcgcg tttcaaaagg caaaggaact atttgagcaa gaagcaatac 13621 gtttgcaaca gttaggggat catccacaaa ttcccacgct actagcatat tttgagcaag 13681 atcatcgcct gtatttggtg cagcagttta ttgatgggca agatttatca gatgaattca 13741 aacagcaggg ttattgcaac gagcagcaaa ttcgggaatt attggttgat ttgttaaaca 13801 ttcttaaaat agttcatcaa cacaaagtta ttcaccgcga tatcaagcca gggaatatta 13861 ttcgtcgcag cagtgatggc aaattggtgc tgatagattt tggtgcttcc aagcaactga 13921 cagcaacgat catgtctgaa cagggaacaa ccattggcag ctttggttat gcaccattgg 13981 aacaaatgca aggtggcgaa gcgtatccag caagtgattt attcagtttg ggcgtaactt 14041 gctttcattt gctgagtggt attcagccta ggggactgtt taataaacaa ggttatgggt 14101 gggtgtcctc ttggcgacag catttgcagc aaagcgtgag tcaggaattc ggacgtatcc 14161 tggataagtt gctgcaagaa gactaccagc agcgttatca gtcagcgcag caagttttac 14221 aagatttaaa tccaccgcca ccgccgcctg ctgtacctcc gacagtacac ccgcctcaat 14281 cgtcaccaag atcgcctgca ccttcacgca aacaggaagg caagttaaaa aacccgttgc 14341 tgctgggtgg tgtcatcctg ttgctggggt tggtaggaac tcaaatctat gggtattttc 14401 gatatgaagt atttccgttt aatccgaaat ttctcattac cagtccacca aatcgtttct 14461 tcctgaaatc taccctgact aagcatatcg agccagttgc tgccgtcggc ataagcccgg 14521 atggcaagac tgtggtaagt gggagtctcg ataacactat caagatttgg aatctgcaaa 14581 caggcgaatt gacatctact ctgcctgggc ataacggctg ggttgtgtcc gtcgccataa 14641 gtccggatgg caatactttg gtgagtggga gttacgacaa gactatcaag atttggaatc 14701 tgcaaactct tgaattgaaa actacacttc ccgggcatac cgaaccggtt ctgtccgtcg 14761 ccataagcgc ggatggcaag actttagtga gtgggagtaa cgacaagact atcaagattt 14821 ggaatctgca aacgggtaaa ttgaaaacta cacttcccgg gcatacccac ggggttgtgt 14881 ccgtcgccat cagcccggat ggtaagactt tggtgagtgg gagtcgcgac aagactatca 14941 agatttggaa tctgcaaact cttgaattga aaactacact t // LOCUS NODE_2235_length_14977_cov_5.01829514977 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14977) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14977) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14977 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 314..1468 /locus_tag="DP116_18915" CDS 314..1468 /locus_tag="DP116_18915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867810.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_18915" /translation="MSNSLHLIRLGANGIERMQTYFVKALQAVQLSWQGISRNSLHGR VQRFVCLFLVGSVLSVAIAACSGSSLANKADVKLRLVSFSVTKAAHDQIIPKFVQKWK KEHNQNVTFEQSYGGSGAQAAAVIAGSQEADIVHLALPLDVNKIQQAGLIKSNWEIKA PRNGIVSRSVAAIVTREGNPKGIKTWADLAKDGVQVIAANPKTSGIAIWEFLAFWGSV TQTGGDEATALDYVTKVYKNIPVLTKDAREASDLFFQQKQGDVLVNYENEVILAEQTG PKLPYIVPQVNISIDNPVTVVDKNVDKHGTREVAQAFVDFLYSTEAQREFAKLRYRSV NPTVSQEVKSQYPPIETLFTSQDLGGWEIIQKKFFADGATFDKIQAAKKA" gene 1559..2557 /locus_tag="DP116_18920" CDS 1559..2557 /locus_tag="DP116_18920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411069.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate-binding protein" /protein_id="PRJNA477356:DP116_18920" /translation="MSFCRKLKDRFLLAFLMLSTYSVTACSSQGNKTQVSVDGAAVGF PISLAVAEEYGKVKPEAQVSVASSGTGGGISKFCAGDIDIVGASRTIKDEEIARCKSK KIEFIELPIALDGIAVIVNRQNNFAKCLTIKELDKMWNSKADGKVLTWNQVNPKFPKQ NLKLYAPASDTGTFDYFSQAVSKKAKNSRTDYTPSHNQNLLVQGVSGEASALGYVGIS YYIQNQDKLNLVAVKSPTGECIKPVPVDNVVKNVYTPLSRPLFIYVSKKSLDTKPAVK EFVDFYLENSWKWVDSVGYVALPDEAYLKVKRKLATGETGTKFKKAKPGQPITNFI" gene 2668..3612 /gene="pstC" /locus_tag="DP116_18925" CDS 2668..3612 /gene="pstC" /locus_tag="DP116_18925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011316944.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease subunit PstC" /protein_id="PRJNA477356:DP116_18925" /translation="MQSRNYSDNGFDPNSRNSLEKKASEDIQDKIVAAILFTCALVSV LTTFGIVVIIFQVAFEFFQEVSFADFFLDTKWTPLFATKHFGIWPLINGTFLTTAIAM AVAIPLGLSSAIYLSEYAQPKVAAILRPAVELLAGIPTVVYGYFALLFVTPLLRNFLP LEIFNALSAGLMMGIMITPTVGSISLDAIRAVPRSLREGAYALGITKLETIFKVVLPA ALSGITASIILGISRAVGETMTVLIAAGQQPKLTINVAESVETMTTYMAQISGGDSPR GSLNFNTLYAVGAVLFLLTLALNIGSYFISNRFKEKYD" gene 3618..4511 /gene="pstA" /locus_tag="DP116_18930" CDS 3618..4511 /gene="pstA" /locus_tag="DP116_18930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411067.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease PtsA" /protein_id="PRJNA477356:DP116_18930" /translation="MTTTYQRDNSFDSAAEFTDNIESREKTGKVFEILFLIGLLIGLF VLGLLLFDVLRDGLGRFLTPGFLTETPSRFPDQGGIRPAIISSILLGIIVIFVTVPIG VGSALYLEEYAPKAWWTAIIEINISNLAGVPSIVYGLLGLGVFNYLLGFGPALISGAL TLSLLSLPVIIVTAREAIRAVPDSLRHASYGLGVTKWQTITNHILPYAVPGILTGVII SVSRAIGDAASLIVVGAVGFLTFNPGLFNRFMALPIQIYSYITRPEPGFANAAAATII VLILLVLALNGIAIYIRQRFS" gene 4794..5624 /locus_tag="DP116_18935" CDS 4794..5624 /locus_tag="DP116_18935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411066.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_18935" /translation="MTYNNSKNQLNNVTLNPEDNAVFNVEGVKVYYGSSLALVDVYMK IPEKQIIAFIGPSGCGKSTLLRCFNRMNDLITGARVEGRLIYRDRNVYDPNINSVKLR RQVGMVFQKPNPFPKSIYENIAFGPRANGYKGNLDELVENSLRRAAIWDEVKDKLKQK GTALSGGQQQRLCIARAIAMKPDVLLMDEPCSALDPISTRQVEELCLELKEQYTIIMV THNMQQATRVADWTAFFNTETDQHSKRRGKLVEFSPTEQIFNSPQTKEAAEYISGRFG " gene complement(5877..7235) /locus_tag="DP116_18940" CDS complement(5877..7235) /locus_tag="DP116_18940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PrsW family intramembrane metalloprotease" /protein_id="PRJNA477356:DP116_18940" /translation="MSHAFLRLVSRGGETSYSLLTTSEVIIGRDPSCQIILNSNDFGV VSRRHATIRPSTTPEGTTSYLLCDLNSANGTYLNGQILQGCQELHAGDRLTFGTSGPE FVFEYQHNIQSLSKPQPPVTQTPDAVDANSTKFTSTDTDASWSQMLPILFRPKNLTRK AYLIPGIITVIFVVLLFFVQGFLYQILLGAYLAGAALYFVYQLCGKHKPWWVLIASAV FTILILSSPVLSLFIFVFRNILPGKVPDNTSTIAFTELFIRYFFGAGLLEEFLKALPV FGFYLLGRAFASPMRERIGVGEPLDGILIASASATGFTLFETLGQYVPGTIAEVAQKM GADAGWRAGLELLIPRILGDVAGHMAWSGWFGYCIGLSVLKPRQGWQILALGYLSAAG LHGLWNSCTGLSNTLGIFVVLLLIVVGGISYALLGAAIVKARALSPTRSQNFATGFHQ KN" gene 7611..7973 /gene="rplS" /locus_tag="DP116_18945" CDS 7611..7973 /gene="rplS" /locus_tag="DP116_18945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653816.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L19" /protein_id="PRJNA477356:DP116_18945" /translation="MNTQEIIRSIEAEQLKSNLPEIFVGDTVKVGVKIKEGEKYRVQP YEGVVIAKRNGGINETITVRRVFQGVGVERVFLIHSPRIDNIKILRRGKVRRAKLYYL RKRVGKATRIKQRFDRAL" gene 8122..8194 /locus_tag="DP116_18950" tRNA 8122..8194 /locus_tag="DP116_18950" /product="tRNA-Trp" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:8155..8157,aa:Trp,seq:cca) gene 8388..8609 /locus_tag="DP116_18955" CDS 8388..8609 /locus_tag="DP116_18955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999422.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecE" /protein_id="PRJNA477356:DP116_18955" /translation="MTKKNEAEITQASNGSGITNFFNGLKEEFDKVVWPSRKQLVSES AAVLLMVTLSASLIFLVDRFFSWAAQQVF" gene 8609..9250 /locus_tag="DP116_18960" CDS 8609..9250 /locus_tag="DP116_18960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcription termination/antitermination protein NusG" /protein_id="PRJNA477356:DP116_18960" /translation="MSFATDGDYNATLQSEDAADTASDASHEARWYAVQVASGCEKRV KTNLEQRIQTFDVAEKILQVEIPHTPAVKIRKDGSRQHTEEKVFPGYVLVRMMMDDDS WQVVRNTTHVINFVGAEQKRGTGKGRGHVKPMPLSHTEVERIFKQTSEQEPVVKIDMA TGDKIVVLSGPFKDFEGEVIEVSPERSKLKALLSIFGRDTPVELEFNQVQKQS" gene 9257..9682 /gene="rplK" /locus_tag="DP116_18965" CDS 9257..9682 /gene="rplK" /locus_tag="DP116_18965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457687.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L11" /protein_id="PRJNA477356:DP116_18965" /translation="MAKKIVAVIKLALNAGKANPAPPVGPALGQHGVNIMMFCKEYNA KTSDQVGTVIPVEISVYEDRSFTFVLKTPPASVLITKAAKIDKGSSEPNKRKVGSITR EQLRQIAQTKLPDLNANDVEAAMNIIEGTAKNMGVTVTD" gene 9780..10496 /locus_tag="DP116_18970" CDS 9780..10496 /locus_tag="DP116_18970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L1" /protein_id="PRJNA477356:DP116_18970" /translation="MTKKLSRRIQALLEKVEDRDYTPIEALSLLKETATAKFPEAAEA HIRLGIDPKYTDQQLRTTVVLPKGTGQTVRVAVIARGEKVTEATNAGADIVGSEELID EIQKGRMDFDKLIATPDIMPQVAKLGKLLGPRGLMPSPKGGTVTFDLGSAIAEFKAGK LEFRADRTGIVHVMFGKVSFSPEDLLVNLKALQETIDRNRPSGAKGRYWRTVYVSATM GPSIRVDINGLRDLKLTEAA" gene 10839..11390 /locus_tag="DP116_18975" CDS 10839..11390 /locus_tag="DP116_18975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740589.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L10" /protein_id="PRJNA477356:DP116_18975" /translation="MPRTIEDKKAIVTDLKETLSQSQLALVIDYQGLTVAEMTDLRRR LRPSGTVCKVTKNTLMGIAIQDDEKWQVLSELLKGSSAFLLVKDDFSAAIKAYQDFQK ASKKTELRGGVMEGRLLQEPDVKALGDLPSKEQLMAQIAGAINALATKIAVGINEVPS SLARALQAVADQEKGGESEESAS" gene 11455..11847 /locus_tag="DP116_18980" CDS 11455..11847 /locus_tag="DP116_18980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197123.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L7/L12" /protein_id="PRJNA477356:DP116_18980" /translation="MSAATDEILDKLKTLSLLEAAELVKQIEEAFGVSAAAQVGAVMA VPGGAAPAAAEPVEEQTEFDVILESVPADKKIAVLKVVRELTGLGLKEAKDLVEAAPK PVKEAIAKEAAEQAKKQLEDAGGKVTIK" gene 12621..13802 /locus_tag="DP116_18985" CDS 12621..13802 /locus_tag="DP116_18985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744944.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_18985" /translation="MGLSPLDLLDITFRSLGKNPLRSGLTALGVFMGVAAVSATLAVG NISRAVIAQQLAKRGAPQASVYPKWDSGRRTTSLKLEDMEFLQQRLVGLQAISAFNWA GSMPTIFQDKELTPPMSPVSQGFLLTSGKTLVSGRFFTVEDFARYRPVAVIDQLLAEQ LFGGQKALGKMIYAGDRPYVVVGVVTTTLDENAPPNGQLYIPISVYNALTGSRDIGSI QMRPYKLEDVENLSKQAEELLKQRFPGQKFKSWNNVSDILEQQKTLEMASQGLAVVGV IALLVGGVGIANIMIASVSERTAEIGLRRAIGATQQEIMLQFILEATLLSLIGGTVAL GVVHGLTIVVTNTFNLPYEFDGSIATLALGSALLVGVGASLTPALRASQIDPVKALRS E" gene complement(13811..14089) /locus_tag="DP116_18990" CDS complement(13811..14089) /locus_tag="DP116_18990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18990" /translation="MSTRKKTANQKSFLTEILHPPGMRKAHAKGVSEASPFGAAVALA IGDTPVAHGGNYATCYKSAQPPNAVAPQDRAGSPFSTRGTASPVPKAK" gene 14499..14813 /locus_tag="DP116_18995" CDS 14499..14813 /locus_tag="DP116_18995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017321722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_18995" /translation="MKHYRNEWIEEWCQENGWTDLFEERCNNYWAFPPGAVMPEPIPT HTLRLIKTQKGLTSQEKFWSVSAVFGTIVAVVSTYLFKCPIPLVLAFAFDAVTAAQLE VE" BASE COUNT 4431 a 3028 c 3376 g 4142 t ORIGIN 1 gacgagagcc atatgtagag aacctgcctg ccctgtgcca gttgcgtaag tcctgcgatc 61 gtgaattatt caatacgtat gttcttgtaa gcctagagga aatcaaaggc aatactcttg 121 catagagtcg tgcaaagcgt tatgcaacgc cagcgctcgc tctgttgcgc ggcaatcatt 181 tcgatcatga attattcaat acgtaacaaa acgtaaaggc aaagaaacat cactgttcta 241 ttaccctata attctgatca cggtaatatc gctcaaacta ccgtacattc agggggaaat 301 gttgatgaga ttgatgagta actcgctaca tttaataaga ttaggggcta acggaattga 361 gcgaatgcag acttatttcg taaaagcttt gcaggctgta caactatcgt ggcaaggaat 421 aagcagaaat tcgttgcatg gacgtgtgca aagatttgtg tgtctatttt tagtaggcag 481 tgttttgagt gtggcgatcg ccgcctgttc tggaagcagt ttagcgaaca aggcagatgt 541 caaactcaga ctcgtttcct tctctgttac caaagctgct catgaccaaa tcattcctaa 601 atttgtacaa aagtggaaga aagaacataa ccaaaacgtc acctttgagc agagttacgg 661 gggttccggt gctcaagcag cagcagtcat tgcaggttcc caagaagcag atatcgtaca 721 cctggcactt cccctagatg taaacaaaat tcagcaagca ggtttgatta aatcaaattg 781 ggaaatcaaa gctcctagaa atggtattgt tagtagatct gtagctgcga ttgtaactcg 841 cgaaggcaat cccaaaggaa ttaagacttg ggcagacttg gcaaaagatg gcgtgcaagt 901 gattgcggct aacccgaaaa cttctggtat tgctatctgg gaattcctag ctttttgggg 961 ttcggtgact caaacaggag gtgacgaagc gacagcgcta gattatgtca ccaaagttta 1021 taaaaatatt cctgtactga caaaagatgc tcgtgaagct agcgatttat ttttccaaca 1081 aaagcaggga gatgttttag ttaactacga aaacgaggtg attttggcag aacaaactgg 1141 accaaaactg ccttatatag tcccacaagt aaatatttcc attgataacc ctgtaaccgt 1201 agttgataaa aacgttgaca aacatggtac aagagaagta gcacaagcgt ttgttgattt 1261 tctttactca acagaagctc aacgggaatt tgcaaaatta agatatcgtt ctgttaatcc 1321 cactgttagt caagaagtaa aatcgcagta tcccccaatt gaaactttat tcacatctca 1381 agatttaggg ggttgggaga ttatccagaa aaagtttttt gcagatgggg caacttttga 1441 caagattcaa gctgccaaga aagcatgagt gccaagaaag catgagtgtt agtacggtag 1501 aaagattttc ctaccaattt tgtcttcaat cttcttagaa atacaagata tttggcttat 1561 gagcttttgt cgcaaattga aggatagatt tttattagca ttcttgatgc tgagtaccta 1621 cagtgttacc gcttgtagta gccagggtaa caaaactcaa gtgagtgttg atggtgctgc 1681 tgtcggtttc cctatttctc ttgcagttgc agaagaatac ggaaaggtga aaccggaagc 1741 tcaagttagt gttgcttcaa gtggtactgg tggtggaatc agtaagtttt gtgctggcga 1801 tattgatatt gttggtgctt ctcgtactat taaagatgaa gagattgcca ggtgtaaaag 1861 taagaagatt gaatttattg agttgcctat agctttagac ggaattgctg ttatcgtcaa 1921 tcgtcaaaat aacttcgcca aatgtttaac gattaaagaa ctcgacaaaa tgtggaattc 1981 caaagcagat ggcaaagtat tgacttggaa tcaagttaat cccaagtttc ctaagcaaaa 2041 tctgaaactc tatgctcccg catctgatac tggaacgttt gattatttta gtcaagctgt 2101 gagtaagaaa gccaaaaata gccgtacaga ctacactcct agccacaatc aaaatcttct 2161 tgttcaaggt gtttcaggtg aggcatcagc tttaggttat gtagggatat cttactatat 2221 tcaaaatcaa gacaagctta atctagttgc tgtaaaaagt cccacaggag aatgtataaa 2281 accagtacca gtagacaatg tggtgaaaaa tgtctacaca ccattgtctc gccctctgtt 2341 tatttatgtc agtaaaaaat ctttagatac caaaccagca gtaaaagaat ttgtcgattt 2401 ttacctcgaa aattcttgga agtgggtaga tagtgttggt tatgtcgcat tacctgatga 2461 agcttatctc aaggtaaaac gaaaattggc tactggtgaa actggtacga aattcaaaaa 2521 agcaaaacca ggtcagccaa tcacaaattt tatctagtct accatccttg agatttcgtt 2581 ataaatcggt ggacataatg aagtagcagg aaacaatatt tttaccctta gtgtaggtgt 2641 cttctactta gtcaaaaatc actgtctatg caaagcagaa attattccga taacggtttt 2701 gatccaaact ccaggaactc actggagaag aaagcatctg aagatatcca agacaagatt 2761 gttgcagcaa ttttatttac ttgtgcttta gtttctgtac taactacctt tggtattgtt 2821 gtcatcatct ttcaggtagc atttgagttt ttccaagaag tatcgtttgc tgacttcttt 2881 cttgatacga agtggacacc tttatttgca acaaagcatt ttggtatttg gcctttaatt 2941 aatggcactt ttttgacgac agctattgct atggcagttg ctattccttt aggtttatcc 3001 tctgctatat atttaagcga atatgctcaa cccaaagtag ccgcgatttt acgtcctgcg 3061 gtggaattgc tggcgggaat acccacggta gtctatgggt actttgcgct gttgtttgtc 3121 acaccattgc tgcggaattt tctccctcta gaaatcttca atgctttgag cgcggggtta 3181 atgatgggga tcatgattac ccctactgtt ggttccatca gcttagatgc tattcgagca 3241 gttccacgtt ctttacgaga aggagcttat gctttaggca taactaaact ggaaaccatt 3301 tttaaagtcg ttctcccagc agcgctttct ggaattacag cctcgattat tctgggtatt 3361 tcgagagctg ttggtgaaac catgactgtt ctcatcgccg ccggacaaca gccaaaactg 3421 actattaacg ttgcggagtc agtagaaacg atgacaactt atatggcgca aatttctggt 3481 ggagatagtc cccgtggtag tcttaatttc aacactttat atgctgtcgg cgctgttttg 3541 tttttactaa cgctggcttt gaatattggg agttatttca tttctaatcg ctttaaagaa 3601 aaatacgatt aataggcatg actacaactt atcaacgaga caattctttt gattctgcgg 3661 cagaatttac tgacaatatt gagagtagag agaagacagg gaaagtattt gaaatacttt 3721 ttttgatagg gttgctgata ggtttatttg tcctagggtt gctacttttt gatgtcttac 3781 gagacggatt aggcagattt ctcacacctg gtttcttgac ggaaacccct tctcgttttc 3841 ctgatcaagg tggtatccgt cctgctatta tcagcagtat tcttttggga attatcgtta 3901 tttttgtgac tgtaccaatt ggtgtagggt ctgctttata tctggaagaa tatgcaccaa 3961 aagcctggtg gacagcgatt attgagatta atatcagtaa tcttgcagga gtcccctcta 4021 ttgtctatgg attgctgggt ttaggagttt tcaactactt acttgggttt ggtccagctt 4081 tgatttccgg agctttgact ttatctttgt tgtctttacc agtcattatt gtgacagcta 4141 gagaagccat tcgcgcagtc ccagattccc taagacatgc ttcctacggc ttaggtgtga 4201 ctaaatggca aactatcacg aaccacatct taccctatgc tgttcccggt attcttacag 4261 gagtgattat ttccgtatct cgcgccattg gtgatgcagc atctctgatt gtggtaggcg 4321 ctgtgggttt cctcaccttt aaccctggtt tgttcaacag atttatggca ttacccattc 4381 aaatttacag ttacatcact cgtccagaac cgggttttgc taatgcagca gcagcaacaa 4441 ttattgtgtt gatactctta gttttagctt taaatggtat agcaatttat atcagacaac 4501 gcttttcata agtagcattt acgagttgcg tacgctccca agtattttgc atttctccag 4561 attgcgaatt cctagtaagg cagctttata cacgcattgc attcataaac aggatttttc 4621 ctctcttttt ctttgtgttc tctgcgcctg ctgggaactt tgcacgagga acatgcacat 4681 cgtaaattcg ctctggtgcg gttttttgtt ctcaggtatt agcagcaatc atttttggga 4741 atgattactc actcgaaaaa ttttgccatc ataaatatta ggagacaggc aatatgacct 4801 acaacaacag taaaaatcaa ctaaataatg tcacactcaa cccagaagat aatgccgtat 4861 ttaatgttga aggtgtgaaa gtctactatg gcagttctct ggctcttgtt gatgtttaca 4921 tgaaaatccc tgaaaaacaa attattgctt ttatcggacc ttcaggatgt gggaaaagca 4981 ctctactacg ttgctttaac cgaatgaatg atttaatcac tggagctagg gtagaaggta 5041 ggctgattta ccgagatcgc aatgtttatg atcccaacat caattctgtc aaattacgac 5101 gacaagtagg aatggttttt caaaaaccaa atccttttcc caagtcaata tatgaaaata 5161 ttgcctttgg accgcgtgct aatggttata aaggtaatct tgatgaatta gtagaaaatt 5221 ccctcagacg tgctgcaatt tgggatgaag tcaaggacaa actgaaacaa aagggtactg 5281 cattgtctgg gggacaacag caacgtcttt gcattgcacg tgcgatcgcc atgaagccag 5341 acgtcttatt aatggatgaa ccttgctccg ccctcgaccc aatttctacc cgtcaagttg 5401 aagaactctg cttagaactt aaggagcaat ataccattat tatggtgact cacaatatgc 5461 aacaagcaac aagagtcgca gattggacag cgtttttcaa cacagaaact gatcagcaca 5521 gcaaacgtcg cggaaaatta gttgagttca gtcctacaga acaaatattc aattctcctc 5581 aaactaaaga agctgcggag tacatcagtg gacgttttgg ttgatgataa atcacagttc 5641 actcaaaaga gtttaggcaa aaacagggca ggaactaggg tataggggta taggggaaag 5701 aaaaatcaaa gaaaggagtt ttttgatgag aggggaaacc aattcacgtg aaaagcgtat 5761 tcttccccta cacccttaca acgccaggtg ctaactcctg cggagacgct gcgcgaacaa 5821 gtcgggaaac ccgcccaacg cactggctcc ccttacacgc ttacaccctt acacccctag 5881 tttttctgat gaaaccccgt tgcaaagttt tgcgatcgcg tcggtgacaa agcacgcgcc 5941 ttaacaattg ctgccccaag caaagcgtaa gatatacccc caaccactat caacaacaat 6001 acgacaaaga ttcccaaagt atttgataat cccgtacaac tattccacaa gccgtggagt 6061 ccagccgcac tcaagtatcc caaagccaaa atttgccaac cttgacgcgg ttttagcaca 6121 ctcaagccaa tacaatagcc aaaccatcca ctccaagcca tgtgacctgc gacatcacct 6181 aaaattcgtg gaatcagaag ttctaaccct gctcgccaac cagcatctgc tcccattttt 6241 tgtgcaactt ctgcaatagt accaggtacg tactgaccta gagtttcaaa cagggtgaag 6301 ccagtcgcag aagcgcttgc gatcaaaatt ccatctagag gttctccaac gccaatccgt 6361 tcacgcatgg gagatgcaaa tgctcttcct aaaagataaa atccaaaaac tggtaaagcc 6421 ttgagaaatt cttccaataa cccagcgcca aaaaagtacc ggataaacaa ctcggtaaaa 6481 gctatagtac tagtattatc aggcactttg cctggaagaa tgttacgaaa tacgaaaata 6541 aatagactta atacaggact acttaaaatc aatatagtaa aaacagcaga tgcaataagt 6601 acccaccaag gtttatgctt gccacacaac tggtaaacaa aatataaagc agccccagcc 6661 aagtaagcac ctaatagtat ttgatataaa aatccttgta caaaaaacag gagtaccacg 6721 aatatcactg tgataattcc tggtattaag taagccttgc gagttaaatt ttttggtcga 6781 aaaagaatag gaagcatctg cgaccaactc gcatcagtgt ctgtgcttgt aaatttcgtg 6841 gaatttgcat ctaccgcgtc cggagtttgg gtgactggtg gttgtggttt actcaatgat 6901 tgaatgttgt gttgatactc gaagacaaat tctggaccac tagtgccaaa cgtcaggcga 6961 tcgcctgcgt gcaattcctg acacccctgc aaaatctgtc catttaaata tgtcccattg 7021 gcactattca aatcacacag caggtagctt gttgtgcctt ctggcgttgt ggatgggcga 7081 atagtcgcat gacgccgcga aaccactcca aaatcatttg aattcaagat aatttggcaa 7141 ctaggatcgc gtccaataat aacctcactc gtcgtgagca gtgagtaaga agtttctcca 7201 cctctggaga ctaaccgcag aaatgcatga ctcattcaga caacttccct cgcccttttt 7261 atagtaatta atagtacaaa aactattctt tattgagatc aaaatatgta gactgttgct 7321 gtatgacctt tgttgacttc atggcgatgg taaatcagaa gtggtggatt tactggtaac 7381 ccaactggag atccctatgc ctgcggtacg gctacgccga gaggatatct cattgacaaa 7441 aactatgttg tatgcatcag gaacgctttg cgctgacgcc agtcaccaac gtgtcccaca 7501 cgttggtgca tagggctggc ttactgataa ctgagtaatt ggtttaaaat gactagactg 7561 taagatagtt atcgatttat tattgttgat tgtgagtttt ttcctggact atgaataccc 7621 aagagatcat ccgctccatt gaagcggaac aactaaaatc taatttgcct gaaatctttg 7681 tgggcgacac agtgaaagtt ggagtcaaaa ttaaggaagg tgagaaatac cgcgtacaac 7741 cctatgaggg agttgtgatt gctaagcgca atggtggtat aaatgaaact attactgttc 7801 gtcgtgtttt ccaaggtgtg ggagtggaac gagtgttcct catacattct cctcggatag 7861 acaacatcaa aatattgcgt cgtggtaagg ttaggcgtgc taaactttac tatttacgta 7921 aacgtgttgg taaagctacc cgaatcaagc agcgttttga ccgcgcctta taattcaaca 7981 gagcaagatg tgggagaagg gtttaacaaa aagcggctcc cgccgctcat ttgttccctg 8041 aatcaaggtc aaagaaaaaa ttgccaactt gtgttaaaat aggttagatt aagtgtaata 8101 actgaataaa ggtcagcctg tgcgctctta gttcagttgg tagaacgcag gtctccaaaa 8161 cctgatgtcg ggggttcaag tcctccaggg cgcgctgtta ccaaaaataa agcccgaaaa 8221 ttatgcaact tatgctaaaa tagcagcaat gttgcaaatt tcgggtaaaa tttttgtgtt 8281 tgcagcttat tgctttggac aagttaagtt tgagtactaa atcttaacaa aaggtagaaa 8341 ttaacggcca acagttgttt gtatcaggaa tgagggagag aggcgacgtg actaaaaaaa 8401 acgaagcaga aataacacaa gcaagcaatg ggtctggcat aacgaacttt ttcaatggat 8461 taaaggaaga gttcgataaa gtcgtctggc ccagtcggaa gcagctagtg agtgaatcag 8521 cggctgtact gctaatggta actctctccg catctttgat atttttggta gatagattct 8581 tttcttgggc agcacaacag gtgttctgat gagttttgca acagacggag attacaatgc 8641 gacgctgcag tcagaggatg ccgcagatac agcgtcagat gcttctcacg aagcccgctg 8701 gtatgcagtg caagtagcct ctggctgtga gaagcgtgtc aagacaaact tagagcagcg 8761 cattcagacc tttgatgtag ctgagaaaat cctccaggtg gagattccac atacaccagc 8821 cgtaaaaatc cgtaaagacg gtagtcgcca gcatacagaa gagaaagtct tccctggcta 8881 tgtgctagtc aggatgatga tggatgacga cagctggcaa gtagtaagaa acaccactca 8941 tgtgattaat ttcgtggggg cagagcaaaa acgcggcact ggcaagggtc gcggtcacgt 9001 gaaacctatg cccctgagtc atacagaagt agaaagaatc ttcaaacaga ccagcgaaca 9061 ggagccagta gtcaaaattg acatggctac aggtgataag atagtcgtgc tttctggtcc 9121 gtttaaagat tttgaaggtg aggtgattga agtcagtcca gaacggagta aacttaaagc 9181 cctactttcg atttttggac gggatacacc agtagaattg gaatttaatc aggttcagaa 9241 acagagctaa ataagaatgg cgaagaaaat cgtagcggtc attaaattgg ccctaaatgc 9301 tgggaaagcc aacccagcac cgccagtggg tcccgctttg ggtcagcacg gcgttaacat 9361 catgatgttc tgcaaggagt acaacgccaa gacatctgac caagttggaa cggtgattcc 9421 tgtagaaatt tcggtctatg aagaccggag ttttacattt gtcctcaaga ctcctccggc 9481 atcagtcctc attaccaagg cagctaaaat tgacaaaggc tccagtgaac ctaacaaaag 9541 aaaagttggg tcaattacta gagagcaatt gaggcaaatt gcccaaacta aattgcctga 9601 ccttaacgcc aacgatgtag aagcggcgat gaacatcatc gaaggcaccg cgaaaaacat 9661 gggcgtaaca gtcacagatt agtcatgagt cattagtaaa agacaaatga caaaggacga 9721 ataacaaaat tattcggggg agaggtaaaa cctcgtcatt gaccccagga gtgaaaaaaa 9781 tgacgaaaaa actatcgcgc cgaatccagg cgctattaga aaaagttgaa gacagggatt 9841 atacacccat agaggcgtta tcccttctca aagaaacagc aacagcaaag tttcccgaag 9901 ccgcagaagc gcatatccgc ttaggcattg atccgaaata tactgatcaa cagctgcgaa 9961 caacagtcgt actgcccaaa ggaacaggac aaacagtacg agtagcagtg attgcaagag 10021 gggaaaaggt cacagaagca accaatgcgg gtgctgatat cgttggttca gaagaactaa 10081 ttgacgaaat tcaaaaaggt agaatggact ttgacaagct gattgccaca cctgatatca 10141 tgcctcaggt ggcgaagctt ggtaagttac ttggtccccg tggtttgatg ccgtcaccaa 10201 aaggtgggac agtgacattt gacttaggaa gtgcgatcgc agaattcaaa gctggtaaat 10261 tagagttccg agctgatcga actggcattg tccatgttat gtttggtaag gtgtcgttct 10321 caccagaaga tttattagta aacttgaagg cgttgcaaga aacaatcgac cgtaaccgtc 10381 cttcaggagc taaaggtcgt tactggcgta cagtgtatgt gtctgccacg atgggaccat 10441 caattagggt agacattaac ggcctacgcg atttgaaact cacagaagca gcataatatc 10501 gttatgtcat gagtgatgag cacatgacaa atggtaaaca actgaaggca actaacaaaa 10561 ttaaataggc aacagccgga gacagcaggt gccattggct taatgtcctg ccgaggtttt 10621 cgccccaatt gctaccagta ccacgcatga tgagcgtgat tagtggggca taaggatgat 10681 aaggtataga cgcactgtac agcgcgtttt taccactaaa ccccggctgc gagagctggg 10741 gtttattgtt ttcaagcagt cactgtcatg gtggtcagca gttggcaaaa acctaacagc 10801 caatacctga aagccgattt gttaaggagg tgagatggat gcctagaacg atagaagaca 10861 aaaaagctat agttactgac ctcaaagaaa ctttgagcca gtctcaacta gcactagtca 10921 ttgactacca aggactaaca gttgctgaaa tgacagacct gcggcgacgt ctgcgtccct 10981 ctggtactgt ttgcaaggtg actaagaata ctctgatggg cattgccatt caagacgacg 11041 agaaatggca agtgttgtca gaattgctca aaggctcttc tgcctttttg ctggttaaag 11101 acgatttctc cgcagcaatt aaggcttacc aagatttcca aaaagccagc aagaaaacag 11161 aacttcgcgg cggcgttatg gaaggtcgcc tgctgcaaga gcctgatgtc aaggctttgg 11221 gagatttgcc atccaaagaa caactcatgg cacaaattgc tggagcgatc aacgccttgg 11281 ctaccaagat tgctgtgggt atcaacgagg ttcccagttc gctggctcgt gctttgcagg 11341 ctgtcgctga tcaagaaaaa ggtggcgaat ccgaagaaag tgcttcctag ttagtcgtta 11401 gtgctaggta gaacaactaa caaataattt atcaacatta caggagttat atcaatgtct 11461 gctgcaactg atgaaatttt ggataaacta aaaaccttga gcttgctaga agcagctgag 11521 ttggtgaagc aaattgaaga agcctttggc gtaagtgctg cagcacaagt tggagctgtt 11581 atggcagttc ctggtggtgc tgcacctgct gctgctgaac cagtagaaga gcaaaccgag 11641 tttgacgtca ttctggaatc agtcccagct gataagaaga ttgctgtact caaggttgta 11701 cgggaattga caggtttggg tctcaaggaa gcgaaagact tggtagaagc tgcgccgaag 11761 ccagttaagg aagcgatcgc caaggaagct gctgaacaag ctaagaagca gttggaagat 11821 gctggcggta aggtaactat caaataattc ataactcgta attaaaaatt acgaaaaggc 11881 agcagtccag taaccagttt ggtgtgggct gctgcctttt tttgattgag aagaaatcaa 11941 cacggtatta tgggtgctat gtccgtattc cgtgctacaa cacttggaca aagctagaca 12001 aagttaaact ttgttagaca atacccggat tgcttcctgt ttcatcctgg agatgtcggc 12061 atcaaaccag tccacaggct taaattcgga tagaacttat cctatacgtg aattatcttt 12121 tgacattcaa atgtgagaaa ctttgctact ctatcccatt attttcaatc ctattgaggt 12181 tttctggact tctcaaaccg ccttattaca ggtaggatta ctcttaagag ttttgtacag 12241 ggtctcaagc cttaattttt tcgcttggtt ttcgcgttag cacaattata gcatatggac 12301 aaatagtcta ttaatgtcta gctttgtcta gaaatttagt tacagttttt aacccccggg 12361 aatgttcagg cttgaagtat ctgatgactt ggacaaacaa aaaccaacag acaaaaaatc 12421 tttgcttaaa cgagttgaga aacttgacta tcaagtttca ctctccgaat ttttttgaat 12481 tttgagagaa gttggagcgg agagcagcgc cttgcggagc cagtgcgttg cgggggttcc 12541 ccccgttgaa gcacctggcg tgcgcttgcg cttacggtga ttttgaattt taaattttaa 12601 atttttgcaa agcgactgtt atgggtcttt cacctttaga cttactggat attacctttc 12661 gctccttggg taaaaaccct ttgcgttctg ggctaacagc tttaggagtg tttatgggtg 12721 tggctgctgt cagtgctacg cttgctgtcg gtaacattag tcgggcagtt attgctcagc 12781 aactagcaaa acgaggcgca cctcaagcct cagtctaccc aaagtgggat tctggtcgtc 12841 gaaccacctc acttaaattg gaagatatgg aatttttaca acaacgcttg gtgggcttgc 12901 aggcaatcag tgcttttaat tgggctggtt ctatgccaac tatatttcag gacaaagaat 12961 taactccacc catgtcacct gtttcccaag gttttttgct gacttcagga aaaacactag 13021 tctcgggacg attctttacg gttgaagatt ttgccaggta ccgaccagta gccgttatcg 13081 accaactgtt ggcagaacaa cttttcggtg gacagaaagc tttgggtaag atgatttatg 13141 ctggggacag accttatgtg gtcgtggggg tggtgacgac tactctggat gaaaatgcac 13201 ctcctaatgg tcaactttat ataccaatct ctgtttataa cgccctgact ggtagccgcg 13261 acattggtag catccaaatg cgtccctaca aactcgaaga cgtggaaaac ttgagcaagc 13321 aagctgagga actgctaaag cagcgattcc caggtcaaaa atttaagtct tggaacaatg 13381 tctcagatat cctagagcaa cagaaaactc ttgagatggc gtcccaagga ctagcagtgg 13441 tgggagtgat cgcgctcttg gtcggcggtg tggggattgc gaatatcatg attgcctcgg 13501 tgtcagaacg aaccgctgaa attggtttga gacgggcaat aggagcgacc caacaagaaa 13561 tcatgctgca atttattctg gaggctacgc ttttgagtct cattggagga actgtcgctc 13621 ttggtgtggt gcatggattg acaattgtag tcacaaatac ctttaacttg ccctacgagt 13681 ttgacggttc tattgcaaca ctagcgttag gctcagcatt actggttgga gtgggagcca 13741 gtttgacccc cgccctacga gcgagccaaa tcgacccagt caaagcctta cgctcagaat 13801 aatttcactg tcatttagct tttggtactg gggaggctgt acctcttgtt gagaatggtg 13861 agccagcgcg gtcttgggga gccactgcgt tgggcggctg tgccgacttg tagcatgtgg 13921 cgtagtttcc cccatgagcg actggcgtat ctcctatggc taacgccacg gctgcgccga 13981 acggagacgc ttcgctaacg cctttggcgt gcgctttgcg catacccgga gggtgcaaga 14041 tttcagttaa gaaggatttt tgattagcag ttttctttcg ggtggacatt gccactaatg 14101 tctcaaatgt ccacctagct taaaaaagca ggggggtagg cgtattgatt tttaaggtat 14161 ttatgcgttg aacctgcgta acacaacttt acatatagac tgaataaata tctgttttta 14221 aaaccaaaaa tcacgattat gttgagaatt gttaaaagtc aaacttccaa gtaaagttta 14281 ttttcttttc atgcaagaac aagctaaata ctgttagcct gtgaattaag aaaagcatca 14341 atttaactct aattacaggt gtttttagat tcttacgtaa gccaaatagt acttgaataa 14401 tctttagttt tataccaatt ggtagacttc tgagaaataa ccaggaaatt tatatttagt 14461 ttctaaacgt catgctgatt ccacttccca gcccaaacat gaaacattat cgtaatgaat 14521 ggattgaaga atggtgccaa gaaaatgggt ggacagattt atttgaagag cggtgtaata 14581 actactgggc atttcctcca ggggcagtta tgcctgagcc aattcctacc catacattga 14641 gattgattaa aactcaaaag ggactgacat ctcaggaaaa attttggtca gtctcagcag 14701 tatttggcac aattgtagcc gtcgtttcta cttacttatt caaatgtcca atacctctgg 14761 ttttagcttt tgcttttgat gcagtgacgg cagcgcaact tgaggtagag tgatctcaag 14821 aatgagttat gatcagtagt actgagtgct gtagacgcca cagaacaatt ggcacatagt 14881 actttgcttg ggggctattt catgatttgc tggggttgct ttactcactt cttgcaccta 14941 accgaataaa tccgtaaaag gtgaataaag agcgcaa // LOCUS NODE_2264_length_14813_cov_5.46198714813 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14813) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14813) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14813 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 66..2903 /locus_tag="DP116_19000" CDS 66..2903 /locus_tag="DP116_19000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181509.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19000" /translation="MHLSLFLRFGVIALILCLLKTSVDQGQQLVPGRSQTVSHSFRTL LQTQKPKNLKIEADRLSNQGVDQYESGQFQKALEIYKKALVIYQEIGDKENISNTLNS LGAVSRELGQYPQALKFYQQALTISRKVDVGKVPDTEEQNNTGLILNNIGLVYQSMGQ YSQALEYYQQSLSLMQKIEDKLGVGTAFNSIGGVYYERGQYSLALKFFQQALVNVQKA KDPIEEANNLNKIGQVYSQVGQYSQALKFYRQALEISKKNNDKLGEGTILNNIGFVYN VMKKYSQARDYYQQALAVFKKTDAQPNIGTTLNNIGFVYQQLGQYSQAVESIEQALTI LQQVGDRAVVGRTLDSMGSAYKGMGQYSQALVSYQQALAVSREIGDRTAQRITLGNIG DLLAQQNKPQLAIIFYKQSVNVTEAIRAQLRSLPREQQQSYTTTVADIYRRLADLLLQ QDRVLEAQQVLDLLKIQELDDYLHDVRGNEKTSKGVESPSPEQSINQGLKVIADKQIQ ISRQLGELQKIPPTNRKPDQIAKIVELDAQLITEFNQFIDSPEVNAWLGQLSPKAAQQ IIPLENLNSLRDNLQRLNQNAVLLYPLILENRLELVLTTPNTPPIHRSVPIKKEELNR AIADFRSALENSESNATIPAQKLYEWLIKPIEKDLANADAKTIIYAPDGALRYIPLAA LYDGKQWLAQRFRTNNITALSLTEIDTKPLPQIKVLAGATTQRYIVQLESTLLPFKAL EYAGAEVQNLATMMPGTKTLLDNEFNRQAMISYLNVYTIIHMATHAFFVNGKPEDSFI LMGDGGLVTLPDIQKLSLPNVDLVVLSACETAVSDQIGKGEEILGFGYQVQRTGARAA IASLWSVSDGGTQALMNAFYAFLSQGKMNKAEALRQAQVAMITGDYSGVTENKDRGIL KSTRQNLPAKVANRLSHPYYWAPFILIGNGL" gene 3066..3881 /locus_tag="DP116_19005" CDS 3066..3881 /locus_tag="DP116_19005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008049245.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid hydroxylase family protein" /protein_id="PRJNA477356:DP116_19005" /translation="MEIEFYLKIAISSTIIQQIFYWPLHNIELNFLTQVLLYWLVGSV SFYSIGLFIEKVIKKNDTLREKLTARVKKVKKQPFPSFTAKGIIIGEIRSLIAALIIL YLAPDVNRGNSLLLNLGWFLMRIIAADFCFYVTHWLFHRKFLRKIHLKHHEFADSSSF VAGHKSLTEYIIVTITDLLPIFIFGYDITQLCAWTIIGNAYNLEGHSSLSIFFVPSDF HDLHHTCFKGNYGIQGFWDRVFNTLNPPTKKSGIMFPVASLENITMKSSSNLD" gene 3927..4127 /locus_tag="DP116_19010" CDS 3927..4127 /locus_tag="DP116_19010" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19010" /translation="MVIQIFVGWAMRTLGSNSEFQVIGTAHPTVIENGEPARSWGATA LGSQYLMRVSLRRYLAFGSADL" gene complement(4256..5260) /gene="egtD" /locus_tag="DP116_19015" CDS complement(4256..5260) /gene="egtD" /locus_tag="DP116_19015" /EC_number="2.1.1.44" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213117.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="L-histidine N(alpha)-methyltransferase" /protein_id="PRJNA477356:DP116_19015" /translation="MLTQPLIFLDNQYHELNNDGEDVIQGLTQTPKSLPPKYFYDERG SQLFEQICQLPEYYPTRTEAWILSQYADEIAQMTGSCELVELGSGSSTKTRLLLDSYQ KIADDCRYLPIDISGGILKTSVLQLQQQYPDFSIQGLLGTYEQALAHLESNSLRYSLR PASGLSRMIFFLGSSMGNFTPQESDLFLSQIAHALKPGDYFLLGIDLQKPKEILEAAY NDSQGVTAAFNLNMLSHLNWRFQGNFELNFFTHQAIYNQADAQIEMYLHCQENHWVSL DILNLKVSFQAGESILTEISRKFDLAIIQKQLAAQGLKTLKTWTDPQQWFGLILCQAQ " gene complement(5504..6865) /gene="ovoA" /locus_tag="DP116_19020" CDS complement(5504..6865) /gene="ovoA" /locus_tag="DP116_19020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207950.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5-histidylcysteine sulfoxide synthase" /protein_id="PRJNA477356:DP116_19020" /translation="MGMVPTMKKLASPQVPTLNDCSSQSLLNYFENSWELEEILMKSL VGEETFYLNPDHLRNCLIFYLGHSAVFYINKLICVGLIKHRINSKYETLFEIGVDPET PTELDAALQGVNWPDVEKVWQYRDKAREVITEVIQNTCLDLPIHQQHPFWALLMGIEH SRIHFETSSMLLRQLPVDRLKRPQGWNYAPSNNEIPNNEMRLIPGGVVKLGKSKDDFT YGWDSEYGDRTVEVKPFLASKYLITNGEFLEFVHEDGYNNPDYWNAESWNWKQLYNVQ HPKFWIPLHDSYRYRAIFDEIDLPLDWAVEVNYYEAIAFCRWKGSEIRLMSEAEWNQA LLTSEANRLSTNYNLNLQFISPSPVGMFKLANSACGLYDLRGNVWEWLGDTFNPLPGF QPHPLYEDQAAPFFDGKHQMMLGGSWATNGSMALPTYRNWFRPYFYQHAGFRIAQDLK AVS" gene complement(7208..8089) /locus_tag="DP116_19025" CDS complement(7208..8089) /locus_tag="DP116_19025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007354409.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19025" /translation="MMQLVQWFTAPLQNEFMVKAILVSALVGMVCSVLSCYMTLKGWA LMGDAVSHAVMPGVVIAYILKIPFAVGAFVFGVGSVIAIGFIKAKTRIKEDTVIGLVF TGFFALGLVLVSKTPSSVDLTHILFGNVLGISQPDIIQTVIISVITLVAIAILRKDLL LFCFDPTHARSIGLNIGVLYYILLSLLSLTAVAGLQTVGIILIVAMLVTPGATAYLLT DNFDHMMLIAMASGVFSSVMGTYISYYIDGATGGCIVVLQTLLFVVAMIFAPKHGLLV RGKKQKDVSIVGIGNRE" gene complement(8086..8865) /locus_tag="DP116_19030" CDS complement(8086..8865) /locus_tag="DP116_19030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007354408.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="manganese transporter" /protein_id="PRJNA477356:DP116_19030" /translation="MMNSISIDVENVTVAYHGKVALHSASLQLKASSICGLVGMNGSG KSTLFKAIMGFVKPRTGRVLINGLPITMVQKNNLVAYVPQSEEVDWNFPVSVHDVVMM GCYGYMNILRIPSVKDKRVVRESLERVQMWEMRDRQIGELSGGQKKRAFFARALAQQG TVLLLDEPFTGVDIKTEKAMIDLLLELRDAGNTILVSTHDLASITTFCDQVVLINRTI LAYGNTNEVFTQENLSRTFGGSLSDLPFSKSRIGRENMEGV" gene complement(8945..10000) /locus_tag="DP116_19035" /pseudo CDS complement(8945..10000) /locus_tag="DP116_19035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951062.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="metal ABC transporter substrate-binding protein" assembly_gap 9703..9712 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 10383..10619 /locus_tag="DP116_19040" CDS 10383..10619 /locus_tag="DP116_19040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19040" /translation="MNAHKIETVLTEDGMTLQGLPFHAGDTVEVIILQAKTPQPQNAV NPKSEKNRYSLRGKVIRYDDPTEPVALEDWEFLQ" gene 11057..13399 /gene="pcrA" /locus_tag="DP116_19045" CDS 11057..13399 /gene="pcrA" /locus_tag="DP116_19045" /EC_number="3.6.4.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874333.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA helicase PcrA" /protein_id="PRJNA477356:DP116_19045" /translation="MTTTTDFLSHLNPSQRRAVEHHCGPLLVVAGAGSGKTRALTYRI ANLILQHRVDPENILAVTFTNKAAREMKERIQKLFADRLAITEYNERFDLLPEYDQTK LKSKVWKNYIKEMWCGTFHSLLSRVLRFDIEKYKDEKGRQWTRNFSIFDESDAQSLVK EIVTKQLNLDDKKFEPRSVRYAISNAKNQGLSPKEFEIEQPNYRGRVIAEVYNHYQSR LAQNNALDFDDLILVPVKLFQQNEQVLGYWHNKFRHILVDEYQDTNRTQYELIRLLTT NGETKKSDWDWTNRSTFVVGDADQSIYSFRMADFTILLDFQQDFGDGLPDEDTRTMVK LEENYRSCENILQAANELIENNTQRIDKILKATRGAGEEIFSHKADDEIAEADFVINQ IRSLENQHPELNWGSFAILYRTNAQSRPFEELLVRLGIPYTVVGGIKFYDRKEIKDVL GYLRAIANPSDTLSLLRVINTPRRGIGKATIDGLVNAAQELGTTLWEILIDETSVNTL AGRSAKAVNAFAQMIRHLQEQIETVPVSELVQRVLEDSGYIKDLETQSTDEAEDRLQN VQELFNAALQFEEENEDLNLQAFLSSTALSSDLDNLKEGQSAVSLLTLHASKGLEFPV VFLVGMEQGLFPNYRSMNDPASLEEERRLCYVGITRAQERLYLSHARERRLYGSREPA LRSQFLDELPEELLMTRQKSSVAYTKGTGGSTAQRTTKRNQDTTASWQVGEKVLHKTF GIGEITHIFGSGNKVSLAIKFDSLGQKIIDPRVAQLQRME" gene 13603..14586 /locus_tag="DP116_19050" CDS 13603..14586 /locus_tag="DP116_19050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749287.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytosolic protein" /protein_id="PRJNA477356:DP116_19050" /translation="MTNPQTEFDSPWKDILQLYFEEFMLFFFPQAHEEIDWTRKPEFL DKELQQVIRDAELGKRLVDKLVKIYRLGGEESWILVHVEVQAQEESDFSRRMYSYNYR IFDRYNRSVASIAVLGDEGINWRPNQFGYDLFGCRVDFQFPIVKLLDYKQRQAELLAS RNPFATVVMAHLAALETRNNRLERKQQKLALTRRLYEQGFERENIINLFQFIDWMLTL PSELEKEFWQEFREYEETIRMRYVTSVERIGIEKGIEQGIEQGIEQGIKQGLIKGISL GLKLKFGESGQSLLPEIESIGDVDLLSTILDAIETVATVEQLRQVYLPVSE" BASE COUNT 4551 a 3040 c 2994 g 4218 t 10 others ORIGIN 1 ctgataactg cggctggtac ttgataactg ataactgata actgataact gttaaaattt 61 gattcatgca cttatctctt tttcttcgct ttggcgtaat tgcacttatt ttgtgtttat 121 taaagacttc agttgatcaa ggacaacaac ttgtacctgg tcgttctcaa actgtgagtc 181 attcttttag aacactctta caaacacaaa aacctaagaa cctgaagata gaagcagacc 241 gactatctaa ccaaggggta gaccaatatg agagcggtca gtttcaaaaa gccttagaaa 301 tctacaaaaa agcactagtc atttatcaag aaataggtga caaagaaaat attagtaata 361 cacttaatag cctaggagca gtttctagag aattaggtca atatccgcaa gcactgaaat 421 tttatcagca agcattaacg ataagtcgaa aagtagatgt tggcaaagtt cctgacacag 481 aagaacagaa caacacagga ctcatcctca ataatattgg gttagtttat caatctatgg 541 gtcagtattc tcaagcacta gaatactatc agcaaagtct ttccttaatg caaaaaatcg 601 aagataaact aggcgttgga acagccttta atagtattgg tggagtttat tacgaaagag 661 gacaatattc cctggcgctc aagttttttc agcaagcttt agttaatgtt caaaaagcga 721 aagaccctat agaagaagcg aataacctta ataagattgg acaagtttac agccaagtgg 781 gtcaatattc ccaagcgctc aagttttata ggcaagcttt agaaatttcc aaaaaaaata 841 atgataagct aggcgaaggc acaatactta ataatattgg tttcgtctac aatgtcatga 901 aaaaatattc tcaagcacga gattattatc agcaagcttt agcagttttt aaaaaaactg 961 atgctcagcc aaatattggc actactctca acaatatagg gtttgtctat caacaattgg 1021 gacagtattc acaagcagta gagtctattg agcaagcgtt aaccattctc caacaagtgg 1081 gtgatcgtgc tgtcgttggg cgtacacttg atagtatggg aagcgcttac aaaggtatgg 1141 gtcagtattc ccaagcatta gtgtcatatc agcaagcatt agcagtgagt cgagaaattg 1201 gtgatagaac tgcgcaaagg attaccctcg gaaatattgg tgatttactg gcgcagcaaa 1261 acaaaccaca attagcaatt attttctaca agcaatctgt gaatgtgact gaagctatac 1321 gcgcacaatt gcgatcgctt ccacgagaac aacagcaatc ttacacaaca accgttgctg 1381 atatctatcg acgtttagct gacttacttc tgcaacaaga tcgagtgcta gaagcacagc 1441 aagttttaga tttacttaaa atccaagaac tagacgatta cctgcatgat gttcggggta 1501 atgaaaaaac ttccaaagga gttgagtcac cgtctccaga acaatccatt aatcaagggt 1561 tgaaagttat agccgataaa caaattcaaa tcagcagaca acttggcgaa ctgcaaaaaa 1621 ttccaccaac taacaggaaa cctgaccaaa tagcaaagat tgtcgaatta gatgctcagc 1681 tgatcaccga atttaaccaa tttatcgaca gtccagaagt gaatgcttgg ttaggacagc 1741 ttagcccaaa agcggcacaa cagattatac cactggagaa tctcaacagc ttgcgcgata 1801 atttgcagcg tcttaaccaa aatgctgtct tattgtatcc gttgatttta gaaaaccgtc 1861 tagaactggt actaacaacc ccaaatacac caccaattca tcgtagtgta cctattaaaa 1921 aagaagaact caatcgggcg atcgcagatt ttcgtagcgc cttagaaaat tccgaatcca 1981 acgccacaat tcctgcacaa aagctgtatg aatggttaat taagccgata gaaaaagact 2041 tagcaaatgc cgatgccaaa acaattattt atgcaccgga tggagcatta cgttatattc 2101 ctctagctgc tttatacgat ggaaaacaat ggttagcgca gcgcttccgt actaacaata 2161 ttactgcttt gagtttaaca gaaattgata ccaaaccttt accacaaatt aaagttttgg 2221 ctggggcgac aacacagcgt tacatcgtac aattagagtc tactttatta ccatttaaag 2281 cactagaata tgcaggagca gaggttcaaa atcttgctac tatgatgcca ggaactaaaa 2341 cacttttaga taatgaattt aatcgccaag caatgatttc ttatttaaat gtgtatacaa 2401 ttattcatat ggcgactcac gccttttttg tcaacggtaa accggaagac tcttttattt 2461 taatgggtga tggtggttta gttaccctac ctgatattca aaagttatct ttgccaaatg 2521 tagatttggt tgtgttgagt gcttgtgaaa cagcagtcag tgaccaaata ggcaagggag 2581 aagaaatttt aggttttggc tatcaagtac agcgtacagg agcaagagct gcgatcgcat 2641 ctttatggtc tgtgagtgat ggcggaacac aagctttaat gaatgctttt tacgccttct 2701 taagccaagg gaaaatgaat aaggctgaag ctttacgtca agcgcaagtt gcaatgatta 2761 caggcgatta ttcaggggta actgagaaca aagacagagg aattctcaaa tctacacgcc 2821 agaatctacc agcaaaagtc gctaatcgtc tgagtcatcc ctattattgg gcacctttca 2881 ttttaattgg taatggtttg taataccaat tatccatgaa aatgcactta taatcctatc 2941 caggaagata aaaataatat atgcttgaat gtatgcttgt aaaaaacttt gattccatga 3001 aaacattgaa tctgatttaa taattactac aaggacacaa agatgaattt ttttttaatt 3061 tctgtttgga gattgaattt tatctaaaaa tcgctattag ttccacaatt attcagcaaa 3121 tcttttactg gccattgcac aatatcgaac tcaactttct cactcaggtg ctattatact 3181 ggcttgttgg ttctgtctca ttttatagca ttggtctttt catcgaaaaa gtcattaaaa 3241 agaatgatac tttgagagag aaactgactg ccagggttaa gaaagtcaaa aaacaaccat 3301 ttccttcttt tactgcaaaa ggcatcatta ttggggaaat cagaagttta atagcagctt 3361 taattatcct ctatctagcg ccagacgtaa atagaggaaa tagcttgctc ctaaatcttg 3421 gatggttctt gatgagaata attgcagctg atttctgttt ttacgtcacc cattggctat 3481 ttcacagaaa attcttgcgg aaaatacatc ttaaacatca tgagtttgcc gactcctcaa 3541 gttttgttgc tggacataag agtttgactg aatatattat tgttactatt acagaccttt 3601 tgcctatctt tatatttggg tatgatatca cccagctatg tgcctggact attataggca 3661 atgcttacaa cctagaaggt catagttcct tatcaatctt tttcgttcca tcagattttc 3721 acgatcttca ccacacttgt ttcaagggaa actatgggat tcaaggattt tgggacagag 3781 tattcaacac gctgaatcct cctacaaaga agtcaggaat tatgttccct gtcgcttctt 3841 tggagaatat caccatgaaa tcgtctagta atttggattg agttacgaat tctagagata 3901 gcaatcttct caacggttgt gagataatgg ttattcagat ttttgtaggg tgggcaatgc 3961 gtacccttgg ctcaaactct gaatttcaag ttattggcac tgcccaccct acagttattg 4021 agaatggtga gccagcgcgg tcttggggag ccactgcgtt ggggagccag tacttgatga 4081 gggtttccct ccgtaggtat ctggcgttcg gctctgccga cttgtagcat gtggcgtggt 4141 ttcccccatg agcgactggc gtatgcgcaa agcgcacgcc caaagggcta aagcgcagcg 4201 tgaccgaagg tcatacccgg agggtgcaag atatcagttc acatttttag aaaaattatt 4261 gagcttggca aagaattaac ccaaaccact gctgaggatc tgtccaagtt ttcaaagttt 4321 taagtccttg cgctgcaagt tgtttttgga tgattgccaa atcgaatttg cgagaaattt 4381 cggtgagaat gctttctcca gcttgaaacg aaacttttaa attgaggata tctagagaca 4441 cccaatgatt ttcttggcaa tggagataca tctctatctg agcatcagct tgattataaa 4501 ttgcttgatg agtgaagaaa ttgagctcga aattgccttg aaaacgccaa tttaaatggg 4561 agagcatatt taaattaaaa gcagcagtta ctccttgact gtcgttatag gctgcttcta 4621 aaatttcttt aggtttttgt aaatcaatcc cgaggagaaa gtaatctcct ggttttaaag 4681 cgtgagcaat ttgactcaaa aaaaggtcag attcctgtgg ggtaaaattc cccatagaac 4741 ttcccaggaa aaaaatcatc ctcgataagc cggaggctgg acgcagagag tatcgcaaag 4801 agttcgattc cagatgcgct aaagcttgtt cgtaagttcc tagtaatcct tgaatggaaa 4861 aatcaggata ttgttgttgt agctgtagca cgctggtttt gagaattccc ccgctgatat 4921 caatgggtag atatctacag tcatctgcaa ttttttgata actatctaac aaaaggcgag 4981 ttttagtaga actaccgcta cctaattcta ctagttcaca actgcctgtc atttgagcaa 5041 tttcatcagc gtattgactc aatatccagg cttctgttcg tgtcggataa tattcgggta 5101 actgacaaat ttgttcaaag agttgagaac cacgttcatc ataaaaatat ttgggtggta 5161 aactttttgg ggtttgggtt aatccttgaa tgacatcttc accatcattg tttaactcat 5221 gatactgatt gtcaagaaat attaaaggct gtgttagcat tttctaattt cttctatttt 5281 ctgtttggtg gtataaaatc ctggtgtcgc tgaattaaac aatgcttaag aggatgtttg 5341 aaaagtctaa ttgagtacac aaaattctcc tatatccccc ctctcccccc ttaaaaaggg 5401 gggtaaaagt cccccttaaa aaggagattt agggggttct cgaagatcca cgtatttcaa 5461 acaatactta taaaacatcc tctaaggaaa acgtatctaa atttcatgac acagctttca 5521 aatcttgagc aattctaaaa ccagcgtgtt gataaaagta gggacgaaac caattgcgat 5581 aagttggcaa tgccatggaa ccattagtag cccaagaacc acccagcatc atctgatgtt 5641 taccatcaaa aaagggtgct gcttggtctt cgtaaagcgg gtgaggttga aatcctggta 5701 gggggttgaa ggtgtctccc aaccattccc aaacatttcc tctgaggtca taaagaccac 5761 aagcactatt ggctagtttg aacattccca ctggactggg ggagatgaat tgtaagttga 5821 gattatagtt agttgataag cggttagcct cagaggtgag taaggcttga ttccattcgg 5881 cttcactcat caagcgtatt tctgaacctt tccaacgaca aaatgcgatc gcctcgtaat 5941 agttgacttc cacagcccaa tccaggggta agtctatttc gtcaaatatg gctcgatagc 6001 gatagctatc gtgtaagggt atccagaatt tgggatgttg tacattgtag agttgtttcc 6061 aattccaaga ttcagcattc cagtagtctg gattgttata gccgtcttcg tgaacaaact 6121 ctagaaattc tccgttagtg atgagatact tactcgctaa aaacggttta acttcaactg 6181 tgcgatcgcc atattcgcta tcccaaccgt aggtaaaatc atctttggat tttcccagtt 6241 tcactacacc acctggaatt aagcgcattt cattgttggg aatctcatta ttgctaggtg 6301 cataattcca gccttgggga cgttttaggc gatcgactgg taattgacgc agcagcatcg 6361 aagaggtttc aaagtgaatg cgactatgtt ctattcccat cagcaaagcc caaaagggat 6421 gctgttgatg aataggcaaa tctaggcaag tgttttggat aacctctgtg attacctccc 6481 gtgctttatc ccgatattgc caaacttttt ctacatcggg ccagttaacg ccttgcaaag 6541 ctgcatcgag ttctgtcggc gtttctggat caactccaat ttcaaatagg gtttcgtatt 6601 ttgaattaat ccgatgtttt attaaaccaa cgcaaattaa cttgttgatg taaaaaactg 6661 ctgaatgacc aagataaaaa atcagacagt ttcttaaatg atcggggttg agataaaaag 6721 tttcttcccc aaccaaactt ttcatgagta tttcttcaag ttcccaagaa ttttcaaagt 6781 agttaagcag gctttgagaa ctgcaatcat ttagtgtggg aacttgaggt gatgccagtt 6841 tcttcatcgt tggaaccata cccatgacat ttttgataga acattccagg tacacttaca 6901 tatcatacaa aatactacaa aaaaatgtgt tgtttattac aatttttttg atgtcgttat 6961 catcggacaa tttgctaaat ttgtaaacag agaaaaaaag tattgaaact tacaaaatat 7021 acgaattgac aaattgatat aaataatgcg gtggcgttga gcaagtatag gcgaggttaa 7081 atcagtgaca ctccccttgc ctaaacccaa ggagagtgtc aagggcttcc gccctttgat 7141 tttcggtgaa actataggac tccggagcag atttttgcga agcttgctac acctttttcc 7201 ctgttcccta ttccctattc cctattccca cgatagaaac atctttctgc ttctttcccc 7261 taaccaacaa accatgcttg ggtgcaaata tcatcgccac gacaaacagc agagtttgca 7321 acaccacaat gcaaccccca gttgcaccat caatgtaata gctaatgtac gtccccataa 7381 cactcgaaaa cactccagaa gccatcgcaa taagcatcat gtggtcgaag ttatcagtta 7441 ataaatatgc cgttgcacct ggagtgacta acatagcaac aatcagaata attcccactg 7501 tctgaagtcc agcgacggca gttaaggaaa gtaacgatag caaaatatag taaagtactc 7561 ctatatttaa gccaatggaa cgcgcatggg tggggtcaaa acaaaataat agtaggtctt 7621 tgcgtaggat ggcgatcgcc accaaagtaa taacgctaat aatcaccgtc tgaataatat 7681 ctggttgaga aatacccaga acattgccaa acaggatgtg tgtcaaatct acgctgcttg 7741 gtgttttaga aactaacacc aaccccaagg cgaagaaccc cgtaaacacc agtccaatta 7801 ccgtatcttc cttaattctt gtctttgcct tgataaaacc aatagcaata actgaaccca 7861 cgccaaacac aaacgcacca acagcaaagg gtattttcaa aatataagca atcaccaccc 7921 caggcataac cgcatgagaa actgcatctc ccatcaatgc ccaacctttc agggtcatgt 7981 aacaagatag cactgaacag accataccaa ctaaggcact gacgagaatt gccttgacca 8041 tgaattcgtt ttgcaaaggc gcagtaaacc attgtaccaa ttgcatcaca ctccctccat 8101 attttcacga cctatccgac ttttgctaaa tggtaaatca cttagagaac caccaaaagt 8161 gcgagagaga ttctcctgtg tgaagacttc attcgtattt ccgtaggcta agatagtccg 8221 attgatcagg acaacttggt cacagaaggt agtgattgat gccaaatcat gggtagaaac 8281 caaaattgtg ttacctgcat ctcgtaattc cagtaacagg tcaatcatgg ctttttctgt 8341 tttgatatct acccctgtga atggttcatc tagcagtaag acagttccct gttgtgccaa 8401 agcacgggca aaaaaggcac gttttttttg tccaccagag agttctccaa tttggcgatc 8461 gcgcatttcc cacatctgga ccctctccaa actctccctc acaacccgtt tatcttttac 8521 ggagggaatc ctcagtatat tcatatatcc gtagcacccc atcatcacca catcatggac 8581 actcaccggg aaattccagt ccacctcttc tgactgtggc acatacgcca ctaggttatt 8641 tttttgtacc attgtgatcg gcaagccgtt aatcaacact ctccccgttc tcggcttcac 8701 aaatcccata attgctttaa ataaagttga ttttccgcta ccattcatcc ccaccaaccc 8761 acaaattgaa ctggctttga gttgtaaaga agcactatgc aaagctacct taccgtggta 8821 agcgactgtc acattttcaa catcaatact gattgagttc ataattattt gttggtagtt 8881 cacctgatat ccccatcttt tgactaatat caaattcgct taactactta tcattcagat 8941 ctcttcacgt ccgcgcatca gcctttaatg ataaatattc aacagattgt gatatgtttc 9001 cctgcaaccc cttaatcagc gtggtcacgt tatattcaag taacttgaga taagtcgatg 9061 ctggaccatc tgacggagag agagaatcta cgtagaacac accgccaaac tttgcaccag 9121 tagcattagc aacctccttt tgcgctttat cgcttaccgt actttcacaa aaaacagcag 9181 gtattttatt tgccttaaca gtgttaatca ccttttccac ttgcttagga gtggcttgtt 9241 gttccgaatt caccgcccac agataaactt ctttcaaacc atagtcgcgg gtgatgtagg 9301 aaaacgcccc ctcacaactg accatatagc gtttattttg gggaagcact aacacctctt 9361 ttagcagatt ttggtcaatc tccttaatct tctgactgta tgctttcgca ttagcattgt 9421 aagtgtctgc gttcgctggg tctaaattta ccagagatga acgaatattt tctacataaa 9481 tcacagcgtt ttgtggtgac atccaagcat gaggattagg tttacctttg taagcatcct 9541 ccgcaatttc cacagatttt attccctgac tcagagtgat atggggaacc ttgggaatgc 9601 tgttgtaaaa tttttctgcc cagcgttcta aacccaaacc gttatccaga atgaggtcag 9661 ctgatttcgc cctcactaaa tcgctgggtg tcggttcata acnnnnnnnn nnaatttccg 9721 aacccggctt gacaattgat tccacaactg ccttatcacc cgccacgttt cgcgccatat 9781 ccgcaatcac tgtgaaggtc gttaaaatta cctttttgtc ttttttcgcc ggattcactc 9841 cattggttgt ctctggcgta gcatttacct gcgactgttg atcagaaggc gtgggactac 9901 atccactcat ccaaagtccc aatagcagcc cagatgcaac aaccaacgag ggtacatgct 9961 gcagcagtcg taccttgaaa ttgttgatat tttttatcat ttttttttca taatcttatc 10021 atttttaatg aactttacca tagggtgact aaaaatgaaa caaaaatgaa aaaacacata 10081 tcatattttg gttttcattc cagcaaatca aggaaattca aggtttatcc atagacttct 10141 tctgagaaaa atatgaactt ttgttatttc tgaaaataat atgagataaa aatgaaagtt 10201 atgctaaatg aggacaccaa aaggcaaggt tagtccagtg cgtcacaagt attgcaaatt 10261 aattcatcgc tgcggatggg tatatgtatg tgttttccac gaatgtccaa tgaatcaacg 10321 attaggggaa agcgatcgca gtttctcgac gaattacccg aacaattatt aatgactcga 10381 ctatgaacgc tcacaaaata gaaacagttt taactgaaga tggaatgacg ctacagggtt 10441 taccttttca tgcgggagat actgtggaag taattatcct acaagcaaaa actccacaac 10501 ctcaaaatgc agtgaatcca aaatcagaga aaaatcgtta ttcattgcgc ggtaaagtta 10561 tccgatacga tgatccaaca gaaccagtcg ctttagaaga ttgggaattt ttgcaatgat 10621 tgtacttgac caaagctaag ggtgtagggg tgtgagggtg taataccatt tcacgaaaag 10681 cctgatacaa ataaagcttc taaaataaag ttccattcag cgatgacgtg aggcttaata 10741 tagcaatcct aaatcatttg tgtaagtcct gcggacaggc tccgccaacg cgaagcgtgc 10801 gccttagcgc tcaattctct gttctctctt ttcttggcgt tcttggcacg ccaggtgctt 10861 caagtcggcg gagcgtttcc cttcgcactg gctcgtctat gtcctgcgga cacgcttcgc 10921 taaggcggtt aataattttc agaaatcaga taggactgct atatattgac tctcgttcgc 10981 taagatagtg tagccttagc aatagctact atctatataa tgtcacttat tcgatacctg 11041 ctgctccctc acacccatga caacaaccac agacttcctc agtcatctca accccagtca 11101 acgtcgcgct gttgaacatc actgcggtcc cttgctcgtt gttgctggtg cgggttccgg 11161 caaaacacga gcgctgactt atcgcattgc taatcttatt ttgcagcatc gtgtcgatcc 11221 agaaaatatc ctagcggtaa ctttcaccaa caaagccgca cgggaaatga aagaacgtat 11281 tcaaaagtta tttgccgatc gcctagcaat cacagaatac aatgagcgct ttgatttgtt 11341 gccagaatac gaccaaacta agctcaagtc gaaagtctgg aaaaattaca taaaagaaat 11401 gtggtgcggt actttccaca gtctcctttc tcgcgttctc cgctttgata tcgaaaaata 11461 caaagacgaa aaaggacgcc aatggacgcg gaatttttcc atctttgatg agtccgacgc 11521 ccaaagtctt gtgaaagaaa tcgtcaccaa acagcttaat ctggacgata aaaaatttga 11581 accccgttct gtgcgctacg caattagtaa cgccaaaaac caaggcttat cacccaaaga 11641 atttgagata gaacaaccca attatcgcgg acgggtgata gcagaagttt acaatcacta 11701 tcaaagccga cttgcacaga acaacgccct tgactttgat gatcttatcc tcgttcctgt 11761 aaaattgttt cagcagaatg agcaagtctt aggttattgg cataataaat ttcgccatat 11821 cctggtagat gaatatcagg atacgaaccg cacccaatac gaactcatcc gcctgttgac 11881 gacgaatggc gaaaccaaaa agagtgattg ggactggaca aatcgttcta ctttcgtggt 11941 tggtgatgca gaccaatcga tttactcatt tagaatggca gacttcacca tcttgttaga 12001 ctttcagcaa gattttggcg acggcttacc agacgaagac acgcgcacga tggtgaagtt 12061 ggaggagaat tatcgctctt gtgaaaatat tctacaagca gcaaacgaac tgattgaaaa 12121 taacacccaa cgtattgata aaattcttaa agcaacaaga ggtgcaggag aagagatttt 12181 ttctcacaaa gcagatgatg aaattgcaga agcagatttt gtcattaacc aaattcgctc 12241 tttggagaat cagcatccag aattgaattg gggaagtttt gccatacttt atcgtaccaa 12301 cgctcaatct cgaccgtttg aagaattatt ggtgcgatta ggaattccct acacagttgt 12361 gggagggata aagttttacg atcgcaaaga aattaaagat gtcctgggtt acttgcgggc 12421 gatcgccaac ccatctgata cactcagttt attgcgagtg atcaatactc cccggcgcgg 12481 aattggtaaa gccaccatcg acggactggt aaacgccgct caagaattag gcacaaccct 12541 gtgggaaata ctgattgatg agacatcagt aaatacatta gccggacgtt ctgccaaagc 12601 tgttaatgct tttgcccaga tgattcgtca tttacaagaa caaatagaga cggttccagt 12661 ttccgaactt gtgcaaaggg tgctggaaga ctctggatac atcaaagact tggaaacgca 12721 aagtacagat gaagcagaag acaggttgca aaacgtccaa gaactattca acgctgcgct 12781 gcaatttgaa gaagagaacg aagatctaaa cctgcaagcc tttctttcca gtaccgccct 12841 cagttccgat ttggataact taaaagaagg acaatcagca gtctcgctgt taactttaca 12901 cgcctccaag gggctggagt ttcctgttgt gttcttggtg gggatggaac aaggactatt 12961 tcctaactac cgttcaatga acgatccagc atctttggaa gaagaacgtc ggttgtgcta 13021 cgtggggatt actcgcgccc aagaacggct gtatttaagc cacgcccgcg aacgccgcct 13081 ttatggttct cgggaacctg ctttgcgatc gcaatttctc gacgaattac ccgaagaatt 13141 attaatgact cgacaaaaaa gcagtgttgc ttatacaaaa ggtactggtg gtagcacagc 13201 ccaacgtaca accaaacgaa accaagatac tactgctagt tggcaagtcg gcgaaaaagt 13261 tctgcacaaa acctttggta ttggagagat tactcacatt tttggttcgg gaaataaagt 13321 ttctttggcg attaaatttg atagtttggg gcaaaaaatt atcgatccaa gagttgcaca 13381 gttgcaacgg atggagtgat gaccaaagct aagggtgtag gggtataggg gaaagaatta 13441 ggggtgtaac tctgtgaggg tgtaggggaa taaaagaata agaaaatcaa cgcatttttc 13501 ctacccgtgg ttagtgcatt tcccttacag ccttattatt aagcgatcgc tacaggaagc 13561 tcccagattc aatcagcgaa gcaatccaaa atctgaaatc agatgaccaa ccctcaaaca 13621 gaatttgatt caccttggaa agacatttta caactatatt ttgaagaatt catgttgttc 13681 ttttttcctc aagcacatga ggaaatagac tggacacgaa aaccggaatt tttagataaa 13741 gagttacagc aagtcattcg agatgcagaa ctagggaagc gattggttga taaattggta 13801 aaaatttatc gccttggtgg cgaggaatct tggatacttg tgcatgttga ggtacaagcc 13861 caagaagaat ctgatttttc tcgacggatg tatagttata actatcgcat atttgatcga 13921 tataatcgct cggtagcatc aatcgcagta ttgggtgatg agggtattaa ctggcgacca 13981 aatcagtttg gttatgattt gtttggttgt cgggttgatt ttcagtttcc aattgtcaag 14041 ttgttagact ataagcaacg acaagctgag ttattagcaa gtcgtaaccc atttgcgacg 14101 gtggtcatgg ctcatctagc tgctttagaa actcgcaata atcggttaga gcgtaagcag 14161 caaaagttgg ctttgactag aaggttgtat gaacagggtt ttgaaaggga aaatattatt 14221 aatttatttc agtttatcga ctggatgttg acgttaccat ctgagctaga gaaggaattt 14281 tggcaagagt ttcgcgaata tgaggagact attcgtatgc gttatgttac cagtgttgag 14341 cgtattggaa ttgaaaaagg tattgagcaa ggtattgagc aaggtattga gcaaggtatt 14401 aagcaagggt taataaaagg tatttctttg ggattaaaac tcaaatttgg agagtctggt 14461 caaagtttgt taccggaaat tgaatctatt ggggatgttg atttgttgtc aactattttg 14521 gatgctatag aaactgtggc tacagtagaa caattgcgac aagtttatct accagtaagt 14581 gagtagtttg gaggatggcg atacgcccgc ttgctctgct ttcccttgca atcgctcaaa 14641 cacgaagtca tgctgtgaag cgatctgcaa aggagcgcag cagcgaagct atcgctctta 14701 ttttcaataa ttagcatgaa acgtagggtt tataaacacc cctttgatga agctgcgcta 14761 agctgttgtg cattaaaaat gcacaacagc aaggcagaag gcagacggca gaa // LOCUS NODE_2278_length_14761_cov_5.37719314761 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14761) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14761) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14761 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1332) /locus_tag="DP116_19055" CDS complement(<1..1332) /locus_tag="DP116_19055" /inference="COORDINATES: protein motif:HMM:PF00072.22,HMM:PF00512.23,HMM:PF02518.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_19055" /translation="MLRILLIDDNFDVHLSITRQLKQEFSPIEVTSIIHVKSFEEALA YGDFDIAITDYSLHWTNGLEVLKTIKARYPNCPVIMYTDSGNEEIAVLGMKSGLSDYV LKGRLELLVIAIRESLEKQTSLHEYAVAIERLRVSEERLELAMEAAHLGTWDWDIPTN QVIWSKNHEQLFGLPSGGFLGSYEAFLSCVHPEDREQISEAITSAINTKTDYNKEFRV IWSDGSVHWILGKGNFFYDDTGEPVRMIGVVLDITERKHREEELERANRLKDEFLAIV SHELRTPLNAILGWAQLLRSRNFDEATRNHSLEIIERSALQQNQLIDDILDTSRLMRG QMQLSISPINLVSVIENALNTLQLSAEGKSITLETVLECSVAVVMGDENRLYQIVWNL LSNAIKFTPVGGRVEVRLSISAESNSCDSQLKTQQRQSYDTYDNGGLRASHG" gene 1764..2762 /locus_tag="DP116_19060" CDS 1764..2762 /locus_tag="DP116_19060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017661970.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cysteine synthase A" /protein_id="PRJNA477356:DP116_19060" /translation="MDIRNGFVDTVGHTPLIRLNSFSDETGCEILGKAEFLNPGGSVK DRAALYIIQDAEEKGLLKPGGTVVEGTAGNTGIGLAHICNAKGYKCLIIIPDTQSQEK MDALRALGAEVRPVPAVPYKDPNNYVKLSGRVAGEMENAIWANQFDNLANRRAHYETT GPEIWAQTDGKVDAWVASTGTGGTFAGVAMYLKEKNPAIKCVVADPKGSGLYSYIKTG EINIEGNSITEGIGNSRITANMEGAPSDDAVQIDDREAIKVVYQLLRKDGLFMGGSTG INVAAAVALAKQMGPGHTIVTILCDSGSRYQSRIFNHEWLESKGLSPDESAPGKEF" gene complement(3301..3951) /locus_tag="DP116_19065" CDS complement(3301..3951) /locus_tag="DP116_19065" /EC_number="2.1.1.33" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA (guanosine(46)-N7)-methyltransferase TrmB" /protein_id="PRJNA477356:DP116_19065" /translation="MAIVRVRQHVNPLAQKYQTLIDPLDWEKVYAKPKAPLHLDIGAA RGRFLLSMAKIEPDWNFLGLEIREPLVVEANKWRDELGLTNLHYVFCNVNNSLRSLFS SLPKGSLQRVTIQFPDPWFKNRHAKRRVVQPELVAELAEFLVPGGIVFLQSDIEFVAE EMCDRFTNHPAFQRLGTGEWLAENPLPVPTEREITTTNKGEPVYRALFERVSSSIA" gene complement(4000..5211) /locus_tag="DP116_19070" CDS complement(4000..5211) /locus_tag="DP116_19070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318448.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_19070" /translation="MIVLEFKAKGRTTQYSAIDEAIKTAQFVRNKCIRFWMDNRGVGQ KDLYRHNTALRAEYPFVKDLNSHACQSAVERAYSSIARFYDNCRKSIPGKKGYPQFKK NCRSVEYKTSGWSLSETRKQITFTDKKGIGKLKLKGTWDLNFYQLDQIKRVRLVKKSD GYYVQFLVRSENKVDTQPTGRTIGLDVGLKEFYTDSNGHSEPNPKFYRTGEKRLRFRQ RRVSRKKKGSANRLSAINKLGRVHLKISRQREEHAKRVARCVIQSNDLVAYEDLRIKN LVKNHCLAKSINDAGWYQFRKWLEYLGVKFGRVTVAVNPAYTSQECSKCGTHVKKSLS MRTHVCQCGFVLDRDYNAALNILNRALSTTGHVGTWILDPNASGDLASTVLGANLSQQ VESVNEESPHL" gene complement(5323..6003) /locus_tag="DP116_19075" CDS complement(5323..6003) /locus_tag="DP116_19075" /inference="COORDINATES: protein motif:HMM:PF03551.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19075" /translation="MFDNYDISRYYKSMLNKRENYPMHDRIRAHHPEGGHHAHRHGHH PHRGEGGRRGGPPRRGEGKVRRGEARYLLLDALRDEPKHGYEIIKALEERSSGQYAPS PGTVYPTLQYLEDMGLVRADQEAARRVYHLTETGRTELDAHAEEVNAFWARLKEPDTS AAIQAEIGFLEDELEHLMRTVWGGLRNALNRDDQKTIRRVREVIEHSQNEVRRILTEP DSFRDNQE" gene 6147..6461 /locus_tag="DP116_19080" CDS 6147..6461 /locus_tag="DP116_19080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002763718.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_19080" /translation="MDQAKKERLESKGWKIGTVSDFLKLTPEETIFVEIKLALSRSLK ERRQQLMTQAELASKISSSQSRIAKAENGDASVSIELLIRAILATGATPQDIGQVIAN VK" gene 6703..7803 /locus_tag="DP116_19085" CDS 6703..7803 /locus_tag="DP116_19085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878696.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metallophosphoesterase" /protein_id="PRJNA477356:DP116_19085" /translation="MISNFRFAVVSDLHLALSHTIWNHPSRFHLVEVSIPAFESVLEH LTQLNLDFLLLPGDLTQHGEPENHAWLQERLAQLPFPSYVVPGNHDVPVVKANEQSIA VSDFPHYYRKFGYEDTDQLYYTCQLLPGVRLIGLNSNCFDDQGQQVGRLDTQQLRWLE EVLAGAADDFVLVMVHHNVVEHLPNQSRHPMANRYMLGNAPELLQILKRYGVRLVFTG HLHIQDVADSDGVYDITTGSLVSYPHPYRILEFHQDNYGNQWLQILSYRVTSVPDFPN LQQTSKKWMGDRSFSFLVKFLTLPPLNLPMSQATELAPSLREFWADMANGDAVFDYPN FPPELRRYFEKYGAIAPSGSPSFIDNNTTLLL" gene complement(7821..9047) /locus_tag="DP116_19090" CDS complement(7821..9047) /locus_tag="DP116_19090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207782.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19090" /translation="MADQIQWANALSTRPSLEAAVADVVQQAVSSLTVPADLGLVFIS SAFTSEYSRLLPLLAERLSVPVLIGCSGGGVIGTTGRGQTQELEAEPALSLTLAHLPQ VNIKGFHVLPEELPDLDSPPDDWIDLIGMPSTPAPQFILLSGSFSSGINDLLQGIDFA YPGSVTVGGQASGGGLGGRIALFYNDKVYNEGTIGVALSGNIVLETIVAQGCRPIGKP LQVTSSDRNIILELDERIPLLVLRDLIANLSEQDRILAQHSLFVGLAMDGFKQDLHQG DFLIRSILGVDPTAGAIAIADYIRPGQRLQFHLRDAQTSAEDLEFLLESYYRKQVAQP SAVGALMFSCMGRGEGLYRKPNFDSELFRRYLKDIPLTGFFCGGEIGPVGGSTLLHNY TSVFGICRAINDSGIS" gene 9413..9637 /locus_tag="DP116_19095" CDS 9413..9637 /locus_tag="DP116_19095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319223.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19095" /translation="MSNNIQDAIQQELEQARATCDTSGSNSPECAAAWDAVEELQAEA SHQKQSKPKNSLEVYCDANPDADECRVYED" gene 9917..10513 /locus_tag="DP116_19100" CDS 9917..10513 /locus_tag="DP116_19100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454250.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3177 domain-containing protein" /protein_id="PRJNA477356:DP116_19100" /translation="MNNEVWFRPFVWMDYRLAVLFAVIIPIILLIWAYVEKAEAIQRL LTIYWRVASLLAITIYLMIAGFGVSFISGLMGLILIPISLWFWVDLNDEIEYQSNGPL KLLFTSWRWATTVYCILNAIAFIPFVGCGFSEGAIATPYCRVWLEAPLLFKEYFHPNS KPGFLGFLGIVGLVIYVLYLSYFVVVKLGKQGRSATPQ" gene 10686..10985 /locus_tag="DP116_19105" CDS 10686..10985 /locus_tag="DP116_19105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312305.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19105" /translation="MNNSISKRLEQYTVKRPQEVLLVSVEIAGESDEIAIFKGFSSSL TRPTAFDPDVPVLQDEAKIIKIDRVASPYNPEAPRYIQQGLSWDNMQVLLSQMEV" gene 11236..12858 /locus_tag="DP116_19110" CDS 11236..12858 /locus_tag="DP116_19110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207787.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GMP synthase (glutamine-hydrolyzing)" /protein_id="PRJNA477356:DP116_19110" /translation="MNTAVTLPTEQAPQKVESLGQLNRQMIVILDFGSQYSELIARRI RETQVYSEVLSYRTTAEQLRQLNPKGIIFSGGPNSVYDTGAPRCDPEIWNMGLPILGV CYGMQLMVQQLGGEVAKADRGEYGKASLYIDDPTDLFTNVEDGTTMWMSHGDSVIQMP EQFELLAHTENTPCAAIADHDKKLYGVQFHPEVVHSVGGIALIRNFVYHICECEPTWT TAAFVEQAIREIRARVGDKRVLLALSGGVDSSTLAFLLHKAIGDQLTCVFIDQGFMRK YEPERLLKLFQEQFHIPVEYVNARERFISSLSGITDPEEKRRIIGREFIIAFEETSRR LGPFDYLAQGTLYPDVIESANTNVDPQTGERVAVKIKSHHNVGGLPKDLRFKLVEPLR KLFKDEVRKVGRSIGLPEEIVQRHPFPGPGLAIRIIGEITAERLNILRDADLIVRQEI NQRGLYHDYWQAFAVLLPVRSVGVMGDQRTYAYPIVLRIITSEDGMTADWARVPYDVL EVISNRIVNEVKGVNRVVYDITSKPPGTIEWE" gene complement(13610..14056) /locus_tag="DP116_19115" /pseudo CDS complement(13610..14056) /locus_tag="DP116_19115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009341836.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="cation-efflux pump" gene 14129..14761 /locus_tag="DP116_19120" /pseudo CDS 14129..14761 /locus_tag="DP116_19120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_086558172.1" /note="frameshifted; incomplete; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" BASE COUNT 4219 a 3271 c 3099 g 4172 t ORIGIN 1 tccgtgggaa gcccgaagtc ccccattgtc gtaagtgtcg taggattggc gttgttgagt 61 tttgagttgt gagtcacaag agtttgactc agcgctaatt gacagcctca cttccactcg 121 tcctcctaca ggggtgaact taattgcatt ggaaagtaaa ttccaaacaa tttgatacaa 181 gcgattttca tctcccatca ctaccgcgac tgagcactct agtacggttt ccaaagtaat 241 tgattttcct tctgctgata actgtagagt attaagcgcg ttctcaatca cactgactaa 301 attgatagga gatatagaca gctgcatttg ccctcgcatt aagcgagagg tatcaagaat 361 atcatcaatt aactggtttt gttgaagggc actgcgttca ataatttcta aactatgatt 421 tctagttgct tcatcaaagt tgcggctacg tagtaactgt gcccaaccta aaattgcgtt 481 taaaggagtg cgaagttcgt gggagacaat cgcaagaaat tcatctttta agcgatttgc 541 tcgctccaat tcttcttccc ggtgcttgcg ttcagtgatg tctaacacca ctccaatcat 601 gcgcacaggc tcacctgtat cgtcataaaa aaaattacct ttacccaaga tccaatgaac 661 gcttccatca gaccaaataa cgcgaaattc cttgttgtag tcagtttttg tatttatggc 721 agaggtaata gcttcagaaa tttgttctct gtcttctgga tgaacgcagg aaagaaatgc 781 ttcataactt cctagaaagc ctcctgatgg taaaccaaat aattgttcat gatttttaga 841 ccaaatcact tgatttgttg gtatgtccca gtcccaggtg cctagatgag cagcttccat 901 tgccagttct aaacgctctt cactgactcg caatcgctca atggcaacag catactcgtg 961 aagtgatgtc tgcttttcta aactttccct aattgcaatg acaagcagct ctagccttcc 1021 ctttaagaca taatcactta gcccagactt catccctaaa acagcaattt cttcgttgcc 1081 gctatcagtg tacataatga caggacagtt tggatatcta gctttaatgg tttttaaaac 1141 ctctaagcca ttagtccagt gtagagagta gtcagtaatt gcaatatcaa aatcaccata 1201 cgctagcgct tcctcaaaac tcttaacatg aataattgag gtaacttcta ttggggaaaa 1261 ttcttgcttg agttgccgtg ttatcgacag gtgcacatca aagttgtcat caattaggag 1321 tatgcgaagc atgatatttg ctatttacaa gacgtaggct aaaaaattaa gtgtaaattt 1381 tagttcaata gtgaagtatc acaccttttc tatcatctcc tgtgaaagat ctagaaatct 1441 gggctaatat tgaagatata aacaagattt tttttcctca gtaattacac ttgtttacgt 1501 tcatatcttg tttcctaaat tgtgaaatca taggatagga actctaagaa gaatttttag 1561 aaaaatttaa gcattctact gctgctgcac tttagccata gaatttagca ctacactgca 1621 cttggtggtt gacgtttgaa aatttttgag tgtgtatggg aaaatcttat ccttagaact 1681 aagacctctc taaaatcagt atagggtcaa taaggcctca cgcctgatga ccaaaagcat 1741 agtctccacc aagaagttta aacatggata tcagaaacgg ctttgtagat actgtaggtc 1801 atacaccact tattcgttta aacagcttta gtgacgaaac tgggtgtgaa attcttggta 1861 aagcagaatt tctcaaccct ggtggttctg ttaaagatcg ggcagcactt tatattattc 1921 aagatgccga agaaaaaggt ttactcaaac ctggtggtac agtcgtagaa ggaacagctg 1981 gtaatactgg cattgggttg gcacatatat gcaacgccaa aggttataaa tgcctgatta 2041 taattcctga cactcaatct caagagaaaa tggatgcttt gagggcttta ggtgcagaag 2101 ttcgccctgt ccctgctgta ccctataaag accccaataa ttatgtcaaa ctctctggta 2161 gagtagctgg cgagatggaa aatgctattt gggcaaatca gtttgataac ttagcaaacc 2221 gccgcgccca ttacgaaacc acaggaccag aaatttgggc acagacagat ggtaaagttg 2281 acgcttgggt cgcttcaaca gggactggtg gaacttttgc tggtgtagca atgtatttga 2341 aagaaaaaaa tccagcaatt aagtgtgttg tagctgaccc aaaaggtagc ggactttaca 2401 gctacattaa aacaggcgaa atcaatattg aaggtaattc tattactgaa ggcattggca 2461 atagtcgtat taccgccaat atggaaggcg cgcctagtga tgatgctgtc cagattgatg 2521 acagggaagc cataaaggtt gtttatcagt tactacggaa ggatggctta tttatgggtg 2581 gttctacagg tataaatgta gctgcggccg ttgctttagc aaaacaaatg ggaccagggc 2641 acaccatagt tactatcttg tgtgacagcg gttcccgtta ccagtcgcgc atcttcaacc 2701 atgaatggtt agaatctaaa ggtctttctc cagatgagtc agcccctggg aaggaattct 2761 aaattcaaaa tgataaatta aaaattaaat acaatttcac acccaagttg gggatgatca 2821 aattctgaac ctttaactag cgctcttaac cctccgggaa cgcgggtcgc ctgttgtcgg 2881 gaaaacgcca catgcttcaa gccgggaaac ccgtccaacg cagtggctcc cctcccgcag 2941 cgctggtctc accgctagca acatatcaaa gaaacaccat cccccttgtt aagagttccc 3001 tgttccttgt ctttcttaac agtaaaaacc aaccactgat tcaggcgata cgcacaagct 3061 aaagcctccg gcttatcgcc cactccgaaa ttgcgtggcg caacacacca ccgcaatttc 3121 gacttcatca aaagtagaaa agctaaaaat tctatctgtt ttcatcactt ttcaacaaag 3181 tgcacaaatg ctccaaaaat cgcttctcta gttcttcaac acttgagaca acatacaaat 3241 attttgctcg atctgagaat cattcaattg agtataacca ataatgaatt ctctcactgt 3301 tcatgctatt gaagaactca ctctttcaaa taaagcacga taaacaggtt cacccttgtt 3361 tgttgtcgtt atctcccgtt ctgtgggaac tggtagcgga ttttctgcta gccattctcc 3421 tgtaccaagt ctttgaaaag cgggatgatt tgtaaagcga tcgcacattt cctccgccac 3481 aaactcaata tccgattgca aaaatacaat accccctggt acgagaaatt ctgccagttc 3541 tgcaactaat tctggttgga cgactcgccg ttttgcatgg cggtttttaa accaggggtc 3601 gggaaattga attgtgacac gttgtagact tcctttagga agggaagaga aaagcgatcg 3661 caatgagtta ttcacattgc aaaacacata gtggagattt gtcagtccca actcatcccg 3721 ccacttgttc gcctccacca ccaacggttc ccgaatttcc aaacccagaa aattccagtc 3781 tggttcaatt tttgccatgc ttaacaaaaa gcgtcctcgc gcggccccaa tatccagatg 3841 taggggcgct tttggcttgg cgtagacttt ttcccagtcc aggggatcga tcagtgtttg 3901 atacttttgt gcaagcgggt taacgtgttg acggactcta acaattgcca aaatggactc 3961 tccttttcac caaacccaat atctttgaca ctctgcgccc tataagtgcg gagattcttc 4021 gttcacagat tcaacttgct gagataggtt tgcacctaaa acagtagagg ccaagtctcc 4081 cgaagcgttc ggatctaaga tccaagttcc cacatgccct gtggtactta aggctcgatt 4141 cagaatattt agagcagcat tataatcccg atccaacaca aatccacact gacaaacgtg 4201 ggttctcatg gacagagact ttttaacatg agtgccacat ttagagcatt cttgtgatgt 4261 ataagcaggg ttgacagcaa ccgtaactct gccaaactta actccaagat actctaacca 4321 tttcctaaac tgataccaac ctgcatcatt aatagacttg gcgagacagt gatttttgac 4381 caagttctta atcctcaagt cttcgtaggc aaccaaatcg ttagattgga ttacgcaacg 4441 cgccactctc ttggcatgtt cttcacgttg cctacttatt ttaaggtgta ctcgccctag 4501 cttattaatg gcgctcaagc ggttagcaga gcctttcttt ttacgagaaa cacgacgttg 4561 acgaaatctc aatcgtttct cgccagttcg ataaaactta gggttaggtt cactatgtcc 4621 gttgctatca gtgtagaact ccttaagtcc cacatccaat ccaatggttc tgccagtagg 4681 ttgtgtatct accttatttt cagatctgac taaaaactga acgtagtacc catcagattt 4741 cttgactaac ctaacccgtt ttatctgatc taattgatag aagtttaaat cccacgttcc 4801 tttgagctta agcttgccaa tccccttttt atcggtgaat gtgatttgct tcctagtttc 4861 agaaagtgac caacctgaag ttttgtactc tactgaacga caattctttt tgaattgagg 4921 ataacctttt ttacccggga tagactttct acagttatcg taaaatcgag caatagaact 4981 ataagctctt tccaccgcag attggcaagc atgagagttc aagtctttaa caaaaggata 5041 ttctgctctt aatgctgtat tgtgacgata cagatctttc tgtcctacac ctcggttatc 5101 catccaaaaa cgaatacact tgttgcgaac aaattgagcc gttttaatcg cctcatctat 5161 agcactatat tgagttgtcc ttcccttagc cttgaactct aaaactatca tttgacgtgg 5221 acaatcccta cgtcaatcat gttaacacaa aaaagacgtc ctagaaggac ggggcttgta 5281 cccattaatt tcggtcaacc agaaacaatc aaaaaaatac ggttattctt gattatctct 5341 aaagctgtcg ggttctgtca ggatgcgccg cacctcattc tgggagtgtt caatcacctc 5401 ccggacacga cggattgttt tttgatcatc tcggtttagg gcatttcgta atccgcccca 5461 tactgtccgc atcaagtgtt ccaattcgtc ctccagaaaa ccgatttcgg cttgaatcgc 5521 ggcagatgta tcaggttctt ttaatcgcgc ccaaaaagca ttcacctcct ccgcgtgggc 5581 gtccaactcg gttcgtcccg tctccgtcag atgatagacg cgccgcgctg cttcttgatc 5641 ggcacgcacc agtcccatgt cttctagata ttgcagggtt ggatagactg tgccagggct 5701 aggggcgtac tgtccagacg accgctcctc cagcgccttg ataatctcat agccgtgttt 5761 gggttcatcg cgaagcgcgt caagaagcag gtaacgcgcc tcaccacgcc ggactttgcc 5821 ctcccctctt ctgggaggtc ctcctctgcg tccaccttct cctctatgag gatggtgtcc 5881 gtgtctgtgg gcatggtgtc cgccctctgg atgatgtgct cttattcggt catgcatagg 5941 ataattttct cttttgttca acatactttt atagtatcgc gatatatcgt aattgtcaaa 6001 catgttcgtt ggttgcaacg gattttttag acgacagagg cacagcaata tatcagatat 6061 gatataaaga tggcatagaa caaccaacct tacctttggt ttggctgcac ggtgaggtaa 6121 aaactccacc taacaggagt catgtgatgg atcaagcaaa gaaagaacgc ttagaatcta 6181 aaggctggaa gattgggaca gtctcagatt ttttgaagtt aacaccggag gaaactatct 6241 ttgttgaaat taagttagct cttagtcgaa gcttgaagga acgtcggcaa caactgatga 6301 cccaagctga acttgcctcc aaaattagct ctagccaatc ccggattgca aaagctgaaa 6361 atggagatgc ttcagtttca attgagctat taattcgggc aatcctcgca acaggtgcaa 6421 cgcctcaaga cattggacag gtgattgcta atgtcaagta agaaaaagga attgctctaa 6481 aaagtgcttt gtatattttt tccttggata ctcgcaacac accagaaaaa gcccccaata 6541 tttcgttcaa attctacgtt cttcttgatc aacgagaaaa actgacagat tttatgtaca 6601 aatgttaagc gagatcactg gcaagcagcg tagcgaagca agggttatac tctttctaaa 6661 aagagttttt tacaccctac ttttgagatt gtcataaaat caatgatttc aaattttcgc 6721 tttgctgtgg tcagcgactt gcaccttgca ctttcccata caatctggaa tcatcccagt 6781 cgttttcatt tggtggaggt tagcatcccc gcgtttgaaa gtgtactaga acatttaaca 6841 caacttaatc tagattttct tttgcttcca ggagatttaa ctcagcacgg tgaaccagag 6901 aatcatgctt ggttgcaaga acgtttagca cagcttccct ttcctagtta tgttgttcct 6961 ggtaatcatg atgttcccgt tgtgaaggcg aatgagcaat ccattgctgt ctctgatttt 7021 ccgcactatt accgcaagtt tggctacgag gatactgacc aactttacta cacttgtcag 7081 ttactgccag gtgttaggct catcggtcta aattctaact gttttgacga tcaaggacag 7141 caggtaggac gcttagatac ccaacagcta cggtggttag aagaagtttt agcaggggct 7201 gctgatgact ttgtgttagt catggtgcat cacaatgttg ttgaacacct gcctaatcaa 7261 tcacgccatc caatggcaaa tcgctatatg ttaggaaatg caccagaact attgcagatc 7321 ctaaagcggt acggcgttcg gctggtgttc acaggacact tgcacattca agatgttgcc 7381 gattcagacg gtgtatacga tataacaaca ggttctttag tcagttaccc tcatccttac 7441 cgaatcttag agtttcatca ggataattac ggtaaccaat ggttgcaaat tttatcctat 7501 cgcgtgacat cagtgcctga tttcccgaac ttgcaacaga catctaagaa gtggatgggc 7561 gatcgcagtt tttcctttct tgttaagttc ctaactctgc ctcccttaaa cctgccaatg 7621 tcgcaggcaa cagaattagc tcccagttta cgcgagtttt gggcagatat ggctaatgga 7681 gatgctgtgt ttgattatcc taactttcca ccagaactgc gtcgctactt tgagaagtat 7741 ggtgcgatcg cccctagcgg aagtccatcc ttcattgata acaacactac attgttgctt 7801 tagcagacaa ggaaaaatct ttaacttatt cctgagtcgt tgatcgcacg gcaaattcca 7861 aagacagagg tataattgtg tagcaaagta ctgccaccca caggaccgat ttcaccgcca 7921 caaaagaagc ctgttaaagg gatatctttg aggtagcgcc taaaaagctc agaatcaaaa 7981 ttaggttttc ggtagagtcc ttcaccacgt cccatacagg aaaacatcag cgcgccaact 8041 gcagacggtt gtgcgacttg ttttctataa taactttcta aaaggaattc caagtcttca 8101 gcagaagttt gagcatcacg caggtgaaat tgtaggcgtt gtccaggtcg aatataatct 8161 gcaatagcaa ttgctccggc tgttggatcg actccgagga tgctacgaat caaaaagtct 8221 ccctggtgta aatcttgctt gaatccatcc atcgccaacc caacaaacag agaatgttgt 8281 gccaggattc ggtcttgttc actcaaattg gcaatcagat ctcgcaagac aagcagtggt 8341 attcgctcat cgagttccag gatgatattg cgatcgcttg aggtgacttg cagcggttta 8401 ccaatcggtc ggcatccttg tgccacaatc gtttctaaga caatattacc actcaaagcg 8461 acgccaattg tcccctcatt atacactttg tcattgtaaa ataaggcgat acgacctccc 8521 aaaccgccgc cacttgcctg tcctcccacc gtcaccgatc caggataagc aaagtctatc 8581 ccttggagta aatcgttaat tccagatgag aacgagccag acagcaatat gaattgtggc 8641 gctggtgttg atggcatacc tatcaaatca atccaatcat ctggtggact atctagatca 8701 ggtaattctt caggaagaac atgaaaacct ttaatattca cttgcggcag atgcgctaaa 8761 gtcaaactga gggcaggttc tgcttctaac tcttgggttt gtccgcgccc agttgtccca 8821 atcacaccac caccgctaca accaatcagc acaggtacag aaagtcgttc agcaagtaaa 8881 ggtaaaagcc gggaatactc acttgtaaaa gcagacgaaa tgaataccag ccctaaatca 8941 gcaggtactg ttagcgatga gacagcttgt tgtacgacat ctgcaacagc tgcttccaaa 9001 gaaggacggg ttgatagggc gtttgcccat tgtatttggt ctgccatgag ttttccactc 9061 ttattttcga ggctaatggg tttttatttt tatcctatca agaaccgacg ataggactcc 9121 tgccacgctg atagcggagc ttcccagcag ccataggtta ccgacgtggt gaattggcgg 9181 tcaaaataaa ttgtataatt agtgtttgag ctaaaattct gtaaaatctt aaaagataaa 9241 gacgaataca aaattattag tacggtttac tgtaatagtt atggagtaca ttccctcact 9301 tactggtgag acgagttttg gtcaatcaca gtaaaatggg aactatagtg caaaaagtgt 9361 catttttgaa ttttttctat acttgtattg aaaacacaac aagactagga agatgagcaa 9421 caacatacaa gacgcaatcc aacaagaact agagcaagct cgtgccacct gcgatacctc 9481 aggtagcaac tctcctgagt gcgcagcagc ttgggatgca gtagaagaac tgcaagccga 9541 agcctcccac cagaagcaat ctaaacccaa aaactctctt gaggtgtact gtgatgctaa 9601 tccagatgca gatgagtgca gggtttacga agattaaaaa gtgattcgtc cagcacttta 9661 ttcatcgtag gtttgcaagt aaccagaatc tgtttactgg ctatattgca tcttaaatac 9721 tccattgtac aaaatctcac tccaagctca ttccgttttc tgtagaaaac tggaaatgtg 9781 tataggggtg agattttctt atcatccttc attgagttga ctgtttcttt ttaaagacaa 9841 gagtacaaga ttaatgagta atatcatata ctaattgact attcactata attgttgatt 9901 tctcataaaa aagattatga ataatgaagt ctggtttcgt ccctttgtct ggatggacta 9961 ccgattagca gtattattcg cagtgattat ccccatcatt ctgctaattt gggcatatgt 10021 agaaaaagcc gaagcgatac aacgcttgct cacgatttac tggcgagtag caagtttatt 10081 ggctatcacc atctacttga tgattgcggg ctttggagtg agttttatct cagggctgat 10141 gggtctgatt ctgattccca tttctctgtg gttttgggtg gatctcaacg atgaaattga 10201 atatcagtca aatggacctc taaagttgct tttcacctcc tggcgctggg ctacgacggt 10261 gtattgtatt ttgaacgcaa ttgcttttat accttttgtg ggttgtggtt tttctgaagg 10321 cgcgatcgca actccctact gtcgcgtctg gcttgaagcc ccattactgt ttaaagaata 10381 tttccatccc aactccaaac ctggatttct cggctttctg ggtatcgttg gtttagttat 10441 ctatgtgctt tacttaagtt acttcgtcgt cgttaaacta ggtaagcagg gacgctcagc 10501 aacaccgcag tagctttcaa gagacacgct taacagggaa cgcttaacag ggaacaggga 10561 gcgcttaaca gggagaaaaa taactccctc actccttcac tttctcactc cttcactccc 10621 tacttcttag gggtgtaaaa gaaacattaa ttacacgtcc taaattgaaa atcaagaatt 10681 ttaaaatgaa caattctatc agcaagcgct tggaacaata tactgtcaaa cgtcctcaag 10741 aagtcctact tgtcagtgtg gaaattgcgg gtgagtcaga cgaaattgcc atttttaaag 10801 gcttttccag ttctttgacg cgcccaaccg cctttgatcc cgatgttcct gtactgcaag 10861 atgaagcaaa aattatcaaa attgaccgcg tagccagtcc ttacaatcct gaagcacccc 10921 gctacatcca acaaggactt tcttgggaca atatgcaagt cttattgtca caaatggaag 10981 tttgataata gcaaaattat gttaagttat tataaataat cttaagattc accaatcaag 11041 ctataatatc ttaccaatct agggtaagtt tttcttctta ataccattta cacataacta 11101 tcggctaagc agacgcattc aaagtcgcgt gtgcagaggg tttatactac catcacagcc 11161 tgacaacgca gagaaatggt attgttctga acctgacgag cgacaatttc ttcttacact 11221 catacctccc ctgccatgaa cactgcggtg actctaccaa cagaacaagc gcctcaaaaa 11281 gttgaatctt tagggcagct caatcgtcaa atgattgtaa ttctcgattt cggttctcag 11341 tattctgaac tgattgcccg ccgaattcgt gaaactcaag tatactctga agttctttct 11401 tatcgcacca cagctgaaca attacgtcaa ctcaatccta aagggattat cttttccgga 11461 ggtcccaatt cagtttacga cactggtgct ccccggtgtg atccagaaat ctggaatatg 11521 ggacttccga ttttaggtgt atgctacggt atgcaactga tggtgcaaca gctgggcggg 11581 gaagtggcaa aagctgaccg aggtgagtat ggtaaagcat cactctacat agatgatccc 11641 acagatttgt tcactaatgt tgaagatggg acgacgatgt ggatgagcca cggagactct 11701 gtgattcaaa tgccagaaca atttgaatta ctggcacata cagaaaatac cccttgtgca 11761 gctattgctg atcacgacaa gaaactttac ggcgtccaat tccatccaga agtggtgcat 11821 tctgttggtg gtatagcttt aattcgtaat tttgtatacc atatctgtga gtgtgaaccg 11881 acctggacaa ctgctgcttt tgtcgaacaa gccattcgag aaattcgcgc gagagtcggc 11941 gataaacgag tgctgttggc gctttctgga ggagtcgatt cttcaacctt ggctttttta 12001 ttacataaag caattgggga tcagttgact tgtgtgttta tcgaccaagg ctttatgcga 12061 aagtatgagc cagaacgatt gctcaaactg ttccaagaac agtttcatat tccggtagaa 12121 tatgtcaacg ctagggaacg gttcatttct tcactctctg gaattactga tcctgaagaa 12181 aaacgtcgta ttattgggcg cgagtttatc attgcttttg aggaaacatc cagacgcctc 12241 ggaccttttg attatctcgc tcaaggcacg ctttatccag acgttattga atctgctaat 12301 accaatgttg atccccaaac tggagaacgg gtggcagtga aaatcaaaag ccatcacaac 12361 gttggtggat tgcccaaaga cttgcgattc aaattggtgg aaccattacg gaaactgttt 12421 aaggatgaag tccgcaaagt cgggcgttct attggcttac cagaagaaat tgtccaacgg 12481 catcctttcc ctggaccggg tttagcgatt cggattatcg gcgaaattac tgccgaaagg 12541 ttaaatattt tacgcgatgc cgacttaatt gtgcgtcaag aaattaacca acgcggttta 12601 taccacgatt actggcaagc atttgctgtc ttactaccag ttcgtagtgt tggtgttatg 12661 ggtgaccaac gtacttatgc ttaccccatc gttttgcgga ttatcacaag tgaagatggt 12721 atgactgctg actgggcacg tgtaccttat gatgtcttgg aagtgatttc taatcggatt 12781 gtgaatgaag tcaaaggtgt taaccgagtg gtttatgata tcacctccaa gccacctgga 12841 actattgagt gggagtaatt gtgaagtacc ccaccgcatg gcggatgggg ctagctccga 12901 agggagcgct acgcaaacgc gtctcattcg cagttgccat gttgggtata ttcattgctg 12961 aatatccgat actaaaatcg gtgatggtct tacactgcgt ctgtgggtat tgctaccttt 13021 atcccttgcg ggaacccgcg cagccgtccc acttcctggg acggtttatt gttttaccgt 13081 acaacatgac atggattgtt ttttatagca gggcgtaacc ttgctaacca gcggtcatgt 13141 tcttttccag gtttgatgtt tctgcggtaa gcatcttact gctacttcgg attaaccttt 13201 gtgttccata cagcctggtg gtcataaacc agtactatca tagcaccaat acagtactaa 13261 acgtcaacta tgaaagcgct aaatgttcgt ttttccgatc aagaacacaa agaattgcaa 13321 aacgctagtc aactacttga gagatctatc aatgatttaa ttcgtgaagc agttagagat 13381 cacttgctaa agttgagggc atcgaaccct caactttagc ccggcaattt cgcccaccgc 13441 actcagggat ggggaatttc gcggaactgc gttaaagcta ggtgggcatt gcccacctga 13501 tttattttct gcatcagcaa gtaggcaagt gtaaaacctc accctgccct gtcgggcatc 13561 cctctccgtg gcttcaccag agggaaaaat ttttacgcat cagtaagggc tagaaagaaa 13621 catggttcga ctggtaggca ggtggttcga catgaatcaa aattctaact gggctgaagc 13681 gttcttctag tcgcttttct acttcctcgg tgatacggtg ggcagtttcg acgtcaggtg 13741 catcgactat taaatgcatt tcgatgaaga cttgacgacc aacaacacca cgagaagcaa 13801 tatcatgaca gttaacgaca ccaggaacag aaagggcgat tccatgaatg acttctgggg 13861 cgattgccta cggcagagct acgcttaacg ccatcctatc aacaagccaa ggtaaatttt 13921 cttttaaaac agtccagcca ctccaaaata ccaacaaagc cactggaaaa gccaaaatca 13981 aatctaacca ctgaacaccc agccacacac ctatcaaacc agcaattaca gaaattgtta 14041 cccagatatc gctcatatag gattcctatt tgatttttga aaaaaaagcg agaaaatacg 14101 gaggggtgta caagtgtaag agcagacaat gaaaaatcag catctaaaaa ccgcagtact 14161 acctgcgcaa gtggcaacag cgatgcgatc gcaccaaacc actcaaaggg cgatcaccga 14221 actacaagaa tttattgctt tgcgtccaaa tgcgcgtgag gtaaggaaag cactagtagt 14281 caagctggtt tatcaaggct acttgtatga agaaattcag acaattctag atgtgtcact 14341 gggttcaata acaggttgga aacaagccta cgagcgagat ggaatagatg gactgcggtt 14401 gaatcataag ggaaggaaga gcgcgctttc gtagcgaaca acgagaaaag gtgttgagtt 14461 ggctgcaaac aaaggattat tgggagcttg gggaactgga gtataaacta gctttcgagt 14521 acgacgtggt ttacgagtca aaacaaagtt actacgactt gtttgaagca gcaggaataa 14581 gcgcagaagt taaccacaaa attaaacccc aaagcagacc cgaatgctgt tgcagcaaaa 14641 aaaaagagat tgaaacactc ttggcaaatc accgcagtga aattgaaaca ggaaaattga 14701 gagtgctgct aattgatcaa tgtcatttaa tgtggggaga tttaagtggt tatgtatggg 14761 g // LOCUS NODE_2286_length_14728_cov_5.06924314728 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14728) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14728) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14728 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 147..1031 /locus_tag="DP116_19125" CDS 147..1031 /locus_tag="DP116_19125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317806.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbohydrate ABC transporter permease" /protein_id="PRJNA477356:DP116_19125" /translation="MTVHQENLTTNPKPKTIVKKSWKNILLWIVVTLVVVFCLAPAMW QLLTSFKVNQDIAKIPTVYFPSRITLNHYIEIFARRPFWRYILNSAFVSILSTVLSLA LGAPAAYALARLRPWGGRVILAGILIVTLFPGILLFLGLLEIIQGLHLGNNYLALIIP YTAINLPLTILVLRSFFEQLPKDLEDSARVDGYNTLQMLIQILLPMTLPALVTTGILT FIFAWNEFIFALTFITREEMKTIPVAAAQLGGATVFEIPYGPIAAATVIGTLPLVLLV LFFQRKIIQGLTAGAVKG" gene 1070..2149 /locus_tag="DP116_19130" CDS 1070..2149 /locus_tag="DP116_19130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952057.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_19130" /translation="MAKLELQNLNKTYTPKVIPVKDVNLTVDDNEFLTLLGPSGCGKS TTLRMIAGLEEPTRGRIFLGDEDITFKRPGDRNMAMVFQSYALYPHMSVYENLASGLK LKKTPRTEIEQRVTEVSKLLGLEELLQRKPGQLSGGQRQRVAVGRALVRRAQVYLLDE PLSNLDALLRERVRADIKQIFAAQKAPVVYVTHDQTEAMTLSTKVAVLNDGLIQQLDP PERIYNQPANLFVAGFVGSPQMNLLTLPCKERSAILGDANIFLPDIPTIPQEIILGIR PEHVRIAQADDTQIIQGQVYLVENLGMHNLVSVRVATSETEPLTIRALLPPDQKWNNE EIRLALPPQNIHWFDINSGDSLWLH" gene 2174..3922 /locus_tag="DP116_19135" CDS 2174..3922 /locus_tag="DP116_19135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408530.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="asparagine synthase" /protein_id="PRJNA477356:DP116_19135" /translation="MLFEVFKNHKSTKIPFIKTNSTWSIAWGTVDASYEGIAWRDEKV AVILPPTASEVLTEKLAISCGEQFVVVGDVWLTNQAQLLQKLGIEPNSFALSPLQLVA NLWERWGFECLNQLVGMFAFVVWDREKQVLQLVRDRVGARTLYYTTTGSVRWIAPKLR TLAPHRSSDLDLVALRDYLCCAFVPGERTLWQQVRELRPGTVLQFNDHKVQAYWQLQE KITAIDKPLAWHGARLRELLNQVVQEYLPPENEPVGVFLSGGLDSSSITALAAKFHNS PVHTFSIHFGYESPNELEFSSLVASHCQTQHHILEITFRDMWERLPETMAYLDDPIGD PLTVPNLMLGRLARESVQVVLNGEGGDPCFGGPKNQPMLINSLYGSVTNQDSLQAYLI SFQKCAADLPQLLKPEVWTAVQTTPWVFEEDFYSQASYLNCLMAMNIKFKGADQILTK VNNLTQAAHIHGRSPLFDQRIVDFSMEIPPDYKLSGVEEKAVLKGAIVDILPDTIINR PKSGMMVPVQLGFRKYWQREARNLLLSRNAAIAPYLNQSLIRDWLNFQGDTWSRYGVK LWLLVSLEIWLRVNQK" gene complement(3927..5258) /locus_tag="DP116_19140" CDS complement(3927..5258) /locus_tag="DP116_19140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408524.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19140" /translation="MKCIQCGTDNKLKDRTSNQGRCIRCQHPFVFEPTNKDDEKITDS MFAKAIADISTNNTLFFTPKQFLYFLDNRVKRKRISGWGWLGLYLFFNVWATGFIGGF SSIFLAPILASFKLPASTTFIIANLGTQIWYIYKIYQDTKSSLINQPRRKENAQVLRL IGLIILIGGIYTSLFIFHSFILFVIYVLLGMLSIYLGFKQLNQSEIPQESLIRQEQVQ SWLNRWRQINGSITKILPPPREEIAPAAVNPDVTAYSFDRLVVCDSAAIAQMLIANNF HFEKNCAILSISGYPQNIFDTTMLMLRRNPDLIVYALHDCSPRGVSLVNHLRTSSTWF LNSNVTMIDIGLLPRQIIAASRGMFIQSQSESAEAAKQLAPEIRSALSTDELEWLESG NFVELESFSPQKLIQILNHGIVNSQTLDSDDSSLILVDDTGNSMYVAESFG" gene complement(5379..6794) /locus_tag="DP116_19145" CDS complement(5379..6794) /locus_tag="DP116_19145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017717891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="decarboxylase" /protein_id="PRJNA477356:DP116_19145" /translation="MASVEKLDVELSDATSLQELNSSSAHLPFSQKLAQELLNTYGSP LYVYQGDILRQTIQHITQAFSYPRTQFRFASVTNGNISVLQIFRDQGWGLHANTPGDI YLGLQAGFAPEQIVYSGSNLNRAEMEQVLNWGVKTLNLDSVSQLQLCCEVYHSFCRER HIAQRGEPAHASGSVASESSTTPRLGLRLNLPEITGDSRIGVRLEEFPDAICALRSSE AIALTHQAGLKISGLHFYRGTGTNATEAFTNVIDKVITTAQLLPDWEYLDFGGGFGYP YHHDGAAFNWELFGTELSDRMSRLGREIELVIEPGRAAIAGCATLLAKVVSVKWQSEK QIVGVDTTVANLCVPSVHGGYREIVTWKEVAQMGRVETLHATSVQSKIENSKSKIYFT DVCGNTTYSRDFLGRNCQLPALEIGDIVGILDVGAYGYAMSSHFLHRPKPAEVLLEND TYRLIRRREDYSVLLANQVFY" gene complement(6782..8677) /locus_tag="DP116_19150" CDS complement(6782..8677) /locus_tag="DP116_19150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006103894.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="asparagine synthase" /protein_id="PRJNA477356:DP116_19150" /translation="MGSIPQGKETTQPHQFIGYWGYAQRRELEARLSLVKTPHTKYHK FSHVDSRVEDNSYPIWNVACIGFQQESIPPIHFQTDHTERIAAISASGILTDSHVIGS FLPDAWVNLQGSDGQSPAGGDAPKGSRPKGERLILGREPFGRVPLYWTQQGEVIWFAS QLQLLLKIVEKPEVSIPGLYGYSCFSYVPNPLTPTTNVFAVPAGTELVWQSQPNSKTL SAPIYKRLWEWSEASEQLKDETTAVKQLQILLQQVIERQISDLKDEPVGVFLSGGLDS SVVAALLVQAGVKVIGYTLDFGDAGIPESPYAEIVAQHLKIPLVKVDASPRQIQKAII PTVQALDLPFGDGVSVPLFLLAQRASQETKVIFNGEGGDQLFAGWVNKPLIAAGVYQT ENPSGQETFIQQYLRTFHRLWGYEAQIYQPHIYEQIQNLHPEEWIAEALDPAYCKAIL HRLRRAALMLKGAQNIHPRATALGFAHGLFVRSPLCDLPLAEWTFQVSGELCLQGACE KYILKRAVENWLPPEIVWRQKRGMGVPLTSWCLNEFWRQIGKWLNPGRLRVENRFSPH LAAQIAASQLGATLQDRRIGEILWLLIMWELWRVHVFGEKPGKQSFDHPFWLPQQLWR FQKKWQA" gene complement(8679..9302) /locus_tag="DP116_19155" CDS complement(8679..9302) /locus_tag="DP116_19155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408527.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3859 domain-containing protein" /protein_id="PRJNA477356:DP116_19155" /translation="MTQRLTTEQLTQIIAEVERLQARREAEIEPEQVNEILQELGLSP DLLDEALVQVRRQQALEAQQKRDRIIAVGIIAALVVVIASTFFFIQQHNSAIARVVAQ TNRITLTQDNGDNLKTISRENSPEIFYRVTLKDAPLDQRLDLLCDWIDPSGQIVKQNR YQTREIDKPIWDTYCRNTIGSASASGKWKVQMSVEGRPLSQAEFEVR" gene complement(9766..10218) /locus_tag="DP116_19160" CDS complement(9766..10218) /locus_tag="DP116_19160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Hsp20/alpha crystallin family protein" /protein_id="PRJNA477356:DP116_19160" /translation="MTLVRWNNWQQMNSLHRQMNRLFDDMLVPSTFVERNFPRVPAAE LQETEDAIHLKLELPGIEAKDLDVQVTQKAVSIKGERKSETKTEEKGRTVTEFHYGKF QRVIPLPSQIQNTNVTAEYKDGILNLTLPKSQEEKNKVVKVNLDQSAA" gene complement(10540..11460) /locus_tag="DP116_19165" CDS complement(10540..11460) /locus_tag="DP116_19165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654508.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L11 methyltransferase" /protein_id="PRJNA477356:DP116_19165" /translation="MANTWWELQILCESDLEDSIFWRLENSGWRGTASQRKGNNFCVK SYLPQFQALPQDLDDLSHLLRQDALSMGLSAPVVQWEVIDEEDWASSWKQHWQPQEIG DRLLINPAWLPLPENSDRLILLLDPGVAFGTGAHATTQLCLESLEMRLSNEPQSFVGK EKESDGVVIADIGCGSGILSIVSIMLGAKKAYAVDVDPLAVKSTLENCKLNGVSPEQL VVAEGSLEVLTKLLEQPVDGIVCNILAHVIISLVPDLSAIAKHSTWGIFSGILFEQSK AVIDTLEKHGWIVATVWRRNEWCCINVRRS" gene complement(11608..13188) /locus_tag="DP116_19170" CDS complement(11608..13188) /locus_tag="DP116_19170" /EC_number="1.1.1.95" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878709.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoglycerate dehydrogenase" /protein_id="PRJNA477356:DP116_19170" /translation="MSKVLVSDPIDQAGIDILSQVAAVDVKTGLKPEELVEIIGEYDA LMIRSGTRVTQEIIEAGTQLKIIGRAGVGVDNVDVPAATRKGIVVVNSPEGNTIAAAE HALAMMLSLSRYIPDANASVKRGEWDRKSFIGAEVYKKTLGIVGLGKIGSHVAAVARA MGMKLLAFDPFISTERAEQIGCQLVDLEVLIQQADYITLHIPKTPETTHLINTERLAK MKPNARIINCARGGIIDEEALAVALREGKIAGAALDVYEAEPLGESSLKSLGQQAILT PHLGASTTEAQVNVAIDVAEQIRDVLLGLPARSAVNIPGLSPNVLEELKPYMQLAETL GKLVGQLAGGRVELLNVRLQGELATNKSQPLVVAALKGLLYQALRERVNYVNASIEAK ERGIRVIETRDASIKDYAGSLHLEATGSLGTHSVTGALLGGGEIRLTNLDDFPINVPP NQHMLFTLHRDMPGIIGKLGSLLGSFNVNIASMQVGRKIIRGDAVMVLSLDDPLPDGI LPEIIKVSGIRDAYTVTL" gene complement(13564..13635) /locus_tag="DP116_19175" /pseudo CDS complement(13564..13635) /locus_tag="DP116_19175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017721746.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" gene complement(13646..14074) /locus_tag="DP116_19180" CDS complement(13646..14074) /locus_tag="DP116_19180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018009980.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VapC toxin family PIN domain ribonuclease" /protein_id="PRJNA477356:DP116_19180" /translation="MIILDTNVLSELIKPQGSVVVRNWASRQPVTGLFTTTITQAEIL YGITILPEGKRKYELYQAATLMFAEDFIGRVLPFDESAAIAFANISAQRRRNGTPISQ ADAQIAAICYSRNAAIATRNVADFAGCGIFIINPWEEKSP" gene complement(14074..14319) /locus_tag="DP116_19185" CDS complement(14074..14319) /locus_tag="DP116_19185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744360.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="plasmid stability protein" /protein_id="PRJNA477356:DP116_19185" /translation="MTNITIFNIDDNIKNLLQQQASKNGRSLEEEVKEILRFALIENQ KPPVNLVNMIEKRFAHLGDFELGEVIREPMRPAPTFE" gene complement(14422..>14728) /locus_tag="DP116_19190" CDS complement(14422..>14728) /locus_tag="DP116_19190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017715215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_19190" /translation="TLVSGSTDKTIKIWNLQTLELKFTLTEHTDFLNYLAISPDSKTL VSVSYDKTIKIWNLQTGELKTTLTGHNDRIICVAISPDGKTLVSGSFDKTIKIWRMP" BASE COUNT 4199 a 3199 c 3068 g 4262 t ORIGIN 1 ctacggaggg aaaccctcct gcaggactgg ctccgcaacg cactggctct ccttaataag 61 gagaggggtg ccgtaggcgg ggtgaggtaa aaacgtgaat gataaaatca acctgattta 121 cataacatct caaattaaaa atctctatga ctgttcatca ggaaaacttg acgacaaatc 181 ccaaaccaaa aacaatagtc aaaaaatctt ggaaaaatat cttgctttgg atagtagtta 241 ccttagtagt ggttttctgc ttagcaccag caatgtggca attactaact tcattcaaag 301 tcaatcagga tattgctaaa attcctaccg tttattttcc ctctcgaatc actctcaatc 361 actacattga aatattcgcc cgccgtccgt tttggcgcta catattaaat agtgcttttg 421 tctcgattct ctctacggtt ttatctttag ctttaggtgc gcctgctgct tacgctttag 481 cacggttgcg tccttggggt ggcagagtta tccttgcagg tatccttatt gtaactttgt 541 tccctggaat tttattgttc ttgggacttt tggaaattat ccaagggtta catctaggca 601 acaactattt agccctgatt attccctaca ctgctatcaa tttaccatta acaattttag 661 tgttacgtag cttttttgag cagttaccaa aagacttgga agattctgca agagtggatg 721 gctacaacac tttgcaaatg ttaatacaaa tattactacc aatgacactt cctgctttgg 781 taacgacggg aattctcaca tttatttttg cctggaatga gtttatcttt gctctcacat 841 ttataacccg tgaagagatg aaaacaattc ctgtggctgc agcacaatta ggtggtgcaa 901 cagtttttga aattccttat ggtccgattg ctgcagcaac cgtgattgga acattgccct 961 tagttttact cgttttgttc ttccagcgca agattattca aggtttgact gctggtgctg 1021 ttaagggata atcaacagat aatcaaaaga taatcaacaa ataaaaacta tggcgaaact 1081 ggaactacaa aacttaaata agacttatac tcccaaagtc attccagtta aagatgtgaa 1141 tttaactgta gatgacaatg aatttctcac tttacttggt ccgagtggct gtggtaagtc 1201 tacgacactg cgaatgattg cagggttaga agaaccgact cgcggtcgga tatttcttgg 1261 ggatgaggat attacgttta agcgaccagg cgatcgcaac atggcgatgg tgtttcaaag 1321 ctatgcgctt tatccccata tgtccgtgta cgaaaatcta gcctctggac tgaagctaaa 1381 aaaaactcct cgcactgaaa ttgaacagcg agtcacagaa gtttcaaaac ttctgggatt 1441 agaagaatta ttacagcgta aacctggtca attatcgggc ggacaaagac agcgagtcgc 1501 cgttggtcgg gcgttagtgc gtcgcgccca agtttacttg ctggatgaac cactcagtaa 1561 cctagatgca ctcttacggg aacgagtccg cgcagatatc aaacaaatat ttgccgctca 1621 aaaagctcca gtagtctacg tgactcacga tcaaacggaa gcaatgacgc tttccacaaa 1681 agtagctgtc ctcaatgatg gtcttatcca gcaactcgat ccacctgagc gcatttataa 1741 ccaaccagct aatttatttg tagctggatt tgttggcagt ccgcaaatga atttgctcac 1801 gctaccttgt aaggaacgtt ctgcaatctt aggtgatgca aacatatttt tgccagatat 1861 cccaacaata ccacaagaaa ttatcttagg tattcgtcca gaacacgtcc gcattgcaca 1921 agccgatgat acacagatta tccaaggaca agtatatctt gtggaaaact tgggtatgca 1981 taacttggtc agtgtgcgtg ttgcaacttc tgaaacagaa ccgttgacaa tacgtgcatt 2041 gttaccacca gaccaaaaat ggaataatga agaaattaga ttagctttac cgccccagaa 2101 tatccactgg tttgatatta attcaggtga tagtctttgg ttgcattaga gtcaaggatg 2161 ggtgaaaatt ttcatgctat ttgaggtttt taaaaatcat aaatccacta aaatcccatt 2221 catcaaaaca aactcaacct ggtcaatagc ttggggaaca gttgatgcaa gttatgaagg 2281 tatagcttgg cgagacgaaa aagttgctgt gattctaccg cctacagctt ctgaagtact 2341 gacagagaaa ctagcaatca gttgtggaga acaatttgtt gttgttggtg atgtgtggtt 2401 aactaaccaa gcacagttgc tgcaaaaatt gggaattgaa ccgaatagct ttgcgctaag 2461 tcctctgcaa ctggttgcta atctttggga acgatggggt tttgaatgtc tcaaccaact 2521 tgtggggatg tttgcgtttg tggtttggga tagggaaaaa caggtgttgc agctggtacg 2581 cgatcgcgtt ggtgctcgta ctctctacta cacaaccact ggttcggttc gttggattgc 2641 gcctaaattg agaactttag caccccatcg ttcatccgat ttagatttag tcgctttgcg 2701 agattatctt tgttgcgcct ttgttcctgg ggagcgaaca ctttggcaac aggtgcgaga 2761 actgcgccct ggaactgttt tacaattcaa cgaccacaag gttcaagctt attggcagct 2821 tcaagaaaag attacagcaa tagataaacc tttagcatgg catggcgcgc gcctacgaga 2881 actgctaaac caagttgttc aagaatattt accaccagaa aacgaacccg ttggcgtttt 2941 tctttctggt ggtttggact ccagcagtat caccgcttta gcagcaaaat tccacaattc 3001 cccagttcat accttctcga ttcattttgg ttatgaatct cccaatgagt tagagttttc 3061 cagtctcgtt gcttctcatt gtcaaacgca acaccacatc ctagaaatta cctttcggga 3121 tatgtgggaa cgcctaccgg aaacaatggc gtatttagat gatcccatcg gcgatccgct 3181 gactgttccc aacctcatgt tgggacgatt ggcgcgagaa agcgtgcagg tggtgttaaa 3241 tggtgagggt ggcgatcctt gttttggtgg tccaaaaaat cagccaatgc tcattaatag 3301 tttatatggc tccgtcacca atcaagattc attgcaagct tatttaattt ctttccagaa 3361 gtgcgcggct gacttaccac aacttttaaa accagaagtt tggacagcag tacaaacaac 3421 accttgggtt tttgaagaag atttctattc tcaagccagc tatctcaatt gtttaatggc 3481 aatgaacatc aaatttaaag gcgctgacca aattcttact aaagttaata acttaactca 3541 agctgctcat atacacggtc gttctcctct ttttgaccag cggatagtag acttcagcat 3601 ggaaattcct ccagattata agctttctgg agtggaagaa aaagcagttc ttaaaggggc 3661 gattgtagat attttgccag atacaattat taatcgtccc aaaagtggaa tgatggttcc 3721 ggtacagtta ggatttcgta aatattggca acgagaagca agaaatctat tgctaagtcg 3781 caatgcggca attgcccctt atttaaacca gtcgctaata cgcgattggc taaactttca 3841 aggagacact tggagtcgtt atggagtaaa gctttggttg cttgttagtt tagaaatttg 3901 gttgcgagtg aatcaaaagt gaaaaattaa ccaaagcttt cagcaacata catagagttt 3961 cccgtatcat caacgagaat caagctgcta tcatcgctat ctaaagtttg actattaact 4021 atgccgtgat taagaatttg aatcaatttt tgaggactaa atgattccaa ttccacaaag 4081 ttaccagatt ctaaccattc caattcatct gttgataatg ccgagcgaat ttcgggtgct 4141 aattgtttag cagcctcagc cgattctgac tgagattgaa tgaacatacc gcgacttgca 4201 gcaataattt ggcgtggcag tagtcctata tcaatcatcg tgacattact attgagaaac 4261 caagtggaac tagtgcggag atgattaact aaactcacac ctctggggct acaatcatgc 4321 aatgcataaa ctattaaatc aggattacga cgtagcatta gcatggtagt atcaaaaatg 4381 ttttgcggat agccactgat gctgaggatg gcacagtttt tttcaaaatg aaagttattg 4441 gcgattaaca tttgggcaat tgctgcacta tcacaaacaa ctaacctatc aaaactgtaa 4501 gcagtcacat caggattgac agcagcgggt gcaatttctt cacgcggagg aggtagaatt 4561 ttcgtaattg aaccattgat ttgccgccaa cggtttaacc aactttgaac ctgttcttgc 4621 ctaattaatg attcctgtgg aatttcggat tggtttagct gtttaaatcc taagtatatg 4681 gacagcattc ccaaaagaac ataaatgaca aataaaatga atgaatggaa gataaataga 4741 ctggtataaa tacctccaat aagaatgatc aaacctatca aacgcaaaac ttgagcgttt 4801 tcctttcgtc taggctggtt aattaaactt gacttagtat cctggtagat tttgtaaata 4861 taccatattt gagttcctaa attagcaatt ataaatgtag tcgatgcagg taatttaaat 4921 gatgctaata ttggagctaa gaatatagat gaaaatcccc caataaaacc tgtagcccaa 4981 acattaaaaa atagatacaa accgagccat ccccatccac ttatcctctt ccttttgaca 5041 cgattatcca agaaatagag aaattgctta ggtgtgaaaa atagagtgtt gttagtagaa 5101 atatcagcta tagccttagc aaacattgag tcagttatct tttcatcatc tttgttagtt 5161 ggttcaaaca caaaaggatg ttggcatctg atacaccgac cttgattgct tgtccggtct 5221 ttgagtttat tgtcagtacc acattgaatg catttcatac tattttttag ttatcaggta 5281 agagtgattg attctgtttt gaatgagata gtttcagtca attaaccttt gtataatttt 5341 tgttacatta ttgactacga aaaaagtacg aaatagtctt aataaaaaac ctgatttgcc 5401 agcaaaacgc tgtagtcttc acgcctgcga attaaacggt acgtgtcatt ttctagcagt 5461 acttcggctg gtttaggacg gtgtaaaaag tgtgaggaca tcgcataacc ataagcacct 5521 acgtccaaaa tgccgacaat gtcaccaatt tctaatgctg gtagttgaca gtttcttcct 5581 aaaaaatctc gtgaatatgt ggtgtttcca cacacatctg tgaaataaat cttagatttg 5641 gaattttcga ttttggattg tacagatgtt gcgtgcaacg tctctactct gcccatctgt 5701 gcaacttctt tccaggtaac aatctcccgg tagccgccgt gtactgaagg gacacagaga 5761 ttggcaactg tggtatctac tccaacaatt tgcttttctg actgccattt cacggaaacg 5821 actttagcaa gaagagtagc acatccggca attgcagcgc gtccgggttc aatgactaat 5881 tcaatttctc ttcccaaacg actcattctg tcacttaact ctgtaccaaa caattcccag 5941 ttaaaagctg cgccgtcatg atgatagggg tagccaaaac caccgccaaa atcgagatat 6001 tcccaatctg gtaaaagttg tgctgtggtt ataaccttat caatgacgtt ggtaaaagct 6061 tctgttgcat tggttccagt acctcggtaa aaatgcaaac cactgatttt gagtccagct 6121 tgatgggtaa gagcgatagc ttcgctgctg cgcaaagcgc agatcgcatc aggaaattcc 6181 tctagacgca caccaatgcg gctatcacca gtgatttcag gtaggttgag acgtaagcca 6241 aggcgcggtg ttgtagaact ctcggatgca acggagccac tcgcgtgcgc gggttccccg 6301 cgttgagcga tgtggcgttc tctacagaag gaatgataga cttcgcagca tagttgcaac 6361 tgggagacac tatccaaatt gagagttttt acaccccaat tcaggacttg ttccatctcg 6421 gcgcggttca aattactacc actgtagaca atttgttctg gcgcaaaacc cgcctgtagt 6481 ccgagataaa tatctccagg tgtattggcg tgtagtcccc aaccttgatc acgaaaaatt 6541 tgcagtacag aaatattccc attagtgacg ctggcaaagc gaaattgggt gcgaggatag 6601 gaaaaagctt gagtgatatg ttggatagtt tggcgtaaga tgtcaccttg gtaaacgtag 6661 agtggtgaac cgtaggtatt cagcaactct tgagccagtt tttgggagaa aggcagatga 6721 gcggatgagg agtttaattc ctgtagagag gttgcatccg agagttctac atcaagtttt 6781 tctacgcttg ccattttttc tgaaatctcc ataactgttg tggtagccaa aatggatgat 6841 caaaagattg tttgcctggt ttttctccaa aaacgtgtac tcgccaaagc tcccacatga 6901 taagcaacca aagaatctca ccaatacgac gatcttgaag agtggctccc aactgactag 6961 cagcaatttg tgctgccaga tgcggagaaa agcggttttc gacacgaagt ctgccaggat 7021 ttaaccactt gccaatctga cgccagaact cgtttaaaca ccaggaggtt aagggaactc 7081 ccataccgcg cttttgtcgc caaacaattt ctggcggtaa ccaattttct acagctcgct 7141 tgaggatata tttctcgcag gctccctgta agcagagttc tccagaaacc tggaatgtcc 7201 actctgccag tggtaagtcg cacaaaggcg atcgcacaaa caacccgtga gcaaaaccca 7261 aagcagtcgc acgcggatgt atattttgtg ctcctttcaa catcagggca gcacggcgga 7321 gccgatggag aattgctttg caatacgcag gatcgagcgc ttctgcaatc cattcttctg 7381 gatgtaaatt ttgtatctgt tcataaatat gtggctggta gatttgagct tcgtaacccc 7441 aaagacggtg aaaagtgcgg aggtattgct gtataaatgt ttcttgtcct gatggatttt 7501 ctgtttggta aacgcctgct gctattaaag gtttattcac ccaaccagca aacaactggt 7561 cgccaccttc gccattaaaa atcaccttag tttcctgact agctctttgt gctaaaagaa 7621 acaaaggaac actcacacca tctccaaaag gcaaatcgag cgcctgtaca gtaggtatga 7681 tagctttttg gatttgacgc gggctagcat cgacttttac caaaggaatt ttaagatgct 7741 gggcgacaat ttcggcatag ggagattctg gaatgcctgc gtcaccaaaa tctagagtat 7801 aaccaatcac ctttacccct gcttgtacca gcaacgccgc cacaacagag gaatctaatc 7861 ctccagaaag aaaaactccg acaggctcat cttttaagtc ggaaatttga cgctcaataa 7921 cttgttgaag caaaatttgc agttgtttga cagccgttgt ttcatctttt agttgttcgc 7981 ttgcttcact ccattcccac aaacgtttgt aaatgggtgc tgatagcgtt tttgaattcg 8041 gctgactttg ccagactaac tcagtccctg caggaactgc aaacacgttc gtcgtaggcg 8101 tcagcggatt gggaacataa gaaaaacaac tataaccgta caaacccggt atactgactt 8161 ctggcttttc aactattttt agcagcagtt gtagctggga tgcaaaccaa atgacttctc 8221 cttgttgagt ccagtacaga gggactcgcc cgaaaggttc tcgtcctaaa attaggcgtt 8281 cgccctttgg gcggctcccc ttgggagcat cgcctccggc ggggctttgc ccatcgcttc 8341 cctgtagatt tacccaagca tcaggtaaaa atgagcctat gacgtgcgag tcagtcaaga 8401 ttcctgatgc tgagatagca gctattcgtt ccgtgtgatc agtctgaaaa tgaattggtg 8461 gtattgactc ttgttgaaaa ccaatgcaag caacattcca gataggataa gaattatcct 8521 caactctcga atcgacatga gagaacttgt gatacttcgt atgtggtgtt ttgacaagac 8581 ttagccgcgc ttctaattcg cgtctttgag cataacccca atatccaata aattgatggg 8641 gctgagttgt ctccttaccc tgtggaatac tccccatact accttacttc aaattctgct 8701 tggctaagtg gacgaccttc aacagacatc tgcactttcc atttacctga agcagatgca 8761 gaacctattg tgttgcggca gtatgtatcc cagatgggtt tgtcaatctc gcgggtttga 8821 tagcgatttt gcttgacaat ttgaccacta ggatcaatcc aatcgcacaa caaatccagt 8881 ctttgatcca aaggcgcatc ctttaaagtc acacggtaaa agatttctgg actattttca 8941 cgtgagatcg ttttcaagtt atcgccatta tcttgtgtca gggtgatgcg gtttgtttgg 9001 gcgacaacgc gggcaatggc agaattatgt tgttggataa aaaagaacgt actagcaata 9061 acaaccacca aagcagcaat tatccccaca gcaattatgc gatcgcgttt ttgctgtgct 9121 tccaaagctt gctgacgccg cacctgaaca agcgcttcat ctagcaaatc tggcgataaa 9181 cctaattcct gtaaaatttc attgacttgt tcgggttcaa tttctgcttc ccgacgtgct 9241 tgcagccttt ctacctcagc gattatctgt gttagttgct ctgtagtcaa tcgttgcgtc 9301 atagttattg caactcctgt gaaaaaagca tttagttttg tacaaagata gctctttatc 9361 cctttggaaa ggttgtgtat tggacgagaa actccttcac agtggaactg gtagggtttt 9421 tgtagacgta gaattaagtt aagctacagt tgctagctat caatttccgc ttacttggaa 9481 agaaaaatta tggtgttgca gaatataacc atgatccaat ttccagaaag taaatcaacc 9541 cacgtcgtca agaacgcacc aatggcattg tccgacagca gacacggcac tggtatcgac 9601 ggacacgatt agcgatttta gataataatt taggttacca gttgttagtc aataaaaaaa 9661 cccagtcata tgaaactgac tgggcgtttt tgttaggtta acctactgtg cattcaggct 9721 atcttttcct aacaattctt aaaattaata acttcttgca atgaattaag cagcagattg 9781 atccagatta actttgacga ctttgttctt ctcttcttga gattttggta gtgtcaagtt 9841 caaaataccg tctttgtatt ctgctgtaac attggtattt tgaatttgcg aaggtaaagg 9901 aatcacgcgc tggaatttac cgtagtggaa ttcagtcaca gtacgaccct tttcttcagt 9961 cttagtttct gacttgcgct ctcctttgat ggaaacagcc ttttgtgtga cttgcacatc 10021 caaatcttta gcttcaattc ctggtagttc tagcttgaga tggatagcat cttctgtttc 10081 ttgcaattca gcagctggaa ctcttggaaa gtttctttca acaaatgttg agggtacgag 10141 catatcatcg aataagcggt tcatttgacg gtgtaaggag ttcatttgtt gccaattatt 10201 ccaacgaact aatgtcatct ttttctctct ccaatcaata ggattttgtt taatgaattc 10261 agcttttcac ttcctttgat tcttatagta tttaaaaact gaggcagtga ttatacggtt 10321 tttatcaccg aaaaaattca tggatcccga acttccatga aaggagcata aaaattcaga 10381 agaatagaca aatcatcaaa gatactctca aaaaagacgg tgaggaatcc ccgatttttc 10441 tggaacgggc tacgatactc ccttaaagtc gtcaactgtt actacaggga aaataagtaa 10501 aaaaaaccca acaccaaaac aatcaaattt gaggactgtt caagaacgcc gcacattaat 10561 acagcaccat tcgttgcgcc gccacacggt tgcaacaatc cagccgtgtt tctctagagt 10621 atcaatgacg gctttagatt gctcaaataa tatgccacta aagatacccc aagtgctgtg 10681 tttggcgatc gcactcaaat ccggaaccaa acttataatg acatgagcca aaatattgca 10741 gacaatccca tctacaggtt gttccagcag ttttgtcaaa acctctaaac taccttccgc 10801 cacaaccaac tgttctggac ttacaccgtt gagtttacag ttttctaggg ttgacttgac 10861 tgccaaggga tcaacatcta ctgcataggc tttcttcgcg cccagcataa ttgacactat 10921 ggaaaggata ccagaaccac atccaatatc cgcaatcacc acaccatcac tttctttttc 10981 tttacccaca aaagactgag gttcgttgct caaacgcatt tccagggatt ctaagcacaa 11041 ttgagttgtg gcatgagcac ctgtgccaaa tgctacgcca gggtctaaaa gaaggatgag 11101 gcggtcagaa ttttctggga gtggtagcca tgcggggttg atgaggaggc gatcgcctat 11161 ttcctggggt tgccaatgtt gtttccagct actcgcccaa tcctcctcat caatcacctc 11221 ccactgcaca acaggtgcag aaagtcccat agatagagca tcttgacgca acaggtgcga 11281 taaatcatcc aaatcttgcg gtagtgcttg aaattgcggt aagtaactct tgacgcaaaa 11341 attatttcct tttctttgac tggctgttcc gcgccagcca gaattttcca gccgccaaaa 11401 gatggaatct tctaaatctg actcacaaag aatttgtagt tcccaccaag tgtttgccat 11461 aaaaaaactc agaataacgc cttcagcgac gctgcgctga cacaaaatgt gcccattcgg 11521 cgcagccgtg ctgtaggcat aggacttaga gaagtcagga gacaggagaa atgatttctt 11581 ccgacttctg acttcgactt ctaattccta tagtgttacc gtatacgcat cacgaattcc 11641 agacacctta ataatctcag gtaaaatccc atcgggtaag ggatcatcaa gactgagtac 11701 catcaccgca tcaccacgaa tgattttgcg accgacttgc atactggcaa tattgacatt 11761 aaagctgcca agtagggaac cgagtttgcc aataatccct ggcatatcac ggtgtaaggt 11821 aaagagcata tgttgattag ggggaacgtt gattgggaaa tcgtccaaat tggtaaggcg 11881 aatttctcca ccgcccaaca aagcacctgt aacagaatga gtccccaaag aacctgtcgc 11941 ttctagatgc aaagaaccag catagtcttt aatcgaagca tcccgcgttt caatcacgcg 12001 aattccccgc tcttttgcct caatgctggc attaacgtag ttaactcgtt cccgcaaagc 12061 ttggtaaagt agtcctttca aggctgcaac caccaagggc tgactcttgt ttgttgccaa 12121 ttcgccttgt agccggacgt tgagtaactc cactcgtccg ccagccagct gtcctaccaa 12181 cttacccaaa gtttctgcta gctgcatgta gggtttgagt tcttccagta cgttaggact 12241 caatccggga atattcaccg ctgaacgcgc gggaagtccc aataacacat cccgaatttg 12301 ttctgcaacg tcaattgcca cattaacttg tgcttctgtt gtcgaagcgc ccaaatgtgg 12361 ggtgaggata gcttgttgcc cgagtgactt taatgaagac tcgcccaatg gttctgcctc 12421 atacacatcc agtgctgcac cagcaatttt accttcccgg agagcaactg ctaatgcttc 12481 ttcgtcaata atcccaccgc gagcgcaatt aataatgcgg gcgttgggtt tcatttttgc 12541 caatctttcg gtgttaatta agtgggtggt ttctggagtt ttgggaatat gcagcgtgat 12601 gtaatctgct tgctgtatca atacttccaa atccactaat tgacagccga tttgttctgc 12661 cctttcagtg gaaatgaagg gatcaaaagc caacaatttc attcccattg ctctagcaac 12721 agcagcaaca tgggagccaa ttttacctaa gccgacaata cccagagttt ttttgtaaac 12781 ttcggcacca atgaagcttt tacgatccca ctcaccgcgt ttgactgaag cattggcatc 12841 agggatgtag cgagacaaag atagcatcat cgccagcgcg tgttctgcag cagcaattgt 12901 gttcccttca ggagaattaa cgaccacaat tcctttgcgg gtagcagcgg gaacatccac 12961 attatcgaca cctacaccag cgcgaccgat aatttttaat tgcgtcccag cttcaataat 13021 ttcttgagta acgcgggtac cagagcgaat cattagcgcg tcgtactcac caatgatttc 13081 taccagttcc tctggtttta atcctgtttt cacatctaca gcagcaactt gggaaagaat 13141 gtcaatccca gcctggtcaa ttggatcgga gacaagaacc ttagacatga ttgcttattt 13201 taaagctaga ggtattccgg aacaaggatt ttcagtttag actgtaacgt ttgcaattta 13261 gcagaaatga taacttctta tatgccagta agtgacactc acgtttgttg atgatggtac 13321 tagagggtta gttgttcaca attatgcagc gcttcctccc atatctaatg ccacctactc 13381 ggtaaatata aaccattcac agtgtcaaaa cctatatcca aaacctatat aaattatgaa 13441 attttcttag attcgagcga ctgtgaactt tttgcgatcg ccgtaagtcc tgcggacagg 13501 ctgcgccaac gcggcgcgtg cactttgtgc tcaggcgggg cgaaagctca tcgcgttcct 13561 tgcgcgtgtc cgtcctggtg atattggttc gatcgctcca cgcttgcctt ggctaaaatc 13621 gtattcatct tccattggtt attcctcatg gagatttctc ttcccaagga ttaataataa 13681 aaattccgca gcctgcaaaa tcagcaacat tacgagttgc gatcgctgcg ttacgagagt 13741 aacaaatagc agcaatttgg gcatcagctt gagagatggg agtcccattt cttcgtcttt 13801 gagctgaaat attagcaaaa gctatagcgg cagattcatc aaatggaaga acacgcccta 13861 taaagtcttc tgcaaacatt aaagtagcgg cttggtagag ttcatactta cgctttcctt 13921 ctggtaaaat ggtgatgcca tatagtattt cagcttgtgt aattgttgta gtaaataaac 13981 ctgtaacagg ttgtcgagat gcccaattgc gaactactac agaaccttgg ggtttgatta 14041 attctgacaa tacattagta tcaagaataa tcatcattca aaagtgggtg caggacgcat 14101 aggttctctg ataacttctc ctagttcaaa atctcccaag tgggcaaagc gtttttctat 14161 catatttaca agatttacag gcggtttttg attttctatt aaagcaaaac ggagaatctc 14221 ttttacttct tcttctaggg aacgaccatt tttggaagct tgttgttgca agaggttttt 14281 gatattgtcg tcaatgttaa aaatggtaat atttgtcatg tgagtgaatt taggttgtga 14341 tgtattgaga tatgtaataa ttatattttt cttaattagg caacgaagtg cagcaaccac 14401 aaatctaaaa tcctcaaatc cctacggcat tcgccaaatc ttgatagtct tgtcgaaact 14461 cccactcacc aaagttttgc catccgggct gatggcgacg caaataatcc tgtcgttatg 14521 cccagtcagg gtagttttca attcgcctgt ttgcagattc caaatcttga tagttttgtc 14581 gtaacttaca ctcaccagag tcttgctatc cgggctgatg gcgaggtaat taaggaagtc 14641 ggtatgctca gtcagggtaa atttcaattc aagagtttgc agattccaaa tcttgatagt 14701 cttatccgta ctcccactca ccagagtc // LOCUS NODE_2290_length_14718_cov_5.61590414718 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14718) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14718) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14718 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..655) /locus_tag="DP116_19195" CDS complement(<1..655) /locus_tag="DP116_19195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858784.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Cys/Met metabolism pyridoxal-phosphate-dependent enzyme" /protein_id="PRJNA477356:DP116_19195" /translation="MVQSLNYSVPAFDNKQQYTIIEAEVSAAVLEVLASGRYIGGPLV AGFEQQFAAYIGVTECVACNSGTDALFLALRALNIGAGDEVITTPFTFIATAEVISAV GAKPVFVDIDETTFNLDVNQVAAAITRHTKAVIPVHLFGQPVDMAALMNVAKAHNLVV IEDCAQSTGANWAGQKVGSIGHIGCFSFYPTKNLGACGDGGAITTNDPEIAARLRFLK " gene 982..1755 /locus_tag="DP116_19200" CDS 982..1755 /locus_tag="DP116_19200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858785.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron export ABC transporter permease subunit FetB" /protein_id="PRJNA477356:DP116_19200" /translation="MELIKLDFVDLAFAVGLMGAAIGLSAWERIGLEFNLALATGRTF LQVAILGYVLEFIFALDNPWAVLAILAVMLTISAIVARNRITQKIPQMLPLVWGSILL STTITLVYTNILIIQPDRWFEPQYVISLGGIVLGNAMNAAALAGERLVSIMNASQLEI ETHLSLGATPGQAIAQYRKDAVRAGLIPTLNQMMVIAMVTLPGIITGQLLSGINAREA ASYQILIMFMIAFANLLTVLLVTRGISRQFFNSTAQLVR" gene 1969..2226 /locus_tag="DP116_19205" CDS 1969..2226 /locus_tag="DP116_19205" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19205" /translation="MQPLIVEKKIDSQPSSHNHLYVAKFSVILTVVLALLLQVITQEI SSGITEWLAKELAVSSLLVQQYSIATLRKILLFGVDLFLAI" gene complement(2324..2581) /locus_tag="DP116_19210" CDS complement(2324..2581) /locus_tag="DP116_19210" /inference="COORDINATES: protein motif:HMM:PF07366.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19210" /translation="MHTGFPDLQFTITDAIAEGDKVAISWTAQGTHKGEIKVHHLPAT GKSVSWTGIIIYRIVEGKITEERGQEHALGLFQQLGLIPKL" gene 3391..4626 /locus_tag="DP116_19215" CDS 3391..4626 /locus_tag="DP116_19215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874952.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="succinate--CoA ligase subunit beta" /protein_id="PRJNA477356:DP116_19215" /translation="MDLLEYQVKEWFANIGIPVLPSQRIDHPTDLKRLKIDYPIVLKS QVNAAERAKVGGVRFVETTIDAIAAARSIFNLPILGQLPEVLLAETKYETEQEFYLAV VLDTAVCRPILLGCTEAMDIDWDSADEKMHYVVVEQEFSPFYARQLALKMGLQGALMQ SVSNVVEKMYQLFVEKDLDLVEINPLGVSSSGQVMALNGNVSINERAIGRHPNMADIA KAMVNPYTSSNTIRNLGDWDVVEMHGKIAILGNGAGLVMATLDLVVNSGGKPGVCLNL RHASVTETSPTTFKSRLEQALKNLATDKSIQVILINFLGSISQGSEVSEVIDDFVESS MTEAQLFMAKSNGGKNRRENSFPRLVVRLAGSEFIRARIDIAALKTPTDTSITVVENL DEGVAEAIRLAKSTAYRRF" gene 4641..5522 /locus_tag="DP116_19220" CDS 4641..5522 /locus_tag="DP116_19220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195695.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CoA-binding protein" /protein_id="PRJNA477356:DP116_19220" /translation="MNLTPYSKVLIQGFSEYITATHIAQMKANGTNLVAGVNPGCGGQ QMYTLPVFDLVEEVVEKCGVIDTTIICVHPYQVLDAALEALACDIRQIIIISAGVPPL DMVQLLRKAEAKETLIVGPNSPGIIIPGKILLGTQPSEFYTAGSVGILSRSTTLTYEI ARELTDAGLGQSMSVSIGSDAIVGSSFLQWLQILDEDEATKAIVLVGQLGGDREEAAA KYIAEAIDKPVVAYIAGTQAPSARHWRQTGTLAAVIGRDPDFGTAKNKLAAFSFARVP VAERPSQIPELVKKAMR" gene complement(5528..7186) /locus_tag="DP116_19225" CDS complement(5528..7186) /locus_tag="DP116_19225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015116051.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19225" /translation="MKLNWNPQLLRELKGRLKPRNILLTIALSLVGQFILVLISFQTL LARVTLDIFDKYCAPTFGEKLDKRYQLKCLPDEYWQSWWRDIFVGLSLIGIVILLVAG TYMLINDLATEERRGTFNLTRLTPQSEASIFIGKLLGVPILIYLFTVLSLPLHLWSGL AAKIPLSLILSFYAVVAVASVFFYSAAMLFGLVGSWLGSFQAWLGSGAVLIFLMLTKG ISTTSYFNSTTWLKLLNPFCLIPNLSTTSFFEGFVTGLERFKWFHFVLSDNVLTIVGF VLLNYGILTWFIWQSLRRCFRDQSATMLSKQQSYLLTVCFTVLTIGCANYAAPANRSP YALPLTENLFALSLLNFLLFLYIIAAITPHRQALHDWARYRHKRVSSSKKLGHSSLVQ DLIWGEKSPAVVAIAINLIIVIILLAFFILLSPAKVDENKTQAFFGLAFNGSLMMILI YASVAQLLLFMRTQQRTSWTIGTLGAAILLPSIISRLVGIYAFYPFAVLIFPYLPILS NTVTAVWVVVVAQLLILVLLNLQLTRQLRRAGESTSKALFTPGK" gene complement(7282..9213) /locus_tag="DP116_19230" CDS complement(7282..9213) /locus_tag="DP116_19230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137008.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_19230" /translation="MIINLIDKLGDWNPQLFREIKGRFKGFNVASAVVISLVGQVVLF LCQLANLPGDKYSLIGEYCRLRPAYEYKTNQLSEQSNNLQRQLDSYLNTQPRLPDKIQ ELKAEIAQIQTKITESNDYLSKNFCPTDQLDMQMWWREHWGYIFFSLSIIFIFTLLVA GTYLLINNLATEERRGTLNFLRLSPQSETSILIGKILGVPILIYLLVFTAVPLHFIAG HGANIATSHILSFWVVLAGSCIFFYSAALLFGLCCRWLNGFQAWFGSGAVLLFLMLTM QLASSESYSLNSSFAWFRLLSPFDMTRHLFPNIFNNGYNWGFMQKFQFFYLPVGTNIG SVMALHLLNYGLWTYWIWQALKRCFRNPSSTIFSKEQSYLFVACFQFVFWGFALQYRE GYCHSACEYESPSNVSCCIYDVNSQIQQSLFWLVFFNLVLLFGLIAILSPHRQTIQDW ARYRHQNLSSHQNVWRKSLLLDLILGEKSPSFVAIAINLVIVTVPLLVWILLAPVLNV HNTSNIDWQINHVGRLKTVLGVAMFITLMIIYATIAQRMLLMKTTKRSFWAVGTIGAV IFLPPVILGMLSVDSSKYPTVWLFSTFPWASLEYASTPTIFLAWLGELSVLVLLNLHL IKQVRLAGESATKALLAGR" gene complement(9337..10287) /locus_tag="DP116_19235" CDS complement(9337..10287) /locus_tag="DP116_19235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310226.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_19235" /translation="MTKELAIRTCELTKQFDRHVAVNDIDLEIESGEVYGLIGPNGAG KTTLIRMLAAAEEPTTGEIYINGDRLLRDKSNPTLKRRLGYLPDDYPLYEELTVWDYL DYFARLYQLREPRRTQRLYEVIELIQLGNKRNSLISTLSRGMKQRLCLARTIIHEPIV LLLDEPVSGLDPIARMQFREIIKVLQEAGMTILISSHVLSDLAELCTSVGIMELGFLV ESSSLQQLYQRLARQQIVISTLGKLDELLGELKNNPYVQEWEIMPTKNSVRVNFSGKQ EDCANLLRSLVTASIPLTDFHCTQEDLETIFLNLGHKQAS" gene 10629..11789 /locus_tag="DP116_19240" CDS 10629..11789 /locus_tag="DP116_19240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879364.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_19240" /translation="MLKAYKYRIYPTSEQSILLAKSMGCARWFYNYALNLTSETYKTT GKGLSRNEIINLLPSLKKEHEWLTEPPSQCLQQVALDLSSAFLNFFEKRGLYPNFKKK GQKQSIRFPQEIKLDGSYLTLPKLGKVYCKVSRKPDGKLKSVTVSLTSSGEYYAACLY DDGKDIPVSSSEGKAVGIDMGITHYAITSDGTKHGNPKYYRKYETKLAQKQKLLSRKH KGSNNRNKARIKVAIVHTKITRCREDFLHKLSRKLVDENQVIVVENLAVKNMVRNHKL AKSISDAGWGQFCTMLKYKAEWKRKTYIEVDRFFPSSKTCSNCLHQVDHLSLDIRSWQ CPRCQTLHDRDVNAAINIRDEGLRILAGGHLATASGQRVRPSKGTAFRGYVG" gene 12226..13314 /gene="leuB" /locus_tag="DP116_19245" CDS 12226..13314 /gene="leuB" /locus_tag="DP116_19245" /EC_number="1.1.1.85" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950858.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-isopropylmalate dehydrogenase" /protein_id="PRJNA477356:DP116_19245" /translation="MTQNYRITLLPGDGIGSEIIAVAVNVLKVVGKKYNIQFEFTEAL IGGAAIDATGEPLPADSLDICRNSDAVLLAAIGGYKWDSLPSHLRPEAGLLGLRAGLG LFANLRPAKILPQLIDASSLKREVVEGVDILVVRELTGGIYFGKPKGIFETETGEKRG VNTMAYTESEIERIGRVGFEAARKRGGKLCSVDKANVLEVSQLWRDRITKLAQEYPDV ELSHMYVDNAAMQLLRAPKQFDTIVTGNLFGDILSDAAAMLTGSIGMLPSASLGASGP GVFEPVHGSAPDIAGQDKANPLAQVLSAAMMLRYGLNEPTPADEIEQGVLEVLQKGDR TGDIMSPGMNLLGCRAMGKALIQVLEAK" gene 13406..13666 /locus_tag="DP116_19250" CDS 13406..13666 /locus_tag="DP116_19250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874122.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19250" /translation="MYALKKGQENYTNQKSQSALIDPTVIRAAGQIYHTYCEVHPEMT GQASGVAINRSNHRGKVIFTHQPILLPEECFVPLNQIESYMY" gene 13797..14642 /locus_tag="DP116_19255" CDS 13797..14642 /locus_tag="DP116_19255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456988.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prepilin peptidase" /protein_id="PRJNA477356:DP116_19255" /translation="MFILIIVQTSFIVFALGTSIGSFINVIVYRLPIGLSIIFPPSHC PHCLNQLKPYDNVPLLGWLWLRGRCRYCKRKISIRYPVVEGVTGFLFLFVFWMLKFSP YTVGYWAFCSWLLALSLIDLDTMTLPNSLTKSGLVVGIIFHMVCGFLPEASWVGLVHH LKMAIGGGVLGLWLFDAIAIIGSIVFGKTAMGTGDTKLAAMMGVWLGWKYLLLASFLA CLVGVVVSGGKMILSQHSSRIPLSQKWGEKIPFGPFLACGAVISLFGGEAILSFYLQL FFPIS" BASE COUNT 4495 a 2936 c 3102 g 4185 t ORIGIN 1 cttttaagaa tcgtaaccgg gctgcaattt cgggatcgtt agttgttatt gctccgccgt 61 caccgcacgc accgagattt ttggtagggt aaaaactaaa gcaaccgatg tgtccaatac 121 ttcctacttt ttgccctgcc cagtttgctc ctgtagactg agcacaatct tctatcacga 181 ctaaattgtg tgcttttgct acattcatca atgcagccat atccacaggt tgaccaaaca 241 agtgaactgg gataactgct ttggtatggc gcgttattgc agctgcgact tggtttacat 301 ctagattaaa cgtagtctca tcgatatcaa caaacactgg cttggcaccc acagcactga 361 taacctcagc agtggcgata aaagtaaatg gtgtcgtaat cacttcatcg cctgcaccta 421 tgtttagagc tctaagtgct aggaaaagag catcagtacc agagttacat gctacacatt 481 cagtgacacc aatatatgcg gcgaactgtt gctcaaagcc tgctactaaa ggaccgccaa 541 tataacgacc agaagcaaga acctctaaga cggctgcgct gacttctgct tctataatgg 601 tgtattgttg cttattatca aaagctggta cagaataatt tagactttga accatgagat 661 tttatcttaa cttcaagata gatactgacg ttagatttgg tgcccctgat cgttttgaat 721 caaaaagctc gtgaaagtct tcttcctcaa attcacatta ttcaagagtt ttcacggagg 781 ttttttttca aaaaggctac tatcttctag ggtgcaagga attgtatatc caccaagaac 841 caataccgaa tatgaaagtt cccgatgagt acttttagag aaacagggaa tagaggacag 901 agcactcata gtgcaaccta ttacctgtaa cctgtaatct ataccttata acctatatgt 961 acaagaggta gaaaacccag aatggagctg atcaagctag atttcgtcga tttggctttt 1021 gctgttggat taatgggcgc agccataggt ttatccgcgt gggaacgaat aggattagag 1081 ttcaatttag ccctcgcaac agggagaacc ttcctacaag tagccatatt gggatatgtt 1141 ttggagttca tatttgcttt agataatcct tgggcagttt tggcgatatt agcagtgatg 1201 ctgacgattt cagcaattgt cgcacgaaat cgcattactc aaaaaatacc gcagatgttg 1261 cccttagtct ggggatcaat tttgctcagt acaaccataa cactggttta tactaatata 1321 ttgatcattc aaccagaccg atggtttgag ccacagtatg tgatttctct gggaggaata 1381 gtattaggta atgcaatgaa tgccgccgcg ctggctggag aacgtcttgt tagtattatg 1441 aatgcgagtc agttagaaat agaaactcat ttaagcttgg gcgcaactcc aggacaagcg 1501 atcgcacaat accgtaaaga tgctgttcgc gctggattga ttcctactct caatcaaatg 1561 atggttatcg ctatggtgac gttaccagga attatcactg gtcaattgct cagtggcatt 1621 aatgctcgtg aagctgcatc ttaccaaatt ttgattatgt ttatgattgc cttcgccaat 1681 ttgctgacag tactgttagt tactcgggga atatctcgtc aattttttaa ttccaccgcc 1741 cagctggtga gataaaggcg taagctgttt gatatcaaag cattataact tataccaaat 1801 tgcagtttgg gcagtacttt cgagcttagt tgacttactt atgtgtatcc agcgatgtat 1861 gttctagtca gtcataagaa agagaccgtt atttattttt tctcggtagg ataaatttaa 1921 tcctgtattc acacaccttg gctcatctac atataatcgg taaaagcaat gcaacctctg 1981 attgttgaaa aaaagattga ttctcaaccc tcttctcata atcacctgta tgtcgctaag 2041 ttttccgtca tattaacggt agtactagcc ttattattgc aagtaataac gcaagaaatt 2101 agttcaggaa tcaccgaatg gctagccaag gaacttgcgg tatcctcttt actcgtgcaa 2161 caatactcaa tagctacttt gcgcaagatt ttattattcg gagtagattt atttctagct 2221 atttagctta tcgaatgcat actattcaaa ataacaaaaa gccccctcta ggatgtccta 2281 gacaagatct ctttttgtct tcaagtgtat aagataacca aggctacagc tttggaatta 2341 aaccgagttg ctgaaaaaga cccaaagcat gttcttgtcc tcgctcttca gtgattttgc 2401 cttcgacaat gcggtaaatt ataatccctg tccaggatac tgactttcct gttgcaggga 2461 ggtgatgaac ctttatctct cccttatgag tgccttgggc tgtccagctg atcgcaacct 2521 tgtctccttc tgcaatcgca tctgtaattg taaactgtag atcggggaaa cctgtgtgaa 2581 tgtcagcgac ccatgattta aaaccctcac aatcaagtgc tttgtgaaag tatagggtag 2641 taaaccttta ggtacaggct agcaagttcg tctactactg ccaaatttcc ctttccccat 2701 gtctgttcac agaattgtcg cgcgatttct ttgttttctt gtgctgacat aacaagttta 2761 ttctttagta tgattgattg ttgtaagttt gttttcgggc ttggactaat tttatcgctg 2821 agaatgatat cagcaacaga accgattttt ctgcttgcat aaactttacc tttttcgatt 2881 ccgtttcatt caatgactga aaaatcaagc taccctggct ggtggttggc tagtgcgcga 2941 tttgtacttc tcaaaaagca aaatcatcta aaaattaaga tcatcttcga tgtacatttg 3001 tatcaatgta ttaagaaaaa tatttgttat tgtacgtact atagcaatcc taaatgataa 3061 gccctgcggg cacgctgcgg gaacgtaaat cacaaaatta cggacttctc tagaaagtca 3121 ggaaaataaa tgtttatttt tttacaaaaa aacaatagtc atatcatatt actactaata 3181 tcattaagca ccttatagtt aatcgaaaaa attaatttat tcccaaaata aataaatata 3241 tcttatcaat ttaataaatc aaataaaatt gataaaaaaa gagaaatatt aatagccgaa 3301 aaaagcttta tcctgcaagt agggggcagc gttaatgttt tcgagagcac cgcagcaacc 3361 gttataacgt ggcaaaaaag gtgtgtgtca atggatttgt tagaatacca agtgaaagaa 3421 tggtttgcga acataggcat tcctgtgttg ccttcccaac gaattgacca tcccacggat 3481 ttaaagcgtt taaaaattga ctacccaatt gttctgaaat ctcaggtgaa tgcggcggaa 3541 cgagcaaaag taggtggagt cagatttgta gaaacgacta tcgatgcgat cgccgctgca 3601 cgaagtatct ttaatttgcc aattttagga caattgccag aagtattact ggcagaaact 3661 aaatacgaaa cggaacaaga attttatctt gcggtagtct tggatacagc tgtctgtcgt 3721 cctatacttt tagggtgtac ggaagctatg gacatagatt gggactcagc agacgaaaaa 3781 atgcactacg ttgtcgttga acaagagttt tccccatttt atgctcgaca actggcgttg 3841 aaaatgggct tgcaaggtgc cttaatgcaa tcagtgagca atgttgtgga gaagatgtac 3901 cagttatttg tagaaaaaga cttggactta gtggaaatca atcctttagg tgtgagttcc 3961 tctggtcaag ttatggcact taatggtaac gtcagcatta acgaacgggc aattggtcgc 4021 catccaaata tggctgacat agcaaaagca atggtcaacc cttatactag tagtaataca 4081 atccggaact tgggcgactg ggatgtcgta gaaatgcacg gtaaaattgc catattgggc 4141 aatggtgctg gattggtaat ggcaacttta gacttggtgg ttaatagtgg tggaaagccc 4201 ggggtgtgtc taaacctgcg ccatgcctct gttaccgaaa cctcaccaac cacttttaag 4261 agtcggttag aacaagcttt aaaaaacctg gctactgata aaagtattca ggtgatactc 4321 attaatttcc tcggtagcat ttcccaaggt agtgaagttt ctgaagtcat tgacgacttt 4381 gtagaaagta gtatgacaga agctcaacta tttatggcta aatctaatgg tggtaaaaac 4441 cgtcgagaga acagttttcc gcgcttagtc gtccgtcttg ctggttctga gttcataaga 4501 gcaagaattg atatagcagc actgaaaact ccaactgata cgtcaataac ggtagtggaa 4561 aatttagatg agggcgtagc agaagcaatc cgtttggcta agtcaacagc atatagaagg 4621 ttttaactaa ttcacctatt atgaatttaa caccatatag caaagtgtta atccaaggct 4681 tttctgagta tattacagca actcatattg ctcaaatgaa agccaatggt acaaatttgg 4741 tcgctggggt taatcctgga tgtggcggac aacagatgta cactctgcca gtcttcgatc 4801 tcgtagagga agtcgtagaa aaatgtggag ttattgatac tacaattatc tgcgtacacc 4861 cttaccaagt cctagatgct gcactagaag cgctggcatg tgatatccgc caaatcatca 4921 tcatctccgc tggtgtacca cccttggata tggtacaact acttcgtaaa gccgaagcaa 4981 aagaaacctt gatagtagga ccaaatagtc ctgggatcat cataccggga aaaatcctct 5041 taggaactca accaagtgaa ttttatacgg caggttcagt aggaatcttg agtcggagta 5101 cgactctcac ctacgaaatt gccagagaat taactgatgc tgggttggga cagtcgatga 5161 gtgtcagcat tggaagtgat gcgatcgttg gttcgtcttt tctgcaatgg ctgcaaatat 5221 tagatgaaga tgaagccaca aaggcaattg tcttagttgg acaactaggt ggcgatcgcg 5281 aagaagccgc agcaaaatat attgcagaag caattgataa accagttgtt gcttatattg 5341 caggtacaca agcaccatcc gcaagacatt ggcgtcaaac tggaacattg gcggctgtca 5401 tcggacgcga tcctgatttt ggtacagcaa aaaacaaatt agctgctttt tcctttgcaa 5461 gagttccagt tgcagaacgt ccttctcaga taccagaatt ggtgaagaag gcgatgagat 5521 agcaaaatta tttacctggt gtaaacagtg ctttggaagt agactcgcct gctcgccgca 5581 attgtcgtgt tagttgcagg ttcagcaata ccagtatgag tagctgagcc actactacca 5641 cccaaacggc tgttaccgtg ttgcttagta ttggaaggta gggaaaaata agcaccgcaa 5701 aggggtagaa agcgtaaatc cccactaacc tagaaatgat cgacggtagt agaattgctg 5761 cgcccagcgt accaattgtc caacttgtgc gctgctgagt tcgcatgaac aataacagtt 5821 gggctacgct tgcataaatc aaaatcatca ttaagctacc attgaaagct agaccgaaaa 5881 acgcctgggt tttgttttca tctactttgg caggcgagag taaaataaag aaagctagaa 5941 gaattatgac gattatcaga ttaatagcga tcgctaccac tgccggactt ttttcacccc 6001 aaattaaatc ctgtactaac gagctatgac cgagtttctt gctactagaa actcttttgt 6061 gtcggtagcg cgcccaatcg tgtagcgctt gacggtgagg cgtgatagca gcaattatat 6121 ataaaaacag caaaaaattc aataacgata gcgcaaagag gttttcagtc agcggcaaag 6181 cgtatgggct acgatttgct ggtgctgcgt agttggcgca tcctatagtt aaaactgtaa 6241 agcaaacagt cagcaaataa ctctgctgct tacttaacat agtggcactc tgatcacgaa 6301 agcaacgccg caaagattgc caaataaacc aggttaaaat gccataattc aacaaaacaa 6361 aaccgacgat agttaagaca ttatcactta gtacaaagtg aaaccactta aaacgttcta 6421 accctgttac aaagccttca aaaaaagaag ttgtagacaa atttggaatt aggcaaaaag 6481 gattaagcag cttcagccaa gtagttgaat taaaataact agtcgtgctt attcctttag 6541 ttaacatcaa gaaaattaaa actgcgccac tacccaacca agcttgaaaa ctgccaagcc 6601 aagaaccaac taagccaaat agcatagcag cactgtagaa aaaaacagaa gcaacagcaa 6661 caactgcgta gaagctcaaa attagactta gaggaatttt agcagcaagc ccagaccaca 6721 aatgcaaagg aagtgataat acggtaaaca aataaatcag aatcggcact cctagcaact 6781 tgccgatgaa aatacttgct tctgattggg gggtgaggcg agtcaaatta aatgtgccgc 6841 gacgttcttc agttgccaaa tcgttaatta gcatataagt tccggcaact aacaggatga 6901 caatgccaat taaacttaaa cctacgaata tgtctcgcca ccaggattgc cagtattcat 6961 ccggcaggca tttcaactga tatctcttgt caagtttctc accgaaagtt ggtgcacaat 7021 atttgtcgaa tatatcaaga gttacacgag ccaatagggt ttggaaagat atcaatacta 7081 gaatgaattg acccaccaag gaaagagcaa ttgtaagtag tatgttgcgg ggtttgaggc 7141 gtcctttgag ttctcgtaac agctggggat tccagttaag tttcataaaa gaagtctttt 7201 aaattcgcag ttagttatag aacagacgca ccctaaacca taaacgcatt tgttggtctt 7261 tcagcttccc aaaattcttc cctagcgtcc agccaacaat gcttttgtcg cagactctcc 7321 tgcaagtctc acctgtttta tcagatgcaa gtttaacaat accaaaacac tcaactctcc 7381 caaccaagcc agaaaaattg ttggtgtcga ggcgtattcc aaactagccc aagggaaagt 7441 cgaaaacagc cacacagtag gatatttaga agaatcaaca gaaagcattc ctagaatgac 7501 tggtggtaag aaaatcactg cgccaatagt gccaactgcc cagaaagaac gcttagtcgt 7561 tttcatcaac agcatcctct gcgctatggt ggcgtaaatt atcatcaacg tgataaacat 7621 ggctacacct aagacagtct ttaacctacc aacatggtta atttgccaat ctatattaga 7681 ggtgttgtga acattcaaaa caggtgccag taaaatccaa acaagtaatg ggactgtcac 7741 tatcacaaga ttaattgcta ttgccacaaa tgaagggctt ttttcaccta aaatcaaatc 7801 cagcaacaaa gattttctcc atacattttg atgactagaa agattttggt gtcgatacct 7861 tgcccaatct tgtatcgttt gacggtgagg tgagagaatt gcaattaaac cgaacaacag 7921 cactaagttg aagaaaacta accaaaataa actctgttgt atttgagaat tcacatcgta 7981 aatgcaacaa gaaacattac ttggagattc atactcacaa gcagagtgac aatagccttc 8041 tctgtattgt aaagcaaatc cccaaaaaac aaattggaag caagcaacaa ataagtaact 8101 ttgctccttg ctaaaaatag tagagcttgg attacggaaa caacgtttga gtgcttgcca 8161 aatccagtaa gtccacaagc cataattcaa caagtgcaaa gccattacac tgccaatatt 8221 tgttcctaca ggaagataga aaaactggaa tttttgcatg aatccccagt tgtatccgtt 8281 attaaagata tttgggaata aatgccgggt catgtcaaag gggcttaaca gcctaaacca 8341 agcaaatgaa gaatttaagg agtatgattc agacgacgcc aattgcatcg taagcatcaa 8401 aaatagtaga acagcaccac taccgaacca agcttgaaaa ccattcaacc aacgacaaca 8461 taaaccaaaa agtagtgctg cactgtaaaa gaaaatgcag gaaccagcta gaacgaccca 8521 aaaacttaat atgtgacttg tggcaatatt ggcaccgtgt ccagcaatga aatgtagagg 8581 gactgctgtg aagacaagca aatagattaa aattggaaca cccagtattt tgccaattaa 8641 aatactagtt tctgattgag gactaaggcg aagaaaattg agtgtgccac gccgttcttc 8701 ggtcgctaaa ttattgatga gtaaataggt gccagcaact aatagtgtga aaataaaaat 8761 gatactcagc gagaagaata tgtatcccca gtgttcacgc caccacattt gcatgtcaag 8821 ttgatcagtg gggcagaaat tcttggataa atagtcattg gactcagtta tttttgtctg 8881 aatttgtgca atttctgctt taagttcttg gattttatct ggtaatcttg gttgagtatt 8941 tagataacta tctaattgtc tttgaaggtt attagattgt tcagacaact ggttagtctt 9001 gtactcataa gctggacgga gacgacagta ctctccaatt agtgaatatt tatcgccagg 9061 taagtttgcc aattgacaaa gaaatagtac cacttgacca actagcgata taacaactgc 9121 gctcgctaca ttaaaccctt taaaccgtcc tttaatttct cggaatagtt ggggattcca 9181 atcacctagt ttgtctatca agttgattat catttaacaa aactccagaa ttaagttaac 9241 tactttgtag agtccaagtc cgtcattgag tgactaatga acgccagatg ctctacttgg 9301 ggagacccca agaccgcact ggctcctaat cactaatcaa gatgcttgct tatgacctaa 9361 gttgaggaaa attgtttcta ggtcttcttg agtgcagtga aaatcagtca gaggaatact 9421 agccgtaaca agcgatcgca acaaatttgc acagtcttcc tgttttccag aaaaattgac 9481 tcgcacgctg ttttttgtag gcatgatctc ccactcttgt acgtaaggat tattttttag 9541 ttctcctaaa agttcgtcta atttgcctaa agtcgatatg acaatttgct ggcgggcaag 9601 acgttggtag agttgttgta gcgatgaact ttctaccaaa aagccaagtt ccataattcc 9661 cacactcgta cacagttctg ctaagtcgct gagaacgtga gaggaaatca atattgtcat 9721 ccctgcttct tgcaacactt tgatgatttc gcgaaactgc atcctcgcaa ttggatctaa 9781 tcctgaaact ggttcatcta gcaacaagac aataggttcg tggataattg ttcgggctaa 9841 acacaaacgc tgtttcattc ctcgtgataa ggtggaaatc aagctattgc gtttattgcc 9901 tagttgtatg agttctataa cttcatacag gcgttgagtg cggcgtggtt cccgcaactg 9961 atacaaacgc gcaaaataat ctaggtagtc ccagactgtc agttcctcgt acagtggata 10021 gtcgtcaggt aaatagccta agcgacgctt gagggtgggg ttgcttttgt cgcgcagtag 10081 gcgatcgcca ttaatataaa tctcacccgt cgttggttcc tcagcagcag ccaacatccg 10141 gatgagagtc gttttccctg caccattagg accaatcagt ccgtaaactt ctcctgattc 10201 tatttctaaa tcaatatcat ttacagcaac gtgtctgtca aattgcttag tcagttcaca 10261 agtccggatt gccagttctt ttgtcatatt tactgaagtg cggcttgaaa ttatactaaa 10321 ttgcattcag ctagagacgc acataacact gttacatgga taactatctg aatctagcgc 10381 gtttggtaat ccacccacta tcgtttagct gcgattgttt aatacacaac agccagccta 10441 tcttttgact acagtatttt ttagctttgt ttcggaaatt gaaactactt gttattaaaa 10501 cttaaaattt caattcaaaa gattattgga acttatcgaa aaattgggtt taaaaccccg 10561 tccttctagg acggctttac ttttttaaaa attttcctta ccatagtaat gcgagtttgg 10621 taagcacagt gttaaaagcc tacaagtaca gaatctatcc gactagtgag caatcaatat 10681 tgcttgccaa gtctatgggg tgtgcgcgtt ggttttacaa ctatgctctt aacttaacga 10741 gcgaaactta caagacaacg ggtaaaggat taagtcgcaa tgaaatcatc aacttgctgc 10801 cttctttaaa gaaagagcat gagtggctaa cagagccacc atcacagtgt ttgcagcaag 10861 ttgcattaga cctttccagt gcttttttga atttttttga aaagcgtggt ttatatccaa 10921 actttaagaa gaagggacaa aaacaatcta ttcgctttcc ccaggaaata aaactagatg 10981 gcagttactt aactcttcct aaattaggta aggtttattg taaagtgtct cgtaaaccgg 11041 acggtaaact taaatctgtt acggtatcac tgacttcatc tggtgaatac tatgctgcct 11101 gtctatatga tgatggtaag gatattcctg tttcatcttc agaaggaaaa gccgttggga 11161 tagatatggg gataacccat tacgctatta cgtctgatgg cactaaacat ggtaatccca 11221 aatattatcg caaatatgaa acaaaattag ctcaaaagca aaaactactt agccgtaagc 11281 acaaagggtc taacaaccgt aataaagctc gtattaaagt ggctattgtc cacactaaaa 11341 ttactcgatg tcgtgaagat tttctacaca aactaagtcg taagttagtt gacgaaaacc 11401 aagtcatagt tgtagaaaac ttggcagtta aaaatatggt cagaaaccac aaactcgcaa 11461 aatcaattag tgatgctggg tggggtcaat tttgcaccat gttgaagtat aaggcagaat 11521 ggaagcgaaa aacctatatt gaggtagatc ggttcttccc tagttcaaag acctgtagta 11581 attgcctaca tcaagtagat catctaagct tggatattcg tagctggcag tgtcctagat 11641 gtcaaacgtt acacgatagg gatgttaacg ctgctataaa tatcagagat gaaggtttac 11701 gaattttggc gggagggcat ctcgctaccg cttctggaca acgtgtaaga ccatctaaag 11761 gcactgcttt tagaggctat gttggatgaa agaagaatcc tcgcggcttt agcccgagga 11821 gttgtcaaga cacatcattt gtctaggcgc atacatcaat acttaaacat taacccccgt 11881 gggtttgtac ttgggacgct ttcaggggtt tttggtggtg caataggaat ttggcataag 11941 ttgttacctg cagcagcgct acctcagcga acacaacaga atacatcgag tcaatctgat 12001 attagggttc gggaactctc ttgccagggc gcaagaattg gtcgaaaatg caattgtcgt 12061 tgcccgtcgc ctcttttggc acgctcacgg agtatcgtta tagtgccgga cttacgacga 12121 aaaaattaaa acagtttgaa aatttaatag tacactgacc aggtatgata ttttatctgc 12181 aagatattct gcacgattaa acatttacct acctgaaaac gaactatgac ccagaactac 12241 cgcattaccc tactccctgg cgatggcatt ggatctgaaa ttatagcagt agcggtaaac 12301 gtgctgaaag tcgtagggaa aaagtataac attcaatttg aattcactga agcactcatc 12361 ggtggtgcag ctattgatgc tacaggcgaa cccctaccag ccgacagctt agatatctgc 12421 cgcaatagcg atgctgtgtt actagcagcc attgggggtt acaagtggga ctccctacca 12481 tcccatttac gtccagaagc aggtttgtta ggactgcgtg cagggttggg attatttgcc 12541 aatttgcgcc cggcgaaaat attgcctcag ttaattgatg cttcttcctt gaaacgggag 12601 gttgtcgaag gcgttgatat tttggtggtg cgcgaactca ccggaggaat ttactttggc 12661 aagcccaaag ggatttttga aacagaaact ggtgaaaagc ggggtgtgaa tacgatggct 12721 tacactgaat cagaaattga acgcattggg cgggtggggt ttgaagctgc acgaaaacgc 12781 ggagggaaac tctgttcggt ggataaggca aatgtgttgg aagtttctca actgtggcgc 12841 gatcgcatca ccaaacttgc ccaagagtat ccggatgtag aactctctca tatgtatgta 12901 gataatgcag ccatgcagtt gttacgtgct cccaagcagt tcgatactat tgtcacaggc 12961 aacttgtttg gtgatattct ttcagatgca gctgctatgt tgactggcag tattggaatg 13021 ttaccctctg caagtttggg tgcttctggt ccaggagtgt ttgaacccgt ccacggttca 13081 gccccagata tcgctggtca agataaggca aatcctttag cacaggtttt aagtgctgct 13141 atgatgcttc gttatggttt gaatgaacca accccagcag atgagattga acaaggagtg 13201 ttagaagtat tacaaaaagg cgatcgcaca ggagatataa tgtctccggg tatgaacctt 13261 ttgggttgtc gcgcaatggg caaagcactt atccaagttc tcgaagcaaa atgagtgcat 13321 atgctcaaag tggaaatttt ggcaacttta aagaaaattt tcggataaac tattaacaca 13381 aacctacaac aatacttcgc gagtggtgta cgctttaaaa aaaggacaag aaaactatac 13441 caatcagaag agccagtcag ctttgattga tcctaccgtc atcagagctg ctgggcaaat 13501 ctaccacact tactgtgagg tacatcccga aatgactggg caagcttcag gagtcgcaat 13561 taatcggtct aatcaccgag gtaaggtgat ttttactcac caaccaattc tcttgcctga 13621 agaatgcttc gtgcctttga atcagattga atcgtatatg tattaagcta taagtgactg 13681 gtcatgagtc atgagttaag gttataactt ttgagcaaat ggcaaaaaat cctctgggtt 13741 agccaattgt tcatgggggg aacccccaat accccattaa ctcacaattg acttgaatgt 13801 tcatcttgat aatagtccag acaagtttca tcgtctttgc tttaggcaca tctattggca 13861 gctttattaa tgtaatagtc tatcgactac caattggact gtcgattatt tttccaccgt 13921 ctcattgtcc ccattgctta aaccaactca aaccctacga taatgtacca ctgttgggat 13981 ggctatggtt aagaggacgc tgtcgttatt gcaaaagaaa aatttctatc cgttaccctg 14041 tggtagaggg ggtgacgggt tttctttttt tgttcgtttt ttggatgttg aaattttcgc 14101 cctatacagt aggctactgg gcgttttgca gttggttatt agcgctatcg ctgattgatt 14161 tagatacaat gacactaccg aactcactga ctaagtcggg tttggtagtg ggaattattt 14221 ttcacatggt ttgtggtttt ctaccagaag caagttgggt cggattggta catcatctga 14281 aaatggcgat aggaggagga gtgctgggct tatggctatt tgatgcgatc gccataattg 14341 gttcaattgt ctttggcaaa actgctatgg gtacaggaga caccaaatta gcagccatga 14401 tgggagtatg gctaggatgg aagtatttac tgcttgctag ttttcttgct tgtctagtag 14461 gagtggtagt cagtggtggg aaaatgatac tatcacagca cagcagccgc ataccactgt 14521 ctcaaaaatg gggagaaaag ataccttttg gtccttttct tgcttgcggg gcagtcattt 14581 ctctatttgg tggtgaggcg attttgtctt tttatttaca attatttttt cccatcagtt 14641 gaacaacaga agataacaaa acagatgacg cccttacagt gtgtatctgc tatacaaaac 14701 ttgactaaac tttttcaa // LOCUS NODE_2308_length_14598_cov_4.91789914598 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14598) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14598) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14598 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 807..2045 /locus_tag="DP116_19260" CDS 807..2045 /locus_tag="DP116_19260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318441.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase family 1" /protein_id="PRJNA477356:DP116_19260" /translation="MENISRIRATIRDKTVSYPDILVISRSFQPKEGGIEEYIYNRCL QDPERVIVLTASCSGHKLFDNAQKFPVYRWPISRHWHSSFVEGMLQPFLNSVCSFVLA VKLYFRYHYRYIEWGHGYHFLSLLLLSYLLPIRFFIYLHGKDILGPSDNPILRSLFEC TLKRAQGIVCNSSYTQDYLRTHFQVATPTHVINPAVRVEKFGVSSYQENLDDLRVRVR NGYSIPETAVVILSVGRVLKRKGFDRVIENLTLLLTFGMDVHYILCGQGPYESALRNL ARRLRVHKRVHFAGYVSDQELTGYYAASDILAMLTQGDDKTPRVGGFGIVCLEAGYFG KPVIASSLGSVVDTVHHEENGILVNPNSGYEVFHAFKRLCQDQKLREQLGRKGKQLAQ RKTLHRSIYIPESRYSCLPT" gene complement(2042..2254) /gene="thiS" /locus_tag="DP116_19265" CDS complement(2042..2254) /gene="thiS" /locus_tag="DP116_19265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874415.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiamine biosynthesis protein ThiS" /protein_id="PRJNA477356:DP116_19265" /translation="MSDNITLLVNGETRSCLPQTPLSDLLQQLGFNPRLIAVEYNGEI LHRQFWSDTKVQQGDRLEIVTIVGGG" gene complement(2247..3368) /locus_tag="DP116_19270" CDS complement(2247..3368) /locus_tag="DP116_19270" /EC_number="2.5.1.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318443.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiamine phosphate synthase" /protein_id="PRJNA477356:DP116_19270" /translation="MKEADCYDGSTNGVVVMVEPYSQTEQIQQVVYRILDANLDRARE GLRIIEEWCRFGLNNAQLAAECKKERQEIAHWHTGELRAARNTPGDPGTDLTHPQEEQ RTSIKSLLQANFCRVQEALRVLEEYSKLYNPNMGKACKQMRYRIYTLESSLMGRHRQQ LLLRSRLYLVTSPGDKLIETVEAALKGGLTLVQYRDKTADDTVRLQQAKKLRQLCHDY GAIFLVNDRLDLALAVDADGVHLGQQDLPIAIARELLGPHRLIGRSTTKVAELQAAIA EGADYVGVGPVYETPTKEGKAATGLEYVRYAAKNCSIPWFAIGGIDPNNVNEVINAGA NRVAVVRSLMQAEQPTLVTQFFLSQLISRMKPELGMSYV" gene 3685..5619 /locus_tag="DP116_19275" CDS 3685..5619 /locus_tag="DP116_19275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318444.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19275" /translation="MAMLGVSLDVAIAQTPRTTDGKQLDVKTISQVNALFVNPSVGDD KVNNGSEGTPFKTISQALRIAGPNTVIILSSGTYSTETGENFPLILKPNISIQGDSRS KGRGIVIKGGDTYLSRTFGGKNVAIVGANQAKLTGVTVTNPNPRGYGLWIENSNPVIV ENTFTGSTQDGIAVTGNSTPNIRNNFFYQNGANGITVSGNSQAEVRENVFQQTGFGIN ITQNAQALVVSNSIQDNRSGVVVQANARPILRNNLIQGSKEDGLVAISQAIPNLGTAS EPGGNQFRNNARYDINASAAKEIFPAFGNSLANNRINGKVDLTGTTAVADAGIGRTQT RENYPANLSASPRVPLPTYSNNASSGLNNQLQPLRPANSPLSATAINQKQSYLQNAGL PTPNNLTRYGGQSPTRMLPPRQVSPTRSIQPLANTSNTSRQANYVRVSPGSVEFTAPQ TASNTVDAGIQGTLNREQGREDARNLQQMTVRSPLSPSGRQVPQLNATFSNPVTPAPV QRVQSQGQSALPTLQAAPVGEAALLPVPNSNIPLGNTSNMRKVPVPQRSSTTAYVGNS SPSPAQDTQTNLRYRVVVEVQNEKEQELVKFLAPSAFPTVWRGKGVMQVGVFNTRNNA DSIIKILNNNGLKGVVEPVN" gene complement(6092..6364) /locus_tag="DP116_19280" CDS complement(6092..6364) /locus_tag="DP116_19280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113908.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19280" /translation="MNRPSILQPGTNYTFSKYFELPLAPADILAEFDCTYERKRLDLP RYEGSIKCLDFLKRILQDIKLYRVPEELEELLRILVGIISSSGDYL" gene complement(6393..7172) /locus_tag="DP116_19285" CDS complement(6393..7172) /locus_tag="DP116_19285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739730.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19285" /translation="MFTSNPDTSADLAIALLIHYSFDLGGYSASELVDLWQKQYPGNW LHMAVIEALYQGRYKAVSVQQILTCWQRRGQAIFHFNMEFERLICSKFPQSLTTVPRI SSAYVETTTAQEDTSVQTFPPPVPSTVDTGTPKWLPLRRVSVNNQTSEHGDTEKSPPH PISPQQQSSQRREPEQAKTSSMETPSPASGNQNDQDNFLSGATNNPPIEQFTPQTSAG SESFTSKLKAISNEKLHPSEKLIARRPQGFLSQETGNDYSY" gene complement(7303..9393) /locus_tag="DP116_19290" CDS complement(7303..9393) /locus_tag="DP116_19290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456341.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19290" /translation="MVNSTVVATLTIYVNPTIGNDANAGTRLNPYKTITRALKVTTTP KVIQLACGTYNTASGEIFPLVIPVGVMLIGHEATKGQGIVISGSGKYESESFGIQNIT LVLLNYAQLMGITVVNPTKKGTGVWIESTSPTLANNTFIKCGREAVFVSGSAKPLIQD NVFLQNGVSGLVMAGDSKGEVLDNLFQKNLLGMAISDTAEPLVANNTFLENRNAIALS RNARPILRNNLIENNTQTGLLVNGNAAPDLGSPQDPGGNIFRQNGQFDIQNTTSLHLS SVGNNLNSTQIKGMIEFINQKENNQNSVLINTTFSDMAGHWAMAFVEALHKKNLMNGF PDGTFRPDAPITRAEYADIIARSFQLPSGKKIRKFTDVKRGFWASSAIERAASMEFIS AFPDGTFRPMQNLTRVQAIVSLVNGLKLSGGNPNILNVFGDSAQIPSAATNAVAVATK NLLIVNYPEIEQLEPLRDITRAEVAACIYQGLVASGIELPIASPYIVNPNVEIASDTE VMTHWAAGFIQALLKMGLTHGSAHETFEPDKPITRAQYAALVAVCFNPTPKHPATEFT DVQKDFWAYNAIKIAAQGGFVGGFRDRTFRPEQNVLRLQVIVSLVNGLGLKAADNNSL LGLSDHNIIPNYARSAVATATQNKIVVNYPDPKQFNPNREATRGEVAAMVYQALVAVR QTPTINSPYIFSHS" gene 10923..11936 /locus_tag="DP116_19295" CDS 10923..11936 /locus_tag="DP116_19295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651168.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_19295" /translation="MISVEHLSKTYGSTSAITDVTFDVEPGEILGFLGPNGAGKTTTM RILAGYLPATSGTARIAGFDVHDNSLLVRQRIGYLPETPPLYPEMTVEGFLYFVARIK GVSAGDRTTKVTAAIERCNLQEKRHVIIRKLSKGYRQRVGIAQAIVHDPPAIILDEPT VGLDPRQIIEVRNLIKSLAGSHTIIISTHILPEVSMTCSRVAIINGGKVVATDTPDNL MNQLTKGSGYEIEIEGEAGLAKQVLQNVAGVSFVESISAVGMHSHTSLKENRTYLRVI SQPGTEPGKDIAATLIGTGFGLLEMRRVNATLEDVFLQLTTEEKTLETETEIAEAKEG EAA" gene 11937..12749 /locus_tag="DP116_19300" CDS 11937..12749 /locus_tag="DP116_19300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138536.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_19300" /translation="MGVVLGNIIAIYRRELQSYFVSPLAYAIAGVFWLLSGLFFVLIL MGPEGILQTVTALDLQGQQFGVPVPPIDVPYELVRAFLDRMGWLLLFVLPILSMGLYA EERKRGTLELLATSPVTNWAVAVGKLLGVLTFFITMVVPIMGLETIALSASSPAMPPT IPLLGHLALILLAAAILSLGMFISSLTDSTILSALFTFALMLLLLFVDLIAKNIGGSI GEALGHLSLLKHYNTLVQGIFDTSSLLLFASYIILGIFLTAQSIDALRFQRS" gene 12835..13197 /locus_tag="DP116_19305" /pseudo CDS 12835..13197 /locus_tag="DP116_19305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009787356.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="four helix bundle protein" gene 13276..>14598 /locus_tag="DP116_19310" CDS 13276..>14598 /locus_tag="DP116_19310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318464.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_19310" /translation="MKLIAQKKPLKILFWFGPFLLAAGLTSGFASDNWGPIQLALIIL GTVIIVLWLIWQNKQNNWWGQRSTQVSTNAVIATLAVLAILGLINFLGTRYHTRLDFT ETKLFTLAPQSRELVRSLQVPAKIWLFDVNQDPVDRDLLENYRRQTSKFSFEYVDPQA RPGLARKFGVKDYGEVYLEFGDKRQLVQVVGPQERLSEVKLTNRLQQISSISSAKVYI LQGHGEHELSGKEEGVISQAVKALNDKGYTTSALNLAEKLSVPQDANVVVVAGPKRSL FESEVKALQDYLNRGGNLLLMIDPDTDPKLESLLDEWGVKLDNRLAVDVSGSVGLGPA APLVTQYGKHPITKDFGNGISFYPLARPIDTTSVPGIQATPLLLTKAYPNSWAESDQQ SENLKFNPESDRKGPLTLGVALTKKLSAKSEATSNSTLTPIIAATPAKP" BASE COUNT 4179 a 3103 c 3139 g 4177 t ORIGIN 1 caacaaggtt tgatgagtag agatcaaaag tcatttgaaa cttttttaaa caaagataat 61 taagaaaagt agctttttgg cattttggca taacattttg agtggtcttc agatattcta 121 ctaaaagtag ctagatttca aagtcttaat aggagcgatt tgacgtatga aagtttaagt 181 atcaattctg agctatttga taaaatagta taaaaaaata taagtagaat agcaagcatc 241 aaacgtaaaa tgtcattacg taaaagcaaa gcgtgaacga gccgtgccat aggcataggt 301 catatccgga gggcaaagac cctattttac gtttttcaat gttgacgtgc cattcagcta 361 tagaacttat ttcatcttca tccttactct cacttgaata tttaacttgt gtctatattt 421 tttttgtaaa ttcgtatata caagaaacag tatgtatgat aaaaataaaa tattcatcaa 481 ctcaattcaa ctatcagaac agaaattatg gcatcttagt tttgccagaa aaaccttata 541 gatattacaa aaaagctcag aatcaaagaa aatgcaatag tcattgatca tttttcattc 601 aaagtgattt tttcaatgaa gaatgataaa taactattta tctgctctac catgcaatca 661 aaatgtcagg tatttgttag ctcaaataaa cgaggaacaa aatcatctga ttttcttcag 721 tttagttaaa ggtatgaatc acaaagattg caaaaaatca tgtaggtttt ttctgcgatt 781 ttaacgttta gtgatcaaat gttctgatgg aaaacatttc acgcataaga gcaacaatcc 841 gagacaaaac tgtgtcatac ccggacatcc tagtcatatc tcgtagcttc caaccaaaag 901 agggtggaat tgaagagtat atctataatc gttgcttaca agatccagaa cgagtcattg 961 ttttgacggc tagttgttcg ggacataagc tatttgacaa tgcgcaaaag tttccagtat 1021 accgctggcc catttctcga cactggcata gtagttttgt ggagggtatg ctgcaacctt 1081 tcctcaattc tgtttgctca tttgtgctcg ccgtcaaact ttattttcgc tatcattacc 1141 gctatattga gtggggtcac ggctaccatt ttctttctct tctgctgtta agttatctcc 1201 tacctatccg gttttttatc tacttacacg gaaaggatat tcttgggccc tcagacaacc 1261 ccatactgcg ttctctgttt gaatgcacgt taaaacgagc ccaagggatt gtttgtaaca 1321 gttcctatac tcaagattac ttgaggaccc atttccaggt tgcaacacca acccatgtta 1381 tcaaccccgc tgtgagagta gaaaaatttg gtgtttcatc ttatcaagag aatctggatg 1441 atttacgtgt cagggtgcgg aatggataca gcattccgga aacagcagtt gtgattctct 1501 cagtcggacg cgtgctgaag cgtaaaggtt ttgatcgcgt gattgagaac ttaacactct 1561 tactgacttt tgggatggat gtccactaca tactctgtgg acaaggtcct tatgagtctg 1621 cactgagaaa tttagcccgt cgtttgcggg tgcacaagcg agttcatttt gctggatatg 1681 tgtctgacca agagttaacc ggatactatg cagcatccga catacttgca atgctgactc 1741 agggagatga taaaaccccc agagttgggg gttttggtat cgtctgttta gaagcaggtt 1801 attttggtaa acctgtgatt gcttctagct tggggagtgt agtggataca gttcatcacg 1861 aggaaaatgg tatactagtg aatcccaatt ccggttatga agtttttcat gctttcaagc 1921 ggttgtgtca agaccaaaag ctgcgtgaac aactcggtcg taaagggaaa caattagctc 1981 aacgcaaaac cttacaccgc tcaatttaca ttcctgagtc tcgctactca tgtttgccaa 2041 cttaaccacc accaactatg gtgactattt ctaggcgatc gccctgctgt acttttgtgt 2101 ctgaccaaaa ctgacggtgt aaaatttcgc cgttatactc tactgctatt aagcgaggat 2161 tgaaacccag ttgctggagt aagtcggata aaggtgtttg cggtaagcag ctacgagttt 2221 ccccattgac aagtagtgta atattatcag acataagaca taccaagttc tggtttcatg 2281 cgacttatta gctgtgagag gaaaaattgt gtcactaagg taggttgttc ggcttgcatg 2341 agactacgca ccacggcgac acggtttgca cctgcattga tcacctcatt cacattattt 2401 ggatctattc ccccaatggc aaaccaaggt atggaacaat ttttagccgc atagcggacg 2461 tattctaagc cagttgctgc tttcccctct tttgttggag tttcgtaaac tggccccaca 2521 ccaacataat ctgcaccttc agcaattgct gcttgcagtt ctgccacttt tgtggtcgaa 2581 cgacctatca aacggtgagg accaagtaat tctcgggcta tagcaatagg caaatcttgt 2641 tgtcctaaat gtaccccatc agcatctacc gctagagcta aatctaagcg atcattgaca 2701 agaaaaattg caccgtaatc gtggcatagt tgtcgcagct ttttcgcttg ttgtaagcgc 2761 acagtatcat cggctgtttt gtcgcgatac tgtaccagtg ttaatcctcc tttgagggca 2821 gcttcgacag tttctattaa cttatcacca ggggaggtga caagatacag acgcgatcgc 2881 aacagcagtt gctgtcgatg acgtcccatc agagaacttt ccaaagtata aatgcgataa 2941 cgcatctgct tacaagcttt ccccatgttt gggttgtaaa gcttactgta ttcttccaac 3001 acccgcaggg cttcttgcac tcggcagaag ttagcttgca acagagattt aatactcgta 3061 cgttgctctt cttgaggatg agtgagatca gtcccaggat ccccaggcgt atttcgcgcc 3121 gcccgcagtt caccggtatg ccaatgagct atttcttgtc gctctttttt gcattctgca 3181 gctaattggg cgttgttcaa cccaaagcga caccattcct caatgattcg caaaccttca 3241 cgagcgcgat ctaaatttgc atctaaaatg cggtaaacaa cttgctgtat ttgctctgtt 3301 tggctgtatg gctcgaccat tacaacaacc ccgttagtgc ttccatcgta acagtcagcc 3361 tctttcatat tgacactgta cttggtatct ctattgattc ttaactaact taaatttgtc 3421 tcaaaaagat cgaagtagtc aatatgtgag taactttatc gactttattt acgacatcta 3481 gaaaagtatc aaaagttttt tggttgaagg agtgccgtaa aagtgattca cccccacgtt 3541 ttttctaaat cttatatcgt aaacaaccat gaccgaaagc gagttgccaa ggatgacaaa 3601 ccatattttt tcctcaatga aataatatta tcagtgtttc gttttttagt tttgtgtttc 3661 acaactggca tgggagtggc atacatggca atgcttggcg ttagccttga tgtcgccatt 3721 gctcaaacac ccagaacaac agacgggaaa cagctggatg tcaaaacaat ctctcaggtt 3781 aacgcccttt ttgtcaaccc aagtgttgga gatgacaagg taaacaatgg cagtgaaggt 3841 actcctttta aaaccatcag ccaagcgcta cgaattgctg gtcccaacac agtcattatc 3901 ctctcaagcg gtacttacag tactgaaact ggagagaatt ttcctttgat actcaaacca 3961 aacatttcta ttcaagggga ctctcgcagt aaaggtcgtg gtattgtgat caaaggagga 4021 gatacttacc ttagtcgaac ctttggtggt aaaaacgtcg ctattgtcgg cgcgaaccaa 4081 gccaagttga ctggtgtaac agtaactaat cctaaccctc gtggttacgg tttatggatt 4141 gaaaacagta atccagttat tgtggaaaat acctttactg gtagtaccca agatggaatt 4201 gcggttactg gtaacagtac ccccaacatt cgcaataatt tcttttatca aaatggagcg 4261 aatggaatca ccgtctctgg aaattcccaa gccgaggtgc gggaaaatgt gtttcaacag 4321 acgggctttg gcattaatat tacccaaaat gctcaagcct tggtagttag taattctatt 4381 caagacaaca gaagtggcgt tgtagtacaa gccaatgctc gcccaatact acggaacaac 4441 cttattcaag gtagcaaaga agatgggtta gtagccattt ctcaagcaat ccccaattta 4501 ggcaccgcct ctgaaccagg tggtaatcag tttcgcaaca atgcccgcta cgacattaac 4561 gccagcgccg ccaaggaaat ttttcctgct tttggcaaca gccttgccaa caaccgcatc 4621 aatggcaagg tagacttgac tgggacaaca gcagttgcag acgcaggaat tggcagaaca 4681 cagacgcgtg aaaattatcc agcaaacctg agtgcgtcac cccgtgttcc tctgcccaca 4741 tattctaaca atgcttctag tggattgaac aatcaactac aaccattaag acctgctaat 4801 tctccacttt ctgcaactgc aattaatcaa aaacagtctt accttcaaaa tgctggcttg 4861 ccaactccta ataacttaac aaggtatggc ggacaatctc caactaggat gctaccaccc 4921 cggcaagttt ccccaacaag atcaattcaa ccattagcaa atacatctaa tacatcacga 4981 caagcaaatt atgtacgcgt ttctcctggg agtgtagaat tcactgcacc ccaaacagca 5041 agtaacactg ttgatgcagg gatacaggga acgcttaaca gggaacaggg aagagaggat 5101 gcgaggaatt tacaacagat gactgtgcgt tctcccctct ctccatctgg gcgtcaggta 5161 ccacaactca atgcaacttt ctctaaccca gtcacaccag cacccgtaca aagagtgcag 5221 agtcagggac agtctgcatt accaacacta caagctgcgc ctgtcggcga agcagctctt 5281 ttgcccgttc ccaattccaa tatccctttg ggcaatacta gtaatatgcg aaaagtgcca 5341 gtacctcaaa gatcttcaac aacagcgtat gttgggaatt catccccatc tcctgcacaa 5401 gacactcaaa ctaatttacg ttaccgagtt gtggtggaag tccaaaatga aaaggagcaa 5461 gagttagtca aattccttgc tcctagtgct tttcccactg tctggcgcgg taagggagtg 5521 atgcaagttg gtgtcttcaa cacccgcaat aacgcagata gcatcatcaa aatacttaat 5581 aacaatggtt tgaaaggtgt ggttgagccg gtgaattgaa tcagtgaaca gttatcagtg 5641 aacagtgaac agtgatatca ggttcggttg attacttatc attcctgaag aaccccaccc 5701 cggttttgtc tgacgccaaa accgcgcctc cccgaattcg cggggagggg attaagggga 5761 gccagcgccg tgtggggtgc agtgaacagt gaacagtgaa cagtgaacag ttaactgata 5821 actgttgaac attttgcttc tgggttgatg gaggaagttt ctctctccat catgccattt 5881 ttattgtgtt tgttattgtg tttggattgg gagattgaat aatatcaggt tcggttaaac 5941 acttataata tctgtaggtt ggggaggaca cttctgtgcg ggggttcccc ccgttgagga 6001 atgtgtccgt ttgagggacg aaacccaaca cttgatcctt gttcatgttg ggtttcactg 6061 ccgttcaacc caacctacat taatgatttt attataagta atcacccgaa cttgatataa 6121 tacctacaag aatccgcagc aactcttcta attcttcagg aacacgatag agtttaatat 6181 cttgcaaaat tcgttttaaa aaatctagac atttaatact tccctcatac cttggcaaat 6241 ctaatcgctt acgttcataa gtacaatcaa attctgcgag aatgtctgct ggcgctaatg 6301 gtaattcaaa atatttgcta aacgtatagt tagtaccagg ctgaagaata gatggacgat 6361 tcatataatt ttaaatatac gcgggagtgt tactaatagc tataatcatt tcctgtttct 6421 tgtgacagaa aaccctgagg acgtctagcg attagtttct cagaagggtg cagcttttcg 6481 ttagaaattg ctttgagttt tgatgtaaat gactctgaac ctgcactggt ttgcggggtg 6541 aattgctcaa ttgggggatt gttggtagcc ccagagagaa agttatcctg atcattttgg 6601 ttcccagatg cggggctagg ggtttccata gaggaggttt tcgcctgttc cggttctctg 6661 cgttgagatg actgctgttg gggcgagatg gggtgtggtg gtgatttttc tgtatccccg 6721 tgttctgatg tctggttgtt cacggacact ctcctcaagg gaagccattt gggtgttccg 6781 gtgtccacag ttgaggggac tggcggggga aaggtttgca cggaagtgtc ctcttgagct 6841 gttgttgtct ccacataagc tgaggaaatt cggggaactg tggttaggct ttgtggaaat 6901 ttgctgcaaa tcaaacgctc aaattccatg ttaaaatgaa aaattgcttg tccacgtcgc 6961 tgccaacacg ttagaatttg ctgcaccgaa acagctttgt agcgaccctg ataaagtgct 7021 tcaatgactg ccatgtgtag ccagttccct gggtattgct tttgccaaag atccactagc 7081 tcactggcgc tataaccacc gaggtcgaaa ctataatgaa ttagcagggc tattgccaag 7141 tcagcagacg tgtccgggtt tgatgtgaac atctcactct cggcttcagg caggtacaca 7201 tcttgttccc atcccataca gcctatgttg ttttgatcta tagttttttt ggtaagtgct 7261 tccaagtata gaggggaagt ggtgttcatt ttacttttca aattaagaat gggaaaagat 7321 atagggtgaa ttaatggttg gagtttgcct caccgccacc aatgcttgat aaaccatagc 7381 agccacttcc ccccgtgtcg cctctcgatt tgggttgaat tgtttcggat ccggataatt 7441 cacaacaatt ttgttttggg tggcagtagc aactgccgaa cgagcatagt tgggaattat 7501 gttatgatcg ctcaagccaa gtaagctgtt attatcagcg gctttgagac caagtccatt 7561 taccagagag acaattacct gtaatcgaag gacattttgc tcaggacgga aggtgcgatc 7621 gcgaaatcca ccaacaaagc ccccctgcgc tgcaattttt atggcattat aagcccaaaa 7681 atccttttgc acatccgtaa actcagttgc aggatgttta ggagtcgggt taaaacaaac 7741 cgcaactaaa gccgcatact gggcgcgagt gattggttta tctggctcaa aagtttcatg 7801 ggcagaaccg tgagtcaaac ccatcttgag caatgcttgg ataaatcctg ctgcccagtg 7861 tgtcataacc tccgtgtcgg aggcaatctc cacgttagga ttgacaatgt agggagaagc 7921 gattggcaat tctatcccac tagccactaa tccctgatag atgcaagctg ctacctcagc 7981 gcgggtgatg tccctgagtg gttctaattg ttcaatttca ggataattca ctatcaataa 8041 gtttttagtt gcaactgcta ccgcatttgt ggcggcacta ggaatttggg cgctatcgcc 8101 aaacacattt aaaatatttg gattaccccc actcagtttg agtccattca ccagagagac 8161 tattgcttga acccttgtta aattttgcat tggtcgaaac gtcccatccg gaaatgcact 8221 gataaactcc atactagcag cacgttcaat agctgacgat gcccagaaac ctcgtttgac 8281 atctgtgaat ttgcgtattt ttttccctga aggtagctga aagctcctag caatgatatc 8341 ggcatactca gcacgagtga taggcgcatc aggtcgaaaa gtgccatccg gaaagccatt 8401 catcaaattc tttttatgta aggcttccac aaaagccatt gcccaatgcc cagccatgtc 8461 tgaaaaggtt gtgttaatta atactgaatt ttgattattc tctttttgat taataaactc 8521 tatcattccc ttaatttgag tagagttcaa attgttacct acagaactta agtggagaga 8581 ggtggtattt tggatatcaa attgcccgtt ctgacggaaa atattaccac cggggtcttg 8641 aggactgccc aaatcgggag cggcattgcc gttcaccaac aatccagttt gagtattgtt 8701 ttcgatcaga ttatttcgca gaataggacg agcatttcgg gaaagggcga tcgcatttct 8761 attttcgaga aaggtattat tcgccacaag cggctcagca gtgtcactga tagccatccc 8821 caaaagattt ttttggaaaa gattgtccag cacttctcct ttgctatcgc ctgccataac 8881 caacccactg acaccgtttt gcagaaatac attgtcctga atcagtggtt tagcactacc 8941 gctgacaaac acagcttccc gaccgcattt gataaaagta ttattagcca aagttgggga 9001 agtggattca atccagacac cagtcccttt ttttgtagga tttacaacag taatgcccat 9061 caattgagca taatttagta ataccagcgt aatattttgt atcccaaaac tttcgctttc 9121 atactttcca ctgcccgaaa tcacaatccc ttgacctttg gttgcttcat gaccaatcag 9181 catcacaccc actggaatca ccagtggaaa tatttcacca ctagcagtgt tgtaagttcc 9241 acaggctagc tgaatgactt tgggtgttgt ggttaccttc agggcacggg taattgtttt 9301 atagggattt aaccgtgtcc cagcattggc atcattgcca attgtagggt tgacataaat 9361 tgtgagtgtg gcaacgacag tagagttgac catttagtga ctaataaaca aattttgaat 9421 ctcacaatca taaataactc tacaactgcg tttacacaag cacacctatg taactttttg 9481 acgctgattc cggttttgat ccccgttggg ggaagaggtt ataccaattc aaaattcaaa 9541 atcgcgttcg cgcagcgtgc ccgaagggct cagcgttgcg agcgaagcga agcaatctca 9601 aaatgaagaa agcctaagat aacaagggtt tgggaatgtg tatcagtcgc attctttttt 9661 caaattggta ttatcaaaaa cgctgagatc ttgcaccatt ctcaagaaca caacctcgtc 9721 cgcctgggaa tcaattccca ggctcacagt tcaagtctac taaagtagac tcaaaagctt 9781 atgcagtcgt ctttagacga cttttactat gagactaggg tttaaaccct aggcggttgt 9841 tggcacaagt gcaagatctc agtttgagca gtgagtttta acaatcgaat tagcggcata 9901 ggtcgttttc atcccctgac ggggaagagg tgttgtaaag caagtgccag taatgccaaa 9961 accgcaccgt ggtagataaa tgtttccatc ccctgacggg ggagaggtgt tgtaaaggaa 10021 attttggaaa catttgaata cccgtctgat gatgtttcca tcaccttacg gggaataagt 10081 attgtaaagc acacgtcaga ttgtcgaaga aatgggtgct cttacgtttc catccctttg 10141 cagggaaagg gttctgtaaa gaattccaaa ctttggcttt tcatccgctg aagttgtttc 10201 catccctttg cagggaaagg gttctgtaaa gacaaccagt cagacagtag caccagctgc 10261 gacatcgacg tttccatccc tttgcaggga aagggttctg taaagagttc acttgtagaa 10321 cccttactgg gagagggttt gagaccccca aatcgacacc acttttttga ttgtcaataa 10381 tcgccagatt attctcaata attaggtcat cttgtaagct ggaaaccttg ctatacaagc 10441 aatcgacacc actcaacgaa gttatgcggt tttcaaggat cgggggagtg gtgtcgatga 10501 agttcaacac actcttctaa aaatagaatg tctcacctat aattgtcaag tttttgacca 10561 actccacact caaaagtgca gtggttcctc tgtactcaac actcagcaac tggtttgttg 10621 ttttatcatg gtttgttaaa acagtgaaca gtgaacagtg aacagtaaac agtgaaggag 10681 tcaggagtca ggagtcagga gtcagtaggg gcgcaaggca ttgcgcccgt acaggagtcg 10741 caattcttta tttcttcctt ctgaatactg aattcaccaa ttgctgaatt cttcttcaaa 10801 ctgataactg gtaacccttc gggttcgcag tcgcctacgg agggagaccc tcctgcagcg 10861 ctgtctcact gataactggt aactgtaaaa gatgaattac catttgcacc tagttaaaac 10921 ggatgatttc agttgaacat ctgagtaaaa catacggctc tacctcagca attactgatg 10981 tcactttcga cgtcgaacca ggagagattt tggggttttt gggacctaat ggtgctggca 11041 aaactacaac catgcgaatt ttggctggtt atttgcctgc gacgagtggg actgcgcgga 11101 ttgctggctt tgatgtccat gacaattctc tgttggtgcg tcaacggatt ggttacttac 11161 cagagacgcc gccgttgtat ccagagatga cggtggaggg atttttgtat tttgtggcgc 11221 ggattaaggg agtttcggcg ggcgatcgca ccaccaaagt gacagcagca atcgaacgct 11281 gcaatttaca agaaaagcgt cacgttatta ttcgcaaact ttctaaagga tatcgtcaga 11341 gagtcggtat tgctcaagca attgtccacg atccaccagc catcatttta gatgaaccca 11401 cagtcggact cgacccccgg caaatcatcg aggtgcgaaa tttaattaaa agtcttgcgg 11461 gaagccacac aatcattatc tctactcaca ttttgccaga agtgagcatg acttgtagcc 11521 gcgtggcaat catcaacggt gggaaagttg tcgcaacaga tacaccagac aatctcatga 11581 accaattgac aaaaggttca ggatatgaga tagaaattga gggagaagcg ggtctcgcca 11641 aacaagtcct gcaaaatgtc gcaggggtaa gctttgtgga atcaatttct gcggtaggaa 11701 tgcacagtca tacctcctta aaggaaaacc gaacatacct gcgggtgata tcacaaccag 11761 gaactgaacc aggaaaagat attgcagcaa cgttgatcgg aacaggattc ggtttactag 11821 aaatgcggcg tgttaacgct actctagaag atgtattttt gcaattaaca acagaagaaa 11881 aaactttgga gactgaaaca gaaatcgcag aggcaaagga aggagaagca gcctaaatgg 11941 gtgtagtact gggtaatatt attgccattt atcgccgaga gttacaaagc tattttgtat 12001 cacctttggc gtatgcaatt gctggtgttt tttggcttct atctggatta ttcttcgtgc 12061 tgattttgat gggaccagaa ggtatcctgc aaacagtgac tgcattagat ttacaaggac 12121 aacaatttgg agtcccagtt ccaccaatag atgttcctta cgaacttgtc agggcatttt 12181 tggatcgaat gggatggcta ttattatttg tcttgccaat tctttctatg ggactttatg 12241 ccgaagaacg caagcgcgga accttagaac ttctcgccac atcaccagta acaaactggg 12301 cagtagctgt cggtaaatta ttaggagttt tgacattttt catcacaatg gtagtgccta 12361 tcatgggatt agaaaccatt gccttgagtg cgtcaagtcc agcaatgcca ccaacaattc 12421 ccttactagg gcatttagca ctcatcttac tagcagcagc tattttatct ttaggaatgt 12481 tcatttcttc tttgacagac agtacaattc tgtctgcact cttcacattt gcattaatgt 12541 tattactctt gttcgttgat ttaattgcca aaaatattgg tggttctata ggagaagcgc 12601 taggacatct atcattgctg aaacattaca acacattagt acaaggtatt ttcgatacga 12661 gcagcttgct tttatttgct agttacatta ttctcgggat atttctcaca gctcaatcaa 12721 ttgatgcatt gcgttttcaa cgttcgtaga ggaggaaaga gggaacaggg aacagggaac 12781 agggaacagg gaatagggaa cagggaacgg aggagtaagt taagaggttt ttaaatgcca 12841 gagattaatg attttaaaga cttaaaaatt tggcaacaag gtatggagat agccgaaaag 12901 tgttattttt tgactcaact atttcctaaa gatgagttat atggtatggt gcaacaaatc 12961 aggagatctg cggtatctat tccagctaac atagcagagg gatacggaag aagaacaaca 13021 cgtgagtatg tcagatttct gaatatcgcc caaggctcaa ttaacgaatt agaaacacat 13081 attattttat ctctaagggt aggcttatct aagcaaaaag atatagaata aattattttt 13141 ttacttcgag aggagagtag aatgattatt gctcttatta aaaagctaga atcatgactt 13201 ttgttcccta ttccctgttt cctcttccct gttccctgtt ccctgttccc tgttccctct 13261 ttcctctttc ctcaaatgaa acttatcgct caaaagaaac ctttaaaaat cttattttgg 13321 tttggtccct tcctccttgc agcaggctta acatctggat tcgcatcgga taattgggga 13381 ccaattcaac tcgcattgat aattttagga acagtcatca ttgtattgtg gctgatatgg 13441 caaaacaagc agaataactg gtggggacaa cgttctactc aagttagtac taacgctgtg 13501 attgcgactt tagcagtttt agcgatttta ggattgatta actttttagg aactcgctac 13561 catacacgac ttgatttcac agaaactaag ttatttactc ttgctcccca gtcacgggaa 13621 ctggtacgct ctttacaagt acctgcaaaa atatggttgt ttgacgttaa tcaagaccct 13681 gtagatagag acttactaga aaattatcgt cggcaaacct ctaagtttag ttttgagtat 13741 gtagatccac aagcaagacc aggattagct cgtaagtttg gtgtcaaaga ctatggagaa 13801 gtttacttgg aatttggcga taaacgacaa ttagttcaag tcgttggtcc ccaagaacgt 13861 ttatcagaag taaaattaac caatcgcctg caacaaatca gcagtataag ctctgctaaa 13921 gtttacatcc tccaaggtca cggcgaacac gaactttctg gtaaagagga aggagttata 13981 tcgcaagcgg ttaaagcatt aaatgacaaa ggttacacca cttcagccct gaatctggca 14041 gaaaaattga gtgttcctca agatgctaat gttgtggtag ttgcaggacc gaagcgatcg 14101 ctctttgaaa gcgaagtcaa agcactacaa gactacctca atcgaggtgg aaatttactg 14161 ctgatgattg acccagatac agaccccaaa ctcgaaagct tgcttgacga gtggggtgta 14221 aaattagata atcgtctggc agttgatgtt tctggaagcg ttggacttgg tcctgctgct 14281 cctttggtaa ctcaatacgg aaaacacccg attaccaaag attttggcaa cggtatctct 14341 ttttatccct tagcacgacc gattgacaca acttcagtac ctggtattca ggcgactccc 14401 ttgttactca ccaaagctta tcctaatagc tgggcagaaa gtgatcagca aagcgaaaac 14461 ttgaaattta atcctgagag cgatcgcaaa ggtccactca cattaggcgt agcattaaca 14521 aagaaactat cagcgaaatc tgaagctaca tctaactcta ccctcacacc gataatagct 14581 gcaaccccag ccaaaccc // LOCUS NODE_2339_length_14393_cov_5.22918114393 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14393) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14393) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14393 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1485) /gene="dndD" /locus_tag="DP116_19315" CDS complement(<1..1485) /gene="dndD" /locus_tag="DP116_19315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315873.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA sulfur modification protein DndD" /protein_id="PRJNA477356:DP116_19315" /translation="MIFIELVLQNFGPYNGRQVINLNPQENDNSRPILLLGGMNGGGK TTLMDAIRLALYGPRAQCSTRGNLSYGDFLTQCVNSHTPAIEKTRIELLFEHIENDHP VKYRIVRTWEKNPKDGKDNLGILELDIAKQDDWLREELVNTWDDYIENLLPLGISNLF LFDGEQVKELAEQEIPPPTVVDAIRGLLGLELAERLGVDLEIVVNRKRKEIADTKDLV NLEEIEKRLKQQQAEYEEKTKQLEKLTTELQKSEKQKQEAFDTFVYEGGKIAAERNQL ELQKKQKTAEVEQARQGMCQLAANVLPLALIEPLLTQAQRQGEKEFRIQQAQVARDIL FERDQRLLNWMTQVGISEEQFEKIKVFLEKDEETLRARLIQPEESWLLADAETLSQLG NVFYYLQNDKKIAKQQIGILKNKEEDIVTLERQIQTAAEPEAYKQLVDALEAAQNKVS QIQAASVVTKRRCDELEAEIKNIKKDLQEYSKQNIDRKNYEHIIT" gene complement(1752..3368) /gene="dndC" /locus_tag="DP116_19320" CDS complement(1752..3368) /gene="dndC" /locus_tag="DP116_19320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859404.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA phosphorothioation system sulfurtransferase DndC" /protein_id="PRJNA477356:DP116_19320" /translation="MTTAQQQENKGQAQRTVSELVEDIENLTIEIQELYCLDAIPWVV GYSGGKDSTATLQLVWNAIAQLPPKKRTKAIHVITTDTLVENPYVSAWVRNSLKQMKL AALEQELPFEPHLLQPEVKETFWVGLIGKGYPAPRGKFRWCTERLKINPSNRFIRDVI RTNGEAILVLGTRKAESTKRAGRMKKWEAKRVRDRLSPNIHLPNSLVYSPIEDWRNDE VWLYLMQWENPWGYSNKDLFVMYRGASADNECPLVVDTSTPSCGSSRFGCWVCTLVNQ DKSLTAMIQNDEEKEWLQPLLDFRVELDVEENRNRRDFRRRNGDVQLYERNLDGEISV EPIPGPYLKEAREDWLRKLLTIQRQIRRTAPENMRDITLITTEELSEIRRIWLEERHE FDDSLPHIYKEVTGEPFIDPRPGAGNSLLGSDEWAVLEEICEEDAMHLELMAKLLDTE RQYRKMSRRVGIYDALAKCFETSSRSPDEAIKNAHLKRDLKAAASEADIEKVKQLTLG DVTVSETAKAQNWANIKFKNKDSVDENLGE" gene 3716..5314 /locus_tag="DP116_19325" CDS 3716..5314 /locus_tag="DP116_19325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455568.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19325" /translation="MSNIPADSTTDITSRYIEQDNKDKNLLACVLDKYLGRNDQILVQ KTQMGGIQAYVGSVTLEWFASRVHFASCLPLLQKKYNPQTDNIEIDADSIDEIQQRPL DWSRQAALVQYLATRQHHKFPPVLVVINQPWVDNPKAPEWNSQGRATKSTTEFTPLDK DGQFGLLNLSQENVNIYALDGQHRLMGVQGLMELLKTGKLSRYRKDKTPSNTFMTVDE LVEQYHISPAYLQSLRKEKIGIEFICAIAAGETLEEARLRVRSIFVHVNLMAVPLTKG QLAQLNEDDGFSIVARKIAVTHPLLEQREDRKPRVNWNSATVAAKSTVLTTLQALKEM SEKYLGQKFLHWKPVDKGLIPMRPEGEELYEGIKDFRVLFDYLATLPSYVILEYEDTP ALRRFSFEKDGGEGNMLFRPVGQVALTQALGILVFKKGFSLEDIFKKLCQFDREGGFS GMEHPKSLWYGVLYDPNKKRVQVAGRDLAARLLIYILGGIQDSIERAELRKDLAKART VENKTIGFDANFVEPKQVGLPSVL" gene 5552..6727 /locus_tag="DP116_19330" CDS 5552..6727 /locus_tag="DP116_19330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015188357.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19330" /translation="MVKTNDPDTNNISSEVKAKFNGFIEPFFSEHYRESCYPGLIFHQ GKRKMLQINVPAKDLPTLLQAKPSKDNDPDSGKNRPEVKGHAEEIKDYVVERAKADKP WVLGTITANVDQQHIKIQELGRGICIVVIPNGVKLDITDGQHRKSAIHELIFSDESHL ISHDDFAITLILEGDKRQCQTDFRDMAQTRQLDKSLLLSFGKFEGRVGITKNLIEQVR MFKEKTEKIKASPAKQLIYTTNYIARFVSYVFADDPNNQLQDIDVEQSSEALGECLNQ FFSKCRDTHDISESKEKPTINQVAAFKEYSILGMSVGLEVLGRLLHCTYDKDRKYFDV DKVSQLAQLDWSRKNSLWENNIVRKAINSDKKVYRVSNSPSAVKDAVVAVKTTLGWI" gene complement(6754..7227) /locus_tag="DP116_19335" CDS complement(6754..7227) /locus_tag="DP116_19335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA phosphorothioation-associated protein 4" /protein_id="PRJNA477356:DP116_19335" /translation="MAETGRIRVAKDKAELVKALTSVDGATGPFQTYADVIVFAAALG AKHKRRVPLGEISKREPSPIRLEYFATMGHDWVIKLLGMTETKNLKILSPNEEEYEHK RNQIFEEYANGGLEILQKELWGAVDYCERVLLMLSAERFNQEQQDEEFDLSKFLS" gene complement(7402..7977) /locus_tag="DP116_19340" CDS complement(7402..7977) /locus_tag="DP116_19340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315879.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_19340" /translation="MVASSNRQYMTPQEYLEWEERQDIKYEYINGEVFAMTGGTIPHT TIALNLASTLKSHLRGRGCRAFMADAKVGVSENGPFHYPDVVVSCDQRDKQAMKFLLY PCLIVEVLSPSTEGYDRGGKFYQYRRIQTLREYVLIDAEKISVECFRLNEKSIWELHP YEEGDEVHLTCVDFHFPISLVYEDVQFLNEG" gene complement(8245..8436) /locus_tag="DP116_19345" CDS complement(8245..8436) /locus_tag="DP116_19345" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19345" /translation="MKEIQQIHSQVVLELPEYWIMVFKNSSLAVLIAPVVTLSTYSAL TFSIFDNLWAIDCLNEMSP" gene complement(8572..10650) /locus_tag="DP116_19350" CDS complement(8572..10650) /locus_tag="DP116_19350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115419.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_19350" /translation="MKLASIKLCNFRSFYGKTPEIILAGGDTRNTTIIHGNNGSGKTS LLNAFTWVLYEKFSAAFASTEQLVNKRAIAEAKPGQAVECWVEVEWEHDGKRYNVKRL CRVYKNETDFNITKTELRMQVAGDDGRWYFPPQQPEDVIGQILPMSLYQYFFFDGERI EEIVRSDKKAEIAEATKIFLGVEVINRSINHLKEAKKSLENELKAIGDSGIQQLLKQQ DKIEQEIEIILKRQTEIQQELEYQDTFKKETSNRLRELSAAKELEERRQELEKQKASS QENLRESREAIKKAISGRGYTILLSQNTAQFREIIDDLKQRGELTSGISREFVNELLQ SQRCICGADLEEGSHSHENVRKLLDKASSSVVEETAIRMSAQVDEIDKQAVSFWEEVD REQVRINQLRQTISKIEGELDNIQERLRKDANEEISSLQKRLDEIEDKIRDLILEQGA NQQQIANLKTELEGLRKQIAKQKLNEDRQALAQRRISATQDAIERLTEVRNRQEKQFR WQLEKRVQEIFSEISFTPYIPKISDKYELTLVENTSGIEMPVAASTGENQILSLSFIA SIIDRVREWSEKKKILLVPDSSTFPIVMDSPFGSLDEISRRQIAKTIPKLANQLIVLV TKTQWRGEVEEEMAGKIGREYVLTYYSSKPDCEQDYIELAGERYPLVRQSPNEFEYTE IIEVMRERSF" gene 10947..11465 /locus_tag="DP116_19355" CDS 10947..11465 /locus_tag="DP116_19355" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19355" /translation="MISHETWLNNVIVSDPKLFQIPICVASLHFVPMLPATQHKWKVP EISPETLVLSFANKSFPTIESWFMALLRLSALDDLLTGVGFLYSREQGTLNRLDGLVV SSLSPKSDLSCFKRCCKKLYLSGRRRLLSLLPNRSKITYNNLLLSLFLPLCASSLASC GSKINIFLADQE" gene 12222..>13535 /locus_tag="DP116_19360" CDS 12222..>13535 /locus_tag="DP116_19360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951089.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19360" /translation="MQPTQIQTLDRHNRFGIATQQNQSSSVTYRNLVFIDTAVADYQT LMSGVEKGTQVILLHPEWDGVEQITTALSQQADDLTTVHIVSHGSPGCLYLGNSCLNL KTLEFYASLLERWFPKGSTPSLLLYGCNIAATDIGIEFLAKLRQKTKAQIAASTTPTG HPALGGNWKLEVTTEKMTISLAFPVATQVAYTGVLNPNRVSVGASGSQTNDNSLRPAI SASGRYIAFRSDASNLVANDTNNFSDIFVYDTDTGITNRVSVGPSGVEGNNAANGGPA ISASGRYVAFESYASNLVADDTNNFSDIFVYDTQTRTTSRVSVDLQGNQGNSVSSSPT ISGDGRYVAFESYASNLVADDTNNFNDIFVYDTQTRTTSRASVNSQSNQGNNASFSPA ISADGRYVAFDSFASNLVPEDTNNTRDIFVYDTQTRTTSRASVNSQ" assembly_gap 13536..13545 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 13658..>14393 /locus_tag="DP116_19365" CDS 13658..>14393 /locus_tag="DP116_19365" /inference="COORDINATES: protein motif:HMM:PF07676.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19365" /translation="MAFESYANNLVPGDTNDTGDIFLYDTQTRTTSRVSVDSQGNQGN NESFSPSISADGRYVAFASYANNLVPRDTNDTADIFVYDTQTRSTKRVSIDSQRNQGN DLSYNSVISASGRYVAYESYASNLVVGDTNNSRDIFLYDTITNNAPTNLALSATSVDE KVPVATEIGAFTTTDPDTNDQHTYSLVTGTGDADNAVFSINGNKLLINNSPNASKSSY NIRARTTDKGGLYFDKQFTINVNLSVN" BASE COUNT 4013 a 3167 c 2724 g 4479 t 10 others ORIGIN 1 ggtaataata tgttcgtagt tttttcggtc aatattttgc ttactatatt cttgtaaatc 61 cttcttgata ttttttattt cagcttctag ttcatcacag cggcgttttg ttacaacgct 121 tgctgcttga atttgggaaa ctttattttg tgctgcttct agtgcatcaa ctaattgttt 181 ataagcttct ggttctgctg ctgtttgtat ttgtctttcc agagtgacaa tgtcttcttc 241 cttatttttg agaataccaa tttgctgctt ggctatcttc ttatcatttt gcaagtagta 301 gaaaacatta cccaattgac ttaaagtttc agcgtcagct aataaccaag attcttccgg 361 ttgtatcaac cttgctcgca atgtctcttc atctttttct aaaaatactt tgattttttc 421 aaattgttct tcggaaatac ctacttgagt catccaattg agtaaacgtt gatctcgctc 481 aaataatata tcccttgcta cttgtgcttg ttgaatgcga aattcttttt ctccttggcg 541 ttgtgcttga gtcagtagag gttcaattaa cgctagtggt agaacattag ctgctaattg 601 acacattccc tgacgtgctt gttctacctc agcagttttt tgcttctttt gtagctctaa 661 ctgatttcgt tccgccgcaa ttttaccacc ttcataaaca aatgtatcaa atgcttcttg 721 cttttgtttt tctgactttt gtaattctgt tgttaatttt tctagctgtt tcgttttttc 781 ttcgtattct gcttgttgct gttttagtct tttttcaatt tcctctagat ttaccaaatc 841 tttcgtatcg gctatttctt tacgtttgcg attgacgaca atttctagat caactcctaa 901 acgttctgcc aattctagcc ctaaaagtcc gcgaattgca tcaacgacag taggtggcgg 961 tatttcctgt tctgcaagtt ccttaacctg ttccccatca aagagaaaca agttagaaat 1021 tcctaaaggt agtaaatttt caatataatc gtcccaagtg ttaactaatt cttctctgag 1081 ccaatcatct tgtttagcaa tatctaattc taaaatacct aaattgtctt taccatcttt 1141 cggatttttt tcccaagttc gcactattcg gtatttgact gggtgatcat tttcaatatg 1201 ttcaaacagt aattcaatcc gtgttttttc aatcgctgga gtatgactgt taacgcattg 1261 ggtgagaaaa tcaccatagc ttaaattacc acgggtagaa cattgagcac gaggtccata 1321 aagcgcgaga cgaatcgcgt ccatcaaagt tgttttacct ccaccgttca ttccacctaa 1381 taagagaata gggcgagaat tatcattttc ttgtgggtta aggttaataa cttggcgacc 1441 attgtaggga ccaaagtttt gtaatacgag ttcaataaat atcatgtggg tggatacttt 1501 gcgagaaaag atactaatgg gactttctaa aaaaatcggg tcaaaatgcc tgcacagatt 1561 gagtaagggt tttacatcga ttaattactt ttgtttgatt taactgactt tgagccgttt 1621 ttacgtgacc gacgtatttt ctttgtctag gatgggtttt agatactttt tctgcccgaa 1681 atttctaaag tcccactaag taaaatacac taggcgaata gaattcgcct ggtttctagt 1741 ttatggcaat tttattcacc taaattttca tcaaccgaat ctttattttt aaatttgatg 1801 tttgcccaat tttgcgcttt tgctgtttca gaaactgtca catcacccaa agttaactgc 1861 ttgacctttt caatatcggc ttcacttgct gctgctttca aatctcgttt caaatgggcg 1921 ttcttaattg cttcatctgg agaacgggaa cttgtttcaa aacatttggc tagcgcatcg 1981 taaattccca cacgacgcga cattttgcga tactggcgtt ctgtgtctaa aagcttagcc 2041 atgagttcta aatgcattgc atcttcttca caaatttctt ccagcaccgc ccattcatca 2101 ctacccagga gactgttacc agcaccagga cgcggatcta taaaaggttc gcctgtcact 2161 tctttataaa tgtggggtaa actatcatca aattcgtgtc tctcttctag ccatatacga 2221 cgaatttcgc tgagttcttc tgtagtaatc agggtgatat cacgcatatt ttctggggct 2281 gtgcgacgga tttgtctttg aatagtcagg agttttctca gccagtcttc ccgcgcttct 2341 ttaagatagg gaccaggaat aggctcaacg gatatttctc catctaagtt gcgctcatac 2401 agttggacat caccgtttct acgtcgaaag tctcgtcgat tgcggttctc ttcaacatcc 2461 aattccacgc ggaaatcgag aagaggttgt aaccattctt tctcttcgtc gttttgaatc 2521 attgctgtca atgatttatc ctgattcact aaagtacaaa cccagcaacc aaaacgagaa 2581 ctaccacagc taggagtcga tgtatcaaca actaaaggac attcattatc agcgctagcg 2641 cctctataca ttacgaataa atctttgttg ctgtaccccc aaggattttc ccactgcatt 2701 agataaagcc aaacttcatc gtttcgccag tcttcaattg ggctgtaaac tagggagttg 2761 ggtaagtgga tgttggggct gaggcgatcg cgcacccgct tggcttccca tttcttcatt 2821 ctcccagcac gttttgtgct ttcagctttg cgagtaccca aaacaagaat agcttcaccg 2881 ttagttcgga tcacatcacg aataaagcgg ttagatggat ttattttcag gcgttctgta 2941 caccagcgaa attttcctcg tggggctggg taaccttttc ctatcaaacc tacccaaaat 3001 gtctctttaa cttctggctg tagcagatgg ggttcaaatg gtagttcttg ctcaagagcc 3061 gccagtttca tctgttttag ggaattgcgt acccaagcag aaacataggg attttcaacc 3121 agcgtgtctg ttgtgataac gtgtattgct ttagttcgtt ttttgggtgg aagttgtgcg 3181 atcgcattcc aaaccagttg caaagtcgcc gtagaatctt tcccgccaga gtatcccact 3241 acccagggaa ttgcatccaa gcagtataac tcttggattt caatcgtgag attctcgatg 3301 tcttccacta actctgagac agtccgctga gcttgacctt tgttttcttg ctgttgtgct 3361 gtagtcattt tgatatccct gttcaaccgc ctgataaccc tcagactgaa aatctgtggt 3421 gacacagact aaacccctcc gggtttgcag tcgcctctgt cgggaaaacg ccaggtgctt 3481 caacgggggg aacccccgca acgcactggc tcccctcctg cagcgctgtc tcaccaccta 3541 cgcgggtttt caaacccttg tcgtttggtg tgcttatcaa ggcaagtttt ctgcaagggc 3601 acacctagta ttacataaat ttgcacaaaa gacggttttt aaaaacttgt ttttttaact 3661 tcctatttct catatattta accaaagcta cgattttaaa tttttaaaaa ctcaaatgag 3721 caacattcca gctgactcaa caacggacat caccagtcgg tacatcgaac aagacaacaa 3781 agataaaaac ttacttgctt gtgtgctaga taagtatctt ggcagaaacg accagattct 3841 ggttcagaaa actcagatgg gtggtatcca ggcgtatgtt ggttctgtca ccctggaatg 3901 gtttgcaagt cgggttcatt ttgcgtcttg cttacccctg ctccagaaaa agtataaccc 3961 tcagactgat aacattgaga ttgacgcgga tagtattgat gaaattcagc agcgtcccct 4021 tgattggtca cgtcaagcag ctttagtaca gtatttggca actcgtcaac atcataagtt 4081 tccaccagtt ctagtagtta ttaaccagcc gtgggtagat aatcccaaag cgcctgagtg 4141 gaatagtcag ggacgagcta caaagtctac cacggaattt acaccactgg ataaagatgg 4201 tcaattcggt ctactcaacc tttcccagga gaatgtgaac atttacgctt tggatggtca 4261 acatcggctg atgggggtac agggtttgat ggagttactc aaaactggca aactaagccg 4321 atacagaaag gataaaactc cttccaatac tttcatgaca gtggatgagt tggtagaaca 4381 gtaccatata tcgccagctt acctgcaaag cttgcgcaaa gaaaaaattg gtattgagtt 4441 tatttgtgcg atcgcagctg gtgaaactct cgaagaagca agactacggg tgagatccat 4501 ttttgttcat gtcaacttga tggctgtccc tttaaccaaa ggtcagttag cacagctcaa 4561 tgaggatgat ggtttttcta ttgttgcgag aaagattgct gtgactcatc cgcttttaga 4621 acagcgtgaa gataggaaac cccgcgttaa ttggaatagt gcgacagttg cagccaagtc 4681 aacagttttg acaacactac aagcactcaa agaaatgtct gagaaatact tgggacaaaa 4741 gttcttgcat tggaaacctg tggacaaagg tctcattccc atgcgaccag aaggtgagga 4801 actttatgag ggaataaaag attttcgagt actctttgat tatctagcta ctctaccaag 4861 ctatgtgatt ttggaatacg aggacacgcc tgctttgcga cggttcagct ttgagaagga 4921 tggcggcgaa gggaatatgt tattccgtcc tgttggtcaa gtggcgttaa ctcaagctct 4981 cggtattttg gtttttaaaa aagggttctc cttggaagac atctttaaaa agctttgcca 5041 gttcgaccgg gaaggtggtt ttagtggaat ggaacatcca aaatctcttt ggtatggagt 5101 tttgtatgat ccaaacaaaa agcgggtaca agttgctgga cgagatttag cagcaagatt 5161 attaatatat attttgggtg gtattcagga tagtattgag cgtgctgaac ttcgcaagga 5221 tttggctaaa gctagaactg ttgaaaataa aacaataggt tttgatgcta actttgttga 5281 acccaagcaa gtaggacttc catctgtctt ataatttgca gagttagcgg ggatattcgt 5341 atatttgcaa gtagtctaaa aaccaaagaa tttagagaaa gcactaaatc ttttaagttt 5401 cagtggagag agcttagatt ttgtccacga gttctgccaa gctatattaa gcctggtctt 5461 ttgactgctg cattattgta cttcctaata accaagcata ttacctgagt ctataattat 5521 aaaaaattat acagttttca agcgtactat tatggttaaa acaaatgacc ctgataccaa 5581 taacatctca tctgaggtga aagctaaatt taatggcttc attgagcctt tcttttcgga 5641 gcattatcgg gagtcatgct atccagggtt gatttttcac cagggaaagc ggaaaatgct 5701 gcaaatcaat gtaccagcta aagacttacc tactcttctc caagctaaac cctccaaaga 5761 caatgatcct gattcaggta agaatcgccc agaggtcaaa ggtcatgcgg aggaaataaa 5821 agactatgtt gttgagcgtg ctaaagcaga caaaccctgg gttctaggga caattacagc 5881 caatgttgac cagcaacata ttaaaataca agaattgggt agaggaattt gtatagtcgt 5941 tattcctaac ggagttaaat tagatattac ggatggacag catcgtaaga gtgcaattca 6001 cgaattaata tttagtgatg aaagtcattt aattagtcat gatgattttg caattacgct 6061 gattttagag ggagacaagc gccagtgtca aactgacttc cgagacatgg ctcaaacaag 6121 acaactagat aaatcgttgt tgttgtcttt tggtaaattt gaaggtcgtg ttggcattac 6181 taaaaacttg atagaacaag tgcgaatgtt taaggagaaa actgaaaaaa ttaaagcgtc 6241 tcccgcaaag cagttgattt acacaacgaa ttacatagct aggttcgtaa gttacgtttt 6301 tgctgatgac ccaaataatc agcttcaaga tattgatgtt gagcaatcat ctgaagcctt 6361 gggtgagtgc ttgaatcagt ttttctcaaa atgtagagac acacacgata tttctgaaag 6421 caaggaaaaa ccgacgatta atcaggttgc tgcattcaag gaatattcta tactggggat 6481 gagtgttgga ctcgaagttt tggggcgatt gctgcactgc acttatgaca aagatagaaa 6541 atatttcgat gtagataaag tttcacaact agcacagcta gactggtcac gaaaaaacag 6601 tctgtgggag aataatatag tcaggaaagc aataaattct gataagaaag tctacagagt 6661 atctaacagc ccaagtgctg taaaggatgc agtggttgcg gtgaaaacca cactgggatg 6721 gatataagtc attattcttg agcgtacctt caatcaagac aggaatttgc tcaaatcaaa 6781 ttcctcgtct tgctgctctt ggttaaatct ttcagcgcta agcatcaaca aaactcgctc 6841 gcaataatct accgctcccc ataactcctt ctgtaaaatt tccaatccac cattagcgta 6901 ctcttcaaaa atttggttac gtttgtgttc gtactcttct tcattaggcg ataatatttt 6961 aagattttta gtttcagtca ttcccagtaa tttgatgacc caatcatgtc ccattgtggc 7021 aaagtattct aatctgatgg gggatggttc tcttttagaa atctccccca gagggacacg 7081 ccttttatgc tttgcaccta aagcagcagc aaacacaatc acatcagcat aggtttgaaa 7141 aggaccagtt gcaccatcaa cagatgttaa agcttttacc aattcagcct tatctttagc 7201 aaccctgatt ctaccagttt cagccatttt acttaaatat agtttgtgac aatcttaact 7261 cagagatgga gcgatatgcg cagcgtcaag cctccgactt atcgccttct cttccctctg 7321 cgtcacgcca ggtgcttcaa gtcgggaaac ccgcccaacg cactggcttc tctgcgcctc 7381 tgcggttcat ttctcttaaa gctacccctc attcaaaaac tgaacatctt cataaaccag 7441 agatatagga aagtgaaaat caacacaggt taagtgaact tcatccccct cctcgtaagg 7501 atgtaattcc cagatacttt tttcattcag ccgaaagcat tctacactga ttttttcagc 7561 gtcaataaga acgtattctc tcaaagtctg aatgcgacgg tattggtaaa atttaccgcc 7621 tctgtcataa ccttctgtac taggcgaaag gacttccaca atcagacaag gatagagaag 7681 gaatttcatc gcctgtttat ctcgttgatc acagctaaca accacatctg ggtagtgaaa 7741 tggtccgttt tcggatacgc ctactttcgc atccgccata aaagcgcgac aaccacgacc 7801 tctgagatgg ctttttaatg tcgaagccag gtttagagca atggtagtat ggggaatagt 7861 accgccagtc atggcaaaaa cttcgccgtt aatgtactcg tacttgatgt cttggcgttc 7921 ttcccattcc aagtactctt ggggtgtcat gtactgacgg tttgaacttg caaccataac 7981 ccaatgtttt gctaaagtgt tttcacgata ctattgcttc aactttaact ttttgcaatt 8041 tctgttgatt aaccatacag cagatactta actcgcactt tctccctttt gcatcctcca 8101 cgtctttgct gtttttccct ccaaaagcag tttcacaaca cttcacaaaa ctttatgaca 8161 aagcgtccaa catcaacacc acaaactgcc tacgcgtgat agtaagatag aaatagaaac 8221 ttccacatag tagttctcca ggcattaggg tgacatctca ttaagacaat ctatagccca 8281 aagattgtca aagatagaaa aagtaagggc gctgtaagtt gatagagtca caacaggggc 8341 gatgagaaca gcgagagaag agtttttaaa aaccattatc caatattcag gtagttcaag 8401 aactacctga gagtgaattt gttgtatttc cttcatcgga agcaacaata ctctcttttc 8461 tactctctgt attctctacg cttctggggt ttatgacttc tatcaactta ggcaaaagca 8521 atgtagagac gcgaggtttt gtgtctctac attgtttttg tgtgtgaaag ctcaaaaact 8581 tctttcccgc atcacctcaa taatctctgt gtactcaaac tcattcggac tttgtctcac 8641 caaaggatac cgttccccag ccaactcgat gtaatcttgt tcacaatcag gcttagagga 8701 atagtatgtc agcacatatt ctctaccaat tttacctgcc atttcttctt ctacctcacc 8761 ccgccactgt gtcttagtca ctaagacaat caattgattt gctaatttgg gaattgtctt 8821 tgcaatttgt cgtcgagaaa tttcatccaa actcccaaag ggcgaatcca tgacaatcgg 8881 gaaagtgcta ctatcaggaa ccagcaggat ttttttcttt tcactccatt cccgtactct 8941 atcgataatg ctagcaataa aagataaact gagaatttga ttttctcctg tggaagctgc 9001 aactggcatt tcgatacctg acgtattctc caccagcgtc agttcatatt tatcgctgat 9061 tttaggaata taaggtgtaa acgaaatttc gctgaagatt tcttgtaccc gcttttctaa 9121 ttgccaacga aactgctttt cttgacggtt tctgacttct gttaatcgtt caatagcatc 9181 ttgagttgcg ctgatacgtc gttgcgccaa tgcttgtcta tcttcattta gtttttgctt 9241 ggcgatttgt tttctcaaac cttctaactc tgttttcaag ttcgctattt gttgctgatt 9301 tgctccttgc tctaatatca aatctctaat tttatcttct atctcatcta agcgcttttg 9361 taaactgcta atttcttcat ttgcatcttt ccgcaaccgt tcttgaatat tatctaactc 9421 accttcaatt tttgaaatgg tttgtcttaa ctgattaatc ctcacttgct ctctgtcaac 9481 ttcttcccaa aagctgactg cttgcttatc aatttcatcc acttgagcac tcatacggat 9541 ggctgtttct tccacgacag aagaactcgc tttatccaac aactttctca cattctcgtg 9601 tgagtgactt ccttcctcta agtctgcacc acaaatacag cgttgagatt ggagtaattc 9661 attcacaaat tcccgcgaaa ttccagaggt taactcacct cgctgcttca aatcatctat 9721 gatttctcta aattgtgctg tgttttgtga cagtagtata gtataaccgc gtccagaaat 9781 tgctttttta atagcttccc tactttctct gagattttcc tgactagatg ctttttgttt 9841 ttctaactct tggcgtcttt cttccaattc cttggcagca ctgagttctc gtaagcggtt 9901 acttgtctct tttttaaaag tatcttgata ttccaactct tgctgaattt ctgtttgccg 9961 tttcagaata atctctattt cttgttctat cttatcctgc tgtttcaaaa gctgttgaat 10021 tcccgaatcc ccaatagctt ttaactcatt ttcgagactt tttttagctt ccttgagatg 10081 attgatggaa cggttaatga cttccacgcc taaaaaaatc ttcgtggctt cagcgatttc 10141 agctttcttg tcagaacgaa ctatctcttc aattcgttca ccgtcaaaga agaaatattg 10201 atataaactc atgggtaaaa tttgcccaat cacgtcttct ggttgctgag gtggaaaata 10261 ccagcgtcca tcatccccag caacctgcat acgcaattct gttttagtaa tgttgaagtc 10321 ggtttcattt ttataaaccc gacacaggcg tttcacgttg tagcgtttgc cgtcatgttc 10381 ccactctacc tctacccaac attctacagc ttgtccgggt ttggcttcgg cgatcgcacg 10441 cttattcact aattgttcag tcgatgcaaa agctgcacta aatttctcat acaacaccca 10501 agtaaacgca ttcaaaagac tcgtttttcc tgagccatta ttaccgtgaa ttatcgttgt 10561 gttacgagta tctcctcctg caagaattat ttctggtgtc ttaccataaa aagagcgaaa 10621 gttacaaagc ttaatcgaag ctagcttcat cgcactgctt cctttacaat cgccagaata 10681 tcatcattaa tatgggtgtt aattttctta tctagcctat ttttctccag ctgccatacg 10741 cgatcaataa tttgccttac ttctggtggt gcactactaa ttggttcttg aacataatca 10801 atattagttt cgtttgtcat cgggcttttc agatctaatt tttagacatt atcatattat 10861 atggtttcac aactatcagt aaataaacca gcaacaatat tttatcaaaa agttaaaatc 10921 tcaaattctc atgaactaat aatcagatga tttcacatga aacctggttg aataatgtca 10981 ttgtaagtga ccccaagcta ttccaaatac ctatctgtgt tgcttcactt catttcgtac 11041 cgatgctccc tgcaactcag cataagtgga aagttccaga aatatcgcct gaaacccttg 11101 ttctttcgtt tgctaacaaa agttttccaa cgattgagag ttggtttatg gcactgttga 11161 ggctaagcgc ccttgacgat ctcctcacag gtgtaggttt tttatatagc agggaacagg 11221 gaactcttaa taggcttgat ggtcttgtgg tgtcctcctt gtctccaaaa tctgacttgt 11281 cctgtttcaa gcgttgttgt aaaaagctat acttgtcagg caggagacgg ctcctcagtc 11341 tactcccgaa ccgctctaag attacctata ataacttact cctctctctc tttctccctc 11401 tgtgtgcgtc ctctcttgcc tcgtgcggtt caaaaataaa tatttttctg gcggatcagg 11461 agtaaaacgc tattgcgtct gtgcgcgact ttccatgtcg cactttggtt tgaccaccag 11521 taccagttgt tgttggtcta agatatttgc caaacttcat tatatataaa tacaaaaaag 11581 ctaaaaattt acattgctct tcattttatt gatgctaatc ttgtccgttg cattcgcaaa 11641 ataagattga cagagattat tcttctgaac aaataaataa caaataattt taacctgaaa 11701 tattcatgat tttgacacat gtttaattgt caaatttaca ttttttttga taagaataat 11761 atgtcaagtc aacatcagtc ataatgctca ggaaaaaact tttatctttt gataaggaaa 11821 agtagttact gtttagcctg acaactgttt tcaagactca gaatcagtca acaaagggat 11881 ttccttaaaa gtaaaatcac actctcctct ggactctaat taattttatt tgacagaaga 11941 gtaattgttg ctgcttcgta tttttttctg tgtcaacaca accgtaggag gaaacattct 12001 atcaaaaatg cacaaatagc ttgtgtcttt ttctaaaaat ttacaagctc aaagagtatt 12061 acaacaactt gcggttgtag atgcacaagt cgcttgaact tacgccgaac tagaagaacg 12121 tgctaactaa ctttgccgtc caagctaaag cgtgtgagct gatctcaatt caatatatcc 12181 gtgacaggtt aaaacagtgt gacaatttat gacaaatact catgcagcca actcaaatac 12241 aaacccttga tagacacaat cgttttggta ttgcaacaca gcagaatcag tccagcagtg 12301 tgacttatag aaacttagta ttcatagaca cagccgtagc ggattatcaa accctaatgt 12361 ctggagtgga gaagggcaca caggtgatac tcctgcaccc agaatgggat ggtgtcgagc 12421 aaatcaccac tgcattgtct cagcaagcag acgatcttac cacagtccat atagtctcac 12481 acggttcacc aggatgcttg tatctgggca atagctgctt gaacttaaaa accctagaat 12541 tctacgccag cctcttagag cgatggtttc caaagggttc tactccctca ttattgctct 12601 atggctgcaa cattgcagct acagacattg gcatcgaatt tctagccaaa ttgcgccaaa 12661 aaacaaaagc acaaatagct gcttctacaa cccccacagg tcatccagct ttaggcggta 12721 actggaaact ggaagtcact acagagaaga tgaccatatc cttggcattt cctgttgcaa 12781 ctcaagtcgc atatactgga gtcttaaacc ctaaccgcgt ctccgtaggt gcaagcggaa 12841 gccagacaaa tgacaattcc ttgcgtccag ccatctccgc ttcagggcgt tacatagcat 12901 ttcggtctga tgccagcaac ttagtagcga atgacacaaa caacttcagt gacatttttg 12961 tctacgatac tgacacaggc attaccaacc gcgtttccgt aggtccatct ggcgtcgagg 13021 gaaataacgc cgctaatgga ggtcccgcta tctccgcttc aggacgttat gtggcgtttg 13081 agtcatatgc tagcaattta gtcgcagatg acacgaacaa ctttagtgac atcttcgtct 13141 acgacactca aacacgcacc actagccgtg tttccgtaga tttacaggga aatcagggga 13201 atagcgtatc ttcatccccc accatttcgg gagatggacg ttatgtggcg tttgagtcat 13261 atgccagcaa tttagtagca gatgacacga ataactttaa tgacatcttc gtctacgata 13321 ctcaaacacg caccactagc cgtgcttccg taaattcaca gagtaatcaa gggaataacg 13381 catctttctc tcccgccatt tcggcagatg gacgttatgt ggcgtttgat tcgtttgcta 13441 gtaatttagt tcctgaagat acaaacaaca ctcgtgacat cttcgtctac gatactcaaa 13501 cacgcaccac tagccgtgct tccgtaaatt cacagnnnnn nnnnncttcg tctacgatac 13561 tcaaacacgc accactagcc gtgcttccgt aaattcacag ggcaatcaag gaaaccaaat 13621 atccttctca cctgctatct cgggagatgg acgttatgtg gcatttgagt catatgctaa 13681 caacttggtg ccaggagaca ctaacgacac tggtgacatc ttcctttatg acacccaaac 13741 acgcaccacc agccgcgttt ctgtagattc acagggcaat caggggaata acgaatcctt 13801 ttctcccagc atctcagcag atggacgtta tgtggcattt gcttcatatg ctaacaactt 13861 ggtgccacga gacaccaatg acactgccga tatctttgtc tacgacactc aaacacgtag 13921 caccaaacgc gtttctatag attcacaacg caatcaggga aatgacttat cctacaactc 13981 agtcatctcc gcttcaggac gttatgtagc gtatgagtca tatgccagca acctagtggt 14041 aggagacacc aataacagcc gtgatatctt cctctacgac actattacca acaacgcccc 14101 aacaaacttg gcgttgagtg ctactagcgt agatgaaaag gtgcctgtgg caacagaaat 14161 tggtgctttc accaccacag atccagacac aaacgaccag cacacctaca gcttagtgac 14221 ggggacgggc gatgctgata atgccgtctt tagtattaat ggcaacaaac tactcattaa 14281 taattccccc aatgcaagca aatctagtta caacatccgc gcccgtacta ctgacaaagg 14341 tggactctac tttgacaaac agttcactat caatgtcaat ctgagcgtta atc // LOCUS NODE_2345_length_14369_cov_5.03039014369 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14369) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14369) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14369 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 169..1262 /locus_tag="DP116_19370" /pseudo CDS 169..1262 /locus_tag="DP116_19370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006668373.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" gene complement(1436..1618) /locus_tag="DP116_19375" CDS complement(1436..1618) /locus_tag="DP116_19375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315683.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19375" /translation="MNPIERFWEFLKSKLRSENCKTLAQLREKLAEALETITPEVIVS LTSYDFILEALFSAAS" gene complement(1620..1928) /locus_tag="DP116_19380" CDS complement(1620..1928) /locus_tag="DP116_19380" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19380" /translation="MGLKTITGRLITAPGVKPIGLSQWQRDNFYLYRVVEPLSGYSFF YEFSHLDSDCFQRFLELLSAELGEDVAVIQFDQGSFHTVKTLDCPENIIPIFQPPHSR " gene 2870..3622 /locus_tag="DP116_19385" CDS 2870..3622 /locus_tag="DP116_19385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2993 domain-containing protein" /protein_id="PRJNA477356:DP116_19385" /translation="MSQEQRIEEQMLSHEAEKQVSQQVDKVEKVDVDVQTDLLKIFQG QADGVSFEAQGLVKQDIRVQEIKLQTDSIDINPLSVLFGQIELNQPVNTTARIVLIEA DINHALTSKFVRSKMQNFELNVDGEIVGLQPQEIQIHLLDGGKMAFTGKVLLKEKGNT RSISFTAQVCPRTQDKPIMLENFNCTHGGEGISLEVVVALMQKVKELVNLPYYEYEKT VFRVRNMDVEKGNMTLLVDARLKQIPSLDDLS" gene 3974..5265 /locus_tag="DP116_19390" /pseudo CDS 3974..5265 /locus_tag="DP116_19390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015331328.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="MFS transporter" gene 5500..7194 /locus_tag="DP116_19395" CDS 5500..7194 /locus_tag="DP116_19395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310913.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)/FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_19395" /translation="MQEYDVVIIGAGHNGLVCAAYLLKAGYSVLLLEKRSVPGGAATT EESLPQEAPGFKFNLCAIDHEFIHLGPVVQELELEKYGLKYLECDPVVFCPHPDGKYF LGHKSVEKTCAGIARYNERDAKKYAEFTDYWQRAIGAMIPMFNAPPKSVLDILGNYDI AKLKDLFSVIGSPNKTLDFIRNMLTSAEDILNEWFDSEFLKAPLARLASELGAPPSQK TLAIGAIMMAMRHNPGMARPRGGTGALVQALVNLVTSKGGVILTDQQVKKVLVDNGHA VGVQVANGVEYRAKHAVISNIDAKRLFLQFIDNSDVDAADPNLRERLERRIVNNNETI LKIDLALNEPLRFEHHEHKDEYLIGSVLIADSVTHVEQAHSKCTLGEIPDSDPSMYVV VPTMLDPSMAPPGKHTAWIEFFSPYQVAGAEGTGLNGTGWTDELKHKVADKVIDKLAD YAPNVKNAIIARAVESPAELGERLGAYKGNYYHVDMTLDQMVFFRPLPEIANYKTPIE GLFLTGAGTHPGGSISGMPGRNCARVFLQHKHPIAQTLKDARDSIKSTVESVFKIN" gene complement(7327..8838) /gene="ilvA" /locus_tag="DP116_19400" CDS complement(7327..8838) /gene="ilvA" /locus_tag="DP116_19400" /EC_number="4.3.1.19" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006104709.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine ammonia-lyase, biosynthetic" /protein_id="PRJNA477356:DP116_19400" /translation="MLCDYLVQILTARVYDVAQETPLDYAPNLSARLNNKLLLKREDM QSVFSFKLRGAYNKMAQLPPDLLEQGVIAASAGNHAQGVALAARHIGTRAIIVMPVTT PQVKIDAVRARGGEVVLHGDTYDDAYALARQLEAEKGMTFIHPFDDPYVIAGQGTIGM EILRQYQQPIHAIFVAIGGGGLISGIAAYVKRLRPEIKIIGVEPVDADAMHQSLKAGR RVRLPQVGLFADGVAVREVGEETFHLCQEYVDDIILVDTDDTCAAIKDVFEDTRSIME PAGALAIAGAKAYVEREQIQGQTLIAVACGANMNFDRLRFVAERAELGERREAIFAVA IPEERGSLRKFCECIGKRNLTEFSYRIAGEKEAHIFVGVQIQNRADAAKMVETFEAHG FKTIDLTDDELTKLHLRHMVGGHSHLANNELLYRFEFPERPGALMKFVSSMSPDWNIS LFHYRNNGADYGRIVVGMQVPPHEMEQWQAFLDTLGYRYWDESQNPAYKLFLG" gene 9529..9969 /locus_tag="DP116_19405" CDS 9529..9969 /locus_tag="DP116_19405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19405" /translation="MEPATLTAAAIATLAFSKAIEKTAETLTASVLNKLNNLREKIFH RFKDTQKLKDTLAKAQKEGSKADVDLIAAYLQVAMDTDDKFAQDIQQLAQEINQEINI GNIEGRNVQNVYGGEAFQSNDANAPTFQGGSGHNITFNYNNPNS" gene 10138..11975 /locus_tag="DP116_19410" /pseudo CDS 10138..11975 /locus_tag="DP116_19410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002768422.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" assembly_gap 11443..11452 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(12178..12363) /locus_tag="DP116_19415" CDS complement(12178..12363) /locus_tag="DP116_19415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006619050.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicA family toxin" /protein_id="PRJNA477356:DP116_19415" /translation="MRLLGFEGPFSGAKHQFMTYGQHRLTIPSNDEYSVPQLRMMVRE IEMILEREITLEEWTSL" gene complement(12386..12658) /locus_tag="DP116_19420" CDS complement(12386..12658) /locus_tag="DP116_19420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_19420" /translation="MAVKFILSDYVEQATAQAVYDKLEDGTFAGKIPACKGVIAFGST LRECEDELRSTLEDWILLGLKLGHSLPVINNIDLNKEPTLESMDTL" gene 12788..12970 /locus_tag="DP116_19425" CDS 12788..12970 /locus_tag="DP116_19425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015225751.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_19425" /translation="MKIKVILEPSDEGGYTVSVPLLPGCISEGETIEEALDNIQEAIK LYLEPLEDELSYTLVR" gene complement(13090..14037) /locus_tag="DP116_19430" CDS complement(13090..14037) /locus_tag="DP116_19430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749700.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_19430" /translation="MQGTTAPSTTPIPGKYWQWRGQRIYYVKAGEPKIQRPPLLLVHG FGASTDHWRKNISGLCNDFEVFAIDLLGFGRSAKPKLEYSGDLWRDQLNDFITEVIGQ KAVLAGNSLGGYASLCVAAQRPDAAAGLVLLNSAGPFSENQPSPEPEALQTEIEPIPA NEKLQKLLGEIAKWILRQPLAQFLLFQYIKQPWVIRQTLEKVYLDKSAITDQLVEEIY RPACDSGAMDVFLSVFSTPQGEKVDTLLKQLTCPLLQLWGEADPWISARERSKKFRQY YPELTEYFLRAGHCPHDEVPDEVNRLLSEWVLSTVVSSS" BASE COUNT 4131 a 3143 c 3165 g 3920 t 10 others ORIGIN 1 ttgttaataa ttctaattct aaattttcag atattttttg gcattcgtct aaataatctc 61 ttatttgttg tagagaaatt atttcatttc tttcatgaga aggaagtttt aagtaatatg 121 aaatccggat gtatacttac tgtatagctg ggttgcaatt atctgctcat gccaaaacga 181 atttcaattg tccaacatca tgaattggac gagttagaaa cacgctatcg ccaatcaaaa 241 gattctgttg aacgaagcca gtatcaaatt gtatggttac taggaagcgg caaaacaacg 301 tcggaagttt ctgcggtaac aggatattct ttaaggtgga tcagagtaat agcgaaacgg 361 tacaacgaat tggcagaggc ggggatcgga gataggcgtc atcgaaacgc aggaacagag 421 ccattattgg atgaagtatt acaagcacag ctattgcaag caatggaaac accagtcagc 481 gatggcggaa tttggaacgg accgagagtg gctgggtgga tgagtcaagt acttgagcgt 541 cgagttcatc ctcaaagagg gtgggagttt ctcaagcgtt gggagcatcg tttacgtgtg 601 ccaagaccgg aacattactg ctctgaccgg atagaacaac ttgagtggaa aaaaaactga 661 atttgagggt aggcgaatta caccaacaat atccagatgc gaaaattgag gtctgggcaa 721 tggatgagca aatgcctgcg gcaacgcttc gcgaacgtgt tggactgaag ccagttttac 781 gacgtatttg ggtaccgtgg tgggaggtac caacagcaca ggtgcattgg cgctttcagt 841 gggtttgggt ctatggtttt gttcatccag aatctggaga aacatattgg tgggttttgc 901 cgcgtgtcaa tactgaattg tttaaccaag ttttaggaga ttttgctcgc gagtttggta 961 ttggcgatga gaaacacgtt ctgttaacgg ttgaccgtgc cggatggcac gtcagtcatg 1021 acgttcaaat tccacatggt ttacacttgg agtttttacc accatattca cctgaattac 1081 aaccagcaga aagactgtgg acacttacga atgaaccaat tgccaatcag tacttttctt 1141 cgattgagga acttgaagac gcgatagtcg ctcgttgcca agttctgctt caaaagcttt 1201 gctttattag tggattaacc tgctaccact ggtggcctag aaccgccgca tacgataact 1261 aatcacccgg atttcatatc acctcgtaag ccagcataac ttgccaaccg cctgcatggg 1321 gcgatcgctc cttgaacaaa ggcaagcaat ctcaaagaca ccagcaaatt gcgtcgcctc 1381 aacgagtaaa ttgacaaccc ctcaacggac tactatagac aagttctctt tttctttatg 1441 aagctgcgct aaataaagct tcaagaatga agtcgtaaga ggtgagagaa acaatcactt 1501 ctggtgtaat agtttctagc gcctcagcta atttctcacg cagttgagca agggttttac 1561 aattctccga tcgtagtttg ctcttgagaa actcccagaa cctctcaatt gggttcaatt 1621 caccgagagt ggggaggttg gaaaatcggt ataatatttt ctggacaatc aagggtttta 1681 actgtgtgaa acgagccttg gtcaaattgg ataactgcaa cgtcctcgcc aagttctgct 1741 gacagcaact ctaaaaacct ctgaaaacaa tcactatcaa gatgggaaaa ctcgtagaaa 1801 aaactgtatc cgcttaaagg ttcaacgaca cgatacaagt aaaaattatc acgttgccac 1861 tgactcaaac cgatgggttt gacaccggga gcggtaatta aacgtcccgt aatagtcttt 1921 agtcccaagc gggttatctc ctctgaacat atatctgagt cgctgacctt ctccaaattc 1981 ttcttgcaag aatttgagtg ctaaagggag ttttttttaa agtcagactg gctgtgcgga 2041 tgctgtttcc ggcttttggg acgaggaact tttagtttgg cacccggcat aatagcgaac 2101 aagttgataa accgttttgt aagcaatctt tagtcccagt tcgttctaag cgccattgtt 2161 gaatctgact gtagctgtga aatccttgtg gtgaggacaa tcgctgtttc aaccgttcca 2221 agttttctcc actaactata gattctttac caggcgcatg tttcacacat agcaacccac 2281 ttcgtccctc gtctttatac cttcgtgaac cttcgagtta cggttgcttg atctggtccg 2341 aaatgtttgg cgatacgcct ttaggcgtct gcccttcggg caatcgcttg ccgacttgtg 2401 acttgaccac ttttaagcca gtagagcatt tgtaatcgct ctttgctact tgccgttgtc 2461 gcatatttaa cagctttttc caattcttca aggctttctt taatctcgac ttgcaatctc 2521 aaacccatca cgtttaaacc aaacgcaccc ttatttttaa tttagcgcag cttcatacag 2581 aaatggtatt acaagttctt gttgttgggt gtgggcttgg aatggaattt attcgaccat 2641 aagtcgtttg ctagaaacta atgcaaaata agtataacct catctttcta agaggaggtc 2701 ttcggacagc aaatatgatt cattggtttg atgtcagctg acaaacacaa ttcctgagct 2761 aaattcagca acaccagatt tttctatctt ctatatcctg atgtagatac tctcctctgt 2821 tgtcggcgtc acaattaaca gtaacacgat gccaggagag aattaattca tgtcgcaaga 2881 gcaacgcata gaagaacaaa tgctttcgca tgaagccgaa aagcaagttt ctcaacaggt 2941 agacaaagta gaaaaagtag acgtagatgt acaaaccgac cttctgaaaa tatttcaggg 3001 acaggcagat ggagtttctt ttgaagctca aggactagtc aagcaagaca tccgtgtgca 3061 ggaaataaaa ctacagacag atagcattga catcaatcca ttgagtgttc tttttggtca 3121 aatagaactg aatcaaccag tcaataccac tgctcgtatt gtacttatag aagcagatat 3181 taaccacgct ttgacttcaa agtttgttcg cagcaagatg caaaactttg agttgaatgt 3241 agatggtgaa attgtcggtt tacagccaca ggaaatacaa attcacctat tagatggtgg 3301 caaaatggca tttacgggaa aagtactgct caaagaaaag gggaatactc gctcaataag 3361 tttcacagca caggtttgcc cacgtactca agacaaaccc ataatgctag aaaattttaa 3421 ctgcactcac ggcggagaag gtatttcact agaagtcgtt gttgctttaa tgcagaaggt 3481 aaaagaacta gtcaacttac catattatga atatgagaaa acggtgttcc gcgtcagaaa 3541 tatggatgtt gaaaaaggta atatgacact tttggtagat gcacgtctca agcaaattcc 3601 ctcattggat gatttatctt gaatctttag tcataaaagg aatgtgagat gcaaaaaata 3661 gatagttttt caaaagtaaa tacgtttgga tggactatac tcacttgact gctgtacata 3721 aactatagca gtcctaaatc attagtgaaa ctcttttttc ttctcctctg cgccacgcca 3781 ggtgctacaa cggggggaac ccccgcaacg cactggctcc tctggttctc tgcggttaat 3841 tcaatcaaaa tctttttcac aaatcaaata gaattgctat aattgcccaa ataaaattgt 3901 actcaaagag atatagacgc atcttaattt caatgatgaa atggcattgg ttcaactcaa 3961 attgtaaaga tttatgtacg aaaagagtaa aaaaccaagc cagaaaagtt tatgggcact 4021 cgattactta aacctttttc tagctgatgt acgtgatgga gtaggaccat atctagctat 4081 ctacttgaaa gcttcagaaa attggaatcc agccaatatc gggattgcaa tgtctgcttc 4141 aactattgca acagtgattg cccaaacacc aacaggtgcg ttagtagacc gattgcgtca 4201 aaaacgaatg ttgattgtgg tggctgctgc aattgtgtct attggttgca tcgcaatagc 4261 cctgttcccg agttttccaa tagttattgg tggtcaaatt ttaattggtt tagcagcagc 4321 agtgtttcca ggtgcaatcg ccgctattac cctaggacta gtcgggcatg atcatttaga 4381 ccgtcgaatt ggtcgcaatg agtcgtttaa tcatgcaggt aacgtacttg cagcgatttt 4441 agctggttta gtgggttctt ttattaccag caaaggcatc ttttttctgg ttgcagctat 4501 ggcggtggct agtgcaatcg ccgttttgag aattcgcgaa aaagagattg accacgaatt 4561 agcgcgtggt gcaaaagatg aggatgagga cgtttcagaa gaacatccac atcatctctc 4621 tgggttatcg cagttgttca gcgatcgccg catccttttg ttctgtcttg cagtggttct 4681 gtttcacttt gccaatgccg ctatgttacc acttgtcggt caaagactct cagaaggcaa 4741 agctgcagga gctacacttt atatgtcagc ttgcatcatt gttgcccagt tggtgatgat 4801 tccttctgct aacttggctg gtcgttttgc ccatgctgag cgaaaaccta tctttttgtt 4861 tggctttgct gttttgccaa ttcgtggtat actttacact ctcaccaata atccttattt 4921 tttggtttct gtacagattt tagatggagt tgcgggcggg atttttggcg tgctttcagt 4981 actcatggtt gctgatttaa ctaagggcac gggtcggttt aatgtcacac aaggagcgct 5041 aaatactgct gttggtatag gtgcatgttt gagtaatctg ctagctgggt ttgtggtgca 5101 aaaggctggc tataatgttg cttttgttgg gttggctgcg atacgccttt ggcgtctgcg 5161 ctttgcgcaa tcgctctcgt agcaactatc attttctgga tgttcgtccc agaaaccaaa 5221 gcttcacata aagcacgggc tttcgtttca aaccactatt catgaaaagt ctatgaataa 5281 cgactgattg caaacgccca aggtgtatta cataagcgac taattgctca ctgtcttcct 5341 tctattacaa ttagtcagta gccagtcttc tcttagccaa tttttcagcc taggtcttat 5401 cctaaaccag gatttctata actgccgaaa gatggaatta ttagtaaaaa tttgtaacct 5461 tgaaaacaaa gagatagata taagcacgaa gtttcatcta tgcaagagta tgatgttgtc 5521 attattggag caggacataa cggattagtc tgtgctgctt atttgctgaa agctggctat 5581 agcgtcctgc tcttggaaaa gcgttctgtt cccggtggtg cagcaacaac agaagaatct 5641 ttaccacagg aagcgcctgg atttaagttt aacttgtgtg caattgacca cgagtttatt 5701 cacttaggac cagtcgtaca agaattagaa ctggaaaaat acggcttaaa atatctggag 5761 tgtgatccag ttgttttctg tcctcatcct gatggaaaat acttcttagg tcataaatca 5821 gtagaaaaaa cttgtgcagg aatcgcccgt tataatgaac gagatgccaa aaaatatgca 5881 gaatttacag actattggca gagggcaatt ggtgcaatga ttcccatgtt taatgcaccg 5941 cctaaatctg ttttagatat tcttggcaac tacgacattg caaagctgaa agatttattt 6001 tcagtcattg gttctcctaa taagacgctg gactttattc gcaatatgct aaccagcgct 6061 gaggatattc ttaacgagtg gtttgattca gaatttctga aagcgcctct agcaagactt 6121 gcatcagaac ttggtgcgcc tccctcccaa aaaacccttg ctattggtgc aattatgatg 6181 gcaatgcgcc ataatccagg catggctaga ccccgtggcg gtactggtgc attggtacaa 6241 gccttggtga acttggtgac aagtaaaggt ggcgttatcc tgacagacca gcaggttaag 6301 aaagttttgg ttgataatgg tcatgctgtt ggtgtgcagg tggcaaatgg cgtagaatac 6361 cgtgctaagc acgcggtgat ctcaaacatt gatgccaagc gactattttt gcaattcata 6421 gataacagcg atgtagatgc tgctgatcca aacttacggg aaagattaga acgtcggatt 6481 gtcaacaata acgaaactat cctcaagata gacttggctt taaacgaacc actgcgcttt 6541 gaacatcacg agcacaagga cgaatacctg attggatctg tgttgatagc agattctgtt 6601 actcatgtag aacaggctca tagtaaatgt actttgggag aaattccaga ttccgacccc 6661 tcaatgtacg tagttgtgcc aacaatgtta gatccgtcga tggcacctcc tggcaagcat 6721 accgcatgga ttgagttttt ctctccgtat caagttgctg gtgcagaagg tactggttta 6781 aatggtacag ggtggacgga cgaattaaag cacaaagtcg cagataaggt gattgataag 6841 ttagcagact acgcaccaaa tgtgaaaaat gcaatcattg ctcgtgctgt agaaagtcca 6901 gcggaattag gagaaagatt aggcgcgtat aaaggaaatt actaccatgt tgatatgacc 6961 ttggatcaga tggtattttt ccgtccctta ccagagatag cgaactacaa aaccccaatt 7021 gaaggtttgt tccttacagg tgcgggaact catccaggtg gttcgatttc aggaatgcca 7081 ggacgcaact gtgcgcgagt ctttttgcag cataagcacc ctatagcaca gacgcttaag 7141 gatgcacggg attcgattaa atcaacagtt gaatccgtgt ttaagattaa ctaatacgcg 7201 cgttgcattc atacgtatta cctcaccctg ccctgtcggg catccctctc cgaattccgg 7261 agagggaaaa attttagggg ttttgctcct caactcagcc ctactctttt aagcgtgagc 7321 attaccctat cctaaaaaca gcttatatgc tggattctga ctttcatccc aatagcggta 7381 acccagcgta tccagaaatg cttgccattg ctccatctca tggggaggta cctgcatccc 7441 cacgacaatc cgcccgtagt ctgcgccatt gttgcggtag tgaaacaggc tgatattcca 7501 atcaggactc atggaactga caaacttcat caatgcacca ggacgttcag gaaactcaaa 7561 acggtaaagc aactcattat tagcaaggtg agagtgtcca ccaaccatat gccgcaaatg 7621 caattttgtt agttcgtcat cggttaagtc aatggttttg aacccatgag cttcaaaggt 7681 ttcaaccatc tttgctgcat cagcacggtt ttgaatttgc acccccacga aaatatgtgc 7741 ctctttttca ccagcaatgc gataactaaa ctcggtcaga ttccgtttgc caatacattc 7801 acaaaacttg cgaagactac cccgttcctc aggaatcgcc acagcaaaaa tggcttcgcg 7861 gcgttcaccc aactctgctc gttcagcaac aaagcggagg cgatcaaagt tcatgttagc 7921 accgcaagca acggcaatta acgtttgtcc ctggatttgt tctcgttcga cgtaagcctt 7981 tgcaccggcg atcgccaatg cacccgctgg ttccataatc gagcgcgtat cctcaaacac 8041 gtctttaatc gcagcacaag tgtcatctgt atcaactaaa atgatgtcat ccacatattc 8101 ctgacacaga tggaaagttt cttctccgac ttcccgcacc gctaccccat cagcaaataa 8161 gcctacttga ggtaagcgca cccgtctccc tgctttgagc gattgatgca tagcatcagc 8221 atccactggt tcaacgccaa taattttaat ttcaggacgt aagcgtttta catatgctgc 8281 aatcccagaa atcaatccac caccgccaat cgccacaaaa atagcgtgga taggttgctg 8341 atattgtcgc agaatttcca tgccgattgt tccttgtccg gcaatcacat acggatcatc 8401 aaagggatga ataaaagtca tacctttttc tgcctctagt tgacgggcta aggcataggc 8461 atcatcgtaa gtatccccat gcaaaactac ctccccccct ctggctctga ctgcatctat 8521 cttcacctga ggtgtcgtta ccggcataac aataattgct cgtgttccga tatgacgggc 8581 agcaagagca acgccctggg catggtttcc agcagacgca gcaatgacac cctgttccag 8641 caaatccggc ggtagttgcg ccatcttgtt ataagcaccc cgcagcttga aggaaaagac 8701 tgactgcata tcttcccgct tgagtaggag tttattattc agtcgtgcgg atagattggg 8761 agcataatcc aagggcgttt cctgagcaac atcgtacacg cgggcagtca ggatttgtac 8821 aaggtagtcg caaagcatgg gcgtcaacag gttggtaaat gccggatgga tgataatttt 8881 accgccaaac ttgtcagcat gacatacagg acttaggcaa cgagatccag cagttgtggg 8941 agggtgtagg ggggtaaggg ggtaagggtg taggggtgta ataccaaatc cgtttttata 9001 acccctcgtt acccctgggg attggaaatc gcggctatac aaacgaagtc cgcctgcgcg 9061 gactaaatta taaatggggt ataagacgcg gatttagtat aaggggaaaa cgcgcgggtg 9121 tatagaagtt aaaagctaaa aggcaaaaga aaaatttctt tgtccttact ttatggtttc 9181 tactttttaa aggtctaaaa aaagaagatt gccatatcag gaactgtact tatccacggt 9241 gtagttactt gtggtttgga aaaacttgca ctaagacctt cctattaaac aaaatctgct 9301 ttttggacgg ttcatcgtac cacgaaaagc agattttttt ttaaaatatt cagaattatg 9361 aacctcaccc tcgcttgaat cggcgctaaa atctttccct ctccttaata aggagaggga 9421 tgcccgacag ggcagggtga ggttccgaac ttactagtaa ttttccgaaa tctttcaaga 9481 taaagtggta gatataacta aatgtcatct tggtttcggg gctagtttat ggaacctgca 9541 acactcacag cagcggcgat cgccacccta gcatttagca aagctatcga aaaaaccgca 9601 gaaacactga ctgcaagcgt attaaataaa ttaaacaatc tgcgcgagaa aatttttcat 9661 aggttcaagg atactcagaa attaaaggat acgctagcaa aagcccagaa agaaggctca 9721 aaggcagatg ttgatttaat tgctgcttat ttgcaggtag caatggacac agatgataaa 9781 tttgctcaag atattcaaca gcttgctcag gaaattaacc aagaaattaa tattggtaat 9841 attgaagggc ggaatgttca gaatgtttat ggtggcgaag cttttcagtc taatgatgcg 9901 aatgctccca cttttcaagg cggcagtggt cacaacataa cctttaacta caataatccc 9961 aattcttgac tgcagtcgtc cgaatgggaa cgggcaaagt taaatccccc tactggaata 10021 ccagagaatt tgcccaagaa tagggcgacg aaatttgtcg ggcgagaaga ggaactgcaa 10081 cagctacacc aacttttgca ggaaaatgac cgcgtggcaa tagccgccat ttctggtatg 10141 ggtggagtgg ggaaaacaga actggcgctg caatatgcaa atactcaccg cgaaacttac 10201 caaggtggaa tttgctggtt actcccgaaa gctgcggatg tgggattgca gttagtgcag 10261 tttgcgcgta ttcaccttaa cttaaaccca ccagaaatat cgccggattt tgatttacac 10321 gcgcaactcg catactgctg gcggcgttgg cgtgagggtg aggtgctgct gattttagat 10381 gatgtcgcga aatacaaaga aattaaaccg tatctcccaa cctcatcttc ccggtttaag 10441 gtgttgatga caacaagagc gcactttggg caaattccga aattatcatt aggtgtgctg 10501 caaccagaag cggcgctgga gttattacga acactcattg gtgcagaacg agttgatgaa 10561 gaaattgcac aagcagaatc tttgtgtgca tggctgggat atttaccttt ggggttagaa 10621 ttagtcgggc ggtatcttgc acgcaaagaa gatttgtctc tagaagaaat gttgcggagg 10681 ttggagaaaa agagattaga acaacctgcc cttgtgaagt cagaagacga catgacagca 10741 caactgggtg tgcaagcagc gtttgacttg agttggcagg aattaaacaa cacggataag 10801 caactggcgt gtttgctgag tttatttgca cctgcaccaa taccttggca gttggtagaa 10861 ctgtgcttac ccgatgtaga tgcagaagag ttagaggaaa gtcgggatga taagctgctg 10921 aatttgcatt tgctacagcg taaggaaaag ggaatttatc aactgcatcc actgctgagg 10981 gaattttttc aagcgaagtt gacagagtta gatcccccca acccccctta tcaagggggg 11041 gatctcaaac aagcttttgc cacagcaatg atagctgttg ctaagaatat tcctgaaaac 11101 gccacgcgtg aacaagttac cgttatttct cccgcaatac ctcatttggc agaagtcgcc 11161 aacaatctta ttcaatgtgt cagggatgag gatttaattt gggctttcgt tggtaatgcc 11221 ttattctaca atagtcaggg attgtatgat caagcagaac cttggtataa gcaatgtcta 11281 gaagttacta aaaaacgcct gggagatgaa catcccgatg tcgcaactag cctcaacaac 11341 ctggctaatc tctacgactc ccaaggaaga tacgcagatg ctgaaccact gtacctacaa 11401 gcattggaac taaggcgacg cctgctggga gatgaacatc ccnnnnnnnn nnactgtacc 11461 tacaagcatt ggaactaagg cgacgcctgc tgggagatga acatccctct gtcgcagcta 11521 gcctcaacaa cctggcgtta ctctacaact cccaaggaag atacccagaa gctgaacctt 11581 tgttccaaca agcattggaa ctcagccgac gcctgctggg agaagaacat cccaatgtcg 11641 cactcagcct caacaacctg gcgttactct acaactccca aggaagatac ccagaagctg 11701 aacctttgtt ccaacaagca ttggaactca gccgacgcct gctgggagaa gaacatccca 11761 atgtcgcact cagcctcaac aacctggggt cactctactc ttaccaagga agatacgcag 11821 aagctgaacc tttgtactta caagcattgg agatttgtga acaacgctta gaggtaaatc 11881 atcccaacac tgtcactgtt cgtgaaagtt tggcaagtct tcgcgctcaa ttggcttcag 11941 atcaggaaaa ctcaagttct gagcctcaac gctgaagttc tcaactcgga ctttgcactt 12001 ttgaagctga agtttcgagt tccaaatcag ttaccagtta tcagttatca gttatcagtt 12061 atcagttact tgttcactgt tcactgttaa acacgcgatt cgccacctaa tatccatcaa 12121 atcatgaaag gcgatgctcc tttggagccg cttcgcgatc gcctatcacc gaatgtatta 12181 aaggctagtc cactcttcca gtgtgatttc tcgctccaaa atcatctcaa tttcccgaac 12241 catcatccgc agttgcggta cagaatactc atcattggag ggaatggtta ggcgatgctg 12301 cccgtaagtc ataaactgat gcttagctcc agaaaaaggt ccctcaaaac ctaacagtcg 12361 caatttatgg acaaaatccc gacgcttaca aggtatccat cgactcaaga gttggctcct 12421 tgttaaggtc aatattatta atgactggca gagaatgtcc caatttcaac ccaagcaaaa 12481 tccagtcttc cagtgttgag cgtaactcat cttcacactc acgcaaggtt gaaccaaagg 12541 ctataactcc cttacaagca ggaattttac ctgcaaatgt accatcttct agtttgtcgt 12601 aaaccgcctg tgcagttgct tgctcaacat agtcacttaa aataaacttt acagccatta 12661 agttttccgc tttgctacaa cttagttgtc attttctact ctattgaaac acaaaaatat 12721 agagagcgat tgccttattg cttacgcgac acacattata tttaaaagaa agtcagaaag 12781 ccaaattatg aaaataaaag tcatattaga acccagtgat gaaggcggat acaccgtctc 12841 tgtacctttg cttccaggct gtatcagtga aggggaaact atcgaagaag cattagacaa 12901 tatccaagaa gcaatcaaac tttatcttga accattggaa gacgagttaa gttatacatt 12961 ggtgcgctag gcgcttttct atttttcgtt acaaatcata ttgaaatatg cgcctaacac 13021 aacgccaggt gctacaacgg agggaacctc cgcaacgcac tggctcccct acagtcactc 13081 acagtaaact cacgacgaac tcacaactgt agacaaaacc cactcactca acaaacggtt 13141 cacctcatct ggtacctcat catgaggaca atgacccgcc cgcaagaagt attctgttaa 13201 ttcaggatag tattgccgaa acttctttga acgttccctt gcacttatcc aaggatcagc 13261 ttctccccac aactgtaaca gaggacaagt taattgcttg agcagcgtat caactttttc 13321 cccttgagga gtgctaaaca cagacaaaaa tacatccata gccccagagt cgcacgcagg 13381 acgataaatt tcttctacta gttggtctgt aatggcactt ttgtcaagat aaactttctc 13441 tagagtttgg cgaattaccc aaggttgttt tatgtattga aataagagaa actgggctaa 13501 aggctgccgt aaaatccact tggcaatttc acccagtagt ttttgtagct tttcgtttgc 13561 tggtatcggc tcaatttcag tttgcaaagc ttctggttct ggtgaaggtt ggttttcgct 13621 aaaaggacca gcactgttga gtaaaaccaa acccgctgct gcatcaggac gttgtgctgc 13681 aacacacaag cttgcatacc ctcccagaga attacctgct aacactgctt tttgaccaat 13741 cacttcagtg ataaaatcgt tgagttggtc gcgccacaag tcaccactgt attctaattt 13801 aggtttcgca gaacgcccaa atcccaaaag gtcaatggca aagacttcaa aatcgttaca 13861 caatccgctg atgttcttgc gccagtggtc ggtagatgca ccaaaaccgt gtaccaataa 13921 taagggaggg cgttgaattt ttggttctcc cgctttgacg taataaatcc tctgccctcg 13981 ccactgccaa tatttaccag gaattggggt tgtagagggg gctgttgttc cctgcatgat 14041 ccaaagaaat gttaagtacc tgtaaatgat tctaacgaag tcatgagacc tcacccccat 14101 cccctctcct tattaaggag aggggtgccc gtaagggcgg gggtgaggtg acgacaacct 14161 gctgcaggag ggtttccctc cgcaggcgac tggcgtatct cctccggaga cgctttgcta 14221 acgccgtaag gcgtgcgctc tgcgcacacc cggagggttt aagtcatgaa tgcaacgcgg 14281 tataacgaag tcatgaaagt caataatgac taatattgac actcccctgc ctaaaggcga 14341 ggggattcta cattcatcgt cagaacttg // LOCUS NODE_2351_length_14352_cov_4.98356314352 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14352) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14352) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14352 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 115..354 /locus_tag="DP116_19435" CDS 115..354 /locus_tag="DP116_19435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874661.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19435" /translation="MKTEREREQLIKDINVLLNQAYDCTLDEILALLQNVEDEEDEED LKSVKEAREEIRLHGTFSWEEIKKEIAEERKQDVA" gene 358..615 /locus_tag="DP116_19440" CDS 358..615 /locus_tag="DP116_19440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747440.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_19440" /translation="MTYTVEFSPSARKMFKKLPQDLQDRIQPKIDALATEPRPSGVKK LKGEENTYRIRVGSYRVVYEIEDDVLLVTVIRVGGRGEVYN" gene complement(622..2769) /locus_tag="DP116_19445" CDS complement(622..2769) /locus_tag="DP116_19445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_19445" /translation="MKNFFTLSTVFSSVIVTGLLFAVQRLGVLEPLEIRLFDQMMQMR VDSGSDSRLLIVAVTEDDIKKWKSPTSDRLSGEVLDNLLGKLEQYQPRAIGLDIYRDL PIEPGHNKLLKRLQQSDIIIPICKHNDKNERGVSPPEGIKSWEVGFIDVVEDSDSTIR RNLLLSDPAANDACAAQYSLSLQLALKYLELEGIRLQSTPNQELKLDKTVFKRLESNS GGYQNIDTGGYQIILNYRSSQVAKKVTLTNVLEDKVGSDLVKNRIILIGSTAPSLKDI FNTPLSTGKSDTSGRMAGVEIHAQSVSQILSAVLNNKQTLFWFLPEWGEVVWILVWSL TGGIIASRIQHPLYLAIVGGTGLVVLFAGNFFIFTQAGWIPVVSPALGLVLAAGSVLG YTLYQSQQEKEKIAQQVQQQEEAIIQLQAFISQSNNSSGLTTPPQIPSQMHLLKRRYK IIEPLGHGGFSETYLAQDTQRPSHPQCVVKQLRPAHQEESFLRVARRLFNTEAEILEV LGQHEQIPQLLAYFEENQEFYLIQEFIKGNSLEKEITPNKKFAEADVVSLLKEVLLIL VFVHGYNVIHRDIKPSNLIRRESDGRIILIDFGAVKQIQTHQPNNTVAIGTPGYVSPE QTNGQPRLNSDIYALGILAIQALTGRHPKTFQRDFNTLRVVISRQDGTLQNWHDLTEI SDKFAAVLNRMVNQNCNLRYQSATEVLNSLESL" gene complement(3354..3614) /locus_tag="DP116_19450" /pseudo CDS complement(3354..3614) /locus_tag="DP116_19450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877766.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(3636..4946) /locus_tag="DP116_19455" CDS complement(3636..4946) /locus_tag="DP116_19455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859185.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="arginine biosynthesis bifunctional protein ArgJ" /protein_id="PRJNA477356:DP116_19455" /translation="MADWQEISGGITAPRGYRSAGIAAGLKPSGLPDLALIVSDVEAI AAGVFTISHVKAACVDYCRQSLQAKHSARAILCNAGQANAATGHQGWLDAIESAMAVA QALNIPSESVLLASTGVIGQRIRMDALKAGIPKVVAALSETGSDAAAGAIITTDLVRK SIALETMMGDRPVRIGGIAKGSGMIHPNMATMLAFVTCDAAVSPTLWQQMLSRAADRS FNSITVDGDTSTNDSLIALANGQSRTPAITEMGAEAEKLEAMLTAVCQHLAKAIARDG EGATCLIEVQVTGAQDEQAARQVAKTIAGSSLVKSAIFGRDPNWGRIAAAAGRAGVPF EQENLKIKLGDILLMENGQPLPFDKKAASEYLKQAAADASVPKDLVATNMSNDLSVDR STIKQRIDNPVIISVSIGNGSSSGKAWGCDLSYDYVKINAEYTT" gene complement(5385..5585) /locus_tag="DP116_19460" CDS complement(5385..5585) /locus_tag="DP116_19460" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19460" /translation="MPLRDSHFPKCNTEIWMRWEYQVALVISDPFSTIIGLYYQHYEN LIFLDFIFRNKHLLEDTQYGDF" gene 5892..7466 /locus_tag="DP116_19465" CDS 5892..7466 /locus_tag="DP116_19465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871219.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase" /protein_id="PRJNA477356:DP116_19465" /translation="MERFQFERLLATKHKRRRFLIGTLGVSASVIASQWTHRVVAQPS FSGYPFSLGIASGEPLPDGIVLWTRLAPEPLNGGGMPSVNVPVQWQVALDENMRNVVR RGTAIATPEFAHSVHVEVGGLQPDRWYWYQFKAGNEVSSIGRTRTAPARDTRVAQFRF GFVNCQDWQNGYYTAYQGLAQEELDLVVHLGDYIYEYGPQPGGPRQHNSPEIVTLADY RNRHALYKTDANLQAAHAAFPWIVTWDDHEVENNYASFIPEENQSQQEFVTRRANAYQ AYYEHMPLRRLSLPRGPYLQLYRRFTFGDIAEFNVLDTRQYRSDQPCDDGLKPRCSEA FDPNATMTGTKQEQWLFRGLSKSQARWNVIAQQTMMAQYNFDARPGQEVFNLDQWDGY VAARDRLLKFLQQQQPSNPVVITGDIHSSWVHDLKTDFNNPNSPTVGTEFVGTSISSD FPEAFIPPTVAALSANPHTKFFDGKNRGYVRCHLTQNTWQSDYRVVSTIREPNATIST LASFVVPNGRAGAQQT" gene 7757..9199 /locus_tag="DP116_19470" CDS 7757..9199 /locus_tag="DP116_19470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860328.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase RecA" /protein_id="PRJNA477356:DP116_19470" /translation="MSDNRLSSGISGLDEVLYGGYVPGRAYLIRGGPGAGKTTLGMHF LTTGAARGEQVLFITLAETVTQLRRTSEGLGFDLENITFLDLSPTPEFFTQVQTYDIF SPAEVEREPTTRRIVQQVEALKPQRIFIDSMTQFRYLATDAFQFRKQVLSFLRFLVEQ DITVLFTSESSEEAPDDDLQFMSDGVLNLNFSQNERTLCISKFRGSDFQNGNHAIRLT STGMQLFPRLIPQIYGQAFTTEVISSGIPEIDELLHGGIERSTITIISGPSGVGKSTL GLQFMKEAAGRGEHSVIYTFEERKETLLHRAEGINIAVHAMQERGTLSVVQVEPLYYT SDEFANLVRQEVEQKQARIVMIDSVSGYRLSVRGQDLTTHIHALCKYLQNMGVAVLLI NEVETITGEFRVTEIGISYLADTIIFLRYLEMQGELRRAIGVLKKRMTDFEKTLREFK ISRYGIKVGDPLTHLRGILTGVPELLEDKP" gene 9271..10356 /locus_tag="DP116_19475" CDS 9271..10356 /locus_tag="DP116_19475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017289559.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_19475" /translation="MSIILVFVEQSENRRLLAEWLGMSYKVVVPDLVVQSGKAVPLLD EPFDLCILDGPALDYMWEWVQARKHKEQPVFLPFLLITVRSDVKLLTRNLWQSIDELI TKPIEKLELHARIEMLLRSRRLSLQLETALKQERELKEQKSRFISMVSHEFRNPLNTI AGFTRLLEQDKLSQEKRADFFQRIQAAVRRMVALLDDVLILSKSEANHLTYNPIRLAI EPFCRKLIEEIKFSISTGHTIDFNCEDECFTVYIDEAVVRHVLTNLLSNAIKYSPPDS TVGLKLQCQSETVIFQVQDQGVGIAPADQQRLFESFFRASNVGNIPGTGLGLAIVKQV VERYGGTITVKSEMNVGTTFTVALPIQ" gene complement(10350..11720) /locus_tag="DP116_19480" CDS complement(10350..11720) /locus_tag="DP116_19480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015117900.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_19480" /translation="MQIRLFRQTRTWLALWYAVVMGLILTLCGFVVYEVIVDAYLVSI KRELESVTGTLHNVIEPNLKQPGRIEPIFQQVLLNTCVIESGCPTQTIFGHKQRILAE DDIISTIHRDKQYYIRFIDSSGRLIAVVGFLPDKLPPTVQTKVWQTVKDPQGNRYNQK SLPLHTQDNQVWGYIQVGRSLKELDNRLAALKLVLALGLPITVLLVGGSSWWLAGLAM RPIDRSYKQMQQFTSDASHELRTPLAAINATVETVLDTEHLSLTEARDTLASIQRQNY RLAELVGDLLLLSRLDQQELTTQWEPCCLNILINDLIEEFSALASAASLQLTSSVLCH QPLYVMGDEDQLLRLLSNLIANAIKYTRAGGYVTVILKRNNGHAVIEVQDTGIGIAPG EQKRIFNRFYRVNSDRSRTTGGSGLGLAIATAIVQAHGGSLCVQSEVGKGSTFIVQLP LKTFRY" gene complement(11710..12411) /locus_tag="DP116_19485" CDS complement(11710..12411) /locus_tag="DP116_19485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010993934.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_19485" /translation="MKVLLVEDEPDLGASIQRKLSKEKYIVDWILDGTEAWICLENQW TEYTLAIFDWLLPGISGLELCKRLRVHGNPLPVLMLTAKDSMADKVAGLDAGADDYLV KPFGMAELLARLRALQRRSPQFQPQQLQVGSLILDYGSGTACYQQPNGNSQVISLTKK EFHLLEYFMKRPNQIVSREQLLSHLYTLNAERISNVVAAQIRLLRRKLSELGCDGFIE TIPSMGYRFNSSDAN" gene complement(12688..14211) /locus_tag="DP116_19490" CDS complement(12688..14211) /locus_tag="DP116_19490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863467.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SulP family inorganic anion transporter" /protein_id="PRJNA477356:DP116_19490" /translation="MKKLKTNKLKRSLSKDIVASFVVFLVALPLSMGIAIASGVPPAR GLVTGIIGGIVVGTISGSALQVSGPAAGLAVIVAELVQNNGIEMLGPVILLAGLIQLL AGVFKFGKIFRAISPAVIYGMLAGIGVLIFASQFHVMLKEKPRATGIENLISIPISLY HTFFDYDNISHLMAAAIGVTTLIVLLLWDKFKPRSLKLIPGALIAVVVASTLANLFHL PIPYVDLPANLTEVIQVPTPANLIRLINPPLLISAAAIAFIASAESLLSAAAVDRLHQ GSKTDFDRELAAQGFGNMVCGALGVLPMTGVIVRSSVNVKAGAKTRLSAILHGVWILV LVVAAPGLLKMIPMSSLAAILVVTGYKLVEVENIRKLKQYGRVPVVIFFATFIGIVAA DLLTGVLIGIVLTTIALIYKISYLNIYLERDEKNRRIDIHIEGTATFIRLPKMASVLE QMPADTELYVHLENLAYIDHSCLDWLSMWAEQQQQRGNTVTIHWDRLEKRFRRPFQS" BASE COUNT 4059 a 3176 c 3112 g 4005 t ORIGIN 1 cacgtccgga tttaactgag aggtgcgcgg gataataccc gccatcgact caaccttatt 61 aggtgaaact agaggcggcg acagaaaaag taaaaatatt gaggatgagt aaaaatgaaa 121 actgaacgtg aacgcgaaca attaattaaa gatatcaacg tactgctaaa tcaagcttat 181 gactgtacct tagatgaaat actggcactc ttacaaaatg ttgaagatga agaggatgaa 241 gaggatttaa aatctgttaa agaagccaga gaagagatac gcttacacgg tacattctct 301 tgggaagaaa ttaaaaaaga aattgctgaa gagagaaaac aagacgtggc ttaattgatg 361 acttacacag ttgaattctc tcctagtgca agaaagatgt ttaagaaatt acctcaagac 421 ttgcaagacc gtatacaacc taaaatagat gctttagcca cagaaccccg tcctagtgga 481 gttaaaaagt taaaaggcga agagaatact tatcgaatta gagttggcag ttatcgagta 541 gtttatgaaa tagaagatga tgtattgttg gttactgtga ttagagtggg tggtcgtggg 601 gaggtttata actaataact gttaaagact ttctaaacta tttaatacct ctgtagccga 661 ctgatatctt aaattgcagt tttgattcac cattctattt aataccgcag caaatttgtc 721 actgatttct gttaaatcgt gccaattctg caaagtacca tcttgccgag atataaccac 781 ccttaaagta ttaaagtctc tttggaacgt tttaggatgt ctccctgtta aggcttgaat 841 agcgagtatt cccaacgcat aaatatcgct gtttagtctt ggttgtccgt tcgtttgttc 901 gggagacaca taaccaggag taccaattgc gactgtatta tttggctggt gagtttgaat 961 ctgcttgacc gcaccaaaat caattaaaat gattcgtcca tcactttctc gcctgattaa 1021 attactaggt ttgatatctc gatggatgac attataaccg tgaacaaaaa caagtatcag 1081 caaaacttct ttaagtagag atacaacatc agcttctgca aactttttat taggagtaat 1141 ctctttttct agggaattcc ctttgataaa ttcttgaatt aaataaaatt cttgattttc 1201 ttcaaaataa gccaataatt gaggaatctg ctcatgctga cccagaactt ctaaaatttc 1261 tgcttcagtg ttgaaaagtc gtctagcaac ccttaaaaaa ctttcttctt gatgagcggg 1321 tcgtaattgc ttgaccacac attgaggatg actaggacgc tgagtatctt gagctaaata 1381 agtttcacta aatcctccat gacctagagg ttcgatgatt ttatagcgtc ttttcagcag 1441 gtgcatctgg gagggaattt gaggtggtgt tgttagacct gaggaattat tgctctggct 1501 tataaatgct tgcagttgga taatggcttc ttcttgctgc tgaacctgtt gtgctatttt 1561 ttctttctct tgctgactct gatatagcgt gtaaccaaga acgctaccag ctgctaacac 1621 caaccccagt gctggcgaga cgactggaat ccatcctgct tgggtaaaaa taaaaaagtt 1681 accagcaaat aatactacaa gacctgttcc accaacaatt gctagatata aaggatgctg 1741 aatgcgtgat gcaattatac ctccagttaa actccatacc aaaatccaaa ccacttcacc 1801 ccattcaggc aaaaaccaaa atagagtttg tttgttattg agaacagcac taagaatttg 1861 actgacactt tgggcatgaa tctccactcc cgccattcta ccggatgtgt ctgacttgcc 1921 agtgcttagt ggtgtattga aaatatcctt taaactgggt gcggttgaac caatcaagat 1981 gatgcggttt ttcactaaat cagaaccaac tttatcctca agcacatttg ttagagtaac 2041 ttttttagct acctgagatg aacggtagtt gaggataatt tggtaaccac cagtatctat 2101 attttgataa ccaccagaat tactttctag acgtttaaat acagtcttgt caagtttcaa 2161 ctcttggttg ggagtagatt gcagtcggat gccttcaagt tccaagtatt ttagtgctag 2221 ttgtaaactt aaggaatatt gtgctgcaca agcgtcattt gctgctgggt cagacaacag 2281 taaattgcga cgtattgtac tatcagaatc ttctacaaca tcaataaatc cgacttccca 2341 tgacttaatt ccctcaggag gagatactcc acgctcattt ttatcattat gtttgcaaat 2401 tggaataata atatcgctct gttgcaaacg tttcaataac ttgttgtgac caggttcaat 2461 gggcaagtca cgataaatgt caagaccaat tgctcgcggt tgatattgct caagtttacc 2521 caagagattg tcgagaactt cccccgataa acggtctgaa gtgggcgatt tccatttttt 2581 gatgtcatct tcagtgactg caacaatcaa caggcgggaa tctgaaccag aatctactcg 2641 catttgcatc atctggtcaa aaagtctgat ttctaatggt tctaacactc ctagccgttg 2701 aacggcaaac agtaaacctg tgacaatgac gctagaaaaa actgttgata atgtgaagaa 2761 atttttcata acggttgctg tcaagcatat aattatagca gttttcactt gagtgaaata 2821 cactatctta gatcttgcac ctatccatat atagcggttc tctgtagggt gggcattgct 2881 cgctgatatc ttaaatgtcc attacataag ctattatggg caatgcccac cttacgattt 2941 gtggtctaat catctgaaaa gcactgtaag cctgggctta gtcaaaaaca gcgcaataaa 3001 gttatatgat tttttattaa cttttctcaa gaaaataagg gctttaggct tcgtattgag 3061 tgtgaaaata atatcaggtt cgcttaaaca cttataatat ctgtaggttg gggaggacag 3121 tgccgggcgt agtcgccgag gtcttggggg tttccacgcc aggtgctaca acggggggaa 3181 ccccaacgcc agataccaag tgagggaaac cctcctgcag tactggctcc gcaacgcact 3241 ggctccccat gaggaactgc cgaaagggtt tcccgacagt cgcgcacctg tccgttgagc 3301 cagcgcgaat gacgctctcc ctcacttggc gactggcgtt tgagggacga aacctcagcc 3361 agtgctgcat aagctcgcat ccgcaaaggt agacgggtgt caaaacgcaa ttgtagctcg 3421 ttcaacacca gaaactcccc gatttgcgga tggtaaactc ttacgacgac atcgctttca 3481 cggctaatcc actgaaactc acaggcaaga atttctcgtg ctaaaacatc gggggactgc 3541 gttacccatt tcacccaacc atctggtgct agactaatca aacgcttccc accgatgtct 3601 gttgctttag ccatgttttt ccattcttcg tacccttaag tggtatactc cgcgttaatc 3661 ttcacgtagt cataactcaa gtcacagccc caagccttac ctgaactaga accgttgcca 3721 atactgacgg aaataatcac tggattatca attctttgct tgatggtact acgatcgact 3781 gagagatcgt tgctcatgtt tgtagctacc aagtcttttg gcacagaagc atctgctgca 3841 gcttgtttca aatattcact tgcagctttc ttatcaaatg gtaatggttg accattctcc 3901 atcagcaaga tatctcctaa ctttattttc aggttttctt gctcaaaagg aactcccgca 3961 cgtccggcgg ctgcggcgat acgtccccag ttgggatcgc gtccaaagat ggcagacttg 4021 acaagagaag aacctgcaat ggttttggca acttgacggg ctgcttgttc atcctgtgcc 4081 cctgtgactt gcacttctat aagacaagtt gcgccttcac cgtcacgggc gatcgccttc 4141 gccaaatgct ggcacactgc tgttaacatt gcctctaatt tttcggcttc tgcccccatt 4201 tcggtaattg ctggggtgcg ggattgaccg ttggcaagag cgattaaact gtcattagta 4261 ctcgtatccc catcaacagt gatggaattg aaacttctat cagccgcgcg actcaacatt 4321 tgttgccaga gagtgggaga cacagctgca tcacaagtta caaatgctag cattgttgcc 4381 atgttgggat gaatcatccc agaacctttg gcgatacctc caattcgtac tgggcgatcg 4441 cccatcattg tctccaaagc aatagatttt ctcaccaagt ctgtggtaat tatcgcacca 4501 gctgctgcat ctgaccctgt ttccgaaagt gctgcaacga ctttgggtat tcctgctttg 4561 agtgcatcca tacgaatgcg ttgcccaatc acacccgtgg aagcgagtag caccgattct 4621 gagggaatgt tgagtgcttg cgctactgcc attgctgatt ctatcgcgtc taaccagcct 4681 tgatgaccag ttgcagcgtt ggcttgtcca gcgttgcaaa ggatagcacg ggcgctatgc 4741 tttgcttgca agctttgacg gcaataatcc acacaggcag ctttaacatg actgatggtg 4801 aagacacctg cggcgatcgc ctccacatct gagactatca aagccaaatc gggcaatcct 4861 gatggtttca accctgctgc aattcctgct gagcgatacc ctctaggtgc tgtgatacca 4921 ccactaattt cctgccagtc tgccattgtt ttcccccgtt tgatcaaaac gccagatttt 4981 tgtttatgac gttcagcatt ctgagttctt gtcaatggtg attataccaa gcttttggtt 5041 ctactttccg ttttgaaact cttgacaaag tagcagccat tcatcgctaa taattattaa 5101 acaaataaaa agagagccac ttgggcagct ctccagatca tcagggtgca tctttgtata 5161 tcatattaat acataagggg ggtgattggc tagacccctc atttattttt tttttcagaa 5221 attcttcaga ggacaccaaa cccaaaaagg ggataaaaaa aatagagagc cacttgggta 5281 gctctccaga tcatcagggt gcatctacat aacatagtat agcactttct gactgagggg 5341 caagacccct cacccccttt tttcaaaatt ttttttgaga caattcaaaa atcaccatac 5401 tgagtatctt ccagcagatg cttattgcga aaaatgaagt ctagaaatat tagattttcg 5461 taatgttggt aatacaaccc gatgattgtt gagaatggat cagagattac gagagcaact 5521 tgatactccc acctcatcca aatttcggtg ttgcacttgg ggaaatgtga atcacgcaat 5581 ggcatagagt gcccgcccgt ttggggcgaa tgatttttgc accttcctga tgtcagcttt 5641 aggaactgaa cacttatctc aaaattaata agctcttaaa acctagttaa tactcagttg 5701 ataagttaat cttgccacaa aagtcgtaag tcacaagtaa aaaggctttt ctgtgtggct 5761 ttttgacgtc agccttgtac ttcaagcacg aacaatgtgc tgtctcaaac aagatcaagt 5821 tagcaatgag tccgttttta tccagctttt agttttccct gtatttcaac aatctccaat 5881 taaggataca tatggaacgt tttcagtttg agcgcttact agctactaaa cacaagcgac 5941 gacgctttct cattggtact cttggtgtaa gtgcaagtgt cattgcgagt caatggactc 6001 acagagttgt tgcacagcct agtttttctg gttatccgtt cagtctcggg attgcttctg 6061 gtgaaccctt accagatggt attgtgctat ggacgcggct tgctccagaa ccgttaaatg 6121 gtggtggaat gccgtctgtg aatgtgcctg tacagtggca agttgctttg gatgaaaaca 6181 tgagaaatgt tgtgcgacgg ggtacggcga tcgccactcc cgaatttgca cactctgtgc 6241 atgtagaagt cggtgggttg cagcctgatc gttggtattg gtatcagttc aaagcaggta 6301 atgaagtcag tagcatcgga cgtactcgta cagcgccagc acgcgataca cgtgttgctc 6361 agttccgctt tggctttgtt aactgtcagg attggcaaaa cggttactac acggcttacc 6421 agggcttggc tcaagaggaa ctagatttgg tcgttcactt gggtgactat atatatgagt 6481 atggaccaca accaggaggt ccacgtcaac ataatagtcc tgaaatcgtc actcttgctg 6541 attatcgaaa ccgtcatgct ctttacaaaa ctgacgccaa cctgcaagca gctcatgctg 6601 cttttccttg gatagttact tgggacgatc atgaagtaga aaataactac gccagcttta 6661 ttccagaaga aaatcaaagt caacaggagt ttgtcacacg tcgtgccaac gcttaccaag 6721 cttactacga acatatgccc ctacgtcggt tatcgctgcc tcgtggtccg tatctacagc 6781 tgtatcggcg ttttactttc ggggacatag ctgagtttaa cgtgctagat acccgtcagt 6841 accgcagcga tcaaccttgc gatgacggct taaaaccccg ttgttctgaa gcttttgatc 6901 caaatgcgac tatgactggt acaaagcaag aacagtggct atttcgaggt ttgagcaagt 6961 cccaagctcg ctggaatgtt attgctcagc aaactatgat ggcacaatat aactttgatg 7021 cgcgtccagg gcaagaagtt ttcaatttgg atcagtggga tggttatgtg gctgcgcgcg 7081 atcgcctctt gaaatttcta caacaacagc aacccagcaa tccagttgtc attactggcg 7141 acattcattc gagttgggta cacgacttaa aaacggattt caacaatcca aactccccca 7201 ctgtcggaac tgagttcgta ggtacctcaa tttcttccga ttttcctgaa gcttttattc 7261 ccccaacagt agcagcttta agcgccaatc ctcataccaa attctttgac ggcaaaaacc 7321 ggggctatgt gcgttgtcat ctgacacaaa atacttggca aagtgactac cgggttgttt 7381 ccacgattcg tgaaccgaat gcaactatca gcactctcgc ttcgtttgtc gttccaaacg 7441 ggcgagctgg tgcccaacaa acctagtacg tatgcgcaaa gggtgggaac gcagttgcct 7501 acggaggagc caatgcgctt ttgaagtctg actctgtaca agtacccacc tatgcaggtg 7561 attgcccaag tggagcagaa tagacgcttt tgtttcctaa agcgtctatt ctccatcaaa 7621 aacggtatac tttacatcaa aataatataa gcctgaactt tttgattcct aattggtagt 7681 aaaaacccat gcagttgaaa gagcacgcta tgctggaatt gcgctaagcc tatagggaaa 7741 ggaagactgc ttacaaatgt cggacaatcg tttgtcttcg ggaatttcgg gtttagacga 7801 agttctctac ggtggttatg ttccaggtcg cgcctactta attagaggtg gacctggggc 7861 tggcaaaaca acgctgggaa tgcatttttt aacaaccggg gcagcaagag gcgaacaggt 7921 tttgttcatt accctggcag aaaccgtcac acaactgagg cgaacatccg aagggctggg 7981 atttgaccta gaaaacatca cctttcttga cctcagcccc acacctgaat ttttcaccca 8041 agttcagact tacgatattt tctcaccggc tgaggtagaa cgtgaaccga caacccgtcg 8101 aattgtacag caggtagaag ccctcaagcc gcagcgcatt tttattgatt cgatgacaca 8161 gtttcgctac cttgcaaccg atgcgtttca gtttcgtaag caagtgctgt cgtttctgag 8221 atttttggtc gagcaagaca tcactgttct attcacctca gaaagcagtg aagaagcgcc 8281 cgatgatgat ttacagttta tgagtgatgg agtacttaat ctaaacttca gccagaatga 8341 gcgcacgctg tgtatctcca aatttcgggg aagtgacttt caaaacggca atcatgcaat 8401 tcgcctaacc agcacaggaa tgcagctctt cccccggtta ataccacaaa tttacggaca 8461 agcttttact actgaagtga tctcctctgg gattccggaa atcgacgagt tgctacacgg 8521 tgggattgag cgcagcacca ttaccattat cagcggtccc agtggcgtgg gtaaaagtac 8581 gttgggactc cagtttatga aagaggctgc cggacggggg gaacattcgg tcatttatac 8641 tttcgaggaa agaaaggaaa cgctgctgca ccgtgcagaa gggattaaca ttgcagttca 8701 tgcaatgcag gagcgcggga cactctcagt cgtacaagtg gagccgctgt attatacatc 8761 cgatgaattt gctaaccttg tgcgtcagga agtagagcaa aaacaggcgc ggattgtcat 8821 gatcgatagt gtgtccggct atcggctttc ggtgcgcggg caagacttga caacccatat 8881 tcatgcgcta tgcaagtatt tgcaaaacat gggtgttgct gtactgctga ttaacgaggt 8941 tgaaacgatt acaggggaat ttcgagttac agaaattggc attagctatc tggcagacac 9001 gattatattt ttgcgttact tagaaatgca aggcgaactg cgacgggcga tcggcgttct 9061 caagaagcgg atgaccgact ttgagaaaac cctgcgcgaa tttaaaatta gtcggtacgg 9121 aattaaagtc ggcgatccac tcacacacct ccggggtata ttgactgggg tgcccgaatt 9181 acttgaggat aaaccgtgat taatagcttg tgtcgtttag ccataacaaa cctttgcagc 9241 agcattgcca gttagcaagg tgtaagagcg atgagtataa ttttagtttt tgtagagcaa 9301 tctgagaatc gccgcttgct ggcagagtgg ctgggaatgt cttacaaggt tgtggttcca 9361 gacttagtgg tacagtcagg aaaagctgta ccacttttag atgagccatt tgacttatgc 9421 attcttgatg gtccagcact ggactacatg tgggagtggg tgcaagcgag aaaacacaaa 9481 gagcaacccg tttttctgcc gtttttgctg attacagtcc gctctgatgt caaactattg 9541 acacggaatt tgtggcaaag tattgatgag ctgattacaa aaccaattga aaagctagag 9601 ttgcatgcac gaattgaaat gctgttgcga tcgcggcggc tttcactaca gcttgagact 9661 gcactcaagc aagaacgcga acttaaagaa caaaaatcgc gctttatctc aatggtttcc 9721 catgaatttc gcaacccact aaataccatt gctggtttca ctcgtttgct agaacaagac 9781 aaactatctc aagaaaagag agcggacttt tttcaacgta tccaagctgc tgttcgccgc 9841 atggttgcct tgctagatga tgtcttgatc ctaagcaaat ctgaagcgaa tcatctaacg 9901 tataatccca ttaggttagc gattgagcct ttctgccgca agctgatcga agaaatcaaa 9961 ttcagcatat ccactggtca cactattgat ttcaattgcg aggatgaatg tttcacagtt 10021 tatatagacg aagccgtagt gcggcacgtt ttgaccaatc tattatctaa tgccattaaa 10081 tactcaccac ccgacagcac agttgggctt aaattgcaat gtcaatcgga aacagtgata 10141 tttcaagtac aggatcaggg cgtcggtatt gcgccagcag atcaacaacg actgtttgaa 10201 tccttttttc gtgctagtaa cgttggcaat attcctggaa ctggattagg actggcgatt 10261 gttaaacagg tagttgagcg atatggcgga acaattacgg taaagagcga aatgaatgtt 10321 gggacaacat tcaccgtagc cctgcccatt cagtagcgaa atgtcttgag tggcaattga 10381 acgatgaagg tgctgccttt accaacttcg ctttgcacac agaggctacc tccatgtgct 10441 tgaacaatag cagtcgcgat cgccagtcct aatcctgatc caccagtggt acgcgagcga 10501 tcgctattca cccgataaaa gcgattaaaa atccgctttt gctcaccagg tgcaatgcca 10561 atccctgtat cttgaacttc aattacagca tgaccgttgt tgcgtttgag aatgacagtg 10621 acataaccac cggctctagt gtattttatg gcattagcaa ttaaattaga aagcagacgt 10681 aaaagttggt cttcatcccc catcacatac aacggttgat ggcatagaac tgaagatgtt 10741 aattgtaaag aagctgcact tgctaacgcg gaaaactctt ctataaggtc attaatcaaa 10801 atgttgagac agcaaggttc ccattgtgtt gtcagctctt gctgatccaa tcgagacagt 10861 agcagtaaat caccaaccag ttcagcgagt cggtaatttt gacgttgaat agatgccagg 10921 gtatctcgtg cttctgtaag agacaaatgc tccgtatcaa gtacagtttc tactgttgca 10981 ttaattgctg ctagaggagt acgcaactca tgggaagcat cagacgtaaa ctgttgcatt 11041 tgtttgtatg acctgtcaat tggtcgcatc gccaaccccg ctagccacca actagaacca 11101 ccaacgagga gtactgtaat gggcaatccc agcgccaaaa ccaattttaa agcagcgaga 11161 cgattatcaa gctctttgag agagcgcccc acctgtatat agccccaaac ctgattatct 11221 tgagtatgca acggcagaga cttttgatta taacgattgc cttgcgggtc tttgactgtt 11281 tgccacactt ttgtttgcac agttggtggt aatttgtcag gcaggaaacc aaccacagca 11341 attaaccgtc ccgaactatc gataaaacgt atgtaatatt gcttatctct gtggatagta 11401 ctaataatgt catcttcagc aaggatgcgt tgtttgtgcc caaagattgt ttgggtagga 11461 caaccggact caattacaca agtattgagt aaaacttgct gaaagatcgg ctctatgcga 11521 ccgggttgtt tcaaattcgg ttcaatgaca ttatgcagtg tgcctgtcac agactctagt 11581 tcccgtttga tggaaactaa gtaagcatca acaattacct cataaacgac gaaaccgcat 11641 agagtcaaaa tcagacccat aacgactgca taccacagag cgagccaggt gcgagtctga 11701 cgaaacagtc taatttgcat cgctggaatt gaaacgatag cccatactgg ggatagtttc 11761 gataaaacca tcacaaccaa gttctgacaa tttacgtcgc aacagtcgga tttgagctgc 11821 caccacatta ctaatacgtt ctgcattcaa cgtataaaga tggctcaaaa gttgctcgcg 11881 gctaacaatt tggttgggac gtttcatgaa gtattccaac agatgaaatt ccttcttcgt 11941 taaagaaatg acctgactat taccattggg ctgttgataa caagccgtac cactaccgta 12001 atctaaaata agactaccaa cttgaagttg ctgaggttga aactggggcg atcgcctttg 12061 caatgctcgc agccgcgcca gcagttccgc catcccaaat ggtttgacca aataatcatc 12121 agcccctgca tccagtccag ccaccttatc tgccatactg tctttagctg taagcatcag 12181 cacgggcaag gggttgccat gaacccgtaa tcgcttgcag agttctaaac ccgatattcc 12241 tggcagtaac caatcaaaga ttgccagtgt atattccgtc cattggtttt ccaggcaaat 12301 ccaagcttct gtgccgtcta aaatccagtc aacaatatat ttttctttgc tcaattttcg 12361 ttgaatcgat gcgcctagat ctggctcatc ttcaactagc agaactttca taacgagcag 12421 atataccttt gaagtttgtt catcaataaa ctttaagcgt agctacacca atagatagct 12481 ctgtattaat gctaacaata agagcggaaa ttttgtatag cttatcagca gcagttactg 12541 ctgtttttta cgttattcat tttaagagtc ggcgtagttc cagaagtaac tggaatgaca 12601 ctactgttta atagtttgac agactgaaat gaaattcgga tgaaatttta tactgttgaa 12661 gcagattttt gcaaaaatgg cggactacta gctttgaaaa ggtctgcgga agcgcttctc 12721 cagtctatcc cagtgtatag tcactgtatt tcctctctgt tgttgttgct ctgcccacat 12781 tgataaccag tcaaggcaag agtggtcaat ataagccaaa ttctcaaggt gaacatacaa 12841 ctcggtatct gctggcattt gctctagaac agaagccatc tttggcagcc gaataaatgt 12901 tgcagttcct tctatgtgga tatcaattcg gcgatttttt tcgtccctct caaggtatat 12961 atttaggtag gatattttgt aaattagcgc tattgtggtc agcacaatgc caatcagcac 13021 acccgtaagc agatcggcag ctacaattcc aataaaggtt gcaaagaaaa tgactacagg 13081 cacacgcccg tactgtttga gtttgcggat gttctctact tccaccaact tgtagcctgt 13141 aacaaccaga attgcagcga ggctggacat gggaatcatc tttagtaaac caggtgcggc 13201 aactaccaag acaagtatcc atacaccatg aagtattgct gacagtcttg ttttagcccc 13261 tgctttgacg ttcacagagc tacggacaat cactcctgtc attggtaaga ctcctagcgc 13321 cccgcaaacc atgttaccaa aaccttgtgc agctaattcg cggtcaaaat ctgtttttga 13381 gccttggtgc agccgatcca ctgctgctgc tgagagtagg ctttccgcac tggcaatgaa 13441 ggcaattgcc gcagctgata tcagtagcgg tggattgatc aggcgaatca agttggcagg 13501 tgttgggact tgaatgactt cggttagatt tgcaggtaga tcaacataag gaattggcag 13561 atggaacagg ttggctagag tgctggcaac taccacagca attaatgctc caggtatcag 13621 ctttaagctt cgcggcttga atttatccca cagcagaagc acgattaagg tggtgacacc 13681 aatagcagca gccatcaaat ggctaatatt gtcatagtca aaaaaagtat ggtaaaggga 13741 tatgggaatc gaaattagat tctcaattcc agtagcacgc ggcttttctt taagcatcac 13801 atggaattgg gaagcgaaaa ttaacacgcc aatacctgca agcatcccat aaataactgc 13861 tggagatatg gcgcggaata ttttaccaaa cttaaagact cccgccagca actgtatcaa 13921 acctgccagg agtataaccg gaccgagcat ttcgattcca ttattctgaa ctagttctgc 13981 aacaataacc gctagcccag ctgcaggtcc actgacttgc aatgctgaac cagaaatagt 14041 cccgacaact attccaccta ttattcctgt caccaaacca cgagccggag gaactcctga 14101 agcgatcgca atccccatcg acagtggaag agcaacgaga aacacaacga aagacgctac 14161 tatatctttg ctcaaagaac gtttaagttt attagtttta agctttttca tgacaaaatt 14221 cccgaaaaac tacttttctg cttcagtatt tgcattaatc caagtaatgc tatattcgct 14281 tttatcaatg gactaaatga gcatttttac tactcatttt ggaagctatt tttttgattt 14341 ttgacagtga at // LOCUS NODE_2361_length_14307_cov_4.89117314307 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14307) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14307) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14307 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 599..1222 /locus_tag="DP116_19495" CDS 599..1222 /locus_tag="DP116_19495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863087.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19495" /translation="MRQEIEELPDISTLPPEKQSEQNDLPSEKGVDYTRLRDLLAGGK WKEANQETLAVMLKASGREKDRYLDVESIENFPCTDLRTIDQLWVKYSNERFGFSVQK RIWKSVGGKPDANYETWEKFGDRVGWRKGMFTKQWQDYEKLTFSTNAPWGHLPFTPCD WVGAVREFEISRGGMRVLFSRVTTCSIEVKYYFSNLNLGIESPNLTP" gene complement(1566..1850) /locus_tag="DP116_19500" CDS complement(1566..1850) /locus_tag="DP116_19500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198691.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system mRNA interferase toxin, RelE/StbE family" /protein_id="PRJNA477356:DP116_19500" /translation="MRVLIWDNSFKRAFKRVVRKNPRLEETIFEVLELLTTDPFAPAL KSHKLKGDLDGLWACWVEYDCRIIYTFEPNPDADEEMIVLIDIGSHDEVY" gene complement(1847..2068) /locus_tag="DP116_19505" CDS complement(1847..2068) /locus_tag="DP116_19505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198690.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19505" /translation="MSKPASLQTVIEYVEALSTEEQDLLLELIHKRRVEKRRQEIASN AAQTLEAIKTGRAKRGTLADLRADLLSDE" gene 2364..3023 /locus_tag="DP116_19510" CDS 2364..3023 /locus_tag="DP116_19510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012596863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19510" /translation="MRAIANQLKKHDIKIWLDEEQIPPGRSFQDEIQKAIPLVKSAAI FIGLKGLGKWQRMEVRSLTTKCVEKDIPLIPVLLPGVTELPETLVFLKEYTWVEFSKS TDDPQALHNLVWGITATQSSPQQKPTELWFNGDRLQQFHKALLSAFPTTAKLKQMVRF KLDDNLDARAGGANHSEVVSNLIVWAKAEGRLEELLTAARKENPGNQNLRRFDEQIRG G" gene 3224..5386 /gene="ppk1" /locus_tag="DP116_19515" CDS 3224..5386 /gene="ppk1" /locus_tag="DP116_19515" /EC_number="2.7.4.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319105.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polyphosphate kinase 1" /protein_id="PRJNA477356:DP116_19515" /translation="MPKSKKAAHQINLNDPQYYLNRELSWLEFNKRVLHEACDSRTPL LERLKFLAIWSSNLDEFFMVRVAALKQQVEAKVSLLASDCRTPEQQLDEISSMLRPLV AKEHEQFEKVLRPQLAEHGINILDYIDLTQKQRNYLDKYYEEQIFPVVTPLAVDPSHP FPYISNLSLNLAVVVKNPETQEELFARVKVPTVLPRFLALPPDLGIQDNGKPAVWTGV PLEQAIAHNLESLFPGMNIQEYHTFRITRDGDLALKEDDADDLLLAIEQELRKRRVGG DALRLEIHSQTPESIRKRLLEDLELEENDVYEVDGILGLKDMMYFMSLPVPELKDQPW QAVVHPRLQRIKEPNLNPDAREIEEGKDFFTVIREKDLFVHHPYQSFSSSVVNFIAHA AHDRNVLAIKMTLYRTSFESPIVNALIAAAENGKQVSVLVELKARFDEENNIYWAKRL ESVGVHVVYGLVGLKTHCKIVMVVRREQDRISRYVHIGTGNYNHKTARLYTDLGLFSC NEELGADVTDVFNFLTGYSRQKSYRKLLVAPVTMRDRFLSLIQREIENVHNGLTGRIV AKMNSLVDPEIICHLYEASRAGVQIDLIVRGICCLRPGLKDISENIRVISIVGRFLEH SRIFYFHNNGQEEIYIGSADWMRRNLDRRVEVITPILDADIAKDLQEILGIMLADNRQ AWELQPDGSYIQRRPGEEVPEASSQKILMSMALNSNAN" gene complement(5440..6021) /locus_tag="DP116_19520" /pseudo CDS complement(5440..6021) /locus_tag="DP116_19520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015158902.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="N-acetyltransferase" assembly_gap 5619..5628 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(6113..6199) /locus_tag="DP116_19525" /pseudo CDS complement(6113..6199) /locus_tag="DP116_19525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412538.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="peptide ABC transporter ATP-binding protein" gene complement(6207..7691) /locus_tag="DP116_19530" CDS complement(6207..7691) /locus_tag="DP116_19530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412537.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polar amino acid ABC transporter permease" /protein_id="PRJNA477356:DP116_19530" /translation="MKKRKVWLSTILFAPILVMSLITGCSHSLNAASSLGKDTLTMIT SPDYPPYDFYDTKKGERQIVGFDIDIAKTIAKELGFKLQIMQSDFNGLIPALQANRAD FAMAGMSPTPERKKNIDFSIIYYQAKDTIVAPKNSNLKQPQDLAFKKVGVQLGTIQEQ NAKKIAQKVTGIQLKQLNKVAETIQEIKSGRIDAAIIEDTVARGFVQANPELGFNVIP SEQKSGSAIAFPKGSSFVEPFNKVLQQMKDKGELEKLVTKWFSQTTATVSSPSAKGGL NLDFTRIIPEIPFILKGIPLTLLFTLLSVFLGLIWGTVLSLCKITSINPLVWVANAYT SVFRGTPLLLQLALVYYATPQLTGYNISALEAGVLTFTLNSGAYMSETIRGGIQAVDK GQAEAAMSMAIPYWLMMWDIILPQALKNILPALVNETIGLLKDSALVSTIGVVEILRS AQIVGANKYIYFEPLLFAGLIYYLLVMGMTRSASVLERRLRQSQ" gene 8130..8648 /locus_tag="DP116_19535" CDS 8130..8648 /locus_tag="DP116_19535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137888.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein" /protein_id="PRJNA477356:DP116_19535" /translation="MTHIPHPDSPGITVTCAIVTVSDTRSKQTDKSGQLIQELLRNAN HVIEAYTIVKDEPVQIEDQMERLSQYPNLNVVIFNGGTGIAPRDTTYDTIIKLLEKTL PGFGELFRFLSYQEIGSRAMASRSVAGVYKQKLIFSLPGSSNAVRLAMEKLILPELVH LVGQLHNSNSKQ" gene complement(8587..8814) /locus_tag="DP116_19540" CDS complement(8587..8814) /locus_tag="DP116_19540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002784362.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19540" /translation="MVTYDDCINYLNEILYPPRLEGVIFYQLSVISYQLAVFVLHNLV HCSLFTVYCSLFTVYCLNCATVQLNEQVLAK" gene complement(8801..9904) /locus_tag="DP116_19545" CDS complement(8801..9904) /locus_tag="DP116_19545" /EC_number="2.7.8.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315380.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phospho-N-acetylmuramoyl-pentapeptide- transferase" /protein_id="PRJNA477356:DP116_19545" /translation="MDAKLSPNQGLKIINGIGLVSLLGVGLGVSALVLDGIANRLPWQ GVSLTLPFLFCALASAAVGFWVVPLLQALKTGQIIREDGPQAHLKKAGTPTMGGIFFV PVAVITACVWSHFAIEVLAVSALTLSYGLIGWIDDWQILRRKSNKGISPRMKLALQIG FALAFCLWLIFTQPFDITNIALPLGFSLPLGLLFWPLAGFVLVAESNATNLTDGIDGL AAGTVAIALLALGALSAPSSIGLMVFCACMSGSCIGFLAHNRNPARVFMGDTGSLALG GALAAVGLLTNSLVALFILSGIFFVETLSVMAQVSYYKATKGSDGKGKRLFKMAPLHH HLELSGWSELQVVAVFYIVSAILAVISLTLGHL" gene complement(9987..10226) /locus_tag="DP116_19550" CDS complement(9987..10226) /locus_tag="DP116_19550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998455.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19550" /translation="MLNSPLREVPRNQRASVIPLKQESSLLDWLKSGGRLIARDVHEP DFLDDEEEITEFLSTEDGIGEYDFDDDDDSAPDEE" gene 10453..11037 /locus_tag="DP116_19555" CDS 10453..11037 /locus_tag="DP116_19555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fibrillin" /protein_id="PRJNA477356:DP116_19555" /translation="MLRKASLLEAIAGKNRGLLATEQDKQAILVAIANLEDVNPTPSP IEATDLLNGNWRLIYTTSIALLNIDNLPLYKLGSIYQYIRVETNSIYNIAETYGLPFF EGIVSVAAKFEPVSYRRVNVKFERSIIGLQRLIGYTSPESLVEQIEVGKKLTAIDFPL NSDKQPGWLDITYIDNNLRIGRGNEGSVFVLTRA" gene 11419..11631 /locus_tag="DP116_19560" CDS 11419..11631 /locus_tag="DP116_19560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140455.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit IV" /protein_id="PRJNA477356:DP116_19560" /translation="MVQRGSKVRILRRESYWYQDVGTVASIDQSGIKYPVIVRFEKVN YSGINTNNFAQAELLEVEAPKAKAKK" gene 11875..12726 /locus_tag="DP116_19565" CDS 11875..12726 /locus_tag="DP116_19565" /EC_number="3.2.2.23" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878434.1" /note="Involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. Acts as DNA glycosylase that recognizes and removes damaged bases; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-formamidopyrimidine glycosylase" /protein_id="PRJNA477356:DP116_19565" /translation="MPELPEVETVRRGLNQLTLNQEITGVQVLLHRTLAHPFSVEELF IGIKESFIVTWHRRGKYLLAELSFSPSALSAGWLGVHLRMTGQLLWLHRDEPLHKHTR VRFFFGDERELRFVDQRTFGQIWWVPPGVAPESIITGLGKLAVDPFSPEFTVEYLALK LRNRRRAIKTALLDQSVVAGLGNIYADEALFLSGVLPETLCTNLQPEQIERLHSCIIQ VLKASIEAGGTTFSNFLNVKGVNGNYGGVAWVYNRAGEPCRVCGTPIQRIRIAGRSSH YCVQCQR" gene 13123..13338 /locus_tag="DP116_19570" CDS 13123..13338 /locus_tag="DP116_19570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410284.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase" /protein_id="PRJNA477356:DP116_19570" /translation="MAVKRGNMVRAVREKLENSLEAQASDSRFPSYLFETKGEVVDIK GDYALVKFGKVPTPNIWLRLDQLEEFK" gene 13358..>14307 /gene="mdh" /locus_tag="DP116_19575" CDS 13358..>14307 /gene="mdh" /locus_tag="DP116_19575" /EC_number="1.1.1.37" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878436.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="malate dehydrogenase" /protein_id="PRJNA477356:DP116_19575" /translation="MSYSPFSPIKRHSPRVTVIGAGKVGSTLAQRIAEKNLADVVLLD VVEGMPRGLALDLMEARGIELHNRRIIGTTDYADTSGSDIVVITAGFPRKPGMTRDDL LLTNAKIVVEAANKAMTYSPDAIYIIVTNPLDVMTYLAWQATGLPGDRIMGMAGVLDS ARFETFIAMELGVCSLDVKAMVLGGHGDLMVPLPRYATVNGIPITELLDAATIERLVE RTRNGGAEIVELMQTGGAFYTPASSTCVMVESILLNQSRLLPVAAYLQGEYGLNDIFI GVPSRLGSSGIEEVVELKLSDVERDALHTSAQEVRKNITRA" BASE COUNT 4247 a 3025 c 3055 g 3970 t 10 others ORIGIN 1 ggggttgtgc gggatttata agcaattaac cggacatgat atgattgctt cgcttcgctc 61 gcaatgacac cacaaactct tcatttacct aactgccacc atgtctagca aacttgtaga 121 gcggcgtttg aatgacttta agcgtagatg tggtaacgat ggcgatgcag ccttgcaact 181 ggcgtatcac gctgctatgc ccgttgcctt aaatcctgag ttgctgcact ttctgcgaat 241 aaacttcttt gttgatccac ctgagcaact tccttataca gtcgagtttg agtttctcag 301 ttctgggctg tgtcgtgaaa ttgacgcaga actgtatgaa attgagccag aaattcgcaa 361 tgagttgttg cagaagttga tgaaaaggga aaatgcaaga caacgaattc gagatgttgc 421 tacactgctt tggcagtatg ttgaatatca ttcgccttgg gcagatcgag ttgaattaga 481 acgggcgcaa caactgactg cactgaattt tctagatcct gcaaaagcac aagagtggtt 541 ggatgaagcg gaagctaatg gtagcttagg gagaggggag cgggaatggt tcatagcaat 601 gcgtcaggaa attgaggagc ttcctgacat ctctacccta ccgccagaga agcaaagtga 661 gcaaaatgat cttccttcag aaaaaggcgt agattacaca cggctgcgag atttgctagc 721 cggtggaaaa tggaaagagg cgaatcagga aactctagca gttatgctca aagcatctgg 781 tagagaaaaa gatcgctatc ttgatgtcga atctatcgaa aattttccct gcactgactt 841 acgcacaatt gaccaacttt gggttaaata cagcaatgag cgcttcggct ttagtgtgca 901 aaagcgcatt tggaaaagtg tcggtggcaa accagacgct aattatgaaa cctgggagaa 961 gtttggcgat cgcgtgggtt ggcgcaaagg aatgtttaca aaacaatggc aggactacga 1021 gaaattaacc ttctccacaa atgccccctg gggacatctc cctttcactc cctgtgattg 1081 ggtcggtgcg gtaagggagt ttgagatttc gcgggggggc atgcgggttc tcttctctcg 1141 cgtaacgaca tgttcgattg aagtaaaata ttatttttct aatttaaatt taggcattga 1201 gtcgcctaat ttgacgcctt gaagccctct tttgattggc ctcgacttgt aaactgtaac 1261 atataaggct tacacagaat tgtaaagatt cgcaaacaag tgattttcca cgaagtattg 1321 cggtttctac gttttcgtta cctggcactg catccaaaga gagtcttctt gttgcgcttg 1381 tagtacgatg ctggcaaatt gtggattata acaattcgcg aaaataaact tgtacgctac 1441 ttcttctatt tcttggttcg ttgcttccca gtcgtcccaa aatgagacct aaattcacga 1501 ttattgcaat aggtggtgat gtgatagcag aatctagctt taaaagtttc gcggagaacg 1561 aagcatcaat acacctcatc atgagagcca atatcgatta aaacaatcat ctcttcatcc 1621 gcatcaggat ttggctcaaa tgtatagata atgcggcaat catactcaac ccaacacgcc 1681 cacaaaccat ctaaatcacc tttcaactta tgtgatttta aagcaggtgc gaatggatcg 1741 gtcgtaagta attccaaaac ctcgaaaatt gtttcttcca aacgaggatt tttacgaaca 1801 actcgcttaa aagcacgctt gaagctatta tcccaaatta gtactctcat tcgtcactca 1861 gcaaatcagc ccgtaggtca gcaagagtac cacgttttgc cctacctgtt ttaattgctt 1921 ccagtgtttg tgctgcattg ctggcgattt cttgacgccg tttttcaact cgccgtttat 1981 gaattagttc cagtagtaaa tcttgctctt cggttgaaag agcctctaca tattcaatga 2041 ctgtctgtaa ggatgcaggc ttgctcatca gcaatgttta taggttttca tgattttatg 2101 tttctttatt ttatagcagt tctcaactgg gtgtaataca gattttttgt agggcaaagg 2161 caagaacttt gtaaatacgt tatattatgc ccttatagcc atccgtaact cttgtggaaa 2221 cgttccccaa cgcaaattgt agcaaaaaaa agtaaagttt tgaattatta gttactatac 2281 caaaaatatc agacacccag tcacaaccta aaagcgaata attcgatgtc ttcctcgcac 2341 acaacagtgc agacaaaccc gaagtgagag ccattgcgaa tcaacttaag aaacacgata 2401 ttaagatatg gctcgatgaa gaacaaattc caccaggaag atcatttcaa gacgaaatcc 2461 aaaaagcaat tccactcgtt aaatctgccg ccatctttat tggtttgaaa ggattaggaa 2521 aatggcagag aatggaagtg cgctcattaa caacaaagtg tgttgagaaa gatatccctt 2581 tgattcctgt tctccttcct ggcgtgactg aacttccaga aacattagtc tttttaaaag 2641 agtatacatg ggtagaattt tctaaaagta ctgatgatcc tcaagcatta cataacttgg 2701 tgtggggaat tacggcaact caatcatcac ctcaacaaaa gccaactgaa ctgtggttca 2761 atggcgatcg cttgcagcaa ttccacaaag cactcctgag tgcatttcct accacagcaa 2821 aactcaagca gatggttcgc ttcaaattag atgataattt agatgcgcgt gcaggaggtg 2881 caaaccactc agaagtcgta tccaacctga tagtatgggc taaagctgaa gggcgactcg 2941 aagagctatt aactgctgct cgtaaagaaa atcctggtaa ccaaaatttg cggagatttg 3001 acgaacaaat acgcggtggc tagcaatcac ctaatttttc aattcagttt tatagaagaa 3061 ttcgactaca gggggcttct aaccctcttt tacccccttt ttaaggggtc gccgcaggcg 3121 ttctgatctt aaccgaaccg tattggaagt gatggctttt ttcctctact atgtttagta 3181 tgctatccga gttgtaagct acatctccct ttctgtggtc atcatgccaa aatcaaagaa 3241 ggccgctcat caaatcaatc tcaacgatcc tcaatactat ctcaaccgag agttaagttg 3301 gctagagttt aataaaaggg tgttacatga agcctgcgac tcacgaacac cccttctaga 3361 acggcttaag tttttggcaa tatggagttc taacctggat gaattcttta tggtgcgcgt 3421 tgctgcactt aagcaacaag tagaagcaaa ggttagcttg ctcgcttctg attgtcgcac 3481 gccagaacaa cagctagacg aaattagctc tatgctgcgt ccgttagtcg ccaaagagca 3541 tgaacaattt gagaaagtct tacgacctca acttgccgaa catggtataa atattttaga 3601 ttacatagat ttaacccaaa agcagagaaa ttatttagac aaatactacg aagagcaaat 3661 ctttcccgtc gtgactcctc ttgctgttga ccccagccac ccctttcctt acatttccaa 3721 tctcagcttg aatttggctg ttgttgtcaa aaatccagaa acacaagaag aattgtttgc 3781 cagagtcaaa gttcccacag ttttgccacg atttctagct ttaccgccag atttgggaat 3841 tcaagataat ggcaaaccag cggtgtggac cggggttcct ttggaacagg cgatcgccca 3901 taacctagag tccctctttc cgggaatgaa tatacaagaa taccatacct tccgtataac 3961 ccgtgatggt gacctggcat taaaagaaga cgacgccgac gacttgctgt tagctattga 4021 acaggaactg cgaaaacggc gcgttggtgg agatgctttg cggttagaaa tacattctca 4081 aactcctgaa tccatcagaa aacgactgtt ggaggattta gaattagaag aaaatgatgt 4141 ttacgaagta gacggtattc tcggactgaa ggatatgatg tacttcatgt ctttacccgt 4201 cccagaactt aaagatcaac cttggcaagc tgttgtacat cctcgtttac aaaggattaa 4261 agagccaaat ctaaacccag atgcacgaga gatagaagaa ggaaaagatt tttttacagt 4321 cattcgggaa aaggatttat ttgtacacca tccctatcaa tccttttcaa gttctgtggt 4381 gaactttatt gcccatgctg cccatgatcg gaatgtgcta gcaattaaga tgactcttta 4441 tcggacttct tttgagtcac cgatagtcaa tgccttaatt gcggctgctg aaaatggtaa 4501 gcaggtttct gtactggtgg aactcaaggc gcggtttgat gaagaaaata atatttactg 4561 ggcgaagcgg ttggaaagcg ttggagttca cgtagtttat ggtctagtgg gtttaaagac 4621 tcactgtaaa attgttatgg ttgtacggcg cgaacaagac cgcatttccc gctatgtaca 4681 tattgggact ggtaattata accataaaac agcgcgattg tacacagatt tgggattgtt 4741 cagttgcaat gaagagttgg gagcggatgt tacagatgtt ttcaatttct tgacaggata 4801 ctcgcgacaa aaatcatacc gaaaactttt ggttgcacct gtgacgatgc gtgatcgctt 4861 cctttctctg attcagcgag aaattgaaaa tgttcataat ggtttgactg ggcggattgt 4921 tgccaaaatg aattcattgg tagacccgga aattatctgc catttatacg aagcttcccg 4981 cgctggcgtg caaattgact tgattgtgcg cggaatttgc tgtttacgcc ctggactaaa 5041 agatattagt gaaaatattc gcgtgattag tatcgttggt cgctttttgg aacactcccg 5101 cattttttat tttcataaca atggacagga ggaaatttac attggtagcg ccgactggat 5161 gcgacgtaat ttagaccgtc gggtagaagt tataacccca atcttagatg cagatattgc 5221 taaagatttg caagaaatct taggaattat gttggcagat aaccgccaag cttgggagtt 5281 acagccagat ggtagttata ttcaaagacg ccctggtgaa gaagttccag aagctagttc 5341 acaaaaaatt ctcatgtcaa tggctttaaa ctcaaatgcg aattaattaa gtatcttgtg 5401 gctgactctg cactccctgc agagttctcc caagcaaagt tagtgttcta gagtccgtcg 5461 gctaccctcc cacattttcc gcgtaagttg ccattcggtt tgactccatc cccgcgtctg 5521 catccaagct gaaccttggc gcgtcccaac tgctataaac ccatatcggc gcgccaagcg 5581 ctccacccgt gaatttgcgc taacagaaat tcctcgaann nnnnnnnntc ctcgaatctc 5641 tttcaatccc agatcgcgga acccaaactc gataagcgcc ctagaaacct caattgcgta 5701 gccgtaacga ccccagcact gtggtgctag ctcgatccct agttccgctt tgtcagggcc 5761 atatccttca cggcgcagac cgcagcagcc aaatatctct tgcgggttct ggaggtgagc 5821 gatcgcaaac tgataattgc tacgtggttg ttcgactgcc cattgaccga aaagccccag 5881 aagcgatcgt gcatactcag gtgttacctc ttccggcgca caaaactccg cataccgtgg 5941 atcggcatgg taggcaataa acgcaggctc atcttcctca ataaactcgc gcagcaaaaa 6001 tcttttggta atgatctcca tgcttagcca aacatactgt agccgtctaa cttgtcatta 6061 tatggaaatg ttcaggataa tactttttta tcggggtgga tacaaataaa aaaacgactt 6121 caccaaggtg aatatccgta aaaatatctt tgagtaggtc gagtttgcca aaggatttgc 6181 acaagaattc tgtgcaaatt acgactttac tgactttgtc ttaaccttct ttctaaaacg 6241 gatgcacttc tagtcatacc cataactaaa agatagtaaa ttaaccccgc aaatagcaga 6301 ggctcaaagt aaatatactt gtttgcacca acaatttggg cactgcgtaa tatttctact 6361 acaccaattg ttgataccaa agccgagtct ttcaatagtc caatagtttc atttactaat 6421 gctggtagaa tattcttcaa tgcttgcggc aaaattatgt cccacatcat caaccagtaa 6481 ggaatagcca tagacatcgc cgcctcagct tgtcctttat ctactgcttg aattccaccc 6541 cggatagttt ccgacatata agccccagag tttagggtaa aagttagcac ccctgcttct 6601 aaagccgaaa tattatagcc agtcagctgc ggtgtcgcat agtaaaccaa agccaactgt 6661 aacagcagag gtgtgcctcg aaataccgag gtataggcgt tagcaaccca gacaaggggg 6721 ttgatactag taattttgca cagagagaga actgtacccc aaattaatcc taaaaatact 6781 gataacagcg taaataacaa cgtcagaggg atgcctttga gaataaaagg aatctctggg 6841 ataattctgg taaagtctag attcagtccg cctttagcag aaggtgaaga tacggtagcg 6901 gttgtttggg aaaaccattt ggttactaat ttttctagtt ctcctttgtc tttcatttgt 6961 tgcaggactt tattaaaagg ttctacaaag gaggaacctt taggaaaagc gatcgccgat 7021 ccacttttct gttctgaggg aataacatta aaacctaatt ctggattagc ttggacgaat 7081 cctctcgcaa cagtatcctc aattattgct gcatcaattc gccctgattt aatttcttga 7141 atggtttccg ctactttatt gagctgcttt agctgaattc ctgtgacttt ttgggcaatt 7201 ttttttgcat tttgctcttg gatagtccct agttgtaccc caactttttt aaaagctaaa 7261 tcttggggct gttttaggtt gctatttttg ggagcgacaa ttgtatcttt tgcttggtaa 7321 taaataattg aaaaatcaat atttttctta cgttctggag tgggagacat cccagccata 7381 gcaaaatcag cccgatttgc ttggagtgca ggaattaatc cattaaaatc ggattgcatg 7441 atctggagtt taaatcctag ttctttagca atagtcttgg caatatctat atcaaagcca 7501 acaatttgcc tttcaccctt ctttgtatca taaaaatcat aaggcggata atctggagaa 7561 gttatcatcg tgagtgtgtc tttgcctaaa gatgaggctg cgtttagaga atggctacaa 7621 cctgtgatca gactcatcac taatattggt gcaaacagga tggttgataa ccatactttt 7681 ctttttttca tgaatttgat aattattgag gaaaacagtt tataactctc caagattatc 7741 attagcccaa ttgaaatcag tgaacagtaa acagtgaaca gggaacaggg aacaggaaac 7801 agggaacaga gtcagcgatt tcttgacccc ccttaacttg gtaactgata actgataact 7861 ggtaactgcg gctggtactt gttgaagaat tcatgtcaaa tctaaaactc tatattaatc 7921 aggtgtttgc ggcgatgaga caaactcttc aaactgaaca attcttcttg aagaaatact 7981 agaaattatt atgtgaccaa atctacatga aattatgaat gcgcttctac agctaaaact 8041 tatgataatc tctgtagtgg actacaagtt agtggttagt aaaaaaacca ataactaaca 8101 actaacaatt aacacaacat aatgcacaga tgacgcacat ccctcaccca gattccccag 8161 gcataacggt aacttgcgct atcgtaactg tcagcgacac acgctcaaaa cagacggata 8221 aaagtggtca gttgattcaa gaattactcc gcaatgccaa tcatgtcata gaagcttaca 8281 cgattgtcaa ggatgaacct gtgcagattg aagaccaaat ggaacgactg agtcagtatc 8341 caaatttgaa tgtcgtgatt ttcaatggtg gtacaggtat tgcaccgaga gataccacat 8401 atgatacaat tattaaattg ctggaaaaaa ccctccctgg gtttggtgag ttgttccgct 8461 ttttaagtta ccaagaaatt ggttcacgag cgatggcatc tcggtctgta gctggtgtct 8521 acaagcaaaa actcatcttt tcccttccag gctccagcaa tgcagtacga ctggcaatgg 8581 aaaaacttat tttgccagaa cttgttcatt tagttggaca gttgcacaat tcaaacagta 8641 aacagtaaac agtgaacagt aaacagtgaa cagtgaacag tgaacaaggt tatggagaac 8701 aaatactgct aactgataac tgataactga taactgataa aaaataaccc cctctagtcg 8761 aggaggatat aatatttcat tcaggtaatt tatgcaatca tcataggtga ccaagggtca 8821 agctgatgac agccaaaatt gcactgacta tgtagaagac cgcaacaact tgcagttctg 8881 accaaccaga gagttccagg tgatggtgta atggtgccat cttgaacaaa cgcttgcctt 8941 tgccatcaga acctttggta gctttgtagt agctaacctg cgccattact gaaagggttt 9001 ccacaaagaa gattccgctg agaatgaaca gtgcgactaa actgttagtc agcaagccca 9061 ctgcggctaa agcgcctcct aaagccaagg aaccagtgtc tcccataaaa acacgggctg 9121 ggttacggtt atgagccaag aaccctatgc aactgccact catacaagca cagaaaacca 9181 tcaatccaat tgaggagggg gcgctgaggg cacctaatgc tagtaatgcg atcgccaccg 9241 ttcctgcagc caagccatca ataccatcag tcaggttagt ggcgttactt tctgccacca 9301 gtacaaagcc cgctaagggc caaaatagta atcccagggg tagagaaaaa cccaaaggca 9361 aagcaatatt cgtaatgtca aaaggttgag taaaaattag ccacagacag aacgctagtg 9421 caaaaccgat ttgcaaagct agtttcatcc ggggagatat acctttatta gatttacggc 9481 gcagaatttg ccagtcatct atccagccaa tcaatccata gcttaaggtc aaagcagaaa 9541 ctgcaagtac ctctattgca aagtgagacc acacacaggc agttatcact gccacaggaa 9601 caaagaatat gccccccatc gttggagtac ctgctttttt tagatgagcc tggggaccat 9661 cctcacggat gatttgcccc gttttcagtg cttgcagtag tggtactacc caaaaaccga 9721 ctgcggctga agcgagtgca cagaacaaaa atggcaaggt taacgacaca ccttgccaag 9781 gcaatctatt cgctattcca tctaacacta gtgctgatac acccagacct acacccagta 9841 gagagaccag acctatgcca ttaatgatct ttaacccttg gttaggagat aattttgcgt 9901 ccacggaaac ttcccttcac tccacacttg caaaactaaa caatagggac gagttgtgcc 9961 agtaatatac cttcagccca gattatttac tcctcatctg gagcagaatc atcgtcgtca 10021 tcaaagtcat attcgccaat tccgtcctca gtactcaaaa actctgttat ctcttcttcg 10081 tcgtctagaa aatcaggctc gtgaacatca cgggctatga gacgaccacc tgatttcaac 10141 cagtcaagca gcgaggactc ttgctttaat ggaataacag acgcccgctg gtttcggggt 10201 acttcacgca atggagaatt tagcatatca acgaagacaa tataagggga aggtagctag 10261 acactaagag tctatcactt gatacagcac tatttacttg ttaattcaac tcttagctta 10321 aaaatccgac aaaaaatttt ttatttatga ttgatacagg aaaacgtgaa accctaatag 10381 ttatcggtat ggcactatac atagttaatc aatatttttg gaactgaatg atgttttttg 10441 aggtgaaaaa caatgctaag aaaagcgtct cttttagaag caattgctgg taaaaatcgg 10501 ggactacttg caactgagca agataaacag gctatcttgg tagctattgc aaatttggaa 10561 gatgtaaatc ctacaccatc tccaattgaa gcaacagatt tgctcaatgg caattggcga 10621 ttaatatata ccacaagcat tgctctatta aacattgaca atttaccctt gtacaagctc 10681 ggttcaattt atcagtatat tcgcgtagaa actaacagta tttacaatat agctgaaact 10741 tatggcttac ctttttttga aggcatagtc agcgttgctg caaaatttga gccagtttca 10801 tatcggcgcg tcaacgtgaa atttgagcgg tctattatcg gtttacaacg cttaatagga 10861 tacacctcac cagagagttt ggttgaacaa attgaagttg gcaaaaaact caccgcaatt 10921 gattttcctt tgaacagcga taaacagcca ggttggctgg atattaccta catagataac 10981 aatttgcgta ttggtagagg taacgagggc agcgtgtttg ttttgactag agcataagaa 11041 cctcctacac aagtgcttgc gaccaatgaa attatgcgat ctcaagaact ttgtcaagga 11101 ggtattcata cttttttgaa aaattataga ataaggaata ggtcacaagt ggtaggtgag 11161 agtctaaaat ctttatcagc tatcctctgt aaccattccc tgtaacctat ttcttgcacc 11221 cgacaactta ttccctattt accttagagg acgaaaatta atcgtacaag ttcctcctag 11281 agtgaccact gtaatgtctt agggagccct tagcaagata aataacttaa caaagcttct 11341 tggtgaggcc tcctggttgt agagtgaaat caattagcct tgtgtattgg caatttataa 11401 cttaaaggga agaaactcat ggttcaacgt ggttctaaag tgcgtattct ccgccgggaa 11461 tcctactggt atcaagatgt cggaaccgtg gcgtctattg accagagcgg tattaaatac 11521 ccagtcattg tccgttttga gaaagtaaac tactctggca tcaacaccaa taactttgct 11581 caggcagaat tattggaagt tgaagctcca aaagcaaagg cgaagaagta aggtagaaaa 11641 caaagggaaa cagggagaaa gtgacaaagg gaggaaactt ctgttgcttc cccactcacc 11701 ccatctgttt gactcaacag gagtagcacg ccagttgcta tctcctcttg ccttcgcaag 11761 tgaacaaagg agagccgaag cgctttgttc gcaatggctc cccttctata gctttgcctc 11821 atcagtcatg ctgtcctccc cctattccct tgtattctgt gcctttcaaa caagatgcct 11881 gaactccctg aagttgaaac agtccggcgg ggtctgaatc aattgaccct taaccaagaa 11941 attacgggtg tccaggtgtt gcttcatcgc acgcttgccc acccgttttc tgtagaagag 12001 ttgttcattg gaattaaaga gagtttcatc gtaacttggc atcggcgcgg caaatatctc 12061 ctagcggaac tctccttttc tccctctgct ttgtcagctg gctggctggg ggttcatctg 12121 cgaatgacgg gtcaacttct gtggctgcat cgagacgaac cgttacacaa gcacacacga 12181 gtcagattct tttttggaga tgaacgcgag ttacgctttg tagatcagcg tacctttggt 12241 caaatctggt gggtgcctcc aggagttgca ccggaaagta ttatcacagg tttgggaaaa 12301 ctagcagttg accccttttc accggaattt actgttgagt atttagcgct taagctgcga 12361 aaccgccgcc gtgcaataaa gacagcactt ctggatcagt cagtagtggc gggtttaggt 12421 aatatctatg ctgatgaagc gctgtttctg agtggagttt tgccagaaac tttatgtaca 12481 aatttgcaac cagagcaaat tgagcgtttg cactcttgca tcattcaagt tctaaaagcc 12541 agtatcgagg ctggtggtac tacgttcagt aacttcctaa atgtcaaggg agtcaacggt 12601 aactacggtg gtgttgcctg ggtttacaac cgtgctggag aaccctgtcg agtttgtggt 12661 acaccaattc aacggattcg gatagctggg cgttccagcc actattgtgt tcagtgccaa 12721 cgataagcaa tataaaagta aaagtgaaaa aagttacaac taaatataga aaatactaat 12781 gtataaaatt aatgtattaa tatgatacaa ttttgatttt agattctgaa ttttgaataa 12841 acgtagatga gggcagacaa tgatctagaa aattatctag aaactatctg cgtacatccg 12901 cgtgcatctg cagttgcaat tttggattct gaatcaacta ctaccagtag gaagtttcag 12961 gaatctaaac atagcagcca taaaaccaac tgaatatcta cgcaaaaaat gtacaagaca 13021 tatcccaaca caaataggaa ggagataact cctggctcta caaagacttc ctgcgagcta 13081 caatcctgaa gaaaacaatt tacacactca cgagggaaac tcatggctgt aaaaagagga 13141 aatatggttc gtgctgtccg cgagaagctg gaaaacagtc tagaagcaca agctagtgat 13201 tctcgttttc cttcctattt gtttgaaacc aagggtgaag tcgtagatat caaaggtgac 13261 tatgcccttg tgaagttcgg gaaagtgcca actccaaata tttggttacg tctggatcaa 13321 cttgaagaat ttaaataata ctcaaccatt agcagatatg agctattccc ctttctcccc 13381 aattaagcgc cattcacccc gtgtcaccgt tatcggtgct ggtaaagttg gtagtacctt 13441 agcccaacgc atagctgaaa aaaatctggc agatgtcgtg ttgctagatg ttgttgaggg 13501 tatgccccga ggactggcac ttgatttgat ggaggcaagg ggaattgaac tgcacaatcg 13561 tcggattatc ggcacaaccg actacgctga tacatctggt tctgatatcg tggtcattac 13621 agcaggtttt cctcgcaaac cgggtatgac tcgggatgat ttgcttttga cgaatgcaaa 13681 gattgtcgtg gaagctgcaa acaaagcaat gacttattct ccagacgcta tatacataat 13741 tgtcacaaat cctttggatg tgatgacgta tttggcttgg caagcaactg gactgccagg 13801 cgatcgcatt atgggtatgg ctggtgtgtt agactcagca cgctttgaaa cgtttattgc 13861 gatggaatta ggagtttgtt cccttgatgt taaagcaatg gtgctgggcg gtcacggaga 13921 tttaatggtg cccttgccgc gttatgctac tgttaacggt attccgatta cagaactact 13981 ggatgcagcc acaattgagc gattggtaga acgtacccgt aacggtggtg cagaaattgt 14041 agaactgatg cagacaggag gtgcttttta tactcccgcc tcttctacct gcgtgatggt 14101 ggaatcaata ttgctgaatc agtcacgtct gttgccagta gcggcgtatc ttcagggtga 14161 atacggttta aacgatattt ttatcggtgt tccctctcgt ttaggatcta gcggaattga 14221 ggaagtcgtg gaattaaaac taagcgatgt agaaagggac gctttacata cttctgcgca 14281 agaagtacgt aagaatatta cccgggc // LOCUS NODE_2364_length_14287_cov_5.17327214287 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14287) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14287) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14287 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(44..952) /locus_tag="DP116_19580" CDS complement(44..952) /locus_tag="DP116_19580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867267.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="site-specific DNA-methyltransferase" /protein_id="PRJNA477356:DP116_19580" /translation="MTQEQQTPQNTSDFTPYYSQKNGAIYLGDSLKLLTFLEDSSINL ILTSPPFALTRKKEYGNESAEKYIEWFLPFAYEFKRVLADNGSFVLDLGGAYLPGSPV RSIYQYELLVRLCKEVGFFLAQEFYHYNPARLPTPAEWVTIRRIRVKDSVNVVWWLSK TPYPKADNRKILKPYSQSMKQLLKNGYKAKIRPSGHDISDKFQKDNKGAIPPNLLEIA NTESNSVYLRRCKAAGIKPHPARFPQSFAEFFIKFLTDEGDLVLDSFAGSNTTGFVAE ILQRRWISFEINEDYIIGSRYRFEDL" gene complement(1075..1515) /gene="cynS" /locus_tag="DP116_19585" CDS complement(1075..1515) /gene="cynS" /locus_tag="DP116_19585" /EC_number="4.2.1.104" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454577.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanase" /protein_id="PRJNA477356:DP116_19585" /translation="MSIPEMTQTLLAAKKEKGLTFADLEKILGRDEVWIASVFYRQAS ASEEEAKLLVEALGLETIYVRELTEYPVKGLGPAVPTDPLIYRFYEIMQVYGMPIKEV IHEKFGDGIMSAIDFTLNIEKEEDSKGDRVKVVMSGKFLPYKKW" gene complement(1753..3036) /gene="glyA" /locus_tag="DP116_19590" CDS complement(1753..3036) /gene="glyA" /locus_tag="DP116_19590" /EC_number="2.1.2.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454495.1" /note="catalyzes the reaction of glycine with 5,10-methylenetetrahydrofolate to form L-serine and tetrahydrofolate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine hydroxymethyltransferase" /protein_id="PRJNA477356:DP116_19590" /translation="MTLTNSDFLFSSDPAVAELINQELQRQRDHLELIASENFTSAAV LAAQGSVLTNKYAEGLPGKRYYGGCEFIDKIEQLAIDRAKQLFGAAHANVQPHSGAQA NFAVFLTLLEPGEKFMGMDLSHGGHLTHGSPVNVSGKWFQACHYGVNQETEQLDYDQI RELALRERPKLLICGYSAYPRVIDFEKFRNIADEVGAYLLADIAHIAGLVASGLHPDP IPYCDVVTTTTHKTLRGPRAGLILTRDPELGKKLDKSVFPGNQGGPLEHVIAAKAVAF AEALKPEFKTYSAQVIDNARALASQLQNRGFKLVSNGTDNHLMLVDLRSIGMTGKQAD QLVSGVNITANKNTVPFDPESPFVTSGLRLGSPAMTTRGLGVEEFTEIGNIIADRLLN PDSAEIAEDCRRRVKALCDRFPLYSHIMIPVPAFA" gene 3205..3462 /locus_tag="DP116_19595" CDS 3205..3462 /locus_tag="DP116_19595" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19595" /translation="MRTAIVGESKGKYYIALETISDFPFAFTLLGVNTPHVKIGVEFS DHGDSNQFTRNRASSKKSEINLCITFDYKPKQPAYNKLNST" gene 3667..3864 /locus_tag="DP116_19600" CDS 3667..3864 /locus_tag="DP116_19600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19600" /translation="MLVILMDNQIFAPQQVCQSCLLADGSGQPRWRQGKLHCGQAIRK LTEQQPDQYECVMGFRVANIE" gene 4033..4512 /locus_tag="DP116_19605" CDS 4033..4512 /locus_tag="DP116_19605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019492956.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19605" /translation="MAWRGGTTVQHRLLSCLPYLLPFIEVQNFAQLPLLRSLYLPFIP VIQLYYAIPFGSLIIFFALYLLVVRNEKVQHFVRFHTLQALLLSIFAYLCGAILDLIG IVQEGASISVPLFQSVMFTLIFLAVVGASIYSVVQAVRGLYTEIPLISQAAYSGTRD" gene complement(4524..5225) /locus_tag="DP116_19610" CDS complement(4524..5225) /locus_tag="DP116_19610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113589.1" /note="response regulator in two-component regulatory system with CusS; regulates the copper efflux system; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_19610" /translation="MHILFVEDESRIANFVRAGLKEQGFVVDYCDNGDDGYIRAMENE YDAIVLDIMIPGKDGLFILKHLRREGRNVPVILLTARNELDDRLEGLNLGADDYIAKP FFVEELVARIHAVVRRSMGVSGATPQEYRQNLLCVGPLKLDRITREVTCNQQVVELTT REFNLLEYLMRSPGRVFTRMQILEHVWSYDFNPNTNVVDVCIQRIRKKIDPISGTAWI ESVRGVGYRFCKPES" gene 5459..6121 /locus_tag="DP116_19615" CDS 5459..6121 /locus_tag="DP116_19615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875066.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="macrolide ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_19615" /translation="MIWMESITKSYRLDDMELPILKGIDLSIEEGEYVAIMGMSGSGK STLMNILGCLDRPTAGYYVLEGRNLSTLASDELAYIRNRRIGFVFQQFNLLARSTALE NVMLPMVYANVPKSKRRQRAIQALTRVGLAERLHNRPSQLSGGQQQRVAIARALVNNP ALVLADEPTGALDTKTSQEVMDLLTDLNNQGITIVIVTHEPDIAAQTKRTIHVRDGLV VT" gene complement(6253..6780) /locus_tag="DP116_19620" CDS complement(6253..6780) /locus_tag="DP116_19620" /inference="COORDINATES: protein motif:HMM:PF00201.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19620" /translation="MALLIQQLHNWRISFKYCGNIQLRFCYVMSSFLVCPGYMKKLTC RGQHYNVRVEKFLPHAHLLPYVDVMVTNGGFNGVQIALANGVPMVTAGQTEEKPEICA RVQWAGVGVDLKTSTPTPKQIQEAVMKIVNSSQYRQRAENFKTEMSHYDAPTLATKLL EQLASTNLPVFRTFQ" gene complement(6632..7045) /locus_tag="DP116_19625" CDS complement(6632..7045) /locus_tag="DP116_19625" /inference="COORDINATES: protein motif:HMM:PF03033.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19625" /translation="MTRFLIGTIAATGHVNPALPIAQKLVESGHEVWWYTGIGFKDKI EATGAHHVPIRTGIDLTDSSTIPQSWLEQKDALKGLDQFKFYLKHGFIDSAVTQLEDL IQILREYPAQVLLCDVFFLGMSWLHEKTDLPWAAL" gene complement(7096..7449) /locus_tag="DP116_19630" CDS complement(7096..7449) /locus_tag="DP116_19630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015160218.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19630" /translation="MEIEFAGYQIPPGWTIIISQFVTHRLSSIYTNPEEFDPDRFAPP REEDKKVPFSLLGFGGGAHVCIGREFALMEIKIFLASLLRKYHWVITPEYSAVAPVLV PPKAQNKLRVRLTTA" gene 7602..8252 /locus_tag="DP116_19635" CDS 7602..8252 /locus_tag="DP116_19635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130904.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_19635" /translation="MGRTANTKSSSKTKRAIRDAEATKQQILDAAEVEFAKHGLFGAR TEAIANSASVAPRMIYYYFQSKEGLYQAVLQRPATQFQQILEQLNLEQLPAPEALRIF LRTIIAYEISHRYRGMLLFQEANQNQGKYFQLTNWQQPIGYITQILEKGMQEGVFCKL DPYMTTLTIAGVCVFYANAYENLKHLTPDVELLSSQMIEQYTQAAINLVLKGVLSQ" gene complement(8372..9181) /locus_tag="DP116_19640" CDS complement(8372..9181) /locus_tag="DP116_19640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5-oxo-1,2,5-tricarboxylic-3-penten acid decarboxylase" /protein_id="PRJNA477356:DP116_19640" /translation="MAQRYVRVQNPQGQVYYGLLQPSLMVHVLDAAPWLQGQLTDLIL EPESYQILAPCTPSKIIAVGKNYAEHAAEMGTEVPTEPLLFLKPPTSIIASLEEIQYP PQSQRVDYEGELALVIGDRTIKCTPEEAQTKIWGYTIANDVTARDLQKRDSQWTRAKG FDTFCPLGPWIVREVNPGARLQTFLNEEATPVQSACIDQMVFPPDFLVSYISGVMTLL PGDVVLTGTPLGVGPLHSGDRVRVEIEGIGRLENTVTVRQPVPNQTPENQN" gene 9830..10156 /locus_tag="DP116_19645" CDS 9830..10156 /locus_tag="DP116_19645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129974.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S6" /protein_id="PRJNA477356:DP116_19645" /translation="MQIVYETMYILRPDLTDEQVEQAIAKYENLLREHGADNIQIQNR GKRRLAYEINRQRDGIYIQINYTGPGNMIAILERSMRLSEEVIRYLTMKQEVKEAQAE AITPAA" gene 10454..11008 /locus_tag="DP116_19650" CDS 10454..11008 /locus_tag="DP116_19650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319393.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19650" /translation="MSQNDTSIQVISAEASKLRQELQFRDQLVQQLSQELFRLVKGNT SFMPQPEASDRYHTQLQELREQLQAVEQQVTFYQEQISARDAEIYQLRQSVQELSDRS RMLEQVVQELPQIYRRKFEERMTPVREKVAILQRENRQLQAELQSVSYRLALKTRTAS HSGIDLPNFSRTASPQNNISTSNA" gene 11070..11768 /locus_tag="DP116_19655" CDS 11070..11768 /locus_tag="DP116_19655" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19655" /translation="MLSAIEMAIAEDTTQATSVQVVAVNNLTSIDQGVGFQPWDLQTS SELQEIIYCPLTLCLPENLVVPFEGIIKACRDIAGLRHKLAQHIQVPIGDGSYWLPVV LTAYGPLYGEAITLAEESNGKKLPDNLLASDLTYYQPLHLSDVLRQSLYHMAHNLLQF LLAPPATYLVQFGLEKSEICFDRLWPFPTAPALASVGVQKPDLFTCHWYCLKALPVLD LNIIPVAQSGFKRL" gene complement(11755..12108) /locus_tag="DP116_19660" CDS complement(11755..12108) /locus_tag="DP116_19660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747211.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="anti-anti-sigma factor" /protein_id="PRJNA477356:DP116_19660" /translation="MTITSKCQVVLFQPEGRIDLQGGIALSEKMSAIVPQRNQLWVID LAKVDFMDSSGLVSLVKSLKLARQSGCRLVLCNVQAPVRLVLELTQLDSVFEIFDTYE EIFTVVQEKSLVTVA" gene 12565..13197 /locus_tag="DP116_19665" CDS 12565..13197 /locus_tag="DP116_19665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319396.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DedA family protein" /protein_id="PRJNA477356:DP116_19665" /translation="MSLEFISLENIQKVAHEYGYWAIFLGILLENLGIPLPGETVTLV GGFLAGSKELSYWLVLADAIAGAVVGGICGYWIGRTGGWSLLVRLGKLFRISEARLLT IKEQFSENASKAVFFGRFPALLRILAAPLAGIVEMPFGKFLIYNLAGAIAWASIMVTL AFFAGRIVSLEQLVAWVSQFAILALVILAAVIAVPIWLESRQKEGEEIER" gene complement(13430..14260) /locus_tag="DP116_19670" CDS complement(13430..14260) /locus_tag="DP116_19670" /inference="COORDINATES: protein motif:HMM:PF00072.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19670" /translation="MGISSKDATRTQSPRAGNPPARLPHPNAVASQRREPPEVLSAEN FGSDNFWQGADFWAVIQVSDTGEGINSEFLPHVFEHFRQADSTTTRSNNGLGLGLAIV RHLVELHGGTVTAESQGKGKGATFTVKLPLIQENTQQPKVSRETTTHHSQLNDVQILV VDDDTDTLDLLQEILQGVGAKVTTVSSANAALQVLEEFKFDVLISDIGMPQTSGYALI RQVRLKEMGQTQRIRAVALTAYSTEEDKFKALEAGFHIFLPKPVNPVDLMGAISSLLE " BASE COUNT 4067 a 3097 c 3026 g 4097 t ORIGIN 1 tcactgttcc ctgttccctg tattcttagc ggatcattaa aatttataaa tcttcaaatc 61 ggtatcgact accgataata taatcctcat tgatttcaaa agaaatccat cggcgttgta 121 aaatttctgc aacaaaaccc gttgtgttgg aacccgcaaa agaatctaaa accaaatcac 181 cttcatcagt caaaaacttg atgaagaatt ctgcaaaact ttgaggaaaa cgtgccggat 241 gaggtttaat tcctgctgct ttacaacgcc gcagataaac actattagat tctgtattgg 301 caatttctag taagtttggt ggaatagcac ccttattatc cttttgaaat ttgtcagaaa 361 tatcatgacc gctggggcgt atttttgctt tatagccatt tttcagtaat tgtttcatac 421 tttgactata aggctttaga atttttctat tatcggcttt ggggtaaggg gttttagaca 481 accaccaaac cacattcact gaatctttta cacgaattcg tctgattgtt acccactcag 541 caggagtggg tagtcgtgct ggattatagt gatagaattc ttgggcgaga aaaaagccaa 601 cttctttaca caatctcact aaaagttcat attggtaaat actcctgact ggagaacctg 661 gcagataagc accacccaaa tctaaaacaa aagagccatt atctgctaaa actcttttaa 721 attcgtaggc aaaagggaga aaccattcta tatatttttc tgcactttcg tttccgtatt 781 cttttttacg tgtgagtgca aatggaggtg aggtcaggat taagttaata ctgctatcct 841 ccagaaatgt aagtagtttt aggctatcac ctaaatatat tgctccattt ttttgagagt 901 agtaaggtgt aaaatctgac gtattttgtg gtgtttgctg ttcttgtgtc aagagttaga 961 acgtttttga gttaaggcgt aagcatactt ctagaaagat tgactgagaa ctttctgcac 1021 gtctacaagg gacgaacccc caaagatgaa aaactcctca gcatgaatga aatctcacca 1081 ctttttgtat ggtaggaatt tcccagacat gacgactttc acgcgatcgc ctttagaatc 1141 ttcttctttt tcaatattca aagtaaagtc tatcgcgctc ataattccat caccaaactt 1201 ctcgtgaata acctctttta tcggcattcc ataaacctgc ataatttcgt agaaacgata 1261 aatgagggga tccgtgggaa cagcaggtcc taaacctttg acagggtatt cagttaattc 1321 cctaacataa atcgtttcaa gccctaatgc ttcaaccagt aacttcgcct cttcctcaga 1381 agcactagct tgacggtaga aaacagatgc aatccacact tcgtcacgtc ccaaaatctt 1441 ttccaaatca gcaaaggtta gtcccttttc tttttttgca gccaaaagcg tttgagtcat 1501 ttctgggatg gacactggat aactcctgtg attaattgtt acttcttttc tggaaaacgt 1561 ttgtagcttt ttcactgcat tatagtctca tattggctaa aaaacatcaa aaaaccattg 1621 ctgaatggag acataaaaaa attatcagtt atgagttata aattaataat tcataactca 1681 taactgataa ctataacaat tccaaatgaa ttgtgaactt attcggtgaa gtaccttaac 1741 ctgagcttaa tgtcatgcga atgctggtac aggaatcatg atgtgagaat acaaggggaa 1801 gcgatcgcac aacgctttga ctcgtcgccg acaatcttca gcaatctctg ctgaatctgg 1861 atttaacagg cgatctgcaa taatattgcc aatctccgta aactcttcaa ctcccaagcc 1921 acgcgttgtc attgctggag aacccaacct tagaccacta gtcacaaatg gtgactctgg 1981 atcgaacggt actgtattct tgttggcagt aatattcaca ccactgacca actgatccgc 2041 ttgcttaccc gtcataccaa tagaccgtag gtctactagc atgagatgat tatccgttcc 2101 attagatacc agtttgaagc ctcggttttg aagttggcta gccaaagcgc gagcattgtc 2161 aatcacctga gcggaatatg ttttaaactc tggcttgagg gcttcagcga aagcaactgc 2221 tttagcagca ataacgtgtt ccaatggtcc accctgatta ccagggaaaa ccgatttatc 2281 cagcttttta cccagttctg gatcacgggt taagattaag ccagcccttg gaccacgtaa 2341 agttttgtgt gtcgttgtag ttacaacatc acaataggga atggggtcgg ggtgaagacc 2401 actagcaacc aatccggcaa tgtgggcaat atctgccaat aagtacgcac cgacttcatc 2461 agcgatatta cggaattttt caaaatcaat aacgcgggga tatgccgaat aaccgcaaat 2521 caagagcttt ggacgctccc taagcgccag ctcccgaatt tggtcatagt ctagttgttc 2581 tgtttcctga ttgacaccgt agtggcaagc ctggaaccac tttccagata cgttgacagg 2641 tgaaccgtgt gtcagatgtc ctccgtgaga caaatccatc cccatgaatt tctctcctgg 2701 ttccagcagt gtcaaaaaca ctgcaaaatt tgcctgtgcg ccagaatggg gttgcacgtt 2761 ggcatgagca gcaccaaaca actgtttagc acggtcaatt gccagttgct caattttgtc 2821 tatgaactca cagccgccgt agtaacgttt accaggtaat ccctccgcat acttatttgt 2881 cagtacggaa ccttgagctg ctaggacagc agccgaggta aagttttcac tagcaatcaa 2941 ctccaagtga tcgcgttgac gctgtagttc ttggttgatt aactccgcca ccgcaggatc 3001 ggaggaaaaa agaaaatctg agttggtcaa agtcactaat cgctatcctt atggaaattt 3061 gcacaatagt cggtaactgt ctagaggcat gagtctaaga gttgttgact aatgactcat 3121 aactgatgac atttatttat cccgacttac tcaattatcg cttgtccaat aaatggttat 3181 gcaaaacttt gaagttctgg gcggttgcgt actgcaatag ttggtgaaag taaaggcaaa 3241 tattacatag ccttggaaac catttcagat tttcccttcg ccttcacttt attgggtgtt 3301 aacactcccc acgttaaaat cggcgtggaa ttttcggatc acggggattc caatcaattt 3361 acccgaaacc gcgcatcgtc aaaaaaatca gaaattaatt tatgtattac ttttgactac 3421 aaacccaaac aaccagcgta caataaacta aactcaacat agaaattttg atgagattgt 3481 catgacccac attcttccaa agtgagcatg ggaggttggg caaaggaaga cagacaagaa 3541 tgtaacagat gtgacaacgt ccagcataac ggacgaacca tccgctacag catttattgc 3601 catccttagc agttttctcc ccctgggtat atgtgggtca tcctcaggat tgacaggaga 3661 aattggatgc tagtaattct catggataac caaatttttg ctcctcaaca ggtatgccaa 3721 tcttgtttac ttgctgatgg aagtggtcaa ccccggtggc gtcaaggtaa actccattgt 3781 ggtcaagcta ttcgcaaact gacagaacag cagccagacc agtatgagtg tgtcatgggt 3841 tttcgagttg ctaatattga atgaccaaca gctatgttta tggtggtcag ttattggtgt 3901 tctgttttca aaaaaaatct agaataaacg tctgaatatt gacgtttact aacggtgact 3961 ttttttccta cctaatcaac tgcgttatcc ttggcatagt aacatcaacg tttaacaaca 4021 ggagaaatca agatggcttg gcgcggaggt accacagttc aacatcgcct tttatcttgc 4081 ttgccttatc ttttaccttt tattgaagtc caaaattttg ctcagctgcc tttattgcga 4141 tcgctctacc tgccttttat tcctgtgatt caattgtatt atgcgatacc atttggtagc 4201 ctgatcattt tctttgcttt ataccttctg gttgtgagaa acgaaaaagt tcaacacttt 4261 gtccgctttc ataccttgca agctctcttg ctgtctatat ttgcttattt gtgcggggca 4321 attttagacc ttataggtat tgtgcaagag ggtgcgtcaa tatcagtacc tttgtttcaa 4381 agtgtgatgt ttactttgat tttcctagca gttgtaggtg catcaatata tagtgttgtt 4441 caagccgtaa gaggacttta cactgaaatt cctctaatct cccaagctgc ttacagtgga 4501 actcgtgact agaatagtca caatcaagac tctggcttac aaaaacgata cccaactccc 4561 cgtacactct caatccaagc cgttccacta atcgggtcaa tctttttgcg aatcctttgg 4621 atacatacat caacgacgtt ggtattgggg ttgaaatcat aactccagac atgttccagg 4681 atttgcatac gggtaaagac tcgtccggga gagcgcatca ggtattccag gagattgaac 4741 tcgcgggtgg tgagttccac tacctgttga ttgcaggtga cttcccgcgt gatacgatcc 4801 aatttgagcg gtccgacaca caatagattt tggcgatact cctgcggagt cgctccgcta 4861 acgcccatac tccgacgcac tacagcatga atgcgggcaa ccaactcctc aacaaaaaac 4921 ggtttggcga tatagtcgtc cgccccaaga ttcaaacctt caagtcgatc atccagttca 4981 ttgcgagcag tcaacaaaat cactggcaca ttacgccctt ctcgtcgcag gtgtttgagg 5041 ataaacagcc catccttccc tggaatcatg atgtcgagca cgattgcatc atactcattt 5101 tccattgctc ggatgtatcc atcatcgccg ttgtcgcaat aatcaacgac aaatccctgt 5161 tccttcagtc cggctcggac gaagtttgct attcttgatt catcttcgac gaacaggata 5221 tgcatggttt tttgatggtc atccaaaact ttgatgtttt gaactattta ttgagcaaat 5281 aaagactaag atttttgcca aagccagaat agccgtattt ttatcgatta caaaattgta 5341 attcagacaa ggggcaagct gtcatcttgg attgatgtaa ttgaaactat aaacatcgta 5401 atgccccctc aactcgaccc aatgactgcg ttatggagcg actaaaatgg ctgcaactat 5461 gatttggatg gaatcaatta caaaaagtta tcgtttagac gacatggaac ttccgattct 5521 caagggaatt gacttatcca tagaagaagg ggaatacgtc gcgattatgg ggatgtctgg 5581 ttcggggaag tccacgttga tgaacattct cggttgcctt gatcgtccaa ccgcaggata 5641 ctatgttctg gagggacgca atttaagcac attggcgagt gatgaactcg cttacattcg 5701 taatcggcgc atcggctttg tgtttcaaca gttcaatttg ttggcacgtt ccaccgcgct 5761 tgagaatgtt atgttgccga tggtttatgc caatgtgcca aagtcgaaac gacgccaacg 5821 ggcaattcaa gctttgacta gggtaggact agcagaacgc ttgcataacc gtcctagcca 5881 actttcaggg ggacaacaac agcgagttgc aattgcccgt gctcttgtca acaatcctgc 5941 tcttgtactc gcagacgaac ccaccggagc tctagacacc aaaacatctc aagaagtcat 6001 ggacttactc actgatttaa ataaccaagg catcactata gtcatcgtga ctcacgaacc 6061 cgacattgct gctcaaacca aacgcacaat tcatgtcaga gatggcttag tcgttacata 6121 gagtcgtgtt gcaaaaaata gtcaagctct agtggaagta ttgtaaaaga tgacttaata 6181 aagaactcat aactcaatag tcagaataat agcttttgtt cagcaggttg aggacgactg 6241 taactttcta ctttactgaa aagttctaaa aacaggcaag tttgttgatg ccaattgctc 6301 taatagcttt gtagcaagag ttggtgcatc atagtgactc atctcagttt taaagttctc 6361 tgccctttgc cgatactgag atgaattcac tattttcatc acagcctctt gaatctgttt 6421 aggtgtaggt gtacttgttt tgaggtcaac tcctacacca gcccattgca ctctggcaca 6481 aatttctggc ttctcttctg tctgaccagc agtaaccatt ggtacaccat ttgctaaggc 6541 aatttgtaca ccattgaacc caccgtttgt caccatgaca tcaacgtagg gcaaaagatg 6601 cgcgtgagga agaaattttt ccactctgac attataatgc tgcccacggc aagtcagttt 6661 tttcatgtaa ccaggacata ccaagaaaga agacatcaca taacaaaacc tgagctggat 6721 attcccgcaa tatttgaatg agatcctcca gttgtgtaac tgctgaatca ataaagccat 6781 gcttgagata aaacttaaac tgatctaatc ccttgagtgc atctttttgc tctaaccaag 6841 attggggaat agtgctggag tcggtcaaat ctatgcctgt gcgaattgga acatgatgtg 6901 cgccagtagc ttcaatttta tccttaaacc caattcctgt ataccaccat acttcatgac 6961 cactttctac aagcttttgg gcaattggta aagcggggtt aacgtgtccg gttgcagcaa 7021 tcgtaccaat tagaaaacga gtcatatagc taaaaaaaag ttgatgattt tggaagattc 7081 gtgcaaattt ctgatttaag cagttgttaa gcgtactcgt agtttattct gggctttagg 7141 aggaactagc acgggtgcaa cagcagaata ctctggtgtt attacccaat gatatttacg 7201 tagcaatgaa gccagaaaaa tcttgatttc catcaatgca aattctcgac cgatgcaaac 7261 gtgagcacca ccaccaaaac caagcaacga gaacggtact tttttatctt cttcgcgtgg 7321 tggcgcaaaa cgatctgggt caaattcttc tggattggta tatatagaag acagccgatg 7381 cgtcacaaat tgcgaaataa taatcgtcca acctggggga atttgatatc ctgcaaattc 7441 aatctccatt tatttcataa atatcagcat gaaacactca tagcttgtct gtcaatagtg 7501 aattaaattt ttagttcatt atctagcaaa aaagcttaga tgcttgtaaa atctatctat 7561 agaggcttgt ctgttgcaaa taaaattcat tttttactgt cgtgggtcga acagcaaaca 7621 ccaaatcttc atcaaaaacc aaacgagcga tacgtgatgc agaagcgaca aagcagcaga 7681 ttcttgatgc tgcggaggtt gagtttgcca agcatggact gtttggagcg cgaactgagg 7741 cgatcgcaaa cagcgctagc gttgctccca gaatgattta ctactacttc caaagtaaag 7801 aaggactata ccaagcggtg ttgcaacgac ctgcaaccca attccaacaa atactagagc 7861 agctaaattt ggagcagtta ccagcgccag aagcgctaag aatatttttg cggacaataa 7921 tcgcttatga aatttcccac cgctaccgag ggatgttgtt atttcaagaa gcgaatcaaa 7981 atcaaggaaa gtatttccag ctaacaaatt ggcaacaacc aattgggtac ataactcaaa 8041 ttttagaaaa ggggatgcaa gaaggggttt tctgtaagtt agatccatat atgacaacac 8101 ttactatcgc tggcgtgtgc gttttttatg caaatgccta cgaaaaccta aaacatctga 8161 cccctgatgt cgaattgcta agttctcaaa tgattgagca gtatacccaa gcagctatca 8221 acttggtttt aaagggtgta ctttctcagt aaggtgaaag ctcttatcaa ctgttcccca 8281 ccctcctaac ccgaaacagt cccgattttg tcttacccgg aaaatttagc atctatgttt 8341 tgttttcctt tatcgagcgt gccattcgat actagttctg attttctggt gtctgatttg 8401 gtacgggttg tctaactgtg acggtatttt ctaagcgacc aataccttca atttccacac 8461 gaacgcgatc gccagaatgt aaaggtccta cccccaatgg tgtacccgtc agtacaacgt 8521 cccctggtag tagcgtcatc accccagaga tgtaggagac gagaaaatct gggggaaata 8581 ccatttggtc aatacaggca gattgtacag gagttgcctc ctcattcaaa aaagtctgca 8641 atctggctcc ggggttgact tctcggacaa tccacggtcc caaagggcag aacgtatcga 8701 aacctttggc tcgcgtccat tgactgtccc gtttttgtaa atcccgcgct gtcacgtcat 8761 tagcaatggt gtaaccccaa atctttgttt gagcctcctc tggtgtacac ttgatcgtgc 8821 gatcgccaat cactagcgct aattctccct cataatccac tcgctgcgat tgcggcggat 8881 actgaatttc ttccagtgaa gcaattatag atgtaggcgg cttcagaaag agcaaaggct 8941 cggtgggtac ttctgttccc atttctgccg catgctctgc ataattcttg cctaccgcta 9001 taattttgga aggagtgcag ggagccaaaa tttgatagct ttctggttcc aaaattaaat 9061 cagtgagttg cccttgtagc caaggtgcag catctaacac atgcaccatt agagatggtt 9121 gtagcaagcc atagtatact tgcccttgtg gattttgaac tcgcacatac cgctgcgcca 9181 taaccattga tgagtatcct tgttgtgcaa tcaaaccgcc agtgtgcaat ctttcaatga 9241 gcgatttagc taaacgtaaa aaagaaaagc tggtaagatt ataaaacctg agggattccc 9301 cccgttatag cacctggcgt cgggttaagc gcgttgtagc acaagcgcgt gaagaacgcg 9361 gtcagttaga ccgtatgagc ctgtggactg gctgatagcg tctgagagtg cgccccctta 9421 ggggctagcg tggcgttagc catagccgac ttctccagtc ctgttgcagg aagtaaacga 9481 aactcaaacc cttcgacttc gctcagggta aacccaattg aatagctttg ataggtgttg 9541 attaggtttg agtaggcttt atataacgga aagtacaaaa accatatctt ttaccaaaaa 9601 agatatagta gttgggagac acatccccca agcaatccgt acagctcgct cggaaataag 9661 gataaacaat tcctaaaagt gagcaagtat cctggcaact ccattgcacg agagagcgtc 9721 aaaaagtccc ttgttgcttt gtaatgttga ttggtgttgg tatcgtaact cgcgactatc 9781 aatacaagtt aacatcggca gccatttagt ccatgaggag attagaatca tgcaaatcgt 9841 ttacgaaaca atgtacatcc tccgtcctga tctgactgat gagcaggtag agcaagcgat 9901 cgctaaatat gagaacttgc tgcgggaaca cggagcagac aacatccaga ttcaaaatcg 9961 aggtaaacgt cgtcttgctt atgaaattaa taggcaaaga gatggcattt acatacaaat 10021 caattacact ggacctggca acatgattgc tattttggag cgctccatgc gtctgagcga 10081 ggaagtgatt cgctacctga caatgaagca agaggtcaaa gaggcacaag ctgaggcaat 10141 aactccagca gcctagaggc ttttgattca ttaattcttc acgcggctct gttgccctca 10201 agtttggtga tcaagtgaga ggaaatgagc cgcgtcagtg tatctgaggt acctagaatt 10261 tttttcaaca caatctgaaa gtcatggtat aatagttagc gcctaacacg accgtataga 10321 caactcaaat gaagcggatt ccgcaaaata atttcgctaa ttttactgat aaaaatatgt 10381 caatagacaa ctgtctaatt gccgcagtcg catatgaaaa accaaaagtt ctgaatggga 10441 gctgtaagcc actgtgagtc aaaatgatac ctcaattcaa gttatctccg cagaagcctc 10501 gaagctacgc caagaattgc agtttcggga tcagctagta caacaactgt ctcaagaact 10561 cttccgactg gtgaagggca acactagctt tatgccccaa ccagaggcat ctgaccgtta 10621 tcacactcag ctgcaagaat tacgagaaca gctacaagct gtggagcagc aggtaacctt 10681 ttatcaagag caaatctcag cgcgtgacgc tgagatttat caactgcggc agtctgtgca 10741 agaactgagc gatcgcagtc ggatgcttga gcaagtcgta caagagttgc ctcaaatcta 10801 tcgtcgtaag ttcgaggagc gcatgactcc tgtcagagaa aaagtagcaa ttctacaacg 10861 cgaaaatcgc caactccagg cggaacttca gagtgtgagt taccgtctag cgttgaaaac 10921 ccgtactgct tctcacagtg ggatagattt gccaaatttt tcccggacag catcccctca 10981 aaataatatt tccacgagca atgcgtaaag tgttagtagt gatcgagtcc gatgggagag 11041 aaataataca atctcccacc gtaagcagga tgttatctgc aatagagatg gcgattgctg 11101 aagacacgac gcaagcaacc tccgtacaag tggttgctgt gaataatctg acttctattg 11161 accaaggagt aggattccaa ccctgggatt tacaaacgtc ttccgaattg caggagatta 11221 tttattgccc tttaaccctc tgtttgccgg aaaatctggt agtgccgttt gaaggaatta 11281 ttaaagcttg tcgggatatt gctgggttac gtcacaaatt agcacaacac atacaagtac 11341 ccattggcga tggtagttat tggttgccag tggtcttgac ggcttatgga cctctttatg 11401 gtgaggctat tactttagca gaagaatcca atggaaaaaa gttaccagac aatttactgg 11461 cgtctgactt aacttattac cagcccctgc atttatcaga tgtgttgcgt caaagcttgt 11521 atcacatggc acataatctc ctgcaatttt tattagcacc accggcgaca tatttggtac 11581 agtttggact ggaaaaaagt gaaatttgtt ttgaccgtct ttggccattt cctactgcac 11641 ctgctttggc tagcgttggt gttcaaaagc cagatttatt cacgtgccat tggtattgtc 11701 taaaagcatt gccagtactc gacttgaaca tcattcccgt cgcccaatct ggtttcaagc 11761 gactgtaact aaactttttt cttgaacaac ggtgaaaatt tcttcgtaag tgtcaaatat 11821 ttcaaatact gaatctagtt gagtcagttc caaaactaac ctgacgggag cttgaacgtt 11881 gcaaagaaca aggcgacaac cgctttgacg tgccaacttt aagcttttaa ctagggaaac 11941 taacccagaa ctatccatga aatcaacttt ggctaggtca ataacccaca gttgattacg 12001 ttgaggaact atggcggaca tcttttcgct cagagcaata ccaccctgta aatctatacg 12061 tccttcgggt tgaaacaaaa cgacttgaca ttttgatgtg atagtcatga attttactgg 12121 gtagttgaaa cattaattac aaaaatgcag aaaacagaca cagctactga ttttctctga 12181 acagtcaaaa atgctgcttc attcagttat ctaggaactg gcagcgatgg gcttaagaca 12241 gagaacatca gtatttatgc cgtataattt gtcctgactt tacccaatat aggcataacc 12301 acatgataat ttcggtaatc accgtatact ttacaaaatt tttacaattt cctcatgaaa 12361 ttataaattt tatataaaaa gaaattgttt gagtctagta agtgtacgga tgtggtattt 12421 taggcatcaa tagtcatgaa tcaatggtaa ctattgattt ttgactatga attctggatt 12481 ttgtagaacg acgcaattga cacctggccc aaaaaagagt caacatagaa gaaaagtaaa 12541 aatttgtaaa gataacggag ctggatgtct cttgagttta tatcactaga aaacatccag 12601 aaggttgctc atgaatatgg atattgggca atttttttgg gaatattgtt ggagaatcta 12661 ggcattcctc ttccaggtga aaccgttacc ttagtcggtg ggtttcttgc tggcagtaag 12721 gaactcagtt actggctagt tctggctgac gccattgcag gtgctgtcgt gggaggtatc 12781 tgtggttatt ggattggtag aactggtggc tggtctttgc ttgtgcgcct aggtaagttg 12841 tttagaattt ccgaagctag acttttgact ataaaagaac aatttagtga gaatgcttcc 12901 aaagcggtgt tttttggacg ctttcctgct ttgctgcgaa ttttggcggc accacttgct 12961 ggaattgttg agatgccctt cggtaaattc ttgatataca acttggcagg agcgatcgct 13021 tgggcaagta ttatggtgac acttgctttc tttgctggaa gaattgtttc cctcgaacaa 13081 ttagttgctt gggttagtca atttgcaatt ttagcattgg tgattctagc tgctgtgatt 13141 gctgtcccca tatggttgga gtcgcggcaa aaagaaggtg aggagatcga gagatgagga 13201 gtcactgcct tggggagggt aagctccccc aaggaaacct tccgtcggtg agtcgtatcg 13261 gcaatgccgt ccttgtagca agtgtcgggt aaagctgcgc tgctgaaccc ctgcggggtg 13321 gatgcagtcc cgcaccgacg tttccaacct ccttaactgg atttcccgtt gggcgttggg 13381 tttcccgaac tgtatgcaaa ctgccgtgga gttgttgagc tttctgttct tattccaaaa 13441 ggctagatat tgcacccatt aaatctacag gattaactgg tttgggcaag aaaatgtgga 13501 atcctgcctc cagtgcttta aatttgtctt cttctgtgct ataagcagtg agtgcaacag 13561 cgcgaatcct ctgcgtttgc cccatttcct tgagcctgac ttgacgaatt aaagcgtaac 13621 cactcgtttg tggcatccca atgtcactaa tgagtacatc aaatttgaat tcttccaaaa 13681 cttgtaaagc tgcattggcg gatgaaacag ttgtcacctt tgcgccaacg ccttgcaaaa 13741 tttcttgcag taggtccaaa gtatcagtat catcatcaac cacaaggatt tgtacgtcat 13801 ttagttgtga gtgatgagta gttgtttcac gacttacctt tggctgctgt gtattctctt 13861 gtataagtgg cagcttgaca gtaaaggttg ctccttttcc ttttccttga ctttctgctg 13921 taactgtgcc tccgtgcaat tccacgagat gtcgaacaat tgctaatccc aaaccaagtc 13981 cattattaga acgtgtcgtt gtactatcag cttgacgaaa gtgttcaaac acatggggta 14041 gaaactcaga attgatgccc tcaccagtat cgctgacttg aatgactgcc cagaagtcag 14101 cgccttgcca aaagttgtct gagccgaagt tttctgcgct tagaacttca ggaggttccc 14161 tccgttgaga agccactgcg ttggggtgag gcagccttgc tggtgggttt cccgccctag 14221 gtgactgtgt tcgcgtagcg tctttggagg agatacccac agaggggtct tgttccctgt 14281 taccctc // LOCUS NODE_2367_length_14264_cov_4.68527014264 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14264) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14264) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14264 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1193 /locus_tag="DP116_19675" CDS <1..1193 /locus_tag="DP116_19675" /inference="COORDINATES: protein motif:HMM:PF00805.20" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002785269.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19675" /translation="ANLSKTDLKFANFSRSNLSKANLIEANLSEAYLSCADLSEANFS RAILNKACLNCTNLSRTNLSKAELSLAVIEAANLQNADLSLAQALGSNFKGANLTGAC LQNWNTNCKTQLDDVECDYIYLKKGYGFEFNSRYPSDRMFQPGEFTRQFKQASEIEEL VFPYGITEFFQFFQERQQQYTDKVLAIQELEPKSDSSVAVRLEVLPKANQEDAKSFYE EQLQLVKASYTLQATTQAMELYKQHNAEIIELARLALGKPSINNVNVGATAMSNSEGF TNNLQGANIANFSNQMRDNARQQANQYNYSPETKSLADAAKEIQTLLDHLSQTYRTDT MTAKCAFANEVIQRIDNDPSLTQRILSAFSAGSISALEQFLNHPAASFVINALEDWQK TRVE" gene 1298..2152 /locus_tag="DP116_19680" CDS 1298..2152 /locus_tag="DP116_19680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017659333.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19680" /translation="MKVLAVYHNKGGVGKTTTVINLAAALSKKNKRVLVIDLDSQANT TFATGLVKFQDEIHDNLKDNYIYQVIASRNDYSISEVARKSEFTNPPFYVIPSHIDLM EHEQELIQQPQALTRLLKKLDEVRKQYDVVLIDTPPSLNLYARIALITADYLLIPSDL RPFANEGLRNVRRFVNDVNEFRDSIKKDPIEILGVIASKVGTSPKFVEYTLPKMIETV EKHYGFPVLNSKIFERRETSKAIERLAEVGDLLIPDPISILDYEPNSPAAEEFKDLAK EVMQLARI" gene 2172..2924 /locus_tag="DP116_19685" CDS 2172..2924 /locus_tag="DP116_19685" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19685" /translation="MELLTSIVDIDSIQVKSPPFAPAQKTTQIDALANTIIELGGLVN VPVVQQVSVDDYELISGYLEYYAYLKACELNPRLPDRITVFVSNTKNQAAIRQQLEIL QVIEDTKQNSSQSITPKQSEIDLQIKNLESSINNNNKIIFNALEQLKADLLATIEAKL PQPISPMDSFNRILEPETAFQVQRKLELFLGASKAKKVVVRLQEVSKGKKNQPFQRFS EILDILREQQKGRSQRLISEEKMIQIIDRWND" gene complement(3085..3675) /locus_tag="DP116_19690" CDS complement(3085..3675) /locus_tag="DP116_19690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861175.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_19690" /translation="MTVVKPKRFTIEEYHRLISLGFLTEVDKIELIRGELIQMAAKGT PHTFCTTRLCRQFDRLLGDRAVVRCQEPIILPSDSEPEPDAVIARGDEADYLAHHPYP EDILLVVEISDSTLTYDQTTKLTLYAEAGISDYWLVNLQARQVERYSQPYQNIQGEFN YLSKQISLANQSVSIPGFEDALLDLSRIFPEGTVGE" gene complement(3721..4920) /locus_tag="DP116_19695" CDS complement(3721..4920) /locus_tag="DP116_19695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873002.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AGE family epimerase/isomerase" /protein_id="PRJNA477356:DP116_19695" /translation="MEHNFKELAELYKNALLNDVLPFWEKHSIDWEQGGYFTCLDRQG KVYDTDKFIWLQNRQVWTFSMLYNQLEKRENWLKIASNGANFLAQHGRDADGNWYFAL TREGKPLVQPYNIFSDCFAAMAFSQYALASGEEWAKDVAMQAYNNVLRRQDNPKGKYT KTYPGTRPMKSLAVPMILANLTLEMEWLLPSETLENVLTATVQEVMSDFLDKERGLMF ENVAPDGSHIDCFEGRLINPGHGIEAMWFIMDIARRRNDTQTINQAVDVVLNILNFAW DSEYGGLYYFMDADGHPPQQLEWDQKLWWVHLESLVALAMGVKHSSAVGDRLTGRNAC QEWYDKIHDYTWSHFADPEYGEWFGYLNRRGEVLLNLKGGKWKGCFHVPRALYLCWQQ FEALSGN" gene complement(4934..5569) /locus_tag="DP116_19700" CDS complement(4934..5569) /locus_tag="DP116_19700" /EC_number="2.7.4.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456951.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dTMP kinase" /protein_id="PRJNA477356:DP116_19700" /translation="MKGKLIVFEGVEGCGKTSQIQLTQEWLQSFQTSVVVTRQPGGTE LGLYLRKLLLETGSHSIVDKTELLLYAADRSQHVEQVIKPALQDGAIVLCDRYTHSTI AYQGWGRGLDINLIHQLNTIATSGLESDLTLWFDVDVEVGLARKRGGGDIFDRIEKET IDFHRRVQQGYAHLADSHPEQIVRVDGSLSQEAVQQKIQAILTARLKAWID" gene 5741..5923 /locus_tag="DP116_19705" CDS 5741..5923 /locus_tag="DP116_19705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316366.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19705" /translation="MSNQQQADMKLAYQAAAEMTYIVAVGLSKVMEKQKKRPLISKRQ KRVKSKSESSIVGSTI" gene complement(6240..6761) /locus_tag="DP116_19710" CDS complement(6240..6761) /locus_tag="DP116_19710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951025.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19710" /translation="MRLLVSGVILSFGLSSLVLPESSLAQAQSSASETQINAMVEALR QAAPPKSGSKDDGYYSEWRVKPETLKGWSKNCLKKEVTPAQFDSDAGLAREVVSCITR RELSKQLAASGNNETAAVRGVACWWMTGNYTGCNSGFTADYVKKVAGYYQKPGSKPSS GTAKPVTSPSPKS" gene complement(7305..8084) /locus_tag="DP116_19715" CDS complement(7305..8084) /locus_tag="DP116_19715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195087.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" /protein_id="PRJNA477356:DP116_19715" /translation="MYLTWLDSNSWLIEIGGQRILLDPWLVDSLTFGGQNWFFKGSRS QERPIPENIDLILLSQGLEDHAHPPTLKQLDRNIPVVASPNAAKVVQQLNYTQVTTLA HGESFTLNQSVEIKATPGSLVGLNLVENGYLLKELESGLTLYYEPHGNHSSTLKEIAP VDVVITPLIDAALPLVGAFIKGNKYALEVAQWLQPQVMLSTAAGGDVTFEGLLNSFLQ IKGSVEEFRSLLEKNNLSTQVIDPKPGERFEVKLEKRVLNV" gene complement(8122..9672) /locus_tag="DP116_19720" CDS complement(8122..9672) /locus_tag="DP116_19720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008179114.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carotene isomerase" /protein_id="PRJNA477356:DP116_19720" /translation="MLGQKEVDFIIIGSGIGGLSCAALLARYGFNVTVCESHSIPGGA AHAFERDGFKFDSGPSLYSGLSYSPSANPLKQVLDAIGEELPCVTYDTWGCYLPEGYF DTSVGADQFCEVLKQFRGDEAVVEWRELQRVMEPLARAATALPPAALRLDVGAIMTVS RFVPSLFQHIADIGKLTGPFSRIMDGVIKDSFIRNWLDLLCFLLSGLPADGTSAAEVA FMFADWYRPGVVLDYPIGGSGALVNALVRGLERHGGQLMLNAHVEQILVEGKRAVGVR LRGGKQIRARRAVISNASVWDTLKLIPQEALPKKFQERQATPECDSFMHLHLGIDAQG LPSDLACHYIVVNNWENGVTAPQNVVLISIPSVLDPSLAPPGKHVIHVYTPGSEPYEL WEGMNRKSEEYPRQKQLRAEVMWQALERIIPDIRSRCDVTLVGTPLTHERYLRRHRGS YGPAIQAGKGFFPGSGTPVPGLLCCGDSTFPGIGLPAVAASGMMTANTLAPLHKHVQM LQDIGYLG" gene complement(10030..11124) /locus_tag="DP116_19725" CDS complement(10030..11124) /locus_tag="DP116_19725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318041.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19725" /translation="MRVSVLDKDGKPLMPTKPSRARRWLKEGKAKIVYNNLNVFCIQL LVEPSGYHQQSIALGLDPGKKFTGVGVQSVKFTLFMAHLILPFSDVTKKMSGRLILRR ARRGRRINRNVAFNNRAHRQKRFDNRKQNKLSPSIRANKEMELRVTKELVKLFPITQI TYEYVKARGDKGFSPVMVGQKVMLQWLEKIAPTNAQEGWQTSILRQQLGLTKDKKNKE KQTPETHAYDGIALAASNFMKFEKFHTANTRGHHWVGDVAITSAPFRVIARPNLFRRQ LHFENPVSDAPKNRKRKGGTVTPFGLRSGDLVKAEKAGKFYIGWVGGYTQTAKTKNVS VYDHNWHRLGQFSPSKVQSIKRSTRLCIKP" gene complement(11354..12658) /locus_tag="DP116_19730" CDS complement(11354..12658) /locus_tag="DP116_19730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196005.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serpin family protein" /protein_id="PRJNA477356:DP116_19730" /translation="MPRRYGVRSARRYLLTAASVVLMGVLGYCQFMSSSTKVVAESPV FQSEYEISQNTFKMDDEKLVAANTKFGFKLFSEILKTDADKNVFVSPSSVAIALAMTY NGASGSTQQAMAKALELQGLSLEQINSSNAVLKEFLENPDPKVQLTIANSLWARQNFP FKPEFLQTNQEFYKAEVSNLDFSDPGSPAIINNWVKEKTSGKIDKIVEEITPEQVLFL INAIYFKGSWTEKFDKNTTANYPFNLISGEQKQHPMMSQTGDYKYYETEEFQAASLPY GDNGRISFYIFLPKQNSSLTAFYQNLNTENWEQWMTKFSKREGFIRLPRFKMVYNIEL NQALKALGMEEAFSKKADFSAMSEEKLFIDTVQHKTFVEVNEEGTEAAAVTSVGVRTT SVQLKPEPFQMIVDRPFFCAIRDNQTGSVLFMGSIVNPESSD" gene 12909..13814 /locus_tag="DP116_19735" CDS 12909..13814 /locus_tag="DP116_19735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749643.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphodiester glycosidase family protein" /protein_id="PRJNA477356:DP116_19735" /translation="MHYRYKVYTRRFLLAIGIGLLGLLLSPLIFYGWRCFLRPSRTDM EQVLFRGIVYKRYPLSTPRPTMIHIVTIDLKTPGVKALVTPGEPKPTDRETSARKTSD FLKEFKLQLAINASFFHHFHEKSPWEYYPHSGDPSYPIGEAISNGYRYSPPEANFPML CFSAQNRAQILKSDKCPEGTTQGVAGNQLLVYRGQAIDDNSNDDKPYPRVAAAINREG TKLWLILVDGKQPLYSEGITIAELTKTVTDLGVYTALNLDGGGSTTLVIGTNNGSKVL NAPIHTRVPMRERPVGNHLGFFAAE" gene 13885..14067 /locus_tag="DP116_19740" CDS 13885..14067 /locus_tag="DP116_19740" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19740" /translation="MKENFEGIDDTQVKSSLQVESEKPDVVVQTDKSFLVREIEHSNK EQIRPIDLSKLEAFNE" BASE COUNT 4212 a 3154 c 2869 g 4029 t ORIGIN 1 atgctaacct tagcaaaact gatttgaagt tcgccaactt cagcagatcc aatctcagta 61 aagctaacct cattgaagct aaccttagcg aagcttatct gagctgtgcc gatctcagtg 121 aagctaactt cagcagagct atactcaaca aagcttgtct caactgcact aatcttagta 181 gaactaacct gagcaaagcc gaactcagtt tagctgttat cgaagcagct aatcttcaaa 241 atgcagatct gagtttggct caagcattgg gaagtaattt caaaggagca aatctcactg 301 gtgcttgctt acaaaattgg aataccaatt gcaagaccca actggatgat gtggaatgtg 361 attatattta cctaaaaaaa ggttatggct ttgagtttaa cagtcgctac cctagcgata 421 gaatgtttca acctggagaa ttcactagac aatttaagca agcctctgaa atcgaagaac 481 ttgttttccc ttatggaatc acagaatttt tccagttttt ccaagaacga caacagcaat 541 acactgacaa agtgcttgcc attcaggagc ttgaacccaa aagtgatagc tcagttgctg 601 ttcgcttaga agtcttgccc aaagctaacc aagaagacgc taaaagcttc tacgaagaac 661 agcttcaact tgttaaagca agttatacac ttcaggcaac tacccaagca atggagttgt 721 ataaacaaca caacgctgag attatagagc tagcgagact cgctcttgga aaaccatcta 781 tcaataatgt taatgtgggg gcaactgcca tgtcgaattc agaaggattc accaataatt 841 tacagggagc aaatatcgct aatttttcta atcagatgcg cgataacgct cgtcaacagg 901 ctaatcagta taactacagt ccagaaacta aaagccttgc tgacgctgcc aaagaaatcc 961 aaacactcct tgatcatctc tcccagacct accgcaccga tactatgaca gctaaatgcg 1021 cttttgcaaa tgaggtcatt cagcgtatcg acaatgatcc ctctcttact caacgcatcc 1081 ttagcgcatt tagtgctgga agtatttcag cactagaaca gttcctcaat catccagctg 1141 ctagctttgt catcaatgcg ctcgaagact ggcaaaaaac acgtgtagaa tgacatagca 1201 ttattacagc aatgcactct aggagtcata gatgagcaga tttcagacct caaagcgcta 1261 attcataaaa aagtatttat ttataaggag taatttaatg aaagtgcttg ctgtttatca 1321 taataaaggt ggtgttggaa aaacaacaac agtcattaat cttgccgcag cgctaagtaa 1381 gaaaaataag agagttttag taattgattt agacagccaa gccaacacaa cttttgcaac 1441 agggttggtt aaatttcaag atgaaataca tgacaatcta aaagataact atatctacca 1501 agtaatcgca tccagaaatg attattcaat ttcagaggtt gctcggaaat cagaatttac 1561 aaatccacct ttttatgtaa ttcctagtca tattgattta atggaacacg aacaggaact 1621 aattcagcag ccacaagctc taactagact tctaaaaaag ctggatgaag ttcgtaaaca 1681 gtatgatgtt gttctaattg atactccccc atcgctcaat ctctatgcac gaattgcgtt 1741 gattacagcg gattatcttt tgattccatc agacctccgt ccttttgcta atgaaggttt 1801 acgaaatgtt cgtcgttttg taaacgacgt gaatgaattt cgtgactcaa tcaagaaaga 1861 tccaatagag attcttggtg tgattgcttc taaagtagga acttcgccga agtttgtaga 1921 atatacactg cctaaaatga tagaaactgt tgagaaacat tatggatttc cagtattaaa 1981 ttcaaaaatt tttgagcgcc gcgaaacatc taaagcgatt gaaagactag cagaggtcgg 2041 tgatttgctg attcccgatc caatttctat cttggattat gaacctaatt ctccagcagc 2101 agaggagttt aaggatttag ctaaagaagt gatgcagctt gcacgtatct aataaattca 2161 aggcaataac aatggaactc ttaacttcaa ttgttgatat cgacagtatt caggtaaaaa 2221 gcccgccatt cgctcctgca cagaaaacaa ctcaaattga tgctttggcg aatacgatta 2281 tcgaattggg tggtttagtc aatgtgccag tagtgcagca agtaagtgta gatgattatg 2341 aactaatttc aggctatctg gaatattatg cttatctcaa agcatgtgaa ctcaatcctc 2401 gtctgccgga tcgtattaca gtctttgtat ctaacactaa aaaccaagca gctattcgcc 2461 agcaactaga aatattgcaa gtaattgaag atactaagca aaattcatct cagtccataa 2521 ctccaaagca gtctgaaatc gacttacaga ttaagaattt agagtcttcc atcaataaca 2581 ataataaaat tatttttaat gcgctcgaac aacttaaagc tgatttactt gcaacaatag 2641 aggcaaaatt gcctcaaccc atctctccaa tggattcgtt taatcgtatc ttagagccag 2701 aaactgcatt tcaggtgcaa cgcaagctgg aattgtttct tggtgctagt aaagcgaaga 2761 aagttgtggt gaggttacag gaagttagta agggtaagaa gaatcaacct ttccagcgtt 2821 tttctgaaat cttggacata ttgagggaac agcagaaagg tagatcgcaa agactaatat 2881 cggaagagaa aatgatacag attatagatc gttggaatga ttaatagctt gtactcgcct 2941 ttgattgctt attcaaagct atccaaaatt acagtgcgat tgatcgtaag aacattcagt 3001 gatgaaatcg cctgctaact caatgcctca cactagcgat acaacgcgac ttgccgtcaa 3061 gtatcgctac aatgctcaac caaatcactc accaacggta ccctctggaa aaatccgact 3121 caagtccaat aaagcatctt caaacccagg aattgacact gactgattcg ccaaagaaat 3181 ctgcttgctc agataattaa actcaccttg aatattttga taaggctgac tgtaacgctc 3241 tacttgacga gcttgcaaat tcacgagcca gtaatcagaa attccggctt ctgcgtatag 3301 agttaacttc gttgtttggt cataagtcaa tgtagaatca gaaatttcga caacaagcaa 3361 aatatcttcg ggatagggat ggtgagcaag atagtcagcc tcatctcctc gcgcaatgac 3421 tgcatctggt tcaggctcac tatctgacgg gagaatgata ggctcttgac aacgcacgac 3481 agctctatcg cctagcaatc gatcaaattg acggcaaagt cgagttgtac aaaatgtatg 3541 aggtgttccc tttgcagcca tttggatcaa ttctcctcga attaattcaa ttttgtctac 3601 ctccgtcaga aacccgagtg aaatcagtcg atgatattcc tcgattgtga atcgctttgg 3661 tttgacaaca gtcatgacgt ttattcagat atagcggctt ccactttaaa tttaataaat 3721 ttaatttcca ctcaacgcct caaattgttg ccaacacaga tacaacgcac gcgggacgtg 3781 aaagcaaccc ttccatttcc cacctttaag atttaacaac acttctccac gccgattgag 3841 gtatccaaac cactcaccat attctggatc agcaaagtgt gaccaagtgt aatcatggat 3901 cttgtcatac cattcctgac aggcattacg tcctgttagg cgatcgccta cggcagagct 3961 atgcttaacg cccatcgcca atgcaaccaa agattctaaa tgaacccacc acaatttctg 4021 atcccattcc agttgctgtg gcggatgacc atctgcatcc ataaaataat acaacccgcc 4081 gtactcacta tcccaagcaa aattcaggat atttagcacc acatcaaccg cttggttaat 4141 cgtttgagta tcgttgcgac gacgagcaat gtccatgata aaccacatcg cttcaatacc 4201 atgaccggga tttatcagcc gtccctcaaa acaatcaatg tgcgaaccgt ccggagcaac 4261 attttcaaac attaagccgc gttctttgtc aagaaaatcg ctcatcactt cctgaacagt 4321 tgcagtcaag acgttctcaa gcgtttcgct tggtagtagc cattccattt ccagagtcag 4381 attggctaaa atcatcggta cagccagaga tttcatcgga cgtgtaccag gatatgtttt 4441 ggtatacttg cctttggggt tatcctggcg gcgcaaaacg ttgttgtaag cttgcatagc 4501 cacatccttt gcccactctt caccagaagc aagagcatat tggctgaaag ccattgctgc 4561 aaagcaatca gaaaaaatat tgtaaggctg aaccagtggc tttccttcac gggtgagcgc 4621 aaagtaccaa tttccgtcag catctcgacc atgttgtgcg agaaaattcg cgccattgct 4681 agcaattttc agccaatttt cgcgtttttc tagctggttg taaagcatgg agaaagtcca 4741 cacctggcgg ttttgcagcc agataaattt atctgtgtca taaactttcc cctgacgatc 4801 aagacaggtg aaatagccgc cttgctccca gtcaattgag tgtttttccc aaaatggaag 4861 tacatcattg aggagcgcgt ttttgtaaag ttcagcaagc tctttaaagt tgtgctccat 4921 aaatatcctt tttttagtca atccaagctt ttagcctagc agttaaaatt gcctgaatct 4981 tttgttgtac agcttcttga ctcaagctac catcgacacg aacaatttgc tctgggtgag 5041 agtcggctaa gtgtgcatat ccttgctgga cgcgacgatg aaagtctatt gtctcttttt 5101 caatccggtc gaatatatcc ccgcctcctc gttttcgagc aagtcccacc tcaacatcaa 5161 catcaaacca caaagttagg tcactttcta acccggatgt cgcaatggtg tttagctgat 5221 gaattaaatt gatatctaaa ccacgtcccc agccttggta ggcaatggta gagtgagtgt 5281 agcgatcgca caaaacaatt gccccatctt ggagagctgg cttgataact tgctcaacgt 5341 gttgcgagcg atcggcagca tacaaaagta attcggtttt atctacaata gagtgagagc 5401 ctgtttctaa caaaagtttt cgcagataca accctaactc tgttcctcct ggttgacgag 5461 tcacaactac cgaagtctga aaactttgta accactcctg cgtgagctgt atttgactag 5521 ttttaccgca accttccact ccttcaaata caattaattt acccttcatt ttttttataa 5581 ttttttcaaa tttttattat ctcatcatga ctagcagttt acctagtcat cctcgatatc 5641 acccgattgg gtgacccaaa tggatgaagt ctagttaata acaaacattg tattgtttgt 5701 atcaaagagt aataaacagt agcaataaac aaaattaact atgagtaacc aacaacaagc 5761 agatatgaaa ctagcatatc aagcagcggc agagatgaca tacattgtag ccgttggttt 5821 gagtaaagta atggaaaagc agaaaaaaag accactcatt agcaagcgac aaaagcgagt 5881 caaatcaaaa tcagaatctt ctatcgttgg aagtactatt taatacagat ttttaatcat 5941 acagtgattc tgttgttgca ccccaccaag gtaggggcac agcatcctgt gccttggata 6001 aatcctgctc aaagttggtc aaccttgttg attctagata gattatatcc ggctttccct 6061 tggcattcag gctcaacaaa acatgacact ctgcatcaaa aacgagtatt taagttgagc 6121 attttgccca agcgcgtagc gttgccgctc ttgcgatagc cgtggcgcaa gccataggca 6181 ataggttgac taacgcgtga gcaacttcac tcaaatcaac tacccacacc aaactcgaat 6241 caagattttg gagatggact ggttacgggt ttagcagtac cagatgatgg tttggaaccc 6301 ggtttctggt aatagcctgc aacttttttc acatagtcag cagtgaagcc gctgttgcag 6361 cctgtgtagt taccagtcat ccaccaacaa gcaacaccac gcacagcagc cgtttcattg 6421 ttaccactag cagccaactg cttacttaac tcccggcgtg taatacatga gaccacttcg 6481 cgtgctaacc cagcatcact gtcaaactgc gctggtgtca cttccttctt gagacaattt 6541 tttgaccagc ctttaagggt ttctggtttg actcgccatt cactgtaata tccatcatct 6601 ttactaccac ttttcggcgg tgctgcttgc cgcagtgctt ctaccatcgc gtttatttgg 6661 gtttccgacg ctgacgactg tgcttgagct aaagaagatt ctggcaaaac gagtgaagac 6721 aatccaaagc tcagaatgac tccacttact agtaatcgca ttaattttct cctcacgacg 6781 ttaaacttct ccgttatagc agaattcgta aaactgaaag cttcgagggt tgcataagtc 6841 ttttttaaac tttgtttcaa cctaggaatc attgcataaa aatatcgata tcttaaagag 6901 agcgctttat tgaaatcaag aaagtggaga tagcaaagcg aaaatatcat gaactttgct 6961 ctggtttatc ttgcatttaa acttgttttt tcgcaataag taaaagtctt ggttggaggt 7021 gagcacaatg acaggaatgc tttggtttgc aggtttttgc tttctttggc tgattggtat 7081 aaccctcata gcagaaattt ggttttttga ggaagagcag gagttttgac gacaaattgt 7141 gatgttttct ttgacagaag atggtgaaaa agccccacag acttgttccc agtctcagac 7201 tgggaataga gttatcgagg ctccgcctcg cgtttaggag aaccagaggc tccgctttag 7261 agttggtatg acctggctga gccaggtcac gaggggttga ggctttaaac attcaacact 7321 cgtttttcta acttcacctc aaagcgttct cccggtttag gatctatcac ctgcgttgat 7381 aaattgttct tctctaataa cgaacgaaat tcctcaacgc ttcccttaat ttgaagaaaa 7441 gagttgagta atccctcaaa agtgacatct cctccagctg cagtcgagag catgacttgg 7501 ggttgtaacc actgagcaac ttctaaggca tatttattcc ccttgataaa tgcgccaacc 7561 aaaggtaaag ctgcgtcaat taacggagtg ataaccacat ccactggagc aatttcctta 7621 agtgtggagg aatgatttcc atgaggctcg taataaagtg tcaaaccgct ttccaactct 7681 ttgaggagat aaccattttc taccaaattt agaccaacca gtgagccggg agttgcttta 7741 atttcaacgc tttgattcag tgtgaaactt tctccatgag caagtgttgt gacttgagta 7801 taatttaact gctgtacaac tttggcagca ttgggagaag ctacaactgg aatattgcgg 7861 tcaagctgct tgagtgttgg tggatgagca tggtcttcta aaccttgaga tagcagaatt 7921 aggtctatgt tctctggtat tgggcgttct tgcgatcgcg agcctttaaa gaaccaattc 7981 tgaccgccaa aagttaacga atcaactagc caaggatcaa gaagtatcct ttgtcctcca 8041 atttcaatca gccaagaatt gctgtctaac caagttaaat acataaactt ttgtaaacat 8101 aacacctaaa ttaattatga actaaccaag gtaaccaatg tcctgaagca tttggacatg 8161 cttgtgtaaa ggtgcgagtg tatttgcagt catcatccca ctagcagcca cagctggtaa 8221 accaatacca ggaaaagtcg agtctccaca acacaaaagc cctggaacag gtgtaccaga 8281 accaggaaag aaaccttttc cagcctgaat tgcaggacca tatgaacctc tatggcgtcg 8341 caaatagcgc tcgtgtgtga gtggtgtacc aaccagtgtg acatcacaac gtgagcgaat 8401 atctggaata attcgttcta aggcttgcca cataacttct gcacgtaact gcttttgtcg 8461 tggatactcc tcactttttc ggttcattcc ttcccacagc tcataaggtt cactaccagg 8521 agtgtaaacg tgaatgacgt gctttccggg tggtgctaat gagggatcta aaactgaggg 8581 aatagatatc aagacgacgt tctgaggagc tgtcacaccg ttttcccaat tattaacaac 8641 tatataatga cacgcaaggt cagagggcaa tccttgtgcg tcaataccca ggtggagatg 8701 cataaaacta tcgcattcag gtgttgcttg tcgttcctga aacttttttg gtaatgcttc 8761 ttgtggaatc aacttgagcg tatcccatac cgatgcatta gaaatgactg cccgacgcgc 8821 tcttatttgt ttcccaccac gcaaacgcac acctactgca cgctttcctt ctacaagaat 8881 ttgctcaaca tgagcgttca gcatcaattg accaccatgt cgttccagtc cccgtactaa 8941 ggcgttcact aaagcaccac ttccacctat aggatagtca agtacgacgc ctggtcgata 9001 ccaatctgca aacataaacg ctacctctgc ggcacttgtt ccatctgctg gtagtccgga 9061 gagaagaaaa cacagcaagt ccagccagtt gcgtatgaaa gagtctttga taacgccgtc 9121 cataattcgg ctaaacggtc ctgtgagctt gccgatgtct gcaatatgtt ggaataaaga 9181 cgggacaaat ctgctcacag tcataatcgc gccaacatca agacgtaagg ctgctggcgg 9241 tagtgcagtt gcagcacgcg ccaatggttc catcacacgt tgcagttcac gccattcaac 9301 cacagcctca tcgccacgaa actgttttag cacctcacaa aactggtcag caccaaccga 9361 ggtgtcaaaa tacccttctg gtaaatagca cccccacgta tcgtaggtga cgcaaggtaa 9421 ctcctcaccg attgcatcta gtacttgttt gagaggatta gcagagggac tgtaggataa 9481 gccagaataa agcgatggac ctgagtcgaa tttgaatcca tcacgctcaa aagcgtgcgc 9541 tgcaccacct ggaattgagt gactttcgca cactgtgaca ttaaagccat accgtgccag 9601 aagagctgca cagcttaaac cgccaatacc actaccaatg atgataaaat ctacctcttt 9661 ctgacctaag atagatgctg tattagaaga agtttgataa accactgagc gtaccttgtg 9721 ctagtcttgc gtcattcaag gatattgtaa tgcaacgtaa caattcttaa aactgcacat 9781 aatgccaaaa acccggttgt tgtaaccggg ttttccatag agattggggt tatatcccct 9841 gcggctcata tagatgttta gttataaagc actcaactaa caatatctac aatcagaggt 9901 ttgtctgttc cggaaaataa gaagcttttt tttcaacaga tttgcgggag tccccaggct 9961 caacggaacg aagtggagta gcctgggcga ggttgccggg acgacggagg aacggagtcg 10021 tccaatgact tacggcttga tgcataatcg cgtacttcgt ttgattgact gtactttcga 10081 cgggctaaat tgtccaagcc tgtgccaatt gtggtcatag actgaaacgt ttttggtttt 10141 agctgtttgg gtataaccac caacccaacc gatataaaac ttgcctgcct tttcggcttt 10201 aaccaaatca ccagagcgca aaccaaacgg tgtcactgtc ccaccttttc gctttctgtt 10261 tttgggtgca tccgaaaccg gattctcaaa atggagttgg cgacgaaaca agttaggacg 10321 tgcaatgaca cggaatggtg cagacgtaat cgccacatct cccacccagt gatgtccacg 10381 agtattagca gtgtggaact tctcgaattt catgaaattg cttgctgcta gcgcgattcc 10441 atcataagcg tgagtctcag gagtttgttt ctccttgttt ttcttgtctt tagttagtcc 10501 tagctgttgt ctaaggattg aagtttgcca accttcttgc gcattagtcg gtgcaatttt 10561 ctccaaccat tgcaacatta ctttttgacc aaccattacc ggactaaacc ctttgtcacc 10621 tctagcctta acgtattcgt aggtaatttg ggtaatagga aatagtttta ccaattcttt 10681 agtgactcgc agttccatct ctttgttagc gcggatgctg ggtgacaatt tattctgttt 10741 acggttatcg aaacgctttt gacgatgtgc tctgttgttg aatgcaacat tgcggttgat 10801 gcgtctgcca cgtctagcac gtcgtaggat cagcctccct gacatctttt ttgtgacatc 10861 tgagaatggc aagatgagat gcgccataaa caaagtaaac ttgactgatt ggacaccaac 10921 acctgtaaac tttttgcccg gatctaatcc taaagcaatt gattgttggt gataaccaga 10981 tggttcgact aatagctgaa tgcagaaaac attcaggttg ttgtatacaa tcttggcttt 11041 gccttccttg agccaacgtc tagcacggct aggttttgtt ggcataagtg gttttccgtc 11101 tttgtctagt actgatactc gcataaagtg ataaacctcc gagtaaagtt atttgtccct 11161 tagcccaact tgttcagcat gtcctttctg caagcgctta caaccagtaa gcttggagat 11221 aatccgaact agggaaacat tcggaagtct gtaacagaac aagtctcatg ggctagtcat 11281 ctcttgcgtt gcaagctggt tcttgcaagc ccctaccgag cgccgttagg caggtagggg 11341 tagttgacag cacctagtca gaagactctg ggtttacaat tgaacccata aacaaaacgc 11401 ttcctgtctg attatcgcga atcgcacaga aaaaaggacg gtcaacaatc atctggaatg 11461 gttctggttt caactgcaca gaagttgtgc gtactcccac tgaagtcact gctgctgctt 11521 ccgtaccttc ttcattcacc tcgacgaaag ttttatgttg aacagtgtca atgaagagtt 11581 tttcttcact catcgctgaa aaatctgctt tcttgctgaa agcctcctcc atacctaaag 11641 ctttcagcgc ctgattgagt tcaatgttat agaccatttt aaagcggggt aagcgaataa 11701 agccttcccg tttgctgaac ttagtcatcc actgttccca gttctcagta ttcaagtttt 11761 gatagaaggc tgttaggcta gagttctgtt taggcaggaa aatataaaag ctgattctgc 11821 cattatcgcc gtaaggtaaa ctagctgcct gaaattcttc cgtttcgtag tatttatagt 11881 cacccgtttg cgacatcatc gggtgttgtt tctgctcgcc agatatgagg ttaaagggat 11941 aattagctgt tgtgttttta tcgaatttct ctgtccagct ccctttgaag tagatggcgt 12001 tgatgagaaa tagcacttgc tcgggtgtaa tctcttcaac aatcttgtct atcttcccgc 12061 ttgtcttctc ttttacccag ttattgataa tagcaggtga acctggatcg ctaaagtcta 12121 aattgctcac ctcagctttg tagaattcct ggtttgtctg caagaattct ggtttaaaag 12181 gaaagttttg ccttgcccaa agcgagttag caatagttag ttgcactttt ggatcggggt 12241 tttctaagaa ttcttttaag accgcgttag aggagttgat ttgttccaaa ctcaacccct 12301 gtaactcaag tgctttagcc attgcttgtt gtgtgctacc actagcgccg ttgtaggtca 12361 tggcaagggc gatcgctaca cttgaagggg agacaaaaac attcttatca gcgtctgttt 12421 tcagaatttc tgaaaacagt ttaaagccaa acttagtgtt agcagcaacg agtttttcat 12481 catccatctt gaatgtgttt tgcgatatct catattctga ctgaaacaca ggagattcag 12541 caaccacctt cgtgctacta ctcatgaatt ggcagtaccc taatacaccc atcagcacaa 12601 cgctagcagc agtcaaaagg taacgtcttg ccgaacgcac gccgtaacgt cttggcataa 12661 aattttacgt aaaattaata aatatatggt taattttacc atctttattc ccaaagtttt 12721 tattttgact acggtatttc cggcaacaaa accatacaaa agtcctgata tcgcaacaaa 12781 ctcaatctga atatcgacga agtaaactag gtaaacaact cttcattatt ttgtatcaga 12841 ttgtaatcga gaagaagttg ttatcacagg tgcactatcg ccaagaatag ctttacttcc 12901 atcaagcaat gcattatcga tacaaagttt acacacggcg attcttacta gctatcggga 12961 taggtttact aggtctgcta ttgtcaccac tcatattcta cgggtggcgg tgttttctgc 13021 gtccttcccg cacggatatg gagcaagttt tgtttcgtgg aattgtttac aaacgttatc 13081 ctctctcgac accacgtcca acaatgatcc acattgtcac cattgactta aaaacaccag 13141 gagtgaaagc acttgtcact ccaggagaac caaaaccaac ggatagagaa acaagcgcac 13201 ggaaaacctc tgattttctc aaagaattca agctgcaatt agcaattaat gctagctttt 13261 tccatcattt ccacgaaaag tccccttggg agtactatcc tcacagtggt gatccttcct 13321 atccaatagg cgaggctatc tccaatggat atcgttattc accacccgaa gcaaatttcc 13381 ccatgttgtg cttttctgct caaaatcgtg ctcaaatctt gaaaagtgat aagtgtcctg 13441 aaggtacaac tcaaggtgtt gcagggaacc agcttttagt ttatcgcggt caagcaattg 13501 atgataactc aaatgatgat aagccttatc cacgtgttgc agctgctatc aatcgagaag 13561 gaacaaaatt gtggctgatt ttggtagatg gaaaacaacc actttacagc gaaggaatca 13621 caattgctga attgacaaaa actgtcaccg atttgggggt ttacacagca ctgaatttag 13681 atggaggtgg atcaacgaca ctggtgattg gaaccaataa tggttcaaag gtgttaaacg 13741 ctccaataca taccagagta ccaatgcgtg agcgtcctgt tggtaatcat ttgggatttt 13801 ttgcggcgga gtgagttcta tactttcctt cttgcaccct ttgggaattc taattgcaag 13861 atagtgcaaa tagaggaaag ctttatgaaa gaaaattttg agggaattga tgacacacaa 13921 gtgaaaagtt cactgcaagt cgagagtgaa aagccagacg ttgtagtgca aacagacaag 13981 agtttccttg taagggagat tgaacactcc aacaaagaac aaatcagacc tatcgattta 14041 agcaaacttg aagcatttaa cgaatgaaag accctgaacc aatgtaaccg taagcttaca 14101 aaatcaagac aacaagaaag gaaatctttc cctataacta aactacttga aaaggggagc 14161 atcccaattt tgcaaaaata aagggaaaaa gataattcgc gagcaataat cgcgaattat 14221 cttggtgatt caatcttctg gttcttcttc ctcttgctca tcac // LOCUS NODE_2378_length_14223_cov_4.93090114223 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14223) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14223) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14223 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(35..3007) /locus_tag="DP116_19745" CDS complement(35..3007) /locus_tag="DP116_19745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015117618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19745" /translation="MVDGLFKRGSRLIILLLGLWLFFDLTCHLRAEIFWFSEVGYLRE FLLRLLTQLIVWAVVFFTSVGFLFSNFLVTSRLKYSPSRQGDGTIPNSKYVLRTRYAN KVQNLKSPTSHLATQDSELGLRLLLPVVITLSALAGLVFIFYAQKALNLWFPNVQLPT TKLALTPWLQQVLQQMQIQQIAIPIVQSILLLVLTIGIVIYCEFCLRAIALLICLLLS FLLSTHWVTVLEYFHVTSFNSTEPLFHQDISFYVFSLPIWELLAFWLVGLFLFAICAA TLIYLCSANSLSEGRFTGFSTQQRLHLKALSSLFMLAVALHYWIARYKLLYSTSGVIY GASYTDVNVLLPIYTGLSFLAVAIAIYLLLQIIILSRARKTSFKLILYPPQLIYALGL YIFVAAVSGEILPSAVQRFVVQPNELSRERPYIERTIALTREAFNLDAIEARTFDPRG QLTASNLQENDLTIRNIRLWDTRPLLESNRQLQQIRPYYKFPGADIDRYTLKREQTDE KQQTIIAARELDYGDVPQQAQTWVNKHLIYTHGYGFTLSPVNVVAPGGLPDYYVKDIG VNNTDNQGSLLITNESVRSSVPINQPRIYYGEITNNYVMTGTKTQELDYPSQNDNVYN VYDGRGGISIGAMWRRLVFAEYLKDWQMLLTRNFTPQTKLLFRRNIKERVRAIAPFLR YDSDPYLVATDGEGTDIKGDKTYLYWIIDAYTTSDRYPYSDPGKNKFNYIRNSVKVVI DAYNGTVNFYVTDPTDPIIHTLGAIFPKLLKPLDKMPVALRSHIRYPQDFFSIQSERL LTYHMTDPQVFYNQEDLWQIPDEIYGSKQQQVQPYYLIMKLPQAQSEEFILLLPFKPV QRANLIAWLAARSDGQEYSKLLLYEFPKQQLVYGTEQIEALINQDPVISQQISLWNRQ GSRAIQGNLLVIPIERSLLYVEPLYLEAEQNSLPTLVRVIVAYNNRIVMAETLEQALA AIFQEKKPTTPPIVRPVQ" gene complement(3231..3806) /locus_tag="DP116_19750" CDS complement(3231..3806) /locus_tag="DP116_19750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113781.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_19750" /translation="MNSKKVKVQSLYTVTDEELMLTSSQNPELRFERNADGTLETMPP TGGISGNREAKVITYLLTWVESQNLGEVFSSSTGFRLPNTAVRSPDAAFVAKERLSEG WDEQEDKFINLAPDFVIEIRSKNDSLEKLKAKMEEYITNGVKLGWLIDRQNQQALVYR LDGSITQYPATAILSGDDVVPGFTLPLAKLL" gene complement(3951..5248) /locus_tag="DP116_19755" /pseudo CDS complement(3951..5248) /locus_tag="DP116_19755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652401.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="glutamyl-tRNA reductase" assembly_gap 4744..4753 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(5334..6374) /gene="glpX" /locus_tag="DP116_19760" CDS complement(5334..6374) /gene="glpX" /locus_tag="DP116_19760" /EC_number="3.1.3.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140437.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class II fructose-bisphosphatase" /protein_id="PRJNA477356:DP116_19760" /translation="MENTLGLEIIEVVEQAAIASARWMGKGEKNTADEVAVEAMRERM NKIYMRGRIVIGEGERDDAPMLYIGEEVGICSQPDAKNFCNPDELIEIDIAVDPCEGT NLVAYGQPGSMAVLAISQKGGLFAAPDFYMKKLAAPPAAKGKVDINKSATENLKILSE ALDRGIDELVVVVMKRERHNDLIKEIRDAGARVQLISDGDVGAAISCGFAGTNIHALM GIGAAPEGVISAAAMRALGGHFQGQLIYDPAIVKTGLIGESKQANLDRLKSMNINDPD KVYDAHELASGETVLFAACGITTGNLMQGVRFFQGGARTQSLVISNQSRTARFVDTIH MFDKQPKELQLH" gene 7088..7315 /locus_tag="DP116_19765" CDS 7088..7315 /locus_tag="DP116_19765" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19765" /translation="MSIEISIALFSYLLGVVIQWVGFRPKFRELDEYTNFPIIWAAKI VTMIGCFMEALTWPYSLILENELILEKNETK" gene 7553..7867 /gene="grxC" /locus_tag="DP116_19770" CDS 7553..7867 /gene="grxC" /locus_tag="DP116_19770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194728.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutaredoxin 3" /protein_id="PRJNA477356:DP116_19770" /translation="MLNSLNTLLGRHPERIKANVEIYTWQTCPYCIRAKMLLWWKGVK FTEYKIDGDETARAKMAERANGRRSVPQIFINNQHIGGCDDLYQLDTQAQLDPLLGQS AV" gene 7886..8371 /locus_tag="DP116_19775" CDS 7886..8371 /locus_tag="DP116_19775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746103.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA-specific adenosine deaminase" /protein_id="PRJNA477356:DP116_19775" /translation="MSLEYTEYLIHRQWMSRALELAQIAGDADEVPVGAVVIDSSGSL IAEGENRKERDKDPTAHAEIIAIKAAAQKLRTWRLNECTLYVTLEPCPMCAGAIVQAR IRLLVYGVDDPKTGAIRTVVNIPDSAASNHRLRVIGGILESSCRQQLQAWFVNRRHFS N" gene 8483..9256 /locus_tag="DP116_19780" CDS 8483..9256 /locus_tag="DP116_19780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210252.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-acyl-sn-glycerol-3-phosphate acyltransferase" /protein_id="PRJNA477356:DP116_19780" /translation="MISLNSPSDTPCEHLATTPETANVTHITTSEISPWLSPLVYFLG GHVLLPSFFGSIRVTGQKNLPQTGPVILAPTHRARWDSLLLPYVAGRCVTGRDLRFMV TVTECQGWQGWFVRRLGGFSVDPQRPSITTLRHSIELLERGEMLVIFPEGGIFRDRKV HPLKSGIARLALSAESSHPELGIKIVPIGINYSEPYPTWGTDVSIHIGSAIKVADYTK GSLKQDAKRLTGDLTKALQKLSYQESQITPRAFAEIANS" gene 9673..10251 /locus_tag="DP116_19785" /pseudo CDS 9673..10251 /locus_tag="DP116_19785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456078.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 10811..11062 /locus_tag="DP116_19790" CDS 10811..11062 /locus_tag="DP116_19790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="BolA family transcriptional regulator" /protein_id="PRJNA477356:DP116_19790" /translation="MMSPQQVEEMIKIELPDALVQVQDLTGGGDHYQVTVVSSQFANK GLVQQHQLVYGALKQAMSSEAIHALALKTYTPDAWENSH" gene 11178..11501 /gene="grxD" /locus_tag="DP116_19795" CDS 11178..11501 /gene="grxD" /locus_tag="DP116_19795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456074.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Grx4 family monothiol glutaredoxin" /protein_id="PRJNA477356:DP116_19795" /translation="MTPELKERIDNLVKQNKILVFMKGTKLMPQCGFSNNVVQILNTL GVPFQTVNVLDDYEIRQGIKDYSNWPTIPQVYINGEFVGGSDVLIELYQKGELQQIVE VALAS" gene complement(11745..11987) /locus_tag="DP116_19800" CDS complement(11745..11987) /locus_tag="DP116_19800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140429.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19800" /translation="MLQDTQTIRYYQRLTDAFVELWNRGYRMDDMRMYLDGYIAALQH GNAIEPYLIHRLEEEANRYLHDVSNFTMTQPQLDYY" gene 12546..13313 /locus_tag="DP116_19805" CDS 12546..13313 /locus_tag="DP116_19805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652928.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_19805" /translation="MGSVCIEIIEGNPHLRSLLGWHLQQLEYRVHQAASIYQAREVFL SHQPTLVILDADLPDGDGIEFCRWLHRQQQPLILMLSARNNEADIVAGLKAGADDYLC KPFGMQEFLARVEALIRRKRTPVAPAYLDYGALQIDLVQRRVRILGEFIDLTPQEFSL LYVLAQAGGVPLSRSELLRRAWPDAIDNPRTIDTHVLSLRKKVELDPRQPSLIQTIRN VGYRFNTEILNANIPNSSTKLPKERFNNQRSMLSTQR" gene complement(13327..14106) /locus_tag="DP116_19810" CDS complement(13327..14106) /locus_tag="DP116_19810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746094.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methionine ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_19810" /translation="MGKLSLKAQLWLEQVSLFASLKTQSQDNQLGYPILQDISFEVFE GERVAIVGPSGAGKTYLLRLLNRLSEPTSGKIYLQNQEYSQIPVLQLRSIVTLVSQEP KLLGMTVKEALAYPLVLRGLPKQTIQQRVSHWIEQLQIPDEWLTRTEVQLSLGQRQLV AIARALVIQPKILLLDEPTSALDVNQAEHLVEIFSQLAQNYQTTVVMVNHQLELVEKF CTRLLCLQQGRLLVNQEASEISWLNLRDKLTQAKAQDEFEI" BASE COUNT 4059 a 3038 c 2948 g 4168 t 10 others ORIGIN 1 agggtgaggt tttctctcgt taggaggagg ttgcttactg aacgggacgc acaattgggg 61 gagttgtagg tttcttttct tgaaaaatgg cagcaagtgc ttgttccaaa gtttctgcca 121 tgacaattcg gttattgtaa gcaacaatca cccttacaag tgttggcaag ctgttttgtt 181 cagcttctaa ataaagaggt tcaacataca acagggaacg ttctatggga atcactaaca 241 aatttccttg aattgctctt gaaccttggc gattccacag agaaatttgc tgcgaaataa 301 ctggatcttg attaatcaaa gcttcaattt gttctgttcc ataaaccaac tgctgcttag 361 gaaactcgta aagcaacaac ttgctgtatt cttgcccatc ggaacgtgct gctaaccaag 421 caattaaatt tgcgcgttgt acaggtttga agggtagtag taagatgaac tcttccgatt 481 gtgcttgagg tagtttcata atcaggtagt aaggctgtac ctgctgctgt ttgctaccgt 541 aaatttcgtc ggggatttgc cacaagtctt cttgattgta gaaaacttgt gggtctgtca 601 tgtgatatgt taataatcgc tcagattgaa tgctgaaaaa gtcttgtgga tagcggatat 661 ggctgcgcag agcgactggc attttatcca agggttttag taatttgggg aagatagccc 721 ctaaagtatg aataattgga tctgttggat cggtaacata gaaattgaca gtgccgttgt 781 aggcatcaat gacaactttt acagaattac ggatatagtt aaatttattt ttgcctgggt 841 cggaatatgg gtagcgatcg ctcgtagtgt aagcatcaat tatccagtaa agataagtct 901 tgtcaccttt gatatctgtt ccttcgccat ctgtagcaac taaataaggg tcactgtcgt 961 agcgtaaaaa gggagcgatc gcccgcactc gttctttaat attgcgacga aacagcagct 1021 ttgtttgcgg tgtaaaattg cgcgtcagca gcatctgcca atctttaaga tattcagcaa 1081 acaccaagcg ccgccacatc gcaccaatag agattcctcc acgtccatcg tagacgttat 1141 agacgttatc attctggctg ggatagtcta attcttgggt ttttgtcccc gtcatcacat 1201 aattattggt gatttcgcca taataaatgc gcggttgatt aataggaaca cttgagcgaa 1261 ctgattcatt cgttatcagc aacgaaccct ggttatctgt gttattaact ccaatatctt 1321 tgacgtagta atcaggtaat ccgccaggtg cgacaacatt tacggggctg agtgtaaacc 1381 catagccatg agtgtagata agatgtttgt tcacccaagt ttgtgcttgt tgtggtacgt 1441 caccgtagtc tagttcccgc gctgcgatga ttgtctgttg cttttcatct gtttgttctc 1501 gtttcagggt atagcgatca atgtcagccc caggaaactt gtagtaaggt cggatttgtt 1561 gcaactgacg gttgctttct agaagaggac gtgtatccca caaccggata ttgcgaattg 1621 tcaggtcatt ttcttgcaaa ttactcgcag ttagttgacc tcgggggtca aaagttcttg 1681 cttctatggc atctaaatta aaagcttctc tggtgagggc gatagtacgc tcaatgtatg 1741 ggcgttcacg actcagttca ttcggttgca caacaaagcg ttgtactgct gagggtaaaa 1801 tctctccaga aaccgccgcc acaaatatgt acagtccgag tgcgtaaatg agttgaggtg 1861 gatagagaat gagtttaaaa cttgtttttc ttgctctaga taaaataatt atttgcagta 1921 gtagatatat ggcgatcgcc actgccaaaa aactcaaccc agtgtaaatt ggcaacaaga 1981 cattaacatc ggtgtaacta gcaccataaa tcaccccgct ggtggaatac aaaagtttat 2041 atcttgcaat ccaataatgc agggcaactg ctaacatgaa taagctactt aaagctttta 2101 agtggagtcg ctgctgtgta gaaaaccctg taaatcttcc ttcactgaga ctatttgccg 2161 aacacaagta aatgagcgtt gcggcgcaga tggcaaagag aaacagtccc actagccaaa 2221 atgccaacaa ttcccaaatc ggtagagaaa atacataaaa gctaatatct tggtgaaata 2281 gcggttctgt gctgttaaaa ctagtgacat gaaaatactc tagaacggtt acccaatgtg 2341 ttgataacag aaaacttagg agtaaacaga tgaggagtgc gatcgctctc aagcaaaact 2401 cacaataaat aacaatccca attgtcaaca ccaacaacag aatcgactgc acaattggga 2461 tggctatctg ctgtatttgc atttgctgca atacctgctg taaccaaggt gtcaatgcca 2521 acttagtggt gggtaattga acatttggaa accacaaatt taaggctttc tgagcataaa 2581 aaataaacac taaccccgcc agcgcactca atgttatcac tactggtaag agcaagcgca 2641 atccgagttc tgaatcctga gtggcgaggt gtgaagtcgg actttttaaa ttttgaactt 2701 tgttcgcgta gcgtgtccgc aggacatatt ttgaatttgg aattgttccg tccccttgtc 2761 tgcttgggga atacttcaac cttgaggtga caagaaaatt gctgaataaa aaaccgactg 2821 aggtgaaaaa gacaactgcc cacactatta gttgcgttaa caagcgcaac aaaaattctc 2881 gtaaatatcc aacttcacta aaccagaata tttctgctct caggtggcaa gttaaatcaa 2941 aaaacagcca tagccctagt agcaggataa tcagtcgaga ccctcgtttg aataagccat 3001 caaccattaa tctatgacga acgcattgca cgacctttac tagacgatct taaatcagcg 3061 ccaggtcaat aggtggattt gaggattatt atctatcaac tgtttccaga gcttttttag 3121 gcaatgtgac aaaagatgcc caaaagttat ccatattttc tcgaaatcca tctggaaaag 3181 cgttattccc tgccaaggat tgggaacggt acatgagtgt catttcactt ctacaataac 3241 tttgccaacg gcaatgtaaa cccaggtaca acatcgtcac cactcaaaat tgctgtagct 3301 ggatactggg ttattgaacc atccaggcga taaaccaaag cttgctgatt ttgacggtca 3361 attaaccatc ccaacttgac accattagtt atatattcct ccatctttgc cttgagtttt 3421 tctagactat cgtttttaga acgaatttca atgacaaagt cgggtgctag gttgataaac 3481 ttgtcctctt gttcatccca accttcgctt aagcgttctt tagccacaaa agctgcatca 3541 ggggaacgta cagcagtatt aggtaatcta aaacctgtac ttgaactaaa cacttcacct 3601 aaattctgac tttctaccca agtcaggaga taagtaatta cttttgcttc tctatttcca 3661 gaaattcccc cagtcggtgg catagtttct agtgttccat cagcattacg ttcaaaccta 3721 agttctggat tttgcgaact cgtgagcata agttcttcat cggtgacagt gtacaaagac 3781 tgcaccttaa cttttttgct gttcatgaca atgccataaa gcttgatata cagttcaatt 3841 ctagggcatt gggcaatggt gctttgcggc ttacgctcag aggatgtttg gtttgaattc 3901 gctcttagaa aggggacacg tgaccgaaag cccccttttt aactagtcat ttagctaaac 3961 tgttcgcccg catctaagtt aaacaacatt tgtaaagttt gcatacaccg gcgtcttgcc 4021 tcgacatctt gctgagctcg cagttgcacc attggatcat gtaaaatttt gttaacaatt 4081 cctttggtta aagcttcaat aacttcttga tgtttttcgc cgaattctga gcctaatctc 4141 gacaatgctt tttctagttc ttgctcgcgg attgtttcaa ctttatttcg caaacagcta 4201 attgtggtga ttgtttcgag cgatcgccac caaacatcaa aagcttccgt ctcctcttct 4261 aaaagtcctt cggcttcctg cgccattttg cggcgacttt cttggttttg tgcgactacc 4321 gcctttaaat catccacatt aaacgcctgt acgtttgtta acgtgttgac atcagcatga 4381 acgttacgtg gtacagaaat atcaattaac atcaaaagtc ggctgggttc caaaaccatc 4441 tccaatttag cgcggtcaag aatcggctcg gttgcagatg tacttgtaaa cactaaatca 4501 ctttcggtta ttaccgtcat tatttccgaa agcaagcaag ttttgatggg ttgtccgggg 4561 aagtgctttg ccaattcctc cgcccgtccg agagagcgat ttaagataca aatttgttca 4621 gcacctttgg aaagtaagtg ttgtactagc aaccgcgaca tcttgccagc acctaaaatt 4681 gccacccgac aagcggttaa atttaccagt ttcatctgcg ccaactccac agccgctgaa 4741 ctgnnnnnnn nnngatggag actgcgccag taccaatgct ggtttcggtg cgaacccgtt 4801 tgccagctgt aatagcttgt ttaaataagc gattcaaaat tgtttttata ccgttgtatt 4861 gctgtcccag tttgtgagta tttttgactt gagccagaat ttgaccttct cccaaaacca 4921 ggctatctaa accagcagca acacgcatca agtgcatgac cgcatcttga tggagcaaaa 4981 caaacagatg ttgccgtaaa gatgtgactg gtagtttact gtgttctgag agaaactgag 5041 tcacttctcg aataccctgt tcggtttctt gagcaacaac gtaaatttcc agacggttac 5101 aagtgcttag tattgcgact tcttcaatat gaggatagct gagcaagtgc gcgatcgcac 5161 cttcagtctg tggttctgga atactcagtt tttcccgaac ttctaccggg gctgttttat 5221 ggcttaaccc caccactgct atattcattt gctaaatctt agttactaat tgagaattag 5281 gaattgggta aagggtaaat gaatattact atccttttac ccttccccaa aaattagtgc 5341 aattgcaatt ccttgggttg tttatcaaac atgtggatag tgtccacaaa tcgggcagta 5401 cgcgattggt tagaaattac caagctttga gttcttgcgc ctccttggaa gaagcggacg 5461 ccttgcatga ggttaccagt cgtaattcca caagcagcga ataagacagt ttcaccagat 5521 gcgagttcat gagcatcgta gaccttatcg gggtcattga tattcataga cttgagccga 5581 tcaaggttgg cttgtttgct ttcgccaatc aaaccagtct tgacgatcgc aggatcgtaa 5641 atcagttgac cttggaagtg accacccaaa gcacgcattg cagctgcgga aataacgcct 5701 tctggggctg caccaattcc catcagggca tgaatgttgg ttccagcaaa accacaagat 5761 atggctgcgc ccacgtcacc atcagaaatc agctggactc tcgctcctgc gtcccggatt 5821 tctttgatca agtcgttgtg gcgttcacgc ttcatcacca cgacaacaag ttcgtcgata 5881 cctctatcta gagcctcaga gagaatcttc aggttttcag tggcagattt gttgatgtcc 5941 accttcccct tagccgctgg aggagctgct agcttcttca tgtagaagtc gggagcagca 6001 aacaaaccac ctttttgaga aattgccaaa acagccattg agcctggttg accgtaagct 6061 acaaggttag taccttcaca ggggtcaaca gcgatatcaa tttcaatgag ttcatcagga 6121 ttgcagaaat ttttggcatc tggctggcta caaataccaa cttcttcccc gatataaagc 6181 atgggcgcgt catcgcgttc gccttcccca atcacgatgc gaccacgcat ataaattttg 6241 ttcatccgtt cccgcatggc ttctacagcc acttcgtcag cggtattttt ttcacctttt 6301 cccatccagc gtgcggatgc gatcgcggct tgttcaacaa cttcaataat ttctaaccca 6361 agtgtatttt ccacaaactc tgccctctcg gttgcttgat tttgcgtcgt tccagaaacc 6421 tgtttcagtt ttcaagtcta ccaaagggcg gatacctatg gggatacctg tggaaaatac 6481 tatcactaac ccttatgtta agttttgcat cttatgaagt tacatagcac aaaacgacac 6541 actttctcac ccatatatat caaaagcttt atagaaaatt gtgtttcttt gatggtttaa 6601 cttgactttt gcaatcaagt ttctaccaag ttgcactcaa gcgtatctgg tgaccaatac 6661 agttggtgat aagcctttct ctctcccaag cactcaagaa ctgcttgcct caatgcataa 6721 atttttgaag taaaccgtat gactcaggag gcaaatcgtg cttctcaacg ctcgttacca 6781 ggctcagcct ggtaacgaga atacagaggc tctgcctctt gttgataatg gtgcaatatt 6841 tcatcagaac tttctgagaa ctttacctaa atttagccga atttcgttgt atctacatat 6901 cgtcaactta ttttatgacc taagagtgaa agtctggcaa gtgaacacca ctttgctgtt 6961 tgtcatggtt ttacttgact tttgcaatga agtttgttcc aaattgcagt caaacgtatc 7021 tttacagtat tgctgttaaa taaaacattc tgaaatagaa ccattttgaa caacggagaa 7081 aatttctatg tcaattgaaa tatcaatagc attattttct tatcttttgg gtgttgttat 7141 tcaatgggtt ggatttagac caaaattcag ggaattagac gaatacacaa attttcccat 7201 aatttgggct gccaaaatag ttacaatgat tggttgcttt atggaggctt tgacctggcc 7261 ttatagtctg attttagaga atgaacttat tttagaaaaa aacgaaacga agtaaaaaaa 7321 cttcattact ctagactaca taaatagatg tacggtagtt tgattggtca tttagcaaca 7381 atgaccaagc cctttggggt ggatgtaaga gtataaggga aagaaagctc ctgctttcct 7441 acacccctac acccctatac cccttttaaa gctcctccag tttgtcttgt gccagccatg 7501 aattagacta aataatagat gaaaatgatt cacaaaaaac tttaaaaaca ttatgttgaa 7561 ctctctgaac accctactag gtcgccatcc tgaacgcatc aaagccaatg ttgaaatcta 7621 cacctggcaa acttgcccat attgcattcg cgccaagatg ctgttgtggt ggaaaggtgt 7681 aaaattcacg gaatacaaaa tcgatggtga cgaaacagcc agagcaaaaa tggcagaacg 7741 tgctaatgga cgccgcagcg taccgcaaat ttttattaac aatcagcaca tcggcggttg 7801 tgatgacctt tatcagctag acacgcaagc tcaattagac cctcttcttg gtcaatcggc 7861 tgtttagctt ctcctcaata agaatatgtc tcttgagtac acagaatatc tcatacatcg 7921 tcaatggatg agtcgcgctt tagagttagc acaaatagcg ggtgatgcag atgaagttcc 7981 tgtaggtgct gttgtgattg attcgtctgg aagtttgatt gcagaaggag aaaacagaaa 8041 agaacgtgac aaagatccga cggcgcacgc ggaaataatt gcaattaaag cagcggctca 8101 aaaattacga acttggcgtc ttaacgaatg taccctttac gtcaccctag aaccgtgtcc 8161 gatgtgtgca ggtgctattg tacaggcgcg cataagactt ctagtctatg gagtggacga 8221 ccccaaaact ggtgcaattc gtaccgttgt gaatatccct gatagcgctg cttccaatca 8281 ccgcttgcgt gtcattgggg gtattttaga atcctcctgt cgtcagcaat tgcaagcttg 8341 gtttgtgaat cgacgacatt tttctaacta acggacagag gtaaaactgt ccagtgggtg 8401 ctacacaagt tagggaagaa aatctacggt gtaactaaca gagtcttgaa tgtattcaac 8461 ctaacagcca gtagctaccg tcatgatttc gttaaactct ccctctgata ctccctgtga 8521 acatcttgct actacgccag aaacagctaa cgtgactcac attacgacct ctgagatttc 8581 tccttggtta agtcctttgg tgtatttctt aggaggtcac gttcttctac catctttctt 8641 tggaagcatt agagtcaccg gacaaaaaaa tcttccccaa actggtcctg ttatccttgc 8701 tcccacacat cgggcacgtt gggattcttt gctattaccg tatgtcgctg gtcgctgcgt 8761 aactggacga gatttgcgat tcatggtcac tgtgactgaa tgccaaggat ggcaaggctg 8821 gtttgttcga cgcttagggg ggttctctgt agaccctcaa cgcccctcaa ttacgactct 8881 ccgtcatagt attgaactcc ttgaaagggg ggaaatgtta gttatttttc ctgagggggg 8941 catttttcgc gatcgcaaag ttcacccatt aaagtcagga attgctcgtc tggctttgag 9001 tgcggaatct agtcatcctg aattaggcat aaaaattgta cccatcggca ttaattacag 9061 cgaaccttac cctacatggg gtacggatgt aagtattcac attggttctg caataaaagt 9121 agcagattac actaaaggtt ctttgaaaca agatgccaag cgtcttactg gtgatttgac 9181 aaaggctctg caaaaattaa gttatcagga atcacaaatc actcctcgtg catttgcaga 9241 aattgcgaat agttaaccta cagcagtgtt caaaagatta caccacaaat cgtagagtga 9301 ccattgtcca taaaagttat gtgatggaca tttgagatat cgtggcaatg ctcacccgaa 9361 agaaaacccc cgtaaggtag acttttgtaa caactcctgc tgattgctaa ggggaattca 9421 gtcaatttct taattctctt ttagtcccat caagcaatta gcattcttag ttgaaaaatc 9481 agaggaattt gtgcgtttgt tcatagcggt aatttaggta gtacgcagaa actatgggaa 9541 cctgatttcg ttgatgttca gtctaaatca gggaaactga atatgagact ttgttttgcc 9601 aaagttcaaa acagagttaa gataccccaa gtttccataa ctgttttttt agttacctct 9661 ttaaatgacc tcatgaactc gtttgcaacc attcctatat ttcgcttcac ttccttagtg 9721 gtgatagcaa ccctgagttt gttatcacct gttcatgctc aagtacagct acccactggt 9781 tctagccaac cacaacccat agatcccaac gatccaaata accttcgtcc cacagcacaa 9841 aataacagcc ttttgagcct tgacggaggg aggcgtctca tggcagaagc aagtaatgca 9901 gtttcttctc aaaactacga cgcagcagca aagaaacttc aagaagcacg agccgtattt 9961 aatcagctgt ctaatttcta tcaagaatta aattctagtt tttctggaat tgacaataga 10021 gttgcggatg atcaacgaaa aaaggcgcta gaaactgccc aaatgcgaga cgaagcttct 10081 taccagctag cattggtaca tagagcgcaa aataagccag aattagctgt accgttactt 10141 gttcaaatag ttaagagtca aaacccaaca cgtgatttag gtaagaaagc atatcagcag 10201 ttattggaat tgggctttgt gaatgctccc ttttctggag ggggtagtaa cgcttcctct 10261 tcgccctctg gtgcttcctc ttcttcccct caaaaaaatc ctcaaaagaa ccctcaaaac 10321 aaccctcaaa agaaataatc gaattacctc tgtcctttgc tcaccactgc ccctaatccc 10381 cacttgtcat tcacccgaat gataggatga gattattggt ggtggtgaga agttttgatg 10441 tttgatgcta aattgcgttt tcactattag gacttacgca aaaactcttc ttaactctta 10501 ttcctttgtg tacgagtgcg tacgagtgca gtaacctcct ctgcgtctac gttttttcaa 10561 catttcgcgt aagtcctgat tatgttattg tttgccaaag cctgagaatc tgggcacatg 10621 ccaaacgcta acgtgcacgc tacgcgccat tcgtacctcc tcataaggca cgcttgagtg 10681 tgttttgctt agcacgataa gttacctctt tgactgcaac tcactataac tctctattaa 10741 gatatatttt aaattgccca aatgcgggag ttacaatccc tattaggctt tgaaatgatt 10801 aggaattgcg atgatgagtc cccagcaagt agaggaaatg ataaagatag aactgccaga 10861 cgccctagtt caagtgcagg acttgactgg aggcggtgat cactatcaag tgacagtcgt 10921 ttcatcgcag tttgcaaata aaggactagt acaacagcac cagttagtct atggtgcgct 10981 taagcaagct atgtctagtg aagcgatcca tgctttagca ctaaaaacat atactcccga 11041 tgcttgggaa aatagtcatt agtcatgagt cattagtcat gagtcatcag tcatgagtca 11101 tgagttagga ctaatgatcc aagccaaaag acaacatcag caattagact aaaagcaaat 11161 caggaaacaa aaagaccatg acgccagaac ttaaagagcg gattgataat ttagtaaaac 11221 aaaacaagat tttggttttc atgaagggaa ccaagttaat gccccaatgt ggtttctcta 11281 acaacgtcgt acaaatctta aatacgttgg gagttccctt ccagacagtg aatgttttgg 11341 atgactacga aatccgtcaa ggaattaaag actattccaa ctggccaaca attccccaag 11401 tctatatcaa cggtgaattc gttggtggtt cggacgtcct tattgaactg taccagaaag 11461 gcgaattgca gcaaatagtg gaagtagcac ttgcttcgtg aaatgaagca gattgcagtt 11521 agcagttagt tcttcataaa tacagtccta ggtgctgagt gttgagtaat gagtgagctg 11581 atgagccaat tcactcgtta ctcacgactt gttacctagg actgctttgc tcggaattgt 11641 aaagaaatat gtccaaaagt tctcatcacg gccacgcagg taatcaggta tgttataatt 11701 acctgcgttg aggtgctggt actattgata acaaaaggaa gatattaata gtaatccagt 11761 tgtggttgcg tcattgtaaa gtttgaaaca tcgtgcaagt agcgattagc ctcctcttct 11821 aagcgatgaa tcagatacgg ttcaatagcg ttaccgtgtt gcagtgcggc tatatagcca 11881 tccaaataca tccgcatatc atccatgcga taaccgcgat tccataactc gacgaaggcg 11941 tcagttagtc tttggtaata gcgaatggtt tgtgtgtctt ggagcataac tgctgttgga 12001 aaactgtctg aaatagagga ttgtaattta ttcgagcacc caagggaagt ctcactttca 12061 gtttggcatt aactataaag acgatatctc ccctaaagga aaaatctcga gcagctacag 12121 actatctcga tcctattcca taaaactgga aatatgatta tcaggtaagc tattgttata 12181 attgcctcaa ccaataactt aactggcaag gtgagtgagt agtaatctgt accgctagct 12241 cgaatatgct tcaaccgcaa agctgaaaaa gttttttggc agaggttagc ttatttagta 12301 agacttcctc ggtaaatggc acccaattca aagcagagtt tgtaatcaaa agcattttta 12361 ctaagtctaa caaaaattta agattattac taacttggct ttggacaacg cgttaatcaa 12421 acgctttttc tgtattgata ttagtaaata tgaatttaat aacaattgag gtaacaagga 12481 ttacaaaacc taagacatag gctttgttac cttgaaattt atgcacgcta aggggtcttg 12541 ccaccgtggg ttcggtttgt attgaaatca ttgagggaaa tccccatctg aggtcattgc 12601 ttggttggca cttgcaacaa ctggagtatc gggtgcatca agctgcaagt atttatcagg 12661 ctagagaagt gtttttaagc catcaaccga cattggttat tctagatgcc gatttacctg 12721 atggtgatgg tattgagttc tgccgttggt tacatcgtca acaacagcct ctgattctaa 12781 tgctatctgc ccgtaataat gaggctgata ttgtggctgg tttaaaagcg ggagcagatg 12841 actacctgtg caaacctttt gggatgcagg agtttcttgc acgggttgag gcacttattc 12901 gccgtaagcg cacacccgtt gcaccagctt atttagatta tggtgctttg caaattgatt 12961 tggtacagcg tcgcgttcgc atcctcgggg agtttatcga cttaacgcct caagaattca 13021 gtttgctcta cgttttagcg caagctggag gagtaccttt gagtcgttct gaactgctac 13081 gtcgtgcttg gcctgatgct atagataatc cacgcaccat tgatactcat gttttatcat 13141 tacggaaaaa agtagaacta gacccccgcc aacctagtct aattcaaact atccgtaatg 13201 taggatacag atttaacacg gaaattttaa atgctaatat tccaaactcg tcaacaaagt 13261 tacctaaaga aagattcaac aatcaacgtt ctatgcttag cactcagcgc tgatagagga 13321 acaacttcaa atttcaaatt cgtcttgggc ttttgcttgc gttaacttgt ctctcaagtt 13381 cagccaactt atttcagagg cttcttgatt gacaagtaag cgaccttgtt gtagacataa 13441 aagccgtgta caaaatttct caactagctc tagctggtga ttgaccataa caactgttgt 13501 ttgataattt tgagccaact ggctgaaaat ttccaccaga tgctcagcct gatttacatc 13561 tagggcagag gttggttcat ctaacaataa aatttttggt tgaatgacta aagcacgggc 13621 gatcgccaca agctgtcgtt gacccaaaga aagctgcacc tcagtccgcg ttaaccattc 13681 atctggaatt tgcaactgtt ctatccagtg actgacgcgt tgctgaattg tttgtttggg 13741 taaaccacgc aaaaccagtg gataagccaa agcttctttt actgtcattc ctaatagctt 13801 tggttcttgg gataccagtg ttactatgga gcgtagctgc aacacaggaa tttgggaata 13861 ctcctgattc tgcagataaa ttttaccgct tgtgggttca cttaaacggt tcagcaagcg 13921 taataaataa gtttttcctg caccagatgg tccaacaata gcaacccgtt ctccctcaaa 13981 tacctcaaag gaaatatcct gtaatattgg atatcccagt tgattatcct gactctgggt 14041 tttcaggctg gcaaaaagac tgacttgttc tagccacagt tgtgctttga ggctaagttt 14101 ccccatctgc gtgtcttgtt gaatttccaa attaaacctt tgttgtgata attgttcttt 14161 ttgtctcaca ggacttacgc aaaaataacg tagttgcacc cgttgcgacg taagtcgtaa 14221 cta // LOCUS NODE_2382_length_14211_cov_5.13061614211 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14211) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14211) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14211 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(270..902) /locus_tag="DP116_19815" CDS complement(270..902) /locus_tag="DP116_19815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210155.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Sua5/YciO/YrdC/YwlC family protein" /protein_id="PRJNA477356:DP116_19815" /translation="MQVSLDTLILGARAGKLISFPTDTVPALAAIPEQGRLIYAAKQR SREKPLILMAASAEEIWSFTTGSDQEYEIWHRVAKKYWPGTVTLVLPASASVPQEMIS IFDGDSAPQKDKLISDRTIGIRVPNCRIAQSILAQTGPLATTSANLSGQPPLQTMAEI SVQFPDVLTLAATEFQDEVHGDGVPSTVVKWTGENWQVLRQGATKIDFKP" gene complement(1010..1924) /gene="prmC" /locus_tag="DP116_19820" CDS complement(1010..1924) /gene="prmC" /locus_tag="DP116_19820" /EC_number="2.1.1.297" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746699.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptide chain release factor N(5)-glutamine methyltransferase" /protein_id="PRJNA477356:DP116_19820" /translation="MADKQLSVTGLQLWQWRNVAIQAANATGILPTEVDWLLQEVAGL DRLALRLESYKHQAEIPLKLSFENLDKLWQQRLNEHLPVQYVAGATPWRKYKIAVSNA VLIPRPETEYLIDLAVAAARKSTVTPSLEQGHWADLGTGSGAIAIGLADVFTTATIHA VDYSHEALLVAKANAQNLGFGERIQFYQGSWWEPLASLKGQFSGMVSNPPYIPTNIIP TLQLEVVKHEPHLALDGGIDGLDCIRHLVEISPIYLRSGGVWLIEMMKGQADTVREML HNQGSYCNIQIHKDLAGIERFALAYTKS" gene complement(1950..2789) /locus_tag="DP116_19825" CDS complement(1950..2789) /locus_tag="DP116_19825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746698.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19825" /translation="MNLLVRWGTTLTLVGSTLLATVFSGNAPVLALTEQQIKEKLDPV PVFLITNNQGVPLTRTVANNGQNGQNAQNAQKKQATVTDVFMSGQEAQAFINELRNVK GKDPKMAEMLKSLQVTPVPLGMIYQKLQENAKKPDSLVFAFNPGRQDLEGAVTLLRQN GKEVKQFPSVPVFIVRSPDKGYVSVKRKTDNKEVIPLFLSQKDAQSLLSQVKQQVPKA DIQVVDIDGVIKTLKEKNDTWLSQVSIVPSTESMQYVVSKRGNAPNQNPSAKPGAPAT PKK" gene complement(3672..3745) /locus_tag="DP116_19830" tRNA complement(3672..3745) /locus_tag="DP116_19830" /product="tRNA-Pro" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(3709..3711),aa:Pro,seq:tgg) gene complement(3918..4469) /locus_tag="DP116_19835" CDS complement(3918..4469) /locus_tag="DP116_19835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654960.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_19835" /translation="MGFWKTWFSTSEANSTTRTTPSEEYAVETVGNSDLNGVGEVHQQ ETRIVFSTERDIDLYELEELCDSVGWSRRPLRKVKKAIEHSFLVASMWQVRGNKRRLI GFARATSDHAFNATIWDVVVHPDFQGKGLGKALMKYVLKKLRSEEISNVTLFADPHVL DFYRSLGFMSDPEGIKGMFWYPH" gene 4794..5723 /locus_tag="DP116_19840" CDS 4794..5723 /locus_tag="DP116_19840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137733.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_19840" /translation="MATIEILGVPHAYELTAPTNYPHTLVFIHGWLNSRGYWQPVISR LSDDFQCLSYDLRGFGESQSKLKTDFSQEQNYFSLSTKSSRAVVDPFDSVYTPAAYTQ DLADLLQQLNVKSAWLIGHSLGGTIALWGAAQMPECVKGVICINSGGGIYLKEAFEQF RSAGQRFLQVRPKWLGQLPLIDLLFTRASVARPLERNWARQRVIDFVVADPEAALGAL LDSTTEEEINCLPGLVAQLKQPIYFLAGAEDKVMEPKYVRHLASFHPLFSYCGDNVIE IPDCGHLAMLEQPDAVATHIRSLVKNQSSVVNC" gene complement(5734..6792) /gene="tsaD" /locus_tag="DP116_19845" CDS complement(5734..6792) /gene="tsaD" /locus_tag="DP116_19845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137732.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD" /protein_id="PRJNA477356:DP116_19845" /translation="MATVLAIETSCDETAVAIVNNREVCSSIIASQIPVHQQYGGVVP EVASRQHLETINGAIAQALEQAAVDWGEIDGIAATCAPGLVGALLVGLTAAKTLAMVH NKPFLGVHHLEGHIYATYLSEPTLEPPFLSLLVSGGHTSLIHVKDCGVYQTLGETRDD AAGEAFDKVARLLHLGYPGGPAIDKQALQGNPQAFRLPEGKVSLPEGGYHPYDASFSG LKTAVLRLVQQFEKDGQSLPTEDVAASFQETVARSLTKRAITCALDYGLSTIAIGGGV AANSGLRQHLQQAAQTHNLRVLFPPLKFCTDNAAMIGCTAADHLNRGHTSPLTLGVHS RLALSQVMELYQTGTHNL" gene 6968..7465 /locus_tag="DP116_19850" CDS 6968..7465 /locus_tag="DP116_19850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459081.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Photosystem I reaction center subunit III" /protein_id="PRJNA477356:DP116_19850" /translation="MRCLFALVLAICIWFNFTPKAFAVGADLVPCSESPTFQERVQTA RNTTGDPNSGEKRFERYSQALCGPEGLPHLIVDGSLDHAGDFLIPSILFLYIAGWIGW VGRAYLQTIKKEGGDVEWKEIKIEVPKALPIMLSGFTWPIASIKELLSGQLTAKDEEI PISPR" gene 7598..7741 /locus_tag="DP116_19855" CDS 7598..7741 /locus_tag="DP116_19855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949893.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit IX" /protein_id="PRJNA477356:DP116_19855" /translation="MAEKQPNYFVQYLSLAPVLLFVNLIVTAVILILFNNWFPDLLFH PLP" gene 8056..8589 /locus_tag="DP116_19860" /pseudo CDS 8056..8589 /locus_tag="DP116_19860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654966.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="photosystem I reaction center protein subunit XI" assembly_gap 8387..8396 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(8752..9378) /locus_tag="DP116_19865" CDS complement(8752..9378) /locus_tag="DP116_19865" /EC_number="2.7.4.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867320.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="guanylate kinase" /protein_id="PRJNA477356:DP116_19865" /translation="MMQVISTKSGATTKECPPTGKLIILTGPSGVGKGTLMRSLLQRH PDLHYSVSVTTRSPRPGETNGKDYYFVSRREFEELVAAGELLEWAEFAGNYYGTPREA VINQIRSGKRVVLEIELKGARQIRASYPNALSIFILPPSMSELEKRIRGRAQDSDEAI ARRLRRAQEEITAADEFNLKIVNDDFERALNAIEAAILSEHSLARRSG" gene complement(9543..9812) /locus_tag="DP116_19870" CDS complement(9543..9812) /locus_tag="DP116_19870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019492432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF370 domain-containing protein" /protein_id="PRJNA477356:DP116_19870" /translation="MDIQLINIGFGNIVSANRVVAIVSPESAPIKRIITDARDRGQLI DATYGRRTRAVIITDSSHVILSAIQPETVANRFVITRDHHQAVDN" gene 10008..10211 /locus_tag="DP116_19875" CDS 10008..10211 /locus_tag="DP116_19875" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19875" /translation="MIATHVCWVECVNPTPTTLVNVGLSFALPSPIYVSTHLGIISKM KRKSLPAIPRAVRLEIKKTETIF" gene complement(10222..10878) /locus_tag="DP116_19880" CDS complement(10222..10878) /locus_tag="DP116_19880" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19880" /translation="MKRLNWVGTFLLAVTLVSVQAFANQTLATAKLKPHLDEPYSQMM TGHSPSTDQQISTRRNPFTATTYQANSVLAKFKTYVENKVYSISYPLEWFITRSHREL AYITNQKMTTTGEGGFPPDFIKTDVQIISENFQTSFTQHLTFSQEDGDRLVKKENMKI DGKNAVRLWYSGGETETVMTLLPYKDNNTVCIATFYTTNNSNYIPVIEKMHSSFKVLD " gene complement(11973..13886) /locus_tag="DP116_19885" CDS complement(11973..13886) /locus_tag="DP116_19885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015956020.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transketolase" /protein_id="PRJNA477356:DP116_19885" /translation="MTVATASFPINLGAYKQIALNPANPTLTNEQRETLKANIQLCRD AIVFFTATGAARGVGGHTGGAYDTVPEVVILDALFRGAPDKFVPIFFDEAGHRVATQY LMAVLHGELSAEQLVHYREADAKLPGHPELGLTPGVKFSSGRLGHIWPYINGVALANP NKVVFSLGSDGSQQEGNDAEAARLAVAKNLNVKLIIDDNDVTIAGHPSEYLPGFSVGK TLAGHGLSVNEGDGEDLDDLYRRICEAVTSDGPVALVNKRKMAVGIEGIEGSTHGHDV IPADKAIAYLEKRGLTEAVRFLKSIEKPKNTYSFIGSGDKWGSNRNVFGEAVVSVLSR LSETERKEKVMCIDSDLEGSCGLKKIHDTYPEIFVSSGIMERGNFSAAAGFGMEKGKQ GIFGTFSAFLEMCVSEITMARLNYSNVLCHFSHSGVDDMADNTCHFGINNFFADNGLD DGYETQLYFPADAAQMKACVEAVFFDPGLRFIFSTRSKTPNILDANGKELYGEGYTFT PGKDEVVREGTAGYIISFGEALYRAVDAVERLKQQGIDVGLINKPTLNVIDEQTLAKV GKAPFVLVVESFNRRTGLGSRFGTWLLERGLTPKFAHLGTHKEGCGGLWEQFPHQGID PEGIINKVKELIG" BASE COUNT 3996 a 3037 c 2914 g 4254 t 10 others ORIGIN 1 cttggcgtct tggcggttcg ttaaattagg tattcttctg gcgggaaggg agtaaaaaat 61 cgccaacttt ctgcctatgt acttctcgtt ttaattcttc ccaacttgct tttgtctcca 121 gatgcaccct tgccttaatt tcacagttga gattaaatcc tgtagtatcg gattaataaa 181 aaaaccactc cagacgcaca ggaaacagag tatgacaaag aaatcatatt gatacaaacg 241 gatttgatat tacataaaac ttatcgactt caaggcttaa aatctatctt cgtagcacct 301 tgtcgtaaaa cttgccaatt ctctccagtc catttcacaa cagtcgaagg aacaccatct 361 ccgtgcacct cgtcttgaaa ttccgtcgcc gctagagtca aaacatcagg aaactgtacc 421 gaaatctctg ccattgtttg taagggaggt tgacctgata aattagcgct ggtggttgca 481 agaggacctg tttgcgctaa gatactttgg gcaattctac aattaggcac tcgaattcca 541 attgtcctgt cagaaataag cttatccttc tgaggagctg aatctccatc aaaaatagaa 601 atcatttctt gtggtacgct agctgaagca ggcaacacca aagtcactgt tcctggccaa 661 tatttcttgg caactctgtg ccaaatttca tactcttggt cactacccgt ggtgaaagac 721 cagatttctt cagcactcgc tgccattaat attaagggtt tttctcgact acgctgcttc 781 gcggcgtaaa ttaaccttcc ttgttcgggt attgcagcca aagcaggaac agtatctgta 841 ggaaagctga ttaatttacc agcacgtgcg cctaagataa gggtgtctag ggaaacttgc 901 atgaacagaa cccagaattc agaaccgaga actcagaatt gagttatcac tcaccagtta 961 tcaattcact gttcactgtg ctgagttggt gagttcttta ttggtgatgt tagctctttg 1021 tatatgccag agcaaagcgc tcaatgccag ctaaatcttt atgaatctga atattacagt 1081 aacttccttg attgtgcaac atttctcgca cagtatccgc ctgtcctttc atcatctcaa 1141 tcagccaaac cccacctgaa cgtagataaa tgggagaaat ttctaccaaa tggcggatgc 1201 aatctaagcc atcaattcca ccgtctaaag ctagatgtgg ttcatgctta acgacttcaa 1261 gctgtagagt aggtataatg ttggtgggga tatagggcgg gttagatacc attccgctga 1321 attgaccctt gagcgatgcc agaggttccc accaagaacc ttgataaaat tgaatacgct 1381 ccccaaaacc taaattttgg gcgtttgctt ttgcaaccag taacgcctca tggctgtagt 1441 caacagcgtg aatggttgct gtggtgaaaa cgtctgctaa tccaatggcg atcgccccac 1501 taccagttcc taagtcagcc cagtgtcctt gctctaaaga cggtgtcacc gtactttttc 1561 tagcagcagc gacagctaaa tctatcaagt actctgtttc tggtctggga atcaaaaccg 1621 cgtttgacac agcgatttta tactttcgcc aaggagtcgc tcctgcaaca tattgtactg 1681 gcaagtgttc atttaatcgc tgctgccaga gtttatctaa attctcaaaa gataacttta 1741 aaggaatctc agcctggtgt ttgtatgatt ccaaacgcaa tgccaagcgg tctaatccag 1801 cgacttcttg tagtaaccaa tcgacttcag taggtaaaat accagtggcg tttgcagctt 1861 gaattgccac gttacgccac tgccaaagtt gtaaaccagt tactgataac tgtttatctg 1921 ccatgcctct atcttaaggc agaaatttat tatttcttag gagtcgctgg tgcgccgggt 1981 ttggcagagg ggttttgatt tggagcattg cctcgcttac taacgacata ctgcatactc 2041 tcggtagatg gtactataga tacttgactt agccaagtgt catttttctc tttcaaagtt 2101 ttaataactc cgtctatgtc tacaacttga atatcagcct tgggaacttg ttgctttacc 2161 tggcttaata aactttgggc gtctttttga ctcaagaaca gaggaataac ttccttgttg 2221 tcagtcttcc gtttgacgga tacatatccc ttatccggag atctaacaat aaaaacaggg 2281 acacttggga actgcttaac ttctttaccg ttttggcgca gcagtgtgac tgctccttct 2341 aaatcctgtc ttccaggatt aaaagcaaac actaggctgt ctggtttctt agcattttct 2401 tgaagtttct gataaatcat ccccaaaggt actggcgtca cttgcaggct ttttaacatt 2461 tctgccattt ttggatcttt acctttgaca ttccgcagtt cgttaataaa agcctgagct 2521 tcctgtccgc tcataaaaac gtctgtgacc gtagcttgtt tcttttgagc gttttgagcg 2581 ttttgaccat tttgaccatt gttagccaca gtacgagtca gaggtacacc ctggttgtta 2641 gtaatcaaaa acacaggtac tggatctaat ttttctttaa tttgttgttc tgtcaatgcc 2701 agcactggag catttccgct aaaaactgtt gctagcagag tactcccaac taaagtcaat 2761 gttgtgcccc agcgaactaa taaattcata atttctcccc gcatcaatac ctctaaaatc 2821 aagtctgaca atatagttgg atttgcacag tattaaccag ttctagatat actacctttt 2881 ttgctcattg actgtgctgt ttttggtatt ttaatttcgc cctactaaga attgcactcg 2941 gatggtagat gattgctgta gattaacaaa agcggcgaaa agtgtactgt tcacttcaag 3001 ggtgtctttt gccattctat tggctttttc agttatcgtt gacgacaata aatatgagat 3061 cgttcctttt acatctgctg aaaatgtttc aaggtttaca tatttcctct cttgtgcgac 3121 ttaatacttg gaaattgttt tgcctaatag aaaagaaaaa atagcaaagc aacatgatta 3181 ctgcttccgg gaaaatatgc aatttcagtg tacattgtct gacaaatcaa acaaatagtt 3241 gcagcacagt ccagttttga ctttttgatt tatttttttg atggtatatc taaaaaagaa 3301 cgaaattcat ttatgtatac ttgagtcatt tgcacgctca aggtgggaac tcttaacagg 3361 gaatagggaa cagggaatag ggaactctta acagggaata gggaacaggg aatagggaac 3421 tcttaacagg gaacagggaa tagggaacag ggaataggga actcttaaca gggaacaccc 3481 gaacgcttaa ctctgaagaa ggaataaagg tgtacgaagc ttggcaaaac tcaaatagga 3541 gtgctataca aaagaacgca atttggtatt atgtgttgtt tgcttgcgct gattagtggt 3601 tatactatat gaaaaaatga aaaaacctat acctgtgttc aggtataggt tttttgctta 3661 ttactatgat atcgggatga caggatttga acctgcggca tcctgctccc aaagcaggcg 3721 cgctaccaag ctgcgctaca tcccgattaa tttattatct taacttagtt gttgcttttt 3781 gacgatactc aaaagctttt ttttaagata acttgatatg ttgccaaact tgcatcaaca 3841 ctataccata ctttgttgat ttttgtaaac agtttcgttc cctatgcaaa agttatgaga 3901 gtaatgtctc ttaaaaatta atgagggtac caaaacatac ctttgatgcc ttcggggtcg 3961 gacataaacc ccaaactccg gtaaaaatct aaaacatggg ggtcagcaaa aagagtcaca 4021 ttactaattt cttcgctcct gagctttttg agtacatatt tcataagtgc ctttcccagt 4081 ccttttcctt gaaagtctgg gtgaactacc acatcccaaa ttgtggcatt aaaggcatga 4141 tctgacgtcg cacgggcaaa gccaataagt cgccttttgt ttcctcgcac ttgccacata 4201 gaggcgacaa gaaaactatg ctcaatagct ttttttactt ttcgcaaagg acgacgcgac 4261 caaccgactg aatcgcatag ttcctctagt tcatacaggt caatatctcg ctcagtacta 4321 aaaactatgc gagtttcctg ctgatgaact tcgccaacac cattcaaatc tgagttacct 4381 acagtttcaa ctgcatactc ctctgagggt gttgtcctag ttgttgagtt agcttctgat 4441 gtactaaacc aagttttcca aaaacccatg ccaacgtggt tcaggtagta tatacgaggt 4501 ggttttgaca gataagagtg ctgattcact tgtgattcaa ctacctacgt tgtattccgc 4561 aacacataca gttgttttgt ctgttggcgg aattttccga tgacagtagt catcaagcgt 4621 gcatcttaca ctagtctttt tcaactttag cattttgttg tggagctaag aagaaattag 4681 ctaaaacgga aacatttaag aataaattgt gtatgacacc agaattcatg tgcctccatt 4741 ttccaatctc cttatcttgc caactcccga actgaacggg gtacgataga aatatggcaa 4801 ccatcgaaat cttgggcgtt ccacacgcat acgaactcac agctcctacg aactaccccc 4861 acaccttagt ttttatccac ggatggctca atagccgtgg atactggcaa cctgtaattt 4921 ctcggttgtc agatgatttt cagtgtctct cttatgattt aagaggtttt ggcgagtcgc 4981 aatctaaatt aaaaactgat tttagtcaag aacaaaatta tttcagccta agtactaaat 5041 ctagtcgtgc agttgttgat ccctttgatt ctgtatatac tccggctgcc tatactcaag 5101 atttagcaga tcttctgcaa cagctaaacg ttaagagtgc ttggttgatt ggtcactctt 5161 tgggagggac gatcgccctt tggggtgctg cccaaatgcc agaatgtgtc aaaggagtta 5221 tttgtattaa ctcaggcggt ggaatttatc ttaaagaagc ttttgagcag tttcgttcag 5281 cgggtcagcg gtttttacaa gttcgcccta aatggcttgg gcaattgcct ctgattgatt 5341 tactgtttac tagagcaagt gtagcacgtc ctttggagcg taattgggca cggcagcgag 5401 ttattgattt tgttgttgca gatccagaag ctgctttagg agcattgcta gattctacaa 5461 cagaggaaga aattaactgt ttgcctgggc tggttgctca acttaagcaa ccaatttatt 5521 tcttagctgg tgcggaagat aaggttatgg aacctaagta tgtccgtcat ttagctagct 5581 ttcaccctct tttttcttac tgtggtgaca atgtgatcga aattcctgat tgcggacact 5641 tagcaatgtt ggaacaaccg gatgcagttg cgactcacat tcgctctctt gtcaagaatc 5701 aatcgtcagt tgtgaattgt taataaagtc aagttaaaga ttgtgtgtac cagtttgata 5761 caactccata acttggctta gcgctaacct agagtgaacg cctagggtaa gaggcgaggt 5821 atgacctctg ttgagatggt cagcagcagt acaaccaatc atagcggcat tatcggtaca 5881 aaattttagg ggagggaata gcacgcgtag gttgtgagtt tgtgctgcct gttgtaaatg 5941 ttgtcttaac ccactgttgg ctgctacgcc tccaccaatg gcaatggtag aaagaccata 6001 gtcaagagca caggttattg ctcttttggt gagggaacgt gctacagttt cctgaaaact 6061 agccgccaca tcttctgttg gtaaagactg tccatctttc tcaaattgct gtaccaaccg 6121 cagcactgct gtctttaacc cactaaaact cgcatcatac ggatgatatc caccttctgg 6181 tagagaaact ttcccctctg gtagtctgaa ggcttgtgga tttccctgca atgcttgctt 6241 gtcaattgct ggtccgccgg gatatcccaa atgcaacaga cgtgccactt tatcaaacgc 6301 ttcacccgca gcatcatcac gagtttctcc tagggtctgg tacacaccac aatctttgac 6361 atgaattaag ctcgtgtgac caccagagac tagtaagcta agaaaaggag gctctaaagt 6421 tggctcactc aaataagtcg cgtaaatgtg accttcgaga tgatgaacac ctaaaaatgg 6481 tttattatgt accattgcta aggttttggc ggcagttaat ccgactaaga gcgcccctac 6541 tagtccaggt gcacaagtcg cggcgatacc atcaatttcg ccccaatcta ctgctgcttg 6601 ttccaaagct tgggcgatcg ccccatttat tgtttctaaa tgctgccggg atgcgacttc 6661 cggcacaact ccaccatact gctgatggac tggaatttgt gaggcaataa tactactaca 6721 aacttcacga ttgttcacaa tcgccacggc agtttcatca cagctagttt ctattgctaa 6781 aacggttgcc attcaagaat tttgcctcta agaacttgtt tgagaagctt taacttttat 6841 ttactttaac tttactcggt tcacgcgcaa gagtatcatg acatcctagt tctgcatatc 6901 aagattgtac aagccgcctc gttttgtaca aaaaactttt tgtttcgtca taaaaggaaa 6961 caattccatg cgttgcttgt ttgctctggt tctagcgatt tgtatttggt tcaacttcac 7021 cccaaaggca tttgccgttg gggcagatct tgtgccatgc agtgaatctc ctaccttcca 7081 agagcgggta caaactgccc gcaataccac cggtgacccc aattcagggg aaaaaagatt 7141 tgagcgctac tctcaagcgc tgtgtggtcc tgaaggttta cctcacctga tagtggatgg 7201 tagtcttgac cacgctggtg atttcttaat tcctagcatt ctcttcctct acattgctgg 7261 ctggattggt tgggtaggtc gtgcctattt acaaacaatc aaaaaagaag gcggtgatgt 7321 ggaatggaag gaaatcaaaa ttgaggtacc aaaggcactg ccaattatgc tgtcaggctt 7381 tacttggccc atcgcatcca taaaggaatt gctttcgggt caactgacag cgaaagatga 7441 ggaaattccc atctcgccac gctagtgagg attttagatt tggagtggtt taggagttag 7501 tagtcttttt tgacaactaa caactaacaa ctcacataac aattctcaat ccatccaaaa 7561 atcactcatt taactgactt attgaggaga atggttcatg gcggaaaaac agccaaatta 7621 tttcgttcaa tatctttctc tggcaccagt tctgttgttt gtcaatttga ttgtgactgc 7681 agttattttg attctcttta acaattggtt cccagaccta cttttccatc cattaccgta 7741 ggtttttaag aaaagttagg agtgagaaat tatgaattat gaattgtgaa ttcataattc 7801 atagttaata gttcatagtt tcaaactcct aacttatagg ttcgcctaga tgaacaacaa 7861 tatagtaaac acagttgctt gaaactggta attatataga taatgtgtga gttttcatct 7921 gaatttaaaa aaattaaaac agctaaatta atttttctaa actagacttg caaaaatgaa 7981 atttagtgac aattaagaat tattatgaat gtaaaatcat gccacaagaa ttttctttag 8041 aggcacacag aaaatatggc gcaagcagta gatgcatcaa aaaatcttcc cagcgatcct 8101 agaaatcgcg aagtcgtttt tcctgaatgg cgcgatccac aacggggcaa tctggaaaca 8161 ccgattaatg cttctccttt agtcaagtgg ttcatcaata acttgcccgc ctatcgccca 8221 gggctaactc ccttcagaag agggctagaa gttgggatgg ctcatggtta ctggattttt 8281 ggtcctttct ccaaactggg tcccctgcgc gatacgccta atgccaactt agcgggatta 8341 ctgtcaactt tgggcttgat agtccttctg actggggcta tatctcnnnn nnnnnnatct 8401 ctgtatggca acactaaccc tcctcaacca aacgtcactg tcaccacacc caatcctcca 8461 gatgctttta aatctggtga aggttggaat ggctttggca gtgctttctt aatcggtggt 8521 attggtggtg caatagttgc atactttttg actagtaatc taggtttaat tcaaggtctg 8581 tttggttaat cagtcatctg aggcatgggg tcataagacc tctatcaaaa agctgctaga 8641 cagatgatca aaaataaagg acgccaggaa aaagttatgt aaaccttttt cttgcgtcct 8701 ttacttgtga gtttacctcc agcagcgtca taggagggtt ttaccaaata attatcctga 8761 gcgtcttgcc aaggagtgtt cactcagtat ggctgcttct atggcattga gagccctttc 8821 aaaatcgtca ttaacgattt taagattaaa ctcatcagca gcagttattt cttcttgagc 8881 acggcgcaga cgacgggcga tcgcctcgtc tgaatcctgt gcccgaccac gtattcgttt 8941 ttctaattca ctcatagaag gcggcaaaat gaaaatgctg agggcgttag gataagaagc 9001 gcgaatttgt cgtgctcctt tgagctcaat ttctagcaca acccttttgc cagagcgaat 9061 ttggttaatt acggcttctc gcggagtacc gtaataatta ccagcaaatt ctgcccactc 9121 cagtaattcg ccagcagcaa ccaattcttc aaactctcta cggctaacga aataataatc 9181 tttgccgttc gtttcccctg gacgaggaga acgagtcgtc acggatacag aataatggag 9241 atccggatga cgctgtaaga gcgatcgcat taaagtgcct ttgccaactc cacttggacc 9301 tgtcaagata atcagcttgc ctgttggcgg gcattcctta gtagtagcac cacttttagt 9361 gcttataact tgcatcatcc gtttaacctg tgaattgatc agtagatact attcatttgc 9421 catgagtcgg acaactcctg actaacgaaa ttttgtagcc tgggtgacat aaacccactg 9481 ttattcacat gttactgctg gtatggcgca aataggcgtg ggactataga tttaaccttt 9541 gctcaattat ctacagcttg atggtgatca cgggtgatca caaagcgatt cgctaccgtt 9601 tccggttgaa tcgccgaaag aataacatga ctggaatcag tgataataac agccctagtg 9661 cggcgaccgt aagttgcgtc gatcagctga ccgcgatcgc gcgcatcggt gatgatccgc 9721 ttaatcgggg cagactctgg actgacaatg gcaactactc ggttggcaga cacaatgttg 9781 ccaaagccga tgttgattaa ctgaatgtcc ataaaaaaac tgacgctaaa cgtggtgaga 9841 aaagctttca aagaactatt ttccatgtta tccacaaaaa atagcagtta caacgcttat 9901 actcacgtaa gtcttcgttt ttaagttaat gaatgccttt tttagctatt tctgagccaa 9961 agatcctaaa aatctaccta aaggaatagg ctaatccaat ttaaaaaatg attgctacac 10021 atgtatgttg ggtagagtgc gtcaacccaa cccctacaac gcttgtaaat gttgggttga 10081 gttttgcttt gccctcgcca atctatgtat ctacgcattt aggaatcatt tctaaaatga 10141 aaagaaaatc tttaccagct attccccgtg cagtgcgact tgaaatcaaa aagacagaga 10201 caatttttta gtgttagttc atcaatctaa aactttgaag gaggagtgca tcttttcaat 10261 aacaggtata tagtttgaat tattcgttgt gtaaaaagta gcaatacaaa cggtattgtt 10321 atctttataa ggtaataaag tcattactgt ttctgtctct ccaccagaat accacagcct 10381 aacagcattt tttccatcaa ttttcatatt ctcttttttg actagcctat ctccatcttc 10441 ttgagagaag gtgaggtgtt gagtgaaact tgtttgaaag ttttctgaaa taatttgaac 10501 atcagtttta ataaaatctg gcggaaaacc tccttcacct gttgttgtca ttttttgatt 10561 tgtaatataa gctaattctc tatggcttct tgtgataaac cactccaggg gataagaaat 10621 tgaatatact ttgttctcta cataagtttt aaattttgcg agaactgaat tagcttgata 10681 cgttgtagca gtaaaagggt tccgccttgt gcttatctgt tgatcagtag agggcgagtg 10741 ccctgtcatc atctgagaat atggctcatc aagatgcggc ttgagctttg cagtagccaa 10801 agtttgatta gcgaaagcct gaacacttac aagtgttact gcgagcagga aagtgcctac 10861 ccagttaagt cttttcatga tttttgtcct caatctttgg ggttataaaa tgaataaaat 10921 gtgacaattt tgaatctttg atttgaataa tcaaagctgc tgcgtttgct aaatcctaat 10981 tggattgtca ttagcttgtg atagataaac gcatgacttt ctcagtattc ttatcaaggc 11041 attccatatt aaatggataa cacttccact tacctaccgt tttcagtcat agcttttaag 11101 gaagtgaatc ctgatgcttt tgtctagaga attaaataac ctggacaccc ctatttatct 11161 tgttagagta cccagtgaaa gtgttgacta gagtaaagtt gagcagagat ttttttgatc 11221 ctctttccgc gatgagagtc ctatagttat cagtagcaaa gtaccagcat ctgatttatt 11281 gacataagca ttttgactgt gcatcactgc ttttagcacc agtaaagcag gtgttaattt 11341 gaatattttc atcaccatat ttctataagg tgctttcacg atttgtccgt tagcgtattg 11401 attttattct tgaataggtt ggctaaaagc aagcaaatag ttttctgata caaagttatt 11461 taagtaacac ggcgatgaca agtaataccg attcaaaaaa tgtttgcgac agatacccca 11521 cccgcgctat cgcgcaccct ccccttgcca aggggagggt agggaggggt ttaaccgatg 11581 tgttgcattc ttttttcaaa ttggtataaa cgggtatcat ttgtaaatga caccctaata 11641 tcaatgcatc aatatagatt attagtttgt tgttataaaa taatcaagta taaatgataa 11701 caaaagttag gatcaattca agagcgtact aatcagatga ttctcctcca tcaggttata 11761 aaacaatacc gttgctgtta aggattgttt ttggagattt catacgtgta gaactttcgg 11821 atgcaacgta agattgcatt ggttattgtt cttgcaaatt atcttatcag caaccgttct 11881 gaaccctccg tttgggttat aaaaatcccc cgcaggctaa agccaaacgg aggaaaaacg 11941 acacattctt agtaaaaatg gaaataatgc gattaaccaa taagttcctt aactttattg 12001 atgatgcctt cagggtcgat tccttgatgc gggaactgtt cccataaacc accacaacct 12061 tctttgtgag ttcccaagtg ggcaaacttt ggagtcagtc cccgttcaag taaccaagta 12121 ccaaaacggc tacccaatcc tgtgcggcgg ttgaaagatt caacaaccaa tacaaacggt 12181 gctttaccaa ctttcgcaag cgtttgttca tcaataacgt tgagtgttgg tttattaatt 12241 aagccgacat caattccttg ctgtttgaga cgttccaccg catcgactgc acggtataat 12301 gcttcaccaa agctgataat ataaccagca gttccttcac gcacaacttc atcttttcca 12361 ggagtaaagg tgtagccttc gccgtataac tctttaccat ttgcatcgag aatattcgga 12421 gttttggaac gggtggagaa gataaatctt agtcccggat caaagaatac tgcttcgaca 12481 caagctttca tttgagcggc gtcagcgggg aagtatagct gtgtttcgta gccgtcatct 12541 aagccattgt cggcaaagaa attgtttatc ccgaagtgac aggtattatc cgccatgtca 12601 tctacgccag agtgagagaa gtgacacagg acgttggagt agttcagccg cgccattgtg 12661 atttctgaaa cgcacatctc caagaatgcg ctgaaggtgc caaagatacc ctgcttacct 12721 ttttccatac caaaaccagc agcagcggag aagtttcccc gttccatgat gccggaactt 12781 acaaagattt ctgggtaagt gtcgtgaatc ttcttcagtc cgcaggagcc ttcgaggtca 12841 ctatcgatac acatgacctt ttctttacgc tcggtttcac tcaagcgact gaggacggac 12901 accacagctt ccccaaacac gttgcggttg gaaccccatt tgtcgccaga accgatgaag 12961 ctataggtgt tcttgggttt ttcaatgctt ttgaggaatc taacggcttc agtgagtccg 13021 cgcttttcca agtaggcgat cgccttatcc gctggaatca catcatgacc atgagtcgaa 13081 ccttcgatac cttcaattcc cacagccatc ttgcgcttgt tcaccaaagc tactggtcca 13141 tcactcgtca ctgcttcaca gatacgacga tacaaatcat ccaagtcttc gccgtctcct 13201 tcattcacgc tcaatccatg accagctaat gtcttaccga cactaaatcc tggtaagtat 13261 tccgaaggat gtccggcgat tgtgacatca ttatcatcga taatgagctt gacattaaga 13321 ttcttggcaa ctgctaaacg tgctgcttct gcatcgttcc cttcctgctg ggaaccgtca 13381 gaaccgagac taaaaaccac cttgttggga tttgccagtg cgacaccatt gatgtaaggc 13441 caaatgtgtc ccaagcgtcc ggaactgaac ttcactccgg gtgtcaagcc gagttccggg 13501 tgtccaggta gttttgcatc agcttcgcgg taatgaacaa gttgttcagc gcttaactcg 13561 ccatgcaata cagccatgag gtattgagtc gcaacgcggt gtccagcttc atcaaagaaa 13621 attggcacga atttatcagg tgccccccgg aataaggcat caaggataac gacctctggt 13681 actgtatcat aagcaccacc agtgtgaccg cctacacccc tagcagcacc tgtggctgtg 13741 aagaaaacaa ttgcatcgcg acagagttga atattggctt tgagtgtctc ccgttgctca 13801 ttggttagag ttggattggc agggtttagt gctatttgct tgtacgcacc aaggtttata 13861 ggaaagctag ctgtagcaac agtcatggtt gatattcctg tatgtagatg aatgacacaa 13921 acagaaccgt cctcttcgac ggtacactgc aagggaaaat tatattgtta tggttgattg 13981 caacaaacac ttcgtgaaca ctcgcggatc taacatatgc cctatgagca agctaagcaa 14041 acactcattc aaaatttaaa aaaagtccaa tcaaataaca agttatttgt aagtataaat 14101 gcttatgaca ctttcggaac aagaaaaata aagaaaataa aaaattttca ctgatagagc 14161 caagtacgga atagggaaca gggaacaggg aacagggaac agggaacagg g // LOCUS NODE_2387_length_14198_cov_5.40748114198 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14198) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14198) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14198 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..178 /locus_tag="DP116_19890" CDS <1..178 /locus_tag="DP116_19890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317352.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19890" /translation="GEERGLQQGLQQGVGRQLIRVLQRRFGEIPQEVKARLKGESVEQ LESLMDSALSAALP" gene 473..1429 /locus_tag="DP116_19895" CDS 473..1429 /locus_tag="DP116_19895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868543.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LysR family transcriptional regulator" /protein_id="PRJNA477356:DP116_19895" /translation="MRLEQLQAFLAIAETGSFQGASRKRGVTQSTISRQIQGLEEDLG LELFHRTSQAKLTLAGERLLPRAQKICQEWQSATQEIADLLAGKQPELCVAAIHSLCA YYLPPVLQKFCHDYPQVQLRVTSLGSDRALKVLKDGLVDLAIVMNNRFLCTGKEMLVQ VLYDEPIEVLVAKNHPLAQYERIPWSELTRYPQVVFKDGYGMQRLIQDRFERLEATLH AALEVNTLDAFRGVVRQGELVAMLPQSALVEARIDPSLAVRPLACNTNSNGSSPDSSS LTRQVVMVTTQDRLQIPPIKHFWQLVRDNVPPQLHSFTKSAC" gene 1497..2558 /locus_tag="DP116_19900" CDS 1497..2558 /locus_tag="DP116_19900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455115.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19900" /translation="MSTLFRDLLKKIGSGEHTAQNLTRAEAATAMKMMLLEEATPAQI GAFLIAHRIKRPTPEEIAGMLDAYDELGPKLQPIACERPVIVLGIPYDGRTRTAPISP VTALLLAASGQPVIMHGGDCFPTKYGVPLVDIWQGLGVDWTGLSLEKSQQVFEKTGLG FVYTPKHFPLTNSIWEYRDQLGKRPPLATMELIWCPYAGNVHIIAGFVHPPTETLFQN TLALRGMTELTTVKGLEGSCELPRDRTAIIGITRTSELDETGNIPIERLHLSPRDYGF TTKNEPLGTTEELIVNIQEVLNGKTSNFMETALWNGGFYLWRCGICSDMREGIAKAEE LLTSGVITQKLEELSSNLC" gene 3876..8015 /locus_tag="DP116_19905" CDS 3876..8015 /locus_tag="DP116_19905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-ribosomal peptide synthetase" /protein_id="PRJNA477356:DP116_19905" /translation="MHDISNKTAVLSTAKKELPDNITDKQIVEAAPLSFAQQRLWFFD QLEPSSTAYHVIKGVKLQGDLNLGVLQQALDAIVAHHEALRTNFVAQEGNPVQIISQP RSVELVVIDLKDCPETERTTIVERLLQDEVQRPFNLASDLMLRAKLLQLSPQEHILLL VLHHIASDDWSTGILFEQLTTLYQAFLNGLPNPLPELPIQYADFAQWQRQWLSGEVLE NQLNYWKQNLAGASPVLELPTDKPRPPVHTYQGGKQNFVIPQSLSASLSALSRQEGVT LFMTLLAAFQTLLYRYTGQEDILVGSPIAGRNLPEIERLIGFFANTLVLRTDISGNPS FQELLHRVRAVALGAYAHQDLPFEKLVEELQPERSLSYHPLFQVMFVLQNTPKQTLQL PGLSLTPYDWDNVTTRFDLTLSITETEQGLQGLWEYNTDLFDAGTINRMSGHFQTLLD SVVANPQQHISELPLLTAAERHQLLYEWNDTYADYACDKCIHELFEQLVEGTPDDVVL MYEDQQLTYQQLNALANQLAHYLRTLGVGPEVLVGIFVERSLEMVVGVLGILKAGGAY VPLDPSYPKERLAFMLENSQPLVLLTQEFLFTELPEISAQVVCFDRDWQSIAQHCEEN LNQTATTANLAYVIYTSGSTGKPKGVQVTHANLCHYAQAMGQALGITAEDVYLHTASI AFSSSVRQLMVPLAAGATVKIATSEQRTDPKALFAAIQQHDVTVIDIVPSYWRNCIHT LATLEPRTRQALLDNKLRLIVSASEPLMSDIPTQWTFGFKHQARLINMFGQTETCGIV ATYPIPAQQHERVKIVPLGRPIPNTQIYLLDSHMQPVPIGIAGELHIGGLGLARGYLN RPELTEEKFIPDLFSQKEGARLYKTGDLARYLPDGSIEFIGRSDYQVKIRGFRIELGE VEAVLNQHPAVLQSVVVAREDKSGEKRLVAYVVPNQKTTVTITELRRFLKKKLPEYMV PFAFVLLEALPLTPSGKVNRSTLPAPDLVKQDLQATFVAPHDDLEKTLSQIWEEVLGI QPIGVRDDFFDLGGHSLLAVRLFAQIEKKFDKKLPLATLFQSGSVEALANILRQEEEP TAGNQVLIATHHQHTSRAPWSSLVEIQPNGSKPPFFCIHPLGGEILCYRPLALHLGSD QPVYGLQPLGLDGKHPPLTRIEDMAAHYIKEIQTIQPNGPYYIGGYSLGGIIAYEMAQ QLYSQGEKVNLLAMLDTSRPGTETRLPFVLRVFEHINNIIQEGPSYLQHKLAGWSEWG TYHIRDKYRRLLEKSELLPEGDEHLDVMGANVQALEQYTFKAYPGRMTVFRTDDKNRD DAVGVKYDPLFGWGEITSGVDVYHLPGSHLSFLDEPDVEVLAEQLKLCLEKAHAAELT N" gene complement(8118..9719) /locus_tag="DP116_19910" CDS complement(8118..9719) /locus_tag="DP116_19910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318023.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional ADP-dependent NAD(P)H-hydrate dehydratase/NAD(P)H-hydrate epimerase" /protein_id="PRJNA477356:DP116_19910" /translation="MKDRQEQIAQVIVTAQQMREIEERIFAAGMPVVALMEKVAGLIT RRVQDIYPLFCQQENKAKASSSSSSRVGILTGPGHNGGDGLVVARELYFRGYEVLIYC PFSKLKELTSQHLQYARSLGLPCYDSIQPLQDCDLLIDGLFGFGLEKTLTDPVATAIN QLNEWHKPIISIDLPSGLHTDTGEVLGTAVRATHTLCLGLWKQGLLQDQALDYVGKAE LIDFDIPLADIQAVLGNSPSIKRITKTTALSTLPLPRPPVTHKYKEGHLLLICGSRRY SGGAILTGLGARASGVGMLSIAVPESLKSLMVAQLPEALIIGCPETESGAIAQLQLPP KTDLSSFSAIACGPGLTQDASPILQEVLDSTIPLLLDADGLNILAEMRSIQTLQKRQI STVLTPHTGEFKRLFPDIPDANHDRVKATREAAAQSGAVVLLKGARTVIANSQGTVWI NPESTPALARGGSGDVLTGLIGGLLAQASSKKISVEEIVATGAWWHSQAAILAAQERT ELGVDAHTLTNYLIAVLASTSIGHL" gene complement(9841..10713) /locus_tag="DP116_19915" CDS complement(9841..10713) /locus_tag="DP116_19915" /inference="COORDINATES: protein motif:HMM:PF05036.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19915" /translation="MLSAIPNQIMAFLKLSQPSLILRLLLGGCLVFLSQSNLAYAQIN PSEVFIAQRRSVYDTLPPAPSNEPLPAVPSDSQGYTVPEVNTSSQRSIEFQAPEAPSY GSNRGYAPYKVYINDNDYGRRGTSLIPRDAFRQRFQGRSVIQVGAFRTREGARSLARR LQSNGVSSARVVDGDQVVYNEQNRRDERDVSYDYGRGDYGRGDSGRQRSSYYYVVIPA NSEELPRYRNEIRSYLGRNIYPGSDVYVIPRLEPRGPHVAVGPFVKRWQAEEWNNFLR KDSRFGNARVYYGK" gene 10843..11922 /locus_tag="DP116_19920" CDS 10843..11922 /locus_tag="DP116_19920" /EC_number="2.8.1.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455024.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA 2-thiouridine(34) synthase MnmA" /protein_id="PRJNA477356:DP116_19920" /translation="MNKVVVGLSGGVDSSTAAAILHHQGYEVIGLTLWLMKGKGQCCS EGMIDAAYICEQLGIPHHIVDIRDVFQANIIDYLVDGYSTGVTPLPCSQCNKTVKFGP MLQYARENLECASIATGHYARIQYDPATGRYQLLRAFDRNKDQSYFLYDLSQELLAGS VFPLGELQKSETRRLADEYGLKTADKPESQDLCLVESNGSMRAFLDKYLAPKKGDIVD TSGKILGQHDGVHHYTIGQRKGIGIAAAEPLYVIALDAVNNRVIVGDRTKVTEPECTV ERVNWVSIAEPSTPIRAQVQVRYRSTPVPVTVIPLENSRVRLVFDEPQFSITPGQAAV WYEGEKVLGGGIIEQFSSSDPSRQL" gene complement(12254..12712) /locus_tag="DP116_19925" CDS complement(12254..12712) /locus_tag="DP116_19925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456550.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L9" /protein_id="PRJNA477356:DP116_19925" /translation="MAKRIQLVLTQDIIKLGKSGDLVEVAPGYARNYLIPKSLATRAT PAILKQVERRREIERQRQLELKQQAQEQKAALEKVESFQIAKQVGEAEAIFGTVTTQE VAEVIQQIAGLEVDRRGITIPDISKLGTYEAEIKLYTDVTAKVSIQVVAS" gene complement(12982..13755) /gene="gloB" /locus_tag="DP116_19930" CDS complement(12982..13755) /gene="gloB" /locus_tag="DP116_19930" /EC_number="3.1.2.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868608.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydroxyacylglutathione hydrolase" /protein_id="PRJNA477356:DP116_19930" /translation="MQVIRLEALSDNYIFLLHDPRQNIAAVVDPAEAEPVLQKLKEFK AELVAIFNTHHHQDHIGGNRQLMQQFPNVIVYGGAEDRGRILGQQVFLQQGSRVEFAH RTAEVIFVPGHTRAHIAYYFPPENPSQTGELFCGDTLFAGGCGRLFEGTPTQMVHSLS KFRSLPDSTRVWCAHEYTLKNLQFALTVDADNTDLQTRYDQVKTSRSHKEATVPSLLG VEKLTNPFLRWDQPALQLAANSRDSVQTFARLRGMKDKF" BASE COUNT 4107 a 3056 c 3133 g 3902 t ORIGIN 1 aggtgaagaa cgtggacttc agcaaggact tcagcagggt gtaggacgac agttaatccg 61 agtcttgcaa cgacgttttg gcgaaattcc tcaagaagta aaagcaaggc ttaagggcga 121 gagtgtggaa caattggaaa gtttgatgga tagtgcgtta agcgcagctc tgccgtaggc 181 aatcgctgta agttctttag aagaatttct gacaattcta tctacttaat ggaataccaa 241 aaaataaatt atccgagatt cgtagggtgc gttacttcgt taacgtacct tactagctta 301 cttaagcccg aacttctttt taaacgggac agggatgaga atagaaacct ttgagtaaaa 361 aagttgtcaa aaaaattctc agcaaacata agctgtaaca agcagtacta atttgcgccc 421 gaaagtacga taacttaaat attgccataa gtgtacttca ggctacatca acatgcgact 481 agagcagttg caagcctttc tggcgatcgc agaaactggt agtttccaag gtgcatcacg 541 aaaacgcggt gtcacccaat cgactattag ccggcaaatt caaggattag aagaagattt 601 ggggttggaa ctctttcaca ggacaagtca agcaaagctg acactggcgg gtgaacgctt 661 gctacctcgt gctcaaaaaa tatgccaaga gtggcaaagt gctacacagg aaatcgctga 721 tttattagca ggaaagcagc cagaactttg tgtcgcagcc attcactcac tttgtgctta 781 ttacctacca ccagttttac aaaaattttg tcatgattat ccacaagtcc aattgcgggt 841 gacatcgctg ggtagcgatc gcgccttaaa agtcctcaaa gatggactcg tggatttggc 901 aatcgtcatg aataatcgct tcttatgcac tggcaaagaa atgctagtac aagtgctgta 961 tgatgaaccg atagaagttt tagtcgcaaa aaatcacccg ttagcgcaat atgaacgcat 1021 cccttggtcc gaactcactc gttatccgca agtcgtgttt aaggatggtt atgggatgca 1081 acgcctaata caagatagat ttgaacgctt agaagccaca ctgcatgcgg ctttagaagt 1141 gaatactcta gatgctttcc gaggagttgt gcgccaaggg gaactcgtgg ctatgctacc 1201 acagtcggca ctagttgaag cacgtattga cccaagcctt gcagttcgtc ccctagcctg 1261 caacactaac agtaatggtt cttcacctga tagttctagt ttgactcggc aggtggttat 1321 ggtgacgact caagaccgtc ttcaaattcc tcctatcaag catttttggc agttggttag 1381 ggacaatgta ccaccacaat tgcactcttt tacaaagtcg gcttgttagc agttataagt 1441 catgagtcat ctgtcgtttg tcatttgact tttgacaaac gacaaaggac aaaaatatga 1501 gcacattatt cagagattta ctaaaaaaga taggtagtgg agagcatact gcacaaaact 1561 taactcgtgc tgaagccgcc accgcaatga agatgatgct gctggaagaa gcgacaccag 1621 cacaaattgg cgcattttta attgctcatc gcatcaaacg tcccacgcct gaagaaatcg 1681 ctggaatgtt agatgcttat gatgaactag gaccaaaact gcaaccaatc gcctgcgaac 1741 gaccagtcat agttttaggc ataccttatg atggcagaac tcgcaccgca cctattagcc 1801 cagtaacggc tttgctttta gcagcaagcg gacagccagt gatcatgcac ggtggagatt 1861 gtttccccac aaagtacgga gtgccactgg tagatatttg gcaggggtta ggagttgatt 1921 ggactggact atcactagag aaaagccagc aagtgtttga gaaaactgga cttggctttg 1981 tttatactcc taagcacttt ccattaacaa atagtatctg ggagtaccgt gatcagcttg 2041 gcaagcgtcc acctttagcg acaatggaac tcatttggtg tccttatgca ggaaatgttc 2101 atattattgc tgggtttgtc catcctccca cagaaacatt gtttcaaaac accctagcat 2161 tgcgaggaat gactgaatta acaacagtaa aaggattgga gggaagctgc gagttaccac 2221 gcgatcgcac tgctatcatt ggcataacca gaacttcaga actcgatgag acaggcaata 2281 taccaataga acgtttacac ctgtcccctc gtgattacgg ctttacgaca aagaacgaac 2341 ctctgggtac cactgaagaa ttgattgtca atatacaaga ggtattgaat ggtaaaacct 2401 caaactttat ggaaacagct ttgtggaatg gaggatttta cctttggcgt tgtgggattt 2461 gttcagatat gcgtgagggt attgccaagg cagaggaatt gttgaccagt ggtgtgatca 2521 ctcagaagct ggaagaactt tcctcaaacc tttgctagta gtactgccat actaccagat 2581 accgttcttg tagttgtcag ccttgaactc gatagcttga ctgctagtta atctggattt 2641 gaatttcgtc taataacgtc tgtgtaaatt tactgtgaac tttaaattta gaattagcca 2701 gaagaaatta tgatcaatta ttattgttag tgtatatcgt tatatgtttt taaagaaacg 2761 ataaagatta tatgaaaaat tctagaaaac aatgctaacc tgattgttga atttgtaaaa 2821 ttatgtatta agattagtgg ctaattcctg ccaaaaagcg tattaaaaca aaacgaaaac 2881 catgctttga ggtagggcta tttaattcta tgcataacca cctgtaaatt tttttttgtc 2941 taatacccga agtccgttaa agccgactta tactcagaat aagagtcagt gtgaacaaac 3001 ttaagctatt agtgacgaac tgaaattcct ggttaattaa caaagagtga aaccacccta 3061 tgttttgatg caatttaaaa tgtaatggag attttgtagt tataccaacg tatgttttga 3121 taactagcaa cttgggagat tctgttagat agaaatctac caaaatgctg atttgtaaga 3181 tgatgattta ttgtattgta atagatggga gaaagcttaa gattgtaagc attaacctct 3241 agccagtaaa tgtacattga aatatcattc aaatcactaa cctatcaaaa actgtgttga 3301 aataaagcca gatgtcctga taatcaacca aacatagggg gtgatcccaa ttttttaaac 3361 tgcattttgc cttaaattga acatttgtga tgaactaaaa ctgtagtgca gaaaatatct 3421 attaccaagt tattatttaa cgtttatttt gcaattttga gaggcaaaat taagcatatt 3481 ttccaatgtt atttatgaat tgtattcaga gaaatagctt gtttgtgcgt aaattgaccg 3541 aaaaaagctt gatcaaatcc aaaaaagtca atggatgtca aaataaaaat aggtcatgca 3601 tctttttgaa taactaaaaa aagtctcctc gattgccgtt aaaaggaaga cacgcaaaaa 3661 gaaaatgcgt ttaatatata catagacaaa atttctaaaa taatatgtgt catcttccaa 3721 cagggagaag gacagaaaat gattgcacta taactggtac aaggcacaaa ttttctaact 3781 gatatcttag atagatttca gacaggaaaa tgcgacacac gggtggctac gttaatgtct 3841 attgactaag taatagcaac gaaggatgac gaaatatgca cgatatcagc aacaagactg 3901 ccgttctctc aacagccaaa aaagagctac cagacaatat cacagataag caaattgtgg 3961 aagcagctcc cttatcattt gcccaacaaa gattgtggtt ttttgatcag ttagagccaa 4021 gcagcacagc atatcacgtt attaagggtg tgaagttgca aggcgacctc aacttggggg 4081 tcttgcaaca ggctttggat gcgattgtcg cccaccacga agcactgcga accaactttg 4141 tggcacagga gggcaacccc gtgcagatca ttagtcaacc ccgctcagta gaattagtgg 4201 tgattgacct taaggattgt ccggaaactg aacgcacaac cattgtagaa cggctgctac 4261 aagatgaggt gcagcgcccc ttcaacttag catcagactt gatgctacgt gctaaattgc 4321 ttcagctatc cccacaagag catatcctgc tgttggtgtt acaccatatc gcttctgatg 4381 actggtcaac aggcatttta tttgaacagt tgacaactct ctatcaagca ttcttgaacg 4441 gattgccgaa tcctttacca gaactgccca tccagtatgc tgactttgcc caatggcaac 4501 gccagtggct ctctggtgag gtgctagaaa accaactcaa ctattggaaa cagaacctgg 4561 caggtgccag ccctgtactg gaattaccta cggataaacc ccgaccacca gtccatactt 4621 accaaggtgg aaagcaaaat ttcgttatac cccagagttt atctgcgtcg ctgtctgcac 4681 tgtcacggca agagggtgtg acactgttca tgacattatt agcggcgttc cagactctac 4741 tgtaccgtta tactggacaa gaggacattc tggtcggttc tcccatcgcg ggacgaaatt 4801 tgccagagat cgaacggcta attggctttt ttgccaatac cttggtacta cgcaccgata 4861 tatcgggcaa ccccagtttc caagaactat tgcatagggt aagagcagtc gctttaggag 4921 cttatgccca ccaagacttg ccgtttgaaa agctggtaga agaactgcaa ccagagcgat 4981 cgctctcata tcatccccta ttccaagtga tgtttgtctt gcaaaataca ccaaaacaaa 5041 cattgcagtt gccaggactg agtctaactc cctacgattg ggataacgtc accaccaggt 5101 ttgatttaac actgtcaatc acggaaacag aacaaggact gcaggggttg tgggaataca 5161 acactgactt gtttgatgct ggcactataa accggatgag tgggcacttc cagacattgc 5221 tggactctgt tgttgctaat ccacagcagc acattagcga attgccactg ctaacagcag 5281 ctgaacgtca ccaattactg tatgagtgga acgatactta tgccgactac gcctgcgata 5341 agtgtatcca tgagttgttt gaacaactgg tagagggcac tccggacgat gtggtgctga 5401 tgtatgaaga ccagcaactc acctaccagc agttgaacgc ccttgctaat caattggcgc 5461 actacttgag aactttggga gtaggtccag aggtactggt tggtatcttt gttgaacgct 5521 ccctagaaat ggtcgtggga gtgttgggaa ttctcaaagc gggtggagct tatgtacctt 5581 tagacccatc gtaccccaaa gagcgcttgg cgttcatgtt ggaaaactct caacccttgg 5641 tactattgac tcaggagttt ctgttcacag aacttcccga aataagcgcg caagtcgttt 5701 gctttgatag agattggcaa tcaattgctc aacactgcga agaaaactta aaccaaacag 5761 caacaactgc caacttagct tatgtgattt atacttctgg ctcaaccgga aagcctaaag 5821 gagttcaagt tacacacgct aatttgtgtc actacgcgca agcaatggga caagcgctgg 5881 gtatcacagc agaagatgtg tatctgcata cagcatcgat cgctttctct tcttccgtta 5941 ggcagttaat ggtacctctg gcggctggcg ctactgtcaa aattgcgact tccgaacaga 6001 gaacagatcc aaaagcgctg ttcgcagcaa ttcaacaaca cgatgtcacg gtaattgata 6061 tcgtcccctc ttactggcgc aactgcattc atacactggc aactttagaa ccaagaacaa 6121 gacaagcttt attagacaac aaattgcgct taattgtttc tgcaagtgaa ccactgatgt 6181 ctgatattcc cactcagtgg acgtttggct ttaagcatca ggcacgatta attaatatgt 6241 ttggtcaaac agaaacttgt gggattgttg caacgtatcc gattcctgcc caacagcacg 6301 agcgggtaaa aatcgtccct ctcggtcgcc cgattcccaa cacgcaaatt tatctgctcg 6361 actctcatat gcaaccagtc cccattggta tagctgggga actacacatt ggtggtttgg 6421 gactggcgcg aggctacctc aaccgaccag aattgacaga agaaaaattc attcccgacc 6481 tctttagcca aaaagaaggg gcacggctgt acaaaactgg ggacttagcc cgttatctgc 6541 cagatggtag cattgagttt atcggacgca gtgattacca agtaaaaata cgcgggttcc 6601 gtattgagtt aggagaggta gaggctgtct tgaaccaaca tccggcggta ctccagtctg 6661 tagtcgttgc tcgtgaagac aagagtggcg aaaaacgttt ggtagcctat gttgtcccaa 6721 atcaaaagac aaccgttaca attactgagt tgcgtcgttt cctaaagaaa aagctgcctg 6781 agtacatggt accattcgct ttcgtcttgt tggaagccct gcccctaact cctagcggca 6841 aagtcaaccg cagcaccctt ccagcacctg atttagtaaa gcaagatctc caagcaacct 6901 ttgttgctcc ccatgatgat ttagaaaaaa cgctttcgca gatttgggaa gaagttttag 6961 gcatccaacc cattggcgtg agggatgact tctttgatct gggagggcat tccttactag 7021 ctgtgcgctt atttgcacaa atagagaaaa aattcgacaa aaaacttccc ctagccaccc 7081 ttttccaatc aggttcagtg gaagctcttg ccaatatact ccgtcaagaa gaagagccta 7141 cagctggtaa tcaggtgtta atagcgacac atcaccagca cacatcacga gctccttggt 7201 catccttggt agaaattcaa cccaacggtt ccaagccacc tttcttctgc atccaccccc 7261 ttggtggaga aatcctgtgt taccgtcctt tggcgttgca tttgggatcg gatcaaccag 7321 tttatgggct acaaccacta gggctagatg gaaaacaccc tcctttaacc cggattgaag 7381 atatggcagc ccactatatt aaagaaatcc aaactattca acccaatggt ccttactata 7441 taggaggtta ctccttgggt ggtataattg cctacgagat ggcacagcaa ctctactctc 7501 aaggtgaaaa agtgaatctt cttgctatgc ttgatacctc tcgtccgggt actgagacgc 7561 gattgccctt cgtgctaaga gtttttgagc acataaataa tatcatacaa gaaggaccta 7621 gctaccttca acataagctt gcgggttgga gtgagtgggg gacgtatcat atccgagaca 7681 aataccggcg tttgttggaa aagtcggagc ttttacctga gggcgacgaa catttagatg 7741 ttatgggtgc taatgtccaa gctcttgagc agtatacttt caaagcgtat cctggtcgaa 7801 tgactgtgtt tcggactgac gataaaaatc gggacgatgc tgtcggtgta aagtatgatc 7861 cgttatttgg ttggggcgaa ataaccagtg gagtagatgt ttatcatctt cctggctctc 7921 acctttcctt ccttgatgaa cccgatgtag aagtattggc agagcaatta aagctttgtt 7981 tagaaaaggc gcatgctgcg gagttaacca attaatatca acagtccgct taattactta 8041 taatttgtgc gttacctcac cccagccctc tccttaataa ggagagggtg ccgcagacgg 8101 gtgaggttct tcgtttttta taagtgtcca atcgaagtag atgccaaaac agcaatgaga 8161 taatttgtca gagtatgggc atctactccc aactctgtcc gctcttgtgc tgctaaaatt 8221 gctgcttgag aatgccacca agcaccagtt gcaacaatct cctctacaga aattttttta 8281 gaagacgctt gcgctaataa tccgccaatc agcccagtta atacatcccc actaccacca 8341 cgcgctaaag ctggcgtact ttcgggattg atccaaacgg ttccttggga attggcaata 8401 acagttcttg cccctttcaa taacaccact gcaccacttt gcgccgccgc ttcccgcgtt 8461 gcttttaccc tgtcgtgatt agcatcagga atgtcaggaa acaatcgctt gaattcacca 8521 gtgtgtggtg taagtacggt agatatttgt cgtttttgta acgtctggat cgatctcatt 8581 tcagccaaaa tattcaaacc gtcagcatct agaagtaagg ggatagtgct atctaatact 8641 tcttgtaaaa tcgggctggc atcttgggtt aagccgggac cacaggcgat cgcactaaat 8701 gaactcaaat ctgtttttgg tgggagttgt agttgggcga tcgccccaga ctccgtttcc 8761 ggacaaccaa taatcaaagc ttcaggtaac tgtgctacca ttagagactt gagagattct 8821 ggcacagcaa tagagagcat accgacacca cttgctcgtg ctcccaaccc agttaatatt 8881 gctcctcccg aataccgacg cgaaccacaa atcaacagca aatgtccttc tttatattta 8941 tgagtaactg gcggacgggg taaaggcaga gtcgagaggg cagttgtttt tgtgatgcgt 9001 ttaatacttg gtgaattgcc cagcacagct tgtatatcag ccaagggaat gtcaaaatct 9061 attaactcgg ctttaccaac ataatcaaga gcttggtctt gcaataaacc ctgcttccac 9121 aaacctaagc acaacgtgtg tgtagcacgt accgcagttc ccaacacctc accagtatcc 9181 gtgtgcagcc cagaaggtaa atctatacta ataatcggtt tgtgccattc attgagctga 9241 ttgatagcag tagcgactgg atctgtaagc gttttttcta aaccaaatcc aaataaccca 9301 tcaattaata aatcacaatc ttgcagtggt tgaatcgagt cataacacgg taaacccaaa 9361 ctcctagcgt actgcaagtg ctgcgaagtt aattccttaa gtttagagaa agggcaatat 9421 atcaagactt catagccacg aaagtataac tctcgggcaa caaccaaacc atcaccacca 9481 ttatgaccag gacctgtgag aattcctaca cgggatgagg aagatgagga agctttcgcc 9541 ttgttctctt gttggcaaaa aaggggatag atatcctgaa cacgacgagt aataagtcct 9601 gctactttct ccatcaaagc taccacaggc attcccgctg caaaaatacg ctcttcgatc 9661 tcgcgcattt gctgtgcggt gacgatgact tgtgcaattt gttcttgcct gtctttcatg 9721 agttctgagt ccgaaaaaag tcaacaatta ggggtgtaag ggtgtgtagg ggcgctgcgg 9781 ccttgcgccc gtacgggggg taaaggggta agaaataaaa gaataaggcg tgggtataag 9841 ttactttccg taataaactc tagcattacc aaatcttgaa tccttccgca gaaagttatt 9901 ccattcttct gcctgccaac gcttgacaaa aggtccaact gccacatgtg gtcctcgcgg 9961 ttcaagtctg gggatgacgt atacatctga tcctgggtag atatttcgtc ctaggtagct 10021 tctgatctcg ttcctgtagc ggggtaactc ttcggagttt gcaggaataa cgacatagta 10081 ataactagac ctctgtctac cagaatcgcc tctaccataa tcgcctctac cataatcata 10141 ggaaacgtcg cgttcatccc gtctgttctg ttcgttatag acaacttgat cgccatcaac 10201 gactcttgcg gaagatacgc cgttagattg cagtcgtctg gctaggcttc tagctccttc 10261 tcttgtccta aaagctccta cttgaatcac agaacgtcct tgaaaccgct gtctaaaagc 10321 atctcttgga atcagacttg ttcctcgccg tccataatca ttgtcattaa tatacacctt 10381 gtagggagca taaccacggt tagatccata agagggtgct tcaggtgctt ggaattcaat 10441 agaacgctgg gacgatgtat tcacctcagg aactgtataa ccttgtgaat ctgacggtac 10501 tgctggtagc ggttcgtttg aaggtgcagg cggaagagta tcatacactg atcgtcgctg 10561 tgctataaaa acttcgctag gattgatttg ggcatatgct aggttagatt gagataaaaa 10621 aactaagcat cctcctaaaa gcaaacgcaa aatcaaagat ggttggctca atttaagaaa 10681 tgccataatc tgattcggta ttgcactgag catattatta acagcaaatt agtagcaaaa 10741 ttttaaatga ttttcttgac taaaaaataa tcgagtgatt caatccaaaa tctaaaacat 10801 cttgatttat gtagtaattt tatacaataa gcgaagtctt ttatgaacaa agttgtagtt 10861 ggtctttctg gtggcgttga tagttccact gcagcagcca ttttgcacca tcaaggttat 10921 gaagtgattg gtttaaccct ttggctgatg aaagggaagg gtcagtgttg ctctgagggt 10981 atgattgacg cggcttatat ctgtgaacag ctaggtattc cccatcatat tgttgatatt 11041 cgggacgtct ttcaggcaaa tatcattgat tacttggtgg atggttacag tactggggtc 11101 actcctttgc cttgctcaca atgtaacaaa actgtaaaat ttggtcccat gttgcagtac 11161 gcgcgcgaaa atttggaatg cgcgagcatt gccactggtc attatgctcg cattcaatat 11221 gatccagcta ctggacgcta ccaattgctg cgtgcttttg accgcaacaa agaccagtcc 11281 tactttctct atgatttgtc gcaagagtta cttgcaggaa gcgtatttcc actgggagaa 11341 ttacaaaaaa gcgaaactcg tcgtcttgct gatgaatatg ggttgaaaac agcagataag 11401 ccagaaagcc aagacttgtg cttggtggaa agcaacggct caatgcgagc gtttcttgat 11461 aagtatctcg cgccgaaaaa aggagatatt gttgatactt caggtaagat actgggtcaa 11521 cacgatggtg tccatcatta cacgattggg caacgtaagg gtataggtat cgccgctgcg 11581 gaaccactgt atgtgattgc gttggatgca gtgaataaca gggtgattgt gggcgatcgc 11641 accaaagtca ccgaaccaga atgtactgta gaaagggtga attgggtctc cattgctgaa 11701 ccatcaactc ctattcgggc gcaggtgcaa gttcgctatc gttcgactcc tgtaccagtc 11761 acagttattc ccttagaaaa ttctcgtgtt cgcctagtgt ttgatgaacc tcagttcagc 11821 attactcccg gacaagctgc tgtgtggtat gaaggggaga aggtgttagg tggtgggata 11881 attgagcaat ttagttctag tgatccgtca agacaacttt gaggggttaa agaatgaacc 11941 gcagaggcgc agagaagagc cagcgcgttg cgggggttcc ccccgttgta gcgactggcg 12001 tcgcagagga agagaaaaga ggatgagtat aaaatgggtt tacccgtttt gatctcttga 12061 aagtacgaat gtaagtaagt aggcacgatt aaaccgaact atgttaactt atgtaaagcg 12121 ccaaaaacct tttaaaaata agcttttaag cgatttacat ttcttaattt agttgtgttt 12181 tttagcgccc acctacttat caagctgagc ttgatacatc ctcaagagtt ttcaacgggc 12241 attattataa gttttagcta gctacaactt gaatgctgac ttttgctgtc acgtcagtat 12301 acagcttaat ttcagcctca taagttccta acttgctaat atcgggtatg gtgataccac 12361 gccgatccac ttccagtcct gcgatttgtt gaataacttc tgccacttct tgggtggtga 12421 cagtaccgaa aatagcttcg gcttcaccaa cctgcttggc aatttggaag ctttcaactt 12481 tttccaaagc tgctttttgc tcttgagctt gttgcttcaa ttctaattgc cgctggcgtt 12541 ctatttcgcg acgacgttcg acttgcttaa gaatagcagg agtggcacga gttgccaaac 12601 ttttgggaat taagtagtta cgagcatagc caggagcaac ttccactaag tcgccagatt 12661 ttcctagctt gatgatatct tgagttaaaa ctaattgtat gcgtttcgcc atcgttgttc 12721 tttttctttg ctttctctaa aaacttaatt gcttgggctt caacaggaca gttgatatgg 12781 atgcagctat atcccaagca gcttcgcgta tagcccgaaa cctacaaatc atagcgaaat 12841 ggaaggtgcg atcgcaactc tcagagcaag aaaaataaat cttctttgca ccgagtcatt 12901 tattcctagt cgagtactgc ttcctaatga ctaatgacta atgactgcca aaacatcgtt 12961 aactttattt taacgaccta cttaaaattt atctttcatt cctcgcaacc gcgcaaaggt 13021 ttgcactgag tcgcgactgt tagcagctaa ttgtaatgct ggctggtccc aacgtaaaaa 13081 gggatttgtg agcttttcca ctcctaaaag cgagggaaca gttgcttctt tgtgactgcg 13141 ggaggttttc acctgatcat aacgggtttg taagtcggtg ttatctgcat ctacggtgag 13201 ggcgaattgc agatttttca aagtatattc gtgggcacac cagacgcgag tagaatcagg 13261 taacgagcgg aatttgctca gggaatgaac catttgcgta ggtgttcctt caaataagcg 13321 accacaacca ccagcaaaca gggtatcgcc gcagaataac tcacccgttt gacttgggtt 13381 ttctggagga aaataatagg caatatgagc acgggtatgt ccgggaacga agataacctc 13441 agctgttctg tgtgcaaatt caacgcgcga gccttgctgt aaaaacacct gctgtcccag 13501 tattctccct ctatcctcag ccccaccata aactatcaca ttagggaatt gttgcatcag 13561 ttggcggttt cctcctatat ggtcttgatg atggtgcgtg ttaaaaatcg ccaccaactc 13621 agctttgaat tctttgagtt tctgcaatac tggttccgcc tcggctggat cgacgacagc 13681 agcaatattt tgccttggat cgtgcagcag aaaaatgtaa ttatctgaaa gtgcttcaag 13741 acgaatgact tgcattgatt ctatctccca gagaatttgg tgtttagcat ttctcaacaa 13801 tgctatcgaa aagagagcga ctggactacc tacgcaagct atatgttgta atggacaaag 13861 gaaattgtgg gctgagcaaa tgcagatgcg tcaattcttt tggagattaa tcttatgatg 13921 gtgcgagaag ctaggttaga ggatgtgtga gcgattgcct acggcagagc tacgcttaac 13981 gccagggttc atgtaaatac ttggcgaacg agacattcag ggtaactcat gccatattca 14041 ataaatataa tgtacaaggc gtcttttgac tgaaaacatg ttgtcatgca gccaaaaatc 14101 taatgctagg atttattttg aaaagacatt catgaatgaa catcttgatt gtcagtctat 14161 agcaatccta aatcatttgt aaaattctct cttctctc // LOCUS NODE_2401_length_14121_cov_4.65157114121 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14121) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14121) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14121 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1000 /locus_tag="DP116_19935" CDS <1..1000 /locus_tag="DP116_19935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315572.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="carbon dioxide transporter" /protein_id="PRJNA477356:DP116_19935" /translation="IYVAEHQFLVFFQFWKYFNGDFSFKKLLRHWWHDRINFEYAEYC MKSMMWHGGGGLDQYLDSKEFEERAQAVIAAKFKNNPIIKGVNQLFPDFLTEQLRVSS YSTGLGQFWRVMADIFLTLSDRYDQGEIKCIPDVVEHVKAGLVADANKPITYAVKIRE KVYDIIPKSTGLTFLADTAIPYVEAVFFRGTAFHGTVSYNAQAYQIPPDQTRFQYGAL YADPLPIGGAGIPPTLLMQDMRHYLPDYLHEVYKRSRRGEDDLRVQICMSFQKSMFCV TTATILGLMPHPVDTQNPDEQKMNRVFMQKWLDRLKTSRLEEVNDQSNFCLVSYPQ" gene complement(1035..2546) /locus_tag="DP116_19940" CDS complement(1035..2546) /locus_tag="DP116_19940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311090.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="arginine decarboxylase" /protein_id="PRJNA477356:DP116_19940" /translation="MLNQNQIPLLDSLKACAERPHAPFYTPGHKRGQGISKSLTDVFG KAVFRADLPELAELDNLFAPSGVIQQAQQLAAEAFGASQTWFLVNGSTCGIEAAILAT CGTGDKIILPRNVHSSAIASLILSGAIPIFIHPEYDSVLDIAHSITPTAVQAALEQHL DAKAVLMVYPTYYGVCGDVRAIASLAHQHNIPLIVDEAHGAHFAFHPQLPTPALAAGA DITVQSIHKVLGALTQASMLHVQGNRIDIDRVSKALQLVQSTSPSYLLLASLDAARQQ MALYGQELMSRTLELAQEARTRISQIPGLSVLENPPTSLIKGGKGGNGESPGFVALDK TRLTVTVSRLGLTGFEAEEILNDKLGVTAEFSSLLHLTFIISFGNTQKDIEQLVQAFS TLSKEYRKTPSPAYSLLMSEGKDLFSITRNSIHLSPREAFFAPTETSPFKETSKRICA EIICPYPPGIPILMPGEIISPCALEYLQHIQELGGFISGCADTSLRTLKVVKA" gene 2816..3049 /locus_tag="DP116_19945" CDS 2816..3049 /locus_tag="DP116_19945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875776.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19945" /translation="MIYPNPGQLPEPQLPPPLPNPEPSPEPRIPQPIPEPVPEPVPNP APQPVPGPVPEPVPAPIPQPVPAPIPQPVPAPI" gene 3116..4207 /locus_tag="DP116_19950" CDS 3116..4207 /locus_tag="DP116_19950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196550.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="redox-regulated ATPase YchF" /protein_id="PRJNA477356:DP116_19950" /translation="MLRAGIVGLPNVGKSTLFNAVVANAKAEAANFPFCTIEPNVGVV SVPDERLNVLSKISSSKQTVPARVEFVDIAGLVKGASQGEGLGNQFLSHIREVDAIVH VVRCFENDDIIHVAGSVDPARDIEIINLELGLADLAQIERRIERTRKQARTSKEGQIE LALLEKLAAALNEGKSVRQVSLTEEEAEIIKPLELLTNKPIIYGANVSEDELATGNEY VEKVREIASTENAQVVVVSAQVESELVELPEEERSEFLASLGVEEGGLKSLIRATYTL LGLRTYFTTGEKETRAWTITAGMSAPQAAGVIHSDFERGFIRAETVAYKDLATAGSMN AAKEKGQVRSEGKDYVVQEGDVMLFRFNV" gene 4334..5272 /locus_tag="DP116_19955" CDS 4334..5272 /locus_tag="DP116_19955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876153.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="isopenicillin N synthase family oxygenase" /protein_id="PRJNA477356:DP116_19955" /translation="MLEIPVIDLSSFTTGKATARQTVVQQIYQACHEIGFMYLKNTNI SHNLINQVLKQSKDFFDLPLAEKQQLAWTNEFSNQGYVGFERERLNPNNPGDLKEAFN IGKQKAIDIDVTDRLSPVFTASSSPAKNPHILNFYQACTELANKVLQAIALALELPQD FFTTNHNQQNHTLRLLHYPSLSQPPKLRQVRAGEHSDYGSITLLFQDEVGGLEVRTAS GKWIAAAPIPDTIVVNTGDLMERWTNHVFCSTKHRVMIPNDHTLNQSRYSVAFFCHPN DNTEIVCLESCQRDKSPIYPPILAGEYLLSRLQATY" gene complement(5466..6794) /locus_tag="DP116_19960" CDS complement(5466..6794) /locus_tag="DP116_19960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206888.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-grasp enzyme" /protein_id="PRJNA477356:DP116_19960" /translation="MKEQIFVVFQNLGTLVLLAIAFPFNCIVVLTSLLLNFLKQPFGK SIVVNPNSKNILIAGARMTKTLQLARSFHAAGHRVIIIDIEKFWPSGNKYSNSVAGFY TVPDPSKDLEGYVETLHAIAKKEKIDFFIPVAIFSVIHYDHGKPPLPDDVEFFHFDAD LTKILDDKFAFAETARSFGLSVPKSFKITDPEQVINFDFSQEKRKYILKSIPYDQIRR LNLTKLPCDTQAETAAFVKSLPISEKNPWIMQEFIPGKEYCTHTTARDGESRMYCCCE SSAFQVNYENVDQPEIMQWANHFTKELGKTGQLSFDFIQVEDGTVYAIECNPRTHSAI TMFYNHPGVADAYLGKEPLAESLQPLADSKPTYWLYHEVWRLNEIRSFKQLQTWVRNI LRGKEAIFEVSDPLPFLLVHHWQIPLLILDNLRRLKGWIRIDFNMGELIE" gene complement(6890..8269) /locus_tag="DP116_19965" CDS complement(6890..8269) /locus_tag="DP116_19965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206887.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-grasp enzyme" /protein_id="PRJNA477356:DP116_19965" /translation="MRKQIFAVFQNLGTLALLAIAFPFNCTVVLASLLWNFFERRYAK QVVLNENPKNILIGGGRMTKTLQLARSFHAAGHRVILFDLDKYWFSGYRFSNSVAGFY TVPDSDEDKEGYTQAVRAIAKKENIDFFVPVGIFAASYFDSECKPVLSGYCENFHFDA DTMKMLDNKFTFAQKARSLSLSVPKTFLITDPEQVLKFDFSNEKRKYILKSIVYDSVL RLDLTKLPMESHEKMALYVKSKPISKENPWILQEFIPGTEYCTHDTVRNGELTVHCCC ESSAFQVNYEKVDHPEIKKWVSHFVKELQLTGQLCFDFIQAEDGTIYAIECNARAHSA ITMYYNHPGLADAYLSKEPPAEPLLPLSDSKPTYWLYHELWRLNEIRSLKQLQKWFKN IWRGKDAIFEVNDPLPFLMVHHWHIPLLLLDSLRSLRTWVRIDFNIGKLIQFGEDVRY KTSTSATRK" gene complement(8362..8757) /locus_tag="DP116_19970" /pseudo CDS complement(8362..8757) /locus_tag="DP116_19970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743131.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 8889..9137 /locus_tag="DP116_19975" CDS 8889..9137 /locus_tag="DP116_19975" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19975" /translation="MNDAVLWSVGASEHKAVLELISCKSLISENQTTSTSRVRPSADL NSFIIIFAKNLIDEVQPKAYRTHNPEEILNTEWRSEFL" gene complement(9134..11050) /locus_tag="DP116_19980" CDS complement(9134..11050) /locus_tag="DP116_19980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859195.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-layer protein" /protein_id="PRJNA477356:DP116_19980" /translation="MRRFLLASASGASFLCLIAKPLSAQVIPNLDIANNTSKINYAAE QLPQNSSSLTESIGSINDIQKGSQTQGQLIVSTDSKQQDLPTLNQKQKSNNLGLTAKF PNQVPASNVIAQVTSVSQLSDVQPTDWAFQALQSLVERYGCIAGYPNGTYRGNRAMTR YEFAAGLNACLNRVNELVATATADLVKREDLATLQRLQQEFGAELATLRGRVDVVEAR TAELQANQFSTTTKLSGEAIFAAIGATGGAPGRNDPNIILTNRVRLNLNTSFTGKDLL ITGLQAYNFLGGSDGRGSLQESLGLAPSGFSASNARVSFEPQFPGLDVKTLSSTGSND IELYKLVYIFPIAKKLTLFAGTAVESSDAFPAITPFAGEGQEAVSRFAGLNPVVRLSG GTSGTGLASAAGFIFDISKQVDLRAFYASANANIPTSAADIQPGVSRTPLGAGVFGGS SIVATQLTFKPSSSLDIAFNYAHSYHEINILGTGLTSSDIGALAGVSLGTPVELNSFG GTVTWRLSPKVALSGYGAAMFVDDSSGRVDASTTFTSWMAGLHFNDLFKPGNNAGIIF GQPLYRTSADGDARLTPDGARRAVPYHLEAYYRFKVSDNISITPGAFVLFNPEGNSRN DTTTVGVLRTTFTF" gene 11710..12624 /locus_tag="DP116_19985" CDS 11710..12624 /locus_tag="DP116_19985" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19985" /translation="MKQYLRFAKRFGLLFTPLVATSVLFTSPSQAATFAFSDGELSLT DFNGILSTEFNSDNNGETKGITLGKNDFVNLQNIPTVETSNSPPKAFTDVVSSVSGEG RNYTGFVKSDSEIVGNFDIGAGKTFSFNFSSFLNLGTQIDASPVENAQAKGDIAFYLY DTSNIPEQTLPDLITGLLDNPNSIKKTPLSFFSLAGNINTLGNDFLINKNSSDITLSE SYKEVDLEGNEEKTLAEFRGYFQRYFEKQANITLIATRRSQARVTAPEPSTSLALVLF FGLLAIANKGRFRTKILNNSSGIKMVKL" gene 12678..13559 /locus_tag="DP116_19990" CDS 12678..13559 /locus_tag="DP116_19990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19990" /translation="MATPNPSTSAPSASPSQTATSALSYGDFFLTNLSQSFATINSDN QADTSATADGGTATVYNNAVVETDDTKVLTFATSSASGENRDFFALAETHAIIVGNLF IDAGKTLSFNFTSTLDLETLKNASQQIDNASAIREVSFFLYDTSDIPKEKLPDYLANL LSDPDSIEKKPLVFFSLSGNLNTLSNDNYLTSKKSENVTISSELKDSNFAGTQKFASI FVRGSVKRSFNNQANITLVALRRGQAKVTAPEPSPTLTLRSTKPKLLGVATQGRHQGA TSNRFSDRKTMKVTVGK" gene 13662..>14121 /locus_tag="DP116_19995" CDS 13662..>14121 /locus_tag="DP116_19995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878481.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_19995" /translation="MKTYDWIVVGAGITGAALAYELVKKDCKVLLLEQYATPNNATRY SYGGLSFWCGVTPLTRQLCDEGIARHRILSEELDADTEFRELDLLLTISADTDPASVA ASYTRFAIPPKLLSLKEACELEPLLNQEAIAAALTHILHLEIWMSKPKLRA" BASE COUNT 4221 a 2867 c 2924 g 4109 t ORIGIN 1 catttatgtt gctgaacatc aatttttagt atttttccaa ttttggaaat actttaatgg 61 agacttttct tttaaaaaac tactacggca ttggtggcat gacagaatta attttgaata 121 tgccgaatat tgcatgaaga gtatgatgtg gcatggcggc ggtggattgg atcaatactt 181 agattcaaaa gaatttgaag aaagagcgca agctgttatc gcagccaaat ttaaaaacaa 241 tcccatcatc aagggagtta atcaactctt cccagatttt ttaacagaac aattgcgtgt 301 cagttcttac tctactggtt taggtcaatt ttggcgtgtg atggctgata ttttcctcac 361 tttatcagac cgttatgatc aaggtgaaat caaatgtatt cctgatgttg tagaacacgt 421 caaagcaggt ttggtagcag atgccaataa gccgattacc tacgccgtta aaattcggga 481 aaaagtttac gacattattc ctaaatctac tggtttaaca tttcttgcag atacagcaat 541 accttatgta gaagcagttt tcttccgagg aacagctttt catggcacag tttcatataa 601 tgcccaagct tatcaaattc ctcctgatca aacccgattt caatatggtg ctttgtatgc 661 ggatccttta cccattggcg gtgcaggtat tcctcctacc ttgttgatgc aagatatgcg 721 tcattatctt ccagattatt tgcatgaagt ttataagcgt agtcgtcggg gagaagatga 781 tttgcgagtt caaatttgta tgagttttca aaaatcgatg ttttgtgtga ctacagcgac 841 gattttagga ctcatgcctc atccagttga tactcaaaat ccagatgagc agaaaatgaa 901 tcgagtcttt atgcaaaagt ggcttgatag gttaaaaact tcgcgtttgg aagaggtgaa 961 cgaccagtct aacttttgtt tggtatcata tccacaataa ttaccacaag aaatgtcatc 1021 tgatacgaca tgattcatgc tttcacaacc tttagggttc ttaagcttgt atcagcacaa 1081 ccgctgataa atccacccaa ctcctgaatg tgttgtaaat actctaaagc acatggggaa 1141 ataatttctc caggcatcaa gataggaatt cctggtggat atggacaaat gatttccgca 1201 caaatgcgtt ttgaggtttc tttgaatggc gaggtttctg taggagcaaa aaaagcttca 1261 cgaggcgaaa gatgtattga atttcttgtt atgctaaaca aatcctttcc ttcgctcatt 1321 aatagggagt aggcgggaga tggagttttt cgatactctt tactcagggt actaaaagct 1381 tgcacaagtt gttcaatatc tttttgggtg ttaccgaaac tgatgatgaa agtcagatgt 1441 aggagtgatg aaaattcagc cgtaacaccc agcttgtcat tcagaatttc ctcagcttca 1501 aagccagtca aacctaagcg agaaacagtg acggttaatc gggttttgtc taaagcaaca 1561 aaaccaggtg attccccgtt acccccctta ccccccttaa taagggaggt tggggggttt 1621 tccaaaaccg ataaaccagg aatttgacta atacgagttc tcgcttcttg ggctagttcc 1681 aatgtgcgag acattaactc ttgaccataa agcgccatct gctgacgcgc tgcatccaga 1741 gaagctaaga gtaaataact gggactagtc gattgtacga gttgcaaagc tttactaaca 1801 cggtcaatat ctatcctgtt gccttgaaca tgtagcattg atgcttgagt caatgcacct 1861 agtactttgt gaatagattg tacagtgatg tcagcacctg cggctaaagc tggagtgggt 1921 agttggggat gaaaggcgaa gtgtgcacca tgtgcttcat ccacaattaa ggggatgtta 1981 tgttggtggg cgagggaagc gatcgccctc acatctccac acacaccgta gtatgtggga 2041 taaaccatca acaccgcttt cgcatcaaga tgctgttcca acgcagcttg tacagccgtg 2101 ggagtgatac tgtgagcaat atctaaaact gagtcatatt ctggatgaat aaaaattggg 2161 atagcgccag agagaattaa acttgcgatc gcagaagaat gaacattccg aggcaaaatg 2221 attttatcac ctgtaccgca agtcgcaaga atcgccgctt caattccgca agtagaacca 2281 ttgacaagaa accaagtttg tgatgcacca aaggcttcag ccgctagctg ctgcgcttgc 2341 tgaatcaccc cacttggtgc aaaaaggtta tctaattctg ctaattctgg caagtcagca 2401 cgaaacactg ctttgccaaa aacatcagtt aaggatttgg aaatcccttg tcctcgtttg 2461 tgtcctgggg tgtaaaaagg agcgtgagga cgttctgcac aggcttttaa ggagtctagt 2521 aaaggtattt ggttttgatt gagcattttg gcaacagatt ttgtggcgga tgtcgcggat 2581 aagttacttt gtcagtcttc gcgcataaga ctaaatactt actcagcaaa gcttttagtc 2641 catttgaatg gacttaggct attagtctgg aacttcagtt ctaggcgggc gaggtttcat 2701 tagagaatcg tgcaagatat ccgttcatcc gttgcaaata gtcatattgg aagatgtccc 2761 cagcatctgt tgtcatctag gttggtaaaa gtcgaatgat ttcagtagag ataccatgat 2821 ttatcccaat cctggacaat tacctgaacc acagcttcct cctccattac caaatcccga 2881 accatcaccg gaaccaagaa ttccccaacc tatacccgaa cctgtaccgg aacctgtacc 2941 caatcctgcg cctcaaccag ttccaggacc agtaccagaa cctgtacctg ctccgattcc 3001 gcaaccagtt ccagcaccta ttccccaacc agttccagca cctatttgaa attccttgct 3061 agtagttttg ttttaagata gcgtttggta ccaaatccaa aatttcaaat ctaaaatgct 3121 acgagccgga attgtcggac tccccaacgt tggaaaatcg actttgttta atgccgtggt 3181 tgccaatgcc aaggcagaag cagccaactt ccctttttgt actatagaac cgaatgtcgg 3241 cgttgtctct gtgccggatg agcggttaaa tgttttatcg aagatttcct cctcaaaaca 3301 aactgttccg gcgcgtgttg agtttgtaga tattgccggt ttggtcaaag gtgcaagtca 3361 aggtgaggga ctgggtaacc aatttctttc ccacatccga gaagttgatg caattgttca 3421 tgtcgtgcga tgttttgaga atgatgatat cattcacgtt gctggttcag ttgatccagc 3481 acgagatatt gaaatcatca atttagagct tggtttagcc gatttagcac aaattgaacg 3541 gcggatagaa cgcacccgta aacaagctcg tactagcaaa gaaggacaga ttgaattagc 3601 tttgctggaa aaattagctg ctgcattaaa tgaaggaaaa tcagtacggc aagttagttt 3661 gaccgaagaa gaagccgaga ttattaaacc acttgaactg ctcaccaata aaccaatcat 3721 ttatggtgcc aatgtgtctg aagatgaatt ggcaacgggt aatgaatacg tagaaaaagt 3781 gcgagaaatt gcatctactg aaaatgcaca agttgtcgtc gtttctgccc aagttgaatc 3841 ggaattagtt gaattgccag aagaagaacg gtctgaattc ttggcatctt tgggtgtaga 3901 agaaggcggt ttaaaatctt taattcgcgc cacttatact ttgttaggtt tgcgtactta 3961 cttcaccaca ggagaaaagg aaacccgtgc ttggacaatc acagcaggaa tgtcagcacc 4021 acaagcagcc ggtgtgattc acagcgattt tgaacgagga tttattcggg cagagactgt 4081 tgcttataag gatttagcca cagcaggttc aatgaatgct gccaaggaaa aagggcaagt 4141 tcgcagtgaa ggaaaagatt acgtcgtgca agaaggcgat gtgatgttgt tccgatttaa 4201 tgtgtaattt tgttgaaatt aaagtaaata gaaacccggt ttattcaaga aactgggttt 4261 cttaattagg cttgagaact aactccatat atatcctaga cgtcagtcat tatcaaataa 4321 ataacaaaat aacatgcttg aaattccagt tattgattta tcttccttca ccactggtaa 4381 agcaacagct aggcaaactg tcgttcaaca aatctatcaa gcttgccatg aaataggatt 4441 catgtactta aaaaatacaa acatatcaca taacttaatt aatcaagtac ttaaacaaag 4501 caaagacttc ttcgatttac ctttggcaga aaagcaacag ttagcttgga ctaatgaatt 4561 tagcaatcaa ggttacgttg gttttgaaag agaacggctt aaccccaata acccaggaga 4621 cttgaaagaa gcgtttaata ttggtaaaca aaaggcaata gatatagatg taacagatag 4681 attatctcct gtattcactg cctcatcttc tcctgcaaaa aaccctcata ttctcaactt 4741 ttaccaagct tgtacagaac ttgctaacaa ggtgttgcaa gcgatcgctt tagctttaga 4801 attgccacaa gattttttta cgacaaacca taaccagcaa aatcatactt tgcgactgct 4861 gcactatcca tctttatccc agccacccaa actacgacaa gttcgcgctg gtgaacactc 4921 cgactatggc agtattacct tattgtttca agatgaagtt gggggattag aagtacgaac 4981 agcatcagga aagtggattg cagctgcacc aattcctgat actatcgtcg tcaacactgg 5041 tgatttaatg gaacgctgga caaatcacgt gttttgctca accaagcatc gagtgatgat 5101 tccaaatgat catacactca accagtcaag atattctgtg gcatttttct gtcatcccaa 5161 tgacaataca gaaattgttt gtctagaaag ttgtcagagg gacaaatcac ctatttatcc 5221 tcctattctt gcaggagaat atcttttaag ccgtttacag gcaacctatt aagcggttct 5281 tgaataaacc tactacagag cggcgtaaat agggcaacca tttaaaattc ctaaaaagcc 5341 cgttccataa tacttttgac ttttgacttt tgacttttga cttccgcgca gcggtactac 5401 cacctacaaa atgtctgcct tgatcttggg tgagcaaaac caatgaatac cggatttctt 5461 cgttctcact ctatcagctc acccatatta aaatctattc ttatccaacc tttgagtctt 5521 cgtaaattgt cgagaatcag taagggaatc tgccagtgat gtacgagcag aaagggtaag 5581 ggatcactca cctcaaaaat tgcttctttc ccccgcaaaa tgtttcttac ccaagtttgc 5641 agttgcttaa acgatctaat ctcattcagc cgccaaactt cgtggtacag ccaataggtt 5701 ggcttactat cagcaagagg ctgcaaagat tcagccagag gttctttacc aagataggca 5761 tccgctactc ctggatgatt gtaaaacata gtaatggcgg agtgagtacg ggggttacac 5821 tcaatcgcat aaacagttcc gtcttctacc tggatgaagt caaaggaaag ttgcccggtt 5881 tttcctagtt cttttgtaaa atgattcgcc cattgcataa tttctggctg atcaacgttt 5941 tcgtagttga cttgaaaggc ggatgactca cagcagcagt acattcttga ctctccatct 6001 cgcgcggtag tgtgagtgca gtattccttt ccagggatga attcctgcat aatccagggg 6061 tttttctcac tgatgggcaa actcttgaca aatgctgctg tttctgcttg tgtatcgcaa 6121 ggtagcttgg tgagattcaa gcgacgtatt tgatcgtagg gaatgctttt aagaatgtat 6181 ttgcgcttct cctgagaaaa gtcaaagttg atgacttgtt cgggatcggt aattttgaag 6241 gatttgggga ctgataaacc gaacgatcgc gctgtttcgg caaaggcaaa tttgtcatcc 6301 agaatcttcg tgagatcagc atcgaagtgg aaaaactcta catcgtctgg taatggtggc 6361 ttgccgtgat catagtggat gacagaaaaa atggctaccg ggataaaaaa gtcgatcttt 6421 tcttttttgg cgatcgcgtg tagggtttca acgtagcctt ctaagtcttt gcttggatcg 6481 ggaacggtat aaaagcctgc aacagaattg gaatatttat taccacttgg ccagaatttc 6541 tcgatgtcaa ttataatcac ccgatgccct gctgcatgaa atgaacgcgc tagctgaaga 6601 gttttcgtca ttctcgcgcc agcgatcaag atatttttgg aattggggtt gacaacaatt 6661 gacttgccaa atggttgctt gagaaagttc aacagtagcg aagtcaagac gacgatgcag 6721 ttaaagggaa atgcgatcgc cagtaataca agagtgccta agttttggaa aactacaaaa 6781 atctgctcct tcatacttat aattgtgaca tgatagtgga attagtggct ggtggcttgt 6841 aaaaatgaag cggaacaaac agatgcaatc ggagccagta aacttactat cacttacggg 6901 ttgcagaggt agaggtttta tatcttacgt cttctccaaa ctgtatgagt ttcccgatat 6961 taaaatcaat cctcacccaa gttcttaagg agcgcaaact atcgagtaat agtaaaggaa 7021 tgtgccagtg atgtaccatt aagaacggta gtgggtcgtt tacttcaaag attgcatcct 7081 ttcctcgcca aatattttta aaccactttt gcaactgctt taacgatcta atttcattaa 7141 gtctccaaag ttcgtgataa agccaataag taggcttact atcactcaga ggtaagagag 7201 gttcagccgg aggctcttta ctaagatagg catccgctaa acctggatgg ttgtaataca 7261 ttgtaattgc agagtgagcg cgagcattgc actcgatcgc ataaatcgtc ccatcttccg 7321 cctgaatgaa gtcaaaacaa agttgtccag tcagttgtag ttctttgaca aaatgactca 7381 cccacttctt aatttctggg tgatcaacct tttcgtaatt gacttgaaaa gcagatgatt 7441 cgcagcagca gtgtaccgtt aattcaccat tcctcactgt atcatgagta cagtattctg 7501 ttccgggaat aaactcttgc aatatccaag ggttctcttt actaattggc ttacttttga 7561 cataaagcgc cattttttcg tgagactcca ttggtagctt ggtcagatct aagcgtaaaa 7621 cagagtcata gacaatactt ttgagaatgt acttgcgctt ttcgttggaa aagtcgaatt 7681 taagaacttg ttcgggatct gtaattagga aggttttcgg gacagatagc gagagtgagc 7741 gtgctttttg ggcaaaggta aacttgttat ccagcatctt cattgtatca gcatcgaagt 7801 ggaaattctc acaatagcct gataacactg gcttgcattc cgagtcgaag tagctggcag 7861 caaaaatgcc taccgggaca aaaaagtcta tgttttcttt tttggcgatc gcacgcacgg 7921 cttgagtata gccctctttg tcttcgtcgg agtcaggaac tgtgtaaaag cctgctacag 7981 agttagaaaa tcgataacca ctgaaccagt atttgtcgag atcaaataga atgactcgat 8041 gccctgctgc gtgaaatgat cgcgccagtt gaagggtttt ggtcattcta ccaccaccga 8101 tcaagatatt tttcggattc tcgttcaaaa ccacttgctt ggcatacctt cgctcgaaaa 8161 aattccacag cagggatgcg agtacgacgg tgcagttaaa gggaaatgcg atcgccagta 8221 atgcaagagt gcccaagttt tggaacactg caaaaatctg cttcctcata gttgtaattg 8281 tgacatgata gtggaaccag tggctggtgg cgagtcagat attaaaggga taaggagatg 8341 cgctccttgt cccctgcttt cctacactcg ccaaatgatc gtcaagccgt ctcgaattgg 8401 cagtagcact tgctcaatcc gggggttatc tttgacgatg cgattgaaac aagcgatcgc 8461 ctccccattt ttggctcgct ccaaagaagg caagtttact tgcccctgat gcaacgtgtt 8521 gtccacacag atgaatccgt gaggagacaa taaattgtcg tctagcagta tttgaaagta 8581 ctacacatac tcagttttgt tagaatcaat gaacaccaag tcaaatgact cctgtgctgc 8641 tgctagcttc tggagtgttt ctaatgctgg tctcaattct acatgaattt tgccaccgtg 8701 aggggcgatt gcccaaaggg cagacgcgac gagcgcgtat cgcaaaacgt cgaacactca 8761 actagctccc aaggcgatta ttgaaacggg gagcatcttt atcaatcaaa tactcaatga 8821 ccagtgccca aattgcgaga ttgtaggcac ggttttgcag cgtccggcta ttttctccaa 8881 aagccgccat gaatgacgct gttttgtggt cggtgggtgc gtcggaacac aaggcggttc 8941 ttgaattgat ttcttgcaaa agcctaattt ctgaaaatca aaccacatct acgtctaggg 9001 tgcgtccatc tgcggatcta aattcattca tcataatttt tgcaaaaaat ctcatagatg 9061 aagtacagcc aaaagcatac agaacgcaca acccagaaga aattcttaat accgagtggc 9121 gttctgaatt cctttagaaa gtgaaagtgg tgcggagtac gcctacagtt gtcgtatcat 9181 ttctactgtt gccttcagga ttgaacaaaa caaatgctcc aggagtaatg ctgatattat 9241 cactcacttt aaaacgatag taagcttcta aatgataagg aacagcccgc cgcgcaccat 9301 caggagtcaa tctagcatca ccatctgcag aggtgcgata aagtggctgt ccaaaaataa 9361 tccccgcatt gtttcctggt ttgaataaat cattgaagtg caatcccgcc atccaacttg 9421 tgaaggtggt agaagcatca acacgaccag aagaatcatc aacaaacatt gctgcaccgt 9481 agccagataa agcaactttt ggagataaac gccaagttac tgtaccgcca aaagagttta 9541 gttcaactgg tgttcccaat gaaacaccag ccaaagcacc aatatcacta ctggttaatc 9601 ctgtacccag gatgttgatt tcgtgataac tgtgggcata gttgaaagca atatccaggg 9661 aactgcttgg tttaaaggtt aattgtgttg ctacaatact actacctcca aacacgcctg 9721 ctcccaaagg tgtgcgagaa acacctggct ggatatccgc agccgaagta ggaatattgg 9781 catttgcact ggcataaaaa gctctcagat ccacctgctt tgatatgtca aaaataaatc 9841 ctgcagccga tgccaaacca gtaccagaag taccaccaga gagacgtacc acggggttca 9901 aacctgcaaa gcgagaaact gcttcttgtc cttcaccagc aaaaggtgtg atggcaggaa 9961 aggcatccga tgattccacc gcagttccag caaacaaagt taatttttta gcaatgggaa 10021 agatataaac tagtttgtaa agctcaatgt cattgctacc cgtgctcgac aaagtcttaa 10081 catcaaggcc aggaaattgc ggttcaaaac tgacacgagc gttactggca ctaaatccag 10141 acggagctaa tcccaaagat tcttgaaggc taccgcgacc gtccgaacca cctaaaaagt 10201 tataagcttg caaaccagta atcagtaagt ctttgcctgt aaaactggtg ttgagattca 10261 accgcactct atttgtcaag atgatatttg gatcattcct accaggagcg cctcctgtag 10321 caccaatagc tgcaaaaatt gcctcaccac tcagtttagt cgttgtggaa aactgatttg 10381 cttgcaactc tgccgtgcga gcttctacaa catctacccg acctcgcagt gtcgccagtt 10441 ctgcgccgaa ttcttgttgc agtctctgca aggtggctaa atcctctctt tttaccaaat 10501 cagcagtcgc tgtggcaaca agttcgttga ctcgattgag acaggcgttt aagccagcag 10561 caaattcata gcgagtcatc gcgcgattgc ctcgataagt accatttgga tacccagcaa 10621 tacaaccgta acgctcaact agagattgca gtgcttgaaa agcccagtcc gtaggctgta 10681 catcagataa ttgcgataca gatgtgactt gtgctatcac atttgatgcg gggacttgat 10741 ttggaaattt tgcagttagc cccaaattgt tgcttttctg cttctgattc aaagtaggta 10801 aatcttgttg tttgctatcc gtagatacaa tgagttgtcc ctgagtttgg ctgccttttt 10861 gtatgtcatt aatactgcct atactttcag ttaaagaact gctattttgt ggtaattgtt 10921 ctgccgcgta attaatttta gatgtgttat tagcaatatc tagattcggt attacttgtg 10981 cagataatgg tttggcaatt aaacatagaa aacttgctcc gctggcagag gctagtaaaa 11041 atcttctcat ggcaactcct acacactgaa ataaagaaat ttagactgcg atctcaaaaa 11101 aacaatggtt tagtaaaatc aaggcaaact atgcaacaca atattgcttg cttcgccttc 11161 cttgtccctc agaaaaaagt tttcttttag tgaattgact aaccacttta tgagtaatta 11221 ccgcaagtgt ttaattatca taaatctatt tacgtcaata attaacacaa gctgatattt 11281 gtaattttta taattatcat ctaaccaagt attgctctat ttatcatagc ttagagtttg 11341 tattataaag tacaaaatat gactacacct acttaacaca aaattttact gtatatttgc 11401 ggtatttttt ggaactatat tgtatatttt tagtaaaatt ttaacagtat ttgataaatt 11461 ttagaagaat ttttgtcccc aaaccaagta attttactct agctagagta taaaatataa 11521 tttatgcttc atacacaggg atagggttaa tatcttttta tcattttttg atacataggt 11581 agattttatg tagctttcgc aaaagtagcc tgagaattac ggaatcgcga tagtttcttt 11641 tgtatttttt attaagagtc tcacaacaga gagaagcaaa ctaaatatta agaggatttt 11701 aaggatttta tgaagcaata tttgagattt gccaagcggt ttgggctact gtttactcca 11761 ttggtagcga cttctgtgct tttcacctca cccagtcaag cagctacttt tgctttttct 11821 gatggagaat taagtttgac agatttcaac ggaattcttt ccacagaatt taacagcgac 11881 aataatggtg agactaaagg tattactctg ggtaagaatg attttgtaaa cttgcaaaat 11941 attccaactg tagagacgtc aaattctcca ccaaaagcat ttactgatgt tgtcagttca 12001 gtgagtggtg aaggtagaaa ctatacagga ttcgtaaaaa gcgattcgga aattgttggt 12061 aattttgata taggtgctgg taaaaccttt tctttcaatt tctcctcttt tctaaaccta 12121 ggaacacaaa tagatgcatc tccagtagag aatgctcaag caaaaggaga cattgccttc 12181 tatttatacg atacttcaaa tataccagaa caaactcttc ctgacttgat tactggttta 12241 ttagacaatc caaacagcat caagaaaact ccgctttcct tcttttctct ggctgggaat 12301 atcaatactc ttggtaatga ttttctgatc aataaaaata gttcagatat taccttaagc 12361 gagagttata aagaagttga tttggaagga aatgaagaaa agactctggc tgagtttcgg 12421 ggatatttcc aacgttattt tgaaaaacaa gcaaatatca ctttgattgc gactaggaga 12481 agtcaggcga gagtgacagc acctgaacct tcaacaagtt tagctttagt actatttttt 12541 gggttactag ctatagctaa caaaggtaga tttaggacaa aaattttaaa taactcttca 12601 ggaattaaga tggttaaatt gtagatagta atgacgtcaa gctctatact aaaaatttgc 12661 aaagaggttt ttttcttatg gcgactccaa acccatctac ttctgcaccc tccgcgtcac 12721 ctagtcaaac ggcgacttct gctctttctt atggagattt cttcttaaca aacttgagtc 12781 aaagctttgc cactattaac agtgataatc aggctgatac ctcagctact gcagatggtg 12841 gtacagcgac tgtttacaat aacgcagtcg tagaaactga tgatacaaaa gtattgactt 12901 ttgcaactag ttcagcttct ggtgaaaata gagatttttt tgcattagct gaaactcatg 12961 caattattgt tggtaacttg tttatagatg caggtaaaac tttatctttc aattttacat 13021 cgacattaga cttggaaacc ttaaaaaatg catcacagca gatagataat gccagcgcta 13081 ttagagaggt ttccttcttt ttatacgata cttctgatat acctaaagaa aagcttccag 13141 actatctggc caatttacta tccgacccag atagtattga aaaaaaacct ttagtgttct 13201 tttctctatc tggaaattta aacactttga gtaatgataa ctatttgacc agcaagaaaa 13261 gcgaaaatgt gacaattagt tctgaattga aagattctaa ttttgctgga actcaaaaat 13321 ttgctagtat ttttgtccgt ggttctgtaa aacgctcttt taataatcaa gcaaatataa 13381 ctttagttgc tttgagaaga ggtcaagcga aagtgacagc acctgaacct tcacctactt 13441 taactttacg ctcaacgaag cctaaattac tgggcgttgc tacccaaggt agacatcaag 13501 gtgcgacttc aaatcgtttt tccgatagaa agaccatgaa agttactgtt ggaaaataat 13561 tgcggtagca tgacaatctc atgatgaaat tggttggagt tggagttgaa gtaacaccaa 13621 agagagagaa tatcttttga gcctgttcat aaagatattc aatgaaaacc tacgactgga 13681 ttgttgttgg tgctggtatt acgggtgctg cactcgccta cgaactggta aaaaaagact 13741 gcaaagtgct tttactagaa caatatgcaa caccaaacaa tgcaactcgt tatagttatg 13801 gtggtttaag tttttggtgt ggtgtcactc cactcactcg ccaattgtgc gacgagggta 13861 tcgcacgtca ccgcatcctg tcggaagagt tggacgctga tactgagttt agggagctag 13921 atttattatt gaccatttcc gccgatacag atccagcaag tgtagcagcg tcttacactc 13981 gttttgctat ccctcccaaa ttactcagtt taaaggaagc ctgtgaatta gaacctctgt 14041 tgaatcagga ggctatagcc gctgctttaa ctcacatctt gcacctggaa atatggatgt 14101 caaaaccaaa actacgtgca a // LOCUS NODE_2411_length_14056_cov_5.32162014056 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 14056) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 14056) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..14056 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(127..218) /locus_tag="DP116_20000" tRNA complement(127..218) /locus_tag="DP116_20000" /product="tRNA-Ser" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(182..184),aa:Ser,seq:gct) gene complement(332..1264) /locus_tag="DP116_20005" CDS complement(332..1264) /locus_tag="DP116_20005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140868.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phytoene synthase" /protein_id="PRJNA477356:DP116_20005" /translation="MLQLPDSIPRMKTPVSVDESYKLCRQLIVKYSTTFYIGTLLVEK PKRKHIWAIYAWCRRTDELVDGPASAITTPETLDLWEQQLESIFAGQPLDSIDVALVD TVQRFPIDIQPFRDMIAGQRMDLYRSRYETFEELHLYCYRVAGTVGLMSTAVMGVDTS TNTAPWNCHQQPYIPTEEAITLGIAHQLANILRDVGEDARRGRIYIPQEDLARFNYTE QDLFKGVVDERWRSLMRFQIARARQFYAKAEKGISYLSADARLPVWASLMHYSRILNI IERNDYNVFTRRAYVPQWQKLRALPAAWLRAQVL" gene complement(1345..2796) /gene="pds" /locus_tag="DP116_20010" CDS complement(1345..2796) /gene="pds" /locus_tag="DP116_20010" /EC_number="1.3.5.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314659.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="15-cis-phytoene desaturase" /protein_id="PRJNA477356:DP116_20010" /translation="MRVAIAGAGLAGLSCAKYLTDAGHTPIVLESRDVLGGLVAAWKD KDGDWYETGLHIFFGAYPNMLQLIKELGIEDRLQWKEHTMIFNQPNNPGTYSRFDFPD VPAPLNGIVAILRNNDMLTWPEKIRFGIGLLPAMILGQRYVEEMDKYSWSEWMKKQNI PPRVEKEVFIAMSKALNFINPDEISATILLTALNRFLQEKNGSKMAFLDGSPTERLCQ PIVDHITARGGEVRLNAPLKEIVLNDDGTVKHFVIRGLNGAEDEVFTADAYVSAMSVD VMKILVPTPWKEIEFFQKLEGLEGVPVINLQLWFDRKLTKIDHLLFSRSPLLSVYADM SNTCREYANPDRSMLELVLAPAKDWIAKSDEEILQATVAELEKLFPDDFTGDNPAKLL KYHVVKTPRSVYKATPGRQEYRPSQVTPIANFYLAGSYTMQRYLGSMEGAVLSGKLTA QAIVRNTAEGFSEGKPLPRFEETSQTPEKALGR" gene complement(2892..3248) /locus_tag="DP116_20015" CDS complement(2892..3248) /locus_tag="DP116_20015" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20015" /translation="MFLCQSFSLKFVHFSHKNARNIRKKQFLSQESEMYQDFYIIVSY GKRKYMKCRSKGHFCGKGCLLRLGCNCLRQGKNLLNGSVLLTQVCWGIKESKTEIVLE QAALVMDDAPRRRQRI" gene 3514..4236 /locus_tag="DP116_20020" CDS 3514..4236 /locus_tag="DP116_20020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747952.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20020" /translation="MRNDLQGLEIGQKELQKLTNLPVNDELIILIHPMKRLLSQIIER AKGSEAATVVFLGMTTLVFSYVAFDVIIRVFAHWVTIPSWLLLVISSCSVGILTQIFF YFLWKQRSRTVTQNMTHSLKILLNDVERYNAVIKAIDINDQIEAAGNSGVNLKERNKV IEALKLTRNDLVRALKTEKILRENKKFIVSKLELFADNLATLTAMQVSEQASEHGRLL NEALQIALDVQQEMKSLQSQGS" gene complement(4370..5209) /locus_tag="DP116_20025" CDS complement(4370..5209) /locus_tag="DP116_20025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874632.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_20025" /translation="MPIRQTFSKPDIQLSYLEWNQGQEPLLLLHGLADHALVWSSLGD ELSSDYHIVAPDMRGHGESGKPERDYSFESAIADLEALMDSLGWSATHVVAHSWTGKL AVIWARINPQRLRSIVLVDPIFIWKIPSFFKLTFPLLYQFLSFLKGMGPFASYESAQQ QARLLNQFQGWSPLQQQVFQASIEQKSDGTWGSKFTITARDRIFEEVMLSPGFTTPIH IPTLFVQPEKGLNRKDWQLQPYKTNLKNLRVCQVPGNHWPFLTQPEAFNQTVKAFLQE HRY" gene 5644..6075 /locus_tag="DP116_20030" CDS 5644..6075 /locus_tag="DP116_20030" /inference="COORDINATES: protein motif:HMM:PF13551.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20030" /translation="MPKRIAISRRLDAPRIANHLSAEELLIRYRQATQPIEHSHYQIM WLLATGKTPQEVAQVTGYTRIWIYQLIKRYNSDGQKALGDKRHHNPGREANLTDVEQA RLWQVLSEKAPDGGLWNGRKVADWLSEITGRHIISSSDDYL" gene 6145..6843 /locus_tag="DP116_20035" /pseudo CDS 6145..6843 /locus_tag="DP116_20035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747986.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 6872..7009 /locus_tag="DP116_20040" /pseudo CDS 6872..7009 /locus_tag="DP116_20040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312184.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="D-glycerate dehydrogenase" gene complement(7203..7766) /locus_tag="DP116_20045" CDS complement(7203..7766) /locus_tag="DP116_20045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_20045" /translation="MVASPEIYLTPEEYLQMEEQSDIKHEYIDGYIYAMAGALDSHVT IALNLATLLRNHVRGSGCRVYIADMKARIESLNRYYYPDVMVTCDQRDQETPAYKKFP YLIVEVLSDSTEAFDRGDKFADYQTLESLQEYVLINTKRQRVECFRRNDEGLWVLQSY TAENKSFRLHSIKFEGTIAELYEDVVF" gene complement(8044..9180) /locus_tag="DP116_20050" CDS complement(8044..9180) /locus_tag="DP116_20050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015983920.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_20050" /translation="MLLGFKTELNLNNYQRTQLAKHVGTARHAYNWGLGLCLGILDHN RLHPDKKIKFPTAIDLHKWLVAIVKPENSWYKDVSKCAPQYALKALREGFAKWFSKKG GRPKFKKKGRDDSFTLDGTIKVLEQKKIQLPVIGILQTYEKLPIGHCPKNITISRQAD RWFISWRIEVSTLVTEKNMDVVGVDLGVKSLATLSTGEVVNGSKSYRKYRQKLARLQR RLSRKVKHSSNWYSAVIDVAKLHRKIANIRGDTLHKLTTYLSKNHATVVIEDLNVSGM LANHKLAASVADMGFYEFRRQLEYKCKLNGSSLIIADRFFASSKTCSNCGHIKQELSL SERVFVCEQCCCQIDRDLNAAINLSRLSSNRIHACEQSAADGSG" gene complement(9185..9793) /locus_tag="DP116_20055" CDS complement(9185..9793) /locus_tag="DP116_20055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002850926.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS607 family transposase" /protein_id="PRJNA477356:DP116_20055" /translation="MSNIIGVKEAAELLGVSTKTIRRWEAEGKIKSVRTEGGHRRFEI SQLLGTKTDGSLTIGYARVSSYEQKQDLERQVIVLETYCAKHGWCFEIIQDLGSGLNY RKKGLIRLIKLICSYQVERLVVTHKDRLLRFGSELIFTICEIFGVEVIVINRTEDSTF EEDLAQDVLEIITVFSARLYGSRSHKNKQIVKQLKEVANNLK" gene complement(9860..10597) /locus_tag="DP116_20060" CDS complement(9860..10597) /locus_tag="DP116_20060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20060" /translation="MRSQQIIRHSDILNTKVITRNNGEQLGVVSQLWVDIEQRQVMAV SLLDNLIAFIGAPRYIYFDNINQIGEVILVDNKNVIEDIDVEGYSNLINCEIITETGE ILGRIQSFKFHRETGKIYSIVIAFLRLPYIPDQFLSTYELSVDEIVSTGSNRLIVFEG AEERLTQLTVGLLERLGISTPPWERNKKGMVTHKLWKNDKWDSENDDTGGSGPIPSPR RPKPGPRPNDTAEQLSIGTPWERNKKG" gene complement(10840..12315) /locus_tag="DP116_20065" CDS complement(10840..12315) /locus_tag="DP116_20065" /EC_number="2.4.1.21" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycogen synthase GlgA" /protein_id="PRJNA477356:DP116_20065" /translation="MYIVQIASECAPVIKAGGLGDVVYGLSRELENRGHCVELILPKY DCMRYDHIWGLHDAFTNLWVPWYNGAIHCSVYCGWVHGRLCFFIEPHSEDNFFNRGCY YGSNDDNMRFAFFSKAALEFLLQSNKRPDIIHCHDWQTGLVPVMLYEMYKYHGMEYQR VCYTIHNFKHQGFAGVDTLWATGLNREAYYFQYDKLQDNFNPFALNFMKGGIVYSNAV TTVSPHHAWEARYTNIGYGLGHTLHLHQDRFTGVLNGIDCDFWNPKTDRYIPSHYTKD NFQEKAKNKKALRERLLLQDVEKPIISYIGRLDEQKGVHLVHHAMYYALHNGAQFVLL GSATESSINNHFRHEKDFLNNNPDIHLELGFNEELSHLIYAGADMIIVPSNYEPCGLT QMIGLKYGTVPIVRGVGGLVNTVFDKDYDQTKPPEERNGYVFYQSDYHALEFSLERPL KLWYNNPEEFRKLALAGMEYDYSWNHPGEEYVKIYDRIRHK" gene 12940..14056 /locus_tag="DP116_20070" /pseudo CDS 12940..14056 /locus_tag="DP116_20070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015180559.1" /note="frameshifted; incomplete; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 4016 a 3035 c 2941 g 4064 t ORIGIN 1 tgttaaccgt tactaatatc gaaatcatag catatttccg caatacctac tcaaatttga 61 ctagttttga agagaaatgc gtaggaatta tgttacagtt tccaaccctt attttttctt 121 aatcaacgga gagggaggga ttcgaaccct cggtacagcc ttacgaactg tacaacggat 181 tagcaatccg tcgctttcga ccactcagcc acctctccag ggtcacgagg attaatctta 241 acagagtcta aggtgaaatg tcaatagttt atttgtcatt tgtcattcgt cacttgttat 301 cacaaacgag aaaggacaaa tgacaacttt ctcataagac ttgtgctcgt aaccatgccg 361 ctggcaaagc acgtaacttt tgccattgag gcacgtaagc acgtcgagtg aacacattgt 421 aatcgttgcg ttcaatgata ttcaaaattc gactgtaatg catcaaagat gcccatactg 481 gtaagcgagc atcggcagat aggtaagaga ttcccttttc tgctttagcg taaaattggc 541 gtgcccgtgc aatttgaaag cgcattagcg atcgccagcg ctcatccacc actcctttaa 601 acaggtcttg ctctgtgtag ttaaagcgcg ccaagtcttc ttggggaata taaatccgcc 661 cccgccttgc atcctccccg acatctcgta gaatattggc gagttgatgg gctatcccca 721 gagtaatcgc ttcttctgtg gggatatacg gctgttgatg gcaattccac ggagccgtgt 781 ttgtggaggt gtcaaccccc atcactgctg ttgacatcaa accgactgtt ccagcaacgc 841 ggtaacagta aaggtgtaac tcctcaaagg tttcgtagcg actacggtat aagtccatac 901 gctgacccgc aatcatatcc ctaaagggct gaatgtctat cgggaaacgc tgaacagtat 961 ccactaaagc gacatcaata ctatccaatg gctgtccagc aaaaatcgat tccagttgct 1021 gttcccacag atccaaagtt tctggtgtcg ttatagcaga tgcgggacca tccacgagtt 1081 cgtctgtacg gcgacaccaa gcataaattg cccaaatgtg tttgcgcttt ggcttctcaa 1141 ccagcaaagt gccgatgtaa aaagtcgtgg aatacttgac tataagctgg cgacagagtt 1201 tgtaggactc gtccacagag accggtgttt tcatgcgagg gatggaatca ggcagttgca 1261 gcattcgttg caggctgaag ggtttgcatt tgcaggcttt ttgaatttgc caccggatgt 1321 gcttcagcag tcgtgtagcg cgtcctaccg acccaaagct ttttcaggcg tttggcttgt 1381 ttcctcgaaa cgaggaaggg gttttccttc agaaaatccc tcggcggtat tacggacgat 1441 cgcctgcgct gtcagcttac cagaaagtac ggcaccttcc atactcccta ggtaacgttg 1501 cattgtgtaa cttcctgcta aataaaagtt agcaattggg gtcacttgtg agggacgata 1561 ctcttgacga ccaggcgttg ctttgtaaac tgagcgtggc gtcttcacga catgatactt 1621 tagcagcttt gcgggattat ctcctgtaaa atcgtcagga aacaattttt ctagttccgc 1681 aacagttgct tgcaaaatct cttcatctga ttttgcaatc caatcttttg ctggggctag 1741 aactaattcc agcattgaac ggtcggggtt ggcgtattcg cggcaggtgt tactcatatc 1801 agcataaaca ctgagcaggg gcgatcgcga gaatagcaga tggtcaattt tagtaagttt 1861 acggtcaaac catagctgca agtttatcac tggcacaccc tctaaacctt ccaatttttg 1921 gaaaaattca atttctttcc agggtgtagg taccaaaatc ttcatcacgt caaccgacat 1981 tgccgatacg taggcatccg ccgtaaatac ttcatcttct gccccattca acccccgaat 2041 cacaaaatgc ttgactgtgc catcgtcatt cagcacaatt tctttgagcg gtgcattcaa 2101 ccgcacttct ccaccccgtg cggtaatgtg atcaacgatt ggctgacaca gccgttccgt 2161 aggagagccg tccagaaaag ccatttttga gccgttcttc tcttgcaaaa accgattgag 2221 agccgttaga agaattgtcg ccgaaatttc atcaggattt atgaagttca gcgccttaga 2281 catggcgata aaaacttcct tttccactcg tggcgggatg ttttgttttt tcatccattc 2341 cgaccaagaa tacttgtcca tctcttctac ataacgctgt cccagaatca tcgctggtag 2401 caaaccgata ccaaagcgaa ttttttctgg ccatgtgagc atgtcgttat tccgcagaat 2461 agctactatg ccattcaaag gtgctggcac atcgggaaaa tcaaagcgac tgtaagtccc 2521 tggattatta ggctggttga agatcatggt gtgttctttc cactgcaatc tgtcttcaat 2581 gcccagttct ttgatgagtt gcaacatatt gggatatgcc ccaaagaaga tgtgcagccc 2641 ggtttcatac cagtcaccgt ctttgtcttt ccatgctgct actaaccccc ccaacacgtc 2701 tcggctttcc aaaacaatag gagtgtgccc tgcatctgtg agatatttag cacaggaaag 2761 tcctgctaaa cccgctcccg cgatcgctac tcgcatttaa ccttactgct cttgaatatt 2821 tttaattgtt ttatcgtttt cattatactt tgcaatccgt tacatttagc agtttttcgg 2881 tgatcgcttt ttcagattcg ctgcctcctt ctgggagcat cgtccattac tagcgcggct 2941 tgttcaagca caatctctgt cttagactct tttatacccc aacacacttg agttaacaat 3001 acacttccat taagaagatt tttcccttgg cgtaggcagt tgcaacccaa gcgcaacaga 3061 caaccctttc cacagaagtg tcctttgctt ctacatttca tgtactttct tttaccatat 3121 gacacaataa tataaaaatc ctgatacatt tcactctctt gactcagaaa ctgctttttt 3181 ctgatatttc tggcattctt gtgagaaaag tggacaaatt ttaaggagaa actctgacac 3241 aagaacatag taagtcaaca cttagaaaag tcatcccgaa acctttagtg gaggtatgta 3301 aaagagataa aggaaaggct aatggacttt tggctctgac atgacttgtc gtcttttaat 3361 agtgtcaact ttgacacagt attaacatgt gactctaaaa tataattttt gttaaatttt 3421 ttgaataaca aaaatggtag ttttatttgt ctgccatcaa ttttactgtt atataatact 3481 ggaaaataac tatattaaga ggatattaaa gctgtgcgta atgatttaca aggattagag 3541 attggtcaaa aagaactcca aaaattgact aatttgcctg tgaacgatga actgataatt 3601 cttattcatc ctatgaaaag actactgagt caaattatag aaagagccaa aggttctgag 3661 gctgctacgg tagtttttct tggaatgaca acactagttt ttagttatgt cgcatttgat 3721 gttattatca gagtatttgc tcattgggtg acgatacctt cctggctact attggtcata 3781 tctagttgtt cagtaggaat tttgacacaa atattctttt attttttatg gaaacaaaga 3841 agtcgtactg tcactcaaaa tatgacacat tcgctgaaaa ttctcttgaa tgatgttgag 3901 agatataatg ctgttattaa agcaatagac atcaacgacc aaatagaagc ggcaggaaat 3961 tccggagtaa acttgaagga aagaaacaaa gttatagaag cactaaaact gacaagaaat 4021 gatctagtta gggcattaaa gacagaaaaa atactaagag aaaataaaaa atttattgtt 4081 agcaaattag agttgtttgc tgataattta gcaacattaa cagcaatgca agtcagcgaa 4141 caggctagtg aacacgggcg attgctcaat gaagcattgc agattgcatt ggatgtacaa 4201 caagaaatga aaagtttaca gagtcaggga tcttaaggct gtaaagtctt aagaaacaca 4261 gggaacaggg aacagggaac agggaaagag tgtttctttc attcataacg ggtggcgcgc 4321 cacccgtgga gaactcttaa gaatctccga tcctgaaaaa cggacagtgt caatatctat 4381 gctcttgcaa aaaagccttc acagtttggt taaacgcctc aggctgtgtt aaaaagggcc 4441 aatgattacc agggacttga caaacgcgta aatttttcag attagttttg taaggttgga 4501 gttgccaatc tttgcggtta agtccttttt ctggctgcac aaacaaggtg gggatatgaa 4561 tgggggttgt aaaaccggga gagagcatca cttcctcaaa aattctatcg cgagcggtga 4621 tggtaaattt actaccccaa gtcccatcgc ttttttgttc aatacttgct tgaaaaactt 4681 gctgttgtaa gggactccat ccttgaaact gatttaacag acgcgcttgc tgttgggctg 4741 actcgtaact agcaaatggt cccatacctt tgagaaagga taaaaattgg tacaaaaggg 4801 gaaaagtaag cttgaaaaag ctgggtattt tccaaataaa aatgggatcg accagaacga 4861 tactccgtaa acgctgcgga ttgatccttg cccatataac agctagttta cctgtccaag 4921 agtgagcgac aacatgagta gctgaccatc ccaagctatc catcagtgct tctaagtcgg 4981 cgatcgcact ctcaaaacta taatctctct caggtttccc actctcacca tgaccgcgca 5041 tatctggggc gactatatgg tagtctgatg atagctcatc tcctaggctc gaccaaacca 5101 aggcatggtc ggctaaaccg tgtaatagga gtaaaggttc ttgaccttgg ttccattcta 5161 aataagaaag ttggatatca ggtttcgaga aggtttgacg tattggcatg atcgcagatt 5221 caccagactt tgacaggatt ttcacttcta cagatttaca aatcgattca tttcttttac 5281 tttcttatca aaccaaaccg tttctacaca tccgtagatt ttattctgtc tgacaataaa 5341 acaatctaaa aagtaaggag tgaactacct acacctgtct cttgtataga ctaggtgtag 5401 gcttttagta accctgaacg gtctgtactg agatgacagc acaaactgtt tgagcctgtg 5461 ggactgttta cggggcgttg ctaatatggt ttatcagttg cgaaccttag tttactgaat 5521 ctttgacttt gtgtcctttt gagaaattgg acatgaatgg tcgctcaaag attaagtaat 5581 atcaagttcg gataaacact tataataaac atagaatcgt tgcagtgtat ctatttaggt 5641 ctcatgccca aacggattgc gataagccgg aggcttgacg ctccgcgtat cgccaaccac 5701 ctaagtgcgg aagaactgtt aattcgttat cgtcaggcaa cacaaccaat tgagcatagt 5761 cattaccaaa taatgtggtt attggcaaca ggaaaaacac ctcaagaggt ggcgcaagta 5821 actggttaca cgagaatttg gatttatcag ttaataaaac gttataactc agatggtcaa 5881 aaggctttag gagataaacg gcatcataac ccaggcagag aagcgaattt aacagatgtg 5941 gaacaagctc gactttggca ggtactttca gagaaagccc cagatggtgg attgtggaat 6001 ggtcgtaagg tggcggattg gttaagtgaa atcacaggaa gacacataat atcaagttcg 6061 gatgattact tataattaag ccgcccttgt ttttggccac cagtggtagc aagttagtcc 6121 ttgaattaag tggtgctgtt tcataatttc ttggcaacga gtgagggaac taagctacct 6181 gccctaaaat cgtttcggaa gtccgagaaa cgattagcta aagtatcgcg tcgcaaggag 6241 aaaaaacgta aaggtagcaa agcacgtcgt aagctcgcca aacgccaagg acgcgaacat 6301 caacgtattg ccaaagcaag aaaagaccat gcttttaaga ctgctcatga gttggtacga 6361 acaggcaaga aagtttttgt ccatgaggac ttaaatctaa aagccttgtc aaaacgcaac 6421 aagacgaaac aagatgagga tggtaaattt ctaccaaacg gacagacagc caagtcaggc 6481 ttaaacaaat cttggaatga tgcagcatgc ggacagtttt ttataaccct ggaacacata 6541 gccgcaaaag ctggggctag ggtcatcgca gttacccctg catacacatc tcaattacta 6601 gcgtatcgtg acgagatagt cttcactgac tgcgatacgc gcttttactg ggatgctgaa 6661 ctttcgctta acgttgaccg cgatataaac gcgggaatca acattaagcg cgttgggctg 6721 ggactgttcc caacgctaaa acgccgtaaa gggaatctag tagtgggcga ctctactacc 6781 aatagtacct tgaaggaagt tctggaaacg ttacgagcgt gccagaagcc tacaccgacc 6841 tgagtacagg tacggtgtag gtagttcaca taacctgatt attacacctc atattggcag 6901 tgccagccgt caaacacgag aaaaaatggc aaacatggcg atcgctaatc tcattgctgg 6961 gttggagggg gagcgattgc ctaattgtgt caatcctgaa gtttattaat agtatctaga 7021 actctatcaa ggcaactttg aggggtagaa aaaaaccaca aagacgcaaa gagcgcaaag 7081 gaagaggaaa agaaaatgaa tttaaaagtt gataaaccaa gcattgttgg tttggcggac 7141 tactaagata taaatccaaa gagcttcata caattttaca atctattttt taggaattgc 7201 tgctagaaaa cgacatcttc gtaaagctcc gcgatcgttc cttcaaactt aatactgtgt 7261 agtcgaaatg atttattttc tgctgtgtaa gattgtaaaa cccacagccc ttcatcatta 7321 cggcggaaac actcgactcg ctgacgtttg gtgttaatta aaacatactc ttggagactt 7381 tctaaagttt ggtaatcggc gaatttgtct cctcgatcaa aggcttcagt ggagtcagat 7441 aaaacttcga caattaaata gggaaatttt ttatacgcag gagtttcctg atctcgttgg 7501 tcgcaagtca ccatcacatc aggataataa tatcgattta gagattcaat acgtgctttc 7561 atgtcagcga tgtaaacacg acagcctgag ccacgtacat gattacggag aagtgtagca 7621 aggttgagag caatagtcac atgggagtca agcgctccag ccattgcgta gatatagccg 7681 tcgatgtatt catgtttgat gtcgctttgt tcctccattt ggaggtattc ctcaggggtg 7741 agatagattt caggtgaagc aaccatattt aagacctcaa ggttaagcag agcaaacgca 7801 ctttaaagtc gcacatatgt tgttgatata aagttagatt catgcacatg accgaacaat 7861 cagattgatc atgatacgcc aagacttggt tgaaatcatt atggacttgg aatcggggga 7921 gaaccgccag tatcatcgtt ttccaaaccc cactcgtcat cgtcccatag gtcatgcgta 7981 actataccgc tctacaaaac ttggacaaag ttagacataa cgttgtccgg tctgcttcct 8041 gcttcatcca gaaccgtcgg cggcactctg ttcacaggcg tgaattcggt tagagcttaa 8101 cctactaaga ttaattgctg cgttcaaatc tcggtctatc tgacagcaac attgttcaca 8161 aacgaaaact cgttcagata aactgagttc ttgtttaata tgaccgcaat tagaacaggt 8221 tttacttgat gcaaaaaacc tgtctgcgat gattaaacta ctgccattga gcttgcattt 8281 gtattcaagt tggcgtcgga attcataaaa acccatatcc gcaaccgaag ctgccaattt 8341 atgatttgcc aacattcccg acacgtttaa atcttctatc actactgttg cgtggttctt 8401 gcttaaatag gtagttaatt tatgcaacgt atctccacga atgttggcaa tttttcgatg 8461 tagtttggca acatctatta ccgcagagta ccaattactt gaatgcttga ctttccgact 8521 caatcgacgt tgtagtctcg ctagtttttg gcgatacttt ctataacttt tgcttccatt 8581 aacaacttct cctgtagaaa gagttgctag agatttaact cctaagtcaa ctccaactac 8641 atccatgttt ttttctgtaa ccagagttga tacttctatc cgccaagaaa taaaccatct 8701 atcagcttga cgactaatag taatattttt tgggcaatga ccaattggta gtttttcata 8761 agtctgcaat ataccaatta ctggtaattg aattttcttt tgttctaaga ccttaattgt 8821 tccatctaac gtaaaagaat catcacgacc ttttttctta aatttcggtc tacctccctt 8881 tttactaaac cattttgcaa aaccttctct caatgctttc aatgcatact gtggcgcaca 8941 tttactgaca tctttatacc atgaattctc cggcttgact atggctacta accatttatg 9001 taagtcaatc gctgttggaa atttaatctt tttatctgga tgaagtcgat tgtggtcaag 9061 aattcctaaa cacaagccta atccccaatt atatgcgtgt cttgcagtac ctacgtgttt 9121 tgctagttgg gtacgttgat aattgttcag gttgagttct gtcttgaatc cgagcaacat 9181 cacactactt taaattattt gctacttctt taagctgttt gactatttgt ttatttttat 9241 gacttcttga cccatataat cgcgcactaa atactgttat tatttctaac acatcttgtg 9301 ctaaatcttc ctcaaacgtt gaatcttctg tacgattaat cacgataact tcaactccaa 9361 atatttcaca tattgtaaaa attagctctg acccaaagcg taatagtcta tctttatgag 9421 tgacaaccaa tctttctact tggtatgaac atatcaactt gattaaccgt atcagtcctt 9481 ttttacggta atttaagcca ctgcctaaat cttgaatgat ttcaaaacac caaccatgtt 9541 tagcgcagta cgtctctaag acaattactt gacgttctag gtcttgcttt tgctcataac 9601 tactgacacg cgcataacca attgttaagc tgccatctgt ttttgtccca agcaattgtg 9661 agatttcaaa tcgcctgtga cctccctctg tacgcacaga cttgattttt ccttccgcct 9721 cccaacgtct gattgtcttg gtactgaccc caagtagttc tgctgcctcc tttaccccaa 9781 taatattgga catttattca ttataatttg atgactgtcc aatattatct aattttgtcc 9841 aattatttca cttctgttgt cagccctttt tgtttcgctc ccacggtgtg ccgatactca 9901 gctgctcggc ggtatcgttt ggtcttggtc ctggtttagg tcgccgtgga cttggaatcg 9961 gcccagaacc gccagtgtca tcgttttccg agtcccattt gtcatttttc cataacttat 10021 gggtaaccat accctttttg tttcgctccc acggcggcgt gctgataccc agacgctcca 10081 gcaaaccgac tgttaactgg gtgagccgtt cttctgctcc ctcaaacaca atcaacctgt 10141 tggaaccagt gctaacgatt tcgtccactg aaagctcgta ggtactcaag aattggtcgg 10201 gaatgtacgg tagtctcaaa aaagcgatga ctatagagta aatctttcct gtttcccgat 10261 gaaacttgaa gctctgtatc ctgcctaata tttcaccggt ttctgtaata atctcgcagt 10321 taatgagatt gctgtaacct tcaacgtcaa tatcttcaat tacattttta ttatcaacta 10381 ggatcacctc acctatctga ttgatgttgt cgaagtatat ataccgaggc gcaccaataa 10441 acgcgatcag gttgtctagc agactaacag ccataacctg tcgctgctca atatctaccc 10501 acaattgact caccacccct aattgttcgc cattgttacg ggtgattacc ttagtgttta 10561 agatgtcgga atgtctaata atttgttgtg atctcattat gcggagtcct gcaattctcc 10621 gattcttatt catcagctta actcaagata atccccccaa cccctgcgac taaacttttt 10681 tctcatctct cccctgcaag cggggagttg gaggaagtgg aggggtatgg ggggatttat 10741 gagttccagt gataaactaa gcaactacca ttagacatct cctaaaacta atctcaaatc 10801 cttggtctgt ttgggctgat gtttaatttc tggagatgtc tatttatgtc ggatacggtc 10861 gtaaatcttt acgtattctt ctcctggatg attccaagag tagtcatact ccatacctgc 10921 aagggcaagt ttgcggaact cttctgggtt gttgtaccac aacttcaaag gacgttccaa 10981 cgaaaattca agcgcatgat aatctgactg gtagaacaca tatccattgc gttcttctgg 11041 tggtttggtt tggtcgtagt ctttatcaaa cacggtatta actagtccac cgactccccg 11101 gactataggt acagtaccgt acttcaagcc aatcatttga gtgagtccac aaggttcata 11161 attactgggg acaataatca tatctgcccc agcataaatg aggtgggata attcttcgtt 11221 aaagccaagt tctaaatgaa tatcaggatt attgttcaaa aagtctttct catggcggaa 11281 atgattattg atgcttgact ctgttgctga acccaacagc acaaattgtg ctccattatg 11341 gagggcgtaa tacattgcgt gatggacaag atgaacgcct ttttgctcat ctaaacgacc 11401 aatgtaggaa atgattggct tctcaacatc ctgcaataac agccgctctc gtaaagcttt 11461 tttattcttg gctttttctt gaaaattgtc tttggtgtag tgagagggaa tgtagcggtc 11521 tgttttcgga ttccaaaaat cacaatcaat gccgttgagg acaccagtga atctatcttg 11581 gtgtagatgc aaggtgtgac ctaatccata accaatgttg gtgtaacgag cttcccaagc 11641 gtggtgtggt gaaactgtcg tcacagcatt ggaataaaca atgccccctt tcataaagtt 11701 cagggcaaaa ggattgaagt tgtcctggag cttatcatac tggaagtagt atgcctctcg 11761 gtttaaaccc gttgcccaga gagtatcgac tccagcaaac ccctgatgct tgaagttatg 11821 aatggtgtag caaacccgtt ggtactccat tccatgatat ttgtacattt catacagcat 11881 gacaggaact aaacccgtct gccaatcatg gcaatggata atatccggtc gcttgttact 11941 ctggagcaga aattccaaag cggctttact gaagaacgca aagcgcatat tgtcatcgtt 12001 agacccgtaa tagcagcccc gattgaagaa attatcctca gagtggggtt caataaagaa 12061 acacagccgc ccgtgtaccc aaccacaata gactgaacag tgaattgcac cgttatacca 12121 gggcacccat aagttagtga aagcgtcatg gagtccccaa atgtggtcat aacgcatgca 12181 atcatacttt ggcagaatga gttcgacgca atgtccccgg ttttctagtt ccctgctgag 12241 tccgtaaaca acatcgccta aacctccagc tttgataaca ggagcgcact ccgaggcaat 12301 ctgtactatg tacatttcta tcctcgctta ataaaaattt atttttatta tcctttccat 12361 gatgacagaa aagatgtatc cattggcaag tcactgaccc cgtcattagg tacaaatctt 12421 aggaaataat gcagtttgta tcatttgttg ccaaaaaatt ttaaccaaaa gcttttcaaa 12481 agtaaaaaat tatgcaaaaa tgtatcctaa gttaggttta caaatattgt aaggcgtatt 12541 aatgcagtcc agaatttaga ttgaatgtgc atggagagat tgaagttcgt ttgttaacac 12601 tacccaaggg cgataagcaa gtacaacaga tgaaaagtag actttccaag ctgcttgagc 12661 aggacgaaaa ataagctttc taaacgttat tttctggagt ttagactaaa gggcaggcga 12721 ttgccgtagg cagagcttcg cttaacgtta acaaaggaaa gcaggtgttg ttggtataaa 12781 atttttggaa ggggagtttg taattaacac agaccccctt cgccggaaga tgagagaaga 12841 accaccaact ctcacgcccc caagatgaaa ctagccaaac tccaagaatt acgcccgtgc 12901 tgcttacaac tatttaggca aggcacatga tgctacgcaa tgaagctgat agatgctata 12961 ttgctgaccc gtaacgctta cagtttggca gatttatccc tgtccccggt ttttcgccgt 13021 aagtggcgca gcatttacgt tcgcgttcgc gcagcgtgtc ccaaggggac tcagcgtgtc 13081 ccaaggggac tcagcgttac aagatagtag accacaacga caaaaattga tgcagttata 13141 catcaagcag atgccagcaa agggacgtct cttactagca ggcgaccaca ccgcctggtc 13201 tcgcccggat gcagtgacac tgcaagaaag gacaattgag catagcaaca cgctttgtgc 13261 caggaaatcg accaattacc attggtcagg gatatagtac cattgcttgg ataccagagg 13321 attctggcag ttgggcatta cccttgagac atgaacgcat cacaaggagc ggaaagccct 13381 attgagaaag caacttggca actacaacag gtgtgcgaac atttgcccag cagaccgatt 13441 tctgtttggg acagtgagta tggctgtgcc ccttttgttt tgaagactgc taacattgct 13501 acagatattc ttgtccgttt gcgttcaaat ctttgtctat ggggagcacc tccagtgtac 13561 tctggtagag ggcgtcccag aaagcatggc gataagttta agctgaatga accttctagt 13621 tggggtgaag caattcaaag tttggaggtt aaccagctca aactggggcg ggtaagagtt 13681 agcttatggg aaaatttaca cttccgtaaa acggctacgc gcccgatgtc attaatcagg 13741 gttgagcgtc ttgaccaaca aggttgctta agagtgtcaa aacctttatg gttggcttgg 13801 gtaggagaac aaatgcctcc actatctgaa gtttggctac tttatttgcg tcgctttagc 13861 gttgaccact ggtatcgttt tttaaagcaa cgcctgcact ggactctccc caagcttagt 13921 accccaaaac agtgtgttcg cgtagcgtgc ccgaagggca tacgttggag tgagttgatg 13981 ccaatgatta gttgggaact gtggttagct cgtgatattg tgagaccagc gcgcatgagg 14041 cgttttcccg ccgtag // LOCUS NODE_2422_length_13999_cov_4.96672413999 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13999) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13999) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13999 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..3466 /locus_tag="DP116_20075" CDS <1..3466 /locus_tag="DP116_20075" /inference="COORDINATES: protein motif:HMM:PF00400.30,HMM:PF12894.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20075" /translation="GLSAFGEKDADFFFGREKFIASLVEAVNSKPLVPVVGASGSGKS SVVFAGLIPHLRTVRNVEIISFRPGKNPFDALAVALSKYCQSLVQRQPKTSGEADNRL AELEFEVNLRHDETVLFHFLENIINTSGLQRLVLVADQFEELYTLAAQEERHSFLNVL LLAVKCIPALTLVLTLRADFLGIVLDYQPMGKALQEYTPLLLTPMDKEELRDAIEKPA LKMKVELEEGLTSKLINNVGDRPGRLPLLEFALTQLWSKQKNWYLTHKAYEEIGGIEK ALAKHADEVLKNLSEEEKQQAQRVFIQLVRPGEGTEDTRRVATRNEVGKENWGLVQQL ADARLLVTGWDETEKIETVEIVHEALIREWRTLREWVSANREFRIWQERLKQEMRDWE NSDRNPETLLQGTRLAVAEDWYKQRRDELTPRARRFINASIKWRKQEQQKQRRRRQLT IFGLTGGLVVTLMLAGAAWWQWQNSAKSEIKAISESSAALFASNQKLDALKEAIRARR KLQKLLGVDADTQHQVELVLQQAVYGAVEYNRLLGHNGEVKSAVFSPDGNIIASTSDD SSVKLWSKDGILLRTIKGHKTTVYKVAISPDGQTLASASADKTIKLWKRDGTLITTLK GHEGGVLNVAFSPDGNTIASASDDSSIKLWRVSGKVSALLTTLKGHGTSVQAVAFSPD GSTIASASGDSTVKLWNKDGTLLTTLAGHKSIVWDVAISPDGKTLASASADSTVKLWN KDGTLLKTLEGHQGPVSSIAFSPDGKTIASASWDNTIKLWNKNGELLTTLNGHSDRVW GVAFNPDSKILVSVSGDKTIKLWKLDSALLTTFRGHSAAVIGVAISPNGKAIASASDD GTVKLWKQDNILPVTLDHKSAVYGVVFSPDSNTIASVGLDSAVKLWNKNGTLLNTFKG HKAGNWGVGFSPDGNTIASAGWDSTVKLWNKNGTLLKTLEGHLQPIWDVTFSPDGEMI ASASADKTVKLWNKNGTLLKTLEGHKAGVWGVAISPDGNTIASASEDKTVKLWKPDGT KLTTLEGHGGTVFGVAISPDGKMIASASADNTVKLWKMETGKLAILLATLNGHNKRVW GVAFSPDGKKLASASDDKTVILWNLARVVDQDKVMAYGCDWVRDYLKNNRDVRESDRR LCDGIGTQ" gene complement(3920..4285) /locus_tag="DP116_20080" CDS complement(3920..4285) /locus_tag="DP116_20080" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20080" /translation="MPKSYINSKSYINSKSYINSKSYINSXXXPKATLTPKATLTPKA TLTPKATSTPLKPKETKLITEELAKELGVSKDNLLSEVKKCHENFKKWSATHGKGTWD FEIETSEGKESILKFQKVN" assembly_gap 4199..4208 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(4323..5177) /locus_tag="DP116_20085" CDS complement(4323..5177) /locus_tag="DP116_20085" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20085" /translation="MSQIFIETRGKSIDYRFLVKSPSESWWRTYSQWTAFENPTLIIE NTSGATQIYLSAIPSKRKDRTQTTIRYTLVIELDRSDSNDDLFLKLISKWLEEVKTAP KNLPKKSEIGDLLDRCFPENMVEELLIKRQNEENPEEMIENLKSGISNFQFSKPKEEN TLRYKMWWGGINNQKSNNSWISLVSKLLKEGYGKALLLNLAGENDLKQLLPTDQNNQN IGLLIIENDQEPTEIPIFNRQSDFDKIGRNWPSILTKLSAIKYVVLLIFIALFVVFGR NFFLKTIP" gene complement(5193..6212) /locus_tag="DP116_20090" CDS complement(5193..6212) /locus_tag="DP116_20090" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20090" /translation="MGENNQATVIEYKIGIVGPTRVGKTSMITSLLEQGKELLAGTDV SIEAVGKTKARINRYRDELRGSLMADEFNPGGMSGTQEPFTIELAMSVGSSKLTWAIL DYPGGWIDEESRPSDRQNDWNNCQAWIKESIVLLVPIDSAVVMESSTKAELQAANTTL QISQAGEVAREWVKGRIKKGEPGLLILVPVKCETYFSDNGGKRDKSADLVDRIQKLYQ DLLNAVRQEIDGATNKPEILIQYHPIDTIGCVEIKDARWIEEGVRLGFQADYLVRPPR QPRPKGADGLLISICRQIASLEKNKKRGIFSRVWRWATEEDEKLNKAIEKLQSKDLGT RVKHL" gene complement(6224..8458) /locus_tag="DP116_20095" CDS complement(6224..8458) /locus_tag="DP116_20095" /inference="COORDINATES: protein motif:HMM:PF00350.21" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20095" /translation="MADIQNQINSVIQKRREQLPLIEVRIQQAKDVENALNEMNSALV NLGNHPKATDQLRVYLRDFQQNQFRQWIASSLDQLMNAQARLLRETINIGVSGQARVG KSTLLQTIAGLTEEQVPTGTGIPVTAVRSRLRHSTTYSRAILTLHTFETFRDQILEPY HKELKLSSCPATLADFQSFNYSEQNILSDNTQHSSIVLLERLRKMQKALPSYSADLTG ETREVSLEGLRSWVAYPTNEEEKNPNCPRRYLAVRDVLIECRFQATDVENLTVIDLPG LGELDASAEEHHVAGLKNEVDLVLLVKRPVEGLAFWKAEDGKAADILDQARGAIKQRR DFVIIVVNGDPNSELFKVLLDDITRQANEGTADKHYRVLQCNAKDSSNVRSSLLVPCL EHLAQRLTIMDRQVIESAKSEWLTTIQRIQGALKDLRDGLKRQTPDSFSSAEEFDKLV EKLRKELVVSLEEEIVLKLFQKARNPEEEDTKLIQTIRNTSNQIKKWVEEEGFGIGKE KWIRDAYETMMRDKSVVRFALEECNRIRVHISETYSRIDNYLDTKVQELWSEISCIIY KNTGQLLEGTQDGKESLEKFVEFLKNGNEQCPSLQKAVKDLLSLNISYRSHFHPRVRE KLDSLNYDEMVHNLKRENKLDEKELFKTMSDLAIQASYETEKALLSETLIPSLILHAA AEQFEDSLIRSGESEREFKRLGRSYRDEIWPSTFRDMDAQNARVAKVNRVIDKLEKIL IISN" gene 9151..10629 /locus_tag="DP116_20100" CDS 9151..10629 /locus_tag="DP116_20100" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20100" /translation="MVSPTKYWQMQILPIGEDVQQQHQREISRAKEFFKTQFPHLSNK PTLSTEENKQVQTVLWEIFRLDDDIYQRAIAGLCLRCYVSHRIFITCKNIPHTYNVSA ENLFRYTDLLPFVLNDDGKALVILDSEGKTQHILNNRDGTTQAIAKGGEFFSVDILRR FNPNLGSNESLDNWTTRLTRQNEEIKSFLWEFGLATPSDWGLLCKSIPRSLSGLFLTE DYEIVKAFQTVYQRDRLKTRQKGRCSEPTPSQLQEMLYLLQQKNIIISQNTLIDHLKR IAEDLRQDWLYKKTGSTKTVPMEVYDNSTNDYFPNPELPYHTDREPEDVELEKLQEIC KDLFEQVLSQTIGEVIHQRIENLKKSRGYKNFAQRLPEGLRLYYHENISLGEIGKIWG IEWSKARRIMQLENFLEIVQYRTEEIFLNKLLQSLDKSQLTRISHEPESLKNIVAEIR EFVWNQTFKEAKAELLSSKKQNKNSLFAKKIRIYLSDSSYAA" gene 10692..11813 /locus_tag="DP116_20105" CDS 10692..11813 /locus_tag="DP116_20105" /inference="COORDINATES: protein motif:HMM:PF08852.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20105" /translation="MSSSKKSILKTKNRRIKFSPKVVWLESESFEEARVISENNFNKF GEINQWKIYLNALARLGFEKYLKERNPNIKINQHSAAHPIDDVCYLNLGEFHLCLIIV DNLIDYFVTVPEEVITSPKRVAHFYVLLEVLEEEEQLNIHGFLRYDQLVKYCQSINLD AKSNSCYQLPLSLFDPEVNNLLLYSRFLSPTAIPLPSVAEVNDTEIQNLTQTTSISTK ALVNLTNWWLEVFEEGWQSTKNILKTLDNNYVWGYARSHSRVDHYSGAKKLDFGLLLN GQTLALVLNLKRLENNEVDVLVQVIHCYEEHRNEEYLPPGLKLKVTLNPNTSESESQE VTARKADNVIQLEFSEALGKQFKVEISFKNVVVTEDFLL" gene complement(11977..12864) /locus_tag="DP116_20110" CDS complement(11977..12864) /locus_tag="DP116_20110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015203640.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UTP--glucose-1-phosphate uridylyltransferase" /protein_id="PRJNA477356:DP116_20110" /translation="MQQNKVRKAIIPAAGFGTRLFPATKVVKKELFPIIDKDGRAKPV ILAIVEEAISAGIEEVGIVVQPPDKEIFGEFFKSPPKKELFDKLSPQNQEYSKYLQEL GSRITILTQEEQEGYGHAVFCAKEWVKDEPFLLMLGDHVYSSNADEKSCASQVLDVYK QVNQSVVGLTVMSAEIIHKAGCVTGVWHLFNSLLSLTQVYEKPSIDYARQHLRVEGMA DDEFLCIFGLYVLTPKIFDFLEEHIHQNFRERGEFQLTSCLDRVRQEEGMTGYVVKGK CFDTGLPDAYRQTMIDFRI" gene complement(12851..13900) /locus_tag="DP116_20115" CDS complement(12851..13900) /locus_tag="DP116_20115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495591.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GHMP kinase" /protein_id="PRJNA477356:DP116_20115" /translation="MHIFVPGRLCLFGEHSDWAGGYRRLNPKLEVGYTLLVGTNQGLS AHVQPHPTQFILHTCVSDGTRQTLILPMEKEALLTEAKKGGFFSYAAGVAYQFLAEFA VGGLEIDNFLTDLPLKKGLSSSAAICVLVARSFNQVYDLNMTLRQEMEFAYLGEITTP SRCGRMDQACAYGNRPIAMIFDGSYTDLIELKVPKNLFFVIVDLGASKNTQKILTQLN ECYPFATNEVQTNVQKYLGSISYQITQAAVDALQKGDAEQIGLLMKQAQGEFDIHLMP ACPEELTAPVLHQLLEYEPIQPYIFGGKGVGSQGDGTAQFLVKDEESQQKVIEIIEYN FPQMQSLKLTIYATK" BASE COUNT 4123 a 2800 c 2893 g 4173 t 10 others ORIGIN 1 cggtttatct gcgtttgggg agaaagacgc ggatttcttt tttggtaggg agaaatttat 61 tgcttcttta gtggaagcag taaacagcaa accgctagtt cctgtagttg gtgcttctgg 121 gagtggaaag tcttctgttg tgtttgcagg attgattccc catttaagaa ctgttagaaa 181 tgtagagata atcagttttc gtccggggaa aaatcctttt gatgctttgg ctgtcgcact 241 gagtaaatat tgtcagtccc tagtacaacg acaaccaaaa acatcgggag aagcggacaa 301 taggctggct gaactggaat ttgaggtaaa tctgcgacat gatgaaacag tattatttca 361 ttttctcgaa aatattataa acacctcagg attgcagcgt ttagtgttag tggcagacca 421 gtttgaagaa ctttacaccc ttgctgccca ggaagaacgt cacagttttt tgaatgtact 481 gcttttggct gtaaaatgta ttccagcatt gacgctggtg ttgacgctgc gagcggattt 541 tttgggaatt gtgcttgatt atcaaccaat ggggaaggct ttgcaagaat acaccccatt 601 gttactgact ccaatggaca aggaggaatt gcgagatgcc attgaaaaac cagctttaaa 661 aatgaaggtg gaattggagg aggggttgac tagtaaactt attaataatg taggcgatcg 721 ccctggtcgt ttacctcttt tggagtttgc tttaacccaa ctttggtcaa aacagaaaaa 781 ttggtatttg actcacaagg cttatgaaga aattggcggt atagaaaaag ctttagcaaa 841 acacgctgat gaagttttaa aaaacctatc tgaggaagag aaacaacaag cacaaagggt 901 atttattcag ttagtgcgtc caggggaagg gacggaagac actaggcgtg tggcgactcg 961 caacgaggtg gggaaggaaa attggggttt ggtgcaacaa ttagctgatg cgcgtttgct 1021 ggtgactggg tgggatgaaa ctgagaaaat agaaacagta gaaattgtcc atgaagcgtt 1081 gattcgggaa tggagaacgt tgagggagtg ggtgagtgct aaccgtgagt ttcggatttg 1141 gcaagaaaga ctcaagcaag aaatgcgtga ctgggaaaat agtgatcgaa acccagaaac 1201 cttattgcaa ggaacacggc tagcagtagc agaagattgg tacaaacaac ggagagacga 1261 actaacacca cgagcgcgac gctttattaa cgctagtatt aaatggcgca aacaagagca 1321 gcaaaaacag aggcgcagac gacagctaac tattttcggg cttactggtg gtttggtggt 1381 aaccttgatg ctagctgggg cagcttggtg gcaatggcag aattcagcaa agagtgaaat 1441 caaagcgatt agtgaatcct cagcagcact gtttgcttca aatcaaaagt tggacgcgct 1501 taaagaagca attagggcga ggcgaaaact acaaaagtta ctaggggtag acgccgacac 1561 tcagcatcag gttgaattgg tgctgcagca agcagtttat ggggcggtag agtacaaccg 1621 cttgttaggg cacaacggtg aagtgaaaag tgcagttttc agcccagatg gtaacataat 1681 cgcctcgacg agtgacgaca gtagcgttaa actatggagt aaagatggca tattgctcag 1741 aaccattaag gggcataaga ctactgttta caaagttgca ataagcccgg acggtcagac 1801 tctagcttca gcaagtgcag acaagactat taaattatgg aagcgtgacg gcacgttgat 1861 aactactctt aagggacatg aaggtggggt tttaaacgtt gccttcagcc ctgacggcaa 1921 cacaattgct tcggcaagtg acgacagtag tatcaaactc tggagagtgt cgggcaaggt 1981 atctgcctta ttgactacct taaaaggaca tggaacttca gttcaggcag ttgcttttag 2041 ccccgacgga agtaccatcg cctcggctag tggggacagc accgtcaaac tatggaataa 2101 ggacggcacg ttgctgacca cgcttgctgg gcataaaagt atagtttggg atgttgcaat 2161 aagtcctgac ggtaagactc tagcttcagc tagtgcggac agcactgtca aattgtggaa 2221 caaggacggc acgttgttaa aaactcttga aggacatcaa ggtccagttt cgagtattgc 2281 tttcagccct gatggtaaga caattgcttc ggcaagttgg gataatacta ttaaactgtg 2341 gaacaaaaac ggtgagttgt taactacgct taatggacat agcgatagag tgtggggagt 2401 cgcttttaac cctgacagca aaattctagt ttcggtcagt ggggacaaga ctatcaaact 2461 ctggaaacta gatagcgcgt tgctgactac cttcagaggt catagcgcag cagttatcgg 2521 agttgcaatc agtcctaacg gtaaggcaat tgcctcagca agtgatgatg gaacggttaa 2581 actctggaag caggataaca ttttacctgt gacgctcgat cacaaaagtg cagtttatgg 2641 agtagttttc agtcctgaca gtaatacaat cgcgtcggtg ggcttggaca gtgcagtcaa 2701 attgtggaat aaaaacggca ctttgctaaa tacttttaag ggacacaaag ctgggaattg 2761 gggagttggt tttagccccg atgggaacac tatcgcttca gcgggttggg acagcacggt 2821 gaaactgtgg aacaaaaacg gcactttgct gaaaactctt gaggggcatc tacaaccgat 2881 ttgggatgtt acttttagtc ctgacggtga gatgattgcc tcggcaagtg cagacaaaac 2941 ggtgaaactg tggaacaaaa atggtacatt gctgaaaact cttgaagggc acaaagctgg 3001 agtttgggga gttgcaatca gccccgacgg caatacaatc gcctcggcaa gtgaggacaa 3061 gacggtgaaa ctttggaagc cagacggcac aaagctgacc actcttgagg gacatggggg 3121 tacggttttt ggagttgcaa taagccctga cggcaagatg attgcttcgg cgagtgcaga 3181 caacacggtg aaactgtgga aaatggagac aggcaagtta gctattttac tggctactct 3241 caatgggcat aacaaaaggg tttggggagt tgctttcagc cctgacggta aaaagctcgc 3301 ttcggcgagt gacgacaaga cagtgatttt gtggaattta gctcgtgttg ttgatcaaga 3361 taaggtaatg gcgtatggtt gcgattgggt gcgtgattat ctgaagaata atcgtgatgt 3421 gagggagagc gatcgccgtc tctgtgatgg cattggcact cagtagtttc aggtaacgct 3481 ttttgaaaga tattttctta actcaacctt agccgcctgc acggctcgat ttctggttag 3541 cagtgcaggc aatttattag acctctccag aaattaaaca ttagcccaga cagcgcaaga 3601 gtttgagagg aattttcgga gatgtctaga gtgttcgtta cgaatttgtg caattttgta 3661 ctaagtttaa gtgtggagag atgagcggga caaggggcat tgcgtcatct tactacgtta 3721 tactaaatcc ggatctaata ccccctttat tatttagtcc gcgcaggcgg acttcgtttg 3781 tgtagccgcg atttctaatc gcctgggtta aaagggggtt ataaaagcgg atttagtatt 3841 atgtacccca tctcccccga aaaaatcact ctcgaaactt tctccctgtg taaaactgtt 3901 agcaagccga cgtttctatt caattaacct tttgaaactt gagtatagat tccttaccct 3961 cggaagtctc aatttcaaaa tcccatgttc ctttcccatg agttgctgac cacttcttaa 4021 agttttcatg acatttttta acctcagata ggagattatc tttacttacc cctaactcct 4081 tagctagttc ctctgtgatg agtttagttt cttttggctt tagtggggtt gatgtagctt 4141 ttggagttaa tgtagctttt ggagttaatg tagcttttgg agttaatgta gcttttggnn 4201 nnnnnnnnga gttaatgtag cttttggagt taatgtagct tttggagtta atgtagcttt 4261 tggagttaat gtagcttttg ggcaagactc tgacacaaac cgaacaacct tattctcatc 4321 actcaaggta ttgtctttaa aaaaaaatta cgcccaaaca caacaaacaa tgcgataaaa 4381 ataagtagaa ccacatattt tattgcgcta agctttgtta gaatactggg ccaattacgc 4441 ccaattttat cgaaatccga ttgtctgtta aaaatcggta tttctgttgg ttcctggtcg 4501 ttttctatta ttagcagccc tatattttga ttattttgat cggttggtag taactgtttt 4561 aagtcattct ctcctgctag gttcagtaat agggctttgc catatccctc tttcaacagt 4621 ttggaaacaa gtgaaatcca agaattatta cttttctgat tgtttattcc tccccaccac 4681 attttgtatc ggagtgtgtt ctcttccttg ggttttgaaa actgaaaatt ggaaatacca 4741 cttttcaaat tttcaatcat ttcttcagga ttttcctcat tttgtcgctt gattaataat 4801 tcttctacca tattttctgg aaaacaccga tctaataaat ctcctatctc actttttttg 4861 ggaagattct taggtgctgt tttcacctcc tctagccatt ttgaaataag tttaagaaaa 4921 agatcatcat tactatcaga acgatcaagc tcgataacta aggtataccg aatggtggtt 4981 tgagttctat ccttccgttt tgaaggaatg gcactcagat atatctgagt agcaccagat 5041 gtgttttcaa tgatgagagt tgggttttca aacgctgtcc attgactata tgttctccac 5101 caggattctg atggtgattt aacaagaaat ctgtagtcaa tggacttgcc ccgtgtttca 5161 ataaatatct gagacactta attacctcta atttacaaat gttttacacg tgttccaagg 5221 tcttttgatt ggagtttttc aatcgccttg tttagctttt cgtcttcttc tgttgcccaa 5281 cgccaaacac gactgaagat accccgcttt ttattcttct ctagactagc aatttggcgg 5341 caaatagaaa tgagcagtcc atccgcgccc tttggacgag gctgacgggg aggacgaact 5401 aagtaatcgg cttgaaatcc taagcgaaca ccttcttcaa tccacctagc atccttgatt 5461 tctacacatc caatagtgtc aataggatga tattgaatta atatttctgg tttatttgta 5521 gcaccatcaa tttcttgacg tacagcattt agcaaatcct gatacaactt ctggatgcgg 5581 tcaactaaat ctgcactctt atctctctta ccaccattat cactaaagta tgtttcacat 5641 tttacaggca caagtattaa taagcctggt tcacccttct ttatccttcc cttaacccat 5701 tcacgtgcaa cttctcctgc ttgactgatt tgcagcgtag tattagcagc ttgcaattct 5761 gccttggttg aggactccat aaccacagcg gaatctattg ggactaaaag gacgatactc 5821 tccttaatcc aggcttggca gttattccaa tcattttgtc tatctgaagg gcgagattct 5881 tcatcaatcc aaccaccagg ataatcaagt atagcccaag taagtttgct gctacccaca 5941 ctcatggcta attcaattgt aaagggttct tgagtcccac tcattccacc aggattaaac 6001 tcatctgcca ttaacgaacc gcgtaattcg tcacgataac gattgatccg tgcttttgtt 6061 ttgcccactg cctctatcga aacatcagtt cctgcaagta attctttccc ttgctcaagc 6121 aaggaagtaa tcattgatgt cttccctaca cgagttgggc caacaatacc aatcttatat 6181 tcgatgactg ttgcctgatt attttctccc atttaccacc tctttaatta gaaattatga 6241 gtattttttc aagcttatca ataacccgat tcacttttgc taccctggca ttttgagcat 6301 ccatatcacg gaatgtgctt ggccaaatct catctctata ggatctgccc agcctcttaa 6361 attcccgctc agattctcca gagcgaatca ggctatcctc aaattgttca gcagcagcat 6421 gaagaatcag agatggtatt aaggtttcag ataaaagagc tttttctgtt tcgtaggatg 6481 cttgaatggc aagatcagac attgttttaa acaattcttt ttcatcaagt ttattttccc 6541 tcttgagatt atgtaccatt tcatcataat ttagagagtc taacttttct ctaactcgcg 6601 ggtgaaaatg actccgataa ctgatattta gtgacaagag gtcttttaca gctttttgca 6661 agctcgggca ttgctcattg ccgtttttca ggaactctac aaatttttca agagactcct 6721 taccatcttg agttccttca agaagctgac ctgtgttttt gtaaataata caactaatct 6781 ctgaccacaa ttcttgtact ttggtatcta aataattatc aattctagag tatgtttcac 6841 taatatgaac tcgaatgcga ttgcactctt ctaaagcaaa tctaactaca cttttatctc 6901 gcatcatagt ttcgtatgcg tctcgaatcc acttttcttt tccaattccg aaaccttcct 6961 cctctaccca tttcttaatt tggttgcttg tattccgaat agtttgaatc agtttagtat 7021 cttcctcttc tggatttcga gccttctgaa atagtttaag aactatttct tcttctaatg 7081 aaactacaag ctcctttcgt agcttctcca caagcttgtc gaattcttct gctgatgaga 7141 atgaatctgg tgtctgtctt tttaatccgt cccttaaatc ttttaatgct ccctgaatac 7201 gctgtattgt agttagccat tcactcttcg ctgactcaat gacttggcga tccattattg 7261 tcagtcgttg tgccaaatgc tcaagacaag gtacaagtaa ggaactacgc acgttagatg 7321 aatctttggc gttacattgc aacactcggt agtgtttatc agcagttcct tcgttagctt 7381 ggcgggtaat gtcgtctaag agtaccttga aaagttctga gttaggatca ccattgacta 7441 caatgatcac aaaatctcgt cgctgtttga ttgctccacg cgcttgatcg agaatatctg 7501 ccgcttttcc atcttcagct ttccaaaaag ctaatccttc tactgggcgc tttaccagca 7561 acaccagatc aacttcgttc ttcaaccctg ctacgtgatg ctcctccgca cttgcgtcaa 7621 gttctcccag tccaggaagg tcaatgaccg ttaaattttc aacatctgtg gcttgaaaac 7681 gacattcaat caggacatcc cgcacagcta agtaacgtcg tggacaattt ggattctttt 7741 cttcctcatt ggttgggtat gcaacccagg aacgtaatcc ttcgagactt acttcccgtg 7801 tctctccagt gaggtctgct gagtaagaag gaagggcctt ctgcattttg cggaggcgct 7861 ccaacaaaac tatagaactg tgttgagtat tatcgctcag aatattttgt tcagaataat 7921 tgaaagactg gaaatctgca agtgtagctg gacacgagga gagcttcagt tctttgtgat 7981 aaggttcaag gatttgatct cgaaatgttt caaaggtgtg taaggtaaga atagctcgtg 8041 aataagtggt ggaatgacgt aagcggcttc gtactgctgt aactgggata cctgttcctg 8101 taggaacctg ctcctctgtc aaaccggcaa ttgtctggag aagggtactc ttaccaactc 8161 gtgcttgccc acttacacct atattaatgg tttcgcgtaa aagccttgct tgggcgttca 8221 ttaactggtc aagactacta gcaatccact ggcgaaactg attctgttga aagtcccgaa 8281 gataaactcg tagttgatcc gttgctttag ggtgattgcc aagattgaca agtgcgctgt 8341 tcatttcgtt tagcgcattt tctacgtcct ttgcttgctg aatgcgtacc tcaataagtg 8401 gtagctgttc tcggcgcttt tgaatgacac tattgatctg gttctggata tctgccattg 8461 ttcttccaat ttaaaaagta gagctttatg gtaacaggcg ataaactctt ctagagttgt 8521 acttattttg ggagatttac tgcttaccac tcaagaatct catatgatgg tttatgagtt 8581 tgttcagaag gtaattgctt gtttctcctg cttctacttc tatataagtt caagtccaat 8641 cgacctatac acttgtgcga gataaataga agtttattct taagaaacac attttcaaac 8701 ccttatagaa caaggatttt gaggaattgt gtttctttca tatgttgttt gtgcccgaag 8761 ggatagtgag aggctcacat cttgacgcga gcgattactc tacaccagtt tgcttaacgc 8821 agcatctgta taggtttaaa aaactgaaac atatatatat ttagtgattc gaggttagtg 8881 tgagtttccg ctgacactta gaagagtgcg cctatacaaa ccaagtccgc ctagtatcgt 8941 tctattatca aacagtaatc atgaaacaga ccgtaagtct gccgatatct aggaaaaatt 9001 aaattttccc ttccccattc cccaaaagtg catatgttcc caaaatcaaa cctatataaa 9061 tatagccccc agaagatgaa tcaaaaaatc tctggtgctg cttcgctgtg aatgtcatgt 9121 ttcatcaagc gttaagtaat ttgaattact atggtatcac caacaaaata ttggcaaatg 9181 cagatacttc ctatcgggga agacgttcag caacaacacc agagggaaat ttctagagct 9241 aaggagtttt ttaaaacaca attccctcat ctaagcaaca aacccacgct atcgacagaa 9301 gaaaataaac aagttcaaac agttttgtgg gaaatttttc gtttagatga tgacatttac 9361 caacgtgcga tcgccggact ttgtctacgc tgctatgttt cacacagaat tttcatcacc 9421 tgcaaaaaca tccctcacac ctacaacgtc agtgcagaaa atcttttcag gtacacagat 9481 ttacttccct tcgttttaaa cgacgatggt aaagcactgg tgattttaga tagtgagggt 9541 aaaactcaac atatcttgaa taatcgtgat ggcacaactc aagcgatcgc aaaaggcgga 9601 gaatttttta gcgttgacat tttgcggaga tttaatccca acttaggttc taacgaaagc 9661 ttagataatt ggactaccag actcacccgt cagaatgaag aaatcaagtc atttttgtgg 9721 gagtttgggt tagcgactcc tagtgattgg ggactactat gtaaatctat accccgttct 9781 ttatctgggc tttttttaac agaagactat gaaattgtaa aagcttttca aacagtttac 9841 caacgagata gactaaaaac acggcaaaaa ggacgctgtt ctgaaccaac accaagccaa 9901 ctgcaagaaa tgctgtattt gttgcagcaa aaaaatatta ttatttctca gaatacatta 9961 attgatcatc tcaaacgcat agcagaagac ttacgtcaag actggcttta caaaaaaaca 10021 ggtagtacca aaactgtacc tatggaagtg tatgataatt caaccaatga ttattttccc 10081 aatccagaat taccttatca cacagaccgc gagccagaag acgtagaatt agaaaaatta 10141 caagaaatct gtaaagactt gtttgaacaa gtgttatctc aaacaatagg agaggtgatt 10201 caccagcgga ttgaaaattt gaaaaaaagt agaggttaca aaaactttgc tcaacgatta 10261 ccggaaggct tgcgacttta ttaccacgag aatatatcct taggtgaaat tggtaaaatt 10321 tggggaatcg aatggagtaa agccagacgc attatgcaac tagaaaactt tttggaaatt 10381 gtccagtatc gaacagagga gatttttttg aataaacttt tacaatcact tgataaatct 10441 cagttaacaa gaatttctca tgaacctgaa tctctaaaaa atattgttgc ggaaattaga 10501 gaatttgtct ggaatcaaac attcaaagaa gcaaaagcag aattacttag tagtaaaaaa 10561 caaaacaaaa atagcttatt tgctaaaaaa atccgcatct accttagtga ttcatcctac 10621 gcagcataaa ctaggtatat ggtagaaaaa atacattgct ttttcatgag ataatcaatg 10681 ggggagaaaa tatgagtagc tccaaaaaat ctatacttaa aacaaaaaac aggagaataa 10741 aattctctcc aaaagttgtt tggctagaat cagaaagttt tgaggaagcg cgagtcatca 10801 gcgagaataa tttcaacaaa ttcggcgaga taaaccaatg gaaaatatat ttaaacgcat 10861 tagcacgact cggttttgaa aagtatctga aagaacgaaa tccaaatata aaaattaatc 10921 aacacagtgc tgctcatcca atagatgatg tttgttatct caatctaggt gaatttcacc 10981 tttgtttaat tattgtagat aacctgattg attattttgt cactgtacca gaagaagtca 11041 tcacttcacc aaaaagggtt gctcacttct atgtattgct ggaagtttta gaagaggaag 11101 aacaattaaa tattcacggt tttttgcgtt acgaccaact cgttaaatat tgccaatcaa 11161 ttaatttgga tgcaaaatct aatagttgtt atcagctacc gctttctttg tttgaccctg 11221 aagtaaacaa cctattatta tactcacgtt tcttgtcacc aactgctatt cctttaccat 11281 cagtagctga agtcaatgat acagagatac aaaatctaac tcagactaca agcatttcta 11341 ctaaagcact agttaaccta accaattggt ggctggaagt ttttgaagaa ggttggcaat 11401 ctaccaaaaa tattctgaaa acgcttgata ataactatgt ttggggttat gcaagaagtc 11461 actcaagagt tgatcattat tctggggcga aaaaattaga ttttggacta ctactaaatg 11521 gtcaaacttt agctttagtt ctcaatctaa aacggttgga aaataatgaa gtcgatgtgc 11581 ttgtacaggt gattcattgc tatgaggaac atcgcaatga ggagtatctt ccgcctggtt 11641 tgaagctgaa agttactctt aatcccaaca catctgagtc agaaagtcaa gaagtcactg 11701 caaggaaagc tgataatgtc attcagttag aatttagtga agccttaggc aaacaattta 11761 aggttgaaat cagttttaaa aatgttgtag ttactgagga ctttttgtta taaatttcat 11821 cactggtgtt tcgcctaaag taggctaacc gccaaataca gcagattgca agttggtgaa 11881 atacaaagat accccacccg cgctgtcgcg caccctcccc ttagcaaggg gagggttggg 11941 gaggggtgta ttcgatttag atccaaagcg ctatatctaa attctaaaat caatcattgt 12001 ctgccgatac gcatctggta atcccgtgtc aaagcacttt cctttgacaa catatcctgt 12061 cattccctct tcttgacgca ctctatccag acaagatgtt aactgaaatt cccctcgttc 12121 ccgaaaattt tgatgaatat gttcttccaa aaagtcaaag attttcggtg tcagtacata 12181 caacccaaag atacagagaa actcatcatc tgccattccc tctacacgca gatgttgacg 12241 ggcataatca atggaaggtt tttcataaac ttgcgtgagt gaaagaaggg agttaaacaa 12301 gtgccaaact cctgtcacac aaccagcttt atggataatt tctgctgaca tgactgtcaa 12361 cccaacaaca ctttgattga cttgtttgta aacatctaaa acttgacttg cacaggattt 12421 ttcatcagca tttgatgagt aaacatggtc acctaacatc agcaaaaatg gctcatcttt 12481 tacccattct ttggcacaaa aaacggcatg accatagcct tcttgttctt cctgcgtcaa 12541 aattgtgatt ctgctaccta attcttgaag atatttgctg tattcttgat tttgaggtga 12601 aagtttgtcg aaaagttctt ttttaggtgg acttttaaaa aactctccaa aaatttcttt 12661 gtctggtggt tgcaccacaa ttccaacttc ttcgattcct gcacttatag cttcttcaac 12721 aatcgccaga atcacaggtt ttgccctgcc atctttatca ataatgggga aaagttcttt 12781 ttttacgact ttagtcgctg gaaataaccg agtcccgaaa ccagccgctg gaatgatagc 12841 ttttcttacc ttattttgtt gcataaatcg tcagttttaa agattgcatt tgtggaaaat 12901 tatattcaat aatttcaata actttttgtt gactttcttc gtctttgaca agaaattgtg 12961 ctgtaccatc tccctgggaa cccacaccct tgccaccaaa aatataaggt tgaatgggtt 13021 catactcaag aagttgatgc agtacgggag cagttaattc ttctggacaa gcgggcatta 13081 aatgtatgtc aaattctccc tgtgcttgtt tcatgagaag accaatttgc tcagcatcac 13141 ctttttgcaa agcatcaacc gctgcttggg ttatttgata gctgatagaa ccgaggtatt 13201 tctgcacatt cgtttgtact tcattagtag cgaagggata gcactcattg agttgagtga 13261 gaattttttg agtgtttttg ctcgcaccaa gatccacaat cacaaagaac aaattcttcg 13321 gtacttttaa ctctatcaaa tcagtgtaac tgccatcaaa tatcatggca ataggacgat 13381 taccgtaggc acaagcctga tccattcgtc cacatcgaga aggcgtggta atttccccca 13441 ggtatgcaaa ttccatttcc tgtcggagag tcatgtttaa gtcatacacc tgattgaatg 13501 accgggctac taacacgcaa atagcagcac tggacgatag ccctttcttg agtggtaaat 13561 ctgtgaggaa attgtcaatc tctagtccac ctacagcgaa ctcagcgaga aattgataag 13621 caacgccagc tgcataacta aaaaaaccgc cctttttagc ttcggttagt aaagcctcct 13681 tttccattgg cagaatcagg gtttgacggg ttccatcact aacacaggtg tgaagaataa 13741 actgggttgg atgaggctga acatgagcag aaagtccctg attggtacca acaagcagcg 13801 tgtaacctac ctccaactta ggattcagac gacgatatcc tcccgcccaa tcgctgtgtt 13861 cgccaaatag acaaagacgc ccaggaacga aaatgtgcat tcagtctacc cgaaattttc 13921 ccactcttta ctaagaaatt actataactt gatgaaatta aaatcaatgg gttaacacta 13981 acacatgaat acctgtctc // LOCUS NODE_2423_length_13995_cov_4.79655713995 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13995) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13995) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13995 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 54..965 /locus_tag="DP116_20120" /pseudo CDS 54..965 /locus_tag="DP116_20120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132059.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" gene 1101..1604 /locus_tag="DP116_20125" CDS 1101..1604 /locus_tag="DP116_20125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006199025.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20125" /translation="MTTFETRVTRTYSQEDIQQILHLAIARQADDNNKEFSYEQLREI AGELEISPETLQQAERDWLEQQGEMLQRRAFNAHRQGRFKKRFGNYTIVNAFLLSIDL LGGAGLSWSLYILLCCGFAVGLDAWNTFNSKGEEYELAFQRWRRKHQVKKFFNTVVSK WLKAWQI" gene complement(1711..2559) /locus_tag="DP116_20130" CDS complement(1711..2559) /locus_tag="DP116_20130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015169255.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="PRJNA477356:DP116_20130" /translation="MAKQLKLNDYKQQIADVYNRRSHNYDESEWHLRIAHRLVEYAQI SPGYDVLDIATGTGHVAIEVAQRVGSSGRVVGVDISTEMLTLARRKVEALSLSNVELQ FADAEALNFPVNSFDRILCANAFPLMTDMEAALRQWMQFLKPNGLVGFHALADTALVG VVIWQKVFENYGVSRELSEPTGTVEKCHNLLERAGFEAIEIKTEQYGSYISLEEAKQR WTISSYPAPKFSNTLFQLSPEQLEEIKAEFDAQLQALVTEHGIWNDGTCFFVFGRKGA NSITHQ" gene 2863..3729 /locus_tag="DP116_20135" CDS 2863..3729 /locus_tag="DP116_20135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316004.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide deacetylase family protein" /protein_id="PRJNA477356:DP116_20135" /translation="MSQKNQSFPFPLILAFLILFLLIKLIINKPLIPILGFHGILSAN TPTSQLRDMHYPEKDLEKILEHLVRHNYWFLTTQELYDLFLKKYHEIPKEHSNQKPIM ISFDDGYKTVHTNLLPILSKLEKKYGKKVKVVLFINPGIMQREESASTHLGCQELREG LKKGFYDIQSHGLNHKNLTTLTRRELVQELQQAQIKLRQCTQDLDPQQQVASHLAYPY GASNKQVRYYASKYYLSTYLYNDKILDYDCNQNFYEIPRIPVNRKMTFQQMLEIAEGF QQDKSLQKCEGK" gene 3918..5357 /locus_tag="DP116_20140" CDS 3918..5357 /locus_tag="DP116_20140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316005.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF5009 domain-containing protein" /protein_id="PRJNA477356:DP116_20140" /translation="MQEKTVNLKRAYALDALRGFAILAMVLSGTIRYKILPAWMYHAQ EPPPAHTFNPNLPGLTWVDTVFPIFLFCLGAAIPLALSSRLAKGFTTKQVILYILKRG FLLGVFAIILQHLRPYKINPNPTQQTWWIALLGFLILFFMFVRLSVNLQLRHYIKWLP LSASIAAIILISFLQYPDGRGFSLSRSDIILVVLTNMAVFGSLAWFFTRNNLLLRLGF LGLLIALRLSATVKQSWIAILWHASPVPWIFKFDYLQYLFIVIPGTIIGDFILNWLQT PTRNEEDEEIDFSWNQLHFFSIILMMLSICLALLIGLQTRWVWQTTLLSFVLCSISWF LFVNPVNDTERLLKSFYQWGIYWLALGLLFEPFENGIKKDPSTLSYYFVTTAIALFFL MIFTILLDIFKQRKSLQLLIDNGQNPMIAYVAFANLLLPILRLSHIEPLILEFTNTPL TGFFKGVIYTLAIACLVSLFTKLKFFWRA" gene 5590..5862 /locus_tag="DP116_20145" /pseudo CDS 5590..5862 /locus_tag="DP116_20145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952078.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 5893..6097 /locus_tag="DP116_20150" /pseudo CDS 5893..6097 /locus_tag="DP116_20150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207359.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 6098..6107 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 6162..6425 /locus_tag="DP116_20155" /pseudo CDS 6162..6425 /locus_tag="DP116_20155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207359.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(6834..7457) /locus_tag="DP116_20160" CDS complement(6834..7457) /locus_tag="DP116_20160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130134.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FMN-dependent NADH-azoreductase" /protein_id="PRJNA477356:DP116_20160" /translation="MAHILHIDSSPRGERSFSRKFSGEFITAWKNAHSGDKVTYRDIG HNTIPHVDESWIAAAFTPPDARTPELAKAIELSDTLVNEFLAADRYVFGVPMYNFNVP STLKAYIDQIVRVGRTFAVTEQGGFKGLVEGKKLLVITARGGDFSPGSFAAPYDYQEP YLRAIFAFVGITDITFINVENLGAGDEVRQQSFAKAHEAIAQAVASW" gene 7645..7998 /locus_tag="DP116_20165" CDS 7645..7998 /locus_tag="DP116_20165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_20165" /translation="MKAEAQNHSRLTCEVETTLKVIGGRWKVLIIRELMDGVKRFGEL QRALDGITQKMLTQQLREMEEDGVIDRKVYAQIPPKVEYSLTPLGESLQPILYAMHEW GVKHLFEINNKKQNI" gene complement(8060..8452) /locus_tag="DP116_20170" CDS complement(8060..8452) /locus_tag="DP116_20170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317912.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20170" /translation="MDSLDKLLAELKAEYEEQKPQQHQLKANSIKPVNKMEQKSNSLI DNLLAEVKADFEQKDLAEKLQKQQEQEQERIRQEQIQAKKLEELKKQAEDWLAKLDPL SLEGLWFERFAEGYPSKLEAAIEYLQTG" gene complement(8523..9128) /locus_tag="DP116_20175" CDS complement(8523..9128) /locus_tag="DP116_20175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488911.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20175" /translation="MLDNRDYTLIIDKSGSMATQDQKGSRSRWVAAQESTFALASKCE QLDPDGITVYLFSGRFKRYENVTSSKVLQIFQENDPSGTTDLAGVLKHATDNYFQRKA AGETKANGETILVVTDGEPDDRKAVMKVIIEASRQMDRDEELGISFIQVGTDSQATRF LKVLDDELQGAGAKFDICDTITIEDMEDMTLSEVLLNAIND" gene complement(9282..9887) /locus_tag="DP116_20180" CDS complement(9282..9887) /locus_tag="DP116_20180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317914.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20180" /translation="MMSDRDYTLIIDKSGSMSTPDQVGGRSRWEMAQESTLALARKCE QFDPDGITVYVFAGKFKRYDDVTSAKVGQIFLENDPGGTTNLAGVLQDATNHYFQRKA AGQAKPNGETILVITDGEPDDRKAVFEVVVNASRQMERDEELGISIIQVGSDPQATKF LKALDDQMQGIGAKFDICDTITLDDLEDMSLADVLMNAVTD" gene 10084..12423 /locus_tag="DP116_20185" CDS 10084..12423 /locus_tag="DP116_20185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458865.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium:calcium exchanger" /protein_id="PRJNA477356:DP116_20185" /translation="MQEDFRLIVDLVSVFAVAACGGLLAALLRQPVLLGYLIGGMVVG PSGLGLIKELIQVETLAQFGVAFLLFALGVEFSFTELKKVQAIALGGGGLQIALTILI TVLVCGATGAWAYLPAKGIFLGAILSLSSTAVVLKCLMERNETETPHGQVMLGILVVQ DLALGLMIAVLPALHEPGETLVVAVLLALLRIGLFAAGAVVAGIWLIPPLLRLLARTE SRELFLLGVVALCLGIALLTESLGLSIEMGAFVAGLMISEVEYADQTLTYVEPLRDIF ASLFFASIGMLIDPVFLWNNLQLILGLVTLVFIGKCLIITPLVKSFRYPLKTALIVGL GLAQIGEFSFVLASSGQALGLVSRKVYLLILGTTAVTLVLTPFVLRLVPIVFDWVESV PWLKPYLSGDGQPLGVADELPIKDHVVVCGYGRVGRNLVKLLQQHNLPVVVIDQSESR IQQLREAGVAYIYGNCVSFHVLETAGVNTARAVAIALPDPMSTRLCLKRALELSPDLD VVVRATNDKSIEVLYQLGAREVVQPEFEASLEMTTYILTDLGLSSAVVQREMQEIRNR HYLDLRPELSASEVSRDLQLATQDLNKRWYSLPSGSPLVGMTLEEADMRYLTGVSLMA IRREGGEEIDYPPVQTKLEEGDRLLVVGSDDELAALDEFAKGQVAVPGESNACQWVTI STDSPLVGKTLADLDIRNKYKVMVQAMRRDGKFIRLPDGKADLQVRDQVLLCGNLLSL NQLVRFLVPRSEIPLSIPVVKAGEAEALKEFLPRDSVLD" gene complement(12505..12714) /locus_tag="DP116_20190" CDS complement(12505..12714) /locus_tag="DP116_20190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein hfq" /protein_id="PRJNA477356:DP116_20190" /translation="MGLDTSLPSIRQVQNLIQQAVTIEIKLLTGDILIGRIIWQDSQC MCLMNENGQQMTVWKQAIAYIKPKE" gene 12915..13757 /locus_tag="DP116_20195" CDS 12915..13757 /locus_tag="DP116_20195" /EC_number="5.1.1.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740247.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diaminopimelate epimerase" /protein_id="PRJNA477356:DP116_20195" /translation="MAIEFTKYHGLGNDFILIDNRSSSEPVITQEQAVKLCDRHFGIG ADGVIFALPGENGTDYTMRIFNSDGSEPEMCGNGIRCLGAFLADLEGDAKKSDQYRIH TLGGVMTPQLMPDGQVKVDMGIPRLLAGEIPTTINPANEKVINQPLEVAGKTWDVTCV NMGNPHCITFVEDVATIPLEIIGPQFEHHPVFPQRINTEFIQVVRPDYVKMRVWERGA GITLACGTGACASLVAGVLTQRCDRKATVELPGGPLLIEWSQIDQRLYMTGPAERVFT GKID" BASE COUNT 3981 a 2871 c 2918 g 4215 t 10 others ORIGIN 1 gcccaaggta gatatttcca gtaatggaca gctttgggta ggtattatat aacggttata 61 tacaagtcgg gcgtagtctc gaaaacttca ttggttatct agatgctgtg aaattaattt 121 tagccttggg attacctatg gctatggtta ttgttggtgt cgcgagttgg tggttggcag 181 gattagcaat gcaacctatt taccaatcct acagacaaat tcaacagttt acagcagatg 241 cagcacacga gttacgaaca cctttagctg caacaggcgc aacagtggaa tcagtgctca 301 tggtaccaca actggatgaa acagaagtgc gagagactct gcaaaccata caacgtcaga 361 atctgcgact gacaactttg gttgctgatt tgctgttgtt agctcgctta gatagacaac 421 ctataccaat gcgacacgaa tgttgtctgg atgagattat aagcgattta gttgaagaat 481 ttgcagcgat ggcaatcgcc gctcatgtga acctgacatc ttcaatacaa gtccatcaac 541 ctctaaacat catcggtaat tctgagcagc tttatcgtct gatttctaac ttaattatca 601 atgcgattca atacacaccc aaggaaggca aaataactgt tgtcttagac cgcagtgacc 661 attatgctgt gattcaggtt caagatacag ggattggcat tccacaaacc gagcttgggc 721 gaatttttga tcgcttttat cgggtgaata gcgatcgctc tcgtagcact ggcggttctg 781 gattagggtt ggcgattgcc caagcaattg ttcaagcgca ccacggcaac ttgaatgtgc 841 aaagtgaatt aggcaaaggt agcactttta caattctact gccttttgat accactacat 901 ttgaaggtgt tcgttctatt tatcgattca aatggctgtc tcgtcgtcca cgtaaattta 961 agtaatattt ttaagtaaaa ttttatattt ttcctgtgta taccatctac ccacagcacc 1021 agctatattt gctagtctaa aactagtgct aaagcatcga aacaaaactt tttattggaa 1081 atactaaaag aaacaaagtt atgacgactt ttgaaactag agtaactcgg acttatagcc 1141 aagaggatat acagcaaatt ctccatctgg cgatcgcacg tcaagcagac gataacaaca 1201 aagaattttc ctacgagcaa ctgcgagaaa ttgctggaga attagaaatt tcgccagaaa 1261 ctctccaaca agcagaacga gactggctag aacaacaagg agaaatgcta caacgacgag 1321 ctttcaacgc tcatcgtcaa ggtagattca aaaagcgttt tggtaattat acaattgtta 1381 acgccttttt gttgtccatt gatctacttg gtggtgctgg tctttcctgg tcactatata 1441 tcctactctg ctgtgggttt gcagtcggtc ttgatgcttg gaatactttt aactctaaag 1501 gcgaagaata cgaactcgct ttccaaagat ggcgtcgcaa gcatcaggtg aaaaaattct 1561 ttaacacagt cgtgagtaaa tggctcaagg cgtggcagat ttagtcacca gtcaaaattg 1621 cagacatcgc ccgttattaa aacagttatc agttatcagt catcagtcat caattgcttg 1681 ataagtgttc agagttccct gttccctgat ttattgatga gtgatgctgt ttgctccttt 1741 acgaccaaat acaaaaaagc aggtaccatc attccatata ccatgctctg tcactaatgc 1801 ctgcaattgt gcatcaaatt cagccttgat ttcttctagt tgctctggtg aaagctggaa 1861 cagagtgtta gaaaatttgg gggctggata agaacttata gtccaccttt gctttgcctc 1921 ctctaaacta atataactac cgtactgctc cgtcttaatt tcaatcgctt caaagcctgc 1981 tcgctcaagc aggttgtgac atttttcaac tgtacctgtt ggttcgctca actctcgcga 2041 aacaccatag ttttcaaaaa ctttctgcca aataacaact ccaactaaag ctgtatccgc 2101 aagtgcatga aaaccaacta acccattagg tttgagaaat tgcatccatt ggcgtaatgc 2161 agcttccata tctgtcatca aaggaaatgc attcgcgcac aaaattcggt caaaactgtt 2221 gactggaaaa ttcagggctt cagcatccgc aaattgaagt tcaacattgc ttagacttaa 2281 cgcctcaacc ttgcgccgag caagagttag catctcagtt gaaatatcca cacctacaac 2341 tctaccagaa gagccaactc tttgagcaac ttcaattgca acatgacctg ttccagttgc 2401 aatgtccaaa acgtcatatc cagggctgat ttgtgcgtat tcaaccagac gatgagcaat 2461 cctcaaatgc cattcacttt catcgtagtt gtgacttctg cgattataca catccgctat 2521 ctgctgcttg tagtcattta acttaagttg tttagccatg acattggttt gaggtgcgac 2581 tgttttttag aacctagact tatgataaat gctgtttggc aagtgcgaca cttcggtcat 2641 gctgtaatac aatagacctg cctcaaaaat ctacttttat ccttccttgt gagctttagc 2701 tgatgaagtc tcctacaaaa gtcaattttt tggaacagaa aatcacagat aaacacagat 2761 aaattatctg tatttatctg cgtccatact cgtccatctg cggttaaaaa atttccctaa 2821 aactcgttac atctaccatc caatagaaca aaattaatca aagtgagtca aaagaatcaa 2881 agctttccct ttcctttaat cctagcattt ttaattcttt ttttgctcat caagttaatc 2941 attaataagc ctctcattcc tattttgggt tttcatggca ttctttctgc taatactcct 3001 acttctcaat tgcgagatat gcactaccca gaaaaagatt tagagaaaat cttagagcat 3061 ttagttcgtc ataactattg gtttttaaca actcaagaat tatatgattt attcttaaaa 3121 aaatatcacg aaataccaaa agagcattcc aatcaaaagc caatcatgat ttcatttgat 3181 gacggatata aaacagtaca cacgaactta ctacccattt tgtctaagct tgaaaagaaa 3241 tacggtaaaa aagtaaaagt agtcttgttt atcaacccag ggattatgca acgagaagaa 3301 agtgcctcta ctcacttagg atgccaggaa ttgagagaag gtttgaaaaa aggtttttat 3361 gatattcaat ctcatggctt gaatcataaa aacttaacaa cactgactcg tcgcgagtta 3421 gttcaagaac tccagcaagc ccagattaaa ctgagacaat gcactcaaga tttagatcca 3481 caacagcaag tagcatctca tctcgcttat ccttacggag cttctaataa acaggtgcga 3541 tactatgcat ctaaatatta tttatcaaca tatctctaca atgacaaaat actcgattac 3601 gactgcaatc aaaacttcta tgaaattcct cgtataccag ttaatcgaaa aatgacattt 3661 caacaaatgc tggaaatagc tgaaggtttt cagcaagaca agagtctaca aaaatgtgaa 3721 ggtaaatagg caggatgaaa cattttccgt gactcctaag aaacacaggg gtgtaggggt 3781 gtaggggtgt aagggtgtaa gggaaaaagg tgtttcttga aagcgcccca ctgacggcgc 3841 accgcccgtt ataaatgaaa gaaacactct ttccctgtta agagttccct gttaagagtg 3901 tttcttagga gatttccatg caagaaaaga ctgtcaacct aaaacgtgcc tacgccttag 3961 atgcactgcg tggatttgca attttggcaa tggttttgtc aggcacgata aggtataaaa 4021 ttttgccagc ttggatgtac catgctcagg aaccaccacc tgctcataca tttaatccta 4081 atctgcctgg gttaacttgg gtagatactg tatttccaat ttttttgttt tgtctgggag 4141 cagctattcc tttagcgttg tcgagtcgtc ttgctaaagg atttacgaca aaacaagtta 4201 ttttatacat tcttaaaaga ggatttttat taggagtatt tgcaataatt cttcaacatc 4261 tgagaccata taaaatcaat ccaaatccaa ctcaacaaac atggtggata gctttgttgg 4321 gttttttaat actctttttt atgttcgttc ggttatctgt caatttacaa ttgaggcact 4381 atataaaatg gcttcccctt agtgcttcaa tagcagcaat tattcttatc tcttttcttc 4441 aatatccaga tggacgtgga ttctcacttt caagaagtga tattatccta gtcgttctga 4501 caaacatggc ggtttttggt tcacttgcct ggttttttac cagaaataat ttgttgctac 4561 gtctaggatt cttaggctta ttgattgctt tacggctgtc ggctactgtt aaacaaagtt 4621 ggattgctat actgtggcac gcttcacctg tgccttggat ttttaaattt gattatttac 4681 agtatttgtt tattgttatt cctggaacta ttataggaga ttttattctc aactggctgc 4741 aaactccaac tagaaatgag gaagatgaag aaattgactt ttcttggaat cagctacatt 4801 tttttagcat tattctcatg atgttgagta tttgtttggc actactcatt ggcttacaga 4861 ctaggtgggt gtggcaaaca acattactga gtttcgttct ttgttcaata agttggtttt 4921 tatttgtcaa tccagtgaat gatacggaaa ggttgctgaa atcattttac caatggggaa 4981 tttattggtt ggcacttggc ttgctttttg agccatttga gaatgggata aagaaagacc 5041 cttcgacgtt gagttactac tttgttacca cggcgatcgc actttttttc ctaatgatct 5101 tcaccatact cctagatatc ttcaaacagc gaaaatctct tcagctactc atagacaatg 5161 gacaaaaccc aatgattgcc tatgtagctt ttgccaatct tctcttgcct attctgagat 5221 taagtcacat tgaaccgttg attttagagt ttacaaatac ccccttgaca ggttttttca 5281 aaggtgtcat ttatacgtta gcgatcgcct gtcttgtcag tctttttact aagttgaagt 5341 tcttttggag agcataatcg ttaaatatac tgaaatatac ttatcataaa agttctttct 5401 tttactataa tatcggttat tataatttcg agtcactcaa atcttgcacc tcatccatag 5461 tacgccttga actcaagttc gggaattgat ttattggcgg acgaaaatga tggtgcaaga 5521 tttgaagtaa taaattaaca aacaacagaa gcttcaacgc ttcaaactgt gctactagga 5581 gatgcatcaa tgccaacaat cactttaaga gtgaaaacaa atacgatttt caagcaggac 5641 tggcggctac aatcaaatga tcctcagcta caacaacaaa acaagtattt agctcaagca 5701 ggtcagcaac ttcgcgttac gtccattgac cgtaatgctg agaaatatgg tggcgatcat 5761 tggctagtca ctttcgagca accacttcag cctaatcaag gtacagcgaa aagtacttgg 5821 tacgtttacg cacctcatgt agaagaatta tcaagtgttc catcaagttc agtcacttta 5881 acaccaagga caactacaac tttcaaacaa gactggcggg aacaatcgtc aaatttagca 5941 ccacaagaca agtatatagc cacaccaggt cagcaatttc gcgtttcatc tattgacaat 6001 aatgccctca gatatggcgg tgatcactgg aaagtcacat tcgtgcagcc acttcaaccc 6061 aatcagggac aagccaagag tacctggtac gtttatgnnn nnnnnnngtg cagccacttc 6121 aacccaatca gggacaagcc aagagtacct ggtacgttta tgtgcctgat gtaaaattat 6181 taactaatac atcaacctca gctttcctca gagtcagaac aaatacaact ttcaagcaag 6241 actggcgaga acaatcgtca aatttagcac aacaagacaa gtacgcagcc acagcaggtc 6301 agcaattgcg tatttcatct attgatcgca atgcttttag atatggtggc gaccactgga 6361 aagtcacatt cgtgcagcca cttcaaccca atcagggaca agccaagagt acctggtacg 6421 tttatgcgcc tgatgtcata tacgaatcgc ctttctctca gtctaatacg gagacagtgc 6481 ctaatcatat caacatcatc cagtatctgc ggcagcaaat acaggggaac acctctatta 6541 agcctgcagg ggaagtaagc ctgctttctc cctcctcgat ttttctcctc tgctttctct 6601 gtacctctgc ggttcgtttt aaaaagtttt gcaaaagtca aagccctacg ggcagggtgt 6661 aggggtataa gggtgtgagg gtgtaagggt gtaagagtgt aagggtgtga gggtgtaggg 6721 atccctccgg gcgtctacat gtgtgtgcat ctgcggtttg aaattatagt tatatgactt 6781 tcaccgattt ccatagtagg gtaggcatta caatgctcac cctacttaag agattaccaa 6841 ctagccacag cttgtgctat ggcttcatgt gcttttgcaa aagactgctg gcgaacttca 6901 tcacctgcac cgaggttttc cacattgatg aatgtaatgt ctgtaattcc cacaaatgca 6961 aaaatcgctc tgagataggg ttcttgataa tcgtagggtg cagcgaagct tcctggagaa 7021 aagtcaccac cacgagcagt gataactagc aactttttgc cttcaaccag ccctttaaag 7081 ccaccttgct cggttacggc gaaggtgcga ccaacgcgaa caatttggtc aatataagct 7141 ttcaacgtag aaggtacgtt gaaattatac atcggcacgc caaagacgta gcggtcagcc 7201 gctaaaaatt cattcactaa ggtatctgac aattcaattg ccttagctag ctcgggtgtg 7261 cgagcatctg gtggtgtaaa agcagcagcg atccacgact cgtctacatg aggaattgta 7321 ttatgaccaa tatcccggta ggtgacttta tctccggaat gggcattttt ccaagctgtg 7381 ataaactcac cagagaactt gcgggagaaa gaacgttctc cacgaggact agagtcaata 7441 tgcaagatat gtgccatgaa ttaattctca atagcaggca atagatttag gattggttac 7501 taggcttgtt ggttacgcat aaacgcaact acgaactagt cccatgattt gtagttcaac 7561 tttttatccg tactaacttt aaactatgta aagttaagat gagaagtagg cacttaaaag 7621 taagctagtt accaaaaaga aactatgaaa gctgaagcac aaaaccatag ccgactgact 7681 tgtgaagtag aaaccacact aaaagtcatt ggtggacgct ggaaggtttt gattattaga 7741 gaattgatgg atggtgtgaa acgctttggt gaattacagc gagctttaga tggaattact 7801 caaaaaatgc tgacccaaca actcagggag atggaggaag atggggttat tgatcgcaaa 7861 gtttacgcgc aaattcctcc aaaagtagag tattccttaa cacctttagg agaaagtctt 7921 caaccaattc tctatgcaat gcacgagtgg ggtgtcaagc atttatttga aataaataat 7981 aaaaagcaaa atatttgagc actggacttt tttcgttaat tcttaataat tgtaaggtac 8041 acaaaactca gaattttatt caccctgttt gtaaatattc aatcgcagcc tctaattttg 8101 acggatatcc ctcagcaaat ctttcaaacc aaagcccttc taaggacaag gggtctaatt 8161 tagctagcca atcttcggct tgctttttca actcctctaa ttttttcgcc tgaatttgtt 8221 cttgtctgat tcgttcttgt tcttgttctt gctgtttttg caatttttcg gctaaatctt 8281 tttgctcaaa atcagctttg acttcggcta aaaggttatc tattaaggaa tttgattttt 8341 gttccatttt gttgactggt ttgatggaat ttgcttttag ctgatgctgt tggggttttt 8401 gttcctcata ttcagctttc agttcagcta aaagtttatc aagagaatcc atgatatgaa 8461 atcctccgta ctagtccaca ttcagtttgc tcctcaattc aatgaaaagt catcaaagat 8521 gcttaatcat taatagcatt gagcagtact tctgataaag tcatgtcttc catatcttct 8581 atggtgatgg tgtcgcagat atcgaacttt gcaccagccc cttgcaattc atcatctaat 8641 actttgagaa agcgagtagc ctgggaatct gtacctactt gaataaaaga aataccaagt 8701 tcctcatcgc gatccatctg gcgagaagct tcaataatca ccttcataac cgctttacgg 8761 tcatctggtt caccatcagt cacaactaaa atagtttcac catttgcctt agtttcaccc 8821 gcagctttgc gttgaaagta gttatcagtc gcgtgtttca gcacacctgc caagtcagtt 8881 gtaccagaag ggtcattttc ttggaaaatt tgcaacacct tacttgatgt cacattttca 8941 tagcgcttga agcgtccaga aaacagataa acagtgatac catctggatc aagttgctcg 9001 catttactcg ccaaggcaaa agtagattct tgtgcagcaa cccatctact tctactaccc 9061 ttttggtctt gagttgccat actgccactt ttgtcgataa ttaaagtata gtcacgatta 9121 tctagcattt ttcaattctc cctatcagtt atcagttata ggactcatat ttgattttta 9181 acgaagctag gtacactttc tgttccctgt tccctgttcc ctgttccctg ttcccttatc 9241 actcatcagt catcaataac gaacaaatga caaaaaactg attaatcagt caccgcattc 9301 attaacacat cagcaaggct catatcttca agatcatcca aggtgattgt gtcgcagatg 9361 tcaaatttag caccaatacc ttgcatttgg tcatccaaag ctttgagaaa cttggttgct 9421 tgaggatctg aacctacttg aattatagaa attcccaatt cttcatcgcg ttccatctgg 9481 cgcgatgcat taacaaccac ctcaaatact gctttgcggt catctggttc accatcggtg 9541 attactagaa ttgtctctcc attgggcttg gcttgacctg ctgctttgcg ctgaaagtag 9601 tgattggtgg catcttggag tacacctgct aaatttgtcg tgccaccagg atcattttcg 9661 agaaaaatct gccctacttt cgctgaagtg acatcatcgt agcgtttaaa tttaccggca 9721 aatacataaa ctgtgatgcc gtcagggtca aattgctcac acttccttgc caaagcgagt 9781 gtagattctt gagccatttc ccagcgactt ctaccaccaa cttggtcagg agtggacata 9841 ctaccgcttt tatcaataat caatgtgtaa tcgcgatcgc tcatcatagt atccctttaa 9901 ttgttaacct tgctaccttg ttgttctagt ctagctattg gattgtagtt ttaacagtta 9961 tctgtttagc tacaggcata gggcatactt taaatagcta aataaaaaaa gccatttgat 10021 actttaagag aaaaaacttt gtatcctgag gttaataggt cttaacagtt cttttataag 10081 cttgtgcagg aagattttag actcatagtt gatttagttt cagtttttgc cgttgcagcc 10141 tgtggcggac tcttagcagc gcttttaaga caacccgttt tactagggta tctcattggt 10201 gggatggtcg ttgggccatc cggactggga ctcattaaag aattgattca agtagaaacg 10261 ttggctcagt ttggagtcgc ttttttacta tttgccttag gtgttgagtt ttcctttacg 10321 gaactgaaaa aagtccaggc gatcgccctt gggggaggag gactccaaat tgccctgaca 10381 attctcatca cagttttagt ctgcggggct acaggggctt gggcatatct accagctaaa 10441 ggcatctttt taggggcaat tttgtcgttg tcttccacag cagttgttct caagtgtttg 10501 atggaacgta atgagacgga aacgccccac ggacaggtga tgctgggaat tttggtcgtt 10561 caagatttag cactaggact gatgatcgca gtcttacctg ccctccacga acctggagaa 10621 acacttgttg tcgcagtttt gttggcactg ttgcgaattg gtttatttgc tgctggtgca 10681 gttgttgcag gaatttggct catccctcct ttgttgcgac tgctagcccg tactgaaagc 10741 cgagaattat ttttattagg tgtcgtagca ctgtgtttgg gtattgccct actgacagag 10801 tctttggggc tttctattga aatgggggcg tttgtcgctg gcttgatgat ttcggaagtg 10861 gaatacgctg atcaaaccct tacctatgtt gagccattgc gagatatttt tgccagttta 10921 ttttttgcct ccattgggat gttaatcgac ccagtatttt tgtggaacaa tctgcaatta 10981 attctgggat tagtgacact ggtatttatt ggtaaatgtt taattattac gccactggta 11041 aaatcgttcc gctacccgtt gaaaacagcg ttaatcgttg ggttgggact ggctcaaatt 11101 ggagaatttt cctttgttct cgcaagctca ggacaagccc tggggctggt ttcacgaaaa 11161 gtatatttat tgattttggg aacaacagca gtaacattag tgcttacccc ttttgtgctg 11221 cgtttggttc caattgtatt tgactgggta gaatcagtac cttggttaaa gccctattta 11281 agcggggatg gtcagccgtt gggagttgct gacgaactac caatcaaaga ccatgtggtt 11341 gtctgcggct atgggcgagt gggacgcaat ttggtgaagt tgctccagca gcacaatttg 11401 cctgttgtgg taattgacca gtcagaaagt cgaattcagc agttgcgcga agctggggtg 11461 gcttatattt atggtaattg tgtgagtttt cacgttttag aaactgctgg agtcaacaca 11521 gcacgagcag tggcgatcgc acttcctgac cccatgagta cccgtctttg cctcaaacgc 11581 gctttggaat tgtctcctga tttagatgtt gttgttcgcg ccactaacga caaaagtatt 11641 gaggtacttt atcaactggg agcacgggaa gttgtgcaac cagagtttga agctagtttg 11701 gaaatgacga cttacatatt aactgatttg ggcttgtcat ctgctgtggt gcagcgagaa 11761 atgcaggaaa ttcgcaatcg tcattatttg gatctgcgac cggaactatc tgcgtctgag 11821 gtttcccgtg atttacaact cgcaactcaa gacctgaata aacgttggta ttctttacca 11881 tctggttcac ccctagtcgg tatgacttta gaagaagcag atatgcgcta cttaacagga 11941 gtgagtttga tggcgattcg tcgtgaagga ggtgaggaaa tagattatcc tccagtgcag 12001 accaagttgg aagagggcga tcgcctcttg gtagttggtt ctgatgatga attggcagct 12061 ttagatgaat tcgccaaagg tcaagttgct gttcccggag aaagcaacgc ttgccagtgg 12121 gttactatca gtaccgatag tccactggtt gggaaaaccc ttgcagattt ggatatccgc 12181 aacaaatata aggtcatggt gcaagcaatg cgacgagatg gcaagtttat ccgccttccc 12241 gatggtaaag cggacttgca agtccgcgac caagttttat tgtgtggaaa tttgctttct 12301 ctgaatcaac ttgtgcggtt ccttgtccca agaagcgaaa taccgctatc tatcccagtg 12361 gtgaaagcag gcgaagcaga agcactcaaa gagtttttgc ctagggatag tgtgttggat 12421 tagtcattag tcattactcc cgaaccgcta tattgcgggt taagggagtg tgcaaccaat 12481 cgtttattag caaaagacaa atgactattc tttaggctta atataagcaa tagcttgctt 12541 ccaaacagtc atctgttgac cattttcatt cataagacac atacactgag aatcctgcca 12601 tataattctc ccaatcaata tgtcgccagt tagcagttta atctcaattg tgactgcctg 12661 ctgtattaaa ttttgtactt gccgaatgct aggcagtgag gtgtcaagtc ccataattga 12721 agaaggggtg taagggtata agggtgtagg ggtgtaggga ataaatcagt cggtcgaggt 12781 gcttccactg attttagacc accaaatcgt agatgttggt gggttgaggt acccttatcg 12841 taagtcagta ataactgaca aaagacatta cactaatgac gtaaaaactc aaatataatt 12901 catcgacttt gacaatggca atagaattta ctaagtatca cggtctgggc aatgacttca 12961 tattgattga caatcgctcg tcatcagagc ctgtcatcac tcaagagcaa gcggttaagt 13021 tgtgcgatcg ccactttggc atcggtgcag atggtgttat ttttgcctta cctggagaaa 13081 acggtactga ctataccatg cggattttta attctgatgg ttcagaacca gaaatgtgtg 13141 gtaacggtat tcgctgtcta ggtgcttttt tagctgattt ggagggcgac gctaaaaaaa 13201 gcgaccaata tcgcattcat actctaggtg gtgtgatgac accccaactc atgccagatg 13261 gtcaagtgaa agtggatatg ggtatcccca ggctacttgc tggtgaaatt cccacgacta 13321 tcaacccagc taacgaaaaa gtgatcaatc aaccgctaga agtcgcaggc aaaacttggg 13381 atgtcacctg tgtcaatatg ggaaatcctc actgcatcac ttttgtagaa gatgttgcaa 13441 caattcctct agaaattatt ggtcctcagt ttgagcatca cccagttttc cctcaacgaa 13501 tcaatactga atttatccaa gtggtgcgtc ctgactacgt gaaaatgcgt gtatgggaac 13561 gaggtgcggg tataacttta gcttgtggga cgggcgcgtg tgcgtcgtta gtggcgggtg 13621 tgttgacaca gagatgcgat cgcaaggcta cggtggaact tccaggagga ccattgctga 13681 tagaatggtc ccaaatcgac caacgccttt atatgacagg accagcagag cgggttttta 13741 ctggcaaaat tgactaaaac taggggtgta agggtgtgag ggggtaaggg tgtaagggga 13801 ataaacaact gggggtgtag gggtgtaagg agaataaaca acagttccaa ataaaatgca 13861 tttccgcctg ttcccctata ccccaatacg gttgctgtta gtggtattta gttttgtaag 13921 atcccccacc aaagttaaaa agcttgcttg gttccctcct ttttaaggag ggctagggag 13981 gatcaatcct taacg // LOCUS NODE_2432_length_13958_cov_5.24088313958 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13958) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13958) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13958 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(61..678) /locus_tag="DP116_20200" CDS complement(61..678) /locus_tag="DP116_20200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875968.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_20200" /translation="MVRERVTEIERDSSLEKVERILQGAMHEFLLHGFAGTSMDRVAA SAGVSKATVYSHFQDKQGLFKALIEKLAQERFHSIFGTEPLEGDPKIVLRRLVTKALN QMLKDEEFHAFKRVVIGESGRFPELAQLCIITLVKPTIDTLKSYLASHPELKIPDPEA TARILVGSLVHFVMTQEMMNGKEILPMESDRLIDALIHFILKSAA" gene 1003..2307 /locus_tag="DP116_20205" CDS 1003..2307 /locus_tag="DP116_20205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875969.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_20205" /translation="MLEHSNLEGSSELKPIFRLPFLLVILATITSGGISVYTVQRFQN DAKTAKQQAAPVVQVTTVTALGRLEPKGEIIKLSAPASAEGSRVEQLLVREGTKVKQG QLVAILDSRDKLGAAVAEAQEQVRVAQANLVQIKAGAKKGETDAQKAAIARIQAEQGT EVQAQQATIARLEAERDTEIEAQKATIAQLQAQLNNALAEYRRYQTLYQQGAISTSFQ DTKRLTLTTAQQKIVEAQANLKRIETSREQQLAEARANLKRIETSRKQQLAEARATLD KIAEVRSVDVAAAQAEVNRAAAAVKRAEANLRQAFVRSPQEGEVFKIRTRPGELVSND GIVEMGQNAQMYAVAEVYQSDINKVRLGQPVRLLSDSVAGELSGIVDRIDSQVLRQNV INSDPTSNIDSRIVEVHVKLDQPSTLKAAKFSNLQVKAVISL" gene 2304..3497 /locus_tag="DP116_20210" CDS 2304..3497 /locus_tag="DP116_20210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316332.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_20210" /translation="MIGLITQQIQQLSRRTPLGWLQLSHQKGRFLVALAGIAFADVLM FIQMGFQAALFDSNTRLHTAMQADIFLMSLQGRNLAYLSTFPRRRLFQAMDVPGVKSA EAMYINFLDWKNPQTLKKTGVLVVGINPNKLLFDLPDVNRQLNVLKLPDTVLFDRGSQ GDYAKIIAQIDQGKSVTTEIQRRTLTVSGLFKVGASFIADGSLITSDQNFLRLFPGQQ ASSVNLGLIQLQPGYDPKLVSKTLKSYLGSSQDVKVFTKEEFIKFEKDYWQKSTAIGF IFSLGAAMGFMVGVIIVYQVLSTDVNAHMKEYATFKAMGYRNAYLLGVVFEEAIILAV LGFLPGLTVSVGLYALTRNATGLPLIMTIARAVQVQMLTIIMCMISGAIATRKVQSAD PADMF" gene 3762..4460 /locus_tag="DP116_20215" CDS 3762..4460 /locus_tag="DP116_20215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409968.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_20215" /translation="MTNQPVISIQNLNHYFGKGQLRKQVLYDINLEINAGEIIIMTGP SGSGKTTLLTLVGGLRSAQEGRLRVLGRELCGANAQQLTLARRSNGYIFQAHNLHGSL SALQNVRMGLELHKSITPAEMKRRSAEMLELVGLGNRVNYYPDDLSGGQKQRVAIARA LVSHPKMVLADEPTAALDSKSGRDVVNLMHNLAKEQACTILLVTHDNRILDIADRIVY MEDGKLAKAPATVG" gene 4598..5233 /locus_tag="DP116_20220" CDS 4598..5233 /locus_tag="DP116_20220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318043.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_20220" /translation="MSVTIPLHAIELAPGSQIAIHNLSWQDFERLLEDLGEKRNTRIA YYRGTLEIMSPLALHERPHRIIAYIITTILEEQGRNWEDFGSTTFKRPDIAGVEPDTC FYIQNASQVKGCTQMDLTVYPPCDLAVESDVTSKTTLNAYISLRVPEVWIYSNHQLTV YILQADGYVESLLSPTFPNLPVTELIPRLVQKAIDDGTRQMLRELRALLRD" gene 5651..7486 /locus_tag="DP116_20225" CDS 5651..7486 /locus_tag="DP116_20225" /inference="COORDINATES: protein motif:HMM:PF13458.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20225" /translation="MCQNCQQVIEKLAINIELFFNDLNTLANKKKLSDLDRVIICHSL LGFSRQEIANIVKLFDQKIRDRLTNNIYPRIAELMCVDQEEIAGNWVKIINFLLNPQK GYKLNPAPQLNSDNFQGSFGRQIFLYPPNQEIVKYQIEGTHFYQQGLYYQAFQCFVMA WNKERKIYGIGNPEVLIYINNCLIEYKKSLLQDKGIKIYTLAVIVPFFHNQGHVAAEI LRGIAQIQLQVNLPSFEKISLGTEINLDDIKPSVLLTLICRHIALQILIVNEPNNLYA PYNQTAEKLADLAPQLNLIAIIGHYSSEMTKNALYFYARKGLVLVNSSSTSNELSDLS VGESLSFFRLTTQDSINAKKMADYLNKKVSNQIPKKVAIIYNQNSTYSTSYRNSLKKY LEQDKERFVFIEECSYLSENYYQVEKYLKNIRQDDVNIIIIIPDGGIEPLSLDNAGLI SRLNLNNCLIAGSATFYHDNVLHWIHEQNQCHSMNQNQRQIIACIPWHWHSQENGCDS SNSIGQSFCQIGAQLWGTENLTWRSATAFDSVLIILKILEEYQSQVSQCLLTHMNQYF KEKRKHVKGVTGLIQFERSGDRLNPPAEIVAVKWDEEQQKWKWKI" gene 7763..7927 /locus_tag="DP116_20230" /pseudo CDS 7763..7927 /locus_tag="DP116_20230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459247.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DUF1445 domain-containing protein" gene 7982..9532 /locus_tag="DP116_20235" CDS 7982..9532 /locus_tag="DP116_20235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457398.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5-oxoprolinase" /protein_id="PRJNA477356:DP116_20235" /translation="MYTTSHSDPVRLEIFKNLYQFIAEQMGIVLQNTAASVNIKERLD FSCAIFDSSGLLVANAPHIPVHLGSMSESVRSLIDDKSDTTIPGNVYLSNNPYNGGTH LPDVTAITPVFDEDEKQIIFYVASRGHQADIGGITPGSMPPHSTTVEEEGIIFDNFLL VEEGNFRETAVREVLLNHPYPARNPDQNIADFKAQIAANTRGVQELRKMVDQYGLQTV QAYMKFVQDNAEESVRRAIDVLKDGSFIYEMDNGARIQVKVTIDRQNRSAIIDFTGTS GQLNSNFNAPKSVTQAAVLYVFRTLVDDNIPLNAGCLNPLEIIIPDGCMLNPTYPAAV VAGNVETSQTIVDALYGALGVMAASCGTMNNFTFGNDRYQYYETICGGSGAGDNFDGT DAVQTHMTNSRLTDPEVLETRYPVLLESFSLRPDSGGKGKYSGGNGVIRRIRFNEPMT ANILSNHRLIPPFGLNGGQAGLVGRNWIQRHNGTEENLDSTATTQMKSGDVFVIETPG GGGFGSIC" gene complement(9576..10703) /locus_tag="DP116_20240" CDS complement(9576..10703) /locus_tag="DP116_20240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20240" /translation="MAKNHIELRRKRYFKLSSQIAQLDNAQLRSLFDNSESNESGTGW GTNHTIVFGESKVFVKRVPVTNIEYDNLFSTRNLYNLPTHCNYNVGSTGFGIFRELVT HIKTTNWVLEEAIVTFPLMYHYRIIPFSGWQTDVDMERLKDYVESRGNSENAGNYVVD RAHANYELIMFLEYIPHILETWLQENPNKLQKPLDELRTTIDFLRKKGIIHFDAHFRN VLTDGEQIYLTDFGLVLDKSFALTKDEESFFKQNTFYDYGEVLRNLGHVIRPSYYSCS ENDKRRIMEKYGIKEGLQPYEVGSILLDNIEQIHADGIMKLDEFYVASIVKYRIIITL MQDFFSNMWGNNKKDTKFDHAKLELLLKETGFISDAETQGG" gene complement(10855..13650) /locus_tag="DP116_20245" CDS complement(10855..13650) /locus_tag="DP116_20245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205788.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecA" /protein_id="PRJNA477356:DP116_20245" /translation="MLKTLLGDPNARKLKKYQPFITEINLLEEDIKALSDEELKGKTA ELKQRLKKGETLDDILPEAFAVVREAGRRVLGLRHFDVQLLGGAILHTGQIAEMKTGE GKTLVATLPSYLNALTGKGVHVITVNDYLARRDAEWMGQLHRFLGLSVGLIQQSMTPS ERKKNYDCDITYVTNSEIGFDYLRDNMATDIKDVVQRPFNYCVIDEVDSILVDEARTP LIISGQVERPTEKYLKAAEISRALQKEEHYEVDEKARNVLLTDEGFAQAEQLLEVKDL FDPEDPWAHFVFNGIKAKELFLKDVNYIVRNEEVVIVDEFTGRVLPGRRWSDGLHQAI EAKERVDIQPETQTLATITYQNLFLLYPKLGGMTGTAKTEEAEFEKIYKLEVTIIPTN RPRRRQDLSDMVFKTEAGKWGAIAKECAEMHTVGRPVLVGTTSVEKSEYLSQLLRQMA IPQELLNARPENVEREAEIVAQAGRRGAVTIATNMAGRGTDIILGGNAEYMARLKMRE YFMPRIVKPEDEDTFNIHRAAGLPSAGSGSGQGFVPGKKVKTWKASPQVFPTSLSKET EKLLKEAVEAAVREYGDRSLSELEAEDKVAVAAEKAPTDDPVIQKMREAYNRIKREYE QYTNREHDEVVQLGGLHVIGTERHESRRIDNQLRGRAGRQGDPGSTRFFLSLEDNLLR IFGGDRVARLMDAFQVEEDMPIESGMLTRSLEGAQKKVETYYYDIRKQVFEYDEVMNN QRRAIYAERRRVLEGQDLKEQVIKYAEKTMDDIVDFYINPDLPSEEWELDKLVEKVKE FVYLLADLQPAQLEDMAMSEIKAFLHEQVRIAYDLKEAEVDQIRPGLMRQAERFFILQ RIDTLWREHLQQMDALRESVGLRGYGQKDPLIEYKSEGYELFLDMMTNIRRDVVYSLF MFQPQPQPMMETPSEMV" BASE COUNT 4108 a 3034 c 2882 g 3934 t ORIGIN 1 actgataact gataactgat aactgataac tgataactga taactgataa ctgatgcaag 61 tcaagcagcc gatttgagaa taaagtgtat caaagcatca atcaggcgat cgctctccat 121 aggaagaatt tccttaccat tcatcatctc ctgagtcatc acaaaatgaa ccaatgaccc 181 gacaagaatt cgtgccgttg cctcaggatc tggtatcttt agttcagggt gagaagccag 241 gtaagacttt agagtgtcaa tcgtcggctt tactaaggta attatacaaa gctgggctaa 301 ctcaggaaaa cgaccagact ccccaatcac cactcgctta aacgcatgaa actcttcgtc 361 tttgagcatc tgattcaatg ctttggttac caagcgccgt agtactattt tcgggtctcc 421 ctcaagaggt tctgtaccga aaatagaatg aaatcgctct tgagcgagtt tttctatcag 481 tgctttaaaa agtccttgtt tatcttgaaa gtggctatat accgtagctt tagaaactcc 541 agcagacgcc gccaccctat ccatgcttgt accagcaaag ccatgaagga gaaattcatg 601 catcgcccct tgcaagattc gttcaacctt ctctaatgaa ctatcgcgct caatttcagt 661 aactctttca cgtaccattt ttcaaaaaca ctgtcctata aacaatctta tagaagttct 721 ttggctgtct gagatgactt aatattatat gcaacttttt tcaatatatc taaaacttac 781 gcattgacac aaaagtgagt atatgccttc aagcccatct tctctttgag gagctgtcga 841 atttgctata gaatgctact ctatagggat tctctgaatc ccgcgcttga ggcgaaagtt 901 gttgtcctat aaagtgctaa aaataggact gaactaactg atttaattta gatatattat 961 aagactaaac ggtttagttt tacaggcaac ggaacgataa ctatgcttga acactcgaac 1021 ctggagggtt catctgagct aaaacctata tttcgtctac cttttcttct agttatactt 1081 gcaactataa catcaggtgg aattagtgtt tatactgttc aacgatttca gaatgacgca 1141 aaaacagcaa aacagcaagc agccccagta gtacaagtaa caacagtaac agccctagga 1201 cgattagagc caaagggaga aatcattaaa ctttcggcac cagcatcagc agaaggaagt 1261 cgggtggaac agttgcttgt gcgggaagga actaaggtga agcaagggca gctggttgcg 1321 attttggata gtcgcgataa actaggtgca gcagtagcag aagcacagga acaggtacga 1381 gtcgcccaag caaatcttgt ccaaatcaaa gcgggtgcca aaaagggtga aacagacgcg 1441 caaaaagcag ctattgcccg catccaggca gaacaaggca cggaagttca agcacaacaa 1501 gcaaccattg cccgcttaga ggcagaaaga gatacggaaa tcgaggcaca aaaagcaacg 1561 attgctcaac tgcaggcaca gctcaacaat gctctagcgg aataccgacg ctatcaaaca 1621 ctttatcaac aaggggcgat ttctacgtcg ttccaggata ccaagcgctt gactctgact 1681 acagcgcagc aaaaaatagt agaagcacaa gcaaacctca aacgcatcga aacatcccga 1741 gaacaacaac tggcagaagc acgagcaaac ctcaaacgca tcgaaacatc ccgaaaacaa 1801 caactggcag aagcacgagc aaccttagat aaaattgcag aagtccgttc tgtggatgtt 1861 gcagcagcac aggcagaagt taatcgtgct gctgcagctg tcaaaagagc ggaggcgaat 1921 ttaagacagg ctttcgtgcg atcgccccaa gaaggtgaag tcttcaaaat tcgtacccgt 1981 cctggagagt tagtctctaa tgacggtatt gtcgagatgg ggcaaaacgc tcagatgtac 2041 gcagttgcag aagtgtacca aagtgacatc aataaagtac gtttggggca accagtacgg 2101 ctactcagtg attctgtagc tggtgaattg tcagggattg tagatcggat tgattcgcaa 2161 gttctacggc aaaatgtgat caacagtgat cccacaagca atattgactc cagaattgtg 2221 gaagtgcacg taaaacttga tcagccatcc accctcaaag cagctaaatt ctctaatttg 2281 caagtcaagg cggtgatttc actgtgattg gactgataac ccagcaaatc cagcaactaa 2341 gcaggcgaac accactagga tggctgcaac tgagtcacca gaaggggcgc tttttggtgg 2401 cattagcagg gattgccttt gctgatgttc tcatgttcat acaaatgggg tttcaagctg 2461 ccttatttga cagtaacacc agactgcata ctgcgatgca agcagacatt tttttaatga 2521 gtctccaagg acgtaacctg gcatatctgt ctacattccc tcgccggcga ttgttccagg 2581 cgatggatgt accaggggta aagtcagcgg aggcaatgta tattaacttt cttgattgga 2641 agaatcccca aacgcttaag aagactggag tcctcgtagt aggaattaat cctaataagc 2701 tactctttga tttaccagat gttaaccgtc aattaaatgt tctcaagcta ccggatacag 2761 ttttatttga tcgtggttct caaggagatt acgccaagat catagcccaa atcgaccaag 2821 ggaaatctgt taccaccgag atacaacggc ggacactcac tgtgagtggc ttatttaaag 2881 tcggggcttc ctttattgct gatggtagtt tgataaccag cgaccaaaac tttttgcgac 2941 tctttcccgg acaacaagca agcagcgtga atctaggttt gattcagcta caaccaggtt 3001 atgatcctaa gcttgtgtca aagaccttga aatcttatct gggtagtagt caggatgtca 3061 aagtctttac aaaagaggaa tttattaaat ttgagaaaga ctactggcaa aaaagtactg 3121 caatcggctt tatctttagc ttgggtgcag caatgggctt catggtaggg gtgattatcg 3181 tctatcaagt cctttccacc gatgtgaatg cacatatgaa agaatacgcc accttcaaag 3241 caatgggtta tcgtaacgcc tacttattag gagtggtgtt tgaagaagca atcattctgg 3301 cagtattggg ttttctacct ggtctcactg tgtctgtggg actgtatgct ctcacgcgga 3361 atgccaccgg tttaccgcta atcatgacaa tagcacgagc agttcaggta cagatgctga 3421 caataattat gtgtatgatt tctggggcga tcgccactcg taaagtccaa tctgctgacc 3481 ccgctgatat gttctaaacc catgttctac agaaccagag gactcaaaga cgcaccggac 3541 gcggagaata actaatgact aatgactaat gactaataga catctccaga aattaattat 3601 gcgttacgtg aaaccctcgt agagaacgcc acatcgctca acgcgggaaa cccgcgcacg 3661 cgagtggctc cgttacatgt aacgtctcta cattgttttt caccagatgt ctaatgacta 3721 atgaccaatg accaatgacc aatgactaat aactaatgac tatgacgaat caacctgtta 3781 tctctattca aaatctcaac cactactttg gtaaaggtca acttcgcaaa caagtgctat 3841 atgatatcaa cttagagatt aacgccggtg aaattatcat tatgacaggt ccgtctggtt 3901 ctgggaaaac cacacttctg accttagtag gtgggttgcg ttctgcccaa gaaggtcgtt 3961 tgcgagtgtt aggacgagaa ctctgtggtg cgaatgcaca acaactcaca ctagcgcgac 4021 gcagtaacgg ctatattttc caagcacaca acctgcatgg tagcttaagc gcactgcaaa 4081 acgtcaggat gggcttggaa ctccacaaaa gtataactcc agcagaaatg aaaagacgct 4141 cagccgagat gctagagttg gtaggattag gcaatcgtgt caattattat ccggatgatt 4201 tgtcaggagg acaaaaacaa cgggttgcga tcgcccgtgc gctggtgagt caccctaaaa 4261 tggttctcgc agacgaaccc accgccgccc ttgatagtaa gtcgggtcga gatgtggtga 4321 acctgatgca caatttggcg aaagagcaag cttgtacgat tttgttagtg actcatgaca 4381 atcgtatttt ggatatagct gatcgcattg tctacatgga agatggtaag ttagcgaaag 4441 ctcctgctac tgttggatag cagaatatat ttttgaatac gtggtgatag atacaactat 4501 ataaaaaaca aaaacgcggg ttaatgcaaa aactcctcac tggagaatgg cgcgtcaaaa 4561 ttgagaaagc aggacagcag ttaacggaga caaaaacatg agcgtaacta ttcccctaca 4621 tgccatagaa ctcgctcccg gtagccagat tgccattcat aacctgtcct ggcaagactt 4681 tgagcgactt ctcgaagact tgggagaaaa acgcaacacc cgcattgctt actaccgagg 4741 aaccttagaa ataatgtccc cgttagcatt acatgagcgt ccccaccgca tcattgctta 4801 catcatcacc acaattctgg aggaacaagg acgcaactgg gaagacttcg gctcgacgac 4861 ttttaaacgt ccagatattg ctggggttga accagatacc tgcttttata ttcaaaatgc 4921 cagccaggtc aaaggatgta ctcaaatgga tttaaccgta tatcctccct gtgaccttgc 4981 ggttgaatct gatgttacct caaaaacaac cctcaacgcc tacatatccc tgagagttcc 5041 ggaagtgtgg atttacagca accatcaact aaccgtttac attctccaag ctgacggcta 5101 cgtagaatct ctcctcagtc ccacctttcc taatttaccc gtcactgaac ttattccccg 5161 actggtgcaa aaagcaattg atgacggaac caggcaaatg ctgcgagaac tgagagcttt 5221 gctacgggat tgacatcctc ccaccgcgct ccccttaaaa cagccagaaa gctcaggata 5281 aaagcttttt gaatgcgtga atgcgtgaat gagcgaattc caaggcagcg gtactagaca 5341 gatgctgtcg tctcaaagtt ctttttggca tcttggcgtg agataatcat cctattttca 5401 tttcatgcgg cgaaataaaa cgcacgactt ccaccagcca atgaaaagat ccaataaact 5461 ttggttttat aaggttgttt tggcatctta gcgcgagata gtctggcggg gtggttcccg 5521 agatatctgc gactcgggca cctaaagata taacttgttt caacacctgt agaaattatt 5581 tccacaccaa taggataacc tgacctgaaa atcagactat cattttgaaa aaaagtgatg 5641 atatctgaat atgtgtcaaa attgtcaaca ggtgatagaa aaattagcta ttaatattga 5701 acttttcttc aacgatttaa atactcttgc aaataaaaag aaactcagtg atctagatag 5761 agtcattatt tgtcactctt tattagggtt ttctcgacaa gaaatagcta atatagtcaa 5821 gttgtttgac caaaaaatta gagatagact gactaataat atatatccga gaatagcaga 5881 gctaatgtgc gttgaccaag aggagatagc tggcaattgg gtaaaaatta ttaatttttt 5941 actcaatcca caaaaaggtt ataaattgaa tcctgctcct caattaaata gtgataactt 6001 tcaaggaagt tttggtagac aaattttcct ttatccacct aaccaagaaa ttgttaaata 6061 tcagattgaa ggaacccatt tttatcagca agggctttac tatcaagcat ttcaatgttt 6121 tgtcatggct tggaataaag aacggaagat ttatggaatt ggtaatccag aagtcttaat 6181 ctatattaat aactgcttaa ttgaatataa aaaatctctt ttacaagaca aaggtattaa 6241 aatttatact ctagctgtaa ttgttccatt ttttcataat cagggtcacg tcgctgcaga 6301 aattttacga ggaattgccc aaatacaatt acaagttaat ctgccaagct ttgaaaaaat 6361 ttctttaggc acagagataa atttagatga tatcaaaccc agtgttttat tgactctgat 6421 atgtcgtcac atagccctgc aaattctcat tgttaatgaa cctaataatt tatacgcacc 6481 ctataaccaa acagcagaaa agttagccga cttagcacca cagttaaacc tgattgctat 6541 tattggtcat tactcaagcg aaatgacaaa aaacgcactg tatttctatg cccgaaaagg 6601 gttagtttta gtaaattcta gtagtacatc taatgaactt tccgatttat ctgtaggtga 6661 aagcttatct ttttttagat taacgactca agacagtatt aatgctaaaa aaatggctga 6721 ttatttgaat aaaaaagttt ccaatcaaat tcccaagaaa gtagctatta tctataatca 6781 aaatagtact tacagtactt cctacagaaa cagtcttaag aaatatctag aacaagataa 6841 agaaagattt gtttttatag aggaatgtag ttatctcagt gaaaattatt atcaagtaga 6901 aaaatatcta aaaaatatta gacaagatga cgttaatatt atcatcataa ttcctgatgg 6961 aggaattgaa ccattatccc tcgataacgc tgggctgatt agccgactaa atctcaacaa 7021 ctgtctcata gctggctcag ccacttttta tcatgataat gttttacatt ggattcatga 7081 gcaaaaccag tgtcattcta tgaatcaaaa tcagcgtcaa attatagctt gtattccttg 7141 gcattggcat agtcaagaaa acgggtgtga cagttctaat tctatagggc agagtttttg 7201 tcaaatcggc gctcagttat ggggaacaga aaacttaaca tggcgcagtg caacagcttt 7261 tgattcggta ttaataattt tgaaaattct agaagaatat cagagtcaag ttagccaatg 7321 tttactgaca cacatgaatc aatatttcaa agaaaagaga aaacacgtaa agggagttac 7381 gggattaatt caatttgaga gaagtggcga tcgcctcaat cccccagcag aaatagtagc 7441 tgtaaaatgg gatgaagaac aacaaaaatg gaaatggaaa atataatttt ttaaaaacta 7501 cctactctga tagcgttttt taattgtgct tcatacagta gagatgttcc ggcggatggt 7561 ctctacacat cgttgtgtat ctgatttcaa caagaatggc tatatttatc cacacctgtc 7621 gtcaaaaatt tccacagatg aagtagaaat actcttaatt ctattgttac tctactttta 7681 tagtcttgag agaattgata gatgttgaaa attgttgctg ttctggatgg tgcgttcaca 7741 gatattatcg ctgtgactaa attatggtga tgctgtcaca cttcgtgatg atgaggttcc 7801 cgttttttgg gcttgcggag tcactactca aactgcaatt cttcaagcca agcctgaact 7861 agcaattact cattctcctg gacatatgtt tatgactgat ttgaaagatg agtctcttac 7921 tttttaaagt tggggctagc gtagaaattt acaaaatttg tagaagctaa aattgtcaac 7981 aatgtacaca acatctcaca gcgaccccgt tcgcctagaa atttttaaaa acctctatca 8041 atttattgcc gaacaaatgg ggattgttct ccaaaacacg gcagcatcag tgaatattaa 8101 ggaaagactg gatttttcct gtgctatttt tgactcttct ggattattag tcgctaatgc 8161 cccccacatt cctgtgcatt taggctcaat gagtgaaagt gtccgcagtt taattgatga 8221 taaaagtgac accactatac cgggaaatgt gtatttatcc aataatcctt ataacggggg 8281 aacacatctt cctgatgtga cagcaataac tcctgttttt gacgaagacg aaaaacaaat 8341 tattttctat gttgcttctc gcggacacca agccgatatc ggtggaatca ctcccggttc 8401 aatgcctccc catagtacca cagtagaaga ggaaggaatt atttttgata attttctctt 8461 agttgaggag ggaaattttc gggaaaccgc agtacgggag gtactcttaa atcatcccta 8521 tcctgctcgt aaccctgacc aaaatatagc tgattttaaa gcacaaattg ccgccaatac 8581 aaggggagtt caagaactcc gtaaaatggt tgaccaatac ggactccaaa cagtccaagc 8641 atacatgaaa tttgtgcaag ataatgcaga ggagtcagtc agacgggcga tagatgttct 8701 taaagatggc tcatttattt atgaaatgga taacggggca cgcattcaag ttaaagtaac 8761 gattgaccga caaaatcgca gtgctattat tgattttact gggacatctg gacaactcaa 8821 tagtaatttc aatgctccca aatccgtaac tcaagccgca gtcttatatg ttttccggac 8881 tttggttgat gataatattc ctctgaatgc tgggtgtctt aatcctctag aaattattat 8941 cccggatggc tgtatgctta acccaaccta tccagcagca gttgtagcgg gtaacgtaga 9001 gacatctcaa acgattgtcg atgctttata tggtgctttg ggtgtgatgg ctgcgtcttg 9061 tggaacaatg aataatttta ccttcggtaa tgaccgttat caatattatg aaaccatctg 9121 cggtggttct ggagcaggag ataattttga tgggactgat gcagtccaaa cccacatgac 9181 taactctcgt ctaactgatc cagaagtttt agaaactcgc tatcctgtac tcttagaaag 9241 ctttagtctt cgtcctgata gcgggggaaa aggaaaatac tcaggtggaa atggagttat 9301 ccgccgcatc cggtttaatg aacccatgac agctaatatt ctctccaatc atcggcttat 9361 tcctcccttt ggattaaatg gtggacaagc cggacttgta ggacgcaact ggatacaacg 9421 tcacaatgga actgaagaga atttagacag cacagcaaca acacagatga aatcagggga 9481 tgtttttgtg atagaaactc ctgggggagg aggatttggt tcaatctgtt gatgagcccc 9541 ttgttggact gcactacagc aaaaaatgat ggcaattatc caccctgggt ttcagcatca 9601 gaaataaacc ccgtctcttt gagcagtagt tctagttttg cgtgatcaaa tttcgtatcc 9661 ttcttgttat ttccccacat attagaaaag aagtcttgca tcagcgtaat aataatgcgg 9721 tatttgacga tgctggcaac ataaaactca tctaacttca taatcccgtc agcatggatc 9781 tgttcgatgt tgtcgagtaa tatggatccc acttcataag gttgtaaacc ttcttttatg 9841 ccatattttt ccattatcct gcgtttatcg ttctctgaac atgaataata agacggtcga 9901 attacgtgtc caagattccg caggacttcg ccatagtcat aaaatgtgtt ctgcttaaag 9961 aaagactctt catctttcgt caacgcaaaa ctcttatcaa gtaccaaacc aaaatcagtc 10021 aaatatatct gctcgccgtc ggtgaggacg ttgcgaaaat gcgcgtcgaa atggataatc 10081 cccttcttcc tcaaaaagtc aatcgtcgtg cgtaactcat ccagaggttt ctgaagtttg 10141 ttggggtttt cctgtagcca tgtttctaga atatgcggta tgtattcaag gaacataatc 10201 aactcatagt tggcgtgagc tctatccacc acataatttc cagcattctc actatttccc 10261 ctggactcca cataatcttt tagacgttcc atatcgacat ccgtctgcca cccagagaac 10321 ggaataatcc gataatggta cattagtggg aaagtgacaa ttgcttcttc caacacccag 10381 ttggtggttt tgatatgcgt cacaagttca cggaagatcc caaaaccagt ggaacctacg 10441 ttgtaattac aatgagttgg cagattatag aggtttctgg tggagaacag gttgtcgtat 10501 tcgatgttcg ttactggaac acgtttcaca aagaccttgg attccccaaa gacgatggtg 10561 tgatttgttc cccagcccgt acccgactca ttcgactcac tattgtcaaa caaagaacgc 10621 aattgtgcat tatccaactg agcaatttgt gaactgagct tgaagtacct tttccttcta 10681 agttctatat gattcttagc cattttcccc cgtccaacca cagtatgaga aacactatga 10741 ataaaatacg agcagtctca cactttcagg taaaaaaagt taaaatcaac gtttttccaa 10801 tgtcacagac gtgaaatgac ctcgctacac gaggagcttc acgtctgtac aacctcaaac 10861 catttctgac ggtgtttcca tcattggctg aggctgaggt tgaaacatga acagggagta 10921 taccacatct cggcgaatat tggtcatcat atccaagaat aattcgtacc cttcgctctt 10981 gtactcaatc agcggatctt tttgaccata accccgcagt cctaccgatt cccgcagggc 11041 atccatctgt tgcaggtgtt cccgccacag tgtgtcaatc cgctgcaaaa taaagaatcg 11101 ttcggcttgc cgcattaatc ctggtcgaat ttggtcaact tctgcttctt tgagatcata 11161 ggcaatacgc acttgttcgt gcaggaaggc tttgatttca ctcattgcca tatcttctag 11221 ttgagcgggt tgcaagtccg ctagcagata gacaaattct ttgacttttt caaccaactt 11281 gtctaattcc cattcttctg agggtaagtc tgggttgatg tagaagtcaa caatgtcatc 11341 catcgttttt tcagcgtact tgatgacctg ttctttgagg tcttgccctt ctagcacccg 11401 acggcgttcg gcgtagatgg cacgacgctg gttgttcatc acttcgtcgt actcaaatac 11461 ctgcttacgg atatcgtagt agtaggtttc gacttttttc tgagcgcctt ccaaactgcg 11521 ggtgagcatt ccagattcga tgggcatatc ctcttcgact tggaaagcat ccatcaggcg 11581 tgcgacgcga tcgccaccaa aaatccgcag cagattatcc tccaagctca ggaaaaatct 11641 tgttgaaccg gggtcacctt gtcgtcctgc gcgtccccgc aactggttgt cgatccgtcg 11701 cgactcgtga cgttcagtcc caatcacgtg caacccaccc agttgtacca cttcatcgtg 11761 ttcacggttg gtgtactgtt cgtattcgcg cttaatgcgg ttgtatgctt cgcgcatttt 11821 ctgaatcaca gggtcatcag tgggagcttt ttctgctgcg acagcaactt tatcttctgc 11881 ttctagttcg cttaagctgc gatcgccata ctcccgcaca gccgcttcca ccgcttcttt 11941 aagaagtttc tctgtttctt tagaaagcga agtcgggaaa acttgtggtg aagctttcca 12001 agttttgact ttcttgcctg ggacaaagcc ttgacctgag ccacttccag cagaaggtaa 12061 accagccgcc ctgtgaatat taaacgtatc ttcatcttcc ggcttgacta tccggggcat 12121 gaagtattcc cgcatcttca aacgcgccat atactcagca ttaccaccta ggataatgtc 12181 agtacctctt cccgccatgt tagtcgcaat cgtcacagcc cctctgcgtc ctgcttgtgc 12241 gacaatctct gcttcgcgtt caacgttctc tggtctagcg ttaagcagtt cctggggaat 12301 tgccatttgc cttaacaact ggctgagata ttccgatttt tccacactag ttgttcctac 12361 caggactggt ctaccaactg tgtgcatttc ggcacattct ttggcaattg ccccccactt 12421 gcctgcttcg gtcttaaaga ccatatcaga aaggtcttgg cgtcttctcg gtctgttggt 12481 gggaataatc gtgacttcca gtttgtaaat tttttcaaac tcggcttctt ctgtcttcgc 12541 tgttccggtc ataccaccga gttttggata aagcaagaac agattttggt aagtaattgt 12601 tgccagagtt tgagtttccg gttgaatatc tacccgttct tttgcttcaa ttgcctggtg 12661 cagtccatca ctccaacgcc gtcctggtaa gactctacca gtaaattcgt ctacaatcac 12721 aacttcttcg ttgcggacaa tatagttgac gtctttaaga aacagttctt ttgctttaat 12781 accgttgaaa acgaagtgtg cccaaggatc ttcggggtca aataaatctt ttacctctaa 12841 aagttgttct gcttgagcaa agccttcatc ggtgagcaga acgttacgag ctttttcatc 12901 tacctcataa tgttcttctt tttgcagcgc tcttgagatt tcagcggctt ttaaatattt 12961 ttctgtaggt ctttctacct gcccagaaat aatcagcgga gttcgcgctt catcaactaa 13021 gatagaatct acctcgtcaa tcacacagta attgaatggg cgctgcacaa catctttgat 13081 gtctgttgcc atgttatcac gcaggtagtc aaaacctatc tcgctgttag taacataggt 13141 tatatcacag tcgtagtttt tcttgcgctc agatggcgtc atgctttgct gaatcagccc 13201 cacgctcaat cccaagaagc gatgtagctg tcccatccat tctgcgtccc gacgagccag 13261 gtaatcgttt acagtgatga cgtgtacgcc ttttcctgtt agggcattca aataacttgg 13321 caaagtcgca accaaagttt tgccctcacc agttttcatt tcggcaattt gccctgtatg 13381 caggatagca ccgcctaaaa gttgaacatc aaagtgccgc aagcctaaca ctcgccgtcc 13441 tgcttcccgc acaacagcaa acgcctctgg caaaatatca tccagagttt cgcctttctt 13501 cagacgctgt ttcagttctg ccgttttgcc ctttagctct tcgtctgaaa gggcttttat 13561 gtcttcctct agaaggttaa tttctgtaat gaaaggttga tattttttaa gtttacgagc 13621 gttggggtcg cccaacaaag tctttagcat ggcagattat acaaaatcaa ggggaacggg 13681 gatgattaat gattgggatt gattataaac tcttaaccca accgatccgg gtgtggtttt 13741 ggatgggttg tacatgagaa tttgctaaaa aacacaagga taaagaccca atcaagttgc 13801 taaatgattg gttttatttt tcagcttaaa tttagactgt ttttatagta tcatttcacc 13861 tccagatgag gcagatcagc ccgaccaagc agaggtcgtg attgccccaa aggaaggata 13921 cgggaagggg gaacagggaa cagggaacag ggaacagg // LOCUS NODE_2434_length_13952_cov_5.04742013952 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13952) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13952) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13952 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(119..1426) /gene="lptC" /locus_tag="DP116_20250" CDS complement(119..1426) /gene="lptC" /locus_tag="DP116_20250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140251.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LPS export ABC transporter periplasmic protein LptC" /protein_id="PRJNA477356:DP116_20250" /translation="MRGTNQFKIAKRCELKPRGVARACPLDIAVAGSDAISKLQILNF TALPHLPLVKSSLILLISLISTGLVGCAGSQTHVTTKPPVESPSPGDKDSNLTFFGVT LEQADEVGRPIWKVRAKQAKYTKEKQIGAAQSPYGELYQDGKVVYQVQAQMADIAQDG KQLFLKGKIVAIDPLNGVVLQGNELEWRPKEDLLIVRKQLHGTHKQLQAVAQEARVKT REQRMEFSGGVIANSLEPQLQMRTEHLIWRIKEETLIGDRPVQMDHYKNNQITDRGRG DSAEVNLKTKIATIKKNAQIELLDPPVQVASNSMTWNLNSETVTTNSPVRVFHQAQKV ALSANQGEMKIPQKTVYLTGNVYSVGERGQSLKSKTLTWYFDKKLVEAQGDVVYRQAD PPLAFTGQKASGDLQAETIVVNSDGSGKKVVTEIIPQDRKVKN" gene complement(1465..2109) /locus_tag="DP116_20255" CDS complement(1465..2109) /locus_tag="DP116_20255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457185.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NYN domain-containing protein" /protein_id="PRJNA477356:DP116_20255" /translation="MLNNLENDSIFTPEQVLENRGRVAIFIDGSNLFYAALQLGIEID YTKLLCRLTGGSRLLRAFFYTGVDRTNEKQQGFLLWMRRNGYRVIAKDLVQLPDGSKK ANLDVEIAVDMMALVDSYDTAVLVSGDGDLAYAVNSVSYRGVRVEVVSLRSMTSDSLI NVSDRYIDLEAIKEDIQKTPRQSYPYRVIDRSLAPIGYLEDPRESDSQQIEMQD" gene complement(2259..3860) /locus_tag="DP116_20260" CDS complement(2259..3860) /locus_tag="DP116_20260" /EC_number="6.1.1.10" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132148.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methionine--tRNA ligase" /protein_id="PRJNA477356:DP116_20260" /translation="MNQVKKTQNTFALTTPLYYVNDLPHIGSAYTTMAADVVARFQRL LGHRVLLITGTDEHGQKIQRTAESKGRSPQSFSDEMSAGFVSLWQLLNIQYDRFIRTT APRHEVLVKEFFQRVWEAGDIYHGQQKGWYCVSCEEFKEERELLEGHRCPLHPNKEVE WRDEQNYFFRLSKYQEKLQALYESRPDFIQPESRRNEVLSFVNQGLQDFSISRVNVDW GFPVPVDPNHTLYVWFDALLGYVTALLEPDEEPTIKNALAKWWPINLHLIGKDILRFH AVYWPAMLMSAGLSLPERVFGHGFLTKDGQKMGKSLGNTLDPIGLVERYGSDAVRYYF LKEIEFGKDGDFNESRFIDVLNADLANDLGNLLNRTLGMVKKYCAGNVPTITNEDIPF EHALKAIGLPLGEQVRNAYEALAFNQACKAVLSLVKTSNKFLDEQAPWSLYKQGQQQA VEQVLYAVLESVRLAAYLLSPIIPDTSSDIYQQLGFGINFNEQIETATAAPFAIHGTW GVLTNQQKLGQARPVFKKIEPPKND" gene 4231..4959 /locus_tag="DP116_20265" CDS 4231..4959 /locus_tag="DP116_20265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864521.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-acyl-sn-glycerol-3-phosphate acyltransferase" /protein_id="PRJNA477356:DP116_20265" /translation="MSVNSPLEISRWLLAALSTKMFRYYEDRIPQDASVLVVSNHRSF MDAPILMAALSSSIRFACHHYMGQVPIMREIVTGQLGCFPLEEANQHRQQSFFQQSQL LLQSKQIVGVFPEGTKPMVEFTQANRVGEFQRGFAHLALRAQVRDLAILPIAIASLEE VNTSAVPLRVLSLFDPSEPLFNQSGWHPLVTYQRVAVLIGHPYWIKPLRKEKYQGRKA RTVVTELTNHCHSEIANLLREGCY" gene 5036..5854 /locus_tag="DP116_20270" CDS 5036..5854 /locus_tag="DP116_20270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457188.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_20270" /translation="MTISKVELKPCFLTPKRLQPEFPLFVYLPGMDGTGQLLRSQTAG LEVGFDVRCLAIPREDLTSWDDLANNVLDLIHAELEKNSQRPVYLCGESFGGCLAQKV AILAPQLFKRIILINSASAFNLRPLLTWASQLSYLVPSNLYNIGALGLLPFLASLPRI SRSDRQELLKTMRSVPPETVLWRISLLRDFCVEEKQLRRLTQPVLVIAGGSDRLLPSL AEAKRLVSILPNSKMVVLPQCGHACLLETDTNLYEIMKANDFLDSSAEAVQVLG" gene 6111..6437 /locus_tag="DP116_20275" /pseudo CDS 6111..6437 /locus_tag="DP116_20275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319191.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" gene 6595..7827 /locus_tag="DP116_20280" CDS 6595..7827 /locus_tag="DP116_20280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315762.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20280" /translation="MAEKLKKDTDETKLNKILASDTKEEKPSLLEAVAETVGCVLGTA VDVGTAAGNTAMEAGKVVVETAADVGEAAAKQSHNLISQATHTAGQVAERIGEIWLVR KLAGFLNLNWLFGAVDTVNLDKAEAEVKKLKQKYPNESPRQIAHRIMLDKATKAGGIG LASSVLPGVAAALLAIDLAATTELQSEMVYQIASLYGLDLKDPSRKGEVLAIFGLALG GGRLLKAAGLGLLRNIPFAGAVIGASSNATMIYSLGYAACRFYEAKLDASKSLDSEET LNTLKQQSENYLEKAMAQEAAMDQVLVHFIIASYPEKTWEEISTELQGLNLSSSSLKT ISENIKSPSPLDTLLNQLNRDFAVPLLAQCYKIARVKGEMTPVAQHIIDTIAAKFDID INFVQSTVDASGNFTNKQ" gene 8084..10099 /locus_tag="DP116_20285" /pseudo CDS 8084..10099 /locus_tag="DP116_20285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315761.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="PAS domain-containing sensor histidine kinase" gene 10457..10993 /locus_tag="DP116_20290" /pseudo CDS 10457..10993 /locus_tag="DP116_20290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742943.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="PAS domain-containing sensor histidine kinase" gene 11156..11632 /locus_tag="DP116_20295" CDS 11156..11632 /locus_tag="DP116_20295" /inference="COORDINATES: protein motif:HMM:PF02518.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20295" /translation="MRHTTIDMNLLLREVQRDFKLETKSRQVSWHIESLPQVQGDPDL LRLVLRNLLENALKYSKTRPLTQITVGSTTGHQEVVFFVRDNGIGFDMKYVHKLFGVF QRLHSDPQFEGTGVGLANVQRIIHRHGGRVWAESEIDNGATFYFSLPRSSGVESGE" gene 11643..12119 /locus_tag="DP116_20300" CDS 11643..12119 /locus_tag="DP116_20300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315760.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_20300" /translation="MELKRILLVEDSINDVELILTSLAENHLGNEVVVVRDGEEALDY LYRRGLYRLRREGHPVVVLLDLKLPKIDGIEVLAQLKADPELRVVPVVVLTSSREEQD LTRCYELGTNGYVVKPIDFLEFVEVIKGLGLFWAIINEPPPGSIPPARSSQGVAGI" gene 12122..>13952 /locus_tag="DP116_20305" CDS 12122..>13952 /locus_tag="DP116_20305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_20305" /translation="MSGLRFLLLEDNLLDAELINALLTENGIACELIHVKTQAEFQTA LEQDGFDLILSDYALPGFDGITALVIAQHHSPEIPFIFVTATMGEEVAIETLKSGATD YVLKQRLGRLIPSVRRALREAQYQRTCKMAEVELYRREQEFRALAENSPDAITRIDGE LRYSYVNPTIEVATGIPLEKWIGKTVAEIDYPEEFSTTWEAKLSQVFATGSESSMEFD VPSHQGRIYYQARIVPEYAPDGSVQSLLSIARDVTEYKRSEQALRENEAQLRQQKEEL ERANKIKDEFLAVLSHELRSPLNAILGWSKILRTRNLDPKSFNRALETIERNAKLQTQ LIEDLLDVSRIIRGKLTLRPYPTNLIPAIEAAIDTMRLAAQAKSIDLQFIILDSELET KFQIPNSQSPPIALQSPPSQEIAARPQQNEPLLAQNSSVESPGSLSQIPGQKFHVLGD PSRLQQIVWNLLSNAIKFTPQGGRVEVLLERVGGDEKDEGATSSSSPLPIRHSSNHAK ITVIDTGIGIKADFIPYVFDSFRQGDGSTTRKYGGLGLGLAIVRHLVELHGGTVSAES SGEGKGATFAVELPILEDKKARKIISSASSASSASSASSPLELS" BASE COUNT 3882 a 2972 c 3057 g 4041 t ORIGIN 1 tatagcagtt gccaggtaga ttaggacatg aactaatgag aaaatacgga caccataaga 61 ctttcacacg cctgttaaga gttccctgtt aagagttccc tgttccctgc tataactatt 121 aatttttcac ttttctgtct tgaggaataa tctctgttac gaccttttta ccagaaccat 181 cgctgttgac gactatcgtt tctgcttgca aatcaccaga ggctttttga cctgtaaaag 241 ctaatggggg gtctgcttgg cgataaacga catccccttg agcctcaact aattttttgt 301 caaaatacca tgttaaagtt tttgatttga gagactgacc acgctctcca acgctataaa 361 cgtttcctgt taaataaaca gttttttgcg gtatcttcat ttcgccttga ttcgcactaa 421 gtgctacttt ttgggcctga tggaagaccc gcacaggaga attcgttgtt actgtttctg 481 agtttaagtt ccaagtcata gagttactag cgacttgcac tggtgggtct agtaactcta 541 tttgtgcatt tttcttaata gtggcaattt ttgttttcaa gttgacttca gcggagtctc 601 ctcgacctcg gtcggtaatt tggttgtttt tatagtgatc catttggaca gggcgatcgc 661 ctattaatgt ttcttctttg attcgccaaa tcaaatgctc agttcgcatt tgtaattgtg 721 gttctaagga gtttgcgatc acccctccag aaaattccat gcgctgttcg cgagttttga 781 ctcgcgcctc ctgcgctact gcttgtagtt gtttatgagt tccatgaagc tgcttacgca 841 caatcagcaa atcttctttt ggtcgccact ccaactcatt accttgcaaa acaacaccat 901 taagaggatc tattgcgaca attttccctt ttagaaacaa ttgcttacca tcttgcgcaa 961 tgtctgccat ttgagcttgt acttgataaa ccactttgcc atcttggtaa agttcaccat 1021 atggactttg cgctgcacca atttgttttt ctttggtgta ttttgcctgt tttgctctga 1081 ctttccaaat tggtctgcca acttcatctg cttgctctaa ggtgacacca aagaaagtca 1141 ggttgctatc tttgtctcca ggcgatggac tttcaacagg tggtttggtt gtgacgtgag 1201 tttggctccc agcacaacca actagccctg tggaaataag ggagatgagc aagatgagag 1261 aactcttaac caaggggaga tgaggtaatg cagtgaaatt caaaatttgc aatttggaga 1321 ttgcgtcgct tcccgccacg gctatgtcca gaggacacgc tcttgcaacg cctcgcggct 1381 tcagctcgca acgctttgcg attttgaatt gatttgtccc cctcatcctc cttatctccc 1441 tttctcctgt tgcttactgc ttcattagtc ttgcatttca atctgttggc tgtcgctttc 1501 cctggggtct tccaaatatc ctataggggc tagtgatcga tctattactc gataagggta 1561 gctttggcga ggggtttttt gaatatcttc tttaattgct tctaaatcaa tatagcgatc 1621 gctcacattg attaaactgt cgctcgtcat agaccgtaaa ctgaccactt caacccgcac 1681 accgcgatag ctgactgaat tgactgcata cgccagatcc ccgtcaccac tgactaacac 1741 tgctgtatca taggaatcta ctaaagccat catatctacc gcaatttcta catccaagtt 1801 ggcttttttg gagccatctg gcaactgtac taaatccttg gcaataacgc ggtatccgtt 1861 gcgtcgcatc cacaacagaa aaccctgctg cttttcattt gtgcggtcta ctccagtgta 1921 aaagaaggct cgcagcaatc tcgaacctcc ggttaatcga catagtagct tggtgtaatc 1981 aatttcaatt cctagttgca gtgctgcata aaataaattt gagccatcaa taaatatggc 2041 gacacggcct cgattttcta aaacttgttc tggcgtaaat atcgaatcat tttccaaatt 2101 attcaacatt gttgtcatac ctcatttttt atcacaagta ataattgata cttgtttttg 2161 cttgtagaac ggtggagaac ggtttatgat aatctttcag gactgataaa tttagtcctg 2221 ataaaatcag acaaaatcag taatggagat tgaaacaatt aatcgttttt aggaggttct 2281 atttttttaa agactggtcg ggcttgaccc agcttttgtt gatttgttag cactccccat 2341 gtcccatgaa tggcaaaagg agcggcggtg gcagtttcta tttgttcatt aaaattaatt 2401 ccgaaaccta gttgctggta aatatcacta cttgtatcag ggatgattgg agataggaga 2461 taggctgcta gtctcactga ttcaagaact gcatacagaa cctgctctac ggcttgctgt 2521 tgtccttgtt tatacaatga ccaaggggct tgttcatcaa gaaacttgtt actagttttc 2581 accagtgaaa ggacggcctt gcaagcttga ttgaaggcta gcgcttcgta agcatttctt 2641 acctgttccc caagaggtaa accaatcgct ttcaaagcat gttcaaaagg tatatcttca 2701 tttgtaattg tcggcacatt gccagcgcag tatttcttca ccatgcctaa agtacggttt 2761 agcaaattac ctaaatcatt tgccaaatct gcattcagca catcaatgaa tctactttca 2821 ttaaagtcgc catccttgcc aaattcgatt tccttaagga agtaataacg aacagcatca 2881 ctaccatacc gctccaccaa acctatagga tcgagggtat tacccagact tttgcccatt 2941 ttctgaccat ctttggtcaa aaaaccatgc ccaaagactc tctctggtaa tgacaagcca 3001 gctgacatga gcattgctgg ccagtacact gcatggaagc gcaggatatc cttaccaatc 3061 agatgcaaat tgatcggcca ccattttgct aaggcatttt taatggttgg ttcttcatcg 3121 ggttctagca aagcagtgac ataacctaga agcgcatcga accaaacata aagagtatgg 3181 ttaggatcaa ctggtacagg aaaaccccaa tctacattca ctcgtgaaat ggagaagtcc 3241 tgcaaccctt gatttacaaa gctgaggact tcatttcgac gactctctgg ttggataaaa 3301 tccggtcgag actcgtaaag tgcttgcagt ttttcttgat atttagataa gcggaagaaa 3361 tagttttgct cgtctcgcca ctcaacttct ttgtttgggt ggagaggaca ccggtgtcct 3421 tctaacagtt ctctttcttc tttgaattct tcacacgaaa cgcagtacca gcctttttgt 3481 tgtccgtggt aaatgtctcc agcttcccac actcgctgga aaaattcttt gactagaact 3541 tcatgacgag gagcggttgt acgaataaat ctatcgtatt gaatgtttag taactgccat 3601 aacgagacaa aacctgcgga catttcatca ctaaagcttt ggggcgatcg ccctttactt 3661 tctgctgtcc gctgaatttt ttgcccgtgc tcatctgtac ctgtaatcaa gaggactcta 3721 tgccccaaaa gtctctgaaa ccttgccacc acatctgctg ccattgttgt ataagcactg 3781 ccaatatggg gtaagtcgtt tacataatat agcggtgttg tgagtgcaaa tgtgttttgt 3841 gttttcttca cttgattcat ctaaaaaaaa ctagttgtta aaaacgtttt ctatcttact 3901 ttttttggta aaaccacgca attatataac ataagtgtca cttattcaag aatatcttaa 3961 tattagcaaa tttctctagg ttgagttttt ttgatgaacg gcataatttc agctaatttc 4021 tctaaaaaat accgaaaatc tttgctcata ttccacacaa aacttgtcat tacattaaga 4081 ttattagaag aaaaacctct tttccacaaa taattttacc gattttgtga atctgaacat 4141 ttctacctta aggcagtcgc ccttatcggt taagaaatgt aaaattttaa tgaaaaataa 4201 gctgcgaaat tttcggattc tatattaaag atgagtgtaa atagccccct tgagatttct 4261 cgctggttac tggcggcgct atcaaccaaa atgtttcgct actatgagga tcgcattccc 4321 caagatgcga gtgtgttagt agtgagcaat caccgcagct ttatggatgc acccatttta 4381 atggcggctc tatcgagttc tattcgcttt gcttgccatc actacatggg gcaagtgcca 4441 atcatgcggg agattgttac aggacaattg ggctgctttc ctctagagga ggcaaatcaa 4501 caccgccagc aaagcttttt ccaacagtca caactcctgt tgcaatcaaa gcaaatagta 4561 ggagtctttc ctgaaggaac aaaaccaatg gtagaattta cccaagccaa ccgggtgggt 4621 gaatttcaac gaggttttgc ccatttggca ttgcgagcgc aggtgcgaga tttagcaatt 4681 ttgccaatcg caatagcatc ccttgaagaa gttaacacat ctgctgtacc tttgagggta 4741 ttaagtttgt ttgacccttc agaaccttta tttaatcagt caggttggca tccactagtc 4801 acatatcaac gtgttgctgt tctcattggt cacccatatt ggattaagcc tctacgaaaa 4861 gaaaaatacc aaggaagaaa agcaagaact gtggtgacag aattgacaaa tcattgtcac 4921 tcagaaattg caaacttact gcgtgaaggt tgctattaaa aattttagtc attagtcctt 4981 atagttagct gatgactctt gactgcctcc tcttgactgt tgactcttgt gaccaatgac 5041 tatttcaaaa gttgagctaa aaccgtgttt cctcactcct aaacgactcc agccagagtt 5101 ccccttgttt gtgtatttac cagggatgga tggaactggt caattgttgc gatcgcaaac 5161 tgctggatta gaagttggct ttgatgtccg ttgtttagcg ataccaaggg aagatctcac 5221 cagttgggat gacttagcca ataatgtctt ggacttgatc catgcagaat tagaaaaaaa 5281 ttcccaacgt cccgtttatc tgtgtgggga gtcctttggt ggttgtttgg cgcagaaagt 5341 ggcgattctt gcaccacagc tgtttaagcg cattattctc atcaactcag ccagcgcctt 5401 taatcttcgc cctttattaa cttgggcatc tcaattgagt tacttagttc catctaacct 5461 ttataacatt ggtgcactag gtttattgcc ttttttggca tctttgccac gtatttccag 5521 gagtgataga caagaactcc taaaaaccat gcgttccgta ccaccagaaa ctgtactgtg 5581 gcgaatctcg ttactgagag atttttgtgt tgaagagaaa cagttacgtc gcctgactca 5641 acctgttttg gtgattgcgg gtggtagcga tcgactttta ccttccttag ctgaagcaaa 5701 acgtttggtt agtattttac ccaattctaa aatggtggta ctaccacaat gtggacacgc 5761 ttgcttgtta gaaacagata ctaacctcta tgaaattatg aaggcgaatg attttttaga 5821 cagtagtgct gaagcagttc aggtgctggg gtagaaggag gggctacagg gctgaattca 5881 catcaaaagt cgggcgttac tcaattaaat atccatactc taggacaccc cttgcaagtg 5941 gcgtgtccca atacttctcg gttagtgagg ataatcccat gaagatcccc ccaacccgcc 6001 ttagaaagga gggctatttt ctgctctcca cccccttttt aactctgtcg ccgtaggcgg 6061 ggagatcaga aaattgggtg gataaagctt tatccgagaa gtattgtggc gtgtcctatc 6121 ttcttatccc atttactaac ttaagtcaga caaagttacg ccttggtgtt gtttttttgg 6181 aaaacactag tttgaatgaa gttacgctct gtttcatttc ttgacaaaac gttgataata 6241 tgagttgaga tgaatttgcc ctagatactt cagatccgga attttctggg actagatgac 6301 gagttgcttc taaagtgaga gtaacttgaa ttgtgacctt gcaacgtcaa ctcattattc 6361 gacgcttggg caagttagga gtcaatcaaa cacagacgtt gaatgcactt ataattcaat 6421 ttttaaaatt aatgtaatca aatttatact gacgtccaaa ctagctgcta gctatcgttt 6481 gagtagaagt tttttcaaac atcctctaac gacttatcct tatagaaaga tgccaagctg 6541 atcacacctt cgctacaaat aaaataggga gatttttaaa gaaggtgcta ggcaatggca 6601 gaaaaactta aaaaagacac agatgagaca aaattaaata agatacttgc ctccgataca 6661 aaagaagaaa agccatcctt gttagaagcc gttgctgaaa cagtaggttg tgttttggga 6721 acggctgtgg acgtaggaac agctgctggt aacactgcta tggaagctgg taaagtcgtt 6781 gtagaaactg ccgctgatgt aggagaagct gcagcaaaac aaagtcataa cctgatttca 6841 caagcaactc acacagcagg acaagttgca gaacgtattg gcgaaatttg gctggtccga 6901 aagctggctg ggtttttaaa tctgaattgg ctttttggtg ctgttgatac tgtgaattta 6961 gataaagcag aagcagaagt caaaaagctc aagcaaaaat acccgaatga atcacctaga 7021 caaattgccc accgcattat gcttgacaaa gcaactaaag cgggcggaat cggattagca 7081 agtagtgttt tgccgggagt cgcagcggct ttgttggcaa ttgatttagc agctacgaca 7141 gagttgcagt cagaaatggt ttatcaaatt gcatccctct acggattaga tttaaaagat 7201 ccctcccgta agggagaagt gttggcgatt tttggtttgg cgttgggtgg cggtcgtctt 7261 ttgaaagcag caggtttagg tttactaaga aatatacctt ttgctggcgc agtgattggt 7321 gctagctcta atgctacaat gatttattcg ttggggtatg cggcttgtcg gttttatgaa 7381 gcgaagctag atgcatcaaa gtcgctcgat tctgaggaga cactgaatac tttaaaacaa 7441 caaagcgaaa attatctgga aaaagcaatg gctcaagaag ccgcaatgga tcaagtgttg 7501 gttcatttta tcattgctag ctatccagaa aaaacatggg aagaaatttc gacagagtta 7561 caaggtttga acctcagttc atcttccttg aaaacaatct ctgaaaatat taaatcacca 7621 tcacctttag atacacttct gaatcaactc aaccgcgatt ttgctgtacc actattagct 7681 caatgttata aaattgcccg agtcaaaggt gagatgaccc cagtagcaca acatattatt 7741 gatactattg ctgctaaatt tgacattgat atcaattttg ttcagtccac tgttgatgca 7801 tccggaaatt ttacaaataa acaataattc atttgtctag tagaagagac tctactaaac 7861 aagccggttt ctcaaagaag ccggctttct tattcataga aatataccat attagagaaa 7921 tctatctcaa ggtcattcca cagaggttct tatcagtctt aaggtagatt gtagcaatac 7981 tcccagcatt tttcgtctgg caaagcaatg ttggaaattt tgtcctgcaa tcattgtatt 8041 agaagtctta aaaaacagaa gtctgaagtt ggagtaaagg cttatgccta acttcaccgt 8101 tactaaaaac attgcacaag ctgcaagtgt ctttgttatt tgcctcggtt gtgtcgtatt 8161 gattggctgg acatttgata ttgctttact caaaagtcta ctgccaggac tggtcacaat 8221 gaaagttaat gcagccgttg ctttcttact ctctggggta tcactttggc ttttgtcaaa 8281 acaagcaagc cagacttggg ggataacttc ctcgtcttct ccatctttgt cactcctgat 8341 tgcccaagga tgtgcgttcg cagttactaa tatcggtcta atcacgttat gtcaatacct 8401 atttggatgg aattgtggaa ttgacgaact gctgttccgt gactcgccaa aagccgtgga 8461 aacatctcat cctggtcgga tgggggcaaa ttctgccgtt aattttctgc tatcgggttt 8521 aagtctatgg ttattagggc agaaaactca tcgcagttct tggttaggtc agggtttgag 8581 tttgattgta ggtttgattt ctttgctaac atttgttggc tatgtctatg gggtgaaaaa 8641 tttttaccag ttcggcgttt atactacctc aatggcgtta cacacagccc taggatttgg 8701 agtgctgtgt ttgggagtgt tatacaccca tccggagcgc ggtttcatgc aaacgatgac 8761 tagtgaactg aatggtgggg cgatcgcccg caggtttata ccaagtgcaa tcgtgctgcc 8821 cttaattctg ggctggttaa tacttgtggg ccaaagggcg aaacaatatg accccgcttt 8881 cagcatatcg cttctggtgg tgtcgctaga agtcatgttt ttggcactga tttggcgaaa 8941 tgcggggttc attaatcgcg tggatagcga tcgcaaaaaa gtcgaagccg cactacagga 9001 aagtgaagaa cgctatcggc tgcttgctga aaacgttccc cagatggttt ggatgagcaa 9061 tcacgatggc tttgtagaat actacaatca gcgctggcta gactacacag gactcaaatc 9121 ccaagaaatc ttggggtgga attggcaaca gctagttcat cctgatgact taccacaaac 9181 tatagaacaa tggacaactg gactcaagac gggaaatcgc tatgaggtgc agtaccgcct 9241 gaaacgagtt gatggggttt atcgttggca tctagcacaa gctttaccat tacgtgatga 9301 ggacggtaca attctcaagt ggtttggcac ctgcactgat atcgatgacc aaaaacagac 9361 agaacaagca ctgcgtcaaa gtgaagcacg gctacggcta tttgtcaact ccgatatcat 9421 tggcattcag tttggcgatg tatatggagg attaactgag gtaaatgatg cctacctcca 9481 tattgttggt tatacccgta aagatttctt acgaggtaaa gtgcgctgga ctcagataac 9541 tccaagcgag tatctaccgt tagatgaagc ggcgatcgcc gaggcgaaga tcaaaggatt 9601 ttgcactcct tacgagaaag agtatatccg taaagatggt tctcgtgttc cagtcatcat 9661 tggcttcgcc ttcattgatg agcaacggga agaggtgatc gccttcattg tggacattag 9721 cgaacgcaag cgagcagaag cagaacgcga tcgctttttt acgctatcag ttgatatgct 9781 ctgcatcacc ggattggatg gttacttcaa gcgtataaac ccggcttttg agagaattct 9841 cggctacact caggcagaac tgctgagcac gccatttatt gagtttgtac atccagaaga 9901 ccgagcaaaa accgagtctg aagtggagaa acttgcaaca ggagccgtaa cactcaattt 9961 cgagaaccgc tacctgtgta aagatggttc ctacaaatgg ctggtttgga atgctgttcc 10021 cgatgttgag ttagagttgt tgtatgcagt tgcccatgat atcactgagc gcaaacaaat 10081 agagcagacg ctgcgagaac aggcagaagc tttgcgaatt agtagagaac gtctggattt 10141 ggtcattcag ggagcagaag tcggtgtctg gtactgcaac ttaccactgg acaaactgat 10201 ttggaacgac cagtgcaaag cacattttgg tctaccacct gatgctgaag tcacgattga 10261 tttatgttat caactgctgc atgatgatga ccgggaaccc acacgccaag cagttcagca 10321 cgctattgag caggcgactg gttatgatgt ggactaccgc acagtagccc cagatggacg 10381 cgtacgttgg atccgagcaa ttggacacac tttctgtgat gcggctggca caccaacgcg 10441 ctttgacggc attactattg acattactga gcgcaaacga gcacaagccg cccttgttga 10501 aagcgagaag cgctttcgcc atgttacaga cactgctccg atgatggttt ggatgtccgg 10561 tacggataaa ctctatgact acttcaataa accttggtta gattttactg gacggacgat 10621 cgagcaggaa ctagggaatg gttggactga aggtattcat cccgacgatt ttcagcgttg 10681 tctagaaact tacgtcaatt cttttgatgc ccgccaagag tttcaaatgg agtatcgcct 10741 gcgacggttt gatggagaat atcgttgggt tttagatatt ggtgtgcctc ggttcacttc 10801 agagggagaa tttttgggtt atatcggctc ttgtgtagat atagaagacc gcaagcaagt 10861 cgaagctcaa attcaacaac tcaacgaaac cttagaagaa cgggtaaaac agcgcacgtc 10921 tcaattagaa gcagccaaca gagaactaga atccttttct tactccgtct ctcatgactt 10981 gcgggcacca ttgcgccaca tcgctggatt cgttgacttg ctgcaaaaac gtcttgtgtc 11041 aacaggggtg gacgacacga gtagacgtta cctcaacact atcactgaag caacgaaaca 11101 agcgggtaaa ttaattgatg atttgttaac tttttcccat gtgggacgtt cgcaaatgcg 11161 tcataccacc attgacatga atctactgct gcgggaggtg caacgcgatt tcaaactaga 11221 aaccaaaagc cgtcaggttt cctggcacat agaatcatta cctcaagtac aaggagaccc 11281 cgatctgttg cggctggttc tgcgtaacct cttggaaaat gctctcaagt atagtaaaac 11341 tcgccctttg acacaaataa ctgtaggtag taccaccggt catcaagaag tggtattctt 11401 cgtgcgagat aacggcattg gttttgatat gaaatatgtt cacaagttgt ttggtgtttt 11461 tcaacgcctg cacagtgacc cacaatttga aggtacaggt gtgggattgg caaatgtcca 11521 acgcattatt caccgtcatg gcggacgcgt ttgggcagaa agtgaaatag ataatggggc 11581 gactttttat ttttcgctac caagaagctc tggagtggaa agtggagagt agagaatata 11641 cgatggaatt aaagcgcatt ctcctagtag aggacagcat taacgatgtt gagttaatcc 11701 tgacttcact ggcagaaaac catttgggca atgaagttgt cgtcgttcgt gacggtgaag 11761 aagcattgga ttacctctat cgtcgaggtt tgtatcgttt gcgtcgggag ggtcatcccg 11821 ttgtggtgct attggatctt aagttaccaa aaattgatgg catagaagtc cttgcacaac 11881 tcaaagctga cccagaattg cgagtcgtac cagtagtggt gttgacttct tcccgtgaag 11941 agcaagatct cactcgctgt tacgaacttg gcaccaatgg ctacgtagtc aaaccaatcg 12001 attttcttga atttgttgaa gtgattaaag gtttgggatt attttgggca ataataaatg 12061 aaccccctcc cggttccata cctccggcac gcagtagtca aggagtagca ggaatatagt 12121 catgtcaggt ctaagatttc tcttgctgga agataatctg ctggatgcag aactgatcaa 12181 tgccttgttg accgaaaacg gtattgcttg cgaactgata cacgtgaaaa cgcaagcaga 12241 attccagaca gcgctagagc aagatggctt tgacctgatt ctttcagatt acgctttgcc 12301 gggatttgat ggtattacag ctcttgtaat agcgcagcat cactccccag agattccctt 12361 catttttgtc accgcaacaa tgggggaaga agtcgcaatt gaaaccctca aaagtggtgc 12421 cactgactat gttctcaagc aaagattagg gcgtcttata ccatcggtac gtcgggcgct 12481 acgcgaagcc cagtatcaac gcacctgcaa aatggcagag gtagaactgt accgtcgtga 12541 gcaggagttt agagctttag cggaaaactc accagatgcg atcacccgca ttgatggaga 12601 actccgctat agctacgtaa atcccacaat agaagtagca acaggaatac cactagaaaa 12661 gtggattggc aaaacagttg ccgaaatcga ttatccagaa gaattttcga caacttggga 12721 agcgaaattg agccaagtgt ttgccacagg atctgagtcc tcgatggagt ttgatgttcc 12781 ctcacatcag ggacgcattt actaccaagc caggatagtg ccagaatatg ctcctgatgg 12841 ttccgtacaa tcattgctga gtattgctcg tgatgtgacg gagtacaagc gctcagaaca 12901 ggcattacgg gaaaacgaag cccaattacg gcaacaaaag gaagaattgg aacgggcaaa 12961 taaaattaag gatgagtttt tggcagtact ttcccatgag ttgcgatcgc ccctcaacgc 13021 catcctcggt tggtcaaaaa tactacgtac ccggaacttg gatccaaaga gtttcaatcg 13081 tgccttggaa acaattgagc gcaacgccaa attacagaca caactgattg aagatttgct 13141 ggatgtatcc cggattattc gtggcaagtt gactctgcgt ccctacccaa caaatttgat 13201 tccagcaatt gaagctgcaa tagatacaat gcgtctagca gctcaagcca aatcaattga 13261 tttgcaattt atcattttag actctgaatt ggaaaccaaa ttccaaattc caaattctca 13321 atcccctcct attgcactac aatcgccacc ctcacaggaa attgcagcgc gtccacaaca 13381 gaacgaacct ctgctggcgc aaaactcctc tgtggaaagc cctggttctc tatctcaaat 13441 tccaggacaa aaattccacg ttttaggcga tcctagccga ttacagcaaa tagtctggaa 13501 tctcctttct aatgctataa agtttacccc acagggagga cgagtggaag tgctgctgga 13561 aagagttggg ggagatgaaa aagatgaagg agcaacatct tcctcatccc cactccccat 13621 tcgccactcc tcaaaccacg ctaaaattac ggtgatagat acaggaattg gtattaaagc 13681 tgactttatt ccttacgtgt ttgattcttt tcgtcaagga gatggttcta ccacccgcaa 13741 atacgggggt ttgggattgg gtttggcaat tgtacgtcac ctagtagaac tgcatggtgg 13801 aactgtcagt gcagaaagtt ctggagaagg aaaaggcgca acatttgcag tggaattacc 13861 tattcttgag gataaaaaag caaggaaaat tatttcgtct gcctcatctg cctcatctgc 13921 ctcatctgcc tcatctccat tagaactgtc tc // LOCUS NODE_2444_length_13915_cov_6.12525313915 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13915) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13915) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13915 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 65..1075 /locus_tag="DP116_20310" CDS 65..1075 /locus_tag="DP116_20310" /inference="COORDINATES: protein motif:HMM:PF00400.30" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20310" /translation="MLHNRLLNAYAQKCLDGWHTGVNDGYFFQNLAYHLHEAGRVRKL QRLLLDFRWLEAKLEKTNINALLADYDFLPQDENVQLVQGALRLSAHVLNQDKTELAT QLWGRLQSVAVPEIQAMLEQAKQWKATPWLRPLTPCFTRPGGALLRTLSGHNNEVRAI ALTPDGKYVISGSLDSTLKVWNWQTGEEVRTLRGHDEEVRAIALTPDGKYVISGSSDE TLKIWNWRTGEEVRTLKGHGSMVTAVVVTPDGNYIISGSWNGTLKVWDWQTGEVVRTL KGHSHQVDALVVTPDGKCIISGSADKTLKVWNWQIGEDVHTFTAHSSYRDLNKIKGEN TV" gene 1269..1898 /locus_tag="DP116_20315" /pseudo CDS 1269..1898 /locus_tag="DP116_20315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307825.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS701 family transposase" gene 2122..2691 /locus_tag="DP116_20320" /pseudo CDS 2122..2691 /locus_tag="DP116_20320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012413077.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS701 family transposase" gene complement(2856..4733) /locus_tag="DP116_20325" CDS complement(2856..4733) /locus_tag="DP116_20325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317320.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine-protein phosphatase" /protein_id="PRJNA477356:DP116_20325" /translation="MNSIEDSVSTSSQTPLVHRYLWATGSLAAKIQPGGKVGHRYDVT SSNIWLDTQPELLPDIPEEFSKEIVSYLRLYQHRLHIPQVYGLAYDNILLLENVPIDE TGNLYPAITDAWEEAKAVRQVYWLYQIIQLWTPLSQLGVAGSLLIPDNLRVQGWCVRL LELVETPHDVSLQQLGECWQPWVAAAKTPVAQELNNIVQQMCSEEANLDSIAAQLNIL LLSSAADLPLTLEVAGATDPGLQQTQNEDACYPSDTADPDDLLLPRVSIVCDGIGGHQ GGEVASLLAVQSLKLQLRALLTEVKEQADILPPDLIQKQIEASLRIVNNVISNCNNEQ KREGTERMGTTLVMAVQLPQIIKSLSGKESQNTHELYLAHVGDSRAYWITRDYCQLLT VDDDVAGREVRSARTLYRKALQRPEATSLSQGLGTKDGEFLRPVIQQFILEEDGILLL CSDGLSDNDWVEHSWRNYAVPVLQGELSLEDAVPQWISLANEKNAHDNISVVLTHCRV SPDPIVSLLRALRPVEVIVPEQQEQEQEQQSELAASSQALLDLLVSEPPVTDETQTPA KPQAKKKWWLLVGGLLILLVSGTSLGLFAWWRLKLNPQTFQQTCGRLPQDLQRLCPPQ R" gene 5296..6549 /locus_tag="DP116_20330" CDS 5296..6549 /locus_tag="DP116_20330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="murein transglycosylase" /protein_id="PRJNA477356:DP116_20330" /translation="MRKIFALLSFSLGIALVNPTYSVAQVPNVPSVPVPNNPQQLPPQ QPDQVEELPIPLQVTDRKTCQVTYQCLGWDEQIWGGKGKKGDKQALLASIDNSLGYLQ TNKANTVYQNYPIKEITLDRVRRSLIRFRQLVVNSKSISQLQAAVNREFVFYKSVGND GKGTVKFTAYYEPLYTASRTPTAVYKYPLYGRPLDFDGWAKPHPKRIDLEGQDGLLKD QSPLRGLELFWFRDRFEAYMIHIQGSAQLKLTDGTKTSVGFAGATDYPWTSIGRALIK DGKLSREKATMPGIIRHFRENPQDLNEYLPRWERFVFFRETNGTPATGSIGVPVTPER SIATDKSLMPPGALALIYSSFPYPKKGGGLEYRKVSRFVLDQDTGSAIKGPGRVDYFM GTGYLAGDRAGVTGGNGELYYLLLK" gene 6778..7212 /locus_tag="DP116_20335" CDS 6778..7212 /locus_tag="DP116_20335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317319.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NfeD family protein" /protein_id="PRJNA477356:DP116_20335" /translation="MPSTTLIWLLAGSGLCLLELFMPTAFAAFLTGISALIVAFLSQA ILGKLWLQIVVWLFLSTVLVILSRRFMSLPKRKTKIQHAIIAETLTEIPAGKPGRVLY EGNSWRARCDDETLIIPPNQRVYVTRREGTTLIVMPENLLDS" gene 7359..8366 /locus_tag="DP116_20340" CDS 7359..8366 /locus_tag="DP116_20340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208533.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="paraslipin" /protein_id="PRJNA477356:DP116_20340" /translation="MEQFFLLIFLALGGSALAGSAKVVNQGNEALVERLGSYNKKLEP GLNFLVPFFDRVVFRETIREKVLDIPPQQCITRDNVSITADAVVYWRIMDMEKAYYKV ENLQAAMVNLVLTQIRAEMGKLELDETFTARSQINETLLHDLDVATDPWGVKVTRVEL RDIIPSQAVRESMELQMSAERRKRAAILTSEGERESAVNNARGEADAQVLDAEARQKS VILRAEAEQKAIVLKAQAERQQQVLKAQAIAESADIIAQKMKNNPLAHQALEVLFALG YLDMGATIGKSDSSKVMFMDPRTIPATLEGIRSIVSDGQPDSNIPYTGEVSPGNNHRS S" gene complement(8491..8886) /locus_tag="DP116_20345" CDS complement(8491..8886) /locus_tag="DP116_20345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129739.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional repressor" /protein_id="PRJNA477356:DP116_20345" /translation="MRAVRTRSQERILNLLKSIKQGISAQDIYVELRNRNQSMGLATV YRSLESLKLEGLVQVRTLGNGEALYNLAQQDKHHLTCLQCGASIPINQCPVHDLEGQL QSTHKFKIFYHTLEFFGLCTQCQLAQTAD" gene 9051..9329 /locus_tag="DP116_20350" CDS 9051..9329 /locus_tag="DP116_20350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129738.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylformylglycinamidine synthase subunit PurS" /protein_id="PRJNA477356:DP116_20350" /translation="MQRKYLAKIFVTLRPSVLDPAGVAVQSGLKQMGYDDVEQVRIGK YIEVTFTSQDEDTARQNLDRMCDQMLANPVIENYRFDLIEVESQTGVY" gene 9413..10087 /locus_tag="DP116_20355" CDS 9413..10087 /locus_tag="DP116_20355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215894.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylformylglycinamidine synthase I" /protein_id="PRJNA477356:DP116_20355" /translation="MKFGVVVFPGSNCDRDVAYVTRDLLKQPTRMVWHQETDISDVDI MIIPGGFSYGDYLRCGAIARFSPVMQQVVEHAQKGKFVLGICNGFQVLTEAGLLPGAL VKNRDLHFICDRVPVKVERTDFAWTQGYQNGEIITLPIAHGEGRFYADDSTLSEIENN GQVLFRYQGDNPNGSVNNIAGICNRQGNVLGMMPHPERASDSMLGGSDGLRLFQGLLE KVGALV" gene complement(10283..12301) /locus_tag="DP116_20360" CDS complement(10283..12301) /locus_tag="DP116_20360" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20360" /translation="MSYSSEEIEAIVKRILDKTHTESDLNTLRHIMLINGDKNTLQIG KTNIDIRKARDITIQVADCIYQGDTAVAIQQAVHESVLKGLSEGLKGLSASDFPGEYS LKESIIYRHITGVNEYSTLSFLEEIMPNKLTTILGINHTPIIYTDNPVFKSLENFAVG TGNEFLISYKYHPKTNFYISAALNNNGYTKTISHGAGGLSENNTTENFTQNNKDFEND TLDYKLDELGFNFPHISASFQTSVEDAEWTSIYESEGTNIGRILQYPKLAAVKTCKLG VPFYLKVDETWIKRILKANPETRGFILFLYQYIKSLKHSFFGECYGRIVEYIPPTPYL RFLDIKNIQNRAIKIQALNLNVVENGEDKLTEVSSQERRHLFSSSTQVTQPINIILPP QHHLLIPTEFGFDTKNHQKPFSYVSENNPIGSVSFFKDTLYVSKNPDEDDSLFKNFTD VLGRKQIDWDTLYGTTPELMNTLITDTINLSPDFLAKTKPLKDFLNSHPKRFAVGSLM EVVSLQIDGKEIKIDSPDDTPKFSMSIHFNVGSCPYLMVYNTKKGYWKELGTVLYGRK HKSLQSDEIYKLGEDISKIRIEERESEITYIQSLSIIYTDSQTDTKREIMLSLSGLAS QEEGYFVLHEGQSIEIDLETLISADALDIKLKINGYYEILPDSNISRD" gene complement(12539..13567) /locus_tag="DP116_20365" CDS complement(12539..13567) /locus_tag="DP116_20365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307659.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="caspase family protein" /protein_id="PRJNA477356:DP116_20365" /translation="MSQIFTHGYALLVGVGECAYANWSLPVTVKDAQALKSVLTDPNF CAYPNDEHHIRLLHDKSATRSKILDDLEWLKVQAANDSEATVVVYYSGHGWLDQTTGE YYLIQHDVEPFDIPNSALCAKDFTNALRQIKAQRLLVVIDCCHAEGIATAKNGNPAIK LPPGLTQTAPPKNLVDNLKQGEGRAVFTSSRGQQLSWIRSDDAMSIYTYHLIEALQGA NNQPGDTVVLISNLMNHLGKSVPESAQTMHQAEQKPFFDAATEDFAIALLLGGKGLSV ADKSTTQEEAFENARQEVNATGDRSVAAGRDISNSPITSGNGNTVQIGDTNLNIGKAR DISLGKNL" BASE COUNT 3988 a 2772 c 3010 g 4145 t ORIGIN 1 gcccaaggta gatatttcca gtaatggaca gctttgggta ggtattatat aacggaattt 61 gtctgtgcta cacaaccgtt tgttaaacgc ttatgctcaa aaatgccttg atggttggca 121 tacgggggtg aacgatggat atttcttcca aaatttagct tatcacttgc atgaggcggg 181 aagagtaagg aaattacaac ggttgctgct tgattttcgc tggttagaag cgaagttgga 241 aaaaaccaat atcaatgcct tgctagctga ttatgatttt ttgccccagg atgagaatgt 301 acaactggtg caaggtgcgt tgcgtttgtc agcacacgtt ttaaatcaag acaaaacgga 361 attagcaaca caattatggg gacgtttgca gtccgttgca gtgccagaaa ttcaagcaat 421 gctagaacaa gcaaagcagt ggaaagcgac accttggttg cgtcccctca caccttgctt 481 cactcgccca ggtggtgcat tactgcgaac tcttagcggt cataataatg aggtaagggc 541 gatcgctctc acccctgatg gcaagtacgt catttctggt tccttggata gcactctcaa 601 agtctggaat tggcaaactg gagaagaggt gcgtactctc aggggtcatg atgaggaggt 661 acgggcgatc gctctcacgc cggatggcaa atacgtcatt tctggttcca gtgacgagac 721 tcttaaaatc tggaattggc gaaccgggga agaggtgcgc accctaaagg gtcatggttc 781 tatggtaaca gcagttgtcg tcactccaga tggcaactat attatttctg gttcctggaa 841 cggtactctc aaagtctggg attggcaaac tggggaagtt gttcgtactc tcaaaggtca 901 tagtcaccag gtagatgcac tcgttgttac cccggatggc aagtgcatca tttctggttc 961 tgctgacaaa actctcaaag tgtggaattg gcaaattggc gaagacgtgc atacttttac 1021 tgctcatagt tcctatcggg acttaaacaa aatcaaaggg gaaaatacag tataatgctt 1081 taccagcata gctttcacgt caaagtaaca cccaatgaaa gagacaaccc ccgcagccat 1141 gcctccgtgt ttggaaaaat ggtgtcacgg gggtctaaca cgaaaaaaat gagccagcct 1201 ttgtttgttg atctgtcggc gtagttgggt actcgtagtt ataaggctgg ctcatttttt 1261 aggggacaat gcgtcaacgg tttgatgaag cgtttactca taaagcccaa aaaagagggt 1321 ttaggcacta tttaggggga ttattgggag aaagcgaaag aaaaaacctg tcccagatgg 1381 cttctaacgc tataggagtt gaataccatc aattgcacca ctttataacc gaaggacgtt 1441 ggtctgattc caagatcaat gaacttcgat tagagattat gaacaagtgt agtcaaacca 1501 ggatcagtcg aagatttagc ttaataattg atgattctgg tcacagaaaa agtggtaatt 1561 ttactgatgg agtgggaagg cagtacattg gagaaatcgg caaaacagac aatggcatag 1621 tagtggtaac aacacattta tatgatggac gaaaaagctt gccattagat atagagttat 1681 atcagcacgg ggattctttg ccagaaggta aacaagatcc agagtttgag aaaaaaactg 1741 aactagcaat caaattaatc gatagaacta tagagagaaa atatcaacca ggaatcgtac 1801 ttatagatgc tggatacggc aacaatacat ccttcttatt agcattagag aagagacaat 1861 taaagtactt gggaggaatc gccaaaaatc gaaaaataac tattaaaaaa agaagtcaga 1921 agtcagaatt cagaattcag aattaatcgg atggggattt agaccccaac cgattataga 1981 caccgtgtag acggatgggg atttagaccc ctcgactcgg agtcattcgt acaggagatt 2041 cgtctgttct tcgttctatt ttttcttcct tctgactcct gactcctgac tcctgagttc 2101 ttggttaatg tatcagaaaa tattcaacaa acaattagga tagacgaatt agcacaaagc 2161 ttacccaaag aagattttac cgaaattcaa ctcaatttag atcaacctaa aatagtatgg 2221 gtggcaacta gagaaataga gatatcacaa ctgaaaggaa agcgaaatat tgctatcgtt 2281 atgaatgcag cgactttctc tgaagccact gatcttgact attttatcac taatgtctct 2341 acatcaattg tcacaccaga atggatagtt cagacctatt ctcaaagaaa ttgggtggaa 2401 gttttttata gagaagccaa ggggtggttg gggttaaaag aatatcaagt tcgagataaa 2461 agaagcctac ttcgacattt tatcttggtt ttttgcgcct atacttttat cctttggcat 2521 cagatgactg ggggtctgag acgaaggtgg gcttccaaac ctttgaacac ttttactgaa 2581 gcaatagatg cctttagaac agctatatct tttcgatttt ttgattggtt gactcataat 2641 cgtgacgtgt ttgccgcttc taaagctagt ttaggctaca tttgggctta atttttgttt 2701 aagtcccatt aaaacggact gatgattcac cattatttat ttctattatt attcagtcgt 2761 ctttagacga cttcacctat gagcctgcga attaattcgc aggcggacgt ggaataagat 2821 cttacaatct cccctgacac tcaaagcccg agtctttacc tctgcggcgg acacaatcgc 2881 tgcaaatctt gaggaagtcg cccacatgtc tgctgaaatg tttgcggatt gagtttgagt 2941 cgccaccagg caaataatcc taaacttgta ccactcacaa gcaaaatcaa caaccctcca 3001 accaacaacc accacttttt cttcgcttga ggtttcgcag gagtttgggt ttcgtctgtt 3061 accggtggtt ccgaaacaag caaatccaaa agcgcttgag aacttgctgc caactcagat 3121 tgttgttctt gttcttgttc ttgttgttct ggtactatga cttctacagg tcgcagagcg 3181 cgaagcaatg acactatagg atctggggaa acacgacaat gggtcagaac gactgagata 3241 ttatcatgag catttttctc gtttgccaat gaaatccatt gagggacagc atcttccaag 3301 gaaagttcac cttgcaggac aggtacagcg taattccgcc aagagtgttc tacccagtca 3361 ttgtcgctca aaccatcaga acacaatagc aatataccat cttcttctaa aataaattgc 3421 tgaatcacag gacgcagaaa ttcaccatct tttgtgccta atccttgact caaggaagtt 3481 gcctcggggc gttgcagtgc ttttcgatac agagtacgag cagaacggac ttcccgccct 3541 gcgacatcat catctactgt taaaagctga cagtagtcgc gtgttatcca gtaggcacgg 3601 ctatcaccca cgtgagccaa gtaaagttca tgtgtatttt gtgactcctt acccgagaga 3661 cttttgataa tttgcggtaa ttgcactgcc atcacaaggg ttgtacccat gcgttcggta 3721 ccttcgcgtt tttgctcgtt attgcagttg ctaatgacat tattgacaat tcgcaaactt 3781 gcctctattt gtttttgaat caagtccggc ggtaagatgt ctgcttgttc tttcacctct 3841 gtgagtaagg cgcgaagttg caattttaga gattgcactg ctagcagact tgcaacctca 3901 ccaccttgat gtccaccaat accatcacaa acaatggaca cccgtggtaa taataaatca 3961 tctggatcag cagtgtcact agggtagcag gcgtcttcat tttgcgtttg ttgtaaacca 4021 ggatctgttg ctcctgccac ttctagggtt aatggcaaat ctgctgcaga tgatagaagt 4081 aaaatattga gttgagcagc aatagagtct aaatttgctt cttcactaca catctgctgg 4141 actatgttgt tcaactcttg tgctactggt gtttttgcag cagcgaccca aggttgccaa 4201 cattctccca actgctgtaa agacacgtca tgaggcgttt ctacaagttc caacagtcgc 4261 acgcaccaac cttgcacgcg caaattatct ggaatgagca aactgccagc gactcccaac 4321 tgtgatagtg gtgtccacag ttggataatt tggtaaagcc agtaaacttg tcgtactgct 4381 tttgcttcct cccatgcgtc tgttattgct gggtagagat ttccggtttc atctattggc 4441 acattttcca acaggagaat attatcataa gccaacccat atacctgagg aatatgtaat 4501 ctgtgttgat ataatcgcag ataagacaca atttctttag aaaattcttc agggatgtct 4561 ggcaatagtt ctggttgagt atctagccaa atgttcgagc tagtgacgtc atagcgatgt 4621 ccaactttcc ctccaggttg gattttggcg gctaatgaac cagttgccca aaggtaacgg 4681 tgaactaggg gagtttgaga acttgtagaa acactatctt ctatagaatt catggcttaa 4741 gtacagcttt gatttatgca ataaattatc cgctgagttg aaatcatgca agtcggaatt 4801 tttcgagaaa tcgatgggtt aatttgggaa catcccaaat gtttaagtta tatcaaactc 4861 tgatgatttc atatccgcag ccaccgtaat ggagtgaagc gcactaactt taaacacaac 4921 gcttgcacac cgtatgccta cggcacgctt tgcgcttacg cggagtgtct gtgcaggaga 4981 tactttgaat gctaaagttt ggctcagaca agatggcact gctaaaatct ggtcgattga 5041 atcgtttgat gaactcatgg tgcgaggttg tcgtcgttgg gtaggactat ctcaagaatg 5101 gctctaatct cagttcgagc gatagcgtaa gcgcaagcgc acgccaaggg cgaacgcgta 5161 gcggctccct gctggagcta gcgctgctgc tttgctgcag atcgttatct ctgtcatggc 5221 attgacactt cccaaccata ggtatgggcg atcaaaccag cgatacatta caccgtttag 5281 ctaattaagt aaattatgag aaaaattttt gctttgcttt cttttagtct aggaattgct 5341 ctggttaacc ccacctattc agttgctcaa gttcctaacg taccatctgt acctgttcca 5401 aacaaccctc agcaactacc accacaacaa ccagatcaag ttgaggaatt gccaatacca 5461 ctgcaagtca cagacagaaa aacgtgtcaa gtgacatacc aatgcttggg ttgggatgaa 5521 caaatttggg gtggcaaggg taaaaagggc gataaacaag cgctattggc atcaattgat 5581 aacagcttag gttatttgca aaccaacaaa gcaaatacag tgtatcaaaa ttaccccatt 5641 aaagaaatta cacttgatcg cgtccgtcgt agcttaatac gttttcgtca actggttgtt 5701 aattctaagt ctatatcaca attacaagct gctgtcaatc gggaatttgt cttttacaag 5761 tcggttggta atgacggtaa ggggactgtt aaatttactg cttactacga gccgctttat 5821 actgccagtc gcacccccac agcagtttat aaatatcccc tatatggacg cccactggat 5881 ttcgatgggt gggctaaacc gcaccccaaa cggattgatt tggaaggaca agatggcttg 5941 ctaaaagatc aaagcccgtt gcgcggtttg gagttatttt ggtttcgcga tcgctttgag 6001 gcgtatatga tacatattca ggggtctgcc caactcaagc ttaccgatgg tacaaaaacc 6061 tctgttggtt ttgcaggtgc aacggattat ccttggacta gtatagggcg ggcactcatt 6121 aaagatggta aactgtcccg agaaaaagca acaatgccag gtataattag gcattttcgc 6181 gaaaatcctc aggatttaaa tgaatacctg ccacgctggg aacgctttgt cttttttagg 6241 gaaaccaacg gtacgcctgc tactggaagt ataggtgtgc cagtcacccc agagaggtct 6301 attgccacag ataagtctct tatgcctccg ggggcgctag cgctgattta ctcttcattt 6361 ccctatccca aaaaaggagg agggttggag tatcgtaagg tgagccgttt tgtacttgac 6421 caagatacgg gaagcgccat caaaggtcca ggacgggttg attattttat ggggactggc 6481 tacttagcgg gcgatcgtgc tggtgtaaca ggtggcaatg gagaacttta ctacttacta 6541 ctgaaataat gctactcccg aaccgctaga aaattaccta tcttcttttt tcttccttag 6601 cgtccttagc gtccttagcg gttttttaaa taggtattct ttgggcggtt cgggagtaat 6661 agtcatcaaa atttactctc aaaaattaat ctttgctttc tggaaagact caatgcatga 6721 gtgtatctaa tcagaaaagt gcattgatgg cttaaatgta ttgtctagga tggcattatg 6781 ccaagtacta ctttaatttg gcttttggct ggttcaggtc tttgtttgtt agagctgttt 6841 atgccaacag cctttgctgc tttcctgacg ggaattagcg ccctgattgt ggcgtttcta 6901 tctcaagcga ttttgggcaa gttatggcta caaattgtgg tttggctgtt cctttccaca 6961 gtcctagtca tactttcccg tcggtttatg agtttgccaa aacgcaaaac gaaaattcag 7021 catgcaatta tagccgaaac tttaactgag attccagccg gaaaaccagg acgagtgctg 7081 tatgagggaa attcttggcg ggcacgttgt gacgatgaga cgctcatcat accacccaat 7141 caaagagttt acgtcacaag acgcgaaggg acgactctga ttgtaatgcc agaaaatcta 7201 ttggactcgt aattctctct taaatcagtg aactcttaac agggaactct taacagggaa 7261 cagggaacag aggaagcgat ttcttgaccc cccttaactt ggtaactggt aactggtaac 7321 tggtaactga taactgttaa aaaattggac aatttattat ggaacagttt tttttactta 7381 tttttcttgc tctaggcggt tctgccttag caggttctgc gaaagttgtg aatcagggta 7441 atgaagcttt ggtggaacgt ttgggaagct ataacaaaaa gttagaacca ggacttaact 7501 ttctcgttcc cttttttgat cgggttgtct tccgcgaaac tatccgagaa aaagttttag 7561 atatcccccc tcaacaatgt attacccgtg ataacgtgtc aattaccgct gatgcggtgg 7621 tttactggcg gattatggat atggagaaag cctactacaa agtggaaaat ctccaggcgg 7681 cgatggttaa tttggtgctg actcaaattc gtgccgaaat gggcaaattg gaattggatg 7741 aaacctttac cgcccgttct caaattaatg aaactttatt gcatgacttg gatgttgcaa 7801 ctgatccttg gggcgtgaaa gtgacgcggg tggaactgcg ggatatcatc ccgtcacaag 7861 ctgttaggga gtcgatggaa ttgcaaatgt cagcggaacg ccgcaaacgg gcagcaattt 7921 taacttctga aggcgagcgc gaatcagccg tcaataacgc cagaggtgaa gctgacgccc 7981 aagttctgga cgccgaagcc cgtcaaaaat ctgtcatttt gcgagcagaa gctgaacaaa 8041 aagcaatcgt tctgaaagca caagccgaac gccagcagca ggttcttaag gcgcaggcga 8101 tcgccgaatc tgcagatatc attgcccaaa aaatgaaaaa taaccctctt gctcatcaag 8161 ctttagaagt tctatttgcc ttgggttacc tggatatggg cgcgacaatc ggcaaaagcg 8221 atagtagcaa ggttatgttt atggacccgc gcacgatacc tgctacctta gagggtatac 8281 gctcgattgt ctcagatggt cagcctgact caaatatccc gtatacaggg gaagtgtctc 8341 cagggaataa tcatcgctct agttagaatg aggcgagtga gaacaagagg acaagagaaa 8401 tcatcactct tgactcttga ctcttgactc ttgactaatg actcttgact aatgactaat 8461 gactaataac taatgactaa taactaatga ctaatcagca gtttgagcta attggcattg 8521 agtacacaaa ccgaaaaatt ctagagtgtg gtaaaaaatc ttaaacttgt gtgtagactg 8581 gagctgaccc tctaggtcat ggacgggaca ttgattaatc gggatggaag caccgcactg 8641 tagacaagtc aggtggtgct tgtcttgctg ggctaagttg tagagggctt caccgttacc 8701 taatgtccgc acttgcacca gaccttcaag ttttaaggat tctaatgagc ggtaaactgt 8761 tgccagaccc atgctttggt tacggttacg taactcgaca taaatatcct gggcggaaat 8821 gccttgttta atgcttttga gcaagtttag aatacgctct tgactgcggg tgcgtacggc 8881 tctcatagac aattagtata gtaataaaac gccaattgta tttgaaaaag tttattttaa 8941 agtattctcc tgcttaactt ttagcttaat tctcataact tgtcctccta ttgtcgccta 9001 attgggacag aaagatatag tatgagctta gcagtattca gcgaaagcca gtgcaaagga 9061 agtatttagc caaaattttt gtcactctcc gtccttcagt tctggaccct gctggagtcg 9121 cagtgcaatc tgggctgaag caaatgggat acgatgatgt tgagcaggtg cggattggca 9181 agtacattga agtgactttc acctctcaag atgaagatac agcccgtcag aacttggatc 9241 ggatgtgtga ccaaatgctt gcaaatccag tgatcgaaaa ttatcgcttt gatttgatag 9301 aggttgagtc acagacggga gtttattaat ttgctttttc agagtcatta gtcgtttgtt 9361 ttataagctc gaatgactaa tgacaaatga ccaatgacta atgactaata acatgaaatt 9421 tggggttgtt gtttttccgg gttctaattg cgatcgcgac gttgcttacg taacaagaga 9481 cttgttgaag caaccaactc gcatggtttg gcatcaagaa actgatattt ctgacgtaga 9541 tatcatgatt ataccaggtg gctttagcta cggggattat ttacgctgcg gtgcgatcgc 9601 ccgcttttca cctgtcatgc agcaagtcgt cgaacatgct caaaagggga aatttgtcct 9661 cggtatttgt aatggtttcc aagtcttaac tgaggcaggc ttgttaccag gggcgttggt 9721 gaagaatcgg gatttgcatt ttatatgcga tcgcgttcct gtgaaagtcg agcgtacaga 9781 ctttgcttgg acgcaaggtt atcaaaatgg tgaaatcatt actttgccaa ttgctcacgg 9841 agaggggcga ttctacgcag atgattccac attatcagag attgaaaata acggtcaagt 9901 cctgtttcgc taccagggag ataatcccaa cggctcagtg aacaacatag ccgggatttg 9961 caaccgtcaa gggaatgtgt tgggaatgat gccacatcca gagagggcat ctgactcaat 10021 gctgggtggt agcgatgggt taaggttgtt ccagggattg ttggagaaag tgggggcgtt 10081 ggtgtagaaa gggtggggac acagaattgc tgtgtccttg tttagtgcat tttaatggac 10141 ttcgcctatt agcctgggac ttacagtcct aggcggacga taacggagtg aaaaaacatg 10201 taattatttt gaacactaat tttactagat ccaagatgta agggaagaaa aaactaagaa 10261 atcttacacg cctgatgaat gtctaatccc ttgaaatatt gctgtctgga agaatttcat 10321 agtaaccatt tatcttaagc ttaatatcca aagcatctgc agaaattaat gtttccaggt 10381 caatttcaat ggactgtcct tcatgcaata caaaatatcc ttcttcttga cttgctaaac 10441 cacttagcga taacataatt tctcgcttgg tatctgtttg agagtctgtg taaataatag 10501 acaatgattg aatataagtt atttcagatt cccgttcttc tatcctaatt tttgatatat 10561 cttcaccaag cttataaatt tcgtcacttt gaagtgactt gtgttttctg ccgtatagaa 10621 ccgtacctaa ttctttccaa taaccttttt tggtattgta aaccatcaaa taaggacagc 10681 tgcctacatt gaaatgaatt gacattgaaa atttaggtgt atcatctgga gaatcaattt 10741 ttatctcttt cccatcaatt tgtagagaga caacttccat cagtgaacca acagcaaatc 10801 ttttaggatg agaatttaag aaatctttta aaggtttagt tttagcaagg aaatccggag 10861 acaaatttat ggtatcagta attagtgtat tcattaattc cggtgttgtg ccgtacaaag 10921 tatcccagtc tatttgtttt ctccctaaaa cgtctgtaaa atttttgaaa aggctatcat 10981 cttcatcagg atttttagaa acgtacaatg tatctttgaa aaatgaaact gatcctatgg 11041 gattgttttc tgaaacataa ctgaagggtt tctgatgatt ttttgtatca aagccaaact 11101 cagtcggtat cagtaaatga tgctgtggag gaagtatgat attaataggc tgagtaactt 11161 gagtactaga cgaaaagaga tgtcgtcttt cttgagaaga aacttcagtt aacttatctt 11221 caccattctc aacaacattc aaatttaacg cttgtatctt aatagctcta ttttgaatat 11281 tcttgatatc caaaaatctt aaataaggag taggaggtat atactcaact atcctaccat 11341 aacattcacc aaaaaacgaa tgttttaagc tttttatata ttgatataaa aataaaataa 11401 accctcttgt ttctggattt gctttaagaa ttcgctttat ccaagtttca tctactttta 11461 gataaaatgg tactcctagt ttacatgttt tgactgctgc taattttgga tactgcaata 11521 ttctgccaat atttgtgcct tcagactcat atatagatgt ccattccgca tcttcaacag 11581 aagtttgaaa cgatgcagaa atatgaggaa agttaaatcc taactcgtcg agcttataat 11641 ctaatgtatc attttcaaag tctttattat tttgagtaaa gttctctgtt gtgttatttt 11701 cagacaaccc acctgcacca tgtgaaatag tttttgtgta accgttatta ttaagagcag 11761 ccgagatgta aaaatttgtt ttcggatgat atttatatga aattaaaaac tcatttcctg 11821 tcccaacagc aaaattttct aaacttttaa agaccggatt atctgtataa ataataggtg 11881 tatgattaat tcctaagata gttgttaact tattcggcat aatttcttct aaaaaagaca 11941 gagtagaata ttcgttcact cctgttatat gtctatagat tattgattct tttagagaat 12001 attctcctgg aaaatcacta gcagataatc ctttcaaacc ttcagacaaa ccttttaaaa 12061 cggactcatg tacagcttgt tgaatagcta cggctgtatc tccttggtaa atacaatcag 12121 ctacttggat tgtaatgtct cgtgctttgc gaatatcaat gttcgtctta cctatttgta 12181 atgtattttt atcaccgttt attaacatta tatgacgtag agtattaagg tcactttcgg 12241 tgtgagtttt atctaagatt cttttaacaa tagcttctat ttcttcagaa gagtatgaca 12301 ttgttcaacc ttgctgatat tttgatatgt tgagagtatg tttgcagatt ttccaatcac 12361 tatgcctatt tagtaaaacc aaaatatagt atcttacaga atctaaatct tgagtttcac 12421 tccctatatt taaaataatt gcaatagtag attatgtaga tacaccctag tgttatttta 12481 atctatggca cttcgtgcca tgctacgcta tcagttagac tcaaatgcag tctgtaattt 12541 aaagattttt acccaaacta atatctcttg ctttacctat atttaggttg gtatctccta 12601 tttgtacggt attgccgtta ccagaagtta tagggctgtt gctaatatct cgcccagcgg 12661 ctacactgcg atcgcctgtt gcattaactt cctgacgagc gttctcaaat gcttcttctt 12721 gcgtagttga tttatcagcg acagataacc ctttgccacc tagtaataga gcgatcgcaa 12781 aatcttctgt tgctgcatca aaaaatggtt tttgctccgc ttggtgcatt gtctgggcac 12841 tttccggtac tgattttccc agatgattca ttaaattgga aataagaaca acggtgtcac 12901 caggctgatt attagcacct tgtagggctt caatcaaatg gtaagtatag atgctcattg 12961 cgtcatcaga gcgtatccag gataactgct gcccacggga tgaagtgaaa acagcccttc 13021 cttctccttg cttgaggtta tcaaccaggt tcttaggagg agcagtttgt gttaaaccgg 13081 gaggcagctt aatagctggg tttccatttt tcgcagtagc tattccttct gcatggcagc 13141 agtcaatcac taccagcagt cgttgtgctt tgatttgccg cagtgcgtta gtaaagtctt 13201 ttgcacaaag ggctgagttg ggaatgtcaa aaggttctac atcgtgttgg attaagtaat 13261 attcacctgt agtttggtca agccagccgt gaccggaata ataaactaca actgtagctt 13321 ctgaatcgtt agcagcttgt acttttagcc attccagatc atctaaaatt ttactgcggg 13381 ttgcactttt atcatgcaac aggcgtatgt ggtgttcatc attaggatag gcgcaaaaat 13441 tggggtcagt aagaacagac ttaagtgctt gagcgtcctt aacagtaacg ggaagtgacc 13501 agttcgcata agcgcattcc cctactccaa caagtaaagc gtagccgtga gtaaagattt 13561 gggacataaa gttctccatc aaaacaattc gtattgttga ttactgttgg gtgttatccc 13621 attggttatt ttaatcacca gcttcctatt gaaacgggac aaatccgatc catagcgtaa 13681 tcaatcgtcc caatcaactc gtcaacgctg gctagtctta cccgatgatg gatgccttat 13741 ccctaccaaa ttcccactct tcaatttacg tcgaaaattg gtaacttcaa gtgctgcaaa 13801 ggagcgattc tgcccttccg aacgcgataa ttaaactaga aagtaattta agtattattt 13861 gacgtagcct ttaggctacc gtcaagttct ttgttgagca atatagtttt tgacc // LOCUS NODE_2452_length_13834_cov_4.69156013834 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13834) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13834) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13834 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(140..1261) /locus_tag="DP116_20370" CDS complement(140..1261) /locus_tag="DP116_20370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744969.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_20370" /translation="MRLFKACSSWSQPWQKSSTRARTLLVAAGLSVGAVVPVYAGMTS QTLQNPTSSSKPELISQNQKPLELTLVSYAVTKEAYSKIIPLFVKKWKQERKQDVTFR ESYGGSGSQTRAVIDGLEADVVALALGQDVNQIQKAGLIGANWEKEVPNNGIVTKSVV ALVTRKGNPKQIRDWNDLVKPGVSVITANPKTSGGARWNFLGLWGSVTQSGGDDAKAL DYVSKVYKNTPVLPKDAREASDTFYKRNQGDVLLNYENEEILAKQKGESNNDVVIPQV NISIDAPVAVVDKVVDKRGTRQVAEAFVKFLFTPEAQREFAKVGFRSVNSTVAGETQN QFPKVGKLYTVSNFGGWDAVQKKFFADGTVFDKIQGGRR" gene 1645..2418 /locus_tag="DP116_20375" CDS 1645..2418 /locus_tag="DP116_20375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140028.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class II glutamine amidotransferase" /protein_id="PRJNA477356:DP116_20375" /translation="MCQLLGMNCNVPTDICFSFEGFAKRGGKTDDHRDGWGIAFFEGK GCRIFLDAKPSIFSPVADFVRHYPIHSTHVIAHIRKATQGEVILENCHPFRRELWGRY WVFAHNGDLKDFHPKDFNFYQPVGNTDSEKAFCLILERLRECFPQNKPPKEKLYSILG EITQTLAEKGTFNYLLSDGEHFFAHCSTKLSYIVRQAPFAAAHLIDQDMTVDFSELTT PSDRVAVIATTPLTDNEVWTPILPGELLVFQDGLPHKFP" gene 2591..3736 /locus_tag="DP116_20380" CDS 2591..3736 /locus_tag="DP116_20380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318656.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_20380" /translation="MGFWQQILESRQIFLKIVHKFKLNSIKGFLSLFLVGVSLSVAIA ACSGNGASDSSTSTDAGGSTAKPVSANNQDVRLTLVSFAVTKAAHDAIIPKFVEQWKK EHNQNVSFEQSYGGSGSQTRAVIDGLEADVVHLALGLDVDKIQKAGLISPGWEKEFPN NGVVSKSVAALITRSGNPKGLKTWADLAKDDVKLITADPKTSGIARWNFLALWNSAMK TGGGEQKALDFVTKVYKNVPILTKDAREATDIFAKRGQGDALINYENEIILAQQKGEK LDYIIPDVNISIDNPIAIVDKNVDKHKNREVAEGFVKYLYTPEAQQEFAKVGFRPVEE TPQTKELASKYPKIKNLGTVQDYGGWDSIQKKFFEDGATFDKIQAKK" gene 3823..4710 /gene="cysT" /locus_tag="DP116_20385" CDS 3823..4710 /gene="cysT" /locus_tag="DP116_20385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308946.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate ABC transporter permease subunit CysT" /protein_id="PRJNA477356:DP116_20385" /translation="MAVSSPRSSSQRELARQKPPAWKSYLVGLAKLPWTWRITIVYLA VMLFLPVTAMLLKASTEPPAKFWEIATSEIAIATYEVTFLTALAAGLINGVFGTLIAW VFVRYDFPFKRLLDATVDLPFALPTAVAGLTLATVYSDNGWIGSLFAPFGIKIAFTRL GVAIAMIFISLPFIVRTVQPVLLEMEKETEEAAWSLGASESQTFWKVILPPLFPSILT GIALGFSRAVGEYGSTVIVASNTPFNDLIAPVLIFQRLEQYDYSGATVIGVVLLAISL VMLLGINLLQAWGRRYDAK" gene 4697..5593 /gene="cysW" /locus_tag="DP116_20390" CDS 4697..5593 /gene="cysW" /locus_tag="DP116_20390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860955.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate ABC transporter permease subunit CysW" /protein_id="PRJNA477356:DP116_20390" /translation="MTQSDDFPQPHFHSTTGAEPNTQKRVKPRNIAPVILIGIAVFYL ALVIYIPALNVFVQAFKGGIGKFISNLSAPNFLHAAWLTLILAVITVPINTVFGLCAA WAIARRQFPGRALLLSIIDLPFSISPVVAGLMVVLLYGRNGWFGPALQALDIKVIFAF PGMVLATAFVSMPFVAREVIPVLEEIGGDQEEAAKTLGANEWQIFWRVTLPSIRWGLL YGLLLTNARAMGEFGAVSVVSGNIEGKTQSLPLFIEDSYKQYQTEAAFSAAVLLGLLA VVTLVVKEIVERNTGKKTNVKH" gene 5776..6042 /locus_tag="DP116_20395" CDS 5776..6042 /locus_tag="DP116_20395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860954.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_20395" /translation="MVSDNQLTHKRIRLRIPKDYHQEPVISRLVSNYGLTVNITAAIL GSNGIGDGWFDLDLQGTSAQIDSALIYINDLNLEIWHDIDTGSW" gene 6248..6544 /locus_tag="DP116_20400" CDS 6248..6544 /locus_tag="DP116_20400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875983.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20400" /translation="MTMIAPAFQSSNITKIRLRLHIPGHYQQEPVISRLIAIHGLVVN ITGAMLGKQTNGEGRFDLELRGTIPQIRHGLAYLESLNLKIVGKPNTEGDGWSC" gene complement(6556..7143) /locus_tag="DP116_20405" CDS complement(6556..7143) /locus_tag="DP116_20405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318651.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Crp/Fnr family transcriptional regulator" /protein_id="PRJNA477356:DP116_20405" /translation="MSISSRFQNPPEQNTKLSFKARSFLPLKHNSLLKIETGVVRIVT WHEDGTLVTLGIWGPGDIVGQAFSKLEPYQIECLTKVEVTILPLEGWFPFTDVMLAHI QQAEELMVIRSYKTVEIMLIKLLGWLAKKFGREVKSGHLIDLRLTHQDIADMLNSTRV TITRVLTQLEEQGLIHRLPLHRIVLKEEEVWHYEI" gene 7217..8242 /locus_tag="DP116_20410" CDS 7217..8242 /locus_tag="DP116_20410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875143.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_20410" /translation="MGIVVENVTKQYGSFLAVDNVSLEIPTGSLVALLGPSGSGKSTL LRMIAGLDTPNSGHIWLLGEDATYKSVQERQIGFVFQHYALFKHLTVRQNIAFPLEIR KFAKSKIGPRVEELVDLVRLQGFGDRYPSQLSGGQRQRVALARALAVQPRTLLLDEPF GALDAKVRKELRAGLRKLHEEVGITTVIVTHDQEEAMEVADQIVVMNNGRVEQVGTSG EIYDHPATPFVMSFIGPVNVLPPTAGVLPERDLNPRTNEQVFLRPHDVLIQTSPTEDA TPAKVLRILNLGWEIRVELVLKSGETVNAFLSRDHFSKLNLKEEQRVYVTSKQAKVFP APAYAVR" gene complement(8527..9078) /locus_tag="DP116_20415" CDS complement(8527..9078) /locus_tag="DP116_20415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006103897.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20415" /translation="MQTLETYKSILETGQTTTEKALQLFDALEPVDLAFMLGRWQGSG FHTNHPMDGLLEKFNWYGKEFVDPENVHPLLFLDSQGKVIKVAPSYMGATNWVLKFPI LKNDFLKPLVILTNSLLKTEKSQARLRMTEYRGKVSATMIYDNFPVNDSFRKVDDNTV LGIMDYKNLPQPFFFILRRSVTN" gene complement(9256..11142) /locus_tag="DP116_20420" CDS complement(9256..11142) /locus_tag="DP116_20420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137807.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsH" /protein_id="PRJNA477356:DP116_20420" /translation="MKFSWRVLVLWTLPALVIGFFFWQGAFSGTPTDMSKNTATTRMT YGRFLEYLDANRVSNVDLYEGGRTAIVEAVDPELDNRVQRVRVDLPSSAPELISKLKE KGVSFDAHPMRNDGAIWGLLGNLIFPILLITGLFFLFRRSSNLPGGPGQAMNFGKSKA RFQMEAKTGVKFDDVAGIEEAKEELQEVVTFLKQPEKFTAVGARIPKGVLLVGPPGTG KTLLAKAIAGEAGVPFFSISGSEFVEMFVGVGASRVRDLFKKAKDNAPCIIFIDEIDA VGRQRGAGIGGGNDEREQTLNQLLTEMDGFEGNTGIIIIAATNRPDVLDAALLRPGRF DRQVTVDAPDVKGRLEVLKVHARNKKLDPSVSLEVIARRTPGFTGADLANLLNEAAIL TARRRKEAITILEIDDAVDRVVAGMEGTPLVDSKSKRLIAYHEIGHAIAGTLLKDHDP VQKVTLIPRGQAQGLTWFTPNEEQGLITRGQLKARITGALGGRAAEDVIFGSAEVTTG AGNDLQQVSGMARQMVTRFGMSDLGPISLESQQGEVFLGRDWMTRSEYSESIASRVDA QVRLIVEECYQTAKRFIQENRIVIDRLVDLLIEKETIDGEEFRQIVAEYTSVPEKSQF MPSI" gene 11845..12372 /locus_tag="DP116_20425" CDS 11845..12372 /locus_tag="DP116_20425" /inference="COORDINATES: protein motif:HMM:PF00857.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cysteine hydrolase" /protein_id="PRJNA477356:DP116_20425" /translation="MNFCLFVIDIQNGFIAPNTSHVIQRVKSLLEQNLFEYVIFTRFR NTLDSPYVRYLNWNKLFSETEQKIVDELEPFAKLVFNKTIYTACNEETLNYLKKRDIH QVFICGIDTEGCVLKTAIDFFENNINPYILEYYSASNGGENFHQAAILVLSQLIGRSN IITEPIDKFHFSKYL" gene complement(12411..12494) /locus_tag="DP116_20430" /pseudo CDS complement(12411..12494) /locus_tag="DP116_20430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742869.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Swarming motility protein ybiA" gene complement(12851..13375) /locus_tag="DP116_20435" CDS complement(12851..13375) /locus_tag="DP116_20435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876489.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20435" /translation="MLTKEISRSIRLSLMIAALAISQRLDAPRIASAINTISRAQETS PPFFEDVVIDQNFSPDPFIVRGMSGGSVPGREIARRRETPTGTCTGYFDEEPDHTIEL TSKFDYLKIEVRSPEDTTLIIRGPGGSWCNDDFDGKNPGMIGEWLPGTYYVWIGSFKK DRYFPYTLRITQIK" gene 13490..13732 /locus_tag="DP116_20440" CDS 13490..13732 /locus_tag="DP116_20440" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20440" /translation="MSISLALDNSDSQAEVQINAPLLLQQFSVGKANWGWSKIHVKIQ LLMSRGHACLNGKTAHQIDRRKNSEITNSVVIGSYT" BASE COUNT 3909 a 2979 c 2876 g 4070 t ORIGIN 1 gagcacctag cgttttcccg acaacaggcg actggcgtat ctcctccgga gacgctgcgc 61 gaacgccttc aggcgtgcgc tttgcgcata cccgtaaggg caacgtctct acattgctaa 121 cagcaaccgt attctttttt tagcgtcttc ctccttgaat cttgtcaaaa acagttccat 181 ctgcaaagaa cttcttctga actgcgtccc agccaccgaa gtttgaaact gtgtagagtt 241 tgccaacctt cggaaactga ttctgtgttt cgccagctac tgtagagtta actgaacgaa 301 acccaacttt cgcaaattcc cgttgtgctt ctggtgtgaa gaggaacttg acaaaagctt 361 ctgctacttg gcgagtaccg cgtttatcca ccaccttgtc tactacagca actggggcgt 421 caatggaaat attgacttga ggaatcacta cgtcattgtt agattcacct ttctgtttag 481 ctaggatctc ctcattttca tagttcaaca atacatcccc ttgatttcgc ttgtagaaag 541 tatcactggc ttcccgcgca tcctttggca gcacaggggt atttttgtaa accttagaga 601 cataatcgag ggcttttgcg tcatcccctc cagattgagt tactgaaccc cacagtccca 661 agaagttcca acgagcacct ccagaagttt tgggattggc tgttatgacg ctgactcctg 721 gtttaaccaa gtcattccaa tcacgaattt gtttaggatt gcctttacga gtcaccaacg 781 caaccaccga cttcgtgaca atcccattgt taggaacttc tttttcccag ttagcaccga 841 ttagccctgc tttctggatt tggttcacgt cttgtcctaa cgccagcgcc accacatctg 901 cttctaaacc atcaatgact gctcgtgttt gggaaccaga accgccgtaa ctctcccgaa 961 aagtgacgtc ctgcttacgc tcttgcttcc acttttttac aaacagagga ataatttttg 1021 agtaagcctc tttggtcaca gcgtaggaaa ccagagttaa ttcaagtggt ttttggttct 1081 gactgatgag ctcaggtttg ctacttgaag ttgggttttg caaggtttgc gaggtcattc 1141 ctgcatagac aggtacaaca gcaccaacgc ttaaacccgc agcaacaagg agggtacggg 1201 cgcgagttga agacttttgc caaggttgtg accaagacga acaagctttg aatagacgca 1261 tgattaagaa ctcctccatc actaaaaata cggttgtttg atttaaaaat cgtattttat 1321 tgatctaata caattccata tacgaattaa aaaatcaagt atctgtgcta cttaacactc 1381 atgtaataaa cttatttatt aatactcagt gagctaaaac cattactgta agctggtttc 1441 ctaacttgaa acaattacca ttgtgaggta ttgagtgggg agccaagcga agaatttttg 1501 cttacatccc attgttaatc ccaagctcct acggaacaca agagcgaaat cttctagaat 1561 gaaacattag taccagataa attgagcaat tctggattat cttgaaccca gccactagtc 1621 taaagtttga gagtttcaac accgatgtgc caactgctgg gaatgaattg caatgtacca 1681 actgatattt gcttttcctt tgagggcttt gcaaagcggg gaggcaaaac tgatgatcat 1741 cgcgatggtt ggggtattgc tttctttgaa ggaaaaggtt gtcgaatttt tttagatgct 1801 aaaccctcga ttttttctcc cgtagccgat tttgtgcgac attatcccat ccactccact 1861 catgtgattg ctcatatccg caaagctacc caaggtgaag tgattttaga aaactgccat 1921 ccctttcggc gggaactgtg gggaagatac tgggtgtttg ctcacaatgg agatttgaaa 1981 gattttcatc ccaaggactt taatttttat caacctgtag gcaacacgga tagtgaaaaa 2041 gctttttgtc tgatactaga aagactacga gaatgtttcc ctcaaaacaa gcctccaaag 2101 gagaaacttt actccatact tggggaaatc actcaaacac tagccgaaaa aggtactttt 2161 aactacttac tttcggatgg ggagcacttt tttgctcact gctcaacaaa acttagctac 2221 atcgtacgtc aagcaccttt cgcagctgct catttaattg accaagacat gactgttgat 2281 tttagtgaat tgacgactcc aagcgatcgc gttgctgtta tcgccaccac tcctctaact 2341 gacaacgaag tctggacgcc aatcctaccc ggagaactcc tagtctttca agatggttta 2401 ccccataagt ttccataact ggttcccaca acagtttcta cagcaatagc tcctgaaatt 2461 tccacttttt ttatcaagaa agtgatgaat ttctcttgca ttttcggcaa attctatgta 2521 gtatctctga caatgagcta taaatacggt taagttaccg ctatgccgta aaatgtcagg 2581 gaaaaacaac atgggttttt ggcagcaaat cttggaatca agacaaatat ttttaaaaat 2641 cgttcataaa tttaagctta attctataaa aggctttcta tcactgtttt tggtaggagt 2701 cagcttgagt gtagcgatcg ccgcttgctc tggtaacggt gcaagtgatt cctccactag 2761 tactgatgca ggtggttcta ctgcaaaacc tgtgtctgcg aataaccagg atgttagatt 2821 aactcttgtt tcctttgctg tcacaaaagc ggctcacgac gctattattc caaaattcgt 2881 ggaacagtgg aagaaagagc ataaccagaa tgtcagtttt gagcaaagct acggcggttc 2941 aggttctcaa actcgtgctg tcatagatgg tttagaagca gatgttgttc acttagcact 3001 aggcttagat gtagacaaaa ttcagaaagc tggattgatt tcgccaggat gggagaaaga 3061 attccccaat aatggcgttg tctccaaatc tgttgctgca ttaatcactc gttcaggtaa 3121 cccaaaaggt ctcaaaactt gggcagattt ggcaaaggat gatgtcaaac tcattaccgc 3181 tgaccccaaa acttctggta tcgcccggtg gaatttccta gccctttgga attcagcaat 3241 gaaaactggt ggaggagagc aaaaagcgct ggactttgta actaaagttt acaaaaatgt 3301 gcctatttta acaaaagatg ctcgtgaagc aactgatatt tttgcgaaac ggggtcaagg 3361 agatgcgctg atcaactacg aaaacgaaat tattctggca caacaaaagg gcgagaaact 3421 agattacatc attcctgatg taaacatctc tatcgacaat ccaatcgcaa ttgtagacaa 3481 aaacgtcgat aaacacaaaa atcgagaagt tgctgaaggt tttgttaaat acttatacac 3541 tccagaagca cagcaggagt tcgccaaagt cggattccga cctgtagaag aaacaccaca 3601 gacgaaagaa cttgcaagta aatatcccaa aatcaaaaat ctgggtacag ttcaagatta 3661 cggtggatgg gattctattc agaagaagtt ctttgaagat ggggcaactt ttgacaaaat 3721 tcaagcaaaa aaataaatag tcattagtca ttggtcattg gtcattagtc agaagtgact 3781 actaatgatt ccctcactcc cttacttcct aactcactgc ttatggctgt atcttcacct 3841 cgctcatctt ctcaaagaga actcgctcgt caaaaacctc cagcttggaa gagctacctg 3901 gttggtttgg ctaagcttcc ctggacatgg cgaattacta ttgtgtactt agctgtaatg 3961 ttgtttttac cagtgactgc gatgttactg aaagcaagta ctgaacctcc ggctaagttt 4021 tgggaaattg ccaccagtga aattgcaata gcaacatacg aagtcacttt tttaacagcg 4081 ctagcagcag gtctgattaa tggtgttttt ggcacactca tagcttgggt gtttgtccgc 4141 tatgattttc cttttaagcg tttgcttgat gcgactgtag atttaccgtt tgcattacca 4201 actgcggtag cagggctaac cctagcaaca gtttacagcg acaacggctg gattggctct 4261 ttatttgcgc catttggaat aaagattgct ttcactcgct taggtgtagc aatagcaatg 4321 atattcatat ccttaccatt tatcgtcaga acagtacaac ctgtactgct agaaatggaa 4381 aaggaaactg aagaagcagc ttggagcttg ggtgcatctg agtcgcaaac tttctggaaa 4441 gtcattttac caccgttatt tccctcgata ttgacaggta ttgccttggg tttctctcgt 4501 gcagtcgggg agtatggctc gactgtaatt gtagcctcca acactccctt caatgactta 4561 attgcacctg tgctaatttt ccagcggtta gagcagtatg actattctgg agccaccgtt 4621 atcggcgtgg tgttactagc aatttcgttg gtcatgctgc tgggaattaa tcttttacaa 4681 gcttggggac gaagatatga cgcaaagtga tgattttcca caaccgcatt tccattcaac 4741 gacaggagca gaaccaaata ctcaaaagcg tgttaaacca agaaatattg caccagtcat 4801 tttgattggg atcgcagtgt tctatttggc tttggtgata tatattcctg ccctcaacgt 4861 ctttgtccaa gcttttaaag gcgggattgg taagtttatt tctaacctca gcgcacccaa 4921 ttttcttcat gcagcttggt tgacactaat attggctgtg attactgtac caataaatac 4981 agtatttggt ttgtgtgcag cttgggcgat cgcccgccgc caattcccag gtcgtgctct 5041 tcttttaagc atcatcgacc tgcccttttc tatctcacct gtggttgcag gactcatggt 5101 tgtactactt tacgggcgca atggctggtt cggtcccgct ttacaagcac tggatatcaa 5161 agttatcttc gcttttccgg ggatggtgct ggcgacagca ttcgtgagta tgccctttgt 5221 agcgcgtgaa gtcattcccg ttctggagga gataggcggt gatcaagaag aagcagcgaa 5281 aacgctgggt gcaaatgagt ggcagatatt ttggcgcgtc actttgccaa gtatccgttg 5341 gggtttactt tacggcttac tcttaaccaa cgctagagca atgggtgaat ttggtgctgt 5401 ttcagtagtt tctggaaaca ttgaaggcaa aactcagagc ttacctttat ttatagaaga 5461 ttcctacaaa cagtatcaaa ctgaagcagc attctcagct gctgtcttgt taggactgct 5521 tgcagtcgtg actttggtgg tgaaggagat tgtggaacgg aatacaggta agaaaacaaa 5581 tgtaaaacat taaaacagtg aacagtaaac agtaaacagt gaacagtgaa cagtaaacag 5641 tgaacacaaa tcagcggtta tgagcaacaa gtactgataa ctgataattg gtaactgata 5701 actgataact ggtaactgat aactgataac tgataactgt taaagcagat ggttagtcaa 5761 ggggtaaatt tacctatggt ttcagacaat caactcaccc acaaacgaat tcggcttagg 5821 attccaaaag actatcacca agaacctgtc atttctcgct tggtgtcaaa ctacggtctg 5881 actgtgaata tcaccgctgc tattttgggt tccaatggta ttggagatgg ttggtttgac 5941 ctagacttac aaggaacttc cgcacaaatt gatagtgcgc tgatctacat caatgacttg 6001 aatctagaaa tttggcatga cattgacaca gggagttggt gatgacgaat tcgtaatttg 6061 taatgactaa gttaatagac ctcataaagt ctcttacaaa agtcaacttt ttgaagatga 6121 aaccacagag caacacagat gtacataaat aatttatctg tatttatctg cggtttcaaa 6181 tatccttaaa tcagattttt gtaagaaatc tcatgacgaa ctaaaagtta tcaattacga 6241 attattaata actatgattg caccagcctt tcaaagcagt aatatcacaa agattcgctt 6301 acgcctgcat attcccggac attaccaaca agaacctgtg atttctcggc tgattgccat 6361 tcatggttta gtcgttaaca ttacgggggc aatgctagga aaacaaacca atggagaggg 6421 acgctttgac ctggaacttc gtggaacaat accacaaatt aggcatggtt tagcttacct 6481 agaatcttta aatttaaaaa ttgtaggtaa gccaaatact gaaggagatg gctggtcttg 6541 ctgacacatg gacagctata tttcatagtg ccaaacttct tcttctttta agacaatccg 6601 atgcagggga agacgatgaa tcaacccttg ttcctccaac tgagtgagga cgcgagtgat 6661 tgtcacgcga gttgaattga gcatatcagc aatgtcttga tgagtgaggc gcaagtcaat 6721 gagatgtccg ctttttacct ctcgaccaaa tttttttgct aaccaaccta agagtttaat 6781 gagcatgatt tccactgttt tatagctacg aataaccatc aattcttccg cctgttgaat 6841 gtgggcaagc ataacatccg tgaatggaaa ccacccttct aaaggcaaga ttgttacctc 6901 tactttagtg agacactcaa tctgatatgg ttcaagttta gaaaacgctt gaccaacgat 6961 atctccaggc ccccaaatac ccaaagtcac aagcgtacca tcttcatgcc aagtgacaat 7021 ccgcacaact cctgtttcaa tcttcaacaa actattgtgc ttcaggggta agaatgagcg 7081 ggccttaaag ctaagtttgg tgttttgttc aggaggattt tgaaaacgtg atgaaataga 7141 cattgtcatc tgaatctctc ttttcgatag acaagccgta cttttttaca atatcagttt 7201 cttgaaggat attactatgg gcatagtcgt tgagaatgta actaagcagt atggttcttt 7261 tcttgctgtt gacaacgtca gtttagaaat tccaacaggt tctcttgtgg cgctgttagg 7321 accatcagga tctgggaaat caactttgtt gcggatgatt gcggggttag atacgcctaa 7381 ttcgggtcat atctggctac ttggtgaaga cgcgacatac aaatcggtgc aggaacgtca 7441 aataggcttt gtgtttcagc attacgcctt gtttaagcac ttgactgtac ggcagaatat 7501 tgcctttcca ctggaaatcc gcaagtttgc caagagcaag attgggccgc gagttgagga 7561 acttgtagat ttggttagat tgcagggatt tggcgatcgc tatccttccc aactctctgg 7621 aggtcaacgc caacgggtag cgctagcaag agcattagca gttcagccga gaacattgct 7681 gttggatgaa ccctttggag cgttagatgc taaagtccgt aaggaattaa gagcagggtt 7741 acgaaaactg catgaagagg ttggtataac aactgtcatt gtcacccacg atcaagaaga 7801 agcaatggaa gttgcagacc aaattgtggt gatgaataat ggtcgggtgg agcaagttgg 7861 tacctctggt gagatttatg atcacccagc aacacctttt gtcatgagct ttattggtcc 7921 tgtgaatgtt cttccaccta ctgctggtgt tttaccagag agggatttga atcctcgaac 7981 aaacgagcaa gtctttttgc gccctcatga cgttctgatt caaaccagcc cgactgagga 8041 tgctacaccc gccaaggtat tgcgcattct taacttagga tgggaaattc gggttgaatt 8101 ggttctcaag tctggagaaa cagtaaacgc tttcctcagt cgtgaccact ttagtaagct 8161 caaccttaag gaagagcagc gtgtctatgt gacatcgaag caggcaaaag tttttcctgc 8221 tcctgcttat gcagtaagat gagaagttga aatgtagcgg tttttgaatg tcatttatac 8281 caagtcgcga ctcaaaaacc tcacccccat cccctctcct taataaggag aggggtgccc 8341 gtcagggcgg ggtgaggtat acgtaatccc agtgtagctg acgctaaagc tggggtgggg 8401 tttttgaatg tattatattt gcaataccaa gtcgcggcgg gttatcttac tgcgtccata 8461 gtgaaaaaaa cctctataca aaactcaaaa ttataccttc ctcactagtt ccagttcttt 8521 gcaagcttaa tttgtgacac tacgccgaag aataaaaaag aaaggctggg gtaagttttt 8581 atagtccata attcctaaca ccgtattgtc atctactttc cgaaaagagt cattaaccgg 8641 aaaattatca taaatcatcg tggcgctgac ttttcctcga tattccgtca tgcggagtct 8701 tgcttgactt ttctctgttt tcagtaagga attagtcaga atcaccaaag gtttcagaaa 8761 atcatttttg aggattggaa acttcaaaac ccagtttgtc gctcccatat aactgggcgc 8821 aacttttata acttttccct ggctatctaa aaataacaaa gggtgaacat tttcaggatc 8881 aacaaattct tttccatacc aattaaattt ttctaataag ccatccatag ggtgattggt 8941 atgaaaccca gatccttgcc aacgaccgag cataaaagct aaatcaacgg gttcaagagc 9001 atcaaacaat tgtaaagctt tttcggttgt cgtttgacca gtttcaagaa tcgacttgta 9061 ggtttctaat gtttgcattt tcccgtgtgt cttgcttgtg tttgataagt gcttatcgga 9121 ataggttgag ctttttactt ttggttttca ccataatttt tttcaatagt cttttcctcc 9181 ctgaaaaata gggatggtgt aaaccatccc tatttctatt tagtaaagta gggctatacc 9241 ctaccaaact ttagattaaa tactaggcat aaactgcgac ttttcaggta cgctagtgta 9301 ctcagcgaca atttgacgga attcttctcc atcaatagtt tctttttcaa tcagcaagtc 9361 tactaagcga tctattacga tgcgattttc ctgtataaac cgctttgctg tctggtaaca 9421 ttcttcgaca atgaggcgaa cttgtgcatc aacacgagat gcaattgact ctgaatattc 9481 cgaacgagtc atccagtcac gacccaagaa cacttctccc tgctggcttt ccaaggatat 9541 cggtcctaaa tcagacatcc caaaccttgt taccatttgt cgggccatac cgctgacttg 9601 ctgtaagtca ttcccagcac cagttgtgac ttccgcagaa ccaaaaatca cgtcttctgc 9661 tgcacgacca cctaaagcac ctgtaattct ggctttgagt tgaccgcgag tgattaaccc 9721 ttgctcctca ttaggagtaa accaggttaa cccctgtgct tgtccccttg gaattaaggt 9781 cactttttga actgggtcgt ggtctttgag taatgtacct gcgatcgcgt gtccaatttc 9841 gtggtaagct atcaaacgct tgctcttgct atcgaccaga ggtgtgcctt ccataccagc 9901 aacaacccga tccacagcat catcgatttc taaaatcgtg atcgcctcct tacgccgtct 9961 ggctgtgaga attgcagctt cattgagcaa gtttgctaaa tctgcacctg taaaaccagg 10021 ggtgcggcgg gcaatgactt ctaaagaaac gctggggtct agtttcttgt tacgagcgtg 10081 aacttttaaa acttccaaac gtcctttaac gtcgggtgca tcgacagtga cttgtctgtc 10141 aaatcgccct ggacgtaaca gtgctgcatc taaaacatca ggacggttgg tagcagcgat 10201 gataataata ccagtgttac cttcaaaacc gtccatttcg gtgagcagtt ggttgagggt 10261 ttgttcccgc tcatcgtttc caccaccgat acctgcacct ctttggcgtc cgacggcgtc 10321 gatttcatca ataaagatga tacagggggc gttatctttt gctttcttaa agaggtcgcg 10381 gacgcgggat gcacccacac ctacgaacat ttctacaaat tccgaaccag aaatgctgaa 10441 gaagggtacg cctgcttcac cagcgatcgc ctttgcaagc agtgttttac ctgttcctgg 10501 aggacctact aagagaactc ctttgggaat gcgtgcacct acagcagtga atttttctgg 10561 ttgtttgagg aaggtcacga cttcttgcag ttcttctttc gcttcttcaa tacctgcaac 10621 gtcgtcaaat ttcactccgg tttttgcctc catctggaag cgagcttttg attttccgaa 10681 gttcatggct tgacctggac caccaggcag gttgctagaa cgacggaaca aaaagaacag 10741 cccagtaatc aataaaatag gaaaaatgag attacccagc agtccccaaa ttgccccatc 10801 attccgcata ggatgagcgt caaaactcac tcctttttct ttgagcttgc taatcagttc 10861 cggagcgcta gaaggcagat ccacccgtac ccgttgtacg cggttgtcga gttctggatc 10921 gacggcttcc acaattgccg ttctcccgcc ttcatataaa tcaacgttgc ttacccgatt 10981 agcgtccaag tattctaaaa agcgaccata ggtcatgcgg gtagtggctg tattcttact 11041 catgtctgtc ggagtaccag aaaatgctcc ttgccagaag aaaaagccaa tcaccaatgc 11101 aggcaatgtc cagagtacta gaactctcca ggaaaatttc atcttaattt tgcctctaga 11161 tgcatataca acaaatcagc agcctacaga tagggctttt caaaccgatg tctataacta 11221 cgttttttat ccaatgttat gacacacgat aagttcgtcg ttagccttaa agctaacgag 11281 ccaaaacatt attaagaatc ttaattaaat ttaacttaat tttgcgtcgt tttactacgg 11341 gtgtacgctt attcatgttt caggttgccg aacacgcttc atactctcct ttacgtttgg 11401 gcaatgcttg tacgttacct aagttttgtg caccttgaag gaagaagcac ttcctttttt 11461 tgaacttggc ttgtaatgga aatttgcgtg ggagtagata cagccaccat gaggaagaag 11521 cgatcgccga gttgtcaaaa aaactagagc taaatttccc tccaagaaac gcatacacaa 11581 gggagcaggt gtcaaacttc aaacactcag cttttctata gtcaatactg cttagagctt 11641 aaccgaaccg catcggggga gcagggaggt ctcttgccgg ggagattgaa aaatttttta 11701 tgcgtgctag tgtacgcaac tcatgaagac tactcaattt tatcttgaat gcgttatttg 11761 ccgcaaacat ttttgtaatt agtataattt ttaataagtg ttactgaaaa gctagaattc 11821 cgagatttta tcaggtttaa caatatgaac ttctgcctat ttgtgattga tattcaaaac 11881 gggttcatcg ctccaaatac aagtcatgtt atacaacgcg tgaagtcttt attagagcaa 11941 aatctatttg agtacgttat cttcacaaga tttagaaata ccttggatag tccttatgta 12001 agatatctaa attggaacaa acttttttca gaaacagagc aaaaaattgt tgatgagctt 12061 gaaccatttg ctaaattggt ttttaataaa actatatata ctgcttgtaa tgaagaaact 12121 cttaactatc tcaaaaaaag agatattcat caagtcttta tttgtggaat tgatactgaa 12181 ggttgtgtct taaaaactgc tattgatttt tttgaaaaca atattaatcc ttatatttta 12241 gagtattact cagcatcaaa tggaggtgag aattttcacc aagcagctat tttagtttta 12301 agtcagctaa ttggaagaag caatattata actgaaccga tagataaatt tcatttcagt 12361 aaatatttat aacttgagtt tagcgctttt tcatcactct cgtctgagtt ttggaattta 12421 gctaatacag ccttcttcat aaactcatct ttgacttgcc tccagtctgg acgaagggga 12481 agggagcgat cgcgtagcgc gccctgtctg tgcccgaggc acgaagtgcc tcccgccagt 12541 cggagactgg ctcggtcttg agggcgatcg ccaccaatgt tcgctgcttc tgatcagatc 12601 aacagcaatc ccacccatcc agtaatgggg caattattgc tatttttgtt gctatatgcg 12661 tttttgacat tatgcccata ttagtccgat ggagcgaaaa tcaaatcagc cctaatgcca 12721 cgatgttcaa tcgttgttgg atagcggcta tcatctttgg actaaggatt gtttgttgtc 12781 tgcgaagctc cggtgacagt tctagttttg accaaaaatc atagcctgta aactttggca 12841 ttcatggtct ctacttaatt tgtgtaattc gtagggtata aggaaaatac ctatcttttt 12901 taaacgaacc aatccataca taatatgttc ctggtagcca ttctccaatc atgccaggat 12961 tcttgccatc aaaatcgtca ttgcaccagc tacctccggg acctctaata attaaggtgg 13021 tatcctcagg acttctcact tctattttta ggtagtcaaa tttacttgtt agctcaatag 13081 tatggtctgg ttcttcatca aaatatccag tacaagttcc tgtgggagtt tctcttctac 13141 gagcaatttc cctccccggt acagaaccgc cactcatacc acgaacaata aacggatctg 13201 gagaaaaatt ttgatcgata acaacatcct caaaaaaggg tggggatgtt tcctgcgcac 13261 gagatattgt attgattgct gacgcgatac gcggagcgtc aagcctctgg cttatcgcca 13321 gcgcagctat catcaaagat aatcttatgc ttcgagatat ctcttttgtc agcattccaa 13381 taacatactt ttttaagtat ttttagagac atgagtacta cactcgaagt tcccaaggcg 13441 aggtttttga ctagtttttt tactttttaa acatcaaata acttgttaat tgtcaattag 13501 cttagcttta gacaattcag attcacaagc agaggtgcag attaatgcac ccctactatt 13561 acagcagttt tcagttggta aagccaattg gggttggagt aaaatccatg taaaaatcca 13621 gctacttatg tctagaggac acgcttgttt gaacggaaaa accgcccatc aaattgatcg 13681 aagaaagaac tcagaaatca ccaactcagt agtcattggc tcttacacgt agaaattcaa 13741 acagtaaaca gttatcagtt accagtgagc cagcgcgaat gacggctctc cctcacttgg 13801 cgactggtga gacagcgcga atgacggctc tccc // LOCUS NODE_2459_length_13781_cov_5.14600013781 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13781) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13781) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13781 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1414..1587 /locus_tag="DP116_20445" CDS 1414..1587 /locus_tag="DP116_20445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316101.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_20445" /translation="MTNKKWAVKRITVNLAAQEAEKLDRYCRQTGRPATDVIRELIRG LPLHSEEATEAST" gene 2142..3581 /gene="zds" /locus_tag="DP116_20450" CDS 2142..3581 /gene="zds" /locus_tag="DP116_20450" /EC_number="1.3.5.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867700.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="9,9'-di-cis-zeta-carotene desaturase" /protein_id="PRJNA477356:DP116_20450" /translation="MRVAIIGAGLAGLATAIDLSDAGCEVHIFESRPFVGGKVSSWVD GDANHIEMGLHVFFGNYYQLLELMKKVGALENLRPKEHSHTFINQGGKVGALDFRFVM GAPFNGLKAFFTSSQLSLLDKFQNAIALGTSPIVRGLVDFNGAMKNIRDLDKVSFADW FRSHGGNEGSIKRMWNPIAYALGFIDCEHISARCMLTVFQLFAVRTEASKLRMLEGSP YEYLHKPILEYLETRGTKIYTRRRVREIQFTEEENQTRVSGMVIANGDTEDLISADAY VAACDVPGIQRLLPQQWRQWSEFDNIYKLECVPVATVQLRFDGWVTELHDTQERKQLN HAAGIDNLLYTADADFSCFADLALTSPGDYYREGQGSLLQLVLTPGDPFIKESNEAIA QHVLKQVHELFPSSRELNMTWYSVVKLAQSLYREAPGVDIYRPQQKTPISNFFLAGSY TQQDYIDSMEGATISGRRAAKAILDNVKK" gene 3694..4146 /locus_tag="DP116_20455" CDS 3694..4146 /locus_tag="DP116_20455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316099.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20455" /translation="MADWLEHSVQVEVEAPIELVWSLWSDLEQMPRWMKWIDSVKVPE DNPEISIWKLDTRGLEFTWKSRIIKIVPNQIIQWESVDGLPNRGAVRFYDRHSSSIVK LTIAYAIPGIIGKIMDNLFLGRAVESTLKSDMEKFREYALQAKSNVSQ" gene 4989..5819 /locus_tag="DP116_20460" CDS 4989..5819 /locus_tag="DP116_20460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214964.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20460" /translation="MSSLDQPFASYHLWSLVAPAAWQQRWHCQMRRGFIKARQHEPQV KALLTKATASQRIGLLAQKGVYEFHHHIHLLNQSDGVEKVAQLLRLSNSTDEVKQRVL QILKKYYDKPLLLNKRIILLTRGDEGFPKPILISQQRYHFRLYAVMDCVFIESDSILH IIDFKTGKSAFDRRQALVYLLAARYLYPRYQAVASFYNLELCKKSEIISLSKDELDII ECELAEIARKHQQDLQKYQEKSSDFSKIFPPNPGSHCRFCPFHEICEFSTLKVLKSQN " gene 5950..8619 /locus_tag="DP116_20465" CDS 5950..8619 /locus_tag="DP116_20465" /inference="COORDINATES: protein motif:HMM:PF12708.5,HMM:PF14252.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20465" /translation="MTSADASLQIPSQTSNYTTYDVVPSLETNHELDARNTQQGASYT NIANSFIEAVNTSNLNSPSTKLFVENFRGISGMLCVNEFPQSDKVLNDSEYFDNLVGR SDNDFGVAGDLSKGNDQLLGHNAQENPLQKAAQESLVFIDPHLEDYQSLVAGISPEAK VVVLDPSQNGIEQITRELSNYNHTVSKVEILSHGASGRLQLGQTSLDSATLDRYSQQL QGWADALTDNADILIDGCNVAQGEQGSRFVTELSELTGADIAASTDLTGNAAQGGDWE LEYKVGQIESVSSLQPQTQRTYHATLGERINFPEGFMKSVEDYGAKPDDGIDDTVAIQ KALDDGRRDANGNSIYDDYSGRPKALYFKAGTYDVSNTLNWIGSAVTLQGQGSGATVI RLKDNTAGFNNSTAPKAVIQTPGGNSSFRQNIWNLSVDTGKGNAGAIGIDYIANNVGS MRDVTIKSEDGKGVAGLAMDRAWPGPCLIKNVQIEGFDYGITLSYSEYGPTFENITLK NQGIAGIRNENGALTIRGLNSTNSVPVVKGTSWAGMVTLLDANLQGGAPNVSAIDTAG EVYVRNVTTTGYQSVIKYNGNIVPGTSHTEYATNVYQLFDGSKQSLNLPVKETPEFQD NNPANWGRITLDPVGFTDTSKLQSVLDSGKSTIYFDFGKYFSFNETVLTVPATVKRII GFSSVVDGESHGQNGGGIKFVVQGNSTDSPLIVEGFGYGAKVDHNNSSRSVALKDGFY QYTSSPGAGELFLEDVNIQPFKVQQNQNVWARQLNNEYGGGTKIENDGGQLWILGLKT EGTGTVIESKNGAKTELLGGVIDPARSFSAEEKQRPAFVVNNSKASFIYRQIAYDPNY NYDIQVEERRNGETRRKLTSQLPQPVALFTAFQ" gene complement(8705..10390) /locus_tag="DP116_20470" CDS complement(8705..10390) /locus_tag="DP116_20470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017804460.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transglutaminase" /protein_id="PRJNA477356:DP116_20470" /translation="MTFTLKRFPSTSQMLGKRTIRPLGAAALCGIAFIQDTLIAIDTV KGHLLQIDPHSDNTQILNSHQVKDFTDVTGLSVWQDTLWLTRGNHVYLCNLGSLGLEH FITLPYTADGIAVWESTIYVSCQKLGAILIFDRDTRKQITKFYAPGVGVENLAVDEEM LWVSDTVEQTVYCIDRATGEVKFSVLTPFDSPTGIAIHKNSETGKKTLYVAYVSEEPY VRDNPNADPSHELSYRDRTFIHPLNYSYNPDKRYTLSNGYLIEMSYVEEIAPLEEVYL PEVEWRIALPSETPRQKIKHIEPIGLPFTEEVVEGQRVAVFKFDALTPGERHIFGWKA VLEVRGIKYRITPSDVENIPEVSLELKSRYLVDDDDLAMDTSIVRRAAREAIGSETNV LRKMYSIRNYVYDELSYGIKPHIDTPDVVLERGVGSCGEYVGVLLALCRLNGIPCRTV GRYKCPPYAEYQQVPLQPQYNHVWLEFYIPGFGWLPMESNPDDIGSGGPYPTRFFMGL CWYHIEIGKGISFETLMSQGARLTKEDLSIGELAINHIRFTILEELPPLVDGV" gene 10695..13568 /locus_tag="DP116_20475" CDS 10695..13568 /locus_tag="DP116_20475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309254.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium-transporting ATPase" /protein_id="PRJNA477356:DP116_20475" /translation="MSAHSLPEKAAAVWHSLEVDKSLELLKTNADSGLTPQEVQQRLQ EYGPNELEETAGRSAWEILVDQFKNIMLLMLIAVAIISGVIDLLDWREKLAKGEMPFK DTIAILAIVILNGILGYVQESRAEKALAALKKLSSPNVRVIRDGKPVEIAGKDLVPGD VMLVEAGVQVAADGRLIEQSNLQIRESALTGEAEAVNKRAELVLPEETSLGDRLNVVY QGTEVVQGRGKILVTNTGMQTELGKIASMLQAVESEPTPLQQRMTQLGNVLVTGSLIL VAIVIGGGVIQNMIQGKGFSNLKELVEVSLSMAVAVVPEGLPAVITVTLALGTQRMVR RNALIRKLPAVETLGSVTTICSDKTGTLTQNKMVVQSAYTNNKTFRVTGEGYTPKGEF QLDGQTISINQSPEIQTLLVACALCNDSFLQQEQGEWKILGDPTEGALVTLAGKGGIE KDQWDSKLPRVSEFPFSSERKRMSVICRVEGVNDGVSSLTSPDPVISNLLQSEKYLMF TKGSPELILARCTNFYAGTGSTPLNDQQRRDILAENDQMASKGLRVLGFAYKPLTEIP PEGSDETSEQGLVWLGLIGMLDAPRPEVRAAVQECRDAGIRPVMITGDHQLTARAIAT DLGIAQEGDRVITGQELQRMSDQDLEQNVDLVSIYARVAPEHKLRIVQALQRRGRFVA MTGDGVNDAPALKQADIGIAMGITGTDVSKEASDMVLLDDNFATIVTATKEGRVVYTN IRRFIKYILGSNIGEVITIAAAPVLGLGGVPLTPLQILWMNLVTDGLPALALAVEPPE PDVMKRPPFSPRESIFARGLGSYMIRIGIIFAIISIILMEWAYHHVQAVTGPGLDPER WKTMVFTTLCVAQMGHAIAIRSNTRLTIEMNPFSNPFVLGAVVVTTILQLMLIYVPPL RDFFGTHWLSPTELGICFGFSALMFVWIELEKLVLRLMGKKPV" BASE COUNT 4064 a 2838 c 3107 g 3772 t ORIGIN 1 tctattttag agaagcaatg aaagacaaaa tttaaacttt taaaatagct ttagaattgt 61 ttcatctaga aatataaaaa acattttagg ataaatgtat tgaaaaaaac gttttattgg 121 attttttaaa aaatgaaaaa gttcaaataa tttattttgt tgtttgcaca ctgagtcatc 181 aatcaatttt tatgattact tgaggttata taaaattaaa ataaccaagg taaatagaaa 241 tgtagtgaat tgtcaaattc tcagctatca gttattaatt ctatctcttg gctaaggttg 301 attccacacc ccattcgcta aatgaaaaat gatacattta ctcataaaaa tgttttgtta 361 cgagcgaatt ttaaactagt gaatttgagc gtgttcatct ggctagtcgt gatttcaagc 421 tattgcgctt tcaaagctat tcagataacc ccagataaat actgatggat ataaacgaac 481 tgcacgccct cactaagctg gaactgcaag caggatcaga aagaataacc cgaagattaa 541 tctaactatt acagaatgtt aatgccaaaa tctggaacaa aacagaagtt ctattgtcag 601 tgaaagttaa gtggtattgt atcaatcatg aattcattta tggaggattg cttctagtgg 661 ttaccaagaa cgtttagatt tctatactga gtcaagtact tataacttct ttccttaaat 721 aagaagacta gctttcaaag gttatttctt atgaataagg gtggtgaatc gacacaagac 781 agataactct tcatttcatt tcaaatatga aatagtgtca aactggaggc gattaagagg 841 acaatgcaca agcctttaat tcaaaattat caattttgtg agtatcgctt ggatgagacc 901 agtcagacag acgtttttaa aacaagttta cttttggaaa cagtgacttg ttgatgcaaa 961 ctttaaagtc gttgagcctc acctacagat actatcgaaa cgtacagaaa aggtagctat 1021 caagcatgat gagttgagac ataaagtctt tgatgaacca gggttagaga ccaagtgaaa 1081 tgcttttgta accacaaagc tgaaaattgg tgatttggtt gacagaagaa tgttcaatca 1141 aatatataaa atttatagtc aagctttgtt catcttgttg ctaagttcat ttttctagca 1201 acgagtgtca agcgtttttc ctttattagg aggaataaaa tgagttcact ttttatattc 1261 gagaaaaaaa gaaagcatac tttgacaaac aaaaaatttt tttcgtctca accaaatgac 1321 ttgtttcctg acaaaaactc tgataatctt atcgtttcca tccctagtta atttcctttg 1381 atgatgttaa taaccagcag acagtagaac aaaatgacaa ataagaaatg ggctgtaaaa 1441 cgtatcactg tgaatctagc cgcacaggaa gcagaaaaac ttgacaggta ttgtcggcaa 1501 acgggtaggc ctgcaacgga tgttatccga gagctgatcc gtggactgcc actccactca 1561 gaggaggcta cagaagcaag cacgtaaatg ggagtggtta atagttagta aaacaactca 1621 cctcggaaga atacagacat ctgtcggaag aattcaattt ggggattgtg tgcgattagt 1681 tgatgaatat tagtctgagg tccctgattt caaacgcgct tgaaaaaaga aaaatttttc 1741 attttgaaaa ccttgacttc aggcatgggg tattttgact caagcaattg tgacttctga 1801 ctccttatca ccgggagaat gactgcagaa atggggcgtc gcatttgggt caaaaaattc 1861 gttcagccct aataacaact cgtcggatcg tgttcggtta ctggttaacg aaaaagagct 1921 ggacgaattt tccaggcgtt cccccgatac gccggcccag ttgtgtttaa agcaccgtgc 1981 ccgaaaagct tagtccctcg atgaatcgat gggttattag ggcttgcccc actcgtttgt 2041 cctaatgcgt cacaactgac aacccaccac ttgaggtgag tcgcaaattc cgacaccaac 2101 ccagttacaa tttgtaatga aacagaaaag gtaaagacgg aatgcgcgtt gcgatcatag 2161 gagcgggact agctggacta gcaactgcta tcgatttgtc tgatgctggc tgtgaagtcc 2221 acatttttga gtctcgtccg tttgttggag ggaaagtaag cagttgggtt gatggtgatg 2281 ccaaccatat tgaaatgggg ttgcatgtct tttttggcaa ctattaccaa cttttggaat 2341 tgatgaagaa agtgggggcg ttggaaaacc tgcgcccaaa ggagcatagc cacaccttca 2401 tcaatcaagg gggaaaagtt ggtgctttag attttcgttt cgtcatgggt gcacctttta 2461 atggattaaa agcgtttttt accagttccc aactttcgtt actggataag ttccaaaatg 2521 cgatcgcact aggaaccagt ccgatagtac gcggtttggt tgactttaac ggtgcaatga 2581 aaaatatccg cgacttggat aaagttagct ttgccgactg gtttcgtagt cacggcggta 2641 atgagggtag tattaagcgg atgtggaacc ccatcgccta cgccctcggc tttatcgatt 2701 gcgaacatat ttctgcccgt tgtatgttga cagtctttca gctatttgca gtcagaaccg 2761 aagcgtcgaa actacgaatg ctggaaggtt ccccctatga gtatttacac aagccgatct 2821 tggaatatct ggaaactaga gggacaaaga tttacactcg taggagagta cgagaaatac 2881 agtttactga ggaggaaaac caaacccgtg ttagtggtat ggtcatagct aatggtgata 2941 cagaagatct gatcagtgct gatgcttatg tcgctgcctg tgatgttcca ggaattcagc 3001 gtttgttacc tcaacagtgg cgccagtggt ctgaatttga caatatatac aagttggaat 3061 gtgtgccagt tgcaacagtt caattgcggt ttgatggctg ggtgacggaa ctccatgata 3121 ctcaagagcg caaacagctt aatcatgcag caggaattga taatttgctt tacactgccg 3181 atgctgactt ttcttgtttt gctgatttgg cattgactag ccctggtgat tattacaggg 3241 aagggcaagg ttcactattg cagctcgtgc tgacacctgg tgatccgttt attaaggaaa 3301 gtaatgaggc gatcgcccag catgtcctca agcaagtgca tgagttgttc ccctcctcgc 3361 gagaactaaa catgacttgg tacagtgtag tgaaacttgc tcagtctttg tacagagaag 3421 caccaggagt ggatatttat cgtcctcagc aaaagacgcc tatttctaat ttcttcctag 3481 caggtagtta tacgcagcag gactatatcg atagtatgga aggtgcaaca atttcaggaa 3541 ggcgtgctgc aaaggcaatt ttggacaatg tgaagaaatg aaccgcctta gcaaagcgtg 3601 tccgcaggac ataggcgctt tagacgcgtt cgcgtagcgt gcgctttgcg ctcagcggcg 3661 tcccaaaggc tagaacgcca aggaataaca aatatggcag attggttgga acatagtgtg 3721 caggttgagg tagaggctcc catagagtta gtctggagct tatggtctga tttggaacag 3781 atgccccgtt ggatgaaatg gattgattca gtgaaggttc cggaagataa tccagaaata 3841 tcgatatgga aactcgatac taggggcttg gagtttactt ggaaatcccg cattattaaa 3901 attgtcccta accaaatcat ccagtgggag tcagtcgatg gtttgccaaa tcggggggca 3961 gtgcgttttt atgaccgcca cagtagtagt attgttaaac tgactattgc ctatgctatc 4021 cctggtatta tcgggaagat tatggataat ttgtttttgg gacgggcggt tgaatcaact 4081 ctcaagagtg atatggaaaa gtttcgagaa tacgcccttc aggcgaaatc gaatgtatcc 4141 cagtaggtag tgcttacgta gcttctcaaa aaatctaact tcctaggcac ccacggctat 4201 aagtcgaggg gctttctgct ttaaattttg tcaaaataag gcaatgctag gtgtcgcaag 4261 ttctcatctc taacagcacg ctttgcaaaa cttcatacaa gtatgtcaga ggtagagtaa 4321 agcgaattat ttctatatct agcaaggtat agataaatga caagttcttt tgtaagaagt 4381 caacaattag gcagaaattg agctaccaga gttgctaaat gtaaaaggtg catccgatac 4441 accttcaata tttgccacaa tagtataagt catttggtgc agtgagtgaa ccagcacagt 4501 gtcggatatg ttttatccca caacatagtg ttggattata ggttttgcgg tgaacatcat 4561 cgccctaact gaatttaagt aatgggcgtt gttgatgata actggtgtgg ggcaatcttg 4621 atggcgtgat ttttacctct aactcaactc aagctttgac ttgggttgct gtgcaagcaa 4681 atgcaagagc gagaaaacaa gtatcgtagg caacaagaac tcatgagtca ttgttttaga 4741 ctgtttcaag ataacttgcg atcggcgatc gctcttgtca cgttttgcac ttagaatgaa 4801 agtaaactta acaggaatta tcttcaactg tttgtgatca ctaaacttta tttgactgga 4861 atgactacta acagccactg tagccagttt atctcgtaca ttctgaatac gacaaccaag 4921 agacttaccg ctaaccgaat gtagtcacaa ctgtcaccca ccgtattgtc tctcaccaat 4981 tctgagcaat gtcatctctt gatcagccct ttgccagtta tcacctttgg tctttagttg 5041 ccccagccgc atggcaacaa cgttggcatt gccagatgag aaggggtttc attaaagccc 5101 gacaacacga accacaagtc aaagcgcttt tgacaaaagc gacagcatcg cagcgcattg 5161 gtttactggc acaaaaaggc gtttatgaat ttcatcacca catccatttg ttgaaccaat 5221 ctgatggtgt cgaaaaagtt gcacagcttt tgagattaag caactcaact gatgaagtta 5281 agcaacgtgt gttacaaatt ttgaaaaaat attacgataa acctttactt ttaaacaaac 5341 gtatcatttt attaactcgc ggcgatgagg gttttcccaa accgattttg atctcacaac 5401 aacgttatca ctttcgtttg tatgcagtga tggactgtgt atttattgag tctgatagca 5461 ttttgcatat catagatttt aaaactggta aatctgcttt tgaccgacga caggcattag 5521 tttatttgct tgctgctcgt tatctttacc ctaggtatca ggcggtagct tcattttata 5581 atttggaact ttgcaaaaaa tctgagataa ttagcctttc taaagatgaa ttagacatca 5641 tagaatgcga attagctgag attgccagaa aacatcagca agatttacaa aaatatcaag 5701 aaaagagcag tgacttcagt aaaatttttc caccaaatcc tggttctcat tgccgctttt 5761 gtccgtttca cgaaatctgt gaattttcta ctttaaaagt attaaaatct caaaattaga 5821 gatggtaatt ttaacagcac ataaagaatt tgtaaacaaa aataagagtt ttaccgaatg 5881 ttgacgttgg ttcaggttta agtgtagtat atgataccat aaacatcata ataacaggaa 5941 ggggactcca tgacttctgc tgacgcgtca ctacaaattc cttcccaaac ctcaaactat 6001 acaacctatg atgttgtacc tagtttggaa acaaaccatg aattagacgc acgcaatact 6061 cagcagggag catcatatac caacattgcc aatagcttca tagaagcagt caacacgagc 6121 aacctgaatt cgccgtctac caagctattt gttgaaaatt ttaggggtat aagtgggatg 6181 ctctgcgtga atgagtttcc gcagagtgat aaagtgctca atgactccga atacttcgat 6241 aacttggttg gcaggagtga taatgacttt ggagttgcgg gcgacttgag caagggcaat 6301 gatcaattac tcggtcataa cgctcaagaa aatccactgc aaaaggcggc tcaagagagt 6361 ttagttttta ttgaccctca tttagaagat tatcaaagtc tagttgctgg tatctcacca 6421 gaagcaaagg tggttgtact ggatccctca caaaatggta tcgagcaaat caccagagaa 6481 ttgtccaact ataatcacac cgtttccaaa gtagaaattc tttctcatgg cgcatcaggg 6541 cgtctgcaat taggacaaac ctctctagac tctgctaccc ttgaccgata cagccagcaa 6601 ttacaaggtt gggcagatgc ccttactgat aatgctgata ttctcataga cggctgcaac 6661 gttgctcaag gcgaacaagg tagccgtttt gtcacagaac tgagcgaact cacaggagct 6721 gatattgctg cttccactga cctgactggc aatgccgctc aaggaggcga ttgggagtta 6781 gagtataaag tcggtcaaat agaatcagtg tcatcgttgc aaccacaaac gcagcgaacg 6841 taccatgcaa ctctgggaga gcgaattaat tttccagaag gtttcatgaa gagcgtcgaa 6901 gactacggtg caaaacctga cgatggcatc gacgatactg tcgctatcca aaaagcgctt 6961 gatgatggac ggcgcgatgc taacggcaac tctatatacg atgattactc cggtcgtccc 7021 aaagcacttt acttcaaagc tggtacttac gacgtgagta atacacttaa ctggattggt 7081 agcgctgtca ctctacaagg tcaaggcagt ggagctactg ttatccgact caaagacaat 7141 acggctggct ttaataattc tactgctccc aaagctgtca ttcaaacccc aggtggcaac 7201 agctctttcc gtcagaacat ctggaacctt agtgttgata ctggaaaggg taatgctggt 7261 gcgatcggca tcgactacat tgctaacaac gtaggttcaa tgcgtgatgt gacgattaaa 7321 tccgaagacg gtaagggtgt tgcaggtctt gcaatggatc gcgcatggcc aggtccgtgt 7381 ttgatcaaga atgtccaaat tgaaggcttt gactatggca ttacgctttc ttatagcgag 7441 tatggtccca cttttgagaa tattacctta aaaaatcaag ggatagctgg tattaggaat 7501 gagaatggtg cactaacaat tcgtggatta aatagtacta acagtgtacc agtagttaag 7561 ggaacgagct gggctggtat ggtcacctta ttagacgcaa acctccaagg aggtgcacct 7621 aatgttagtg caatagatac agcaggtgaa gtatacgtac gcaatgtcac tactactggc 7681 taccaatcag tcattaagta caacggtaat atagttcctg gcacatcgca tacagagtat 7741 gccacgaatg tctaccagtt gtttgacggt tccaagcagt ctctcaactt gcctgttaag 7801 gaaactccag agtttcagga caacaatccg gcgaactggg gacgcattac actagaccct 7861 gtaggtttca ccgatacaag caagttgcag tcggtactcg actccggcaa gtctacaatc 7921 tacttcgact tcggtaaata cttctccttc aacgaaaccg tgctcacggt tcctgcaacg 7981 gttaaacgta tcattgggtt ttcatcagta gtcgatggag aatcccatgg tcaaaatggc 8041 ggtggcatta agtttgtcgt tcaaggaaac agtacagatt ctccgctcat tgttgagggg 8101 tttggctatg gagcgaaggt agatcacaac aattcctcac gatcagtagc gctcaaagat 8161 ggattctacc aatatacttc cagccctggt gcaggcgaat tgtttcttga ggatgtcaat 8221 attcaacctt tcaaagttca acagaatcag aatgtttggg cacgacaact caacaacgaa 8281 tacggtggtg gcactaagat agagaatgat ggtggtcagc tttggattct agggcttaag 8341 accgaaggca ccggcaccgt tattgagtcg aaaaatggag ccaagactga gttgctagga 8401 ggggtgatcg acccagcgcg gtcgttttcc gctgaagaaa agcaaagacc agcgttcgtt 8461 gtcaataact caaaggcttc gtttatctat cggcaaatcg cctacgatcc taactacaac 8521 tatgacattc aagtcgaaga gagacgtaat ggagaaacgc gtcggaagtt aactagccaa 8581 ctcccgcagc cagtggcact atttacagcg tttcaataag aacccttaat caaaggtgta 8641 gggtgtaggt gagaaagtca agagtcaaaa gaatcttcta cttacttttt tacaccccta 8701 ttccttacac cccatctacc aaaggcggca attcctccaa aattgtaaac cgaatgtgat 8761 taatcgctaa ctcaccaatt gaaagatctt ctttggtgag cctagcgccc tgactcatca 8821 aggtttcaaa ggagattcct ttgccaattt caatgtgata ccagcataag cccataaaaa 8881 accttgtagg atagggtcca ccgctgccga tatcatcagg gttggattcc atcggtaacc 8941 aaccgaagcc aggtatgtaa aattccagcc aaacgtgatt atactgaggt tgcagaggaa 9001 cttgctggta ttcagcgtag ggaggacatt tgtagcgacc tactgtacga cagggaatgc 9061 catttaaacg gcataaagca agtaaaacgc ccacatactc gccgcaggaa ccaactcctc 9121 gttctaaaac gacatctggt gtgtcaatat ggggtttaat accataagac aactcatcgt 9181 agacgtagtt gcggatactg tacattttcc gcagtacatt agtttcactt ccaattgctt 9241 cacgggcagc gcggcgaaca atgctggtat ccattgccaa atcatcatca tccactagat 9301 agcgggattt tagttctagc gatacttcag gtatattctc tacatcgctg ggcgtaatgc 9361 gatacttgat tccccggact tccaaaactg ctttccagcc aaaaatatgc cgttcgcctg 9421 gagtgagggc atcaaatttg aagacagcga cacgttgccc ttctacgact tcttctgtga 9481 aaggtagacc aatcggttca atgtgtttga tcttttgacg cggagtttcc gacggcaaag 9541 ctatacgcca ttctacctct ggtaaatata cctcctctaa tggcgctatc tcttccacat 9601 aagacatttc tataagatag ccattggaga gggtataacg tttatcggga ttgtaagagt 9661 aattcagcgg gtgaatgaaa gtgcgatcgc ggtaagataa ctcatgactc gggtcagcat 9721 taggattatc ccggacatat ggttcttcag agacataggc aacgtagaga gtttttttgc 9781 ccgtctcact atttttatgt attgctatcc cggtgggaga atcaaacggt gtcagtacac 9841 tgaatttaac ttccccagta gccctgtcga tacagtaaac agtttgttcc actgtgtcgc 9901 tgacccaaag catttcttca tccactgcca aattttctac cccaacccca ggggcataaa 9961 atttagtaat ttgttttcgc gtatcccgat caaaaatcag aatagctccc agcttttggc 10021 aactaacgta gatagttgat tcccaaacag caatgccatc agctgtataa ggcaaggtga 10081 taaaatgttc caaacccaac gaacctaggt tgcacaaata aacatgattg cctctcgtca 10141 gccaaagagt atcttgccag actgaaagac cagtcacatc cgtaaaatct ttgacttgat 10201 gggaattgag aatttgggtg ttgtcagagt gggggtcaat ttgcaacaaa tgccctttga 10261 cagtatcaat ggcgatgagt gtatcttgaa tgaaggcaat gccacaaagg gcggcagcac 10321 caaggggtcg aattgtcctt ttcccaagca tttggctggt agatggaaaa cgcttaagtg 10381 taaaagtcat attaaactgg tttaatcaag ggtttagcac acttagaatc gaagggaaaa 10441 tgtcttgatg actgagttct ctacttcttg tactcataat tatcaatatt attttgtgtc 10501 atggaattct ccctcaggag taagcacacc catattatcc tgaacattat ccggaagaaa 10561 aaatgaggaa gtgtcaagaa ttctacttgg gtgtacaatt tggtgctcaa cctgatgtaa 10621 tcttgtgttg atggttttcc agccattggt aaattatgat cgagttttgt aaccattccc 10681 tgtgacctat cacgatgtct gctcactctc tacctgaaaa agctgccgcc gtttggcata 10741 gtttggaagt tgataaatca ttggaactgc ttaagacaaa cgcagacagt ggcttaacac 10801 cccaagaagt acaacagcgg ttgcaagaat atggtccaaa cgaactagaa gaaactgcag 10861 gacgcagtgc ttgggaaatt ttggtggatc agttcaaaaa cattatgttg ttgatgctga 10921 ttgccgtagc cataatttct ggagttatag acctgctaga ttggcgagag aagctggcaa 10981 aaggtgagat gccattcaaa gacacaatcg ccattttagc gattgttatc ctcaacggca 11041 tactcggtta tgtccaagaa agccgtgcag aaaaagcctt agcagccctg aaaaaacttt 11101 cctctcccaa cgtgcgagtt attcgcgacg gcaaaccagt ggaaatcgct ggcaaggatt 11161 tggtaccagg agatgtgatg ctcgtggaag caggggtaca ggttgctgca gacggacgcc 11221 tcatagaaca atctaacctg caaatacgag aatcggcact cactggtgaa gccgaagcgg 11281 ttaataaacg ggcagaactt gtcttgcctg aagaaacatc attgggcgat cgccttaatg 11341 tagtctacca aggcactgaa gttgtccaag gacgcggcaa gattctggta accaacaccg 11401 gaatgcaaac agaacttggc aaaatagctt ccatgttgca ggcggtggaa agtgaaccaa 11461 ctccattgca acagcggatg acccaacttg gtaatgtcct cgtcacgggt tctttgattt 11521 tggtggcgat cgtcattggc ggcggtgtca tccaaaacat gatccaagga aaaggtttta 11581 gcaaccttaa agaacttgtg gaagtttctt tgagtatggc ggtcgctgtc gtgccagaag 11641 gtttacccgc agttattacc gttaccttgg cactgggaac ccagcggatg gtacgtcgca 11701 atgctttgat tcgcaaactt ccagcagtgg aaactctagg ttctgtcacc actatttgtt 11761 ctgataaaac tggcacctta actcagaaca agatggtcgt gcaatcagct tacacgaaca 11821 acaagacttt tcgtgtgact ggagaaggtt acacccccaa aggggagttt cagttagatg 11881 gtcaaacaat ttccataaac caatctccag aaatccaaac tttattagtt gcttgtgcct 11941 tgtgtaatga ttcgttttta caacaggaac aaggagaatg gaaaattttg ggcgacccca 12001 cagagggcgc tttggtaacc ctggcgggga aaggagggat agaaaaagac cagtgggaca 12061 gcaagctgcc tcgtgttagc gagttcccct tctcctcaga acgcaaacgc atgagcgtga 12121 tttgtcgggt tgaaggagtg aatgatggtg tatcatcctt aacatcacca gaccctgtga 12181 tcagcaactt gctgcaatct gaaaagtatt tgatgtttac aaaaggatca ccagagttaa 12241 ttttagcacg ttgtaccaac ttttatgcag gcactggctc aacaccttta aatgatcagc 12301 aacgccgcga cattttagca gaaaatgacc aaatggcgag taaaggtttg cgagtgctgg 12361 gttttgctta caaacccctc accgaaattc cgccggaagg gtcagatgag acatccgagc 12421 aaggcttagt ttggctggga ttgatcggaa tgctcgatgc accacgccca gaggtacgag 12481 cagcagtcca agaatgtcgg gatgcgggta ttcgaccagt gatgattact ggagaccatc 12541 aactgacagc acgagcgatc gccacagatt tgggaattgc acaagagggc gatcgcgtca 12601 ttacaggtca agagttgcaa cggatgagcg accaagatct ggagcaaaac gttgacttag 12661 taagcattta cgctcgtgtc gcccccgaac ataaactacg aattgtgcaa gcgctgcaac 12721 gtcgaggcag atttgtcgcc atgacaggtg acggtgttaa cgatgctcca gccctcaaac 12781 aagctgacat cggtattgca atgggtatca ctggtaccga cgtgagcaag gaagccagtg 12841 atatggtgtt acttgatgac aactttgcta ccattgtcac ggccacaaaa gaaggtagag 12901 ttgtttacac caacattcgc cgctttatca aatacattct tggaagtaat attggtgagg 12961 tcatcacgat tgcagcagca ccagttctcg gcctgggagg cgttcccctg acgccgctgc 13021 aaattctttg gatgaacctc gttacggacg gtttaccagc actagcatta gcagtagaac 13081 ctcctgaacc tgacgtgatg aagcgtccgc ccttcagtcc tcgtgaaagt atttttgcta 13141 gggggttggg ttcttacatg attcggattg gcattatctt tgccatcatc tcgattattc 13201 tgatggagtg ggcatatcat catgttcaag cagtcacagg tccaggacta gatccagaac 13261 gatggaagac aatggtattc actaccctat gtgttgccca aatgggtcat gctattgcaa 13321 ttcgctcaaa tacccgactg actatagaaa tgaatccctt ctcaaatccc tttgtactag 13381 gggctgttgt tgtcacaaca attttacaat taatgctcat ttacgtacca cccctgcgcg 13441 atttctttgg tactcactgg ctgagtccaa ctgagttggg tatttgcttt ggtttcagtg 13501 ctttgatgtt tgtgtggatt gaacttgaaa aacttgtctt gcggttgatg ggtaaaaaac 13561 ctgtttaaaa ggcgaatttt cacctaaaga cggtgggaaa gcagcaccca tcgttttggt 13621 gattagattt cttgcaaaaa tctgatttac agcgattttc atgtaaatag aatacacctt 13681 gttgggtttc acttcgttct aacgccactt gcaacaagag cggtaaagcc gacggcagat 13741 gcttcaagcc gggaaacccg tccaacgcac tgcctcccca a // LOCUS NODE_2463_length_13777_cov_5.04416313777 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13777) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13777) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13777 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(103..2811) /locus_tag="DP116_20480" CDS complement(103..2811) /locus_tag="DP116_20480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860424.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphohydrolase" /protein_id="PRJNA477356:DP116_20480" /translation="MKTQQFFQSLTQKLTFWRRQYKVRHCHIASSKTSRKDGYPTARV SRSINKSLKDKNGLDAVGSPWVKVRRSCVVFVIAVVSITGVMGQKLYNQPQLQVGTVA PQTIRAPATAKIENKKKTAEKRKAATNSSLPVLMVDERINAEIEQNLQQILQEGNEIR ATVGSFPFFDTLVLSVSTQRYLRSCSNWEWQALLLAVESTDKQKSRMFVQKRGSQRGH KDARTQRNEDAPKVSAFASPYHTPSTSKEASGVNVLASSGTSQQPPVSGSYAPKTEDL KSNDLFQNIDFAQAVAQLQAYRLISSKQNLSSVIAQITKIRQGYAQAKTKLTDLVIAK PETVYDEASILNLSDEDWTKIKIEIQQSLERILTQGISPGLPQNILQNTVSIQVQTFV PKDAEPLASKLLLAVLKPNLKKDELQTQRISAKIAAQVEPVIVTVYKGEVIVRQQQKV SAWNFDVLEHYHLIEREIHWLGLIQLGSVVTASVAIFAMVEGRIKHSIRQSDRLLVLL LSLSVPLVQMMGTPYTTWSAVGLLLGSFYGPTLGVTVVGLLSLLLPMSLEISLIALVA GVSGGVLGSCVAQRLRSREELALLSLAIALTEGAVYLIIKLLISAVLGVPVHYLVLQN AGLFALSGLVWSIVGMGLSPYLEKLFDLVTPIRLAELANPNRLLLKRLATETPGTFQH TLFVATLAEAAAKQLGCNVELVRAGTLYHDIGKIHDPMAFIENQMGGPNKHDTEIKDP WMSAYIIKKHVSEGLAIARKYSLPSSVQAFIPEHQGTMQIAYFYHQAKQMALLDPSLK VMEADFRYDGPIPQSRETAIVMLADSCEAALRSLTDATREQALAMLNNILRARWQDNQ MVDSGLTREEMTQIAEIFVQVWQQFHHKRIAYPKLNAAKEGTGNRQ" gene complement(3030..3971) /locus_tag="DP116_20485" CDS complement(3030..3971) /locus_tag="DP116_20485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871845.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ADP-ribosylglycohydrolase family protein" /protein_id="PRJNA477356:DP116_20485" /translation="MRYSLTSRFKGTILGAIIGEQIASSRSKQLQGAGTLKIPSLHWS DMAILCAESLISKGRLDTQDWQKPQQQEFTKLKNSYEAILATLPVALFFHDNTVKLRQ NLLLATEIWQDDPVVRDGTLAVGYAIAKSLTEKLSPQALISQTISFVGETQTNLPQQL LEVHNLLEYGAGLETVQAELGKEQKLSNIIALAFYCFLSTIEDFRLSVLRAIQDNYRS CAIGAMTGALSGAYNSAVGIPVTWQVMLVQQSSAQDWKRTGSSKMVELADAIVAVWSG VYNLASHPDKVREESGTITPPLLSLQVTAAPRVIRSR" gene complement(4014..4472) /gene="aroQ" /locus_tag="DP116_20490" CDS complement(4014..4472) /gene="aroQ" /locus_tag="DP116_20490" /EC_number="4.2.1.10" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410889.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II 3-dehydroquinate dehydratase" /protein_id="PRJNA477356:DP116_20490" /translation="MLDLTERPISILVLHGPNLNLLGQREPGIYGSLTLAEINRSLEQ EAEKLQAKVSHLQSNYEGALVDAIHEALGKHQGILINAGAYTHTSVALRDAIAGVNLP TVEVHLSNIYRREDFRHHSYIAPVVIGQISGFGAQSYLLGLQALVHYIRK" gene complement(5111..7759) /gene="topA" /locus_tag="DP116_20495" /pseudo CDS complement(5111..7759) /gene="topA" /locus_tag="DP116_20495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876994.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type I DNA topoisomerase" assembly_gap 5170..5179 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 8080..8289 /locus_tag="DP116_20500" CDS 8080..8289 /locus_tag="DP116_20500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877011.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20500" /translation="MKSNLTASGNREQGTGNREQGTGNIPVIIYSASRSFGRSGFDKD TLFICPAGTFANARFVCFALTYYVS" gene 8699..9019 /locus_tag="DP116_20505" CDS 8699..9019 /locus_tag="DP116_20505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318695.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="PRJNA477356:DP116_20505" /translation="MSIYVGNLSYSVTQEDLSKVFSEYGTVTRVQLPTDRETGRPRGF GFVEMESEQAEDKAIQALDGAEWMDRVLKVNKARPREEKDSRFSGGNSGGRSNDRYSG RGRY" gene 9348..10895 /locus_tag="DP116_20510" CDS 9348..10895 /locus_tag="DP116_20510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit N" /protein_id="PRJNA477356:DP116_20510" /translation="MDFANLASQLNAGTILPEGIVIVTLLGVLIVDLILGRTSSRWIG YLAIAGLLASIVALLFEWENTNPISFGGAFNGDDLSIVFRALVALSAAGTILMSIRYV EQSGTPLAEFIAILLTATLGGMFLSGASELVMIFISLEILSISSYLLTGYTKHDPRSN EAALKYLLIGASSTAVFLYGVSLLYGLSGGQTELSAIASGIATSGFGQSLGLVIALVF VVAGIGFKISAAPFHQWTPDVYEGAPTPVIAFLSVGSKAAGVALAIRLLTTAFPLVAN EWKFVFTALAVLSMILGNVVALAQTSMKRMLAYSSIGQAGFVMIGLVAGTQAGYASMV FYLLIYLFMNLCGFTCVILFSLRTGTDQIVEYSGLYHKDPLLTLGLSISLLSLGGIPP LAGFFGKIYLFWAGWQAGQYWLVLLGLVTTVVSIYYYIRVVRMMVVKETHEMSEVVKN YPEVRWNLPGYRPLQVGLVVTLVATSIAGILSNPIFTLANNSIAHTSILQSTTVVTTQ VSAMNNE" gene complement(11009..11584) /locus_tag="DP116_20515" CDS complement(11009..11584) /locus_tag="DP116_20515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870052.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="PRJNA477356:DP116_20515" /translation="MGTHKILVIDDTTVVRVKVREMLPQGNFQVLEARDGLEGLNFIR QEKLSLILLDFVLPKVSGWEVFQEIQSQPDFKKIPLIIMSGRKKEVMEKIPEPFEYFE FLGKPFDQKQLIDAIKSAMTKAKKPRQEPAELLAVSARNTRITATSSENGVSTADIQI LNQKIASIQTEIDSLNKQLAQVVTFIQQKIK" gene 11888..12793 /gene="lipA" /locus_tag="DP116_20520" CDS 11888..12793 /gene="lipA" /locus_tag="DP116_20520" /EC_number="2.8.1.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipoyl synthase" /protein_id="PRJNA477356:DP116_20520" /translation="MTSSQKAELKSEIRAMPAWLRRSIGKASELSTVQRIIKQRQIHT ICEEGRCPNRGECYSQKTATFLLMGATCTRSCAFCQVDKGHAPMPLDSEEPEKVAQAV QLLGLRYVVLTSVARDDLPDQGAGHFVKTMETIRQLNPETQIEVLTPDFWGGAGAGQQ GQRERIYKVVKAKPACFNHNIETVQRLQGPVRRGAKYDRSLFVLQVVKEIDSCIPTKS GLMLGHGETVEEVIETMVDLRKVGCDRITIGQYMRPSLEHLPVQKYWTPEEFDELGKV AREMGFSHVRSGPLVRSSYHAGVDE" gene 12941..13111 /locus_tag="DP116_20525" CDS 12941..13111 /locus_tag="DP116_20525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318687.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Photosystem I protein" /protein_id="PRJNA477356:DP116_20525" /translation="MTADAKTAAANATAAKVGEDVAKSGAKPPYTFRVGWAVLLLAIN FLVAAYYFRIIQ" BASE COUNT 3531 a 2969 c 3097 g 4170 t 10 others ORIGIN 1 ccaacgcact ggctctcctt attaaggaga ggggtgccga gtggcgtgtg aaatccccgc 61 aagtacttgt tcactgttaa ctgttcactg tttcaagaag ccttattgcc tattccctgt 121 tccttctttt gctgcattta acttcgggta agcaatgcgc ttgtgatgga attgctgcca 181 aacttgcacg aaaatttcgg caatttgggt catttcctct cgtgttaacc cggaatctac 241 catttgattg tcttgccatc tagcgcggag gatgttgttc agcattgcta aagcttgttc 301 tcgtgtggcg tctgtgagcg atcgcagcgc cgcttcacaa gaatctgcca acatgacgat 361 tgctgtttcc cgtgactggg gtatcggacc atcgtagcga aaatctgcct ccatgacttt 421 taaacttgga tctaaaagcg ccatttgctt tgcttggtga tagaaatagg caatttgcat 481 tgttccctga tgttctggaa taaaagcttg aactgaggaa ggcaaactgt actttcgcgc 541 gatcgccaac ccttcactca cgtgcttttt gataatgtac gcactcatcc aaggatcttt 601 aatttctgtg tcgtgtttat ttggtccccc catttgattt tcaataaatg ccattgggtc 661 gtgtattttg ccgatatcgt ggtataatgt cccagcccta acaagttcga cgttgcatcc 721 taactgtttg gcagcagctt cggcaagagt cgcgacaaac agggtatgct gaaaagtccc 781 aggagtttca gtcgcaagtc ttttcaataa gaggcgatta gggttcgcca gttctgcgag 841 gcggattgga gtgactaagt caaatagttt ttctaaataa ggactcaacc ccatccccac 901 gatgctccat actaagccgg ataaagcaaa taatcctgcg ttttgtagta ccaagtagtg 961 cactggaact cccaatacag cactgatcaa gagcttgatg atcaggtaaa cagcaccttc 1021 tgttaaggcg atcgcaagac tcaaaagtgc taattcttca cgcgatcgca accgttgtgc 1081 aacgcaactg cccaaaactc cccctgatac accagcaact agagcaatga gacttatttc 1141 tagactcatg ggtagcagca gtgacagcaa tcctacaaca gtcacgccca aagtaggtcc 1201 atagaagcta cccagcaata acccaacggc actccaggtg gtgtaaggtg tacccatcat 1261 ctgtaccagt ggtacactaa ggctcagcaa tagtactaac aggcgatcgc tttgccgaat 1321 ggagtgcttg atccgcccct ctaccattgc aaaaatagca acagacgcgg tgacaacact 1381 ccctaactgt atcaatccca gccaatgaat ctcgcgctca atcaggtgat aatgctctag 1441 tacgtcaaag ttccatgcac tcaccttttg ttgttgacgg acaatcacct cacccttata 1501 aacagtaaca atcacaggtt ctacttgagc agctatcttt gcacttattc gctgcgtttg 1561 taattcatct ttcttcagat ttggcttaag tacagctaac aaaagtttgc ttgctaaggg 1621 ttcggcatct tttggtacga acgtctgcac ttgtatactg actgtatttt gtaggatatt 1681 ttgtggtagt ccgggtgaaa tgccttgggt gagaatccgc tctaaacttt gctggatttc 1741 tatttttatt ttcgtccaat cctcatctga caagttgaga atagaagctt catcgtatac 1801 tgtctctggt ttggcaatca ccaaatctgt gagtttagtt tttgcttgag cgtatccctg 1861 gcgtattttt gtaatttgag cgattactga agaaaggttt tgcttggaac tgatgaggcg 1921 ataagcttgt aattgggcca ccgcttgtgc aaagtctata ttttggaaca aatcatttga 1981 ttttaagtct tcagtcttgg gtgcgtaaga acctgaaaca gggggttgtt gtgatgttcc 2041 tgatgaggct agcacgttga caccagaagc ttcttttgat gtggaaggag tgtgatacgg 2101 tgaggcaaaa gcagaaactt ttggtgcatc ctcatttctc tgtgttctcg catccttgtg 2161 tcctctttga cttccccgct tttgaacgaa cattcttgat ttttgcttgt cagtactttc 2221 tacagctaac agcagtgctt gccattccca attagaacac gaacgcaggt aacgctgggt 2281 agagacggat aaaaccaaag tgtcaaaaaa aggaaaagat ccaactgttg cacgaatttc 2341 gtttccctct tgcaaaatct gttgtaaatt ttgctcaatt tctgcattga tgcgttcatc 2401 taccatgagg actggcaagg aactgtttgt tgcggctttg cgcttttctg ctgttttttt 2461 cttattttcg attttcgcag tcgctggtgc tctaattgtt tgtggcgcaa cagtccccac 2521 ttgcaactgg ggttggttat ataacttttg ccccataact cctgttatag agacaactgc 2581 gatgacgaaa acaacgcagg aacgcctgac tttcacccag ggcgaaccca ctgcgtcaag 2641 accattttta tctttaagcg atttgttaat tgagcgtgac actcttgctg ttgggtatcc 2701 atcttttcta cttgtttttg atgatgcaat atgacaatgt cgcactttgt actgtcgccg 2761 ccagaaagtc agcttctgag tcaaggactg aaaaaattgc tgcgttttca tcacctatga 2821 ccgcagttac ttactttgat cggtttaaac aagacaatct agagcgggaa tatacttgaa 2881 gccgatttgc cactatagct acctacagac tttgaatgag aatttccttt tttatcttag 2941 tattaaaagc tactttcgta actcatgttt gaattacttt gtagtcttta atgttttgga 3001 atcagtcacg tttagcgata gaaaaaatgt tagcgtgacc gaatgacgcg aggagctgct 3061 gttacctgta aagacaatag aggcggtgtg atagtaccgc tttcttctct gactttatct 3121 ggatgcgaag caaggttata tactcctgac catacagcca caattgcgtc agctaattct 3181 accattttag aagagccagt tcgtttccag tcttgtgcag agctctgttg cacaagcata 3241 acctgccaag tgacaggaat accaactgca ctgttgtatg ctcctgataa agcacctgtc 3301 atggcaccta ttgcacaaga acgataatta tcttgaatgg ctcgcaaaac agaaagacga 3361 aagtcttcta tggtactaag aaagcagtaa aaagccaagg ctatgatgtt acttaacttt 3421 tgttccttac ccaactcggc ttgtaccgtt tctaagcctg cgccgtattc taacaaatta 3481 tgaacttcta ataattgttg tggcaagttt gtctgcgttt ctccaacaaa ggagatcgtc 3541 tgagaaatga gggcttgtgg agagagtttt tctgtgagag attttgcgat cgcatagccc 3601 actgctagtg ttccatcccg tacgactgga tcatcctgcc aaatttccgt cgccagcagt 3661 aagttttgtc gcaatttcac tgtattgtca tgaaaaaata gcgccactgg caatgtggcg 3721 agaatagcct catagctatt ttttaatttc gtaaattctt gctgttgggg tttttgccaa 3781 tcttgtgtgt ctaatctacc tttggaaatc aaactttcag cacataaaat agccatgtca 3841 ctccagtgca aagatggtat tttcaacgtc cctgcaccct gtaactgctt gctacgacta 3901 gaagctatct gttctccaat aattgccccg agtattgtgc ctttgaatcg acttgtgaga 3961 gagtagcgca tatcagaaag taagtgaaaa gtcaatatag tttttttact tttttacttt 4021 cttatataat gcactaatgc ctgcaagcct agtaaataac tttgtgcccc aaaaccacta 4081 atctgtccaa tgactactgg ggctatatat gaatgatggc gaaaatcttc tcggcggtag 4141 atattactca ggtgcacctc gactgtaggt aagttaacac cagcgatcgc atcccgcaaa 4201 gccacactcg tgtgagtgta tgcccctgca ttgattaaaa ttccctgatg ttttcctagt 4261 gcttcatgaa ttgcatctac cagagcgccc tcataatttg attgtaaatg agagactttc 4321 gcctgtagct tttctgcttc ttgttctaag gagcgattaa tttcagccaa tgtcaacgaa 4381 ccatatattc ctggctctcg ctgtcccagt aaatttaagt ttggtccatg cagtaccaga 4441 atacttatgg ggcgctcagt taaatctagc acggttaagc ttctttatcg acggcgtgaa 4501 cgatcttgca ctggaatcgg aatgagttcc ggttccggtt ccgcttctgg tcccaaaagg 4561 gcttctatta gcttgcgtgc caagtctttt agtttttcta gcaccttttc tatatagtcc 4621 attgaactga ccgctcccga gatactacca acgatgttta taatccttca cggagaaacc 4681 agagtatttt tgcaccctca gcttaatact atgatagttt acactctgtg aatgcgcgaa 4741 aatttgatgt tgttttctct tctacatttt agagtatcat atcatctcca ctacagactg 4801 catcttacta aggagggttt gataggtata gaagttaagg cggtcaaaga tttttttcat 4861 ttttataatt tcttttgttt ttctttatgt caagttggca agcgcttgta aacaaaagca 4921 tctgccctca gttgtacata attaacgttt tcatgctgga ttatgagggc gatcgcccca 4981 aaggggcggt cactccgtgc catcgcacgt tctagtctgg aagagtggcg atcgccattc 5041 ctaacacatt ttccaaatta cgaaaaacga ctcgatgatg tcaacttttt agcaaaactc 5101 gccttttttg tcattttttt tttgcagtgg tggttgagga cttagttgtt gatttagagg 5161 tagtagaacn nnnnnnnnnt gttttacgag tcgattttgc tgttgatgct ttggatgcta 5221 accactccag ggcggaagca agagtcacat cttttactga tacaccttct gggatgctta 5281 cattcgtttt gccgtgctta atgtaaggtc cgtaggggcc attgtagata ttgactggtt 5341 caccatctgc aggatgagag cctaattccc gttccgccgc cttagacttg ctgttagtgg 5401 tgctgcgtcc ttttttcggt tcagacaaca actctaatgc acgttctaaa gaaattgtca 5461 atatatcatc actcgctttg agggaacggt aatcttttgc accactttgg ttatgaacaa 5521 cgtaaggtcc aaagcgtcct aaattcgctt ggatttgagc gcctgtttgt ggatgaaccc 5581 ccagtgttcg cggtagtgac aaaagaccaa tagctgtctc aagagtgata ttttctgacg 5641 ttattccttt tggtaaggaa gcttgtttgg gttttgggtt ttcgtcggtc ttatcaccta 5701 attgtacgta gggaccataa ggaccaattt tgacataaat tgtttcgcca gtttctgggt 5761 gaataccgag ttcgtctgga ccaacgattt tttgccgcag cagtgtttct accttttggg 5821 ggtcaaggtc agcgggagtc aggtctttgg gaatagaggc ggtgacaaca ccatcaccat 5881 tctctttttc tatataagca ccatacttac caatgcggac tttggcatct aggttttcta 5941 gttccactgt ccgggcaaca ttggcatcta tttggctttc ctgttcctta actaaggttt 6001 ctagaccttt gtctcccaga tagaattcct tcaggtaggg taaccacgca gcttcacctg 6061 tggaaatgtc atccagggtt tgctccatct tggaggtaaa actgggatca acaacatccg 6121 gaaagtgctt ttccaacagt tccgtgacgg cgaaagctgt aaaggtgggt atgagggcat 6181 tactcaccaa ttgggcataa cctttatcaa taatggtgcc aatgatgcta gcgtaggtgc 6241 tgggacgccc aatgccttcg ctttccagag tttttactaa ggtcgcttcg gtgtatcttg 6301 cgggaggttg ggtttcgtga ccaactgctt ctaaatcagt gcatttggga ctatccccca 6361 cttttagatt aggcaaaatg acttcttggt cttccagcgc cgcctctgga tcgtcagaac 6421 cttccacata agcgcgcaag aatcctggaa agtcaatgcg tttgccagag gagcgaaaac 6481 cggcatcttc cacgagcagt tgcatgatga tttgagtttg gcgagagtct gccatttggc 6541 tggcgacggt acgcttccaa atcaggtcgt acacttggaa ctcgcgacca cttaaaccgg 6601 tttcttgggg agtgcggaag gtactacctg cggggcgaat tgcttcgtgg gcttcttgtg 6661 cgcctttgga tttggtagtg tattgtcttg gttgggggct gaggtaatct ttaccgtaaa 6721 gtttgtctac acagtcacgg gcggcggcga tcgcctgatc cgacaaatgc accgaatctg 6781 tacgcatata ggtaatatat ccctgttcgt acaaactctg ggcaacccgc attgtgtccc 6841 gtgcgctaag gcgcagttta cggttggatt cctgctgtag ggtcgaggtg gtaaagggtg 6901 gcgcgggttt acgcgtaaca ggacgttcct caaggtctgt gactttccaa gtttttcctg 6961 ttaggcgttc tttcagggct tgtgcttgct cttgattaag caagatcaca ttgcgatctg 7021 ggggaatttt ccctgtggcg gcatcaaaat cgccgccatt cgccagcctt gttcctccca 7081 atgtgaccag ttgggaagta aatgactgcc ctttgggaga atcagggggg tttaacgttg 7141 ctttcaaatc ccagtaagaa ccttggcgga aagcacggcg ttgacgttcc cggttgacta 7201 agagccgcac agcaacagac tgtacgcgtc cagcagataa tccccaggcg atttttttcc 7261 acagcagggg agacagggta tagcccacaa gtctatccaa aatccgccgc gtttcttgag 7321 cacgaaccaa ctgttcgtca atattgcggc agtttttcaa ggctttcttg atggcgtcag 7381 aggtaatttc gtgaaacacc atccgctttg taggaacttt tggcttgagc aactggtata 7441 aatgccaact gatgctttca ccttcccggt cttcatccgt tgccagaatc agttcatcta 7501 cttctttaag ggcgtctttg agctgagtga caactttctt tttatcttta gggacgacat 7561 acagcggttc aaagtcggcg tctacattta ccccgagctg cgcccatttc tcgcccttga 7621 cattggcggg aatttcagtt gctgattgag gaaggtcacg cacatgaccc atagacgctt 7681 ccacccggta gtctcttggc aggtagttgc gaatggtacg agctttggtc ggagattcga 7741 cgatgacaag agttgacatg gaaattttaa acaaagaata gctgctaagc taaacagtca 7801 attgaaatcg ttaagagcgc atcggttcag ctaaaacctc acgctgagcg atatgtgatt 7861 ttatccttac acaaactgcc cactaggcgc tgtcattcgc ctatgtttag cacaacttcc 7921 tcccagatgg aaggtagggc gggagcgcgg gtgtgagttt ctcgttattt caatctttaa 7981 cttataatcg gcataattgg cgactacagt tttcaaaatc ccccaacctg catcgttata 8041 caattacata acaaccaggt ttgtgtcgcc tatgccaaaa tgaaatcaaa tttaacagcc 8101 tcagggaaca gggaacaggg aacagggaac agggaacagg gaacagggaa tatccccgta 8161 attatataca gtgcaagccg ctctttcggg cggtctgggt ttgataagga tacattgttt 8221 atatgccctg cgggcacgtt cgctaacgcc cgattcgtgt gctttgcgct tacgtactac 8281 gtgtcctaag agtgcaggaa gcccttaagt tgatttggtg gagtgacttt taccttttgt 8341 ctttcttcat ccagaggaat gttttgtaca aatctaaatt ctcaaaaatt aagcttatag 8401 taactagctt gttcttgtta tgtattcatt cactttgccg taaataaaat ataatgaaaa 8461 tatcatatag tcatctatct gaggaagtat ttaagcaagt catagttgat atagaaaaca 8521 tagtaaaaga taatacttga aatcaatatc aaaggataga tttaaccttt ttttaaaaac 8581 agttgtgtta ggattgagtc tggagaggtc aattcggcga ttctgatctt gttattatta 8641 aaaagtgcaa gcggttctcc ggatacacat ctctttctat cggattctgg agaatttcat 8701 gtcaatttac gtagggaacc tatcctacag cgttacacaa gaagacctca gcaaagtgtt 8761 ttctgagtat ggtactgtaa cgcgagttca attgcccact gatcgggaaa ctggtcggcc 8821 acgtggtttt ggtttcgtag aaatggaatc agaacaagcg gaggacaaag ccattcaagc 8881 cctagatggt gctgagtgga tggatcgcgt gctaaaagtc aataaagcaa gaccacggga 8941 ggagaaagat agtcgcttta gcggcggtaa ctcggggggc aggagcaacg atcgatattc 9001 tggtagggga cgctattaag gcttgaagtt attgatgaat caagcttcac atctgtagtg 9061 tagcctaaag gcacaaaaag catttgcaaa gcgaaactta atatctaacg gaatgagcag 9121 tttctaaaaa actgtcacca tcagtttcta tgcaagcaga ttagtttatg actggtctgc 9181 ttaactttat aagaagaact ttttttccat cacgctcact aaacttaaac ataatttaat 9241 ctgatggtgc cactcacccg atacacgagt gaatgcaacg agagtaaaag acaatagaat 9301 gaacacagtt agaccattat gtctgatagc caaaaccaat acagctcatg gattttgcta 9361 atcttgcatc gcagctgaac gctggaacaa ttctgccaga ggggattgtg attgtcaccc 9421 tcttgggagt tttgattgtt gatttgattt tagggcggac gtcctcacgc tggattggat 9481 atctagcaat tgcaggttta cttgcttcta tcgtcgccct gttgtttgaa tgggaaaata 9541 caaatcccat ctcttttggc ggtgccttta atggtgacga cctaagtatc gtctttcgcg 9601 ctcttgtagc attatctgcc gctggcacca ttttgatgtc gattcgctac gttgaacaaa 9661 gtggtacccc tttagccgaa ttcatcgcga ttttgctaac tgctactctg ggaggaatgt 9721 ttctatcagg ggctagtgag ttggtgatga ttttcatctc tctagaaatt ctgagtattt 9781 cctcttattt actcacaggt tacactaagc atgacccccg ctctaacgaa gcagcgctga 9841 aatatctgtt gattggtgct tcgagtacag cagtgttttt gtatggagtt tcgctgttgt 9901 acggtttatc gggtggacag actgaactga gtgcgatcgc cagtggaatt gccacatctg 9961 gttttggtca atctctgggt ttagtgattg ctctggtttt tgtggttgca ggtattggct 10021 tcaaaatctc cgctgcgcct ttccaccagt ggacaccaga cgtttatgaa ggcgcaccca 10081 ctccagtaat tgccttttta tctgtcggtt ccaaagcagc tggggttgct ctagcgattc 10141 gcttgctgac aacagccttc cccctcgttg ctaacgaatg gaagtttgtt ttcactgctc 10201 ttgccgtcct gagtatgata ttgggtaatg ttgtcgccct tgcccaaact agcatgaaac 10261 ggatgctggc ttattcttcc atcggtcaag ccgggttcgt catgattggc ttagttgcag 10321 gaacacaagc aggatatgcc agcatggtct tttatctgct gatctacctg ttcatgaatt 10381 tatgcggctt tacctgcgtg attctgttct ccctgcggac aggaactgac caaattgtgg 10441 aatactctgg tttgtatcac aaagacccac tcctgacact ggggttaagt atttccctac 10501 tgtccttggg tggtattcca ccactagccg gattcttcgg taagatttat ctgttctggg 10561 caggttggca agctggacag tactggttag ttttgctggg cttagtcacc actgtcgtct 10621 ccatctacta ctacatccgc gtggttagga tgatggtcgt taaagaaact catgaaatgt 10681 ctgaggtggt gaaaaattat ccagaagtac gttggaattt gccgggatat agacctttac 10741 aggtgggatt ggtcgtgaca ttagtcgcca cttccatcgc tggaatcttg tcaaatccga 10801 tatttactct ggctaacaat tccatcgctc atacttcaat tttgcaatcg acaacagttg 10861 tgaccactca agtgagtgca atgaacaacg agtaagttag tcatgtgttg gtagctgtgc 10921 ataagcgcaa ctaccaacaa aagaaatctt gtgactcaca ttccccaggt agctcaaaaa 10981 gcgacgacaa ttttttaagg atttcctatt atttgatttt ctgttgaata aaggtgacaa 11041 cctgagctaa ctgtttattc aaactatcta tttctgtttg tatgctggcg attttttgat 11101 ttaatatttg aatgtcagca gttgatactc cgttttcact agaagttgcc gttattcgag 11161 tatttctagc agatactgct aagagttcag ccggttcttg acggggcttt ttagcctttg 11221 tcattgctga tttgatcgcg tcaataagtt gcttttggtc aaaaggcttt ccgagaaatt 11281 caaaatattc aaaaggttct gggatttttt ccatcacctc ttttttacga ccagacataa 11341 tgatcagagg aatttttttg aaatctggct gggattgaat ctcctggaaa acttcccagc 11401 cactcacttt aggtagcacg aaatccagca gaatcagact gagtttttct tgacggatga 11461 aattcaatcc ttcaagacca tctcttgctt ccaacacctg gaagttgcct tgtggcaaca 11521 tttctcgtac ttttacccgg acaactgtag tgtcatcgat aactagaatt ttatgagttc 11581 ccacgagtgt ttggtgtaat aaaaacttta tgtttagaga gtggtttcgc caattcacac 11641 actatactca atcttttctt gatttagggg agagtaattt ctcggagaaa taacaatatt 11701 aaatgtattt tgtataactt tttatttcga tttgtatcat ggtgatattg aaggcaataa 11761 ggaagagaga agaaagaatt attgtccatt aactcttctc cccctctccc tgcccctaag 11821 cgtagtgagt ccctacttgt tcactctctc actcctatac tattgtttac aaagatttta 11881 aatggttatg acttcttcac aaaaagccga actgaaatct gaaataagag caatgcctgc 11941 gtggttgcgt cgttcaatag gcaaagccag cgaactttct acagtacagc gtattatcaa 12001 gcagcgtcaa attcacacga tttgcgaaga gggacgctgt cctaaccgag gggagtgcta 12061 ttctcaaaaa actgcaactt tcttactcat gggcgcaaca tgcaccaggt cttgtgcttt 12121 ttgtcaagtc gataaaggtc atgcacccat gcctcttgac tctgaagaac cggagaaagt 12181 ggcacaagca gtgcagcttt taggattgcg ttatgttgtg ctgacttctg tcgcacgaga 12241 tgacttgcca gatcaaggag caggtcattt tgtcaagaca atggaaacta tccgacagct 12301 aaacccagaa actcaaattg aagtattgac accagatttt tggggtggtg cgggtgctgg 12361 acagcaaggt caacgtgagc gtatatataa ggtagtgaag gcgaaaccag cttgttttaa 12421 tcacaatatt gagacggtgc aacggttaca aggaccagtc cgccgggggg cgaagtacga 12481 tcgctcgctt tttgtcctgc aagtcgttaa agaaatcgac tcttgcattc ccaccaagtc 12541 agggttgatg ctaggacacg gggaaacagt tgaggaagtg attgagacaa tggtggatct 12601 tcgtaaggtg gggtgcgatc gcatcacaat cggtcagtat atgcgtcctt ccttggaaca 12661 tttgcccgtc caaaaatatt ggacaccaga agaattcgat gaattaggca aggtagcacg 12721 agaaatggga tttagccatg ttcgttctgg tcctctggtt cgcagttctt atcacgcagg 12781 ggttgacgaa taaaccatcc tgtccgggac tttgtgctac attttgttta tttttgctga 12841 tttgtgctgc aaagtctaca aaaatattct tgaagaatct ggcaatatat gacagctctt 12901 tggtatttgt taagaagtct ttaaattagg agagaacaca atgactgctg acgctaagac 12961 agctgctgct aatgctactg ctgcaaaagt aggcgaagac gttgctaaaa gtggagctaa 13021 gcctccttat acctttcgtg taggttgggc agtgttgcta ctagctatca atttccttgt 13081 agctgcctat tacttccgta ttattcaata attagttgac atcctcccac ggctgcggaa 13141 attatgtgcc cttggcgctc agcatgagcg ttcgcttacg ctgatttccg cgctcaaagc 13201 cgtgggattg tgaatctcac gacttggttt gtctggttgt gcgagtcagc ttaattggat 13261 gagtgtgatc aacaagatag tggcgggttt gaatcccgca ataattgcat tgctgaattc 13321 agaattctga ctcctgtaac cgcaagcgca cgcccagagg gctttagcgc agcgtctccg 13381 gaggagatac aaagcagcgt gtccgcagga catattctga attgttcttc ttaccctaca 13441 tcacggttat tcatacttgc gaagaaatat cgcgctacat atcatttcgt cgcagtagtt 13501 gtctatttgt acttgtactt acagcaattt tcgggtaaat agaccacatt gtaggggcgc 13561 aaggcattgc gcccctacga aagattgtgg ttcaaataga tgaaaagcgc tgtaaagacc 13621 agataagtgg gacaaagggt caaaggttcg gcaatgctat cccaagaaaa acaaaggaac 13681 aaagaaagtt catcaagctg aacttctgct ttgctaattc attaaaaaag ctacttctaa 13741 aaggaacatt catatctaac cttaagaggt atgttta // LOCUS NODE_2484_length_13692_cov_5.01525313692 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13692) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13692) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13692 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..870 /locus_tag="DP116_20530" CDS <1..870 /locus_tag="DP116_20530" /inference="COORDINATES: protein motif:HMM:PF13432.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20530" /translation="IQKNWNLKNENIEKFYIGLANELYKQNRLQESVYIYKEALAVNP NNVIIHENLVYILLQCGQIDEASAQYLKAIHLEPNNVKFYVDLGKIFYVKAQFPEAIE VYKQAIKLNTNGASDDLAEIHYLLGCTFCECERFENAIQEYTQAIKIKPKYAEAYAKL GDCQYRLNNFVDAANNLRKAIALKPDNPEFHLYLGRIISSQGEYREAIVEFEKAIELN IIDSTPDPIVYTHYAFALFAQNKQKKAKTEIERAIKLFQKQNMDDAAKQMEHLFEQIK KESSWKSFFQRFR" gene complement(898..1848) /locus_tag="DP116_20535" CDS complement(898..1848) /locus_tag="DP116_20535" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20535" /translation="MTRIRKQALFAALLFGNVSLSSVSSLVLDKYLKNSLKASYGYST TLLNSKTKPIHSSSFQPAIGIKFPRFKLPPIKLPPIKLPWGRLLKPLKQFIRQIFRPK KPPGVRPSKPPKPAPVIPLQYSPSNDQQVNLVSDAMSKSKQKQTVQIFLNDMSPKNAN LLEQRLEADLTRKIQAKLLTPNLSSQFPVQIRSQSQVQNLSDQTFQEAAESFYPARDA AEQVLASRYKARNGVWTNKDTKDAQLAAQQVIQEQRRQKKELVLSGTLVLSITTTTAI AVIAVESAKAKVQKNNNNRNFRTLPRNRNPVYRLFKKILR" gene 2091..3152 /locus_tag="DP116_20540" CDS 2091..3152 /locus_tag="DP116_20540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208270.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_20540" /translation="MTVQLSQPNFQRLTRIVQNLPDFANVRDRRRLVAGALQGAAQAD VIMARLDLDGSPMGVSVEVVRFFCQFGRVAYDKEALAVFLNYIQPFTGDEDKDFIVEL FQNYPLDVPASPSRGINNWRGMDSTADIKEKIIGENTLRDIYILNLALEASKAVVRIR TPEGLGTGFMIAPDLLMTNNHVIQSQEVGDKSNFSFNYQLDINGKECPTQIIGALPNG AFYTNKELDVTVVTLKDVPNFGKPLIFKSKLMRRDERVAIIQHPGGHLKKISIQNNFV AYADNQVLQYTTSTEPGSSGSPVFDDDFLVVGIHHSGGMLPEPSTQRRYLRNAGTSAV ALLNDLKNNAPEIYARLAI" gene 3347..3925 /locus_tag="DP116_20545" CDS 3347..3925 /locus_tag="DP116_20545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208137.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_20545" /translation="MVAITQQPQKMTAEEYLEWELQQDIRYEYINGEVFAMTGVTIPH NDIALNFYTTLHPHLRYRGCRVNVSDVKVQLSAQSQYYYPDVIVSCDPQDLNARKFIQ FPKLIAEVLSPGTSGKDRGDKFTDYLKIPTLQEYILIDSEKISVERFCRGEGRMWLYY PYTAEDIITLSSIEFEFPIELLYEGVAFETEA" gene 3992..6142 /locus_tag="DP116_20550" CDS 3992..6142 /locus_tag="DP116_20550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycine--tRNA ligase subunit beta" /protein_id="PRJNA477356:DP116_20550" /translation="MPSFLLEVGTEELPASFLSSAISQWKSRIPHTLEENSLTYDAVE VYGTPRRLAVLIKGLPSQQADREEEIKGPPAQAAFKDGNPTPAAQGFAKKQGVEVSAL EVRPTEKGDFVFVRKVTRGRPVAEIITELVPQWIFKLEGKRLMRWGDGDKTFSRPIRW LVALLDEAVLPIELDNGSETVKSNRISQAHRVLHPEPITIPNATDYVTTLRSASVVVD TDERVNTITQQVKESVQKLGGYAEIYPDLLQEVTNLVEFPSAVIGKFESEFLNLPTEV ITTVMVSHQRYFPVFQSSNTKDLLPNFVTISNGDPTKSDIIAVGNERVIRARLADGRF FYDADLEKPLESFLPQLETVTFQEDLGSLLKKVNRICKIAEQITEQLQLSEKERENIQ RAALLCKADLVSQMVYEFPELQGVMGQKYALASGEEEAVATAIFEHYLPRSADDILPE TLTGQVVGLADRLDTLVSIFGLGMIPTGSSDPFALRRAANAVVNITWTAHLPINLQQL LEKVATDFASEYHKDRNQLVAGLEEFFLQRIRTLLQEEKHIDYDLVNAVLGENDREYT ERALKDLLDVGDRATFLQKIRANGTLDNIYETVNRSTRLAAQGDLDTKQLDPKAIVRK ELFQKSSEEAFYNAIVELVPQTQAAQQSRDYGQLVTALEQITPTVSNFFDGAESVLVM DPNPEIKRNRLNLLGLLRNHARVLADFGAIVKNL" gene 6233..7717 /locus_tag="DP116_20555" CDS 6233..7717 /locus_tag="DP116_20555" /EC_number="6.3.2.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748369.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase" /protein_id="PRJNA477356:DP116_20555" /translation="MPKAHVIGLGKSGIAAARLLKREGWEVELSDRNTSLDLLNQQQE LATEQITVKLGYSLELSDSDLPQLIVVSPGVPWDIPVLIQARELGIKTIGEMELAWQH LQSVPWVGITGTNGKTTTTALIAAIFQQAGLNAPACGNIGYAACEVALSVENGGRINS KSRSVASKATQSQNSKLEVSPSSPASSDNQIDWVIAEVSSYQIESSSSLAPRIGVWTT FTPDHLSRHKTLENYYNIKAHLLRQSQLQVFNGDDPYLRKVGASDWFDAYWTSVKGKD NLIGSQGYYIEDGWVVEKLNANSQPERIVEVSALRMVGEHNQQNLLMSVAAARLGGID QDAIVRAVNEFPGVPHRLEHICTWEGIDFINDSKATNYDAAEVGLVSVKSPVILIAGG EAKAGDDTNWIAKIKDKAACVLLIGTAAPAFAKRLQEEGYENYEIVETMEKAVPKSAE LAKHYQASVVLLSPACASFDQYANFEQRGDHFRQLCLELVKIIN" assembly_gap 7768..7777 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 8362..9240 /locus_tag="DP116_20560" CDS 8362..9240 /locus_tag="DP116_20560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20560" /translation="MITIAGSRNCTYPVKSMKWMLTIALILSSLTYYPAKVTAQTQQP ATTKQLVQPTKGGISPQQRTSQPVFVFPKTPVRLSPVSGRRRGMGSRGNCPAVQTALT ALIPLREEQKVSKQTDKSISGIVGGLTTSERPTFWFYVPYTQDLANSSGEFILQDSAG NDISKNAIALPPKPGVIGVSLPSNTSLQVGKTYRWYLKVRCNQQQTASVPIYVEGDIQ RVNLDSRVMQQLEAAVDPAQKVAIYAANGIWFDSLTMLAQLRQKNPNDASVAEDWQSL LRSVNLDNVATAPLVK" gene complement(9392..12040) /locus_tag="DP116_20565" CDS complement(9392..12040) /locus_tag="DP116_20565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20565" /translation="MHPFALTKRLIRFALTHVVLFTLAFSLTIIPSTFAKQPQTNSVD NFHTQTTKPPTTTPQQLVSQGERFYQSGRFAEAVTVLQQAVRIYQREGDRLPQAAALT NLSLALEQLGSWKEASKAINTSLNLLGWDENNQKLNVNNPKSELLEVLAQTLEIQAGL QLAQGQADVSLKTSQQAEEIWKRLGKQYNTGVTRSRINQAQALRVSGFYRRSLDILNT VSRQLQTQPDSLEKVTALRTLGNAQQQSGDLEQSQKNLQQSLEIAQRLQLPQEISVIE FSLGNTARANGNRKNAIAHYEKAALIAPNPLTKVQAQINQLSLLVENKNTADLELSDT TSLLIPTIQSQLATLPTNQAGIYTRVNFARTITKFGNKRDIAEILATSVQQAKTIGSE RAQSYALGSLAEVYEQNSQWQEAQNLTQQALFIAQKILASDIAYRWEWQLGRLLKAQG NIEGAIAAYDSAVASLQSLRSDLVVVNREVQFNFRDSVEPIYRQSVELLLQQKGQGKP DLDKVRRRIEALQLAELDNFFRQACLSNQFVVLDKVVDRDNPNTAIFYPIILDNQLEV ILKLPNQPLIHKTSVVKRQEVEQVITKMRETIVEPDATKKFQVVSQQLYDWLIKPVEG ELKKSKVNTLVFIPDGSLRNIPVSALYDGALYLVQKYAIAISPGLQLFTPKPLAQERL NALAGGLSQPPKNEKFASLPNVKVELKLIQQSGISTTTLLDENFKSTTLGKTINAQPF RVVHLATHGQFSSKAKDTFILAADGRINVSQLDSLLKSREQKRTQPIELLVLSACETA AGDNRAALGLAGVAIRAGARSTLASLWQIGDDSTALFISEFYRQLTTGKTTAEALREA QLKLLSGTEYTRPLNWAPYVLVGNWL" gene complement(12410..12742) /locus_tag="DP116_20570" CDS complement(12410..12742) /locus_tag="DP116_20570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_20570" /translation="MAKIEQYRQFVQSLLTKYADNTSNSDVEVQLIFDTERDHYQWMS VGWRQLDRIYRCIIHFDIKDGKIWIQQNLTEVDLAEELVLMGVPTEDIVPGLIAPYKR QYTGFSVA" gene 12799..13401 /locus_tag="DP116_20575" CDS 12799..13401 /locus_tag="DP116_20575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009342847.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_20575" /translation="MTQALHKLVTFEEFAKWKPEDGRYELHDGVIVKMPQPLGGHEEV TGFLVRKLSVEFDRLNLPYFIPKTALVKPPDQESAYSPDVLVVNQSNLSSEPLWKKES TLIYGASIPLVVEVVSTNWRDDYFKKRGEYEGIGIPEYWIVDYLALGGKQFIGNPKQP TISIYHLIDDEYQVTQFRGDDRIQSPTFPEFNLTAQQIFN" gene complement(13429..>13692) /locus_tag="DP116_20580" CDS complement(13429..>13692) /locus_tag="DP116_20580" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="filamentous hemagglutinin" /protein_id="PRJNA477356:DP116_20580" /translation="TEPLQDTSTLSAWVRLRPKPANSAKTTISPQPTAVSNSTKVAAA TTQIVEATGWIVDKNGNIELVAQAPGVTPHSSWQTPASCPPSR" BASE COUNT 4044 a 2704 c 2949 g 3985 t 10 others ORIGIN 1 attcaaaaga attggaattt aaaaaatgaa aatattgaaa aattttacat cggtttagca 61 aacgaacttt acaagcaaaa tagattacaa gaatcagttt atatctacaa agaggcgctt 121 gctgttaatc caaataatgt gattattcat gaaaatttag tatacatttt gttacagtgt 181 gggcagatag atgaagcgag cgcacaatat ctaaaagcta ttcatcttga accaaataat 241 gtcaaatttt atgtagattt aggaaagatt ttttacgtca aagctcagtt tccagaagcg 301 attgaggtat ataaacaggc aattaaactt aacactaatg gtgcgtcaga tgatttagca 361 gaaatacact atttgttagg atgcacgttt tgtgagtgtg aacgttttga aaatgcgatt 421 caagaataca cccaagcaat taagattaag ccgaaatatg cagaagcata tgctaagtta 481 ggagattgtc aatacaggtt aaataatttt gtagatgcag ctaataattt acgaaaagca 541 attgcgctca aaccagataa tccagaattt cacttatatt taggaaggat tattagtagc 601 cagggcgaat atagggaggc aattgttgag tttgaaaaag ctatagaact caatatcata 661 gatagcacac cagatccaat tgtctataca cattatgcgt ttgcgctatt cgcacaaaat 721 aaacagaaga aagcgaaaac tgaaattgag cgagctatca aattattcca aaagcaaaat 781 atggatgatg cagccaagca aatggaacac ttatttgagc aaatcaaaaa agaaagtagt 841 tggaaaagct ttttccaacg ctttcgctaa atttagctcg taaaatcaca tgatttatta 901 cctaagtatt tttttgaata atcgatacac aggattgcga tttcgaggca aagtccggaa 961 atttcgattg ttattatttt tttgaacttt cgctttggca gattcaacag caattacggc 1021 aattgcggtg gttgtcgtaa tactcaaaac taacgtacca ctcaaaacta attctttctt 1081 ttgacgacgc tgttcttgaa taacctgctg tgcagctaat tgtgcatctt ttgtatcttt 1141 attggtccac accccatttc tagctttgta acgcgatgcc agtacttgct cagctgcatc 1201 acgcgctgga taaaaactct ctgcggcttc ttgaaatgtt tggtcactca agttttgaac 1261 ctgagattga ctgcgtattt gaaccggaaa ttgactgctt aaattaggtg ttaataattt 1321 tgcttgaatc tttctagtaa gatcagcctc taatctttgt tccagaagat tcgcattttt 1381 cggtgacata tcattcaaaa atatttgcac cgtctgcttc tgcttgcttt tggacattgc 1441 atcgctgaca aggttaactt gctgatcgtt cgagggagag tattgcaatg gtataactgg 1501 tgctggtttg ggtggtttgg atggtctaac ccccggtggt ttttttggtc gaaatatttg 1561 tcgtatgaat tgtttgagtg gtttgagtag tctaccccac ggtagtttta ttggtggtag 1621 ttttattggt ggtagtttga atcgtgggaa cttaatacca atagctggct gaaaactact 1681 cgaatgtatt ggttttgttt tcgagttcaa taaagtcgta gagtatccat aactagcttt 1741 gagtgaattt ttaaggtact tatctaatac caaactcgat actgatgaca gtgaaacatt 1801 gccaaacaaa agggcagcga ataaagcttg cttgcgtatc cgtgtcatag ttccatgcat 1861 ctgataaatt tttttataac ttgatgcttc tagcaactaa taagcttgtg aacatcatag 1921 tgagctttta caaactctct gcttgcaatt ccggaaaatt tactgtttgc aatataaatt 1981 tactctcgaa aatacttaaa ctatacttgg gacaataaac actaaactaa aatttcagta 2041 ttcgcagtaa caaacaaata ttatcgatac ttgctatagg agtagtgaca ttgaccgttc 2101 aactatcgca acctaacttc caacggctca cccgtatcgt acaaaactta cccgactttg 2161 ccaatgtgcg cgatcgccgt cgtttagttg caggtgcatt gcaaggtgca gcccaagccg 2221 atgttatcat ggcgcggtta gatttggatg gttcgccgat gggcgtttct gtggaggtgg 2281 tgcggttttt ctgccaattt ggacgagttg cttacgataa agaagctctt gctgtcttcc 2341 tcaactatat tcagcccttc actggggatg aggataagga ttttatagta gaactatttc 2401 agaattaccc gcttgatgtt ccagccagtc ctagccgtgg aattaacaat tggaggggaa 2461 tggatagtac ggctgatatc aaagaaaaga ttattgggga aaatacgtta cgcgatatct 2521 acattttaaa cttggcttta gaagcatcaa aagctgtggt tcgtatccgc actcccgaag 2581 gtttgggtac ggggtttatg attgcacctg atttactcat gacgaacaat cacgtcatcc 2641 aaagtcagga agtaggagac aaaagtaatt ttagcttcaa ctaccaactc gatattaatg 2701 gtaaagaatg cccaacacag attattggag ctttaccaaa tggtgctttt tacactaaca 2761 aagaactaga tgttacagta gtaaccctga aagatgtccc caacttcggc aaacccttaa 2821 tcttcaaaag taagttgatg cgacgagatg aacgtgtagc aatcattcag catcccggcg 2881 gacatttgaa gaaaatctcc atacagaata actttgtcgc ctacgccgac aatcaagtat 2941 tgcaatatac tacaagtaca gaaccaggtt catcaggatc acctgttttc gatgatgatt 3001 ttctagtggt tggtattcac catagcggtg ggatgcttcc ggaaccaagt acgcagcgaa 3061 ggtatttacg taatgcgggg acgagtgcag tagcactctt aaatgacttg aaaaacaacg 3121 caccagaaat ttatgctcgt ttagcgatat agttagccga tgacgataag tagcgagtta 3181 ctgaattttt aactattact cgttctgaat caagtatatt tggcgagaat tgtccagttg 3241 tgtgtaactt tgggtaggct ttatataacg gttttgacga actttcaccc tcacccacaa 3301 cgctacaatc aaaccaaacc cgaaaaagtc ttggcgttca aagattatgg tagccatcac 3361 ccaacaaccc caaaaaatga ctgccgagga atatctcgaa tgggaacttc agcaagacat 3421 tcgctacgaa tacattaacg gcgaagtttt tgccatgaca ggtgttacaa ttccccacaa 3481 tgacattgca cttaactttt acactaccct acacccacat ctgcgttata gaggttgtcg 3541 agtgaatgtg tcagacgtga aagtacaact cagtgcccaa agccagtact actatcctga 3601 tgttatcgtc agttgcgacc ctcaagacct caatgcccgc aaatttattc agtttcctaa 3661 actcattgcg gaagtcctct caccaggtac aagcggcaaa gatagaggtg ataaattcac 3721 tgattatctt aaaatcccca ctttgcaaga gtatatcttg attgactctg aaaaaatctc 3781 cgttgagcgt ttctgtcggg gagagggaag aatgtggctt tattatccct acactgctga 3841 agatattatc actctatcaa gtattgaatt tgagtttccg attgaattgc tatatgaagg 3901 tgttgcattt gaaacagaag cataacaagc acggtaaact ggttcaattg actaatataa 3961 ttaagcccag ctttccttga aaaatagtcc tatgccttca tttttattag aagttggtac 4021 agaagaactc cctgcaagtt tcctgagtag cgctatctcg cagtggaaat ctcgcattcc 4081 tcacactctg gaagaaaaca gcctgactta cgatgctgtg gaagtttacg gaactccccg 4141 ccgcttagcg gtactcatca aaggtttacc ctcgcagcaa gcggatcgcg aagaagaaat 4201 taaaggtccc cccgcacaag cagcatttaa agatgggaac ccaacaccag cagcacaagg 4261 ctttgctaaa aagcaaggtg tggaagtctc tgcgttggaa gttcgcccta ctgaaaaggg 4321 agattttgtc tttgtccgaa aagtgactcg cggtcgtcca gttgctgaga taataacaga 4381 acttgttcct caatggattt tcaaactcga aggaaagcgg ttgatgcgtt ggggtgatgg 4441 agataagaca ttttctcgtc ccatccgctg gttagtcgct ttgttggatg aggcggtgct 4501 accaattgaa ttggataatg gttccgagac ggtaaagagc aatcgcatct ctcaagctca 4561 tcgtgtctta catccagaac ccatcaccat acccaacgcg actgattatg tcaccacgct 4621 tcgctctgct tctgttgtgg ttgatacgga tgaacgggta aataccatta cgcagcaagt 4681 caaagaatcg gtacaaaagt taggcggtta tgcagaaatt tacccagatt tattgcagga 4741 agtcacaaat cttgtagaat ttccctcagc tgttattggt aaatttgaat cagagttttt 4801 gaacttgcct acagaggtga ttaccactgt gatggtaagt catcagcgtt attttcctgt 4861 gttccaaagt tcaaacacca aagacttact accaaatttt gtcacaatct cgaacggtga 4921 tcccacaaaa tcagacatca ttgctgtagg aaatgaaaga gttatccgcg cacgtttagc 4981 cgatggtcgc tttttctacg atgctgattt agaaaaacct ttggaaagct ttttacctca 5041 attggaaacc gtcactttcc aagaggattt aggttcgcta ctgaagaaag tcaaccgcat 5101 ttgcaaaatt gccgaacaaa tcacagaaca gttgcaatta agcgaaaaag aacgcgaaaa 5161 catccaaaga gcagctcttt tatgtaaagc tgatttggtc agtcaaatgg tgtatgaatt 5221 cccggaatta cagggagtta tgggacaaaa atatgctctt gcaagtggag aagaggaagc 5281 agttgcaacg gcaatttttg aacattattt acctcgttcg gcagatgaca tattacccga 5341 aactttgaca ggacaagttg tcggtttggc agacagactt gacacattag tcagcatctt 5401 tggattgggg atgataccca cgggttcttc tgaccccttc gctttgcgac gggcagcaaa 5461 tgcagtcgtg aatattacat ggacagccca tctacctatc aatctacaac agttactgga 5521 aaaagttgcg acagattttg cgagtgaata tcacaaagat agaaatcaat tagttgctgg 5581 gttagaagaa tttttcttgc aacgcatccg tactttactg caagaagaaa aacacattga 5641 ctatgactta gtgaatgctg tgttgggaga aaacgaccgg gaatatacag aaagggcgtt 5701 gaaggattta ttggatgtgg gcgatcgcgc aactttcctg caaaaaatcc gcgccaatgg 5761 tacattagat aacatctacg aaaccgtgaa tcgttccact cgtttagcgg ctcaaggaga 5821 tttggataca aaacaacttg accccaaagc gatcgttcgt aaagaactct tccaaaaatc 5881 atcagaggaa gcattttaca atgcgattgt ggaattagtg ccacaaactc aagctgcaca 5941 acagtcacga gattatggac agctagtcac agcattagag caaattaccc ccaccgttag 6001 caactttttt gacggtgcag aaagcgtttt agtcatggat cccaatccag aaatcaaacg 6061 caatcggtta aatttgctcg gattacttcg taaccatgcc cgtgttttag cggattttgg 6121 tgcgatagta aaaaatttgt agcatcagta aaaaaaagtt gtgtgatttt tgcaaataaa 6181 cgctaccata atggtgctta accagggacc tctgccacag tctatgccca ctatgcccaa 6241 agcccacgtc attggattag gaaagtccgg tattgctgcg gcgagattgt tgaaacgaga 6301 aggttgggag gtggaactca gcgatcgcaa cacctctctt gaccttctaa atcaacaaca 6361 agaacttgcc acagagcaaa ttactgttaa attggggtat tctctagaac tcagtgattc 6421 agacttacct caattgatag tagttagtcc tggtgtgcct tgggatatac ccgtgttaat 6481 tcaggcacga gaattgggta ttaaaaccat cggcgagatg gaactcgctt ggcagcattt 6541 gcaatccgtc ccctgggtgg gaattacagg cacaaacggt aaaaccacta ccacagcctt 6601 aattgccgca atttttcaac aagcaggatt aaacgctccc gcctgcggta acattggcta 6661 cgctgcctgc gaagttgcct tgtctgtgga gaatggagga cgtatcaatt caaaatcgcg 6721 tagcgttgcg agcaaagcga cgcaatctca aaattcaaaa ttagaagtct ctccctcatc 6781 ccccgcatca tctgacaatc aaatagactg ggtgattgca gaggtcagta gctatcaaat 6841 agaatcatct tcttctcttg cgcctcgcat cggtgtatgg acaactttca caccagatca 6901 cctcagtcgc cataaaacct tagagaacta ctacaacatc aaagcgcatc tgctgcgtca 6961 gtctcagttg caagtgttta acggtgatga tccatattta aggaaagtcg gtgcaagtga 7021 ttggtttgac gcctattgga caagtgtcaa aggtaaagat aatttaatcg gttctcaagg 7081 ctattacatt gaagatggct gggttgtaga aaagttgaat gcaaactctc aaccagaaag 7141 aattgtggaa gtttccgctt tacggatggt gggagaacac aatcagcaaa atttactgat 7201 gtcagttgca gcggcgcgtt tagggggaat tgatcaagat gctattgttc gcgctgtaaa 7261 tgaattccct ggtgttcccc atcgtttaga acacatttgt acttgggaag gcattgattt 7321 cattaacgac agcaaagcca cgaactacga cgctgctgaa gttggtttag tctcagtcaa 7381 aagtccggtt atattaattg ctggtggtga agccaaagcg ggtgatgaca ctaactggat 7441 cgcaaaaatt aaagacaaag ccgcctgtgt attgctgatc ggcactgctg cacctgcttt 7501 tgctaaaaga ttgcaagaag aaggatatga gaattacgaa attgtagaaa caatggaaaa 7561 agctgttccg aaatcagcag aattagcaaa acattatcaa gcatctgtgg tgcttttatc 7621 cccagcttgt gcgagtttcg accaatacgc caattttgaa caaaggggcg accattttcg 7681 gcagttgtgt ttggagttag taaagattat caattagaaa ttaagttata gtttttagtg 7741 caattaagct tataaagatg acttgcannn nnnnnnngac agagaagcga ttgatctcat 7801 gtccggatga atacttataa aaaacgaatc acctcacccc actcgctagc cctcctttga 7861 attttgaact ttgaattccc ctctggggca gggggtgata tcatgtccgg taaattactt 7921 atgatatgtc attgcgaacg cagcgatagc ggagtgaagc aatcgcaagg tgttggattg 7981 cttcgcttcg ctcgcaatga caactattca accggacatg atatgaggtc tttttattgt 8041 aagtcatgag acgaacttga tatgatctga gatcttgcac catcctcaac aagactagaa 8101 aaaccgggta actgagccga gaaatcaagg gtgaggtgat tgtatgagtt gcaagaaacc 8161 acaggtttga agctggtgca agatgtcagt taaggggaaa tgtgctgttt gggttgggta 8221 aatcctgaac ctcgtccgga gcatcgcaaa agttttcggt ctattgacag gacaattgca 8281 taacttaact atctcaagcc tatcaagaat atagagatac tctgttaaat caacttttat 8341 caatgcttaa gtcctaatac tatgataaca atagccggaa gcaggaactg tacctaccct 8401 gtgaagtcca tgaaatggat gctaacaatc gctctgattt tgagcagcct cacctattat 8461 ccagcaaagg taacagccca aactcaacaa ccagcaacga ccaagcagct tgtacaacca 8521 accaaaggtg ggatttcacc tcaacaacgt acctctcaac cagtttttgt tttcccaaaa 8581 acacccgtac ggttaagtcc tgtatctgga cgtagacgag gaatgggtag ccggggtaat 8641 tgtcctgcgg ttcaaactgc actcactgcc ttaataccgt tgcgagagga acaaaaggtt 8701 agcaaacaga cagacaaatc aatttcgggg attgtggggg gactaaccac ctctgaacga 8761 ccaacgtttt ggttttatgt tccttacact caggatttag cgaactcaag tggtgagttt 8821 atcttacaag atagtgcggg aaacgatatt tctaaaaatg cgatcgcact acctccaaag 8881 cctggtgtta taggtgtttc tcttccatcc aatacatcct tgcaagtagg gaaaacatat 8941 cgctggtatt taaaagtccg ttgcaatcaa caacaaacag ccagtgttcc tatttatgta 9001 gaaggagaca ttcaaagagt gaatttggat tctcgtgtga tgcagcaact ggaagcagca 9061 gttgatcccg cgcaaaaggt tgcaatctat gcggctaatg gtatttggtt tgattctctg 9121 actatgttgg cacaactgcg ccagaagaat cctaatgatg catctgttgc tgaagattgg 9181 caaagtttat tgagatctgt caacttggat aatgttgcta cggctcctct ggtgaaataa 9241 cagtaaacag ttatcagtta aaacttgatt caaatgataa gttggtcaac ataaaaaacg 9301 taaaatctag aaacccggta acttctgcga aaccgggttt ctgacaccac gcattttatg 9361 tttcatttgg tggttttact taactgtttc tctacaacca attaccaact aatacataag 9421 gtgcccagtt aagaggacga gtgtattcag taccagacag cagctttaat tgagcttcgc 9481 gtaaagcttc tgctgtagtt ttaccagttg ttaactgacg gtagaattca ctaataaaca 9541 aagcagtgga gtcatcccca atctgccata aggacgctag agtactccgc gccccagctc 9601 gaatagcaac tcccgctaat cccaatgcag cacggttatc tcctgctgct gtttcgcaag 9661 cactcagaac cagtaattca attggctgtg ttcgtttttg ctctctactt ttgagcaagc 9721 tatctaattg acttacattg atacgtccat cggctgctaa gatgaaagtg tcttttgctt 9781 tagagctaaa ttgtccgtga gttgccaggt gaacaactct aaaaggttgg gcgttaatcg 9841 tttttcctaa agttgtactt ttgaaatttt catctaataa tgtcgttgtc gatatgcccg 9901 attgctgaat caatttcaat tcaaccttaa cgttaggaag cgatgcaaac ttttcattct 9961 ttggtggttg cgagagtcct ccagcaagag cgtttaatct ttcctgcgct agaggtttgg 10021 gggtaaacag ttgcaatcct gggctaatgg cgatcgcgta cttctgaact aaatacaatg 10081 cgccatcata caatgcagac actgggatgt ttcgtagtga tccgtctggg ataaagacta 10141 aagtattaac tttacttttt ttgagttcac cttcaactgg tttaattaac caatcataaa 10201 gctgctggga aaccacttga aatttcttag tagcatcagg ttcgactatc gtctctcgca 10261 tttttgtaat gacttgctct acttcctgac gcttcaccac agaagtttta tgtatcaaag 10321 gttggttggg aagtttgagg ataacttcta attggttatc tagaataatc gggtaaaaaa 10381 tggcagtatt aggattatca cggtctacca ctttgtctaa taccacaaat tggttactca 10441 agcaagcttg ccgaaaaaaa ttgtccagtt ctgctagttg caaggcttca attcgcctgc 10501 gaactttatc caaatcaggc tttccttgtc ctttctgttg taaaagtagc tctactgact 10561 gccgataaat aggttctaca ctatcacgaa aattaaactg aacctctcgg ttaactacta 10621 ctaagtcact gcggagagac tgaaggcttg ccactgcgct atcataagca gctatagctc 10681 cctcaatatt tccttgtgct ttgagcaagc gtcccaactg ccactcccaa cgataagcta 10741 tatctgaagc caagattttt tgagcgataa acaatgcttg ctgcgtcaaa ttttgcgctt 10801 cttgccattg actgttttgc tcgtagactt ctgctaaact gccaagtgca taggattgag 10861 cacgctccga gcctatggtt ttagcttgtt gaacgctggt tgctaaaatc tcggctatat 10921 ctcttttgtt accaaatttg gttatagtgc gagcaaagtt gacacgtgtg taaataccag 10981 cttgatttgt aggtagagtt gctaattgag attggattgt tgggattagt agagacgttg 11041 tatctgaaag ttctagatct gctgtattct tattctcgac aagcaagctc agttgattga 11101 tttgggcttg aactttggtg agtgggtttg gcgcaattaa agctgctttt tcatagtgag 11161 cgatcgcatt ttttctatta ccattagctc tagcagtatt accaagagaa aattctatca 11221 cactaatttc ttggggcagt tgcagacgtt gagcaatttc taagctttgt tgtaagtttt 11281 tctgggattg ttctaagtca cctgattgtt gctgggcatt accaagggtg cgcaaagctg 11341 ttactttttc aagtgagtca ggttgggttt gtaactgtcg ggagactgta ttcaatatgt 11401 ctaagctacg gcggtaaaaa ccagaaactc gtaaagcttg tgcttgattg atacggctgc 11461 gcgtcactcc agtattatat tgtttaccta agcgcttcca aatttcttct gcttgctgtg 11521 atgtttttag agatacatct gcctgtccct gtgccagttg cagtccagct tgaatttcta 11581 gagtttgtgc tagaacttct agtaattctg acttgggatt gttgacattt aacttttgat 11641 tgttctcatc ccagccaagt agatttaagc tggtattaat agcttttgac gcttctttcc 11701 atgagccgag ttgctctaag gctagagaga gatttgtgag tgctgcggct tgtgggaggc 11761 gatcgccttc tcgttgatag atgcggacag cttgttgtaa aacagttacc gcttcagcaa 11821 accgtccaga ttgataaaac ctttctcctt gagagacaag ctgctgtggt gtagtggttg 11881 ggggtttggt ggtttgagtg tgaaagttgt ctactgaatt tgtctggggt tgtttggcga 11941 atgtgctggg gataatagtt agggaaaaag caagggtaaa taggactaca tgagttagcg 12001 caaatctaat gagtcgcttg gtcagtgcaa atggatgcat atttcatacc ttagcgagat 12061 gcaggacaaa aagcaggtat ttgctaaaag ctgtgggggt tgattgattt ttgggtatct 12121 gagatcttgc accattatga gtcaaccgct aatacttaga ttaaatagtg cttaaatata 12181 actcaaagac aagttccagt tgtagtccat tttaatggac ttcgcctatt agcctgggac 12241 ttccagtcct aggcggacga gaacggagtg aaataagata taatgatttt tgacattttt 12301 tgaccaggtg caagatataa ggtatctgaa aaaataacct cagtcaaaaa cgggactggg 12361 gcatatagtt agtcaatgtc atcttgaatc tcgaataagc actcgtcatt caagctacgc 12421 tgaatccagt atattgtcgt ttatagggag cgatgagacc cggaacaatg tcttctgttg 12481 gaacccccat aaggacgagt tcctcagcga gatctacttc agtcaagttc tgttgaatcc 12541 aaatcttgcc atctttaatg tcaaaatgga tgatgcaccg ataaattcgg tcaagctgac 12601 gccagcctac actcatccat tgataatgat ctcgctctgt atcgaatatc agttgaacct 12661 ccacatcact attagatgtg ttgtcagcat attttgtaag tagactttga acaaattgtc 12721 gatactgttc tatttttgcc attgcacgaa gatttggaaa caatagtaag gttcaactca 12781 tgaaaaggag aaacaaaaat gactcaagcc ttacacaaac tagtaacatt tgaagaattt 12841 gcaaaatgga aaccggaaga tggacgctat gaattgcatg atggagtcat cgttaaaatg 12901 ccgcagccat taggaggaca tgaggaagtt acaggttttt tggtcagaaa actgagtgta 12961 gaatttgacc gattaaatct gccgtacttc atacccaaaa cagcattagt taaaccacca 13021 gatcaggaat cagcttactc accagatgtg ctggtagtaa accaaagtaa tttatcatca 13081 gaacccctat ggaaaaaaga atcgacttta atttatggtg cttcaattcc gctcgttgtg 13141 gaagttgtta gtactaactg gcgtgacgat tactttaaaa aacgtgggga atatgaagga 13201 attggcattc ctgaatattg gattgtagac tatttagcat tggggggtaa gcagtttatt 13261 ggcaacccca aacaaccaac aatttctatt taccacttaa tagacgatga atatcaagtt 13321 actcaattta ggggagatga ccgcatccag tcgccaacat ttcctgagtt taacttgact 13381 gcacaacaga tttttaacta attcgccttt cggtcaagcg aactgatatt agcgagatgg 13441 aggacaagaa gcaggagttt gccaagaact gtgaggagtg acaccaggtg cttgcgccac 13501 gagttctata tttccattct tatcgactat ccaaccagta gcttccacaa tttgagttgt 13561 cgcagcagca actttagtac tatttgagac tgctgttggt tgtggggaaa ttgttgtttt 13621 agctgagttt gctggttttg gtcttaatct cacccaggct gatagagtgc ttgtatcttg 13681 taatggttca gt // LOCUS NODE_2490_length_13648_cov_4.99485013648 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13648) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13648) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13648 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(190..1872) /locus_tag="DP116_20585" CDS complement(190..1872) /locus_tag="DP116_20585" /EC_number="3.5.4.25" /EC_number="4.1.99.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876340.1" /note="bifunctional enzyme DHBP synthase/GTP cyclohydrolase II; functions in riboflavin synthesis; converts GTP to 2,5-diamino-6-hydroxy-4-(5-phosphoribosylamino)pyrimidine; converts ribulose 5-phopshate to 3,4-dihydroxy-2-butanone 4-phosphate; note this protein has an additional C-terminal tail of unknown function as compared to similar bifunctional enzymes; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase/GTP cyclohydrolase II" /protein_id="PRJNA477356:DP116_20585" /translation="MSQLNTTSNQTFEFDSIDAALADLKAGHQIVVVDDENRENEGDL ICAAQFATPDIINFMAVEARGLICLAMTGDRLDELDLPLMVTNITDTNQTAFTVSIDA GPHLGVSTGISAEDRARTIQVAINPATQPSDLRRPGHIFPIRAKEGGVLKRAGHTEAA VDLARLAGLYPAGVICEIQNPNGSMARLSQLIQYAKHHNLKIISIADLISYRLQHDRL IKRETVADLPTQFGRFQIYAYRHTLDNTEHVAVVKGNPAEFGDNPVMVRMHSECLTGD ALGSLRCDCRMQLQAALKMLENAGQGVVVYLRQEGRGIGLVNKLKAYSLQDMGLDTVE ANERLGFPADLRDYGMGAQMLMDLGVHKIRLITNNPRKIAGLKGYKLEVVDRVPLLIE ANDYNSYYLTTKAKKLGHMLLQTFLLTVAINWQDDPEAVTERYERLEKLRHLAKSHDL LLQEEARPLAIALFDKPSLIVHLGFDQANVATDDWYKHKGHPYAQAISQILDELVALP YIQKLEFLISPGVDPLTNLQVQLDRQTFPVGTLPSSICCEQLGTQKIYSFSR" gene 2112..3170 /locus_tag="DP116_20590" CDS 2112..3170 /locus_tag="DP116_20590" /EC_number="1.2.1.38" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862324.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyl-gamma-glutamyl-phosphate reductase" /protein_id="PRJNA477356:DP116_20590" /translation="MGNSRRVRVGIVGASDYGGVQLVRLLMDHPEVELVYLGEHSSAG KSFADLYPHLAHIVNQPIEAVEPETIAERCEVVFLSLPNGLAYKIAPQLLELGLIVLD LSADYRFKDLTTYTNWYGAQRTDLATAATAVYGLPELYRDRISEAQLIACPGCYPTAS LLALSPLLKQGLIVPETAIIDAKSGTSGGGRQAKVNMLLAEADNSFAAYNVVRHHHTP EIEQICSELAGHEVTIQFTPHLVPMVRGILATVYATLRDPGLVRDDLITIYKAFYRNS LWVKVCEPGVYPQTKWACGSNLCYIGIEVDPRTGRVIVMSAIDNLIKGQAGQAIQCLN IIMGWDETLGLPKLGFYP" gene 3379..3957 /locus_tag="DP116_20595" CDS 3379..3957 /locus_tag="DP116_20595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015145260.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_20595" /translation="MSNLPLYIPANLKVSEEQFVLLAAANRDVQLELTATGELIIMPP TGGNTGKRNIDIEGQLWFWNRQSKLGIAFNSSTAFRLPNGAERSPDAAWVTQARWDAL KPEEQDSFPPMSPDFAIELPYISDNMEPLRKKMQEYIDNGLRLGWLIDTKNKKVEIYR ALQPVEVLNNPTSLSGEDVLPGFVLDLQVVFS" gene complement(4270..5550) /locus_tag="DP116_20600" CDS complement(4270..5550) /locus_tag="DP116_20600" /EC_number="4.2.1.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454281.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphopyruvate hydratase" /protein_id="PRJNA477356:DP116_20600" /translation="MFDTAIEAIVAREILDSRGRPTIEAEVHLVNGAIGLAQVPSGAS TGTFEAHELRDKDKARYGGKGVLKAVQNVKEALAPKLINLDALNQELLDRTMIALDGS SNKSHLGANAILAVSLAAAKAGAESLGIPLYRYLGGPLANLLPVPLMNVINGGAHAAN NVDFQEFMVVPIGASSFREALRWGAEVFATLSQVLDEKGLLTGVGDEGGFAPNLESNQ VALELLVAAIEKAGYKPGEQVALALDVAASEFYKDGQYVYDGKPHSGGEFVDYLAQLV DQYPIVSIEDGLHEEDWQNWQLLTQKLGSRVQLVGDDLFVTNVTRLQKGIELKAANAI LIKLNQIGSLTETLQTIDLATRNSFRSVISHRSGETEDTTIADLAVATRAGQIKTGSL CRSERVAKYNRLLRIEDELGELAVYAGAVGLGPN" gene 5813..6442 /locus_tag="DP116_20605" CDS 5813..6442 /locus_tag="DP116_20605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876337.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="radical SAM protein" /protein_id="PRJNA477356:DP116_20605" /translation="MTQKKTDPYPALMEIPAGYLNIMGYIDESEVNGPGCRAVVWVQG CLRECPGCFNVESWSFEMNQLISIDSLAEKILSNPRNQGVTFSGGEPFWQAPALAVLA SKIKAGGLNVMSFSGFTLEQLQSQHAPAGAQDLLQQLDILIDGPYIQSLALNSPDSPV SSNNQRVHVFNPTLKDKITWASDQIEVHIFKDGSRLITGYRGGLELSEG" gene 6759..7193 /gene="gloA" /locus_tag="DP116_20610" CDS 6759..7193 /gene="gloA" /locus_tag="DP116_20610" /EC_number="4.4.1.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210883.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lactoylglutathione lyase" /protein_id="PRJNA477356:DP116_20610" /translation="MRLLHTMLRVGNLEESLKFYCDLLGMKLLRQKDYPGGEFTLAFV GYGDESENTVLELTYNWGVEKYDFGNAYGHIAIGVDDIYTTCEEIKKLGGKVVREPGP MKHGSTVIAFVEDPNGYKVELIQTDTQNHAVKQETEPQMINQ" gene 7225..9003 /locus_tag="DP116_20615" CDS 7225..9003 /locus_tag="DP116_20615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="porin" /protein_id="PRJNA477356:DP116_20615" /translation="MEFSLKQSLRQRIRSRNFLAWSLLIWGIPSQIVFFPKPANSLPN PKQQETAVSIFSDENDLYEVRIPEKFEESAGSATAADFSEASVTSFDTESESINKSGN LNINAQVSDQQSPANESGNPDRNTNTSPQQPSVSELSDVQPTDWAYQALRSLMERYGI LSGYPDHTFRGNRPMSRYEFAAGLLATMDKLDSLVANGIRNQSIQEDLITLQRLQREY RLALDDLQKRLNRIDDRVTRLEAKRFSATTKLQGQAIVAFTQGSNANSTIVSRERLNL LTSFNSKDLLFTQLESGNNGGDAIARAHKRKNLNLLGTSGLIASGGGLDYVDVDSDLK LRRLYYSFRPLSDLSVTIGAKMSPRDFIDRNRYANNEAVDFSSSFFLNNPLIVQNQID RNGGAGVAISWNPGGSPLTFRSLYIAADANLANSNASGGLFKDRYQASAEVEYSLSNQ LALRLQYTRAEINNTDINAFGVNAEYALNRNIGVFTRLGFGSYQGFNTAINQNLDINP FSWAVGFGIRNLVIPGTFGGLAVGQPFVTDGLGNTTQTNIEAFYNLELSDNVSITPTF SVVTNANNDSSNNTIWQGALRTVISF" gene 9160..11829 /gene="clpB" /locus_tag="DP116_20620" CDS 9160..11829 /gene="clpB" /locus_tag="DP116_20620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319144.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent chaperone ClpB" /protein_id="PRJNA477356:DP116_20620" /translation="MQPTDPNKFTDKAWEAIVKSQDVVRAYQQQQLDVEHLLIALLEE PTGLATRILGRCEIDASRLQQQVEGFTQRQPKVGKADQLYLSRSLDTLLDRAEEARAR MKDSYISVEHILLAFAEDERIGRRIYKSFNLDTAKLEAAIKTVRGSQKVTDQNPESRY EALQKFGRDLTEQAKSGKLDPVIGRDDEIRRVIQVLSRRSKNNPVLIGEPGVGKTAIA EALAQRIINGDVPESLKNRQLMSLDIGSLIAGAKYRGEFEDRLKSVLKEVIESNGQIV LFIDELHTVVGTGSTQQGAMDAGNLLKPMLARGELRCIGATTLDEYRKYIEKDAALER RFQQVFVDQPSVENTISILRGLKERYEVHHNVKISDSALVAAATLSARYISDRFLPDK AIDLVDEAAAQLKMEITSKPTELENIDRRLMQLEMEKLSLAGEEKATVQTKERLARIE QEIVNLTEKQQELNGQWQGEKQLLEAISALKQEEEKLRLQIEQAERAYDLNKAAQLKY GKLEGVQRDRESKEAQLLEIQSIGATLLREQVTEADIAEIVAKWTGIPVNRLLESERQ KLLQLESHLHQRVVGQQEAVSAVSAAIRRARAGMKDPSRPIGSFLFMGPTGVGKTELA RALAQFLFDSDDALVRLDMSEYMEKHSVSRLVGAPPGYVGYEEGGQLSEAIRRRPYSV VLLDEVEKAHPDVFNILLQVLDDGRITDSQGRTVDFRNTVIVMTSNIGSEHILDISGD DANYEKMRNRVMDALRSHFRPEFLNRVDDIILFHTLNRSEMRQIIRIQLKRVESLLQE QKISLEISTAACDYLVETGYDPVYGARPIKRALQREVENPVATKLLENTFVPGDTIVI DKADHGLTFNKKMVVKVPVVQNKTLLIEASREV" gene 11903..12238 /locus_tag="DP116_20625" CDS 11903..12238 /locus_tag="DP116_20625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876375.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_20625" /translation="MDKLAKYREIVRKIISEYTTHKPYHGQIDVGAIIDSERDRYQVL QIGWDGIRRVHGCTVHIDIIGDKVWIQYDGSSYPVAEALMEAGIPREDIVLGFHPENL RQHTGFAIS" gene 12323..>13648 /locus_tag="DP116_20630" CDS 12323..>13648 /locus_tag="DP116_20630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_20630" /translation="MQYWQPNQTIKNGRFTIQKILGGGGYGVTYSAKDNNTNKIIAIK TLNPMQQSQPDFEQEQEKFVNEALRLRGCNHPHIVQVYEMIREAGLWGMVMEYVEGQD LAVYIDQRGQLPEDEALRYIDQVGQALEYVHKQKFLHRDIKPNNILLRLGTQQAVLID FGLAREFNLGKTGSMTNAKTEGYAPIEQYERRGNFAPCTDVYALAATLYSLLTAEVPI PASFRTYVQLPPPKQFNSKISDRVNEAILKGMALEPQDRVQTVREWLELVLPTNVNPP TPQASIPNPQPQIQNQPSKNIATPPKSDDEIKLITAKMDYTRLRDLLAAGKWKEADQE TARVMLAVAARKNEGWLDVESMDNFPCEDLRTIDQLWVKYSNGRFGFSVQKRIYQSLG GRRGWDREIWEAFGDRVLWTNAGKWLYYRDMTFDLRAPEGHLPAWWTGYW" BASE COUNT 3968 a 2986 c 3100 g 3594 t ORIGIN 1 actctatgtc caccttacca gagtttgtgg ttcaattacc tgaaaattgc tgtaattatg 61 agggttttta aaattacaga aatcttgtag agacgttgca tgcaacgtct ctacaaatca 121 tcaaaattaa tgacatagac caccagagtc tcctaaatgt catattttgt atggtggcga 181 aaggttcatc taccgactaa aactataaat cttctgcgtg cctaattgct cacaacaaat 241 agacgaaggt aatgtcccca cagggaacgt ctgtctatcc aattgcactt gtaagtttgt 301 cagaggatca acaccaggcg aaatcaaaaa ttctagcttt tggatgtaag gcaaagccac 361 gagttcatcc aaaatttggc taatcgcttg cgcgtaagga tgacctttgt gcttatacca 421 atcatctgtt gcaacgttcg cttggtcaaa acctaagtgc acaattaagg atggcttatc 481 aaacagggcg atcgccaaag gacgtgcttc ttcctgtaac aataagtcat gacttttcgc 541 taaatgtcgc agtttctcta accgttcgta gcgttccgtc accgcttcag gatcatcttg 601 ccaattgatc gccactgtta acaaaaaagt ctgtaacagc atatgaccca gctttttcgc 661 cttggttgtc aggtaatagg aattgtagtc gtttgcctca atcaacaatg gcacgcgatc 721 cacaacttcc aacttataac cttttaaccc tgcaatttta cggggattat tggtaatcag 781 gcgaatctta tgtactccca aatccatgag catctgtgct cccataccat agtctcgcaa 841 atcggcgggg aatcccaaac gctcattcgc ttctacggta tctagtccca tatcctgcaa 901 cgagtaagct ttcaatttgt taaccagccc gattcctcgt ccttcttgcc gcaggtaaac 961 aacgacacct tgccccgcat tttcaagcat tttcagtgct gcttgcaact gcatccgaca 1021 gtcacagcgt aaagaaccca aagcatctcc agtcaggcat tcggaatgca tccgcaccat 1081 cactgggtta tctccaaatt cagcaggatt acccttgaca actgcaacgt gttctgtatt 1141 atccagagta tggcgataag cgtaaatttg gaaacgacca aactgggtgg gtaagtcagc 1201 aacagtttca cgcttgatca ggcgatcgtg ctggaggcgg taactgatta agtccgcgat 1261 gctgataatt ttaagattat gatgcttagc atactgaatt aactgagata gccgcgccat 1321 cgaaccgttg gggttttgaa tttcacaaat gacgccagca gggtataatc cagctagtcg 1381 agctaaatca acagcagctt cggtatgtcc tgcccgtttg agtacgcctc cttccttggc 1441 acgaatgggg aaaatatgac caggacgacg caaatctgaa ggttgtgttg ctgggttgat 1501 agcaacttgg atcgtgcggg cgcggtcttc agccgagatc ccagtactta cccctaggtg 1561 aggtccagca tcaatactca ccgtgaaagc agtttggttg gtgtcggtga tgttggtcac 1621 catcaatggt aaatctaact catcgaggcg atcgcccgtc attgccaaac aaatcagccc 1681 cctggcctct actgccatga aattaataat gtcaggagta gcaaattggg cagcacaaat 1741 caaatcgcct tcattttctc gattttcatc gtctaccacc acaatttggt gaccagcttt 1801 caagtcagcc aaggcggcat caatcgaatc aaattcaaag gtttggtttg aagtggtgtt 1861 aagctgcgac acagaggaat tccagctatc taacgtaatt tttttttaca aatgcttgtc 1921 ttctgattgt agctcttcta gagactatat aagcatttca tagccaagat tcgataacat 1981 accgataaat acctgcggct gaagttggat tttatgataa ctgagtgcag caagaactaa 2041 acagcaataa gtctttaata gaaaaattct gataaagacg ggttgaccta ttcaaaaagg 2101 atttacaagt tatgggcaat tctagacgcg taagagttgg gattgttggc gcgtcagatt 2161 acggcggagt acagttagtg cgactactga tggatcatcc agaagtggaa cttgtttact 2221 taggtgagca tagtagcgcg ggaaaatcgt ttgcagatct ctacccacat ttagctcaca 2281 tagttaacca accgatagag gctgtagaac cagaaaccat tgctgagcgt tgtgaggtgg 2341 tgtttctgtc tctaccaaat ggattggctt acaaaatcgc tccgcaattg ttagaactcg 2401 gattgatagt actggatttg agtgcagact atcgctttaa ggatttgaca acttatacaa 2461 actggtatgg tgcccagaga acagatcttg caacagcagc cacagcagtt tatggattac 2521 cagaattgta tcgcgatcgc attagcgaag cacaactgat tgcctgtccc ggctgctatc 2581 ccaccgctag tcttctggca ctttcaccac tcctcaagca agggctaatc gtgccagaaa 2641 cagctattat tgatgccaaa tcaggcactt ctggcggtgg gcgtcaagca aaagtcaata 2701 tgttgcttgc tgaggcagat aactcctttg ccgcttacaa cgtcgtccgt caccatcaca 2761 ccccagaaat cgagcagatt tgtagtgaat tagcaggaca cgaagtcacc atccaattta 2821 caccacacct cgtgcctatg gtgcgcggca tcttagcaac ggtgtatgcg acactacgtg 2881 accctggttt agtacgagat gatttaatca ctatttataa agccttctac cgtaactctc 2941 tttgggttaa agtctgtgaa ccaggcgttt atccccaaac aaaatgggct tgtggcagta 3001 atctttgtta cataggaata gaagttgacc cgcgcactgg acgtgtgatt gtcatgtcag 3061 caattgataa cctgatcaaa ggacaggcgg gtcaagctat ccagtgtctc aacatcatca 3121 tgggctggga tgagacactg gggttgccga agttggggtt ttatccatga ttggggataa 3181 ggggacaagc tatggcgtac acacatcttg cgcttttcac ctcaccctcg cttttagctt 3241 cgcaaaaatc tttccctctc cgaaatcgcg gagagggatg cccgacaggg cagggtgagg 3301 ttccgaaccg agtgcatctc ccaccagaag taatccaaga caatatatat tatgggcatt 3361 tttaatgttt aattgtctat gagcaacttg cctctgtaca tccccgcaaa cctgaaagtc 3421 tccgaagagc aattcgtact cctcgctgct gctaaccgag acgtacagct agaactaact 3481 gccacaggag aattaattat catgccccct acgggaggaa atacaggtaa gcgcaatata 3541 gatattgaag gacaactttg gttttggaac cgccaatcca aacttggtat tgcttttaac 3601 tcctcaaccg ctttccgact tcccaatggt gcagaacgtt ctcccgatgc agcttgggtg 3661 actcaagcca gatgggatgc attaaaacca gaagaacaag actcctttcc acccatgagt 3721 ccagattttg cgattgaatt accttacata agtgacaata tggaaccgct acgcaaaaaa 3781 atgcaagaat acattgataa tggattgcgt ctgggttggt taattgatac aaaaaataaa 3841 aaagtagaaa tttatcgcgc cttacagcca gtggaagtgt tgaataatcc aacaagctta 3901 tcaggagaag atgttttacc tgggtttgtt ttagatttac aagttgtgtt tagttagcat 3961 aagcaaaatt gggatgctcc ctggcggtta gaaaccgcgc ctacacaaac taagtctgcc 4021 tccgcagact agaacaccaa ttcagtattc accaattgtg ataaaaaacc cggtctcttc 4081 tcattgagat aacgaaatat cagtctagcg ccagcaacag aaaccgggtt tttggcatta 4141 atgataagtc tttgtgatat actacccttt tttggatacc acacaaaaaa tatttccaaa 4201 aaagattcac aaaatttagc aaaaattaag gcaatcagtg ggggcttgaa ccaccactga 4261 ttaaaaaggt taattcggtc ctaatcccac agcaccagca taaacagcga gttcgcccaa 4321 ttcatcctca atgcgtagca aacggttgta ttttgctact cgttcgctgc gacacagaga 4381 acccgttttg atttgacctg cgcgggtggc gacggctaag tcggcaattg ttgtgtcttc 4441 tgtttcgcca gaacgatggc taatgactga gcgaaaactg ttgcgagtcg ccaagtcaat 4501 tgtttgcaaa gtttctgtca acgaaccaat ttgattgagt ttaatcaaaa tcgcgttagc 4561 agctttgagt tcgattcctt tttgcaagcg cgtgacgttg gttacaaata agtcatcacc 4621 taccaactgc acgcgagaac ctagcttttg ggtgagtaat tgccaatttt gccaatcttc 4681 ttcgtgcaag ccatcttcaa ttgagacaat ggggtattgg tcaaccagtt gcgcgagata 4741 atcaacaaac tcacccccag agtgaggttt accgtcgtag acatactgcc catctttgta 4801 aaactcactc gccgcaacat ccaacgccaa agcgacttgt tctcctggtt tatagccagc 4861 tttctcaatc gctgcaacca acaattccag cgccacctga ttcgattcca aattaggagc 4921 aaaaccgcct tcatccccaa caccagtcag taaacctttt tcatccaaca cctgacttag 4981 ggtagcaaac acctccgcgc cccagcgcaa cgcttctcgg aaagaagacg ccccaatcgg 5041 gacaaccata aactcttgaa aatctacgtt attggctgca tgagcaccgc cgttaatcac 5101 gttcatcagc ggtacgggta ataaattcgc cagtggacca cctaaatagc gataaagggg 5161 aattcccaaa gactcagccc cggctttggc tgctgctaaa gaaaccgcta gaatcgcatt 5221 agcaccaaga tgggatttat tggaggaacc atctaaggca atcatcgtgc gatcgagtag 5281 ttcctggttg agggcatcca agtttatcaa ctttggagca agtgcttctt taacattttg 5341 caccgccttg agtacgccct tacccccata acgtgcttta tccttatccc tcagttcgtg 5401 cgcttcaaaa gtgccagtag acgcaccact gggaacctgc gccaatccta tcgcaccatt 5461 gaccaaatgc acttctgctt caattgttgg tctaccccga gagtcaagga tttcacgggc 5521 aacaatagct tcgatcgcag tgtcaaacat ctgttcttca tcctttgttt agcgtgcgtt 5581 ctttctgccg tttacaaaaa gttaaatggc tcatccgacc atctagcata tgccttccgt 5641 ggagtttatg cgctgaggtt aacgaagatt tccactgggt ttgggggatc aggagaataa 5701 ggagaacata ttctagcaca ctctgttact ccctccctca ctcccttatc ctcctcaatt 5761 tttttgcatc taccctgcaa atatgtatat cccttaacaa gagaaaaaga caatgaccca 5821 aaagaaaact gacccttacc cagctttgat ggaaattcct gccggatatc tcaatattat 5881 gggctatatc gatgaatcag aggtgaatgg tcctgggtgt cgtgctgttg tttgggtgca 5941 ggggtgtttg cgggagtgcc caggttgttt taatgttgag tcctggtcat ttgaaatgaa 6001 tcaacttata tcaattgaca gccttgcaga gaaaattttg agcaatcctc gcaaccaagg 6061 agtgacattt tctggaggag aacctttttg gcaagcccct gcactggctg ttcttgcttc 6121 taagatcaaa gcgggtggac tgaatgtgat gtctttttct gggtttactc tagaacagtt 6181 acaatctcaa catgccccag ctggcgctca agatttgttg cagcagttgg acattttaat 6241 tgatggtcca tatatccagt cactagcact caattcacct gattcaccag tttcttctaa 6301 caatcaacgg gttcacgtct ttaatcccac tttgaaagat aaaattacct gggcaagcga 6361 ccaaatagag gttcatattt tcaaagatgg tagccgtttg attactggtt accgaggagg 6421 gttggagtta tctgagggat aagggagcag ggggaggggg agtgagggag agtaaaatcc 6481 tcctcatctg agatcttgta caattctcac accttgcatt ctcgtccgcc tgggatttca 6541 atcccactcg aatagcaaaa gtcgtctaaa gacgactgca caagcctttc agtctacttt 6601 agtagacttg ggttgtgagc ctcggaatta attccgaggc ggtcaaaatg ggctaattta 6661 aagtgatgca agatctcagt catcttctcc agtatgagga cagtagccag attacaatat 6721 cagcggaacc acaaaacatc aaggcgaagg ggataatcat gcgcttacta cacacaatgc 6781 tgcgtgttgg caatcttgaa gaatcgctaa agttttactg tgatctcttg ggaatgaagc 6841 tgttacggca aaaagactat cctgggggag agtttactct ggcttttgtt ggctacggtg 6901 atgaaagcga gaacaccgtg ctagaactta cttataactg gggtgtggaa aaatacgatt 6961 tcggaaatgc ttacggacat attgccattg gagttgatga tatttacaca acttgtgagg 7021 aaatcaaaaa gcttggtggt aaagttgtac gcgaaccagg tccaatgaaa cacggttcta 7081 cagttattgc ttttgtggaa gatccaaatg ggtataaagt agaacttatt caaacggata 7141 ctcagaatca tgccgtgaaa caagaaacag aaccgcaaat gataaatcag taaatataac 7201 cacaaataca aaagtatagg tgtgatggag ttcagcctca agcaatccct acggcaaaga 7261 attcgttcta gaaatttttt ggcttggagt ttactcattt ggggaattcc tagccaaatc 7321 gtcttttttc caaaaccagc gaattctttg ccgaatccaa agcagcaaga aactgctgtt 7381 tcgatttttt ctgacgaaaa tgatctgtat gaggttagga taccagaaaa atttgaagag 7441 tctgctggct cagcaacagc cgcagatttt tcagaagcat ctgtaacatc tttcgataca 7501 gagtccgaat ctataaacaa atctggtaat ctgaacataa atgctcaggt gtcagatcag 7561 cagtctcccg ccaacgagtc tggtaatcct gacagaaata ccaatacgtc acctcagcag 7621 ccttcggttt cggaattgtc tgatgttcaa ccgactgatt gggcatatca ggcgttgcga 7681 tcgctcatgg aacgctatgg tatcctctct ggctatcctg atcacacgtt tcgtggaaac 7741 cgtcctatgt ctcgctatga gtttgcggcg ggtttgctgg ctacgatgga taaactcgat 7801 agcctcgttg ctaatggtat tcggaatcag tcgattcagg aagacctgat aactttacaa 7861 cggttacaaa gagaatatcg tttagctttg gatgacttgc aaaagagact caacagaatt 7921 gatgatcgcg tgacacgact ggaagcaaag cgattttccg cgaccaccaa acttcaaggt 7981 caagctattg ttgcttttac acaaggtagt aacgccaatt ctaccattgt atcccgcgag 8041 cgtttaaatt tattaactag ctttaacagc aaagatttac ttttcactca gctagaatct 8101 ggtaacaatg gcggtgatgc tattgctagg gcacataaga ggaagaatct gaacctcttg 8161 ggaacgagtg gtttaatcgc cagtggtggc ggactagatt atgtcgatgt tgactccgat 8221 cttaagctca gacgcttgta ttactccttt cgtcccctat cagatttatc tgtgacgatt 8281 ggagcaaaaa tgtctccacg ggatttcatt gaccgtaata gatatgctaa caacgaagcg 8341 gttgatttta gttccagttt ttttctgaat aaccctttga ttgttcaaaa ccaaattgac 8401 cgaaatgggg gtgcaggcgt ggcgatatcc tggaatccag gtggtagtcc attgactttt 8461 cgttctctct acatcgctgc tgatgcaaat ctcgctaact caaatgccag tggtggttta 8521 tttaaggaca gatatcaagc cagtgcagaa gtggaatact cactcagcaa tcaactggca 8581 ttaagactac aatatactag ggcggaaatc aacaacaccg acattaacgc cttcggggtt 8641 aatgccgaat atgccttgaa ccggaatata ggcgttttta ctcgcctggg atttgggagt 8701 taccaagggt tcaacaccgc gatcaaccaa aatttggata tcaatccttt tagttgggcg 8761 gtgggctttg gtattcgtaa cctcgtgatt cccggtacat ttgggggtct cgccgtaggt 8821 caaccttttg tcaccgatgg tttgggaaac accacccaaa caaacattga ggcattctat 8881 aatttggagc tgagcgacaa tgttagtatc acacccacat tttcagtagt tacaaacgca 8941 aataacgata gctcgaataa taccatctgg caaggtgcct tgagaacggt gatttccttt 9001 taaagtcagt tctgagtcat gatcaaaggg tttattgcaa agtatcctca aattgcaacc 9061 gcagatacaa gcggatacac cccaaaaatg gttggatatt aatttgacat ccttagttta 9121 aaattcaaaa tctaaaatct aaaatctaaa attgtaaaga tgcagcctac agatccaaat 9181 aaatttactg ataaagcctg ggaagcaatt gttaaatcac aggatgttgt ccgagcatat 9241 caacaacaac aacttgatgt tgagcattta cttattgcgt tattagaaga accgactggg 9301 ctagcaacac gcattctcgg tcgttgcgag attgacgcat cgcgcttaca acagcaagta 9361 gaaggattta ctcaacgtca gccgaaagta ggtaaagctg atcagctcta tcttagtcgt 9421 agcttagata cgttattaga ccgagctgaa gaagcaagag ctaggatgaa agattcctac 9481 atctctgttg aacacatcct tttagctttt gcagaagatg agcgcatcgg acgacgaata 9541 tataaaagct ttaacttaga caccgctaag ctagaggctg cgatcaaaac ggttcgcggt 9601 agccaaaagg tgactgacca aaacccagag tcccgttacg aagctctgca aaagtttggc 9661 agagacttga cagaacaagc aaaaagtggt aaactagatc cggtgattgg gcgggatgat 9721 gaaatccgtc gagttataca ggtattgtcc cgtcgcagta agaataaccc tgtactgatt 9781 ggtgaacctg gggtagggaa gacggcgatc gcagaagcat tagcacagcg tattatcaac 9841 ggtgatgttc cagaatcttt gaaaaaccgc cagcttatgt ctttggacat tggtagtttg 9901 attgctggag cgaagtatcg cggtgaattt gaagaccgat tgaaatccgt cctcaaagaa 9961 gtgatagaat ccaatggtca aattgtgctg tttattgacg agctgcacac cgttgtcggc 10021 actggttcca cccaacaagg ggcaatggat gcaggaaatt tactcaaacc catgctggcg 10081 cggggagaac tacgctgcat tggcgcaaca actttggatg agtaccgtaa atatatagaa 10141 aaagatgcgg ctttggaacg gcgttttcag caagtctttg ttgatcagcc tagtgtggaa 10201 aataccattt ctattctgcg gggactcaaa gaacgctacg aagtccatca caatgttaaa 10261 atttctgatt cagcgctggt agcagcagca acactgtctg cgcgttacat ttctgatcgc 10321 ttcttaccag ataaggcgat tgacttggtg gatgaagcag cagcacagtt aaaaatggag 10381 attacctcca aaccgacgga attggaaaat attgaccgac ggttgatgca gctagaaatg 10441 gaaaagctgt cgttggctgg ggaagaaaaa gctacagttc agacgaaaga gcgtttggcg 10501 cgaattgaac aagaaattgt caatttgacg gaaaaacagc aagaattgaa tggacagtgg 10561 caaggtgaaa aacagctact ggaagcaatt agtgcgttaa agcaagaaga agaaaaactg 10621 cggttgcaaa ttgaacaagc agaacgcgct tatgatttga ataaagcagc acagttgaag 10681 tatgggaagt tggagggagt gcagcgcgat cgcgaatcca aagaagccca gttgctagaa 10741 attcaaagta taggagcaac cctgttgcgc gaacaagtca cagaagccga tattgctgaa 10801 attgtggcga aatggacggg aattcctgta aaccgcctgt tagaatctga acggcaaaag 10861 ttactgcaac ttgagtcaca tctacaccag cgagttgtgg gacaacaaga agccgtttct 10921 gctgtctcag cagcaattcg tcgtgcgcgt gcgggaatga aagaccccag tcgccccatt 10981 ggttcattct tgttcatggg accaactgga gttggtaaaa cggaactcgc ccgtgcttta 11041 gcacagtttc tctttgattc cgatgacgct ttggtgcgtc tggatatgtc cgagtatatg 11101 gagaaacact ccgtttctcg gttggttggt gcgcctccag gttacgtggg ttacgaagaa 11161 ggcggtcaac tttcggaggc gattcgccgt cgtccttact cggtggtgct attggatgaa 11221 gtggaaaaag ctcatcccga tgtgtttaat attttgttgc aggtattaga tgacgggaga 11281 attactgact cgcagggacg aacggttgat ttccgcaaca ctgtcattgt catgacgagt 11341 aacatcggta gcgaacacat actggacata tctggtgatg acgccaatta tgaaaaaatg 11401 cggaaccgag ttatggatgc tttgcgatcg cacttccgtc cagaattcct caaccgcgtt 11461 gatgacatta ttctcttcca cactctcaac cgtagcgaaa tgcgacaaat cattcgcatc 11521 caactcaagc gggtggaaag tcttctacaa gaacagaaga tttccttgga gatatcaaca 11581 gccgcttgtg attaccttgt ggaaacagga tatgacccag tatacggcgc acgtcctata 11641 aaacgggcac ttcagcgaga agtagaaaac cccgtcgcga ctaagctatt ggagaatact 11701 tttgtccctg gcgacacaat tgtgattgac aaagcggatc atgggttgac ttttaataag 11761 aaaatggtgg tgaaggtgcc agtagtgcag aataagacct tgttaattga agcatcgcgt 11821 gaggtgtgag atttagtagc aggatgtgtg aggcggattg cttggttcac atcctgcata 11881 cccgtgatat taggattgtg caatggataa actagcaaag tatcgtgaaa ttgtacggaa 11941 aattatatca gaatatacca cccacaagcc ttatcacggt caaattgatg ttggggcgat 12001 tatagattcc gaacgcgatc gctaccaagt cttacagata ggttgggatg gaatccgtcg 12061 cgtacatggt tgtacggtac atatcgacat aatcggtgat aaagtttgga ttcagtatga 12121 cggtagttct tacccagtcg cagaagcact catggaagct ggtattcctc gtgaagacat 12181 tgttttgggt tttcatccag agaatttacg tcaacataca ggcttcgcta tatcttgaat 12241 cacaaagctg ttgtcagcaa acatttttcc acgccagcat attaagctct aagtatagcc 12301 tctcagctac aggcagcgaa caatgcaata ctggcaaccc aatcaaacta taaaaaacgg 12361 tagatttacg attcaaaaaa ttctcggtgg tggtggctat ggtgtcacct acagtgcaaa 12421 agataacaac acaaataaaa tcatagccat caaaaccctt aaccccatgc agcaaagtca 12481 gcctgacttt gaacaagaac aagaaaaatt tgtcaacgaa gctttgcggt tacgaggctg 12541 caatcatccc cacatcgtcc aagtttacga aatgatacgg gaagctggac tgtggggcat 12601 ggtgatggaa tatgtcgaag ggcaagattt agcagtttat attgatcagc gcggacaatt 12661 gccagaggat gaagctctac gctatattga ccaagttggg caagctttgg aatatgtcca 12721 caaacagaaa tttttacatc gggatatcaa accaaataat attctgttac gactcggaac 12781 gcaacaagca gtattaattg actttgggtt agcgcgtgag tttaaccttg ggaaaacagg 12841 aagcatgacg aatgcaaaga cggaaggtta tgcaccaatt gaacagtacg aacgacgcgg 12901 taattttgct ccttgcactg atgtttatgc tttagcggcg acgctgtatt ctttgctcac 12961 cgctgaggtt ccgattcctg ctagttttcg cacgtacgtg cagttaccac caccaaaaca 13021 atttaactcg aagattagcg acagagtaaa tgaagcaatt ctcaaaggga tggcgttgga 13081 accacaagat agagtgcaga cggtgcggga gtggttagaa ttagtgttgc caacaaacgt 13141 aaatcctcca actcctcaag catcaatccc caatccacaa cctcaaatcc aaaatcaacc 13201 atcaaaaaat attgcaaccc caccaaaatc agatgatgag attaagttaa tcacagcgaa 13261 gatggactac acccgactgc gggatttact cgcagcagga aagtggaaag aagctgatca 13321 agaaacggcg cgagttatgc tagcggtagc tgctagaaaa aacgaaggtt ggcttgatgt 13381 ggaaagcatg gataattttc cctgtgaaga ccttcgcaca atagaccaac tttgggtaaa 13441 atacagcaac gggcgctttg gtttctctgt gcaaaaacgc atttatcaaa gtctgggtgg 13501 aaggcggggg tgggacagag aaatttggga agctttcggc gacagagtcc tttggacgaa 13561 cgcgggaaag tggttgtact acagggatat gacttttgac ctcagagcac ccgaaggaca 13621 cctccctgcg tggtggacgg ggtattgg // LOCUS NODE_2494_length_13591_cov_4.66075713591 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13591) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13591) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13591 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(282..485) /locus_tag="DP116_20635" CDS complement(282..485) /locus_tag="DP116_20635" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20635" /translation="MVRQFWILDLNLWIDPTDGFKGLGILDLNLWIDSTDGFFGLYHQ GIIGQVYDFLVIQSSYLWKLWKT" gene 484..1650 /locus_tag="DP116_20640" CDS 484..1650 /locus_tag="DP116_20640" /EC_number="2.7.7.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126344.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit beta" /protein_id="PRJNA477356:DP116_20640" /translation="MKLVCTQSDLSTNLSLTSRAVPSRPTHPVLANVLLQADAETNQV SLTAFDLSLGIRTSFSAEVLHSGEIAIPAKLLNDITARLPEGEITLENESASIDDPLA GEGSIVTLIPKSGRYQVRAMGAQEFPELPVIENTQALHIPAGALIEGLRGSLFATSAD ETKQVLTGVHLTVQQDTLEFAATDGHRLAVVQTTNESPDTNTETRLEVTVPARALREL ERMLAHTNSQEEPVALYFDQGQVIFEWQNQRLTSRTLEGQYPAYRLLIPRQFERELVL DRRQFLSALERIAVFADQKNNVVKVSMDSEAQEITISCEAQDVGNGRESMSAQILGDD IDIAFNIKYLMEGLKALPSTEIHVQLNGNLTPVIFTPVGGFKMTYLAMPVQLRN" gene complement(1838..3016) /locus_tag="DP116_20645" CDS complement(1838..3016) /locus_tag="DP116_20645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316488.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="arsenic-transporting ATPase" /protein_id="PRJNA477356:DP116_20645" /translation="MRVILMTGKGGVGKTSVAAATGLRCAELGYRTLVLSTDPAHSLA DSFDLELGHAPKQIRPNLWGAELDALLELEGNWGAVKRYITQVLQARGLDGVQAEELA ILPGMDEIFGLVRMKRHYDEGEFDVLIIDSAPTGTALRLLSLPEVSGWYMRRFYKPFQ NISVALRPLVEPFFKPIAGFSLPDKEVMDAPYEFYQQIEALEKVLTDNTQTSVRLVTN PEKMVIKESLRAHAYLSLYNVATDLVVANRIIPEAVQDPFFQRWKENQRQYRQEIHEN FHPLPIKEVPLFSEEMCGLAALERLKETLYKDEDPTQVYYKETTIRVVQENNQYSLEL YLPGIPKNQVQLSKSGDELNITIGNHRRNLVLPQALAALQPAGAKMEEDYLKIRFAAA " gene complement(3060..3449) /locus_tag="DP116_20650" CDS complement(3060..3449) /locus_tag="DP116_20650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316487.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20650" /translation="MDIVEILKQDYQRFPDNQTYSIYAQDVYFQDQVFKFRGVEKYKL MIKFIKTCFLNPKMDLHDIRTEGDTIKTEWTLSWNTPLPWKPRISIPGWSELRLNTQE LIVSHIDYWHCSRLDVLKQHLFPLKSS" gene 3881..4369 /locus_tag="DP116_20655" CDS 3881..4369 /locus_tag="DP116_20655" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20655" /translation="METTLPPSFRNFLAASNGWQKLDNMIDRLLSTQEIDWLASRNQE LIDGWMTGIQIGSGYTPVPDEEYFVYGDEQDSTLLRNEYLQTALEIGGDSNQGLLLLN PQIVSENNEWEAWFFANWLPGAHRYRSFYELMLDLHKRLLELPKLREIPNSDENTYKK LL" gene complement(4666..5244) /locus_tag="DP116_20660" CDS complement(4666..5244) /locus_tag="DP116_20660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859463.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20660" /translation="MIPSQIADTLKELFSAASVNAIAPGSWQVETPDFRLLVLLSDDE SWLRILLPIMPVSEAQPFLEQFLEANFEETQEVRYALQQGALWGVFQHNCNSLTPEDF EDAIAQLISLYRTGLDNAFNKLIEKRIRQIVIAAKQQNQSLEATLQNLERFYAEGLMG ELDQTAQAREEVLAAWRYQLERLWNEVDSNSL" gene complement(5529..6485) /locus_tag="DP116_20665" CDS complement(5529..6485) /locus_tag="DP116_20665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653140.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20665" /translation="MTKFCLFSKNTRILFFLSTIVLCFLAGCKYINFNASSEASQTCL GGNSQFSITFFKTNNQGERYAKGINHVIIFNPKSEALDFKVNMGLSHQLYAKDNQGKF RQEYIPKLFHELGSNENAKLNGQLPIAAINADYIDTNNKPQGLNISRGVEYSGDFKNK RSSFGISADQPKQRQATLQVGRRKDNILNYNLVGGNGRFYRNGKFKDICQDLGELACQ QARNRSLVAITDKGYVILLVNDLKANSDIQVSQVNQELLPDMFDDVLEGIARNNCLGK IQEGMLFDGGMSPGLYYNNKIYVENPGPIGSVFLIYKKPLKN" gene 7049..7936 /locus_tag="DP116_20670" /pseudo CDS 7049..7936 /locus_tag="DP116_20670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017716414.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="protein kinase" gene 9234..10718 /locus_tag="DP116_20675" CDS 9234..10718 /locus_tag="DP116_20675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012163919.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ammonium transporter" /protein_id="PRJNA477356:DP116_20675" /translation="MKLESKPKLKELKCKKSWLSLNWKFCLPLTAITVLILSSAAVAQ TSTSPVKPSLSLQLAVDTVWVLFTGCLVFFMNAGFAMLETGLCRHKNAVNILAQNFIV FAVATVAFWVIGCALMFGDNTNPLFGTKGWFFDGTDQQMFKSLKSSVPQSALFFFQLV FAGTAATIFTGAVAERIKFIAFFIFSFLLIGISYPITGHWIWGGGWLSKLGFYDFAGS TVVHSVGGWAALVGAWLLGPRLYKLESRGAPIYRYLKNGSNIAMPGHNQSMATLGGFI LWLGWFGFNAGSTLKADPGAIAHILLTTNMAAATGGIAATLISWRRFGKPELTMIING VMAGLVSITASSAFVSIRSAFWIGLIAGILVFFSVLFIDTKLKIDDPVGAISVHLVNG IWGTLAVGLFSVGIEDKLRNNAVQFAPGPKPGLFCGGGLEQFLIQLLGIISVGIFTFV FSALAWSAIKATVGLRVSPQAELDGLDISEHDMDGYHGFEKKQV" gene complement(10733..11209) /locus_tag="DP116_20680" CDS complement(10733..11209) /locus_tag="DP116_20680" /inference="COORDINATES: protein motif:HMM:PF03259.15" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20680" /translation="MNIDEIRAILKKFASNMMGFQGTALVNSEGKPIATIGMDDDSAL IMAGTMIYLANRTRKEVQWEGIEQISVKGADGYVILTTCSPDIFLLVIASKVPEGMLV VDINRTVDKLKAVLKDEESQSTDSNQTKLQESVSKLISKDGKFSHPLMYRGSKIPD" gene 11306..11545 /locus_tag="DP116_20685" CDS 11306..11545 /locus_tag="DP116_20685" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20685" /translation="MRSESRSQNTRLLTNPAILTSLVLNFYTASRNKGVRQRSPLARP MPQPNGVWLPQIGDLGRGVLEKPLWAYTGDRFGAA" gene complement(11631..11870) /locus_tag="DP116_20690" CDS complement(11631..11870) /locus_tag="DP116_20690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316095.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="(2Fe-2S)-binding protein" /protein_id="PRJNA477356:DP116_20690" /translation="MTVRVRFLPDDVTVDAEVGEPLLDVADRAGVFIPTGCLMGSCHA CSVEVEDGHMIRACITAVPPRREELTINLFSDPTW" gene complement(11882..13360) /locus_tag="DP116_20695" CDS complement(11882..13360) /locus_tag="DP116_20695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317085.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobyric acid synthase CobQ" /protein_id="PRJNA477356:DP116_20695" /translation="MKAIMIVGTTSHAGKSLLTAAVCRILSRRGWRVAPFKGQNMALN AYVTANGGEIGYAQAVQAWAAGVTPWIEMNPILLKPQGDMTSQVIIKGKPMGKVSAVE YYEQYFELGWRAIEESLQHLSTEFDLLVCEGAGSPAEINIKHRDLTNMRVAKYLNAST LLVVDIDRGGAFAHVIGTLELLDPDERALIRGIVINKFRGQRSLLEPGIKWLEERTGI PVVGVIPYLEELYPAEDSLDLLERRVEKLQTELNISVIRLPRISNFTDFDPLESEPTV GVKYISPKQELGHPDALIIPGTKTTIPDLILLQKSGMAEAIQHYAAAGGTVLGICGGY QILGQMIADPEGLEGQAGRYQGLGLLPIKTVITGQKVARQRQVNSHFPQMGLPVIGFE IHQGRSRIEIPLAETENYHTLFDDANLGLVDNCLSVWGTYLHGIFDNGPWRRAWLNRL RQQRGLKSLPTGVANYREQREHILDSVAAEVERHLDLTLFLP" BASE COUNT 3981 a 2833 c 2783 g 3994 t ORIGIN 1 tgacatcggt aatcttacac cgggaaggga gtaataacag ccttgacgag taaactttat 61 ttgtaccgat ctacttattg tgttttccac aaggaacggc tcatctcaag tgttaagtta 121 agtataaaga atctttaaga taaaattact ttgtggaaaa ctttcgacaa atctgtggaa 181 aaattcatca agtataaata ctctgtggaa aacctaccct gtttttccac aagttttcca 241 cagctagcaa cctactgaag catctttgaa gtagatgtgc atcaagtttt ccacaatttc 301 cacaggtagg acgactgaat aactaaaaaa tcataaactt gaccaatgat tccttgatgg 361 tacaagccaa aaaacccgtc cgtggaatca atccataagt taagatccaa aattcccaag 421 cctttgaacc cgtccgtggg gtcaatccat aagttaagat ctaaaatcca aaattgtctg 481 accatgaaat tagtttgcac ccaaagcgac cttagtacta acctctcact caccagccgt 541 gctgtaccct cacgcccaac acatccagta cttgctaacg tgctactaca agcggatgct 601 gaaactaacc aagtcagcct aacagcgttt gacctcagtt tgggtatccg cacgagtttt 661 agcgccgaag tcttgcactc tggagaaatt gcgatccccg ccaagctgct taatgacatc 721 actgctcgcc ttccagaggg tgaaattact ctagaaaacg aatcagcttc tatagatgat 781 cccttagcag gggaaggctc aattgtgact ctgataccca aaagcgggcg ttatcaagta 841 cgcgcaatgg gagcacaaga gtttccagaa ctccctgtca tcgaaaatac tcaagcacta 901 catattcctg cgggtgcatt aattgaagga ttacgaggtt ctctatttgc aaccagtgca 961 gatgaaacca aacaagttct caccggcgta catttaacag ttcaacaaga cacactggaa 1021 tttgcagcaa ctgatggaca tcgtttagct gtcgtacaaa cgactaatga gagtccagat 1081 acaaatacgg aaactcgctt agaagtgaca gtcccagcca gagcattaag agaactagaa 1141 cggatgctgg ctcatactaa ctcacaagag gaacctgtgg cgttgtattt tgaccaaggt 1201 caggttattt ttgaatggca aaatcaaagg ctgacgagtc gtactcttga aggacaatat 1261 cctgcttacc gtctactcat tccccgacaa tttgagcgag aattggtttt agataggcga 1321 caattcttga gtgctttaga acgaattgct gtgttcgcag atcaaaagaa caatgttgtt 1381 aaggtgagca tggatagcga agcgcaggag ataactatat cctgtgaagc tcaagatgtt 1441 ggcaatggta gagaatcaat gtcagcacag atattaggag atgatataga tattgctttt 1501 aacattaagt atttgatgga aggtttaaaa gcattgccct cgactgaaat tcatgtgcag 1561 ctaaatggaa atttgacccc agtgattttt actccagttg gtggttttaa gatgacttat 1621 ttagcgatgc ctgttcaact taggaattaa aaaagaatcc agaagtcagc aatacagcag 1681 tcataattcc tcgttccttg ccagaggcaa ggaatgcatt tgctcaaggc tctgcctaat 1741 cgctgattaa gaggcagagc ctcttataat gcattcccat gcagagcata ggaacgagaa 1801 ataggaacga gagagacgag agagacaaaa gagcctatta tgccgcagca aaacggattt 1861 tgagatagtc ttcctccatt tttgcccccg ccggttgcaa tgctgctaaa gcttgtggca 1921 agactaaatt tcggcggtga ttaccaattg tgatatttaa ttcatctcca cttttactta 1981 attgaacctg gttcttagga atgccaggta agtacaattc caagctgtat tgattatttt 2041 cttgaacaac tctaattgtg gtttctttgt aataaacctg agttggatct tcatctttat 2101 acagtgtttc cttgaggcgt tctaaggccg ccaaaccaca catctcttca gaaaacagtg 2161 gaacttcctt aataggtaac ggatggaagt tttcatgaat ttcctggcgg tactgccgtt 2221 gattctcttt ccaacgctgg aaaaaggggt cttgcactgc ttccggaata atgcgatttg 2281 cgacgactaa atccgttgca acgttatata aactcaaata agcatgagcg cgaagagatt 2341 ctttaatcac cattttttca gggttggtga caaggcgcac agaggtttga gtgttatctg 2401 ttaatacttt ttccagagct tctatttgtt gatagaattc gtaaggtgcg tccatcacct 2461 ctttgtctgg taaagagaaa ccagcaattg gtttaaaaaa aggttcaact aaaggtctta 2521 gtgcgacaga aatgttttga aaaggtttgt aaaaacgccg catataccag ccactgactt 2581 ctggtaaact taacagacgt agtgctgtgc cagtaggggc tgagtcaata atcaaaacgt 2641 caaactcccc ttcatcgtaa tggcgtttca ttctgaccaa gccaaaaatc tcatccatgc 2701 ctggtaaaat agctaattct tccgcctgca ctccgtctaa accccgtgcc tgtaaaactt 2761 gagtgatgta gcgctttaca gcaccccagt ttccctctag ttctagtagt gcatccagtt 2821 ccgcacccca caaattgggg cgaatttgtt tcggtgcgtg tcccagttct agatcaaaac 2881 tgtctgccag agaatgagcg ggatctgtgc tcaaaacaag tgtacgataa cccagttctg 2941 cacaacgaag tccagtggcg gcggctacgg aggtttttcc cacaccgcct ttgcctgtca 3001 ttagaattac acgcatggat ggttttgtcc aagatgagga gtatttacat ttatttacat 3061 tatgaactct ttaagggaaa taaatgctgt ttcagtacat ctaggcgcga acaatgccaa 3121 taatctatgt gagacacaat caactcttga gtgttaaggc gtaattcact ccagcctggg 3181 atagaaatac gtggtttcca aggaaggggg gtattccagc tgagtgtcca ctcggttttg 3241 attgtgtctc cctctgtgcg aatatcgtgc aagtccattt tgggatttaa aaagcaagtc 3301 ttgatgaatt taatcatcag cttgtacttt tcaacaccac gaaatttgaa gacttgatct 3361 tgaaaataaa cgtcttgagc gtagatgctg tatgtctgat tatcaggaaa tctttgatag 3421 tcttgtttga gaatttcaac gatatccata atgcaataat ccaattaaat aatccaaaag 3481 acaaagtgaa aaaacttaaa gagtgaatct cttatattat cttagagcat gtttataaat 3541 aaaagataaa aatttttatg gtgtgtcacg acgcacgatt ctacatctat aaagcaaaaa 3601 ttctagaacc ccttacacta gggttgtgtg ctaaaaaatt tctattcata tacaatttaa 3661 gcgaaacaaa ctctaaaaaa tagctaataa tatggaggac acaaatggca aacacaagag 3721 acacatttga ctggattttt ttttaaggca gtttagccga gatttactgg ctcgtttaga 3781 agatagcgaa tttacaaatt tactatcaga gattattaat aataaatggt taggctatcc 3841 tggagcaata gatactgaaa ttctcaccgt tgaagctcgg ttggaaacaa ctttaccacc 3901 ttctttccga aactttttag cagctagtaa tggttggcaa aagttagaca atatgattga 3961 tagactttta tcaactcaag agattgattg gttggcttca agaaaccaag agttgattga 4021 tggatggatg actggaattc aaataggaag tggttatact cctgtgcctg atgaagaata 4081 ttttgtttat ggagacgagc aagactcaac tttgctacga aacgagtatt tgcaaaccgc 4141 cttagaaatc ggtggtgact caaatcaagg attattacta cttaatcctc aaatcgtatc 4201 tgagaataac gagtgggaag cttggttttt tgctaattgg cttcctggcg cacatagata 4261 cagatccttt tatgagttga tgctagactt gcacaagcgt ttacttgagc ttccaaagct 4321 tcgggagatt cctaattctg atgagaatac ttacaaaaaa cttctttaac acttgcatta 4381 actcaaacct caactccaca ttcggctaaa cctaaataat ctctagctaa caaaacccgc 4441 aaagcgaccc tttaggaaac cactacctga aatcgacctc gatagatcta cctgattggg 4501 ggcatcgcag cacaacttgc atttcaaacc ttctttccac aattactttt gtgttgtttt 4561 ttacctcatc tccagcgaat gcgtcttttt gcacttagtt ctggaattga cattcaaatt 4621 tctagatgca agatctgagt taatgaaatg aagtctctaa aatttttata gagaattgga 4681 atcaacctca ttccacaagc gctctaactg ataccgccat gctgctaaaa cttcctctct 4741 ggcttgtgca gtttgatcta gttcacccat caatccttct gcataaaagc gttctaagtt 4801 ttgtagagtg gcttctaaag attgattctg ctgtttagct gcaatcacaa tctgccgaat 4861 acgtttttca atcagtttgt taaaggcatt atctaaacca gttcgataga gagaaatcag 4921 ttgggcgatc gcatcttcaa aatcttctgg cgttaaacta ttgcagttgt gctgaaacac 4981 cccccataaa gccccttgct gcaaggcata acgtacttcc tgagtttctt caaagtttgc 5041 ttctaaaaac tgttctaaaa aaggttgagc ctcagatact ggcataatgg gtaataagat 5101 gcgtaaccag ctttcatcat ccgacagcaa aaccagcagc cgaaaatcgg gagtttctac 5161 ttgccaggaa ccaggtgcaa tagcattgac agacgctgca ctaaataact ctttgagcgt 5221 atccgcgatt tgcgagggta tcatagtctt tattttcgct tcagtttata gcgtaccttt 5281 ttcagctagc aatgttatct gctttagtca taagtctttc ttttgacgta gctgtctttt 5341 cattaattgc tattttgtaa gaattcagga gccagaattc aggagtcaga agagttaaga 5401 attccgttaa aaatttgatg aaaggaatat accaaaaatc atagactcct tatcaattct 5461 gacgaatgat ctcattggtg ctgaattctt acgctatttt ttttaaattt cttgactgag 5521 gggcttgatt aattcttcaa aggtttctta taaattaaaa atactgaacc aattggtccg 5581 gggttttcca cataaatttt attattatag tacagccctg gagacattcc tccatcaaat 5641 aacattcctt cttgaatctt gcctaaacag ttattgcgag caattccttc aagaacatca 5701 tcaaacatat ctggtaaaag ctcttgatta acttgagata cttgaatatc ggaatttgcc 5761 ttcagatcat tgactagtaa aatgacataa cctttatcag taatagctac aagagagcga 5821 tttcttgctt gttggcaagc aagttctcct aaatcttgac aaatatcttt aaacttgccg 5881 ttacgataaa acctgccatt ccctcctact aaattatagt taagaatatt atcctttctt 5941 cttcctactt gaagagtcgc ctgacgctgc tttggttgat ctgctgatat accaaaagaa 6001 gaacgtttat ttttaaaatc tcctgaatat tccactcccc gagaaatatt taagccttga 6061 ggtttattgt tagtatctat gtaatccgca ttaattgctg caattggtag ttgcccgttt 6121 aattttgcat tttcattact tccgagttca tgaaataatt ttggtatata ttcttgacga 6181 aatttccctt gattgtcttt tgcataaagc tgatgagaca aacccatatt gactttgaag 6241 tctaaagctt ctgattttgg gttaaaaata atgacatggt taattccttt tgcatatctt 6301 tcaccctggt tattggtttt gaagaaagta atactgaatt gagaatttcc tcctaaacaa 6361 gtttgggagg cttcacttga tgcattgaag ttaatatatt tacagcctgc taaaaaacag 6421 agaacgattg tagatagaaa aaataaaatt cttgtgtttt tagaaaaaag acaaaactta 6481 gtcatcataa gtaaacaata tcattatcat ataaacaata aaaagctcac ataattttca 6541 gctgcgataa tttagtgagc gaactgtcag tttattaagt atcatggcaa tttataatat 6601 caagattaag aggcgagcat ggtgaaagag gcgttcgcgc cctgcgcggc tccccttggg 6661 agcatcgcgt gcgtcgcggt ttgcagcatc actactgtgt tcaacaaaag ctatatacct 6721 caagacttgt ataaacttgc aagaagattt tacagtacga agagttccag acatgatgaa 6781 ttctttgttg cggcgctcac ttacagaaaa aacttcacta gaacatgaaa cccgtttcac 6841 tatatagacg tgggagtctg gactaggcta agctaatata ctatatctac ttatactaat 6901 tccccatgaa tatgagttta atctttctta ataattttgt ctggaagcct tacacactgt 6961 ggttgttttt caccaaatat taagctcaag tttactccaa aattggtatt agttctaagt 7021 taaaatcttt gcttaaataa aaaatattgt gacctactgc atcaacccct ggtgtcaaca 7081 gcgccaaaat cctaattatg tggagcgttg tcaaacctgt gggaatagct tatttgtaaa 7141 tgagcgatat cagttagtaa aaccgttgcg cgaattgcgc caccaacaat atactgaagt 7201 ctttgaggta gatgatcggg ggacgcgaaa agtcctaaag gttctcaaag aagacgatcc 7261 gaagttgatc gaaatgttgc agcgagaggc aagcgtcttg agtactctca accacccagg 7321 aattcctaga atcgagccgg atggatactt taccttaacc ttagcggttg gctacggttc 7381 ttatatatta ttgcactgct tagtgcttga gaaaattgag gggcaaaact tatggcagtg 7441 gttagagcag aatcaaccaa tttctcaaaa gcaagctcta atttggttgc aacagattgt 7501 agagattcta gctcttgtgc atgaaaataa acttttccat cgggatatca agccatctaa 7561 catcatgctt aaacctgatg gtcaacttgt gttaattgac tttggctctg ttaaacatgg 7621 caccaaagct tacttgagcg gtatccgtaa gggtcttgaa ggtacgataa tattgtcatc 7681 tggttatacg cccccagaac aaatccaagg caaagctgtc aagcaatcag atttttatgc 7741 actagggcga acatttgtca acttactcac aggtaaccct ccaagttcat ttttacagga 7801 ctcgacaaca ggtcagttaa tttggcgaga cagcgctcct gggatctcga agccattggc 7861 agatttaatt gattccttga tggctttttc tccaaatcac agacctgaaa atacgcgggt 7921 tattttgcaa agactagaaa agattagtcg cgaaactgat gaattgatgc aagctataga 7981 gtcatcaatt cgttcaagtt tgtcacaact gcctcaaccg ctaaaatttt ttactgaaca 8041 aattctcaaa aataaacttc agcggttggc gaagcaaagc agaattccta aaatagctct 8101 ttacgggcgt tctggtgcgg gcaaatcatc tctgattaat gccattctgg gacggagtgt 8161 taccgagatt ggtctagcca aagcaacaac acaaatcact gaaatatacg attatgagcg 8221 gaacgggtgg aaattaagat tcgtagactc acgtggtgta ggtgactcac gggatcatgc 8281 agcctttcaa caagctatta acgacatcgt ccaaaacaaa gtagacatcc tgttattcgt 8341 tattccagct gatgagcgag catatgtgac tagcgatatt gattttctca ctgaactgaa 8401 atcaacgcac aagcgaaaac atggagttga actaccagtc attttggttt taaacaagat 8461 tgatcgaatt caacccacct cggagtggaa tccgccctat aacctttttg actcccagac 8521 tcgcgacacc aaattcaaaa cagccagaca gcaagcaaag gaagcaaata ttcgtgattg 8581 cgttatggcg agaatcaccg agtacaaaac tttaagttca atgtatgttt atatttgtac 8641 tttatgggat gaatacgacg ataaacggta caacatagaa gaactcgctc tacggatata 8701 taaatgtatt cctgacgaag ccggtaaaca gggctttgga ggagcaaccg ccgctatcac 8761 actcaagaaa gctgtcgccc gggactatac atttgtcgcc gcatgtttgg catttcttgc 8821 tggctggttt ccttttggtg accaaaaggg agtgttgtcg atacaacgtc gcttagtcag 8881 tatgattgcc caaattgcta ctaacgcaga tcagagcaat gaagccgaaa aattattgag 8941 acaattgggt gttcaacaag ctgatacgaa atcacctcta tctacgacat tggcaattgg 9001 tgaagcggca attcgttatt ttatcgagca agacagcact attgaacaag cgcagcaggc 9061 ttttaccgaa gaaaaagagc gacgagaacc agaatttcaa gaagcactta aaggaggaac 9121 taacaaagta gtgagcaaac tgcgagaaat agatcaagaa ctatatgagc gttacggctt 9181 gccacgtatg tatgaggatg ataaggatga tatattgtaa aatttgacag aatatgaaat 9241 tggagtctaa accaaaactc aaagaattaa agtgcaaaaa aagctggctg tcactgaatt 9301 ggaaattctg cttacccttg acagctataa ctgtgctgat tttgagttcc gccgcagttg 9361 ctcaaacttc cacatcccca gtcaaaccct ctctaagctt acaactagct gtggacacgg 9421 tatgggtgct gttcactggc tgtttagtgt tctttatgaa tgctgggttt gcaatgttgg 9481 aaacaggctt gtgtcgccac aagaatgccg tcaatatatt agctcaaaac ttcattgtat 9541 ttgcagtggc gacagtggct ttctgggtaa ttggctgtgc gcttatgttt ggggacaaca 9601 caaacccatt gtttggaaca aaaggttggt tttttgatgg aacagaccaa cagatgttca 9661 aatcccttaa gtctagtgta cctcagtcag ctctattttt ctttcaactc gtttttgcag 9721 gcaccgcagc aactattttt actggagcgg ttgctgaacg gattaagttt attgcatttt 9781 ttatctttag ctttttactc attggtattt cctaccctat tactggtcac tggatttggg 9841 gtggtggttg gttatccaaa ttaggcttct acgattttgc cggttcgact gtggttcatt 9901 cggttggtgg ttgggctgct ctagtgggag cttggctgtt gggaccacgt ttgtacaaac 9961 ttgaatcacg cggtgcaccg atttatcgct atctcaaaaa cggtagcaac attgctatgc 10021 ctgggcataa tcagagtatg gcgactctgg ggggctttat tctttggttg ggctggtttg 10081 gctttaatgc aggctcaacg ctgaaggctg atcctggtgc gatcgctcat attttgctca 10141 caaccaacat ggctgctgca acgggtggta tcgcagcaac cctgatctct tggaggcgct 10201 ttggtaagcc tgaacttacc atgatcatca atggagtcat ggcaggtttg gtatccatta 10261 cagcttcctc tgctttcgtt agcattcgca gtgcgttttg gatcggtctg attgcaggaa 10321 ttcttgtgtt cttctcggta ctttttatag acacaaagtt aaaaattgat gacccagtgg 10381 gtgctatctc agttcacctt gtcaatggta tctggggaac actagctgtg ggcttattta 10441 gcgtcggtat agaagataaa ctacgcaata atgctgtgca atttgcgcct ggtccaaaac 10501 cgggattatt ctgtggtggg ggtttagaac aattcttgat tcagttactt ggtattatct 10561 cagtgggaat tttcacattt gtgttcagcg ctcttgcttg gtcggcaatt aaagctacag 10621 ttggtctacg agtttcaccc caagcggaac ttgatggttt ggatattagt gagcatgaca 10681 tggatggtta ccacgggttt gagaagaaac aggtgtagta aatctccact ggttagtcag 10741 gtatcttact tcctctatac ataagaggat gggaaaattt accatcttta ctaataagtt 10801 ttgatacaga ttcttgcaat tttgtctgat ttgagtctgt gctttgggac tcctcatctt 10861 tgagtacagc cttcaacttg tccacagtgc gattgatatc tacaactagc atcccttctg 10921 gaaccttact ggctatgact agtaagaaaa tatcagggct acaggtggtt aaaatcacat 10981 aaccgtctgc acctttaacc gaaatctgct caatcccctc ccattgcact tccttacggg 11041 tgcggttagc caaataaatc atcgttcccg ccataatcaa ggcggaatca tcatccatcc 11101 caattgttgc gatcggctta ccctcagaat tcacgagtgc tgtaccctga aagcccatca 11161 tgttactagc aaattttttt aaaattgccc taatctcgtc tatattcata actggctata 11221 tatctaattc tgctagagca tacacttatt tgatgataac ctagcagatt ttcggacatt 11281 gatcgtcata gcttcgtaaa aatcaatgag gtcagaatcc aggagtcaga atactagact 11341 ccttacaaat ccagcaattc tgacctcatt ggtgctgaat ttttacactg cttcacgaaa 11401 taaaggtgtt cgacagcgat cgcccttggc gcgccctatg ccccaaccaa acggagtttg 11461 gttgccccaa atcggagatt tggggagagg ggttttggaa aaacccctct gggcatatac 11521 gggcgatcgc ttcggtgcag catagcggca gatcactagt gagaatagga catgaattag 11581 gagttatcag ttgaaaactc ataactccta actcataagt attatttcaa ttaccaagtt 11641 gggtcactaa acaaatttat cgtcaactct tcgcgtcttg gcggtactgc tgtaatgcaa 11701 gcacggatca tgtgtccatc ctcaacttcc acactgcaag cgtgacacga ccccatcaga 11761 cacccagtgg gaataaatac gccagctcgg tctgctacgt cgagtagggg ttctcccact 11821 tcagcatcaa ctgtaacatc atctggtaag aagcggacgc gaacagtcat gaaaattacc 11881 ctcaaggcaa aaacagtgtt aagtccaaat ggcgttcaac ttcagcagct acagagtcta 11941 agatatgctc tctttgttcc cggtaattag caacccctgt tggcaaagat tttaaacccc 12001 gctgttgacg tagacgattt aaccaagcac gtctccaagg accattgtcg aagataccgt 12061 ggagataagt accccagacc gataagcaat tatccaccaa gcctaaatta gcatcatcga 12121 atagagtatg gtagttttcg gtttctgcta gcggaatttc tatgcgcgat cgcccttgat 12181 gaatttcaaa cccgattacg ggtaagccca tttgtggaaa atgcgagttg acttggcgtt 12241 gacgagcgac tttctgtcct gtgatcacag tttttatggg taacagccct agtccttgat 12301 atctgccagc ttgtccttca agtccctctg gatcggctat catctgacct aatatttggt 12361 acccaccaca gatacccaaa actgtaccac ctgctgctgc atagtgttga attgcttctg 12421 ccataccgct tttttgcagc aatatcaagt caggaattgt tgtttttgta ccaggaataa 12481 tcagtgcatc aggatgtccc aattcttgct taggacttat atattttacc ccaactgtag 12541 gttctgattc taatggatca aagtcagtaa agttagaaat tcttggtaag cggatgacac 12601 tgatgttgag ttcagtttgt aatttttcta ctctccgttc caatagatcc agggaatctt 12661 ccgcaggata taactcttcc aagtaaggga taacaccaac aacaggaata ccagtgcgtt 12721 cttccaacca ttttattcct ggttctagga gcgatcgctg tccacggaat ttgttaatca 12781 caataccgcg aatcaaagcg cgttcatctg gatccagtaa ctccagggtt cctatgacat 12841 gggcaaaagc accacctctg tcaatatcaa caacgagtag agttgatgca ttcaagtatt 12901 ttgccacccg catattggtt aagtctcggt gcttgatgtt aatctccgca gggctaccag 12961 caccttcaca aaccaacaaa tcaaattctg tactcaaatg ctgtagagat tcttcaattg 13021 cccgccaccc gagttcaaaa tattgttcgt agtactctac agcgctgact ttacccattg 13081 gttttccctt gataatgact tgagaggtca tatctccttg tggtttgagt aagatagggt 13141 tcatttctat ccaaggggta actcccgcag cccaagcttg taccgcctgg gcgtagccaa 13201 tttctccacc attagcggtc acataagcat ttaaagccat attttgacct ttgaagggag 13261 ccactcgcca gccacggcgc gacaatatac gacaaacagc cgctgttaac agtgatttcc 13321 ctgcgtggga tgttgttccc acaatcataa ttgctttcat aaccaaatat ttgtttttcc 13381 agcttgggtc ggttaattca aaaggtgaaa ttagtcatga gttttcagac tacgaactct 13441 tgactaggga gtcaacgccc tacgggcagg gtgtaagggg gtaagggtat aggggtgtag 13501 gggaacccca tcaataaatc gcgcagcgca gtggagtgca gtggcggtct gggcttgata 13561 caaggacgcc tcaacttcgg caccatgtgt c // LOCUS NODE_2519_length_13427_cov_5.07096913427 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13427) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13427) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13427 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..833) /locus_tag="DP116_20700" CDS complement(<1..833) /locus_tag="DP116_20700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="PRJNA477356:DP116_20700" /translation="MRSGLVSNGLAGQNHEISSVIPDVSVVVPVHDEVESLPHLLEAI ASTLSNTDLSYEIICVDDGSTDGSAQFLREQGQIRNDLKAVILRRNYGQTAAMAAGFN YALGKAIVTLDADLQNDPADIPMLLAKLEEGYDLVSGWRKNRQDGAVNRLLPSKIANW LIRRTTSVYIHDYGCSLKAYRAELVADMNLYGELHRFLPALAYIEGARITEMPVRHHA RRFGRSKYGIWRTFRVLMDLLTILFMKKFLTRPMHVFGLWGLIAMTSGGAIGIYLTFV KL" gene complement(849..1571) /locus_tag="DP116_20705" CDS complement(849..1571) /locus_tag="DP116_20705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314643.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NlpC/P60 family protein" /protein_id="PRJNA477356:DP116_20705" /translation="MSFNLESKIQTLKPGEYQCLADLNIYDSPSCDRLATQAATGRYL WVTSSYQDIETLHATSVQVCLSEDDYPGWLSLSDLGLLQSCTLPYKAKTFSESEIKKR LPEVIEFTQKAMQQPNYYLWGGTVGPNYDCSGLMQAAFASVGIYIPRDAYQQEDFTQS ISIAELQPGNLVFFGTSQKATHVGLYLGDGCYIHSSGKVIGRNGIGIDRLSEQGDEVS QSYYQQLRGAGRVVKSYEPQQR" gene 1585..2529 /locus_tag="DP116_20710" CDS 1585..2529 /locus_tag="DP116_20710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314644.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine hydrolase" /protein_id="PRJNA477356:DP116_20710" /translation="MNFFFNKDDQLENLGLGILEATWAAFPTLARNQIALTWIVYDPP VLVNTGGALTPDAFWNYPIRGFTYRGVERIYPASVVKLFYLVAVNEWLEKNMVSTSKE LERAMRDMIVDSSNDATSLVVDILSGTTSGPELSPGPFETWKQQRNIVNRYFQSLGWS EMETINVCQKTWCDGPYGRERAFVGELLENRNMLTTNATARLLHSIIGGVAVSSGRSQ AMMALIKRSLNPNDLPKDAEEDQVTGFLGGALPQEAQIWSKAGWTSQVRHDAAYIELS EKRPYLLVVFTEGKANAKNREILPFVSKLVREAVVGLG" gene 2673..3464 /locus_tag="DP116_20715" CDS 2673..3464 /locus_tag="DP116_20715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129816.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YdcF family protein" /protein_id="PRJNA477356:DP116_20715" /translation="MFLYLSKLLPLFLYPLGLACLCLLVALFMLWKRPRTAALAIALA LILLLFTSNGWVSRSLVQSLEWQNLPLTEIPTAEAIVVLGGATKSALPPRPSVDLNEA GDRVIYAGQLYRQKKAPIIILSGGRIDWKGGGPSESADMATILTSIGIPKEVIVQEPD SLNTYQNAVNVKKILNSRGIRRVLLVTSAIHMPRSLLIFQHQGIDVIPTPTDFLVSQS ELQEMTSTPKGALLNVLPDVDNLHLFTSALKEYVGILAYRLRGWL" gene 3535..3759 /locus_tag="DP116_20720" CDS 3535..3759 /locus_tag="DP116_20720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744672.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20720" /translation="MSQELPQIIPSPQSEVEDTFSAYQTTHQFYTEVQARSELKQYCE WYYTTAERHRQDLEKMRGELNIMQWFRKTR" gene complement(3903..4172) /locus_tag="DP116_20725" CDS complement(3903..4172) /locus_tag="DP116_20725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875990.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin--nitrite reductase" /protein_id="PRJNA477356:DP116_20725" /translation="MAVETLVEQKLGVNTVMKIGDRIRVKESVIVYHHPEHRGEAFDI KDTEGEVISIINQWQGRPVSANLPIYVQFSKKFKAHLREAELELI" gene complement(4930..5475) /gene="pgsA" /locus_tag="DP116_20730" CDS complement(4930..5475) /gene="pgsA" /locus_tag="DP116_20730" /EC_number="2.7.8.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315464.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase" /protein_id="PRJNA477356:DP116_20730" /translation="MTLPNWITFSRLLGIPFLLYGLYNPTPQARWICLAIFLVASLTD WLDGYLARKLNQISDLGKFLDPLVDKLLVLAPLLALIELGKVPAWGVFLILGRELAIA GWRVSKTKITGANIWGKLKTVSQIIAIALLIAPLPQVWQLPSIIAFWISVVLTIISGA IYLLPEKDTPHTSGNASSPKE" gene 5798..7234 /gene="aspA" /locus_tag="DP116_20735" CDS 5798..7234 /gene="aspA" /locus_tag="DP116_20735" /EC_number="4.3.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315463.1" /note="catalyzes the formation of fumarate from aspartate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate ammonia-lyase" /protein_id="PRJNA477356:DP116_20735" /translation="MTQQINSQVRIERDSMGDRQLPSTAYYGIQTLRATENFPISGIK PLATYVDAGVLIKKATAIVNGELGCIPQDVSQAIIQAADEVLAGKFRDQFVVDVYQAG AGTSHHMNVNEVLANRALEILGEEKGNYKRVSPNDHVNYGQSTNDVIPTAIRIGGLLA LSKTLHPALEKAIAALEEKAEEFQHIVKSGRTHMQDAVPVRLGENFRAWAHILSEHQN RIYTASSDLMVLGLGGSAAGTGLNTHPQYRARVVETISDLINLPLEPASHLMAAMQSM APFVHISGALRNLAQDLVKISHDLRLMDSGPKTGLKEIQLPPVQPGSSIMPGKYNPVM AEMTSMVCFQVMGYDSAIALAAQAGQLELNVMMPLIAYNLIHSIEILGNTIAALTERC IKGITANQERCLAYAEGSLALVTALNPHIGYLNAAAVAKESLETGKSLRQIVLERGFM SETELAQVLNLEQMSAIVPLNILDGTSD" gene 7306..8277 /locus_tag="DP116_20740" CDS 7306..8277 /locus_tag="DP116_20740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141025.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thylakoid-associated protein" /protein_id="PRJNA477356:DP116_20740" /translation="MQTQTASVSLIRATSYEREALRESLETLLEPLGGMAAFVKEGNR VLLKPNLLTGARPSKECTTRPELVYAVAQMVIEAGGKPFLGDSPAFGSAKGVAVANGY QPILEELNLPIIDFHGQRYQTVSEDFDHLLLCKEAMEADVVINLPKVKSHAQLVLTLG VKNLFGCVPGKMKAWWHMEAGKDANRFGEMLVETAKAIHPNLTILDGIIGHEGNGPSG GEPRPLGILAASPDVFALDRAMVQILNVPAEQVPTVAASIRLGLVGELNNINFPHLHP DLLKIDDWRLPDKFLPIDFGMPRVIKSTFKHLYIRFIKEPMSAYAKR" gene 9201..9389 /locus_tag="DP116_20745" CDS 9201..9389 /locus_tag="DP116_20745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315461.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20745" /translation="MHSNTQSADPASTYSHRLADIVGTMIGLLTLTLPLFVIGHYSSN SVQNHQQPITYNIKADAD" gene 9736..10254 /locus_tag="DP116_20750" CDS 9736..10254 /locus_tag="DP116_20750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015203912.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="inorganic pyrophosphatase" /protein_id="PRJNA477356:DP116_20750" /translation="MDLSRIPAQPKPGLINVLIEIPGGSKNKYEYDKELQAFALDRVL YSSVQYPYDYGFVPNTLADDGDPLDGIVILDEPSFPGCVIAARPIGMLEMIDGGDRDE KILCVPDKDPRYTSVKSLKDLAPHRLQEIAEFFRTYKNLEKKVTEILGWQDVDQVLPL VEKCIRAGSGKS" gene 10659..11045 /locus_tag="DP116_20755" CDS 10659..11045 /locus_tag="DP116_20755" /EC_number="4.1.1.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209012.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate 1-decarboxylase" /protein_id="PRJNA477356:DP116_20755" /translation="MQRTLLLAKIHNCTLTATNINYVGSISIDQVLLDKAGILWYEQV HVVNVSNGERFITYAIPAAPNSGAIELNGAAARLGMSGDRLIIMSYAQFNSEELKIYS PTVVIVDNRNQLLEVRRYDDLLSRQT" gene 11058..11957 /locus_tag="DP116_20760" CDS 11058..11957 /locus_tag="DP116_20760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" /protein_id="PRJNA477356:DP116_20760" /translation="MSNFKQSPSQKQPTPEPLPQILSPELEFVVQFWGVRGLIPTPDS HTIQYGGNTACVEMQIAGKHLVFDGGTGLRLLGKNWLQQQSPLNAHLFFTNSQSNRIQ GFPFFAPAFKSENCFHIYGTAALNGASIKQCLCDQMLQPHFPYPLQFMQSELHFHNLT SGKVVNIDDLTITAAIINHEQKSIGYRVSWKDYSVAYLTDLSKIANEADRKCIAQLAK NVNLLIANATYTTPAAPNHNEPDFHWGTAVDLAKTAKVNQLVISHHRPDDHDDFLDQV QIEVQSVFPKALLAKEGLVLAIV" gene complement(12006..12407) /locus_tag="DP116_20765" CDS complement(12006..12407) /locus_tag="DP116_20765" /inference="COORDINATES: protein motif:HMM:PF13358.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20765" /translation="MVQLSHSQDNIFTQVYPKLNGEYFQQFLDWYLVQLGEDYAILQI DQAPAHISGAINWPENIIPLLQPPHSPELNPIERLWQYLKKSLKNELFSSLQDLRTRI QQLFEQLTFEQVISISSYNFILEALFYAASY" gene complement(12558..13067) /locus_tag="DP116_20770" CDS complement(12558..13067) /locus_tag="DP116_20770" /inference="COORDINATES: protein motif:HMM:PF13565.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20770" /translation="MSRPFKIEIAESEEELKKRLQTANLGKQKEKLMMLWWIKSGQVQ EQQDIGKRLAKDTSTVTRWIQKYRSGGLDELLEIKKAPGAKRKINERAIAALEEKLKT GKGFSRYGAIVEWLKKEQGLEVEYATVYALVRYKLGAKLKVPRPQSHKQDEKLVSEFQ KNSVSSSIS" BASE COUNT 3961 a 2845 c 2718 g 3903 t ORIGIN 1 agtttcacaa aagtcaagta aattccaatc gccccacctg aagtcattgc aatcaagccc 61 cacagcccaa aaacgtgcat tgggcgtgta aggaactttt tcataaacaa aattgttaac 121 aaatccatta atacccggaa cgtccgccat atcccatact tgctgcgacc aaagcgacgg 181 gcgtgatgac gcacgggcat ttctgtaatt ctcgctcctt caatatatgc caaagccgga 241 agaaagcggt gcagttctcc ataaaggttc atatctgcca cgagttcagc gcggtaggct 301 ttaagcgaac aaccatagtc atgaatatac acacttgtgg ttcgcctaat taaccagttg 361 gcaattttgg aaggaagtaa tcgattcaca gcaccatctt gccgattttt tcgccaaccg 421 ctgaccaaat cgtagccttc ctccaacttt gccagtaaca tggggatatc agcagggtca 481 ttttgaaggt cggcatctaa agtgacaatt gctttaccca aagcatagtt aaatccagca 541 gccatcgccg cagtttgacc ataattgcga cgcaaaatca cggcttttaa atcgttgcgg 601 atttgccctt gttctctgag aaactgtgct gaaccatctg ttgaaccatc atccacacag 661 atgatttcat aacttaagtc agtgttggat aatgtggatg cgatcgcctc aagcaaatga 721 ggcaaacttt ccacttcatc atgcacaggt accacaaccg aaacatctgg gataaccgac 781 gatatttcgt gattttgccc agctaaccca tttgaaacca gtccactcct catattcctc 841 ttctttcctc agcgttgttg aggttcgtaa ctcttgacca ctcttccagc accgcgcagt 901 tgctggtaat atgactggct aacctcatca ccttgttccg aaagacggtc aatgccaata 961 ccattacgtc ctattacttt gccagaacta tggatatagc agccatctcc caaatacagt 1021 cccacatgag tcgctttttg ggaagttccg aaaaacacaa ggttacctgg ttgtaattct 1081 gcgatagaaa tggattgggt gaaatcttcc tgctgatagg catctctagg tatataaata 1141 cctaccgaag cgaacgccgc ttgcatcaat cctgaacagt cataatttgg tcctaccgta 1201 ccaccccaga ggtaataatt tggttgttgc atagcttttt gggtaaactc aattacctct 1261 ggcagtcgtt ttttaatttc tgattcagaa aatgttttcg ctttataagg taaagtgcat 1321 gattgtaaca aacccaaatc cgatagtgac agccaccctg gatagtcatc ctcggacaaa 1381 cacacctgta cagacgttgc atgcaacgtc tctatatcct gatagcttga tgttacccac 1441 aaatatcgcc cagttgcagc ttgagttgcc aatcggtcac aactgggaga atcatagatg 1501 tttagatcag ctaagcactg gtactcacca ggtttgagag tttggatttt agattctaga 1561 ttaaaggaca tcggtgaaac aacaatgaat ttcttcttta ataaagacga tcaactggaa 1621 aatcttgggt taggcatttt agaagcaact tgggctgcat ttccaacttt agcgcgcaac 1681 caaattgctt tgacttggat tgtttacgat ccgcctgtcc ttgtgaacac tggtggtgct 1741 ttaactccag atgctttttg gaactaccca attcgtggtt tcacttatcg cggagtggaa 1801 cgaatttacc ctgcaagtgt cgtgaagttg ttttacctag tcgcagtcaa cgaatggcta 1861 gagaaaaaca tggtttccac atccaaagag ttggaaaggg ctatgcgcga tatgattgtt 1921 gattctagca atgatgccac gagtttagtt gtagatatct taagtggaac cacttctgga 1981 ccagaattat caccaggacc atttgaaact tggaaacagc agcgcaatat tgttaaccgc 2041 tatttccaat ctcttggatg gtcagagatg gaaacgatta atgtttgcca aaaaacttgg 2101 tgtgatggtc cttatggacg agaaagggct tttgtaggag agttgctaga gaatcgcaat 2161 atgctgacga caaatgctac cgcgaggttg ctacatagta ttattggtgg tgtggcggtg 2221 tctagtgggc gatcgcaagc catgatggct ttgatcaagc gcagtcttaa ccccaatgat 2281 ttgccaaaag atgctgaaga agaccaagtg acaggctttt tgggcggtgc acttcctcaa 2341 gaagcacaaa tttggtcaaa ggcgggttgg acgagtcaag tccgccatga tgcagcttac 2401 atagagttaa gcgaaaaacg cccttatctg cttgttgtat ttactgaagg caaagcgaat 2461 gctaagaacc gcgaaatttt gccctttgtc tcgaagctgg tgagggaagc tgttgtcggt 2521 ttaggttagt tatttctgac ttgataggtg actatagcgg ttctctgttg gatctggagt 2581 ttaaatacta gttcatacaa gcatgagtgg tttactcgca atggtgcaaa atctcagtaa 2641 actgacaaca cattttttga gaacattgcc aaatgttttt atatctctct aaattgctac 2701 cactgtttct ctatcctctg ggactagctt gcttatgctt actggtagca cttttcatgt 2761 tatggaaacg tccacgtaca gcagcacttg caattgcact agcgctgatt ctattgttat 2821 ttacgagtaa tggttgggtt tctcgctcac ttgtccagtc tctagaatgg caaaatcttc 2881 cactgactga aataccaacg gcagaagcta tcgtggtttt gggtggtgcg actaaatcag 2941 ctttaccacc acgaccatct gtagatttaa atgaagcagg cgatcgcgtc atttatgctg 3001 gtcaactcta ccgccaaaaa aaagctccga ttattatttt aagtggtggt cgcattgatt 3061 ggaagggagg cggtccatca gagtcagcgg atatggctac cattctcact tctattggta 3121 tcccaaaaga agttattgta caggagcctg attccctcaa tacatatcaa aatgcggtga 3181 atgtcaagaa aattctcaat tctcgtggaa ttcgccgtgt tttactggtg acttctgcga 3241 ttcatatgcc gcgatcgctt ctcatcttcc aacatcaagg tattgatgtc attccaacgc 3301 ctactgactt tctcgtcagt cagagtgaat tgcaagaaat gactagcacc ccaaaagggg 3361 cattactgaa tgtgctacct gatgttgata acttacacct atttacatct gctttgaaag 3421 aatacgttgg aattctcgct tatcgcttgc gcggatggct gtaagtcata ataaatagag 3481 atttgtagaa gaaatcccac tttcaattct taagttttaa ctgaatttgt gaatatgtct 3541 caagaattac cccagattat accttctcct caaagcgaag ttgaggatac attctctgca 3601 taccaaacaa ctcatcagtt ttacacagaa gttcaggcgc gctcagagtt aaagcaatat 3661 tgtgaatggt attatactac agcagagcgc catcgtcaag atttagaaaa aatgcggggc 3721 gaactgaata tcatgcagtg gtttcggaaa actagataac ccaattttga gtttcgtcag 3781 gttctttctt tgggaataat gtaagagtca ttagatatag taaagtacgg cctcctgcta 3841 aaaggaggct acttattttc aagagtgata ttcacaaaat gctgtcgttc agcacaaaaa 3901 agctagataa gttctaactc agcttcacgc aagtgggcct taaatttttt actaaattgg 3961 acataaatcg gcaaattagc actgacaggt ctaccttgcc attgattaat aatacttatc 4021 acttcacctt ctgtgtcttt gatgtcaaaa gcttcaccgc gatgctcagg atgatgataa 4081 actatgacgg attctttaac gcggatgcga tcgccaattt tcataacagt atttacacct 4141 agtttctgtt ccacaagtgt ctccacagcc atccctcact gaactttctt tttggaaaaa 4201 tagactgctt gcacatttta ttgctgtttg tacttcgtct tcacagacac gccgtcttca 4261 cgcctctgta actactagac aaacaaagcc tgcctgtgca ggctttatta ttcaatgatt 4321 tgccttagtc cttggttgtt tgttctccca atcaacgaac agacgaacaa agataaggca 4381 acatcaaagt cgcctcgtac tttggcaaga ggtctagtct ttattatctt cactcttgga 4441 cgttcctaag gtcaactatt tgattaggat tatgaaatta tcactccata ttcctctatc 4501 gtccacagat gctatcccta tgagaatgtt gtggtcgatg agtttatttc tggagtatgt 4561 tgcacaagtt tttgtagcta taccaaactt gtgatcacac atcataagac ttgattattc 4621 ccgaacatat tcggctaatt cctcaagcgg cttgtagcac ccaatcaaat ttagttgaat 4681 atctttgtta agaactagat gacaattttt tctacaggcg agtgtttgaa acgatgcaag 4741 cacctcatgg atacactttg tcaaagacta agagaatgaa taactattgg agaaagatca 4801 tgtccctgta agaatctttc acatttgaga attagatttt agaacgcttt tccaaacaat 4861 caggcagcta tctttgtgca gatagttgag aactttcttc aatcccgaac aatcatctta 4921 aaataaaaac tactcctttg ggcttgacgc attacccgaa gtatgagggg tatctttttc 4981 tggtaaaagg tagatagctc cagaaatgat tgtcagaaca acagaaatcc agaaggcaat 5041 aatggaaggt aattgccata cttggggtag aggtgcaatc agaagtgcga tcgcaatgat 5101 ttgactaact gttttgagtt tgccccaaat attcgccccc gtaatcttgg ttttactgac 5161 acgccaacct gcaatagcca actctcgtcc caaaatcaga aacactcccc aagcagggac 5221 ttttcctaac tcaattaaag cgagtaacgg cgcaagtacc agcaatttat cgactaaggg 5281 gtcaagaaat ttacccaagt cgctgatttg gttaagtttc cgtgccaaat agccgtctaa 5341 ccaatcagtc agagaagcga cgagaaaaat tgccaaacaa atccatctgg cttggggcgt 5401 aggattatac aagccataaa gtagaaatgg tatccctaga agacgagaga aagtaatcca 5461 gttaggtaaa gtcataaaca attcaaaatt caaaattcaa aattgaaaat taggaaactt 5521 aatcctgaga atccctctat agcagtccca aattattcat aagattttct cttctctttc 5581 ttggcgtcct tggcgtctat gtccttcgga cacgcttcgc taaggcggta tgcgccaagg 5641 gcgcacgcta cgcgaacgat aattctcaca actcatatag gattgctata gttgctaact 5701 acaggtgtaa caccgataga tgagatgaaa gggataaggg gacaagggga taaggagaat 5761 atacactctt gagtgatgac taattccaaa aaatagtatg actcaacaaa taaattctca 5821 agtgagaata gaacgcgatt cgatgggcga tcgccaacta cctagtactg cttactacgg 5881 tattcaaaca ttgcgggcaa cagaaaactt ccccattagc ggcattaaac ctttagctac 5941 ttacgtagat gctggcgttc ttattaaaaa agcgacagca attgtcaatg gtgaactcgg 6001 ctgtattccc caagatgtca gtcaagcaat tatacaagct gctgatgaag tgcttgctgg 6061 aaagtttcgc gaccaatttg tggtagatgt ctaccaggct ggtgctggaa cttctcacca 6121 tatgaatgtt aacgaagtct tggcaaatcg cgctttagaa attctcggcg aagaaaaagg 6181 caattacaag cgtgttagtc ctaacgacca cgttaactac ggacagtcta ccaatgatgt 6241 cattcccaca gcaattcgta ttggtggctt attggcacta tcaaaaacgc tacatccggc 6301 tttagaaaaa gcaatagcag cattagaaga aaaagccgaa gaatttcaac atatcgtcaa 6361 atccggcaga acccacatgc aagatgctgt acccgtgcgt ttgggtgaga attttcgcgc 6421 ttgggcacac attttgagtg aacatcaaaa ccgtatttac acagcttctt ctgacttgat 6481 ggtgctaggt ttgggaggaa gtgcagcagg gacaggttta aacactcatc ctcagtatcg 6541 tgcccgtgtg gtagaaacaa tctcagattt gataaatctc cctttggaac ctgcatctca 6601 cctgatggca gcaatgcaga gtatggcacc attcgttcat atttccggtg ctttacgtaa 6661 cttagctcaa gatttagtca aaatatctca tgatttgcgt ctgatggatt cgggaccaaa 6721 aactggctta aaagaaattc aactacctcc agtgcaacca ggttcctcaa ttatgccagg 6781 aaagtacaat ccagtcatgg cagaaatgac atcaatggta tgctttcagg tgatgggtta 6841 cgatagtgcg atcgctcttg ccgcacaagc aggacaatta gaattaaatg tgatgatgcc 6901 tctgattgct tataacctaa ttcacagcat cgaaattctt ggtaacacca tcgccgcact 6961 caccgaacgc tgtatcaagg gcattaccgc taaccaagaa cgttgtttag cttatgctga 7021 aggcagttta gccttagtca cagcattaaa tccccacatt ggttatttga atgcagctgc 7081 tgtcgccaaa gaatctttag aaactggcaa gtctttgcgg cagattgtct tggaacgtgg 7141 tttcatgagt gaaacagaat tagcccaagt attaaatctg gaacaaatga gtgcgattgt 7201 gcctctaaat atactagacg gcacatcgga ttagagaaca aagaaggaag ataaaagaaa 7261 taactcttcc ttcttgattt tttcttattc gttctttatt tcactatgca gactcaaaca 7321 gcatccgtca gcttaattcg cgccacttcc tacgaacggg aagcattgcg agaatcttta 7381 gaaacactgc tggaaccttt gggaggaatg gcggcgtttg tcaaagaagg aaaccgcgtc 7441 ttactcaaac ctaacctact aacaggcgca cgtcctagta aagaatgtac gactcgtcca 7501 gaactggttt atgcagtcgc ccaaatggtt atagaagcag gcggtaaacc atttttaggg 7561 gatagtcccg cttttggtag cgccaaggga gtcgcagtcg caaatggtta tcagcctatt 7621 ttagaagaac tgaatcttcc tatcatcgat tttcatggtc agcgttacca aaccgtcagc 7681 gaagactttg accatctgct actgtgcaaa gaagcaatgg aggcagatgt cgtgattaac 7741 ctacccaagg taaaatctca tgcccaattg gtactaactt taggggtgaa aaatctattt 7801 ggttgtgttc caggcaagat gaaagcttgg tggcatatgg aagcaggaaa agacgcaaat 7861 cgatttggtg aaatgttggt cgaaactgct aaagcaattc atcctaactt aactatactt 7921 gatggcatta ttggtcatga aggcaatggt ccgagtggtg gtgaaccccg tccactgggg 7981 attttagcag catcaccaga tgtttttgct ttagatcgtg ctatggtgca aatcctcaat 8041 gttccagcag aacaagtacc cactgtcgct gcatcaatac ggctggggtt agttggtgaa 8101 ctgaacaaca tcaattttcc tcacctacat cccgacttac taaaaataga tgattggcga 8161 ctaccagaca agtttttacc cattgatttt ggaatgcccc gcgtcatcaa gtctacgttc 8221 aaacatcttt acataagatt tatcaaagaa cctatgagtg catatgccaa acgatagttg 8281 catttagcgc atcatataag caacaagtgc aaagtctgtt acaacaattt cacgctttgg 8341 cagagtaagc tgtaaatacc ctccgattgc ttaaccaaac cgccctcatt aagtggcggt 8401 ttttttattg cttaggcttt cgttatcaga ggtaaagcac attgagataa cacattttca 8461 gttgtgtttg gatgtggagt cattctttgg gtttaagtta aaagagttag aagaaatgat 8521 taaattttct ttcgacacgc tactttgaca ccccaaaaaa tttgtcctac gattgaccca 8581 tgaaaacatt ttttgggtgc aattactgag gaattattta catgacagtt catttctact 8641 tgcaaaaggt tgtctagtat catgttgcta gtatccagca gattgcaagt taatgaggca 8701 caccttttca actcaagagt ccagatagag agcaaatttt acttctgact tctggctcag 8761 gaattgctga attctgtatg agcacgcacg cccagagggc taacgcgtag cgtgtccgca 8821 agacatatta tgctgtatat tttgcaaagg cacgctgtcc cttagagtga caagcaagaa 8881 taatttgata tcatttatct atgaatatca tcttgccaaa aacagtcatt tttcatacta 8941 tcattttcag tgtttcctaa aatggttacg aaattaagtt aaatgagtct gtgaaattaa 9001 tgacaaagta aaacttaaaa atttcaactc caattttgta tcaagcgtct catctacaga 9061 aacaagtata gataacttaa gaaatgattg atattatttc ttaatagata gtataatttg 9121 aaaaaatagc taaaatagaa gtaagactct tcgcaaaaga gtcaaaacaa aataataagt 9181 gtcaaaaact tcaacttttt atgcattcaa atacccaatc tgctgaccct gccagtacat 9241 attcccatag gttggcagat attgtaggaa caatgatcgg tcttttaact ctgactctac 9301 ccctgtttgt tattggccac tattcttcaa acagtgttca gaatcatcag caaccaatta 9361 cctacaatat aaaagcagat gcagattaga caaaacttga ctcttgaacc gcagttgcgt 9421 gactgttggc agagtcgtcg gcagtttgca tgcacaaaaa gggcttcgag attccctgtg 9481 agaatgagat agacgactcc tcaaccatct cttcccgtga caacctacaa ttgccatagg 9541 ttagagatat atttgtcgtc ggcgtccggc ttatgtccca ccgctttgtt tgagtacaaa 9601 tatagcgggg gacttacgcc aaatgagtta acggtggcag tatgaaataa cagtagtcca 9661 ttttagccta gaattataca ctgggagcta agcatgaagc accccctaat tcaccgttaa 9721 ctgaggactt ttgttgtgga tttatcccgt attcctgccc aacctaaacc aggtctaatt 9781 aacgttctaa ttgaaattcc aggcgggagt aagaacaaat acgaatacga taaagaactt 9841 caagctttcg ccttagacag agtactatac tcgtcggtac aatatccata cgactacggc 9901 tttgtaccca acaccttggc agatgacggc gatccattgg atggtatcgt gatcttggat 9961 gagccaagct ttccaggatg tgtcattgca gcaagaccaa tcggtatgtt agagatgata 10021 gacggaggcg atcgcgatga gaaaatcctt tgtgttcctg ataaagaccc acgttacact 10081 tccgtaaaat ccttgaaaga tcttgcacca caccgacttc aagaaatagc tgaatttttc 10141 cgcacatata aaaatttaga aaaaaaagtg acggaaattc tcgggtggca agatgttgat 10201 caggtcttgc ctttggtaga aaagtgtatt agagctggta gtggaaaaag ttaagactca 10261 cattcacaaa aaacttgaaa catttgcaat actccagcgt cagttaatta atgaacagca 10321 atctattgat tattgtctaa actgataact ggtgagcttt aaggtatctc ctatgcctat 10381 ggctaacgcc attcaaagca gcgtctaggc aggagataca tcagacatgc tcgtcgcaag 10441 gcgataagct gcgcatttgc ccgattcgta cgctttgccc taaggctaac gccagtttct 10501 tgtgtccgga aaccctcaca aacaacacca gtctcaccag tacctgtgtc aggaaacagc 10561 ggtgcaaaac tggactcacc agttttctac gaaaaaacaa ccttcaaaag acactgataa 10621 ctgatcactg ataactgatc actgataact caaacccaat gcagcgcacg cttcttttgg 10681 caaaaattca taactgcacg ctcacagcaa caaatattaa ctatgtagga agtatcagca 10741 tagaccaagt tctgttggat aaagcgggta tcttatggta tgagcaggtg cacgtagtga 10801 atgtttccaa tggcgagcga tttatcactt atgcaattcc ggctgcaccg aattcaggag 10861 caattgaact gaatggggca gcagcacgtc taggcatgag tggcgatcgc ttaattataa 10921 tgtcttacgc gcagttcaat tcggaagagt taaaaatata ctctcccaca gttgtcattg 10981 tggacaacag aaaccaactg ttagaagtgc ggcgctacga tgacctgctt agtcgtcaga 11041 cctagtttcg agcaaatatg tcaaatttta agcagtcgcc ttcccagaag cagccaactc 11101 cagaaccact gcctcaaatt ttgagtccag agttggagtt tgtggtgcaa ttttggggtg 11161 tacggggttt aattcccaca ccagatagcc acaccatcca atatggtggt aatacagctt 11221 gtgtggaaat gcaaatagct ggaaaacact tagtttttga tggcggtact ggtttacggc 11281 tacttggaaa aaattggctg caacaacaat cacccttaaa tgcccattta ttttttacca 11341 actcccaatc aaatcgtatc caaggttttc ctttttttgc tcccgcattc aagagcgaaa 11401 attgcttcca tatttacgga acagccgcct taaatggagc atcaattaag caatgtttgt 11461 gtgatcaaat gcttcagcct cactttcctt accctttaca gttcatgcaa tctgaattgc 11521 actttcataa tttgacatca ggtaaggtgg tgaatataga tgatcttacc atcacagcgg 11581 caatcattaa tcacgagcaa aaatcaatag gttatagagt cagttggaaa gactacagtg 11641 ttgcctatct cacagattta tctaagattg caaatgaagc ggatcgaaaa tgcatagcgc 11701 agcttgcaaa aaacgttaat ttgctcattg ctaatgccac ttatacaact cccgcagcac 11761 ccaatcataa cgaacctgac tttcactggg gtactgctgt cgatttggcg aaaactgcta 11821 aggtgaacca gttagttatt tctcatcatc gtccagatga ccacgatgat tttctcgacc 11881 aagttcagat tgaagtacaa tctgtcttcc ctaaagcatt actagctaaa gaaggtttag 11941 ttttagctat tgtatagtaa atttttgata acagggaact cttaacaggg aactcttata 12001 ccaatttaat atgaagctgc atagaaaaga gcttctaaaa taaagttata agaggagata 12061 gatattacct gctcgaatgt taattgctcg aatagttgtt gtatgcgagt gcgtaagtct 12121 tgcaaagaag aaaaaagttc attcttgaga gattttttga gatactgcca cagcctttca 12181 atgggattga gttccgggga atgaggtggt tgaagcaggg gaataatatt ttccggccaa 12241 ttaatcgcac cactgatatg agcaggagct tggtctattt gtaaaattgc ataatcttca 12301 cctaattgca caagatacca gtctaaaaac tgttgaaaat attcaccatt aagtttgggg 12361 taaacttgag tgaaaatgtt gtcctgtgaa tggctcaatt gcaccataaa tccaaaaatt 12421 tttcctctgc catttcacgg atacagtagg tttgactcca gaagcagtaa tcacttttcc 12481 cgttaaagtt tttaatccca ccctggtttc atcctgacac agataatgaa tacattttcc 12541 tggtgctaga tgtttttcta ggatattgag gatgataccg agtttttttg aaactcagat 12601 actaacttct catcctgctt atggctttgc ggacgtggta ctttgagttt tgctcctaat 12661 ttatatctca ctaacgcata cactgttgca tactctactt ccagtccctg ttcttttttt 12721 aaccactcca ctattgcgcc atacctacta aagccttttc ctgtttttaa cttttcctca 12781 agtgctgcga tcgcacgctc gttaattttt cgttttgctc ctggagcttt tttaatttct 12841 aataattcat ctagcccccc tgacctatac ttttgtatcc accttgtcac cgttgaagta 12901 tctttcgcca agcgttttcc aatgtcttgc tgctcctgaa cctgaccact tttgatccac 12961 cacagcatca taagtttttc tttctgcttc cctaagttgg ctgtttgtaa acgtttttta 13021 agttcttctt cgctctctgc gatttcaatc ttaaaagggc ggctcatggt tattttaatt 13081 tatcgagttt gtttcttcta ttatatgcag tttcatatta aaatggtatt aacagggaac 13141 tcttaacagg cttgatggtc tggtggtgtc cgtatcttct catgagttca tatcctaatc 13201 tacctggcaa cggctatatt tgaaataaaa atctttgaaa tttttgtaaa aaattggaaa 13261 ttgtacacaa aactcatgtt tttaaatgtg atggcaattt cacaaaaaaa tgcgacaggt 13321 cttttggtta gagagacccc cgtagatagg ttttttgtag tggaaactct gatttaacgc 13381 tgttccgcga aattcaagag tccgccgtgc ggtgggcgag gttgtcg // LOCUS NODE_2532_length_13358_cov_4.63241413358 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13358) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13358) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13358 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(42..1874) /locus_tag="DP116_20775" CDS complement(42..1874) /locus_tag="DP116_20775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310038.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_20775" /translation="MRIRKKRNGLRQSLAVFRYSGRAISLVWTTSRSLTIIFASLTVL AGLLPAAIAYIGKLIVDAVVSISQQISYKSDFVNNYRPLIYVGLEAIAVALLAASQRG LTVCQSLLRMLLGQRVNVLILEKALTLDLTQFEDSEFYDKLTNARREASIRPLSLVNR TFGLVQSALSLVTYGVLLVKFSFWAVLMLILAAMPAFFAETKFAGEAFRLFSWRAPET RQQHYLETLLAREDFAKEVKLYQLGDMLLERYHSLFNRLYGEDRDLTLRRGWWGYLLS LLSTVAFYIAYAWIVVETVVGRISLGDMTMYLTVFRQGQSTFSSALTSIGGMYEDNLY LSNLYEFLEEEVPKPKGKATKGLKPKDGIRFENVSFTYPGSLQPALKNISLHLKPGEK LAIVGENGSGKTTLIKLLTQLYTPDSGRILLDGLDLQEWDVDVLRRRIGVIFQNFVRY QFTVGENIGVGDVDYVEDKNRWMSAAEKGMALPFIEHLPEKFQTQLGKWFRDGQELSG GQWQKIALARAFMRTHADILVLDEPTSAMDPQAEFDIFNHFRALTKNQMVFLISHRFS TVRMADKIVVIEAGEVVEQGTHEELLQAQNRYATLFSLQAAGYQ" gene 2079..2342 /locus_tag="DP116_20780" CDS 2079..2342 /locus_tag="DP116_20780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459520.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3146 domain-containing protein" /protein_id="PRJNA477356:DP116_20780" /translation="MSAKYLPETTAHVRITRQSWQDGFLEGEVRAGDFEWQFQWHFRR GELSVKPYKGRALIKEPLGRFLEQRDYQLEPGGDYAFTIRAQL" gene complement(2399..2851) /locus_tag="DP116_20785" CDS complement(2399..2851) /locus_tag="DP116_20785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315186.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="resolvase" /protein_id="PRJNA477356:DP116_20785" /translation="MTSREFLPTQPAILGFDPGRDKCGLAVMGLDRQVYYHEVVLAAE AIAAIMSLRQKFPISLLVMGDQTTAKRWKQELHNQLADPLNIILVDERYTSLEARDRY WQMYPPKGLAKLLPQGLRQPPRPIDDIVAILLIERYLNRLTESVVNNE" gene complement(3284..4777) /locus_tag="DP116_20790" CDS complement(3284..4777) /locus_tag="DP116_20790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3084 domain-containing protein" /protein_id="PRJNA477356:DP116_20790" /translation="MTTGYILIAAILILGGVIATVGDRIGTRVGKARLSLFNLRPKNT AVLITILTGTLVSASTLGILFAADDGLRTGVFELENIQKDLRRKREQLNTTAKQLEDT KGELEQARKEQKAQQDRLQQTNQSLREANAKQQETQTQLNRTISQQAQTQAQLQGTQN QLSQAVASYKQAIAELQSVENEKKKLLVEIEQRKVERQRLYEEARKAISQAKTAIEKR DRELANRQQAVQQRDQKIAELDQLIKKRNTAIAAREQVIAQRESRLKELETQQDSLEQ EVARLEKYYQSYRDLRLGKLALVRGQVLATGVVHVEQPVLGRQLIVQLLQEANRNASI ELSEPGATNQANLQILQVTNEQVDQLTKQINDGREYVVRIFSAGNYVRGEKQIAFFAD AARNQVVFTGGQVIATTSADPKTMTSFQLRQRLEQLISASQFRARNAGILESIQIDGT FIRFIAQLRQYNQPIDIKAIAAEDTYTAGPLKVKLIAIQNGQIIFST" gene 4920..5120 /locus_tag="DP116_20795" CDS 4920..5120 /locus_tag="DP116_20795" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20795" /translation="MNTYYLSHLQHSAKPRLQLANPWDHLLAVYTHQATGGSHAVASA TFNLLFKHLQGSLARENPTNLA" gene complement(5435..6106) /gene="ntcA" /locus_tag="DP116_20800" CDS complement(5435..6106) /gene="ntcA" /locus_tag="DP116_20800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008233817.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="global nitrogen regulator NtcA" /protein_id="PRJNA477356:DP116_20800" /translation="MLVTQDKALASVFRQMATGAFPPVVETFERSKTIFFPGDPAERV YFLLKGAVKLSRVYEAGEEITVALLRENSVFGVLSLLTGNKSDRFYHAVAFTSVELLS APIEQVEQALKENPELSMLMLRGLSSRILQTEMMIETLAHRDMGSRLVSFLLILCRDF GVPCADGITIDLKLSHQAIAEAIGSTRVTVTRLLGDLREKKMISIHKKKITVHKPVTL SRQFT" gene 6817..7593 /locus_tag="DP116_20805" CDS 6817..7593 /locus_tag="DP116_20805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209832.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="enoyl-[acyl-carrier-protein] reductase FabI" /protein_id="PRJNA477356:DP116_20805" /translation="MLDLTGKKALVTGIANNRSIAWGIAQQLNAAGANLGITYLPDEK GKMEKKVAELVEPLNPSLFLPCNVQNDEDLKSTFNAISDKWGSLDILIHCLAYANKED LSGDFSQTSRSGFNLALEISTYSLVQLAGYAKPLMTQGGSIVTLTYLGGVRAIPNYNV MGVAKAGLETSVRYLAAEMGPQNIRVNAISAGPIRTLASSAVGGILDMIHHVEQVAPL RRTVTQLEVGNAAAFLCSDLASGITGQVLYVDAGYEIMGM" gene 7757..8380 /locus_tag="DP116_20810" CDS 7757..8380 /locus_tag="DP116_20810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015144859.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="imidazoleglycerol-phosphate dehydratase HisB" /protein_id="PRJNA477356:DP116_20810" /translation="MIPSQHEQLPGGARIAEISRRTSETDVYVRVNLDGTGVCNAATG IPFLDHMLHQISSHGLIDLEVRATGDLHIDDHHSNEDVGITLGQAISLALGDKKGIVR FGHFVAPLDEALIQVALDFSGRPHISYGLQIPTQRVGTYDTQLVREFFVALVNHSQMT LHIRQLDGMNSHHIIEATFKAFARATRMAVEIDQRRAGTIPSSKGML" gene 8538..9515 /locus_tag="DP116_20815" CDS 8538..9515 /locus_tag="DP116_20815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="multidrug ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_20815" /translation="MMSVANTPESQLNTTSIPPVVLTSELEKVYRTGFFLNQKVVSLK SCSLKVYKGETFGLLGPHGAGKTTLLKLLLGMIRPTSGRGLLLGRPLGERTLKERIGY LSENLSFSQYLTGWEFLQVCAGVFKISTKVQRQRIPQLLELVGLSQADARKKLLRRYS KGMLQRVGMAKALINDPELLLLDEPMSGLDPVERYQMRDIIIAIKAAGKTIFLNSHVV NEVEQICDRVAILSQGELISSGSVDQLLGTNHLYHVKGYGGNWEVLKKWVPNLEYQPD GSWQGKVQGDVYDFLASFRLMGGQLIGVNLSRPSLEDFFLEQIQRHNQS" gene 10235..11869 /locus_tag="DP116_20820" CDS 10235..11869 /locus_tag="DP116_20820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315180.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide export protein" /protein_id="PRJNA477356:DP116_20820" /translation="MENTRLKFISQPFGGVVLLTAINFAFPLTSLAQNPTKQSTTQLN NYTSGQALPTGTQRNTKAPRQLVPSATQVKKNYTQGQTVPAATTPIDTKYTQRQAVPG GTQVNTNYTLGGGDRVRVSVFEVPEYSGEYQIPPGGGLNLPLIGSVSVLGLTTEEAAE LITQKYSRFLKRPIITINLLSPRPINIFVAGEVTRAGAYSLNLQAGGGNAPSAQFPTL LSALTTAQGITQSADETRVELRRRIGQGPEQVSVVNLKEIRQTGRIRQDLTLRDGDTI YVPTATTIDLADARNLASSSYAADPTKPRTVALIGEVLRPGSYLVSEGASEGGNANNT TGAPSITGQPTLMRALQLAGGITPEADVRNVAIRRRTRTGNEQSIKVDLWKLLNSGDI NQDVILQDGDTIVFPTATEVSRAEATQLATTTLAPSRIQVNVVGEVKAPGVKQVQPNT PLNQALLAAGGFNDARASRTTVDLVRLNPDGSVTKREVKVDIKQGINEQTNPILRNND IVIVNRSGIAKTGDAVGAFFNPFGTVLGIFRSLFGF" gene complement(11960..12232) /locus_tag="DP116_20825" CDS complement(11960..12232) /locus_tag="DP116_20825" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20825" /translation="MPCANIYQAQTAYELPISPLGTLYQFKIRNPEKYACGTLRYQNE ESQTSQGFENVYLSYSFFKLVLSKQNYKFKINNLHIFLLGIQSSKK" gene 12196..>13358 /locus_tag="DP116_20830" CDS 12196..>13358 /locus_tag="DP116_20830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315161.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dienelactone hydrolase" /protein_id="PRJNA477356:DP116_20830" /translation="MQFELGKYWHKASWAQKLLCPLMLVLGWGVGMPSTIAAQTVTIR LGPFQQSVAIRDLEKFAKTGKLPEGLELLSPVLNSQVRELLTKRLEVNPAVADRFIDN LVRSPGGRQFISSLGGVIPGSTTESIKATLNFALRQANGFSALSFLRAYPGENITVDA TKAVGLAVDFNPNNLQSQAFGLLLERELSVESSIPFKAAFDPAALGNQVVQQQTIVLN DQQRNRIIPVDIYWSQNSSQGNTENPLVVLSHGFGANRRFLSYLARHLASYGITVVAV EHPGSNVDAVGRASDKQNLAQLLAGTEFIERPKDITFVLDELTKLNTQIGQLQGKFNT EKVTVIGHSLGGYTALALVGGEVNLEELRQFCKDSLNIGEAPGDWLQCSAASLR" BASE COUNT 3830 a 2920 c 2812 g 3796 t ORIGIN 1 cgcaatgact atagttacgt tgttcgtgcg taagtcctgc tctactgata tcctgctgct 61 tgcaacgaaa acagtgtcgc atagcggttc tgtgcttgta acaactcctc atgtgttcct 121 tgttccacaa cctccccagc ttcgataaca acaattttgt cagccatccg caccgtagaa 181 aagcggtgag atattaaaaa taccatctga tttttcgtca gggcgcgaaa gtgattgaaa 241 atatcaaatt ctgcttgggg gtccattgct gatgtcggtt catccaaaac caaaatatct 301 gcatgagttc gcataaaagc acgcgcaaga gcaattttct gccactgacc tccagagagt 361 tcttgtccat ctctgaacca tttcccaagc tgagtctgaa acttttcggg caaatgttca 421 ataaaaggaa gcgccatgcc tttttcagcc gcactcatcc atcggttttt atcttcaacg 481 tagtccacat cccccacacc aatattctca ccaacagtaa attggtagcg gacaaagttt 541 tgaaaaatca ccccaatgcg tcgccgcagc acatccacat cccattcttg caagtccaaa 601 ccatctagta agatgcgtcc agagtcagga gtgtaaagtt gagtcagcaa cttaatcagc 661 gtcgttttgc cagaaccatt ttcaccaaca attgctaact tctctcccgg tttcaggtgc 721 aaggaaatat ttttcagcgc tggttgtaaa cttcctggat aggtgaatga tacgttttca 781 aagcgaattc catctttagg ttttaagcct ttcgtagctt tgccttttgg ctttggaact 841 tcctcttcca gaaactcata aagattagaa aggtacaagt tatcctcata catccccccg 901 atagaagtca aggcactaga aaatgtagac tgtccttggc gaaacaccgt gagatacatc 961 gtcatatctc ccaaggagat tctgcctaca actgtctcca ccacaatcca ggcataagct 1021 atgtaaaaag ctacagtact caacaaactc agcagataac cccaccatcc tcgccgtagt 1081 gtcaagtcgc ggtcttcacc gtagagtcta ttaaataggc tgtggtaacg ttccagcaac 1141 atatccccaa gttgatagag tttgacttcc ttggcaaagt cttctctagc caaaagggtt 1201 tccaggtagt gctgctgacg ggtttccggc gcacgccaac taaataggcg aaaggcttcc 1261 ccagcaaatt tcgtttccgc aaagaatgcg ggcatagctg ctaagatgag catcaacacc 1321 gcccaaaatg aaaatttcac cagcaaaacg ccgtaagtca caagtgacaa cgcgctttgc 1381 accaagccaa aggtacggtt gaccagagac aagggacgaa ttgatgcttc tcgtcgtgcg 1441 ttggtcaatt tgtcataaaa ctctgagtcc tcaaactgtg tgaggtcgag tgtcagggct 1501 ttttctaaaa tcagaacgtt gactcgttga ccaagcagca tccgcaacaa agactggcaa 1561 acggtgagtc ctcgctggct agccgccagt aaagcgacgg cgatcgcctc taaacccaca 1621 tatattaaag gacgataatt attgacaaaa tcactcttat atgatatctg ttgagatatc 1681 gaaaccacag catcgacaat taacttacca atataggcta tagctgctgg taaaagacca 1741 gccagcacag ttaaactcgc aaaaataatg gtcagcgagc ggctagtcgt ccacaccaag 1801 ctaatagctc gtccactgta ccggaaaact gcgagagatt ggcgcaggcc attgcgtttt 1861 ttacgaatcc tcatattctt tttatactgt aaaatcgcgc ctgattgcag cgttcactcc 1921 tatgtcttcc agcagaaatt catcctgagt tgcatttaaa gacaataggc aggatgaggg 1981 atcatatcca aatacaccaa ttggagtatt tgaggtactg tgaataaatg taaagctata 2041 acaagttagc tttatttttc aacttgacat acagcatcgt gagtgctaaa tatttaccag 2101 aaactacagc ccatgtcagg attacccgtc aatcttggca agacggattt cttgaaggtg 2161 aagtgagagc gggtgacttt gaatggcagt tccaatggca tttccgccgg ggagaacttt 2221 ctgtcaagcc ttataaaggg cgtgctttga tcaaggaacc cctcggtcgc tttttagagc 2281 aacgagatta ccagctcgaa ccaggaggag actatgcttt cactattcga gctcaactgt 2341 aaagaaaaaa gaaacgaaga tttatcaagt aaacatgttt tataatactt aaccttaatt 2401 attcattatt aaccactgat tcagttaaac gattgaggta tctttcaatt aacaggatag 2461 ctacaatgtc gtctattggt cgtggtggtt gtcgcaagcc ttgtggtaga agcttggcaa 2521 gtcctttggg aggatacatc tgccagtagc gatcgcgcgc ttctaagctg gtgtaacgct 2581 catctactaa aatgatgttc aatggatctg ctaattgatt gtgcagttct tgcttccagc 2641 gttttgctgt ggtttggtca cccatgacga gtaaagaaat ggggaacttt tgacgcagcg 2701 acatgatagc ggcgatcgcc tcagctgcta acacaacttc gtgatagtac acttgtctat 2761 caagtcccat cacagctaaa ccacacttat cacgacctgg atcaaaaccc aaaatagctg 2821 gttgggttgg taggaattca cgggaagtca taataaaaat ttaaaattag gaagaacgca 2881 gaaggcagaa cgagagaatt ctccacaaat gaaatatatt ctatgtaaga aaaaattctt 2941 ttttctccat aaatgaattc aggcgcaaga gagtcgcaac tcacgcacca aaactccgct 3001 cctattgacg cgcgatttct aaatttcttc tgactactga gttttaacca ttttttgggt 3061 caaaagtcaa cagcttttga actgtggact tttgattata gattatgaaa acttaacaat 3121 tagtaatata taagactcct atttgatttt tgaaaaaaac tcggtacacc tttattcctt 3181 cttcgatgtt aagagttaag cgttaagagt tccctgttcc ctgttccctg ttccctgttc 3241 cctgttccct attccctgtt ccctattccc tgttccctag tagttaagtg ctaaaaataa 3301 tctgcccgtt ttggatggct atgagtttga ctttcagtgg tccagctgtg tatgtatcct 3361 ctgcagcaat agctttgata tctattggct ggttgtactg tctcaactgg gcgataaagc 3421 gaatgaaagt tccatctatt tgaatacttt ctaaaatacc agcattacga gcgcgaaatt 3481 gcgaagcaga aatcagctgt tccagccgct gtcgcagttg aaaagatgtc attgttttgg 3541 gatctgcgct tgttgttgct ataacttgac ctccggtaaa aacgacttga ttgcgtgctg 3601 catcagcaaa aaatgcaatc tgcttctctc ccctgacgta attgccagca gaaaaaattc 3661 ttaccacata ttctcgacca tcattaattt gcttggttaa ttgatcaacc tgctcatttg 3721 tcacttgcag gatctgcaaa tttgcttgat tggtggcacc aggctcactt aattcgatac 3781 tagcgtttcg attagcttcc tgtaaaagtt gaacgatcaa ctgacgcccc agaacaggtt 3841 gttccacatg aacaacgcct gtagcaagaa cttgaccgcg aacgagtgcg agttttccta 3901 aacgcaggtc acggtaagac tggtaatatt tctccagcct tgctacttcc tgttccagag 3961 aatcttgttg tgtttccaat tctttgaggc gagattcccg ttgagcaatg acttgctctc 4021 gtgctgcaat tgctgtgttt cgctttttaa tgagttgatc cagttcggca attttttgat 4081 cccgttgttg aacagcttgc tgtcgattgg ctagttcgcg atcgcgtttc tcaatagctg 4141 ttttggcttg cgaaatcgct tttcttgctt cctcatacag tcgttggcgt tctacctttc 4201 tttgttcaat ttctaccaac agcttttttt tctcattctc aacgctttga agttccgcta 4261 tcgcttgttt atagctagcg actgcttgac ttagttggtt ttgtgtgcct tgaagttgag 4321 cttgagtttg agcttgttgg ctaatagtac ggttcagttg agtctgagtc tcctgttgtt 4381 tagcgtttgc ctcccgcaac gactgatttg tttgctgcaa gcggtcttgt tgagcctttt 4441 gttcttttcg agcttgttct agctcacctt tagtgtcttc tagctgtttt gctgttgtgt 4501 tgagttgttc gcgcttgcgc ctgagatctt tttgaatatt ctctagctca aacactcctg 4561 tgcgcaatcc gtcgtcggca gcaaataaaa ttcctagggt tgatgccgag accaacgtac 4621 cagtcagaat agttatcagt actgccgtgt ttttcggacg caggttaaac agggagaggc 4681 gggctttgcc aactcgtgtg ccaatgcgat cgcccacggt ggcaatgacg cctcccaaaa 4741 ttaaaattgc tgctattagg atgtacccgg tggtcatctt ctcctaccga cgaaatcctt 4801 ccagatacag cctactactt ccagtcattc tctgtgggaa attagtgtaa gtcaagcatt 4861 gacatgacca aatcttcacg agtctttatt cctaattcca acagaactca acagacgcca 4921 tgaacactta ctatctcagt cacctgcagc atagtgcaaa gccacgttta caactggcaa 4981 acccttggga tcaccttctt gcagtttaca ctcaccaagc aactggcggg agtcacgcag 5041 tggcttcagc tacctttaat ttgcttttta aacatctaca aggaagcctt gcacgggaaa 5101 atcccacaaa ccttgcatga aaaagatttc tttccttcat cgttctgagt caggagttgc 5161 tgagttctat tttcaagcta ggaaactatg catggcagtt gcttacagag gttgatacac 5221 gcctggtacc tcctgaacag aactcttgca acaaggaggg cattgccgac gttaatccgc 5281 tttgggttgc aaggaatgac ctaaaaggtg tcaattcgtc ttgcaataag gagggtattg 5341 ccgatgtcag ttgtagtaag tgacgtttat gagtcatttt ttctgacttt tttatctaac 5401 tcagaaatga ctgttcatag ctagtgaact ttgtttaggt aaattggcga cttaaggtaa 5461 caggcttatg aacagtaatc tttttcttgt gtattgaaat catctttttc tcacgcaagt 5521 cacctagtaa tctagtgaca gtcacgcgag tcgaaccaat tgcttcagcg atcgcttgat 5581 gagaaagctt gagatcaatt gtgattccat ctgcacaagg aacgccaaaa tcacgacaga 5641 gaattaacaa aaaactcacc aatctagaac ccatatctct gtgagcgagt gtttctatca 5701 tcatctctgt ctgtaagatc cgcgaagaca atcctcgcag cattagcatc gacagttctg 5761 ggttttcttt aagcgcttgt tccacctgtt caattggtgc tgagagcaat tctacagatg 5821 tgaaagcaac cgcatggtaa aacctgtctg acttgtttcc tgtcagcaag gataagacgc 5881 cgaaaacact attttcccgc agcagtgcta cggttatttc ttctcctgct tcatacaccc 5941 tggagagctt aacggcacct ttcaatagaa aatacactcg ttcggcggga tcgccgggga 6001 aaaagatggt tttactacgt tcaaatgttt ccacaaccgg cggaaacgct ccggtcgcca 6061 tctgacgaaa aacacttgct agggctttat cttgtgtcac gagcatttac cttcccctac 6121 ccaatgccgg aaaaattaaa atcgatttcc taagaataca ccggaaaaat caagaacaca 6181 ttgtcaatgt ttttgtactt tcctatactt aatctcactc actcatatag taattctaaa 6241 ccatttgtga aatttcccat aacgtcacct gtaatttgtc tagtgtctag agtccaaagt 6301 cataacaagt cacaaagaaa cgcaacttgt tttaacagag ggaatcacca agggcgagat 6361 cctcaagatg attctaacga gcagtcttac aacttaaatt ggattgctct gtttatttgt 6421 tcatcatgat acataatttt ggatgtttag atctagtcct ttttcaaagt taattacaaa 6481 agtttttgaa gaactttctg gaagtaaaaa aaatagacac actcatcctg tagaaggtgt 6541 aaaactgtag tcagcagctg gtctatggta gatagaaaaa aattctattg actacctaat 6601 acctactatc tactaccata catttcccat ccactgttta ctcctaaatc atttatcact 6661 actatgtttt gagcatatat tttaagaatt gaacgatgat tttgtggtat caaaagctga 6721 ctccagaatc gtgtttacaa atagtaccca aaacaaagac aaatcggtgt cagattctag 6781 tatgctcaag caggagtgtt cacattaaga attgatatgc tagatctgac cggaaaaaaa 6841 gccttagtaa cgggaattgc aaacaaccgc tcaatcgcct gggggattgc ccaacaactg 6901 aacgcagcag gagcgaacct gggtattacc tatttgccag atgaaaaagg caaaatggag 6961 aaaaaagtcg cagagttggt agaaccgcta aaccccagct tgtttcttcc ttgtaatgtc 7021 caaaacgacg aagatcttaa gtctaccttc aatgcgatta gcgataagtg gggatcttta 7081 gatatcctta tccattgcct tgcctatgcc aataaagaag acctgagtgg agattttagc 7141 caaacatccc gttctggctt caatttggct ttagaaatca gtacttactc cttagtgcaa 7201 ctcgctggat atgctaaacc cttgatgact caaggaggaa gtatcgtcac tctcacctat 7261 ctaggaggtg tgcgggcaat tcccaactat aatgtgatgg gagttgcgaa agcagggcta 7321 gaaacgagtg tgcgatacct agctgctgaa atgggaccac aaaatatcag agtgaatgcc 7381 atctctgctg gtccaattcg tactttagcc tctagtgccg taggaggaat cttggatatg 7441 attcatcatg tggagcaggt ggctcctcta cgacgcactg tcactcagct agaagtgggc 7501 aatgcggcag ctttcttgtg tagtgacctg gcaagtggaa tcacagggca agtcctttat 7561 gtggatgcag gatatgaaat catgggtatg taacgttaat attgctattt gctaaccaat 7621 aactactatt tgtgtattta ccattagtta ttaggtatta gcaaatggct ctgacccatt 7681 actccataat taagcataag ccaggttctg tatgcaaata agcgattcct ccagcacaga 7741 ctccgtccta cgtcaaatta tcccctcaca gcacgaacag ctccctgggg gcgctcgtat 7801 tgccgaaatt agccgtagga caagcgaaac ggatgtgtat gttagagtca acttggatgg 7861 gacaggagtt tgcaatgcag caactggcat tccctttttg gatcatatgc tgcatcaaat 7921 ttcctcccac gggctaattg acttagaagt tagggcgaca ggagatttgc acatcgacga 7981 tcaccacagc aacgaagatg ttggtattac cttgggtcaa gcaataagct tggcccttgg 8041 cgataaaaaa ggtattgtcc gctttggtca tttcgttgcg cctcttgatg aagcactcat 8101 tcaagtggcg ctagattttt ccggacgtcc ccacatcagc tatggcttgc aaattcctac 8161 ccagcgagtg ggaacttatg acacccagtt agtgcgcgaa ttcttcgtag ctttggtaaa 8221 ccatagtcaa atgacgctac atatccgcca attagatggt atgaattccc atcacattat 8281 tgaagcaaca tttaaggcct ttgccagagc aacgcgcatg gcagtagaaa ttgatcagcg 8341 tcgtgctggt acaattccca gttctaaggg gatgttataa actagcacaa gcacggggag 8401 tcattactac tccctactcc ctctctcatc tcccccactt gctattgagg ggagattagt 8461 cttaatggtg ttataattac cagtcaactc gctcatctgt cattacatct tattttatct 8521 gttgtattaa gcacaacatg atgtctgtcg caaacacccc tgagtctcaa ctgaatacca 8581 caagtattcc gccagttgtt ctaacctctg agttggaaaa agtctatcgc acaggctttt 8641 ttttgaatca aaaggtcgta tctctcaaaa gctgttcttt gaaagtttat aagggggaaa 8701 cttttgggtt acttggacca catggtgctg gtaaaacaac tttgttgaaa ttgttgctgg 8761 gaatgattcg tccaacatca ggacgaggat tgcttttggg cagaccatta ggcgagcgta 8821 ctctcaaaga acgcatcggc tatctgagcg aaaatctctc tttttctcaa tatctcacag 8881 gctgggaatt tttgcaggtt tgtgctgggg tatttaaaat ttctacaaag gtgcaacgtc 8941 aacgcattcc tcaactcctg gaactggtcg ggttatccca agctgacgcc cgcaaaaagc 9001 tgcttcgtcg ctactctaaa gggatgctac agcgtgttgg tatggcaaaa gcactgatta 9061 acgacccaga actgttgttg cttgatgaac cgatgtcggg tcttgaccct gtggaacgtt 9121 accaaatgcg agatattatc atagcgatta aagccgcagg taagacaatt tttctcaaca 9181 gccatgttgt gaatgaagtt gaacaaattt gcgatcgcgt cgctatcctt agtcaaggag 9241 aactcatttc ctctggttct gttgatcaac tcttaggaac aaatcacctg tatcatgtca 9301 aaggttacgg cggtaattgg gaagttctta aaaaatgggt tcccaatcta gagtaccaac 9361 ctgatggttc ttggcaaggt aaagttcaag gcgatgttta tgattttctt gccagttttc 9421 gtctcatggg aggacaactc attggtgtga atttgtctcg tccctctttg gaagattttt 9481 ttctggaaca gatccaaaga cacaatcagt cttagatcat ttcatacgta tgttagtcgg 9541 cttgtttttc ctcattctga ctaactgtaa ataattctag caagaaaatg ggctgacaaa 9601 aaacgcagcg gcgcgggagt tggaagaaag tttgtttcgt tatggtgacg aattcatcgt 9661 gacgactact gattcaaact ccttaacttt agtctcctca tcgtcttcag tcacagattt 9721 ttttgcccct tacaaataac agatggctga gacaaaaaat atgcactcct caaacgattg 9781 tctcttctta gtaagatttt gatgctatag cagttgcgag gtagattagg acatgaacta 9841 atgagaaaat acggacacaa ggatactttc cagcctgtta agagtttcct gctgtacgtc 9901 tttttctgct ttgatgacac gctctcttaa ctaataaatc ctacttttat tacacgtcag 9961 atttgctagt caaaaaaaat tttgttaata ctactgtacg aagagagtat ttctcagata 10021 aaatgtcatg aaaaccacaa tattgtaaaa gcaagtatga taaagtttca ctatttaaat 10081 gaacttcact aaaaattcaa atgaaataaa caattatcat tcggaaaaac aagatttttt 10141 aagaaaataa aagtgttaag gaacttgaag aaaagacaag agtcaagctt aaaatgttaa 10201 gacagtactt tactaatagt agtttcaggt aaatatggaa aatacaaggt taaagtttat 10261 cagccaacca tttgggggtg tggtactgtt aactgctatt aatttcgctt tcccgcttac 10321 tagcttagct cagaatccaa ccaaacaatc aacaacacag ttaaataatt atacatcagg 10381 acaagcccta cctacaggga cacaaagaaa tacaaaagct ccaagacaac tggtaccatc 10441 agcaacacag gtaaagaaaa actatacaca aggacaaacc gtacccgcag caacaacacc 10501 aatagataca aaatatacac agagacaagc agtaccaggg ggaacacaag taaatacaaa 10561 ctatacgttg ggaggtggcg atcgcgtcag agtcagcgta tttgaagttc ccgaatattc 10621 aggcgagtac caaattcctc cgggtggagg gctgaaccta cctttaattg gtagtgtttc 10681 tgttttagga ctgaccacag aagaggcagc tgaattaatt actcagaaat actctcgctt 10741 tctcaaacga ccgatcataa caatcaatct tttatcacct cgccccataa atatttttgt 10801 tgctggagag gtgacacgtg caggagctta cagtctcaac ttacaagcag gcggagggaa 10861 tgctcctagt gcgcaattcc caactttact atcggcactg acgacagctc aagggatcac 10921 acagtcagcg gatgagacaa gagtggagtt gaggcgtagg ataggacaag gtccagagca 10981 agttagtgtt gtcaatttga aggagattag acaaacaggc agaataaggc aggatttgac 11041 attgcgggat ggagatacca tctacgttcc aacagcaacg actatagact tggcagatgc 11101 acgaaattta gcttcatcaa gctatgccgc agatccgacc aaacctcgta cagtagcact 11161 cattggtgaa gttctccgtc caggttcgta tcttgtcagc gaaggtgctt cagaaggagg 11221 aaatgcaaac aacacaactg gtgcaccaag tatcactgga caaccaactt tgatgcgagc 11281 actgcaactg gctgggggaa ttacgccaga ggctgatgtt cgtaatgtag caatacgccg 11341 acgcaccaga acaggtaacg aacaatcaat caaagttgac ctgtggaaac tactcaatag 11401 tggtgacatt aatcaagacg tgattttgca agacggagat acgatcgtat ttccaacagc 11461 aacagaagtc agtcgagcag aagcgactca attagcgact acgactttgg ctccttctcg 11521 cattcaagtc aatgtggttg gtgaagtaaa agccccaggc gtgaaacaag ttcagcctaa 11581 tactccgtta aatcaagcat tgcttgcagc aggcggattt aacgatgcga gagcaagtag 11641 aactactgtt gatttggttc gcctcaaccc agatggctct gtgacgaaac gggaagtcaa 11701 agtagatatc aaacaaggaa ttaatgagca aaccaatccc atactccgta ataatgatat 11761 cgtgatagtt aaccgctctg gtatagccaa gactggtgat gctgtaggtg ctttcttcaa 11821 tccttttggc actgtgcttg gtatttttag atctttattc gggttttaaa caaagctgta 11881 aactggggac actcgctagc ccctaagggg gcgctgcgca aacgctcaaa ttcaaaattc 11941 gctcattcaa agttcaaaat tatttttttg aactttgaat tcccaacaaa aagatgtgga 12001 gattattaat tttgaattta taattttgtt tgcttaatac caatttgaaa aaagaatacg 12061 acagatacac attctcaaac ccttgtgatg tctggctttc ttcattttga tagcgcagcg 12121 tgccgcaggc atacttctcc ggattgcgaa ttttgaattg gtataacgtg cccaaagggc 12181 ttattggtaa ctcatatgca gtttgagctt ggtaaatatt ggcacaaggc atcttgggca 12241 caaaaattac tctgtccttt gatgcttgtt ttgggatggg gtgtgggaat gccttctaca 12301 atagccgccc aaacagttac tatccgttta ggtccgtttc agcagtcagt ggcgatcaga 12361 gacctagaaa agtttgccaa aaccgggaaa ttaccagaag gactggaact tttgtcacca 12421 gtgctaaact cgcaagtgcg agaattgcta accaagcggt tggaagtcaa tcctgctgtt 12481 gcagacagat ttattgataa cttggtgcga tcgcccggag gtagacagtt catatcatct 12541 ttgggtggag tcataccagg tagtacaaca gaaagtatca aagccaccct gaatttcgca 12601 ctccgacaag caaacggttt tagcgcattg agtttcttgc gagcttaccc aggagaaaat 12661 attaccgtag acgctacgaa agctgtcggt ttagcggtag actttaaccc aaacaactta 12721 cagagtcaag cctttggtct gttgcttgaa cgagagttat ctgttgaaag tagcatacca 12781 ttcaaagcag cattcgaccc agcagcgctg gggaatcaag tcgtacagca gcagacaatt 12841 gttttgaatg accaacagcg taaccgtatt attcctgtag atatttattg gagtcaaaat 12901 tcgagtcagg gaaacaccga aaacccactc gttgtcctct cgcacggatt cggggcaaac 12961 cgacgattct tgagctattt agcgcgtcat ttagcttctt atggcataac tgttgttgct 13021 gttgaacatc ctgggagcaa tgttgatgct gtcggtagag cctcagataa acagaatttg 13081 gcacagctgc tagcaggtac tgagtttatt gaacgaccaa aagatattac ctttgtgctg 13141 gatgaactga caaaactcaa tactcaaatt ggtcaacttc agggtaagtt caacactgaa 13201 aaagtcactg tcatcggtca ttccttggga ggttataccg ctttggcttt ggtaggagga 13261 gaggttaact tggaagagtt acgtcaattt tgtaaagatt ctttaaatat cggcgaagcg 13321 cctggtgatt ggttacaatg ttcagcggct tccttacg // LOCUS NODE_2533_length_13351_cov_4.88086613351 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13351) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13351) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13351 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(119..1441) /locus_tag="DP116_20835" CDS complement(119..1441) /locus_tag="DP116_20835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319412.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-alanyl-D-alanine carboxypeptidase" /protein_id="PRJNA477356:DP116_20835" /translation="MLELFGSGLISVWLEMAGVTVKPINALDILAWQGSPGLVVAADP NPAGSNTVLEYLKGLQTLKLVAANQMESQGIWMQSGPMLMANHQGTTALPAASLTKVA TSLTAFKNLGPDHQFQTLVSATGPLQNGVLQGDLVITGGGDPMFVWEEAITLGNSLNQ MGIKRVTGNLVITGNFAMNFQRHPLLAGQMLKQALNAATWPRGATFQHSLMPKGTPKP QLVIAGGVKVEPIANPQPTLLLRHHSLPLKQIIREMNVFSNNEIAQMLADAVGGAEVV QSTAARLARVPQSEIQLINGSGLGPENRISPRAVCAMFMAIQQEALAYQLTLADLFPV SGFDHRGTMHSRHLPAATIMKTGTLRDVSALAGVVPTRDRGLVWFAIINRGTNVSGFR TGQDQLLQRLVQQLQVAPGVPATLTPHIAVNTIPQLGAASRNEMLYGG" gene 2000..3025 /locus_tag="DP116_20840" CDS 2000..3025 /locus_tag="DP116_20840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012406928.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate acyltransferase PlsX" /protein_id="PRJNA477356:DP116_20840" /translation="MGSTGARIAIDAMGGDHAPGEIVAGALRAREELGVEVLLVGEPQ QIKSKLPPKTHLGQVEIVPAEETITMDEEPLSGIRRKPKASINVAMDLVKSQRADAVV SAGHSGAAMASALLRLGRLRGIDRPAIGAVFPTIVAGKPVLILDVGANVDCRPKFLEQ FAVMGSVYSQYVLGIQEPSVGLLNIGEEDTKGNDAAVRANQLLRENPQISFIGNAEGR DVLSGRFDVIVCDGFVGNVLLKFAEAVGEVLLQIIREELPQGLHGQIGTAILKPNLKR IKQRVDHAERGGALLLGVDGICIISHGSSQAPSIFNAIRMAKEAVDNQVLHRIQSQNI SIQRESG" gene 3148..4140 /locus_tag="DP116_20845" CDS 3148..4140 /locus_tag="DP116_20845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874862.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-oxoacyl-ACP synthase" /protein_id="PRJNA477356:DP116_20845" /translation="MQNLGIAITGSGSAVPATFLDNHTLTELVETSDEWITTRTGIRQ RRLAKASETVTELASAASRQAIAMAGISPQDLDLILLATSSPDDLFGSACQIQAQLGA TKAVAFDITAACSGFVFGLVTAAQYIRTGVYQNVLLVGADVLSRWVDWQDRGTCVLFG DGAGAVVIQANQKDYLLGFELRSDGTQNNCLNLAYAAEPTQLTQEVNVGKGNFQPITM NGKEVYRFAVQKVPEVLDKALFRANLSVDQIDWLLLHQANQRILDAVAQRLNIPEHKV ISNLANYGNTSAASIPLALDEAVRQGKIKTGDIIAASGFGAGLTWGAAIFQWGR" gene 4279..5157 /gene="fabD" /locus_tag="DP116_20850" CDS 4279..5157 /gene="fabD" /locus_tag="DP116_20850" /EC_number="2.3.1.39" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="[acyl-carrier-protein] S-malonyltransferase" /protein_id="PRJNA477356:DP116_20850" /translation="MTKTAWVFPGQGSQSQGMGIDLLDVPSAQEKFAQAENILGWSVI EICQNNAEKLSYTLYTQPCLYVVESILADIVREKEKPNLVAGHSLGEYVALYVAGVFE WSDGLRLVKRRAELMETTAGGMMAALMNFDREQLEKVIAETPEVVIANDNSPGQVVIS GTPAAVEAVMSQVKAKRAKPLNVSGAFHSPLMAPAAAEFQDILESVEFHQAFVPVLSN VEPVPTVDASVLKKRLIQQMTGGVRWREISLALPENGIERVVEIGPGNVLTGLIKRTC SDLILENVSSVADLAL" gene 5345..5983 /locus_tag="DP116_20855" CDS 5345..5983 /locus_tag="DP116_20855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139665.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-acyl-sn-glycerol-3-phosphate acyltransferase" /protein_id="PRJNA477356:DP116_20855" /translation="MSRSREPYVSLLLYHAFKWSVVSPMLQVYFKGRIYGAENVPKTG PLLLVSNHASNYDPPIVSNCVRRPVAFMAKEELFKIPILGKAIQLYGAYPVSRGSADR TAIRAAMKYLDDGWAVGLFLQGTRTPDGRITDPKRGAALIAAKAKVPLLPVSLWGTQA IEQKGSRTPLSVPVTVRIGTLIDPPSSNDKEELEALTQKCTAEINTLHELGR" gene 6251..7294 /locus_tag="DP116_20860" CDS 6251..7294 /locus_tag="DP116_20860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AI-2E family transporter" /protein_id="PRJNA477356:DP116_20860" /translation="MSSFDANNLWYRLNNLALVRFLLFVASGWAIVLLLDYFQSVIVI FTFAAILAFLLSYPVQWLRHFLPHSIAVVVIFLLSITLIGGLTITVGLTVLSQGQQLI DTVSAFLNSLIPLVERIEALLRNRNLPIDLSVIEEQLRNQAISLLVNSLNIFQSMITN FVTFLLIAVVAFFMLLDGEKLWSFIIKTVPQRRRDKFTNIIRRNFLGFFRGQLILTLF LTSSTFLVFLILKVPFALILSVIVGILDIIPGIGATLGVGIITLIVLSQSVWMALRVL VACIILQQIQDNLITPRIMQGALNLNPVVVFFALLVGAKVAGLLGLFISIPIAGVLVS LFEIDEMKAEVES" gene 7464..7766 /locus_tag="DP116_20865" CDS 7464..7766 /locus_tag="DP116_20865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2288 domain-containing protein" /protein_id="PRJNA477356:DP116_20865" /translation="MTNTDLRAELRENLDEAEWEWLIPHAQRDALIFVVEELDLLDVG VAIAGDNVSQVQTWIDEALITKPSVAQIGKWNTQSAKRFKTLIVQPYVLVQEKAAA" gene complement(7837..8370) /locus_tag="DP116_20870" CDS complement(7837..8370) /locus_tag="DP116_20870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017322825.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YdcF family protein" /protein_id="PRJNA477356:DP116_20870" /translation="MILALPLMMLWGYKEIQSQFMQPQAILVLGGSTSKLEREKFTAN FAQKHPTLSIWISGGSPPKVTKQVFAKAGVDTRRLRLDYRANNTVENFTTLVDDLNKR GIKSVYLVTSDYHMRRAKIVGEIVLGSRGIDLKPVTVPSETSPEPIGKSIRDGIRAIV WVTTGYTAIDETQNNQR" gene complement(9126..10226) /locus_tag="DP116_20875" CDS complement(9126..10226) /locus_tag="DP116_20875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197743.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tocopherol cyclase" /protein_id="PRJNA477356:DP116_20875" /translation="MLTISTNSHHSIQTPHSGYHWDGSTRRFFEGWYYRVTLPAEGQT FAFMYSIEDPVGGKHHSGGAAQILGPNDEYLWRTFPDVSKFWASRNVLGLGHWGKTDL LHTKPVYLLPPEFEHHIQEGYQATATLNQGAIHDPGTGNYCRWEYEIQPIYGWGDKGS IQQSTAGWLSFLQIFEPGWQILMAHGLASGWIDWNGKRYEFTNAPAYGEKNWGGAFPE KWFWLNCNSFEGESDLALTAGGGRRGVLWWMESVAMIGIHHQGKFYEFVPWNSQVHWD IQPWGRWQMQARNLQYEVEVTGTTDSPGTPLRAPTANGLRFCCRDTMQGELNLELREF GIGKRKTILKARSFLCGLEIGGGSWDNSWQSR" gene 10255..10851 /locus_tag="DP116_20880" CDS 10255..10851 /locus_tag="DP116_20880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315777.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaJ" /protein_id="PRJNA477356:DP116_20880" /translation="MSDRLNINDAYSILKLKPGASPTQVKQAYRKLVKIWHPDRFSHP HQKQEAEEKIKQINEAYNKLKSDLPSSADPPTQTTHTNIYTSRSNAELFYNWGAENAK QGRYQEALVDFTHAIRLNPNYIDAYKYRGFICSLLGYEYRASSDLNKAAQIEAKLKNK QTHSGSPSSRSSRTSRKKSKLKSLLERFLHWIRRFGGF" gene 11022..13253 /locus_tag="DP116_20885" CDS 11022..13253 /locus_tag="DP116_20885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315778.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20885" /translation="MVKLHWLEVSEYASVSFSALGLVTAALTQQMIYVAAPLTVTACL NVANRERLKYLIYQHQQQTIAKQDLLMDPLRQRLTNFDTFTQKLSATTEQQLDELRKN QQQDVNIFTQRLEEFDASFQQVSTNIQQQIQALLQNQQEQNTDDLRQRLTQLDTFTQE LKINTHKQIDPIYQRLGQLDALTQQLSTTTQQQIQTLQQTQQEQNIDVVSQRLTQLDT LVQQLSTTTQQQINPLSVRLSQLDSFAQQLNRYTQHQIQALQQTQQEQNVDDLRQHLS ELDALTQNLSANTQKQFQELQYAIALLQTELQELSLGKQQSQNLSEKKKKPEGFKLSQ VTIKSPAPPPPLQEKKQQTTNFSPNLSPVTVTQSPSSQVNHTTPEVSTPVVTITTSPP KPEVSTPVVSTTTSPPTPEVSTPVVSVPTSPPTAEVSTPVVSVPTSPPTTQGSTTESP VSYKVTSVAFSPNGQTLASACDDKTIKIWHLADKEPRMLRETGGFSCSSAVNSVAFSP DGKLLASGNDDKTIQLWDISTGKEISTFKGHKEKVYTVAFSPDGKTLASGSKDKSVKL WSIDTNKETSTLHGHCDEVFCLAFSPDGKILASAGAKKDKTIKVWYLAENKFLTLKGH SEELGGIYSIAFSPDGKTLVSGGTDKTIKIWQLSSGLELRTLGGHSDDVCSVTFSPDG KQIASGSKDKTVRLWQVETGKEIRTFTVGEDPIYCVVFSPDGKTLAAGGNGDNTVMLL PCD" BASE COUNT 3902 a 2833 c 2810 g 3806 t ORIGIN 1 gcttgcgaat aatgcttact ttacctgact ttgaggctta actgaactgt attgacttat 61 gtgtcattta tgatttgtaa aaaccaagga ctaatgacta atgactaatg actaatgact 121 aacctccata caacatctca ttacgactag ccgcacctaa ttgaggtatt gtgttgacag 181 ctatgtgggg agtgagagta gcaggaacac caggagcaac ttgtaattgc tgcaccagac 241 gttgcaacaa ttggtcttgt ccagtgcgga aacctgagac atttgtacca cggttaatga 301 tggcaaacca gaccaaaccg cgatcgcgtg taggtaccac tccggctaaa gcactaacat 361 cccggagagt gccagttttc atgatcgtag ccgcaggtaa atgtctagag tgcatcgtcc 421 cccgatggtc aaacccggag actggaaata agtcagccaa agtcaactga taagcgagtg 481 cttcctgttg aattgccata aacattgcac aaactgctct gggagaaata cgattttccg 541 gtcctaaccc tgaaccgttg attaactgaa tttctgactg tggtactctt gcaagtctgg 601 ctgctgttga ttgcacgacc tccgctcccc ccacagcatc tgccagcatt tgagctattt 661 cattgttact gaaaacgttc atctccctaa tgatttgctt taggggtagt gagtggtgac 721 gcaatagcaa agtcggttga ggatttgcta ttggttctac tttcactccc ccagcgatga 781 caagttgagg cttgggagtt ccttttggca tcagcgagtg ttgaaaagta gccccacggg 841 gccaagtagc agcattcagt gcttgcttaa gcatctgacc tgctagcagt ggatgacgtt 901 ggaaattcat ggcaaaattg ccggtaatca ctaaatttcc tgtaacacgc ttaatgccca 961 tctggttgag actattgcca agggtaatgg cttcctccca aacaaacatc ggatcgccac 1021 cacctgtaat gaccaaatcg ccctgtaata ctccattctg caacggtcct gtagcactca 1081 ccaaagtttg aaattggtgg tctggaccaa gatttttgaa agcagtaagt gacgtagcaa 1141 ctttggttaa ggaagcagcc ggtagagcag tcgtgccttg gtgattagcc ataagcatcg 1201 gtccagattg catccaaatt ccttggctct ccatttgatt tgctgccact aattttaatg 1261 tttgtagtcc tttgagatat tccagcaccg tattagaccc agctggattt ggatcggcgg 1321 caacaactaa gccaggacta ccttgccaag ccaatatatc caaagcattt atgggcttga 1381 cagtgacccc agccatttct agccaaacag aaattaaacc tgaaccaaac aattccagca 1441 tgcttcattg cctctgtaga atatgctatt ttgtttcgtg tatgattacg taagtgtttc 1501 ctgctgttaa gcattgagtc atgataaccg cactttttgc catgcggaag tagggcatgc 1561 aaagaagctt tggtacaaag tataaccata cgttcaaaat atttgaaaat agagggtttt 1621 catttgcacg tttagtgaat ctagttgtta atgcagggca ctcatttttg tctatcccac 1681 atttttctga ggttgacgcc taaatcaggc taattttgtg attttaagta gctttggtgc 1741 tttttgtgtt gccaaaaatc ctgtttttgt caaaagggat taaattttat cctaaatact 1801 gtggttttgg ttctcagtta cagaggtaat tcatgttatg ttgagcttgt cagcacctct 1861 tgagaaagga tgatcgatca agcatgaaga ttgaaagttt tataactcat acttcataac 1921 ttttgccttg atcagaacat ctactacttt cagtagaggc atctggtaaa atttgtaagg 1981 tttctagaag ctctgagcaa tgggatcgac tggcgcacgg atcgcaattg acgcaatggg 2041 aggggatcac gcacccggtg aaatcgttgc tggcgcactg cgagcacgag aagaattggg 2101 tgtagaagtc ttattggtag gcgaacccca acaaataaaa agcaaactgc cgccaaaaac 2161 tcatctgggg caggtggaga tcgttcctgc tgaggaaacg atcacaatgg acgaggagcc 2221 tttaagcggt atcagacgca aacctaaggc ttcgattaac gtagcgatgg atttggtaaa 2281 aagtcagcgg gctgatgctg tggtttctgc tggtcactct ggggcagcga tggcatcagc 2341 tttgctccgc ttgggacgac tccgaggaat tgaccgtcca gcaattggtg cggtgtttcc 2401 gacgatagta gcaggtaaac cagtgctcat acttgatgtg ggcgcaaacg tagactgccg 2461 tcctaagttt ttagagcaat ttgccgttat ggggtcggtt tatagtcagt atgtcttggg 2521 tattcaagaa ccatctgtcg gtttgttgaa tatcggtgag gaagacacca aagggaatga 2581 cgcggctgtt cgtgctaacc aactgctgcg tgaaaatccc caaatctcat ttattggcaa 2641 tgcggaagga cgtgatgtcc tttccggtcg ctttgatgtg attgtgtgtg atggatttgt 2701 gggcaatgtg ttgttaaaat ttgccgaagc agttggggaa gtcctgttgc agattatccg 2761 agaagaatta ccccagggat tacacggtca aattgggaca gcaattttaa aacctaacct 2821 caagcgaata aagcagcgcg tagaccatgc agaacgtgga ggggccttgc ttttaggcgt 2881 tgatggaatt tgcattatca gtcacggtag ctcccaagca ccttcgattt ttaatgcaat 2941 tcgcatggca aaagaagctg tggataacca agtgttgcac agaattcagt ctcaaaacat 3001 cagcattcag cgcgaaagtg gttaacagtc aattgtttcg agtcaaagtc aaactcaaaa 3061 gttttttact aacgactaaa gactgttgac taaagactaa tgactaatga gtaatgacta 3121 aaagctgata gctttggaga tgccagagtg caaaacttag gaatagcaat cacaggaagt 3181 ggatccgcag taccagcaac tttcctggat aaccacacat taacagaact ggttgaaaca 3241 tcagacgaat ggattaccac aagaacagga attcgtcaac ggcgattggc aaaagcaagt 3301 gaaacggtaa ctgaacttgc gagtgctgct agtcggcagg cgatcgcgat ggctggaatt 3361 tccccacaag acttggattt aattctgcta gccacctcta gtcctgatga cttgtttggc 3421 agtgcttgtc aaattcaagc ccaattggga gctaccaaag cggtagcttt tgatatcacc 3481 gcagcctgtt ctggctttgt gtttgggcta gtgaccgctg ctcaatacat caggactggg 3541 gtttatcaaa acgtactgct tgttggagcg gatgttctct ctcgttgggt ggattggcaa 3601 gaccggggta cttgtgtatt gttcggagat ggtgcaggag ctgtagtcat acaggcaaac 3661 caaaaggatt acttgctggg atttgaactt agaagcgacg gaactcagaa taactgcctt 3721 aaccttgcct atgcagctga acctacacaa ctgactcaag aggtcaatgt tggcaaaggc 3781 aacttccaac ccatcaccat gaatggcaaa gaggtatacc gttttgccgt acaaaaagtc 3841 ccagaagtgc ttgacaaagc tttgtttcgc gctaacctta gtgttgacca aatagattgg 3901 ctattattgc accaggcaaa tcagcgcatt ctcgacgctg ttgctcaacg cctaaatatt 3961 ccagaacaca aagttattag taatctcgcc aattatggca acacttctgc cgcctccatt 4021 cctcttgctt tagatgaagc agtgcggcag ggcaaaatca aaacaggtga tattattgct 4081 gcctctggct ttggtgctgg actaacgtgg ggtgcagcaa tcttccaatg gggaagatag 4141 ttttttatcc tttgtcattg gttatttgtt ctttgttaat aacaaatgat caatgaccaa 4201 tgaacggcag gtgctcatgg gggaaacccc caagaccgca ctgcctccca atgaccaatg 4261 actcatgact aatgactaat gactaaaact gcatgggtat ttcctggaca aggttcgcag 4321 tcccagggaa tgggaataga cttattagat gtgccatccg cgcaagaaaa atttgctcaa 4381 gctgagaaca tattaggctg gtctgtcatc gaaatctgtc agaacaatgc cgaaaagcta 4441 tcgtacactc tctacacaca gccttgttta tacgttgtag aaagcattct cgctgatatt 4501 gtgcgagaaa aagaaaaacc gaatttagtt gcaggtcaca gtttaggaga atacgtcgcc 4561 ctttacgtgg ctggagtatt cgagtggtct gatgggttgc gtttagtaaa gcgtcgtgcc 4621 gaacttatgg agaccactgc aggtgggatg atggcggctt taatgaactt tgatcgtgaa 4681 cagttggaaa aagtcattgc tgaaactccc gaggtagtga tagcaaacga taatagtccc 4741 ggtcaagtag tcatatcagg cacacccgca gcagtagaag ccgtaatgtc tcaagttaaa 4801 gcaaagcgtg ctaagccttt gaatgtttct ggggcatttc attcgccttt aatggcacca 4861 gcagccgctg aatttcaaga tattttagaa tcagtcgaat ttcatcaagc ttttgtacca 4921 gtgttatcaa acgtagaacc agttcctact gttgatgcaa gtgttttgaa aaaacgttta 4981 atacaacaaa tgactggagg agtgcgatgg cgagaaattt cactagcact accagaaaat 5041 ggcattgagc gagtggtaga aattggtcct ggtaatgtgc taactggttt aattaaacgc 5101 acttgctctg acttaatctt agaaaacgtc agcagtgtag ctgatttggc tctttagtga 5161 gttatgacta gtgagtcgtt aattgtgtca ctaactactc actactcact tgttccacat 5221 tcagattggt gatttccata gtttgaattt tgaattttga attttgaatt ttgagattgc 5281 ttcgcttcac tcgcaacgct acgcgatttt gaattttgaa ttgtttaacc cctcccactc 5341 gactatgtct cgaagccgtg aaccatatgt aagtttgtta ctgtaccatg catttaagtg 5401 gtcagtggtc agtcctatgc ttcaggttta tttcaaggga cgaatttatg gtgctgaaaa 5461 tgtccctaaa acaggaccac tactactagt tagcaatcat gctagtaact acgatccacc 5521 cattgtttct aattgcgtac gccgtccagt ggcgtttatg gcaaaggaag aactctttaa 5581 aatcccaatt ctcggcaaag caattcagtt gtacggtgct tatcctgtga gtcgaggtag 5641 tgctgatcgc actgctatcc gtgcagctat gaagtatctt gatgacgggt gggctgtagg 5701 tctttttctg caaggaacac gtactccaga cggacgcatt acagatccta aaagaggtgc 5761 agccctgatt gctgctaaag ccaaagtccc acttttaccc gtatctttat ggggaactca 5821 ggcgattgag caaaaaggtt cgagaactcc tctttctgtt ccagtcacag tgcgaattgg 5881 gacgttgata gatcctccca gttccaatga taaagaggaa ttggaagcat taacgcaaaa 5941 gtgtacagca gaaattaaca cgctccacga gttgggacga taatttggaa gagtctgaaa 6001 atatgcttaa aggaggtggc aaaagccact gaatttgtgt caatttaaaa caatgcaatt 6061 tgaatttgag atttagaatc gctaattgag tagataattg tcaataattg tctgacaaat 6121 agcatatttt ttgcgcctct ttctcaggaa aaagaccaaa aaccataatc taaaatccta 6181 aattaaagtt gatatttgtt ccacgttgct agaccaaatt cactgtcatc aagctatccc 6241 taaataatat atgagcagtt ttgatgccaa caatctttgg tatcgcctaa ataatctggc 6301 gttagtccgt tttttgcttt tcgttgcttc tgggtgggcg attgtactac ttctagatta 6361 ttttcaatca gtcattgtta tttttacatt tgctgccatt ttagcttttt tactgagtta 6421 tcctgtacaa tggctgcggc actttttacc acacagtata gcagttgttg tcattttcct 6481 attgagtatt acccttattg gtggtttgac aattactgta ggtttaaccg ttttatctca 6541 aggacaacaa ttaattgaca cagtctctgc atttttaaac tctttaatac cgctagtaga 6601 gcgaatagag gctcttcttc gtaaccgtaa cttgccaata gatttgagcg tcattgaaga 6661 acagttgcga aatcaagcta tatctttact tgttaatagt ttaaatattt tccaaagtat 6721 gattactaac tttgtgactt ttctcttgat tgcagttgtc gcttttttta tgttactaga 6781 cggcgaaaag ctttggtctt ttattataaa aacagtaccc cagcgacgac gagataaatt 6841 tacaaatata atcagacgta acttcttggg attttttcga ggtcagctga tattaacctt 6901 atttttgaca tcttcaactt ttcttgtttt cttgatactt aaagtgcctt tcgcattgat 6961 cttatcagtc atagtcggaa tactggacat cattcctggt ataggagcta cattgggagt 7021 gggtatcatt actttaattg tcttatcaca aagtgtttgg atggcactca gagtcttagt 7081 agcttgcatc atacttcaac aaatacaaga caatttaatt acgcctagaa ttatgcaagg 7141 cgctctcaat cttaatcctg ttgttgtctt ctttgcttta cttgttgggg ctaaagttgc 7201 aggattgttg ggacttttta tttctattcc gattgctgga gttttagtgt ctttatttga 7261 aattgatgaa atgaaagcgg aagttgagtc atgagtcatg agtcattagt cattaaaaaa 7321 aataatgaat ttaaagaaaa agtgtatata gtttctcgga aagtatagtt ttctgataac 7381 agattgaaag atagcaaaaa aagatgtgtt atttttgttg gatttattta tgatagtaat 7441 tgagaactaa taactcctga cgaatgacta atacagattt aagagcggaa ttaagagaaa 7501 atttggatga agcggagtgg gaatggctga ttcctcatgc gcaacgggat gctctaattt 7561 tcgtggtaga agagttagat ttattagatg tgggagtggc gatcgctggc gataacgtat 7621 cacaagtgca aacctggatc gatgaagcat tgattaccaa accctcagtt gcacaaatag 7681 gaaaatggaa cacccagagt gcaaagaggt ttaagacttt gattgtccaa ccttatgtac 7741 ttgtacaaga gaaagctgct gcctaaaaaa gatatagagt ttttttaaat attttttact 7801 tgtgaattct tttaactatt ctttttataa ctgatttcat cgctggttat tttgagtttc 7861 atcaatagct gtgtaaccag ttgttaccca aactatagct ctaattccat cacggataga 7921 ttttccaatt ggttcgggag aagtctctga agggactgtg acaggtttta aatcaattcc 7981 tcgactaccc aaaacaattt caccaacgat tttagcacgg cgcatatggt agtcagaagt 8041 cactaaataa acacttttta tgcctctttt gttcaaatca tctaccaatg ttgtaaagtt 8101 ttctacagtg ttgtttgctc tgtagtctag acgcaaccgc cttgtatcaa ctccagcttt 8161 agcaaacact tgtttagtta cctttggagg actaccccca gaaatccaaa tacttaaagt 8221 ggggtgtttt tgtgcaaaat ttgctgtaaa cttttctcgc tctaatttag atgtcgaacc 8281 acccaaaact aaaattgctt gaggctgcat aaattggctt tgaatttcct tgtaacccca 8341 caacatcata agtggtagtg ctagaatcat aaaccgagaa gaaatacctt tagattgttg 8401 ctttttcaag ggcttatcac ctcggttgct taaacattat ttgttgcaag aatcataacg 8461 tacattatca aagcgctaaa aaatgaactc acatttgtgt ttgagtcatg tttctgttcc 8521 ataagttttg tatagcaaat ccgatgtaaa taccttcctt ttctaaagtc catatcagga 8581 gatattttta acaaaccacg actcgcaatt gtcagggttg cttgaagtta cagacgcaga 8641 ttttgtatta tctgtgccta caaataacaa catctccaga ttcaatggat aattttaacc 8701 tctatccagg caaatccaaa gatttccctc gtttatcaag tgtattagag gacgggactg 8761 gccagattcg aaccggcgac ctagcgcttc agatttgtgt gagtttcccc actctctgga 8821 ctataccttc accgtaggct ttaagccctt aggtggtagc tttctagtct ctacaccttc 8881 ctgagctgac ctgcataaga aatgacatag ttttatgctc aggcttggct cggtattccc 8941 ataaagtgtc agcattgtgg tcaaaagaca accaatccta cttttttcta tctttttagg 9001 gttcaccgaa tttaactaca ttcactcata gagtttcctc tatagtgccc gatagttcag 9061 gaggcgctcg ctctatcctg ctgagctaca gccccaaaca gacatcattc aacatgatac 9121 catttttagc gggactgcca agagttgtcc caagatccgc cgcctatttc taatccacac 9181 aagaaactac gtgctttcag gatagttttt ctttttccaa taccgaattc tcgcagctct 9241 aaatttaact ctccctgcat ggtgtcacga cagcaaaatc ttaaaccatt tgccgttggc 9301 gcacgtaaag gagtcccagg tgaatcggta gttcctgtga cctcaacttc atactgcaaa 9361 ttccgtgctt gcatttgcca tctaccccaa ggctgaatat cccagtgtac ttgggaattc 9421 cagggtacga attcataaaa cttgccttga tggtgtatac caatcatagc tacagattcc 9481 atccaccata aaacaccgcg tcgtccgccg ccagcagtca gagcaaggtc gctttcgcct 9541 tcaaagctat tgcagttgag ccaaaaccat ttttcgggaa aagcgcctcc ccaatttttc 9601 tcgccgtaag ctggggcgtt ggtaaattcg tagcgtttac cattccagtc aatccaaccg 9661 gaagctaagc catgagccat gagaatttgc catccgggtt caaaaatctg caaaaatgac 9721 aaccaacctg ctgttgattg ttgaatactt cctttatcac cccatccata aatgggttga 9781 atttcatact cccagcgaca atagttacca gtcccaggat cgtgaattgc tccttggttc 9841 aaggtcgcgg ttgcttggta gccttcttga atatggtgtt caaactcagg tggaaggagg 9901 tacacaggtt tagtgtgtag gaggtctgtt ttaccccaat gacctaagcc caaaacgtta 9961 cggcttgccc aaaatttgct cacgtctgga aacgtccgcc ataaatactc gtcattggga 10021 ccgaggattt gtgctgcgcc gccactgtgg tgtttaccgc cgacggggtc ttcaatagaa 10081 tacatgaaag caaaagtttg accttctgcg ggtaaggtga cgcggtaata ccaaccttca 10141 aaaaagcgac gagtactacc gtcccagtga taaccgctgt ggggtgtttg gatagaatga 10201 tgagaatttg tggaaatagt taacatagat ttacgatgct gcctatttcc cagtatgagc 10261 gatcgcctta acataaatga tgcctactct atcctcaaac tcaaacccgg tgcatcacca 10321 acacaagtca agcaagccta tcgtaagcta gttaagattt ggcatcctga tcggttttct 10381 catccgcacc aaaaacagga agcagaagaa aaaattaaac aaatcaacga agcttacaac 10441 aagctgaaat ctgatttacc aagttctgct gatccgccca ctcagacaac acacacaaac 10501 atttatacaa gtcgctccaa tgccgaactt ttctacaact ggggagcaga gaatgcaaaa 10561 caaggaagat accaggaagc acttgttgac tttactcatg caattcgcct caatcctaac 10621 tacatcgatg cttacaaata ccgtgggttt atttgctccc tacttggata tgagtatcga 10681 gccagttctg atttaaataa agccgcacaa atcgaagcga agttgaaaaa caaacagact 10741 cattctggat ccccatcatc aagatcctcc agaacgtccc gaaaaaaatc aaagctcaaa 10801 tcccttctag aaaggttttt acactggatc aggcgctttg gtgggtttta agaagtcacc 10861 cggatgcact tgtgtttgtc atctcttgtt gacatgagta cttctgggca tagattcgag 10921 aaatttatta gcagaagatg aggatagggt agtcaaaccg cattacgcgc tcaagttatt 10981 ttctttctga tttaggcacc aagcacagca ctcacctgat tatggtcaaa ctccactggt 11041 tagaagttag cgaatacgcc tccgtctctt tttctgcact gggattagtc acagccgcac 11101 tcacgcaaca aatgatttat gtagctgctc ctttgactgt aacagcttgt ttaaacgttg 11161 ctaatcgaga aagattaaag tatttgatct accagcatca acaacaaaca attgccaaac 11221 aggatctact catggatcca ctccgacaac gactgacaaa cttcgatact ttcactcaaa 11281 aacttagtgc aacaactgaa cagcagcttg atgagttacg aaagaaccag cagcaagacg 11341 tcaatatttt cacccaacgt cttgaagaat ttgatgcctc ttttcaacaa gtcagtacaa 11401 acattcaaca acagatacag gcattactgc aaaatcaaca agaacaaaat accgatgatc 11461 ttagacagcg cttaactcaa ctcgacacct ttacgcaaga gttaaagata aacactcaca 11521 aacagataga ccctatttat cagcgccttg gacaacttga tgcattgaca caacagttga 11581 gtacaacgac acaacagcag atacagacat tacagcaaac ccagcaggaa caaaatattg 11641 atgttgtaag tcaacgactc actcaactcg acacccttgt acaacagttg agtacaacga 11701 cacaacagca aattaatcca ctttctgtgc gcctttctca actggactct tttgcacagc 11761 aattaaatcg atacactcaa caccagatac aggcattaca gcaaacccag caggaacaaa 11821 atgtggatga tcttagacaa cacctgagtg aacttgatgc cctgacacaa aatttaagtg 11881 caaatactca aaaacagttt caagaacttc agtacgcgat cgccctacta cagactgaac 11941 tacaggagtt gtctttggga aagcagcaat ctcaaaacct atccgaaaaa aagaaaaaac 12001 cagaaggttt caaactatct caggtaacga ttaaatcacc agcaccacct cctcctcttc 12061 aggaaaagaa gcaacagaca acaaattttt ctccaaatct atctccagtg acggtgacac 12121 aaagcccgag ttctcaggtc aatcatacaa cacctgaggt ttctacgcct gttgtcacca 12181 taacaacatc acccccaaaa ccggaagttt ctacacctgt tgtcagcaca acaacatcac 12241 ccccaacacc ggaagtatct acacctgttg tcagcgtacc gacatcaccc ccaacagctg 12301 aagtttctac acctgttgtc agcgtaccga catcaccccc aacaacacaa ggttcaacaa 12361 cagaaagtcc tgtttcctat aaagtgacgt ccgtcgcttt tagtcccaac gggcaaacat 12421 tagccagtgc ttgtgatgac aaaaccataa aaatttggca tttagctgac aaagaacctc 12481 gcatgcttag ggaaaccgga ggattttcct gttctagcgc agtgaattct gtagcattca 12541 gtcctgatgg taaacttttg gcaagtggaa atgatgataa aacgattcaa ctgtgggata 12601 tttccacagg aaaggaaatc tctaccttca aagggcataa agaaaaagtt tatactgtcg 12661 catttagccc agatggaaaa actttagcta gcggtagtaa agataagagt gtcaagcttt 12721 ggtctataga tacaaataaa gaaacttcta cactccacgg gcattgtgac gaggttttct 12781 gccttgcttt cagtccagat ggaaagattt tagccagtgc tggtgctaag aaggataaaa 12841 ctatcaaagt gtggtatctg gctgaaaata aatttttgac tcttaaaggt cactccgaag 12901 aattgggagg aatttattct atagctttta gtccagatgg gaaaaccctt gtaagtggcg 12961 gtacagacaa gacaattaag atttggcaac tctcaagcgg tttagaactc cgcactcttg 13021 gaggacactc tgatgatgtt tgttctgtga cattcagtcc agatggcaaa cagatcgcaa 13081 gtggtagcaa agacaagacg gtgaggcttt ggcaagtgga gacagggaaa gaaatccgta 13141 ctttcactgt tggtgaagat ccaatttatt gtgttgtgtt tagtccagat ggtaagactt 13201 tggcagcggg tggcaatggt gataacactg tcatgctttt gccttgtgat taagctaggc 13261 attgcacctg gagactgcaa atgaaatcag cacctcaaga aatgcaaatc aacttttccg 13321 gtaagacagt aaaaaagact atatttataa c // LOCUS NODE_2568_length_13173_cov_5.03758213173 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13173) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13173) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13173 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(727..1380) /locus_tag="DP116_20890" CDS complement(727..1380) /locus_tag="DP116_20890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745675.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20890" /translation="MLPPEKPALDPHPLSPYVAWAGNCNRDPILEVFKKKLPQNEGHV LEFASGSGMHINYFAPHFQHLSFQPSDMNEEAFENIRRLTQQTQAKNVHPPIKLDLTQ PQTWSVLAGKKFDTIFCINVFQVAPIAIADGMMECTANLLKDSGSLFIYGPFKVHGKY TTPSNEEFHRTLLSYKVPEWGLKDIADITKYAQKHRLNLKEKIDMPSNNFTLVYSFR" gene complement(1452..1781) /locus_tag="DP116_20895" CDS complement(1452..1781) /locus_tag="DP116_20895" /inference="COORDINATES: similar to AA sequence:RefSeq:YP_002517285.2" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20895" /translation="MTQVFLYAEYQISIPFSEIDWVPINLEMKKFPGLKSKTWLSGVN TNTVGGFYEFDSVENAQSYIDNLLIPFAKQVNGNLTVKLFDGYVTKEASIGMSSPFYL ADSHDRK" gene complement(2020..2490) /locus_tag="DP116_20900" CDS complement(2020..2490) /locus_tag="DP116_20900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456604.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4281 domain-containing protein" /protein_id="PRJNA477356:DP116_20900" /translation="MSIDQIFNIANIFVLPFWTLMFLLPKWKVTQRVMESYLPFVPLA GVYLYLFVTSITPENAQALSNPQLADIARFFADEKAAATGWVHFLVMDLFVGRYIYLE GQKTGVWTIHSLALCLFAGPMGLLSHIFTTWITKAVSPTLKNEVAEVAEKTVSS" gene complement(2745..3104) /locus_tag="DP116_20905" CDS complement(2745..3104) /locus_tag="DP116_20905" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20905" /translation="MKKIFSITCLIALLCLLGSCSTSYRTVQIKDVSKPEQITLSKAP LQGSIYSINIQVTGKLDGTANIGLTMNGKPYLSEKMHNSVNFEWEYDWYSDVAVIQYK PIDVKSGKLTISYRFFG" gene 3607..5397 /gene="typA" /locus_tag="DP116_20910" CDS 3607..5397 /gene="typA" /locus_tag="DP116_20910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="translational GTPase TypA" /protein_id="PRJNA477356:DP116_20910" /translation="MTLPIRNVAIIAHVDHGKTTLVDALLKQSGIFREGEDVPDCVMD SNALERERGITILAKNTAVHYGDTLINIVDTPGHADFGGEVERVLGMVDGCILIVDAN EGPMPQTRFVLKKALEKGLRPIVVVNKIDRPQADPHGAIDKVLDLFLELGADDDQCDF PYLFASGLSGYAKEQLEDEAKDMQPLFEAILRHVPAPVGDAEKQLQLQVTTLDYSEYL GRIVIGKIHNGIIRMGQQAGLVKENGEIVKGKITKLMGFEGLKRIEITEASAGNIVAV AGFADANIGETITDPNEPQALPLIKVDEPTLQMTFWVNDSPFAGQEGKLVTSRQVRDR LLRELETNVALRVEETDSPDKFLVSGRGELHLGILIETMRREGYEFQVSQPQVIYREV NGQPCEPYELLVLDIPEDAVGSCIERLGQRKGEMQDMQVGGNGRTQLEFIIPARGLIG FRGEFMRMTRGEGIMNHSFLDYRSISGDIEARNKGVLISFEEGVSTFYAMKNAEDRGV FFIVPGTKVYRGMIVGEHNRPQDLELNVCKTKQLTNHRASGGDELVQLQAPVDMSLER ALEYIGPDELVEVTPQSIRLRKLAKKLAKR" gene 5879..6244 /locus_tag="DP116_20915" CDS 5879..6244 /locus_tag="DP116_20915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875343.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin--nitrite reductase" /protein_id="PRJNA477356:DP116_20915" /translation="MISPEVNTKSSEHSLEAMRHFSEQYAKRTGTYFCSEPSVTAVVI EGLAKHKDELGAPLCPCRHYEDKQAEVKATFWNCPCVPMRERKECHCMLFLTSDNDFA GDKQEISLETIKQVRDGMA" gene 6241..6642 /locus_tag="DP116_20920" CDS 6241..6642 /locus_tag="DP116_20920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756675.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF309 domain-containing protein" /protein_id="PRJNA477356:DP116_20920" /translation="MNEEMPDEFWQGVEQFNTGEYYACHDTLEALWIEATEPDKTFYQ GILQIAVALYHLGNHNLRGAVILLGEGSNRLRRYPSEYGGIDVDELLIQSAALLKALQ LTQPEKIAALNLGQDEGLPLPRIIRINNEEA" gene 6904..7410 /locus_tag="DP116_20925" CDS 6904..7410 /locus_tag="DP116_20925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216562.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="OstA family protein" /protein_id="PRJNA477356:DP116_20925" /translation="MMPHYQLPHFHMRRLALALILPVVVLGAMTFPNQLQTATAQTPQ KSGQNRPLYIRGKVQEYNSKTQVATIRGEVELSYPARGIQATAAQAQYFTRERQIILS GNVYVLQQGSNSIKADSVTYLIDEARFVATPKQGSQVESIYMVNDTPVNNQAPGSAPA TAPVKKSN" gene 7440..8168 /gene="lptB" /locus_tag="DP116_20930" CDS 7440..8168 /gene="lptB" /locus_tag="DP116_20930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210966.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LPS export ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_20930" /translation="MKIVLENIHKTYGKRVIVNRVNLSVAQGEIVGLLGPNGAGKTTT FYIATGLEKPEKGKVWLDNIDMTPLPMHRRARLGIGYLAQEASVFRQLSVRDNILLVL EQTNVPSWEWARRVDTLLREFRLEKVANNKGIQLSGGERRRTELARALAAGREGPKFL FLDEPFAGVDPIAVSEIQQIVAQLRDRDMGILITDHNVRETLAITDRGYIMREGQILA SGTSKELYNNPLVRQYYLGDNFQA" gene 8667..9788 /locus_tag="DP116_20935" CDS 8667..9788 /locus_tag="DP116_20935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140079.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="permease" /protein_id="PRJNA477356:DP116_20935" /translation="MDRYLARELLPTFLFGVGAFASIGVTIDSMFELIRKIVESGLPI SIAVQVFTLTMPEFIVLAFPMSTLLATLMTYSRLSSDSELVALRGCGVSVYRMVLTAV MLSLVVTGMTFVFNEHLAPASKYRATQILNAALKSETPTIVKRQNIFYPEYQKVKKKD GSGNRKILTRLFYADEFDGKEMKGLTIIDRSTEGLSQIVVSESGEWNPSQNVWNFHNG TIYLVAPDRSYRNILRFEHQQIKLPRTALDLAQKSRDYGEMNISQALEQLEVERLGGD EQKIRKLQVRIQQKISLPFVCVVFGLVGAAMGTVPQRTGKGTSFGISVIVIFSYYLLW FITGALGEAGIFSPLVAAWMCNFLGFGVGFLLLRRVAQK" gene 9928..11391 /locus_tag="DP116_20940" CDS 9928..11391 /locus_tag="DP116_20940" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20940" /translation="MSRHHHPPAYLRYLKARLWYLARPSFWGTAIFLSVVGLAIKEYW THPDFFTQWQKNLVADNKPVNSSVLEEDKAMTPDINNLPPVPYNIGNNRANPAAKNLA SKQNIQAKNSKTLLEALNSKSQTSTSDNKLKLNVKEGDSTPVIELENPFLAQAQNLLQ FKNLPNGSNSLGVNALTPSLYQQNLAQNSFGLGTGLPNQTTSNQNGVSESALQTALNQ VKSQQSTNSNRITSTQKNPFEPSSLLSTENRNSQTLVPSTGFNTNTINPLNVGTGSTQ SGAISGTSYIQPGTTPGTTYPQPSFNNLQPQNLSGGTSYIQPGTGNQLQTSIPGTAYP QPSFNNLQPQNLSGGTSYIQPPTANQLQTSIPGTAYPQAKPELVGIPQTPSAIPNRSA IILDQVIKNRLNNINTAQPLSTVTQPTSVPPSSNTLNAPNYTSNQGNVINNAPLTSNN DGSVGIQQAPAPVPQYIYPSSTQIPGQYTGGGQINRY" gene 11511..12062 /locus_tag="DP116_20945" CDS 11511..12062 /locus_tag="DP116_20945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015219692.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_20945" /translation="MDTAPLVRWLPVEKPESLKPKSTEPRLTVELVPQTCWYSNVRSE VSTGEWDKLKRVTFERANYRCEVCSGRGLKHPVECHEIWDYDDERHIQTLTGLIALCP ACHECKHMGFANIKGRGELATDHLAKVNGWTIHQAKTYVRECFQVWQERSQHEWDLDI TYLEQFEISSHSSSPRRGDRPGN" gene 12142..>13173 /locus_tag="DP116_20950" CDS 12142..>13173 /locus_tag="DP116_20950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137423.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium chelatase ATPase subunit D" /protein_id="PRJNA477356:DP116_20950" /translation="MPAPTTTPLSTAFPLTAVVGQEAIKSALLLAAVDPGLGGVAIAG RRGTAKSVMARAIHTLLPPIEVVSGSISNCDPNRPEEWDDQLLADSSLSLPTSEDGSS EVKTEIIPATFVQIPLGVTEDRLLGSVDVEQSVKQGETIFQPGLLAQAHRGVLYIDEI NLLDDQISNQLLTVLSEGRNQIEREGISFQHPCKPLFIATYNPEEGALREHLLDRIAI ALSADGVLGLDQRVQAVEQAIAYSLSPQEFLQQYSEDLDALKTQIILAREWLKDVCIT HQQISYLVEEAIRGGVQGHRAELFAVRVAKASAALDGRTQVNADDLRRAVELVIVPRA TVVQTPPPQE" BASE COUNT 3884 a 2860 c 2830 g 3599 t ORIGIN 1 ttcatccact catgtaaagg taggggtttt catcctcccg cttgcagccc gataattcaa 61 tacgattgag tttcagctaa aaacaccgaa ccaatgtacg ttggggagga cagtcgcaca 121 agaagggttc gtccaacgga agagtcagtt gccagcagtg gggttaccct caaagtagca 181 ctgatctgac tgtgttgcga tcgctgtgga cattttacct ttaataaggt tgagctattt 241 agtactaaat tccacaagtt cgtagcgaac gacagcttct ttcattgagt tttagcgttt 301 tttgaggcac gggatgccgt gcccctttag catgaccgta aacgttatac catttctcta 361 tgaacttgag cgattattta accccccgcc tccggcgtcc cacgccactt ctggtggggg 421 gggggaaccc ccgcactctt tgtggctccc cttagcaagg ggagccagtg cgaagggaaa 481 cgctctgccg acttgtagca tctggcgtgg gactacaggg gggtaactat taagcgcatt 541 gttacagaga actcgtatta acagcaaccg catacctcaa gatgtcgtga ggtacaaaaa 601 aagaccccct tcatccgctg ctaaaggggg tgcaaaatat acagtggtct aatttctata 661 tagatgtagg cgaaatcgac ttatgcagta gaagaggtag tgcctttttt gaaaaatgac 721 ttcttcttac ctgaaactgt agactagagt gaagttattg gatggcatat caattttttc 781 tttcaggttc aaccgatgct tctgggcata cttggtgata tctgcgatat cttttagtcc 841 ccattccggt actttgtagg agagcagggt tctgtgaaac tcttcatttg aaggtgttgt 901 gtacttacca tgcactttaa atggaccata gatgaataat gagccactgt ctttgagcag 961 gtttgcggta cattccatca tcccatctgc aatggcaatc ggcgcaacct ggaacacatt 1021 gatacagaaa atagtgtcaa atttttttcc agccaaaaca gaccaggttt gcggctgagt 1081 cagatccagc ttgattggcg ggtggacatt ttttgcttga gtttgttgcg tcaagcgtct 1141 aatgttttca aatgcctctt cattcatatc cgagggctga aaacttaggt gctggaaatg 1201 gggagcaaag tagttgatgt gcataccgct accggaagca aactctagaa cgtgaccttc 1261 gttctgaggt aactttttct taaacacttc caaaattggg tcacggttgc aattccctgc 1321 ccaagcaaca tagggactca gcggatgcgg atcaagtgcg ggtttttctg gtggtaacat 1381 ttgttcctac ttaattgtgc tgcatttctt ttgatattga gtgtagagtg caataattct 1441 aaccccacac tttatttcct atcatgagaa tctgccagat aaaaaggaga actcatgcct 1501 atactggctt ctttcgttac ataaccatcg aaaagcttta cagtgagatt gccattaact 1561 tgttttgcaa atggaatcaa taagttatca atataacttt gtgcattctc tacagaatca 1621 aattcataaa atccaccaac agtattagtg tttacaccac ttaaccaagt cttggatttt 1681 aatcctggga attttttcat ttctagattg attggaaccc aatcaatctc actgaaagga 1741 atagatattt gatactcagc gtataagaat acttgagtca taaatatttt tgagtatttc 1801 ttgaattttc ctttagttat tttattctat caaatgatat ttttctctta gcttatcata 1861 atgatttaag ttttaactca tcttagatag ctgcaaaggg tttgggtgaa cattttatct 1921 aagttcaccc aaaccgcatt ggatttctgt cacaagtgaa aaccataccc cttatcgagt 1981 attaattcta ggtacagtag tcaaattgct ctgtgtcacc taagaggata ctgttttctc 2041 ggcaacttct gctacttcat ttttaagagt tggggaaact gcctttgtta tccaagtcgt 2101 gaaaatatga gaaagtaatc ccattggacc agcaaataag caaagcgcca acgagtggat 2161 tgtccaaact cctgttttct gtccttctaa ataaatgtaa cgcccaacaa ataaatccat 2221 gactaagaaa tgaacccaac ctgttgcagc agctttctca tcagcaaaaa atctggcaat 2281 atctgctaat tgaggatttg ataaagcttg ggcattttct ggtgtgatgc tagtaacaaa 2341 caaatacaaa tacacgcctg ctaagggcac aaaaggaaga taggattcca taaccctttg 2401 tgtcactttc catttgggta ggagaaacat taatgtccaa aatggtaaaa cgaaaatatt 2461 agcaatgttg aaaatttggt cgatagacat aagagatgta aaattatact ggtacaataa 2521 tacttgatta tggcaactca cagatgatat ataggaatcc gatttaattt ctgaaaaaat 2581 ctaagtatat gtagggggag ccagtgcgtt gtgttatcac tcttgagtac agcaccaaaa 2641 ccgtcggatg gtgggttagc cagaatcaaa tatgagtcct atagatgaat tttattacca 2701 gaatcttgaa tgcattaatt accccaatcg taccgattgt gagattaacc aaaaaagcga 2761 taggaaatgg tcaacttacc agatttaaca tcaattggct tgtattggat aacagcgaca 2821 tcagaatacc aatcatattc ccattcaaaa ttcacactgt tatgcatttt ctctgacaga 2881 tagggtttgc cattcatcgt taaaccaatg ttcgccgtgc catcaagttt gccagtaact 2941 tgaatattaa tcgaataaat acttccttgt aatggtgctt tgctgagggt gatttgttct 3001 ggtttgctga catccttaat ctgaaccgtt cgataggatg tggaacacga accaagtaaa 3061 cacaatagag cgatcaaaca agtgatagaa aaaatctttt tcatcataaa tgtaaagctg 3121 ccggattttt cgcgccttca atgaaacgat gagtccagcg ccgtaggcga ctggcgtaag 3181 tgcaaagcct gccgctattc catacaaagc agtgtaggcg tgcaggagat acccggtaag 3241 gcgtggccgt tcccactagg gttgtcaaag cagaacgaaa gcctgtcttg aagctgaatt 3301 tatcaccttt tcctcctctg taccagttac ggacgcagaa aaaagtgacc acccgtgagg 3361 gttgatctgt cggtgtggtc gggtataagt ggggtaaata cggctgctca cttttttcac 3421 ttttttacga ctgaatgcgt tagcatcttg gcgattcaaa aaaacacttc aaccataaag 3481 agttaatata gaattgttaa ggcttgttta cagaatttat aaaaccaccc gaattctgtt 3541 acaacattcc tccactagat attaagttct aaaatcccca cgctaatttt atccaaacgc 3601 gctattatga cgctcccaat ccgtaacgtc gccattattg cccacgttga ccacggcaaa 3661 acaacccttg ttgatgctct cctcaaacaa tccggcatct tccgtgaagg agaagacgtt 3721 ccggattgcg tcatggactc caacgccctt gaacgggaac ggggaattac aattcttgcc 3781 aaaaatactg ctgttcacta tggagacacg ctgatcaaca ttgttgatac gcccggacac 3841 gctgactttg gtggcgaagt tgaacgtgta ctcggcatgg ttgacggctg cattctgatt 3901 gttgatgcaa acgaaggtcc catgccccaa acacgctttg ttttgaagaa agccttggaa 3961 aaaggactgc gccccattgt tgtcgtcaat aaaattgacc gtccccaagc agatccacat 4021 ggtgctatag ataaagtgct ggatctgttt ctggaactag gggcagatga tgaccagtgt 4081 gattttcctt atctgtttgc ttctggttta agtggctacg ccaaagaaca gctagaagat 4141 gaagccaagg atatgcagcc actgtttgaa gctatcctgc gccatgtacc agcaccagtg 4201 ggtgacgccg agaaacaact acaattgcaa gtcacaaccc tagattattc tgagtatctg 4261 ggacggattg tcattggtaa aattcacaat ggcattatcc gcatgggaca gcaagccggc 4321 ttggtaaaag aaaacggcga aattgtcaag ggaaaaatca ccaagttgat gggctttgaa 4381 gggctgaagc ggattgaaat cacagaagca tccgctggca atatcgtagc tgtggcagga 4441 ttcgccgatg ccaacattgg ggaaacgatt actgacccga acgaacccca agcattgcca 4501 ctcattaagg tagatgaacc gacattgcaa atgaccttct gggtgaatga ttcgcccttt 4561 gcaggtcaag aaggaaagtt ggtgacatca cgacaagtgc gcgatcgcct ccttcgcgaa 4621 ctagaaacca acgtcgccct acgcgttgaa gaaaccgact cccccgataa attcctagtt 4681 tccggtcgtg gagaactcca cctgggcatc ttaatcgaaa ccatgcgccg cgaaggttat 4741 gagtttcagg tatcacagcc acaggtgatt tatcgcgagg tcaacgggca accttgcgaa 4801 ccttacgaac ttctggtatt agacattcct gaagatgccg ttggcagttg cattgagcga 4861 ctggggcaac gcaaaggcga aatgcaagat atgcaagtgg gtggtaacgg acgcacccag 4921 ctagaattta ttatccccgc ccgtggatta atcggttttc ggggtgaatt catgcgtatg 4981 actcgtggtg aaggtatcat gaaccacagt ttccttgact accgttctat ctctggtgat 5041 attgaagctc gtaacaaagg tgtcttaatc tcatttgagg aaggcgtttc cactttctac 5101 gccatgaaga acgccgaaga tagaggcgta ttctttattg tacctggtac gaaagtttac 5161 agaggcatga ttgtcggaga acacaatcgt ccccaagact tggaattgaa cgtctgcaag 5221 acgaagcaac tcaccaacca ccgtgcatcc ggcggtgatg aactggtaca gctgcaagca 5281 ccagtcgata tgagtctaga gcgtgcttta gagtacatcg gaccagatga attggtggaa 5341 gtcactcctc agtcaattcg tctgcggaag ttagcgaaaa agctggcgaa acgctaaaca 5401 ctatgaagtc ctgagtgctg agttttgagg aaatatcact caaggctcag cactctttgt 5461 gtatcaggac ttatgcaaaa ataatctagt tgtgtcattg cgacgtaagt cgtaactgca 5521 gtcttgttgt ttagtaactc gtccgtaagt cctatgtatg tctaccgcaa gaatcggtcg 5581 cgatgaaatg cattcaaatt tcacgcatga aaatttcaat tgcgtgcaaa aaaacggcaa 5641 ctataatctg atatccagat tctgccgatt ctacgatgct ctgctccccg tacagttttt 5701 gccaattcaa aatactgaaa tgtgccctat gttctaagcc ttgggttacg aactcgcgcc 5761 tcgacacaca ctttacaaaa tttaaaatcg ctcaaaagaa ggggagtcat ccctagttct 5821 agctcccata ggctgcggta caatgcaaag aaatgtttaa attatgagtg ccgatcccat 5881 gatttcacca gaagttaaca caaaatccag cgaacatagc ctagaggcga tgcggcattt 5941 ttcggaacaa tacgccaagc gcactggaac gtacttctgt tctgaaccct ctgttaccgc 6001 agttgtcatt gaaggacttg ccaagcataa agacgaactc ggagctcctt tatgtccctg 6061 tcgccactat gaagacaaac aagcggaagt gaaagctact ttttggaact gtccatgtgt 6121 accaatgcgc gagcgcaaag aatgtcactg tatgctattc ctcacatcag ataatgattt 6181 cgctggtgat aagcaagaaa tttctttaga aacaatcaag caagtacgag atggtatggc 6241 atgaacgaag aaatgcctga tgagttttgg caaggtgtag agcagtttaa tactggtgag 6301 tattatgctt gtcatgatac cttggaggct ctgtggattg aagcaacaga gccagacaaa 6361 actttttatc aaggcatttt gcaaattgcc gtggcgctgt atcatttagg caaccataac 6421 ttacgtggtg cggtcatttt gttgggtgag ggtagcaatc gcttgcgacg ttacccatca 6481 gaatatggcg gcattgatgt agatgaatta ttaattcaga gtgcggcatt gttgaaggca 6541 ttacaactaa cacagccaga aaagattgcc gctcttaacc ttggtcaaga tgaaggcttg 6601 cccttgccta ggattataag aattaataat gaagaagcgt aaagaattaa aaaaattatt 6661 aaaaaaatct taacaaatta ttaaaatact gtagattcaa gctagctggc agtggatgca 6721 aatctgtgtg ctttgatagt cagctagacg accagatgaa accttaaaag tcgagacagc 6781 gtcaatactt ctagttttag ccagcagcta tcctcgaata gcccagaacg cgtattttat 6841 gatctctggt taaagtagtt gagaaaattg gaataacaaa aaaatcataa cttctaaatt 6901 gctatgatgc cccactatca attgcctcac tttcacatgc gtcgtttggc actagcttta 6961 atactaccag ttgtggttct gggcgcaatg acgtttccta accaactgca aacagctact 7021 gcacaaactc ctcaaaaatc tgggcaaaat cgtcctcttt acatccgtgg aaaggtacaa 7081 gagtataact ctaagacaca agtggcaact attcgcggtg aggtggaact gtcttatcct 7141 gcacgaggta ttcaagcaac tgctgcgcaa gcacaatact ttacccgcga acgtcagatt 7201 attctcagtg gcaatgtcta tgttttgcaa cagggcagca atagtatcaa ggctgactca 7261 gtgacgtatc tgattgatga agcgcgattt gttgctacac ccaagcaagg tagtcaggta 7321 gaatccatct acatggtcaa cgacactcca gttaacaacc aagctcctgg atctgctcct 7381 gcaacagcac ctgtgaagaa atctaattag cacagataat tcataaagga cgcgagagag 7441 tgaaaattgt cttagaaaat attcataaaa cttatggcaa gcgagtcatt gttaatcgcg 7501 tcaacctttc agttgcccaa ggcgaaatcg ttgggttact tggtcccaat ggcgctggta 7561 aaacgacgac tttttacata gccacaggtt tagaaaaacc agagaaggga aaagtctggt 7621 tagacaatat agatatgact ccactaccaa tgcacagaag ggcacgacta ggcattggtt 7681 atttggcaca ggaagcaagt gtttttcgcc aactcagcgt gcgagataac attcttttgg 7741 tgctagaaca aacgaatgtg ccaagctggg aatgggcaag gcgagtggat accttgttgc 7801 gggagtttcg tttagaaaaa gtcgctaaca acaaaggaat tcaactttct ggaggtgaaa 7861 gacgccggac agaattggca agagctttag cagctggtag agaaggacca aaatttttat 7921 ttttggatga accatttgcg ggagttgacc cgatagcagt ctcagaaatt cagcaaattg 7981 tggcgcaact gcgcgatcgc gatatgggca ttttaatcac cgatcacaac gtccgcgaaa 8041 ctctagccat cacagatcgc ggatatatta tgcgtgaggg gcaaatcctt gcctctggca 8101 cctccaaaga actctacaat aacccccttg tgcgccaata ctatctgggc gataacttcc 8161 aggcgtaaat taaaaaaata aatcataaat aaaaaatatt ttctatttac tctttttaat 8221 tagaatttcg taatctggct ttattttgtt taatatttat tttgatagct gttacatttt 8281 tgatgcttca atcttcttgt gggtgagatt gttgctatcc taaaccagga tttttgactg 8341 gagtttgcat agtatggtgc ataccaaatg tttgatgagt cgtgagaaaa aacctagtac 8401 gaatgactaa cgacccttcg ggtatctcct tcggagacgc tgcgcgttag ccctttgggc 8461 gtgcgctttg cgcatacgcc agtcgcctgt tgtcgggaaa acgccagatg ctccacttgg 8521 ggagacccca agaccgcact ggctcccctc ccgcagcgct ggtctcacta atgactaata 8581 actaataact aacgactaat aactaataac taatgcctaa atttcagtct ttctacacca 8641 tcaattcgct gctgcctttt tctattatgg atcgctactt ggcgagggaa ttgctgccga 8701 catttttgtt tggtgttggg gctttcgctt caatcggcgt aacaatagat tctatgttcg 8761 agctaatacg gaaaatcgtg gaatccggac taccgataag catcgccgtt caagttttta 8821 cgttaactat gccagagttt atcgttttag ctttccccat gtccacgctc ctggctactt 8881 tgatgactta cagtcgtctt tccagcgata gcgaacttgt agctttacgc ggctgcggag 8941 ttagtgtgta tcgcatggta ctcactgctg tgatgttgag tttggtcgtt acaggaatga 9001 catttgtgtt taacgaacac cttgcaccag catcgaaata ccgagcaact caaatcctga 9061 acgcagccct caagtccgaa acaccaacta ttgttaagcg gcaaaatatt ttctaccccg 9121 aataccagaa ggtgaaaaaa aaggatggta gcggtaatag aaaaatctta acacgcttat 9181 tttacgctga tgaatttgac ggcaaagaga tgaaaggctt gacaattatt gaccgttcta 9241 cagaaggttt gagtcaaatt gttgtatcag aatcaggcga gtggaatcca tctcaaaatg 9301 tctggaattt tcacaacggt accatctatt tggttgcccc tgaccgctct tatcgtaata 9361 ttctcaggtt tgaacaccaa caaataaaat tgccccggac tgctcttgat ttggcacaaa 9421 aaagccgaga ctatggcgag atgaacattt ctcaagcact ggaacaactg gaggttgaac 9481 gtctcggtgg tgatgaacaa aaaattcgca aactccaagt caggattcaa caaaaaattt 9541 ccttaccatt tgtctgtgtc gtttttggtt tggtcggtgc agcaatggga accgtacctc 9601 agcgtactgg aaaagggaca agttttggca tcagtgtcat agttattttt agttattatt 9661 tgctttggtt tattactggt gctttaggag aagctggtat tttttctccc ttggtagctg 9721 cttggatgtg taactttctg ggatttggtg taggtttctt gttattgagg cgagttgcac 9781 aaaaatagtt ttttgacaac aatcacacct ggggttaata aattacaatt gtggagtagg 9841 taggataaaa aaaactccat tagtaaaccg ctggtctaaa ccttttttct tttgccaatt 9901 tatttgtaaa cttagactct attgcccatg tcacgccacc accatcctcc cgcctactta 9961 cgctatctca aagctagatt atggtattta gcacgaccta gtttttgggg aacagcaatt 10021 tttttatctg tcgtagggct ggctattaaa gaatactgga cacaccccga cttttttact 10081 caatggcaaa aaaatctagt tgctgataac aagcctgtca actcctctgt tttagaggaa 10141 gacaaggcaa tgacaccgga cattaataac ttaccaccag tcccgtataa tattggcaat 10201 aaccgagcaa acccagcagc aaaaaatctt gcctctaaac aaaacattca agcaaaaaac 10261 agcaaaactt tattagaagc cttaaatagc aagagccaga cttctacaag tgataataaa 10321 ttaaagctta atgtcaaaga aggtgattcc acacccgtta tagagttgga aaatcctttc 10381 ctagcacaag cacagaattt attacaattt aaaaatcttc caaatggtag caattcacta 10441 ggagtgaatg ccttaactcc ttctttatat caacaaaatt tagcacaaaa ttcctttggt 10501 ttgggaacag gattacctaa tcaaaccact tctaatcaaa acggcgtctc agaaagtgcg 10561 ttgcaaacag cgcttaacca agtaaagagc cagcaatcta caaactcaaa tcgtatcacg 10621 tcaactcaga aaaacccctt tgagccatca tctttactat ccaccgaaaa cagaaatagt 10681 cagactcttg ttccaagcac tgggttcaat acaaacacaa tcaatcccct gaatgtagga 10741 acaggttcta cgcagtcggg agcgatttct ggaacaagtt acatacaacc aggaactact 10801 cctggaacaa cttatcctca accaagtttc aataatctgc aaccacaaaa tttatctggt 10861 ggaacaagtt atatacaacc aggaactgga aatcaactac aaacttccat tcctggaaca 10921 gcttatcctc aaccaagttt caataatctg caaccacaaa atttatctgg tggaacaagt 10981 tatatacaac caccaactgc aaaccaactg caaacttcca ttcctggaac agcttatcct 11041 caagcgaaac cagaattagt aggtataccg caaactccaa gtgcaattcc taatcgttct 11101 gctattattc ttgaccaagt tataaagaat aggttgaata acataaatac tgctcagcca 11161 ttgtctactg ttacacaacc aacatcagta cccccgtctt ccaatacact caatgctccc 11221 aactacacct caaatcaagg aaatgtaatt aataatgccc ctttaacgtc aaataatgat 11281 ggaagtgtag gcatacagca agcgccggct ccagtgccgc agtatattta tccatcttct 11341 actcagattc caggacagta tactggtggc ggacagataa ataggtacta gtaaccgtaa 11401 gatctatgtc acgaacctat ttaaatgttt cataccaaga aaaagacgaa gcaaagaaac 11461 taggagccag atgggatgct agtgctcgtc gctggtatgt accagaagga atggatacag 11521 cacctttggt gcgctggtta cctgttgaga aaccggagtc tttgaagcct aaatcaacag 11581 aaccccgttt gacggtagaa cttgttcctc aaacttgttg gtatagcaat gtgcgctcag 11641 aggtttcaac gggagagtgg gacaaactca agcgagtgac ttttgagcga gcgaattacc 11701 gttgtgaagt ttgtagtggt cgtggtttaa aacatcctgt ggaatgtcat gagatttggg 11761 actatgacga tgagcgacac atccaaactt tgaccggtct gattgcatta tgtcctgctt 11821 gtcacgaatg taaacatatg ggatttgcaa atatcaaagg tcgaggggaa ctcgccaccg 11881 atcatttggc gaaggtgaat ggttggacaa tccaccaggc aaagacttat gtgcgagaat 11941 gcttccaagt ttggcaagag cgtagccagc atgaatggga tttagatatc acctatctcg 12001 agcaatttga gatttcgagt cattcttcat cgcccagacg tggcgacaga cctgggaatt 12061 gagcagtttt gggtaactac agcgcgtgcg ccgcgctatt atttggtatc ctagaaagct 12121 ggactagcta tgacttaaac tatgcctgcg cctaccacca cccctcttag tactgccttt 12181 cccctgacag ccgtagtcgg tcaagaagcg attaaatcag ccttgctgct tgctgcagta 12241 gacccagggt tgggaggagt ggcgatcgca ggtcgtcgcg gtacggcaaa atctgtgatg 12301 gcgcgtgcta ttcacaccct acttccaccc attgaagttg tctctggttc tatcagcaat 12361 tgcgatccca atcgtccaga agaatgggat gaccaacttt tggcagattc ctccctcagt 12421 ctccctacaa gcgaagatgg aagcagcgag gtaaaaaccg aaatcatccc cgcaactttt 12481 gtacaaattc ctttgggagt gacagaagac agactcttgg gttctgtaga tgtggaacag 12541 tctgtcaaac agggtgaaac tatttttcaa ccaggtttac tcgcccaagc acaccgaggc 12601 gttctctaca tagatgaaat caatttatta gatgaccaaa tatccaatca gcttttaaca 12661 gtattatctg agggacgcaa ccaaattgaa cgcgagggca ttagttttca gcatccctgc 12721 aaacccttat ttatcgcgac ttacaaccca gaagaaggag cattgcggga acatttatta 12781 gatagaattg cgatcgccct cagcgctgat ggtgtcctcg gtttagatca aagagtccaa 12841 gcagtcgagc aagctattgc ttactcccta tctcctcaag aatttctcca acagtatagc 12901 gaagacctag acgccctcaa aacccaaatc atcctggcgc gggaatggtt aaaagacgtt 12961 tgcatcaccc accaacaaat ttcctactta gtagaagaag caattcgtgg aggcgtacag 13021 ggacaccgcg ctgagttatt cgccgtgcgt gttgctaaag cctcagctgc tttggatgga 13081 cgcacacaag tgaatgctga cgatttgcgg cgtgctgtgg aactcgtgat agtcccacgg 13141 gcgactgttg tgcagacacc tccaccccaa gaa // LOCUS NODE_2593_length_13039_cov_5.02418413039 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 13039) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 13039) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13039 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 193..993 /locus_tag="DP116_20955" CDS 193..993 /locus_tag="DP116_20955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999135.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartyl/asparaginyl beta-hydroxylase domain-containing protein" /protein_id="PRJNA477356:DP116_20955" /translation="MESFNEYHLDPNQFNFLNSFQDNWQVIRDEFTNFIKHASHEELQ FSYDVLGPKSKTIKTKGNSKYSAFGILFQGIFIEEYIQLHQIEYPNYRQNDASQKALA LREKYFPNLAKVIKKVNLINDNIIRNVYFGTFHPGLDIKLHVNDNPHMNRGYLGLIVP PGDIAMKICHEQLYWHEGKFMVLDHSYPHCPHNYTNYDRTVLVVDFLKTDKPREDLIR FEQEQVTQRMQDNPYSLGVFGKSDKAKTEDFIKYGLAHQLEWDKALGA" gene 1039..1338 /locus_tag="DP116_20960" CDS 1039..1338 /locus_tag="DP116_20960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317060.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_20960" /translation="MLSGIKQQVVVGKDGKIEIQTSELAEGTVVEVIVLVEQDVVETN ANQSIQQDETEYLLSTEANRYHLMTAIQNIETKTNLVSFTPEEWNEEYNIRSFGI" gene 1304..1567 /locus_tag="DP116_20965" CDS 1304..1567 /locus_tag="DP116_20965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017750039.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Txe/YoeB family addiction module toxin" /protein_id="PRJNA477356:DP116_20965" /translation="MKNITFAHSAFEQFNDWAAQDKKIHRKIITLINDILRQPFTGLG KPEPLKHELTGYWSRRITDEHRLVYEVTETEIIILSCRFHYDD" gene complement(1762..3180) /locus_tag="DP116_20970" CDS complement(1762..3180) /locus_tag="DP116_20970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873136.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase" /protein_id="PRJNA477356:DP116_20970" /translation="MSKPIEIRNPWTGKFDYVIIPLPPKLIVQQCNRLRRAQRGWLQL GLEGRIEVLQQWKQAIISGRDKLLEALVNDTGRLSISQLEIDSFLSSIDRWCKLAPEL LQETVKDTAISFIELQQTAVPYPLVGVISPWNFPLLLSTIDTIPALLAGCAVIVKPSE IAPRFTGPLLASINAVPKLRDVLTFIEGAGETGAALIEYVDLVCFTGSVATGRKVAEA AAKRFIPAFLELGGKDPAIVLSSANLDLATSAILWGSVVNTGQSCLSIERIYVAESIF EKFVEQLVAKVQRLELAYPRLENGEIGPIIAERQAAIISDHLLDAVEQGAVVHCGGEV ENYGGSWWCRPTVLTSVNHSMKVMTEETFGPIMPVMPFSTIEEAVNLANDSIYGLSAA VFASSEAQALEVARQIDAGAISINDAALTALMHEGEKNAFKFSGMGGSRMGPAALTRF MRKKAFLVKTKNMSDPWWFDSH" gene complement(3184..4242) /locus_tag="DP116_20975" /pseudo CDS complement(3184..4242) /locus_tag="DP116_20975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009756753.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="nitrilase" assembly_gap 3735..3744 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(4332..5093) /locus_tag="DP116_20980" CDS complement(4332..5093) /locus_tag="DP116_20980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316574.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="red chlorophyll catabolite reductase" /protein_id="PRJNA477356:DP116_20980" /translation="MIDQQPDLDNTALFEQLWGITNELRQKLEARFELHPDPSTKDLQ TYSSITGDAQGSLNTFSGPEIDRLVHSWIRNPKLGFSHIRLIIWLGPHIRVPHLACAF ATIPHLFFYMDYIPRSDMFTDLEYLDRYYEPVNQTYLTLLEDSRFEPHISKNVYMRQA QSPTSLCYTSKFSDEVCPLVRTTAHEMMDRWLLWIDKAESVAENERAALSERDLIILR TVIERDPDNKIAVPLLGAELTDKLVRALWGGDRPK" gene 5155..6090 /locus_tag="DP116_20985" CDS 5155..6090 /locus_tag="DP116_20985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316575.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NmrA/HSCARG family protein" /protein_id="PRJNA477356:DP116_20985" /translation="MSHSPDSTELNTGKVLLIGVTGGTGGNVVKGFLEQGVTNLRAIT RKIDLNRPSLAKMNDAGVELVEANLDDEASLAAAFAGIAAVYCHATSADSAKPDPQEV ERAKRVVQVAKKADIKHFVYNSAGGADRNSGISHIEQKYKVEQVLKNAGLPTTMLRAC LFMEEFWKKYTRPSILKGVFPFSIQPNKPIHLITTKDMGRVAAYVIKHPSNYIGQEIE LAGDVLTPKQMTQAFSNAQGIPVVYKETPAWIFLLLLRKELFDLIQWYRTKGYQADVQ RLREEEFPGLLTTFSEFLEETNWANQELTYESLGF" gene 6216..7151 /locus_tag="DP116_20990" CDS 6216..7151 /locus_tag="DP116_20990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015155238.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycerol acyltransferase" /protein_id="PRJNA477356:DP116_20990" /translation="MRNQGYDDKIFNPFSELLDKLSIFTSSLGDDSKASRRRFDGWSL SERNPEIIKAWMPIWEWFYRYYFQVETSGWHHMPSAGKVLIVGSHNGGLGSPDTSMFM YDWFRTFGYERLAYALMHPSAWDTPIFAVPGAPVGAIIAHPKMASAALRKDAALLVYP GGAQDLFRPYTWRNRIHLANNKAFIKLALREEVPIVPIISHGAHSTLIVLADFYQQMK QFHEWGFPWLLDGNTGVFPVYLGLPWGLGIGPLPNIPFPMQIHTHVCAPIVFERYGRV ASRDRTYVDACYEQVRAMMQAELDELVQKYDCSEG" gene complement(7254..7637) /gene="arsC" /locus_tag="DP116_20995" CDS complement(7254..7637) /gene="arsC" /locus_tag="DP116_20995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864783.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="arsenate reductase, glutathione/glutaredoxin type" /protein_id="PRJNA477356:DP116_20995" /translation="MFVCKKNSRRSQMAEGFARTLGDGKITVNSSGLESSYVDPTTVQ VMSEIGIDISHQTSKPLDNFKAEDYDAVISLCGCGVNLPEEWVIRDVFEDWQLDDPEG QDIETFRRVRDEVKERVVKLISSLS" gene complement(7705..8361) /gene="arsH" /locus_tag="DP116_21000" CDS complement(7705..8361) /gene="arsH" /locus_tag="DP116_21000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412686.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="arsenical resistance protein ArsH" /protein_id="PRJNA477356:DP116_21000" /translation="MTTHHKPRILFLYGSLRERSYSRLLAEEAARIIEEFGAEVRFFD PRELPIHNSVPDTHPKVQELRELSLWSEGQVWSSPEMHGNITGIMKNQIDWIPLSLGA VRPTQGRTLAVMQVSGGSQSFNAVNTLRILGRWMRMFTIPNQSSVAKAYQEFNEDGTM KDSPYRDRVVDVMEELYKFTLLLRDKVDYLTDRHSERKEKAAKETIRVANTALEVNAN " gene complement(8403..9095) /locus_tag="DP116_21005" CDS complement(8403..9095) /locus_tag="DP116_21005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864785.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aquaporin" /protein_id="PRJNA477356:DP116_21005" /translation="MKSFFKQISVHRYQTLAEAIGTFTLVFAGTGAVMVNDISKGAIT HVGISFVFGAIVVALIYAMGHLSGAHFNPAVTLAFWTSGFFPKHRVLPYILAQSLGAI VASALLLMSLGRVANLGVTLPLNGNWLQSLVLETVLTFILMLVIFGSGLDRRAHIGFA GLAIGLTVGVEAAFMGPITGASMNPARSFGPALVGAIWQYHWVYWVGPILGAQLAVIV YKQLSNDFQDIR" gene complement(9343..9672) /locus_tag="DP116_21010" CDS complement(9343..9672) /locus_tag="DP116_21010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316584.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_21010" /translation="MQSPSTTTPDLIVLGFHALSDPLRISVIELLRNKELCVCDLCDA LEVSQSKLSFHLKTLREAGLVRSRQEGRWIYYSLNIAQFAALEQYLSEFRRLYQILPA RLCQEPS" gene 9965..10852 /locus_tag="DP116_21015" CDS 9965..10852 /locus_tag="DP116_21015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859531.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carboxyvinyl-carboxyphosphonate phosphorylmutase" /protein_id="PRJNA477356:DP116_21015" /translation="MQPSSQKLRQLLERPEILIIPGVYDCLGAKLAEQLGFELVATSG FGIAASTLGVPDYGLMTATEILASTGRIAQSVSIPLIADMDTGYGNALNVIRTIKDAV QQGIAGVLLEDQEWPKKCGHFEGKRVISMAEHVGKIRAAVQARGDSGLVIIARTDARA PLGLEEALSRGRAYIDAGADILFVEAPQSVEELQKIASAFPDVPLVANIVEGGKTPQL SASQLQELGFKIVFFPVSALLAATKVMTACLRQLKEQGTTVDFQELVEFKEFQNLIGV PEYLQMEKQFTSVKDSTSV" gene complement(11107..11343) /locus_tag="DP116_21020" CDS complement(11107..11343) /locus_tag="DP116_21020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011243300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21020" /translation="MTTNLLQRITYNPDVCHGKPCIRGLRYPVELILELLSSGMTIDE ILADYEDLEREDILAALQFAVRLSQVKSIYKIAS" gene 11743..13039 /locus_tag="DP116_21025" /pseudo CDS 11743..13039 /locus_tag="DP116_21025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208327.1" /note="frameshifted; too many ambiguous residues; incomplete; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 12779..12788 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 3743 a 2847 c 2807 g 3622 t 20 others ORIGIN 1 agagaagaag aggaggatgt tcatatccta tttaggaagg ctgtataagt ttatagctat 61 aactctaagc atgagctttt aattaaaaaa tactctgttc tagtcaggaa ttccaatatc 121 gtatcggtag gcagaccaaa tctcatcacc atgtaacttt tttttattca atcatcttca 181 ggagaaaaga tcatggaaag ttttaatgag tatcatctag acccaaatca atttaatttt 241 cttaacagct ttcaagataa ctggcaagtg attagagatg agtttacaaa ctttatcaaa 301 cacgcatctc atgaagaatt acaattcagt tatgatgttt tgggacctaa aagtaaaacg 361 attaaaacta aaggaaattc taaatacagt gcttttggta ttttatttca aggtatcttc 421 attgaagaat atattcaact tcatcaaatc gaatatccta attataggca aaatgacgca 481 tcgcaaaaag cactcgcatt aagagaaaaa tatttcccaa atttggctaa ggtaatcaaa 541 aaagtcaact taatcaatga taacatcatc agaaatgtct attttggtac attccatcca 601 ggcttggata ttaagctgca tgtaaatgat aaccctcata tgaaccgtgg ctatctagga 661 ttgattgtgc caccagggga tatagctatg aaaatatgcc atgagcagct ttattggcat 721 gaaggaaaat tcatggtttt agatcatagc tatccacact gcccgcataa ttacactaat 781 tatgacagaa ccgtcttggt tgtagacttt ttaaaaacag ataagcctag agaagactta 841 atacgctttg aacaagagca agtcacacaa cgtatgcaag ataatcctta cagtttaggt 901 gtttttggta aaagtgataa agcgaaaaca gaagatttta tcaagtatgg tttagctcat 961 cagttagagt gggataaagc cttaggagct taatttcatc tgcacactgt ctggaaaaac 1021 ttattgcgag ggtttgacat gttaagtggc attaaacagc aagtggtggt tggcaaggac 1081 ggcaaaattg agattcaaac gtctgagtta gcagaaggaa ctgttgttga agtgattgtt 1141 ttagttgaac aagatgttgt tgaaacaaat gcgaaccaat ctattcaaca agatgagaca 1201 gagtatttgc tgtctactga agccaatcgt taccacttga tgaccgctat tcaaaacatt 1261 gaaacaaaaa ccaacctggt cagttttacg cctgaggaat ggaatgaaga atataacatt 1321 cgctcattcg gcatttgaac aattcaacga ctgggcagca caagataaaa aaattcatcg 1381 caagattatc accttaatta acgatattct tcgtcaacca tttactggac ttggcaaacc 1441 tgaacctcta aagcatgaac tgaccggata ttggtcacga cgaattacgg atgagcatcg 1501 tttagtgtat gaagtgactg aaactgaaat catcattttg agttgtcgat ttcactatga 1561 tgactgaagc tgtttgcgat cgcccgtata cgcccagagg gttttctttg agaaaaccct 1621 ctccccaaat cgaagatttg gggcactctg ctgcgcaggg tggggcgtag ggcgcgctgc 1681 gccatcgcct attctcccaa ccatgcgata cgcaaaagct aaagccagag gcttgccttt 1741 tcacaacctt gctaagttca ctcaatgact atcaaaccac caagggtcac tcatgttctt 1801 tgttttcacc aaaaaagctt ttttccgcat aaaacgtgtc agcgccgcag gtcccatgcg 1861 tgatccaccc ataccagaaa atttgaaagc atttttttct ccctcatgca taagagcagt 1921 cagagctgca tcgttgatac tgatagcacc tgcatctatc tgacgcgcga cttccaaagc 1981 ttgtgcttcc gacgaggcaa aaactgctgc acttaatcca taaattgagt cattcgccaa 2041 gttcaccgct tcttcaattg tggaaaaggg catgactggc ataatcggac caaatgtctc 2101 ttcggtcatc accttcatag aatggttgac agaagtgaga actgtaggac gacaccacca 2161 agatcctccg taattctcaa cttcaccgcc acagtggaca actgctccct gttccactgc 2221 gtctagcaga tggtcgctga taattgctgc ctgtctttct gcaataatcg gaccaatttc 2281 tccattttcc agcctagggt aagctaattc aaggcgttgc actttggcta ccagttgttc 2341 aacaaacttt tcaaaaatag attccgcaac gtagatcctt tctatagaga gacacgactg 2401 tccagtgttg acgacagaac cccacaaaat tgctgatgtt gctaagtcta aattagcgga 2461 cgacagtaca atagccgggt ctttacctcc taattccaaa aaagcaggaa tgaagcgttt 2521 agctgctgct tctgctactt ttcgacctgt tgccacactt ccagtaaagc ataccaaatc 2581 cacatactcg atgagtgcag ctccggtttc acctgctccc tctataaaag ttaatacatc 2641 acgtaatttt ggaactgcat taattgatgc caacagtggt ccggtaaagc ggggagcaat 2701 ttcactgggt ttgacaatca cagcgcaacc agcgagtaat gctggaattg tatcaattgt 2761 tgaaagcaag agaggaaagt tccacgggct aatgacccca actagagggt aaggaaccgc 2821 tgtttgttgc agttcgataa aagaaattgc tgtatctttg acagtttctt gcagcaattc 2881 tggtgctaat ttacaccacc tatcaatgct agagagaaat gagtcaattt ctaattgcga 2941 tattgataat cttcctgtat cattgaccaa cgcttcgagc agtttatcgc gtccagatat 3001 aatcgcttgc ttccactgtt gcagaacttc gattcttcct tctaaaccaa gctgcaacca 3061 acctctttgt gctctccgca agcggttgca ttgctgtact atcagctttg ggggtagtgg 3121 tataatcaca taatcaaatt ttccagtcca ggggttgcgg atttctattg gtttggacat 3181 gacttatatt acccctagtt tagttaggcg ctctattgtt tcttgctgtg tttggataaa 3241 gtgtttgcgg tgtacttctt tttccagcat actgttagct gggtaaaagt gcgagtgatg 3301 ataactttct gcgtagagtt caaaccgctg tcgtgaaagt aaattattta accctggacg 3361 gcggcggtag cggcgtaaag cagccaagtc tatttcagca aaagctgcca tactctcccc 3421 ttcgcctgtc tcagctaaaa ccagccctcg atagtcaatg attttagagc caccatcggc 3481 agaagcgaca gggatgggga tattagcgat gccagcagta ttggatgaaa ctacgtatgc 3541 catgttttcc acagcgcgac agatttttgc cgcttctttg ggagcgcgtt ctttgccata 3601 aacttctgag gttgagtgca gaaaaatctc tgctccacgc attgctagac accgcgctac 3661 ttctggatat aaaatttctt ctgatgcgat cgcccccaaa ttcccaattg cagtctttgc 3721 aaccggaaac acacnnnnnn nnnncacacc ttcaagtcca taacagtcaa gatatttttc 3781 ccaaacatca tgcggtgtcg gagcaaacat agaattcagc cgccgatacc gcaagacaac 3841 tgaccctgat gggtcgataa caaagcaagt ttgaaagtac agctctggaa agttggggtc 3901 aagctcgtag gcgttacctg caagaaaaat actctgtttt tgtgcaattt tactgagtgc 3961 ttcatattca ggacccgcca tttctagaca agctttttcc ccccagacag ccagtgactc 4021 tcccattgga aagcctgtga ggaagtattc cggtagaaca attaaacgac agtcgaagcc 4081 gatgaaagcg atgctggcag caatttgtcg tgccaaacgg ttgatagtgt cttgcatcaa 4141 agaacgggct tcctgacggt cggttgcttg gttaacggcg tgacacgtta cctggagtgc 4201 taaggctcga aatgagtcaa tggtttctgg attggctgtc ataccctgct ttgtctaaat 4261 tatcattatg ttactctgtg atgcgatcgc gaagcgggcg ctctgcgcca tcgcctttct 4321 aaggcagtgc gctactttgg gcgatcgcca ccccatagcg ctcgcaccag tttgtcggtc 4381 aattctgccc ccaacagcgg cacagcgatt ttgttatccg ggtcgcgttc gatgaccgta 4441 cgcagtataa ttaagtcacg ctcagacaga gcagcacgct cgttttctgc caccgattct 4501 gctttatcta tccagagcag ccagcgatcc atcatctcgt gagcagttgt gcgaacaaga 4561 ggacatacct cgtccgaaaa tttgctcgta taacatagac tggtagggga ttgagcctga 4621 cgcatataca catttttgct gatatgcggc tcaaagcgtg aatcttccaa taatgtcaag 4681 tatgtttggt ttacaggttc gtagtagcga tccaaatact ctagatcagt gaacatatcg 4741 cttctaggga tgtaatccat gtagaagaac agatgcggta ttgtggcaaa cgcacaagcg 4801 agatgaggaa cacggatatg cggtcccaac cagataatca ggcgtatgtg actaaagccc 4861 aattttgggt tgcggatcca cgaatgcact aaccgatcta tttcaggacc agagaaggtg 4921 ttgagcgagc cttgggcatc tcctgtaata ctggagtagg tctgtaaatc ctttgtagat 4981 ggatctggat gtaactcaaa acgagcctct agcttctgtc gaagctcgtt tgtaataccc 5041 cacaattgct caaataaagc tgtgttgtct aaatccggtt gctggtcaat cattgtttgc 5101 atccctgtac tttgtataga cttaatttag accaaaaact tgaaggaagc gaacgtgtca 5161 cattctcctg acagcacaga attgaacacc ggtaaagtcc tgcttatagg agtcactgga 5221 ggtacaggtg gaaatgtggt caagggattt ctcgaacaag gagtgaccaa cctacgtgcc 5281 ataactagga aaatagactt gaatcgtcct tccttggcaa agatgaacga tgcaggagtc 5341 gagcttgttg aggcaaactt agacgatgaa gcttcactcg cagcagcctt tgcaggaatt 5401 gcggctgttt actgccacgc caccagtgcg gactctgcca aacctgaccc ccaagaggtc 5461 gagagggcaa agcgagttgt acaagttgca aagaaagctg acattaagca cttcgtttat 5521 aactctgcag gtggagcaga tagaaattct ggaatcagcc atattgagca gaagtacaag 5581 gtggagcagg tcctaaaaaa cgcaggctta ccgacaacaa tgttgcgggc ttgcttgttt 5641 atggaagaat tttggaagaa atacacccgc ccgtccatcc tcaaaggagt cttcccattc 5701 tctattcagc caaataaacc catacatctg attacaacta aagacatggg tcgcgttgct 5761 gcttatgtta taaaacatcc ctccaactac atcgggcaag aaattgagct tgctggcgat 5821 gtgctgactc caaagcagat gacgcaggcg ttttccaacg cgcaagggat tccagttgtc 5881 tacaaggaga cacccgcttg gatcttctta ctcctcctgc gaaaggaact gttcgatttg 5941 attcagtggt atcgcactaa gggttatcaa gccgatgttc agcgcttgag ggaagaggag 6001 ttccccggac ttttgactac atttagtgaa tttcttgaag aaaccaactg ggcaaatcag 6061 gaactcacct acgaaagtct tggcttctaa cctctcaacg ggttaccaac ctggtgaagt 6121 acgtttttca cggtattaac tctgagaaaa aagataatgg tctacctcaa ttgcttagga 6181 aacgctataa ggtaaaagta ttcactaata tttttatgcg caaccaaggc tacgacgaca 6241 aaatttttaa cccattctct gaattattag ataaattaag cattttcaca agcagcttgg 6301 gtgatgactc caaagcctct cgtcgtaggt ttgatggttg gtctttatcg gagcgaaacc 6361 ctgaaattat aaaagcttgg atgccaatat gggaatggtt ctaccgctac tattttcagg 6421 ttgaaacttc cgggtggcat catatgccat cggctggaaa agtgttgatt gtcggttccc 6481 ataatggcgg gttgggttca ccagatacgt caatgttcat gtatgactgg ttccgcacct 6541 ttggctatga gcgattagcc tacgccttga tgcacccgtc agcctgggat actcctatct 6601 ttgctgtgcc aggtgctccg gtgggagcta ttatagcaca tccgaaaatg gcgagtgcag 6661 ctctacgaaa agatgcagca ctactggttt acccaggagg tgctcaggat ttgtttcgtc 6721 cctacacttg gcgaaaccga attcatttag caaacaataa ggcgttcatc aagctggctt 6781 tgcgggaaga ggtgccaatt gtacccatca tttctcatgg tgcacactca actttgattg 6841 tgctagctga tttctaccag cagatgaaac aattccatga atggggcttt ccttggttac 6901 ttgatggaaa cacgggcgta ttcccagttt accttgggct accttggggc ttggggattg 6961 gacctctacc caatatcccg ttccccatgc agattcatac tcatgtctgt gcaccaattg 7021 tatttgagcg ctacggtcgt gttgcgtctc gcgatcgcac ctacgtagat gcttgttacg 7081 agcaggttcg tgcgatgatg caagctgaac tagatgagtt agtacaaaag tatgactgtt 7141 cagaggggta acaagaatat ataccaatcc tattttattt gtgaaaatcg cgcatgatca 7201 gattcctgac ttctttaaga agtcgggaat cttgttgttc acgaatgatt tgtttaactt 7261 aacgatgaaa tcaacttcac aactcgttct ttaacttcat ctcgaacacg gcgaaaggtt 7321 tctatatctt gcccttctgg atcatcaagt tgccaatctt caaacacatc tcttatcacc 7381 cattcttcag gtaaattgac tccacaacca caaagagaaa tcacagcatc gtaatcttct 7441 gctttgaaat tatctaaagg tttggaggtt tgatgactaa tatcaatgcc aatttctgac 7501 atcacttgga cagttgttgg atctacataa cttgattcca gtccagaact gttaacagta 7561 attttaccgt ctcccaatgt gcgagcaaat ccctctgcca tttgagaacg acgggaattt 7621 ttcttacaaa caaacatcac ttttttcatg gcttttgtgg ttattagttg ttgatagatt 7681 aacgagtgaa aacttttatg agcttcaatt agcgttaacc tcaagagcag tattcgctac 7741 tctgatggtc tcttttgctg ctttttcttt acgttcgctg tggcggtctg tcaggtaatc 7801 caccttgtca cgcaacagca aggtgaactt atagagttct tccatgacat caacgacgcg 7861 atcgcggtaa ggcgaatcct tcatcgttcc gtcttcgtta aactcctggt aagctttagc 7921 gacactagac tgattcggaa tcgtaaacat tcgcatccaa cgccctaaaa tgcgcagggt 7981 gttgacagcg ttgaaagact gagaaccccc actgacttgc atgactgcta aggttctgcc 8041 ttgagtcggt ctgactgctc ctaaacttag aggaatccag tcaatttggt ttttcatgat 8101 acccgtgata ttaccatgca tctcaggaga agaccatact tgaccctcag accacaagct 8161 taattcccgc aattcctgca ccttgggatg tgtatcaggt acactgttgt gaattggtaa 8221 ttcccgtggg tcaaaaaacc ggacttctgc accaaattcc tcgataattc gcgcagcttc 8281 ttctgctaga aggcgactat aggaacgctc tcgcaaagag ccatataaaa ataagattct 8341 cggtttgtga tgagttgtca taatttcagc ccaatataac agcctctctt ggatttcctc 8401 ggttatctaa tatcttgaaa atcgttagaa agttgcttat aaacgattac agctaactgc 8461 gctcccaaaa ttggtcccac ccaataaacc cagtggtatt gccaaattgc tcctactaac 8521 gccggaccaa aagaacgtgc tggattcata cttgcacccg taattggtcc cataaacgct 8581 gcttctactc caacagtcaa ccctatagct aaccctgcaa agccaatatg tgcgcggcga 8641 tccaatccag aaccaaaaat cacaagcatc aaaatgaacg tgagtactgt ttccaacacc 8701 aaagattgca accaatttcc attgagtgga agagttaccc ccaaattagc aactcttccc 8761 aagctcataa gtagtaaagc agaagctact attgccccca aggattgtgc caaaatatac 8821 ggtaacactc gatgtttggg aaaaaagcca cttgtccaaa acgctaacgt cactgctggg 8881 ttaaagtgtg cgccgcttag atgtcccata gcgtaaatca aagcgacgac gatcgcacca 8941 aacacaaagc taatccccac atgagtaata gcacctttac ttatatcatt taccatgact 9001 gcgccagtac ctgcaaatac caaagtaaag gtgccgattg cttctgctaa tgtctgatat 9061 ctatgtacag agatttgttt gaaaaaagat ttcatcaacc taaaactgct tttgtttgcg 9121 taagacctct gcaagcgtta tgttaagctg agcttctttg ccaaacacag ttcctacctt 9181 tccagtgttt accagtgcaa gtgccaactt gaccagtctt gcgttcattg acctcactcc 9241 cttgacgtat taagagtata tatccatcat ttcaaaaaaa gttgatttgt caacctatag 9301 ggtgagccat gttacagaaa ttaatcaaaa aaagttgata tttcaggatg gctcttgaca 9361 caaacgagca ggcaaaattt ggtacaagcg gcgaaactct gacaggtact gctctaaagc 9421 agcaaattga gctatgttga ggctatagta aatccagcgt ccttcctgac gagagcgaac 9481 caaaccagct tctctgagag ttttcaagtg aaaagacagc tttgactggc tgacttctaa 9541 agcatcgcac aagtcacaca cacaaagctc tttatttcgc agtagttcga taacgctaat 9601 tctcagtgga tcggaaagag catggaaacc gagaacaatt aaatcaggag tggttgtgga 9661 gggggattgc attaataaaa attaaggtga aagtgttcac tctctattgt ggaatagaaa 9721 attgaagttg caagtattaa ctctgggtaa aatgcaaagc tttataccta aaaatcgtgc 9781 taataccgtg atatacgcaa gtatttattt gattttttaa tttttgtttt tgaattttga 9841 attgttcatg ccatgcacct gttgtgctaa tctaaacaaa cagttatcag ttatcagtta 9901 tcagttatca attgtccact gttcactgtt cactgttccc tgttccctgt tccctgattt 9961 attcatgcaa ccttctagcc aaaagctgcg acaattacta gagcgtccgg aaattctcat 10021 cattcctggc gtttacgatt gtttgggagc aaaactggct gaacaattag gttttgaatt 10081 agtggcgact agtggctttg gtatcgccgc ctctacactt ggtgtaccag actatggttt 10141 aatgactgca acagaaatac ttgccagcac aggacgaatt gctcaatcag tgagtatacc 10201 tttaatcgct gatatggaca ctggctatgg taatgcatta aacgtcatcc gaactataaa 10261 ggacgccgta cagcaaggaa ttgcgggagt tttgctagaa gaccaagaat ggccaaaaaa 10321 atgcggtcac tttgaaggaa aacgagtcat ttcgatggct gaacacgttg gcaaaatccg 10381 cgcagctgtg caagcgcgtg gtgacagtgg tttggtgatt attgctcgca ctgatgctcg 10441 tgctcccttg ggtttggaag aagcattgag tcgtggtaga gcttatattg atgctggggc 10501 agatatcttg tttgttgaag cgccgcaatc tgttgaggaa ttacaaaaaa tagcttctgc 10561 gtttccagat gttcctctgg tagccaatat tgttgaaggt ggcaaaactc ctcaactctc 10621 ggcgtcacag ttgcaggagt taggttttaa gattgtgttt ttcccagttt ctgctttact 10681 tgcagcgaca aaggttatga ctgcttgctt gcgtcaattg aaagaacagg gaactacggt 10741 tgattttcag gagttggttg agtttaagga atttcaaaat ctgattggtg tacctgaata 10801 tctgcaaatg gagaagcagt ttacaagtgt gaaagattcg acgagtgttt gatacttgct 10861 tcaaatttaa taggatctac gcggaagaga tcccccaacc cccttaaaaa gggggctttt 10921 aagattcccc cgtttttaag gctagggggg atcgaggttt cgggttttca gtgcgtaata 10981 tcaaatccgt ttgtatcaga attatttctc cttcctcttt ctctgagctc tctgcgtctc 11041 tgcggttttt taatcattca gagaatcgag caagtcgaac aggtaattga gcgtcaacaa 11101 gaaatttcac gaggcaattt tatagatact ttttacttgg ctaaggcgaa cagcaaattg 11161 cagagcagct aaaatgtctt cccgctctaa atcctcataa tctgctaaaa tttcatcgat 11221 tgtcatgcca gagctaagta attcgagaat caattctaca ggataacgaa gtcccctgat 11281 acaaggcttg ccgtgacaca catcaggatt ataggtaatg cgttgtaaaa ggttagtggt 11341 catattgttt tttcgtaaag tatataaggt tgtatatgtc taattttgca ttgagtgttg 11401 aactgtttat gattatttat tctattcttt tttgtgctac tctataaata atggaaaagg 11461 tgtgcgcaga tagacccctc ctcacaatga tgcaatagct cgtaaaggca agtggtcgta 11521 aaaagtatta aaataagggg cttgtgtcct tgttccaggg tagggatgca ccacaaagaa 11581 aattttctct ttcccttaca cccttacacc cccatacccc tacacccttt tgttggtcac 11641 aaatgcgaga ctcagcaatc tgccaatgtt tccgtaatca cgggcaatgc aattgaagca 11701 tcttaatata tatatagtta gataaaacac ccgcgcctat ccatgaacga acagcgccag 11761 caagcctatc tcaacctgat tgaaagactg ctgaattgcc cgagtgagga agaagcagag 11821 attttagaag cgaaccaaga tttacttgat gctgaattct tggggatgct ggaagctgtg 11881 gcagaaaatt tctcaccgca gggagatgaa aatacagcga gttggttgag aaatctggca 11941 aatcagttga gggaagcgtt gaatttatcc tcaccaactg ctgcgaacac ttcagaagaa 12001 gacttacaag cttattggca atttttgatg gaggtactgc aagcaaccgc agaaagccga 12061 ggtgaggcgc aggtagttta cccgttgctg gtagaaaaca caaaatatct taatctgaat 12121 ttagcggaag tcttgcggcg ttgggcgaca aataagctgg cggaagttga gacggatgag 12181 gcacaattca tcgctggagt tattgttaat tttagtaatc tgattcagca attccccttg 12241 ggtaacaaag ctagcaacat ggaaattgcg atcgctggtt atgaagtcgt gtcaaccgtg 12301 ttcacccgcc aaagctttcc ccaagattgg gcaatgacgc aaaataatct cggggctgct 12361 tactccaaca gaatacacgg ggagaaagca gagaatcttg agagtgcgat cgcttcttac 12421 actgcagcat tgtccgttta tacccgccaa agctttcccc aagattgggc aatgacgcaa 12481 aataatctcg ggaatgctta ccgtaacaga atacacgggg agaaagcaga caatcttgag 12541 agtgcgatcg cttcttactc tgcagcattg tccgtaagaa cccgccaaag ctttccccaa 12601 caatgggcaa tgacgcaaaa taatctcggg cttgcttact ctgacagaat acacggggag 12661 aaagcagaga atcttgagag tgcgatcgct tcttacactg cagcattgtc cgtttatacc 12721 cgccaaagct ttccccaaga ttgggcaatg acgcaaaata atctcgggaa tgcttaccnn 12781 nnnnnnnnca aagctttccc caagattggg caatgacgca aaataatctc gggaatgctt 12841 acctttacag aatacgcggg gagaaggcag agaatcttga gagtgcgatc gcttcttaca 12901 ctgcagcatt gtccgtaaga acccgccaaa gctttcccca agattgggca atgacgcaaa 12961 ataatctcgg ggctgcttac tccaacagaa tacacgggga gaaagcagag aatcttgaga 13021 gtgcgatcgc ttcttacac // LOCUS NODE_2608_length_12977_cov_5.23951412977 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12977) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12977) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12977 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(214..1449) /locus_tag="DP116_21030" CDS complement(214..1449) /locus_tag="DP116_21030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128830.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aluminum resistance family protein" /protein_id="PRJNA477356:DP116_21030" /translation="MNSSEQLREAEQALLQIFSGIDAQVKQNLQRVLYAFRHNHVGAH HFAGVSGYGHDDLGRETLDRVFAVVVGAEAAAVRVQFVSGTHAIACALFGVLRPGDEM LAVVGAPYDTLEEVIGLRGQGQGSLLEFGIRYQQLDLTLEGTINWQALSHAVKDKTRL VLIQRSCGYSWRPSLSIADIEKIIHLVKQQNPNTICFVDNCYGEFIETQEPTHVGADL IAGSLIKNPGGTIVTAGGYVAGRADLVEAAACRLTAPGIGSYGGATFDQNRLLFQGLF LAPQMVGEAMKGTHLTGYVFDKLGYPVNPAPFAKRRDVIQAIKLGSAQKLIAFCRAIQ QNSPIGSYLDPIPDDMPGYESKVVMAGGTFIEGSTSEFSADGPLREPYVVYCQGGTHW THVAIALEAVIEAVGEANT" gene 1536..2366 /locus_tag="DP116_21035" CDS 1536..2366 /locus_tag="DP116_21035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197143.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA desaturase" /protein_id="PRJNA477356:DP116_21035" /translation="MTIATSTKSQLNWLHITFFAGLHLGVLFALFPSNFSWKAFGVFL FLYWLTGGLGITLGFHRLITHRSFETPKWLEYFLAFCGTLTCQGGPIDWVGMHRIHHL HSDKEQDPHDSNKGFWWSHIAWMFYNSPAFADVPRFTKDIGDDPFYQFLQKNMILIQV ALGIVLLLLGGWSFVVWGVFVRLIFVWHCTWFVNSATHKFGYRTYDTSDRSTNCWWVA LLTYGEGWHNNHHAYQFSARHGLKWWEIDLTWMTVQLLQLLGLATNVKLAPEKAVSGQ " gene 2597..3649 /locus_tag="DP116_21040" CDS 2597..3649 /locus_tag="DP116_21040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid desaturase" /protein_id="PRJNA477356:DP116_21040" /translation="MTISTIESHKSAHERGLSDLKLKDIIKSLPRECFEQNRRKAWTK VLLSVLMVCLGYFSLIIAPWFLLPLAWIFTGTALTGFFVIGHDCGHRSFAKRRWVNDL VGHVMMMPLIYPFHSWRIQHNYHHTHTNKLDEDNAWQPITTEVYESCGKISQWGFIAF LRYRLWWIGSIIHCGVLHFNWSLFRKKDQASVKFSVAVVVLFAAIAFPILIATTGIWG FVKFWLLPWLVYHFWMSTFTLVHHTAPDVPFVAESKWNQAIAQLNGTVHCDYPSWVEI LCHDINVHVPHHISTAIPSYNLRMAYRSIKENWETYLHDECRFSFSLMKRITDQCHLY KADSGYQSFKDYFAGQ" gene 3839..4918 /locus_tag="DP116_21045" CDS 3839..4918 /locus_tag="DP116_21045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid desaturase" /protein_id="PRJNA477356:DP116_21045" /translation="MQSNIIKTNNPHEVESLEDTTKLPFTLQDLKTAIPSECFQPNVG KSLFYFFRDILIIGLLYAIAHFLNSWFFWPIFWVMQGTMFWALFVVGHDCGHQSFSNK KWLNDLIGHLCHTPILVPYHGWRISHRTHHKNTGNIDNDESWYPVSESQYKVMDWVEK AGRYYVFLLAYPVYLFMRSPNKEGSHFFPSSPLFKPSEKWDVITSTTLWTLMVALLGF LTYQWGWMWLLKYYVGPYLVFIIWLDLVTFLHHTEPDLPWYRGEDWTFLKGAISSIDR DYGFINHIHHDIGTHVAHHIFLNIPHYNLKKATEAIKPIMGDYFRKSDEPILSALVRS CNTCHFVPDTGGKVYYTSNKKLAKS" gene complement(5056..6153) /locus_tag="DP116_21050" CDS complement(5056..6153) /locus_tag="DP116_21050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006617265.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA desaturase" /protein_id="PRJNA477356:DP116_21050" /translation="MTQTQTRVSFAKNVGFRKELNKRVDAYFESENIAKRDNLAMYLK TAIILGWVISAWTFTLFGPPIMWMKLFGCVLLGLGIAGVGFSVGHDANHGGYSSHKWV NSIFGLTYDVIGLSSSLWRFRHNFLHHTYTNILGHDVEIHGDGLVRMSPYMEHKWYHK FQHFFIFIVYAIIPFYWSVADVNLALFKRRYHEHVIPVKPLEILILLVGKLVWLGVFV GIPLVVGYTPLQVIVGFSIAYITYGLLICNVFMLAHVLAPVEFLQPHPESNHIDDEWA IAQVKTTVDFAPKNSFLNWYLGGLNYQVVHHLFPHVCHIHYPKIAPILAEVCVEFGVP YSVYPTFGEALVANFRWLKLMGTSPPESTAL" gene complement(6339..7604) /locus_tag="DP116_21055" CDS complement(6339..7604) /locus_tag="DP116_21055" /EC_number="1.2.1.41" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111526.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamate-5-semialdehyde dehydrogenase" /protein_id="PRJNA477356:DP116_21055" /translation="MTVDAFDVSPEPIERARRAHQAALKLGTTKGVDRSRAVLAMAQA MTRSFDDILEANTLDLEASREMAVPDLILDWLKLTPKRLEMAVDILQRLGELSDPLRR VRHADYQLDDSQTYTQLMPLGVIAFIYEAFPELGAIAAALCIKTGNSLILRGGTEASH SNAAIANALLSAVTEVGLPPGCLQLITAEEGSSIRDIVTQDQCLNLVIPYGRSSLVQQ VARQSTAPVLKSAMGNCYLYWSPNGSLEMVRWLIIDSHQSEPDPVNAIEKVLIHRQAL PSSLLTLWNSLKEKGFEIKGDAELVEAFPQLQLAKEAEWGSAFLTKTVAFKLVDSLEA AIAWINQYSSGHADCIVTESYQESRQFALGVNSASSYINASPRFSRNPSRGDAVFLGM SNQKGHRRGLISLESLTTVKHIVQGNGRF" gene complement(7609..8016) /locus_tag="DP116_21060" CDS complement(7609..8016) /locus_tag="DP116_21060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21060" /translation="MQPQKADKIDFNSEYPCPCRRRGQLIPITLTEAFGCNRCQQIFV VEDNGYVLEQLSTTYPYKRAWRWRGNSWQVVHPRLGESYLPVALGIIFVLVIIWLPLA LRLANGYSIIAWAMVAVLLAILPALMVWLTYRR" gene complement(8486..8557) /locus_tag="DP116_21065" tRNA complement(8486..8557) /locus_tag="DP116_21065" /product="tRNA-Gly" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(8523..8525),aa:Gly,seq:gcc) gene complement(8658..9227) /locus_tag="DP116_21070" CDS complement(8658..9227) /locus_tag="DP116_21070" /EC_number="2.5.1.78" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745133.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="6,7-dimethyl-8-ribityllumazine synthase" /protein_id="PRJNA477356:DP116_21070" /translation="MAVFEGTFTQAEPLRFGIVIGRFNDLVTNKLVEGCQDCLKRHGI DVNPHGSQVDYIYVPGSFEVPVVARQLAVSGRYDAVICLGAVIRGQTPHFDYVSSEVA KGIAAAAFQTGVPVVFGILTVDSMQQALERAGIKSNHGWDYAMSALEMASLMRQLRSN LAEPFSGHSLVSIAEKCQRRTAVCRIVSD" gene complement(9426..9614) /gene="psbZ" /locus_tag="DP116_21075" CDS complement(9426..9614) /gene="psbZ" /locus_tag="DP116_21075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652374.1" /note="controls the interaction of photosystem II cores with light-harvesting antenna; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein Z" /protein_id="PRJNA477356:DP116_21075" /translation="MTFIFQLALLALVSLSFVLVIGVPVAYATPQNWNESRKLLWIGS GAWIILVLVVGALNFLVV" gene 9786..12560 /locus_tag="DP116_21080" CDS 9786..12560 /locus_tag="DP116_21080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872716.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="poly(A) polymerase" /protein_id="PRJNA477356:DP116_21080" /translation="MDLILCHTTADFDSLGAAVGLTRLLPGSKIVLSGGSHPTVGDFL ALHRDEYELMERRSVNPNEIRSLIVVDTQQRDRIGKAAQWLDLPHINKIIIYDHHQEQ QSDIPATEINISPVGATTTVIVEQLQKQQISLTTAEATVMALGIHVDTGSLTFDYTTP RDALALAWLMQQGASLSVVSQYVDPGLSPQLQQLLKVSLEKIQRIYVRGYQIAWVNLK TDAFVPGLSSLASQLMELTEIDALLLFHEYGLAEDESRLTVIGRCSRGAAPLGRSHIQ GINLNELFQPLGGGGHSQAASLNLRGVDSEEILNQLLEGIKATIPHPPIARDLMSSPV RTIRPETTIAEAQRTLLRYGHSGLCVADAQGQLLGIISRRDLDLALHHGFSHAPVKGY MTVNMRTITPETTLPEIESVMVTYDIGRLPVLENGQLVGIVTRTDVLRELHQGREEEQ GSRGAGEQRSRGESHVKLPSLAELHSRLAPQLWQLLTKASREAEKRGWHLYLVGGAVR DLLLAEEASGTLLIKDIDLVVDGFHKAADVGAGVELAKALQQLYPTARLEIHGAFQTA ALLWHKDPELDSLWVDIATARTEFYPYPAANPEVEASSIRQDLYRRDFTINAMALRLT PPRAGALLDFFGGLLDLQAKQIRVLHANSFIEDPTRIYRGVRFAMRLGFHLEPQTEEY IRYAITSGVYDRTTRENSKTPALQTRLKTELKHILEAPYWKPALQLLAELGALQCIHP TLGLDEELLGQLRLLERCLRRFDSQQTLIHWQMRLEALIAHIAAEYRGRVARNLLLPE DSINRLEVLAQAQANVNKILPSLQAHREVDEVRPSQIQQLLRQYDVPMLILIAVQSSR GVRRKIWQYLTVWRNVQPILNGNDLRKLGYKPGPQYRQMLDDLLAATLDGVVCNRDEA EEFLNKHYPQ" gene 12699..>12977 /locus_tag="DP116_21085" CDS 12699..>12977 /locus_tag="DP116_21085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317384.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alcohol dehydrogenase" /protein_id="PRJNA477356:DP116_21085" /translation="MIRAYAAFQQGEELKPFEYEPGPLGSEDVEIDVEYCGICHSDLS MLKNDWGMSQYPIVPGHEVVGTIANVGDAVKKLQVGQRVGLGWNSRSCM" BASE COUNT 3650 a 2910 c 2768 g 3649 t ORIGIN 1 ttcatccact catgtaaagg taggggtttt catcctcccg cttgcagccc gataacgata 61 tgtagagacg ttgcccttcg ggtatctcct gcggagacgc tgcgcgaacg gcagttgctt 121 caagtcgggg aacccgctgt tagcacctgc ctcacatgca acgtctctac cgtattggac 181 gcaattgaaa accgctctat tcccaaagga gatctacgta ttcgcttctc ccacagcttc 241 tatgactgct tctaaagcta ttgctacatg agtccaatga gttccccctt ggcaatacac 301 cacatacggt tcccgcaatg gaccatcagc tgaaaattca gaggtgcttc cctcaataaa 361 tgtcccgcca gccatcacga ccttactttc ataaccaggc atatcatctg ggatggggtc 421 aagataagaa ccgatgggtg aattttgttg aattgcccga caaaaagcaa tgagtttttg 481 ggcagaacct agtttaattg cttgaatgac atcgcgacgc ttagcaaacg gtgctggatt 541 gactggataa cctagtttgt caaatacata acccgtgagg tgagtccctt tcattgcttc 601 tcctaccatt tgtggggcga gaaacagccc ttgaaacagc aggcgatttt ggtcaaaagt 661 ggcacctccg taactgccaa ttcctggtgc tgtgaggcga caagcagcgg cttctaccaa 721 gtcagcgcga ccggcaacat aaccgccagc agttacaatc gtaccaccgg ggtttttaat 781 caacgaccct gctatcaaat cagcaccaac atgggtaggt tcctgagttt ctatgaattc 841 gccatagcag ttatcaacaa agcaaatggt attaggattt tgctgcttga ccaagtggat 901 aattttttca atatcggcta ttgataagct aggacgccaa gaataaccac aagaacgctg 961 aatcagcacc aaacgagttt tgtcctttac cgcatggctt aaagcttgcc aatttatagt 1021 tccttctaga gtaagatcta attgttggta gcgaatgcca aactcaagta gtgaaccttg 1081 accttgacct cgtaatccaa tgacttcttc aagagtatcg tagggtgcac caaccaccgc 1141 tagcatttca tcaccaggac ggagaacacc aaataaagca caagcaattg cgtgggttcc 1201 tgaaacaaac tgtactcgta cagcagccgc ctcagcgccc accaccaccg caaaaactcg 1261 atctaaagtt tctctcccca agtcatcgtg tccgtatcca gagacaccgg cgaagtggtg 1321 tgcccctaca tgattatggc gaaaagcata taatactcgt tgaagatttt gcttgacctg 1381 agcgtcaatt ccagaaaaaa tctgtaacag tgcttgttct gcttcacgca gctgttccga 1441 gctattcatt agttcctctt ttaagaaaac ataaaaaatt tgcggatttt tatatcgcgc 1501 agattaggct tgacaattcc tattcaggtt aatgcatgac aattgctaca tccacaaaat 1561 cccaacttaa ttggcttcat atcacatttt tcgccggttt gcatctcgga gttttgtttg 1621 ccctgtttcc aagtaacttt agctggaaag cattcggtgt attcttgttt ctctactggt 1681 taactggtgg cttgggtatt accttaggat ttcaccgcct gatcacccac cgtagttttg 1741 aaactcccaa gtggctggag tatttcttgg ctttttgtgg gacactcact tgccaaggag 1801 gaccaatcga ttgggtagga atgcatcgta tacatcattt gcactctgat aaagagcaag 1861 acccccacga ttccaataag ggcttctggt ggagtcacat agcttggatg ttttataatt 1921 cacctgcttt cgcggacgtg cctcgcttta ctaaagatat tggcgacgac ccattttatc 1981 agtttctgca aaagaatatg attttgatcc aggtggcgct gggtatagtc ctgttgctgc 2041 tgggtggttg gtcatttgtc gtttggggag tttttgttcg cctgatcttc gtgtggcact 2101 gtacttggtt tgtcaacagt gctacccata agttcggcta tcgtacttat gatacaagcg 2161 atcgctctac aaactgctgg tgggtagctt tactaaccta tggtgaaggc tggcacaata 2221 accaccacgc ctatcaattc tcagctcgcc acgggttgaa atggtgggaa attgacctaa 2281 cctggatgac cgttcaattg ctacaattac tgggtctagc tacaaatgtc aagctggcac 2341 cagaaaaagc agtaagcggt caatagttag caattcttat cttatttaat gttcaatttt 2401 gaattttgaa ttttgaattt tgaactactt atgctccccc gcttctaaaa ggctactaaa 2461 gcatcagcgg gggtttattg caggtgcggt agggtacatt agattgctat aatccaagac 2521 gcaaatatta aaaaactttg cgacagccac ttgtttagtg atttggtagt gaggtttgtt 2581 gtcttttcag gtattcatga ctatatcaac aatcgaaagc cacaagtcag ctcatgagcg 2641 tgggttatcc gaccttaagc taaaagatat tatcaaaagc ttgccacggg aatgttttga 2701 gcagaatcgc cgcaaagcct ggacaaaagt tctgctcagt gtcttgatgg tctgcttggg 2761 ctatttcagc ttgatcatcg ctccttggtt tctcttacct ctagcgtgga tttttacggg 2821 tactgcttta accggttttt ttgtcatcgg gcatgattgc ggtcatcgtt catttgctaa 2881 acgtcgctgg gtcaacgatt tagtgggaca tgtgatgatg atgccgttaa tttacccgtt 2941 ccacagttgg cgtattcagc acaattatca tcacactcac actaacaagc tggatgagga 3001 caacgcttgg cagccaatca ccacagaagt ttatgaaagt tgtggaaaaa tttcgcaatg 3061 gggtttcata gcatttctaa ggtatcgatt gtggtggata ggctcaatca ttcattgcgg 3121 agttttgcac tttaattggt ctttattccg caagaaagac caagcaagtg tgaaattttc 3181 tgtggctgtc gtcgtgcttt ttgctgcgat cgcattcccc atcctcatcg ccacaaccgg 3241 gatctgggga tttgtgaaat tttggctact tccgtggctg gtttaccatt tttggatgag 3301 tacttttacg ctggttcacc atactgctcc agacgttcct tttgtggcag aatctaagtg 3361 gaaccaggct atagcacagc taaatggcac agttcattgc gattaccctt cttgggtaga 3421 aatcctttgc cacgatatca atgtccacgt cccccatcat atttcaactg ctattccttc 3481 ttataatttg cggatggctt atcgcagtat caaagaaaat tgggagacgt acctgcatga 3541 tgaatgtcgg ttttcttttt ctttaatgaa gcgaattaca gatcagtgtc acctgtataa 3601 agccgatagt ggctatcagt cttttaaaga ctattttgca ggacaataag ctcatagctc 3661 taagaaatag attttggcaa ttctgtaaac acaagtcttg ttactgaaat tgccagatat 3721 taccttgctg ctttgcattt ccaaaactta gggatagcca aacccaaagt catgcgattg 3781 caaagtttag atcactttgg ctcaacaact tcctttttga ctccagtaaa aaaatcaagt 3841 gcaatcaaat atcatcaaga caaataatcc tcatgaggtt gaatctcttg aggatacaac 3901 caagctgcct tttactcttc aagatttgaa aacagcaatc ccatctgaat gttttcagcc 3961 caatgtcggg aaatcacttt tctacttttt tcgtgacatc ctgattatcg gtttacttta 4021 tgcaattgct cattttctaa attcttggtt tttctggcca attttttggg tgatgcaagg 4081 aacgatgttt tgggctttgt ttgtcgttgg acatgactgc ggacaccaat ctttttctaa 4141 taaaaaatgg cttaatgatt tgattgggca tctttgccat acaccgatac ttgttccata 4201 tcacgggtgg cgaattagcc atagaactca ccacaaaaat actggtaaca tcgataatga 4261 tgaaagctgg tatcctgtga gtgaatcaca gtacaaggtg atggattggg tggagaaagc 4321 gggtcgatat tacgtgtttt tattggctta tccggtatat ttgtttatgc gttctcctaa 4381 caaggaaggt tcgcactttt tccccagtag tcctctgttt aaaccatcgg aaaaatggga 4441 tgttattacc agcaccacac tctggacttt aatggttgct ttgcttggtt tcctcaccta 4501 tcagtggggc tggatgtggt tgttgaaata ctacgttgga ccttatcttg tctttatcat 4561 ttggctagat ttagtaacat ttttgcacca cactgagccg gatcttccct ggtatcgtgg 4621 agaagattgg acttttctca aaggtgctat ttctagcatt gaccgtgatt acggttttat 4681 caatcacatc catcatgata tcggtactca tgttgctcac cacatattcc tgaacattcc 4741 tcattacaat ttgaagaaag caactgaggc gattaaaccg attatgggtg actatttccg 4801 caaatctgat gaacccattt tgagtgcatt agtccgttcg tgcaatacct gtcattttgt 4861 tccagatact ggaggaaaag tttactacac atctaataaa aaattagcga aaagctaata 4921 tgaagtgaag agtgagaagt tcatgaaaaa ctcctcactc ttcactctgt aactttggta 4981 cttctgctga gaaaaaattg cgttcaataa caaagaaact gcgtttttag ttagaggcag 5041 gatgtgtatt ttcttttata acgcagtgct ttccggaggc gaagtaccca tgagtttcaa 5101 ccaacgaaag tttgcaacaa gtgcttctcc aaatgttgga tacacactgt atggaacacc 5161 aaattctaca caaacttctg ccaatatagg agctattttc ggatagtgaa tatgacaaac 5221 atggggaaat aaatgatgca ccacctgata gttaagacca cccaaatacc aatttaaaaa 5281 ggaattcttt ggtgcaaaat caactgttgt tttgacttga gcaatcgccc attcatcatc 5341 aatatggttg gactcaggat gtggttgaag aaactctacc ggagcgagaa catgagctag 5401 catgaaaaca ttgcaaatca gcagaccata ggttatgtaa gcaatgctaa atcctacaat 5461 tacctgtagg ggagtatatc caacaacaag tggtattcct acaaagactc ctagccaaac 5521 caatttaccc accaacaata tgagtatttc taaaggtttg acaggaatta catgctcatg 5581 gtatctgcgc ttgaataaag cgagattgac atcagcaaca gaccaataaa acggaataat 5641 tgcatagact ataaaaatga agaaatgttg gaatttatga taccatttat gctccatgta 5701 tggagacata cgcactaaac catctccgtg tatttcgaca tcatgtccta atatattggt 5761 atatgtatga tgcaggaaat tatggcgaaa tcgccacaaa gaactggata aaccaatgac 5821 atcataagtt aaaccaaaga tagaatttac ccatttgtga ctagaataac caccatgatt 5881 ggcatcatgt cctacactaa aaccaacgcc agcaatacct aatccgagaa ggacacagcc 5941 aaatagcttc atccacatta tcggtggacc gaagagagta aatgtccaag cagaaatgac 6001 ccatcccaag ataattgcag tcttgaggta cattgctaga ttatcgcgct tcgcaatgtt 6061 ctcagattca aaataagcgt caactcgttt attgagttcc tttctaaaac caacgttctt 6121 ggcaaaactt actctcgttt gtgtttgtgt cataaaaact aatcgttcaa atgagctatg 6181 aattgacttg aagaagtttg ttatctggca agtatccaag tagcgttgct cagatagaaa 6241 cacacaagcc aaatcatctc attgtggcag attatgccaa ctatctacca gaaaaaaaga 6301 aaatttttcc aaataagcaa caaaccttgt gcgtatgatt aaaatcgtcc gtttccctga 6361 acaatatgct taacagtggt caggctttct aagctaatca atccgcgtcg atgacctttt 6421 tgattggaca tccctagaaa taccgcatct cctcgcgatg gattccggga aaaacgcgga 6481 gaagcattga tgtagctcga agcactattc actcccaggg caaactggcg actctcctgg 6541 taagattcag taacaataca gtcagcatga ccgctgctgt attgattaat ccaggcgatc 6601 gcagcctcta gactatccac caatttaaaa gcgactgtct tcgttaaaaa agcgctcccc 6661 cactctgctt ctttagctaa ctgtagctga ggaaaggctt ctaccaattc ggcatctccc 6721 ttaatttcaa agcccttctc ctttaagcta ttccacaatg tcagtaaaga agatggcaag 6781 gcttgtcggt gaatgagtac tttttcaatt gcgttcaccg gatctggttc gctttgatgg 6841 ctatctataa tgagccaacg taccatttct aaactaccat tcggcgacca gtaaaggtag 6901 cagttgccca tagccgattt gagaactggg gcggtggact ggcgcgccac ttgctgtacc 6961 aagctagaac gtccgtaggg aatcactaaa ttcaagcatt ggtcttgcgt cactatatcc 7021 cgaatagaac ttccttcttc tgctgttatc agctgtaaac aacctggtgg taaaccaact 7081 tcagtcacag cactcaaaag tgcattggcg atcgccgcgt tagaatgact ggcttcagta 7141 ccgcctctga ggatcagact gttgccagtt ttgatacaca acgctgctgc gatcgctccc 7201 aattctggaa acgcctcata aataaacgca atcacaccca aaggcattaa ctgtgtataa 7261 gtctgagagt cgtctagttg atagtcagca tgtctgactc gccgtagtgg gtctgataat 7321 tcccccagcc gttgtaaaat atctactgcc atctccagcc gttttggagt cagcttcagc 7381 caatccagaa ttaaatctgg caccgccatt tcccgactcg cttctagatc caaggtgttt 7441 gcttctagaa tgtcgtcaaa tgaacgtgtc atcgcttgtg ccatcgctaa cacagcacga 7501 cttctgtcta ctccttttgt ggttcctagc tttagggcag cttgatgagc gcgtcttgcg 7561 cgctcaattg gttcggggct aacatcaaaa gcatccactg tcattcaatt atcgcctgta 7621 ggtgagccat accatcagtg ctggcaaaat agccaacagc accgccacca ttgcccaagc 7681 aataatgctg taaccatttg ccaatcgtag tgccaatggt aaccatataa tcactagcac 7741 gaaaataatc ccaagcgcta caggcaaata gctttctcct aatcgaggat gaacaacttg 7801 ccaactattc cctctccagc gccaagcccg cttgtaggga taagtggtgg aaagttgctc 7861 cagtacataa ccattatctt cgaccacgaa tatctgctga cagcgattgc aaccaaatgc 7921 ttctgtcagc gtaataggaa tcaactgccc ccgccgacga cagggacacg ggtattcact 7981 attgaagtca attttgtctg ctttttgagg ttgcacaagc gaacttttta cttgagcgtg 8041 tttgccaata gctgtgaaaa tagcatcccc tgatttacgt tttcaggcaa gctattttcc 8101 catgaacttt ctttataaca ttagccagtc tggcttactc ttttttctgg acagattttc 8161 ttgaaattca gtggtagtac aatcctatca ctgaactaca ctgataatag ccagatctca 8221 taactgaata agcacaaaga ctaaagtgaa ttttcttttc acttcaacct ttattcttca 8281 atatcgcctt aacgtcagta tcaattcaat atttttttac agtatgatac taaccactgt 8341 acagatcttt ttatagagta gcgcatctta catcattttt ttttcgctac acttaaattt 8401 tttataatca taattcatat atattggcat tttacagcag taatttgcca aagtcttatc 8461 tttatacatt gctttttatt ttcaaagcgg gtaacgcgat tcgaacgcgc gacattcacc 8521 ttggcaaggt gacgctctac cactgagcta tacccgcaaa ctccactctt actactattc 8581 catagtatgc attatttgtc aacccctttt atttgtttag tagttagtgg aaaaagttag 8641 ccactaacca ctaactacta atcactgact attctgcaga cagctgtcct acgctggcac 8701 ttttcagcga tgctgacgag agaatggcca gaaaaaggct ctgcgaggtt tgaacgtaat 8761 tgccgcatca gactagccat ttctaaagca ctcatggcgt aatcccaacc atgattgctt 8821 ttaatgccag cacgttctaa agcttgctgc atggagtcca ctgttaaaat gccaaaaacc 8881 acaggaacac cagtttgaaa cgcagcagca gcaattcctt tagcgacctc agaagaaacg 8941 taatcaaaat gaggcgtttg tcctcggatc acagcaccca gacaaataac cgcatcataa 9001 cgaccagaga ctgccaattg gcgagccaca accggaactt caaaactgcc tggtacgtaa 9061 atgtagtcta cctgagagcc gtgaggattg acatcaatac cgtggcgttt caagcaatct 9121 tgacaacctt ccaccagctt gttagttaca aggtcattga atcgaccaat aacaattcca 9181 aaccgcaaag gttctgcctg agtaaaagtt ccctcaaaaa ctgccataac cgcctctgat 9241 tcatctatac ttcttgccaa tatacattgg ttctccaaaa agtataggag cagagttatc 9301 gacaaatcca gcaaaaagaa gattgagttg accgaatcct gtgagggaga taggggaatc 9361 ccccggcgca tctcccacca cttacggcga gtacgcgcta tagtggggga cttacgccga 9421 atttgttaaa ctaccagaaa gtttaacgcc ccaacgacaa gcactaaaat gatccaagct 9481 cctgaaccaa tccacagcag ttttctagat tcattccagt tctgtggagt ggcataggca 9541 acaggaacgc caatgaccag aacaaaagac agcgaaacca gagctaataa agccagttgg 9601 aatataaaag tcattttccc ttctcccgaa acagcaaacg agttacaaaa gataaaatat 9661 tgtgcaggtg aacaccgaca gtttttcctt atttgttaag ctaacagaaa attgacacat 9721 tttagtcatt tgtccttgat cctttgtgct gacgattaat gactcatgac ccatgactga 9781 taactatgga tttaattctg tgccatacaa cagctgattt tgactcacta ggagccgccg 9841 taggactgac acgcctgtta cctggaagca agattgtgtt gagtggcggt tctcatccaa 9901 ccgttggaga ttttttggca ctacatcggg atgagtatga gctgatggaa cgacgttcgg 9961 ttaatcccaa cgaaattcgt tctttaattg tggtggatac ccaacagcgc gatcgcatcg 10021 gcaaagctgc ccaatggtta gatctacccc atattaataa aattatcatt tatgaccatc 10081 accaagaaca acaaagcgac attcccgcaa cagagataaa tatttcccct gttggagcga 10141 caacaactgt gattgtcgag caattgcaaa aacaacaaat ttccctaaca accgcagaag 10201 caactgtgat ggcattgggt atccatgtgg atactggttc tttgacattt gactatacaa 10261 caccacggga tgctctggct ttggcttggt tgatgcaaca aggggcaagt ttgtcagtcg 10321 tgagtcagta cgttgatcct ggtctttctc cccaattgca acagttgttg aaggtgtcgc 10381 tggaaaaaat acaacgtata tatgtacgag gatatcagat tgcctgggtc aatttgaaaa 10441 cagatgcttt cgtgccagga ttgtcaagtc ttgcctcgca gttgatggag ttaacagaaa 10501 tagatgcctt actgcttttt catgagtatg gtttagcaga agatgaatca cgcttgacag 10561 tgattgggcg atgctccaga ggagccgccc ctttggggcg atcgcacatt caaggtatta 10621 atctcaatga gttattccag ccactgggtg gcggtggaca ttcccaagca gcatcgctga 10681 acttaagagg ggtagactca gaagaaattt taaaccaact tcttgagggg ataaaagcaa 10741 caattcccca tcctcctatt gcccgagact tgatgtcctc gccagttcgc accattcgcc 10801 cagaaaccac aattgcggaa gcacagcgaa ctttgttgcg ctacggacat tctggtttgt 10861 gtgtagccga tgctcaagga caactactag gtattatttc tcgacgagat ttagatcttg 10921 ccctgcatca cggctttagt catgctcccg ttaagggtta tatgacagtc aatatgagga 10981 caattacccc agagacgaca ctgccagaaa tcgagtcagt catggtgaca tacgatattg 11041 gacgcttgcc agtattggag aacgggcagc tggtgggaat tgtcacccgt actgatgttt 11101 tgcgggaatt gcatcaaggg agagaggagg agcaggggag cagaggagca ggggagcaga 11161 ggagcagggg agaaagccat gttaagttgc ctagcttggc tgaattgcat tcacggctag 11221 caccccagtt gtggcaattg cttaccaaag cttcacgaga agcagaaaaa cgtggttggc 11281 atctttacct tgttggagga gcagtacgcg acttgctact tgctgaagaa gcttctggca 11341 ctttgttgat taaagatatt gacttagtcg ttgatggttt tcacaaagcg gcggatgtgg 11401 gtgcgggtgt ggaactggca aaagcactac agcaacttta cccaacagca cgtttagaaa 11461 ttcacggggc ttttcaaact gcggctttgc tgtggcataa agacccagaa ttagattcat 11521 tgtgggttga cattgcgact gccagaacag aattctatcc ttatccggca gcaaatccag 11581 aagttgaggc gagttcgatt cgtcaagatt tgtatcgtcg agattttacc atcaacgcaa 11641 tggcactgcg actgacgcct ccccgcgctg gtgcattact cgactttttt ggcgggttgc 11701 tagatttgca ggcaaagcaa attcgggtgt tgcacgctaa cagctttatc gaagatccta 11761 ctcgcattta tcgcggtgtg cgctttgcga tgcgactcgg atttcactta gagcctcaga 11821 cagaagagta tatccgctac gctatcacga gcggcgttta cgatcgcacc acccgagaaa 11881 atagcaaaac tccagcactg caaactcgac tgaaaacgga attaaaacat attctagaag 11941 ccccctactg gaaaccagct ttgcaattac tggcagagtt gggagctttg cagtgcatcc 12001 atcccaccct tgggctagat gaggaacttc tggggcaatt gcgtttgctg gaacgttgct 12061 tacggcggtt cgactcccag caaaccctta tccactggca aatgcgcttg gaagctctca 12121 ttgcccacat agcagcagaa tatcgagggc gcgtggcaag gaatctgcta ttgcccgaag 12181 atagtatcaa taggctagaa gtattagcac aagctcaagc taatgtcaac aaaattctcc 12241 cttctttgca agcacacagg gaagtggatg aggtgcgtcc gagtcaaatt caacagttat 12301 tgcgccagta tgacgtgcca atgctgattt taatagctgt gcaaagttca agaggtgtta 12361 ggcggaagat ttggcaatat ttaactgttt ggcgtaacgt tcagcctatt ttgaatggga 12421 atgatttgag gaaactggga tataaacctg gaccgcagta tcggcaaatg ctcgatgatt 12481 tgctggcggc gacgttggat ggagttgttt gcaatcggga tgaggctgag gagtttttaa 12541 ataaacatta tcctcaataa agttttgcaa cattttatga actttccata acgagtacta 12601 agatttcttt tcaatttgac ctctggcgct cgcaaagtca gagaaaactt tacaataact 12661 actgaaaaaa cttaacatca tcaactgagg tcaggtctat gatccgcgcc tacgcagctt 12721 ttcaacaagg cgaagaactg aagccttttg agtacgaacc aggaccactt ggcagtgagg 12781 atgtggagat tgatgtggaa tactgcggga tttgccatag cgacctcagt atgctcaaga 12841 atgattgggg aatgagtcag tacccgattg ttcccggaca tgaagtggtt ggtacaatcg 12901 caaatgttgg cgatgcggtg aaaaaactcc aagtcgggca acgggtaggg ctgggttgga 12961 attcgcgatc gtgtatg // LOCUS NODE_2616_length_12949_cov_5.42236712949 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12949) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12949) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12949 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(710..2320) /locus_tag="DP116_21090" CDS complement(710..2320) /locus_tag="DP116_21090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013257631.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="PRJNA477356:DP116_21090" /translation="MYSQAVDRSILTPLTFIQRNAKVYSQKVAIIYNQKRFTYGEFAN RINCLASALRHAGLEKGDRVAFFCFNTPPMLEAHFAVPLAGGILVSINTRLTSQEVVY ILNDCGAKFLFVDTELANIIRPVQNNLETVKHIINIDDLEGFEPLDGEDYEAFLQTGK PTPLSWIITDEMETISINYTSGTSGKPKGVMYSHRGAYINSLGEIIETTLTPNSVYLW TLPMFHCNGWCFTWGVTAIGGTHICLRKFNPGNIWKLIQQKEVTHLNAAPAILISLFN HPNCPQLLEKPLTITTAGAPPSPTLIEKITKIGAKIIHVYGLTEVYGPYTVCEYQSDW DNLMIAEKAKLMARQGVPYIGADGLRVVDQNMNDVPADGQTMGEVVMRGNMVMTGYYN DPEVTERVFRGGWFHSGDLAVMHPDGYIEVRDRMKDIIITSGENVSSIEVEQCLYRHQ AVLECAVIAVPNARRGEVPKAFVTLKERAQVTEQELIQFCRNQIAAFKCPTKIEFTTL PKTSTGKIQKYLLREKEWIGYEKRIYGG" gene complement(2436..2882) /locus_tag="DP116_21095" CDS complement(2436..2882) /locus_tag="DP116_21095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127023.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21095" /translation="MFLHRTTFIIAATMIALGSSIALAKPNLLLPQVVAQNPTPTKRQ QPGQSGSLKDLNLTPQQMQQIKTIRTQSKDQIAQKRQAIRQAQQELKAIMAGTASKDQ VRDKYNQIRTLRQELADVQFDNTLAIRDVLNPEQRQKFAEQMHNKR" gene complement(3126..4082) /locus_tag="DP116_21100" CDS complement(3126..4082) /locus_tag="DP116_21100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878907.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid desaturase" /protein_id="PRJNA477356:DP116_21100" /translation="MSSDAMTQPELLTSSADKPHNPRQILSVRELSALNTRSNYKGLV QLAFHLTVTGCSGYLWVTNFGNWSLAIPALIIYGFSIAAMFAPMHECVHRNVFTNNSV NDAVAWCAGLLSFYNSTFFRYYHKWHHLYTRVPDKDPELTDPKPSNLGKYLLIISGLP WWEGKIRGHFRAAIGQLDDCPFIPETARGEVIRSTRLQLAVYAAGFILSIAVRQPWFI VYWLLPLLVGQPILRFILLAEHTGCTLDANLLTNTRTTLTLLPVRFFMWNMPFHAEHH LYPSIPFHALPKAHLQLSSHFTHIEPGYIKVNRDIITKYTTP" gene complement(4304..5077) /locus_tag="DP116_21105" CDS complement(4304..5077) /locus_tag="DP116_21105" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="ferredoxin reductase" /protein_id="PRJNA477356:DP116_21105" /translation="MLDQIAPLIAGTSDALSTVANLQTLFSENVGLGDESRIHNLLDE YLDLETLQKGLPLYVSVYQSQGSLNDFFSAIIASVNLINTQHSEFLHIQSLERTQQRV AILASAAIPLLFKAEQISVNETNYFYSDGGQGGVKNVQGNTPITPLLELANCTHIIVN HLSDGSLWDRNTFSNAFPNTTVLEIRPGRVIKPQGLKDLIGFDPNNIQSWIEQGYEDT QRCYESVARALHIHEIANQARKQRDEAIDELDNDNFHID" gene complement(5276..5479) /locus_tag="DP116_21110" CDS complement(5276..5479) /locus_tag="DP116_21110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_000102620.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21110" /translation="MTTKPYRVGLVLSGGGARGAYHVGVIKYLADMNISIDAVAGASI GALNGAVVAAAPNLQEASTRLSK" gene 5811..6185 /locus_tag="DP116_21115" CDS 5811..6185 /locus_tag="DP116_21115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872962.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF433 domain-containing protein" /protein_id="PRJNA477356:DP116_21115" /translation="MTPASNGQAVIIRTERGLTIAGTRITLYDLMDFITAGYPHSFIR YQFSVTDEQFNAAMSYIEANRAEVETEYQTVLKKVEENQKYWEERNRERFARIAKMPP PPGMEVAWEKLQKAKARLDSKA" gene 6182..6544 /locus_tag="DP116_21120" CDS 6182..6544 /locus_tag="DP116_21120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655232.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ACP S-malonyltransferase" /protein_id="PRJNA477356:DP116_21120" /translation="MIFLIDHNLERQSVILLGNIASQGWLDIIPIRFVTFEQVALPVD SDDRLVWRFAQQNKMILLTANRNMKDEDSLEQVMREENSPTSLPVLTIGKVDRLYDPD YRERCANRNTVQLRIDPP" gene complement(6694..7476) /gene="xth" /locus_tag="DP116_21125" CDS complement(6694..7476) /gene="xth" /locus_tag="DP116_21125" /EC_number="3.1.11.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876175.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="exodeoxyribonuclease III" /protein_id="PRJNA477356:DP116_21125" /translation="MKIATWNVNSIRTRLGHVIDWLRVSDVDILCMQEIKVVDAQFPC APFEELGYNLYLSGQKAYNGVAIASRQPLEDVSCGFTPTLLNIQPDWDEQKRVITGKI DGIRIVNLYVPNGSAVGSEKYEYKLGWLTVLQEYLRSLLLSQAAICVCGDFNIALEDK DIHQSAKSENHIMASEPERQALRDILLLGFADAFRKFTSEGGHYTWWDYRAASFRRNL GWRIDHHYLTPALYERAKSCTIDVAPRKLPQPSDHAPVIVEF" gene 7847..8914 /locus_tag="DP116_21130" CDS 7847..8914 /locus_tag="DP116_21130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 4 protein" /protein_id="PRJNA477356:DP116_21130" /translation="MKIAQVAPLWERVPPPTYGGIELVVSRLTDELVRRGHDVTLFAS GDSQTLAKLEAVYPRALRLEPEVKEYAAYEMLELSQVYQRAQEFDIIHSHVGITALPL ASFVQTSTVHTVHSSFTPDNTNIYTHHKQQAYVSISNAQREINLNYVNTVYNGINLED YPFVAQHQEPPYLAFLGRFAPEKGPHHAIAIAKKAGWHLKMAGKVDTVDSKFFEQEIA PHIDGEQIQYLSEINHAAKVELLGNAAITLFPIGWQEPFGLVMAESMATGTPVIAMSL GSVPEVIAHGVSGFVCQSYDEMATMIPAALELNRQTCREHVENKFSVSQMVDGYEAVY RKIIQGRIHSNGRIHDAKIKV" gene 9054..9239 /locus_tag="DP116_21135" CDS 9054..9239 /locus_tag="DP116_21135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198678.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21135" /translation="MSMRTNDCPCCGGSLLRHIRHGELYWFCKTCWQEVPVLTVSQMP NLEGRTKASVAQTAVKA" gene complement(9576..11219) /locus_tag="DP116_21140" CDS complement(9576..11219) /locus_tag="DP116_21140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749482.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_21140" /translation="MQPSDLDPKILPLSQSYARKHNKASMPKIANNPKIPRSLPSHVN FHEQAKDVSGNHKNCTSEIPKTDVTKHKSENQLNCSETNGKNHNLEHSTQESTAIVAK ITDVTEASGSSPEAASDDTQEQGFLPILKNANFLALWGGQVFCQLADKVYLVLMIAII NTQFQTSEQSISGWVSTLMMAFTIPAVLFGSLAGVFVDRWSKKVVLVATNIWRGILVL AIPPMLWLTHDWEPVAGMLPVGFAILLGVTFLVSTLTQFFAPAEQAAIPLVVKEQHLL SANSLYTTTMMASVIVGFAVGEPLLGIADQLWSQLGGSHGLGKELVVGCSYAIAGLVL LLLVTHEKPHQSEKESPHVFADLQDGFRYLKENYRLRNALVQLIILFSIFAALTVLAV RMAELIPNLKASQFGFLLAAGGVGIALGATVLGLFGQRFSYTQLSLCGCMGMAASLIG LSIFTKQLWLALLLITLLGVFGALVGIPMQTAIQTETPPEMRGKVFGLQNNVINIALS LPLALTGVAETFVGLRTVFLVLAAIVFSGGILTCYTSSK" gene complement(11206..12141) /locus_tag="DP116_21145" CDS complement(11206..12141) /locus_tag="DP116_21145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317588.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA repair protein RecO" /protein_id="PRJNA477356:DP116_21145" /translation="MSKTYKATGINLKAQALGESDRLVTILTTEFGLIRAVAPGSRKH NSSLGGRSGMFVVNELLIAKGRSLDKITQAQTIKSYPGLAKDLGKLGASQYLAEIVLS VALSEQPQEELYELLNEHLNRLEALPNKETLNVLAHLAHGIFHLLALAGLTPQVQICC ITGRSIIPDFTDPTWQVGFSIPTGGTICLSAWERLKEEKRKQQDFSTSLPHYASSSTS SAIPATVKEKPANYQTVIHRQEIPVISSRLNATELAILQHLSQPEIIQDVTKNDSWLS VEQILRSYTQYYIGRPIRSAALIDSYFAANYDATV" gene complement(12208..12888) /gene="deoC" /locus_tag="DP116_21150" CDS complement(12208..12888) /gene="deoC" /locus_tag="DP116_21150" /EC_number="4.1.2.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653609.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="deoxyribose-phosphate aldolase" /protein_id="PRJNA477356:DP116_21150" /translation="MAADYTDIDIAPFIDHALLIPTATSKQVEQWCEEADRFHFAAVC VYPTYVKQAAELLHGKKPKVCTVIGFPTGATTSATKLYEAQEAVENGATELDVVINLG WLKAGNTDNVHQEIAEICEETGQTVKVILETNLLTDAEKRLAAEICMDAGAAFLKTST GWNGAVTVEDVRLLKEVTQDRVGIKASGGIRTIDQALDLIMAGATRLGTSRSIDLLHQ RHKLDKSE" BASE COUNT 3734 a 2738 c 2702 g 3775 t ORIGIN 1 gtttgggagt gaggggttgg gtgagcgtcc caggacgtct tctctcgcgt cgagacttgt 61 aaagtgtaac atctagagct tacagacttt tattaagatt tatatacgag ttgttttcca 121 cgaagtcttg cgggttctgg cttcccgata cctggcactg gggtactcct gctcgtacac 181 tatttttatg atgagttttg tagtacgaca cagacaaact gcgggttata gcagttggcg 241 aagagaaatt tttctgcaat tttctctatt tcttcgttat tggtttccca gtcattgcaa 301 ggaagtttca tttgttcagg agagaacgat aagcctccgg cttgacgcaa agcgtatcgc 361 acttcttgga aaagttagcg atacgcgcct tagtgcgtct gttgctcctt tgcgatggta 421 caaagtgccc gctggggtgc gataagcgaa gcttaacgcc tccggcgtat cgctctacac 481 aaaatactta agatcaatac ggttcagtta gagccaaaaa ccttacactg ggtaggttgg 541 ggaggacact tctgtgcggg ggttcccccc gttgaagaaa gtgtccgttt aagcgtagcg 601 caacccaaca agaaacgttt tgtgttgggt ttccttacgt caacccaacc taccttatgc 661 ttaactgaac cgttttgtac ttaagatagt taagttgatg aactcgcaac taaccgccgt 721 aaatcctctt ttcataacct atccattctt tctcccgtaa cagatacttt tgaattttac 781 cagtgctagt ttttggcaga gtcgtaaatt caattttagt agggcattta aaagcagcaa 841 tttgattacg gcaaaattgg attaattctt gttctgtaac ttgcgccctt tctttgagag 901 taacaaaggc ttttggcact tcaccacgcc tagcattcgg tacagcaata actgcacact 961 ctaacacggc ttgatggcgg taaagacatt gttcaacttc tatactggag acattttccc 1021 cgctagtaat gatgatatct ttcatacggt cgcgcacctc aatataaccg tcagggtgca 1081 ttacagctaa atcaccacta tgaaaccatc ctcctctaaa tactctttca gtcacttctg 1141 ggtcattata atagcctgtc atcaccatat ttcctcgcat caccacctct cccattgttt 1201 gtccatccgc tgggacatca ttcatatttt gatctacaac tcgcaaacca tctgcaccga 1261 tataaggtac gccttgacgt gccatgagtt ttgctttttc tgcaatcatt aaattgtccc 1321 aatctgattg atattcgcaa actgtataag gaccataaac ttcagtcaaa ccataaacgt 1381 gaataatttt tgcgccaatt ttagtaattt tttcaatcaa tgttggcgat ggtggtgcgc 1441 ctgctgtagt gatagttaga ggtttttcta ataattgcgg acaatttgga tgattgaaca 1501 gagaaattaa aattgcaggt gcagcattta aatgagtgac ttctttttgt tgaattaact 1561 tccagatatt accggggtta aacttacgta aacaaatatg agtaccacca atagcagtta 1621 ccccccaagt aaaacaccag ccattgcagt gaaacatggg caaagtccag agataaacag 1681 agttaggtgt tagtgttgtt tcaatgattt ctcctaaaga gttaatatat gcacctcgat 1741 gagagtacat cacacctttg ggtttaccag aagttccact agtgtaatta atcgaaattg 1801 tctccatttc atctgtaatt atccaagata aaggagtcgg tttaccagtt tggagaaatg 1861 cttcataatc ttcaccatct aaaggttcaa aaccttctaa atcatcaata ttgattatat 1921 gtttaacagt ttctagatta ttttgaacag gtcgaattat atttgctaac tctgtatcaa 1981 caaagagaaa tttcgcaccg caatcattca agatataaac aacttcttgg gaggttaaac 2041 gagtattaat agatactaaa ataccacctg ctaaaggcac agcaaaatgg gcttctagca 2101 tgggtggagt attgaaacaa aagaaagcga cgcgatcgcc tttttcaagt ccagcatgac 2161 gcagtgctga ggctagacaa tttatccgat ttgcaaattc gccgtaagtg aagcgttttt 2221 gattatatat aatcgcaact ttctgactat agactttagc attgcgttga ataaaagtta 2281 gaggtgtgag aatactgcga tcaactgctt gagagtacat agctaatgct aacagtgaat 2341 attcactgca tgagcatacc atacagtcat gcatttcttg tatttacata caatccagta 2401 cttgggtttc atactcgtgc tgctttggaa gtttactacc gtttgttgtg catttgctca 2461 gcaaacttct gccgttgttc tgggttcaaa acgtctcgaa ttgcaagtgt attgtcaaac 2521 tgaacatcag caagttcctg cctgagtgtt ctgatttggt tatatttatc gcgcacttgg 2581 tctttggatg ctgtccctgc catgatagct ttcaattcct gctgcgcttg acgaattgcc 2641 tgtctttttt gggcaatttg gtctttggat tgagtacgaa tcgttttaat ttgttgcatc 2701 tgctgaggtg tcaggtttaa atccttcaac gatccagatt gaccaggttg ctgacgcttt 2761 gtaggtgtgg ggttttgagc aacgacttga ggtaacagta aatttggctt ggcaagagcg 2821 atactactac caagagcaat cattgtagct gcaattatga aagtagtacg atggagaaac 2881 ataaaacttg tcttctgtaa ttagtaggtg aatagaacgt actcacgacg cacacatgcg 2941 atctgccgct atgctgcagc gaagctatcg cctgtaagga tgattggctt tgctcaattg 3001 cgacttcaaa gattcagtct tgctttgtgc ctgaaagggt tgggaattta attttcgtag 3061 gttggggagg acagtgcccg aagggcagtg gctccccaac ctacagttaa aattttaacc 3121 cttatttatg gtgtggtgta cttagtaata atatccctgt tgacttttat atagcccggc 3181 tcaatgtggg taaagtgtga acttaactgt agatgggctt taggcagtgc gtggaaggga 3241 attgatgggt ataaatggtg ctctgcatga aatggcatat tccacataaa aaatcgcaca 3301 ggcaagagcg tcagtgttgt gcgtgtattt gtgagcaaat tagcgtcaag agtgcaacct 3361 gtatgttctg ccagtagaat aaaacgcaga atcggctgac ccactaagag cggtagtagc 3421 caataaacga taaaccaagg ctgtctgact gcaattgaga ggataaaccc agctgcataa 3481 acagctaatt gcaatcgagt ggagcgaatc acttcacctc gtgctgtttc tggtataaat 3541 ggacaatcgt caagttgacc gatcgcagcg cgaaaatgcc cacgtatctt tccttcccac 3601 cagggtaaac cactaattat taacaaatat ttacccagat tgctcggctt aggatcagtg 3661 agttccggat ctttgtcagg aacgcgagtg tagaggtggt gccacttgtg gtagtaacgg 3721 aagaaggtac tgttgtaaaa tgagagcaag cccgcacacc aagcaacagc atcattcaca 3781 gaattgttag tgaagacgtt tctgtgaacg cattcgtgca taggtgcaaa catggcagcg 3841 atactaaagc cgtaaattat cagtgcaggt attgctagcg accaattgcc aaagtttgtt 3901 acccacaggt agccactgca tccggtgaca gttagatgaa aagccagttg aaccagccct 3961 ttgtagttag agcgcgtatt caaagcactt aattctcgta cactcagaat ctggcgggga 4021 ttgtggggct tatcggctga ggaagtcaat aattctggtt gagtcattgc atcactagac 4081 ataatgatta taattaatcg taattaaacg tacgatacaa taaattttta tcaagcgttg 4141 gtaaattgaa ccctctccgt gtctttagac cgcagagggg taagacacat caagtgaaat 4201 ttattgccca aatttttgga gtcgctgtat aatcaaagct tgccaaatag cacgtttctc 4261 tcaagaaatt tttgcaacac aacagtcgca atgcctgtta ggtttagtca atatgaaaat 4321 tatcattgtc taattcatct attgcttcat ctcgttgctt tctagcctga tttgctattt 4381 catgaatatg gagagcacga gcaacgcttt cataacaacg ttgagtatct tcataacctt 4441 gttcaatcca agattgaata ttattgggat caaatcctat taaatctttt agaccttgag 4501 gttttataac tcttcctgga cgaatttcta aaacagtggt gtttggaaaa gcgtttgaaa 4561 aagtattacg atcccataaa ctgccatcac ttagatgatt gacaataata tgggtacaat 4621 ttgctaattc aagcaatgga gtgatgggag tattgccttg gacatttttt acacctcctt 4681 gaccaccatc cgaatagaaa tagttcgttt cgttaaccga aatctgttca gctttaaata 4741 acagaggaat agcagctgaa gcaagaatgg ctacgcgttg ttgagttcgc tccagggatt 4801 gaatatgtaa aaattcagaa tgttgagtat taattaagtt tacggaagct ataattgcag 4861 aaaagaagtc atttaagcta ccttgacttt gatatacgga aacatataaa ggtaaacctt 4921 tttgaagtgt ttctaagtct aaatactcgt ctaacaaatt atggatacga ctttcgtccc 4981 ccaaacctac attttctgaa aatagagtct gaagattagc tacagttgat aatgcatcac 5041 ttgttccagc tattagtgga gcaatttgat caagcaatat ctgacaaatg tatttaatta 5101 tttttattct gttaatcttg ataggactat tagcagcaag tgagttccaa atttcttcta 5161 agtaggtggg cgctaaaaaa cacaactaaa ttaagaaatg taaatcgctt aaaagcttat 5221 ttttaaaagg tttttggcgc tttacataag ttaacatagt tcggtttaat cgtgcctact 5281 tacttaaacg tgttgatgct tcctggagat tgggtgctgc tgctacaact gcaccattta 5341 aagcaccgat actagcacca gcgacagcat caattgaaat gttcatgtca gctaggtatt 5401 taatcacgcc tacatgatat gcacctctag caccaccgcc agataaaacc aagcctactc 5461 gataaggttt agtagtcatt gttgtgctgt attcgttttt tgttttcata tatcacatga 5521 atgtccttct ggctgaagcc aaaatcccat acgacggcgt taagcgtagc tctgccgtag 5581 gcaatcgcct aatcttaata cgcatcattc attcttgacc cataagttca gtacaagaat 5641 taacttcata cctctgcttt ttccaaattt tcagcgtaat ttctaattga caaaactaag 5701 attatcttaa actctcaccg atgagtacat attcatttgc gtttatcggt agcatcgaac 5761 ttgctttttt cataaggtaa tatgaaagca atatttgaga gggagtgaat atgactccag 5821 catctaatgg acaagcagtt atcattcgta cagagcgtgg actaacgatc gcaggcactc 5881 ggatcacgct ctatgacctc atggacttca tcacagctgg ctatccacat tcgttcattc 5941 gttatcagtt ctctgttacc gacgaacaat ttaatgcagc tatgtcctat attgaagcaa 6001 atcgtgctga agtagaaaca gagtatcaaa ctgttttgaa aaaggtagag gaaaatcaga 6061 agtattggga agaacgcaac cgtgagcgtt ttgcccgaat tgcaaaaatg ccaccgccgc 6121 ccggaatgga agttgcttgg gaaaaacttc aaaaagcgaa agccagactt gattctaaag 6181 catgattttt ctgattgacc ataaccttga gaggcagtct gttattctct tgggcaacat 6241 cgctagtcaa ggttggcttg atataatccc tattcgcttt gttacttttg agcaggtagc 6301 gttaccagtt gacagtgatg atcgccttgt ctggcggttt gcccaacaga acaagatgat 6361 tttactgact gctaatcgaa atatgaagga cgaagactcc ttagaacaag tgatgcgtga 6421 agaaaattca ccgacttcac tacctgtact tacgattggg aaggttgatc gcctttatga 6481 tcctgactat agagaacgct gcgccaaccg caatacagtt cagttaagga ttgatcctcc 6541 ctagccctcc ttaaaaagga gggaactaag cccccttttt aagggggttg ggggatctta 6601 acaaaagtaa acaccactaa ctgaaccgta ttgccttata cccttatacc cttataccct 6661 tataccctta cacccctagt tgttgactaa attctaaaat tcgacaatca ctggtgcatg 6721 atcgctgggt tgaggtaact ttctgggagc gacatcaata gtgcagcttt tggcacgctc 6781 atacaaagca ggcgtaagat agtgatggtc aattcgccaa cccaaattac ggcgaaaaga 6841 tgctgctcga tagtcccacc aggtgtagtg tcccccttcc gaggtaaatt tgcgaaaagc 6901 atcagcaaat cctaataaca ggatatctcg taacgcttgg cgctctggtt ctgatgccat 6961 gatgtgattt tcgctctttg cactctgatg aatatcttta tcttccagag caatgttaaa 7021 atctccacac acacaaatag cagcttgtga taaaagaagc gatcgcaaat actcctgcag 7081 cactgttagc cagcccagct tgtattcata cttctcgctt cccactgctg aaccattggg 7141 tacgtaaaga ttaacaattc ttataccatc tattttaccg gtaattaccc gtttttgctc 7201 atcccaatct ggttggatgt ttaataaagt cggcgtaaaa ccacaactga catcctctag 7261 tggttgacgg ctggctatag caacaccgtt gtaagctttt tgccctgata gataaaggtt 7321 atatcccaat tcctcgaaag gtgcacaggg aaattgtgcg tctacaactt taatttcttg 7381 catgcacagg atatcaacgt cactcaccct cagccaatca atgacgtgcc ctaaacgagt 7441 gcgaatcgag ttgacattcc aagtagcaat tttcatctta gtcagtcgtc attctccacg 7501 tgtttgcgtc ctctgatcac ccatctcctc ctcctgcaac atttttagaa ttgatcccgt 7561 ttcggggttt tagtactttt cgttacaaaa attttaagcc ttaatgagcc ccaagataga 7621 tgaatcgtta tctcagcaaa cagtaatctt atagtatact cccgctatag gaagaatggc 7681 tagtatgaaa ttagccaact actatatcag ggtagggtta accgatattg cactcgcgtg 7741 agtcagtcag gttaatctaa gaaacagatg attaagtaca aatgctctat catcagatat 7801 aggtcctagt aataggactg tcttacttgc tgaaccaaaa ctacctatga aaatcgctca 7861 agtagccccc ttgtgggaac gagttccacc tccaacatat ggaggaatag agctagtggt 7921 gagtcgcttg acagatgaac tagtgcgtcg tggtcacgat gtcactctgt tcgcatctgg 7981 cgattcccaa acattggcga agttagaagc agtttatcct cgtgcattgc gcttagaacc 8041 ggaggtcaaa gaatacgcag cgtacgaaat gctggaactg agccaggttt accagagagc 8101 acaggaattc gatatcattc attcccacgt ggggataacg gcattacctt tggcgagttt 8161 tgtacaaaca tccactgtgc atactgtgca cagcagtttt acacctgata acactaacat 8221 atacacccac cataagcagc aagcatacgt cagcattagt aacgcgcaac gcgaaataaa 8281 cctgaactac gttaacacag tctacaacgg aattaatcta gaggattatc cttttgttgc 8341 ccaacatcaa gaaccgccat acttggcatt cttaggacgc ttcgcgccag aaaagggacc 8401 gcaccatgcg atcgccattg cgaaaaaagc tggctggcat ctaaagatgg caggaaaggt 8461 tgatacagta gactctaagt tctttgaaca agagattgcc ccacatattg atggagagca 8521 aattcaatac ctaagtgaaa ttaaccacgc cgcgaaagtt gaactgctgg gtaatgctgc 8581 gatcactctc ttcccaatag gctggcaaga accctttggt ttggtgatgg ctgaatcaat 8641 ggcaactggg acaccagtta ttgccatgag tttaggttcc gtaccagagg ttattgccca 8701 cggtgtctca ggtttcgtct gtcaaagtta cgatgaaatg gcgacaatga ttccagctgc 8761 tttggaactg aatcgtcaaa cttgtcgaga acatgtagag aacaagttta gcgtgagcca 8821 aatggttgat gggtacgagg cggtgtatag aaaaatcata caaggccgca ttcattcaaa 8881 cggtcgcatt catgatgcaa aaatcaaagt ttaattaacg attttttaag ctgagcggct 8941 tgccgtgact tgccccgagc gaagccgaag tggcgtagtc gaagggagtc gaagcttgaa 9001 cgcatctact acatctataa aatcaattgc tttttaataa ggaggacaac gatatgagta 9061 tgagaaccaa cgattgcccg tgttgtggcg gttccctcct acgtcatatt cgtcatggtg 9121 aactctactg gttttgcaaa acctgctggc aagaagtgcc agtattaacc gtgagtcaga 9181 tgccgaactt ggaaggaaga actaaggcat ctgttgctca aacagcagtc aaagcttagt 9241 gtcagaagaa cagtgaacag tgaacagtaa acagtgagtc cagcgctgca ggcgggtttc 9301 ccgccgtagg cgactggcga acccgaaggg tgaacaaggg gtggacgagt ccgtttccta 9361 cttggtaact gataactgat aactgataac tggcaactgc tataacttgt aacaacattt 9421 tctagacgct acgtataacg cctagctaca actcgggttg acgcccagca agtctcgtga 9481 gaattcagta ctcaggtaag tctaccataa caacagaact tttcagtgaa taatcggttg 9541 taaaaagctg aatattaact ctttttatct gatagctact tagaagaggt ataacaggtc 9601 aatataccac ctgaaaagac aatcgcagct aataccaaaa acactgtccg taatccaaca 9661 aaggtttctg ctacgcctgt tagtgctaag ggtaaggaaa gagcaatgtt aatgacgttg 9721 ttttgtagac caaagacttt cccacgcatt tctggcgggg tttctgtttg gatagctgtc 9781 tgcataggaa tacccaccaa cgccccaaag acacctaaca gtgttattaa aagcagggcg 9841 agccataatt gtttagtaaa tatcgacaaa ccgataagag atgctgccat acccatacat 9901 ccacacagac ttaattgagt ataggagaag cgctgaccaa atagacctaa aacagttgcg 9961 ccaagagcaa taccaacacc tccggctgct aataaaaagc caaattggga agcttttaag 10021 ttaggaatta attctgccat gcgaacggct aaaacggtca atgctgcaaa gattgagaac 10081 aaaattatca gttggacaag cgcattgcgc aggcgataat tttctttgag ataacggaaa 10141 ccatcttgca agtcagcaaa aacgtgagga gattcttttt cgctttggtg aggtttttca 10201 tgagtgacaa gcagcaataa aacgagtccg gcgatcgcat aactacaacc cactacaagt 10261 tctttgccaa gtccatgact accacctaat tgcgaccaca gttgatctgc tatccctaac 10321 aatggttccc caacagcaaa cccaactatc actgacgcca tcattgttgt tgtgtatagt 10381 gagttggctg aaagtaaatg ctgttctttc accaccaaag gaatagccgc ctgttctgct 10441 ggtgcaaaaa actgcgtcag cgtagaaacc aagaaagtca ctcccaaaag aatagcaaaa 10501 cctactggta acattcccgc gactggttcc caatcatgag tgagccacag cattggtgga 10561 attgccaaaa ccagtatacc gcgccaaata tttgttgcta ctagcaccac ctttttcgac 10621 cagcgatcta caaatacacc agccagggaa ccaaaaagga ctgcgggaat ggtaaaagcc 10681 atcatcagtg ttgaaaccca accactaata ctctgctcgc ttgtttgaaa ctgagtatta 10741 ataatggcaa tcatcaaaac cagatacact ttatctgcca gttgacaaaa gacttgacca 10801 ccccaaagag ccaaaaagtt agcgtttttt aagatcggca aaaatccttg ctcttgtgta 10861 tcatctgatg ctgcttctgg actcgaacca gaagcctctg ttacatctgt tatttttgct 10921 acaattgctg ttgactcttg cgtgctatgt tctaagttat gatttttgcc atttgtctca 10981 ctacaattca gttgattttc tgatttgtgt ttggtgacat cagtttttgg gatttctgat 11041 gtacaatttt tatgatttcc agaaacgtct tttgcctgct cgtggaagtt aacatgactg 11101 ggtaaagacc ttggaatctt tgggttattg gcgattttag gcattgatgc cttattgtgt 11161 ttcctagcgt aactttgtga taacggcagg atttttggat ccaaatcaga cggttgcatc 11221 atagttggca gcaaaataag aatctattaa agcagcagag cggatagggc gaccaatgta 11281 atactgagtg taggagcgca aaatttgctc aacggataac cagctatcat ttttagttac 11341 atcttggatg atctctggtt gcgacagatg ctggagtata gcaagctctg ttgcattcaa 11401 gcggcttgat ataactggta tctcctgccg atgaataact gtttggtagt tagcaggctt 11461 ctctttcaca gtagcaggaa tggcggaaga agttgaggaa ctggcgtaat gaggtaggga 11521 ggtagaaaaa tcttgttgtt ttcttttctc ctccttcaga cgttcccaag cactcagaca 11581 aattgttcca cctgttggta tactaaatcc tacttgccaa gttggatctg taaaatctgg 11641 tatgatagag cgtccagtga tgcaacagat ttgtacttgg ggagttaatc cagccaaagc 11701 taaaaggtga aatatcccat gagccagatg tgctaaaaca tttaatgttt ctttgttagg 11761 taaagcttct aaacgattga gatgttcgtt gagtaactcg tagagttctt cttggggttg 11821 ctcgcttaaa gcaacagaga gtactatttc tgctaaatac tggctagcac ccaactttcc 11881 taagtctttt gcaagacctg gataagattt tatcgtttgt gcttgagtga ttttatcgag 11941 cgatcgccct ttagcaatca gtagctcgtt taccacaaac attccactcc taccacctag 12001 actggagttg tgtttccgtg aacctggggc gactgctcga atcagaccaa actcagtagt 12061 caaaattgtc actagtctat ctgattctcc cagagcctga gctttaagat taattccagt 12121 tgctttatac gttttactca ttagtcatta gtcattagtt attagtcatg agtcatgagc 12181 ttaagacaaa agacaaagaa caaatgacta ctcactcttg tccagcttat ggcgctgatg 12241 aagcaaatcg atactgcgag aagtgcctaa tctggttgca cccgccataa ttaagtctaa 12301 agcttggtcg atggtgcgaa tacctcctga ggctttaatt ccaactctgt cttgtgtgac 12361 ttctttcaaa agtcgtacat cttctactgt cacagcgcca ttccaaccag tactcgtttt 12421 gagaaatgct gctccagcat ccatacagat ttctgctgct aatctttttt ctgcatccgt 12481 cagcaggtta gtttctaaaa tcacctttac cgtttgccca gtttcttcgc atatttcggc 12541 aatttcctgg tgtacgttgt cagtgttgcc tgcttttaac cagcccaaat ttatcaccac 12601 atctaactca gttgcaccat tttccactgc ttcttgagct tcatacagct tcgttgcaga 12661 agttgttgcc ccggtaggaa agccaatcac tgtacaaact ttgggctttt taccgtgcag 12721 aagttctgcg gcttgcttca cataagtcgg gtagacacaa accgccgcaa aatgaaatct 12781 gtctgcttct tcacaccatt gttcaacctg ctttgaggta gcggttggta tcagcagggc 12841 gtgatctata aatggcgcaa tgtcaatgtc tgtataatct gctgccatcg cctcttttgc 12901 taatgattaa ttattaacct ttctcaaaaa atttcaagct atttagata // LOCUS NODE_2630_length_12872_cov_4.81774212872 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12872) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12872) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12872 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(233..1873) /locus_tag="DP116_21155" CDS complement(233..1873) /locus_tag="DP116_21155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318100.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metallophosphoesterase" /protein_id="PRJNA477356:DP116_21155" /translation="MTSPPQLLTDPFVQLPTETSVQIVWFTEFAGYGHIVTYGENLAQ TALATTTQLKRIREDQNSRLGNQTQNGQVYQHPVQRDIWRHEAQITDLTPDTRVSYLV TSVREDGESVSSKVFTLAPKPSPGKPLKILLTSDHQIKPMVAANLQKVVETVGRVDAV WFAGDLVNIPDRASEWFDDHRGGAFFPCLQGRANYEMNSDGVKTIYSGGEIIQYAPMF TCIGNHEIMGRRDHGSLDDEFDATIPRTVALKFYGQESLKENSFNTDTYEEIFTLPQS QEGGKTYYAVSFGDVRLVVLYATNMWRTFNLDAEARGRYREREKDFNTPENWGYGQHI YEPIAKGSTQYNWLAQELNSPEFQQAKYKVVMLHHPPHTLGGNIVPAYTNPVQIIERD ADGNIKAIRYEYPKDADFLIRDVVPLVEAAGVQLVFYGHSHLWNRFCSQSGMHFLETS NVGNSYGAAWGDNKREVPVGYKEDYAELGDPYGLEPVVPTIVPLLGEDGKPMPYIASN DITVFSIFDTGTGTVSSYRFDIRKPDSEVIKFDEFRLK" gene 3059..3754 /locus_tag="DP116_21160" CDS 3059..3754 /locus_tag="DP116_21160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="PRJNA477356:DP116_21160" /translation="MNRQQFLSKLQRFQELLVMSSVGALPTILLGPKLRNILYSSIFA RIGKAVFIQEGVEFINTSCIEIGNGVFIFKGARIDGRGHQNNRIYLNDKVAIERNVSI GCLEDSYIDIGQETFIGPGVCIAGPGDIKIGKRCLIAANSGIYANNHNFADPNEPIKY QGITRKGIVIEDDCWLGHGVVVLDGVTIGQGSVIGAGAVVTKNIPPFSVAVGVPAKVM KSRTNKQLVNTRD" gene 4196..5257 /gene="mtnA" /locus_tag="DP116_21165" CDS 4196..5257 /gene="mtnA" /locus_tag="DP116_21165" /EC_number="5.3.1.23" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317574.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-methyl-5-thioribose-1-phosphate isomerase" /protein_id="PRJNA477356:DP116_21165" /translation="MNSSTNQVYPVIWHNNSVSLIDQTRLPNEYAFVEIHRCSDMAQA IKTMIVRGAPAIGVAAAYGMYLGAREIETSDRNQFLTELEKVAQLLRTTRPTAVNLFW AISRMLKIAYETLGTVEDIKQTLLNTARSINAEDLQTCYAIGDNGLRVLPQTPEKLTI LTHCNAGALATAGYGTALGVVRSAFREGRLARVFADETRPRLQGAKLTTWECVQEGIP VTLITDSMAAHCMKQGLIDLVVVGADRIAANGDAANKIGTYSLALVAKAHNIPFYVAA PLSTVDFSLSDGNEIPIEERNPEEIYQIGETIITPTGAEFYNPAFDVTPAQLITAIIT ENGVFAPGELAKSRIKQFA" gene complement(5482..5823) /locus_tag="DP116_21170" /pseudo CDS complement(5482..5823) /locus_tag="DP116_21170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317573.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="sugar ABC transporter ATP-binding protein" gene complement(5848..6561) /locus_tag="DP116_21175" CDS complement(5848..6561) /locus_tag="DP116_21175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015213211.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21175" /translation="MTDPLIVSGTHSNIDSLRQPLVSGSINTQQQVIPQLANLGDTGL DVLMEFLFGRKKNPATWVDGKVYQVLYNSDSSKAKEFLQAYFPNGIVPLTSECGIDYS SVQQLLAACDFQAADRMTLQKMCELAGPGAVQRKWLYFTEIENFPATDLKTINTLWLI HSEGKFGFTVQREIWLSLGKNWENLWEKIGWKKGNNWTRYPNEFTWDLSAPRGHLPLS NQLRGVRVIASLLSHPAWN" gene 6549..6731 /locus_tag="DP116_21180" CDS 6549..6731 /locus_tag="DP116_21180" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21180" /translation="MGLSCYYSWNAVSEQAIGYQVLWTGVALVAKSSLQEIAKNAHSL LYFGYPLLKTKDGLHE" gene 6996..8417 /locus_tag="DP116_21185" CDS 6996..8417 /locus_tag="DP116_21185" /EC_number="1.1.1.42" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458176.1" /note="Converts isocitrate to alpha ketoglutarate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADP-dependent isocitrate dehydrogenase" /protein_id="PRJNA477356:DP116_21185" /translation="MYDKITPPTTGAKITFKNGEPIVPDNPIIPFIQGDGTGIDIWPA TVKVLDAAVETAYKGKRKISWFKIYAGDEACDLYGTYQYLPQDTQTAIKEYGVAIKGP LTTPVGGGIRSLNVALRQIFDLYACVRPCRYYAGTPSPHKNPEKLDVIVYRENTEDIY LGIEWRQGNEIGDRLISILNNELIPATPEHGNKRIPLDSGIGIKPISKKGSQRLVRRA IKHALQLPKNKQMVTLVHKGNIMKYTEGAFRDWGYELATTEFRHETITERESWILSNK EKNANISLEENARMIDPGFNALTTDKQAQIVKEVETVLNSIWETHGNGKWKDKIMVND RIADSIFQQIQTRPDEYSILATMNLNGDYLSDAAAAIVGGLGMGPGANIGDECAIFEA THGTAPKHAGLDKINPGSVILSGVMMLEYLGWQEAADLIKKGLGDAIANSQVTYDLAR LLEPPVEPLKCSEFAEAIIKHFS" gene complement(8558..8752) /locus_tag="DP116_21190" CDS complement(8558..8752) /locus_tag="DP116_21190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317570.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21190" /translation="MELIVFGLVVVYAGGAWKFFNGFNRTNFQRSLPNRLKLALLWPA LFATNKSYRQNFRKALKSQK" gene complement(9227..9760) /locus_tag="DP116_21195" CDS complement(9227..9760) /locus_tag="DP116_21195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318830.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="translation initiation factor IF-3" /protein_id="PRJNA477356:DP116_21195" /translation="MSVIERKRNRDLPQINERIRFPKIRVIDTDGSQLGILTPSEALQ LAEEKELDLVLLSDKAEPPVCRIMDYGKYKFEQEKKAREARKKQHTADVKEVKMRYKI EEHDYNVRVKQAERFLKDGDKVKATVMFRGREIQHSDMAEQLLKKMATDLEAFGEVQQ MPKKEGRNMMMLISPKK" gene 10292..11209 /locus_tag="DP116_21200" CDS 10292..11209 /locus_tag="DP116_21200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458816.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_21200" /translation="MDTIVHWQQRVGNQRDWVWRGWQTRYTYIRPAQNNRKTTPLILL HGFGASIGHWRQNLEVLGEHHTVYALDMLGWGASEKAPVNYSVHLWAEQIYDFWKAFI CQPVVLVGNSLGSLVCLAVAAAHPEMVEGIVMMSLPDPSLEQEAIPPILRPIVMGIKN LVASPLLLKPLFQVLRRPNIVRRWASIAYANPEAITDELVEILVGPSQDRGSARAFTA LLKATIGINFSPRVKTVLPTLQIPMLLIWGQKDRFIPPALATQFANFNEKLELLNLED VGHCPHDERPELVNQAILDWIDKYSNPKS" gene complement(11429..12442) /locus_tag="DP116_21205" CDS complement(11429..12442) /locus_tag="DP116_21205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874637.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="short-chain dehydrogenase" /protein_id="PRJNA477356:DP116_21205" /translation="MQLKPINQQVVAVVGASSGIGRITALKFAMRGAKVVVAARSEQG LKSLVDEIRGNGGEARYVIADVSEFEQIKAIADQTVAEYGRLDSWVHVAATGVISPFE KITPQEFERIVQVNLMGQVYGAMVALPHLKREGRGALIHVSSMQGRRSLPLQSSYCAA KHGLEGFLESLRVELQYEKLPISVTSILPATINTPYYNKVGTKLGVKPTGIPPYYQPE LVADAILYAAEHPIRDFIVGDVGRVLDVLQKISPSLVDSLLSVIGIPGQHTGESKSEN APNNIFAPVEGYDHVEGDFSKLAIPSLLDWLDMNPPVKWGAFALAVLGIAVFVSDWHS RNS" BASE COUNT 3760 a 2791 c 2617 g 3704 t ORIGIN 1 caacgcactg gctcccctta cacccctatc cccttacacc cctagttctt gacaagggac 61 tcgcatggaa agtgcgatat cacgagtgag tctatccgtg catggcgata cgcatgattg 121 cgtctctagt tctgcgcgat cgcgtagcgt caagcctccg gcttatcgca cttactacag 181 aaaaatcatt tgcatgcgat gctaccacgc gatcgccatg gcggagtgtt ccttatttca 241 accgaaactc atcaaactta ataacctccg aatctggctt gcgtatatca aaacgatagc 301 tactcactgt acccgtaccc gtgtcaaaga tgctgaaaac cgtgatgtca ttacttgcaa 361 tgtagggcat tggtttacca tcttcaccca acaacgggac aattgtgggc accacaggtt 421 ctaaaccata tggatcacca agttcagcat aatcctcttt atagccaaca ggcacttctc 481 gcttgttgtc accccaagca gcaccgtaag aattaccaac attagaagtt tctaaaaagt 541 gcattcccga ctgactacaa aagcggttcc acaaatgaga atgcccgtaa aatactaact 601 gtacaccagc tgcttcaact aaaggcacaa catcacggat gaggaaatct gcatctttgg 661 ggtactcgta acggatagcc ttgatgttgc catcagcatc acgttcaatg atttgtacag 721 gatttgtata tgctgggact atgttaccac ccaatgtatg gggcggatga tgcaacatca 781 ccactttata ctttgcttgc tgaaattcag gactattgag ttcttgtgct agccaattgt 841 actgggtact ccctttggct attggttcat aaatatgttg cccgtaaccc caattctctg 901 gggtattgaa atctttttcc cgttctcgat atctacctct agcctctgca tccaagttaa 961 aagtccgcca catatttgtc gcgtacagta cgaccaaacg tacatcacca aagctgactg 1021 cgtagtaagt ttttccacct tcctgacttt gtggtaaagt gaaaatctct tcgtaggtat 1081 cagtattaaa agaattttct tttaaagatt cctgtccata gaacttaaga gcaactgtac 1141 gaggaatcgt agcatcaaat tcatcatcta gacttccatg atcacgacgt cccataatct 1201 catggttacc aatgcaagta aacatcggcg cgtattgaat gatttcgccg cctgagtaga 1261 ttgtcttgac tccgtcgcta ttcatttcat agttagcacg accttgcaaa cagggaaaga 1321 aagcaccgcc acgatgatca tcaaaccatt ctgaggcgcg atcaggaata ttaaccaaat 1381 caccagcgaa ccaaaccgca tcgactcgtc caacagtttc aaccactttt tgcaaattgg 1441 ctgcaaccat tggtttgatt tgatggtcag aagtcagtag aattttgagt ggtttaccag 1501 gactaggctt gggtgcaagc gtaaacacct tgctactgac actttcgcca tcttcccgca 1561 cactcgttac aaggtaagaa actcgcgtgt cgggagtcaa gtcagtgatt tgggcctcgt 1621 gtcgccaaat atcacgttga acaggatgtt gataaacctg tccgttttgg gtttggtttc 1681 ctaagcgtga attttggtct tcgcgtatgc gcttaagttg ggttgttgta gcaagagcag 1741 tttgtgctaa attttcaccg taggtaacta tatgtccata accagcaaac tcagtaaacc 1801 acacaatttg cactgaggtt tctgttggta gttgcacaaa cggatcggtc aacaactggg 1861 gtggtgatgt cataatcttt gctcagataa acacatgctt atcaggttaa gccatagtcg 1921 tgaaagttag gttaagcgtt gagtttacaa tcaaaccgct tcttcaacac gctatttatt 1981 tgctattaga gcagaaaagt caatatacgt ctgtagtctg atgttattgc gcataccata 2041 aatttataaa tggtatggtt gataaaaaac agcgtcatga caagtgtgac ctaattagtt 2101 ggtaaaccat atgcgtgcaa catgctaaaa gtagcacatt caaaaaactg gcttcagcaa 2161 agccgttata gtaatcgggg aagtgcttac ttgtttgcac cgatggttaa ttttaacgtc 2221 atggcagaag ttggagatac tcccccctag tattaattcc acttgaatag aaataccaag 2281 ataaactcaa caacaatgac atttttctgt cggtgaatgg aggagaaatt tctcctgtga 2341 ctatgaaata gccgcaaaaa atgtttttgt gtttaagaag cagttggttt aactgcatct 2401 ggtggataaa gattataact tcaagagcag caagtcacct caaagaaaat tattgtatcg 2461 ggaatagtgt aaaaatcctt gtttttatat tatctgagaa aggaaacatt agcctattta 2521 gatggtttga tggttaatgc ttccacaaat ttaattgcaa ggcttttaaa ctcgtccaga 2581 gaaaaacact gcttgtgtat ctacctaact cacatcttgt cgcaatcaaa tagcatgact 2641 tttcgtcact cagtattaaa caaaaagtcc tatatttagc gataaaagcc aataaaaata 2701 tgtagctaaa tcgctcatat ttttatcttc tacggtttta tacggagaaa agccagaaat 2761 gagctaatat gaaaccatta caagtttttc ttgcaaatca acactattga tgtgtagatc 2821 caaaaaaact atgtgtcgtg catctaactt gaggatttga cagtacctgc accaaagagt 2881 ttccctgtga ctgaaattta aacattagga aactatgggg tgtttttgaa atggctctaa 2941 attcacttga tggaatgaaa aatatcagca tcagtgcaaa cacaaagtta tgtaacacgg 3001 tttggcatca agctcataca aacttaattc ttgtcaaaaa ctcttacgta aggtatttat 3061 gaatagacag caatttttat ccaaattaca gcgcttccaa gaacttttag tcatgagttc 3121 agtaggcgct ctacccacaa ttctgttagg tccaaaactg cgtaatatac tctacagcag 3181 tatttttgct cgtataggta aagcagtttt tattcaagag ggtgttgaat ttatcaatac 3241 ttcttgtatt gaaataggga atggggtgtt tatttttaaa ggtgctcgta tagatggacg 3301 aggacaccaa aacaatagaa tatatttgaa cgataaagta gccattgagc gtaatgttag 3361 cattgggtgt ttggaggatt catacataga tattggacaa gagactttca ttggtcctgg 3421 cgtttgtatt gctggacctg gagatatcaa aataggtaag cggtgtttga ttgcagcaaa 3481 ttctggaata tatgcaaata atcataattt tgcagacccc aatgagccaa taaaatatca 3541 aggtattacc cgcaaaggaa ttgtgataga ggatgactgc tggctaggac atggtgttgt 3601 agtcttagac ggagtcacca tcggtcaggg tagtgtcatt ggtgcaggag cagttgtgac 3661 caaaaatatt cctccgttct cagttgctgt gggagtcccg gcaaaagtga tgaaaagccg 3721 cactaataag caattggtga acaccagaga ctaaaaaaga ggaggctcag cctcccagcc 3781 cttgttccca ggctcaacct gggaacgaga aatcgaggct ctgccttata ttgactggag 3841 tttacattaa gacacaatca atgagcgctc ttgcccagac ttgtttccaa gtctaaatga 3901 agcggcattg ttatcaattg atgacaaagt agttaagatt ttgaaattaa gatttcatta 3961 acttaataaa aatttataat ttgtcatatt aacatcgtaa ctctcatagc agaaagctca 4021 aaagatagga ttttcgcctt aagtaagctg tttcctgaaa attcctttcc tcaggcattt 4081 acccatttac aacggttcca cccaaaaaca gggagaagct acagatacga aaatatggta 4141 taacctgtag aatttctaac gttacaattt gaaaaaattc attacgttaa attccatgaa 4201 ctcttccaca aaccaggttt accctgttat ttggcataac aactcagtct cactgattga 4261 tcaaacccgt ttacctaacg agtatgcgtt tgtagaaatc catcgctgct cagatatggc 4321 gcaggcaatt aaaactatga ttgtacgagg tgcacctgca attggcgtgg ctgcggcata 4381 tggaatgtat cttggggcac gggaaattga aactagcgat cgcaatcaat ttttgaccga 4441 attggaaaaa gtagcccaac tgctacgtac gactcgtccg acagcagtca atttattttg 4501 ggcaatttca cgaatgctga aaatcgccta cgaaactctt ggaacagtag aagacatcaa 4561 acaaaccctg ctgaatacag cccgatctat aaacgcggaa gatttgcaaa cctgctatgc 4621 gattggcgac aatggcttga gagtcttgcc tcaaactcca gagaagctga caattctaac 4681 ccattgtaac gctggagcat tagcaaccgc aggttatgga accgctttgg gtgtcgtgcg 4741 ttctgctttc cgagaagggc gtttagcgcg ggtgttcgct gacgaaactc gtcctcgctt 4801 gcaaggtgca aaactgacga catgggaatg tgtccaagaa ggcatcccag ttacgctcat 4861 tactgacagt atggcagctc actgcatgaa gcagggtttg attgatcttg ttgttgttgg 4921 tgctgataga attgccgcta acggagacgc cgcaaacaaa attggtacgt atagtttagc 4981 actcgttgct aaagcacata acattccttt ctatgttgct gcaccccttt ccaccgttga 5041 tttttcttta tccgacggta atgaaattcc aattgaggag cgtaatccag aggaaatcta 5101 ccaaattggc gagacaatta ttacaccaac tggcgcagaa ttttataacc cagcttttga 5161 tgtcacacca gcccagttga ttacagcaat catcacagag aatggagtat ttgctcctgg 5221 ggagttagca aaatcccgga tcaagcaatt tgcttaagtg caatcgccct taagagaact 5281 taaaaccaga actcagaata ggtatgacaa agctaaaact gtgttccctt gtaaatgtaa 5341 tcattctgag ttctgcattc tgcattctga ctccttttac ttttttgtcc aaagtctatt 5401 gaaaacgaca ggagatatta ggaggtattg aaatcaatac ccagcttaat atcaataacc 5461 aagtctttga gaactgatat ataaaccccc acgaattaag gtttctcgtc ccaaaggttc 5521 cacgaccttc acctcaacga tcaactctcc caactcctgt tgtcctctgt ggtctctgcg 5581 cttgtgcggt tcacttaaat aaatattctc cggacgaatt cccaaatcaa atccttgccc 5641 cggtgcaagg cgtattctct ctcgcatagc atgaggacaa ggtaacagct gaccactcac 5701 atcaaaacca tgattggctc ttagaatatt catcggggaa tttcccaaga aagttgctac 5761 catgcggttc gctgattggg agtagatatt ttggggtttg ccgatttgtt gaattcgccc 5821 ttgtaactaa taactaagga ctaatgacta attccaagca gggtgagata gtaaagaagc 5881 aatcactcgc actccacgta gttggttgga caaaggtaga tgacctctcg gagcgcttag 5941 atcccaagta aactcgttag ggtagcgcgt ccagttattg ccttttttcc aaccaatttt 6001 ttcccaaaga ttctcccagt ttttacccag gctcaaccaa atttctctct gtacggtaaa 6061 gccaaacttt ccttctgaat ggattaacca tagggtgttg atggtcttga ggtcagtggc 6121 tgggaaattt tctatttcag taaaatacaa ccattttctt tgcactgccc ctggtcctgc 6181 cagttcacac attttttgta gagtcatgcg atcagctgct tgaaagtcgc aagctgcaag 6241 taattgttgc acagaactgt aatcaatgcc acactctgat gtcagaggta cgatcccatt 6301 cgggaaataa gcttgcaaaa attcttttgc cttggatgaa tcagaattat aaagaacctg 6361 gtatactttc ccatccaccc aagttgcagg atttttttta cgtccaaata aaaattccat 6421 caggacatcc aagcctgtat ctcccaggtt agctaactgt gggatcacct gctgttgggt 6481 gttaatagac ccagagacca acggttggcg gagggagtcg atgttactat gagtgcctga 6541 tacaatcaat gggtctgtca tgctattact cgtggaatgc agttagtgaa caagctattg 6601 gctatcaggt gttgtggacc ggcgttgcac tggtagcgaa gagtagttta caggaaatcg 6661 caaagaatgc ccactcatta ctttactttg gttatccgtt actaaagact aaagacggtc 6721 ttcacgaata agttcacaat tcatttgggt ttgctatata ttcaaagagt atactaataa 6781 agcgtgcagg ctataagata gcgaagattt gaagaaatct tacgcgcctg tgcccacgat 6841 gcaaacaggg gaaaaaactc tgcgcctcat aaactttaca catctatcaa aacaagacac 6901 agcaaaatct actagcctta catacatgag tctccctcct ttgagttcat accccaaagg 6961 atactatgtt gcagcgtagc catcaggagt gtttgatgta cgacaaaatt accccgccca 7021 caacaggagc aaaaattacc ttcaagaatg gagagccaat tgttcctgac aacccgatta 7081 tcccctttat tcaaggtgat ggcacgggga ttgatatttg gcccgcgaca gtaaaagtcc 7141 ttgatgctgc ggtagaaacg gcatacaagg gcaagcggaa gataagttgg tttaagattt 7201 atgccggaga tgaggcttgt gatttatacg gaacatatca gtatttaccc caagacactc 7261 agacggcaat taaagaatat ggtgttgcca ttaaaggacc tttgacaact cctgtcgggg 7321 gtggcatacg ctcccttaat gtagcattgc ggcaaatttt tgacctttac gcctgcgtgc 7381 gtccttgccg ctactatgca ggtacaccct cacctcataa aaatcccgaa aaacttgatg 7441 tcattgttta ccgggaaaat acggaagata tttatttggg gatagagtgg cgacaaggta 7501 atgaaatagg cgatcgccta atttccatcc tcaacaacga actcattccc gcaacaccag 7561 aacacggtaa caagagaatt cctctagact caggcattgg catcaaaccc atcagtaaaa 7621 agggttctca gcgcttggta cgacgtgcca tcaaacacgc cttgcaactc cccaaaaata 7681 agcaaatggt gactttggtg cacaaaggca acatcatgaa gtacactgaa ggcgcttttc 7741 gcgattgggg ttacgagttg gcaacgaccg agtttcgcca tgagaccatt accgaaagag 7801 aatcttggat tttgagtaat aaggagaaga acgccaatat ctccttagaa gaaaacgccc 7861 gaatgattga ccctggattt aacgccctga ccacagacaa gcaagctcaa attgtcaagg 7921 aagttgaaac agttcttaac tcaatttggg aaactcacgg taacggcaag tggaaagata 7981 aaatcatggt caatgaccgc attgccgaca gtattttcca acaaattcaa actcgcccgg 8041 atgagtattc cattctggca acaatgaact taaatggtga ttacttgtct gatgctgcgg 8101 cggctattgt tggcggactg ggtatgggac ctggggcaaa cattggcgat gaatgtgcca 8161 tttttgaagc cactcacggt acagcaccca aacacgcagg gttagacaaa ataaatcctg 8221 gttcagtgat tttatccggt gtaatgatgc tggagtatct gggttggcaa gaggcggctg 8281 atttgattaa gaaaggttta ggcgatgcca ttgcgaacag tcaggttact tacgacttag 8341 caaggttgct ggaacctcca gttgaaccct taaaatgttc tgaattcgct gaggcaatca 8401 ttaaacattt tagctagtgc aaacttacta aaattttgct ttctagcctc ctctctatta 8461 acggggaggg ggttttcata agctggatta gcgccagcat aacgcacctt tttcatcaaa 8521 acataaacac actcactcgc aactcagaga aaaaaagtta tttctgactt ttgagagcct 8581 ttctaaaatt ttgacgatag gatttatttg tggcaaacaa agctggccac aataaagcta 8641 attttaggcg attgggcaga cttcgctgga agttagtccg gttaaatcca ttaaagaact 8701 tccaagcacc acctgcataa accacgacta atccaaatac gattaattcc attcagcaca 8761 actccacaag caattggtca caattttagc taagttttac caacaccata atggaacttt 8821 ataacgatat cgaaaaccaa tgagtcagtc acaagtgagc aaaagcagaa actgcttgtg 8881 aatactttga ctgtcgcttt tttcggcatc atacgttaat taatacattt atatatcaag 8941 tttagagtag cttcgtgtat tatgacaatt cactcgtcag ttattcagaa attgtctcaa 9001 aaaatgtaaa aatctacact ataacgattg cgtagagcat ggcttgattg aaattttcat 9061 tgtagtatgt agagtgcgat acgcctcttg gcgtctgctt tcccttgcaa tcgctcgttc 9121 ctctagcccg ccttcaatgg ttaggatggc tacaaacaag agtaggggaa acctctcaac 9181 ggaagtatcc cccttgtctg atgtgctaag tttgcttatt tattgatcac tttttgggtg 9241 agatgagcat catcatgttt cgcccttctt ttttgggcat ttgctgaact tcaccaaatg 9301 cctctaaatc agtcgccatt ttcttgagca attgttctgc catgtcgctg tgttgaattt 9361 cgcgacccct aaacatcaca gttgctttga ctttatcccc gtctttgaga aaacgctctg 9421 cttgcttaac tcttacatta tagtcgtgtt cttcaatctt gtaacgcatc ttcacttctt 9481 tgacgtcagc agtgtgctgc tttttgcgag cttcccgtgc cttcttctct tgctcaaact 9541 tgtatttccc atagtccata atccgacaaa ccggtggttc agctttgtca ctcagcagca 9601 ctaaatcaag ctctttttcc tccgctagtt gtaatgcttc tgatggggtc aggattccta 9661 actgagaacc atcggtatca atcacccgaa ttttcgggaa gcgaattcgt tcgttaattt 9721 ggggcaggtc gcgatttctt tttctttcaa tcactgacat gatttgtggg agctgtttgt 9781 taagaaagtg gcttagttgt ttgttgtatg cctcagaagt aaaatactaa aatcactgag 9841 gactagaaaa tgaacttata gttctagggt agctaattcc ctagagcagt ttcattgtct 9901 tcgttgtatc tgccatttta ctcattttga aagcacttct gtctaatcgt taatctaaat 9961 tacacaatag tagcaatttt taatcattgt aacagttttt ataaaaccat tgactttatg 10021 gcttttaagt gtaattccaa aagggcggtt taagataata cagtattcac atgaacaaat 10081 gtatctttat ttactagtat atataatttt ctcagtatta attctaagcc aaaaaaggtg 10141 gaaatgcact tagcctagaa tatatttaaa gactatcttc attttagcca aactagagtc 10201 acaactctcc acagatttag cactaaggag ttgagctaac tttgacaaag tggaatgaat 10261 agtttttctg tgcgataaat tggagaaaac tgtggacacc attgtgcact ggcagcaacg 10321 ggttggcaat caaagagact gggtttggag aggctggcaa actcgctaca catacattcg 10381 tcctgctcaa aacaatcgca agacaactcc cctgatcttg ctacacggat tcggtgcatc 10441 tattggtcat tggcgacaga atttagaggt actaggcgaa caccatactg tttacgctct 10501 tgacatgtta ggttggggcg cttctgaaaa agctcctgta aattacagcg tacatctttg 10561 ggcagaacag atttacgatt tttggaaagc atttatctgt caaccagttg tactcgtagg 10621 taactctctt ggttcacttg tttgcttggc agtcgccgct gctcatcctg agatggtgga 10681 aggtatcgtg atgatgagtt tgcctgatcc gtcattggag caagaggcga ttcctcccat 10741 tttacgacct attgtcatgg gaatcaaaaa tcttgttgct tcgccactct tacttaagcc 10801 tttgttccag gttttgcgcc gaccaaacat agtacgtcgc tgggctagta ttgcctatgc 10861 caacccagag gcgatcactg atgaactggt agaaattcta gttggtcctt ctcaggatcg 10921 aggttctgcc cgtgccttca ccgccttgtt gaaggcgacc attggtatta actttagtcc 10981 tagggttaag acagtactgc caaccttaca aattcccatg cttctgattt gggggcaaaa 11041 agaccgattt attccccctg cacttgctac tcaatttgct aatttcaacg aaaagttgga 11101 actgcttaat ttagaagatg tgggacattg tcctcacgac gaacgtccag aactggtgaa 11161 ccaggcgatt ttagattgga ttgataaata tagcaatcct aaatcctgag tcgctacggg 11221 cacgccctcg catatgccta cggcacgcta cgctttgcgc ttacgcttgc gtgcgctttg 11281 cgcaatagat tttatccaag ggaatccaga cgtaaccgac cacgggggcg ataagcctcc 11341 ggcttgacgc tgcgcgtcaa gcgaactaaa acacagtgta tctcaatgag tgaaggagtt 11401 atggggaaaa atcccaaaga gtctactatc atgaattgcg tgaatgccag tcactcacaa 11461 acactgcaat tcccaacacc gccaatgcaa aagctcccca cttcacaggt ggattcatat 11521 ctaaccaatc taacaaagaa ggtattgcta gcttgctaaa gtctccctcg acgtggtcat 11581 agccttcaac tggtgcaaaa atattgtttg gtgcattctc tgattttgat tctccagtgt 11641 gctgccctgg aataccaata accgacaaca atgagtctac taaagaaggc gagatttttt 11701 gtagtacatc taacacccta cccacatctc cgacaataaa atcgcgtatg gggtgttccg 11761 cagcataaag gatagcatcg gcaacaagct caggctggta atatggtgga ataccagtag 11821 gtttaactcc tagcttggtg ccaaccttgt tgtaataagg tgtgttgatt gttgcaggca 11881 atattgatgt cacgcttatg ggcaattttt cgtattgcaa ctcgacgcgc aaactttcta 11941 agaaaccttc caaaccatgc ttggcagcac aataggaact ttgtagcggc agactccgcc 12001 taccctgcat tgaggataca tgaattaacg ccccacgtcc ctcacgtttg agatgaggta 12061 gtgctaccat tgcaccgtat acctgtccca tcaagttaac ttgaacaatt cgttcaaact 12121 cctgcggcgt gattttctcg aaaggcgaaa tgacacctgt agcagcaaca tgaacccaac 12181 tatccaaccg tccatattct gccactgttt gatctgcgat cgcctttatc tgctcaaact 12241 cactcacatc tgcaattaca tatctagcct caccaccatt gcctcgaatt tcatctacca 12301 aggactttag cccttgttca ctacgagcag caaccactac ttttgcacca cgcattgcaa 12361 acttcagtgc cgttatccgc ccaataccac tggaagctcc aactaccgca acaacttgtt 12421 gattaattgg ttttaattgc ataatgtttg tttcatctct tctgttctgt ctccttccat 12481 gtattaactt tgattaagca tccatacatc tttcataagg cttataaaaa aaaattggcg 12541 ttgctgaatt tgggaatgaa ttatggttgt agcacgatgt tcaattaatt tcctgtaacc 12601 ctgtagaact ttcggatgca acgtctctac attaatggga atttgaaaat tcatatcttg 12661 attcagcaac gccaaaaaat ttgtatacaa aaggagagtg attagtacaa gcgtgtacaa 12721 acatgtatat caagcagtaa ttgcaacagg atactactac tttttatcga gtgaaacccg 12781 catgatttat aggtgtaaag ttatttgcct aaattttata atataaataa gtcttaaaaa 12841 ttacacttca attttatgaa aatactctca tc // LOCUS NODE_2636_length_12842_cov_4.41667312842 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12842) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12842) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12842 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..762 /locus_tag="DP116_21210" CDS <1..762 /locus_tag="DP116_21210" /inference="COORDINATES: protein motif:HMM:PF13676.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_21210" /translation="PKLSREGRAPYFHILHWLANNDKWSLQLDFAITRYLQHRASVEP LLDKGLIQNFIARNHEDFYDVIHYDSLTKILSIEDSKFVFFLRNLSWHEFAVKAGYLS VSFTGRYDFALSFAGANRNVAEAIARKLTEAEIAVFYDQDEQHRILANDVEEYLKPIY DSQAQFIIALLSKDYPTRIWTKFESEQFETRFSKSEVIPIWYSDSLPGMFDKTNRVGG LTYNIDTDMESQVDHIVNTLIKKLGETRRKITGAQ" gene 879..1229 /locus_tag="DP116_21215" CDS 879..1229 /locus_tag="DP116_21215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013320651.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21215" /translation="MHFLVKSLLKSIQMNSIPISPKVIISYSHDSREHKDRVLQLADQ LREDGIDCNIHQYYESDPPPQGWPRWANDEIDAANFVLIVCTELYNRRFRSHENKTMA KPFKWTYKGKVLAI" gene complement(1623..2321) /locus_tag="DP116_21220" CDS complement(1623..2321) /locus_tag="DP116_21220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317475.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21220" /translation="MLKMMSNGRVPNKPVKQRPNESHEPVSAQRARKLILEHRAWDGM RVLGHLDLSGASELYNLPENLSCESLDIRDCVNLTTLPKGLHITYWIELAGSGITSLP AGHGFVLRWRGVQVSDSIAFESQSITGQDILNVENVELRRVLIERLGYETFLQQVGGL IRDRDKDAGGERQLVYIPFEDDEPLMVLKVTCPSTGHTHILRVPPYMRSCHQAAAWIA GFTNPDDYHPLIEA" gene complement(2321..2662) /locus_tag="DP116_21225" CDS complement(2321..2662) /locus_tag="DP116_21225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410057.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21225" /translation="MLNDRNNSQANQPGVLYRHGDVLIGRIASLPVGAQRRIGATLAH GEVTGHSHRIQQSNAVQLWVHGSNLFLEVKEPSATLIHEEHRAIELPQGLYRVWRQRE YRPDAYVEVTD" gene complement(2878..3708) /locus_tag="DP116_21230" CDS complement(2878..3708) /locus_tag="DP116_21230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407562.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" /protein_id="PRJNA477356:DP116_21230" /translation="MRIHHLNCGCMCPIGGALFDGFSRGLTACLVCHCLLVETNQGLV LIDTGFGQRDIKAPLSRLSPFFMNLNRIKFEQKYTAIAAIEQLGFRARDVRHIVLTHL DFDHAGGLEDFPEAIVHVMLPEIEAAQERRGFISSQRYRPGQWDEVKQWKYYSAKGEP WFGFEAVRDLDGLPPEILLIPLAGHTRGHAGIAIETPEGWLLHAGDAYFYRHEIGTSK PDCTPGLRAYQWLMEVDRKARLYNQQRLRELSLNHSSDVRLFCSHDAIEFKAFADQNN " gene 4151..5839 /locus_tag="DP116_21235" CDS 4151..5839 /locus_tag="DP116_21235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015192145.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(FAD)-dependent dehydrogenase" /protein_id="PRJNA477356:DP116_21235" /translation="MKQVEAVVANINDLKDGEMQQVCVGETEVLLSRLDGKFYAVGAH CSHYKAPLAEGVLSGHYVVCPWHNACFDVTNGDQTEPPGLDSLACYTVRIEGEKVIVS VPEKTTGLRSPEMAQFDPNVDKRTFVILGAGAAGSHAAEALRVAGYQGRIVMITQEDK LPYDRTKLSKDYLIGDTSREEMPLRSPDFYKEHAIEVLLNKRVEQVQTTTGAIALSDG DSLTYDALLVATGGKPRQLDIPGADLQNIFTLRSFDDTNRTLTLTEQKRQVVVIGSSF IGMEMAAGLSQRGSQVTVVSPDSVPFEKILGEQIGKQFQQVHEENGVSFKLGRKAVQF EGSSKVEAVILDNGDRLTADIVIVGIGVQPATQFLEGVNLHPKDKSVVVDEYLRAAEG IYAAGDIARYPDWRTGEPIRVEHWRVAAEQGRIAAHNMAGKPVKFKGLPIFWTMQFQF PLRYVGHAESWDEMIVDGDLQKQEFIVCYVKNNQVLAVATSHKDTETAAIFELMRSNQ MPTLDELRSGAVDFVQRQRQVNGDFLPTKKEKGTELADIVGNQWYFLNFQPGKL" gene complement(5769..6565) /locus_tag="DP116_21240" /pseudo CDS complement(5769..6565) /locus_tag="DP116_21240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_086558172.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" gene 7162..9134 /gene="cadA" /locus_tag="DP116_21245" /pseudo CDS 7162..9134 /gene="cadA" /locus_tag="DP116_21245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015163016.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="cadmium-translocating P-type ATPase" assembly_gap 8828..8837 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 9148..10029 /locus_tag="DP116_21250" CDS 9148..10029 /locus_tag="DP116_21250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747687.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipid kinase" /protein_id="PRJNA477356:DP116_21250" /translation="MTRRALLLVNRHARRGQHSLPQAVKQLRELGFDLIEEDTEKPNH LREMIHRHRNQVDLVIIGGGDGTLNAAVDALVETQLPLGILPLGTANDLARTLGIPPT LPDACQTIAKGELQRIDLGWVNGKHFFNVASLGLSVQITERLNQELKRRWGVLAYAAT ALGVIWQARLFRADIRLDGELIRVKTVQIAVGNGRYYGGGMIICENAAINDQKLHLSS INVRHWWQIVALLPAMKQGRHKAWSGIHTDKCQEIEVYTYRLHAINTDGELTTNTPAK FRLIPKALSVLIPRKYS" gene 10156..11103 /locus_tag="DP116_21255" CDS 10156..11103 /locus_tag="DP116_21255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019497117.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation transporter" /protein_id="PRJNA477356:DP116_21255" /translation="MTIAPKFEHNHQSDTTAVLSTQKVRRLWIVLGLRSSLLLMELAA GFWTRSLSLLAISGHMLSDIFTLGLALFAAKLSQRPAVGQATFGYRRAEILVALLNGL TLIAIATLIAWKAVGRFQSPEPLSGLPTLIVAALGLAVNSLLISLLYFESHHDLNLRG AFLHVVADAAGFLGVILAASMVYWLNWLWADPVASLFVASLMSLSAFPLVWDSLRVLM ELAPQSTDVALVEAALSSFAGVRQVEMLHIWTITSGQVALCAHLVVESMSAFERDKLL EQLQTRLTQEFKVCESTLQMTALNEGDSAPFTLRDRTDY" gene 11212..11916 /locus_tag="DP116_21260" CDS 11212..11916 /locus_tag="DP116_21260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019497123.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_21260" /translation="MASKLLNVLAYTLIPVAAATVGGGIAAWRTPGPKLKSVVQHFAA GVVFAAAAGELLPDLVHEKSLPATIIGGAFGVAVMLAVKQLVKKASGSISLIATVGVD VLIDGLIIGIGFAAGAKEGILLTIALTIEILFLSLSVSTTLSQANASRTRVMITTLGI ALLLPLGAVIGSALLGGLSGFSLATFLAFGLVALLYLVTEELLVEAHEIPDTPLTVAI FFVGFLVLIVIEEMLQ" gene complement(11954..12457) /locus_tag="DP116_21265" CDS complement(11954..12457) /locus_tag="DP116_21265" /inference="COORDINATES: protein motif:HMM:PF07282.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_21265" /translation="MRERIRNIVDEAHKQVAHYICSQFDVILLPSFETSEMVSQRGGR VPRHKATGEPVRVVKKSIKLRSKSVRAMLSWSHYRFKEYLKFKAQEWNTQVIEVSEAY TSITCTKCGHIHTKLGGNKKFKCPSCGHTLPRDLNGSLGIYLKALVERPELLQWREHL LPMGNQS" gene complement(12473..>12842) /locus_tag="DP116_21270" CDS complement(12473..>12842) /locus_tag="DP116_21270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_086558172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="IS630 family transposase" /protein_id="PRJNA477356:DP116_21270" /translation="GNSENTIDYLQYLLTQSPNQRLLIFWDGASYHRSKEVRGFLSEV NQGLPTDQWKIHCVRFAPNCPEQNPIEDIWLQAKTWVRRFCALIPSFSLLKWMFEWFI RHTTFDFPTLRMYGVFSKIK" BASE COUNT 3520 a 2741 c 2962 g 3609 t 10 others ORIGIN 1 cctaaattgt cccgtgaagg gcgtgcccca tactttcata ttttacactg gctagctaat 61 aatgataaat ggtctcttca actagacttt gcaattacaa gatacttaca gcatagagca 121 agtgttgaac cattacttga caaaggtttg atccagaatt ttattgctcg aaatcatgaa 181 gatttttatg atgttataca ctatgattca ttaactaaaa ttttaagtat agaagactct 241 aaatttgtat tttttcttag aaatttatca tggcatgaat ttgcagtaaa agcaggttac 301 ttatctgtaa gctttactgg tagatatgat tttgccttat ctttcgcagg tgcaaataga 361 aatgtagctg aagccattgc tagaaaatta acggaagcag aaattgctgt tttctatgat 421 caggatgaac agcatcgaat tttggcaaat gatgtggaag aatatttaaa accaatttat 481 gacagccaag cacagtttat catagcttta ctcagtaaag attatccaac gcgtatttgg 541 actaagtttg aatccgaaca gtttgaaact cgttttagta agagtgaagt gattccaatt 601 tggtatagtg actctctacc aggaatgttt gataaaacta atcgagttgg aggactgact 661 tataatattg atactgatat ggagtctcaa gttgaccata tagttaatac tttaattaag 721 aagctaggag agacacgtag aaaaataact ggtgctcaat aatatcttat tcaagcagta 781 attatataaa tcttagtact ttactacttt aataaaaaac agcaggcgta agaaaagatt 841 taaaactgtg gataaatact taatttttga tataatatat gcacttttta gtcaagtctc 901 tgttaaaaag catacaaatg aactccatcc ccatttctcc caaagtaatt atcagctaca 961 gccatgactc acgagaacat aaagaccggg tgttacagct agctgaccaa ttgcgggaag 1021 atgggattga ttgcaacata caccagtact acgaatcaga tccaccacct caagggtggc 1081 cccgttgggc gaacgacgag attgatgcag ctaactttgt cctgatagtc tgtaccgagt 1141 tatacaatcg acgcttcagg agtcacgaga ataaaacaat ggctaaacct tttaagtgga 1201 catacaaggg taaagtgttg gctatttaat ggttggttta tttacgccgt gccgtactag 1261 cttcgtcaaa atcaaatagc agtcctaaat aatttataaa attatctctt ctctcttttg 1321 ttggcgttct tggcgacgcc tgtcgcctaa gtcgggaaag ccttcattcg cgctagttcg 1381 tctatgtcct gcggacacgc tgcgcgcttg gcggttgata atttttctaa ctcaaatagg 1441 attgttatat caggttcggt taaacactta taatatctgt aggttgggtt gaggaacgaa 1501 acccaacact tgatccttgt tcatgttggg tttcactgcc gttcaaccca acctacattg 1561 ataaattgta ttataagtaa tcacccgaac ttgatattac atattaactt tacgaattaa 1621 tactacgcct caatcagagg atgatagtca tctgggtttg taaagcctgc aatccaggcg 1681 gctgcttgat ggcaactccg catgtaaggg ggaacgcgta ggatatgagt gtgtcctgtg 1741 gatggacaag tgactttcaa gaccataagt ggttcgtcat cttcaaaagg aatgtagaca 1801 agttgacgtt ctccaccagc gtctttatcg cgatcgcgaa ttaaccctcc cacttgctgc 1861 aaaaatgttt cgtatcccag acgctcaatc agtacgcggc gcaactcaac attttctaca 1921 ttcaggatat cctgccccgt tatggactgg gattcaaagg caatgctatc actcacctgc 1981 acgcctcgcc aacgcaaaac gaaaccatgc cccgcaggca aactggtaat cccacttcca 2041 gccaactcga tccaataagt gatgtgcaga cctttgggaa gggtagtcag gttcacacag 2101 tctctaatat ctaaactttc acaactgaga ttttcaggta gattgtaaag ttccgacgcc 2161 ccacttaaat ccagatgacc caaaacgcgc ataccatccc atgcgcgatg ctcaaggatg 2221 agttttcgag cacgttgggc cgatacaggc tcgtggcttt cgtttgggcg ctgcttaact 2281 ggtttattag gcacgcgtcc attcgacatc atcttgagca ttagtcggtt acctccacgt 2341 aagcatcagg gcggtattca cgttgtctcc aaacgcggta aagcccttgg ggtaattcaa 2401 tcgcccgatg ttcttcatga atcaatgtgg cacttggttc tttaacctct agaaaaaggt 2461 tactaccatg cacccacaac tgaacagcat tagattgctg aatgcgatgg ctgtgtcctg 2521 tgacttcacc atgagccaga gtagcaccta tacgcctttg agcaccaact ggtaaactag 2581 caattcgtcc gatcagaacg tcgccgtgtc tgtagagaac accaggttgg ttagcttggg 2641 agttgttacg gtcgttcaac atgttctgat cataacaata cctttgccaa aaatatctca 2701 agtgttgctg caagcgagaa atctttaggg cgttttgagt ctgccgttgt tcaaagcagc 2761 gggcaaggga gggcatacca agaatcccac tgctttttat tgaagataag cattgcgaaa 2821 tcttctcacg ctgtgggagt gtcatcaatt acaggtgtag attatttatt acacctgtta 2881 attattttgg tcggcgaacg ctttgaactc aatggcatca tggctgcaaa agaggcgcac 2941 gtcactactg tggttgagcg ataactcacg caaccgttgt tgattataaa gtcgagcctt 3001 gcggtctacc tccatcaacc attggtaagc acgcagacca ggcgtgcagt ctggtttaga 3061 ggttccgatt tcgtgccgat aaaagtaggc atcgcccgca tgtaaaagcc aaccttctgg 3121 tgtctcgatg gcaatgccag catgaccgcg cgtgtgacca gcaagcggga tgaggagaat 3181 ttctggtggt agtccatcga ggtcgcgcac tgcctcgaaa ccaaaccaag gttcgccctt 3241 tgctgaataa tatttccact gttttacttc gtcccactgg ccaggacgat agcgttgcga 3301 tgagataaag ccgcgccgtt cttgcgctgc ctcaatctca gggagcatta catgcacgat 3361 cgcttctgga aaatcttcca acccacctgc atggtcgaaa tcgaggtggg taagcactat 3421 gtggcgcaca tcgcgcgcgc gaaagccaag ctgctcaata gctgcgatcg ccgtgtattt 3481 ttgttcaaac ttgatacgat tcaaattcat gaagaacggg ctgagtcttg ataatggcgc 3541 tttgatatcg cgctgaccga agcccgtatc gatgagaaca agtccctgat ttgtctcgac 3601 gagcaagcag tggcagacga ggcaggctgt tagcccgcga ctaaaaccgt cgaaaagtgc 3661 cccgccaatc ggacacatgc aaccgcaatt aagatgatga atacgcataa atgatttcac 3721 gctcgtatca atctctctta aaaacgaaaa agtatggata atcaacatac ttaggggtga 3781 aaaggggtga aaaaatttct gacttggata tgatcaggat ttaatcgctc ttgtgcactt 3841 ttgctgtcct agtctaaaat ccagtaattg agactcgctt tgaaaatttg cctatgccca 3901 aaggctgaat ttctcacttc aagtaatgta cccttgattc gttgaaaaat tcataaattt 3961 tggaatacag ttaagtgtcc aagcaagaac tacctaagta ttttcctatc gtggatttgt 4021 ttaattaatt cacagggata gggaacaggg aacagggaac agggaacagg gaacaggtta 4081 agaagtgttt tcatgtatcc gagtgtacgc agttcatgat ggctacttaa gtgttatttg 4141 agaatttggt atgaaacaag tagaagcagt cgtcgcgaat attaacgact taaaagatgg 4201 tgaaatgcaa caggtgtgtg tcggcgagac agaagttttg ctgagccgat tggatggaaa 4261 gttttatgcc gtcggtgcac actgtagtca ttataaagca ccactggcag agggggtgtt 4321 gagtgggcat tatgttgttt gcccttggca caatgcctgt tttgatgtga caaatggcga 4381 ccaaacagag cctcctggct tggattcttt agcgtgctat acggtacgca ttgaaggtga 4441 aaaggtcatt gtcagcgtac cagagaaaac aacagggttg cgatcgcccg aaatggcaca 4501 attcgacccc aacgttgata aacgcacatt tgttatttta ggagcggggg cggctggttc 4561 tcatgctgca gaagctttgc gagtcgctgg atatcagggg cgaatcgtca tgattactca 4621 ggaagacaag ttgccctatg atcgcactaa gcttagtaaa gactacttaa ttggtgacac 4681 atcaagagag gaaatgccgt tgcgctcgcc agatttctac aaagaacacg caattgaagt 4741 gctgttgaat aagcgagttg agcaggtaca gacaacaaca ggtgcgatcg ccttgagtga 4801 tggtgattcg ttgacttatg atgccctgtt ggtagcaaca ggaggaaagc cacgccagct 4861 cgatattcca ggtgcagact tgcagaacat tttcacatta cgtagttttg atgataccaa 4921 tcgcactttg acactcaccg agcaaaaaag acaggtggtg gtgattggtt cgagttttat 4981 tggtatggaa atggctgctg gactgagtca gcgaggctca caagtaaccg ttgtttcgcc 5041 cgattctgta ccttttgaga aaatcttggg tgagcaaatt ggcaagcaat ttcagcaggt 5101 tcatgaggag aatggcgttt cctttaaatt gggcaggaaa gcggttcaat ttgaaggtag 5161 tagtaaagta gaagccgtaa tattggataa tggcgatcgc ttaacagccg atatagtcat 5221 tgtaggaatt ggcgtacagc ctgcaacaca gtttcttgag ggtgttaatt tgcatcccaa 5281 agataagagt gttgttgttg atgaatacct gcgtgcagct gagggcattt acgccgcagg 5341 cgatattgct cgttatcccg actggcgtac aggagaaccg attcgcgttg aacactggcg 5401 agttgcggca gaacaagggc ggattgcagc tcataacatg gcaggaaaac cagtcaaatt 5461 taaaggactt cctattttct ggacgatgca attccaattt cccttacgct acgttggaca 5521 tgctgaaagt tgggatgaga tgattgtgga tggcgatctg caaaaacagg aatttattgt 5581 ttgctatgtg aaaaataatc aagtattggc agttgcaacc agccataagg ataccgaaac 5641 agcggctatt tttgaactga tgcgctccaa ccaaatgcca acactcgatg aattacgcag 5701 cggtgcagtt gattttgttc agagacagcg tcaagttaac ggtgattttc tacccacaaa 5761 aaaagaaaag gggactgagt tagcagatat tgtaggtaat caatggtatt ttctgaattt 5821 ccagcctggt aaactttaag taggaattct cggtcaagat agtcaactgc cccgtaatat 5881 gtctgtttat ctcgctcatt cacgacggga actgtaactt cttggtctgt ttttccccat 5941 acatagccaa tcgtgtctcc ccacatcaaa tggcattcat caagcaacaa tactctcaac 6001 tttcctgttt cgatttcctc tcggtggttt gccagcaatg ttccaatctc tttttttttg 6061 ccgcaacagc atccgggtca gctttgggat tcgactttgt agttttcttc cagctaattc 6121 ctgctgcatc gaacaggtcg tagtaacttt gctttgactc ataaactacc tcatactcaa 6181 aagcgagttt atactccagt tccccaagct cccagcaatc ctttgtttgt agccaactca 6241 acacctcttc tcgttgttcc ctactcaagt aactctttct tcccctatgg ttcagccgca 6301 gtccatctat tccatcttgc tcataggctt gtttccaacc tgttatcgaa cccagcgaca 6361 catctagaat tgtttgaatt tcttcatata agtacccttg ataaaccagc ttgactgcta 6421 gcgctttcct tacctcacga gcatttggac gcagagcaat aaattcttgc agaacggact 6481 tcgcatcaag gacggctcct tcagccaaca ttgcagttgc tacttcctgt gggagtcttg 6541 cggtttgtag atgctgattt atcattatct gctcttacac ttgtacacct gtgcgtattc 6601 tctcgtactc tttttcaaaa atcaaatagg aatcctatat aaagtagttt gctagaattc 6661 aaactaacca tatgactatg aaatttgcat gaaattttta gatgatttta gataaaaaac 6721 ttactttttt agaaagaagc aactactaga acaaattcct ggatttgact atcttgatag 6781 tgaatccagt ctggttgagc tattagcttg agttttcaag aaataccagg gtatagttcg 6841 gtatttttgg atatttcaat aattaataaa gagccattct cattttcaaa acaataaaag 6901 taatcattat cattttctct aaaaagaata gccactaatt ttgcttgact caaagttaaa 6961 ctcttaatat aaactgaaga ttaaatgggt aatttttata aaagaatacc cgatctgctt 7021 tcaaccacct cgacctaccc agacagttaa ggtcgaggat tgcgaagagt caaactctgt 7081 aaggagaccg ctcctaaagt taccacgctt gtgacggatg atttcaaccc ctattctttg 7141 ctgtaagcga gaaagacaat catgtcaaag ccacattcaa aacattcggg ctgttgcgag 7201 catgaccacg accacagcca gaccaatcac agccacgatg atcacgacca cgaacacgac 7261 cacgatcatg gtcacggcga cggagatttt aacctcaagt cggaattgac tactgttgtt 7321 ttggttgtta ttttatttat cctcggttca atttttgaaa aacagttaca caatacacct 7381 tactcggttg gtgaatatct ggttttcatc ccagcctatc ttttaagtgg ctggaatgtt 7441 ttgactagcg ctggacgcaa tattctccga ggcagagtgt ttgatgagaa tttcttgatg 7501 actgtagcga ccttaggcgc tttagccatc catcaactac ccgaagctgt aggggtaatg 7561 ttgtttttca aagtcggcga attattccaa gaacttgcgg tcagtcgttc taagaaatct 7621 atcaaatctg ttttagacgt tcgccctgac ttcgccaact tgaagacgac gaatggctct 7681 gttaaaaaag tatcgccagt tgaagtagcc ataggggata ccattgttgt caagccaggg 7741 gaaaagattc ccttagacgg tgatattgtg gagggtggtt ctcaggttga tacctctgca 7801 ctaactggag aatcagtccc aagaactatg aaagctggag aaccagtttt ggcagggatg 7861 attaacaaaa cgggagttct ctccatcaaa gtcaccaaac tatttggtga gtcttccata 7921 tctaagatat tagatttggt gcagaatgcc agcagtaaaa aagctgaatc tgagaaattg 7981 attaccaagt ttgccagata ctacacgcca gttgtcgttt ttggctcatt ggcagttgcc 8041 ctacttcccc ccttatttat tcctggtgca acttctgccc aatgggttta ccgcgccctc 8101 gtcctgttgg taatttcctg cccctgtgga ctggtaatta gtatcccact tggctacttt 8161 ggaggtgttg ggggtgctgc caaacgaggg attttagtta aaggctctac ttatttagat 8221 acactcacag cagtcaaaac ggtagtcttt gacaagacag gaactctgac aaagggtgta 8281 tttaaagtgg cacagattgt gccgaaaaat ggtttcaccc aagaagagct actccgactt 8341 gcagcagaag tagagtcaca atccaaccat cccatagctc agtctatccg ggatgcctat 8401 ggtcaagaaa ttgacccatc cttaattgaa gcttacgaag aaattgcagg tcatggaatc 8461 cgagctttgg tagaaaatcg gttggtgatt gcgggaaatg accgcttatt gcatcgggaa 8521 aacatcgttc atgatgtctg taacgttgag ggtacagtcg ttcatctagc ggtagataag 8581 cgttacgctg ggtacattat aattgctgat gaactcaagg acgacgcgat acaggcgatt 8641 caagccttga agaaattggg gattgaaacc atcatgttaa cgggagatag tcaagctgta 8701 gccgagaggg tggcgcaaaa tctggggtta gattcctacg aagctcagtt attaccagaa 8761 gacaaggtga gtgcaattga gaaaatcttg agccgctctg gaaaagataa taaagttgtc 8821 tttgtagnnn nnnnnnncca gttattgcta gagccgacgt gggcatggcg atgggtgggt 8881 taggttctga tgctgcgata gaaactgcgg atgtggtgct gatgacggat gcaccctcaa 8941 aggtggcgga agcaatacaa gttggcagaa agactcatca aattgtttgg cagaatatca 9001 ttcttgcttt ggtagtcaaa ggcgtgttta tcgctttagg aattttcggt ttagcaacga 9061 tgtgggaggc ggtttttgcg gatgtgggcg tagcgctact ggctattttt aatgctacca 9121 gagtcctgag ataggaggtt gatactcatg actcgacgcg ccctgctgtt agtgaaccgt 9181 cacgctcgaa gaggacagca tagtttaccc caggcagtaa aacaactacg agagctaggt 9241 tttgacctga ttgaagagga caccgagaaa ccgaatcacc tgagagagat gattcatcgt 9301 caccgcaatc aagttgactt ggtgattatt gggggagggg atggaacact gaatgctgct 9361 gtggatgcac tggtagaaac tcaacttcct ttggggatct taccactggg aaccgccaac 9421 gatctcgccc gcactttggg gattcctcca actctacccg acgcctgtca gactattgca 9481 aagggtgaat tacagcgcat tgacctcggt tgggtgaatg gaaagcactt tttcaatgtt 9541 gccagtttgg ggttgagtgt ccaaattacg gagcggctca atcaagagct gaagcgccgt 9601 tggggagtgc tagcctatgc agctacagct ttaggagtga tttggcaagc ccgactcttt 9661 cgagcggaca tccgtctcga cggcgaatta attagagtta aaactgtgca aattgctgta 9721 ggaaatggtc gatactacgg tggcggcatg ataatttgtg agaatgcagc gatcaatgac 9781 caaaaactgc atctgtccag tataaatgtc cgtcactggt ggcagatcgt tgccttactg 9841 ccagcaatga aacaggggcg acataaagcg tggtcgggta tccacactga caaatgtcaa 9901 gaaattgaag tctacactta ccgactgcac gctatcaaca cggatggtga attaactacc 9961 aacacacctg ctaaattccg tctgatcccc aaagctttat cagtgctgat acccagaaaa 10021 tattcttaag gatgtctcaa tgattatgca gatccctaga ggctttacat gccctcaatg 10081 agttaacgca gtagcgtcag gtaaagtttt tgaatcgata ttttcaaaaa gagattagga 10141 acgaggtaac gtaaagtgac tattgctccg aagtttgaac acaatcatca aagcgacaca 10201 acagctgtgt taagtaccca aaaggtacgg cgactttgga ttgtcttggg actgcggagt 10261 agccttttgc tgatggaact ggctgctggg ttctggactc gcagcctttc actgttagcg 10321 atctccgggc atatgctctc ggatatcttt actctgggat tagcgctctt tgcagcaaag 10381 ctgtcccagc gtccagctgt gggtcaggcg accttcggct atcgacgggc agaaattttg 10441 gtggcgctgc tgaatggatt gaccctaatc gcgatcgcta ccctaattgc ctggaaagca 10501 gttggtcgat tccaatcccc agagccatta tcaggtttac ccacgttgat tgtggcggca 10561 ctgggcttgg cggttaacag cttgctcatc agcttgctgt actttgaaag tcaccatgac 10621 ctgaatttgc gaggtgcttt ccttcatgtg gtagccgacg ccgctggctt tttgggcgtg 10681 attttggctg ctagtatggt ttattggttg aattggctgt gggcagaccc agtcgccagt 10741 ttgttcgtgg caagcctaat gagcctcagt gctttccccc tcgtttggga tagcttaagg 10801 gtgctgatgg aacttgcgcc ccaatctact gacgtagctc ttgttgaagc tgccctaagt 10861 tcctttgcag gtgtccgaca agtagaaatg ctgcatatct ggacgattac gtctggacaa 10921 gtagctttgt gcgcccatct tgtcgtagaa tctatgagcg cctttgagcg ggataagtta 10981 ttagagcagt tgcaaacccg cttaactcaa gaatttaagg tttgcgagtc tactctacag 11041 atgacagccc tcaatgaagg cgattctgct cctttcacgc tacgcgatcg cactgactac 11101 taacaaatct ttgctagaga tgaattgcca gggacaggga ttatcaaagc aacaaaacta 11161 aaaatcactg actcatgaac acgtatctta acaaacataa agggggtact cctggctagc 11221 aagctcctaa atgtgttggc ttacaccttg attccggtag cagccgctac cgtgggtgga 11281 gggatcgctg cgtggcgaac gccgggaccg aagctaaaaa gcgttgtcca gcattttgca 11341 gcaggcgtgg tgtttgccgc cgctgctggt gagttgttgc cagacttagt tcatgaaaag 11401 tcgctacctg caactattat tggcggcgca tttggagtag ctgtaatgct tgccgtgaaa 11461 caattggtca agaaagcctc tggatccatc agtctaattg caacggtggg cgtggatgtg 11521 ctcattgacg gtctgataat tggtattggg ttcgctgcgg gagccaaaga ggggatactg 11581 ctgacgattg ccctaacaat tgaaattttg ttcctcagtt tgtccgtatc gacgacgctc 11641 agccaagcca atgcatcgcg tactcgggta atgataacga ccttggggat tgccttgctg 11701 ctcccactag gcgcagtgat aggctcagcc ttgcttgggg gactttcagg tttctctttg 11761 gcgacgttcc tcgcgttcgg attagtagca ctgctgtact tggtaacaga ggaactactt 11821 gtcgaggcac atgaaatacc cgatactccg ttgaccgtgg caatattctt tgtgggtttc 11881 ttggtgttaa tcgtaattga ggagatgctg caatagtttg acgaagcgta ccttactatc 11941 cgaacgtttt aagttaactt tggttaccca ttggtaacaa atgctccctc cattgcagga 12001 gttcgggacg ttctaccaat gccttaagat atatcccaag gctcccattc aaatctcttg 12061 gcagtgtgtg accacagcta ggacatttga attttttgtt tccacccaat ttggtgtgaa 12121 tatgtccaca ctttgtacag gtaattgaag tgtaagcttc tgacacttca attacctgtg 12181 tgttccattc ttgtgcttta aactttagat attccttgaa tctataatgc gaccaactga 12241 gcattgctct tactgacttt gatctgagct ttatagactt tttcactacc cttacgggtt 12301 cgccagtcgc tttatgccgg ggaacccgtc caccgcgctg gctcaccatt tctgatgttt 12361 caaagctagg tagtaaaatc acatcaaatt gagagcagat ataatgagca acttgcttat 12421 gtgcttcatc tacaatatta cgaatgcgtt ctctcattct tctataggac tcctatttga 12481 tttttgaaaa aactccgtac atcctaagag tgggaaaatc aaaggtggtg tgtcgaatga 12541 accactcaaa catccacttc aaaagcgaaa aagaaggaat caaggcacag aaacgtcgaa 12601 cccatgtttt ggcttgcaac caaatatcct caataggatt ttgttctggg caattgggag 12661 caaagcgaac gcagtgtatt ttccattgat cggttggcaa accttgattc acctcgctca 12721 agaaacctcg aacctccttg gaacggtgat aactcgcacc atcccaaaaa attagcaatc 12781 gctgattagg ggactgagtt agcagatatt gtaggtaatc aatggtattt tctgaatttc 12841 ca // LOCUS NODE_2652_length_12792_cov_4.93130312792 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12792) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12792) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12792 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..389) /locus_tag="DP116_21275" CDS complement(<1..389) /locus_tag="DP116_21275" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21275" /translation="MTSSNRYSAKFLFFGLWIVSITIWFISLFWLWLLMDFQFVMWTL EASEWPSRIVLWLRPLIWLIIALVPPGLCFYLYRKHRVFWVIPILLVIGVTLLKVSFD FGRIDTQGTIFDTPLPKGEGILHSSSEL" gene 489..893 /locus_tag="DP116_21280" CDS 489..893 /locus_tag="DP116_21280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21280" /translation="MTTQTLTLQIPEILYQRLVNTAHAQRRPIEEVIVHALQVGSPPE WDDVPEEFQADLAALDKLDDNSLWQIVRSCKTADQMERYNFLLLRNSSGNITDAEQLE LIKLRHEADRFILCKAQAAVLLRWRGHHVPTP" gene 1033..1239 /locus_tag="DP116_21285" CDS 1033..1239 /locus_tag="DP116_21285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020734543.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21285" /translation="MFTGETTPLFNPRTQRWSEHFTWSSDATKVEGLTTIGRATIVCL RMNNPVIVVARRRWTIIGWHPPDD" gene complement(1425..1937) /locus_tag="DP116_21290" CDS complement(1425..1937) /locus_tag="DP116_21290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008185524.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cupin domain-containing protein" /protein_id="PRJNA477356:DP116_21290" /translation="MKLPQQKTFKITKASVILPLAILAFGSVVVNSQESPSPNTYTQS VSREVLASGYPTQDQKQILELVRYTIAPRTKLPTHIHPGMQIERVEAGTLTYTVVQGE AKVTKANGTQLILQKGKTIQLTVGDSLIEPAGMVHYGENQTNKPIILLSASLFDANQP KAILTNPENR" gene complement(2102..3355) /locus_tag="DP116_21295" CDS complement(2102..3355) /locus_tag="DP116_21295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316226.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21295" /translation="MNSQEPHSRLQPQEEWMDPSTNPKDWSWSFWPVVPLYPYGRRRT LCREIVKDTIWTFDQLQGILYTIVPIRMTVVKLWAGGLLVYAPVAPTTECVRLVNELV AVHGEVKYIILPTSSGLEHKVFVGPFARRFPSAQVFVAPHQWSLPFNLPLSWLGFPQK RTQVLPEDSRQAPFADEFDYAVLDINLGRGSFAEVAVFHRQSRTLLVTDSVLSLPEEP PAIIQLDPYPLLFHARDNTFEVIEDNEANRRKGWQRICLFAIYFRPSALEVTGLVQTF RDTFQAPNHSPKAYFGLFPFRWKQNWKQSFDTLRGHGRPFVAPILQILILPQAPRQVL HWADKVATWDFGRIISCHFDSPIEAVPHEFRRAFAFLEKKPLGSENSFGSSSQPLVEE DFRFIKELEANLVRRGIATPAKEKV" gene complement(3835..5361) /locus_tag="DP116_21300" CDS complement(3835..5361) /locus_tag="DP116_21300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019490604.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin:protochlorophyllide reductase (ATP-dependent) subunit B" /protein_id="PRJNA477356:DP116_21300" /translation="MKLAYWMYAGPAHIGTLRVASSFKNVHAIMHAPLGDDYFNVMRS MLERERNFTPVTASVVDRNVLARGSQEKVVDNITRKDAEEHPDLIVLTPTCTSSILQE DLANFVERASLEAKADVLLADVNHYRVNELQAADRTLQQIVQYYIEKARKKGQLPEGK TEKPSVNIIGISTLGFHNQHDCTELKRLMADLGIEVNAVIPEGASVHELKNLPRAWFN LVPYRELGVMAARYLEEQFGIPTVDITPMGVVETARCIRKIQQIINAQGADVDYEEFI NQQTLYVSQAAWFSRSIDCQNLTGKKAVVFGDSTHAAAMTKILAREMGIHVVWAGTYC KYDAEWFSQQVSEYCDEVLITEDHGAIGDAIARVEPSAIFGTQMERHVGKRLDIPCGV IAAPIHIQNFPIGYKPFMGYEGTNQIADLVYNSFTLGMEDHLLEIFGGHDTKEVITKG ISADSDLAWTKEAQAELNKVPGFVRGKVKRNTEKFARDRNFSQITLEVMYAAKEAVGA " gene complement(5915..6457) /locus_tag="DP116_21305" CDS complement(5915..6457) /locus_tag="DP116_21305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651780.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21305" /translation="MDRNLATIYLSVLVGLLLIAVVSIFRQVFKSRRVEGSMSRLRKK LTKESGTTQEYYELASIYSEKKLFSQAVTLFQKALKAAEEDRVDGEEEQQELAYIYNG LGYAYFAQEQYDIAIRQYKEALKIKPDYVVGLNNLGHAYERKKLNAQALQAYEEALKL QPTNTTAKRRSDSLRRLVSA" gene complement(6574..7395) /locus_tag="DP116_21310" CDS complement(6574..7395) /locus_tag="DP116_21310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997579.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_21310" /translation="MDIGCNRWKAACRLHLLISAAIFFLLTFSLVIADTQLSASAAEM PEIQRRGYIRIAVKDNLRPLGFRDTNGNLQGLEIDLAKALAVDLVGKADAVKLQPVAN GDRLSAVLDHKVDLAIARVTATASRSRLVSFSVPYYFDGTVLVTKSTSFQRLSDLAKR KVAVLNNSSTIADVRYYIPNAELVGVNSYQAAFALLENNAADAFAADASILSGWVQQY PQYRLLSTKLSTQPLCVVMPKGLQYDPLRRQVNQAIARYINSGWLKQRATYWGLR" gene complement(7408..7764) /locus_tag="DP116_21315" CDS complement(7408..7764) /locus_tag="DP116_21315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197431.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L20" /protein_id="PRJNA477356:DP116_21315" /translation="MTRVKRGNVARKRRKKILKLAKGFRGSHSTLFRTANQRVMKALR NAYADRKKRKRDFRRLWITRINAAARQHGMSYSQLIGNLKKADVQLNRKMLAQLAVLD PASFGKVAELASQSKG" gene complement(7787..7984) /gene="rpmI" /locus_tag="DP116_21320" CDS complement(7787..7984) /gene="rpmI" /locus_tag="DP116_21320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L35" /protein_id="PRJNA477356:DP116_21320" /translation="MPKLKTRKAAAKRFRATGSGKIVRRKAFKNHLLQHKTSNKKRDM SKMAVVDERDAENVRLMLPYL" gene complement(8147..9721) /locus_tag="DP116_21325" CDS complement(8147..9721) /locus_tag="DP116_21325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308838.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mechanosensitive ion channel family protein" /protein_id="PRJNA477356:DP116_21325" /translation="MNILIILAEVVFLILIFSLFNLLIGIIFRQFNKVSWLQGRTANI TFLRRNISRLLILICVVLCLALIAMNGVIIYRGGNVKEFQLNLVRSIPTQFWLNFLTA SLKSVSLLMLVKFSIPPLQRGVDWVCDYAKKADQIKANDESTEAFFKVLKQTITNTIW ISSAILCAKFFYLPEVVSKYLYIALKIYIIVTVGLLIVKAVATIVDTLDALSLKYSSS NNLLRFYERLRHLIPLFKKCLEYVLYVGIVNLVVPEIEFIAWISAYTPKIVQIIGIFF ISNVLIEVAYFILDEFYLRTTDSDDSNRQKRLTLIPLMRSFTKYFIYFTAGVTILKLI GIDPAPILAGAGIVGIAVGLGAQNLINDVVCGFLILFENYYLVGDYVEVGKVEERNVQ GMVEAIELRTTHVRHPDGQLQIIRNGDIGSIINYSKQYIYARVEVSVSYDSNLDHVYR VVEKVGQQLKADEHDVLEPTRVAGIEHFGEHNLLLLTLTKVKPGKHIHIQRVLRKILK ETFSQEEIELCGFSKD" gene complement(10353..11024) /locus_tag="DP116_21330" CDS complement(10353..11024) /locus_tag="DP116_21330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017299523.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme-copper oxidase subunit III" /protein_id="PRJNA477356:DP116_21330" /translation="MENPIHLLEEILDESPIHFYKLRRYLPIWLHRFLPIGGGKADDE HGKTLFGFTVFLLSESIVFLSFFFTYIALRLTTTNWLPPGVSGPELSSFTIFNTLVLL SSSLVIQSAENALKRRQIRKFRLLWLITSAMGTYFLIAQAIEWSHLNFGLTTGVVGGT FYVLTGFHGLHVLVGVLLQILMVIRSFIRGNYNKGHFGVSATTLFWHFVDVIWVILFS LLYIW" gene complement(11050..12711) /gene="ctaD" /locus_tag="DP116_21335" CDS complement(11050..12711) /gene="ctaD" /locus_tag="DP116_21335" /EC_number="1.9.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007355017.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c oxidase subunit I" /protein_id="PRJNA477356:DP116_21335" /translation="MTNNSIEDMGRAGESEKPNTNWRDYLGFSKDHKVIGVQYMVTTF IFFLIGGLLAMIIRGELVTPESNLVDRSLYNGLFTLHGTVMIFLWIIPFMAGLANYVV PLMIGARDMAFPLLNAIAFWIIPPAGLLLMSSFLLPGGPAQAGWWSYPPISIQNLSGR LLNGEFIWILSVILLGISSILGGVNFITTIVWMRAPGMTFFRMPIFVWTVLSAQMLQL FCLPSLTGALILLFFDLSFGTNFFKPSQNGDPIIYQHLFWFYSHPAVYVMALPAFGIF SEILPAFSRNPLFGYRSVAIASFGIALVSIFVWVHHMFASATPDWMRILFMVSSMLVA VPTGVKVFAWTATVWNGRLHLLTPMLFALGGVVMFIFGGITGVMLSSVPFDIHVNNTY FVVGHFHYIVHNTITMAIFAAIYFWFPKITGRMYAEGWGKVHFLLTFIGANLTFFPMH AVGLQGMLRRVSSYDPRYQGWNVIASLGGFLLGMATLPFIANMVGSLLQGSKASDNPW HATGLEWKTSSPPPTENFEEIPVVNEPPYNYNNRSEPTPEAVIQE" BASE COUNT 3690 a 2815 c 2818 g 3469 t ORIGIN 1 agttctgacg atgaatgtag aatcccctcg cctttaggca ggggagtgtc aaaaatagtt 61 ccctgtgtgt caatgcgacc gaaatcaaag cttaccttga gcagtgtaac cccaataact 121 agaaggatag gaataaccca gaatactctg tgcttccgat agaggtaaaa gcacaatcca 181 ggcggtacca aggcaataat tagccaaatc aaaggtcgaa gccaaagtac aattcgactg 241 ggccactcac ttgcttccag cgtccacatc acaaactgaa agtccattaa taaccataac 301 cagaacagac taatgaacca aatagtgata gacacaatcc acagtccaaa gaaaagaaat 361 tttgcagaat aacgattaga tgaagtcata gtctccctca atggtgatgg cggcggatgc 421 cgtagctaag tttgaactac gctaaaataa agacggttta atcaaggcat gggtagggaa 481 ttgaactaat gacaacacag acactaactc ttcaaatccc agaaatactt tatcagcgtt 541 tagtcaacac tgcccacgcg caacgtcgcc cgattgaaga ggtaattgtt catgctttac 601 aggtgggtag tcctccggaa tgggatgatg taccagagga atttcaagct gacctcgcgg 661 ctttagataa actagatgac aatagcttgt ggcaaatagt ccgcagttgc aaaacagcag 721 accaaatgga gcgatacaac tttctgctct tgcgtaactc tagtggtaat attacagatg 781 cagaacaatt ggaactaata aaactgcgtc acgaagctga ccgttttatc ctatgcaaag 841 cccaagctgc tgtgttactc cgttggcgag gacatcatgt gccgactcct taaatcagtt 901 atcagttatc agttatcagt taccaggcgg gctggtgggg gatgcgctcc ccgccaccaa 961 cgccttccac caattggtgg gggacgtaag acccaaggat ttaactgata actgtttact 1021 gttcactgtt cactgttcac tggggaaaca acacctctat ttaatccgcg tacacagaga 1081 tggtctgagc attttacttg gagttctgat gctacaaaag ttgaaggttt aactactatt 1141 ggtagggcta caattgtttg cctgcgaatg aataatccag taatagttgt tgcccgacga 1201 cgttggacaa ttattggctg gcatcctcct gatgattgac tagaatagaa tttaggcgta 1261 gggtgcgcta attaggagtg ggatctgcta tggactcaca gaagttgaca gcggatagag 1321 tagctaatgt tttatacctt ctctttaagt atttcagcgc ttgtttataa ttaagcaccc 1381 aagacaatta ttgtggaagg attgagccta aagttgcggc aatttcatct gttttcagga 1441 ttggttaaaa tagcttttgg ttgattagca tcaaatagag aagcagacaa tagaataatc 1501 ggtttgtttg tttggttttc tccataatga accattccag caggttcaat taaagaatct 1561 cctactgtaa gctgtatagt tttccctttt tgaagaatca attgtgtgcc attagctttc 1621 gttactttag cctctccttg cacaacagta taagttaaag ttcccgcctc tactcgttca 1681 atctgcattc ccggatgaat atgagtaggg agttttgttc ttggtgcgat tgtatagcgt 1741 acaagctcaa gaatctgctt ttgatcttga gtcggataac cactcgctaa aacttcacgg 1801 ctaacagatt gagtgtaagt attgggtgag gggctttctt ggctgttaac aactacactg 1861 ccaaaagcca ggatagctaa ggggagaatc acagacgctt ttgtgatctt aaaagttttt 1921 tgttggggta atttcataaa tattaatttt ccttaacatc tctgatacaa atggcactaa 1981 ggtaagctga aacacttgtg gtttaagcag tttgcaattg cttacgtaat tcatgccggg 2041 gtttcgtttg attcgggcgt aactatagtt ttgtgttttt tcatttttat agatgtatca 2101 atcaaacttt ctctttagct ggtgtggcga ttccgcgccg aaccagattt gcctcaagtt 2161 ccttgatgaa tctaaagtct tcctccacca gaggctgact tgagctgcca aatgagttct 2221 cacttcccaa gggtttcttc tcaagaaaag caaaggcccg ccggaattca tgtggaacag 2281 cctcaattgg cgagtcaaag tgacaggaaa taattcgccc aaaatcccaa gttgcaactt 2341 tgtcagccca gtggagtact tgtctcggtg cttggggaag aatgaggatt tgcaaaattg 2401 gtgctacaaa tggacgccca tgtcctcgta gggtatcaaa tgactgcttc cagttctgtt 2461 tccatcgaaa cggaaacaaa ccaaagtacg cctttggtga gtggtttggt gcttgaaaag 2521 tatcgcgaaa tgtctgcacc agtccagtga cctccaaggc acttggacga aagtaaattg 2581 caaataaaca tatccgttgc catcccttgc ggcggtttgc ttcattatcc tcaatgactt 2641 caaaggtatt gtctctagcg tgaaacagca agggatatgg atctaattgg atgatagctg 2701 gtggttcttc tggcagagat aggacagagt cggttacgag tagggtgcgc gattgcctgt 2761 gaaaaaccgc aacttccgca aaagaacccc gtcccaagtt gatgtccaac actgcatagt 2821 caaactcatc agcaaagggc gcttgtctgc tgtcctctgg aagcacttga gttcgttttt 2881 ggggaaagcc aagccaactt aacggcaggt taaacggcaa actccactga tgcggggcaa 2941 caaaaacctg tgcactggga aagcgtctgg caaaaggacc aacgaaaact ttgtgctcca 3001 aaccagaact cgttggcaga atgatatact taacttcacc atgaacagcc accaactcgt 3061 tcaccaagcg cacacactca gtcgtcggag caacaggcgc atagacaagt agaccccctg 3121 cccagagctt tacgacggtc atacgaatcg gtacgatagt gtagaggatg ccttgaagtt 3181 gatcgaatgt ccagatagtg tccttaacta tttctctaca cagtgtccgt cgcctaccgt 3241 aggggtagag tggtacaaca ggccaaaagg accatgacca atctttcgga ttggtgcttg 3301 gatccatcca ttcctcttgc ggctgtaacc ttgagtgagg ttcctgtgag ttcatcgtcg 3361 ctcgacctca gaagtgctcc ttttcaattt ctaacattgg aaaggttgtt gttgatgttg 3421 gaagaagtta ttgaggacaa aatatagcga ttcttacttg catactacac acacgcacgc 3481 ggtaggggca cggcattgcc catacgtgtc aacttaacgt tcaagcctta attccacgtt 3541 gatttaacct cacccccctt cgggttcacc agtcgcctct gtcgggaaag ccgtcattcg 3601 cgctggattc accgtaaagc tacgcttaac gtcccctctc cttagtaagg agaggggcgg 3661 ttttggcgta agacaaaacc ggggtgaggt agagcgacaa ttgtgggcaa gtagaataaa 3721 gcatctcggt aacctaagaa caagagtttc agcttaagtt gacaccaatg ggctagcctt 3781 tacccctaca aataacgtgt attgtaccca attgagaact cgcgctttta aaaactatgc 3841 acccacagct tctttcgcag cgtacatcac ttccagagta atttgactga aattgcgatc 3901 gcgagcaaat ttctcggtgt tgcgcttgac tttaccacgt acaaagccag gaaccttgtt 3961 caattctgct tgagcttctt tagtccatgc caaatcggaa tcagcagaaa tgcccttggt 4021 aataacttcc ttggtatcat gtccgccaaa gatttccaag aggtggtctt ccattcccaa 4081 ggtgaaggaa ttataaacca aatctgcaat ttgatttgtc ccttcgtaac ccatgaatgg 4141 tttgtaacca atggggaagt tctggatgtg aatgggtgct gcaatcacac cgcaggggat 4201 atccaagcgt ttaccaacgt ggcgttccat ttgggtaccg aagatagcag agggttcgac 4261 gcgggcgatc gcatccccaa ttgcaccatg atcctcggtg atcagcactt catcacaata 4321 ctcgctcacc tgctggctga accactccgc atcgtacttg cagtaagttc cagcccagac 4381 aacgtgaatg cccatttctc gtgccagaat cttagtcata gcagcggcgt gagtgctatc 4441 accaaacaca acggctttct taccagtcaa gttttgacag tcaatcgaac gagaaaacca 4501 tgcagcttgt gatacataca gagtttgctg gttgataaac tcttcgtaat caacatccgc 4561 tccttgagca ttaattatct gctgaatttt acggatacaa cgagcagttt ctacgacacc 4621 cattggtgta atatctacgg ttgggattcc gaattgttct tcaaggtaac gagctgccat 4681 aacaccaagt tcccggtaag ggacaaggtt aaaccaggcg cggggcaggt tcttcaactc 4741 gtgaacggaa gcaccttctg gaatcacagc attcacctca atacccaagt cagccatcaa 4801 ccgcttgagt tctgtgcagt cgtgctggtt atggaaaccg agggtagaaa taccgatgat 4861 gttgacagaa ggcttttcgg ttttgccttc aggcagttga cctttcttgc gggctttttc 4921 gatgtagtat tggacgattt gttggagagt gcgatcggcg gcttggagtt cattaacgcg 4981 gtagtggttc acatccgcca gcagcacatc ggcttttgct tccagagatg ctctttccac 5041 aaagtttgct aagtcttctt gcaaaatgct ggaagtgcag gtgggagtta acacaatcaa 5101 atctgggtgt tcttccgcgt ctttgcgggt gatattgtcc accacttttt cttgtgaacc 5161 gcgtgccaaa acgttgcgat caacgacact ggctgtcact ggcgtgaagt tcctctcccg 5221 ttctagcatg gaacgcataa cgttgaagta gtcatcaccc aagggcgcgt gcataatagc 5281 atgaacattt ttaaaggaac tggcaacccg cagagttcca atgtgggccg gacctgcata 5341 catccagtaa gccaatttca tctttagtgt tctccctttt attgaaacga actatatccg 5401 ataataggtg gacgtatagc gtgcttgttt ttttaaaccg tttctgattt tcgcagaact 5461 tgtagcccct aagaaatgag atttaacaga attgtaatag aggtgcgatc gcccgtatat 5521 gcccagaggg gtttcttgaa ggaaacccct ctccccaaat ctccgatttg gggcaaccaa 5581 gcgttcgcgc agcgtgtcct ctggactcag cttggttggg gcatagggcg cgctacgcga 5641 tcgctgaaac ctccttacta attaggattg ggcttctgac ttaaacaaca tttatgttgt 5701 caaagtgata cgaatcttaa cttttaggga aatttcttat aactagggac tgtggctgta 5761 agccagatat tttgagcaaa cacctacttg attgttatcg aacagttttt acggctattg 5821 atggatgttt gcctctacga aaggtaatat tggagaggtt ttgcctcccc aagctattcc 5881 caagcaaagc gcgggaacga gaaatactcg taaatcatgc agatactaag cgccgcaaag 5941 agtcagaacg gcgttttgcc gtggtgttag ttggctgaag tttcagtgct tcttcataag 6001 cttgcagggc ttgagcattt aattttttcc gctcgtatgc atgaccaaga ttatttagcc 6061 ctaccacata atcaggtttg attttgaggg cttctttata ctgacgaatg gcgatgtcat 6121 actgttcttg agcaaagtaa gcatagccta atccattgta aatataggct agttcctgtt 6181 gttcttcttc accatcaact ctatcttctt cagcggcttt gagagctttt tgaaagagag 6241 ttaccgcctg ggaaaatagt tttttctcag aataaatact ggctaactcg taatattctt 6301 gagtggtacc gctttctttg gtcagttttt ttcgcaacct tgacatagaa ccttcaactc 6361 tacgactttt gaagacctgg cggaaaatac tgacaactgc aattaagagc agacccacta 6421 aaactgacag ataaatagtt gctagattgc gatccattgt ctttagtaaa agtttcttct 6481 tatctatata tcagcttata tgaaacaatg aaaattatga attatgaata ctaaaaaatt 6541 tcatttctaa tttctaactg agaaatttct aatttagcgt aatccccaat atgtagcgcg 6601 ttgcttaagc cacccagaat taatataacg ggcaatggct tggttaacct gtcttcgtaa 6661 tggatcgtac tgcaatccct tgggcatgac gacacataac ggttgggttg atagcttagt 6721 tgacagtaag cgatattggg gatattgttg cacccaacca cttaaaatac tagcatctgc 6781 ggcaaaggca tcagctgcgt tattttctaa cagtgcgaac gctgcttgat aagaatttac 6841 tcccactaat tcggcatttg gtatataata gcgcacatcg gcaatggtgc tggaattgtt 6901 gaggacggca actttccgtt ttgccaaatc actcagtcgt tgaaatgatg tgctttttgt 6961 gactaataca gtgccatcaa agtagtaagg aacactgaag cttactaaac gagaacgtga 7021 ggcagttgct gtgactctgg cgatcgccag atcaacttta tgatccaaga ctgcggatag 7081 gcgatcgcca ttggctacag gttgtaattt caccgcatct gctttgccca ctaagtctac 7141 tgccaatgcc ttcgctaaat caatttctaa gccttgtaag ttaccgtttg tatctctaaa 7201 tcctaacgga cgcaagttat ctttgacggc aatcctgata tagccccgcc gctgaatttc 7261 gggcatttct gcggcagatg ctgataattg tgtatctgct atgacaaggg aaaaagttaa 7321 gaggaaaaaa atggcagcgg atatgagcag atgtaaccga caagctgctt tccatctgtt 7381 acatccgata tccatctgtt tccaccttta tcctttagat tgacttgcca attcagcgac 7441 tttaccgaag cttgctggat cgaggactgc taattgtgcc aacattttgc ggttgagttg 7501 gacatcagct tttttcagat tcccgatcag ctggctgtaa ctcattccat gttggcgtgc 7561 tgcagcgttg atgcgggtga tccagaggcg gcggaaatcg cgtttgcgtt ttttgcgatc 7621 ggcataggcg ttccgcagcg ccttcatgac tctttgattc gccgttctaa acagagttga 7681 gtgagaaccg cgaaaacctt tggcgagttt gagaattttt ttgcggcgtt tgcgagcaac 7741 attaccgcgt tttacccgtg tcatattgtt ttagttcctg aacaaattac aaataaggga 7801 gcatgaggcg cacgttttcg gcatcgcgct catcgacaac tgccattttc gacatgtcac 7861 gtttcttatt ggaggttttg tgctgtagca ggtggttttt gaacgctttg cgacgtacaa 7921 ttttaccgct acctgtagcg cggaatctct ttgccgcagc tttacgggtt ttgagtttag 7981 gcatgggctg gctttgattg gacacaatcc atcattataa actaagaatt ttgaaacgaa 8041 cggcatcagg cgctgggaaa cgttttttgc aaaattggga tatttcggta ctgccaactc 8101 aactaaaatt acgattgttt acagctgttt tctgctttgg tgggcatcaa tccttggaaa 8161 aaccacaaag ttcaatttct tcctggctaa aggtttcctt taagatcttg cgcagaacac 8221 gctggatatg aatgtgtttt ccaggtttga ccttcgtcag cgtcagaagc aacaggttat 8281 gctccccaaa atgttctatt ccagcgactc gcgtgggttc gagaacatcg tgctcatccg 8341 cctttaactg ctgtcctacc ttctcaacga ccctgtagac atgatctaaa ttggagtcat 8401 aggaaacgct aacttccacc cttgcatata tatactgctt agagtagttg ataattgatc 8461 caatatcccc attgcgaata atctgtaatt gaccatcagg atgtcggaca tgagttgttc 8521 tcaactcaat cgcctctacc attccctgaa catttctctc ttccactttc ccaacttcga 8581 catagtcacc caccaagtaa tagttttcaa acagaatcaa aaatccacaa acaacatcat 8641 taatgaggtt ttgtgcccca agaccaactg ctatacccac aatccctgca cctgctaaga 8701 taggcgcagg atcaatgcca ataagcttga gtatagtcac tccagcagtg aagtagatga 8761 aatatttcgt gaaactccgc attaagggaa taagcgtcag ccgtttctga cggtttgagt 8821 catctgaatc tgtagttctt aggtagaact catcaaggat aaagtaagcg acttcaatca 8881 aaacattgct gataaagaaa atcccaataa tttgaacaat tttgggggta taagcactta 8941 tccaagcgat aaactctatt tctggaacaa cgaggtttac tatgccgaca taaaggacgt 9001 attccaaaca ttttttgaag agtggaatta gatggcgtaa acgctcgtag aaacgcagca 9061 gattgttaga actggagtat ttaagactaa gggcatcaag agtatcaacg atagtagcaa 9121 cagctttgac aatgagtaaa ccaactgtga cgatgatata gatttttaag gcaatgtaaa 9181 gatattttga aacgacttct gggaggtaga aaaatttagc gcatagaata gcagatgata 9241 tccaaatagt attagtgata gtttgtttta aaaccttaaa aaaagcttca gtactctcat 9301 cattagcctt gatttgatca gctttcttgg cgtaatcaca aacccagtct acaccccgct 9361 gtaatggtgg tatgctaaac ttaaccagca tcagcaggct cacacttttc aagctcgccg 9421 tcaagaaatt gagccaaaat tgagttggaa tactgcgaac taagttaagt tggaattcct 9481 tgacgtttcc gcctcgataa atgatcaccc cattcatagc aatcagcgcc aaacacagca 9541 ccacacaaat caaaatcaaa agtctgctga tatttcggcg caggaaggtg atattcgcag 9601 ttctcccttg aagccaagaa accttattaa actgcctgaa gattattcca atcagcaagt 9661 taaataggga aaaaatgagt attaaaaaga cgacttcagc caggataatt agtatattca 9721 tcttttgagc atcaatgtca aacaattgag atagtgtgat taatattatc aactgaagcc 9781 agccctgcca agattaagta cttacgccaa agactgcaaa aacgctttca caaaaaatat 9841 acttgttcaa aatccaattt actttatgca attagtcaag aactagtcgc tagtcaacta 9901 tcgtattttg attgtgacca gatttaacag ttatcagtca tcagttatca gtcatcagtc 9961 atcaagcaat tgataactgt tcattgttca ctgttttaac agcttctacc atgatgctct 10021 ccacaactag ggtgttaaat aaagctgtat agttggtggt taaaatacgt attcgactat 10081 gtgttttttt atgatcatat cagcaacatg acgaggatgc aaggacagat aagacagtta 10141 agcagtacag ttaaaaaaaa taattaggat agttaagata ctggacttta gactaaatga 10201 gctaaataat acgcttggca tgcatgtatt taacaatact tttcactgag ggcagcgatg 10261 ttcatactgc ccctacgatc gcgatcatcg cctattggga aaaaactttg ataattattg 10321 tgctagatac actctactag ctatatccaa gcttaccaaa tataaagaag tgagaacaga 10381 ataacccaaa tcacatcaac aaagtgccag aacaaagttg ttgcactcac accaaagtga 10441 cccttattat agttaccgcg aataaaagaa cgaatcacca tcagtatctg tagaagcaca 10501 cccacgagaa cgtgcaaacc gtggaagcca gtcagcacgt agaatgtccc cccaacaact 10561 cccgtagtca gcccgaagtt gaggtgactc cactcaatcg cttgggcgat caagaagtag 10621 gttcccatag ctgaagtgat cagccataac aaacgaaatt tgcgaatttg acgacgcttc 10681 agagcatttt ctgccgattg aatgactaag ctactggaaa ggagcactaa tgtgttaaaa 10741 atcgtaaaac tagataattc tggtccggac acaccaggtg gtagccagtt ggtagtcgtc 10801 aaccgcagag caatgtatgt aaagaaaaaa cttaagaaga cgatgctttc cgacagcagg 10861 aagacggtga aaccaaacaa cgttttgcca tgttcatcat cagcttttcc cccacctatc 10921 ggtaggaacc ggtgcagcca aatcggtaga tatcgccgca acttatagaa gtggatagga 10981 ctttcatcta atatttcctc taataaatga ataggatttt ccatgattgc ttcttggatg 11041 aaattgcagc tattcttgga tgacagcttc tggtgtcggt tcggatctat tgttgtagtt 11101 gtaaggcggt tcgtttacaa ctggaatttc ctcaaagttc tctgttggag gcggtgaaga 11161 agttttccac tccagccctg tagcatgcca aggattatcg gatgctttag agccttgtaa 11221 taaagaacct accatatttg caataaaggg taacgtcgcc atcccaagca agaacccccc 11281 aaggctggcg atgacgttcc atccttgata gcgtggatcg taggaggaaa ctcgacgcag 11341 cataccttgt aaacctacag catgcatggg aaagaaagtc aagttggcac caatgaacgt 11401 taataagaaa tgcaccttgc cccagccctc ggcgtacatc cgcccagtta ttttggggaa 11461 ccaaaagtaa atcgcggcaa agattgccat cgtgatggtg ttgtggacga tgtagtggaa 11521 gtgacccacc acgaagtaag tgttgttgac gtgaatatca aatggcactg aactgagcat 11581 gacaccagtg ataccaccga aaataaacat caccacacct cccaaggcaa acagcatcgg 11641 tgtcaaaaga tgcagcctgc cattccagac ggtagcagtc caagcgaaca ctttaacgcc 11701 agtgggtacg gcgaccaaca ttgaggaaac catgaagagt atccgcatcc agtcgggtgt 11761 ggcactggcg aacatatggt gtacccagac gaaaatgcta actaaagcaa ttccaaagga 11821 ggcgatcgcc actgaccgat aaccaaacaa aggattacgg gaaaacgccg gaagaatctc 11881 cgagaaaata ccgaaagctg gtagcgccat aacgtaaacc gctggatggg aatagaacca 11941 aaacaggtgc tggtagatga ttggatcccc attctgcgac ggtttgaaga agtttgtccc 12001 gaaactgagg tcaaaaaata gtagtatcaa tgcaccagtg agagagggca ggcaaaagag 12061 ttggagcatt tgagcactga ggactgtcca gacgaatatt ggcatacgga agaacgtcat 12121 ccctggggcg cgcatccaaa caatggtggt gataaagttg actcccccta aaattgatga 12181 gatgcccagc agtatgacac tcaaaatcca aataaattcg ccgttgagta acctacccga 12241 taagttctgg atactaatgg gtgggtaaga ccaccagcca gcttgcgcag gaccaccagg 12301 cagcaagaaa cttgacatta acaaaagacc tgctggggga atgatccaga aggcgatcgc 12361 attcaggagt ggaaaagcca tatcccttgc cccaatcatc agcggtacta cgtagttggc 12421 aagacctgcc atgaacggga taatccacag gaagatcatc accgtgccgt gtagcgtaaa 12481 caaaccatta tagagggagc gatccaccaa atttgactca ggcgtcacca gttcgccacg 12541 gataatcata gccaacagcc cgccaatcaa aaagaaaatg aaggtcgtca ccatgtactg 12601 aacaccgatg actttatggt ctttgctgaa gcccaagtag tcccgccagt ttgtattagg 12661 cttttcggac tcgccagccc tacccatatc ttcaatagag ttatttgtca ttcaagacct 12721 tcttttcaac taggaggatt caccagaact aagggtgtag gggtgtaggg gtggttaaaa 12781 tacgtattcg ac // LOCUS NODE_2657_length_12749_cov_5.33031412749 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12749) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12749) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12749 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(244..984) /locus_tag="DP116_21340" CDS complement(244..984) /locus_tag="DP116_21340" /inference="COORDINATES: protein motif:HMM:PF05050.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21340" /translation="MIFFVDQIRQFSMKNLRPLIHKTGFDIVRFPPRKYNFLITLLKN HNVDTVIDVGANLGQYATEIRSEGFDGRILSIEPVPEVFKSLQKAKAKDSKWSGFNFA LGAENGTTQINLTNFSDLSSILEPTEYAKSVAPAFEVKEKIQIQLKTLDTFWNENKLN NSKVFLKLDCQGFEEQILQGANESLDKLVGVQMELSLKALYKNQRLYNDSIAFMKEKK FELYHILPVFSDVNTGQLLDMDGFFFKS" gene complement(1373..1663) /locus_tag="DP116_21345" CDS complement(1373..1663) /locus_tag="DP116_21345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875779.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21345" /translation="MTNLQQTYQVIEETIKKPPIPHEPQKQSLKAWAMYCLRDRGFKV IYAQNADFAVETRNGEKIYFKVANDGSDLDNQFGWILWDRTTKNVSFIPPES" gene complement(1714..2175) /locus_tag="DP116_21350" CDS complement(1714..2175) /locus_tag="DP116_21350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875780.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VOC family protein" /protein_id="PRJNA477356:DP116_21350" /translation="MQQSPQSLKSVLTSGDLRRVHHIALNVHDMQASRYFYSHILGLH ELTGDEVPATLVDLVAQGKVANFVTPDGTILDLFWEPDLPPPDPNPERTFTRAYHLAF DIAPELFEQAVEVIRQNQIQIAHGPVTRPTGRGVYFYDPDGFMIEIRCDPQ" gene 3003..>3786 /locus_tag="DP116_21355" CDS 3003..>3786 /locus_tag="DP116_21355" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21355" /translation="MVESRQASDRQSVNTTNSITTSQPYAGTVGTNVGVGVSRTDDAN RVSIGQPSVGSSGTNIGAGLSTDANTNSGTNIGAGLSTDANNSSNSSSISSNGQSGAQ AANGGKNVTLTRGLIGGLIGGTLGSVAAVFAGKRIAQGVSVAAKGLGEAAKTIGGGLS QTGKGVGQAVKSIADGATEAVVGSAVDTAKGVAEGTKQVVAGTLDAVQNTAQGVNQAV QAGAEIVQDTAKSVAEGARQTVTGTLDAVQNTAQGVNQAVQGA" assembly_gap 3787..3886 /estimated_length=100 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <3887..4426 /locus_tag="DP116_21360" CDS <3887..4426 /locus_tag="DP116_21360" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21360" /translation="LDAVQNTAQGVNQAVQGAADTAKDTANRAAEGTRQAALGATDTV KDTAQGVNQAVQGAADTAKDTANRAAEGTRQAAQGAADTVKDTAQGVNQAVQSGAQNV KPSENQSNQYQDRTVADNQQINKYEVEMDRTPVVYISSPEHKELRVDPIVPVDAETSL GSDKEDLLEEEIVRIDDYR" gene 4658..5263 /locus_tag="DP116_21365" CDS 4658..5263 /locus_tag="DP116_21365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314676.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21365" /translation="MNGNQELFDQLDGNTDVATEQPQAYEVGQDTSNGLSKIAIGALI GATLGALAGALTIKGTAEKVNQTVKNVGDKVKDAAQNFNQTVQGVGGAVNTIAEGVND TVKDVGESVRGSALGVNDTVNTTVGAVKTTAINVNDTVNNTIDIIKGAAEGVTHSVTN TMDVAKSSVEDAKPSGTQNVNVANLPNNQTTYMLVPVENQK" gene 5393..7108 /locus_tag="DP116_21370" CDS 5393..7108 /locus_tag="DP116_21370" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21370" /translation="MSYNEHQYEKNADRMKEQRTTPRVGVGLGGGLVGAAIGGLLGRR VGGVSGAVIGAVAGALVGKGTAERVNRTVDSLVDAANSVAETINHNVNGVGNAVKDTV EETKTSVVSVVQAVKDTVEEAKPSVLSTVQAVKDTVEEAKPSVVDVVQAVKDTVEEAK PSVVGAVQAVKDTVEEAKPSVVNAAKNVAEAVNQSINDVKTTLKNTVKNTVDEVNQSV VGVEETAKNADEEVKPSGTHNNEIHQEPLVSEQLSKSSKEYPQVVEDVMSSSSHCPQV VEDVMSSSSHCPQDEDFIPPTVILPASPPSTLPLTPDRRGDFLMQELRVESQENSNLS DEVDIKEEIAQGLNSSQQPKSFEEIDIKDIQEIKYNNIQQKEKEFNHSQQETTQQLQQ PTLQPKTEKKQKITGIAGIIVGVSIISLIGVTLGFLPKQNQLVMKSSASSQSLSSIPE TTPERTPPTMTDGWIFLGHINKASDSVLVGKTLIKSSESTDSRIIPSVGSIVAVAVEP GIKLRDNRPQPPNFSPQQQKALAILKPQEKLKILKVEFVNPSSTTESPIKVWAKVRRC GDACL" gene 7117..7605 /locus_tag="DP116_21375" CDS 7117..7605 /locus_tag="DP116_21375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858887.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4385 domain-containing protein" /protein_id="PRJNA477356:DP116_21375" /translation="MFDYSLDFKNINFREHPELYRVGKGEQGVLLVEPYKSEILPYWR FKTPEIAKESSEKIYQMFFEYLEEDDFVGADMARKFLQMGYTRSRRYANHKSGRKYKT NPQKETCQEAQMKARKDILPNEVDPVKAESAAIFKEKWMQAKTNENYLQLLAKHKQMY ER" gene complement(7949..8164) /locus_tag="DP116_21380" /pseudo CDS complement(7949..8164) /locus_tag="DP116_21380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314679.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 8407..>10561 /locus_tag="DP116_21385" CDS 8407..>10561 /locus_tag="DP116_21385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411679.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding transcriptional accessory protein" /protein_id="PRJNA477356:DP116_21385" /translation="MLNIPQLLATELNLKPYQVQNALELLAEGATIPFIARYRKERTS EMNEVQLRELNDKYNYLSELEERKLVILNSVAEQGKLTDELKQKIESCLQKTELEDLY LPYRPKRRTRATIAREKGLEALAEFIKSLNNKNAVSASLDAEAAKYISQEKGVKTAED ALKGASDILAEEVAEKAELRAYLRDYLLEEGVFVSRIKDDHPSGTTKFEMYRNYQMRV KNIAPHNMLALCRGEAEEVLSFEISFDEDVVLSYLESKEIKTKVRAIRDFYQAMLKDA FNRLMKTSLISEVIGEKKTYADIESIKTFEANLRELLLSAPAGMKPTLAIDPGFRTGC KVAILDQTGKFLEYQAIFPHQAAEQRLKAAQTLKNLIEKYKVELIAIGNGTASRETDE FVLEVLQAMERKPVKVMVNESGASIYSASKVALEEFPDLDITVRGAISIGRRLQDPLA ELVKIDPKSIGVGQYQHDVDQKLLKKKLDETVESCVNYVGVDLNTASKELLTFVSGIT SSVANNIVAYRNQHGVFKNRRQLLKVPKLGPKAFEQAAGFLRIRGSENPLDNTAVHPE SYSVVEAIASDLSLPLTQITQIAEKLKKANLKKYVTDTVGEPTLRDILSELEKPGRDP RAEFKYATFKEGIKEISDLKEGMELEGIVTNVANFGAFVDIGVHQDGLVHISQLADRF VDDPKKIVKVGQVVKVRVLEVNEKLKRISLSMKSVK" assembly_gap 10562..10571 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(10689..12068) /locus_tag="DP116_21390" CDS complement(10689..12068) /locus_tag="DP116_21390" /inference="COORDINATES: protein motif:HMM:PF00331.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21390" /translation="MRNHNNQQKWKRRRQTSSTLLLGFVSFLIIFGIHITLTLAQEAV LPVQQTKGNNLIAQISSDDAWRQAAANNINTYRKGDLTVVVTDANGTPVPNAEVHVAM KRHAYNFGSAVDEKTLLTSSSNSDVNKYQNNIPQLFNEGVIENGLKWPQWENLESRPQ AIEAVKWLRDKNLKVRGHNLVWPSWNNSPPSLQALYNNTLNQQGKAAADQVLRDRIIA HIRDEVGYLKGQINDWDVINEASDNHDFQDILGESVLVDWFKAAHEADPNAVLYINEN FYDNSTHGDQYESQIQYLVNNGAPIGGIGIENHLMNGTISIPQLVSILDRLGKFNLPI KITEFDIFTSNRQTQADITRDCMTAVFGHPATNGFVMWGFWDGAQWFENGPIYDRDWT VKPSGKVYMDLVFKEWWTDVVGKTDANGEYKIRGFLGDYEVTASNNLVSKTVSTTLSR NGTKLTIGM" gene 12307..12648 /locus_tag="DP116_21395" CDS 12307..12648 /locus_tag="DP116_21395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314681.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_21395" /translation="MPVYQVRLINSGIGLDRTIQVPDDQYILDIAQEAGIRLPSGCQQ GECSACVAKIISGEVDQSEQKFLRPHEVEAGYTITCVAYPLSDCTLETHQEQVLYKTS LYFQSNAAPSE" BASE COUNT 3958 a 2438 c 2760 g 3483 t 110 others ORIGIN 1 aatctcaaaa ttaagaatct ctcctttttg aatttgccta agttgaattt tgaatgtttg 61 agtgatcgtg tggggtgtta cgggtgtaag ggtggagttc ccctacatcc ctaagtcctt 121 acggacacgc tgcgcgaacg gggaagtcaa aaatcaaaaa tcaaaaggta agaagtatta 181 attttgactt tttacttgat acttttgact tgttttgccc ctatccccct acacccctag 241 ttattaactc ttgaagaaaa agccatccat atcaagaagc tgtccggtat ttacatcgct 301 aaatactggg agtatatgat aaagttcaaa tttcttttct ttcataaagg caatggaatc 361 attatatagt ctttgatttt tatacaaagc tttcaatgac agttccattt gaacgccgac 421 aagtttgtcg agagattcat tagccccttg tagaatctgt tcttcaaagc cttgacagtc 481 taattttaaa aaaactttgg aattatttaa cttattttcg ttccaaaagg tatctaacgt 541 ctttaattga atttggattt tttctttaac ctcaaaagca ggtgcgacac ttttagcata 601 ttctgtgggt tctaagatag agcttaaatc ggaaaaattc gttaagttta tttgagtggt 661 tccattttcc gctcccaatg caaaattaaa acctgaccac tttgagtctt tcgctttcgc 721 tttttgaaga gatttaaaaa cttctggtac tggttctata gataaaatac gtccatcaaa 781 accttcgctt ctgatttctg tcgcatattg accaagattc gccccaacat ctataacagt 841 gtctacatta tgatttttta ataacgtgat taaaaaatta tatttgcgag ggggaaatct 901 cacgatatca aaccctgtct tgtgaatcaa aggacggaga ttcttcattg aaaattgccg 961 gatttgatca acaaaaaaaa tcatattttt aaaaattaag tgatgataag gtataaaact 1021 aggaaacttg cttttagcga gagtgtttcg gagactttcg cataaatact acaacgaaaa 1081 aaccccagag acaagagtgc gttttttgag ttttattaag tacgttaata ttcaaaaacg 1141 taaaacagag tctttgcgat gacttcgttt cacgggatcg gaatgtgagg ctgactgcaa 1201 atggttatgg tgttgaaatt atcttaagcg caaggggatt tcaagctatt ttgctttagt 1261 tgggaaagtg cgataagccg gaggcttgac gctacgcgta tcgctccaca tttcaatcat 1321 ctacttcccc gtaaggggat taaaacacga caaacgacag aacacgcaac agctaagatt 1381 ctggtggaat gaagctgaca ttttttgtag tcctatccca aagtatccaa ccgaattgat 1441 tatccaaatc acttccatca tttgcgactt taaaataaat cttttcaccg tttctggttt 1501 caacggcaaa atcagcattt tgagcatata ttaccttaaa acctctatct cgtaggcaat 1561 acatcgccca agctttcaat gactgctttt gtggctcatg tggaatcgga ggttttttta 1621 tcgtctcttc gataacttga tatgtctgct gtaaattagt cattagtcat tagtcattta 1681 gtcaagaatt tttttattac cttgtttccc tagctattgt gggtcacagc gtatctctat 1741 cataaagcca tcggggtcgt agaaatacac gcctctacca gtaggacgtg taactggtcc 1801 gtgggcaatt tgtatttgat tttgtcttat gacttcgact gcttgctcaa acaattcagg 1861 agcaatgtca aacgccaagt ggtatgctct ggtaaaagta cgttctggat ttggatctgg 1921 cggtggtaag tctggttccc aaaataaatc gaggatagta ccatcaggag tgacaaagtt 1981 agcgactttc ccctgtgcga ctaaatctac aagagttgca ggaacctcgt caccagtcag 2041 ttcgtgcaaa ccaagaatat gactgtagaa gtatcgagag gcttgcatat catggacgtt 2101 aagggctatg tgatgcactc gtcgcaggtc acctgaagtc agaacacttt ttagggattg 2161 aggactttgt tgcatggcag aactctattg actgattaag ggagaagcta cactaaccag 2221 tttctagtat gacttttccc tgtattaaac gcttcactat tgataagtat tatcatctca 2281 tcaaactggg attatttcat aaatatgacc atattgaatg aaatcacaga aaaacatcga 2341 attggcaaac gtaacagtta cattgctctg tttccctaat tttttgctcg atttctctaa 2401 tatttttcct aattgttatc aagtagatga actggacttt aaactaattt cgagagcgaa 2461 acgccaaaat ttttacatat aaagagtttt ggagaactat agcagtccta aatcatttgt 2521 gagaaaccac taatgacaaa tgacgaatga ctaaggacag ccttcacgaa aaatctcaca 2581 actcaaatag gattgctata gctcattttc tagttagatg aatgaagcga accttaaaag 2641 ctttacacag actacaattg ggataattta tctaattttt catctcaact ccaggatgaa 2701 gaattatcag caatacttca tttgaagtta tatcagtaaa aagcaaaaaa tatattaata 2761 aaaataaata tttacaaaat atgagatgaa agtcaggaat aaaataatac caaataaact 2821 agacacggga aatatgtata tttatctaga aaaaaagatt atctcttcag agagactata 2881 ccatcaacta tgtctgtctt tagggcggtt aacttattat ttataatcgt taaatttttt 2941 agtgaagaca acatacagct gagcgctaac tttttaattt tttaattaag gaaacgattg 3001 acatggttga gagcaggcag gcttcggata ggcagagtgt aaatactact aacagcataa 3061 ctacaagtca accttatgct ggtacagtag gcacaaacgt aggggttggg gtatccagaa 3121 cagatgatgc gaaccgcgtg agcatcggtc aacccagtgt gggttcatca gggacaaaca 3181 taggtgcagg gttatctaca gacgccaata cgaattcagg gacaaacata ggtgcagggt 3241 tatctacaga cgccaataat tcttccaata gttcttcaat tagcagtaat ggtcaatccg 3301 gtgctcaagc agcaaacgga ggcaaaaatg tgacgctaac tagaggactg ataggcggac 3361 tcattggtgg tacattaggt tcagtagcag ccgtttttgc tggtaaaaga atagctcagg 3421 gcgttagcgt tgctgcaaaa ggtttaggag aagccgcaaa gactatcggt ggaggtctta 3481 gccaaacagg aaaaggtgtc ggacaagcgg taaagagtat tgctgatggt gcaaccgagg 3541 ctgttgtcgg tagcgcagtg gatacggcaa agggtgttgc tgagggaact aagcaagtcg 3601 tagccggtac attggacgca gtacagaata cagcacaggg tgtgaaccaa gctgtacaag 3661 ctggagcgga gatagtccag gatacagcaa agagtgttgc tgagggtgct aggcaaaccg 3721 taacaggtac attggacgca gtacaaaata cagcacaagg tgttaaccaa gctgtacaag 3781 gtgctgnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3841 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnttgg acgcagtaca 3901 aaatacagca caaggtgtta accaagctgt acaaggtgct gcagacactg caaaggatac 3961 agcaaatcgt gccgctgaag gaactaggca agccgcacta ggtgctacag acactgttaa 4021 ggatacggca cagggtgtga accaagctgt acaaggtgct gcagacactg caaaggatac 4081 agcaaaccgt gctgctgaag gaactaggca agccgcacaa ggtgctgcag acactgttaa 4141 ggatacggca cagggtgtga accaagctgt acaaagtggc gcacaaaatg ttaagccctc 4201 tgaaaatcag agcaatcaat atcaagatcg gacagttgca gacaatcagc aaatcaataa 4261 atatgaagtt gaaatggatc gtacaccggt ggtctacatt tcaagcccag agcacaaaga 4321 gctacgtgtt gatccaattg tgcctgtaga tgctgaaacc tcattgggtt ctgataaaga 4381 ggacttgctt gaggaagaaa ttgttcggat tgacgattac agatagctat ttaacattgc 4441 gagggtcatc aatcaaacct agatgtaaca acgatgcccc cttttaaata aaggtagtcc 4501 aataggacta cctatcaaca taggaagctc gtaagttata cgagatagtt tttccggctt 4561 agactttgtc aaatcaaaga caaaggtcta agcgttgatt ttttgtcgtg taaccacaga 4621 gaaattaacc agttttggta caaaggaaac gattattatg aatgggaacc aggagctttt 4681 cgatcagctt gacggcaata ctgatgtagc tacagagcaa cctcaggctt atgaagtagg 4741 tcaagatacg agtaatgggt tatctaaaat agcgattgga gcacttattg gtgctacctt 4801 gggcgcacta gccggggctt taactattaa aggtacagcc gaaaaggtta accaaactgt 4861 aaaaaatgtc ggggataagg taaaggatgc agcccaaaat tttaaccaaa ctgtacaagg 4921 tgtagggggc gcagtaaata caatagctga gggtgttaac gacactgtaa aagatgtagg 4981 ggagagcgtc aggggttcag ctttgggcgt taatgacact gtaaacacaa cagtgggtgc 5041 tgttaagact acagcgataa atgttaacga caccgtaaac aatacaatag acatcatcaa 5101 gggtgctgct gaaggagtta cccactctgt gacaaataca atggacgtag ccaagagttc 5161 agttgaagat gccaagcctt caggtactca aaacgtcaac gtagccaacc tacctaataa 5221 ccagacaacc tatatgttag tcccagtaga aaaccagaag tagagagtag ggtgggtatt 5281 gcccacccta ctggttttaa gtcttaccta gaattgtttg gttggtcata gataacgagt 5341 tgcttcagct agcggttatg cctctttggt gtcattttag gagataatta gtatgagtta 5401 caatgagcac cagtatgaaa aaaatgctga ccggatgaag gagcaacgca cgactcctcg 5461 agtaggtgta ggtctaggcg gtggattagt tggagcagcc attggtggtt tactcggtcg 5521 ccgagttgga ggagtttctg gtgctgtaat tggcgcagta gctggagctt tggttgggaa 5581 gggtacagct gagcgtgtta accgtacggt agatagttta gtagacgcag ctaatagtgt 5641 agctgagact attaaccata atgtaaacgg tgtaggaaat gccgtaaaag atacagttga 5701 ggaaaccaaa acatctgtag taagcgtagt acaagcagtc aaggatacag ttgaggaagc 5761 caagccatcc gttctaagta cagtacaggc agtcaaagat acagttgagg aagctaagcc 5821 atccgttgta gatgtagtac aagcagtcaa agatacagtt gaggaagcca agccatccgt 5881 ggtaggtgcg gtacaagcgg tcaaagatac agttgaggaa gccaagccat ctgtagtaaa 5941 tgcagctaag aatgttgctg aggctgttaa tcagagtatc aatgatgtta aaactacatt 6001 aaaaaataca gtaaaaaata cagttgatga ggtcaaccaa tctgtagtag gtgtagaaga 6061 aaccgccaaa aatgcagatg aggaagtcaa gccatctggt actcacaaca atgagataca 6121 ccaagagcca ttggtgagcg agcaactgtc gaagtcaagc aaggagtatc ctcaagttgt 6181 tgaggatgtg atgtcaagtt catctcattg tcctcaagtt gttgaggatg tgatgtcaag 6241 ttcatctcat tgtcctcaag acgaagactt tatcccgcct acggtcattc ttcccgcatc 6301 acctccttct actctcccgt taacgccaga cagaagagga gattttttga tgcaagagtt 6361 gagggtggag tctcaagaga atagtaactt gtcagatgaa gttgatatta aagaagagat 6421 tgctcaagga ttaaattctt cacaacaacc gaagagtttt gaagaaatag atataaaaga 6481 tatccaagag attaaatata acaatattca acagaaagaa aaagaattta accacagtca 6541 acaagaaact actcaacagc ttcagcaacc aactcttcaa ccaaaaactg aaaaaaaaca 6601 aaaaataact ggaattgctg gaattattgt aggagtttct atcataagtt tgataggtgt 6661 aactttggga tttctcccaa aacaaaatca gttagtcatg aaatcatcag catcttctca 6721 aagtctatct tcaataccag aaacaacacc ggaaagaaca ccgccaacaa tgacagatgg 6781 ttggattttt ctaggtcata tcaacaaagc ctcagattca gtattggttg gaaagactct 6841 catcaaaagt tcagagtcta ctgattcacg gattattcca tccgtagggt cgatagtagc 6901 tgtcgcggtt gaaccaggta ttaagttgag ggacaataga ccacaaccgc caaactttag 6961 tcctcaacaa caaaaagctt tagctattct taagcctcaa gagaagttaa aaattctgaa 7021 agtagagttt gtgaatcctt cgagtaccac tgaatcacca ataaaagttt gggcaaaagt 7081 ccgcagatgt ggtgatgctt gcttataata ttatgtatgt ttgattattc tttagatttt 7141 aaaaatatca actttcgaga acatcctgaa ctgtatcgcg taggtaaggg tgaacaggga 7201 gtgcttttgg tagaaccata caaatcagaa attcttccct actggcggtt taaaactcca 7261 gagattgcaa aagagtctag cgaaaaaatc taccagatgt tttttgaata tttagaggaa 7321 gatgattttg ttggggcgga tatggcgcgg aagtttttac agatggggta tacgcgatcg 7381 cgtcgctacg ccaatcataa aagtggtcga aagtacaaaa ccaacccgca aaaagaaact 7441 tgtcaagaag cccagatgaa agcaagaaag gatatcttac caaacgaagt agatccagtt 7501 aaagctgaat cagcagcaat ctttaaagaa aagtggatgc aagccaagac gaatgaaaac 7561 tatctccagc ttttagcaaa gcataagcag atgtacgagc gttaatcctc ttaaaggaga 7621 ggaagtgatt ttttacttta gttgattggt aggcagactt caattatcta ttgattcaat 7681 caacttatag gatatataaa cattttttat atcctaaaaa aactaaaacc gtttatccat 7741 ctgtgaacta tcacatggac aaggtgataa ttccgtggcg ctttacctac tcatttctct 7801 cctaaatttc cggactgaaa cttggagttt agtagataat atgctgatac ttacgtatca 7861 tttcaacctc gtaatcctga ggtttggaaa gaacctgttc ccggttatca ggggttaatg 7921 gttcttcttt ggctacttcg tcaagaatga ttttatactc attttgacgt aagagttttt 7981 tggtggtagt gagggtaaaa atcagtagca aaagcagtat ttgccaaggt gcgaagatta 8041 aacctaaaac tatacagata gctgcgaata tccccacaaa gtacccgatt tcatcggtac 8101 ttttcttgaa gatatagcca ccaactaaac cagtaaaaag tggaagcaga aaaaacagag 8161 ccatagtttc tcacttctga acaaaagcag aagcacttat gtgtgcatat gagagctgct 8221 gtgggcaaca agtgtgtttt atttcacaat ctacgttgtt ttagccatga caagctacct 8281 ataacaacag cgtataaaga caagattccg gaatcttctg ttaaactagc ttaacacttt 8341 ctggttgcac tatttaatct atctaacgta ttaccttaaa aagtagcgtt ttcaactgag 8401 acaccaatgc tgaatattcc tcaattacta gctacagaac tcaatctgaa accatatcag 8461 gtgcaaaacg cgctagaact tttggcggag ggtgcaacaa ttccctttat tgcacgatat 8521 cgcaaagagc gtacaagcga gatgaatgaa gtccagttgc gcgaactgaa tgataagtat 8581 aattacttaa gcgaattgga agaaaggaaa ttggtgattt tgaattccgt agccgaacaa 8641 gggaaactca ccgacgaact caaacaaaaa atcgaatcct gtttacaaaa aactgaactt 8701 gaggatttat atcttcctta ccgtcccaaa cggcgcactc gcgccactat cgctagagaa 8761 aaaggactgg aagcgcttgc ggagtttatc aaatcgctga acaataaaaa tgctgtttcg 8821 gcgtcgctgg atgcggaagc agcgaagtat atttcccagg agaaaggagt caagacggca 8881 gaagatgcac ttaaaggtgc ttctgacatc ttagcggaag aagtcgcgga aaaagcagag 8941 ttgcgagcat acctgcggga ctacttgctg gaagaaggag tttttgtctc ccgtatcaaa 9001 gatgatcatc cgtccgggac aaccaagttt gagatgtacc gtaattatca aatgagggtg 9061 aaaaatattg cacctcacaa tatgctggcg ttgtgtcggg gtgaagcaga ggaggtgttg 9121 tcctttgaaa tctcctttga tgaagatgtc gtgctttcct atttagagtc aaaggaaatt 9181 aagacgaaag tccgcgccat ccgagacttt tatcaggcga tgctgaaaga cgcattcaac 9241 cgcttgatga aaacttcctt gataagtgaa gtcattggtg aaaagaaaac ctacgccgat 9301 attgagtcta tcaaaacttt tgaggcgaat ttgcgggagt tgctgttgtc tgcgccagca 9361 gggatgaaac cgacgctagc aattgatcct ggttttcgca ctggatgcaa agttgctata 9421 ctcgaccaaa cagggaaatt tttggaatat caagcgatat ttccccatca agcagctgaa 9481 caacgtttga aagctgcaca aactctcaaa aatttgattg agaagtacaa agttgagtta 9541 atcgccattg gtaatggtac agcttcccgc gagacagatg aatttgtttt ggaagtgctg 9601 caagcaatgg aacgaaaacc agtcaaggtg atggtgaatg agtcgggtgc atctatatat 9661 tctgcaagta aagtcgcttt ggaagagttt cccgatttag atattaccgt tcgcggtgct 9721 attagtatcg gtcgtcgttt acaagatcca ctcgcggaac tggtgaaaat tgatcctaag 9781 tccatcggcg tcggacaata ccagcatgat gtggatcaaa agttgttgaa aaagaagttg 9841 gatgaaactg ttgaaagctg tgttaactac gttggtgtgg acttaaatac tgcttcaaaa 9901 gaacttttga cttttgtttc tgggattacg tccagcgttg ccaataacat tgtcgcctat 9961 cggaaccagc atggtgtatt taaaaaccgc cgacagttgc tgaaagtccc gaaattggga 10021 ccaaaagctt ttgaacaagc ggcggggttc ttgcggattc gtggaagtga aaacccattg 10081 gataacactg ctgttcatcc agagagttac tcggtggtgg aggcgatcgc ctccgacctc 10141 agcctaccat taacccaaat cacccaaatt gccgaaaaac tgaaaaaagc caatctcaag 10201 aaatatgtca ccgacaccgt tggcgaacca acactacgcg acatcctcag cgaattggaa 10261 aagccaggaa gagatccacg cgctgagttt aagtatgcaa cattcaagga aggaatcaag 10321 gaaatttctg atctcaaaga ggggatggaa ctagaaggta tcgtgaccaa tgttgctaac 10381 ttcggtgctt tcgttgatat tggtgtacat caagatggtt tggtgcacat ctcccaattg 10441 gcagatagat tcgttgatga cccaaagaaa attgtcaagg tgggacaagt tgtgaaagtg 10501 cgggtgttgg aagttaatga gaaattgaag cggattagtt tatcgatgaa gtcagtgaag 10561 cnnnnnnnnn ngtagactca cttaaagtca caaaggtgct aatatcaagt ttgcttaatc 10621 acttataact gcagtttaga taaagggtag cgcaatcgcg ctaccccttt gactgctgcg 10681 atccgaagtt acatccctat tgtcaacttc gtaccgttgc gcgaaagtgt tgtggacact 10741 gttttactca caaggttatt gcttgcggtc acttcatagt ctccaagaaa gccccgaatc 10801 ttgtattctc cattggcgtc agttttccca accacgtcag tccaccactc cttaaagacc 10861 aagtccatgt aaactttgcc actcggcttg acagtccaat ctctgtcata aattggtcca 10921 ttttcaaacc actgtgcacc atcccaaaaa ccccacatta cgaagccatt tgtcgccgga 10981 tggccaaaaa cagccgtcat acaatcgcgc gttatatcag cctgtgtctg cctattagat 11041 gtgaaaatat cgaactcggt aattttaatg ggaaggttaa acttacctaa gcggtcaagg 11101 atgcttacga gttgcggtat cgaaatagta ccattcatta ggtgattttc tataccgata 11161 ccaccaattg gtgcaccatt attaactagg tactgaatct gactttcata ttggtctccg 11221 tgagtgctgt tgtcatagaa attctcgttg atgtaaagaa cggcatttgg atctgcttca 11281 tgggcggctt taaaccaatc tacaagaacc gactcaccga gaatgtcttg aaaatcgtga 11341 ttgtcagatg cctcattgat aacatcccag tcgtttatct gaccctttaa gtacccgacc 11401 tcgtcgcgga tatgggcgat gatgcgatcg cgcaaaacct gatcagccgc tgcctttccc 11461 tgctggttca gtgtattgtt gtaaagtgcc tgcaaactgg gtggcgaatt gttccaactt 11521 ggccagacaa gattgtgacc acgtactttg aggtttttgt ccctaagcca cttgacagct 11581 tcgatagctt ggggacgtga ctccaagttt tcccactgtg gccatttcag accattttcg 11641 attacccctt cattaaatag ctgtgggata ttgttttgat acttgttaac atcagagttg 11701 cttgaacttg tcagtaaagt tttttcgtca actgcactac caaagttgta ggcgtggcgc 11761 ttcatcgcaa cgtgtacctc tgcattaggg acaggtgtac cgttggcatc ggtcactacc 11821 accgtcaaat cacctttgcg gtatgtgttg atattgttgg cagctgcttg acgccacgca 11881 tcatcagagg aaatctgtgc gatgagatta ttgccttttg tctgttgaac cgggagaacc 11941 gcttcttgcg ctagcgtcag tgtaatatga ataccgaata taattagaaa gctaacaaaa 12001 cccaatagca gtgtgctact tgtttgccgc cttcgtttcc atttctgctg gttgttgtga 12061 ttacgcatat gaccctttgt tgcttttttt acaccactgg tttgatttca aagccaagta 12121 ttagctagct caatcagcat tgttagcata ttacactaat gtgttactga cgataaagaa 12181 tttgttactt gacatcagga acgtttccac tccgaaatag cgtaaattat tttacactaa 12241 attctgcgga cgttatgttc tgatcgctcg tctttagtct aaactccagt tgtataaaaa 12301 taacaaatgc cagtttatca agttcgactt atcaattccg gaatcggatt ggatcgcacc 12361 attcaagtcc cagatgacca atatattctt gatattgctc aagaagctgg tattcgtcta 12421 ccatcgggat gtcaacaagg agaatgttcc gcctgtgttg ccaaaataat cagtggagaa 12481 gttgatcaaa gtgagcaaaa gtttctgcgt ccacatgaag tagaagctgg ctacacaatc 12541 acctgtgtcg cttatccttt atcggattgc actttagaaa ctcatcaaga acaagttttg 12601 tataaaacat cgctttattt tcagtcaaat gcagcaccat cagaatagtt ctaaaggaca 12661 tttattcaat cagtaccagc cgcagtgaac aatgaactgg taactgataa ctgataactg 12721 gtaactggta actgataact gataactgt // LOCUS NODE_2666_length_12722_cov_5.82332012722 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12722) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12722) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12722 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 26..607 /locus_tag="DP116_21400" CDS 26..607 /locus_tag="DP116_21400" /inference="COORDINATES: protein motif:HMM:TIGR01766" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_21400" /translation="MDNTSHRLHQTSNRQEGIRASNKAKQYFEQVAKSHAYIANVRRD FLQKTTTDISRKYYRIRIEDLNISGMMKNEKLSEAISTLGLYEFRRMLTYKEAFYGTK VELVDRWIPSSKTCSKCGSVQSMTLSERVYICGAGCGHKMCRDLNAAINLNNAPNHKI RAASAKLTPTDRLGADSPGRSRKQTPKSSGYAI" gene complement(705..1190) /locus_tag="DP116_21405" CDS complement(705..1190) /locus_tag="DP116_21405" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21405" /translation="MHPEQLAKPLLVAAALSTIIIPIGVDAVLLANGHMNNPAWLPHA KLHCAMSFFAAVSLGSAALAILKVRPTSDRFSMGLAAFLGSAFWIGLIAAGFWPGTSY GFLNDPVLGNIREPELGGITIYPNVVLAAISIAIAVVGYWLTGQAKSPIAMAQSARQK Q" gene complement(1408..2247) /locus_tag="DP116_21410" CDS complement(1408..2247) /locus_tag="DP116_21410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015328901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_21410" /translation="MKETRYLERRTIQLFGLTIDRHISAPNELEFPGYNYHLLCFLLS DGNQQKLTRIGKQESEKPQAKGDFWICPAEVSGLWAWDSTDESLMFVIDPLFVRRMAE EIDGLDANNVELLSTVSAHDPQISAIARLFQTEFDTGCLGGQLYVESLTQVFIIHLLR QYCAFAPKKLHDESLPNNRLQQVVDYIHSYLDRPLHLAELAEISGISQYHFCRLFKQS MGVAPYQYVLQQRMEKAKKLLQQRKYAIAEIALLVGCTDQSRFAKHFKKYFGVTPGMF LRK" gene complement(2370..3200) /locus_tag="DP116_21415" CDS complement(2370..3200) /locus_tag="DP116_21415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309267.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalt ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_21415" /translation="MQEWLLEFEQVHYTYPGAQQSALNGLSVRIPARKRCALIGQNGC GKSTLFLLANGLYKPDKGTISWQGKPLQYNRDYLTNLRQKVGLIFQDPEQQLVASTVE EDISYGLCNLGLPEAEIQQRVEQALVEFGLTELAQRPVHHLSLGQKKRVSIADVMVLR PELLLLDEPTAYLDRLHTRNLMATLRKIHASGTTILMATHDLDLVYRWADWILVMDRG RLILEGKPQDVFTQREILEELQLGVPLIYEMLFADGVAEEGPVLERLRQRILNLFRYF " gene complement(3175..3987) /gene="cbiQ" /locus_tag="DP116_21420" CDS complement(3175..3987) /gene="cbiQ" /locus_tag="DP116_21420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalt ECF transporter T component CbiQ" /protein_id="PRJNA477356:DP116_21420" /translation="MNIQIDTLAYTNRLRWLAPEQKLIFATILLVITAFAHPFIQILI AVWISFWTVMYAGIPAKTYLKLVYVATLFWLTSLPALVINGVDLSHIHLVPKDSLGGL TFGSYYIYLSHKGIEQGLTILTRAIASLSCLYFVMLTIPFSELLQTLRRIGVPVLITE LLLLMYRFIFVLLNTTAELWTAQQSRGGYRTFRTGMKSLALLIGQLLKRTIENYRQVS LSLAARGFNGELRVWHPHRYQKSKRYAIEAIVGCVVLIALEFWQHAGMVTRI" gene complement(3971..4285) /locus_tag="DP116_21425" CDS complement(3971..4285) /locus_tag="DP116_21425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209164.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="energy-coupling factor ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_21425" /translation="MNQSKKGLSNWLLLGGVVILAAAPFIFARNAEFAGADDRAAKAV TEIQPGYQPWFKPLMNVPSCEVQTFLFASQAALGAGTLGFLIGLYKGRSEQRNSNHEH SD" gene complement(4272..4958) /locus_tag="DP116_21430" CDS complement(4272..4958) /locus_tag="DP116_21430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318563.1" /note="catalyzes the ATP-dependent transport of cobalt; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalamin biosynthesis protein CbiM" /protein_id="PRJNA477356:DP116_21430" /translation="MHIMEGFLPPGWAVFWWIVALPFFILGLRSLTRITKANPELKLL LALAGAFTFVLSALKMPSVSGSCAHPTGTGLGAVLFGPLAMSVLGSLVLLFQALLLAH GGLTTLGANAFSMAIAGPFAAYWIYHLMMRLSGKQKIAIFLAAALADLLTYVITSIQL ALAFPAPAGGFVASFVKFAGIFAVTQVPLAISEGLLTVLVWNWLQSYTPQELELLKLI KREPQTNESI" gene 5537..6547 /locus_tag="DP116_21435" CDS 5537..6547 /locus_tag="DP116_21435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314552.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="copper oxidase" /protein_id="PRJNA477356:DP116_21435" /translation="MPNNSVLEQEKLLSRRNLLKLGLAGTGVASVALWQILNSRSQSR VKVPPLATQTTSDLATPSKMVREFEYGTLKRENGRVIREFRLTAGTSPIRLNSAVSFN IWDLNGRIPGPTLRAKQGDRVRVLFFNQAGHSHSLHFHGVHPSEMDGIRPVRHGTATI YEFDAEPYGVHLYHCHIQPVTRHVAKGLYGMFIIDPPTPRPPADEIVLIMGGYDVNDD NRNEYYAFNGLPNYYKDNPIRIYQNQLIRLYLLNIIEFDPAVTFHLHANFFKVYPTGM TLKPSHESDVITMGTAERHILEFAFRYPGMYMFHPHQDAIAENGCMGHFDVITESQEK MT" gene 6892..7392 /locus_tag="DP116_21440" CDS 6892..7392 /locus_tag="DP116_21440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314551.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase family protein" /protein_id="PRJNA477356:DP116_21440" /translation="MNIRLTHLLNLPGVAVESCHSSQDSVSFQLRVLTKGTYCPHCRN YTEELHQTRPILVRDVPVDSKEVYLKLPRRQFYCRVCQRYITERLQFIDWRRRYTQRY EENIYTQVNRFNIEQISQQQHLGVEQVKNIFNHINQKRKKQHLELSEKRYSYEKYGKN NKKLSP" gene complement(8739..9647) /locus_tag="DP116_21445" /pseudo CDS complement(8739..9647) /locus_tag="DP116_21445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863218.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate synthase" gene complement(10263..11609) /locus_tag="DP116_21450" /pseudo CDS complement(10263..11609) /locus_tag="DP116_21450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314578.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate synthase" gene 12215..12664 /locus_tag="DP116_21455" CDS 12215..12664 /locus_tag="DP116_21455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747302.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MgtC/SapB family protein" /protein_id="PRJNA477356:DP116_21455" /translation="MPSIYYLPANDWLNITFRLCLALLVGAIIGIERQIRHKPAGLRT HMLVSFGAALFTLVSLVTGADKPNGDALSRVIQGIAAGVGFLGGGEILRESSQESART EIKGLTSAAAIWVCAGLGIAAGCGLWQLGLIAALLSLLVLNVFKRFE" BASE COUNT 3737 a 2735 c 2684 g 3566 t ORIGIN 1 cgtccggttc taacggagag gtgcgctgga taataccagc catcgactcc accagactag 61 caatcggcaa gaaggcatca gggcgtcaaa taaagcaaag caatactttg aacaagttgc 121 aaaatctcat gcttacatcg ctaatgtacg acgtgatttt cttcaaaaaa caacaacgga 181 tattagtcgg aagtactacc gcataagaat agaggattta aatatctctg gaatgatgaa 241 gaatgaaaag ctttcagaag caatttctac tctagggtta tacgaattcc gtcggatgct 301 cacttataag gaagctttct atggcacgaa agtcgagctt gttgaccgat ggattcctag 361 ttctaaaacc tgctctaaat gcggtagtgt tcagtccatg actttgtcag aaagagttta 421 tatatgcggg gcaggatgtg ggcataaaat gtgccgtgat ttaaacgcag caataaacct 481 aaataacgct cccaaccata aaatacgtgc ggcaagcgcg aaattaacgc ctacggacag 541 gttaggagcc gactcccctg gaagaagtag gaagcaaact ccaaaaagtt caggttatgc 601 gatttgaacg ttttgcgtag attttgtatt gcactgaagc tagatagcgc caatctccga 661 gggagaattc ttttgcgcga tacgcacaag ctaaagcctc cggcttattg cttctggcga 721 gcactctgtg ccatcgcgat tggcgacttt gcctgtccgg ttaaccaata gcccactaca 781 gcaatagcaa tcgagatcgc agcaagcacc acatttggat aaatcgtgat tccacctaac 841 tcaggttcac gaatattgcc gagaaccgga tcattgagaa acccgtagga cgttccaggc 901 caaaacccgg cagcaattag cccaatccag aaagcggaac cgagaaatgc agcaagaccc 961 attgaaaacc gatcacttgt cggacgcacc ttcaaaatcg ctaaggctgc acttcccaac 1021 gagacagctg caaaaaatga catagcacaa tgtagttttg cgtgaggtag ccaagcgggg 1081 ttgttcatat gaccatttgc gagcaaaact gcatcaactc cgatcggaat gatgatagtt 1141 gaaagtgcag cagcgacaag cagaggttta gcgagttgtt ctggatgcat gttagtatct 1201 tcccattcaa gagtcaggcg aaggaaacga gttgaatttt acctgagctg tagaactcac 1261 aacagagatg gtacaattcg ttatcggatc aaagataggc gtttcttgcg ggatttacat 1321 gatgatttag gtcattcttg ccatcgtggt ttggatagca gagttctaga ttgcaaaatt 1381 aactttatta ctgaggtgtt gtgatgacta ttttctcaga aacattcccg gtgtgactcc 1441 gaaatatttc ttgaagtgct ttgcaaagcg actttgatcc gtacagccaa cgagcagagc 1501 aatttcagcg atcgcatatt tcctttgttg caatagtttt ttagcttttt ccattcgttg 1561 ctgaagcacg tattgatagg gtgctacacc cattgactgc ttaaataatc ggcaaaaatg 1621 atattgactg atgccggaaa tttctgctag ttctgctaga tgaagcggtc gatctaaata 1681 gctatggatg tagtccacca cttgctggag tcgattgtta ggcaaacttt catcgtgcag 1741 ctttttcggc gcaaacgcac agtattgtcg cagcaagtga atgataaaca cctgtgtcag 1801 cgattcgaca taaagttgcc cacccagaca gcctgtgtca aattctgttt gaaacagacg 1861 agcgatcgct gaaatttgtg gatcgtgggc acttactgta ctaagcaact caacgttgtt 1921 cgcatccaat ccatcaattt cctctgccat tcgacgcaca aagagtggat caatcacaaa 1981 catcagcgat tcatcagtac tatcccatgc ccaaagcccc gacacctcag cagggcaaat 2041 ccagaaatct cctttggctt gaggtttttc agattcttgt ttgccgatgc gagttagttt 2101 ttgttgattc ccatcgctta acagaaaaca aagtagatga taattgtatc cggggaactc 2161 aagctcgttc ggtgctgaga tatgtcgatc aatggttaac ccaaataatt gaatagtgcg 2221 gcgctcaaga taccgtgttt ccttcataca aacctaatgg gtagattaac acgcgctaga 2281 tgctctggat agaatatctt ttattgtggg tgcgagaatt gggtacagca tctaatatat 2341 gaatgtttga tcaacgccta tctacttaac taaaaataac gaaataaatt caatatccgt 2401 tgtcgcaacc gttccaaaac aggtccttct tcggcaactc catctgcaaa taacatttca 2461 tatatcaatg gcactcctaa ctgtaactct tctaaaattt cgcgttgagt aaacacatcc 2521 tgcggctttc cttccaaaat gagtcttcct ctatccataa ctaatatcca atctgcccag 2581 cgataaacta aatcaagatc atgggttgcc atcaagattg ttgtcccaga tgcatgaatt 2641 tttctcaagg ttgccatcaa gttacgcgta tgtaatctat ccagataagc cgtgggttca 2701 tctagcaata agagttctgg tcgtagcacc atcacatctg ctattgaaac tcgctttttt 2761 tgtcctaaac tgagatgatg tactggtctt tgagcaagtt cagttaaccc aaattctact 2821 aaagcttgct cgacacgctg ttgaatttct gcttctggta accccaaatt acacaaaccg 2881 taagaaatat cttcttcaac agtagaggct accagttgtt gttctggatc ttgaaaaatt 2941 aacccgactt tttggcgtaa attcgtcagg tagtcacgat tatattgcag tggcttaccc 3001 tgccatgaga tagttccttt gtcaggcttg tatagaccat tagctaacaa aaataatgtg 3061 cttttaccac atccgttttg accaattaat gcacatcttt tacgagcagg aattcgtaca 3121 ctcagaccat tcagagccga ctgttgtgca cctgggtaag tataatgtac ctgctcaaat 3181 tcgagtaacc attcctgcat gctgccaaaa ttccaatgct attaagacta cacagccaac 3241 aattgcttct atagcatagc gcttggattt ttgatagcga tggggatgcc aaacccgcag 3301 ttctccatta aaaccccgtg ctgctaaact taaggagact tgacgataat tttctattgt 3361 tcgctttagt aattgtccaa tcaacagtgc taaacttttc attcccgtgc gaaaagtacg 3421 atagccacca cgagattgtt gagcagtcca taactcagca gtagtattga gaaggacaaa 3481 aataaagcgg tacatcagca ataaaagctc agttatcagt actggaactc ctatacgacg 3541 tagtgtttgc aaaagttcgg aaaaggggat agttagcatg acaaagtata ggcaggataa 3601 agaagcgatc gctcgcgtta aaatagtcaa tccctgctca atacctttgt gactgagata 3661 tatataataa gagccaaaag tcaaaccacc gagtgaatcc tttggcacta aatgtatgtg 3721 agaaaggtct actccattga taactaaagc tggtaaacta gttaaccaga agagagtagc 3781 aacataaact aattttagat aagttttagc aggaattcct gcatacatca ctgtccaaaa 3841 acttatccac acagcaatca agatttgaat gaaaggatga gcaaaggcag tgataactag 3901 caggatagtg gcaaatatca atttctgctc tggtgctaac catcgcaacc gattcgtata 3961 agcaagcgta tcaatctgaa tgttcatgat tactgtttcg ctgttcagag cgccctttgt 4021 acaaaccaat caaaaaccca agtgttcctg cacctaaagc agcctgagac gcaaataaaa 4081 aggtttgcac ttcacaacta ggcacattca tcagtggttt aaaccaaggt tgatatccgg 4141 gttgtatttc agtaacagct ttggcggctc tgtcatctgc acctgcaaat tcagcgttac 4201 gggcaaagat gaatggtgca gctgccaaaa tcacaactcc tccaagtagc agccagttac 4261 ttaatccttt tttagattga ttcattggtt tgaggctccc gtttgattaa tttcaatagt 4321 tccaattctt gaggagtgta agattgtagc cagttccata ccagtacagt cagtaaccct 4381 tcactaatag ctaagggtac ttgagtcact gcaaaaattc cggcaaactt gacaaatgag 4441 gcaacaaatc ctcccgctgg tgcaggaaaa gcaagagcga gttggataga tgtaataacg 4501 tatgtcagta aatctgctaa ggctgctgct agaaagatag ctattttttg tttaccactt 4561 agccgcatca ttagatgata tatccagtaa gcggcaaatg gtccggcgat cgccattgaa 4621 aaagcattag cccccagtgt tgtcaagccc ccgtgagcta gtaacagcgc ttgaaacagc 4681 aaaactaaac tacccagtac cgacatcgct aaaggtccaa atagcactgc tcctaaccct 4741 gttcctgtag gatgagcgca acttcctgat actgaaggca ttttcaatgc tgacagtaca 4801 aaagtaaacg ctcctgccaa tgctagaagt aattttagtt caggattggc tttggtgatg 4861 cgagtgagcg atcgcaatcc caaaataaaa aacggtaatg ccacaatcca ccaaaaaact 4921 gcccagccag gtggtaaaaa accttccata atgtgcatag cgtaggctgg tttcggtaag 4981 cacacaatca agtaaaaact gaccactgcc atcaagatga gattcactaa ccttactttg 5041 gcttttgact tgggaatagt tttcaacata aacagtactt gcagatgcta aagcttttag 5101 tataaatcgc tctggctctt gactcaagaa atatctttgt cggcaaaggt tcccgctttt 5161 tagttccgtt tgagaaggtg ttgaatagta aaaagtcttg gctaccgtct tgtattgagc 5221 tactaggaaa tatatgttat acgtatcctc aatacttgta agtttaaaca caattatact 5281 tatctgacta cggtgtacaa tcacgaaaaa attaaaattt tttcacaaga cttttcaagg 5341 ttatacgagc aaatttgcaa catttagata atgtatccag ttttttgaga acttacctca 5401 tttatcttga taattattac ttgtcagact gaagaaatat tgacaataat ttttgactgt 5461 tttactattt tttgaggaaa caattgaaaa aaattagcaa ttttcttcca ttttgccgct 5521 agccatttct aaaatcatgc ctaacaattc tgttttagag caagaaaaac tattgagtcg 5581 ccgtaatctg ttgaagctgg gcttagcagg tactggtgtg gctagtgtag ctttatggca 5641 aatacttaat tcccgaagtc aatcacgggt aaaagtgcca cctctggcga cacagacaac 5701 ttctgattta gctaccccct cgaagatggt acgagaattt gagtatggta cgctgaagcg 5761 agaaaatgga cgtgtgattc gagaatttcg attaactgct ggtacttccc caattcggct 5821 caatagtgct gtttctttca acatttggga tttaaatggt cgcataccgg gaccaacact 5881 ccgggcaaaa cagggcgatc gcgtgcgggt gttgttcttc aaccaggcag gacattccca 5941 ctctctacat tttcatggcg ttcacccctc agagatggat ggcattcgtc cagttcgcca 6001 cggtacagca actatctatg aatttgacgc ggaaccctat ggtgttcatt tgtaccattg 6061 tcacattcaa ccagtcactc gtcatgttgc caaggggcta tatggaatgt ttatcattga 6121 tccccccact ccacgtcccc cagcagatga gatagtcctg attatgggtg ggtatgatgt 6181 gaacgatgat aaccgtaatg aatattacgc cttcaatggt ctgcccaact actacaagga 6241 taatccgatc cgcatttacc aaaatcagtt gattcgactt tacctgctta acatcattga 6301 attcgaccca gcagtaacgt ttcacctaca cgccaacttt tttaaggttt atcctacggg 6361 aatgacttta aaaccgtctc atgaaagtga tgtgatcaca atggggacag ccgaacgcca 6421 catcttggaa tttgcctttc gctatccagg tatgtatatg tttcatccac atcaggatgc 6481 gatcgcagaa aacggctgta tgggtcattt tgacgtgatc accgaaagtc aggaaaaaat 6541 gacataaatt cgggaagcgg ttgaatactc caaagatcat tgatcattat tttttataga 6601 agagaatttt tgccctacta tgtggtaaaa cttgttttag tatggcaaca aacatctgaa 6661 taacaaaaaa agattcatac ctaaaatttg ccaggggcgc ataaaactaa ggggtatgaa 6721 ttgaatcgtg catctcctac gaggagaaaa gcctactagt taatcattgc tacttatgca 6781 tcgccaaaaa aatcctgatt tgattcttgc tgtgcaatga cgaggtcatg acagaaccaa 6841 tgaatcatat aattcatggt gcaaaatgta acatcaaatg acttttttcc gatgaacata 6901 cgtctaacac atttactaaa cttacctggt gtagcagtag aatcttgtca ttcttctcaa 6961 gattcagtat cttttcaatt aagggtttta accaaaggta catactgccc acattgtcgt 7021 aactacacag aggaattaca tcaaactaga ccaattctag ttagagatgt accagtagac 7081 agtaaagaag tttacctaaa gttaccacgt cgtcagtttt attgcagagt gtgtcagcga 7141 tacattactg aacgattgca gtttatcgat tggcgaagga ggtacacaca aaggtatgag 7201 gaaaatattt atacccaagt caatcgtttc aatattgagc aaattagcca acagcaacat 7261 cttggtgttg agcaagtcaa gaacattttt aaccacatca atcaaaagcg aaaaaaacag 7321 catcttgaac tcagcgaaaa gaggtatagc tatgagaagt atggaaaaaa taacaaaaaa 7381 ttatcaccat aaagtcaaag aataagagta ttatttttgt tgactgtaga ttttttttgt 7441 aaaagtctgt atgctgaaag aagtgtctta aacatagcgt taaagcatac actttagaaa 7501 agtttcccta gatcatccgg gcatcctatt aatgacatag cgggagatga cttcctaaat 7561 tacggttttc ctcccatcta aatagttgtt tgttgaacac taactgagga tttttgatag 7621 tgctagattc acgtgttgat tcaatttttg gagagttccc tcaacaactg gatattctca 7681 acgttagagt ctatagactg agtatgttat ctgctagcat ctttctattt ttgagcttat 7741 ttaacttaca gcattctagt ccttggcatt agacatctgg tgcagattca tttgaaaccc 7801 ttgtaatagg tgggctgaaa tctaattaat acagaggtct atggatggaa ataactagac 7861 agttaaaatc tgcacaaagc tgattttata ggcattatag cttaccctgc ggcaaaggct 7921 ttgccctaca gcatacctgc tttcttgtac tagcaactta gcatatatgc gttaccgtgc 7981 ataatagttg ctggcactgc taatatcatg tcagattaaa acagtgaaca ctgaacaggc 8041 aacgccaagg tgagcgtcag ccgaacggga acagtgaaca gtgaacaatg aactgataac 8101 tgataactga taactggtaa ctgataactg ttaaataacc gttcgcagcc agtaaaggga 8161 agcaatccca aagacgcaag caaattgcgt tgtttgacgg agtttgcccg agagtttagc 8221 aagagtttgc atatcttttg tgttactccg aagaaatact cttgagggag tgagtgggga 8281 aagtgtttct ttcattcgta acgggttgca atcgctgccc gtgacactca acggctgttt 8341 gaaacagcac ctgttcatac caagttgcaa ttctggtggt agcataagtc aattgcattc 8401 aaagcagcgt gccctctggg catacgaaac gaggcgagcg taagcttcag gagagttttc 8461 cgcgacgagc gctaacgcgt tcgcgcccga tttgtgccgc aaccataccc taaaggtgga 8521 gcaaagcaat gtgcttaacc acaaactatt acagcagaac gcaacttaag tattaagtaa 8581 cgatttccag cttgtagctg aaaaagcaac atacagtatt caatattcag tttttcccag 8641 ttgaattgtt accccttgca ataccttata aggtagcgca aggggttttt tttgcttagt 8701 cacaacatca gcaacacttg aacgatagtc ttgaaagact actgctgatt tagttgacgc 8761 cgcgctgctt ctaaaatgat cctttgttca gcacgggcga tcgccaaata tgtccgttca 8821 acagcttccg gttcaacaga aatagaagtg atgccccatt gcaccaattg ctcaataatt 8881 tcaggataca gcgctggcgc ttgaccgcaa atagaacaag gaattcctgc actatttgcc 8941 atttgaataa gttgagcaac agcacccata accactggat gacgttcatc aaacattttt 9001 gcgagttgtc cttgctctct atccactccc aaaagcagtt gagtcagatc attagtacca 9061 attgaaattc cctgtacacc tgctttcaca tattctggca gtaaaaacag cacgcttggc 9121 acttctgcca taatccacaa ttgaaactgt ggtacttggg ttaaccttgc ttgctcaact 9181 ttttgacggc aaaaagaaaa ctcttccact gtacgaacaa aaggtaacag taaatggata 9241 ttgctgtatc ctgcgttttg tacagcggct attgcagcca gttctaactc aaaaactgcg 9301 ggatttttca catagctgaa agttccacgt tcccccagca ttgattgcgt ggaagattgt 9361 acattatcat tcaaagatag cagttcatga gaacgccaat ccaaagaccg ataaaaaaca 9421 ggtcttggtg caaaagcttg ggcgaagtga atcatctgct cacgccaacg ctctaacaat 9481 ttggttcggt gtccttcaaa gagccaagta ctcgggtgtt tcccttcgag tatattcaga 9541 accatcagtt ctgagcgcaa taatcccaca ccatctacag gtaaactttg cgcttgttct 9601 atcacactag gctggctcaa gttaaccaac agttttgtag tgatcatcgg gagactcagg 9661 cgtgatgggg agatggaaga cgcggagaat aaatatgtcc ccttgtctac ggacactttc 9721 ctcaagggga gccagcctcg tgctttggtg tcctctccta gttcccttgt ctccgtatca 9781 tcttcccctt gtcgatggac actttcctca acagaagcat cagctccaga agagttttcc 9841 gacaaaggcg tctggggttg aaaaactcca gcatggtgag acagcgcgaa tgacggtgag 9901 ggggtcgcag tcttggcggt tcccgtcgat tgcgttcgcg cagcgtctcc gaaggagtta 9961 cccccgaacc cggagggctt tcccgacaga ggcgactgcg ttagcgaagc gtgaccggag 10021 gtcatacccg aagggaagtg tcctccccta gtctccttcg cgcctaattc cgactccgta 10081 ggaacctcaa gttcttcttc gctcttaatc cctaacgcca atctgctcaa gtcggaaagc 10141 tcgccggacg gaagcttccg cccggcagac tttgcggaaa acccacccgc ggggctggtt 10201 cctccgtcgg aaccccgagt tccctttgtc cccgtgtcct cttccccttg tcctatttcc 10261 ctagcttctc ttgtgagacg ataaacttcc cccttgtcgc catcaatcag taatttttcg 10321 ccagtttgaa cgattgctgt ggcatctgct acgctcacca ctgctggtat tcccaattct 10381 ctcgcaagga tagcagcatg acttgtgata cctcctttgt cagtaataat gccagcagct 10441 tgttgcagta gcggtaacca atcaggtgcg attgttggtg tgactaaaat gactcccttg 10501 ggtaattgtt ctggtttttg ttgtgcattc acaataatat gagcagttgc tgtcacacgt 10561 cccggtgatg ctcctaatcc tttgagagag tggaaattat aaattgctag aggtgtactc 10621 atttgcgtta ggtaaagcgt actcgttgtc tcacaagaga tagtccactc aaaggtaaaa 10681 ttgacgccag tttctttgac aatttgactt gctagttgaa tgagttgttg cagatattct 10741 tctggcaaag cgtactgttt ttgttggtct tcttccaaca ggtaagcaac aagacttgaa 10801 tcatttgagg caagcgcacc tgaagcaggc acggtttgca aagcagctac gggcgcttta 10861 tcattaagac ggtaagctac gatcttattg cccaattgcc gttctttgac agtaccagtt 10921 tccggatgga tgtagtaaac atctggcaag acttcacctt gagtcattgc cactcctaat 10981 ccccatgttg ctaaaatttc ccattcagag gagttggcat tgagtaagcc gctggcgatc 11041 gcatttctaa aaggctgtac caaaactccc aaattcattt gttgcaggtc tattcctgcc 11101 cattgccagt acaataaact tttagcacga aacatctgac tccaggtgcg cttcaatgcc 11161 agagcaatag cctcttcatc acattcacat acctgagatt ctagcaatcc agatgtctta 11221 gtggccccat gctttatggt aggcacgaca aaagttggtc gaaaaatcag atatgctgat 11281 tttaattctt tagctgcatt gacaatttga ctaagtaggt gttgtggcaa atgagcagca 11341 ctcatctctt ggcgcaagcg tctagcaacc tgctgaagtt gacgccaatt tttgacatct 11401 aagtgtagcg aagaataagg taagtcagcg actaacgcct ccgaactgtt aagagtttct 11461 agaaattctc gcaaaacttc tgctgaaacg acaaaaccag gcactacagg ataaccgcgc 11521 tgcataattc tgcttaaata aaatgctttg tcaccaactt tggcgcggtc ttgtagttta 11581 atttgatcaa gccagtagag tttatccact caattacttt gttatctgtt gtcagtagct 11641 acgataatgg aattactgta atcatataag tagcgacggc ttcacgaata gtaaatatag 11701 gaatctggtt tgatttggtg agaccagtgc tgcaggaggg tttcccgaca gaggcatctg 11761 gcgttagccg taaggcgtgc gctttgcgca tacccgaagg gtgaaattac ttgcgtaggg 11821 agggaatagg gaacacccga acacccgaac agggaatagg gaatagggaa gaaggctgtt 11881 tgaagtgtac gaagcttcgc aaaaatcttt ggaggagtcc taatagtcag atctcatcca 11941 ctgaaaaact accaaaatta ggttacagca tttgtccaac atctatgttt atgacgctga 12001 cttagcttgt ggttatgcgc ttaaagttta aataagtccg tttacctgaa aaaacccaat 12061 atctctcgtt acaaggtaga agacacaaga gaagaaaatt ttttaacgat tttttatatt 12121 ttacgttaca ttatatgtgc ctattaagat agtaccatta agtacacgag tgattgtata 12181 ttaatctcca tcaagaattg ataaggattt tccattgcct agcatctact atcttccggc 12241 taatgattgg ttgaacatca cttttaggtt gtgccttgct ttactcgttg gggcaataat 12301 tggcatagaa cgccaaatta gacacaaacc agctggtttg cgaactcata tgctggtgag 12361 ttttggtgca gcgctgttca ctctcgtatc tttggtaaca ggtgcagata aaccaaatgg 12421 ggatgctctt tcgcgagtga ttcaaggtat tgctgctggt gtaggatttc ttggtggagg 12481 agaaatttta cgggaatctt cccaagaatc agcgcgaact gaaattaagg gactcacttc 12541 agcagcagct atttgggttt gcgctggttt gggaattgct gctggatgtg gtttgtggca 12601 gttaggatta attgcggctt tgttgtcttt gttagtttta aatgttttta aacgatttga 12661 ataaaaaaaa gggaacaggg aacagggaac agggaacagg gaacagggaa cagggaacag 12721 gg // LOCUS NODE_2669_length_12712_cov_4.52666512712 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12712) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12712) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12712 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(21..989) /locus_tag="DP116_21460" CDS complement(21..989) /locus_tag="DP116_21460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745295.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chlorophyll a/b binding light-harvesting protein" /protein_id="PRJNA477356:DP116_21460" /translation="MAISTDKTFAANTDSPWLVGNARLIDLCGLLLGAHVAHAGLIMF WVGSTTISEVVRFIPGIPMYEQGFTLLPHLATLGWGVSAGGEVVDTYPYFVIGMLHLV ASAVLGAGGLFHVFRGPAILHNAGGRAAKFHYEWNDPKKLGFILGHHLIILGFGALLL VLKAMFFGGIYDTNIDNVRLITNPTLDPKTIFGYLVGLKDNSWTLLGMASVDNLEDVI GGHIWISLILIAGGAWHILVAPFGWVRRIFPIANGEEILSYSLLGLALMGFISAVFVG YNMTVFPPEFYGKDRLGSANIQFFLGVLTFSGFVWHYWRSRQEEGV" gene complement(1132..2181) /locus_tag="DP116_21465" CDS complement(1132..2181) /locus_tag="DP116_21465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651804.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chlorophyll a/b binding light-harvesting protein" /protein_id="PRJNA477356:DP116_21465" /translation="MQTYGNRNVKYDWWAGNARFANLSGLFIGAHVAQAALTTLWAGA FTWFELSRYNPAVPMGEQGLILLPHLATLGFGVGEGGQVVNTYPYFVIGALHLISSAV LGAGALFHTFKGPRNLKDASGRARKFHFEWDDPKKLSFILGHHLIFLGVGALLLVGKA MFWGGLYDSTIHDVRLVSEPTLNPFVIYGYQTHFVSVNNLEDLVGGHIYVGLMLIGGG IWHILTEPFAWAKKLLIFTGEAILSYSLGGIALAGFVAAYFCAVNTLAYPVEFYGPVL ELKFGVTPYFADSVKLANGAYTSRAWLANAHFFLAFFFLQGHLWHALRAIGVDFKQVE RALNAVGSDTVSSES" gene complement(2269..2781) /locus_tag="DP116_21470" CDS complement(2269..2781) /locus_tag="DP116_21470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114247.1" /note="An electron-transfer protein; flavodoxin binds one FMN molecule, which serves as a redox-active prosthetic group; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flavodoxin FldA" /protein_id="PRJNA477356:DP116_21470" /translation="MSKKIGLFYGTQTGKTESAAEMIRDEFGSNVVTLHDMSKVEASD FDEYECLIIGCPTWNIGELQSDWDGFFPELDDIDFSGKLVAYFGTGDQIGYADNFQDA IGILEEKISEQGGKSVGSWSTDGYDFTESKAVKNGKFVGLALDEDNQSDLTKERVKAW VAQLKKEFGS" gene complement(2802..4259) /locus_tag="DP116_21475" CDS complement(2802..4259) /locus_tag="DP116_21475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129072.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chlorophyll a/b binding light-harvesting protein" /protein_id="PRJNA477356:DP116_21475" /translation="MAVQTPMLIKRWSGNARLTNLSGRLLGAHVAHAGLIVFWAGAMT LFEVTKYDPSQPLYEQGLILLPHLATLGFGVGNGGQIVDTYPYFVIGILHLVSSAVLG AGGIYHALVGPQVLPENQTFFGSFGYDWKDEDKMTTILGIHLLLLGLGAWLLVAKALF WGGLYDPAVACVRVISQPTLNAVRIFGYLFGVFGSQGMAAVNNLEDVVGGHIWVGLLC IGGGFWHIFTKPLKWAKRVLFWSGEAYLSYSLGAIAYMGFLAAYFVTVNDTVYPTVFY GPLGLSTTESGIVTVRTWLATSHFALAAVFLAGHIWHALRVRTIAAGLDFQQGVINYA DIPEVGNFATPVNASGIAQIYLQNLPIYRQGLSPFARGLEIGMAHGYFLIGPFVKLGP LRNTESANLAGLAATIGLLFILSVCLSLYASASFQQKKPAIGELPDNMKTAKSWSNFN VGWTVGSCGGAFFAFLLLNNSHLFVELVHSLTNQG" gene complement(4629..4862) /locus_tag="DP116_21480" /pseudo CDS complement(4629..4862) /locus_tag="DP116_21480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315178.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RAMP superfamily protein" gene 5180..5397 /locus_tag="DP116_21485" /pseudo CDS 5180..5397 /locus_tag="DP116_21485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407020.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="NAD(P)-dependent oxidoreductase" gene 5688..6710 /locus_tag="DP116_21490" CDS 5688..6710 /locus_tag="DP116_21490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407021.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_21490" /translation="MAISLKPLGNQVVVITGASSGIGLVTARMAAKQGAKLVLAARNE DALRQLVDEIRGLGGEAIYVVADVGQEEDVNRIAQRAIAEFGGFDTWVNNAGVSIFGR CMDVSIPDMKRMFDTNFWGVVYGSRAAVNHFKQRQSGSGALINVGSFLGDRAVAVQST YSASKHALHGWTDALRTELEAEGAPVSVTLIHPGRIDTPYNEHARSYMPKQPAHRGMI YPPEAVAEAILYSAEHPKRDMFVGFQAKALAVLAGISPRLTDKLIELWAFPSQQSDRP SRDPEDNALYRAGYGMHERGTHQGWIRSGSLYVKAEKHPVTTTIIVAGLGTLIWWLTS SAVQGY" gene complement(7155..9536) /locus_tag="DP116_21495" CDS complement(7155..9536) /locus_tag="DP116_21495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoketolase" /protein_id="PRJNA477356:DP116_21495" /translation="MTLASPPQTKPLTGEELQKMNAYWCAANYLSVGQIYLLDNALLK EPLKLEHIKPRLLGHWGTTPGLNFIYVHLNRMIKKYDQNMIYIAGPGHGGPGILANAY LEGTYSEYYPNISQDAEGMKKLFVQFSFPGGVGSHCTPELPGSIHEGGELGYSLVHAY GAAFDNPDLIVACVVGDGEAETGALATSWHSNKFLNPVYDGAVLPILHLNGYKIANPT VLARLSHQELESLFIGYGYKPYFVEGSDPETMHQLMAATLDTVIHEIKEIQEEARNNG NTKRPQWPMIILRSPKGWTGPKEVDGEKVEGFWRSHQVPFGELASKPEHIKLLEEWMN SYNPQELFDENGKFIPELAELAPKGNRRMGDNPHVNGGLLLRDLRMPDFRNYAVDVSE SGKVISEATRVTGKFLRDVMKLNLESRNFRIFGPDETKSNRLDAVFEVTDRTWEAEKF PYDVNLSPEGRVMEILSETTCQGWLEGYLLTGRHGFFSCYEAFIHIVDSMFNQHAKWL DSCLDIPWRRSIASLNYLLTSHVWRQDHNGFSHQDPGFLDVVVNKKAKVIRVYLPPDA NTLLSVTDHCLRSRNYVNVIVAGKQPALQYLNMEAAVKHCTKGIGIWEWASNDQGSEP DVVMACAGDIPTMETLAAVDILRQNFPDLKVRVVNVVDLMKLQPESEHPHGLSDKDFD SIFTTDKPIIFAFHGYPWLIHRLSYRRTNHANLHVRGYKEEGTTTTPFDMVVVNDLDR FHLAMDVINRVPHLGYKAAYVKQMLQDKLIEHKQYINRYGEDMPEIQNWKWSY" gene 10295..10762 /locus_tag="DP116_21500" CDS 10295..10762 /locus_tag="DP116_21500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015144386.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_21500" /translation="MKIRLFRTQDTEQIAQLFHETVREVNIRDYSNNQVKAWAPDDIY FRNWSEVCLKRITYIADEEGIIVGFGELEPNGHINCFYCHKNYQRRGVGSQIYQAIEA KALELGLDRLFTEASITAKPFFQHMGFSVVKEQQVTRRGETFTNYVMEKFLSC" gene complement(10822..11244) /locus_tag="DP116_21505" CDS complement(10822..11244) /locus_tag="DP116_21505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317942.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative toxin-antitoxin system toxin component, PIN family" /protein_id="PRJNA477356:DP116_21505" /translation="MKVVVDVNVWISALLWGGVPDKVLILAEDKKITIFANDALFLEL EMTLRRRKFQSKIQSLDLNVDDVINATKDVIQMCPDISVDAPQLRDSKDNKILAAAVA ASAEVIITGDLDLLVLTEFNQIPILTPQDFLSRHFPES" gene complement(11241..11495) /locus_tag="DP116_21510" CDS complement(11241..11495) /locus_tag="DP116_21510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317941.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21510" /translation="MIVKVTDDGKLEIPSELLEKLKPLTEYEVSVTEDEIVFKKIRKP LTLKELRRRVQKAGSDPNQPTLKEISRIVKEVRQELWTEK" gene complement(11626..11823) /locus_tag="DP116_21515" CDS complement(11626..11823) /locus_tag="DP116_21515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317938.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21515" /translation="MKVIHGTWIPNAAADFIQPGCFYLWVETPELKKRRQKNQQIHPG HLVQDEEELYLRVKTKNQNLI" gene complement(11945..12097) /locus_tag="DP116_21520" CDS complement(11945..12097) /locus_tag="DP116_21520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320178.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lmo0937 family membrane protein" /protein_id="PRJNA477356:DP116_21520" /translation="MLGILWGLVVLLFAFWVLGLVLHIAGNLIHIALVLAITLAIYNF LKARDV" gene complement(12213..12545) /locus_tag="DP116_21525" CDS complement(12213..12545) /locus_tag="DP116_21525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017656204.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YnfA family protein" /protein_id="PRJNA477356:DP116_21525" /translation="MKEILKSLLCFIWAGLFEIGGGYLVWQWLREGKPVWWGIAGGIG LALYGMLATLQPANFGRVYAAYGGVFIAMAMLWGWKVDGVAPDRYDILGGCLALLSVL IIMYAPRG" BASE COUNT 3693 a 2920 c 2766 g 3333 t ORIGIN 1 gtactggctc ccctacaccc ttacacccct tcttcttgac gcgatcgcca atagtgccaa 61 acaaaacctg aaaacgtcaa tacacccaaa aagaactgga tatttgctga gccaagacgg 121 tctttaccgt aaaactctgg cggaaaaact gtcatgttat acccgacaaa cacggcagaa 181 atgaaaccca tcaaagccaa acctagcaaa ctgtaggaaa gaatttcttc accatttgca 241 attgggaaaa ttcgcctaac ccaaccaaaa ggtgccacaa gaatgtgcca cgcaccaccc 301 gcaattaaaa ttaaacttat ccaaatgtga ccaccgataa catcttctaa gttgtcaaca 361 cttgccattc ctaaaagtgt ccagctatta tctttgaggc caactaagta gccaaagatt 421 gtctttggat caagagtagg attggtaatc aggcggacat tgtcaatatt ggtgtcataa 481 attccaccaa aaaacattgc ctttagcacc agtagcaatg caccaaaacc tagaataatc 541 aggtgatgac ctaagatgaa acccaatttt ttgggatcgt tccattcata atgaaatttt 601 gctgcccgtc ctccagcatt gtgcaaaata gctggaccgc gaaacacatg aaaaagtccc 661 cctgcaccca aaacagcaga tgcgactaaa tgcaacattc caatgacaaa gtagggataa 721 gtatcgacaa cttcaccccc agcactgact ccccaaccta aagttgccaa gtggggtaac 781 aaagtgaatc cctgctcata cattggtatg cctgggataa agcgtaccac ttctgatatg 841 gtcgttgatc ctacccaaaa cattatcaag cctgcgtgag ccacgtgagc gcccaaaagc 901 agaccacata agtcaattaa acgagcatta ccaaccagcc agggggaatc tgtgtttgct 961 gcaaaggttt tatcagtaga aattgccaca attgaattcc tttgtgaaca ttctgagatt 1021 ttgggaatct aaatgactcg tgtctgaccc aaaaattacc tgttgaagcg aagtagggtg 1081 ggcatgggca atgcccacca gtactaccaa aatcttacaa tctataaagt tttacgattc 1141 gctgctaaca gtatcactgc caacagcatt taatgctctt tcaacttgtt tgaagtcaac 1201 gccaatggct cgcagtgcgt gccacagatg tccttgcaag aagaagaaag ctaagaagaa 1261 gtgggcgttt gccaaccaag cacgagacgt ataagcgcca tttgccaact taacgctatc 1321 ggcaaaatag ggtgtaacgc cgaatttgag ttctaaaaca ggaccgtaaa attccacggg 1381 ataggctaag gtgttcactg cacaaaagta agctgcaaca aatcctgcta aggcaatacc 1441 acccaaagag taggaaagaa tggcttcacc tgtgaagatt aaaagcttct tcgcccaagc 1501 aaaaggctct gtaaggatgt gccaaatacc tccaccaatg agcataagac ccacatatat 1561 atgaccgcca actaaatctt ctaagttgtt aacgctgaca aagtgggttt gatagccgta 1621 gataacaaat gggttgaggg taggctcact cacgagtcgg acatcatgaa ttgtggagtc 1681 gtagagtcct ccccagaaca ttgctttgcc cacgagtagt aaagccccta ctcccaagaa 1741 aatcaagtga tgtcctaaga taaagctgag ttttttgggg tcgtcccatt caaaatgaaa 1801 tttacgagcg cgaccagagg cgtctttcaa gttccgagga cctttgaagg tatggaagag 1861 cgcacctgca cccaaaacag cggatgagat caggtgcaaa gctccaatca cgaaataggg 1921 gtaagtattc actacctgac cgccttctcc cacaccaaag cctaaagtcg ctaagtgggg 1981 cagcaaaatc agcccttgct cacccatcgg taccgcaggg ttgtatcgag agagttcaaa 2041 ccaagtaaat gcacccgccc agagggttgt cagtgcagct tgcgccacat gtgcgccgat 2101 aaataagcca gagagattgg cgaagcgagc gttaccagcc caccagtcat atttgacgtt 2161 tcgattaccg tatgtttgca ttgcttttat cttgataagt tattgctagt catcatcaac 2221 aaagttgatt tccacttggc ttttaagttg aggaaatgga aaacttagct acgaaccaaa 2281 ttctttcttt aactgagcaa cccaagcttt gactcgctct tttgttaaat cagattgatt 2341 gtcttcatca agtgccagtc ccacaaattt gccattcttg acggctttag actcagtaaa 2401 gtcatagcca tcagttgacc aagagccaac acttttacca ccttgctcgg aaatcttttc 2461 ttctagaatc ccgatcgcat cctggaagtt atcagcataa ccaatctggt ctccagtacc 2521 aaaatatgca accagtttgc cactgaaatc tatgtcgtcg agttcgggga aaaagccatc 2581 ccaatcactt tgaagttcac caatattcca agtgggacag ccaataatca ggcattcata 2641 ttcatcaaaa tcactcgctt ccactttgga catatcgtgc aatgtgacga cattactacc 2701 aaactcatcg cgaatcattt ccgcagcaga ttcagttttg ccagtttgag taccgtagaa 2761 cagaccaatt tttttcgaca ttttcacacc taatagattg cttagccctg attcgtcaac 2821 gaatgcacca gttcaacgaa caaatggcta ttattcaaaa gcaggaaggc aaagaacgct 2881 ccgccacaac tacctactgt ccagccaaca ttaaagttgc tccaagactt agcggttttc 2941 atgttgtctg gtaattcgcc aatagccggt ttcttctgtt gaaatgatgc acttgcatac 3001 aacgacagac aaactgacaa aataaacagc agaccaattg tggcagctag tcctgctagg 3061 ttcgctgact cggtgttccg caaaggaccc aacttcacaa atggaccaat caggaagtag 3121 ccgtgcgcca tgccaatttc caatccccgt gcaaagggag acaatccctg acggtaaatc 3181 ggcagattct gcaaatatat ctgcgctatg ccagaagcat taactggggt agcaaaattc 3241 ccaacttccg gaatatccgc ataattgatc actccttgtt gaaagtcaag ccctgcggct 3301 atggtacgca ccctcagtgc atgccagatg tgaccagcca aaaagaccgc tgccaaagca 3361 aagtgactgg ttgcaagcca ggtgcgaact gtcacaattc ccgactcagt ggtactgagt 3421 cccagtgggc catagaacac tgtggggtac actgtgtcat tgactgtgac aaagtatgcc 3481 gcgagaaagc ccatgtatgc tatagcaccc agactgtagg acaggtaagc ctcaccagac 3541 caaaacagta cccgttttgc ccatttcaag ggttttgtga agatgtgcca gaatccgcca 3601 ccaatgcaga gtaatcctac ccagatgtgt ccgccgacga catcttctaa gttgttaact 3661 gctgccatcc cctgactgcc aaaaacaccg aaaagatagc caaagatgcg aacagcgttg 3721 agagtcggtt gactaatcac ccgcacacaa gcaactgccg gatcgtacaa accaccccaa 3781 aacagtgcct tagccactaa cagccaggct cctaacccta acagcagcag atgaatacca 3841 agaatggtgg tcatcttgtc ttcatctttc cagtcatagc caaacgaacc aaagaaggtc 3901 tgattctcag gcaagacttg cggacccaca agggcatggt aaatgccacc agcacccaga 3961 acggctgaac tgaccagatg cagaatgcca atgacaaaat acggataggt atcaacaatt 4021 tgtccaccat tgccaacacc aaaccccaga gttgctagat gtggcaatag gattaaccct 4081 tgctcgtaca agggttggga ggggtcatac ttggtgacct caaataaggt catggcacct 4141 gcccaaaaga caatcaaccc tgcatgagca acatgagcac ctaaaagtcg tcctgagagg 4201 ttggttaacc gagcgttccc tgaccacctc ttaatcaaca taggagtttg aacagccatg 4261 ttggtcatga ccttctctcc tgaaaaattg tcaaaacaac tggaatacgc attttgctta 4321 atgaataagt atctcaataa gtcttgcgtt gatcgtatca gacttcttgc aaattattct 4381 caatttattt gttaattttt gtaaatctct agcctgagat ttttaccttt cttcacacaa 4441 cgacaaatca gcccatcaga taacaccctg aaaggttaac aggacatgat atcaaacact 4501 ctgaagaaac acaggggtgt aagggtgtag gggtgtaggg gtgtaagggt gtaagggtgt 4561 aaggaaaaaa ggtgtttctt tggttgcggg ggtgtaatcg cgccccttgg gcactcatat 4621 catgtccgtc accagagttt ttgaaaacct tgtggtcgat tttgttgaga attgagaaac 4681 tgcacaaact gccgggactc agcagaatca tcgggaaata aagtcaaaag ttcaaagtat 4741 tttggggttg tctttggaat tggtcttcca tttaagtctt ggggatttct gactgaccta 4801 accaaagggt acatccgata ccataatcga cctattcgcc ccatttgtcc agtcaccgat 4861 gaaataatgt atacctagct aaagttttgt atcgaagact tattaagtat taatttagat 4921 ggtttgccct ctcgggaatt cccaattcct tcaaacacat ttgtattagg aaacacacaa 4981 taagttaaaa ctaatctcat tctgtagtta gaagttgggt agaggaatac acagttaatt 5041 attaactaac aagaacaaag aatcaaacaa cgcgcataag aaccaccgag agcagatgct 5101 gcgctagccg ctcgaacaag agcagtcgta cgaatgcgct gtagtttttt tactcaaaac 5161 cgggttgaaa ggagagttaa tgacctcaca gaatctacaa gaacaaagac cgaagccgcc 5221 gttccccgaa caacaacaac aggagccggg actagagtca cagatgaacc cgaagccgga 5281 cagggcgcga aaattcctat cgtggatccg gaaaactgca atcaagagtc ggatgcgcag 5341 gaaaccgttc gtatcgtcga gaaggagggg cgcaagtgcg tcgcgatcgc aggcgaggca 5401 tagggcgcgt ggtgagtaca agcgattaca caattaattt tgaattttgt atctcctgcc 5461 cagacgctga gcgcaaagcg cacgctccgc gaacgcgtta gcgcagcgtg ccgtaggcat 5521 acgcgcagcg tgcccgtatc tcctgcggag accgcacggt acaggcctca agccgtgccg 5581 caggcatagg gcatattttg aattttgaat gtttgagaag cgagtgacgt tctgctttat 5641 acccaatgac aaataaacct ataaaaattt tgtttgcata aaaagttatg gctatttcac 5701 tgaaacctct gggaaaccaa gtcgttgtca ttacaggtgc ctctagtggc attggcttag 5761 tgaccgcaag aatggcagca aaacagggcg caaagcttgt cctggcagca cgaaatgaag 5821 atgctctgcg tcaactggtt gatgaaattc gcgggcttgg gggggaggcg atctatgttg 5881 tcgctgatgt cgggcaagaa gaagatgtga atcgcatcgc ccaaagagcg atcgctgaat 5941 ttggcggctt tgatacgtgg gttaacaatg caggcgtatc gatctttggt cgctgcatgg 6001 atgtttcaat ccccgacatg aagcgcatgt ttgacacgaa cttctggggc gttgtgtatg 6061 gctctcgcgc cgccgtcaac cacttcaaac agcgccaaag cggcagtggg gcgttaatca 6121 atgttggcag ctttctgggc gatcgagcgg tggcggttca gtcaacctat tcggcttcaa 6181 aacatgcgct gcatggttgg acagatgccc tgcgaacgga gttggaggcg gaaggtgctc 6241 ctgtatcagt aacattgatt caccccggac ggattgacac tccatacaac gaacatgccc 6301 gcagctacat gccaaaacaa cccgcgcatc gcggcatgat ctacccaccc gaagcggtcg 6361 ctgaagcaat cttgtattcg gcagaacatc cgaagcgaga catgttcgtc ggtttccaag 6421 ccaaggctct ggcggtgttg gctggtatct caccacggct cacggacaag ttgatagagc 6481 tttgggcgtt tccctcgcaa cagtctgatc gcccctcgcg cgaccctgaa gacaatgcgc 6541 tctatcgggc tggctacggt atgcatgagc gggggactca ccaaggctgg attcgatcgg 6601 gcagcctcta cgtgaaagct gagaagcatc cggtgactac aaccattatt gttgctggtc 6661 ttggcacctt aatctggtgg cttacatcct ctgcggttca aggttactaa attgccaaaa 6721 ttctgttaga caatccatcc ctgtacgccc cgagatcatc ttggcttggg gggctataag 6781 acccacacca taatttttat ttggtgtggg atacatggac accccaagtt aattttatgg 6841 gattcaatat ccctaaaatt aacaataaaa tgcggtagtt tttgacagga tttgctaata 6901 aaaatgatac aatagaaata ccgaatagcc atcggttgac ttgactcaaa cggacacgtc 6961 ttagcagata cgcgcattgg gtgaagttga aatttaggcg aatggcacac gagagggagc 7021 gcagtttgca ctgaagtttg ctgccgatgt gcctagtcat gtcgatatgt ttagcaagtg 7081 aactacccac acgctccctt gcggtacagt gtgggcttcc aacttcacgc gcgaagaagt 7141 taatttgaaa ttccctaata cgaccacttc caattctgaa tttccggcat atcctcaccg 7201 tacctattaa tgtactgctt gtgttcgatg agtttgtctt gcagcatctg cttaacataa 7261 gctgctttgt aacctaggtg tggtacgcgg tttatcacgt ccatcgccag gtggaagcga 7321 tccaaatcgt tgacgacgac catatcaaaa ggcgtggtag tcgttccctc ttctttgtaa 7381 ccccgcacat gcagatttgc atgattcgtg cgacggtaac tcaaacggtg aatcagccaa 7441 ggataaccgt ggaaagcaaa gataatcggc ttgtcagtgg tgaatatact atcaaagtct 7501 ttatcgctga gtccgtgggg gtgttcactc tcaggttgaa gtttcatcaa gtctacgacg 7561 tttaccaccc gcaccttcaa gtctgggaag ttttgacgca gaatgtccac ggctgctaag 7621 gtttccattg tcggaatatc tcctgcacat gccattacca catctggttc actaccttgg 7681 tcattgcttg cccattccca aataccgata cctttggtgc agtgttttac tgctgcttcc 7741 atattcaagt attgcaaagc aggttgcttc ccagcaacga taacgttgac atagttacgg 7801 cttcgcaaac agtggtctgt taccgacagc aaggtattag catcgggggg taaatacacg 7861 cgaatgactt tcgctttctt gttaaccaca acgtcgagga aaccggggtc ttggtgagag 7921 aagccgttgt ggtcttgccg ccaaacgtga gaggtgagaa ggtagttgag ggaagcaatt 7981 gatctgcgcc aaggaatatc aaggcaactg tcgagccatt tggcgtgctg gttgaacatt 8041 gaatctacaa tgtggataaa cgcctcgtag caggagaaga agccgtgacg accagtgagg 8101 aggtatcctt ctaaccaacc ctggcaagtg gtttcactga gaatttccat cacccgacct 8161 tcgggagaca ggttgacatc ataaggaaat ttctcagctt cccaagtccg gtctgttacc 8221 tcaaatacag catccaagcg atttgatttg gtttcgtctg gaccaaatat gcggaagttg 8281 cgggactcca aattgagttt catcacatcc cgcaggaatt ttcccgtcac cctagtggct 8341 tcggaaataa ccttgcctga ttcagaaacg tcaacggcat aattccgaaa atcaggcatt 8401 ctcaagtcac gcagcagaag accaccgttg acatgggggt tgtcacccat gcgtcggttt 8461 cctttggggg ccagttctgc gagttctgga ataaacttgc cattttcatc gaagagttct 8521 tgtggattgt aactgttcat ccactcttct agaagtttga tgtgttctgg cttgcttgct 8581 aactcgccaa aaggaacttg gtgcgatcgc caaaaaccct ctaccttctc cccatcaact 8641 tccttgggtc ccgtccaacc tttggggctt ctgaggataa tcatcggcca ttgcggacgc 8701 ttagtgttgc cattgttacg cgcctcttcc tgaatttctt tgatttcgtg aatcacagta 8761 tccaaagtcg ccgccatcag ctggtgcatt gtttccgggt cagacccttc gacaaagtaa 8821 ggcttgtaac cgtaacctat aaataaactc tccagttctt gatgactcaa tcgcgccagc 8881 actgtcgggt tcgcaatttt gtagccgtta agatggagaa tagggagaac agcaccatca 8941 tacacggggt tgagaaactt attcgagtgc cagctcgtag cgagagctcc cgtttccgcc 9001 tcaccatcgc caacaacaca agcaacaatc aagtcggggt tgtcaaatgc agcaccatat 9061 gcatggacga gagagtaacc aagttcaccc ccttcgtgga tagaaccagg aagttcagga 9121 gtgcaatggc taccaacacc gccagggaat gagaattgga cgaagagttt cttcatcccc 9181 tcagcatctt gggagatgtt gggatagtat tcgctgtagg taccttctag gtaagcgttt 9241 gcaagtattc caggaccgcc gtgacctgga ccagcaatat agatcatatt ctggtcatac 9301 tttttgatca tccggttgag gtgaacgtag ataaagttca aacctggagt tgtcccccag 9361 tgacccaaaa gtctgggttt gatgtgttcc agctttagcg gttctttaag cagtgcgttg 9421 tcgagtagat atatttgtcc aactgaaaga taattagctg cacaccagta ggcgttcatt 9481 ttctgcagtt cttcacctgt taaaggcttt gtttgtgggg gacttgctag agtcatatcg 9541 aactccttga caaaaggatt ttgatgaatt tacccgtatg aagtttagtt atctttgaca 9601 tgacggtcat ctaacttctg accaatcttg ccaatattaa tttcacaaaa gtcgcaagtt 9661 aatgcttgtg ataagtagct caacctaatt aaatgtaaaa tgtcattgcg agcgaaacgc 9721 agtggagcga agcaatcgca agaatcgtat tttatgtttt tttatgttga cctacttact 9781 cgcaagtgca tttgggtaac gtataccatc actctgcgat agttatgtta actacagtta 9841 aagacgtgat taattttgaa tacttgctta cgaaacacag ggaacaggga acagggaaca 9901 gggaacaggg aactcttaac aagcaagaag gaggatggtg tttctttcat atgttactag 9961 cggttaagag cgcatgagtt taaggctcag atcttgcacc atagtattcg tccgccaaga 10021 aataaatttc ttggctcaaa accaaagtcc gttaaaacgg actgggtaag tctttgagtc 10081 cgttttaacc gacttgatct attagccttg tagacgccgg aggcggcttc ccgcagggta 10141 cttgagttca aggcgtactc actggtgaag tgcaagattt gagtttagca attaaacccg 10201 tccaaatatt actggtagtt gaaagtcatc ttgtcaatgg acagccgctg aaacggtgtc 10261 agatttgaaa tacattatag gcagggctat cttaatgaaa atacggttgt ttcgtacaca 10321 agacaccgag caaatagcac agttattcca cgaaacagtg cgtgaagtca acatccgcga 10381 ttactcaaac aatcaagtca aagcgtgggc accagatgat atttacttta gaaactggtc 10441 agaagtttgt ttaaaaagaa ttacttatat tgcagatgaa gagggaataa ttgttggttt 10501 tggggagtta gaacccaatg gtcatattaa ttgtttttac tgccataaaa actatcagcg 10561 tcgcggagta gggagtcaaa tctatcaggc aattgaggca aaagcattgg aattaggatt 10621 ggatcgcctg tttactgagg cgagtattac tgcaaagcca tttttccagc atatgggatt 10681 ctcagttgtc aaagaacaac aagtgacccg tcggggagaa acttttacta attatgtcat 10741 ggagaagttt ttaagttgct aaggcaaatc aaaatcttgc agaagcagga ctctcaaaat 10801 tcgtcgagat tcgtgcttat atcagctttc aggaaaatgg cgagagagaa aatcttgagg 10861 tgtcaaaatg ggaatttgat taaattcagt caagactagt aaatctaaat caccagtaat 10921 aatgacttcc gcactagctg caacagctgc agctaaaatt ttattatctt ttgaatcgcg 10981 tagttgaggg gcatcaacag aaatatctgg acacatctgt atgacatctt ttgtcgcatt 11041 aatgacatca tctacattca aatctaaaga ttggattttg gactgaaatt ttctacggcg 11101 taaagtcatt tctaattcta gaaataaagc atcattagca aaaatagtaa ttttcttgtc 11161 ttctgcaagt attaaaactt tatctggaac acctccccac aacaaagctg atatccaaac 11221 gttaacatca actacaactt tcatttttca gtccaaagtt cttgtcgtac ttccttgact 11281 atccgactga tttctttcag tgtaggttga ttggggtctg aacctgcttt ttgtactcgt 11341 cttcttaatt cttttaatgt tagtggttta cggatttttt tgaacacaat ttcatcttct 11401 gtgactgaaa cttcatactc agtcaatggc tttagttttt ctaacaactc tgatgggatt 11461 tctagttttc catcatctgt cactttgaca atcatcttta tatctacagt gtttattttc 11521 taatttccat tttactatgt acgtccgttg aatatgagcg tactgaaatc cgttagtgta 11581 aggttaagcc tgttgccaaa aatttatcaa tcaatgcagt tcacctcaga tcaaattttg 11641 attcttcgtt ttcactcgca gatataattc ctcttcatcc tgtactagat gtccggggtg 11701 aatttgttga tttttctgac gacgtttttt gagttcaggg gtttctaccc acagataaaa 11761 acacccaggt tgtataaagt cagccgctgc gttgggtatc cacgtaccat gaataacttt 11821 catgagtcca cgcttttggt gagtctagaa tttctagagg gtctagatta ctttaccaaa 11881 agtaagtatg gactatgaca taactatgca aaaaattcag attctgtttg caaagccgct 11941 atagctacac atcacgggct tttaaaaaat tatagatcgc aagagtaatg gctaaaacta 12001 acgcgatatg aatgagattc cctgcaatgt gaagtactag acccaatacc caaaaagcaa 12061 acagtagaac gacaagaccc caaagtatgc ctaacataat ttttgattca ggtttctagt 12121 agaggatttc tatcagagta attcctagag actacaaaaa aataatttcg atacaaattt 12181 ttaaataatt tttaaacaca ctcatcttct ttctatcctc tgggtgcata catgataatc 12241 aatacactca ataacgctaa acagcctccc agtatatcgt agcggtctgg agccactcca 12301 tcaactttcc aaccccaaag cattgccatt gcaataaaca cgcccccgta ggcagcataa 12361 actctaccaa agttcgcagg ctggagagtc gcgagcatac cgtataaagc taacccaata 12421 cctcctgcaa ttccccacca gacaggtttt ccttctctta accattgcca tacgagataa 12481 ccgccaccaa tctcaaacaa gccagcccag ataaagcaaa gtaaggattt gagtatttct 12541 ttcaattgag aatatcctca gcgaaccagt gaagattgtt tacaaattat agattgtctc 12601 aagtttgtgt attggcaaga gtataggaaa acacaacaat tgtgacttat tctatagcag 12661 ggaacaggga actcttaata ggcgtgtgat tgtctcgtgc tgtccgtatt tt // LOCUS NODE_2676_length_12687_cov_5.04623212687 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12687) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12687) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12687 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(189..890) /locus_tag="DP116_21530" CDS complement(189..890) /locus_tag="DP116_21530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314795.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flavodoxin family protein" /protein_id="PRJNA477356:DP116_21530" /translation="MKFFIVHAHPEPNSFNGALTRYAKEVLYTTGHEVIISDLYAMQF NPVSDRRNFTYNKELNYYNQQSEEMYATEVDGFAPDIKAEMEKLDWCDVLIFQFPLWW FGLPAILKGWVDKVFAMGRTYGGGRFYDNGVFQGKKAMLSLTTGGALTMYTEIGINGE IRTILYPINHGIFRFVGFDVLPPFIVWGASRIGDERRQAYLEEYKQRLLTINATPPIV YPSLKDFDEDFQLKQ" gene complement(1008..1739) /locus_tag="DP116_21535" CDS complement(1008..1739) /locus_tag="DP116_21535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016948983.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiol:disulfide oxidoreductase" /protein_id="PRJNA477356:DP116_21535" /translation="MIELYYWTTPNGHKITMFLEEVSLPYTVIPVNIGAGDQFKPEFL TISPNNRIPAIVDHEPAGGGEPISVFESGAILLYLAEKTGKLIPADLRDCASARLRER VEVLQWLFWQMGGLGPMAGQNHHFSQYAPQKIEYAINRYVNETGRLYAVLDKRLADRE FLAGDYSIADIASYPWIVPYERQGQKLENFPHLKRWFETIQVRPATIRAYEKAEAFKN QALDIEKSRDLLFSQSAKTIQKPTT" gene complement(2033..2233) /locus_tag="DP116_21540" CDS complement(2033..2233) /locus_tag="DP116_21540" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21540" /translation="MPTARLCKSGNAGRTPHASTEDSDAPQWLLFWENAAMLYAWVPQ DPYGYALCARLRAYALCARPEG" gene complement(2324..3073) /locus_tag="DP116_21545" CDS complement(2324..3073) /locus_tag="DP116_21545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408642.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_21545" /translation="MKKLEGKVALVTGGTSGIGLATAKRFVAEGAYVFITGRRQTELD AAVKAIGKNLTGVHSDASNLADLDRLFATIKQEQGHLDVIFANAGGGELAPLGEITEE HFDKTFNTNVKGLLFTVQKALPLLPEGASIILNASATSIRATPAFSVYSATKAAVRSF ARNWTLDLKERKIRVNAISPGVVPTPGYNLLGLNDEQVQAFVDSQASTIPLGRVGTPD EIAKAVVFLASDDSSFVNGIELFVDGGMAQI" gene complement(3397..4068) /locus_tag="DP116_21550" CDS complement(3397..4068) /locus_tag="DP116_21550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314798.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitroreductase" /protein_id="PRJNA477356:DP116_21550" /translation="MVQELNLQTKPLDVPSAIIQRRSIKTFKPDPISPELLHQLVKLT VAAPSSYNIQDWRIVLVQDDAQKAALAAAAWNQKQVVEAPVTFVFAADATAGEQDLTP ILNQGLETGAWNQGTVNYFKTNIPQYQAGLGDKRREYAIKDAMIAATHLVLAAESLGL STCFMNGWIEEKVKEVIGAADNRDIAIAVLVPVGYAAEPRRNPGRLPFSHNVFVDRLG NPYEG" gene complement(4151..5266) /locus_tag="DP116_21555" CDS complement(4151..5266) /locus_tag="DP116_21555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314799.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkene reductase" /protein_id="PRJNA477356:DP116_21555" /translation="MSTNINLLSPYKLGDLELPNRIVMAPLTRNRAGEGNVPHQLNAI YYTQRASAGLIISEATQVSPQGQGYPYTPGIHSQEQVEGWKLVTDAVHQHGGRIYLQL WHVGRISHPDFQPNGDLPLAPFAIAPKGQVLTYEGMKPYVTPRALETSEIPEIVEQYR KGAENALAAGFDGVEVHGANGYLLDQFLRDGTNKRTDKYGGSVENRARLLLEVTEAVV GVWGAKRVGVRLSPSGTFNDMQDSNPLATFGYVAQALNQFDLAYLHIFEAIEADIRHG ATVVPTSHIRERYHGTIMVNGEYTPEKGNAVLAKGEAELVSFGTLFISNPDLPRRIAL NAPLTEADSTSFYGGGEKGYTDYPTVEEQYPSLIGAK" gene complement(5376..6221) /locus_tag="DP116_21560" CDS complement(5376..6221) /locus_tag="DP116_21560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412474.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_21560" /translation="MDLKLHGKSALVSASTAGIGFAIARGLAQEGASVIITGRSSERV EQAICALRSSVCAAPPKGLAIAKITQTNPEAKISGVVADLAKKEGASQVFQQVPHIDI LVNNLGVYEPKAFSDITDEDWFNIFEANVLSGVRLSRHYLPKMLEPNWGRVIFISSES AIQIPVEMIHYGMTKTAQLAIARGLAQSTIGTQVTVNSVLAGPTRSEGVDDFVAKMAK DRGISPSEVEADFFKNVRPTSLIKRFATTDEVAAMVVYLSSPIASATNGAALRVDGGV VQSVV" gene complement(6396..6719) /gene="trxA" /locus_tag="DP116_21565" CDS complement(6396..6719) /gene="trxA" /locus_tag="DP116_21565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314800.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin" /protein_id="PRJNA477356:DP116_21565" /translation="MSSVANVTDATFKEEVLNSEVPVLVDFWAPWCGPCRMVAPVVDE IAAEYTGQVKVVKLNTDENPTIASNYGIRSIPTLIVFKGGRQVDTVVGAVPKTTLSKT LAQYL" gene 6798..7148 /locus_tag="DP116_21570" CDS 6798..7148 /locus_tag="DP116_21570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314801.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_21570" /translation="MRFLHHPDRKHISLAGVLYALGDPVRLEIVRRLAVKEEQCCADF DFAIAKSTMSNHFKILRESGVVLTRKEGTQHINMLRKEDLSALFPGLLDAVLRSAKPL SVGCSSQQTTSQQV" gene complement(7382..8287) /locus_tag="DP116_21575" CDS complement(7382..8287) /locus_tag="DP116_21575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314803.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar ABC transporter permease" /protein_id="PRJNA477356:DP116_21575" /translation="MNRLTAKDWVLITRRLTPYLFLLPALFILGLTVFWPALQAFYLS FTRYEYDLTQMPQWVGFSNFQRLWNDRVFWQTLFNTLLYLVGVVPILVIAPLALAILV NQKLRGMHWFRAAYYTPVVISMVVAGIAWKWLYAENGLLNQLLKGIFPEGIPWLTSPR FALFSVMAVTVWKGLGYYMVIYLAGLQSIPTDIYEAAAIDGSDGVRKHWDITVPLMKP YIALVAVISAISATKVFEEIYIMTQGGPRNSSKTIVYYLYEQAFSNLEISYACTIGLV LFLIILGLSIVRLTIDRQGGDNITA" gene 8388..9116 /locus_tag="DP116_21580" CDS 8388..9116 /locus_tag="DP116_21580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874581.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3891 domain-containing protein" /protein_id="PRJNA477356:DP116_21580" /translation="MIANLDQNGWEVIYHRAHALLAAQIAGHWHPEKRPLRWLETIAA ISHHDDLEKEWEGNYLTEAGAPLDFTLAKGTDVNSVRRHTQNARYRGRWVAMLISMHV SFLYEGKRGESPELDSFLDEQLQNQKQWRHELNITKDEAVEAYEFFQWCDRLSLILCN RVLPANERALEIATLPDGKRYDVIESSDGNVTVQPWPFLEKKFTVNVEASYLEQLKFD TNDELTQALQTAPIKTLEWTFVRL" gene complement(9361..9852) /locus_tag="DP116_21585" CDS complement(9361..9852) /locus_tag="DP116_21585" /inference="COORDINATES: protein motif:HMM:PF05099.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21585" /translation="MSDIKQALPASEFLKKNLGISEIPLEAYLSYGYALLTIAGADGE VSEAELNWLLNHQRMAGAPEEAIEKYKTFEYKNADLENLLTKITVDVPTWSKSRSLLY HAIQMARADDNYSLEEQKAVKKAAKLLKVEDDIALALNRLVETEEAVTALRKALLQTE VLA" gene complement(9947..11173) /locus_tag="DP116_21590" CDS complement(9947..11173) /locus_tag="DP116_21590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314805.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="PRJNA477356:DP116_21590" /translation="MHQNIIHESSNTYSEKPSNQRLRLAYLDGIRGLAALYVVLVHCW EPSLADALQPALLWLPMAKFLRYGIFAVVIFIVLSGYCLMLPVVRSDKKYFSGGLLGF FKRRIRRILPPYYAALLLCTLIGIFILWIEPADAFGLEDERFHILKDLFSPRFSLHDV VVYLLLFQNFGSLYINKINGPTWTVAVEWQIYFLFAVLLIPIWRRLGLFYTVAIAFAV GITLNYLMGELSFSVHPWFLGLFALGMTAAEINFSQKPSLMWMKNSLPWHKLAALFTC FGFLTEWIRFNLIQQIPEWVIHYFIALGTACFLTYCTNFLINGKPLPPAVQFLESPRL VALGVFSYSLYIIHAPIVWLVYQILLSQQLSPSILAFRWFVIAVPLSIVVAYVFYRFC ERPFMSHLPAKVKSQG" BASE COUNT 3629 a 2805 c 2799 g 3454 t ORIGIN 1 cccgacaaca ggcgactggc gtatgcgcaa agcgcacgcc caaagggcta acgcgaagcg 61 tctccgaagg agatacccga agggctacgt ctctacaatc gtgtattgca cgcatatgcg 121 caaagcgcac gcaagcgagg gctatcgaga agcgctatat gcaagtcata cgctaaagca 181 gcttatcatc actgcttcag ttgaaaatct tcatcaaaat ctttcagcga gggatagaca 241 ataggaggcg ttgcattaat cgttaacaac cgctgcttat actcttctaa ataagcttgg 301 cgacgctcat ccccgatgcg acttgcgccc caaacgatga aaggcggcaa aacatcgaaa 361 cccacaaagc gaaaaatccc gtgattaatc ggatagagaa tggtacgaat ctcaccattg 421 atccctattt cggtgtacat ggttaatgca ccacctgtgg tgagagacag catggctttt 481 ttgccttgaa aaactccatt gtcataaaac ctcccgccac cataagttcg ccccattgca 541 aagactttat caacccaacc cttgaggata gcaggaagtc caaaccacca aagaggaaac 601 tgaaaaatca gcacatcgca ccagtcgagt ttctccatct ccgctttgat atctggggca 661 aagccatcaa cttcagtggc atacatttct tcactttgct gattgtagta gtttagttct 721 ttattataag taaagttgcg gcggtcagaa actgggttga actgcatggc ataaaggtca 781 gaaataatca cctcatgtcc tgtggtatac agtacttcct tggcatagcg agtcagagca 841 ccattgaaac tgttgggttc tgggtgggcg tgaacaataa agaatttcat cttgatatcc 901 ctcaacactt atccgagtct aatatgctgc ttataataca gatactaaac atggataaac 961 ggataagggg agtcagtgct gtgtgttccg atactcccct aaaattttta agttgtgggc 1021 ttctgaattg tttttgccga ctggctaaat aacaaatctc ttgacttttc tatatcaaga 1081 gcttgatttt tgaaggcttc tgccttttcg taagcacgaa ttgttgcggg acgcacctga 1141 attgtttcaa accagcgctt taggtgagga aagttctcta acttttgacc ttggcgctca 1201 tatggcacaa tccacggata gctagcaata tcggcgatgg aataatcacc agcgagaaac 1261 tctctatcag caagccgttt atctaggaca gcgtataaac gtcccgtctc attcacataa 1321 cggttaatag catactcaat tttttggggt gcgtattggc tgaagtggtg gttctgtccc 1381 gccatcggtc ccaaaccgcc catttgccaa aataaccatt gcaagacttc aacgcgttcg 1441 cgtaagcgtg cgctagcgca atcgcgcaaa tctgctggaa tcaacttccc agttttttct 1501 gctaaataca gcaagatagc accagactca aaaactgaaa tcggttcacc acctcctgca 1561 ggttcatgat ccacaattgc ggggatacga ttgttaggag aaattgtgag aaactcaggc 1621 ttaaattgat ctcccgcgcc aatgttgaca ggaataacag tgtaaggcaa actgacttct 1681 tcaaggaaca tcgtgatttt atgtccgttt ggtgtcgtcc aataataaag ctcaatcatt 1741 ggttgtttcc taatgtttaa atgaactgtt tgtcacaaga agcttatctg atatcttgca 1801 ccctcctccg aacgcgggtc gctcatgggg agccagtact gaaccgtgtg cgtggcacga 1861 agtgccatag gagggtttcc caaaagaggt atctggtgag accagcgcgc atgctgtcgg 1921 ggagccagtg cggtgagtcc agcactgcgg tccttgtttc ccgacaacag gtgactggcg 1981 tatctcctga gcccttcggg cacggccccg tgccgaacgg agacgcttcg ctttagccct 2041 ctgggcgtgc gcaaagcgca tacgccctaa ggcgtgcgca aagcgcatac ccgtaagggt 2101 cttggggaac ccaggcgtag agcatagccg cgttttccca aaagaggagc cactgcggtg 2161 cgtcgctgtc ctccgttgaa gcatgtggcg tgcgacccgc gttcccgctc ttgcaaagcc 2221 gtgccgtagg catagggcgt tggaacgaat ctccgattcg aggcaacaaa gctccgcttt 2281 gttggggcaa acggaccggc gaagccatcg cttttagggt gcttcaaatc tgtgccatac 2341 cgccatcgac aaacaactcg atgccgttca caaagctgct gtcgtctgaa gcaagaaaga 2401 caacggcttt ggcaatctca tcgggcgtgc ctactcttcc caacgggatg gtgctggctt 2461 ggctgtccac gaatgcctgc acctgctcat cattcagtcc cagaagattg tagccaggag 2521 taggaaccac gccaggacta atggcgttaa cccggatctt gcgctctttg aggtcgagtg 2581 tccaattacg ggcaaacgat cgcacggcgg ctttggtagc actgtaaacg ctaaaggctg 2641 gggtggccct gatagaagtg gctgaggcgt tcaggatgat agaagcgccc tctggcaaca 2701 gaggcagtgc cttctgcacg gtgaacagca aacctttgac gttagtattg aacgttttgt 2761 caaagtgttc ttcggtgatt tcgccgagtg gtgcgagttc tccaccgcca gcattggcga 2821 agatcacatc gaggtgtcct tgctcttgct tgatcgtggc gaacaggcga tcgaggtctg 2881 ccagattgga ggcatcgctg tgaacacccg tgaggttttt accgatcgct ttcacggcag 2941 catcaagttc agtttggcga cgacccgtga tgaagacata ggcaccttcg gcgacaaagc 3001 gcttggcagt ggcaagaccg atgccgctgg taccgccggt gacaagagcg acttttcctt 3061 ctagtttttt catgatgatg agtcctgtga ttgctgagga gtgaaagagt cgtccgcgac 3121 ggatccttga aaattggatt tgtcccggta acaactcgat ggattgtgga acgctcaatt 3181 gcttggatcg aaggctctcc ggagtttgat caaaaacagt taaaggacat ggttgaaatt 3241 gtttgatttg gtagatgatt ttggtgcgat tgactttggt gcaaagtcaa tcgctcatga 3301 gtcaatcaaa tgagcagtac ctaatgactt gctgtctacc cattgtttgc tcgtgatgtt 3361 ttttactttg gcgagaatac gtggacagtt ctcatctcaa ccttcataag ggtttcccag 3421 tctatccaca aagacgttgt gggaaaatgg cagacgtccg ggattacgac gtggttctgc 3481 tgcatagcca acaggtacaa gaactgcgat cgcaatatct cgattgtccg cagcaccaat 3541 cacttccttg actttctctt caatccaacc attcataaag caggtggaca agcccagact 3601 ttctgctgct aacaccaaat gagtcgcagc aatcatagca tctttgatgg catactcccg 3661 ccgtttgtct ccaagtcctg cttgatactg cgggatattg gttttaaaat agtttaccgt 3721 gccctgattc catgcacccg tttcgagtcc ctgattcaaa atcggtgtta aatcttgttc 3781 tccagcagtc gcatcagcag caaagacaaa tgtcacgggc gcttcaacga cttgtttttg 3841 attccaggct gctgctgcaa gagctgcttt ttgtgcgtca tcttgcacaa gaactatccg 3901 ccaatcttga atgttataac tactgggtgc tgctacagtt agcttcacga gttggtgaag 3961 tagttctgga gatataggat ctggtttgaa agtcttgata gaacgacgct gaataatggc 4021 gctaggtaca tccagaggtt tagtttgaag gttgagttct tgaaccatga gacctatccg 4081 ttaattttca caaagattat aagcttggat cacagctgtt cgtcattcgt acatttataa 4141 gaacaacgaa ttattttgct cctattaaag agggatattg ctcttctact gtcggatagt 4201 cagtataacc cttctcccca ccaccataaa aactcgtcga atctgcctca gtcaggggtg 4261 cattcaaagc tatacgtcgg ggtaaatcgg gatttgagat aaatagtgtg ccaaaagaaa 4321 ctagctctgc ttctcctttt gccaaaactg cattgccttt ttcaggagtg tattccccat 4381 taaccatgat tgtaccgtga tagcgctccc gtatatgact ggttggcacc acggtcgccc 4441 catgtcggat atctgcttct atcgcctcaa aaatatgcag atatgctaaa tcaaactggt 4501 tcaaggcttg ggcaacataa ccaaatgttg cgagcggatt ggagtcttgc atatcgttaa 4561 acgttccact cggagacaaa cgtaccccaa cccgttttgc accccacact ccaacaaccg 4621 cttcggttac ctccaacaga agtcgggcac ggttttctac agaaccgcca tatttatctg 4681 tacgtttatt cgtaccatcc cggaggaatt ggtcaagtaa ataaccattg gcaccgtgaa 4741 cctccacccc atcaaaacca gccgccagag cattttctgc tcctttgcga tattgttcga 4801 caatttccgg aatctccgac gtttctaaag cacgaggagt cacatacggt ttcatacctt 4861 cataagtaag tacctgaccc ttgggtgcga tcgcgaatgg tgctaacggt aagtcaccat 4921 tcggttggaa atcaggatgt gaaatgcgtc ctacatgcca tagttgcagg taaattcttc 4981 ctccgtgctg atgcactgcg tctgtgacta acttccaacc ttcgacttgt tcttgcgagt 5041 gaatacctgg tgtataagga tacccttgtc cttgtggaga aacctgagtg gcttcagaga 5101 taatcagccc cgcagaggcg cgttgagtat agtagatggc attgagttgg tgcggtacat 5161 taccctctcc tgcacgatta cgggttaagg gagccatcac tatgcggttg ggcagttcta 5221 aatcacctaa cttgtaagga gagagtaaat tgatgttagt gctcatagat gaaatctcca 5281 aagtctgtcc tatgctaaaa atcaaaattc ctggttgatt cagtcatcac tcagacggat 5341 gaaattgata actgataact ggtaactggt aactgttaaa caactgactg cacaaccccg 5401 ccatccaccc gcaaagctgc accattggtg gctgaagcta tcggactgga tagataaaca 5461 accattgctg caacttcatc agttgttgca aagcgtttga tcagggaagt tggacgaaca 5521 tttttgaaaa agtcagcttc gacttcacta ggactaatac cacggtcttt tgccatcttt 5581 gctacaaagt catcaacacc ttctgaccga gttggtcctg caagaacaga gttgactgtg 5641 acttgagtcc caattgtcga ttgtgctaaa ccccttgcaa tagcaagttg agccgttttg 5701 gtcataccgt agtgaatcat ctcaacaggg atttgaatag cagactcact ggagataaaa 5761 ataacacgtc cccagtttgg ctccaacatc tttggtaggt agtggcgact cagccggact 5821 ccactaagaa cgttagcttc aaagatgttg aaccagtctt catcggtaat atctgaaaat 5881 gcttttggct cgtaaacacc aagattgttg actaggatat caatgtgagg aacttgttga 5941 aagacttgcg aggctccttc tttttttgca agatcggcga cgactccgga aatttttgct 6001 tctgggttgg tttgcgtaat tttagcgata gctagcccct tagggggcgc tgcgcaaacg 6061 ctgctgcgca aagcgcagat cgcctgttcc actcgttcag aagaacgtcc agtaataata 6121 actgatgcac cctcttgtgc gagtccccga gcaattgcaa aaccaatacc tgcagttgaa 6181 gcactcacca gcgcagattt accatgcaat ttcaagtcca ttgcgttcgc tccagcaaat 6241 gataggattt acgcattatg cttccgtatt ggtacttaga tgtggattgt agagtttggg 6301 gtggagtcat ttttgtgccc aaccgcatac actcttctat ccactagacg ggctaaaatc 6361 gtaattcgtc attcagttat tgatgacgaa tttaactaca gatactgtgc taaagtctta 6421 ctcaacgttg tctttggcac tgcacccact acagtatcga cttgccgacc gcctttgaat 6481 actatcagcg taggaatgct gcgaattccg taattgctgg caatagtagg attttcatct 6541 gtattgagct tcaccacttt tacctgtcct gtatattcag cagcaatctc gtctaccacg 6601 ggagccacca tacgacaagg tccacaccaa ggtgcccaaa agtctactaa cactggaact 6661 tcactgttaa ggacttcttc cttgaacgtg gcatctgtta catttgcaac ggatgacata 6721 cttgccctcc tattagttcg tttattcgat aatatcgaat aattaaaata tagtcaactt 6781 gatgagataa tgcaagtatg agattcttac atcacccaga ccgaaaacat atttctttag 6841 caggagtgct gtacgccttg ggtgatccag tgcggctgga aattgtgcgg cggctggcag 6901 ttaaggaaga gcaatgctgt gctgactttg attttgccat tgccaaatcc accatgtcca 6961 atcacttcaa gattttgcga gagtcaggtg tcgtcttgac tcgcaaagaa ggaacacagc 7021 acattaatat gttgcgcaaa gaggatttgt cagcactttt tccagggttg ctggacgccg 7081 tgttgcgctc agctaagcct ttgtctgttg gttgttcatc tcaacaaaca acatcacagc 7141 aagtttaatg cttgaagatt ggaaccgctt ccatctgcgg tttcatattc aaataaattg 7201 acttttgtat gaaatatagc cgttgccagg tagattagga catgaactaa tgagaaaata 7261 cggacatcac gagattttca accctgttcc ctgttaagcg ttccctgtta agcgttccct 7321 gttccctgct atatgtccta atagagttgg cgattgctat accacgaaaa cagctccaac 7381 atcaggctgt aatgttatca ccgccctgtc ggtcaatcgt taatcgcaca atcgatagtc 7441 ctaaaataat taggaacagc actaacccaa ttgtgcaagc atagctaatt tccaaattac 7501 taaatgcttg ttcatacaga tagtaaacta ttgttttcga gctattgcgt ggtccgcctt 7561 gcgtcatgat gtagatttct tcaaacacct tggtggcaga aatagccgaa atcacagcaa 7621 ctaaagctat atatggtttc atcaagggta cagtaatatc ccaatgctta cgcacaccgt 7681 ctgagccatc aattgctgca gcttcgtaga tgtcggtggg gatggattgc aacccagcta 7741 aataaatgac catgtagtag cccaatcctt tccacaccgt cacagccatg acgctaaata 7801 aggcgaagcg gggactcgtc agccaaggga ttccttctgg gaaaataccc ttgagcaatt 7861 gattgagtaa gccattttct gcatacagcc acttccaagc aatcccggcg acgaccattg 7921 aaatcacgac tggtgtatag tatgcggctc taaaccagtg cattccgcgt agtttctgat 7981 tgactaaaat tgccagtgct aagggggcga tgactaagat tggtaccaca cccacaagat 8041 acagcaaggt gttaaataaa gtttgccaaa aaactcggtc gttccacaag cgctgaaagt 8101 tagaaaaacc tacccattgg ggcatctgag ttaaatcata ttcgtagcga gtaaagctga 8161 ggtaaaacgc ttgtagtgcg ggccaaaaga cggttaatcc caagataaac agggcaggga 8221 gcaaaaataa gtaaggagtc agtcgccgtg tgatgagaac ccaatctttg gctgtcaatc 8281 tattcatagg aagattttta tcgtgctgtt gagggatgaa tttcgagttt ggataagtgc 8341 aaaatgaaaa agtaaattaa tgttatgaga gcttttaaac ctaatttatg attgctaatc 8401 tagatcagaa cggttgggaa gttatttatc atcgcgccca tgctttactt gcagctcaaa 8461 tagcgggaca ttggcaccca gaaaagcgcc cgttgcgttg gttagagaca attgcggcta 8521 tttcccatca cgatgatttg gaaaaggaat gggaaggcaa ttacctcact gaagctggtg 8581 caccgctaga ttttacctta gccaagggaa ctgatgtaaa tagtgtacga cgacacacgc 8641 aaaatgcccg atatcgggga cgatgggttg ctatgctgat ttctatgcat gtgagctttc 8701 tctatgaggg taagcgtgga gaatcaccag aattggatag ctttttagac gaacaactcc 8761 agaatcagaa gcaatggcgt catgagttga atattacaaa ggatgaagct gtggaagctt 8821 atgagttttt tcagtggtgc gatcgcctct cactcatcct ctgcaaccgt gtgttacctg 8881 ctaatgaacg tgcgttagaa attgcaactc tgccagacgg caagcgctat gatgtgatag 8941 agtcgtctga tgggaatgtc acagtacaac cttggccttt cctagaaaag aagtttactg 9001 ttaatgtgga agctagttac ctagagcaac tgaaatttga cactaacgat gaactcacgc 9061 aagcacttca gacagcaccg ataaaaactt tggagtggac gtttgttcgg ctctgattag 9121 cgatcatccc gggaacatcc caaatatgta agctatatca aactcagatg attagttgtc 9181 cgcagcgtta gcgcagcgtg ccctccgggc atacggagca aagcggagtg tagacgcgcg 9241 ctttgcgctt acgctttact cggttaccct cgctgcggac aaacatgttt ttgcatgcca 9301 atgcatgctc ccatcatcgc cagttactca accggatggg gatgtgggga ttctgctcga 9361 ttaagctaaa acctcagtct gtagtagcgc cttgcgtaat gcagttactg cttcttctgt 9421 ctctaccaac ctattaagag caagcgcaat atcatcttca accttcaaca gtttagctgc 9481 tttcttaacg gctttttgct cttcaagtga gtagttgtca tcagcgcgag ccatttgaat 9541 tgcatgatac agcaatgacc ttgattttga ccaagtagga acatcaactg taatcttagt 9601 cagtaggttt tctaagtcag catttttgta ttcaaatgtt ttgtatttct ctattgcttc 9661 ttcaggagcg ccggccatgc gctgatgatt tagtaaccaa ttaagttcgg cttctgaaac 9721 ctctccatcg gctccagcaa tagtgagtag tgcatatcca tagctaaggt atgcttcaag 9781 gggaatttcg gaaataccta agttcttttt cagaaattct gaagcaggca gtgcttgttt 9841 gatatcgctc atcaattacc tttttgattt gtttgagaat ttaatatttt gcagaacctc 9901 ttttacaata ccacaaggat tatgattttt ttatacacgt agctcttcag ccctgagact 9961 tcactttggc tggaaggtgg gacataaaag gtcgctcaca gaatcggtag aatacgtagg 10021 caactactat tgacaatgga acagcgatga cgaaccatct aaaagccagt attgaaggag 10081 acaattgttg gctcagtagg atttgatata cgagccatac tataggagca tggatgatat 10141 aaaggctgta agaaaatact cccagtgcaa cgagccgagg tgattctaag aattgcacag 10201 caggaggaag tggtttaccg tttattaaga aattggtgca atatgtgaga aaacaagccg 10261 tgccaagggc aatgaaataa tggataaccc attctgggat ttgttgtatc aaattaaatc 10321 gtatccactc tgtcaaaaaa ccaaagcaag taaatagggc agcaagctta tgccaaggta 10381 atgaattttt catccacatc aaggatggtt tttgggaaaa attaatttct gctgctgtca 10441 ttcccaaagc aaaaagacct aaaaaccaag gatgaacaga aaaggagagt tcacccatca 10501 gataattaag tgtaattccg acagcaaagg caattgctac tgtataaaat aaccctaaac 10561 gtctccatat tggtattaac aaaacagcaa acaaaaaata gatttgccat tcgacagcaa 10621 cagtccatgt tggaccatta attttattga tatataagct gccaaaattt tgaaaaagta 10681 agagataaac aaccacatca tgcaaagaga accgaggaga aaataaatct ttcaaaatgt 10741 ggaatctttc atcttccaag ccaaaagcat cagctggctc tatccacagt atgaatattc 10801 ctatcaatgt gcacaagagc agagcagcat aatatggtgg tagaatccgt cgaatccgtc 10861 gcttaaagaa acctaacaag cctccggaaa aatatttttt atcggaacga actactggaa 10921 gcatcaaaca gtagccagaa agaacaataa aaataacaac ggcaaaaatt ccatatctga 10981 ggaactttgc catcggtaac caaagcaagg caggttgtaa agcgtctgca agactgggtt 11041 cccagcaatg cactagaacc acatatagag ccgccaagcc acgtatacca tctaagtaag 11101 ccagacggag tctctggtta ctaggttttt cagaataagt atttgaagat tcgtgaatta 11161 tattttgatg cattggtaat gaagaaatga tgaaaggcta aagatgaaaa attaaacgca 11221 cgaaaatcgg ttacacgagt ttttgaaaat aaactcatgc tatgagcaag gtatttgact 11281 accttaagaa caaaaggtat taagacaaat ttttctgtca cttagtgtca aaaaatggct 11341 tttttagtta aaaggacact cttgacatca aactctattg ataacaaagc taaagcaaaa 11401 aaatcaggtt taatcttcta atgacacaga ggaaagcttt ctcctaatat taacaaagtc 11461 ttgaagacga gaagtctttc ttaggtaaga aaacaactta taaatatcac tcgtgcgatg 11521 gtaggtgaca acctaaatat ataagtatta acgacttata tagatatttt gctttcaggt 11581 aaaactggat tttttgttac caaaagagaa aaaatgacgc tgttcacgaa caatttagta 11641 ttgctatatg tgaacaaact tttgtattga acaaagtaca gttgcttaca aattcacacc 11701 gttatatgta gcctgaaatt gtgatatttt ctcaggaagc taagctccgc atatctacag 11761 cactcgcaac tgaaaattgt tatgaatttg tggagttctg tactaagttg tgtcttatga 11821 aagctcttta tcttgcaagg ctgtgtagag tagaagtcac gcctctaacc ttgaccaaga 11881 aagcttttga aatctataaa attcaggggg gagttacacc accaggaacc ctgacggcat 11941 gaactcttgg aactactagc tcatacccta caaaaactgt tccttttagc agtggttgct 12001 ggatgtgagt cagaaggatt tcacgaagaa aatctagctt aaacataaca aatgacctct 12061 gagcttccat cgtacagttc gctcgtcact gcacatactc tagtaaatgc acagctgctc 12121 tcaacattag acttaatagc aaaaatattg attttatttt tataactaca gataaatgat 12181 tgtaaaaaac gttcctctac tgttgccatt agtaagccat ttccttaatt cctcttgtga 12241 aggactagaa aaaggtgata gaaacaagta gacaatagta catactaaca acctatatgt 12301 agttatttgt gtatactttg aacatagcca aagatactta cttttgctaa gagattaatg 12361 tttctatagt caagtaaacc ataacatcta tgacaataca aatcgtcatg ttgtgcccac 12421 tctttaatag atttttatgg aaatgaatgt aatcttcatc gaatcttgat tgaaagaaaa 12481 aacttccaaa gggcaatctc actgaattca gattgctcaa ttaaacgtct atcctccatg 12541 tagaaaatgc gaactgtttt gcctcgacga gcaggaaagc ttcggtatgc cccggtataa 12601 ttataatagg gatatccatc tgggggttgg ctagtgcggt acgttgctta taaactgtac 12661 cccataaggg ttacagggga ttaagcg // LOCUS NODE_2679_length_12677_cov_5.06409412677 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12677) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12677) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12677 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1912 /locus_tag="DP116_21595" CDS <1..1912 /locus_tag="DP116_21595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315855.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="glycoside hydrolase family 10 protein" /protein_id="PRJNA477356:DP116_21595" /translation="SPPPPPSAPKIATAPQPRPIPSITPPKSEEAIDQLEQKVRFDVV PNSQAPISRTEVLTYQHELENLIGRVESANLAALALSENPNNPALAKTQQAQVASTRP GVAVPTTEQALDAAREVVKNLPDLIAEKNYAQARQQWLIARGNLWNQFPLNKRLAQPE IRAMWLDRGTIIRAGNEQGLAQIFDRIAQAGINTIFFETVNAGYTIYPSKITPQQNPL VSGWDPLASGVKLAHERGIELHAWVWAFAAGNQRHNQLLNINPNYPGPVLAAHPDWAG YDNRGQMIPSGQSKPFFDPANPQVRQYLLSLYEEIVSRYDVDGLQLDYIRYPFQDPGA NRTYGYGKAAREQFQQLTGTDPAKISPKQQQLWQKWTEFRTQQIDSFVAQVSQQLRQK RPNLILSAAVFPLPEQERIQKLQQHWEVWARRGDIDLIVPMTYAQDTPRFERLAQPWI TTSTQLGSSLVVPGIRLLSLQTVGAFDQIQLVRDLPVAGYALFAAQNFTNDLNQVFSN TQGSAQRAQKEPIPHRQPFQTAWVRYTALQREWKLAQQNNKLRITSTTLSSFNSQAEV VQSALNQLAINPDNSKLATARTSLLTLQSRFREWMRLQALENPYQVRVWENRLATIEK LLRYGERVQLHR" gene complement(2133..2342) /locus_tag="DP116_21600" CDS complement(2133..2342) /locus_tag="DP116_21600" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21600" /translation="MCINVCKNLLAQNSFLSIKLCIEQETTQKEVALVYLVVGTATAT RTTDALVAFAHAMLWLMLPERRCGL" gene 2335..5256 /locus_tag="DP116_21605" CDS 2335..5256 /locus_tag="DP116_21605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009460030.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_21605" /translation="MHTFAQLISLPSLDSAIDFFPLTVAPETLLSDVIALMKKHQRQT RCVLIVENLQIVGWFTEHDALHLLFSDVNLQTAKVSEVMRTSVPTLKYSEIPSVVFIL SFLRQYNLFCVPLIDEQGQLVGLCTYETICQVIEQQARDIVSNSSHCCTFKSCEDITE QKRTTETLCESEEQFRKLADELPILIWMKDANGLNTFVNQSCVEFTGRPLEELLGEGC VEGIHPEDKQHCWDTYQAAFKNHQRFQYTCRYLRPDEECRSLINIGVPRFASDGSFVG YIGCSIDITEHVETEAALQQAQAELKQVNAELETRVEERTRAVKEMNRQLIFEMTDRL YAEEQLRQSQQMLQLIMDNIPQGIFWKNTASVYLGCNRIFAKMFGFESAENVVGKTDY DLVVNEQEADFYCESDHRVMQTDTPEYDIIFRHIRKNGKQAWLEASKIPLHDHEGNVV GVLGTFEDITERKQAEENLQLRDRAIAASSNGIIIADVTMPGSPIIYANSALEEITGY SVEEVIGKNYSFLHDYDIDQPGMTELHDAIAQGKSCTVVLRNYRKYGTLFWNEVSISP VYDNHGQFTHYISIQSDITERKQAEVALLVSQERLQYLLSSSPGVIYSSKIHSEHSIT FMSANVIATLGYEAQEFTASSRFWASHIHPEDLQQALVAVSKVLEQGHNSHEYRFLHK DGTYRWMYDQAKLVQDNAGDALEIVGYWLDITERKQLEEELKASLHKEKELNELKSRF VSMTSHEFRTPLSTILSSSELLEHYRHKWTEEKQLSHLHRIQTSVKRMTDMLNNVLVI GKVEAGKLDFRPVPFDLVEYCHYLVEELQLNVNNQHAINFSSQSDSMPCCMDEKLLGH ILGNLLSNAIKYSPTGTTVRFTLTFHHARAVFTIQDQGIGIPPADLPHLFDSFHRATN VGNIQGTGLGLAIVKNCVETHHGEITVKSEIGLGTIFTVTLPLNNYVPSEASHDQNFS H" gene 5234..6454 /locus_tag="DP116_21610" CDS 5234..6454 /locus_tag="DP116_21610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740453.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diguanylate cyclase" /protein_id="PRJNA477356:DP116_21610" /translation="MTKILVIEDEKLVRENIIDLLAAEDFDTIAAANGRVGLDLAVSQ IPDLILCDLMMPEVDGYGVLTTLREEPVTASIPFIFLTAKSARADFRQGMDLGADDYL TKPFTRTELLNAITSRLAKKATLAKQVSTKFDAQTFSPKVQMMEMYLRRAIEREEFEQ FLVYYQPIVDIHSGQIIGAESLLRWQHPELGMVTPTELIPLAESTGLILPISDWVLNK VCKQIKNWHNQGFTDLRVAVNVSGNQLKQPNFSQKIIHLLLANNLVPDSLVLELTENI IMPDINQAISTMNEIHSFGVKIAIDDFGAGYSSLIYLKQLPIHTLKIDRYFIQCIAND SQKAVITTALIQMAHNLNLHIVAEGVETEQELAFLRQHKCDAIQGFFFSHPLPAREFE KILFANKRLNLSNV" gene 6663..7757 /locus_tag="DP116_21615" CDS 6663..7757 /locus_tag="DP116_21615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138766.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_21615" /translation="MYKILVIEDETNVRQNLLELLTYEDFDVIAAENGQLGVKLAQKE IPDLIICDVMMPELDGYGVLKTLRQQSITATIPLIFLTAKTEKTYLRQGMELGADDYL TKPFTRAELIAAISSRLKKQVAIRQQSQRRLDDLRSSITMSLPHEMRTPLNGILGFSE LLMKEADTLSRHEIFEMAEGLHKSGKRLHRLVQNFLLYTELEMISTDPQRMKNLESHK TVFPSMALEKLITEKAQQAGRYTDFQVNLQSPCCVQICETRLSKIFEELIDNAFKFSI SGTQVYLKSTAVTNQLIISLSNYGRGMTAAQIAELGAYRQFERQLYEQQGSGLGLIIA KRIVELYGGELRIHSKLGEKTVIQVVLPCI" gene complement(7858..8187) /locus_tag="DP116_21620" CDS complement(7858..8187) /locus_tag="DP116_21620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859905.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_21620" /translation="MSKDVKSVIDEFYSSVNMTAKELKSWLETKESKSVGQKEGDDES IGHKSGRHIVELLQNKKDDYTDDEISHMKKVISYIHRHSAQQPDGDIEHTHWRYSLMN WGHDPLK" gene complement(8193..8411) /locus_tag="DP116_21625" CDS complement(8193..8411) /locus_tag="DP116_21625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318721.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2945 domain-containing protein" /protein_id="PRJNA477356:DP116_21625" /translation="MTDELKKGDKVKWNTSQGETTGEVEKKLTSPTQIKGHHVAASKD NPEYLVKSDKTGKEAVHKPDSLEKIEES" gene complement(8751..9707) /locus_tag="DP116_21630" CDS complement(8751..9707) /locus_tag="DP116_21630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315809.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protoheme IX farnesyltransferase" /protein_id="PRJNA477356:DP116_21630" /translation="MIETNVSRHHQTFLQVIQSYYQLTKPRIIPLLLITTAGSMWIAA KGEVDPLLLLVTLTGGTLAAASAQTINCVYDRDIDYEMERTRHRPLPSGKIQSRDALI FASALAVISFTLLYVFANLLAALLAMSGIVFYVLIYTHFLKRHSTQNIVIGGAAGAVP ALVGWAAVTGTLSWGAWVVFAIVFLWTPPHFWSLALMIRDDYAKVGVPMLPVIAGTTP TVRQIWLYTLVTVAATFSLIYPLQISGMIYGVIAVTLGAVFIYKAWQLLHNPEDRNLA KGLFLYSISYMMLLCLGMVIDSLPISHHVINVVVDKAHLLIG" gene complement(9781..10743) /locus_tag="DP116_21635" CDS complement(9781..10743) /locus_tag="DP116_21635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme A synthase" /protein_id="PRJNA477356:DP116_21635" /translation="MSEFVLEQQNITTAPPQVPQERIRRLVWRMCIATLILMAIGSAT RVMNAGLACPDWPLCYGELVPTKQMNFQVFLEWFHRLDAALIGVSAIALVGMSWWSRR SLPRWLPWASTFALFLIVFQGVLGGLTVTELLRFDIVTAHLGTALLFFTTLLVIGTAL TPYQPTGTVGNLPWVGLTAAILVYLQSLLGALVGSRWALHQCFGTSQLCTVMYSHIAG IVPPTVATLAVVFLSWRTPALHPALRRLANIAGGLLIVQLLIGVATFRLHLQVEPLTV SHQAVGAALLGTLVCFTVLALRDWVHKRGLNANNVTINANTARE" gene 11421..12503 /locus_tag="DP116_21640" CDS 11421..12503 /locus_tag="DP116_21640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome C oxidase subunit II" /protein_id="PRJNA477356:DP116_21640" /translation="MKIPSSIWTLLIGIGLTLVSLWYGQNHGLLPTAASDEAPLVDGL FNTMMTVSVGIFVLVEGVLIYSAIRYRRRAGDNADGPPVHGNVPLEILWTAIPTVIVI GISVYSFDVYNEMGGFSPHAIHEAPMTSQVMNMPGAAIAATLSDTPPSTEPNLNQEKS DEAMQDPATAAVRNADQIPQKRNAPGVGSVAPSLGPTPENEGKPPAFVVNVTGLQYAW IFTYPDTEVTSGELHVPIGREVQLNMTANDVIHAFWVPEFRLKQDVIPGRQSEIRFTP NKEGDYTLICAELCGPYHGAMRTQVVVQKPEEFEQWIQEQEVASADELKQAVAVNPAV LTPDEFLAPHTSHMGIHPEMLHQIHH" BASE COUNT 3793 a 2726 c 2669 g 3489 t ORIGIN 1 atctccccca cctcccccat ctgctcccaa aattgccaca gctcctcaac ccagacctat 61 tcccagcatc acacctccaa aatcagagga ggcgattgac caacttgaac aaaaagtgcg 121 gtttgatgtt gttcctaact cacaagcacc gattagcaga actgaagtgc tcacctatca 181 gcatgagcta gagaatctta ttggtagagt cgagagtgcc aatttagcag cactggctct 241 ttctgaaaat cctaacaatc cagcgttagc aaagacgcaa caggcacagg tagcatcaac 301 aagacctggg gtagcagttc ctaccacaga acaagcttta gatgctgctc gtgaagttgt 361 gaaaaacctg cccgacttga ttgcagaaaa aaactatgct caggctcgtc agcaatggtt 421 gatagcaaga gggaatttgt ggaaccaatt tcctctgaat aagaggttag ctcaaccaga 481 aatcagggca atgtggcttg atcgaggaac gattatccgt gcggggaacg agcagggact 541 ggctcagatt tttgatcgca tagcgcaagc cgggattaat accatatttt ttgaaacggt 601 taatgctggc tacaccattt atcccagcaa gattacaccc cagcaaaacc ctttagttag 661 tggttgggat ccactagcat ctggtgttaa gttagctcat gagcggggta tagaattgca 721 cgcttgggtt tgggcttttg ccgctggtaa tcaacgtcac aatcaactcc tcaatataaa 781 tcctaattac ccaggaccag tacttgcggc tcatccggat tgggctggct atgataatcg 841 cggtcaaatg attccttctg gacagagtaa accattcttt gatccagcga atcctcaagt 901 acggcagtac ttgctcagct tgtatgagga aattgtcagt cgctatgacg tggatggttt 961 acaactagac tacattcgct atcccttcca agatccaggt gctaaccgga cttacggtta 1021 tgggaaggcg gctagagagc agtttcaaca actgactggc acagatccag cgaagatttc 1081 tccaaagcaa cagcaactgt ggcaaaagtg gacagaattt cgtactcagc aaattgatag 1141 ctttgttgct caagtgtcac aacaactgcg acaaaagcga ccgaacttga ttttatcagc 1201 tgctgtattt ccccttccag aacaagaacg cattcaaaag ctacagcagc actgggaagt 1261 ttgggcaagg cgcggtgata tcgatttaat tgtgcctatg acttatgccc aagatacccc 1321 gcgctttgag cgactggcac aaccttggat aacaacttct acacaactgg gttcttccct 1381 ggtggtacca ggaattcgcc tactttcttt gcaaacagtc ggagcattcg atcaaattca 1441 actagttaga gacttgcctg ttgctggtta tgctttgttt gctgcacaaa atttcaccaa 1501 tgacctcaac caagttttta gtaatactca aggtagcgct caacgcgcac aaaaagaacc 1561 tattccccac cgccaacctt ttcaaactgc ttgggttcgt tacactgctt tacaacgtga 1621 gtggaagttg gcacagcaaa ataacaagct acgaataact tccacaacgc tttcaagttt 1681 taattctcaa gcagaagttg tgcaaagtgc tttaaatcag ctggctatta acccagacaa 1741 tagtaagttg gcaacagcaa gaacatcgct tttaacctta cagtctcgat ttagagagtg 1801 gatgcgtcta caagcgttgg aaaatccata tcaagtcaga gtttgggaaa accgtcttgc 1861 cacgatagag aagttgttgc gttatggaga acgggtacag ttgcaccgct agggagttaa 1921 gtaaggtgag cattgcaatg cctaccctac ttaacttctt gagtgaagcg tattggttca 1981 ccaagaagca gtgcactggc ttgcctctgt taatatatat gaagcagaaa tatagtataa 2041 atcctcactt ttttaaggga atatacgcaa gatcttagcg atagacaagg taaaaatgaa 2101 aatacaccag gacttacgca cgaacaacgt aactatagtc cgcagcgacg ttcgggaagc 2161 ataagccaaa gcattgcgtg tgcaaatgcg acgagtgcgt ctgttgttcg cgtagccgtg 2221 gccgtaccca ctacgagata caccagcgct acctccttct gagttgtttc ctgttcaata 2281 cataacttaa tagatagaaa actattttgt gctaagagat tcttgcaaac atttatgcac 2341 acctttgctc aattgatttc actgccatca ttagattctg caattgactt ttttccccta 2401 actgttgcac ctgaaacact tttatcagat gtcattgctc tgatgaagaa gcatcaacgg 2461 caaacaaggt gtgtgttgat agttgagaat ttgcagatag tagggtggtt tacagagcat 2521 gatgcattac accttctatt ctcagacgtc aacctacaga ctgccaaagt gtctgaagtc 2581 atgagaactt cggtaccaac actaaaatat tctgagattc ctagtgtagt atttatactg 2641 tcatttttac ggcaatataa tttgttttgt gtaccactca ttgatgagca aggtcaactc 2701 gttggattat gcacttatga gacgatttgt caagtcatag aacaacaggc gcgggacatt 2761 gtatcaaatt cgtctcattg ctgcacgttt aagagctgcg aagatattac cgagcaaaaa 2821 agaactactg agacattgtg tgagagtgag gagcaattcc gcaaactggc agatgagtta 2881 ccaatactca tttggatgaa ggatgctaat ggcttgaaca catttgtcaa tcaatcctgt 2941 gtagaattta caggacgtcc tttggaagag ttgctaggtg aaggttgtgt agaaggaatt 3001 catcctgaag ataaacagca ctgttgggat acataccaag cagcattcaa aaaccatcaa 3061 cgtttccagt atacctgtcg ttatttgcgt ccagatgaag agtgtcgttc gctgatcaac 3121 ataggcgttc caagatttgc gtctgatggc agctttgttg gttacattgg ctgctctata 3181 gatattactg aacatgtaga gacagaagca gcattacaac aggcacaggc agaactaaag 3241 caggtcaacg cagagttaga aacgcgagtt gaggaacgaa caagggctgt caaagaaatg 3301 aatcggcagc tgatttttga aatgactgat cgcctgtatg cagaagagca actgcgccaa 3361 tcgcagcaaa tgttgcagtt gatcatggac aacatccccc aaggtatttt ctggaaaaat 3421 acagcttccg tgtacttagg ttgtaatcgc atctttgcca aaatgtttgg ttttgagagt 3481 gcagaaaatg ttgtgggcaa gactgactat gatttagtcg tgaacgaaca ggaggcagat 3541 ttctactgcg aatctgatca cagagtgatg cagactgata cgcccgaata tgacattatc 3601 tttcgccaca tccgaaagaa tggcaaacaa gcttggctgg aggcgagtaa aatcccactt 3661 catgatcatg aagggaatgt cgtgggggtt cttggtactt ttgaagacat tactgagcgc 3721 aaacaagcag aagaaaattt acaactgcgc gatcgcgcga tcgccgctag cagtaatggc 3781 atcatcattg ctgatgtcac aatgccaggc tcgccgatca tatatgctaa ctcagcattg 3841 gaggaaatca ctggttattc tgtagaggag gtcattggga aaaactacag ttttctccac 3901 gattatgata tcgatcaacc gggtatgacc gagttacacg atgccattgc ccaaggaaaa 3961 agttgcaccg tagttttacg taactaccgc aaatatggta cactcttctg gaatgaagtc 4021 agtatttccc cggtttatga taatcatggt caatttactc attacattag tattcaaagt 4081 gatatcacag aacgcaagca ggcagaggtg gcgctcttag tttcgcaaga gcggttgcaa 4141 tatttacttt cttctagtcc tggtgtgatt tatagcagca agatccacag cgagcatagc 4201 attacgttta tgagcgcaaa cgttatcgcg acactggggt atgaagcaca ggaatttact 4261 gcctcttcca ggttttgggc tagtcatatt cacccagaag acttacagca ggcgcttgta 4321 gcggtgtcaa aagttcttga gcaagggcac aatagtcacg agtaccgctt tttgcacaag 4381 gatggcacat atcgctggat gtacgatcaa gcaaagctcg tgcaggataa cgctggtgat 4441 gcattggaaa ttgtcggtta ctggctggac atcacagaac gtaagcaact agaagaggaa 4501 ttaaaagctt cgctgcataa agaaaaagaa ctgaatgaac tgaaatctcg ctttgtctca 4561 atgacttccc atgaatttcg tacgccgctg agcactattc tttcttcatc agagttgtta 4621 gaacactacc gccataaatg gactgaggaa aaacaactgt ctcacctgca tcgcattcaa 4681 acttcagtca agcgtatgac tgatatgtta aacaatgtct tagtgattgg gaaagtggag 4741 gcaggaaaat tagattttag acctgtgccc tttgatttag ttgaatactg ccattatctc 4801 gtggaagaat tacaactgaa tgtcaataat caacatgcaa tcaactttag tagtcaatct 4861 gattccatgc catgctgtat ggatgaaaaa ttgctagggc atattcttgg taatttactc 4921 tcgaatgcaa ttaagtattc tccaacgggt actactgtca gatttactct gacgtttcat 4981 catgcccgag cagtctttac aattcaagac caaggaatag gtattccacc agcagactta 5041 cctcacctgt ttgattcttt tcacagagct acaaatgtag gtaacatcca aggcacgggg 5101 ctaggactag caattgtaaa aaactgcgtg gaaacccatc atggtgaaat tacggtaaaa 5161 agtgaaattg ggctgggaac tatatttact gtaactctgc cgttaaacaa ttacgttcca 5221 tcagaggcaa gccatgacca aaattttagt cattgaagac gaaaaactag tacgagaaaa 5281 cattatagat ttattagccg cagaagattt tgatactatt gctgctgcca atggacgcgt 5341 tggattagat ttagcagttt ctcaaattcc tgatttaatt ttatgcgact taatgatgcc 5401 agaagttgat ggttatggtg tcttgacgac attacgtgaa gaaccagtca cagcaagtat 5461 tccatttatt tttctcacag caaaatctgc tagggctgac tttcgtcaag ggatggattt 5521 aggtgctgat gactatctga cgaaaccatt tactcgtact gaactattaa atgcgatcac 5581 aagccgctta gcgaagaaag caactttggc aaagcaagtc tctaccaaat ttgacgccca 5641 gactttctct cccaaagtac agatgatgga aatgtatttg cgtcgtgcta tagaacggga 5701 ggagtttgag caatttctgg tttactacca acctatagtc gatattcatt ctggtcaaat 5761 tatcggtgct gaaagtttat tacgttggca gcatccagaa ttaggaatgg ttactcccac 5821 agaactgatt cctttagcag aatcaactgg tttaattctc cctattagtg attgggtatt 5881 aaataaagtt tgtaaacaaa taaaaaattg gcataatcaa ggattcacag acttacgcgt 5941 agcagttaac gtatcgggaa atcaattaaa gcaacctaac tttagtcaaa aaattattca 6001 tcttttattg gcaaataatt tagttccaga tagcttagtg ttagagttaa cagaaaacat 6061 cattatgcca gatatcaatc aagcgatttc tactatgaac gaaatccact cttttggagt 6121 caaaattgcg attgatgatt ttggtgccgg atattcttct ttgatttact tgaagcaatt 6181 accgattcat accttaaaaa ttgatcgata ttttattcaa tgtattgcta atgattcaca 6241 gaaagcagtc attacaacgg cattgattca aatggcgcat aatcttaacc ttcatatagt 6301 tgctgaaggt gtggaaacgg aacaagaact tgcttttttg cgtcaacata aatgtgatgc 6361 aatacaaggg tttttcttta gccatccatt accagcaaga gaatttgaaa aaatattatt 6421 tgctaacaaa cggttgaatc tgtcaaacgt ataatatttg ttagtcaatg tcaatcacgc 6481 attatcaatt atcaattatc aattggagat agaaatgatc gttagaaaaa caattgataa 6541 ttgacaattg atattttaat cagccatgtc aaaaactact aagaaccaag taaagtgcgg 6601 taaatagtag tcaattaaat caaaaaaact ctctaacttt caataattaa tctttattga 6661 atatgtataa aattttggtc atagaagatg agacaaatgt cagacagaat cttttggaat 6721 tactaaccta tgaggatttt gatgtcattg ctgcagaaaa tggtcagctt ggtgtgaagt 6781 tagctcaaaa agaaatcccc gacttgatta tttgtgatgt catgatgcca gaactagatg 6841 gttatggcgt tttaaaaaca ttacgtcaac aatctataac agcaacgatt ccgttgattt 6901 ttttaacagc taaaactgag aaaacttact tacgccaagg gatggaattg ggagctgatg 6961 actatttaac aaagcctttt acccgcgcag aactcatcgc tgcaatttct tcccgattaa 7021 aaaaacaagt tgctattcgt cagcaatcac aaagaaggct ggatgatttg cgtagtagta 7081 ttaccatgtc tcttcctcac gaaatgagaa caccactaaa cggtattttg ggtttttcag 7141 aacttttaat gaaagaagcc gatacccttt ctcggcatga aatttttgag atggcagaag 7201 gtcttcataa atcgggaaaa cgcttacaca ggttagttca gaacttcttg ttatacacag 7261 aactcgaaat gatatcaaca gacccacaac gcatgaaaaa cttggaaagc cataaaactg 7321 ttttcccttc aatggcgttg gaaaaattaa ttaccgaaaa agctcaacaa gcaggacgtt 7381 atacagattt ccaagtcaat ttacaaagcc cttgttgtgt acaaatttgc gaaacaagac 7441 tttctaaaat ttttgaagaa ttaattgata atgcctttaa gttttcaata tcaggaacac 7501 aagtttatct aaaaagcact gcagttacca atcaactgat tatctcactt agcaactacg 7561 gacgaggtat gacagccgct caaattgctg aattgggagc ataccgacaa tttgagcgcc 7621 aactctatga acaacaaggt tcaggtttgg gtttaatcat cgctaaacgt atagttgagt 7681 tatacggagg ggaactgcgt attcacagta agctaggaga aaaaacggtt atccaagtcg 7741 tgcttccgtg tatttgagcc tactcaaagt cgtacccacc gttgaatatt ttagtctttt 7801 tgcgctggta tatttattgg ttctcaggag ttttgcactt ttataactcc taacctctca 7861 ttttaggggg tcatgtcccc aattcattaa agagtaacgc cagtgagtat gctcaatatc 7921 accatcaggt tgctgagctg aatgacgatg gatataacta atgacttttt tcatgtgtga 7981 aatctcatca tcagtgtagt catctttttt attttgtagt agttcgacaa tatgcctacc 8041 ggatttatgt ccgatagatt catcatctcc ttctttttgt ccaaccgatt ttgactcctt 8101 agtttctagc caagatttaa gttcctttgc tgtcatatta actgaggagt aaaactcgtc 8161 aatcactgat ttgacatctt tactcataac gttcatgact cctcgatttt ctctaaagaa 8221 tcaggtttat gaactgcctc ttttccagtt ttgtcacttt taaccaagta ctctgggtta 8281 tcttttgagg cggcgacatg atgtccttta atctgtgtag gtgaggtgag tttcttttcc 8341 acttcacctg tggtttcacc ttgcgatgta ttccacttca ctttgtcgcc ttttttcaac 8401 tcgtcagtca cgggtttttt cctgcttgat gttttccacc cgttaattat gtttggacta 8461 tcagaagaat taatctttct attgacataa aaagtggtaa attcctctgt tgaaagggct 8521 accccgtacc agatatctgc gttgcaaaaa tgtgttctcg gtttggtctg ttgttaccgc 8581 tggcggcgtt actgtgggaa cctgtccagt aaatcgtgtt attcaaacgg tgcataattg 8641 actaaatatg aactggtgtt actgcatatg aaacaaagtt gctccgtgcg actaaaagcc 8701 tcgcggagca acaacagttt agaatcctcg tttttcacaa cgagtagaaa tcaacctatc 8761 agtagatgcg ccttatctac cacaacatta atgacatgat gactgatggg aaggctatca 8821 attaccatac ctagacataa caacatcatg tatgagatgg aatagagaaa taatcctttg 8881 gctaaattac ggtcttctgg attgtgcagt aattgccaag ctttgtaaat aaaaactgct 8941 cccaaggtaa cagcaatgac tccgtaaatc attccactga tttgcaaagg ataaatcaat 9001 gaaaatgttg ctgctactgt aactagggtg tataaccaaa tctgtcgcac cgtaggcgta 9061 gttcctgcaa tgacgggtaa cattgggacg ccaacttttg cgtaatcatc ccgaatcatc 9121 aaagcaagag accagaagtg aggcggtgtc cacaaaaaga caatcgcaaa tactacccat 9181 gctccccagc ttaacgtacc agtgacagca gcccaaccta ccaaagccgg aactgcgcct 9241 gctgctccac caataacgat attttgagtg ctatgtcgct tgagaaagtg cgtataaatc 9301 aggacataaa atacaatccc tgacattgct agcagtgcag ctagtaagtt ggcaaatacg 9361 taaagcaatg taaaggaaat gacagctaaa gcactagcaa aaatcagagc atcgcgcgac 9421 tgtattttac ctgaaggcag agggcgatga cgtgtgcgct ccatttcata atcgatatct 9481 cggtcgtaga cacagttaat cgtctgggca cttgcagctg ccaaagtacc accagtaagt 9541 gtaacgagta acaacagtgg gtctacttct cccttagctg caatccacat acttccagca 9601 gtggtaataa gtagcaaagg gataattcta ggcttcgtca gctggtagta gctttgaata 9661 acctgtagaa atgtttgatg gtggcgagag acattagtct caatcatttt ggcactattt 9721 cctttttttc aaacctaaat aactgttcac tacggctggt actgttgact gttcagtggg 9781 ttattctctt gctgtgtttg cgttgatagt cacgttgtta gcattgagcc cacgcttatg 9841 aacccagtcg cgcagtgcaa gaactgtgaa acacactaaa gtaccaagca aagctgctcc 9901 tactgcttgg tgagagacag tcagtggttc gacctgaaga tgtaaccgaa aagtggcaac 9961 tcctatcagg agctgtacaa taagcaatcc accagcgata tttgctagtc gtcgtaatgc 10021 aggatgcagt gctggtgttc gccatgaaag aaataccact gccaatgttg ccactgttgg 10081 aggcactatg ccagcgatat gactgtacat cacagtacaa agttgggatg tgccaaagca 10141 ttggtgtagt gcccaacgag agccgactaa agcacctagt agactttgta gataaaccaa 10201 aatagctgct gttaaaccta cccaaggcaa gttacctaca gttccagttg gctggtaagg 10261 ggtaagtgca gtaccgatga ctagtaaggt tgtgaaaaat aacagcgctg ttcctaagtg 10321 agcggttaca atatcaaacc gcaaaagttc agtaaccgtg agtccgccca agactccttg 10381 gaaaacgatt aaaaatagcg cgaatgtgga tgcccaaggc agccatctgg gtaaagagcg 10441 acgagaccac caggacattc caacgagtgc gatcgcgctg acaccaatta acgccgcatc 10501 caacctatga aaccactcca agaaaacctg gaaattcatt tgcttggttg gcaccagttc 10561 cccgtaacac aagggccaat cagggcaagc aagtccagca ttcatcacac gggtggcact 10621 gcctattgcc atcaaaatca aagtggctat acacattctc cacaccaagc gacgaattcg 10681 ttcctgtgga acctgcggcg gcgcagtcgt gatgttttgt tgttctagaa caaattcgct 10741 catgtaaaga taccttttgc cgctcttttt gattctgtct acgtttcaac catttgtcag 10801 taccgatcca tctcccaccc tagcgtacaa ctaaggttcg agactggctc tcactatact 10861 gtgaatcacc atagctactt tttgcctgaa gctttaggga atttttaagt tgtgagcagg 10921 cacactcaaa ggcgtaccac ccgctacaaa tgaaagaaac agctttttca cgcccttcgg 10981 gttcgccagt cgcctacgga gggaaaccct cctgcagcgc tggtctcacc agataccaag 11041 tgagggagac cctcatcaag tactggctcc cctacaccct tataccctta cacccctaca 11101 cccctatgtt ttttgagacg attgcgaaga acttagaacc agaattcacc gaaattaagc 11161 cagagtgtga aagccctggg aaggtgagtt ttttctggtt tggcattgat tcatgctaac 11221 cagttataaa attttttccc gaatcgacat ccaccactcg acacatccga gaaacagatt 11281 ctgttcaaga tcccaaaaat cagcctaaaa attaataaaa ttttcatgcg ttcccccaac 11341 atatctctta ggctggacta gtgtagtaag tgagtaaact gctcagtcaa atatagtaaa 11401 gccgttaact caaaaaaacc gtgaaaatcc caagttccat ctggacgtta cttataggca 11461 tcgggctgac cctagtcagc ctctggtacg gtcaaaatca tggtctacta ccaacagcag 11521 cttctgacga agccccgtta gtagacggtt tgttcaacac aatgatgacc gtctccgtag 11581 gtatatttgt gctcgtagaa ggtgttttaa tttactctgc tatcagatat cgtcggcgtg 11641 ctggtgataa tgcagatggt ccgccagtac atggcaatgt accactagaa atcctttgga 11701 cagcgatccc aacagttatt gttattggca tttctgttta cagcttcgac gtttacaacg 11761 aaatgggcgg cttcagcccc catgccatcc atgaagcccc aatgacgtca caagtcatga 11821 acatgcctgg ggcagcgatc gccgcaactt taagcgatac tccccccagc acagaaccta 11881 acctaaatca ggaaaaatct gatgaagcaa tgcaagaccc tgcgacagca gcagtccgca 11941 atgctgacca aattcctcaa aagcggaatg cccctggtgt aggaagtgtt gctcctagcc 12001 ttggacctac acctgaaaat gaaggaaaac cacctgcatt tgtggtgaat gtcacgggtt 12061 tgcagtacgc ctggattttt acctaccctg acactgaagt cacttctggt gaactgcacg 12121 ttcccatcgg gcgcgaagtg caattgaata tgacagcgaa cgatgttatc catgccttct 12181 gggtaccaga gtttcgcttg aagcaggatg tgatccccgg tcggcaaagt gagattcgtt 12241 ttacacccaa caaagaaggt gattatacgc tcatctgtgc cgaactgtgt ggtccttacc 12301 acggtgcgat gaggacacaa gttgttgttc aaaagccaga agaatttgaa cagtggatcc 12361 aagaacagga agttgctagc gctgatgaac tcaagcaagc agttgctgtt aatccggctg 12421 tcctaacccc agatgaattt ctcgctcctc ataccagcca tatgggaatt catccagaaa 12481 tgctacatca aattcaccat tagtcattag taattagcaa ttgactcctg acttctcaat 12541 atggttcgga taagatcaga acgcctgtgg cgaccccctt tccaaaggcg taaataaata 12601 aaatagcccc cctttccaag gggagccagc gcgttgggga gccagtactt gatgagggtt 12661 tccctcactt ggtatct // LOCUS NODE_2689_length_12624_cov_5.83093312624 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12624) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12624) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12624 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1287) /locus_tag="DP116_21645" CDS complement(<1..1287) /locus_tag="DP116_21645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helicase" /protein_id="PRJNA477356:DP116_21645" /translation="MYLTKNQVKQTATFKLKLPFVGESKGGYQTQPLPGCPKMPPSLQ LRGYQRQAVNNWFANNGRGTLKMATGSGKTITALAIACELYKQINLQVLLVVCPYRHL VTQWARECAKFNLQPILAFESLHNWQSQLSTQLYNVRSGSQLFLTVITSNSTLIGDGF QSQLKYFPEKTLIIGDEAHNLGAPKLEESLPRRVGLRLALSATPERYFDEGGTQSLFD YFGPVLKPEFTLRDAISNGALVHYLYYPILVELTEIESRAYAKLTQKIGRALLYRDRE NLDLADLEDNEDLKPLLMQRARLIGAAENKLNALRELMSTRRETSHTLFYCSDGSQEV GRSSLRQLKAVVKTLGVELGYKVSTYTSQTSIEEREVLRRQFESGELQGLVAIRCLDE GVDIPAIQTAVILASSANPRQFIQRRGRVLRPHPGKE" gene complement(1373..1978) /gene="lexA" /locus_tag="DP116_21650" CDS complement(1373..1978) /gene="lexA" /locus_tag="DP116_21650" /EC_number="3.4.21.88" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410809.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="repressor LexA" /protein_id="PRJNA477356:DP116_21650" /translation="MERLTEAQKQLYEWLAEYIRGHQHSPSIRQMMQAMNLKSPAPIQ SRLEHLRNKGYIEWSEGKARTIRVLQALKQGVPILGTIAAGGLIEPFTEAVENIDLAH LSLPPQAYALRVTGDSMIEDSIVEGDLVFLRPVLEPDQLKNGTIVAARVDSIGTTLKR FYRLGDRITLKPANPKYDPIEVNAIQVQVQGSLVAIWRNYN" gene complement(2139..3065) /gene="argF" /locus_tag="DP116_21655" CDS complement(2139..3065) /gene="argF" /locus_tag="DP116_21655" /EC_number="2.1.3.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195408.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ornithine carbamoyltransferase" /protein_id="PRJNA477356:DP116_21655" /translation="MAALIGRDLLSLADFSPTEVLELLQMASRLKSQKLRLRCHKVLG LLFSKASTRTRVSFTVAMYQLGGQVIDLNPNVTQVSRGEPVQDTARVLDRYLDILAIR TFAQQELEIFANYAKIPVINALTDLEHPCQVLADLMTVQECFGTFSGLTLTYVGDGNN VANSLVLGCALVGMNVRIATPNGFEPNAAIIETARLIAANKTEVLLTHDPEIATKGSH VIYTDVWASMGQEQEADNRMPIFQPYQVNEQLMSLAQSEAIVLHCLPAHRGEEITDEV MEGPQSRIWDQAENRMHAQKALLASILGAEEF" gene 3451..3522 /locus_tag="DP116_21660" tRNA 3451..3522 /locus_tag="DP116_21660" /product="tRNA-Thr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:3483..3485,aa:Thr,seq:tgt) gene complement(3751..4920) /locus_tag="DP116_21665" CDS complement(3751..4920) /locus_tag="DP116_21665" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21665" /translation="MGRKNHFPFSHALIHECIRQKSTPDLLLTNITMGNNTKYLQRLL ISTGNPKNIFELSLLLGTVLPIIFFTILSHTQQKIIEVETIKYTLLFLAMAHVSTTVY FYNVKAFRINIISQNKFMYIYTPLILFVFCGCIFTFSSQFFKPYLLLFYWLWQAYHYG KQNIGVYSFISYSQTKKPICRLEKISITLGILAGILPTWKVIGYNVAPSYLHNLINFF YWMGQPVFFAGLILSIYVFIVKNSCFTLLKSIFFFLLVFFFLPIYLSDNILVTFHTYA TAHGIQYIIFMTVIAINSEQAKDKAQKQSTLLKSLFLLSLYLIVGGLIFFYSQDLNKL VFIQNHSILSSCTDFLLGGLLGLTMAHFVVDAHAWKLSQVNQRKFILEKFSFIFK" gene complement(5319..6389) /locus_tag="DP116_21670" CDS complement(5319..6389) /locus_tag="DP116_21670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315886.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional riboflavin kinase/FAD synthetase" /protein_id="PRJNA477356:DP116_21670" /translation="MLNLSQNGCSVWVTSSTELARTPTFVALGKFDGVHRGHQKVIQP ILPPFNRAEDVGNFAETLFASSQTPHVYSTVVTFNPHPQEFFTGQPRTWLTPLDEKIH QLRSLGVEQLVLLPFDKELSALSAEQFVEKILVQQLQAARISVGQDFCFGSNRSGTAV DLKLLAAQYGIPVSIVPLETSACHGPCDSSDVSVASAEETRISTSLIRQLLLRGDLQS ANQLLGRAYTLIGTVIKGQQLGRTIGFPTANLQLPKDKFLPCHGVYAVRVLIYDETPD NPENIYLGVMNIGHRPTVNGTYQSVEIHLLDWSGDLYGKKLLVQLERFLRPEQKFSSL EALKTQIQQDCDIARAFFSAES" gene complement(6594..7514) /locus_tag="DP116_21675" CDS complement(6594..7514) /locus_tag="DP116_21675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315887.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" /protein_id="PRJNA477356:DP116_21675" /translation="MSRIENQFTVHFWGVRGSIPCPGPHTVRYGGNTPCVEMQVGGRR LIFDGGTGLHVLGQSLLSNMPIEAHVFFTHSHWDHMQGFPFFSPGFVKGNTFHIYGAI APDGSTIEQRLNDQMLHPNFPVPLQIMQANLDFYDVISGRPIHIHDITIETAPLNHPG EAVGYRVNWRGGAIAYITDTEHYSDRLDENALRLARNADILIYDCTYTDEEYNSPIQP KIGWGHSTWQEGVKIARSANVKTLVIFHHDPSHDDEFLDRVGQEATQKFPGAIMAHEG MVLQVPVPIPLSESFPIRKFSTYRLQSEQL" gene complement(7797..8594) /locus_tag="DP116_21680" CDS complement(7797..8594) /locus_tag="DP116_21680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876237.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5'/3'-nucleotidase SurE" /protein_id="PRJNA477356:DP116_21680" /translation="MNILISNDDGVSALGIRTLANTLAQAGHEINVVCPDRERSATGH GLTMHQPIRAEIVESIFHPGVKAWACDGTPSDCVKLALWALLDSPPDLVLSGINQGAN LGTEILYSGTVSAAMEGLIEGIPSVALSLTSHINKDFQPAANFAKILVGQLAQNPLPE LMLLNINIPAVKWEEIAGACVTRQGVRRYIDVFDKRVDPRGKTYYWLTGEVVEDVEPP LGLNLSENIPIDVHMIRKNYISITPLQYNLTYANGLNQLSQWNLKFP" gene 9158..9904 /locus_tag="DP116_21685" CDS 9158..9904 /locus_tag="DP116_21685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859578.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotide exchange factor GrpE" /protein_id="PRJNA477356:DP116_21685" /translation="MDEDKHQNNTNQQSGEESEEKQAMTSDSTAEINANVTQSETEPV ATPTDVSQNTPTQNQDNSVAAKVEGASTDLTAKEAALHQQIESLKAQLEERSTQYMRI VADFENYRKRNQKEKEDLEQQIKRNTITELLPIVDNFERARAQIKPQNDGEMTIHKSY QGVYKLLVDSLKRLGVSPMRPENQPFDPNLHEAVLREPTDEYSEGTVLEELVRGYYLG DRVLRHAMVKVAAPKEDAPSSEENSVESSQ" gene 10119..12077 /gene="dnaK" /locus_tag="DP116_21690" CDS 10119..12077 /gene="dnaK" /locus_tag="DP116_21690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996602.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaK" /protein_id="PRJNA477356:DP116_21690" /translation="MGKVIGIDLGTTNSCVAVLEGGQPIVITNSEGERTTPSIVGFGK NGERLVGQLAKRQAVTNAENTILSIKRFIGRRWEDTAGERSRISYTCVKGRDDTVDVL LHGHDYTPQEISAMILQKLKQDAENFLGEPVTQVVITVPAYFTHAQRQATKDAGTIAG LEVLRIINEPTAAALAFGLDKQDQEQLILVFDLGGGTFDVSILQLGDGVFEVKATCGN NHLGGDDFDHCIVLWMITRFQQEEKVDLSGDKMALQRLREAAEKAKTELSSRISTSIN LPFITADQTGPKHLQMELTRAKFEELTTHLVKAIIEPMSQALKDAQLKPQDIDRIILV GGSTRIRAVQNALSKFFDGKTPDRSVNPDEAVALGAAIQAGVLGGDVDHLLLLDVTPL SVGIETLGEVFTKIIERNTTIPSSQSQIFSTAVDGQTSVEIHVLQGERAMARDNKSLG KFLLQGIPPVPRGVPQIEVSFEIDVNGILKVAAQDKGTLREQSILITNTGGLSANEVE KMRQQAEMFAEDDKRRMQLVELKNQADNLLYSYESTLKHNGGMIDDQMKVLAHEKVIQ LQAAITDESISIAQFQQQLDNFQQTLFDIGSQVYNPVNSQTDTIIAASDNQLTLESAA PPSGTSTPPFDFDLKDESMMQVGYEAID" BASE COUNT 3837 a 2637 c 2573 g 3577 t ORIGIN 1 ttcctttcct ggatgaggac gcaaaactct tccccgccgt tgaatgaact ggcggggatt 61 agctgaactt gccaaaatca ccgccgtttg aattgcagga atatcaactc cttcatctaa 121 acagcgaata gcgactaaac cttgcaattc tccgctttca aattgacgcc gtaaaacttc 181 tctttcttct atagatgttt gggatgtgta ggtactgact ttataaccca gttctacccc 241 caaagttttg acgacagctt tcagttgacg taagctagaa cgtcccactt cttgtgaacc 301 atcactacaa tagaaaagtg tatgacttgt ttcccgacga gttgacatta actcccgcaa 361 agcatttaac ttattttcgg ctgcaccaat taatcttgct cgctgcatta ataacggctt 421 caaatcttcg ttatcttcca aatctgccaa gtcaagattt tcacggtctc gatataatag 481 tgctcgtcca atcttttgcg ttagtttagc ataggcacgg ctttctattt ctgttaattc 541 tactaatatg ggataataaa gataatgtac taaagcacca ttggagatcg catctcttaa 601 agtaaactct ggtttgagaa caggaccaaa ataatcaaat aaagattggg tcccaccttc 661 atcaaaatac ctttctggcg tagcagataa agccagtcgt aacccaacac gacgaggtaa 721 actttcttcc aacttcggtg cgcctaagtt atgtgcttcg tctccaataa ttaaagtctt 781 ttccggaaaa tatttgagtt gagattgaaa gccatctcca attaaagtag agttactagt 841 aataactgtc agaaacaatt gagaaccaga acggacatta taaagttgtg tggaaagttg 901 actctgccaa ttgtgtaaac tctcaaaagc taaaataggt tgcaaattaa attttgcaca 961 ttctcgcgcc cattgcgtga caagatgtcg ataaggacaa acgacgagca agacttgtaa 1021 gttaatttgc ttgtataatt cacaggcgat cgccagcgct gtaattgtct tcccactacc 1081 agtcgccatt ttcagcgttc ctctaccgtt gttggcaaac cagttattaa cggcttgtcg 1141 ctgataccct cgcaattgca gagatggtgg cattttaggg catcctggta atggttgtgt 1201 ctgataccca cccttacttt cgcctacaaa tggtaatttt aatttgaatg tagctgtttg 1261 cttaacttga ttttttgtca aatacataga aaggcaataa cgtcaaggta gtggtgagtg 1321 gaaagatgag gaagttattt ccctcatctc cccaatcccc tttctctatt tatcagttgt 1381 aattacgcca aatggcgaca agggaacctt gcacctgtac ctgaatagcg ttcacctcta 1441 tgggatcgta tttcgggttt gcaggtttga gggtaatgcg atcgcctagt cgataaaacc 1501 gcttcaaagt cgtaccaata ctatctactc tggcggcgac gatagtccca tttttcagtt 1561 gatctggttc tagtactgga cgcaaaaata ccaaatctcc ctccacgatt gaatcttcaa 1621 tcatgctgtc accagttacc cgcaaagcat atgcttgggg aggtaatgac aaatgagcta 1681 aatctatatt ttcgacagcc tcagtgaagg gttctattaa tccaccagcg gcgatcgttc 1741 ccaaaattgg tacaccttgt tttaacgctt gtagaactct aattgtccgt gctttacctt 1801 cactccattc aatgtacccc ttattgcgta aatgttccaa gcgactttga attggtgcag 1861 gtgattttaa attcatcgcc tgcatcattt gtcgaatcga aggtgaatgc tggtgtcctc 1921 gtatgtattc agccagccat tcataaagtt gtttttgagc ttctgtcaga cgttccataa 1981 atttgtaggg aggtgaatac aaatgctcct agaacattag tactacaaaa aacctcccta 2041 cgcactattt attttagata ttttgataaa ttcaagttag aacagagaag aaatgagtca 2101 atgatcgagt aaaaatctaa ctatttatga ctcacacctt aaaattcttc tgcccctaag 2161 atgcttgcaa gtaaagcttt ttgagcgtgc atccgatttt cagcttgatc ccaaatacgt 2221 gactgaggac cttccataac ctcatcagta atttcttcac cacgatgtgc tggtaaacag 2281 tgtaaaacaa ttgcctcgct ttgagcaaga ctcatcagtt gttcattcac ttgataaggt 2341 tgaaaaatag gcattctatt gtctgcttct tgttcttgcc ccatacttgc ccaaacatca 2401 gtgtaaatga catgagaacc ttttgttgca atttctggat catgagttaa gaggacttcg 2461 gttttattgg ctgcaatcaa acgtgctgtt tctataattg ctgcatttgg ttcaaatcca 2521 ttgggtgtgg caattctcac attcattccc accaaagcgc aacccaatac cagggaattt 2581 gcaacattat tcccatctcc tacataagtc aaagtcaatc cagaaaaagt gccaaagcat 2641 tcttgcaccg tcatcaaatc agccaagact tgacaaggat gttctaaatc agtcagagca 2701 ttaatgacag gaattttggc ataattagca aaaatttcca attcctgctg tgcaaaggta 2761 cgaatcgcca aaatatcgag gtatctatcc agaactcgcg ctgtatcctg cactggttcc 2821 ccgcgactaa cttgagtcac attgggattg agatcaatca cctgtccacc cagttggtac 2881 attgcgacag taaaactgac tcgtgttcga gttgaagcct tggagaacaa cagccctaac 2941 actttatgac accgcaatct cagcttttgt gattttagcc gagatgccat ttgcagaagt 3001 tctagaactt ccgttggact aaaatctgcc agacttaata aatcccgtcc gatcaacgct 3061 gccatgcttg ttgctctaca aagaacagtt ttgtctttct gtttggtgta cccagccgtg 3121 tcaaaaaaat cttgccttaa aactacacca gatattttct gtctagccta tagttttaag 3181 gtggctttgg tatatcacta gtttatcaat atcgtcagtg aaaataatca caagtttgtt 3241 tgattttcat tacagtcacc gcgcttgtaa gtatgcttct aggtggtgac atcggcagta 3301 tgataaaaat gagatatttt cagcgcaatt tggacaaata taactcttca cactagtata 3361 ttgtcaaatc agcttgtcta tggtatagtc ataaattagt gtgatgttca aaaaagacaa 3421 ggcaatttct attttagaaa agccgagtat gccagcatag cacagtggta gtgcatccga 3481 cttgtaatcg gaaggtcgtc ggttcaaatc cgactgctgg cttttgtggt tgtcgattaa 3541 catattattt gtcgtcgtta acaaattaat gtcatcgaac tgttctgagt gaaggtgcaa 3601 ggtcggatta ttgatgttta cgctgcaaat aaagctgatt ttatcaagtt gaaggatgga 3661 actgaaattc ggttggatag aattgtttca gtcaataata agctgctttc ctcttactct 3721 aattcatcgc acactaactc ctaggtaaac ctacttaaaa ataaatgaga acttttctaa 3781 gataaacttt ctttgattta cctgacttaa tttccaagca tgagcatcta ctacaaaatg 3841 cgccatagtt agccccagta atcctcccag aagaaaatcc gtacaacttg aaaggatact 3901 gtgattttga ataaagacaa gcttgtttaa atcttgactg tagaagaaaa tcaagccccc 3961 cactattaaa tacagtgaaa gcaaaaacaa tgacttcaaa agagtagact gtttttgggc 4021 tttatccttt gcttgttcag aatttatagc gataactgtc atgaatataa tgtattgaat 4081 tccatgtgca gttgcataag tatgaaatgt cactaaaata ttatcagata agtaaatcgg 4141 caaaaagaag aacactaata gaaagaagaa aattgacttg agtaatgtaa aacaagagtt 4201 ttttacaata aaaacataaa tactcaaaat tagacctgca aagaagacag gctgtcccat 4261 ccaatagaaa aagtttatca aattatgtaa ataactagga gctacgttgt agccgataac 4321 tttccaagtt ggaagaatac cagctaatat tcctaaagta atagaaattt tttctaatct 4381 gcatataggt ttctttgttt gagaatacga aataaaactg taaactccta tattttgctt 4441 accataatga taagcttgcc atagccaata aaaaagcaaa agataaggtt tgaaaaactg 4501 acttgaaaaa gtaaaaatac acccgcagaa tacaaacaat atcagaggtg tatatatgta 4561 catgaattta ttttgagata ttatgttaat ccggaatgct ttaacattat aaaaataaac 4621 tgttgttgaa acatgcgcca ttgctaagaa taataaagta tatttaatcg tctcaacttc 4681 tatgattttt tgttgtgtat gactgagaat cgtaaaaaat ataatcggta aaactgtacc 4741 tagtaataaa cttaattcaa atatattctt agggttaccc gtgcttatca aaagccgttg 4801 taaatattta gtattgttgc ccatagtgat gttagttagc aataaatcgg gggtggattt 4861 ctgtcttatg cactcatgaa taagtgcatg ggaaaatggg aaatggtttt ttcttcccat 4921 ttccttattg cctcagtaag aggatatgtg ttgaagtcta aaatgtcatg ctctacgcaa 4981 acgggtagtg tgcaccttgg acgctcatac caagttgcgc cttaatgatc tcatcaggtg 5041 cccctctgcg atgctttgga gagagggtaa agtgacaagt atcaatataa cacgcgtaac 5101 agtatctcaa agtttgacgg taaaaaggaa ctcctcgttc cgcccaatga gaaagttagt 5161 tcccaagacg tgaaaaacgt cggtgcagga ctggactaac cagttgcctc caccgacgtt 5221 tccctcaaaa cagtactagt ctcacttcgc tctactcaga atgacaaaat gacacttttg 5281 ggacagacat ccccttacta ttcccccaat aagctagact aagactctgc actaaaaaaa 5341 gctctagcga tatcacagtc ttgttgaatt tgtgttttta gggcttctag agaagaaaat 5401 ttttgttctg gtcgcaaaaa tctttctagt tgcactaaca actttttgcc atacaaatca 5461 ccagaccaat ctaacagatg gatttccaca gattgatacg taccgtttac cgttggacga 5521 tgacctatat tcatcacgcc caagtaaata ttctctggat tgtctggtgt ttcatcgtag 5581 attaaaacgc gaacagcgta gacaccgtgg cagggcaaaa acttgtcttt tggtaattgt 5641 aggttggctg tgggaaagcc aatagttctg cccagttgtt gacctttgat gacagtaccg 5701 ataagagtgt atgcacgtcc taacagttga tttgcgcttt gaagatcgcc tcggagtaaa 5761 agttgccgga tgagtgaagt gctaatgcga gtttcctcgg cggaagcaac actgacatca 5821 ctgctgtcac aaggtccatg acaagcagac gtttctagag gaacgataga aacaggaatg 5881 ccgtactggg cggcgagcaa cttcaaatcc acagctgtac cactgcggtt ggaaccaaag 5941 caaaaatctt gcccaacgct aattcgtgca gcttgcagtt gttgcaccag aatcttttct 6001 acaaactgtt cagcagataa ggcagataat tctttgtcga agggtagcag gacgagttgt 6061 tctactccca gcgatcgcaa ctgatgaatt ttttcatcca gtggtgttaa ccaagtacgg 6121 ggttgcccag taaaaaattc ttgtggatga ggattaaagg tgacaaccgt tgagtaaacg 6181 tgtggagttt gagaggacgc aaacagcgtc tcggcaaagt tccccacgtc ttctgctctg 6241 ttgaagggcg gtaatatagg ttgaataacc ttttgatgac cacggtgtac gccgtcaaac 6301 ttgccaagag caacaaaagt tggagttcga gccaattcgg ttgaagaagt tacccacaca 6361 gaacacccat tttgagacaa atttagcacg tcgattctgg gatttgaggt tatattaagc 6421 tctcagctaa tacagttcta agaaccaaag ccctatgcct tccctagtgg tgggtatgct 6481 cgtgcaactg ttcgttgaca cctcggcagg cgaaaagctt ctcattgttg tgcacgagtt 6541 gtgtgctagc gactgattgc tgaaagctcc taatcaccaa tctaatccaa aatttaaagt 6601 tgctccgatt gcaatctata agtcgaaaat ttcctaatag gaaaagattc tgataaaggg 6661 attggtacag gaacttgaag caccattcct tcatgtgcca tgatagcacc aggaaatttc 6721 tgtgtggctt cttgtccgac acgatccaaa aactcatcgt catgagatgg atcgtgatga 6781 aaaatgacca gagttttgac attagcggat ctggctatct ttacaccttc ttgccatgtg 6841 gagtgtcccc aaccaatttt aggctgaatt ggggaattat actcttcatc agtgtaggtg 6901 caatcgtaaa tcaggatgtc agcgttacga gctaagcgca gggcgttttc atctagtctg 6961 tcagaataat gttcagtatc agtaatataa gctattgccc caccacgcca attaactctg 7021 tacccgaccg cttctcctgg atggttgagt ggagctgttt ctatggtaat gtcatgaatg 7081 tgaatcggtc gccctgatat aacgtcgtag aaatctaaat ttgcctgcat aatctgcaag 7141 ggaacaggaa aatttgggtg cagcatttgg tcattaagac gctgttctat ggttgaacca 7201 tcgggagcga tcgcaccata aatatgaaaa gtgtttccct taacaaatcc tggagaaaag 7261 aaagggaaac cctgcatatg atcccagtgg gagtgggtga aaaaaacgtg tgcctctatc 7321 ggcatattag acaacaaaga ttgccctaaa acgtgcagtc ctgtgcctcc atcaaaaatt 7381 aaacgtctgc cacccacttg catctctaca caaggggtat taccgccata acgaacggtg 7441 tgtggtcccg gacaggggat actgccacga acgccccaaa aatgtacggt aaattgattc 7501 tctatcctag acatgggtgt tgcttgctgg actgagcggg cactcaataa aaaaaattgt 7561 tacttatttt gttcagttta ttctgtttca cgacttttag tagtagtaga atttcttggt 7621 aatacagcat accttgtttc aggttagttt tgtctttact tatgaaaaac ttcaaggcag 7681 aagttttgga gtgcaaacgt ttccgtataa gttattttca cttaacttcc tactttataa 7741 ccgaaatttt gctgtcatgg ttttatccat atacttgccg tcataaagaa aagacattac 7801 ggaaatttca aattccattg agataattga ttcaatccat ttgcataagt aaggttgtat 7861 tgtagtgggg ttatactgat gtagttttta cggatcatat gcacatctat aggtatgttt 7921 tcagaaagat ttaaacctaa tggaggttcc acatcttcca caacttctcc cgttaaccaa 7981 taataagttt tcccacgcgg gtctactcgt ttatcaaaaa catcgatgta acgtcgtact 8041 ccctggcggg tgacacaagc tcctgcaatt tcttcccatt taacagcagg gatattgatg 8101 ttaagtaaca tcaactctgg tagaggattt tgcgccagct gtcctactaa gattttggca 8161 aagttagcag ccggttgaaa atccttattt atatgactag taagactgag ggcgacactg 8221 ggaatgcctt caatcaaacc ttccattgcg gcagatacag taccagagta gagaatttca 8281 gttcctaaat tagcaccctg attaatgcca gatagaacta aatctggtgg agaatctaac 8341 aaagcccaca gcgccaattt aacacagtct gaaggagttc catcacaagc ccaagctttg 8401 acgccaggat ggaaaatcga ctcaactatt tcagcgcgaa ttggttgatg cattgttaac 8461 ccatgcccgg ttgcggaacg ctctcgatcc gggcaaacta cattgatctc atgacctgct 8521 tgcgctagag tgtttgccag agtacgaata cctaaagcag aaacaccatc atcgttgcta 8581 atcagtatat tcataaatat ttaatccatc gtcatagtca atagttagga gttaagagtc 8641 aacactcaag aagtgatggc gtaattggta ctccacccac atgcgcctaa accctgtagt 8701 aaatctctag gatagtttac tagagtctat attccgcacc ctcaactggc tgttagagct 8761 tttgattctc aagttgatgt agagatatcc gcctgacaag atctcgcaaa aaacaatcag 8821 atctggcgaa aaagtaatac caagttgcgg cgaaatcacc ttactctcca tctcctttgg 8881 aataccttgg agacggtaac ccgtccaagg agggtatacg ctaactgact cagcatttgt 8941 caaaatagat gacggtggtc attctcaagc taaacaatga agaatgaaat attccgtact 9001 tcattcatag tttatacaat cttataagaa ttcgtccgca actttgttgc ctagttcata 9061 cttcatactt cagggaacca gtttttgcac ccgaattcaa ttgttgcaag atatggggat 9121 aactacccaa ttcagcatcc agaaagagtg cacaatgata gacgaagata aacaccaaaa 9181 caacacaaac cagcaatcag gtgaagaaag cgaggaaaag caagcaatga caagcgactc 9241 tacagccgaa atcaatgcca acgtaaccca aagcgagact gagccagtgg caaccccaac 9301 cgatgtctcg caaaatacac ctacacagaa tcaagacaat agtgttgctg caaaagttga 9361 gggcgcaagt actgacttga cagccaaaga agcagcactt caccaacaaa ttgagtccct 9421 aaaagcgcag ttagaagagc gtagcactca atacatgcgg attgtggctg attttgagaa 9481 ttaccgcaaa cggaatcaaa aagagaaaga agatttggaa cagcagataa aacggaacac 9541 gattactgaa ttactaccaa tcgttgataa ttttgaacga gcaagagcgc aaatcaaacc 9601 ccaaaatgat ggggagatga cgattcacaa aagttaccaa ggtgtttaca agctattagt 9661 agattcctta aagcgcctgg gtgtctcacc aatgcgtcct gaaaaccaac catttgatcc 9721 caacttgcac gaggcagtat tgcgcgaacc tacggatgaa tattctgaag gaacagtgtt 9781 agaagagtta gtacgcggat attacttggg cgatcgcgta ctgcgccatg caatggtgaa 9841 ggtggctgct ccgaaggaag atgcaccctc ttccgaggaa aattcagtcg agtccagcca 9901 atagctcagc tgtattcgct cgccttactg atgaaaagtc cctggaaaac aggggcgagc 9961 aatctaacca cacccgacag atagtcacgg aaacatcacg cccaaagttt agcaaccttt 10021 acccggtgtt cgtgtatcgc taacaccgga ctgaagttgc tcaaaacttc agagaccaag 10081 tagtcagtaa aaatactgat gtcaaacaac tgatatttat gggaaaagtt attgggatcg 10141 atttaggcac taccaacagt tgtgtcgcgg ttctcgaagg tggtcaacca attgtcatta 10201 cgaattcaga gggtgaacga acgactccaa gtattgtggg attcggtaaa aatggtgaac 10261 gcttggttgg tcaactggca aaacgtcaag ctgtcacaaa tgctgaaaac actattctca 10321 gtattaagcg atttatcggt cgtcgttggg aggacactgc aggtgaacgc tcgcgcattt 10381 cttacacctg tgtcaaaggt cgagatgata ctgttgatgt gctccttcac ggacatgatt 10441 atacaccaca agaaatctcc gcgatgatcc tgcaaaaact caaacaggat gcggaaaact 10501 ttttaggtga acctgtcact caggtagtaa tcacagtacc tgcatatttt acacatgccc 10561 aaagacaagc gaccaaagac gctggcacta ttgcgggact agaagttttg cggattatca 10621 atgaaccgac tgcagctgcc ttagcgtttg ggttggacaa gcaagaccaa gagcagctca 10681 ttctcgtatt tgacttagga ggcggtacct tcgatgtctc cattctacag ttgggggatg 10741 gagtctttga agtgaaggcg acttgtggca acaaccattt aggtggggac gattttgatc 10801 actgtattgt cctctggatg atcacacgct tccaacaaga agagaaagtt gacctttctg 10861 gggacaaaat ggctttgcaa cgtttgcggg aagcagcgga aaaggcaaaa acggaacttt 10921 ccagtagaat cagcacttca attaacttgc cttttatcac tgctgatcaa acaggaccaa 10981 agcatttgca gatggaactc acccgcgcca aatttgaaga gttaacaaca catctggtta 11041 aagcgatcat cgaaccgatg agtcaggcgc tcaaagatgc acaactcaaa ccacaagaca 11101 ttgatcggat aatcttagtg ggcggttcca ctcggattcg ggctgttcaa aatgccttga 11161 gcaaattttt tgatggcaaa actccagatc gttctgtcaa ccccgacgaa gcagtcgcac 11221 tgggagcagc tattcaagct ggggtgctgg gtggtgacgt agatcattta cttttgttag 11281 atgttacgcc cttgtctgtg ggaattgaaa ccttgggaga agtgttcact aaaattatcg 11341 aacgcaatac cacaattcca agtagccagt cccaaatttt ttctacagca gttgatggac 11401 aaacctctgt ggaaattcac gtccttcaag gtgaacgggc aatggcacgg gataacaaga 11461 gtttgggcaa atttcttctt cagggaattc cgccagttcc gcgtggtgta ccgcaaattg 11521 aagtctcttt tgaaattgat gttaatggca tcctcaaagt tgcggcacaa gacaaaggca 11581 ctcttcgaga acaaagtatt ctcattacca atacaggtgg cttaagtgcc aacgaagttg 11641 aaaaaatgcg gcaacaagct gaaatgtttg ccgaagatga caaaaggcgt atgcaactgg 11701 ttgaactgaa aaatcaagca gataatttac tgtatagtta cgaatcaacc ttaaagcata 11761 atggcggtat gattgatgac cagatgaaag ttttggcaca tgaaaaagtc atacagctac 11821 aagcagcaat cacagatgaa tctatttcta tagcccaatt ccaacaacag ttggataact 11881 tccagcaaac tttgtttgac attggctctc aggtgtataa tcccgttaat agccaaactg 11941 atacaataat cgcagcttca gataatcaat tgactttaga atcagcagcg ccccctagtg 12001 gaacgtcaac accaccattt gactttgatt tgaaagacga gagtatgatg caggttggtt 12061 acgaggcaat agactagaga attgccaaga gtttgggtag tgtgcaaaaa tgacagatcc 12121 tagattctga ctcctctaca cttttaatta ttgaatctct gaaattggct tgtcaggttg 12181 ccctacacca tacatatccg gtaataccat tactttcatg aacggttttc cacacttttt 12241 agtgcggtta gctcgtctgt aaagtaagcg gagttcttgc aatatacgta gcacaattgt 12301 aaaagagaca gcctcaggct attgcttttg tcgtttacca tgagaaaagc acctgttgct 12361 gttgtcagta gtgagtaaaa attttctggt cttaacttag tctttgattt caaaactaat 12421 tccagattcc tttgactcag tgctcgacat tcagaactgt tatggtgatt tttgtataaa 12481 ggaaaaggta aaattttagt attaataaaa cttaactatt gccttctgcc ttcccgaact 12541 tccgaacggc ggaactcgtc gttaaaagtt ctgtttgcat agcccgcttt ctacctttta 12601 actttcacct ttgctctatg gctc // LOCUS NODE_2695_length_12606_cov_5.36164412606 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12606) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12606) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12606 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 427..4194 /locus_tag="DP116_21695" CDS 427..4194 /locus_tag="DP116_21695" /inference="COORDINATES: protein motif:HMM:PF01590.24,HMM:PF02518.24,HMM:PF03707.14" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21695" /translation="MYSELTGTYKLELVGLSFAIAVISSYTALDLSGRVQLAWNRRLL WLLGGAVAMGMGIWSMHFIAMLAFELPQPVTYDVWTTLLSLLFAVVASSIALSLLSRS ISTALLIGGGVCMGLAIASMHYTGMAAMRLQAKLEYDLKLVSLSVLIAIIASFAALWL AFRLRKNQDLKGAIWQKLGSAFLMGIAISGTHYTGMWATHFMPHKHLSKLQSPVMNQF WLAVAIGVAALFMLTLALLTSFFDQSLTDHLLQQKALEESEKRFRMLIREMQVGVLLL NCNAEILISNQAANNLLNRNPHDKQRQVFGAGWLLLREDGTPLPEEEQPVRQAIALQK PIHNIVMGIEDRTRQNQRWLLVNADPQIGNDGRVERVVCTFSDITQRKQVEATLQLIV EGTAYTTGDEFFRSCVRYIAKVLQVFYVFVSEFTNDTKTELRTLAFWNSVDFDENFNY DIAAITHEHCKLVFGGTCCCHFNDELSILLLRNKDIAQLNLHSYFVPLVNSNGEVIGY LVVIDVKPLEIDLSKESCVKIFAARAGAELERKLAEELLAKSAERERAISFVIQRMRQ TLEIEKIFSATTQELRQALSCDRVLVYRFHPDWSGDIVCESVAEGWKALKRLQKNQSQ FTQNAVNQTDCVIKSVGDADNIIKETYMYDTQGGYFRSGVTSRCVSDIYKAGFDPCYV EFLEQFQARAYITVPIFCNNQLWGLLATYQNSAPRQWKEAEIKMVVQIGAQLGVAIQQ AELLVQTQKQSTELKQAKEAADKANRAKSEFLANMSHELRTPLNAILGFTQLMNRDTS LKTEHQKYLNIINRSGEHLLVLINDILQMSKIEAGGMTFNENKFDLYYLLNSLEAMLK LKAQSKGLNLIFERTLQVPQYITTDESKLRQVLINLLDNATKFTEKGSVTLRVSVDQG SRDIQQQQHSPQFHLLFEVTDTGPGIHPNEFDKLFQAFEQTATGLKSGEGTGLGLSIS HKFVQMMGGEIKVSSTLGVGTQFSFFIPIEEATETKTQTLEFTSHKAIGLASEQPAYR ILVVEDQQTNRMLLVNLLNNLGFQVQEATNGQDALTLWRIWQPHLIFMDIRMPLMDGC EATRLIKQRERKTHQHSHQTIIIAITASAFEEEKHKILSAGCDDILSKPIQEQDIFGK ISKYLGVQFLYQENTTNTDITPPICKVMTNFPNLASVMGTMPTEWIRQLHNAACGGND LLILQLMEQIPADKIDLIEALNVLVENFDFEQIIELTENRTQNSERITQQS" gene complement(4505..5635) /locus_tag="DP116_21700" CDS complement(4505..5635) /locus_tag="DP116_21700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876569.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldo/keto reductase" /protein_id="PRJNA477356:DP116_21700" /translation="MKYRRFGKTNLRLSVFSLGTMRYLASQENAWQTIHKAIVLGINH LETARGYGKSEEYLGEAISVGLPVNRCQVHITTKIPPTADADTMRRYIDESLERLKLD YLDCLGIHGLNTWEHLDWVKAKNGCMQAVQEAIADGRVRHVGFSTHAPLEIILAAIDT DLFEFVNLHYYYFFQRNAEAIQKAYEKDMGVFIISPADKGGRLYTPPQTLKDLCDPFS PLELNYRFLLSDSRITTLSIGPAHPEELTEPLRVADCDQKLTPEEITVFERLENRQKA VLGTDKCSQCYACLPCPENINIPEVLRLRNLAVAYDMGDFGQYRYRMFENAGHWFAGM KASRCTECGDCLPRCPEELDIPTLLQDTHSRLNGSSGRRLWG" gene complement(5701..6198) /locus_tag="DP116_21705" CDS complement(5701..6198) /locus_tag="DP116_21705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012406891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional nuclease family protein" /protein_id="PRJNA477356:DP116_21705" /translation="MLEMKVAGIALDAITRSPIVLLKDASDRRALPIYIGQEQARAIM GALENQKAPRPLTHDLIVNMLDAWNIVLERIIIHSLQKDTFYAALIVKQGDVKKEIDA RPSDAIAIALRTNTPIWVMEEVIADASIPVDRDADEAEQQAFREFISDLRPEDLIKRF GSGET" gene 6538..7218 /locus_tag="DP116_21710" CDS 6538..7218 /locus_tag="DP116_21710" /EC_number="2.5.1.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="riboflavin synthase" /protein_id="PRJNA477356:DP116_21710" /translation="MFTGIIQALGTIKPLEFSSWQITCVTQPSNVIMQDLATGDSIAV DGICLTVEKILKNGFIATASPETLRRTTLGQEETQQRYVNLEASLRVGGKVGGHFVMG HVDGIGQLVSAEATQTSWELIFTAPGAIARYMVSKGSIAINGISLTVADYQSELSQFK VAVIPVTYKETNLQYINPGSWVNLEGDILGKYVEKFLCFGNPDLEEATQTIPNDITPG FLVEHGYL" gene complement(7325..9001) /locus_tag="DP116_21715" CDS complement(7325..9001) /locus_tag="DP116_21715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315644.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M23 family peptidase" /protein_id="PRJNA477356:DP116_21715" /translation="MAQRNNSAENRLHQLWQQCLSTRRFASTLPAQSLAWLGSITMLS NGGLVFAQTESAIDNIVPTAESSQPAPSVNRVKRDTFRHNNSSPAVEETKSQSDEFSQ RRVRLRQRLSQAKRPSPAVALRNYKRHAQSSQPEVAIRKFKPRVQVSVRSESTRNWRT RLQRAPQVEVPQFNISVSKEQPQQEVSQENNWPVPRHLAHWREPYANSTKIDNLRAAL NSRQGTASSTDDSTPKDYNNAYIDPGEYNTGATDRYQAPNSVVITQQSNTELPRKKAS WIRRSKPATLATVPPVRHVERGERNSLDRPTYRTVSRSMTPSRHRSYRTAYRSITKDT YHPNRFIPDFSSPTTVSSVPIAPVGGILPAPMTAENVAPRISNITYDIPLAAVLPQVN YGGVYGGRLASGPGLMYPLSIPSAISSLFGWRTHPITGDRSFHSGTDIGAAMGTPVLA AYTGKVESADWLGGYGMTVIVNHSNAQQTLYGHMSEIFVQPGQLVQQGTVIGRVGSTG LSTGPHLHFEVRQLTPEGWVATDPGAQLESALGQLMQSLHTAQTPQRPGS" gene complement(9256..10110) /locus_tag="DP116_21720" CDS complement(9256..10110) /locus_tag="DP116_21720" /EC_number="6.3.4.15" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458377.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="biotin--[acetyl-CoA-carboxylase] ligase" /protein_id="PRJNA477356:DP116_21720" /translation="MALDQQKLETALQVERKSPYLQFSLHLFETVSSTNQILWDLLAQ GAESGCVVIATDQTAGRGQWGRQWMSSAGGLYLSVAIANGNIAAENRTAFCLPKLYAT NSYQLTLATAWGIAQELRNCGVPVDIKWPNDLILVSRKLGGILTETKIHKGVITQAVI GVGLNWANPVPETGINLEMWQANQQTKFVSCLEMLISKVLIGIESGIQCLFQEGVDIL IYRYLELLMNIGEKVYVNNTVGTIIGVTNTGELRVRMETSELKSVKRSEIYLQPGTIS LGYCHSYH" gene 10184..10987 /gene="pgeF" /locus_tag="DP116_21725" CDS 10184..10987 /gene="pgeF" /locus_tag="DP116_21725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318243.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan editing factor PgeF" /protein_id="PRJNA477356:DP116_21725" /translation="MHAWHWRTWEGLPYLTCSLLEPWHHGFFTQQFSPSYPSELTKVL HPEASAYRLKQVHGNTVLTPKEIVALLAEGGDQVDGEGDDALVSGDGLISNQPLQAVW VATADCTPVLIADEKTGQVAAVHAGWRGTALKIVPQAIARMQAQGSKLEDLRIAMGPA IAGEVYQVSEQVAAEVGASIIPQNDEKAIVAALYKLPNSPLLPDPNPERVRLDVRRVN TLQLEQLGISSQQMAIAPYCTYQTPEYFFSYRREKQKKVQWSGIVSNTL" gene 11138..11803 /locus_tag="DP116_21730" CDS 11138..11803 /locus_tag="DP116_21730" /inference="COORDINATES: protein motif:HMM:PF11523.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21730" /translation="MARRKKTQILDKEFETQKELKEYVQKILYAYELKELLSPEHFQF IRSLLNNHPDSYEKIGDGIESIWIQENVVNRGKSRGFWFQRIDRSIDNFSYKVCIESP PSVTSHFIMACREAVDSYVNAYREKTFQGVEILQCPTTGDSITLKESYVAHSPLHFKE LAEAFRKNEDLVLSEQLFQVHRDGDFAMSFADEALRQKWIKYHSENATLEIRSKSVLG VKV" gene 11946..12308 /locus_tag="DP116_21735" CDS 11946..12308 /locus_tag="DP116_21735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181242.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21735" /translation="MTTVAILPISNPNGEKSYRAIAGDKQSVGKTAGQALDALTAQLG ETEFSALLVIQSFRPDPFFSTEQQKRLSELMSVWRTARDQGQALPPEQQAELDTLVDT ELRAATARTAALMQQLSQ" BASE COUNT 3649 a 2685 c 2741 g 3531 t ORIGIN 1 attccctctt ccccccctat ccccagaggc gaccggtgag tcccccctat ccccagaggg 61 aaccggtgag tccccctatt cctaaagctg cctatttgta tcaacgttaa agtgaaacga 121 tattagaccc cgtcctacca cattcaggga tggggaattt tatagaattc gtgaagaaaa 181 tgcgcaattt tacgtatttt actgaaaaat aaaataattt cctcatcact ttgtcactta 241 gaattaacag aaaaacttat attctttacc taatggttca aatgtgaaag ataaaattta 301 tctgacgaaa ttgcagctgg tccggtaggg catgttgctg catctctaca cagcaccagt 361 ataggacttc atggaaaata gttggttcac atttcagtag tctgttgggt aaacgcataa 421 aaaaatatgt acagcgaatt aacaggcacg tataaattag aattagtcgg actttcgttt 481 gcgatcgcag ttatttcttc atacacagcc ttagacttat caggaagagt gcagcttgct 541 tggaacaggc gtttactctg gctgttgggg ggggctgtcg caatgggaat gggcatttgg 601 tcaatgcact ttattgccat gctggctttt gaattacctc agcctgtgac ttacgatgtg 661 tggactaccc tgctgtctct gctgtttgca gtggttgctt ctagcatcgc tttatcgttg 721 ttgagtcgtt ctatctcaac agcactttta attggtggtg gggtttgcat gggactagcc 781 attgcttcaa tgcactacac gggcatggca gctatgcgac ttcaggcaaa gcttgagtat 841 gatttaaagt tagtcagttt gtcggtgctc attgccatca ttgcctcttt cgcagctctt 901 tggctagcgt ttcgactgag aaaaaatcaa gacttgaaag gggcgatatg gcagaaactt 961 ggcagtgcct ttttaatggg aattgccatc agtggaacgc actacacagg aatgtgggca 1021 actcacttta tgccccacaa acatttatca aaactgcaat ctccggtaat gaatcagttc 1081 tggctggctg ttgccattgg ggttgcggca ttattcatgt taactttagc cttattaact 1141 tctttcttcg accaatcttt aacagatcac ttattacaac aaaaagcttt agaagaaagt 1201 gaaaaacgct ttcggatgct gatccgagag atgcaagtag gagttttatt gctcaattgc 1261 aatgctgaaa ttctcatcag caatcaagca gcaaacaatc tcctgaatcg aaatccccac 1321 gataaacagc gtcaagtatt tggtgctggt tggttgttgt tgcgtgaaga tggtacgcct 1381 cttccggaag aagaacaacc cgtgcgccaa gcaatagcac tgcaaaaacc cattcacaat 1441 atagtcatgg ggattgaaga tcgcactcgc caaaatcaac gctggctatt agtcaatgca 1501 gatccccaga taggaaatga tggtcgtgta gagagggttg tctgcacttt tagtgatatc 1561 actcaaagaa agcaagttga agccacactg caattaattg ttgagggaac agcttataca 1621 actggtgatg aattctttcg ttcatgcgta cgctatattg ccaaagtgtt acaagttttt 1681 tatgtgtttg ttagtgaatt tacgaatgat accaaaactg aacttcgtac cctggcattc 1741 tggaatagtg ttgattttga tgaaaacttc aactacgaca tcgctgctat tacccacgaa 1801 cactgtaaac tcgtttttgg cggaacatgc tgctgccatt ttaatgatga gttatcaata 1861 cttttactta gaaacaaaga tatcgcccag ttgaatcttc acagctattt cgttcctcta 1921 gtcaactcaa acggcgaggt tatcggttat ttggtggtga tcgatgttaa gcctttagag 1981 attgatttaa gcaaagagtc ctgtgtgaaa atttttgctg ctcgtgcagg ggctgaacta 2041 gaacgtaaac tagcagaaga attgctcgct aagagtgcag aacgagaaag agcaatttcc 2101 tttgtgattc aacgcatgcg tcaaactctg gaaattgaga aaattttcag tgcgacaaca 2161 caagagttgc gacaagcgtt aagctgcgat cgcgtcctcg tttatcgctt tcatcctgac 2221 tggagtggag acattgtctg cgagtcagta gcagaaggtt ggaaagcgct aaagcgctta 2281 caaaagaatc aatcccaatt cacacagaac gcggtgaacc aaaccgattg tgtgatcaaa 2341 agtgttgggg atgcagataa tatcataaag gagacctata tgtacgacac ccagggcggt 2401 tatttccgtt ctggcgtaac ttctcggtgc gtttcggata tttacaaagc tggatttgac 2461 ccttgttatg ttgaattctt agaacaattt caagcacggg catacatcac agttcccatc 2521 ttttgcaaca atcaactttg gggcttgcta gcaacgtatc aaaattctgc tcctcgtcaa 2581 tggaaagaag cagaaatcaa aatggtagtt caaattggag cacaattggg agtcgcaatc 2641 caacaagcag aattgttagt acaaacgcaa aagcaatcta ccgaactcaa acaagccaag 2701 gaagccgctg ataaagccaa tcgtgctaaa agcgaatttt tagcaaatat gagccatgaa 2761 ctaagaacac cactcaacgc aattcttggc tttacccaac tgatgaaccg cgacacttcc 2821 ttaaaaacag agcatcagaa atacctaaat atcattaatc gtagcggcga acatctgttg 2881 gtgttaatta acgacatttt acagatgtcc aaaattgaag cgggaggtat gacatttaat 2941 gagaataaat ttgatttata ttacctcctt aatagtttag aggcaatgct caaactcaaa 3001 gctcaatcca aaggcttaaa cctgatattt gagcgtaccc ttcaagtccc gcaatatatc 3061 acaactgacg aaagtaaatt gcgtcaggtt ttaataaact tactagacaa tgccactaaa 3121 tttaccgaaa aaggaagtgt cactcttcgg gtatcagtgg accaagggag tagagacata 3181 caacagcagc aacacagtcc acaattccat ctcctgtttg aggtcacaga cactggtcct 3241 ggcattcatc ccaatgaatt cgataaatta tttcaagctt ttgaacaaac tgcaacaggg 3301 ttaaaatctg gtgagggtac tggcttgggt ttatccatca gtcataagtt tgtgcaaatg 3361 atgggaggag aaatcaaggt tagcagtacc ttgggggttg gaactcagtt tagctttttt 3421 attcctatag aggaggcaac agaaacgaaa acacaaactc ttgagttcac gagtcataaa 3481 gccattggct tagcttctga acaacccgct tatcggattt tagttgtcga ggaccagcaa 3541 accaatcgta tgctattggt caacctgcta aacaacctag gctttcaagt acaagaagcc 3601 acgaatggtc aagatgctct cactctctgg agaatatggc aaccgcactt aatttttatg 3661 gatatacgga tgcccttaat ggatggctgt gaggcgactc gcttgattaa gcaaagggaa 3721 aggaagacac atcaacattc tcatcaaacc atcatcattg ccataacggc aagcgctttt 3781 gaagaagaaa aacacaaaat tttgagtgct ggctgtgatg atattttaag taaacccatt 3841 caagaacaag atattttcgg aaaaataagc aaatatttgg gcgtgcaatt tctttatcaa 3901 gaaaatacaa caaatacaga tatcactcca cctatttgta aggttatgac caattttcct 3961 aacctggctt ctgtgatggg aactatgcct accgagtgga tacgacagct tcacaatgct 4021 gcttgtggtg gcaacgattt gctcattctt caactgatgg aacaaattcc agcagataaa 4081 atagatttga ttgaggcttt gaatgttttg gttgaaaatt ttgactttga gcaaattata 4141 gaactgactg aaaacagaac tcagaattca gaacgcataa ctcagcaatc atagcagttc 4201 tcatttgaat cacatacact aagtagccat catgaactgc gtacacaaga acacatgaaa 4261 agatctctta gcctgttccc tgttctctgt tccctgttcc ctgttccctg ttccctattt 4321 tcaagtcaga taggtagaaa taaacagagc tatgttaaga aatgtaaaaa cgcctcaaac 4381 ccttactaat gaatggcagg tgctgtagcc tgtgaaaccc gtccaccgca ctgcctccta 4441 atgaccaatg actaatgact gccaaaacat cgttaacttt atttgtaccg acctacttag 4501 tttctcaacc ccacaatcgc cgtcctgatg atccatttag cctactatga gtatcttgta 4561 acaaagttgg tatatccaac tcttctggac accggggcag acagtcgcca cattctgtac 4621 aacggctagc tttcattcca gcgaaccagt gaccagcatt ttcaaacatt ctgtagcggt 4681 attgcccaaa atcacccatg tcataggcaa ctgcaagatt gcgtaaccgc aacacttctg 4741 gaatgttgat attttctggg cacggtaaac acgcatagca ctggctacat ttgtctgttc 4801 ccaaaacagc cttttgacga ttttctagac gttcaaagac cgttatttcc tctggggtta 4861 acttttgatc acaatctgca acccgcaaag gttctgttaa ttcctctggg tgtgctggtc 4921 cgatgctgag agtggtgatt cgggagtcac taagtaaaaa ccgataattt aactctaaag 4981 gtgaaaaagg atcacacaaa tctttcaagg tttggggtgg tgtatacagg cgtcctcctt 5041 tatcagcagg agaaataata aatacaccca tatctttttc ataagctttc tgaattgcct 5101 ctgcgttgcg ctgaaaaaaa tagtaataat gcagattgac aaattcaaat aaatctgtat 5161 ctatcgccgc caaaataatc tctaaaggcg catgggtaga aaaaccaacg tgtcgcaccc 5221 gaccatcggc aatagcttct tgcaccgcct gcatacagcc attcttggct tttacccagt 5281 caagatgttc ccatgtgttt aaaccgtgaa ttcccaggca atctagataa tctaacttta 5341 atcgttccag agattcatcg atataccgac gcatagtgtc agcatccgct gttggtggaa 5401 ttttagtggt gatgtgaact tggcaacggt taactggtaa ccctacagaa attgcctcac 5461 caagatactc ctcacttttt ccgtatcctc tagcagtttc taaatgatta atccccaaca 5521 ctatggcttt gtgaatggtc tgccatgcat tttcctgtga agctaagtag cgcattgtcc 5581 ccaaggaaaa gacagacaag cgtagattcg ttttcccaaa acgtcggtat ttcattttta 5641 acagagttcg gagttaagag tttgctgcaa gtcactcata actcattgtt tatatctcat 5701 ctatgtttcg ccactaccaa aacgcttgat caaatcttca gggcggagat cggagataaa 5761 ttcccggaag gcttgctgtt cggcttcatc tgcatcccga tctacaggaa tagaagcatc 5821 cgcaatcact tcttccatga cccaaatagg agtatttgta cggagggcga tcgcgatcgc 5881 atcactagga cgcgcgtcta tttctttttt gacatcgcct tgcttcacaa ttaatgctgc 5941 ataaaatgta tctttctgta atgaatgaat gatgatccgt tccaaaacga tgttccatgc 6001 atccagcata ttcacaatca ggtcatgagt taagggtctt ggagcctttt gattctccag 6061 tgcgcccata attgccctag cttgttcttg accaatgtaa attggcaatg cacggcggtc 6121 tgaagcatct ttcaagagta caatcgggct gcgggttatg gcatctaatg ctatgccagc 6181 gactttcatt tcaagcattg gctaagcctc taaaatcctt tgagcgctat ggtaatttgt 6241 aacaaatcgt acttgtgttg taatcatttt ggggttagga taaacataac tcttcattcg 6301 ctataattgc ctctattaaa ctcctaaatg gatgtgaaca ctagagtaag tcaaacaagc 6361 ctaattcgac aataatctac ttttctaaat gtgcgcagtc ataaatgtaa atattttgaa 6421 cttttgacat aatttttagt ctgaaattat acttaaaatt ttgaaatgtt tgtgagaatt 6481 actagctgct tttgggcaaa aataagcaat aaatagagtt agttttgcaa aaaagccgtg 6541 tttacaggaa taatccaagc attaggaacc ataaaacccc tggagttcag ttcttggcaa 6601 atcacttgtg tgactcagcc atctaatgta attatgcaag atttagcgac aggtgacagc 6661 atcgctgtag atggcatctg cctaacggta gaaaaaattt taaaaaatgg atttatcgct 6721 acggcttcac cagaaaccct gcgccgcaca acactaggac aagaagaaac acaacagagg 6781 tacgttaact tagaagcatc gctgagagtg gggggtaaag tgggcggtca tttcgttatg 6841 ggacacgtgg atggtattgg tcaactggtg tcagcagaag cgacacaaac ctcttgggaa 6901 ttgattttta ccgctccagg tgcgatcgcc cgctatatgg tctccaaagg aagtattgct 6961 ataaatggca tcagcctaac agtagccgat tatcagtctg aactatcgca gtttaaagta 7021 gcagtgattc ccgtcactta caaagaaacc aatctccaat acatcaatcc gggtagttgg 7081 gtgaatttag agggagatat tcttggcaaa tacgtcgaaa aattcctttg ttttggcaat 7141 ccggatctgg aagaagctac acagacaatt ccaaatgaca taacacccgg attcttagtt 7201 gaacacgggt acttgtaagc aaaaagggac aaccagaaag aggaaggagg aacagagcga 7261 gggaaaagtt aatacttaat aacttttttc cttcacactt tcactccttc actcccacac 7321 ttttctagct acctggtcgc tgaggagttt gagctgtatg taacgattgc attaattgac 7381 caagggcgga ctctaattga gcgcctggat cagtcgcaac ccatccttcg ggtgtcagct 7441 ggcgcacctc aaagtgcaag tggggtcctg tggacaaacc agtgctacca actcgtccaa 7501 taacggttcc ctgttgcacc aattgaccgg gctgaacaaa gatttctgac atatgaccgt 7561 aaagagtttg ttgggcgttg ctgtggttaa ctatcactgt cataccgtag ccgcccaacc 7621 aatcagcact ctctacttta ccagtgtagg ctgccaaaac aggtgttccc atagctgcgc 7681 ctatgtctgt accagagtgg aaactgcgat cgcctgtaat gggatgagtt cgccaaccaa 7741 acagagaact aattgcagag gggatagaaa ggggatacat caatcccgga ccacttgcga 7801 gtctaccacc atagactcca ccgtagttta cttgcggcaa tactgctgct agcgggatat 7861 cataagtgat attactgata cgaggtgcaa cattttccgc agtcattggt gctggtaaga 7921 tcccacctac tggagcaatc ggtactgaac tcactgttgt aggggaggaa aagtcgggga 7981 taaaccggtt gggatgatat gtatctttag ttatacttct ataagccgtt cgataactgc 8041 gatgccgact aggagtcatg gaacgagaaa cagtcctata agtgggacga tccaaagagt 8101 ttcgttcacc tctttcaaca tgcctgactg gcggaacagt tgctaatgtg gcaggtttgc 8161 ttcttctaat ccaacttgct ttctttcggg gtaattcagt attggactgt tgtgtaataa 8221 ctacagaatt cggtgcttga tacctatcgg tagcaccagt gttgtattcg cctgggtcga 8281 tataagcgtt gttgtaatct ttcggagtgg agtcgtcagt agaggaggca gtcccttggc 8341 gtgagttcag cgccgcgcgg aggttatcga tcttggtgga attggcgtag ggttccctcc 8401 agtgcgcaag gtgccgtgga acaggccagt tgttctcttg ggaaacctct tgttgaggtt 8461 gttccttgct gacacttatg ttgaattgtg gaacctctac ttgaggtgct ctctgtaacc 8521 ttgttctcca gtttctagta ctttcgctac gtacagaaac ttgcacacgg ggtttgaatt 8581 ttctgatcgc cacctctggc tgtgaacttt gtgcatgacg tttgtaattt ctgagagcga 8641 ctgcaggact tggtcttttt gcctgagaga gtctttgtct gagtctaacc cggcgttgag 8701 aaaattcgtc ggactgcgat ttcgtctctt ctactgcggg gctgctattg ttgtgtctaa 8761 aagtatctct tttgactcta ttcaccgatg gtgctggttg agaactttca gcagtgggaa 8821 caatattatc gattgctgat tcagtttgag caaacaccaa gccaccatta cttagcatag 8881 taatgctgcc aagccaagcg agactttgtg ctgggagcgt agacgcaaag cgtcttgttg 8941 agaggcattg ctgccacaat tgatgcaaac ggttctcggc agagttattg cgctgcgcca 9001 ttgttttttt gatgttttgt gtattgtaga gcatgaacta cctatactcg cgtaaggtgt 9061 gagtataggc ttcgagtctc attcgctgtt gctgtgttgg gtatactcat gactgagtat 9121 tcggaaccaa atctgatcta attatggtta gtagactgaa tgaaacaaga atcccatcgc 9181 ccttttaggc gtgggagtgt caaagcttcg gctccgctca gctttgaccc tgagcggagc 9241 cgaagggtca attggttaat gatatgaatg acagtaaccc aaactgattg taccgggctg 9301 aaggtaaatt tctgatcttt ttactgattt cagttcggat gtttccatac gaacccgaag 9361 ctccccagta tttgtcacgc caatgatagt acctacagtg ttgttaacat agactttctc 9421 acctatgttc attagcaact ctagataacg atatattagt atatcgactc cttcttggaa 9481 caagcactgt ataccggatt ctattccaat taaaacttta gaaatgagca tttccaaaca 9541 cgagacgaat ttagtctgct gatttgcttg ccacatttcc agattgattc cggtttctgg 9601 tacagggttt gcccaattca accctacacc aataactgct tgggtgatga ctcctttgtg 9661 tattttggtt tctgtcaaaa taccgccaag ctttcggctg actaatatta aatcattagg 9721 ccatttaata tcaacaggaa cgccgcagtt tcgcaactct tgagcaattc cccaagcagt 9781 agccagggtg agttggtagc tgttggtagc atatagtttg gggaggcaaa aagcagtcct 9841 attttctgcc gctatattgc cgtttgcaat agccactgaa agatataatc ccccagctga 9901 tgacatccat tgacgacccc attgtcctct tcctgctgtc tggtcagtcg caatgacgac 9961 acatcctgat tcagcccctt gggctagcaa gtcccagagg atttggttcg ttgaagaaac 10021 agtttcaaaa agatgtagcg aaaattgtaa ataaggactt ttgcgttcta cttgtagagc 10081 agtttccaac ttttgttgat ccaatgccat agcagtttac taaatttcca agtgctagga 10141 taaattggcg gctattgatg tttgttttta ggtttggtta aagatgcacg cttggcactg 10201 gcgcacatgg gaagggttac cctatctaac ctgtagtcta ttggaaccct ggcatcatgg 10261 cttttttact cagcagtttt cacctagtta tccgtcagaa cttacaaagg tgctgcatcc 10321 agaagcatca gcttatcgct taaaacaggt acatggcaat accgtcctga ctccaaaaga 10381 aattgtggct ctcttagcag aaggtggtga ccaggtcgat ggagaaggtg atgatgccct 10441 agtctcagga gatgggttaa tatccaatca gcctctgcaa gcagtatggg tcgcgacagc 10501 tgactgcacg cctgtactga ttgctgatga gaaaacagga caagtagcag cagtgcatgc 10561 gggttggcgt ggcactgcgc tcaagatagt accgcaagcg atcgcccgaa tgcaagcaca 10621 aggtagcaaa cttgaagatt tgcgaattgc catgggacca gcgatcgcag gtgaagttta 10681 tcaagtctca gagcaagttg ctgctgaagt tggggcaagc attataccac aaaacgacga 10741 aaaagcaatt gtcgctgcat tgtacaagct accaaattct cccttactac cagatccaaa 10801 tccagaacgg gtacggttgg atgtgcggcg ggtgaatact ttacagctgg aacagttggg 10861 gattagttcc caacagatgg cgatcgcccc ttattgtact tatcaaaccc cagagtattt 10921 cttttcttac cgccgggaaa agcagaaaaa agttcagtgg tcgggtattg ttagcaatac 10981 tctctaagta tctggaactc acgaattgtt atcagtgaga tgcccatctg gtatgtaaca 11041 gcgcatcaaa aatctagagg actgcgtacc cctcgtccac cacagcccac aatttaaaaa 11101 aagctctctc tcaactcaaa gtgacgttag attgaaaatg gctagaagga aaaaaactca 11161 aatcttggac aaagagtttg aaacgcaaaa agaactaaaa gaatatgttc aaaaaatttt 11221 atacgcatac gagctaaagg aactcttgtc tcctgaacac ttccaattta tcaggtcgct 11281 tctcaataat catcccgatt cctacgagaa aattggtgat ggcatagaaa gcatctggat 11341 acaagaaaat gtagtcaatc gtggaaagtc gagaggtttc tggtttcaac gtatagatcg 11401 ttcaatcgat aacttcagtt acaaagtatg tattgaaagt cctccatcag tgacaagtca 11461 ttttatcatg gcatgtagag aggctgtaga tagttatgtc aatgcttatc gagaaaaaac 11521 ttttcagggg gtagaaatat tgcaatgtcc cactactggt gactcaatca ccttaaaaga 11581 atcatatgtc gcgcattcac ctttacactt taaagaactg gcagaagcat tcagaaaaaa 11641 cgaggattta gtcttatcgg aacaattgtt ccaagttcat cgtgatggtg atttcgctat 11701 gagttttgct gatgaagctc ttcgacaaaa gtggatcaaa taccactctg aaaacgctac 11761 ccttgagatt agaagcaaga gcgtacttgg agtcaaagtg tgaaacagat gtggcataat 11821 tgcgctgagt tcacaaccat gttatgtttg gggcgatcaa ttttgaatca aagctacgta 11881 cctcagttca cagagcgata ttgcttaggg ttataatttc aggcaagcga atcccaattg 11941 gaagaatgac tacagtagct attttgccaa tctctaatcc aaacggtgaa aagtcttatc 12001 gcgctattgc aggtgataag caatcggttg gcaaaactgc ggggcaagcc ctagatgcgt 12061 tgacagctca gttaggtgaa actgagttca gtgcactgct tgttattcaa agctttcgtc 12121 ccgatccgtt ctttagtaca gagcaacaaa aacggttatc ggagttgatg agtgtatggc 12181 ggacagcacg agatcaagga caagcgttgc cgccagagca acaagcagag ctagatactc 12241 tcgttgatac tgaactgaga gctgcaacag cccggacagc cgcattaatg cagcaattga 12301 gtcaatgaac cgttacataa gacctaccca aagttatcca atgctagaca ttgttcgcca 12361 ttggtctagc tgaactttgg tgcttacttc ctgattcaac ctggtgatgt cggcatcctc 12421 cagtccacag gctcactccg tctaactgac ggtgatggac gaaagaccta ttttgaccca 12481 aatatgacta tattgtcaat gcacacagct gtacaaaaac cgaaccaaaa tcaaaagtta 12541 cccaagattg agtaagtagg taggcgggaa aaaaccaaac tatgttaaga taagtaaata 12601 cggaga // LOCUS NODE_2707_length_12542_cov_5.17073812542 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12542) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12542) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12542 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(51..245) /locus_tag="DP116_21740" CDS complement(51..245) /locus_tag="DP116_21740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017711731.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21740" /translation="MAVQVAGLRQHVDPHWNRGLSYLKIGLRWLKGVIYKGRQLLSPI PLLPKDQEPCFASNKARQDI" gene 660..1265 /locus_tag="DP116_21745" CDS 660..1265 /locus_tag="DP116_21745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407332.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21745" /translation="MTSRKRSNSNKHQQPKKTTSSVAKKTATPEEIFYSVAHAIPGRI RFRIPRLVKDSEYANKLKQVIESDSRITSVRVNPTAASIVVSYQLGVISDKQMRSHLV NLIQTAPNIVLPPRVTTKSIMRAIFDALINLIDSTRNINQARNAIVYRRFRTDIWERL LSGAKTIIKRLKSATMFVLPNKRWRLPSPDGDATGTALRSA" gene 1433..2545 /locus_tag="DP116_21750" CDS 1433..2545 /locus_tag="DP116_21750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316499.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PA-phosphatase" /protein_id="PRJNA477356:DP116_21750" /translation="MGTHFHNLLNQNPLVRAIHTAVKGRARYKVNGLSRSESLKRYLE LRLSKEEGIGQVRANHDTGNVLVLFHSDFSPNAIASLLERIVLDYRKQGTKLPVRTAY ISTAPEEAKNLSINTRKLNQLTASVQKQESTQTHVKKQLEQAGSQLILVSGTAVSTLV LCTGLLHKYGLDERILLAIQKLHTPLLDRIMLGITSFGDPVFLVLICLALQTGLLYHN RRTQAITLSIAAIGAVSLNCFLKLLFGRARPDLWNWIIDVGQHSFPSGHAMVSIVIYG FLGYILAKEFPQWRGRIFALTVVLIVAIGFSRLYLGVHWPTDVVAGYAVGLVWLIACI KRMELREKYYSSAKYLYTILGLTHSHKPDKVQTIYA" gene complement(2532..2993) /locus_tag="DP116_21755" CDS complement(2532..2993) /locus_tag="DP116_21755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318131.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex ATPase subunit type 1 TsaE" /protein_id="PRJNA477356:DP116_21755" /translation="MRIFLADTTATRKLGITLGQSLNAGNVILLEGDLGAGKTTLVQG MAEGLGITDPIVSPTFTLINEYTQGRLPLYHLDLYRLESNEVAALNLETYWEGVEVTL GIVAIEWAERLPYKPDSYLSVRLTYGDENTRQVEITSHNCAISEEITTMRI" gene complement(3227..4345) /locus_tag="DP116_21760" CDS complement(3227..4345) /locus_tag="DP116_21760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016948836.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_21760" /translation="MLLSIKTKLKLSKTQEIIMSKHAGIARFTYNWGLATWNSLFKDG LKPNKYILKKFFNNHVKPEFEWIKEKGICQKITQYAFDNLGDAFSRFFTGIGGYPNFK KKGRHDSFTIDAGGQPIPVGGKSIKLPTIGWVKTYEGLPHTTCKSITISRTADSWFIA FAYEQEHEPTTSQYDVVGLDLGVKELATLSTGVVFPNPKHYKTHLEKLRRLSRKFAIK TKGSSNRNKAKIQLARHHARVANLRKDTLHQITTFLCKNHAKIVVEDLNVSGMLSNHK LAQVIADCGFHELKRQLEYKAKKFGCEIIIADRWFPSSKTCSNCGHIQDMPLKERTYN CKSCGHSMDRDLNAAINLSRLAKACCKSVRSANKLTEG" gene 4403..4582 /locus_tag="DP116_21765" CDS 4403..4582 /locus_tag="DP116_21765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015079730.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21765" /translation="MTFRKNNKLGFTSDRPFDKDPVCFKVLPGVKEKLKAVPDWQERL RGFVDELIKGVENCQ" gene complement(4633..6000) /locus_tag="DP116_21770" CDS complement(4633..6000) /locus_tag="DP116_21770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996455.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21770" /translation="MSIGFLRLALKALQKQSRSRTSHRVNQWFKWLSPGLSVKRWLLI SVGGMILASLGFAIWIRLTPIFWAIKFLRRILGVITDIVPYYISGPLVILGGLLLLLW GQTRTVSSITQVLRSEGDEELIDVLLAHRRLYRGPKIVVIGGGTGLSTLLRGLKAYSA NITAIVTVADDGGSSGRLRQEFGVLPPGDIRNCLAALADEEKLLTELFQYRFKAGDGL TGHSFGNLFLTAMSDITGDLERGVAASSKVLAIRGQVLPATLSDVRLWAELADGRRIE GESKITEAGGTIVKIGCIPANPPGLPAAIKAIKEANYIIMGPGSLYTSVIPNLLVPEI ADAIAASDAPRIYICNIMTQPGETQGYTVADHIRAIDAACGRPLFNAVLVHKKSPSER ALIRYAQQQSHPVFLDREAIAQLGRRIVIANVMHEDETGCVRHNSQRLARVLWRWYSG GHYKK" gene 6172..6663 /locus_tag="DP116_21775" CDS 6172..6663 /locus_tag="DP116_21775" /EC_number="3.1.22.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="crossover junction endodeoxyribonuclease RuvC" /protein_id="PRJNA477356:DP116_21775" /translation="MEQRILGIDPGLAILGFGAVICQKNQDRVQHDSVNLIDFGVINT PAHTEIGQRLCTLYEDLHTLIKEFQPELVAIEKFFFYRMANTIPIAQARGVIMLVLAQ HELPTVEFTPAQIKQALTGYGNAEKQEVQQAVARELNLDYIPHPDDAADALAVALTAW FQI" gene complement(6920..7636) /locus_tag="DP116_21780" CDS complement(6920..7636) /locus_tag="DP116_21780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_21780" /translation="MAPAKILVVDDDPAVRNLIQRFLIKQSYQVESAEDGKTALALFE QFNPDLVILDVNLPDVIGFNLCQEMQSRNGVFVLMLTSRADEADKIRGFSKGADDYLT KPFGLGELEVRVAAILRRQRVVTTAEQKRLVFEKLMIDPVRREVTLNSQPVPLTALEF DLLHFLASHPGRVWRRAELIQEVWDYEYVGDQRVVDVHIGQIRKKIEIDASQPALIQT VRGVGYKFESSSQGQQFEKS" gene 8317..8946 /locus_tag="DP116_21785" CDS 8317..8946 /locus_tag="DP116_21785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651352.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaJ" /protein_id="PRJNA477356:DP116_21785" /translation="MSDEQNPYEKLGLSENASFDEIQDVRNRLLEQHSGDGKRLEAIE AAYDAILMERLKMRQEGKIKVPERIRFPERLMPSLAKESQTPRQQSPAWLQRILDKPT LTETLLPGAWYVGLSAISVFYQPGSDQVLQLTLVVGVCVSIYFLNRKEKKFGRAALLT LIGLTTGLIAGGLVAKWLIPQVQMINVTRNQFSSVVTFVLLWLISSFLK" gene complement(9001..9699) /locus_tag="DP116_21790" CDS complement(9001..9699) /locus_tag="DP116_21790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743253.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HAD family hydrolase" /protein_id="PRJNA477356:DP116_21790" /translation="MLRLITDFDGPIIDVSERYYRVYQFCLEKIQCPNQQVQQLAKAE FWQLKRSRVPEKKIGIISGLDEAQAQEFSQLRRQTVHTESYFEYDTLAPGAVEALLKV QQAGVDLVVMTMRRIRELDYAFKKHDLGTFFPENRCYCLSNDYVKTRDTDDKPLLMAR AIQELPPATDTWMVGDTEADIISATKHNIKVIAVECGIRDRAQLEQYQPNLIARDLSS AVDLVLERTLQQKS" gene complement(10010..11182) /gene="hppD" /locus_tag="DP116_21795" CDS complement(10010..11182) /gene="hppD" /locus_tag="DP116_21795" /EC_number="1.13.11.27" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198936.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxyphenylpyruvate dioxygenase" /protein_id="PRJNA477356:DP116_21795" /translation="MKIDHVHFYVEDAKIWRDWFVRHLGFQVVADSAVLPTFLEKADN IPWKAREDDTFHTCTQVVKSGPICFLLSSPLLPTSPVAEFLRYHPPGVADVAFCVENL EEIISLAQVHGAKILQPIQQNQYGQECIKWSKIAAWGSLTHTLIETTGGGNEETRESQ RITESGSVEKYPSLFPSSSSWFAGIDHIVLNVNAGDLEHAVAWYENILNFQGQQTFNI KTSRSALHSQVMVSRSGDVQLPINEPGSANSQIQEFLNVNQGSGIQHIALRTRNIVHA IAQFRASGLSLLPVPQDYYSQVRQRKGLPLSIDELDTIAGQEILVDWKQETFFGSKNN RTPLLLQIFTQPIFGQPTFFFEFIERRYQAQGFGEGNFRTLFAAIENEQIKRGSLG" gene complement(11691..12476) /locus_tag="DP116_21800" CDS complement(11691..12476) /locus_tag="DP116_21800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749051.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_21800" /translation="MIPVLQVGDRAINNNELIPLLKQYGILPQLMREIIIDSAIANDT LTPEENTQAYKQFYQQHQLNSEDDLQAWLQVRGLNREQLDYLATRNIKLENFKRSTWG DKLQSYFLQRKAKLDRVVYSLIRVKDICIAQELYFRIQEGEQSFSELAREYSQGPEAQ TGGLIGPVELGVPHPVLANMLASVQPGQLLPPTSLGDWIVVVRLEKLLPSQLDEEMRQ RLLNELFEHWLQTQLKENLQNIQEKSPQHQVEQNAVKSVNSEQ" BASE COUNT 3582 a 2726 c 2614 g 3620 t ORIGIN 1 tgaggctagg ggactctctc tattctatgt cctaaccttc ttggcaactg ctatatatct 61 tggcgagctt tattggaagc aaaacaaggt tcttggtctt taggcaacag agggattggt 121 gatagtaact gtcgtccctt gtaaataaca cctttaagcc agcgtaaacc aattttgaga 181 tagctcaaac cacgattcca atgaggatca acatgttgac gtagaccagc gacttgtaca 241 gccatgccct ccgttgtgct gtaaagaaaa gctaaggctg caattacaac tgggagattc 301 atgccatcaa aacgttcttg gtggagtatg actccagtta gatatagcaa tgctatttga 361 gttgtaagat tgggtgttat ggtcgtcctt tatcctttgt ttctagtcct gtgtagcgac 421 aaattacaaa tgataaagga caaaaaggac gaagccgacg caattagtag gtaggtcaaa 481 gtcaacccaa ctatgtaaag gtttattaac tgggaaaaaa gcttaattta agttaacctc 541 aaactattta taaaactaaa catagttaac ttttctaacg cctaccaagt ttctgacaaa 601 ttctaaaaaa ttagaaattt tagatatctt gtttcctttt ttaaactgaa acctttagta 661 tgacttctag aaaacgctct aattctaaca agcatcaaca accgaaaaag acaacctcta 721 gtgtagctaa gaaaacagcg acacctgaag agatatttta tagcgttgcc catgcaattc 781 ctggacgaat tcgttttcgt attcctcggc tagttaaaga ttctgagtat gctaataaac 841 tcaagcaggt gatagaatct gactctagga ttacgagtgt ccgtgttaac ccgacagctg 901 catctattgt cgtcagttat caactaggcg ttatttcaga taagcaaatg cgatcgcatc 961 tggtcaatct tattcaaacc gccccaaata tagttttgcc accacgagta acaacaaaat 1021 ctattatgag agctatcttt gatgctctca tcaacctaat tgacagtaca cgtaatatca 1081 accaagcgcg taacgctatt gtgtatcggc ggtttaggac agacatttgg gaacggctac 1141 tctctggcgc gaaaaccatc atcaagagac taaaatctgc taccatgttt gtcttgccaa 1201 acaagcgatg gcgattgcct tctccggacg gagacgctac gggaacagca ctgcgttccg 1261 cgtaatcgca aagcataacg aatcaagggt aagccgtgcg tagcgcggct tctcatagag 1321 tcgctggcgt tgtgtcgttt ctgccaacac agttatcatt gacatcgttt ccggttaatg 1381 ttccacctgt caaatttaga tacagtttga ctttaatcag ctaagtagaa aaatgggcac 1441 tcatttccat aacctcctca atcaaaatcc tttagttcgg gcaatacaca ctgctgtcaa 1501 aggaagagcg agatacaagg taaatgggct ttctcgttcg gaaagtctca aaagatacct 1561 tgaattgaga ttatcaaaag aggaaggcat tggacaagtt cgtgccaatc atgatacagg 1621 aaatgttctt gttcttttcc attcagattt cagtccaaat gctatagcct cgcttcttga 1681 aagaattgtc ttggattaca gaaaacaagg cacaaaatta cctgtaagga cagcttacat 1741 cagcacagcg ccagaagaag caaaaaattt atctataaac acaagaaaat taaatcagct 1801 gacagccagc gttcagaaac aagaaagcac ccaaacacac gtgaaaaaac aattagagca 1861 agcggggagt caactgattc tagtatcagg aactgctgtt tctactcttg ttctatgcac 1921 agggctgctg cacaaatatg gtcttgatga acgtatttta ctagcaattc agaagctgca 1981 tacgccactt cttgatcgca ttatgcttgg tattacttct tttggcgacc cagtcttttt 2041 ggtgttgatt tgtttagcat tgcaaacggg actgctttat cataaccgtc gcactcaagc 2101 aattaccttg agcatagctg caattggtgc tgtaagctta aattgtttcc taaaactgct 2161 gtttggtaga gcacgtccag acctatggaa ctggattatt gatgtgggtc aacacagctt 2221 tcctagtggt catgcaatgg tgtcaatagt gatttatggc tttctaggct atattttggc 2281 aaaggaattt cctcaatggc gaggacgaat ttttgccttg actgttgtct taattgttgc 2341 aataggtttt agtcggcttt atctcggtgt acactggcct actgatgtgg tagctggcta 2401 tgctgtaggt ttagtatggt taattgcctg tattaagagg atggaactga gggaaaaata 2461 ttactcatca gctaaatatt tatacacaat tttaggactg acacattcac acaaacctga 2521 caaagtacag actatatacg catagtagtg atttcttcgc tgatagcgca attatgcgag 2581 gtaatttcca cttgacgagt gttttcatcc ccataagtca agcgtacact aagataacta 2641 tctggtttat aaggcaaacg ttccgcccac tcaatcgcga caattcctaa tgtcacctca 2701 acgccttccc agtaagtttc taagtttaag gcggcgactt catttgattc cagacgatat 2761 aaatccagat ggtaaagggg aaggcgtcct tgtgtgtact cattaatcag agtgaaagtt 2821 gggctaacaa ttggatcagt gatgcccaaa ccttccgcca tcccttgtac taaagtcgtt 2881 ttgccagcac ctaaatcacc ttctagtaaa atgacattac cagcatttag tgattgacca 2941 agagttattc caagctttcg cgtcgcggtt gtatctgcca gaaaaattct cattatcact 3001 cacccccctt cacttcattc aaaattatga atatgcccta cacgcaattc gcagcttctc 3061 cctgggaggt acgaaattca aaattatttt ttgttcgcgc agcctgtccg taggacatac 3121 tttgaatgag tgaattttga attggtttgt cttgttattt tctgctatat aaatcctaga 3181 caagtctaga catttacttc ttacttcaca gggagcatgg gagcggttat ccctcagtaa 3241 gcttattcgc agagcgaacg ctcttgcaac acgctttagc caaacgcgat agattgattg 3301 ctgcgttcaa atccctgtcc atcgaatgtc cacaactttt gcagttgtaa gttctttctt 3361 ttagcggcat atcttgaata tgtccacaat tggagcaggt tttacttgat ggaaaccaac 3421 gatcagcaat gattatttca caaccaaact ttttagcctt atattccaac tgccgtttca 3481 actcatgaaa tccgcaatca gcaatcactt gtgccaattt gtggttagat agcatcccag 3541 aaacgttcaa atcttctact actatttttg cgtggttctt gcataagaaa gtagtgattt 3601 ggtgaagagt atctttcctg agattggcta ccctagcatg atgccttgca agctggatct 3661 tggctttgtt tctattacta gaacctttcg tctttattgc aaactttcgt gatagtctgc 3721 gaagcttttc tagatgggtt ttgtagtgtt ttgggttagg aaacactaca ccagtagaca 3781 gtgtagccaa ttctttgaca cctaaatcaa gtcccacaac atcgtactgc gaagtcgtcg 3841 gttcatgttc ctgttcataa gcgaaagcaa taaaccaact gtcagcagtt cgcgatattg 3901 tgattgattt acatgtggta tgtggcaatc cttcataggt tttaacccaa ccgatagtgg 3961 gaagttttat tgacttacca cccactggta ttggttgacc gccagcatca atagtaaaag 4021 aatcgtgacg acccttcttt ttgaaattag gatagccacc aatgcccgta aaaaatctgg 4081 agaaagcgtc acctaaatta tcgaaagcgt attgggtgat tttttgacaa atgccttttt 4141 ctttaatcca ttcaaactca ggttttacgt ggttgttaaa gaactttttc aaaatatact 4201 tgttaggctt taatccatct ttaaacaagc tattccaagt agcaagcccc cagttgtaag 4261 taaatctagc tattccagca tgtttactca ttattatttc ttgagtttta cttagtttta 4321 attttgtctt gatggataaa agcattgttc gacctctaat gcagtgttag tgttagtata 4381 tcaacaacta cagcatattg ccatgacttt ccggaaaaac aataagttag gctttaccag 4441 tgatcgcccg ttcgataaag accctgtatg ttttaaggtg ttgccaggag ttaaggaaaa 4501 gctaaaagct gttcctgact ggcaagaacg gcttagaggg tttgtcgatg aactaattaa 4561 aggtgtagaa aattgtcaat agaggtatag gcttgtctac gttattggcg gcttaggaca 4621 aactcaacct cttcactttt tataatgccc tccactatac cagcgccaca acactcgtgc 4681 tagtcgttgg gaattgtgac gcacacaacc tgtttcatct tcatgcatca cgttagctat 4741 cacaattcgt cgtcccaact gggctatggc ttctcgatct aggaaaacag gatgggactg 4801 ttgttgggca tagcggatta gtgcgcgttc actaggagat tttttgtgta ccagtacagc 4861 gttgaatagt ggtctaccgc aagcagcatc aatcgccctg atatggtcag caacagtgta 4921 tccctgagtt tccccaggct gtgtcatgat gttacatatg tagatacggg gtgcgtccga 4981 ggcggcgatc gcatcagcaa tttctggtac caacaaattg ggaataacgc tggtgtaaag 5041 actaccagga cccataataa tgtaattagc ttctttgatt gctttgatag ccgctggcaa 5101 accaggggga tttgctggaa tgcagccaat cttgacaata gtaccacctg cttcggttat 5161 tttagattcc ccttctatac ggcgaccatc ggctaactca gcccagaggc gcacatcgct 5221 gagggttgct ggcaaaactt gtccccgaat ggctaaaact ttagaactgg cggcgacacc 5281 tctttctaaa tcgccagtaa tgtcactcat cgctgtcaag aacaaattgc caaaactgtg 5341 accagtcaaa ccatccccag ctttaaagcg gtactgaaac agttctgtca ataatttttc 5401 ttcgtcagct agtgctgcta aacaattacg aatatctcct ggtggtaaca cgccaaattc 5461 ctggcgtaac cgccctgaag aaccgccgtc atcggcgaca gtaacaatgg cggtaatatt 5521 ggcgctgtaa gctttcaaac ctctaagtaa tgtagaaagt ccggtaccac caccaatgac 5581 tacaattttc ggtccccggt acaatcggcg atgtgccagc aagacatcaa taagttcctc 5641 atcgccttct gatctgagta cctgagtaat tgaacttact gtccgagttt gtccccacag 5701 aagcagcaac aagccgccca gtatcaccaa aggaccgctg atgtagtacg gtacaatatc 5761 ggtgatcact cccagaatgc gcctcaaaaa cttaattgcc caaaaaattg gcgttagcct 5821 aatccagata gcaaatccca gacttgctaa aatcatacct ccaacactaa tgagcaacca 5881 tcgtttcact gatagcccag gggacaacca cttgaaccat tggttcactc gatgggaagt 5941 gcgactgcgt gactgctttt gcagggcttt gagggcaagt cttagaaaac caattgacat 6001 acctgatcca cagcagtact actaacaatg attaacagaa aatgtacacc acctggttga 6061 aatccaagtg tggaattgtt acacttttca atccgattgg actaaaaatg catagtctaa 6121 attgccagta taccgagtca atacgctttg tgatcaaaaa acaacaaatc aatggaacag 6181 cgaattttag gaatagatcc gggactagca attttaggat ttggggcggt tatatgccaa 6241 aaaaatcaag acagggtgca acatgattca gtcaacctaa tagactttgg tgttatcaat 6301 acgccagctc atacggagat cggacaacgg ctatgtacgt tgtacgagga tttgcacacc 6361 ctgatcaaag aatttcaacc tgaactggta gcaatagaga agttcttctt ctatcgtatg 6421 gcaaatacta tccccattgc tcaagcgcgg ggagtcatca tgttggtgtt ggcacagcat 6481 gaattaccaa cagtggaatt tactcctgcc caaatcaaac aggcgttaac gggatatggt 6541 aacgctgaga aacaggaagt tcagcaagct gtggcgcggg agttgaattt agactacatt 6601 ccccatccag atgatgcagc agatgcattg gctgtcgccc tgacggcgtg gtttcagatt 6661 tagttgtccc ttaataaaaa atgaggcaag tctatttaag aacagcagac tcaaagagaa 6721 aatgtttcat tttgtctttt atctttgctt ctgcaacgcc cagatcttga agtcaagaaa 6781 ctgaggcttt aaaacaatgc ctggttagtg cttgtgcagt agatttgatg aagcagttgt 6841 atgaaggtta attttttaac cagattcccc ggcaaaataa ccgtgattca cattatttcg 6901 ccaatatttt cgacgattat taagattttt caaattgctg accttgagaa gaagactcaa 6961 acttataccc gacgccacgt acagtttgaa ttaacgctgg ttggctggca tcaatctcaa 7021 tcttcttgcg aatttgaccg atatggacat ctacgaccct ttggtcgccg acgtactcgt 7081 agtcccaaac ctcctgaatc agttcagcac gccgccaaac tcgaccagga tggctagcta 7141 aaaaatgcaa caggtcaaat tccaaagcag ttaaaggtac tggttggcta ttaagtgtga 7201 cctcccgccg tacggggtca atcattagct tttcaaagac tagcctcttc tgttctgccg 7261 tagtaacaac acgctgacgc cttaaaattg ctgctactct cacttctagt tcccctagac 7321 caaagggctt ggtgagataa tcatccgcac ctttagaaaa gccgcgaatc ttgtcggctt 7381 cgtctgcacg acttgttagc atcaatacaa aaactccatt acggctttgc atctcttggc 7441 agaggttaaa tccgatgacg tctggtaaat ttacatctag aattaccaag tctggattaa 7501 attgctcaaa cagcgccagg gcagttttac catcttcggc agattctacc tgatagcttt 7561 gcttaatcaa aaagcgttgg attaagttcc gaaccgcagg gtcgtcatca acaacaagaa 7621 tcttggcagg agccatgacc atcactttac acaaaaattt ataggattaa caagcgaatt 7681 ccgtcaaaaa atggttttcg cattggagac aatatgtatc tacaaggcta aggtatgatt 7741 tccctgatta ttcaaatact agtttaatca ggattatcaa caattagggc gtaaatattt 7801 tggcaatgct accctttgct ccagtattcg gcttttttgt tcattttgga tccacaaggc 7861 aagtcctcta accttagcca cgacaactgg ttcaacttgg acacaagttt atcccaggtg 7921 tggaataatt taaaaccctt taattttaaa tcgagatgcg acttcatggc acgaggaagc 7981 aaaagacaat tatttacaac tcacctcgaa gtgccgtaaa ttgttgaatc tgaaacacaa 8041 gagatttttt catcgtaaaa ttgccttcta ggtatttttt gcttaaattc tagagtaaag 8101 ccctagttaa tgaacgacat gaggggatac acggagctat ggctatatgt taaagtgact 8161 atagttaaaa ttagtaaact agtgtcgctg acctgtgcta atatccgggt aaaggatacg 8221 aattgaccaa aacattggtt tagaccaaga taaaaatata gtattttttg tttagcaaga 8281 cattgccgct gactgaggct aaagtaataa actcccatga gcgacgaaca gaatccctac 8341 gaaaaacttg ggctatcaga aaatgctagc tttgatgaaa ttcaagatgt tcgcaatcgt 8401 ctattggagc aacacagtgg cgatggtaaa cgtctggaag ctatagaagc tgcgtatgat 8461 gcgattttga tggagcgctt aaagatgcgc caggaaggca aaattaaagt gcctgaacgc 8521 atccgatttc cagagcgtct aatgccatcg cttgcaaaag aaagtcaaac tcctcgccag 8581 cagtcaccag cttggctgca acgaatccta gataagccaa cattaacaga gactttgctg 8641 cctggagctt ggtatgtagg attgagtgcc attagcgttt tttaccaacc cggaagcgac 8701 caagtattgc aattgacatt agtagttggg gtgtgtgtca gtatttactt tctcaatcgt 8761 aaggaaaaaa agtttggtag agcggcttta ttaacgctaa ttggtctgac tacaggctta 8821 attgcaggag ggctagtcgc caaatggctt atccctcaag tacaaatgat aaatgtgact 8881 cgaaaccagt tttctagtgt ggtcacattc gtattactat ggcttatcag tagcttttta 8941 aagtagatgg gtttttggat agtaggtaat aaagaatagc tgccttcatc caattccaac 9001 tcagcttttc tgttgtaggg ttctttctag aactagatct acagcagaac ttaaatccct 9061 agcgattaag tttggctggt attgttctag ttgagcgcga tcgcggatgc cacactccac 9121 agctataacc ttaatgttgt gttttgtggc agaaatgatg tcagcttctg tatcccccac 9181 catccaggta tccgtggcag gggggagttc ttgtattgcc cgcgccatca acaaaggttt 9241 atcatcagtg tcacgggttt tgacatagtc gttactcagg cagtaacaac gattttctgg 9301 gaaaaatgtc cctaaatcgt gtttcttaaa agcataatct aattcccgaa ttcgacgcat 9361 agtcataacg actagatcaa caccagcctg ttgaactttt aacagtgctt ctacagcacc 9421 aggagctagg gtgtcatact caaagtaaga ttcggtatgc acagtttgtc ggcgcaattg 9481 agaaaactct tgtgcctgcg cttcatctag accagaaata ataccaattt ttttttcggg 9541 aacgcgcgat cgcttcaact gccaaaattc tgcttttgca agttgttgta cctgttggtt 9601 tgggcattga atcttctcca agcagaattg gtaaacacga tagtagcgct cagaaacatc 9661 aatgatggga ccatcaaaat ctgtaataag tcttaacatc tttaattaaa atgtaatttt 9721 tacttacaga atatcccaaa tcatggtgtt ttgtgcgttt gaaaaattga tcaatcaatt 9781 ttagatttct aatatgtcat ccaaccgaga aaaataagcc agcctgcact aattgagcgc 9841 cagaggcgcg tgaagcagct tatgcctacg gcacgttgcg cgtatctcct caggagacgc 9901 tatgcctacg gcacggctgc gcctatcgct aacgcgcccc aggcgtgcgc ttcgcgcata 9961 cacccaagcc gcgtgccctg tggacagttg gctcattttt ttgaacgttt cagcccagac 10021 tcccacgttt gatttgctca ttttcaattg ctgcaaataa agtgcgaaag ttaccttcac 10081 caaaaccttg agcttggtac cgacgctcaa taaactcaaa gaaaaaggtc ggctgtccaa 10141 agataggctg ggtgaaaatt tgcaatagta aaggcgttcg gttgttcttc gagccaaaaa 10201 aagtctcttg tttccaatcc accaaaattt cctgtcctgc aatagtatcc agttcgtcta 10261 tagatagggg aagccctttt cgctgccgaa cttgtgagta gtaatcttga ggaactggga 10321 gtaacgatag accactcgcg cggaactggg ctattgcatg aacaatattg cgtgttcgta 10381 aagcaatatg ttgaattcct gatccttggt tcacgttcaa aaattcttga atttgagaat 10441 tggctgaacc tggctcatta attggcaatt gtacatcacc gctacgcgaa accatgactt 10501 ggctatgcaa ggcagagcga gatgttttaa tgttaaacgt ctgctgtccc tgaaaattaa 10561 ggatattttc gtaccaagcc actgcatgct ctaaatcacc tgcgtttaca tttagcacga 10621 tatggtctat gccagcaaac caagaagatg aagaaggaaa gagggaagga tatttttcca 10681 cactcccaga ctctgtgatt ctctgactct ctcgtgtttc ctcatttccc ccccctgttg 10741 tttctatcaa agtatgtgtc agggaacccc aagcagctat tttgctccac ttaatgcatt 10801 cctgaccata ttgattttgc tggatgggtt gcaggatttt ggccccgtgg acttgcgcaa 10861 gggaaataat ctcttctaaa ttttcaacac aaaaagcaac atctgccaca ccaggcgggt 10921 ggtagcgcaa aaactcggct actggactgg tgggtaatag cggggaagac aacaaaaagc 10981 agatgggacc acttttgacc acttgcgtgc aggtgtgaaa ggtgtcatcc tctctagcct 11041 tccaaggaat gttgtctgcc ttttctagaa aagttggaag aacagcagaa tcagcgacta 11101 cttgaaaacc taaatggcgt acaaaccaat cgcgccatat tttggcatct tccacataga 11161 agtgaacgtg atcaattttc atgctctttg aaaatccagc acaactgttt atctgtttgt 11221 aaattttgcc tcttttactg gattttcagg tcattcctaa gcaagaatct ggattgttca 11281 acacaagtaa ggaatgagga ttcactgatt ttatagataa aacacttgtg gaggacattt 11341 gagaaattcg tggcaatact cacccaagag aaaaccgctg tataaaaatg atgtgtttca 11401 aaccgagttg cgtatctcca tttgaaacct tgataactgt ttaccctagt gggaacggcc 11461 acgcctgacc caagagcgcc agtcgccagg gcgtgggaga cccctatgcc tacggcacgg 11521 ctggggcttt cacgggtatg cctggggtac gcctgaccca agagcgccct tacgggttcg 11581 gcacttcctt caagtcgggg aacccgccca acggagtgcc tcaccagttg cctgccttcg 11641 ggttaccctc ccgcacttgg ggttgtcgtt accgtcattc gcgcgcttac tcactgttca 11701 ctgttcactg atttaacagc gttttgttca acttgatgtt gtggagattt ttcctgtata 11761 ttttgtaaat tttcttttaa ctgggtttgt agccagtgtt caaatagttc gttaagcagt 11821 cgctgtcgca tttcttcatc taactgagat ggcagtagct tttccagacg tacaaccaca 11881 atccaatctc ctaaagaagt tggtggtaaa agctgaccag gctgaactga tgccagcata 11941 tttgccaaaa ctggatgggg aactcccaat tccactggac ctattaagcc acctgtttga 12001 gcctcaggac cttgagaata ctcgcgagcc agttcgctaa aagactgctc tccttcctga 12061 attcggaagt aaagctcttg agcaatacaa atatctttga ctcgaatcag agaatagaca 12121 accctatcta actttgcctt gcgctgaaga aagtaggact gtaacttatc tccccaagtg 12181 cttcgcttga agttttccag tttgatgtta cgtgtggcta gatagtcaag ttgttcacgg 12241 tttagaccac gaacttgtaa ccatgcttgc aggtcatctt cagaattcaa ctgatgctgt 12301 tggtagaatt gcttatacgc ctgagtattt tcttctggtg ttagagtatc gttggcgatc 12361 gcgctgtcga taataatttc tcgcatcaac tgcggcagta ttccgtactg cttcagtagg 12421 ggaatgagtt cgttgttgtt aatggcgcga tcgccgactt gtaaaactgg aatcataacc 12481 tcctacatgt tatagttctt acattatggt caatagagtc aattaactct tcagatttta 12541 tc // LOCUS NODE_2725_length_12450_cov_5.77289212450 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12450 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(189..956) /locus_tag="DP116_21805" CDS complement(189..956) /locus_tag="DP116_21805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21805" /translation="MAEAYKAERTISAVFKEQKQVDDVIRRLLDRGVSRDHISVMGRN FQSETRIAGFITKRDVILGGLRSGAVFGSLFGSFLSLLTGVGVLFIPFVGPIVAAGPI GAVLLGAASGAIAGSAGAGLVSVLTTLGMPEDKAAVYQTRLEAGEFLVMAEVPGDRSG EYQLLIQTAGGEEVQTIEKALPRACAGGCNSPEDLSPEIRSHLSGEAQGKFIERYNTV IKETNDEAKAEHAAWEMIHQQYDEDENGVWSKAKTTV" gene complement(1097..1636) /locus_tag="DP116_21810" CDS complement(1097..1636) /locus_tag="DP116_21810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA starvation/stationary phase protection protein" /protein_id="PRJNA477356:DP116_21810" /translation="MRKLNIGLTDEQRQGVINLLNQDLADAYVLLVKTKKFHWDVVGP QFRSLHQLWEEHYQTLTETIDSVAERVRALGGYPVGTLEGFLKIATLKEEAGTVPTAT GMVTRLVEDHEQIIRNLRDHIDQSSENFHDEGTADFLTGLMEGHEQMAWMLRSFIEGQ ELQPDGSQTLGQTKTPVGV" gene complement(1897..2256) /locus_tag="DP116_21815" CDS complement(1897..2256) /locus_tag="DP116_21815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873749.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21815" /translation="MKNIGLGLSMLRPVRFLIVVFTCALLIFSYAFPAAAIGTQRSNP EKGEAQLLETQKKTDEVTAKPPLGLKETTERANEGLNEVQGAADINKMNRPENSQQAT SAEEQVENFLEKITGKK" gene 2256..2558 /locus_tag="DP116_21820" CDS 2256..2558 /locus_tag="DP116_21820" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21820" /translation="MKLLQSNGDPKDTDSFKRWQARILDTAYHKVWIDYSLPDLVSIE ILLSSGCKAILIFKQKPLLQFSPTRVLKFSCTSIFQSLIAASISILSRFNLIFFLL" gene complement(2867..5701) /locus_tag="DP116_21825" CDS complement(2867..5701) /locus_tag="DP116_21825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862167.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein" /protein_id="PRJNA477356:DP116_21825" /translation="MFNKTDAAKNIDSQNETSVSDSEDFTMESNGIKRPNEANTEIRR KYTQNRASLTLKRLSLGTKATLVAIAIGTLPVLAIGALAYLVASNSLSNKISQIQQAE AKGLADKVNRFMIDRYNNIQLISSLPIFTDSKFSNTFSQKEKQELLDKFVDTYKVYDS IAVFNLNGDVIAQSQGDSITNQSNSQDFKAVRQNDRFYISQPEVTKNTRNLGVIIAAP VKRTTGETIAIVRARIAVKALDNVIKNSVESGRKYYLTDGSEKFFLATDKKDLSRSAI AIFPSLAQRQAQKKDDTFITVNQTDNQQQLVSYSSRTRDGLPNLKWRFILATDTATAF EPQRQLLLTVAIGTGLTALIVALIAVWLAKRTTEPIAKATAAVAKLGLGELDTRLEVE SEDELGILSHNINQMASQLQALMKDQELDTERAKLLTEITLRTRRSLKVEDIYKTAVR EVRQAIKTDRVVIYKFNLETLDGNVVAESVTAGLPRMLGVHIDDPCFRERHVDSYKDG RVRAISNIYQDSSLSNAACYIKMLEKFAVKANLVAPIIIAGQLVGLMIAHHCDSPRNW QQTEIDLFRQLATQVGYALEQAQLLEEVEKARRVAERVSEDERQQKETLQMQLLELLS EVEGAASGDLTVRAEVTAGEIGTVADFFNSIVESLRLIVTQVQQTATQVNQAIWTNSG AIGELAEEALIQAEEINRTLDAVDHMSQSIQQVAYSADQAATIANSAAQTAKKSGIAM DMTVQNILYLRETVGETAKKVKRLGESTQQISRVVALINQISMQTNLLAINAGIEAAR AGEEGQGFAVVAEEVGELAARSAAATTEIEQIVENIQRETTSVVQAMEMGTTTVVEGT RIVEDAKQNLAQILEISHQIDMLVQSISLATASQAQTSQTVSQLMKDIAASSQRTSDS SIRVSQSLHETVEISQQLQASVGTFKVNLN" gene complement(5766..6260) /locus_tag="DP116_21830" CDS complement(5766..6260) /locus_tag="DP116_21830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein CheW" /protein_id="PRJNA477356:DP116_21830" /translation="MNSSKLVSLKQNQNNLGDGYLKFRLNRHTSATLPMRHTQEAVVV PPETISSMPNMPACIIGLINWRSRIVWVIDLPRMFNLEYLDNRQRQYNVIIIRVDSVL LGLVVQEIQGTTKFLPDEIRSPMGQVASSLTPYLRGCVVQQKEILLVLDAQAIVQSSI LRSD" gene complement(6266..6643) /locus_tag="DP116_21835" CDS complement(6266..6643) /locus_tag="DP116_21835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457039.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system response regulator" /protein_id="PRJNA477356:DP116_21835" /translation="MSITLLGTILIVEDSLSELELMSHYLVESGYNVIQATGAKEALE KAQLQNPDVIVTDVVMPGMSGFELCRSLKRNPVTQKVPIVICSSKNQEIDRLWAMRQG ADAYVTKPYTREQLLRAIKSVVI" gene complement(6710..7777) /locus_tag="DP116_21840" CDS complement(6710..7777) /locus_tag="DP116_21840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319179.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system response regulator" /protein_id="PRJNA477356:DP116_21840" /translation="MSTTPIGSYKFFQKLHPLSLLAQLTSRRATGCLQVFTESASWSI YLEDGKLVYASSDKMFERLENLLGRLSQQTHTLNSTSLMRVRLIFDQNKDHQSTPQPD YQAICWLVNEEYITPPQAAVLIDELAREVLESFLIIKQGSYEFKSETPLNELPKFCRL DLRLLVENCQKQLRHRQQTQSTVDTQVTSHFVSTTEVSATRHQVTPGEELPKQNNFDI SHLNSNKDSHQSREKSAYTVACIDDSPTVLNSIKHFLDESTFSVVMISDPVKALMQIL RSKPDLILLDVEMPNLDGYELCSLLRRHSAFKNIPIIMVTGRTGLIDRAKAKMVRASG YLTKPFSQSDLLKMVFKHIDN" gene 9493..10572 /locus_tag="DP116_21845" CDS 9493..10572 /locus_tag="DP116_21845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016948836.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_21845" /translation="MLLSIKTKLKLTAEQKTIMSKHAGIARFTYNWGLATWNSFVKDG LKPNKFILKKFFNNHVKPEYGWIKEKGICQKITQYAFDNLGDAFSRFFSKKGGYPRFK KKGHHDSFTIDASGKPIPVGGTSIKLPTIGWVKTYEGLPHTTCKSITISRTADSWFIA FAYEQEHEPTVKQHEIVGVDVGVKELATLSTGVVFPNPKHYKTHLEKLRRLSRKFTNK QKGSNNRHKAKIQLAKHHAKVANLRKNTLHQVTTFLCKNHAKIVVEDLNVSGMLSNHK LAQVIADCGFHEFKRQLEYKAKKFGCEIIIADRWFASSKTCSCCGVKKETLTLGERVF ECEHCGHVMDRDLNAAVNLSRLAKA" gene complement(10740..10824) /locus_tag="DP116_21850" tRNA complement(10740..10824) /locus_tag="DP116_21850" /product="tRNA-Leu" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(10788..10790),aa:Leu,seq:tag) gene complement(10995..12182) /locus_tag="DP116_21855" CDS complement(10995..12182) /locus_tag="DP116_21855" /EC_number="5.1.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319178.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alanine racemase" /protein_id="PRJNA477356:DP116_21855" /translation="MLSHEPKSSVTSDQQWDTYPWLSQRAWVEINLSALSYNVKQLLS ILSPQTQLMAVVKADAYGHGAVTVAQTVLDSGASWLGVATVPEGIQLRKAGIKAPILI LGATHTREQIHAIAQWKLQPTLISPKQALVFSNTLEAINYKTPLPVHVKLDTGMSRLG TNWQEAAEFVQLVQRLPDLTIASIYSHLATADSLDPTTMKQQQQRFEQVIAQLRTLGI EPPCLHLANSAATLSDKALHYDIVRVGLAVYGLYPADHLCLSIDLKPVLQVKARVTQV KTIPPGTGVSYSHQFIASDELRLAVIGIGYADGVPRNLSNKMQVLIRGQRVPQIGTIT MDQLMLDVSALPDVQEGEVVTLLGTEGKEQITAEDWANQLNTISWEILCGFKHRLPRV AVA" BASE COUNT 3460 a 2724 c 2461 g 3805 t ORIGIN 1 catgtggcgt ccctgggcta aagccacagg gttttcatct cacccactat aaattgcctt 61 gaaaagtagg tcagagtcaa gagtgaaggt aacaagtcaa cagtcaacag tcaacagtca 121 acagtcaagc actttttgat taatgactaa tgactaatga ctaatgacta atgactaatg 181 actatttgct aaacagttgt ttttgccttt gaccagacgc cattttcatc ttcatcatat 241 tgttgatgaa tcatctccca agctgcatgt tcagcttttg cttcatcatt tgtctctttt 301 atgacagtgt tataacgttc tatgaattta ccctgagctt caccagaaag atgagagcga 361 atctctggag ataaatcctc tggactgttg cagccaccag cacaagcccg aggtagtgct 421 ttctcaattg tctggacttc ttcgccacca gcagtttgta tcagcaattg gtattctcca 481 gagcgatcgc caggcacttc tgccatcact aagaactcac cagcctctaa gcgagtttgg 541 taaactgctg ctttgtcttc tggcataccc aatgtggtca aaacagagac caaacctgca 601 cctgcacttc cggcgatcgc accacttgca gctccgagca atacagcacc aatcggacct 661 gctgcgacaa tcggaccgac aaagggaata aatagtacgc ctacacctgt gagtaagctg 721 aggaaggaac caaacaagga accaaagact gccccgcttc tcaagcctcc cagaatgaca 781 tctcttttag taataaaacc tgcaattcgg gtttctgact gaaagtttct gcccataacc 841 gaaatatggt ctctggacac accccggtct agtaaacgcc gaataacatc atcaacttgc 901 ttctgttctt taaaaaccgc tgagatagta cgctctgcct tatatgcttc tgccacttga 961 acactccttt ttgcttttcc agttggcaga ctgagggggt agatgcactc ctatttactc 1021 atacctcaat ctgccatttt aaactcacac ctttagctgt taactgttca ctgatttggg 1081 cgaatgccaa cacagattac acgcctacag gagtttttgt ttgtcctagg gtctgtgagc 1141 catctggctg taactcttgt ccttcaataa atgagcgtag catccaagcc atctgttcat 1201 gtccttccat caatccagtt aggaagtcag cagttccctc gtcatggaaa ttctcactgg 1261 actggtctat gtggtcccgc aagttgcgaa taatctgctc atggtcttcc accagtcgag 1321 tgaccatacc agttgctgtg ggaacggtac ctgcttcttc cttgagagta gcaatcttga 1381 gaaatccttc caaagtacca actggataac cacccaaagc acgaacccgc tcagctacag 1441 aatcgatagt ttcagtcagt gtttggtagt gctcttccca aagttggtgc agagagcgga 1501 attgaggtcc gaccacatcc cagtggaact ttttcgtttt tactaacagt acataggcat 1561 ctgccaaatc ttgattcaac agattaatca caccttgacg ctgttcgtcg gtcaaaccaa 1621 tgtttaactt gcgcatgact ttaaatccct tagctcacta ctttattaac cttcacaaat 1681 tcaaccccta ccatcatcaa cctaaggaaa gattgttttt tataactaaa tataaataga 1741 catctcaata tacaaatcga gatgtctacc caatggtgaa atcaaaaatt aactgctcaa 1801 aacccgagaa cattgagcaa caatttattt gttacaccca attttaaaaa ccacaccttg 1861 gtttcatgaa aaggcatgga aagtaagaaa ctttgcttac tttttacctg taattttttc 1921 caaaaagttc tcaacttgtt cttccgccga agttgcttgt tgagaattct caggacgatt 1981 cattttatta atatcagccg ctccctgaac ctcgttcagc ccctcgtttg ctctttctgt 2041 tgtctctttc aatccaagag gtggttttgc agtgacttcg tcagtttttt tctgagtttc 2101 gagtagctgc gcttcacctt tttcaggatt gctcctttgg gtaccgattg cagcagcggg 2161 aaaagcataa gagaaaatca gtaaggcaca agtgaataca acaatgagaa atcgcactgg 2221 acgcaacata gataaaccaa gaccaatatt cttcattgaa actcctccaa agtaatggtg 2281 atccaaaaga cacagatagt tttaaacgtt ggcaggctag gattctagac actgcctatc 2341 acaaagtttg gatagactat agtttacctg atttggtatc catcgagata cttttgagca 2401 gtggttgcaa agcaatctta attttcaaac agaagcctct acttcaattt agcccaacaa 2461 gagtgttaaa atttagctgt acttctatct ttcaaagctt gattgcggct tcaatttcaa 2521 ttttatcaag attcaaccta atcttttttc tactttgaat acacagaaag aggaatcgtc 2581 ctgtgggctg aaatcttttt tctgtactta actatatata atcaatctac agatagataa 2641 aatttgtaca attttggcag gttaagcaat aatggtttgg tgcattaact gtgaattatt 2701 tttaatatac taatcattct taacataata tatagcagtc ctatttgagt tgtaaaactg 2761 ctcgttatga ccgtccttag tactggattg cacaagttcg tagtgaatga cagcttcttt 2821 cattaagttt cacatggcgt ttttggggca cggcatgctg tgcccattaa ttcagattca 2881 ccttgaaagt accaacactg gcttgcaatt gctgggaaat ctccacagtt tcgtgtaaag 2941 attgggaaac ccgaatagaa gaatcgctcg tgcgctgtga gctagcagca atatctttca 3001 tcaattggct gactgtttgc gaagtttgcg cctgtgatgc tgtggcgagc gaaatcgact 3061 gcacgagcat gtctatctga tgtgaaatct ccaaaatctg agccagattt tgcttggcat 3121 cttcgacaat acgtgtacct tcgacaacag tcgtggttcc catctccatc gcttgcacaa 3181 ctgaggttgt ttcgcgttgg atgttctcga ctatttgttc aatttctgtg gtggcggcgg 3241 cgctgcgggc ggctaattcc ccaacttctt ctgccacaac cgcaaaacct tgaccttcct 3301 cacctgcacg tgctgcttcg attccagcgt tgatggcgag caagttggtt tgcatggaaa 3361 tctgattaat taatgccacc acacgcgaga tttgttgcgt tgattctccc aggcgcttga 3421 ctttcttggc ggtttcgcca actgtttcac gcaaatacaa aatgttttgc actgtcatat 3481 ccattgctat gccacttttt ttggcagttt gggcggcgct attggctatt gttgctgctt 3541 gatcggcgct atatgccact tgttgaatcg attggctcat gtgatcaact gcatcgagtg 3601 tgcggttgat ttcttctgct tgaatcagtg cttcttctgc caattcaccg attgctccag 3661 agtttgtcca gatcgcctga ttcacctgag tagcggtttg ttgaacttga gtgacaatta 3721 agcgcaaact ttcaacgata gagttaaaaa agtcagcaac tgtgccaatt tctcccgctg 3781 tcacttcagc acgcaccgtc aagtcaccac tggctgcacc ttctacttca ctaaggagtt 3841 ccaaaagttg catttgcagg gtttcttttt gttggcgttc gtcttctgat actctttctg 3901 ctactcgccg tgctttttcc acctcttcaa gaagttgtgc ttgctctaga gcatagccaa 3961 cttgagtggc gagttgtcta aacaaatcta tctcagtttg ttgccaattg cggggactgt 4021 cgcaatgatg agcaatcatc aagcctacca attgccctgc aataataatt ggtgctacca 4081 aatttgcctt gactgcaaat ttctccaaca ttttgatgta acaagctgca ttgctcagac 4141 tagaatcttg atagatgttg gatattgccc gcactcgacc atctttataa ctgtctacat 4201 gacgttctcg aaaacaggga tcgtcgatgt gcacgcctaa cattcttggt aaaccagccg 4261 tgactgattc agcgacgaca tttccatcca aggtttctaa gttgaattta tagatgacaa 4321 ccctgtctgt cttgattgct tgacgaactt ctctgacagc tgttttgtaa atgtcctcaa 4381 ctttcaggct tctacgagta cgcaaggtaa tctctgtaag taactttgcc ctttcagtgt 4441 cgagttcttg gtctttcatt agcgcttgga gttgcgacgc catttggttg atgttatgac 4501 tcaaaatccc aagctcgtct tctgattcca cctccaaacg ggtatcgagt tctcctaatc 4561 caagttttgc cacggctgcg gttgctttcg caattggttc tgtggttcgt ttcgctaacc 4621 acacggcaat caatgcgaca attaatgctg tcaatcctgt cccgattgca actgttagta 4681 ataattgtct ttgaggctca aatgcagttg ctgtatctgt cgctagaata aatcgccact 4741 tcaaattggg caatccatct cgcgttctgg atgagtagct gacaagttgc tgttggttat 4801 cagtttgatt aacagtgata aaagtatcgt cttttttttg tgcttgccgt tgtgctaaac 4861 taggaaaaat tgctatggca cttctactta gatctttctt atccgttgcc aagaagaatt 4921 tttccgaacc atcagtcaga tagtatttac gtcctgattc tacagaattt ttgatgacat 4981 tatctagagc tttcacggct attctagctc tgactattgc gatcgtttct cctgtggtac 5041 gttttacagg tgctgcgata ataacaccca aatttcttgt attttttgtg acttctggtt 5101 gactgatgta aaaacggtcg ttctgccgta ctgctttaaa gtcctgacta ttgctttgat 5161 tggtaattga atctccctga gattgtgcaa tcacatcacc atttaaatta aatacagcaa 5221 tgctgtcata gactttgtaa gtatctacaa acttatccag caattcttgt ttttcttttt 5281 gagaaaaagt gttgctaaat ttagaatctg taaatattgg tagactagat ataagttgaa 5341 tattgttata ccgatctatc atgaaacggt tgactttgtc tgccagaccc ttggcttctg 5401 cttgttgaat ttgagaaatt ttattgctta aggaattact ggcaactaag taggcaagcg 5461 ctccaattgc taagactgga agcgttccga tagcgatcgc cactaatgtc gctttggtac 5521 ctaaacttaa ccgcttcaaa gttaaagatg cacgattttg tgtatactta cgacgtattt 5581 cagtatttgc ctcgttaggt cgttttatcc cattactctc catagtgaaa tcttccgagt 5641 cagaaacaga tgtttcattt tgactatcaa tattcttagc tgcatcagtt ttattaaaca 5701 taaagatatt ctcctgaggt ggtgaatgaa cataggaaaa atcagtgttt tctttggagt 5761 acaccctaat cactgcggag aatagaagac tgtacaattg cttgtgcatc taaaacaagc 5821 aatatttctt tttgttgaac aacgcaacca cgtagatagg gggttaaact ggatgcaact 5881 tgtcccatgg gagagcgaat ttcatcaggc aaaaatttag ttgtaccttg tatctcttgt 5941 acaactaaac ctaaaagcac tgaatctacc cggataatta tcacattata ctgccgttgc 6001 ctattatcta gatattcaag attgaacatc cttggcaaat cgatcaccca aactatgcga 6061 ctgcgccagt ttatcaatcc tatgatacag gcaggcatat ttggcatcga tgagatggtt 6121 tcaggtggca caacaaccgc ttcttgcgtg tgcctcatgg gtagggtggc agatgtatgt 6181 ctatttagcc gaaacttaag atagccatcc cccaagttat tttgattttg ttttaaagat 6241 actagtttgg aactattcat ttgattcaaa tgactactga tttaatagct cgtaggagct 6301 gctcccgagt gtaaggcttt gtgacataag catcagcgcc ttgtctcatt gcccacaatc 6361 ggtcaatctc ctgattttta gaactgcaaa taacaatcgg cactttttga gtcaccggat 6421 ttctcttaag agaacgacac aactcaaaac cgctcattcc tggcataacc acatcggtga 6481 cgataacgtc tgggttttgt aattgcgctt tttctaaagc ttcctttgca cccgtggctt 6541 gtataacgtt ataaccactt tctacgagat aatggctcat cagttctaat tcactgagag 6601 aatcttcgac aatcaaaatt gtaccaagca aagtaatact cacttctaca tctcctatct 6661 cactacatcg actttcaaaa aacgttgtga ctgttctttt gtgttgacct caattatcaa 6721 tatgtttaaa caccattttc aacaaatctg attgagaaaa aggtttagtc aaatagcctg 6781 acgctctgac catttttgct ttggctctgt ctatcaatcc tgttctacca gtcaccatga 6841 ttatagggat attcttaaaa gctgaatgcc tccgcaacaa agaacatagc tcatagccat 6901 ctaaatttgg catttctaca tctagtaaaa ttaagtcagg tttgctgcgg agaatttgca 6961 tcaaagcttt caccggatcg ctgatcatca caacggaaaa tgtactttca tccaaaaagt 7021 gcttgataga atttaagact gttggactgt catctataca ggcaactgtg tatgcacttt 7081 tctcacgaga ttggtgagaa tctttgttgc tgttaagatg agagatatca aaattattct 7141 gttttggcaa ttcttcccca ggcgtaactt gatgccgcgt tgccgaaacc tcagtggtag 7201 acacaaaatg agaggtaacc tgtgtatcaa cggttgattg cgtctgttgg cgatggcgta 7261 actgcttttg acaattttct acaagtaatc gcaagtccaa acgacaaaac ttcggtagtt 7321 catttaacgg agtttcactc ttaaattcat agcttccttg ttttatgatt agaaatgact 7381 ccagtacttc tcttgccaat tcatctatca ggactgctgc ttgaggagga gtgatgtatt 7441 cttcgttgac taaccagcaa atagcctgat aatctggttg cggtgttgac tgatgatctt 7501 tgttttggtc aaatatcagt ctgacacgca taagagatgt actgttgaga gtgtgtgttt 7561 gctggctcaa acgccccaaa agattttcaa gccgttcaaa cattttatct gaagaggcat 7621 aaactagttt accgtcttct agatagattg accaagaagc cgactctgta aatacttgta 7681 agcagcctgt agcacgccga cttgttaatt gtgctagtag agatagcgga tgtagtttct 7741 gaaaaaactt gtagctacct ataggagttg tgctcatttt gcttttactc aggctaaatt 7801 cttgctagtg tgagctaaat ttatgcaggg ttttccacaa gtccgttgaa acggttcata 7861 gcttttgtta tgaaaagtac aaaagtcttt ccttaagatt tgtgagaaaa ctcagcggca 7921 gcacccttct attaagatag aaaaaacctc agataactcg caaattgcag tgaaatttac 7981 taacttatac tgagattagc agacatcttt acacaacaag taaaaagaat gtaaagacta 8041 gatgaagcaa atcacctcaa agtgagcctt gtctaatggc aaccgagata atactgcttt 8101 ttattgccac aggatccctg ctatctttag tactcgtact taagtataat acggtttttg 8161 gaaccgagat aaatctcagg taaattttgc taaaacaaca gtgttataac tttaaaattg 8221 tataaaatta acatcaagta tccgggtaac agaatagtgc caatgtctgt tatatatccc 8281 aaaagtgaaa tctgccaatc aaattattaa tacaattttg aaatttatag agttttaatt 8341 gatacgaaat caaacttaac actaagaatg agaaaagaat tgtaatgact gttgcatata 8401 gtgtgttgta tcaagtgtca tcgtagccat ttttacttat gaagattttt tgacaggaca 8461 aatagataca ttttagacga gtgcttttta aagaaaaatt atagtattca aacaaagagc 8521 taaaaaaaac ttctgccacc ttttgtttct attacatatt tttatacttt tataaattaa 8581 tgattctcag ttagccgtct acacatttcc ctctttttgg acaaattcct tgtgtatgga 8641 ggcaaatact gtatgtatct cctccacaag ataccgtggt ttggcttacg ttcgccaagc 8701 cttaaaacga caaaaccatg cccctgatca attttgaatt gatcaggggc gttggggaat 8761 aaacagtcac acgttacaca ggtgacaaaa aagacctgcg aacataacca cttaattact 8821 attctttatt gacgtactcc cctgactaaa gtctagggga ttcttaaagg atgttgacat 8881 acaggctaaa gccatgtatt ttacactctt attcctgttt catggactct taactagact 8941 gtgaccagtc cccgtcagac cgtctccgca ggcaattttg ctttacatct acgacgcgtc 9001 ccaccgcaga taaggtacag tccgaagact acgtttctgc tctcggagtc cagttatgga 9061 tatcctgcat gggtcgtcac ccagtacctt cagaaattat aacagagcta gtccttagac 9121 ggctaaagcc tagacaaccc ttccctagta ttgacaaccc tagacgtctt taattaactc 9181 atcaacgaat tctctaagcc gtgcttgcca gtcagggacg gcttttagct tatctttcac 9241 acctgggagt accttgaaac aaactggatc tttgtcaaag ggcttgtcac tagtgaaacc 9301 atagttgttc tttttctgga atctcattga cgcaacatgt agtagtgagt atactgacta 9361 tatcacgagt attgaggtga cgcattggga caaagaaatg agacaagcct ttgtctgtta 9421 atttgtcagc acccacaagt attcgttgta ataaggcttg tctcatttct ccagtcattc 9481 ccccggtgaa caatgctttt atccattaag acaaagttaa agctgacagc cgaacaaaaa 9541 accatcatga gtaagcatgc agggattgca cgttttacat acaactgggg tttggctacc 9601 tggaatagtt ttgtcaaaga tggattaaag cctaacaagt tcattctgaa aaagttcttt 9661 aacaatcacg tcaaacctga gtacggctgg atcaaagaaa aagggatttg tcagaagatc 9721 acccagtatg cttttgacaa cctgggtgac gctttctctc ggtttttctc aaagaaaggt 9781 ggctatccca ggtttaagaa gaaagggcat catgactctt tcactattga tgcaagtggt 9841 aaacctatac ctgttggcgg cacatcaata aagttaccga ctattgggtg ggttaaaacc 9901 tacgaaggac tgccgcatac tacttgtaag tcaatcacaa tatctcgtac tgctgatagt 9961 tggtttatcg ctttcgctta cgaacaagag cacgagccaa ctgttaaaca acacgagatt 10021 gtaggcgttg acgtcggtgt gaaggaacta gctacactct ccactggcgt agtgtttccc 10081 aaccctaaac actacaagac ccacttagaa aagcttcgcc gattatcccg aaagtttact 10141 aacaaacaga aaggttctaa taatcgacat aaagcaaaaa tccaactggc taaacatcac 10201 gcaaaggtag ccaatctccg aaagaacact cttcaccaag tcactacttt cttatgcaag 10261 aaccacgcaa aaatagtagt agaagatttg aatgtttcgg gaatgctatc taaccacaag 10321 ttggctcaag tcatcgctga ttgtggattc catgagttta agcgccagtt ggaatataaa 10381 gctaaaaagt ttggttgtga aatcatcatc gctgaccgtt ggtttgcatc aagtaagacc 10441 tgttcctgtt gtggcgtcaa gaaagaaaca ctcaccctag gtgaacgagt ttttgaatgc 10501 gaacattgcg gtcatgtgat ggacagagat ttaaacgcag ccgttaatct atcgcgtttg 10561 gctaaagcgt gaaagcttac cgagggataa ccgctcccat gctccccgtg acgtaagaag 10621 taaatgtcta gacttgtcta gggtttatat agcagaaata acaggttgaa aacctgttgt 10681 gtcgggtttc atctcctcct taaaaggtag gagttctcac cctcccactt gcaacccgat 10741 agtgcggatg aaaggacttg aaccttcacg ccttgcggca ctagaaccta aatctagcgc 10801 gtctaccaat tccgccacat ccgcattagt agatgattat atcataacag aattatgtct 10861 tgcaatatct aacggttagt tatagggatt aaaccccgtc cttaaggacg gggtctttcc 10921 ttgttattag tggttcgtcg ttagttgttt gagtacctac ttagcactag ccacgaacca 10981 ataatcagct ttgactaagc caccgcaaca cgaggtaaac ggtgcttaaa cccgcagaga 11041 atttcccaag aaattgtatt taattgattt gcccaatctt cagcagtaat ttgttctttt 11101 ccctctgttc caagtagggt gacgacttcg ccttcttgca catctggtaa tgcactcaca 11161 tctagcatca actgatccat tgtaattgtg ccaatttgcg gcacccgttg accgcgaatc 11221 aacacttgca ttttgttgga aagattgcga ggaactccat ctgcgtaacc aattcctatc 11281 acagcaaggc ggagttcatc tgaggctata aattgatgac tgtagctgac gccagttcca 11341 ggaggaatgg tttttacttg cgtgactcgt gctttgactt gtaaaacggg tttcaagtct 11401 atacttaagc ataaatgatc agctgggtag agtccgtata cagctaaacc cacacgcaca 11461 atatcgtagt gcaatgcttt gtcgctcaaa gtagcagctg aatttgctag atgtagacaa 11521 ggcggttcta tccctaatgt tcttagttga gcaatgactt gctcaaatcg ttgttgctgt 11581 tgtttcatcg ttgtaggatc aagactatct gccgttgcca agtgagaata tatactagcg 11641 atcgtcagat caggtaaccg ctgcacaagt tgaacaaact cagcagcctc ctgccaattt 11701 gttcctaacc tggacattcc cgtgtctaat ttaacgtgta cagggagtgg agtcttataa 11761 ttgatggctt ctagagtatt ggaaaagact aaagcttgtt tgggagatat tagtgttggt 11821 tgaagtttcc attgagcgat cgcatgaatc tgctctcggg tatgagttgc acctagtatc 11881 aagattggag ctttaattcc cgcttttcgc aattgaattc cttctggaac tgtagcgact 11941 cctaaccaac tggctcctga gtctaacaca gtttgggcga ctgtgactgc tccatgtcca 12001 taagcatcgg cttttaccac cgccatcagt tgggtctgcg gtgacagtat gcttaaaagc 12061 tgcttgacgt tgtacgacaa tgctgacaaa ttaatttcca cccacgcacg ttgggacaac 12121 caagggtaag tatcccactg ctgatcagag gttacactgc tctttggctc atgactcaac 12181 atcttgaatg actcctatca caccactgtt tggcttctaa atcatttaga ttattactca 12241 ctgctataag cctaatcatg atcaattgat tttggaattg gcacaagatc atgaacagag 12301 agtgcgggaa gaagggagaa aaatttttag ctcttgaccc ttacgggttc gccagttgac 12361 gccaggtgct ccacttgggg agcacctgcg ttgggcggct gtgccgactt gtagcatgtg 12421 gcgtgagacc ccaagaccgc actggctcct // LOCUS NODE_2733_length_12397_cov_5.25992512397 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12397) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12397) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12397 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 326..631 /locus_tag="DP116_21860" CDS 326..631 /locus_tag="DP116_21860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878167.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4090 domain-containing protein" /protein_id="PRJNA477356:DP116_21860" /translation="MTAEINQQSQTTKGADAVDEAIASGIDFDGSPIPPAKLELYHKV MGLEGNRQRSGVSNTMRSRIVRIGAKHIPQEELNQQLTDAGFAALKEKEIAFFYGGK" gene complement(678..1268) /locus_tag="DP116_21865" CDS complement(678..1268) /locus_tag="DP116_21865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132231.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="septum formation inhibitor Maf" /protein_id="PRJNA477356:DP116_21865" /translation="MGTPPFVLASASVARRRLLQIAGIEPLVCPSDFDESQVQIDEPG LLVKTLAQRKAETVAPQFQSALIMGCDSVLAVNGEIHGKPANPEEARQRWQKMQGHFG DLYTGHAFIDLPQNRTLVKCQVTRVYFAQLSSDTIDAYVATGEPLKCAGAFALEGRGG LFVEKLEGCHSNVIGLSLPLLRQMLADLGYSVTDFW" gene complement(1315..1857) /locus_tag="DP116_21870" CDS complement(1315..1857) /locus_tag="DP116_21870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458976.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II oxygen evolving complex protein PsbP" /protein_id="PRJNA477356:DP116_21870" /translation="MWKKIILILLLVFAFSLSHSNVAVAAERNYVDTTDGYEFSYPNG WVQVKVANGPDVVFHDIIEISENISVVISPVPEGKTLKELGTPTEVGYKLGKNALAPE GSGRKAELVDAEERESNGITYYLLEYAIALPNNQQRHNLASVAVSRGKVFTINASIPE RRWGRIKKLIEESVNSFSVY" gene 1927..2550 /locus_tag="DP116_21875" CDS 1927..2550 /locus_tag="DP116_21875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320866.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21875" /translation="MLTTYELRWFSPGMIPENIETWFKQNCLIDPIQPPEEREDVYLY SPKSDFLGIKLRQGRLEVKWRKAELGAVRFGDFVEGKAEKWGKWLCSDATEESFQPNL VLSNSSWVSVQKVRHSQLYQVLPEFPPQPVSVDEHIENGCTVELTHLIIQGNAWWSLA FEASGEDACLMENLQATASWVFDTYRELKLLAINSYAFPSWLALVHQ" gene 2910..3953 /locus_tag="DP116_21880" CDS 2910..3953 /locus_tag="DP116_21880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2278 domain-containing protein" /protein_id="PRJNA477356:DP116_21880" /translation="MNLSEGYGVLKCRAIAGQKELGQGTPHYQVHVKDDQFSYRLAIN VRSSQQPFDLLYFVDDNFEHFITDKLDQLDFGFKKLENSDRQPGKIALDYIRGNLFKV NQMKPLPFNLPGENNDLNELIDSYIQRAIETQAVVYVFGEPWGPENKPDKVFGFQPGR GVHNLHMNQGNSGRFAQENGVYQDGALLIHYPSALSSGDYWVAVFFAFQSQSFHTDDS TGNPIDKIVGSEPVEPGTSAVKARVRIIGALVNPRGEDSGKESVTLINSSPQKIDLNG WAIADKLKRKQPLDGFSLEPGGVITVPLSPEKIQLSNDGGILSVLDSEGTKVDGVSYT KEDAKEQGWTLVF" gene 4201..4866 /locus_tag="DP116_21885" CDS 4201..4866 /locus_tag="DP116_21885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873602.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ComF family protein" /protein_id="PRJNA477356:DP116_21885" /translation="MQPLIKNLESLLNLFLKSNCPLCQRPTSQEFCQDCTRQLQKCHL SDPNLKQKSMPVFAWGTYGGILKSAITTMKYAKQPQIARPLGRWLGEAWLLHSPVYNK VVVVPIPLHPDRQKNRGYNQAALIAQSFCETTGLKLKQNGLARVRSTEAQYSLSASKR AKNLAEAFEIGKDLRRHPEVSVLLVDDIYTTGATAMSAMQTLNQAGIKVVGLAASAVT LKG" gene 4991..5452 /locus_tag="DP116_21890" CDS 4991..5452 /locus_tag="DP116_21890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459012.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="PRJNA477356:DP116_21890" /translation="MINKGLAVGLKQVTIIPATYLAIAISTTPAFAQANKFYNPISIP VGNEITDTLSDKDIPTGQGGFARDYMVKLKKGDNLAIDLVSESFDCMLTLLAPNGTTV AENDDGPDGTSNSLLFTRVAETGNYVIRVRSFGETGIGAFKLKVTRLQPAK" gene complement(5512..6294) /locus_tag="DP116_21895" CDS complement(5512..6294) /locus_tag="DP116_21895" /EC_number="2.7.8.26" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318152.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenosylcobinamide-GDP ribazoletransferase" /protein_id="PRJNA477356:DP116_21895" /translation="MTNKYQWWKQLLLKLYASVTFYTSIPLPYVNELNFTRVSQLAPQ VGLIIGAILGLFDTGMIFLGVPILTRSVLVVCLWIFLTGGLHLDGAMDTADGLAVGDP KRRLEVMADSATGAFGAMAAIALILMKTAALTDLSKNRWLTLMAACGWGRWGQQLAIF RYPYLKPTGKGVFHKQAICSYKDLLPGLLLMIGLCGLQIVFDKQRLFFVLGTILAGSA IATLTGAWFNHKLGGHTGDTYGAVVEWTEALFLCVLTILEKH" gene 6683..7135 /locus_tag="DP116_21900" CDS 6683..7135 /locus_tag="DP116_21900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318153.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21900" /translation="MTHHLPNAKIPAPCIINTGVVVNKLDMRRLLSDLGRVRYIYTYQ GQLQSEGEGDVMEVFANPQRSTLIANHALYLNISSFDYLELKQSKEKETYFDLVQEGV CLRLIPLSTPFQERQQRCLNINDLEVMMEQVLSARWDAEFDDDNSDLL" gene 7452..8597 /locus_tag="DP116_21905" CDS 7452..8597 /locus_tag="DP116_21905" /EC_number="2.4.2.29" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114516.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA guanosine(34) transglycosylase Tgt" /protein_id="PRJNA477356:DP116_21905" /translation="MSAKFSFQHLANCSQTKARAGIFFTPHGPVETPRFMPVGTLANV KTLTPAQLQDTGAQMVLCNTYHLHLQPGEAIVAGGGGLHKFMGWNGPILTDSGGFQVF SLSEMRKISEEGVTFRSPHDGQIINLTPERSIEIQNILGADVIMAFDECPPYPASRED VEAATQRTYRWLERCIVAHQRSDQALFGIVQGGVYLDLRSDAAIALRKLDLPGYAIGG VSVGEPPELIAKIVQATAPLLPPEKPRYLMGVGTYREIAIAIASGVDLFDCVIPTRWA RHGTAMVQGERWNLKNAKFREDFTPLDETCPCYTCQNFSRAYLSHLVRSQEILAYTLL SIHNITELIRFTQRIRKAILEDRFSTEFAQWLTPESGECSVVDNSPA" gene complement(8617..9777) /locus_tag="DP116_21910" CDS complement(8617..9777) /locus_tag="DP116_21910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879364.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_21910" /translation="MLKAYKYRIYPTNEQAILLAKSFGCVRWFYNYALNLTSETYKAT GKGLSRNDTINLLPCLKKQYEWLTEPPSQCLQQVALDLSSAFLNFFEKRAKYPSFKKK GQKQSIRFPQGIKLDGDYLTLPKLGKVYCKVSRLPEGKLKSVTVSLTSSGEYYAACLY DDGKNKLVSSSQGKAIGVDMGIKHYAITSDGTKHGNPKYYRKYEVKLSKKQKQFSRKQ KGSNNRNKARRKVAIVHAKITRCREDFLHKLSRKLVDENQVIVVENLAVRNMVKNSKL AKSISDAGWGQFCTMLNYKAEWEGKIYIEVDRFFPSSKTCSNCLHQVDNLSLDIRSWQ CPKCQTTHDRDINAAINIRDEGLRLLAGGHLATASGQRVRPSKGTAFRGNVG" gene 9957..10094 /locus_tag="DP116_21915" CDS 9957..10094 /locus_tag="DP116_21915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein K" /protein_id="PRJNA477356:DP116_21915" /translation="MEAALLLAKLPEAYQIFDPLVDVLPLIPLFFLLLAFVWQAAVGF R" gene 10407..10703 /locus_tag="DP116_21920" CDS 10407..10703 /locus_tag="DP116_21920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874262.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="iron ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_21920" /translation="MGNITFVKENKEVVAANGANLRLKAMQNNVDIYKVFGKMMNCGG NGQCGCCVVEVVEGMENLSPRTDTENRLLKKKPANCRLACQTLVNGPVSVVTKP" gene 10799..10903 /gene="psbM" /locus_tag="DP116_21925" CDS 10799..10903 /gene="psbM" /locus_tag="DP116_21925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407089.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein M" /protein_id="PRJNA477356:DP116_21925" /translation="MQVNDLGFVASILFVLVPAVFLIVLYIQTASREG" gene complement(11001..11852) /locus_tag="DP116_21930" CDS complement(11001..11852) /locus_tag="DP116_21930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136745.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="universal stress protein" /protein_id="PRJNA477356:DP116_21930" /translation="MIEKILLAVSGLGHAEEMLKTLADVPSIQRSKVTVLHVVSDRSS AEAMTTKWEEGGKILANAIQYLNLDPSKVSSILRQGDPKRVVCEVADEIDADLIIMGS RGLKRLQSILSNSVSQYVFQLSSRPMLLVKDDIYVKKVKRVMVAIDNSDAAKYCLSLA LFLLRDIKGGELILANINTDLRGKESETTEVSPEQNPVLAAAYAEAKKYGVPARCVIS SGKPGEEICRLADERNIDLLLLGSPDRRPSIAKNLVDIDRLLGASLSDYVRVNATCPV LLARTPG" BASE COUNT 3690 a 2541 c 2597 g 3569 t ORIGIN 1 gggatagggg atataagagc gtgggggctt gtatccctca gtttcgaggc aggtgtacaa 61 aaatacaaat ccttcttacc cccatacccc tacaccctta gcttcgatca aaaataactg 121 attttttgac ctaatcctcc atatctctag ctttttgctt accttcactt tcttgtgttc 181 acaatgcctg gtaaaattct ctagtaaggt gattcaaaca gtgaacagta aaaactgata 241 actgttaact gataactgtt tagacccaac gggacacgac actgataact gctaactgtt 301 aactggtaac tgatataaaa ctgccatgac tgctgaaatt aaccaacaaa gccaaacaac 361 caaaggtgct gatgcagttg atgaagcaat tgcaagcgga atagattttg atggttcccc 421 tattcccccc gctaaactgg aactttatca caaagtcatg gggctagaag gaaacagaca 481 gcgcagtggt gttagcaaca cgatgcgatc gcgcattgtg cgaattggtg caaaacacat 541 tccccaagaa gaactcaacc aacaactgac agatgctggt tttgcggcgt taaaagaaaa 601 agaaattgcc ttcttctacg gtggaaagtg atgtagaaag caagaagaaa gaagaaaatt 661 tcttctttct ttacgattca ccaaaaatcc gtgacactgt atcctaaatc cgccagcatt 721 tgccttagca atggtaaact tagcccaatc acgttgctat ggcaaccttc aagtttttcc 781 acaaacaaac caccacgacc ttcaagagca aaagcaccag cacattttaa aggttcacct 841 gtggcaacat atgcgtcaat cgtgtcactg ctcagttgag caaagtaaac tcttgttact 901 tggcacttga ctaaagtacg gttttgaggt aagtcaatga aggcatgtcc cgtgtaaaga 961 tcaccaaaat gaccttgcat tttttgccag cgttgacgcg cttcttcggg atttgctggt 1021 ttaccatgaa tttcgccgtt gacagccaaa acggaatcgc aacccataat caaagccgat 1081 tgaaactgag gggctacagt ttctgctttt cgttgggcta gagtttttac cagtagccca 1141 ggttcatcta tttggacttg cgactcatca aaatcactgg gacaaaccaa aggttcaatc 1201 ccagctattt gtagtaagcg gcgtcgcgct acagaagcag aggcgagtac gaagggtgga 1261 gtacccatag gaggagtgag ggggatgagg gagataagga aaagaaaata ataactagta 1321 aacagagaaa gaattcacgg attcttcaat taattttttt attctgcccc agcgtctttc 1381 aggaattgag gcgttgattg taaaaacttt gccacggcta acagcaacac tagcaaggtt 1441 atggcgttgt tggttgttag gtagtgcgat cgcatactct aaaaggtagt atgttattcc 1501 gttggattcc cgttcttcgg catcgactaa ttcagctttg cgaccggaac cttcaggagc 1561 aagagcgttt tttcccaact tatatcctac ctctgttggt gtccctaatt ctttgagagt 1621 tttgccttct ggaactggac taatcactac agagatattt tcactaatct caatgatgtc 1681 gtgaaacacg acatctggac cgttggcaac tttaacttgc acccagccgt taggatatga 1741 aaactcatag ccatcagtgg tgtctacgta attacgttca gccgcaaccg ctacattaga 1801 atgactcaag ctgaaagcaa acaccaagag caaaatcaat ataatttttt tccacatttc 1861 tacgatttcc tttaggctgg aagcaacaca aggcaaggtt atatttcgtt tttcattgtc 1921 ccatgtatgc tgacaactta cgaactgcgc tggttttctc ctggcatgat accggaaaat 1981 attgaaactt ggtttaaaca gaattgtctg atagacccaa tacaaccacc agaagaacgc 2041 gaagatgtat atctttactc accaaagtct gatttcttgg gaataaaact acgccaagga 2101 cggctggaag tcaagtggcg caaagcagaa ttaggcgctg tgcgttttgg cgattttgta 2161 gaaggtaaag cagagaaatg gggtaagtgg ctgtgttctg atgctacaga agaaagtttt 2221 caaccaaacc tagttctcag caattcttca tgggtaagtg tccagaaagt tcgccattca 2281 caactttacc aagttttacc tgaatttcca ccacaacctg tttccgtaga tgagcatatc 2341 gaaaatggtt gtacagtaga actgacccat ctgataattc aagggaatgc ttggtggagt 2401 cttgcatttg aagcttctgg tgaggatgct tgtttgatgg agaaccttca agcaacagca 2461 agttgggtat ttgatactta tcgcgagtta aagttactgg ctattaattc ttacgcattc 2521 cctagttggt tagcactcgt tcatcaataa gcgtcaaaaa acaagcaaaa atcacccaca 2581 agggtactgc tgtgctcaat aacattgagc tatcatttcg gagaatgcgg tagtttcaag 2641 gaaaagtgtg taactacaat gacgcaaatc aattaaaaag cttggcaata acgagaaact 2701 cagggtaaaa tttgagactc ttaagtaggg tagcactcac caatcatgaa tcctaaaagt 2761 agtagtcatt tataatgagt agctgctcaa agaatgaatg acctttgcct acgaaaagct 2821 atatccagct gggagttcgt gactctctgg tgggcacact tgaattgcga gatgaatata 2881 cagtcaattt catcaaaagg aataagttta tgaatttgag cgaaggttat ggtgtattga 2941 agtgtcgtgc gatcgcagga caaaaggaac ttggtcaagg tactccccac tatcaagttc 3001 atgtgaagga cgatcaattt tcgtatcgtc ttgcaatcaa tgttaggtca agtcagcaac 3061 ccttcgattt gttgtatttt gtcgatgata attttgaaca ttttatcacc gacaaactcg 3121 atcaattgga ctttggcttt aaaaagcttg agaattccga tagacaacct ggaaaaattg 3181 cgctggatta catccgaggc aatttattta aagtcaatca gatgaaacca ttgcccttta 3241 atttacccgg agaaaacaac gatctcaacg agttgattga ctcgtacatt caacgggcaa 3301 ttgaaactca agcagttgtg tatgtcttcg gtgaaccttg gggaccagaa aataagcctg 3361 acaaagtgtt tggcttccaa ccaggaagag gcgttcataa tcttcacatg aaccaaggta 3421 acagcggtag gtttgcccag gaaaatgggg tttaccaaga cggggcgtta ttaattcatt 3481 acccatctgc actctcgtct ggtgactact gggttgcagt attctttgca tttcagtcgc 3541 agtccttcca tactgatgac agcactggta accccataga taagattgta ggaagtgaac 3601 cagtagaacc aggaacttca gcagtgaaag cgcgggtgag aatcattggt gctttagtca 3661 atcctcgtgg ggaagattct ggcaaagaat cggttacttt aataaactct tcaccacaaa 3721 agatagactt gaatggttgg gcaattgctg ataaactcaa gcgcaagcaa cctcttgatg 3781 ggtttagcct tgagcctggt ggagtgatta cagtaccttt aagtccagaa aagattcagc 3841 tttctaatga tggtggaatt ctctctgttc ttgattcaga aggcacaaaa gttgacggtg 3901 tttcctacac caaagaagat gctaaagagc aagggtggac tctagttttt tgatgctgga 3961 acactgatgt tgaatccgtg aatttaagcg taagcgagtg tcttaacttg aattttattc 4021 ctctccatac tcccgtttgg gaaattgacg catgatgaat tcaaagtatg cccttttagt 4081 gcctagggca tactttgagt ttttggcttc ataccaagga actgaaaatt gcaagcgtgt 4141 gcgtaggaca tatatttgag tcgggaaggt catagtcgaa ttgaagaagg acagttattg 4201 atgcaacctt tgattaaaaa tctcgaaagc ttattaaacc tttttctcaa atctaattgt 4261 cctctgtgtc aacgtccaac ctcccaagaa ttctgtcaag actgcaccag acagttacaa 4321 aaatgccatc tcagcgatcc taatcttaag caaaaatcaa tgcccgtatt tgcatggggg 4381 acatatggcg ggatactgaa gtcagcaatc actacaatga aatacgcaaa gcaaccccaa 4441 attgcccgtc ctttaggtcg atggttggga gaagcatggc tattacattc gcccgtgtac 4501 aataaagttg tggttgttcc tatcccactc caccccgata gacaaaaaaa ccggggatat 4561 aaccaagctg cattgatagc acaaagtttc tgcgaaacaa ctggattaaa attgaaacaa 4621 aatggtttgg caagggtgcg atcaactgaa gcacaatatt ctttatcagc gtccaagcga 4681 gcaaaaaatt tagctgaagc ctttgaaatc gggaaagatt tgcgtcgcca cccagaagta 4741 tcggtgcttt tggtcgatga catttatacg acgggtgcga ctgctatgtc tgctatgcaa 4801 acactcaatc aagctggaat aaaagttgtt gggttagcag caagtgctgt taccttgaaa 4861 gggtaaaaat taatttgatt ttattttgaa tcttgaatta ttatcccaca cctgtgagta 4921 aataggcaaa ctctgtctaa gaaaactata attaaccaga tgattctgtt tttgtgggaa 4981 taatctttgg atgataaata agggtctggc tgtgggattg aaacaagtta caatcattcc 5041 tgccacgtat ctcgcgatcg cgataagtac aacaccagct tttgctcaag ctaataagtt 5101 ttataatcca atttctatac ctgttggtaa tgaaattacc gacacactct ctgacaaaga 5161 cattcccaca ggtcagggcg gatttgcccg tgattatatg gtgaagttga agaaggggga 5221 taatttagca attgacctgg tgtctgagag ttttgactgt atgctcacac tgctggcacc 5281 caatggaaca acagtggcgg aaaatgatga tggtcccgat ggtaccagca attccctact 5341 ttttacccgc gtcgcggaaa ctggcaatta tgtcattcgc gtccggtctt ttggagaaac 5401 tggaattgga gcttttaaac ttaaggtaac acgactacag ccagctaaat gaaattatga 5461 attgcaaatt attgcaaata atcaatgaca aaacattttg agaatttaca attaatgctt 5521 ttccaaaata gttagtacac aaagaaataa agcttccgtc cactcaacaa ctgcgccgta 5581 agtatcacct gtatgtccac ctaatttatg gttaaaccag gcaccagtga gagtggcgat 5641 cgcacttcca gcaagtatcg ttcccaacac gaaaaacaaa cgttgtttat cgaagactat 5701 ttgtaaacca cacaaaccta tcatcaataa cagtcccggt aataaatctt tataagaaca 5761 aatcgcttgt ttgtgaaata cgcctttacc agttggtttt aggtaaggat aacggaaaat 5821 tgccagttgt tgaccccagc gcccccaccc acaagcagcc atcaaagtta accaacggtt 5881 tttacttaaa tcggttaatg cagctgtttt cattaatatc aaggcaattg ctgccatagc 5941 cccaaacgca cctgtagcac tatctgccat cacctctagt cgtcgctttg gatcacccac 6001 tgctaaacca tccgcagtat ccattgctcc atctaagtgc agtccaccag tcaaaaaaat 6061 ccaaagacac accaccaaaa cgctgcgagt cagtatcggt acacccaaaa aaatcatgcc 6121 agtgtcaaat agccctaaaa ttgccccaat gattaaccct acttgcggag cgagctgcga 6181 caccctcgtg aagtttaact cgtttacata cggcaaagga atgctagtat agaatgtaac 6241 agaagcatac agctttaaca gtagttgttt ccaccactgg tacttgttag tcatgtttat 6301 cacattaatg aacagtttct gtactttatt cttaaaacat tacactgtta atttcggtac 6361 atatatgggc atgaattttt aaaggtttag ataagattta cactttatag aaactcatcc 6421 gaaaaatttt gttattcttg tcccagtgtt aagatttcaa cactttgtat tttgtataaa 6481 gttaacagtt ttctattggg agttaagcag taacaattta gttcactagc agtttagtag 6541 tgctaataat tagaaattcg ctatttacat ttttagttat tcgctcttac attgcccata 6601 tttgattaaa atgtaggggg taaagaagct gttgctgctt ggctgcgtta catctcacct 6661 gcgctcttga agttgtttga ttatgactca tcatctacct aatgcgaaga taccagctcc 6721 gtgcattatc aacacaggcg ttgttgtgaa taagcttgat atgcggcgat tgctgtcgga 6781 tttaggtcgt gtccgctaca tctacactta ccaaggtcag ttgcagagcg agggtgaggg 6841 ggatgttatg gaagtgtttg ccaatccaca acgctctacc ctaatcgcaa atcatgctct 6901 ttacttaaat atttctagtt ttgattactt ggaactgaaa cagtcaaaag aaaaagaaac 6961 ctactttgac ttagtacaag aaggggtgtg tctgcgactg attcccctct cgacgccttt 7021 tcaagagcgc caacagcgtt gcttgaatat caacgatcta gaagtgatga tggagcaagt 7081 cctatcagcc agatgggatg cagaatttga tgacgataat tccgacttgc tttaaagtca 7141 tcagcgatta gatatcagtt tttgaggatt tcttcgcaag ttattcagat attctatcgt 7201 ttttgcgaga gtgaacaaca gcgtctcgtt aaattactcc cgaagcgcca gaaaaatacc 7261 tgttttttaa ccgcacgagg caagagaggg cacagacaga ggagtaaatt actataggta 7321 atcttagacc accgaaggga gtaggataaa ccgtgagcta aaaagctgac gcccaagagc 7381 cagcgcaaag cgtgagggtt tcccgttttc acggcgactg gcgttgctga atatttgagg 7441 ttaattctat cttgagtgca aaattttctt ttcaacatct tgccaactgt agccaaacga 7501 aagctagagc tgggattttt tttacccctc atggtccagt ggaaacccca cgatttatgc 7561 cagtggggac tttggcaaat gtcaaaactt taactccagc ccagctccag gatacgggag 7621 cacaaatggt attgtgtaat acttatcatt tgcacctcca accaggtgaa gcaattgtgg 7681 ctggaggtgg aggactgcac aaatttatgg gctggaatgg tccgattctg actgattccg 7741 gtgggtttca ggtttttagc ttgagcgaaa tgcgaaaaat cagtgaagaa ggtgtcacgt 7801 ttcgctcacc ccatgatggt caaattatta atttaacacc agaacgctcg attgagattc 7861 agaatattct gggagcagat gtgatcatgg cgtttgatga atgtccgccc tacccagcca 7921 gtcgggaaga tgtggaagca gcgacgcaac gaacttaccg ttggctagaa cgttgtattg 7981 tggcgcatca acgtagtgat caagcattgt ttgggattgt acagggagga gtatatcttg 8041 atttgcgctc tgatgcggct attgctttaa gaaagttaga tttgccggga tatgctattg 8101 gtggtgtgag tgtgggcgaa ccgccagaac tgattgcaaa aattgtgcaa gcaacagcac 8161 cactgttacc accagaaaaa ccgcgttact tgatgggtgt ggggacttat cgagaaattg 8221 caattgcgat cgcctctggt gtagatttat ttgactgcgt gattcccact cgctgggcga 8281 gacatgggac ggcgatggtg caaggggaac ggtggaactt aaagaatgca aagtttcgtg 8341 aagattttac gcctttagat gaaacatgcc cttgttacac gtgtcaaaac tttagccgcg 8401 cgtatttgtc tcatttggtg cgatcgcaag aaatattggc ttacaccttg ttaagtattc 8461 acaacatcac tgaactcatt cgctttaccc aaagaatacg caaggcaata ttagaagacc 8521 gattttctac agaatttgcc caatggctga ctccagaaag tggcgagtgt tcagtagttg 8581 acaactcccc agcctaaagg cgtggggatt cttccttcat ccaacgttgc ctctaaaagc 8641 agtgccttta gatggtctta cacgttgtcc agaagcggta gcgagatgcc ctcccgccaa 8701 aagtcgtaaa ccttcatctc tgatattgat agcggcatta atatccctgt cgtgcgtcgt 8761 ttggcattta ggacattgcc aactacgaat atctaaactt aggttatcta cttgatgtaa 8821 gcagttacta caagtctttg aactagggaa aaatctatca acttcaatgt agatttttcc 8881 ctcccactca gccttgtaat ttaacatagt gcagaactga ccccatccag catcactgat 8941 tgatttagct aacttggaat tcttcaccat gtttctgaca gccagatttt ctacgactat 9001 gacttggttt tcgtcaacta atttacgact tagtttatgt agaaaatctt cacgacatct 9061 agtaattttt gcgtgaacaa tagccacttt cctacgtgct ttgttgcgat tgttagaccc 9121 cttctgtttg cggctaaatt gcttttgttt cttggaaagc tttacttcat atttgcggta 9181 atatttggga tttccatgct tggttccatc agacgtaata gcatagtgtt ttatccccat 9241 gtctacacca atagctttgc cttgtgacga tgaaacaagt ttatttttac catcatcata 9301 gaggcaagca gcatagtatt cacctgatga ggtcagcgaa accgtaacag atttaagttt 9361 tccctctggc aatcgagata ctttgcagta aaccttaccc aattttggca atgtgagata 9421 atcgccatct agttttattc cttgaggaaa gcgaatagat tgcttttgcc ctttcttctt 9481 aaaacttggg tacttagcac gcttctcaaa gaagttgaga aaagcgctgg atagatctaa 9541 tgctacttgc tgtaaacact gcgatggtgg ctcggttaac cattcgtact gctttttcaa 9601 acaaggaagt aagttaatgg tgtcattacg acttaacccc ttaccagttg ctttgtaagt 9661 ttcactagtt aggttaagag catagttgta aaaccagcgt acacatccaa acgacttagc 9721 aaggagtatt gcttgttcat tagtcggata gattctgtac ttgtaggctt tcaacattct 9781 gcttaccaaa ccactaacac tatggtaaag taaaacacaa aagtaaagcc gtgcttctag 9841 cacggggttt taaacccaaa atcttcgata aagactcagg actcatagag gactaaagcg 9901 tgttaagata cttgtcaatt ctgaaagctg tgttggatag gtggatttaa aaaattatgg 9961 aagcagcatt gttattagca aaactgcctg aagcttacca aatcttcgac cctctggtag 10021 atgttctacc actcatccct ttgttcttct tgttgctggc tttcgtttgg caagcagctg 10081 ttggttttag ataaaatcgg tggtgcaatt tatcgcgttt tcaccatcat aaaaatatga 10141 catatcaagt ggggtgggca ttgcctaccc cacttttaca gcagttttca attgggtaga 10201 acaaaagttg cctctttgag acacagcaat gctttgtgtc gtaccacgtg gtttgtcgtt 10261 tatgcaaatc aaaagcaata tatcttaact taaagtgaca attattaata ttacgtaata 10321 aaataagaaa agagatcata cattattctt aaagcaagtt aaaacttagc tttccactca 10381 ggcttacagt aaaagaggta tcaggcatgg gtaatatcac attcgtcaaa gaaaataaag 10441 aagtcgtagc agcaaatggt gcgaatctac ggctgaaagc catgcaaaac aacgtagata 10501 tatataaagt ctttggcaag atgatgaatt gtggaggcaa cggtcagtgt ggctgctgcg 10561 ttgttgaagt cgttgaagga atggaaaatc tttcaccccg cacagatacg gaaaatcgat 10621 tattgaagaa aaagcctgca aattgccgtc ttgcttgtca aaccttagtt aatggtccag 10681 ttagtgtggt aacaaagcct taatggtgaa taatcaagag gtaagaaatc actcgttgaa 10741 gccgggataa tgatatcctt aaattgttca gtggatatca ttgagaggca ttctttccat 10801 gcaagttaat gatctgggct ttgtggcgag cattttgttc gttttagtgc ccgctgtttt 10861 tttaatcgtt ctttacatcc aaactgctag ccgtgaaggc tgattaatag gaaaaaagat 10921 agttaagacc tattttgtca atcacacgaa aacccctata cttatttgtg taggggtttt 10981 gtttatcaac cgcagtgact ttaacctggg gttcgcgcta acaagactgg acaggttgca 11041 ttaactcgaa catagtcaga taaagaagct cccaaaagtc ggtctatatc gaccaaattc 11101 ttggcaatcg aaggacgccg atctggagaa ccgagcaata ataaatctat gttccgttca 11161 tctgccaagc gacaaatttc ttcacctggt ttaccactgc tgatcacaca acgagctggt 11221 acaccatatt ttttcgcttc cgcataagct gcagctaaaa ccggattttg ttctggacta 11281 acttcagttg tttcagattc tttgccacgt aaatctgtat taatattagc cagaatcaac 11341 tctcctccct taatatctcg taaaaggaat aaagccaaac ttaagcaata ttttgccgca 11401 tcagagttat caattgccac cataacgcgc ttgacttttt tgacataaat gtcatctttg 11461 acaagcaaca tggggcgaga agataattgg aaaacatact gactgacaga gttcgataaa 11521 atcgattgta accgcttgag tccgcgcgaa cccataataa tcaagtcagc atcgatttcg 11581 tccgctacct cgcaaacaac acgtttagga tcaccttgcc gcaaaatcga agaaactttg 11641 ctgggatcta agttcaaata ctggattgca ttagccagaa tcttgccacc ttcttcccac 11701 ttagttgtca tggcttcagc agaagaacgg tcagaaacaa catgcaatac tgtaactttt 11761 gaccgttgga ttgagggcac atctgccaaa gttttgagca tttcttctgc atgacccaat 11821 ccagagacag ctagcaaaat tttttctatc atcatggcgt cttgttgttc aggttgcttt 11881 gggaattagc cattggtcat tggtcaatag tcaatagtac aagtcgtgaa agaatgacaa 11941 ttgacaatga catttaagtt tatttacgcc tagttactta aagttccaac taagcttcat 12001 taatttcagc ttactcctaa ttctgactga tgagcttaac gaataattat taaatttatt 12061 agttttaatc tatgtttact atttttcatt aatgaataag ttaagtattg caaagaagtg 12121 ctaatttttg ataaaaaatc aagtttgcta tagaattttt attttatttt tgcgaagctt 12181 cgtacagttt atgttaacca gttatcagta atcagttatc agttaagtaa gtgggttgaa 12241 tctcagattt tgcaccctcc gggtatgacc ttcggtcacg ctgcgctaac accagtcgct 12301 catgggggaa accacggcag gtgctacaac gggacgccac atgcttcaag tcggcaaagc 12361 cgcccaacgc agtggctcgg gaacccccgc aacgcac // LOCUS NODE_2764_length_12263_cov_5.14760812263 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12263) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12263) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12263 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(136..960) /locus_tag="DP116_21935" CDS complement(136..960) /locus_tag="DP116_21935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319448.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_21935" /translation="MNFSSARQYFAQEIQQPDEHIDLAKAALYIAQEEYPNIDPEEYL NAFDTMAVELQERLPSQRYPLRVIQSINQYLYDDLGFAGNQKDYYDPRNSFLNNVIER RLGIPITLALVYMEVSRRVDFPMVGVGMPGHFLIRPEIPDTEIFIDAFNFGEVMFPQD CQERLNQVYQQPVQLQPEFLATVSKRQLLARILANLKYIYLNQQELEKALAAVERILL LFPGAALELRDRGLLCYELGLFAQAANDLETYLIKAPQAEDAVTIRQILSLLKRMS" gene 1475..2068 /locus_tag="DP116_21940" CDS 1475..2068 /locus_tag="DP116_21940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019339678.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="(2Fe-2S)-binding protein" /protein_id="PRJNA477356:DP116_21940" /translation="MENTSTTTSERVSILDQIPSSNAPLTCSVPSAGTIAVTLHINGT EYNLQIDPRVTLLDVLREYIGLTGTKKGCDHGQCGACTVLVDGYRINSCLTLAVSYDG EQITTIEGLAEGEDLHPVQEAFLHHDAFQCGYCTPGQIMSAVGLLLEGHAKTDADIRE QMSGNICRCGAYANIFAAVRELCNGEDNSNPNASAHQ" gene 2139..3137 /locus_tag="DP116_21945" CDS 2139..3137 /locus_tag="DP116_21945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015122376.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="xanthine dehydrogenase family protein subunit M" /protein_id="PRJNA477356:DP116_21945" /translation="MNPFTYVRAANSDEAIATLTREPQAMFIAGGTNILDLMKEGVHT PSQLVDIRKLPSTEIVTKDDGGIRIGATARNSDVAYNSIVQERYPVLSEAILAGASAQ LRNMATVGGNLMQRTRCSYFHDTAFACNKRQPGSGCAALEGFNRMHAVLGTSEHCIAA HPSDMCVALVALDAVVQTHGPRGERSIPITDFHLLPGETPHLETVLEHGEIITAVDLP EIPLNRRSHYLKIRDRASYAFALVSAAVVLEIDEEVIRNARIALGGVGTKPWRSLEAE EVLVGAPATQETFTAAANAAMQEARPYRHNEFKIELAKRTIKEALKTVAAISGGQA" gene 3134..5446 /locus_tag="DP116_21950" CDS 3134..5446 /locus_tag="DP116_21950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019507307.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="xanthine dehydrogenase family protein molybdopterin-binding subunit" /protein_id="PRJNA477356:DP116_21950" /translation="MSSGVISPPKEREQVVGKPINRVDGRLKVMGAASYAAEIPQENI AHAVLIQSTIAKGRIKNIETSEAEKAPGILTILTHLNAPKLNQMQQGDIVKGALGEKL VPLQSDEVFYDGQHIGVVVAETLEEAKYAASLVRVTYEEEKPSVEIESESPKAYQPKQ FFGEELQVQRGDVTKAFAAADVKIEQTYTTPIEHHNPMESSASIAVWNDNQLTIYDAT QWVIGTRNVVAYTLGIPEESIRIISHFVGGGFGCKGFTWWHSILAAVAARVVSRPVKL MVTRQQMFTSCGHRSRTIQQLALSATKNGKLTAIKHVTTSQTSEVDEFIEPCGLTTRM LYACPNLKVVHHVVRVNTGNPTPMRAPGEAPGMFALESALDELAYELGIDPVELRIIN HADVNPHTGKPWSSKYLKECYQLGAERFGWSRRNPTPGSMRDSDYLIGWGMATATYPG YRSPASAKAQLFADGRAVVSSATHDLGTGTYTVMTQIAADALGLPVERIEFKLGESSM PLAPVAGGSQSAASVAPAVQGSAQELRSRVIRLAIDDESSPLYGVAQEAILTENGRVF LKNEPSRGETYAELLQRNNLPILEVEAIANTAASESQQNSDNKVVRICVGKDENSDQQ QYAFQSFGAQFAEVRIHPRLGQVRVTRFVSAIDVGRILNHKTARSQILGGITFGIGMA LMEETVLDQQSGRFVVRNLADYHVPVQADVSDIDVLFIDKPDPHISPIGVRGVGEIGI TGVAAAVANAIYHATGKRIRELPITPDKLL" gene 5622..6785 /locus_tag="DP116_21955" CDS 5622..6785 /locus_tag="DP116_21955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008184823.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XdhC/CoxI family protein" /protein_id="PRJNA477356:DP116_21955" /translation="MKELQDILTDFLAIKSRGQTAVLATVVKVKGSTYRRPGARMLMT QDGCMTGSISGGCLENDVFEHAKQVMASGEPILVKYDPEVAEEIIWALGLGCNGAVHV LIERLDKQLTFIAQCLTKRHSGVLATVFCVDGQVQAKVGNHLMLDSDKNMTTEIADST LNQAIITDAQAALQEQKSKVQTYQFSTGRVEVFIEFIKPPTPLLIFGAGQDTIPVVRF AKELGWHVTVVDHRPTYLTPEKFPNADKLILTSAEAAHKNVLLEDNTVAIAMTHNYFH DRELLKMLLPSAVRYIGVLGPKRRTAELLEDLHSIGMFYTQEQLNRVYAPVGIDIGAD TPVEIALSIIAEIQAVLAKRTAGLLRDRIGPIHHPIDEPDVQAIQSRQQHLNV" gene 6782..7405 /locus_tag="DP116_21960" CDS 6782..7405 /locus_tag="DP116_21960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859178.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotidyltransferase family protein" /protein_id="PRJNA477356:DP116_21960" /translation="MINSPQAENATSAIGAIILAAGASTRMGQPKQLLQFQGRSFLRH TVEVVVASVCNPIIVVLGAYAEKMRQEVSQLPVLVVENSQWDEGMGASIKVGMTALNA AAEEIEGVVLTLCDQPFISCNVINQLVAAYHSTGQGIIASEYAQTLGVPALFSHKFFS DLTSLEATSGAKQVIKKYSHEVFCLPFAAGAIDIDTPQDYERLLSQS" gene 7696..8100 /locus_tag="DP116_21965" CDS 7696..8100 /locus_tag="DP116_21965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fasciclin domain-containing protein" /protein_id="PRJNA477356:DP116_21965" /translation="MADIVDTAVNAGSFSTLVAAIQAANLVDTLKGAGPFTVFAPTDD AFAKLPAGTVDALLQDIPKLQKILTYHVVSGKVTSAEVVKLDSAPTVEGSQVKIDASN GGVKVNDATVTTPDVTADNGVIHVIDTVLIPA" gene 8468..9916 /gene="atpD" /locus_tag="DP116_21970" CDS 8468..9916 /gene="atpD" /locus_tag="DP116_21970" /EC_number="3.6.3.14" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit beta" /protein_id="PRJNA477356:DP116_21970" /translation="MVTTAEKTNIGYITQVIGPVVDVKFPNGKMPQIYNALTIKGTNE AGQNISVTTEVQQLLGDNQVRTVAMSSTDGLVRGLEVTDTGAPISVPVGKATLGRIFN VLGEPVDNRGPVNSDEKLPIHRDAPKFTELETKPSVFETGIKVVDLLTPYRRGGKIGL FGGAGVGKTVIMMELINNIATQHGGVSVFAGVGERTREGNDLYNEMIESGVINKDNLN ESKIALVYGQMNEPPGARMRVGLSGLTIAEYFRDVNKQDVLLFVDNIFRFVQAGSEVS ALLGRMPSAVGYQPTLGTDVGALQERITSTNEGSITSIQAVYVPADDLTDPAPATTFA HLDGTTVLSRGLAAKGIYPAVDPLDSTSTMLQPNIVGEEHYNTARAVQSTLQRYKELQ DIIAILGLDELSEDDRLTVARARKVERFLSQPFFVAEVFTGSPGKYVKLEETIKGFQR ILAGELDDLPEQAFYLVGNIDEAIAKAEKLKG" gene 10113..10526 /locus_tag="DP116_21975" CDS 10113..10526 /locus_tag="DP116_21975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209815.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit epsilon" /protein_id="PRJNA477356:DP116_21975" /translation="MTLTVRVISPDKTVWDAPAEEVILPSTTGQLGILSGHAPLLTAL DTGVLRVRANKNQAWIAIALLGGFAEVEQDEVTILVNSAERGDTINIEEARAALSEAE ARLNQVQAGERQSQIQANQAYKRARARFQAAGGTV" gene 10946..12139 /locus_tag="DP116_21980" CDS 10946..12139 /locus_tag="DP116_21980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320114.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA replication/repair protein RecF" /protein_id="PRJNA477356:DP116_21980" /translation="MYLKTLHLRQFRNYKQQQVEFSAPKTILVGNNAQGKSNLLEAVE LLATLRSHRMARDRDLIQDGETIAMLVATLERQTGISDLSLTLRRNGRRTVALNGETL RRQMDFLGVLNAVQFSSLDLDLIRGGPEGRRNWLDTLLIQLEPVYAHILQQYNQVLRQ RNAFLKSTLQHQSPSLREAGARLQEKSLHQVSAKAQTPQHSELDIWNAQLATTGTRVI RRRDRAIQRLAPIAKAWHASISGGTEILELNYSPNVSLDQNNPEQLQQAFLEKIEQRA VAELHQGTTLVGPHRDEIELTINQTPCRQYGSQGQQRTLVLALKLAELQLIEQVVGEP PLLLLDDVLAELDPSRQNQLLDAIQDRFQTLITTTHLGSFDSQWLNSSQILAVDSGEI SQANK" BASE COUNT 3498 a 2797 c 2812 g 3156 t ORIGIN 1 gagcaagaag agagatttaa ttatttagtg caagtttata gagaatcggt attactaaaa 61 ttcacttatg cattgtcatt aaaatttaga tatttctact aggacttcat gactcatttt 121 caacacgact gaatgtcacg acatcctttt aagcaaactc aggatctgtc gaattgtaac 181 agcatcttcg gcttggggag cttttatcag ataagtttcc aagtcgttag cagcttgagc 241 aaatagaccg agttcgtagc acaaaagacc gcgatcgcgc aattctagtg ccgcgccagg 301 aaacagtagt aaaatccgtt caacggctgc taaagctttt tccaactcct gctgattcag 361 gtaaatatat ttcaaattcg ccaatatccg tgctaacaac tgcctcttac tcactgttgc 421 taaaaattcc ggttgcagct gcacgggttg ttgataaact tgattgaggc gttcttgaca 481 atcttgtgga aacatgactt cgccaaaatt gaacgcatca atgaaaatct ccgtatctgg 541 aatctcagga cgaattagaa aatgtcctgg cattcctacg cccaccatag gaaagtctac 601 tcttcgggag acctccatat aaacaagtgc taaggtaata ggaattccca accgacgttc 661 aatgacatta tttaaaaagc tgttgcgagg gtcatagtaa tctttttgat tcccagcaaa 721 tcctaaatca tcataaagat actgattgat actttgaata actcgcaagg gatatctctg 781 ggatggtaag cgttcttgta gctcgacagc cattgtatca aaggcattaa gatattcctc 841 tgggtcgatg ttgggatatt cttcttgtgc tatgtataaa gctgcctttg ccaagtcaat 901 atgctcatca ggctgctgga tttcttgagc aaaatattgc cgcgctgacg agaagttcat 961 aagcagatgt tcttgaatat aggttcaaaa aaaagacgaa agcgtatatg atgctttttc 1021 tacttttata ttaaacttgc tattgcgatt tcaaaaatag aaattcaaaa ttcagcttac 1081 agttaagtca atagtgggat tgacagtagc tcaagcacac tcggtgctgg aattttccct 1141 atttttgcat cactttttcc cgaaagatta aaattttagt attttatact aaatttgctt 1201 gaaatgctat ctaatttaaa tgctctctaa aaagtataaa gaagttgctt gtctgtggtt 1261 gctcatgtca atgatacaga gcttaatata cgcaaccaac aacgctggac aagctgctgt 1321 cagttatagg taaaaacaat cgttcacaaa attaaatttt taacactggt tgatttgatg 1381 aaaataaaaa catagatgta catgaataaa ctaatgtata tatgttcatc tgtggttatc 1441 aaatacccaa cagccttctt gaaggagcga gaacatggag aacacgagca caactacttc 1501 agaacgcgtt tcaatactcg accaaatacc atcgtcaaat gcgccgttga cctgttctgt 1561 accttctgct ggtaccatcg cagtcacatt acatatcaac ggtacagaat acaatttgca 1621 gattgatccg cgtgtcactt tactggatgt cctacgagaa tatattgggc taacaggtac 1681 taaaaagggc tgtgaccacg gacagtgcgg tgcgtgtact gtacttgttg acgggtatcg 1741 tatcaactca tgcctgacgc ttgctgtctc atatgacggt gagcaaatca cgactatcga 1801 gggtttagcc gagggtgaag atttgcaccc agtacaagaa gcatttctcc accacgatgc 1861 ttttcagtgc ggttactgca ctccagggca gattatgtct gctgtaggac tgctcttaga 1921 aggacacgcg aaaaccgatg ctgacattcg cgaacaaatg agcggcaata tttgtcgatg 1981 tggcgcttac gcaaatattt ttgcagcagt tcgtgaattg tgcaacggcg aggacaattc 2041 taacccaaac gcctcagcac atcagtgaac agtgaacagt gataactgat aactgatcac 2101 tgatcactgt tttaagcaaa ataacggagg caaaacggat gaatccattt acttatgtac 2161 gcgctgctaa ttcggacgag gcgatcgcaa cactcactcg cgagccacaa gcaatgttca 2221 ttgcaggtgg taccaacata cttgatttaa tgaaagaagg tgtacataca ccaagtcaac 2281 ttgtagatat tcgcaaacta ccgtcaacag aaattgtcac taaagatgac ggtggtatca 2341 gaattggagc gacagcgcgc aacagcgatg ttgcatacaa ttcaatcgtt caggaacgct 2401 atcctgtgct gtcagaagcc attcttgcag gagccagcgc tcaactacgg aacatggcaa 2461 cagtcggcgg gaacttgatg cagcgaacac gttgctccta ctttcatgat acagcctttg 2521 cctgcaacaa acgtcagcct ggttcaggct gcgccgcatt agaaggtttc aaccggatgc 2581 atgcagtctt aggcacaagt gaacactgta tcgccgcaca tcccagtgac atgtgtgttg 2641 cgctcgttgc gcttgatgca gttgtccaga cacacggacc tagaggtgag cggagtattc 2701 ccatcacaga ctttcacctg ctacctggtg agacaccaca tttagaaaca gtgctggaac 2761 acggtgagat aattacagca gttgatttgc cagaaatacc attgaacagg cgatcgcact 2821 acctaaaaat acgagacaga gcgtcttacg cctttgcgct cgtatctgca gccgtggttc 2881 tcgaaataga tgaggaagtg attcgcaatg cgcgtattgc tcttggtggt gtaggaacaa 2941 agccgtggcg ttctttggaa gcagaggaag tattagtcgg tgcaccagct acacaagaga 3001 ctttcaccgc agcagcgaat gcggcgatgc aggaagctag accatatcga cacaatgaat 3061 tcaaaattga gttagcaaaa cgaaccatta aagaagcact caagaccgta gcagcaatct 3121 caggaggtca agcatgagtt caggtgttat ttccccccca aaagaaagag agcaagttgt 3181 cggtaagcca attaaccgtg ttgacggacg actcaaagtc atgggtgcag cttcctatgc 3241 agcagagatt ccgcaagaaa atatcgctca tgctgtgctc atccaaagca ctattgctaa 3301 aggtcggata aaaaacatcg agacttctga ggcagagaaa gcaccaggca tacttactat 3361 tctcacccat ctcaacgcgc caaaacttaa tcaaatgcaa caaggagata tagtcaaagg 3421 tgcgcttggt gaaaagttgg tcccactgca atcagacgaa gttttctatg acgggcaaca 3481 tattggcgta gttgttgctg agacgttaga ggaagctaaa tacgctgcat ccctcgtccg 3541 tgtcacttat gaggaagaaa agccaagcgt tgaaatagaa tcagaatcac cgaaagcata 3601 tcaaccaaaa cagttttttg gcgaagaact gcaagtgcaa cggggcgatg taaccaaagc 3661 ctttgctgca gccgacgtca aaattgaaca gacttacaca acgccaattg aacaccacaa 3721 cccgatggag tcgtcagctt ctattgccgt gtggaacgac aatcaactca caatctacga 3781 tgccacacag tgggtgattg gcacacgcaa tgttgttgcc tacacgcttg gtataccaga 3841 ggaaagtatc cgtatcatct cgcacttcgt cggaggtgga tttggttgca aaggttttac 3901 atggtggcac agtatcttag cagcagtcgc cgcgcgtgtc gttagtcgtc cagtcaaact 3961 tatggtcacg cgccagcaaa tgtttacctc ttgcggtcat cgttcacgca ccattcaaca 4021 gctggcactg agtgcaacaa aaaacggaaa gctcacagcc attaagcacg tgacgacatc 4081 gcaaacttcc gaagtagatg aatttattga gccatgcgga ttgacaacaa ggatgcttta 4141 tgcgtgtcca aacttaaaag tggtgcatca tgtagttcga gtcaacactg gcaatccaac 4201 accaatgcgg gcaccaggag aagcgccagg tatgttcgcg ttggaatcgg ctctggatga 4261 gttagcgtat gagttaggaa tcgaccctgt agaacttcgc atcatcaacc atgctgatgt 4321 aaatccgcac acaggcaaac cctggtcgag caaatacctc aaagagtgtt accaactggg 4381 tgcagaaaga ttcggctggt cacgccgtaa ccccacacca ggttctatgc gcgattctga 4441 ctatctcatc ggttggggaa tggcaactgc aacgtatcca ggctaccgtt ctccagcgtc 4501 agcaaaagcg caacttttcg ctgatggacg tgctgtcgta tcgagtgcaa cccacgatct 4561 cggtacaggc acttacacgg ttatgacaca aattgctgct gatgcgcttg ggctacctgt 4621 tgaacgaatt gagtttaagc tgggtgaatc gtccatgcct ttagcacctg ttgctggtgg 4681 ttcgcagtca gcagcaagtg ttgcaccagc tgtccaggga tcagcacaag aattacgtag 4741 tcgggtgatt cgtcttgcga ttgatgacga atcgtcaccg ttgtatggag tcgcacaaga 4801 ggcaatctta actgagaacg gacgtgtttt cctcaagaac gagccatcga gaggtgaaac 4861 ttacgcagaa cttttacagc gcaacaatct gccgatacta gaagttgaag cgatcgcaaa 4921 cactgctgcc tcagagtcac aacaaaattc agacaacaaa gtcgtgagaa tctgcgttgg 4981 caaagacgaa aactccgacc agcagcagta tgcgtttcaa tcttttggcg cgcagttcgc 5041 tgaggtacgc attcatccac gtttgggaca ggtacgcgta acgcgcttcg tgagcgctat 5101 tgatgttgga cgcatcctca accacaaaac agcacgcagc caaattctag gcggtatcac 5161 ttttgggata ggcatggcat taatggaaga gactgtactc gaccaacaaa gcggtcgttt 5221 cgttgtgcgt aacttggcag attatcacgt tcctgtccaa gcagatgtct ctgacattga 5281 cgtgctattc attgataaac ccgacccaca cattagccct atcggtgtgc gcggtgttgg 5341 tgaaatcgga attactggcg ttgcagcagc agttgccaat gccatctatc atgcaacagg 5401 aaagcgcatt cgcgagttgc caataactcc tgacaagctg ctgtaaacct acaccttgca 5461 gcattgtttc tcgttcccat gctcagcatg ggaatgcata acttgaggct ccgcctcaat 5521 ataatcattg ggaggcagag cctcctttta ggcattccca gcctcaggct gggaacgaga 5581 gatataactt cttgcaatcc ctagtcaatt ctcccgaaaa catgaaagaa ttacaagata 5641 tccttaccga cttcctggca attaaaagtc gtggtcaaac tgcagttctt gctacagttg 5701 tcaaggtcaa aggttctact taccgacgac ctggtgctcg aatgctcatg acccaagatg 5761 gttgtatgac aggttctatt agtggcggtt gtctggaaaa tgacgttttt gaacacgcca 5821 aacaagtcat ggcttcaggt gaaccgattt tagtaaaata tgacccagaa gtcgcagagg 5881 aaattatttg ggcacttggg ttaggttgta atggagcggt tcatgttctc attgagcgtt 5941 tggataagca attaacattc attgcccaat gtctaacgaa aagacactca ggagttttag 6001 caactgtgtt ctgtgtggac ggtcaagttc aggcaaaggt gggaaatcac ttgatgcttg 6061 actcagataa aaatatgact actgaaatcg cagattctac tcttaaccaa gctattatta 6121 cggatgctca agcagctttg caagagcaaa aatcaaaagt tcaaacttac cagttctcaa 6181 ccggtcgtgt agaagttttc attgaattca tcaagccgcc aacaccgcta ctcatttttg 6241 gagcaggtca agatactatc ccagttgtac gctttgctaa agagctaggt tggcacgtca 6301 ctgttgtaga ccaccgaccg acttatctaa ccccagaaaa atttcccaat gctgataagt 6361 tgattcttac cagtgcagaa gctgctcata aaaatgtgtt gttagaggat aacacggtag 6421 cgatcgcgat gacacataac tacttccacg acagagaact cctaaaaatg ttgttgccat 6481 ctgcagtacg ctacataggt gtgcttggtc caaaacgcag aaccgcagaa ttactagaag 6541 atttgcactc cataggaatg ttctacactc aagaacaact caaccgagtg tatgctccag 6601 ttggtattga cattggcgct gatacaccag tggaaattgc gctttccata attgctgaga 6661 ttcaggctgt ccttgccaaa cgtactgctg ggttgttgag agatagaatt ggaccgattc 6721 atcatcctat cgatgaacca gatgttcaag caatccagtc ccgtcagcaa catctcaacg 6781 tatgataaac tcgcctcagg cagaaaatgc aacctcagct attggcgcaa tcattctggc 6841 ggctggcgca tcaactcgca tgggtcagcc gaaacaactc ctacaatttc aagggcgcag 6901 cttcctgcgt catactgtag aggttgttgt tgcctcagtt tgcaacccta ttatcgttgt 6961 gctgggagcg tatgcagaaa aaatgcgtca agaagttagc cagcttccgg ttctggtggt 7021 agaaaactcg caatgggacg aggggatggg tgcttctatt aaagttggta tgacagcact 7081 caacgctgct gctgaagaaa ttgagggagt cgtgctgaca ctgtgcgacc agccatttat 7141 ttcctgcaat gttatcaatc aacttgtcgc tgcttaccat tctacaggtc aggggattat 7201 tgcttctgaa tatgctcaaa cattgggagt tccagccctt tttagccaca agtttttttc 7261 agacctcacc agtttagaag cgacatcagg cgcaaaacaa gttattaaga aatattccca 7321 tgaggtcttt tgcctccctt ttgcagcagg tgctattgat atcgatacac cgcaggacta 7381 cgagcgatta ctaagtcaaa gttagtgaca tcctcccaca ccaatatcat tgattatggt 7441 gtgggcttcc caacgttgcg gttgggcttt cctgtttccg ccgtgttcaa ctatagcaaa 7501 cacagcagtt ttctatgatt gatagccaga acacaccata tcgccagtaa ggcatacgat 7561 acgctccgtg ttttgcatta atccacgtcg tgaataatta aaaaaataat acggtatttt 7621 tatcggctta tagttgttat tatccaaaaa gtagtttata cttgttaaca aatgaaataa 7681 aactaggtga aacacatggc tgacattgta gatactgcag ttaacgctgg ttctttcagt 7741 actctagttg cagcaatcca agctgccaat ctggtagata ctctcaaagg cgctggtcca 7801 ttcaccgttt ttgcacccac agatgatgcg tttgctaagc ttccagcagg cacagtagac 7861 gcattgcttc aggacattcc aaaactccag aaaatcctga cgtatcatgt tgtttcaggc 7921 aaggtgacat cagctgaagt agttaagctt gactcagctc ctacagttga aggttcacaa 7981 gtgaaaattg atgcttctaa tggcggcgtc aaagtgaatg atgccacagt cacaacaccg 8041 gatgttactg ctgataatgg tgtcatccat gttattgaca cagtgttgat tcctgcataa 8101 ggaaacactt cacccagtaa atatcgcttt tggggcagcc actacctaca gtcggtgccc 8161 cacattaaaa ttagaaaaca ccacattaac caaatctcta gttattactg aaagttgacc 8221 ggttttttcg gaatctggtc tagctgctgt gtctgccatc aatagacaca gcatttgtta 8281 tagatgaact ttttttgtta acgaatatag cagcataacc taattaccac tccacagcct 8341 gaatcttaag gagaaattta agcgcaatag aggggatcgt tcagtgctaa tcttgaagag 8401 acaagagcaa aatcttatgt ccttgacgac ccgatcccaa atatagcttc aatcattaag 8461 gctcagcatg gtcaccacag cagaaaaaac aaacattggt tacattaccc aagttattgg 8521 tccggttgtt gatgtcaagt tccccaacgg gaaaatgccg caaatctaca acgctttgac 8581 aatcaaaggt actaacgaag ccggacaaaa catctcagtt actaccgaag tgcagcagct 8641 gctaggtgac aaccaagtcc gaactgttgc gatgagttcc actgatggtt tagtgcgtgg 8701 tctggaagtt actgatactg gcgctcccat cagtgtgcct gttggtaaag ccacgctggg 8761 tcggattttc aacgtccttg gcgaacctgt ggacaatagg ggacctgtca attcggatga 8821 aaaattgccc atccaccgcg atgctcccaa attcacagaa ttggaaacca aaccttcggt 8881 gtttgaaacc ggaattaaag ttgttgacct gctaactccc tatcggcgcg gtggtaagat 8941 tggtctattt ggcggtgctg gcgtaggcaa gaccgtcatt atgatggagt tgattaacaa 9001 catcgccaca caacacggcg gagtgtcagt gttcgctgga gtgggtgagc gcacccgtga 9061 gggcaatgac ctttataatg aaatgattga atctggggta atcaacaaag ataacctcaa 9121 cgaatcgaag attgccctag tgtacggtca gatgaacgag ccacccggag caagaatgcg 9181 ggttggtctg tcgggtttga caatagcaga gtacttccgc gatgttaaca agcaggacgt 9241 gctgctgttt gttgacaaca tcttccgatt tgtgcaagca ggttcagaag tatcagctct 9301 gttgggacgg atgccttctg cggtaggata tcagccaaca ttaggaaccg acgtaggtgc 9361 actgcaagag cgcatcactt ccaccaacga agggtcaatt acctcaattc aagcagtgta 9421 cgtacctgcg gacgacttaa ccgaccccgc acccgcaacc acattcgctc acttggacgg 9481 aacaacagtg ctatcgcggg gtttggctgc taaaggaatt tatccagctg tggatcccct 9541 agattctact tccaccatgt tgcagcccaa cattgttggc gaagaacact acaacactgc 9601 tcgtgctgta caatcaacac tgcagcgtta caaggaactt caagacatca tcgcgattct 9661 cggtttggat gaactgtctg aagatgaccg tctgactgtg gcgcgtgcac ggaaagttga 9721 gcgtttcttg tcccagccgt tctttgtggc agaagtgttc actggttctc ctggtaagta 9781 cgtgaagctg gaagaaacca tcaaaggctt ccagagaatt cttgctggag agttggatga 9841 tctgccagag caagctttct acttggttgg gaatattgat gaggcgatcg ccaaagctga 9901 aaaactcaag ggctaatatt cagtgaacag tcatcagtga acagtgagac cagtgctgcg 9961 gtccttgttt ccctcgtgcc ggctctggcg ttagccgtca ggcgtgcgct ttgcgcatac 10021 ccgaagggtt atcaggataa gtatgataac tgttcactga tacttgatga ccagtaactg 10081 ataactgata actgataact gataactgaa atatgacatt gactgttcgt gtaatttccc 10141 cagataaaac tgtgtgggat gccccagctg aagaagtcat tttgcccagc accactggtc 10201 aactcggtat cctcagcgga cacgcaccac tgttaacagc cttagatacg ggcgttctgc 10261 gagtacgtgc caataaaaat caggcttgga tagcgatcgc cctgttaggt ggctttgccg 10321 aagttgaaca agatgaagtc acaattctgg taaacagtgc tgagcggggt gacacaatta 10381 acatagagga agcccgtgct gctttgagtg aagcagaagc ccgtttgaat caagtacaag 10441 caggagagcg ccaaagccaa atccaagcaa accaagcata caaacgcgcc cgcgctcgtt 10501 ttcaagctgc aggtggtaca gtctagctgc ttgtaggtag ggttacttta tctacaaatt 10561 gttctctgac agaagtttcc agtaagtctg aaggaacatg aggttataag aggcttagct 10621 tgcttttacc atactttagg cgtggctggc agactgctgt ctgaggacaa atactaccaa 10681 gggagtgcat tattcgcttt tgccagttct gcgttgcgta tgtcccaagg ggagctactg 10741 cggtcttggg gtttcacgcc actggctccc caagtggagc aagtggcgtg acacgcgctc 10801 gtcgcaaagc gtcccgttga gggatacgcg aacacagaga ataaaaaaac cgcagaggcg 10861 cagagaacac agagaaagag agaaagagag aaagagagaa agagagaaag agatagggag 10921 agggaaaatt ctctgcgcgc gtattatgta cctgaaaacc cttcatctgc gacaatttcg 10981 caactataaa cagcagcaag ttgagtttag tgctcccaaa accattttgg taggcaataa 11041 tgctcagggc aagtccaatc tgttggaggc ggtggaattg ctggcgacat tgcgatcgca 11101 cagaatggca cgcgatcgcg atttaattca agatggagaa accatagcca tgcttgttgc 11161 cactttagag cgacaaactg gcatcagtga ccttagtctc accttgcgtc gaaatggtcg 11221 ccgtaccgtt gctttgaatg gtgagactct gcggcgtcaa atggattttc ttggtgtctt 11281 aaatgcagta cagttctcta gtttagattt agatcttatt cgcggtggtc ctgaaggtcg 11341 tcgaaattgg ctggatactc tgttaatcca acttgaacct gtctatgctc atattttaca 11401 gcaatataat caggttttgc gacagcgcaa tgccttctta aaaagcactt tgcaacacca 11461 gtcgccatcg ttacgggaag cgggtgcgcg tctacaagaa aaatcactgc atcaggtcag 11521 tgcaaaagcc caaactccac agcattctga gttagatatt tggaatgcac agttggctac 11581 cacgggaaca cgggtgattc gacgacgcga tcgcgccata cagcgactag cccctattgc 11641 caaggcttgg cacgccagta tcagtggtgg tacagaaatt ctagaactca actactcacc 11701 caatgtctca ttagaccaga acaatccgga acaattacag caagcttttc tagaaaaaat 11761 agagcaacgt gcggttgcgg agttgcatca aggcacaact cttgttggtc cccatcgcga 11821 cgaaatagaa ttgacaatta atcaaacacc atgccgtcaa tacggttctc aaggtcagca 11881 gcgaactttg gtactagcct taaagctagc agaattgcaa cttatagaac aagtggttgg 11941 cgaaccaccc ctgctactac ttgatgatgt cttagcagaa cttgatccat ctcgacagaa 12001 tcagctactc gatgccattc aagaccgctt tcaaaccctg atcaccacta ctcatctggg 12061 ttcttttgat tcacagtggc taaattcttc gcaaattctt gctgtggatt ctggtgaaat 12121 atcgcaagca aataagtaag tctgcccgtc attagtcaac cgtctcccaa ttcgccaata 12181 cagtcgtcat ttatttatgg ggatttccct gttcactgtt cactgttcac tgttcactgt 12241 tccctgttcc ctgttccctg ttc // LOCUS NODE_2766_length_12257_cov_5.33805912257 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12257) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12257) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12257 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..526) /locus_tag="DP116_21985" CDS complement(<1..526) /locus_tag="DP116_21985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006635808.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="calcium-binding protein" /protein_id="PRJNA477356:DP116_21985" /translation="MGTNLTEISSLTLVSVADDGTQGNNYSRKPTISADGRFVAFYSA ANNLVAGDTNNSSDIFVRDLLTGTTTRVSVADDGTQGNNNSSNPAISADGRFVAFDST ASNLVAGDTNNTSDIFVRDLLTGTTTRVSVADDGTQGNGFSYTPAISADGRFVAFESS ASNLVAGDTNNISDI" gene complement(663..2063) /locus_tag="DP116_21990" CDS complement(663..2063) /locus_tag="DP116_21990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015118003.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alkaline phosphatase family protein" /protein_id="PRJNA477356:DP116_21990" /translation="MNKTVVLNVVGLSPSLLGEHTPSLSRWAASGKVVPIASVLPAVT CTVQATYLTGKMPNEHGIVANGWYFRDECEVKFWRQSNKLVQAPKVWDMARSLDPSFT CANLFWWYNMYSSADYAVTPRPMYPSDGRKLPDIYTQPQEWRSPQGGSSGASPLQAEL GQFPLFNFWGPNTSIASTQWIASSAKWVEERCNPTLTLIYLPHLDYCLQKFGPEDNLV TKDLQEIDAVCGDLIQYYENRGAQVIVLSEYGITPVSQPVHLNRILRENGLLAVREEL GRELLDPGASQAFAVADHQIAHVYVNDPFYIPKVRSLLENTEGVAQVLGEDEKPAYGL NHPRSGELVAIANSDAWFTYYYWLDDNRAPDFARTVDIHRKPGYDPVELFVDPDIKFP QFKIGFKLLKKQLGFRYLMDVIPLDATLVRGSHGCITASPANGPLFMTRQTHLLESES IAARDVCQLILRHLTL" gene complement(2675..3895) /locus_tag="DP116_21995" CDS complement(2675..3895) /locus_tag="DP116_21995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194647.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="xylose isomerase" /protein_id="PRJNA477356:DP116_21995" /translation="MKIETNHNFHLTYCTNIHPGEEWSKVFANLKQYIPALKAQLAPE KPYGIGLRLADVATKELLQGNALAEFQSWLTELDLYVFTLNGFPFGGFHWQVVKDQVY APDWSKQERLEYTLRLVNILAQLLPADMEGSISTLPLSYKPWFKGNQLFLASLTSSAS LNLAQVVAEMARIRTETGKLLHIDLEPEPDGLIENAAEVVDYFQRHLLPIGGEYLAKH LGISLEAAEALILEHVRVCYDTCHFAVEYENPVSVFKQFQAAGIQVGKIQISAALQVN IPNDREQRSLVKKRLLPFAESTYLHQVIARESTGRLRHYTDLEKALPDLDYTTDREWR IHFHVPIFIHDYQLFQSTQDDISTVLDLLQINHACSHLEIETYTWEVLPTEMKLDLLA SIQREYEWVLSKML" gene complement(4240..5478) /locus_tag="DP116_22000" CDS complement(4240..5478) /locus_tag="DP116_22000" /EC_number="4.2.3.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317601.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-dehydroquinate synthase" /protein_id="PRJNA477356:DP116_22000" /translation="MVIDIRQKATLNMQPIRQSFSVTFHYGVHFTNGVFDLKNSLLAQ VIAGDGEVGPKKVLAIVDSGLLQSRRTLLKQIVAYCDRHPDTLKLAAEPIIIKGGEAA KNDPNELSKIHQIIDEVGVCRHSYILGIGGGAVLDLVGYAAATAHRGIRLIRIPTTVL AQNDSAVGVKNGINAFGKKNFLGTFAPPYAVLNDFSFLTSLNDRDWRCGIAEAVKVAL IKDPDFFEYISTHADALICRDLNSMQQVVYRCAQLHMDHIANNGDPFEVGSSRPLDFG HWAAHKLEQLTNHRLRHGEAVAIGIALDSTYSYLIGLLSRLEWQRILNTLLSLNFLLY VPELSEKITELEHPRCLFRGLSEFREHLGGELTLTLLQGIGKGIEVHEVDISLFRQAT WLLGEFYNSSLPASGWECVM" gene complement(5472..6116) /locus_tag="DP116_22005" CDS complement(5472..6116) /locus_tag="DP116_22005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194649.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DsbA family oxidoreductase" /protein_id="PRJNA477356:DP116_22005" /translation="MLIDIYHDTVCPWCRIGHKHLFDALAQYQQTVKIRWHPFLLDNS IPAAGCEFRSYMQQRKGIEPQAIEKLFDHVRNIGHAAGVKLDFNKIHLAVNSILSHRL IALAPDDIKNDVVQAVYKAYFEEGLNLGDLDVIVAIGTKYGMNSTVLRLQLNGNALAD AVLAESTFARLNGITSVPFYVINNKVKIDGSHSSEVFLQALNRAALIEISTKIW" gene complement(6061..7242) /locus_tag="DP116_22010" CDS complement(6061..7242) /locus_tag="DP116_22010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408011.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional chorismate mutase/prephenate dehydrogenase" /protein_id="PRJNA477356:DP116_22010" /translation="MTQELLKQTDQSLCDPQRDTHTASLALGGDGDCLRQSYAERVEH QIHTSDRHTLLTASKISAIEEQLATLGSLLAQAGVPESVRVNLVNSCYAALSTVDSSS STQITPRRITIIGGAGRMGRFFTQQLTAAGHNVRILENEDWEYADNLLSCAELVIVSV PIQWTADVIKRAAQYLAPTTALCDITSIKTEPMSAMLEHHRGPVMGLHPMFGPNVKSF AGQKVVACPGRNDDSFEWLLDFIKSQGGEVIVSTPEEHDYMMVIIQATRHFSRFSVGV FLAQEKIDIERSLCMSSPSYRQEIDIIKRLFTQSPHLCVDIMLATHDRCHAIAKLADT YNRLARLVAQKDRGAIIQEFETAQNFFAEESITSPNLTHENHADRHLSRYRMPVVPNR S" gene complement(7423..8718) /locus_tag="DP116_22015" CDS complement(7423..8718) /locus_tag="DP116_22015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317598.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="colanic acid biosynthesis glycosyltransferase WcaI" /protein_id="PRJNA477356:DP116_22015" /translation="MRILIYSYNYHPEPIGIAPLMTELAEGLVKQGHQVRVITGMPNY PQREIYQEYRGKWYMTEQKNGVTIQRSYLRIKSKPNLIDRLLLELSFVFTSLPQALNG WQADVILLTVPPLLVSLPATLLGWLHKCPVVLNVQDILPEAAVRIGLIKNQWMIRALQ AIEQFAYRSAHTISVIADGFVENLVDKGVPAQKIVCIPNWVNVNFIRPLPKKSNSFRI AHQLQDKFVVLYSGNIALTQGLETVIETASSLRDIPQIVFVIAGEEKALKRLEAYCQK CGADNVLLVPLMPREKLPEMLAAADVGLVVQKRNVISFNMPSKIPLLLASGRPIVASV PVSGTAARAVRNSGGGIVVAPESPQALAEGIVDLYNNPRKVAKLGYFGRLFAVEHYSF EQALAEYEALFSEVIATKSQSPSLGMLPKLTSSESMIDI" gene complement(8889..9803) /locus_tag="DP116_22020" CDS complement(8889..9803) /locus_tag="DP116_22020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194652.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polyprenyltransferase" /protein_id="PRJNA477356:DP116_22020" /translation="MNAATLNSSRLWAYLQLLRPANIVTAWADILAGFAASGCIVFIS LSSSDLISLAWLLLATTGLYGGGIVFNDVFDAEIDAIERPERPIPSGRASRQGASILG GLLLSAGILAASQVSWLSATLGCGIAAAAVLYDAYSKHNPIFGPLNMGLCRGGNLMLG VSAVSPMVSNYWFLALIPIVYIAAITTLSRGEVHGGKSSTGGITLILIGMVIAALLGL SLLENYHLLAVLPFVILLAVRVLLPFIKAVSQPSPENIRTAVRAGVLSLIVLDATVAA GFASLPYGLLILCLLPISIALSQIFAVT" gene complement(9837..10745) /locus_tag="DP116_22025" CDS complement(9837..10745) /locus_tag="DP116_22025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317596.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrolase TatD" /protein_id="PRJNA477356:DP116_22025" /translation="MYIDPHIHMSSRTTDDYQKMRDSGIVAVIEPAFWFGQPRTSVGS FQDYFSSLVGWERFRAAQFGIKHYCTIGLNSKEANNEPLAEQVMELLPLYACKEGVVA IGEIGYDDMTPQEDKYFRLQLELARELDMLVMIHTPHRNKKAGTSHSMDVCIEHGLDP SKVIVDHNNEETVQEVLDRGFWAAFTIYPNTKMGNARMVEVVRQYGCNRIIVDSSADW GVSDPLAVPKTAQLMRERGIPEAHIRAVCYENALTAYSQSGQMHESDWLNPSPIDQRQ LFSGNSVLRGQKPVVESPTREYALIE" gene complement(10810..11703) /locus_tag="DP116_22030" CDS complement(10810..11703) /locus_tag="DP116_22030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317595.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22030" /translation="MVCASYLEKTTISANDLLHGWLKQRLNSQALTWLEQKIEQIKTA VNTLVFFSAFSAVPRYTGKNDLQLTKVELEAACYARTGWFPSHWSVDQAARILLVLSL PQDNEQKYLQTLERVFTAADVGELVALYQTLPLLPYPEKFRARAAEGVRSNMTAVFNA VALRNSYPAEYLSDLAWNQMVLKALFVGSPLYLIQGLEERANPELAQMLVDYAHERWA AKRTVSPELWRLVGRFADNAMLADLERAIQDPDPVQQAAAAIATSSCPLPQAQQLLAR YPNLQAQIQAGHLTWSRLTIS" gene complement(11824..>12257) /locus_tag="DP116_22035" CDS complement(11824..>12257) /locus_tag="DP116_22035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408013.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="PEP-CTERM sorting domain-containing beta-propeller repeat protein" /protein_id="PRJNA477356:DP116_22035" /translation="IGIEVDDQENVYVADSINSRVQVFDKNGKFLTSFGEPALDASGN PVPPPGLTDGPFGNPLDLRPGRFNWTGGTSLKDGKLYVGDFFQGRVQVLNVVKDGATK VPEPASLLGLGLLGIGATATTLRKKQQKATLNLEEPEKVAV" BASE COUNT 3364 a 2678 c 2785 g 3430 t ORIGIN 1 ggatatcgct aatgttgttg gtgtctccgg ctaccaggtt gctggcactt gactcaaacg 61 ccacaaagcg cccatcagcg gaaatggcgg gggtatacga gaagccattc ccctgggtgc 121 catcgtcggc gacagaaact cgggtggtgg tgcccgttag gaggtcgcga acgaagatat 181 cgctagtgtt gttggtgtct ccggctacca agttgctggc agttgagtca aacgccacaa 241 agcgcccatc agcggaaatg gcggggttag aggagttgtt attcccctgg gtgccatcgt 301 cggcgacaga aactcgggtg gtggtgcctg ttaggaggtc gcgaacgaag atatcgctac 361 tgttgttggt gtctccggct accaggttat tggcagcaga gtaaaacgcc acaaagcgcc 421 catcagcgga aatggtgggc ttacgggagt agttattccc ctgggtgcca tcgtcggcga 481 cggagaccag ggtgagagac gagatttcgg ttagatttgt acccatgata aatatttatt 541 ctcacggtat atacataagt atacggtaaa ggtcacgtaa gaaaaagttt attctgcgtt 601 tggtggaaat tggtgtatgg actaagaaac gacaaaacga ccgatttgtt tatacctatt 661 aattaaagag tgagatgtct gagaattaat tgacaaacat ctctggctgc gattgattca 721 gactcaagta agtgagtttg gcgagtcata aataatggtc cattggcagg agaagcagtg 781 atacaaccgt gagaaccacg cactaaggta gcatctaagg gaataacatc cattaagtag 841 cgaaaaccaa gctgtttttt cagcagtttg aaaccaattt tgaactgggg aaatttgata 901 tcgggatcaa cgaataattc cacaggatca taaccaggct tgcgatgaat atcaactgtt 961 ctggcaaaat caggggcacg attgtcatca agccagtagt agtatgtaaa ccaagcatct 1021 gagtttgcga tcgccactaa ttctcctgac cttgggtgat tcagaccgta agctggcttt 1081 tcgtcttcgc ctaaaacttg agcaacgcct tctgtgtttt ctaggagcga tcgcactttc 1141 ggaatataaa acgggtcatt gacataaaca tgagcaattt gatggtcagc tacagcaaat 1201 gcttgacttg caccaggatc aagcagttct cgtcccaatt cttcgcgcac agctagcaaa 1261 ccattttccc gcagtatccg attgaggtgg acgggttggg atacgggtgt aataccatat 1321 tctgacaaaa caataacttg agcgccgcga ttttcgtagt attgaatcaa atcgccacaa 1381 acagcatcaa tttcttgtaa atccttggtg actaagttat cttcaggacc gaacttttgc 1441 aagcaataat ctaaatgagg tagataaatc agtgtcagcg tcgggttaca gcgttcttcc 1501 acccacttag ctgaagaagc gatccactgg gtagaagcaa tcgatgtgtt gggaccccag 1561 aaattgaaca gaggaaactg accgagttca gcttggaggg gcgatgctcc agaggagccg 1621 ccttgcggcg aacgccattc ttgtggttgt gtataaatat cgggtaattt tctaccatca 1681 gacggataca ttggtcgcgg ggtaactgca tagtccgccg acgagtacat gttgtaccac 1741 cagaacaaat tggcgcaggt aaaggagggg tctagcgatc gcgccatatc ccacacttta 1801 ggagcctgta caagtttatt agactgtcgc caaaatttta cttcacactc gtcacgaaaa 1861 taccacccat tcgcaacaat gccatgttca ttcggcattt tgcctgtcag ataggttgct 1921 tgaactgtgc aggtgacagc aggtaatacc gatgcaattg gaacgacttt cccagaagcc 1981 gcccaacgtg acagcgacgg tgtatgttcc cctaacaggc tgggtgacaa tcctacgacg 2041 ttgagaacaa ctgttttatt catagtcgca aattccaggg agtcggcacg atttcatgtc 2101 agaatgatga tttcaatctt agttctctaa ccaaaactca gattcgattc acgaaaatct 2161 tttttgaaga ctgtttaagg tatactttat tttccttttt aagacagatt taattgaaag 2221 attcgatttc attgcgtgcg cctaacttaa agattagaat gcttgagttt tgagaaaact 2281 cttgttcaag acattgagag taaacatggg tataatatat ctgttttcta tacgagcaga 2341 atagctttta cctggtgtag gagacttgtg aacttcaaac ataatagctt atcttatatc 2401 ttgtacctat ccgtagaaac ctcggtttag tgaaaagcag cgcaataagt tatattactt 2461 ttattcaact tgattgctac attgaaaact ttgagtgttg tactggttca tgaaatgaca 2521 tcaatttttt ttggtgcaag atgtcagtta tttgatttca tcaagtgttt ttgtgttgtg 2581 gcttccaact caaacactta ctagatactg attttgaatg agaagcggta attcaataga 2641 aacattaatt gaatgtttcg ctttcatatc cttttcataa catttttgat aacacccact 2701 catactcccg ttggattgat gctagtaaat ctagtttcat ctcagtcggc agcacttccc 2761 aagtataagt ttcaatttcc aggtggctac aggcatgatt gatttgtagc aagtctaaca 2821 cagtagaaat gtcgtcttga gtagactgga acagctggta atcgtggata aaaatgggga 2881 catggaagtg aatccgccat tcacgatctg tggtataatc caaatctggt aaagcttttt 2941 ccaaatctgt ataatgacgt aatcgtcctg tactttcgcg tgctatgact tggtgtaagt 3001 aggtagactc ggcaaaagga agtaaccgct ttttcactag agagcgctgt tctctatcgt 3061 tgggtatgtt cacttgcaac gcggcactga tttgaatctt accaacttga atacctgctg 3121 cttgaaattg cttgaaaacc gaaactgggt tttcatactc tacggcaaag tggcaagtat 3181 cataacaaac acgcacgtgt tctagtatga gagcctcagc tgcttctaaa gaaatcccta 3241 aatgctttgc taaatattct cccccaatcg gcaacaagtg cctctggaaa taatcaacaa 3301 cttcggcagc attttctatt aatccatctg gttctggttc caaatctatg tgcaggagtt 3361 ttcctgtttc ggttcggatg cgtgccattt ctgccacgac ctgtgctaag ttgaggctgg 3421 cgctgcttgt taaggatgcc aggaaaagtt gatttccctt gaaccaaggc ttatatgaca 3481 acggtagtgt agaaatgctg ccttccatat cggctggcaa gagttgtgcc aggatgttta 3541 ccagtcgtag ggtatactcc aatcgctcct gcttagacca gtctggtgca taaacttggt 3601 ctttgacaac ctgccaatga aatccaccga agggaaaacc atttaaggta aatacgtata 3661 agtccagttc tgtcaaccat gactgaaact cagctaaggc gtttccttgc agcagttctt 3721 tggttgcgac atccgccaaa cgcaatccaa taccataggg tttttctggg gctagctgtg 3781 ctttgagtgc aggaatgtat tgctttaagt tggcaaagac tttgctccat tcttcaccag 3841 gatgaatatt cgtgcagtag gttaggtgaa agttgtgatt tgtttctatt ttcatctgta 3901 tttttaatcg cgcctgttat ccttgactca acttgttatg aaagcctgct tagaggtttg 3961 gaaccgcaga tggacacaga tagacgaggc agtgcgttgg ggagccactg ccgtgcgcgg 4021 gttccccgcg ttgaggcatg tggcgtgagc cagtacttga tgagtgtttc cctcacttgg 4081 tatctggtga gaccagcgcg aatgacggct ctccctccgt aggcgactgg cgaacccgaa 4141 gggcgtccgg cgtgccgctc ttgttgcacc tgccgtgcag atagtttatc tgtgtcattt 4201 gtgtccatct gtggtaaaat tcaaaaaatt tcaacagtct cacatcacac attcccaacc 4261 tgaggcaggt aaggaagaat tgtaaaactc gcccaacaac caagttgctt gccgaaatag 4321 agatatatct acttcatgaa cctctatacc tttaccaata ccttgcaata gcgtgagggt 4381 taattcaccg cccaggtgtt cgcgaaactc actcagcccc cggaacaagc aacggggatg 4441 ttccagttct gttatttttt cagacagttc tggtacgtac aatagaaagt ttaatgacaa 4501 caaagtattt aatatccgct gccactctaa ccgagacagc agcccgatta ggtaggaata 4561 ggtactgtct aaggcaatgc cgatcgccac tgcttcccca tgacgtaatc tatggtttgt 4621 cagctgctca agtttatgtg ctgcccagtg accaaaatct aatgggcgag atgaacccac 4681 ttcaaaggga tcaccattgt tagcaatatg atccatgtgc aattgagcac aacggtagac 4741 gacttgttgc atggagttca agtctcgaca gatgagtgca tcagcgtggg tactgatgta 4801 ctcaaagaaa tcaggatcct tgatgagcgc gacttttact gcttctgcga ttccacaacg 4861 ccagtcacgg tcatttaggg aagttaagaa gctaaaatca tttaaaactg catagggtgg 4921 agcgaaagtg ccaagaaaat tctttttgcc aaaggcgtta attccgttct tgactcctac 4981 agcagaatca ttttgtgcta acactgttgt aggaatacga atcagacgaa ttcctcggtg 5041 tgctgttgct gctgcatatc caactaagtc taatacagca ccaccaccaa ttcctaatat 5101 ataagagtgg cggcaaaccc ccacttcatc tatgatttgg tgaatttttg agagttcgtt 5161 tggatcgttt ttggctgctt ctcctccttt gatgatgatt ggctcggcag caagtttgag 5221 tgtgtctgga tggcgatcgc aataagcaac gatttgcttg agtagcgtcc gtcgagactg 5281 caacaatcct gaatctacaa ttgctaagac tttttttggt ccaacttccc catcgccagc 5341 aatcacttgt gccagcaggg aatttttcag atcaaacaca ccattagtaa aatgcacccc 5401 gtagtggaaa gtcactgaga aactttgccg gatgggttgc atattgagtg ttgccttttg 5461 cctgatgtct attaccatat ttttgtggat atttctataa gcgcggcacg atttagagct 5521 tgaaggaaca cctcgcttga gtgtgaaccg tcaattttga ctttgttgtt gataacgtaa 5581 aaaggcacgc tagtgatgcc attcaatcga gcaaatgttg actcggctag aaccgcatcg 5641 gcaagagcgt taccgttcaa ttgcagtcgt aatacagtag aattcatgcc atattttgtg 5701 ccgatagcaa caataacgtc aaggtcaccg aggttcaaac cctcttcaaa gtaagccttg 5761 taaactgctt gcacaacatc gtttttgatg tcatctggtg ctaacgcaat caatcgatgg 5821 gaaagtatac tattgacagc cagatgaatc ttgttaaaat ctaatttgac tcctgctgca 5881 tgaccaatat ttcgcacgtg gtcaaatagc ttctctattg cttgcggttc tatgcctttt 5941 ctctgttgca tgtagctacg aaattcgcag ccagcagcag gaatggaatt gtctagaaga 6001 aatgggtgcc accgaatttt cactgtttgt tgatattgtg ccaaagcatc aaaaagatgc 6061 ttatgaccga ttcggcacca cgggcatacg gtatcgtgat agatgtcgat cagcatggtt 6121 ttcgtgagtc aggttgggag aagtaatgct ttcctcagca aaaaagtttt gagcagtttc 6181 aaattcttgg atgattgcac ctcggtcttt ctgtgcgact aaccttgcca agcggttgta 6241 agtatcagct aacttagcaa tggcatgaca tctatcatgc gtagcaagca taatgtccac 6301 acataagtgt gggctttgag taaacaaacg cttgatgatg tctatttctt gacggtagct 6361 tggacttgac atacacaaac tgcgctctat gtcaattttt tcttgtgcta agaaaacacc 6421 aacgctaaat ctggagaagt gccttgttgc ctgaataatc accatcatgt agtcgtgttc 6481 ttcaggtgtt gagactatca cttctccacc ctgacttttg ataaagtcta ataaccactc 6541 aaatgaatcg tcgttccgac ctggacaggc tactaccttc tgtccagcga aggatttgac 6601 atttggacca aacatgggat gcaatcccat aaccggacca cgatggtgtt caagcatcgc 6661 tgacatcggc tcagttttga tacttgtgat gtcacaaagt gctgtggttg gtgcaagata 6721 ttgagccgca cgctttataa catctgctgt ccactgaatt ggaacagaga ctatcacaag 6781 ttcagcacaa ctgaggaggt tgtctgcata ttcccagtct tcattttcga gaattcttac 6841 attgtgacct gctgctgtca gttgctgagt aaagaatcta cccattcttc cagcaccacc 6901 tatgatggtg attctccggg gtgttatctg tgtagacgaa gatgaatcaa ctgtggaaag 6961 ggcagcataa caactgttaa ctaaatttac ccgaactgat tcaggaacac cagcttgagc 7021 aagtaaagag ccgagagttg ctagttgttc ttctatagct gaaattttgg atgctgttaa 7081 aagtgtatgg cgatcgctcg tgtggatctg gtgttctacg cgttcagcgt agctctgccg 7141 taggcaatcg ccatcgcccc caagggcaag gcttgctgtg tgagtatcgc gctgcggatc 7201 acacaggctt tgatccgttt gcttgagcaa ttcttgagtc ataataggtt atttacggcg 7261 aatgagcaga tgcaaacaac acaaagacgc tctgaggcga agttgtagag acgttgcatg 7321 caacgtctct acgaaatttc tataatttaa aaacacctcg gaattaatga catgaaccat 7381 tagtgactaa tgattaatga ccagtgacta atgactaatg acttaaatat caatcatcga 7441 ttcgctagag gtcaattttg gtaacatacc taaagaaggt gattgggatt ttgtagcaat 7501 aacttcacta aataaagctt cgtactcagc taaggcttgc tcaaacgaat aatgttctac 7561 agcaaacagc cttccaaagt aacctagctt tgctactttt cttggattgt tatacaaatc 7621 cacgattcct tctgctaacg cttgcggtga ttctggtgca acaactatgc cgccgccact 7681 gttacgcact gcccgtgctg ctgtaccaga gacgggaact gaggcgacaa tgggacgacc 7741 actagccaac aacagtggga ttttggaagg catattgaag gatataacat tgcgcttttg 7801 tactaccaaa cccacatctg ccgctgccag catttcgggt agtttttctc gcggcattag 7861 aggtacaagc aacacgttgt cagccccaca tttttgacaa tatgcctcca atcttttcaa 7921 tgctttttct tctccagcaa tgacaaagac gatttgtggt atatctcgca gactagatgc 7981 agtttcgatg actgtttcta atccttgtgt caaggcaatg ttgccggagt aaagtacaac 8041 aaatttatct tggagttgat gggcgatgcg aaatgagttg ctcttttttg gtaaggggcg 8101 gataaaattc acatttaccc aattgggaat gcaaacaatt ttttgagctg ggactccctt 8161 gtcaactaaa ttttcaacaa acccatcggc aatcacgcta atagtgtgtg cgctacggta 8221 tgcaaattgc tctatggctt gcaaagcacg aatcatccat tgatttttga tgagtccaat 8281 gcgcacagcg gcttctggta atatatcttg tacatttagt acgactggac acttatgtaa 8341 ccaaccaagt agcgttgctg gtaaggatac caatagtggt ggtactgtca ggagaataac 8401 atccgcttgc caaccgttca gagcttgtgg caaacttgta aacacaaaac tcaactccaa 8461 gagcagtcta tctatcagat ttggtttaga ttttatccgt aaataactac gctggatggt 8521 cacgccgttt ttttgttcgg tcatgtacca cttgccccga tactcttggt atatctcccg 8581 ttggggataa ttaggcattc ctgtaatcac tcgcacttgg tgaccttgct ttaccagtcc 8641 ttccgccagt tcagtcatta aaggggcaat tccgattggc tctggatgat agttgtaaga 8701 ataaataaga atccgcatta tccgtttctt tgttaggtaa ggtcaaatcg acagtgtgag 8761 ttttgttgtt ttcttacaag atgcacggtg agtcgcggtc tataggctaa gctgagtagt 8821 ataaatccag catattccct gatgtttact cctgatatta gctctaagat agaattttaa 8881 cgtgtgcctc aagtaacagc aaatatttgt gacaaagcta ttgaaatagg caatagacag 8941 aggatgagta atccgtatgg caaactggca aaaccagcag caactgtggc gtctaaaaca 9001 atgagtgaca agactcctgc acgcacggct gtgcgaatat tttctggaga tggttgactc 9061 actgctttga taaaaggtag taagacccgc acagccagca atataacgaa tggcaagact 9121 gcaagcaggt gataattctc taacaacgag agtcctaata aagcggcaat aaccatgcca 9181 ataagtataa gggtgatacc accagtacta cttttacctc catgaacttc acctcgacta 9241 agagttgtga tggcagcaat gtaaacaatg ggaatcagcg ctaaaaacca atagtttgat 9301 accattggtg aaactgcgct cacaccaagc atgaggttgc caccgcgaca aagccccatg 9361 ttgagcggac caaaaatagg gttatgcttg ctgtaagcat cgtagaggac tgcagcagct 9421 gcaatcccac aacctagagt ggcactcaac caggagactt gagaagctgc cagtatgcca 9481 gcactcaaaa gcaaacctcc cagaatagat gcaccttgac gggatgctct accgctggga 9541 ataggtcgct caggtctttc tatagcgtct atttccgcat caaaaacatc attaaaaaca 9601 ataccaccgc catataaacc tgtagttgct agaagcaacc atgcaagtga tatcaaatca 9661 gaggaagata gggaaataaa aacaatgcat cctgaagcag cgaacccagc tagaatatct 9721 gcccacgcgg taacgatgtt agcaggtcgc agtaactgga gatatgccca aaggcgagaa 9781 gaatttaagg ttgcagcgtt catagcaagc aggtggagga ggaaaagatt gactttttac 9841 tctatcaatg catattcacg agttggagac tcaacgacgg gtttttgtcc ccgaagtacc 9901 gagttaccac taaatagctg tcgctggtcg atgggcgatg ggttgagcca gtctgactcg 9961 tgcatttgac cgctttggct gtaagcagta agtgcatttt cataacacac agctcgtata 10021 tgcgcttctg gaattcccct ttctcgcatg agttgagcag tcttaggaac cgcgagtgga 10081 tcgctcacac cccaatcagc actgctgtct acgatgatgc gattgcatcc atactggcgt 10141 acaacctcaa ccatgcgggc attccccatt ttggtattgg ggtaaattgt aaaagctgcc 10201 caaaaacccc gatccaaaac ttcctgcaca gtctcttcgt tgttgtgatc tacaatcact 10261 tttgatggat ctaatccatg ctcgatacag acatccatac tgtgactagt gcctgctttt 10321 ttattgcggt gaggtgtgtg aatcatcacc agcatatcca gttctcttgc gagttctagc 10381 tgcaagcgga agtatttgtc ttcttgcgga gtcatgtcat cgtaaccaat ttcaccaatg 10441 gcaacaactc cctctttgca agcgtatagg ggcaaaagtt ccataacttg ctctgccaat 10501 ggttcattat tagcttcttt ggagttcaaa ccaattgtac agtagtgttt tataccaaat 10561 tgagcagcac ggaatcgctc ccagcccact aagctgctaa agtaatcttg gaatgagcca 10621 acgctcgtac ggggttgtcc aaaccagaac gcaggttcaa tcacagcgac aatgccagag 10681 tcccgcatct tttggtaatc gtcggttgtg cgggaactca tgtgaatgtg aggatctata 10741 tacatcatgg cagcggtgtt gattgttaca aaattagtca taggcttgta gtcagcgctt 10801 gagcgctgac tacgaaatag ttaagcgact ccaagtcaaa tgccctgctt gaatttgggc 10861 ttgtaggttt ggataacggg caaggagttg ttgagcttgt ggtaaaggac acgaggaagt 10921 ggcgatcgcc gctgctgctt gttgcactgg gtcggggtct tgtattgctc gctctaaatc 10981 cgctaacatc gcattgtcgg caaatcgccc caccagtcgc caaagttcgg gagagactgt 11041 gcgctttgct gcccaacgtt catgggcgta gtcaactagc atctgagcta gttctggatt 11101 tgcacgttcc tcaagtcctt gaataagata caaagggctg cctacaaata aggctttcaa 11161 taccatctga ttccaagcaa ggtcacttag atactctgct ggataggaat ttcgcaaagc 11221 gacagcgttg aaaactgctg tcatattact gcgaactccc tcagccgcac gagcgcgaaa 11281 cttttctggg taaggtaaaa gtggtaatgt ttggtaaagt gccactaact ctcccacatc 11341 agcggctgta aaaactcgtt ccaacgtttg taagtacttc tgctcattat cctgtggtaa 11401 actcagtacc agtaagatac gagcagcttg atctacactc caatgactgg gaaaccaacc 11461 tgtccgggca taacaggctg cttctagttc cacttttgtc agttgtaaat catttttgcc 11521 tgtataacga ggcacagcgc taaaagcact gaaaaatact aacgtattga cagcggtttt 11581 gatttgctct atcttctgtt ctagccaggt aagagcttga ctattaaggc gttgcttcag 11641 ccaaccatgc agtaagtcat tagcactaat tgttgttttt tccaaataac ttgcgcaaac 11701 cattacttta taagtattta gttatgtaat agagctaaca cttctagaaa tacgttccca 11761 agctacagcc tgggaacgag gctaaacaaa attaaggttg aggaagtaca aaactacgag 11821 tgtctaaaca gcaacttttt ctggttcctc taaattgagg gttgcctttt gctgcttttt 11881 ccgcaatgtg gtagcagtag cgccaattcc caacaagcct aaacctaata gtgaagcagg 11941 ttcaggtact ttggttgcgc catccttaac tacgttcaat acttgaacgc gaccttggaa 12001 gaaatcgcca acataaagct tgccatcttt gagggatgtg ccacctgtcc agttaaatct 12061 gcctggtctg aggtcgagag ggttgccaaa gggaccgtca gttaatcctg gaggcgggac 12121 tgggttgcct gatgcatcca gggctggttc accgaaggaa gtcaagaact taccgttttt 12181 atcgaatacc tgaacgcggc tgttgataga atcagctaca tagacattct cttggtcgtc 12241 cacttcgatg ccaattg // LOCUS NODE_2767_length_12257_cov_4.85510612257 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12257) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12257) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12257 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 92..322 /locus_tag="DP116_22040" /pseudo CDS 92..322 /locus_tag="DP116_22040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182965.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 471..893 /locus_tag="DP116_22045" CDS 471..893 /locus_tag="DP116_22045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome B6" /protein_id="PRJNA477356:DP116_22045" /translation="MKRRHFFTWIGIGSLISSLFVKVRTSEPNTTAISSASGDWQRVG TVAELDKTGQLLNEKSPVGSVLVVGTSKSKNLIAVNPTCTHMGCTVEWLNKERMFLCP CHASEFKLDGKVQMGPATKPLSTYTTKIEGNFVMVKRI" gene 1028..1894 /locus_tag="DP116_22050" CDS 1028..1894 /locus_tag="DP116_22050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194538.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_22050" /translation="MQKPKSIEIPKATRYQNAAIDYYMGVTGSSYLHYGYWETLPHSG EELTLTSLRAAQEAYTAKLFSFIPKGISTVLDVGCGIGGNAKYFLERGFSVEGLAPDA LQQEKFLKNTNNQVPFYLTRFEDFQPTHSYDLILFSESSQYIAVDDLAQGAARLLSSG GYLLIADMMRSDPEYQEGIFSNCHVTSVLQEALERAGFTLIKAEDISNSVAPTIDLSL DYFRTFGLTTMKYISDVVAIAVPPLHALGRWAFKRWLDKPIVEGLAARTIFERHLCYS IQLWQLSIHKLN" gene 2222..3016 /locus_tag="DP116_22055" CDS 2222..3016 /locus_tag="DP116_22055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PhzF family phenazine biosynthesis protein" /protein_id="PRJNA477356:DP116_22055" /translation="MGQTITQVDAFSNTPFAGNPAAVCILPTPQSEDWMQKVAQEMNL SETAFLVRQEDGFHLRWFTPTTEVPLCGHATLASAHVLWSQGHLLPDEVARFHTKSGV LIAKKHGNWIELDFPVNSSEQITAPPELGEALGVPMRTVMKNSLGYLVEVESEDLVRQ IQPNFELLKALPLANVIVTSLTQEDEEYDFVSRFFAPGVGINEDPVTGAAHCCLAPFW RDRLGKDEFLAYQASSRGGVLKVRYEGGSRVYISGQAVTVLRGELI" gene complement(3094..3465) /locus_tag="DP116_22060" CDS complement(3094..3465) /locus_tag="DP116_22060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22060" /translation="MADVNGTWLGTYWQQGIPTRFEVTLIQSGNTLTGRILDDSNLGE AQLTGEVVGRRISFIKRYFTTSPDPITYIGTISENEDYMQGQWSIKLWESGPWEARRS GDALLADLQTRIEKKVSLTGA" gene complement(3530..3898) /locus_tag="DP116_22065" CDS complement(3530..3898) /locus_tag="DP116_22065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323208.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_22065" /translation="MDANELKQRYVAGERDFPAVKLVRAKLIQAMLAGVNLFAADLSG ANLAKAKLWGANLGAANLAGANLTRANLSGANLHEANLRGAKLNFAKLYGANLSGACY DDSTRFSRGFDPVSRNMRKL" gene complement(3880..4203) /locus_tag="DP116_22070" CDS complement(3880..4203) /locus_tag="DP116_22070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22070" /translation="MTSAVTFKQLIEKNPKCLESLPKALQGLWYDKKGEWDTAHEIVQ DASDADSAWVHAYLHRKEGDLINARYWYRRSGQPESKVELDQEWEQIASSLLSTVNQA WMQMN" gene complement(4405..4602) /locus_tag="DP116_22075" CDS complement(4405..4602) /locus_tag="DP116_22075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318414.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22075" /translation="MFPFPFPFWDNNCYGSEPVSRKTRLERKLKFLKTMRDDLDTRLA GLNAAISNVEQQLNQENVTQV" gene 5466..6701 /locus_tag="DP116_22080" CDS 5466..6701 /locus_tag="DP116_22080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015116325.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-dehydroquinate synthase" /protein_id="PRJNA477356:DP116_22080" /translation="MSTIQAKFSATEDAFHVEAYEKIEYSLIYVDGVFAIKNPQLAEA YKKFGRCLVVVDANVNKHYGSQIEQYFKYYDIDLTVFPITITEPNKTIESFEKIIDAL AQFKLVRKEPVLVVGGGLITDVAGFACSAYQRSSNYIRIPTTLIGLIDASVAIKVAVN HKKLKNRLGAYHASQNVFLDFSFLGTLPTAQVRNGMSELVKIAVVNNKEVFDLLDKYG EELLSTHFGNIDATPEIKDVAHRVTYESIKSMLNLEVNNLHELDLDRVIAFGHTWSPT LELAPRVPIFHGHAVNIDMAFSVTLAARRGYITTQERDRILSLMSRLGLALDHPLLDE ELAWRATQSITCTRGGLLRAATPRPIGNCFFVNDLTREELAAALSEHKHLCESYPRGG DGVELYPDAYNPELVGSEA" gene 6701..7531 /locus_tag="DP116_22085" CDS 6701..7531 /locus_tag="DP116_22085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876764.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_22085" /translation="MSQVVEKPTARPVTPLGILAKQLETILQTLHQQTSDELQANLNQ AWLLAAGLDPYLEECTTGESPALAALAQKTAHEAWNQKFHEGATVRELEQEMLSGHVE GQTLKMFIHMTKAKKVLEVGMFTGYSALAMAEALPPDGELVACEVDPYTAEFGQTAFQ QSPHGAKIRVEVGAALDTLQKLADARESFDFVFIDADKKEYVKYFQILLEKDLLVPGG FICVDNTLLQGQVYLPEENRTLNGEAIAQFNHIVAADSRVEQVLLPLRDGLTIIRRLP " gene 7896..8243 /locus_tag="DP116_22090" /pseudo CDS 7896..8243 /locus_tag="DP116_22090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197687.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="D-alanine--D-alanine ligase" gene 8396..9478 /locus_tag="DP116_22095" CDS 8396..9478 /locus_tag="DP116_22095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196876.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class I fructose-bisphosphate aldolase" /protein_id="PRJNA477356:DP116_22095" /translation="MTTTLSAPHSIESLLGKEAEDLLTYKAKVSKDLLHLPGPDFVDR VWLNSDRNPQVLRNLQTLYSTGRLGNTGYVSILPVDQGIEHSAGASFAPNPMYFDPEN IVKLAIAAGCNAVATTLGVLGSVSRKYAHKIPFIVKLNHNELLTFPNQFDQVLFASVE QAWNLGAVAVGATIYFGSEQSTRQIQEISQAFKRAHDLGMVTILWCYLRNNAFKEDKD YHLAADLTGQANHLGVTIEADIIKQKLPENNNGYGAVAKATGKSYGKTNERVYTDLTS DHPIDLTRYQVLNCYCGRAGLINSGGASGKSDFAEAVRTAVINKRAGGTGLISGRKTF QRPFEEGVKLFHAIQDVYLSEAVAIA" gene complement(9609..9827) /locus_tag="DP116_22100" CDS complement(9609..9827) /locus_tag="DP116_22100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015136695.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22100" /translation="MLQTIEGIYRNGKIELTEMPQGITESRVFVTFLETKSTTWSEMI MQHQGVAESIIFESYRDELLSPNSLSSF" gene 9827..10033 /locus_tag="DP116_22105" CDS 9827..10033 /locus_tag="DP116_22105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317124.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22105" /translation="MQFTFQQTQGCRGKTSQKYQVICRSGEEVGNDQSQRAFGARYYD RNPHERSPIEQEFARNKAQRGDDD" gene 10997..12163 /locus_tag="DP116_22110" CDS 10997..12163 /locus_tag="DP116_22110" /EC_number="2.6.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952110.1" /note="catalyzes the formation of oxalozcetate and L-glutamate from L-aspartate and 2-oxoglutarate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate transaminase" /protein_id="PRJNA477356:DP116_22110" /translation="MKLAQRVSQVAPSITLAIAAKAKAMKAEGIDVCSFSAGEPDFDS PAHVKAAAQKALDEGKTKYGPASGEPKLREAIARKLKTDNGLDYKAENVIVTNGGKHS LFNLILALIEPGDEVIIPAPYWLSYPEMVTLAGGVSVIVPTDVSTGYKITPEQLRKSI TPKTKLFILNSPSNPTGMVYTPEEIEAIAQIIVETDILVVSDEIYEKILYDGAKHVSI GSLGPEIFERTIISNGFAKAYSMTGWRIGYLAGPIDLIKATITIQSHSVSNVCTFAQY GAIAALEDSQDCVEEMRQAFAKRREVMYERLNAIPGLTCPKPDGAFYLFPDIRKTGLK SLDFSNRLLEKEQVAVIPGIAFAGDDNIRLSYATDMATIEKGMDRFEKFVKSLI" BASE COUNT 3478 a 2592 c 2642 g 3545 t ORIGIN 1 agctccttct tttcgtgtcc tcaataacgt tttcagcctt aactgaaccg tattgtacta 61 taaggtatac aggctatcaa cataagaagg tcaaaaagct tgtgtcaaac atccaaaact 121 cggtaaatat ttttgcgcta atgctaaaca attgtctcat ttgaagaaag gcgcaccaga 181 tattgacgct accgacgcga ataacgaaaa ggctctgctt ggggtgacag atgcagaaag 241 cgatgctgct gtatctatgt ttggctgtga ttgcgttgca tcgataaatt gcttgcgtcg 301 cctgcgcaat agccttcctt agaattgatt ttccttgatt tttaagatta gaaaaagcaa 361 gggagcaagg gaagagattt tttcttttcc cttactcttt gacagcttgt tatcagacat 421 tgtattgttc gagagattgt ctaaatattg ttctacagaa tttcaataaa atgaaacgtc 481 gtcatttttt tacttggatt ggtataggtt ctttgataag tagcctattt gtgaaagtta 541 gaacttctga gccaaacact acagcgatat catcagcatc tggagattgg caaagggtgg 601 ggacggtagc tgagttggat aaaactggtc agttattaaa tgaaaaatca cctgttggtt 661 cagttttagt agttggtact tctaaaagta aaaatttgat agctgttaat cccacctgca 721 ctcatatggg ttgtacagta gaatggctga ataaggaaag aatgttttta tgtccttgtc 781 acgcttctga atttaaactt gatggcaagg tgcaaatggg tccagccaca aaaccactct 841 caacttacac gacaaagata gagggtaatt tcgtgatggt gaaacgcatt tagtttggga 901 tttgtcaagt gtaaaagagc acttaacgag aaagtaaagt ttggggaact gacacggaac 961 atgcttgcgc gatcactcgt attattaaaa agaaatgtgc agcagtatcc accaacttta 1021 accctaaatg cagaaaccaa aatcaattga aattccaaaa gcaactcgct accaaaatgc 1081 agcgatagat tactatatgg gagtcacagg ttcctcctat ctccattatg ggtattggga 1141 aacattacct cacagtggtg aggaattgac tctaacttct ctgcgtgcag ctcaagaagc 1201 ttatacagcc aaactgttca gttttatccc aaagggaata agcaccgtgc tggacgtcgg 1261 ctgtggtatt ggtggtaatg cgaaatactt tttggaacgc ggtttcagtg ttgagggatt 1321 agcacccgac gcactccaac aagaaaagtt tcttaagaat accaacaatc aagtaccttt 1381 ctacttaacg agatttgaag attttcaacc aacccactcc tacgatctca tcctgttcag 1441 cgaaagcagc caatatattg ctgttgacga tttggctcag ggtgcggctc gtttactgag 1501 tagtggcggc tacttgctga ttgccgatat gatgcgctct gatcccgaat accaggaggg 1561 tatcttttcc aattgtcatg tcacaagtgt tcttcaggaa gccttggaac gggctgggtt 1621 cactttaatc aaagcagagg acatctcaaa ctcggttgca ccaacgattg acttgtcttt 1681 ggattatttc cgtacttttg ggctgactac gatgaaatat attagtgatg ttgtggcgat 1741 cgctgtccca ccattacatg cattgggacg ttgggcattt aagcgctggc tggacaaacc 1801 aattgtcgag gggttagcag cacgtacaat ttttgaacgc catctgtgtt attccatcca 1861 actctggcag ttatcaattc ataaattgaa ttaaaatgac tgcagaaaca aacacccaag 1921 attggcaaca aactttcatt agcaattggt tacaatgctt gggttggtta tttagcagca 1981 tcccactggc ggcgcgccac ccgctatgaa tgaaaaaaac actctttccc tgttccctgt 2041 tccctgttcc ctgttcccta ttaagcgtgt ttcttcagag gagaacgtgg tgttgtcatt 2101 tgcagcacca attaagcgca tttaaatatt gctaacgtat caccgctatc gttactttgc 2161 ccgaaaatgt ctgggcaaat gtcgcgtcaa tgtatcagta tatttaaaac aaattcgaga 2221 tatgggacaa actattactc aagtagacgc atttagcaac acaccttttg caggtaaccc 2281 cgctgcagtc tgcattttac ctactcccca gtctgaagac tggatgcaaa aggtggcgca 2341 agaaatgaat ttatctgaga ctgcgttttt ggtgagacag gaagatggtt ttcatctgcg 2401 ctggttcacc cctacaacag aagtgccact gtgtggtcat gctaccctag ctagcgctca 2461 tgttctttgg tcacaggggc atcttttgcc agatgaagtc gctcgtttcc acaccaaaag 2521 cggagtgttg attgctaaaa agcacggaaa ttggattgaa ctcgattttc ctgtaaattc 2581 ttcagaacaa ataaccgctc ctcctgaact cggtgaagct ttgggtgttc cgatgagaac 2641 ggtgatgaaa aattccttgg gttatctggt agaggtggag tctgaagatt tagtgcgaca 2701 aatacagccg aacttcgagc tactgaaagc gttacccctt gccaatgtta ttgttactag 2761 cctaacacag gaagacgaag aatacgattt tgtctctcgc ttttttgcac caggagtggg 2821 aattaatgaa gatcctgtga ctggggctgc tcattgttgt ctggctccgt tttggcgcga 2881 tcgccttggt aaagatgagt tcttagcata tcaagcatct agtcggggcg gtgtattgaa 2941 ggtgcgttat gagggtggtt ctcgtgtcta tataagtgga caagcagtta cggttctgcg 3001 tggagaatta atttgaaaat gctattcgcc actccctcac tccctcattc cctgtctcaa 3061 atttagttta gataccggaa aattcctgtc aggttatgca cccgttaagg aaaccttttt 3121 ttctatacga gtttgtaaat ctgccaaaag cgcatcccca ctgcgtcggg cttcccaagg 3181 accagactcc caaagtttta tactccactg tccttgcata taatcttcat tttctgagat 3241 tgtgccaatg taggtaattg ggtcgggtga agttgtgaag tagcgcttga taaagctgat 3301 acgacgccca actacctcac cggtaagctg ggcttctccc aagttgctgt catccaaaat 3361 cctaccagtc aatgtgttac cactttgaat caatgtgact tcaaaacggg taggaattcc 3421 ttgttgccag taagttccaa gccaagtgcc atttacatca gccatagcca gtttttcgca 3481 acttatctgt gttatcttta aactataaaa tcggattgct atatcttggt caaagttttc 3541 gcatatttct actgacgggg tcaaaacctc gggaaaagcg cgtactatcg tcgtagcaag 3601 ctccactcaa atttgctcca tagagcttgg caaagtttag cttcgctccc ctaagattag 3661 cttcatgcaa gttggcaccg cttaggttag ctcgcgtcaa gttggcacca gctaagttag 3721 cagctccaag gtttgctccc cacagtttag ctttagctag gtttgctcca cttaaatccg 3781 ctgcaaataa attgactcca gcgagcattg cttgaattaa cttggctcta acaagcttca 3841 ctgctggaaa atctctctct cctgctacat aacgctgctt cagttcattt gcatccatgc 3901 ttggttcacc gtgctcagca aactggaagc tatctgctcc cattcctggt ctaattctac 3961 ttttgactct ggctgaccgc tacgtctata ccaataacga gcattgatca aatcaccctc 4021 tttgcggtgc aggtaggcgt gaacccaagc actatcagca tcgctagcat cttgtactat 4081 ttcgtgtgca gtatcccact cacctttttt atcgtaccaa aggccctgca acgcttttgg 4141 cagtgactca agacattttg gatttttttc tattaattgc ttgaatgtta ccgcactcgt 4201 cataaatcca cctagaaaat atcaagcctc tatttctagg atacaaattc taggctacaa 4261 ataagttctg actgacgaaa gaattttcat acataccaca gtagctatct acccttttaa 4321 gtagatgttc cctagcaaag ctagggaacg agacaactag cttatgtgtg agcaacgata 4381 tctcgcttga gcaccacaac tctattagac ttgagtgaca ttttcctgat ttaattgctg 4441 ctcgacatta ctgatagcag cgtttagacc agccagtcta gtgtctaaat catctcgcat 4501 agttttgagg aatttcagct tacgctctaa gcgagttttg cgagacacgg gttcgctacc 4561 gtagcagttg ttgtcccaaa aaggaaaagg aaacggaaac atagtttgct tttatagaaa 4621 cgaccttatt taaattgtgg ctaaatccta ggcgagataa cggataagag agtgggggat 4681 aaccgactat cgtgagtggg ggttgagctt ccggtagagt tgggttactc atatagatgc 4741 atattattaa ctctttgtct actgttaata agacgcgata tttgttcata aaagtaccaa 4801 ataaaggtct catagcactt atttctctag cttgactaag gtgagctatt ttatttcatt 4861 ctttagatag aagctttctt aaaccgaggc attataaaaa gttattgtac tttgtccagt 4921 gaagagttta ttgctttgta ctgccaagga aaaactacca aacttaaaag tgtaaatcag 4981 ttgctagtaa taacgggttg tattttttta tcaagacttc gatttgattt aggcaatgtg 5041 ttgtcggtat tccagactca aattctcaga ggaagaataa aatctaataa aattataaaa 5101 cttatttttt ttttataaaa tgtgataatt tgagaatata agtcaaaata cggtgatcat 5161 tcgatagcta tcaaccctgg gagacattag gatttttctg caaaaagtgg aaaattaaca 5221 tatgtcagct aattttgctg aaaagtgagc taattcaatt aaaattcctt actaaagtat 5281 gctcaaaacc ttttaagacg ggggttcttt atttgtttta gttttttgac ctccagtctt 5341 cttgtccatt gggcactttg cttatggata gtggtgtagt tattgggtgt actgaggtaa 5401 atactgtctt gcgaagagtc tctcattttt ttgagggata ttgtcaacat tgaaggtcaa 5461 aaataatgag taccatccaa gcaaagtttt cagcgactga agatgctttt cacgtagaag 5521 cttacgaaaa aattgagtac agcctcattt atgttgatgg ggtttttgca atcaaaaatc 5581 cacaactggc agaagcctat aaaaaatttg gacgttgctt ggttgttgta gatgctaatg 5641 tgaataagca ttacggcagt caaatcgagc agtatttcaa atattacgat attgacctga 5701 cggtttttcc gatcacgatc actgagccaa acaagactat tgagtctttc gagaagatta 5761 tcgatgcttt ggcccaattc aagctggttc gcaaagaacc agtgctagtg gttggtggtg 5821 gactgattac agatgttgca ggttttgcct gttctgctta ccagcgcagt agcaactaca 5881 ttcgtattcc tactactctg attggtttga ttgatgcgag tgttgctatc aaggtagcag 5941 ttaaccacaa gaagctgaaa aaccgccttg gtgcttatca cgcttcgcaa aacgttttcc 6001 tcgatttctc gtttttgggt actttgccaa cagcacaggt tcgcaatgga atgtcagaac 6061 ttgtgaaaat tgcagtggtt aacaacaaag aggtttttga tttgttggat aagtatggcg 6121 aagaactgct ttccacacac tttggcaaca ttgacgcaac accagaaatt aaggatgtag 6181 cgcatcgcgt cacttacgag tctattaaaa gcatgctaaa tttggaggtt aacaacctac 6241 acgagttaga cctagatagg gttattgctt ttggtcatac ttggagtccg actcttgaac 6301 ttgcgccccg cgtacctatt ttccacggtc atgcagtcaa tatagatatg gctttttctg 6361 tcacgcttgc agcacgacga ggctatatta ccacacaaga acgcgatcgc attctcagtc 6421 tcatgagccg tctcggtctt gctctcgacc atcctctctt agacgaagaa cttgcatggc 6481 gtgctaccca atccatcacc tgcacacgag gcggtctact acgagctgct acgcctagac 6541 caattggcaa ttgcttcttt gtcaatgatt taactcgtga ggaactggct gcagcattat 6601 cagaacataa gcatctgtgc gaaagctacc ctcgtggtgg cgatggcgta gaactctacc 6661 cagatgctta caacccagaa cttgtcggga gtgaagccta atgtcgcaag ttgtcgaaaa 6721 gccaaccgct agacctgtta caccattggg gattttagct aagcagctag aaacgatttt 6781 gcagacactt catcaacaga catctgatga gctacaagct aacctcaatc aagcatggct 6841 tttggcagca ggtttagacc cctatttaga agaatgtacc actggtgaat caccagccct 6901 cgcagcgcta gctcaaaaaa cggctcatga agcttggaac caaaagttcc acgagggagc 6961 gacagtacga gaattagagc aggaaatgct ttctgggcat gtggaaggac aaactctcaa 7021 gatgtttatc cacatgacaa aagctaagaa agtgctggaa gtaggtatgt tcactggcta 7081 ttctgcactc gcaatggcgg aagctttacc gcctgatgga gaactggtcg cttgtgaagt 7141 tgacccttac accgcagagt ttgggcaaac tgctttccaa caatctcccc acggtgcgaa 7201 aattcgtgtg gaagtgggtg cagcactgga cactttgcag aaactggcag atgcaaggga 7261 gtcgtttgat tttgtgttca tagatgcgga taaaaaggag tatgtcaagt acttccaaat 7321 cctgctggag aaagatttgc tcgttcctgg agggtttatc tgcgtagata acactctact 7381 tcaagggcaa gtctatcttc ctgaagaaaa tcgcactctc aacggcgagg ctattgctca 7441 gtttaaccac atcgtcgctg ccgattcccg tgtggaacaa gtattgttgc cactgcgaga 7501 tggcttaacc attatccgac gattaccgta gttgtcatac atttagcgtg cacttttatt 7561 gtgcacagag aactcagaat tcagaattat cgaaaaccgc acttatattg ttcccttttt 7621 cttacttcat agatctctga gttctgagtt cttcaacaac tctcgtccat cacctccttc 7681 tcctacagcg ctggtaggcg aaggagagaa tatgaatttc acatattatt ttttgttcgc 7741 gcagagtgcc cgtcgggcat atttttcacg ctcgcgaatg cgtgaatgtt gagaagcgaa 7801 agttatgaca ctaccagaca attccagaca gttgaattcg tagtcttgcc gtttttttca 7861 atggcgaaac tacgaatctc aaatcacgaa ttcttatggc acaatctctt tcgttatcat 7921 ctaatagagc agacaatgat ttttactgcg atttgtcacg cctttacgcc gaaggctgtc 7981 tgacagcaac ggcagatcca tcgcgctatg actttgtgat tgcatacatc acaccagatc 8041 gccagtggcg atttcctcgg tcccttagtc gagaagatat tgctgtcgcc aaaccgattc 8101 ctgtgtttga tgctatacag tttataacag cgcaaaacat tgacctgatg ttgccacaaa 8161 tgttttgtat ccctggaatg actcactacc gcgcgctatt cgacctgctt aagatccctt 8221 acgtaggcaa tactccggat gtcaatgagt gtttattaat aaatcaatca aaagtagtat 8281 ataaatttct gcttagatta ggcatataaa cttaatgctt ttataaccaa gtgaaaaata 8341 gtacgaatgc tatattaaag gcaaagctat gtgctgacgt tttgaggaaa aggcaatgac 8401 gacaacacta tctgcgcctc attctatcga gtccttgcta ggcaaagaag cagaagactt 8461 gctcacctat aaggcaaaag tttctaagga tttactacat ttgccaggac cagattttgt 8521 ggatcgagtt tggttgaaca gcgatcgcaa cccccaagtc ttgcgtaatc tccaaacact 8581 atactccaca ggacggctgg gaaatactgg gtatgtctct attctccctg tagaccaagg 8641 gattgaacac tcagccggtg cgtcatttgc gcccaatccc atgtactttg atcccgaaaa 8701 tattgttaag ctggcaatag cagcaggctg caacgctgtt gcgacgacat tgggagtttt 8761 aggttcagtt tcacgcaaat acgctcacaa aattcccttt atagtcaaac tcaaccacaa 8821 cgaactgcta actttcccca atcaatttga ccaagttttg tttgcttcag tcgaacaagc 8881 ttggaatttg ggtgcagttg ctgtgggtgc gacaatttat tttggttcag aacaatctac 8941 caggcaaatt caggaaatca gccaagcttt caaacgcgcc catgatttgg ggatggtgac 9001 gattctttgg tgctatctgc gaaacaacgc tttcaaggaa gacaaagatt atcaccttgc 9061 agctgacctg acaggacaag cgaatcatct cggtgtgaca attgaagctg acattattaa 9121 acaaaagttg cctgaaaata acaacggtta tggagctgta gcaaaggcga ctggtaagag 9181 ttacggtaaa actaatgagc gggtttacac agatttgaca agcgaccacc caatcgattt 9241 aactcgttac caggtactca attgttattg tggacgtgca ggattgatta actctggtgg 9301 cgcgtctgga aaaagtgact ttgcagaagc tgttcggact gcggtaatta acaaacgcgc 9361 tggtggaaca ggactcattt ctgggcgcaa aacattccaa cgtccgtttg aggaaggagt 9421 gaaattgttt cacgcgattc aggatgttta cttgtctgag gcggtggcga tcgcctaggg 9481 cagagctacg cttaacgcct agggcagagc tacgcttaac gcctaaacga ctctcagtcc 9541 gcctgcgatt caaatcgcag gctaatagcc aaagtcctct caagaagact aaataaatat 9601 gaagtccatt aaaatgaact taggctatta ggtgatagca gttcatctcg atacgattca 9661 aaaatgatac tttctgctac accttgatgt tgcataatca tttccgacca agtagtggat 9721 ttcgtttcta ggaaagtaac aaagacgcgg ctttcagtaa taccttgcgg catttcagtg 9781 agttcaattt tgccgtttct atatattcct tcaatagttt gtagcattgc agtttacctt 9841 tcaacaaaca caggggtgta ggggcaaaac aagtcaaaag tatcaagtca tatgtcggtc 9901 tggcgaagaa gtaggaaatg accagtctca acgagctttt ggggcaagat actatgatcg 9961 taacccacat gagcgtagcc caattgaaca agaatttgct cgtaacaaag cgcagcgtgg 10021 agatgatgac taagcaattg agaatgtgaa atgtaaaaaa cttgacaacc ataggtcata 10081 cattctatgt ttgagggagt gtgttgaaac tcatcgacac cagtcccccg atccttgaaa 10141 accgcataat tccgtttagt ggtgtcgatt gctcttgtag taagggtttc agcctacaag 10201 atgaagagtt tattgcaaat aatcttgccc ttattgataa ttaaaaaagt ggtgtcgatt 10261 tggcgtccca aatcccctcc cagtaaggct ccctaagcga actctttcca ttacctcttc 10321 ccctaacggg gatggaaact actacttgtc ttgtatacta actgttaatt agtactgtct 10381 ttccattacc tcttccccta acggggatgg aaacattgga gtgggctcgt caaactgtcc 10441 atactgtgac tctttccata acctcttccc ctaacgggga tggaaaccta atatttctat 10501 taggttaaaa agaatattac aagtttcttt ccataacctc ttcccctaac ggggatgaaa 10561 taccaaatct ttgtatcaaa ttccgaatta ttgaatgagc gagaaaattt ccgatgggaa 10621 aactaaatct gaggcttaaa caccttgcga tttttagttc ctacagttag ttgctcaaga 10681 attcccaagt catcccccac acttaataat gccgcttaag ttggtaggga tgctgggatt 10741 tttctatttg tgggggttgg tcgttgagta gaaaattacg tagagaaaaa atcacaaaaa 10801 aacacattta ctctctgtat ccaccatcca cagttcatct gagagataac aaacagcgaa 10861 gcgattgcct acggcagagc tacgcttaac gccccatcaa ttttattctt tcaacaaagg 10921 aattgttaag agacaaatat caatcttatt ttctgagata ttgtaaagta agtcagaagt 10981 ttgccaaacg agtgacatga agctggcgca acgagtaagt caagtagcac cttctataac 11041 cttagctatt gcagccaagg ctaaggctat gaaggcagaa ggaattgatg tttgtagttt 11101 tagcgctgga gaaccggact ttgatagtcc agcgcacgtc aaagctgctg cacagaaggc 11161 tttggacgaa ggtaaaacta agtatggacc tgcatctggg gaaccaaagt taagagaagc 11221 cattgcccgc aagttaaaaa ctgacaatgg tcttgattac aaagcagaaa atgttatcgt 11281 taccaacggt gggaaacatt ctctgtttaa cttaatactg gcgctgattg aaccaggaga 11341 tgaagtcatc atccccgcgc cctactggtt aagttatccg gaaatggtga cgcttgcggg 11401 tggagtctct gtgattgtcc ccacagatgt ttcgacaggt tacaaaatta caccagaaca 11461 gttgcgtaag tcgattacgc ctaagacaaa gctttttatt ctcaactcgc catctaatcc 11521 tactggtatg gtttacacac cagaggaaat tgaggcgatc gcccaaatta tcgttgagac 11581 tgacattctt gtcgtttccg acgaaattta tgagaagatt ctctacgatg gtgccaaaca 11641 cgtcagcatt ggttccctag gaccagagat ttttgagcga actattatca gcaatgggtt 11701 tgccaaagct tactccatga caggatggcg cattggatat ttggcaggac ctattgattt 11761 gattaaagca acaattacta tccaaagcca tagtgtgtcg aatgtttgta cgtttgctca 11821 atatggggcg atcgcagctt tggaagattc ccaagattgt gtggaagaaa tgcgtcaagc 11881 tttcgccaaa cgtcgagaag tcatgtatga aagactcaac gctattcctg ggctgacttg 11941 tccaaaacca gacggcgctt tctacttatt tcccgacatt cgcaaaacag gactaaaatc 12001 tctggatttt tctaacaggt tgctagaaaa agagcaagta gcagttattc ctggaatcgc 12061 ctttgctggt gatgacaata ttcgcctttc ctatgccacc gatatggcaa caattgaaaa 12121 gggaatggat agatttgaaa aatttgtcaa atcacttatt tagcgatacc gttcagttaa 12181 ggattgatcc tccctagccc tccttaaaaa ggagggaatt aagccccctt tttaaggggg 12241 tcgccgtagg cgggggg // LOCUS NODE_2773_length_12233_cov_5.46764712233 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12233) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12233) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12233 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 240..1946 /locus_tag="DP116_22115" CDS 240..1946 /locus_tag="DP116_22115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875948.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA mismatch repair endonuclease MutL" /protein_id="PRJNA477356:DP116_22115" /translation="MASTIYALPAEVVHLITAGEVIDSLASVVRELVENSLDAGATRI VVSLWPGQWRVRVADNGCGMTLDDLQKAATAHSTSKICSCADLWKITSLGFRGEALHS LTTLADLEILSRPLGGDCGWRVVYGYAGEALYVEACAIAPGTVVTVSHLFGNCSGRRE GLPTLAQQIKAVQATIHQIALCHPHITWQVWQNDREWFSLCPATSIGQLLPQILHQVK EADLQEVKLEIPNPENSEFGIQKSGLHLVIGLPDRCHRHRPDWIRVAVNGRMVKSSEL EQTILSGFYRTLPRDRYPITFLHLLISPEHINWNRNPAKTEIYLNQLSYWQEQITQAI AQALRINSTSVKDAVHTTRVGKLLKAAEEKGGYNVNRSLTCEEHKTHPEMLLATSLQL KAVAQVNNTYIVAEHPGGVWLVEQHIAHERVLYEQLCDNWKIIPIEPPVILYQLSPAQ VSQLQRIDLDIEPFGEQLWAVRTVPTLLQQRDDCADAILELSWGGDLQAAQVAVACRS AIRNGTTLSLQEMQTLLDEWQRTRNPRTCPHGRPIYLSLEESALARFFKRHWVIGKSH GI" gene 2685..3413 /locus_tag="DP116_22120" CDS 2685..3413 /locus_tag="DP116_22120" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22120" /translation="MRNKLFHRLIATSVLITVSIPNITLAGDHQPQTWIQSCDKAFIE LNNHPFDNHEQSRLAKTVTNQYENLKKELKLDATLAENAPNKYERKYLVTLLQECDQS LPSEVQNFSTNTLPQEDSIRIDYSVRIDNNLFRSFKGSIKKLISSIGVDPDTINFAHI RRYFQRTITRYQDAIKQFQDAIKQFQEVRVPIKTILANVGFVFSLVGSAFIASLVSYP KSSRENTTESESSRSIPIKKSSKN" gene complement(3764..5806) /locus_tag="DP116_22125" CDS complement(3764..5806) /locus_tag="DP116_22125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197049.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="PRJNA477356:DP116_22125" /translation="MPYTGSANFVGREYELSLLKKKLQKPGTVAISAVAGMGGVGKTE LATQYARQHEADYPGGICWLNARDSNLAAEIVQFTQLYMNLKVPQVLQGRQLSPEQQV EWCWQNWRPTEGLVLVVLDGITNLNNCQQLIPKGNRFRVLMTTRLRNLDTNIEEISLD VLSPEEALQLLTKLVGEKWEKRVQKEEETAKQLCEWLGYLPLGLELVGGYLAKKLHHW TLAKMLQRLKQQRLQDEAINRHKQQLQKTFSTAQLGVLDAFELSWLELDPTTQVVGEL LSLFAPDIFAWEWVDSATSRLNWDATEVETAVKQLYERHLIQWVEDKSGDRDDCYKIH PLIREFLKVKQAASEQADELKRAFAETMVAITQKFPEPITQQSINSVKDAIPHLAEVA KTLTDAVSDKNLILLFVSLGKFYKGQGLYASAQEWYEQCVPLIESRLGKQDPDFATSL NNLANLYSLQGRYNKAEPLYQQALALRGRVLGEEHPDFATSLNNLANLYFFQGRYEEA EALYKQALELTGRVLEGEHPDFVTSLNNLATSLNNLANLYFFQERYDKAEPLYNQALE LTGMVLGEKHPDFATSLNSLANLYFFQRRYDEAELRYNEALEIRKRVLGKEHPDFATS LSSLAKLYESQERYDEAELLYNQALEITTSVLGKEHHMTVTVDKSLESLRNKKSKS" gene complement(5855..6289) /locus_tag="DP116_22130" CDS complement(5855..6289) /locus_tag="DP116_22130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002747830.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22130" /translation="MEPLTAGAIAIGTVIATKALEKTGEKVGETLWNKTGEFLVTLKK HSPHTVVAIEKVLSQPLDYGKAVLEVESAAKANPEVNQAVQELVAQAEANPPLNLAQV LQDIAQALKSQSPQNQTWISTIEKIVNFAQRDINIQNQNISI" gene complement(6488..7816) /gene="nifE" /locus_tag="DP116_22135" CDS complement(6488..7816) /gene="nifE" /locus_tag="DP116_22135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867591.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase iron-molybdenum cofactor biosynthesis protein NifE" /protein_id="PRJNA477356:DP116_22135" /translation="MKSEDVFNKNPDYKTQNRESVLREGQEDCAFDGAMMTLVPITDA AHLIHGPSGCINNCWENSSSLSFGTMLYKARFTTDMDEKDIIFGGAKKLYNAILELQR RYKPAAVFVYSTCVSALIGDDLNGACAEAAVQTGTPIIPVDCPGFVGRKNQGIRIAGE ALLEHVVGTAEPDFTTPYDINLIGESNMADAMWNIIPLFDKLGIRVLCKITGDTRYKE VCYAHRAKLNVITSSKALFKMAKKMEERFGIPYIQEFFYGIENINQGLRNIAAKLGDS DLQKRTEKFIAFETSALEAKLAFYRTYVQGKRIVIDIQDVKSWSIICAAQKLGIEVIP ISTVKSSQEDRAKIQKLLGKDSIIIQQNSPEEILQIIHENKADMLITGERYQYASVKA KIPLLDIKAELHHSYAGYTGILEVAQKLYATLTSPIWKQVRKPAPWENRC" gene complement(8053..9021) /gene="nifH" /locus_tag="DP116_22140" CDS complement(8053..9021) /gene="nifH" /locus_tag="DP116_22140" /EC_number="1.18.6.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867592.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase iron protein" /protein_id="PRJNA477356:DP116_22140" /translation="MSVDEKIRQIAFYGKGGIGKSTTSQNTLAAMAEKGQRILIVGCD PKADSTRLMLHSKAQTTVLHLAAERGAVEDLELEEVMLTGFRGVKCVESGGPEPGVGC AGRGIITAINFLEENGAYQDVDFVSYDVLGDVVCGGFAMPIREGKAQEIYIVTSGEMM AMYAANNIARGILKYAHSGGVRLAGLICNSRKTDREDELISTLASRLSTQMIHFVPRD NIVQHAELRRMTVNEYAPDSNQGNEYRKLADKVINNKMLAIPTPIEMEELEELLIEFG ILESEENAAKMVGISKAQEDAEKKQEDLEGEALEALKKGNVEVVKK" gene complement(9464..10594) /locus_tag="DP116_22145" CDS complement(9464..10594) /locus_tag="DP116_22145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869127.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" /protein_id="PRJNA477356:DP116_22145" /translation="MKTDNDLRNNLKSIRTRLGMSQQELASLAGVTRQAISGVESGQC APSVAMTIRLAKALGCKVEDLFWLEQDLPQVEAVLAKSIPGSQQQRVSLARIGGQWIA YPLVGNDAFRMEMISADAQIDSQTDTNTCLVRLLDDIDRFENTVVIAGCAPVLSLWAR ATERWYPQLRVQYKFANSMAALESLCRGETHIAGMHLYDSQTGEHNIPFVREVLAGKE AVLITLGIWEEGLLVQSGNPMQITTVTDLVEAKAKIINREKGAGTRLLLERKLQEEQL PCDSVKGFDSIARSHHSVADAVVSGFADAGISTASVATAFGLEFIPWHSSRYDLVILK EYLQESPVQQLLNTLERRSVHSQLEIIGGCDTSKTGDLVATI" gene 11220..11912 /locus_tag="DP116_22150" CDS 11220..11912 /locus_tag="DP116_22150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864578.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HAD family hydrolase" /protein_id="PRJNA477356:DP116_22150" /translation="MSPFDLVIFDCDGVLVDSEQITNSVFAAMLSEIGLPVTLEYISN TFVGQPMTACLEIIQESLGKPIPDNFVTEFRQRNKKDFSKKLKPIKGIHDVLSQIKTS YCVASNSIHEMIQTKLGVTGLLPYFEGKIFSVTDVARGKPHPDLFLYAAEKMKAHPQR CVVVEDTPIGVQAGFSAGMKVFGYAEYSNPQKLQAAGASVVFSSMRLLPNLLEQFDTP LAKAEGYHPQTN" BASE COUNT 3406 a 2718 c 2476 g 3633 t ORIGIN 1 tttttagaac tttacattaa ttcttttgaa gctatgcaaa tttagacaaa gttgtagtaa 61 gatttgcagt tattatggct tatttttcaa acatacgctc ggagcgattc aaacagggaa 121 cagaaaacag tgaacaggga acagtgagca gtgaacaatt aataactggt aactggtaac 181 tggtaactga taacttgaat tcccccgcaa gaggtaatcc atcctctaca ctgtaattta 241 tggcgtcaac tatttatgct ttaccagcag aagtcgtaca tctgattaca gccggagagg 301 ttatcgactc cttagcctct gtggtgcgag aattggtaga aaattccctt gatgcaggtg 361 caacacgaat tgttgtttct ttatggcccg ggcaatggcg cgtacgtgtg gcagataatg 421 gttgtggaat gaccttagat gacttacaga aagcagcaac ggcacacagt accagtaaaa 481 tttgcagttg tgcggatttg tggaaaatta caagtttggg gtttcgtgga gaggcgttgc 541 atagtttgac aacgctggca gatttagaaa tattgagtcg tccacttggt ggagattgcg 601 gctggcgagt tgtctatggc tacgctggag aagcattgta tgtggaagct tgtgcgatcg 661 cccccggtac agttgtcaca gtctctcatc tatttgggaa ttgttcaggt cgtcgtgagg 721 ggctacctac actagcacag caaataaaag cagtgcaagc aacaattcac caaatcgcct 781 tatgtcatcc tcatatcact tggcaagtct ggcaaaatga ccgagagtgg tttagtctat 841 gtccggccac ttcaatagga caactgctac cccagattct acatcaagtt aaagaagcag 901 atttacaaga ggtaaaactg gaaataccca atcctgagaa ttcagaattc ggaattcaga 961 agtcaggact ccatttagtg ataggattac cagatagatg tcatcgtcat cgtccagact 1021 ggatacgagt tgctgtcaac ggaaggatgg tgaagtcatc ggaattagaa cagactattt 1081 tgtcaggatt ttatcggaca ttaccgcgcg atcgctatcc aatcactttt ctgcatctat 1141 tgatttctcc agaacatatt aactggaacc gtaacccagc aaaaacagaa atttatctta 1201 accaactcag ttactggcaa gaacaaatta cccaggcgat cgcgcaagca ctccgcatca 1261 attctacatc tgttaaagac gctgttcaca caacgcgagt tggaaaatta ctcaaagctg 1321 cagaagaaaa aggcggatac aatgtcaatc gttcgcttac ttgtgaagaa cacaaaacgc 1381 atccagagat gttgctagca acgtctctac aactaaaagc cgttgctcaa gttaataaca 1441 cttatatcgt ggcagaacat cctggtggag tctggttagt tgaacagcac attgcccatg 1501 aacgagtttt gtacgagcaa ttgtgcgaca attggaaaat tattccaatt gaacctccag 1561 tcattctcta tcaattatct ccggctcaag tatcgcaact acaacgaatt gatttggata 1621 tagaaccatt tggtgagcaa ctttgggcag ttcgtacggt tccgacactt ttacaacagc 1681 gagacgattg tgcagacgcc attttagaac tgagttgggg aggtgactta caagcagcac 1741 aagttgctgt cgcctgtcgt agtgctattc gcaacggtac aactctcagt ttacaagaaa 1801 tgcaaacact tttggatgaa tggcaacgca cgcgcaaccc tcgtacctgt ccccacggac 1861 gtccaattta tttatctttg gaagagtcag ctttggctcg ttttttcaaa cgtcattggg 1921 tgattgggaa aagtcatggg atttagattg acaatgaaca gttgagcgta gggtgggcag 1981 tgttgttaat tgccagatta tttgtctgcg ttgacaaatt aatgtctttt ttgacaaatt 2041 attgccactt ggcgtggaac tagcgaagac actgacttgg aaatagaggc tgaaaaagtt 2101 tatcctatat aggtttcagc cttttacttc ttgtgtttgg ttaaaacaca agaagtctac 2161 aagtagacta ctcacaacca ggagaaatta atactttgag tcacccattc tatgtaatag 2221 tagttatctc gttcacaacg ttttgccggg ttgctgaact acgaactggc acatgcttct 2281 agattcgtgc agtagtctaa tgacataatg gttctaacct gtactcactc cccttaatat 2341 catgtccggt tgaagactta tcattagggc tgacgtcgag aggcggtgac actctgacgc 2401 ggagaattat taattacaag taattaatac cgtttttata tgaagatgca tagaaagaac 2461 cccaccgcct gcggcacctc cccttcacaa ggagagccag tgcattgcgg ggatctcccc 2521 ccaactccac tctgtgggga tcccaaagcc ccccattgta gcacctggcg tgaggttagg 2581 aggggtaaaa aaatatccag cttcatagag aattgatgta agaggacttg atataacccg 2641 gtttagagcg gaatgctact cataccccta aggactatct ctatatgcga aataagttat 2701 ttcataggct aatagctact tctgtcttaa tcacagtatc tattcctaat attacccttg 2761 ctggcgatca tcaaccgcaa acatggatac aatcatgtga taaagctttt attgaactta 2821 acaaccatcc ttttgataac cacgagcagt cgaggctagc aaagacggtt acgaaccaat 2881 acgaaaattt gaaaaaagaa ttaaaacttg atgctacgct tgcagaaaat gctccaaata 2941 agtatgaaag gaagtattta gtaacattgc tacaggagtg cgatcagagt ctgccaagtg 3001 aggttcaaaa tttttcgaca aatacacttc cacaagaaga tagcattcgg atagactatt 3061 ctgtccgcat tgataataat ctattccgct cattcaaggg atcaatcaag aagttaatca 3121 gctccatagg agttgatccc gacacaataa actttgctca cataagaaga tactttcaac 3181 gtaccataac acgatatcaa gatgccataa aacaatttca agatgccata aaacaatttc 3241 aagaggtcag agtcccaata aaaacaatat tagccaatgt cggctttgtt tttagccttg 3301 tcggctctgc ttttatcgcc tctttggtat catacccaaa aagttctcgc gaaaacacaa 3361 cagaaagtga aagttctagg agtattccaa taaaaaaatc atccaaaaat taagcagaag 3421 agccaaagat ttcaagaaac cattttttca agtttggtaa acgttgaatc tgttgttcaa 3481 cttgatacaa actatctggg agccgccgag ataccgatac cgttggcgaa tgtactacat 3541 atttgtaaat tcaggtatta cgtaagttct aactcttaat ctgtcgttgt ggcttttagt 3601 tgtaacactt ttgtttatac ttttctaaca ctttcgccaa cagtatccat atctcggcgg 3661 atacccagta aaagtttatc gatagaaaat tccagcatat tgtatccccc aactcaaaga 3721 aacgcctgat aaaagttttg cgaaactttt caatctaccg acattaactt tttgattttt 3781 tatttcgcag cgattctaga cttttatcaa cagtaactgt catgtgatgt tcttttccta 3841 gcacacttgt tgttatttcc agagcttgat tatacaataa ttcagcttca tcataacgct 3901 cttgggactc atagagcttt gctaaactgc tcaggctagt ggcaaagtcg ggatgttctt 3961 ttcccagcac tctttttctt atttccaggg cttcattata gcgtaattca gcttcatcat 4021 aacgcctttg gaagaagtag agatttgcta aactgttcag gctagtggca aagtcgggat 4081 gtttttctcc cagcaccatt cctgttagtt ctaaggcttg gttatacaat ggttcagctt 4141 tatcataacg ctcttggaag aagtagagat ttgctaagtt gttcaggcta gtggctaagt 4201 tgttcaggct agtgacaaag tcgggatgtt ctccttccag cacccttcct gttagttcca 4261 gggcttgctt atacaatgct tcagcttctt cataacgccc ttggaagaag tagagatttg 4321 ctaagttgtt caggctagtg gcaaagtcgg gatgttcttc tcccagcacc cttcctctta 4381 gtgccagggc ttgctgatac aatggttcag ctttattata acgcccttgt aaggaataga 4441 gatttgctaa gttgttcagg ctagtggcaa agtcgggatc ttgttttcct aaacgggatt 4501 ctattaatgg tacacattgc tcataccact cttgtgcgga tgcgtacaaa ccttgtcctt 4561 tataaaattt acctaaacta acaaagagca gaattaaatt tttatcactg actgcatcag 4621 tcagagtttt tgccacctct gctaaatgtg ggatagcatc cttgactgag ttaattgatt 4681 gttgagtaat tggctcaggg aatttctgag tgatcgccac cattgtttct gcaaaagcgc 4741 gtttcaactc atcggcttgt tccgatgctg cttgcttgac tttcaaaaac tcccgaatta 4801 agggatgaat tttgtagcaa tcatctctat ctccactttt atcttccacc cattgaatca 4861 aatgacgctc gtaaagttgc ttgactgctg tttcaacttc cgttgcatcc caattcagcc 4921 gagaagttgc agagtctacc cattcccaag caaagatatc cggtgcaaat aaacttaata 4981 actcacccac aacctgtgtc gtaggatcga gttccagcca acttaattca aacgcatcca 5041 aaaccccaag ctgtgctgta ctaaaggttt tttgcaattg ttgcttatgg cgatttatcg 5101 cttcatcttg gagtcgctgc tgtttcagcc gttgcaacat cttagctaaa gtccaatggt 5161 gaagtttttt agccaaatac ccccccacca attccaaacc caaaggtaaa tatcccagcc 5221 attcacacaa ttgtttcgct gtttcctctt ccttttgcac ccgtttttcc catttttcac 5281 ccactagctt tgtcaacaat tgcaaagctt cttctggtga caacacatcc agagatattt 5341 cctcaatgtt ggtgtcgagg tttcgcaacc gcgttgtcat caacacacgg aagcggttac 5401 cttttggtat aagttgttga caattattca aattcgtgat accgtctaac actaccaaca 5461 ccaaaccttc tgttggtcgc cagttttgcc agcaccactc cacctgttgt tctggactaa 5521 gttgtcgtcc ttgcaacacc tgcggcactt tcaaattcat gtaaagctgg gtaaactgca 5581 caatctccgc agctaaattt gagtctctag catttaacca gcaaattcca ccaggataat 5641 cggcttcatg ttgacgtgca tactgcgttg ctaattcggt tttgccgaca cctcccatac 5701 cagcaacagc agaaatcgct actgtacctg gtttttgtaa tttttttttg agtaaagata 5761 gttcatactc tcgtccaaca aagttggcag agcctgtata aggaatatat tttgggggat 5821 ttaactttac ccgttcccgt tcgggcggct gtagtcatat actgatgttt tggttttgaa 5881 tattaatatc tctttgagca aaattaacaa ttttctcgat tgttgagatc caagtttgat 5941 tctgtgggga ttgtgattta agtgcttgtg ctatatcttg gagaacttga gccaaattta 6001 atggtggatt agcttctgct tgtgctacca attcctgcac agcttgattg acttcaggat 6061 tagctttagc tgctgattcc acctctaata cagctttacc gtaatcgagt ggttgagaga 6121 gcactttttc aatagcaact acagtatggg gagagtgttt tttgagagtg acgagaaact 6181 caccagtttt attccataga gtttctccta ccttttcgcc cgttttttcc aaagcctttg 6241 tcgcaatcac agtcccaatt gcgatcgccc cagcggttag tggttccatg tgctacctca 6301 taagaatggc atcctattta cttattacca catgatttta gttctgaatt ccgcaattga 6361 ggtaaaaact tatagcttaa gtttttactc gtaagcaatg cgccaatcta cgaaagggca 6421 ttttcgtgta gcacagcctt acaacagctt attttcgtca ggttaatgct cccaatttca 6481 acctacttca acatctgttt tcccacgggg cgggtttgcg tacttgtttc catataggac 6541 tagtaagagt cgcatacagt ttttgtgcca cttctaaaat ccctgtatag cctgcataag 6601 aatgatgaag ttccgccttt atatctagca aaggaatctt agctttaaca gaggcgtatt 6661 gataacgctc tccagtaatt aacatatcgg ctttattttc gtgaattatc tgtagaattt 6721 cctctggact attttgttgg ataataatgc tatctttacc cagcaacttt tgaattttag 6781 ctctatcttc ctgactactt ttaacagtgc tgatagggat aacttctatc cccaattttt 6841 gagcagcaca gataattgac caacttttca cgtcttgtat atcaataaca atacgtttgc 6901 cttgcacata ggtgcgataa aaagcaagtt tcgcctctaa agcgctagtt tcaaacgcaa 6961 taaacttttc tgtacgtttt tgtaagtcgg agtctcccaa cttagcagca atattccgta 7021 agccttgatt gatgttttct atgccgtaga aaaactcttg aatgtagggg ataccaaaac 7081 gctcttccat cttttttgcc attttaaaca gcgctttgga agaagtaatc acattcagct 7141 tagcacggtg ggcataacag acttccttat agcgagtatc acccgtaatt ttgcaaagga 7201 ctcggatacc taatttgtca aatagcggta taatgttcca cattgcatcc gccatgttgg 7261 attcaccgat gaggttaata tcataaggtg tagtaaagtc tggttcagct gtaccaacaa 7321 cgtgttccaa caaagcttca ccagcaatgc ggataccctg atttttacga ccaacaaatc 7381 cgggacagtc tacaggaata atgggagttc ctgtttgcac agcagcctct gcgcaggctc 7441 cattaaggtc atcaccaata agagcggaaa cgcaggtaga ataaacaaat accgcagcag 7501 gtttgtagcg tctttgcaat tctaaaatgg cattgtataa ttttttagct ccaccgaaga 7561 tgatatcttt ttcatccata tcagtcgtaa aacgagcttt gtatagcatg gtaccaaaag 7621 agagactgct gctattttcc caacagttgt tgatgcaacc gcttggtccg tgaattaaat 7681 gagcagcatc ggtaattggt actaaagtca tcattgcacc atcaaaagca caatcttctt 7741 gaccctctct caggacagat tctcgattct gcgttttgta atctgggttt ttattaaata 7801 catcttcact tttcatgcaa gtagaaattc tctagagaca tcggaaaaaa aaggaatgga 7861 aaagagcgat ttcccattcc ctaaaatgta accaaccaat aaaaggagaa aacaaaaaac 7921 taaactaatc gttttaccgc gcgaacattt acagattcat gctgacacgg aaaacctaga 7981 aatcgtagaa ctatacatta gccattcaga ctacctccac atctctatct tcaccagtgc 8041 tttgtagcgc acttatttct taacaacttc tacgtttcct tttttcaatg cttctaaggc 8101 ttcgccttca agatcttctt gctttttctc ggcatcttct tgagctttgg atatgccaac 8161 catttttgcg gcattttctt cactttcgag aataccgaat tcaatcaaca actcttctaa 8221 ctcctccatt tcaatgggtg taggaatggc aagcatttta ttgttgataa ccttgtctgc 8281 taatttacgg tattcgttac cttgattgct atcaggtgca tactcgttca cagtcatccg 8341 gcgtaactca gcgtgttgca cgatgttgtc acgggggacg aagtgaatca tctgggtgct 8401 taatcgggat gctagtgtgc taatcagttc gtcttcccgg tcagttttgc ggctattaca 8461 aatcagacca gccaagcgca caccaccaga gtgagcatac ttgagaatac cgcgagcgat 8521 gttgttagca gcgtacatcg ccatcatttc accagaggtc acaatataga tttcttgtgc 8581 cttaccttca cggataggca tagcgaaacc accgcagaca acgtcaccca acacgtcgta 8641 gctaacgaag tccacatctt ggtaagcacc gttttcttct aagaagttga tggcggtgat 8701 aataccacga ccagcgcaac cgacaccagg ttcgggacca ccagattcta cgcacttaac 8761 accacggaag ccggtgagca tcacttcttc gagttccaaa tcttctactg cgccccgttc 8821 agcagcgagg tgaagtacgg tggtttgggc tttgctgtgg agcatcaaac gggtggaatc 8881 agccttcgga tcgcatccca caatcaaaat gcgctgaccc ttttctgcca tagcggcgag 8941 ggtattttga gaggtggtgg acttaccgat accgccttta ccgtagaatg ctatttgtct 9001 aattttttcg tcaacggaca tgatgagttt tctcctgtaa ttgataaatt aatcagtggg 9061 tctggtagtt tatcaggcgt tcactaatgg aagtgacagc tctgtttgtg accgcatgta 9121 gaacgtcgcg ccaggcgtgt agtacttttg ccaacaggag gccattggca aacaacagcg 9181 cctacggggg gcgatcgctc taccgcatac atagaaagaa gaaatatcat gtgagtcttt 9241 agttagtgat tatttgtcct accccaacaa gaaattactt gaagggcatt tttttcaaaa 9301 attaagcatt ttgtagaggt ttttccctca tccacaaaaa agcgctcttt gaagtcagaa 9361 ttcttattta tttttttact tatgatcagg caaggtatgt actatcccta tggcgtttcg 9421 ccaccctgag gggatttccc tgttgcctgt tgcctgttcc cttttaaatt gttgcgacta 9481 aatccccagt tttactagta tcacaaccac cgatgatttc tagttgtgaa tgaacagatc 9541 gacgctctaa cgtattgagt aactgctgta ctggtgattc ttgcaagtat tccttgagaa 9601 tcactaagtc gtagcgtgag gaatgccaag ggatgaattc caatccaaaa gctgtagcca 9661 cagacgctgt agaaattcct gcatccgcaa atcctgagac gacagcatca gcaacactat 9721 gatggcttct ggcgatactg tcaaatcctt taacagaatc acatggcagt tgctcctctt 9781 gcagtttacg ttccaaaagc aaccgagtgc cagctccttt ttcgcggtta ataatctttg 9841 cttttgcttc tactaagtca gtcactgttg tgatttgcat cggattgcca gactgtacta 9901 ataatccctc ttcccaaatt ccaagggtga ttaaaactgc ttcttttcct gctaagactt 9961 cacgaacaaa gggaatatta tgctcccctg tttgggagtc atacaggtgc atcccagcaa 10021 tatgagtctc acctcggcac aaactctcta acgcagccat gctgttagcg aatttatact 10081 ggactcgcag ttgaggatac cagcgttcgg tggctcttgc ccatagcgaa agcacaggcg 10141 cacaaccagc aatcacaact gtattttcaa atctgtctat atcgtcaaga agtctgacta 10201 ggcaggtatt tgtatctgtc tgactatcga tttgggcatc agctgaaatc atctccatgc 10261 gaaaagcatc attcccaacc aggggatatg caatccattg cccgccaatc cgcgccaaac 10321 tcactcgctg ttgttgacta ccagggatag attttgccag aactgcttcc acttggggca 10381 aatcctgctc tagccagaat agatcttcga ccttgcaacc cagcgccttt gctagacgaa 10441 ttgtcatagc gacagaaggg gcgcattgtc ctgactctac accactaatt gcttgacgtg 10501 tcacacctgc taagctagcc aattcttgct gactcatgcc taagcgggtt ctaatggact 10561 tcaagttgtt acgaaggtca ttatccgtct tcatttggct ctatctcttg gtgacaaaaa 10621 gcgtagcaag ttgatgcgga tttaccagag taatagacca agattcaaat aatcttaatt 10681 tccccttaag attattgcca tgagcgcttt tttgattccc tttttgattc cttaaaataa 10741 gcttagcatg gatatcgcta atttatttgc ctgtggaaat tttttttgct cactcctgat 10801 tcagttggta acatgtagca tattttttta ctttttcttg attcatcaat gacttataat 10861 attcctaact gcaacagttc aagtattctg taatagtatt attttttatt atttagtata 10921 aataaactaa ataaaaattg cgtagagcgc accaccctta tggggaattg ggcaaggcga 10981 cgcgcctaaa cttcttcaac acggttcggt taagatcccc cgcctgcggc gaccccctta 11041 aaaagcaggg ctgtttcatt ccctcaaagg ctcataatgt caagggtttg tgaagcgcga 11101 ttgctctgta ttttattttc tgaaagtccc ttattaaatg gcgacttcca ctttaaaatc 11161 aaactcaact ctaacttcag ccatgagaaa atgagttgag ggttgaagta ggaaaatcaa 11221 tgagcccatt tgacttagtt atctttgatt gtgatggtgt acttgtagat agtgaacaaa 11281 tcactaactc ggtttttgcg gcaatgctta gtgaaatcgg gttaccggta accttggaat 11341 atatatctaa cacattcgtc ggtcaaccta tgacagcatg tttagaaatt attcaagagt 11401 ctcttggaaa acctattcca gataattttg tcaccgaatt taggcaacgc aacaaaaagg 11461 atttctcaaa aaaattaaag ccaattaagg gcatacatga tgttttatct caaattaaaa 11521 cttcttattg tgtagcttca aatagcatac atgagatgat tcaaacaaag ctaggtgtca 11581 cagggttact gccatatttt gagggaaaaa tatttagtgt tacagatgtc gcacgcggta 11641 aaccacatcc agatctattc ttatatgcag ccgagaaaat gaaagctcat ccacagcgat 11701 gtgtggtagt tgaagatact ccaatcggtg tgcaagccgg atttagcgct gggatgaaag 11761 tttttggtta tgcagaatac agtaatccac aaaaattgca agcagcaggc gcatcagttg 11821 tttttagttc tatgagatta ctgccaaatt tacttgagca atttgacact cccctcgcta 11881 aagccgaggg gtatcacccc cagacgaatt gataaattct gaccccttcg gggttcgccc 11941 ttcgggtatc tcctaagccc tgcgggcacg cactcgcgtt cggagacgcg ctcgtcccaa 12001 ggggacacgc tgcgcgttcg ccttcggcgt gcgctttgcg cttacgcgaa cgcagtcgcc 12061 tacggaggga gaccctcctg gagcgctgtc tcaccagtcg cagcctacgg ctcgctccgc 12121 taacatgggg gaaaccacgc cagacgctac cctagtggga acggccacgc cttacggcta 12181 tcgggaagcg gagatccggt gttagcgcct ggctcaacgg acaggtgcgc gac // LOCUS NODE_2781_length_12197_cov_4.76601912197 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12197) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12197) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12197 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(39..230) /locus_tag="DP116_22155" CDS complement(39..230) /locus_tag="DP116_22155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744511.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22155" /translation="MLWIFKMPKTKFKPDNQVAYDKVPVQFKVLPGVREKLKAVPNWQ ERLREFVDQLITDSAIQFS" gene 282..1367 /locus_tag="DP116_22160" CDS 282..1367 /locus_tag="DP116_22160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006616249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_22160" /translation="MLLGFKTELKLNNQQRTALMKHCGVARHAWNWGLALTKQILDHN KTNSAIKIKFPTAIDLHKWLVALVKAENEWYYECSKSTPQQALMALREAWKRCFNKTA GVPKFKKRSRRDSFTLEGTVKILGNNKIQVPVIGVLKTYERLPQVLTKSATISRIASR WFISFRFDVEEQDLSNTSVVGVDLGVKNLATLSTGEVVNGAKSYKKYEAKLSRMQWLN RHKIIGSNNWKKAQMQIAKLHRKISNIRKDTLHKLTTLLTKNHGVIAIEDLNVSGMMA NHKLAKSIQDMGFFEFRRQLTYKCELYGSKLVVVDRWFPSSKTCSHCGSKKETLTLNE RVFECSHCGFVIDRDLNAAINLKKIAS" gene 1488..1823 /locus_tag="DP116_22165" CDS 1488..1823 /locus_tag="DP116_22165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315715.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA-binding protein" /protein_id="PRJNA477356:DP116_22165" /translation="MTEISYDDFEKVEIRVGKVIKVEDFPQARKPAYKLWIDFGDLGV KKSSAQITKLYQKENLTHRLILAVTNFPPRQIADFMSEVLVLGVVLDDGEVVLIQPEK EVPLGQRIF" gene complement(1880..2191) /locus_tag="DP116_22170" CDS complement(1880..2191) /locus_tag="DP116_22170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315716.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3067 domain-containing protein" /protein_id="PRJNA477356:DP116_22170" /translation="MTGQELHQLLLEKWGRSYDVQLRRTQGKIFLQVMWKYLEQVSFP LNEVEYQEHLDSIANYLNGLGGETQVKTYIRETRERPRLGKAVSIPLDLGERAAEWLI Q" gene 2418..2957 /locus_tag="DP116_22175" CDS 2418..2957 /locus_tag="DP116_22175" /EC_number="1.10.9.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006278029.1" /note="Rieske protein, with cytochrome b6, cytochrome f, and subunit IV, makes up the large subunit of the cytochrome b6-f complex; cytochrome b6-f mediates electron transfer between photosystem II and photosystem I; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome b6-f complex iron-sulfur subunit" /protein_id="PRJNA477356:DP116_22175" /translation="MAQFSESMDVPDMGRRQFMNLLTFGTVTGVALGALYPVVNYFIP PASGGAGGGTTAKNELGNDVSVSKFLESHNVGDRVLVQGLKGDPTYIVVDSKEAITDY GINAVCTHLGCVVPWNAAENKFKCPCHGSQYDATGKVVRGPAPLSLALSHAKAEEDKI VLTQWTETDFRTGEDPWWA" gene 3221..4222 /locus_tag="DP116_22180" CDS 3221..4222 /locus_tag="DP116_22180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996608.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="apocytochrome f" /protein_id="PRJNA477356:DP116_22180" /translation="MRNVCQIASLTRSARTIIKTLLVAIATVTFFFTSDLVLPQAAEA YPFWAQETYPESPRQPTGLIVCANCHLAAKPTEVEIPQSVLPDTVFKAVVKIPYDTSV EQVGADGSKVGLNVGAVLMLPEGFKIAPEERIPEEWKEEIEGLYYQPYTEDQENVVII GPLPGEQYQEIVFPVLSPNPATDKKIHFGKYAVHVGGNRGRGQVYPTGLKSNNTVYTA SAAGTISKIAKTEDEDGNVKYELSIQPESGDVVVDTIPVGPELIVSEGQVVKKDEPLT NNPNVGGFGQDDGEIVLQDATRVQWLIAFVALVMLAQVMLVLKKKQVEKVQAAEMNF" gene 4342..4665 /locus_tag="DP116_22185" CDS 4342..4665 /locus_tag="DP116_22185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875571.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22185" /translation="MTDKTYPIASAKIGTQDGFRLPRAFSKDHPHLVNTSGYVQVLSK NTFLVQLDTDEVEEVDETESIMMSLFLDFLMKDAVKNPEQLVAYTKEMSDEMDELLCD VVLDA" gene 4662..5144 /locus_tag="DP116_22190" CDS 4662..5144 /locus_tag="DP116_22190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system YhaV family toxin" /protein_id="PRJNA477356:DP116_22190" /translation="MNSFTSDGWRIFFYPLFDKQWVELSHRVRTLKTELSKQEFMTHG DVKLLKGLNIGIKEKITQDPFASHFVLHKPLHRYGRLKKMGLPARYRLFFRAFKEQKI IIIIWLGFPRKEGDKNDCYQVFAKKVTNIDFPENVDELLAECEVTDFKEQKKDSLETK " gene 5389..5640 /locus_tag="DP116_22195" CDS 5389..5640 /locus_tag="DP116_22195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456029.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22195" /translation="MNAFYAMLYQGNMTKAEALRQAQVAMITDNYSRVVQQQNKAILE STRDLRAAQQRSYRFLLKVANNLSHPYYWAPFILIGNGL" gene complement(5728..7317) /locus_tag="DP116_22200" CDS complement(5728..7317) /locus_tag="DP116_22200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006104699.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1957 domain-containing protein" /protein_id="PRJNA477356:DP116_22200" /translation="MAIGYVALVLHAHLPFVRHPESDYVLEEEWLYEAITETYIPLLR VFEGLKQDGVDFKITMSMTPPLVSMLRDPLLQERYDAHLAKLEELIELEIEHNVHNGH IRYLAEHYASEFKAIREVWERYNGDLVTAFKQFQDTNNLEIITCGATHGYLPLMKMYP QAVWGQIKVACEHYEENFGRPPKGIWLPECAYYEGVERMLADAGLRYFLTDGHGILYA RPRPRFGTYAPIFTETGVAAFGRDHESSQQVWSSEVGYPGAAEYREFYKDLGWEAEYE YIKPYIMPNGQRKNTGIKYHKITGRGLGLADKALYDPYWAREKAAEHAANFMYNREQQ VGHLHGIMQRPPIIVSPYDAELYGHWWYEGPWFIDYLFRKSWYDQKTYEMTHLADYLR ANSSQQVCRPSQSSWGYKGFHEYWLNQTNTWIYPHLHKAAERMIEIAKREPEDELEWK ALNQAARELLLAQSSDWAFIMRTGTMVPYAVRRTRSHLMRFNKLYEDVKIGKIDSGWL EKVELMDNIFPNINYRVYRPL" gene complement(7542..8909) /locus_tag="DP116_22205" CDS complement(7542..8909) /locus_tag="DP116_22205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome P450" /protein_id="PRJNA477356:DP116_22205" /translation="MTNLSTQRFAWRTPKGNLPLPPGRFGLPIVGESISYLRDPARFI EQRQKRYGTIFKTHLFGRPTIVFIGADATRFLFTNESQRFEMTNTPSFEVLLGANSIG VKTGTAHQILRKQLFQAFEPRALAEYATTIEDMTRRYLHRWERMGTLTWYSELKKYTL DIACKLFVSVETASDENLEKVYETWSDGLLSIPXXXVRAREQLLAKIDEMINQRQQHP SSNEDVLKILLQAQDEEGNRLSLEEVKDNVLGMLVAGHETLTSALTSLCLLLAQHPQV LQAARAEQEQLGLTQPLTQESLKQMTYLEQVLKEVLRIAPPVVRSGSRKVLESCEFGG YLIPQSWDVFYQIQETHQDQNVYAQRERFDPERFAPERVENKQKVFSYIPFGGGIREC LGKEFARLEMKVFAALLIREYQWELLPGQNLERVVLPFSRPHDGLKVKFWRHKRGASC SVSEF" assembly_gap 8322..8331 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 9041..9631 /locus_tag="DP116_22210" CDS 9041..9631 /locus_tag="DP116_22210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019499265.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="PRJNA477356:DP116_22210" /translation="MSKPVKSPPGRPRSAQSHQAILQAALELLAEVGFERMSIDAIAA RAGVGKPTIYRRYKSKEELVADAIESCRQEYVVPDTGSLWGDIDALINSAAQITFTPL GRQTVAMMISTASSNPQFAQVYWTKYLQPRRQAFAVVFERAKSRNEIQADLDSDLVFD LISGIMLYALVFQATTEPWEAYIRRTLHLLLREPSS" gene complement(9738..10826) /gene="rsgA" /locus_tag="DP116_22215" CDS complement(9738..10826) /gene="rsgA" /locus_tag="DP116_22215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874195.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribosome small subunit-dependent GTPase A" /protein_id="PRJNA477356:DP116_22215" /translation="MRGEGLEATDQLLGTVLAVQANFYRVQLDIGMRQMSEQSSPHPP MLLCTRRTRLKKIGQQVMVGDRVVVEEPDWAGGRGAIAQVLPRESELDRPAIANINQI FLVFAVADPPLEPYQLSRFLVKAESTGLSVILCLNKCDLIATEQQAEINQQLVTWGYQ PIFISLSKSINIDKKVDKLKDNMTVIAGPSGVGKSSLINTLIPNANLRVGEVSGKLAR GRHTTRHVELFELPSGGLLADTPGFNQPDFDCVPEQLVTYFPEARQRLAVDSCRFSDC LHRDEPGCVVRGDWERYQHYLDFLEEVIKRQTQLNQQADPESTVKVKTKGKGKTQYEP KLESKKYRRTSRRTQLQQLQQLYQETDE" gene complement(10823..11080) /locus_tag="DP116_22220" CDS complement(10823..11080) /locus_tag="DP116_22220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456792.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfurtransferase TusA family protein" /protein_id="PRJNA477356:DP116_22220" /translation="MSLSSLSTPNAQLDLRGTPCPINFVRTKLRLEQMNPGELLEVWL DSGEPIEQVPDSLTMAGYQVEQITDCVSYFSLLVRRPVSAS" gene complement(11077..>12197) /gene="dnaJ" /locus_tag="DP116_22225" CDS complement(11077..>12197) /gene="dnaJ" /locus_tag="DP116_22225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315722.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="molecular chaperone DnaJ" /protein_id="PRJNA477356:DP116_22225" /translation="YEILGVSRDADKEEIKHAYRRQARKYHPDVNKEPGAEDRFKEIN RAYEVLSEPETRARYDRFGEAAVNGGSGAGFQDMGDMGGFADIFESIFSGFAGSAGGT TQRRRSGPVRGDDLRLDLKLDFREAVFGGEKEIRISHLETCEICSGSGAKPGTRPRTC STCTGTGQVRRVTRTPFGSFTQVSTCPTCNGTGMVIEDKCDACDGKGAKQVTKKLKIS IPAGVDNGTRLRISAEGDAGQRSGPPGDLYVYLFINEDEEFHRDGINILSELKISYLQ AILGCRLEVNTVDGPQELLIPPGTQPNTVMKLENRGVPRLGNPVSRGDHMITVLIDIP NKLTIEERELLEKLAKIKGDRTGKGGLEGFLGNFFHQR" BASE COUNT 3451 a 2697 c 2589 g 3450 t 10 others ORIGIN 1 tgcttgcgaa taatgcttac tttacctgac tttgaggctt aactgaactg tattgcgcta 61 tcagtaatca attgatcaac aaactctctt agccgttctt gccagttggg gacggcttta 121 agtttttccc ttactcctgg caagactttg aactgtaccg ggactttatc atatgccact 181 tgattatctg gtttaaactt tgtttttggc attttaaaaa tccaaagcat ctgatatact 241 agatattgta caccttttta aaaagttgaa aggatagctt aatgctgtta ggtttcaaaa 301 ctgaattgaa actaaataat caacagcgca cagcattaat gaaacattgc ggtgttgcac 361 ggcacgcttg gaactgggga ctggctttaa caaaacaaat ccttgaccac aataaaacca 421 attccgcaat taagatcaag tttcctactg ccattgactt acataaatgg ctggtggcat 481 tagtaaaagc tgaaaatgaa tggtattacg agtgcagcaa atctacacca caacaagcat 541 tgatggcttt gcgtgaggca tggaagcgtt gttttaacaa aactgctggc gttccgaagt 601 tcaagaagcg aagtaggcgc gattctttta cattggaagg cacggtcaaa attttaggta 661 ataacaaaat ccaagtacct gtaattggtg ttctcaaaac ttatgaacga ttaccacaag 721 tattaacgaa gtctgcaaca attagccgta tagcttccag atggtttatc agcttccgat 781 ttgatgtaga agagcaggac ttaagtaata caagcgttgt aggcgttgac cttggcgtta 841 aaaacctagc aacattgtcc actggtgaag tggttaatgg cgcaaagtca tacaaaaagt 901 acgaagccaa actcagcaga atgcagtggt tgaatcgtca taaaatcatc ggttcaaaca 961 actggaaaaa agcgcagatg caaatagcta aactgcatag aaaaatatcc aacatccgaa 1021 aagatacgtt gcataagctc accacactat taaccaagaa ccacggtgta atagccattg 1081 aagacctcaa cgtatccgga atgatggcta accataaact ggctaaaagc atccaagata 1141 tgggattttt tgagtttcgt aggcaactaa cctacaagtg tgaactttat ggttccaaac 1201 ttgttgtagt tgaccgatgg tttcctagct ctaaaacttg ttcccattgt ggcagtaaaa 1261 aagaaacact cactttgaat gagcgagtgt ttgaatgtag tcattgtggt tttgtcattg 1321 accgcgattt gaacgcagca atcaatttga agaaaatcgc cagttaggcg aggttagcct 1381 gtggactgga tagtgccgac actaccagat tgaaacagga agtaagcgaa actcaaacaa 1441 gaatgaatag ctttgatagg tgttgattag gtttgagtag gatttatata acggaaataa 1501 gttacgatga ttttgaaaaa gtagaaattc gtgttggcaa agtcatcaaa gtagaagatt 1561 ttcctcaagc ccgaaaacca gcatacaaac tttggataga ttttggtgat ttgggcgtca 1621 agaagtctag tgctcaaatt accaaattgt atcaaaaaga aaatttaacg cacaggttaa 1681 tattagctgt caccaatttt ccacctcgtc aaatcgctga ttttatgtct gaagtccttg 1741 tgttgggtgt cgtgctagac gatggtgaag tcgtgttgat tcagccagaa aaagaggtac 1801 ctctgggtca aagaattttc tgacaatctc ctcatttttc tgtcttaggt ttcgccatag 1861 ttggtgaaag tatttacaac tactggatca accattccgc cgcacgttct cctaaatcta 1921 acggaatact cacagcttta ccaagtcgtg ggcgttctcg tgtttctcga atataggttt 1981 ttacctgtgt ttcaccacct aagccattca gataattggc gatgctatcc agatgctctt 2041 gatattctac ctcatttaat gggaaagaca cttgctctaa atatttccac ataacttgta 2101 agaatatttt accttgtgtg cgacgcaatt ggacatcata ggagcgtccc cacttctcaa 2161 gcaacagttg gtgtaattcc tgtcccgtca tatcttttct caaatatttt tacatttagt 2221 tatgatgtta agttccatga ctcaagtatg ataggactat aaagaaatgt aatcctacta 2281 agaaagatgc tggaaagatt tttgcacaaa gcatcctggc atccttgtag gcattatcct 2341 tttcactcag acaagagttt cctaatgaaa aactcaagtc atatttgtta acaaaataca 2401 caaagaacgc gtaggttatg gctcaatttt ctgaatcaat ggacgtgccc gatatggggc 2461 gtcgtcaatt catgaatctt ctgacatttg gaactgtaac cggagtggct ctgggagcat 2521 tgtatcccgt tgtcaactat tttattcctc cagcaagcgg tggtgctggg ggtggtacaa 2581 cggcaaaaaa cgagctaggg aacgatgtca gtgttagcaa gtttttggaa agtcataatg 2641 taggcgatcg cgttctcgtt caaggactca agggagaccc cacctatatt gtggtagata 2701 gcaaagaggc gattactgat tacgggatta acgccgtttg cacgcactta ggttgtgttg 2761 ttccttggaa cgcagcagag aacaagttta agtgtccctg tcatggttca cagtacgacg 2821 cgacgggtaa ggttgtccga ggtccagcac ctctgtcttt agctttgagt catgcaaaag 2881 ccgaagaaga caaaatcgtc ctgactcaat ggacggaaac cgacttccgt acaggcgaag 2941 atccttggtg ggcttaacag ttatcagtta tcagttatca gttatcagct catcttggct 3001 tgataactct tgagtgttta ctgttcactc aaaaggctgt tcactgattt aatcagtgca 3061 tagtctatag tttggagtcg aaatttgact cttgactcat gactgttgac ccttcgggtt 3121 cgccagatgc ctgttgtcgg gaaaccctcc cgcagcactg gtctcactaa tgactaatga 3181 ctaatatcag agaatcgttg tcctagttgt ccttatagag atgagaaatg tttgtcaaat 3241 agcgagttta actcgcagtg ctagaacaat tatcaaaacc ttgctcgtag cgatcgccac 3301 agtgacattt ttcttcacca gcgatctagt cctacctcaa gcagctgaag catatccctt 3361 ttgggcacag gaaacctacc ccgaaagtcc tcgccaaccg acagggctga ttgtgtgtgc 3421 taactgtcac ctagcagcaa agcctaccga agtcgaaatt ccccagtctg ttctgcctga 3481 cacggtattt aaagctgtgg taaaaattcc ctacgataca agcgtagagc aggtgggtgc 3541 cgatggttct aaagttggct tgaacgtcgg tgctgtgctg atgctaccag aaggcttcaa 3601 gattgctcct gaagaacgca tccccgagga gtggaaggaa gaaatcgagg gtttatacta 3661 ccaaccctac accgaagatc aagaaaacgt cgtcatcatt ggaccgctac ctggtgaaca 3721 atatcaggaa atcgtcttcc ccgtcctttc tcctaaccct gcaactgaca agaaaattca 3781 ctttggtaag tacgctgttc acgtaggtgg taaccgtgga cgcggacaag tttaccccac 3841 tggcctaaag agcaacaaca cggtatacac ggcttctgct gcgggtacaa ttagcaagat 3901 tgccaaaaca gaagatgaag acggtaacgt caagtacgaa ctcagcatcc aacctgaatc 3961 tggggacgta gtagttgaca ctatacccgt tggaccagaa ttgattgtct ctgaaggaca 4021 agtggtcaaa aaagatgaac ctttgaccaa taatccaaac gttggtggat ttggtcaaga 4081 tgacggcgaa atcgtgctac aagatgctac gagagttcaa tggttgattg catttgtcgc 4141 acttgttatg ttggctcaag ttatgcttgt cctcaagaag aagcaggttg aaaaggttca 4201 agctgcagag atgaatttct aaattcagag ccaaattatc agtcatagga caggtatttg 4261 acctgtcttt tttatgtttg tttgacgata ttataataaa tgttataata aaagaattgc 4321 cttcataaag gagtaactta gatgactgac aagacctatc ctatagcttc tgctaagata 4381 ggcacgcaag atggttttcg cttacctcgt gctttttcta aagaccatcc tcatttagtt 4441 aatacttcag gttatgtgca agtcttatct aaaaacacct ttttagtgca attagatact 4501 gatgaagttg aggaagtaga tgaaacggaa tctattatga tgagtctgtt cctcgacttt 4561 ttgatgaaag atgctgttaa aaatcctgag caactcgttg cttatacaaa agaaatgagc 4621 gacgaaatgg atgagctact ttgtgatgtt gttttggatg catgaactca ttcacatctg 4681 atggatggag aatatttttc tacccattgt ttgataagca atgggtagaa ttgtctcatc 4741 gggttcgtac tttaaaaact gagttatcta aacaagagtt tatgactcat ggagatgtaa 4801 agctactaaa gggcttgaat attggtatta aagaaaaaat tactcaagat ccttttgcat 4861 ctcactttgt attacacaaa cctttacaca ggtatggtcg tctgaagaaa atggggctac 4921 ctgcccgata tagattattt tttagagcat ttaaagaaca aaaaattatt atcattattt 4981 ggttaggatt cccacgtaaa gaaggggata aaaacgattg ttatcaggtt tttgcaaaaa 5041 aagttactaa tattgacttt ccggaaaatg tagatgaact gcttgcggaa tgtgaggtaa 5101 cagattttaa ggaacaaaag aaagattcac tggaaaccaa atgattagca aagatactgc 5161 ataagaaaaa gcacaacgct tagcagccaa ccagatactc tttgatatca aagcacttca 5221 tcccaactcc attgagcgct aaaccgtctc taacgtgttt ccaagattga tctagcaatc 5281 ttaaaacttg ctgtgcaacc ttagcaggaa gacttaaaac taccagaggc aaaacagctt 5341 ttggctacca aaggtttttc tggcttagaa aaccgtacac aaaccttgat gaatgctttc 5401 tacgcaatgc tgtatcaagg taatatgaca aaggctgagg ctttacgtca agcgcaagtg 5461 gcaatgatta cggacaacta ctcaagggtt gttcaacagc aaaacaaagc tattttagaa 5521 tctacgcgcg atctgcgcgc agcgcagcag cgaagctatc gcttcttgct aaaagtcgct 5581 aataatctta gtcaccccta ttattgggca ccattcattt taattggcaa tggcttgtaa 5641 gaacagggtt tggtgttgtc caccctagtc tttgcacagc aggtgaggac atcctcacct 5701 accgcccctg ttatttccgt tttgctacta caatggacga tacactcggt agttgatatt 5761 cgggaaaata ttgtccatca attcaacttt ttccagccaa ccgctatcga ttttgccgat 5821 tttaacgtct tcgtaaagct tgttaaagcg catgaggtgc gatcgcgtcc ttctaactgc 5881 ataaggtacc atcgttcccg tccgcataat aaacgcccag tcggaagact gcgccaacag 5941 cagttccctc gctgcttggt tcaacgcttt ccactccaac tcatcttctg gttcccgctt 6001 tgctatttca atcatccgtt ctgcggcttt atgcaaatgt gggtaaatcc acgtatttgt 6061 ttggttcaac caatactcat ggaatccctt ataaccccaa cttgattgcg aaggacggca 6121 gacttgctga cttgaatttg cccgcaaata atctgccaag tgggtcattt cataggtttt 6181 ttggtcatac catgacttgc ggaacaggta atcaataaac caaggacctt cataccacca 6241 atgtccatat aactcagcat cgtagggaga aacaataatt ggcggacgct gcattatacc 6301 atgaagatgt cctacttgct gctcacgatt atacataaaa ttggcagcgt gttctgcagc 6361 tttttcccgt gcccaatacg ggtcgtagag tgccttatct gccagtccta agccgcgacc 6421 tgtaattttg tgatacttaa tgcccgtatt tttgcgttga ccgttaggca taatgtaggg 6481 cttaatgtac tcatattctg cttcccaacc caagtctttg taaaactcgc ggtattctgc 6541 agcaccagga tagccgacct cagaagacca tacctgctga gaagattcat ggtctcgacc 6601 aaaggcagca acaccagttt ctgtaaaaat tggggcataa gtgccaaatc gtggacgagg 6661 acgggcataa agaataccgt gcccatcagt gaggaagtag cgcaaccctg catcggctaa 6721 catccgctct acaccttcat agtaggcgca ttctggcagc caaatacctt tgggtgggcg 6781 accaaagttt tcttcgtagt gttcgcaagc tactttaatt tgtccccaca cagcttgcgg 6841 gtacattttc atcagtggta ggtagccgtg ggtagcaccg caggtaatga tttctaaatt 6901 gttagtgtcc tgaaactgct taaaagctgt gactaagtcc ccgttgtagc gttcccaaac 6961 ttctcgtatc gccttaaact cactggcgta atgctcggct aaataacgaa tatgaccgtt 7021 gtggacatta tgctcgattt ccagttctat aagttcttcg agtttggcta aatgagcgtc 7081 gtagcgttct tgcaaaagag gatcgcgtag cattgacaca agaggtggtg tcatgctcat 7141 cgtgatttta aagtcaacac cgtcttgctt taagccttca aagactcgca gcaaaggaat 7201 gtaagtttcg gtgatggctt catagagcca ttcttcttct agcacataat cactttctgg 7261 gtgacgaacg aagggcagat gtgcgtggag tacgagcgca acgtagccga tagccataat 7321 tatagttccg ggatggtact tgttagatga aaggttggga atttggcaga tgttaagaat 7381 attttaagac tttatgggga tcgttacatt ctttagtcaa ctctccaaat tatgcctacg 7441 cattgatgca catctcattc atgcccttcg caactcattt cataaaaggc aaaaggtcaa 7501 actcttgacg aatatctaac agtgatcgat ctaaaagcgg ttcaaaattc agaaacggaa 7561 catgatgctc cccttttatg tcgccaaaat ttcaccttta agccatcatg aggacgggag 7621 aacggtagta caacccgctc tagattttgt ccagggagta attcccactg atactcacga 7681 atcaacaacg ctgcaaacac cttcatttcc aatctggcaa attccttgcc taaacattct 7741 cgaattccac caccgaacgg aatatagcta aagacctttt gtttgttctc gacccgttca 7801 ggtgcaaaac gttctgggtc aaatcgttca cgttgagcat agacattctg gtcttgatga 7861 gtctcttgga tctggtaaaa cacatcccaa ctttgcggga tgagataccc accgaactcg 7921 caggactcaa gcactttccg cgaaccactg cgaactactg gaggtgctat tcgtaaaact 7981 tcttttagca cttgctctag ataagtcatt tgcttgagag attcctgtgt tagcggttgt 8041 gtcaacccga gttgctcttg ttctgcacgg gctgcttgca gcacctgtgg atgttgagca 8101 agcaataaac acaaggatgt gagtgctgaa gttaaagttt cgtgtccagc aacgagcatt 8161 cccaacacgt tatctttcac ttcctccaag ctcaaacgat taccctcttc atcctgcgcc 8221 tgcaatagaa ttttcaaaac atcctcatta gaggaaggat gttgttggcg ttggttgatc 8281 atttcatcaa tctttgcaag aagctgctca cgcgcacgaa cnnnnnnnnn nggaatagat 8341 aacagcccgt cactccaagt ctcataaact ttttctaggt tttcatctga ggcagtttca 8401 acgctcacga acaacttaca ggcaatatcc agcgtatatt ttttcagttc ggaataccaa 8461 gtcaatgttc ccatgcgttc ccatcgatgc agataacgac gggtcatgtc ttctattgtc 8521 gtagcgtact cggctaatgc tcttggctca aaggcttgga acaactgctt acgaagtatt 8581 tggtgagcag tacccgtctt tactccaatt gagtttgctc ccagtaacac ctcaaaactt 8641 ggagtattgg tcatctcgaa tcgctgactt tcattggtga acaaaaaacg ggttgcatct 8701 gctccaatga aaacaatcgt tggacgacca aacagatgag ttttgaaaat tgttccatat 8761 cgcttttgtc gctgctcgat aaaacgggct ggatcgcgca aatagctgat ggattcgcca 8821 acaatgggca aaccgaagcg accaggaggt aacgggagat ttccctttgg ggttcgccaa 8881 gcaaatcgct gagtcgagag attggtcatg ctgttttctg ctgtttaatg gataaagaag 8941 tttttaaatt acgatactgt atcgtaattt aaaagtaaat gtcaattgga caactgcttg 9001 gtaagttgtt tagatacgga tgccttagaa tgacgatcgc atgagtaaac ctgtcaaaag 9061 tccgccggga cgaccccgta gtgcccaatc acatcaagca attctgcaag cagccctgga 9121 actgctggca gaagttggat ttgagcgcat gagtattgat gcgatcgcag cccgtgcagg 9181 agttggcaaa cccacaattt accggcgcta caaatcgaaa gaagaactag ttgcagacgc 9241 gattgagagt tgtagacaag agtatgtcgt tcctgacact ggtagcctct ggggtgatat 9301 tgacgcgttg atcaacagtg ctgcacaaat cacatttacc cctttgggac gacagacagt 9361 tgctatgatg atcagtactg catccagcaa tcctcagttc gctcaggtct actggacaaa 9421 atacctacaa ccccgacgac aagccttcgc tgttgtattt gaacgcgcaa aatcgagaaa 9481 tgaaattcag gcagatttag attctgatct cgtttttgac ctgataagtg gaatcatgct 9541 ttatgcactc gtgtttcaag ctacaacaga accttgggaa gcatatattc gtcgtacctt 9601 gcatcttctt ctgagagaac catcaagctg actttccctg aacaaaaagc taggggagtg 9661 tgaataagaa aaatatcatt tcactcgatt aatgcccaag cagcataagc atatttgcgt 9721 aagccgttgg tgatttctta ctcgtccgtc tcctgataca actgctgcaa ctgctgcaac 9781 tgcgtcctgc gtgaagtccg gcgatatttc ttactttcta gcttcggttc gtattgagtc 9841 tttcccttgc ctttcgtttt taccttgaca gtagattcag gatctgcctg ttggttaagc 9901 tgagtttggc gcttaataac ttcctctaaa aaatctaagt aatgctgata tcgttcccaa 9961 tctccacgga ctacacaccc cggttcgtct cgatgcagac aatcactaaa acgacagcta 10021 tcaactgcta atctttgtct tgcttctggg aaataagtta ctaattgttc tgggacacaa 10081 tcaaagtcag gttgattaaa accgggagta tctgctaata aaccaccact tggcaattca 10141 aataattcca cgtgacgagt ggtatggcga cctcgtgcca gtttaccaga aacctctccc 10201 actcgtaagt tagcattggg aatcagtgta ttgatcaggc tggacttacc aactccagaa 10261 ggtccagcta tcactgtcat gttgtctttg agtttatcta cttttttatc aatatttata 10321 cttttgctca gactaataaa tattggttgg tagccccaag ttaccaactg ctgattaatt 10381 tctgcctgtt gttctgttgc aattaaatcg catttattta agcataaaat cacacttaaa 10441 cccgtagact cagctttaac cagaaagcga ctaagctggt acggttccaa aggcgggtca 10501 gcaacggcaa atactaagaa aatttgatta atattagcga tcgctggacg atctaattca 10561 ctttcacgag gtaatacttg agcgatcgcc cctcgtccgc cagcccaatc tggttcttct 10621 acgactacgc gatcgcccac catcacctgt tgcccgattt ttttcaaccg tgttctgcga 10681 gtgcatagaa gcatgggggg atgaggagaa ctttgctcac tcatctgtct catccctata 10741 tctaactgca ctcggtaaaa attagcttgc acggctagaa ctgtgcctag taactgatcg 10801 gtagcttcta agccttcccc tctcatgagg cactcacagg gcggcggact agtagagaaa 10861 aataactaac acaatctgta atttgctcta cctgataccc tgccattgtc aaactatcag 10921 gaacctgctc aatcggttcc cctgaatcta gccagacttc tagtaattct ccgggattca 10981 tttgctccag acggagtttc gtccgcacaa aatttatcgg gcaaggggtg ccacgcaaat 11041 cgagttgagc attaggagtt gagagggaag atagactcat cgttggtgaa agaagtttcc 11101 caaaaatcct tctagtccgc cttttccagt gcgatctcct tttattttag cgagcttctc 11161 cagaagttca cgttcttcaa ttgtcaactt attaggaata tcaattaaca ccgtaatcat 11221 gtggtcgcca cgactgacgg gattacccaa gcgcggtact ccacgatttt ccagcttcat 11281 gacagtattt ggctgagttc ctggtggaat caacaattcc tgtggtccat ctacagtatt 11341 tacctccaag cgacagccca aaattgcttg caggtagcta attttgagtt ccgagagaat 11401 attgattcca tcccgatgga attcctcgtc ctcattgata aacagataaa cgtataaatc 11461 tccaggagga ccactgcgtt gacctgcatc accttctgcg gagatccgca agcgtgtacc 11521 gttgtccacc ccagctggaa tggatatttt tagtttcttc gtaacttgtt ttgcaccctt 11581 accgtcgcaa gcatcacatt tatcttcgat caccatcccc gtaccattac acgtaggaca 11641 agtcgaaact tgggtgaagc taccaaaggg cgttctggtg acgcgacgta cttgacctgt 11701 tccggtacaa gtcgaacaag tgcgagggcg agttccgggt ttagcgccag atccgctaca 11761 aatctcacaa gtctctagat gagaaatgcg aatttccttt tcaccgccaa agactgcttc 11821 tcgaaagtct agctttaggt ctagccgcag atcatcgccc cgcactggtc cactgcgtcg 11881 tctttgggtt gtaccacctg cactaccagc aaagcctgag aaaatgcttt caaagatatc 11941 agcaaacccg cccatgtcac ccatatcttg gaagcctgca ccagatccac cattcacagc 12001 agcttcacca aagcggtcat aacgtgcgcg ggtttccggc tcagaaagaa cttcataagc 12061 acggttaatt tctttgaagc gatcttccgc tcccggttct ttgttcacgt ctgggtgata 12121 cttccgggct tgacggcggt aagcatgttt gatttcttct ttgtcggcgt cacgagagac 12181 acctagaatt tcataat // LOCUS NODE_2805_length_12050_cov_4.78099212050 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12050) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12050) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12050 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..523 /gene="dndD" /locus_tag="DP116_22230" CDS <1..523 /gene="dndD" /locus_tag="DP116_22230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130702.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="DNA sulfur modification protein DndD" /protein_id="PRJNA477356:DP116_22230" /translation="IITSAAKVQQTLKLFREKLTLRKLNKLEVEVTECFRYLLHKSDL VHRVAIDTHTFSLSLYDLQGKPVPKHRLSAGEKQLLAIAFLWGLARVSGRRLPVAIDT PLGRLDSSHRSNLVERYFPSASHQVILLSTDTEIGDKEVKILRENEAIAREYLLKYDS STRQTTVQPGYFW" gene 710..1099 /gene="dndE" /locus_tag="DP116_22235" CDS 710..1099 /gene="dndE" /locus_tag="DP116_22235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315872.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA sulfur modification protein DndE" /protein_id="PRJNA477356:DP116_22235" /translation="MEPPIDRIKLSQTAKDQLLKLRRNTKIDQWNILCRWAFCRSLAE PTPPSPVPIPQDSNVEMTWRVFGGEMSDIFLLALKQRCYNDGFPTDKQTLATQFRLHL HRGIGYLAGDPNIKKIEDLVELATKKI" gene 1178..2641 /locus_tag="DP116_22240" CDS 1178..2641 /locus_tag="DP116_22240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA phosphorothioation-associated methyltransferase" /protein_id="PRJNA477356:DP116_22240" /translation="MFSESYVEIERHRAAIVRTDLSRPVRLAIEWSIINKDTTFFDYG CGHGGDVERVANLGYTSAGWDPYYYPDTPRICANVVNLGYVLNVIEDPEERRQALCQA WELAHKVLIVAAQVLINAPSKTQVAYSDGIVTRRNTFQKYYDQEELKKYIDETLNVDA VPVALGVYFVFRDEAEKEEFKAIRFFSRTSTPKVRIHTKRFEDYKEMLEPLMAFFTHR GRLPVKGELDNEQELLSEFKNFRRAFAVVLQATDEAEWDAIAYRRSLDIQVYLALTHF DKRPTSHKLSPEMRHDIKAFFGSYEEACEVADAKLFSLGKTGVVQTACEKSKIGKHTR SALYVHVSALGELDPLLRIYEGCASRTIGRVDDATLIKFCLDEPRISYLFYPEFDTDA HPALKASITIDLKTLRITHRDYEQRANPPILHRKETFVTSNYSYYQEFAQLTQQEQKL GLLKHKSEIGTREGWTKCLAEHGVEIRDHQIHLVNEN" gene 2828..4075 /locus_tag="DP116_22245" CDS 2828..4075 /locus_tag="DP116_22245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859399.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AmpG family muropeptide MFS transporter" /protein_id="PRJNA477356:DP116_22245" /translation="MAALVLLGFSSGLPYLLTGNTLTAWMTVENVDLGTIGWFSLVSL PYSLKFLWSPLLDRYKLPILGRRRGWLIATQIALIVAIALMAGQQPGTVLQLLAINAI AIAFLSATQDIAADAYRTDVLEPLEVGAGAGLFVSGYRLAIIVAGALALILAAHLPWK SVYLLMALFMVIGIFGTLLAPEPKKVTPPDSLAQAVILPFGEFFQRLGVYQAPLTLAF IIFYKLGDAFLSKMSTPFLLKTGFTLTDIGAIQIGMGSIATIVGALAGGSILSKIGIN RSLWVFGILQALSNIVYFFLATLGQNYQFMVIAINVENFCGGLGTAAFIGFLMSLCNQ RFSATQYALLSSLMAVSRDILAAPAGAIAEITGWQIFFLISIAVAVPGLLLLPLFAPW NQKPRAIKRPGLNDDEEDLWGKN" gene 4060..4482 /locus_tag="DP116_22250" CDS 4060..4482 /locus_tag="DP116_22250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315864.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_22250" /translation="MGQKLIIIIATAILLITGVLLGYVLSQLVLGYLTPNLLTLLGTF SLIIIFGTLYYVLFWEFRRQQAQTSPAARRRPSRREQQREPDANLKNRLISLAGDATV AQRLVDQAKQDFPGMPENWYWERAIADLERDNPPAPTA" gene 4729..4953 /locus_tag="DP116_22255" CDS 4729..4953 /locus_tag="DP116_22255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859397.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22255" /translation="MAIFRQYIAPFFVVLIFIIALVAVSSRIFLPSDMAAPAPVEESG EVGSVNTPLFTLSTLSGQSPQRTASPSLSV" gene 4946..5497 /locus_tag="DP116_22260" CDS 4946..5497 /locus_tag="DP116_22260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115906.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_22260" /translation="MSEELLPNYRIRRGSTLDRALLVKFMQRTYQEAFPQQDFSHLAR TVEQYFSRETPLWWVDFVEVEQEEAPEAGGESNSKLGVSPSLPSSSPIAGLWVGNAID QVQGDRHAHIFLLYVVPEHRRQGIGTALMRYVENWARARGDHQIGLQVFQTNTAALNL YNQLGYETQSLWMVKSLHPYKYD" gene 5590..6348 /locus_tag="DP116_22265" CDS 5590..6348 /locus_tag="DP116_22265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876076.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HEAT repeat domain-containing protein" /protein_id="PRJNA477356:DP116_22265" /translation="MYDEDDLSLLDVETELESPLDHMEPVTAETETPKPNPEQMLALL EHQDPQQRMLAARAFCDIQDARAIPHLIRLLTDTCPLVRVSAAYGIGRNPSLDAVEPL IAQLNRDWNGYVRKGVVWALGNCRDRRSLAPLADALKTDISAVRLWAATALAQMAEVG YEAVVGAIPPLISALVQDPVAPVRSNCAWTIGQLCRELPSNVVYATAIDALIQAFAED QDLGVREDAKASLLGVGDPRGLQLIETLEQEGWF" gene complement(6416..7750) /locus_tag="DP116_22270" CDS complement(6416..7750) /locus_tag="DP116_22270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740451.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sorbosone dehydrogenase family protein" /protein_id="PRJNA477356:DP116_22270" /translation="MKHLRCLLLFLLLTTAAACNQTRASQEEATPESSPSAQLAQSST QVKNGIRTEPFSPAPIRINVADLPKPYATDSASKSPEVVPIPENPTLRVPQGFVVNVF ADGLDAPRWLALTPNGDVLVTETGQNRIRLLRDTNGDGVADVKQTFASQANGLNRPFG MAFAGNSFFLGNTDAVLRFPYTKNQQQITGSGTKIASLPAKGYNNHWTRNVVVSPDGK KLYVSIGSGTNADEEPLPRASVQVMNLDGSQQQTFASGLRNPVGLDFQPVTKELYATV NERDGIGDELVPDYLTRIQQGEFYGWPYAYLTAKNLDPRQKTGDASKRPDLVRRTRTP DVLFQSHSAALGLQFYDGQTFPEKYRNGAFVAFRGSWNRDRGTGYKIVFVPFDTKGRP QGYYEDFLTGFLLDPTVPTTWGRPVGLLVLPDGSLLVTEEANNRIYRIQYRG" gene 7965..8588 /locus_tag="DP116_22275" CDS 7965..8588 /locus_tag="DP116_22275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130292.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1003 domain-containing protein" /protein_id="PRJNA477356:DP116_22275" /translation="MRHNADDTANKHFSSGANELKLKASRNQVFTAPLPDPITKNIEA ISSLHIQEVRDIPAHQRILEAIATFFGRSTFLYSLLVILALWIFGSFFDRFLPFNLPS FSWSNQGLDAAALVISTGVLVRQTRQENFAEQRAQLMLQLNLLSEQKIAKIIALLEEL RTDLPDVINRHDSEAELMQEPADPIAVLEALQKNLAQELSSTEENNS" gene 8669..8917 /locus_tag="DP116_22280" /pseudo CDS 8669..8917 /locus_tag="DP116_22280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130291.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="alpha-amylase" gene complement(9223..10476) /locus_tag="DP116_22285" CDS complement(9223..10476) /locus_tag="DP116_22285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine protease" /protein_id="PRJNA477356:DP116_22285" /translation="MSLSSSEEQRFLPVLRQVISHALIAVLSVGLTLTTLWAFPNLQI LKRSSLQGAVAPITPTVETTTPPTANSQEVPIRSFVSAAVNRVGSAVVRIDTDRTIKM RAPDPYFSDPFFRDFFGNDFSAMPQEYHQRGEGSGFIIDPNGMILTNAHVVSGADSVT VTLKDGRKLKGEVKGVDQPSDLAVLKINEKDLPVAALGNSQDLKVGDWAIAVGNPLGL DNTVTLGIISTLNRSSAQVGIPDKRLDFIQTDAAINPGNSGGPLVNEQGEVIGINTAI RADAQGIGFAIPIDKAKLIKDALVRGDKIPHPYIGVRMMTLTPEFAKQSNSDPNTTLT LPQINGVLVVQVMPNSPAATAGVRRGDVITQVDEQAITTAEQLQDLVELSRINQPLQI KVQRGEQTQQLSVRPGELGEAAMKS" gene 10725..11735 /locus_tag="DP116_22290" CDS 10725..11735 /locus_tag="DP116_22290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316725.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)-dependent alcohol dehydrogenase" /protein_id="PRJNA477356:DP116_22290" /translation="MKVWEVHSKEGIEALTLVEKPEPQPKAGQVLLKMRAASLNYRDL LTVKGAYGSKQKLPFVPLSDGVGEVVAVGEGVSRVKVGDRVAGIFMQTWLEGEFSLDK SKSALGGAIDGILAEYVTLDENGVVHVPEHLSDEEAASLPCAAVTAWNALTTDGKLKA GDTVLIQGTGGVSLFALQFGKIMGVKVIATSSSDLKLEKLKQLGASELINYKTTPNWD EKVWQLTNEVGVDRIIEVGGAATFNKSLRAVRYGGYISLIGVLSGFSADVSTVSILHK GITVQGIYVGSRDMFETMNKAIALHGIKPIVDRVFPFEEVRQALEYMESGAHFGKIAL RF" BASE COUNT 3417 a 2695 c 2706 g 3232 t ORIGIN 1 tattattacc tctgcagcta aagtccaaca aacacttaag cttttccgcg aaaaattaac 61 cctgagaaaa ctcaacaaac tagaagttga agtcaccgaa tgcttccgct acctcctcca 121 caaatcagac ttagtccatc gcgtcgcaat tgacactcac accttcagcc tctcgctgta 181 cgatttacaa ggtaaacctg tccccaaaca tcgactttct gcaggcgaaa aacaactctt 241 agcgatcgcc ttcctctggg gattagcccg tgtttctgga cgtcgcttac cagtcgcaat 301 tgacacaccc ttaggacgct tagactcctc ccaccgcagt aacttagttg aacgttactt 361 tccatcagcc agccatcaag ttattttgtt atccacagat actgagatag gagacaaaga 421 agtcaaaata ttgcgagaaa atgaggcgat cgcccgcgaa tacctcctca aatacgactc 481 atccacccgc caaacaacag ttcaaccggg atatttctgg taaactcccc ttccgggaag 541 ggcgcgtgaa gcgaaggatc ggaataagaa tctttttttg aattttgcgg aaagttgcta 601 ggaggagcca gtacttgatg agggtctccc tcacttggta tctggcgttg gtttcccgac 661 agagcaaact ttccaagacg aattttgaat ttaagtccga aggacgcgca tggaaccccc 721 aatagatcgt atcaaactct cccaaacagc caaagaccaa ctcctcaaac tcagacgcaa 781 caccaaaatc gaccaatgga atatcctgtg tcgttgggcg ttttgtcgtt ccctagccga 841 accaactcca ccctcacccg tcccaattcc tcaagatagc aacgtcgaaa tgacatggcg 901 cgtctttggc ggcgaaatgt ccgatatatt cctactcgcc ctcaagcaac gctgctacaa 961 cgacggcttc ccaacagaca aacaaaccct agcgacacaa tttcgcctcc acctacatcg 1021 cggtattggt tacctcgcag gtgatccaaa tatcaagaag attgaagatt tagttgaatt 1081 agcgacaaaa aagatataag gattaataaa ccacagatgc acgcagatga attatccgcg 1141 tgcatttgcg tttatctgcg gttcccttaa tatctttatg ttttcagaga gttacgtaga 1201 aatagaacgc catagagctg cgattgttcg cactgactta tcgagacctg tgcgattggc 1261 aatagaatgg tcaattataa acaaggatac cacgtttttt gattacggtt gcggacacgg 1321 tggcgatgtg gagcgagtag ccaacctagg ctacaccagt gcaggctggg atccatacta 1381 ctaccccgat accccgcgca tttgcgctaa tgtcgttaac ttgggctacg tcctgaacgt 1441 aattgaagac ccagaggaac gtcgccaagc tctttgccaa gcttgggaac tcgcccacaa 1501 agtcttaatt gttgcggctc aagttttgat aaatgctccc agcaagactc aagttgctta 1561 cagtgatggt atcgtcactc gccgcaatac ctttcagaaa tattacgatc aagaagaact 1621 caaaaaatac attgatgaaa ccctaaatgt tgatgcggta ccagtcgcgc tgggtgtgta 1681 ttttgttttt cgagatgaag ccgaaaaaga agaatttaaa gccatacgct ttttctccag 1741 aacctcaaca ccaaaggtac gcattcatac caaacggttt gaggactaca aagagatgtt 1801 ggaaccactg atggcttttt ttactcatcg tggaagactt ccagtcaaag gcgaattgga 1861 taacgaacag gaattactca gtgaatttaa gaactttcgc cgtgcttttg ctgtggtttt 1921 acaagcgact gatgaagcag aatgggatgc gatcgcctac cgtcgttctc ttgacatcca 1981 agtttatctt gctctcaccc acttcgataa acgcccgaca tcgcataaat tatcaccaga 2041 aatgcgccat gacatcaaag ccttttttgg cagttatgaa gaagcttgcg aagttgctga 2101 tgctaagctt ttcagtttgg ggaaaaccgg agtcgtacaa actgcttgcg aaaaaagtaa 2161 aataggcaaa cacactcgta gcgcccttta cgtccatgtt tctgcgcttg gtgaacttga 2221 ccccttgctg cgaatttacg aaggctgcgc tagtcgcact attggccgtg tcgatgacgc 2281 aacactgata aagttttgtc tagatgaacc gcgaatatcc tacctgttct acccggaatt 2341 cgatactgat gctcatccag cattaaaggc gagtatcacg attgatttaa aaactttgcg 2401 cataactcac cgagactacg agcaaagagc aaatccacca attcttcacc gtaaagaaac 2461 atttgtgacc tctaactatt catattacca agaatttgct caactcaccc aacaagaaca 2521 aaaattagga ctactcaagc ataaaagcga gattggtact cgtgaaggtt ggacaaaatg 2581 tcttgcagaa catggggtag aaattagaga tcatcaaatt catctggtta acgaaaatta 2641 ggattcagcc aaaaaatgaa aaaccctaac tgcgtaagat agtggaactt tgcgttatat 2701 ataggcatct ttgcctgata ttacatcgta aatctgacaa gatatctaaa attcttccac 2761 tttaggagaa ataaataaaa cagtgcaaac aatttcatca ttactacaag tttttaaaag 2821 cccgaagatg gcggctttgg tgttattagg tttttcatca ggtttaccct atttgttaac 2881 tggtaacact ttgacggctt ggatgaccgt agaaaatgta gacttgggga ctattggctg 2941 gtttagtctt gtcagtttgc cttattcctt gaaatttctt tggtcacctt tgctcgatag 3001 atataaactc cccattttgg gacggcgacg gggctggtta atagcaacgc agattgcttt 3061 aattgtagcg atcgccctca tggctgggca acaacccggc acagtactac aactgcttgc 3121 gataaacgca atagcgatcg cttttttgag cgctacacaa gacattgctg ctgacgctta 3181 ccgtaccgac gttttagaac cactggaagt gggtgctggt gctggtcttt tcgtctcagg 3241 ttatcgactt gccattatag tagcaggtgc gttggcatta attctggctg cccatttgcc 3301 ttggaaatcg gtttatctgt taatggcgct tttcatggtt attgggattt ttggcacttt 3361 gctcgcgcca gaaccaaaga aagttacccc tccagattcc ttagcacaag cggtcatact 3421 accatttgga gaattttttc agcgcctggg tgtataccaa gccccgttga cattagcatt 3481 catcatattt tataaactag gcgatgcctt tctgagcaag atgtccacgc catttttact 3541 caaaactggc ttcaccctaa ctgatattgg cgctattcag atcgggatgg gttcaattgc 3601 tactattgtt ggtgccctag caggtggttc gattttgagc aaaatcggta taaatcgctc 3661 gctttgggtc tttgggattt tacaagcatt gagtaatata gtatactttt ttctcgcgac 3721 actcggacaa aactaccaat ttatggtcat cgccatcaat gtagaaaact tctgtggtgg 3781 tttaggaaca gctgccttta ttggtttttt gatgagtttg tgcaaccagc gttttagcgc 3841 gactcaatat gctttactat ctagtttgat ggctgtcagt cgtgatattc tcgctgcccc 3901 agctggcgca atagcggaaa ttacgggttg gcagatattt ttcttaatta gcattgccgt 3961 ggctgtacca ggactattac tattaccatt gtttgcccct tggaaccaaa agccacgggc 4021 aattaagcga ccaggactca acgatgatga agaggattta tggggcaaaa actaatcatt 4081 attatcgcaa cggctattct gttaattact ggcgtcttac ttggctacgt tctttcccag 4141 ttagttctgg gatacttgac gcctaactta ctaactcttt tagggacttt cagtctcatt 4201 ataatttttg gtacacttta ttacgtttta ttttgggagt ttagaagaca gcaggcgcag 4261 acttccccag cagcaagaag gcgaccctcc cgaagagaac aacagcgtga acctgacgca 4321 aatctcaaaa acaggctcat ctctttagcc ggtgacgcga ctgttgccca acggttggta 4381 gatcaagcca agcaagactt tcctggtatg ccagaaaatt ggtattggga aagagcaatt 4441 gctgacttgg aacgcgataa ccccccagcg ccaaccgcct gaactcctat taagtgagat 4501 tctttcaaga aacacgctta acagggaacg cttaacaggg aacagtgaac agtgataact 4561 gataactgat aactgacgac tggtaactgg taactagtaa ctgataactg ataactgata 4621 actgttttaa gaagggaggt ggtgtttgtt tcattcgtta ccagcagcgt gtaaccgcgc 4681 acgagtttaa ggctcacaaa gatcaaccgt tcgttgcaaa atgaaactat ggctatattt 4741 cgtcagtaca tcgcgccgtt tttcgtagtg ttaattttca ttattgctct tgtcgcagtc 4801 agctcccgca tctttttacc ctctgatatg gctgcacccg caccagtaga agagagtggg 4861 gaagtgggat ctgtaaatac cccgctattc acgttgtcta ccttgtctgg tcagtcccct 4921 caacggactg cctcaccttc tttaagtgtc tgaagaattg ctaccgaatt acagaattcg 4981 ccgtggctct accttggata gagcgctgct ggtaaaattc atgcagcgaa cttaccaaga 5041 agcctttcca cagcaagatt tctcccatct cgcgcgaaca gttgaacaat acttctccag 5101 agaaactccc ctttggtggg ttgattttgt agaagtagaa caagaggaag caccagaagc 5161 gggaggagaa agcaattcca aattaggagt ctctccctct ctcccttctt catcccccat 5221 tgcgggttta tgggtaggaa atgccataga tcaagtgcag ggcgatcgcc acgctcacat 5281 ttttttactc tacgttgtgc cagaacaccg acgacaaggt atcgggacag ctttgatgcg 5341 atatgtagaa aattgggcaa gggctagagg tgatcaccaa atcggattac aagtctttca 5401 aaccaacaca gccgcattga atctttacaa tcagctagga tatgaaaccc aatcactgtg 5461 gatggtaaaa tctcttcatc cttacaaata tgactgaaat tgctgcaaga gacaaattgt 5521 aaattctaat actggtatag caaaaccgca gtggaaattt agagaaattc tctcgcattg 5581 tgtctaaata tgtatgacga agacgacctt agcctactag atgtcgaaac agaactagaa 5641 agccctctgg atcacatgga accggtcact gccgaaacag agacgccaaa gcccaatcca 5701 gagcagatgc tagcattatt agaacatcaa gatccgcagc aaaggatgct ggccgcgcgt 5761 gctttttgcg acatacaaga tgctagggcc atcccccatc tgattcgctt attaactgat 5821 acctgtcctt tggtacgggt gagtgcagcg tatggtatcg gacgcaaccc aagcttagat 5881 gctgtggaac cgttgattgc ccagttaaac cgagattgga atggctacgt gcgtaaaggg 5941 gttgtttggg cgttggggaa ctgtcgcgat cgccgcagcc tagcacctct agcagacgct 6001 ttaaaaaccg atatttccgc agtacgcctg tgggcggcta ccgccttggc acaaatggca 6061 gaagtgggat atgaagctgt tgtgggagca atacccccct tgatttcagc cttggtacaa 6121 gatcccgtag caccagtacg cagtaattgt gcttggacaa ttggacaact ttgccgcgaa 6181 ttgccttcta atgtcgttta tgctactgcc attgatgcac tgattcaagc ctttgctgaa 6241 gaccaagatt tgggagtgcg cgaagatgcg aaagcctccc ttttaggagt tggtgacccc 6301 cgtggacttc agttgattga aactctggaa caggaaggat ggttttgatt ttagttatta 6361 gtcattagtc attagtcatt agtacaagac gattttttga ctgaatgacc aatgactaac 6421 ctctatactg aattcgataa atacggttat ttgcttcttc cgtgactaaa agactaccat 6481 cgggcaaaac gagtaagccc acaggacgtc cccaagtcgt tggtacagtt gggtctagga 6541 gaaatccagt cagaaagtct tcatagtatc cttgcggacg ccctttagtg tcaaatggga 6601 caaagacaat tttatatcca gtgccgcgat cgcgattcca agaaccacga aaggcaacaa 6661 aagcaccatt acgatatttt tccgggaatg tctgaccgtc ataaaactgc aaccccaatg 6721 cagctgagtg tgattgaaat agcacatctg gagtacgagt gcgacgtact aaatctggtc 6781 gtttgctagc atcacccgtc ttttgacgtg gatcgaggtt tttagctgtt aggtaagcat 6841 aaggccaacc atagaattct ccctgctgaa tgcgtgtcaa gtaatctggt accaactcat 6901 cgccaatacc atcacgttca ttaacagtgg cgtaaagttc cttagtcaca ggctgaaaat 6961 ccagaccaac tgggttacgt aagccagagg caaaagtctg ctgctgggaa ccatctaagt 7021 tcatcacctg tactgaagcc cgtggcaaag gttcttcgtc tgcgttagtt ccggaaccaa 7081 tcgaaacata tagcttttta ccatcgggtg agacaacaac attacgtgtc cagtggttat 7141 tgtagccctt agcaggcagg ctggcgattt ttgtcccaga accagtgatt tgctgctgat 7201 ttttagtgta gggaaaccgt aagactgcat cagtgttgcc caggaagaag gaattaccag 7261 caaaagccat accaaagggt ctatttagtc cgttggcttg acttgcaaag gtttgtttga 7321 catcagccac cccatcaccg ttggtatcac gcagtaaacg aatacgattt tgcccagttt 7381 cagttactaa aacatcaccg ttgggagtta aagctaacca gcgaggagca tccaaaccat 7441 cagcaaaaac gttgactaca aagccctgag ggacacgcag agtcgggttc tctggaattg 7501 gtactacctc aggtgacttt gaggcgctat cagtggcgta gggttttggc aaatcagcaa 7561 cattaatgcg gataggtgct ggtgaaaatg gttctgtacg aataccattt ttcacttgtg 7621 tagaactctg tgctagttga gcagatggtg aagattcggg tgtcgcttct tcttgggaag 7681 cacgagtctg gttacaggct gctgctgtag tcagtagcag aaaaagcagc aagcaacgca 7741 gatgtttcat ggtaggaatt tcaacaatga tgcagggctt cttaaactta gatcagatgt 7801 tttaaaacta tcaccattcc aaagtgcgat tttaccagtc atccgttagg tgaaaactgc 7861 tgtaatcacg gtataatagg atgttcagcc tcttaatatc catttgccga cggcatcaga 7921 agataagcaa taatcgagta acatctagat tcaggagttg ctgcatgagg cacaacgcag 7981 acgacacagc caataaacac ttcagttcag gtgctaacga actcaagctc aaagcttctc 8041 gaaatcaagt ttttacggct ccattaccag acccgattac aaaaaatatt gaagcgatta 8101 gttcacttca tatccaggaa gttcgagaca ttcccgccca ccagcggata ctagaagcga 8161 tcgctacatt ctttgggcga tcaacatttc tgtatagttt gctagtcata ctggctttgt 8221 ggatttttgg cagttttttt gatcgctttt taccattcaa cctgccctcg tttagctggt 8281 caaatcaagg cttagacgca gctgcattgg tgatttcaac cggagtgctg gtgcgacaaa 8341 ctcgccaaga aaactttgcc gaacagcggg cgcaactgat gctacagctt aacctgctct 8401 ccgagcaaaa aattgccaag attattgctc tattagaaga actccgtacc gatttgcctg 8461 atgtgataaa tcggcatgat tcggaagctg aattgatgca ggaaccggct gacccgatcg 8521 cggtattgga agcactccag aaaaacttgg ctcaagaact gtcatccaca gaagaaaaca 8581 atagttaatt gcagtttttt agtagatttt tgtttctatt tagagatatc aaagaaaagt 8641 atagtctttg tgtcgaaaag aaaaatttta cgattatttc gatcacccta atacaatcgg 8701 ctggacacgc ttaggagatg aagaacatcc tggcggtatg gcagttgttt tgagtaatgg 8761 ggaagagggg actaagtgga tggaagttgg gcaacccaac agcacttaca ttgacatcac 8821 tgaacatatc agcaaaccca tcacaacaaa tgatcaaggc tgggctgatt ttcgatgcag 8881 tgctggttcc gtttctgtgt gggttccaca gtcgtagtcg gggtgacgct cgtacatacg 8941 ttatacgttc ggaaagcata attgcgtcgc cccttgggcg atgactcggc aaaggggcgc 9001 tatggctacc cctgaagaag tagcggcagt cgttcatgga ggaaaccccc tctccagagc 9061 gcctcggctg cttgttcgtc attccgcccg gtcgggagaa tcgcgcgggc agagatatta 9121 ccaattatca gttatcagtt atcagttatc agttatcagt tatcagttaa aaaaagctga 9181 ttattttgat aactgttcac tgtttactgt tcactgattt catcacgact tcatagccgc 9241 ttccccaagt tcacccggtc gcacgcttag ctgttgagtt tgttcacccc gttgtacctt 9301 aatttgtaaa ggttggttga ttcgactgag ttctactaag tcttgcaact gttcagcagt 9361 cgtgatcgct tgttcatcaa cttgggtaat gacatctcct cgacgcacac cagcagtagc 9421 agcaggacta ttgggcatca cttgtaccac caatacacca tttatttgtg gtagagtcag 9481 cgtcgtattg gggtcactat tggattgttt tgcaaattct ggtgtgagag tcatcatccg 9541 aacgccaatg taaggatgtg ggattttgtc accacgaacc aaagcatctt taataagctt 9601 agctttgtca ataggaatcg caaacccaat tccctgagca tctgcacgaa ttgctgtatt 9661 gatgccaata acttcacctt gttcattgac caacggacca ccagagttac caggattgat 9721 agcagcatct gtttggataa agtccaagcg tttatccggg atgccaactt gagcgctgga 9781 tcgattcaac gtgctgataa tacccaaagt cactgtgttg tctaatccta acgggttgcc 9841 aactgcgatc gcccagtcac ccactttcaa gtcttgggag ttaccaaggg cagctactgg 9901 caaatctttc tcattaattt tgagaactgc caaatcagaa ggctgatcca cccctttcac 9961 ctctcctttt aacttgcgtc catccttgag ggtgacggtc acagaatcag caccgctgac 10021 aacatgagca ttggtgagaa tcatgccatt ggggtcaata ataaaaccag acccttctcc 10081 gcgttggtgg tactcttgag gcatagcaga gaaatcatta ccaaaaaaat cacggaaaaa 10141 tggatcacta aaatacgggt caggtgcacg cattttgatc gtacgatctg tatcaattcg 10201 cacaactgca gagccaactc gattgacagc agcactaacg aaactgcgga taggaacttc 10261 ttgggagttt gctgttggag gagtcgtcgt ttctactgtg ggagttatgg gtgcaacagc 10321 accctgtaga gatgatcgct taagaatttg taaattggga aaagcccata acgtcgtcaa 10381 agttaacccc acactcaaga cagcgattaa agcatggcta atcacttgac gtaaaacggg 10441 caagaaacgt tgttcttcag aagatgacaa agacatagta atttctgcta ttacaaagag 10501 cttataggtt gcataaaccc gaagaatcaa gtttttattg acaattcatc aaagttgtgc 10561 actttaattt caaaaaaatc agtgtcaata atatcggcac ctttacgacc attatgactg 10621 taggctgtga cgttcggctt gcagccacta gcaagcagaa aaaccttgat tgattgatga 10681 atttgtgata tgcaaacagt agaaaaaagc aatcggaaac cgctatgaaa gtctgggaag 10741 ttcactctaa agaaggaata gaagcattaa cactggttga aaaaccagaa ccacaaccaa 10801 aagcaggaca ggttctcctt aagatgcgtg ctgcttccct gaattatcgc gacttgttga 10861 cagtgaaagg ggcatatgga tctaagcaaa agctaccgtt tgttccgttg tctgatggtg 10921 tgggggaagt ggttgctgtt ggcgagggag tgagtagagt taaagtaggc gatcgcgtcg 10981 ctggcatttt catgcaaacc tggttagaag gagagttttc actagacaaa tcaaaatcag 11041 cgttaggggg agccattgat ggtatcttgg ctgagtacgt gacgcttgat gaaaatggcg 11101 ttgttcatgt cccagaacat ctttctgacg aagaagcagc atccttacct tgtgctgcag 11161 tgaccgcttg gaacgcccta acaacagatg gcaagctgaa agctggcgat actgttctca 11221 tacaaggtac cggaggagtg tctttatttg ccttgcagtt tggcaagata atgggagtaa 11281 aagtcattgc cacttctagc agcgacttga agttagagaa attaaaacaa cttggggcat 11341 ctgaacttat caattacaaa accacaccaa attgggatga gaaagtttgg caactgacaa 11401 atgaagtggg agttgatcgg atcattgagg ttggcggtgc tgcaaccttc aataaatcct 11461 tacgtgctgt acgctacggt ggatatatca gtttgattgg agtgctttct ggattcagcg 11521 cagatgtcag tacagtatcg attctgcaca aaggaattac cgtacaaggt atctatgtag 11581 gcagtcgcga tatgtttgag acaatgaaca aggcaattgc tttacatggt atcaaaccca 11641 ttgttgaccg agtgtttccc tttgaggaag tgcggcaggc gttagagtat atggaaagtg 11701 gggcacattt tggtaagatt gcgctgcgct tttagcacca ataaaccgaa aggtagaaag 11761 gcatctgtca atgaagccgg aagcctacac tgacctgacg gtacagtgta ggtagttcac 11821 cgctgtaagc ctacgcgaat cgagcacatc gcacccaact gcaacaaaaa acctcggcga 11881 aaccaaccga ggtgatgagg agaagtccaa atgtcgataa gtaccaacgt cgaaatctaa 11941 aaatattagc tgacgcccaa gacaagcgaa actagagcgg taacaaatat acgtagctcg 12001 tttttttggt ttgttttttt attttgaact tatgtaaata aatataacaa // LOCUS NODE_2819_length_12005_cov_4.90418412005 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 12005) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 12005) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..12005 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..979) /locus_tag="DP116_22295" CDS complement(<1..979) /locus_tag="DP116_22295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997941.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor, RpoD/SigA family" /protein_id="PRJNA477356:DP116_22295" /translation="MPTVETQTENFNAKFTADMVRTYLREIGRVPLLTREQEIVFGKQ VQQMMTLVDAKEALAKKLSREPSLQEWAAHVRKSETDVKQIVHQGKRAKQKMIEANLR LVVAIAKKYQKRNMEFLDLIQEGTLGLERGVEKFDPTRGYKFSTYAYWWIRQAITRAI AQQGRTIRLPIHITEKLNKIKKVQRELAQKLGRSPTPAEIGKELELEPAQIREYLNMA RQPVSLDVKVGDNQDTELQEMLEDEGPSPEYYMTQEFLRQDLNNMLSELTPQQREVLA LRFGLLDGNEMSLAKVGERLNLSRERVRQLEHQALAHLRRRRANVKEYIA" gene complement(2136..2765) /locus_tag="DP116_22300" CDS complement(2136..2765) /locus_tag="DP116_22300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316330.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_22300" /translation="MGRAADRDKGRVSLEKAKIILDGAMQEFLIHGYAGASMDRLAVA AGVSKPTLYNYFQNKEGLFTALIERLAQEKLQAILESQDSQGLQGETEVVLRRIAIKL LNTITGDQQLLAFIRLVIGESGRFPKLARSFVSSLDKPIILALTQHLATRHELLDPEA GARVIFGTLVYFVIIQEILCTEDVLPMEYERLIDTLVSLFSSQSQKSEE" gene 2935..4152 /locus_tag="DP116_22305" CDS 2935..4152 /locus_tag="DP116_22305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152858.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_22305" /translation="MASDLRDVQITDCCIVGGGPAGAVLALLLARQGISVTLLETHKD FDRDFRGDALQPSVMEIMEELGLSERLLQLPHSKARQPQMHTEQGDFTFIDYSHLKTR HPYLTVLPQVHFLELIIEQAQQYPSFRLIMGANVQELLEEEGVIRGVRYRGQGGWHEI RATLTVGADGRHSRVRQLAGFEFPSRNTPPLDVLWFRLPRYRDEPDELNALVSHGQFI VLMNRNDQWQVGYVIPKGEYQKLRSQSLEVFRQSIVKAVPQFSDRIEQLQRWSQIAYL SVESGCLKRWYRAGLLLIGDAAHIMAPFGGVGVNYAIADAVVAANLLSKPLKAKQVNL RDLAKVQRRRELPTHIIQVFQSMIQQQMLIPGLDSTVPFQPPALMRLPFFPRFLARFL SFELVPVHVAPAA" gene complement(4317..6047) /locus_tag="DP116_22310" CDS complement(4317..6047) /locus_tag="DP116_22310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317538.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_22310" /translation="MNLGQPLGSVIQGSLTGGLEVRLHPDVSVEDMRVGKFLVVQGVR SRFFCMLTDVALGIANQRIIANPPSWEDTFLRDILAGGGTYGTINLAPMLMFTPESEE SFSPTNGKSANPFVPSATGLASFQPQTSTTMELLPVKTIPSHFSQVYEASEEDFRRVF GWEDDPQRKNFSIGKPLDMEVPVCLDLNRFVERSNGVFGKSGTGKSFLTRLLLAGTIR KNAAVNLIFDMHSEYGWEAVSEGKQMSTVKGLKQLFPSSVEVYTLDPESTKRRGVPHA QELYLSYDQIEVEDIKLCSRDLGLSEASLDNANILFAEFGKSWILQLINMTNEEIKSF CEEKRGHQGSITALQRKLLRLENLKYMRAACPQNYVNQMLRSLEAGKNIVVEFGSQSN MLSYMLVTNMITRRIHQHYVTKAEKFLQTKNPNDRPTQLMITIEEAHRFLDPATVQST IFGTIAREMRKYFVTLLVVDQRPSGIDNEVMSQIGTRITALLNDEKDIEAIFTGVSGG GALRSVLAKLDSKQQALILGHAVPMPVVVRTRPYDTIFYEEIGETAWEEKPDEEVFAA AELAKADLGF" gene complement(6281..7078) /locus_tag="DP116_22315" CDS complement(6281..7078) /locus_tag="DP116_22315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654996.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase" /protein_id="PRJNA477356:DP116_22315" /translation="MLELYQFELSQYSEKVRLILDYKGLDYRKIEVTPGVGQVELFRL TGQRQVPVLKDGSRYIVDSTEIAKYLDSQYPVRPLIPTDPKKRGLCLMMEEWADESIG IKGRKALFSAISQSQNFRKSLLPTTTPDVLKSLVEGVPNDILRVLGFGVGYSPDVIRS AIADLKQDLEALTLLLADSPYLVGDEPTLADFAVAGLSILLKFPSGPYLNLPETIRGK GVPELVDNPVYAPFFAWRDRLYAQYRKPLLTYTTTGSPSAPTSINID" gene 7558..8433 /locus_tag="DP116_22320" CDS 7558..8433 /locus_tag="DP116_22320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317540.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fasciclin domain-containing protein" /protein_id="PRJNA477356:DP116_22320" /translation="MKVENSNLLTKLAGIIGVTGISLLTGLPTGANEVLNSNSSIFKE ATYNDAQRFLVNAQYTQPSESVTAATKSKKTPPAKNTTVAQRRGGLNPAPSILQECPY NRAACPGGSDTSTPPATPPGGEIPTTPPATPTTPPAPGTQTPPKEPAAGTESKNIVAV AESNGSFTMLTKALKAAGLAETLQGKGPFTVFAPTDAAFAKLPQDAVQDLLKPENKEV LVKILRYHVVQGSVTSKDLKSGEVKSIEGGPINVKVDPKTGVTVNDAKVVQPDIKASN GVIHVIDNVILPPDL" gene 8603..9241 /locus_tag="DP116_22325" CDS 8603..9241 /locus_tag="DP116_22325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016858778.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_22325" /translation="MLLELRQLRIQPGQQLLLEDVSWQQFENILAELGEHRAARLSYS HGFLEIMVPLPEHEKAKEMIGDLVKILLNERSINYDSLGSTTLRSEKMTQGVEPDACF YIQNQAAIIGKNRLDLSIDPPPDLAIEIDLTSRTQLENYQILGVPELWRYGKQGLQIN ILQSGKYVESNFSPTFPDIPIIELVNQYVQQSQVVGSSQAIQAFRSWVRDNI" gene 9450..10256 /locus_tag="DP116_22330" CDS 9450..10256 /locus_tag="DP116_22330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875888.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flagellar assembly protein H" /protein_id="PRJNA477356:DP116_22330" /translation="MKTDTIFYRLFQSFPSIFFELINQPPETADTYQFSSVEVKQLAF RIDGVFLPKSNPSSPIYFVEVQFQPDKKFYSRLFTEIFLYLDRSELTNNWRGVVVYPS RSLDVGETERYIELLTSGRVSRIYLDELDSAAEQSIGIGTVKLVIEPESGAATKAREL INLAKQQIADEITQREFLELIETIIVYKFPLKSREEIEQMLGLSELRQTKVYQEAKQE GKQEGKLEAIPFMLSLGATVEQIADALGLDIELVRLVAAKAKPNQEQSGE" gene complement(10268..11776) /locus_tag="DP116_22335" CDS complement(10268..11776) /locus_tag="DP116_22335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873290.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phenylacetate--CoA ligase family protein" /protein_id="PRJNA477356:DP116_22335" /translation="MNSEAQRQRAIKAFTDFLQTPLDTLLQQHINTNTSAAALALFHD MAASVPAYKHFLANHAVNPDEYQTLEDFQRLPQLVKENYLQRYPLAQLCRNGQLETCD MIAVSSGSTGKPTFWPRFFADEMQIATRFEQVFHDSFHADTRRTLAVICFSLGSWVGG MYTTNCCRYLASKGYPITLITPGNNKEEIFRVIQELGSAFEQVVLLGYPPFLKDIIDT GIARGIEWQRYQIKLVMAGEVFSEEWRSLVGERVGSQHCCYDSVSLYGTADAGVLGNE TPLSICIRRFLAQNPEAARALFGESRLPTLVQYDPISRFFEVIDGTLIFSGNNGIPLV RYNILDTGGIISYDAMLQFLKKWGFNPVAEIEQMGGRGVRALPFVYVFGRSNFTVSYF GANIYPENVTVGLEQPVIREWVTGKFVLQVQEDADKNRFLSVVVELAPRVEESEEKRE AIASSILTQLQRLNSEFANYVPPEYQMPQVALAPTGDPEYFPVGVKHRYTRK" BASE COUNT 3413 a 2628 c 2486 g 3478 t ORIGIN 1 tagcaatgta ctctttaacg ttggcacgac ggcgacgcag atgagcaagc gcttgatgtt 61 ctaactgacg aacgcgttcg cggctaaggt tcaaccgttc gccaacttta gccaaagaca 121 tttcgttacc atctaataaa ccaaagcgca gggctaaaac ttctcgctgt tggggggtga 181 gttctgacag catattattc aagtcttggc gtaagaattc ctgcgtcatg taatactctg 241 gtgatggccc ttcatcttcg agcatttctt ggagttcggt atcttggtta tctccaacct 301 tgacatccaa agaaactggc tgacgcgcca tgttcagata ctcacggatc tgagctggtt 361 ctaactctaa ctctttgcca atttcagcgg gagttggtga tcttcccaac ttctgagcaa 421 gttcccgctg gactttctta attttgttta gcttctcggt gatatggata ggtaggcgaa 481 ttgtacgacc ttgttgggcg atcgcacgag taatagcttg gcgaatccac cagtaagcgt 541 atgttgagaa tttatagccc cgagttgggt caaatttctc gacaccccgc tctaatccta 601 gtgttccttc ttggataaga tccagaaact ccatgtttcg cttctggtat tttttggcaa 661 tagcaacaac caagcgcaaa ttcgcttcta tcatcttctg ctttgctcgc ttgccttgat 721 gtacaatctg tttgacatca gtctcagatt tacgaacatg agcagcccac tcctgtaagc 781 taggttcgcg gcttaatttc ttcgccaaag cctccttagc atccacgagt gtcatcattt 841 gttggacctg cttcccaaag acaatctctt gctcacgggt tagcagtggt acacgaccaa 901 tctcgcgcag ataggttcgc accatatcag ccgtgaattt ggcgttaaag ttttcagttt 961 gggtttcaac agtgggcatt ggtgcgtttt cctctactcc gtaaacaaaa attaatactc 1021 aataactaag gcggattggg aaagttagta cacgtattac gcaaccaggg aaactatcag 1081 gtgaaaattg atttctacag attctgtcaa gacgaggact ctgttggatt agtttcccca 1141 ccttccctaa attttgaaat cgtttctaag catgctggct tgttttcaag aattctttta 1201 atccttaagt cgctggtgac gaccttaatt atacgaagtc agtatgagtt ctacttaatc 1261 tatgataaga caaatcatcc caagagtaaa gtagtttcgt taaatctttc tattataatg 1321 caaaattata tggttacaca aaaagcctga ttttcttttt ttaaacttac tacatatata 1381 gtgactctaa gatcccatga tcagtcctcc acctatagcc ggataaccga acagaatcga 1441 ttctcctgat ggagggatgt cacgaaatca agaaatcagg aatatgagat tgagaattag 1501 tagttgatat attcttgttt gtgttaaccc aaatttacaa ttcgcacttt ctccactaaa 1561 aactcagtag ccgaaaaaga caagcttggc ttgagagcaa gctcaaaagc tacgtaggtg 1621 tctgagtcat gaattttaag ccaattccaa ctcccctttg aactgaaata cgacttccgg 1681 tgtcaggcaa agtagtcttc aagcagggca tacaagagcc acacgcgacg ataatcagtc 1741 ttaatatctc ctagggaaat gccctttaac attttcctaa aacaagtgca ccgcagttga 1801 actttatttg ctgatagcat cgcaggatca gtggcaaata acttacccac tcgcttgagc 1861 agcctctgcc caaaatcttc ctactctcgc aatactacgc ccccgtgaat ccgcaggaag 1921 gaactatcta ctataataac atctttctcg ttatagatgg caaaagctga gaatggaatc 1981 atcgcagtca cagcatatgg tttgtttgag gtgggataac atttcctagc tccgtttgaa 2041 gtctataaac tgaaacaggc ttgctccctc gcgattctct tgggttattg atggaggtag 2101 tatttgagat tttcggtgag aagtttgctc ataggttatt cctcgctttt ttgactctgt 2161 gaagaaaaca gagaaaccaa agtatcaatc aagcgctcgt actccatcgg taaaacgtct 2221 tctgtacaga gtatttcctg aataattaca aagtatacga gtgtcccgaa aataactcgt 2281 gctcctgcct ccggatcgag taactcgtgc cgcgtcgcga gatgctgagt gagagcaaga 2341 attatcggtt tatccaaact actaacaaaa gaccttgcta gctttgggaa gcgacctgat 2401 tctccaatca ctaatcgaat gaaagcaagg agttgttgat cgcctgtaat cgtgttcaat 2461 agtttgattg caatgcgccg cagaacaact tccgtttctc cctgcagccc ttgggaatct 2521 tgagattcca aaatcgcttg aagtttttcc tgtgccaatc gctcaatcag agccgtgaat 2581 aatccttctt tattttgaaa gtagttgtaa agggttggtt tagaaacacc tgctgcgact 2641 gcaagacgat ccatactcgc accagcatag ccgtgtatca agaattcttg catcgcacca 2701 tccaggatga ttttggcttt ctccaaagac acgcgcccct tgtcacgatc agcagctcta 2761 cccattttca aagtgcctcc tccaaaattt gacctgcaag tggtgagtgc aagggaaaaa 2821 gttcacgaaa actccctcag cacttaatca tattttacta aacggtctag ttaaattact 2881 aaactgttta gtaaaatata atcaggcaat gtaaagtaat taggaggcaa tcctatggct 2941 agtgacttgc gagacgttca aatcacagat tgctgcatcg ttgggggtgg tcctgctggg 3001 gcagttctcg ctctgctgtt agcacggcag ggaatttcgg tcacgctttt agagactcac 3061 aaagactttg atcgcgactt tcgtggtgac gctcttcaac catccgttat ggaaattatg 3121 gaagaacttg ggttgagcga gcgtttgcta caactccccc acagcaaggc tcgccagcct 3181 caaatgcaca ctgaacaagg ggacttcaca tttattgact atagccatct caaaacccgc 3241 cacccctatc tgacagtgct tccacaagta cattttctgg agttgattat tgagcaagcg 3301 caacagtatc caagctttcg cttgattatg ggtgctaatg tccaggaatt gcttgaagag 3361 gaaggagtca ttcgaggagt tcgctatcga gggcaagggg ggtggcacga aattcgcgca 3421 accttaactg tgggcgcaga tggacggcac tctcgtgtgc ggcagctggc tggctttgaa 3481 ttccccagta gaaatacacc tccattagat gttttgtggt tccgcttgcc tcgctatcgg 3541 gacgaacctg atgagctaaa tgccctagtt agtcatggac aattcattgt gctaatgaat 3601 cgcaacgatc aatggcaggt tggctatgtg attcccaaag gtgagtatca aaagctacgc 3661 agccagagtt tagaagtatt tcgccagtca atcgtcaaag cagttccaca gtttagcgat 3721 cgcattgaac agttacaaag gtggtcacag attgcttatc tttctgtaga aagtggatgt 3781 ctcaaacgct ggtaccgcgc tggcttgcta ttgattggag atgccgctca catcatggct 3841 ccgtttggag gtgttggtgt taactacgca atcgctgatg cggttgttgc agcaaatcta 3901 ctgagcaaac cacttaaagc aaagcaagta aacttacgtg accttgctaa agtgcaacgt 3961 cggcgagaac ttccaaccca tattattcag gttttccagt cgatgattca acagcaaatg 4021 ttgatacctg gactggactc aactgtacca tttcaaccac ccgctttaat gcgcttacca 4081 tttttcccaa ggtttttggc gaggttcctc agttttgaac ttgtacctgt tcatgtagcg 4141 ccagcagcct aagagaatgt ctctagaaca ccgattcagt aataaacgcg agactgataa 4201 acccggtttt tacgtctttt caaaccaaaa tttccttatt gcgacccaaa gaaacccagc 4261 tttttcgtca taactctgaa tactgaatcg gtcttctagc tttactgtgc ggcgctttaa 4321 aacccaagat ccgctttcgc taattccgcc gcagcaaaca cttcttcatc tggcttttct 4381 tcccaagcag tctcgccaat ttcttcataa aatatggtat cgtaaggacg agtacgcaca 4441 accacaggca taggaacggc gtgacctaaa attaaggctt gttgcttaga atccaacttt 4501 gccaacacag atcgcaacgc acctccacca gacaccccag taaaaattgc ttcaatgtct 4561 ttttcatcat tgagcaaagc tgtgatgcga gtaccaatct gggacatcac ttcattatct 4621 atgcccgatg gacgttgatc aacgaccaga agtgttacga agtatttccg catttcgcgg 4681 gcgatcgttc caaaaatggt actttgtaca gttgcagggt cgaggaaacg gtgcgcttcc 4741 tcaattgtaa tcatcagttg cgttggtcta tcgttgggat ttttggtttg taaaaacttt 4801 tctgccttag tgacgtagtg ttggtgaatc cgtcgggtaa tcatattagt caccaacata 4861 taagagagca tatttgactg ggagccaaat tccacgacaa tattcttccc agcttctagg 4921 gaacgcaaca tttgattgac ataattttgc ggacaagccg cacgcatata ctttaaattt 4981 tctaatcgca ggagtttgcg ctgtagtgct gtgattgaac cttggtgtcc tcgcttttcc 5041 tcacagaaag acttgatttc ttcgttggtc atatttatca attgaagaat ccaagacttg 5101 ccaaattcag caaataaaat attggcgtta tctaaacttg cttctgaaag tcctaaatca 5161 cgagagcata atttgatatc ttcgacttca atttggtcgt aacttaagta aagttcttga 5221 gcgtgaggaa cgcctcgacg cttggtggat tctggatcaa gagtgtagac ttcaactgaa 5281 ctgggaaata gctgttttaa cccttttaca gtactcattt gtttgccctc agaaacagct 5341 tcccagccat actctgagtg catatcaaaa atcaaattta ccgcagcatt tttgcggatt 5401 gtcccagcta aaagtaaacg tgtcaaaaag gatttacctg ttcctgattt cccaaaaacg 5461 ccgttgcttc tttccacaaa acggtttaaa tcgaggcaaa ctggtacctc catgtccagc 5521 ggttttccaa tggaaaagtt ttttctttga ggatcgtctt cccacccaaa tacccggcga 5581 aaatcttctt ctgaagcttc atacacctgg ctaaagtggc tgggaatcgt tttcactgga 5641 agcaattcca ttgttgtact ggtttggggc tgaaatgatg ccaaaccagt cgctgatgga 5701 acaaagggat ttgcagattt gccgtttgtt ggagaaaaag attcttcaga ttcgggggta 5761 aacatcaaca tcggtgcgag gttgatggta ccatatgtcc ctcctcctgc taagatgtct 5821 cgtaaaaaag tgtcttccca actcggagga ttcgcaataa ttctttggtt ggcaattcct 5881 aacgccacat ctgtgagcat acaaaaaaaa cgcgatcgca ccccttgcac aactaaaaac 5941 ttacctaccc gcatgtcttc tactgaaaca tcggggtgca atctaacttc taatcctcca 6001 gtgagagagc cttgtataac tgaacctaat ggctgtccca aattcatttc attaatcaca 6061 cagtgatgag cttgctgtta tcagttacca gttatcagtt atcagttatc agttcccagt 6121 gtcgtatccc tttgttcact gttcactgtt cactgtttac tgttcactgt tcactgtttt 6181 atgttgcatt gtaattgtag aagacaaaat tggtgatgta acttttgtcc tttgtcattt 6241 gttaaaaact aatgactaat gactaatgac taatgactaa ttagtcaatg ttaattgaag 6301 taggcgcact tggacttcct gttgtggtgt aagttagaag cggtttacga tattgggcgt 6361 agaggcgatc gcgccaagca aagaacggtg cataaactgg attatcgact aattcaggaa 6421 ctcccttgcc tctgatggtt tctggtaaat ttaaataagg accactcgga aacttcagca 6481 atatcgataa acctgccaca gcaaagtcag ctaaagttgg ttcatctcct actaaatagg 6541 gactatctgc taacaacaat gtcagtgctt ctaaatcttg ctttaagtcg gcgatcgcac 6601 tgcgtattac atctggactg taaccgacgc caaaacccaa aactcttagt atgtcgttgg 6661 gaactccttc gaccaagctt ttaaggacat ctggtgtcgt cgtaggtagt aaagatttgc 6721 ggaaattctg gctttggcta atggcagaaa acagtgcttt gcgacctttg atgccaattg 6781 attcatccgc ccattcttcc atcattaaac ataaaccccg cttttttgga tcggttggta 6841 tcaatgggcg taccggatat tgtgagtcta aatacttagc tatctccgtt gaatctacta 6901 tatacctact accgtctttc aagacaggta cttgtcgctg accagtcaac cgaaatagtt 6961 ctacctgccc aactcctggc gtcacctcga ttttacggta gtccagccct ttataatcaa 7021 ggataagacg cactttctct gagtattgag atagttcaaa ctgatataat tccagcattt 7081 tttctttcct gtacgtggtt gacttaaata aattttacaa gtttattttg gttaggagaa 7141 tttagattcc aatcgacgcc tgataaactg aataagggtg gataaatttg taaggtgatt 7201 ttacattctg agttcttctg acagaactct gcggtgaaat tctggttgac tggatttttc 7261 tctataaaag tgcatctatt atgcctttca atcgtggcat aatgcaacta tgcgacattt 7321 ttccttttca ggaaacatgc ggaggtaatt ctatcgtctc tggcttcctt gttcaatcaa 7381 acttccttta tacagacagc taattatact cataattctg caaattgttc aacattttca 7441 tactgatact taatttattt atggctacaa atctcccttc tttagttaga ttttctgtat 7501 agcaagatga tataaatcat tatggtaatg tgtggaaatg aaagccaaac aaaatttatg 7561 aaggttgaaa acagcaattt gctgaccaag ttggctggta taataggagt gacgggcatc 7621 agtcttctta ctggtctacc cactggagca aatgaggtat taaattctaa ctctagtatt 7681 ttcaaggaag caacatataa cgacgctcaa cgttttttag taaatgctca gtatactcaa 7741 ccgagtgagt ctgtaacagc agcaacaaag tccaaaaaaa ctccaccagc taaaaataca 7801 acagtggcac aaagacgggg aggattaaac cctgccccca gtattttgca ggagtgtccc 7861 tacaaccgtg ctgcttgtcc tggtggatct gacacttcta cacctcccgc gactccccct 7921 ggaggtgaga tacccacaac accaccagct acacccacga cgcctcctgc acctgggaca 7981 caaacgcctc ctaaggaacc agcagcgggt acagaaagta agaatatcgt agcagttgca 8041 gagtccaatg gttcctttac aatgctcacc aaggctttga aagcagctgg attggcagaa 8101 accttgcaag gcaaaggtcc tttcaccgtc tttgcaccta cagatgcagc atttgccaaa 8161 ttgccacaag acgctgtaca agatttattg aagccagaaa ataaagaagt cctggtgaag 8221 attttgcgat atcatgtggt gcaaggttcg gtaacgtcca aagatttgaa atctggcgaa 8281 gtcaaaagta ttgagggtgg tccgattaat gtcaaggtag atccaaaaac tggtgtaaca 8341 gtcaatgatg ctaaggtggt tcagcctgac atcaaagcta gtaacggcgt tattcatgtt 8401 attgacaacg tgattttacc tcctgacttg taagtttttg acactggtgc attgtgtaac 8461 aaaactaggt gaaataaata ttttaaaaaa acccgtttca agaatgattt ggacgggttt 8521 tttcatattg gtaaacctgt tatatcaagc gatgtacgta acgaagtcga tatattgtaa 8581 taaaagagtt ggatgaatat ttatgctatt agagttaaga caattgagga ttcaaccggg 8641 acagcaattg ctacttgagg acgttagctg gcagcaattt gaaaatattt tggcagaatt 8701 gggagaacat cgtgctgcca gactttccta tagtcatgga tttttagaaa ttatggttcc 8761 tttgcccgaa catgaaaaag ccaaagaaat gattggcgat ctggtaaaaa ttttattgaa 8821 tgaacgaagt atcaattacg attctttggg ttcaacaacc ttaagaagtg aaaaaatgac 8881 tcagggagta gaaccagatg cctgttttta cattcaaaat caagcagcaa ttattggaaa 8941 aaatcgccta gacttgagta tagatccacc cccagattta gctatagaaa ttgatttaac 9001 ttcccgcacg cagttggaga attaccaaat tttgggagtg ccagaacttt ggcgatatgg 9061 aaaacaagga ctacaaatta acatcttgca aagtggaaag tatgtagagt cgaattttag 9121 tcctactttt ccagatattc cgataattga gctagtaaat caatacgtcc agcagagtca 9181 agttgttggt agcagccaag caattcaagc ttttagaagt tgggtgcggg ataatattta 9241 attttggctt tactttgaga tcggagtcag tggaggtttt ttgatgccta ccacaaacaa 9301 acagcgggga atgcgaattg cccatcaaga gggtcagcct ctagaagaat agtgaggtaa 9361 agccttacag tattcattcc cagccggagt ctgggaacaa caaatcaagt aaaatcaaaa 9421 cttacgccac caacccagca tcaatcagcg tgaaaacaga cacaatattc taccgcctat 9481 tccaaagctt ccccagcatc ttctttgaac tcatcaacca accacccgaa accgctgaca 9541 cctaccaatt ttcatccgta gaagtcaaac aactcgcttt ccgcatagat ggtgtatttc 9601 tccctaaaag taatccatca tcccccatct acttcgtcga agttcaattt caacctgata 9661 aaaaattcta ctctcgcttg tttacggaaa tctttcttta cctggataga agcgaactta 9721 ccaacaactg gcgcggggtg gtggtttatc caagccgcag tcttgatgtg ggagaaacag 9781 aacggtatat tgaattactc acatctgggc gagtcagccg catctatctt gatgaattag 9841 actccgcagc agaacagtcg attggtattg gtacagttaa actggttata gagccagagt 9901 caggcgcagc tactaaagct agggagttaa ttaaccttgc caaacaacaa atagctgatg 9961 aaattaccca acgggaattt ctcgaattga tagaaacgat tattgtctat aaattccctt 10021 tgaaaagtcg ggaggagata gagcaaatgt tgggattaag cgagttaagg caaactaaag 10081 tttatcaaga agctaagcag gaaggtaagc aagagggtaa attagaagcc attcctttta 10141 tgttgagttt aggtgcaact gtagaacaaa tagctgatgc gttggggttg gatattgagt 10201 tagttaggtt ggttgctgct aaagctaaac caaaccaaga acaaagcggc gaatagaaca 10261 ctctctccta cttacgggta tagcgatgct tcactccaac agggaaatat tctggatcgc 10321 ctgtgggggc tagtgcaact tgcggcatct ggtattctgg aggaacgtag ttagcaaatt 10381 cgctattgag gcgctggagt tgggtgagaa tggaagatgc gatcgcctcc cttttctctt 10441 cactttcctc cactctcggt gctaactcca caaccacaga taaaaatcga ttcttgtctg 10501 catcttcttg cacctgcaac acgaatttac ctgtcaccca ttctctaatg actggttgct 10561 ctaatcccac tgtcacattt tctgggtaga tattcgctcc aaagtaagaa actgtaaaat 10621 tagagcgtcc gaaaacatag acgaatggta gcgcacgaac acctcttcca cccatctgtt 10681 ctatttccgc cactggatta aagccccatt tttttaaaaa ctggagcatg gcatcataac 10741 tgattatccc tccagtatcc aagatgttgt aacgcactag gggaatgcca ttatttccag 10801 aaaatatcaa cgtgccgtct ataacttcaa aaaagcggct tatgggatca tattgtacca 10861 gtgttggtaa acgagattct ccaaacaagg cacgcgccgc ttctggattt tgtgctaaaa 10921 aacgacgaat gcagatactc aaaggtgttt cgttacccaa tacacctgca tctgctgttc 10981 cataaagtga gactgagtcg tagcagcaat gttgggaacc aactctttca cccactaaac 11041 tgcgccattc ttcgctgaat acttctcccg ccatcaccaa ctttatctga tatcgctgcc 11101 attcaattcc acgggcaatt ccagtatcaa taatatcttt aagaaatggt ggatatccta 11161 ataagacaac ttgctcaaaa gccgaaccga gttcttgaat aactcgaaat atttcttctt 11221 tattattacc aggagttatc aaagtaatcg gataaccttt gctggcaaga tagcggcagc 11281 aattggttgt gtacattccg cctacccagc ttcctaaact gaaacaaatt acagctagag 11341 ttcgtctagt atctgcatga aaactatcat gaaatacctg ctcaaaacgg gtagcaattt 11401 gcatttcatc tgcaaaaaaa cgcggccaaa atgttggttt tcctgtggaa ccagaggaaa 11461 cagctatcat gtcgcacgtt tctagctgtc cgttgcggca taattgggct agaggataac 11521 gttgcaggta attttctttt acgagttggg gtaatctttg gaaatcctct aaagtttggt 11581 attcatcagg attaaccgca tgatttgcta agaaatgttt gtaagcaggt acactagcag 11641 ccatatcatg aaataacgcc aaagcggctg ctgaggtatt agtatttata tgctgttgta 11701 ggagtgtgtc taagggagtc tgtaaaaaat ctgtaaatgc tttaattgcc cgctggcgtt 11761 gtgcttccga attcatgacg ttggtttgtg taaaaaattc cgattgctta ttccaccaag 11821 taaaacttag accactgtga cttattgtga atattttaac ctttggcagg cgtaatgaaa 11881 tattaatacc atttctttgt aaggctgcgc caaattttct tgacttcttc tttctttctt 11941 tctttgtgta ctttgcgacg ccagtcgcct caagtcgggg gacccgccca cggcgctggc 12001 tcccc // LOCUS NODE_2848_length_11910_cov_5.00742311910 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11910) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11910) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11910 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(663..1964) /locus_tag="DP116_22340" CDS complement(663..1964) /locus_tag="DP116_22340" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22340" /translation="MDTNETGSIASIIEKRLPLARKIVDVEAELKFLDSAVQHLQNSQ NQLLKRLDDSSVPSLLENICLTTQQLSINTELEALAKLKERFCHKTLNDEKYASLRQQ QLNLLQNQVKDELEKVNKALDLATHDDIHGLALFEVKFKELWLAVTNSLEELLTELRQ KREAVDMDFKKQVEAVLQTCRSDTGIPSIQEIKQRFCLEKSYETIYEKYLNEIRAHLS KHLSSLDIGLERSLNKVKSQVTQILMDKGHLEKLTPAKGTEFMEAIAKQIPDELIPGI PSQLKYGLQTLAKFKLSYRGFLQYRIRKCLDGLTPNQPATLKLSASSSAEQVLLNLKI AYAEAVSKCEKALKELLCEPSQVIYAIVEEFVDCILRARDVESEWRIFLQDVRRQIWQ EFQHLADTPLPKPSLWANAHKEATKDDHTQQPYEVSPTHDF" gene 3334..4536 /locus_tag="DP116_22345" CDS 3334..4536 /locus_tag="DP116_22345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746322.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sorbosone dehydrogenase family protein" /protein_id="PRJNA477356:DP116_22345" /translation="MKARNLLIGCALLLTITLPGNAQQSQELRIEQVEGYLVTPQQLE FDESLLQQLQLPAGFRINVFAKDLGNPRMLAVAPDGTIYVTRREQGDVLALLDSNQDG RADEIRTVASGYKYINGITIYQNRLYFVTDRQLYAAPLQSPGVIGEPQELINDLPDAG QHPNRTLAFGPDGLLYITVGSTCNACRDTNPENATILRAQPDGSNRTIYATGLRNTIG FAWHPETGELWGMDHGSDWRGNEQPPEELNLIAEGANYGWPFCYADRRPDVYLPANPT GTTKEEYCANTQPPVLTYTAHSAPLAMTFYTASQFPEEYRNDAFVTMRGSWNRNPPSG YKVVRVRYENGKPVKFEDFITGFLDEEKLTQFGRPVGLAIAPDGSLLFTDDTNGVIYR VSYVGTGN" gene complement(4620..6791) /locus_tag="DP116_22350" CDS complement(4620..6791) /locus_tag="DP116_22350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198961.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipopolysaccharide biosynthesis" /protein_id="PRJNA477356:DP116_22350" /translation="MTPPIVKRYLIAFDKYKWIGLASFGLVVAGSMAVALQPQQSRYV ANGALTYNRPTVSFSATGSEINLQGQELSKDILLSEQVIDAAAAKVNVKPTKIRQNFV LTMPEKDSKTGGDKQLTPTLIELKYNDTKIQAAQDTLQALMEAMIALSAEINTRRLKA VIGKVNERLPQAKKELQAAEQKLEVYDRKERPAILAAENGSLLNAVTSSQIQQRQTQL VLAGIDTQIRSIQEKLGLTVNQAYVSSALSADPIIANVRVQLYQIESQIEVLKKEGLR PEHPTMIQLRNQKEAYENQLRQRGSEVVGGKDGTAPLPATIAIRSRSNLDPARQQLAN QMVALQTQRETLQQQFLDLTKDEARLRNEYSLIPNKQLERSRLEQEVALKKAVYDQMQ VKLTDAKTAEAETVSSLGIARPPIVIDANKDRLSPPITLGIGSLLGLLVGGGVIFLLG SLETTFKTKEDIRDSLKQREVPLLGELPLMPVDGLDLGEVPVVLSLDSLYLEFYEKFR SNLRRVGGRKLKVVLITSTSSGEGKTVSAYNLGIASARAGKRTLIIEADLRSPSHSSS LGVTSDPDAIIEPLRYYANLGEYIRLVPDIENLYILPSPGPVRQSAAILESSEIRRLM EDVRERFDLVILDTPALSLSNDALLIQPYTDGIVLVTRPNNTQENILTEAIDQLVESD LELLGVVINGADITMPLSYSYMGSNYPPEEPQARVINGARR" gene complement(6932..8347) /locus_tag="DP116_22355" CDS complement(6932..8347) /locus_tag="DP116_22355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742823.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_22355" /translation="MRLFNALSSAGLYVGVFVVTIVQPVVAQTSTQKRTSGQAPAPLP NTPARQVPAPLPNTPAGQLLRQPPPPAPSSTDFIAPGLQEGMSPQLNRYLLGPGDSIN VLVQRTPGPYRLGPGDSIGVSVLRFPDLSFQALINPEGNIVVPLLGTLSLKGLTLQQA QEKIRSGYNRFVVNPNVTLSLLAQRPEFNFSAQINPEGNIIVPQVGTVSLQGLTLEEA QEKIRLALSRQLVNPLVSLSLIGQRPVQVTINGQVNRPGVYPITSATPRVSDALLLAG GSAMMADLRQVQVRRKLVDGSVVSQTIDLYTPLQNGGDIPNLRLQDGDAIIVPRRELA NDKSYDRALVSRSTLATPQIRIRILNYAGGGIVTVPLPNGSTFIDVLAGINTDSANLS EIALIRFDPERGRAVKQTINGKKALSGDASQNVPLQDNDVIVVGRSLIGKITNIIGTI ARPFYDIRTFLRFFGVNDYRY" gene complement(8462..9217) /locus_tag="DP116_22360" CDS complement(8462..9217) /locus_tag="DP116_22360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994672.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanoexosortase B system-associated protein" /protein_id="PRJNA477356:DP116_22360" /translation="MISFSKFLKEQRFSQIVALLLLLLLLLIGAIPGYLTGHWEWQQL PRITRLQELKDLRQKGLILPGWQTIEQREQEIGEGTWSYQFIQKKSDQTKAVLLLRPQ TGSKDQPVVEWTEINSLWQWQIAQYRSADFTVKPQGTESNAAETKVQARFFRASTNQQ TFAVLQWYAWFNGGSSSLFPWFIVDQLAQLQKKRAPWVAVSILIPMEPLGQVETTWNE AKSLGQTVQATLMAGPLSSRSPMHLYKKSKVLR" gene complement(9248..10120) /gene="crtB" /locus_tag="DP116_22365" CDS complement(9248..10120) /gene="crtB" /locus_tag="DP116_22365" /EC_number="3.4.22.-" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198958.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyanoexosortase B" /protein_id="PRJNA477356:DP116_22365" /translation="MQIRHQLKNANTTRLITPAILGILLLLYAPVLFYWVQGWLNKSI SIEHEYFSHGIIGLPFAAYLCWLNRKKWQRLPNISHPLGVVLLVLGGVFYLSGVSEWV NISFPTILTGLCFWLKGIPGLKLQGFALILVLLATPTSVPYLITPYTLPLQSFIAGTA GFILSQLGMQVTVDGINIYVGGRIVEVAPYCAGLKMLFTTLYVGLMLLYWTGALSSRR KSIWFLSIAVLISICANIVRNTLLSFFHGTAQDGAFDWLHNGWGGDLYSAVMLLSLVP TLNWIEKYFAEVSE" gene complement(10228..11394) /locus_tag="DP116_22370" CDS complement(10228..11394) /locus_tag="DP116_22370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="erythromycin biosynthesis sensory transduction protein eryC1" /protein_id="PRJNA477356:DP116_22370" /translation="MNNKTVMVPFVDLTLQHQPIQTQLQQAIQGVTERGDFVLGQALK EFEAAFAAASGVEYGVGVASGTDAIALGLQACHIGSGDEVILPVNTFVATLIGVIRAG AKPIFVDCDPQTALIDLEAAARAVTPQTKAILPVHLYGQMVSPRQLLDFADTFKVLIF EDAAQAHLAEREGYRAGSVGTAAAFSFYPSKNLGAFGDGGMLLTRNPDVAQKMLRLRN YGAAQKYIHVEEGTNSRLDTIQAAVLHEKLPYLPQWNRDRLNAAQQYDIELAPLASSG IIPIQNHSGEGHVYHLYVIRVDDSCAVQRQQLQEQLAQVGIQTGVHYPIPCHLQPAFT HLGYQVGDFPNAEMLAKQILSLPMYPGLSHSQVQQVVSAIASTLSAEQQKVLCI" BASE COUNT 3332 a 2567 c 2514 g 3497 t ORIGIN 1 accttgtaca acttgtcaat agtgctaaaa aatctagttt catagagtaa atttcaattc 61 aagagagtgg tattatgatt gtcaaagaaa ttttcactaa tttgaaatta ttggtgtata 121 gcacagaaca gcctattcac acatttaaca atgaataatc acaaaaatgc tagtttagta 181 agaagaaaca acatattttc aaaaatccgt ttttcccctt atccgttgag tgttcggatc 241 agtgtactac caacgtattg aagtttttag aacaaaatat aatagcctgt atttagctct 301 aaatttaagc taaatgatgc tctggaaaca gataactttt tttctatttt gtttattgaa 361 aaaatgaaaa acttaaaaat aatatttagt ttttttaaca taagaaaagc tttaatacta 421 tattgcctat agcagcgcct cccaaagaat atgacaaatg cttaaatctt gggagcgttt 481 cattcctaag ctgtcaacat ccccatttgg gtacactaat accacttggc attcatcaaa 541 agccgatcgc gccagggtga cgcagcgcta tgcaccgacg taacccgaag gtaggcgact 601 cgtgtacata ctctgcccac ttgaggttcc ctcagttata gccagtggcg cgtggctggg 661 acttaaaagt catgagtggg agacacctca tatggctgtt gggtatgatc atctttggtg 721 gcttccttat gtgcattggc ccaaagggac ggctttggaa gaggagtatc ggctaaatgt 781 tgaaattcct gccaaatttg ccgtcgcaca tcttgtagaa agattcgcca ttcactttct 841 acatctctag cgcgaagtat gcaatctaca aattcttcta caattgcata aatcacttga 901 cttggctcgc acagcaattc tttcaaagct ttttcacact tactcacagc ttctgcgtaa 961 gctattttga gattgagtaa cacttgttca gcagaggacg atgcagaaag cttgagggtt 1021 gcaggctgat ttggagtcag tccatctaaa catttgcgaa tgcgatactg tagaaatcct 1081 cggtaagaaa gcttaaactt agctagggtt tgcaacccat acttaagttg actcggtatc 1141 cctggtatca gttcatcagg tatttgcttg gcgatcgctt ccataaattc agttcctttc 1201 gctggtgtca gtttttccaa gtgtcccttg tccattagta tctgagtcac ttgcgattta 1261 actttgttaa gcgatcgctc taaaccgata tccaatgacg acagatgttt cgacaaatga 1321 gcacggattt cattcaggta cttttcgtat atagtttcgt agcttttttc tagacaaaaa 1381 cgctgtttta tttcttgaat tgagggaatt ccggtatcac ttcgacaagt ttgcaaaaca 1441 gcttctactt gctttttaaa gtccatatca actgcttctc gcttttgcct aagctctgtg 1501 agcagttcct ctagactgtt tgttacagcc aaccaaagtt ctttgaattt tacctcaaac 1561 agcgcaagcc catggatatc atcatgggtt gctaaatcta aagccttgtt gactttctct 1621 aactcatctt tcacttggtt ttggagtaaa tttaattgtt gttggcgtaa cgaggcgtat 1681 ttctcatcat taagtgtctt atggcaaaag cgctctttga gttttgccag cgcttctagt 1741 tcagtattaa tgcttaattg ttgtgtggtt aaacagatgt tctccaaaag gctaggcaca 1801 cttgaatcat caagtcgttt cagcaactga ttctgagaat tttgaagatg ttggactgct 1861 gagtccaaga atttaagttc cgcctctaca tctactattt ttctagctaa gggaagacgc 1921 ttttcaataa tgctagcaat actcccagtc tcatttgtat ccattgacgc gcgtgtttta 1981 ctgttattga ttatatgtgt ggagtttgac gaaacgatat atacagtaga tagagaaacc 2041 tactcgtacg tatgctaagt aaatattcac tctcactgct caaatatatg cctagaggta 2101 atagcatttc actttaaggt tgcatggggg caaccagttc ttgaactggg ggagtgaaat 2161 gaattataat tttcctggca ggttataacc tctgcgaaag tgatagcgtg caactcttcc 2221 gcaagatgcg taaagtttta aaaaataatt ttgattaatt atttgaacat tcctccttta 2281 gatagatgat tttatatatc atctgacgta tattatcaca tttttcccaa taaaatcagc 2341 tcaattaaca atcttgtatg aaaaaacgtt aaaattgcgc agattttgat gcttatataa 2401 acgcgctaca cctatggaca gtcagaaaca acttccttgg caacaacaga gggttattag 2461 ctacagccgg ttataggtga ggaaaatcaa cgacttggtc aagcagctgt ctactcagaa 2521 tgcaaatttc tgacttgacc tagaaaaaac ttagaactgg agtttagtgg ctcatgaact 2581 tttgacactc cccacgctta gaaggcgggg gattcttgcg tcacggggat tccaattgac 2641 accgcttcat taaaagcgga aaaatccccg gttgggctgg ctaaattttt atagcagttg 2701 ccaggtagat tagggcatga actaatgaga aaatacggac accaccagac catcaagccc 2761 gttaagagtt aagcgttccc cttcgggttc gccagtcccc ctacccttac gggaagccgc 2821 tgagtcgcag agcgacacgc ttcgcgaacg cgtctacgtg acggagaccg ccaagacggg 2881 ggctggtctc acctgttaag agttaagagt tccctgctat acgtgtaaga ccccgtgacg 2941 gaaaaatccc ccaaatgtta actgctatac caaaatccga attttttatg aaacgatttt 3001 tccggagttg tgctgatttt tatacgttta gacccaaccc accaatatct caaatgtcca 3061 tcaaataagc ttttctgggc aatgccaccc taccatttgt ggtctaatca taagaagaag 3121 cgctccttaa cagaggagtt ttgacaagtt taaggcagta attcctatcc cattagaaag 3181 atgtggttat cactgctata gctctaaact atcttacgaa aattcatccg aaatgattac 3241 ccaaagaaag gatgaaggat aaaaaatttt atacttcata cttttacttt tagtaactta 3301 gtcttctcct atcaacattt aaggaaagat gccgtgaaag caagaaactt actcatcggt 3361 tgtgctttac tgttgaccat aacccttcct ggcaatgcac aacagtcgca ggaactaaga 3421 atagagcaag ttgagggcta tctggtaacg cctcagcaac tggaatttga cgaatcgctg 3481 ctgcaacaac tccaactacc tgcgggattt cgcatcaatg tgtttgccaa agacttgggc 3541 aacccccgga tgttggcagt agcccccgat ggtactatct acgtgacccg ccgcgaacaa 3601 ggtgatgtgt tggcactgct tgactcgaat caagatgggc gtgctgacga aatacggacg 3661 gtggcgtcgg gttataaata catcaatggc atcacgattt atcagaatcg tctctacttt 3721 gtcacagaca gacaacttta tgccgctcct ttgcagtcgc caggggtcat aggcgaaccg 3781 caggaattaa ttaatgactt gccagatgcg ggacaacacc caaaccgcac ccttgccttt 3841 ggtcctgacg ggttgcttta cattacagta ggtagtactt gtaacgcctg tagagacacg 3901 aacccggaaa acgcgactat cctgcgtgca caacccgatg gtagtaatcg gacgatttat 3961 gccacgggct tacgtaatac tatcggcttt gcttggcatc cggaaactgg ggaactgtgg 4021 ggcatggatc atggttctga ctggcgtggt aatgagcaac ctccagaaga attgaatctg 4081 attgcggagg gtgcaaacta cggctggcct ttttgttatg ccgatcgccg tccggatgtg 4141 tacctaccag caaatcccac aggcacgaca aaggaggaat actgcgccaa tacacaaccg 4201 ccagtactca cctacacggc tcatagtgcg ccgttggcaa tgacatttta tacagcatcc 4261 cagtttcctg aagaataccg taatgatgct tttgttacga tgcgcggttc gtggaatcgc 4321 aaccctccct caggttacaa agttgtgcgg gtgcgatacg aaaatggcaa accagtaaaa 4381 tttgaggact tcatcacagg atttttggat gaagaaaaat tgacacagtt tggcagacct 4441 gtgggtcttg cgatcgcacc tgacggttcc ctgttgttca cagatgatac caacggtgtg 4501 atttaccgag tgtcatatgt aggtactggc aactagcttt acctttactt ttttctcttt 4561 tctttgagaa ttatattaaa catttggtgt gtgagcgcaa gctttcacac ctcaactccc 4621 tatctcctag ctccatttat tactcgtgct tgtggttctt caggtggata gtttgaaccc 4681 atatatgagt aagacagagg catggtgata tcagcaccat taatcacgac tcccaacaac 4741 tcaaggtcag attccaccaa ttggtcgatc gcttcagtga gtatattttc ttgtgtattg 4801 tttggtcttg tcactagcac aatgccatcg gtgtaaggct gtattaacaa agcgtcattt 4861 gataaactga gagcaggggt gtctaaaatc accaaatcaa aacgctcgcg tacatcctcc 4921 atgaggcgtc gtatttcact agattctaga attgccgctg attgacgcac aggacctggg 4981 ctaggaagaa tgtataaatt ttctatatct ggcactaagc gaatatactc gcccaagttg 5041 gcgtaataac gcaggggttc aatgatagca tcggggtcgg aagtcactcc tagggatgaa 5101 gagtgagaag gcgatcgcaa atctgcttca ataatcaagg ttcgtttgcc tgcacgggcg 5161 gatgctattc ctaagttata agcactaact gtcttacctt ctccactgct ggtgctggta 5221 atcaatacca ccttcaattt tctcccgccc acgcggcgca gattactgcg gaacttttca 5281 taaaactcta aatacaaaga atctagagaa aggacgactg gcacctcacc tagatctaac 5341 ccatcaactg gcatcaaagg caattcgccc aacagaggaa cttctcgttg tttgaggctg 5401 tcgcggatat cttccttggt tttgaaggtt gtttccaatg atcctagtaa aaatataact 5461 ccaccaccta ccaacaatcc tagtaaactg cctataccca gggtaattgg cggactcaga 5521 cgatccttat tagcatcgat cacgattggt ggtctagcaa tcccaagact gctgactgtc 5581 tcagcttctg ctgtttttgc gtctgttagc tttacctgca tttggtcata gacggctttt 5641 ttgagtgcaa cttcctgttc taggcgcgat cgctccaatt gcttattggg tatcagagaa 5701 tactcattcc gcaatcgtgc ttcatctttt gtcaggtcaa ggaattgctg ttgcaacgtc 5761 tcgcgctgag tttgcaaagc gaccatctga tttgccagct gctgtcgggc tggatctagg 5821 ttgcttcgag aacgaatggc aatagtagct ggaagcggtg ctgttccatc tttaccgcct 5881 acaacctcac taccacgttg ccgaagttgg ttctcataag cttctttctg attccgcaac 5941 tgaatcattg tggggtgttc ggggcgcaaa ccttctttct ttaaaacttc aatttgggat 6001 tcaatttggt aaagttgtac tcgcacgtta gcaataatcg gatcagcact caaagcagaa 6061 gagacataag cttgattgac ggttaaaccc agtttttctt gtatgctgcg aatttgagta 6121 tcaattccag caagaactaa ttgagtttgt cgctgctgta tttggctact tgtgactgcg 6181 ttgagtaaac ttccattttc tgcagccaat atagcaggtc gttctttgcg atcatacact 6241 tctagcttct gctcagccgc ttgtagttct tttttcgcct gtggtaagcg ttcattaact 6301 ttgccaataa ctgcttttaa tctccgtgta ttaatttccg cactcagcgc gatcattgcc 6361 tccatcaacg cctgcaatgt gtcttgtgct gcctggatct tagtatcgtt atatttcagt 6421 tcgatgagtg tgggtgtgag ttgtttatct cctccagttt tagagtcttt ctcaggcatc 6481 gtaagcacga aattttgacg tatttttgtt ggcttgacat tgactttagc tgccgcagca 6541 tcaatgactt gctctgatag caaaatatct ttgctcagtt cctgcccttg cagattaatt 6601 tcactgcctg ttgcggaaaa agaaactgtt ggacgattgt atgtaagtgc gccgttcgct 6661 acatatctag attgttgtgg ttgcaaagcc accgccattg atcccgctac aactagacca 6721 aaactagcta gtccaatcca cttgtactta tcgaaagcaa tgaggtagcg tttaacaatt 6781 ggtggagtca tggcagcagc aaatcaaaac taggtataaa ttcgactata gtttttagat 6841 aatttcggtt ttgaggagtt tgaaattata aattatttca taagttcatt cctcttcctt 6901 ctcttacaca cggctggcaa ggactactaa tctagtatct atagtcattg acaccgaaga 6961 agcggagaaa ggtacgaata tcgtaaaagg gacgggcaat tgtaccgatg atattagtaa 7021 ttttaccaat caggcttcgt cctactacga taacatcgtt atcttgaagt gggacatttt 7081 gagatgcatc ccctgacaga gctttttttc cattaatcgt ttgcttgacg gctcgacctc 7141 tttctgggtc aaaacgaatg agagcgattt cactgaggtt tgcactgtca gtattaattc 7201 ctgccaatac gtctataaaa gtactaccat taggcaaagg gacagtgact attcccccac 7261 cagcataatt taaaatacga attctaatct gcggtgttgc taaagttgag cgagacacca 7321 atgcgcggtc ataactctta tcgtttgcaa gttcccgacg tggcacaatg atcgcatcgc 7381 cgtcttgcaa gcggaggtta ggtatgtcac cgccattttg taatggagta tacaggtcta 7441 tcgtttgtga aaccacagac ccatcaacca gtttccggcg tacttggact tgacgcagat 7501 ctgccatcat cgcagaacca ccagctaaca acaacgcatc actaacgcga ggcgttgctg 7561 aggtaatagg ataaactcca ggtcgattga cttgcccatt aatagtgact tgaactggtc 7621 gctgtcctat caatgatagt gacacaagtg ggttaactaa ttggcgactc aaagccaagc 7681 ggattttttc ttgcgcttct tccaaagtca agccttgtag ggatactgtc cctacttgcg 7741 gcacgatgat gttgccttca ggattaattt gagccgaaaa attgaactct ggacgctgcg 7801 ccagcaatga caatgtcaca tttggattga cgacgaaacg attatagccg gagcgaattt 7861 tttcttgtgc ttgttgtaga gtcaacccct tgagtgacaa tgttcccagt aatggcacca 7921 cgatgtttcc ttcggggtta atcagggctt gaaagcttaa atctgggaag cgtaaaactg 7981 aaactccaat agagtctcct ggtcctaaac gatagggacc gggagtacgc tgaaccaaaa 8041 cgttaatgct atctcctggt cccaaaaggt agcggttcaa ttgcggcgac atcccttctt 8101 gaagaccagg agcgataaag tcagtactac taggcgcagg tggaggaggt tgtcgtagta 8161 attgtccggc tggagtatta ggtagaggcg cgggaacctg tcttgctgga gtattaggta 8221 gaggcgcggg agcctgtcct gacgttcgtt tctgcgttga tgtttgagca acaacaggtt 8281 ggacaattgt taccacaaaa acgcctacgt acaaaccagc agaagacagg gcgttaaaca 8341 agcgcatacg acaaccaaga acaatagaca ttggcactat aacttactta tagtaggaag 8401 tgccaatgta cataacagag ttataacttt actttaaatt aagacgctaa aaagtacttt 8461 tttatcgcaa aactttactt tttttataca aatgcattgg tgaacgactg gacaaaggac 8521 ctgccatcaa agtggcttgc actgtttgac cgagagattt agcttcattc caagttgttt 8581 ccacctgccc taatggttcc ataggaatga gaatactcac agcaacccaa ggagcgcgtt 8641 ttttttgcaa ctgtgctagc tgatcgacga taaaccacgg gaaaagtgat gaactaccac 8701 cattgaacca ggcataccat tgcaaaaccg caaaagtctg ctgatttgtt gaggcacgaa 8761 agaacctagc ttgtactttt gtttcagcag catttgactc tgtcccttgc ggtttaacag 8821 taaaatcagc agagcgatac tgagctatct gccactgcca aaggctgttt atttccgtcc 8881 actccaccac aggctgatct ttagatcctg tttgtggtcg caaaagcaga actgctttgg 8941 tttggtcgct ttttttctga ataaactgat aagaccatgt gccttcccct atttcttgtt 9001 ctcgctgttc tatcgtctgc caaccaggaa ggatcaaccc cttttgacgt aaatctttca 9061 actcctggag acgagtgatt cgtggtaatt gctgccattc ccagtgtcct gtcaggtaac 9121 caggaattgc tcctataagc aacagtagta acagcaacaa aagcgctact atctgagaaa 9181 accgttgctc cttgagaaac ttggaaaaag aaatcatcag tgaagtttac agtatctaga 9241 caaaatttta ctctgaaact tctgcaaaat atttctcaat ccaatttaga gttggcacca 9301 atgaaagcag cataacagca gaatacaaat caccacccca accattatgc agccaatcaa 9361 aagctccgtc ttgtgctgtg ccatgaaaga aggacagtaa tgtgttacga acgatattgg 9421 cgcaaatact aattaagact gcaatagata aaaaccaaat acttttgcgg cgcgaagaca 9481 aagcacctgt ccaataaagc agcatcaagc caacataaag agtcgtaaat agcattttca 9541 gccctgcaca gtaaggtgca acttctacaa ttcttccccc aacatatata tttatgccat 9601 cgacagtgac ttgcatacca agctgactga gtatgaagcc agcagtaccc gcgatgaaac 9661 tctgtaaagg taatgtgtag ggagtaatga ggtagggaac tgatgttggt gttgcaagaa 9721 gtactaatat taaagcaaat ccttgcaatt ttaagccagg aattccctta agccagaaac 9781 acaaccctgt cagtatagtt ggaaaggaaa tgttgaccca ctcgctgaca ccactcaggt 9841 aaaaaactcc tcccagtact aacaacacca cacccagagg atgggaaata tttggtagtc 9901 tttgccattt ttttcggttc aaccaacaaa ggtatgcagc aaacggtaaa ccaatgatac 9961 cgtggctaaa atattcgtgt tctatactaa tacttttgtt gagccaacct tgcacccagt 10021 aaaataacac aggagcataa agcagtagca aaatgcctaa tatagctggg gtaatcaggc 10081 gtgtagtatt tgcatttttc agctggtgcc gaatttgcat ggcttttatg ataatgaatt 10141 atcaattgtg agtttgataa atactgagtg aagaataatg agtattactc ttctgattca 10201 ctcagtattt atcatgacat tattttttca aatacacagt accttttgtt gttcagcact 10261 gagggttgat gcgattgctg atacgacttg ctgaacttgg ctgtgactta aaccaggata 10321 cataggtaag gatagtattt gcttcgccag catttcagca ttggggaagt ccccgacttg 10381 atagcctagg tgggtgaatg ctggttggag atgacaagga attggatagt gaacgccagt 10441 ttgaattccc acttgtgcga gttgttcttg aagttgctga cgttgtacgg cacaagaatc 10501 atctactctg atgacataaa gatgataaac gtgtccctcg ccgctgtggt tttgtatggg 10561 aataattcca cttgatgcta gaggtgctag ttcaatgtcg tactgctggg ctgcattgag 10621 gcgatcgcga ttccattgtg gtaaatatgg taatttctcg tgcaacactg ctgcttgaat 10681 tgtatctagg cggctattgg ttccctcttc aacgtgaatg tatttttgag ccgcaccata 10741 attccgcaag cgtaacattt tttgagcgac atctggattt cgcgttaata acatcccgcc 10801 atctccaaat gctcctaaat tcttgctggg atagaaacta aaagctgctg cagttcccac 10861 agaaccagcg cgatatcctt ctcgttcagc gagatgggct tgggctgcat cttcaaaaat 10921 gagtactttg aaagtgtcgg caaagtctaa caactgccgt ggcgacacca tctgcccgta 10981 aagatgtact ggaagaattg ctttggtttg aggtgtaact gcccgtgctg ctgcttctaa 11041 atcaattaaa gctgtttgtg gatcgcaatc tacaaaaatt ggctttgccc ctgcacgtat 11101 caccccaatc agtgtcgcaa caaaggtgtt tactggcaaa ataacttcat ctccagaacc 11161 aatgtgacag gcttgtaaac cgagggcgat cgcatcagtt cccgaagcaa caccaactcc 11221 atactctaca ccagacgcag cagcaaaagc tgcttcaaac tctttgagtg cttgccctaa 11281 aacaaaatcc cctcgttccg ttacaccttg gattgcttgt tgtaattgag tttgaattgg 11341 ttgatgctgt aaagttaggt ctacaaaagg aaccattaca gtcttattat tcatttttag 11401 gcaagtccct agtatttttt gtcaaaattg ctgttctaaa tactatagtt ttgtttaagt 11461 tttattatga agaaatcgta aatttttgat taactttata tttgtgacat ttttagactt 11521 ttggagatga ttgcctaaat gattgacaca ttccatctaa tttttagata ttagaaccgt 11581 tttctttgaa atttaatgaa tgaatctaac aactcaaatc atattgcaat agcatattta 11641 tatataactg atattgcgca ctttcaactg acatgatcga tttttactgt tttgtgcctg 11701 ctaagaatgc tataggaatc tggtttggtt tatgttagct tgcgtggcgt taggcgggct 11761 tccgtggcgt cagccatagc catacttact agttgagcta gggaacaggg aacagggaac 11821 agggaacaga aactgtacgg agttctgttc aaaaatcaaa taggaatcct atatctgaga 11881 tacttaacct tccgtttata gggtatttac // LOCUS NODE_2861_length_11842_cov_4.92126911842 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11842) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11842) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11842 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 118..2337 /locus_tag="DP116_22375" CDS 118..2337 /locus_tag="DP116_22375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455783.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoketolase" /protein_id="PRJNA477356:DP116_22375" /translation="MTATTPKETLAAPNFCEGIQYFGEAIPGFETYGGTPAINSGGSA NPGDPAAAFQTLLTADALRYLILQVTASKASGHPGGFASQAEAYAALVMLGYKNIITE VGHHAPGFYSAMFLDRSLEDMGIYTVQQLRDRFREKHGLLGHLSGYIPGILAPAGPLG QGQHFAMSAALLHKDKLFPFTVGDGGLGEPYIMSSIAHFHTAFPGVTNFLPVLVWNGY SQEHHSMVSLKTNEQMIAYWYGNGFEKVVLVDAKDFDDQNQTGDYVDSTAFSFEQRLA FTKAVLTGVDEAARSALSGKLTVFIIKQLKGAGVHARGAKSHNLYPKDTLDAPHIVSA LKERALPPEVWQLVRTNCERAGGGPAAKTVVTEFEYELPELGELPLEEYPVGGEPKVS TTSMGRLVGKVGQIDRNFLVTNADGNEASGIANINSALKIIHPTTDDLYNQAPNGQVY EPLSEDACAGLAAGLSLMGARTLWCSYESFAINGLPIWQTVTQAMAELRRRTPSTITL FTAGALEQGRNGWTHQRPEIEAYFASLMRNGNVFPLFPPDANSIQVCYDWALTTKNKG IVITASKSPLPIRTTFEQTSQALRDGAIVLQEVDSNQGGSKKVVFAVIGDMTLIPVFE AAAFLETEGIAVKIVSVINPRRLYRPHDTAWDICSEADGGFVDDQKFAELFDGDALIG VTGGAAGMLEPIMLRSTAKRDTFAWKRGETTASAGELMAFNGLTAQALAKRGIELVH" gene 2442..3287 /locus_tag="DP116_22380" CDS 2442..3287 /locus_tag="DP116_22380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015203144.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease" /protein_id="PRJNA477356:DP116_22380" /translation="MADLLKAIKTIVDNPIPDLVSYYQGKNRINSIGDALECFVKDIF AETLSQSEQALKNERYSEVFSYIGNQNNPPDLILKNGDAIEVKKIESLKASIALNSSY PKSKLYYDSPLITKHCRQCEDWQEKDIIYVIGVPENKKLKILWFIYGDCYAADREIYE RIAKKISSGITEIKDVEFSETRELGRVNKVDPLGITYLRIRGMWGITNPIYVYDYIFQ TQATENLQVVVIMKEEKYLSFSPESREEIEAISNVNFQNQNVNIRDPNNPAQLLRAKI FIYRI" gene 3321..4388 /locus_tag="DP116_22385" CDS 3321..4388 /locus_tag="DP116_22385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152561.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA (cytosine-5-)-methyltransferase" /protein_id="PRJNA477356:DP116_22385" /translation="MNIVSLFSGCGGLDLGFHQAGFNIVWANEYDKSIWDTYELNHPD VKLDRRDIRVIEPDEIPECVGIIGGPPCQSWSEAGAGRGINDSRGQLFYDYIRILREK RPLFFLAENVSGILAQKHNKALTNILYQFKDAGYEVTYKLLNACNYGVPQDRKRVIII GYREEMGGTFEFPLESNHILTLRDAIYELSDIEPTPVAGEVSKTHPLVPNHEYMEGSF SSIYMSRNRVRTWDEPSFTIQAGGRHAPIHPQAQKMIWVEKDKWIFDPNSLKPYRRLS VRECARIQTFPEKFIFKYKHIGDGYKMVGNAVPVLLAKKLATKIIIDIKEYQNFGVCN HVRRHKYPTQLTLFEPSSSFA" gene complement(4411..4635) /locus_tag="DP116_22390" CDS complement(4411..4635) /locus_tag="DP116_22390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357366.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4926 domain-containing protein" /protein_id="PRJNA477356:DP116_22390" /translation="MIAELVVVILTTHISEYGLEQGDIGTVVLVHQGGKGYEVEFLTE YGETVAIVSLLAAQVRSIGSREIAHAWVMD" gene 4840..5199 /locus_tag="DP116_22395" CDS 4840..5199 /locus_tag="DP116_22395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996024.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22395" /translation="MTNTQGMVRLNLDLSPELNQVLEELAKKTGVTKSDVLRQAISLM QILVTAKEQTHKLGINEADQLIATEIIMPSEDIPTEHPLETFMESFGAWEDERTPEEI IKEIYDSRTISKSEYSL" gene 5196..5597 /locus_tag="DP116_22400" CDS 5196..5597 /locus_tag="DP116_22400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain nuclease" /protein_id="PRJNA477356:DP116_22400" /translation="MTYLLDTDTCIYWLTNRYSVRQKVRQVGWNQISICIITAAELYF GAFNSNRIEENFARAEFFIKQLPVLPLTDSAVRRFGELKAELRRLGQPIGDFDLLIAS VALTGNYILVTNNTRHYQRITELQLENWTLP" gene 5702..7228 /gene="lysS" /locus_tag="DP116_22405" CDS 5702..7228 /gene="lysS" /locus_tag="DP116_22405" /EC_number="6.1.1.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874072.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lysine--tRNA ligase" /protein_id="PRJNA477356:DP116_22405" /translation="MSEEDIRATRLEKVEQLRQLGMNPYAYHWESSHHAAQLQEKYAD LPNGEEVDLEVTVAGRVIARRVFGKLAFFTLEDETGTIQLYLEKNRIQESMADVDADA FNHLKQLTDVGDILGASGTIKRTEKGELSVFVKKYTILTKSLLPLPDKWHGLTDVAKR YRQRYVDLIVNPEVRQTFRRRALITAGIRRYLEQRGFIEIETPVLQAEAGGADARPFI TYHNTLEMELFLRIATELHLKRLIVGGFEKVFELGRIFRNEGISTRHNPEFTTIEIYQ AYADYNDMMALTEGIITTVAQEVLGTLQITYQGTPVDLTSPWRRATMHDLVKEYTGLD FNSFQTLEEAKAASKNAGLEGVKDCPSIGKLLNEAFEQKVEENLIQPTFVIDYPVEIS PLAKPHRSKPGIVERFELFIVGRETANSFSELTDPIDQRQRLEAQAARKAAGDLEAQG VDEDFLTALEYGMPPTGGLGIGIDRLVMLLTDCASIRDAIAFPLLKPEKSESSTESDT " gene complement(7300..8613) /locus_tag="DP116_22410" CDS complement(7300..8613) /locus_tag="DP116_22410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743392.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide transporter" /protein_id="PRJNA477356:DP116_22410" /translation="MVQTPDKVQAKLPPSTHEFAEVIHRLEAGGAMLPDTPENLMQII GLYKAYAVPMDFYWRDLLYIAEQVFLDPFPFFKYFIPQEYLDRHNHYAGDDAELRVWR GEATAHPELLAFMEKGETFKMPKLLHHLFHDRINMEFAEACMRAMLWHRGMGGKFDPY LDTEEYKANADRAIKAYFKGNPFMLGLYKLFPDMFIEQCRQMSYYANLGLFWEVMAPV FFEMSDLYDEGKITSVPEAMNFIVNGIFAAANRPIYHHVYIRGECYEIVPKSKGFVWL HEAALPYVEAVFYRTAPFRGTKSYNAQAGQVPIDQKDFHYGILYADVFPVGTAGIPPT LLMQDMLHFLPPYLIDYYKNYCRGEEDTLIQLGISFQRSMYCVTSAVIQALRTVLCHP LDDPDPEHLQANRDFFESQLNRFTRPEYGIRNAARLRDIQRQDYR" gene complement(8652..10154) /locus_tag="DP116_22415" CDS complement(8652..10154) /locus_tag="DP116_22415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit M" /protein_id="PRJNA477356:DP116_22415" /translation="MLSTLIWLPILGAAVISFLPRAIPAINVRLTALTVTGLVLLWNI FLLFKFDISLPGMQLQEYLPWNETLGLSYQLGVDGLSILMLLLNSLLTWIAIYSSNQQ TERPRFFYSLILLVSAGVAGAFAAQNLLLFFLFYELELIPFYLLISIWGGEKRAYAGM KFLIYTAVSGALILATFLGTVWLTGSTSFDYNTLSTQALSTTLQIILLVGIVLGFGIK IPLVPLHTWLPDAYVEASAPIAILLGGVLAKLGTYGVLRFGMALFPEAWSILAPSLAT WGAVSAIYGAVTAIAQKDIKRMVAYSSIGHMGYILLAAAASTSLALIGAIAQMFSHGI ILAILFHLVGVVEAKVGTRELDKLNGLMSPIRGLPLTSALLILGGMASAGIPGMTGFI AEFIVFQGSFSVFPVPTLLCVVASGLTAVYFVILLNRTCFGKLDNDLAYYPKVLLSEK MPAFILAALILFLGVQPSWLVRWSETTSTAMVAVIPPVEKTVTTQVALNQ" gene complement(10439..>11842) /locus_tag="DP116_22420" CDS complement(10439..>11842) /locus_tag="DP116_22420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864510.1" /note="Catalyzes the transfer of electrons from NADH to ubiquinone; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit F" /protein_id="PRJNA477356:DP116_22420" /translation="LVTYLLVGLWFSQPLVVTGARDAFLTKRVGDLFLLMGVLAIWSQ AGTWNYTDLAEWATTANVNPTLITLTCLALIAGPMGKCAQFPLHLWLDEAMEGPVPST ILRNSVVVATGAWVLIKLQPVFSLSPIATSAMVAIGVVTAVGGSLIAIAQVDIKRCLS YSVSTYMGLVFIAVGTQQDEAALLLVLTHALASALLVMSTGAIVWNSITQDVSLLGGL WSRRPVSGIAYIVGTLGLIGFPPLGGFWALLKLASGLWTTQPWLVGVIIIVNTLTAFS LTREFSLIFGGKPKQMSDRSPEVSWQMALPTIILLGFTLHLPLVLQSLSLLPTWATLN KDVALLLIWSSVFGCSISAVIYLGNVIPKPIRLPLKPLQDLLAYDFYTPKLYRVSIVF SVDLISKLADMIDRLIFDGIVNLVGLVSILSGESLKYSTSGQTQFYAFTVLLGVGILG MVVSWQHWGSHLLNLMF" BASE COUNT 3689 a 2373 c 2523 g 3257 t ORIGIN 1 tccttatatt tatgttgcca aagaacaata aactgttaca acctgaaagt tgtaaaatct 61 ttgttaaatt gaagagcatc tccagtgtct cctcatactt agattaaggt acttagcatg 121 acggcaacaa ccccaaagga aactttagcg gctcctaatt tttgtgaagg tattcaatac 181 tttggtgaag caatcccagg ttttgaaaca tatggtggaa cacctgccat taattcaggt 241 gggagtgcca atcctggtga tccagccgcc gccttccaaa ccttacttac tgctgacgct 301 ttgcgttacc taatactgca agtcacagca agtaaagctt ctggacaccc aggtggtttt 361 gcttctcaag cggaagctta tgcggcgctt gtcatgctcg gttacaaaaa cattattacc 421 gaagtgggac accatgcccc tggattttat agtgccatgt tcttggatcg atcgctagag 481 gacatgggaa tttatacggt gcagcaattg cgcgatcgct tccgagaaaa acacgggctt 541 ttaggacacc tttctggcta cattcccggt attctcgcac ccgcaggtcc tttaggacaa 601 gggcaacact tcgcaatgtc agccgcattg ctgcacaaag acaagctttt ccccttcacc 661 gttggcgatg gtggattggg tgaaccgtat atcatgagtt caatagcaca cttccacacc 721 gctttccctg gtgtcaccaa cttcttacca gtattggtat ggaatggcta cagccaggaa 781 caccacagca tggtttccct taaaaccaat gaacaaatga tcgcatactg gtacggtaac 841 ggttttgaaa aagtcgtgtt agtcgatgcc aaagactttg acgaccaaaa ccaaacaggc 901 gattatgttg atagtaccgc cttttccttt gagcagcggc tagcttttac caaagctgtg 961 ctgacaggtg tagatgaagc agcacgttct gcactgagtg gtaagctaac cgtatttatc 1021 ataaaacaac tcaaaggtgc gggagttcat gcacgaggtg ccaagtctca caacctttat 1081 cctaaagaca ctttggatgc tcctcatatt gtaagtgccc tgaaagaacg tgccttaccg 1141 ccagaagttt ggcaactagt gcggacaaat tgtgaacgcg caggcggagg tcctgcagcg 1201 aagacagttg tgacagaatt tgaatatgag ttgccagaat taggcgaatt gcctttagaa 1261 gaatatccag tgggtggcga accgaaagtt tccaccacat ctatgggaag attggtggga 1321 aaagttggac agatagatag aaacttcttg gtaactaacg ccgacggtaa cgaagcatct 1381 ggtattgcca atatcaactc tgcacttaag attatccacc ctacgacaga tgacttatac 1441 aaccaagcac caaacggaca agtatacgaa cctttaagtg aagatgcttg tgcggggtta 1501 gctgctggtt tatcgttgat gggtgctagg actttgtggt gttcttacga atcttttgcc 1561 atcaacggtt taccaatttg gcaaacagtc acgcaagcaa tggcagaatt gcggcgtcgt 1621 actccctcaa ctattacatt attcaccgct ggtgcattag agcaaggacg caacggttgg 1681 actcaccaac gtccggaaat tgaagcttat tttgcttcgt taatgcgcaa tggtaatgtc 1741 ttcccattgt ttccccctga tgctaacagt attcaagttt gttatgactg ggcattgacg 1801 actaagaata agggaattgt gattactgct agtaagtcac cattgccaat tcgcaccact 1861 tttgaacaaa cgagtcaagc tttacgcgat ggcgcgatag tgttgcaaga agttgattct 1921 aatcaagggg gaagtaagaa agttgtattt gctgtaattg gcgatatgac gttaatccca 1981 gtatttgaag ctgctgcttt tctagaaact gaaggtattg ccgtcaagat tgtttctgtt 2041 atcaatcctc ggcgtttata tcgtccccat gatactgctt gggatatttg ttctgaagct 2101 gatggtggtt ttgtggatga tcagaaattt gccgaattgt ttgatggcga tgcactcatt 2161 ggtgtgactg gtggtgctgc ggggatgcta gaacccatca tgttacggag tactgcgaag 2221 cgggatacct tcgcgtggaa gcgcggggag acgacggcga gtgctggcga gttgatggcg 2281 tttaatggtt tgacggcgca ggcgttggcg aagcggggga ttgagttagt gcattagatt 2341 tagcaatctg atacgatacg gtattgccgc tcttcacaca tgaggagcgg cataatcttt 2401 attatctaat tagaatatag taataaacat aacatacatt tatggcagac ttacttaaag 2461 caattaaaac aattgttgat aatcccattc cagatttagt aagttactac caaggtaaaa 2521 atagaattaa cagtattggt gatgctttgg aatgttttgt aaaggatatc tttgccgaaa 2581 ctttaagtca aagcgagcag gctctaaaaa atgagagata ttctgaggtt ttctcttata 2641 taggtaatca gaacaaccca cccgatctaa ttttaaaaaa tggtgatgca atagaagtca 2701 aaaaaataga atcactaaaa gcaagtattg cattaaacag ttcctatcca aaatctaaat 2761 tatattatga tagtccactg atcaccaagc attgtcgtca atgtgaagat tggcaagaaa 2821 aagacattat ttatgtaatt ggtgttccag aaaataaaaa actgaaaatt ctttggttta 2881 tatatggtga ttgctatgct gccgacagag aaatctatga aagaattgct aaaaaaatta 2941 gtagcggcat tacagaaata aaagatgttg aattctctga aactagagag ctaggaagag 3001 tgaataaagt agatcctcta ggaatcactt atttgaggat tagaggaatg tggggaataa 3061 caaatcctat atacgtttat gattatattt ttcaaactca ggcaacagag aatttacaag 3121 tagtcgtaat tatgaaagaa gaaaagtatt tatctttttc tccagaaagt cgagaagaaa 3181 tagaagctat ctctaatgtt aacttccaaa accaaaatgt taacattaga gatccgaata 3241 atcctgccca acttcttcgt gctaaaatat ttatctatag aatatagaat tccaaaaaag 3301 ttaagttata atgaatcaat atgaatatag tgtctctatt ttcaggttgt ggaggtttag 3361 atttaggctt tcatcaagct ggatttaaca tagtctgggc aaacgaatat gataagtcaa 3421 tttgggatac ttatgaactt aatcatccag atgtaaaact agatagaaga gatatcagag 3481 ttattgaacc tgatgaaata cctgagtgcg taggtatcat tggtggtcct ccttgtcaaa 3541 gttggagtga agctggtgct ggacgaggaa taaatgatag cagaggtcag ttattttatg 3601 attacattag aattcttaga gaaaaaagac ctctattttt tctagctgaa aatgtcagtg 3661 ggattttagc ccaaaagcat aacaaagctt tgacaaatat tttatatcaa tttaaagatg 3721 ctggatatga ggttacttat aaattactga atgcttgcaa ttatggagta ccccaagata 3781 gaaaacgggt aattattata gggtatagag aagagatggg gggaaccttt gaatttccct 3841 tggaaagcaa tcacatttta actcttagag atgctatcta cgagctaagt gatattgaac 3901 cgacaccagt agcaggagag gtatctaaaa ctcatccatt agtacctaat catgaatata 3961 tggaaggtag cttttctagc atttatatgt caagaaatag agtcagaacc tgggatgagc 4021 catcttttac aattcaggca ggtggtagac atgctcctat acatcctcaa gcacaaaaaa 4081 tgatttgggt tgagaaagac aaatggattt ttgatcctaa ttcattaaaa ccgtaccgga 4141 gattatccgt tagagaatgt gcaagaattc aaacgtttcc agaaaaattt attttcaaat 4201 acaaacatat aggtgatggc tataaaatgg ttggaaatgc tgttcctgta ttgctagcaa 4261 aaaagcttgc tactaaaata attatagata taaaagaata ccaaaatttt ggtgtctgta 4321 atcatgtgcg tcgtcacaaa tatccgacac aactcacatt atttgaaccc agttcttctt 4381 tcgcttgaga cttaatttta tgcaatacca tcaatccata acccatgcat gagcaatttc 4441 tcggctacca atggaacgaa cttgtgcagc taagagtgaa acaattgcca cggtttcccc 4501 atactcggtg agaaattcca cttcatatcc cttaccacct tgatgtacca aaacaactgt 4561 acctatatca ccctgttcta acccgtactc tgaaatatga gttgttagaa taaccactac 4621 aagttctgca atcatttttt tcaccttttc aattgatata gcaatccgcg catcagttgt 4681 aaaaatctct cttttcctcc tctgtgttct ctgcgcctct gcggtttatt cactaaaggt 4741 ttattttaca aatcagatag gactgctata ggcagtcaca aattatcttg accatctgta 4801 agctatgatt tactaaagat tttacacaag gattaaacca tgactaacac tcaaggaatg 4861 gttcgcttaa atctagattt gtcaccagaa ttgaaccagg tattagaaga acttgcgaaa 4921 aaaactggtg tcactaaaag cgatgttctg cgtcaggcga ttagcttaat gcaaattctt 4981 gttacagcta aagagcaaac tcataaatta ggtattaacg aagcagatca gctgatagct 5041 acagaaatca tcatgccctc tgaagatata ccaacagagc atccactgga aacatttatg 5101 gaaagctttg gtgcttggga agacgaacgt acaccagaag aaatcatcaa agagatatat 5161 gatagccgta ctatttctaa gtctgaatat agtctgtgac ttatttactt gatactgata 5221 cttgtattta ttggcttact aatcgttatt cagtcagaca aaaggtgagg caagtaggat 5281 ggaatcaaat ttccatttgc attatcactg ctgctgaact gtactttggt gcgtttaatt 5341 ctaatcgaat tgaagaaaat tttgctcgtg cagaattttt tattaaacag ctacctgttt 5401 tacctcttac tgattctgct gtcagacgct ttggagaatt aaaagcagaa ctccgcagac 5461 taggacaacc aattggcgat tttgatttac tcattgccag tgttgctctt acaggaaatt 5521 atattttagt tacgaacaat acccgtcatt atcagcgcat taccgaacta caactagaaa 5581 actggacttt accataagct tgacttttga cttgttgtac tagtttggcg tctgagaaag 5641 ctacagctac tcttacaatt acatacaata atgaaatctt gccctattta gttcgttaaa 5701 catgtcggaa gaagatatcc gtgctacgcg gctagaaaaa gtagaacagc tgagacaact 5761 ggggatgaac ccctatgcct accattggga atccagccat cacgcggctc aattacagga 5821 aaaatatgct gatttgccta acggtgaaga agttgattta gaggtgactg tggctggacg 5881 cgttatagcg cgtcgtgttt tcgggaaatt ggctttcttc actttggaag atgaaaccgg 5941 cacaattcag ctttatttgg aaaaaaatcg cattcaagaa agcatggcag atgttgatgc 6001 tgatgctttc aatcacctca agcaactcac agatgtcggt gatatcttgg gggctagtgg 6061 gacaatcaaa cggactgaaa agggcgagct atctgtcttt gtaaaaaaat acactatttt 6121 aaccaaatcc cttttgcctt tacctgataa atggcatggg ttgacggatg ttgctaagcg 6181 ataccgtcag cgttacgttg atttgatagt aaaccctgaa gtgcgtcaaa cttttcgccg 6241 tcgcgcgctc attactgctg gtattcgtcg ttacttggaa cagcgtggtt ttattgaaat 6301 tgaaactccg gttttacaag cagaagcggg tggtgcagat gcgcgtccgt ttatcacgta 6361 ccacaatact ttagagatgg agttgtttct gcgaattgca acagaactcc atcttaagcg 6421 cttgattgtc ggtggatttg aaaaggtgtt tgaactgggg cggattttcc gcaatgaggg 6481 aatatcaact cgtcacaacc ctgaatttac cacaattgaa atttaccaag cttatgccga 6541 ttacaacgat atgatggcgc taacagaagg tatcattacc accgttgccc aagaggttct 6601 cggcacgcta caaattacct atcaaggcac accagtggat ttgacttccc cttggcgacg 6661 tgcgacaatg cacgatttgg taaaagaata tacaggctta gatttcaact ccttccaaac 6721 tttggaagag gcaaaagctg ctagtaaaaa tgctggtttg gaaggtgtca aagattgccc 6781 ttcaatcggt aagttgctga atgaagcctt tgagcaaaag gtagaggaaa acctcattca 6841 accaactttt gtgattgact acccagtaga aatttcgcca ctggcaaaac cacaccgttc 6901 aaagcctggt atagtagagc gatttgagtt atttatcgtc ggacgggaaa ctgccaatag 6961 tttctcagag ttgactgatc ctatcgacca aagacaacgt ttagaagcac aagctgcacg 7021 aaaagctgcg ggtgacttgg aagcgcaagg agtggatgaa gatttcttga cggcgttgga 7081 atatggtatg cctcccacag gcgggttagg aattggaatt gacaggttgg tgatgttgtt 7141 aacagattgt gcgagtattc gggatgcgat cgcattccca ttactcaaac ctgaaaaatc 7201 cgaatcatct acagaatcag acacatagcc agaaggtgcg ttacggcaaa atgtaacgca 7261 ccttcttaaa ctgatactca tgcaagaagt caggaggaat tatctataat cctgtctttg 7321 aatatcccgc aggcgagcag cattacgaat accatactca ggacgagtaa aacgatttaa 7381 ctgagactca aaaaagtccc gatttgcttg taaatgttca ggatctggat catctaaagg 7441 atgacaaagt acagtccgca atgcttgaat aactgcagaa gtgacgcagt acattgaacg 7501 ttggaaacta attcccaact gaatcaatgt atcttcttct cctcggcaat aattcttgta 7561 gtaatcaata agatacggtg gcaagaaatg caacatatct tgcatgagca gtgttggtgg 7621 aatacctgca gtacctactg gaaagacatc cgcataaaga attccatagt gaaagtcttt 7681 ttggtctatt ggcacttgtc ctgcttgagc attataagat tttgtacccc ggaagggagc 7741 agtacgataa aaaactgctt ccacataagg taatgcagct tcgtgcagcc agacaaaacc 7801 tttagattta ggaacaattt cgtagcattc accacgaata taaacatgat ggtaaatagg 7861 acgatttgca gccgcaaaaa ttccatttac tataaaattc attgcttctg ggacactcgt 7921 aatttttcct tcgtcgtata agtctgacat ttcaaaaaat acgggagcca tgacttccca 7981 gaacaaaccc agattagcgt agtaagacat ctgacgacac tgttctataa acatatctgg 8041 gaacagtttg tagagtccta acatgaatgg gttacctttg aagtaagctt tgattgccct 8101 atcggcatta gctttgtatt cttcagtatc cagataaggg tcaaatttcc cacccattcc 8161 tctatgccaa agcattgctc gcatacaagc ttcggcaaat tccatgttga ttcgatcatg 8221 gaataagtga tgcaataact taggcatttt gaaggtttca cccttttcca taaatgccaa 8281 gagttctgga tgagcagtcg cctcacctcg ccaaactctc aactcagcat catcaccagc 8341 ataatgattg tgacgatcta aatactcctg gggaatgaag tatttaaaaa agggaaatgg 8401 atctaaaaat acctgctcag caatatataa taagtcgcgc cagtagaagt ccatcggtac 8461 tgcatatgct ttgtaaagac cgatgatttg cattaagttt tctggcgtat cgggcaacat 8521 tgcaccgcct gcttctaacc gatgaatcac ttctgcaaat tcatgagtag aaggaggtaa 8581 tttagcttga accttatctg gagtttgtac cattttgatt tcctcaaata aagtagattt 8641 tcaaaggaag tttattgatt gagagcaact tgagtcgtta cggttttttc aacaggagga 8701 ataacagcaa ccattgctgt acttgttgtc tcactccaac gcactaacca actcggttgt 8761 actcccaaga ataagatgag ggctgctaaa ataaaagctg gcattttctc agacaacaga 8821 actttaggat agtaagctaa gtcgttatca agtttgccaa aacaggtacg gttaagcagg 8881 atgacaaagt aaactgcagt taagccactg gcgacgacac acaacagtgt tggtacggga 8941 aagacggaga aactgccttg aaacacgata aattcagcaa taaatcctgt cataccagga 9001 ataccagcgc ttgccatacc ccctaaaatg agtaaagcac tcgtcagggg taatccgcgt 9061 ataggactca ttaagccatt gagtttatcc aattcacggg ttcccacttt ggcttccaca 9121 actcccacca gatggaagag gatggcgagg ataataccgt gactgaacat ttgggcgatc 9181 gcaccaatca gtgctaacga agtactagct gctgctgcta acaatatata ccccatgtgc 9241 ccaatggaac tatatgcaac catgcgcttg atatcttttt gagcaatagc tgtgactgcc 9301 ccgtatatag cactgactgc tccccaagtt gctaaactgg gtgcaagaat actccaagct 9361 tcgggaaaca gcgccatccc aaatcgtaaa acgccataag ttcctaactt tgctagcact 9421 ccaccaagaa gaatcgcaat cggggctgag gcttcaacgt aagcatctgg taaccaagtg 9481 tgtaagggaa caaggggaat tttgatacca aaccctagca cgattcctac aagtagaatg 9541 atttgcagtg ttgttgataa ggcttgagtc gagagtgtat tgtaatcaaa actagttgaa 9601 ccagtgagcc aaactgtacc cagaaatgtt gctagaatta atgctcctga aacagcggta 9661 taaattagga acttcatgcc agcataagcc cgcttttcac ctccccatat ggaaatcagg 9721 agataaaagg gtattaattc tagttcgtaa aacaggaaga agagcaataa attctgtgct 9781 gcaaatgcac ctgcaactcc tgcactgact aataagataa gggagtaaaa aaaccgggga 9841 cgttctgtct gttggttgct gctgtaaata gcaatccagg tgagcaaact atttaatagc 9901 agcatcagta tggaaagtcc atcaacccct aattggtaac ttaaaccaag ggtttcattc 9961 caaggtaagt attcctgcaa ctgcatgcca ggaagagaga tatcaaattt aaacaggagg 10021 aaaatattcc aaagaagaac taatccagta acggttaggg ctgttaaacg aacattaatt 10081 gcgggaatgg cacgaggtag gaaactaata acagcggcac ctaaaatagg tagccaaatt 10141 aaggtactga gcataaattt gttagtgatt tgactgatct aaagaactta aaaaaaccgc 10201 tccagacgca gagaagagcc agcgcgttgc ggagccagtg ctgaagacgg gtttcccgtc 10261 gcaggcatct ggcgttgggg ttccccccgt tgtagcgact ggcgtcacag aggaagagag 10321 aacagagaga tgcacaagct ttggacgaaa acaaatttta ttgttagtta ttgattgtga 10381 gttgactgct tttatgggcg caagcagggt aagcgcattt taataagcag tctggaaact 10441 aaaacatcaa atttaaaaga tgagatcccc agtgttgcca actcactacc attcctaaaa 10501 tacctactcc caatagcacg gtgaatgcat aaaactgggt ttgtccggat gtgctatatt 10561 tcaaactttc tccactcaat atggaaacta agccaactaa gttaacaatt ccatcgaata 10621 tgaggcggtc gatcatgtct gctagttttg aaattaagtc aacgctgaaa actatgctca 10681 cccgatacag ttttggagtg taaaagtcgt atgctagcaa gtcttgcaaa ggtttcaatg 10741 gtaagcgaat tggtttggga atgacattgc ctagataaat aacagcacta atgctgcacc 10801 caaaaacact tgaccaaatt agcaggagtg cgacatcttt gttgagagtt gcccaagttg 10861 gtagtagtga taagctttgt aacactaagg gtaaatggag agtaaagcca agcaaaatga 10921 tcgtgggtaa agccatttgc caactgactt caggtgagcg atcgctcatt tgctttggtt 10981 ttccaccaaa tattaaacta aattccctgg ttaagctaaa ggctgttaaa gtgttgacta 11041 ttataatgac tcccacaagc caaggttgtg ttgtccacag tcccgacgct aatttcagca 11101 atgcccaaaa gccgcctaac ggtggaaacc caattaatcc tagagtccct actatatatg 11161 ctatgcctga tactggacgg cgtgaccaaa gtccccccag aaggctgaca tcttgagtga 11221 tgctgttcca aacaattgca ccagtactca tgactaggag tgctgaggct aaggcgtgag 11281 tgagaactag taacagtgct gcttcatctt gttgtgttcc tacggcgata aatactaaac 11341 ccatgtatgt actgacggag tatgataagc agcgtttgat atcaacttgg gcgatcgcaa 11401 tcaaagaacc acccactgct gtcacaacac cgatagctac cattgcagaa gtcgctattg 11461 gcgacaagct gaaaacaggt tgcagtttaa tcagcaccca agcaccagtc gcaacgacta 11521 cagaattccg taaaattgta ctgggaacag gtccttccat tgcctcatcc aaccacagat 11581 gcaagggaaa ttgagcacac ttacccatcg gaccagcaat taatgccaga catgttaacg 11641 ttattaaggt tgggttgaca ttggcagtag tcgcccattc tgctaaatcg gtgtagttcc 11701 aagtcccagc ttgtgaccat attgccagaa ctcccatcag cagaaataaa tctcccaccc 11761 gtttggttaa gaaagcatct ctagcacctg tgacaaccaa aggttgacta aaccaaagtc 11821 ctactaataa gtaggttact aa // LOCUS NODE_2863_length_11839_cov_5.13891711839 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11839) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11839) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11839 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 3..182 /locus_tag="DP116_22425" CDS 3..182 /locus_tag="DP116_22425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130504.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S21" /protein_id="PRJNA477356:DP116_22425" /translation="MILGENEGIESALRRFKREVSKAGIFPDVKKHRHFETPLQKRKR KAVAKQKQRQNRFRY" gene complement(193..3993) /locus_tag="DP116_22430" CDS complement(193..3993) /locus_tag="DP116_22430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874788.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22430" /translation="MLKWLVKWIKKYLRFFFRKKQTRSVYPIEEQKMAPPPALTNADL EFLFTQLLEGVHQAKGEQWAIQYLQRMEKRISNERWINWLLDFGERLLKSPAPNNYLA EQMVQLGELGIGRIGDLSYDIGIRLLTRNLGNPYWENDQTQDTETITDTTLPSRFSEE RQNKSTEDSDNILRQLTPAVMPAPFLNPPQEEYTDNQEELIWEYDEPIVEFRTPDSAW TLSAKQEKEESDWVEQSVEPEAKTAARQDFSDFQSPVAVKLDELLVRLEHSTSLVQQL ASGLGYQSEISINTGLPINQQRTTLEQAEAWFYEGLKQAKAGNLSGAIASYNKAIEIH PNSQEYWFNRGLTLFYLGHFSEAIASYDKAIALKPDFYKCWYYRGGALGELGHFEDAI LCFDKAIEIKFNYPEAWSGRGIALQKLGRPLEAVASYDKAIILQPQDQDNWYYRGLAL AQDGRNYDAIASYEKALEIQNDFHLAWYKRGVELCELGEFEDAIASLEEATEIQNDFS EAWYALAGALNKVGRSEDAIASYEKATQIDPNSHEVWIDKGVVQFSLGRWDDAISSWE KALEIKPDYYLGWFNCAVALDNLGQREEAIACYDKAIEFYPQFELAWYNRAIVLFYLQ RFEEAIASYDSALQIKPDYWEAWIARGNAAENIVYRDNSHFTFLSPAELYERVYEEKL ASYDQGLKYVDQNTQPEGWGRLHLALGNAYYDRGKRHPTPSYYWQQALTAYNQALHTL TPEPFARLHLEVLQNLIKTLVVLGQTSQAQELHQYGTDFIQFLLSETTHSDEEKKQLA LKYTSFEQLAVDIAVQSGEIAQAIEIAERGKNACFTWLLYGWTDEIRSFSYQSIQQLL NPTTAIIYWHISPCALRTFIIKYESPEPIVVFTPVVNGVIDEVPLPQVVRRLVEFEDW LKDWHQQYREYQQLQTNDASDKSVHSWQAEMEQRLLNLKHILNISAIVNELEDITNLI LIPHRDLHRFPLHALFNISSLWEQEFSKQRNYTCTYLPSIQIGFSFKSQQLLQVQNQP LLIVEPQESINYPTPQFAGLESEVISQMFSYCTRIQGLQATKTQVENALSENYNIFHF TGYVTENSNQPHKSEFVLTGEEKLTIEEICKKPITSYDLITLSTWETVMTNNQNITTE HVGIVTAFLAGGVNHVLSTVWTVESAAIALVMIEFYRRLQQNKSAAKALAETTAWLKE LTAGELKQWYEELLNQLPQEGQKIRASLTTELYRSREMSAETKLYNHPYYWAAFKIAG KFSF" gene complement(4268..4954) /locus_tag="DP116_22435" CDS complement(4268..4954) /locus_tag="DP116_22435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317525.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22435" /translation="MFKRLIQWLKKFLQRLFGSNQTPAKTEGNIHKEPPPPLSDTDLE FLFTELLEGVHQARGQAWAQKWLNNIEHRVPEERWVQWLERFGARLLASPTPNNELAS RMVQLGELEIGEVGNAAYDIGMQLLTRTVGEPIWEYQGPDAVKQTPVTTQTQNEQPED MEIENLPEGEYQSVTLDELFVLIQQDENLRKQICEQLGMETDDPEIIIQALLDQFQAA NESTTNQAES" gene 5277..7742 /gene="dnaX" /locus_tag="DP116_22440" CDS 5277..7742 /gene="dnaX" /locus_tag="DP116_22440" /EC_number="2.7.7.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207639.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit gamma/tau" /protein_id="PRJNA477356:DP116_22440" /translation="MSYEPLHHKYRPKSFAELVGQEAIATTLTNAIRAAKIAPAYLLT GPRGTGKTSSARILAKSLNCLKSDKPTPQPCGVCDVCQGITKGYSLDVIEIDAASNTG VDNIREIIERAQFAPVQCRYKVYVIDECLTGDSLVLTDRGLVRIDDPNIKGKRVLSYS DSSLKWEFKKVLRWLDQGERQTLVIRTTNREIKCTGNHLIKTDQGWVPAKDVKEGMRI LSPGWEQMPVENGWQISNSILSPQWTTSLETVESVYLAGIEKVYDIEVADNHNFVANG LLVHNCHMLSQAAFNALLKTLEEPPKHVVFVLATTDPQRVLPTIISRCQRFDFRRIDL EAMAKHLSYIAYQENINITIEAITLIAQIAQGGLRDAESLLDQVSLLSGDVTPERVWD LVGSVSEHDLLFLLNAIAKDNPEAVLDCTRKILDRGREPLIILQNLAAFYRDLLIAKT ASGRNDLVACTPQTWQALIDFAQQFDTHTILLGQQHLRTAEVQLKNTTQPRLWLEVTL LGLLPSAHISVPQNGRGAVYPPRISPPTVSPSPGKSEQNSITSLEPPSVSAPENNSKT VEQVKTDQTDTFSKTPTTPSSPEETPSSRHPPEPRPEIEAPTSVSQKEEVSQSTEIED LTQVWQQVLSNLEEISKQALLRQMCHLIEFDGTVARVAVKEKWYKQVQTYHPIIVAAF KKTFKCDVKINLEIATTSTSIPPQTSPQKKPKTRPSYEPPSAPSPEVPKSTKPPTTKS PNAANTAAVQSSDTTHRSDKPSQTQPSASTPDWEVDEVAAAAQRLAQFFDGEVIRLTD DIERATSTTPSEWEGEAEADDEF" gene complement(7818..9236) /locus_tag="DP116_22445" CDS complement(7818..9236) /locus_tag="DP116_22445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457511.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_22445" /translation="MPANSWPENDSYNELDPLDSLFSDLSGLEELQEEETSASVSLPY RFQGRRRKAALVLTIVWSGTIALHLVSWGTLFVLGLTTIIGFHALVIVFARAKPHPEQ MQEDLPSVSVLVAAKNEEAVIGRLVKNLCSLDYPGGEYEVWVIDDNSSDKTPQLLSEL AQKYDQLKVLRRKPEAGGGKSGALNQVLALTKGEILAVFDADAQVSPDLLQRVIPLFQ RENVGAVQVRKVIANAKENFWTKGQASEMAFDTFMQHQRNANGGVCELRGNGQFVRRK ALLCCGGWNEETITDDLDLTLRLHLEKWDIQCVFNPTVEEEGVTNAIGLWHQRNRWAE GGYQRYLDYWDLILRNRMGTRKTWDLFVFLVLQYILPTAAVPDLLMAIARHRPPIFSP VTGLTITLSVTGMFAGLWRIRRQDKEVKLSTYLLLLLGTLRGTLYLLHWIVVMSSTTA RMSVRPKRLKWVKTVHQGEDGE" gene complement(9190..10164) /locus_tag="DP116_22450" CDS complement(9190..10164) /locus_tag="DP116_22450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317522.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="radical SAM protein" /protein_id="PRJNA477356:DP116_22450" /translation="MAPLVANYYLTYRCNARCHFCNIWSLEPGKEADFETIKHNLNDL RRLGVKYVDFTGGEPLLRYDVREIYTEAKRLGFATSITTNTILYPKKAKEIQGLVDFL NFSLDGADADTHDQSRGVKIFDNLVESVAIAKSLGEYPVLNHTVTAQNYHRIEEIGKL GKDLGVRVWLNPAFTAYEHYNSNKNPTPEMVTAIQVTAKKYNNVGYNKAALAFIEAGG NDTKNPRCKAVDAVIAISPNDELLLPCYHFAQSGVPINGRLYELYRESEEVEKYRQSQ GKLQVCEGCTVWCYLIPSFFMGVDKYWWLNQVTYASEFLARKRFLQRA" gene 10598..10969 /locus_tag="DP116_22455" CDS 10598..10969 /locus_tag="DP116_22455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112342.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22455" /translation="MSLEQLHPATQQQTSVYLPYIQGNKRNFLPHAITLYQKGILEGY RKIEGSDNIPFVATWNVATLPSDLTRCRMQFDGNAELSYELMMASFEFINFLIEFIEN YERYRVTDFSQVFYRKLLRLE" gene 11425..>11839 /locus_tag="DP116_22460" CDS 11425..>11839 /locus_tag="DP116_22460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874768.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22460" /translation="MPKSAKYLLIGSTEAYCGKSATVLGLSYQLQQKGLDIAYGKPLG TCLSESTGTMVEEDVQFIADSLKLSKNRVVPTMLALNEVAVQKRLQGLDKTNYQKLLA QEYSQIQVGDLVLLEGPGNLEEGSLFDLSLLQVAEV" BASE COUNT 3336 a 2631 c 2421 g 3451 t ORIGIN 1 aagtgattct tggcgaaaat gaaggaattg agtcagccct ccgcagattt aagcgagaag 61 tctccaaggc aggaattttt cctgacgtga agaagcaccg tcactttgaa acaccccttc 121 aaaaacgtaa acgcaaagca gttgccaagc aaaagcagcg tcagaatcgt ttccggtatt 181 gatgggaagt gtttaaaaag aaaacttacc tgcaatctta aatgctgccc agtagtaagg 241 gtgattatag agctttgttt cggcggacat ttcacgactt ctatataatt ctgttgttaa 301 acttgcccta attttttgcc cttcttgagg tagttggttg agtaattctt cataccattg 361 tttcagttca ccagcagtca gttctttaag ccatgcggtt gtttccgcta atgctttggc 421 tgctgattta ttttgttgga gtcgtcgata aaactctatc ataaccaaag caatagcagc 481 tgattccaca gtccacacag tactcaagac atgattaact cccccagcta ggaaagcagt 541 gactattcca acatgttcag tcgtgatatt ttggttgtta gtcatgactg tttcccaagt 601 tgaaagagtt ataaggtcat aacttgtaat cggctttttg cagatttcct ctatagttaa 661 tttctcttca cctgttaata caaattctga tttgtgtggt tgattggaat tctccgtaac 721 atagcctgta aagtgaaata tattgtaatt ttcagataaa gcattttcta cctgtgtttt 781 tgtcgcttgt aagccttgaa tgcgtgtgca ataactaaac atctggctaa tgacttcaga 841 ttcaagtcca gcaaattgtg gtgtcgggta atttatactt tcttgaggtt ccacaatgag 901 taacggttgg ttttgcacct gcaagagttg ttgggatttg aaagaaaacc cgatttggat 961 gctaggtaga taggtacatg tgtaattacg ctgctttgag aactcttgtt cccaaagaga 1021 ggaaatattg aaaagggcgt ggagagggaa tctatgtaag tcgcggtggg gaattaaaat 1081 cagattggtg atgtcctcaa gttcattcac aattgcagaa atgttgagga tatgttttaa 1141 attcaataac ctctgttcca tttctgcttg ccatgaatgg acacttttgt ctgatgcatc 1201 gttagtttgg agttgctgat attccctgta ttgttgatgc caatctttta gccaatcttc 1261 aaattcaacc aagcgtcgta ccacctgggg taaaggcact tcatcaatca ccccattaac 1321 tacaggtgta aaaacgacaa taggttctgg agactcgtat tttataatga aggtgcgtaa 1381 ggcgcaaggg ctaatatgcc agtaaataat tgcagtggtg ggattgagaa gttgttgaat 1441 tgattgatag ctaaatgatc tgatttcatc tgtccaacca tacagaagcc aggtgaaaca 1501 agcatttttt ccacgttcgg cgatttctat cgcttgcgct atctcaccag actgtacggc 1561 tatatcaact gccaactgtt caaaacttgt atacttgagt gctaactgtt ttttctcttc 1621 atcagagtga gttgtttcgc tcaataagaa ttgtataaaa tctgtaccat attgatgtaa 1681 ttcttgcgct tgggaagttt gccccaaaac cacaagtgtt tttatcaggt tttgcaaaac 1741 ctccagatgt agtcgggcaa agggttctgg ggtcagagta tgaagcgcct gattgtacgc 1801 agtgagggct tgttgccaat aataggaggg tgtaggatgt ctcttacctc ggtcgtaata 1861 ggcattgcca agtgctaaat gcaatctgcc ccaaccttct ggctgtgtat tttgatcaac 1921 atatttcagc ccttggtcat aacttgccaa tttctcttca tagacgcgtt catacagttc 1981 tgctggactc aagaaagtga agtgtgagtt gtccctataa actatatttt ctgctgcatt 2041 ccctcgggcg atccacgctt cccagtagtc aggtttgatt tgtaaagcgc tatcatatga 2101 ggcgatcgcc tcttcaaatc tctgcaaata aaatagcact atagcccggt tgtaccaagc 2161 taattcaaac tgaggataaa attctatcgc tttatcataa caagcgatcg cctcttcacg 2221 ctgtccgaga ttgtccaacg ccacagcaca gttgaaccag cctaaatagt agtcgggttt 2281 gatttctaaa gctttttccc aggatgaaat cgcatcatcc cagcgtccca aactgaactg 2341 taccacacct ttgtcaatcc agacttcatg agaatttgga tcgatttgag tcgctttttc 2401 ataagatgcg atcgcatctt ctgatcttcc tactttattc agcgcaccag ccaaagcgta 2461 ccaagcttcc gagaagtcgt tttgaatttc tgttgcttcc tctaagctag caatagcatc 2521 ttcaaattct cctaattcac aaagttccac acctcgtttg taccaagcta ggtgaaagtc 2581 attttggatt tctaaagctt tttcatagga ggcgatcgca tcataatttc gtccatcctg 2641 cgccaatgcc agacctcgat aataccaatt gtcttggtct tgtggttgca gtatgattgc 2701 tttgtcgtaa cttgcaacgg cttccaatgg acgccctaat ttctgcaacg ctatacctct 2761 gccagaccaa gcttctgggt aattaaactt aatttctatc gctttatcaa agcacaaaat 2821 cgcatcttca aagtgtccta gttcacctag tgcgccgcct cggtaatacc aacatttata 2881 aaagtctggt ttgagggcaa ttgctttgtc gtaagatgca atcgcctctg aaaaatgtcc 2941 taaataaaat agtgttaacc ccctgttaaa ccaatattcc tgcgagttgg ggtgaatttc 3001 tatagctttg ttgtaagatg caattgctcc cgataaatta cctgccttag cttgcttaag 3061 accttcgtaa aaccaggctt ctgcttgttc taaagtcgtc ctttgttgat ttattggtaa 3121 acctgtattg attgatattt cgctttgata tcctagccca gaagccagtt gctgaactaa 3181 acttgtactg tgttctaacc taactaacaa ttcatccagc ttgacagcca ctggtgactg 3241 aaaatcggaa aaatcttgcc ttgcagcagt ttttgcttct ggttccacgc tttgttctac 3301 ccagtcagat tcttccttct cttgtttagc tgacaaagtc cacgcagaat ctggtgttcg 3361 gaattcaact attggttcat catattccca tattaattcc tcttgattgt cggtatactc 3421 ttcctgtgga ggatttaaaa aaggtgcagg cattactgct ggtgttagtt gcctgagaat 3481 attgtctgaa tcctctgtgc ttttgttttg tctctcctca gagaatcgac ttggcagtgt 3541 tgtatcagtt atggtttctg tgtcttgtgt ttgatcattt tcccaatacg gattgcctaa 3601 gtttcgcgtc aataatcgta tgccaatgtc gtaagataaa tctccaatcc tcccaatacc 3661 taactcgcct agttgcacca tttgttctgc taagtagtta tttggtgcag gtgattttaa 3721 caatctctcc ccaaaatcta gtagccaatt tatccagcgt tcattggaga tacgcttttc 3781 cattctttgt aaatactgaa tcgcccactg ttcccctttt gcctgatgca caccttccag 3841 cagttgggta aatagaaatt ctaaatccgc atttgtcaat gcgggtggag gtgccatctt 3901 ttgttcctct ataggataga cagaacgagt ctgctttttc ctgaagaaga acctcaaata 3961 cttcttaatc cacttgacta gccacttgag catttcggga attattggag attggattgg 4021 tcatcataag attttagcga gcacagtgtt aaaaaagtta tattgacttt aaaaacctat 4081 aataattgaa tgagcgatag tacggtcagt tgggaatcta aatagcatac taacagtcca 4141 aaaaacaagg tgattgacga tttacagtat ttttacctag tcaagcaact aatagcacca 4201 aattttaccc atacggcgaa aatttagtat ttgtatatac aaaatgcttt cagagtcagg 4261 cattgaatta cgattcagct tgatttgtag tagattcatt cgctgcttgg aattgatcaa 4321 gcaaagcttg gataattatt tctggatcgt ctgtctccat tcctaattgc tcgcaaattt 4381 gcttgcgtaa attctcgtct tgctgtatta ggacaaataa ctcatccaaa gtaacgcttt 4441 ggtattctcc ttctggaaga ttttcaattt ccatatcttc gggttgctca ttttgagttt 4501 gtgtcgttac aggagtttgc ttgacagcat ctggtccttg atactcccat attggctcgc 4561 caactgtgcg cgttaacaac tgcatcccaa tatcataggc ggcgtttccc acttctccaa 4621 tctccaactc acctagttgt accattcggg aagctaattc gttattgggt gtgggtgatg 4681 ctaacaatct tgcaccaaaa cgctccaacc actgcaccca acgttcttct gggacgcgat 4741 gttcaatatt atttaaccat ttctgcgccc atgcttgccc tcgtgcttga tgcacacctt 4801 ctaatagttc tgtaaacaga aattccaaat ctgtatcact taggggtggg ggcggttctt 4861 tgtgaatatt tccctctgtt ttggctggag tctgatttga gccaaacaag cgttgcaaaa 4921 attttttgag ccattgaatt aaccgcttga acatctcccg caccggtaag cgttggtggt 4981 catcataaga ttgtagcggt gtataaggac aagctgctat gcgtttgtca actacactca 5041 catatcttac tcatatcaag taaaatgtac agacaaaaaa gttttctgtc tctatagtga 5101 agaaaaatta acacaaagta gaaaaatatc cccaagctag actttctgga taatatggta 5161 tatttgtact aaaaaacaat catcaatgtt tgaagttacg tcataaattt aaaatgatct 5221 aagcccaaat cgtgggtgct agttattaac catcaaaggc attgacatca ttatccatgt 5281 cttacgaacc cctgcaccac aagtatcgcc caaagagttt tgctgaactg gtgggacaag 5341 aggcgatagc taccacccta acaaacgcta tccgcgccgc caaaatagcc cctgcttacc 5401 tgttgactgg tccaagaggt acagggaaaa cttccagtgc tcggattttg gcaaaatcgc 5461 ttaattgtct caaaagtgac aaacccaccc ctcaaccctg tggcgtgtgt gatgtctgtc 5521 aaggaattac taagggctat tctttagatg ttattgaaat agacgctgcc agcaatactg 5581 gtgtcgataa tattcgcgaa attatagaaa gagcgcaatt tgcccctgtg caatgtcggt 5641 acaaagttta tgttattgat gagtgcctga ctggagattc tttggtcttg actgataggg 5701 gacttgttag aatagatgat cccaatatca aaggtaaaag ggttctcagt tatagtgatt 5761 catcgctcaa gtgggaattc aagaaagttc tgagatggct agatcagggt gaacgtcaaa 5821 ctctggttat tagaacaact aaccgagaaa tcaaatgtac gggcaatcat ttaattaaaa 5881 cagaccaggg atgggtacca gcaaaagacg taaaagaagg aatgaggata ctatcccctg 5941 gatgggagca gatgccagta gaaaatggtt ggcaaatctc aaacagtatt ctatcccctc 6001 aatggactac aagtttggaa acggtcgagt ctgtttacct cgctggaatt gagaaagtct 6061 atgacattga agtggcagac aaccataact ttgttgctaa cgggttactt gtccataact 6121 gtcatatgct cagccaagcc gcattcaatg cgctactgaa gacactagaa gaaccaccga 6181 aacatgttgt ctttgtacta gcaacaactg acccgcaaag agtcttacct accatcatct 6241 cccgatgtca acgcttcgac tttcgcagaa tagatttaga ggcgatggcg aagcatttga 6301 gttatatagc ttaccaagaa aatattaaca ttaccatcga agcgataact ctaatagccc 6361 aaattgctca aggcggattg agagatgcag aaagtctcct cgaccaagtc agtttattat 6421 cgggtgatgt gacacctgaa agagtttggg atttagttgg ttcagtcagc gaacacgatt 6481 tacttttctt gttgaatgcg atcgcaaaag acaacccaga agctgtttta gattgcactc 6541 gtaaaatctt agatcgtggt cgagaacctc taattattct ccaaaatctc gctgcatttt 6601 accgagattt actcatcgcg aaaactgcgt ctggtcgcaa cgatttggtt gcttgtactc 6661 cccaaacttg gcaagcactg attgattttg cccaacagtt tgacacacac acaatcttgc 6721 taggacagca acacttgcga accgctgaag tgcaactgaa aaacaccaca caaccgcgct 6781 tgtggttaga ggtgacgttg ctgggattgt taccctcagc tcatatttct gttccacaaa 6841 acggtagagg cgcagtttac ccgccaagaa tttcgccgcc tacggtttca ccatctccag 6901 gaaaaagtga acaaaactca attacaagcc ttgaaccacc ttcagtatca gcaccagaaa 6961 acaattcaaa aactgtagaa caagtcaaaa ctgaccaaac cgataccttc tcaaaaactc 7021 ctactacccc ctcatcacca gaggaaaccc cgagttctcg tcatcctcca gaaccacgcc 7081 ccgaaataga agcacctacc tcagtatccc aaaaagagga agttagtcag agtacagaaa 7141 tagaagactt aactcaggtt tggcagcaag tgctgagtaa tctcgaggaa atctcaaaac 7201 aagcgctatt gcgtcaaatg tgccatctta tagaatttga tggtactgta gctcgtgttg 7261 ctgttaaaga aaaatggtat aaacaagtac aaacatacca ccccataatc gtagcagcat 7321 ttaaaaaaac tttcaaatgc gacgtcaaga taaatctgga aatagcaacg acatcaactt 7381 ctatcccacc tcaaacttct cctcaaaaga agcccaaaac tcgtcccagt tatgaaccac 7441 catccgcgcc ttctccagaa gttcccaaat caaccaaacc accgacaaca aaatctccaa 7501 atgcagcaaa tactgcggcg gtgcaaagtt cagacacaac tcatcgcagt gataagccct 7561 cacaaactca accatccgca tcaacacctg attgggaagt tgatgaagtc gcagccgccg 7621 cacaacgtct ggcacaattt tttgatggag aagtcatccg attgacagat gacatagaac 7681 gagcgacttc tacaacccca agtgagtggg aaggtgaagc agaagctgat gatgagtttt 7741 aaatcagtga acagtgaaca gtgaacagtg aacaaaataa ctaccgataa ctgataactg 7801 gtaactggta actctgttca ttctccatct tcaccttgat gtacagtttt cacccatttc 7861 aagcgcttgg gacgtaccga catccgggcg gtagtgctac tcatgactac tatccagtga 7921 agcaaatata atgtgccacg caaggtccca agaagcagca gaagataggt agatagtttg 7981 acttccttgt cttgacgtcg tatacgccac agaccagcaa acatccccgt tactgacagc 8041 gttatcgtca agcctgtcac tggactgaaa atgggtggac gatgtcgtgc gatcgccatc 8101 aacaaatctg gcacagccgc tgtgggtaag atgtactgca gcaccagaaa cacgaataga 8161 tcccaggttt tgcgcgtacc catgcggttt ctgagaatca aatcccagta atccaaatac 8221 cgctgatacc caccttctgc ccaacggttg cgttgatgcc acaagccaat ggcattagtc 8281 acaccttctt cttctacggt tgggttgaac acacactgaa tatcccactt ttctagatgt 8341 aagcgcagtg tcagatccaa atcatcggta attgtttctt cattccagcc accgcaacac 8401 aggagtgctt ttcgccggac aaattgaccg ttaccgcgca gttcgcatac accaccatta 8461 gcattgcgtt ggtgttgcat gaatgtatcg aatgccattt ctgacgcctg accttttgtc 8521 cagaaattct ctttggcatt ggcgatcacc tttcgtacct gcactgcacc tacgttctct 8581 cgttgaaaca acggtatgac acgctgtaac aagtctggtg aaacttgggc atcggcatca 8641 aataccgcca aaatttctcc ttttgtcagc gctaaaacct gattcaacgc tcctgatttt 8701 cctccccctg cttctggctt tcgtcgtagt actttcagtt ggtcgtattt ctgggctagt 8761 tctgatagta actgcggcgt tttatcgctg ctgttatcat caatgaccca gacttcgtat 8821 tcccctcctg ggtaatctag actacaaaga ttcttgacta atctgccaat aactgcttct 8881 tcatttttcg ctgccaccaa aacagataca gagggcaaat cttcctgcat ttgttctggg 8941 tggggttttg ctcgggcgaa cactatcact aaggcatgaa atccgatgat agtcgtcagt 9001 cccaagacaa acagagttcc ccaagaaact aaatgcagag cgatcgtacc actccagact 9061 atcgtcaaaa ctagagcggc tttgcgtcta cgaccttgaa accgataggg aagagacacg 9121 cttgctgatg tctcctcctc ctgtaactcc tctaagccag agaggtctga aaatagggaa 9181 tcgagcggat caagctcgtt gtaagaatcg ttttcgggcc aggaattcgc tggcataggt 9241 tacttgattt aaccaccaat atttatctac acccataaag aagctgggta ttaggtaaca 9301 ccaaactgtg caaccttcac atacctggag cttaccctga gattggcggt atttttcaac 9361 ctcctcagac tcgcgataca gctcgtaaag tcgcccatta atgggaactc cactttgggc 9421 gaagtggtag caaggtagca gcagttcatc gttaggagaa atagcaatga cagcatctac 9481 cgctttacag cggggatttt tagtatcatt gcccccagcc tcaataaatg ctagtgcagc 9541 cttattgtaa ccgacattgt tatatttctt tgcagtgact tgtattgccg ttaccatttc 9601 tggtgtcgga tttttgttcg agttgtagtg ttcgtaagct gtgaaggcgg gatttagcca 9661 aacccgaaca ccaagatctt tgcctaattt ccctatttct tctatccggt ggtaattttg 9721 agcggtaaca gtgtggttga ggacagggta ttctcctaga gacttggcga tcgccaccga 9781 ctcaaccaag ttgtcaaaaa ttttcactcc ccgagactga tcatgagtgt ccgcgtctgc 9841 accatctaag gaaaaattta aaaagtctac taatccctga atttccttag ctttcttcgg 9901 atacagaata gtgtttgttg ttatactcgt ggcaaaaccc agacgtttcg cctctgtata 9961 aatttctctg acatcataac gcaggagtgg ttcaccgcct gtaaagtcca catatttgac 10021 tcccaagcgg cgtaaatcgt tcaaattgtg cttgattgtt tcaaaatctg cttctttacc 10081 aggttcaagt gaccaaatat tacaaaaatg acatctagcg ttacaacggt aggtcaagta 10141 gtagtttgca accagtggag ccatagggta aaaccactta agtgttcttt aggacatctt 10201 taaacaaaaa gtcaaaactg aaaagtgagc cagtgcggta ctgagtcagc gctgttggta 10261 ggataactcg aaggcaggcg actgacgtgc ataatttctc agcataaagg aaccagtgcc 10321 ttgaggaacg taagcgccct ggttaggttc ccccgagtgt acacacagcc tttgtcggtc 10381 atgccttcct tgtagacgcc tatgggcgac ttcccgtagg gtagtatctg gcgtcaactg 10441 gtgttaggca tctgcttcgc tcatacccgt aaggatcaaa acacaaaaat tattatcctc 10501 acctcaatga atactaaaca aacatttttt aggaatacta ttgacacaaa tcattaaact 10561 acgcaaactg tcaagtaact taaggagtat ttataatatg tcactcgaac aactgcaccc 10621 cgctactcaa caacaaacaa gcgtctattt gccttacatt cagggtaaca agcgtaactt 10681 cttgccccat gcaattactc tttatcaaaa gggtattttg gagggatacc gcaaaataga 10741 aggcagtgat aacattccct ttgtcgccac ttggaatgtt gcgactctcc cctcagactt 10801 aacccgttgc cgaatgcagt ttgatggcaa tgcggagttg agttatgaac tcatgatggc 10861 aagctttgag tttattaatt ttctcattga attcatagag aattacgagc gctatcgcgt 10921 cactgatttt tctcaagtat tttatcgcaa actcttgcgt ttagagtaag gtattatacg 10981 ctattttaac caccattatc tatcattgtc ggttttgcct atgctaccgt aaaaaaagct 11041 gttattagaa tcatcaacac ttgcttaagt tgagcaatca ttacctatgg tcaagtaggt 11101 ttacaacttt aactgcaagg ttcgacctaa aatgagtaaa acctacttgg gttgtgaaaa 11161 gctgtcagca attatttatg agattatctt gcggaaaacc ctgtgccttg ctggacttag 11221 cgtctgggca ggagataggg gtatacctgt gtgccacggg caatgcccgc aagccttgct 11281 cctgcgtgcc tcctgcagag gaggtgtgcg aatcgcgcgc gcagcgtatc cgtaagtttt 11341 gtggataggc tatctacagc taaaatcaaa ctgattgcta aacgttgaaa gttacctaaa 11401 cttttgtaaa ttaggagtgc atgtgtgcca aaatccgcta agtatttatt aattggatct 11461 acggaggctt actgcggtaa atctgcaaca gtcctgggtt tgtcttatca actacagcaa 11521 aaggggctgg acattgccta tggcaaaccg ctaggtactt gtttaagtga atctacagga 11581 acgatggttg aggaagatgt tcaatttatt gctgatagcc tcaagttatc aaaaaaccgc 11641 gttgtaccca cgatgctggc tttaaatgaa gtcgctgtgc aaaaacgttt gcaagggtta 11701 gacaaaacca attatcaaaa attgctagca caggaatatt cgcaaataca agtcggcgac 11761 ttggtgttgc tagaaggtcc tggtaacttg gaagaaggca gtttgtttga tttgtctttg 11821 ctgcaagtag cagaagtgg // LOCUS NODE_2866_length_11825_cov_5.07102811825 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11825) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11825) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11825 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..480 /locus_tag="DP116_22465" CDS <1..480 /locus_tag="DP116_22465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457400.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydantoinase/oxoprolinase family protein" /protein_id="PRJNA477356:DP116_22465" /translation="SLLASTPPASSEEEIVLKVNLKYEGTNSTLTVDFTSDAAVMRQD FEAEHKSRYGFIQLEKTLIVESASLEVIQKMDTPEESLITRTRSIDEPPASVETVRMF TADKWHDTPVYRREYLQPEDSISGPAIIVEKISTIVVEPLWEARLTERNHLILQRKN" gene 879..1193 /locus_tag="DP116_22470" CDS 879..1193 /locus_tag="DP116_22470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015078326.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22470" /translation="MNANKINSNGRFEINDLIDDAVKNAVARRSQVIDSEDALLVLAE TEAQSIIGGAAAAISESKVSPLITGKIAVSEPTPKPPIKAVVCPPIIVGLIAVDPIAS KA" gene 1238..1675 /locus_tag="DP116_22475" CDS 1238..1675 /locus_tag="DP116_22475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012595959.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogen fixation protein" /protein_id="PRJNA477356:DP116_22475" /translation="MENIATENTTLCPSARPESADSVVFGVVSGTVTKPRIAYLKQPQ PVTDELIAKSSPATPAEIFRTAGPCVESGCMHFDGKDCRLAQRIVENLSAVAEELPPC SIRRNCRWWEQEGKTACMRCPQVVTDSYNPSQLMREVATPTAL" gene complement(1774..2256) /locus_tag="DP116_22480" CDS complement(1774..2256) /locus_tag="DP116_22480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NifU family protein" /protein_id="PRJNA477356:DP116_22480" /translation="MTDDTLEELVKEISRYEAIISEWDETYRGVVVGLKRAIEALHKE ALTRLIKTVKQESMPALRSAVKDEVVYGVLLYHELVKPPKPPLTQRVQQALDEVRPGL KSHNGDVELVDIKPPDTVEVKLIGACGNCPTSTLTLSQGIEQAIKSYCPEIVNVVAVR " gene complement(2316..2540) /locus_tag="DP116_22485" CDS complement(2316..2540) /locus_tag="DP116_22485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320976.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22485" /translation="MFELGWFSAKLFLQGKLIRDPVHFVQQTAIGTSIGLLLLVLLAF VKLPLLLSVVISSLVTGTMMPFLLKDFKAK" gene complement(2553..2789) /locus_tag="DP116_22490" CDS complement(2553..2789) /locus_tag="DP116_22490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747423.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22490" /translation="MTDLFTSLIAGDINLITGLILLVVAILFSIVGGAIGGIMLAGKE FGYQFSATLGGLLAPAGVIPVVILGLVVLKVLIN" gene 3728..4717 /locus_tag="DP116_22495" CDS 3728..4717 /locus_tag="DP116_22495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126474.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrogenase" /protein_id="PRJNA477356:DP116_22495" /translation="MTNVLWLQGGACSGNTISFLNAEEPTVCDLIADFGIKILWHPSL GLELGENLQALLWDCISGKIPLDILVFEGTVVNGPKGTGNWNRFAERPMKDWLLDLSK VAKFVVAVGDCATWGGIPAIAPNPSESQGLQFLKRQKGGFLGTEYLSKAGLPVINIPG CPAHPDWISQILVAMHIREPPKVIATGRLSDITLDEFHRPQTFFKSFTQTGCTRNIHF AYKASVAEFGQRKGCLFYDLGCRGPMTHSSCNRILWNRVSSKTRAGMPCIGCTEPEFP FYDLEPGTVFKTQTLMGAPKDIPPGMNRQDYALLTLVAKNMVPTWAEEDIFTV" gene complement(5069..5557) /locus_tag="DP116_22500" CDS complement(5069..5557) /locus_tag="DP116_22500" /inference="COORDINATES: protein motif:HMM:PF08239.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22500" /translation="MAYLLKVTYSQGTNFKQSLKDSSDLGLCEKYNVSSGKSFRVKER VEADKDHYRVTLFDKLGNSGCPQYDTWYVFKTHVDIQPETSGSVAEVVVINASSGLNV REQPDTSARELGKIPNGSRVSVYGEGKDNDGFRWIKVKSNQWVASEGWVATEYLKILS VR" gene complement(5663..6442) /locus_tag="DP116_22505" CDS complement(5663..6442) /locus_tag="DP116_22505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315128.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22505" /translation="MNLKDQNWKNFTTNHLRDWHGIWTRYSPEGEVTESFQSLRSFQS HSKETEIVQKNHYAYSDGRRVEQSWEYNQLSNSLSNGLFHPQNESMRGIFFESGHAAW VSTKLKTDSYFAVELFFKIQELRHSVGIVYDESGRLFRTANIREDATGFPSQYWSNEI NQLPERDFSGNWQGTAVTITPDLKISEPVVTQLHWIGEGHKTFFLPDGVSISCPGKVS VGISFTMAANWLVKTSEMHQLLVNYDECGDFSALTLELLYL" gene 6513..6983 /locus_tag="DP116_22510" CDS 6513..6983 /locus_tag="DP116_22510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194533.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrogenase maturation protease" /protein_id="PRJNA477356:DP116_22510" /translation="MLTIIGCGNLNRSDDAVGVLIAQRLQQYLAQYPLPHVRVYDCGT AGMEVMFQARGSKKLIIIDASSTDSEPGAIFKVPGKELEALPEPSYNLHDFRWDHALA AGRKIFKEDFPLDVTVYLIEAENLDFGIDLSPVVQHSADLVFEEITAIVRQSKK" gene 7105..8571 /gene="xylB" /locus_tag="DP116_22515" CDS 7105..8571 /gene="xylB" /locus_tag="DP116_22515" /EC_number="2.7.1.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015156466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="xylulokinase" /protein_id="PRJNA477356:DP116_22515" /translation="MTDVVVGLDLGTGGVRAIAVDLQGQIIAKETRTYPLLIPQPGWT EQNPSDWVEASLDALFDVAQQLDGHQVIALGLSGQMHGMVPLDAEGRVIRPAILWNDQ RTGKAVDAIEAIIPRQELIQRTGNPAITGFQLPKLVWLRTEEPQAYARLWQILLPKDY IGYVLTGELVTEPSDASGVGCLNLATRQWDTDILNALNINPALFPPVVESTAIAGRLK SEIAARVGLPAGLPVVAGGGDNAAAAIGLGISSSNLNRGSLSIGTSGVIFAPCERPIP DPEGRVHLFCHVDGGYHQLGVTLAAGGSLRWYRDTFAPQITYTELMDMAERSLPGARG VLFMPHLSGERSPHLDPDTRGAWVNLSLAHTQADIIRAVLEGVAFSLRAALEVISEIT PVHQLLATGGGARSNIWLQILADVLQTELITPKTEEGAAYGAAILAMVGVGAYPNLEA AFKILPQDASVVQPHVNAVYEAGFKRYTLLYDALKAVR" gene complement(8635..9033) /locus_tag="DP116_22520" CDS complement(8635..9033) /locus_tag="DP116_22520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010479853.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nuclear transport factor 2 family protein" /protein_id="PRJNA477356:DP116_22520" /translation="MKTDNLRVNQLSPETYEWYLKYLEALDSLHIEAYSRFLADDCSV QSNNNPPMEGKQVIMQGLAAYWKTFASLEHDLLNIYGSDSSFVLEALNHYKRNDGKPV TVRAVAFTDRNEEGLVTSVRFYTDTTPLFA" gene 9321..9890 /locus_tag="DP116_22525" CDS 9321..9890 /locus_tag="DP116_22525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860382.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_22525" /translation="MVSTIKRRYTLDEYRALEEKMEGRSEYRDGEIVPMPGGTLKYSR ISGNIFAFLKFLLRDTQFEPINSDLRLWIPEHRRGVYPDVMIFEGEPQLNDERLDEVL NPILIVEVLSPSTADYDRQNKFRIYRSIPSFREYLLVEQDEPFVERYSKQTQGWLLTE FNGLERSISLESVGMELPIAEIYRGVVFE" gene complement(10700..11530) /locus_tag="DP116_22530" CDS complement(10700..11530) /locus_tag="DP116_22530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015156655.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_22530" /translation="MTAKAQDRFAGKTILITGGAGDIGKATAHRFAGDGADVVLLDLN EPKMADVAEELKKYNVSVGTFRCDVTVCDDVAKAFTSAVEQLGRIDYVFNNAGYQGVF AKTDEYPEDDFQKVIDINVVGVFHILKAAAQHLRFSPWETPRANGGGGAIVNMASYAG VVGPPNMLAYAASKFAVIGMTQTAAKDLAPYGIRVNALSPALIGPGYMWTRQTELQAA VGSQYFDANPKVVEQQMIDSVPMRRLGSLEEVANGVAFLMSDEASYITGFNLEVTGGQ " BASE COUNT 3382 a 2537 c 2469 g 3437 t ORIGIN 1 tcactgcttg cttcaacccc acccgcaagc agtgaagagg aaatagtcct aaaagtcaat 61 ttaaaatacg agggaactaa ctctaccttg actgttgatt ttacctcaga tgcggcagtg 121 atgcgacaag actttgaggc tgaacataaa tctcgttatg gtttcattca attagaaaaa 181 accttaattg ttgaatcagc aagtctggaa gtcattcaaa aaatggatac tcccgaagaa 241 tccttaataa ctcgtactcg ttctatcgat gaacctcccg cttctgtgga gacagtaagg 301 atgtttactg ctgacaaatg gcatgatact cctgtttatc gacgagaata tttacaaccg 361 gaagatagta ttagtggacc tgcaattatt gtggaaaaaa ttagcacgat tgtagttgaa 421 ccattatggg aagcaaggtt aactgaacgt aatcatttga tattgcaacg taaaaattga 481 ttaattcata tcttgcaccc attctcccag cactcgttcc caggcttagc ctgggaacga 541 taaatcgagg ttctgcctct gattgagaat agtgcaagat ctcagctata gaacacgagg 601 ttttttgtga cattcgagag atattcggtc gtgcatctct atatatatct gtgtcaaaag 661 aattactgaa tgaaaaagca agaataaact tgatacaaaa tcctgggttc gcaagaccct 721 ttattatcaa gctttttgat catcattgtt aaaaaatatt atctattttg caaaataagt 781 tgtctgttat gggtataatt ctgtctcgta acaaattagg caattaagcg gttcagtttt 841 tgaacctaac accactcata attcccagga aagtaaaaat gaacgcaaac aaaataaact 901 ctaacggcag atttgaaata aatgacctga ttgatgatgc cgtgaagaat gcagtagcac 961 gacgcagcca agttatagat tcagaagatg ccttgttagt tttggctgaa acagaagcgc 1021 aaagcataat aggtggtgcg gcggcagcaa tttccgaatc taaggtttcg ccgcttataa 1081 ctggtaagat agcagtctcc gaaccaactc ccaaaccacc tatcaaagct gttgtttgtc 1141 cgccaattat agttgggcta attgctgttg acccaatagc ctcaaaagca taatcttcga 1201 gccttaacaa agatactctt cgttggaggt tcgattgata gaaaacattg ctactgagaa 1261 tactacactt tgccccagcg ctagaccaga atcagcagat agtgttgtct ttggtgtcgt 1321 cagtgggacg gtgacaaaac cccgtatagc ttatctgaag cagccgcagc ctgtcacgga 1381 tgaactaata gcaaagtcta gccctgctac accagcagaa atttttcgga cagcaggacc 1441 ctgtgtagaa tcaggttgta tgcattttga tggaaaagat tgtcgtttgg cacagcgaat 1501 tgtagagaat ttatctgcag tagctgagga acttccaccc tgttctatcc gccgaaattg 1561 tcgttggtgg gaacaggaag gcaagacagc ttgtatgcgt tgtccgcaag ttgttacaga 1621 tagctacaat ccatctcaac ttatgcgaga ggtggcgaca ccaactgccc tttaagcaaa 1681 acttccatgt gtagttaagt catttgacac tcgctaagcc ctatgtcttt cggacatagg 1741 gcttagagtt gacgggtgct gatcaccaac tcctcaacgt actgcaacca cgttaacaat 1801 ttccggacaa tacgacttaa ttgcctgttc aattccttga gataaagtta gagttgaagt 1861 cggacaatta ccacaagctc caattaattt gacttcaact gtatctggtg gtttaatatc 1921 cactaattcc acatcaccat tatgactttt taaacctgga cggacttcat caagcgcttg 1981 ctgaacacgt tgcgttagcg gtggtttagg aggctttaca agttcgtgat aaagcagaac 2041 tccatacaca acttcgtctt taacagcact acgtaaagct ggcatagatt cttgcttcac 2101 agttttaatt aaacgtgtca atgcctcttt gtgtaatgct tcaatggctc gttttagacc 2161 tacaacgaca cccctgtaag tttcatccca ctcagagata attgcctcat aacggctgat 2221 ttctttgact aattcttcaa gtgtgtcatc ggtcattgtt caattgccaa ataaattctt 2281 tcggatttta ctcaggactc aaaatgacta aaaactcatt ttgctttgaa gtcttttagg 2341 agaaatggca tcattgtacc tgtgactaag ctagaaatta caactgatag cagcaagggt 2401 agtttcacga atgccagtaa cactagtaac agtaaaccta tgctagtacc aattgcagtt 2461 tgttgtacaa aatggacggg atcacgtatt aattttcctt gcaaaaataa cttagcagaa 2521 aaccacccaa gctcaaacat aactcctcct agttaattga taagaacctt cagtacaact 2581 agccctaaga tgactactgg aatgacacca gcaggagcta atagcccacc aagcgtggcg 2641 gaaaactgat acccaaattc cttaccagct aacattattc cgccaatcgc accaccaaca 2701 atagagaata aaattgccac tactagcaat atcaatccgg tgatgagatt aatatcacct 2761 gctattaagc ttgtgaataa gtctgtcatt ttgtgttctc ctaaatagca aaaacttaaa 2821 aagttttcta taaataatta ttgcctcatt cccttcttcc ctgtctgata ctaggaatac 2881 tctttagaag ggctgctgcc tcccttactc gcggcagagt cgcttggatc gcatttcctg 2941 cttagacctg gaaacgagat tttaacctgg aaacgagatt ttaatgagat tttaacgaga 3001 ttataaaagg gttttggctt caattaacac cagtcaacaa tgctgtgtgc ctctacagtc 3061 aatctgttta tattcaattg aaaactgctg tccaaacatt tttttgattt tattttggtt 3121 gcaatactcc tatagtaaac ccaaaaaagt ctatcaattg ctagtttata tttttttata 3181 aatactctat ctcaagtgag agttaatcaa tgtactgttt tcatgttatt gttgatacta 3241 ttgaggcatt taaaataact taatcaatca tttactgatg cagatgaatg acaataagat 3301 taagaaaaaa acatgaattg actaacttct cattgcttgg gcgaaaattt ttatatgaaa 3361 atttttactt ttttatgatt attacctgtg tgatttattt cacattttga ggaatcaaat 3421 aactggtttt ttttgtcata ttttcatcat attttctaat tttgtagagc gtcaaagaaa 3481 aaaatagctc ataattaaag ctaaatgtga agacacagga acaggtaaga cctccacagg 3541 gttttgctaa gtcttgcctg tttcagttac tggcgaacat aagatgattt tatttcaaca 3601 gaattagggg ggttagagta tatacaaagc ctattttatt ggaaaagatt tttagcaatt 3661 gttctctgtt gaaactttac caaactcatc agagttacac attctcaaca ataagaagac 3721 atagcaaatg actaacgtac tatggttgca aggtggtgct tgttcaggta acaccatatc 3781 ttttctgaat gccgaagaac ctaccgtctg tgatttaatt gctgattttg gcattaaaat 3841 tctttggcat ccatctttag gactggaact aggtgaaaac ttacaagcac ttttgtggga 3901 ttgtatttcc ggcaaaattc ctttggatat cctggtattt gaaggtacag tagttaacgg 3961 tcccaaaggg actggtaact ggaatcggtt tgctgaacgt cccatgaaag attggttact 4021 tgacctctca aaagttgcca agtttgttgt agcagtggga gattgtgcga catggggagg 4081 aatccccgcc attgcaccca atcccagtga gtcacaggga ttgcaatttc tcaaacggca 4141 aaagggtggt tttttaggaa cagagtattt atcaaaagca gggcttcctg tgatcaatat 4201 tcctggatgt cccgcccatc ctgattggat tagtcaaata ttagtggcga tgcacataag 4261 ggagccacca aaggtgatcg caacaggacg tcttagtgat ataacccttg acgaattcca 4321 tcgcccgcaa acatttttca aaagttttac acaaacaggc tgtacccgca acatccactt 4381 tgcctataaa gcatccgttg ctgaatttgg tcagcgcaaa ggatgtttat tctacgactt 4441 aggttgtcgc ggtccaatga cccattcttc ctgcaaccgt atcttgtgga atcgcgtttc 4501 ctcaaaaacc cgtgctggaa tgccttgcat aggctgtact gaacctgaat tccccttcta 4561 tgacctcgaa ccaggaacag tatttaagac gcaaaccctc atgggtgctc ccaaggacat 4621 accgccagga atgaacagac aagactatgc cttgctcacc cttgtcgcaa agaatatggt 4681 acctacttgg gctgaagaag atatctttac ggtttagtcc ttagtcatta gtcattagca 4741 aaggacaaat gacaaagaac aaaggataaa aaacaaatgg gaattcaaac attagacatc 4801 tccccagtcg ggagagtaga aggcgattta gatgtccgat gactcatgtc tcgtttgcac 4861 tgttcacgcc catgatgcga agacgggtaa gaaattagcg cgtttccgta cggcgtaagt 4921 gtcgctttag tgaaaatgaa taacaagatt ctccaaatga gacgactcac ccgaaaaccc 4981 ataagactgt cattagtcat tagtcctttg tcttttgtga tttagtgact catgactaat 5041 aacaatcctt acaatacggt ctgcaagttt atcggactga taaaatcttc agatactcag 5101 tcgccaccca gccctcagaa gcaacccatt ggttagattt aaccttaatc caacgaaatc 5161 cgtcattatc tttgccttca ccgtacacgg agactcttga tccattggga attttaccaa 5221 gttcacgagc actcgtgtct ggttgttccc gaacatttag accactcgaa gcattgatga 5281 cgactacttc tgcaacagaa ccagatgttt ctggttggat atcgacatgg gttttaaaaa 5341 cataccaagt gtcgtattga ggacatccag aatttccaag tttatcaaac agagtcactc 5401 tatagtggtc tttgtctgcc tctactcttt ctttaactct aaaagacttg ccgctgctga 5461 cattatactt ctcacaaagt cccagatcac ttgagtcctt aagactctgt ttgaaattcg 5521 ttccttgtga gtatgtgact ttcaataggt atgccattga ggtatctcct tgttagattt 5581 tcaacaccaa cttaaatgtt agtgcctttt acagaatacc cacactgaga gtatgaaaag 5641 tgcttcccct atttagatgt cttcaaaggt acaatagttc cagcgtcagt gccgagaaat 5701 ctccacactc atcatagttg acaagcagtt gatgcatctc ggaggttttc actaaccaat 5761 tagctgccat agtaaaggaa atgccaacac tcactttgcc tggacaactg atagaaactc 5821 cgtcaggcaa aaagaaagtt ttgtgtcctt ccccaatcca gtgtaactga gtcacgactg 5881 gctcagaaat ctttaaatca ggagttatag tgacagcagt tccttgccaa ttaccgctga 5941 aatctcgctc tggcaattga ttgatttcat ttgaccagta ttgactagga aagccagtag 6001 catcttcgcg aatatttgcc gttctgaaca aacgaccact ttcatcataa acgataccta 6061 cactatgtct taattcctga atcttaaaaa atagttcaac ggcaaaatat gaatctgttt 6121 tcaacttagt tgaaacccaa gcagcatgac ctgattcaaa gaaaatcccc ctcatggatt 6181 cattttgtgg atgaaaaagt ccattagata agctattcga gagttgatta tattcccaac 6241 tttgttcgac tcttctgccg tcactgtatg cgtagtgatt cttttgaact atttcggtct 6301 cctttgaatg actctgaaag cttctgagac tttgaaacga ttctgttacc tcaccttcgg 6361 gagaatatct tgtccaaatt ccatgccagt cacgcaaatg gttagtcgtg aagtttttcc 6421 aattttgatc ttttaaattc ataaaatata agttttttgt gtttatggta gtgtaacact 6481 caattgttta gctattagcc gttagctcta aaatgctcac aatcattggt tgcggtaatc 6541 tcaatcgtag tgacgacgct gtaggcgtac tgattgccca acgcttacag caataccttg 6601 cccaatatcc tcttcctcat gtgcgagttt atgactgtgg tactgcaggc atggaagtga 6661 tgtttcaggc aagaggtagc aaaaagttaa ttatcattga tgcaagttca acggattctg 6721 aaccaggggc tatatttaaa gttcctggaa aagaacttga agctttgcca gaacctagtt 6781 ataatttgca tgattttcgc tgggatcatg ctttagccgc aggtagaaaa atctttaaag 6841 aggattttcc actagatgtg actgtttact taattgaagc agaaaatctt gattttggaa 6901 ttgacttaag tcccgttgtt caacattctg ctgacttagt ttttgaagaa ataactgcaa 6961 ttgttagaca gagtaaaaag taattactag gactaggatt ttccgaacgt ttcggaacct 7021 tttcatacat tgatgtgaga agattagact aaaagtccag attttttgaa gtccttcact 7081 agattgagaa tttgggaggg ggaattgact gatgtcgtag ttggcttaga tctaggtaca 7141 ggaggagtgc gggcgatcgc tgttgaccta caagggcaaa ttatcgcaaa agaaaccaga 7201 acctatcccc tgttaatccc acagcccggt tggacggaac aaaacccatc ggattgggtc 7261 gaagcgagtc tggatgccct gttcgatgtt gctcaacaac tagatggaca ccaagtaatt 7321 gctttgggtt tgtctggaca aatgcatggt atggttcctc tggatgcaga gggcagagtc 7381 attagaccag caattttgtg gaatgaccaa cgcactggta aagccgttga tgcgattgaa 7441 gccattattc cccgtcaaga attgattcag cgtactggaa atcctgcaat tactgggttt 7501 cagcttccga agcttgtgtg gttgcggact gaagaaccgc aagcatatgc tcgactttgg 7561 cagattcttt taccaaaaga ttatatagga tatgtgctaa ctggcgagtt agtaacagag 7621 ccgtctgatg cgtctggtgt tgggtgtttg aatttggcga ctcggcaatg ggatacggat 7681 attctcaatg ctcttaatat caacccagca ttgtttcccc cggtagtcga gtctacggcg 7741 atcgccggac gactgaaatc agaaatcgcc gcccgcgtgg gactacctgc tggattacct 7801 gtggttgcag gcggaggcga caatgcagca gcggcgatcg gtctgggcat ctcatcaagc 7861 aacctgaacc ggggcagtct gagtattggc acatcgggtg tgatctttgc accctgcgag 7921 cgcccaattc ccgatccaga aggtcgggtg catttattct gtcatgtgga cggtggctat 7981 catcagctgg gagtgacgct ggcggctggt ggttctctgc gttggtatcg agatacgttt 8041 gcaccgcaga tcacctacac tgagctgatg gacatggcag agcgatcgct tcctggtgct 8101 cgtggtgttc tatttatgcc ccatctttca ggagagcgta gtccccatct cgatccagat 8161 actcgcggtg cttgggtgaa tctgtcatta gctcatacgc aggcagatat tattcgtgct 8221 gttcttgaag gcgtggcatt tagcttgcgg gcagcattgg aagttatcag cgaaattact 8281 cccgttcatc aacttttggc aacaggtgga ggtgcacgat ccaacatctg gttacaaatt 8341 ttagcagatg ttttgcaaac agaactaatt actcccaaaa cagaagaagg agccgcttac 8401 ggagcagcga ttctggcaat ggtgggggtt ggtgcatacc ccaacttaga agctgcattt 8461 aagattttgc cgcaggatgc gagtgtggta cagccgcatg taaatgctgt gtacgaagca 8521 ggttttaagc gatacacgtt attgtacgat gccctgaaag ccgttcgttg atatgaagtg 8581 gatcatcctt atttagctag tagggtgcgt tagtataaat caatcaggtt ctggttatgc 8641 aaacagaggt gttgtatccg tgtaaaacct cacagaagta accaaaccct cttcattgcg 8701 gtcggtaaaa gcgactgctc taacggttac aggtttacca tcattgcgtt tgtaatgatt 8761 taaggcttca aggacaaaag acgaatcact gccataaata ttcaacaagt cgtgttccaa 8821 actggcaaag gtcttccagt aagcagcaag cccctgcatg ataacctgtt tgccttccat 8881 tggagggttg ttgttcgatt gtacggaaca atcatccgca agaaatctac tatatgcctc 8941 aatgtgtaag ctatccaatg cttctaagta tttcagatac cactcgtaag tttcgggtga 9001 aagctgattg actctgagat tatcagtttt catcaatcac tcctcttgat tacttaaaga 9061 tttgatcacg ttatctcact atgcaccaat tttggcaatt gctcaccttt agagtttttc 9121 agcctaccag aatcttcgtc gtgatttcta acacaacttc attgatctca atgttgctcg 9181 ccaagcgcgt taagcgtagc tctgccgtag gcaatcgcct aagtttggtt gtcgtcgtag 9241 acatcgctgc atcacctcac aactaaattt gtgacaaaat agttttgttg tcctacagcc 9301 aatgcaatcg gctacgatct atggtttcta ccattaagcg ccgctacact ttggacgaat 9361 atcgcgcgct cgaagaaaaa atggaaggac gcagcgaata tcgagatgga gaaatcgtac 9421 ccatgcccgg aggaacgctt aaatacagcc gcatcagtgg taatattttt gcctttctca 9481 agtttctgct gcgcgatact caattcgagc caattaacag cgatttgcgg ctatggattc 9541 ctgaacatcg acgcggagta tatccagacg tgatgatttt tgagggcgaa ccacaactaa 9601 acgatgagcg cttagatgaa gttttgaatc ccattttgat tgttgaagta ctatctcctt 9661 ccaccgcaga ttacgaccga caaaacaaat ttcggatata tcgatcaatt cctagtttta 9721 gggaatattt attagtcgaa caggatgaac cctttgtcga acgctatagc aagcaaactc 9781 aaggttggtt gctcactgaa tttaacggct tggagcgatc tatttcccta gagtcagttg 9841 ggatggaatt gccaattgcg gaaatttatc gcggcgttgt ttttgagtaa aataactctg 9901 ggtgaaagaa cggacaatat caacatcgga attacgacgc acaagaccac gaacaatctg 9961 attgttaaag ttctcgtctg ccaagaatcg caacatctta gtttatcttt atgggttgcg 10021 tctcgccaat aagcgatcgc gtatcccaac agggttaaaa cgtctttgta aatctatgct 10081 atggtttggc gatcgctttc tctaagaatc cctgctggaa gcgcgttcaa acagtgaaca 10141 gttaacagtg aacaagttac atagggggat ttcacacgcc acatctctca agtcggcaga 10201 gccgcccacg agagtggctc cccaactgaa accagcaacg cagtagttgt tggggtttta 10261 gatccaacgg gacacgacac tgataactga taactcaata cagttcagtt aaggataaag 10321 taggttgggg agccaccgcc gtggacgggt tccccggctt gaggcgagtg gcgtttgacc 10381 taaggaaacc caacacagaa tgtttattgt tgggttgcgc tacgcttaaa cgccagacgc 10441 caagtgaggg aaagcccttc gggtatgcct acggcacgcc tgacggctaa cgccagatgc 10501 ctggctgtcg ggaaacaagg accgcagcac tggtctcacc gtcattcgcg ctggctcccc 10561 aacctacgtg gtataaggtt tttagctcta actgaaccgt attggatgat cgagccgaaa 10621 tttgatcaat gagaaataat ttttgctcaa acatgtggaa atttttattt ttgaattttg 10681 aattttgaat tttgaattgt tattgccctc cagtaacctc caagttaaac ccagtaatat 10741 agcttgcttc gtcactcatc agaaacgcta ccccattcgc cacctcttca agacttccca 10801 aacgccgcat tggcactgaa tcgatcatct gttgctcaac caccttggga ttggcatcaa 10861 agtactgcga tcctactgct gcctgcaatt ctgtttgccg cgtccacata taaccgggac 10921 ctatcagcgc aggagagagt gcattcaccc ggatgccgta gggagccaaa tcttttgctg 10981 ctgtttgagt cattccaatg accgcaaact ttgaagcggc atatgccagc atatttggcg 11041 gaccaactac ccctgcataa cttgccatgt tgacgatcgc gcctccaccg ccgttcgccc 11101 ttggcgtctc ccaaggggag aagcgcaagt gctgggcagc cgccttgaga atgtggaaaa 11161 cgccaacaac gttaatgtcg atgacctttt gaaagtcgtc ctcgggatat tcgtccgttt 11221 tggcaaacac tccttgatag ccagcgttgt taaagacata gtcgatacgc ccaagttgct 11281 ctacagcact agtaaaagct ttggcaacat catcacaaac cgtaacatca caacgaaacg 11341 tgccgactga aacattgtac ttttttagtt cctcagccac atctgccatc ttcggttcat 11401 tcaaatccag cagaactaca tctgcgccat caccggcaaa acggtgtgca gttgccttgc 11461 caatatctcc cgcaccgcct gtaatcagga tggtcttgcc tgcaaagcga tcttgtgctt 11521 tcgccgtcat agttttaact tcttattgat aaaaattgca aatgcccgtc aatttctcag 11581 ttttaagaac agacttgtat aatttctatt attgagcttt tgcttaaagt tagagcataa 11641 aatccaaaat tgagtttgat tttttgacac aaactttctg gatctgtcat ctggttcaac 11701 tgcgtgagtt gtaatacaag tgcggtttcg gtattgcggt agcttgctga aaatttctga 11761 aaaataagtg agtttttatt aaaaaatatt acataaatta aattttttat tacactattt 11821 tgtca // LOCUS NODE_2869_length_11818_cov_4.90963211818 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11818) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11818) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11818 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 114..1076 /locus_tag="DP116_22535" CDS 114..1076 /locus_tag="DP116_22535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126474.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrogenase" /protein_id="PRJNA477356:DP116_22535" /translation="MTNLLWLQGGACSGNTMSFLNAEEPTACDLVTDFGINVLWHPSL GLELGTNLQTMLWDCLLGKIPLDILVFEGTVINAPNGTGEWNRFADRPMKDWLNDLSQ VANYIVAVGDCATWGGIPAMAPNPSESIGLQFLKRKEGGFLGKEFRTKSGLPVINIPG CPAHPDWITQILVAIATNRIGDIALDDLHRPQTFFNTYTQTGCTRNIHFAYKASTTEF GQRKGCLFYDLGCRGPMTRSSCNRILWNRVSSKTRAGMPCLGCTEPEFPFYDLKPGTV FKTQTVMGVPKDLPTGVNKKDYALLSIVAKDTMPAWAEDDFFTV" gene 1200..2795 /locus_tag="DP116_22540" CDS 1200..2795 /locus_tag="DP116_22540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126473.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome-c3 hydrogenase" /protein_id="PRJNA477356:DP116_22540" /translation="MGIQTLDISPVGRVEGDLDVRVEIEDGQVVNAWTHAELFRGFEV ILRGKDPQAGLIVTPRVCGICGASHLTSAAWALDTAWETEVPRNAILARNLGQIVETI QSIPRYFYGLFAIDLTNKKYQYSPYYSEACRRFAAFTGKSYELGITISGKPVEIYALF GGQWPHSSYMVPGGVMCAPTLTDVTRAWSILEYFRTNWLEPVWLGCSLERYEQIQTYE DFMVWLDESPSHANSDLGFYWRMGLDIGLDKYGVGVGRYVTWGYLPHEAKYQKPTIQG RNAAVIMKSGVYDSFTDTHTLMDQTFVRENTTYSWYEELTSDIHPFDRTTKPSQNNVK DFNGQYSWSTAVRHKDLGRLEAGPLARQLVAGGKHGESWQHYDGFILDAFKQMGGASI HLRQLARVHEIVKLYRQAERCLREFRLNDTWYIKPKEKDGRGWGATEASRGSLCHWLE IEGGKIKNYQIMAPSTWNIGPRDAEGIRGPIEEALVGTPIFDSSDPVEVGHVARSFDS CLVCTVHAHDAKTGKELARFRTA" gene complement(3010..3810) /locus_tag="DP116_22545" CDS complement(3010..3810) /locus_tag="DP116_22545" /inference="COORDINATES: protein motif:HMM:PF10127.7" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotidyltransferase" /protein_id="PRJNA477356:DP116_22545" /translation="MQSQQLQTIHDTLLSTLDFRPIFITVSGAHLYGFESYNSDLDLR GCHYPVGLEFLRYNKSSDTIDRSFEDRNIFREETDLVSHSFFKYLYLLVRKANGYVLE QLYSPLVIQTSPLHQELKTLGKECICKELKYHYGGFFKNQTQLLTKEKKQVKLVLYQA RIIASSVYAAQNGQIEANLVKANQATGIFDEAKLNELIEIKRQGEKNNFPDSRQFNYW QEVISNKWELIDQSFEKSDLPSFDSQKVSTMANNLINRNFQFLPISRY" gene 4593..5297 /locus_tag="DP116_22550" CDS 4593..5297 /locus_tag="DP116_22550" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22550" /translation="MNLIHFWKSYFALTSNFLVSIVPATQLITLLGLLLGVHQTASAA TLTFEQASVGTLSTYTESGFTTSAVSGPWAVSDSYGKPAPFIQFRKEAGLNPLTATIQ ITNDDSKFTFGSVDLYSSVTPIPYVITGSLNSTAIFSFEGTVPNTFGNFKTVVNPNSN YLIDSLIISLTNPAVTCCSNPMGLDNITVTPISTTSVPEPNSSLFSLLGLPVVTWLSR RNLTPGSTIRRDKKLS" gene complement(5460..5597) /locus_tag="DP116_22555" CDS complement(5460..5597) /locus_tag="DP116_22555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873662.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22555" /translation="MSCELCERQVQPLTVHHLIPRQKKGDHGSKINICSACHKLVRSI L" gene complement(5707..6744) /locus_tag="DP116_22560" CDS complement(5707..6744) /locus_tag="DP116_22560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323493.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_22560" /translation="MSKKLRLTQSLRAALVALTIGLFGCHNLGSIGNTSAHGTLNSGE QNANVPRVVATTSILCDLTKQIAEETIHLTCLIPPDTNPYFYQPKPEDQEAIQEAKLI LFSGYNLEPNLLKLMKASKSSASKIAVAQRAVPQPLNFEGEDTTVSDPYVWHNAKNGI RMVDVISNNLSKVVPENASLYSKNAVKVKNELTKLDDWIKSRISSVPAQQRELITTHD AMGYYAKAYGLSYESALEAISDTEKPSATRVQALATYIKKSKIPTLFTETTTKNSNWI NSVTQNTKAKVSQRKLFVNNLGAPGSEGDTYQKMMVANTRTIVEGLGGTYLIFEPTLS SSQQKSDGGNQ" gene complement(6987..7214) /locus_tag="DP116_22565" /pseudo CDS complement(6987..7214) /locus_tag="DP116_22565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019492450.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(7345..8244) /locus_tag="DP116_22570" CDS complement(7345..8244) /locus_tag="DP116_22570" /inference="COORDINATES: protein motif:HMM:PF01844.21,HMM:PF08388.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22570" /translation="MILHEELSVIKKAQEVIANWLKGMGLELKPTKTKISHTLCEYNG NVGFDFLGFNVRQYKVSKYSSGKTMKGYKTLIKPSKDKVKRHLRQVGEVIDKNKGTPQ EALIGKLNPIIRGWCNYYSTVISTKIFASVNHFTYEKLRAWAKRRHPNENAHQISNKY WLIGQGGGWTFAARKEEKTYKLLKHNEVPIIRHIKVEGYRTPYDGDWSYWATRTGKHP QLPNKVAKLLNQQKGKCQYCGLYFTPESLMEVHHLDENHKNQKWNNLALVHRHCHDQI HGIQNHDIGNQVLETDFLVENLF" gene complement(8311..10140) /locus_tag="DP116_22575" CDS complement(8311..10140) /locus_tag="DP116_22575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006545877.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="maturase" /protein_id="PRJNA477356:DP116_22575" /translation="MRNAETVLGIIHERGKRGLPLEDVYRQLFNPDLFLKAYGKIYRN KGAMTPGATDETVDGMSKAKIDTIIHDLRYERYRWMPVRRIHIEKKNSLKKRPLGIPR WSDKLLQEVVRLILEAYYEPQFSPTSHGFRSGRGCHTALSEIYSKWIGTKWFVEGDIA QCFDSLNHQILLDILKEKIKDNRFLRLIENLLAAGYLEEWRYNATLSGSPQGAILSPI LANIYLDKLDKFVENTLIPKYNYGQGRQPNPEWQRLQRQAQRLKRKGLLVEAHIARRL MQQVPSLDPLDPNYRRLRYIRYADDWLIGFSGPHQEAEDIKREIGTFVREHLKLELSE TKTLISHARTEAARFLGYDIVVLNNNQKLDRRGHRSINGQIGLRVPLDVVKSKCTRFL LHGKPIHRAELVHDSVFSIVAHFQQEFRGIVEYYRLAYNLHQLNRLKWVMERSLVQTL AHKLRVSVRTIYRRYQTTLQTHNGSYIGLQVTVERGEGQKPLIANWGGISLSRNMKAV LNDSPLRIVGSRTELERRLLADTCELCGSHEDVQVHHIRALKDLHKKGRTPPPYWVEI MASRQRKTLVVCRQCHMEIHAGQVTQRTTIDMETLESRVLRKA" gene complement(10338..10568) /locus_tag="DP116_22580" CDS complement(10338..10568) /locus_tag="DP116_22580" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22580" /translation="MAKAKGDSSMLIKQKKLEDAYNFVLQDPRDMVRAKLPKPQFPVM EAYIPRLTASHGGVTVIGWLSRRHLNIPHSER" gene complement(10629..11457) /locus_tag="DP116_22585" /pseudo CDS complement(10629..11457) /locus_tag="DP116_22585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015146100.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 3083 a 2583 c 2476 g 3676 t ORIGIN 1 caatagaata aaaaaataca tgatatgaaa aataaagaat gaaaaacagt tgagacgcga 61 caaataacgt cccctgaatc atccagggca aacaatagaa aaaaagatag ctgatgacta 121 acttactatg gctgcaaggt ggtgcatgtt caggcaacac catgtcattt cttaacgctg 181 aagaacccac agcttgtgat ttagttactg actttggcat aaacgtcctt tggcatccat 241 ctttaggact ggaattaggt acaaacttgc agacaatgct gtgggactgt cttttgggca 301 agattccctt ggatatcctg gtttttgaag gcacagttat taacgcaccc aacggtacag 361 gagaatggaa ccgttttgcc gatcgcccca tgaaagactg gttaaacgat ctatcccaag 421 ttgctaacta tatagtcgca gtgggagact gtgcaacttg gggaggaatt cccgcaatgg 481 cacctaatcc tagtgaatcg attggattgc aatttctcaa acgcaaagaa gggggctttt 541 taggaaaaga attccggaca aaatcgggat tacccgtgat caacattcct ggatgtcccg 601 cccatccgga ttggatcaca cagatattag tggcgatcgc caccaataga attggtgata 661 ttgcccttga cgacttacat cgtccacaaa ccttcttcaa cacctatacc caaacaggct 721 gcacccgcaa catccacttt gcctacaaag catcaaccac cgaatttgga caacgtaaag 781 gatgcttgtt ttacgactta ggttgtcgcg gaccaatgac ccgttcctcc tgcaaccgca 841 tcttgtggaa ccgcgtctcc tccaaaaccc gcgccgggat gccttgttta ggttgcactg 901 aaccggaatt tcccttctac gacctcaaac caggaaccgt atttaagaca caaaccgtca 961 tgggagttcc caaagactta cccacaggag tcaacaaaaa agactatgcc ctactctcca 1021 tcgtggcaaa agacacaatg cccgcttggg cagaagatga cttttttaca gtttagtcat 1081 tagtcattag tcattagtca ttggtcatta gtcattggtc attggtcatt agtcattggt 1141 cattagtcat tagcattcat tcaaagcaca aagtacaaat gacacaggac aaaaaacaaa 1201 tgggaattca aacattagac atctccccag tcgggagagt agaaggcgat ttagatgtcc 1261 gagtcgaaat agaagatgga caagtcgtca acgcttggac acatgccgaa ctctttcgag 1321 gatttgaagt catcctacgc ggaaaagacc cccaagcagg attaattgtc acacctcgcg 1381 tgtgcggtat ttgtggcgct tcccacctca caagtgcagc ttgggcatta gacaccgctt 1441 gggaaacaga agttccgcgt aatgcaattt tggcaagaaa cttaggtcaa attgtcgaaa 1501 caatccaaag catacctcgt tacttttatg gtctatttgc aatagattta accaataaaa 1561 agtatcagta tagcccttat tattcagaag cttgcagacg ctttgccgct ttcacaggca 1621 agtcttacga acttggcatc acaatttctg gtaaacctgt cgaaatttat gccctctttg 1681 gcggtcaatg gccccacagc agttatatgg ttccaggtgg cgtgatgtgc gcccccacct 1741 taacagacgt gactcgcgcg tggtcaattc tagaatactt ccgcaccaat tggttagaac 1801 ctgtgtggtt aggttgttca ttagaacgct acgaacaaat tcagacttat gaagacttca 1861 tggtatggct agatgaaagt ccaagtcatg caaattctga cttaggtttt tattggcgca 1921 tgggcttaga tataggtcta gataaatatg gtgttggtgt aggtcgatat gtcacttggg 1981 gatatttacc tcatgaagca aaataccaga agccaacaat acaagggcga aatgcagccg 2041 tgatcatgaa gagtggagtc tacgacagtt tcaccgacac ccacactctt atggatcaga 2101 ctttcgtccg ggaaaataca acttactcct ggtatgagga actgacatca gacattcacc 2161 cctttgatcg cacaactaaa ccaagtcaaa ataatgtcaa agacttcaat gggcaatact 2221 cttggtcaac agcagtacgt cacaaagact taggacgttt ggaagcagga ccactcgcac 2281 gtcagttagt cgctggtggt aaacatggag agtcttggca gcattacgat gggtttatcc 2341 tagatgcatt taagcaaatg ggtggtgcca gtattcatct acgacagctt gcacgagttc 2401 acgaaattgt caagttatac agacaagcag aacgctgctt gcgcgagttc cgcctgaatg 2461 acacctggta tatcaaaccc aaagaaaaag atggacgcgg ttggggtgca acagaagcat 2521 cgcgcggttc cctgtgtcat tggttggaga ttgagggcgg taagattaag aattaccaaa 2581 ttatggcacc gagtacttgg aatatcggac cccgtgatgc agaaggaata cgcggaccga 2641 ttgaagaggc tttagtcggg acacctattt tcgactccag cgatccagtg gaagtgggac 2701 atgtggcgcg atcgtttgac tcttgtttgg tttgtactgt tcacgcccat gatgctaaga 2761 cgggtaagga gttggcgcgt tttcgtactg cttgacactc tcccaccgcg ttccctggag 2821 taacaggtat agccggggat tttaagtgaa atacacatgt actgtagggt gggcattgca 2881 atgcccaacg ccagatgcct acggagggaa accctcctgc agcactggct cccctactta 2941 attcttgact tttaagaagc gagggtattt tgaaacagga attcaacgtg ctttagctgt 3001 tgaaggatgt tagtaacgag atatcggtaa aaactgaaaa ttgcgattta ttagattgtt 3061 agccattgtg ctgacttttt ggctgtcaaa acttggtaga tctgatttct caaaactttg 3121 gtcgattaat tcccatttgt tgctaattac ctcctgccaa taattgaact gcctgctgtc 3181 tggaaagttg tttttctctc cctgtcgctt aatttcaatt agttcattta acttagcttc 3241 atcaaaaatg cctgtggctt ggttggcttt gactaaatta gcttcgattt gaccattttg 3301 agcagcatac acactggagg ctataattcg ggcttggtaa agaacaagtt tgacctgctt 3361 tttctcttta gttaataact gagtttggtt cttgaaaaaa ccgccataat gatattttaa 3421 ctctttgcag atacactcct ttcctagtgt cttcaattct tggtggaggg gcgaagtctg 3481 aataacaagg ggtgagtaaa gctgctctaa tacgtagccg ttggctttgc gaactagaag 3541 atagaggtac ttgaaaaaag agtgggaaac caaatcagtt tcttcgcgaa aaatgtttct 3601 gtcttcaaaa cttctatcaa tcgtgtcgct gcttttattg tatcgcaaga attctaagcc 3661 tactggatag tgacagcccc tcaaatccag gtcagaatta taagactcaa aaccatacag 3721 atgcgcaccc gaaaccgtaa tgaatatagg ccgaaaatct aaggtggata aaagtgtatc 3781 gtggattgtt tgtaattgtt gagattgcat aggggatttt atagtttctc ctctggtcaa 3841 aactgctacc tttgggtagg aaaatagaca tgacagaata cgcgcgagca atgtttagaa 3901 gtagaacacc gtacctataa cagtttgcag ttatcagaaa agttggctca acacagaact 3961 acagcatcaa agtgcagcgt tccggattct actcaagtat tcaaacagcc actcagcata 4021 aaaacgttgc atatttccct gaatttgtgt ataaattact gtctctgtta aagtaagata 4081 caccttttag gagattgttt ttgtactacc gccaagaaat gcgctcacag attaaggttt 4141 tcaagggttg acgtagacgc gcagcaatag ttggagtttg tagttgggtg aaaaattcag 4201 cccaactgat ttttcgtaat gacgttggac agcaagttac tacatctcat gaacttttgt 4261 aacaaaaaat gaggttttgc ataaaacact acgcttgaag ctgataagta aactgaagtt 4321 cacctatcac ttttgttaaa gaggtagtac catgcgtctt tagtggttgg caatcttaag 4381 cgtcataatt atgtttctgg ttctgacgat acaaacaagt aggtatcctc aaagcctggg 4441 ctaagtcaaa agcaaagaaa caaagttata tgatttttat tggcttttct ctgtaaaata 4501 cgggctttag ttttcatact gggtgactaa atagcatcaa tttttattgg tgcaagatgt 4561 gcgtgtattt attttcatct ttagagggaa aaatgaatct tattcatttc tggaaaagtt 4621 attttgcctt gacttctaat tttttggtat cgatagttcc tgctacacag ctgattactt 4681 tactcggcct gctgcttgga gtacatcaaa cggcatctgc tgcgacgctt acgtttgagc 4741 aggcatctgt agggacgcta tcaacctata ctgaatctgg ttttacaact tcggcagttt 4801 ctggtccctg ggctgtaagt gactcatacg gtaaaccggc tccctttata caatttagga 4861 aggaggctgg tctaaacccg ctgactgcta caattcaaat tacaaatgat gactctaaat 4921 ttacattcgg ctcggtcgat ctttactcca gtgtaactcc aattccttat gtaataacgg 4981 gatcgttaaa ttcaacagcg atattttctt ttgaaggtac ggttccaaac acttttggta 5041 attttaaaac tgtagtcaac ccgaactcga attacttaat cgacagtctg atcatctctc 5101 taactaatcc ggcagtaaca tgttgctcta atcctatggg gcttgataat atcactgtga 5161 cacctatttc cactacttca gttcctgagc caaatagttc tctattttct ttgctgggac 5221 tccctgttgt cacttggctt agccgacgga atctgactcc tggctcaaca attcgtagag 5281 acaaaaaatt gtcttgaata tgttcgattt atgaaacata aaaagctgag tcgaaggtta 5341 tccgctcaga aaaagcattg atggttttcg gttacaacaa ttttgataac caaccattac 5401 tcccctacaa tcgccatagt ttacaagtta tttgttcaat aacttcttaa ttttgctttt 5461 caaagaatag agcgaacaag cttatggcag gcagaacaga tatttatctt cgatccatga 5521 tcaccctttt tctgtctagg aatcagatga tgaaccgtta acggctgcac ttgtctttca 5581 cacaactcgc aactcatgct gttgacctaa tgctactgat actatttgtc ttattacgta 5641 atacttaaga tggtaatggt gggttttttt cccattatcc attaccaatt actgaattct 5701 taacagttac tggttccccc catcactttt ttgttggcta cttgagagtg ttggttcaaa 5761 aatcaaatac gttcccccta atccttctac aattgtgcgc gtattggcaa ccatcatttt 5821 ttggtaagtg tctccttcgc ttcctggcgc acctaagtta ttaacaaata gcttcctttg 5881 tgacactttc gcttttgtat tttgagtcac agaattaatc caattagagt tctttgttgt 5941 tgtttctgta aaaagcgtag gaatcttaga ttttttaatg tatgtagcca gtgcttgtac 6001 tcgtgttgca ctaggttttt ccgtatcgct tatagcttct aaagcagatt cataagaaag 6061 accgtaagct ttggcataat aacccattgc atcatgagtt gtgattaatt cgcgttgttg 6121 agcaggaaca ctagatattc ttgacttaat ccaatcatct agttttgtga gttcattttt 6181 gacttttacc gcgtttttac tatacaatga tgcattctct ggtactactt tactcaggtt 6241 attgctaata acatctacca tcctgatacc atttttcgca ttgtgccaaa cataaggatc 6301 agatactgtt gtatcctctc cttcaaagtt tagcggctga ggaacagcac gttgagcaac 6361 tgcgattttt gatgcactac ttttacttgc cttcataagc ttaagtaaat taggttctaa 6421 attgtaccca ctgaaaagaa tcaatttagc ttcctggatg gcttcttgat cttctggttt 6481 tggttgataa aagtagggat ttgtatcagg gggaattaag caggtgagat gaattgtctc 6541 ctctgcaatt tgcttggtta agtcacacaa tatacttgtc gttgctacga ctctaggaac 6601 attagcattc tgttcaccag agttgagagt gccatgagca gaagtgtttc ctatactacc 6661 taaattatga catccaaaca atccaatggt aagagcaacg agagcagcac gtaatgattg 6721 agttaatcgt aacttttttg acataagaaa ttatatgaaa acaaacacaa taataatggt 6781 gtatcttcct tttaggagtt gtttttgcca cctatattgt ggattattag gcaattatag 6841 aaaatttctc ataaacgaaa ccccctgttt cactactcct actaatttcc tcatgtcggg 6901 agaaacaagg aaaacttttt cgtggaggat gaaacaaggg gtatcgtagc tttactgttc 6961 tttgttcttt tacaggaagt ttagtgttag cagtctacta attctggtgt ttggttaaga 7021 atttgactct taatggaaat aaaatcttgg taagcagcag aatctaatct atttgggtga 7081 aatacaccaa cgctgtgaca attctgaaac gccaatttta taccaaaata acgctcaact 7141 gccccacgaa tagcatctcc acccatcatc cgcaataatt gagcacgttc tttttcactt 7201 ggtggtgcta cagcgctggc taaggacagg gagtcacctc cctgcctcgc ctacaaaacc 7261 gtgcttgaaa gtttcccttc acacggctcc tcaacaagtt ggtgcttgtc acgcatacct 7321 cttacccttc ttgggcttag gcggttagaa taggttttct actagaaaat ccgtttctaa 7381 cacttggttg ccgatatcgt ggttttgtat gccatgtatt tggtcatggc aatggcggtg 7441 tacaagtgcc aggttattcc atttctggtt tttgtgattt tcatccagat gatgaacttc 7501 cattagcgac tctggtgtaa agtatagacc gcagtattgg cattttcctt tttgctgatt 7561 aagtagtttt gctactttat ttggcagttg aggatgttta cctgttcttg ttgcccagta 7621 actccaatct ccgtcatagg gagttcgata gccttctact ttgatgtgcc ttattattgg 7681 cacttcgttg tgcttaagca acttgtaggt tttctcttcc tttctggcgg cgaatgtcca 7741 accccctcct tgcccaatta gccagtattt gttgctaatt tggtgtgcat tttcattggg 7801 gtgtcggcgt tttgcccaag ctcttagttt ttcataggta aagtggttga cggatgcaaa 7861 gattttcgtg cttattaccg tggagtagta attgcaccat cctctgatga ttggatttaa 7921 tttaccaatc aatgcctctt ggggggtacc cttatttttg tcaattactt ctcctacttg 7981 tcttaagtgc ctctttacct tgtctttgct tggtttaatg agagttttat aacctttcat 8041 agtcttccca gaactgtatt tgctcacctt atattggcga acgttaaatc ctagaaagtc 8101 aaatcccaca ttgccgttgt attcacaaag cgtgtgtgag atttttgtct tggttggttt 8161 caactctaaa cccatgcctt taagccagtt tgcaattacc tcttgagctt tcttgataac 8221 gctgagttct tcatgtaata tcacaaagta gagtaggcga ccagcgagtt actaaaagta 8281 cttttctgtc cgcccctccc caaaccgggc ttacgctttt cggagtaccc ggctttccag 8341 tgtttccatg tcaatggtag ttctttgggt tacctgtcct gcgtggattt ccatatgaca 8401 ttggcgacaa actactaaag ttttacgttg tcgggaagcc ataatttcaa cccagtaagg 8461 tggtggagtt cttcctttct tatgtagatc cttgagtgct cgaatgtgat gcacttgcac 8521 atcctcatgt gaaccgcaaa gttcacatgt gtctgccagc agtctacgtt caagttcagt 8581 ccttgagcct acaatacgta agggagaatc atttaggact gctttcatat ttctcgaaag 8641 cgaaattcct ccccagttgg ctattaatgg cttttgccct tctccacgtt caactgttac 8701 ttgcagtcca atatatgaac cgttgtgagt ttgaagtgta gtttgatagc gacggtatat 8761 tgtcctgacg ctaacacgca atttatgtgc gagcgtttga actaatgacc gttccattac 8821 ccatttcaaa cggttgagtt ggtgcaggtt ataagccaat cgatagtact ctacaattcc 8881 ccgaaattcc tgctgaaaat gagctacgat gctgaaaaca gaatcgtgaa ctaattctgc 8941 acggtgtatc ggtttaccgt ggagtaggaa gcgagtgcat ttgcttttca caacatctag 9001 tggtactctt aaaccaattt gcccattgat actgcggtgt ccccgcctat caagcttttg 9061 gttattgtta agcactacga tgtcataacc gagaaaacgg gcagcttcag tgcgtgcgtg 9121 ggaaattagc gttttggttt cggacaattc aagtttgaga tgttctcgaa caaatgtgcc 9181 gatttcgcgc ttaatatctt ccgcttcctg atgtggacca ctgaagccaa ttagccaatc 9241 atcagcgtag cgaatataac gcaggcggcg atagtttggg tcaagtgggt ctagggatgg 9301 aacctgttgc attagtctac gcgcaatatg agcctcgact aatagaccct ttcttttcaa 9361 acgttgagct tgcctttgta atcgttgcca ttctggattg ggttggcgtc cttgaccata 9421 gttatatttt ggaattaaag tattttcgac aaatttatcc aacttgtcta agtagatatt 9481 cgccagaatg gggcttagta ttgcgccttg tggacttcca cttaaggtcg cattgtatcg 9541 ccattcctcc aaatatcccg ccgcgagcaa gttttcaatt aaccttagaa aacggttatc 9601 cttaattttc tccttgagaa tatcgaggag aatttggtgg ttgagtgagt caaaacattg 9661 agctatgtca ccttccacaa accattttgt ccctatccac ttactgtaga tttcgcttag 9721 ggctgtgtga catccgcgtc ctgaacgaaa accgtgggac gttgggctaa attgcggttc 9781 gtaatacgcc tctaatatca gtcgtaccac ttcttgcaat agtttatctg accaacgagg 9841 aatgccgagc gggcgctttt tgagagaatt tttcttctca atgtgtatgc gtcgtacagg 9901 catccatcga tacctttcat aacgtaggtc atggattatg gtatcgattt ttgcttttga 9961 cattccgtct acggtttcgt ctgttgcacc gggtgtcatt gcacctttgt tgcggtaaat 10021 tttcccgtag gctttgagga acaaatcggg gttaaacaat tgtcgataaa catcttccaa 10081 tggcagtccc cgtttgccac gttcatgaat aattcccaac actgtttcgg cattgcgcat 10141 ctcgcgcacc tcccttgttg agttttacaa cacctgtcac cctttcccat gtaaacagct 10201 ttcctgttct ctgagtacta cggtgactcc gttaccctgt gcctctcggc acggtgggca 10261 atcccacttt ccctaggagc tagatgtatc gagcgcgact taggtgcctc ttccgtttcc 10321 ttcagtaagg tcattcctta ccgttcactg tgaggaatgt tcaggtggcg acgacttagc 10381 caccctatca cagtgacccc gccatgagac gctgttagtc gggggatgta cgcttccatc 10441 actggaaact gaggtttggg cagtttggct ctcaccatat cacgcgggtc ttgcaaaacg 10501 aaattataag cgtcttctaa cttcttctgc tttatcaaca tgctactgtc ccctttggct 10561 ttcgccatta ggtaagttgt tgacccagag gtcatctttt cctaacctct cctctctgta 10621 agaggagtct agcgcctggt taaatggcgc acgtcatctg cgtagcggat aagtcttggt 10681 tctagtcttg gatttttcac caattctttt acctgctctt ccaacccatg tagggcaata 10741 ttagctagta atggggatat tacacctccc tggggtgttc cctcattggt tggaaaaagc 10801 ccttttcctt gttcaaacac tccagctttt agccatccct taataacacg tctaattgtt 10861 ggagtagtcc ccaatttttt cagcagacct ttgtggtcga tgcggtcaaa acattttgct 10921 atatccgcat caagcacata tttgggtttt ctactgatgc tgaggtagat tgcttctata 10981 gcatcatggc atgagcgtcc tggtctaaag ccgtagctat ttggctcgaa tctcgcttcc 11041 cattctggtt cgagagctag cttaactagc gcctgcaaac agcggtcgtg tactgtgggt 11101 atacctaatg gtcttttttc ctgtgttcct ggtttcggaa tccagactct tctggtagga 11161 gaggttttct cctctagctt gaggttcttt accagtgcga gacgttcttt cggggtcagg 11221 cttttgcacc cgtccacacc agccgtcttc ttcccctggt tgtcttgtgt tacccgtctg 11281 accgcaaagc acaatgctct cccaggattt tatcagcaat ttttggagtt tatggactgt 11341 cttgacatca ccacgattcg aggcttggaa tatgcgcttt tgcaacttga acgttttctg 11401 ctctagcttg cgccaaggga tttcgttcca ttcatacgta gtcgttaact cgctcatgac 11461 ttattaattg ctacttgtac ctattccttc cagcttgccg tgcctccgtc tgcgtatcct 11521 tggcattacc caaggcattt gcttgtgagg caatcccacc cgtccatagc atccggttaa 11581 ctccttctca gaaaagctaa cttttctgag ggctatgggg ggttacttcg ttcctagctt 11641 tcgttttgac gctggcttta ggattctctc tgtccaccgg gtttaatttg ggtgcaacta 11701 ggtcaatatc atacttgcct agctcttagc agtcaaatac acaattgctg cccttacatc 11761 ctttccgctt gtaaaaatct acgcatatct gggcaacaat cctcaaatct acgtagag // LOCUS NODE_2888_length_11737_cov_5.40361211737 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11737) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11737) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11737 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 303..1550 /locus_tag="DP116_22590" CDS 303..1550 /locus_tag="DP116_22590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747202.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="competence/damage-inducible protein A" /protein_id="PRJNA477356:DP116_22590" /translation="MSAEIICVGTELLLGDILNSNAQYLAQQLAGLGIPHYYQTVVGD NPTRLKQVIEIASQRAEILIFTGGLGPTPDDLTCETIADFFGAPLVERPDIIEDITKK YVQRGRVMTASNRKQALIPQGAEVLPNPTGTAPGIIWQPRSGLTIFTFPGVPSEMYRM WEDTAVPFLKSQGWGKEIIYSRMLKFWGVAESALAEKVAPYLNLPNPTVAPYSSKGEV KLRVSAKAASQSQAQDLIAPIEKQIKDIAGLDYYGADNDTVASVVGELLRGAGETLSV AESCTGGGLGQMLTEISGSSDYFWGGIISYDNSVKVGLLGVNSQDLAKYGAVSSIVAE QMAAGVRSRLSTTWGLSITGIAGPTGGTETKPVGLVYVGLAGPNEQVQSFEYHFGAGR GRSLIRHLSACTALDNLRRKLLI" gene complement(1820..2818) /locus_tag="DP116_22595" CDS complement(1820..2818) /locus_tag="DP116_22595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459766.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-N-acetylenolpyruvoylglucosamine reductase" /protein_id="PRJNA477356:DP116_22595" /translation="MTISQAVANVCKVAGMTNSRLIKSNNSIESKEIYLPGTDCVIKS QVTLAAYTSYRVGGPAEWCVAPRNLEALQASIQFAKAHELPVTILGAGSNLLVSDCGI PGLVVVTRHLRHSHFDSETGQLTVAAGDPIPSLAWQIAERGWQGFEWAVGIPGTIGGA IVMNAGAHNSCIADILVNVQVLLPDGTLETLTREQLNYSYRTSILQGSDRIVTQATFQ LQPGANPTEVLAATKQHKEHRLNTQPYHLPSCGSVFRNPKPHAAGWLIEQTGLKGYQI GKAQVALRHANFIVNCGGASAWDIFNLIHHVQHQVQERWSIPLEPEVKMLGEFQAA" gene complement(2974..4437) /locus_tag="DP116_22600" CDS complement(2974..4437) /locus_tag="DP116_22600" /EC_number="6.3.2.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407251.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramate--L-alanine ligase" /protein_id="PRJNA477356:DP116_22600" /translation="MKDTVEFGGRPFHFIGIGGIGMSALAYVLAKRHLPVSGSDLRPN HITRRLESIGTHIFGKQEASNLEFFRPHSHPSEVTLNTQEELSAFKKAKLPQVVCSTA INTTNLEYKAALELGCPIFHRSDVLAALITQYNSIAVAGTHGKTTTSSMIGYMLLEAG LDPTILVGGEVNAWEGNARLGQSQYLVAEADESDGSLVKHSPEIGIITNIELDHPDHY DTLEEVVETFQTFAKGCKTLVGSIDCTTVRDRLQPAISYSLNPDSAADYTVTNVDYKA DGTTALVWERSKALGVLNLRLLSKHNLSNALAAVAVGRILGLEFAEIAKGLATFEGAR RRFEFRGEVNGITFIDDYAHHPSEIRATLAAARLQARPGQRVVAIFQPHRYSRTLTFL EEFAQSFSHADLVVLTDIYSAGEPNLGQVSGEKLATEVAKEHPQVIYQPTLASVCEYL QQTLHCGDLALFLGAGNINQVIPEVMATQCPPAKVTS" gene 5496..6512 /gene="gap" /locus_tag="DP116_22605" CDS 5496..6512 /gene="gap" /locus_tag="DP116_22605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876187.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I glyceraldehyde-3-phosphate dehydrogenase" /protein_id="PRJNA477356:DP116_22605" /translation="MIRVAINGFGRIGRNFARCWVGRQNSNIDLVGINDTSDPRTNAH LLKYDSMLGKLKDVDITADDNSIIVNGKTIKCVSDRNPENLPWKDWEIDLIIEATGVF TSKEGATKHLNAGAKKVLITAPGKNEDGTFVIGVNHHDYDHEKHNIISNASCTTNCLA PIAKVLHEKFGIIKGTMTTTHSYTGDQRLLDASHRDVRRARAAAINIVPTSTGAAKAV ALVLPDLKGKLNGVALRVPTPNVSMVDFVTQVERPTITEEVNQALKDASEGQLKGILD YSELPLVSSDYQGTDASSIVDANLTFVMGGDLVKVMAWYDNEWGYSQRVLDLAELVAQ KWTK" gene complement(6820..7869) /locus_tag="DP116_22610" CDS complement(6820..7869) /locus_tag="DP116_22610" /EC_number="2.7.4.16" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiamine-phosphate kinase" /protein_id="PRJNA477356:DP116_22610" /translation="MNSELSSLQVQDIGEQGLLARLQRFCPPEIIGDDGAVLSTEPGQ SLVVTTDVLVDGVHFSEITTSGEDAGWRAVAANLSDIAAMGATPLGITVGLGLPGEVA VSWVEELYQGMTQCLQKYNTPIVGGDIVRSPTTTIAISAFGQVDPLRIIRRTKAKVGD MIVVTGVHGACAAGLQLLLHPELGQNLSDSDSEALLLSADRTVLIHAHQRPKPRLDVL PILWEILDSYTPHSVAGMDSSDGLADAVVQICRASGVGAVIERTQIPLPKQFNHWLTQ EQALEYALYGGEDFELVLCLPKEQAYAFVQQLGEGAAIVGKMTDGSTVILHDQTEEYP DQVLNLDRGFQHFSH" gene complement(7985..9187) /locus_tag="DP116_22615" CDS complement(7985..9187) /locus_tag="DP116_22615" /EC_number="2.6.1.83" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874631.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LL-diaminopimelate aminotransferase" /protein_id="PRJNA477356:DP116_22615" /translation="MQFAKRIQPLQSNVFADMDKAKTKALACGQELIDLSLGSSDLSP EAHVIEAIAQSLHNPSTHGYLLFHGTQVFRQAAANWYTERFGIPVNPETEVLPLIGSQ EGTAHLPLAVLNPGDFALLLDPGYPSHAGGVHLANGQIYPMPIRAENGFLPVFTDIPT PVLAKSRMMVLSYPHNPTAAIAPLSFFKEAVSFCQQHNIVLVHDFPYVDLVFEESNDS ELGTENWDRSLAPSILQADPDKSVSIEFFTLSKSYNMGGFRIGYAIGNAQLINALRQI KAAVDFNQYRGILNGAIAALTGPQTGVKTSVNTLRQRRDTFVSALHRIGWYVPTPKAT MYIWAKLPERWSQDSIGFCTQLVEKTGVAASPGAGFGKSGEGYVRFALVHDTPVLETA VERIAKFL" gene complement(9397..9726) /locus_tag="DP116_22620" CDS complement(9397..9726) /locus_tag="DP116_22620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_044521338.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thiol reductase thioredoxin" /protein_id="PRJNA477356:DP116_22620" /translation="MSSSVVTITDAEFETEVLRANKPVLIYFWASWCGPCQLMSPLVN SAATTYSDRLKVVKIEVDPNPVAVKQYQVEGVPAFRLLQGDKLLASAEGVISKDKLLS LLDSHLN" gene complement(9961..10734) /locus_tag="DP116_22625" CDS complement(9961..10734) /locus_tag="DP116_22625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874629.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PspA/IM30 family protein" /protein_id="PRJNA477356:DP116_22625" /translation="MGLFDRIKRVVSANLNDLVSKAEDPEKMLEQAVAEMQEDLVQLK QGVAQAIAAQKRTEKQYNDAQNEINKWQRNAQLALQKADENLARQALERKKTFTDTSN ALKASLDQQTTQVETLKRNLIQLESKISEAKTKKEMLRARITAAKAQEQLQGMVRGMN TSSAMAAFERMEEKVLTQEARAQSAAELAGADLENQFAALESSDVDDELAALKAQISI PGGSPNQPQLPQQTTPPKTNKNEVVDAELDSLRKQLDQM" gene complement(10901..11597) /locus_tag="DP116_22630" /pseudo CDS complement(10901..11597) /locus_tag="DP116_22630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195907.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="PspA/IM30 family protein" assembly_gap 11159..11168 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 3116 a 2578 c 2591 g 3442 t 10 others ORIGIN 1 gtctttacct acgcagcatt cctgattgcc tctactgctg gtaagtttgg ttgatccggt 61 tgcgagtcgt aggtcagtca tcaaaatccc ctgccgcaag gtgggggatt tggatatcac 121 gtttgctttt ttttgtcaat taatcttttg ctcgcccctc ccgcagcagg ttgtcgtcac 181 ctcaccccgc ccttacgggc acccctctcc ttaataagga gaggggcagg agtgagttat 241 tgaggcgcaa cttggtattc aatgatatat tttgacgacc aaacgaaaca attccctatt 301 ttatgagtgc agaaattatt tgtgttggaa ccgaactcct gttaggagat atccttaata 361 gtaatgctca atatttggca cagcagttag ctgggctagg tattccccac tactatcaaa 421 ccgtagtcgg agataaccca acaagattaa aacaggttat agaaattgca agtcaaagag 481 cggaaatttt gatttttact ggaggtcttg gtccgacacc agatgacttg acttgtgaaa 541 ccattgcgga tttttttggt gctcctttag tagaacgtcc agacattatt gaagatatta 601 ccaagaaata tgttcaacgc ggtcgggtta tgaccgccag caaccgcaag caagctttga 661 taccccaagg tgctgaagtt ttgccaaacc ctacaggaac agcacccggt attatttggc 721 agcctcgttc aggattaacg attttcacct ttcctggtgt gcccagtgaa atgtaccgga 781 tgtgggagga tactgctgta ccttttctca aaagtcaagg ttggggtaaa gaaattattt 841 acagtcggat gttaaagttt tggggtgtgg cggaatccgc actagcagaa aaggtcgcgc 901 cttatctcaa cttgcctaac ccaacagttg caccttattc cagtaaagga gaagtaaaat 961 tacgagtttc tgctaaagct gcttcccaat cacaagcaca ggatttgata gcccccattg 1021 agaaacagat taaagatatt gcggggttag attattatgg tgcagataat gacaccgttg 1081 cctcagtagt aggtgagttg ttgcgtggtg caggggaaac cctttctgtg gcagaatctt 1141 gcactggtgg cggattaggg caaatgttaa cggagatttc tggaagttct gattacttct 1201 ggggtgggat tatttcttat gataattccg taaaagtagg gcttctgggt gttaactcac 1261 aagatttggc aaagtatggg gctgtcagtt ctattgttgc agaacaaatg gctgcgggag 1321 tgcggtcgcg tctttcgacg acttggggat tgagtattac tggtattgct ggacctacag 1381 gtggtacgga gactaagcct gttggtttgg tttatgtcgg gttagctgga ccgaatgaac 1441 aagtgcaaag ttttgaatat cattttggtg cagggcgagg tcggtcttta attcgccacc 1501 tgagtgcgtg tacggcattg gacaatttgc gaagaaagtt gttgatatga tgtccggcaa 1561 atcaccctta atatgtcatt gcgagtggaa cgaagtgaaa cgacgcaatc gcaaggtttt 1621 gggattgctt cccttcggtc gcaatgacgg ttctttgaac ggacatgata tgagtcatta 1681 aaaaaactgc ctctacacaa ataaagcccg cgcaagcggg ctaccaataa gaaatatttc 1741 gtgctcttgt aaataaactt ctccttgttt tatgaagaag gggagaaaaa gttatatttt 1801 tctcataaaa cttggttcat caagcagctt gaaactctcc tagcattttt acttctggtt 1861 ctaacggaat agaccaacgt tcttgtacct gatgttggac atggtgaatg agattgaaaa 1921 tatcccaagc actagcgcca ccacagttaa cgataaaatt agcatgccgt agtgcaactt 1981 gcgctttgcc aatttggtag cctttgagac ctgtttgttc aattaaccag cctgcggcat 2041 gaggttttgg attgcggaac acactaccac aactaggcaa gtgatagggt tgagtgttta 2101 gtcgatgctc tttgtgttgt ttggtagctg ctaaaacttc tgttgggttt gcccctggct 2161 ggagttgaaa agtggcttgg gtgacgatgc gatcgcttcc ttggaggata gaggtgcggt 2221 agctgtagtt caactgttca cgggtgagag tttctaaagt tccatctggc aaaagtactt 2281 gaacattaac taatatatct gcgatacaac tattgtgtgc acctgcattc atcactatgg 2341 cacctccaat agttccaggg ataccaacag cccactcaaa tccttgccat ccccgctctg 2401 caatctgcca tgctaagctt ggaattgggt ctcctgcagc tacagttaac tgacctgttt 2461 ctgagtcaaa atggctatga cgcaaatgac gagtcacaac aactaaacct ggtatcccac 2521 aatcactgac gagcaagtta gaacctgctc ctagtattgt tactggtaat tcatgtgctt 2581 tcgcaaactg aatactggct tgcagagctt ctaagtttcg aggggcaaca caccactcag 2641 caggtccacc gactctgtat gaagtatacg ctgcaagcgt tacttgagat ttaataacgc 2701 aatcagtacc aggtaaataa atttctttac tttcgatgga attattagat tttatgagcc 2761 tgctatttgt cataccagca actttgcaga catttgctac tgcctgggat attgtcatat 2821 cttttagaat gcgctgaatt tagatgtttg tatgccgtaa aaagacgaca ccaatgaaaa 2881 gccatgtaac tattgctaaa cggacactat gcgaataagg gaaaccaact tgtacggaaa 2941 gttgtactcc ggttttggga taaaaaactt tccttaagag gttaccttag ccggtggaca 3001 ttgtgtcgcc atgacttcgg gaatcacttg attaatattc ccagctccca gaaacagcgc 3061 caagtctcca caatgcagag tttgctgtaa gtactcgcag actgaggcta aagttggttg 3121 atatattacc tgcgggtgct cttttgcaac ctcagttgca agtttttcac cactaacttg 3181 ccccaagttt ggctcgcctg cactgtaaat gtcagtcagg acaactaaat cagcatgact 3241 aaacgattga gcaaattcct ctaaaaaggt aagcgtgcgg ctatagcgat ggggttggaa 3301 gatagcaaca actctctgcc ctggtcttgc ctgtaagcgt gcagcagcaa gagtagcacg 3361 aatttcacta gggtgatggg cgtagtcatc aataaaagtg atgccattaa cttcaccccg 3421 gaattcaaaa cgccgtcttg cgccttcaaa ggtcgcaaga cccttagcta tttctgcaaa 3481 ttctaagccc aagatccgac caaccgccac tgcggctaga gcattactga ggttgtgctt 3541 actgagtaag cgcaagttca acacacccaa agccttgctc ctttcccata caagggctgt 3601 ggtgccatcg gctttatagt ccacgttggt gacagtgtag tcagcagcag aatcaggatt 3661 taggctatag ctaattgctg gttgtaagcg atcgcgtaca gtcgtgcaat caatgctgcc 3721 taccaaagtt ttacagcctt ttgcaaatgt ctgaaaagtt tccaccactt cttctaatgt 3781 gtcgtagtgg tcgggatggt ctaactctat gttagtgata atacctattt ctgggctatg 3841 tttcaccaaa gaaccatctg attcatctgc ttctgctacc aaatactgac tttgccctag 3901 tcgggcattg ccttcccaag cattaacttc tccacctact aaaatcgttg gatcaagacc 3961 tgcttcaagt agcatgtaac caatcatgct acttgttgtc gtttttccgt gagttcctgc 4021 aacagcaata ctattgtatt gagtaattaa tgctgccagt acatcagacc gatggaaaat 4081 agggcaacct aattctaatg ctgctttgta ctctaaatta gttgtgttaa tggctgttga 4141 acaaacaact tgaggtagtt ttgctttttt gaaagcagat agttcttctt gagtgtttaa 4201 ggttacttca ctagggtgag aatgaggacg aaaaaattcg aggttgcttg cctcttgttt 4261 accaaaaata tgagtaccga tagattctaa acgtcgagta atatgatttg gacgaagatc 4321 tgagccagat actggcagat gacgctttgc tagcacatat gctaaagcag acatacctat 4381 gccgccaata ccaatgaaat gaaatggtct accgccaaac tctactgtat ctttcattga 4441 tttatcctct atcaccacac cacacaatca tcctaaatga cacgcgtatc ataacaggaa 4501 ttctattttt actgatactg cctctgtacc atttttataa ccgataagtt atttgtttca 4561 ctattgtttt tcccattcca atcgatttta cctttgttag tcattttttc tctttcttaa 4621 cagatattat gattattctg aaagttcaag ctaaatcggg aaactctcaa tcgtcattgt 4681 aaatatatgt tttttgctgc accgctaaaa tttgctataa tcataagttg gtaaataaag 4741 tttttgcaaa ttgaagaaag cctaagcgta agtgaataga gcaatgagca attattgcag 4801 gcacttttga tccacttcca taaggaagac atggttcgag ccttgacagc tttggacgaa 4861 gttcttttag aacgagtcgt ataagtgcct tgataaaagc cttatcgtaa agatatattg 4921 cagcgaacga ctacagacaa cagatggttg actgtatcac tcgtttttga aaactttttc 4981 tgggatttgt gatgctatga acaccctgat tgccttgtct gctgattacc ccaatactca 5041 ctgatactag attattggtg aagtgaacat ttcaaacctt acgtcattgg taccgttgag 5101 aaaaattagg acaattgtgt gattggttaa ttgcaccttg actgccaagc tgtgagaaca 5161 tagcttaagg ttagttaatg tgcaacactg tgcagcaaaa aactgtaaag caaaattcta 5221 gcacagacta gtagttactg catcaggctg tagtaatatt ttcatcaaat ctgattcaca 5281 aacttttcta tgatagccaa tcaattccct atttagttcc accaaggagt tggggcttat 5341 attgataacc acaacctgta tacaaacagg taggaagaaa atatttattt ttttataatg 5401 taaagcttta tagttataaa tactcccccc tttgcggtat gatcggggtc atcagtagct 5461 tttgttaagt ataagtaaac agagggcaag acgctgtgat tagagttgca atcaacggtt 5521 tcgggcgcat cgggcgtaac tttgcacgtt gctgggtagg tagacaaaat agtaatatcg 5581 accttgtcgg cattaacgac acatccgacc ctagaaccaa tgctcacctg ctcaaatatg 5641 actcgatgct agggaaatta aaggatgttg acatcactgc cgatgataac tcaatcatcg 5701 ttaacggtaa aaccattaag tgtgtctccg accgcaaccc agaaaacttg ccctggaaag 5761 actgggaaat tgacctaatt atcgaagcaa caggggtttt cacctccaaa gaaggggcaa 5821 caaaacatct taacgctggt gcaaagaaag ttctaatcac cgcccctgga aagaacgaag 5881 atgggacttt tgtgattggc gtaaatcatc acgattacga tcacgaaaaa cataacatca 5941 tcagtaacgc tagctgtacc accaactgct tggctcccat tgctaaggtg ttgcacgaga 6001 aattcggtat catcaaaggc acgatgacga ccactcacag ttatactggt gaccaacgct 6061 tgctagacgc ctctcaccgt gatgttaggc gggcacgtgc tgcagccatc aacattgtgc 6121 cgacctccac aggtgcggca aaagctgtag cattggtact gccagacctc aaaggtaagc 6181 tgaatggcgt ggcgttgcgc gtacccaccc ctaacgtttc gatggtggac tttgtgaccc 6241 aggttgaaag acctaccatt accgaagaag tcaatcaagc cctcaaagat gcctccgaag 6301 gtcaacttaa aggcattttg gattacagcg aactccccct ggtatcatcc gattatcaag 6361 gtactgatgc ctcttcaatt gttgatgcta acttgacgtt tgtcatgggc ggcgatttag 6421 tgaaagtcat ggcttggtat gacaacgagt ggggttacag ccagcgagtc cttgacttgg 6481 cagagttagt agcccaaaaa tggactaagt agttcattca ctatggcggt tctcatttga 6541 atcacataca ctactacgaa taatttagag acgtagcccg caggcgacta gcaaatccac 6601 agggctacgt ctctacatgt ctgtattgca tgcgacacca gaagcgctat ataaaacaat 6661 accgttcagt gaaggattga tcctccctag ccctccttaa aaaggaggga actaagcaag 6721 ttttttaact ctgttggggg attttacaaa actaaatatt actaactgaa ccgtattgct 6781 atataagaca gtcttttacc aatgaccaat gaccaatgac taatgactaa aatgttgaaa 6841 cccccgatca agattgagaa cttggtcggg gtattcttct gtttggtcat gcaaaatgac 6901 tgttgatcca tctgtcattt ttcccacaat agccgcgccc tcaccaagct gttgtacaaa 6961 agcatatgct tgctcttttg gcaagcacag tactagctca aagtcttcgc caccgtataa 7021 ggcatattcc agggcttgtt cttgtgtcag ccaatgattg aattgttttg gtaaaggaat 7081 ttgagtgcgt tcaatgacag caccaacacc actggcgcgg caaatttgca ccacagcatc 7141 tgctaagccg tcgctactgt ccataccagc tacagagtgt ggggtataag agtctagaat 7201 ttcccataga attggtaaga catctagtcg gggtttgggt ctttggtgtg cgtgaattaa 7261 tactgtgcga tctgctgaaa gcagcagcgc ttcgctatcg ctatcactca ggttttgccc 7321 taactcagga tgcagcagca attgtaagcc agcagcacat gccccgtgaa cacctgtgac 7381 aactatcata tctcccactt tcgcttttgt acggcgaata atacgtaagg ggtcaacttg 7441 accaaaagcg gaaatggcta tcgtggttgt gggcgatcgc acgatatcac caccaacaat 7501 tggggtattg tacttttgca agcattgtgt cattccctgg tataattcct ccacccaact 7561 gacagcaacc tcgccaggga gtcctagtcc gacagtaatt cctaagggag ttgcacccat 7621 tgctgctata tctgataagt tagcagcaac ggcgcgccaa ccagcatctt caccagaagt 7681 cgtgatttca ctgaaatgga caccatcaac caacacatca gttgtgacta ccaaagattg 7741 tcctggttca gtagaaagca cagccccatc atctccgata atttctggag gacaaaagcg 7801 ctgcaatctt gctaaaagac cttgctctcc aatatcttga acttgcaaag aggataattc 7861 actgttcaca ttgacactcc cactgggctt tagccgtatg cccaaggcgc acgcaagcgt 7921 aagcgcaaag cgcagcgtgc tgtaagtatt gggctttctg cttttattac tttaagaaat 7981 tttcttataa aaactttgca attctctcga cagcagtttc caatactggt gtatcatgca 8041 ctaaggcaaa gcggacatat ccttctccag atttgccaaa gcctgcacct ggtgaggctg 8101 ctacgcctgt tttttctacg agctgagtac aaaatccgat cgaatcttga ctccaccgtt 8161 ctggtaactt tgcccaaata tacattgttg ccttgggcgt gggaacatac caaccaatac 8221 ggtgtaaagc gctgacaaag gtatctcggc gttggcgcaa ggtgttaaca gaggttttga 8281 ctccagtctg tggacctgtc agggcagcaa tagcaccgtt caaaattccc cgatactgat 8341 taaaatcaac tgctgctttt atctggcgta aagcattaat taactgggca ttaccgatgg 8401 cgtagccaat acggaagccg cccatgttgt atgacttgga gagggtgaaa aactcaattg 8461 agacgctttt gtctggatca gcttgtaaaa tggagggagc aagggatctg tcccaattct 8521 cggttcctag ttctgagtcg ttactttcct cgaagaccaa atcaacgtag gggaaatcgt 8581 ggactaggac gatattgtgt tgttgacaaa aactaactgc ttccttaaag aaagatagtg 8641 gtgcgatcgc cgcagttgga ttatgaggat agcttaagac catcatccgc gactttgcta 8701 atacaggcgt cggaatatct gtaaacacag gtagaaaccc gttttctgcc cgtattggca 8761 ttgggtaaat ttgaccgttc gccaagtgaa ctccccctgc atgggaggga taacccggat 8821 caagcaataa agcgaaatct cctgggttga ggactgctag aggcaaatgt gctgtgcctt 8881 cttgggagcc aatgaggggc agtacctctg tttctggatt gacgggaata ccaaaccttt 8941 cagtatacca gtttgcggct gcttggcgaa aaacttgagt cccgtgaaat agtaagtagc 9001 cgtgggtact tgggttgtgt aaagattggg cgatcgcctc aatgacgtgc gcctcaggtg 9061 atagatctga agaccccagt gacaaatcta ttaactcttg tccacaagcc aaagcttttg 9121 tcttggcttt gtccatatca gcaaatacat ttgattgcag gggttgtata cgtttagcaa 9181 actgcatttt tcgtcctttg tcctaagtca attgtcttta gtccttagtc aatagcaaag 9241 aactaaagac taaggactaa cgacccttcg ggtatgcgca aagcgcacgc caagggcgaa 9301 cgccagtcac ctacggtggg aaaacgctag gtgctacaac ggggagaacc cccgcaacgc 9361 actggctccc ctcctgcagt gctggctcac caatgactaa tttagatgag aatctaagag 9421 gctgagtaat ttgtctttgc tgatgactcc ttcagcggat gccaaaagtt tgtctccttg 9481 aagaagtcta aaggctggta caccttcaac ctggtactgc ttgactgcaa ctgggttagg 9541 gtcaacttcg atttttacca ctttcaggcg atcgctgtag gtggtagctg cggagtttac 9601 cagtggcgac atcaattgac aaggtccaca ccaggaagcc caaaaataaa ttaatacagg 9661 cttattggct ctcaacactt cggtttcaaa ctcagcatca gttatggtta caacactgct 9721 gctcattgca ctctccagca atacccatta atcataactt gcccacagac tactctatcc 9781 caatctgagg acattaacat actctgttat ttatccctgg tcaaccttgt ggtcttgagc 9841 caatgtatca aaaaccttga taaactcttt gtcaaaaaca caaaagatta tcaaggtttc 9901 tcttgaaagt agtataaatc cgcttctggt gaacccaaat gaaagcgctt gtcgtaaaaa 9961 ttacatttga tccaattgct tgcgtagcga atccaactca gcatcaacca cttcattctt 10021 atttgttttt ggaggagtgg tttgttgcgg tagttggggt tgatttggtg aacctccggg 10081 tattgaaatt tgcgctttca aagctgccaa ttcatcatca acatcgctac tttccaaagc 10141 tgcaaattgg ttttctagat ctgcacctgc caactcggct gctgactggg cacgggcttc 10201 ttgtgtcaaa actttttctt ccatgcgttc aaaagcagcc atagcactgc tggtattcat 10261 accacgcacc atgccttgaa gttgctcttg agcttttgct gccgtaatcc ttgctctgag 10321 catttctttc ttggtctttg cctcagaaat tttgctttcc agttggatta agttgcgctt 10381 gagagtctca acctgagtag tttgttgatc tagactggct ttgagtgcat ttgatgtgtc 10441 agtaaaagtt tttttacgtt ccagtgcttg ccttgctagg ttttcatctg ctttttgcag 10501 tgccagttgg gcattgcgct gccacttatt gatttcgttc tgagcatcat tgtactgttt 10561 ctcagtccgt ttttgggctg ctattgcctg agcaacgccc tgttttagct gtaccaagtc 10621 ttcctgcatt tctgcaacgg cttgttccag cattttttcg gggtcttcgg ctttactgac 10681 caggtcgttg aggttagcac tgacgactcg tttaatgcga tcaaataatc ccataacttt 10741 gtttttcctc ttgtgattta cgcgcctcgt tgggtagtta gcttttttac tatttcatag 10801 ctacttattt caatgtaatc gtttcggttt ggattggcgc ttctgagagt gttcttattt 10861 tttaagataa actccaagtt ggatcattgc ccaatcttca tcaatctgga gttttaggta 10921 gctgttgtct tttttgtgat tcagtggaaa gctgcttttg gagtgatgct agttctgcat 10981 caacatcacc agcctgttct aaagaagcaa attgctgagc caaattatcc gtactcgttg 11041 tggaaagagc ttgtgtttga gcttctagtc gcaaaatttt ttcttccata cgctcaaagg 11101 ctttgatgct atctaagttg gaagttgtgc ccaacatttc ttgaattttg tatgacgcnn 11161 nnnnnnnngg cgcgagcaat gtacatatct ttttttgttt ttgcttcggc aatttttagc 11221 tctagggtac gcatatcttt tttaattcta gccacaacag tactttgttc gtctatctga 11281 gtggcaagag cttcgcccgt ttgttgatat gctcgccgtt tgataagggc ttcccgtgct 11341 aaagattcgt taccttgttg aagtgctagt tgagcgcggc gataccattc ttctgctgta 11401 gattgagcat tagccatttg tcgttctgtg cgcttttggg tggcgatcgc ttgcgctatc 11461 ccttgtcgca attggatcct attctcctcc atctgttgga cagtttcttc cagaactttt 11521 tctggatctt ctgcactcac agttaaacta ttgagattcg cgcgaatcac ccgcatgata 11581 cggtcaatta gtcccatgtc ctaccctcga tgaactttta aataggggtg taagggttaa 11641 cagtcgtcag ttatcagtta tcagtgagcc agcgcgaatg atggctttcc ctcacttggc 11701 gactggcgtt cgcccttggc gtgcgctttg cgcatac // LOCUS NODE_2903_length_11673_cov_5.29833011673 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11673) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11673) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11673 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 747..1826 /locus_tag="DP116_22635" CDS 747..1826 /locus_tag="DP116_22635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M4 family protein" /protein_id="PRJNA477356:DP116_22635" /translation="MARKNKKSVMSECEHLLDTRCPICCIVPPHMLEHVAVNGNPQQR EWAFRTLNVSAQFRGRRNVVGAVNFAVSPGQKRRTIFDAKNSEQLPGVLVRGEGDPPS QDTAVNEAYDAAGATYDLFKEIFERNSVDDKGLRLDSTVHYNVKYDNAFWNGDQMVYG DGDGEIFQCFTKSIDVIGHELTHGVTQHEAGLIYFGESGALNESFSDVFGSLVKQRVK NQTADQADWIIGEGLLTPKTKGVGIRSMKAPGTAYDDPVLGKDPQPAHMKNKYTGTDD NGGVHINSSIPNYAFYLAAVEIGGYAWEKAGKIWYITLRDRLKSKANFKQAANITIKV AGELYDQGSKEQKAVQNAWQKVGVL" gene 2001..2288 /locus_tag="DP116_22640" CDS 2001..2288 /locus_tag="DP116_22640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22640" /translation="MRISFERTGGFAGISRKKTVDTANIPANEADQLPRLVEAADFFN LPEKITASTTQPDRFQYKLTVEEEGREHTVTVSEAALPGTLRPLIEWLQQK" gene complement(2360..3628) /locus_tag="DP116_22645" CDS complement(2360..3628) /locus_tag="DP116_22645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747716.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome P450" /protein_id="PRJNA477356:DP116_22645" /translation="MKHLSDYNFFDPELLVCPYEFYKLAQEQAPIMELPSPKTDAKLF LVTRYDLVIEILKNTKVFSSNFSTLLAGKEEQDPELQKISAQGWPQMNTLLTADPPEH ERFRSLVNKAFSSSRVNKMHDLIQQIVDELIDSFIDKSKCEFVSEFAVPLPLKVIAQQ LGVPQADLPKFKQWSDSFIARLSHMLFKEQEIECAKDVLAFQHYFHDVIESRQKQPQD DLITDLVQAEVAFERSLDTAELLSIIQQILVAGNETVTSAIAGGMLLLTKNPQQMKLL QTDLSQVENFVEEVLRMETPTAGMWRVVTQDTKLEDVDLKAGSLVMIRFDAANRDPIK FPEGESFDVRRHNASNHLSFGHGIHYCLGAMLARKEMQIAYERLLLRIKNIRLAQGDY QYLPNILMRGLKHLYIEFDKVAVEQSETQH" gene 3837..4847 /gene="trpS" /locus_tag="DP116_22650" CDS 3837..4847 /gene="trpS" /locus_tag="DP116_22650" /EC_number="6.1.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tryptophan--tRNA ligase" /protein_id="PRJNA477356:DP116_22650" /translation="MGRQRVLSGVQPTGNLHLGNYLGAIRNWVEIQSQYENFLFIADL HAITVPHEPATLAANTYTLATLYLACGLDLNHSTIFVQSHVSAHSELTWLFNCITPLN WLQDMIQFKEKAVKQGENVNVGLLDYPVLMASDILLYQPDKVPVGEDQKQHLELTRDI VNRFNHFFAKPNQPVLKLPEPLIRKEGARVMSLTDGTKKMSKSDPSELSRINLLDSPD EITKKIKRCKTDPIRGLEFDNPERPESNNLLTLYMLLSGKTKEEVAAECQDMGWGQFK PLLTDTTIEALKPIQEKYQTVTNDKGYLESVLRHGKHKAEAIANQTLNEVKAAMGYSI PL" gene 4910..5830 /locus_tag="DP116_22655" CDS 4910..5830 /locus_tag="DP116_22655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459374.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methylenetetrahydrofolate reductase" /protein_id="PRJNA477356:DP116_22655" /translation="MLDTINPTALTSFRTAAKTGQFLITAEVTPPKGGNPEHMITMAA TLKGRVHAVNITDGSRAVLRMCSLVASAILLQNGIEPICQIACRDRNRIAIQADLMGA HALGIYNILALTGDPVKAGDHADAKGVFDLEAVRLLQLIHKLNQGLDCNEKPLTDGAL DLFPGAAVDPQCASWSGLQSRFERKLAAGAQFFQSQLITDFDRLEKFMDKIAAGCDKP ILAGIFLLKSAKNAQFINRCVPGVNIPDHIIDRLAQAKDPLEEGMKIAAEQVKIARQL CQGVHMMAVKREDLIVPILDMAGVASISKL" gene 5976..7427 /locus_tag="DP116_22660" CDS 5976..7427 /locus_tag="DP116_22660" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22660" /translation="MLPKDKDLILQEKVKLYNNIAKINNCEGRLLAELEIYPTPRIIW EFEILGNVQCNFPSLSLDSNTKNPLIGHCFSIEQPICTGDSSNIVGPLRAIRGATAQA VYGDMEDTAHKFIFYLPNTRFQYNSVRQEILIKILKEVGSDTEVSWENGGRYVQSAID NIWSIRLDIWTDALNWLNPQNRNTGSLITTVGTLYQRKYKPTEPETLSELQTITLSNA LEQLKHLCLFLSYANGGYIGPLFIEGYEYTQNRSHPIQTSCAVALTFQTTPLEQLCYS WVTVDSDLKVYMECFPAFKGIMQNPTWRQTYDFTLTQYFQATRPGMRWQVVATAVGAA LERLSYTILVEEETNATTKAACELLFDIKQSQTAKQCWNLGKSSGQENISVTGKRLRL LLERIGLTKSKGYDDIDDVPSFLEVRNDAVHPRVGSMTIEHRWKFIKQAIQWIDEVLL WRLGYSSKYLDRTQEWESSTLPRYDLSLRASSW" gene complement(7508..7843) /locus_tag="DP116_22665" CDS complement(7508..7843) /locus_tag="DP116_22665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742293.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_22665" /translation="MDTLDSYRQITEKILSEYAAVPYAYGEIQTEVVFDRKNDRYLLV NVGWDGDRRVHGCIIHIDIINNKLWIQRDGTEHGIAKDLTEAGIPENHIVLGFREPEL RQYTGYAVA" gene complement(7831..8247) /locus_tag="DP116_22670" CDS complement(7831..8247) /locus_tag="DP116_22670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652823.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty-acid synthase" /protein_id="PRJNA477356:DP116_22670" /translation="MPSRDIYHNTVKNALLKDGWTITHDPLKLELGKKDLYVDLGAEQ LIAAEKEKSKIAVEIKSFVGRSDIDDLEKALGQYILYNDILSEKEPERVLYLAIRNGV YIDLFEEPIGKLLLSKGRVKLIVFDPSMEVILKWIP" gene complement(8494..8640) /locus_tag="DP116_22675" /pseudo CDS complement(8494..8640) /locus_tag="DP116_22675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317951.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="MBL fold metallo-hydrolase" gene complement(8667..9116) /locus_tag="DP116_22680" CDS complement(8667..9116) /locus_tag="DP116_22680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318036.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VapC toxin family PIN domain ribonuclease" /protein_id="PRJNA477356:DP116_22680" /translation="MQTTDNSVFIDTNILVYANLALSPFHLQATERLQAIEEQGIELW ISRQTLREYLAAMTRKGDLTGEISVSSLVEDVRYFSNRFRVAEDNFHVTERLLTLMEE ILSGGKQVHDANIVATMLVYGIPQLLTHNTGDFTRFSELITVLPLQE" gene complement(9103..9339) /locus_tag="DP116_22685" CDS complement(9103..9339) /locus_tag="DP116_22685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22685" /translation="MKAITTIATVTEDGKITVQLPPDIPAGEHKLVVVIDEKPLVEKQ IIKEKRPPLKFSAYPVGLVSESLTFRREDLYAND" gene complement(9417..9605) /locus_tag="DP116_22690" CDS complement(9417..9605) /locus_tag="DP116_22690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22690" /translation="MKVIETIATVTEDGKMTVQLPPDIPAGEHKVVVIIAKQPLPKKP ETKEKHPPLNFPVDNYSS" gene complement(9620..10834) /locus_tag="DP116_22695" CDS complement(9620..10834) /locus_tag="DP116_22695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875693.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22695" /translation="MLPTELLSHRQNGEEIIPKRLKIDNNHLAIATELTSCFQKAVGE TQGTLERQLSELEGDTPDYRVKRGLAYILKSSFCTFEVVSPLEPQMLRERVFALAAKS PPSRELTETTLTKIADELSHELEREVLPKQVGEGLYADLSENKILIAFDAPAPHDLLH RYNLSQVQGVFYRASQLVINAHRNVPGQYKLLFRYLKLFQLMSYIEGDADHGFTITID GPTSLFTPSTRYGLAIAKLIPALLHVTKWSLSTILQIRDPYTNTWKTGRFTLNSECGL VSHYSTGKPYDSMLEESFADKWESTKTDWVLEREVDLIPIPGSVMIPDFRLVHPDGRS LLLEIVGYWRPEYLQKKFSQVRRAGRDDLILAISERLNLEKAGVELNDVPARIVWFKD KLLPKAVLAVME" gene complement(10881..>11673) /locus_tag="DP116_22700" CDS complement(10881..>11673) /locus_tag="DP116_22700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318677.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="ATP-dependent helicase" /protein_id="PRJNA477356:DP116_22700" /translation="YNQLIQVRNDFLKQSKISLGSLQGWQTFVQMSARSQAGRRAMLA HRQAKEIALGTDGKIRVLADILTEHYPERVLIFTADNATVYHISQNFLVPAITHQTPV KERHEILTKFREGEFKTLVASHVLNEGVDVPAASVAIILSGTGSTREYIQRLGRVLRK GKDNKKQAILYEVVAEDTSEEGTSARRRGMEKKNVSPPETLRERAASPEDIPPKRQER QEKEEKKGNLRVVYGSGKEKSLKAAEQLEINYKVQKKPPINTDES" BASE COUNT 3468 a 2479 c 2374 g 3352 t ORIGIN 1 gaacagggaa cagggaatag ggaacaggga acagggaaca gggaacaggg aacagagaac 61 agagaacgct taactcttaa gaaggaataa aagtttacgt acaaattttt ttcaaaaatc 121 aaataggagt cttatatcta aaatactcaa tacatgaatg aattgtatat gaaataacat 181 tttttcagta taaaagcata attaatgact ttggaaatat ttttttttaa ataattatgt 241 gtgggaatag ttaaaggtta tgagtatgac ataaaaaaca aaatggtcac gagggattta 301 accccctgac catgtatcaa ccaagtttta caacacacaa ccacacacaa tttagtgggg 361 tcttttatta tgttaaacaa tcttatgatt ttggtctaga tttttactag cgacatctcc 421 tatatacaat tttttatttt tgagaatatt tacatcaaaa agaactcagt cttaaataaa 481 ccattattat taagtggaag catcagtgga aattctcata aaatagctag ttaccggaaa 541 agcgtttgcc aggaatacta attttgacac tggtattcgc gcttatgtta cccgaatgag 601 taagccaaca ctgaaaaatg ctcttgtaga ctgtttgcga tgactttgct tcatctttcg 661 tttggaatga tgactctgct cgttgttgta tagactaaga gatactgttt ggcattactt 721 atctgaaaaa ggagaagatc ttaaagatgg ctcgaaaaaa caaaaaatct gttatgtctg 781 aatgtgagca tctacttgac actagatgcc ccatctgttg tattgttcca cctcatatgc 841 tggaacacgt tgcagtgaat gggaacccac agcagcgtga gtgggcgttt cgcacattaa 901 atgtttcggc acaatttcgt ggacggcgga acgtagtagg tgctgtgaat tttgcggttt 961 ctcctggtca gaagcgtcgt actatctttg atgcgaaaaa tagtgaacaa cttcctggcg 1021 tactcgtacg tggagaaggc gatcccccaa gtcaagatac ggcggtcaat gaagcctacg 1081 atgcagctgg tgcaacttat gacttgttca aagaaatttt tgagcgcaat tctgttgatg 1141 ataagggact gcgtttagat tctaccgtgc attacaacgt taaatatgac aacgcctttt 1201 ggaatggcga ccaaatggtt tacggtgatg gtgacggaga aatatttcaa tgcttcacca 1261 aatctataga tgtgattggg catgagttaa ctcatggggt gactcagcat gaagctggtc 1321 tgatctattt tggagaatct ggagcactca atgagtcatt ttctgatgtt ttcggctctc 1381 tggtcaaaca gcgagtcaaa aatcagaccg cagatcaggc agactggatc attggggaag 1441 gtctattgac gcccaagaca aaaggcgttg gcattcgctc aatgaaagcg cctggaacag 1501 cgtatgatga tccagtactg ggcaaagacc cccaaccagc gcatatgaaa aacaaataca 1561 ctggtacgga tgataatggc ggagtacata tcaactcaag tattcccaat tacgcctttt 1621 atctagcagc ggtggaaatt ggcggctatg cctgggaaaa ggctgggaaa atctggtaca 1681 ttactttacg cgatcgccta aaatctaaag ccaattttaa acaggctgct aatatcacta 1741 tcaaagttgc tggagaactt tacgatcaag gaagtaaaga acaaaaagca gtgcagaatg 1801 cttggcaaaa ggtaggagtt ctctaagagg atgaattatg aatcatcaaa gttttaacag 1861 ttgcttgata actgttgact gttgtaatga atgataactc atcattgagt gttatctttc 1921 ttgataaact gggaaatatg ttatgacctt cccagttttt attcattctt gcaaaatata 1981 gcaagacgga gagcaagaaa atgcggatat catttgaacg cacgggtggt tttgcaggga 2041 taagtagaaa aaaaactgta gatacagcaa acattccggc aaatgaagct gatcaactac 2101 cgcgattagt agaagccgca gatttcttca atctacccga aaagatcaca gcttctacga 2161 cacaacctga ccgctttcag tacaagctaa cggtagaaga ggaaggaaga gaacacacag 2221 tcacggtgag tgaagctgcg ctaccaggaa cgctaagacc cttgattgaa tggcttcaac 2281 aaaaatagaa ttaagaaaaa atgtttttgt ttttcattct gtagacagat gaaaaaatag 2341 attgattaga aaactcattt taatgttggg tttcactttg ttcaactgct accttgtcga 2401 actcaatata gaggtgcttg agtccccgca tgagtatgtt cggcaagtac tggtaatcac 2461 cttgtgccaa gcggatgttc ttaatccgta gcagcaaccg ctcataggcg atctgcattt 2521 ccttgcgagc aagcatagcg cccaaacaat aatgaatgcc atgaccaaaa gataggtggt 2581 tgctggcatt atggcggcgc acgtcgaaac tctcaccctc tggaaatttt atggggtcgc 2641 ggttagcagc atcgaagcgg atcatgacca aagacccagc tttgagatca acgtcttcta 2701 gttttgtatc ttgcgtcaca actcgccaca ttcccgctgt cggagtttcc atccgtagaa 2761 cttcttcaac aaagttctct acctgggaaa gatctgtttg tagcaacttc atctgctggg 2821 gattctttgt cagcaacaac atcccacccg cgatagcact ggtgacagtc tcattgcctg 2881 ccactagtat ttgctgaatt atactcagca gttcagctgt gtcgagcgat cgctcaaaag 2941 ccacctccgc ctgtaccaag tccgtgatca aatcatcctg aggctgtttt tggcgtgact 3001 ctatgacatc atgaaaataa tgctggaaag caagcacatc tttagcacac tcgatctcct 3061 gctctttaaa aagcatatgg cttaggcgcg caataaaaga atcagaccat tgcttgaact 3121 tcggcagatc tgcctgcggc acacctaact gctgagcaat aactttgagc ggtaagggta 3181 cggcaaactc actaacgaac tcacacttgc tcttatcaat aaaactgtca atgagttcat 3241 caacaatctg ctgaatcaag tcatgcattt tgttgacacg cgacgagctg aatgccttat 3301 tcaccagtga tcggaatcgc tcgtgttctg gcggatctgc cgtgagcaag gtgttcatct 3361 gaggccagcc ttgagcagag attttctgca attccggatc ctgttcttct ttcccagcca 3421 gtaaagtaga gaaattgcta gagaatactt ttgtattttt aagaatttct atcaccaagt 3481 catagcgggt aacgaggaag agtttggcgt cagttttggg actaggtagc tccataattg 3541 gtgcttgctc ctgtgctagc ttgtagaatt cataagggca cacaagcagt tcagggtcaa 3601 agaagttgta gtccgacaga tgcttcatgc ttctaatagt ggtcagtgat ggtttgcttt 3661 gagtcactgt acagtcaaca tcatcttaac aagttaataa cgcaacatat taaacatcca 3721 caacatgcgc tgcttatatt tgcgtgaaat ataaagaaat gtcaaaaatt ttctgcaaca 3781 atggaagtca gttgtaaagc cgcattgtcg gatttcaaaa cattatattg gagtttatgg 3841 gtaggcaacg agttctttcc ggagttcaac caactggcaa tttacattta ggcaactact 3901 tgggtgcaat tcgcaattgg gtagaaatcc aaagccagta tgaaaatttt ctcttcatag 3961 ctgatttaca cgcgattact gtgccccacg aaccagcaac gctggcagct aatacctata 4021 ctctagctac tttatatctg gcttgcggac ttgatctcaa ccattctact atctttgtgc 4081 aatctcatgt ttctgcccac agtgaactga cttggttgtt taactgcatc acgcccctga 4141 attggctgca agacatgatc cagttcaagg aaaaagcagt caaacaagga gaaaatgtga 4201 atgttggctt gttggactac ccagtgctga tggcatctga tattttgctt tatcaaccag 4261 ataaagtacc agtgggtgaa gatcaaaagc aacatttgga actgacgcgg gatatcgtca 4321 acaggtttaa tcacttcttt gctaaaccaa atcagccagt gctgaagtta ccagaacctt 4381 tgattcgtaa agaaggtgca agggtgatga gtttaaccga tggtactaag aaaatgtcga 4441 agtccgatcc gtctgagttg agtcgaatca atttgctaga ttcacctgat gagattacaa 4501 aaaaaattaa gcgctgcaaa actgatccga ttcgcggttt agagtttgac aacccagaac 4561 gtcctgaaag taacaatttg ttaacattgt atatgctgct ttctggaaaa accaaggaag 4621 aggtagcggc ggagtgtcaa gatatgggct gggggcaatt taagcctttg ttaacggata 4681 cgacaattga agcccttaaa cctatccaag aaaaatatca gacggtgacg aacgacaaag 4741 gttatttgga gtctgtgttg cgccatggaa agcacaaagc agaggcgatc gccaaccaaa 4801 ctctcaatga agtcaaagct gcgatgggtt actctatacc cctataagct tttctcatga 4861 ggttatcttg tgttatagct gtacccagag acagaaatgg tcatttttta tgctggatac 4921 gataaatcca actgctttaa cttcttttcg tacagcggca aaaacaggtc aattcctgat 4981 taccgccgag gtaacaccac ctaaaggcgg taatccagaa cacatgataa caatggcggc 5041 gactcttaag gggagggttc atgccgtcaa tattactgat ggtagccgcg cagtattacg 5101 gatgtgttca ttagtagcgt cggcaatttt gttacaaaat ggcattgagc cgatttgtca 5161 gattgcttgt cgcgatcgca accgtattgc catacaagcc gaccttatgg gcgctcatgc 5221 tctaggtatt tataacatct tagctttaac tggtgaccct gtgaaagcag gcgaccatgc 5281 cgacgctaag ggcgtgtttg atttggaagc agtgcgctta ctgcaactga ttcacaaact 5341 caatcaaggc ttagactgca acgaaaaacc tctgactgat ggtgcgctag acttatttcc 5401 tggtgcagca gtcgatccgc aatgtgcaag ttggtctggt ttgcaaagtc ggtttgagcg 5461 taaattagcg gcgggagcgc agttttttca aagtcaattg attacagact ttgatcggct 5521 agaaaagttt atggacaaaa tagccgcagg ctgtgataaa ccgattttgg caggaatttt 5581 tctgttgaaa tcagcgaaaa atgctcagtt tatcaatcgg tgtgtaccag gggtgaatat 5641 tcctgaccac ataattgata ggttagcgca agcaaaagat ccgcttgaag aagggatgaa 5701 aattgctgct gaacaagtca aaatcgcacg tcaattgtgt caaggtgtgc acatgatggc 5761 ggtgaagcgt gaagatttga ttgtgccaat tttggatatg gcgggtgttg catcgattag 5821 caaactttaa attttaaatg tcgggatttt ggattggctg gttttcgcta atccaaaatc 5881 tgagatacac taaattctca attacaagta atggcaagga agggcacgtt tgccgtagtt 5941 aagatatcga ctaagcttta cttaaaccat caaaaatgct tcctaaagat aaagatttaa 6001 ttctccaaga aaaagtaaaa ctctacaaca atattgccaa gataaacaat tgtgaaggtc 6061 gccttttggc agaattggag atttatccca cgccacgcat tatttgggaa tttgaaatcc 6121 ttggcaatgt tcagtgtaat tttccttcac tctcacttga ctcaaacaca aaaaatccgt 6181 tgattggaca ctgtttttcc atagaacagc cgatatgcac tggcgactct tctaatattg 6241 ttggaccact cagagctata aggggtgcta ctgctcaggc tgtttatggt gacatggagg 6301 atacagcaca taaatttatc ttttacctgc ctaatactcg ctttcagtac aatagtgtcc 6361 gccaagaaat attaataaaa atcctaaaag aggttggcag tgatacagaa gttagttggg 6421 agaacggagg taggtacgta caatcagcta tagataatat ttggagtatt cgcttagata 6481 tatggactga tgcactaaat tggctcaacc cccaaaatcg gaacacaggt agcctcatta 6541 ctacagttgg aaccctttac cagcgaaaat ataagccgac agagcctgaa accttatctg 6601 aacttcaaac tattacactc agcaatgctt tagagcagct caaacatctt tgcttgtttc 6661 tgtcttatgc caacggcggc tatattggtc ctcttttcat tgaaggttac gagtatactc 6721 aaaaccgatc gcatcccatt caaacttctt gtgcagttgc tctgacattt caaaccactc 6781 cactagaaca attgtgttat tcatgggtga ctgtggacag cgacttgaaa gtgtacatgg 6841 agtgcttccc cgcttttaag ggcataatgc aaaacccaac atggagacag acctatgact 6901 ttacactcac tcagtatttt caagcaacgc gaccaggaat gcgttggcaa gttgtagcaa 6961 ctgctgtagg tgctgccctt gagcggttaa gttacaccat tcttgttgaa gaagaaacta 7021 atgccacaac aaaagctgct tgcgaattac tctttgatat caaacaaagt caaaccgcta 7081 aacaatgctg gaatttaggt aaaagttctg gtcaggagaa tattagtgta actggcaaac 7141 gtctgagatt gttactcgaa cgcataggtt taactaagag taaaggatat gacgacatag 7201 acgatgtacc ttcgtttctt gaagtgcgaa atgatgcagt tcaccctaga gttggcagca 7261 tgacaattga acaccgctgg aagtttataa aacaagcaat tcagtggatt gatgaagttt 7321 tgttatggcg actaggttac agcagcaagt acttggatcg tacgcaagag tgggagtcgt 7381 ctacacttcc ccgttatgac ctaagtttac gtgcttctag ttggtaactt tgctcactca 7441 tagctttcat tgtttgtgct aaacaagtta ctgaggaagc ggcaaaactc aacttcatac 7501 atcaagacta tgccacagca tatcccgtat actgcctgag ttcaggctct cgaaacccta 7561 agacaatgtg gttctcgggg attccagctt ctgtcaaatc tttggcaata ccgtgttcag 7621 taccatcgcg ctgaatccag agtttgttgt tgataatgtc gatgtgaata atacaaccgt 7681 gaactcggcg atcgccatcc caccccacat tcaccagcaa atatcggtcg ttcttgcgat 7741 caaagacgac ttcggtttga atctcaccgt aagcgtaagg aacagcagca tattcagaga 7801 gaattttttc agtaatctgt ctatagctgt ctagggtatc catttgagaa ttacctccat 7861 gctagggtca aacacaatga gcttgacacg acctttgctt aataacagtt taccaattgg 7921 ttcttcaaac aggtctatgt aaactccatt gcggatagca aggtataaaa ctcgttctgg 7981 ttctttttct gaaagaatat cattgtagag aatatactgc cccagcgcct tttctaaatc 8041 atcgatatca gagcgaccaa caaaactttt aatttcgact gctattttag acttttcttt 8101 ctcagcagcg atgagttgtt ctgctcccaa gtcaacatat aaatcttttt tacccaactc 8161 aagttttaga ggatcgtggg ttattgtcca tccatctttg agcagagcat ttttaacagt 8221 gttgtgataa atatcgcgtg aaggcataat ctttaatcag tgaacaatga acagtgagat 8281 ctgataactg ataactgtta aatgtctact ttatagacaa aatataaccc aatacggttc 8341 agttagagct aaaaacctga ttttgcgtag gttggggagc gtaagctcca ggagggtttc 8401 cctcacttgg cgtctggcgt ttgaggaacg aaacccaaca ttctcggggc tttgttgggt 8461 aacgctatcg ctctacccaa cctacaatta tccttaactg aaccgtattg aaatataacc 8521 acaactatga ttagcaggaa ttgcctccat catcttgtta ggaaaagcta attttagatt 8581 ccgcattaat tcaataaagc tattctgttc atgtcttgca aacctcggat tacaacgttt 8641 gggacaggta caaaatatca gatgatctac tcttgtaaag gcaaaacagt gattaactca 8701 gaaaatctag taaaatctcc agtgttatga gttaagagct gaggaatacc gtaaactagc 8761 attgtggcga caatattcgc atcatgcact tgcttgccac cactaagaat ttcttccatt 8821 agcgtcaaca acctttcagt cacatgaaag ttatcttcag caactcgaaa gcggttggag 8881 aaataccgca cgtcttccac tagggaagaa acagaaattt ctccagttaa gtcacctttc 8941 cttgtcatag ctgctaaata ttccctgaga gtttggcgac tgatccatag ttcaattccc 9001 tgttcttcta tagcttgaag tcgttccgta gcttgtaagt gaaacggtga caaagctaaa 9061 ttggcataaa ccaaaatatt cgtgtcgatg aagacagagt tatcagtcgt ttgcatataa 9121 atcttctcgg cggaaagtaa ggctttccga gactaatcct acaggatagg cagaaaattt 9181 tagaggcgga cgcttttcct taattatctg cttctctacc aaaggttttt catcaatcac 9241 caccaccaat ttatgctcac ctgctgggat atctggtggt aactgtactg tgattttccc 9301 atcttctgta actgtagcaa ttgtcgtaat cgctttcatc tcatttctcc taaaatatga 9361 cggtagcgtc cccaatgatc atacatatcc tcacgtctta aagaaagatt ttcaggtcaa 9421 gaactgtaat tgtctactgg gaagtttaaa ggtggatgct tttcctttgt ctctggtttt 9481 ttgggtaaag gctgtttagc aattatcacc acaactttat gctcacctgc tggaatatct 9541 ggcggtaact gtactgtcat ttttccatct tctgtcactg tggcaattgt ttcgataacc 9601 ttcatatcat tcctcctttt tactccatca ctgctaacac tgctttcggc aacaatttat 9661 ctttaaacca aacaatccta gcaggcacat catttaattc aactccggct ttctccaaat 9721 tcaatcgttc agaaatagcc aaaattaaat catcacgtcc tgcacgccgt acctgtgaaa 9781 acttcttttg taaatattct ggtcgccaat aaccaactat ttctaataac agcgatcgcc 9841 catcaggatg cactaatcta aaatcgggaa tcatcacact tccaggaata ggaattaagt 9901 caacctctcg ctctaacacc caatccgttt tcgttgactc ccacttatca gcaaaagact 9961 cttccaacat actgtcgtat ggtttaccag tggaataatg agacaccaaa ccgcattcag 10021 aattgagagt aaaacgtcca gttttccatg tatttgtgta tggatcacga atttgtaata 10081 tcgtggaaag actccatttt gtgacatgaa gtaaagccgg aatcagcttc gctatagcca 10141 aaccataccg cgtactggga gtaaacaaac tcgtcggacc atcaatagta attgtaaaac 10201 cgtggtcagc atcaccctca atataagaca ttaattgaaa caacttaagg taacgaaaca 10261 aaagcttata ttgtcccgga acattgcgat gagcatttat caccaactga cttgctctat 10321 aaaacactcc ctgcacctga gataaattgt atcgatgcaa taaatcatgg ggtgctggtg 10381 catcaaaagc tattaaaatt ttattttcag ataaatccgc atataatccc tcacctactt 10441 gtttcggcaa aacttcgcgt tcaagttcat gactcaattc atcagcaatt tttgttaaag 10501 tcgtttctgt caactcgcga ctcggaggag acttcgccgc caaagcaaaa accctttctc 10561 gtaacatttg aggttccaga ggactcacca cctcaaaagt gcagaaactg cttttgagga 10621 tataagctaa accccgcttg actctataat ctggagtatc cccctccaac tccgacaact 10681 gacgctcaag cgtaccctga gtctcaccta ccgctttttg aaaacaactt gttaactcag 10741 tggcgatcgc caaatgatta ttatcaatct tcagtctctt cgggattatc tcttccccat 10801 tttgcctgtg gcttagtaac tctgttggta gcattggact tttgaaccgc agataaacgc 10861 agataattat tgcaacatta tcagctttca tccgtattta tcggcggttt cttttgaact 10921 ttataattta tctccaactg ttcagcagcc ttaagactct tttccttccc actcccgtaa 10981 acaacccgca aattcccttt cttctcctct ttctcttggc gttcttggcg ttttggcggt 11041 atgtcctccg gagacgctgc gcgttcgcga agcgtctccg gaggagatac gtttttcttc 11101 tccatacctc ttctcctcgc cgaagtcccc tcctcactcg tatcctccgc caccacctcg 11161 tacaaaatcg cctgcttctt attatcctta ccttttctta aaacccttcc caatctttga 11221 atatactccc gtgtcgaacc agtaccagat aaaataatcg caacagaagc cgcaggaaca 11281 tcaacacctt cattcaacac atgagaagca accaaagttt taaactctcc ctcacgaaac 11341 tttgttaata tttcatgccg ttccttgaca ggagtttgat gagtaattgc tggaaccaga 11401 aaattttgag aaatgtggta aactgtcgca ttatcagcag taaaaatcaa aactctttct 11461 ggataatgtt ctgttaatat atcagcaaga actcttatct tcccatcagt acccaaagca 11521 atttctttgg cttgacggtg cgctaacatt gctctacgtc cggcttgcga tcgcgcactc 11581 atttgcacaa atgtttgcca accttgaaga ctccctaaag atatctttga ttgcttcaaa 11641 aagtcattgc gaacttgaat cagttgattg tac // LOCUS NODE_2911_length_11631_cov_5.65298911631 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11631) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11631) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11631 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1036 /locus_tag="DP116_22705" CDS <1..1036 /locus_tag="DP116_22705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317192.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="cobalamin biosynthesis bifunctional protein CbiET" /protein_id="PRJNA477356:DP116_22705" /translation="SHSIEEILRRRGQSVCVLASGDPMCFGIGVTLTRHIPISEMTII PAPSSFSLACARLGWSLTEVETLSLNARPPALIQTAIYPGARLLILSEGKETPAIVAD ILTKRGFSGSKITVLEHMGGSQERIIAGTAASWRTTEIADLNTIAVECIIDAGVVPLS RLAGLPDDAYHHDGQLTKREVRAITLSALAPTPGQLLWDVGAGCGSIGIEWMRSHSRC RAIAIEQNSTRLQYIADNASALGTPYLQIVAGKAPAALKDLPQPDAIFIGGGATTEGL FEVCWEALRPGGRFVANAVTIESEQKLLQWHNQVGGELIRVAVQRAAPIGGFLGWKPM VPVTQWVVRK" gene 1299..1793 /locus_tag="DP116_22710" CDS 1299..1793 /locus_tag="DP116_22710" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22710" /translation="MKLVIVLTSLLIGFASSLGISALLALSSFAAQATPTTSNAQIAS QPPHRSKKQLVMTEDNAPYSELMMNGWRAFSKEEALRLFGKAKSRAQGFKDPKRVQEA EDAIKNVDQIDFAANRRKAADNLERGVSGMREEGRIDDANALQSLANRVRSGEVKGIY DMMR" gene complement(1880..2431) /locus_tag="DP116_22715" CDS complement(1880..2431) /locus_tag="DP116_22715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307870.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS630 family transposase" /protein_id="PRJNA477356:DP116_22715" /translation="MSPSEKNRRIRYWCGDESRFGLQTIPGKLITLKGVKPIGLTQWK RDNFYLYGVVEPMSGENFILEFSHLDTMCFQIFLEKFAVEYPEDLHIIQVDNGAFHFS NYLKVPSNIILLFQPAHSPEVNPIERFWEEIKKHLSWECFQTLNELQEAVWKQLSKFT TSQVKSIAGWDFIINALFVSGFS" gene complement(2452..2982) /locus_tag="DP116_22720" CDS complement(2452..2982) /locus_tag="DP116_22720" /inference="COORDINATES: protein motif:HMM:PF13551.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22720" /translation="MCRVLKLDIKEQASELQMLLKQQKTASGKERIQVLYLLKTRQVE TVQHLAVLVGRGRITVQRWLKLYRQGGLISLLDQKKSPGRPKTIPLEVRLRGAQTPEG VSHLQKELSQSEGFKSYEEIRTWLRASEGIEASYKVVHEVVRYKLKAKLKTPRPCSIK QNKGVAEDFKKNSHLG" gene complement(3051..4187) /locus_tag="DP116_22725" CDS complement(3051..4187) /locus_tag="DP116_22725" /inference="COORDINATES: protein motif:HMM:PF00149.26,HMM:PF03130.14" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22725" /translation="MRRRAAQALEKIGDSTAVTGLIQALNDEDSDVRRRVVGALVKIG DSTAVTGLIQALNDEDSDVRRSAAQALRNITSPEKLPELTHFLLTTTETHLLNIISNI QDRCKFYNYTLTQPPRPQPTPTSISMTYILHLSDLHFGTLENADLWCSQLADDLKIEL DRSRLDVLILSGDIANKSTSEEYDAAKEFLDKLCQEFQLKQEQIVIVPGNHDLNWQFS EDAYTPHRRKNYQGQLKPGRDIDGGDYIEVRDEEKYKQRFAHFSKFYESIKHQPYPLE YDEQGILHHFKEQNILVLGLNSAWELDHHYKSRPSINSSALSKALNSIRQNQEYQNCL KIAVWHHPLDSPYEDPPGSPDAYGGKPSCSTGLTVSKKAIFYNV" gene complement(4160..5509) /locus_tag="DP116_22730" /pseudo CDS complement(4160..5509) /locus_tag="DP116_22730" /inference="COORDINATES: protein motif:HMM:PF13646.4" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 4206..4215 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 5716..6174 /locus_tag="DP116_22735" CDS 5716..6174 /locus_tag="DP116_22735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197271.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22735" /translation="MSNLSPATPEPENQSLTEFERNILEKEYFFLQTTIEDYNKQIWV IKALGITGTGAILALMLQQKTNGSAIALIGCAIPVFFWILESQWKYFQRGFYPRVAEI ESILSNHGLRCPCIYGGWTHAVKHSSYSPKRSNYLKDGLLNPSVYVSYVL" gene complement(6538..8448) /locus_tag="DP116_22740" CDS complement(6538..8448) /locus_tag="DP116_22740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745211.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heavy metal translocating P-type ATPase" /protein_id="PRJNA477356:DP116_22740" /translation="MLYRQRLTQFTKEHTEAIAAILCGVLLFFGWFALHLNFLGWALL LLPAAYVIGGYESAREGLTTLFKEKELDVDLLMIVAALGAAILGLWQREYHLILDGGI LILIFAISGALEGYAMQRTERSIRSLMSLTQDTARVLRSGGEEIISISQLKVGDEIVV KPGELIPTDAMIVSGLSTINQAAITGESLPIEKTVGEEVFAGTLNGYGALKLKVHQPP ESSLLQRVIRLVEQAQTEAPPSQQFVERFERRYALVIVVAGLLLAILPPFVLNWDWET TIYRALTFLVVASPCALMAAIMPTLLSGIANGARQGILFKNGAQLEMIAKVRAIAFDK TGTLTTGQLQVFQVIPTRGYTEADVLKVAASVESSSEHPIGEAIVQAAQDLNWSRAVE VRAIPGQGIVGINGHQEVIVGKANFVKQYVTHLPSELKDAADLREQEGKTVVWVAQQE QVLGVVVIADQIRAEAVRAIAHLKKLGVEQIVMLTGDNKRTAHSVAQAVGIDQVYAEL LPEDKLDVIRRLQKEYQTVAMVGDGINDAPALAQASVGIAMGKIASDVALETADIILM ADRLEKIEAAMRLGKRAQAIVKQNITVALGFIVLLLVGNFVGGINLPLGVIGHEGSTV LVTLSGLRLLRK" gene 8580..9203 /locus_tag="DP116_22745" CDS 8580..9203 /locus_tag="DP116_22745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878852.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_22745" /translation="MHAIITPQRIELPPGAVLKLLGSWQDYLALSEQLGDRTVPRIKY RLGEILLMAPLPEHGRKASLIADIVKVLLDHLEQRYDSFTPITMKLPEVTGIEPDYSF YIENWKAVVGKNRIDWESDPPPDIVIEIDVTSYTDISDYLPYKVPEIWLLKNNQIEIY RLQGEVYTTAESRYYPNISEIVQQCLQIADSQTTSDAIRWLRKFLQG" gene 9208..10317 /locus_tag="DP116_22750" CDS 9208..10317 /locus_tag="DP116_22750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745201.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MerR family transcriptional regulator" /protein_id="PRJNA477356:DP116_22750" /translation="MTKSLTIKELTSAVGGGMTPRMVRHYHELGLLPQPVRSPSNYRL YTEKDVLRLQRIVALKQQGFQLNHIRQILEVEPEEDTTASLMTQLQQQYRAVIQQISQ LRQTASALEGLLGRDRDCEIIQAEVLAQLKLLEVDTSGGLGGLEKLWNGLDAQVHAHP EAFTESLERLLPDLYSRSEIEQDLISKLVLACGDVSLASFVRLSDRAIATSRQALKSG CQIVADIPTVAAALDRTRLAHLGCPVETLIDNSHITTATEAEKAFWQHQEWQEKLHQV SPGCVLVIGYAPSVLLSVCKAIQNQEIQPALVIGMPIGFSHAPAAKRQLIQSCVPFIT VEGSLGGGLLAATVLNSLVESLIEKPDCHCYLGNY" gene 10423..11415 /locus_tag="DP116_22755" CDS 10423..11415 /locus_tag="DP116_22755" /inference="COORDINATES: protein motif:HMM:PF01656.21" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22755" /translation="MRLAVYGKGGSGKTTISTCLLQFLNYFCDFQVLGIDGDHNLHLA EELGATPVEMQRGNKEPARDIGNDLDWLRSYFAGTNPRITADLPMIKTTPPGEGSRLL RLREAAEWRDRYVTLIDGVELIRVGDFTQDDYRKKCFHSKTGAIEIFLKHVWEDNNEA IIVDMSAGKDVFASPLPSLFDRNIYVVKPTRKSVTNAGDFLAHAKNFGIDMLVVATDI RSRQEVAAIEHYLMRPVDAVIPHDPFFVQRDSLVLSRPPHVREASFTVLDAFYSFTDL LRQLPSHNWEKLLTRMFYHHEETAHDWAGVNFTAQIQPGFNPFERFQSNKLATA" BASE COUNT 3178 a 2612 c 2507 g 3324 t 10 others ORIGIN 1 cagtcattct atagaagaaa ttctccgtcg ccggggtcaa tctgtgtgcg tcttggcaag 61 tggcgatccc atgtgctttg gtatcggtgt cactcttacc cgccacatac ccatttctga 121 gatgactatc atccccgccc cttcatcctt cagccttgct tgtgccagac taggatggtc 181 actcaccgaa gtagagacat taagcttaaa cgctcgtccc cctgctctca tccaaaccgc 241 tatctaccca ggagcacgtc ttctaatttt aagtgaagga aaggaaaccc cggcaattgt 301 cgctgatatc ttgacaaagc gtggttttag tggtagcaaa attaccgtcc tagaacacat 361 gggcggttct caggaaagaa ttatcgcagg cacagccgca tcatggagaa caacagaaat 421 agctgattta aacaccatcg ctgtagagtg tattatcgat gctggagtcg tgccgttatc 481 tcggctggca ggactaccag atgatgccta tcatcatgat ggacagttga ctaagcgtga 541 agtgcgagca ataacacttt ctgctttagc tcccacacca ggacagttgt tatgggatgt 601 gggtgcgggg tgtggttcga ttggtattga gtggatgcgg agtcattctc ggtgtcgggc 661 gatcgccatt gaacaaaact ccacccgact acaatatatt gctgacaacg cctccgctct 721 tggtacacct tatttacaaa ttgttgcagg caaagctccc gccgcactca aagacttgcc 781 tcaacccgac gccatcttta ttggtggtgg tgcaacaaca gaaggtttat tcgaggtgtg 841 ttgggaagct ttgcgtccgg gtgggcgttt tgttgctaat gctgtgacga ttgaaagtga 901 gcagaagttg ttgcaatggc acaatcaggt aggtggagag ttgattcgtg ttgctgtgca 961 acgggctgct cccattggcg ggtttttggg gtggaagccg atggtgccag tgacgcagtg 1021 ggtggtgagg aagtagttgg aagcgagtat cctttcgagt tatgtatcaa gtggataatg 1081 cttgatttgg attgtgtcgt gtatcaggtc aagcagcttt ttttggggtt gttttctaaa 1141 aaaaatatgc aaaatatgat gtaaaaatgc catgtattac ccataaatat gaagatatag 1201 tcattgtaga agaagtggca acataaaact ctatggaaag tctctttgaa cagagctctg 1261 ttcattgaac agattttaat ttcaatggag tgaattgcat gaaattagta atagtattga 1321 cctccttact gataggtttc gccagttcgt taggaatatc agcattgctc gctttgtcat 1381 catttgccgc acaagcaaca cccaccacgt caaatgctca aatcgcttct cagccgccgc 1441 acagatcaaa aaaacaactg gtgatgacag aagacaacgc accttattct gagttgatga 1501 tgaatggatg gcgcgctttt agcaaggagg aagcgcttcg tttatttggt aaagcaaagt 1561 caagggcaca gggtttcaaa gacccaaaac gagtacaaga agctgaagac gctatcaaaa 1621 acgtagatca gattgatttt gcagcaaacc gaagaaaagc cgcagacaat ctagaaaggg 1681 gtgtctctgg tatgcgagaa gaaggcagaa tagatgatgc caatgcgctc caatctctag 1741 ctaatagagt tcgtagcggc gaagtgaaag gtatctatga tatgatgcgt tgacgcgcac 1801 atttcactag tgcatgatga cgaaagtttc cgttcagttt tagagatggg gcgaataggg 1861 agggctgttt cataccattt cacgaaaagc ctgatacaaa tagagcgttg ataataaaat 1921 cccatcccgc aatagatttg acttgagatg ttgtgaattt actcaattgt ttccatacgg 1981 cttcttgtaa ctcgttcaaa gtctggaagc attcccaact caaatgtttt ttaatttctt 2041 cccaaaaacg ttcaatcgga ttgacttccg gtgaatgtgc cggctgaaac agtagaatga 2101 tatttgacgg tacttttaga taattgctga aatgaaatgc gccattatcg acttggataa 2161 tatgtaaatc ctctggatat tctactgcaa atttttctaa aaatatttga aagcacattg 2221 tgtctaaatg cgagaattca agaatgaagt tctctccact cattggttcg actacaccat 2281 ataggtaaaa gttatctcgt ttccattgag ttaagcctat aggtttaact cctttgagtg 2341 tgattaattt tcctggtatg gtttgaagac caaatctgct ttcatctcca caccaataac 2401 gaattcttcg gtttttttct gacggtgata tcaggtattt cttgatgagt ttcaaccaag 2461 atgggagttt tttttaaaat cctctgccac acccttgttt tgctttatac tacaaggacg 2521 tggtgttttt aacttcgctt ttagcttgta acgtactact tcatgaacga ctttgtacga 2581 tgcttcgatg ccctcggatg ctctcaacca agtacggatt tcctcgtaac ttttaaatcc 2641 ttcgctttga gatagttctt tttgaagatg cgatacgcct tcaggcgtct gcgctccgcg 2701 caatcgcact tcgagcggga tagtttttgg tcgcccagga ctcttttttt ggtcgagcag 2761 actgattaat cctccttgtc tgtaaagttt taaccacctt tgtactgtta tcctacctct 2821 gcccactagc actgcaagat gttgtactgt ctcaacttga cgcgttttga gtagatataa 2881 tacttggatt ctttctttcc ctgaagctgt tttttgttgc ttcaacagca tttgaagttc 2941 agacgcctgt tcttttatat ccagcttgag aaccctacac atttttgttc gttattaatt 3001 aatgcccctg gaattatatg tatcatactt ttcgtgaaat ggtatgacag ctaaacgttg 3061 tagaaaatcg ctttctttga tacggtgaga ccagtgctgc aggagggttt ccctccgtag 3121 gcatctggcg aacccggagg gtcttcgtag gggctgtcga ggggatggtg ccaaactgcg 3181 attttcaggc agttttgata ttcttgattt tggcggatag agttaagggc tttgctgagg 3241 gcagaggagt tgatgctagg gcgggatttg tagtggtgat ccaactccca agcagaattt 3301 agccccaaga cgagaatgtt ttgttctttg aagtggtgca aaatgccttg ctcgtcatac 3361 tctaaaggat agggttggtg tttgatggat tcatagaatt tgctgaagtg ggcaaagcgc 3421 tgtttatatt tttcttcgtc tcgaacttcg atgtagtcac caccatcaat atctcttcct 3481 ggtttgagtt gaccttgata gtttttgcgc ctatgtggtg tgtaagcatc ttctgagaat 3541 tgccaattga ggtcatggtt accaggaacg atgacaattt gctcttgctt gagttggaat 3601 tcttggcaga gtttgtctaa aaactctttg gctgcgtcgt attcttctga ggtggatttg 3661 ttagcaatgt caccggagag gatgaggaca tcgaggcggg agcgatcgag ttctatcttg 3721 agatcatctg cgagttggct acaccataaa tctgcatttt caagcgtgcc aaagtgcaag 3781 tcggagagat ggagaatgta agtcatggag atagaggtgg gtgttggttg tgggcgaggt 3841 ggttgagtta gggtgtagtt gtagaatttg caacgatctt gaatgttaga gattatattt 3901 agtagatggg tttctgtggt tgttaataaa aaatgagtca attctggtag tttttcagga 3961 ctagtgatat ttcgtaacgc ctgagcagca ctcctacgca catcagaatc ttcatcgttc 4021 aaagcttgaa tcagtccagt gactgctgta gaatccccga tttttactaa cgccccaacc 4081 actcttctac gcacatcaga atcttcatcg tttaaagctt gaatcagtcc agtgactgct 4141 gtggaatccc cgattttttc taacgcctga gccgccctcc tacgcacacc agaatcttca 4201 tcgttnnnnn nnnnncgatt ttttctaacg cctgagccgc cctcctacgc acaccagaat 4261 cttcatcgtt caaagcttga atcagtccag tgactgctgt agaatccccg atttttacta 4321 acgccccaac cactcttcta cgcacatcag aatcttcatc gtttaaagct tgaatcagtc 4381 cagtgactgc tgtggaatcc ccgatttttt ctaacgcctg agccgccctc ctacgcacac 4441 cagaatcttc atcgtttaaa gcttgaatca gtccagtgac tgctgcggaa tccccgattt 4501 ttcctaacgc ctcagccacc ctccaacgta caaagtaatc tttatcgttc aaagcttgaa 4561 tcagtggagt cactgctgct gagtttccga tttttactaa cgccccagca gcactcctac 4621 gcacataaga attttcatcg tttaaagctt gaatcagtcc agtgactgct gtggaatctc 4681 cgatttttcc taacgcctta gctgcactac tacgcacatc ataatcttca tcgtttaaaa 4741 cttgaattag tgcagtcact gctgcggaat ccccgattct tcctaacgcc tcagccgccc 4801 tccaacgtac aaaataatct tcatcgttca aagcttgaat cagtggagtc actgtagaat 4861 acatgatttt ttctaacgtc ttattagcac taataagcac ataaaaaact tcatttttca 4921 aagcttgaat cagtggagtt actgctgctg agtttccgac tttttctaac gttttattag 4981 cgctactacg cacatcaaaa tttttatttt tcaaagcttg aattagtgga attactgctg 5041 agtttccgat tttttctaac gcctcagcag cactccaaca cacaaaataa tcttcatcgt 5101 tcaaagcttg aatcagtgga gtcactgctg ctgagtttcc gatttttcct aacgcctcag 5161 cagcactcca acgcacaaaa taatcttcat cgttcaaagc ttgaatcagt ggagtcactg 5221 cggaattccc gatttctcct aacgcctcag cagcactcct acgcacagaa gaattttcat 5281 cattcaaagc ttgtagtaag agcgaaactg catgttcaga acgtgtgatg cttaataatt 5341 caactttaat tttttgaaaa acttctaacc caacaactaa cgccaccgtt tgcacctgaa 5401 attctggttt taccgcacct gctagccttg ctgccaactt aagatccacc tctaaagcta 5461 actttaccac ccgcagcgct tgcttctctt cttccagcaa ctccaacatc acggctaggg 5521 gttcagtcca tttcaggtaa ttcaaatatt cccgtttcag cttgtcgtca ctgaagtgcc 5581 gtagctgctt ccgtaaactc tcggcggtgt tcgagcaaag ttaatcgtag tcatcagcga 5641 aaagctgtgt tttcatctcc gcctagagtc ctttcctgat ttatcccaat ctcagcaacg 5701 cctcaagatt tctctatgtc taatctatca ccggcaactc cagaacctga aaaccaatca 5761 cttacggaat ttgaacgcaa tattttggaa aaagaatact ttttcttgca aacgactatc 5821 gaagactaca ataaacaaat ttgggttatc aaagctcttg gtatcacagg cactggggct 5881 attttagcgt tgatgttaca gcaaaaaact aatggaagtg cgatcgcttt aattggttgt 5941 gcaattcccg tatttttctg gatattggaa agtcaatgga aatactttca gcgcggtttc 6001 tatccccgtg ttgcagagat tgaatctatt ctcagcaatc atggattacg atgtccttgc 6061 atatacggtg gatggactca tgctgtcaag catagctctt actcgcccaa acgtagcaat 6121 tacctcaagg atggtttgtt aaaccctagt gtgtatgtga gctatgtgtt ataaattggg 6181 tttttgttac tcatggctat aattgcacct agcttagcaa aataattaga ctgagatttt 6241 gcaccattct caacaaacgt agggtgggca ctgcctacca acattaaaat gagggtttgc 6301 gatttggatg ggcagtgccc accctataaa agattgtttc gtttgactcg cggcggacat 6361 attatgggta atttgacgga ctgtctactt atatagcaat cctaaatcat aagccctgcg 6421 ggcacgctgc gcgttagcct ctggcgtgcg cttgcgctta cgtgaacaac aagattcccg 6481 acttcgcaga aattgtcgag aatctaagcc ttccataaag cgatcgccct acacctgtca 6541 cttcctcagc aatcgtaaac cactcagagt caccaacacc gtagaacctt catgcccaat 6601 cacaccgaga ggtaagttaa tacccccgac aaaattacca accaaaagca aaacaataaa 6661 acccaaagca acagtaatat tctgcttgac gatcgcctgc gctcttttac ccaaacgcat 6721 cgctgcttct attttctcca atctatctgc catcaatata atatctgcgg tttccaatgc 6781 gacatcactg gcaatcttgc ccatcgcaat tcccacagat gcttgagcta aggctggggc 6841 atcattgata ccatctccca ccattgccac agtttgatac tctttttgta aacgacggat 6901 gacatcaagc ttatcttctg gtaaaagttc tgcatacact tgatcaattc ctactgcttg 6961 ggcgacgctg tgagcagtgc gcttattatc gcctgttaac atgacgattt gttcaactcc 7021 cagttttttc aaatgggcga tcgctcttac tgcttctgct ctgatttgat ctgcgatcac 7081 taccacgcct agcacctgct cttgttgtgc aacccaaaca accgttttac cttcctgttc 7141 tcgcaagtct gctgcatcct tcaactcact tggtaaatga gtcacatact gtttgacaaa 7201 attcgctttt ccgacaatta cctcttggtg cccgttaata cccacaattc cctgccctgg 7261 tattgctcgt acctcaactg ctcttgacca attcaaatcc tgagccgcct gtacaatagc 7321 ttcgcctata ggatgttctg aagaagattc gacagaagca gcgactttta agacatctgc 7381 ttctgtatat ccacgagtag gaatgacttg aaagacttgc agctgtcctg ttgtgagagt 7441 accagttttg tcgaaggcga tcgcccgaac tttagcaatc atttccaact gggcaccatt 7501 tttgaacaaa atcccctgtc tcgcaccatt cgcaatccca gaaagcagtg tgggcataat 7561 tgcagccatc agcgcacagg gagaagctac caccaaaaaa gtaagagcac gataaatcgt 7621 cgtttcccag tcccaattta aaacaaacgg cggcaaaatt gccagcaaca acccagccac 7681 cacaataacc agcgcatatc tccgttcaaa tcgctcaaca aattgttgag aaggaggtgc 7741 ttctgtttgt gcttgttcta ccagacgaat cacacgctga agtaaactgc tttctggtgg 7801 ttgatgcact ttaagcttga gcgcaccata gccgttgagt gtacctgcaa aaacttcctc 7861 acctaccgtc ttctctatag gtaaggactc tcctgtaatg gcggcttgat tgatggtact 7921 caaaccagaa acaatcattg cgtcggtggg aatgagttcg cctggtttga caacaatttc 7981 atctcccacc tttagctgac tgatggaaat gatttcctcc cctccagaac gcaaaaccct 8041 tgctgtatct tgtgtcaagc tcatcaaaga gcggatgctg cgttccgttc gttgcatagc 8101 gtagccttcc agcgccccac taattgcaaa gatgagaatt aagattccgc catccagaat 8161 aagatggtat tccctttgcc acaagcccaa gatcgcagca cccaacgccg ccacaatcat 8221 cagtaaatca acatccagtt ctttctcttt gaaaagggtt gttagtcctt cacgggcgct 8281 ttcataacca ccaatgacgt aagctgcggg taatagtagt agcgcccatc ctaaaaaatt 8341 gagatgtagg gcgaaccatc caaaaaacaa cagcactcca caaaggatgg cggcgatcgc 8401 ttctgtatgt tctttggtga attgagttag acgttgtcgg tagagcatga gagtgaaatt 8461 atgaacactc tctcaagcta aaccttgaca ttaatgtcaa tgtcaagcgc aaaagtggta 8521 tgatgggatg cgagaaaaca aagtaggtca ttttgtgtct gtctcccttg caaagaacca 8581 tgcacgcaat tatcacgcca caaagaatag agttacctcc aggggctgtc ttaaaacttt 8641 tgggaagttg gcaagactac cttgctttga gtgaacagtt gggcgatcgc actgtacctc 8701 gtattaaata tcgactcgga gaaattttgt tgatggctcc tttaccagag catggacgca 8761 aagccagttt gatagcggat attgttaaag ttttgcttga ccatttggag caacgatatg 8821 attcatttac tccaataact atgaagttac cagaggtaac aggcattgaa ccagattata 8881 gtttttacat tgaaaactgg aaagcagtag taggcaaaaa tcggattgat tgggagtctg 8941 atccaccgcc agatatagtt atcgaaatag atgtgacaag ttatacagat atttctgact 9001 atcttccata caaagtacca gaaatttggt tgttgaaaaa caatcaaata gaaatatata 9061 gactacaggg tgaagtttac acaacagcag aaagcagata ctaccctaat atttcagaaa 9121 ttgtgcagca atgtctacaa attgctgact cgcaaacgac aagtgacgca atcagatggt 9181 taaggaaatt tttgcaaggg tgacagaatg accaaatctc ttactatcaa agaacttacg 9241 tcagcagtgg gaggcgggat gactccgcgc atggttcgcc actatcatga attggggtta 9301 cttcctcaac cagtgcgatc gcccagcaat taccgtctct acaccgaaaa agacgttctc 9361 agattacaac gcatcgtcgc actgaagcag caagggtttc aactcaacca catccgccaa 9421 atactggaag tggaaccaga agaagacaca actgctagtt tgatgacgca attacagcaa 9481 caatatcggg ctgtcataca gcaaatttca caattacggc aaacagcatc tgcattggaa 9541 gggttattgg ggcgcgatcg cgattgtgaa atcatccaag ctgaagtttt ggcacaactg 9601 aagctactag aagtagacac atccggagga ttaggcggac tggaaaaact ctggaatggg 9661 ttggatgctc aagttcatgc tcatccagaa gcttttactg aatcgttgga acgcttgcta 9721 cctgaccttt atagccgttc tgaaatagaa caagacctca tttcaaagct ggttttggct 9781 tgtggagatg tgagtttggc gtcgtttgtc aggttgagtg atcgggcgat cgccacgagt 9841 cgtcaagccc tcaaatccgg ctgtcagata gttgcggata tcccaacggt agctgctgct 9901 ctagatagaa caagattagc tcatttggga tgtccggtag aaaccttaat tgataactct 9961 catattacca ctgccacaga agcagaaaaa gcattttggc aacatcaaga atggcaggaa 10021 aaattacacc aagtctcccc aggttgtgtg ttggtgattg gctatgctcc aagtgtgcta 10081 ctctctgtat gtaaagccat ccaaaaccaa gaaattcaac cagcgttagt cattggaatg 10141 cccattggct ttagtcatgc tcctgctgct aagcgacaac tgatacaatc ttgcgtacct 10201 tttattacag ttgaaggaag cttgggaggc ggcttattag ctgcgactgt cctgaattct 10261 ttggttgagt cacttattga gaaacctgat tgtcattgtt acttaggcaa ctattaataa 10321 tagtataaaa aaatgctttt atagcacggt ttcagaatgt cccatacgag aataagatcc 10381 aatatcgttg tgaatcaaaa ataggacaaa acaaagtaaa caatgagatt agctgtttat 10441 ggcaaaggag gaagtggaaa aactactatt agtacttgcc ttttgcaatt tctgaactat 10501 ttctgtgact tccaagtttt aggtattgat ggcgatcaca atttgcatct agctgaagaa 10561 ctgggtgcaa cgcctgtcga aatgcagcga gggaacaagg aacctgcacg ggatattggt 10621 aacgatttag attggttgcg ttcttacttt gctggtacca atcctagaat tactgcggat 10681 ttacccatga ttaaaacaac tccaccaggg gaaggatcgc gtttgttgcg gttaagagaa 10741 gcagccgagt ggcgggatcg ctatgtgact ctcattgatg gagttgaatt gattcgagtt 10801 ggtgatttta cgcaagacga ctatcgcaag aagtgctttc acagcaaaac cggggcgatt 10861 gagattttct taaagcacgt ttgggaagat aacaacgagg caattattgt tgatatgtca 10921 gcaggtaaag atgttttcgc ttcaccctta cctagcttgt tcgacagaaa tatctatgtt 10981 gtcaaaccaa cgcgcaagtc agtcaccaat gcaggggatt ttctcgcaca cgccaaaaac 11041 tttggtattg acatgctagt ggtggcgact gacatccgtt cccgtcagga agtcgcagca 11101 atagaacatt atctcatgcg acctgtggat gcagtcattc cccacgatcc attttttgtc 11161 caacgggact cgttagtact gtctcgtcca cctcatgttc gcgaagccag ctttactgta 11221 ttggatgctt tctacagttt taccgatttg ttgcgtcagc ttccctcaca taattgggaa 11281 aagctgctca ctcgaatgtt ttaccatcac gaagagactg cacatgattg ggcaggagtc 11341 aactttacag cacaaattca acctggtttc aatccatttg agcgctttca aagcaacaaa 11401 ctcgccacag cttaacagtt aacagttatc agcgcatctt gccaaatctt tctctgacaa 11461 aatatgagtt ttgaaataca gcaatcctag ataaatcgtg aacaaaaaaa cgtagacgcg 11521 gagcggtgag acagcgctgc aggagggttt cccgacagag gcgactgcgt tcgcgtagcg 11581 tctccgcagg agatacccga agggcttccc gcagggtacc gcaaagggga g // LOCUS NODE_2955_length_11443_cov_5.51844011443 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11443) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11443) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11443 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(322..708) /locus_tag="DP116_22760" CDS complement(322..708) /locus_tag="DP116_22760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997619.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_22760" /translation="MKVLLDTHAFIWWVTDDSQLSSTARSIIADPSNVLFLSAASAWE IVIKVRLGKLNLPEPPETYIPSRLTMNRLESLPIQMVHALQVTNLPDLHRDPFDRIII AQSQVEKMPIVTVDSQITQYPVDVIW" gene complement(692..955) /locus_tag="DP116_22765" CDS complement(692..955) /locus_tag="DP116_22765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875156.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system Phd/YefM family antitoxin" /protein_id="PRJNA477356:DP116_22765" /translation="MYNAELPANLADFGELLRRVLAGEEVILSQAGTPIARIVPLTNQ PLPRIPGLDSAKVFIAPDFDEPLPEDVLNDFLIPLDIQNESIT" gene 1315..2238 /locus_tag="DP116_22770" CDS 1315..2238 /locus_tag="DP116_22770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198279.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease D" /protein_id="PRJNA477356:DP116_22770" /translation="MPYLTSASEMRSLVAEYTQAHTLWIDTEIAEYKSRSPRLSLIQV LDDPTDLSGDRIHILDILNHPDIVAEFIDNIMLNPQIEKVFHNASFDVKYLGNKKATN ITCTLEMAKKIPYYALPVPNYQLKTLAAELCDFKNIEKQEQTSDWGKRPLTEEQIEYA YLDCIYLAQIHQRLIELNKEINPEPTTENLTSLNARYAQLEQQWKIIDSEFAHLQERI KKAMLAQNLPESSYFKLSSSERKTVKVAFVELAKLIQTQDIDFDFPITLTQKLQKDLG ENLEQLSVDIEKTTAWRLIPKNQESQENTDE" gene 2399..3202 /locus_tag="DP116_22775" CDS 2399..3202 /locus_tag="DP116_22775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748234.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22775" /translation="MKAVCIAAFTVFVALATVSCSNSDDVLVTEIGVRPVGRPVATPS QSKDFYIEGQYQQIKGNSQGAIASYTKAINLSPNYGTAYNSRGLARFDVGDQQGAIED YNQALSINSNDAQAYNNRGNARAAQGDRDGAIDDYSQAIRLNPKYAEAYNNRGNALSA KGEKNQAVDDYSQAIRLNPNFAVAYNNRGNARSAQGDRDGAITDYNEAIRLAPNFAAA HNNRGNAYAALGDKEKAMQDLQRAAETFDKQNNKLLYQQVMNNIKELGQ" gene 3285..3581 /locus_tag="DP116_22780" CDS 3285..3581 /locus_tag="DP116_22780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743192.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22780" /translation="MSNTLTDIGTLIDSHPGIHGGCPLIAGTGVTVRRIAIWYKQGLR AEEIAARIGHLTIAQVYAALTYYHINREEIDADIAAQEAEADRIEALHKASRQS" gene 3578..3946 /locus_tag="DP116_22785" CDS 3578..3946 /locus_tag="DP116_22785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015177588.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22785" /translation="MSQIRLYMDEDSGDIALVLALQNRGVDVITTLSVNRLKYPDEEQ LIWARSQGRVLYSSNIQDFYRLHTAFLTQEQPHSGMILVQQQRYSIGELMRGILRLVA AKSAEEMENEVEFLSTWIEE" gene 4114..5085 /locus_tag="DP116_22790" CDS 4114..5085 /locus_tag="DP116_22790" /EC_number="2.7.1.39" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314817.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="homoserine kinase" /protein_id="PRJNA477356:DP116_22790" /translation="MSVIPSVSITVPGTTANLGPGFDCIGAALTLYNEFKFTRLDIPP YLGKVKITVMGAEAERVVTDESNLLYQAFVKFYQYREQTPPPVEIEIHLGVPLARGLG SSATAIVGGLVGANLLEGEPLSQLEVMELAIALEGHPDNVVPALLGGCRLAATGVEGE PQRRREQKSWEICEVPWCEEIVPVVAIPDFELSTKEARQVLPTEVSRADAIFNAAHLG LLLRGLETGRGDWLRAALQDRLHQPYRKALIRGYDAVYSAAVEAGAYGMVISGAGPTL LALTDKAHSQAVAQAMTAAWMEEGIKAEVRSLCVDTQGAKSIQNSKL" gene 5541..7154 /locus_tag="DP116_22795" CDS 5541..7154 /locus_tag="DP116_22795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407323.1" /note="shuttles electrons from NAD(P)H, via FMN and iron-sulfur (Fe-S) centers, to quinones in the respiratory chain; subunit D, with NdhB and NdhF are core membrane components; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit 4" /protein_id="PRJNA477356:DP116_22795" /translation="MNMIDFPWLTTIILLPMVAATAIPFIPDKEGRTLRWYGLGVAFT DFALMIYAFWHNYDFQSSAFQLVEKYSWIPQIGLNWSVAVDGLSMPLILLTGLINTLA VFAAWKVTTKPRLFYGLMLVMYSAQIGVFVAQDLLLFFLMWEIELVPVYLLISIWGGE KRRYAATKFILYTAAASIFILVAGFAMAFSGDTVTFDMATLGMKEYPKAFELLVYVCF LIAFGVKLPIFPLHTWLPDAHGEAPAPGSMILAGVLLKMGGYALIRINMEMLPDAHVY FAPVLAILGVVNIVYGACCAFAQTNLKRRLAYSSVAHMGFVLIGIASYTELGTSGAML QMISHGLIAASLFFLAGVTYERTHTLMMDKMGGIAKVMPRTFALFTAGAMASLALPGM SGFVGELMVFLGIATSDVYSSSFKVVVVLLSAVGVILTPIYLLSMLREVFYGKQNEEL VLDALVLDVKPRELFITACLILPIIGIGFYPKLATQTYDVKTMQVATHARQVLPVVAR QQPTSLYSLIFTAPTLADSQVPGLVNIAE" gene complement(7415..7912) /locus_tag="DP116_22800" CDS complement(7415..7912) /locus_tag="DP116_22800" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22800" /translation="MLNVTIGVRTALIKYLASVGLLMTALPSALVSQSASAAPLVGGI LWSADLSNQAQTSTQNLTKRQIHIAVIGEVTAPGAYFLADSTPSDERFTESRSLPTVI KALQAAGGVTLYADTCRIQVRRQASDGSKQTIPVDLCKYRLKKDLSQDIRLQDGDAVV VPFAP" gene complement(8125..8334) /locus_tag="DP116_22805" CDS complement(8125..8334) /locus_tag="DP116_22805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315987.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_22805" /translation="MKTFTAIVERDLDTKLYVGYVPGFPGAHSQGETLDELQDNLRQV IEMLLEDEELVFQTEFVGIQQIVIQ" gene complement(8390..9793) /locus_tag="DP116_22810" CDS complement(8390..9793) /locus_tag="DP116_22810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22810" /translation="MSESAQGIPGLPDLTHPSELIQFGTHLLKASLLLVFLVLALGIA IALLSFALRQNQQQQEILIGEWAVSYSQLLRGLQHFTLVLILLVGGFFLCSTLSNRYH HWEQAKVAQVAESVAGERLEQIAPQIRYMTEEPYVYTTQVNGKIVKVNEKRKLSRFLT LSGSQIQVKIDQSPDVRGRRAVYKIDYTADYKVVNQLKDINSFFFEASPPIGYSLLQS YKVERDATRLAQTNPGDYSFPFQLQPGEQTSLRVTYKAQGAPRWVYSANGQLLSNFRL LANANFPGADFASGIVPSEIKNDRQGTQFTWVFDDNVSVKNPFGVYTYTDPIRHTGVL PRLLLLAPTIFFWWILLLYLSLPMSLTNVAIAGSVFFACLLSLTYLSRFINPQLAWTL ISLVLLALTWGLGSKNRSVSLAALICTIAGAVLPVFGLLVPFSGLTLSLAGLLSVIWL AVRHWYGLYRLELEDRR" gene complement(9860..10393) /locus_tag="DP116_22815" CDS complement(9860..10393) /locus_tag="DP116_22815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017322174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_22815" /translation="MKADELLAQYAAGKRDFTEVNLSEAFLEGADLSGAILDRAILDG ADLSRVNLSHASLIEADLNGANLNQANLTEANLSGAILDGAILEGAILDGANLSQADL TIAKLIGTQLREADLQEANLNAVSLNEADLSHADLAKADLTQADLKNAELHEANLNQT NLDNANLEGTILDEGKG" gene complement(10538..10780) /locus_tag="DP116_22820" CDS complement(10538..10780) /locus_tag="DP116_22820" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22820" /translation="MTANENNNSTPIEKNDLVGLNRDVDENLTEGLVGRVVECNEDAF DVKFPLPGDKEVSAKLPREDIDFLVGTKQLENKNEN" gene complement(10998..11252) /locus_tag="DP116_22825" CDS complement(10998..11252) /locus_tag="DP116_22825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319654.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR03643 family protein" /protein_id="PRJNA477356:DP116_22825" /translation="MKLPLLDSEIIDRIIEMAWEDRTPFEVIEKQFGLQEKQVIALMR REMKASSFRMWRERVTGRKSKHLQKREFVAGRFKSANQKT" BASE COUNT 3210 a 2405 c 2478 g 3350 t ORIGIN 1 ccgttttttc caaagccttt gtcgcaatca cagtcccaat tgcgatcgcc ccagcagtca 61 gtggttccat gtactacctc gtaagcgcgg tatcttattt atttattgat acatcgtttt 121 tgttctcaat tccgcaatca aatcagtgtt gaatttagca atttagcaat tctgactttt 181 tataatggct cagtcatgtt tagtgaattc attgacgata attgggtagt tttgaaatta 241 tgagcgataa gccggaggct tgacgcaaag cgtatcgcat tgtgtgcccc ccgtaggaga 301 tgttcccaaa gcgtgattgc attaccaaat cacatcaaca gggtactgag taatctgact 361 gtctacagtt acaattggca ttttttccac ttggctctga gcaataatta tcctgtcgaa 421 aggatcacga tgtaaatctg gtaaatttgt tacttgtaaa gcgtgaacca tttgaatcgg 481 tagactttca agtcgattca ttgtcaaacg acttgggata taggtttctg gaggttcagg 541 taaatttagt tttcctagac gtaccttaat aacgatttcc cacgcactcg cagcacttaa 601 aaacaacaca ttacttgggt cagcaattat actgcgagcc gtggatgaca gttgagaatc 661 atctgtgacc caccaaatga atgcatgggt atcaagtaat actttcattc tggatgtcta 721 aggggataag aaaatcgttt aaaacatcct ccggtaaggg ttcgtcaaag tcgggggcaa 781 tgaatacctt agcactgtcc aatcctggaa ttcgaggtaa tggttggtta gttagaggaa 841 caatgcgagc aatgggagta ccagcttgag aaagaataac ttcttctccc gccaaaactc 901 gacggagcaa ttctccaaag tcagccaagt tagcagggag ttctgcatta tacattgata 961 gtcactccga attgttgcaa ctagtgaact acttttatag gtttggtgtg atggagtgtc 1021 gcttgatgtc ttcaacaagt atatcaagca gcaagattct ccatctgcta actaatacca 1081 atttgaaaaa agaatgcaac agatacacac tcccaaaccc ttgtgatgtc tggctttctt 1141 ccggattgaa ttttgaattt tgttagcgca gcgggacgca gtccattttg aattggtata 1201 aatcctaccc ctgctttaga tacaatgtat aatctacgca gtcagagatg atctaagcaa 1261 aaatcacacg ctagaactta attgctcttg cccatttttc cgcaaattca ccccatgcca 1321 tatttgactt cagctagcga aatgcgttct cttgttgctg aatacactca agctcatact 1381 ttgtggattg atacagaaat cgcagaatac aaaagtcgta gtccgagact atcgctgatt 1441 caggtattag atgatccaac agacttgagt ggcgatcgca tccatatttt agacatactc 1501 aaccatcctg atattgtggc tgagtttatc gacaacatta tgctcaatcc ccaaattgaa 1561 aaagtctttc ataatgctag ctttgatgtg aaatatctgg gtaataaaaa agctacaaac 1621 atcacttgca ctctggaaat ggcaaaaaaa attccctact atgccttacc agtacctaac 1681 taccaattaa aaactttagc agcagaactt tgtgatttta agaatataga gaaacaagaa 1741 caaaccagcg actggggaaa gcgaccactc acagaagaac aaatagagta tgcttaccta 1801 gactgcattt atcttgctca aatccatcaa cgtttgatag aattaaacaa agaaattaat 1861 ccagagccta ccacagaaaa cctaacatca ctgaatgcaa gatatgcaca gcttgagcaa 1921 caatggaaaa tcatagactc ggagttcgcg catttacaag agcgaatcaa aaaagctatg 1981 cttgctcaga atctaccaga aagttcgtac ttcaaacttt cttcctctga gcgaaaaaca 2041 gtgaaggtag catttgtaga actcgcaaag ctaatacaaa ctcaagacat agatttcgat 2101 tttccgatta cactaactca aaagctacaa aaagatttag gagaaaattt ggaacaactg 2161 tctgtagata ttgagaaaac cactgcttgg cgactgattc ccaaaaatca agaaagtcag 2221 gaaaatactg acgagtgatc aacagttatc agttaacagt tatcagttat cagttatcag 2281 ttatcaggta ggaaacggac tcgtccaccc cttgtttact gttcactgtt tactgttccc 2341 tgttccctgt tccctgttcc ctgttcccta tgtttcttca gaagatgact ggtaaagaat 2401 gaaagctgtc tgcatagctg cgtttaccgt atttgtcgcc cttgcaaccg tttcctgtag 2461 caacagtgac gatgttttag taacagaaat tggagttcgt cctgttggac gccctgtagc 2521 aacaccatct caatccaaag acttttatat tgaaggtcag tatcagcaaa ttaaagggaa 2581 ttcacaagga gcgatcgctt cttataccaa agcaattaac ctcagtccca actatggaac 2641 tgcatataac agtcggggac ttgctcgctt tgatgtagga gatcagcagg gggcaataga 2701 ggattacaat caagccctta gcataaattc taacgatgct caagcttata ataaccgagg 2761 aaatgctcgc gccgcacaag gagacagaga cggagcgata gacgattaca gccaagcgat 2821 tcgcctcaat cctaaatatg ccgaggcata caataatcgg ggaaatgctc tttcggcaaa 2881 gggagagaaa aaccaggcgg tagacgatta cagccaagcg attcgcctca atcctaactt 2941 tgctgttgct tataataatc ggggaaatgc tcgcagtgct caaggagata gagacggagc 3001 aatcacagat tacaacgaag cgattcgcct tgctcctaat tttgcagcag cacacaataa 3061 ccgaggaaat gcttatgctg cactaggtga taaagaaaaa gctatgcaag acttgcagcg 3121 agcagcagaa acctttgata aacagaataa taaactactc tatcaacaag tgatgaacaa 3181 tattaaggag ttagggcagt agtgttagac gctttcaagt atttggaaca acacttggat 3241 tgcgtttgtt aaaataggca gcaatggtta tgtcatttaa cactatgtca aatactctca 3301 ctgatattgg cactctcatt gacagtcatc ctggaattca tggaggctgt cctctgattg 3361 ctggcactgg tgtcacagtg cgacgaattg ctatttggta taaacaaggt cttagggcag 3421 aagaaattgc cgctaggatt gggcatttaa cgatagcaca ggtttatgca gctttgacgt 3481 attatcatat caatcgagaa gaaatcgatg ccgacattgc tgcccaagaa gcagaagctg 3541 accgcataga ggcattacat aaagccagca gacagtcgtg agtcaaattc gcttgtacat 3601 ggacgaggat tctggtgata ttgcattagt cctggctcta caaaatcgtg gtgtagatgt 3661 aatcacaact ctaagtgtaa accgattaaa atatccagac gaagaacaat taatttgggc 3721 aaggtcgcaa ggtcgcgttt tgtacagttc taacattcaa gatttctatc gtttacacac 3781 agcttttttg actcaagaac aacctcactc agggatgatt ttagtacagc agcaacgtta 3841 ctcaatcgga gaactgatgc gtggtatttt gagattggtt gctgctaaat cagcagagga 3901 aatggaaaac gaagtagagt ttcttagcac ttggatagag gaataaatgt gagcgatacc 3961 gctcacacaa acctaaacac ttatttgttg ttctttggca cgtcgctgta gtatttgcca 4021 tgcggtagac gtcacacaag agattcagat aaccgctgag caaattaatc agttacgtga 4081 gatgctgatt gattaattag tttttactga caaatgtctg ttattccttc tgtatctatc 4141 actgttccag ggacaactgc taatctagga ccaggttttg attgtattgg tgcggctcta 4201 acgctgtaca acgagtttaa gttcactcgc cttgatatac cgccttacct tgggaaggtc 4261 aaaattactg tcatgggtgc ggaggctgaa cgagttgtca ctgatgaaag taatttgctc 4321 tatcaagcgt ttgtgaaatt ttatcaatat agagagcaga caccgccacc tgtagagatt 4381 gagattcatc ttggtgttcc actggcgcgg ggtttgggga gttcggcgac agcaatagtc 4441 ggtgggttgg ttggtgccaa tctgttggag ggggagcctt tgagccagtt ggaagtgatg 4501 gagttggcga tcgctctaga aggacatccg gataatgtgg taccagcttt gttggggggg 4561 tgtcggttgg cggcgacggg ggtggaaggt gaaccgcaga ggcgcagaga acaaaagagt 4621 tgggagattt gtgaggttcc ttggtgtgag gagattgtac cagtggtagc gattccagat 4681 tttgagcttt caacgaagga ggcgcgccag gttttgccta ctgaggtaag tcgtgcggat 4741 gcgattttta atgcggcgca tctgggattg ttgttgcgtg gtttggaaac tggcaggggt 4801 gattggttga gggctgcttt gcaagatagg ttgcatcaac cttatcgaaa agcattgatt 4861 cgaggttacg atgcggttta ttctgctgct gttgaagctg gtgcttatgg gatggtgatt 4921 agtggtgctg gtccaacgct gttagctttg acggataaag ctcattctca agctgtggct 4981 caagcgatga cagcagcttg gatggaggag gggataaaag cagaggtgcg atcgctctgt 5041 gttgatactc aaggagcaaa atcaattcaa aattcaaaac tctaaatttc aaatatagat 5101 acgtggttat atgtcaaccg tattgtttga gatattcata ggtaacttta tctgtcttct 5161 tgtttaagta ctgagccttt aactagcgcg gttacaccct cgcaacgaat gaaagaaaca 5221 cctttttccc ttacacccct atacccttac acccctacac ccctacaccc ctgtgtttct 5281 tgaaagaact ctcctcgcca gataaccatc ctaggacaga gttaatataa aaatttgtaa 5341 cggtggctgt tgccgccgtt ttttgttgac gtagcaaaac tttacaatta gaatggattc 5401 gagcttgaaa tgagtattta ttcttatgca aatctaaaga tttggttttt gtaaggaaac 5461 aatgttcaca aaactatgaa ttgcttaata taatgtaatt aaaaagaaaa acataaagtg 5521 attgaaagtg attgtcagcg atgaatatga tcgattttcc ttggttaaca accataatcc 5581 tattgccaat ggtggctgct acagccatcc cctttatccc agataaagaa ggcagaactc 5641 tccgctggta tggtttggga gttgcgttta cagactttgc cctgatgatt tatgcttttt 5701 ggcataacta cgattttcaa agctctgcat tccaattagt tgaaaaatat tcttggattc 5761 ctcaaatagg tttgaattgg tctgtagcgg ttgatggctt atcgatgccg ttgatactgt 5821 tgacaggctt gattaacaca ctcgcagtat tcgcggcttg gaaagtaact accaagccgc 5881 gtttgtttta tggtttgatg ctggtgatgt atagcgccca aattggcgtg tttgttgccc 5941 aggatttgct gttgttcttc ctaatgtggg aaatcgagtt ggtacctgtg tacttgctga 6001 tttctatctg gggaggagag aagcgccgct atgcagctac caaattcatt ctctacactg 6061 ctgctgcatc aatatttatc ttggtagcgg gctttgcaat ggcattctct ggagacaccg 6121 ttaccttcga catggcaact ttgggaatga aggaatatcc caaagcattt gaactcctag 6181 tgtatgtgtg cttcttgatt gcttttggtg tcaagctgcc gattttccct ctgcacacct 6241 ggttacccga tgctcacggt gaagccccag cacctggttc gatgattttg gctggtgttt 6301 tgttgaagat gggtggttat gcacttatcc gcatcaacat ggagatgtta cccgacgctc 6361 atgtttactt tgctcccgtg ctagcaattt taggtgtggt gaacattgtc tacggtgctt 6421 gctgtgcctt tgctcaaaca aatcttaagc ggcgcttggc ttactcgtca gttgctcaca 6481 tgggttttgt cctgattggt attgcttcct atacagagtt gggcaccagc ggcgcaatgc 6541 tacagatgat ttctcatggt ttgattgctg caagcttgtt cttcctcgct ggcgtgactt 6601 acgagcgcac tcacaccttg atgatggaca aaatgggtgg tatcgcgaaa gttatgccca 6661 gaacgtttgc tctcttcaca gcgggtgcaa tggcttctct ggcgttacca gggatgagtg 6721 ggtttgtcgg tgagttgatg gtttttctcg gaattgccac aagcgatgtt tacagttcta 6781 gcttcaaagt tgtcgtcgtc ctcctctcgg ctgttggtgt gattctgact ccgatttact 6841 tactgtcaat gctgcgcgaa gtcttctacg gtaagcaaaa tgaagagtta gtgctggatg 6901 cactcgtact ggatgttaaa ccacgcgaac tgttcatcac tgcttgtttg atacttccca 6961 tcatcggtat cggcttctat cctaagttgg cgacacaaac atacgacgtg aagactatgc 7021 aagtcgcaac tcatgcccgt caagttttac cagttgtcgc tagacagcaa ccaacaagtt 7081 tatattccct tatatttaca gcaccaacac tggctgattc acaagttcca ggtttggtta 7141 atattgctga gtaatatttg ttcccagaag gctggcgcta tcaaagaatc ctagaaagat 7201 tataaatggg gggtgattaa gcaatcaccc cctaaaattt agtaataaag tcacatcagc 7261 tcatgacttt agtcccatgt aattcctcgg taaaattttt agtattaatt cggcattaac 7321 atcaagcact caaaacctga cccgaattag acacgggtta ggtttttttc tgtagttata 7381 taaaagagtt aggctggagt tgaggacgca atcatcatgg ggcgaatggt actacaactg 7441 cgtcaccgtc ctgcaacctt atatcttgac ttaagtcttt ttttaacctg tacttgcaca 7501 aatctacagg aattgtttgt ttcgatccgt cagaagcctg ccgacggact tgaattcggc 7561 aggtatcagc atataaggtg acaccgccag ctgcctgaag cgctttgatt acagtaggca 7621 atgaccgaga ctctgtaaaa cgctcatccg atggtgtaga gtctgccaga aagtatgccc 7681 ccggagctgt cacttctcca atcacggcaa tatgaatctg ccgctttgta aggttttgag 7741 tgctagtctg tgcctgattg cttaggtcag cagaccagag aattccgccc accaacggag 7801 cggctgacgc tgattgggat actaacgcgc ttggtaaagc ggtcatcaaa agaccaacgg 7861 aagctagata ttttattagg gctgtccgaa ctccgattgt gacgtttagc atgcttttag 7921 acttctggtt ttttatttgt taattggcaa tcgctcgcct aaagttccct ctgctttgac 7981 cccatatttt cgtctaaagt ttcaccttga gaatgcgctc cagcaaaccc aggaacatat 8041 ttgacgcact tcaataaacc caagattttc cagtatccga actacttcct gtggtttgag 8101 aacaggtatg ttgctcataa gtgtttactg gatgacaatt tgttgtatac caacaaattc 8161 tgtctggaaa actaattcct catcttccaa aagcatctcg atgacttgac gtaagttatc 8221 ttgtaattcg tctaaagttt caccttgaga atgcgctcca ggaaacccag gaacatagcc 8281 aacataaagt ttagtatcaa gatctctctc aacaatcgca gtaaatgttt tcatcggact 8341 tgagttttta aactagttta acggaaaaaa tgcaatagaa aagacctcat tatctacggt 8401 cttctagttc taatctatac aacccatacc aatgacgaac tgcaagccag ataactgaaa 8461 gcaagcctgc aaggctgaga gtaagaccac taaatggtac taataatcca aaaacgggta 8521 atacagcccc agcaattgta cagataaggg cagctagaga aacactgcga ttttttgacc 8581 ccaaacccca tgtcaatgct agtaagacaa gtgaaatgag agtccaagct agctgaggat 8641 ttatgaagcg actgaggtaa gtcaaactta gtaaacaagc aaagaaaaca ctacctgcaa 8701 ttgccacatt tgtcaagctc atgggtagtg ataaatacag cagcaatatc caccagaaaa 8761 atattgtcgg tgctagcaat agcagtcgcg gcaggacacc agtgtgacga atgggatcgg 8821 tgtaggtgta tactccaaat ggatttttca ctgaaacatt atcgtcaaag acccaagtaa 8881 attgagttcc ttgtctgtca ttttttatct cactgggtac gataccactg gcaaaatctg 8941 ctccaggaaa gttggcgttc gctagcagac gaaagttaga aagcaactgc ccattagcac 9001 tataaaccca gcgcggtgct ccttgtgctt tataagtgac tcttaaactc gtttgttctc 9061 ccggttgtag ttgaaaagga aagctatagt caccaggatt tgtttgtgcg agtctggtag 9121 cgtcacgctc tactttataa ctttgcaata aggagtaacc aatgggtggt gatgcttcaa 9181 agaaaaaact attaatatct ttgagttggt taacaacttt atagtcagca gtatagtcta 9241 ttttataaac tgcccgacga cctcgaacat ctgggctttg gtcaatctta acttgaattt 9301 gcgagccact taatgttaaa aaacggctga gctttcgttt ttcatttacc ttgactatct 9361 tgccattaac ttgagtggtg tatacatatg gctcttcagt catgtatcgt atttgaggcg 9421 ctatttgttc taacctttca ccagcgacac tctcggcaac ttgagctact tttgcctgtt 9481 cccaatgatg gtaacgattg ctcaaagttg aacaaaggaa aaaccctccc accagcagga 9541 ttaacaccaa ggtgaaatgc tgtaatcccc gcaatagttg agagtaacta actgcccact 9601 ctccaatcaa aatctcttgc tgttgttgat tttgacgtag ggcaaagctc aacaaagcaa 9661 tggcaattcc caaagctaaa accagaaata ctaataacag tgaagcttta agtagatgcg 9721 ttccaaattg gatcagttcg ctaggatgcg ttaagtcggg taatcctgga atgccttggg 9781 ctgattcaga catggagggg tgttggaaga gtttggagat tctgactgag aaaatctcct 9841 aaaagtttct taccaaaagt tagcctttac cttcatccag gattgttcct tctagatttg 9901 cattatccaa gttggtttgg tttaaattcg cttcatggag ttctgcgttc ttgagatctg 9961 cttgggtgag atctgctttc gccaagtcag catgactgag atctgcttcg tttaaagaaa 10021 ctgcattgag gttagcttct tgtagatcgg cttcgcgtaa ttgagtaccg attaactttg 10081 cgatcgtcaa atcagcttga ctcaaattag caccgtccaa aatggctcct tctagaattg 10141 ccccatctaa gatggctcca ctcaaattag cttccgtgag atttgcttga tttaaattgg 10201 ctccatttaa gtcagcctct attaagcttg cgtgactgag attaaccctg cttaaatcag 10261 ctccatccaa aatggctctg tctaaaattg ctccactaag atcagctcct tccaagaatg 10321 cttcgctcag gtttacttca gtaaaatctc ttttaccagc agcgtattgt gccaatagtt 10381 catctgcttt cattgcttcg cctccgacaa gttttgactt cttttgtcag ctacattagc 10441 aattttttgg gagtatgcca aatttacaaa aatcgctccc aattttttga gggcagaggg 10501 gtactctgcc ctcgaagatg aaactaatta attttcatta attttcattt ttattttcaa 10561 gctgcttcgt tccaaccaaa aagtctatat cttcccttgg cagtttggca gacacttcct 10621 tgtctcctgg aagaggaaac ttaacatcaa atgcatcttc gttgcattct acaacccttc 10681 ctactagacc ttctgtcagg ttctcgtcta catcgcgatt cagtccaact aagtcgtttt 10741 tctcaatagg tgttgaattg ttattctcat tagccgtcat atttccgacc tccagttact 10801 actaactatt agagtgggct gtcaatgctt tcgcaaagta ttttgttaaa gtattctgtt 10861 aataatcaca catcagttgc aaatgcactt cctacctttg cccgatattg acgctttgta 10921 ccttacctat ggatatacga ataaaggaag agctgatatg cggggtgctg acctaagccc 10981 ttgtaagaat gagagtttca agtcttttga ttagctgatt taaaccgacc tgcgacaaat 11041 tctcgctttt gtaaatgctt gcttttgcgt ccggtaacac gttcgcgcca catccgaaaa 11101 ctggatgctt tcatttcacg gcgcatgagg gcaatgacct gcttttcttg caaaccgaat 11161 tgtttttcaa ttacctcaaa tggagtccgg tcttcccatg ccatttcgat aatacggtcg 11221 atgatttcgg aatcaagtaa tggtagtttc attggtggtt aacaatgttt gtaagagaat 11281 gtttgtaaag tctaccgttg tattgagatc ccccctagcc cacgccagac gctaccctac 11341 gggaagccgc ccttcgggcg tctacaagtc gggagaccct cctacagtac tggctcaacg 11401 gacaggtgcc tcaagtcggg aaacccgccc acggcactgt cct // LOCUS NODE_2956_length_11441_cov_3.27033211441 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11441) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11441) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11441 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..913) /locus_tag="DP116_22830" CDS complement(<1..913) /locus_tag="DP116_22830" /inference="COORDINATES: protein motif:HMM:PF00361.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22830" /translation="MSFDALVQYVAADAKSSIAGFLPELVLCVAIVLLLVVRVAGWRV RGDAYLISTIGALAALLVAVLDAREGSLFLPLVEKSPHSPRVGAELFTGMIVYDHFTQ FWRILLAGFLVLFLRLTRLTGVPDDEDGPDIYSLVIGSTIGLMSMASANHVLMIFLAM ELASVPTYILAGLNKGRRQASEAALKYAVYGAAASGVMIYGMSLLCGVLGSLHLPTMA ERLAATLGDPALADRQTMLVLGGLFFAVGLAFKLATFPFHFWAPDVFEGATAEMGAFL SVASKAGALALTMRVALTFAGGPLVPEP" gene complement(910..2682) /locus_tag="DP116_22835" CDS complement(910..2682) /locus_tag="DP116_22835" /inference="COORDINATES: protein motif:HMM:TIGR01972" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit M" /protein_id="PRJNA477356:DP116_22835" /translation="MTDAVWFSLIVFVPLVGAILCAGLPRGSDDAARWITLGASLVVF ALAAALLVVGPKGAEDAFRPGVGTIQHVVTLEWIPSFGVYYFLGVDGISFWLLLMTTF VSVLAAAASWSIDKNVQSYSALFLILLTGMVGVFLALDLLLFYVFFELVLLPMYFLIG VWGGPRKEYAAIKFFIYTLFGSALILIAVLMLYFASDLTKLTDDQLRAAHLPADAVAQ IAANRIDGPPVRTFNIVALQTLARETDVFRAAPLFWGQTLEWWAFVLLTIGFLVKVPS VPFHTWLPDAHVEAPTAISMLLAGVLLKMGGYGLIRIALPICPIAAQVMGPVLAGLGA VSIVYGALAALAQTDFKRLVAYSSVSHMGYVLLGLSLWASGPDAALVDSWSMGLSGAT YQMIAHGLTSAGMFFIVGVLYDRLHHRDLNRFGGLAGEMPVGAGMAAVVFFASLGLPT LCGFIGEVFVLLSTWSRSPVAAAIGAVTTIFTAGYILTALQRVFYGVKPEGIEREPVA DVTPREIAVLAPIVAFSVVFGVLPQLVFRYVEPATTHIAREVVAAGARPGVLAPIAAA RSRPADPPAAVVEVARDAAKGATR" gene complement(2701..5025) /locus_tag="DP116_22840" CDS complement(2701..5025) /locus_tag="DP116_22840" /inference="COORDINATES: protein motif:HMM:TIGR01974" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit L" /protein_id="PRJNA477356:DP116_22840" /translation="MIETIPSLLLTAAAAPLFAALVACAVRLAGRGRSRAPALVSTIA VAVSLLSSLVALAAWASHHGVGSERAGDAPNVLRGTWLTFAQFGGLRFDVSYSIDALT IVFFCVIGIVATCVHVYAIAYMHEETHASVEDHEIVSPLGDHLHRPGRYAAFFQHLSL FTFSMFGLVLTGNLLVLFAFWELVGATSYSLIGFYRERDSAASAARQAFVVNRVGDAG FLLGILILANLCGTLELHDLDGRLGIVSQFRMAANHFAWAVPARVGDANYGLLVLAGL GVCCGCVGKSAQAPLHAWLPDAMAGPTPVSALVHSATMVAAGVYLVARLAPVFPAEVL LVLAYVGAITLTLGALFALTATDLKRVLAFSTISQLGYMTLAVGVGGWNAATFHLTTH AFFKSLLFLGAGAVIHATHVQDLRRLGGLLKPLKVVAVTMLLACVAMVGAGLPLIDVG TSGYYSKDAILAQALAFSQRNPQHAVLFYLPLGAALLTAAYIFRLWFLTFVGPPRDPV ARQHVHEPSRLMTVPLVVLATLAISAGWTFGLDAGLSPLLGQAVLPTARPGAHARSEP ESADSTAHVADVPSARTGREPLAEPDDIAQGRWLPSLVIPAEAASHTPALHLAASLGG FAAGLAGFAIAAAFYGWRWLSADELARSFAPLSRALRSEGRFTSLYRTAFVAPTLAVA RAAKWFDRVVLDGIVDGSARLAVWLARVDDRFDRGIVDGFVNAISRVIYETALSLRVV QTGRLRQYVLILAVGAAGLFLVASWWLSPPAARP" gene complement(5073..5417) /locus_tag="DP116_22845" CDS complement(5073..5417) /locus_tag="DP116_22845" /EC_number="1.6.5.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012912073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit NuoK" /protein_id="PRJNA477356:DP116_22845" /translation="MDLLTQPIGATHYLIVGAILFVCGLLCMAVKRNAVGLLMGVELV LNAALVNWVAFSSKSFRADVDVALGLDGHLAALFVIILAAGEAAAALAIALNFYNNHA TIDVDRGDQLKG" gene complement(5419..5925) /locus_tag="DP116_22850" CDS complement(5419..5925) /locus_tag="DP116_22850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020470259.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22850" /translation="MAVYLIFALCGTSGLFFLAGAEFVGAAQLMVYVGGTLVLLIFGV MLTSPGPFASLNTRPADWVLGLGVGGLLLAAVLIPAIFSVADWRPNPRADVELAAPRL EVKSAEIGLALIGVRADRVRDAEDPLAPGRSGWLLPFELSSVYLLVVLIGAAYLARAK RSARTHEA" gene complement(6021..7215) /locus_tag="DP116_22855" /pseudo CDS complement(6021..7215) /locus_tag="DP116_22855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020470260.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hydroxyacid dehydrogenase" assembly_gap 7099..7108 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 7541..8008 /locus_tag="DP116_22860" CDS 7541..8008 /locus_tag="DP116_22860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002645575.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="PRJNA477356:DP116_22860" /translation="MLPYDFDASIGFWICMANQAVQRELADRLEPQGITYRQAQVLGN LIHSGEMTQATLARKMFIEPPTLVGILNRMERDGWIGRTICPSDRRKKYVRINPEAAD AWSKVVECAYAVRAKASDGLTDEELATLKRLLEKVRANLGAGVEIADLAACDD" gene complement(8033..8731) /locus_tag="DP116_22865" CDS complement(8033..8731) /locus_tag="DP116_22865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009102309.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_22865" /translation="MSRHKILVIEDETALLEALRYNLEREGFEVLTATDGLSGLQRAQ SVLPDLIVLDLMLPVLEGLEVCRRLRDDANTRTIPIVMVTARGEEIDEVVGFQMGADD YVAKPFKVRPLLQRVKALLRRTNVREDQTASIAEQGVEIDRWRHRATLDGRVLPLTPT EFRLLWSLARQPGRAFSRHELMDASMGEDVASLDRTIDVHVKSLRQKLGDRAELIETV RGVGYRFKEASASN" gene complement(8800..9711) /locus_tag="DP116_22870" CDS complement(8800..9711) /locus_tag="DP116_22870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007418983.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar phosphate isomerase/epimerase" /protein_id="PRJNA477356:DP116_22870" /translation="MTSPLDRRTFLAAGVGAAAALASGGPGSSASAQDAPVKKYKKAV KIGMVRAGATLVDKFKILKEIGFDGIELDSPNGFDKSEVLAARDESGLPIHGVVDSAH WKETLSHPSEEVRAKGLAALETAIRDAHAYGATSVLLVPAVVNKEVSYADAYTRSQAE IRKALPLAAELKIKILFENVWNNFLLSPLETARYIDEFESEWIGAYFDVGNVVLYGWP EQWIRTLGKRIGKLDVKEYSRKVAREKGTGAGFGVELLEGDCDWPAVMRALEEIGFTG WGTAEIPGGDETRLRAIAERMDKIYAS" gene 9710..>11441 /locus_tag="DP116_22875" CDS 9710..>11441 /locus_tag="DP116_22875" /inference="COORDINATES: protein motif:HMM:PF07583.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22875" /translation="MKSPRGSRTAEMARGTRPRAKAYQAAQTAGKPLARPRGNRRRPP VNFPCGRAAPVRLTDRRATITLDASTNFDGLGSRSMRRRASNGRNATPAVVLAALAWS ASLAAAAETTVFPESITLDDVFARRQVLVTAAGPRGPEDRTSQAKYVSLDPHVATVDE RGYVAPVAVGKTALEIAIDGRKLRVPVEVTGLDGGRPVDFAREVVPLLTRFGCNAGGC HGKASGQNGFKLSLFGFDAAYDHKAIATEARGRRVFPAAPDQSLLLLKAIGAVPHGGG RRLVKDSDPYLVIRRWIEQGLPATAPDAPTIKSVRIEPHERTMRKHEKQQLAVIAEYS DGSTRDVTRQAQFNTNEGAVAVSSDDGLVSANDLPGEAAIMAAYLGQVAVFKVLVPTG EPLDAIPEFTPNNFVDELVLAKWKKLGLRPSPTCTDEEFLRRVTIDLAGRLPTVDEAS AFAADSSATKREAAVDRLLASPDYAAYFAMRWGTILRNSRLAGADKAAYAFHNWLKDL IAQNRPYDELVRGVVAASGEWQDAPAINWYWQMRDDQLHQPVGDTAQVFLGLRLQCAQ CHHHPYERWSQ" BASE COUNT 2072 a 3964 c 3901 g 1494 t 10 others ORIGIN 1 gcggttcggg cacgagcggg ccgccggcga aggtcagggc cacgcgcatc gtgagggcga 61 gcgctccggc cttcgaagcg acggagagga acgcgcccat ttcggccgtc gccccttcga 121 agacgtcggg agcccagaaa tggaacggga atgtggccag cttgaacgcc agcccgaccg 181 cgaagaagag cccgccgagc accagcatcg tttgacgatc ggccagcgcc ggatcgccca 241 acgtcgcggc cagccgctcg gccatggtcg gcaggtggag cgagccgagg acgccgcaga 301 gcaggctcat gccgtagatc atgacgccgg aggccgcggc cccgtagacc gcgtatttga 361 gcgccgcttc gctcgcctgg cggcggcctt tgttgagccc cgcgaggatg tacgtgggga 421 cgctggccag ctccatcgcc aggaagatca tcagcacgtg attggccgag gccatgctca 481 tcaggccgat cgtcgagccg atcacgagcg aataaatgtc ggggccgtcc tcgtcgtcgg 541 ggacgccggt caggcgggtc agtcgcaaga ataggacgag gaagccggcc agcaggatcc 601 gccagaactg cgtgaagtga tcgtaaacga tcatccccgt gaacagctcc gcgccgaccc 661 gcgggctgtg cgggctcttt tccaccagcg gcaggaagag cgaaccctcg cgcgcgtcga 721 gcaccgcgac cagcagcgcc gccagcgcgc cgatcgtcga gatcagataa gcgtcgccgc 781 ggacgcgcca gccggcgacg cggacgacga gcagcagcac gatcgcgacg cagagcacca 841 gctcggggag gaaccccgcg atcgagctct tcgcgtcggc cgcgacgtat tgcacgaggg 901 cgtcgaaact caccgcgtcg ctcccttcgc cgcgtcgcga gcgacctcga cgacggccgc 961 gggcgggtcg gccggccggc tccgcgcggc ggcgatcggg gcgagcacgc cgggccgcgc 1021 gccggccgcg acgacttccc gcgcgatgtg cgtcgtggcg ggctcgacgt agcggaacac 1081 caactgaggc agcacgccga agaccacact gaacgcgacg atcggcgcga gcaccgcgat 1141 ctcgcgcggg gtcacgtcgg ccaccggctc gcgctcgatc ccttcgggtt tgacgccgta 1201 gaacacgcgc tgcagtgcgg tgaggatgta gcccgcggtg aagatcgtcg tgaccgcgcc 1261 gattgcggcc gcgacgggcg agcgactcca cgtcgagagg agcacgaaga cctcgccgat 1321 gaagccgcac agcgtgggca gcccgagcga ggcgaagaag acgaccgcgg ccatccccgc 1381 gccgacgggc atctcgcccg cgaggccgcc gaagcggttc aggtcgcgat ggtgcaggcg 1441 gtcgtagagc acgccgacga tgaagaacat cccggcgctg gtcaggccgt gcgcgatcat 1501 ctggtaggtc gcgcccgaca gccccatcga ccacgaatcg accagcgccg cgtccggtcc 1561 gctcgcccac agcgacaggc cgagaagcac gtagcccatg tggcttaccg agctgtacgc 1621 gaccaggcgt ttgaagtcgg tctgcgcgag cgcggcgagc gccccgtaga cgatgctcac 1681 cgcgcccagg cccgcgagca ccggccccat gacctgcgcc gcgatcgggc agatcggcaa 1741 cgcgatgcgg atcaggccgt agccgcccat cttcagcagc acgcccgcca gcagcatcga 1801 gatcgcggtg ggggcctcga cgtgggcgtc gggaagccag gtgtggaacg gaacgctcgg 1861 gaccttcacc aggaagccga tcgtgagcaa cacgaacgcc caccactcca gcgtctggcc 1921 ccagaacagc ggcgcggcgc ggaagacgtc ggtctcgcgg gcgagcgttt gcagggcgac 1981 gatgttgaac gtgcggaccg gcgggccgtc gatccgattc gcggcgatct gagccaccgc 2041 gtcggccggc aggtgcgccg cccggagctg gtcgtcggtg agcttggtga gatcgctggc 2101 gaagtagagc atcagcaccg cgatcaggat cagcgccgag ccgaacagcg tgtagatgaa 2161 gaacttgatc gccgcgtatt ccttccgcgg gccgccccag acgccgatga ggaagtacat 2221 cggcaagagg accagctcga agaagacgta gaacagcagc aggtcgagcg cgaggaacac 2281 cccgaccatg cccgtcagca ggatcaggaa cagggcgctg tagctctgca cgttcttgtc 2341 gatcgaccag ctcgccgcgg ccgccagcac gctgacgaag gtcgtcatca gcagcagcca 2401 gaagctgatc ccgtcgacgc cgaggaaata gtagacgccg aacgacggga tccactcgag 2461 cgtcacgacg tgctggatcg tgccgacccc gggtcgaaaa gcgtcttccg cgcccttcgg 2521 cccgacgacg agcagcgcgg cggcgagggc gaacacgacg agcgacgcgc ccagcgtgat 2581 ccagcgggcc gcgtcgtcgg agccgcgcgg caggccggcg cagagaatcg ccccgacgag 2641 cggaacgaag acgatgaggc tgaaccaaac ggcgtctgtc atgccggtgt cggttgcgcg 2701 ttagggacgc gcggccgggg gcgaaagcca ccagctcgcg accagaaaca ggccggccgc 2761 gccgaccgcg agaatcaaca cgtattgccg cagccgcccg gtctgcacga cgcggagcga 2821 gagggccgtc tcgtaaatca ctcgggagat cgcgttgacg aacccgtcca cgatcccgcg 2881 gtcgaagcgg tcgtcgacgc gcgccagcca cacggccagc cgcgccgagc cgtcgacgat 2941 cccgtcgaga accacgcggt cgaaccactt cgcggcgcgc gcgacggcca gcgtcggcgc 3001 gacgaacgcg gtgcggtaga gactcgtgaa ccgaccctcg ctccgcagcg cccgcgacaa 3061 cggcgcgaac gagcgggcaa gctcgtcggc cgagagccac cgccagccgt agaacgcggc 3121 cgcgatcgcg aagcccgcca gtcccgccgc gaatcccccc agcgacgcgg ccaggtgcaa 3181 cgccggcgtg tgcgaagcgg cttccgcggg aatcaccagc gacggcagcc agcgaccttg 3241 cgcgatgtca tccggctcgg ccaggggctc acgccccgtc cgagccgagg gaacatcggc 3301 gacgtgagcc gtcgaatcgg ccgactccgg ctcggatcgg gcgtgagccc ctggccgagc 3361 cgtgggaaga acggcttgac cgagaagcgg cgacaagccc gcgtcgaggc cgaacgtcca 3421 gcccgcggaa atcgcgagcg tcgccagcac gacgagcggc acggtcatca gtcgcgacgg 3481 ctcgtggacg tgctgacgcg cgacggggtc gcgcggcgga ccgacgaacg tcaggaacca 3541 cagccggaag atgtaagcgg ccgtcagcag cgccgccccg agcggcagat agaacaaaac 3601 ggcgtgctgc ggatttcgtt gactgaacgc gagggcctgg gccaggatcg cgtccttcga 3661 gtagtagccc gacgtgccga cgtcgatcag cggcagcccc gcgccgacca tcgcgacgca 3721 ggcgagcagc atcgtcaccg cgaccacctt cagcggcttc agcagcccgc ccaggcgtcg 3781 caggtcttgc acgtgcgtgg cgtggatgac ggcccccgcg ccgaggaaca acaggctctt 3841 gaagaacgcg tgcgtcgtca ggtggaacgt cgcggcgttc cagccgccga cgccgaccgc 3901 gagcgtcatg tagccgagct ggctgatcgt cgagaaggcc agcacgcgtt tgaggtccgt 3961 cgcggtcagc gcgaagagcg ccccgagcgt gagcgtgatc gccccgacgt acgccaggac 4021 gagcagcacc tccgcaggaa acacgggggc cagccgcgcc acgaggtaga cgccggccgc 4081 gaccatcgtc gccgagtgaa ccagcgccga gacgggcgtc ggaccggcca tcgcgtcggg 4141 aagccaggcg tgcaacgggg cctgcgcgct cttgccgacg cagccgcagc agacgcccag 4201 tccggccagc acgagcagtc cgtaattcgc gtcgccgacg cgcgccggaa ccgcccaggc 4261 gaaatggttc gcggccatgc ggaactggct cacgattccc aggcgaccgt cgaggtcgtg 4321 cagctcaagc gtcccgcaga ggttcgccag gatcagaatc ccgagcaaaa accccgcgtc 4381 gccgacgcgg ttgacgacga acgcctggcg ggcggccgac gcggccgagt cgcgctcgcg 4441 gtagaagccg atgaggctgt aggaggtcgc gccgacgagc tcccagaacg cgaacagcac 4501 cagcaagttg cccgtcaaca ccaggccgaa catgctgaac gtgaagagcg acaggtgctg 4561 aaagaacgcg gcgtatcgcc ccggccggtg caaatgatcg ccgagcggcg agacgatctc 4621 gtggtcctcg acgctcgcgt gcgtctcctc gtgcatgtac gcgatcgcgt agacgtgcac 4681 gcaggtcgcc acgatcccga tcacgcagaa gaagacgatc gtgagcgcgt cgatcgaata 4741 cgagacgtcg aaccgcagcc cgccgaactg ggcgaacgtc agccacgtcc cgcgcaacac 4801 gttcggcgcg tcgccggccc gctcggagcc gacgccgtga tgactcgccc aggccgccag 4861 ggcgacgagc gaagagagca gcgagaccgc gaccgcgatg gtcgagacga gggccggcgc 4921 gcggctgcga ccgcgtcccg ccagccgcac ggcgcaggcg accagggccg cgaagagcgg 4981 cgcggccgcg gccgtgagca gcaaggacgg gatggtctcg atcatcagcg acgcggggcg 5041 tcctcgaggg aaatcgacgg cgacgcggga cttcagccct tgagctggtc gccgcggtcc 5101 acgtcgatcg tggcgtgatt gttgtaaaag ttcaaagcga tcgccagggc cgcggcggcc 5161 tcgccggcgg ccaggatgat gacgaagagc gcggccaggt gaccgtcgag cccgagcgcg 5221 acgtcgacgt cggcccggaa cgacttgctc gaaaacgcca cccagttgac gagcgccgcg 5281 ttgagcacca gttcgacgcc catcaacagg ccgaccgcgt tgcgtttgac ggccatgcac 5341 aacaggccgc agacgaacag gatcgccccg acgatcaggt aatgcgtcgc gccgatcggt 5401 tgcgtgagca ggtccatctc aggcctcgtg cgttcgcgcg gagcgcttgg cccgggcgag 5461 gtaagccgcg ccgatcagga ccacgagcag gtacacgctc gacaactcga acggcagcag 5521 ccaccccgag cggcccggcg cgagcgggtc ctccgcgtcg cggacgcggt cggctcgcac 5581 gccgatcagg gccaggccga tctcggccga cttcacttcc agtcgcggag ccgccagctc 5641 gacgtcggcc cgcgggttcg gccgccagtc ggcgacggag aagatcgcgg ggatcagcac 5701 ggccgcgagc aacagcccgc cgacgcccag cccgagcacc cagtcggccg gccgcgtgtt 5761 cagcgacgcg aacggccccg gcgaggtgag catgacgccg aagatcagca gcaccagcgt 5821 gccgccgacg tagaccatga gctgcgccgc gccgacgaac tcggcccctg cgaggaagaa 5881 caggccgctc gtcccgcaca gggcgaagat caaatagacc gccatgcgga cgacgttgtt 5941 cgacagcacc acggccaggg ccgagccgca ggcgacgagc gcgaacagca ggaagaagaa 6001 cgcgtgcgga ttgaactcgt tcacgtccgc tccccttccc ggctctcgac cgagatcgcg 6061 ggggccgagc gacgggcggt cgccggagcg gccgcgttcg cggggccggg ctcgatcggc 6121 gcgacgaccg cggggaaccg atgctcggcc gcgggacgcg tcagcccggg ggcgaatccc 6181 aagagtcccg cggggagcag cagttgccag aacgacacgc cgaccagcgt gaagcaggag 6241 atcggcaggc agtacttcag gcacatgtgc atgacctggt cgatccgcag ccgcggcaac 6301 gaccagcgga cccacatcat caccaccatg ccgaacgagc acttgagcag gaagttgacc 6361 aggccgagca tccggcccca gaagccgggc cagccctcgc cggtcaggcc gagccactcg 6421 ccgatcggca gcggtccgct ccaaccgccg cagaagagca ccgcggccag cccggagacg 6481 aggaacatcg agccgtattc ggccatgaag aagtaactcc agcggatgcc ggagtattcg 6541 gtgtggaaac cggccaccag ttcgctctcg gcctccgcca ggtcgaacgg ggcgcggttg 6601 acgctcgcga tcgcgcaggt gaagaacacc cagaacgcga tgaaggtgaa cgggtcgtgg 6661 aacacgtacc agttccagaa gccgccggtc tgctgccggg cgatttcggt caggttcatg 6721 gtccccgcga gcaccaccgg cggcacgacg cacagcccca gcggcacctc gtagctgacg 6781 acctgggcgg cctcgcgcat cgcgccgaag agcgaccact tcgagccgct ggcgtaaccg 6841 gcgaggatca cgccgaacac ttccaatccg agcacggcca acaggaagaa cacggccgag 6901 tcgaggtcct gggcgaccca gccgtcgctg aacggcagcg ccatgtaggc cacgaagccg 6961 gcgcagaagc tcacgtacgg agccagccga aagaggagcg cgtcggcctc gtcgggaaga 7021 atgtcttcct tggtgatcag cttcacgccg tcggcgagcg actggagcca gccgaacatg 7081 ccgccgacgc gggtcgggnn nnnnnnnncc ccagatgaag acgagcgcgc cgaccgcgat 7141 caggttgagg atcagaaacg cctgcacgcc ggcgaccagc acgtcggccg tggccgcggg 7201 caggaacgaa ggcaggagcg agcgaaacgt ttcgagcacg gcttccggcg ggccttcgtc 7261 gcggtcggga ctttcgggag gcttcgcgcc cccggcgggc gtcgcgcgta taaatccgta 7321 tcgacggcgg cggtgcgcgt cgcgagcgcg gggattgtcg tgaaaccccg tttcgtccgc 7381 aagccccggc cgtcgctcgc gcggcgggcc gaatggtcgc agcccgagcg atcgcccgcg 7441 gcggcgtttg ttgccaagcg ccgcaacccg ggatagtctc tagagaaacg acggcgcatc 7501 gcgccccgcg cggccgtctt cgagaaaccg aggtcgagag ttgctcccgt acgatttcga 7561 cgccagcatc ggtttttgga tctgcatggc caaccaggcc gtccagcgcg agctggccga 7621 ccgtctggag ccccaaggaa tcacctaccg ccaggcccag gtgttgggaa acctgatcca 7681 ttcgggcgag atgacgcagg cgacgctggc ccgcaaaatg ttcatcgagc cgcccacgct 7741 cgtcggcatt ctcaatcgga tggagcgcga cggctggatc ggccggacga tctgcccgtc 7801 cgaccgccgc aagaaatacg tccgcatcaa tcccgaagcg gccgacgcct ggtcgaaggt 7861 cgtggagtgc gcgtacgcgg tccgggcgaa ggcctcggac ggcctcaccg acgaagagct 7921 cgccacgctc aagcggctcc tggaaaaggt ccgcgccaac ctcggcgcgg gcgtcgaaat 7981 cgccgatctg gccgcctgcg acgactgacc gaatcgcgcg accgcggccg gctcagttcg 8041 acgcgctggc ttctttgaag cggtagccga cgccgcggac cgtctcgatc agctcggcgc 8101 ggtcgcccag cttctgccgc agcgacttga cgtgcacgtc gatggtgcgg tcgagcgagg 8161 ccacgtcttc ccccatgctc gcgtccatca gctcgtgccg cgaaaacgcc cggccgggct 8221 gacgcgccag cgaccagagc aggcggaact cggtgggcgt caggggcagg acgcggccgt 8281 cgagcgtcgc gcggtgccgc cagcggtcga tctcgacccc ctgctccgcg atcgacgcgg 8341 tctgatcctc gcggacgttc gtccgtcgca ggagcgcttt cacccgctgc agcagcggtc 8401 gcaccttgaa cggcttggcg acgtagtcgt cggcccccat ctggaagccg accacctcgt 8461 cgatctcctc gccccgcgcc gtgaccatca cgatcgggat cgtccgcgtg ttcgcgtcgt 8521 cgcggagccg gcggcagacc tcgagccctt cgagcaccgg cagcatcagg tcgaggacga 8581 tcaggtcggg gagcaccgac tgggcccgct gcagcccgct cagaccgtcc gtcgcggtga 8641 ggacttcgaa gccctcgcgt tccaggttgt agcggagagc ttccagcaac gcggtttcgt 8701 cttcgatgac cagaatcttg tgacggctca tgcgatctcg gggcggaatg tacgccgacg 8761 ccacgcgggc cggatcggcc cacgaactca cgacgcgaat caactcgcgt aaatcttatc 8821 catccgttcg gcgatcgccc gcaggcgagt ttcgtcgccg cccggaattt cggccgtgcc 8881 ccagccggtg aagccgattt cctccagcgc ccgcatcacc gccggccagt cgcagtcccc 8941 ttccagcagc tcgacgccga aaccggcccc ggtccctttt tcgcgggcga ccttgcggct 9001 gtattccttg acgtcgagct tgccgatccg cttgcccagc gtgcgaatcc attgctcggg 9061 ccaaccgtag agcacgacgt tccccacgtc gaaataggcc ccgatccact cgctctcgaa 9121 ttcgtcgatg taacgggcgg tttccagcgg gctgagcagg aaattgttcc agacgttctc 9181 gaacaggatc ttgatcttca attcggcggc cagcgggagc gccttgcgga tttcggcctg 9241 cgaccgcgtg tacgcgtcgg cgtacgagac ttccttgttc accaccgccg gcacgagcag 9301 cacgctcgtc gcgccgtacg cgtgggcgtc gcggatcgcg gtttcgaggg ccgccagccc 9361 cttggcccgc acttcttcgc tcgggtgcga cagcgtctct ttccagtggg ccgaatcgac 9421 gaccccgtgg atcggcagcc ccgattcgtc gcgcgccgcg agcacctcgc tcttgtcgaa 9481 cccgttgggc gaatcgagct cgatcccgtc gaagccgatc tccttgagga tcttgaactt 9541 atcgacgaga gtcgccccgg cccgcaccat cccgatcttg acggcctttt tgtacttctt 9601 caccggcgcg tcctgagcgg aagcggacga gcccggcccg cccgacgcga gcgccgcggc 9661 cgcgccgacg ccggccgcga ggaaggttcg acgatcgagg ggagacgtca tgaagagtcc 9721 tcgcggctcg cgaacggcgg aaatggctcg cggaactcgt ccgcgggcga aagcatatca 9781 agccgcgcaa acggccggca agccgctcgc ccgcccgcgg ggtaaccgtc gccggccccc 9841 ggtcaacttc ccgtgcggac gcgccgcgcc ggttcgattg accgaccgcc gggcgacgat 9901 tacgctcgac gcatccacga actttgatgg gttagggagc cgatccatgc gacgacgggc 9961 ctcgaacggg cggaacgcga cgccggcggt cgtgctggcg gccctggcct ggagcgcgtc 10021 gctcgcggcc gcggccgaga cgaccgtctt ccccgagtcg atcacgctcg acgacgtctt 10081 cgcccgccgg caggtgctcg tcaccgcggc cgggccgcgc gggccggagg atcgcacctc 10141 ccaggcgaaa tacgtctcgc tcgacccgca cgtcgcgacc gtcgacgaac gcggctacgt 10201 cgcccccgtc gcggtcggca agaccgcgct cgagatcgcg atcgacgggc gaaagctgcg 10261 cgtcccggtc gaggtgaccg gcctcgacgg cggccgtccc gtcgatttcg cccgcgaggt 10321 cgtccccctg ctcacccgct tcggctgcaa cgcggggggc tgccacggca aggcgagcgg 10381 ccagaacggc ttcaagctga gcctgttcgg attcgacgcc gcgtacgatc acaaagccat 10441 cgcgacggaa gcccgcggcc gacgcgtgtt tcccgcggct cccgatcaga gcttgctgct 10501 gctcaaagcg atcggcgcgg tcccgcacgg cggcggccga cggctcgtca aggattccga 10561 tccgtacctc gtgatccgcc ggtggatcga gcagggtttg cccgcgaccg cgcccgacgc 10621 gccgacgatc aagagcgtcc gcatcgagcc gcacgagcgg acgatgcgga agcacgagaa 10681 acagcagctc gcggtgatcg cggaatacag cgacggctcg acccgcgacg tcacgcggca 10741 ggcgcagttc aacaccaacg aaggcgcggt cgcggtttct tccgacgacg gcctcgtgag 10801 cgccaacgac ctgccgggcg aagcggcgat catggccgcg tatctcggtc aggtggcggt 10861 cttcaaagtg ctggtcccga cgggcgagcc gctcgacgcg atccccgaat tcacgcccaa 10921 caacttcgtg gacgagctgg tgctcgcgaa gtggaagaag ctcggcctgc gcccgtcgcc 10981 gacgtgcacc gacgaagagt tcctccgccg cgtgacgatc gacctggccg gccggctccc 11041 gaccgtcgac gaggcgtccg ctttcgcggc cgattcctcc gcgaccaaac gcgaggcggc 11101 ggtcgaccgc ctgctggcgt cgcccgacta cgcggcctac ttcgcgatgc ggtgggggac 11161 gatcctccgc aactcgcgac tcgccggcgc cgacaaggcc gcgtacgcgt ttcacaactg 11221 gctcaaggac ctgatcgcgc agaatcgtcc ttacgacgag ctcgtgcgcg gcgtggtggc 11281 cgcgtcgggc gagtggcaag acgcgcccgc gatcaactgg tactggcaga tgcgcgacga 11341 tcagttgcat cagccggtcg gcgacacggc ccaggtcttc ctcggtctgc ggctgcagtg 11401 cgcccagtgc caccaccatc cgtacgaacg ctggagccag g // LOCUS NODE_2958_length_11435_cov_4.52495611435 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11435) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11435) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11435 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1212 /gene="pruA" /locus_tag="DP116_22880" CDS <1..1212 /gene="pruA" /locus_tag="DP116_22880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="L-glutamate gamma-semialdehyde dehydrogenase" /protein_id="PRJNA477356:DP116_22880" /translation="VGKPVREADAEVSEAIDFCRYYASEMEQLDPGYNYDVAGETNRY IYQPRGIAVVISPWNFPLAIATGMTVAALVTGNCTLLKPAETSSVIAAKLTEVLVDAG IPKGVFQYVPGKGSQVGAYLVNHADTHVIAFTGSQEVGCRIYADAAILKPGQKHMKRV IAEMGGKNAIIVDESADLDSAVVGVVQSAFGYSGQKCSACSRVVVLEPVYDTFVQRFV EATKSLNIGETELPSTQVGPVIDANARSRIREYIEKGKAEAKVALEMPAPDNGYFIGP VIFKDVPANGIIAQEEIFGPVVAVIKVKNFKEALEVANGTNYALTGGLYSRTPSHIEK AQDEFEVGNLYINRNITGAIVARQPFGGFKLSGVGSKAGGPDYLLQFTEPRTVTENIQ RQGFAPIEGAD" gene 1309..1749 /locus_tag="DP116_22885" CDS 1309..1749 /locus_tag="DP116_22885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874357.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_22885" /translation="MKDVIVRVADWYKEFSAIQAIRKSVFQEEQGVDPDLDFDGEDET SQQLIASLNGEYVGTARVRYLDDKTAKIERLAVLPVGRGYGIGKKIMEKAMDVIASKN IPEVVVHAQEYIKGLHQQLGFQEEGEVFEEAGIRHVTMRKKLNQ" gene 1833..2495 /locus_tag="DP116_22890" CDS 1833..2495 /locus_tag="DP116_22890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015156386.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase" /protein_id="PRJNA477356:DP116_22890" /translation="MTSLTLVIGNKNYSSWSLRPWLAMKQMGLEFTEIRIPLYQVGSS AQVRRYSPSGKVPVLLHDEITVWDSLAILEYLIEQFPALPWLPLAAKPRAIARSICAE MHSSFANLRQHMPMNIRAYVPGGEVPASVQADITRITTIWHECREAFGMGGNFLFGAF TIVDAMYAPVVTRFITYGVQLDSVCSTYAEAILALPAMQNWIAAAKHETETIESYTNP PN" gene 2503..5064 /locus_tag="DP116_22895" CDS 2503..5064 /locus_tag="DP116_22895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459506.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phospholipid carrier-dependent glycosyltransferase" /protein_id="PRJNA477356:DP116_22895" /translation="MTAKDKQNRFNSLLILGIIWLLGALCDRIWFALDRSVPAWDQAD YLTGSLNYWHALQNPQWFNQEWWQSFWLLSSKIPPLTYIVTAIVQNIFGIGPDQATLM MLLFSAILLTSVYGLGTVLFSETVGLWAAALCQILPALYDFRLEFLLDYPLTAVVTLS FFCLTVWRITLNKEKEERGRILPHSPPLSLPHSLPPLLWAVAFGLTFGLALMVKQTAL FFLLTPIVWVGVGALRHRRWGRLLQLLGGLCFSVLVFGPWYRTNWLIILTSGKRATID SAIAEHQPGLDKLESWIYYWNQLPYQVSLPLLFVPLVSLLIWWGRSKFASENPKLETI TSSKANQQNSSLTWLLVFWVGGYFLSCLNINKHERYVLPYLPVLTVLLAYGLTRWRSL FGSRVRWGTFGIAVILMLLNLFPVGGVVGGWVTQVLSPNAQRYPYMKEELPHRQVIAQ IIQTEPHLRSTLGVLPSTPEINQHNFNYYGALQNFQVYGRQVGTRKKYVDQDGRSLEW FLTKTGEQGPVKEAQAAMVKTVEQGGNFQLNKSWNLPDNSQLMLYHQQTPTLEVRQTS QQNSQAKIALSQVIVPEKAPPGVPVPVSYEWSGSWDELQHGLVLLTWKNKNSKWIHDH GIAMGTLHPGAKKPEGTFQVIEKMAMLPPSTVAPGTYTPEAIYLNRLSGESYPVSVPK VTLQIDAQATATQAPELDLVSQLETLGAQLPKGTEAVSQVFEEVGRINQYDAIQDYLV QARLTLEYRLQHFTPNRDWAYALALANVLQRRVEGAIASLKQVTHLDPENPYAHAYLA FVQLYNWQPAAAQKSLEPSLAKNPNLPEFRVLSGVAGLMQGDFVKAWKDLSTLRK" gene complement(5183..6403) /locus_tag="DP116_22900" CDS complement(5183..6403) /locus_tag="DP116_22900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877706.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_22900" /translation="MEHLNSSQQTQRLAGLEGQKPGLDQKVTAITTTDSNSDIQKNSD IQKPVPPSKRNVVPKVLLAVLLGGGAIASATYAYRSWQYNQQYAAKFQETDNAYVTAN VSPITSRVSGIVTEVTVNDNQMVSPRDVLVKIDKGDYQASLTQAKASLELAKQQAELA RENIKKDLLILSAFESNSAPNQPVNREKALQAQTINQQKQVHQQQYKTALAAVAQKQA EVKKAELQLSYTNITPLVVGKVANKNVSVGQQVQAGQSLITIVQPNPWIIANFKETQL EKIQPGQKVTIKIPAFQNREFRGRVDSMSPTSFGKVALPPQENGTAHSSLAQDVQRVP VKIVFEPESIQGYESRMTPGMSAVAVVETNNSLPVQKTPNLTPAQKAPNVTNPKDPKD TKKEEKKDNSTERL" gene 6804..7733 /locus_tag="DP116_22905" CDS 6804..7733 /locus_tag="DP116_22905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874355.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalamin-binding protein" /protein_id="PRJNA477356:DP116_22905" /translation="MTNDSVRIVSLIPGGTEILAGLGLTDAIVGRSHECDYPPEIQDR PVCTQARINSSAPSGEINDKVNYLLQSALSIYQINTDVLEKLQPTHIVTQDQCDVCAV SLKDVEEAVATIANTSPQIISLQPNVLKDIWEQIQQLGNVFGVDSLQLIENLEARVKI VDQKTQGLSQTEHLPTVVCIEWTDPLMVAANWIPELVTEAGGQPLFSITGSPSTTFKW ETLISSNPDIIIFMPCGFDLNRTRQEAQLLTQRPEWQKLHAVQSGRVYITDGNSYFNR PGPRLVDSLEILAEILHPEIFQYGYKQKGWELL" gene complement(7745..9529) /locus_tag="DP116_22910" CDS complement(7745..9529) /locus_tag="DP116_22910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316586.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RluA family pseudouridine synthase" /protein_id="PRJNA477356:DP116_22910" /translation="MMVLHPLSDFIDSNFPVSDSSANYYYQGRCPQSGELLRLPRTPL VEAIAYSLMQHLAIDNRYSSEGKMCGVLLVSLPTGEQRILKAFSGLLNGDSVVEGWVP PIPGRDQVALQEASTLAQLDAIKQELITLKQLPERLQYETLSREFEVRLQQMSDRHRD RKNQRHEKRQILCKTLAGETLHFALEQLNEESRREGIERKQLKRQRNEELKPLQQLIS AADTRIRELKQQRQELSRQLQAQMHAAYTLINFLGQSLSLQQLIPGGMPTGTGDCCAP KLLHYAAKHNFKPLAMAEFWWGSSSTHQDKVQGEFYGACTERCQPLMGFLLSGLTQSK PTTDTSITERTSPSLLATLKSFPLLNKERDARQGRVRFRGSVECLYTGQLLGFDTVYE DEWLIVVNKPPGLLSVPGRYFDTQDSVITHLRHLLPDGTGLVAVHRLDQETSGILMLA RDLQTYRQLSQQFEQRLVRKIYEAVLSDSITTERGVISLPLWGNPQNRPYQQVDWQYG KPSVTHFQMIAVEGNYTRVEFTPLTGRTHQLRVHAADVRGLGVPILGDRLYGCRTAAS RLHLHARNLYFEHPQSGETIHLQAQTPF" gene 9877..10602 /locus_tag="DP116_22915" CDS 9877..10602 /locus_tag="DP116_22915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007630334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22915" /translation="MSNPVKLFFSYSHKDEVLRDELATHLSMMKRQGVIEAWHDREIS AGGEWANAIDDNLNAADIILLLVSANFVDSDYCYDIEMQRAMERHEAREARVIPIILK PVDWSDAPFAKLQGLPKNVKAVTTWQDRDEAFLNIAQGIRRVVEEMAKSKTSSTASET TTPAMTSGELTDRQRRRLKQEEDSLQQQYDRESEKLSRLRQAYSIETDTATKFKLEKQ IQESETELHRLDRQLEEIEQKLV" gene 10624..>11435 /locus_tag="DP116_22920" CDS 10624..>11435 /locus_tag="DP116_22920" /inference="COORDINATES: protein motif:HMM:PF13676.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22920" /translation="MPPMESIEVFISYHQKDEDLREELEKHLASLQREKVITSWSDRK IVAGQEFKGEIDKSLNQAGLILLLVSPDFIASDYHWTVEVTRALEQNAAGKASVIPVL LRYADWETPPIDELSPLPKNRKPIKSWNDRDEAFLEVVKGIREEVKRLVASSNYSPPK HATQELDKRQYQVTSLINEAHRLYEDKNFEEAVLKYKAALRLDPNSVLAHYNLGIALK NQGKLDEAIASYRKSLQIDQNYPSVHYNLGIALYNQGKFDEAIASYRKALQI" BASE COUNT 3110 a 2354 c 2697 g 3274 t ORIGIN 1 gttgggaagc cagtgcggga ggcggatgct gaggtttccg aagcgataga cttctgtcgc 61 tactacgcct cggagatgga acagttagac ccgggttata attacgacgt tgcaggggaa 121 acaaaccgtt atatctacca gcctcgggga attgctgttg tgatttctcc ttggaatttc 181 cctcttgcta ttgcaacggg aatgactgtt gcagccttgg ttacaggaaa ttgtactttg 241 ctcaaacctg cggaaacatc ttctgtgatt gcggcgaagt taacggaagt tttggtcgat 301 gctggaattc ctaaaggtgt ttttcaatac gtgcctggta agggttcgca ggttggtgct 361 tatctagtga atcacgctga tactcatgtg attgctttta caggctcaca agaagtaggg 421 tgtcggattt acgcagatgc ggcaattctt aagcctggtc aaaagcatat gaaacgagta 481 attgcggaga tgggcgggaa gaatgcgatc attgtggatg aaagtgctga tttagactct 541 gctgttgttg gggtggtgca gtcggcgttt ggttacagcg gacaaaaatg ttctgcttgt 601 tcacgggtgg tggtgctgga accggtatat gatacttttg tgcagcggtt tgtggaagcg 661 acgaaatcgt tgaacattgg ggagacggag ttaccgagta cgcaagttgg accagtgatt 721 gatgcgaatg cacgctcgcg cattcgtgag tatatcgaaa aaggtaaggc agaagcaaaa 781 gtggctttgg aaatgccagc accggataac gggtatttta tcggtcctgt tatctttaaa 841 gatgtaccag caaatggaat catagcccaa gaagaaattt ttggtccagt tgtcgcggtg 901 attaaagtga agaacttcaa agaggcgctg gaagttgcta acggtactaa ctacgccttg 961 actggaggac tttactctag aactccttcg cacattgaga aggcgcagga tgaatttgaa 1021 gtcgggaatt tatacattaa ccgcaatatt acgggtgcta ttgttgcacg acagccgttt 1081 ggtggtttta agctttctgg tgttggttct aaggctgggg gaccagatta tctgctgcaa 1141 ttcacagaac cgcgtacggt gacggagaat attcaacgtc aaggttttgc gcctattgag 1201 ggagcagatt aacaattcaa aattaaagta agttagtagg gtgggcagcg ttgcccacct 1261 tattttatta tcgaaccacg aagacgcgaa gggcgcgaag agatttgaat gaaggatgtc 1321 atcgttagag ttgctgattg gtataaggag ttttcagcaa ttcaagcaat caggaagtct 1381 gtttttcaag aagaacaagg tgtagatcct gatttagatt ttgatggaga agatgaaact 1441 tctcagcagt tgattgcttc tttaaatgga gagtatgtgg gaactgcgag ggtgagatat 1501 ttggatgaca agactgcgaa gattgaaagg cttgctgttt tacccgtagg tagagggtat 1561 ggtattggca agaaaattat ggaaaaggca atggatgtta tagcgagtaa gaatattcca 1621 gaagttgtgg ttcatgcgca agagtatatc aaaggtttgc atcaacagtt gggttttcag 1681 gaagagggag aggtttttga agaggctggc attcgtcatg tgacaatgag gaagaaatta 1741 aatcagtgaa caactgataa ctgataactg atcactgagt gccaatagaa aactgctgta 1801 aatgtttgca aagtacctca atacacaatc gtatgacttc actgactttg gtcattggca 1861 acaaaaacta ttcctcttgg tcgctgcgcc cgtggctagc gatgaagcag atggggctgg 1921 agtttacaga aatccgcatt ccgctttatc aagttggttc ctcggcgcaa gtgcggcgtt 1981 attcaccatc cggtaaggta cccgtgttac tccacgacga gatcacggtt tgggattcgc 2041 tagcgatttt agagtactta attgagcagt ttccagcttt gccttggtta ccccttgcag 2101 caaaaccgag ggcgatcgct cgttccatct gtgcggaaat gcactccagc tttgccaatt 2161 tgcgccagca tatgccaatg aacattcgcg cttacgttcc aggtggtgaa gtacctgcct 2221 ccgtgcaagc cgatattacc cgtattacaa ccatttggca cgaatgtcga gaagcatttg 2281 gcatgggcgg aaattttttg tttggcgcat tcacgattgt agatgccatg tatgctcccg 2341 tggtgactcg gtttataacc tacggcgttc agcttgactc ggtttgcagc acttatgcag 2401 aagcaattct ggcgcttcct gccatgcaaa actggattgc tgctgctaaa cacgaaacgg 2461 aaacgattga aagctacaca aacccaccga attgaactta agatgactgc taaagataag 2521 cagaataggt ttaatagcct gctgatactt gggataattt ggttattggg ggcgttgtgc 2581 gatcgcattt ggtttgcttt ggatcgttct gtccctgctt gggatcaagc agactattta 2641 actggtagtt taaattattg gcacgcgttg caaaatcccc agtggtttaa ccaggagtgg 2701 tggcaaagtt tttggctgct gtcttccaaa ataccgcctt taacttacat cgttacagct 2761 attgttcaga atatctttgg catcggacca gatcaagcaa ctctgatgat gctgttgttt 2821 agtgcaattt tactaacttc agtctatggt ttgggcactg tgctatttag cgaaactgta 2881 ggtttgtggg ctgctgcact atgccaaatt ttacctgctc tgtatgactt tcgtttagaa 2941 tttctgctgg attatccctt gacagcagtt gtaactttga gtttcttttg cctcaccgtt 3001 tggcgaatca cactgaacaa ggagaaagaa gaacgaggga gaattttgcc tcactccccc 3061 cctctctcac tccctcactc tctccctcct ctcctctggg ctgtagcctt cggcttaact 3121 ttcgggttgg cgttgatggt gaagcagacg gcgttgtttt ttttgttgac gccgatagtc 3181 tgggttggag tgggtgcgct acgtcatcgg cgttggggac gcttattaca actactgggt 3241 ggtttgtgtt tttcggtgtt ggtttttggt ccttggtatc gtacgaattg gcttatcatc 3301 ctcacttctg gtaaacgggc gacaattgat tctgctattg ctgaacatca gcctggtctt 3361 gataagctgg aatcttggat ttactattgg aatcagttac cttaccaggt ttcgttacct 3421 ttgttatttg tacctctggt aagtttactt atatggtggg ggcgttcaaa atttgcgtct 3481 gaaaatccaa aactcgaaac tataacctca agcaaagcga atcagcaaaa ctcatcactc 3541 acatggttac ttgtgttttg ggttggggga tattttctct catgtttgaa cataaacaaa 3601 catgaacgtt atgtcttgcc ttatttacca gtactgacgg tgcttttagc gtatgggttg 3661 acgcgttggc gcagtctgtt tggaagccgt gtccgttggg gtacttttgg tatagcagtg 3721 atattaatgt tgcttaacct gtttccggtg ggaggtgttg ttggtggttg ggtgacacaa 3781 gttttgagtc ccaatgctca acgttatcct tatatgaaag aggagttacc ccatcgacag 3841 gtgattgccc aaattatcca aacagaaccg catttgcgtt ctactctggg agtgttacca 3901 tcaacaccag aaattaacca acacaatttt aattattacg gggcgctaca aaactttcaa 3961 gtttatggac gtcaggtggg aaccaggaaa aagtatgtcg atcaggatgg gcgatcgctt 4021 gagtggttcc tcacaaaaac aggtgaacaa ggaccagtca aagaagccca agctgctatg 4081 gtaaagactg ttgagcaagg tgggaatttt caactcaata agtcttggaa tttgcctgat 4141 aacagtcaac tcatgcttta tcatcaacaa acgccgacac tagaagtcag acagacttct 4201 caacaaaact cacaagccaa aattgcccta tcacaagtaa ttgtaccaga aaaagctccg 4261 ccaggagtac cagtacctgt aagctacgag tggtctggtt cttgggatga attacaacat 4321 gggttagtgc tgctaacttg gaaaaataaa aattcaaaat ggatacacga ccacggtatc 4381 gccatgggaa ctttgcatcc tggtgcaaaa aaacctgaag ggacatttca agtgattgaa 4441 aagatggcga tgttgcctcc ttctactgta gcaccaggaa cttatacccc agaggcaatt 4501 tacctcaaca gactatcagg agaaagttat cccgtctcag tgccaaaagt aacactacaa 4561 attgacgctc aagccactgc tacacaagca ccagaattgg atttagtgag tcaattagag 4621 actttaggcg ctcaattacc taaaggcact gaagcggtga gtcaagtttt tgaggaagtt 4681 ggacgtatta atcaatacga tgcaattcag gattatttgg tacaagcaag gctgacttta 4741 gaatatcgat tgcaacattt taccccaaac cgagattggg catacgcttt ggcgttagca 4801 aacgtgttgc agcggcgagt ggagggtgcg atcgcatctc tgaaacaagt cactcactta 4861 gatccagaaa atccttatgc tcacgcttat ctagcatttg ttcaactata caactggcaa 4921 ccagcagcag cccaaaaatc tttagaaccg agtttagcca aaaatcccaa tttaccagaa 4981 ttcagagttc tttcaggagt tgctggtctt atgcaaggcg attttgtcaa agcatggaaa 5041 gatttgtcaa cattgaggaa gtaaagaaag caggattgtg gaaacttatt ggcacagatg 5101 tcaattgatt tcacgttcac tccctcactc cctcactccc tcactccctg tctcaaacgt 5161 agtctgttga caccggagta tgttaaagtc tctcggtgct attatctttt ttctcttctt 5221 tcttcgtgtc cttagggtct ttggggttcg ttacattagg tgctttttgg gcgggagtta 5281 aattaggcgt tttttggacg ggaagagagt tattcgtttc gacaaccgca accgctgaca 5341 ttccaggagt catgcgagat tcataacctt gaatactctc aggctcaaat actattttga 5401 ctggaactcg ttgtacgtct tgagcaaggc tggaatgagc tgtgccattt tcttgtgggg 5461 gaagagcaac cttgccaaag gaagtggggg acatactgtc taccctaccg cgaaactcgc 5521 gattttggaa agcagggatt ttaattgtca ctttttgccc aggctggatt ttttctaatt 5581 gagtttcttt aaaattggca ataatccaag gatttggctg cactattgta ataagacttt 5641 gtcctgcttg tacctgctgc cctacagaaa cgtttttgtt agcaactttt ccaacaacta 5701 gcggtgtaat attggtgtag gatagctgaa gttcagcttt tttaacctct gcctgctttt 5761 gggcgacagc agctagcgct gttttgtact gttgctgatg tacttgtttt tgctgattta 5821 tggtttgtgc ttgtaatgct ttctctctgt taactggctg gtttggtgct gagttggatt 5881 caaaagcact aagaatgagg agatcttttt tgatattttc ccgtgctaat tctgcttgct 5941 gcttagctaa ttctagagat gcttttgcct gtgtcagaga tgcttggtag tcgcctttat 6001 caatcttcac caatacatcc cttggagaca ccatttggtt atcattgact gtcacttcgg 6061 tgactattcc cgatacgcgg gaagtgatcg gggaaacatt tgcggtgaca taggcgttat 6121 cggtttcctg aaacttcgcc gcatactgct ggttatactg ccatgaacgg tatgcgtaag 6181 ttgcggaggc gatcgctcct ccccctaaca acactgctaa cagcactttc ggtaccacat 6241 tgcgcttaga tggcggtaca ggtttttgga tatcgctgtt tttttggata tcgctgttag 6301 agtctgtggt agtgatagct gtcacctttt ggtctaaacc tggtttttgt ccctctagtc 6361 ctgcaaggcg ttgggtttgt tgtgaagaat ttaggtgttc catcagcgac cttcattatt 6421 acaaattatc aatagtttta cttgaggcac ggtagagata catttatttg ttggtcgttt 6481 tgattgatgc tcttattgta taaactgttc cccacagcat tgcacgtgga gattccataa 6541 ggaattcaag aattcccata caccccgaat aacttatcaa ctcaaatatt gcatcagtat 6601 atattttatc taaataatgc cagttggctt gagtttacag tcgttagcac ccgatggctt 6661 gaaaggttga ttcagaagac tcaatggtta tccaatttat aacctatttc cccagtaccc 6721 cttaggcatt gacctagaga aagattcctc agttgtagaa ttttgccaaa attgaggtat 6781 gacagctgtc gtggaggtag aaaatgacaa atgacagtgt aagaatagtc tccctcattc 6841 ctggtggaac agaaatttta gcaggtttgg gtttgacgga tgcaattgtt gggcgatcgc 6901 acgaatgtga ttatccccca gaaattcaag accgccctgt ttgtactcaa gcacgtatca 6961 atagtagcgc cccaagtggt gagattaatg ataaggttaa ttatctgttg caatctgccc 7021 taagtattta tcaaattaac acagatgttt tagagaaatt acaaccaact cacattgtta 7081 cccaagacca atgtgatgtc tgtgctgtta gcctaaaaga tgtggaagaa gcagttgcaa 7141 ctattgcaaa tacctctcct cagattattt ccttacagcc aaacgttctt aaagatattt 7201 gggagcagat tcagcagctg ggtaacgttt ttggggtaga ctcgctgcaa ttgatagaaa 7261 atctggaagc acgggtgaaa atagtagacc aaaaaacaca aggtctttcc caaacagaac 7321 acctcccaac tgttgtttgc attgagtgga ctgatcctct gatggttgct gctaattgga 7381 ttcctgaatt agtaaccgaa gccggaggac aaccattatt tagcatcaca ggttcgcctt 7441 ccactacttt caaatgggaa acactcattt ctagcaaccc ggatatcatc atctttatgc 7501 cctgtggctt tgatttaaat cgtacccgtc aagaagccca actactaact caacgtccag 7561 aatggcaaaa gttacatgct gtccaaagtg gcagagttta catcactgat ggtaattcct 7621 acttcaatcg tccaggacca cggcttgtag attctctaga gattttggca gaaatcctgc 7681 atccagaaat ctttcaatat ggatataaac aaaagggttg ggaacttttg taggtttgtg 7741 tgagctaaaa tggcgtttgt gcttgtagat gaatggtttc tcccgactgc ggatgctcaa 7801 aataaaggtt cctcgcatgc aaatgtaacc gacttgctgc tgtacgacat ccataaaggc 7861 gatcgcccaa aatcggtacc ccaagtcctc tgacatcagc agcgtgaact ctcaactgat 7921 gggtacgtcc tgttagtggc gtaaactcga cgcgagtgta gtttccctca actgctatca 7981 tctgaaagtg agtcacgctg ggttttccat actgccaatc aacttgctga taaggacgat 8041 tttgaggatt tccccatagt ggcagtgaaa tcacaccccg ctcagtagtg atagaatcgg 8101 aaagtacggc ttcgtaaatt ttgcgaacca accgctgctc aaactgctga ctcagttgac 8161 gataagtttg caaatcgcga gctaacataa gtatgccaga tgtttcctga tccaggcgat 8221 gcacagccac aagccctgtc ccatcaggta acagatgacg caaatgagtg ataacactgt 8281 cttgagtgtc aaaataacga cctggaactg acaggagtcc tgggggtttg tttacaacaa 8341 ttaaccattc gtcttcataa acagtgtcaa atccgagtaa ttgcccagta taaagacact 8401 ccacactccc tcggaacctc accctgccct gtcgggcatc cctctcctta ttaaggagag 8461 ggaaagattt tagcgtagct aaaagcgagg gtgaggttcg ctcagttatg ctagtatctg 8521 tagttggttt gctttgagtc aatcctgaca gcagaaaccc catcaatggc tgacagcgct 8581 ctgtacaagc gccgtagaat tctccttgca ctttatcttg atgagtagaa gaggaacccc 8641 accaaaattc tgccattgcc aatggtttaa aattatgctt tgctgcatag tgaagtagct 8701 ttggtgcaca acaatctcct gtgccagtgg gcatacctcc tggtattaat tgctgcagtg 8761 atagagattg tccgagaaaa tttatcaggg tgtaagcagc gtgcatctgc gcctggagtt 8821 ggcgggatag ttcttgacgc tgttgtttca attcccgaat tcgtgtatct gctgctgaaa 8881 tcaactgctg aagcggcttt aactcttcgt ttcgttggcg ttttagttgc tttcgttcaa 8941 ttccttctcg acgactttct tcattgagtt gttcaagggc aaagtgaagt gtttctcccg 9001 ccaaggtttt gcagagtatc tggcgttttt cgtgtcgttg gtttttgcga tcgcgatggc 9061 gatcgctcat ttgttgcagg cgcacctcaa actcacgaga tagcgtttca tactgcagcc 9121 gttctggcag ttgcttgagg gttatcagtt cctgcttgat agcatctaat tgagctaatg 9181 tgctcgcttc ctgtaaagca acttgatctc gtcctggaat gggcggtacc cagccctcaa 9241 ctacgctgtc gccattcaga agaccagaga aagctttgag tatcctttgt tccccagtag 9301 gcagtgaaac cagtaatacc ccgcacattt tcccttcaga ggaataacgg ttatcaatgg 9361 caaggtgttg catcaaactg taggcgatcg cctctaccaa aggagtgcgc ggtagtcgca 9421 gtagttcccc actttgagga caacgccctt gatagtaata gttggcagag gagtcgctga 9481 ctgggaaatt agagtcaatg aaatctgaaa gcggatgcag aaccatcata gaaggtattg 9541 ttggaaatct gtttgttctc catatttgaa attattacgc tctactcaac tgaaaactgc 9601 tgtatgcgtt ccggtgcatt gcttgtgtat ttaataaaaa cgctacaatt ttacacaaca 9661 ctgaacccat aagaattttc tgtcaagctt aagacttacg caaaagagat ccctcaacag 9721 agttaaagag ggggctttta agattccctc cttttgaagg gaagctaggg ggtatcgagg 9781 tttcgagttt tcagtgccta agtcctgaag ctttcatctc gggaactcag cctatatatt 9841 catatacatg aactcaataa agtagatttc aaaaatatgt ccaatccagt taaactattt 9901 ttttcctact cccataaaga tgaagtattg cgagacgaat tggcaactca tctgagcatg 9961 atgaaacgtc agggagtgat tgaagcttgg catgatcggg aaatcagtgc tggtggggaa 10021 tgggcaaatg cgatcgatga caatcttaac gctgcagata taattttatt gctggtgagc 10081 gctaactttg tggattcaga ttactgctac gatattgaga tgcagcgagc aatggaacga 10141 catgaggcaa gagaagctcg cgtgattccg attattctca agccagtaga ctggagtgat 10201 gcaccttttg ccaaactgca aggacttcct aaaaatgtca aagctgtgac aacttggcaa 10261 gacagagatg aagctttttt gaatattgct caaggaattc gcagagtcgt tgaagaaatg 10321 gcgaaatcaa aaacctcttc aaccgcttct gaaactacta caccagcgat gacgagcggt 10381 gaattaacag ataggcaacg tcgcagattg aagcaagaag aagattcact ccaacaacag 10441 tatgaccgag agagcgaaaa attaagtcgg ttgcgccaag catactcgat tgaaacagat 10501 acagcgacaa agtttaagct agagaaacaa attcaggaaa gcgaaacaga actacatcgg 10561 ttagatcggc agcttgagga aatcgagcaa aagcttgtgt agttatctta atatatataa 10621 ataatgcccc ctatggaatc tattgaagtt tttatttcct accaccaaaa ggatgaggac 10681 ttacgcgaag agttagaaaa gcatctagcg tctttgcagc gggaaaaagt tattaccagt 10741 tggagcgatc gcaaaatcgt cgcggggcaa gaatttaaag gtgaaattga taagtctctc 10801 aaccaagctg ggcttatcct gctgttggta agtccagatt tcattgcttc agattatcac 10861 tggacagttg aagttacacg agcattagag caaaatgcag caggaaaggc tagtgttatc 10921 ccggtgttac tgcgttatgc agattgggaa actcctccca ttgatgaact gtccccactg 10981 cctaaaaatc gcaaaccgat aaaaagctgg aatgaccgag atgaagcatt tttggaggtt 11041 gtcaaaggaa ttcgcgaaga ggttaagcga ttagttgcta gctcgaatta ctcaccaccc 11101 aaacacgcta cacaagaact ggacaagcgc caatatcaag ttacaagtct gataaacgaa 11161 gcacatcgtt tgtacgaaga taaaaacttc gaggaagcag tcctcaaata caaagcagcc 11221 ctccgtcttg atccaaactc tgtactcgct cactataacc tggggattgc gctgaaaaat 11281 caaggcaaac tcgatgaagc gatcgcctct taccgtaaaa gcttgcaaat cgaccaaaat 11341 tatccaagtg ttcactataa cctggggatt gcgctgtaca atcaaggcaa attcgatgaa 11401 gcgatcgcct cttaccgtaa agccttgcaa atcga // LOCUS NODE_2970_length_11365_cov_5.25092811365 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11365) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11365) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11365 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 143..841 /locus_tag="DP116_22925" CDS 143..841 /locus_tag="DP116_22925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867607.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tungstate transporter permease" /protein_id="PRJNA477356:DP116_22925" /translation="MNTIIEGAIKAFELLTTGNSDVFQVMTMTLLVSGTATAISVCLG LPLGLWLALVDFVGKQVLTSLVNFGMGLPPVVVGLVVSLFLWRSGPLGNFDLMYTPTA MILAQAIIAFPIVAGFSFAAILTINPKLRLRLLSMGATEWQANWLLIKEARLGLMAAI IAGFGRVISEVGASMMVGGNIKGQTRVLTTAIVTEVEKGNYDVALAIAFILLIIAYSI IVSLTLLQYDKKIL" gene 838..1992 /locus_tag="DP116_22930" CDS 838..1992 /locus_tag="DP116_22930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017322645.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_22930" /translation="MNMMTEFKSFVNLQSYPVARQENLTRMNEYVLNMQQVNVDSKKR KNILSIENFAVRPGELVAVLGPNGAGKSTLLRTINLLQPYRGQMQLFGQDVCHTNKTL LRRRSALVFQETLLLNDTVFNNVAKVLEFRGIPANKIKQKVHTALATFGCEHLANRSA RSLSGGEAKRVCIACGLVADSELLLLDEPSASLDVGIRAEIIEKIRQSAQARGSAVIL VSHNFTDILHFAERAIALFDGCIVQDDKLEMLIRRPANEQLARLVGMDNIIPCRVERG SRGYFIKLANSIEFLYPGEVRTPITACCLSGDAFYIDDTTSSVPYQPGMVTIKGRVER IVRGIGIYTIWVKVGEQTLIARVPQSHISGNVYQHEIIKLAFHPRDAHFI" gene complement(2036..2734) /locus_tag="DP116_22935" /pseudo CDS complement(2036..2734) /locus_tag="DP116_22935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308086.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="amidotransferase" gene complement(2792..3376) /locus_tag="DP116_22940" CDS complement(2792..3376) /locus_tag="DP116_22940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_22940" /translation="MTKPRAIRKSVHDRILNTASDLFYREGIRNVGIDRIIAESGVAK MSLYNHFKSKDALIEAWLRQQDEQWCQWLKTTIEQRTSDPAKQLLAIFDALREWFEGP DFRGCAFINASVELANPDHPGHRVALEHQQSIYHYIKSLAQSAEVSSPEQLARQLLLL VQGAIVVAMMEGSCSTASQAKKAATMLIQTASKL" gene 3438..4286 /locus_tag="DP116_22945" CDS 3438..4286 /locus_tag="DP116_22945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308088.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADPH-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_22945" /translation="MTNPIELMRSAPMRSALLSRYGEIPFNGEIAWNDFLSTIISHRS VRSYLSDPLPPLTLELLVAAAQSAATSANLQSWSVVAIEDQERKEELSRLAGGQAHIK EAPLFLVWLADLARVARVAESRGLSHDALEYLELLIKAIVDASIAAQNATLAAESLGL GTGYIGAIRKSTQEVATLLNLPPLVFPVFGLCVGYPNPEVQTAIKPRLPQSAVLHRET YKLGEQDEAIAHYNDIIKNFYTEQKMNVAGDWSQHSAERIATLDYLKGCKYLREALNN LGFKLL" gene 4406..4774 /locus_tag="DP116_22950" CDS 4406..4774 /locus_tag="DP116_22950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488817.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="addiction module toxin RelE" /protein_id="PRJNA477356:DP116_22950" /translation="MSWVVEFHQDFEPEFDALPEEVQNKLLARANLLEAFGSELGRPN VDTLNGSRHSNMKELRFKAADGVWRVAFAFDPRRHAILLVAGDKSGSSESRFYKQLIK TADARFDAHLAQLKTGQEEK" gene 4784..5131 /locus_tag="DP116_22955" CDS 4784..5131 /locus_tag="DP116_22955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015179995.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_22955" /translation="MGRILKEKLEQLSPERRKKIEEESSLLIAEEMTRQQLRLALKLT QEQMAELLQIDQGNVSRLEQRTDLMLSTLRKYIVAMGGELRLVVEFPNRPPVTLVGFS EIEESEEGAIASE" gene 5626..5907 /locus_tag="DP116_22960" CDS 5626..5907 /locus_tag="DP116_22960" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22960" /translation="MSDDFEGGVGGEGGDLNSLLLKLVVKNAVKAGKKALDSKCSQCW EMFNVGDTRCCSSRMCLNCIQKHITQERFLFLRSTYFSCPNCSEKLKLS" gene complement(6546..6755) /locus_tag="DP116_22965" CDS complement(6546..6755) /locus_tag="DP116_22965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016516502.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_22965" /translation="MWQDEILDEIHKIREEHAKSFNYDLDAMFADWQKKQAESGREIV SLPPKHALTMRWSRPSKGDDFSVQE" gene complement(6759..7043) /locus_tag="DP116_22970" /pseudo CDS complement(6759..7043) /locus_tag="DP116_22970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015784984.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DNA-binding protein" gene complement(7062..7838) /locus_tag="DP116_22975" CDS complement(7062..7838) /locus_tag="DP116_22975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017288274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flagellar assembly protein H" /protein_id="PRJNA477356:DP116_22975" /translation="MYDDTCRFLAENFSADFASWLLGESMTLTELKPSELSLEPIRAD ALILLESDDSVLHLEFQTRPKRDIPFRMLDYRVRVYRRYPDKTMRQVVVYLQPTGSDR VRQTSFSLERTRHEFDAVCLWEQPVTLFLQYPGLLPFATLSQTTDPEATLRSVAQTID QISDPITQANLTAASAILAGIKLEEDVIYRLLRRDIMQESVIYRSIQEEAEARAEARK QREIAVNLLRQGVTINIIASATGLSIEEVQQLQQQITESP" gene complement(7881..9302) /locus_tag="DP116_22980" CDS complement(7881..9302) /locus_tag="DP116_22980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320718.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase associated protein N" /protein_id="PRJNA477356:DP116_22980" /translation="MHLSHSLIISKNSTIVNAISFNKEVKMSAGLTFDNCDHSKDPIV GCALEGIANMVAGMKDVSIVIHSPQGCASTVAAGYDNHEVDFTKRKVGCTRLFESDIV MGASDKLKGMIKEADQSFHAKVMFVVGTCAADIIGEDIEGLCKNIQPQVNAKLVPILA GGFRGNAYDGLEMGLEALLPFIKKRQTKRRGRKPRIVNIIAPQANVNPTWWADLHWIT QTLKSLRIKVQTVFSHNSSFAELEQAGDATANILLSHDVGYKFARKMQQTHDIPLILD DIPLPIGVKNTTRWLQALAAHFKIEERVEPLIKQGEEMVVDTLRRRALMIIPRYRNCR VAVSTDGTLGIGLVRMLFEELEMIPEVLLFRSAMRESRPILEQELQSLGLSPSVVFSA DGYQIKQALTEIDTDAVFGSAWEKYIAEELGIQISFDVFSPTNRETYLDKPYFGYEGM INMMEVVANDWDRAFRSKHIHWT" gene complement(9686..11215) /locus_tag="DP116_22985" CDS complement(9686..11215) /locus_tag="DP116_22985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867600.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase associated protein E" /protein_id="PRJNA477356:DP116_22985" /translation="MATSIDGNVVFYGNLSELYRLAKEGKIKTSLQGSHTRPCKFWTA TKILSGIKDAIVISHGPSGCAYGVKRAYKLTNSRNSGSPYEPVVTTNMSEKSVIFGGE KELRGAILEVDQKYNPDAIVVATSCASGIIGDCVDDVVNKARTEIQSEIMTIHCEGFA GEYRSGFDIVFRQIVDFMEPPTPERKAQLADAVNIVGAKMGPERTEVETDVKELVRLI EGMGARVHSVIAGDCTLDELKRAPSAAVNCTLCLDLGYTIGKAMSDKFGTPLNSTILP YGISATEKWLQGAAKYLGMEEQADALMKQEYAAIKTEFEEAKKYIEGKLAIIEGHDAI KCLSIAHMLERDFGMRAVIYNFHPWSTEARETSVDYLLETGLDPEILITKGTLAFGKY ESMKQTEDELLNFIGGLDAESVVYFGSSLSFPHIPVVDLNAILNRPRFGYRGALKVAK CIKTALQYGFRPRSSLTKQMVFPKNSGLASAQSLTGKLAQDLPDCTVYAEKRRGKCCQ D" BASE COUNT 3206 a 2465 c 2388 g 3306 t ORIGIN 1 aggacatgaa ctaatgagaa aatacggaca ccaccagacc atcaagccta ttcccctatt 61 ccccattccc tgttccctgc tataagagca ttaaggattg cccaactcca tatttcggta 121 ttcttttaca gtttttgaac agatgaatac gatcattgaa ggagctataa aagcctttga 181 gctattaacc actggcaata gcgacgtttt ccaagtgatg acgatgacat tattggtatc 241 tggaacagct acagctatta gcgtgtgcct tggcttaccg ttaggactat ggctagcttt 301 agtagatttc gtcggcaagc aggttttgac tagcttggtt aactttggta tgggattacc 361 gccagtcgta gtcggcttag tggtcagtct gtttttgtgg cggtctggac cgttgggaaa 421 ctttgacttg atgtatacgc ccacggctat gatcctagcc caagcaatca ttgctttccc 481 aattgtagct ggtttcagtt ttgcagcaat tctcactatt aatccgaaac tgcgcttgcg 541 actgctatca atgggggcga cagagtggca agccaattgg ttgctgatca aggaagcacg 601 gttaggatta atggctgcta taatcgctgg ttttggtcgc gtcatctctg aagttggcgc 661 atccatgatg gtagggggca atatcaaagg gcagacaaga gttctgacca cggcgattgt 721 tacggaagtg gagaaaggga attacgatgt ggctttggcg atcgccttca ttctactcat 781 catcgcatac agcattatag tatcgctgac tctcttgcaa tacgataaga aaatcctatg 841 aacatgatga ctgagtttaa atcgtttgtt aatctgcaat cctatcctgt agcacgacag 901 gaaaatctta cgaggatgaa cgagtatgtt ctgaatatgc agcaagttaa cgtcgatagc 961 aaaaaacgta aaaatatcct atcaattgaa aattttgcag tccgtccagg agaacttgtc 1021 gctgtacttg gtcctaatgg cgctggtaaa agcaccctgt taagaacgat caatctgttg 1081 cagccttacc gtggtcaaat gcaattattt ggacaggatg tttgtcatac caataaaaca 1141 ttattgcgtc gtcgttctgc cctagtcttt caggagacat tactgctgaa tgataccgtg 1201 tttaataatg ttgccaaagt cttagaattt cgaggcatac cagcaaacaa gattaaacaa 1261 aaagtacata cagcactcgc cacttttggc tgcgaacacc tggcaaatcg gtcagctcgt 1321 tccctgtccg gtggtgaggc aaagcgagtt tgtattgcct gcgggttagt cgctgattca 1381 gaattgttgc ttttagatga gccttctgcc tctttagatg tgggaatacg tgctgagatt 1441 atcgagaaaa ttagacaatc ggctcaagcg aggggatcag cagtcatatt agtcagtcat 1501 aactttaccg acatactgca ttttgcagaa agggcgatcg cattgtttga cggctgtatt 1561 gtacaggatg ataaattgga aatgctcata cgccgaccag cgaacgagca acttgcaaga 1621 ctagtaggaa tggataatat tattccgtgc cgggtagaac ggggcagccg tggttatttt 1681 atcaagctcg caaacagtat agagttttta tatcctggtg aagtgagaac gccaattact 1741 gcctgttgcc tgtctggtga tgctttttat attgatgata caacttcttc agttccatac 1801 cagccaggaa tggttactat caaagggcga gtggaacgaa ttgtacgtgg catagggatt 1861 tatactattt gggttaaggt gggagaacaa actttaatcg ccagagtgcc tcaaagtcat 1921 atatctggga atgtctatca acatgaaatt attaaactgg catttcatcc tagagatgca 1981 cattttattt agaaaaaata ggcgatttta tacaaggaag aggagttata aataattagt 2041 cattaaaagc agcaaaatac tctaaaattt tacacatggc agtgtttata ttcctaaaat 2101 catcatctcg tgcgagcatt tattccggtt tatgaatata tttgccagca actaattctg 2161 aagcacaatt ttcaataatt tggcgaacgc tgtctttagt tgactctagg tgaaattgta 2221 aacctaggac tttctcacca taaagaaaag cttgatttga acagacttca ctataggcta 2281 gtcgtacagc acctggaggt aagtcaaaag tatcaccatg ccaatgaaac acagtgaaag 2341 gctcagtaag tgacgcgaaa acaggataat tttgtgcttc tgttgttagt ttaatcggat 2401 accaacctat ttctttttcc tgacctttgt acaccttaga acctaaaaca tcagcaatta 2461 gctgcgagcc aagacaaatg ccaatcacta ctttattttt cttgattgct tctgcaacaa 2521 attttttttc cgaagtcaac caaggatact tatcatcctc ataaatattc atcggactac 2581 ccattactat cacccaatct aggtcatcaa cagatggtaa cgtgtcaccg ttgtaaaact 2641 tagttgcgga aataatctta ccctgctgta tcgcccactg ttcaatgcta gcaagtcctt 2701 caaatgacac gtgttgtaag taatgaattc tcattgcttt cccccattgt ttatcaatcc 2761 tccatgataa accttatgca atccctgatt ctcacaattt tgatgcagtc tgaattagca 2821 ttgttgctgc ctttttcgct tgtgatgcgg ttgagcaact tccttccatc atcgcaacca 2881 caattgcacc ttgcacaagc agaagcagtt gtcgggctaa ctgttccggt gatgaaactt 2941 ctgctgattg agcaaggctc ttaatgtagt ggtagatcga ttgttgatgc tctaatgcca 3001 ctcgatgtcc aggatggtca ggattggcta attccactga ggcattaatg aatgcacaac 3061 ctcggaaatc tggtccctca aaccattccc gtaacgcatc aaatattgct agcagttgtt 3121 tggctggatc agaagttcgt tgttcaatcg tggtttttag ccattgacac cattgttcat 3181 cctgttgtcg cagccaagct tcaatcaatg catcttttga cttgaaatga ttgtagagcg 3241 acattttagc aacgccagat tcggcaatga ttctgtcaat gccaacgttc cggattccct 3301 ctcgataaaa caagtctgat gccgtattca ggatgcgatc atggacagac ttacggattg 3361 ctctcggttt agtcatttaa atcacatcct ttttttcttt gaaatcgtaa aaaagctttg 3421 gagtttaaaa gttaactatg actaatccta tagaactaat gcgatctgcg ccaatgcgca 3481 gtgctttgct atcgcgctac ggtgaaattc ccttcaatgg cgaaattgcc tggaacgact 3541 tccttagtac aataatatct caccgttcag ttcggtctta tctatctgac cctctaccac 3601 ccttaaccct agagttgtta gttgcagctg ctcaatctgc tgcgacttct gccaatctcc 3661 aatcctggag tgtggtagcg attgaagatc aagagcgcaa agaagaatta tccagactag 3721 caggaggaca agcccatatt aaagaggctc ctttattctt ggtttggtta gcagatttgg 3781 ctcgtgtggc tcgcgttgct gaaagtcgcg gactatctca tgatgctcta gaatacttag 3841 aactgttgat taaggcgatc gtcgatgctt cgatagcagc gcagaatgcc actctcgctg 3901 ctgagtccct tggtttggga acaggatata ttggcgcaat ccgcaaaagt actcaagagg 3961 tagcaacgct actgaatttg ccacctttgg tgttccctgt gtttgggttg tgtgtgggtt 4021 atccaaatcc tgaagtacaa acagccatta agccacggtt acctcagtca gctgtactgc 4081 atcgagaaac ttataaattg ggagagcaag atgaggcgat cgctcactac aacgacatca 4141 tcaaaaattt ttatactgaa caaaagatga atgtcgctgg tgattggtca caacactcag 4201 ccgagcgcat cgcaacttta gactatctca aaggatgcaa gtacttgcgt gaagctctga 4261 ataacttagg tttcaagtta ttataaccca atacgctaat ggaaagcaag ctgcgcgtag 4321 tctgtcgtaa agcgacttag cttttcaaag caattacaaa gctttaatat acccttgaaa 4381 acatatgttt ttgagtgcat aattaatgag ctgggttgtt gaatttcatc aagattttga 4441 gccagaattt gatgctctac cagaagaggt gcaaaataag ttacttgccc gtgccaatct 4501 attggaagct ttcggttctg aattgggacg acctaatgtt gatacgctta atggttcgcg 4561 gcatagcaac atgaaggaac taaggtttaa agcagcagat ggtgtttggc gtgtcgcgtt 4621 tgcattcgac cccagacgac atgcaatatt actagtcgca ggtgacaaat caggtagtag 4681 tgaaagtcgt ttttacaagc aacttatcaa aacggctgat gctagattcg acgcgcactt 4741 agctcaatta aagactggac aagaggaaaa gtgaggataa tctatgggga gaattttgaa 4801 ggaaaaacta gaacagttgt caccggaacg acggaaaaag attgaagaag aatctagctt 4861 gcttattgcg gaggaaatga ctcgacagca actgagactt gcactcaaac tcactcaaga 4921 acagatggca gaacttctgc aaattgacca agggaatgtt tctaggttgg aacaacgcac 4981 tgatttaatg ctctcaactt tgcggaagta cattgtagca atgggaggag agttacggct 5041 tgttgttgaa tttccaaacc gtccgcccgt caccctagta ggtttttcag aaatcgaaga 5101 atcagaggag ggagcgatcg cttcggaatg agttagagtc tacgccagtt gcgacaaagg 5161 acggaacctt tgagggcgct tatgctcgct ccccaagtga tgttttacta cagttcaacg 5221 agcgatcact tcttaactta acagtgaaca acaggtttat aatacaatac ggttcagtta 5281 aggataactg taggttgggt tgaacgtagt gaaacccaac aaactcctat aaatgttggg 5341 tttcgttcct caaacgctct taccctctgt cgggaaagcc cttcgggtat gcctacggca 5401 cgcctgacgg ctaacgccag atgcctgcgg agggtaaccc tcccgcagca ttagtctcac 5461 cgtcattcgc gctggcgacc ctacctacac agtttaaggt ttttggctct aactgaactg 5521 tattggttta taatagacac gaatgcgact gcgctttaag ttgttagcca tttgatctgt 5581 cggcggctag gttttgaaat ccatgcgtct tgaggagaac tctttatgag cgatgatttt 5641 gaaggtggag tcggtgggga aggtggggat ctaaactcac tattgcttaa gttggttgtc 5701 aaaaacgctg taaaggctgg gaaaaaagcg ctggatagca agtgttctca gtgttgggaa 5761 atgtttaatg tgggtgatac gcgctgttgc agttcgagaa tgtgtttgaa ctgtatacaa 5821 aaacatataa cacaagagcg gttcttattt cttcggagta cttatttttc atgtccgaac 5881 tgtagtgaga aattaaagct atcctgattg agatcgacgg gtgggcaatt gtccacctca 5941 ctttttgtgt agcatatcat tgtgaaatga gcagaagtct cgtgtgttaa gcgcgcaggc 6001 taatatactg ctgaaccaag acttcaaaga gtttatatca ttattaatgg cgatcgctct 6061 cactcatatt atgattagtc ctcaaataat catcaatata attacatcct ttaacaacta 6121 aatcatctaa atcaaaccct gatcacagag aatgtagagt taccacctaa tactactgtt 6181 gaataaccaa tgatactacc atagtaaagc gatcgccctt catcataata agcgaagagc 6241 gatcgcctaa aatcatcggc taataaagaa ggcgatcgca tgaagtaaga agttaacttg 6301 aaatatttcc taactcccca ggtgacacta cggaaatttt cccactcacg aacccagaaa 6361 tatcacgggt aacaattaca tctaatcctt gcgttactgc acaagtatat tgaaccgtat 6421 cttcaaaatt tcggaaatta agagcgatac gcccttggcg tctgcgctcc tcgcaatcgc 6481 ttgttcaata atgctcctat caactaggca aatatataaa tcgcctacgg atttaaaccc 6541 atcattcatt cttgtacgga aaaatcatca cctttggacg gtcggctcca acgcattgtt 6601 agagcgtgct ttggtggtaa gcttacgatt tctcttccac tttctgcttg ctttttctgc 6661 caatctgcaa acattgcatc taagtcgtag ttaaaagatt tagcgtgttc ttctcgaatt 6721 ttatgaattt catctaaaat ttcatcttgc cacatagctt aatctcctaa taattcgtag 6781 ggggtgcaaa ttattggtaa attgtatcca aaatcaaggc tgattgctgc aagcttcctt 6841 tgaatttgag cgtttgcaat atgcttacag ttccacgtta gcaggtaatc taacccgtga 6901 acagttgctg ccgcgatgtg aaccgcatcg tcggaagctt tgggaggaag attactgcgc 6961 gcaagaaact gtgctgacaa attttgtaca gtctaattga gatcaactaa cagtaacccg 7021 ttgaggattt gcaaacgttt caattgatcg caagtttaag tctatggaga ttcggtaatc 7081 tgctgttgta gttgttgcac ctcctcaatc gacaagcctg ttgcagaagc aataatgttg 7141 attgttactc cttggcgcaa aagattaaca gcaatctctc gttgctttct tgcttctgcc 7201 cttgcttctg cctcttcttg aatagaacga taaatcacag attcctgcat gatgtccctc 7261 cgcaataagc gataaataac atcttcttct aattttatcc cagctaaaat tgctgaggca 7321 gcggtcaaat tggcttgggt tattgggtct gaaatttggt cgatggtttg tgcaacagaa 7381 cgcaatgttg cctctggatc agttgtttga ctgagggtag caaagggaag tagtccaggg 7441 tattgtagga ataaggttac gggttgctcc cacagacaaa ctgcatcaaa ttcatgacga 7501 gtgcgctcta gggaaaaact ggtttgccga acgcgatcag atccagtggg ctgaagatac 7561 accacaactt gtcgcattgt tttgtcagga taacgtctgt aaactcgcac gcgataatcc 7621 aacatacgga atggaatatc ccgtttggga cgggtttgaa attctaggtg caagacggaa 7681 tcatcagact caagtagaat taaggcatct gcgcgaatgg gttcaaggga gagttcggag 7741 ggttttagct ccgtcagcgt catggactct cccagcagcc aactggcaaa gtcggctgaa 7801 aagttttcgg caaggaatcg acaagtatcg tcatacataa atagattgta cgactttgta 7861 atgccttagc gaagacttct ttatgtccaa tggatatgtt ttgaacgaaa agctctatcc 7921 caatcgttgg caaccacttc catcatattg atcattcctt catagccaaa atatggcttg 7981 tcaagataag tctctctgtt agtagggctg aatacatcaa aagagatttg gattcccagt 8041 tcttctgcta tgtatttttc ccatgccgag ccaaaaacag catcagtatc aatctctgtt 8101 aaggcttgtt ttatttgata tccatcggca gaaaatacca cgctaggaga aagaccaagg 8161 ctttgcaatt cttgctctag aattgggcga gattcacgca tcgctgagcg aaacaacaaa 8221 acttctggaa tcatctccaa ctcttcaaag agcattctca ccagtccgat accaagcgtc 8281 ccatctgtag atacggcaac tctacagtta cgatatcggg gaataatcat taatgctcgt 8341 cggcgaagcg tatctacaac catttcttcg ccttgtttga tcagcggttc cactctttct 8401 tctatcttga aatgtgccgc caatgcctgc aaccaccgcg tagtgttttt tacacctatt 8461 ggtaaaggta tatcatctag aatgagggga atgtcatggg tctgttgcat tttccgagca 8521 aacttatacc cgacatcatg actgagaaga atattggcgg ttgcgtcacc agcttgttcc 8581 agttctgcaa aggaactatt gtgggagaat accgtttgca ccttaattct taaagatttt 8641 aacgtctggg ttatccagtg caaatcagcc caccaagtgg gattcacatt tgcctgtggt 8701 gcaataatgt ttacaatcct cggctttctg cccctcctct tagtctgcct ttttttgatg 8761 aaaggaagta aagcctctaa tcccatttct aagccatcat aagcattgcc ccgaaaccca 8821 ccagcaagta taggaacaag tttagcattg acctgtggct gaatattttt gcataatccc 8881 tcaatatcct caccaataat gtccgccgcg caagtgccaa cgacaaacat gaccttagca 8941 tgaaaagatt gatccgcctc tttaatcata cctttgagtt tatcagacgc tcccatgaca 9001 atgtctgact caaaaagacg agtacaacct actttgcgct ttgtaaaatc aacctcatgg 9061 ttatcatagc cagctgcaac agttgacgcg caaccttggg gagaatgaat cacaatactg 9121 acatctttca ttccagcaac catgtttgca ataccttcta aagcacagcc aacaattggg 9181 tctttgctat ggtcacaatt gtcaaaagtt agtcccgcag acattttcac ctctttattg 9241 aaagatattg cattgactat ggtagagttc ttacttatta tcaatgaatg cgataggtgc 9301 aaaactccat gtctttagac cacagatata gcagcaaagt taattaaaat tgccgtcaat 9361 tatggtaata tgactttagt ggaagggtaa ttaaccactt taaaaaccca attgctgcgt 9421 aaaaagcaga cactgcacct tgacaattga aaatatgggg ttctattcca tagttctagt 9481 ttcctcccag gaataggtaa aattttcatg ggaactagac gctcagcgtt taactcaaac 9541 aaattttgga ataatccaac taccgtgggg cacacggaaa gttaaagctt gaggagagaa 9601 ccacctctgg gttaggtaac gcaaggagct tagtctaagt ggactcgttg aaccaagaat 9661 ctcctctcat gatggtagga gagtgtcaat cctgacaaca ttttccccgc ctcttctctg 9721 catatacggt acaatcgggc aggtcttggg ctaattttcc tgtgagtgat tgagcagatg 9781 ctaaccctga gtttttagga aataccattt gcttggttaa tgaactgcga ggtctaaagc 9841 cgtattgaag tgcagtttta atacacttag ctacctttaa ggctccgcga taaccgaatc 9901 tgggacgatt taagatagca tttaaatcga caacgggaat gtgtgggaaa ctcaaggaag 9961 aaccaaaata taccaccgat tctgcatcaa gaccaccaat aaaattcagt aattcatctt 10021 ccgtttgttt catcgactca tacttaccaa aagccagcgt ccccttggta attaagattt 10081 ccgggtctaa cccagtttcc aataagtagt ctacactggt ttcccgtgct tccgtgctcc 10141 agggatggaa gttatagatc accgcacgca tcccaaaatc acgctctaac atatgggcaa 10201 tggacaagca cttgatagca tcatgccctt cgataatcgc taacttccct tctatatatt 10261 ttttagcttc ttcaaattcg gtcttgattg cggcatattc ctgcttcatc aaggcatccg 10321 cctgttcttc catccctaaa tattttgcag ctccttggag ccatttttct gtagcgctaa 10381 taccataagg aaggatagta gagttcagcg gagtaccgaa cttgtctgac atggctttgc 10441 cgatagtata gccaagatcc aaacataagg tacagttgac tgccgcactt ggtgctcgct 10501 taagttcatc taatgtgcag tcaccagcaa tcacactgtg aactcgtgct cccattcctt 10561 ctatcagtcg caccagttcc ttgacatcgg tttctacttc tgtcctttca ggacccatct 10621 ttgcacccac aatattgaca gcatcggcta gttgtgcttt ccgttctgga gtgggcggct 10681 ccataaagtc tacgatttgc cgaaatacga tatcaaatcc actacgatac tcccctgcga 10741 atccttcaca atgtatcgtc atgatctcag actgtatttc agtgcgtgct ttattcacca 10801 cgtcatcaac acaatcgccg ataattccag atgcacaact cgttgccaca acgatcgcat 10861 ctggattata tttttgatct acttctaaaa tcgcgcctcg taattctttt tcccctccaa 10921 aaatcaccga tttttcactc atgttggtag tgactacagg ttcataaggg gaaccactat 10981 tgcggctatt ggttagcttg tatgctcgct tgactccata agcacagcca cttggaccgt 11041 gagaaattac gatcgcatct ttaatcccac ttaaaatctt cgttgcagtc caaaatttgc 11101 agggacgggt atgactacct tgtaagctag tcttaatctt accctctttt gctaaacgat 11161 acagctcgct taaatttccg taaaatacaa cgtttccatc tatactggta gccataatac 11221 ctcttcaatc tctcatatca tgtttgatcg aacatttgta ttaacaaaac ccctccccaa 11281 ccctcccctt ggtaagggga gggtgccgtt aggcgggtgg ggtatatttg ggaattgtta 11341 agtaattaag cgaacatgat attac // LOCUS NODE_2993_length_11282_cov_5.37302911282 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11282) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11282) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11282 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(65..1471) /locus_tag="DP116_22990" CDS complement(65..1471) /locus_tag="DP116_22990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316086.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LytR family transcriptional regulator" /protein_id="PRJNA477356:DP116_22990" /translation="MTIQRTSAQDDYSTDTSNLSDKKANTDKSGRWLWFWVGMSGIAM VSATAGALLAVSLTGKPLMQASLSPDEAAVFDSDRIAGDGLKFSQLTRPVNILIMGMS VLPPDVRNPPTQSKNLRYLPQVNSFDGLADVMLLVKFEPSHKKVMMLSIPRDTRTQID GHGTRKINSTNVIGGPALTAKTVSNLLGDVGIDRYIRINVLGVGKLIDALGGVTVYVP KDMKYQDDSQHLYINLKKGKQHLNGDQALQLLRFRHDENGDIGRIQRQQMVMRSLMEQ SVNPATVAQLPKVLNVVKEHIDTNLTLEELLALIGFGVRTNRSNMQMLMLPGQFGQKG GYWIPDSQRIHSMMAQYFDVQTDSTSGVVNPGSLRVAIQNSTGNDKANLRPLIRTLQK LGYTNVYVAKSWGEPLEETHIVAQQGDGNSAESIRNTLGFGEVRVESTGNLGSDISIQ LGKDWLRQTQINQNPVQP" gene complement(1760..2154) /locus_tag="DP116_22995" /pseudo CDS complement(1760..2154) /locus_tag="DP116_22995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196003.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 2259..3536 /locus_tag="DP116_23000" CDS 2259..3536 /locus_tag="DP116_23000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494374.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AI-2E family transporter" /protein_id="PRJNA477356:DP116_23000" /translation="MQTRKLLNWWETLTPLARFLTITLFAPLLVLNGWAISAIFHYFH SLIVILVGASVLAFLLNYPVSWMQRQGAKREQVAILVFLLALSVLLALGVTLVPLALT QAQQLVARIPELIDSGRSQLMILNEKAESFGLPINLDALVSQINDRVKGQLQAIAGQV LNLAVITVTSLLDFILTMVLTFYLLQHGDELWESLVYWLPVRFREAFSNTVRLSFQNF FISQLISATCMASALIPIFLWLKVPFGLLFGLTIGIMALVPFGGSVGIALTTLLVTLQ DFWMGARVLIAALIVQQILENFIAPRILGSFTGLNPVWVLISVLTGARIGGLLGVIVA VPSAVVIKTVLSAIRPYSSSSGNNVEVLPQGSSQSCNTENHDTGGGSITPDSKAYSAE VAAPVSVEASPPTAQTNHPGANPSKTVAPQWNP" gene complement(3505..4833) /locus_tag="DP116_23005" CDS complement(3505..4833) /locus_tag="DP116_23005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316083.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serpin family protein" /protein_id="PRJNA477356:DP116_23005" /translation="MKQKFSNAGENFLQRRYGVRLGRRYVLAAASVVLMGVIGCSQVN SSTSAFAESGLPRSESPIPKSMSYPDIKLITANTKFGFKLFSEVLKNDSGKNIFVSPS SVAIALAMTYNGASGSTKQVMAKALEFKDLNLEQINSSNAVLKKLLENPDPKVQLTIA NSLWANKEVSFNPDFLQRNRDFYTARVANLDFTDISSPAIINDWVSEKTRGKINKIVE KIEPSQVLFLINAIYFKGSWTNEFDKQQTAEYPFYLSSGQQKQHLMMSQSGDYKYYEN QQFQAVSLPYGKDGKISFYIFLPKQNSSLESFYQNLNAENWENWMTQFSKQQGFIRLP RFKMDYDITLNNALTAIGMGEAFSNQANFSAMGKDLKISEVKHKTFVEVNEEGTEAAA ATSVRIMPLSAVEPSLEPFRMIVERPFFCAIRDNQTGSILFMGSIVEPQS" gene complement(5126..6718) /locus_tag="DP116_23010" CDS complement(5126..6718) /locus_tag="DP116_23010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878456.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S8 and S53 subtilisin kexin sedolisin" /protein_id="PRJNA477356:DP116_23010" /translation="MANKQSWIIWGLGACCLSAPVLAAVESPFGTNGIDALRLHQLPY NLIGRKIAIGQVEIGRPGRFGLDKAVSKNRSVSIAGAFLRNELAKSNSGVDPHAYNVA SMMVSTDKGFPGVAPGARLYSSAVGYGKSLGQPQECLSAQHIALQNGGDVRAINFSFG EPLNRDPRPEAVLDGNALLTLCIDWSSRVHNSLYIIAGNQGKGGIPIPTDNYNGVNVA FSSRRGGIYNKVDVSNLAAVSEGLAMRLAGKEINLGPRRAIGIVAPGNNIPLRNPDGK KNKVTGTSFAAPQVTATVALLQEFGDKQLRTKQPNWSIDSRRHEVMKAVLLNSAEKIQ DNGDGLRLGMTRTLIDKQNKDWLASDAYNDPKIPLDSQMGTGHLNVFRAYQQFSAGEW NPTQTVPALGWDYRTVNTEASVEYELAKPLKQGSFVSITLTWDRLVELNDRNKNGQFD AGENFRDRGLNNLDLYLVKADDKNANAEAVCSSISEVDSVEHIFCPVPTTGNYKIRVQ FRKKVNEATQAYALAWWSVPVR" gene 6880..7587 /locus_tag="DP116_23015" CDS 6880..7587 /locus_tag="DP116_23015" /EC_number="5.1.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742756.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribulose-phosphate 3-epimerase" /protein_id="PRJNA477356:DP116_23015" /translation="MTQTQSKKPIVIAPSILSADFSRLGDEIRAVDQAGADWIHVDVM DGRFVPNITIGPLIVEAIRPVTQKPLDVHLMIVEPEKYVEDFAKAGADHIYVHAEHNA SPHLHRTLGQIKEVGAKAGVVLNPGSPLELIEYVLELCDLVLIMSVNPGFGGQSFIPE VVPKIRKLRQMCDERGLDPWIEVDGGLKANNTWQVLEAGANAIVAGSAVFKAKDYAEA INGIRNSKRPTPQLATA" gene complement(7768..8451) /locus_tag="DP116_23020" CDS complement(7768..8451) /locus_tag="DP116_23020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861309.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23020" /translation="MVQELQDFAFSQQGSMTIVRVLGYGLLLLALFDIIEMFIPPNFM NPVWEFQTMGMLVEKVPVPLIGLVLVFFGELHSRTKWEIPILKFLSWLTLLFGILFFL LIPLGLASTIRLNTQNAAQVKAVSTQQVSRAEELEKQLSQVTPDQIDKFFKIQGRSLD GKNPQEIKNQLLTEVSKAKKNIKTQAEATQSLRGLNLIKTSAKWNLGAVVAGTLFISI WKGTRWARN" gene complement(8523..8600) /locus_tag="DP116_23025" /pseudo CDS complement(8523..8600) /locus_tag="DP116_23025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316079.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="cyanoexosortase A system-associated protein" gene complement(8830..9264) /locus_tag="DP116_23030" /pseudo CDS complement(8830..9264) /locus_tag="DP116_23030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878461.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(9355..9795) /locus_tag="DP116_23035" CDS complement(9355..9795) /locus_tag="DP116_23035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457162.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23035" /translation="MSVLKQSRQIIIACVLILVLTLTTACGASNATIADRTTNLPVAG RNITYGELEQGNTPAGQTFGNWVVQTSKGLITDAYVRENNKLGIVISSKVPPSDVRPL AKSLLEGFHRNFPNEDLKVLVYAPDKKLILTAQYDVQTNQVQYT" gene complement(10069..10251) /locus_tag="DP116_23040" CDS complement(10069..10251) /locus_tag="DP116_23040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015189097.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CsbD family protein" /protein_id="PRJNA477356:DP116_23040" /translation="MSLEDRAKATAKNIEGKVQEAVGNVTGDPEDKAEGQAKQAESQV RHGVENVKDDVKDALK" gene complement(10584..11003) /locus_tag="DP116_23045" CDS complement(10584..11003) /locus_tag="DP116_23045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863437.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23045" /translation="MSITSSGEVRSPRRKNLKLNGETTVENNQGESWDKVEHSSDQQT RVELPGKLEVHDTLAISGIRPVSSSDLQVVETKNIMGIRPTTANTFHVVDTLNMSGAR PIGSSDLVISETYSVMGNRPIASNTIDDSESLMGFLD" BASE COUNT 3170 a 2363 c 2237 g 3512 t ORIGIN 1 caagttctga cgatgaatgt agaatcccct cgcctttagg caggggagtg tcaacttgtt 61 cgacctatgg ttgaacaggg ttctgattga tttgtgtttg ccgcaaccag tccttaccca 121 gttggatact aatatcagaa ccaagattac cagtgctttc tacgcgcact tctccaaatc 181 ctaaagtgtt gcgaattgat tcggcgctgt taccgtctcc ttgttgggcg acaatatgag 241 tttcttctag aggttcaccc cagcttttgg ctacataaac gttggtatag ccaagttttt 301 gcaaagttct aatcaatggt cggagattag ctttatcgtt acctgtgctg ttttgaattg 361 cgactcgcaa agatcctgga ttgaccacac cagatgtaga atctgtttgc acatcaaagt 421 actgagccat catgctatga atgcgctggc tatctggaat ccagtagcct cctttttgac 481 cgaattggcc aggtagcatt aacatttgca tattggagcg atttgttctt accccaaaac 541 cgattagtgc tagtagctcc tcaagtgtca agttagtatc aatgtgttct tttacgacgt 601 taagaacttt gggcaattga gctacggttg ctgggttgac agattgctcc atcagtgatc 661 gcatgaccat ttgctgccgc tgaatccggc caatatcgcc attttcatca tgtcggaagc 721 gcagaagttg cagtgcctga tcgccattga gatgctgttt tcctttcttt aaattaatat 781 ataagtgctg agagtcatct tgatatttca tatctttggg tacgtacact gtaactccac 841 ccaaggcatc aatgagctta ccaactccca acacgttaat gcggatatag cggtcaattc 901 ctacatcacc taacaaatta ctcacagttt tggcggttaa agctggtcca ccaatcacat 961 ttgtggaatt aatttttctt gtaccatgtc catctatttg tgtccgagta tctctgggaa 1021 tagaaagcat catcactttt ttgtgcgatg gctcaaattt gaccaagagc ataacgtcag 1081 caagaccatc aaaagagttg acttggggca gatatctcag atttttgctt tgagtgggag 1141 gatttcgcac atcaggagga agtacgctca tccccatgat taaaatattc acggggcgag 1201 ttaactgtga aaatttcagt ccatctccag caatgcgatc gctatcaaac accgctgctt 1261 catctgggct taaagaagct tgcattaaag gcttacctgt taaagaaacc gccaacaatg 1321 ctcctgctgt tgctgatacc attgcaatcc cgctcatacc cacccaaaac cacagccaac 1381 gtccagattt gtcagtgttg gcttttttat cacttaagtt gctagtatct gtagaatagt 1441 cgtcttgcgc cgaagttctt tgaatagtca cgcgactcct cacacttatc tgaataaaat 1501 ctggataatc tgctgctatg gaaacagaca aattaaccca tgatagctaa ctcttagatt 1561 aacaatgata aattttacac ttcagtatga accacaacac ttattggagt aattctccta 1621 ttttttgata aatccggaga ggttttctta gaagtatgac aatatcaaaa gagatttgac 1681 tagacctgtt aagaataaaa tacatatttt tttggcaaat taagtaaaaa ccctgtcaac 1741 tgctttgatg actaaagaat tatgacaggt atttgccaag cgttctttcc acaaaacgac 1801 tcacgggcgt ggcaatcgcc ttaatctgct cttaactaga agaacaatta accagatatt 1861 aatcaaaaag tagcgacagg tgagaaagtt atgaacgata agtagatgca atgttaaaaa 1921 aattgaagtt gtcgccccag ttgcgagtat taggtaaccc aaaaaccagg taaataccaa 1981 agttattgac agataactaa ctcgaacttg ttcccaactt tccggatggt ggtagagtgt 2041 ccatagtgag gagaaaaaat ctctgaccgg ataatggtag ataagcacca gtccaccaga 2101 ggtggaggag tcttgaggat ggtaagaatt gtcagataat ttaagaattt tcatttgcct 2161 gagtgcaatc cgaaataaga gggatgaatc atagtggtgt agtcagagat aataggctgt 2221 aaagagaatt ttatttagtt gtatccaaca attggcagat gcagacacgt aagctcttaa 2281 actggtggga aacacttaca cccctagcac gattcttgac aatcactctg ttcgctccac 2341 tgctagtgct caatggttgg gcgatttcag caatttttca ttactttcac tcgctaattg 2401 ttattttagt cggagcgtca gtgctagcat ttttgctaaa ctaccctgtg agctggatgc 2461 agcgtcaagg tgctaagcga gagcaagttg ctattttagt ctttctactc gctttgtcag 2521 ttttactggc gttgggtgtc acccttgtcc cgcttgctct gactcaggct cagcaactgg 2581 tggcgcgtat accagagtta atagattctg ggcgatcgca gttgatgatc ctgaacgaga 2641 aagcagaaag ttttggttta cccataaacc ttgatgcttt agtctcacaa attaacgacc 2701 gtgtcaaggg acaattgcag gcgatcgctg gacaagtttt aaatttggct gttattacag 2761 tcacaagcct ccttgacttt atcttgacaa tggtattaac tttttatctc ctacagcatg 2821 gggatgaact gtgggaaagt ttagtatatt ggctacctgt tagattccgc gaggcttttt 2881 caaatacagt acgcctcagc tttcaaaatt ttttcattag ccagttaatt tcagccactt 2941 gtatggcctc ggccttgatt cccatctttt tgtggttgaa ggtcccattt ggcttactat 3001 ttggcttgac cattggtatc atggcacttg tcccatttgg tggttctgta ggtattgctc 3061 taacgacttt gctcgtgaca ctacaagact tttggatggg ggcaagggta ttgattgctg 3121 cacttattgt ccagcaaatt ctggaaaact ttattgctcc gagaatttta gggagtttta 3181 cgggtttaaa ccctgtttgg gtgctgattt cagttttgac aggcgcaaga attggtggac 3241 ttttgggtgt gatcgtagcg gtacccagcg ctgttgttat caaaacagtt ttaagcgcta 3301 tacgtcctta ctcttcatcc agtggcaata atgtagaagt acttcctcaa ggaagttcac 3361 aatcttgcaa tacggaaaat catgacactg gagggggtag tataactcca gattcaaagg 3421 cttattctgc tgaggtagct gcacccgttt ctgtcgaagc ctcgcctcca actgctcaaa 3481 caaatcatcc tggggcaaat ccttctaaga ctgtggctcc acaatggaac ccataaataa 3541 tatgcttcct gtctgattat cccgaatcgc acagaagaag ggacgctcaa caatcattcg 3601 gaatggctct aagctcggct ctaccgctga taatggcata attctcacag aagtagcagc 3661 agcagcttct gtaccttcct cattgacttc aacaaaagtt ttatgcttga cttcgcttat 3721 ttttaaatcc ttacccatag ctgaaaaatt tgcctgattg ctaaaagcct cacccatacc 3781 tatggctgtc agagcattgt tgagcgttat gtcatagtcc atcttaaaac ggggcaagcg 3841 aataaatccc tgttgtttgc tgaactgagt catccaattt tcccaattct ctgcattcaa 3901 gttttgataa aagctttcta gactagagtt ttgtttaggc aagaaaatat aaaagctaat 3961 cttaccatct ttaccataag gtaaactgac tgcctgaaat tgttgatttt catagtattt 4021 atagtcacct gattgtgaca tcattaggtg ttgtttctgc tgaccggagg agagatagaa 4081 aggatattct gcggtttgct gtttatcaaa ttcatttgtc cagctacctt taaaatagat 4141 agcgttgatg agaaatagca cttgtgatgg ttctattttt tcaactatct tatttatctt 4201 cccgcgtgtc ttctcactaa cccaatcatt aataattgca ggagaactta tatctgtgaa 4261 gtctagatta gcaaccctag ctgtgtaaaa atctctattt cgttgcagaa aatcgggatt 4321 gaaactaacc tctttatttg cccaaagcga attagcaatt gtcagttgca ccttaggatc 4381 tgggttttct aataactttt tcagcaccgc gttagaggaa ttaatttgtt ctagattcaa 4441 gtccttgaac tccagtgctt tagccattac ttgttttgta ctgccactag cgccattgta 4501 ggtcatggca agggcgatcg ctacgctaga aggtgaaaca aaaatgttct tgccactatc 4561 gtttttcaaa acttcggaaa acagtttaaa gccaaatttt gtgttggcag taatcagttt 4621 aatatcagga tatgacattg atttcggtat cggagattct gagcgaggta aacccgattc 4681 ggcaaaggca ctcgtgctac tattgacttg agagcatcct ataacaccca ttaagacaac 4741 actcgcagct gccaaaacgt aacgtctgcc taaacgaact ccataacgtc tttgtaggaa 4801 attttcccca gcattgctaa atttttgctt cattttacca cccataatag ctagagtctt 4861 tattttgact atcgcatctg gtgtgaggtg atatctgatc cgttacagtt ttggcaacag 4921 tgattgtaca gtttgaattt tttttagcta tgggtagttt gatttctgag tcaataagta 4981 ttagttacta gtttttagat ttcatcactg tttgtggcgg tggatgaatt accgttccta 5041 gcccctgatg cgatatagct ttctcatcca cataccaaat cagttgattc gccgctttct 5101 gtgctgtacc tagctaggga atttcctacc tcacgggtac actccaccaa gccaaagcgt 5161 aagcttgagt tgcttcatta actttcttgc gaaattgaac acggattttg tagttacctg 5221 ttgtaggaac agggcagaaa atatgttcga cactatcaac ctcacttatt gaggaacaga 5281 cagcctcagc atttgcattt ttgtcatcgg ctttgactag ataaaggtcg aggttgttta 5341 agccgcgatc gcgaaaattt tctcctgcgt caaactgacc gtttttattc ctgtcgttta 5401 gctctactaa cctatcccaa gttaaggtga tagaaacaaa actcccctgt tttaagggtt 5461 ttgctaactc atattctaca gatgcttctg tattgactgt gcgatagtcc caaccaagag 5521 ctggtacagt ttgcgttgga ttccattcac cggcgctaaa ttgttggtaa gcgcgaaaaa 5581 cattcaaatg accagttccc atttgcgaat ctaaaggaat ttttgggtca ttatatgcat 5641 cagaagccaa ccagtcttta ttttgtttat caatgagcgt ccgcgtcatt cctaagcgca 5701 aaccgtcgcc attatcttga attttctcag ctgagttgag cagtactgct ttcatcactt 5761 catgtctgcg agaatctatg ctccagttag gttgttttgt tcgcagttgt ttatcaccaa 5821 actcctgtaa cagagcgact gtagcagtta cttgaggcgc tgcaaagctt gtccctgtga 5881 ctttattctt cttcccatct ggatttcgca agggaatgtt attcccggga gcaactatac 5941 caatagcacg acgtggacca agattaatct cttttccagc aagccgcatc gctaatcctt 6001 cactgacagc cgccaaatta gaaacatcaa ctttattata aatccctcct cgacgggatg 6061 aaaaagctac gttcactccg ttataattat cggtaggtat gggaatgccg ccttttccct 6121 gattgcctgc aatgatgtac aaagaattgt ggacgcgact agaccagtca atacataaag 6181 ttagtaaagc attgccatct aaaacagcct ctggtcgtgg gtcacggttt aaaggttcgc 6241 caaagctaaa gttaattgcg cggacatctc ccccattttg tagtgctatg tgctgtgcag 6301 acaagcactc ttgtggttga cctaagcttt ttccataacc cacagcagat gagtacaatc 6361 gcgctcctgg ggcaactcca ggaaaacctt tatctgtact aaccatcata ctagcgacat 6421 tgtaggcgtg ggggtcaaca ccgctatttg atttagcaag ttcattgcgt aggaacgctc 6481 ctgctatgga tactgagcga tttttagaga cggctttatc taacccaaac ctaccgggac 6541 gcccaatctc tacctgacca atagcaatct tgcgaccgat taaattatat gggagttggt 6601 gtagtcttag agcatcaata ccattagtgc caaaaggaga ttccacagca gcaagtaccg 6661 gcgcactcaa gcagcaagca cctaatcccc aaattatcca actttgtttg ttcgccatag 6721 tcagttagaa atcatgagtt gttatttgtt tgttatcaga aagcactact aacgactaat 6781 taatcctcac gaatgaaaaa gtggggatca gatagatgag cgtttgctca agttttgtta 6841 caatgctcac agacgaggac gctttaaaac ccactagcca tgacccaaac acaatctaaa 6901 aagcccattg taatagctcc atccatccta tcagcagatt ttagtcgttt aggagacgaa 6961 atacgtgctg tagatcaagc tggcgcagat tggattcatg tcgatgtaat ggacggtcgc 7021 tttgtaccta atattacgat aggtcctctg attgtggagg cgattcgccc agtgacacaa 7081 aagcctctgg acgtccactt gatgattgtg gaaccagaaa agtacgtaga agattttgct 7141 aaagcagggg cagatcacat ctatgttcat gctgaacata atgcttctcc tcacttacac 7201 cgcacgttgg gacaaatcaa agaggttggt gcaaaagcgg gagtggttct taaccccggt 7261 agccctttgg aactgattga gtatgtccta gaactgtgtg acttggtgtt gattatgagc 7321 gtcaaccctg gttttggcgg tcaaagcttt attcctgaag ttgtgccaaa aatccgcaag 7381 ctgcgtcaaa tgtgcgacga acgtggtctt gatccttgga ttgaagtgga tggaggattg 7441 aaagcaaata atacctggca agttttggaa gctggggcta atgcgatcgt ggctggttca 7501 gccgtattta aagctaagga ttatgctgaa gcaataaatg gtattcgcaa cagcaagcgt 7561 cctactccac aactcgcaac agcgtgaata attgaataag aaaagaaatg ttttttctcg 7621 ttccctggct cagccaggga atgcataact tgaggctctg cctcaatata atcattataa 7681 tcattgggag gcagagcctc cctttaggcg ttcccatgca gagcatagga acgagagaaa 7741 taaatctcta aatatccccg cgtcatatca atttcttgcc caacgcgttc ccttccaaat 7801 actgataaac aaagttcctg caaccacagc accaagattc cattttgcag aagtcttaat 7861 caagtttagt cctcgaagag actgagttgc ttctgcttga gtttttatat ttttcttagc 7921 ttttgagact tctgtcaaaa gttgattttt tatttcttga ggatttttgc catctaatga 7981 acgaccttga atcttgaaaa acttatctat ttgatcaggt gtaacttgac tcagttgctt 8041 ttccaactct tcagcacgag aaacttgttg agtggataca gctttaactt gagcagcatt 8101 ttgagtgttt aaccgaatgg tactagcaag tcctaaaggt atcagtagaa aaaataatat 8161 tccaaataat aaagttaacc aagataaaaa ctttaaaata ggaatttccc attttgttcg 8221 agaatgtagc tctccaaaaa acactaacac tagcccaatt aaaggcactg gtactttttc 8281 aactagcatc cccattgttt gaaattccca aacaggattc ataaagttgg gcgggataaa 8341 catctcaatg atgtcgaaca atgccaataa taataaacca tagcccagca ctctaacgat 8401 tgtcatcgag ccttgctgac tgaaggcaaa gtcttgtagt tcttggacta tgggaagaaa 8461 tttatcgcta tttgattttg tcattttgct aaccgagtaa aactaaaata attgacgata 8521 atttaggact tgggaaagcg tggttcccac caccgatacc aagaaaacca agcgttttct 8581 aaaacttggt aagctgcatc tacttgtctt aacgcaaaat ttaaagtgaa tattgttatt 8641 aagtaaacat tacatatcat gaaaagttag cctagatcaa gcttccagat gattcatcac 8701 ctaaattacg atataaaaat ttggtaaatt tacgactatt aaataaaaaa gatgggcatt 8761 cttgcccacc attaaaactt aagaacccaa cactcaactg ttatcggtga tgattcccta 8821 accatttagc tgctctgcca acacttcgcg gaagatatcg ggatggttgt gataggcgaa 8881 ggacgcaagt ttactaacat cgtcagcagt catacgatta ggagcgtggg tggaaagccc 8941 gagttgttgc tccaaatgac gatcatctaa tcctctttgc ttgaagtgtt tgaagaaggc 9001 gcgggcgaca tcatcccgtt cattaggttt aatttgagca attgcctttt gcaattctgg 9061 ctccatttgg ttggctggaa tgttgtcatg atgtaaagat tgaccaaata actgacggcg 9121 ttcttcgtgt gttgagcgct gagcaaaatc atcaaaatta tcgtactctg ttgttgagcc 9181 ggtgggagca tcacctagag actcaacatt tccctgagcc aaatcactca tgatgtgacg 9241 tctgtattga tcgctggtgc tcattttact tatctccttt ctcttatcaa actttttcaa 9301 tttctcgttt ttaaatcaac tgacttcgct caagtttttg atcaaattcc gcaactaagt 9361 gtactgaact tgatttgttt gtacgtcata ctgagctgtc aaaatgagtt tcttatcagg 9421 cgcatacacc aaaactttca agtcttcatt ggggaagttc cggtggaaac cttctagcaa 9481 ggattttgct aagggacgta catcgctggg aggtacttta gaggaaatga caatgcctaa 9541 tttgttgttc tcacgcacat aagcgtctgt aattaatcct tttgatgtct gtacaaccca 9601 gttaccaaaa gtttgtccgg ctggggtatt tccttgttct aattccccat aggtaatatt 9661 tcgaccagca actggtaagt tggttgtgcg atcagctatt gtcgcattac tagcgccaca 9721 ggcagttgtc aaagttaaga ctaaaatcaa gacacaagct ataatgattt gacggctctg 9781 cttcagcaca ctcatgttta ttacctcctt aaaataagtt ttcagtcaaa tgaagcggaa 9841 gatttcttcg gtgttaattg agtctatttt tcttgaattt tctacactca ctgagtaaag 9901 ttattttgac atatcaagta ctagattaaa aaacgaagcc cactgacgcg gacttcgttc 9961 ccgcgctttg ataacgcgcg tttactcagt tgttaagttt aacagcgtag gtactaagat 10021 aaatacactc tcatgaaacg ctcaaactta gcagaaaaag cattagtcct atttcagtgc 10081 atcctttacg tcatctttga cattttcaac gccatgacgt acttggcttt cagcttgctt 10141 tgcttgacct tctgctttat cttctggatc gccagtaaca tttcctactg cttcttgaac 10201 tttaccttca atgtttttag ctgtagcttt tgcacgatct tctaaactca ttttgacctc 10261 cagaccttct tttaattacg tttcaaagtt gataagtaac cttgaacttg ttttttatta 10321 ctcttacaaa ctagcttaat cttctgacag cagccttcca cctctagata tattagttcc 10381 ctatactaaa agagagagga gaagtgacta ttagctactc ctttaacact agttagttat 10441 tcaaaaactt ttgatactac tagacactaa accatttaag aataaggttt tatgaataaa 10501 ttttaatgtc gcttatttaa ttaatgaaaa agccggcttt cttacgaaaa ccgggtgaat 10561 aaagtaatct gtacaaattt tatttaatca agaaaaccca ttagactttc agaatcgtca 10621 atagtatttg atgctattgg gcgatttccc atgacagagt atgtttcaga aatgacgaga 10681 tcacttgaac caattggacg tgctccagac atgttaagag tgtcaacaac gtggaatgtg 10741 tttgctgtag tcgggcgtat acccattatg ttttttgttt ctacaacttg caaatcactg 10801 gagctcacgg gacgtattcc agagatagct agtgtgtcat gtacctccag tttacctggt 10861 aattctaccc gcgtttgctg gtcactactg tgctctactt tatcccaact ttcaccctga 10921 ttattttcca ctgtagtttc tccgttaagc ttcagatttt ttcgccgagg acttcttact 10981 tcaccagagc tagttatgct cattgcttct cacctggtgc attgttaact tattttacaa 11041 tattctggaa aagttaaatt tttccttaat atctaagttt aactttaaaa aatcttgtgt 11101 aggtataatt ttttttgaga ttcatatagt atttttgtaa aatttaataa agaaacattg 11161 caaaacctaa tctcccaatt actggcgacg aaactcattt ctacagtaaa accaccttgt 11221 gtccaagttg cttagcatat aaaatagaag cagctaaatt aatttttatg acgcaatttc 11281 at // LOCUS NODE_3000_length_11262_cov_5.03961811262 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11262) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11262) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11262 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(261..596) /locus_tag="DP116_23050" CDS complement(261..596) /locus_tag="DP116_23050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23050" /translation="MHTIARTPTTDMEVTSIRLERELKDKLKELAGNQGYQALIRDIL WNYVQQKSGEWKPRFSRADIRASIAATAQQEERCVLTGQLIQPKQPMLLGLTRNGDMV PLSVESLAG" gene 667..915 /locus_tag="DP116_23055" CDS 667..915 /locus_tag="DP116_23055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113794.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23055" /translation="MKFSVKSVYPISTSLRVVNCRLSWESAAVGVEYLFMGDGDWNKW QITQGNYDSQYASIRALNAVAKLERQSGRYKQQNAVPI" gene 1013..1165 /locus_tag="DP116_23060" /pseudo CDS 1013..1165 /locus_tag="DP116_23060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315075.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="HNH endonuclease" gene 1205..1621 /locus_tag="DP116_23065" CDS 1205..1621 /locus_tag="DP116_23065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453719.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23065" /translation="MLKLTYTETSFCLECLVQSLEEWVQARVILALRVGRSLCVEPST ASFLLPVNLPGVEILKALVNRENSEIIALCNCDHECVEVTLRGYWLSNGSQDAEGVFI TAMSDSFYGEHSASPTEFFLHKLWQEAQHRASVMSE" gene complement(1665..1904) /locus_tag="DP116_23070" CDS complement(1665..1904) /locus_tag="DP116_23070" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23070" /translation="MAQNVYYQRNRLYINSHSIMRRRLNLKIFFKIRIKKFIKLEYFL THLKPLFFIYLLLNTRPDLFDCTCISEAIGIKKEQ" gene 1903..2280 /locus_tag="DP116_23075" CDS 1903..2280 /locus_tag="DP116_23075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319185.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23075" /translation="MSRRFLPPLSPDDLPPTRVTQLDELLLRQLEEAICKRFFKACSS MTRALLSNCHWYFQINGGSLILVIVSYDMESYWHIANAIPQIVNKLKLFSNTAKIRFC PPSKEGIPWEIAVDEISDEKDVS" gene 2284..2640 /locus_tag="DP116_23080" CDS 2284..2640 /locus_tag="DP116_23080" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23080" /translation="MVKNIIRWHSFVDCESSAARGKPSCWFWRQRYALTPAVRHGAVA SAIGIGHTRKNQGVSFKNAFPSSWEQSYYHTEFGWAKHAERLCKAHPPDWQLGDHYLK ISIASVDKMIYKLINI" gene 3022..3459 /locus_tag="DP116_23085" CDS 3022..3459 /locus_tag="DP116_23085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949923.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_23085" /translation="MPRKEQGWITFQTSEEERKILEEFSKHSQRTKTEILRELVRSLN KESQQAPSLPTQQYETEEAWTSRIEVFGRKKPLKVSSRNILKGIIKRVVTGAVNTEVT LEIVHKVELTSIITRVSADELDLSEGTEAYAVIKSNDIVIARD" gene complement(3587..3871) /locus_tag="DP116_23090" CDS complement(3587..3871) /locus_tag="DP116_23090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319188.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23090" /translation="MAKAIWNGAVLAESDNTVVVESNHYFPPDSINKQYFKESNTHTT CPWKGLASYYNVEVDGQLNKDAAWYYPDAKEKAKQIEGYVAFWRGVKVEA" gene 4126..4833 /locus_tag="DP116_23095" CDS 4126..4833 /locus_tag="DP116_23095" /inference="COORDINATES: protein motif:HMM:PF00498.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Forkhead-associated protein" /protein_id="PRJNA477356:DP116_23095" /translation="MHSECHFLDVQEPDGNKYTIKLEQDRFTLGRSNSNDIILPNSDK TISRNHCVLECKANCWWVVDESSANGTFVQRCNRDTRIDVRQHGKYQLNNGDTILILG KFLEENEPVFWRLTFRDSEQTVQPPAFLEYSLSQQKLFLVKSSEQQEIHLSRRERNLI HYMAERNQANNNKPVVCEYQQMITAIWGDSCGHTNNDVTRLVWVIRKKVESDPGEPEF LITEKARGYSLKVKLIA" gene 5075..7309 /locus_tag="DP116_23100" CDS 5075..7309 /locus_tag="DP116_23100" /inference="COORDINATES: protein motif:HMM:PF07714.15,HMM:PF12849.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23100" /translation="MVQEQPSKYNNGKSSKPTNSISNEKIEQALEQSLQNQRGIVSNW LPGYELKGGRYIIQEELSTGGFGVIYRAKDSRKNQVLVIKVLKSDLYNEQESDSFEEN FVKEAINLAICKNLYIVPFKDLIKTSGKWCIVMEYIEGEDLNKWVRKNGPLSEADALR YIRQIGEALKVLHEHNLLHRDVNPKNIIRRANGLEAVLIDLGIARKFSHNQTEQHTPI LTQDFAPIEQYREKYKRGAYTDVYALAATLYFLLTGNAPIASWRRLNGEPLKTPQQIK GKQYISEQVNDAILLGLQIFPQERPETIQEWLEKLPPDESKIEEATYQYNRTTNTNKE ERDVSKRTQPPNDTQPNQPNPRKLLSLPAAFTSPASFMVALAIFSSLGTSFISAGLWL LLTICFIYVDFFITKRCTIKQQIGLVISTLSPPLVILLYIFALRHWKAQEVEPLGWAP ETFYYGGSTTWAPIRRDVDPQIQKFFPNFKLKLVNPKSVNPGSETGIKQLLGLLDGEE RLTFAQSSRPFTGDEYKKAHTQGFLLKQIPVAIDGIVVAVNPKLNISGLTISELKAIY QSKITNWNQVKGSNNLNLPIKPYSRKKEVGGTVKIFVEDVLENSDFGTTVEFKEDTTS ALKLVQSDMGSIYYASAPEVVKQCNIKALPLGYRFNQLVSPYQKRFLSYKDCIQEGRN RVNIEVFRNKQYPLIRKLYVILKEDHPDQQAGEAYANLLLTAEGQQLLEKAGYASLYD TQAN" gene 7403..7705 /locus_tag="DP116_23105" /pseudo CDS 7403..7705 /locus_tag="DP116_23105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319452.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DNA helicase RecQ" gene complement(7814..9601) /locus_tag="DP116_23110" CDS complement(7814..9601) /locus_tag="DP116_23110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006635997.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_23110" /translation="MQIVLRSISYFRKDLWLIATLLVLIGASVVLNLLNAWPIAILVD TVLSPTPKPNWIHTLFLAPFGEGRLNRIFGMAIVGMIIKILSDTVIMLRKMLNYRIQY NGTMRVRTELYDKLQALSLGWHNSQSQGDAIYRLSYDSLGPWGVIDVLIGSTAASVTL TAMIGIMLSRHVLLTVFALSFTPLLILANWYFEGEIKRRATNSKRTDALMTSTMQQAI ELIGLIQSFGREATESRRFAQVVEQSVAASMRLNWLETLYPLAVQVIFALGGGVIFGY GGYLVYRDQFLRPVPNGLTIGDLIVFMAYLAQFWDPLGWVLGFTTKIQTFVASCDRVF AVIDLAPAIADEPDAQPLPVRPRTLALCDVSFEYSPGRPVLRKISATIEPGQMVAFLG PSGTGKSTLLNLLPRFYDPMEGHVLLDGVDLRTFKVCDVRKHMAIVTQSSPLFPGTIA ENIAYGRADAEFHEIQEAAIESGAAEFIETLPEQYDTIVTEGGQNFSGGQRQRLAIAR ALLTNAPILILDEPTSSLDLKHEQWVIQTLQRLRRLKTIILVTHRLETAVDCDQIFVM QGGEIVESGTHAELLGLRGLYERMLGYRF" gene 9793..>11262 /gene="recQ" /locus_tag="DP116_23115" CDS 9793..>11262 /gene="recQ" /locus_tag="DP116_23115" /EC_number="3.6.4.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319452.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA helicase RecQ" /protein_id="PRJNA477356:DP116_23115" /translation="MSQNPNLEQALKYYFGYDNFRLGQRQIIEQALQNRDLMIVMPTG GGKSLCFQLPALLKKGLTVVVSPLIALMQDQVEALRDNGIAATFLNSSLNAYKVRSRE ENILSGKVKLLYVAPERLLSERFLPFLDLVHHQIGISAFAIDEAHCVSEWGHDFRPEY RQLKSLRKRYPDIPVLALTATATERVRADIIQQLGLKQPSIHIASFNRTNLYYEVRSK TKYAYAELLELVRETEGSAIVYCLTRKKVDELTLKLQHDKISVLPYHAGLSDEERTKN QTRFIRDDVRVMVATIAFGMGINKPDVRLVIHFDISRNLESYYQESGRAGRDGEPSRC TLFYSYGDVKTIEFLINQKPDPQEQLIAKHQLRQIIDYTEGTDCRRTIQLGYFGERFS GDCGNCDNCRYPKPIEDWTIEAMKFLSCVARCKERFGMGHIIDVLRGGKTQKITQYEH DKLSTHGIGKDKTVDEWRMLGRSLLHQGLLEQTADGYAVL" BASE COUNT 3323 a 2373 c 2535 g 3031 t ORIGIN 1 taatttaact cataattctt tacaatattt tatttttact cagctaatgg gtttttatgt 61 gtcaagtcgt tgttgaaact cctggtgttc aaacagcctc actcagttgc aaggcatcca 121 aatctattgg ggctaacagt ctaataattt gatatcttaa actattaaat ttactgataa 181 ataaagataa agcaaaattc tgaatgcagc ccgttgcagt atcatataga acaactgagc 241 tgcattcagt tgaaaaaagt ttaaccagct aaactctcga cacttagagg aaccatatcg 301 ccatttcttg tcagtcctag taacattggt tgttttggtt gaatgagttg acctgttagt 361 acacagcgtt cttcttgttg tgctgttgcg gctatgctag cccgaatatc agctcgcgaa 421 aatcggggct tccactcacc tgatttttgc tgcacataat tccagaggat gtctcgaatc 481 aaagcttgat atccctgatt accagctaat tctttgagtt tatctttcaa ttcccgttct 541 agacggatgc tggtgacttc catatcggtg gttggtgtgc gagctattgt atgcatcagt 601 ttttctcctt aaagtatgga caggatagta atacaagtgt agtatgttta aaggaatgtt 661 caataggtga aattttctgt gaaatccgtt taccccatct ctacttcctt gcgtgttgtc 721 aactgtcgat taagttggga atcagccgct gtgggagtgg agtatttatt tatgggagac 781 ggtgactgga ataaatggca aatcacacag ggcaattatg attctcaata tgccagcatt 841 cgtgcactga atgcagtagc aaaactagaa cgacaaagcg ggaggtacaa gcaacaaaat 901 gctgtaccga tatagaagca ccaagacttc aacaaggtca aattccaacc cccgcttcca 961 ctcaataaga agcgggggtt tgtttataag gagtcaagct gtgacaagcg cgatgcaagt 1021 tttagaacaa tccgtggtgg tgttttatca aaatgatttg ccactgtgtc gggttaatat 1081 cgagcaagcg attgtattgt tagtaacaga taaagctgat ccactatatt tttatcgcca 1141 aagcggatcg gaagttcact cacccacctg caagcaaacc tggaataaca ggagagcata 1201 agcaatgctg aaattaacat acactgaaac gagtttttgt ttagagtgtt tagttcaatc 1261 gttagaagaa tgggtgcaag cgcgagtgat tttggcactg cgagtaggga gatctctgtg 1321 tgttgaaccc agcactgctt cctttttact tcctgttaat ttgccaggag tagaaatact 1381 taaagctctt gtcaatagag agaacagtga aattatcgct ctgtgcaatt gtgatcacga 1441 gtgtgtggaa gtgactctgc gaggttattg gctatcaaat ggttctcagg atgctgaggg 1501 cgtttttatc accgcaatga gcgattcatt ttacggtgaa cactctgctt cacctactga 1561 gttttttctc cacaaacttt ggcaagaagc tcaacacaga gcctctgtta tgagcgagta 1621 atgagttgtt ggctttcagc ctggttgctg aaagctgaca actgtcattg ttcttttttt 1681 atgcctatag cctcacttat acaagtacaa tcgaataagt cagggcgtgt atttaataac 1741 aaataaatga agaaaagagg ttttaaatgt gttaaaaaat attccagttt tatgaatttt 1801 tttattctaa ttttgaaaaa aatcttgaga tttagacgtc tcctcataat tgaatgcgaa 1861 tttatgtata agcgattccg ttgatagtaa acattttgtg ccatgagcag acgttttctt 1921 cctcctcttt ctccagatga tttgcctcca actcgggtga cacaacttga tgagttatta 1981 ctgagacaac tagaagaagc tatctgcaaa cgctttttca aagcttgtag ttcaatgact 2041 cgggctttat tgtctaattg tcactggtat ttccaaataa acggtggcag tttaatactg 2101 gtcattgtta gctatgacat ggaaagttat tggcatatag caaatgctat tcctcagata 2161 gttaacaaat taaagctatt ttccaatact gccaaaatcc gtttctgtcc tccatctaaa 2221 gaaggcatac cttgggaaat tgcggtagat gaaatatcgg atgaaaagga tgtgtcttaa 2281 cacatggtta agaatatcat caggtggcac tcttttgtag attgtgagtc cagcgccgca 2341 cgagggaaac cctcgtgctg gttctggcgt cagcgctatg cgctcactcc tgccgttcgg 2401 cacggggccg tggcgtcagc cataggcata ggacataccc gtaaaaatca aggcgtttct 2461 ttcaaaaacg ccttcccttc ttcttgggag cagagttatt atcatacaga gtttggttgg 2521 gcaaaacacg ctgagcgatt gtgcaaagct cacccccctg attggcaatt gggcgatcat 2581 tatctaaaaa ttagtattgc ttccgtagat aaaatgattt ataaactgat aaatatatag 2641 caatcttatt tgatttgtgc tcagctaggt acagtttatg ttctctgttc cctgttaaga 2701 gttccctgtt aagagttccc taactcaact agtaagttca aaaccaaacc ggattcctat 2761 aaattgatac atatataatt tctattttca gctggtagta gcaaatacaa gtaagttagt 2821 gttagattgt gctagaactg ggaataagtc aaacaatcct gagctaaata ttcacaagca 2881 aactggtaaa tgctgtgcca agaaaaacaa gttgtcttct cataggaaaa cagcggcata 2941 gagatttatt tttacaagga ttgacacttg gtaggatcat accaagaaaa actcataatt 3001 accaactaaa ttggtaaaaa tatgccaaga aaagaacaag gatggattac ttttcaaacc 3061 tcagaggagg agcgaaaaat tctagaagag ttttcaaagc actctcagcg cactaaaaca 3121 gaaattttgc gggaactggt gcgtagtctc aacaaggagt ctcaacaggc cccatcacta 3181 ccaactcaac agtatgaaac agaggaggct tggacttcta ggatagaagt ttttggtcgc 3241 aagaaaccac tcaaagtcag ctcccggaat attcttaaag gaatcattaa acgagttgtg 3301 actggggctg ttaacactga agtcacgcta gagattgttc acaaagttga gctaacctca 3361 attatcacaa gagtgtcggc agacgagcta gatttgtctg agggtacaga agcttacgca 3421 gtgatcaagt ccaacgatat tgttattgct agggattgac atcttccccc actaacctta 3481 tagttcgtca gtgacaaaat tatacaaaag ccctcactcc tgattcctct tccaagttgg 3541 aaaaggactt aggggtgagg gtacttttag aagtattaaa tttttactaa gcttcgactt 3601 tcacacctct ccagaaggcg acataacctt caatttgctt tgctttctcc tttgcatcag 3661 gataatacca agcagcgtcc ttgttgagtt gtccatcaac ttcaacattg tagtaactgg 3721 cgagaccttt ccaaggacaa gttgtgtggg tgttgctttc cttgaagtac tgcttgttaa 3781 ttgagtcagg gggaaagtaa tggttgcttt ccacaactac ggtgttatcg ctttcggcta 3841 acacagcacc attccagatt gcttttgcca tagatgacaa gtgctaataa cgctacacat 3901 tacattatga ttcttctggt gagcatcttg cccacgcata aaatggctac actgagcaaa 3961 tttttatcaa ggaaaattca aggcattttg gattgacatt ggaagatttt cagatcactt 4021 gtcaattgct gtaggtactg ggagcaaaaa taattatcag taaaaccagc ctcactgcat 4081 attagcagct gaggtcactg ataattagga ggaggtattt tctttatgca ttcagagtgt 4141 cattttcttg atgttcaaga gccagatggc aataagtaca ccattaagct agaacaagac 4201 cgctttacct taggacgaag taatagcaat gatataatcc ttcctaactc agataaaaca 4261 atctcacgga accattgtgt cttagaatgt aaggcaaact gctggtgggt ggtagatgag 4321 tcaagtgcga atggaacttt cgtgcagcga tgcaatcgtg atactcgaat cgatgtgcgc 4381 cagcatggga aataccaact taacaatggg gacaccatcc tcattcttgg caagttcttg 4441 gaggaaaatg aacctgtctt ttggaggcta acctttcgag attctgaaca aactgtccaa 4501 cctccagcat tcttagagta cagcttaagt caacaaaagc tttttttagt caaaagtagt 4561 gagcaacaag agattcactt aagtcgaaga gaacggaatc tcattcacta tatggcagaa 4621 cgcaatcagg caaataacaa taagcctgtt gtgtgtgagt atcaacagat gataacagca 4681 atttggggag attcgtgtgg tcacacaaat aatgatgtta cacgccttgt ctgggttatc 4741 cgcaaaaaag tagagtcaga tcctggtgaa cctgagtttc tcataacgga gaaggcaaga 4801 ggttacagcc tgaaagttaa actgattgcg tgagtcaata acaacaactg ccatgtcttt 4861 aaggaacagt ggtctttgtt aagagactgt tcccagaaac ctaaatactg tacccaatat 4921 ttggcatagc gctcattttt tagccttaaa catatcttaa tactcgtact ttagcttaga 4981 agcagattgg aactgttctg cgacacatgg gctatgagcg atcgcagact caaactgcta 5041 gccaaattct tcaacaagac ttgggatagg gattatggtt caagagcaac ctagcaaata 5101 taacaatggc aaatcaagca agccaacgaa ctcaatttca aacgagaaaa tagagcaggc 5161 gttagaacag tcacttcaaa atcagcgagg cattgtgagt aattggttgc ctggttatga 5221 gttaaaaggt ggtaggtaca ttattcaaga agaattgagt acaggaggct ttggtgttat 5281 ctatcgcgcc aaagatagcc gaaaaaatca ggttcttgtt atcaaagttc tcaaaagcga 5341 cttatataac gaacaagaat ctgacagttt tgaagaaaac tttgtcaaag aagcaattaa 5401 tctcgcaatc tgtaaaaatt tgtatattgt accatttaag gatctgatta agacatcggg 5461 taaatggtgc attgtaatgg aatatattga gggtgaagac ctgaacaaat gggttagaaa 5521 aaatggtcct ctttccgaag ctgatgcttt acgttatatt cgccaaattg gtgaagctct 5581 aaaagtactt catgaacata atcttttaca tcgagatgtt aaccccaaaa atattattag 5641 gagagcaaat gggttagaag cagttttaat tgatctggga attgctcgaa agttttctca 5701 taatcagaca gagcagcata ctccaatttt gacacaggac ttcgcaccaa ttgagcagta 5761 tagagaaaag tataaaaggg gagcttacac tgatgtatat gccttagcag caacactgta 5821 ttttttgttg acaggaaacg caccaattgc ctcttggagg aggttaaatg gagagccact 5881 gaaaaccccg cagcagatta agggtaagca gtatatcagc gagcaggtaa atgatgccat 5941 tctcttagga ttgcaaatat ttccacaaga gcgccctgaa acaatacaag aatggctaga 6001 aaaactacca cctgatgaaa gcaaaattga ggaagctaca taccaatata accgtacgac 6061 aaacacaaac aaagaagaaa gagatgtttc caaacgaaca caaccaccaa acgatactca 6121 acccaaccaa ccgaatccac ggaaattatt gagtctgccg gctgcattca cgagtcctgc 6181 tagctttatg gtagcgctgg caattttcag ttcattagga accagtttta ttagcgctgg 6241 gttgtggctg ttactaacaa tttgtttcat ttatgttgat ttttttatta ctaaaagatg 6301 cacaattaaa cagcaaattg gtttagtgat tagtacccta agtcccccac tggtaatttt 6361 actttatatt tttgctttac gacattggaa ggcacaagaa gtggaaccgt taggttgggc 6421 cccagaaaca ttttattacg gtggtagcac gacttgggca cccatccggc gtgacgttga 6481 cccacagatt caaaaatttt ttcctaactt taagttgaag cttgttaatc caaaaagcgt 6541 caatcccggt tcggaaactg gtattaagca attgttaggg ttgttagacg gggaggagag 6601 gctaaccttt gctcagtctt ctcgaccgtt tacaggagat gaatataaga aagctcatac 6661 acagggattt cttttgaaac agatcccagt ggcaattgat ggcattgtag ttgcagtcaa 6721 tcctaaatta aacatctctg gtttgactat ttccgaactc aaggctattt atcaaagtaa 6781 aattaccaac tggaaccaag tcaaaggttc aaataatcta aatctaccaa ttaagcccta 6841 ctcacgtaaa aaagaggttg gcggcacggt taaaatcttt gtggaagatg ttttagaaaa 6901 ttcagatttt ggcaccacag tagagtttaa ggaagacaca acctcagctc ttaaactagt 6961 acaaagtgat atgggcagta tctactatgc ttcggctccg gaggtggtta aacagtgcaa 7021 tatcaaagca ctaccactag ggtatcgatt taaccaacta gtttctccct atcaaaaacg 7081 gtttctatct tataaagatt gcatccaaga agggcgcaac cgagtaaata ttgaagtatt 7141 tcgcaacaag cagtatcccc ttattcgtaa gttgtatgtc atccttaaag aggatcatcc 7201 cgatcaacaa gcaggagaag cttatgcaaa tttgctactg actgctgagg gtcaacaact 7261 gctggaaaaa gctggatatg ctagcttata tgacactcaa gctaattgag tactgtaagt 7321 taagcatcaa tcacaacctg ttctacagac attgactatc atgtaaaaca agcaatctgc 7381 ctcctttgca acacagtctc ttatgtcaca gaatcccaat ttagaacaag cgctaaaata 7441 ttactttggc tacgataact ttcgcctcgg acagcgacaa atcattgagc aagcgctaca 7501 aaatcgggat ttaatgattg taatgccgac tggtggcgga aagtcgctgt gttttcaact 7561 accagcacta ctcaaaaaag gtttaactgt ggtggtgtcg ccgctgatag ctttgatgca 7621 agaccaggtg gaatcactgc gagataatgg gattgccgca acatttctta atagcagtct 7681 gaacgcttat aaagtgcgat cgcgcttggg ggcaccaaaa gctttttccc gctactgcac 7741 acaaactggc acgcttgttc taccaaatat ggacgatcgc aggactttct gccgaccttg 7801 gcatggacta ctatcaaaaa cggtagccca acatacgttc gtatagcccc cggaggccaa 7861 ggagttcggc atgggtaccg gactcgacaa tctcaccccc ctgcatcaca aaaatctggt 7921 cgcagtctac ggcggtttcg agccgatgcg tgactaagat gatggttttc aagcggcgta 7981 gacgctggag ggtctgtatc acccactgct cgtgctttaa gtccagcgaa ctcgtgggtt 8041 cgtcgaggat gagtatagga gcgttagtga ggagcgcgcg cgcaatagct aggcgctgcc 8101 gctgcccgcc tgagaagttc tgaccgccct cagtaactat ggtatcatac tgctcgggca 8161 aggtttcgat gaactcggct gccccggatt cgatcgccgc ttcttgaatc tcgtggaact 8221 cagcatctgc gcgaccgtag gcgatattct ccgcgattgt tccggggaag agtgggctgc 8281 tttgcgtgac gatcgccatg tgtttgcgga cgtcacagac cttaaaagtg cggaggtcaa 8341 caccgtcgag gaggacgtga ccctccatcg ggtcgtagaa gcgcggcagg agattgagca 8401 gcgtgctctt tcccgtcccg cttggaccca aaaacgccac catctgaccc ggctcgatag 8461 tagcactgat cttccgtaag acgggccgcc cagggctgta ctcgaagctc acgtcgcaaa 8521 gggctagtgt gcggggacgg acgggaagcg gctgcgcgtc aggctcgtcc gcgatcgcag 8581 gcgcaaggtc aattactgca aagacgcgat cgcatgaggc tacgaatgtc tggatcttag 8641 tagtgaaccc caacacccaa cctaacgggt cccagaactg cgcgaggtaa gccatgaata 8701 cgatcagatc gccgatggtc aagccgttcg gcaccggacg caagaactgg tcgcgataga 8761 ccaggtaccc gccgtagccg aagattaccc cgcctcccag ggcgaagatt acctgaacgg 8821 cgagcggata gagggtttcg agccaattga gccgcatcga ggcagcgacg ctttgctcga 8881 ccacctgcgc gaagcgccgc gactcggtcg cttctcgacc gaaagactgg atcagcccga 8941 tcaactctat agcttgctgc atggtggagg tcatgagcgc gtcggtacgc ttcgagttcg 9001 tagctcgacg cttaatctct ccctcgaagt accaattcgc aagtatcaga agcggcgtga 9061 acgaaagtgc gaacacggtg agaaggacgt ggcgcgacag catgattccg atcatcgccg 9121 taagggtgac cgatgcggca gtggagccga tgagtacatc gattaccccc cacggtccga 9181 ggctgtcgta gctcaaccga tagatcgcgt cgccctgcga ctgggagttg tgccagccaa 9241 gactgagcgc ctgcagtttg tcgtacagtt cggtccgaac ccgcatggtt ccgttgtact 9301 gaatgcggta gttcaacatc ttacgcagca ttatgacggt atcgctcagg atcttgatga 9361 tcatcccgac gatcgccatt ccgaaaattc ggttcagcct gccttcgccg aacggggcaa 9421 gaaacagcgt atgaatccag ttgggtttcg gggttggtga gagaaccgta tcaactagaa 9481 tcgctatggg ccaagcgttg agaaggttca aaaccaccga tgccccgatc agaaccaaga 9541 gcgtggcgat cagccataag tccttgcgga agtaggagat ggatcggagg acgatctgca 9601 tgtggcgaac ctcatatctt gtaccaggtt agcttgtacc aggttacatt gatgtgcaga 9661 ccatagcgtt tggcacaggc tttggactag atgccagaat cccacccatg cccgcgaagc 9721 ctaggggtgg gagtgtcaat attgattatg atgtaaaaca ggcacaatgc ctcgtttgca 9781 acacagtcac ttatgtcaca gaatcccaat ttagaacaag cgctaaaata ctacttcggc 9841 tacgataatt ttcgcctcgg acagcgacaa atcatagaac aagcgctaca aaatcgggat 9901 ttaatgattg tcatgccgac tggcggcgga aagtcgctgt gttttcaact accagcgctg 9961 ctgaaaaaag gtctcactgt ggtggtgtcg ccgctgatag ccttgatgca agaccaagtg 10021 gaagcactgc gtgataatgg aattgcggca acatttctta atagtagtct gaatgcttat 10081 aaagtgcgat cgcgcgaaga aaacatcctc agtggtaaag tcaaactcct atacgtcgcc 10141 ccagaacgcc tgctgagtga aagatttctc ccctttctcg acttagtcca tcatcaaatt 10201 ggtatttctg cctttgctat tgacgaagca cactgcgtct ctgagtgggg acacgacttt 10261 cgtccagaat atcgccaact gaagtcactg cgcaaacgct acccagatat tcccgtcctt 10321 gcgttgactg cgacagcaac tgagcgtgtc cgtgctgata ttatccaaca acttgggtta 10381 aagcaaccaa gcatccatat cgccagcttc aaccgcacaa atctttacta cgaagtccgt 10441 tctaaaacga agtatgctta cgctgaattg ttggaactcg tcagggaaac agaaggttca 10501 gcaattgtct attgcttgac gcgcaaaaaa gttgatgaac tgactctgaa actgcaacac 10561 gataaaattt cagtcttacc ttatcatgcc ggattatctg atgaagaacg caccaaaaat 10621 caaactcgat ttattcgtga tgatgtgcgt gttatggtgg caacaattgc ctttggtatg 10681 ggaattaaca aaccagatgt gcggctagtc attcactttg atatctcccg caatctggaa 10741 agttactatc aggaatcagg gagagcagga agagatggag aaccatctcg ctgtactctt 10801 ttctacagtt acggtgatgt caaaacaatt gaatttctca tcaaccaaaa accagatcct 10861 caagaacagt tgattgctaa acatcaactg cggcagatca tagactacac tgaaggaaca 10921 gattgccgac ggacaattca gttaggatat tttggggaac gtttttctgg tgattgtggt 10981 aactgtgaca actgtcgtta tcccaaacca atagaagatt ggacaattga agccatgaag 11041 tttttatctt gtgtggctcg atgtaaagaa agatttggta tgggtcatat tatagatgtg 11101 ttgcggggag gaaaaaccca aaaaattact caatacgaac atgacaaact ttccactcac 11161 ggtattggta aagataaaac tgtagatgaa tggcgaatgc tggggcgttc tctcttgcac 11221 caagggttat tagaacaaac tgctgatggt tacgctgtat tg // LOCUS NODE_3056_length_11047_cov_5.52965811047 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 11047) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 11047) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11047 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1134 /locus_tag="DP116_23120" CDS <1..1134 /locus_tag="DP116_23120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017877110.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23120" /translation="TLSHELRTPLNAILGWTQILRRGKIGPAELKKGLETIERNSRAQ AHIIEDLLDMSRIISGKVQLNVKNVNLRSVIEAAIESVLPSVQAKNIRLQTVFDALPI FITGDPNRLQQVVWNLLSNAIKFTHSGGRVRVLLEKMDKHVELIVSDNGQGIKSEFLP YVFDRFRQADSSTTRKFGGLGLGLSIVKQLVELHGGTVQAISPGEGQGATFTVLLPLV HRESNSNAPAVAYSFDDDRLTTTQCEETNLQSTKVLVVDDEIDAQELVKRVLEECGAN VLTASSVDEALELVQTQKPHVVVSDIGMPGKDGYEFIRRLRGLPSSMGGDIPAAAVTA FARFEDRIRALRSGYQTHVAKPVEPAELIAVVASLAGRHDSHI" gene 1249..1953 /locus_tag="DP116_23125" CDS 1249..1953 /locus_tag="DP116_23125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318032.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphatidate cytidylyltransferase" /protein_id="PRJNA477356:DP116_23125" /translation="MLFTALGISPLLGNLIATALTFVYVFGLVALLNIFVKNLGLPQD ISRKITHIGAGSVIGFLPLYSDLHWSKYLNVAIFVVWIVLLVQKGLFAQPDDEAVNTM TRTGDRRELLKGPLYFVIVATICGTLLYKTFPGIVAMTTLGWGDGVAPIIGSQYGKLK YTILSNKSWEGSFSMFVFAFAASVFFVWLIIPGQLNLIRILLLAFIATVVEGCSPKEI DNILIPVVVFVAASFI" gene complement(2206..3129) /locus_tag="DP116_23130" CDS complement(2206..3129) /locus_tag="DP116_23130" /inference="COORDINATES: protein motif:HMM:PF04471.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23130" /translation="MYHSLYCEQQIAIPYSSLRLKKVIMSLARKFNWTVIDDKQFEEI VYEIVKAKNPIQIIWRSDTGGKGRDIQATFLIQDTFGEFVQEIYFIEAKHYQSGVSPT DIMPALSWAMAEKPSVFVIATSSHLTNPCRDFIDSWKKNNPNVRVNIWERKDIESFIL SKTSTRKAAVSLGILPPSINDILPENPKQARDNPLYPAIAYRYLMTEDEISQIDDFKF FLEDVKQTISGNCDKYTPFEIDMGLYNWSVFLNYLQAQIGLQLSLMNYILALENNASI EELQMLSAKVKESAEKITDDNESKPRWLLID" gene complement(3147..6023) /locus_tag="DP116_23135" CDS complement(3147..6023) /locus_tag="DP116_23135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="excinuclease ABC subunit A" /protein_id="PRJNA477356:DP116_23135" /translation="MSNSQQIATSLNGHLPNSRDNQNTIRIRGARQHNLKNIDLELPR DRLIVFTGVSGSGKSSLAFDTIFAEGQRRYVESLSAYARQFLGQVEKPDVEAIEGLSP AISIDQKSTSHNPRSTVGTVTEIYDYMRLLFGRAGEPHCPICDRCIAPQTIDQMCDRI LELPDRTRFQILAPVVRGKKGTHTKLLSGLASQGFVRVRVDGEVRELSDAIDLDKNVT HTIEVVIDRLVKKPSIQERLVDSLTTCLKQSSGIAAIEIIKDTDNQDLPSELVFSENF ACPEHGAVMDELSPRLFSFNSPYGACSHCHGIGSLRRFSPELVVPDPKAPVYSAIVPW SEKENSYYIELLYKVGQEFGFELQTLWSQLTEEQQNVILHGTAETHKSQNSGFKGVLP MLQRQYDGSSELIKQKLEQFLIDQQCPVCKGKRLKPEALAVRLGQYQILDLTGSSIRE CRERIDKLELSQRQMQIADLVLREIRARLQFLLDVGLDYLTLDRPAMTLSGGEAQRIR LATQIGSGLTGVLYVLDEPSIGLHQRDNGRLLKTLTKLKDLGNTLIVVEHDEETIRAA DHLVDIGPNAGIHGGNIVAEGDLQALLKSEESLTGAYLSGRRVITTPAERREGNGLSL VIKNARRNNLRNIDVEIPLGKLVAVTGVSGSGKSTLINELLYPALQHHLTRKVPFPKE IDAIKGLDTIDKAIVIDQSPIGRTPRSNPATYTGVFDVIREVFAETIEAKTRGYKRGQ FSFNVKGGRCEACSGQGVNVIEMNFLPDVYVQCEVCKGARYNRETLQVKYKDKSISDV LNMTVEEALEFFKNIPKAQAKLQTLVDVGLGYIHLGQPATTLSGGEAQRVKLGTELSR RATGKTLYLIDEPTTGLSFYDVHKLLDVLQRLVDKGNSILVIEHNLDVIRCADWLIDL GPEGGDKGGEVIAVGTPEEVAQNSRSYTGQYVKQVLQQYSRAKV" gene 6236..6878 /locus_tag="DP116_23140" /pseudo CDS 6236..6878 /locus_tag="DP116_23140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008053754.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" gene complement(6885..7295) /locus_tag="DP116_23145" CDS complement(6885..7295) /locus_tag="DP116_23145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020483082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VapC toxin family PIN domain ribonuclease" /protein_id="PRJNA477356:DP116_23145" /translation="MIVVDTNVIAYFFLQGEQTQQAEAVYEQDPQWVAPYLWRSEFRN VLALYLRQGYLCLEDAIQIYQEAETLMQQGEYAIDAIAVLQLTFESGCSAYDCEYVVL AQQLGVPLITADRKLIASFPLVAIPMATFLQNLA" gene complement(7292..7540) /locus_tag="DP116_23150" CDS complement(7292..7540) /locus_tag="DP116_23150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011112154.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_23150" /translation="MCMVTLTLKNIPETLYDKLKQNAAHNRRSLNSEILVCLEQAVAT PKLKNTEVLDRVRTLRQKTVNHLLTQTELSEAKNEGRL" gene complement(7604..7813) /locus_tag="DP116_23155" CDS complement(7604..7813) /locus_tag="DP116_23155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006757575.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_23155" /translation="MAQLIVRNLDEEVVRRLKLRAAQNGRSAEAEHRAILEAVLLERS SQSSLKQLLQSMPDVGEDTDFLLST" gene complement(7980..>11047) /locus_tag="DP116_23160" CDS complement(7980..>11047) /locus_tag="DP116_23160" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23160" /translation="NPALLTVNPSALFFNQIAAAARIENNSASAGKNPAGFDAFGLRV PDGKSLLLVGGNISMQGGRLNAYGGRVELGGLISPGNVGLNVDGNKLSLDFPASSTRA DISLTNGASVFVQAGGGGDIVINSRNLEVSGSQLIAGFGEGLGNAGAKAGDITLGATE QIKVVNSYIFNNVGAGAAGNGGNIRINADSLSVFNGAQLSAATLGKGDAGSVIIDARK NVSFDGVRDRQYPSAAFGDVGQGGEGNGGGIYITTGELSVTNGAQLQASTYGKGDAGN VIIDASKRVSFDGVVDGVYTFNSYAASYVVNGAQGKGGNIQITTGELSVTNGARLQTY TNGKGDAGNVIIDASKDVSFDGSSIFSIVGQKGEGNGGNIRINADSLSVFNGAQLQAI TNGKGDAGNVIIDARKDVSFDGSSIFSIVGQTGEGKGGNIRINADSLSVFNGAQLQAT TNGKGDGGSVIINARKNVSFDGVRDRQYPSAAFGDVGQGGEGNGGGIYITTGELSVTN GAQLQASTYGKGDAGNVIIDASKRVSFDGVVDGVYTFNSYAASYVVNGAQGKGGNIQI TTGELSVTNGAQIYADTAGKGDAGNIQIKANNSISVFGTSSISGSSSALFTSTTSTGK GGNIIVDTNAFRTSDGGVLNAQTFNDGEGGSITVTAKLVEVLNGGQLLATSSGNGRAG KITVNATDRIIVNGTDATYVDRVKKFGTKVPNAFATKGVDNVSVDEVNKPASGLFVRS QSSGSAGDIEVTSPQIRLDNSGRFIAESASGNGGNITLQIRDLLLLRGGSLISATAGT AQAGGNGGKIEINTPKGFIVAFPGQNNDITANAFNGSGGVVKIKAAGIYEIKPLSRDD LEGLPPTDRDPRKLPTNDITAVSQEGGPQLDGLITINNPNADLSYSFVSLPADVFDPS KQIAQGCSAFDQPNASDFKVTGRGGLPPSPDEPLSSDAVWEDTRLGATTAQRLDSKTT ATKPKSESDTVEIIPATGWVFNGKGEVTLVSNGSSTNNLGTTPATCLAR" BASE COUNT 3045 a 2530 c 2253 g 3219 t ORIGIN 1 accctctctc acgaactccg aacaccgctg aatgcaatcc taggttggac tcagattttg 61 cgtagaggaa aaattggtcc tgcggaattg aaaaaggggc tggagacgat tgagagaaac 121 tcgcgagcgc aagcacatat tatcgaagac cttttggaca tgagccgaat catctcgggt 181 aaagttcaac ttaatgttaa gaatgttaat cttcgctcgg ttattgaagc agcgattgaa 241 tctgtccttc catctgtgca agccaagaac atacgacttc aaacagtttt tgatgcgctt 301 cctattttca taactggtga tccgaacaga ttgcaacaag tcgtgtggaa tcttctttcc 361 aacgccatta aattcaccca ttcgggtgga agggtgcggg tgttgcttga aaagatggat 421 aaacatgttg agttaatagt cagtgataat ggtcaaggca tcaaatctga gtttttgcca 481 tacgttttcg accgttttcg gcaagcagat tcttccacaa cacgcaagtt tggcggtttg 541 gggttagggc tttcaattgt caagcaactc gttgagcttc atggaggaac cgtacaagca 601 ataagcccag gagagggaca aggagctaca tttactgtcc tgcttccact cgttcataga 661 gaaagcaata gtaatgcacc agcagttgca tacagttttg atgatgacag actcacgacg 721 actcaatgtg aggaaacaaa ccttcaaagt acgaaagttc tcgtagttga tgatgaaatc 781 gatgctcaag agttggtcaa gagagttctt gaagagtgcg gggctaacgt cctgactgct 841 tcctcagtgg atgaggcgct tgagcttgta caaacccaga aacctcacgt ggtagtgagt 901 gatattggta tgcccggaaa agacgggtat gagtttatcc gcagactgcg aggacttcca 961 tcatccatgg gaggagatat tccagctgca gcagtcaccg cctttgctcg ttttgaggat 1021 cgaattcgtg ctttgcgctc tggctaccaa acgcatgttg ctaagcccgt tgagccagca 1081 gaactcatcg cagttgtcgc gtctcttgct ggtcgtcatg actctcacat ttaggttatc 1141 acggatgagc tgattatttt tggtgaactt ctagactcag cgccttcgac acagctactc 1201 tacttcataa gggaaatttg ccgtgtttaa atcttaggtg ctgagtctat gttgtttact 1261 gctttaggta ttagtccgct actaggaaat ctcatagcaa cagcgctaac gtttgtttat 1321 gtgtttgggc tagtagcgct gttgaatatt tttgtaaaaa atttaggact accgcaagat 1381 atcagccgta aaattacgca tattggtgct gggtcggtta ttggtttttt gcctctttac 1441 agtgatttac attggtcgaa gtacttgaat gtggcgattt ttgtggtgtg gatagttctc 1501 cttgtacaga aagggctatt tgctcaacca gacgatgaag ctgttaatac tatgactcgc 1561 acaggcgata ggcgtgaact tctcaaagga ccgctttatt ttgtgattgt ggcaactatt 1621 tgtggaacac tgctgtataa aacttttcct ggaattgtcg caatgacaac tctcggttgg 1681 ggcgatggtg ttgcaccgat aattggctct caatatggca aactcaaata cacaattctt 1741 agtaacaaaa gctgggaagg aagtttttca atgtttgttt ttgctttcgc cgccagcgtc 1801 ttttttgttt ggctgattat accaggtcaa ctaaatctta tccggatttt attgctagca 1861 tttattgcca cagttgttga aggatgtagt ccaaaagaga ttgataacat ccttattcca 1921 gtagtggttt ttgtggcggc aagttttatc tgatgaaatt atcaacatac tctataccag 1981 tcctaaatga ttcgtgaaac tttttttgat tggcgttctt ggcgacgcca gtcgcctcaa 2041 gtcgggagac ccgcccacgg cgctggctcg tctatgtcct gcggacacgc tgcgctaagg 2101 cggtttgtaa aaaaaataat tttcacaaat gaaataggat tgctatagct gtaaagttat 2161 tactgcaagt tgtagttggg gaaaacaaac tgcattatta gattattagt caataagcag 2221 ccaacgaggt ttagattcat tgtcgtctgt aattttttct gcgctctcct taaccttcgc 2281 agataacatc tgtaattctt ctatactagc attgttctct agagctaaaa tgtaattcat 2341 gagagataac tgaagtccta tttgtgcttg taaataattt aaaaatacac tccagttgta 2401 aagccccata tctatttcaa aaggtgtata tttatcacag tttccagaaa tagtttgttt 2461 gacatcttct aaaaaaaatt taaaatcatc tatttgggaa atttcgtctt ctgtcatcaa 2521 gtagcgataa gcaattgctg gatataaagg attatctctg gcttgtttag ggttctcagg 2581 taaaatatca ttgatagatg gaggtagtat tcctaaacta acagcagctt ttctagtaga 2641 agttttagat aaaatgaagc tttcaatatc ttttctttcc caaatgttaa ctctgacatt 2701 tggattattc ttcttccaac tgtcaataaa atctctacaa ggattagtta aatgactgga 2761 agtagcaata acaaaaacag atggtttttc tgccattgcc caactcaaag ctggcataat 2821 atctgttgga gacacaccag attgataatg ctttgcttca atgaaataaa tttcttgaac 2881 aaattctcca aaagtgtctt gaatcaggaa tgtagcttga atatctctgc ctttacctcc 2941 tgtatcactt cgccatataa tttgaattgg atttttagct ttgactatct catatacaat 3001 ttcttcaaat tgtttgtcgt caattacagt ccagttaaat tttcttgcta gtgacataat 3061 tacttttttt agtcttaagg aagaataagg tattgcaatt tgttgttcac aataaaggga 3121 atggtacatt atgtaccacc cacacactat acctttgctc gggaatactg ttgcaacacc 3181 tgcttaacat actgcccagt ataggaccta gaattctgag ccacctcctc tggtgtacct 3241 acagcaatca cctctccccc tttatcgcca ccttctggtc ccaaatctat caaccaatca 3301 gcacaacgaa tcacatctaa attgtgttca atcactaaaa tcgaattgcc tttatccacc 3361 aaacgttgca atacatctaa caacttatgg acatcataaa aagataaacc tgttgtcggt 3421 tcatcaatca gataaagtgt cttacctgta gccctgcgag aaagttctgt tcctaacttc 3481 acccgctgcg cttcaccacc agataaagtg gtcgctggtt gtccaagatg aatatatccc 3541 aacccaacat caaccaaagt ttgcaattta gcctgagctt tgggaatgtt tttaaaaaac 3601 tccaaagcct cctcaacagt catgttgaga acatcagaaa tagacttgtc tttgtacttc 3661 acctgcaaag tttcgcggtt atatcttgca cctttgcaaa cttcgcactg tacataaaca 3721 tctggcaaaa agttcatttc aatcacattc acgccctgtc cgctgcaagc ttcacagcgt 3781 ccacctttaa cattaaagga aaattgccct cgtttataac ctcttgtttt cgcttcaatt 3841 gtttctgcaa acacttcgcg aataacatca aaaacacctg tataagtcgc tgggttagaa 3901 cgtggtgttc taccaatagg tgattgatca atgacaatgg ctttgtcaat agtatctaat 3961 cccttaattg catctatctc tttaggaaat ggaactttgc gagttagatg atgttgcagt 4021 gcggggtaaa gtaactcatt aattaaggta gatttaccgg aaccagaaac accagtcacc 4081 gcgacaagtt tacccaaagg tatttctaca tctatattcc tgagattgtt gcgacgagca 4141 tttttgatga ccaaactcag cccatttcct tctctgcgtt cagctggagt tgtaatcacc 4201 cgccgtccag acaaatatgc acccgtcagc gactcttccg actttaacaa tgcttgcaaa 4261 tcaccttcag cgacaatatt ccccccatga atacctgcgt taggaccaat atcaacaagg 4321 tgatcagcag cacgaatcgt ttcctcatca tgctctacta caatcaacgt attacctaaa 4381 tcctttaact ttgttaaagt tttcagcaac cgcccattat ctcgttgatg caaaccaata 4441 ctcggttcat ccaaaacgta caaaacgcca gtcaagccag aaccaatttg agttgctaga 4501 cgaattcgtt gtgcttcccc gcccgaaagc gtcatcgctg gacgatctag tgtcagataa 4561 tctaacccca catctaacaa aaattgcaat cgtgctctaa tttctctcaa taccaaatct 4621 gcaatttgca tctgacgttg actcaactct aacttatcaa tcctctcccg acactcgcga 4681 attgaactac cagtcagatc caaaatttgg tactgtccta atcgcaccgc caacgcctcc 4741 ggtttcaacc gcttcccctt acacaccgga cactgctggt ctatcaaaaa ctgttctaac 4801 ttttgcttaa ttaactccga acttccatca tactgccttt gcaacatcgg cagcacccct 4861 ttaaaccctg agttctgcga cttgtgtgtc tctgcagttc catgcaaaat gacgttctgt 4921 tgttcctcag tcaactgact ccacaatgtc tgcaactcaa acccaaattc ctgacctacc 4981 ttatacaaca attctatata ataggaattc tctttctccg accaaggaac aatcgcagaa 5041 tacacaggtg cttttggatc aggaactacc aactctggcg aaaatcttct taaactccca 5101 atcccgtgac agtgcgaaca agcaccataa ggagaattaa acgagaacaa gcgcggtgat 5161 aattcatcca taaccgcccc gtgttccgga caagcaaagt tctcggaaaa gaccaattct 5221 gagggtaagt cttgattatc tgtatctttt ataatttcaa ttgccgcaat tccacttgat 5281 tgtttgagac aagtcgtcag agaatcaacc aaacgctcct gaatagaggg ctttttcact 5341 agacggtcaa taaccacttc tatggtatgt gtaacatttt tatccagatc aattgcgtca 5401 gaaagttctc gtacctcgcc atccacccga acacggacaa aaccttggga agctaaacct 5461 gataacagct tggtatgagt tcctttttta ccccggacga caggcgcaag gatttgaaag 5521 cgggtgcgat ctggaagttc tagaatgcga tcgcacatct gatctatcgt ctgaggagca 5581 atacagcgat cgcatatcgg acaatgaggt tcacccgccc gtccaaacaa caaccgcata 5641 taatcataaa tctccgtcac cgtccccacc gtggaacgag ggttatggga agtagacttt 5701 tgatcaatgg aaattgctgg acttaaaccc tctatcgcct ccacatcagg tttttccacc 5761 tgtcccaaaa attgtcgtgc ataagcgctg agggattcta catagcggcg ttgtccttct 5821 gcgaaaatag tatcaaatgc tagggaagac ttgccagaac cagaaacgcc agtaaagaca 5881 attaggcgat cgcgtggcaa ttccaaatca atattcttga gattatgctg ccttgcaccg 5941 cgaatgcgga tggtgttttg gttgtccctg gagttgggaa gatgtccatt taaggacgtg 6001 gctatctgct ggctatttga catattggcg cgttgagaga aagcagtagg ggcgaaacag 6061 ttcttaatgt tatcattgtc agctttacta tggtagaaca atagtactgc tttaggtggc 6121 gatcgcccgt atgtgcccag aaggtttttt tgtgcgaaaa ccttctcccc aaatctccga 6181 tttggggcac tctgctccgc agagtggggc acagggcgcg ccaaaggcga tcgccatgac 6241 tactaccaca gctaaaaaac taacttttga ggaatttcta gaacagtacc cagatggctt 6301 aggcatttat gaacttgtga atgggcaaat tgtacaggtt gaaccgacta gagcgcataa 6361 aaatgtagcg cgatatcttg tcaagtcttt cgacagagag attgaacgtt tgggacttga 6421 ttacattgtt gacaaggata ttgtcgttaa aactgtaaca aattttctta aagagcaagg 6481 cagaaacccg gatgtgagtg tagtgagtgc atcaaagtgg aactcaaatg taatggcata 6541 tggtgctttg actgaaccaa tccagctagc tgtggaagtc gtctcaacga attgggaaga 6601 cgattacgtt gataagctag acgagtatca acgactcggt atccctgagt attggattgt 6661 ggattatttg gcgttaagcg tagctctccg aaggaatcgc ttctagaagt tatttgggta 6721 atcccaaagt ccctacgatt tttgtttacc aattggtaga agatcaatat caagtccaaa 6781 aatttacgag tgatgagcgc attgtatcgc ccacttttcc agaactagag ctaactgttg 6841 agatggttgt gagcgcaagc caaatgcgaa aactttgact cctgttacgc taagttttgt 6901 aaaaatgttg ccattggaat tgcaactaat gggaaactcg caatcagttt gcgatcagct 6961 gtaatgagag gcactcccaa ctgttgtgcc aagacaacat attcacagtc ataagcagaa 7021 caacctgatt caaaagtaag ctgtagaact gctatggcat caattgcata ctcaccctgc 7081 tgcatcaagg tttctgcttc ctggtaaatc tgaatggcat cctctaaaca gaggtatcct 7141 tgacgcagat acaacgccag aacgttacgg aattcggatc tccaaagata aggagccacc 7201 cattgtggat cttgttcata aactgcttct gcttgttgcg tttgttcacc ctgcaaaaag 7261 aagtaggcga tgacattcgt gtctacaaca atcacagcct tccctcgttt ttcgcctcag 7321 acaactctgt ttgtgttaac agatgattga ctgttttctg gcgaagagta cgaactctat 7381 ccaaaacttc agtattcttg agttttggag ttgctaccgc ttgttctaaa caaactagaa 7441 tttcgctatt tagactacga cgattatgag cggcgttttg cttcagtttg tcgtaaagtg 7501 tttctggaat atttttaagg gttaacgtga ccatacacat cagggataac aatggaacta 7561 ttatggaacc ataatggaac cataataact caaaaagtaa atgtcaagta ctaagtaaaa 7621 agtcagtatc ctcaccaaca tccggcatcg actgcaaaag ttgtttcagt gatgattgag 7681 aagaacgctc aagcaaaact gcttccagaa tcgcccgatg ttccgcttcg gcagaacgac 7741 cattttgagc agctctaagc ttcaatcggc gaactacctc ctcatctaag ttacgaacga 7801 tgagttgtgc catagctttt ttttcacgct tgtgctaaga aatcttactt aatgatagga 7861 ttgatatcat ttttttctac tttttgttgt tgaacgtttc cacatcttgc ggggcttgtc 7921 ttcgacatcg ctctttgggt actgcgctgt gcgcggctct ccataattgg agtactgttt 7981 tatcttgcta agcaggtagc aggagtagtc cccaaattat tggtactgga gccattggaa 8041 accagcgtca cttcgccctt accgttgaat acccaacccg tagcaggtat aatttcaact 8101 gtatcagact cagactttgg ttttgttgct gtcgttttag aatctagtcg ctgtgctgtc 8161 gtcgcaccta aacgagtatc ttcccagact gcatcactgc tgaggggttc atcaggactg 8221 ggaggtaaac caccgcgtcc agtcacttta aagtcacttg catttggttg atcaaaagca 8281 gaacaacctt gggcaatttg cttggagggg tcaaatacat ctgcgggtaa ggagacgaaa 8341 ctgtaactga ggtcagcatt ggggttgttg atggtaatta gaccatctaa ttgcggtcca 8401 ccttcttggg aaacagcggt aatgtcattt gttggtaatt tacgcggatc tcgatcagta 8461 ggaggcaacc cttcaaggtc atccctactg agtggcttaa tttcataaat accagcagct 8521 ttgattttga ctacgccacc agatccattg aaggcattgg cagtgatgtc gttgttttgt 8581 cctgggaatg cgacgataaa acccttaggg gtattgattt cgatcttgcc accattgccg 8641 ccagcttgtg cagtacccgc agtggcagag atgagactcc caccgcgtag cagcagtaaa 8701 tcccttattt gcagcgtaat attgccacca ttacctgatg cagattcagc aatgaacctt 8761 ccggagttgt ccaaccgaat ttggggtgag gtaacttcaa tgtctccggc gcttcctgaa 8821 ctttgagaac gaacaaacaa accactggct ggtttgttaa cttcatctac actaacatta 8881 tctacacctt ttgtggcaaa agcatttgga acttttgtgc caaatttttt tacccggtca 8941 acataagtag catctgtgcc attaacaatt atccggtcag tagcattgac cgtaatctta 9001 ccggctcttc cattaccaga ggaggtggcg agcaattgtc ctccattgag gacctcaacc 9061 agttttgctg tcactgtaat actgccacct tcaccatcat taaaggtttg tgcattcaac 9121 actccaccat cagatgtgcg aaaggcgttc gtatctacaa tgatattgcc acctttgcct 9181 gtagaagtgg tagatgtaaa taaagcacta gaggagccgc tgatggagct agttccaaaa 9241 acactgatag aattgtttgc cttgatttga atattacctg catccccttt tcctgcggta 9301 tcggcataaa tttgagcgcc attggtcaca gataattcac ctgtggtaat ttggatgttt 9361 cctccctttc cttgtgcacc atttaccaca tagctggcgg cataactgtt gaaagtgtag 9421 actccatcga caacaccatc aaaagagact cttttactgg catcgataat cacattaccc 9481 gcgtcccctt ttccataggt actggcttga agttgagcgc cattggtcac agacagttca 9541 cctgtggtaa tgtagatacc cccgccgttt ccttctcctc cctgtcccac atcaccgaag 9601 gcagcactgg ggtactgacg atcacgaacg ccatcgaaag agacattctt gcgggcattg 9661 ataatcacac tacctccatc cccttttcca ttggtagtgg cttgaagttg agcgccattg 9721 aagacagaga gagaatcagc gttaatgcga atgttcccgc ccttgccttc tcccgtttgt 9781 cctacaatgc tgaagatgga actgccatcg aaagagacat ctttgcgggc atcgataatt 9841 acattacctg catccccttt tccattggta atggcttgaa gttgagcgcc attgaagaca 9901 gagagagaat cagcgttaat gcggatgttc ccgccgttgc cttctccctt ttgtcctaca 9961 atgctgaaga tggaactgcc atcgaaagag acatctttac tggcatcgat aattacatta 10021 cctgcatccc cttttccatt ggtataggtt tgaagtcgag cgccattggt cacagataat 10081 tcacctgtgg taatttggat gtttcctccc tttccttgtg caccatttac cacatagctg 10141 gcggcataac tgttgaaagt gtagactcca tcgacaacac catcaaaaga gactctttta 10201 ctggcatcga taatcacatt acccgcgtcc ccttttccat aggtactggc ttgaagttga 10261 gcgccattgg tcacagacag ttcacctgtg gtaatgtaga tacccccgcc gtttccttct 10321 cctccctgtc ccacatcacc gaaggcagca ctggggtact gacgatcacg aacgccatcg 10381 aaagagacat tcttgcgggc atcgataatc acactacctg catccccttt tccaagggta 10441 gcggcactca gttgagcgcc attgaagaca gatagagaat cagcgttaat gcggatgttc 10501 ccgccgttgc cagctgctcc tgcgcccaca ttattgaaga tataactatt tacaactttt 10561 atttgttctg tagcacccag tgtaatatct cctgcctttg ctccagcatt acctaagcct 10621 tcgccaaaac cagcaatcag ttggcttcca gaaacctcta aattgcgcga gttgattaca 10681 atatctcctc caccacctgc ttgtacaaat acactagcgc cattggttaa agatatatct 10741 gctcgtgtac tagaagcagg aaagtccaag cttaatttat tgccatccac atttagcccc 10801 acatttccag gagaaattaa tcctcctaac tcaactcttc caccataagc attcagtcgt 10861 ccaccttgca tgctaatgtt gccacccact agtagcaaac ttttaccatc cggaacacgt 10921 aaaccaaatg catcaaatcc tgctgggttt tttccggctg atgctgagtt attttcaatt 10981 cgtgcagcag ctgctatttg attgaaaaat aaagctgaag gatttaccgt tagcagtgca 11041 ggattat // LOCUS NODE_3087_length_10951_cov_4.83700410951 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10951) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10951) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10951 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 144..1700 /locus_tag="DP116_23165" CDS 144..1700 /locus_tag="DP116_23165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012413103.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="deoxyribodipyrimidine photo-lyase" /protein_id="PRJNA477356:DP116_23165" /translation="MKILWFRRDLRLTDNECVAEASANDAPVLPCFIIDPWFYQKWTD VGKARVRFLFESLENLDQNLRSLGSRLYLFEGNSVNILQEMTQQLMQQGHKPKLYFNR DVQVEYGIERDSTIVNFYQQLNLDYHIGLNNFLQIDDDHRDQWFNEYYNYVRQISHPT PIHINTPQISFNLPQLTFTELKHKYHAFYETKKVYFKGGETQAQKTLDSFLTKRFYGY HWKLSRPWAAQQKATSHLSPHLTFGTISVRNVYQRTKVRAAELADTPKAEFSLKAFRD RLRWHDSFTQRLYFHPEIAYTNFYPEFDEYYRPDELTPQQQELFHAWQEGITGFPMVD ASMRQLKTMGWMNFRMRAMCATFLTINCGISWHHGAKHYMNCLVDGDLAINNWQWQMQ AGITNPLSDTFRIYNPNKNIEEKDSDLRFIYHWIPKLLGYSLPEILEGKYIEHGLYPS PILDWAQTRLVNGKIVSDIRKRVKQRLLIEGGEECENALATKTAVNKYVESKDKQYQQ FKKLESQLGQ" gene complement(1989..2255) /locus_tag="DP116_23170" CDS complement(1989..2255) /locus_tag="DP116_23170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114670.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23170" /translation="MSKEAVKVILWEENQKDFLKKFYDLQFIPRIGEEIYLKKENWIV TRIEHDLEVSEINVYMELKKKDKQHKGTKGSSQWLEQMHQNDGL" gene 2503..2748 /locus_tag="DP116_23175" CDS 2503..2748 /locus_tag="DP116_23175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316506.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB family transcriptional regulator" /protein_id="PRJNA477356:DP116_23175" /translation="MSIATITSKGQTTIPKEIREKLNLRPGDRIHFIIEPDGKVYIQP LNIQVEELSGILHKPEREPVSIEEMNEAIEQCAGNLS" gene 2745..3158 /locus_tag="DP116_23180" CDS 2745..3158 /locus_tag="DP116_23180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006623697.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="twitching motility protein PilT" /protein_id="PRJNA477356:DP116_23180" /translation="MNGLDTNVLVRYLVQDDIEQGRLAAEYIKQVKASGQTCFINNIV LCELVWVLKSSYKLGRSEIIDVLEKILRTDVFDFENRETAWLSVQDMKKGKADFSDYL IVKLNKQASCSETATFDTKLQKVEEIRLLSTYKNA" gene 3321..7771 /locus_tag="DP116_23185" /pseudo CDS 3321..7771 /locus_tag="DP116_23185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997616.1" /note="frameshifted; too many ambiguous residues; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 4806..4815 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 5920..5929 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 7807..8070 /locus_tag="DP116_23190" CDS 7807..8070 /locus_tag="DP116_23190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997617.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB family transcriptional regulator" /protein_id="PRJNA477356:DP116_23190" /translation="MEITKVSNSGQVIIPEQLRKSHGWEAGQELIVIDTGDGILLKPK KPFAETTLNEVAGCLKYQGTPKSLDDMDDAIRQGVEESWHGGS" gene 8057..8446 /locus_tag="DP116_23195" CDS 8057..8446 /locus_tag="DP116_23195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006669891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain nuclease" /protein_id="PRJNA477356:DP116_23195" /translation="MVAVDTNIIVRLLTQDDEAQYQKSLEIFQTQTIFIPDTVILETE WVLRFAYKFKPVEICAALRKLFGLPNVNVSNASLVSQALQWHETGLDFADAFHLAQSQ NYAEFYTFDEKFVKKAKGITSCEVKLP" gene 8449..8634 /locus_tag="DP116_23200" CDS 8449..8634 /locus_tag="DP116_23200" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23200" /translation="MVSRCEFIANRCDGTNQHIKTGECLKTLRSDRPYENMNITGVKG LADAEITTLKATSTPGA" gene 8843..9073 /locus_tag="DP116_23205" CDS 8843..9073 /locus_tag="DP116_23205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316508.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB/MazE/SpoVT family DNA-binding domain-containing protein" /protein_id="PRJNA477356:DP116_23205" /translation="MNTAKLLMNGENQTVVLPKEFQFQGNEVYIKKIGNAVVLISKEN PWQTLFDATELFSEDFMENREQPSLEVKEALK" gene 9195..9419 /locus_tag="DP116_23210" CDS 9195..9419 /locus_tag="DP116_23210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316506.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AbrB family transcriptional regulator" /protein_id="PRJNA477356:DP116_23210" /translation="MKKIQNVSKQTQEKLNFQSDTRINFIIQPDSKVYIQPLNIEVEE LSGILHKPKRKPVSIDAMNQAVEQYVNNLS" gene complement(9431..9805) /locus_tag="DP116_23215" CDS complement(9431..9805) /locus_tag="DP116_23215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c" /protein_id="PRJNA477356:DP116_23215" /translation="MENQITKPETLIWRIILTALAILLVVLVSIFAVRIMRTADPYVK SVLSLKGDPTLGHAIFQINCAGCHGWEADGRVGPSLQGVSKHKSPYGLIHQVISGETP PMPKFQPNPQEMADLLSYLETL" gene 10059..10172 /locus_tag="DP116_23220" CDS 10059..10172 /locus_tag="DP116_23220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316458.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome b6-f complex subunit PetG" /protein_id="PRJNA477356:DP116_23220" /translation="MVEPLLDGIVLGLIFVTLSGLFYKAYEQYKRGNQLGL" gene complement(10302..10847) /gene="rsmD" /locus_tag="DP116_23225" CDS complement(10302..10847) /gene="rsmD" /locus_tag="DP116_23225" /EC_number="2.1.1.171" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875762.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="16S rRNA (guanine(966)-N(2))-methyltransferase RsmD" /protein_id="PRJNA477356:DP116_23225" /translation="MSLRIYGNRPLKTLPGKETRPTSARVREAVFNIWQGTIEGCRWL DVCAGTGSMGAEALCRGASRVVGIDKSHRACAIIQQNWQNLASTEQKFQVMRGDVVQL LPKLSGQQFDRIYFDPPYASELYERVIEAIAHSGLLDPNGEIAVEHNPQDWNPPVIPS WEICREKVYGNTALTFYSVVD" BASE COUNT 3390 a 2130 c 2435 g 2976 t 20 others ORIGIN 1 gaattacact gtgttgcgtt cagaaatatc acctgttaaa gggacaaggg agacaagggg 61 gacaagggag agagagcact aagatgtatg aacgcaactt agtataagct tagctgacaa 121 gaaaattata gtacaattga accatgaaaa ttctatggtt tcgacgagat ttacgattaa 181 ctgataatga atgcgtagca gaagcatcag ccaatgacgc ccccgtattg ccctgtttta 241 tcatcgaccc gtggttttat caaaaatgga cggacgtggg taaagcacgg gtaagatttt 301 tgtttgaatc tttagagaac cttgaccaga atttacgctc tttgggtagt aggctgtact 361 tatttgaagg gaattccgta aatattctgc aagaaatgac tcaacaattg atgcagcaag 421 gacacaaacc gaaactttac ttcaaccgcg atgtacaggt tgaatacgga attgaacgcg 481 atagcactat cgttaatttc taccaacaac ttaaccttga ctaccacatt ggtctaaaca 541 actttctgca aatcgacgat gatcaccgcg accagtggtt taacgaatac tataactacg 601 ttagacaaat atctcaccca actcctattc acattaacac cccacagata tctttcaatc 661 taccccaact tacctttact gaactcaaac acaagtacca cgctttttac gaaacgaaga 721 aagtgtactt caaaggtgga gaaacgcaag cgcaaaaaac tttagactca tttcttacca 781 aaagatttta tggctatcat tggaaactct cccgcccgtg ggcggcacaa cagaaggcaa 841 cctcccatct ttcccctcat ttaacatttg gtacaatctc tgtgagaaat gtgtaccagc 901 gtacaaaggt acgggcagca gaacttgcag atacacccaa agcagaattt tccttaaaag 961 catttcgcga tcgcctgcgt tggcacgata gttttacgca acggttatat tttcatccag 1021 aaatagcata tacaaacttt tatccggagt ttgatgagta ttaccgacca gatgaactga 1081 ctccacagca gcaagaatta tttcatgctt ggcaagaggg tataactggt ttcccaatgg 1141 ttgatgcgag tatgcgtcaa ctcaaaacaa tgggttggat gaatttccgc atgcgagcga 1201 tgtgtgctac tttcttgact attaattgcg gtatctcttg gcatcatgga gcaaaacatt 1261 acatgaactg cttggtagat ggcgatttag ctattaacaa ttggcaatgg cagatgcagg 1321 ctggtattac taatcccctt agtgatactt tccgcatata caatcccaat aagaatattg 1381 aagaaaaaga ctcagatttg cgcttcattt accactggat accgaagtta ctcggttata 1441 gcttacctga aattcttgaa ggcaagtata tagagcatgg tttataccca tcaccaattt 1501 tagattgggc gcagaccagg ctggttaatg gcaagattgt ttcagatatt cgtaagcgtg 1561 tgaaacaacg cctactgatt gaaggtggcg aggagtgcga aaacgcttta gcaactaaaa 1621 cagcggtaaa taaatacgtt gaatccaaag ataaacagta ccaacaattt aagaaattgg 1681 agtcacaatt agggcagtaa cactagaggt gacaagcatg tgcataagtt aatttttctg 1741 cacctaataa aaggcaggtg agtaatcact acccgcctta accatcgggt ttgttaattc 1801 ctctcgtctt ctatgcccct cgcctttggt gctgagggat tttatttgag tttataaagc 1861 aagcatcccg aacacggaga gaatgcttgc tatgtggcgg ttacggatgt tttgatcgga 1921 ttaactgact accgcaaaag ctagattttt tgtacatcgg gatattgcac aaaaaactct 1981 accatccttt agaggccatc gttttggtgc atttgttcca gccattggga agaacccttg 2041 gttcctttat gctgcttgtc ttttttcttt aattccatgt aaacattaat ctcgctaact 2101 tctaaatcat gctcaattct ggttactatc cagttctcct tttttaggta aatttcctct 2161 cctattctgg gaataaactg taaatcgtag aactttttta gaaaatcttt ttggttttct 2221 tcccataaga ttactttgac tgcttctttg ctcattgatt cagtagttta aggaaataag 2281 ggagcaagtt tatcttaagt aattaatttc tgaattactg cttatctcaa accaactttt 2341 gtcttattca tgtaatacgc taattcactt tagatagcaa tcaattttag agtgacgtgt 2401 tacgccttca ggcgtctgcg cagtgcagac gcctctggtt gactgtcagc tcgttaattc 2461 cttacaataa agtaagaatt attttaaaag taaggaaatc tcatgtccat tgctacaatt 2521 accagtaaag gtcagacaac tatcccaaag gaaattcggg agaaactaaa cttgcgtcca 2581 ggcgatcgca ttcatttcat tattgaacca gatggcaaag tctacattca gccgctgaat 2641 attcaagttg aggaattgtc tggtatcctc cataaaccag aaagagaacc agtctctatc 2701 gaagaaatga atgaagcaat tgagcaatgt gctggtaact tgtcatgaat ggactagata 2761 ctaacgttct agttcgttat ctagttcaag atgacatcga acaaggtaga cttgctgcgg 2821 agtacatcaa gcaagttaag gcaagtggtc aaacctgttt tatcaacaac atcgttcttt 2881 gtgaactcgt ttgggtactc aaaagttcct ataaacttgg caggagtgaa attatcgacg 2941 ttcttgaaaa aatcctgaga acggatgtgt ttgattttga aaacagagaa acggcttggt 3001 tatcggtaca ggacatgaaa aagggaaaag ctgatttctc ggactatctt attgtgaagt 3061 taaacaagca agcaagctgt agtgaaaccg ctacatttga cacaaaattg caaaaagttg 3121 aggaaatccg gttactttct acttacaaaa atgcttaaaa aatgtctgat aagtcacata 3181 gaatgcaccc ttagctgaaa taccaggcgt taacataagt ataagcaaca gaaaaatttc 3241 cccttttata caaaaaagct cctgtgatac cgattaatct tgactcactc atcagcgcca 3301 taactggtat tgccaaccca atcattaaag aaaagattca gcgtaacgaa atagtcatca 3361 aattattaaa gaaatttaac cttgatcctg aacatcctcc cgctgatttc agtggtgttt 3421 acgcttatac tttggtagaa tacggagtag gcaaacccaa gccatttctc gaactttttc 3481 gacaagaaga aataaaacaa gctttccgca aggcattaga ccataacaac ccctcaattc 3541 ttctctctga agttgatgct tttctcgata cttatgcgtt gggtgatgac atcaaaactt 3601 tgggaattga catcaggcga gaaatcgccg ccttcgcaac ggtttttatc gaagttgcta 3661 agcgcagccg gacacctgct gatgttttga tgaaccagca gataggttcc ctacataaaa 3721 gaatagcgag tatccaagaa caactccaaa ggctgccaac actcgaagga atgagaacag 3781 agatagcgcg gttagcagca cagagtgtag agacgtcgaa tgcaacgtct ccacatgaaa 3841 agaaatgtaa agccatagct ttagcccagc agatgcaagg ctggtttgaa acgttgggct 3901 accgctttga gaagtatgag gtttgggaag acaactattt tgagtggatt ctcaacatcc 3961 cggtacgacg taatcgattt gaccgcattc ttatacgtgg aattgaaggt gaggcgggac 4021 taagtgatgt catggctttg cgtcagtcag tagaggcgca aagaactgat gaagggtggt 4081 tagtgacggc gagacgtatt tcacgggcgg cgcgtgatga agtggagaaa ccagaaaatc 4141 gccagcttga ttgttatact tttgacgaac tcatagacca ggatgctgat tttagtgatt 4201 atctcgactg gctagaagct gaggtgaaac gccgagaaat tgacaaaaaa tatgtgccgt 4261 tggcttgtac taaagaagaa tttgactcac ttagcaagcg ccggatagga gttagtcgct 4321 acgatgaacg cgacggctgg attgatggtt atctcgattt gtggttagat gaccctgcaa 4381 aagagcatat ctccatttta ggggaatttg gcactggaaa aacttggttt gcgtttcatt 4441 atgcttggat tgcactacaa cgttatcggg atgcgcaaaa acgcggtgtt gaacgtcccc 4501 gtcttcctct gataattacg ctgcgcgact atgccaaagc actcaatgtt gagaatgttt 4561 tggcaggttt cttttttacg caacataata ttcgcttaaa tagcgaggtg tttgaccagc 4621 ttaacagcat gggcaaattg ctgctgattt tcgatggctt tgatgaaatg gcggcgaagt 4681 gcgatcgcca acaaatgatc aataactttt gggaattggc gaaagtcgtc gttcccggtg 4741 ctaaagttat cctgacttgt cgcaccgaac attttccaga agcaaaagaa ggacgtgctt 4801 tactcnnnnn nnnnngcgtc taccgccaac ctaacaggag aaacgcccca gtttgaagtc 4861 ctagaactag aaaaattcaa cgacgagcaa attcgtcagg tgttgtcctt ccaagctgaa 4921 tctaccacag ttgaccatgt gatggacaat ccacaactgt tggacttggc acgtcgtccg 4981 gtgatgacag aattaatttt ggaagcattg ccagatattg aagcaggtaa gcctgtggat 5041 atgtcgcggg tttacctgta tgcggtgcgg cggaagatgg aacgagacat taaagcagag 5101 cgaactttta cttctttggc agataagctc tactttttat ctgaactttc ctgggaaatg 5161 ctatctactg accagatgag tttaaattat cgcctttttc ccgatagaat tcgtcgctta 5221 tttggtgatg ttgttcagga ggagaaagat ttagaccact ggcattatga catgatgggg 5281 caaacgatgc tgattcgtaa tgccgatggt gattatactc cggcgcatcg ctcgctgtta 5341 gagttttttg tcgcttataa gtttgcggct gagttggggg cgttggctga tgattttatt 5401 gaattagcgc aggtgcagtt gcatataaat aatcatgcat caattgatta cacttggtca 5461 tcttattttc gtcgccaggt gaatgaggtg gggggaattt tgtcaagtgc accactgagg 5521 agatttgcaa gtgagtcgtt ggagaagttg agggaaagtt ttgggaaagc gcctttaaca 5581 aaagcggtga tagatctgct tttgccgatg cttgacctaa cccccctaac cccccttccc 5641 ttgtagggaa ggggctgggg gagaggtcac acaaatcgcc gctgctagaa atcctacaag 5701 cgacacgggg caaaaccgaa gccaaggttg gttatgttgg agggaatgcg gcgacgctgt 5761 tggtaaaggt tgacaaggcg gcgttggagg gtaaagacct ctctagtgct gtgattaaag 5821 gcgcagatct tactaaggct agcctgcgcc gtgtcaattt tgcggaggcg aatctggcta 5881 aatctgtttt tcccaaagtc ttcggtgcgg ttttttgtgn nnnnnnnnng cgtttagtcc 5941 agatggcaag ttgtttgcta caggtgatgc taattgcgag attagcttat ggcaagttgc 6001 agattgtaaa aaagttttgc tctgtaaagg gcataccgat tggatacgat cagtcgcctt 6061 tagttccgac ggtataactt tagccagtgc tagttttgac gatacattga agctttggga 6121 tatccgtacg ggaaaatgtt tgaaaacttt gcacggacat accaatcggg taaattcagt 6181 agcaatcaat cccaatggta caatcctagc cagtggcagt gatgaccaaa cagtaaagct 6241 ctgggatatt cacactggca aatgcttaaa aattttgcaa gggcatatcg gttcggtaca 6301 gtcagtggct gttagtgccg atggtacaac cttagccagt agcagtgatg accaaacagt 6361 gaagctctgg gatatccgca caggagaatg tctgaaaact ttgcagggtc atactaatcg 6421 cgtccggtca gtagcaataa gtgcagatgg tacaacctta gccagtagca gtgatgatca 6481 aacagtgaag ctctgggata ccaatacagg agaatgcttg aaaacgttac aggaacatac 6541 caactgggta cggtcagtaa cattcagtcc tgacggtgca accctagcca gtagcagtta 6601 tgacaaaaca gtgaagctgt ggaatactca tactggagag tgtttaaaaa ccttacaggg 6661 acatactagc tcagtacggt cagtaacatt cagtcctgac ggtgcaactc taattagtgg 6721 tagcgatgac caaacagtga agctatggag tatcaatacc ggtaaatgtc tgaaaacctt 6781 acagggtatt atttgtccga taaggtcaat agcaataagt tcagatggta caaccttagc 6841 cagtagcagt gaagacccaa cagtgaagct atgggatacc actaccggag aatgcctgaa 6901 aaccttgcag gggcatacca gtcgagtaaa ttcagtagta attagtcctg atagtacaat 6961 cctagccagt agcggttatg accaaacaat aaagctatgg gatatccata ccggagaatg 7021 cctgaaaact ttgcagggat ctaaaagagc tgtaaattca gtggctttca gtcctaacgg 7081 tacaactcta gctagtggga gtgcagacaa aacaataaag ctatgggata tccataccgg 7141 agaatgtctg aaaactttgc aaggacatac cgactggtta tggtcagtga cattcagtcc 7201 taacggtaaa gttttagcta gtggaagtta tgatcaaaca ataaagctat gggacatccg 7261 cacaggagaa tgtctgaaaa ctttacaagg acataccact aggaataatt ctgtagcaat 7321 cagccctgat ggtaaaattt tagctagtgg tggtcaagac caaatggtaa agctatggga 7381 tatccatagt ggtgaatgcc taaaaaccct gcaaggacat accagttgga taccctccgt 7441 tgtctttagt cctgatggta agacttttgc cagtgcaagt aacgataaaa cagtcaggct 7501 atgggatatt cacagtggaa aatgcttgaa aattttgcag ggacacaccc attgggtaaa 7561 ttcagttatt ttcagcacgg atggtcaaac tcttgttagt gggagttggg atgaaacaat 7621 aaagctttgg gatgttaaca cgggtgagtg tctaaaaact ctgatagata gaccctacga 7681 aaacatgaat atcacaggtg ttaagggttt aaccgaagcc gaaatcacca cccttaaagc 7741 attaggcgca gtggaagaag gagaaatgta aaattaaaaa taccagataa gcaaaccttc 7801 ccacacatgg aaataacaaa agtatcaaac tctggacaag taattattcc cgaacagtta 7861 cgaaaatctc atggctggga agcaggtcag gagttgattg taattgatac aggtgatgga 7921 attcttctca agcctaaaaa accttttgca gaaactacat taaatgaagt tgcaggttgc 7981 ttaaaatatc aaggcacacc aaaatcctta gatgatatgg atgatgctat ccgtcaaggt 8041 gtagaggaat catggcatgg tggcagttga tactaacata attgtgcgtc tattaactca 8101 agatgatgaa gctcagtatc aaaaaagtct agaaattttc cagacacaaa cgatttttat 8161 cccagatact gtgattttag aaactgaatg ggtgctgagg tttgcttata agttcaaacc 8221 agttgaaata tgtgcagcct taagaaaact ttttggttta ccaaatgtta atgtgagtaa 8281 tgctagcctg gtttctcaag cactacaatg gcatgaaact ggtttagatt ttgccgatgc 8341 atttcatcta gctcaaagtc aaaactatgc tgagttttac acctttgatg agaagtttgt 8401 gaaaaaagct aaggggataa ctagctgtga agtaaaacta ccttgagcat ggtaagtagg 8461 tgtgagttta tagctaatcg gtgcgatggt acaaatcaac acattaaaac aggtgagtgt 8521 ttaaaaactt tgagaagcga tcgcccttat gaaaacatga atatcacagg tgttaaaggc 8581 ttagcagatg ccgaaatcac cacccttaaa gccactagta caccgggcgc gtaaataaac 8641 tacccattcc aaatcaatga aacgcttgcg ctgtatacct gacagatgag tagaaaaaag 8701 gaacagagcg agaaagggcg atcgcctgac ttctaaaaac gcttacgcct ctatatagct 8761 ataatattgc acaaagaaat taagtgaact ttcccaacac ccatgacagt tcaagaaatc 8821 aatagaacag agagattcaa atatgaatac tgctaaatta ttgatgaatg gtgaaaatca 8881 aaccgtagta ttgcccaagg aatttcagtt tcaaggtaat gaagtttata tcaaaaagat 8941 aggcaatgcc gtagtcttaa tttcaaaaga aaatccttgg caaactttgt ttgacgcgac 9001 tgaactcttt tctgaggact ttatggaaaa cagagaacag cctagcttag aggttaaaga 9061 ggctttgaaa tgagattttt gctgaatacc aaaaagtgtt taaattgact caaaaatcag 9121 ttccagtttt agtccatttt aatggacttt gcctattagc ctgggactta tagtcctagg 9181 cggacgaaaa cggagtgaaa aaaatccaaa atgtaagcaa acaaacccag gagaaactga 9241 acttccaatc ggacactcgc ataaatttca ttattcaacc agacagcaaa gtctacatac 9301 agccgctgaa tatagaagtt gaagaattat ctggtatcct ccataaacca aaaaggaaac 9361 cagtttctat tgacgcaatg aatcaagccg ttgaacaata tgttaataac ctgtcataat 9421 tggctgtaag ttaaagcgtt tctaaataac tcaacaaatc tgccatttct tgagggttgg 9481 gttgaaattt cggcatgggt ggagtttcac cactaatgac ttggtgaatt agaccatatg 9541 gggatttgtg ctttgagact ccttgtaaac taggaccgac acgtccatct gcttcccagc 9601 catgacagcc cgcacagttg atttgaaaga tggcgtgtcc gagtgttggg tcccctttga 9661 gggatagaac gcttttgacg tatggatcgg cagttctcat aatccgaaca gcaaaaatgc 9721 tcacaaggac tactagcaga atcgccagcg ccgttaatat gattcgccaa atcagagttt 9781 caggtttggt aatctggttt tccaaatgtt tacttttaaa gtcaaagtgt acaaaaagtt 9841 ttcatttcat acacatagct taaaagttct tgatggtgtc tgcaagcaca gacaaatgtt 9901 tttttgctaa ctcagcaggg gtagaggtga ggcaaatcag gcagatgagg ggatgagggg 9961 gatgaagagg atgaagaaga tgaggggatt tctgccaaat ttggtagaat caatgacggg 10021 aagaaaatat tgaacagtag caccaaagga gatttaacgt ggttgaaccc ttgcttgacg 10081 gtattgtact aggtctaatc tttgtaacct tgtctggatt gttttacaaa gcttacgagc 10141 aatacaaacg tggcaatcaa ctcggtctat aaagttatcc aaaccgagga cacagggata 10201 tagcaccttc tgttggaggt aggggagatc ataagaaatt ctgaattctg aattttgaat 10261 tttgaatttt gaattttgaa ttgatttatc tccttgtccc ctcaatccac aacgctgtaa 10321 aaggtcaaag ccgtgtttcc gtaaactttt tcacggcaaa tttcccaaga aggaataact 10381 ggaggattcc aatcttgagg attatgttcg actgctattt caccattagg atctagaagc 10441 ccagagtgag cgatcgcctc tatcacccgc tcatataact cacttgcata cggcgggtcg 10501 aagtaaattc tgtcaaattg ctgtcctgat aattttggca aaagctgtac aacatctccc 10561 cgcattacct gaaatttctg ttcagtactc gctaagtttt gccaattctg ttggattata 10621 gcacaagctc gatgggattt atcaattccc acaactcggc tggctcctct acacaaagcc 10681 tccgcaccca ttgaaccagt tccagcacac acatccagcc aacgacaacc ttctattgtt 10741 ccctgccaaa tattaaaaac ggcttcgcgt acccgcgcac tggtgggtct ggtttctttc 10801 cctggtaaag tttttaatgg tcgattaccg taaattctta agctcattag tcatttatag 10861 cagggaacag ggaacaggga atagggaata gggaacaggg aacagggaat agggaacagg 10921 gaacagggaa cagggaacag ggaacaggga a // LOCUS NODE_3096_length_10922_cov_4.98214810922 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10922) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10922) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10922 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(25..480) /locus_tag="DP116_23230" CDS complement(25..480) /locus_tag="DP116_23230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491506.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_23230" /translation="MIRLLVVDDQDIFRQDLATLLSASADLDVVGQASHGREALALTQ HLQPDVILMDVRMPVYDGVTATREIIQRYPWIRILVLTTFDDDEYVWQSLQAGALGYL LKHTPIEQIVTAIRSVYLGYCQLGPTIVPKVVAQLKPNPSRPKDDYYYL" gene complement(477..1649) /locus_tag="DP116_23235" CDS complement(477..1649) /locus_tag="DP116_23235" /inference="COORDINATES: protein motif:HMM:PF07730.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase" /protein_id="PRJNA477356:DP116_23235" /translation="MKLNMFSSPSFCTILRYVEWVMMIGVTLGFILDGMFQVSITTTL QVFFCLCGYTWLSLVFPIYRPLWQRQAYVLSGIILALFMRVTGIGIELFLYLYIAKSC FLLNRKNLILTVIFTVLLGVLTFVLALPEYVQLNSALCVNLEEHKQIQQMILSYLSDN LVVTVFIIAFSFMVIVEQKSRKRAEALAEQVETLATTLERTRIARDIHDSLGHSLTNL NSRLTVAQQKLRQHDIDGVCEVVDTAQFLASQCIEDVNRSLKMMRQSDFNLNQALTTL LEQLRHNQALSVQWEINLPQLSLQTSHHIYCIVKEGLINIQKYAHASAVRFRGLSTPE GILIELADNGQGFDPKIPHLGLGLQGIEERVQLLRGQLTINSTPGSGTQIQIILPL" gene 1739..2440 /locus_tag="DP116_23240" CDS 1739..2440 /locus_tag="DP116_23240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YdcF family protein" /protein_id="PRJNA477356:DP116_23240" /translation="MIPRLRWKRFWSGLGTVLLVVYFSATFPLTIAVAKKGLVTFIPP DPGTTADAIVVLGRGVPFRDSRVEVVAELWRNHRAPFIFASGLGDGSEIVQQLKAKGI PDTALGEEHCSQTTKENALFTASILQPRGIKQILLVTDSPHMLRSRLTFKSVGFEVMS HTSPIPSEFTPTKKAMLMFYEYMGLVSYGLRGDFLKHNLPQQKNPPVAKVKNSANFNL QKQHIEDNRDISYPL" gene complement(2526..3065) /locus_tag="DP116_23245" CDS complement(2526..3065) /locus_tag="DP116_23245" /inference="COORDINATES: protein motif:HMM:PF08872.8" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23245" /translation="MLSNQFQRLEQEDDVLFLGDKIFNAVQIVKKVLEYFEPKGNDLI FSCQDKFVKKYFKKRNITGIFNRVEWEFSLKPEIQCELLVPNNNGRQKGKLEIRVTLE FSPFKKQFSSVKDGGSLSHSLSITDSKTSEDFEIKVSLDFYPEERLLEGIYHQERTTE SELRIFPTDTLRVSPFGTR" gene 4414..5769 /gene="dnaB" /locus_tag="DP116_23250" CDS 4414..5769 /gene="dnaB" /locus_tag="DP116_23250" /EC_number="3.6.4.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873813.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="replicative DNA helicase" /protein_id="PRJNA477356:DP116_23250" /translation="MSEELNFQGHGMDRLPPQNIEAEEAILGGILLDPEAMSRVSDRL IPEAFYLNPHKDIYQAAVRLHTQGKPTDLLAVINWLNDHDMLSRVGGRNKLISLVDRT VSAVNIDALADLVMEKYLRRRLIKGGNEIVHLGYQTETELPTVLDEAEQKVFSITQER SQVGLVHISNTLINTFQDIETRHQGIALPGIPCGFYDLDGMTSGFQRSDLIIVAGRPS MGKTAFCLNLAHNIAASYKLPVAIFSLEMSKEQLAQRLLASEAGIETGYLRSGRISQT QWEPLSRAIGMLSDMPIHIDDTPNITVTEMRSQARRLQTELNTELGLIIIDYLQLMEG AGDNRVQELSKITRNLKGLARELNVPVIALSQLSRGVESRTNKRPMLSDLRESGSIEQ DADLVIMLYRDDYYNSDTADRGIAEVIIAKHRNGPTGTVKLLFDPQLTKFKNFARPNN Y" gene complement(6024..6551) /locus_tag="DP116_23255" CDS complement(6024..6551) /locus_tag="DP116_23255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746397.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FKBP-type peptidyl-prolyl cis-trans isomerase" /protein_id="PRJNA477356:DP116_23255" /translation="MKAILLSMGFMLACVLVLVFAQISGNKQDIAVATQLTETTPAPI QVQENNTLIASKKNMSEANVVTTPSGLKYTELKEGTGATPKTGQTVVVHYTGTLEDGT KFDSSRDRGQPFSFKLGVGQVIKGWDEGLSTMKVGGRRQLIIPSDLGYGARGAGGVIP PNATLIFDVELLKVQ" gene complement(6632..6955) /locus_tag="DP116_23260" CDS complement(6632..6955) /locus_tag="DP116_23260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308854.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23260" /translation="MDNNNWMKQLLTIGLGTTSLVADKLREVSEQLVKDGKLDPEQAK AVMDDMVQQLKSEQGNWDSNMQRQLRNMLQDLGVPRQSEVDELRGRIDRLERQVRDLE NKLWR" gene 7254..7931 /locus_tag="DP116_23265" CDS 7254..7931 /locus_tag="DP116_23265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205921.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23265" /translation="MLEFNLPKYLPSAEELPDSDDTPVDNELQELIPGLLKAILLILW AERMDWFFGIDMGIYYHPDKPAIVPDGFLSLGVERVYDEELRPSYVLWDENVLPTLVL EVVSQNYRKEYSTKKEEYAALGVLYYVIYSSRRRRKPRLEVHRLVDDKYELQEGNPVW LPEVGLGIGCERGTYGGVTREWLYWYNEQGKRYPTPEERIQAAEQRTRRLAEKLRELG VDPESIG" gene 8082..>10922 /locus_tag="DP116_23270" CDS 8082..>10922 /locus_tag="DP116_23270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120403.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NTPase" /protein_id="PRJNA477356:DP116_23270" /translation="MDFLTAWGVSTAVGFLFQPVMKQFAQDLGKDLLKDILKDVFKAI PSQILQKLKKEEIDIAAGKALKEFLSLVQQELEDADIPDNEIREYYNEPLQKFLKDEK VKEILGTPFQDECNELDTRSLQTLWNEKKLLHLPRKFDWTKLGKRYVKKVKAIIQDSS ELRAILDSKNLEKIEKNTTETAGIIPDFNFKQYQEAIRERYANIKLDSIDTSGYAYNE LRLWRIFIPQNVREVHQVLPQVHELPKEHLRRLREKNELEAEEIAIEELERHKRDYFE QKVYSVKDIVEDKQNYKYIVILGDPGSGKSTLLQYLALDWVEQTLDQLHKKNHLPIPL LIELRAYMRRREDKECNNFLEFFHKCSGVIQNLNQHQLQEQLKAGNALVMFDGLDEVF DPGRREDVITDIHRFTNEYPNVQVIVTSRVIGYKAQRLRDAEFKHFMLQDLEPEQIQD FINRWHSDTFTDKIDGERKKERLQRAIDTSNSIAELAGNPLLLTMMAILNRNQELPRD RPELYNQASRVLLHQWDVERALVEDKRLDPKTIDYKDKQAMLRQVAYHMQTSEKGLAG NLISANDLERILTEYLKTIEVDKAREVARVMINQLRTRNFMLCFLGADYYAFVHRTFL EYFCAWEFVWQFKETRSLGIEELKTQVFGKHWHDETWHEVLRLIAGMIEPKFVGEILE YLMVQNGESEKFSNLFLATKCLSDVRNRTVIAKTAQQLLNRFKDLTKYDLNYYYDPRQ DEEERKLVQKIRTEAVAAIATTWKDDADTFAILKTRATADNSFVRRAAIEALASNYKD DAQTLAILKTRATDNDDWDVRYAAVQALASNYKDDAQTLAWLKTRATVDDNSGVRRAA VKALASNYKDDAQTLAFLKTRATADDSSGVRSAAVEALVSNYKDDAQTLAFLKTRATA DDSSGVRSAAVEALVSNYKDDAQTLAFLKTRAT" BASE COUNT 3396 a 2168 c 2250 g 3108 t ORIGIN 1 gccacagggt tttcatctca cccactataa atagtaataa tcatcttttg ggcgtgaggg 61 attgggtttc aactgagcaa cgactttcgg aacaatcgtc ggtccaagtt gacagtatcc 121 cagataaact gaacgaatag ctgttacaat ttgttctatt ggagtgtgtt tcaataagta 181 accaagtgcc cctgcttgta gtgactgcca gacatactcg tcatcatcaa atgtcgtgag 241 tactaatatc cgtatccaag gataacgctg aataatctca cgagttgccg tcactccatc 301 gtaaactggc attcggacat ccatcaaaat aacatcaggt tggaggtgtt gcgtgagggc 361 gagcgcctcg cgtccatgac tcgcttgtcc aactacatcc aaatccgctg aagctgataa 421 tagtgtggct aagtcctggc gaaagatatc ctgatcatca acaactaaaa ggcgaatcat 481 aacggaagta tgatttgaat ctgcgtacct gaacctggag tactattgat ggttaattga 541 cctctcaata gttgaactcg ttcttcaata ccttgtaatc ctaagcctaa atggggtatt 601 tttggatcaa acccttgacc attatctgct agttctatca agataccttc aggcgtagac 661 aaacctcgaa aacgaaccgc agaagcatga gcatactttt gaatgttaat tagtccttct 721 ttgactatac aataaatgtg atgactagtt tgcaatgaca gttgtggtaa gttaatctcc 781 cattggacgc tcaaagcttg attgtggcga agctgttcga gcagagtcgt taaagcttga 841 ttgagattaa aatccgattg acgcatcatt ttcaacgagc gattgacatc ctcaatacac 901 tgactcgcca aaaattgagc tgtatctaca acttcacaaa ctccatcaat atcgtgttgt 961 cgaagttttt gttgggcgac agttaaccga ctgttgaggt tcgttagact atgtcctaga 1021 gaatcgtgaa tgtcgcgtgc aatccgagtg cgttctagtg tagttgctag ggtttccact 1081 tgctcagcta aagcttcggc acgttttcga cttttctgtt ctacaatcac cataaaacta 1141 aaagcaatta tgaagacagt gacgacgaga ttatctgata gatagctcaa aatcatctgt 1201 tgaatttgtt tgtgctcttc tagattgaca cataacgctg aattcaattg aacatattca 1261 ggaagtgcca aaacaaaggt taatactcca agaagaacag tgaatataac agttaatatc 1321 agattttttc gattcagtaa aaagcagctt ttagcaatgt acaagtagag aaataattcg 1381 attccaattc ctgtcaccct catgaagagt gctaatataa ttccggaaag gacatatgct 1441 tgcctttgcc aaagcggacg gtaaattgga aagactaaac ttaaccaagt atatccacaa 1501 agacagaaaa aaacttgtaa agttgttgtg attgaaactt gaaacattcc atcaagtata 1561 aaacctaaag tcactccaat catcatcacc cactcgacat aacgcaaaat cgtgcagaat 1621 gatggggaag agaacatatt cagtttcatg gcaagcaaag cagcaaagat tgcctaaaag 1681 atactaaaaa ctagagtaga gtattccctt tgaatttctc tatgacattt taccttggat 1741 gattccacgc ctgcggtgga agcgtttttg gagtggtttg ggaactgtgt tactcgtggt 1801 ttatttttct gctacttttc cactcaccat cgctgtcgcc aaaaaaggat tagttacttt 1861 tattcccccc gatcctggta caaccgcaga tgcaattgtt gtactcggac ggggggtacc 1921 atttagagat tcaagggttg aggttgtagc tgaactgtgg aggaatcatc gcgcaccatt 1981 catttttgcc agtggtttag gagatggttc tgaaattgtc caacaactca aagcaaaagg 2041 tattcctgat actgcgttag gagaagaaca ctgttctcaa accaccaagg aaaacgcact 2101 gttcaccgca tcaatcttac aaccacgggg aatcaagcaa attctgctag tcacggattc 2161 tccacacatg ttgcgttcgc gactcacttt taaaagtgtg ggctttgaag tcatgtcgca 2221 tacaagtccc ataccatctg aattcactcc gaccaaaaaa gcaatgctca tgttttatga 2281 gtatatggga ttagttagct acggtttgcg aggggacttt ctcaaacaca acttacctca 2341 acagaaaaac ccacctgtcg cgaaagttaa gaactcggca aattttaacc tgcaaaaaca 2401 gcatatcgag gataacagag atatcagtta tcctctttaa gcacagctat gggtaagcga 2461 gtcgtacaaa agtccttctg tatttcctag ggagacgcta cgtgttcaaa gtatgcactt 2521 gcgactcagc gtgtcccaaa gggacttacg cgcagcgtgt ccgtagggaa tattctcaat 2581 tctgactctg tggttctttc ttggtgataa attccctcta gcaacctttc ctcaggatag 2641 aaatctaagg aaacttttat ctcaaaatct tctgaagtct tactatctgt gattgataga 2701 gaatgagata gactcccacc atccttgaca gaggaaaatt gttttttgaa gggagaaaat 2761 tctaaagtaa ctctaatttc aagtttccct ttttgtctac cattgttatt aggaacaagc 2821 agttcacatt ggatttccgg tttcagggag aattcccact caactcgatt aaatatgcct 2881 gtaatattcc tctttttgaa gtatttttta acaaatttgt cctgacacga aaaaattaaa 2941 tcatttccct ttggctcaaa atattcaagc acttttttaa ctatttgaac tgcattaaaa 3001 attttatctc ctaaaaaaag aacatcatcc tcttgttcca atcgctggaa ttggttactc 3061 aacatgttaa aataacctcc ttactaatcc cgagttcttg agattctctt acaatccttg 3121 ccacaaaatt gaatttcgtc aaataaagta catggtagat gcctagaagg cggtgataca 3181 gccctctgtg gcgggtttac cgcctcctgg tgactgaaag aacagccata ggactgtaga 3241 cgcggggagc gctgcttacc gctggctagg gtagagtagg gtaggataga cagacgaata 3301 tagacaatta atttacttta tttttctgcg tagattgcgt gttcaaaact acattttttc 3361 atattttact atactattca acttaattaa agtcttgtga gaaatagatt ttcctaacaa 3421 aagatgtaaa tatttacact tatttcctct tgttggaatg taaagacagg tcatattgta 3481 aaagttgtta tatatactca attattgatt tcagactttc acactgaagt agtcatcatt 3541 attgtatatt caagatttaa aaaaaaatta tgataaaaat atcccgtata ttttcttaga 3601 taaatgacaa acgttcagtg tatttactgt gtaaaaagga aatatatatc tttgtctagt 3661 ttcatcaaaa catgacataa gcataaatta agctaattac tctattgtta tttaatattt 3721 aatttttatt gaagaaataa ctttttgata actgacgaaa atatagcggt tctcatttga 3781 atcacacaat gtgaaagtgc atgtactttc actagctccg tgtgtattgc acccaacacc 3841 accaagcgct atattcactc tgttgaggat gttttagaag tattgtttta aatacgtgga 3901 tcttcgatat ccctctaaat ccacggctag tgctacaacg ggacgccaca tgcaccgact 3961 gtcgggcgtt gcgcgacgcc agtcgcactc aaagcgggaa cctccaaggg cgcttacgct 4021 ccccctgttg aggaggtgct gttcagatct acgacaattt tgtgtactca atgagacttt 4081 ccaaacatcc tcttgtgaag aagtttaacg cgttgcaaaa gtgctgtcga actaacagca 4141 gggaaatatc ctgttgtgca tagtttgggg tttttatttc aaacaggtat gagcatattt 4201 gaatttctgt taattttgct atcttcacac actcctaaaa tggtgcgtta cactacatta 4261 acgaacccca cgcattcaat atttatttgt atagcagtta ttcatctcaa cgggtagaga 4321 cggttcaaac agcacacata ttgaaaatct ctattttact ttataagagt aaaaatttaa 4381 tcataatgtc aaatctcaaa tttcaactta tttatgtctg aagaacttaa ttttcaaggt 4441 catggtatgg atcgcctccc tccccaaaac attgaggcgg aagaagcgat attggggggg 4501 attttgctag atccggaagc gatgagtaga gtgagcgatc gcctcattcc agaagccttt 4561 tatctaaacc ctcataaaga tatctatcaa gccgctgtca ggttacatac tcaaggtaaa 4621 cctacagact tgcttgcagt gataaattgg ctaaacgacc acgatatgct aagccgtgtt 4681 ggcggtagaa ataaattaat ttcattagta gatcgtacag tgtcagccgt taatatcgac 4741 gccttagcag atttagttat ggaaaaatac ctgcggcgtc ggttaatcaa aggtggtaat 4801 gaaattgtcc atctgggtta tcagacagaa actgagttac caactgtttt agatgaagca 4861 gaacaaaaag tcttcagcat cactcaagaa cgttcccaag ttggtctggt tcacatttct 4921 aacaccctga ttaatacttt ccaagatatt gaaactcgcc atcaaggtat cgccctacca 4981 ggaattcctt gcggctttta tgatttagac ggtatgacga gcggttttca acgttctgat 5041 ttgattattg ttgctggtcg cccatcaatg ggaaaaaccg ctttctgtct caatcttgct 5101 cataacattg ctgcttcata taaattacca gttgctatct tcagcttaga aatgtcaaaa 5161 gaacaactgg cgcagcggtt gttagcaagt gaggcgggaa ttgaaactgg ttatttgcgg 5221 agtggacgca tcagccaaac tcaatgggaa cccttaagcc gtgcgattgg tatgctctct 5281 gatatgccaa ttcatattga cgatacgcca aatattacag tcacagaaat gcgaagtcaa 5341 gcccggcgtt tgcaaacaga attaaataca gaactaggac tcattattat agattacttg 5401 caattaatgg aaggagcggg cgataaccgc gtgcaagaat tatcaaaaat tacccgaaat 5461 cttaaaggtt tagcgcgtga attaaatgtc ccagtgattg ctctttctca gttgagtcga 5521 ggagtagaat cacgtacgaa caagcgtccc atgttatctg atttgagaga aagtggaagt 5581 attgagcaag atgcggattt agttattatg ttgtacagag atgattacta taacagcgat 5641 accgcagata gaggaattgc tgaggtgatc atagctaagc accgcaatgg tccaacaggt 5701 actgttaaac tcttgtttga cccacaattg acgaagttta aaaattttgc cagaccgaat 5761 aattattgaa aatgaatgag tagttcgtgg taagttattc agaactattt tattttatgc 5821 atgttcaccc ggaaataaat ttccgggcta atagccgaag tccgttaaaa cggactaaat 5881 acaataaaag tcggttttaa ctgacttaag gtactcggtg tgggacttga gttccaggta 5941 gatttcggaa gaatgaaata accaagggtt tgttatcaat agttatgatt aataactaac 6001 aacaaactac taactatgaa ttattactga actttcagca actccacatc aaaaatcaaa 6061 gtagcgttgg gtggaatgac accaccagca ccacgagcac cataacctaa gtctgaaggt 6121 ataattaatt gacgacgacc acctactttc atagtactta gtccttcgtc ccagcctttg 6181 ataacttgtc caacacctag tttgaaacta aagggttgac cgcgatcgcg tgagctatcg 6241 aacttagtac catcttctaa agtcccggtg tagtgaacta caactgtttg tcctgtttta 6301 ggagtcgccc cagtcccctc ttttaactca gtgtacttaa gtccagaggg cgtggtgacg 6361 acattggctt cagacatatt ttttttgctc gcaataaggg tattgttttc ttgtacctga 6421 atgggtgctg gagttgtttc ggtcaactgg gtcgcaacag cgatatcttg tttgttaccg 6481 ctaatttgcg caaataccaa aaccaaaaca caagccagca tgaagcccat gctgagtaaa 6541 atcgctttca aaatgcatcc tccctactca gtattttgga aatgtaagat aaccagtagg 6601 acaaaactta gtttaaatca caaagcgacc cttaacgcca aagcttattt tctaaatcac 6661 gcacttgacg ctctaaacgg tcaattctac cacggagttc gtccacttcc gactgacgag 6721 gcacccccaa atcctgtaac atattccgca gttgtcgttg catattagaa tcccaatttc 6781 cctgctccga cttcaactgc tgtaccatat catccataac cgcttttgct tgctcgggat 6841 caagcttacc atccttaaca agttgctcgc tgacttcccg cagtttatct gctaccaagg 6901 acgttgtccc aagacctatc gttaataact gcttcatcca gttgttgttg tccataatcc 6961 ctttctatca cttttttcaa atcaaacgtc cctttctcat aaatctaccc tatagatcta 7021 tggtgcaagg ataattaata tagactacac ctctattctg acaaacacac acaaaggatc 7081 aaaagtgtga taatcgaaca attttaggtg aaagttccgc ccgaacaaac agtcaagtga 7141 gtatcaagtg cggaaatttc cgcagaaata agcaggcgag aattacccac tattcttaaa 7201 tttgaatgaa atttgaaatc gctattttag tcagttctgg gggaaaacca gtcatgttag 7261 agtttaacct gccgaagtac ttaccctctg ccgaagaatt acccgactct gacgatactc 7321 ctgtggataa cgaattgcaa gaattgatac caggcttgct gaaagcgata ctgctgatac 7381 tatgggcaga acgcatggac tggttttttg gcatagacat gggtatttat tatcacccag 7441 ataaaccggc aattgtgcca gatgggtttt tgagtttggg ggtagagcga gtttatgatg 7501 aggaattacg accaagttat gtcctatggg acgaaaacgt tctcccgact ttagtgctag 7561 aagttgtttc tcaaaattat cgcaaagaat acagcacgaa aaaagaagaa tatgcagcgc 7621 tgggtgtgtt gtattatgtg atttactctt cccggcgtcg ccgcaaacca cgtttggaag 7681 tgcatcggtt agtggatgat aagtatgaat tgcaagaggg aaaccctgtt tggctaccgg 7741 aagttggttt aggaattggt tgtgaacgcg ggacttatgg aggcgtaacg cgagagtggc 7801 tttattggta caatgagcaa gggaaacgtt accccacgcc agaagaacgt attcaagcag 7861 cagaacaacg aacacgaagg ctggcagaaa aattgcgtga gttaggagta gatccggaga 7921 gtattggtta aggcgatcgc atttcttgag ttaaggttag ttaaggcgat cgcatttctt 7981 gagttgtccg attgctaaaa ttaccttaaa ctcaggaaac atgactgacg ctggtgtata 8041 tactcttagt caagagcttg gcaaacaaga gtattacgcc tatggacttt ttaaccgctt 8101 ggggagttag tactgctgtt ggatttctct tccaaccagt aatgaaacaa tttgcccagg 8161 atttaggcaa agatttatta aaagatatac tcaaagatgt attcaaagct atccccagcc 8221 agattttaca gaaactgaaa aaggaagaaa tagatattgc tgctgggaaa gcacttaaag 8281 agtttttgtc actcgtccag caagaattag aagatgctga tattcctgat aacgaaatta 8341 gggagtatta taacgaaccg ctacaaaaat ttctgaaaga tgaaaaagtt aaagaaatcc 8401 tgggaactcc ttttcaggat gaatgtaatg aactagatac aagaagttta caaactcttt 8461 ggaacgaaaa gaaattgtta catttgccaa ggaaatttga ttggacaaag cttggtaagc 8521 gttacgttaa aaaagtcaaa gcgattattc aagactcatc tgaactgaga gcaattctag 8581 attccaaaaa tcttgaaaaa attgaaaaaa acacaacaga gactgctggc attatacccg 8641 acttcaattt caaacaatat caagaagcaa ttcgcgagcg ctacgccaat ataaaactag 8701 atagtataga taccagcggt tacgcctata acgaattgcg gctatggcga atctttattc 8761 ctcaaaacgt gcgagaagtt caccaagtct taccacaagt tcacgaactc cccaaagaac 8821 atctccgacg actgcgagaa aagaacgaac tagaagcaga agaaattgca atagaagaat 8881 tagaacgtca caagcgagat tattttgaac aaaaagttta ttcagttaaa gatattgtcg 8941 aagacaagca gaactataaa tatatagtga ttttgggtga tccaggttca ggtaaatcta 9001 cattgttgca gtaccttgca ttagattggg tagaacaaac acttgatcag cttcataaga 9061 aaaatcatct gccaattccg ttactgattg agttacgtgc ttatatgcgg cgacgggagg 9121 acaaagaatg caataatttt ctggaattct tccataaatg tagcggtgtc attcagaacc 9181 tcaatcagca tcaattgcag gaacaactca aggctggtaa cgccttggtg atgtttgatg 9241 gtttggatga agtttttgat cctggtaggc gcgaagatgt gattacagat attcatcgct 9301 tcactaatga gtatcccaat gtgcaggtga ttgtcacttc tcgcgtgatt ggatataaag 9361 cgcaacggct gcgagatgct gagtttaagc atttcatgct gcaagattta gaaccagaac 9421 aaattcagga ttttatcaat cgttggcatt ctgacacttt tactgacaag attgatggag 9481 agagaaaaaa agagcgacta caaagagcca ttgacacgtc aaactcgata gcagaacttg 9541 caggaaatcc cttattgtta acgatgatgg caattttgaa tcgtaaccaa gaattgccaa 9601 gagacagacc agaactttat aatcaagcgt cgcgggtgct gctgcatcag tgggatgtgg 9661 aacgtgcttt ggtggaagat aagaggttag atccaaaaac aatagattat aaagataagc 9721 aagcaatgtt gcgtcaagtt gcttaccata tgcaaactag tgagaaaggt ttggcaggca 9781 atttgataag tgcgaatgat ttagaaagga ttctgactga gtatttaaaa accatagaag 9841 ttgacaaagc tagagaagtt gcgagggtga tgataaatca actgcggact cgcaacttta 9901 tgttatgttt cctgggtgcg gattattatg cttttgttca tcggacattt ttggaatatt 9961 tctgcgcttg ggaatttgtc tggcagttta aagagacgcg atcgctggga attgaagaac 10021 tcaagacgca agtttttggc aaacactggc atgatgaaac ttggcatgaa gtcttgcggc 10081 tgattgcagg aatgattgag ccgaagtttg ttggggaaat tttggaatat ttaatggtgc 10141 aaaatggcga gtcggagaaa tttagcaact tgtttttggc gactaagtgt ctttctgatg 10201 tgaggaatag aacggtaatt gcaaaaactg ctcagcaatt actaaaccgc tttaaagact 10261 taaccaaata cgacttaaat tattattacg atcccaggca agatgaggaa gagagaaaac 10321 tcgttcagaa aattcgtacc gaagcagttg cagcaattgc aacaacttgg aaagatgacg 10381 ctgatacttt cgccatcctc aaaactcgcg ctactgctga taattcgttt gtgcgacgtg 10441 cagcaatcga agcattagcc agcaactaca aagatgacgc ccagacttta gccatcctca 10501 aaactcgcgc tactgataat gatgattggg atgtgcgata tgcagcagtc caagcattag 10561 ccagcaacta caaagatgac gcccagactt tagcctggct caaaactcgc gctactgttg 10621 atgataattc aggtgtgcga cgtgcagcag tcaaagcatt agccagcaac tacaaagatg 10681 acgcccagac tttagccttc ctcaaaactc gtgctactgc tgatgatagt tcaggtgtgc 10741 gaagtgcagc agtcgaagca ttagtcagca actacaaaga tgacgcccag actttagcct 10801 tcctcaaaac tcgtgctact gctgatgata gttcaggtgt gcgaagtgca gcagtcgaag 10861 cattagtcag caactacaaa gatgacgccc agactttagc cttcctcaaa actcgtgcta 10921 ct // LOCUS NODE_3103_length_10877_cov_5.36305710877 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10877) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10877) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10877 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(100..543) /locus_tag="DP116_23275" CDS complement(100..543) /locus_tag="DP116_23275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874087.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VOC family protein" /protein_id="PRJNA477356:DP116_23275" /translation="MHHASIRTGNIHRAIAFYEQLGFTVCERFTTGYTLACWMEGLNG RIELIQIPEPKPAPDAFADEHYVGYYHLSFDLTEITPDLSSWLTNLKERLWVLSKSDP DLLQPLKILLEPTQQQIGQKIYEVTFIADTDNLPLEFLRVLATLS" gene complement(656..1168) /locus_tag="DP116_23280" CDS complement(656..1168) /locus_tag="DP116_23280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214191.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR02652 family protein" /protein_id="PRJNA477356:DP116_23280" /translation="MMNPALQYPIFGAEIQCPHCRQTIPALTLTDTYLCPRHGAFEAN PETGELIHLQSGRHWRRWNGEWYRQHTHPDGIRFEIHEALDKLYTQGYRATRVVIARR YQELMSGYLERSTHWRSGQSEATSARLYGLPVEFSADSLDDPCWDVINFDLEKEPGVP VRYPYFRLFE" gene 1374..1904 /locus_tag="DP116_23285" CDS 1374..1904 /locus_tag="DP116_23285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315579.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gamma carbonic anhydrase family protein" /protein_id="PRJNA477356:DP116_23285" /translation="MSIASYWSSPDISQAAFIAANAIVMGSVKIAAGVSIWYGAVVRA DVESIEIGECTNIQDGAILHGDPGKPTVLEDHVTVGHRAVVHSAYIEHGSLIGIGAVV LDGVRVGAGSIIGAGAVVTKDVPPLSLVVGVPGKVVRQISEAEAAELIEHAQRYKKLA LVHAGKGTDIGFKAPE" gene 2114..2242 /locus_tag="DP116_23290" CDS 2114..2242 /locus_tag="DP116_23290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013189929.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II protein Y" /protein_id="PRJNA477356:DP116_23290" /translation="MSLPDMRLIIVLAPLLIAAGWAFFNIGAAALNQLQNFLNKEA" gene 2388..3662 /locus_tag="DP116_23295" CDS 2388..3662 /locus_tag="DP116_23295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional folylpolyglutamate synthase/dihydrofolate synthase" /protein_id="PRJNA477356:DP116_23295" /translation="MDIDSLLQPFQHFGVHLGLERIVKLLANLGNPHHQLPVIHVAGT NGKGSVCAYLSSILTQAGYQTGRYTSPHLVDWTERICLNEEPISSEEFCQLLLQVQAA IPQGEESPTQFEVITAAAWLYFAQQKVDVAVVEVGLGGRLDATNVCSKPLITIITSIS REHWQVLGPTVADIAREKAGILKLGCPAVIGLLPSEAEQVVRSRALELQSPLITPQPS QSISPGWAEYQTQNSLFRNSNLIKYPLPLQGQIQLTNSALALAAIEILQTQGWQISES AIIQGMANTKWLGRMQWTTWKNHKLLVDGAHNPAAAQVLRDYVNTLNVKSVSWVMGML STKEHEEIFKALLRQNDRLYLVPVPDHSSANPGELAKLASYICPELSFCHIYPELLSA LEAAFASTDDLVILCGSLYLVGHFLGTTDLGA" gene 3659..3904 /locus_tag="DP116_23300" CDS 3659..3904 /locus_tag="DP116_23300" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23300" /translation="MKGVRQERIILLKIEIVLNRWVEFSVGSSWIYWKQQNIKKTKKK MQSTPNKTSPSTIKLTEKEVISDIREYWALKGWRCVK" gene complement(4001..4084) /locus_tag="DP116_23305" /pseudo CDS complement(4001..4084) /locus_tag="DP116_23305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309811.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" gene complement(4195..5247) /gene="hisC" /locus_tag="DP116_23310" CDS complement(4195..5247) /gene="hisC" /locus_tag="DP116_23310" /EC_number="2.6.1.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313271.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidinol-phosphate transaminase" /protein_id="PRJNA477356:DP116_23310" /translation="MSYFRPAIDAMTGYIPGEQPKPGTKIIKLNTNENPYPPSPKAME VLHNLDGEWLRRYPDPFARDFCQAVSDALGVPADWVIVGNGSDELLNVLIRCCAEGNN RKVVYPMPTYVLYRTLSAMQPAQVLEVPYPADFQLPIKELVAANGVITFIASPNSPSG HVVPLDDLRELAQQVSGIVVVDEAYVDFAEYSALPLVQEFENVIVLRTLSKGYSLAGL RMGFGVANPKLLAGLFKVKDSYNIDAIAQAVGTAAMRDQAYKNACADKVKISRIQLAQ DLKNLGWEVLDSHANFVLATPTQGNAESIYLGLKERGILVRYFKQAGLEDKLRITVGT DEQNQTLIEALVSVKK" gene complement(5570..5863) /locus_tag="DP116_23315" CDS complement(5570..5863) /locus_tag="DP116_23315" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23315" /translation="MKRFFKKVIVCTFVMLSLCFLFQNPSFAKTSDSQAQPQVQESTS NPSSNLDKGEWNITSVPIRPLKPGFEWEVKSEENGKKLVILHNGEEALTLVKE" gene complement(6142..7455) /locus_tag="DP116_23320" CDS complement(6142..7455) /locus_tag="DP116_23320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874082.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_23320" /translation="MVRFKKFKKLIIWALLGVLTSWIVSCGTSNVGTTTKQASSGVAA VQNVEFWTMQLQPQFTDYFKSLITSFESQNSGIKVSWVDVPWTAMENKILTAVSAKTP PDVVNLNPDFASQLAGRNAWLDLDAKVPKEVRSSYLPNIWQASTLNGKSFGIPWYLTT RLTIYNTDLLKQAGISKAPVTYTELAQAAQKIKDKTGKYAFFVTFVPQDSGEVLESFV QMGATLVDAEGKAAFNSPEGKAAFQYWVDLYKQKLLPKEILTQGHRRAIDLYQAGETA FLFSGGEFLENIGKNAPAIAKVSATAPQVIGDNGKKNVAVMNVVIPRDSKQPDAALKF ALFLTNDENQLTFAKAANVLPSTVKSLADSYFKDAPANASTLDKARVISAEELRQAEI LTPRLKDIKRLQKAIYENLQAAMLGEKSVDQAVEDAAQEWNSTRS" gene 7756..8874 /locus_tag="DP116_23325" CDS 7756..8874 /locus_tag="DP116_23325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015112022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pilus assembly protein PilM" /protein_id="PRJNA477356:DP116_23325" /translation="MKLFNSLFGKLHQGVGIELSPQRVNLVQLRKLRQGLKLETLTSV SLPEGVFVNGQIVDSPTMAQCIQQALVEGNIKASRVATAIPGRESIIRIISVPAELDD KELRHMVLNHEAGLHLPLPCEDADLDYQKLGYFIDDDGIEKIQILLVAIRKEITETYI KTFELAGLQIDVLDINSFALIRTLREPLRQFGSQEAVVLVDIEFDCTEIAIIVNGIPH FLRTVPIGIDLMQTVLSEAMGLPVSQALELLYEMSIPSTHINGEKTDAHDDIIEINSG MVALMRLLEELADELRRSIDFYLNQNESLEVAQIMLAGPGGGLGQLDDFLTQQLNLPT SQIDPVAALSLQIDEEKYPTVERSGLGIVLGLGMREAD" gene 8918..9685 /locus_tag="DP116_23330" CDS 8918..9685 /locus_tag="DP116_23330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457347.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fimbrial protein" /protein_id="PRJNA477356:DP116_23330" /translation="MYSIDINFLKNRKTENKFEEKRLGISLPTGNLTPVYIGVVVGIF FPALVGIGWWFVQIKNTALDNNITQLKQENESLESQIQSLNKVQVETKKIKQETQALV SVFDQIRPWSAMLQELRDRIPTTVQIDSIKQIAPTTLTQGQPALNSTGGIEISGFARS FSDVNDFTLTLQQSRFFKAAQTKIMTAELVDFPLPPTGNSINSSQIKPPQIVKYTIQS SLSDVPASEFIRELEQKGTVGLVSRIRSMQQTGIIPK" gene 9682..10446 /locus_tag="DP116_23335" CDS 9682..10446 /locus_tag="DP116_23335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315586.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pilus assembly protein PilO" /protein_id="PRJNA477356:DP116_23335" /translation="MTLSDDFDFIEDTEFATTSSSYPVAFGITLTPKVSGIILGVLGL AGAAYMLMNFVMPAWDNFQQQQTKQNELQQSIEQKKTYIKQIDKVQQEQALAKQQQIQ VLALFANEKTLDTLLLDLNRLVESTNTQVSADAVQAKLKKYVPTGKAEPITDGSFGQL VNGKLKRSSINIEIIGTYEQTQLIFRNIERLQPLLIVRDYRSSSLTPEPTTQQDKVMV GSVEPTVISTSFQLQALMPLSQEEIAAVKAVQSKTK" gene complement(10439..10738) /locus_tag="DP116_23340" /pseudo CDS complement(10439..10738) /locus_tag="DP116_23340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015159760.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="signal transduction protein" gene complement(10753..10877) /locus_tag="DP116_23345" /pseudo CDS complement(10753..10877) /locus_tag="DP116_23345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010873091.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="four helix bundle protein" BASE COUNT 3163 a 2263 c 2305 g 3146 t ORIGIN 1 cttgaatata ataatttgac ttttctgagt ggtctgtgat cttaacaaca caaaaccaga 61 gcaaattagt attctacaga taaaaaatca gaaattcaac taggagagtg tcgctaacac 121 gcggagaaat tctagaggca aattatcagt atctgcaata aaagtgactt cataaatttt 181 ctgacctatc tgctgctgtg taggttctaa aaggattttc agtggttgta gcaggtcagg 241 atcacttttt gataacaccc acaaacgttc ttttaaattt gtcaaccaac tagataaatc 301 cggtgttatt tcagttaaat caaacgagag atgataatac cccacgtagt gttcatctgc 361 aaatgcatct ggggcaggtt ttggttccgg aatttggatc agctcaattc tgccgttgag 421 tccttccatc cagcatgcaa gagtatagcc tgttgtgaag cgttcacaga ctgtaaaccc 481 tagttgttcg tagaaggcga tcgcccgatg aatatttcct gtacgaatag aagcgtgatg 541 cataattttt ttcctttgtt atttgtcatt tgtcctttgc taaacgacaa aggacaaaaa 601 acaaatgact caacggcaaa gccttcaaag cagcaggttg tctgacaact aacaattact 661 caaataatcg gaaataggga taccgtactg gtacaccagg ttctttttcc aaatcgaaat 721 taatcacatc ccaacaggga tcatccaagg aatcagcgct aaactccaca ggtaaaccat 781 aaagtcgtgc cgaagttgcc tccgactgtc cagaacgcca gtgagtgcta cgctctagat 841 aaccactcat caactcttga tagcgtctag caataactac ccgtgttgct cgatagcctt 901 gggtataaag cttatctagt gcttcatgaa tttcaaaccg aatgccatcg ggatgagtat 961 gttgccgata ccactcgcca ttccatcttc gccaatgacg tcctgactgt aggtggatga 1021 gttccccagt ttcgggattt gcttcaaacg ccccgtgacg cggacataaa taagtgtcgg 1081 ttagtgtcaa agccgggatt gtttggcggc agtggggaca ctgaatttct gcaccgaata 1141 tggggtactg caaggctgga ttcatcatga agtgcgtaca aacatatttt tgttttcacc 1201 agtagtgttg gttcctttta cacttctgct ccaacttgtt ggacaccgat ttactgccta 1261 gtgagccttg tggctctaca ccgtgatatc ctagttttgc aacctatact ggtgcaacct 1321 atactggttg gagttcctaa gcgtgttcct gggtaccaca aatctattct accgtgtcta 1381 tcgcttccta ctggtcttct cctgatattt ctcaagctgc ctttatagca gcgaacgcta 1441 ttgttatggg ttcagtaaaa attgcagctg gggtgagcat atggtacggg gcagttgtga 1501 gggcagatgt ggaatctata gaaattggcg aatgtacaaa cattcaggat ggagcaatat 1561 tacacggtga tcctggaaaa cccacggtgt tagaagatca tgtcactgtg ggacatcgtg 1621 ctgtcgtaca ctctgcttat attgaacacg gcagcttaat cggaattggc gcagttgttc 1681 tagatggagt acgggtgggt gccggtagca ttattggtgc tggcgcagtc gttacgaaag 1741 atgtaccgcc cttgtcccta gttgtgggtg ttcctggtaa agtcgtccgc caaatttccg 1801 aagccgaagc agcagaactg attgaacatg cccaacgtta caaaaagtta gcgttggttc 1861 atgcaggcaa gggtacggat attgggttta aagcaccgga gtaagaggga gcaggggaag 1921 caggttcgct agtgggaagt tcttcgacat aggattttcc tctcccccac acccttacac 1981 ccccacaccc ttacacccct agtttttgta aagattcttt acgtattgtt tggcaaagtt 2041 gactgcttca gctacaaaat aaataatgag aaaagttgac aaaactttat taaagcggaa 2101 agaggttgta gatatgagtc ttccagatat gcgtttgatc attgttttag caccacttct 2161 cattgctgct ggttgggcat tttttaacat tggtgccgct gctctcaatc aattgcaaaa 2221 ctttttgaac aaagaagcgt aaagaaagaa gcataaagtc tgattcacac cataaaatac 2281 caaccgactg ttttgattca tagggcagtc ggttttgttt atcagttatc agttaccagt 2341 tcctgttcac tgtgtaaaaa attccaccta ctacctgact accttttgtg gatatcgact 2401 ccctacttca acccttccaa cactttggcg ttcacctcgg gttagaacgt attgtcaagc 2461 tgctggcaaa tctaggtaat ccccatcatc aactcccagt gattcatgtt gctgggacta 2521 atggcaaagg ttccgtttgt gcctacctta gttctatcct cactcaagca ggttatcaaa 2581 ctggacgcta tacctcacct catttggtag attggacaga acgcatttgc cttaatgaag 2641 agccaatttc ttcagaggaa ttttgccaat tgttactgca agtgcaagca gcaattcccc 2701 aaggagaaga atcaccaact caatttgaag tcattaccgc tgctgcttgg ttatattttg 2761 cacaacaaaa agtggatgtg gcggtggtag aagttggatt aggaggacgc ttggatgcaa 2821 caaacgtctg ctcaaaacct ctgatcacta ttatcacttc gattagccgt gaacattggc 2881 aggttcttgg tcctactgtt gctgatattg ccagagaaaa agctggtatt ctcaaacttg 2941 ggtgtcctgc tgtgattgga ctattgccca gcgaagctga acaagttgtg cgatcgcgtg 3001 ctttagaatt acaatcccct cttattacgc ctcaaccttc acagtctatt tctccaggat 3061 gggcggagta tcaaacacaa aattcactct ttagaaattc aaatttaatt aaataccctt 3121 tgccgttaca gggacaaatt cagttaacga attcagcttt agctttagcc gctatagaaa 3181 ttctacaaac gcagggttgg cagatttctg agtcagctat tattcaagga atggcaaata 3241 caaaatggct ggggcggatg caatggacaa cttggaaaaa ccataaattg ttagtagatg 3301 gcgctcataa cccagccgcc gcccaagttt tgagagatta tgtgaacaca ctcaatgtaa 3361 aatcggttag ttgggtgatg ggaatgctct ctacgaaaga gcatgaggag atattcaaag 3421 ctttactgcg acaaaatgac cgattgtatt tagtgccagt tcctgatcat agttcagcaa 3481 atcctgggga attagcaaaa ctcgcttcat atatctgtcc agaattgagt ttttgtcaca 3541 tctacccaga attattatca gcccttgaag cagcttttgc ctcgacagat gatttggtaa 3601 ttttgtgtgg atcgctgtat ttagttgggc attttttagg aacaactgat ttaggagcgt 3661 gaagggtgtc aggcaggaga ggataatatt gctcaagata gagatagtat taaatcgatg 3721 ggtggagttc agtgttggga gttcttggat atattggaaa caacaaaata taaagaaaac 3781 aaagaaaaaa atgcaaagta ctcctaacaa aacatcccct tctactatta aactaaccga 3841 aaaggaagtt atctcagata taagggaata ctgggcgctg aagggatggc ggtgtgtaaa 3901 gtagggagca agatgttccc actatcttta tgtacatcca ttgggatgtt cccaaattta 3961 attgcttagt tttctgaaaa ccaagttaat tacaaatgcg ctaaattgta tcaggatcaa 4021 tatttaactc ccgcaattta gccgccaatc tttctgcttt atttatagct aattctttct 4081 gccttgaggt ataaaagcca gattttaagc cttttttact tccgaaccct atggcaaact 4141 gaagttgcgg taattctaag ggaagaagtg ttttttcttc ttcctacttc cttcttactt 4201 ctttacgctc actaacgctt ctatcaacgt ctgattttgc tcatccgtac caactgtaat 4261 ccgcagctta tcctctaacc ccgcttgctt aaaataacgc accaaaattc ctcgttcttt 4321 caaccccaag taaatagact ctgcatttcc ctgagtcgga gttgccaaaa caaagttagc 4381 gtgagaatca agcacttccc atcccaaatt ctttaaatct tgcgctagtt gaatccgtga 4441 tattttcact ttatctgcac aagcattctt gtaagcttga tctcgcattg cagctgtacc 4501 gactgcttgg gcgatcgcat caatgttgta actatctttt accttaaata accccgccag 4561 cagttttgga tttgccaccc caaaacccat ccgcaaacct gctaacgaat accctttcga 4621 caaagtacgt aacacaatca cgttctcgaa ttcttgcacc aagggcaacg cgctatactc 4681 agcaaagtca acgtaagctt catcaactac aacaattcct gagacttgct gcgctaattc 4741 tcgcagatca tctaaaggta ccacatgacc agagggacta ttgggagaag caataaacgt 4801 aattactcca ttcgccgcaa cgagttcctt aatcggtaac tgaaaatcgg cagggtaagg 4861 aacctcaagt acttgtgcgg gttgcatggc tgataaagtc cgatatagca cgtaagttgg 4921 catgggatac acgactttgc gattattccc ctccgcacaa cagcgtatga gcacattcag 4981 caactcatca cttccattac ccacaatcac ccaatctgca ggtactccca acgcatcgct 5041 gactgcttga caaaaatctc ttgcaaatgg atcagggtag cgtcgtagcc attcaccatc 5101 taaattgtgc agcacctcca tcgcttttgg ggagggagga tagggattct catttgtgtt 5161 gagtttgata atttttgttc ctggtttggg ctgttcacca ggaatgtagc cagtcatcgc 5221 atcaatagca ggacgaaagt agctcatgtg tgaatggtgg gaacttatgt aagtcggggt 5281 tcgttgtaga caacagccca catcgcttca caaacagttt tgtttgtttt acattttgga 5341 atgataacta gttgtcacca aacaaggcaa aagaaaaatc ttgagtagta tgtttctttt 5401 tactttttca attgagtgag actgccaaaa atccctgatg gcgatgctcc agcagggagc 5461 gctagcattg ctgatcgctc tccccaggct tttatttatg ttcaatcctt gcatttactt 5521 gaattagggc tgaagtaagt tcaaccccag ccctaacaat taatgcttgc tattctttca 5581 caagtgttaa agcttcttcg ccgttatgta aaatgactaa cttcttacca ttttcctcag 5641 atttcacttc ccactcaaag cctggtttga gtggacgtat aggaacagat gtaatattcc 5701 attcaccttt atctaagtta gaacttgggt tacttgtgga ctcttgaact tgtggttgcg 5761 cttgagaatc acttgtcttt gcaaagctag gattctgaaa cagaaaacac aagctaagca 5821 taacaaaagt acaaacgatt acctttttaa agaaacgttt catgacaaga tattaccttc 5881 tatcacttta ctacctaaac gcttgtaaca cttaagctaa agatagaata tcatctcggt 5941 cgcccagagg cgcgatgccc tgcccgaaat ttcaattcgt ttcgcgaaat tctgggattt 6001 tgccatcacc aggcgatcgc tcagatcacc ccaccagatg tgctacccaa cgggtgcgcc 6061 gcttttgcgc cagcgtcccg taagggatac gcgtctacgt atagccgtac aggtacagtg 6121 tgcggcttgt tttggattaa gctaactcct agtgctgttc cactcctgcg ctgcatcctc 6181 cacagcttga tccactgact tctcgcccag catagccgct tgtaagttct cgtaaattgc 6241 cttttgcagc cgttttatat ccttgagcct tggagttaat atctctgctt gtcgcagttc 6301 ttctgcgctt atgactcgcg ccttatctaa agtcgaggcg ttcgcgggtg catctttgaa 6361 gtagctgtca gcaagggact tgactgtaga tgggagaaca tttgctgctt tagcaaaggt 6421 gagttgattc tcgtcatttg tgaggaacaa ggcgaattta agagcagcat cgggttgttt 6481 actgtcacgg ggaataacaa cgttcataac agcgacattt tttttgccat tgtcacctat 6541 gacttgcggt gcggtggcgg aaactttggc tattgctggc gcatttttac caatgttctc 6601 cagaaactct cctccagaaa acaggaacgc tgtttctcca gcttggtaca aatctatagc 6661 tcgacgatgc ccttgtgtca atatttcttt gggtagaagt ttttgtttgt ataagtctac 6721 ccaatactga aacgccgctt taccttctgg ggagttaaaa gcagctttac cctcagcatc 6781 cactaaagtt gcccccattt gtacaaacga ctctaacacc tctccagaat cttgagggac 6841 aaaggttaca aaaaaggcat acttaccagt tttgtcttta attttttgag ccgcctgcgc 6901 taattcggtg taggtgacag gtgctttact gatacctgcc tgttttaata aatcggtgtt 6961 ataaatggtt agccgcgtgg tgagatacca gggaattcca aaactcttgc cattgagcgt 7021 gcttgcttgc cagatattcg gtaaatagga ggaacgaact tcttttggga cttttgcatc 7081 taaatctaac caggcatttc ttcctgcaag ttgggaagca aaatccggat taaggttgac 7141 aacatcaggt ggtgtttttg ctgaaacagc tgttaaaatt ttgttttcca ttgcagtcca 7201 aggcacatca acccaactca cttttatacc agaattttgt gactcaaaag atgttattag 7261 gcttttaaag tagtcagtaa attgaggttg gagttgcatg gtccaaaatt caacattttg 7321 tactgctgcg actcctgatg atgcttgttt tgtggtagta ccaacattgc ttgtaccaca 7381 actaacaatc caactggtca atacaccgag taatgcccaa ataattaatt ttttgaactt 7441 tttaaatcta accatcgtgc ttgtacttta tttaagtcta tcgggtgatc agtacttatg 7501 aaacaattgt ggagattata atttaaaaca aattcttagt aatgatatca atccttaaat 7561 cagtgaacaa tgaacagtga acagtaaaca gtcctgcatg catatggtgg aggattaaac 7621 ccacttagca tgcgttccac gctcttggtg agggatcgtt acccaagctg agaaagctga 7681 taactggtaa ctgataactg ataactgtta atatcatttc atttttaaat tagcaatccg 7741 acgggcaaaa ctgtagtgaa gctattcaat agtttgtttg gaaaattaca tcaaggtgtt 7801 ggtatagaac tgtcaccgca acgtgtgaat ttagttcaac tgcgtaagct acgccaaggt 7861 ttgaaactag aaactttaac atcggtatct cttcctgagg gagtttttgt caatggtcaa 7921 attgttgact ccccaacaat ggcgcaatgt attcagcaag ccttagtgga aggtaacatc 7981 aaagcttctc gtgtggctac tgctataccg gggcgagaat caattatacg catcatatct 8041 gtgccagccg agttagatga taaagaatta cggcacatgg tactaaacca cgaagcaggt 8101 ttgcatttac cattaccttg tgaagacgct gatttagatt atcagaaact cgggtacttt 8161 atagatgatg atggaattga aaaaatacag atacttttag tagctatacg gaaagaaatt 8221 actgagacat atataaagac ctttgagtta gcaggattgc aaatcgatgt tttagatatt 8281 aatagttttg ctttgattcg cacacttcgt gaaccacttc gacaatttgg ctcgcaagaa 8341 gcagttgttc tcgtggatat agaatttgat tgtacagaaa tagcgatcat tgttaacggg 8401 attccacact ttttgcgcac agttccgatt ggaatagatc tcatgcaaac tgtcttatct 8461 gaagcaatgg gcttacccgt ttcacaagct ttagaactgt tgtatgaaat gtctatccct 8521 tcaactcata taaatggaga aaaaactgac gctcatgatg acattattga aatcaattca 8581 ggtatggtag cactcatgag actgttggaa gaactcgcag atgaactgcg ccgttccatc 8641 gatttttatc tcaaccaaaa tgaaagtttg gaggtggcgc aaattatgct ggctggacca 8701 ggagggggac tgggacaact tgatgatttt ttgacacaac aattgaattt gccaacctct 8761 caaatcgatc cagttgcagc tttgtcattg caaattgacg aagagaaata ccccactgtg 8821 gaacgttctg gattgggaat agtgcttggt ttaggaatgc gagaggcaga ttaacaattc 8881 aaaattcaaa attcaaagtt caaaaaaaga aatgtaaatg tacagtatag atattaactt 8941 tctcaaaaac cgcaaaactg agaataaatt tgaagagaaa cgactaggaa tatctctacc 9001 tactggcaat ttaacgccag tgtatatagg agtggtagtg gggatatttt tcccagcttt 9061 agtgggaatt ggttggtggt ttgtgcaaat taaaaatact gcattggaca ataatatcac 9121 acaactgaaa caggaaaatg aaagtttaga aagtcaaata caaagtctga ataaagtcca 9181 agtagaaaca aagaaaatca agcaggaaac tcaagcttta gtgagtgttt ttgaccaaat 9241 tcgtccttgg tcagcgatgt tgcaagagtt gcgcgatcgc atacccacaa cagtccaaat 9301 tgatagtatc aaacaaatcg cacccactac actaacacaa ggacagccag cactcaattc 9361 tactggaggt atagaaattt cagggtttgc tcgctccttt agcgatgtca acgatttcac 9421 gttgactttg caacaatctc gcttcttcaa agctgcacaa acaaaaatta tgacagcaga 9481 attagtagat ttcccgttac caccaactgg gaactctatt aatagctcac aaatcaaacc 9541 accccaaatc gttaaatata caattcagtc tagtttaagt gatgttcccg cctctgaatt 9601 cattcgtgaa ttagagcaaa aaggcacagt tggacttgtc agtcggattc gcagtatgca 9661 acaaacagga atcattccaa aatgacgctg agtgatgatt tcgattttat agaagacacg 9721 gaatttgcga caacttcctc aagttatccc gttgcttttg gcatcaccct cacgccaaaa 9781 gtgagtggca ttatcttagg agttttagga cttgcaggag cagcttatat gctgatgaat 9841 ttcgtgatgc cagcatggga taattttcaa cagcagcaaa caaaacagaa cgagttacag 9901 cagagtattg agcaaaagaa aacttacatt aaacaaattg acaaagtaca acaggagcaa 9961 gcacttgcca aacagcaaca aatccaggtt ttagctttat ttgctaatga gaaaactttg 10021 gacactttac tactagattt gaatcgcttg gttgagtcta ctaatactca agtttctgct 10081 gatgcagtcc aagctaaact gaaaaaatat gtgccgactg ggaaggctga acctattact 10141 gatggtagtt ttggtcaatt ggtgaatggc aaattaaaac gcagtagtat taatattgaa 10201 attatcggaa cttacgaaca aacacaattg atttttcgca atatagagcg gttacaacct 10261 ttgttaattg tcagagacta tcgatcatca agtttgacac cagaacctac aactcaacag 10321 gataaagtta tggtgggtag tgttgaacca acagttatta gtacatcttt tcagttacaa 10381 gcattaatgc cactgagtca agaggaaatt gcagcggtga aagcagtgca atcaaagact 10441 aagtagcata tggcagcagc aatgtatcct caatctcctg gcgaacttcg cggctgacgt 10501 agcaatcgct gttgaggcaa tcaactagca gtttattagc atcgtagtac tgcttcagga 10561 gttccttttg agcatcagtg aattgccaat catgaccaat attacgatgt tcaatcatta 10621 cagatctaag ttgttcagtc caagttgaac cgttttcctc ccaccattgt tggtatgctt 10681 cttcgtcttc tgggttgggc tgttggtctt tgagttgttg gagcgatcgc cttaactctg 10741 ggtcgaggac tcgggagagg aatcgggaga cgaagaggga gacggagagg tcttgcaaga 10801 ggactcgaaa gacggagagc cagagggaga gggagagcca gagggagagg gagagagaga 10861 gggagaggga gagggag // LOCUS NODE_3121_length_10816_cov_5.32803610816 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10816) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10816) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10816 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(69..272) /locus_tag="DP116_23350" /pseudo CDS complement(69..272) /locus_tag="DP116_23350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199759.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="inorganic phosphate transporter" gene 274..525 /locus_tag="DP116_23355" /pseudo CDS 274..525 /locus_tag="DP116_23355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860518.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 1014..1682 /locus_tag="DP116_23360" CDS 1014..1682 /locus_tag="DP116_23360" /inference="COORDINATES: protein motif:HMM:PF14518.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23360" /translation="MMLTTVTQHLLNVRYRVLEGLKDSASIKSVLAGKADKDIYIKYL TNVYQYAQHSPKVIALAASRCMNTHPQLAKYLLHHAEEEQGHDLWALADLQDLGVNES TVKLAYPVPSCSAMIGFVYYTAGYANPVGLFGWLYVLEAMGNDIGGIIAEQLNDGLSL SNTALRFVAGHGISDRDHTTDLTEVMNTYVKNPQDVADINHVADVIADLYVRMFTEIA KIRV" gene 1682..2806 /locus_tag="DP116_23365" CDS 1682..2806 /locus_tag="DP116_23365" /inference="COORDINATES: protein motif:HMM:PF00027.27" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23365" /translation="MTLSVAVAKTEAELNEICHFRYRIYVKELKILPPEADHSKQILR DSLDDYGVSYALLKDGQVVGSLRSIFLKDVPNPKPLINKFNLKPAIETFGTSAIITTS RFILDPQLRHGSAVFRLIEAGYKEGRSRGVRLNYGDCSPYLLPFYEHLGYSRYISAYN DSSFGYKFPLLMLVGDHAWFERVHSPLRRLAFCYPQDTKARQWFESTYPEYFGLESAP FLPEKVFFDTLSQRLGNNPLRKIFLLRGLDQTEANLFLKEATIIKTSPGDCVVRQGNY DKTFYVMLSGIAEVIDNQVPDHPAKILSAGELFVENHILTPEPCAANLIACSVCNILV VPGEFFGKFCKQEPIIASKILLNLVQLGLKSRGCNEMRAT" gene complement(2914..3882) /locus_tag="DP116_23370" CDS complement(2914..3882) /locus_tag="DP116_23370" /inference="COORDINATES: protein motif:HMM:PF07859.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23370" /translation="MREDIFARMTADTRAVVQKMQAHAVSPPPNADFVQLTRDGYRQV IGLAGDAQEVETVEDHTVPSTPAIPLRLYKPRVEYNDTPLPALVYFHGGGFISGGFDT HDRPLRVLANASGCAIALVDYRLAPESPFPAAPEDCFAGLQWVIEHAQELGINPSKVS VGGDSAGGLLATVVCLMCRDRNASRPIAQILIYPDTDLAINTRSWYELDFLHPAQSRE NKLSQIAMYVPNQAEREQPYASPLRAPNLSNLPPALIITAELDPQRDEGEAYAQQLRD AGCLVTHTRYPGVIHGFYQMGGVIASARAAIAEVGAYLQLRHTGAS" gene complement(4020..4922) /locus_tag="DP116_23375" CDS complement(4020..4922) /locus_tag="DP116_23375" /inference="COORDINATES: protein motif:HMM:PF12833.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="PRJNA477356:DP116_23375" /translation="MQQQQPVIVRTNQPGATLQILPNQPTLSSHLLKWDGITVRYAWQ PAWETPEHCNTQHLILIRYSHDYVNSERQLDDKRQREQFSQGDTVLIPVNVRHKARWD GAGEIISISLEPTQLAHLAHESVDGDRLELLPQFAHPDLLIHGIGVALKRELEAGEAS SRLYVDALTTALGAHLIRRYATRHHYLGNYTDGLPKLKLQLVIDYIQAYLDRDLSVGE LAAIVQMSRYHFGRLFKQSTGLTAHQYVLQCRIEKAKQLLRDPELPITEVYQQVGFQS QSHFTKVFRRHTGVTPKAYRTDRK" gene complement(4986..6272) /locus_tag="DP116_23380" CDS complement(4986..6272) /locus_tag="DP116_23380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318185.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="inorganic phosphate transporter" /protein_id="PRJNA477356:DP116_23380" /translation="MLVIIVALLAFYLAWNLGANDVANSMGTSVGSKAVTLKQALIIA GVLEFTGAVLFGQEVSQTLATEIVNPALFADTPQMLVTGMVAVLLSGGLWLQIATSRG LPVSSSHAVVGAIAGFSWVALGVSAIDWSTIGQITIGWIITPVISGAIAALFYSQIKR WILDQPNPLVQLNEWIPWLSAVLLGIFGVIVLPSLTQGLATFLIEQVGFKIPAHDIPL VTGAVATVGLTFYSWRQLEDKGTRRQADKADKETREKTQIQNPVERLFGRFQLLSACF VAFAHGSNDVGNAIAPLAAIAYINRTGSVPTDGFNIPIWILILGGVGIVTGLAILGKK VIATIGESIISLQPSSGFCAELATATTILLASRLGLPVSTSHALVGGVVGIGLVQDIK SIQFKTLQGIAAAWVITVPASAVLSAAIFSVLRIKF" gene complement(6622..7668) /locus_tag="DP116_23385" CDS complement(6622..7668) /locus_tag="DP116_23385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119308.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sorbitol dehydrogenase" /protein_id="PRJNA477356:DP116_23385" /translation="MKAQVFRGVNQLSYEELPVPTLEPDEVLVQVQVVGLCQSDIKKI RYPLYEPPRIFGHETAGTISAVGSQVTGWQVGQRVAVMHHIPCMRCDYCLNDNFSMCD TYKNISTTAGFNASGGGFAEYVKVPGHIVRNGGLIPIPDNISFEEASFVEPTNCCLKA VKKAQIAPGQTVLITGAGPIGLMFIMLVKYFGAKAIATDLLPSRIEKALSVGAEAAFD ARDANLPAKIHDLTNGMGVDVTLLAVPSDKAFFQALDCTRKGGKILFFAEFPDEMEIP VNPNLLYRREIDLMGSYSSSYRIQSLAADIVFNRRIDVQALISDRYRLQDLSAAVEQA IAPTAETYKILIYP" gene complement(7782..8345) /locus_tag="DP116_23390" CDS complement(7782..8345) /locus_tag="DP116_23390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952734.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PIN domain-containing protein" /protein_id="PRJNA477356:DP116_23390" /translation="MPSALLDACVLFPMYLRDTLLSTAEAGLYVLYWSQNILDEAMRN LILKGKISVEQAENLQETMKAAFPEAMVKVPVELEEVMTNHPKDRHVLAAAVTANANV IVTSNLTDFNAQALTPWNLKAQSPDDFLCLLFEDYPEEMIQVLRQQSQKYRRRPLSYQ ELLAFLSKKDGANLQKLVAKIRSYESR" gene complement(8373..8831) /locus_tag="DP116_23395" CDS complement(8373..8831) /locus_tag="DP116_23395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015080351.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_23395" /translation="MTTILTHNVPIEAVKTEQEAQSIKQVEDILNSQGSQVKLIGTNG EQIDIPESLYQVLRHVVHAMASGQAVSLVPHSYEMTTQQAAEFLNVSRPYVVKLLEQG EIPYIKVGSHRRVCFEDLVRYKEQRDKKRSQLLKQLIEMSEEAGLYEEEK" gene complement(8907..9590) /locus_tag="DP116_23400" CDS complement(8907..9590) /locus_tag="DP116_23400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312175.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_23400" /translation="MRKQTRDCDTVVGKVKTKRVTKSVRQIRDADVTQQQILDAAELE FARHGLKGARLSAIANHAHITTATIHYYFENKEGLYKAVLQRPIDEVQAMVSQLNLDH LPPEDAMAHIIRTAIAYEASNPHRQMLWFQEASQNQGLYFKQANVWSLYEHLLKVLER GITEGCFRPLDPILTLTHILSVCIFYFTVHENWKHLTPEIDRLSPEMVEKHIEAAIKF VLAGVKNTP" gene 9715..10704 /locus_tag="DP116_23405" CDS 9715..10704 /locus_tag="DP116_23405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869804.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="(2Fe-2S)-binding protein" /protein_id="PRJNA477356:DP116_23405" /translation="MTMLQGAPWLLAHRSMLKPNQPVKVSLYGNDYVLWQDSTGKISC LPNACPHMGAMLSEGWCVTKPDGSSVVVCPFHALEFDGSGCTILPGSNKPTKSLMEPL ELVIQGDFIWSYGGYEPKIPIPTIMNEIAGEYEFIGFTGERSIKTDFLSLLLNMHDYN HQNGTHRELFEIEQVQLKQFIDDGFHSHAYIDQPRKKPTLRHILKNPVLIAMPKVLQA HLENFFPCMAVMHGENAILSLKECHFYIPESPNHSRVFILMFIKAHTPIAHLIKGNLL RLIDIVVEQDADILSKLYANTPQHIKLNNEVGMDWVRRNFESFPTVVEPNFSR" BASE COUNT 3044 a 2460 c 2339 g 2973 t ORIGIN 1 ctctccttgc taaggagagg ggaggttttg gcgtcagcca aagccggggt gaggtaacac 61 gcgcgttatc aaaactttat cctcaacaca gaaaagataa cagcactaag aaccgcactt 121 gcaggaactg taatcaccca tgctgcggca attccttgaa gagttttgaa ttgaatagac 181 ttaaaatctt gcaccagtcc aatgccaaca acaccaccaa ccagagcatg ggaggtagag 241 acgggtaaac ccagtcggga ggcaagcagg atatacgaac aagcgatcgc ttgttttgac 301 aaagccatta aactcaagcc tcactcctac aaagcttgga ataagctagg ttacgcctta 361 gtgagactgg gacgcgatga cgaagcgatt gaaagcttcg acaaggcgct ggaaatgaaa 421 ccagactacg ctagtgctta ctacaacaaa gccgcctgtt atgcactgca aagacagatt 481 gagttcgcta tagaaaattt gcaaccagca gtggacattc gttaaaaaat aatagaatgt 541 tgcttttgtc aattagaaaa tatgctgatg acaaagcata tttcaggtat taaattaact 601 taagactgat tttttcgagc aaaagtgaat gacgagtaca aatagtacgc gacaagcctt 661 tatgctttaa gctccgctta tcgctcccac aaaaattatg ttttcaagca cggtttttgt 721 caccttttta agtaagatta cttattgaca aaagcgcgat aaccataaca tgtcaaggat 781 agcagtggat atgaatccta gcgagaaaga agctgcaact gacgcagatt tgctgtgaaa 841 ttgccagata taaacgtttt aggcagtttt tcgcgaaatt tgcccagtca acccaccctg 901 aacctatgaa aaaggtcagg ggcatgctac taaagcttgg aataaccgag gttacgtttt 961 agtgaggcta ggacgcgatg actctcgtat ttgttcgcta acaaaggaga cttatgatgt 1021 tgacaacggt cacgcagcac ttgttgaatg ttcgctatcg ggttttagaa gggctaaaag 1081 actctgcatc aattaaatct gtattagcag gaaaagctga caaagatatc tacatcaaat 1141 acttaaccaa tgtatatcaa tatgctcaac acagtcccaa ggtgattgca ctcgcggctt 1201 cccgttgcat gaacactcac cctcaacttg ccaaatactt gcttcatcat gcagaagaag 1261 aacaaggaca cgacttgtgg gcgctagcag acctccaaga tttaggtgtg aatgaatcta 1321 cagtgaaatt ggcataccca gtaccatcat gctcagcgat gattggattt gtctattaca 1381 ctgcagggta tgcaaatcct gtagggttgt ttggatggtt atatgtcctc gaagcaatgg 1441 gcaatgacat aggtggaatc atcgctgaac agctcaatga tggactcagt ctatccaaca 1501 cagcgctcag atttgtggct ggacacggca ttagtgatcg ggatcatacg acggatctga 1561 ctgaggtcat gaacacttat gttaaaaatc ctcaggatgt tgctgatatt aaccacgtag 1621 ctgacgtaat tgcagacctt tatgtccgaa tgttcacaga aatcgcaaag ataagggtct 1681 gatgacttta agtgttgcag tcgccaaaac agaggcagaa ctcaatgaga tttgtcactt 1741 ccgctatcgc atctatgtca aagaattgaa aattttgcca ccggaagcag accactcaaa 1801 gcagatttta cgcgattccc ttgatgatta cggtgtctcc tatgcgcttc taaaggacgg 1861 tcaagttgtt ggttcactca gatccatctt ccttaaagac gtaccaaatc ctaagcctct 1921 gatcaacaaa ttcaacctca aaccagcaat agaaaccttt ggtacctcag ccattattac 1981 taccagtcga tttattctcg atcctcaact acgtcacgga agcgccgtgt ttcgtttaat 2041 agaagctggt tataaagagg ggcgatcgcg tggagtacgt ctcaactatg gagactgcag 2101 tccctatcta ctgccgttct acgaacacct tggttacagt cgctatatat cagcatataa 2161 cgattcgtct ttcggttaca agtttccgct tctgatgttg gtgggtgatc atgcttggtt 2221 tgagcgtgtt cactcccccc ttaggcgtct tgccttctgt tatcctcaag atacaaaggc 2281 gcgtcaatgg tttgagagca cttacccgga atactttgga cttgaaagtg ccccattctt 2341 acctgaaaaa gtctttttcg acaccctaag tcagcgcttg ggcaataatc ctttacgtaa 2401 aatattccta ctgcgtggac ttgatcaaac cgaggctaat ctgttcctga aggaagctac 2461 aatcattaag acttcaccag gcgattgtgt tgttagacag ggaaattatg ataagacgtt 2521 ctacgtcatg ttatcaggca tcgcagaagt cattgacaat caagtccccg accatccagc 2581 taagattcta agcgctggtg aactctttgt tgagaatcat atcctgacac ctgaaccttg 2641 tgctgcaaac ttaattgcat gctcagtctg caatatattg gtagtgcctg gagaattctt 2701 tggcaagttt tgtaaacaag aaccgattat tgcttcaaaa atacttttga atcttgtcca 2761 actaggacta aagtcgcgag gctgcaatga aatgcgtgcg acgtgaaggc gtgtgttaag 2821 tacacctcag ccgcttcata aagcggcaga tttgagcagg tttagcccaa ccctaaaaaa 2881 agtgcggaaa tcatgcagcg ctcgcatttg tcattaactc gcccctgtat ggcgaagttg 2941 caggtaagcc ccaacctcag ctattgcagc tcttgcggat gcaatcaccc cgcccatttg 3001 atagaagccg tgtataacac cagggtatcg ggtatgtgtg acaagacagc cagcgtcgcg 3061 caactgctga gcataagctt ctccctcgtc ccgctgagga tcgagttctg ctgtaattat 3121 gagtgcaggt ggaagattcg acaggttagg agcacgcaga ggcgaagcgt atggctgttc 3181 acgctcagcc tgattgggaa cgtacattgc aatctgactc aatttgtttt cacgtgattg 3241 ggctggatgg agaaagtcta attcatacca tgatcgagtg ttgatcgcca agtctgtgtc 3301 aggataaata agaatctgtg caattggacg actggcatta cgatctcgac acatcaaaca 3361 tactaccgta gcaagcaacc caccagcact gtcaccacct actgaaactt tagagggatt 3421 aataccaagc tcttgggcgt gttcaataac ccactgcaat ccagcaaagc agtcttctgg 3481 tgcagcagga aagggtgact ctggagctaa tcggtagtct accaatgcga tcgcgcaacc 3541 actcgcgtta gcgagaaccc gaagtggacg atcatgggta tcgaagccac cactgataaa 3601 cccgccaccg tgaaagtaca caagagccgg tagtggagtg tcgttgtact caactcgcgg 3661 tttgtacaat cgaagtggga tagcgggtgt cgatggaact gtgtggtctt caacggtttc 3721 tacttcttga gcatctccag caagcccaat cacctgtcga tacccatcgc gtgttaattg 3781 aacgaaatca gcattaggag gtggggacac tgcgtgcgct tgcatttttt gcacaacggc 3841 acgggtatct gctgtcattc tggcaaaaat gtcttctctc atgtgacctc gttcctttgc 3901 ttttccaatg ttgaatctta aataatttca actgtaaatg tcttgtcaat ttgtgcgaat 3961 aattggtcaa attgtgcgat acgcgtagcg tcaagcctcc ggcttatcgc gtacgattac 4021 tattttctgt cggtacgata agctttgggc gtgactccag tatgccgacg aaacacctta 4081 gtaaagtggc tttgactttg aaatccaacc tgttgataaa cttcggtaat tggtaattct 4141 ggatctcgca gtagttgttt tgccttctca atccggcatt gtaatacata ctgatgagcg 4201 gtcaatcctg tggactgctt aaacagacga ccaaagtggt agcgactcat ctgtacgatt 4261 gcagccagtt cccccacgct taaatcccga tctaagtaag cttgaatgta atcgatcact 4321 aactgtaatt tcagctttgg caatccatct gtgtaattgc caaggtaatg atgacgggtg 4381 gcataacgac gaatgagatg agcaccaagc gctgtagtta aagcatcaac ataaaggcga 4441 ctactggctt cacccgcttc tagctcccgt ttcagggcta ctccaatccc gtgaatgagt 4501 aaatctgggt gggcaaactg cggtaagagt tcaaggcgat cgccatcgac tgattcatga 4561 gcaaggtgag caagctgagt tggttctaat gagatagaga tgatttctcc tgctccgtcc 4621 caacgggctt tgtgtcgcac attaacggga atgagcaccg tgtctccctg gctaaattgt 4681 tcacgctgtc ttttgtcatc cagttggcgt tctgaattga cataatcgtg tgagtaccgg 4741 atcaagatca ggtgttgtgt gttgcaatgt tctggtgttt cccaggcagg ttgccaagca 4801 tatcgcacag tgatgccatc ccatttcaat aagtgactcg atagcgttgg ctgattgggg 4861 agaatctgta acgtagcccc tggttggttt gtccgaacaa tcactggctg ctgttgttgc 4921 atattcttct ctatcgttgc ttacagacta attttttctg tgttgaggat aaactttttc 4981 taaactcaaa actttatcct caacacagaa aagatagccg cgctaagaac cgcactcgca 5041 ggcactgtaa tcacccatgc tgcagcaatt ccttgaagag ttttgaattg aatagactta 5101 atatcttgca ccaatccaat cccaacgaca ccaccaacca aagcatggga ggttgagaca 5161 ggtaaaccca accgggaggc aagcaggata gttgtcgcag ttgcgagttc agcacaaaat 5221 ccactactgg gttgcaagga aataatactt tcgccaatcg tggcgataac tttttttccc 5281 aaaatcgcta aaccagtgac aattccaacg ccaccaagta ttaaaatcca tatggggatg 5341 ttgaaaccat ctgtaggtac actgcccgtg cggttaatgt aagcaattgc agctagagga 5401 gcgatcgcat tccccacatc attagaacca tgagcaaaag caacaaaaca agcactgagc 5461 aattgaaatc gaccaaataa cctttcaaca ggattttgga tttgggtttt ttcccttgtt 5521 tccttgtctg ccttgtctgc ttgtctcctt gtccccttgt cttccaactg tcgccagctg 5581 tagaatgtga gtccaactgt agctacagca ccagttacca atggaatatc atgagcaggt 5641 attttgaaac caacttgctc aatcagaaaa gttgcgagtc cctgagttag tgacggtagc 5701 acaatcacac caaaaatccc cagcaaaaca gcactcaacc aaggaatcca ctcgtttaac 5761 tgtactaatg gatttggttg atctaaaatc cagcgcttga tttgactgta gaataaagca 5821 gcaattgccc cgctgatgac tggggtgata atccaaccta tcgtaatttg tccaattgtt 5881 gaccaatcaa tcgcactcac tcccaaagca acccagctaa atccggcgat cgcaccaaca 5941 accgcatgag aagaagagac aggcaaaccc cgtgatgtag caatttgcaa ccacaaaccg 6001 ccagaaagca gcaccgccac cattccagta actagcattt gcggtgtatc ggcaaacaaa 6061 gctggattga caatttctgt cgccaaagtt tgcgatacct cttgtccaaa tagtaccgca 6121 cccgtaaact ctaacactcc agcaataatc aatgcctgtt tgagggtaac agctttggaa 6181 ccaacagaag ttcccattga gttagcaaca tcatttgctc cgagattcca agcaagataa 6241 aaggcgagaa gtgcaactat gatgacaagc attttgtttg tgtggaagat gaggggagtg 6301 tgggagtgtg ggagtgtggg agtgtgagag tgaagcgcgt ttgcgcagca ccccctgcgg 6361 ggctagcgag ccgtaggctg ggggatgagg aagatgactc agatcttaca cctgcttttt 6421 cgtccgccta gaaataaatt tctaggctca aagctcaagt aagctaaagc tcactggata 6481 tatatttcag tccgttttaa cggactttgg ctataagcct tgaacttgag ttcaaggcgt 6541 actcaccggt aaggtgcaag atttgagatg agggagatga ggaaaggttt tttagctatt 6601 ccccttgtcc ccttctccct tctacggata aatcaaaatc ttatacgttt ctgcagtagg 6661 agcgatcgcc tgttccactg ctgctgataa atcttgtaat cgatagcgat cgctaatcaa 6721 tgcttgcacg tcaatgcgtc gattaaagac aatatcagcc gctaaacttt gaatgcggta 6781 ggatgaactg taactgccca taaggtcaat ttccctgcgg tagagaagat tcggattaac 6841 tggaatttcc atctcatcag gaaactccgc gaaaaagaga attttcccac ctttgcgagt 6901 acaatcgagg gcttgaaaga aagctttatc acttgggact gcaagtaggg taacatcaac 6961 acccatgcca ttggtcaaat catgaatctt tgctggtaag ttggcgtcac gggcatcaaa 7021 tgctgcttct gcacctacac tcaaagcttt ttcaatccgg gatgggagta agtcggtggc 7081 gatcgccttc gccccaaaat acttcaccaa catgataaac attaacccaa ttggtcccgc 7141 accagtaatc aaaactgttt gcccgggagc aatttgggct ttcttgactg ctttgaggca 7201 gcagttcgtt ggttctacaa aacttgcttc ttcaaaactg atattatcag gaatgggaat 7261 caacccacca ttacgcacaa tatgaccagg aactttgaca tattcagcaa aacctccacc 7321 actagcgtta aatcctgctg tcgtagagat gtttttgtaa gtatcgcaca tcgagaaatt 7381 atcatttagg cagtaatcgc aacgcataca agggatatgg tgcatcaccg ccacccgttg 7441 tccaacttgc cagcctgtga cttgcgaacc gaccgctgat atagttccag cagtttcatg 7501 tccaaaaatg cgcggcggtt catacaaagg ataacgaatt tttttgatat ctgactgaca 7561 taaccccaca acctgcacct gtaccagcac ttcatccggt tccagtgtcg gtacgggcag 7621 ctcttcgtaa gaaagttgat taacgcctct aaatacctgt gctttcatgt taaattctca 7681 ccgctcaacg atcatacatt taacacttag gggcgttctt tatgttactt gctacaaaat 7741 tttcgtcacg gaaatctttt gtggtagttt aaattacatg attatcgcga ttcataagag 7801 cgaattttag caaccaattt ctgtaaatta gcaccgtctt tcttacttaa aaatgcaagt 7861 aattcctgat acgaaagtgg tcgccttcta tacttttgag actgttggcg taatacttga 7921 atcatttctt caggatagtc ctcaaacaaa agacagagga agtcatcagg agattgtgct 7981 ttcagattcc aaggtgtcaa agcttgagcg ttaaagtctg taagattact agtaacaata 8041 acattagcat tagcagttac agcagccgca agaacatgac gatcttttgg atgatttgtc 8101 attacttcct ctaattccac aggtactttt accattgctt caggaaaagc tgctttcatt 8161 gtttcctgaa gattctcagc ctgttcaaca gaaattttcc ctttaaggat aaggttcctc 8221 attgcttcat ctaagatgtt ttgcgaccag taaagtacgt acaaaccagc ttcagcagtc 8281 gatagcaagg tatcgcgtag atacatcgga aataagacgc aggcatccag gagagcgcta 8341 ggcatagcaa ttatactcta ctaatgtcac aattactttt cctcctcata caagccagct 8401 tcttcgctca tttcaataag ttgcttcaaa agttggctac gctttttatc ccgttgttct 8461 ttatacctta ccaaatcctc aaaacaaacc cgtcgatgtg aacccacctt aatgtaggga 8521 atttctccct gttctaataa cttaacaaca taaggtcgtg aaacgttgag aaattcagcg 8581 gcttgttgtg ttgtcatttc ataactgtgg ggaactagag atacagcttg tcctgaagcc 8641 attgcatgga cgacatgacg cagcacttga taaagtgact caggaatgtc aatttgttct 8701 ccattagttc ctattagttt cacttgagaa ccttggctat tgagaatatc ttctacttgc 8761 ttaatagatt gggcttcttg ttcagtcttg actgcttcta taggcacgtt atgtgttagt 8821 atagttgtca tttttttgat attttataac tctatgcaac ctacgataac acaaaccgta 8881 ttacataaac gaaatatccg aaatggttaa ggagtgttct tcacacctgc caaaacaaat 8941 ttgatcgccg cctcaatgtg cttttctacc atctctggac tgaggcgatc aatctcaggg 9001 gttaggtgtt tccaattttc gtgcacagta aagtagaaaa tgcaaacgct gagaatatga 9061 gtcagtgtta aaattgggtc gagtggacga aaacacccct ctgttatacc tcgctcaaga 9121 actttgagaa gatgctcgta taaactccac acgttggctt gcttgaaata caggccttga 9181 ttttgactcg cttcttggaa ccacagcatc tgacgatggg gattgctggc ttcgtaggcg 9241 atcgcagtcc gaatgatatg cgccattgcg tcttcgggcg gcaaatggtc gagatttagc 9301 tgactaacca ttgcctgaac ttcatcaatc ggacgctgca acacggcttt gtataagccc 9361 tctttattct cgaagtagta atgaattgtg gcagtcgtga tgtgcgcatg atttgcgatc 9421 gcactcaacc tcgcaccttt aagaccgtgc ctagcaaact ccaattctgc ggcgtcaaga 9481 atctgctgct gcgtcacatc tgcatctcga atctgacgaa ccgatttggt aactcgctta 9541 gtcttgactt ttccaaccac agtatcacaa tctctcgttt gttttctcat actaccaagc 9601 aagagatatc ctgtttcgtt gagttaacac gcttattgcg gaattgccaa ttgacgacgg 9661 aatcacctat agcgtaactt actaattaat tagtaagtcg tttaacaagt atttatgaca 9721 atgctgcaag gtgcaccctg gctattagca caccgttcca tgctcaagcc aaatcaacct 9781 gtgaaagttt ctctctacgg taatgactac gttctctggc aagacagcac aggcaaaatc 9841 agttgcttac caaatgcttg ccctcatatg ggtgcgatgc tttcggaagg atggtgtgtc 9901 accaagccag atggtagtag tgtggtggtt tgcccgtttc atgcattgga atttgatggc 9961 tctggctgca ctatcttacc aggttctaat aagccaacaa aatctttgat ggaaccattg 10021 gagttagtga ttcaaggcga ctttatttgg tcttacggtg gttatgagcc gaagattcct 10081 atcccaacaa ttatgaatga aattgcaggg gagtatgaat ttattgggtt cacgggcgag 10141 cgcagtatca aaactgattt cctaagcctc ctgttaaata tgcacgacta caaccaccaa 10201 aatggcaccc atcgtgagtt atttgagatt gaacaagtgc agttgaaaca gtttatcgat 10261 gatggatttc actcccacgc ctacattgat cagccgagga agaaacctac actccgccac 10321 atcctcaaaa atcctgttct catcgcaatg ccaaaagtgc ttcaagctca tttagaaaac 10381 ttcttcccat gcatggcggt gatgcacggt gagaatgcaa tcctcagcct caaagagtgt 10441 catttctata ttccagaatc accaaatcac tcccgcgtct tcattttgat gtttataaaa 10501 gcgcatactc ccattgctca tctgataaaa ggaaatctcc tgcggctgat agatattgtt 10561 gtagagcaag acgctgacat tttgagcaaa ctctatgcca atacacctca gcacatcaag 10621 ctcaataatg aagtcgggat ggattgggtg cggcgaaatt ttgagagttt tccaacagtt 10681 gttgaaccga atttctctcg gtaacggctt acgttagtgt aaaaaattca gaaatagcag 10741 atagtgtctt actcccttcc cggtgcaaga ttacctatct tctttttgat tttcttggcg 10801 tccttggcac gccagg // LOCUS NODE_3208_length_10496_cov_5.88411110496 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10496) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10496) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10496 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 723..2048 /locus_tag="DP116_23410" CDS 723..2048 /locus_tag="DP116_23410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318877.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="REC domain-containing diguanylate cyclase" /protein_id="PRJNA477356:DP116_23410" /translation="MSCLKILVVEDEKKNALDIKKRLQKLGHYVSEITEYGDKAIQKL GEVNPNLVLVDICLLGIIDGVRLADIVMNDFQLPVLYLTEDDSEIEQIYENRQTEPFS YITKPVAEQDLQIAIEIAVYKHQTKIKLQEQQQKFMAILKSMGCAVMITDTCGCIHLM NPIAEELTGWKQEEAVTKKLAEILSLVDKDTGVLIKNLATQVIQTGVVLNLPETLTLI AKDDTEIQIGGNIAPIRDDNGNLIGSVVVFQDITQRKQTEAQLVRNAFYDALTGLPNR VLFLERLSQVFERRKRRNNDRYAVLFLDVDGFKGINDSFGHGAGDNLLIEIARRLESC LRSADTVARFGGDEFAILIEDIKDISDTTNVAKRIQETLKLPIYIEEHKISISASIGI ALSCSSYEQPENLLRDADMAMYEAKQQGKARYVVFNSQKSYHTNRPVKK" gene 2209..2766 /locus_tag="DP116_23415" CDS 2209..2766 /locus_tag="DP116_23415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353101.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sugar O-acetyltransferase" /protein_id="PRJNA477356:DP116_23415" /translation="MEKTEKQKMLAGELYLASDLELIAGRNFALRLQRMYNSTTEEQL EERSQILQELFGKVGQNINIMPPFQCDYGKNIYAGDELYMNFGCVILDCNTVHIGDNV LCAPYVQIYTAYHPTDPEIRLSGKELAAPIRIGHNVWIGGGAIICPGVTIGDNTTIGA GSVVVKDIPANVVAAGNPCRVIRHL" gene complement(3064..3315) /locus_tag="DP116_23420" /pseudo CDS complement(3064..3315) /locus_tag="DP116_23420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311363.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(3385..3816) /locus_tag="DP116_23425" CDS complement(3385..3816) /locus_tag="DP116_23425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleoside deaminase" /protein_id="PRJNA477356:DP116_23425" /translation="MDEFMEAAIAQAKQGRQEGGIPIGSVLVKDDKIIGKGHNKRVQD GDPVTHAEIDCLRNAGRIGSYRGTTLYSTLMPCYLCAGAVVQFGIKKVIAGESKTFPG AKEFMVSHGVEVIDLNLDECEQMMSEFIQEKPELWNEDIGK" gene 3842..4765 /locus_tag="DP116_23430" CDS 3842..4765 /locus_tag="DP116_23430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861027.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleoside hydrolase" /protein_id="PRJNA477356:DP116_23430" /translation="MSKQLVLMDHDGGVDDYLATMLLMTMDHIQPLGIVVTPADCYAE PAVSATRKILDLMGCSDIRVAQSTVRGINPFPRLYRRDSFVIDHFPILNQSDAIRTPL LAEPGQDFMIKVLQEAPEPVTLMVTGPLTTVATALDKAPDIEEKIQRIVWMGGALNVG GNVEKNWEPGQDGSAEWNVYWDPISAARVWQTQIEIIMCPLDLTNTVPVTSEIVYKMG KQRHHPISDLAGQCYALVIPQDYYFWDVLATAYLAHPEFYQLREWETEIITTGLSQGR TKIVPGGRKIFAMDQVDKEAFYAYILQQWAR" gene 5290..6333 /locus_tag="DP116_23435" CDS 5290..6333 /locus_tag="DP116_23435" /inference="COORDINATES: protein motif:HMM:PF00353.17" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23435" /translation="MTTNISYTFTYNYGDGEYYKGYGYTTSDSNYYAGQRIDQPSPNE TGLTGYYIIDSVNPTSLTDYTGEVVVTHYYDKDTNLIADPGGNSGGEEPGHLHNYPHN PSTDVNLGVGHNGLGSEAGLAHIDGHYFGFDRETFAVFNNSNEADLAPENPGDTVIKF PNGAFYGSRGDDVIEGGAGDQVIYGGKGNDLIFGDGILGGSSAPSGNDFLIGGDGNDW LYGGRGNDKLNGGWGDDYLNGYGGYSDSETDTLTGGPGADTFGLGYNWRSSSDVDIYY LGSGNAVITDFKASEGDKIRIGGSIGDYTLVQNQNLIGSSGLDTAIYRYGDLIAILQD TTNVIASRDFTIT" gene complement(6821..7120) /locus_tag="DP116_23440" CDS complement(6821..7120) /locus_tag="DP116_23440" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23440" /translation="MKQVANPKGSINGVKLQIWATWLFYAVLIDLADTVANEFEMESE KISIEMLFRSFYHFNHAYNRGFSHDLIAYLTSPKNRDLGIVKYQPKSQQRDTLDL" gene complement(7117..8067) /locus_tag="DP116_23445" CDS complement(7117..8067) /locus_tag="DP116_23445" /inference="COORDINATES: protein motif:HMM:NF033592.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS4 family transposase" /protein_id="PRJNA477356:DP116_23445" /translation="MNTTGIQSTVVLPAIGTTRPDSGFTFNGAYVLTLLWRQVASVRE LHRLLNREDLLWCKATCVSQQALSKRFLEFPASIFEQVMMELIPKLQARWVKRKNRPL PVSIRFAKTKFQRILAVDGSTLEALFRHLESLQEKTVCLAGKIYTIVDITTYLPVQIT FEENPNCSDAKMWDWLHGCVPNGTLLIFDRGFYDFTEFAALMTNGVAWITRLKKASYR VQRTLTHTPQVVDQIIELGHKRGRAKPIIVRLVEIRRENTWYRYITSVTNPVDLPPYL VADLYGRRWHIETAFGLVKRLLKMSYLWTGEPLRWTGSPA" gene 8252..8491 /locus_tag="DP116_23450" CDS 8252..8491 /locus_tag="DP116_23450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23450" /translation="MVDNIYERVQLKGHQNSVNSAAFSPDGQRIVTASFDKTAKVWRV GGFDDLLARGCDWLQDYFVTHPEARERLWVCRARR" gene 8981..9199 /locus_tag="DP116_23455" CDS 8981..9199 /locus_tag="DP116_23455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874594.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23455" /translation="MIEKTSTLQKVIEAVEALDIDAQILLIDIITKRLNQQRRDELLK EVAQAQHDYEQGSVRRGSVEDLMAELED" gene 9436..9510 /locus_tag="DP116_23460" /pseudo CDS 9436..9510 /locus_tag="DP116_23460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008179751.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system mRNA interferase toxin, RelE/StbE family" gene 9704..9994 /locus_tag="DP116_23465" CDS 9704..9994 /locus_tag="DP116_23465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23465" /translation="METMKVRSHIGADGILQIQMPTDFKDTSVEVVLVVQPLSEQETA ASESTQVQYNAWGKPTTKKSISHAIALMQQLRREVALDQTSIREMIEEGRRF" gene 9994..10425 /locus_tag="DP116_23470" CDS 9994..10425 /locus_tag="DP116_23470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310380.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="VapC toxin family PIN domain ribonuclease" /protein_id="PRJNA477356:DP116_23470" /translation="MQFVLDCSVAISWCLVDENNPTANAILAMMPDSEAFVPGIWSLE VANTLLVAERRNRMTQQQSQQAIILLQSLFIQVDTATDANALAATITLGRQEGLAAYD AAYLELALRLGLPLATLDTRLAEAATRCGVELIVVDEKRQK" BASE COUNT 3120 a 2003 c 2300 g 3073 t ORIGIN 1 agtttacaaa taagcagtta ttaaccattt aatcaaaaat gtaagtgttt tgaagatgaa 61 ccgcaaaact agaaggttat actgtgttta tcaacagtaa taccactgaa aaacattccc 121 aaaaacatgg tagaaaacat aattttagga cttgatgaaa tctagaaaaa ccgtataatc 181 attaacactt tatgcctacg gcacgcctgc aaagtatgcg caaagtgtgc cgctaggcat 241 acaaagcagc gtaggcgtgg aggagataca gatttacccg acgggagagc aaatcaccag 301 ataagcttga taagcgactt gcgaagaaac agagaaccaa cttttttgca atactagaga 361 gtcgccgctt ttgtttacta gttaatgtcc aaatttgtct gttcatgacg gcggcgcgat 421 caccctcaag gaaagtcaat ctgttttatt gaaatgtatt aaattgccgt tttttttgct 481 attttcgcat aacagtgatc gatgtgataa tatttgtact catcatttac tctggctctt 541 cacactacgt acaaggatga ttttgatact ttttttactt acttttcagc agaaccctga 601 ttgacaaaag cgatgttttt tggagaatag cacatttgac aagatttttt ctgagccaat 661 aactttattg agattgagta acgagtgcaa aaaatataac tttagctaga ggagattttg 721 atatgtcttg tctgaaaatt ttagttgtag aggatgagaa aaaaaacgca ttagatatta 781 agaagagatt gcaaaaatta ggtcactacg tttccgaaat tactgaatat ggagataaag 841 caatacaaaa actgggggaa gttaatccaa atttagtctt agttgatatt tgcttattag 901 gaataattga tggtgtgaga ttagctgata ttgttatgaa tgattttcaa cttccagttt 961 tatatttaac agaagatgat tctgaaatag aacaaatata tgaaaataga caaacagaac 1021 cttttagcta cattactaaa ccagtagcgg aacaagatct gcagattgct atagaaatag 1081 ctgtttacaa gcatcaaacg aaaataaaat tgcaagaaca acagcaaaag tttatggcaa 1141 ttcttaaaag tatgggttgt gcagtgatga tcacagatac ttgtggttgt attcatctga 1201 tgaacccaat agcagaagaa ctcacaggat ggaaacaaga agaagcggtt actaaaaaat 1261 tagcagaaat attaagctta gttgacaaag atacaggagt tctgattaag aatttagcta 1321 cacaagtcat acagacaggt gtcgttttga atttaccaga gactttaaca ttaattgcta 1381 aagatgatac cgaaatccaa attggaggaa atattgcacc catccgcgat gataacggta 1441 atttgattgg ttcagttgtc gtttttcagg acatcactca acgcaaacaa acagaagcac 1501 aacttgttcg gaatgctttc tatgatgccc taacaggact acccaataga gttttatttt 1561 tagagagact aagccaagtg tttgagcgtc gaaaaagacg gaataatgat cgctatgctg 1621 tcttgttttt agatgtggat ggctttaaag gaattaacga tagttttggg catggtgctg 1681 gtgataattt gttgatagag attgctcgac gtttagaatc atgcttacgc agtgccgata 1741 cagtagcacg atttggtggt gatgaatttg ctattctcat cgaagatatt aaagacattt 1801 ctgatacaac caatgttgct aaacgtattc aagaaacctt aaaattacca atttacatag 1861 aagaacataa aatatcaatt tcagctagca ttggtattgc tctaagctgt tctagttatg 1921 aacagccaga aaatttgctg cgggatgctg atatggcaat gtatgaagca aagcaacagg 1981 gaaaagctcg ttatgttgta tttaattccc agaaatcata ccatacgaat agaccggtca 2041 aaaaatagta agtacgaata tgtgactgct cgtgattatg cagtcataga aaaaatcaat 2101 cttagtattt gtaaaaatag gaatatgatc tatttttagt taatattgac aatttgatag 2161 attaaagcga tgtttgtcgt atcataatca aaaataataa attctgttat ggaaaaaaca 2221 gaaaaacaga aaatgctggc aggcgaattg tatcttgctt ccgatttaga attgattgct 2281 ggaagaaact ttgcacttcg tttacaaaga atgtataact ctacaactga agaacaactg 2341 gaggagcgat cgcaaatcct acaggaatta tttggtaaag tgggacaaaa tattaatatt 2401 atgccaccat ttcaatgtga ctatggtaaa aatatttatg caggcgatga attatatatg 2461 aattttggct gtgtgattct agactgtaac acagttcata ttggagataa cgttttgtgt 2521 gctccttatg tccaaatcta tacggcatat cacccaacag atccagaaat tcgcctttct 2581 ggtaaagaac tcgctgcgcc aataagaatc ggtcataatg tctggattgg tggcggtgca 2641 attatttgtc ctggtgtgac aattggcgat aatactacta ttggtgctgg tagtgttgtt 2701 gtgaaagata tacctgcaaa tgtcgttgct gcaggcaatc cctgtagagt cattcgacat 2761 ttgtaatctc tagcttataa gagttatttc caaatacttc tgcactgatt gtatgggcat 2821 tctaattctg gattgatagt aagtcgtgct taagcagtgg ggtttatccg acaagcgtgc 2881 ttttttgttc tttctttaat ttcatttata ctgtgctggg cattctaatt ctgagtaaga 2941 gttgttgcct tcaagcactg gagaatcagg atactccagt gtttttttaa tttatgtggg 3001 ttttcataaa ttgagttgac cgctcagaga aatctaaaca aagctaccca aagatagtca 3061 atgctagaca aacccgtttg taatcttaat ccaaagtcct aaaactacag ggataaacgg 3121 caatcccact atcccatcca tgaagtttgg gattacttta cgagctatat tcatgcttcc 3181 attggtatca gcattgattg ttctaccagt ggaagttttg tataaaccac gcttaacacg 3241 tttaccacta aaaacaggct tgatgtcaga tttactgttg aaagttggta gtgtatcacc 3301 atctaaagca cttgcagata ggatgacgtt tgatgcgttt ttgcgtcaaa caggtgttgt 3361 gtgaaataat ccaaaatccg ctatctactt gccaatatcc tcgttccata actcaggttt 3421 ctcttgaata aattcgctca tcatttgttc gcattcatca agattgaggt caatcacttc 3481 cacaccgtga gataccataa attctttagc acctggaaaa gtttttgatt ctccagctat 3541 gactttttta atgccaaatt gtacaaccgc cccagcgcac agataacatg gcattaaggt 3601 tgaatagagt gttgtacccc tataactgcc gattctacca gcattgcgaa ggcaatcgat 3661 ttcggcgtgg gtgactggat cgccatcttg cacacgcttg ttatgtcctt tgccgataat 3721 cttgtcatct ttgacgagaa ccgaaccgat gggaattccg ccttcttgtc tgccttgttt 3781 tgcttgtgcg atcgcagctt ccataaactc atctattttc tcattcattt ttttattctc 3841 catgtctaaa caacttgtct tgatggatca cgatggtggt gtagatgatt atctagcaac 3901 tatgctgctg atgactatgg atcacataca gccacttggt attgtcgtca ctccagcaga 3961 ttgctatgcc gaaccagccg taagcgctac acgtaaaatt ctagacttaa tggggtgttc 4021 tgatatcaga gttgcccaaa gtactgtacg cggtattaac cctttccctc gactctatcg 4081 ccgtgattca tttgttattg accattttcc cattctcaat caaagcgatg caattcgcac 4141 accactgctt gcagaacctg gtcaagattt tatgataaaa gtgttacaag aggcaccaga 4201 acctgttacg ttaatggtca ctggtccgtt aacaacagtc gcaactgcac tggacaaagc 4261 accagacatt gaagagaaaa ttcaaagaat tgtttggatg ggtggtgctt tgaatgttgg 4321 tggtaatgtg gaaaaaaatt gggaaccagg acaagatggt tccgcagaat ggaacgtgta 4381 ttgggatcca atttctgcag cgcgggtatg gcaaacccag attgaaatta tcatgtgtcc 4441 tttggatttg actaacactg tacctgtgac atcagaaatt gtttacaaaa tggggaaaca 4501 acgtcaccat cccatatctg atttagcggg acaatgttat gcactggtta ttccccaaga 4561 ctattatttc tgggatgtgc tggcgactgc ttatcttgct catccagaat tctatcaact 4621 gcgagagtgg gaaacagaaa ttatcaccac aggtttaagt caagggcgta caaaaatagt 4681 accaggaggg cgtaagattt ttgctatgga tcaggtagat aaagaagctt tttacgctta 4741 tattttgcag cagtgggcac gttgaatttt tgaccagtta tcaaacaagt tgtttatgcg 4801 actcatccag agtaaaaaaa ataccgggcg attataaatc gcacgccaga tgctacggtg 4861 aggttccgag attttgggta attcgatcac ttgtgtgtac accgtagcct ttaaggagag 4921 ggaaagattt tagcgccgat tcaagcgatg gtgaggtgaa aagcgatatg tatgtacagg 4981 gtagcttctt attaaggaga agggatgaaa aggatagaca gggttttttt atgaactttt 5041 gtaaaaagtt ctggaatgga acagtaacct tttgaggtta atcagaaatt cctgccgatt 5101 accggacctc aacagccaga aattaatatt tgagaatttg cataaaactc tggctgtttg 5161 agtggagtta catatactag atttcaatca ctgcgtggca ttaggttgca actatgccag 5221 aaagactaga agagcaaacg attgatgtct ataaggagtt aatcacaccg cttgtaaaag 5281 gagatacaaa tgactactaa tatttcctat actttcactt acaactatgg tgatggggaa 5341 tactacaaag gctatggcta tacaacatct gactctaact actacgctgg tcagcgtata 5401 gatcagccct cacctaacga aactgggctt actggttact atatcattga ctctgtaaac 5461 ccaacctcac taactgatta caccggagag gttgtagtaa cgcactacta tgacaaagat 5521 actaatttga ttgcagatcc aggtggtaat tcaggaggag aggaacccgg acatttgcat 5581 aactaccccc ataatccatc tactgatgta aacctaggtg tcggacataa tggattaggt 5641 agtgaggctg gacttgctca cattgatggt cattacttcg gatttgatag agaaaccttt 5701 gctgtattca ataattccaa cgaggctgat ttggctcctg agaatccagg tgataccgta 5761 attaaatttc ccaacggtgc attctatgga agtagaggtg atgacgtaat tgaaggagga 5821 gcaggcgacc aagtcattta tggcggtaag gggaacgacc ttatctttgg tgatgggata 5881 ctaggcggga gctcagcccc tagtggaaat gacttcctga ttggcgggga tggtaatgac 5941 tggctctatg gtggtcgtgg aaacgacaag ttaaatgggg gttggggtga tgattacctc 6001 aatggatacg gaggttacag tgattctgaa acggatacct tgactggtgg tcctggggcg 6061 gatacgtttg gactcggtta caactggcgt tcttcttcag atgttgatat ctactatcta 6121 ggctccggaa atgctgttat cacagatttt aaggcatcgg agggcgataa gattcggata 6181 ggtggcagca ttggtgacta tacgttggtg caaaaccaga acttgattgg cagctcaggc 6241 ttagataccg ctatctacag atacggtgat ttgattgcca tcctgcaaga caccaccaat 6301 gtcattgcct ctcgagattt tacgattacc tagctggttg aggaaacgac atgaaaactt 6361 gtcgtcccac caaaagtgcc aaagtctaag ctcatacgaa tttgaaccgt tgattgtagc 6421 cgtaggtaat ttcaatcgct actggctcct cctcaccttc ccttaaggtt ttagcatact 6481 ttcaagaaac accgcagcgt ggcagaggat tgagcaacga ggtcaaagcc cggagattga 6541 gcctttttct ggaggttttt ctgggtgcga tcgcacccag aaatcatcat caggtgatcg 6601 cacgtagcta tcaccctcaa gcgatcgctt gaggggtttg agatttttct cgttcccagt 6661 ctctggctat tggtgacaac ttaaagcgat cgccaaaatg tcaagtgggg tgtggagata 6721 tagcagtcgc cacaattgtt cggacatttc cctgttccct gttccctgtt ccctgttccc 6781 tcttaatcaa aacacgatgt cctaaacgat atgtctgttg ctataagtcc aaagtgtctc 6841 gctgttgaga tttgggttga tatttaacaa ttcccaaatc tcgattttta ggagaagtaa 6901 gataagcaat aaggtcgtgt gaaaatcctc gattataggc gtgattaaaa tgataaaatg 6961 agcgaaacaa catctcaatc gaaatttttt cagattccat ctcaaattcg ttagcaacgg 7021 tatcagctag atcaattaaa acagcataaa ataaccaagt tgcccaaatt tgtaacttga 7081 caccattaat cgagcccttc gggttcgcca cttgcttcaa gccggggaac ccgtccaacg 7141 cagtggctca ccagtccaca aatatgacat tttcaacaat cgtttaacta gaccaaaagc 7201 agtttcgatg tgccaccgac gaccataaag gtcagcgaca aggtaaggag gtaagtcaac 7261 tggattggta acagaagtaa tgtatctata ccaagtattt tcgcgtctaa tttcgactaa 7321 acgaacaata attggttttg ctctccctcg cttgtgtccc aattcaatta tttggtcaac 7381 tacttgtgga gtatgagtca gagttctttg tacacgatag cttgcctttt ttaaacgagt 7441 aatccaagca acgccatttg tcatgagtgc ggcaaactca gtaaaatcat aaaaacccct 7501 gtcaaaaatc agtaaagttc cattgggtac acaaccatgc aaccaatccc acatttttgc 7561 atccgaacaa tttggattct cttcaaaagt aatttggact ggtaaataag tcgtgatatc 7621 aacgattgtg taaattttac ccgctaaaca aacagttttt tcttgtaggc tctctagatg 7681 acgaaaaagt gcttctaatg ttgaaccatc cacagccaag attctttgga attttgtttt 7741 tgcaaaccta atacttacag gtagtgggcg gttttttctc ttaacccatc ttgcttgtaa 7801 ctttggaatt aattccatca ttacttgttc aaatatcgat gctggaaact ctagaaatcg 7861 cttagacaat gcttgttgcg atacgcaagt cgctttacac cataacaagt cttctcgatt 7921 taataaccga tgtaactcgc gcactgatgc tacttgtcgc cacaagagcg ttaatacgta 7981 tgcaccatta aaggtaaacc cagaatccgg tcgcgtagtc ccaattgctg gtaatacgac 8041 agttgattgt ataccagtgg tgttaatagc tcctccatcc gcgcggcgat cacttcgttt 8101 tctggctggc gcatgtttct atcccgcttg tggtctgggt ttcccactct tggatttccc 8161 atcgtttttt cccttgatgc acctgttgac aagcttacct ttttcttaag ttgtcaccaa 8221 tagtctctgg ctgggaatgc actattagga gattgttgac aacatctacg aacgggtgca 8281 actcaaaggg catcagaact cggtcaatag tgcagcattt agccctgatg gtcaacgcat 8341 tgtcactgca tcatttgaca aaacagccaa ggtgtggcgg gttggcgggt tcgatgattt 8401 gctggcgcgg ggttgcgact ggctgcagga ttattttgtc acccatccag aggcgcggga 8461 gaggttgtgg gtgtgtcggg cgaggagata ggaggaggga gggagcaggg agagaggagt 8521 tactaaactc agatcttgca ccatacttta tcgtccgcca agaaataaat ttcttggctc 8581 aaagtctaag tccgttaaaa cggactcttg ttagtttttc agtctgtttt aacagacttg 8641 gtttattagc ccagaacttg agttctgggc gtactcagcc ggaggtgcaa gatctgagta 8701 aaggcgttaa gctagctgcg ccttcggcaa tcgcccacac gaatcacttt tggcgttaag 8761 ctagctgcgc ctccggcaat cgcgcacacg aatcactttt ggcgatcgct cacacgaatc 8821 acttttggcg ttaagcaatg ctagcgctcc ctgctggagc atcgctcaca cgaatcactt 8881 ttggcgttaa gcaatgctag cgctccctgc cgtaggcaat cgctccttgt gcgctgagtg 8941 actctgttaa attagactta ttccccaatt cattacgagt atgatagaaa aaacatctac 9001 attgcaaaaa gtcattgaag ctgtagaagc attagatata gatgctcaaa ttttgctgat 9061 agatattatt accaagcgcc tcaatcaaca gagacgagat gagttgttaa aggaagttgc 9121 acaagcacaa catgattatg aacagggaag tgtacggcga ggttctgttg aagatttgat 9181 ggcggagtta gaagattgaa aaatttagtt tggagttctg catttgttcg tgcgtttaaa 9241 cgtttagtac gtcaaaatcc acaattacgt tctcaagtag aggagacgct aaaaaaacta 9301 gctgatacca agttgcgttc tgttctagta gtttgtggtt aggggagatt gcttcgctcc 9361 acttcgtttc gtatgccctc cgggcacgct gcgcgaacgc aatgacaggt gctacgtttg 9421 attgcaactt ggtatgaaga ccctttctct cccagtttac acagtcacaa acttaaaggt 9481 gatttagatg gtatatggtc gtgttctatg cagcgaatta ttggcgaagt gtatggtagt 9541 ttaaacgaag aaatgatggt attaattgat gctgcattga aattgcattt gggattaggt 9601 tgagatatga ctctaaacaa gagcgatcgc tcattaagac catcgatttt tgagatatga 9661 cttatataga ttaacgtgta gcagtgaatg attagaggtg gatatggaaa caatgaaagt 9721 gcgatcgcat atcggggcag atggcatatt gcaaatccaa atgcccactg attttaaaga 9781 tacctctgtt gaggtggtgt tagttgtaca gccattatct gagcaagaaa ccgcagcatc 9841 agagtcaacc caagtgcaat acaatgcttg gggaaagccg acaacgaaaa aatcgataag 9901 tcatgcgatc gctctcatgc aacaactacg ccgagaggtt gcgttggatc aaacatcaat 9961 ccgcgaaatg attgaagaag ggcgaagatt ttaatgcagt ttgttttaga ttgttcggtg 10021 gcaattagct ggtgtttagt ggatgaaaat aaccctaccg ctaatgccat actggcaatg 10081 atgccagatt ctgaagcatt tgtaccagga atttggtcat tagaggttgc aaatactcta 10141 ctggtagctg aacggcgtaa ccgtatgact cagcaacaat cacagcaagc tattattttg 10201 ttacaatccc tgttcattca agttgatacg gctacagatg ctaacgcatt agctgcaacc 10261 ataacactag gacgacagga aggtttagct gcttatgacg cagcttattt ggaattagcg 10321 ctgcgattgg gattaccctt ggcaactctg gatactcggt tagctgaagc agcgactcgt 10381 tgtggtgtgg agttaatcgt tgttgatgag aaacgccaga aatagacggg taataccaat 10441 tcactttaaa gttagaacat agcgggcaag cagggggcta ggggtcccct ctgggg // LOCUS NODE_3292_length_10163_cov_5.69618110163 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10163) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10163) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10163 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(6..1013) /locus_tag="DP116_23475" CDS complement(6..1013) /locus_tag="DP116_23475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875926.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alcohol dehydrogenase" /protein_id="PRJNA477356:DP116_23475" /translation="MKAVLMTAAGDPEVLQVQDVQNPAVPLGETELLVHLRAAGINPI DTKLRKRGTFYPDKLPAILGCDGAGVVEAVGASVQKFRVGDEVYFCNGGLGGHQGNYA EYTTVDERFVARKPASISFAEAAAAPLVLITAWEALYERGRLEPGERVLIHAGAGGVG HVAIQLAKLKGADVSTTVSTQEKADFVQKLGANHVILYKETDFVQAALDWTNGEGVDL AFDTVGGETFHKTFPAVRVYGDIVTILEPDANTVWKSARQRNLRVGLELMLTPMLQGI VEAQQHQAEILEQCAKWIDAGKLKIQVSDTFPLEEARSAHHLLETGSVTGKIVLLMGD K" gene complement(1235..1807) /locus_tag="DP116_23480" CDS complement(1235..1807) /locus_tag="DP116_23480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_23480" /translation="MTLSQYHPYLSPEQYLETEKSSPIKHEYIQGQIYAMSGASDVHV TITANLVTLLRNHIRGTGCRVYVADMKARIETLNIFYYPDIMVTCDQRDTKFEYFKRY PSLIIEVLSPSTEAFDRGDKFSDYQELETLQEYVLISQTRQRVDCFRRNSEGRWVLYS YRRNQDLELTSVNFSCSLAEVYEDVLFSEI" gene 2058..2549 /locus_tag="DP116_23485" CDS 2058..2549 /locus_tag="DP116_23485" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23485" /translation="MARIIYETENSLEITELRAEVLSKRNCGEVLFEIKKLIPDETIE RSENLTAILDLFVSQLGYSSLGLRWKEVNQGEAQKILKFIMTKDLAYSVQLMSLEEAE KIVVKLFQIFPGNCKFFTNALFRNNYSGISAWDSITKATFDTGIIVVSERRIGILWVQ DED" gene 2865..4136 /locus_tag="DP116_23490" CDS 2865..4136 /locus_tag="DP116_23490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739695.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23490" /translation="MSKLALPLKIAGGAILLLSFFSSRVNSVTISEGFENGTKGSYAA GDVTLSTGVWNFNDALIGNLSTDVKTGTQSARIRNNGKVTMKFDRTGAGTVTIKHAKF GSDASTTWELWCSTNSASSWSKIGSTITTSSTSLQTATFTPNLSGTVRCEVRKTDGTS NRTNIDDIEISDYGSSPPSSSLPPGSVPFFDNINNPVSGLAYGSPADVTPPAPVPNSF DTAVTNLCGAPGTVVSRAGFQSMMQNNSTVLANIKQYVGGYLKPGRTTDAAFLDDLTD VWFNAQAFDHIFCGEPVQGGSIGGLHFVARYVELQEKGLAGRLDNNTSREEVVPGTIY TIGVVMKVGSGTAQSSIKGYPYTLNAEEILSKASLGYKNNPNTTSTNTVCNLSVTDEG KTFTAIFVRRDGGIRTFYPDATPDSNPNCTQ" gene complement(4645..5631) /locus_tag="DP116_23495" CDS complement(4645..5631) /locus_tag="DP116_23495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459831.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3641 domain-containing protein" /protein_id="PRJNA477356:DP116_23495" /translation="MIQTAITPFKQKLASPLTKKEITVLQINLGKRCNLACNHCHVEA GPKRTEELSPEICQQLIEIIHKFPQIQIVDLTGGAPEMNYGFKPLVEAARATGKQVIV RSNLTIFFEDGFDDLPEYFAKHKVRVVASLPCYLSDNVDKMRGAGVYDGSIKALQWLN QLGYGKEPDLIVDLVYNPQLPKDEKFSLTPDQTKLERDYKQYLQENFDITFNNLFTIT NLPVGRTKLSLERKKLYAPYLQFLESHFNTGTIEHLMCRDEISIDYLGYVYDCDFNQM MNLPAKTRDGENLTVAKLLQAGSLDLINEVQTAAYCYGCTAGTGSSCGGALL" gene complement(5811..6779) /locus_tag="DP116_23500" CDS complement(5811..6779) /locus_tag="DP116_23500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015212770.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_23500" /translation="MSYLETAAQFYSEVAQTPQVGLCCVQSTPLQLPGLKIPRKMQEM NYGCGTTVHLTELGNQPTVLYVGVGGGLEALQFAYFSQRPGAVIAVDPVAEMRQAAAR NLEIAAQENPWFDTSFVEIREGDAFNLPVLDDSVDIVAQNCLFNIFEPEDLTRALQEA YRVLKPGGKLQMSDPIATRPIPQHLQKDEQLRAMCLSGALTYEQYIQRIISAGFGQVE IRARRPYRLLDSQTYNLEENLLLESLDSVAFKVIIPEDGACVFTGKTAIYAGSESFFD DDAGHILQRGIPASVCDKTAAKLAALMPEQIMMTDSTWHYNGGGCC" gene complement(6891..7565) /locus_tag="DP116_23505" CDS complement(6891..7565) /locus_tag="DP116_23505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138262.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_23505" /translation="MTSHILLVEDEVKLARFVELELSYEGYKVSVAHDGLTGLTTARE SHPDLVILDWMLPGLSGLEICRRLRSTGDQVPIILLTAKDEVSDRVAGLDAGADDYVV KPFSVEELLARVRAHLRRTQEADTNTLQFEDLSLNRRTREVFRGTRLVELTAKEFDLL EYLLIHPRQVITRDRILEEVWGYDFMGDSNIIEVYIRYLRLKLEANEEKRLIQTVRGV GYALRE" gene 7770..9020 /locus_tag="DP116_23510" CDS 7770..9020 /locus_tag="DP116_23510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015117068.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_23510" /translation="MSLKLHNWQQERRTLEILSSLSYRRGELQSYLRQITCGVCELLE LDWSVVTLCQDGYEKVLASSIDLGDDNDQVYLLHGSLTGTVVQTGHSLVVPDAKICND YGEAPEGYQAYLGVPLCTPEGKIVGTICSFHKTSRQFSADEVRIAELFAERAATAIDN YFLYQQQRQFNQILEAEVARRTEELRAAQAKLVEQERLAAIGEFAACIIHEIRNPFTT MKMGLNFFQKLDLSAPAKERLFLALDEAHRLERLLKEILLYAKPQTLQLEKIDINEFI PEILPSLQNMSEAVGREVDFYPAMNEVKVEADKDKLKQVFINLVRNAFEAIPKGETVK LQIESNTNRNQVCIHVHNGGEPISPEILPKLTQPFYSTKSSGTGLGLAISKRIVEVHS GELLMKSNLAEGTTVSVRLPIVTA" gene complement(9041..9328) /locus_tag="DP116_23515" CDS complement(9041..9328) /locus_tag="DP116_23515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869617.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase" /protein_id="PRJNA477356:DP116_23515" /translation="MTSFTQLGPQNDLLQPVRQWFDSIEIHNAKLAHSLCKLIPAQCP FERDITLFGRKLFHIPPMCKLNPLYEEVVTLRFKALCYLADECGEDVTAYC" gene complement(9579..9932) /locus_tag="DP116_23520" CDS complement(9579..9932) /locus_tag="DP116_23520" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23520" /translation="MLTKGISITNQSGEKVSAEVTVKDKRAAIARITPGGFSGGKPSP CAESTGQSGCAGLGRRAGEAIAPRVRFVCPVSQTWLYPVYGRSVGLTTDIPFGLVRVQ TYGASPNQSKSENQG" BASE COUNT 2999 a 2124 c 2082 g 2958 t ORIGIN 1 tcccatcact tatctcccat tagcagaaca attttacccg tcactgaccc agtttcaagc 61 aaatgatgag ctgatcgtgc ttcttctagt ggaaatgtgt cgctaacttg aattttcaac 121 ttccctgcat caatccactt agcacattgt tctagaattt ctgcttgatg ttgctgcgcc 181 tctacaattc cttgcaacat tggtgtcagc attaattcta aaccaacgcg gagattacgt 241 tgtctggcag atttccaaac ggtattggca tctggttcta atattgtgac tatgtcgcca 301 taaactcgca ctgctggaaa agttttatga aaagtctctc cacctacagt gtcaaaagct 361 aagtctacac cttccccgtt tgtccaatcg agtgctgctt gcacaaagtc tgtttctttg 421 taaagaatga catggttagc accgagtttc tgaacaaaat ctgctttctc ctgagtactt 481 acagtcgtgc tgacatcagc acccttaagc ttcgccagtt gaattgctac atgaccgact 541 ccaccagcac ctgcatgaat taatacccgt tctccaggtt ccaagcgtcc ccgttcgtat 601 aaagcttccc aagcggtgat gaggactaac ggcgctgcag cagcttctgc aaaggatata 661 gaagcgggtt tacgggcaac aaatcgctca tcaacagtcg tatattcagc ataattgcct 721 tgatgtccac ccaagccacc attacaaaaa tagacttcat cgcctacgcg aaatttctga 781 acactagcac ccacagcctc aacaacacct gcaccatcac atcctaaaat agcaggcaat 841 ttatcagggt aaaaagtgcc acgtttacga agtttggtgt caatagggtt aatgcctgct 901 gctcgtaagt gtaccagaag ttcagtttca cccaaaggaa cagcaggatt ttgcacatct 961 tggacttgta gaacttcggg atcgccagct gctgtcatta aaactgcttt cattgcttcg 1021 cttcctcttc tgtgctaaag ggtagttgca tttgacaatt taccgcaaag ggtagggcat 1081 tatgagcaat tcaaaattca atgtatgccc tacgggtacg cttgttctac aaaaatactg 1141 tattacttta tataaaaaac taagcgctat tatgaatata agttgatgta cagtaaggac 1201 aattcatcct ttaattatgc aaccttgagc atctctatat ttccgagaat aaaacatctt 1261 cgtaaacctc tgcgagggaa caggaaaaat ttacactggt taactctaaa tcttgatttc 1321 ttctatagct gtaaagcacc catcttcctt cagaattgcg tcgaaaacaa tcaacacgct 1381 ggcgagtttg actgattaag acatattctt gcaatgtttc taattcttga tagtcactaa 1441 atttatcgcc ccggtcaaaa gcttctgtcg atggtgataa aacttctata attaagctgg 1501 gatagcgttt gaaatattca aattttgtat ctcgttggtc gcaagttacc atgatatcag 1561 gatagtagaa aatattcagt gtttcgatgc gggctttcat gtcagcgacg taaacacgac 1621 accctgttcc tcggatatga tttcttagca gggtaactaa gttagctgta attgttacat 1681 gaacatcact tgcaccggac attgcataaa tctgcccttg aatatattca tgtttaattg 1741 ggctagattt ttcggtttct aggtactgtt ctggggaaag atagggatga tattgactaa 1801 gagtcattga ataactccag ttgtatggct gtggtttatc gttgctaggg ttgtttcgtt 1861 tctattttaa gttgatactt tgatttattt tcccctttga tttttgagaa tcactcgtgg 1921 aaactcttaa cagggaacag ggaacaggga acagggaaca gggaacaggg aactcttaac 1981 gcttaacacc gaagaaggaa aaaggtgtac gtagctgagc gaaaatcaaa taggagtctc 2041 atagtaacat aaaaaaaatg gctagaatta tttacgaaac tgaaaattcc ttagaaatta 2101 cagaactcag agcagaagtt ttgtcaaaac gtaattgtgg agaggtgttg tttgagatca 2161 aaaagttaat accggatgaa actattgaaa gatctgaaaa tctgactgca attcttgatt 2221 tatttgtgag ccaattaggg tattccagtc ttgggttgcg ctggaaagaa gttaatcaag 2281 gagaggcgca aaaaatctta aagtttatca tgacgaaaga tttggcttat tctgtgcaat 2341 taatgagctt agaagaagca gagaaaattg ttgttaaact ctttcaaatt tttccaggta 2401 attgcaaatt tttcacaaat gcgttatttc ggaacaatta ctctggcatc agtgcatggg 2461 attcaataac aaaagccaca ttcgatacag ggattattgt tgtcagtgag agacgaattg 2521 gtatcctctg ggttcaagat gaagattaag cagaagtcag tgaagaagta gggaagcacc 2581 cgagcagcgg gcaaagggag gagagtatac tctgattgaa ctcataaagt ttttaaaaat 2641 ctcacacgtg tagagatgtt gcatgcaacg tctctacata ggttagcctt tttgcaaatt 2701 atctgatttg aaccatattg aaatgatcaa ttcaagaaaa cttagcattt caatggttaa 2761 taacttgtta atcaacaatt aatcaaacaa taagtgattg ataacgttat aactatcaat 2821 tcctatttca atgagtaatg agttaagcat taaccctgtt aaatatgtct aagttagcat 2881 taccattgaa gatagcagga ggagccatct tactgctttc ctttttttca tctagagtaa 2941 atagtgttac aatttcagag ggatttgaaa acggtacgaa aggcagttat gcagcaggtg 3001 acgtcaccct cagtacaggc gtttggaatt ttaatgatgc gctcatcggt aatctttcaa 3061 ccgatgttaa gaccgggacg caatctgctc gcattcgcaa taacggcaag gtgacgatga 3121 aatttgatcg tacgggtgct ggtacagtta ctatcaaaca tgccaagttt ggttctgatg 3181 ctagtaccac ttgggagttg tggtgttcaa ccaatagcgc atcttcatgg tcaaaaatag 3241 gttcaacaat tacgacaagc tctacatcac ttcaaacagc aactttcact cctaatctct 3301 ctgggacagt tcgctgtgaa gtccgcaaaa ctgatggaac ttcaaacaga accaatattg 3361 atgacattga aattagcgat tatggctctt ctcctccttc ttcttcctta cctcccggtt 3421 ctgtaccatt ctttgacaat atcaacaacc ctgtctctgg tttagcttac ggtagcccag 3481 ccgatgttac acctcccgcg ccagtaccaa acagctttga cacagcagtg actaatcttt 3541 gtggagcacc tggtacagtc gtcagtcgtg cgggcttcca gtctatgatg cagaacaact 3601 ctactgtctt ggcaaatatc aagcaatatg tgggaggata tcttaaacca ggacgcacca 3661 cagatgcagc tttcttggat gacttgactg atgtctggtt taatgcgcag gcttttgatc 3721 atatcttttg tggggaacca gttcaaggcg gttcgattgg gggactgcac tttgtcgctc 3781 gttatgtgga acttcaggaa aagggtttag ccggacggtt agacaacaat acatccagag 3841 aagaggttgt tcccggtaca atttacacta ttggcgttgt tatgaaggta ggtagtggta 3901 ctgctcagtc ttctatcaaa ggttatccct ataccctcaa cgctgaagag attctatcga 3961 aagcatctct aggttacaag aacaacccaa ataccacctc gactaacaca gtttgtaatt 4021 tgagcgtaac tgacgagggt aaaacattca cagccatatt tgtgaggaga gatggcggga 4081 ttcgcacttt ctatcctgat gctacccccg atagtaaccc caattgtacg cagtaatgat 4141 gaagtctggt taatgggcta tctatcaaga accgcgcctc gtactgcgat cgctaaggtg 4201 tgaggagatt tcaactccgc atccatccca aaaaaataaa agccgtggaa cattcaattc 4261 cacggctctt gttgtagtgt gaggagctaa actttctatc atcataatgc taactgacga 4321 gtataaacat gctcgtaacg attttggtaa cagaaatttt tctgccagta ttcacaacct 4381 ttctcgcctc tgggaactaa cacccttgcg tatcatcctc acattgggtg agtttctgtt 4441 tagattccat aaaaaaataa ataaccgtgg aagcaccaga tgagtccacg gctataaatg 4501 tggtgtgagg agacttaact taaaatcatc ataaacttct tggagtcttg gtaaggcgct 4561 cgtaacagtt ttggtaacga gattattcac caagccggat aaatcacttg ctcaagcctt 4621 tcccagacaa tcatgcaagc aacttcacag caaagcgcca ccacagctag aaccagtacc 4681 ggctgtacag ccataacaat aagctgctgt ctgtacttca tttatcaagt ccaaagaacc 4741 agcttgtaat aacttggcga ctgtcaggtt ttccccgtcg cgagtttttg caggtaaatt 4801 catcatttgg ttgaagtcgc agtcatacac ataacccaga taatcaattg aaatctcatc 4861 ccgacacatc aaatgctcaa ttgtaccagt attgaaatga gactctaaaa actgcaaata 4921 aggagcatac agtttctttc tttctagaga cagttttgtt ctcccaactg gtaagttggt 4981 aatcgtaaat agattgttaa aagtaatatc aaaattttct tgtaaatatt gcttgtaatc 5041 tctttccagc tttgtttggt ctggagtgag ggaaaatttc tcatctttcg gtaattgtgg 5101 attgtaaact aaatccacga ttaaatctgg ttctttccca taacctagtt ggttcagcca 5161 ttgcagtgct ttaatggaac catcatacac gccagcaccg cgcattttat ccacattatc 5221 tgacaaatag cagggcagag aagcaacaac cctaactttg tgtttggcaa agtattctgg 5281 caaatcatca aatccatctt caaaaaaaat ggtcaaatta gaacggacaa tcacctgttt 5341 ccctgttgct cttgctgctt ctaccagtgg tttgaaacca taattcatct caggtgcacc 5401 accagtcaaa tcaacaattt gaatttgagg aaatttatga attatctcaa tcaattgttg 5461 gcagatttct ggagaaagtt cttctgtgcg ttttggtcca gcttctacat gacaatggtt 5521 acaagcaagg ttgcagcgct tacctaggtt aatttgtaaa acagtgattt cttttttggt 5581 taacggagag gcaagttttt gtttgaaagg tgttatcgct gtttgaatca ttgtctcact 5641 tatatagatg caaaatgaaa gatgtgtctt ttttggattt ttctctcgtt cccatgcaga 5701 gcatgggaat gcatcatttg aggctctgcc tcccaatgat tatattaagg caaagcctca 5761 agttatgcat tccttgcctg aggcaaggaa cgagaaaacg tacgctaaaa ttaacaacaa 5821 ccgccgccgt tataatgcca agttgaatca gtcatcatta tttgctctgg cattaaggct 5881 gcaagtttgg cagcagtttt atcacacact gatgctggaa ttccacgctg aagaatatga 5941 ccagcatcat catcaaaaaa tgattcagaa ccagcataaa tagctgtttt tcctgtgaac 6001 acacaagcgc catcttcagg aatgataact ttgaaagcga cagaatctag actttctaaa 6061 agaagattct cttctaagtt gtaagtttga gaatcgagca aacggtaggg gcgacgagcg 6121 cgaatttcta cttgcccaaa gcctgcactt ataatacgct gaatgtactg ttcataagtg 6181 agtgcgcctg ataaacacat tgctcgcagc tgctcatctt tttgcagatg ttgcggaatg 6241 ggacgagtcg caattggatc actcatctgc aattttccac ctggttttaa tacccggtat 6301 gcttcttgca aagcacgggt taaatcctct ggttcaaaga tattgaaaag gcaattttgc 6361 gctacgatat caacagaatc atcaagaaca ggtaagttaa aggcatcgcc ttcgcgaatt 6421 tctacaaagc tggtgtcaaa ccaagggttt tcttgagcag caatttccaa gttacgtgct 6481 gcagcttgac gcatttctgc tactggatca acagcaataa cagcaccagg acgctgagaa 6541 aagtaagcaa attgcaaagc ttctaaacca ccgccaactc caacatacag taccgtgggt 6601 tgatttccaa gttcggtgag atgaacagtc gtcccgcaac catagttcat ttcctgcatt 6661 ttgcgaggaa ttttcaaccc tggtaattgc aagggtgtgc tttgcacaca acaaagtccg 6721 acttgcggtg tttgggcgac ttcactataa aactgcgccg ctgtctcaag ataactcatt 6781 gcactcgctg atgttgacta ccgttcttaa gtgtagcggt aagtaaatca gtgaattcat 6841 atcaaatggg taatttattt atcgaacaaa tgcagtatgg ggcgttcaga ttattctcgc 6901 agtgcgtaac caactccacg tacagtttgg atgagacgtt tttcctcatt cgcctccaat 6961 ttcaggcgca agtagcgaat gtaaacctca ataatgttgg agtcgcccat aaagtcgtag 7021 ccccagactt cttcgagaat gcgatcgcgc gtaataacct gtcgtggatg gatcagtaaa 7081 tactccagta aatcaaactc tttggcggtt aactcaacta agcgcgtccc tcgaaaaact 7141 tcccgcgtgc gacgatttaa actcaagtct tcaaattgca aggtattagt gtcggcttct 7201 tgggttcttc gcaagtgtgc gcggactctg gctaacaact cttcgacgct aaaaggtttg 7261 acaacataat catcagcacc tgcgtctaaa ccagcaacgc gatcgcttac ctcatcttta 7321 gctgttaaca atataattgg tacctgatct ccagtgcttc gtaaacgtcg gcaaatttcc 7381 aacccagata aaccaggtag catccaatcc aatatcacta aatctggatg agactctcgt 7441 gcagtcgtca gtcctgttaa cccatcgtgt gcaacactaa ctttataacc ctcataactc 7501 agttccaatt ccacaaatcg agcgagttta acttcatctt caacaagtaa aatatgcgat 7561 gtcatgaata tttaataaca gataaagtta cttttcttta catctactta ctacaaaacc 7621 tgatactttt gtattgatat tatacagaaa ctaagtgaaa aaccctgata agaattgttg 7681 cactttttga cgactcaatt tgacaacaga attgatcacc ctcatttgta tatgaatata 7741 ctattgtttc atcaagcaag gagcattcta tgtccttgaa attacataat tggcagcaag 7801 aaagacgcac actagaaatc ttatcttcac tcagttatcg cagaggtgaa ctccagagtt 7861 acttgcgaca aatcacgtgt ggtgtttgtg aattgctaga actagattgg tcagtcgtca 7921 ctctttgcca agatggttat gagaaagtgc tagcgagtag tatagacttg ggagatgata 7981 atgatcaagt ttatttactg catggtagct taactggcac agttgtgcaa actggacaca 8041 gtctagtggt accagacgcc aaaatttgta acgattatgg tgaagcacca gaaggatatc 8101 aagcttattt gggggttcct ctgtgcactc ctgaaggaaa gattgttgga acgatttgtt 8161 cgtttcacaa aacgtcgcgc cagttttctg ctgatgaagt tcgcatagct gagttatttg 8221 ctgaacgtgc ggcgacagcg attgacaatt attttctgta ccagcaacaa cgacaattca 8281 atcaaatcct agaagctgag gtagcgcgga ggacggaaga attaagggcg gctcaagcta 8341 aacttgtaga acaagaacgt ttagcagcaa taggcgaatt cgctgcttgt attatccacg 8401 aaattcgcaa ccctttcaca acaatgaaaa tggggttaaa cttttttcaa aaacttgatt 8461 tatctgcgcc agcaaaagag cgattatttt tggcactaga cgaagctcat agattagaaa 8521 gattgttaaa agaaatattg ttatatgcca aaccgcagac actacagtta gaaaaaatag 8581 atattaatga attcattcct gaaattctgc cctcgctgca aaatatgtca gaagctgtag 8641 gacgggaagt tgatttttat ccagcaatga atgaggtgaa agtagaagca gataaagata 8701 aactgaaaca agtctttatt aatcttgtgc ggaatgcatt tgaagcaatt cccaaaggag 8761 agactgtgaa attgcaaatt gagagtaaca caaatcgaaa tcaagtctgt attcatgtcc 8821 acaatggcgg tgaacctatt tccccagaga ttttacccaa gctaacccag cccttttact 8881 ctacaaaatc ttctggaact gggttaggac ttgccattag caaacggatt gtggaagttc 8941 atagtggaga gcttttgatg aaatcaaatc ttgcggaagg taccacggtc agcgtgcgat 9001 tacctatagt tactgcatga tatcactacc agacgtgcta ttagcaatag gctgtaacgt 9061 cttcaccaca ttcatcagca agataacaca aagctttgaa acgtaaagtc accacttctt 9121 cataaagagg atttaactta cacataggtg gaatgtgaaa tagcttgcga ccaaatagtg 9181 taatgtcacg ttcaaaagga cattgagccg ggatcaattt gcagagtgaa tgagctaatt 9241 tggcattatg aatttctatt gaatcaaacc attgacgaac aggttgcaga aggtcgtttt 9301 gtgggcccag ttgagtaaag ctagtcataa tgatttacct cgtcttgata attgcaaagt 9361 tggtcttgtt tgttgggttt gaaccctttc ctatatatct atagggagca gttgtctgtc 9421 ttatgcagtt gttttttgtt ctcttttttg ggaacactca ttactacaaa ctccagttct 9481 agtgttccgg acgatttccc cgatattttt gttgcacata gaatttttct actgaaaggg 9541 tataaacagg cttgaggggc gtttttgcat caccgcccct acccttggtt ttcactcttc 9601 gattggtttg ggctagcgcc gtaagtctgg actcggacta aaccaaaggg gatgtctgtc 9661 gttagcccga cgcttctacc gtaaactggg taaagccacg tctgcgaaac agggcaaacg 9721 aaacggacgc gcggagcgat ggcttcgccg gccctacgcc ccaatccggc gcagccggat 9781 tgccctgtgc tttcagcaca gggggagggt tttcctccag aaaaccctcc gggcgtgata 9841 cgggcgatcg ccgctctttt gtctttaaca gtaacttctg ctgatacttt ctctccagat 9901 tgatttgtta tgctaatacc cttggttagc attaacccgt ctggtaaatc aataaaaagc 9961 tctgaaacga gcgtaaccca acaaaacctt tcggttggtg ttgggtttcg ttcctcaaga 10021 tgttagcgta gtgagcgaaa gatgtatggt gtgcaaactg ttcagtgtac tgctatacaa 10081 aacttgacta aactttttca aacctagcca ttcaactaaa gaaaacttta gagggtctgc 10141 ttcctgcttc taccaaggga gtc // LOCUS NODE_3313_length_10088_cov_5.17910910088 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10088) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10088) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10088 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..231 /locus_tag="DP116_23525" CDS <1..231 /locus_tag="DP116_23525" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23525" /translation="LIVQVNSKLQEVFKQNLSIVEIFQNPTIKSLAQYFSQKSEDVPS IQSMRDRAQKQIQAINHRQKQQLSKQGKKIYG" gene 224..4840 /locus_tag="DP116_23530" CDS 224..4840 /locus_tag="DP116_23530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358392.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23530" /translation="MASATNLDSLEGIAIIGMAGRFPGAKNIEQFWHNLRSGIESISV FTDEELISSGIDPAVVSDPNYIKVSAVLEDTDLFDASFFGFNHREAEITDPQHRLFLE CAWEALENAGYDSTRCESRIGVYAGASLNNYLSFNLNHDQIGSAITFQKLIGSDKDFL TTRVSYKLNLTGPSFTVQTACSTSLVATTLACQSLVNYQCDMALVGGVSIRVPQKTGY LYQEGGILSPDGHCHAFDAMARGTIVGNGVGVVVLKRLADALADRDSIHAVIKGSAIN NDGSLKVGYTAPSVDGQAEAIAEAQALAGIEPETVSYIEAHGTGTSLGDPIEIAALTK VFRASTQKKGFCAIGSVKTNVGHLDAAAGITSLIKTVLALKHKQIPPSLHFEKPNPQI DFANSPFYVNTKLSEWKTNGTPRRAGVSSFGIGGTNAHVILEEAPVVELSDPIVLKSR PWQVLMLSAKTSSALETATANLANYLQQHPDLNLADVAYTLQIGRQGFEHRRTVVCRS IEDALDALVDPKRVLTGIQETQERPVAFMFPGQGAQYVDMGKELYQSEPIFRDQVDLC CQLLQPHLGLDLRSLIYPNESESKVAAEKLQQTDITQPALFVIEYALAQLWMSWGISP GAMIGHSIGEYVAACLAGVMSVADALALVAARGRLMQQLPSGAMLSVPLPEEEVRALL DEKLSLAACNAPALCVVSGTHDAIDAFQNKLQGIECRRLLTSHAFHSSMMERILERFQ KEVSKVKLHPPKISFISNVTGTWITASQATDPNYWATHLRSCVHFSQGISVLLQEPNR ILLEVGSGRTLCTFALKHSDAVGLSSLPHPKEKDSDVAFLMNTLGKVWLSGVQIDWSR FYAHQRRYRIPLPTYPFERQRYWIESQKNTRDVNLSQTALEQKLDIKDWFYIPSWKRS VPPISFETRRLTVEKQCWLVFVDTCGIGTQILEKLKRENQNVITVKVGEQFCHSGECE YTINPQNKNDYDALLKAIRNLGQIPTIIAHLWNITPSEYISSRLESCEKAQDIGFWSL VFLAQALGEQNISDSIQIDVVSNNMQQLLDEDELCPEKATILGPCKVISQEYSNITCR SIDIKLPQSGTRQWEQLINHLLTELAASTSEQVIAYRGNQRWVQCFEALPIESQTSTT ARLREEGVYIITGGLGEIGLIFAEHLAKTVQAKIVLIGRSGLPPKAEWEQWSSSHDDQ DLLSTKIKKVQILEELGAEVLVLTADVANLEQMQALVNQVRDRFGEIHGVIHAAGVPG AGLIQLKTTELATNVLEPKVKGTLVLDAVLQDINLDFLVLFSSITATAGGFGQVDYCA ANAFLDAFAHYNFYQQQIPTVSINWDWWQGNNWADSLMSAVPEFQAEFKQMRERYGIS FVEGVDAFSRILSTKLPQVVVSTQNLQTVIDKFKSFAAPISSEKLETSEQSKPKHPRP ILGIAYVPPSSDLEQKIADIWQELLGIEQVGINDNFFDLGGHSLLATQLVSQLRKDFQ VELSLRHIFEAPTIAELALMIEDLILRELEELTEDEANVYAVRE" gene 4881..5744 /locus_tag="DP116_23535" CDS 4881..5744 /locus_tag="DP116_23535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358391.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3-hydroxybutyryl-CoA dehydrogenase" /protein_id="PRJNA477356:DP116_23535" /translation="MKIQVVGVVGAGVMGIGVAQNLAQTSHQVILVDISEEILDKAKK EIKNNIRFQGFFNKNEKAENPDNILHRIKFSTNYKFLESAEFVIENATEKWDIKKEIY AQLDAICPPETVFAANTSAISITRIGSATKRADKIIGMHFMNPVPMKPMVEMIRGYHT SDETISTAKELLAQMGKEGILVNDSPGFVSNRVLMLTINEAIFLIQDQVASVSEVDRI FKTCFGHKMGPLETADLIGLDTILFSIEVLYESFNDSKYRPCPLLKKMVDAGLYGRKS GQGFYTYNRAI" gene 5800..6051 /locus_tag="DP116_23540" CDS 5800..6051 /locus_tag="DP116_23540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358390.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl carrier protein" /protein_id="PRJNA477356:DP116_23540" /translation="MKEIQPKIKEFLSRFFRNYDLQPDEDIFALGFVNSMFAMQLVLF MEQEFQISIDNEDLEFDNFRTINAMTRLIERKTAFVVQK" gene 6215..7399 /locus_tag="DP116_23545" CDS 6215..7399 /locus_tag="DP116_23545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409825.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="PRJNA477356:DP116_23545" /translation="MKIELTTQQKDAQAQFRAFVDSEVMPYANHYDQEECTPPKLIEK LAQKGYLGAILPKVVGGIGMDFITYGLLNEEIGRGCSSLRSLLTVHCMVAHAVSKWGN KSQKEYWLPKLASGEVIAAFALSEPNVGSDAKSIETTATLSGDSYVLNGQKKWITYGQ IADVFLVFAQCAGKPSAFLVEKNSPGLLIKPISGMLGVRASMLAELHFRDCRIPQENV VGKLGFGFSYVASSALDYGRYSVAWGCVGIAQACLEACIQYTSQRKQFGAYLKEHQLI RQMITEMIANVKAARLLCYQAGYLKDISDPSSIIETSIAKYFASTTATKVANNAVQIH GANGCTNEYSVARYLRDAKIMEIIEGSTQIQQITIADYGYQEYMSQPTPAVISQNLLA RT" gene 7404..8474 /locus_tag="DP116_23550" CDS 7404..8474 /locus_tag="DP116_23550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409824.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23550" /translation="MISTKAELSNYKQADKKAIKCVVWDLDNTIWDGVLLEDDHVELR SQVVDIIKTLDSRGILQSIASKNDYTRAMEQLQEFGLHEYFLYPQINWNSKSSSIQEI AKSLNLGIDTFAFIDDQLFELEEVNFSYPEVLCINAAELAHLLDMPEMNPRFITEDSK LRRLMYISDIERNNGEKEFVGTQSEFLATLNMCFTMSSAQEEDLQRAEELTVRTNQLN TTGYTYSYDELNHFRQSDKHKLLIASLEDKYGSYGKIGLALVECQEFVWTIKLLLMSC RVMSRGVGTILLNYIMTLAKNNKVRLRSEFVSNNRNRMMYVSYKFAGFKETEKKGDLQ ILENDLTRIQPYPEYVNIKIMD" gene 8513..>10088 /locus_tag="DP116_23555" CDS 8513..>10088 /locus_tag="DP116_23555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-ribosomal peptide synthetase" /protein_id="PRJNA477356:DP116_23555" /translation="MDNKKDEILKRRSKLSPMQRELLEKRLRGEVNSHSQLKVIPRRS QTSPAPLSFAQQRLWFLHQLEPSSPHYSELACLRLTGALKIDALEQSLNDIVQRHEAL RTTFEIVEEQAVQVIHPTITVALPVVELHLMSEAVRQAQIEQLTTEIAQKPFDLASGS LLRAMLLQTGLQEHVLLFAIHHIAVDGWSIGVLIRELVALYEAKSCEKTSPLPELPIQ YADYAIWQRQWMQGELQKTQLSYWKQQLAGASTLALPTDRPRPPVQSFRGAVTCFKLS PTLTNTLRSLSNREGVTLFMTLLAAFQTLLYRYTGQEDICVGSPIANRNQTEIQGLIG FFVNTLVLRTCLSGNPSFLELLGGVRQVCIGAYANADIPFEQLVEELHQERNLNHTPL FQVMFALQEDTQKDLTLPGLTLSWLPIHSQTARFDLTLQVVDGEPELRGSLEYNTDLF NAETITRMAEHFRTMLSGIVANPQARLSDLPVLTAAQLHRQVVEWNDTSTNYPTDVCI HQLFEAQVERTPDAVAA" BASE COUNT 3042 a 2002 c 2200 g 2844 t ORIGIN 1 ctgatagttc aggttaatag caaattacaa gaagttttta aacagaattt atcaattgta 61 gaaatattcc aaaatccaac tattaaatca ctagctcaat atttcagtca aaaatcggag 121 gatgtgcctt ctatccaatc aatgcgcgat cgcgctcaaa agcagattca ggctatcaac 181 caccgccaaa aacaacaatt aagcaagcaa ggcaaaaaga tttatggcta gcgcaacaaa 241 tttggattcc ttggaaggca tcgctataat tggcatggca ggacgttttc ccggtgccaa 301 aaatattgag cagttttggc ataatttacg atctgggata gagtcaattt ctgttttcac 361 agacgaggag ttaatatctt ctgggataga tcctgctgta gtgagtgatc ccaattatat 421 caaagtcagt gctgtcttag aagatacaga tttgtttgat gcttccttct tcggctttaa 481 ccatagagaa gctgaaataa cagacccgca acaccgtctc ttcttagagt gtgcgtggga 541 agcacttgaa aatgctggtt atgactccac tcggtgtgaa agtcggattg gagtgtatgc 601 gggtgccagc ctaaataatt atttatcatt taacttaaac catgaccaaa tcgggtcagc 661 aattaccttt caaaagttaa ttggcagtga taaagatttc ctcaccactc gtgtctctta 721 caaactaaat ctcactggac caagttttac agttcaaacg gcttgttcta cctcattagt 781 tgcaacaaca cttgcttgcc aaagtttggt gaattaccaa tgtgatatgg cattggtagg 841 tggagtttcc attcgggtgc cccaaaaaac aggatatttg tatcaagaag gaggaatatt 901 atcgcctgat ggtcactgtc acgcctttga tgctatggca agggggacaa ttgttggcaa 961 tggtgtggga gtggtggttt taaagcggct tgcagatgct ctagctgacc gtgatagcat 1021 ccatgcagtc attaaaggtt cagcaatcaa caatgacggt tcacttaaag ttggctacac 1081 agcacccagc gtggatggtc aagccgaggc tattgcagag gctcaagctc tagctggaat 1141 tgaacctgag acagtctcct atattgaagc tcacggaacc ggaacatctc tgggagatcc 1201 tatagaaatc gcggcgctga caaaagtttt tcgtgccagc acacagaaaa agggattttg 1261 tgcgatcggt tcagtgaaaa ctaatgttgg tcatctggat gcagcggctg ggataacaag 1321 tctcatcaaa actgttttag ccctcaaaca caaacaaata ccacctagtt tgcatttcga 1381 gaaacccaac ccccagattg actttgctaa cagtcctttc tacgtcaata ctaaactttc 1441 agaatggaaa acgaatggca ctccccgacg tgcaggagtt agttcctttg gtattggtgg 1501 aactaatgcc catgtcattc ttgaagaagc cccagttgta gaactctcgg atccgatagt 1561 tctaaaatct cgtccttggc aggtgttaat gctatctgcc aagacaagtt cagcactgga 1621 aacagctacg gcaaatctag ctaattatct tcaacagcat cccgatttaa acctcgcgga 1681 tgtagcttac acgttgcaga ttgggagaca gggtttcgag catcggcgta cggttgtttg 1741 ccgttcaatt gaagatgctc ttgatgcact tgttgatcca aagcgagttc tcactggtat 1801 tcaggaaact caggagcgcc ctgttgcctt tatgtttcct ggacagggcg ctcagtatgt 1861 ggatatgggc aaagaacttt accagagtga gccgattttt cgggatcagg tcgatttgtg 1921 ttgtcaattg ctccaacctc atttgggatt ggatttgcga tcgctcattt atcccaacga 1981 gtctgagtca aaagtcgcag cagagaaact acaacaaact gacatcactc agccagcgtt 2041 gtttgtcatt gaatatgctt tggctcagtt gtggatgtcg tgggggattt ctccaggtgc 2101 aatgattggt cacagtattg gggaatatgt cgccgcttgt ttggctggtg tgatgtctgt 2161 tgctgatgct ttagcgctgg tggctgctcg tgggcgactg atgcaacaac ttccatctgg 2221 tgccatgctg tctgtaccat tgccagaaga agaagttaga gcgctactgg atgaaaaatt 2281 atccttagct gcctgtaatg caccagcttt gtgtgtggtt tcaggaaccc atgatgccat 2341 agacgcattt caaaacaagc tccaaggtat agagtgtcgt cgtctgctta cttcccatgc 2401 ctttcattcc tcgatgatgg aacgcatctt ggagcgattc cagaaggaag ttagtaaagt 2461 caaattacat ccgccaaaaa tttcctttat ttctaacgtt acaggaactt ggattacagc 2521 atctcaagct acagatccta attactgggc gacacatcta cgctcctgtg ttcatttttc 2581 acaaggaatc tctgtactac tgcaagaacc aaatcgcatt ctgctggagg taggatcagg 2641 acgcactttg tgtaccttcg ctcttaagca ttcagatgcg gtagggctgt cttcattacc 2701 gcatcctaaa gaaaaagact cagatgtagc gttcttaatg aacacattag gcaaagtttg 2761 gctatctgga gtccaaatag attggtctag attttatgct catcagcgtc gctatcgcat 2821 ccccttacca acatatccct ttgagcgtca gcgttactgg attgaatcgc aaaaaaatac 2881 acgcgatgtc aatctgagcc aaacagcatt agaacaaaag ctcgacatta aagactggtt 2941 ttacattcct tcctggaaac gttccgtacc acctatatct tttgaaacta gaagattaac 3001 agttgaaaag caatgctggc tggtttttgt cgatacatgc ggcataggaa cccaaatctt 3061 agaaaaacta aaacgtgaga atcaaaatgt tataactgtc aaagttggag agcaattttg 3121 tcacagcggt gagtgcgaat ataccattaa tcctcaaaac aaaaatgact atgatgcatt 3181 actaaaagcc attcgtaact taggtcaaat ccccacaatc attgctcatt tatggaacat 3241 tacacccagt gagtatatat catcgagact tgaaagttgt gaaaaagctc aagatatcgg 3301 cttctggagt ttggtgtttc tggcgcaagc acttggagaa cagaatattt ctgactctat 3361 ccaaattgat gttgtctcca acaatatgca gcagttgctt gatgaagatg agttgtgtcc 3421 agaaaaagcg actattctag gaccatgtaa agtcatttct caggaatatt caaacattac 3481 ttgccgtagt attgatatta agcttcccca atcaggtact aggcaatggg aacaactgat 3541 aaaccatctt ttgacagaac ttgcagcctc aacatctgag caagtcatcg catatcgtgg 3601 taatcagcga tgggtacagt gttttgaagc gttaccaata gaaagtcaaa ccagtaccac 3661 agccaggtta cgagaagaag gcgtatatat cataactggt ggattggggg aaataggact 3721 tatatttgca gaacacttgg caaagacagt gcaagcaaaa attgtactga ttgggcgttc 3781 gggattacct ccaaaagcag agtgggagca atggtcatca agtcatgacg atcaggatct 3841 gttgagtaca aaaatcaaaa aagttcagat tctagaggaa ttaggtgcag aagttttagt 3901 cctgacagca gatgttgcca accttgaaca aatgcaagct ctagttaatc aggtacgcga 3961 tcgctttggc gaaattcatg gagtgattca cgccgcagga gttccaggtg caggtttaat 4021 tcaactcaaa acaaccgaat tagcaacaaa tgttcttgaa cctaaagtga agggaacact 4081 tgtgttagat gctgtactac aagatattaa tttagatttt ctagtcttgt tttcctcaat 4141 cactgctacc gcaggtggat ttggtcaggt agattactgt gcagcgaatg cttttcttga 4201 cgccttcgct cattacaatt tttaccaaca gcaaattcca actgtctcta ttaactggga 4261 ttggtggcaa ggtaacaact gggcagattc attgatgtca gctgttccag aattccaagc 4321 tgagtttaag cagatgagag aaagatacgg tatcagcttt gtagaaggtg tagatgcgtt 4381 tagccgtatt ctatctacaa aactacctca agtcgttgtc tcaacacaaa acctacaaac 4441 cgtaattgat aaatttaaga gttttgcagc acccatttcc tcagaaaaat tagagacgtc 4501 tgagcaatcc aaaccaaaac acccaagacc tattttagga attgcctatg ttcctcccag 4561 cagtgatctt gagcaaaaga ttgctgatat ttggcaggaa ttgctgggta ttgagcaagt 4621 aggtattaat gataacttct ttgacttggg cggacactcc ctactagcta cccaactcgt 4681 ttctcaactt cgcaaagact tccaagtaga actatcttta cgtcacattt ttgaagcacc 4741 gactatagcc gagttggctt tgatgattga agacctgatt ttaagggagt tggaagaatt 4801 aacagaggat gaagctaatg tttacgccgt ccgagaatga agttgaaaat cttagtagca 4861 aagtgtggtc agttgaaatg atgaaaattc aagtagtcgg tgtagttgga gcaggtgtaa 4921 tgggaatcgg ggtagcgcaa aacctcgccc aaactagtca tcaagtgatt ctagtagata 4981 tttccgaaga gattctggat aaagcaaaaa aggaaatcaa gaataatatc cgtttccagg 5041 gttttttcaa taaaaacgag aaagcagaga atcctgacaa cattctgcat cgaattaagt 5101 tttctactaa ctataaattt cttgaatctg cagaatttgt gattgaaaat gctacggaaa 5161 agtgggatat taaaaaggag atatacgcgc agcttgatgc aatttgtccc ccggaaactg 5221 tatttgcagc caatacctct gccatttcca ttactcgcat tggttcagcc actaagcgtg 5281 ctgataagat tattggtatg catttcatga accccgtgcc aatgaagcca atggttgaaa 5341 tgattcgcgg gtatcatact tctgatgaaa caatttcaac cgccaaagaa ttgttggcgc 5401 agatgggtaa ggaaggcatc cttgtcaatg actcgccggg ttttgtttct aatcgtgtgc 5461 tgatgttaac gattaatgaa gccattttct tgatacaaga ccaagttgct tcagtgtcag 5521 aagtagatag aatttttaaa acctgcttcg gacataaaat gggaccgcta gaaactgctg 5581 acttgattgg attggatacg attctattct caattgaagt tttgtatgaa agcttcaacg 5641 acagcaaata cagaccctgt cccttgctga agaaaatggt agatgcagga ttgtatggtc 5701 gtaaaagtgg acaaggtttt tatacttaca atagagcaat ttgaaatcag aaattaagac 5761 gaatcttcta gctatccttg gtaaggagaa attgaaaata tgaaagaaat acaaccaaaa 5821 ataaaagaat tcctttcacg ttttttccgc aattacgact tacagccaga tgaagatatt 5881 tttgcacttg gctttgtcaa ttctatgttt gccatgcaac tcgtcttgtt tatggaacaa 5941 gaatttcaaa tcagtattga caatgaagac ctagagtttg ataacttcag gacgataaat 6001 gcgatgactc gtttgattga acgcaagaca gctttcgttg tacaaaaatg agtcaactta 6061 ataaaaaacg aagaacctca cccgcctccc caaccctctc cgcgagttcg gagagggttg 6121 gggaggggta acgcgaggac tgcaatgata aataatcatt cgaacttgat agaaaagata 6181 aaaattcaac ctatttaaac atttaaatag tacaatgaaa atagagttaa caactcaaca 6241 aaaagacgct caagctcaat ttagagcttt cgtagactcc gaggttatgc cttatgcgaa 6301 tcactacgac caagaagaat gtactcctcc aaagctcatt gaaaaattag ctcaaaaagg 6361 gtatttgggt gctattttac ccaaagtagt gggtggtata ggcatggact ttattaccta 6421 cggtcttctt aacgaagaaa ttggacgggg atgttcttca ctgcggagtt tgctcacagt 6481 tcactgcatg gttgctcatg ctgtttctaa atggggcaat aagtctcaaa aagagtattg 6541 gttgccaaag ttagcatctg gtgaagttat agctgctttt gccttaagcg aacctaatgt 6601 gggtagcgat gccaaaagta tagaaactac agcgacactt tctggtgact cttatgtctt 6661 aaatgggcag aagaaatgga ttacctatgg acagattgct gatgtctttt tggtgtttgc 6721 tcaatgtgca gggaaacctt ctgctttttt agttgaaaaa aacagtccag gactcttgat 6781 aaaacccatt tctggaatgt tgggtgttcg ggcttcaatg ttagcagaat tgcactttcg 6841 ggattgtcgg attccacagg aaaatgtggt aggtaagttg ggttttggtt tctcctatgt 6901 tgcatcctct gcactggatt atggaagata cagtgttgca tggggttgtg tgggtattgc 6961 tcaagcctgt ctagaagctt gtattcagta tacaagtcaa cgaaagcagt tcggagctta 7021 tttgaaagaa caccaattaa ttcgacaaat gatcactgag atgatagcca atgtaaaagc 7081 agcaagatta ctgtgctatc aagctggcta tctcaaagat atcagcgatc caagctcaat 7141 aattgagact tcaattgcca aatattttgc atccacaact gcaaccaaag ttgctaataa 7201 tgctgtacaa atccacggtg ctaatggttg caccaatgaa tattctgttg ccagatattt 7261 acgagatgcc aaaatcatgg aaattattga aggaagtacg caaatacaac agataaccat 7321 cgctgattac ggttatcagg agtatatgtc acaacctact cctgccgtaa tttctcaaaa 7381 cttactggca aggacgtgag aagatgatta gcacaaaagc tgaactgtca aattataaac 7441 aagccgataa aaaagctatt aaatgtgtag tttgggattt agataacaca atttgggatg 7501 gtgttttatt agaggatgac cacgttgagt tgcgctctca ggtagtcgat attatcaaaa 7561 cattggacag tcgaggcatt ttgcaatcta ttgccagtaa aaatgattac accagagcaa 7621 tggagcaact ccaagagttt ggtttacacg aatattttct gtatcctcaa attaattgga 7681 attccaaatc tagttcgatt caggaaattg ctaagtctct caatcttggt atcgatacat 7741 ttgcttttat agacgaccag ttatttgaac tcgaagaagt taatttttca tatccggaag 7801 ttctttgtat caacgctgct gaactagcgc atttactaga tatgccagaa atgaatcctc 7861 gttttattac agaggattca aaactgagaa ggttgatgta tattagtgat atagaacgaa 7921 acaatggcga aaaagaattt gttggcaccc aatcagaatt tttagctaca ctcaatatgt 7981 gttttactat gtcttccgct caagaagaag atttacaacg agccgaagaa ctaacagtca 8041 gaactaatca attaaataca actggttata catattccta tgatgaactg aatcatttcc 8101 gacagtcaga taagcataaa ctgctcattg ccagtttaga agataagtat ggcagttacg 8161 gtaaaattgg cttggctctt gtggaatgcc aagagtttgt gtggactata aaacttttgc 8221 tgatgtcttg tcgcgttatg tccagaggtg tcggcacaat tctgctgaat tatattatga 8281 cattggctaa aaacaacaag gttcgtttgc gttctgagtt tgtttcaaat aatcgcaatc 8341 gtatgatgta cgtatcttat aaatttgcag ggtttaagga aactgagaag aaaggagatt 8401 tgcaaatttt agaaaatgac ttaacgcgaa ttcaacccta tcctgagtat gtgaatatca 8461 aaattatgga ttagaaaaaa ttgctagttc ttgtagtttg atagggggga ttatggacaa 8521 caaaaaagat gaaattttga aacgacggtc aaagctttca cccatgcagc gggaacttct 8581 cgaaaagcgg ctgcggggtg aggttaactc tcactctcaa ttaaaggtta ttccaagacg 8641 ttctcaaaca agtcctgctc ccctatcttt tgctcaacag cggttgtggt ttctccacca 8701 gttagaacct agcagtcccc attatagtga actagcatgt ttacggttga caggtgcgct 8761 taaaatcgat gcactagagc aaagtcttaa cgatattgtg caacgtcatg aagctctacg 8821 tactactttt gaaatagtgg aggagcaagc agttcaggtc attcacccca ctataactgt 8881 agcgctacca gtggtagaat tgcatttgat gtcagaagca gtgcgacaag cccaaatcga 8941 gcagctaaca acagagatag ctcaaaaacc ctttgacttg gcatctggtt cgttgctacg 9001 agctatgctg ttacaaacag gcctgcagga acacgtgttg ctgttcgcaa tccaccacat 9061 tgctgttgat ggctggtcga taggagtgct gatccgggaa ttagtagcac tctacgaagc 9121 caagtcttgc gagaagacat ccccactgcc tgaacttccg attcagtatg cagactatgc 9181 tatctggcaa cgtcagtgga tgcaaggaga gctacaaaaa acgcagcttt cctattggaa 9241 acaacaattg gcaggcgctt caacgttggc tttgcccaca gatcgtccac gacccccagt 9301 ccaaagcttc cgaggtgcag ttacctgttt taagctatca ccaaccctaa ccaatacgct 9361 cagatcccta agtaatcggg agggggtaac tctgtttatg acgttgttgg cagcgtttca 9421 aaccctacta taccgttaca cagggcaaga agatatctgc gttggctccc caattgcgaa 9481 tcgcaaccaa accgagattc aagggttgat tgggtttttt gtcaataccc ttgtgctacg 9541 tacctgtctt tctggtaatc caagtttttt agaattactg ggtggagtgc gtcaggtgtg 9601 cataggtgcg tatgctaatg cagatatacc ttttgagcaa ctggtggaag aacttcacca 9661 agagagaaat ctcaatcaca cgcccttatt tcaggtcatg tttgccttgc aggaagacac 9721 tcagaaggat ttgacactgc ctggtttgac tctaagttgg ctccccatac acagccaaac 9781 tgccaggttt gatttgacct tgcaggttgt agacggcgag ccagaattga gggggtcgtt 9841 agaatataat actgacttgt tcaatgccga aaccattact cggatggcag agcatttccg 9901 tacaatgctc tcgggtattg ttgcaaatcc acaagcaaga ttatcagatt taccagtact 9961 gacagcagcc cagttacatc ggcaagtagt ggaatggaat gatacttcta ccaattaccc 10021 aacagatgta tgtattcatc agttgtttga ggctcaagta gaacgcacac ctgatgcagt 10081 cgcagcgg // LOCUS NODE_3333_length_10037_cov_4.42787010037 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10037) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10037) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10037 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(148..807) /locus_tag="DP116_23560" CDS complement(148..807) /locus_tag="DP116_23560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199937.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF2232 domain-containing protein" /protein_id="PRJNA477356:DP116_23560" /translation="MVRQTKDSNKNSMKLEVPLRMVETAFLASTASLIWFINFYLPLG PVLRIFFPVPIALVYLRWGKRAAWMSAVTSGLLLLVLMGPVRSLLFVIPFAFMGVLLG STWNRRIPWIVSITLGALLGTIGVFFRIWLMSVLTSEDLWIYVINQVTDLIEWVVLKL GILVSPSVFWIQVGAIALIIFNNFIYLFTVHLAAWLLLDRLGNPIPRPPRWVQVIMDY E" gene complement(847..1545) /locus_tag="DP116_23565" CDS complement(847..1545) /locus_tag="DP116_23565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409873.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Crp/Fnr family transcriptional regulator" /protein_id="PRJNA477356:DP116_23565" /translation="MEDRYSVQALTNPLRYLAPFFQGLPETVVEQALTHLVTRTHPAN QVILLENDWGGSVYFIASGWVKIRTYNLEGKEVTLNILGQGELFGEMAALDEVPRSTD VITLTSTVISSMPAQDFVKLLQTEPMAGVRLAQLMARRLRQVNRRLRLRESDSQSRVA DTLLFLAEGQGKKGDEGIKIPNLPHRELSSLSGLARETVTRVLTRLEKKGLIKREQEV ICIPDLSGLEKMIV" gene 1985..6298 /locus_tag="DP116_23570" CDS 1985..6298 /locus_tag="DP116_23570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23570" /translation="MNVTSRATETQQLIAEIDRLLTNKRLPRLFSNQASELRQVLERI RDFLVNLSESSAQVQTPQELQSQQSPSLAKSVAQDDHQSLPQQQSLQQQENINVVVGQ ENNPFAVVLGPLQEEIKALLQERSNLAEEIRQLEQKRLHNYSLAQQLANQEQMISEFL QVLKSRIVSDLTPQTRETAGNSQMQHLVTSYQSNTESATYSTLPVVESQEQVERLSRL AKQLDQKLLALDGTVNVVFEALQGNIHTYHESLSQALARMHSKGVQGEQLLASFITNF TQQLQQQTSINQASVENVKDETPQLVESNELVSELPEVTPNNQALSLEAQQESTNASD LNAVVFQDREDAQNTVETPDKSAVHHNSSQFIRDQVDQLYASLFGVDVTDVATEDKVT DITDEFFDQPTATVSAPSEVTIITDELSTLTLSELTDELFEQSITTPAGDQVTDVTNQ LSAPDIDQFFEQQTTTTPVTTDDDTTVTDVTTTKVTNVIDELFDQPTLTSSADTSPPT VTHQPQSEFLYEITFEAPQVPTKPVTDVTNSSSVSNLSETSPDAATNQQQLDSIVEIP DPWLEELEADLVKLSTDDAKQTQDVSLETTEQSSIPQENILAELPPVEEYTQVVSFDS TSPVSPTTENIITVLTDLLADTNRESLAANIAPITSEAVETPPQNIAESTAGGIQNRD VEESSENNVPASPIENVLFPEENQSPEIVDISLEEAQLDQLEQDLASFDGEINALLQP LTQSENQEKAETEPNSSIIEPQAELEKSVVKEEIDGSSVSISGSTNNDARKSIWYLGI DLGTTGISAALLNRSTTEVYPLYWSAAQTQEEATSIKRSFRLPAEVYLPTASVTSTET ESSHPQDQIAPAAVAEEKVPENMATSSPSPQSSAATHNLFSAQLKPYLQIALPYKSEG QKWEPVLQLNEFSTVPLVWVVRSLSKLLLTFTEDRSSTTLGLSAAAVGLDQETFRRII NEITGVICTCRSNWSEQYRFNIREAILISKLVQHPQQVFFLEEGIACLLSELDGAEGE IVKITDSEGTRFAKSSDRPLVGNTFVLNIGAAATEMALVNLPENVEDLTHSDFMLHGF AYAGKELEQDIICQLLLPSKWRQPRTTSQEDTKTYTTNSKNWQPAILGLDQMSFSSLG LEELNLPRAGEPDVGERIRLQQRLESSLLGKAIIDAAIALKLILQHQESFTLELADQQ WVLQRRDLESQVFVPFVRRINREINRLLVAKGIPTEAINQAIFTGGVASVAAVSRWLR QKLPNAKIIQDLYLGENNSPNCSRVAYGLAVLPLHPQVLDVPRQQYTDYFLFTELLQI LPSRAISFSEVIQLFENRGINTRTCEQRLMAFLEGELPSGLIPTSTDAIWLTQGSCDN SNYKAITAAPLFEKQGSLTYRPNTEQLQAITRYLDAIKPSIQQSLEEPYTVNFALGIV D" gene 6692..7789 /locus_tag="DP116_23575" CDS 6692..7789 /locus_tag="DP116_23575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873334.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_23575" /translation="MDNTRKRIVIQGDMALLIRGLIIGKVLTLVVIGGLFWWLVRPRL LITSNINSSPSQNTNTTSSTRLAFGTTADIPVGSFKYGGSTAWAPIRQLIDSQIQNAH PELQLNYVKPTNGSPGSGSGIRMLLDGQLDFAQSSRPITVEESTTAKQRGFTLDQRQV GIDGIAVVVNPSLTLPGLTIDQLQQIYHGEITNWKQVDGPDLPITPFAQRPEDADTLI FVNNKSSGTSSNKDLNNQAFSSNVQYVHSTTQAVRRLSKTPGGLYYASARTLVPQCSV RLLPLGQTPTKFIPPYREPKVSTEECLHKRNQLNTKAIKNGSYPLTTNLFVIIKHNNG REQRAGEAYAKLLLTDQGQEAIEQAGFIRVR" gene complement(7865..8914) /locus_tag="DP116_23580" CDS complement(7865..8914) /locus_tag="DP116_23580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006623257.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_23580" /translation="MSKHAGISRFTYNWGLATWKALYESGYQPNHLTLKKFFNNEVKP VLTWIKEKGICQKITEFAFDNLGKAFKNFFQKRADYPKFKRKGRNESFTINAGGKPIN LGGKRIKLPTIGWVSTYESLPHTTTTKLTISKSAGDWYISCSYEISPEITKKEHEYVG VDVGIKTLATLSTGVIFFNPKAFKKAQKTLTRLQRQLSRKVRGSHRYKKQKLRISKLH RRIANIRKDATHKATTFICKNHAVVALEDLNTSGMLKNHRLAGAVSDANFYEFRRQVE YKVIRYGGTVVFVDRFYPSSKTCSNCGEIREISLSERVYVCQKCQHTQDRDLNASKNL QKYARMVKPCLDVKG" gene 9465..9962 /locus_tag="DP116_23585" CDS 9465..9962 /locus_tag="DP116_23585" /inference="COORDINATES: protein motif:HMM:PF13463.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="PRJNA477356:DP116_23585" /translation="MNKADTIRQVALDWKVTRPEIDPSPMVRVLAVLRSALELEKATQ KLFARYDLNTATFGVLATLRRSSPPEGMTLSQLAQFVLVTPASITNRVDRLEARGLVE RYDATNDRRCWLVRLTQKGYDLIDELIPQHVENERQLLSGLNEQEQEQLYLLLLKLLA SLEDD" BASE COUNT 3023 a 2222 c 1999 g 2793 t ORIGIN 1 gtgttccgtg ttccctgctg cacttaaatt caaatcaaaa ataattttga gagaagtctg 61 atcctttaga aaccttctta cccccttaca cccctaattg ttagaaacct tcttaccccc 121 ttacacccct acacccctag ttgactatca ttcataatcc atgatgactt gcacccagcg 181 tggagggcgg gggatcgggt tacccaggcg atccaacagc agccatgctg ctaagtgtac 241 tgtaaataag tagataaagt tgttgaagat aattaaggcg atcgccccca cttgaatcca 301 aaacacactg ggactcacca aaattcccag tttgagaaca acccactcaa tcaagtccgt 361 cacttggttg atgacataaa tccaaaggtc ttcactcgtc aacacagaca tcaaccatat 421 acgaaaaaag actcctatgg tgcctagtag tgcacccaaa gtgatggaaa caatccaagg 481 aatacgacga ttccatgttg atcccaaaag tacccccata aaggcaaagg ggatgacaaa 541 cagtagactg cggactggtc ccatcagcac caaaagcagc aatccagatg tgactgctga 601 catccatgcc gctcgtttgc cccaacgaag gtatactagg gcgatcggca cgggaaaaaa 661 tattcgtaat acaggtccta agggaaggta aaaattaata aaccaaatca agctagcagt 721 gcttgctaaa aatgccgttt ctaccattct taacggcact tccagcttca ttgagttttt 781 gttagagtct tttgtttgcc tgaccattaa tttcaaggga gttaagatga gaagaacgaa 841 gaacttttaa acgatcattt tttctaagcc tgacaaatcg ggaatacaaa tcacctcttg 901 ctctcgttta atcaatcctt ttttttcgag tcttgtcagg actcgtgtga cagtttcccg 961 tgctagtcca cttaaactac tcaattccct gtgaggtaaa ttgggaattt tgattccctc 1021 gtctcctttt tttccctgtc cctctgctaa aaacaataac gtatctgcta cccgcgattg 1081 actgtcagat tcccttaacc gcaatcgccg atttacttgc cgtaaacgac gtgccatcag 1141 ctgcgccagt cggactcccg ccattggttc tgtttgaagt aacttgacaa aatcttgtgc 1201 tggcatacta ctaatgaccg tggaagtcaa ggtaataaca tcagtggaac gaggcacttc 1261 atcaagagct gccatttcac caaataattc tccttggccg agaatattca gtgttacttc 1321 tttgccttcc agattataag tacgtatctt cacccatcca ctagcaataa aatatacgga 1381 tcccccccag tcattctcta acaaaatcac ctgattagct ggatgagtgc gggttaccag 1441 atgagtaagt gcttgttcta caacggtttc tggtaaccct tgaaaaaagg gtgccaagta 1501 ccttaaggga ttggtaagtg cttgcacgct ataccggtct tccattacga catcttgttg 1561 ataaaagctg taatatacac aaaaaataac gctactttta taagcctagg ctattaaagt 1621 acagccatta gaatctaaaa atagattatg ttacaaaaca aagcgagcca agcgtttttt 1681 atcaaaagta ttgcgttctc gcgctaagat ttcgccaaag tctcgcaata ataaatagtc 1741 tacagcttac agctccctgg ttgaagacta gcagcactgt aaacatcaat acttaaaact 1801 caaaactata ggcaacctaa ataaataata ctcatggtca ctaatttttg ctaccctata 1861 atttttagct gttttcagta aactacaata atatacattg ctggatgccc atctaatgcc 1921 taacatactg caacagttaa aaaaatttct atcgccacct gacttaaaat tctagtaatg 1981 gaaaattaac gtgacttccc gcgcaacaga aactcaacaa ctgattgcag agattgaccg 2041 cttactcacc aacaagcgct tacctaggct tttctctaat caagcatcag aactacggca 2101 agttttagaa cgaattcgcg actttttggt caatttatca gaaagttctg ctcaggttca 2161 aaccccccaa gagctacaat cacaacaatc tccctcacta gcaaaatctg ttgctcaaga 2221 tgatcaccaa tctttaccac agcagcaatc tttacagcaa caggaaaata ttaatgttgt 2281 ggtgggacaa gaaaacaatc catttgcagt agtgctggga ccgttgcaag aagaaatcaa 2341 agcactgttg caagagcgat caaatcttgc cgaggaaatc aggcaattag aacaaaagcg 2401 gctgcataac tactccttag cacagcaatt agcgaatcag gagcagatga tttccgaatt 2461 tttgcaagtg ctgaaaagcc gaattgtgtc tgatttgaca ccacaaacga gagaaactgc 2521 cggcaattct caaatgcagc atttggtgac aagttatcag agtaacacag aatctgcaac 2581 ttattcgact ttacccgtag tagagtcaca agagcaagtg gagcggttat caaggcttgc 2641 gaagcagtta gatcaaaagt tactcgccct tgatggaact gtgaatgttg tttttgaggc 2701 attacagggc aatattcaca cttatcatga gtctttgtcc caggcgctgg cgagaatgca 2761 cagcaaagga gtgcaagggg aacagttgtt ggcaagtttc atcaccaatt ttacacagca 2821 gttgcaacaa caaacttcca tcaaccaagc gtctgttgag aatgtcaaag atgaaacacc 2881 acagttagta gaatcaaacg agttagtttc agagttacct gaggtgacac caaataacca 2941 agcactcagc ttggaagcac aacaggagag tacgaatgca tcagatttga atgcagtcgt 3001 ttttcaggac agggaagacg cacaaaacac cgtagagact cctgacaaaa gtgctgtaca 3061 ccacaatagt tctcagttca ttcgtgatca agtagaccaa ctttatgcca gtttgtttgg 3121 tgttgacgtg acagatgtgg caacagaaga taaagtcacc gatatcacag atgagttctt 3181 tgatcagcca acagcaactg tatcagcacc cagtgaagtc acaattatca cagatgaatt 3241 atctactctc acattatctg aacttacaga tgagttattt gagcaaagca taaccacacc 3301 tgctggggat caagttacag atgtcacaaa tcaattatct gcacctgata tagatcagtt 3361 ttttgagcag caaacgacaa ccacacccgt gacaacagat gatgacacaa ctgtgacgga 3421 tgtgacaacc acgaaggtga caaatgtcat agatgagtta tttgatcaac caacactcac 3481 ttcatctgcc gacacttcac caccaactgt tactcatcaa ccacaatcag agtttctgta 3541 tgaaataact tttgaagcac cgcaagtgcc aacaaagccg gtgacagatg tcacaaattc 3601 ttcatcagtg tcgaatctct ctgaaacttc tccagacgct gcaacaaatc aacaacaact 3661 cgattctatt gttgagattc cagatccctg gttggaggaa ctagaagccg atcttgtgaa 3721 actcagtact gatgacgcca aacagacaca agatgtcagc ttggaaacca cagaacaatc 3781 atccattcct caggaaaata tattggcaga attaccgcct gtggaagagt atactcaggt 3841 tgtttctttt gacagtacta gcccagtgag tccaacaact gaaaatataa ttacagtgtt 3901 gactgattta ttggctgata caaaccgcga gtcactagca gcaaacatag caccgatcac 3961 ttccgaagcc gtagaaactc ccccacagaa tattgctgaa tcaacagccg gtggaattca 4021 aaatcgtgat gtagaagagt cttcagagaa caacgttcct gcttcaccga ttgagaatgt 4081 gctatttcca gaagaaaatc aatctccaga aatagttgat atttccttgg aagaagcgca 4141 gttggatcaa ttggaacaag atttagcgag ttttgatggg gagataaatg ctttgttgca 4201 gcctttgact caaagcgaaa atcaagaaaa agcagagact gagccaaact caagcataat 4261 agaaccacaa gcggaacttg aaaaaagtgt ggtaaaagaa gaaatcgacg gttcttctgt 4321 ctccatttct ggttccacaa ataacgacgc ccgtaaatct atttggtatt taggaattga 4381 tttgggtaca accggaattt ctgcggcttt gttaaatcgc tccacaactg aggtgtatcc 4441 tctatattgg tcagcagcac aaacccaaga ggaagcaact tctataaaac ggtcgtttcg 4501 tttaccagca gaagtctatc tgccaacagc ttcggtgaca agcactgaaa cagaaagttc 4561 acatccacaa gaccagatag cacctgcggc tgtggctgaa gaaaaagtac ctgagaatat 4621 ggctacgagt tctccatcac ctcagtcgag tgctgcaacg cataacttat tctcagcgca 4681 gttgaaaccc tatctgcaaa ttgctttacc ttacaagagt gagggacaaa aatgggaacc 4741 agttttgcaa ttaaacgaat tttctactgt tccattggtt tgggttgtgc gatcgctctc 4801 aaaattgctc ttaacgttca ctgaggatcg cagtagcacg acgttgggtt taagtgcggc 4861 tgctgttggt ctagaccaag aaaccttccg tcgtattatc aatgagatta ctggtgttat 4921 ttgcacttgc cgatccaact ggtcggaaca atatcgcttc aacattcgag aagctatact 4981 catcagcaaa ctcgtacagc atccgcaaca agtctttttt ctggaggaag ggattgcttg 5041 tttgctgtca gaacttgacg gcgctgaagg tgaaatcgtg aaaatcacag atagtgaggg 5101 gactcgtttt gcaaaaagca gcgatcgccc tcttgtgggt aatactttcg tccttaatat 5161 cggtgctgct gcaacagaaa tggcgttagt taatttgccg gaaaacgtgg aagacctcac 5221 ccatagtgat tttatgcttc acggttttgc ttatgctggc aaagaattag agcaagacat 5281 tatttgtcaa ttgctattac catctaaatg gcgacaacca cgcacgactt ctcaggaaga 5341 taccaaaact tataccacaa attccaagaa ttggcaacca gcaattcttg gtttagatca 5401 gatgtccttt tcgagtttag gcttggaaga attgaacctc cctcgggctg gggaaccaga 5461 tgttggagaa cgcattcgcc tacagcaacg gttagaaagt tcacttttag gaaaggcgat 5521 tattgatgcg gcgatcgctc tcaagctgat tttgcaacac caagaatctt ttaccctaga 5581 actagcagat cagcaatggg ttttgcagcg acgagactta gaaagccagg tatttgtccc 5641 atttgttcga cgcatcaacc gagaaatcaa tcggttactt gttgccaaag gcataccaac 5701 agaagcaata aatcaagcta ttttcacagg tggggttgct tctgttgcgg cggtgagtcg 5761 ttggctgcgg caaaaactgc ccaacgctaa gattatccaa gatttgtatc tgggtgaaaa 5821 taactctccc aattgcagtc gggttgcata tggtttagca gtacttcctc tgcatcctca 5881 agtcttagac gttcccagac aacagtacac cgactatttc ttgttcacag aattgctgca 5941 aattctacca tcaagagcca tatcctttag cgaagttatc cagttatttg agaatcgtgg 6001 cattaatacc cgcacttgtg agcaacggct gatggctttc ttggaaggtg agttaccttc 6061 tggcttgatc cccaccagca cagatgccat ttggctcacc caaggttcat gtgacaattc 6121 taactataag gcgataacag ccgcaccact ttttgaaaaa caaggaagtc tcacttatcg 6181 tcccaatacc gaacaactcc aggctattac tcgttatctc gacgccatca aacctagtat 6241 tcagcaatct cttgaggaac cttacacggt gaatttcgcc ttgggaatcg ttgattagca 6301 ttttgtcccc ttatccccag aggggacttt gaaaccctcc gttgtggcga ctggcgttca 6361 aacagttatc agtatttttt tgtatcattt atcgttcctg aaagcaaaat gtgataatct 6421 catatattta taggaaatca aagtagacta attaccaaca cctgtcgaaa gcatttacca 6481 agtcaaactc tcggaacaaa aaagcatcat atacctctct cttgagagtg gtttttatca 6541 caaaaaatca tcattttcaa ctttcttaaa cttttttttc aatataaaaa atcttatttc 6601 tgtacagatt taggtgaaaa atactaatgt ataaataaca gccataatca tactgaaata 6661 tttacaacga ttaggggtca atcaaaaaat tatggacaat acccgaaaaa gaatcgtcat 6721 tcaaggagac atggcactcc ttatcagagg cttaattatt ggcaaagtgt tgacacttgt 6781 ggttattggg ggactgttct ggtggctagt gcggccacgc ttgttgatta ccagcaacat 6841 taactcttcg cctagtcaaa atacaaacac aacctctagt actagattag cctttgggac 6901 aactgctgat attcctgtcg gttcattcaa atacggtggc agtacagcgt gggcacctat 6961 tcgacaatta atagattctc aaatccaaaa tgctcatcca gaattgcagt taaattacgt 7021 aaaacccact aacggcagtc ctggttctgg ttcaggcatt cgtatgttgc ttgacgggca 7081 attggacttc gcccagtctt ctcgtccgat aacagttgaa gaatccacta cagccaagca 7141 gcgaggtttc acacttgatc aacgtcaagt tggtattgat ggaatagcag tggttgttaa 7201 cccatcactc acgctgcccg gtttaacaat tgaccaattg caacaaattt atcacggtga 7261 aattactaac tggaagcaag tagatggacc agacctaccc atcactcctt ttgcccaacg 7321 tccagaggac gcagatacac tcatatttgt aaacaacaaa tcaagcggca catcaagcaa 7381 caaagactta aacaatcaag cgtttagctc taacgtccag tatgtccact ctactacaca 7441 agcggtgcgt cggctcagta aaaccccggg cggtttgtat tacgcttctg ctcgtacact 7501 agtccctcaa tgtagtgtga ggcttttgcc acttggtcag actcctacta agttcatccc 7561 cccctaccgt gaaccaaaag tgtcaaccga ggagtgccta cacaagcgca atcagctcaa 7621 tactaaagca atcaaaaatg gcagctatcc actcaccaca aacctgtttg tgattattaa 7681 gcataacaac ggtcgggaac agcgggcggg agaagcctat gccaaacttt tactcaccga 7741 ccaagggcaa gaggctattg agcaggctgg atttattaga gttcgttaaa tttgtattca 7801 ctgctacata aaatctggac aattccagac atgacttgac gtttcggcgg gagcatggga 7861 gcaattatcc cttaacgtcc aggcacggtt taaccatccg tgcatatttt tggagatttt 7921 tagatgcatt taaatctcgg tcttgggtgt gctgacattt ctgacacaca taaacccgct 7981 ctgatagact aatttctcga atttctccac aatttgaaca agttttacta gatgggtaaa 8041 atctatccac aaatacgaca gttccaccgt atctaatgac tttatattca acctgtctac 8101 gaaattcata aaaattagca tcgctgacag cacccgctaa ccgatgattc ttcaacatcc 8161 ccgaagtatt taaatcttct aaagcaacta ctgcgtggtt tttgcagata aatgtagtcg 8221 ccttgtgagt ggcatcttta cgtatattag caatccgtcg atgcagttta gatattctca 8281 gtttttgttt tttataacga tgactacctc tcaccttcct cgataattgt cgttgtaacc 8341 ttgttaaggt tttttgggct tttttgaaag cttttgggtt gaaaaaaatc actcctgttg 8401 ataaagttgc taaagttttt atcccgacat caaccccgac atattcatgt tcttttttgg 8461 tgatttctgg tgagatttca taagaacaag aaatgtacca atctccggct gatttcgata 8521 tagtcagctt tgtggttgtt gtgtgtggta aggattcata agttgatacc cagccaatgg 8581 ttggtaattt tatgcgttta ccacccaagt ttattggttt accaccagca ttaattgtaa 8641 aactttcatt tctccctttc cttttaaact tgggataatc tgcacgtttt tgaaagaagt 8701 ttttaaaggc ttttcctaaa ttatcaaatg caaattctgt gattttttga caaatccctt 8761 tttctttaat ccaggtgaga actggtttga cttcattatt aaaaaatttc tttaaagtca 8821 aatgatttgg ttgatatcct gactcataca aagctttcca tgttgccagt ccccagttat 8881 atgtaaatcg ggatatgcca gcatgttttg acatcagaat tttttgggat gatgtcaatt 8941 ttagtttggt tttgatcgag gtcacgcatt tgtcccctag aaaatgagcc agccctggtg 9001 ttggttgatc tgtcggcgaa gttgggtact cgttgtaata gggctggctc attttcttcg 9061 tgtaagacct cgtcagcaac acaaatctac tctgcatgag catggtatgt ttcaaatcat 9121 cacataactt atggaggtag gtctagtcgg ttgatgtaaa ttttcttaaa ataggtagac 9181 ggttttgtga ggcggtttgt cagtaaatgt cgggctttgt ccagttaata gtctactagg 9241 acattgctct tcaagaataa ctactattga gtcttaaagt atagtactaa atatttttac 9301 gttaaactat ttaattgttg aaatacacat aaagtaatat ttagcgctaa aatattaatg 9361 gttgaactcg ccgctctact catgctttta agttgctaga gtattgtttt tgggcagttg 9421 aacattcctg gtcactcagt cgtaggcaac tctcaaagtt aaagatgaac aaagcagata 9481 cgattagaca agtagcgctc gactggaaag tgacgcgacc ggaaattgat ccaagcccga 9541 tggtacgagt actggcagtg cttcgttcag ccttagaact cgaaaaggcg acacaaaagc 9601 tttttgcccg gtatgacttg aatacagcaa cgtttggagt cctcgcaaca ctgcgccgtt 9661 cttcgccgcc tgaggggatg accctttctc aattagctca atttgtccta gtcacaccag 9721 cttccatcac caatcgcgtt gatcgtttgg aagcacgggg tttagtcgag cggtatgatg 9781 ccacaaatga tcgacggtgt tggttagtgc gtttgactca gaagggatat gacctcattg 9841 atgaactgat tccgcaacac gtagagaatg agcgccagtt gctttctggt ttaaatgagc 9901 aggaacagga gcaactttat ttgctgctac ttaagctact ggcaagtcta gaggatgact 9961 aaaggataga acacagcaag cactttatag tgggtgagat gaaaaccctg tggctttagc 10021 ccagggacgc cacatgc // LOCUS NODE_3340_length_10024_cov_4.34988510024 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10024) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10024) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10024 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..173) /locus_tag="DP116_23590" CDS complement(<1..173) /locus_tag="DP116_23590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319887.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide transporter" /protein_id="PRJNA477356:DP116_23590" /translation="MVNIKDKPVNYPLFEYIERLESGGALLPDTEENLIEVVGILKSY GIVLDAYSKNLIYV" gene complement(260..1762) /locus_tag="DP116_23595" CDS complement(260..1762) /locus_tag="DP116_23595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194883.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit M" /protein_id="PRJNA477356:DP116_23595" /translation="MLSILLLVPLLGAALIGFLPFGMTPKLSRKVALVVASIAFLWTV VISSQFNPGETNQQFTEFLPWIDALGLNYYLGIDGLSLPLLLLNGLLTWIAIDSTDEN IARPRFYYSLLLLLNAGVTGAFMAQDLLLFFLFYEIELIPLYFLIAIWGGERRGYAAT KFLIYTAVSGIMILASFLGMVWLSGSSSFALDALNTSVLSVETQVLLLLGILIGFGIK IPLVPLHTWLPDAHVEASTPISVLLAGVLLKLGTYGLLRFGMNLLPNGWSYVAPVLAT WAVISVLYGASCAIAQNDMKKMVAYSSVGHMGYVLLAAAAATPLSVLGCIMQMISHGL ISALLFLLVGIVYKKTGTRDLGVLQGLLNPERGMPVIGSLMVLGVMASAGIPGMVGFI SEFIIFRGSFAVFPVQTLLSMIGTGLTAVYFLILMNRAFFGRLSPQVVNLPRVSWGDR IPAVVLAVLIVIFGVQPNWLVRWTEPTMTTMVNIPTPVATVSFVKEKTKT" gene complement(1802..3658) /locus_tag="DP116_23600" CDS complement(1802..3658) /locus_tag="DP116_23600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141171.1" /note="Catalyzes the transfer of electrons from NADH to ubiquinone; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit F" /protein_id="PRJNA477356:DP116_23600" /translation="MNQILFSTSWCIPFYGLIGALLTLPWAIGLIRRTGPRPAAYVNW LATFLAFIHSLFVFVDIWNTEPQSFLFTWFKAADFELSFALEISPISVGTTVFITGLS LLALTYALGYMEKDWAIARFFGLMGFFEAALSGLAISDSLFLSYALLEMLTLSTYLLV GFWYAQPLVVTAARDAFWTKRVGDLLLLMSVVTLSTIAGSLNFSDLYEWAQTADLNST TSTLLGLGLIAGPAGKCAQFPLHLWLDEAMEGPNPASVMRNSLVVAGGAYVLFKLQPI LALSPVALNALIVMGTVTAIGASLVALAQIDIKRAMSHSTSAYMGLAFLAVGMQQGGV ALMLLLTHAIAKALLFMSSGAVIFTTNTQDLTEMGGLWSRMPATTTAFVVGSAGMVTL LPLGSFWAMLGWADGLALVSPWVVGVLVVVNSLTALNLTRLFRLIFWGQPQPKTRRTP EVGWQMALPMVTLTIMTLLLPLMLQQWYLLPDWESINWIVVTLLVSSGALGIGVGSNI YLHKAWSRSRVLTWRFLQDLLGYDFYIDRLYHVTVVSIVAVLSKISAWSDRYLVDGFI NLIGFAAIFSGQTLKYSVSGRSQGYLLTILVGISLLGFLISWSLGLLNTLPF" gene 4291..4599 /locus_tag="DP116_23605" CDS 4291..4599 /locus_tag="DP116_23605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194885.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide-concentrating mechanism protein CcmK" /protein_id="PRJNA477356:DP116_23605" /translation="MPIAVGMIETKGFPAVVEAADAMVKAARVTLVGYEKIGSARVTV IVRGDVSEVQASVAAGVEAARRVNGGEVLSTHIIARPHENLEYVLPIRYTEAVEQFRT " gene 4788..5135 /locus_tag="DP116_23610" CDS 4788..5135 /locus_tag="DP116_23610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494518.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="BMC domain-containing protein" /protein_id="PRJNA477356:DP116_23610" /translation="MSIAVGMVETLGFPAVVEAADAMVKAARVTLVGYEKIGSGRVTV IVRGDVSEVQASVAAGVESVKRVNGGQVLSTHIIARPHENLEYVLPIRYTEDVEQFRE NVNAIRPFGNRRP" gene 5142..5444 /locus_tag="DP116_23615" CDS 5142..5444 /locus_tag="DP116_23615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747341.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide concentrating mechanism protein CcmL" /protein_id="PRJNA477356:DP116_23615" /translation="MQIAKVRGTVVSTQKDPSLRGVKLLLLQLVDEEGQILPEYEVAA DSVGAGVDEWVLISRGSAARQVLGNEQRPVDAAVVAIIDTVHVQDRVIYSKKDQYR" gene 5958..7625 /locus_tag="DP116_23620" CDS 5958..7625 /locus_tag="DP116_23620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309625.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon dioxide-concentrating mechanism protein CcmM" /protein_id="PRJNA477356:DP116_23620" /translation="MAVRSTAAPPTPWSKSLAEPNIHETAFVHPFSNIIGDVFVGANV IVAPGTSIRADEGTPFHISENTNIQDGVVIHGLDQGRVIGDDQNEYSVWIGTNASITH MALIHGPAYVGDNSFIGFRSTVFNARVGEGCIVMMHALIQDVEIPPGKYVPSGAIITS QQQADRLPDVQARDKQFAHHVVGINQALRSGYICAADSKCISTVRKELTKSYTSNNGF ERSNDVARSSLGVETVDQVRYLLDQGYKIGTEHVDQRRFRTGSWQSCTPIQARSVGEA IAALESCLADHSGDYVRLFGIDPKGKRRVLETIIQRPDGVVNAPANFKAPTTNQTNKS YSDNGYSNGSGSGKLSAETLDQVQQLLAAGYKIGTEHVDERRFRTGSWQSCQPIEVTS TQEVVSALEECMENHQGEYVRLIGIDRKAKRRVLESIIQRPNDPVGSSSSSKSTASAP TSEVPVNYSARSAGTATSTRLSSEVVEQLQQLINGGYNISVEHVDQRRFRTGSWSSAG QIQTRSAQQAAAALENYLNQYQGEYVRLIGIEPKAKRRVLETIIQRP" assembly_gap 7705..7714 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 7894..8622 /locus_tag="DP116_23625" CDS 7894..8622 /locus_tag="DP116_23625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874098.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transferase" /protein_id="PRJNA477356:DP116_23625" /translation="MSVPPLRPSYDFDSYISGEVIIHPSAVLAPGVILQAAPNSKIII GSGVCVGMGSILQVHEGTLEVEAGANLGAGLLMVGKGKIGANACIGAATTIFNCSVEP GQVVPAGSVLGDTSRRISESPTQSEQTTQSESSTTNPTSSSTQSENGTGRQKGQGDKQ IGEQGDKQESSAISPSSSRQSPQVGEPAHGAASSSSLSPPVSTQDSQISSYIYGQESI QKLLVTLFPHKQSLNKPISDGEPE" gene 8688..9458 /locus_tag="DP116_23630" CDS 8688..9458 /locus_tag="DP116_23630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874099.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="microcompartment protein" /protein_id="PRJNA477356:DP116_23630" /translation="MEAYSQRSISNIQTSRRRDALSESALGLVSTRSFPAMVGTADMM LKTAGVHLVGYEKIGSGHCTAIVRGGIADVRLAVESGVQTAEQFGQLVSSLVIPRPYP NLEVILPINSRLSALSQDSYNRLSNQAIGLVETRGFPALVGAADAMLKAADVQLAAYE KIGAGLCTAIIRGSVANVAMAVEAGMYEAERIGELNAVMVIPRPLEELEQTLPVASCW IEERQPLKLPVNIKEQVAETELVQLPDLSRLPLKMTEE" BASE COUNT 3001 a 2311 c 2258 g 2444 t 10 others ORIGIN 1 acataaatga gatttttaga gtaggcatct aaaacgatgc cgtaactttt cagaataccg 61 acgacttcta tcagattttc ttcggtatca ggaagtaatg cccctcctga ctctaatcgc 121 tcgatatatt caaataatgg atagttaaca ggtttatctt ttatgtttac cattgctata 181 tctcctactt aaaatacata ttaaaagtta ggtttgtagt atgcatttgg gcacatctta 241 caaacctaag aacagtcttt taagttttgg ttttttcctt aacaaacgat accgttgcca 301 ctggggttgg tatattaacc atcgtcgtca ttgtcggttc agtccaacgc acaagccaat 361 tgggttggac tccaaaaatg acaataagca cagctaaaac tacagcgggg atacgatcgc 421 cccaagacac gcgtggcaaa ttgacaactt gtggagacaa gcgcccaaaa aaggcacgat 481 tcatcagaat taagaagtaa acagcagtta agccagtacc tatcatagac aacaaagttt 541 gcactgggaa aactgcgaaa ctgccgcgaa aaatgataaa ttctgaaata aatcccacca 601 ttcccggtat accagcgcta gccatcactc ccaaaaccat caagcttcca atcacaggca 661 tacctcgttc tgggttgaga agtccctgaa gaactcctaa atcacgggta cctgtttttt 721 tatacacaat ccctaccagc aaaaacagca gggctgaaat caagccgtga ctaatcattt 781 gcatgatgca tcccaagaca ctcaaaggtg ttgcagctgc ggcggctaaa agcacatagc 841 ccatgtgtcc aacggaactg tatgctacca tctttttcat gtcgttttgg gcgatcgcac 901 acgatgcacc atacagcacg ctgataactg cccacgttgc taaaacgggg gctacataac 961 tccaaccatt cggcaacaag ttcatcccaa accggagtaa gccgtaagtt cccaacttca 1021 acagtacccc agctaataat acagaaattg gtgtggaagc ctcaacgtga gcatctggca 1081 accaagtatg tagaggaact aagggaattt tgatgccaaa accaattaat attcccagca 1141 agagtaagac ttgtgtctct acagacaaaa cgctagtatt caaggcatcc aaagcaaagc 1201 tagaagaacc actcagccaa accatgccca gaaaacttgc taaaatcatt atcccggaaa 1261 cagcggtgta aatcaaaaat ttagtcgctg catagccgcg ccgttctcca ccccaaatcg 1321 caatcagaaa atacagggga atcagttcga tctcgtaaaa caagaaaaat agcagtaaat 1381 cctgtgccat gaaggctcca gtcacgccag cgtttaacag caggagcaaa ctgtaataaa 1441 atcgaggacg cgcaatattt tcgtcagtgc tgtcaatggc gatccaagtc aacaagccat 1501 ttaatagcag caaaggtaat gacagaccat ctattccaag ataataattt aatcctaacg 1561 catctatcca aggtaggaat tcggtaaatt gttgattcgt ctccccagga ttaaactggc 1621 ttgaaatgac tacagtccac aagaaagcga tactcgcaac aaccaaagcg actttacgag 1681 aaagttttgg tgtcatgcca aaaggcaaga aaccgatcaa agccgcaccg agcaacggca 1741 ccaaaagcaa aatactgagc ataggaaaca agtcggtaag gtgaaggggg ggacaagggg 1801 actaaaatgg caatgtattc agcaagccta atgaccaact gatgagaaaa cctaggaggc 1861 taatgcctac aaggatggtc aataggtatc cttgagagcg accagaaacg ctatacttta 1921 aggtttgtcc actaaaaatt gctgcaaacc caattaagtt aatgaaacca tctactaggt 1981 agcgatcgct ccatgctgaa attttagaaa gcaccgcgac tatactaacg actgtcacat 2041 gataaagtcg gtcaatgtaa aagtcataac ccaacaagtc ttgcaaaaat ctccaagtaa 2101 gaactctaga tctcgaccaa gctttgtgca gatagatgtt agaccctacc cctatcccca 2161 gcgctccaga ggatactaac aaggtcacga caatccaatt tatactttcc caatccggta 2221 agagatacca ttgttgtagc atcaggggta ataacaaggt cattatcgtc agcgttacca 2281 ttggcaatgc catctgccaa ccgacttctg gagtgcgacg ggtctttggt tgtggctgcc 2341 cccaaaatat taagcgaaat aggcgtgtca aatttaatgc cgtcaaacta ttgaccacca 2401 ctaacacacc aacaacccaa gggctaacca aagccaagcc atcagcccat cctaacatcg 2461 cccaaaaact gcccagcggt aataatgtca ccattccagc agaaccaaca acaaaagctg 2521 tggtggttgc tggcatcctt gaccacagac cccccatttc tgttaaatct tgagtattgg 2581 tagtaaatat tactgctcca gaactcataa ataagagtgc cttggcaatg gcatgagtta 2641 gcagcagcat caaagcaact cctccttgtt gcatcccaac tgccaaaaac gctaacccca 2701 tgtatgcact tgtggagtga gacattgctc gtttgatgtc aatttgagcc agcgccacta 2761 acgatgcacc aattgcggtg actgtaccca tgacgattaa agcatttaat gcgactggcg 2821 acagcgctaa aattggttgt agtttaaaca acacatacgc cccaccagcc accaccaagg 2881 agtttcgcat cactgaagct gggttgggac cttccatcgc ctcatctaac cacaaatgta 2941 gcggaaattg agcgcattta cctgcaggtc ctgcaatcag ccctaaacct aataaagttg 3001 atgttgttga atttaaatct gccgtctgcg cccattcata caagtcagaa aagttcaagc 3061 tacctgctat ggtggaaagc gtcacaactg acatcagcag caacaagtct cccacgcgtt 3121 tagtccaaaa tgcatctcgt gctgctgtga cgactaacgg ttgagcatac cagaaaccca 3181 ccagcaagta agtcgaaaga gtcaacattt caagaagggc atagctgaga aataaagagt 3241 cactgatagc taagccactc agcgctgctt caaaaaaccc catcagccca aagaaacggg 3301 caatagccca gtccttttcc atgtaaccaa gggcgtaagt tagtgccagt aaacttaatc 3361 ctgtaatgaa aactgttgtc ccaacactaa ttggcgaaat ttctaaagca aaggatagct 3421 caaaatccgc agctttgaac caggtaaaca aaaaactttg tggttctgta ttccaaatat 3481 caacgaatac gaacaggctg tggataaaag ccaaaaaggt tgccaaccaa ttgacataag 3541 cagctggtct tggtcctgtg cgccgaatta atcctattgc ccacggtaaa gtcaacaggg 3601 cacctattaa cccataaaaa ggtatacacc aacttgttga aaagagaatc tgattcattc 3661 aaatgttcct ttacagcttt gagcaagaat tcctttttca gtttatgaaa tgattgatcc 3721 tagtttctgc ttttaacaat tcaaatttta ttatttagaa ttattctttc attgtttgta 3781 tcgaaaacac aaacaaaaat gtcgtatatt tcatcatata ttttttctta cccccttact 3841 tgtctttctt ttgacagcaa taatttataa ctaaaagatt ctagtttata aggaatataa 3901 taaaaaatta tatactttat tctgattgtc ttcacaaaga gtaacagcca gggaagaact 3961 gtcctaacac ttcctaagtt ttccgtcatt cttcttgagg cagaatttcc acctttacga 4021 acctgtttca gcagatagta taaaaaaatt ataaaacaga agtgtctgct ctaggaatct 4081 gtgaagttcg gttgagcaga attgtcgttc tcaagtaata ttaatttttt ttaattaagt 4141 ttattataaa ctattatggt aatttatatt gtcagggacg ccaagctttt ctatgcttaa 4201 tttggaaagc ttttaaagct caagatgagc atccccgtaa aaaattttcc acctgacagt 4261 atattgcact aaatttgtgg gagttctgcg atgccaattg cggttggaat gattgaaaca 4321 aaagggtttc cggcagtcgt tgaagctgct gatgcgatgg tgaaagccgc tcgcgttact 4381 ctagtagggt atgaaaaaat cggtagtgct cgggtgactg tgattgtccg aggagacgtt 4441 tctgaagtac aagcttcagt tgctgctgga gtggaagcag ctagaagagt caatggcggt 4501 gaggtgcttt ccactcacat cattgctcgt cctcacgaaa acctggaata cgtactgcct 4561 attcgttaca ccgaagccgt ggagcagttc cgcacttagc aattttggat tttcgctctt 4621 cagtgctggg ctaaaattaa gcggcagaaa gcaactgcaa aaaatcaagc tttcagccaa 4681 cagcccaagt gctgacggct tgaaaaatct gaaattgttg tcggggttgc cgtattgggc 4741 tacgacgtcg gctgtattca aatcttaaaa acaaaggatt aaaacttatg tcaattgcag 4801 taggaatggt ggaaactttg gggtttccgg cggtagtaga agcagcagat gcgatggtga 4861 aagcagctcg cgtgacttta gtaggctacg aaaaaattgg tagcggtcgc gtaaccgtga 4921 ttgtgcgggg agacgtttct gaagtacaag cttcagttgc tgctggagtt gaatcagtca 4981 agcgagtcaa tggtggacaa gtgctgtcta cccacatcat tgctcgtcct cacgaaaacc 5041 tggaatatgt gctgccaatt cgttatacag aagacgtaga acagttccgg gagaatgtga 5101 acgcaattcg tcccttcggc aacagaagac cataatctgt catgcaaatt gctaaagttc 5161 gcggcacagt agttagcact caaaaagacc caagtctcag aggtgtcaag ctactgctgt 5221 tgcaattagt agatgaagaa ggacaaattt tgccagaata cgaggtggca gctgatagtg 5281 tgggtgcggg agtagatgag tgggtactta ttagtcgcgg tagtgccgcc cgtcaagttc 5341 tgggtaacga acaacgtcca gtagacgcag cagtggtggc cataattgat acagtccacg 5401 ttcaagaccg tgtgatttac agcaaaaaag accagtatag atagtcatta ggaggcagtg 5461 cggtggacgg gttccccggc aggagccagt gcggtcttgg ggtctcccca agtggagcac 5521 ctggcgtcat ctgccgttca ttagtgagcc agcaggatct tgagggaaac ccccatgagc 5581 gactggcgct cttgggtcag acgaaggcag gcatacccgt aagggtcatt aggagccagt 5641 gcggacgcca catgcctctg tcgggagacc ctcctgcagc agtggctcgt cttggggtct 5701 cacgccacat gctacaagtc ggcacagccg acggcagatg ctacccttcg ggaagccgcc 5761 cttcgggcgt ctacaagccg ggaaaccctt tcggcagttc ctcatggggg aaacccccaa 5821 gaccggactg cctcaccaac gcactgcctc cccaacgcag tggctcccca agtagggcat 5881 ctggcgttca ttagctgtaa acaaaggact tttgacaaag gacttttgac aaaggagaac 5941 agaggaggaa tctagcgatg gcagtccgca gcacggcggc acccccaacc ccgtggtcaa 6001 aaagtttagc tgagccaaac atccatgaaa cagcgtttgt gcatcccttt tctaatatta 6061 ttggggatgt atttgtaggc gcaaatgtaa tcgttgctcc gggaacttcg attagagcgg 6121 atgaaggcac accctttcac ataagtgaaa acaccaacat tcaagatggt gtcgtgattc 6181 atgggttaga ccaaggtcga gtcattggtg atgaccaaaa tgagtactct gtatggattg 6241 gcaccaatgc atccattact cacatggcgc tcattcatgg accagcttat gttggggata 6301 attccttcat tggctttcgc tccacagtgt tcaacgccag agtcggagaa ggttgcatcg 6361 taatgatgca tgctctgatt caagatgtgg aaatacctcc gggaaaatac gtgccttctg 6421 gagcaataat tacgagtcag cagcaagctg atcgcttacc agatgtgcag gcacgcgata 6481 aacaatttgc tcatcatgtg gttgggatta atcaggcctt gcgctctggt tacatctgtg 6541 cagcggatag taagtgtata agtactgttc ggaaggaatt gactaaatct tatacaagca 6601 ataatgggtt tgaaaggagt aatgacgtgg caagaagtag cttgggtgtt gaaacagtcg 6661 atcaagtacg ttatctgtta gatcaaggat ataaaattgg tacggaacac gtagaccagc 6721 ggcggttccg tactggctct tggcaaagtt gtacaccaat acaagcaagg tcagttggtg 6781 aggcgatcgc agctttggaa agttgtttag cagatcactc tggcgactac gtccgccttt 6841 ttggcattga ccccaagggt aaacggcgcg tgttagaaac cattatccaa cgcccagatg 6901 gagtcgtaaa tgcgcctgct aacttcaaag ctcctaccac caatcaaaca aacaaaagct 6961 acagcgataa tggttacagc aacggcagtg gtagtggtaa actcagtgcc gaaactctag 7021 accaagtaca acaacttctt gcagctggtt acaaaattgg tacagaacac gtagatgagc 7081 gtcgcttccg tacaggttca tggcaaagct gccagcccat tgaagtcact tccacacaag 7141 aagttgtcag cgcattggaa gaatgtatgg aaaatcatca aggtgagtac gtacgcttga 7201 ttggtattga ccgcaaagct aaacggcgcg tattagaaag cattatccaa cgtcctaacg 7261 acccagttgg ctcatccagc agttctaagt ctacagctag tgcgccaact agtgaggtgc 7321 cggtcaatta ttctgccaga tcagcaggaa cagcaaccag tacccgcctc agttccgagg 7381 tcgtagaaca gctacagcaa ctcatcaatg gtgggtataa cattagcgta gaacacgtag 7441 accagcgacg ttttcgcaca ggttcgtggt caagcgctgg gcaaattcaa actcgttctg 7501 cacaacaagc cgcagcagca ttagaaaact atctaaatca ataccaaggg gagtacgtgc 7561 gactgattgg tattgagccc aaagctaaac gtcgcgtact ggaaacaatt atccaacgcc 7621 cataagacaa agggacaaac aaaaagagct ggggacaagg ggacaagaca attcaaaatc 7681 accgaagatt tgaatctcct gagcnnnnnn nnnngccccg tgccgaacgg agacgctgcg 7741 cgaacggagg aaacctccgc tcaaacttct ctcaaaattc aaaattcaaa attcagaatt 7801 tagaattaga actctctccc tcatcctcct tactccccca ctccctcact ccctcactcc 7861 ctcctatttc attggcttcc gaggttacat atcatgtctg tgccgccact gcgcccaagt 7921 tatgactttg attcttatat aagtggcgag gtgataattc atccaagtgc agtacttgca 7981 ccaggtgtga tactccaagc agctccaaac agcaaaataa ttattggttc gggggtctgt 8041 gttggtatgg ggtcaattct ccaagtccat gaaggaaccc tagaagtaga agcaggagca 8101 aacctgggag ccggtctttt gatggttggt aaaggcaaaa taggggcgaa tgcttgtatt 8161 ggggcagcaa caacaatttt taattgttct gttgagccgg gacaagtcgt acccgctggt 8221 tctgttctcg gagatacaag tcgccgtatt tctgaatctc ctacacagag tgaacaaaca 8281 acccagtcgg aatcatccac aactaaccct acttccagta gtacgcagtc agaaaatggg 8341 acagggagac aaaaaggaca aggagacaaa cagataggag aacaggggga caagcaggaa 8401 tcatctgcca tctctccctc atcctcacgg cagtcgccac aagtcgggga acccgcccat 8461 ggcgctgcct cctcatcttc cttatccccc ccagtttcca ctcaggattc tcaaataagt 8521 agttacattt atgggcaaga aagcatacaa aagctgctgg tcacattgtt tccccataaa 8581 caatcgttga acaaacctat atctgacggt gaacctgaat aagtgttaat tgttaagttg 8641 tcagcaatca aagagcaact aacaattgag taccctggag aactgaaatg gaggcataca 8701 gtcaaaggtc tataagcaat atccagacat cgcgccgtcg agacgcactc agtgaaagtg 8761 cgttgggatt agtgtctacc cgcagttttc cagcaatggt tggtacagcg gatatgatgc 8821 tgaaaaccgc tggagttcac ctggttggat atgagaaaat tggcagtggt cactgtactg 8881 cgatcgtcag gggtgggata gctgatgtac gtctggcggt agagtctggt gtgcaaactg 8941 ctgaacaatt tggtcagttg gtttctagct tggtcattcc ccgtccttat cccaacctgg 9001 aagtgatact gcctatcaac agccgcctga gtgcactttc tcaagacagc tacaaccgcc 9061 tgagcaacca agcaattggt ttagtagaaa cgaggggatt tccagcgttg gtgggagcag 9121 ctgatgccat gttgaaagct gccgatgtcc aattggcagc ttacgaaaaa attggtgcgg 9181 gtttgtgtac agccattatt cgtgggtccg tggcaaatgt cgcgatggca gtagaagctg 9241 gtatgtacga ggcagaacgc attggagagt tgaacgcagt aatggtcatt ccaagaccat 9301 tggaggagtt ggagcaaact ttgccagttg caagttgctg gatagaagaa cgtcaaccgt 9361 taaagttacc cgtcaacatc aaggagcaag ttgcagaaac agagttagta caattgccag 9421 atttatccag gttacctcta aaaatgaccg aagaatgact gcgagtgaaa aagttaagag 9481 ttaagagcta agagtagcca gttgtgtaaa cgttcccccc ccttaagcaa actggattcg 9541 aggaatgagt cctttgtctt ttgcgtttac aaatgaccct ccgggtctgt tcattcgccc 9601 tgcaggcacg cacttgcaca gggaacagcc gtatgccctt cgggtatgcg caaagcgcac 9661 gccaaaggcg aacgccagtc gcctacggag ggagagccgt cattcgcgct ggtctcacca 9721 gtcacccgct ttcgggttac cctcaaagca gtgctggact tagcccactg gctcacaaat 9781 gacgaaccac tcatcactca taactcgtaa cttgagctat ctaagatcaa ggtgtgtaac 9841 cttggctgga aggtgtctta cgctgttgtt gctagtttga atcataactt tttccagttg 9901 ataagggaaa taccattatt agaatagttt tcataggttt ttatctatgt atacatccgc 9961 cttatattcc aatacgcttg ctgttaagga ttgatcctcc ctagccctcc ttaaaaagga 10021 ggga // LOCUS NODE_3351_length_10005_cov_5.20593010005 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 10005) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 10005) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10005 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 153..593 /locus_tag="DP116_23635" CDS 153..593 /locus_tag="DP116_23635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875321.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23635" /translation="MKMPGRFKIVPAALTMSLVFTNAHLAEAQVVQITSRLQPDPLIV NGTSGGSLASNCGNIATKPNQVIRVTESLPYLRLTVESGGQPTLLIHSPGGRFCVLAD KYSGGKPEISGYWQAGNYSLYVGELSRGQYNYTLSISQQKILTK" gene complement(535..744) /locus_tag="DP116_23640" CDS complement(535..744) /locus_tag="DP116_23640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017304034.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23640" /translation="MPLTLRWKTASNIHKETSSLFYSVPCSLLRVPYSLFPIPCSLLR VPYSLLIILLVSFVAKLIKYSYTVL" gene complement(757..1728) /locus_tag="DP116_23645" CDS complement(757..1728) /locus_tag="DP116_23645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995427.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_23645" /translation="MQIYGPDPITFEGVTGHRYWFNEPFTFTGVADIDERLSGFPVAV FLPHNRPAHQTPLVIGLQGMSAPYGWNAFIVPTLTQMGIAVALFDTPFAGERSLVRTF SAVVQNEIKPLVDGGIAFDTQLLLRIFRCTTRDIGKVVDFCYNRYSLTDGRVALFGVS MGVLQSAHAFTANGLGERLLGAIGHANLQSFAKSWGYPLLSELAASPLGKLAEALLER FQPELKPVIKVMQVAKKLKDGDEYSRACNPMTYIDQVKPHRRVRLLLGATDHIVNIRD ARWCAKQFPDGVCYVVPGMGHGQTHNGRSFVDHVRYFLVTQLADWRG" gene 1922..3616 /locus_tag="DP116_23650" CDS 1922..3616 /locus_tag="DP116_23650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651227.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_23650" /translation="MLRLEHISKIYPTGEVLKDVNWEVKAGDRIGLVGVNGAGKSTQL KIITGETEPTSGEIIRPASLHIAYLNQEFEVEPTRTVREEFWTVFKEANAVQLSLAQV MREMESATPEELDKLINKLDRLQRQFEALDGYILDTRIGKILPEMGFGLEDGDRLVSA FSGGWQMRMGLGKILLQEPDLLLLDEPTNHLDLETIEWLENYLKGLNTPMVIVSHDRE FLDRLCTQIVETERGVSSSYLGNYSAYLQQKAENQTAQLSAYERQQKELDKQQAFVDR FRASATRSTQAKSREKQLEKVERIEAPTDDLRTLHFRFPPAPRSGREVVNIKNLTHVY DDKILFLGANLFIEKGDRIAFIAPNGAGKSTLLRLIMGVEQPTEGSLTLGEHNVLPGY FEQNQAEALDLKKTVMETIHDEVPDWKNEEVRTLLGRFLFSGDTVFKAVAALSGGEKA RLALAKMLLQPANLLILDEPTNHLDIPAKEMLEEAIQNYDGTVLVVSHDRYFISKVAN KIVEIREGDFRVYLGDYHYYLDKIAEEKEEAKLAAIAAEKAAKKAAKASKSGTKKK" gene 3962..5560 /locus_tag="DP116_23655" CDS 3962..5560 /locus_tag="DP116_23655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864692.1" /note="catalyzes the interconversion of 2-phosphoglycerate and 3-phosphoglycerate; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2,3-bisphosphoglycerate-independent phosphoglycerate mutase" /protein_id="PRJNA477356:DP116_23655" /translation="MTKAPVAPVVLVILDGWGYCEESHGNAIAIANTPVMDSLWAAYP HTLIRTSGKAVGLPEAQMGNSEVGHLNIGAGRIVPQELVRISDAVEDGSILRNPALVK ICQEVFSRNGKLHLVGLTSSGGVHSHITHLFGLLDLAKIQGISQVCIHAITDGRDTTP TEGVKALGLLQDYIDRIGVGRIVTVSGRYYAMDRDRRWDRVKRAYDVMTQDGPGNGLP AVEVLQASYAERVTDEFVIPVRIAPGAIEPGDGVIFFNFRPDRARQLTQAFVNPEFNG FERQQITPLSFVTFTQYDPELRVGVAFEPQNLSNILGEVISKHGLKQFRTAETEKYAH VTYFFNGGLEEPFAGEDRGLINSPMVATYDSAPAMSAQAVTEVAIAAIEKRVYSLVVI NYANPDMVGHTGNIPATVKAVETVDQCLGRLLSSIGKVGGTAIITADHGNAEYMLDSN GNPWTAHTTNLVPLILVEGEKAKIPGHGTDVALGSDGKLSDIAPTILDILQLPQPPEM TGRSLLQNAEYEVQRSRTPVQLGM" gene 5681..5914 /locus_tag="DP116_23660" CDS 5681..5914 /locus_tag="DP116_23660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015141098.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecG" /protein_id="PRJNA477356:DP116_23660" /translation="MTIANVVEVIWALSAVGLIVLVLLHSPKGDGIGAIGGQAQLFSS TKSAENTLNRVTWALTAIFLGLTVVLSAGWLPK" gene 5924..6748 /locus_tag="DP116_23665" CDS 5924..6748 /locus_tag="DP116_23665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316912.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="PRJNA477356:DP116_23665" /translation="MGRQTQKLILSSRKSTIRTTITKTVVLQRFIVSLALAIGTALLV ILTSFQSSAILVTSSSSSPPPLKPHPLPPTLTQWQDSTNSGDYFSQVSSTQVGSLVWS QFPVRVYVESPQAVNSKLAEEWVKTVVQAVQEWSVYLPLAIVEQPKDADITIVRKTPP LQTSPNSKILRARSAQTTYEVYVSKNNLLSHCFTILLSPSQTGQYLLAATRHEFGHAL GIWGHSPLPTDALYFSQVRNSPSISPRDVNTLKRVYEQPTSLGWSLTSVSTLQQEN" gene 7372..9069 /locus_tag="DP116_23670" CDS 7372..9069 /locus_tag="DP116_23670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321482.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-Ig-H3/fasciclin" /protein_id="PRJNA477356:DP116_23670" /translation="MYNLKRLSTVSTALMALGIITATVSPTLVCAQTLPQETTPSTTP STTPSTTPSTTPTTTPSTTPATTNFPDVGADYWAQPFIQALAARNVITGFPDGTYRPD QPVTRAEFAAMIQKAFNQNRVRQLAAGGFQDVPSTYWGASAITQAYETGFLSGYPGNL FRPNQQIPKVQAIVALTSGLGLTASGDASNSLGTYYTDASSIPSYAVNNVATATQANL VVNYPNVSVLNPLVPLTRAEAAAHLYQALVKLGQVPPLANNVAAASYIVGRTTASAPT NTDIVSVAASSNSFTILTSLLKTAGLADVLQQPGPYTVFAPTDEAFSALPQQTLQQLQ QPENREALIKILRYHVVPGSLSADQLAPGNLKTAEGLPLNVKVNGGSQIAVNNARVIQ PNIQASNGVIHAVNRVLIPSDVSLNTQNGGGSGDNITPGRATRGVSSYIGVGGNIGFG GDTALGDGNFAVFSKIGLTRNFSVRPSAVIGDNPIVLVPITFDLSQRGTGQGFNIAPY VGAGVAITTGDNTDVGLLLSGGVDVPLTRRFTLTGSVNAAFIDDTGVGLMLGVGYNF" gene 9739..>10005 /locus_tag="DP116_23675" CDS 9739..>10005 /locus_tag="DP116_23675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867642.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23675" /translation="MLSKLQVRHPFLFLALPLATLASLSYFTVAKAQITPDNSLGAEN SVVTPNVQINGTPSDRIDGGAIRGDNLFHSFQEFNVNAGRGAYFS" BASE COUNT 2905 a 2163 c 2202 g 2735 t ORIGIN 1 tttcttaata tttaaccata taattatttt tccccaaaga aaactttata cagttcactt 61 atgaaacaaa gacagattgt tctcgtcact taattttaaa atattatgta tcaaattttt 121 cttgataaaa agtcgggagc ataacaataa atatgaagat gcctgggcga tttaaaattg 181 ttcccgcagc attaacgatg agcttagttt tcaccaatgc tcatttggct gaggcgcaag 241 ttgtgcaaat tacttctcgt ttgcagccag atccgctgat tgtgaatggt acgtccggcg 301 gatcacttgc aagcaactgt ggtaacattg ccactaaacc caatcaggtg atacgggtga 361 cagaatcact accttatttg cgcttaacag ttgaaagtgg gggacaacca actctattaa 421 ttcatagtcc tggaggacgc ttctgtgtgt tggcagataa gtactctggg ggcaagccgg 481 aaatttctgg ttattggcag gcaggaaatt actctttata tgtaggagaa ttatctagag 541 gacagtataa ctatacttta tcaatttcgc aacaaaagat actaacaaaa taattaagag 601 ggaataggga actcttaaca gggaacaggg aatagggaat agggaatagg gaactcttaa 661 cagtgaacag ggaacagagt agaagaggga tgatgtttct ttgtgtatgt tgctagcggt 721 tttccatcga agcgtgagag gcacactatt gtcaaactac ccccgccagt ctgccaattg 781 cgtcaccagg aaatagcgca catgatcaac aaacgatctc ccattatggg tttgtccatg 841 tcccattcct ggaacaacgt aacaaacacc atcggggaac tgcttggcgc accaacgggc 901 atctcgaatg ttaacgatgt gatcagtagc accaaggagt aagcgaacgc gacggtgagg 961 ctttacttgg tcgatatagg tcatcgggtt gcacgcccga ctatactcat cgccatcttt 1021 gagttttttc gctacttgca taaccttaat gacaggttta agctcaggtt ggaatcgttc 1081 tagcaatgct tccgctagtt tacctagagg cgaagctgct aactcgctta aaagcgggta 1141 gccccagctt ttggcaaagg actgtaaatt agcatgacca atcgccccca gcaatcgttc 1201 tccaagaccg ttagccgtaa aggcatgggc actttgcaaa actcccatac ttacaccaaa 1261 cagagccacc cgaccatcgg tgagactgta gcgattgtag caaaaatcaa ccactttccc 1321 aatgtcacgt gtcgtacaac gaaaaatccg cagcagcaat tgggtgtcga acgcgatacc 1381 gccatccact agcggcttaa tttcattttg caccacagca ctgaaggtac gcaccaagct 1441 acgctcacct gcaaagggtg tatcaaacaa agctacagct atacccattt gcgttagcgt 1501 tggcacaatg aaggcgttcc agccataagg agcagacata ccttgcagtc cgatgactag 1561 aggtgtctgg tgtgcaggtc gattgtgcgg gaggaagact gcaaccggaa agccacttaa 1621 tcgttcatca atgtcagcaa ctccagtaaa agtgaatggc tcattaaacc aataccgatg 1681 accagtaaca ccttcaaaag taatcggatc gggcccgtag atttgcatac caggtaaata 1741 acagaagctt aaccgaaaat tctgtttaaa aactcattcg ttcgtctaca tccaattttg 1801 aattgctatt ttatctcgaa agagttgacg actcaaattt ttgagttaaa ttttgttaag 1861 attggactca atacaatcac ttctgccttc tctttcaatt ccctaacctt ggagatatcc 1921 catgctgcga cttgaacata tcagtaaaat ttatcccaca ggcgaagtcc tcaaggatgt 1981 caactgggaa gtcaaagctg gcgatcgcat tggtttagtc ggtgtcaacg gtgccggaaa 2041 atccacccaa ctcaaaatta tcacagggga aactgaacct acgtctggcg aaatcattcg 2101 tcctgccagc ttacacatag cttatctaaa tcaagagttt gaagtcgaac ccactcgcac 2161 tgttagagaa gaattttgga cggtttttaa ggaagctaac gctgtgcagc tgtctctagc 2221 gcaggtgatg cgagagatgg aaagcgcgac tccagaggaa ctggataaac tgattaataa 2281 gttagatcgc ttacaacggc agtttgaagc attagatggc tacatcttgg acacacgaat 2341 tgggaagatt ctaccagaga tggggtttgg cttagaagat ggcgatcgcc ttgtaagtgc 2401 ttttagtggt ggttggcaaa tgcgcatggg tttgggcaaa attctattgc aagaacctga 2461 cttattacta ttagacgagc caacaaacca tttagattta gaaacaattg agtggttaga 2521 aaattatctc aaggggctaa ataccccaat ggtaatagtt tctcatgacc gcgagtttct 2581 tgaccgtttg tgtacccaaa ttgtggaaac agaacgcggt gtatcctcca gctaccttgg 2641 taactattca gcatatttgc aacaaaaagc tgaaaatcaa acagcgcaac tttcagctta 2701 cgaacgccag caaaaagaat tagataagca acaagctttt gttgataggt tccgcgctag 2761 tgcgactcgc agtacccagg caaaaagccg cgaaaaacaa ctggaaaaag tcgaacgcat 2821 cgaagcacct acagatgatt taagaacgtt gcacttccgt tttccccctg caccccgtag 2881 tggtcgtgag gtggtgaata tcaagaattt aactcacgtc tatgatgata agattttgtt 2941 tttgggtgca aatcttttca tagaaaaggg cgatcgcatt gcttttattg ctcccaatgg 3001 tgcaggtaaa tccactttgt tacgcctcat catgggtgtt gaacaaccca cagaaggatc 3061 actgacattg ggtgaacaca atgttctccc tggatacttt gaacaaaacc aagcggaagc 3121 tttggatttg aaaaaaactg ttatggaaac tatccatgac gaagttcctg attggaaaaa 3181 tgaagaagtt cgcacccttt taggaagatt tctcttcagc ggtgacactg tatttaaagc 3241 agtggcggca ttgagtggag gtgaaaaagc acgtcttgcc ttagcaaaaa tgctcttaca 3301 acccgctaat ttactaatac tggatgagcc aaccaaccat ttagacattc ctgcaaaaga 3361 aatgttggaa gaagctattc aaaattatga tggcacagta cttgttgttt cccatgatcg 3421 ttatttcatt tccaaggtag caaacaaaat tgtggaaatc cgtgaaggag atttccgcgt 3481 ctacttaggc gactaccact attatttaga taaaatagca gaagaaaaag aagaagcaaa 3541 attagcagca attgctgctg aaaaagctgc taaaaaggct gctaaagctt ctaaaagtgg 3601 cacaaaaaag aaatgagatg tagatgtagc agtcctaatt cattcatgag aatctctatt 3661 tgtgtctgac cttcgtgtct tttgtgtcct ttgtgatagc ctgcgggaag cagcgattcg 3721 caaagcgaca cctgctttgt gaatccgtct atgttcaaga aaaagacaca aaaccaatag 3781 gtgtacagta gtatagttat gataccaaat acaagaaatt gccaaagata aacaaagttt 3841 aagaaatgtc taaaagccca cttcgagcaa aataaagtag atcataagca gaaattcact 3901 tgcggctgct attagcaatg gtatcattcg ggtattacaa aaagtaaagg gcagttttac 3961 tatgaccaaa gcacctgtag ctcccgtggt gctagtcatt ttagacggat ggggctactg 4021 cgaggagagt catggaaacg cgattgctat tgctaacact ccggtgatgg atagcttatg 4081 ggcggcttat ccccacaccc tcatccgaac atcagggaaa gcggtggggt taccagaagc 4141 tcaaatgggc aactcagaag ttggtcattt gaacataggc gctggtagaa ttgtccccca 4201 agaattagta cgcatctcag acgcagtaga agacggttcc attctcagaa atccagcact 4261 tgtcaaaatt tgccaggaag tgtttagtcg aaatggcaag ctgcatctgg ttggactcac 4321 ttcctcaggt ggagtgcatt cgcacatcac tcatctattc ggactacttg acttagcgaa 4381 gattcaggga atatcccaag tttgcataca cgccattact gatggacgtg acaccactcc 4441 aactgaaggg gtgaaagcat tggggcttct tcaagactat atcgaccgta taggagtcgg 4501 gcgtatagtc acagttagcg gtcgctatta cgcgatggat cgcgatcgcc gttgggatcg 4561 ggtcaaacgc gcttacgacg tgatgacgca ggatggacct ggaaatggtc taccggctgt 4621 ggaagtccta caagcatcgt atgcagaacg tgtgacggac gaattcgtga tcccggttag 4681 aattgcacct ggtgcgatcg aaccaggaga cggagtcata ttttttaact tccgccccga 4741 cagagcaaga caactcaccc aagcttttgt caatccagaa tttaacggtt ttgaaagaca 4801 gcaaatcacc ccgctgtctt ttgttacctt tacacagtat gacccagaat tacgagttgg 4861 ggttgctttt gaaccgcaaa atttgagtaa tattcttggg gaagtcattt ccaagcatgg 4921 tttaaagcag tttcggactg cagaaacaga aaaatacgct cacgtcactt acttcttcaa 4981 tggcggtttg gaagaacctt ttgcaggaga agaccgagga ctgataaatt ctccgatggt 5041 ggcgacttat gatagcgctc ctgcgatgtc agcacaagca gtgacagagg tggcgatcgc 5101 agccattgaa aagcgcgtat actcgcttgt tgtcatcaac tatgctaacc cagacatggt 5161 aggacacaca ggtaacatcc cagcaactgt caaagccgtt gagacagttg accagtgttt 5221 aggtcgtcta ctctccagta ttggtaaagt tggggggaca gcaattataa ctgctgacca 5281 cggcaacgct gagtatatgc tagatagtaa cggcaatccc tggacagcgc acacgaccaa 5341 cttagttccc cttattttgg tggaaggtga aaaagccaag attccaggac atggtacaga 5401 tgtcgcattg gggagtgatg gtaagctctc tgacattgca cccacaattt tggatatcct 5461 acaactacct caaccaccag aaatgacagg gcgatcgctg ctgcaaaacg ccgagtatga 5521 agtgcaacgt tctcgcactc ctgtacaact ggggatgtga aagagtgaaa aatgatgaat 5581 tatgtaaaga aacaaaactc ataattcatc atttttaatt tttcatctat aattaaaaat 5641 tggtctatct taaaaatcgt tattaactat attagctacc atgaccattg ctaacgtcgt 5701 agaagttatt tgggcacttt ccgctgttgg tctcattgtt ttggttttgc tgcatagtcc 5761 caaaggtgat ggcataggag ccattggtgg acaagcccaa ctgttcagca gcactaagag 5821 tgcagaaaat acattaaacc gagtgacttg ggcactcaca gccatttttc ttggtttaac 5881 tgtggtttta agtgctggtt ggcttcccaa ataacatcaa gatatgggga gacaaaccca 5941 aaaactcata ttgagcagtc gtaaaagtac tattagaact accattacaa aaaccgtagt 6001 cttgcagcgc ttcattgtct cactagcatt agctattggt acagcgctgc ttgttatttt 6061 gacgagtttt caatccagcg ccatccttgt tacctcttcc tcatctagtc cacctcccct 6121 caagcctcat ccactaccac caacactgac gcagtggcaa gatagtacta atagcggtga 6181 ctacttttct caagtcagtt caacccaagt tggttctctc gtttggtcgc agtttcccgt 6241 tcgggtttat gtagaatcac cccaagcagt gaatagtaaa ctagcagaag aatgggtgaa 6301 gaccgttgta caagcagttc aggagtggag tgtttatttg cctttggcga tcgttgaaca 6361 gccgaaagac gcagacatta ctattgtgcg aaaaactcca ccattgcaaa cttctcctaa 6421 tagtaaaata ctccgtgcgc gttctgctca aactacttat gaagtgtacg ttagcaaaaa 6481 taatctttta tctcactgct tcacgattct gttgagtccc agccaaacag gacagtatct 6541 ccttgcagcc actcgccacg aatttggtca tgcattggga atatggggac atagtccgct 6601 accaactgac gccttatact tttctcaggt tcgcaactcg ccgtcgattt ctcctagaga 6661 tgtgaatacc ttgaagcgag tttatgaaca gccaacgagt ttgggatggt ctttaacaag 6721 tgtctccaca ctacaacagg agaactaaag aaatgcttca gattctttgt ggtgattgtt 6781 ttgtaaacaa aagtcaagaa gaacatctcc tataaggctg aagcttaagc agccggagga 6841 gggatttttg aaaatttgta gggtggctca atctgacttt tcacggcatg tggaaaaggt 6901 cattttgagg gtttcccaac ccctgatttt tccgttggtc attttcatta atccacaact 6961 attgacaata atgctgtgga aaaagccacg cagtttcgtg gtttgaggac ttaccgtgaa 7021 aagtcagggc tcaattcgta tcacagataa ctaataaaat ccttgcctga gatagatgtt 7081 taatctcctc ttaaagagct atatcaaagt atataaagaa tgttttgtaa tcattttctt 7141 tttctaacac gaaaagtttt ttgctgcatt ctttatcgcg tttttagttc atttaagcct 7201 acgtatttca aaacattttg aattttccat gagttgtttt attgagatct aagttctaac 7261 aataacttgt tcagtggttg attcaagtgc aatatatcaa ctgagtctta tgacttctct 7321 ggatttacgg taggcgagat gtcatctttt atgtgaggag caaaaaaact tatgtataac 7381 ttaaagcgtt tgtcaacagt cagcaccgca ttgatggcgc taggaataat aactgctaca 7441 gtcagtccca cattagtttg tgctcaaact ttacctcaag aaaccactcc atcaaccact 7501 ccatcgacaa ctccatcgac cactccatcg acaactccaa cgacgactcc atcgacaact 7561 ccagcgacaa ctaactttcc tgatgttggg gcagattact gggcacaacc atttattcaa 7621 gctttagccg caagaaacgt aattactggt tttcctgatg gcacttatag accggatcaa 7681 cctgtgactc gtgctgaatt tgcagcaatg attcagaaag ctttcaacca aaaccgagtt 7741 cgccagttag ctgctggtgg atttcaagat gtaccttcta cctattgggg ggcttctgca 7801 attactcaag cctacgaaac cggatttcta tcagggtatc cagggaactt gtttagacca 7861 aatcagcaaa ttcctaaggt gcaggcgatc gttgctttaa caagtggttt aggtttgact 7921 gctagtggtg atgcaagcaa tagtcttggt acttactaca cagatgctag ttctatccca 7981 agctatgctg tgaataatgt ggcaactgca acacaagcta atcttgtggt taactatcca 8041 aatgtgagtg tactcaatcc tcttgtgcct ctgactcgtg cagaagcagc agcacattta 8101 tatcaagctt tagttaaact aggacaggtg ccacctcttg ctaacaatgt cgccgctgct 8161 agttatattg taggtagaac cactgctagt gctccaacta acacagatat tgtatccgtt 8221 gctgcctcta gtaattcttt tacaatcttg acttctttat tgaagacagc aggtttggct 8281 gatgttctac aacaacctgg tccttacact gttttcgccc ccaccgatga agcattttct 8341 gctttgcctc aacaaacctt acagcagttg cagcagccag aaaacagaga agcactgatt 8401 aaaattttaa gataccatgt ggtgcctggt agccttagtg ctgatcaact tgcacctggt 8461 aatctcaaaa ctgctgaggg cttaccttta aacgttaagg tcaatggtgg tagtcaaatt 8521 gcagtcaaca atgccagggt gattcagccg aatatccaag caagcaacgg tgtaattcat 8581 gcggttaata gagtccttat accgtctgat gtgagtctta atacacagaa tggcggaggt 8641 agtggagata atattacgcc aggtagagcg actcgtggtg tttccagcta tattggggtt 8701 ggtggtaata ttgggtttgg cggtgataca gctcttggtg atggaaactt tgcagttttc 8761 agtaaaattg ggctaacacg aaacttctcg gtgcgacctt cagcagtgat tggggataat 8821 cccattgttc tagttcccat taccttcgac ttgtcccaac gaggaacagg acaaggattt 8881 aacatcgcgc cttacgttgg tgcaggtgtg gctattacaa ctggtgataa tactgatgtt 8941 ggtttgcttc tctctggcgg tgttgatgtg ccgttaactc gtagatttac attaacaggt 9001 tctgttaacg cagcttttat agatgacact ggtgtcgggt tgatgctagg agttggctac 9061 aatttctaag taacggaacc tcaccctgcc ctgtcgggca tccctctcct tagcaaggag 9121 agggaaaatt tttttgttgg taaaagtgaa ggtgaggtag tctgacaaaa aatacttttt 9181 tgttggcgca agcttgtgca cgacttaggg aacctcaccc tgccctgtcg ggcatccctc 9241 tccgcgattt cggagaggga aaattttttt gtaggtaaaa gcgagggtga ggtagtctga 9301 caaaaaatac tttaaactcg tttccagcct caggctggaa atgccggtga cgaggctctg 9361 cctcaaccta tagcgcttct gacttcggtg caatacagtc gtcacctcac cccgcatttg 9421 ctgacgcaaa cgctcccctc tccttagcaa ggagaggggt tgggggtgag gttgaaaaat 9481 gtacttcatg caaacgagaa ccgctatatt gcgcttacaa gttgggcatt gcccgcccaa 9541 cgcactggct ttctttatta aggagaggga aaattttttg taggtaaaag ggagagtgag 9601 gtagtctgac aaaaaatact ttgtcacaaa atcttacaaa taagtactat ttgcgtacaa 9661 acctgatatt agtactcaag agccacaatt tgacttattt taaagtattt ttatctacct 9721 catatccaca ctgctaaaat gctgagcaaa ctacaggtgc gtcacccatt cttgttcctc 9781 gccttacccc ttgcaaccct agcaagcctt agctacttca ccgtagccaa agcacaaatt 9841 actcctgata acagtctcgg tgcagaaaat tctgtagtca cgcccaacgt ccaaattaac 9901 ggcaccccta gcgacaggat tgatggaggc gcaattcgtg gggacaacct gtttcacagt 9961 tttcaggaat ttaatgtaaa cgctggtaga ggagcttatt tttcc // LOCUS NODE_3362_length_9969_cov_5.6653229969 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9969) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9969) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9969 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 318..779 /locus_tag="DP116_23680" /pseudo CDS 318..779 /locus_tag="DP116_23680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457215.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RtcB family protein" gene 1106..1825 /locus_tag="DP116_23685" /pseudo CDS 1106..1825 /locus_tag="DP116_23685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457215.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RNA-splicing ligase RtcB" gene complement(1893..4085) /locus_tag="DP116_23690" CDS complement(1893..4085) /locus_tag="DP116_23690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186201.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="low-complexity protein" /protein_id="PRJNA477356:DP116_23690" /translation="MAYEFQRANLRGEDFRGKNLAGANFSHADIRGVNFSHAVLIGAN FRNAKAGLPTSWAISLVALSLILSLFAGLIAAYAGAFIGNLLSSKVYGHIFFGVFSLI ALAIFLIVIFWQGLGATLATLAEIIAACLIAGLAFFPENNLGGHLVIGAVFTTIALAG GMASVVNMAIAVAVGRIMALPMARAITGLMAFVGAVFGALFGVRADSSASVEAIVVAF FIAGLVALVAIASGIYVGWQAIYGDKKYQLIQALAIGIVAKKGTSFRGANLTDADFTQ ATLKSVDFRKAILTRTCWFQTQKLEQTRLEGTYLENPKVRQLVVTKDGVEKNFDHLNL RGLNLQDANLQDASFICTDLSEATLHKANLFGAKLAQAQLYQANLNEACLTGAYIQNW GVSTDTKLERVKCEYVYMQLPTKEDPDPCRKPDNRNESFKEGDFADFIAPIIKTLDLY QTQNVDLREVAKKFKTLDLFHYEGIDPSAAAIALTQLAENHPDAELEVVALEGRGQEK IRLQAKVAGDANRSELYNEYFQTYSQVKSLPYNDLQSLLLGVKEKDERIRSLEQLLEN AIQQPKFYVETYQTQGEFIMSQSKGNISISGTQGNISGVAGAGENLSMTGVAIGAISG SVTNTINQLPDSSESDKPGIKELLTELQAAIEADTNLSDEDKAEALEQVKAIAEAGQK PEDGAMQKMVKSAIKLLKGTVADLPTTVQLVEVCGKVLPTIATFFGLA" gene complement(4104..4640) /locus_tag="DP116_23695" CDS complement(4104..4640) /locus_tag="DP116_23695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186200.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23695" /translation="MFSKHWKYKIAILTPLVVLQAIFSPSLAQTPSRQLPTAQCTDIE IQKHIQQLNKAERLDFDALVACQSKAVPALIKALTLIKALKNKDENTRIIIIAALGEI GSQATPAVPLLNELLVKDESRDVVRMIDYALIQIEPCLGCLLIRDVKHNTLRYVNNNP PVMCRIPAIRAVLRWKCP" gene complement(5444..6595) /locus_tag="DP116_23700" CDS complement(5444..6595) /locus_tag="DP116_23700" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23700" /translation="MKPKTPIMDSKVQQRISVQRPHRVTHYKLARLVLLGVITFVLSG VPPIMATESKPTPKADFSPVMLGSNTTSETRDPAYWPFDAHSPWNMPIGSEAQFEPVS SSEWTTEALKYGLQVNTTDWSIPIFMADASDPIRSIYSTDYDKLAFEVHVPDAAVPDS NQDAHLHIIDETHNSVIEMLWAKRRADSNLEAPYPNKIDLKGPGVFDTYHGSCAYGGS CTAGLIRKGELHNGIRHALRISLTTAVLNKNTPSGKPYVWPANWADDDGKGSSYTGTG NVYMGSLLAIPRDVNIEAIVGPPGTPIYELARALQDYGAYVVDRGHLNLYGEPSAEEE VNQLSWQGLQVLPKYLQVVANNGSERVGGGGTPRRPLAPSFEVVDGTQS" gene complement(6636..8648) /locus_tag="DP116_23705" CDS complement(6636..8648) /locus_tag="DP116_23705" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hemolysin" /protein_id="PRJNA477356:DP116_23705" /translation="MSTINVTSNADNGAGSLREAIAQAQSGDTIQFDSSLANQTITLS SGQLTVDKNLTVDGAGATGLTISGNNASRIFDVATPGSSFSLRNLTLANGKSSGEGEN GAGGAIRTVTSDKLTTLNVENSKFKNNASSEGGAIWGGFNTANTITNSLFEGNDGTAG KSERGGGAIAVNANSTLTVKGSEFDNNKGTNGGAINTVLSTLTVEDSTFRNNDSTAGG PIGPNTIGYGGAIYTDGANASGPNFDYGPIGGTITIRNSQFEGNKGAGQGGGLFLYAY PPDKIIVDNSTITQNEVVPDSQGDSLGGGLRIGNGEFTINNTTFTDNRALEQGGGLWV GEQSPGTITKSTFSGNRAESADGKNGMGGAIALANGSNPVTIDGTTVANNYAGWQGGG ISGGGSSTTVKNSIFADNVAYNGGNGWNIKNQATEQLRDGGGNTQWPAKNSNDPTDIN VTASINIAQPDLSHLQNSTQSVGNSLNDNGSNLATASNLTPVSVSENGLNNGTSVDPI NNTSSNSANDSLTPVAVNGNSSSNGSDGSLTQVPGNDRSSSSTPIFTVSGDKLLCGNI TQADHLYSGLSQDTLTQGKGIDNSVLTPSQGSELNSQLNSGQNYVGLLDTLTPSHFST SQPAQDSSNVDNIQPRSLANLEKVNTPTLTSNPQFFSGQSYSTSTL" gene complement(9218..9472) /locus_tag="DP116_23710" CDS complement(9218..9472) /locus_tag="DP116_23710" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23710" /translation="MYSQMLRAFTLDSPILTESQLVRVPLLIQGMTPKAKVIVLNSTE NKNAQIAKVFIPHIYISGVPISHQQPETQESGTVQLNKQI" BASE COUNT 2761 a 2131 c 2136 g 2941 t ORIGIN 1 tttgagccaa gaaatttatt tcttggcgga cgaaaagtat ggtgcaagat ctgagcctcc 61 aactagcggc gtaccgcaag tgaaatatga aagaaacaca ttttccttaa atctctaaaa 121 gccttgctat atatggattt aaaaatgtgt ttcttaggag taatgcccat cgttatctgg 181 ttgtggtggg cattatccat ttaactaaca catgtctatt tgtaagtagg aaataaaatg 241 aaatactgtg caaaattaaa attgcgatga aactgccctt atagtcctaa atcttaaatt 301 caaaattcta aagaattatg gcttacgaag agttaaaact ttcaacacca tcgcctattt 361 tatcttgggc gaatcaccct ttgggttcag aagaaaccaa gatggcaaag aatgtggcat 421 cgctgccatt tgtgtataag cacctcgcat tgatgccaga tgttcacttg ggtaaaggtg 481 ccttagttgg ttctgtcatt gcaacaaaag aagcaattat tcctgcggct gtgggcgtgg 541 atattggttg cggtgtctgt gccattaaaa caccatttaa tggtgatcaa ctggaaggca 601 agctaaagaa aatccgcttg gatcttgaag cagaaattcc tacaggtttc aacgaaaata 661 aagacgttga aaaaagtgtc accaattggc aacgctggcg tgatttccaa gacttgcatc 721 aaggtgttca aaatttgcaa ggtaaagcga tgaaacaaat gggttctcta ggagggggta 781 aagaattgcc ccgtactgta gtaatacagt aatgaaaact cgtccaaatc ggtgaacgct 841 gagattgcta acaccgaggg aagtggtaaa acaccccgta gagagcagag ggacttgggc 901 atcctaactc actagttagg atgaaggtgt gctccgaact ataccgaaca ccgaaggtat 961 agaaccaagc agaaatgact tggtcgattg caactttcat ctatgatatt gtggagtaaa 1021 tgaaagtgtt aagcagaagt cgaaatagtg ttgaactaag actgtggaga ttgagcaagc 1081 ttgacaagca attagtaaca aattgaatca cttcatagaa gtttgcctcg atacagagaa 1141 ccaagtttgg ctgatgttgc attccggttc ccgccatatt gggaataatc tagcccagtg 1201 ccacattaat acagctaaag aattagcaaa aatggcgggt aataaattac ctgaccccga 1261 tttagctcat tttgtcgctg gtacgccaga atttaaagca tactggcatg atttgcagtg 1321 ggcgcagaat tatgcgcgtt tcaaccgtga ggtgatgatg gggcgtttca agcgcatcgt 1381 cgaaaagcat ctaacgggtg gtaaagcgac gaagccttta ttggaagtga actgccatca 1441 caactacgct gaaaaagaag tgcattttgg tgaggatgtc tacgtgactc gcaaaggtgc 1501 agtccgtgca acacaagaag actatggcat tatccccggt tcgatggggg ctaaatctta 1561 catcgttaag ggtaagggtt ctgccaacag tttttgcagt tgttcccacg gtgccggacg 1621 tttgctatca agaagtaagg caaaaaatac ctacacgcta gatgatttga ttgagcaaac 1681 aaaaggtgta gagtgtcgta aagatacagg cgttttggat gaaattccca gcgcttataa 1741 gccgatagat caggtgatga ataatcaagc ggatttggtt gaagttgtag caacactcaa 1801 gcaagtcctt tgtgtgaagg gttgaatttc gggtgggcaa tgcccaccct actttttcag 1861 agattttaac tgggtagttg acaatgctta gcttatgcca aaccaaaaaa tgtcgcaatt 1921 gtcggcaata ccttaccaca aacctcaaca agttggactg ttgtaggtaa gtcggcaact 1981 gttcctttta gtaattttat tgcgcttttt accatcttct gcatcgctcc atcttcaggc 2041 ttttgacctg cttcagcaat agctttcact tgttctaaag cttccgcttt gtcttcgtca 2101 ctcaaattcg tatcagcttc aatagccgct tggagttctg ttagcagttc cttaattcct 2161 ggtttatctg attcactaga gtcaggcaac tgattaattg tattagtaac actaccgcta 2221 attgcgccga tcgcaactcc tgtcattgat aaattttcac ctgcgccagc aacaccgcta 2281 atattaccct gtgtaccact gatgctgata tttcctttac tttgtgacat aataaattct 2341 ccttgagttt gataagtttc tacataaaac tttggctgtt gtatagcatt ctctagtagt 2401 tgttctaaac tgcgaattct ttcatctttt tctttaactc ctaatagtaa cgattgtaag 2461 tcattgtaag gtaaagattt gacttgacta tatgtttgaa aatattcatt gtaaagttca 2521 gaacgatttg catcacctgc aactttagct tggagtcgaa ttttttcttg ccctctgcct 2581 tctagggcga caacttctag ttctgcatct gggtgatttt cagctaactg tgtaagggcg 2641 atcgccgcag cacttggatc aataccttca tagtgaaaca agtcaagagt tttaaacttc 2701 ttggctacct ctcgcaagtc aacattttga gtttgataca aatcaagcgt cttaataatt 2761 ggagcaataa aatcagcaaa gtctccttct ttaaagcttt cgttacgatt atcaggttta 2821 cggcaaggat cggggtcttc ctttgttggt agttgcatat aaacgtattc gcacttcacc 2881 ctttctagct tagtgtcagt ggaaacgccc caattttgaa tataagctcc tgttaagcaa 2941 gcctcattta agttagcttg atataattgc gcttgagcaa gcttggcacc aaataaatta 3001 gctttgtgca aagtcgcttc actcaaatct gtacagataa agctagcatc ttgcagatta 3061 gcatcctgaa gattcaaacc gcgcaaattc agatggtcaa agtttttttc gacaccatct 3121 ttcgtaacaa ccaattgtcg cacttttgga ttttctagat atgtgccttc caaacgagtc 3181 tgttccagct tttgagtttg aaaccagcaa gtgcgggtga ggatagcttt tctaaagtct 3241 acacttttga gggtagcttg ggtaaagtca gcatctgtta aattggcacc gcgaaagctt 3301 gttccttttt tggcgacaat gccaattgct aacgcttgaa tgagctgata ttttttgtct 3361 ccataaatag cttgccagcc aacataaata ccagatgcga tcgccaccaa agcaaccaat 3421 ccagctataa agaaggcaac aactatagct tcaacagaag cagaggaatc tgctctcact 3481 ccaaacagag caccaaatac agcacctaca aaagccatca atcctgtgat cgctcttgcc 3541 ataggtaaag ccattattct ccctacagcc acagcaatag ccatgttgac aacgctagcc 3601 attccaccag ctaaagcgat cgtagtgaat actgcaccaa taacaagatg ccctccaagg 3661 ttattttcgg gaaaaaatgc caatcccgca attagacatg cagctataat ctcagcgagt 3721 gttgctagtg ttgctcccaa gccttgccag aaaataacta ttaaaaagat tgctaaagcg 3781 attaaggaga aaactccaaa aaagatatgt ccataaactt tactactcaa caaattgcct 3841 ataaaagcac cagcatatgc agcaattaat cctgcaaata atgatagaat gagtgacaaa 3901 gcaaccaagc taatagccca agaggttggc aaaccagctt tagcattgcg gaagtttgca 3961 ccaatcaaaa cagcatgact gaagtttaca cctcggatat cggcatggct aaagtttgct 4021 ccagcgaggt tttttcctct gaagtcttca ccccgtaaat tagcacgttg aaattcatat 4081 gccattgctg tttatccgtt gaactagggg catttccatc ttagtacggc gcgtattgct 4141 ggaattcgac acatgacagg agggttatta tttacgtaac gcaaagtatt atgtttgaca 4201 tctctaatca ggaggcaccc taagcatggt tcaatttgta tgagagcata atcaatcatc 4261 ctcacaacat cgcgactttc atccttaact aataactcat ttaataaagg tactgctggt 4321 gtggcttgcg aaccaatttc acccagtgca gcaattatta tgatacgagt gttttcatcc 4381 ttatttttca gagcttttat aagagtcagc gcttttataa gagcaggaac tgcttttgac 4441 tgacacgcta ccagtgcatc aaaatctaat cgttcagctt tgtttagttg ctgaatatgc 4501 ttttgaatct cgatatcagt gcattgagca gtaggtaatt gccttgatgg agtttgagct 4561 aaggaaggtg agaatatagc ttgtagtaca actaaaggcg tcaatattgc tattttatat 4621 ttccagtgtt tactaaacat tattttcagt tttatatcca gatattttta cattgcctcc 4681 tgtcaattag cattgtcaaa tttctctaca atgacaataa atctatcgaa ctttagggca 4741 tttccatctc aacacaattc gtattgctcg acttcggcag attataggct gttttttatt 4801 tgtgtaagaa gtggttgctt tttcgacgta agctacacct gactggttac agttttgagt 4861 accatttact acttcacaat caggtgttgt aatagatgtg ttattcctat cagaacgaat 4921 atgattatgt ctatgtcctg cattctttgc agaagattga gaaatagttg acctatctga 4981 tgaagtttga gcaagcactg ttatagttgt taacacagag aacgttactc ctcctaccac 5041 agctaaggtc aacataactg ctttaggttt ccaacgcttt cttaacacta aacacctcct 5101 gtgaaaagcg tccatttcac ttctgtagta taagaaaaat atacaaaagg aagtgagatg 5161 attccggatt gtgctaaaat ttttctgcaa aaacgagacg ctggcaagag attccagtga 5221 aaatacgtca aaatttcgat tgtcgctgga tttagctaat agacaggtat gaaattgttt 5281 ttatataaag aaaattaagg atatggcaat cctatttcat ttgtgaacat cgggcatatt 5341 tagattcccg acttctttaa gaagtcggga atcttgttgt tcacgttatt taggaatcag 5401 gtagaaaaca gcacaaaaaa gcaaaacaaa agtgtgtatt tggctaggac tgtgtaccat 5461 caacgacttc aaacgaagga gcgagtgggc gacgaggtgt cccgccacca ccaacacgct 5521 ctgaaccatt gttcgctacc acctgaaggt actttggcag tacttgcaga ccttgccatg 5581 aaagctgatt gacctcttcc tctgctgagg gttcaccgta caggttaagg tgaccccgat 5641 ccactacata agcaccgtag tcttggaggg cgcgagcgag ctcgtagatc ggtgtgcccg 5701 gtggtccgac aatagcctca atgttaacgt ctcttgggat tgccaacaac gaacccatgt 5761 acacattccc cgtcccagta tagctgctgc ccttgccgtc gtcgtcagcc cagtttgcag 5821 gccatacata aggctttcca ctaggggtgt ttttatttaa gacagcagtt gtgagcgaga 5881 ttcgcagtgc gtgtcgaatc ccgttgtgga gttccccctt gcggatcagt ccagctgtac 5941 acgagccgcc gtaggcacac gagccgtggt aggtgtcaaa gactccagga cccttcaagt 6001 caattttgtt ggggtacggt gcttcaaggt tgctatcagc ccgcctcttt gcccagagca 6061 tctcgatgac ggaattgtgg gtttcgtcga tgatgtgcaa atgagcgtct tggttggaat 6121 ccggtacagc ggcgtctggc acgtgcacct cgaaagcaag tttgtcgtag tccgtgctat 6181 atatgctgcg aatcggatca gatgcatcag ccataaagat cggaatagac cagtctgttg 6241 tattgacctg gagaccatac tttaacgctt ccgtcgtcca ctcggaggaa ctcaccggct 6301 cgaactgcgc ctcggagccg attggcatat tccaagggct gtgtgcatca aatggccaat 6361 atgcggggtc gcgtgtttcg cttgtcgtgt tgctacccaa cataacgggt gagaaatcgg 6421 cttttggggt cggcttgctc tcagtcgcca taattggggg cacgccggag aggacaaaag 6481 taatcactcc cagcaggaca agccgtgcaa gcttataatg tgtgacgcgg tgtgggcgtt 6541 gcactgatat gcgttgctgg accttggaat ccattatcgg tgttttaggc ttcatagtca 6601 gcgattaacc caccatgttt tgtcatgcga taaccttaca gagttgaagt agagtaactt 6661 tgtccagaga aaaattgggg attactggta agcgttggag tattgacttt ttccaaattc 6721 gccagtgacc gaggctgaat gttgtccaca ttcgagctat cttgagcagg ctggctggtt 6781 gaaaaatgac tcggagttag agtgtccaat aagccgacat agttttgtcc agagttcaac 6841 tgagaattta attcagagcc ttgactagga gtgagcacag agttgtcaat tcccttgcct 6901 tgtgtgagag tgtcctggct aagaccactg tagagatgat cagcctgagt aatattacca 6961 cacagtagct tatcaccact gacagtgaat ataggggtac tgctgctgga gcgatcatta 7021 ccgggcactt gagtaagact accatccgaa ccattgctag agctatttcc attaacagcg 7081 acaggggtga ggctatcatt ggcactgttg ctagaagtat tgtttatggg gtctacagaa 7141 gtaccgttat ttaagccatt ctcactaaca gatactgggg taaggttgct tgctgtagcc 7201 agattggagc cgttatcatt caaagaattt cctacgcttt gagtactgtt ttgcaaatga 7261 ctcaaatcag gctgtgcaat attaatagat gctgttacat tgatatcagt aggatcattc 7321 gagtttttag ctggccattg ggtatttccc ccaccatccc tgagttgctc agtcgcttgg 7381 ttcttgatgt tccaaccgtt accgccattg taggcaacgt tatctgcaaa aattgaattt 7441 ttaacagtag tggacgaacc tcctccagaa atgcctccac cttgccaccc agcgtagttg 7501 ttagcaacag tcgtaccatc gatagtgacg ggattagagc cattagctaa agcgatcgct 7561 ccacccatcc catttttgcc gtcagcactc tcggctcgat taccagagaa cgtactcttg 7621 gtgatggtac ctggcgattg ttctcctacc cacaaaccac caccttgctc cagtgcccga 7681 ttatctgtga aggtggtgtt gttgatagta aactcaccat taccaattcg caaaccaccg 7741 ccgagtgaat caccttgact atccggaacg acctcatttt gagtgattgt ggaattgtca 7801 acgatgatct tgtctggggg ataagcatac aaaaacagtc ctcccccttg tccagcacct 7861 ttgttaccct caaactggct gttgcgaatg gtaattgtgc cccctatagg accataatcg 7921 aaatttggtc cagaagcatt cgctccatca gtataaatgg ctccaccata acctatggta 7981 ttgggaccta tcggacctcc agcagttgag tcattattgc ggaaagtaga gtcttctact 8041 gtgagtgtgc tcaatacagt gttgatcgca ccgccattgg ttcccttgtt attatcgaac 8101 tcactacctt tgaccgtcag agtactgtta gcattgactg cgatcgctcc gccaccgcgt 8161 tcgcttttac cagcagtacc atcgttaccc tcaaacagac tgttagtgat ggtgttagcg 8221 gtgttaaagc ctccccaaat cgcacctcct tctgaagagg cgttattttt aaatttgcta 8281 ttctcaacgt tgagagtggt gagtttatca ctcgtaactg tccgaattgc acctccggca 8341 ccattttcac cttcaccact ggattttcca ttggcaagag tcaaatttcg caagctaaaa 8401 cttgagccag gtgttgcgac atcgaaaatt cggctagcat tattgccact aatcgttaga 8461 ccagtagcac ctgcaccatc aacggtcagg tttttgtcaa ctgtgagttg acctgagctg 8521 agggtaatgg tttgattggc aaggctagag tcaaattgaa tcgtatcacc tgattgggct 8581 tgagcgatcg cctctctgag ggaaccagca ccattgtcag cattagacgt tacgttaata 8641 gtactcatgt tctatcaatc ctcgtttgtt gtttgttttg accaaataac tcatgaacaa 8701 tgaccaacca agtggtctag atctacgatc agcacctatc agtacgttgc ggctgaatag 8761 gcaataactt tctgaccatg acctgatagg catgatcata tgaagtgcac ttcatacaga 8821 ctcgtgctga ttatgtcatc accgatagtt gcactctcaa ctctagttaa ctcaagcaag 8881 agcgttgctc aagggcggat ctagtcaccc ttagcacaaa cagagctacc atagagtctg 8941 gtacttgcct ctaatgttga tgctcttgcc tactcttgtt aacagttatc agttatcagt 9001 tatcaagtac cagccgcagt tatcagtttg aagaagaatt cagcaattct gcaattcagt 9061 attcagaacg aagaaataaa gaattgcgac ttctgtacgg gcgctgcggc cttgcgcccc 9121 tactgactcc tgactccttc actgcgtagg tagctgttca ctgttcactg ttcactgttc 9181 actgttcact ggtttaagag gcagagttgt ttttgtttca gatttgttta ttgagttgca 9241 ctgtccctga ttcctgagtt tcaggttgtt ggtgggagat cgggacacca gagatataaa 9301 tatgtggaat gaacactttg gcaatttgtg catttttatt ctctgtactg ttcaagacaa 9361 tcaccttggc ttttggtgtc atgccttgaa tcagcaaagg gactcttact agttgagact 9421 cagtcaatat tgggctatct agagtgaaag cccgaagcat ttggctatac atagtaaatt 9481 cttgtgatga cgggtttgct gtagatactt aacttacgca actgtaagca cctgtaggtt 9541 gcattagtat tccagcgatc aagaagcatg agttgcttag cttttggtag acacccgtta 9601 ttcagtcgat ttactctggg caagcaagtt tttgaatttt cctgcttgat actgtcaaag 9661 aacttttaac actctttgct cttactctga tgaagcttta aaagctgaag ttgttttttg 9721 aaagtatgca ccttaaatat tatttgcacc tttttctacc gaaaatgtca atttatttct 9781 aacttcataa agcttttatc tttttgctca aaaatcttta tgataaattt ttaactgact 9841 tctccattct tgttgactaa agtccaaaaa cgtgcattaa gtgagaaaca tctatcgtat 9901 aggattcccc gaaagatttt tgcgaagctt cgtacagttt atgttaacca gttatcagtt 9961 atcagttat // LOCUS NODE_3372_length_9955_cov_4.6533339955 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9955) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9955) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9955 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 175..1068 /locus_tag="DP116_23715" CDS 175..1068 /locus_tag="DP116_23715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015146773.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="EamA family transporter" /protein_id="PRJNA477356:DP116_23715" /translation="MNFKGELAALGVAFLWALTSVIYSRLGKKIFPLAMNLSKGAIAI AMAFLTILLSGEQLLPAIDSIRFILLLLSGAVGIGVGDTAYFAALNSLGARQTLLLKT LGPPMAAIVSTIFLHEQLSYVAWVGILLTILGVAWVISERVKNATINDKLIVGVSFAL LSAFTDAMGAVLSRAALAETTINSLWSAMVRLVGGVLILLLWLPMKREPVRASLKELR SGRILGIVILCAFLSTYLGFWLQQISLKFSPAAIAKSLNATSPLFVLPFAFFIGEKIS LRAILGVLVAIAGMGLLFIYR" gene complement(1206..1409) /locus_tag="DP116_23720" CDS complement(1206..1409) /locus_tag="DP116_23720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008186983.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23720" /translation="MKCCFKRGIYFQKVDSRKTSQICPNCGTETGKKELSRVHVCENC GYTTDRDVAAAQVVLNRGCAVVG" gene 1553..2791 /locus_tag="DP116_23725" CDS 1553..2791 /locus_tag="DP116_23725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312306.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF445 domain-containing protein" /protein_id="PRJNA477356:DP116_23725" /translation="MTLDWSHLWLYVSPPIVGGIIGYFTNDIAIKMLFRPYRAIYIGG RKIPFTPGLIPRNQERLAKNVSDTIMGSLLTPQELQKLARRLLQTERIQGAILWILRL AIDQLKSDKEQKTAKILSGILRDLLGESLPRLLKVLARREDFLEAQVNQVFDQILLDF QLTEEQASRLADWLLQVVIPPEILRLAVIDFLTDRTIQTIDESFREKTSGTYWVVANL FGLRNTLTRLRTYCLDEKDATNARLQELIQQLQVRDRLKQLLQSFSLENLPIGTVRQL RKTTRESVRHYVQTRGSDLLQGLGSSVDWENIGMLLVNRLSSSPVVNASLEVVSKELA LVLERYLEKDLEAIVAQAIPILAIDQVIVDRVKSTSPADLEAAIEGIVKNELQAIVNL GGVLGFIVGLVQTTFLLFAQ" gene complement(3031..4503) /locus_tag="DP116_23730" CDS complement(3031..4503) /locus_tag="DP116_23730" /EC_number="5.4.2.10" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873985.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoglucosamine mutase" /protein_id="PRJNA477356:DP116_23730" /translation="MVSYITRTQGTISGNSTTNIDESEKKVNEGGFGLNLTSLLGTPL FGTDGIRGRVGDLLGAPLALQVGFWAGTVLRSHASNPGPVILGQDSRNSSDMLAMSLS AGLTAAGLEVWHLGLCPTPCVAYLTSITNAIGGVMISASHNPPEDNGIKFFGADGMKL SQALQTEIEAGVRGMASANCNHCGRYYSRLDLVGSYVEALKKPLHGVMNFQGMKIVLD LAWGAVVGLAPSVFKEMGAEVIYLHNEPDGDRINVNCGSTHLGILQAAVQEHNADLGF AFDGDADRVLAVDNTGRQVNGDYILYLWGQKLKQKQQLPSDLIVSTVMANLGFERAWK QVGGQFIRTAVGDQYVQAEMLRTGGMLGGEQSGHILCRHYGMTGDGLLTALHITALVQ QAGVPLSEMVSQSFQTYPQLLQNVRVEDKSKRLGWKECEPLQQAIARAEAAMGDSGRV LVRASGTEPLIRVMVEAEAAELVNYWTNELVTQVQQHLVA" gene 4879..5592 /locus_tag="DP116_23735" CDS 4879..5592 /locus_tag="DP116_23735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme oxygenase" /protein_id="PRJNA477356:DP116_23735" /translation="MSSNLATKLRVGTKKAHTMAENVGFVKCFLKGVVEKNSYRKLVA NFYFIYSAMEEEMEKHANHPIVSKINFPQLHRKETLEQDLTYYYGVNWREQIQLSAAG KAYVERIREISEKEPELLVAHSYTRYLGDLSGGQILKGIAETAMNLSDGGTAFYEFDE IPDEKGFKAKYRQALDELPIDDATADHIVDEANAAFGMNMKMFQELEGNLIKAIGLML YNSITRRRTRGSTELATAE" gene 5911..6747 /locus_tag="DP116_23740" CDS 5911..6747 /locus_tag="DP116_23740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320484.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfite exporter TauE/SafE family protein" /protein_id="PRJNA477356:DP116_23740" /translation="MNILEFTLLVWLSSATAGFLGALTGLGGGVVLVPLLTIVFGVDI RYAIGASLVSVIATSSGAASAYVKEGYTNLRLGMFLEVATTFGAITGATIAAFVPTRI LAVVFGFVLLYSAYLSRQPRSEHADDTPPDPLATRLKLNSTYPTPESEQPYNVRAVPV GFSLMFVAGVLSGLLGIGSGALKVLAMDQFMRIPFKVSTTTSNFMIGVTAAASAGVYL KRGYIDPGLAMPVMLGVLLGALLGARVLVKARVDVLRNIFSIVIVLLAIQMIYNGFLG RI" gene 6749..7132 /locus_tag="DP116_23745" CDS 6749..7132 /locus_tag="DP116_23745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307576.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1634 domain-containing protein" /protein_id="PRJNA477356:DP116_23745" /translation="MSQTRRNLSERQVEILVGNLLRYGVLIATAIVLFGGVLYLIYHG KEAPNYQIFRGEPPAFTSPEGVANSALSGRRRGIIQLGLLLLIATPVARVAFSLLAFM RQRDIIYIILTVIVLTGLMISLIGA" gene 7284..9815 /locus_tag="DP116_23750" CDS 7284..9815 /locus_tag="DP116_23750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315487.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="U32 family peptidase" /protein_id="PRJNA477356:DP116_23750" /translation="MKADRKPTPKMTLPTFGRPELLAPAGYWDCAKAAVENGADAIYF GLDRFNARMRAQNFTEADLPKLMEFLHRRGVKGYVTLNTLIFPQELREAEQYLRSIIS AGVDAVIVQDIGICRLIRHLSPDFPIHASTQMTITSAAGVEFAKSLGCQLVVLARECS LKEIEKIQRQLPEHQASLPLEVFVHGALCVAYSGQCLTSEALGGRSANRGECAQACRM PYELISDGKVVDLGNRKYLLSPQDLAGLEILPELVQAGVSCLKIEGRLKTPEYVANVT RVYREALDRVMADLDVGDKKKFSTSFQEHYNLEMAFSRGQYTGWFRGVNNQELVHAHF GKKRGVYLGEVTRISNEKVIVRLQAPVKPGDGVVFDCGHPEAQEQGGRVYAVEQKAKE TVLTFGRRDLNLRRIHVGDKIWKTNDPELDKQLRQSFAGENPQFQRPIHIEVYGEVGQ SVTAIARDELGHIVQVESTIPLEEAHTKPLTPERLHEQFARLGNTPFYLGSLTNHLNG AFMLPVSELNRLRREIVAQLEELRSQPKRWRLNFHASLQDLLPSTSKVQATPNSPSLI VLVRNFKQLQAALEAGIETLYCEFEDPRAYPEAVQMVRQASKANTQHSIWVAPPRITK PGENWILQLVRSCEADGYLVRNYDHLQFFANERCIGDFSLNVANPLTADYFQHQFGLE RVTASYDLNITQLEDLLTSCPPHWFEVTIHQHMPMFHMEHCVFCAFLSQGTDYTNCGR PCEKHEVKLRDRVGTEHILQADAGCRNTVFNGTAQTAAEYVQHFIELGVRNFRIEFLN ETPETMTQTIHRYQQLLQGEMTGSQLWRELKLQNQLGVTRGSMGA" BASE COUNT 2637 a 2169 c 2252 g 2897 t ORIGIN 1 atccccagac tcagggagcg catctaccct ctgatagctc gtcattaccg aactttacct 61 ttttatgact agccatcttc tgcacctttt gtttactcgc ttcattttct ggcatggcga 121 tgcttcatgg aggttagatg ttttatgctg gcaaattttt ctgacttatt tttaatgaat 181 tttaagggag aactggcagc tctgggagtg gcgtttttgt gggcgctgac ttcagttatc 241 tacagtcgct taggaaagaa aattttcccg ctggcgatga acttgagcaa aggtgcgatc 301 gcaattgcga tggctttcct caccatcctt ttgagtggtg agcaactgct gccagcaatt 361 gactcaatcc gcttcatact cctacttctc agcggtgctg tcggaattgg tgttggtgat 421 actgcttatt ttgcggcatt gaacagcttg ggagcaagac agacgctgct attgaagaca 481 ttaggtccac cgatggctgc aattgtatcg actatttttt tgcacgaaca actgtcatat 541 gttgcttggg ttggtatttt gttaactatt ttaggtgtgg cttgggtaat tagcgaacga 601 gttaaaaatg ctactatcaa tgacaagctg attgttggtg tcagctttgc tctgttatca 661 gcattcacag acgcaatggg tgcagtttta tcccgtgcag cgctggcaga aacaactatt 721 aattccctgt ggagtgcgat ggtacggcta gtgggtgggg tattaatact gctgttgtgg 781 ctgccgatga agcgagaacc agtccgcgcc tctctaaaag aattgcggtc tgggcggata 841 ctgggcatcg ttatactgtg tgcttttctg agtacttact tgggcttttg gctgcaacaa 901 atctctctta aattcagtcc tgcagccatt gctaagtccc tcaatgctac gagtcctctg 961 tttgtcctgc cgtttgcttt ctttattgga gaaaaaatca gtctacgggc aattctgggc 1021 gtgttggtgg cgatcgctgg gatggggctg ttgtttatct accgctgatt gcggataacg 1081 agtattttca tagccaaaat ccttataaaa ttattgacct cccacggctt tagcgtagcg 1141 tgggattctt gaatcatagg agaggagttc tactaaactt actttcctac gcgcatcttg 1201 atcgtttacc caacgactgc acaccctcta ttgaggacga cttgagcggc tgctacatct 1261 ctatccgttg tgtagccgca attctcacaa acatgaacgc gtgacaactc ttttttgcct 1321 gtttcggttc cacaattagg acaaatctga cttgtttttc tgctatcgac tttctgaaaa 1381 tagataccac gcttaaaaca acattttatt tcttgaaact ctgccttggc ttcatgcctc 1441 ccgcgttatt ttaggtatga tggaaatgct agtgataacg aaagcttagt agttgttcat 1501 tatttagtga ttagtaattc actaatgatt aataactcat aactaatgac taatgacttt 1561 ggattggtct catctttggc tttatgtctc tcccccaata gtgggtggaa ttattggcta 1621 cttcactaac gatatagcca tcaaaatgtt atttcgtccc taccgtgcaa tttacatcgg 1681 tggacgaaag atacccttta cacctggatt gattccccgc aaccaggaac ggcttgctaa 1741 gaatgtctcg gatacaatta tggggtcgct actgacaccc caagagttgc aaaaactagc 1801 gcggcgactg ttgcaaacag aacgcatcca aggagcaatt ctctggatat tgcgactggc 1861 aattgaccaa ctaaaatccg ataaagagca gaaaactgct aaaatattgt cgggaatttt 1921 gcgggatttg ttaggagaat ctttaccacg tcttttgaag gtattggcgc ggcgagaaga 1981 ttttttagag gcgcaagtta atcaagtttt tgaccagata ttactggact ttcaacttac 2041 tgaagaacaa gccagccgcc ttgctgattg gctgttgcaa gtcgtcatac cgccggagat 2101 tttgcgacta gcagtcattg actttttgac agatcgcaca atccaaacta ttgacgaaag 2161 ttttcgggaa aaaaccagcg gtacctattg ggttgtggca aatctgtttg gtttacgcaa 2221 tactttaacg cgcctacgga cttactgctt ggatgaaaaa gatgctacta atgctcgctt 2281 gcaggaattg attcaacagt tacaggtgcg cgatcgcctc aagcaacttc tacaaagttt 2341 ttctttggaa aatttgccaa taggtacagt acgacaactg cgaaagacga cacgcgaaag 2401 cgtccgccat tacgtgcaaa cacgcggtag cgatttactc caaggattag gtagttccgt 2461 tgactgggaa aatattggta tgttgctggt gaatcgcctc agttcctctc ctgttgtgaa 2521 tgcttcactg gaagtggtta gcaaagaact ggctcttgtt ttagagcgat atttggagaa 2581 agacttggaa gcaattgtag ctcaggcaat tcccattttg gcaattgacc aagtaattgt 2641 tgaccgcgtg aaatcaactt cgcctgctga tttagaagca gcaattgagg gaattgtcaa 2701 aaatgaatta caggcaattg tgaatttagg aggtgtttta ggctttattg tgggattggt 2761 gcaaacaacg tttttgttat ttgctcaata gactttttgc aaaaagatta tttttcagcc 2821 aatacggttg ctgatgacaa aatacataat gtttgtagag acgttgcatg tgagtccagc 2881 gctgcgggag ggtttccaac gccagatacc tatggaggga aaccctcctg cagtactggc 2941 tcctccgcag gcgactggcg aatccgaagg acaacgtctc tacacgtgtg aagtctttca 3001 aaataatcct taactgaaca gtcttgattt ttaagccact aagtgctgtt gaacttgcgt 3061 caccaattca tttgtccaat aattaacaag ttcagcggct tcggcttcca ccataactcg 3121 aatcagaggt tccgtaccag aggcgcggac taaaactctg ccagaatctc ccattgctgc 3181 ttcggcgcgg gcgatcgcct gttgcaaagg ttcacattct ttccatccca aacgcttgga 3241 tttgtcttcg actcggacat tttgtaacag ttgcggatag gtttgaaagc tttgagacac 3301 catttctgat aaaggaacgc cagcttgttg caccaaagct gttatatgta gggctgtgag 3361 taaaccatct cctgtcatac catagtgacg gcaaaggata tgaccggatt gttcgccccc 3421 aagcattcct ccagtccgca gcatttccgc ctggacgtac tgatcaccaa ctgctgtgcg 3481 aatgaattga ccaccgactt gcttccatgc tctctcaaaa cctaagtttg ccataacagt 3541 ggaaacaatg aggtcacttg gtagttgttg cttttgcttt agcttttgtc cccacaggta 3601 caggatgtaa tcaccattga cttgtcttcc ggtgttgtct acagctaaaa cgcggtctgc 3661 atcaccatca aaagcaaaac ctaaatcggc attgtgttct tggactgctg cttggagaat 3721 tccgaggtga gtggaaccgc agttgacgtt gatgcgatcg ccatctggtt cgttatgtaa 3781 gtagatcact tctgccccca tctccttaaa taccgatggt gctaaaccaa caactgctcc 3841 ccatgccaag tctaaaacaa ttttcatgcc ctgaaaattc atgacaccgt gcaggggttt 3901 tttcaacgcc tcaacataac ttcccactaa gtccaagcgc gagtaatacc gtccgcaatg 3961 attacagttg gcagaagcca taccacgcac tcctgcttct atttccgttt gcaaagcttg 4021 ggatagcttc atcccatccg cgccaaaaaa tttaatgccg ttgtcttctg gcgggttgtg 4081 gctagcagaa atcatcaccc cacctatcgc attggtgatg ctggtgagat atgcaacgca 4141 aggggtagga catagcccca aatgccaaac ttctaatcct gcggctgtta accctgcact 4201 caaagacatc gccagcatat cgctagagtt tctggagtcc tgtccaagga taactggtcc 4261 tgggtttgaa gcatgactac gcaaaacagt acctgcccaa aagcccactt gtaatgctaa 4321 gggtgcacct agcaagtctc ccactcgtcc ccgaatacca tctgtaccaa acaggggtgt 4381 acccagaagt gatgttaagt tcaacccaaa accgccctcg ttaactttct tttccgattc 4441 atcaatattg gtcgttgaat tccctgagat agtaccttga gttcgagtta tgtatgaaac 4501 cataaataga aaaaccccac acacttacac tggagaatag cactttcaac aatattcaat 4561 aatatttaac tattcctgct ggacgccttt caatcggcaa actttactat ttattgtttc 4621 atttttcact caaagtcgtt gtaagtattt tttaagtatt tttttgtatt tataaagata 4681 ttgtaaacta ctgggtgaca tcaaaaattg ctctcaacac ttatcactag gaacttagaa 4741 atatctcgga tgatctcaac ttaaacattt ttaatataat ttctcatcaa atagttctac 4801 tgtaacgttt tagtggcttt tgtagccgtt cgtaaattag acttcagcag ttgacatttt 4861 atagtcgagc attttagcat gagcagcaat ttagcaacaa aattacgtgt cgggacgaaa 4921 aaagcccaca caatggcaga aaatgttggt tttgtcaagt gttttttaaa aggagtcgta 4981 gaaaaaaact cctatcggaa gctggtagct aacttctact tcatctactc agcgatggaa 5041 gaggagatgg agaagcacgc caatcatccg attgtttcaa aaatcaattt tcctcaactt 5101 caccgtaagg aaactttaga gcaagacctc acttactact acggggtaaa ctggcgcgag 5161 caaatccaac tatctgcggc tggtaaagcc tacgtagaac ggattcggga aatctctgaa 5221 aaagaaccag aactattagt tgctcactct tacacccgat acctagggga tttatctggt 5281 ggacaaattc tcaaaggaat tgctgaaact gcgatgaacc tttctgacgg aggaaccgcc 5341 ttctatgaat ttgatgagat tccagatgag aaggggttca aagcaaaata ccgtcaagct 5401 ttggatgaac taccaataga tgatgcgact gctgatcaca ttgttgatga agcaaacgcc 5461 gcctttggca tgaacatgaa gatgttccaa gagttggaag gcaatctcat caaagccatt 5521 ggactcatgc tttacaacag catcacacgt cgtcgtacac gcggtagtac tgaactcgca 5581 actgctgagt aaatacaatt agcataatct tttgaacaat tgctaggggc agcgcctctg 5641 ggatgactct tgtagaaagg ggaatcgcgg ggatgttgcc cctagttgcg tcttgagcac 5701 aatttacgca ttcaaaaatc aacaggcgta ggttttgact gttcatagga tgtggaaaag 5761 ggggatctgt ttatctacta cagctttatg ttaacgggcg taaaattctt aatctgatgg 5821 aattaggaga ctgctttaat atacaatgca agcgttgcca ttgaactgtt tgataaccaa 5881 cgtttcattc aaaagcgaag gtgcgattgt ttgaatattt tagaatttac tctattggtt 5941 tggcttagtt ctgccaccgc aggctttttg ggagcactga caggcttagg cggtggagtg 6001 gtacttgtcc ccttgttaac tatagtcttt ggcgttgaca ttcgttacgc aataggtgct 6061 tctctggtat cggtaatcgc aacttcttca ggtgctgctt ctgcatacgt taaagaaggc 6121 tataccaatt tacgcttggg aatgtttttg gaagtagcaa caacatttgg agcgattaca 6181 ggcgcaacta tagccgcttt tgttcccacc aggatactcg ctgtggtgtt tggatttgtt 6241 ttactttaca gtgcctacct atcacgtcaa cctcgttctg aacacgcaga tgatactcca 6301 cctgatccct tagcaactcg tttaaagtta aatagcactt atccaacacc tgaaagtgaa 6361 caaccttaca atgttcgtgc tgttcctgta ggatttagtc tcatgtttgt cgccggagtg 6421 ctttccgggt tacttggtat tggttctggc gcactcaagg tactagcgat ggatcaattt 6481 atgcggattc cgtttaaggt ttccactact accagtaatt ttatgattgg ggtgacagca 6541 gcagcgagtg cgggtgtgta tctaaaacga ggttacatcg atcctggact agcaatgcct 6601 gtcatgttag gagtactttt aggcgcattg ttgggagcta gggtattggt aaaagccaga 6661 gtagacgttt taagaaatat ttttagtatt gttattgtgc tgttagctat tcaaatgatt 6721 tataacggct tcctagggag gatttaaagt gtcccaaact cgacgtaatt tgagtgaaag 6781 acaagtcgaa attttggttg gtaatttatt gagatacggg gtacttatcg ctactgctat 6841 cgttttattc ggcggagtgt tgtacctaat ttatcatggt aaggaagctc caaattatca 6901 aatttttcgc ggtgagcctc cagcgttcac ttctcctgaa ggagttgcaa attcagcatt 6961 atcaggtcgt cgtcgtggca ttattcaact tggattgttg ttattaattg ctactcctgt 7021 tgctagagtt gctttttcct tattagcttt tatgcgtcag cgagatatca tttatattat 7081 tttgactgtg attgttttga ctgggctgat gattagtttg ataggtgctt aagagataat 7141 tatctacttt tttgactgtg aataattaca aaaaagttga gaggatgcaa ggcgttccaa 7201 taaggattgc aaaccttctc tccaccatca atgaccacaa gcgatagaat cgcaatggtg 7261 ctctctaatc catcctcaac ttgatgaaag ccgatcgcaa accaacccct aaaatgactt 7321 tacctacctt tggtcgccct gaactactcg cccccgcagg ttactgggac tgtgcaaaag 7381 ctgctgtgga aaatggggca gatgctattt attttgggtt ggatcggttt aatgcgcgaa 7441 tgcgggcgca aaactttacg gaggcagatt tgcccaagtt gatggaattt ctgcaccgtc 7501 gaggtgtgaa gggctatgtc accctgaata cactgatttt tccccaagaa cttagagaag 7561 cagagcaata tcttcgttcg attatcagtg caggtgtgga tgcggtgatt gttcaagata 7621 tcgggatttg tcgtctcatt cgtcacctat ctccagattt tcccatccat gcttccacgc 7681 agatgacgat cacgagtgca gcaggagtgg aatttgctaa atccttgggg tgtcaattgg 7741 tagtacttgc tcgtgaatgc tccctgaagg aaattgagaa aatccagcgc cagcttccag 7801 aacaccaagc ctcacttcct ctggaagttt ttgttcatgg tgctttgtgt gtagcatatt 7861 ccggtcaatg cttgacgagt gaagctttag ggggacgttc tgccaaccgt ggcgaatgtg 7921 cccaagcttg ccgaatgccc tacgagttaa tctcagatgg caaagttgtg gatttaggaa 7981 atcgcaaata tctgctgagt cctcaagact tggctgggtt agagatactg ccagaattgg 8041 ttcaagccgg ggtcagttgt ctgaaaattg aaggtcgctt gaaaacacca gagtatgtcg 8101 ccaatgtgac tcgtgtttat cgagaagccc tggatcgggt gatggcggat ttggatgtag 8161 gagacaagaa gaaattctcc acatcttttc aagaacacta caacttggag atggcgtttt 8221 ctcgcggaca gtatacgggt tggtttcgcg gtgttaacaa tcaggaactc gttcatgctc 8281 actttggtaa aaagcgcgga gtttacttgg gcgaagtcac tcgcatcagt aacgaaaaag 8341 tcatagtacg gttacaagcc ccagttaagc cgggtgatgg tgttgttttt gactgcggtc 8401 acccagaggc gcaggaacaa ggcggtcgag tttatgcggt ggaacaaaaa gctaaggaga 8461 cagtgctgac ttttggtcgc cgtgatctca acttgcgacg aatacacgta ggggacaaaa 8521 tttggaaaac caacgaccca gaacttgata agcaactacg tcaaagtttt gctggagaaa 8581 atccgcagtt tcagcgtcca attcacatag aggtgtatgg agaagttggt cagtcagtaa 8641 ctgcgatcgc ccgcgacgaa ctcggtcaca ttgtccaggt agaatctaca atcccccttg 8701 aggaggcgca caccaaaccc ctcaccccag aacgtttaca cgaacagttt gctcgtcttg 8761 gcaatactcc cttttatcta ggaagtttga caaatcacct caatggtgcg ttcatgctac 8821 ccgttagtga gttgaaccgt ttacggcggg agattgtcgc acagttggaa gagttgcgtt 8881 cccaacccaa acgctggcga cttaactttc atgcttctct acaagacttg ctcccttcca 8941 cttcaaaagt gcaagcgact cccaactccc catcgctgat tgtgcttgtg cgaaacttca 9001 agcaactaca agctgcactg gaagcgggaa tcgaaaccct ctactgtgaa tttgaagacc 9061 cccgcgctta cccagaagca gtgcagatgg tacgccaagc aagcaaagca aatactcagc 9121 actcgatctg ggttgcgcct ccgagaatta ccaaacccgg ggaaaattgg attttgcaac 9181 tggtgcgttc ctgtgaggcg gatggttatc tggtacggaa ctatgaccac ctccagtttt 9241 ttgccaatga gcgttgcata ggagatttct ctctcaacgt tgctaatccc ttgacggcgg 9301 actactttca gcatcaattt ggtttagaac gggtaacagc gtcttacgat ttgaacataa 9361 ctcagttaga agacttgctg acaagttgtc cacctcactg gtttgaagta acaattcatc 9421 aacatatgcc catgttccat atggagcatt gcgtcttttg tgctttttta tctcagggga 9481 ctgactacac caactgcgga cgcccctgtg agaagcatga agtaaaattg cgggataggg 9541 tagggacaga acacattctc caagccgacg caggttgtcg gaatactgta tttaatggca 9601 ctgctcaaac ggcagcagaa tacgtacagc attttataga gcttggtgta cgcaattttc 9661 ggattgaatt tctcaatgag acacctgaaa caatgacaca aacaatacat cgttatcaac 9721 aactgctgca gggggaaatg actggctctc aactgtggcg agagttaaag ttgcaaaatc 9781 agcttggtgt cactcgtggt tcaatgggag cataactgaa tcgtcaaaaa ctttgctaaa 9841 cttactcaaa atttgtaaat ttcagagttc cctgttctct gttccctatt ccctgttaag 9901 cgttccgctt catttttcat agacaaggat ggacaaccct ccacaatgtc tactc // LOCUS NODE_3374_length_9951_cov_4.3516579951 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9951) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9951) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9951 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(204..2084) /locus_tag="DP116_23755" CDS complement(204..2084) /locus_tag="DP116_23755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859563.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23755" /translation="MPVSNSVIRRYTPPTCTLEVLAQSSALSQWMGKSVLKQLQFELR FDDPRLPEEKRIAIRGDSDQLETLCAVVTGYVQEFLEMSPESFWANFSGTNFPGTEDV SVVSDETEQTNVYNPPSQKIPTRNSFTQLPHADIKIKPSDHLTHNLFLGSLANPASGP VIQLSLLQLFDLATALDEYSADVVALPNLTPRSTRSGGIPAWAPVAAVLVIGVGLAPF TWQYANRVRQQQTARKPISTEQKIALEASPSQGLSTSTAVPTLAPSSSLPSPPLPPFG STLAVPNASPSIAQTLPSVPVTPQTSAKSTFPSIPQTSPSIGNVAKAPLPPLGNPLSI SGTTKTPTISTLPKTTTPKQEIALQPKLQPNFTALTPKSEPNSTSVQKNSTPNNLSST NNTSSTSTGASLLRGTSSKDATRMPAARQGLTRSVSERDTAVTPGGNPPMPYGQASPP TALAPQGTAPSNGLAPQNNAASSLATSPQSSTSTSREPVYSQPGSRQSGTDALISRLR EARAKRANLSTEVATNSTVRSGTLFDTPQVSEARDVLKKRWQPPSGLRQSIEYSLSVG VDGTIEQILPMGKAARDYVDRTGMPLIGEPFVSPNKNGQSVKIRAIFSPDGKVQTFPE TE" gene complement(2151..2756) /locus_tag="DP116_23760" CDS complement(2151..2756) /locus_tag="DP116_23760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859562.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3038 domain-containing protein" /protein_id="PRJNA477356:DP116_23760" /translation="MLKVMHSAADSPAPTSQWEDLIKLPAPNSVHWDNIKTQLDLVLL ALETLTGIGSEAMLQAAINLNLESRVPDRVALWRLRQSNPLRKGQGGRKKLDVEEARA LVLITCYLAKQHQELIRRAVGLLEQMAEDNREPHQAALLGDYIDTFCNTYQERMEEDA TISTNELTHLGLKLLIDLLFYSGPGGHRRLWLALIDRSTKF" gene complement(3329..5797) /locus_tag="DP116_23765" CDS complement(3329..5797) /locus_tag="DP116_23765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319408.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="endonuclease MutS2" /protein_id="PRJNA477356:DP116_23765" /translation="MIQSETLELLEWSRLCQHLSTFAATKLGTIAARHLQIPASQAQS EQLLAQTKEIYELENRLGTGLSFDGIQDIGDSLERAELQGILQGDELLAIATTLAGTR NLRRVIDKHPDLTVLNDLITDLRTYPELEQEIHRCIDERGQVTDRASQKLGEIRTNLR QSRSQITQKLQNILQVKANAVQEQIITQRGDRFVIPVKAPQKDAIPGIVHDTSTSGAT LYVEPNSVVPMGNQLRQLLRREQIEEEAIRRVLTEQVAAVKPDLERLLAIVTTLDLAL AKARYSLWLKANPPRFINRDENESITLRKLRHPLLEWQHYHEQGHSVIPVDLLIQPQI RVVTITGPNTGGKTVTLKTLALAALMAKVGLYVPAREPVELPWFDMVLADIGDEQSLQ QNLSTFSGHIRRISRILSALDEDEETEKAGEAREAGEAGEEKTIPSSLILLDEVGAGT DPVEGSALAISLLQYLADHALLTIATTHFGELKALKYEDERFENASVEFDESTLSPTY RLLWGIPGRSNALTIAQRLGLKLAVVESAKTQLGGATDEVNQVIAGLEAQRRRQETKA EQAQDLLQQAERLYKEVSEKATALQERERALRVSQEVAVQQAIAQAKGEIAQVIRRLQ QGKPTAQDAQSATKALGEIADKFIPEPPPKQKVGFLPKVGDRIRIPKLGQTAEVLTAP DQDRELTVRFGIMKMTVKLEDVESLDGQKPEPSVKEPKSQRAEEPRSKGESKITTLTE TPAIRTSKNTVDIRGRRVPDAEIILDKAISEATGPIWIIHGHGTGKLRQGVHAFLKQH PRVSRYEAGEQADGGSGVTIAYIG" gene 5954..7117 /locus_tag="DP116_23770" CDS 5954..7117 /locus_tag="DP116_23770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009556227.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GuaB3 family IMP dehydrogenase-related protein" /protein_id="PRJNA477356:DP116_23770" /translation="MDIEIGRGKTARRAYGIDEIALAPGNRTLDPSLADTQWRIGNIE REIPIIASAMDSVVDVRMAVLLSELGALGVLNLEGIYTRYADPEPILDRIASVGKEEF VPLMQELYAEPIKAELIEQRIREIKQQGGIAAVSATPVGASKFGSVVAKAGADLFFIQ ATVVSTAHLSPESVVPLDLAKFCQEMPIPVILGNCVTYEVTLNLMKAGAAAVLVGIGP GAACTSRGVLGVGVPQATAIADCAAARDDYYQETGKYVPVIADGGLITGGDICKCIAC GADGVMIGSPFARAAQAPGRGYHWGMATPSPVLPRGTRIRVSTTGTLEQILTGPAQLD DGTHNLLGALKTSMGTLGAKNIKEMQQVEVVIAPSLLTEGKVYQKAQQLGMGK" gene 7434..7697 /locus_tag="DP116_23775" CDS 7434..7697 /locus_tag="DP116_23775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002804222.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23775" /translation="MKFFPTLKLLLEKSHMSPRIEIAETHVSVHSSHHTPPGQASGGS LIIKLFLNHINCHMQNSVYHCKAVVFAVVEFSAKSNILIGEGL" gene 7700..8023 /gene="trxA" /locus_tag="DP116_23780" CDS 7700..8023 /gene="trxA" /locus_tag="DP116_23780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312996.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin" /protein_id="PRJNA477356:DP116_23780" /translation="MSEAAQVTDSTFKQEVLDSAIPVLVDFWAPWCGPCRMVGPVVDE IATQYKDQVKVVKVNTDENPNVASQYGIRSIPTLMIFKGGQRVDMVVGAVPKTTLSNT LEKYL" gene 8472..9548 /locus_tag="DP116_23785" CDS 8472..9548 /locus_tag="DP116_23785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012409106.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome D ubiquinol oxidase subunit II" /protein_id="PRJNA477356:DP116_23785" /translation="MTSSASFDTLESLQADIAKLFEDLPMSKHRRYIIQALATIVRLA HKSEEIERLDWKILSSSLTDMERGFQLFYAYRHVRKVTIFGSARLSPQTPEYRMAYEF ARYVCRLGFMVMTGGGGGIMQAGNEGAGRENSFGLNIQLPFEQQANPIIEGDPKLIHF KYFFTRKLFLLKESDAIALFPGGFGTQDEAFECMTLSQTGKFGPIPLILIDHPGGDYW RSWSEYIDKQLVQKGLVSPDDPSLYTVTDNLDVACNAITSFYQVYHSSRYVGDRLVIR LKVELSDAEVEQLNHDFSDILVTGRIEKSQALPQEAQDETFELPRLLLYFNQRDLGRL YQLIAVINRMGTPSAEEKAHPERK" BASE COUNT 2511 a 2198 c 2370 g 2872 t ORIGIN 1 tgactagtaa tgagtagaga tgggtaggga ttaagttaca gttccttacc ccgttattaa 61 aattaatttc atgttcctta gggggctggt ttgaccagcc ctcctagttt tcaccctata 121 tctacagtac ggcggcatag ggtgcgcaaa gcgcacgtag tctccaaggt gcaccgaaac 181 gtcagttcat agccctgtct gacttactcc gtttctggaa aagtctgcac cttgccgtca 241 ggactaaaaa tagctcggat ttttactgat tgtccatttt tattaggaga aacaaaaggt 301 tcaccaatca gaggcattcc tgtacggtca acataatctc ttgcagcttt tcccattggt 361 aaaatttgct caattgtgcc atcaacacct actgacaaac tgtactcaat gctttgcctt 421 aatccagaag gcggttgcca gcgttttttc aatacgtctc tagcttcaga tacttgaggt 481 gtgtcaaaca atgtgccact acgaactgta ctattagtgg caacttctgt ggaaagatta 541 gccctttttg ccctcgcttc ccgcaatcta gaaattaatg catcagtacc actttgtctt 601 gagcctggtt gtgagtaaac tggttcacgg ctggtacttg tgctggactg aggtgaggtt 661 gcaagtgatg aggcagcgtt gttttgggga gctagtccgt tgctgggagc agtgccttgc 721 ggagccagtg ctgttggcgg cgaagcctgc ccgtagggca taggagggtt tcccccagga 781 gtgactgccg tatctctttc agagacgctt cgcgttagcc cttggcgtgc cgcaggcata 841 cgcgtagcgt ctttggagga ggtacccctc aggagtgagg ctcctgttga ggtactggaa 901 gtgttatttg tgctggacaa attgttcggt gtcgaattct tttgaactga tgtagaattc 961 ggctcgctct ttggcgttaa tgctgtaaaa ttgggctgta atttgggttg aagagcaatt 1021 tcttgctttg gtgttgtggt ttttggaagg gttgagatcg ttggagtttt tgttgttcca 1081 gatatactca acggattgcc taaaggtggt agaggtgctt tggcgacatt cccaatacta 1141 ggagatgttt ggggtataga aggaaatgtg gactttgcag atgtttgagg cgtaacaggt 1201 acactaggca atgtttgagc gatcgatgga gaagcattgg gcactgctaa ggtagaccca 1261 aagggtggga gtggcggcga aggtaaacta ctagagggag ccagtgtagg aacagcagtg 1321 gatgtggaca gaccctgtga gggggaagct tctagagcaa tcttttgttc tgtcgaaatt 1381 ggttttctgg ctgtctgctg ttgtctgaca cggttcgcat attgccaggt gaatggtgct 1441 aaacccacac ctatcaccag aactgctgca acaggtgccc aagcaggtat acccccactc 1501 cttgtagaac gaggggtgag atttggtagc gccacgacat cagcagaata ttcatccagg 1561 gctgttgcta aatcgaataa ttgcagcaga ctcagttgaa tgacaggacc agatgctgga 1621 tttgctaaag aaccaagaaa caaattgtga gttaagtgat cgctaggctt gatttttatg 1681 tctgcatgtg gtaattgggt aaaagagttt ctcgttggaa ttttttgtga agggggattg 1741 taaacatttg tttgttcagt ctcatcagag actacgctta cgtcctctgt tcctgggaag 1801 tttgttcctg agaaatttgc ccaaaagctt tcaggagaca tttccaaaaa ttcttggacg 1861 tagcctgtca ctactgcaca caaagtctcc agctggtcgc tgtcgcctct aattgcgatt 1921 ctcttttcct ctggcaatcg cggatcatca aagcgaagtt caaactgtag ctgcttgagg 1981 acagatttac ccatccactg agacaaagct gaactctgcg ccaagacttc tagcgtgcag 2041 gtgggcggtg tataccgacg gatcacagaa tttgatacag gcatggcagg aagtgcgagc 2101 aggatatttt taatttttga tttttgatca gagaatctca aatctgaaat ctagaatttg 2161 gttgaacggt cgataagtgc tagccacaga cggcgatgtc caccaggtcc actgtaaaaa 2221 agtaaatcta taagcagttt taatcccagg tgggttaatt cattagtgga gattgttgcg 2281 tcttcctcca tccgctcttg ataagtgttg cagaaagtat caatataatc tccaagcaaa 2341 gcagcctgat gaggttcgcg gttgtcttcc gccatctgtt ctaaaagacc aacagcacgg 2401 cgaattaatt cttggtgctg tttggcaagg taacaagtaa tcagaacaag cgccctagct 2461 tcctccacat ctaatttttt tcgtcctcct tgacctttac gtagcggatt tgattggcgc 2521 agtcgccata gcgctacacg gtctggcact cttgactcta aattcagatt aattgccgct 2581 tgtagcattg cctcggagcc aatgccagtc aatgtttcta acgctaacag caccaagtcc 2641 agctgagttt tgatattgtc ccaatgaact gaatttgggg ctggaagctt tattaaatcc 2701 tcccactgag aagtcggagc gggtgaatcg gcggcagagt gcataacttt tagcataggt 2761 gatcaggctg gtgagggctg ttgctgttac ccattttgaa accagactct caagggttaa 2821 gattggtttc caatcactat cagccgaagc cgggagctac agcttctatt actttggtga 2881 acagcagagg tctttgtctt aactctatga tgaaaaactg ttttgaaaaa agtataatcc 2941 taatcaattg tgctttatct gtcaaacaca aataactttt tttatttttt agtaacaata 3001 aacatttcta cataaatcat cgctagtatc caatcgctat tcaaaatagc ccaattgttc 3061 agtggtgatt attttagtta tactcagcgc cacaatcaaa acccggattt ttccaaaaac 3121 cggtgagacc agcgctgcgg gagggtttcc aacgccagat acctacggag ggagacactc 3181 atcaagtact ggctcctccg caggcgactg cgtatgcgca aagcgcacgc ccaaagggct 3241 aaagcgcagc gtgtccgttc ggcctcaagc cgtgcccgaa gggctcagga catacccgaa 3301 ggggtttctg tgttttgtga ttttagtttt aaccgatgta agcaattgtc acaccactac 3361 cgccatcagc ttgttctcct gcttcataac ggctgactct ggggtgctgc tttaagaagg 3421 cgtgaactcc ttgccgtagc ttacccgtgc catgtccgtg aataatccat attggtcctg 3481 tggcttccga aattgctttg tctaaaatga tttccgcatc cggtacccta cgcccgcgta 3541 tatctaccgt atttttagac gtgcgaattg caggagtttc tgtcaaggtt gtgatttttg 3601 actctccttt gctcctcggt tcctcggctc tttggctctt gggttcctta acgcttggtt 3661 caggtttttg accatctaaa gattctacgt cttctagctt cactgtcatc ttcatgattc 3721 caaagcggac agtcaactcc ctatcctgat caggggcggt taagacttct gctgtttgcc 3781 cgagtttggg tatacggatg cgatcgccca ctttgggaag aaatcccact ttttgtttcg 3841 gaggaggttc tggtataaac ttatctgcta tctcacctaa agcttttgtt gcgctttgag 3901 catcttgtgc tgttggttta ccttgctgca agcggcgaat cacttgagct atttcacctt 3961 ttgcctgggc gatcgcctgc tggactgcta cctcctgcga gacacgcaaa gcacgctctc 4021 tttcctgcaa tgctgtggct ttttcggaga cttctttgta taaacgctca gcttgctgca 4081 acaaatcctg tgcttgttcg gctttagttt cctgacggcg gcgttgcgct tccaacccag 4141 caataacctg attgacttcg tctgttgctc ctcccagttg cgttttcgca ctttccacaa 4201 ctgctaattt caaccccagg cgttgggcaa ttgttaaggc gttggaacgt ccaggaattc 4261 cccaaagtag gcggtaagtt ggtgaaaggg tactttcgtc aaattctaca gaggcatttt 4321 caaatcgctc atcttcgtat tttagcgctt tgagttcacc aaagtgagtg gtcgcaattg 4381 tcaacagagc atggtctgcg agatattgta ggagcgatat tgccaaggca ctcccttcaa 4441 cggggtctgt tcctgcaccc acttcgtcca gaaggattaa ggaagatggg atagtctttt 4501 cttcccctgc ttcccctgct tcccttgctt cccctgcttt ctctgtttcc tcgtcctcat 4561 ctagtgctga caaaatccga ctaatgcggc ggatgtgacc agaaaaagtg gataagtttt 4621 gctgcaggga ttgttcatca ccgatatctg ccagtaccat gtcaaaccac ggcaattcca 4681 ctggttcgcg tgcaggcaca tataagccga ctttcgccat caatgctgcc aaagctaggg 4741 ttttcaacgt cacagttttt ccgccagtat ttggtccggt aattgtgact acccgaattt 4801 gtggctgaat cagcaaatca acgggaatta cggaatgtcc ttgttcgtgg taatgctgcc 4861 actccaaaag tggatgacgt aacttccgca aggtaatgct ttcattttcg tctcggttaa 4921 tgaagcgtgg aggatttgct tttagccaca aactataacg cgcttttgcc aatgccaaat 4981 ccaaggtagt tacaattgct aacaaccgtt ccaaatctgg cttgacagca gcgacttgct 5041 ctgttaagac acgacggatt gcttcttctt ctatttgttc tcttctgagc aactgtcgca 5101 actggtttcc cattgggacg acagaatttg gttccacata caacgttgcg ccgctggtag 5161 aagtatcgtg gacaatgcct ggtatggcgt ctttttgagg tgcttttacg ggaatgacaa 5221 agcgatcgcc cctttgagta ataatctgtt cttggacggc attcgctttt acctgtaata 5281 tattttgcag tttttgggtg atttgactgc gtgattgccg taagttcgtg cgaatttcac 5341 caagtttttg gctggcgcgg tcagtgactt gacccctttc gtctatacat ctgtgaattt 5401 cttgttctaa ttctggataa gtccgtaaat cggtaattaa atcattgagt actgttaaat 5461 ctgggtgttt gtcgatgaca cgacgtagat ttctggtgcc agcaagagtg gtggcgatcg 5521 ccaacaattc atctccctgc aaaattcctt gcagttcggc gcgttctagg gaatcaccaa 5581 tatcttggat tccatcaaac gaaagccccg tacctaggcg gttttccagt tcgtaaattt 5641 cttttgtttg tgctaacaac tgctcacttt gggcttgcga cgcaggaatt tgtagatggc 5701 gcgcagcaat agtccccagc tttgttgccg cgaaggtaga caaatgctgg cagaggcgag 5761 accattcaag taattctaga gtttcagatt ggatcaagga ctgctaatct gacatggatt 5821 ctaaaattat tgtaacaatt cttttctcta agctgcataa cctttctaga ttaatgacct 5881 ttggaaaaac ccctgcaaat cctcccataa ttttgatacc ctatctaaaa cagatttggg 5941 agcatcaaga agcgtggata ttgaaattgg gcggggaaaa acagctcgca gggcctacgg 6001 catagatgaa attgctctcg cccccggtaa tagaacactc gatccaagtt tggcagacac 6061 tcagtggcgt atcggcaata ttgagcgcga aatcccaatt attgccagtg cgatggatag 6121 cgtagttgat gtccgtatgg ctgtgctttt gtcagagcta ggagcattgg gcgtcctgaa 6181 cttagaagga atttacactc gttatgcaga tccagagcca attttagacc gtattgcctc 6241 tgttgggaaa gaagaatttg ttcccttgat gcaagaactt tatgctgaac cgataaaagc 6301 agaattaatt gaacaacgaa tcagagaaat taaacaacaa ggtggtatcg ccgccgtgag 6361 tgcgactccc gtaggagcaa gcaaatttgg ttctgtagtg gcaaaagctg gtgcagattt 6421 attctttata caagcaactg tagtttctac tgcacacctt tctccagaat ctgttgtacc 6481 actcgatttg gcaaagtttt gtcaagaaat gcccatacct gtgatactag gaaattgcgt 6541 gacttatgaa gtgactctca atttgatgaa agcaggcgca gcagccgtac tggtgggaat 6601 tggccctggt gcagcgtgta cctctcgtgg tgtgctgggt gtcggtgtac cacaagcaac 6661 ggcgatcgct gactgtgcag cagcacgaga tgattattat caagaaactg gcaaatacgt 6721 tccagtaatt gctgatggtg gtttaatcac cggtggcgac atttgtaagt gcattgcttg 6781 tggtgcagat ggagtgatga ttggttcccc ctttgccaga gctgctcaag ctcccggacg 6841 gggatatcat tggggaatgg caactcctag tcccgtcttg ccacgtggca cgcgcattcg 6901 tgtcagcacc actggcaccc tagagcaaat cctcactgga ccagcacaac ttgatgatgg 6961 gactcataat cttttgggag ccttgaaaac aagtatgggc actttgggag caaaaaatat 7021 taaagaaatg caacaagtgg aagttgtgat tgctccttct ttgttaaccg aaggtaaagt 7081 ttatcaaaaa gcacaacaat taggtatggg aaaatgaagc gtgagaatac tgcgggcgca 7141 ttcgttaaaa ggagcgatcg cgctatgcac aagcgtaacc ggaaggtagg cgactgctgt 7201 gcatggtttt ccgacttgag gagccactgc gcccttggtg aggcagtgct tggctagctt 7261 agagaagttt ggatcggaca ccgcagttgt gattcctagc ccctcttagg ggcggcactc 7321 aaaggaccca gagctacgct taacgcctcc atagcgctgg caactaatga ctctttatca 7381 cccagctgta atagtcaaca ggagtgatac tattacagct gggagataag ttaatgaaat 7441 ttttcccgac tcttaaactc ttactggaaa aatcgcatat gtcgcctaga atagagatag 7501 cggagacgca tgtttccgtt cactcctcac accacactcc gcccggacaa gcttcgggcg 7561 gttccttaat aattaaatta tttttgaatc acatcaattg ccatatgcaa aactctgtgt 7621 accactgcaa agcagtggtt tttgctgtgg tagaattttc agcaaagtca aatatattaa 7681 taggcgaagg tttataggca tgtcagaagc cgcacaagtt acagactcta cctttaaaca 7741 ggaagtattg gatagcgcaa ttcctgtttt agttgacttt tgggcaccct ggtgcggacc 7801 ttgtcggatg gttggccctg ttgtcgatga aattgctacc caatacaaag accaagttaa 7861 ggtagtgaaa gttaacaccg atgaaaatcc caatgttgcg agccagtatg gcattcgcag 7921 tattcccaca ttgatgattt ttaaaggtgg gcaaagagtt gatatggtgg tcggtgccgt 7981 gccgaaaact acactatcta acactttgga aaaatatctt tgaacacaac cgccggaaaa 8041 ctgctacaga gtatttgatt tcaaagggca ttcggtaaag attggtaaac agtggagggg 8101 cgacaatcgc ctctccataa atttatgaac tttatgaact ttatgaactt tttagttcag 8161 ccaatgtttg gcgagtttct ctgttgtcag cagttattta taaaataaat gttaaattat 8221 tgtagggatg tagtttgctg cgtgtgttgc aaacacaacc accaaacaaa tcgttttctc 8281 ttatcaagat aggttggcga cgataactgt cggcagcgcc ttggattatc tttaaccaaa 8341 ggtgcttttt gttgtgaaaa aatacctaaa agtcaatagt caaaaaactt tagactttgc 8401 actccacgta acgatttatc attccacatt aaatttagat aaaatgtaat gccattttgg 8461 cagcagttct catgacctca tctgcgtcgt ttgacacatt agagtctctt caagcggata 8521 tcgctaaatt atttgaagac ttaccgatgt caaagcatcg gcggtatatt atacaggcac 8581 tagcgactat agtgcgtctg gctcacaaga gtgaagaaat tgagcgcctc gattggaaga 8641 tattgtcgtc ttctttaaca gacatggaac gcggtttcca gctgttttat gcttaccgac 8701 atgttcgcaa agttactatt tttggttcgg ctcgcttatc accacaaaca ccagaatatc 8761 gcatggcgta cgaatttgct cgctatgtgt gtcgattggg atttatggtt atgacaggtg 8821 gtggcggcgg aattatgcaa gcaggtaacg aaggtgctgg acgggaaaat tcttttggct 8881 taaacattca gcttcccttt gagcaacagg caaacccgat tattgaggga gatcctaagc 8941 tgattcattt taagtatttc ttcacccgga agctgtttct cctcaaagag agtgatgcta 9001 tagctttgtt tccaggtggt tttggcaccc aagacgaagc ttttgagtgc atgactttaa 9061 gccagacagg taagtttggt ccaatacctt tgattttaat tgaccaccct ggaggcgatt 9121 attggcggtc ttggagcgaa tatattgaca agcaactggt gcagaaaggt ctggttagtc 9181 ctgatgaccc cagtctttac acggtgacag acaacttgga tgtggcttgc aatgctatta 9241 ccagttttta ccaggtttat cactctagcc gctacgtggg cgatcgccta gttatccgtc 9301 tgaaagtaga gttatccgat gctgaagtag aacaactcaa ccatgatttc agcgacattc 9361 ttgtgacagg acgcattgaa aaaagtcagg ctttaccgca agaagctcaa gatgagacgt 9421 ttgagctacc ccgcctcctt ttgtacttta atcaaagaga tttagggcgg ttgtaccaac 9481 tcatcgcagt gattaaccgc atggggactc cttcagcaga agaaaaggcg catccagaaa 9541 ggaagtagcc cgttaggtcg agcctcgggc cgtaggcata gggcgttggg cactcccaac 9601 aacgtgaact acctacacgc tcccttgcgg tacagttagt cgcttcccaa ttcattgggg 9661 attacctcaa tcttcataga ttttttacgt cctaaggtaa aaaataaggt tttgagcgaa 9721 cagtaggcag tgcccaacgc cagatgctac cctatggctg acgccacgcc ttacggtgag 9781 tccagcgctg gctctccaac gcactggctc ccttacaaaa attgcaataa cgcaattggt 9841 gcactgcctc ctcatcttct tcattgttct tcatattctt gatccaatac ggttaggttg 9901 acactcccct gcctaaaggc gaggggattc tacattcatc gtcagaactt g // LOCUS NODE_3379_length_9932_cov_5.0209589932 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9932) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9932) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9932 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 558..812 /locus_tag="DP116_23790" CDS 558..812 /locus_tag="DP116_23790" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23790" /translation="MRPALREGFPPQATGEPGGLSGMRKAHAKGQLRGREKRRSRTGL TSHLGIAPLFTPNALPHPSTVLRNAAPFLLFVLFLFIIKK" gene 1202..1888 /locus_tag="DP116_23795" CDS 1202..1888 /locus_tag="DP116_23795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NYN domain-containing protein" /protein_id="PRJNA477356:DP116_23795" /translation="MNLAGFNKERNQYKLVKEQSHWQEERHTKTTDQDKDGKMSVTKI EQSIFGNLNRGRVAIFIDGANLFHAGLQLSLEIDYAKLLCCLTENAKLLRAFFYTGVD RTNEKQQGFLLWMRRNGYRVVAKDLVQFPDGSKKANLDVEIAVDMMNLAPYYDTAVLV SGDGDLAYAVNALSYQGVRVEVVSIRSMTSDSLIDVTDYFVDIETIKQYIQKDSHSCY NYRPLSNSSL" gene 2396..3109 /locus_tag="DP116_23800" CDS 2396..3109 /locus_tag="DP116_23800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998858.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_23800" /translation="MNILIVEDEPEIAQLIQISLEKEGFCCRISSDGLNALRMFQEQS PDLIILDLMIPGLDGLEVCARIRHKPGAKDPYILMLTAKGEEIDRVIGLSTGADDYMV KPFSPRELIARVRALLRRSLRHGSQNQVYRTKHFIVDVEQRIISRQMNSQEPEMLDLT TLEFNLLSTFISNPGRVWDRTQLIDKLWGDNFFGDERVVDTHIARLRKKIEPDPTNPT FVKTVVGVGYKFEDSSVIL" gene 3106..4221 /locus_tag="DP116_23805" CDS 3106..4221 /locus_tag="DP116_23805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998857.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_23805" /translation="MTKTGWRWAKSLPLASRLFISHLLVMIVGVVSLVIISKVSSPRF FVLHLERLEQRGYDLFDVRSELVNSFEFAWRRSTVWSVFAGATAAGGLSYWVSRLIMQ RLMEMEQITQKFATGEFDARLPLSDIPELNRLGASFNRMAISIEGVEARRRELIGDMT HELRTPLTVVRGYLEELAGGTIEPTPEIYLRLAKETRRLERLVNDLQELSKAEAGYLP IKTQSLNLRPLLESMVEKFADQLLDDGPVLRLECPSKLPLVLADIDRTEQVLVNLLGN AVRHTTKGSITVRAWAEASKVWIAVTDTGTGIAKEDLPYVFERFWRADKSRDRHSGGT GIGLTISRRLVELQGGQISVESQLGSGSTFRFFLPLA" gene complement(4612..4884) /locus_tag="DP116_23810" CDS complement(4612..4884) /locus_tag="DP116_23810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860233.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GlsB/YeaQ/YmgE family stress response membrane protein" /protein_id="PRJNA477356:DP116_23810" /translation="MNIIAWVVLGLLAGAIAKAIYPGYQGGGILATIVLGIVGAFIGG ILGVFLTTGKIILTAHAGFSLPGLALAVLGALIAIFIWYSFARRTV" gene 5375..7777 /locus_tag="DP116_23815" CDS 5375..7777 /locus_tag="DP116_23815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742555.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S9 family peptidase" /protein_id="PRJNA477356:DP116_23815" /translation="MDNFPINPPIIEQVWQSPPEPINQILDAPSPPAILLSPNREWMV ELERPLLLPISLLAETEVPLAGFLINPKTNAPARLNPFQSMRVRAIAPASPSESIASP QMSKTVTLPDNAQIGFIKWSPDSRKLAFTLTQATGLELWFVDVADFIPKRLTQPVLNA AYGEPYRWLSDETLICKFILSERPKPPSEPTVPPGPLIQENLKGKSPTRTYTNLLQNP HDEALLEYYLTSTLEKVTLDGQRTLLVESSLIHEAIPSPDSKFILLTTLQRPFSYQVP ISYFPKKIEVIEDTGKFVYQVADVPLFVRRTTKFDEVRTGRRGISWRSDTKSTLSWLE ALDEGDPTRDVPKRDALFELDAPFTDTPKQLWESEYRFHNLAWGKEDVALVSERWYDT RKERIWRIYPQAPETPPQLLLDRSYEDKYNDPGSALMTVGPYRYRVLRFAPQGNIIYL SGRGASPNGAYPFLDSFDLETGQKQRLWQCQDPYLEEIAYLLDDEAQTVITRRQSQTQ PPNYFLFNRRDNQILTALTHYQDPAPALAGVYSELVKYQRADGVQLSAKLYLPPGYER ERDGPLPMLFWVYPEEFKDKEFAGQITKSENTFSRPNRASVLFLLTQGYAVLSGPTLP IIGEGDTEPNDTYVEQLIAGAQAAVDYVVKRGIADPHRIGIGGHSYGGFTTVNLLAHT DLFRIGIARSGAYNRTLTPFGFQGEQRNFWEAQQTYIHMSPFTHAAKVKAPLLLIHGE KDTNPGTYPLQTERLYEALKGLGATVRLVVLPLEDHGYRSGEGVAHVLWEMVNWCDKY LK" gene 8270..9055 /locus_tag="DP116_23820" CDS 8270..9055 /locus_tag="DP116_23820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317414.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uridine phosphorylase" /protein_id="PRJNA477356:DP116_23820" /translation="MTDAKLYHIGFGKNDLGEQPPTIALLSGDPERARLIAQSYLHSV RLLSENRGLNSYVGILPNGKALLSATSGMGAPSTSIVVNELIQVGIRLIIRVGTCGSI QEHVSVGSIVITSAALCRQGAANDIAPVEYPAAADPFVTVALVEAARELKVKYHLGIT ASVDTFYEGQERTQSVNRHLLRSLRGVTEEYRSLNVLNYEMECGTLFKMAGVYGFRAG CVCGVVAQRTVGEEVILGQKDFATDNAIKVAVQAAQHWKEPEA" gene 9244..>9932 /locus_tag="DP116_23825" CDS 9244..>9932 /locus_tag="DP116_23825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197919.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide deacetylase family protein" /protein_id="PRJNA477356:DP116_23825" /translation="MQLAPLFPIFHRILKPTFPNCLWCGDANSKIIALTFDDGPHPEY TPSLLKVLDRYNVTASFFWLGACVNRSPGVAKQVCDRGHWIGLHGYDHQNFPMLSPTE LKHSLEKTQAAIYNACHLAPDKVRDVRPPNGLFTPTTLKLLNQWNYRPVMWSVVPEDW VRPGIATVVQRILQQVCNGSLIVLHDGMCGGQDVTATTEILIPQLMEQGYQFVTVDTL WQEARGVGALGS" BASE COUNT 2867 a 2076 c 2163 g 2826 t ORIGIN 1 aaattgataa tatggaaata atgataaaat ttcatataga atctcaaggt aaattctata 61 aattagcaaa tcgcaaataa caaaaccctt ttctaatttt ggataagaat cagtgcttta 121 agtcgccgtt aggaactgcg tacactagga cagatgaaaa tttttctttt cttctgagtt 181 cttggtcagg agttgctgag ttctattttc aactcggttg gacaaaaata ttatgcgtcc 241 gtcgcaagcg ttgcggacgc gagtgcgtgt ccgtaagagg cgaacgtgtc gcataccgga 301 ggtgatagcc gtgggttgag tatatccaga acgcaccctg ctttgaacac agggggcata 361 gtggtgttta tttatgtcca gttagcaatt gtgagaatat gaaaagcccc tgtagagttt 421 aaatattttg tttgtggaaa aagtcactcc aaatcctaat atcctgcaca gacgctatgc 481 ctgactccta tggcacaggt ctcatccttt gagttcacct aacgggagaa ccctagtagg 541 tacggcgacg ccttacagtg agaccagcgc tgcgggaggg tttccctccg caggcgactg 601 gcgaacccgg agggctatcg ggtatgcgca aagcgcacgc caaggggcag ttaaggggac 661 gcgaaaaacg tcggtctagg actggactca ccagtcactt gggcatagcg cctttattca 721 ccccaaacgc gctacctcac cccagcactg tcctgcgcaa cgcagcgcct tttctgcttt 781 tcgtactatt tttatttatt atcaagaaat aacgtggtat gcatgcccaa aatatacggt 841 ctttatattt ttggtgattt ttggaatttt catcagaaaa tttatcaata caatatgatg 901 ttatatatag gtgagtcgca cgcctagtta caataaggtt aatgttgata tatcactcaa 961 attatgctaa tcaataaata tgctgctttt acttgagcag gtgcatatgt tgaattagag 1021 agttcaaagt ataaagtcaa aaatcttgtt tgagctttgg actttggaaa gttaagatga 1081 gttgagtcta tttaacatta ttgacattag actctttttt gtcaaggtga taacctacta 1141 caagttatct attgtcaatt tacctaaaaa gacttgcaaa atttgaaatt cagaggtatc 1201 aatgaatctt gccggtttta acaaagaaag aaatcaatat aaactagtca aagaacaatc 1261 gcattggcaa gaagaacgtc acacaaagac tactgatcaa gataaagacg gaaaaatgtc 1321 agtgactaag attgagcaat ccattttcgg taatttaaat cgtggtcgag tagccatttt 1381 cattgatgga gcgaatttat ttcatgccgg tttacaactt agtcttgaaa ttgactatgc 1441 taagcttctt tgttgtttaa ctgagaatgc aaagctttta cgtgctttct tttacactgg 1501 agtagatcgc acaaatgaaa agcaacaggg ttttctgctg tggatgcgtc ggaatggtta 1561 tcgtgtcgtt gccaaagatt tagtacagtt tccagatggc tcaaaaaaag ccaatttaga 1621 tgtagaaata gctgtagata tgatgaattt agctccttac tacgatactg cagtgttagt 1681 tagtggtgat ggagatttag catatgcagt taacgctctt tcttatcaag gagttcgagt 1741 tgaagttgtc agcatacgct caatgacgag cgacagtctg attgatgtta ctgattattt 1801 cgttgacata gaaacaatca aacaatacat tcaaaaagat tcccattcct gctacaacta 1861 ccgacctcta tcgaattcta gtttgtaaaa gacaaatttc caaaagacgg cttatactat 1921 ttcactaaca tcctgataca actcactccc cttgcctacc catctcgcca gatcaacaac 1981 gacttaccca ccatgttcta accttaaagt caaatgatat tactataaac ttggctcgca 2041 agaaaagaaa cataattaag agattgtaaa tgttttttcc aaaaataacg cagaacacat 2101 aagtcaaaaa tcaggagaaa aaatagaact aaaaagaaga ataaaaatct gaattctgta 2161 caactgaaga cgaagcatca agggtacaag cccccactaa agatgcagat ttagtgggaa 2221 ataattgtgt ctttatttga atcctccact cattgcattc ctagctgcgg agttctgtgt 2281 tctgagttct tttttaataa aatttatctg taaacctcaa gtttgatggc aaaatatcgg 2341 ttacaatagc aacccaggaa atgaccttgt gcattctgct tgccaataat ttagtatgaa 2401 catcctcatc gttgaagatg aacccgaaat tgcccagtta atccaaatat ctctggaaaa 2461 agaaggtttt tgctgtcgca ttagcagcga cggtttgaac gccttacgaa tgtttcagga 2521 gcaatcacca gatttaatta ttttagatct aatgattcct ggtttagatg gtttggaagt 2581 ctgtgcaaga attcggcaca aacctggtgc aaaagacccg tatattttga tgcttacggc 2641 taaaggtgag gaaattgatc gcgttattgg cttatctact ggcgctgacg attacatggt 2701 caaacccttt agccccagag agttgattgc tagggtacga gcgctattgc ggcgtagtct 2761 ccgccacggt tcacaaaatc aggtgtaccg tactaaacac tttatagtag atgtagagca 2821 gcgcataata agtcgccaaa tgaattctca ggaaccagaa atgctagatt taacgaccct 2881 agaattcaat ttgttaagca cctttattag caatcctggt cgagtttggg atcgcaccca 2941 actcatcgac aaactttggg gagataactt ttttggtgat gagcgcgtgg tggatactca 3001 cattgctcgg ttgcgaaaaa aaattgagcc agaccctact aatccaactt ttgtgaaaac 3061 tgttgttgga gttggctata agtttgaaga ttcttctgtt attttatgac aaaaacgggg 3121 tggcgttggg caaagtcatt acctttggca tcacgcctat ttatctccca cttactggtg 3181 atgatagtag gagtcgttag tctcgtcatt atcagcaaag tctcttcccc tcgctttttc 3241 gttctgcatt tggaacgatt ggaacaaaga ggatatgact tatttgatgt tcgtagtgag 3301 ttggttaaca gctttgaatt tgcttggaga cgaagcactg tatggtcggt gttcgctggt 3361 gctacggcgg ctggaggatt gagttactgg gtgtctagac tgatcatgca gcggttgatg 3421 gagatggaac aaatcaccca aaaattcgcg actggtgaat tcgatgcaag actaccttta 3481 tcagatattc cagagttgaa tcgattaggt gctagtttca accgtatggc aataagtata 3541 gagggggtag aagcccgacg ccgggaactg attggagata tgacgcatga gctacgaacc 3601 ccactgacag tcgtacgcgg ttacttagaa gaacttgctg gtggaactat tgaaccaact 3661 cccgaaattt atttacgact cgcaaaagaa acaaggcgtt tagagcggtt ggtcaacgat 3721 ttgcaagaac tctctaaggc agaagcgggt tatcttccga taaagacaca gtcacttaat 3781 ctacgtcctt tattagagtc aatggtcgag aaatttgctg accaactgct agatgatgga 3841 ccagttctgc gcttagagtg tccatcgaaa ctccccttgg tattggctga tattgaccgt 3901 acagaacagg tgctggtcaa tctacttggt aatgcagtgc gtcataccac taaaggttca 3961 attactgttc gtgcttgggc tgaagcctct aaggtatgga ttgcagtcac ggatacaggg 4021 acggggattg ctaaagaaga tttgccgtat gtatttgagc ggttctggcg agctgacaag 4081 tctcgcgatc gccactctgg aggaacaggt attggtttaa ccatctcccg tcgtttagtc 4141 gaactgcaag gcggtcagat ttcagtagaa agtcagctag gatcgggcag cacgtttcgt 4201 tttttcttgc ctttggctta agtttagctt caacgggcaa aagcccctta tatcaaatcc 4261 agttaatgac ttactgcatc tggaggtgtc gctatcctgt tgataactaa atcaacattg 4321 aaaagcataa aatagagttg tagcacaagc gcgttgtgtt agcgctcttt atcacacatc 4381 taagccataa actttggtgc gttaagacgt ttgtcctaac gcaccctaca ttttacgttt 4441 tttaaggttg agcgacttaa caggatagtg acatcttccc tagttttcta caaagtgcca 4501 ccaataatta aaaacaaaat attccgaaaa aaactctaga ggatttaata attcgtaact 4561 cgcaattgta tcgatcatta cgaattacga atgataaatt attaacttgc attaaacagt 4621 tctacgagca aacgaatacc aaatgaaaat tgcaattagt gcaccaagaa cagctaaagc 4681 taaaccaggt agactgaaac ctgcatgagc agttaagatt atttttccag tagtcaagaa 4741 aacccctaaa atcccaccga taaacgcacc tacaattccc aacacaatag tagcaagaat 4801 cccaccacct tgataaccag gatagatggc tttagcaatt gcaccagcta aaagacctaa 4861 gaccacccat gcgatgatgt tcatttgtta aattccttcc tgttttggtt gccaataata 4921 ttaagtctaa cactgcaata cttttagtca tctttcacaa gaaataactt tagaaacctt 4981 aaggaataca acaggagaaa ggtagtgtag aactagcttg attcttctca gtaaaaaaag 5041 atgttgcaat tatgactgag attataagtc ggttagaaat ttatgagtca aaatacttct 5101 caactgatag taatgagtat ttaaatttac cgaaagtatc aggagttaac agacatgagt 5161 atagctagaa ctaagaatat tcttttatca attgaagctg agcatttcca gacgagtgtt 5221 cacgaatcct gcgcggattg ctgtagaact gaggagcttg tcttctgtgc agcaatgcat 5281 aagtgtgaat aaactagaaa agtgaccttt tcgggatgta tcctaaatca ggtgttggta 5341 atttgaaata tgagcttgcc ttcacggact aaaaatggac aatttcccga taaatccccc 5401 aattattgaa caagtctggc aatctcctcc agaaccgatt aaccaaattc tcgacgctcc 5461 atcgccacct gctattttgc tgtctcctaa cagagaatgg atggtagagt tggaaagacc 5521 attgttgctt cccatttcct tacttgcaga aacggaagtt cccttggcag gttttttgat 5581 caatcccaaa actaacgcac ctgctcgtct taaccctttt caaagcatga gagttagggc 5641 gatcgcgccc gcgtctccct ctgaaagcat cgcttccccc cagatgagta aaacggtgac 5701 gcttcctgac aacgcccaaa ttggcttcat aaagtggtcg cctgatagcc gaaaacttgc 5761 ctttactctc actcaagcca cgggactaga actatggttt gtggatgttg cagattttat 5821 tcccaagcgg ttaacacagc cagttctcaa cgctgcatat ggagaaccat accgttggct 5881 gtcagatgag acgcttattt gcaagttcat tttgagcgaa cgccccaagc ctccctctga 5941 accaaccgtt ccaccaggac ccctcattca agaaaatctc aaaggtaaaa gcccaactcg 6001 tacctacaca aatctactgc aaaatcctca cgatgaggcg ctacttgaat actacttaac 6061 ctcgaccttg gagaaagtga cgctagatgg tcaacgtact ttgttagtgg agagcagtct 6121 cattcatgaa gctatacctt ctcctgatag caaatttatt ctactgacga cgctgcagcg 6181 accattttct taccaagttc caatctcgta ttttcccaaa aaaattgaag ttattgaaga 6241 tacaggaaaa ttcgtttatc aggttgcgga tgtccctctt tttgtccgtc gtacaactaa 6301 attcgatgaa gtgcggacag gacgccgagg aatttcttgg cgtagtgata cgaaatctac 6361 actatcttgg ttagaagctt tagacgaagg tgatccaaca cgtgacgttc ccaaacggga 6421 tgcgttgttt gaattagatg ctccttttac agatacacca aagcaactct gggaatctga 6481 gtaccgcttc cataaccttg cttggggtaa agaagatgtc gccctggttt cggaacgatg 6541 gtacgatacc cgtaaagagc gaatttggcg catatacccc caagcacctg aaacaccgcc 6601 gcaactactt cttgaccgca gttatgaaga taaatacaat gaccctgggt cagccttgat 6661 gactgtggga ccttacaggt atagagttct gcgctttgca ccacaaggca atattattta 6721 ccttagtggg cgaggagcat cacctaatgg agcatatccg tttttagata gcttcgattt 6781 ggaaacaggg cagaagcaac gcttatggca atgtcaagac ccatatcttg aggaaatcgc 6841 ttatctgctg gatgatgaag cacaaactgt gattactcgt cgccaatctc aaactcagcc 6901 acccaattat tttctcttta accgtcgcga taaccaaatt ctcacggcgc tgactcatta 6961 tcaagaccct gctcctgcgt tggctggagt ttacagtgag ttggtaaaat atcagcgagc 7021 agatggtgtg caactgtcgg ctaaattata tctgcctcct ggttatgagc gagagcgaga 7081 tggtcctctg ccgatgttat tctgggttta tccagaggaa tttaaagata aggaatttgc 7141 cggacagatt acgaaatctg aaaacacttt cagccgtccc aatcgtgctt ctgttttatt 7201 tcttctcact caaggctatg cagttctttc tggtcccacc ttaccaatta ttggcgaagg 7261 tgatacagaa ccaaatgata cttacgtaga gcaattgatt gctggagcac aagcagcagt 7321 agactatgtg gtaaagcgtg gcattgccga ccctcatcgt attggtattg gaggacactc 7381 ttatggaggg tttactacag taaatttgct tgcacatact gatttattcc gcataggtat 7441 tgctaggagt ggtgcttaca accgaaccct gactcccttt ggttttcaag gagagcaacg 7501 caatttttgg gaggcacaac aaacttacat tcatatgtct ccttttactc atgctgcaaa 7561 agtcaaagca cctcttttgc tgattcatgg ggaaaaagat accaatcctg gtacctatcc 7621 attacaaacg gaacgactgt atgaagcgct caagggattg ggagcaactg ttcgtttggt 7681 cgtgttacct ttagaagacc acggctatcg ctctggtgaa ggagttgctc atgtactttg 7741 ggagatggtg aactggtgcg acaagtatct caagtgaact accacacacc gaccgttagg 7801 cagtacggtg taggcttccc aattcaacgg ggattgcctc tagaactccg tagttctgtt 7861 ggtcttacat tccctccaag ggcaggagtc ctagttccta agacccaaac tttttcttgc 7921 aacatttacc cttaggggtt cggcacttcc ttcaagtcgg ggaacccgca agggcggagt 7981 gcctcacctt ttagcttggt tttcgcttac ggcattgatg ctcaagatgt acataattta 8041 ccatattttg ttagtttcct ccatgtcttc ttccctcgtg ctacgcacat tggttcagaa 8101 gactgtcctc aactcactcg caattcatga gcgaaaatgt ggtttactcg tggcctcccg 8161 gtttattcgc gtctcaacac tgactgagta cagtacagtg tgagccaact tggctgttta 8221 aggtaattca tgcaacaagt tatcacacct tgggacaaaa agctgaatca tgactgatgc 8281 caagttgtat catatcgggt ttggaaaaaa tgaccttgga gagcaaccac caaccattgc 8341 tttattatct ggtgatccgg aacgagcacg gctgattgct cagtcctatt tgcactctgt 8401 acggctttta tctgaaaacc gggggttgaa tagttatgta ggtattctac ctaacgggaa 8461 agcgctttta tctgcgacaa gtggaatggg agcgccatct acaagtattg ttgttaacga 8521 gttgatacaa gttggcattc ggctgattat tcgtgtcggt acttgcggct ctattcaaga 8581 acacgtatca gttggtagca tcgtaattac gagtgctgcc ttgtgtcggc aaggagctgc 8641 taacgatatt gcacctgttg agtatcccgc agctgctgat ccgtttgtga cggtagcttt 8701 ggttgaggcg gcgcgggaac tgaaggtgaa atatcatctg ggtattaccg catcagttga 8761 cactttctac gagggacaag agcgaactca gtctgtgaat cgacatttgt tgcgctcatt 8821 gcgtggcgtg acagaagaat atcgctcttt gaatgtgttg aactacgaaa tggagtgtgg 8881 gactttgttc aagatggcgg gggtgtatgg ttttagagca ggctgcgtgt gtggtgtcgt 8941 tgctcaacgt acagttggtg aagaggttat tttgggacag aaagactttg caactgacaa 9001 tgctatcaag gtagccgtac aagcagcaca gcactggaaa gaacctgaag cataaaaaat 9061 ttacaaattt tgaaaaatat tacaaatatt tacaatgtcc tgattctatc ttttggtcga 9121 tggttaatac ttcgtaagaa agttttttca gaaataatcc ggaacagaga attaagctct 9181 cgttctagta caagtatata aaactaaaaa tagataaaac tcaaaatagc aagtaggctt 9241 aaaatgcaac tggctccctt gtttccaatt ttccaccgca tcctcaaacc aacgtttcct 9301 aactgccttt ggtgtgggga tgcaaactcc aaaattatcg cattgacgtt tgatgatggt 9361 ccccatcctg aatacacgcc atcactgttg aaagttttag accgttacaa cgtcacagct 9421 agtttcttct ggctgggtgc ttgcgtcaac cgttcaccgg gtgtggctaa acaggtgtgc 9481 gatcgcggac attggatcgg attgcatggc tatgatcatc aaaattttcc catgctctcc 9541 ccaacagaac ttaagcacag cttagaaaaa acccaagctg ctatctacaa cgcttgtcat 9601 cttgcacccg ataaagtgcg cgatgtccgt ccgcctaatg gtttgtttac acctacaaca 9661 ttaaaattac tgaatcaatg gaactatcga cctgttatgt ggagtgttgt accggaggac 9721 tgggtaagac caggaatcgc cacagtggtg cagcgaattc tccagcaggt ctgtaatggt 9781 tcactaattg ttttgcatga cggtatgtgc ggcggacaag atgtcaccgc aacgacagaa 9841 atcctcattc ctcaactaat ggaacaaggc tatcaatttg taacggtaga cactctctgg 9901 caggaagcga ggggggtggg ggctttgggg tc // LOCUS NODE_3408_length_9836_cov_5.6903189836 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9836) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9836) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9836 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 106..1278 /locus_tag="DP116_23830" /pseudo CDS 106..1278 /locus_tag="DP116_23830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875631.1" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hydrogenase formation protein HypD" assembly_gap 986..995 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 1329..1484 /locus_tag="DP116_23835" CDS 1329..1484 /locus_tag="DP116_23835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457442.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-oxalocrotonate tautomerase" /protein_id="PRJNA477356:DP116_23835" /translation="MSHVTVQVAECHSIRLKRKLAQAVTHALVSTLNTKPEWVTVHVD KFEREKN" gene 1750..2967 /locus_tag="DP116_23840" CDS 1750..2967 /locus_tag="DP116_23840" /inference="COORDINATES: protein motif:HMM:PRK07236.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23840" /translation="MQVKADITQQNSQPLRVVIAGGSMSGLCAGLALHCIGCDVEIFE RASYQGVTPTFGSVQSHGAAVVVQMEINQFLAEHGISIPEAVGMTSCKRQYITGDGSI IWEESTPQVMISWDMLYHQLRKVFPDERYHQGNSVIGFQLSDDCVVVHFEDEREEKCD LLIGADGVDSTIRQQLILNAMPQYAGYIAWRGLIDENVLSSDVAKFFADKFTFFNGPS MQTLCYLVPGPNGELDEGKRRLNWLWYFNVSDGEELNAVMTNHQGRVRRFFMPQGEVR EEVVQQMRVVAKRYLPEIFQYFFELTDKPFIQPIYDLSVPRMVFGRVCLIGDAAFVVR PHTAAGISKAVTNAIELAQGLQESGGDVVAALEQWEPIQLAMGNYLKVLGVTLGNRSR LGHPFGEFKIHGC" gene 2957..3136 /locus_tag="DP116_23845" CDS 2957..3136 /locus_tag="DP116_23845" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23845" /translation="MVAERTCTERSRSNRSKIQKYQRNFVGWVEDMRPNNPVNVGFRF AQPNLPLIVRTYAKL" gene 3252..4355 /gene="hypE" /locus_tag="DP116_23850" CDS 3252..4355 /gene="hypE" /locus_tag="DP116_23850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407167.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrogenase expression/formation protein HypE" /protein_id="PRJNA477356:DP116_23850" /translation="MNFSLSDSPQNFLFQKIEKVRRHQGKVRDTHITLAHGSGGKAMR DLIDDIFVNTFDNPTLSQLEDQASFDLASFMKQGDRLAFTTDSYVVDPLFFPGGDIGT LAINGTVNDLAMSGAKPLYLSCSVILEEGLAVETLRRVAQSMQAATKKAGVQVVTGDT KVVHRGAADKLFINTSGIGVIPTGVNISAHNIQPGDAVIINGELGNHGTAILIARGEL ALETDIESDCQPLNGLVETILNVCPDVHAMRDATRGGLATVLNEFALGSGVGIRLDEQ SIPVREEVKGVCELLGLDPLYLANEGKLVVVVGRENADAVVSAMKSHPAGKDACIIGE VIPSPSGVVFLKTAFGAERIVDMLVGEQLPRIC" gene 4569..4910 /gene="hypA" /locus_tag="DP116_23855" CDS 4569..4910 /gene="hypA" /locus_tag="DP116_23855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011321273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrogenase maturation nickel metallochaperone HypA" /protein_id="PRJNA477356:DP116_23855" /translation="MHELGITQNVVAIVAEYANGTQVKRVLLEIGKLSAIMPEAVQFC FDICTQGTVLEGAKLDILETPGLAKCRQCGAEIPLEKPFGTCRCGSVHLDLIAGEELK IKEIEIEEVCV" gene 4901..5770 /gene="hypB" /locus_tag="DP116_23860" CDS 4901..5770 /gene="hypB" /locus_tag="DP116_23860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194510.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrogenase accessory protein HypB" /protein_id="PRJNA477356:DP116_23860" /translation="MCVTCGCSDDAEVKMTNPETGEVATMDSSNNTHHHHTHTLPDGT VITHSHSHDHVTEASQIHAKVHGTTLALEHDILAKNNLIAAQNRGWFKGRNILALNLM SSPGAGKTTLLTRTINDLKHQLPINVIEGDQETTNDAKKIQETGCKVVQINTGTGCHL DAAMVERGLQQLKPPLDSVVMIENVGNLVCPALFDLGELFKVVILSVTEGEDKPIKYP HIFRASQVMILTKIDLLPHVQFDVQRCVEYAKQVNPQIQVFQVSAITGTGLENWYEWL SSKVANLSTSTSV" gene 5971..7134 /locus_tag="DP116_23865" CDS 5971..7134 /locus_tag="DP116_23865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315144.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="PRJNA477356:DP116_23865" /translation="MSQKKAALKFVILLGIVSLCADATYEGARSITGAYLGVLGASAT VVGLVAGLGELIGYGFRLVIGYISDQTRKYWGITTLGYVFNTAVVPLLAFAGRWEVAA GLMIAERTGKAIRTPPRDVLLSHAASQVGRGFGFGLHEALDQIGAVIGPLAVAAVIYL KGGYQRGFAILIVPAVLGLCVLLVAQKLYPNPRDFEQETPTLKGEGLPQVFWIYLGAV ALVAAGYADFPLIAYHFHKGAIATEQTIPLLYAMAMGVDAVAALVFGRLFDRVGISIL AIAVFLSLMFAPLVFLGSSNLALLGMILWGIGMGAQESILKAAVAGIVPIDKRASAYG IFSAGYGLSWFLGSALMGILYDQSVSLLVGFSVVIQLAAIPILLLVGKHSIQS" gene complement(7129..7500) /locus_tag="DP116_23870" CDS complement(7129..7500) /locus_tag="DP116_23870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126490.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23870" /translation="MKNKPFLPPEDLPSTEATHLDKILRSQLEHSTGRCFFEACDQIT RALLSNCQWYITTDGGYLMLVIDCPDIVSYWHIVSNIAPIGNRLERFASSAKIRVYPP FGKGTPFEISVNEISAYRDWL" gene complement(7557..7892) /locus_tag="DP116_23875" CDS complement(7557..7892) /locus_tag="DP116_23875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873667.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23875" /translation="MISEDILAQEFMRVVTHYYPKVGELLDGCYVKVISSYWGRPPKR LRYIGIYCSEKMMPDVQTHKEILRELAENMGLVQVVFLNATRLLRDPMSNVKRIHPRL WLDLHWLTT" gene complement(8211..8522) /locus_tag="DP116_23880" CDS complement(8211..8522) /locus_tag="DP116_23880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868835.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23880" /translation="MLLTTTDAGHIALEFLMAEWNISDEDRQWFVIVNSRLTGQNWYI VELAVAGFPDRWYVQVYDTGACDPNYTFISPIHGSDRYIDMTDLPVMVAEVLFSERNV R" gene complement(8526..8777) /locus_tag="DP116_23885" CDS complement(8526..8777) /locus_tag="DP116_23885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315148.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23885" /translation="MTAQREDLYFNLIDQLLRCPNGQEPEVLEAKPELLDAGLIQTML QVASGFAHNNNPDGAQFLIHVARELSKELGLYPEIPKKE" gene complement(8786..9214) /locus_tag="DP116_23890" CDS complement(8786..9214) /locus_tag="DP116_23890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873670.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23890" /translation="MKLKLVQLLTVCTVTLVVLSPEIVVFSMVWERHMELLAQTKNST CEVNQNSTSKVGLALQEGTNSYQDSSHVEFSVTNFVKQSLNHKTQDKIVKILQWFLLL FPIILGLLVFFYDRYLVYRAGVFQQQVEMLERLWQQGIEQ" BASE COUNT 2828 a 1874 c 2163 g 2961 t 10 others ORIGIN 1 cgccagtcgc ctacggaggg agaccctcct gcagcgctgg ctcgtcttgg cggtttgtaa 61 aaaaatagtc gtctaaaaca atcaattttt tcatctaatt ttaatatgaa atacgttgat 121 gaattccgag acccgagaaa agctgatgct ttattccacg ccattcaaga attatctcaa 181 cggctcaaaa agcctatcaa aataatggaa gtttgtggcg gtcacactca ttctattttt 241 aaatatggaa ttgaagaaat tttaccggaa gcaattgaat taattcatgg tcctggttgt 301 cctgtatgtg tgatgccaaa gggaagacta gatgatgcta tttctatctc cctgaatcct 361 aacgttatct tcacgacttt tggagatgcc atgcgcgttc ctggttccca aaaaaacttg 421 ttgcaagcta aagcacaagg cgcggacatt cgtatggttt actctcccct cgatagtcta 481 caaattgcta aggacaaccc tgataaagaa gttgtcttct tcgccttggg ttttgaaaca 541 actgccccca gtactgcttt tactatcctg caagcagcat atgaaaaaat tcctaacttc 601 agtatgtttt ctaatcacgt cctcgtgatt cctgccttaa aagcactttt agataatcca 661 gatttacaac tcgatggatt tattggtccg ggtcatgtca gtatggtgat tggcactgaa 721 ccgtatgagt ttatttccca agagtatcaa aaaccgattg tgatttctgg ttttgaacct 781 ttagatatca tccaatctat ttggatggta ttggagcaga ttgtagaaaa tcgttgtgag 841 gttgagaatc aatataacag gttggtagag aagacgggaa atcaggtggc gatcgccgcc 901 atgaataaag tttttgaaaa ccgagaaagt tttgagtggc gcggtttagg ggagattccc 961 aaatctggat taaaaatccg gactgnnnnn nnnnngactg aatatgccga atttgatgca 1021 gaattaaaat ttaccattcc taatcaaaaa gctgctgacc ataaagcgtg tctttgtgga 1081 gaaatcctca aaggagtgtt gaaaccttgg cagtgtaaag tttttgggac agcttgcact 1141 ccagaaacac cgattggaac ttgcatggtt tcttctgaag gtgcttgtgc agcttattat 1201 aagtatggtc gcttttctca cctggctaaa aaaatgatgc ttgataaaac acttgacaaa 1261 gaaaaggtaa gagtgtagaa aaatcacaag tcaaaaatca acactgaaaa aatcattggg 1321 gaatgcatat gtcacatgtc accgtccaag ttgctgaatg tcattctatt cgactgaagc 1381 ggaaactagc gcaagcagtc actcacgcac ttgtttctac tttgaatact aaacctgaat 1441 gggtgacagt tcacgtagat aaatttgaac gcgaaaaaaa ttaggctgtt ggaggtctac 1501 tacattcaga taagcatagt ggtagacacg atcacaaaac cacaagataa ttcacgcatt 1561 caaaaatgga attcttcatt ttgaattttg agattgcgtc gcaacgctga cgcaacgctg 1621 cgcgattttg aattggagcg tgatcgcccc ttgggcggct ccccttggga gcatcgtggg 1681 tcactatcag tgatagagtc aaacctgtag actttgagca agtgagaaaa actctacaga 1741 agcatcttca tgcaagtcaa agcggacatc acgcaacaaa actcacaacc ccttcgtgtc 1801 gttattgcag gtggttcaat gagtggttta tgtgcaggtt tagcgctgca ctgtatcggt 1861 tgtgatgtag aaatttttga acgcgcctcc taccaagggg taacgcctac atttggtagt 1921 gttcaaagtc acggtgctgc tgttgttgtg cagatggaaa tcaatcagtt tctggcagaa 1981 catggtatta gcataccaga agcggtgggt atgacttcgt gcaagagaca gtatataact 2041 ggagatggca gcattatttg ggaagaatca acaccccaag ttatgatttc ttgggatatg 2101 ctgtatcatc aactgcggaa agttttccca gatgaacgtt accatcaagg taacagtgtc 2161 attggttttc agttgagtga cgactgcgtg gtggtgcact ttgaagacga aagggaagaa 2221 aaatgtgatt tactcattgg ggctgacggt gttgattcaa ccatacgtca gcaactgata 2281 ctaaatgcta tgcctcagta tgcaggctac attgcatggc gagggttgat agacgaaaat 2341 gtactttcat cagacgttgc aaaattcttt gccgataaat tcaccttttt taacggaccc 2401 agcatgcaga cactgtgcta tctggtccct ggaccaaacg gggagttaga cgaaggaaaa 2461 cgccgtctca actggctttg gtattttaat gtctcagatg gcgaagagtt gaatgcagtc 2521 atgaccaacc accaaggacg agtacgaaga ttttttatgc ctcaaggaga ggttcgagaa 2581 gaagtcgtcc aacagatgag agtagtagca aagagatatt taccagaaat ctttcagtat 2641 ttttttgaac taacagacaa acccttcatt caacctatct acgacttatc tgtaccacgt 2701 atggtttttg gacgggtgtg tttaattggc gatgctgcat ttgtcgtccg tcctcacacg 2761 gctgcgggga tttctaaagc tgtgacgaat gctattgaac tcgcccaagg gttacaggaa 2821 tctggtggtg atgtggtggc ggcgttagaa caatgggaac caatccagtt agcgatgggg 2881 aattacctca aggttttggg cgtcacttta ggaaatcgtt ctagactggg tcaccccttt 2941 ggggaattca aaattcatgg ttgctgagcg cacttgtact gagcggagtc gaagtaatcg 3001 aagcaaaatt caaaagtatc agagaaattt tgtaggttgg gtcgaagata tgagacccaa 3061 caaccctgta aatgttgggt ttcgctttgc tcaacccaac ctaccattaa tagtcaggac 3121 ttacgcaaaa ttatgaaaaa ataaaccgca aaggacgcaa agatcacgaa gttaagagag 3181 tttaaaagag tttttgcgta agtcctaaat agtatggata aatccgctaa aatatttaaa 3241 cagaatttag aatgaacttc tccttatctg actcaccaca gaatttcctc tttcaaaaaa 3301 tcgaaaaagt ccgccgtcat caaggtaaag tgcgagacac tcatatcacc ctcgcacatg 3361 gtagtggtgg caaagcaatg cgcgatttaa ttgatgatat ctttgtcaat acttttgata 3421 atccaactct gtctcaatta gaagaccaag ccagttttga tttggcaagt tttatgaaac 3481 agggagacag acttgctttt acaactgatt cttatgttgt tgaccctttg ttttttccgg 3541 gtggtgatat tggaacatta gccatcaatg gtacggttaa tgatttagca atgagtggtg 3601 ccaaaccgtt atatcttagc tgtagtgtca ttttagaaga aggacttgct gtagaaactt 3661 tgcggcgtgt tgcgcaaagt atgcaagcag caacgaaaaa agctggcgta caagttgtga 3721 ctggtgacac aaaagtagtc catcgtggtg cagcagataa attatttatt aatacctctg 3781 gtattggtgt tattccaacg ggagtaaata tttctgccca taatattcag ccaggagatg 3841 cagttattat taatggtgag ttgggcaatc atggcacagc aattttaatt gcccgtgggg 3901 aattggcatt agaaactgat attgaaagcg actgtcagcc gttgaatggt ttagttgaaa 3961 ctatcctaaa tgtttgtcct gacgttcatg ctatgcggga tgcaacacgc ggtggtttag 4021 ccacagtgtt aaatgaattt gctctgggtt ctggtgtggg aattcgctta gatgagcagt 4081 ctatcccagt gcgtgaagaa gtcaagggcg tttgcgaact tttaggttta gatccattgt 4141 atttggcaaa tgaaggtaag ttagttgttg tggtaggacg cgaaaatgct gacgctgttg 4201 tctcagctat gaaatctcac ccagccggaa aagatgcttg tattattggt gaagtcattc 4261 cctcaccttc tggtgttgtc tttttaaaaa ccgcttttgg tgctgaacgt attgttgata 4321 tgcttgttgg tgagcaatta ccaagaattt gttaattcac cggaatgcac ctcatccata 4381 gtatgccttg aacttaagta ccctacgaga agctgcccaa agggcgtcta ctcagaattt 4441 ctttctttgc ggactaacat tatggtgcaa aatctgagtt aattcaagaa tttgttaaat 4501 ctggtacgtt gcactgcaaa aatcacctct accattgaag attgactaag cattcactca 4561 gcacttcaat gcatgaatta ggaattactc aaaatgttgt ggctattgta gctgaatatg 4621 ccaatggcac acaagtcaaa agagtattgt tagaaattgg taagctttca gctattatgc 4681 cagaagcagt gcaattttgt tttgatattt gtactcaagg aactgtttta gaaggggcaa 4741 agttagatat tttagaaacc ccaggattag caaaatgccg tcaatgtggt gccgaaattc 4801 ctttagaaaa accttttgga acctgtagat gtggtagcgt gcatttggat ttaatagctg 4861 gggaagaact gaaaattaag gaaatcgaaa tagaggaagt atgtgtgtaa cctgtggttg 4921 ttctgatgat gctgaagtga aaatgaccaa tcctgaaaca ggcgaagtcg caacaatgga 4981 ctcatcaaat aatacccacc atcatcatac tcatacttta ccagatggta ctgtcatcac 5041 tcattcccac agtcacgacc atgtcacaga agcatctcaa attcatgcca aggtacatgg 5101 cacaactctc gctttagagc atgatatttt agcaaaaaat aatctaatag ctgcccaaaa 5161 tagaggatgg tttaaaggta gaaatatcct cgcattaaat cttatgagtt ctcctggtgc 5221 aggaaaaaca actctcttaa ctcgaaccat caatgattta aagcatcagt tgcctatcaa 5281 cgttattgaa ggcgaccaag aaacaacaaa tgatgccaaa aaaattcaag aaactggttg 5341 taaagttgtc caaatcaaca ctgggacagg ctgtcattta gatgcagcaa tggtggaacg 5401 agggctacaa caactcaaac cacctctcga ttcagtggtg atgattgaga atgttggcaa 5461 tcttgtttgt ccagctttat ttgatttagg agaacttttt aaagtcgtca ttctctcagt 5521 cacggaagga gaagataagc cgataaaata tcctcatatc ttccgcgcta gtcaggtgat 5581 gattctcacg aaaatagatt tgctgcctca tgtacaattt gatgttcagc gttgtgtaga 5641 atacgcgaag caagttaatc cccaaattca agtttttcag gtttctgcaa taacaggaac 5701 tggattagag aattggtatg agtggctatc tagcaaagta gcgaatctgt caacttcaac 5761 ttctgtatag caatccgcgc aggattcgtg aaattctctc ttctctcttt tcttggcgtc 5821 cttggcgacg ccagtcgcct acggagggag accctcctgc agcgctggct cgtcttggcg 5881 gttcgtaaaa aaatagtttt cacaaatcat ctaggattgc tatataactg atatgctagt 5941 gcgaacaatg gttacttgaa atatcagggt atgtctcaga aaaaagcggc tttaaagttc 6001 gtgattttgc tgggtattgt cagcctctgt gcagatgcaa cttatgaggg ggcgcgtagc 6061 attacggggg cttatcttgg agttttaggc gctagtgcta ctgtagtagg gcttgtagcg 6121 ggtttgggag agttaattgg ctatggtttt cgcttagtga taggttatat cagcgaccaa 6181 acgcgaaaat actggggaat tactactctt ggttatgtct ttaatacagc agtcgtgccc 6241 ctcctagctt ttgcaggacg ttgggaagtc gcagcaggac tgatgattgc tgaacgcaca 6301 ggaaaagcga ttcgtacccc tccacgagat gtgctgcttt cccatgctgc aagtcaagtc 6361 ggcagaggtt ttggctttgg cttgcatgaa gccctggatc aaattggtgc cgtcatcgga 6421 ccattagctg tagcagcagt gatttatttg aaaggagggt atcagcgtgg tttcgcaatt 6481 ttgattgtgc cagctgtctt agggttatgt gtgcttttag tagcacaaaa actttacccc 6541 aaccctcgtg attttgaaca ggaaacccca acactcaaag gagaaggttt accacaagtc 6601 ttttggattt atctaggtgc tgtagcactt gttgctgctg ggtatgcaga ttttcccctg 6661 attgcttatc atttccacaa aggagccata gcgactgagc aaacgattcc cttgctttac 6721 gctatggcta tgggagttga tgctgtagca gcgctggtgt ttgggcgtct ttttgatcga 6781 gttggcattt ctatcctcgc aatagcagtt ttcctgtcat taatgttcgc cccgttagtc 6841 ttcctaggaa gttccaatct tgcccttttg ggaatgattt tgtggggtat tggaatgggg 6901 gcacaggaat caattttgaa agctgctgtc gctggtatag taccgataga caaacgtgct 6961 tctgcttacg gtatttttag cgctggctac ggtttgtcgt ggtttttagg gagtgcctta 7021 atgggaattt tatatgacca atcagtcagc ttactggttg ggttttcagt cgtcattcaa 7081 ctcgctgcta ttcccatttt gttgttggta ggaaagcact caatacagtc atagccaatc 7141 tcgataagct gatatctcat taacgctaat ttcaaatggc gtccccttcc caaatggtgg 7201 ataaacgcgg atcttagcgc tactggcaaa ccgttctagc ctatttccaa tgggtgcaat 7261 attgctgact atatgccagt aactgacaat gtcaggacaa tcaatgacta gcatgaggta 7321 gccaccatct gtcgtaatat accactgaca gttagacaat aatgctcgtg ttatttggtc 7381 acacgcttca aaaaagcatc tgccagtgga atgttcgagt tgacttcgca atattttgtc 7441 aaggtgcgtc gcttctgttg aaggtaaatc ctctggagga aggaagggtt tatttttcat 7501 tggaacccct tcaggttgca aggtgggttt ggaggaagtt gggtttacga ttattcctat 7561 gttgtcagcc aatgcaaatc caaccacaaa cgtggatgta ttcgcttgac attcgacata 7621 ggatcacgca atagtcgcgt tgcgttgagg aaaactacct gaacgagtcc catattttct 7681 gctagttctc gcaaaatttc tttgtgagtt tgcacatcgg gcatcatttt ttcagagcaa 7741 taaattccta tgtatcgcaa gcgcttagga ggtcgtcccc agtaagagga gataactttc 7801 acataacagc cgtctagtaa ttctccgact tttgggtaat agtgagtgac gaccctcatg 7861 aattcttgtg ctaggatatc ttcactaatc atgaaacact atgattatgt aaggattgtc 7921 acatgattta atataacata atttggctct cttcatgtga aaaaatatcc gaaattatta 7981 taaaattaag ggtttacctt tatcacaata gttagaagtc atcatggaaa agccttccat 8041 taaaaattca taaagaaaaa gcgtagagta tgcattctat actctacgct tgtcgttatc 8101 tagcgataga aaatgagtca ttttttaaac tcaaaattat catgtttatt tattaatgtg 8161 atttttatat acacattaat atttaaaatt tgaattttga actctcattc ttatcgaaca 8221 ttacgctcgg aaaacaacac ttctgctacc ataactggta aatccgtcat gtcaatatat 8281 ctgtctgaac catgaattgg tgaaataaaa gtgtaatttg gatcgcacgc tccagtatca 8341 taaacttgga cataccacct atcaggaaat cctgcgacag caagttctac gatgtaccaa 8401 ttttgaccag ttaggcgaga attaacaatg acaaaccatt gtctgtcctc atctgaaata 8461 ttccattcag ccatgaggaa ttctaacgct atgtgtcccg cgtcggttgt agttaataac 8521 attatttact ccttttttgg aatttcagga tacaatccca gttccttaga tagctcacgt 8581 gcaacatgga taagaaactg agctccatct ggattattgt tgtgtgcaaa tccacttgca 8641 acttgtaaca ttgtctgtat caaaccagca tcaagcaatt ctggcttagc ttctaaaact 8701 tctggttctt gaccattggg acaacgaagt agctggtcaa tgaggttgaa atataagtct 8761 tctctttgtg ctgtcatagt tggttctatt gttcaatacc ttgttgccaa agtctttcta 8821 acatttctac ttgttgctga aaaacacctg cgcgataaac gagatatctg tcatagaaaa 8881 aaacaagtag tccaagaatt ataggaaata ataaaaggaa ccattgtaat attttgacaa 8941 ttttgtcttg tgttttatga ttgagagatt gcttgacaaa attggtgaca gaaaattcta 9001 catgagagct atcttgatag gaatttgtac cttcttgcag agctaatcca actttacttg 9061 tactattctg attcacttca caggttgaat tctttgtttg tgctagtaac tccatatggc 9121 gctcccaaac catactaaat accacaattt ctggagaaag caccactaaa gtaactgtac 9181 aaactgtgag tagttgaaca agtttcaatt tcattcagct atcatgagta actgttaact 9241 attaactatt tgggaactaa gctgtcaccc ttggggtatc tcctcctcca gacagcactt 9301 tgtaagtgct gcaaagactt gcaccaaccc atagcgttct gaaagaaaga caaagaacat 9361 ggctgcctca ctgtttttct tgcggattga attttgtaag cgcagccgtg ccctaggcat 9421 ggggcttagg gcatatttta aattttgaat ttttaagaag ccccccctcc ttcctagact 9481 ggcgtctaga agtttggggg gtagagggga atctctgcgt aaatcctata tgttttatat 9541 cagttgtctt tgctgaggag ggtttatcag tcgcagatca agattattaa attaattatg 9601 attgtactca aagcctgtca ttcttatctc atgagtgtta caaaaatttt ttggggtaga 9661 ggtgctagac ttttcgtgta gtggtactag tatttacgtt gaggggggtc tggctgttgt 9721 gtactcaaca tcaaacttat atggagtttg gtattgacaa gttttatctg ctattttata 9781 ggaaattaag cttgtactca aaaatggtat gattatataa atatatagtt gagttt // LOCUS NODE_3418_length_9807_cov_6.0935199807 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9807) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9807) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9807 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..539 /locus_tag="DP116_23895" CDS <1..539 /locus_tag="DP116_23895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002797234.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="group II intron reverse transcriptase/maturase" /protein_id="PRJNA477356:DP116_23895" /translation="VWIPKPGKDEKRPLGIPTMYDRALQALIKMALEPEWEAKFEPNS YGFRPGRSCHDPIGAIHTSINHKPKFVLDADISKCFDKIDHSALLKKLNTFPTIRRQI RAWLKVGVMDKKQFQETSEGTPQGGVISPLLANIALHGMEERIKLFAETLPGSKKGNR SKLSLIRYADDVRHLTRC" gene 600..830 /locus_tag="DP116_23900" CDS 600..830 /locus_tag="DP116_23900" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23900" /translation="MTKVRGDCSMLIKQKKLEDAYNFVLQDPRDMVRAELLKPQFPVM EAYIPRLQRLKCGVTVIGWLSRRHPNFPHSER" gene 1028..2824 /locus_tag="DP116_23905" CDS 1028..2824 /locus_tag="DP116_23905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013871683.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="maturase" /protein_id="PRJNA477356:DP116_23905" /translation="MRNAETVLGIIHERGQRGLPLEDVYRQLFNPDLFLKAYGKIYRN TGAMTPGATGETVDGMSRAKIDTIISDLRYERYQWMPARRVYIEKKNSLKKRPLGIPR WSDKLLQEVIRLILEAYYEPQFNPTSHGFRPGRGCHTALSEIYSKWVGTKWFVEGDIA QCFDSLNHTVLLTILREKIHDNRFLRLIENLLKAGYLEEWRYNATLSGSPQGAILSPI LANIYLDKLDKFVENELIPKYNRGKARQPNPEWQRLQGLAQRLRKKGLFSEAHIARKL MQQVPSLDPQDPNYRRLRYVRYADDWLIGFSGPRQEAEEIKRLIGNFLRENLKLELSE TKTLISHPRTEAARFLGYDIVVLNNNQKLDRRGHRSINGQIGLKVPPDVVKSKCARFL LHGKPIHRAELIHDSVFSIVAHYQQEFQGIVEYYRLAYNLHQLNRLKWVMERSLTQTL AHKLRTSVSTIYRRYQTTLQTRNGPYIGLQVTVERGEGQKPLIANWGGISLKRNMKAV LNDSPLQTVGPRTELECRLLANTCELCGSQENVQVHHVRALKDLQKEGRNSPPYWVQI MAARQRKTLIVCQQCHMDIHAGRATQRTNTDM" gene 2896..3880 /locus_tag="DP116_23910" /pseudo CDS 2896..3880 /locus_tag="DP116_23910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316660.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="restriction endonuclease" gene complement(3928..4134) /locus_tag="DP116_23915" CDS complement(3928..4134) /locus_tag="DP116_23915" /inference="COORDINATES: protein motif:HMM:PF13359.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23915" /translation="MFKNSKLTLNKDIQCLVDKGYLLIKKLHLNSQIPYKKPKNGKLS NEEKKKNRHLAKNRVIGDALGQRK" gene complement(4404..4649) /locus_tag="DP116_23920" CDS complement(4404..4649) /locus_tag="DP116_23920" /inference="COORDINATES: protein motif:HMM:PF13613.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23920" /translation="MKYEQVKHLTTAEFKRCCGVKPETFEQMVEVVQTHNQNKQKTGR PSKICLEDQVLMTIKYWREYRTYFHIGLSWGVAESTA" gene complement(4895..5533) /locus_tag="DP116_23925" CDS complement(4895..5533) /locus_tag="DP116_23925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216529.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine phosphatase family protein" /protein_id="PRJNA477356:DP116_23925" /translation="MTLNLYLLRHGETTFSQSGNFCGETDAELTSEGMQMAESFADVY QKLKWEAVYVSPMKRTIATAKPFCDAIGMNMQLRDGLREGSYGKWETKSKSFVQENYA ENYVKWLTEPAWNAPIGGETAVDIANRSMPVIAEIQEKHPQGNVLVVSHKATIRIMLC SLLGIDLGRYRYRVNILVASVSMVKFDVNGPLLEILGDRHHIPDHIRSRPGT" gene complement(5918..6724) /locus_tag="DP116_23930" CDS complement(5918..6724) /locus_tag="DP116_23930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459493.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uroporphyrinogen-III synthase" /protein_id="PRJNA477356:DP116_23930" /translation="MLTSIQLPLHGKRIIVTAPRNYAARLSQQLINQGGMPFLMPTIE TCPLENFTELDAALQHINAFDWIAFTSRNGIDAFFQRLEDLEINPLVLTNLCFCAIGL DAERLADLGVKVDLVPKEPSPAGVIAELAKIPDIAQKNILVPVPEVVGVPEPDVIPNF VAGLEKLGMKVTRVPTYRTRSLEKDIYEVELNLIRQGKIDVIAFSSTAEVAGFLQMVD SKSDYERCAIACFGPYTAANAKKLGLNVSIVAQDYSSFAGFAEALSVALP" gene complement(6850..8181) /locus_tag="DP116_23935" CDS complement(6850..8181) /locus_tag="DP116_23935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010996002.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amino acid ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_23935" /translation="MQKMNAAFALALTTLAVGFLAAACDTTPPNSPTGNGTSASPGGG STTTTTTTSGKGLKIGSLLPATGDLASIGQQIIPSIPLLVETVNACGGVNGEPVTLVA VDDQTDPKAGAAGMTKLATVDRVAGVVGSFASSVSTAAVSIAAQNKVMLISPGSTSPV FTEQAKQGKFQGFWARTVPPDSYQGPAIAALANKRGYKKVSTVVINNDYGVGFEKAFV QAFEKSGGTIVNKNNPVRYDPKATTFDTEASAAFAGKPDAVLGVFYVETGSLLLKAAY QQGLLQGVQVMLTDGMKSNEFPGQVGKTSDGKYIVSGVIGTVPGSNGKALEALTKLWQ SKRGGSAPGEFAPQGWDAAALLALAAQAAKENTGDAIAKKIREVSNGPGVEVTDVCEG LKLLKEGKKINYQGASGNVDVDQNGDVIGIYDVWTVGDDGKIKTIDKVSPK" gene complement(8501..8785) /locus_tag="DP116_23940" CDS complement(8501..8785) /locus_tag="DP116_23940" /inference="COORDINATES: protein motif:HMM:PF13358.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23940" /translation="MGRGVCSGCFSSHKVSGIREAIESVGAHLIYLSPYSPDFSPIEN FWSKVKDFLRSQETRTYPDLDKAITDALETINLSDILGWFKHCGYRTASN" gene 8971..9480 /locus_tag="DP116_23945" CDS 8971..9480 /locus_tag="DP116_23945" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23945" /translation="MNQNLPQKKLDRESLYQLSKEQLADIIIEQAIAIQQLQATIKEL KQEIQQLRVSRDLDSKTSSKPPSGDLLKKPEKQNSETEPHSATQKRKPGGQPGHIGKT RKGFDRVDRDQVLRPQICLACGNTEFATEPVKVETQQVAQLVERPIEIVEYHRHSCQW SALWSYTER" gene 9570..>9807 /locus_tag="DP116_23950" CDS 9570..>9807 /locus_tag="DP116_23950" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="IS66 family transposase" /protein_id="PRJNA477356:DP116_23950" /translation="MKNNQVLLWELGEIDIGVGSLVTTNERVEQAIKPSILELSSWVQ QEQPNIHVDETPWSVKGLKKEVRSQNSEFRINRMG" BASE COUNT 2969 a 2164 c 2066 g 2608 t ORIGIN 1 gcgtatggat tccaaaacct ggtaaggacg agaaaagacc tctgggtatc ccgacaatgt 61 atgaccgcgc tttgcaagct ctcataaaaa tggcacttga gccagaatgg gaagctaaat 121 ttgaaccaaa ctcatatggg ttcagaccag gacgctcatg tcatgatcct atcggagcta 181 tacatacaag cattaatcat aaaccaaaat ttgtgcttga tgccgacata tcaaagtgct 241 tcgataaaat cgaccactcg gcgctgctga aaaaattaaa tacattcccc accattcgcc 301 gtcaaatacg tgcttggtta aaagtgggag tgatggacaa aaagcagttc caagaaacat 361 ctgagggtac accgcagggt ggcgtgatat cgccgctttt agcgaatatt gcccttcacg 421 ggatggaaga acggattaag ttatttgcag aaacattacc tggtagtaaa aagggtaatc 481 gctcaaaact atcattaatc cgatacgccg atgacgtgcg ccatttaacc aggtgctaga 541 ctcctctcac agagaggaga ggttaggaaa agatgacctc tgggtcaaca acttacctaa 601 tgacgaaagt cagaggggac tgtagcatgt tgataaagca gaagaagtta gaagacgctt 661 ataatttcgt tctgcaagac ccgcgtgata tggtgagagc cgaattgctc aaacctcagt 721 ttccagtgat ggaagcgtac atcccccgat tacagcgtct caaatgcggg gtcactgtga 781 tagggtggct aagtcgtcgt cacccgaact ttccgcacag tgaacggtaa ggaacgacct 841 tacttaagga aacggacgag gcacctaagt cgcgctcgat acatctagcg cttagggaaa 901 gtgggattac ccgccgtgcc gagaggcaca gggtaacgga gtcatcgtag tactcagcgg 961 acgggaaagc cgtctacatg gggaaggatg acaggtgtta tgaaactcga cttgtgaggt 1021 acgcgagatg cgaaatgccg aaactgtgtt gggaattata catgaacgtg gtcagcgggg 1081 actgccactg gaagatgttt atcgacaatt atttaacccc gatttgttct taaaagctta 1141 cggtaaaatc tatcgtaata ctggggcaat gacaccaggt gcaacaggcg aaaccgtaga 1201 tgggatgtct agagcaaaaa ttgatactat tatttctgac ttacgttatg aaaggtatca 1261 atggatgcct gcgcgtcgtg tttacattga gaagaaaaac tctctgaaaa agcgcccact 1321 tggcattcct cgttggtcag ataaattgtt gcaagaagtt atccgcctaa tattagaggc 1381 gtactatgag ccgcagttta atccaacctc tcacggcttc cgtccaggac gcggatgtca 1441 tacagcctta agcgaaatat acagtaagtg ggtaggaaca aaatggttcg tggaaggtga 1501 cattgctcaa tgctttgact cactcaacca tacagttctc ctcaccatac tgagagagaa 1561 aatccacgac aaccgctttt tgaggttaat agaaaacttg ctgaaagctg gatatttaga 1621 agaatggcgg tataatgcca ctctaagtgg aagtccacaa ggtgcgatac taagcccgat 1681 tcttgcgaat atctacttag acaagcttga taaatttgtt gaaaatgaac taattccaaa 1741 atataaccgt ggtaaagcac gccaacccaa tccggaatgg caacgactac aagggctggc 1801 tcaacgtttg agaaagaagg gtctatttag tgaggcacat attgctcgca aactaatgca 1861 gcaagtacca tctttagacc cacaagaccc aaactatcgc cgcctacgtt acgttagata 1921 tgccgatgat tggctaattg gctttagtgg tccacggcaa gaagctgaag agattaagcg 1981 cttaatcggc aatttcttga gagaaaatct caaactcgaa ttgtccgaaa ccaagacgct 2041 aatttcccac cctcgaactg aagctgctcg ttttcttggt tacgacattg tagtacttaa 2101 taacaaccaa aagcttgata ggcgcggaca ccgtagcatt aacgggcaaa ttggtttaaa 2161 agtaccacct gatgtggtca aaagcaaatg tgcccgcttc ctacttcacg gtaaaccaat 2221 acaccgtgca gaattgattc acgattccgt ttttagtatc gtggctcatt atcagcagga 2281 atttcaggga atcgtggaat attatcgatt ggcttacaac ttacaccaac taaatcgttt 2341 gaaatgggta atggaacggt cattaactca aacgctcgcg cataaactgc gtactagcgt 2401 atcgacgata taccgtcgct atcaaaccac acttcaaact cgcaatggcc catatattgg 2461 attgcaagta acggttgaac gtggagaagg acaaaagcca ttaatagcta actggggagg 2521 aatctctctg aagcgaaata tgaaagcagt cttaaatgat tctcctttac aaactgtagg 2581 tccaagaaca gaactcgaat gccgactctt ggctaatacc tgtgaacttt gtggttcaca 2641 agaaaatgtt caggttcatc atgttcgggc acttaaggac ttacaaaagg aaggaagaaa 2701 ttcaccgccc tattgggttc aaattatggc agcacgacaa cgtaaaacct taattgtgtg 2761 tcaacaatgt catatggata tccatgcagg acgcgcaacc caaagaacta acactgacat 2821 gtaaacactg gaaagccggg tactccgaaa ggcgtaagcc cggtttgggg aggggcggac 2881 agaaaagtgc ttttagtaac tcgctggtcg cctactctac ttcgtcatct tacacgaaga 2941 cataactgtt gtccaaagat gtcagattat tatatctgag tggttaaaag gcatgggttt 3001 ggaattgaag ccatcaaaaa cacgcttaac tcatacctta aataagtatg agagtgaaga 3061 accgggattt aactttcttg gtttcaacat taggcagttt aaagtgggta aatatcactc 3121 aaaacaaggc ttcaaaacaa tcatcactcc aagcaaagaa aagcagaaga tacattacga 3181 acgaataact agtatcattt acgaccacgg tcaagcgcca caagtagcgt taataagccg 3241 actcaacccg gtaatccgag gatggtctaa ctactattcg acagtaataa gtaaagagtc 3301 ttactctaag ctagagcatc tcatgtatct taagctaaaa gcctgggcag aataccgtca 3361 cccaaataaa tcaggagaat ggatagccaa gaaatattgg cgaaccattg gcggtgataa 3421 ctgggtattc gctacaccga ataaaggtaa aaatccaatg cgtttaatga aacatacgga 3481 aacgcctata attcggcacg taaaagttaa aggcgatgcc agtccatacg atggcaacct 3541 agtttattgg agttcaagaa tgggcacaca tcctgaaatg ccaaaaagag tggcaacact 3601 tctaaagcaa caaaaaggaa aatgcgccca ctgcggattg tatttccgtg aggaagatgt 3661 actggaaatt gaccacaaaa tacccctaaa gaaaggggga aaagatgaac ataaaaatct 3721 ccagttatta catagacact accacgatgt taagacagct aaggatgact tggttggagg 3781 tatgcactta gaaaagcacc aaattactga ggagccggat gaagcgaaag tttcacgtcc 3841 ggttttgaag acgagtcgct ctggtgacgg agcggcttag tttaatagta cccttgtgca 3901 cccacgaatc aacaaacaaa ggctggctca ttttctttgt cccaatgcgt caccaattac 3961 cctattttta gctaaatgac gatttttctt tttctcttca ttgcttaact taccattttt 4021 cggctttttg taaggaattt gactattgag atgaagtttt ttaattaaga gatagccttt 4081 atctaccaag cattgaatgt ctttatttaa cgttaatttg ctgtttttaa acaatttaaa 4141 atcatgggtt ttacccttag cgtgcgctgt acaaataatt tcaccagtct tctgattaac 4201 aactacttga gacttgaaag tatgtctttt tttcttgcca ctgtaatata gtttttgttt 4261 tttttaggac gttcaatagg gctttctgtc acatcaatga caactacgtc tatattgaaa 4321 tcagccgaaa gtaattgttt ttttcctggt agggtgaata ttctagactg aatcaaaata 4381 ttttctactt tccgcttagg agatcaagct gtagattctg ctactcccca agatagacct 4441 atatggaaat aggttcgata ttctctccag tacttaatgg tcattaatac ttggtcttcc 4501 aaacaaattt tgcttggtct tccagttttt tgtttatttt ggttgtgggt ctgcactact 4561 tcaaccattt gttcaaatgt ctccggtttg acaccacaac accgtttaaa ctctgctgtt 4621 gttagatgtt taacttgttc gtatttcata aggttatctt atgatatgaa cctagcgtta 4681 agcgtagctc tgccgtaggc aatcgccttt taccagtccc catcttaccc cgactgggac 4741 ttttgcaaga ggtctattca attctttgtt cattaagata gttcctgaat gtcctattgg 4801 atcggaatgg ataaatgcac aacagcttat ttccagtcac gatcattatc tgtccactta 4861 acttgaatgc tgctgatacg gaggcaagcg actattatgt tcccggacga gagcgaatat 4921 gatcgggtat atgatggcga tcgcctaata tttctaacaa aggaccatta acgtcaaatt 4981 taaccatact taccgacgcg accaaaatat tcacccgata gcgatagcgt cccaaatcaa 5041 ttcccagtaa actgcaaagc ataatccgaa tcgtagcttt atgggatact actaaaacat 5101 taccttgggg atgtttttct tgaatttcag caattacagg catagaacgg ttagcaatat 5161 ctaccgcagt ttctccacct attggtgcat tccaagccgg ttctgtcaac cattttacat 5221 agttttctgc ataattctct tggacaaacg atttactctt agtttcccat ttgccgtaac 5281 taccttctct aagtccgtca cgcaactgca tattcatacc gatagcatca cagaatggct 5341 tagcagttgc aattgtgcgc ttcatcgggc taacataaac cgcttcccac ttcaattttt 5401 gataaacatc ggcaaaactc tctgccatct gcatcccttc agaggtcaac tccgcatcag 5461 tttcaccgca gaaattacca ctttgactaa aagtagtttc tccatgtcgc agtaaatata 5521 aattgagtgt catagcttgt atcgcgattg ggataaagga gtttgcgcaa aaataaaata 5581 ccatcaatct cgcaatcaag tatgtgacac caggacaaag taatcaacca aaaaagtcaa 5641 tcaatcacaa acttttgcga tcgcgtaagc gcaaagcgca ggctccgcca acgcgtagcg 5701 tgtccgttcg gcctcaagcc gtgcccgtta ggcgcagccg tggcggaacg ccatagggct 5761 caggactcag cgggcgctct gcgccatcgc cctgagtgag tctcccgtat gggcgatcgc 5821 cgcaactttg ccaaattcag cttatatgcc aaccgccgca agtgagtacg aagaatttaa 5881 taccaaaaag ttattgtaat gcccaaagag cgattgccta cggcagagct acgcttaacg 5941 cctcagcaaa ccccgcaaat gaactataat cctgggcaac tattgagaca tttaagccta 6001 acttctttgc atttgctgcg gtatagggtc caaagcaggc gatcgcacaa cgttcgtaat 6061 cactttttga atcaaccatt tgcaaaaaac ctgcgacttc cgcagtacta ctaaaggcaa 6121 tgacatctat tttcccttgc cgaatcagat ttaactcaac ctcataaata tctttttcta 6181 aactccgcgt cctataagtt ggtacacggg taactttcat gcctaatttt tctaaccctg 6241 caacaaagtt tggaatcaca tcgggttccg gaacgcctac aacttctgga actgggacaa 6301 gtatattttt ctgagcaata tccggaattt tagccaattc agcaataact cctgctgggc 6361 tgggttcttt cggtactaaa tcgactttga ctcctagatc tgctaatctc tctgcatcta 6421 atcctattgc acaaaaacat agatttgtca agacaagggg gtttatctcc aaatcctcta 6481 gacgttggaa aaatgcatcg ataccatttc tactggtgaa agcaatccaa tcaaatgcat 6541 tgatatgttg cagtgcagcg tctaactcag taaagttttc tagtggacaa gtttcgatgg 6601 ttggcatcaa aaacggcata cctccttggt taatgagttg ttgagataac ctggcggcat 6661 aattacgagg tgcggtaacg ataatccgct taccatgtaa gggcagttgt atagatgtaa 6721 gcagtttgat agacctcctt tcatcccacc actaacccat aggggtatag tgggggagtc 6781 gaattttgtt aaacttctgg gcagcgcgtt agccgagagt acggcatcat acctcttatg 6841 ttttaacttc tacttcgggc taactttatc aatcgtcttg attttaccgt cgtccccaac 6901 tgtccagaca tcgtagatac cgataacatc gccgttttgg tcaacatcca cgttaccact 6961 ggctccctgg tagttgattt ttttaccttc tttcaataac ttcagtccct cgcatacatc 7021 agtcacttct accccaggac cgttggaaac ttcgcggatt ttcttagcta tggcatcacc 7081 tgtattctcc ttagcagctt gtgctgctag cgctagcaag gcggcggcgt cccagccttg 7141 aggagcaaat tcacctggtg cagatcctcc tcttttagat tgccagagtt tggtaagagc 7201 ttctaaggct ttgccgttag aacctggtac cgtaccaatc actccagata cgatatattt 7261 accatcactg gttttaccga cttgtccagg aaactcattt gacttcattc catctgtcag 7321 catcacctgc actccctgca gcaaaccttg ctggtacgct gcttttagta gcaaactacc 7381 tgtctctacg taaaaaacgc caaggactgc atcaggcttg ccagcaaaag cagcagatgc 7441 ttcagtatca aaggtggttg ctttgggatc gtagcggaca gggttatttt tattaacaat 7501 agttcccccc gatttttcaa aagcttgcac aaatgctttt tcaaagccga cgccatagtc 7561 gttatttatg acgacggtag aaactttttt ataacctctt ttattagcaa gtgcggctat 7621 ggctggtccc tggtagctat caggtggaac agtacgtgcc caaaatcctt gaaatttacc 7681 ttgttttgcc tgttcagtaa atacggggct ggtactacca ggagaaatca gcatgacttt 7741 attttgagct gcaattgaga cagcagcagt agaaacgctg ctggcaaagg aaccaacgac 7801 acccgcaacc ctatctacag ttgccagttt ggtcatacca gcagcaccag ctttaggatc 7861 agtttggtca tcaacagcaa caagagtcac aggttcaccg ttcaccccac cgcaagcgtt 7921 gacagtttcc acaagtaagg ggattgaggg tatgatttgc tgtccaatag aagccaagtc 7981 accagtggct ggtagcaggg aaccaatttt caatccttta ccactagtag tagttgttgt 8041 tgttgttgaa cctcctcctg ggctggcact tgtgccatta ccagttgggc tattaggggg 8101 ggtagtatca caagcagctg ctaggaaacc aactgccagg gtagtcaaag ccaaagcaaa 8161 ggcagcattc attttttgca tatattattt agtttcactc ctcatttatt caggagccta 8221 aaagtatgat tcaattttga cgatgaatta aattctttag ttgctccaga tgataaataa 8281 taacatcccc tttagaggat gtctgagaag tcaaaaatgt caagtccaac gtagcgacgt 8341 tgagggcttt acgcttacgc ctaattactt acaataaaaa cacctcaccc ccatcacctc 8401 gcggaaatcg cggagaggga atgagggtga ggttcttcgt tttttataaa cctatcttac 8461 ttttccgtta cacaatacct attcacaact atagcggttt tcagttggat gcagtacgat 8521 aaccacaatg tttaaaccaa ccaagaatgt cacttaaatt gatagtttct aaagcatctg 8581 taatcgcttt atctaaatca ggataagttc ttgtttcttg tgaacgtaaa aaatctttaa 8641 cttttgacca aaaattctcg atgggcgaga agtctgggga atatggtgat aaataaatta 8701 aatgagcacc aacagattca atcgcttctc taattcctga caccttatga gagctaaaac 8761 aaccactaca cacgcccctt cccataaatt cgggactaaa acttgatgaa tataaataga 8821 aaatgcattt ttatcatttc cacccttaaa actcatcgcc ccaacaatcc cacgtaaagc 8881 gttaagcgaa gctctaccga aggtaatcgg taaggattca ggtgacgggg gtattgacaa 8941 aggggacgct tgagggagag tatcgcaatc atgaaccaaa acctgcctca aaaaaaacta 9001 gaccgggaaa gtctgtacca actgtccaaa gaacagctgg cagatatcat cattgagcag 9061 gcgatcgcca tccaacagtt acaggcgacg ataaaggaac tcaaacaaga gatacaacaa 9121 cttcgtgtca gtcgagacct ggatagtaaa acttcatcaa aaccaccatc tggggacctc 9181 ctgaaaaagc cagaaaagca aaactcagaa actgagccac actctgcaac tcaaaaaaga 9241 aaaccaggag gacagccagg acacattgga aagacccgca agggatttga tcgtgtagac 9301 cgtgatcaag ttctccgtcc acaaatatgt cttgcttgtg gtaacacaga atttgcaacc 9361 gaaccagtca aagtcgaaac gcaacaagta gctcagttgg tagaacgtcc gattgaaata 9421 gtggagtacc accgccacag ttgtcagtgg agcgcattgt ggagctatac agaacgctaa 9481 ttggtcgcca caaataattc cacggcaaga cttgggagtg aggctgcaag cattcttagg 9541 atggatgggc aattatggac atctgccata tgaaaaacaa tcaagttttg ttatgggaac 9601 tgggagaaat cgacattgga gttgggagtt tggtaaccac caatgagcga gtagaacaag 9661 caatcaaacc aagtatcttg gaattgagta gctgggtaca acaagagcaa ccaaacattc 9721 atgtggatga aaccccctgg tcagttaaag ggttaaaaaa agaagtcaga agtcagaatt 9781 cagaattcag aattaatcgg atgggga // LOCUS NODE_3432_length_9775_cov_3.1237659775 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9775) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9775) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9775 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..907) /gene="recA" /locus_tag="DP116_23955" CDS complement(<1..907) /gene="recA" /locus_tag="DP116_23955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012912710.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase RecA" /protein_id="PRJNA477356:DP116_23955" /translation="MAKKSKAVAKESGGALAAALERDPNLKNTLAQIEKAFGAGSIMA LGSGEAPKIEGVPTGSLSLDIALGGQGIPKGRIIEIFGPESSGKTTLALHVIASAQKA GGIAAFVDAEHALDPSWAKKLGVELDTLLVSQPTSGEEGIQITEMLVRSNAVDVIVVD SVAALVPQKELDGDIGDSHVGLQARLMSQAMRKLTGAIAKCKTVVIFINQIREKIGVM FGSPETTPGGRALKFYSSCRIDVRRISQLKDGEEVVGQRVRAKVVKNKVAPPFRVAEF DMMHTNGISVEGDVLDLAMEHKLVVR" assembly_gap 1276..1285 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(1287..4934) /locus_tag="DP116_23960" CDS complement(1287..4934) /locus_tag="DP116_23960" /inference="COORDINATES: protein motif:HMM:PF00072.22,HMM:PF00512.23,HMM:TIGR00229" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23960" /translation="MAGVKLGCIGANRGQCRSPDDPTPRDPVWTGQKGLLLSMTTAPG APIVQRLRDYGVAAFAVAIATVLRMSLSPFMADRAPFAFLFVAVLLVARFVGFGPAAF ATLVGAATTAFFIVDPVGSLIIRDPGHQVAFVVYIVVGFGMAGLGGLMRVAQDRADRL AAEAILQREELRITLASIGDAVVVTDADAKIVSLNPVAELLTGWTSAEAVGRPLGEVF VIVNEETREPVESPVDRVIREGVIVGLANHTVLVARGGKETPIDDSAAPIFNAQQRLV GVVLVFRDVAERRGVETALRESKRGLEQLADSMPQIVFASSAAGQPEYYNRRWYEYTG VASGDFGRESWAPFIHPDDLPRVTAEWTRCLAAGEPWEAEMRLRDKHGEDRWHLVRSV PVRDDAGKIVRWFGTSTDIHERIVVERRLQTKARVLESMAEGVSLTDENGIIVYTNPA EDRMFGYEPGELLGQHVRVQNDYPPEENERRVAAVIDELNRGGVWTGEWANRRKDGTS FVTQARITALEAAGMRYFVCVQEDVTERRAAAESLRASEERLRLALDAGRMGTWDWNV RTNAVTWSPSLEAIHGLPAGTFAGTFEASVADVHPDDRERVLGAVRRTLEEDLDYHLE YRVVWPNGSQHWVEARGQLLKDERGRPERMTGVCVEITERKRAEHDAKFLADASATLA GLVDMEATLQKVATLAVPAFADWCAIDMLDETGALKRVAVVHIDPSKVELAHVAHRKW PPDPKAPTGVWRIIRTGESEITPEITDEMILGSVKDSEFAQILIDLGLRSYMAVPLEA RGRVLGVITFISAESGRRFEPRDLALAEDLAHRAAVAIENSRLYHEVREADRRKEDFL SLLAHELRNPLAPVRNGLEILKMAGRDPNAVDMVVEMMERQVQHLVRLVDDLLDVSRI MRNKIELRRERIDLVAVIERAVEIARPGVDAGGHTLVVSPPAEPVPLHGDLVRLVQVV GNLLNNAARYSDPGGKIELSGERVGDRAVIRVRDRGIGIAPEMLSKIWDMFVQGDRRP SRSHGGMGIGLTLVRSLVEMHGGRVEARSEGLGRGSEFTIHLPVADAPEPTPASAAAP AIEAAKPRRLLVVDDNVDAAESLAVLLRLGGHDARVAHDGPTALEMAAADPPELAFLD VGMPVMDGYELARRFRSHPSLKDVVLVALTGWGQESDRRRTKEAGFDAHEVKPVEPAA LQRLLADGLQS" gene complement(5029..6975) /locus_tag="DP116_23965" CDS complement(5029..6975) /locus_tag="DP116_23965" /inference="COORDINATES: protein motif:HMM:PF02518.24,HMM:TIGR00229" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain S-box protein" /protein_id="PRJNA477356:DP116_23965" /translation="MDFFVHLFDTSDFPARWYCGNWSAAHGWLHVASDLGVWSAYFAI ALALGYFVVRRRDLPLRGLLALFGAFLLLCGTTHLLDAAVFWWPAYRLAALFKLATAV VSWATVVALVRAAPIVLTTRAPAELEREIAARLAAEETLQRNNQHLEARVRERTAELE RTVASLRDLTRTLEGRVAERTTALAASENRFRAIFNSMFQFIGLMEPDGTLLEANETA LVAGGITRDEAVGRPFWETRWWTVSPETQSRLREAIARAAQGELVRYEADVRGAGDRV ITIDFSIKPVFDDAGRVVLLIPEGRDVTDRKRAEAALRESEEQFRTAFDAAPIGMAIV APDGRWLRVNEALRELVGYSEEELLRTTFQTLTHPDDLETDLSLVRDVIAGRRRTYQL EKRYFRKDGRIVDVLLSVGLVRDDNGAPVHFISQIKDVTEQKRAERQTRASLKEKELL LKEIHHRVKNNLQIVSTLLDLQSEYIDDPRALAMFQECRGRVRSMALIHERLYRSHDL ARVDFAEYVRRLADDLYHAYKLSDDEVRLELDVEVPPLPIDVAIPCGLLLNELMSNCF KHAFPDSMEGRVRVSLRPDGPDAIVLVVADDGVGLPPDFDFRAATSFGLQLVNTLAEQ LGAEIDSPRGDGARFVVRFPAPTR" gene 7344..8402 /locus_tag="DP116_23970" CDS 7344..8402 /locus_tag="DP116_23970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010034810.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gfo/Idh/MocA family oxidoreductase" /protein_id="PRJNA477356:DP116_23970" /translation="MLRIGVVGIGFMGMIHYLAWRRVSRARVTAISTRNAQRLAGDWR DVKGNFGPPGEVMDLSGVQKYARYEDLLADPEVDVVDLCSPPDKHCEMTLAAFAAGKH VLVEKPIALHPHEAERMTAAAKESGKRLLVAHVLPFLPEFAYAVDAIRGAAFGKFLGG FFRRIVSEPTWIPDFFNPNTVGGPIVDLHIHDAHLIRVLAGMPASVVSSGRTRGDVVE FFTTQFRFADPKLHVVAQSGVIAQQGRPFTHGFEIHLEGATLVFDSQALVGQTEGTGV PLTVLDKSGGSFRPPLGSSDPVDGFVAELTEAADAIETGRPSPLLAGELARDALVLCH RQTEAVRSGEIVAVAAVR" gene 8428..9567 /locus_tag="DP116_23975" CDS 8428..9567 /locus_tag="DP116_23975" /inference="COORDINATES: protein motif:HMM:PF13432.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23975" /translation="MNAATRVIVVALAAAIGMASLGDASRAAQDSAETRRAKAAYDRA IGYYQQRQYVLARAKFDEALAADAKFADALVGRGLTSIAEGKMLDALHDLDAAVAINP KYTAAWFYRGGVLGGLNRNDEAVASFDKVVALDPKMAAAHHDRGLALDKLGKYAEADA AYAQALSIREDAQTYVDRAHARTRLGKLSEALADLDRAIALAPKHAQAYFNRGRLREK LGRRDEAAADRKAALEHDPDFAAYCVQQAVRSLNQGRPLDAREMCDAAIDLSPKLGPA YVARGMSFVATSDWKRAEDDFQTALGLDPKSADAHCGLGVVQEARKELPAADAAFSRA IELAPQNPQFWRRRGAVRSALGQARTAESDFAQAAALEEAHRRKE" BASE COUNT 1456 a 3457 c 3380 g 1472 t 10 others ORIGIN 1 aacgaacgac gagcttgtgc tccatcgcca ggtcgagcac gtcgccttcg acgctgatgc 61 cgttggtgtg catcatatca aattcggcga cgcggaaggg aggggcgacc ttgttcttga 121 cgaccttggc ccgcacccgc tggccgacga cttcttcgcc gtccttgagc tgcgagatgc 181 ggcggacgtc gatgcggcac gagctgtaga acttcagcgc gcggccgccg ggcgtcgttt 241 cggggctgcc gaacatcacg ccgatcttct cgcggatctg gttgatgaag atcaccaccg 301 tcttgcactt cgcgatcgcg cccgtgagct tccgcatcgc ctggctcatc agccgggcct 361 gcaggccgac gtgcgaatcg ccgatgtcgc cgtcgagctc cttctgcggc acgagggcgg 421 ccaccgagtc gaccacgatc acgtcgaccg cgttgctgcg gacgagcatc tcggtgatct 481 ggatgccttc ttcgcccgag gtcggctggc tgacgagcag cgtgtcgagc tcgacgccca 541 gcttcttggc ccagctcggg tcgagggcgt gctcggcgtc cacgaaggcc gcgatgccgc 601 cggccttctg ggcgctggcg atcacgtgca gggcgagggt cgtcttgccg ctcgattcgg 661 ggccgaagat ctcgatgatg cggcccttgg ggatgccctg accgccgagc gcgatgtcga 721 gcgagaggct gccggtcgga acgccctcga tcttcggggc ctcgcccgag ccgagcgcca 781 tgatcgaccc cgcgccgaag gccttttcga tctgcgcgag ggtgttcttg agattcgggt 841 cgcgttccag agccgcggcg agcgcgccgc ccgattcctt cgcgacggcc ttcgatttct 901 tcgccatgcg tgcctcggtt gccgtagcca tccaacgtcc tttcgacccg caagaactgg 961 ttgtaacgaa tctctgagcg gtcgccagtg acgctcgcga gggtcgtgct cctcgggagc 1021 gtgacgggcg ttccgatgtc cgtgcgtata ctatacatcg gttcaaaaac gcccgtcaat 1081 cgaaaccgtc gaaaactttt ccaccggccg cgactggcgt actgagagtt tcggacgccc 1141 ccgtgacacc ttcggaccgg ggcgcgaagt tttcgccgcg aggcgcgcga acgacgcggt 1201 tcgcgcccga tttcgggtcg cgacggcccc cgcgcgagag acgcggcggg agcggaaaac 1261 gcccgcggct acgacnnnnn nnnnngctac gactgcaagc cgtccgccag gagccgctgc 1321 aacgcggccg gttccacggg cttcacttcg tgggcgtcga agccggcttc cttggtacgc 1381 cgacggtcgc tctcctggcc ccagccggtg agcgcgacga gcacgacgtc cttgagcgac 1441 gggtgcgagc ggaaccgccg ggccagctcg taaccgtcca tcaccggcat cccgacgtcg 1501 aggaacgcca gctcgggcgg gtcggccgcg gccatttcca gcgccgtcgg cccgtcgtgc 1561 gccaccctcg cgtcgtgacc gcccagccgc agcagcaccg ccaggctctc ggccgcgtcg 1621 acgttgtcgt cgacgaccag gagccgccgc ggcttggccg cttcgatcgc gggggccgcg 1681 gcgctggcgg gcgtcggctc cggggcgtcc gcgaccggca ggtggatggt gaactcgctg 1741 ccgcggccga gcccttcgct ccgggcctcg acgcggcctc cgtgcatctc gacgaggctg 1801 cggacgagcg tcagcccgat gcccatgccg ccgtgcgagc gactgggccg gcggtcgccc 1861 tgcacgaaca tgtcccaaat cttcgagagc atctcgggcg cgatgccgat gccccggtcg 1921 cggacgcgga tcacggcgcg gtcgcccacg cgctcgcccg agagctcgat cttgccgccg 1981 gggtcgctgt agcgggccgc gttgttgagc aggttgccga cgacctgcac gagccgcacc 2041 agatcgccgt gcagcggcac cggctcggcc ggcggcgaga cgacgagcgt gtggccgccc 2101 gcgtcgacgc ccggccgggc gatctcgacg gcgcgctcga tcaccgcgac caggtcgatc 2161 cgctcgcggc ggagctcgat cttgttccgc atgatgcgcg agacgtcgag caggtcgtcg 2221 accagccgca cgaggtgctg gacctgccgc tccatcatct cgacgaccat gtcgaccgcg 2281 ttgggatcgc ggccggccat cttcaggatt tcgaggccgt tccgcaccgg cgcgagcggg 2341 ttgcgcagct cgtgcgccag gaggctgagg aagtcttcct tgcggcggtc ggcctcgcgc 2401 acctcgtgat agagccgcga gttttcgatg gcgaccgccg cgcgatgggc gaggtcttcg 2461 gccagcgcca ggtcgcgcgg ctcgaatctc cgccccgact cggccgagat gaaggtgatc 2521 accccgagca cgcgaccgcg ggcctcgagc ggcacggcca tgtacgaccg cagcccgaga 2581 tcgatcagga tctgcgcgaa ctcggagtcc ttcaccgagc cgaggatcat ctcgtcggtg 2641 atctcgggcg tgatctccga ctcgcccgtg cggatgatcc gccagacgcc ggtcggcgct 2701 ttcgggtccg gcggccattt gcggtgggcc acgtgggcga gctcgacctt ggacgggtcg 2761 atgtgcacca cggccacgcg tttgagcgcg ccggtctcgt cgagcatgtc gatcgcgcac 2821 cagtcggcga acgcgggaac ggccagcgtc gcgaccttct gcagcgtggc ctccatgtcc 2881 acgagccccg cgagcgtcgc gctcgcgtcg gccaggaatt tcgcgtcgtg ctcggcccgc 2941 ttgcgctcgg tgatctcgac gcagacgccg gtcatccgct ccgggcggcc ccgctcgtcc 3001 ttgaggagct ggccgcgggc ttccacccag tgctgcgagc cgttcggcca cacgacgcgg 3061 tattcgaggt gatagtcgag atcctcttcg agcgtccggc ggaccgcgcc gaggacgcgc 3121 tcgcggtcgt cgggatggac gtccgcgacc gaggcctcga acgttcccgc gaacgtcccc 3181 gcgggcaggc cgtggatcgc ttcgaggctg ggcgaccagg tcaccgcgtt ggtgcggacg 3241 ttccagtccc acgtccccat ccgccccgcg tcgagcgcca gccgcaaccg ctcttcgctc 3301 gcccgcagac tctccgcggc ggcgcggcgc tcggtgacgt cttcctgcac gcagacgaag 3361 tagcgcatgc cggccgcttc gagcgcggtg atgcgggcct gcgtgacgaa gctcgtcccg 3421 tccttgcgac gattggccca ctcgcccgtc cagacgccgc cgcgattcag ctcgtcgatc 3481 acggccgcga cgcggcgttc gttttcttcg ggagggtagt cgttctgcac ccggacgtgc 3541 tggccgagca gctcgcccgg ctcgtagccg aacatccggt cttcggccgg attggtgtaa 3601 acgatgatcc cgttctcgtc cgtcaggctg actccctcgg ccatgctttc gagcacgcgg 3661 gccttggtct gcaggcggcg ctcgaccacg atccgctcgt ggatgtcggt gctcgtgccg 3721 aaccagcgga cgatcttccc cgcgtcgtcg cggaccggca cgctgcggac gaggtgccag 3781 cggtcttccc cgtgcttgtc gcggagccgc atttccgctt cccacggctc gcccgcggcg 3841 aggcagcgag tccattccgc ggtgactcgc ggcaggtcgt cgggatggat gaacggggcc 3901 cagctctcgc ggccgaagtc gcccgacgcg acgcccgtgt attcgtacca gcgccggttg 3961 taatactcgg gctgccccgc ggcgctcgac gcgaagacga tctgcggcat cgagtcggcg 4021 agctgctcga gcccgcgttt cgattcgcgg agggccgtct cgacgccgcg tcgctcggcg 4081 acgtcgcgaa agacgagcac gaccccgacg agccgctgct gcgcgttgaa gatcggcgcg 4141 gcgctgtcgt cgatcggggt ctctttgccg ccgcgggcga caagcacggt gtggttggcg 4201 aggccgacga tgacgccttc gcggatcacg cgatcgaccg ggctctcgac cggttcgcgc 4261 gtctcctcgt tgacgatcac gaacacctcg ccgagcggcc gaccgacggc ctcggcgctc 4321 gtccagccgg tgagcaactc ggcgacgggg ttgagcgaga cgatcttcgc gtccgcgtcg 4381 gtgacgacga ccgcgtcgcc gatgctggcg agcgtgatcc gcagctcttc gcgttgcagg 4441 atcgcctcgg cggcgagtcg gtcggcgcgg tcctgcgcca cgcgcatcag cccgccgagt 4501 ccggccatgc cgaagccgac gacgatgtac accacgaacg cgacctggtg tcccggatcg 4561 cggatgatca gcgaaccgac cggatcgacg atgaagaaag cggtcgtcgc ggcccccacg 4621 agcgtcgcga acgcggccgg gccgaagccc acgaagcggg ccaccagcag cacggccacg 4681 aacaaaaacg cgaacggcgc gcgatccgcc atgaacgggc tgagcgacat tcgcagcacc 4741 gtcgcgatcg ccaccgcgaa cgcggcgacg ccgtagtcgc gcaggcgttg gacgatgggc 4801 gcaccggggg cggtcgtcat cgagagaagc aatcccttct ggccggtcca aaccgggtcg 4861 cgcggagttg gatcgtcggg cgaccggcac tgaccccggt tcgcgccgat gcatcctagc 4921 ttaactcccg ccaacgggca gtgcgacctc cccacgaccg ggtttcgtcc cccgcgcggg 4981 gcgtcgaacc gccccgcacg aggagcgacg cgaccgtcgc gtcgagcgtc agcgcgtcgg 5041 ggcggggaac cgcaccacga accgcgcgcc gtccccccgc ggcgagtcga tctcggcccc 5101 gagctgctcc gccagcgtgt tcaccagttg cagcccgaac gacgtcgcgg cgcggaagtc 5161 gaagtcgggc gggagcccca cgccgtcgtc ggcgacgacc agcacgatcg cgtcggggcc 5221 gtcgggccgc aacgacaccc gcacgcggcc ctccatcgag tcggggaacg cgtgtttgaa 5281 gcagttcgac atcagctcgt tcagcaacag cccgcacggg atcgccacgt cgatcggcaa 5341 cggcggcacc tccacgtcga gctccagtcg cacctcgtcg tcggagagct tgtacgcgtg 5401 gtacaggtcg tccgcgagcc gccgcacgta ctcggcgaaa tcgacccgcg ccaggtcgtg 5461 cgagcggtag agccgttcgt ggatcagcgc catcgaccgc acgcggccgc ggcattcctg 5521 gaacatcgcc agggcccgcg ggtcgtcgat gtattccgac tgcaggtcga ggagcgtcga 5581 gacgatctgg aggttgttct tgacccggtg gtggatctcc ttgaggagca gctctttttc 5641 tttgagcgac gcccgcgtct ggcgctcggc ccgcttctgc tcggtgacgt ccttgatctg 5701 cgagatgaag tgcaccggcg cgccgttgtc gtcgcggacc aagccgacgg agagcaacac 5761 gtcgacgatc cggccgtcct tgcggaagta ccgcttctcg agctgatacg tccggcgtcg 5821 gccggcgatc acgtcgcgca cgaggctcag gtcggtctcg aggtcgtcgg gatgcgtgag 5881 ggtctggaag gtcgtccgca gcagctcctc ctcggagtaa ccgaccagct cgcggagcgc 5941 ctcgttgacg cgcagccagc gaccgtcggg cgcgacgatc gccatgccga tcggcgcggc 6001 gtcgaacgcg gtgcggaact gctcctcgct ctcgcggagc gccgcctcgg cccgcttgcg 6061 gtcggtgacg tcgcgtccct cggggatcag cagcaccacg cgtcccgcgt cgtcgaacac 6121 cggcttgatc gagaagtcga tcgtgatcac gcggtcgccc gccccgcgga cgtcggcctc 6181 gtaacgcacc agctccccct gggcggcgcg ggcgatcgct tcgcggagcc gcgactgcgt 6241 ctcgggcgag acggtccacc accgcgtctc ccaaaacggc cggccgaccg cctcgtcgcg 6301 cgtgatgccg ccggcgacca gcgccgtctc gttggcctcg agcagcgtcc cgtcgggctc 6361 catcaacccg atgaactgga acatcgagtt gaagatcgcg cggaagcggt tctcgctggc 6421 cgcgagggcg gtggtccgtt cggcgacgcg tccctcgagc gtgcgtgtca ggtcgcggag 6481 cgacgcgacg gtccgttcca gctcggccgt ccgttcgcgg acgcgggcct cgaggtgctg 6541 gttgttccgc tgcagcgtct cctccgcggc gagccgcgcc gcgatttcgc gttcgagctc 6601 ggccggggcg cgggtcgtca acacgatcgg agcggcgcgg acgagcgcga cgacggtcgc 6661 ccacgagacg accgcggtgg ccagcttgaa cagcgccgcg agtcggtagg cgggccacca 6721 gaagacggcc gcgtcgagca ggtgcgtcgt tccgcaaagc agcaggaacg cgccgaagag 6781 cgcgagcaga ccgcggagcg gcaagtcgcg acgccgcacg acgaagtaac ccagcgccag 6841 cgcgatcgcg aagtacgccg accagacgcc caggtcggac gcgacgtgca gccagccgtg 6901 agcggccgac cagttcccgc agtaccaacg cgccggaaaa tcggacgtgt cgaacaggtg 6961 gacgaagaag tccacggcct ctccctcgcg acgagatcgc gcgctcgtcc cccgcgcgac 7021 gacggactcg acggaccggc ccggacgacg cgagcgcttc aagggtgaat atagaacgcg 7081 gacggtcggg ctgacgagcc gccgctgatc gacggaggga ggaccgcgga gtgcttccct 7141 ctacgtcctt tggcccgcgg cgagcgcccc cgcgcggccg cgggaccccc gctcgcgcgg 7201 cccgggcggc cgatgaactt tcacgcgacg cgtcgtaaga tgcggccacc gcgacgacgc 7261 gtcgtccgtc gcccccgcga cgcgcgcgac gaccggcgga tcgagccccc gcgaatcgac 7321 gacaggcgaa aggccctcag cacatgctcc gcatcggcgt ggtcgggatc ggcttcatgg 7381 gcatgatcca ttacctggcc tggcgacgcg tctcgcgcgc ccgggtgacc gcgatcagca 7441 cccgcaacgc ccagcggctc gccggcgact ggcgcgacgt gaaggggaac ttcggcccgc 7501 ccggggaagt catggacctc tccggcgtgc aaaaatacgc gcgctacgaa gacctgctgg 7561 ccgatcccga ggtcgacgtg gtcgatctgt gctcgccccc cgacaagcat tgcgagatga 7621 cgctcgccgc gttcgcggcc ggcaagcacg tcctcgtcga gaagccgatc gcgctgcacc 7681 cgcacgaggc cgagcggatg acggccgccg cgaaggagtc gggcaagcgg ctgttggtcg 7741 cgcacgtgct gccgttcctg ccggagttcg cgtacgcggt cgacgcgatc cgcggggccg 7801 cgttcggcaa gttcctcggc ggcttcttcc gccggatcgt ctccgagccg acctggatcc 7861 ccgacttctt caatccgaac accgtcggcg ggccgatcgt cgatctgcac atccacgacg 7921 cgcacctgat ccgcgtgctg gccgggatgc ccgcgagcgt cgtcagctcc ggacgcaccc 7981 gcggcgacgt cgtcgagttc ttcaccacgc agttccgctt cgccgacccg aagctccacg 8041 tcgtcgctca gagcggcgtg atcgcccagc aaggccgccc gttcacccac ggcttcgaga 8101 tccatctgga aggcgcgacg ctcgtcttcg attcgcaggc gctcgtcggg caaaccgaag 8161 gaacgggcgt gccgctcacc gtgctcgaca agtcgggcgg ctcgttccgc ccgccgctcg 8221 gctcgagcga tccggtcgac gggttcgtgg ccgaactgac cgaagcggcc gacgcgatcg 8281 agaccggacg accgtcgccg ctcctggccg gagagctcgc ccgcgacgcg ctcgtcctct 8341 gccatcgcca gacggaagcg gtccgctccg gcgagatcgt cgcggtcgcc gcggttcgtt 8401 gaacgaggac gtcgaggaga ccgtgcgatg aatgcagcga ctcgcgtgat cgtcgtcgcc 8461 ctcgcggccg cgatcggaat ggcctcgctc ggcgacgctt cccgcgccgc gcaagactcg 8521 gccgagacgc ggcgggcgaa agccgcttac gaccgggcga tcggctatta ccagcagcgg 8581 cagtacgtgc tggcccgcgc gaagttcgac gaggcgctcg cggccgacgc gaagttcgcc 8641 gacgcgctcg tcggccgcgg gctcacctcg atcgccgagg ggaagatgct cgacgcgctg 8701 cacgacctcg acgcggccgt cgcgatcaac ccgaaataca ccgcggcctg gttctaccgc 8761 ggcggcgtgc tgggcgggct gaatcggaac gacgaagcgg tcgcgtcgtt cgacaaggtc 8821 gtcgcgctcg atccgaaaat ggccgccgcg catcacgacc gcggcctggc cctcgacaag 8881 ctcggcaagt acgcggaggc cgacgccgct tacgcccagg cgctctcgat ccgcgaggac 8941 gcccagacgt acgtcgaccg cgcgcacgcc cgcacgcgat tgggcaagct ctccgaagcg 9001 ctggccgacc tcgatcgggc gatcgcgctc gcgccgaagc acgcccaggc ctatttcaat 9061 cgcgggcggt tgcgcgagaa gctcggccga cgcgacgagg ccgcggccga ccgcaaggcg 9121 gcgctcgagc acgatcccga tttcgccgcg tattgcgtgc aacaagcggt gcgctcgctc 9181 aatcagggcc gcccgctcga cgcccgcgag atgtgcgacg cggcgatcga tctttcgccg 9241 aagctcggcc cggcgtacgt ggcgcgcggg atgtcgttcg tggcgacgtc cgactggaag 9301 cgggccgagg acgacttcca gaccgcgctc ggcctcgacc cgaaaagcgc cgacgcccac 9361 tgcgggctgg gcgtggtgca ggaagcccgc aaagagctcc cggcggccga cgcggcgttc 9421 agccgggcga tcgagctcgc gccgcagaac ccgcagttct ggcgtcgccg cggggcggtc 9481 cgttcggccc tcggccaggc ccgcacggcc gaatccgact tcgcgcaggc cgccgctctg 9541 gaggaagcgc accgccggaa agagtgagcg acgcgcgcgg gtcgtcggtc gccgacgcgc 9601 gcggcatcgc gacgcgcgat ccgcggaaaa cgaaaccgcc cggcgaacgg cggtgaagcc 9661 ggctcgtcgg gcggatcgag gtcgcgaggt tgaacgcggg gcgattacgg attcgccggg 9721 gcggccggag ccggagtctc gctcgcagcg gccggcgggg tcgccggggc ttcag // LOCUS NODE_3440_length_9748_cov_4.5753649748 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9748) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9748) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9748 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(315..2192) /locus_tag="DP116_23980" CDS complement(315..2192) /locus_tag="DP116_23980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320888.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit gamma" /protein_id="PRJNA477356:DP116_23980" /translation="MRVAQNNQFDYVKIGLASPERIRIWGERTLPNGQVVGEVTKPET INYRTLKPEMDGLFCERIFGPAKDWECHCGKYKRVRHRGIVCERCGVEVTESRVRRHR MGFIKLAAPVAHVWYLKGIPSYISILLDMPLRDVEQIVYFNSYVVLSPGNAETLTYKQ LLSEDQWLEIEDQIYSEDSILQGVEVGIGAEALLRLLADITLEQEAESLREEITTAKG QKRAKLIKRLRVIDNFIATGSKPEWMVMAVIPVIPPDLRPMVQLDGGRFATSDLNDLY RRVINRNNRLARLQEILAPEIIVRNEKRMLQEAVDALIDNGRRGRTVVGANNRPLKSL SDIIEGKQGRFRQNLLGKRVDYSGRSVIVVGPKLNIHQCGLPREMAIELFQPFVINRL IRSGMVNNIKAAKKLISRSDPSVWDVLEEVIEGHPVLLNRAPTLHRLGIQAFEPILVE GRAIQLHPLVCPAFNADFDGDQMAVHVPLSLESQAEARLLMLASNNILSPATGRPIIT PSQDMVLGAYYLTAENPDATKGAGRFFTSLDDVIMAYEQQQVDLHSYIYVRYDGEVET SEPDVEPVEVIPNEDGSRTLLFKFRRVKEDAHGNMIYQYLRTTAGRVIYNKAIQEAIA S" gene complement(2549..5929) /gene="rpoB" /locus_tag="DP116_23985" CDS complement(2549..5929) /gene="rpoB" /locus_tag="DP116_23985" /EC_number="2.7.7.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876965.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit beta" /protein_id="PRJNA477356:DP116_23985" /translation="MTNETIMEPAFLLPDLIEIQRSSFRWFLEEGLIEELNSFSPITD YTGKLELHFLGQNYRLKEPKYSVDEAKRRDSTYAVQMYVPTRLINKETGEIKEQEVFI GDLPLMTDRGTFIINGAERVIVNQIVRSPGVYYKSEIDKNGRRTYSASLIPNRGAWLK FETDRNDLVWVRIDKTRKLSAQVLLKALGLSDNEIFDALRHPEYFQKTIEKEGQFSEE EALMELYRKLRPGEPPTVLGGQQLLDSRFFDPKRYDLGRVGRYKLNKKLRLQVPEPIR VLTPQDILAAVDYLINLEYDIGNTDDIDHLGNRRVRSVGELLQNQVRVGLNRLERIIR ERMTVSDAEVLTPASLVNPKPLVAAIKEFFGSSQLSQFMDQTNPLAELTHKRRLSALG PGGLTRERAGFAVRDIHPSHYGRICPVETPEGPNAGLIGSLATHARVNQYGFLETPYR PVENGRVRFDVSPIYMTADEEDDLRVAAGDLPVDENGDIIGPQAIVRYRQEFSTTTPE QVDYVAVSPVQIVSVATSMIPFLEHDDANRALMGSNMQRQAVPLLKPERPLVGTGLEA QAARDSGMVIVSRTDGEVVYVDATEIRVRPRVTSTDGKPTAALNHQGPSSSGRDHSSK SNSSDIRYHISKYHRSNQDTCLNQKPLVYMGERVVAGQVLADGSSTEGGELALGQNIV VAYMPWEGYNYEDAILISERLVQDDIYTSIHIEKYEIEARQTKLGPEEITREIPNVGE DALRQLDEQGIIRIGAWVESGDILVGKVTPKGESDQPPEEKLLRAIFGEKARDVRDNS LRVPNGEKGRVVDVRLFTREQGDELPPGANMVVRVYVAQKRKIQVGDKMAGRHGNKGI ISKILPLEDMPYLPDGSPVDIVLNPLGVPSRMNVGQVFECLLGWAGHNLGVRFKLTPF DEMYGEESSRTIVHGKLQEARDETGRNWVFNPDNPGKILVYDGRTGEPFDRPVTVGVA YMLKLVHLVDDKIHARSTGPYSLVTQQPLGGKAQQGGQRFGEMEVWALEAFGAAYTLQ ELLTVKSDDMQGRNEALNAIVKGKAIPRPGTPESFKVLMRELQSLGLDIAVHKVETQA DGSSLDVEVDLMTDQGNRRTPPRPTYESLSRESMDDEE" gene 6006..6221 /locus_tag="DP116_23990" CDS 6006..6221 /locus_tag="DP116_23990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_23990" /translation="MWANGAIKAANKDFGKGIITHFAAKRCQFPPAINFFTEIKWYNF LLLSFVFFTSPSKKRVMPQLYTVLACT" gene complement(6622..7407) /locus_tag="DP116_23995" CDS complement(6622..7407) /locus_tag="DP116_23995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129319.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="deoxyribonuclease" /protein_id="PRJNA477356:DP116_23995" /translation="MQLIDTHVHINFDIFQSDLEAVRSRWQQAGVIHLVHSCVEPSEF KSIQALAHRFAEMSFAVGLHPLDAAKWTEQTAEEIVSLASSDSKVVAIGETGLDFYKA DNYDQQRKVFETQLEIASSANLPVIIHCRNAAVEVREVLQKWKKLKGESVRGVMHCWG GSPEETQWFVDLDFYISFSGTVTFKNAKQIQESAAMVRSDRLLIETDCPFLSPVPKRG ERRNEPAYVRYVAQQVATVRGETVDAIATLTTRNACELFGLTI" gene complement(7478..7780) /locus_tag="DP116_24000" CDS complement(7478..7780) /locus_tag="DP116_24000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456876.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S20" /protein_id="PRJNA477356:DP116_24000" /translation="MANTKSALKRAEIAERNRLRNKAYKSAVKTLMKKYFAAVDAYAA NQSPESKQEVMTRMSEAYSKIDRAVKRGVLHPNNGARKKSKLAQRLKAHTQPAATA" gene 8272..8398 /locus_tag="DP116_24005" /pseudo CDS 8272..8398 /locus_tag="DP116_24005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008186223.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(8709..>9748) /locus_tag="DP116_24010" CDS complement(8709..>9748) /locus_tag="DP116_24010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309420.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_24010" /translation="SVSLNLSRAYKNFFDGRAQYPKFKSRHHRQSIQYPQKVKQVNDC LKFPGTLGVVKAKIHRLLDGTIKTVTVSKCPSGKYYASVLIEYEGDYPTSSTDGKVIG VDLGIKDFAITYDGEKVSKYPNPKHLAKYEKKLAKKQRIAARKVKGSNRRRKAIKIVA RVYEQVSNVRQDYLHKLSRKIVDSNQVVVVENLLVLGMVRNPYGFATCYNGGNPRNAV AHHKLAKAISDVGWGTFVNFLSYKLEHNGGKLVEINRWFPSSKLCSNCHYQIKELPLD IRNWICPSCGTHHDRDGNAAINIRAEGIRVLSSSGTGEANANGEEVRPNRGRKPVMWH SSVKLEAPTIP" BASE COUNT 2390 a 2313 c 2176 g 2869 t ORIGIN 1 cccactcgct gtctgccgtg catggtttcc cgacttgagg agccactgcg gtgcggtgaa 61 tccagcgcga atgacggctt tcccgacaga ggcgactggt gttagcgcag cgtgaccgga 121 ggtcataccc gaagggaggt tccctccgtt gaagcatgtg gcgtgcacct gtccgttgag 181 gcagtgcgcc cttgggtagc actaggcgtg gtttccccaa ggaagaactg ccgttcaagc 241 agtcttgcgg tcaacactca agaattaaaa aaactttgga ctcttgagtg ttgacttttg 301 actttgggtc gcggttaact ggcgatcgct tcttgaattg ccttattgta aataacgcga 361 cctgctgttg tacgtaaata ctggtaaatc atgtttccgt gagcatcttc tttgactcga 421 cggaacttaa acagcaaagt ccggctacca tcttcattgg gtatcacttc aaccggttct 481 acatctggct cacttgtctc tacttcgccg tcatagcgta cgtaaatata ggaatgcagg 541 tcaacttgct gctgctcgta agccatgatg acatcatcaa gagaagtaaa aaatcgtcct 601 gcgccttttg tcgcatcagg attttctgcg gttaagtaat acgcccctaa caccatgtct 661 tggctaggtg taatgattgg tctgccagta gctggtgaca aaatgttgtt agacgccaac 721 atcagtaagc gtgcttccgc ttgagattcc aaagataacg gtacgtgtac cgccatttgg 781 tcaccatcaa agtcagcgtt aaatgctgga caaacgagtg ggtgcagttg aattgctcga 841 ccttcgacta agattggctc gaaagcttga attcccaaac ggtgtagtgt cggcgcacgg 901 ttaagcagaa cagggtgtcc ttcaatcact tcttctagaa catcccaaac acttggatca 961 gaacgagata tcaacttttt ggctgctttg atgttgttca ccattccgct acgaatcagc 1021 cgattgatga caaatggttg aaatagctca atggccattt ctcgtggtaa accgcactgg 1081 tgaatgttaa gctttggtcc gacgacgatc acagaacgtc cggagtagtc aacccgttta 1141 ccgagtaagt tttgccggaa acgtccttgt ttcccttcga tgatgtcgga caaagacttc 1201 agtggtcgat tatttgcacc tacaaccgta cgtccccgac gaccattgtc aatcaaagcg 1261 tccactgctt cttgcagcat ccgcttttcg ttgcgaacaa taatctcagg agccaaaatt 1321 tcttgcaggc gtgccaaacg gttgttacgg ttaatgactc ggcgatataa atcattcaaa 1381 tcgctggttg caaatcgtcc accgtccaac tgcaccattg ggcgtaagtc tgggggaatg 1441 acaggaatga cagccataac catccactca ggcttagatc cggtagcgat gaagttgtca 1501 atcacccgta gccgcttaat cagctttgcc cgcttttgac ctttggccgt tgtaatttct 1561 tcccgcaagg attcagcttc ttgctctaag gtaatatcag cgagtaaccg caacaaagct 1621 tctgcaccaa taccgacctc taccccttgc aggatggaat cttcgctgta aatttggtct 1681 tcaatttcca accactggtc ttcactcagt aactgtttgt aggttaaggt ttcagcattg 1741 ccaggactaa gaacaacata ggaattgaaa taaacaattt gctcaacatc ccgcaaaggc 1801 atatctagga ggatggagat atagctggga atgcctttga gataccagac gtgagcgaca 1861 ggtgctgcga gttttatgaa acccatgcgg tggcggcgca ctcgcgattc agtcacttct 1921 acgccacagc gttcacacac aattcctctg tgacgcactc tcttatactt gccacaatgg 1981 cattcccaat ctttagctgg accaaagatg cgctcacaga acaagccgtc catctccggt 2041 ttcaaagtac ggtaattgat cgtttctggc tttgtgactt caccaactac ctgaccgtta 2101 ggtaaagttc tctcacccca aatccgaatg cgttctggtg atgccaagcc gattttaacg 2161 tagtcaaact gattattttg ggctactctc atctttgtcc aaattgtgtt ctaaagttat 2221 gacttttaag gaacacggct aataaattcc gtgctggtgt gagtattcag ttgtgagtta 2281 ttagttatca gctattactt gttcagtatt tactgttcac taataacagt tcaccctgcg 2341 ggttcgccct atgcacaaag cccacgaatc gggctttgaa catacagtgt gccggaaggg 2401 cttacggcac cggctttcac gaagtgtgcc cgtaagcgca aaccacacaa atcgggcgaa 2461 gcttgagtgt gtcctttgga cataggactt acggacacac tgctttgaac agtactgctt 2521 tcactgttga gtgatgactg ttcagttttt attcctcgtc atccatcgac tcgcgtgaca 2581 aggattcata ggttggacgt gggggtgtac ggcgatttcc ttggtctgtc atcaaatcga 2641 cttcgacatc caaagaacta ccgtctgcct gcgtctctac cttgtgtacg gcaatatcca 2701 agcctagtga ttgcagttct cgcatcaaca ctttaaagga ttctggtgtt ccaggtcgag 2761 gaattgcctt tcctttgacg atcgcattca gtgcttcatt ccgtccttgc atatcgtctg 2821 atttcaccgt cagcaactcc tgcaaggtgt aagcagcacc gaaggcttcc agtgcccaca 2881 cttccatttc accgaaccgc tgaccacctt gttgtgcttt accacccaag ggttgctggg 2941 tcaccagtga gtatggacct gttgaacggg cgtggatctt gtcatccacc aagtgaacca 3001 gtttcagcat gtatgccaca cccacagtca ctggtcggtc aaagggttcg ccggtgcgac 3061 cgtcgtacac caaaattttt ccaggattat ctgggttaaa cacccagttt ctacctgttt 3121 cgtcccgtgc ttcttgcaat ttgccatgca cgatggttcg agatgattct tcaccataca 3181 tttcgtcaaa gggagtgagt ttgaaccgca ctccgaggtt atgaccagcc caacccaaca 3241 gacattcaaa cacttgtcca acgttcatcc ggcttggtac gcccaagggg ttgagaacaa 3301 tgtctactgg tgagccatct ggcaagtacg gcatatcttc gaggggcaag attttggaaa 3361 taattccttt attcccgtga cgtcctgcca ttttgtcgcc gacttggatt ttacgctttt 3421 gcgccacata gacccggacg accatgtttg cccctggtgg cagttcatcg ccttgttcgc 3481 gggtaaacaa gcgtacgtct acgacgcgtc ctttttcacc gttgggaact cgcagggaat 3541 tgtctctcac atcccgtgct ttttcaccga aaattgcccg caacagtttt tcttctggtg 3601 gctggtcaga ttcacctttg ggtgttactt ttcctaccag gatatccccg gattctaccc 3661 atgccccaat gcgaatgatc ccctgttcat ccaattgacg caaggcatct tcaccaacgt 3721 tgggaatttc tcttgtaatt tcttctggtc ccaacttggt ttgtcttgcc tcaatttcat 3781 atttttcaat gtgaattgag gtgtagatat cgtcttgtac cagccgttcc gaaatcaaaa 3841 ttgcgtcttc gtagttatac ccttcccaag gcatataggc gacgacgata ttttgtccca 3901 gcgccagttc gcctccttca gtagaagaac catccgccag cacttgacct gcgacaacgc 3961 gttcacccat gtaaactaac ggtttttggt tgagacaggt atcttggtta gagcgatgat 4021 atttggaaat gtggtatcta atgtctgagc tattagattt ggaagaatgg tcacgtccgg 4081 atgagctagg gccttgatga ttcagcgcgg cagttggctt tccatccgtt gaggtaaccc 4141 gtggacgcac acgaatttct gttgcgtcta catataccac ttctccgtcg gtgcgcgata 4201 caatcaccat accagagtct ctggcggctt gagcttctaa gcctgtcccg accaaaggac 4261 gctctggttt cagtaggggt actgcttgcc gttgcatgtt cgatcccatc agggcacggt 4321 ttgcgtcgtc gtgctccaaa aatggaatca tgctagtagc aacagacaca atttgcacag 4381 gagagactgc cacgtagtcc acttgttctg gtgtggtcgt ggagaattcc tggcgatagc 4441 gtactatggc ttgcggtccg atgatgtcac cattttcatc aacgggcaag tcgccagcag 4501 caacccgcag gtcatcttct tcatctgcag tcatgtaaat tggtgatacg tcaaaccgca 4561 cccgtccatt ttctacaggt cgatagggtg tttctagaaa accatactgg ttgacgcgag 4621 catgagttgc caaggaacca atcaagccag cattgggacc ttctggagtt tctacggggc 4681 aaatgcgtcc gtagtgggat gggtgaatat cccgtaccgc gaaacctgca cgttcacgtg 4741 ttagacctcc agggccaagt gctgataaac ggcgtttgtg ggtgagttct gccagaggat 4801 ttgtctggtc catgaactgc gacaattggc tagacccaaa gaattctttg atggctgcga 4861 ccaaaggttt ggggttgact agggaagcag gagtcagtac ttcagcgtcg gatacggtca 4921 ttctttcgcg aatgatgcgt tctaagcgat ttaaaccgac tcggacttgg ttttgcagca 4981 attcgcccac acttctgact cgacgattcc ctaagtgatc gatgtcgtca gtattaccga 5041 tatcatactc aagattgatc aggtaatcca ctgctgccaa gatgtcttgg ggagtcaaga 5101 cgcgtatggg ttctggcact tgcaagcgca gttttttgtt gagtttataa cgtccaacac 5161 gaccaaggtc gtagcgtttg ggatcaaaga agcgcgagtc gagaagttgt tgtcctccca 5221 agactgtggg cggttcacca ggacggagtt tacgatacaa ctccatcagg gcttcttctt 5281 ccgaaaattg cccttctttc tcaatcgtct tttggaaata ttctgggtgg cgtaatgcgt 5341 caaagatttc attgtctgac aacccaagag ctttcaggag gacttgcgcc gaaagtttgc 5401 gggttttgtc tatacgtacc cacaccaagt cgttacggtc tgtttcaaat ttcagccaag 5461 ctcctcggtt ggggattaaa ctagctgaat acgtacgccg cccgttcttg tcaatttctg 5521 atttgtagta cactcctggc gatcgcacta tctggttgac aatcacccgc tcagcaccgt 5581 taatgataaa cgtgcctcgg tctgtcatca gaggcaaatc cccaataaaa acttcttgtt 5641 ctttgatttc tccggtttct ttattaatca aacgagtggg gacgtacatc tgtaccgcgt 5701 aggtactatc ccgccgtttt gcttcgtcta cactgtactt tggctctttt agtctgtagt 5761 tttgacccaa aaagtgcagc tccaatttgc ctgtatagtc tgtgatcgga ctaaaggagt 5821 tcagttcttc tatcagtcct tcttctagaa accaacggaa gctggaacgc tgaatttcga 5881 tcaagtccgg caacaaaaag gcgggttcca taattgtttc gtttgtcatg cctctacctt 5941 tgtcaaccta agttttactt aatggtgtgt gggaacttct cctgacaccc cctactcacg 6001 agcaggtgtg ggcaaacgga gcaattaaag ctgcaaataa ggattttggc aaggggatta 6061 ttacccattt tgccgccaaa cgctgccaat ttccaccagc tattaacttc tttactgaaa 6121 tcaaatggta taactttttg ttactttctt ttgttttctt cacttccccc tccaaaaagc 6181 gcgttatgcc tcagctttac actgttttgg cttgtacgta actacctgga tttaaaccca 6241 gagggataat atccttatgc tcagaggaca atattcccct gagcctactt tgtgtatgga 6301 tagcttgtct tggcgggtgg tggtcacacg gtaatttgtt taacaattct ctatagggaa 6361 aatatgaaac tagtgtttac acaacgggtt tagatgacca atcgtcgaaa gttatagtcg 6421 tgatggaatt ggagcatcca gttgtagaga ttaagaggat ggttaatttg ctcgtttgac 6481 tcaactgtca gattcaatag ttgcttcttt aaccattatg gcttatgcaa gtctgttttg 6541 gaggtaataa tcatagcata ttttagtcct cctaaagttt gattttttga ttttctgagt 6601 ttatgcacta ggggagatgt tctatattgt taagccaaac aattcacagg catttcgggt 6661 ggtgagagtt gcgatcgcat caacagtttc tccccgcact gttgccactt gttgtgcaac 6721 atagcggaca tacgctggtt cattgcgccg ttcacctcgt tttgggacgg gtgaaagaaa 6781 tggacaatct gtttctatta acaggcgatc gctcctcacc atagcagccg attcttgaat 6841 ttgtttcgcg tttttaaatg tcactgttcc gctaaagctg atgtagaagt ccaaatcaac 6901 aaaccattgg gtttcttctg gacttccacc ccagcagtgc atgacacccc gaacgctttc 6961 ccctttgagc tttttccatt tttgcagcac ttcccttacc tccactgccg cattgcggca 7021 gtggataatg actggtaggt ttgccgaact tgcgatctct agctgtgtct caaagacttt 7081 gcgctgttga tcatagttat ctgctttata gaaatccagc ccagtttccc ctattgctac 7141 aactttagag tcagaactag ctaaagaaac aatttcttct gctgtttgtt cagtccattt 7201 ggctgcatct aagggatgca accccacagc aaagctcatt tcggcaaaac ggtgagccaa 7261 agcttgaatg ctcttaaact cagatggttc tacgcaggag tgcactagat gtatgacccc 7321 cgcttgttgc cagcgtgatc gtacggcttc taaatctgac tggaaaatgt caaagttgat 7381 gtgaacgtgg gtgtctatca gctgcattgc ttttcctttg tcttttgccc ttcgtctttt 7441 gccatactaa agataaaaga ggaaagactg ctgactatta cgcagttgct gctggttgtg 7501 tatgcgcttt tagcctttga gccagttttg acttttttct ggctccattg ttaggatgaa 7561 gtacacccct tttcacggct ctgtcgattt tgctataagc ttccgacatc cgagtcatga 7621 cttcttgttt tgattccggg ctttgattag ctgcgtatgc atctacagcc gcgaagtatt 7681 ttttcatcag ggtcttcact gccgatttat atgctttgtt acgcagtcgg ttgcgttctg 7741 cgatttcggc gcgcttgaga gcagactttg tattcgccac agtcagttcc agaaagacta 7801 tttatatata aacacactac tagacttact aatatagcat cattattgca aatatttata 7861 gtgctagaga aaaattttac cagatacacg tcttgtggat gcgatgagca gcagcttgta 7921 tccgcaaaaa aacaggactt gctattgcct aacggcaacg ctgcgctaac acccacaagt 7981 tgggatagcg aggatttcgc ttgaaagtgt ggtcaaattg ccgaccattg tccccgaaaa 8041 attgagctga cttttcacgg ttcgtccgag aaacaccaaa cagcgtaggt ttttctacag 8101 gcagattctc aatagtttta gatgaatgaa aatgagtgac tttgaaaatc agcgtttgaa 8161 aaactcaatt tatgcgatac gccaaggtcc gtctgttgct gcgcgcaatc ggtaaccgat 8221 gaaagtgact tttccagatg ccatcaaaag tcaggtttta gatgtcatac tcaccacgat 8281 agggacaaga acgctgcaaa gaatatcaaa gcagaaagta tcagatgggt attgtcctct 8341 gggacggggg actggtgcga gtggaggcga tgtcagacca aatcgaggat cctaccctac 8401 agaccaggcg attgtccata aatctggaag cccccaccat accgcttgca gttggtggtg 8461 ggtagttcac gacccacatc tattttataa aaagctttct tctaaagcct gctaccatca 8521 ataaaactct ttcattttgc taaatcaatt ttacattgtg attgaaatct gttgaagtga 8581 gatttgtaag aaaaattttt ctccattcgg gaaatggaaa ttttcctgag cagttatact 8641 gcaatttttt ttcaaaaata atactcagta atattttgag tggtagtgaa ctacccacca 8701 ccaaccgctt acggtatggt gggggcttcc aacttcacgg aggaatgcca catcacaggc 8761 ttacgcccac gatttggtct tacttcctct ccattggcgt tagcctcccc cgtcccagag 8821 gaggacagca ctctgatgcc ttctgctcta atatttatcg cagcattacc atctctatcg 8881 tgatgagtac cacaacttgg acaaatccaa tttcttatat ctagtggtaa ctctttgatt 8941 tggtaatgac agttggagca aagcttggaa ctaggaaacc atcggtttat ttcaactaac 9001 ttcccgccat tgtgttctag tttataagac aagaagttga caaaagttcc ccatcccaca 9061 tcagatattg cttttgctag tttgtggtga gccactgcgt tgcgggggtt ccccccgttg 9121 tagcaagtgg cgaacccgta agggttacga accatgccca agacgagtag gttttcaact 9181 accacaactt gattgctatc aactatcttt ctagatagtt tgtgtaagta gtcttggcgg 9241 acattgctaa cctgttcgta taccctagct acaatcttta ttgcttttct acggcgatta 9301 ctccctttaa ctttacgtgc agcaatacgt tgtttcttgg ctagtttttt ctcgtattta 9361 gctaaatgtt taggattcgg atatttggaa actttttcac catcataagt aattgcaaaa 9421 tccttgatac ctaagtcaac accaataacc ttgccatctg tactagatgt tggataatca 9481 ccctcatatt cgatcaacac agaagcgtag tatttacccg atgggcattt actaacagtg 9541 acagtcttga tagttccatc aagtagacga tgtattttag cttttacaac gcctagggtt 9601 ccaggaaatt tcagacaatc attgacttgt tttacctttt gtggatactg gatggactga 9661 cgatgatgtc ttgacttgaa cttaggatat tgtgctctac catcaaaaaa gtttttgtac 9721 gcacgactaa gattgagact tacagatt // LOCUS NODE_3453_length_9703_cov_5.1514309703 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9703) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9703) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9703 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(155..2263) /locus_tag="DP116_24015" CDS complement(155..2263) /locus_tag="DP116_24015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456032.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24015" /translation="MNAKAQFLLKTDKGKDKHRDIETSPHFLVSQQQFAQKKQSPRGS TSSVSSSEQPLPPPPAIAGVISYGNEISLNGRIFSGAWLQQREKTGSLSIHLSDGALR QLFGVDFLDSSNPDKQPIQWFSSHTKPLVLSSLLTSGYRYLDVTNFAKTAQWQMQVVN NTLIVTTPSAKITNISQEKQVVGDREARPTANPVLPEQGSPNRRFGATRRSLVGQYTG DRIIVALDRPTLWQITQGLSVKKPQPQIEEEGNSSTQLPSSPPRSSSPPNRQWIITLD GIANPVLAERYTPSNTVGEQDNSKFSSPSSPPLIQQVEVVNNQTIIHLGVPVGLTPRI STVANPNRLIIEIRPDAMVEKNITWAPGLNWRQRFVNLGKERFPVVWLEINPRTSGIK LKPIRTTTNTLVGTAPLIQTAQQYSAVAAINGGYFNRNNQLPLGAIRRDGVWLSSPIL NRGAIAWNDSGQFYIGRLNLVEALVGNNNIQLPILTLNSGYVQSGIARYTTAWGATYT SLTDNEIILVVQKNQVINQLTGAKAGEIAVPIPQDGYLLTLRGNATSNAATLPVGSSI RITSSTAPTALGRYSHIVGAGPLLLQNRQIVLDGKSEKFSNAFITQKAIRSGICTTTT GNLIIAAVHNRVDGGGPTLAEHAQLMQLLGCVDALNLDGGSSTSLYLGGQLLDRSPNT VARVHNGIGIFLPLPGVQRK" gene 3292..8007 /locus_tag="DP116_24020" CDS 3292..8007 /locus_tag="DP116_24020" /EC_number="1.4.1.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872453.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamate synthase large subunit" /protein_id="PRJNA477356:DP116_24020" /translation="MNKKRMNQEMTSLNTSSGCNNQGQRWLVEERDACGVGFIAHRQN YANHEILEKALAALTCLEHRGGCSADQDSGDGAGILSAIPWTLFQQDLATRGIQLPST ENTAVGMIFLPQDPQAAQTARAVVEQIAESEKLTVLGWRVVPVQPTLLGVQARENQPQ IEQIFLASQDKSGDQLERQLYITRRRIGQAIRNTINSNWSEDFYICSLSSRTIVYKGM VRSAVLGDFYDDLKNPSYTSAFAVYHRRFSTNTMPKWPLAQPMRLLGHNGEINTLLGN INWMMAREASLDHPVWGVGAASGEETRIDELKPFVYIDNSDSATLDNVFELLVRSERS PLEALMIMVPEAYQNQPELRNYPEIVDFYEYYSGLQEAWDGPALLVFSDGRKVGATLD RNGLRPARYCITKDDYIVVASEAGVVNIDEANILEKGRLGPGQMIAVDLETQEVLKNW EIKARIAKSKPYGEWIRQYRQELKSLVSGESQPVNGNGVNGNGHSTTNKIERLALLQH QVAFGYTTEDVEMVIQPMAMEGKEPTFCMGDDIPLAVLSEKSHLLYDYFKQRFAQVTN PAIDPLRESLVMSLKVELGERGNLLQPKPEYARRIKLDSPVLLEAELQAIKLSGFATA ELSTRVEIAAGPQGLKAAVQSLQAQAAESVRAGAKILILSDRISNGIDTEYTYIPPLL AVGAVHHHLIREGLRMKTSLVVETAQCWSTHHFACLIGYGAGAVCPYMTLETVRDWWF DPKTQQFMERGKITKISLEQAIANYRKAVEGGLLKILSKMGISLLSSYQAAQIFEAIG IGQDLLELGFYGTTSRIGGISVDELAQEVLAFHSKAFPELTTKKLVNLGFVNYRPNGE YHMNSPELAKALHKAVDGKKYDHYEVYKQYLANRPLTALRDLLDFQSDRPPISLEEVE SVSDIVKRFCTGGMSLGALSREAHETLAIAMNRLGGKSNSGEGGEDPVRYKILDDVDQ TGHSRTLPHLKGLHNGDTATSAIKQVASGRFGVTPEYLMSAKQIEIKIAQGAKPGEGG QLPGPKVSPYIAMLRRSKPGVTLISPPPHHDIYSIEDLAQLIFDLHQINPKAQVSVKL VAEIGIGTIAAGVAKANADIIQISGHDGGTGASPLSSIKHAGSPWELGLTEVHRVLME NSLRDRVILRVDGGLKSGWDVLMGALMGAEEFGFGSIAMIAEGCIMARICHTNNCPVG VASQKEELRKRFTGMPEHVVNFFYFIAEEVRSLLAKLGYRSLTELTGRADLLKLREDV HINKTAALNLDCLLQLPNTRENRTWLVHEQVHSNGVVLDDQLLADPDIQAAISNHSCV SKTVAVVNTDRTVGTRLAGFIASQYGDNNFEGQIHLNFKGSVGQSFGAFNLPGMTLTL EGEANDYVGKGMNGGEIIIKPPANATYNPSQNVIIGNTCLYGATGGVLFARGLAGERF AVRNSNGTAVIEGAGDHCCEYMTGGVVVVLGKVGRNVAAGMTGGLAYFLDEDGLFPEL VNREIVKLQRVITPAGEKQLQELIKLHAERTGSQKARMILENWQEFLPKFWQLVPPSE AESPQANPQAVEEKQLSSV" gene 8379..8514 /locus_tag="DP116_24025" /pseudo CDS 8379..8514 /locus_tag="DP116_24025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320172.1" /note="response regulator in two-component regulatory system with CusS; regulates the copper efflux system; frameshifted; incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" gene complement(8583..9503) /locus_tag="DP116_24030" CDS complement(8583..9503) /locus_tag="DP116_24030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide deacetylase family protein" /protein_id="PRJNA477356:DP116_24030" /translation="METNKSYVWSHGILIAAVCFCASLSVGLVMPVNPNTSEAQTKQT TNVKETGAKVGTQKRIETFKAAMLTSWQEEAKTKGFSYDVPSRFHGAVVESAKLTKGE KVIALTFDDGPWAGSTAEVLDILKKNNIKATFFVVGQMLKTYPELGKRIVAEGHVIGN HTWHHWYHYFNPQAAAYEIDSTTDLIYQITGARTTLFRPPGGIMHNGVAAYAKNRNYT LVMWSADSVDYSRPALPKLIRNVMRNSKPGGIVLMHDGGGNRSRTVQALPEIISNFRK QGYRFVTVPELLEIENRELQLAAAQKSNTK" BASE COUNT 2746 a 2061 c 2250 g 2646 t ORIGIN 1 cgtacggatc taactgagag gtgcggtgga taataccacc catcgactca accagggaaa 61 ttcccatcat taaaatcgcg ccgtggcatt agcgaagtag cggcttggat cttttacctt 121 caaaacaagt tttcttcact gttaaacgtt aagactactt cctctggact ccaggtagcg 181 gtaagaatat accaataccg ttgtgaacgc gagcaacagt attaggggaa cgatcaagca 241 attgtcctcc taagtaaaga ctagtagaac taccaccatc caaattcaga gcatcgacac 301 agcctaaaag ctgcatgagt tgggcatgtt ctgctaaggt aggtccccct ccgtcaacac 361 gattgtgtac agcggcaata atcagatttc ctgtggttgt tgtacatata ccactgcgaa 421 tagctttttg agtgataaag gcgttgctaa atttttcgct cttgccatca aggactattt 481 gacgattttg cagtagcagt ggtcctgctc ctacaatgtg ggagtaacga cccaaagctg 541 taggagcagt agaactcgtt attcttatag aagagccaac gggtaaggta gcagcattgc 601 tcgtagcgtt accgcgtaat gttagtaaat agccatcttg tggtattggg acggcaattt 661 caccagcttt agctcctgtt agctgattga taacctggtt tttctgcaca acaagaatga 721 tttcattgtc agttaaggaa gtgtaggttg ctccccacgc agtggtataa cgggcaatac 781 cactctgaac atagccacta ttgagagtca aaatcggcaa ctgtatattg ttgttaccta 841 ccaaagcttc caccaagttg agacgaccaa tgtaaaattg cccagaatca ttccaggcga 901 tcgctcctcg gttaagaatt ggacttgata accacactcc atcccgacga atggcaccta 961 gaggtaattg attattacgg ttaaaataac caccgttaat tgctgcgact gctgagtatt 1021 gttgtgcagt ttgaatcaaa ggagcagtgc ctaccaaggt gttagtcgtt gtcctgatag 1081 gttttaattt tattccggag gtacgcggat taatttctaa ccacacaaca ggaaagcgtt 1141 ctttacctaa gtttacaaac cgctgtcgcc aatttaaccc tggtgcccat gtaatatttt 1201 tttccaccat tgcatcgggt cgaatttcaa tgatcagacg gttggggttc gctacagtac 1261 tgatccgagg agttaaaccg acaggaacgc ccagatggat gatcgtttgg ttgttgacga 1321 cttctacctg ttggatgagt ggtggggatg agggagagga gaatttggaa ttgtcttgtt 1381 cccctactgt atttgaagga gtgtaacgtt ctgccaacac tgggttggct attccatcaa 1441 gggtaatgat ccactgtcta tttggtgggg aggaagaacg tgggggtgag gaggggagtt 1501 gtgtagagga atttccctcc tcctctattt gaggttgagg ttttttgacg cttaaccctt 1561 gtgtgatttg ccagagagtt gggcgatcta aagctacgat gatgcgatcg cccgtatact 1621 gcccaaccag gctacgcctg gttgccccga atctccgatt cggggaaccc tgttctggaa 1681 gaacagggtt ggcagtaggg cgcgcttcgc gatcgcctac aacttgcttc tcctgagaaa 1741 tatttgtgat ttttgcgctt ggtgttgtaa ctattaacgt gttatttacg acttgcatct 1801 gccattgggc agtttttgcg aaattcgtaa cgtctaaata tcgatatcca cttgttaata 1861 aactacttaa cactagtggt tttgtgtgag atgaaaacca ctgtattggt tgcttgtctg 1921 gattgctgct atctaagaaa tctaccccga ataattgcct gagtgctccg tcacttaaat 1981 gaatactcaa cgagcctgtt ttttcgcgtt gctgcaacca ggctccagaa aaaatgcgac 2041 cattgagaga aatttcgtta ccataagaaa ttaccccggc tattgcagga ggtggaggta 2101 atggttgctc agaagaactc acagaggaag tactcccccg tggagattgc tttttttgag 2161 caaactgctg ttgagagacg aggaaatgag gagaagtctc tatgtccctg tgtttgtctt 2221 tccctttgtc agtttttaac aagaactgcg ctttggcatt cataccagta gctattaagc 2281 atagtagtgt cacaagtatt gatgacacga aaatcgtgac atacctacca ttgcttttat 2341 tgacagtagc actgcaatgt tgttggcatt ttgaagttgg tacttctccc ataattcgca 2401 atttacttgg taaaaacaat tttttcaagc attaaattta gggtatagct gtcgctctcg 2461 ttatcaggca cagccttgta atggtattgc ttttgagtat aaggaaattt ttgtagaact 2521 taatatactc aatacctcgt acccgtcatc cacttttaac agaaaagcta agtactttag 2581 ttatcaaaat caaaaaagaa tgacatgagt tttgctctcg aacgggatgt gatattgtat 2641 atatgggctt tttatctctt gaaaacaacc aaaaaatttg ttctagtaca actatgcaat 2701 caggacgagt gctaggtgac acgagtgctc gtgatacaca gcagctttag ttgccaaaac 2761 taaagtgatg agatttttca tgctggctca cagttgtaat gcaagtaaaa tcatgctgct 2821 aacaacacaa ttgacgacaa ggagcttaaa aatttagaaa acctaaggag atataggata 2881 gatgaatagg aattgtcaaa tgagtctata gtgtaatacg ctgattttgt taccaaatta 2941 acgataattg tagatttaaa agtcgtgaag acatatgttg agatgtattt caggcaagca 3001 acacaaggct agacctggaa actagccttc gtcgtcgaaa atctgagaat ttgaggaata 3061 gaactataaa caaaacattg ctggggcaaa agtcgtctcc ttaactcagc atcaacacaa 3121 aaataacgag atgattttaa cacggggtca gcaaaataaa acccgaagtt aaaaattttt 3181 ttcctaattt tcagtcattc tgtttttctg catatctctt cattctgatt tagcaaaatt 3241 ttcccaacaa cgaagtcgta aaaatcgtaa ccaaagcctc agggacagtc tatgaataaa 3301 aaacggatga atcaagaaat gacatcgtta aatacttcct cagggtgtaa caatcaagga 3361 caaaggtggc tcgtagaaga acgagatgct tgtggtgtgg gttttattgc ccatcgccaa 3421 aactatgcca atcatgaaat actagaaaaa gcgttggctg ctttgacttg cttggaacac 3481 cgaggcggtt gtagtgcgga tcaagactct ggtgacggcg cgggaatctt gagcgcgata 3541 ccttggacgt tgtttcaaca agatttggca acgcgcggga tacaacttcc ctctactgaa 3601 aatactgctg tagggatgat atttttaccg caagatccac aagcagccca aacagcacgt 3661 gcagttgttg agcaaatcgc agagtcggaa aaattaactg tattgggctg gcgagtagtc 3721 ccagtgcaac caacattact aggggtacaa gcaagagaaa atcaacctca aattgaacaa 3781 attttcttag cctcccaaga caaaagcggt gatcaactgg aacgtcaact ttatatcacc 3841 cgtcgccgca ttggtcaagc cattcgcaat actatcaaca gcaactggtc agaagacttt 3901 tatatctgct ccttgtctag ccgcacaatt gtttacaaag gcatggtgcg ttcagcagta 3961 ttgggagact tctatgacga tttaaaaaat ccgtcataca caagcgcttt tgctgtgtat 4021 caccgccgct ttagtaccaa taccatgccc aaatggcccc tcgctcaacc gatgcggctt 4081 ttgggacaca acggcgaaat taatactctc ttaggtaaca tcaactggat gatggcacga 4141 gaagccagtt tggatcatcc agtatggggc gttggcgcag cctctggaga agagactcgc 4201 attgatgaat taaaaccatt tgtctacata gacaatagcg actcagccac tttagataac 4261 gtgttcgagt tactggtgcg ttctgaacgt agcccattgg aagccttgat gattatggtt 4321 ccagaggctt atcagaatca gccagagttg cgtaattatc cagagatagt tgatttttac 4381 gaatactaca gtggtctgca agaagcttgg gatggaccag cattgttggt gtttagtgat 4441 gggcgcaaag ttggtgcaac actagaccgt aatggtttaa gaccagctcg ttactgcatt 4501 accaaggatg actacatagt cgtggcttct gaagcaggag tggtaaatat agatgaagcc 4561 aacatcctgg aaaaaggtag acttggtcct gggcaaatga ttgccgtgga tttagaaact 4621 caagaagtgc tgaaaaattg ggagataaag gcgcgcattg ccaaaagcaa accatacggc 4681 gaatggatac gccagtaccg ccaagaactc aaatctttag tgagtggtga gtcacaacca 4741 gttaatggga atggtgtcaa tgggaatgga cattccacaa caaacaaaat tgaaagactt 4801 gccttgctgc aacaccaagt cgccttcggt tacaccacag aagatgtgga aatggtgatt 4861 cagccaatgg cgatggaagg taaagagcca actttctgca tgggggatga tattccctta 4921 gcagtgctgt ctgaaaaatc tcacctgctt tatgactatt tcaaacagcg ttttgctcaa 4981 gtgacgaacc cagcgattga tcccttgaga gaaagcttgg ttatgtcgtt gaaggtagaa 5041 ctgggtgaac gcggtaactt actccagcca aagccagaat atgcccgcag aatcaagctg 5101 gattcaccag tgctgcttga ggcagaattg caggctataa agttgtcggg atttgcgaca 5161 gcagaattat caacccgagt tgaaatcgcc gcaggtccac aaggattaaa agcagcagtt 5221 caatctttac aagcacaagc cgctgaatcg gtgcgggcag gagctaaaat actcatcctc 5281 agcgacagga taagtaatgg cattgatact gaatacacct atattccccc tctactggcg 5341 gtgggcgcag tacaccatca cctgatccgc gaaggactgc ggatgaaaac atccctcgtt 5401 gtcgagacag cgcagtgctg gagtacacat cactttgcct gtctgattgg ctacggtgca 5461 ggtgctgttt gcccgtatat gactttggag actgtgcgtg attggtggtt tgatccgaaa 5521 acccaacagt tcatggaacg gggtaaaatc actaaaatta gtttggagca ggcgatcgcc 5581 aactatcgca aagcagtaga aggtggtttg ctcaaaattc tctccaaaat ggggatttcc 5641 ttgctctcta gctatcaagc agcgcaaatc tttgaagcaa ttggcattgg gcaggattta 5701 ttagagttgg gattctatgg gacaacttcc cgcattggtg gtattagtgt agatgaactc 5761 gctcaagaag tgctggcgtt ccacagcaaa gctttcccag aactgacgac taagaagcta 5821 gtaaatctgg ggtttgtcaa ctaccgtccc aatggtgagt accacatgaa cagcccagaa 5881 ctagcgaaag cacttcataa agctgttgat ggcaagaaat acgaccacta cgaagtttat 5941 aagcagtatc tcgcaaatag accgctcaca gcgttgcgtg acttgctgga tttccaaagc 6001 gatcgcccac ccatttctct tgaagaggta gagtcagtta gtgatattgt caagcgcttc 6061 tgtactggtg ggatgtcgtt gggagcattg tcgcgggaag ctcatgaaac attggcgatc 6121 gccatgaacc gccttggtgg taaatccaac tctggcgaag gtggcgaaga tcccgtgcgt 6181 tacaaaatcc tagatgacgt agaccaaaca ggtcactcgc gaactttacc acacttaaaa 6241 ggattacaca acggtgacac tgcaacgagt gcgatcaagc aagttgcatc aggacgcttt 6301 ggtgtgacgc cagagtacct gatgagcgcc aaacaaattg aaatcaaaat cgcccaaggt 6361 gcaaaaccag gggaaggtgg acagctccca ggaccaaaag ttagccccta cattgcgatg 6421 ttgcggcgtt ctaagcctgg agtgacacta atttcaccac caccccacca cgacatctat 6481 tccatcgaag acttggcgca gctcattttt gacttgcacc aaattaaccc gaaagctcaa 6541 gtctcagtga agttagtcgc agaaattggt attggtacca tcgctgctgg tgtagcgaaa 6601 gcaaacgctg atatcattca aatttctggt cacgatggcg gtacaggagc ctcacctctt 6661 agttctatta aacacgctgg gtcaccgtgg gaactcggct tgactgaagt ccatcgtgtg 6721 ctaatggaaa atagcttgcg tgacagggtg attttgcgcg tcgatggtgg cttaaagagt 6781 ggctgggacg tgctgatggg cgcattgatg ggcgcagaag aatttggttt tggttcgatc 6841 gctatgattg ctgagggctg tatcatggcg cgaatttgcc ataccaataa ctgtcccgtg 6901 ggagtcgctt ctcagaaaga agaactgcgc aagcggttta ccggaatgcc agaacacgtc 6961 gtcaacttct tctactttat tgctgaggaa gtccgtagcc tgttagcaaa acttggctac 7021 cgttccttaa cagaacttac tggacgtgct gacttgttga agcttcgaga agacgtacat 7081 ataaataaga cagctgcgct gaatttagat tgcttgcttc aactaccaaa caccagagaa 7141 aatcgcactt ggctagtaca tgagcaagtc cacagcaacg gcgtagtttt ggatgaccaa 7201 ttgcttgctg atccagacat tcaagctgcg atctccaacc actcttgtgt tagcaaaaca 7261 gtagcagtgg ttaacaccga cagaactgtt ggtacacgct tagcagggtt cattgcttcc 7321 cagtatggcg acaataattt tgaaggacaa attcacctaa atttcaaagg tagtgttggg 7381 caaagttttg gtgctttcaa tcttcccggt atgactctca ccttggaagg tgaagcaaac 7441 gactacgttg gtaagggaat gaatggtggt gaaatcatca tcaaaccccc agcaaatgcg 7501 acttataacc catcacaaaa cgtgattatt ggcaatacct gtctttacgg tgcgacgggt 7561 ggcgtcttat ttgccagagg tttagccgga gaacgtttcg ctgtacgtaa ttccaacggg 7621 acagcggtga ttgaaggcgc tggggatcac tgctgtgagt acatgactgg tggtgtggtt 7681 gtcgttctgg ggaaggtggg acgcaatgtc gcagccggga tgactggcgg attagcgtac 7741 ttcttagatg aagatggtct gttccctgag ttagtcaacc gagaaattgt taaactccag 7801 cgggttatca ctccggcggg cgagaaacaa ctacaggagt taatcaaact tcatgctgaa 7861 cgcactggtt cacaaaaagc gaggatgatt ctagagaatt ggcaggagtt cttgcctaag 7921 ttctggcaat tagttccgcc ttctgaagcg gaaagtccac aggctaatcc tcaagctgtg 7981 gaagaaaaac agcttagttc agtttagtga actcaccccg cctttatagt cattgaaggc 8041 ggaaatccac cacttgcaca caaaaaccca aaaaagtttc tgtgtgcaac gttcttttag 8101 gggtgaggca ttgtccccct tcaaaataac agcccgactg ttttctggat tacatttttg 8161 taatgttagc caaaggataa tgactttaaa atggagttag caattcaaat tcatgcccta 8221 gtgacaggga atgtaggcga gcatccgtat actgcttacc ctcttgtttg ctgaacaggc 8281 ttaagcgaat cagaggttcg gggctgtggt gagcgacttt gagcagtggg gcgctaaagt 8341 gatcgccaaa gtcaagcttc ccggttgcac accaatcgct tgatcacatc acgcgaaaag 8401 tcactcgcaa ccaacaagta ataccattca ccagccgcga gtttaatctt cgccaaatat 8461 ttaatgccct attccggacg gatgtttacc cgcagccaaa tattggaaca cgttattttc 8521 tgacgaattc catcgttggc actgattttt tatgatgatg agtggttagt agtaactaat 8581 aattatttag tattactctt ttgtgctgct gccaactgta gttctcggtt ttctatttct 8641 aatagttctg gtacggtgac aaagcgatag ccctgcttcc taaagttgct aataatttct 8701 ggtaaagctt gtacagttct agagcgattt ccaccaccat catgcattaa tacaattcca 8761 cctggtttgg aatttctcat gacgttccta attaattttg gcagagcagg acgactgtag 8821 tctacggagt cagcagacca cataaccaga gtgtagtttc tatttttagc ataagcagcc 8881 accccattat gcataattcc acctggtggt ctgaataaag tggttctcgc acctgtaatt 8941 tgataaatta agtctgttgt gctgtcaatt tcataggcag ctgcctgtgg gttaaaatag 9001 tggtaccaat gatgccaagt atgattgcca ataacgtgac cttctgctac aattcgtttc 9061 cctagctccg gataagtctt cagcatctgt ccaacaacga agaatgtagc ttttatatta 9121 ttttttttca ggatatccag tacctctgca gtagaaccag cccaaggacc atcatcgaag 9181 gtaagtgcaa tcactttttc ccctttcgtc agttttgcac tctcaacaac tgcaccatga 9241 aagcgcgatg gtacatcata tgaaaatccc tttgtttttg cttcttcttg ccaacttgtc 9301 agcatcgccg ccttaaaagt ctcgatgcgc ttttgagttc ctactttggc tcctgtttcc 9361 ttcacgttcg tagtttgttt tgtctgagcc tccgaggtat taggattgac aggcattacc 9421 aagccaacgc ttaaactagc gcaaaaacaa actgcggcaa tgagtattcc atgtgaccaa 9481 acatacgatt tattagtttc cacgtcaaag ctccttgggg tggaatcatg agttatgtgt 9541 ctaagggtgt gcgacagcac attttttctc gatcagagat agctaggtgc tttctgataa 9601 tcctaaaaat ttaaatatag attaacagaa atataaagat taagctgata aatgatagag 9661 gtttagagta aaccagtata atctagcttt aaactaacat atc // LOCUS NODE_3476_length_9621_cov_5.0946069621 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9621) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9621) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9621 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(146..793) /locus_tag="DP116_24035" CDS complement(146..793) /locus_tag="DP116_24035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_24035" /translation="MIRVFVVAASPVVRAGLSAVVATSSKLTVVGTASDLDALTREFE QLQPDVLLLDVSGHFQELVWEKLLSSQQQPYPAIVVLTDELDSLDLEAALRAGVRSIL PSSSTESEIVAAVEAIALGLIVLHSDTIEFLLPLKESSVREKDTAHPVQALTPREIEV LQMLGSGLSNKAIAKRLNISDHTVKFHVSSIFQKLGVSTRTEAVTVGVRLGLIML" gene complement(790..1623) /locus_tag="DP116_24040" CDS complement(790..1623) /locus_tag="DP116_24040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315980.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S1" /protein_id="PRJNA477356:DP116_24040" /translation="MNLTTITRLTDEFASVAENLRKVTVKVRSSSFGSGSGVIWQSTE RDTLIITNAHVATNNKATVELSDGRVFEAVRTNIDPTKDLAALKIDATDLPTATIGHS DALRVGELVLAVGNPFGDSGAVTTGIIHANHQRVVMADIVLFPGNSGGPLADCLGRVI GINTMIVNNLAVAIPSLTVERFLHGNRQKLGVMLQPVLVNATNKRNLGLLVLSVNSGS SAEAAGVLVGDVLIGVSGKLFTTPHDLSNYLEHTNNSEPLPLQILRGGRQLVCQVAEV V" gene complement(1666..2586) /locus_tag="DP116_24045" CDS complement(1666..2586) /locus_tag="DP116_24045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315981.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="PRJNA477356:DP116_24045" /translation="MSSPNALVALSNNIADIVEQVGGAVVAVNAGQRFSPSGIHWRNG IIITSDESLRRYDEVTVTLSNGSTVPVTLIGRDPTTDIAVFKVENAQIPVGKIGDAKT LKVGHLVLGLGRSSEGDIRAAIGAVSVVSGAWRSMIGGNIDQFIRPDITLYPGFAGGP LVDAAGLVVGMNTSGRRGTALTIPASTVNRVIDELLAKGHIARGYLGLGMQPVRLPNN LRTALNLTSVGGVIVVNVEPNAPADKAGVLIGDVLITFDGTPVDDTGDVLAFLNSGDR VGKTIKVQVIRAGALVELAIAIGERSASEE" gene 2884..3471 /locus_tag="DP116_24050" CDS 2884..3471 /locus_tag="DP116_24050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HdeD family acid-resistance protein" /protein_id="PRJNA477356:DP116_24050" /translation="MTTNISGDINKSGDINKRINNSLLIGILLTAFGIVAIALPSVST IFAETWIALILISAGAAKLSYAFQTRHEGGFVWKLLLSILYFATGVMLFVYPFTGILT LTLLLASFLLTEGVFELILAFRLRPQQNWTWALGNGIVTLILGAMIWFQWPFNAPWVI GTLVGASILSTGVSRVMLSLNVRSALNQSDSIAQA" gene complement(3597..4004) /locus_tag="DP116_24055" CDS complement(3597..4004) /locus_tag="DP116_24055" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24055" /translation="MELHNMHEYFVNLNISINNSVDKTNLIEIELVKAEPEVTPSHGT KQQQTAVPDGILFWLPLSLMTFWMIVAFRLSNAWKVTQHRLTTAKILSQVPCNKCQYF KNNPYLKCAIHPTTALTEEAINCSDYSPNNTDS" gene complement(4191..5543) /locus_tag="DP116_24060" CDS complement(4191..5543) /locus_tag="DP116_24060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874709.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_24060" /translation="MPIESNSNSSQSPKRESVPEDILQPDDFEDAGENGLVPDELATQ QALSAISALQSPRHTAALQQARSSFQGRNQAVTVKPRALLLILVAIAIIFIGIAINNW LIGISGTLVALLISSAVLLPELINAIQQWFSVQERSLFVAFFGFVASLIGFVKFSGLG DRILTIGRRINWEASGTLAEWFGALGQILIAIIAVYVAWRQYVISKDLTIQQNLLTVQ QNIITQQQTIDSYFQGVSDLVLDEEGLLEDWPQERMIAEGRTAAILSSVDGSGKAKIL RFLSRSKLLTPLKRDRHLGRAILDGNGGYAEDRKAGLRVIDLGVMLAGADLSGTDLRW TDLSEANLVRADLTGCDLVKANLSRTILYEVKLEGADLNGIRLFYGSAETASPRSRTE PPNYETGEHTGAVVENVDFSDVRRMSEVARYYCCAWGGEQTRGTIPGGCEGIPNKLGR " gene complement(5814..6236) /locus_tag="DP116_24065" CDS complement(5814..6236) /locus_tag="DP116_24065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015155749.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pilus assembly protein" /protein_id="PRJNA477356:DP116_24065" /translation="MRRRVLLDTGPLVAFLKRQDQFHSWVTAQWATIEPPLLTCEAVI SEACFLLRNVYGGQEAVIAIVNSGVIQIPFRLDEETGIIGELLKTYQSVPMSLADACL VRMAEQYADSVLLTFDSDFLVYRKNKNQVIPVIMPQDK" gene complement(6236..6475) /locus_tag="DP116_24070" CDS complement(6236..6475) /locus_tag="DP116_24070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006670690.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24070" /translation="MSIEQIIVEKLRTLPPEKQQEALDFVEFLQTKTRKREFSHQEQQ PGVSALTLAQKWAGCLEGGPSDLSTNKKYMDGYGE" gene complement(6490..6867) /locus_tag="DP116_24075" CDS complement(6490..6867) /locus_tag="DP116_24075" /inference="COORDINATES: protein motif:HMM:PF14218.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24075" /translation="MVNSFGDGWNTQKRCDTIAQRLESFRQDGLIGFSHRSDPKTPNQ SAICANTKLDRSNCNLLVTLKPGADGYDSLRRMLEALKNGTSVEQGSNGSTVPILALG STFVSVENLLAAEDLKAGLDTSK" gene 7184..8071 /locus_tag="DP116_24080" CDS 7184..8071 /locus_tag="DP116_24080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874711.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_24080" /translation="MKKLLITGASGFLGYHLCHLAKREWEVYGTYFSHSLEIPDIKML KVDLTDFQELKQIFSDFQPAAVIHTAALSQPNFCQTHPEASYAINVTASCNIAGLCAD SCIPCAFTSTDLVFDGLNAPYRETDPVCPVNIYGEQKVMAEEGMLERYPMTAICRMPL MFGMQTPTATSFIQPFIQTLEQGKELNLFIDEFRTPVSGKTAAKGLLLALEKVKGRIH LGGKERISRYDFGRLLVDIFQLPAHQLKACRQKDVKMAAPRPSDVSFDSSKAFSLGYQ PLSLQEELQELRENFSTYI" gene complement(8452..9519) /locus_tag="DP116_24085" CDS complement(8452..9519) /locus_tag="DP116_24085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994744.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3616 domain-containing protein" /protein_id="PRJNA477356:DP116_24085" /translation="MDYSHRINQVLLTFTDSFKEHRNDLSALLLTPEKHLWLGSDETS TIERLSFVDVSNFAEHKQFHVADFISLPAPEEEEIDIEGLAYADYYLWLIGSHSYKRK KPKPEFSDEKNIKRLTKIASEPNRYIIGRIPLVDGELLPLCQHPKKPDVQLSAAKVEV TQQGNLLMEAIADDPHLGFFVKATIPGKDNGFDLEGIAICKNRIFLGLRGPVLRGWAI ILEIELENSGTGLLRLKNIGDDDKRYKKHFLFLNGLGIRDLFLDGEDLLILAGPTMDL DGPVQVYRVKNGLHLQEKVLNYPEFVLDIPYGNRDEHAEGITLFHDIAGVPSVLTVYD SPAKNRLVGDGSVLADVFKLM" BASE COUNT 2747 a 2108 c 1958 g 2808 t ORIGIN 1 cgagtagaca ttgttgtcgg gttgtccatc cttgtctatg aaaaatgaag cgggtgcagt 61 gaagggcggg gtgaggtctt tttattgtaa gtaattaggc gaacttgatg cggctggtac 121 ttgcggctgg tacttgataa ctgagtcaaa gcatgattaa acccaacctc acaccaacag 181 tcacagcttc tgtgcgcgtc gagacaccca gcttttgaaa aatagatgag acgtgaaatt 241 taacagtatg gtcggagata tttaaacgtt tggcgatcgc cttattactc aatcccgaac 301 caagcatctg caaaacttct atttctcgtg gtgtgagtgc ttgcacagga tgtgcagtgt 361 ctttctctcg tacactcgac tctttgagag gaagcagaaa ctcgatagta tcagaatgta 421 gcacgattag tccaagggcg atcgcctcaa cagccgccac aatttccgac tctgtgctgc 481 tactcggcaa tatgctgcgt acaccagcac gcagcgctgc ttccaagtct agactatcga 541 gttcatcagt tagaacaaca attgcagggt atggctgctg ttgtgaagag agcaattttt 601 cccagactaa ttcttgaaag tgaccactca catccaacag cagcacatcg ggttgtagtt 661 gctcaaattc ccttgtcaat gcatccaaat ctgatgcagt acctacgact gtgagtttag 721 aactcgtcgc cacgacagct gataacccag ctcgcaccac aggggaagca gcaactacaa 781 acacacgaat cataccacct ccgccacctg acaaactaat tgccttccgc cgcgcagaat 841 ttgtaacggt agtggttcgc tgttgttagt gtgttcaaga tagttgctta aatcatgagg 901 agtggtaaac aattttccgg aaactccaat aaggacatca ccaacaagca caccagcagc 961 ttcagcagaa cttccagaat tcaccgacaa caccaataaa cccaagttgc gtttgttggt 1021 tgcattcacc agcacaggtt gcagcatcac accaagcttt tgacgatttc catgcaaaaa 1081 acgttctaca gtaaggcttg gtattgctac agccaaattg ttgacaatca tggtgttaat 1141 tccgatgact cgcccaagac agtcagcaag tggaccacca gaattaccag gaaacagcac 1201 aatatcagcc atgactacac gttgatgatt ggcgtgaata atccctgttg tcacagcacc 1261 gctatcacca aaaggattac caactgctaa gaccaattca cctactcgta aagcatcaga 1321 atgaccgata gttgcagtgg gtaaatcagt cgcgtcaatt ttgagagcag ctaaatcctt 1381 tgttgggtct atattagtac gtacagcctc aaaaactcgt ccatccgata gttctacagt 1441 tgctttgttg ttagtagcga catgagcatt cgtaataatt aaagtatctc tttcggtgga 1501 ttgccagata acaccagaac cacttccaaa agaactacta cgcactttca cagtcacttt 1561 gcgtagattt tcagccaccg atgcgaattc atcagtcaag cgtgtgattg ttgtgagatt 1621 catgttttac ttttcaatta tgaatgggtg aattttgaat tggtatcact cttcgcttgc 1681 ggatctttcg ccaattgcta ttgctaactc aactaatgcg ccagcacgga taacttgcac 1741 tttgatggtt ttgccaacgc gatcgccact attcaaaaat gccaacacat cacctgtatc 1801 atctactgga gtaccatcaa aagtgatcaa tacatcccca atcaacacac cagccttgtc 1861 agcaggtgca tttggctcta cattaacaac aatcactcca ccaacagaag ttaaattgag 1921 tgctgttctc aggttgtttg gtaaacgtac gggttgcatt cctaaaccca ggtaaccgcg 1981 tgcaatatgt ccttttgcca acagttcgtc aatcacccga ttgactgtag aagctggtat 2041 ggtaagagca gtaccgcgtc gtcccgatgt gttcatgcct accactaaac ccgcagcatc 2101 tacaagtggt ccacctgcaa acccagggta aagcgtgatg tctggacgga tgaactggtc 2161 tatatttcca ccaatcatac tccgccaagc accactgaca acactcactg cacctatagc 2221 tgctctaatg tcaccttcgc tacttcttcc tagtcccagt accaggtgac caactttgag 2281 tgttttggca tcgcctattt tccccactgg gatttgtgca ttttcgactt tgaaaacagc 2341 gatatcagtg gtagggtcac gtccgatgag tgtgacaggt acagtactac catttgataa 2401 agtgactgtg acttcgtcat agcgccggag tgactcgtcg gaagtgataa taattccatt 2461 gcgccagtgg ataccgctag gggaaaaacg ctgaccagca ttcacagcaa caacagcacc 2521 accaacttgc tctacaatgt cagctatatt gttggacaaa gccactaagg cgttaggtga 2581 agacatttta ttcctctact ttttgataga taacacagga gtaacttttc ctgatgttgc 2641 gattgtgtcg ttttgttcgt gtgaacagca tcagaagaat ggggggaatt tgccctagaa 2701 atatggctag gtaaggtgga aaagaagggg tgtaagggta taagggtgta agggtatagg 2761 ggggtaaaga aggaagaggt aggtagagct tccttggacg tatatcttaa gggtctacaa 2821 agttcttgag aaattttcta gcctaaatct tagagttcaa ttaactacag ggaaaagatt 2881 tttatgacga cgaacatttc tggtgatatc aacaagtctg gtgatatcaa caagcgtatt 2941 aataactcgc tcttaattgg cattctcctg actgctttcg ggattgtagc aattgctctg 3001 ccttctgtct caacaatctt cgctgagact tggattgctt tgattctaat ttctgcgggt 3061 gcagccaagc tgagttatgc atttcaaacc cgccatgagg gagggtttgt ttggaaactc 3121 ttgctgagta ttctctactt cgcaacaggc gtgatgctgt ttgtttatcc ttttacagga 3181 attctcacgc tgactctgtt gttagctagc tttttgctaa cagaaggtgt atttgagcta 3241 attctggcat ttcgcttacg tccgcaacaa aactggactt gggcactggg taatggtatt 3301 gttaccctaa tcttgggtgc aatgatttgg ttccaatggc ccttcaatgc tccttgggtg 3361 attgggacat tggtaggtgc cagcattctc tccactggtg tttcacgtgt gatgctatca 3421 ttgaacgttc gttctgcttt gaatcaatct gactccatag cacaagcata aggcgatcat 3481 agagaagaac taaccaacaa agaactcaga actcagcaca gtaaacagtt atcagtaaac 3541 agtgaacaat aactgaattc tgagttctgt attgtgtctt cttcatgacg aagtcttcaa 3601 ctatctgtgt tattaggaga gtaatcagag caattgatag cttcttcagt taaagcagtt 3661 gtagggtgta tcgcacattt gagataagga ttgtttttga agtattgaca tttattacaa 3721 ggaacttgag aaaggatttt ggctgttgtc aatctatgtt gtgtaacttt ccaagcattt 3781 gaaagccgaa aagcaacaat catccaaaag gtcattaagc tcaagggaag ccaaaagagt 3841 attccatcgg gtaccgctgt ttgttgctgt tttgtcccgt gtgatggtgt aacctctggt 3901 tctgctttga ccaattctat ttctattaaa ttagttttat caacactatt atttatagat 3961 atattaagat taacaaaata ttcatgcata ttatgcaatt ccattaaaag aagaagaatt 4021 aaaacaagaa gagattttgc acgcttgaga gatccataca gatgtatggg aaaaatccgt 4081 ctcaagtcgt gcaagttgtg ttaagctttg ctaattctgt tttatagctg aaaaaaattt 4141 ttgatatctc tatccaaggt tttggttgtt gcttatcttg ttgcttataa ttatctacct 4201 aacttattcg gaattccttc gcaaccaccg ggaatagtac ctctggtttg ttctcctccc 4261 caagcgcagc aatagtaacg cgcaacttca gacattcgcc ggacatcact gaaatcgaca 4321 ttttctacaa cagcccctgt gtgttcgcca gtttcatagt ttggcggttc agtgcggcta 4381 cggggcgagg cggtttcagc ggaaccatag aataagcgaa ttccgtttaa atctgcaccc 4441 tcaagtttta cttcatataa aatagttcgg gagaggttag cttttactaa gtcacagcca 4501 gtgagatcag cacggacaag atttgcttcc gacagatcag tccaacgcaa atcagtacca 4561 ctaagatctg caccagcaag catcacccct aaatcgatga cacgcaaacc cgcctttcga 4621 tcttctgcat aaccaccatt accatcaaga atagctcgac caagatggcg atcgcgcttc 4681 aaaggtgtca gcaatttgga acgcgagaga aaacggagaa ttttggcttt accactacca 4741 tccacactac tcaaaattgc tgctgtgcgt ccttcagcaa tcatcctctc ttggggccaa 4801 tcttctaata acccttcttc atccaatact aaatcagaga ccccttggaa ataagaatca 4861 attgtctgct gttgggtaat gatattttgc tggacagtca gcaggttttg ctgaatggtc 4921 agatctttgg aaatgacgta ctgtcgccaa gcaacataga cagcaataat ggcgatgaga 4981 atttgcccca aagcgccaaa ccattccgcg agtgttccag aagcttccca gttgattcgg 5041 cgtccaatag ttaggatgcg atcgcctaaa ccgctaaatt tcacaaagcc aatcagactt 5101 gctacaaatc caaaaaaagc aacaaacagc gatcgttctt gaaccgaaaa ccactgttgt 5161 atcgcattta tgagttctgg taacagcacc gctgaagaga tgagcaaagc taccagagtt 5221 cccgatatcc caatcaacca gttattgatg gcaattccga taaaaatgat ggcgatcgcc 5281 actaaaataa gcaacaatgc cctaggtttg actgttacag cttgattgcg tccttggaag 5341 ctagaacgcg cttgttgaag ggctgctgta tggcgtggag attgtagggc tgaaattgcg 5401 gaaagagctt gttgagtggc gagttcatct ggtacaaggc cattttcacc tgcgtcctcg 5461 aaatcatctg gttgtaagat gtcttcggga acagattctc gtttaggaga ctgcgatgag 5521 ttggaattag attcaatcgg catgattata ttgtcgcacc agcacgccta agttcaacag 5581 atctctcatg gtaatcacga aagtatggcg gtaggcaaca gtcgtaggta ataaaacagg 5641 gcgatggctt tttggtggaa ccttaatatg ggtgatgatt ggacttgctt gagaaattta 5701 actgtaatta gcctgttttt ttaagtattt tttcgtacac agaaagcgat cactctgaaa 5761 cctgacttaa gcggttgctg aatcaattgg ggcgctataa atagacgtta tagctattta 5821 tcctgaggca taattacagg aattacttga tttttgttct tacgatagac gagaaaatca 5881 ctatcaaaag ttaacaaaac actatcagca tactgttcag ccatccgcac taaacaagca 5941 tcagccaaag acatcggtac agactggtag gttttaagca attccccaat aataccagtt 6001 tcttcatcta agcgaaaagg tatctgtatc actccactat tcactatggc gataaccgct 6061 tcttgaccac cgtaaacatt tcgcagcaaa aaacaagctt ctgaaatgac tgcttcacaa 6121 gttaataaag gcggctcaat ggtagcccac tgcgctgtca cccagctatg aaactgatcc 6181 tggcgcttga ggaaagctac taatggacct gtatctagta gaactcgccg cctcattact 6241 ccccatatcc atccatgtac tttttgttag tagaaaggtc tgatggtcct ccttccaaac 6301 aaccagccca cttctgagca agagttaaag ctgacacccc tggctgttgt tcttgatgag 6361 aaaactctct tttgcgagtt ttcgtttgca aaaactccac aaaatcaagc gcctcctgtt 6421 gtttttcagg gggcaaagtt ctcagctttt ctacaataat ttgttcaata ctcatttcac 6481 actattatgt tatttactag tatctaatcc agctttgaga tcctcagctg ccaacaggtt 6541 ttcaacgcta acaaatgtcg aaccaagagc aagaataggg acagttgaac cattcgagcc 6601 ttgctcaaca ctagtaccat ttttcagagc ctcaagcata cgccgcaagg agtcgtagcc 6661 gtcagcacca ggtttaagcg tcaccagcaa attgcaatta ctgcggtcaa gtttagtgtt 6721 ggcacatatc gcggattgat tgggtgtttt ggggtcagag cgatgactaa aacctattaa 6781 accatcttgg cgaaagcttt ccaagcgttg ggcgatagta tcgcaccgct tctgagtatt 6841 ccagccatca ccaaagctgt tgaccatcga agccagggct ttgtccgctg atcattacga 6901 tacattacca tccacacctc accgccgttt tgtgtgtcgg gttgcaaggc gcaagaaaag 6961 gggtcttttg agccgaagcc gaatatacct ggagtccagc caatagcaac accgtaagcc 7021 aagactaaac cagcgacacc cgctacagca ccaataatgc ctacagtcaa attatctatg 7081 tcaaatatat gtcttttttt agacatcgac tctgctctca aatcgacaca ctgtcgtgat 7141 atttttaaca tagcgtatct acgttagagt atacattgaa attatgaaaa aactattaat 7201 tactggtgct agtggttttt tagggtatca tctttgccat ctggcaaaac gagaatggga 7261 agtctatgga acttactttt ctcattcttt agaaattcca gacatcaaaa tgctgaaagt 7321 tgatttgaca gattttcaag aactgaaaca aatatttagt gattttcaac cagcagcggt 7381 cattcatacc gccgcactat cccaaccgaa tttttgtcaa actcatccag aggcatcata 7441 tgcaataaat gtgacagcat cctgcaatat tgctggactt tgtgcagatt cttgtattcc 7501 ttgcgctttt acatcaacag acttagtttt tgatggctta aatgctccct atcgtgaaac 7561 agatcctgta tgccctgtca atatttatgg tgagcaaaaa gtcatggctg aagaaggtat 7621 gctagagcgt tatcctatga ctgctatttg tcgtatgcca ttaatgtttg gtatgcaaac 7681 accgacagca acaagcttta tccaaccatt tattcagact cttgaacaag gaaaagagtt 7741 aaatttattt atagatgaat tccggacacc cgtgagtgga aaaactgcag ccaaaggatt 7801 gttattagct ttggaaaaag tcaaaggacg tattcactta ggaggtaaag aacgaatttc 7861 gcgctatgat tttggtcgct tattagtgga tatttttcaa cttcctgctc atcaattaaa 7921 agcctgccga caaaaagatg ttaaaatggc agcaccgaga ccatctgatg tttctttcga 7981 cagttccaag gctttttctt taggctatca acctctctct ttgcaagaag aattacagga 8041 attacgcgaa aatttttcta cttacattta agtgcagagt ttgttcatgt tgtcttcgtc 8101 aacttttaaa ctctactctg ctcaaaatgt cgtctagttg tgtttgctca agctacacaa 8161 gtgaaaaatc cgagggtgtg tagccacctg catcgggaga ttgacaatgt tcgatgagtt 8221 ggagcagatt tgttaagtag ttatcagcta agttggcgat cgcctctttt gaatgttgtt 8281 ctaacttgaa cttccccaca gtcaaaaaag gctaatgctt acgtaaagcg gatttcaaag 8341 ttttttgaaa ccgccttact gggtagatct gacttttgcg cctctagcca gtttgggtta 8401 catttgggct taatttttgt ctaagtcctg ctaagagtcc tgctcttttt tttacatcag 8461 tttaaaaaca tctgccagta cactaccatc tcccaccagc ctattttttg ccggagaatc 8521 ataaactgtc agtactgaag gtaccccagc tatatcatga aataaagtaa ttccctcagc 8581 gtgttcgtct ctatttccat atgggatatc aagcacaaac tctggataat tgagaacttt 8641 ttcttgtaaa tgtaggccat tctttacacg ataaacttgc accggaccat ccaaatccat 8701 tgtgggtcct gctaaaatca acaagtcttc accatctaaa aataaatccc ggattcctaa 8761 accattcaag aagagaaaat gcttcttgta tctcttgtca tcatctccaa tatttttcag 8821 tctcagcagt ccggtaccag aattttctaa ctctatttcc agaatgatag cccaacctcg 8881 taacacaggt ccacgcaaac ctagaaaaat tcggttttta caaatggcta ttccttctag 8941 atcaaagcca ttatctttac ctggaattgt cgctttcaca aaaaagccca agtgtggatc 9001 atcggctatt gcttccatga gcaagttacc ctgttgtgtt acctcaactt tggcagcgct 9061 taattgtaca tcaggttttt ttggatgttg gcataatggc aataattcac catccaccaa 9121 gggaatacgt ccgatgatat agcgattcgg ttctgatgcg atttttgtga gtcttttaat 9181 atttttctca tccgaaaatt caggtttagg ttttttgcgt ttgtagctgt gagaaccaat 9241 taaccacagg taatagtcag catatgccaa accttcaata tcaatttctt cttcttctgg 9301 tgcaggtaaa ctgataaagt ctgctacgtg gaattgttta tgttctgcaa agttacttac 9361 atctacaaaa gatagccgtt caatcgttga ggtttcatcc gaccccaacc acaagtgctt 9421 ttccggggtt agcagcagtg ctgaaaggtc gttacgatgt tctttaaaac tatcagtaaa 9481 agtcagtaaa acttgattga ttcgatgtga atagtccatt gtatatactc ctgtagatat 9541 tttttagtct tgttttttat gaaactgatt ttttcaggat tttatactaa aaatggctga 9601 taataacttg aaaataataa t // LOCUS NODE_3480_length_9611_cov_4.6618889611 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9611) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9611) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9611 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 53..901 /gene="mgtE" /locus_tag="DP116_24090" /pseudo CDS 53..901 /gene="mgtE" /locus_tag="DP116_24090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115761.1" /note="internal stop; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="magnesium transporter" gene 1170..1622 /locus_tag="DP116_24095" CDS 1170..1622 /locus_tag="DP116_24095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869457.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-tyrosyl-tRNA(Tyr) deacylase" /protein_id="PRJNA477356:DP116_24095" /translation="MRVIIQRVKSSQVTVNGEIVGKIGRGLNLLVGIADTDTDAELDW MVRKCLELRLFGDQQQESRWEKSVQEIGGELLVISQFTLYGDCRKGRRPSFDRSAIPK VAQDLYNRFVDKLRESGLRVETGEFGAMMQVSIENDGPVTLLLEREIV" gene 1727..2065 /locus_tag="DP116_24100" CDS 1727..2065 /locus_tag="DP116_24100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem II reaction center protein Psb28" /protein_id="PRJNA477356:DP116_24100" /translation="MAQIQFSRGKNEEVIPQVRLTRSKTGDSSTATFIFQNPQALDSK STEEITGMYMIDEEGEIVTREVKGKFINGKPEVLEAIYLMRSKDEWDRFMRFMERYAE ENGLEFSGKS" gene 2652..3059 /locus_tag="DP116_24105" CDS 2652..3059 /locus_tag="DP116_24105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876959.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_24105" /translation="MLMLSCEPSTLRVLVVDDHELTRLTLKLAFSGQENLQVVGLASN GQEAVEMVKRYHPDVIVLDLHMPVMDGWRASGHIKAIAPNTQIIAYSSVEDNKCQETR ELASLDAFCKKETPTTELIALVRKLGQSSSNDW" gene complement(3213..3392) /locus_tag="DP116_24110" CDS complement(3213..3392) /locus_tag="DP116_24110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456861.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24110" /translation="MDAQIIKERVLKVQSQREYLLSLLEQPNLGTLRVDVNQALEEMD DLIDEYRRTFPQTEV" gene 3830..4378 /locus_tag="DP116_24115" CDS 3830..4378 /locus_tag="DP116_24115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24115" /translation="MLGQISLGTLGISIGGVLTIIGFAAYAADNATLNLIGFFYGIPL LLGGLALKANELKPVPLSQPTTPQVLTLRKEQATPTQNQIRQDITRYRYGQDVHLYQA LSFLGLGATDDDIPAVTELQETEINGAYALILKFDSPAVPIEVWQQKQEKMTSFFGPG VEVKVTQPESEKIDLTLIATKK" gene complement(4437..4697) /locus_tag="DP116_24120" CDS complement(4437..4697) /locus_tag="DP116_24120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319902.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chlororespiratory reduction protein 7" /protein_id="PRJNA477356:DP116_24120" /translation="MPDSLMYQQDYFVVLETNQPEQFLSTLELLEKLKGILHKLKFED LPPDLRSFESQDAQAKYLIDTSCELDVGAGEYLQWYAVRLEK" gene 5360..6394 /locus_tag="DP116_24125" CDS 5360..6394 /locus_tag="DP116_24125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748653.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diguanylate cyclase response regulator" /protein_id="PRJNA477356:DP116_24125" /translation="MSGISPFFMTKETPVILVADDDKSMRMLLREAMEKEGYRVVEVS DGKQCLDVYASVKPDLVLLDAVMPIMDGFTCCKQLSKISSNFGNSTLANPVMESSLNT SAISVLWNSTPILMITGLDDPESVDRAFEAGATDYVTKPIHWAVLRQRVRRLLQQAQL YKQFEAATHALQELVNIDGLTGVANRRRFDHYLNTKWLNLAHEQLPLSLILCDIDYFK LYNDRYGHPAGDACLQKVAAVLNRLAQRNEDLVARYGGEEFAVIMPNTHAAGAVHVAA SIQAGVRELQMDHSESEVSRYLTLSLGVATTFPNFESSLTTLGMAADKALYQAKAQGR NRIILKQVKC" gene complement(6482..6553) /locus_tag="DP116_24130" tRNA complement(6482..6553) /locus_tag="DP116_24130" /product="tRNA-Thr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(6519..6521),aa:Thr,seq:cgt) gene 6577..6849 /locus_tag="DP116_24135" CDS 6577..6849 /locus_tag="DP116_24135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3493 domain-containing protein" /protein_id="PRJNA477356:DP116_24135" /translation="MTQQNNKTRINSEQYARLKAEATAPYRGLRQFIYITCGASGFIG VVIFLSKLLAGRDVESALPNLALQIGVVAIMVFLWRWEQRRQKRPK" gene complement(7305..7538) /locus_tag="DP116_24140" CDS complement(7305..7538) /locus_tag="DP116_24140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747918.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24140" /translation="MNIKLIIFSGVMTALVGAVIGLAAAQIGQRNFNQLKYETQYYKD LHNRYALIGASLGFIIGVGQECVRELKADGDDK" gene complement(7927..9558) /locus_tag="DP116_24145" /pseudo CDS complement(7927..9558) /locus_tag="DP116_24145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015155299.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hydrophobe/amphiphile efflux-1 family RND transporter" BASE COUNT 2777 a 1969 c 2033 g 2832 t ORIGIN 1 ccatttttgt aaactgtccc gcaagtagcc tttcacccag ctattcagct tgcggcgttt 61 gaccgggatt ttgtctctac gggagttggt cacttctcaa cctgagcaaa agattggcaa 121 cgtcatgact caagatgttg tgtttgtgta tacggataca gatcaagaag aagtcgctag 181 gatgatacaa cgatatgact ttgttgctgt gcccgtggtg gatcgagaac agcgtcttgt 241 tggtattatc accgttgatg atatcataga tatcttacaa gaagaaacaa ctgaagatat 301 ctacaccttg ggtggtgggg tacaatcggg aggcgacagc tattttcaat cgaatataat 361 ttcagtggct cgcaagcgag ttgtctggtt gttagtctta cttgtaacca acactgttac 421 aggaactatt attaaatcac aagaagatat tttgtcaaaa gtggtgacgc tggcggcgtt 481 tatccccttg ctaactggta ctggtggtaa tgttggggcg cagtcttcca cagtggtgat 541 tcgcgggatg aacacggatg aaattcgtgc tttgggacca ttgcaagttg ttggacggga 601 agcacttgca ggtgcgttac tgggaggaat actgggaacg atcgccaccc tctgggcttt 661 ccttttggaa aagaatttgg aggtggcgtt aagcgtagct ctgccgtagg caatcgcagt 721 cggaagtagt cttgtagcta tttctatttt agcctccgtc tctggttccg cactaccgtt 781 tttgttccgc caactccgtt tagatccagc attgatgtca gcacccttta tcaccacagc 841 cgtggatgtt ttgggtgtgt tgatttactt taacttggca cgggttattt taaaaatgta 901 agaagaactc agcagtcagc agacaggaaa aggagaaaaa atcaacattc agagttaagc 961 gttaagagtt aagcgttaag agttaagcgt taagcgttcc ctgttaagag ttaagcgttc 1021 cctattctgg cttctgaacg gcagtcgcac gccacttgct acaacggggg gaacccccgc 1081 aacgcagtgg ctccccaagt cgggaaaccc gcccacggcg ctgcctcctc ctggattctt 1141 ctttgagaac ttcaaaatct aaaatagata tgcgcgttat tatccaacga gtcaaatcat 1201 ctcaagtgac cgtcaacggt gaaattgttg gtaaaattgg tcgaggccta aacttgcttg 1261 tgggcattgc cgatacagac accgatgctg aacttgattg gatggtgcgt aaatgtttgg 1321 aactgcgact gtttggtgat caacaacagg aatcgcgttg ggaaaaatct gtacaagaga 1381 taggcggcga gttactagtc atcagccagt ttacgcttta tggtgattgt cgcaaaggtc 1441 gccgtccctc ctttgaccgt tcagcgattc ccaaagtagc tcaagatttg tataatcgct 1501 ttgttgataa gttgcgtgaa agcggtttac gggtggaaac aggtgagttt ggggcaatga 1561 tgcaagtctc gatagagaat gatggtcctg tgactttgtt actagagcga gagatagtat 1621 aagggtttat acggtaaaag aaaatctcag cagccgttag acttctatgt acgaaatgat 1681 gagagaatat taaactaacg ataataaaac tttaaattca ctcatcatgg ctcaaattca 1741 gttttccaga ggtaagaacg aagaagtcat tccacaggtg cgcctgacgc gatcaaagac 1801 tggtgacagt tcaacagcaa cgtttatctt ccaaaatccc caagctttgg atagtaagag 1861 tactgaagaa atcactggga tgtacatgat tgatgaagaa ggagaaatcg tcactcgcga 1921 agtcaagggt aaattcatca acggtaagcc agaagtatta gaagccattt atctgatgag 1981 atccaaagat gagtgggatc gctttatgcg gtttatggag aggtatgctg aagaaaacgg 2041 tctggaattc agcgggaaat cttaaactgt catgagtcaa acacaaactc ggactgtata 2101 aagccagaaa aagaatacca ttgttgagtg tatacgctta tcgtaactgc gttttagaca 2161 cttttaccgc tctggacgta tttgcctttt ccacagaggt tttcaggcat ttttcattct 2221 ttgaaagttc tcaagtccta ctcaagcgcg tgttggattt ctgtgtacca ttgaaatccc 2281 ctttcgggca agttgagttg tttgttaagg cgaaatactg cccagtttgg tattacccat 2341 gcctattctg ggcttattct aaactttaca accgcatcaa ttcgtttatt tggtcatctg 2401 ctaaccatcg ttgaactgtg catcgatatg cgttcctaaa ttttttgcga ctcaacgcag 2461 acagtgtcgc attctttttt caaattggta taacttattc taaattgttt gaaaaagttt 2521 tgtctttctt aattattcga ttttttgcta tgagaaaata tgacgttatt caaccattag 2581 atagagtaag tgtaaccaac tattttcata aactgtagag gttatcataa gtcgtcttta 2641 ctctaacgtg gatgttaatg ttgtcctgtg agccttctac attacgtgtt ctagtggttg 2701 acgatcacga actaactcgt ttaaccctaa agttagcctt ttctggtcag gaaaatcttc 2761 aagttgttgg tttagccagc aatggtcaag aagctgtaga aatggttaaa cgctatcacc 2821 ctgatgtgat agtactagat ttacatatgc cagttatgga tggttggcgt gcttctggtc 2881 atattaaagc tattgctcca aatacccaaa ttattgctta ctcctcagtg gaagacaaca 2941 aatgtcagga aacaagggaa ttggctagct tggatgcttt ttgcaaaaaa gaaaccccaa 3001 caacagaact tattgctcta gttagaaaat taggacaaag ttcatccaat gattggtagc 3061 tggataaatg atacagcaaa gaaacttggg aaatataaaa ctgggtatct gggaactgga 3121 gaatatggag atcttgaaga gcgtcggtca aactgtcagt tagagtttga cgacgctttt 3181 ttttgtaatt tgtatccctt gttggaactt aactaaactt ctgtttgtgg aaaggtgcgt 3241 ctatattcat caatgagatc gtccatttct tctagagctt gattcacgtc aactcttaaa 3301 gttcccaagt ttggctgttc caacaaactc agtaaatact ctcgttgact ttgaacctta 3361 agcactcgtt cttttataat ttgtgcatcc attgctattt tcctgttttc aactgataga 3421 tttacatcat aacaattggg cataacgcct gaaaaggtac aaatagcgct gtcactgttt 3481 caggtgatac gatagttatc tgttgtttca gcttcgagac caaaaccttt tcatcttgga 3541 gcttgaaaaa cagttcaatt tttaacttcg ctatagccca agctgtcgtt gctatgtaat 3601 tggaactggg tgacaaaacc gcactccata tcaacaagct agacaggcat gactatgcaa 3661 attgtaacta ttgtaccatt ttttactgct ttttgcgatc ctttggttaa aaaatatcac 3721 ttgtgttcta ctgaaatcta cgataatgga ctgctgctct tcctaacttg ttcgttatca 3781 ccgatgatag aaaaataatg gtactaagaa tacataacga actaaaatca tgttaggtca 3841 aatttccttg ggaacgctag gtataagcat cggcggtgtt ttgactatca ttggttttgc 3901 tgcctacgct gctgacaatg ccacactcaa tcttattgga tttttttacg gaatccctct 3961 tctactagga ggattagcac tcaaagctaa cgagcttaag cctgtgccct tgagtcaacc 4021 gacgacgcca caagtgttga cacttagaaa ggagcaagca actcctactc aaaatcaaat 4081 tcgccaagac attactcgat atcgttacgg tcaagacgtg catttatacc aagcgctctc 4141 atttctgggt cttggagcta cagatgatga tataccagcc gtcacagagt tacaagaaac 4201 agaaattaat ggagcttatg ctttgatttt gaaatttgat tcgccagcgg taccaatcga 4261 agtttggcaa caaaagcagg aaaaaatgac cagttttttt ggtcctggag tagaggtaaa 4321 agtcacacag ccagaatcag aaaaaattga tttgacgctc attgctacta aaaagtagtt 4381 gttattcaca ttagttctga gttgtttggt aatcactcag agctaatgaa tcatatttat 4441 ttttctaaac gaactgcata ccactgtaaa tactccccag caccgacatc taactcgcaa 4501 ctcgtgtcga ttagatattt tgcttgtgca tcttgtgatt caaaagatcg caaatcgggt 4561 ggcaaatcct caaatttgag tttgtgtaaa attcccttga gcttttctaa taactctaaa 4621 gtagaaagaa attgttctgg ttgatttgtt tctaaaacga caaagtaatc ttgctgatac 4681 attaatgaat ctggcataag taagtgactc caatcaagtc taagataaag gacaggtata 4741 aattcgtgaa attagtatcc tatataacaa tcctaaatga ttagtgaaat ctttcatata 4801 ggctatggct tgtccaaagt cgtaacaaat gactcccagt tcgctctttt tcaagagagt 4861 tagaacaatt gcaaaaatct agatttgatc agagtttgag gtgtactaaa tatatgatta 4921 tctaaacaat atctcatttt aagggacaag cgctcgtggc aaacaccaag gcggatctgc 4981 ccttctttga caggttaact tttggggaaa ctctctccct gatgtaacta ggggcaagat 5041 gcatctggta tcttggggca tagggcgatt gccatcttcg ccaacgctaa aatccaaatt 5101 ttccttccat ttttagaaac tcaatttcaa atcggcgaag cgtacgaaat gacttatgat 5161 agctgcgggc atgcctatgg cacgtcttca acttcaaagt agcgtattcc tttaggacta 5221 gtgtaatcgc aaagactcta ttttctgttt ttcaatgttg acttacttat acaagtcacg 5281 gtgttagata attgcgtaat gatttcacgc cgtaatgtat atgttgttta atatttttgg 5341 ctcttatgga atgcaactca tgtcaggcat aagcccattc ttcatgacca aagaaactcc 5401 tgttattctt gtggctgatg acgacaaatc gatgcgaatg ctgttgcgtg aagctatgga 5461 aaaagaaggc tatcgagtcg tcgaggtgag tgatgggaaa cagtgtctag atgtatacgc 5521 atctgtcaaa cctgatttag ttttgctaga tgctgtcatg ccaattatgg acggctttac 5581 atgctgtaag caactgagca aaatatccag taattttggt aactcaacac ttgctaatcc 5641 agtgatggaa tcttcattga acactagtgc aatttcagta ctttggaata gtactcccat 5701 attaatgatt actggcttgg acgatcctga gtcagtggat cgcgcttttg aagcaggagc 5761 aaccgactat gtcaccaagc ctattcactg ggctgttttg cgtcagcgtg tgcgaagact 5821 gctgcaacaa gcacaactct acaaacaatt tgaggctgct acccatgctt tgcaggaact 5881 tgtcaatata gatgggttaa ctggagtggc taatcgtcgg cgctttgacc actatttaaa 5941 taccaaatgg ttgaacttag cacacgagca actgccttta tcactgattt tatgcgatat 6001 cgactatttt aaactttata acgatagata tggtcaccct gctggagatg cctgtttgca 6061 aaaggttgct gctgtcttga accgtttagc tcaaagaaat gaggatttag tagcgcgtta 6121 cggtggtgaa gaatttgctg tgatcatgcc taacactcat gctgctggtg cagttcacgt 6181 tgcagctagc atacaagctg gagtcaggga attacaaatg gatcattctg aatctgaggt 6241 tagccgttac ctcaccttga gcttaggggt agccaccacg ttccccaatt ttgaatcttc 6301 tttaacaact ttgggtatgg cagcagataa ggcgctttac caagcaaaag cacagggacg 6361 caatcgcatc atactcaagc aagtgaaatg ttgagcggtt caagctttac actagagata 6421 atcaaactac aattaaaata acaattagtt tcactaaaat ttcgcaaact ccttttaaaa 6481 aagccgatga cgggatttga acccgtgacc tgctgattac gaatcagctg ctctaccact 6541 gagccacatc ggcatacctt cggattatta tagcttatga cacaacaaaa taacaaaact 6601 cgtattaatt cagaacaata cgcccgtctc aaggcagaag caacagcccc ctatcgtggc 6661 ttacggcaat ttatctacat tacttgtggt gcttctggtt ttattggtgt agttatcttt 6721 ctttctaagc tacttgcagg acgtgatgtt gagagcgcgt tgccgaattt agcacttcaa 6781 ataggcgtgg ttgctattat ggtttttctt tggcgttggg agcagcgtag acaaaaacgt 6841 cccaaatagc aattgattac agtttacttc tcacgtgttt gaagtttatc tttcttggtg 6901 attcattatt tttcataatc aatttgaaaa tatttttagc ataaaatttt aagaatttcc 6961 ttttatgaaa tgaattacaa gcttttgcga taagccggag gcttgacgct ccgcgtatcg 7021 cgtctcatac ctttagaaaa gtaagctaat aaaaatgttg atatctccat catagctaaa 7081 ttccagttag tcattgctag tatagtagtg ccgataacta gcaaacaaag taagctgtat 7141 ttaatcttca ttcattctca ctttttccaa ataaaatata aaattttcat tcaaaagttt 7201 tccggcaaga ctcatcttta ggaacaacac gctttggaga actttctttt gctctacaag 7261 ttagaaagaa aagtcctcct gactcagcct gatttaatag gcttttattt atcatctcca 7321 tcagctttca gttcacgcac gcattcttgt ccaacaccaa taatgaatcc taagctggca 7381 ccaattaaag catatctgtt gtgcaaatct ttgtagtact gagtttcata tttaagttga 7441 ttaaagttac gttgaccaat ttgagcagca gctaagccga taactgcacc gacaagagca 7501 gtcataacac ctgagaagat gattagctta atattcataa tttctcaaac tcagattgtg 7561 atgaggtaaa actatacaat tagcgctgaa ataaattgtc tgaggagaac agttaccagt 7621 accagccgca gtgaacagct acctacgcag tgaacagtta tcaattgctt gatgactgat 7681 aactgataac tgataactga taactgataa ctggtaactg gtaactgata actggtaact 7741 gctaactgat tagactagtg gtgcaagaag aaagagtaaa acagatacac aacagtgata 7801 gtaaacaaaa tgtggtttat ccaatcgttt gcattccatt gtttgaacat ttgtgtgacc 7861 tccatagaca aactttagta gacaaacttt aaacttcttc tgccccgtta tcgctgcaac 7921 ttggtatcac tcattttgtg gagttgtcct aaatgtcggg agtgcttgtt ccttctgatc 7981 tggttgaagc tgaggcggct gaccctgctc tggatgctgt ggcgaactgg gcttgttagg 8041 tttacccggt ttcaagaagc gctcttctag gtttttaata acaacataca gcacggggac 8101 taaaaacaaa cttaagattg tcgaaatcag ataaccgcca aacagcgctg ttcccaaaga 8161 ccaacggctc atagcgccag cgccagaagc tatcaccaag ggccagaaac caactaaacc 8221 agcaaaagca gtcatgagaa tcggtcgcaa gcgttcctca caagcgcgga ttgcagcctt 8281 agtgatattc atccctaact ctaccgactg gttagcaaat tccacaatca gaattgcgtt 8341 cttacttgcc aaaccaatga gcatcaccag cgcgacttga gcatagatgt tattgttaac 8401 cacaggccaa acaccgcctg tttggaaaag attggcacgg aacacgactg caccaagcgc 8461 acccaagatt gccagaggta cggtaatcat gatgatggtg ggatcgacgt agctttcgta 8521 ctgcgccgct aacaccaaga aaaccatgac aaaagctaaa ccaaaaataa tgggtgctgc 8581 accgccagag gatttctcct gaagggcagt ccctgtccaa gcaaagccaa atcctggttg 8641 taaaacttcg ttggcgatcg cctccattgc ttgaattgcc tgtccagtac tatagcccgg 8701 tgctggagct ccttgaattt tgattgacgg ataaacgttg tagtgagtga taacaggcgg 8761 ataagtaatt ctttccaatt gcaccaaaga actcagttga acgagattcc cattttggga 8821 gcgtacgtac agacgactga gatcttctgg attagaacgc actgtcccct ctgcttgggc 8881 gtaaacccga tacagtcgcc caccaagcac atattggttg atatagcttg cacccaggta 8941 ggtctgcatg gtgctaaaaa tatcactgac tcggatgttt tgtgctttgg cttggttgcg 9001 atcaatggaa atttgcatca tcggagcatt gaaggtgaat tgagaaaaga cacctcctag 9061 ttccggtcgc ttctgggctg ctgcaatcac cttttgggta ttgtcgataa gcgcttccat 9121 aggcagggct tgtttgtttt ggatatacat ctcgaagcca cctgtactac ccaagccatc 9181 tacgggtggt gcattgacgg cgatcgccac cgcaccagga attttttctc gcaacccttt 9241 attaattttt tgtagaatgc caaagactga atgctcgcct ccatgacgct cttcccagtc 9301 tttaagcttc acgaaaaaca gcgacttgtt gctcgcgttc ccctcaaacc caaagccagc 9361 gtttccgaca acatgctcaa cttccggcag tggttttacg acttcttcag taatcttctt 9421 gaccagatca acgctgtagt tgagagatac gcttggtggt gcttcgacaa tcgcaaagaa 9481 ataaccttga tcctcttcgg ggataaaccc ttggggagtc gtctggtaaa tccaacctgt 9541 caagactaaa cccccaataa gggacttcca aggaataaat taatcaacaa aaaccaagaa 9601 actagattta c // LOCUS NODE_3483_length_9597_cov_4.8036059597 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9597) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9597) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9597 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 297..671 /locus_tag="DP116_24150" CDS 297..671 /locus_tag="DP116_24150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" /protein_id="PRJNA477356:DP116_24150" /translation="MVDSADLPEVPLRALVWMGDSRKNIRAFPSEVQKAVGYALQLVQ AGETPLDAKPFKGVGSGVYEIVKRYDTDTYRAVYAVKIGEKIYVLHAFQKKSNKGIKT PQTDVDLIKQRYKDALAREEQP" gene 668..994 /locus_tag="DP116_24155" CDS 668..994 /locus_tag="DP116_24155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" /protein_id="PRJNA477356:DP116_24155" /translation="MTEESVFEESSGNVFADLGLEDAEELFTRGKIGIVVLHLLKQRN LKQREISKLLGIPQPEVSYLMRGEFQRFSEGKLLTFLKRLDTEITLHLRPRHAGTQTG ETVVSL" gene complement(1138..2061) /locus_tag="DP116_24160" CDS complement(1138..2061) /locus_tag="DP116_24160" /EC_number="2.7.1.23" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017804282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(+) kinase" /protein_id="PRJNA477356:DP116_24160" /translation="MELKQVIIAYKARDSQSKRWAEICAKQLEHRQCQVLMGPSGPKD NPYPVFLASAGQPIDLAVVLGGDGTVLTGARHLAPAGVPILAVNVGGHLGFLTESVDE FQDTERVWDRLLEDRYAVQRRMMLQASVFEGNRTNLEPVSERYLALNEMCVKPASADR MITSILEMEIDGEVVDQYQGDGLVISTPTGSTGYTVSANGPIIHDGMEAITITPICPM SLSSRPLVLPPGSVVSIWPLGDYDLSTKLWTDGVLATSIWPGHRVDVRMADCRAKFII LRENNSYYQTLREKLLWAGTRIRYSSASHAN" gene complement(2066..2371) /locus_tag="DP116_24165" CDS complement(2066..2371) /locus_tag="DP116_24165" /EC_number="1.6.5.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009343500.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit NuoK" /protein_id="PRJNA477356:DP116_24165" /translation="MQIQYFLLLAAALFCIGIYGLITSRNAVRVLMSIELLLNAVNLN LMAFSNYLDSVAIKGQVFTVFVITVAAAEAAVGLAIVLAIYRNRDTVDMEQFNLLKW" gene complement(2451..3062) /locus_tag="DP116_24170" CDS complement(2451..3062) /locus_tag="DP116_24170" /EC_number="1.6.5.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319171.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit J" /protein_id="PRJNA477356:DP116_24170" /translation="MNLAEGVQIVSFGILAVLLIGTALGVVLFENIVHSAFLLAGVFV SIAGLYLLLNGDFVAAAQLLIYVGAVNVLILFAIMLVNKRQAFVPFPTAWVRKALTGV VSLGLFALLSTMVLATPWSLSTAIPPSNTIVLIGQHFFTDFLLPFELASILLLIAMVG AIILARREYLPEQTISPEQQQVLTLPERPRELVSIGESSRNIQ" gene complement(3167..3748) /locus_tag="DP116_24175" CDS complement(3167..3748) /locus_tag="DP116_24175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019493874.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit I" /protein_id="PRJNA477356:DP116_24175" /translation="MLKFLKQVGDYAKEAVQAGRYIGQGLSVTFDHMRRRPVTVQYPY EKLIPSERFRGRIHFEFDKCIACEVCVRVCPINLPVVDWDFDKATKKKKLKHYSIDFG VCIFCGNCVEFCPTNCLSMTEDYELSTYDRHELNYDSVALGRLPYKVTLDPMVTPLRE LVYLPKGVMDPHGLPADAPRPGARPEDLVEQEK" gene complement(3803..4921) /locus_tag="DP116_24180" CDS complement(3803..4921) /locus_tag="DP116_24180" /EC_number="1.6.5.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209493.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit NuoH" /protein_id="PRJNA477356:DP116_24180" /translation="MNSGIDLQGTFIKSLMDLGLSAGAAKSIWMPVPMILMIIGATVG VLTCVWLERKISAAAQQRIGPEYIGPLGLLAPVADGLKLVFKEDVVPAKSDSVLFTLG PIIVVLPVFLSYLIVPFGQNLVITDVGMGVFLWIALSSIQPIGLLMAGYASNNKYSLL GGLRAAAQSISYEIPLALAVLAIAMMSNSLSTIDIVEQQSGYGILGWNVWRQPAGFII FWIAALAECERLPFDLPEAEEELVAGYQTEYSGMKFALFYLSSYVNLVLSALLVSVLY LGGWDFPIPINLIASALGVSETNPVLQVVTASLGITMTVLKAYFLVFLAILLRWTVPR VRIDQLLDLGWKFLLPVGLVNLLLTAALKLAFPFAFGG" gene 5068..5298 /locus_tag="DP116_24185" CDS 5068..5298 /locus_tag="DP116_24185" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24185" /translation="MVFAPLPTQEFLLMVKIGICEENLQSWVESVCAAPPPGLASIGG ARGDKTIQNRVALRSALRRNLKIQNSYPPLPL" gene complement(5303..6439) /locus_tag="DP116_24190" CDS complement(5303..6439) /locus_tag="DP116_24190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="citrate synthase" /protein_id="PRJNA477356:DP116_24190" /translation="MMVCEYKPGLDGIPAAQSSISNVDGQKGILEYRGIRIEELAEKS SFLETAYLLIWGELPTSKELAAFEDEVRSHRRIKYHICDMMKCFPESGHPMDALQASA AALGLFYSRRDLNNPVYIRDAVVRLIAKIPTMVAAFQLMRNGNDAIRPHEDLDYSANF LYMLNEKKPDPLAARIFDICLILHAEHTMNASTFSARVTASTLTDPYAVVASAVGTLG GPLHGGANEEVIQMLEDIGSVENVRPYIEDCLERKAKIMGFGHRVYKVKDPRATILQN LAEQLFTKFGHDKYYDIAVEMERVVEEKLGHKGIYPNVDFYSGLVYRKLGIPTDLFTP IFAIARVAGWLAHWKEQLEENRIFRPTQVYDGKHDVSYIPIDQR" gene complement(6555..7049) /gene="sixA" /locus_tag="DP116_24195" CDS complement(6555..7049) /gene="sixA" /locus_tag="DP116_24195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphohistidine phosphatase SixA" /protein_id="PRJNA477356:DP116_24195" /translation="MELYLIRHGIAEERRPDLKDEERSLTKEGREKTEKVAQRLRKLG LHFDLIATSPLIRARQTAEIFIATGLSSKVEECSYLAPDGGIESWVVDWLSPRNYSPQ TQLALVGHEPSLSAWAEILLWGQAKATLVLKKAGMIGVKLPEKGSPLGRSQMFWLTPP KYLL" gene complement(7228..8511) /locus_tag="DP116_24200" CDS complement(7228..8511) /locus_tag="DP116_24200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652771.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional oligoribonuclease/PAP phosphatase NrnA" /protein_id="PRJNA477356:DP116_24200" /translation="MHFNSSVTQISSLPLTGDPTPDDFEIDSKDLVEVPLTRHSGTAL AGEGHFLGLRNNSLVNQKSEELHNTFLAHKQERQLIILQDFPDPDALSCAWAYQLIAQ QYDIKCDIIYAGALSHQENIALVKLTGLPAQRWTVQTLKSKDLSSYQGFVLIDNQGTT SQLVPIVQEVGIPIVAVIDHHSLQSDIKSEFFDVRPSVRATATIFTQYLQYGLLTLDS SINQHVKCATALMHGLRSDTNRLMQAQEEDFMAAAYLSKFYDAQLLNAILQANRSKRV MDVIERSLKNRIVQNNFSIAGVGYLRYDDRDAIPQAADFIVTEENVHTALVYGIVHDE DEELEVVIGSLRTTKLTLDPDEFIKEAFGQDSSGRFFGGGRTSAGGFEIPMGFLSGGN ENSAYAKMKWEVFDSQIKQKLLRLVNPKDNPIQSE" gene complement(8928..9425) /locus_tag="DP116_24205" CDS complement(8928..9425) /locus_tag="DP116_24205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457046.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_24205" /translation="MGKVLVLNASYEPLNITSWRRAVVLLIKGKAERVEFNNKQLYLG FPLPTVIRLRHYVRVPYKEIPLTRRNILHRDGHTCQYCGYTGDDLTLDHVVPRSRGGG DSWENIVTACVRCNVRKGSRTPNEAHMNLRHNPRRPYSSLYFEVTKHLKSGTHQEWQK YVIGL" BASE COUNT 2747 a 2258 c 2035 g 2557 t ORIGIN 1 gagcgctaag gcgcacgctt cgcgttggcg gagcctgtcc gcaggactta cacaactcta 61 ttatgattac tatccctggg tgaagttgtg catgacttag caatgaacac tttactatta 121 tctagcctgg agagaattta gcaaagtttt attgattatc tgactcatgc ttgttgaatt 181 gtaaatttgg taactgacag ctataacggc tttttgtaat tagaaggaaa agacattaca 241 caggtgacat ttcatgctac ggtgctgact tgaacttata ggaaatttca tatattatgg 301 tcgatagtgc ggatcttcca gaggtgccgt tacgcgctct tgtgtggatg ggggactctc 361 gcaaaaatat tcgtgcgttt ccttcagagg tacaaaaagc ggtagggtat gcgttgcaac 421 tagtgcaggc aggggaaaca ccactggatg ccaagccttt taaaggcgtt ggaagcggcg 481 tgtatgaaat tgtcaaacgc tacgataccg acacatacag agcggtttat gccgtgaaga 541 ttggagaaaa gatctatgtc ctgcacgctt tccaaaagaa atcaaacaag gggattaaaa 601 ccccacagac tgacgttgac ctgattaaac aacgctataa ggacgcactc gcacgggagg 661 aacaaccatg acagaagaaa gcgtttttga agaaagtagt ggtaacgtct ttgctgacct 721 cggcttagag gatgcagaag aactttttac ccgtggcaag attgggattg tggtacttca 781 cctcttaaaa caacgtaacc tgaaacagcg ggaaatcagc aaacttcttg gcattcccca 841 gccagaagta tcgtatctga tgagaggaga gtttcaacgg ttcagcgagg ggaagctgct 901 cactttcctc aagcgccttg ataccgaaat caccctgcac cttcgccctc gtcatgcggg 961 aacccaaaca ggagaaacag tggtatcgtt gtaggggctt tactggatct gtgaattcac 1021 caaaaaccga ttgtgatagt gccgggtgcg gaatgtggga tacaaacaaa gcctgcagaa 1081 gcaggcttgt ctggtagttg agataaaaag ctctattgag aaactagaaa acaactctca 1141 attagcatga ctcgcactac tgtagcgaat tcttgttcct gcccacagca acttctctcg 1201 cagcgtctga taataggagt tgttctcccg taaaattata aacttagccc gacaatctgc 1261 catccgcaca tcaacacggt gtccgggcca aattgaagtc gctaacaccc catcagtcca 1321 caacttggta cttaaatcgt aatcacccaa gggccaaatg ctgacgacag aaccaggggg 1381 taacacgagg ggacgactgg aaagactcat cggacagatg ggagtgatgg taattgcttc 1441 cataccatca tgtataatcg gaccattcgc agaaacggta taacccgtag aaccagtggg 1501 agtagaaata actaacccgt ccccttggta ttgatcgacg acctcaccat caatttccat 1561 ctctagaatt gaggtaatca tcctatccgc agaagcgggt ttaacacaca tttcatttaa 1621 agcaaggtag cgttcagaca ctggttccaa atttgtccta ttcccctcaa agactgaggc 1681 ttgcaacatc atccgccgct gcacagcgta acggtcttct aagagtcgat cccaaactcg 1741 ttcggtgtct tgaaattcgt ctacagactc ggttaaaaac cccagatgac ctcccacatt 1801 cactgctagt attgggacgc cagccggggc tagatgtctt gcaccagtta aaacagtacc 1861 atcaccgcca agtaccacag ccaagtcaat tggttgtcca gctgaagcca gaaataccgg 1921 atatgggtta tctttcggtc cacttggtcc catcaaaacc tggcattgac gatgttctag 1981 ttgctttgca cagatttctg cccagcgttt actttgggaa tctcgtgctt tataagcaat 2041 aatgacctgc ttgagctcca cagaattacc acttcaggag attaaactgc tccatatcaa 2101 cagtgtcgcg gttacgataa atggcaagca cgatcgctaa acctaccgcc gcctcagctg 2161 ctgctacggt aatgacaaat acggtaaata cctgaccctt tattgctact gagtctaggt 2221 aattggaaaa cgccattaag ttcagattaa cggcattgag cagcaactca atcgacatca 2281 gcactctgac agcattacgg ctagtaatta agccataaat accgatgcag aacaaagctg 2341 ctgcaagtaa taaaaagtac tggatttgca taaactttgg tgtcccccca ttgtcagtta 2401 acaattatca gttatcagtt atcacttcgc gctgttcatt aataactgta ttactgaata 2461 ttgcggctcg attcaccgat tgagaccaat tctctgggac gttctggtaa cgtcaaaact 2521 tgctgttgtt ctggagaaat agtctgttca ggcagatact cgcgacgtgc caaaattatc 2581 gcgcctacca ttgctattaa cagcaaaata gaagcgagtt caaaaggcag taaaaagtca 2641 gtgaagaaat gttgaccgat taaaacaata gtgttgcttg gaggtatggc agtagacagt 2701 gaccaaggag ttgccaacac catcgtactt aaaagtgcaa acaatcccaa actgactaca 2761 cctgttagtg ctttgcgtac ccaagcagta gggaagggca caaaagcttg ccgcttgttt 2821 accagcataa tggcaaacaa aatcaacacg ttcactgcgc caacataaat tagcaattgc 2881 gctgctgcta caaagtcacc atttagcaat agatatagtc cagcaatgct aacaaatacg 2941 cctgctaaca gaaaggctga atggacgatg ttttcaaaca gcaccacacc cagtgctgtt 3001 ccaatcaaca acaccgccag tatgccaaaa gaaacaatct gtactccttc cgcaagattc 3061 actgtatttt tgtcctcagt taaaagtcaa cagttaaaag tcaacactca agagtcaaaa 3121 gtcaagagtc aaaaactatg cactgttgac tcttgactaa ttatctttac ttttcctgct 3181 ctacaaggtc ttctggacgc gcaccagggc gaggtgcatc agcaggtaaa ccgtggggat 3241 ccataacacc cttgggtaaa taaacaagtt cacgcaaagg tgtgaccatt gggtctagag 3301 tgaccttgta gggcaaacga ccaagcgcca cgctgtcata gttcaattca tggcgatcat 3361 aagtagaaag ttcataatct tctgtcattg ataaacagtt ggtgggacaa aattccacgc 3421 agttaccaca gaagatacaa actccaaagt cgatgctata gtgtttgagc tttttctttt 3481 tggttgcctt atcgaaatcc caatccacta caggcaggtt gatcggacaa acccgaacac 3541 aaacttcgca ggcaatgcac ttatcaaact caaagtgaat cctaccgcga aatcgttcgc 3601 taggaatcaa tttttcgtaa gggtactgca cggtgactgg acgccgccgc atatggtcaa 3661 aggtaacaga cagaccctga cctatgtagc gaccagcttg aaccgcttct ttggcgtaat 3721 caccaacttg cttgaggaac ttcaacatat tgtgtctctc tctttttaag ctatcagcgt 3781 catttgtctt tcgtgctaat gactaaccac caaaggcaaa gggaaaggca agtttcaggg 3841 ctgcagttag tagcaagtta accaaaccaa caggcagcaa gaacttccat cctaaatcta 3901 gcaattggtc aatacgaacg cgtggtactg tccagcgcaa caggattgcc agaaagacca 3961 ggaagtatgc tttcagtacg gtcatagtga ttcccaagga ggcggttaca acctgcaaca 4021 cgggatttgt ttcactgact cccaaagcgc tagctatcag gttaatagga attggaaaat 4081 cccaaccacc cagatacaac actgatacta gcagggcaga aaggactagg ttaacgtagg 4141 aactcaggta gaaaagagcg aatttcatac cagagtattc agtctgataa ccagcaacca 4201 gttcttcttc tgcttcaggt aagtcgaagg gtaatcgttc gcattcggca agggctgcta 4261 tccaaaagat gataaacccg gctggttgac gccatacgtt ccagccaaga atgccataac 4321 cagattgctg ttccacgatg tcgatggtgc tgaggctatt ggacatcatg gcgatcgcca 4381 gcactgcaag tgccaacgga atttcatagc taattgactg cgctgctgcc cgcaaccctc 4441 cgaggaggga gtatttgttg ttggatgcat aaccagccat caacaagcca attggctgaa 4501 tactagacaa agcaatccac aagaaaaccc ccattccaac gtctgttatg accagattct 4561 gtccaaacgg cacaatcaga taggacagaa acactggcag aacaacgatg attggaccaa 4621 gggtaaacag cacactgtca gacttggctg gcaccacatc ttctttaaaa acaagcttga 4681 gaccatctgc tactggagct agcaaaccta aaggaccaat atactcagga ccaatccgct 4741 gctgtgcggc ggcggaaatt ttccgctcta accaaacgca ggttagcaca ccaactgtag 4801 caccaataat cattagtatc atcggcactg gcatccaaat tgacttggct gcaccagcgc 4861 ttagtcccaa atccatgagg gattttataa aagttccttg cagatcaatt cctgaattca 4921 tgtcttttgc tcttcaagtc ctttagatca ttgtccttta caactatcca cacttgttaa 4981 taactgtaaa ttcgcccatt aatatttctg cctaaatcat gcaaaattta agattttttg 5041 caattcccat gcctagtata tcgttggatg gtttttgccc ctttacccac acaagaattt 5101 ctacttatgg ttaagattgg gatttgcgaa gaaaacctcc agtcgtgggt agagagcgtt 5161 tgcgcagcgc cccctccggg gctagcgagt atcgggggag caaggggaga taagacaatt 5221 caaaatcgcg tagcgttgcg gagcgcgttg cgacgcaatc tcaaaattca aaattcttat 5281 cccccattac ccctttagca ctctatcgct ggtcgatggg gatataagaa acgtcgtgct 5341 taccgtcgta aacctgggta ggacggaaaa tccggttttc ttctagctgt tctttccaat 5401 gtgctaacca accagctaca cgggcgatcg caaatattgg tgtaaacaaa tctgtaggaa 5461 ttcccaactt cctatacact aatcctgaat aaaagtcaac attgggataa attcctttgt 5521 gacccagttt ttcctccacc accctttcca tttctacggc aatgtcataa tacttgtcat 5581 gcccgaactt ggtaaacagc tgttctgcca agttctgtaa aatggtggcg cgtgggtctt 5641 tcaccttgta cacacggtgt ccaaacccca taatcttggc tttgcgttcc agacaatcct 5701 ctatgtaggg acgcacattt tctacagagc caatgtcttc caacatctga atgacttctt 5761 cattcgctcc accgtgtagc ggtcctccta acgttcccac cgcactagct accactgcgt 5821 atggatcagt tagagtagac gctgttaccc ttgcactgaa ggtggaggcg ttcattgtgt 5881 gttcagcatg aagtatcaag cagatatcaa aaatccgcgc cgccaatgga tcaggttttt 5941 tctcattgag catgtacaga aaattggcgg aatagtccaa gtcctcatgg ggacgtattg 6001 catcattccc atttcgcatt aactggaacg ccgccaccat tgtcggaatc tttgctatga 6061 ggcgaacgac agcatcccga atgtaaacag gattattcaa gtcacgacgc gaataaaaca 6121 agcctaatgc cgcagcagag gcttgcagtg catccatcgg atgaccgctt tctggaaagc 6181 acttcatcat gtcgcaaatg tgatatttaa tccgccggtg agagcgaact tcatcctcaa 6241 atgctgcgag ttctttcgat gtgggcaact caccccaaat aaggagataa gcagtttcca 6301 aaaatgaact tttttctgct agttcctcaa tccggatgcc acgatattct agtattccct 6361 tttgcccgtc aacattgcta atactggatt gggcggcggg aatgccatcc aaaccaggct 6421 tatattcgca caccatcatg gcaacaccat ctgctttttt ttgaaagatt ttttcaggct 6481 aacttaccag aaattgttat caccataaca gtatccgttc tcctaaaaat tctttcagaa 6541 acgttactct actgttagag caagtacttg ggtggcgtca accaaaacat ctgactacga 6601 cccagagggg agcctttttc tggtagtttt accccaatca tacccgcttt tttcaaaact 6661 aacgttgctt ttgcttgtcc ccataaaaga atttctgccc aagcgctcaa agaaggttca 6721 tgtccaacca aagcaagttg agtttggggt gaataatttc ttggactcaa ccagtccaca 6781 acccaacttt caataccacc atctggcgcg agataggaac attcctctac cttggaactc 6841 agtcctgtgg ctataaaaat ttctgctgtt tggcgagctc gtattaaggg actggtcgca 6901 atcaaatcaa aatgtaagcc cagctttcgc aatcgctggg ctactttctc cgttttttcc 6961 cgaccttctt ttgtcagtga gcgttcctcg tctttgagat ctggtcgcct ttcttcagca 7021 ataccatgac ggattaaata cagttccacg gcggaatgtc ctttatgagc gtaagctctt 7081 tggagggagc gagtgctaag aaagcgctcg tcggtgcaag tattatttta ccgacagatt 7141 ggaggaaata ctccagagat ctttagtcct tagtcctttg tctttggaaa ctaaggacta 7201 atgactaata actaatgact aatgattcta ctccgactgt attggattat ctttcggatt 7261 cactaatctt aggagcttct gcttaatctg agaatcgaaa acttcccatt tcatcttcgc 7321 ataagcagaa ttttcattac ctccagataa gaaacccatc ggaatttcaa acccacctgc 7381 gcttgtccgt ccaccaccaa aaaagcgtcc gctgctatct tgaccgaagg cttctttaat 7441 aaactcatcg gggtcgaggg ttagtttggt tgttcttagg gaaccaatca ccacctctag 7501 ttcttcatct tcatcgtgaa caataccgta caccaaagct gtgtggacgt tttcttctgt 7561 cacaataaaa tctgccgctt ggggaatagc atcgcggtca tcgtaccgta agtaaccaac 7621 accagcaata gaaaagttat tttgcacgat gcggtttttg agcgatcgct ctataacatc 7681 catcactcgc ttagaacgat tcgcctgtag aattgcgttc agcaactggg cgtcataaaa 7741 tttgctcaaa tacgctgctg ccataaaatc ttcttcctgt gcttgcatca gtctattagt 7801 atccgatcgc aagccatgca tcaaggcagt cgcgcatttt acgtgttgat ttatgctgct 7861 atctagagtc agtagcccgt actgtaggta ttgagtgaaa atcgttgctg ttgctcgtac 7921 agagggacga acatcaaaaa attctgattt gatgtcactt tgtaaactgt ggtggtcaat 7981 aaccgccact attggtatcc ctacctcttg cacaataggt actagctgag aagtcgttcc 8041 ttggttatct atgaggacga aaccttggta ggaagataag tctttgcttt tcagtgtttg 8101 tactgtccaa cgctgcgcgg gtaaccctgt cagcttgacc aaagcaatat tttcttggtg 8161 gctcaaagca ccagcgtaaa tgatgtcaca ttttatgtcg tattgctgag ctatgagctg 8221 gtaagcccaa gcacacgata aagcatcagg atctggaaaa tcttgaagaa taattaactg 8281 gcgttcttgt ttatgcgcaa gaaacgtgtt gtgcagctct tctgattttt gatttaccaa 8341 tgaattgtta cgcagaccca gaaaatgacc ctcgcctgcc aatgctgtac ctgagtgcct 8401 agtgagtgga acttctacta aatctttgct gtctatttca aaatcatctg gagtaggatc 8461 acccgtcaat ggcaaactcg aaatctgagt cacagaagaa ttaaagtgca tacgaaacgt 8521 ttttataagt gtgaggctcc ttttttttca tcaagagata ctacagtttg tatttcaaca 8581 cttcagccac acattgcgat ctttgcaaaa atatcagagc atgacatatt atctctgagc 8641 ctcttttttt gtcaagaact tatcgttgct aacttgccag gaaaacgcct ttagtaccta 8701 cccctgatct tttggtagat ttagaacgca ccctgttgtt tttggcgtta gtatgtgtta 8761 tactagcgaa agtgctctcg atggatacgc agaattgctg tctcctccat cagctttttc 8821 ggttgttgaa atccataaag agccgtgaag aagctaaaat catcaagcaa aggcaaaaaa 8881 gcgaaaaact gaccactgcc tgccttgttc aaattcagca gaaggtgtca gagtccgata 8941 acgtattttt gccattcttg gtgagtacca cttttgaggt gtttggtgac ctcaaagtaa 9001 agactactat aaggtcggcg tggattatga cgtaagttca tatgagcttc gttgggcgta 9061 cgactaccct ttctgacatt gcagcggaca catgctgtaa caatgttttc ccaggaatcg 9121 ccgcccccac gcgatcgcgg aacgacatga tccaaagtta aatcatcccc agtgtaaccg 9181 caatactgac aagtgtgacc gtcgcggtgt agtatgttcc gacgagtcaa aggaatctcc 9241 ttatagggaa cgcgcacata atggcgtaac ctgataaccg tcggaagcgg aaagcccaag 9301 taaagttgct tattattaaa ctctactcgc tctgctttgc ctttaatcag taacacgaca 9361 gctcgccgcc agctcgtgat attgagcggt tcgtaagagg cgtttagaac taaaaccttc 9421 cccattgatt agcgctcaag ctattatttt tacatatatt aacacaatta tattcagctt 9481 tggtgatgaa aaaaaattat actattccaa tagttaaaac taagaattgt aaaaagtcaa 9541 gggtcaaaag tgagccagtg cggtggacgg gttccccggc ataaaggagc cagtgcg // LOCUS NODE_3517_length_9511_cov_4.8379869511 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9511) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9511) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9511 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..563) /locus_tag="DP116_24210" CDS complement(<1..563) /locus_tag="DP116_24210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137905.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phenylalanine--tRNA ligase subunit alpha" /protein_id="PRJNA477356:DP116_24210" /translation="MTNSLEAQLLELREEGEKAIAAADTLERLEELRVSYLGKKGQLG ALLRSMGQLSAEERPKFGAIANTVKEALQNSLDQQRATLEGAKIQAQLDAETLDVTMP GIYRPQGRVHPLNGIIDKALDIFVGLGYTVATGPEMETDYYNFEALNTPPDHPARDMQ DTFYLPDGNLLRTHTSSVHIRYMEPEEP" gene 812..892 /locus_tag="DP116_24215" /pseudo CDS 812..892 /locus_tag="DP116_24215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006102808.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="DUF86 domain-containing protein" gene 1111..1413 /locus_tag="DP116_24220" CDS 1111..1413 /locus_tag="DP116_24220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015154538.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24220" /translation="MSHEADLVKPAVLLTIIGETVLKDAILKLLKSYNVNDYTITQVQ GEGSHGRRMGDMVGYNTNIEIKTILSLEVSNDILQSIKDYQGKQAVIAFRSYVEIL" gene 1543..2296 /locus_tag="DP116_24225" /pseudo CDS 1543..2296 /locus_tag="DP116_24225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016951334.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 2392..2490 /locus_tag="DP116_24230" CDS 2392..2490 /locus_tag="DP116_24230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315732.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24230" /translation="MRDKLIHDYFNTDVEILCKAVQDDVPQLKIMI" gene 2634..4646 /locus_tag="DP116_24235" CDS 2634..4646 /locus_tag="DP116_24235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874200.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II/IV secretion system protein" /protein_id="PRJNA477356:DP116_24235" /translation="MAHSLPQRRSTAVTTRSQFSPFGNKLVQSGFVNSEQMRQALIES RKSGRSLTEVLESISGRQLSPELYRLHKKQQLFELKILYGVESVDPEVSQIGTRTIGQ LIDTLIPVDICRRHHLVPLAKNAEENPPSVMVAMVDPDNLEAWDDLNRILRPQGWTLQ RIVITQEDYQQLINQYLDEVTVRQKHLEQEKFTDINQDLENLDNLNLDDVPEDKETDL GAAMKGAEDAPIINLVNRILAKALHEMVSDIHVEPQEENLLIRFRKDGVLRQVFDPLP KKIIPAVTTRFKIIANLDIAERRLPQDGRMRRVFEGRKLDFRVSTLPSRYGEKVVLRI LDNSATQLGLDQLITDAETLQIVKDMVSRPFGLILVTGPTGSGKTTTLYSALSELNCS GINICTVEDPIEYSLPGITQVQVIREKGLDFATTLRAFLRQDPDVLLVGETRDKETAK TAIEAALTGHLVLTTLHTNDAPGAIARLAEMGIESFMISGSLIGVLAQRLVRRVCPSC CIPYTPTTTELARYGLSASQQAGVTFYKANTLTLEQIHQAKANNQLCPECHGVGYKGR CGVYEVMRITERLQGLITEDAPTERIKEVAVEEGMKTLLAYSLDLVCQGYTTLEEVER VTFTDTGLEAELKAKRKRSLTCRTCHAGLEAEWLDCPYCMTPRFQE" gene 4796..5917 /locus_tag="DP116_24240" CDS 4796..5917 /locus_tag="DP116_24240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012406947.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="twitching motility protein PilT" /protein_id="PRJNA477356:DP116_24240" /translation="MEMMIEDLMEKLIKMGGSDMHLSAGLPPYFRLSGKLTPIGDQPL SADECQWLIFSMLNNSQRRILEENWELDCSYGMKGLARFRVNVYKERGAYAACLRALS SKIPSFEKLGLPDVVREMSEKPRGLILVTGPTGSGKTTTLAAVIDLINRTRSEHILTV EDPIEFVYEPIQSVIHQRQLGEDTKSFANALRAALREDPDIILVGEMRDLETISLAIS AAETGHLVFGTLHTSSAAQTVDRIIDVFPSEKQTQVRVQLSNSLVAVFSQTLVPRKNP KSGEFGQVMAQEIMVVTPAISNLIREGKTSQIYSAIQLGGKMGMQTLEKVLADLYKAG VISLEDAMSKTSKPDEIKRLIGSTTLALGTRTGVAVKAY" gene 6119..7330 /locus_tag="DP116_24245" CDS 6119..7330 /locus_tag="DP116_24245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456786.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II secretion system F family protein" /protein_id="PRJNA477356:DP116_24245" /translation="MPIFIARVRDSRGKSKREKIVANSLAQARIYLRQQGYIVQEIKK ESQGFDFKEFKTKFTYVSVKDKAVFSRQFAALVNAGVAIVRSLSVLAQQCSNPKLKQA ILSINTDVQSGMNLSEAMRKYPDCFDGLYVSMVEAGEVGGVLDEVLNRLAKLLEDIAR LQNQIKSALAYPVVVGFLAIAIFLAMTIFIIPIFANIYKELGIQLPALTQFMLQVSEV LRSRWLLVIIGVITVGIAYRLYYKTQAGRETIDRLLLKMPLFGELIQKSSIARFSRTF GSLTRSGVPILTSLEIVRDTSGNQVVAKAIDTARAEIQQGGMISIALQKEKVFPPMAI QMISIGEETGELDAMLMKIADFYEDEVEQTVKALTSILEPIMIVVLGGMVGTILLSMY LPMFKVFDKLG" gene 7440..7901 /locus_tag="DP116_24250" /pseudo CDS 7440..7901 /locus_tag="DP116_24250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128281.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="16S rRNA (uracil(1498)-N(3))-methyltransferase" gene 8052..8321 /locus_tag="DP116_24255" /pseudo CDS 8052..8321 /locus_tag="DP116_24255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318759.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="16S rRNA (uracil(1498)-N(3))-methyltransferase" gene complement(8455..9018) /locus_tag="DP116_24260" CDS complement(8455..9018) /locus_tag="DP116_24260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874441.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24260" /translation="MIGFIKNLITGVLNFFTGLLGGNKNGYYLELDEAAEATKEAAKQ AASKAQEAVEPVVSKAQEAVGSVVSKAQEAVEPVTSKAQEAIKPVAAKAKKTAKSVAT QAATNNGTKDAVKPQPNVQLVQTAVGVKPEPVQSEKAKAIIQKEPTETTFAPKYLAVP NTSNGRRRPGANMSSFLDMANQVKTSK" BASE COUNT 2664 a 1893 c 2179 g 2775 t ORIGIN 1 ggttcttcag gttccatata acgaatgtgt actgacgagg tatgagtccg caggagattt 61 ccgtctggca ggtaaaaagt atcctgcata tcacgggcgg ggtggtcggg tggggtgttc 121 agagcttcaa aattgtagta atctgtttcc atctctggtc cagtagctac ggtgtagccc 181 aacccaacaa agatatccaa agctttgtcg atgatgccat tgaggggatg aacacgacct 241 tgaggacggt aaattcccgg cattgtcaca tctaaagttt cagcatctag ctgtgcctga 301 attttcgccc cttccaaggt tgcacgttgc tgatctaggc tgttttgcag ggcttctttg 361 actgtgttgg cgatcgcccc aaattttggt cgttcctccg cactcagttg tcccatactc 421 cgcaacagcg ccccaagttg accttttttt ccgaggtagc tgactctaag ttcctccaga 481 cgttctagcg tatcagcagc ggcgatcgct ttttctcctt cctctcgcag ttctaaaagt 541 tgagcctcta aactgttagt cattagtttt tagtcattac actattcatt agtcatagat 601 ccaaggcttt ggactactgg agatataaca tatcgttgac accgttattc tgccgtatga 661 agaacttgaa tcgtcacaag acaattgatt tgtcaaagac aatattataa cccaaagttc 721 tttttacctg tttagccgct ctgtaagcaa gccagaatca gatttacata agctgttatg 781 ctgctagcta tcagtggggg tagtgcttta tattccctgg agagatattg caggaatgcg 841 ggataagttg attcatgatt atttcaatac agatagctat cagtgggggt agtgctttat 901 attccctgga gagatattgc actggagttt agactaaaag tcgttactca ctcctataac 961 agttcaaccg gactaagcaa cctgttgaca ccttgtggac acggatatgt caggaaatca 1021 atgtttgcgc aaaaaaaacc caatcatgtg aagattaaag aagtcatcaa ttaacaaacg 1081 caacataaaa cctgttctat ggagttaccc atgtcccacg aagctgattt agtcaaacct 1141 gccgttctcc tgaccatcat tggtgaaaca gtgctgaaag acgctattct caagctattg 1201 aaaagctaca atgtaaacga ctacaccatt actcaagtgc agggtgaggg tagccacggg 1261 agacgcatgg gagacatggt gggctacaac accaacatcg aaattaaaac aattttgtca 1321 ctagaagtat ctaatgatat tcttcaatct atcaaagact atcagggtaa gcaggcagtg 1381 atcgcctttc gctcttacgt agaaatattg tagtgattaa tactttaact tgagtttaga 1441 ctaaatgggc aatgtcaaaa aaaagggggt ctgattgaat acagtcccct tttggcagaa 1501 gctgagaaaa gaactaccat tactcagctg ccaatatgaa acttgccaaa ctccaagaat 1561 tacgcccgtg cggcttacag gcatttaggc aaagcccatg atgccacatt tgaattgact 1621 gatgccatac tgctgacccg taacgcctac agcttggcgg atttatctct gtctccagtt 1681 ttccgacgca agtggcctag catttatgtt cgcgaagcgt gtccccttgg gactcagcgt 1741 tacaagatag cagaccacag cgacagaaat tgatgcaatt atacatcaaa cagatgaagt 1801 ttggcgattg tatttacgtc gctttacgat tgaccactgg tatcgttttt taaagcaacg 1861 cctgcattgg acactgccaa agctcagtac tcccaaacag tgtgttagcg tagcgtgccc 1921 gttaggcgca gccgtggcgt tagccatagg gcatacgatg gagtgacttg atgccaatga 1981 ttacttggga gttatggtta gctcgtgata ttgtgagacc agcgcgcatg aggcgttttc 2041 ccgccgtagg cgactggcga acccgaaggg ttgctgataa tcctttaccc tggcagaagt 2101 caattgacaa attgactcct ggaagagttg cccaagctat gggaggaatt ttattagtac 2161 tcctgcacga tcgcccaaac ctcgcggaaa gtcacccggt tggactcccg gacaaacacg 2221 actacgtagg attcgctacc caatcatcaa aaaaggtacg aatacttccc gtaaacgaca 2281 gccacaatcc gcttaagatc caaaattgtt catcgttgag acttgtatct catcaactca 2341 gcattgctga gtcgttttgc attacttagt ctaaactcca gtattgcagg aatgcgggat 2401 aagttgattc atgattattt caatacagat gtggaaattc tttgcaaggc tgttcaggac 2461 gatgttccac aattaaaaat tatgatttaa caagttttag aagatttaac aggtgatgcg 2521 tcactctaaa ctacttaaga attattgtct caatttttat cagtgtagat gtcaattcta 2581 tctcacagct tgggaatact ctaattagag tttccgtgac taattgctca tatatggctc 2641 actcgttacc acaaaggcgt agtactgctg ttactacaag atctcagttc tcgccctttg 2701 gcaacaaact agtgcaatct ggctttgtca atagcgaaca aatgagacaa gcactgattg 2761 aaagtcgcaa gtctggcaga tccctgacag aagtgctgga atctatttcc ggacgacagt 2821 tgtcaccaga gctatatagg ttacacaaaa agcagcagct gtttgaactc aaaatattat 2881 acggcgttga atctgttgat ccagaagtga gtcagattgg cacaagaaca attggtcaac 2941 tgattgatac cctcattcca gtagatatct gtcgtcgcca tcacttggta ccactagcaa 3001 agaatgccga ggaaaacccg ccctcagtta tggtggcgat ggtagatccg gataatttag 3061 aggcttggga tgatttgaac cgcatcttgc gtccgcaggg atggactttg cagcggatag 3121 tcattactca agaggattac cagcaactta ttaaccaata tttggatgag gtgactgttc 3181 gacaaaagca cctggaacag gaaaagttta cagacattaa tcaagactta gaaaatctag 3241 acaatctcaa tttagatgat gtacctgaag ataaagaaac tgacttaggt gcagcaatga 3301 agggtgctga ggatgccccg attatcaatt tagttaacag aatccttgcg aaagcactac 3361 atgagatggt ttctgatatt cacgtagaac cgcaggaaga aaacttactc attcgttttc 3421 gtaaggatgg tgtgttgcgt caggttttcg atcctctgcc caaaaaaatc attcctgctg 3481 tcacaactcg tttcaaaatt attgccaacc tagatattgc tgaacgacgt ttacctcaag 3541 acggacgcat gagacgagtg tttgagggac gcaagctgga cttccgcgtc agtaccttgc 3601 ccagtcgcta tggggaaaag gtcgtactac gaattttgga taactctgca acccaattgg 3661 gattggatca gctgattact gatgcagaaa ctttgcaaat tgtcaaggat atggtcagtc 3721 gtccctttgg tttgattttg gtgaccggac ccactggttc tggtaaaaca acaacgttgt 3781 actcggcact ttcggaactg aactgttcag gtattaatat ttgtacggtg gaagacccaa 3841 ttgagtacag tttacctgga attactcaag tgcaggtgat tcgggaaaaa ggtctggatt 3901 ttgcgacaac tttacgggct tttttgcgac aagatcccga tgtgcttctg gtgggtgaga 3961 cgcgagacaa ggaaacggcg aaaacggcaa ttgaggcagc attaactggt cacttggttt 4021 tgacaacttt gcatacgaat gatgcaccgg gggcgatcgc ccgtttggca gaaatgggca 4081 ttgagtcttt catgatttct ggttctctga ttggtgtatt agcacagcgc ttagtacggc 4141 gtgtctgtcc cagttgttgc attccctaca ctcccaccac aacagaactt gctcgctacg 4201 gtttatcagc aagccaacaa gcgggtgtta ccttttataa ggcaaatacc ttaactttag 4261 agcaaattca ccaagctaaa gcaaataatc agctttgccc agaatgtcat ggtgttggct 4321 acaagggacg ttgtggtgtt tatgaagtta tgcgaatcac agaacgtttg caagggctga 4381 ttaccgaaga cgcaccaacg gaacgcatca aagaagtggc ggtggaagaa gggatgaaaa 4441 ctttactggc ttacagtttg gatttagtgt gtcaaggtta caccactttg gaagaggttg 4501 aacgagtcac gtttactgat acgggtttgg aagcagagtt gaaagctaaa cgtaagagga 4561 gtctgacttg tcggacttgt catgctgggc tggaagctga atggttagat tgtccatact 4621 gtatgacacc aaggtttcaa gagtagttac tcttttcccg ccaaaagatt actgatttta 4681 aaaccgcaga ggcgcagaga acgcagagaa aatacgcttc gctgtcagag gtaattttag 4741 agtaccgaag ggaatcgtca ctagtcatta gtttggaata aaggaagtta gaaaaatgga 4801 aatgatgatt gaagatttga tggagaagtt gattaaaatg ggtggttcgg atatgcattt 4861 atctgcaggt ttgcctccct actttcgcct cagtggtaaa ctgacaccta ttggtgatca 4921 gccattgtcg gcagatgagt gtcaatggtt gatttttagt atgcttaata attcccagcg 4981 tcgaatctta gaggaaaact gggagttaga ttgttcttat ggtatgaagg gattggctcg 5041 ttttcgggta aatgtttata aagaacgtgg tgcttatgcg gcgtgtttgc gggcgttaag 5101 ttccaaaatt ccaagttttg aaaaattagg tttaccagat gttgtacggg aaatgtcaga 5161 aaaaccaagg ggattaattt tggtaacagg tcccacaggt tctggtaaaa caacaactct 5221 agctgctgtt attgatttaa ttaatcgcac tcgttcagag catattttga cggtagaaga 5281 cccgattgaa tttgtctatg aaccgattca aagtgtgatt catcaaaggc aacttggcga 5341 ggatacgaaa agctttgcca atgctttaag agcagcattg cgggaagacc cagatatcat 5401 tttggtgggt gaaatgcgcg atttggaaac tatttcctta gcgatttccg ctgcggaaac 5461 aggacacttg gtctttggaa ctttgcacac gagttcagca gcacaaactg tagatagaat 5521 tattgacgtt ttcccatcag aaaaacaaac tcaagtgcga gtgcagttgt ctaactcact 5581 tgtggcagta tttagccaaa ctttggttcc tcgaaaaaat ccaaaatcag gcgaatttgg 5641 tcaagtgatg gctcaagaaa ttatggttgt gactcctgct atttctaacc taattcgtga 5701 aggtaaaaca tctcaaatct actctgctat tcagcttggt ggaaaaatgg gtatgcagac 5761 tttagaaaag gtattagcag atttatataa ggcaggagtt atctctttgg aagatgcaat 5821 gtctaaaact tctaagccgg atgagattaa gcgtcttatc ggtagtacaa cactagcact 5881 aggaacaaga acaggagtag cagttaaagc atattaaaac acagggaata gggaataggg 5941 aatagggaat agggaacagg gaacagggaa cagggaacag agaacaggga acagggaaca 6001 gggaacaggg aacagggtag aaggggcact ttcaatcaac aagcagtcct ggttgggggc 6061 tgtgagtttg ttatttttga attttgaatt ttgaattttg aattttgaat tgcttaatat 6121 gccaatattc attgctcgtg ttagggactc gcgaggaaaa tcgaaaagag aaaaaattgt 6181 ggctaattcc ttggctcaag ctcgtattta tcttaggcaa cagggttata ttgtacaaga 6241 aatcaaaaaa gaatctcaag gctttgactt caaagaattt aaaaccaaat ttacgtatgt 6301 ttcggtgaaa gataaagccg tattttctcg tcaatttgct gctttggtaa atgcaggagt 6361 tgcaattgta agaagtttga gtgtgttagc ccagcagtgt agcaatccta aacttaaaca 6421 agcaattttg agtattaaca ctgatgtgca aagtggtatg aatctttctg aggcaatgcg 6481 gaaatatcct gattgctttg atggtttata tgtcagtatg gttgaggctg gggaagttgg 6541 tggtgttttg gacgaagtgc taaatcgttt agccaagttg ttggaggata ttgctcgctt 6601 acaaaaccaa attaaatcag cattggctta tcctgttgtt gtgggatttt tggcgatcgc 6661 catctttctc gcgatgacga tttttatcat cccaattttt gccaatattt ataaagaatt 6721 gggtattcaa ttacctgctc tgacgcagtt tatgctacag gtaagtgaag tgttaagaag 6781 tcgttggctt ttggtgatta tcggcgttat cacagtaggt atcgcctatc gactgtacta 6841 caaaactcag gctggacgtg aaaccataga ccgtcttttg ctgaagatgc cgttgtttgg 6901 tgagttaatc caaaaatcat cgattgcacg ttttagccgt acctttggtt ctttaactcg 6961 ttcaggagtt ccgattctga cttccttgga aattgtgcgg gatacgtcag gaaatcaggt 7021 ggttgctaag gctattgata ctgcgcgtgc agaaattcaa caaggaggta tgattagcat 7081 tgctttgcaa aaagagaaag tttttccacc aatggcaatt cagatgatta gcatcggaga 7141 agaaactggt gaattagatg ctatgttgat gaagattgct gatttctacg aagatgaagt 7201 ggaacagaca gtcaaagcac taaccagcat tttggaaccg attatgattg ttgtcttggg 7261 ggggatggtt ggtacaattt tgctatctat gtacctgcct atgtttaagg tgtttgataa 7321 gcttggctaa ttatcagtta tcagttatca gttatcagtt atcagtactt gttgtcgtct 7381 gaaaccttga taattgttca ctgttcactg tttactgttc actgataact gttgatatca 7441 tgtcacaact gcaacgaatc gcaatagcac catcccaact tcaacaagag caaatattgc 7501 tgacgaaaga gcaacaacat tatctggaac gtgttttgcg cttgcgcgag ggcgatcgct 7561 ttatcgcaat aaatggcaag ggtcaatggt ggctggcgca gttggaagga gaaaaagcac 7621 aaattttaga gtcgctaacg gtggaaacgg agttacccgt atccataacg ttgatggtag 7681 ccttgcctaa aggaaatgga tttgatgatg tggtgcggtg ttgtactgag ttgggagtcg 7741 ctgttcttgt tccggttgtg agcgatcgca ctttacttga tcctagtcct caaaaattcc 7801 aacgctggtt gcgtattgca caagaagctg ctgagcaatc agagcgttct tttgtaccta 7861 caatattaga acccgtttct tttataacta gtcttgccac agggacagca aatgaccctt 7921 cgggtatctc ctgcacagac gctacgcgtt cgccctctgg gcgtgcgctt gcgcttacgc 7981 agtcgccaag tgagggaaag ccgtcattcg caccgcagtt tctacatgag ggaaaccctc 8041 ctccgaactg ctcgctgtct caccgttaca tttgtgaggc tcgtggcagt tatcctcatt 8101 taaaagaggc aatcaatcaa gtttttacaa catctgagat tatcattatc actgggccag 8161 agggaggatg gacagataaa gaattgaagc acgctattga cgctggattt caacctgttt 8221 ccctcgggcg tcgcatcttg cgggcagtca cagccccaat tgttgcctta tctgttgtgg 8281 cagcacagtt agaatcagtg acattcgatt cagcaagctg aaaacagtat accttttgta 8341 gggacagctt tagatataat aatgaagaaa attttgtaaa aatagttact tttttataaa 8401 tatcatcaat taatagtagg gtgagcaacg cccaccctaa agaatatcca aaacctactt 8461 agaagttttc acttgatttg ccatgtctaa gaaagaactc atgttcgcac ctggacgccg 8521 acgaccattg ctagtattag gtacagcgag atattttggt gcaaaagtgg tttcagttgg 8581 ctccttttga attattgctt tggctttttc tgattgaaca ggctcaggct taacaccaac 8641 tgcagtctgt actaactgaa cattaggttg tggcttaaca gcatcttttg ttccattgtt 8701 tgtggctgct tgtgttgcaa ctgacttagc cgttttctta gcttttgctg cgactggttt 8761 aatggcttct tgagctttgg atgtgactgg ttcaaccgct tcttgagctt tcgagactac 8821 tgatccaacc gcttcttgag ctttcgagac tactggttca accgcttctt gagctttgga 8881 tgctgcttgc ttcgctgctt cttttgttgc ttcagcggct tcatcgagct ctaggtagta 8941 gccgttcttg tttccgccca aaagcccagt aaagaaattc agaaccccag tgattaaatt 9001 tttgataaaa ccgatcatta caatttctcc tgcctcttat acctggaatg tgaccgtaat 9061 ccttagaaaa tcacggaata tttttaattt ttatcaagat attcttgacc tacccattag 9121 cctgtacaac cagttacggt gacagccttc taacggttta tttcttaaca aagtggctat 9181 gactgaacag ggctagtgct tttcactctt tactttcagg cacttgccac ctgtaaacag 9241 tcagatttta ttattaaatt ttactgtaac ttgaggcaat atttcgacag cctgcttaac 9301 aaaagttaac ttgtactttc agatatgcgt atcgactctt gaaagaccct attccctatt 9361 ccctattccc tattccctgt taagagttaa gcgttcccta ttccctattc cctattccct 9421 attccctatt ccctgttaag agttaagagt taagagtagc cctgtcaagg tgactttttt 9481 gcagtcttca aaagtttggc agttgagaca t // LOCUS NODE_3533_length_9450_cov_4.0162859450 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9450 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 123..2210 /locus_tag="DP116_24265" CDS 123..2210 /locus_tag="DP116_24265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009554090.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenylate/guanylate cyclase domain-containing protein" /protein_id="PRJNA477356:DP116_24265" /translation="MLLVVSLCAMLASTFICSNAGKSILTEKIFNQLTSLRAVKTYQI QDYFENLYNHSQTLSEDLTVVAAMQEFKTAYRQLEESKVPADFDKKIDTYYQTKFLTK LARTNEGSPVLASYTPKTTAARYLQYHYIAANPNPPGKKLLLDQPGDASSYSRVHARY HPIFRNIVEKYGYYDMFLIDPEGSVVYTVFKEADFTTNLTNGPYKESNLAEAIAAARG ANGKGYVKIVDFKPYSPSYGAPAAFIAAPIFNGPEFIGILAFQLPVDKINNVMTGNKH WKQNGLGDSGETYLVGPDYLMRSASRFQIEDPKGHAKTLRSIGTDENTVKKIEEFKTT ILLQEVQTKAVKEALFGKQGTQVINDYRDIPVLSSYAPLDIDGLKWAILAEMDVSEAY APIHSFEKTILIAATLIIALITLVAMSLTAIFVKPIKTLIASARKVGAGEFDAVVKSG SQDEFGELAKSFNQTIDSLRAETQLIEQKNRENEALVLNIFSPAIAKRLKQGDREIAD QISNVSVLFSDLERFTKLSQSMSPQEVVGVLNELVTAFDEMTEKYGIEKIKTIGDGYM AVCGLSVPRLDHDKRMVEFASEMLAFVRRFNYEKDLHLDLRIGINSGDVVAGVIGKDK LLYDVWGDTVNTANRLKSACPPGGIFVSQNICDRLRDLYEFEPVGEIQESGKQKLVAW QLKGIQQTVSVPR" gene 2218..3690 /locus_tag="DP116_24270" CDS 2218..3690 /locus_tag="DP116_24270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019502396.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="small-conductance mechanosensitive channel" /protein_id="PRJNA477356:DP116_24270" /translation="MHNISLHNQDFIWMLTVAFGFPILVIVLGEVVHLLQMRRKPIAA TVRIVKNLVLPVFMFMIFFKYILKVNTGGSFVKIVETLFWISVIHAALSLINTILFAE AEANTWRAKMPKLLTDLFRLFLVLVGTAIVLALVWGADLAGMVTALGVGSVVIGLALQ DTLGSIMSGITLLLERPFNVGDWLRVGEKVEGQVIDINWRSVRLLTLQRQVIIVPHQV IGKEIVCNHSLPERLYNQRIKIGFSYDSPPNLVKQVLTSTALSTQGIVAEPEPESKTS SYDETAIMYEVEFFIEDYENVEQILDRFMTRVWYAARRNNLVLYRYRYEYSSEPAVKT DTPSSQLTQNLNSIPGFVPLTKQQENLDDLAKGSTLQHFGAGEKVIRQGDSDNALYVI IAGQAAVTVKNESGKEQEVMTLSRGEFFGAMALFRGESNPVSVTAINDLEVLVIHSDV VDTMIERQPSLAREIGQIVEARRKSVNMAQQAEVPVNRHF" gene complement(3799..4476) /locus_tag="DP116_24275" CDS complement(3799..4476) /locus_tag="DP116_24275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877231.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="PRJNA477356:DP116_24275" /translation="MEKSLQTVFHEIASVSNEQELRLALIDTVAEYFGVQHWGIHLLD EPFLTEIDVQDVPGVCMESNPVGRYVVERHAPAHEQLVLPTGEWKHFCSRHDHEHVMS GPIVCDGRLVGTLNLARASGTPAFNVNDLADMSALCLHVSAKLATLRAKPKTSVSPLV SRLTPRELEIAELVARGLTNAEIGEKLWITQNSVKQALKRMFRKLEVSARAEMVAKLQ DMVSVIG" gene complement(4542..5720) /locus_tag="DP116_24280" CDS complement(4542..5720) /locus_tag="DP116_24280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317152.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24280" /translation="MRVPVLSPSGKPLMPTKPSRARRWLKEGKARVVHNDLGIFQIQL VVCPRTTNMQPIAVGIDPGKYFTGMGVQSAKFTLWLAHLQLPFQIVRERMEQRRMVRR GRRGRRINRKVAFKVRAHREKRFDNRGGRKIPPSIRANREFELRVLDELSLIYPISTV VYEIVKARGDKGFSPVMVGQKWQLTKLGNDWDDVREVEGWQTANIRQQLGLHKQKYSK GDTIPATHAVDGIALGCSAFTRYGAIDCQSMGWKGHVSITPAPFTVIRRPPVSRRQLH LMVPLKGGTRRKYGGTVTRHGFRKGDLIKTPSGEIGYCSGDTEKALSVSDADWRRLGR FSPKKSRLVQRNTGLIVLPTKRLSNLLAMRVHVACPFGLKPCRRHRTEDSVHATRTNQ " gene 6102..6758 /locus_tag="DP116_24285" CDS 6102..6758 /locus_tag="DP116_24285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458101.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24285" /translation="MTVTQVSAQELFRAAYENRYTWDTNFPGYTADVTYKQDEQVFTG KIRINSNLKAEVFEIEDEQAKQVIHNQAWEIAIHRIRRSFEQTHGENTFRYGATDETG AVEIFLGGKSEGDYYKVRNNEVSLVHRHIHNVVVTINTFSSHDTGEGYLSHEYDSVYH DPKTGEQKGGRSEFTDEYEKVGDYFILNRREIRTQTAAQPSIQEIIFSNIQLLEPVAA " gene 7121..7621 /locus_tag="DP116_24290" CDS 7121..7621 /locus_tag="DP116_24290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651685.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HugZ family protein" /protein_id="PRJNA477356:DP116_24290" /translation="MSQFETALAAYQSFTDSFQSLIISTVSADNTPNASYAPFVIDKS KKIYIYVSGLSTHTQNLHAVPKASVLFIDDESQTKQIFARRRLTFDCTATLVERDTEL WNQIVDSFEARFGEMVQILRDLPDFRIFQLTPSKGRFVIGFGAAYEVDPNDLSTLTHV TGESKG" gene 7772..9250 /locus_tag="DP116_24295" CDS 7772..9250 /locus_tag="DP116_24295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320645.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase GatCAB subunit B" /protein_id="PRJNA477356:DP116_24295" /translation="MTSAALVKTEYEAIIGLETHCQLSTKTKIFSSSSTAFGADPNTN IDPVCMGLPGVLPVLNEKVLEYAVKAGLALNCQIAKYSKFDRKQYFYPDLPKNYQISQ YDLPIAEHGWLEIELVDADGNPVRKKIGITRLHMEEDAGKLVHAGSDRLSGSTYSLVD YNRAGVPLVEIVSEPDLRSGQEAAEYAQELRRILRYLGVSDGNMQEGSLRCDVNISVR PIGQKEFGTKVEIKNMNSFNAIQRAIEYEIERQTAAVEAGERIIQETRLWEEGSQRTI SMRIKEGSSDYRYFPEPDLAPIEVSEEQLQEWRAQLPELPALKRDHYESELGLTAYDA RVLTEERATAEYFEAVIAAGGNPKAAANWITQDVAAYLNKNKDLKITEIGLTPANLAD VITRIEKGKISNAIAKEKLPDLLNGTSPEELFKGKELITDPSVLESIIDEVMAANPKE LEKYRNGNTNLKGFFVGQVLKKTNKLAEPKLTNQLVEQKLNG" BASE COUNT 2721 a 2119 c 2045 g 2565 t ORIGIN 1 tacttagata aaaaaagaaa atctatttta tgcttttgaa gaaattaaaa aatattaaca 61 atgttaactt tccgcgattt ttcaatagcc taggtgtaag tatcaagaca aaacttttag 121 taatgttgtt ggtcgtcagc ctttgcgcta tgctcgcgag cacctttatc tgctctaatg 181 ctggtaaatc tattctcacc gaaaaaatat tcaaccaatt aaccagcctt cgggctgtca 241 aaacttatca aatccaagat tattttgaaa acctttacaa ccatagccag actttaagcg 301 aagatttgac agtcgtcgcc gctatgcagg agtttaagac ggcttatcgc caacttgaag 361 aatcaaaagt cccagcggat tttgacaaaa aaattgatac ctactaccag acaaaatttc 421 taactaaact ggcgcgaact aacgagggtt cgcccgttct cgcatcttat acgccaaaaa 481 caactgctgc ccgttaccta cagtaccact acatcgcagc taaccccaac cccccaggca 541 aaaaacttct ccttgaccaa ccaggagacg ccagctctta cagccgcgtt cacgcccgct 601 accacccgat ttttcgcaat atagttgaaa aatatggtta ctacgacatg tttttgatcg 661 atcccgaagg aagcgttgtc tacacggttt ttaaagaagc cgattttacc acaaacttga 721 ccaacggtcc ctacaaggag agtaatttag cagaagcgat cgccgccgca cgaggggcaa 781 acggaaaggg ctatgtgaaa atagtagatt ttaaaccgta cagcccttcc tacggtgcgc 841 ctgcggcttt tatcgctgcc ccaatcttta acggcccgga gtttattggt attttggcgt 901 tccagcttcc cgttgacaaa attaacaatg tgatgactgg gaacaaacat tggaaacaaa 961 atggtctggg cgattccgga gaaacttatc tggtcggacc agattatcta atgcgttctg 1021 cgtctcgttt ccagattgaa gacccaaaag gtcatgcaaa aaccctacgg tctatcggca 1081 ctgatgaaaa tactgtcaag aaaatcgaag agttcaagac tacaattctt ctacaggaag 1141 tacaaacaaa agcagtgaag gaagcgttgt ttgggaaaca gggcacgcag gtgatcaacg 1201 attatcgtga tattccagtt ttgagttctt atgctccact agatattgat ggattgaagt 1261 gggctatctt agcggagatg gatgtttctg aagcctatgc cccaattcac tcgtttgaaa 1321 aaacaatttt gattgcggct actttaatta tcgcgctgat cacacttgtt gccatgtcgc 1381 tgaccgcaat ctttgtcaaa cccatcaaga cgttaatcgc tagcgcccgc aaggtggggg 1441 ctggagaatt tgatgccgtt gtcaaatctg gttcgcaaga tgaatttgga gagttagcta 1501 aatcatttaa ccaaacgata gacagccttc gcgccgaaac tcaactaatc gaacagaaaa 1561 atcgcgaaaa tgaagcatta gtcttaaata tatttagtcc cgcaatagcc aaacgcctca 1621 aacagggcga tagagaaatt gccgatcaga tttctaatgt ctcggtcttg ttttccgacc 1681 tcgagcgatt caccaagctt tctcaatcaa tgtcccctca agaagttgtt ggcgtgctta 1741 atgaactggt gactgcattt gatgaaatga cagaaaaata cggcatagaa aagattaaaa 1801 ctatcggcga tggttatatg gctgtatgcg ggcttagtgt accgcgtctg gatcatgaca 1861 aacgtatggt cgaatttgct tcggagatgc ttgcatttgt tcgtcgattt aattatgaaa 1921 aagatttgca cttggacttg cggattggta ttaactccgg agatgtggtt gctggagtta 1981 ttggcaaaga taagttgctc tatgatgttt ggggcgatac cgttaacact gctaataggt 2041 tgaagtcggc ttgtccgcca ggaggaattt tcgtttctca aaacatctgc gatcgcctgc 2101 gcgaccttta cgaatttgag ccggttgggg aaatccaaga aagcggaaag caaaagctag 2161 tggcttggca acttaaaggt atccagcaaa cagtcagcgt tccaaggtag aaaacaaatg 2221 cataacatca gcttgcacaa ccaagatttt atctggatgt taactgtagc attcgggttc 2281 ccaatccttg tcatcgtact gggagaggtt gttcatctgt tgcaaatgcg ccgtaaacca 2341 atagcagcta cggtgcggat tgtcaagaac ctcgtcttgc ccgtcttcat gttcatgata 2401 ttctttaaat atattctgaa agtgaacact ggcgggagtt ttgtcaagat tgtcgaaacg 2461 ctattctgga tatctgttat ccacgcagcc ctctccctaa tcaatacaat tctgtttgca 2521 gaagctgagg caaatacatg gcgcgcaaaa atgcccaaac tgctaacaga cctcttcagg 2581 ttgtttttgg tattggtagg aaccgcaatt gttctagcac tcgtgtgggg tgcagatttg 2641 gcaggtatgg tgaccgcctt gggtgttggc tcagttgtga tcggtttggc gcttcaggac 2701 acattgggca gtattatgtc tggcattacg ctgcttcttg agcgtccttt caatgtgggt 2761 gattggttgc gtgttggaga gaaagttgaa ggtcaggtga ttgatatcaa ttggcggtct 2821 gttcgcctcc tcacactcca gcgccaagtc attatagttc ctcatcaggt tattggaaaa 2881 gaaatcgtct gtaatcacag cctaccagaa cgcctgtata accaacgcat caagataggc 2941 ttctcctacg atagtccccc caaccttgtc aagcaagtgc taacaagtac tgctctatct 3001 acacagggga tagtagctga acctgagcca gagagcaaaa cctcatctta cgatgaaact 3061 gcaattatgt atgaggtgga attttttatc gaggactatg agaatgtgga gcaaatactt 3121 gacagattca tgacgcgagt ctggtatgca gcacggcgaa acaaccttgt tctctatcgc 3181 taccgatacg aatattcttc agaacccgct gtcaagacag atactccttc cagtcagttg 3241 acacaaaact taaattcaat tcctggattt gtacccctca ctaaacaaca agagaacttg 3301 gatgacctag ccaaaggttc aactttacag cattttggtg caggagagaa agtcattcga 3361 caaggtgatt ctgataacgc tttatacgtt attatcgctg gtcaggctgc ggtgactgtt 3421 aagaacgaat ctggaaaaga gcaagaggtt atgactcttt cgcgtggtga gttctttggt 3481 gcaatggcat tgtttcgtgg agaatcgaat ccagtatcag tgactgcgat taacgatttg 3541 gaagttcttg tcatccattc agatgttgtc gatacgatga tagaacgtca accaagcctt 3601 gcccgtgaga taggtcagat tgtagaagca cgaaggaaat cggtaaatat ggcgcagcaa 3661 gcggaggttc ctgtcaatcg tcatttttag atatgtcaga tagcgcacct tgaatagatt 3721 gatgcgtgcg acgtgggatt actggagcgg acaaatacac aagtcgattc gtaaaagata 3781 tttagatata gttaactttt aaccaataac tgataccata tcttgtagct tggcgaccat 3841 ttcagcacga gctgaaactt ccagcttgcg gaacatccgt ttgagcgctt gcttgacaga 3901 attttgtgtg atccaaagtt tttcaccaat ttccgcattg gttaaccctc gcgctaccaa 3961 ctcagcaatt tctaactcac ggggagtcag acgactgact aacggggaaa cagatgtttt 4021 tggttttgcg cgtaaagttg cgagttttgc ggaaacatga aggcataaag cactcatgtc 4081 cgcgagatca ttgacgttaa aagcaggagt cccactcgca cgagcaagat taagcgttcc 4141 cacaagacga ccatcacaaa caattggtcc actcatcacg tgttcatgat catggcgcga 4201 gcaaaaatgc ttccactctc ctgttggtaa caccaattgc tcgtgagctg gtgcatgacg 4261 ttcaacgacg tagcgtccaa ccgggttgct ttccatacac acaccaggaa catcctgaac 4321 atcaatctct gttaggaatg gctcatccaa gagatggatt ccccaatgct ggacgccaaa 4381 atactccgcc actgtatcta tcagagctag tcgtaattct tgctcattgc tcacagaagc 4441 tatttcatga aatacagttt ggagagactt ttccatcttt attttccgag tggggactcc 4501 aacttcagcg aagcaagtcg gagaggaaac gagggtctaa tttactggtt cgttcgcgta 4561 gcgtgcactg aatcctcagt cctatgcctg cggcacggct tgagaccgaa tggacacgct 4621 acgtgaacgc gcattgccag taaattagac aatctcttag ttggcaacac gataagtccc 4681 gtgtttcgct gtaccaaccg agattttttt gggctgaatc gccctaatct acgccaatca 4741 gcatcactga ctgacaaagc tttttcagta tcgccactgc aataaccaat ttcaccacta 4801 ggagttttta ttaaatcccc tttgcgaaac ccatgacggg ttacagtgcc accgtattta 4861 cggcgtgttc caccttttag tggaaccatc agatgcaact gacgacgaga taccggagga 4921 cggcgaataa ctgtgaatgg ggcaggtgtg atactaacat gacctttcca gcccattgat 4981 tgacaatcaa tcgcaccgta tcgagtgaaa gcgctacacc ccaaagcaat accatctaca 5041 gcgtgagtcg caggtatagt atcaccttta gagtactttt gtttatgcaa acctaattgc 5101 tgcctaatat ttgcggtttg ccacccctcg acttcgcgta catcatccca atcgtttcct 5161 agtttcgtta gttgccactt ctgaccaacc ataaccggac tgaatccttt atcaccacgc 5221 gctttaacta tttcgtagac gacggtagaa atggggtaaa tcagcgacag ttcgtcaagt 5281 actcgtaact caaattcccg gttagcccgg atacttggag gtattttgcg ccctccgcga 5341 ttatcaaatc tcttctcccg atgagcgcga actttaaatg cgactttgcg gttaattcga 5401 cgacccctgc gtcctctacg caccatgcgt cgttgctcca tgcgttcccg cacaatttga 5461 aacggcaatt ggaggtgagc caaccaaagg gtaaacttag cagactgtac acccatgcct 5521 gtgaaatatt taccggggtc gataccaact gctatcggtt gcatattggt tgtcctaggg 5581 caaacaacta actgaatctg gaaaataccg aggtcgttat gtacaaccct tgcttttcct 5641 tcttttaacc aacgtcttgc cctactgggt tttgttggca tcagtggttt cccacttggt 5701 gataaaactg gtactcgcat aaagagataa tccttctccc aaggggagac gcgaaacgcg 5761 aacgagtaaa gttgtaagtc ccctcgccca cccaaactca gatgtcttgg ctttacccaa 5821 caaccgagca aacagtttta cagataatcc ggtctgggga agcaaccgga agtgttgacc 5881 gagtttaggc tcactgggct attcaacact tgcgttacta agtctgctct tagcaatccc 5941 taaccgttac tccgtttgcg cagtgcgcca tcaggcatag gtaggttagg gtagttgaaa 6001 ggtgtaccca cttggggtct atgcgaaact taataattac ttctatgcta atagcagcaa 6061 aggtaaaaag ctaatataac aggtgatata taggagaaac aatgacagta acacaagttt 6121 ccgctcaaga actttttcgt gctgcttacg aaaaccgcta cacttgggat acaaacttcc 6181 ctggttacac tgcagatgtc acctataagc aagacgagca ggtgtttaca ggtaaaattc 6241 gtatcaactc aaatctcaaa gcggaagttt ttgagataga agatgagcaa gccaagcaag 6301 tgattcacaa tcaagcatgg gagatagcga ttcatcgcat tcgtcgctcc tttgaacaaa 6361 cccacggcga gaatacattt cgctatggtg cgactgacga aactggtgca gttgaaatct 6421 ttctaggcgg aaagtctgag ggcgattact acaaagtccg caataatgag gtgagcttag 6481 tccaccgtca tatccacaat gttgttgtga cgatcaacac tttcagcagt catgatacag 6541 gagagggcta cctgtcccat gaatatgact ctgtttacca cgatccaaaa acaggtgaac 6601 aaaaaggcgg aagaagcgag tttacagatg agtacgaaaa ggttggtgat tactttattc 6661 taaatcgccg agaaattcgt acccagacag cagcacaacc atctattcag gaaattatct 6721 tttctaacat tcagttgttg gaacctgttg ctgcttaagt tttttgatag ggctaatata 6781 tgcaacgtgc atatattagc ctgtattttc atagacttat tttttaccaa gttacttata 6841 agttttactt gggttccata taaagaatct aaatctaata actatttagt tcttcttgtg 6901 tttttaacct ctgttgtacg acagataact ggtgcagaca gaatcacaaa agcaatcaaa 6961 gccaaattag taaacattgt ttaacccctg ttatttttta agtccttctg ggtataagac 7021 tacagatggt tccggcgaag ttccagaaat ttctggtatt taggacactt ttcaaacctt 7081 catatataaa ctactttttt atctacaggt tgtatcatct atgtcccaat ttgaaaccgc 7141 cctcgccgcc tatcaaagtt tcactgactc tttccaaagt ctgattatca gcaccgtgag 7201 tgctgataac actcccaatg cgagctatgc tccttttgtg atagataaaa gtaaaaagat 7261 ttacatttat gttagtggtc tttctaccca tactcaaaat cttcatgctg tccccaaagc 7321 aagcgtatta tttattgatg atgagtctca aactaaacaa atctttgccc gtcgccgtct 7381 aacttttgac tgtacagcta ccttggtaga acgtgatact gagttgtgga accaaattgt 7441 agatagcttt gaagcgcgtt ttggagaaat ggttcagata ttgcgggatt taccagactt 7501 ccgcattttt caactgacgc caagcaaagg tcgctttgtc atcggctttg gtgctgctta 7561 tgaggtagat ccgaatgacc tcagcacctt aactcatgtt actggtgaaa gcaaaggata 7621 ggaaaggaat aaccgcaacc aacccaaata tcgttttgtg atttgttgtc ggttttcctg 7681 tcaagaagca tcaggattta aacattcaaa gcaagtccta ataaagctaa aatagtacca 7741 ctatgaagga ctatgcctaa tttgtgactt tatgacttct gctgctttag tcaaaactga 7801 atacgaggcg attattggtc tggaaaccca ttgtcaactc agcaccaaaa ccaaaatttt 7861 ctcttctagc tctacggcat tcggtgctga ccccaatact aacattgacc cggtttgcat 7921 gggtttgccc ggtgtcttac ccgtactgaa tgaaaaagtc ctagaatatg ctgttaaagc 7981 aggtttggcg ctcaattgcc aaatcgctaa atatagcaaa ttcgaccgta agcagtattt 8041 ttatccggat ttaccgaaaa attaccaaat ttctcagtac gatctaccca ttgctgaaca 8101 tggctggtta gaaattgagt tagttgatgc tgacggtaac cctgttcgca aaaagattgg 8161 catcacgcgc ctgcatatgg aggaagatgc agggaaattg gtacatgcgg ggagcgatcg 8221 cctttccggt tccacctact ctctagtaga ttacaatcgt gctggtgtcc ctttagtaga 8281 aattgtttct gaaccagatt tgcgttctgg acaagaagct gctgagtacg cccaagagtt 8341 gcgtcgaatc ctacgttacc tcggcgtcag cgacgggaat atgcaagaag gttctctacg 8401 ctgcgatgtc aatatctccg tgcgtccaat tggacaaaag gaatttggta ctaaggtaga 8461 aatcaaaaac atgaactcgt tcaacgccat ccaacgggcg attgaatacg aaattgaacg 8521 gcaaactgca gcagtagaag ctggcgaacg tatcatacaa gaaactcgtc tgtgggaaga 8581 aggttctcaa cgtacaatta gtatgcggat taaggaaggt tctagcgatt accgctactt 8641 ccccgaacca gatttagctc ccatcgaagt ttcagaagaa caattacaag agtggcgcgc 8701 tcaacttcct gaactcccag cactcaaacg cgatcattat gaaagcgagt tggggctgac 8761 tgcttatgat gcgcgagtgt tgacagaaga acgtgcaaca gctgaatatt ttgaagcggt 8821 tattgctgca gggggaaatc ccaaagctgc tgcgaactgg attactcagg atgtcgccgc 8881 ctacctcaac aaaaataaag atctcaagat caccgaaatt ggcttgacac ccgctaacct 8941 ggctgatgtg atcactcgga ttgagaaagg gaaaattagt aatgcgatcg ccaaagaaaa 9001 acttccagac ctcctaaacg ggacttctcc tgaagaactc tttaaaggta aggaactcat 9061 caccgatccc agcgtactgg aatctatcat tgatgaagtc atggctgcta atcccaaaga 9121 actcgaaaag taccgtaacg gtaacaccaa tctcaaaggc ttctttgtag gacaagttct 9181 gaaaaagacc aataaactcg cagaacccaa actcacaaac caattagtgg aacaaaagct 9241 gaacggctaa actctttaca gacgttgggt tcaacgtctc cattgcttcg agtgagtgct 9301 tgcttaacaa ttctttacac aaaatttcct aataaatcag gaaaaggggg tgacaccccc 9361 caccccaaac agtatatact gtgtttagta gatgcaccct gatgatctgg agagctgccc 9421 aagcggctct ccttttttat gtcttgtaag // LOCUS NODE_3544_length_9386_cov_4.7091429386 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9386) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9386) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9386 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..266 /locus_tag="DP116_24300" CDS <1..266 /locus_tag="DP116_24300" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24300" /translation="ACGVPGLKPQGFHLTLYKKYISDTVFFGNMTLCRLTHEPSTLFK KKVQELENIYIGEMGVTEISLVICNAVCHPKTKKVIGTYKLSQ" gene complement(484..1077) /locus_tag="DP116_24305" CDS complement(484..1077) /locus_tag="DP116_24305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310245.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfite oxidase-like oxidoreductase" /protein_id="PRJNA477356:DP116_24305" /translation="MVGKFFRKPTKQDGERVPPGQHLTKGFPVLTYGETPEVSTDEWE FRVWGLAKPATFTWSDFLALPQHEFTADFHCVTRWSKLDVKWTGIKVTDFMKLIEVDA KADHVMEHCYGGYTTNISMKDFVREENFFAFQVFGEPLPAEHGGPLRLVVPHLYAWKS AKWINGLEFLEREELGFWERNGYHRRGEPWAEERYSY" gene complement(1241..1426) /locus_tag="DP116_24310" CDS complement(1241..1426) /locus_tag="DP116_24310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015116804.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L32" /protein_id="PRJNA477356:DP116_24310" /translation="MAVPKKKTSKSKRDKRRATWRHKATVEAQKALSLGKSILTGRST FVYPNAQQEEEEEEEES" gene 1730..3460 /locus_tag="DP116_24315" CDS 1730..3460 /locus_tag="DP116_24315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase C14" /protein_id="PRJNA477356:DP116_24315" /translation="MANNWAIAIGINQYQFFQPLDCAQADAEAVKDFLVTNGGFLPQQ CLVITETSPPFEDRSTYPTKENILLLLEDFAATSWHPQDRLWFFFSGYGVNYKGQDYL MPTEGDPNRVEQTGIEMRSLMQSLHVANLDALLLLDINRAFGTYTDAYVGKETIELAK ELQISTILSCQPEEFSHESRELGHGFFTAALLEALRYGNGNHLTHLEKYLSVRTPELC QHYWRPTQNPVMILVPKRQAQVISPQLEDKREAKVKEVAKEVAEVAGKNTPSSSSSPS SPHPQKKSSNNHFLPLLLLWSIATMLVLCLIMVVLLRDRVGFKLAQILPPSFRSATNN KKIVNVSPQPEASPSPTILTQPETQLIPQVTSSPQVTPTPDESKQSNQALLELEKMSL SQTQASDLRQAINTAAKIPPDDPKYEQAQDNIKIWSRMILDLAQTRAQQKQYANAISA AQLVPKDQAVYSKAQTSIKQWRLKAKQYVTNTTLIDAAVGLIRDGQASTYNRAIEVAK KVPRGEPGFDDAQKSINSWSENILRLAKQRANRKEFKAAIETAALVPEETAAYKQAQN AIAQWQKNGQ" gene complement(3615..3989) /locus_tag="DP116_24320" CDS complement(3615..3989) /locus_tag="DP116_24320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862724.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase" /protein_id="PRJNA477356:DP116_24320" /translation="MKVYHNTTRKIFLASWVLLNQTQQDAPAAQMHRPSAKPEFGILQ SIRRWLDNVPIRDRQLAHRLCKFIPSQCPFERDVKLFGKTLFHIPPMCKLNPVYEELV GLRFRALCYLADECGEDVTQYC" gene 4732..5079 /locus_tag="DP116_24325" /pseudo CDS 4732..5079 /locus_tag="DP116_24325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866091.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="anion transporter" gene complement(5244..6035) /gene="budA" /locus_tag="DP116_24330" CDS complement(5244..6035) /gene="budA" /locus_tag="DP116_24330" /EC_number="4.1.1.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acetolactate decarboxylase" /protein_id="PRJNA477356:DP116_24330" /translation="MKFKHYFWIAVVLITALSVVLPGRTQQNISFDSLFQTSTISALA AGVYDGETTFKELKNYGNFGLGTVNALDGEMIGLDGKFYQVKSDGIAYSIPDSTKTSF AVVTFFKSEKRIHLEGTMNYQQMQQSLDRQLPTKNSPYAIRIQGTFPYLKVRSVPKQT PPYRPLVDAVKDQSIFELKNVKGVLVGFRTPNYMQGINVNGYHLHFLTENRKIGGHLL DGKFQNDQVEIDTKSDVQIALPKTTQFEQADLGDGKAAEVNKIER" gene complement(6177..7949) /locus_tag="DP116_24335" CDS complement(6177..7949) /locus_tag="DP116_24335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130994.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease J" /protein_id="PRJNA477356:DP116_24335" /translation="MAKNETLAALKIIPLGGLHEIGKNTCVFEYDDEIILLDAGLAFP TDGMHGVNIVLPDMTYIRENRHKIKGMIVTHGHEDHIGGIAFHLKQFDIPVIYGPRLA LAMLEGKLEEAGVRERTELRSVLPRDVMRIGKHFFVEYIRNTHSIADSFTVAIHTPVG LIIHTGDFKFDHTPVDGEHFDLQRLAEHGEKGVLCLMSDSTNSEVPGFTPSERSVYPN LDRVFSQATGRLFVTTFASSVHRINMVLQLAQKYKRVVGVVGRSMLNLIAHARNLGYI KCDDNLLLPLQSVRNVPDENVLILTTGSQGEPMSAMTRIANQEHPHIRIRQGDTVVFS ANPIPGNTIAVVTVIDKLMMQGANVIYGRDKGIHVSGHGCQEDQKLMIALTKPKFFLP VHGEHRMLVKHSQTAQSMGIPAENMVIIQNGNIVELTEESIRVAGKVASGLELVDTSG SGMVSAKVLQERQRMAEEGIVTIAMALDWNGKLIAKPEIHLRGVVTSIERSLLQKWVQ QRIEEILSVRWSEFAPSFDSENQEVDWGGLQGLLERELARSLRKELHCQPSVTLLMQI PDEPPVKVSDGRRRRTRTAAQVAS" gene complement(8258..9155) /locus_tag="DP116_24340" /pseudo CDS complement(8258..9155) /locus_tag="DP116_24340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205884.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="4-hydroxy-tetrahydrodipicolinate synthase" assembly_gap 8386..8395 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 2692 a 2083 c 1860 g 2741 t 10 others ORIGIN 1 aggcatgtgg cgtccctggg ctaaagccgc agggttttca tctcactctt tataaaaaat 61 acatctctga tacagttttt tttggaaata tgactttatg tagattgaca catgagccga 121 gcacattatt taaaaagaaa gtccaagaac tggaaaatat ttatattgga gaaatggggg 181 tgactgaaat tagcctcgtc atatgcaacg ccgtttgtca tcccaaaaca aaaaaagtta 241 taggtactta taagttaagt cagtagggcg gataatacca taggtgtcaa cttaagcgca 301 gagtcttaca gagtaagctt tttatgccca acaaaccttc gatcccccct taatcccccc 361 taaccaaggg gggagacaga attttagccc ccttggttag ggggttgggg gatctcaaat 421 gactgtcatc aaagcatttt agttttaatt tgacaccaat tgggaaacgc cccaccaacc 481 ttattaatag ctgtaacgtt cctcagccca aggttcacca cgacggtgat agccattgcg 541 ttcccaaaaa cccaactctt cccgttccaa gaactctaaa ccattaatcc acttggcact 601 tttccaagcg tagaggtgag gaacgacgag tcgtagtgga cccccatgtt ctgctggtaa 661 gggttctcca aagacttgaa aggcaaagaa gttttcttct cgcacaaagt ctttcattga 721 gatatttgtc gtgtagccac cgtagcagtg ttccataaca tggtctgctt tggcgtctac 781 ctcaatgagt ttcataaaat ctgtaacctt gataccagtc cacttgacat caagttttga 841 ccagcgcgtc acacaatgga aatccgctgt gaattcgtgc tgtggaagcg ccagaaaatc 901 tgaccaggta aaggttgcag gttttgccaa accccatacc cgaaactccc attcgtctgt 961 gctaacttca ggagtctcac cgtaggttaa tacaggaaat cccttggtca agtgctgacc 1021 gggaggaacg cgttccccat cttgcttcgt aggcttccga aaaaactttc ccaccatagt 1081 tgcagaaaaa taacttttct gacaatactc taatttttat atcaagatga agtaatcagt 1141 cataagtaat gggtaatggt aaccagttag tggttagtga gataataact aacaactaac 1201 atccgacaag caaccagtta ccaattaccg aatagtcgtc ttatgattcc tcttcttcct 1261 cttcttcctc ttgttgagca tttggataga caaaagtaga acgtccagtc aaaatagact 1321 tgcctagaga aagagctttc tgagcttcaa ctgtggcttt gtgcctccat gtagctcgac 1381 gtttatctcg tttggacttc gatgttttct tcttaggaac agccatagta acgggtactc 1441 taaatgtaga caaccttttt attctaaggc atagaaacaa atgtagtgag tgttaagtag 1501 aaatcgctct cttatggttt gcgtcgatct ttctccttga agcaaccaaa aaactgcaac 1561 tcctcaactt ccacgcttgt cttagactct gaggagacgg cagatcctat aacggtcgaa 1621 tcagaatcac ctaaactatt agacaaggtt agtagacata ggttagcctt gttccttaca 1681 aagtcaatca ctttcctagg caattaaaca acgagtaaga aaaatcgcga tggcaaataa 1741 ctgggcaatc gcaattggta tcaatcaata tcagttcttt caacctttag actgcgctca 1801 agctgatgct gaagcagtca aagatttttt ggtgacgaat ggtggttttt taccgcaaca 1861 gtgtctggtg ataacagaga cttccccacc gtttgaagat agatctacct atcccactaa 1921 agaaaatatt ttactactgc ttgaagattt tgctgctact tcctggcacc cgcaagaccg 1981 attatggttt ttcttcagcg gttatggcgt caactataaa ggacaagatt atttaatgcc 2041 caccgaaggc gatcccaatc gcgttgaaca gacgggtata gaaatgcgat cgctcatgca 2101 gagtctgcac gttgcgaacc tcgatgcatt gttattgctt gatatcaacc gtgctttcgg 2161 aacttacaca gatgcttacg ttggaaaaga aacaatagaa ctagctaaag aactacaaat 2221 ttccactatt ctttcttgtc aaccagaaga attttctcac gaaagtcgcg aactagggca 2281 tggattcttt acagcagcgc tgttagaagc tttacgttac ggtaatggca atcatttgac 2341 gcatttagaa aaatacctga gtgttcgtac gccagaactg tgtcaacatt actggcgtcc 2401 cacacaaaac cctgtcatga ttttagttcc taagcgacag gcgcaggtta tttctccaca 2461 attggaggat aaacgtgaag cgaaagtcaa ggaagtagca aaggaagtcg cagaagttgc 2521 aggaaaaaat actccctcat cttcctcatc tccctcatcc cctcatcctc aaaaaaaatc 2581 gtcaaacaac cattttttgc cgctgcttct actgtggagc atcgctacga tgcttgttct 2641 atgcttaatt atggttgttt tgttgcgcga tcgcgtagga ttcaagttag cgcagatact 2701 accaccatct ttcagaagtg ctactaacaa caaaaagatt gttaatgtat caccacaacc 2761 cgaagcctca ccatcaccaa ccatactcac ccaaccagaa acccaattaa tcccccaagt 2821 tacttctagt ccacaagtca caccaactcc tgacgaatct aaacaatcca accaagcact 2881 gttggaattg gagaaaatgt ccctgagtca aactcaagcg agtgatttga gacaagcgat 2941 taatacggct gctaaaattc cgccggatga tcccaagtat gaacaagctc aggacaacat 3001 caagatttgg agccgcatga ttttagattt agcacaaact cgtgctcagc aaaaacagta 3061 tgcaaatgct atttccgctg cacagttagt ccccaaagac caagctgtat actcaaaagc 3121 acaaacatct attaagcagt ggcgattaaa agcaaagcag tatgtgacca atacaaccct 3181 tattgatgct gctgttggct taattcgcga tggacaggct tctacctaca atcgcgctat 3241 agaagttgct aaaaaagttc cacgaggtga accaggtttt gatgatgctc aaaaatctat 3301 caactcatgg agtgaaaata ttttgcgtct tgccaaacaa cgagccaacc gaaaagaatt 3361 caaagctgcg attgaaaccg ctgctttagt tccagaggaa acagccgcat acaaacaagc 3421 gcagaatgca attgcacaat ggcaaaaaaa tggtcaatag ttactagttt ttggtcatga 3481 gtagagacgt tgcatgtgag gcagcgcgcg ttcgagtgag ggttttcctc gtgccggcga 3541 tcgctctccc gttgggcaac gtctctacac gtgtgaaatc tgaaacaatc cttaacagca 3601 acagtactga tttattaaca gtactgtgtg acatcttcgc cgcattcatc agctagatag 3661 catagcgctc gaaaacgtaa gccaaccaac tcttcgtaga cagggttgag cttgcacatt 3721 ggcggaatat gaaacagggt tttcccaaat aatttcacat cgcgctcaaa aggacactgg 3781 gaaggaatga acttacacaa gcgatgagct agttggcgat cgcgaatcgg aacattgtct 3841 aaccatcgcc gtatggattg gagaatacca aactcaggct tggcggaagg acgatgcatc 3901 tgggctgctg gagcgtcttg ttgtgtttga tttagcaaca cccagcttgc caaaaatatt 3961 tttcttgtgg tattatgata taccttcatg gttcactcct ctattcgact gaggctcttg 4021 aactacccga tgaacattta caactttgcc acgacttgtt ccggaagaac cgtgtcctga 4081 ctgagataca atctgatgat ttgccttatc tgatgactga gtagctactg gcatcagagc 4141 ctgttctata ttacgtcaag ttaagcgcaa tttccttttg atcagaggtg cagtagagat 4201 atcttgtttt taacttgatg taattaaaat ttcagtaaag acatctacct ttgggtagag 4261 aaaggtgtaa cgtttactaa aattttatca caaaataaaa cacatttgac gtcagtctct 4321 attgcgtaag tataaatact gcaattttgc attcaggtag tcattagcta agagcaaaac 4381 tgcgtaaatt tgacgctgag agcaactttt aaatattaca accttcccca aaaaaatact 4441 tatagataag agaggctaag taggagagag aaagaataca aagaagaaaa ttaattgaaa 4501 atcaagaatt tcgatttttt caaacctatt tttcggcgat tttacccaca ttttcactct 4561 cagtttcaaa gtggtaatct tacattggaa tatgagtaat tttgctgata gctacagagc 4621 tgctgataac agcattataa atttgtgatt cttctgcgac aacttgcaat ctatggcgta 4681 ttagggctga cttacctggg gttagcactg ggctatcttc caggtctacg catgaaccgt 4741 gcgacaattg ccctagtggg ttctgcattt ctcattggtt tgggtgttct caatttgcaa 4801 gaagcatggg atgctattga tgccactacc atcgtgtttt tgttaagcat gatggttgtc 4861 aatgccaacc tatcctttgc tggttttttc cagcaagcac tagcagtgct gttgagtttc 4921 acccgcagtc cttttggctt gttaatcgcc ttaacttgtg ctagtggtat cctctgggcc 4981 ttttttctca atgacaccat cgcgctgatt tttacaccat tgactttgag gctgacacaa 5041 gcgctgaatt tgaatccgat tccctattta cttggactgt ttggttcagt cgccaactta 5101 atcgtcgcag aagcagccgg tgagttaggc tacaagttaa ctttttggga acatctgcgc 5161 tttgggatac cactgacggt gttaacttta ctgctagtat atctgtgggt tcagtaaata 5221 tctttgatgc cgtttccatc aacttatcgc tcaattttgt tgacttcagc agctttacca 5281 tctcctaaat cagcttgttc aaattgagta gttttaggca aggctatctg aacatctgat 5341 ttggtatcaa tttcaacttg atcattttgg aattttccat ccaaaagatg tccgccaatt 5401 tttcgatttt cagttagaaa atgcagatga tagccattca cattaattcc ttgcatataa 5461 tttggtgtac gaaagcctac caaaacacct ttgacatttt taagttcaaa aatggattgg 5521 tctttcactg catctactaa aggacgatat ggtggagttt gcttggggac gcttctaact 5581 ttgaggtaag gaaaagtgcc ttgaatgcga atggcgtaag gagaattttt cgtcggtaac 5641 tgtcggtcta aagattgctg catttgttga taattcatag tcccttctag atgaatccgt 5701 ttctctgact taaagaaggt gactactgcg aaagaggttt ttgtcgagtc tggaatggaa 5761 taagcaattc catcagattt cacttgataa aacttgccat ctaacccaat catttctcca 5821 tcaagggcat tcactgttcc caaaccaaaa ttaccataat tttttaattc tttgaaagtc 5881 gtttctccgt cataaactcc tgctgcaagt gcgctaatgg tggatgtttg aaatagggaa 5941 tcaaatgaaa tgttttgctg tgttcttcct ggaagaacga cactcaatgc agtgatgagc 6001 acaacagcta tccagaaata atgcttgaat ttcaaccaat aggaatacac catacaatga 6061 aaactggcgc agtttgataa taaattaggg tggatagcaa cgctattcca cccttgtgca 6121 gtttcatttt tggttagtta accaacaaac ttaagactct acactctata acttaactat 6181 gatgcgactt gagctgcagt acgagtgcga cgccttctcc catcagaaac ttttactggt 6241 ggttcatcag gaatttgcat caataaagtt acggatggtt gacaatgcag ttcctttcgc 6301 aatgagcgtg caagttccct ttccaaaagc ccttgcaagc caccccagtc tacctcttga 6361 ttttcactat caaaagaggg ggcaaattct gaccagcgaa cgctgaggat ttcttcaatc 6421 cgctgttgta cccacttttg caacaaggag cgctcgatgc tcgtgacaac acctcgcagg 6481 tgaatttctg gctttgctat caatttgcca ttccaatcta gcgccatcgc aattgtgaca 6541 ataccctctt ctgccatacg ttgccgttct tgcagcactt tagcactaac catgccacta 6601 ccggaagtat ctaccagttc cagaccagat gcgactttac cagcaacacg aattgattct 6661 tcagtaagtt cgacgatatt cccattctga ataatcacca tgttttcagc aggaatgccc 6721 atactctgag ctgtttggga atgcttgacg agcatccgat gttctccgtg aactgggagg 6781 aaaaacttgg gtttagtcag agcaatcatc agcttttgat cttcctgaca tccatgacca 6841 gagacgtgaa ttcctttatc acgaccatag atgacatttg ctccctgcat catcaactta 6901 tcgatgacgg tgactacagc aattgtattt ccaggaatgg ggttcgctga aaagacgacc 6961 gtatctcctt gacgaattct aatatgagga tgttcttgat tggcaatgcg cgtcatcgct 7021 gacattggct caccttgaga accagtagtc agaatcaata cgttttcatc tggtacatta 7081 cggactgatt gcaacggtag aaggagatta tcatcacact tgatatatcc tagattacgg 7141 gcatgggcaa tcaaattcag catggaacga cctaccacgc ccacgacacg cttatacttt 7201 tgcgccagtt gcaaaaccat gttaatgcga tgcacgctgg aggcaaaggt tgttacaaat 7261 aatcgcccag tggcttgact gaagactctg tctagatttg gatacacaga acgttctgaa 7321 ggcgtgaatc ccggtacttc tgagtttgtt gagtcactca tcaagcacag tacgcctttc 7381 tctccatgtt cggctagtct ttgcaagtca aaatgctcac catctacggg cgtatggtca 7441 aatttaaaat ctcctgtatg gatgatcaaa ccaactggtg tatgaattgc tacagtaaag 7501 ctatcagcga tggagtgggt attgcgaata tattctacaa agaaatgttt gccaattcgc 7561 atcacatcgc gcgggagaac acttcttaat tcagtgcgct cgcgcactcc tgcttcttcc 7621 aatttcccct ctagcattgc caatgctagt ctaggaccat agatcacagg aatatcaaac 7681 tgctttaggt gaaaggctat cccaccgata tggtcttcat gaccgtgggt gacaatcata 7741 cctttaattt tgtggcgatt ttcccgtata taagtcatgt ctgggagtac tatattgact 7801 ccatgcatcc cgtctgtggg aaaagccaat cctgcatcta acaggataat ttcgtcgtca 7861 tactcaaaaa cacaggtatt tttaccaatt tcatgcaaac cgcccaacgg aataattttt 7921 agggcggcta aagtttcgtt tttagccata ttctccttac tcgtgttctt tgttgataaa 7981 aatgtgaaat ttaaagattg ttctttgtca atcaaatgtg aaatctaaga ttgttgtaaa 8041 actgacaaga caaacaggtt atgaaatcag tctgaaaaca gctgactttt tatcagatgt 8101 tcaattgttg cttaattaat tcggctttaa aagctcaaaa ttcataatat atttatccag 8161 tcttagcaca ggggttttac ccagattttg gtaaaataaa acactttttt ataaaaatct 8221 tcagcaaatt agcccgtcag gtttcattat attcttgcta aatcaaacca agttctttga 8281 gaacgagttt taacttttga ctgatttctg agtcagcttc gcacaatggc ggacgagttg 8341 aaccaacctc ccaacctaaa agatttaatg ctttcttaac taggannnnn nnnnnggatg 8401 gggtttgtcg ttgcaaataa agctttaaac agaggaaaga gttggagatg tatttccctg 8461 gctacttgaa ttttacccga atcaaaggct tgaaccatct tttgtagttc gtttcctacc 8521 agatgagaag ccacgctgac gacacctttt gcccctattg ccaacagagg caaagtcatg 8581 taatcatcac cagagtaaat ctggaattct tttggcgtca agcgacgaat ttcacttgct 8641 tgatctatat tgcctgtcga ttccttaatt ccaataatat tgtcaatctc agctaaccga 8701 gcaacagttt ccggttgaag attttgacca gtacgaccag ggacattgta caacagcaac 8761 ggtaagtcag gcgtcgcttg agcgatcgca gcaaagtgct gataaagccc tgcttggggg 8821 ggtttattgt agtaaggaac aacctgcagt acaccatgta ctcctatttt aaccgccttt 8881 tgggtagcgg caattgcttc tttggttgaa ttagaaccac accctgctat cacaaaagct 8941 tttcccgcca ccgcttgcaa caccacagaa aacaactgat attcttcatc ccaacttagg 9001 gtaggagatt ctcccgtggt tccacacacc acaattgtat ctgtaccatt gtcagctaga 9061 tgtgctgcta gtgctgcagc aacatcataa tttacactgc cgtcttcctt aaacggcgta 9121 atcatagcgg ttagaactct gccaaaatct accaccctct tttcactcct aaatattctt 9181 ctttgttgtt ctttgttctt cgctttttgt ggatattggt ggttttatat tgtgataacc 9241 tatttcacta ttactttcaa gcgtaattta cacattatcg ccatatcatt agtcctttct 9301 ctggcactca taaccacatc actcatgact catggaccct tcgggttcgc cagtcaccta 9361 cggagggaaa ccctcctgca gtgctg // LOCUS NODE_3565_length_9296_cov_5.2477009296 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9296) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9296) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9296 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(376..1149) /locus_tag="DP116_24345" CDS complement(376..1149) /locus_tag="DP116_24345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859351.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-(5-phosphoribosyl)-5-((5- phosphoribosylamino)methylideneamino)imidazole-4- carboxamide isomerase" /protein_id="PRJNA477356:DP116_24345" /translation="MEVIPAIDLLEGRCVRLYQGDYEQSQVFSDNPVEVANQWVEQGA TRLHVVDLDGAKTGKLVNLPAIEAIAQAVSVPIQIGGGVRDRSRVQQLLNIGVQRVIL GTVAVEQPQLVQELCQEFPEQIVIGIDARNGMVATRGWQETSEVFATQLAVQMQELGA AAIIYTDIHRDGTLSGPNIEALRSLAATISIPIIASGGVSSVTDLLSLLALESQGVTG VIVGRALYTGDVSLREALRAVGQGRIQDIPPNIDFSAFA" gene complement(1234..2388) /locus_tag="DP116_24350" CDS complement(1234..2388) /locus_tag="DP116_24350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408885.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lysophospholipase" /protein_id="PRJNA477356:DP116_24350" /translation="MKFLCRSLLCFSGCVGFLAVDSILSSSFSFAAEQFVVRYGFFEQ SLPLQDLRNYAETQQVSSNLKSFLSYLKPKQQKMLQEALQIKMSLDIVAVDKLLDSGI GKMLLSVGSKSVIRRDKAAKEALRAAIILGAKSTKGLGIISFLEAYPSEKLVVDVPTV LDILSQSRLFSNSSNLPPKDNLSSTPIWHIATQYQTLASQGKQFSGCLFGDSVSAQLG NSLGEGTFNFALNGLSTISLVEQLEILALGKVKCQKTIIAIGGNDAWYGLSNELFTEK LKEAISLVRAMGTKEIFLIPAFYSTVAASKDPSIAAPLSKVEEINTVMNQVGTTENVP VKSSGVQPLYENNVLKDNLTSDGDHLNAEGLKIYRNALLEILASDSANKN" gene complement(2491..2835) /locus_tag="DP116_24355" CDS complement(2491..2835) /locus_tag="DP116_24355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208942.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3593 domain-containing protein" /protein_id="PRJNA477356:DP116_24355" /translation="MISKETLFALSLFPYLGFLWFISRTPQMPRLALYGFYGTLVFVA VTIPAGIYAQVHYGKALANVDWLHGSAEFFLTLANILVVLGFRQAIIQKQQVKASNSS TQNLSSPTFSQE" gene complement(2832..3179) /locus_tag="DP116_24360" CDS complement(2832..3179) /locus_tag="DP116_24360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013191627.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24360" /translation="MNALSIPTWIIHVSSVIEWIAAIWLIWKYGEVNGNRAWWALSFA MLPALVSAMCACTWHYFDNAESLEWLVTLQASMTLLGNFTLWAAAVWIWRSTKSAETA NNSVSTKTIESKQ" gene 3233..4285 /gene="csaB" /locus_tag="DP116_24365" CDS 3233..4285 /gene="csaB" /locus_tag="DP116_24365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127946.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polysaccharide pyruvyl transferase CsaB" /protein_id="PRJNA477356:DP116_24365" /translation="MRVLLSGYYGKGNGGDEALLATLLQMLPPSVTPVVLSGNPEETN SRYRVEACDRMAPLTVLQALRSCDAFIWGGGSLIQDTTSAVSPFYYGAIMTLAQKMGL KTIAWGQGIGPVKRPVTRWLARQNFGGCTKVSVRDRASAALLTDWQIPFILAPDPVWA LQSKPVPGLWDLPAPRVAVTLRTHPQLTQTRLANLTRALVDFQKATQTFILLLPFQKS EDLEIAQAIQPQLKDVSKIMCLEEPELLKGVFRGVEMAIGMRLHSLIMAASEGCRCFA LSYDPKVNRLMEDLDMPGWDLANLPDDPNFISKTWMEHYANGDPLSAEKIQSLVDRAL IHREVLSEGLTVNSIK" gene 4350..4838 /locus_tag="DP116_24370" CDS 4350..4838 /locus_tag="DP116_24370" /inference="COORDINATES: protein motif:HMM:PF00583.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_24370" /translation="MSFIFSPMNFESACVILTWQYDEPYNFYNLDSGEISESVQQFLD PHNAYYTITNGHNDLIAYCCFGPDARVSGGDYSVEAVDVGLGIRPNLTRRGLGLRVVD AVMNFAQNKFTPILFRVTVADFNKQALRICEKAGFQTAQRFQRDLDGKDFVVLTLKDN ER" gene complement(5108..5452) /locus_tag="DP116_24375" CDS complement(5108..5452) /locus_tag="DP116_24375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24375" /translation="MSNIKVTIVQEVQTWQGVTVAPHRFGGIEFQVNGREIGHLHGDY QADIPFTARIRKELVESGKASLHQIYPKSGWISFYIHGVENVPLLLELLRMNYDRLAK NRLEVEVEEIAA" gene complement(5591..6304) /locus_tag="DP116_24380" CDS complement(5591..6304) /locus_tag="DP116_24380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747159.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rubrerythrin" /protein_id="PRJNA477356:DP116_24380" /translation="MDLSNSNTAKNLSEAFAGESMANRKYLFFAEVTRQLGMTEVSKL FRETANQETEHAFAHFRLMHPELVVDDVASLSEEQKKAIAARCLELAIEGETYEYTTM YPEFTEAARTDRDHKAVVEFEAQQAESREHAQIFRKAAHNFGLLTPIEHHHANQYTEA LQALEGVAPAPKAASDNPATQKWICRQCSMIYDPVEGDPDSGIAPGTAFEAIPEDWHC PICNASKKTFVPYEEAVAA" gene complement(6449..7210) /locus_tag="DP116_24385" CDS complement(6449..7210) /locus_tag="DP116_24385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009627632.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24385" /translation="MPNQTFTTTLNQSILTKPIISACNLNHYFGEGELRKQALFDINL DIYPGEIVIMTGPSGSGKTTLLTLMGGLRSAQEGSLTILDQQMCRANQQQMMQVRRQI GYIFQAHNLLTFLTAKQNVRMSLELHDEYLEQDIDGMSADILKAVGLGNRVDYYADSL SGGQKQRVAIARALVSHPKIVLADEPTAALDKKSGRDVVEMMQKLAKQQGCTILLVTH DNRILDIADRIVYMEDGRLADNPALTLGEPHRFFD" gene complement(7273..7902) /locus_tag="DP116_24390" CDS complement(7273..7902) /locus_tag="DP116_24390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015200834.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyclase/dehydrase" /protein_id="PRJNA477356:DP116_24390" /translation="MSQFLRSCLFAGYFSVIATIAVTPSVYAELFNSPVDKLPVAERV KLRNGQALVTGEKGKYTAKVLVTASPDIAWEVLTDYDNFSKFLPNTVSGKVLEVNGNQ KVVEQVDTRQVFLMNVQSRIRSAITETAKNRIDFRQIDGDLQSLDGYWKIEPVAPYSG AKANQVLITQVVEAQPKSGTPKKIFYNLFKDSLGDILTAVKREVDRRNR" gene complement(8067..9236) /locus_tag="DP116_24395" CDS complement(8067..9236) /locus_tag="DP116_24395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013320571.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_24395" /translation="MFRKLFCKTPLAWLQVRREKTRLAVALAGIAFADVLMFVQLGLN DALYDSAANFHNTLRGDLFLINPQSEAVQSFKSFPRERLYQAAGFEGVKSINSLYVGS AQWRNPQNHQSERTVLTLGINPAKPAFTLPEVNQQIDKLKPLNRVLYDRAGRPEFGDI PALFQQSSPLLIQVKNKQIWVTGLFTLGVSFGTNGTMITSDSTFIRLFTERKPEQIDV GLISLKPGVNLKQVQAQIRATLPNDVLVLTREEFANREKKYWSSSTPTGFIFGLGTII GFVVGIVIVYQILYSDVSEHLPEYATLKAMGYSNTFLVSILIQESLILAIMGFIPGFV LSSGLYLIAGTATLLPIGMTPSRTALVLFLTVIMCVVSGGIAMRKLQSADPADIF" BASE COUNT 2753 a 1999 c 1876 g 2668 t ORIGIN 1 gaaaatagag acgcataatc aaacgtctct agagaggaaa cccataaatt gtaaaactga 61 aaaatgaaaa ttgaaaattt aattttataa aataaatatt atttagtcag gcactttact 121 ttcatccagt acttttgttg ttgttgtaga aactttggtt ggaatctggt aattccagga 181 tttcttgcgt agtctacctc tagtaagaag tgtttgacga gcgcattcag aactgaatga 241 ttgcttacaa cttgagatac acgacgggaa atgacgactc cggaaaattt taaaaaataa 301 ccggctacaa gccggtaaaa attttggaac aacaaacgtg ctaaggcgtg tctctacacc 361 agagagtaga aaacattagg caaacgctga aaaatctata ttgggcggaa tatcctgaat 421 acgcccttga ccgacagcac gcaatgcttc tctaagagaa acgtcccctg tatacagcgc 481 acgaccaaca atcacaccag tcacaccttg tgattctagc gccaacagac tcagcaaatc 541 ggtaacagaa ctcacaccac cagaagcaat gattggaatg gaaattgtcg ctgcgagcga 601 tcgcaatgct tctatatttg gtccagaaag cgtcccgtca cgatggatat cagtatagat 661 aatggcggct gctcccaatt cttgcatctg tacagccagt tgagttgcga aaacttccga 721 ggtttcttgc cacccgcgag ttgctaccat tccattacga gcatctatac caataacaat 781 ctgttctgga aattcttggc agagttcttg cacaagctgg ggttgttcta cggctacagt 841 tcccaaaata acgcgttgta cccctatgtt gagtagctgt tgtacgcgag agcgatcgcg 901 cactcctcca ccaatttgaa tgggtactga cacggcttga gcgatcgcct caattgctgg 961 cagatttact agtttacctg ttttcgcccc atccaaatcg actacgtgta atctagtagc 1021 cccctgttca acccactggt ttgctacctc aactgggttg tcgctgaaaa cttgggattg 1081 ttcataatct ccctgataaa gtcgcacaca gcgaccttct agtaaatcta tcgctggaat 1141 aacttccata taacttttcc tttttaaaag actcccctcc cagtttaaga tgacatatgg 1201 tcttggaggc ttattcatgt gctcatccaa aactcaattt ttgtttgccg aatcactcgc 1261 caaaatttct aataacgcgt ttctataaat ctttaagcct tctgcattga gatgatcgcc 1321 atcacttgtt aaattgtctt ttaaaacatt attttcatat aaaggttgta caccggatga 1381 tttaactggt acattttccg ttgttccaac ttgattcatc acagtattaa tttcttccac 1441 tttagacaat ggggctgcta tactcgggtc tttactagca gcaactgttg agtagaaagc 1501 aggaatcaga aaaatctctt tagttcccat tgctcgtact agagagatcg cctctttcaa 1561 tttttcggta aatagctcat tactcagccc ataccaagca tcatttccac caatagcaat 1621 aatagttttt tggcatttca ctttaccaag agctaaaatt tccagttgtt caactaatga 1681 aattgtactt aacccattta aggcaaaatt aaaagttccc tcaccaagag aattcccaag 1741 ttgagcagaa acagaatctc caaataagca gcctgagaat tgcttacctt ggctcgctag 1801 tgtttgatac tgggttgcaa tgtgccatat aggtgtagaa cttaagttat cctttggtgg 1861 caaattagat gagttagaaa ataaacgtga ttgacttaga atgtctagaa ctgtaggcac 1921 atccacgact agtttttcac taggataggc ttccagaaaa ctaataattc ccaagccctt 1981 tgttgatttc gctcctagga taattgcggc tcgtagtgct tctttggcag ctttatcacg 2041 acgtataaca cttttggaac caacagataa aagcatttta ccaataccac tatccaatag 2101 cttatccaca gccacaatat caagagacat ttttatctga agtgcctctt gtaacatctt 2161 ttgctgttta ggctttaggt aactcaggaa agatttgaga ttagaagaaa cttgttgagt 2221 ttcggcataa ttccgtaaat cttgtagagg aagtgattgc tcaaaaaatc catacctgac 2281 aacaaattgt tccgctgcaa aactaaaaga tgaactaaga atactatcaa cagctagaaa 2341 tcctacacag cctgagaaac acaacagaga acgacacaaa aacttaattg ctaaagatct 2401 tttcaatgac ataaatgatt ttttccaggt tggtaaaaca tttcacttat cagttctttt 2461 acggtttcct gaaaatatga tttggcaatt ttactcctga gaaaaagttg gggatgacaa 2521 attttgcgtg gaagaattag aagccttcac ttgttgcttt tgaatgatag cttgtcgaaa 2581 acccaacaca accaagatat tagcaagcgt caagaaaaat tctgcgcttc cgtgcagcca 2641 atcaacattt gccaaagcct ttccataatg cacttgagca taaatccccg ctggaatggt 2701 gacggcgaca aagacaagag taccgtaaaa accatacagc gccaaacgag gcatttgtgg 2761 agtgcggctg ataaaccaca agaaacccaa gtagggaaac agggaaaggg caaagagggt 2821 ttcttttgaa atcattgctt tgattctata gttttggtcg agacggaatt atttgcagtt 2881 tcggcagact tagtggaacg ccaaatccac accgccgctg cccaaagggt aaaattaccc 2941 aataaagtca tactagcttg cagtgtgacc aaccattcta gagattctgc attatcaaaa 3001 tagtgccaag tgcaagcaca catggcgcta accaaagcag gtaacatagc aaaggacaat 3061 gcccaccaag cgcggttacc attaacttca ccgtatttcc aaattaacca aatggcggca 3121 atccactcaa taacactaga tacgtgaata atccaagtag gaatcgaaag agcgttcata 3181 aaagtcaaaa agtcaagagt caaaagttat aaattaaaaa tcaaaagaaa aaatgcgggt 3241 actactgtct ggatattacg gtaagggaaa tggtggggac gaagctttgt tagcaacgct 3301 tctacagatg ctaccaccat cagtcacacc tgtggttctc tctggcaatc cagaggaaac 3361 taacagtcgc tatcgtgtgg aagcgtgcga tcgcatggca ccattaaccg tactacaagc 3421 tttgcgctcc tgtgatgcct ttatttgggg tggtggtagc ttaattcaag atacaaccag 3481 tgctgttagt ccattttatt atggggcaat catgacatta gcccaaaaaa tgggtttaaa 3541 aaccattgct tggggacaag gaattggtcc tgtgaagcgt cctgtgactc gttggttagc 3601 acggcaaaat tttggtggtt gtacaaaagt gagtgtgcgc gatcgcgctt ctgcagcatt 3661 actcactgat tggcaaattc cttttatctt agctcccgat ccagtttggg cgctgcaaag 3721 taaaccagtt cccggacttt gggatttacc tgctcccaga gtggcggtga ctttgcgaac 3781 tcatccccag ttaacccaaa cacgccttgc taacctgact cgcgcacttg ttgattttca 3841 aaaagccaca caaactttta tattattgtt gccttttcaa aagagtgaag atttagagat 3901 agcccaagca attcaaccgc aacttaaaga tgtcagcaaa attatgtgtt tggaagaacc 3961 ggaactttta aaaggagtgt ttcgcggcgt ggaaatggcg ataggaatgc gtctacatag 4021 tttgattatg gcagcaagtg aaggttgtcg ctgctttgct ctcagttacg accccaaggt 4081 gaatcgattg atggaagatt tggatatgcc tggttgggat ttggcgaatt tgccagatga 4141 tcctaatttc attagtaaaa catggatgga gcattacgcc aatggcgatc cattgtcagc 4201 agaaaaaatt cagtctttgg tagatagagc gctgattcac cgtgaagtat tgagcgaagg 4261 actcactgta aattcgatca agtgaatttt ggtcaattgt atttgaatcg tatttgatta 4321 gactcctcac aactgtacaa ctcaaactta tgtctttcat cttctcacca atgaactttg 4381 agagtgcatg tgttattctg acttggcaat atgacgagcc gtataacttc tacaacttag 4441 actctggtga aatctccgag agtgtacagc aatttttgga tccccacaat gcttattaca 4501 cgattaccaa tggtcataat gatcttatag cctactgttg ctttggacca gatgcgcgag 4561 ttagtggagg tgattacagt gtcgaagcag tcgatgttgg gcttggtata cgtccgaacc 4621 tgacgaggcg gggacttggt cttcgtgtag ttgatgcggt aatgaatttc gctcaaaaca 4681 aatttacacc cattctgttt cgtgttacag tcgctgattt taataagcag gcgttacgga 4741 tatgtgaaaa agccggattc cagacagccc aaaggttcca gagagatctt gatggaaaag 4801 atttcgtagt cttgacgcta aaggataacg aacgatgaaa aatcttgaaa aagaatacag 4861 aatgtagaac tcaccaactc agaatgcaat tagtacggca gttcttggca aatttaactg 4921 cctatgatga acttgaacac gtttcattaa caaaggcttt tttaaataat ccgcaatcgt 4981 ttttaagaca tttataaact atcaaaagtt cattcatttt ttacgctcaa gcacaataaa 5041 tttcttatgt gggcagacaa gatgtccaac ccacaagagt tgataaaaat tgtacatagc 5101 aagatagcta cgctgctatt tcctcaactt caacttccaa ccgattcttt gccaatcggt 5161 cataattcat tcgcaacagt tctaacagta aaggtacgtt ctcaacacca tgaatgtaga 5221 aagaaatcca accgcttttt ggatagatct gatggagtga agctttacct gattctacga 5281 gttctttacg tatgcgagct gtaaaaggaa tatctgcttg ataatctcca tgcaggtgac 5341 caatttctcg accattgacc tgaaactcta taccaccaaa gcgatgaggt gcgacagtta 5401 caccttgcca agtttgtact tcctgcacga ttgtgacttt tatgttactc atagctatca 5461 tttctcctga ggcatgactc aaccttatac actgattgaa acgtaatgat tatgacttgc 5521 aataagctga ggtaaagggc gtaatgtatt acgcccctac atcacacaag ccaaaactat 5581 taattaatgc ttatgctgca acagcttcct cataaggaac aaaagttttc ttagaagcat 5641 tacaaatagg acagtgccaa tcctctggaa ttgcctcaaa cgctgttcca ggagcaattc 5701 cagaatcagg atcgccttct acgggatcat aaatcatcga acactggcgg caaatccatt 5761 tctgggttgc aggattatcg cttgctgctt tcggtgcggg agcaactcct tccagagctt 5821 gaagcgcttc tgtatactgg ttagcatgat gatgttcgat gggtgtcagc aatccaaagt 5881 tgtgtgctgc tttgcggaaa atttgagcgt gttcacggga ctctgcttgt tgtgcttcaa 5941 actcgacaac tgccttatga tctctatctg tacgggcggc ttctgtaaac tctgggtaca 6001 tggtggtata ctcataagtc tcgccctcaa tcgctaactc taaacaacga gctgcgatcg 6061 ctttcttttg ctcctcactc aatgaagcca catcatccac caccaactct ggatgcatta 6121 accggaagtg agcaaaagcg tgttctgtct cctgatttgc tgtttcccga aacagctttg 6181 acacttcagt catccccaac tggcgagtga cttctgcaaa gaacagatac ttacgattcg 6241 ccattgattc ccctgcaaaa gcttcagaca agttctttgc agtgttcgag tttgataaat 6301 ccattgtgtt ttgcctctcc attcttctct caaagactac cgtgataatg gcagtccatc 6361 taaagcgtac tcgtaatgag ttcgtactat tttattgtac tcataacgag tatgaaatgg 6421 gagagttttc acaatatttt tctgctggct agtcgaaaaa tctatgtggc tcacccagtg 6481 tcaaagctgg gttatctgcc aaacgaccat cttccatgta aacgatgcgg tcagcaatgt 6541 cgagaatccg gttgtcatgt gtgaccaaaa gaattgtaca gccctgctgc tttgctagtt 6601 tctgcatcat ctccacaaca tctcgtccgg attttttgtc aagggcggct gtgggttcat 6661 cggctaggac aattttggga tggcttacaa gagcacgggc gatcgccact ctttgcttct 6721 gtcctccaga aaggctatct gcatagtagt ctacgcgatt tcctaaccca actgctttga 6781 gaatatcagc ggacatccca tctatatcct gttccaagta ttcatcatgc agttctaaag 6841 acattcgtac attttgttta gctgttagga acgtgagtag attgtgtgcc tgaaaaatgt 6901 aacctatttg ccgccgcact tgcatcattt gctgttgatt tgctctacac atttgttgat 6961 caagaattgt aagacttcct tcttgcgctg aacgtaaacc tcccatgaga gtcagtaatg 7021 tggttttacc agaaccagaa ggaccagtca taattacaat ctctcctgga taaatgtcga 7081 gattaatatc aaacaaagct tgtttgcgaa gttctccctc accaaaatag tggttcaggt 7141 tgcaagctga aataatcggt ttagttagta tcgattgatt gagcgtggta gtgaaagttt 7201 ggtttggcat ggttgttttt ttttgattct aaggttttca tgattaacag ttaagggaaa 7261 gaaagctttt gctcaacgat ttcgcctatc gacttctctt ttaactgcag tcagaatatc 7321 acccaatgaa tctttgaaaa gattgtagaa tattttcttt ggggttcccg attttggttg 7381 agcctccact acctgcgtaa ttaaaacttg attcgctttt gcgccagaat aaggagctac 7441 aggttcaatt ttccagtatc catctaagct ttgtaaatct ccatcaattt gacggaaatc 7501 tatgcgattt tttgcagttt cagtaatagc tgaccggatt cgtgattgaa cattcattaa 7561 aaatacctga cgcgtatcta cctgctccac aactttctga ttgccattca cttcaagtac 7621 tttacccgac actgtattcg gcaaaaattt agaaaagttg tcataatcag tcagcacttc 7681 ccaagcaata tctggcgaag ctgtcaccaa aactttagca gtatattttc ctttttctcc 7741 agtgactaaa gcctgaccat ttctcaattt tacccgttca gcaacaggta atttatcaac 7801 aggactatta aataattcag catagacaga tggggtgact gcaattgtag ctatcacaga 7861 aaagtaccct gcaaacaaac aactacgcaa gaattgcgac atatggtcat tccctcgatt 7921 tagatctgtc aattgacatc ttcccacctc caagcccgta gggctaagca gttttgaata 7981 ggagtacaga ctgtaaagtc aagcaagagc agtgtgatca acaaaattgt ctgcgtcata 8041 acttcagcat ccataacttg aactctttag aaaatgtctg ctggatcggc tgactgtaac 8101 ttacgcatgg cgattccacc cgaaacaaca cacattataa ctgtcaaaaa caatactaaa 8161 gcagtacgac tcggtgtcat gccaataggt aacagagtcg cagttccagc aatcaagtac 8221 aacccactgg aaagaacaaa tcctggtata aaacccatga ttgccaagat gagggactct 8281 tgaatgagaa tgctaacgag gaaagtattg ctgtagccca ttgctttcaa cgtggcgtat 8341 tcaggtaagt gttctgacac atcagagtag agaatttgat aaacaatcac aattcccacg 8401 acaaagccaa tgatggttcc aagtccaaaa atgaagccag taggagtgct actcgaccag 8461 tatttcttct ctcgattggc aaattcctct cgtgtcagta ctaagacatc gtttggtaat 8521 gtagctcgta tctgagcctg cacttgttta agattcacgc caggttttaa cgaaatcaga 8581 ccaacatcaa tttgctctgg tttgcgctca gtaaacagac ggataaaagt agaatcacta 8641 gtaatcatag tgccatttgt tccgaaagag acaccgagtg taaacaatcc agtaacccaa 8701 atttgtttat tttttacctg aatcaaaagc ggagaagact gttgaaaaag ggcaggaata 8761 tcgccgaatt caggtcgccc tgctcggtcg tataacactc gatttagcgg cttgagtttg 8821 tctatctgct gattgacctc cggtaacgta aatgcaggtt tagcagggtt aataccaagc 8881 gtcagaactg tgcgttcact ttgatggttt tggggattac gccactgagc actaccaacg 8941 tataaagagt taatagactt gactccctca aatcctgcgg cttgatataa acgctctctt 9001 ggaaaactct tgaaggactg cacagcttca gactgtggat taatgaggaa caaatctcct 9061 cgcagtgtat tgtgaaaatt cgctgcagag tcatagagcg cgtcattgag tccaagttgg 9121 acaaacatga ggacatctgc gaaggcaatt cctgctaaag caacggcgag acgcgttttc 9181 tctcgtctga cttgcaacca tgctagtggt gttttacaaa agagtttacg gaacatttca 9241 gaggtgcgat tcgttactag ctgaatcacg gtaatcagtc cgcacataag ctagtc // LOCUS NODE_3569_length_9292_cov_5.1429049292 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9292) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9292) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9292 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(294..1316) /locus_tag="DP116_24400" CDS complement(294..1316) /locus_tag="DP116_24400" /inference="COORDINATES: protein motif:HMM:PF02195.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24400" /translation="MTTISIPTVEYLPIANILINGGTQSRVKLNWDVIAEYAEAITLD AIFPPILVFYDGKNYWLADGFHRLHATKKAGRQEIAVEIHPGNRRDALLYSVGANANH GLRRTNADKRRAVNIMLQDEEWNHWSNREIAKRCGVSEFMVRQMRESICDKNADTKKR TVQRQGKTYTLDTTHIGEGMTSINIAESSKPHPDGGCERVLLCTDNLQLSQCDHQIDV LIGQLDSSTYVANGEQTPSFKESEVQLFNGQPLTNVHPQLEHQSYPTPSTETINLQDI MINKIAMEIMHLSPEQLTKVISTSARNGLSKFHLNTIIEAAKQALNEGNQQEYFHKSY AVVGIE" gene 2371..3195 /locus_tag="DP116_24405" CDS 2371..3195 /locus_tag="DP116_24405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198659.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_24405" /translation="MDTTRIVIKAGRTESQYWKDLWRYRELFYFLAWRDILVRYKQTV IGIAWALLRPFLAMIISTVVFGNLAKLPSEGVPYPILVFAAMLPWQFFASSLTECSLS LINSSHLISKVYFPRLIVPVSSVIVSFVDFLISGIILLGLMAWYDFVPDWRILALPFF ILIAFAVAIGGGLWLGALNVKYRDFRHIVPFLVQFGFYISPVAYSSSVISPSWRLLYS LNPMVGVIDGFRWAILSGQSKLYLPGFILSTGLASLLLASGIWYFRKTERTFADVI" gene 3223..4524 /locus_tag="DP116_24410" CDS 3223..4524 /locus_tag="DP116_24410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019489327.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24410" /translation="MSDNVVRVENLGKKYTISHQKRGANSTLRDAIATSAKAISRKFL TPFDKKIPNPTHEEFWALKNVSFDIKQGEVVGIIGRNGAGKSTLLKILSRITEPTTGQ ILITGRVASLLEVGTGFHQELTGRENIFLNGAILGMSKAEIKKKFDEIVSFAEVEKFL DTPVKHYSSGMYVRLAFAIAAHLEPEILIVDEVLAVGDVQFQKKCLGKMGNVAKEGRT ILFVTHNMSMVESLCDRGILLEEGTLCVDGTSEEAVRVYLEKSYSLAQELPLSQRKDR TGSGRVRVSSFRILNEKGHEEQVLQSGKNYYFEIGYSNYIGKRLSNVVVSLAVADERG TFDLLLRSNFTNDYLTLNSEQGYILCGIENLPLVNGLYQVSIYLSHADSETLDDLQEA VSVVVDGGDFFGTGNPGLPNFCKFLVKADWSTSHAHLFSHM" gene 4643..6199 /locus_tag="DP116_24415" CDS 4643..6199 /locus_tag="DP116_24415" /inference="COORDINATES: protein motif:HMM:PF00534.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24415" /translation="MEAKMSYKILYYNWDPYFEKSSLGGGVSIYCRSLVEHFSQESDY QVNFLYSGVDYTFFGKQPYIKQVRNNRHPSVPTFSIVNSPVFSPSHFYFHNPIGNIEN PELEKYFQQFLVEHGPFQIIHFHNLEGLTATCLKIAKESGAQVVFSLHNYWSVCPQVN LWKLESSPCSNYLEGRACVSCLDTKIHVELELTLRKLNHLGSLLGQDQQSLPMAITKK ICRGIYSQIWYQVITRTKSPLKKPLEVTSADLPQEADLYRYRREEIVSLINRYVDVAL SVSERTTAIYKQYGVNPALLTTKYIGSQAAQFQVPPQNPAAYTPEQPFKLIYMGPARK DKGFYFLLEELRSLPQEELSSLELVVASRIYDSVELGMSIEQKGRLLSLAQSLHRFRF YPGYKYENIPNMLDGIHLGVVPSQWEDNLPQVTFELIACRVPVLCSNRGGAQEFVRHP AFIFDPSKQGDFEYKLRTIRENPHLLTEFWQEARPVKTVEQHFHELNEVYQTDIGKSL SLVGASSSAD" gene 6599..6988 /locus_tag="DP116_24420" CDS 6599..6988 /locus_tag="DP116_24420" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24420" /translation="MKNAIIKVVAVAAMVFTFAFSWVSSSAYALNLPGLNQLAIIKVT NLREGNLSQNYLAFEALREARAPLPPIPLPSKLNEDKKYGYIATPLYAEFEQNIVKQL NERNSAIGFPERYEYIEGGLDKLESLN" gene complement(7351..7467) /locus_tag="DP116_24425" /pseudo CDS complement(7351..7467) /locus_tag="DP116_24425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015186563.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS982 family transposase" gene complement(7596..8645) /gene="gap" /locus_tag="DP116_24430" CDS complement(7596..8645) /gene="gap" /locus_tag="DP116_24430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314598.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I glyceraldehyde-3-phosphate dehydrogenase" /protein_id="PRJNA477356:DP116_24430" /translation="MTRLKVGINGFGRIGRLVFRAGINNPNIEFVGINDLVPPDNLAY LLKYDSTHGTYKGSVEAKDDGIVVNGHFIPCVSIRNPAELPWGKLGAHYVVESTGLFT NHEGAANHLKAGAKRVIISAPTKDPDRVPTLLVGVNHHLFDPAKDTIVSNASCTTNCL APIAKVINDNFGLTEGLMTTVHAMTATQPTVDGPSKKDWRGGRSAAQNIIPSSTGAAK AVALVLPELKGKLTGMAFRVPTPDVSVVDLTFKTAKSTSYKEICAAMKQASQGELKGI LGYTEDEVVSTDFQGDVRSSIFDAGAGIELNSNFFKVVAWYDNEWGYSNRVIDLMLSI AQKEGLLEHVLAAVA" gene complement(8779..9057) /locus_tag="DP116_24435" CDS complement(8779..9057) /locus_tag="DP116_24435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129076.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24435" /translation="MMLTLFCAVSGWAILMFLIWNVWITLKTGASHLKTLHQIPCSGC EYFTNDYRLKCTVHPVKACSPEALGCLDFEPKTDCCNANQKRPRKLCK" BASE COUNT 2659 a 1836 c 1942 g 2855 t ORIGIN 1 acttgccaaa cttactaggg agctatcgat gtaataccaa attcctttca aggcaattcg 61 tctagctgga agatttaaat aaaatacaaa actcactgct ttatagtcat ggagtttgta 121 gtcttgatat cggtcaaaaa tgaaagtaaa aaaccccgca aaactttacg gagtttgagt 181 gacagagttt atcgaggaat gggcgaaaag taatatggaa tctggcaaaa ttgagtaagt 241 ttgtccggtg gctatagttg tgcatctgca cgagtgtatc tttccattcc ctattactcg 301 atgcctacaa ctgcataact cttgtgaaag tattcctgtt ggtttccctc attgagcgct 361 tgtttagctg cttcaattat ggtgttgaga tgaaatttac tcagtccatt cctggcagat 421 gtggagatga ccttggtcaa ctgctccgga gacagatgca tgatttccat tgcaatttta 481 ttaatcatta tgtcttgaag gttaatcgtt tctgttgagg gcgtagggta agattgatgt 541 tcaagttgtg ggtgtacatt ggtcaacggt tgtccattga aaagctgtac ctcggactct 601 ttaaaactgg gagtttgttc accattagca acgtacgtgc tactatcaag ctgccctatt 661 aacacgtcaa tttggtgatc acactgactc agttgaaggt tgtctgtgca cagaagcacc 721 cgttcacatc cgccgtctgg atgtggtttt gaagattctg caatgtttat tgatgtcatt 781 ccctcgccaa tgtgagtggt gtcaagtgtg taagtttttc cttggcgttg tacagtgcgt 841 tttttggtat ctgcgttttt atcgcagata gattcacgca tttgacgaac cataaattca 901 gaaacgccac aacgtttggc aatctcccga ttactccaat gattccactc ctcatcttgt 961 agcattatat tcaccgcacg gcgtttatct gcgttggtac gtcgtagacc atgattggca 1021 ttggctccca ccgagtacag caaagcgtcg cggcggttac ccgggtgtat ttctacagca 1081 atctcctgac gcccagcttt tttagttgca tgcagccgat ggaaaccgtc tgccagccag 1141 tagttcttgc cgtcgtaaaa gactaaaatc gggggaaaga tggcatccaa cgttattgct 1201 tccgcatatt ctgcaatgac atcccagttg agtttgacgc gtgactgagt accaccattg 1261 atgagaatgt ttgcaatggg aaggtactcg actgttggga tagatatagt cgtcataact 1321 attgtgcttt aggagcttta aaagatatga gtgaattcta ggagctacca ataaaagcta 1381 aaacccaatt ctttatgcac tggaaagtga catatttttt aagcaactcc ttgaatccat 1441 agaaaacctt aagtaataat tacttttata ttgcagattt ttcaatctgc ctatttatac 1501 acttttatgt attacttata caattctatg taattggata taaatgtagg taaagtttga 1561 gagtgatctc agatactttt taatttttcc agaaaattga atataaatca atttacatac 1621 tttcaagatg tttttctcaa ataatgttaa ttgtttaagt aaactacttg ataataacca 1681 acctaattgc tgccactgac ctttgctctg cgcaatgtta ttaattaaaa gtttacagtt 1741 aggttttgtc agttcaccgg actcataata ctaggttctg ttttcctagg atatttattc 1801 tttttcccaa atttgtgtgt tgaaagttga attatgctgt atactttgaa ctaagagctt 1861 tcactaagct taagcatatc aataacccca cccaaagcag cggagcttgt ggatactcaa 1921 attcatataa ctagctaact gggggcaatc tcgaaaattt ttatctaata cagactttcg 1981 gttatttctc tagacttaat atctgtaaac cggctagtgt agctagccat aataccttcc 2041 ttaaaggatt ggctcaagga aagaggatta gccaaaatca gccacattaa caaaataggt 2101 catgtcaaat agggatttca cttctagtgg ttcaatttcc atcagataac gctcttgcat 2161 taagctttga accaaaagtg taaatgtcgt gtatgcggta tacacctatt taattgttag 2221 tttgaatcat gagttgatta tccagcgtca aatcggaagt tttgtgatgg cgtgtataaa 2281 tcatgtcagg catttttcag ctttgcataa gtctaaatag tgtttacata gctagcatcc 2341 aacagtaatt tatttgttat ttatttgtaa atggatacta caagaattgt tatcaaagcc 2401 ggtcgaactg aaagccagta ttggaaagat ttgtggcgct atcgagagct gttttacttc 2461 ctagcttggc gcgatatttt ggtgcggtat aagcagactg tgattgggat tgcctgggct 2521 ttgcttcgac catttttagc aatgattatt tccacagtgg tgtttggcaa tctagcgaag 2581 ttaccctcgg agggtgtgcc atacccaatt ttagtatttg cagcaatgct accctggcaa 2641 ttttttgcta gttccttaac agagtgcagc ctcagcttga tcaatagtag tcacctgatt 2701 tctaaagttt attttcctcg cttgattgtg ccagtcagtt cggtgattgt tagcttcgtg 2761 gattttctaa tttccggcat catcctgtta ggattgatgg cttggtatga ctttgtgccc 2821 gattggcgca tcttagcgct accgttcttt attctgattg cctttgcagt ggcgattgga 2881 gggggacttt ggttaggggc gctgaacgtc aagtaccgtg atttccgcca cattgtaccg 2941 tttcttgtgc agtttggctt ttatatatct ccagtcgctt atagtagcag cgtgatatcc 3001 ccttcttggc gcttactcta ctctttgaat ccaatggtgg gggtgattga tggctttcgt 3061 tgggcaattt tgagtgggca gtcaaaactc tacttgccgg gattcattct atctacggga 3121 ttggcttctc tgcttcttgc tagtggtatt tggtacttcc gtaaaactga acgcacgttt 3181 gcagatgtta tttaacttgt tatttaacta gggagtgaga gtgtgtctga taatgtagtc 3241 agggttgaaa atttaggcaa gaaatataca attagccacc aaaagcgggg agctaacagt 3301 actttgcggg atgcgatcgc cactagcgcc aaagccatca gtcgtaaatt tctaacacca 3361 tttgacaaaa aaatacccaa cccaactcat gaggaatttt gggcgttaaa gaatgtttct 3421 tttgatatta agcagggcga agttgtcggc attatcggtc gcaatggggc aggaaaatca 3481 actcttttaa aaattttaag ccggattact gaacccacaa ccggacagat tttaattaca 3541 ggaagagtcg caagtttgtt ggaggtggga acaggttttc accaagaatt aacaggacga 3601 gaaaacattt tcctgaacgg tgctattttg ggtatgagca aggctgagat taaaaagaag 3661 tttgatgaaa ttgtgtcttt tgcagaagtg gagaagtttt tagatactcc tgtgaagcat 3721 tattcatctg ggatgtacgt acgccttgct tttgccatag cagcacacct agaaccagaa 3781 attttgattg tagacgaagt actagcagtg ggggatgtgc aatttcaaaa gaaatgtctg 3841 ggaaaaatgg gtaacgtggc gaaggaaggg cgaactattt tattcgtgac tcataatatg 3901 agtatggtgg aatctttatg cgatcgcgga attctccttg aagaaggcac actgtgtgta 3961 gatgggactt cagaagaagc tgttagggtt tacctagaaa agtcttacag tctcgctcaa 4021 gagttgccct taagccaaag aaaagaccgc actggttctg gaagagttag agtttctagt 4081 tttagaatct tgaatgaaaa aggtcatgaa gagcaagttt tacaatctgg taaaaattac 4141 tattttgaaa taggatactc taactacata ggaaagcgtc tgagtaatgt cgttgtcagt 4201 cttgctgtag ccgatgagag aggtactttt gacctcttat taaggagtaa tttcaccaac 4261 gattatctaa ctcttaattc tgagcaaggc tatattctct gtggtataga aaatttgcct 4321 ttagtgaatg gcttgtatca agtttcgata tatctatcac acgcagatag cgaaacgctt 4381 gatgaccttc aagaagctgt atctgtggtt gtagatggag gtgatttttt tggtactggt 4441 aatcctggtt tgccaaactt ctgtaagttt ttagttaagg cagattggtc tacatcacac 4501 gctcatttat tctctcatat gtaatcaatt ttattttcaa aaactaaaga ttctgtttct 4561 agctatgaaa cagaattaag cgatagtgaa attattgttt catctaattt cgatattacc 4621 tttatcaagg ttactctcat taatggaggc aaagatgtca tacaaaattc tttattacaa 4681 ctgggatcct tactttgaaa aatcatcgtt ggggggaggt gtatctatat actgccggag 4741 tttagttgag cattttagtc aagaaagtga ctatcaagtt aattttcttt atagtggggt 4801 tgactacacc ttttttggga aacaacccta cattaaacag gttcgtaaca atcgtcatcc 4861 tagtgtacca accttttcaa tagttaattc tcctgttttc tctccttctc acttttattt 4921 tcataaccct ataggaaata ttgaaaatcc tgagttagaa aaatactttc aacagttttt 4981 agtagaacat ggaccttttc aaattattca ctttcataac ttggaaggat taactgctac 5041 ttgccttaag atagctaaag agagcggtgc tcaggtggta tttagcttgc acaactattg 5101 gtcggtttgc cctcaagtta atttatggaa attagaaagt agcccatgct caaattatct 5161 tgaaggaaga gcttgtgttt cctgcctaga cacaaaaata catgtagagc tagaactgac 5221 tctcagaaaa ctgaatcatt taggaagctt gctaggtcag gatcagcaat cacttcctat 5281 ggctataact aaaaaaattt gcagggggat ctacagccaa atttggtatc aagtcataac 5341 tcgtacaaaa tctcccctga agaaacctct agaagtgacg agtgcagatt tgcctcaaga 5401 agccgatttg tatcgctatc ggcgagagga aatcgtttct cttataaacc gctatgttga 5461 tgtagcgttg tctgtttctg agcgtaccac tgccatttat aagcaatacg gagttaaccc 5521 agctctattg acgacaaaat atataggcag ccaagcggct cagttccaag taccaccaca 5581 aaacccagca gcgtacactc cagaacagcc ttttaaatta atctacatgg gaccagcaag 5641 aaaagataag ggattctact tcttgttaga agaactacgt tctttaccac aggaagaatt 5701 gagttcgctt gagttggttg tcgcaagccg aatttatgat tcggtcgagt tgggaatgtc 5761 aattgagcag aaaggacggt tgttatctct ggctcagtct cttcaccgtt tccgtttcta 5821 ccctggctac aaatacgaga atattcctaa tatgttggat ggaatccact tgggagttgt 5881 cccatcacaa tgggaagata acctgccaca agtcactttt gaactcatcg cctgtcgcgt 5941 gccagtactt tgctcaaatc gaggaggtgc acaggagttt gttcgccatc cggctttcat 6001 ttttgaccct tcaaaacaag gagacttcga gtacaaactc cgtactattc gagagaaccc 6061 acacttactc accgagtttt ggcaagaggc gcgacctgtc aaaacagtag aacagcattt 6121 tcatgagtta aacgaagttt atcagacaga catcggaaaa tcgctttccc ttgttggtgc 6181 gagttctagt gctgattaat gtaacatgag ttcgggataa ggaaagggac aggtagtaag 6241 ctgaaaataa cgtcaaatcc agcacctgtc ctatgttcag tttagatgct ttattttgcc 6301 acgtagatga ttgcagcaca agcagattgt ttgctgtaac gcaaggacag ttaagacact 6361 taagcaaaat atgtcttaag gatgaaactt atctacaccc tgtagccgct aatgcttgat 6421 atcaaagggt tagcgcgatt ttatttatcc aatgtgtaat tttaacaacc tatttttcct 6481 taactgtctt aactgtcctt gaattctcac aaaaaatttg aacaattaag aaagtgaagt 6541 gctctggtat acacttcaag caactttcaa taaatttcat ttattcagga gtgagacaat 6601 gaagaatgca attatcaaag tagttgccgt agccgcgatg gtgtttactt tcgcattttc 6661 ttgggtgagt agttctgctt atgcactcaa tttacctggt ttgaatcagt tggcgataat 6721 taaagttact aatctgaggg aaggcaattt atctcagaac tatcttgcat ttgaagcttt 6781 acgagaagca cgtgcaccgc taccgcctat accattgcct agcaaattaa atgaagacaa 6841 aaaatatggg tatattgcaa caccgttata tgcagagttt gaacaaaaca ttgttaaaca 6901 attaaacgaa aggaactctg caattgggtt tccagagcga tacgagtaca tagaaggagg 6961 tctagataaa ctagaaagcc ttaattaaat caccataagt accctggttg aacaaatttc 7021 gatatccctt ataaaagtct agggaagggt agaagcactg gtgttgtgtc gatcgtccag 7081 tgcttttagt tgttatttac gctcaattct ttgctatacc tgcggtgtag cattttcccg 7141 tattccgttc cgaatcagca agcccacacc caacagcgcc actaatccgt cagcaatcgc 7201 ccatttagga atatcagcaa aactccagat tcgtgtttcg tattgggtca tgagctagta 7261 tagatatgcc atctgccaaa ggtacaaatc aaaaaattcg cgggactttt taagtttcac 7321 attggaattc ttgaaaacct gatgcagttt acctttcgca cgaatacgtt ttattcctcc 7381 gtgtataaat cgctttttct gccattgagc ttcaaaaatt ttacataaat catctacgtg 7441 gcaaaataaa gcatctaaac tgaacatagg acaggtgcta gatttgacgt tattttcagc 7501 ttactacctg tccctttgct tatcccgaac tcaggttaat gtagttatca gtaaccagta 7561 acaactgata actgataact gataactgtt aactgctaag ctacagcagc aagcacatgc 7621 tccaacagac cttctttctg tgctatggac aacatgaggt caataacacg attcgagtaa 7681 ccccattcgt tgtcatacca agcaacaacc ttaaagaagt tagagttgag ttcaatacca 7741 gcacctgcat caaagatgct agaacgaaca tcgccctgaa aatctgtgga aacgacttcg 7801 tcttctgtgt aacccaagat accttttagt tctccttgcg aagcttgctt catggcagca 7861 caaatttctt tgtagctggt agatttagca gttttaaaag ttaaatcgac taccgagaca 7921 tcaggagtag gaacccgaaa cgccatgcca gttaacttac cctttaactc tggcaaaacg 7981 agtgctacag cttttgctgc gcctgtagaa gaaggaataa tgttctgtgc cgcgctgcga 8041 cctcctcgcc agtccttttt gctgggacca tctacagtgg gttgggtagc agtcatcgcg 8101 tgaactgtgg tcatcagtcc ttcggttaac ccaaagttat cattgatgac tttagctatg 8161 ggagccaaac agtttgtggt acagctggca ttagaaacaa tggtgtcttt tgccgggtca 8221 aataggtgat gattgacacc taccaataga gtaggaactc tgtctgggtc ttttgtgggg 8281 gcggaaatga taacccgctt tgctcctgct ttgaggtggt ttgcagctcc ttcatggtta 8341 gtgaaaagac ctgtagattc aactacataa tgtgcaccca atttacccca gggcaattct 8401 gctggattcc tgattgacac acaaggaatg aagtgtccat tgacaacgat accatcatcc 8461 tttgcctcaa cactgccctt ataggttcca tgagttgagt cgtactttag taggtaagcg 8521 agattatccg gtggtactaa gtcgttaatc ccgacaaact cgatgttggg gttgttgatc 8581 ccggcgcgaa acacaagacg cccgatacga ccaaatccat taataccaac ttttagccta 8641 gtcaagatga aacctcctgt tatcggagac taaacaaagg gagaaagtca aaagaaatta 8701 tcttttactc tttacttccc aatcctgaca gtcccggtac gctaaggcgt atttacttgg 8761 catcttttaa gatttttgct atttacacag ctttctgggt cgtttttggt tagcattaca 8821 acaatcggtc tttggttcaa agtcaagaca gcccagtgct tccggggaac acgctttcac 8881 tgggtgaact gtgcacttga ggcggtagtc attggtaaaa tattcacaac cagagcaagg 8941 aatttggtgc aaagttttga gatgacttgc tcctgttttc aaagttatcc agacattcca 9001 aatcaagaac atcagaatag cccatcccga gacagcgcaa aaaagtgtaa gcataattgt 9061 aaagccccaa aaataatctc aaaaaattgc ctgcagctaa ctcctggtaa ctacaggcat 9121 gaaacattta tttagctgtt gttgagaaaa actccattaa gatcacgctt gaccaaaacg 9181 cgcgggtgta ggccctacac acgccagaac aagcgtgagg gaaaccctcc tatggcacaa 9241 agtgccacgc tgcgcgaaca gtactggctc ccctacaccc ctttcttggt ca // LOCUS NODE_3591_length_9221_cov_5.1495759221 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9221) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9221) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9221 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..599) /locus_tag="DP116_24440" CDS complement(<1..599) /locus_tag="DP116_24440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111886.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HPP family protein" /protein_id="PRJNA477356:DP116_24440" /translation="MVKQLNSYAGSTRKAVSRINFFQKLKGDNTKLPPKHSARAIILA WLGGFLAISAVALLSNTFSVTLVLGSFGASCVLVFGFPDVPFSQPRNVLGGHFLSSFI GLVCLTLFGATWWSVAIAVGTAISVMMLTRTTHPPAGSNPVIIYLSKPAWSFLLFPTL FGAIVLIVVALLYNNFVRESRYRSEKSKQSYPKIVNARQT" gene complement(606..1205) /locus_tag="DP116_24445" CDS complement(606..1205) /locus_tag="DP116_24445" /inference="COORDINATES: protein motif:HMM:PF00501.26" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24445" /translation="MNRQFNTRQSRKQQRYVHQSSPITISVRIRGEPMHSYVPLSPTT FLERSGQAFPNRSAIAYPDGTVTYAELLHRSRCLAQVLKRLQVGYGDRVALLTENNRQ IIEAHFSIPAIGAVIVMLNPWLAEQDLLSLLDYCEAKVLIVDASLSEKIFLNPQVALS HLQQVIVINHGVANPIYSGVLDYETCLAGENGDFRLDLR" gene complement(1406..1969) /locus_tag="DP116_24450" CDS complement(1406..1969) /locus_tag="DP116_24450" /inference="COORDINATES: protein motif:HMM:PF12973.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24450" /translation="MSQNNQMSKKPPSGYNGTNKGNGPIPTNGQLMSNGQIKFDPFPY NSINDREFKFLGMENVSLPSELEFVPYQLDVRKGVEIHTIFSLVDQDPQGPDAAILKY EAGAFVALHEHIGYEMVLVLTGDYIENGVTFLPGSLILRSPGTFHTMASINGCTILAT RYIKVKQRPDLWNEFAVLHEDVKPVSR" gene 1933..3618 /locus_tag="DP116_24455" CDS 1933..3618 /locus_tag="DP116_24455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878140.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24455" /translation="MEVFLTFGCSVTFAFQYLRRLNFSFEEIFFGSVNYIILIAYFEL WQQYQGLMTERLEALQIAATAVQTKTLTQELRENARQAAHKLAGVLGMFDREAGTTLA RQIEQILLEDEALGQGDKGDKKLLSLVRELGELLNLADSEPFSPSKTSRLLLIDPDTN LTQELQELAHSLDLSWKQVTTLELAKTWLQSHSPDLVVLAVDDMEQQEDSLALVADFA ARTPPIPVLVITSRDGLLDRVTVARAGGQGFLVRPVTAPRVWEIATQLLQRSRSRSVS VLAVDDDPVFLAGLRSMLEPWGIQMTELDDPLRFWEVLRRTAPDLLILDVDMPQLGGI ELCQAVRTDPDLQGLPILFLTAHRDRETIQQIFATGADDYVSKPVVAAELLTRITNRL ERTRLLKTLSTKDPLTGLANQPQSYRDLELLIQSMKDADKSANSDSSLLPDTFCLVVF FITELPAINLQYGHATGNQILQRWGRLFQATFRGAEVLGYWGNGEFVVGTPGLTKQQT SDRLSEILTTLRQQVFTAPDNSRFQVSFRFDIAEYPTDGLTLQFLYQIASQAS" gene complement(3659..4819) /locus_tag="DP116_24460" CDS complement(3659..4819) /locus_tag="DP116_24460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="spermidine/putrescine ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24460" /translation="MAQHLTQNQAGRTAFESLDVELHNVFKFFNQDPAVHGVDLGVRH GEFFSILGPSGCGKTTTLRLIAGFETADAGKVLIQGQSMTSVPPYRRPVNTVFQSYAL FNHLNVWDNVAFGLKLKNIHRSEVQSRVKEALELVKMEGLRSRFPSQLSGGQQQRVAL ARALVNRPAVLLLDEPLGALDLKLRKEMQVELSNLHQELGLTFIMVTHDQEEALSLSN RIAVMNQGKIEQIGTPSDIYEQPRTSFVADFIGDTNLFEGEIAGVDASSVIIFTKTGF SIVVARQHDTPTEISQAVVVSVRPEKIQLSLYKPSLQTNCFEGRLVNIMYLGTHVNYI VQLTNGIRITVLQPNTFGNLPGRETPIYAWWAETDCLALLKNSEHKIQNTDF" gene 5029..6882 /gene="mrdA" /locus_tag="DP116_24465" CDS 5029..6882 /gene="mrdA" /locus_tag="DP116_24465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860865.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="penicillin-binding protein 2" /protein_id="PRJNA477356:DP116_24465" /translation="MSTVKSRPTSKNREVRTVGQGLQPIFFIVFTLLMITGINIRLGY LQIIEGHKFKERAAANRIRTISKQPERGNIFDRNGKLLATTRYPRSVYLWPMAHTKPS WSVVGPRLEKILNISQEEIEKKLDEAGASATSLVRIARDLSEAQITALKEYENELPGV EINTEAVRYYPHGKELAHILGYTRELTPEQLKKKKPEGYRLGDVIGQMGVEKAYEQLL RGEWGGQQVEVDSKGRPIRVLGEKQAKAGNDIHLTIDLDLQKAAEKALTLRRGAVVAL DPNTGAILAMVSHPTFDPNIFSKQKLSQKDWQSLQGSEHPLVNRALSAFPPASTFKII TTTAGLESGKFSPSTVLQTYGSLTVGGVTFGEWNHSGFGALGFQRAIAMSSDTFFYQV GRRVGGPTLIEWTRKYGFGKKTGFEFSDEESKGNVPDDAWKRKVWKMPWTVGDTINMS IGQGALQVTPLQSAIMFSVPANGGYRVKPHLLKDHEEAKNWRESLNMKPSTIKVLREG LRQVITGGTGTLLNVPSIAPAAGKSGTAEAGINRPNHTWFGAYAPADKPEIVVVAFGE NSGGHGGTDCGPVVLKVLEEYFQKKYPGKYQKSESEESKAKTQNSRRRSGD" gene 7285..9156 /gene="mrdA" /locus_tag="DP116_24470" CDS 7285..9156 /gene="mrdA" /locus_tag="DP116_24470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316961.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="penicillin-binding protein 2" /protein_id="PRJNA477356:DP116_24470" /translation="MRLLQQPQLASKKETRTVGRGFQPLFFIIFTLLMTAGIGARLVY LQIAEGANYRKRAEANRIRMIPKQPERGNIFDRNGKLLATTRYPRSVYLWPMAHTKPS WSVVGPRLSKILNIPVEEMEKKLDDSGFNSSLVRIARDLNDAEITVLKEYANELKDVE IHTEAVRYYPHGQELAHVLGYTREITPQQLKRKKQEGYRPGDMIGQMGVEKAYEKLLR GERGGQQVEVDGKGRPIRVLGEKQAKAGSDIHLTIDLNLQKAAEKALTGRNGVIIALN PNNGAVLAMVSHPTFDPNLFSKRRLSKKDWESIQGENHPLLNRAVASAFPPASTFKIV TVTAGLESGKFSPDTVLQTYGSLTVGRVTFAEWNHAGFGPLGFTKALAMSSDTFFYQI GRGVGGPTLIEWARKYGFGEKTGIEFNTEETKGHIPDQAWKQRTWKIPWTVGDSINMS IGQGALQTTPLQVSVMFCIPANGGYKVKPHLLKDHEEAKNWRESLNMKPSTVKTLREG LRKVIAEGTGKALNVSTIPPTSGKSGTAEAWKNRVKANHAWFGAYAPADKPEIVVLAF AEHSGGGGGSVAAPMVLQVMEEFFQKKSLNKSDKADTEALKLRTHSPRMRRQNESRE" BASE COUNT 2667 a 1930 c 2116 g 2508 t ORIGIN 1 gtttgtctag cattgactat ctttgggtag ctttgtttag atttctctga gcggtatcta 61 ctttcacgga caaaattatt atagagaagc gcaacaacga ttaaaactat tgccccaaaa 121 agagtgggaa acagcagaaa actccatgct ggtttgctca ggtaaataat aactggattg 181 gagcctgctg gtgggtgagt tgttcgggta agcatcatca cagaaatggc agtaccaaca 241 gctatagcaa ctgaccacca ggttgctcca aaaagcgtca gacataccaa accaataaag 301 gaactcaaaa aatgacctcc aagcacattt cgcggttgtg aaaaaggcac atctggaaac 361 ccaaatacca aaacgcagga cgctccaaaa gagcctagta caagtgttac agaaaatgta 421 ttggagagaa gcgcaactgc actgattgcc aaaaatcctc ctaaccaagc aagtattatt 481 gctctagctg aatgcttagg aggtagtttc gtattatccc cttttagctt ctgaaagaaa 541 tttatccgag acactgcttt tctggtgcta ccagcatagc tgtttaattg cttaaccatg 601 tagattcacc tcaaatccag cctaaaatct ccattttcac ccgccagaca ggtttcgtag 661 tcgaggactc cagaatagat tgggttggca acgccatgat taataacgat aacttgttgc 721 aaatgcgaaa gagcaacctg agggttaagg aagattttct cgctcaagga agcatctaca 781 atcaaaactt tcgcctcgca atagtccagc agggatagga ggtcctgctc tgccagccaa 841 ggattaagca tgacgattac ggcaccgatg gctggaatgc tgaagtgagc ttcgattatc 901 tgtcgattgt tttctgttag gagagcaacc cgatctccat accccacctg gagtcgcttc 961 agcacttgtg ctagacagcg agagcggtgc aatagctcgg cgtaagtcac tgttccatca 1021 ggataggcga tcgcgcttcg gttaggaaat gcctgaccac tgcgctctaa gaaagtggtt 1081 ggacttaagg gtacatagga atgcataggt tctcctctaa tacgaactga tatagtaatt 1141 ggacttgatt ggtgaacata tctctgttgt ttcctgcttt gtcgcgtgtt aaattgcctg 1201 ttcataaatc aaataagact actaactgtg ctggttgatt tatctcattt ttaggcgttc 1261 taatgtgcgc ggttcatgat agctacttag tattttggct caattgctat ttgcagtgtc 1321 ggaatatatc cgtaaccaaa acaaagaaac caccgttgaa gatgctggta ctctgattca 1381 aaatcagagt acttcacacc aataactatc ttgaaacagg cttaacgtct tcgtgcagta 1441 cagcaaactc gttccacaaa tcaggtcgct gtttgacttt gatataacgc gtggccaaaa 1501 tggtgcagcc attaatagaa gccattgtat gaaaagtgcc tggtgagcga agtattaatg 1561 aaccgggcaa gaacgtgacg ccattctcaa tataatcccc ggttaacact agaaccattt 1621 cataccctat atgttcatgc aaggcaacaa atgcgccagc ctcatattta agaattgcag 1681 catcaggacc ttggggatct tgatctacta aactaaatat tgtgtgaatc tcaacacctt 1741 ttctcacatc taattgatag ggtacgaact caagctcact aggcaaagaa acattttcca 1801 tacccaagaa cttaaattct ctatcgttga tggaattata tgggaatggg tcgaacttta 1861 tttgaccatt gctcatcaat tgaccatttg ttggaatagg accgttacct ttgtttgtgc 1921 cgttgtatcc cgatggaggt ttttttgaca tttggttgtt ctgtgacatt tgcttttcaa 1981 tatttacgta gattgaattt ttcttttgaa gaaattttct ttggcagtgt caactacatt 2041 attttgattg cgtattttga actgtggcag caatatcagg ggttgatgac cgaacgcttg 2101 gaagctttgc aaatagcggc gacggcagta caaacgaaga cgctgactca ggagttacgc 2161 gaaaatgcta ggcaggcagc acataaactg gcgggtgtgt tggggatgtt tgaccgggaa 2221 gcaggaacta cactggcgcg gcaaattgaa cagattttat tagaagatga agctttagga 2281 caaggtgaca aaggggataa gaagttgttg tctttggtgc gggagttggg tgaattgcta 2341 aatttagctg attcagaacc attttcgccc tcaaaaacat ctcgactgct gttgattgac 2401 ccagacacaa atttaactca agagttacag gaactagcac actcactcga cttgagttgg 2461 aaacaggtga caacgttgga attagctaaa acctggctcc agtcgcactc cccagattta 2521 gttgtgctgg ctgtggatga tatggaacag caggaagata gtttggcact ggtggcagat 2581 tttgccgcta ggacacctcc aataccagtg ttagtcatta cctctagaga tggactatta 2641 gatcgtgtca ctgtagcccg tgctggcgga caaggatttt tggttagacc tgtgaccgct 2701 cctcgggttt gggaaatcgc aacacaacta ttacagcgtt ctcgttcaag aagtgtgagt 2761 gtattagctg ttgatgatga cccggtgttt ctggcgggac tgcgctctat gctggaacct 2821 tggggtatac aaatgacaga attggatgac ccgttacgct tttgggaagt cctgcgacgt 2881 actgctcccg atttgctgat tttggatgtg gatatgcctc aactaggggg tatagaactg 2941 tgtcaggcag tacgcactga tccggacttg cagggactgc cgattttatt tcttactgct 3001 caccgcgatc gcgaaactat ccaacagata tttgcaactg gggcagacga ctatgttagt 3061 aagccagttg tggctgcgga actgttaacc cgaatcacca atcgtttaga gcgcacccgc 3121 ttgctcaaaa ccctttctac taaagaccct ctgactggat tggcaaatca accacaatca 3181 tatcgtgatt tagaactttt aatccaaagc atgaaggatg cagataaaag cgcaaactct 3241 gattcatccc ttctccctga tactttttgt ctggtggtgt tttttattac agaattacca 3301 gcaattaatc ttcaatatgg tcatgctaca ggcaatcaga ttttacaaag gtggggtcgt 3361 ctattccaag ctacctttcg gggtgcggaa gtactcggtt attggggcaa cggagagttt 3421 gtggtaggga cgccaggatt gaccaaacaa caaacaagcg atcgcctcag cgaaatcctg 3481 actaccctac ggcagcaagt gtttactgca ccagacaaca gccgctttca agtcagcttt 3541 aggtttgata tagctgagta tcccactgat ggattaactc tacaattcct gtatcaaatt 3601 gctagtcaag cttcgtgaaa accgtatact atctgctatg tagtctcttt ttttacaatc 3661 aaaaatctgt attctgtatt ttgtgttctg agttctttaa taaagccaaa caatcagttt 3721 cagcccacca agcataaata ggagtttcac gacctggcaa attcccaaaa gtattaggct 3781 gcaaaacggt gatgcgtata ccgtttgtca attgcacaat atagttgacg tgggttccaa 3841 gatacatgat gttgacgagc cgtccttcaa agcagttcgt ttgcaaactg ggtttataaa 3901 gcgaaagctg aattttttct ggacggacgc tgacgacgac tgcttgggat atttcagttg 3961 gtgtgtcatg ctggcgagca accacaattg agaatcctgt ttttgtaaaa attatgacac 4021 ttgaagcatc tacacctgcg atttcacctt caaataaatt cgtatcacca ataaaatcag 4081 ctacaaagct tgtgcgtggt tgttcgtata tatcactcgg ggtgcctatt tgttcaattt 4141 tgccttgatt catcacagca atacgattcg acaaggacaa tgcttcttct tggtcgtgag 4201 tcaccatgat aaaggtcaac cctagttctt gatgtaaatt tgagagttcg acctgcattt 4261 ccttacgcag ttttaagtct agcgccccaa gaggttcatc taataacaag actgcgggtc 4321 gattcacaag cgcccgtgct aatgctactc tttgctgttg accaccagag agttgactgg 4381 gaaagcgcga tcgcaaccct tccattttca ccagttccaa agcttcttta actctgcttt 4441 gaacttccga cctgtgtatg ttttttaact ttagtccaaa cgcgacgtta tcccaaacat 4501 ttaagtgatt aaacaaagca tagctttgaa agactgtgtt gacaggtcgg cgataagggg 4561 gaacgcttgt catggactga ccttgaatca acaccttacc agcatcggct gtttcaaacc 4621 ctgcaattaa gcgtagggta gttgttttgc cacacccaga aggacctaaa atactgaaga 4681 attctccgtg tctgacgcct aagtctaccc cgtgtactgc tggatcttgg ttaaaaaact 4741 tgaaaacgtt atgcagttca acatccaacg actcaaaagc cgtcctccca gcctgattct 4801 gcgtcaaatg ttgagccata atccaccagt tagatcctat tatcttcagt ttttttgttg 4861 attaatatag tcgattcggc aaaccacagg ttacactttg gttgattcga cttaccaaga 4921 agaatttaca aaaaatatgc tgttccggtt atcttaacgc ggaatacaga aaagaatttc 4981 acctaacttt ataccaataa aaattatttt ggaattggat atttttctat gtctacagta 5041 aagtcccgtc ctaccagcaa aaatagagaa gtacgtacag tcggacaagg tttgcagcct 5101 atatttttta ttgtatttac cttgttgatg atcactggta tcaatattcg cttaggatat 5161 ttacaaatta ttgaaggaca taaatttaag gaacgagccg cagcaaaccg gattcggacg 5221 atttccaaac aaccagaacg aggtaatatt tttgaccgca atggcaaact actagcgaca 5281 actcgctatc cgcgttctgt gtacttatgg ccaatggcac acactaagcc atcctggtca 5341 gttgtcggtc cgcgtcttga gaaaattctt aatatatccc aagaggagat agaaaagaaa 5401 ttagatgagg ctggtgctag cgccacttct ctggtacgga ttgctcgcga cctcagcgaa 5461 gcacaaatca ctgcattgaa ggagtatgaa aacgaactgc cgggtgtcga aattaataca 5521 gaagccgtac gctactaccc gcatggcaag gagttagctc atatactagg ttatactagg 5581 gagttgacgc ctgagcagtt gaaaaaaaag aaaccagaag gctaccgctt gggagatgta 5641 attggtcaga tgggggtaga aaaagcgtat gaacaactgt tgcggggtga atggggtggt 5701 cagcaagtag aagtagatag taaaggtcgc ccaattcgag ttttaggaga aaaacaggca 5761 aaagcaggta atgatattca cctgactata gatttagatc tgcaaaaggc agccgaaaaa 5821 gctttaacac ttcgtagggg tgcagttgtt gcactagatc ccaatactgg ggctatttta 5881 gcaatggtgt ctcatcccac ctttgaccca aatatcttct ccaagcaaaa actttctcag 5941 aaggactggc aaagtctaca aggtagtgaa catcccttag tcaatcgcgc cttgagtgct 6001 tttccacctg cgagtacttt caaaatcatt accacaactg cggggttgga atcaggtaaa 6061 ttttctccta gcacagtgct gcaaacttac ggttctctga ctgttggggg agttaccttt 6121 ggtgagtgga accactcggg atttggcgca ctgggatttc agcgtgcgat cgccatgagt 6181 agcgatacat ttttctacca agttggtcgc agagtcggtg gtccaacttt gattgaatgg 6241 actcgcaagt atggatttgg caaaaaaacc ggctttgagt tttcggatga agaatcaaaa 6301 ggtaatgttc cagatgacgc ttggaagcga aaagtgtgga aaatgccttg gactgtgggc 6361 gacactatca atatgtccat tggtcaaggt gctttgcaag tcactccact acaatctgca 6421 atcatgtttt ctgtccctgc taacggtggc tacagagtaa aaccacactt gctcaaagac 6481 catgaagaag caaagaactg gcgggaatct ctgaatatga aaccaagtac gatcaaagtt 6541 cttcgcgaag gattgcgcca agtgatcaca ggtggtacag ggacactatt gaatgtacca 6601 tcaattgccc ctgcagcagg taagagtggt actgctgaag ctggtattaa tcgcccaaat 6661 catacttggt ttggagcata tgctcctgct gataagccag aaattgtggt tgtagcgttt 6721 ggtgaaaact ctggtgggca tggtggtacc gactgcggac cagttgtctt aaaagtccta 6781 gaagagtatt ttcagaaaaa gtatccaggt aaatatcaaa aatctgagtc tgaagaatca 6841 aaagcgaaaa ctcaaaactc aagacgtaga tctggagatt aacagttatc agttatcagt 6901 tatcactaaa gagcgcatct agcccctcta ggggcgctac gcaaacgctt gggtaacggt 6961 ccggtgacct agtatagctt gaattagggg cgatcgcttc ggcgatcgcc tctaccatgt 7021 ggtttgtagt ttctacccga aaaagcgttg taaatttgga ctaccgtctt tgccaaatta 7081 acagtataat cattggtttt ctctgaatag cgctaattga tgtatttcct tatgcaaaat 7141 actgacaaca atttggttta ttctatataa caaaaaccaa atgacaaatg ctgcactact 7201 ggggttgttt ttctttcaat cccaggtaga aatttggcgc gatcgtatac agttcatgcg 7261 attctcggaa tcgctatttg tgacatgagg ctactgcaac aacctcaatt agcgagcaaa 7321 aaagagacac gtactgtagg ccggggtttc cagccgttgt tcttcattat atttacctta 7381 ttgatgacag ctggtattgg tgctcgtcta gtatatttgc aaattgcgga aggtgcaaat 7441 tatcgcaagc gagctgaggc aaaccggatt cggatgattc ccaaacaacc agaacggggt 7501 aatatttttg accgtaatgg caaactacta gcaacaactc gctatccgcg ttctgtgtac 7561 ttgtggccaa tggcacacac caagccttcc tggtcagttg tcggtccgcg cctgtccaaa 7621 attctcaaca ttcctgtaga agagatggaa aagaaattag acgactctgg ttttaactct 7681 tctttggtgc ggatcgctcg tgacctgaac gacgcggaaa ttacggtatt gaaggagtac 7741 gccaatgagc tcaaagacgt ggaaattcat acagaagctg tacgctacta cccacatggt 7801 caggagttag cccatgtact gggttatacc cgagaaataa cgccgcagca gttgaaacga 7861 aagaaacaag aaggttatcg cccaggagat atgattggtc aaatgggggt agaaaaggcg 7921 tatgaaaaat tgttgcgggg tgaaagaggt ggtcagcaag tagaagtgga tggaaaaggt 7981 cgccccattc gagtgctagg agagaaacag gcaaaagcag gtagtgatat tcacctgact 8041 atagatttga atctgcaaaa agctgctgaa aaagctttaa cgggacgtaa cggcgttatt 8101 attgcactaa accccaataa cggtgctgtt ttagcaatgg tgtctcatcc cacctttgac 8161 ccgaatctct tctccaagcg aagactttct aaaaaagatt gggaaagcat acaaggtgaa 8221 aaccatccct tgctgaatcg tgctgttgct agcgcctttc cacccgccag tactttcaaa 8281 atagtcactg taacagcagg actggaatcc ggcaaatttt ctcccgacac agtgctgcaa 8341 acctacggtt ctctgactgt tggcagagtc acgtttgcgg agtggaacca cgctggattt 8401 ggtccattgg gatttacaaa agccttagca atgagtagcg atactttctt ctatcaaatc 8461 ggtagggggg taggtggtcc aaccttaatt gaatgggctc gcaaatacgg atttggtgaa 8521 aaaactggca ttgagttcaa caccgaagaa acgaaaggtc atattccaga tcaagcctgg 8581 aagcaaagaa cctggaaaat tccttggact gtaggcgata gcattaatat gtctattggt 8641 caaggtgctt tgcaaaccac accactgcaa gtttctgtaa tgttctgtat cccagccaat 8701 ggcggttata aagtgaaacc acatttgctc aaagaccatg aagaagcaaa aaactggcga 8761 gaatctctga atatgaagcc aagtaccgtc aaaactctgc gtgaaggact gcgaaaggtg 8821 atagctgagg gtacgggtaa agctcttaat gtgtcaacaa ttcctccaac atccggtaaa 8881 agtggtacgg ctgaggcatg gaagaatcgt gtcaaagcaa accatgcatg gttcggagca 8941 tatgcccccg ctgataagcc agagattgtc gtgttggcat ttgctgaaca ctctggcggt 9001 ggtggcggta gtgttgctgc cccaatggtg ttacaggtga tggaagaatt ttttcaaaag 9061 aaatctttga ataagtctga caaagctgat actgaggcgt tgaaattaag aactcacagt 9121 cccagaatga ggcggcagaa tgaatcaaga gaatgacgag gaaaggaggg ttgggaactg 9181 taacttaatc cctgctcacc tctgttcatt gctgatcgtt t // LOCUS NODE_3604_length_9182_cov_5.1309309182 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9182) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9182) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9182 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1103) /locus_tag="DP116_24475" CDS complement(<1..1103) /locus_tag="DP116_24475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197580.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aromatic ring-hydroxylating dioxygenase subunit alpha" /protein_id="PRJNA477356:DP116_24475" /translation="MSVVSQAAQSHAQSHNTRQLGINPNHWYVVALSREVKTQPVGVR LWKQAIALYRDTKGQVHALEDRCPHRLVKLSHGQVIGDELECAYHGWRFNNQGECAAV PYLAENQKLPSCKIRRYPVKEQDGFIWLFPGDVETLHATSLQPMGVPEWEHLNYIATV SIINCNAHYSYLIENLMDMYHGHLHQDYQAWTQASLQDIDEDTHRVDAHYTAQSYYKI DKIWSISQLFFPALRRLHPEPLDVSYVYPHWISTLGKDFKIYCLLCPVNERQTKAYLI HFTSLNAFWRLHKLPVWFRRFVKDSLFGAAQKLLDGLVRQDVQMIEEEQQAYLENHQR RGYELNRALVSVQRLMRSQVEPHPHKATLYIPSP" gene 1267..1770 /locus_tag="DP116_24480" CDS 1267..1770 /locus_tag="DP116_24480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873443.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YbjN domain-containing protein" /protein_id="PRJNA477356:DP116_24480" /translation="MTSYQSNQETVATESTSTDELINEIIEEANAINYVEIIENVIDT LEQDDSAMVSHPSEGSYRWKFKYGTVEVFVQLTGTTDEDTLTVWSAVLKLPAKDEPRL MRELLEMNASTTFEARFAIIENQVVVLTTRTLADLSPGEASRLITIVATIADNNDEAL ESEFGVS" gene 1821..2501 /locus_tag="DP116_24485" CDS 1821..2501 /locus_tag="DP116_24485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873442.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipoate--protein ligase family protein" /protein_id="PRJNA477356:DP116_24485" /translation="MAIDSWLLKQHQSGHPPALRFYTWSPPAISLGYHQHKYPEYWQH LVWQGEKVDLVRRPTGGRAVLHQGDLTYAVVTSGVTSGMTSGFVGSRVQVYQKICEFL IQGWRSLGIQLQYGTAGRGYIHNPNCFGTATGADLILPDGTKLIGSAQVRRGGAILQH GSIRLQPSAELFTQVFGTESFSPVHLPQNLDIEKIIVALIAAAEDCFGMELEVQPLSE SEWEAILT" gene 2631..3650 /locus_tag="DP116_24490" CDS 2631..3650 /locus_tag="DP116_24490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24490" /translation="MLPKNKRKFRVIGIAFLSTVCIWFFGSYRVIANEFQSLHFQTSR IIKVSLKQPNNQQLCSHYQNIGRQLEPVSNQVLSGSEILFKTFADDKVLSIPAEPLPT IHQAAKKAKVPVIMYHDIISNKKVIYDITPQELEKHFELIKSTQMTPISLNRLFAHLR TGSPLPKKPILLTFDDGYGGQYEYAYPLLKKYGYPAVFAIHTSGVGVNAGRTHVTWEQ LRTMANDPLITISSHSITHPALTKVSDQQLYKEVVESKQILEAQLDRSIIYFTYPYGN YDSRVKKIVAEAGYLAALTIGDPTEMFANQSKDLFAISRFEKSRLEKVIPQAWGSPNM PQCNS" gene complement(3915..4279) /locus_tag="DP116_24495" /pseudo CDS complement(3915..4279) /locus_tag="DP116_24495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015217690.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RNA-dependent DNA polymerase" gene complement(4276..4473) /locus_tag="DP116_24500" /pseudo CDS complement(4276..4473) /locus_tag="DP116_24500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314602.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="reverse transcriptase" gene complement(5031..6529) /locus_tag="DP116_24505" /pseudo CDS complement(5031..6529) /locus_tag="DP116_24505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009785485.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RNA-dependent DNA polymerase" gene complement(7328..7876) /locus_tag="DP116_24510" CDS complement(7328..7876) /locus_tag="DP116_24510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458959.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NYN domain-containing protein" /protein_id="PRJNA477356:DP116_24510" /translation="MLSSLPRAVLLVDGYNIIGAWSCLKRTRDTAGLEAARCELIEAL TSYSAFQGYETQIVFDAQYQNSCSNREIITELLSVHYTDFGQTADTYIEKVSASLRSS LAQSLISRMIVATSDRAQQLMVQGYGAEWMSAQQLCYEVQATVCRVRQKSKVRKHSNT RFLANSIDAKARQRLAELRMGL" gene complement(8062..8745) /locus_tag="DP116_24515" CDS complement(8062..8745) /locus_tag="DP116_24515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317922.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24515" /translation="MAELGIEVKDLSFSWSSGENVIKSCSLEVPKGEFWMLLGTNGSG KSTLLRLLAGLLTPKSGDIQLSPTVGFVFQNPDHQLVMPTVGADVAFGLVEEKLPPAA VRTRVEEALEAVNLLSLQKRPIYALSGGQKQRVAIAGALARRCEVLLLDEPTALLDPD SQLELVASVRRLVKNRGITALWVTHRLDELNYCDGALLLERGSLLDQGEPQRLKRRLM ELHDKTPET" gene complement(8764..9039) /locus_tag="DP116_24520" CDS complement(8764..9039) /locus_tag="DP116_24520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013192327.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24520" /translation="MFSIDLTVKNTAFPISIERKTSEDAEAVYQLIVAAMQTGSPDIV ELTSEGKTEKKVAVRASEISGVQLIQKDGAAGGGTGRPPGFFAIAAE" BASE COUNT 2370 a 2068 c 1997 g 2747 t ORIGIN 1 ggagagggga tataaagcgt agctttatgg gggtgaggtt caacctgact cctcatcaat 61 ctttgcacgc tcaccaacgc ccgattcaac tcataacccc gcctctggtg attctccaaa 121 tacgcctgct gttcctcttc aatcatctgc acatcttgac gcaccaaacc atcaagtaat 181 ttttgcgctg caccaaacaa actatctttc acaaaccgcc gaaaccatac aggtaatttg 241 tgtaatcgcc aaaaagcatt taaagatgtg aaatgaatca aataagcctt ggtttgtctc 301 tcattcacag gacacaataa acagtagatt ttgaaatctt tgcccaatgt ggaaatccag 361 tgaggataca cgtaactgac atccaaaggt tcaggatgaa gacggcgtaa agcgggaaaa 421 aataattgcg aaattgacca aattttatct atcttataat agctttgtgc tgtataatga 481 gcatcaacac gatgagtatc ttcgtcaata tcttgcagtg atgcttgtgt ccaagcttga 541 taatcctgat gtaaatgtcc atgatacata tccatcagat tttcaattaa atatgaataa 601 tgggcgttac agttaataat tgaaaccgtg gcaatgtaat tgagatgctc ccattcggga 661 acacccattg gttgtagaga cgttgcatgc aacgtctcta catctccagg gaacagccaa 721 ataaatccgt cttgttcttt gactgggtaa cggcggattt tgcaactagg gagtttctgg 781 ttctctgcta gatagggaac tgctgcacat tcaccttgat tgttaaagcg ccaaccgtga 841 taagcacatt ccaactcgtc gccaataact tgcccgtgac tgagtttaac taagcgatga 901 ggacatctat cttctagggc gtggacttgt cctttggtgt cgcggtagag tgcgatcgcc 961 tgtttccaca atctcacacc cacaggttga gttttgacct cacgcgaaag cgcaacaaca 1021 taccagtgat tagggttaat acccaactgg cgagtattat gactttgagc atgactttgg 1081 gcagcttggg aaacaacaga cataaattga cacatcctcg ctattgttag aatgtaccgc 1141 aaatcttgat aatgttcccc ttgcgctttg atagacagcc tacgacttaa ggagctaaaa 1201 ataatcacag cacaagctat aaagtagtct agatgcaaca attatgctgt tgaggtaaaa 1261 atctgtatga caagctacca atctaatcaa gaaacagttg cgaccgaatc cacatctact 1321 gatgaactca tcaatgagat tatcgaagaa gcaaatgcca tcaactacgt ggagataatc 1381 gaaaacgtta tcgatacgct agaacaagat gacagcgcaa tggtgagtca tccttcagaa 1441 ggtagttacc gttggaagtt caagtacggc acagtggaag tgtttgtcca actcactgga 1501 acaactgatg aagatacctt aacagtttgg tctgcggtgc tgaagttacc cgccaaagat 1561 gaacccaggt tgatgcgaga gttgttagaa atgaatgcat ctactacctt tgaggcacgt 1621 ttcgctatta ttgaaaacca agttgttgta ctcacaacac gcactctcgc agatttgtct 1681 cctggcgaag cttcccgctt aattactatt gtggcaacta tcgctgataa caacgacgaa 1741 gcattagaat cggaatttgg tgtgtcttga ctttagtgtg gcgattcatt ccgttattac 1801 aagcctctgg tgatgtgcag atggcgattg actcctggtt gttaaaacag caccagtcgg 1861 gacatcctcc tgctttgcga ttttacactt ggtcgccacc tgccatttcc ttgggctacc 1921 atcagcataa atatcctgaa tattggcagc atctggtttg gcaaggtgag aaagtggatt 1981 tggtacggcg tcctacagga ggacgtgcag tgttacatca aggtgattta acctacgctg 2041 ttgtgacatc tggtgtgaca tctggtatga catctggatt tgtcgggagt cgcgttcagg 2101 tgtaccaaaa aatttgtgag tttttgattc aggggtggcg atcgctcggc atacaattac 2161 aatacggtac agctggacgc ggttacatcc acaaccccaa ctgtttcggc actgcaactg 2221 gtgcagattt aattttgcca gatggtacta aactcatcgg tagcgcccaa gtgcgacgag 2281 gtggagctat tttacaacat ggttccatcc gtttgcagcc ctcagctgag ttatttactc 2341 aagtctttgg cacagaatct ttttctcctg tgcaccttcc ccagaactta gatattgaga 2401 aaattattgt cgctttaatt gctgctgctg aagattgttt tggtatggag ttagaagtgc 2461 aaccactctc tgagtctgag tgggaggcaa ttttgactta gccacgtaac tattaactca 2521 aacacctgtt gggcaggggc aagcttcacg aaaactccaa ctgattttat acctacttct 2581 gtgcgtcaca aaacataatt acgaattatt ctgaattctt agataaaaaa atgctgccaa 2641 aaaataagag aaagtttcgt gtcataggta ttgcgttttt gtccacagtt tgtatttggt 2701 tttttggttc gtaccgggtg attgcaaatg aatttcaaag tctgcacttt cagacatcac 2761 gaattataaa agtttcctta aaacagccca ataatcaaca attatgtagc cactatcaaa 2821 atataggtag gcaacttgag cctgtaagca atcaagtatt atctggatca gaaatattat 2881 ttaagacttt tgcagatgac aaagtgttat caatacccgc cgagccttta ccaacaatac 2941 accaagctgc aaagaaagct aaagttcccg tcattatgta tcacgatatt atctcaaaca 3001 agaaagtcat ttatgatata acaccccaag aattagagaa acattttgaa ctgattaaat 3061 cgactcagat gactcctatt agcttaaatc ggctttttgc tcacttgcga acaggaagcc 3121 cattaccaaa aaaacctatc ttgttgactt ttgatgatgg ttatggtggg cagtatgaat 3181 atgcttatcc tctattaaaa aaatatggtt atccggctgt ttttgctatt catacgagcg 3241 gtgtaggagt taatgctggt cgtactcatg ttacttggga acaattaaga actatggcta 3301 atgacccact cattacaata tcttcccaca gcataactca tcctgccctc acaaaagtat 3361 cagatcaaca gctttataaa gaagttgtgg aatctaagca gatacttgaa gctcaattag 3421 atcgttctat tatttatttt acttatcctt acggtaacta tgattccagg gtaaagaaaa 3481 tagttgcaga agccggatat ttggcagcac ttacaatcgg cgatccaact gagatgtttg 3541 caaaccagtc aaaggattta tttgcgatca gtcgttttga gaaatctcgt ctagaaaaag 3601 ttattcccca agcttggggc agcccaaata tgccacagtg taattcttga aggggcgtta 3661 taccgggttg agtcgatggg gactattatg tcccgcacct ctcatttaga tccggacgtg 3721 cccatttctg tgcatccggc tcccgatgtt ctaggatttc tccttgctca tgtgatgata 3781 ttggtggcat gactgatgga ctgctatgca gttcttgggt ttccaattcg gttgagtcga 3841 tgggtggtat tatccgccgc acctctcagt tagatccgag cgtgcccatt tctgtgcact 3901 cggctcccga tgttctaggg tttccccttg ctcatgtgga cgtagtcgtg acagctttca 3961 tgtaccgcta gaaggttttt agttttccag ttattgtgat tgccgtcgat gtggtgtaag 4021 tgtacccgtt cttcactgag catctttaat ccacaatgtc cacatgcatg gttttgccgt 4081 ttcagggcaa tagaacggtt tcgccgtcgt agagtttgct attgcgctca ctccagtagg 4141 gagtatctcc gtcaaaaggt gatttttctc cttggacgtt gacaaattta ttttcggagt 4201 aaggtactgc tgggaacgct ttgtccagta atttcttact agagtatcgg tcattctttg 4261 cttctttatt gaataccttg ttacttgtct aatagccagt aaccgtgcag ccttagattt 4321 cagaatcagc ttttgaagtg accgcgtttt acgcatgtcg cctgctttaa ctgctttata 4381 caaccgaact tgtaggcgaa acaaattgcg tctgaatttc ttccagggta acttgctcca 4441 agattcacta gtattgttac tgtgcctaat catactctac tctatcctga ttgttttctg 4501 aacaccttgc agcaattacg ccgcatccta cccgaattaa aggaattcag cctctcgtct 4561 agcttacctg ggtttcgacc gttcccaaga cctttaactc gtttttattc gttccccgga 4621 tgggattttt atgtgccgtc aggtggaacc agttcaaccg ctagaatccc ttactcttgc 4681 cgttttctca ccaattcatc ggcggggttt tatctctagc gaggtcaggg ggttttgtgt 4741 tatgccctgc ttcggaatag gttgcttttc taggttctgt tttcaccttg tgcactccct 4801 attagcgcaa agctctgcgg gggaagcttc ccccgcaaga ctttgcacca gtgtcagccc 4861 ataacagccg tctggttacg ccctgttccc agcttcactc tgcgaaactc cgagttcgct 4921 cggtgtgggc aggtaaggag tcacctctga gtttgagggg aagggctttc accttcatcc 4981 tgtccggagt tcaatccctg tagggattgg ctaatatatg tacggactga ttaccgtgat 5041 tctactagca acgaatcgca ctggtgattt ccgtctacgt ggtgtagatg gactgtatcg 5101 ccaggaatca tcttcatccc acaaattccg catgaatggt tttgccgttt taaggcttta 5161 gaggtgtgtc cgtcatagag tttactgttt cgttcactcc agtacgttaa atctccgtcg 5221 aatggtgact tatcgccttt gaccatgacg aatctgtttt cggagtaggg aactgctgga 5281 aatgccctat ctagtaattc tttactggag tatcggttat tctttgcttc cctgttgaat 5341 accttgaatg ttctgcgttg gatgtgatac aacgagttac gtgccccatc catcttgcag 5401 tagcggtggt aattcctcca tcctctaacc actggtgtta gctttgtagc tctcactttg 5461 gaaccataat tggagcagtt gacgatagct tttactttct tacggaatgc tttgaagtta 5521 tccacagagg gggtacttct aaatttcccg ttgctctgca ctttgaagtg ccagccgagg 5581 aagtcaaatc catctgtcgt ggcggttatc ttggtcttct tttcgttgat attcattcct 5641 ctttctgcaa ggaattggct tattttttcg agtacctcta ttgcgttatc tttgggtctg 5701 agtatgatga ccatgtcatc tgcatatcta atgcaggctt tgaccatcgt tccatctgta 5761 cttttgtatt gatgtatgtc ttctattcca ttgagcgcaa tgtttgctag taatggacta 5821 actactccac cttgtggtgt tccttgctca ggaaactctg gattaactcc tgccttaagg 5881 catcggaaga tacctatctt tgtgcctttg gggcaaatga gtttgtccat tatggttttg 5941 tggcttatcc tgtcgaagca tttttgaatg tcgagttcga tgactcgctt ctctattcct 6001 ttacacctgg agtttaggtg gttaaatacc agtctttgtg catcgtgtgc acttcttcct 6061 agtctaaacc cgtagctctt ggcgtggaac gttgcaacgc gcaagctggt tctagggcat 6121 acttcacaag gcattgccat gctctgtcag ccatagtggg gactttgagg gttcggaagg 6181 ttccgtcttt ctttggtatt gggatttccc tgagtttgct atgttgccaa tcgtgtacat 6241 gcttcgcgag catccctccg agtgcaaatc gctcctcgta gttgagagac tttttgccgt 6301 caatccctgc ggtcttttta ccagtgttta actgagatac ttgacgaata gccaaaaatc 6361 ttgcagcttt ggatttcaga atcagctttt gaagtgaccg cgctttccgc ttgtctcctg 6421 ctttaattgc tttgaacaac ctgacttgta ggcggaataa gctccggcgg aatttcttcc 6481 aggggagatt cttccaagat tcactagtct ggtgactgtg cctaatcatg ctctactcct 6541 tgtgattgtt tttctgaaca cctcagcgaa attactcgct gtcctacccg aatcaagaga 6601 attctgtatc tcgtcatacc taccttgggt tcgacctccc aggactcttg attcgttttt 6661 attcgttcct cggatgggat atatgtgccg tcaggtgtaa ccacttcaac cgctagaacc 6721 tcttactctt actgcttgta ccttgattca tcagcgaggt tttaactcta gcgaagtcag 6781 ggggttcttt gttatgccct gctcatacct aggttgcttt ttttaggttc ttttcacctt 6841 gtgcactccc tgttaacgcc agtgtcagcc cataacagcc gtctggttgc gtcctgttcc 6901 cagcttcact ctgtagaatt ccgagtctac tcggtgtggg caggtgagga gtcacctctg 6961 agtttgaggg ggaggaattt cacctccatc ctgtccgaag ttcaacttgt tgacggtaca 7021 acgaagttga catgcttatt tacgggctga ttaccgagat ttagcatgta actaatcgca 7081 ccaaggttag tacccatcaa agaagcgatt cgcaattttg ggagtatcgt cagacacgtt 7141 aagacgttgg cagatggaag gcaaaatgac tagccagaga acagacgcag gtcacacaga 7201 acatagggac acgcacccca aggcaaacgc acgctacttt gtatgccaga ttcgtgcgct 7261 ttgcccatag agtgatagcc gtggctgtac ccactgggtg tattttgaat ttggaatttg 7321 gaatttctta tagtcccatc cgcaattcag ctaaacgctg acgtgccttg gcgtcgatag 7381 aattagctaa aaaccttgtg ttagaatgtt ttcgtacctt agatttttgc cgcacccgac 7441 aaacagtggc ttgtacctca tagcacagtt gctgtgctga catccattca gccccgtacc 7501 cctgtaccat caactgctgt gcgcggtctg atgtagcaac aatcatccga gaaataagag 7561 actgggctaa tgaggagcgt aaagaggcag aaactttttc aatgtaagtg tctgctgtct 7621 gcccgaagtc tgtataatga accgatagaa gctctgtaat aatttctctg ttgctacagc 7681 tgttttgata ttgagcgtcg aatacaattt gagtctcata accttgaaac gcactgtagc 7741 tagtcaaagc ttcgattaat tcgcaacgcg ctgcttctaa tccagcagtg tcacgggttc 7801 ttttcaagca agaccaagcg cctatgatgt tgtagccgtc cacaagcaaa acggctcggg 7861 gtaaggaaga gagcatgatt ttggatcaca attttctaga gttaacaaaa ttcattcata 7921 aaattcatag taattgaagc atgggttgta acactatgtt acaaaaagaa aaatgtatat 7981 aagggtgtag agaaaaatcg atggaaaata tgccctatgg gcatgcaagg caaacgaaat 8041 ttcccctaca cccttgtatc cttaagtctc aggagttttg tcgtgcaatt ccatcagccg 8101 tcgtttgagg cgctgcggtt caccctgatc caataaagaa cccctttcta gcagcaaagc 8161 gccgtcacag taatttaact cgtctaagcg gtgggtaacc cagagggctg tgataccacg 8221 atttttgact aggcggcgga cactagctac gagttccaat tgactatctg gatcaagcaa 8281 agcagttggt tcatctaaca ataggacttc acagcgacgg gctaaagcgc cggcgatcgc 8341 cactcgttgt ttctgtccac cactcaaagc ataaatagga cgtttttgca gtgagagtaa 8401 attgactgct tccaacgcct cctcaacgcg agttctgaca gcagcaggtg ggagtttttc 8461 ttctaccagc ccaaaagcca catcagcacc aactgttggc atgacgagtt gatgatcagg 8521 attttggaag acaaagccca cagtaggtga aagttgaatg tccccagatt taggggttaa 8581 tagccctgcg agcagtctga gtaaggtgga tttgccactc ccatttgtac ccaagagcat 8641 ccaaaattca cccttgggta cttccagaga gcaagactta atcacattct ctccggatga 8701 ccagctaaaa cttaaatcct taacttctat ccccagttcc gccatgagta taccactcgc 8761 cgattactca gctgctatgg cgaagaaacc aggcggtctg ccagttccac caccagcagc 8821 accatctttc tgaattaact gaactccaga aatctcacta gcacggacgg cgactttctt 8881 ttctgtcttc ccctcgcttg tgagttccac aatatcaggg ctgccagttt gcatagccgc 8941 cacaatcagc tgatacactg cctcagcatc ctcagatgtt ttacgttcta tagatatggg 9001 gaaagctgtg ttctttacgg ttaagtcgat actaaacatt tcgcttcaat gaaattaagg 9061 tgcaggactc attttaacgt tagcttgcgt ggagcgagtt tgaggaggtg actcatggga 9121 aaagttaact ttcggttaaa aaaatgaaaa aaacactaga aaaatctgag aactttatat 9181 ac // LOCUS NODE_3639_length_9063_cov_4.7999569063 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9063) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9063) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9063 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..202 /locus_tag="DP116_24525" CDS <1..202 /locus_tag="DP116_24525" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24525" /translation="LKSGDPFGSRSWGKPPRPRCLTTTLAPQFIDGENKSNNPYSKPL NSLMGFSYPYTPTALHPYTLSC" gene complement(205..831) /locus_tag="DP116_24530" CDS complement(205..831) /locus_tag="DP116_24530" /inference="COORDINATES: protein motif:HMM:PF08547.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24530" /translation="MRKKLVVASILLCLMNSSLANAEVNHFEDPRLTVSTVEADWSIK TDKTERPQVGASVATLIQAQKELQFQGTLAAIEGNGGLVGFAFVETPLSQDLSQYKYI EFYAKSKEARVVYTLTLKDEQTQQDAGTLTFEQEFVVGTDWTKVKLPLSEFQPKIRGR LVDSFQLDLGQVRSLSFQINRSKQEPDSPIPLEFALDVGSEVLVTNQE" gene complement(1058..2371) /locus_tag="DP116_24535" CDS complement(1058..2371) /locus_tag="DP116_24535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408025.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alcohol acetyltransferase" /protein_id="PRJNA477356:DP116_24535" /translation="MMDNRELGKMEKAMETLNRDGAGFLVVTVGRIKGPLNAEMVRLA LDLVQRRHPRLNSRIVGSLDNLHFQTGEMPLIPLRVVDKLHDEQWTDVVLEELNEKID SSKGLLRAVLILPMNKSCVNYLILTVHHSISDGLSCLQLYSEILAYCQSFASNDPITQ VISLPTLPPVEELLPESKKELRKEPLRNLPEPLGVETLGFEKCVPMELRRCGLVHRQL DAELTQQLVNICRQEKTTVQGALCAAMLFAAVRKIRTGQTNDVRVSCRSPIDLRQRLK PVISNGNMIALVSSVMSFHTLQLNTSFWELARDIKQQLHAGLEREEIFTGVLTFSQNV ELLLRQPHEVLATVSVSNVGRVNIPRVYGSLELEEISFVPSIAAYGGVFYAAVTTFQG KIFLNFPFSEPAISLHTMENLINSAVSCIIDACQGYCQLKPNTHE" gene complement(2535..3221) /locus_tag="DP116_24540" CDS complement(2535..3221) /locus_tag="DP116_24540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873828.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24540" /translation="MANLQHLALLQIGALAWNEWRRKNPQIEPDLRDAELNGANLYTA NLIKANLSAANFSVANLSAATLTQADLTHANLIGADLTEANLKGAFLCEANLIGTELK GANLRDADLGHAKLIRANLCFANLIAANLIAADLSKANLYEAEVIGAYLYKADLYKAN LRQAHLSGAYLLRANLSEADLSKADLRWTNLQGANLAGANLRGAKLEGAKLRGANLSG VDFQDTIISE" misc_feature complement(2649..2699) /locus_tag="DP116_24540" /note="possible 23S ribosomal RNA but 16S or 23S rRNA prediction is too short" gene complement(3225..3866) /locus_tag="DP116_24545" CDS complement(3225..3866) /locus_tag="DP116_24545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408263.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24545" /translation="MGIELRSFVFLDSLQPQHAAYMGTVAQGFLPLPGDSSLWIEISP GIEINRITDVALKAASVRPGALVVERLYGLLEIHSSSQGETRAAGQAILATLGVKKDE CLKPRVVSNQIIRNIDAYQAQLLNRTNRGQMLLAGETLYVLEVEPAAYAALAANEAEK AAAINILAVQPVGSFGRLYLGGQERDILAGAQGALTAIENVAGRVNPQSGRQE" gene complement(3967..5589) /locus_tag="DP116_24550" CDS complement(3967..5589) /locus_tag="DP116_24550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009632494.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="monoamine oxidase" /protein_id="PRJNA477356:DP116_24550" /translation="MARSALMAFLQRAYKITQVSLETGIPADEIVDILQQKTTRRRLI YGGLGLASAISAVTWHDGSDSVAFGTIPKVLVVGAGIAGLVAAYRLYQAGVPVDIVEA RNRIGGRIYTLQNALGTSIPVDLGGEFIDTKHTSLRSLAQELGLQIADLYAADKDLIQ GTLYFQGRKISEKEILQWFTPLVQKIKRDLAAIGKTPVTYRTHNQVAIKLDNTSITQY LEEAQVHPLLSQRIQVAYTGLYGREAREQSSLNMLLFIGTDTNSFQISAESDERYQIV GGNDQVPRLLARLLTNSIETGTQLEAITTRNDGSYRVSLRSGNRSFERTYERVLLALP FSTLRQVSLNVDLPAVKKKAIAQLGYGNNAKLITAYQERIWRTRYNSTAFVSCDLDFQ TIWEASRYQPGSNGLLTNFTSAQSSLLLGQGSAESQAQKLLTQLEKIFPGIASVRKGE AIRAYWPREPYTRGSYACYLVGQWTTIAGAEQENVGNLFFAGEHCSQRFQGYMEGGCR TGEMAAVQILRSLGLKKSAAQQQARITEKQTS" gene complement(5705..6997) /locus_tag="DP116_24555" CDS complement(5705..6997) /locus_tag="DP116_24555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198315.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_24555" /translation="MLVLEAKLEGSSEQYTILDEMIRTARFIRNSCVRYWMDNRDVGK NEISAYTAILASLYEWAGKLNSMARQASGERAWQSINRFYSNCKKKVPGKKGYPKFRR EQTHGSVEYKTTGYKLSKDRRSITFTDKFDAGTFKMWGTRDLHFYQLNQIKRVRVVRR ADGYYCQFCVDQERLEKREPTQTTVGLDVGLNHFYTDSKGNTVENPRYLRKGEKALKR LQKRVSRKKKGSSNRTKARKKLALKHLKVSRQRKDFAVKLARCVVISNDVVAYEDLKV RNMVKNTKLAKSISDAAWSLFRQCVEYFGKVFGVATVAVPPHNTSQNCSHCGNKVQKS LSTRTHRCPHCGTVLDRDPSGSQSPTEGDPPAALSHHNAALNILEIALRTVGHTGTKT SGDENDLCLGEETPSGKPTRGTRKSKAQVLESHGFLSK" gene complement(7326..8246) /locus_tag="DP116_24560" CDS complement(7326..8246) /locus_tag="DP116_24560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317290.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="manganese catalase family protein" /protein_id="PRJNA477356:DP116_24560" /translation="MFYHTKKLQYFRPPEKPDAIYAKKIQELIGGVFGEMTVMMQYLF QGWNCRGPAKYRDMLLDIGTEEIGHVEMLATMIAHLLDKAPIKLQEDGAKDPVVGAVM GGSNTQNVITDIMGAAMNPQHAIVSGLGALPADSVGFPWNGRFIVASGNLLADFRSNL HAESQGRLQAVRMYEMSNDPGVKDTLSFMIARDTMHQNQWEAAIEDLKESGLESTPVP SSFPLELEKREFAYQFWNNSEGTESSEGRWAKGPSPDGKGEFQYVENPQPLGPEPLPP QSDPRLHGTPLNKQAQGSADTLINRTTIGG" gene complement(8540..8881) /locus_tag="DP116_24565" CDS complement(8540..8881) /locus_tag="DP116_24565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114917.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cupin domain-containing protein" /protein_id="PRJNA477356:DP116_24565" /translation="MSDTSVKKVDSAYSPKGQLGQKYLASGKSVSMRLWENEQPGESK EPATREYETVGYVITGRAELHLEGQVVLLEPGNSWVVPKGASHTYKILEPFTAVEATS PPAQVHGRDEN" BASE COUNT 2391 a 2136 c 1884 g 2652 t ORIGIN 1 tctcaagtcg ggagaccctt tcggcagtcg ctcatggggg aaacccccaa gaccgcgctg 61 cctcaccacg acactggctc ctcaatttat tgatggagaa aataaaagca ataatcctta 121 ttccaagccc ctcaattcat tgatggggtt ttcttacccc tacaccccta cagccttaca 181 cccttacacc cttagttgtt gactttactc ttgatttgtg accaatacct ctgaacctac 241 atctaaagca aactccagtg gaatgggtga atctggttcc tgcttgctgc gattgatttg 301 aaaagagagc gatcgcactt gccccaaatc caattgaaag ctatctacaa gtctcccacg 361 aatttttggt tgaaactcac tcaaaggcaa cttcactttt gtccagtctg tcccaacaac 421 aaactcttgc tcaaatgtga gagttccagc gtcctgctga gtttgctcgt ctttgagtgt 481 tagcgtgtaa acgactctgg cttctttaga tttagcgtag aattctatgt acttgtattg 541 gctcaaatct tgggataaag gtgtttctac aaacgcaaac ccgacaagtc ccccattacc 601 ctcaatcgca gccagtgtac cctgaaattg taattccttt tgcgcttgga tcaatgttgc 661 aacgctagca ccaacttgcg gtctttcagt tttatcagtc ttgatcgacc aatctgcctc 721 aacagttgag accgtcagtc tggggtcttc aaaatgatta acttctgcgt ttgccaaaga 781 agaattcatt agacaaagaa gaatagaggc tacaactaat ttttttctca tgttaagaat 841 tgtacactgt cgctaccact tgttcataat atggcagcaa accgtcttgc caaatcctca 901 atgtcgtcca agcgagcgat aagccggagg cttgacgcaa agcgtatccc acggctacaa 961 acgaagtcca cgcaagttaa cttaccacaa agcctgcgga ggcaagctac tctccggaca 1021 gtacctaggc ttgtaaactt gctatgggtt attgatttca ttcatgggta ttgggcttaa 1081 gttgacaata tccctggcaa gcgtcaataa tacaggacac tgcgctgttt ataaggttct 1141 ccatcgtatg gagactgatt gccggttcag aaaacggaaa attcaaaaat atttttccct 1201 gaaaagttgt gacagcagca tagaaaacac ccccatatgc agcaatcgat ggcacgaagc 1261 tgatttcttc tagttcgagt gaaccgtaaa ctctggggat gtttactcga cccacgttgc 1321 tgacactcac cgtcgccaat acttcatgag gttgtctgag aagtaactcg acattctgac 1381 tgaacgtcaa tactccggtg aatatttcct ccctctctaa accagcgtga agctgttgtt 1441 ttatatcccg tgccaattcc caaaatgatg tatttagttg taaagtgtga aatgacatca 1501 cagacgaaac cagcgctatc atattcccat tgctaataac tggttttaga cgttgacgta 1561 aatcaatagg cgatcggcag ctcacacgca catcattagt ctgacctgtc ctgatttttc 1621 ttaccgctgc aaacagcatt gccgcgcaca aagcaccttg cactgttgtc ttctcctgcc 1681 tacaaatatt tacgagctgc tgggttaact cggcatctag ctgtctgtga accagaccgc 1741 aacggcgtaa ctccatcggt acacatttct caaaacccaa ggtttcaaca cctaagggtt 1801 ctggtaggtt acgcagaggc tctttcctca actccttttt cgattcaggc agtagctctt 1861 cgacgggtgg aagtgtgggc aagctaataa cttgagttat tgggtcattg gaagcaaaac 1921 tctggcaata tgccaatatt tctgagtaca gttggaggca tgacaaccca tctgatatac 1981 tgtgatgcac tgttaggatc agataattta cacacgattt attcattgga agaatgagta 2041 cagctctgag taaacctttg ctactgtcga ttttttcatt cagctcttcc aaaacgacat 2101 cagtccactg ttcatcgtgt aacttgtcta caacacgcaa aggaatcaaa ggcatctctc 2161 cagtttgaaa gtggagatta tccaaggaac caacaatacg agaattcaga cgagggtgac 2221 gacgctgaac aagatctaga gccagtctga ccatttctgc atttagaggt cctttgatgc 2281 gaccaaccgt tacaactaag aaacctgcgc catcgcgatt caaagtttcc atagctttct 2341 ccatcttgcc aagttctctg ttgtccatca tgaatacctc gttttcacta ctgcatttat 2401 tgtctcattt gtcaatgtgt aaattttgat aattcctagg catttctttt tataattagt 2461 taattttgag taggtattgc tgaaaatatg ctaattctat tcaccaattc tgaattcttc 2521 ttttgacaca tgtattactc agatataatt gtgtcctgaa aatccacacc actaaggttg 2581 gctcccctga gtttcgctcc ctcaagctta gctcctctga gattagctcc tgctaagttc 2641 gctccttgga ggttagtcca tctcaagtca gctttactca agtcagcttc actcaaatta 2701 gctcgtaaca agtatgcacc actcaggtga gcttgcctga gattagcttt gtacaagtca 2761 gctttgtaaa ggtaagcgcc aatcacttct gcttcgtaca ggtttgcctt acttaagtca 2821 gccgcaatta agttcgctgc gattaagttt gcaaaacaga ggttagcacg aattagcttt 2881 gcatgaccca aatcggcatc tctcaagtta gcgcctttca actcagtccc aatcaagtta 2941 gcctcacaca ggaaagcgcc tttgagattc gcctcagtca aatcagctcc aatcagattg 3001 gcatgagtta aatcagcttg tgtcagtgta gcagcgctta agttagcaac actgaagtta 3061 gcagcgctta aattagcttt tatgaggttg gctgtgtaga ggttagcgcc attgagttca 3121 gcatctctaa gatctggttc aatttgggga tttttccttc tccactcatt ccatgccagc 3181 gcacctattt gtagtaaagc tagatgctga agatttgcca tgttctactc ctgacgccca 3241 ctctgaggat ttacccgacc agctacattt tcaatcgccg ttaacgctcc ttgagcacct 3301 gcaagaatat ctcgttcttg tccacctaag taaagccgtc caaagcttcc tacaggttga 3361 acagcaagga tgttaatcgc cgctgctttc tccgcttcat tcgcagccag tgcagcataa 3421 gcagcaggct caacttctaa tacatacagc gtttccccag ccagcagcat ttgtccccgg 3481 ttcgtgcgat tgagaagttg tgcttggtaa gcatcaatgt tgcggataat ctggttagaa 3541 acaacacgcg gtttgagaca ttcgtctttt ttgactccta gtgtcgccaa aatggcttga 3601 ccagcagctc gcgtttcccc ttgggagcta gaatgaattt ccaatagacc atataatcgt 3661 tcgaccacga gtgctccagg acggacagag gctgctttca gtgctacatc cgtaatccga 3721 ttaatttcga taccaggaga gatttcaatc cacagcgatg aatctcctgg taatggtaag 3781 aaaccttggg ctaccgttcc catatatgct gcatgttgag gttgcaggct gtcgagaaat 3841 acgaaactgc gtagttctat tcccaaggtt ttgctctcca gtcataaggg gaaaagttat 3901 ctgttgcaag atgttaagtg taaccaaccc taaactacca taaggctggg tactggaatt 3961 gctggattat gaagtttgtt tttcagtaat tcgcgcttgt tgttgagccg cgctcttttt 4021 caaacccaaa gaacgtagaa tttggacagc agccatttca cctgttctac aaccaccttc 4081 catataacct tgaaatcttt gggagcaatg ttcaccagca aaaaacagat tccctacatt 4141 ttcttgttca gctcctgcaa ttgttgtcca ctgtccaaca agatagcaag cataggaacc 4201 tcttgtataa ggttctctgg gccaatacgc acgaatagct tcacctttgc ggacactagc 4261 aattcctgga aatatctttt ctagttgtgt gagcagtttt tgagcttgag attcagccga 4321 accttgccca agaagtaaac tgctttgagc gctggtaaaa ttggtgagta gaccatttga 4381 acctggttga taacgagaag cttcccagat ggtttgaaag tctaagtcgc aagagacaaa 4441 ggctgttgag ttatagcgtg tgcgccatat gcgttcttga taagcggtaa ttaacttggc 4501 attgttaccg taacctaatt gggcaatcgc cttctttttc accgctggca aatctacatt 4561 taacgagact tgtcgcagag tactaaaagg taacgccagc aaaactcgct catatgttcg 4621 ttcaaaagag cggttaccag aacgtaaact gactcggtag ctaccatcat ttcgagttgt 4681 aatagcttct aattgagtcc ccgtttcaat agaatttgtt aacaatcgcg ctaataaacg 4741 aggaacttgg tcatttccac caacaatctg ataacgttca tcactttcgg cactaatctg 4801 aaagctgttc gtatctgtgc caataaaaag cagcatattc aaactagact gttcccgagc 4861 ttctcgaccg tataaaccag tataagcaac ttgtattctc tgactcagaa gcgggtgaac 4921 ttgtgcttcc tctaagtact gagtgatgga ggtattatct aacttaatag caacctgatt 4981 gtgtgtacga taagtaacag gtgtcttgcc gatagcggct aaatctcgct tgattttctg 5041 caccaatgga gtgaaccact gtaagatttc tttttctgat atcttacgcc cctgaaaata 5101 caatgttcct tgtatcaaat ccttatcagc agcatacaaa tctgcaattt gcaaacctaa 5161 ttcctgcgct aaagagcgta agctagtatg ttttgtgtca ataaattctc cacccaagtc 5221 tacaggtatt gaagtcccta aagcattttg tagcgtatat attcgaccac cgatacgatt 5281 tcttgcctca acaatatcca caggaactcc agcttgataa agacgataag cagcgacgag 5341 tccagcaatt cccgcaccta ccaccaagac tttaggaatt gtaccaaatg caactgagtc 5401 acttccatca tgccacgtca ctgcactgat ggcacttgct aatcccaaac ctccatatat 5461 taagcgacga cgagtggttt tttgctggag aatatcaact atttcgtccg caggaattcc 5521 tgtttctaaa gatacctgag tgattttgta agctcgctgt aagaatgcca tcaacgccga 5581 tctcgccatt gcatttgcct ttattgttta ttgttatttc aagacaaaac acaccagcta 5641 gctgattagc tactctcatg agtttgacat cctcccacgg ctgcccaaat atcgcactgc 5701 gtgctcattt ggacaaaaag ccgtgggatt ccaagacttg cgccttagat tttctggttc 5761 cacgagtcgg cttacctgaa ggagtttcct cacccaggca gaggtcgttc tcatccccag 5821 acgttttcgt tcccgtgtgc cccacggtac ggagtgctat ctcaagaatg ttcagtgctg 5881 cattatggtg agacagcgct gcaggagggt ctccctccgt aggcgactgc gaacccgaag 5941 ggtcacgatc aagaacagtt ccgcaatgtg ggcagcgatg agttctggtt gacagcgact 6001 tctggacttt gttaccacaa tgcgagcagt tctgagaagt attgtgtgga ggtacggcga 6061 ctgttgccac tccgaatact ttcccaaaat attcaacgca ttgacgaaac aatgaccatg 6121 ctgcatcact gattgacttt gcaagtttgg tgttcttgac catgttccgc accttcaaat 6181 cttcatacgc cacgacatcg ttagatatca ctacgcatct tgctaacttc acagcaaaat 6241 ctttacgttg acgacttact ttgaggtgct taagcgctag ttttttccta gctttggttc 6301 tgttgctcga ccctttcttc tttctagaaa ctcgtttttg tagacgtttg agcgctttct 6361 cccctttgcg tagatagcgg ggattctcga ccgtgtttcc cttgctgtca gtgtagaagt 6421 gattcaaccc aacatctagc ccgacagttg tttgtgttgg ctctcgtttt tctaaacgtt 6481 cctgatcaac acaaaactgg caataataac catctgcacg ccgtacaacg cgaacacgtt 6541 ttatctggtt caactggtag aaatggaggt cacgagtacc ccacatctta aaagtcccag 6601 catcaaactt gtcagtgaaa gtgatagaac gacggtcttt tgacagcttg tatccagttg 6661 tcttgtactc cacacttcca tgagtttgtt ctctgcgaaa cttgggatat ccttttttac 6721 caggaacttt ctttttgcaa ttggaataaa aacggttgat agactgccac gctctttcac 6781 cactagcttg acgcgccatt gagttaagct tacccgccca ctcatacagg gaagcaagta 6841 tagcggtgta tgcactaatc tcgttttttc ctacatcccg gttatccatc cagtatcgaa 6901 cgcatgagtt acggatgaat ctggctgtac gaatcatttc gtccagaatt gtgtattgtt 6961 cgctacttcc ttctaacttt gcttctaata ccagcatgaa taacctcacc tcctcatgat 7021 taaccttact ttgtttattg tgatgtaaac ttcacatata aagttcctaa ttttggaaaa 7081 tagcacagat atttagactg caacgcgcat ttgcatcgta ttcatgagcc agtcgcttgc 7141 gggggttccc cccgttgtgc gacctggcgt ccccacccct gaaaggggat gggctttcta 7201 ctctcgatac tttgtaacat tgttcctaaa aaagaaaact gacacatacg ccttgtatgt 7261 gccagtagat tacgtgctaa tttttgcagt gacaatcgaa atcaccaagc aggggagtgt 7321 caatattagc caccgatagt cgtgcgatta atcaaagtgt ctgcgctacc ttgggcttgt 7381 ttgtttaaag gcgtcccatg cagtcttgga tcgctttgag gaggtagtgg ttctggtcct 7441 agtggttggg gattttctac atactggaat tctcccttac cgtctggaga aggacctttt 7501 gcccagcgac cttcagaact ttcggtacct tcagagttgt tccagaactg ataagcaaac 7561 tcgcgcttct ccaattccaa cgggaaggaa ctgggaactg gagtactttc caatccggat 7621 tcttttaagt cctcaattgc agcttcccac tggttttgat gcatggtgtc gcgagctatc 7681 atgaaactca gggtatcttt cactcctggg tcattgctca tctcatacat tcgcactgct 7741 tgcaagcgtc cttgagattc ggcatgaaga ttggagcgaa aatctgctag taggttacca 7801 ctagcgacga taaagcgacc gttccagggg aaaccaacgc tgtcggctgg gagagcaccc 7861 aaaccagaga caattgcatg ttggggattc attgctgccc ccatgatatc tgtaatgaca 7921 ttctgtgtat tggaaccacc cataaccgca cccaccactg ggtcttttgc accatcttct 7981 tgcaatttga tgggtgcttt atccagcagg tgagcaatca ttgtggcaag catttcgacg 8041 tgaccgattt cttcggtacc aatatctaag agcatatcgc ggtatttagc aggtccgcga 8101 caattccagc cttggaacag gtactgcatc atcacggtca tctcaccaaa gacaccaccg 8161 atgagttctt gaatcttctt agcgtaaata gcgtctggtt tttctggtgg tctgaagtac 8221 tgtagcttct tggtgtggta gaacattatt attacctaat tagttgaaca tgggtctggc 8281 acaaaatgag gctaatctga tttttgtgcc gcctatgctt tctaggaaac ttatttcatg 8341 cttgcgatac atcactctag agttaaataa aaagtactga aaaaataagt aaatcgactg 8401 agggcagttc gtttccaagt gaatctagag atgcacatcc agaaacttca tttatcataa 8461 aatgaagaaa aaaccagaag tttagcttct ggtttactat aactaagaga gaattttgtc 8521 aaaatttact aagtataaat taattttcgt ctcgaccgtg aacttgggcg ggtggactgg 8581 ttgcctcaac agcggtaaat ggttctagaa ttttgtaagt gtgggatgct ccttttggta 8641 cgacccagga atttccaggc tctagtaaaa cgacttgacc ttcaaggtgc aattctgcac 8701 gaccagtgat gacataacca acggtttcat attcgcgtgt tgctggttct ttggattcac 8761 cgggttgttc gttttcccaa agacgcattg aaacagattt accagaggcg agatactttt 8821 gacccagttg acctttggga gagtaggctg agtctacttt tttaacgctt gtgtcagaca 8881 tatttatttc ctcagttagg tttatgaaaa atgaaccgca gaggcgcaga gagcgcagag 8941 aaagaaaagg aagaggagag ttggtgagtg ctaaactttg atggttttgc ctgtgcgggc 9001 tgcttcgtaa atgagctgct tgacactccc ctgcctaagg gcgaggggat tcaacattca 9061 tcg // LOCUS NODE_3644_length_9050_cov_5.2762659050 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 9050) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 9050) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..9050 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(105..725) /locus_tag="DP116_24570" CDS complement(105..725) /locus_tag="DP116_24570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210083.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_24570" /translation="MVKSSPKLLTVDEFITYYGDSDRHELIDGELIEMEPTGPHEQVS AFIGRKLNVEIDRQDATYFIPHRCLIKLLGTDTAFRPDLIVLDQTRLINEPLWQKEPV ITLGNSIKLIAEVVSTNWQNDYARKVEDYATLGVPEYWIVDYLGIGGKEYIGKPKQPT VTICTLLEDEYQKQLFKNNDQLVSLTFPNLQLTAKQVFTAGQSVPM" gene complement(766..1557) /locus_tag="DP116_24575" CDS complement(766..1557) /locus_tag="DP116_24575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743210.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="endonuclease/exonuclease/phosphatase family protein" /protein_id="PRJNA477356:DP116_24575" /translation="MVKLVTINILYDMADWEQRRTLLVDGLQAEQADLIALQEVKLPE NTAAWLADKLDMPYVHLVPDNRPTSISDMGVGNAILSRHPFVEEAVLDLQSQGRVAQY VQVNLDGQPLVFCNGHYYWYPGHHPERTKQVQLLIDWLSQLPSSIPIVAVGDFNGTPQ TPAIMLMKQHFTSAYAKYHGQEPEYTCPTPLARRSWKKSLRLLVRDLISNRTLRPWRG TLDYIFINQHLRVRSCKIILNQPARESRTLYPSDHFGIAADLELV" gene 1624..2757 /locus_tag="DP116_24580" CDS 1624..2757 /locus_tag="DP116_24580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214529.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="branched-chain amino acid ABC transporter permease" /protein_id="PRJNA477356:DP116_24580" /translation="MAEYLIFLTISTSIFALFGLGLNLQWGFTGLINFGHVAFMTLGA YTTVLLTLKGVPLVLSAIIGAVIAALLGLIIGFSTLRLREDYLGIVTIGVGELIRLVV NNQDLPVGNEWRSGAFGVQSYPIPLATLVPNLFIKLLLIGLLTLLVLITLWQLWRWIG VSRVSSSANSQKKVGSKQEFISHLVIGVFLALLTVAIYISGVIGLYDYKATTGLMLVS LIVLAFVFWRLEILVRSPWGRVLKAIREDEEIPKALGKNVFWYKLQSLMLGGAIAGIA GAFIAWQLSAIYPNNFEPIETFTAWIIVILGGSGSNLGTILGAVLYFMYYEGTRNLDK IISLDSDRLSALRIMIIGLILMVLMIWRPQGILGKKEELTLGK" gene 2747..3532 /locus_tag="DP116_24585" CDS 2747..3532 /locus_tag="DP116_24585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746114.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24585" /translation="MANNESSQFPLLAATGVCKNFGGIKAVDNAEIQVSSGSITGLIG PNGAGKTTLFNLLSNFIRPDKGRVTFDGEPIQNLQPYQIAQKGMVRTFQVARTLSRLS VLENMMLAAQKQTGEKFWQVQFQQPKIAKEQKEFQERALFLLESVGLAHKAYDYAGGL SGGQRKLLEMGRALMTEPKLILLDEPAAGVNPRLIDEICDRITGWNNSGMTFLIIEHN MDVIMSLCDRVWVLAEGRNLADGTAQEIQSHPKVLEAYLGKSA" gene complement(3720..4286) /locus_tag="DP116_24590" CDS complement(3720..4286) /locus_tag="DP116_24590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315364.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24590" /translation="MLVLALQGSKPVKVQLDEQELFYGELKDFVSPGVGAAFSFGLGL AGALLWTWQQSLRQSSELEKQLSNLQNQISEKDSQIHTLKVSPSSPMLSQLRWFLDED ESKVQAAISTANASSVKQAAPPISTKKANEPTFSQAVTKPLVITSTEYETQPITVSQL TAQTATSTFPSAQSALGFTQKYRKRADA" gene complement(4844..5410) /locus_tag="DP116_24595" CDS complement(4844..5410) /locus_tag="DP116_24595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746109.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="PRJNA477356:DP116_24595" /translation="MNNEEFKILLRFFKALADESRLKILGILANQECSVEELAALLRL KEPTISHHLARLKELNLVTMRPEGNARLYQLDNEALQTMGKEMFTLEKIASLGEDVDT EAWESKVVKTYIEGNHLKEIPTSRKKRLVILKWLASKFEEGITYPEHTVNEILKCYHP DYTTLRRELISYQLMRKENGVYWRFVQK" gene 5622..5798 /locus_tag="DP116_24600" CDS 5622..5798 /locus_tag="DP116_24600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999735.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_24600" /translation="MLKHGMSVAQAAKYLQVEQSTLYIALQKGQIPDLRRNGRTVVSE GALLDYRARTQTLL" gene complement(6205..6924) /locus_tag="DP116_24605" CDS complement(6205..6924) /locus_tag="DP116_24605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019492078.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Crp/Fnr family transcriptional regulator" /protein_id="PRJNA477356:DP116_24605" /translation="MSASQNHCKPIKNRLLASLPKEEQERLQPHLELIPLEYKQLLYI PNEPIQYVYFPNYGVISLITIMQNGDAVEVATIGNEGMVGLPILLGANTIPGQALVQV PGEGLRIKVDVFQREVTPGSPLYKLLQRYTQALFNSIAQLVACNRLHSIEERFCRWVL MTQDRVGKNEFPLTQEFLGQMLGVRRASVSVVAAMIQKAGLITYKRGKMTILDREGLE DVTCECYVMIKDEFESLVDDN" gene 7175..7843 /locus_tag="DP116_24610" CDS 7175..7843 /locus_tag="DP116_24610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872495.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein CheB" /protein_id="PRJNA477356:DP116_24610" /translation="MGHTVVSHFPNVAYNVVAIAASRGGLKAISQILSTLPAEFPAPI TLVQHLSPQHRSHMAEILSRRTALQVKEAEEGELLRPGTVYIAVPNKHLVVNPDATLS LSDAPKINFVRPAGDKLFTSVASSFKSRAIGVVLTGKDGDGVLGVLAIKKYGGTVIVQ DEASSECFSMPKSAIDTGKVDFVLPLDAIAHRLLTLVTTQEVHSQELSQKSSNSSAGL TSGR" gene complement(7798..8640) /locus_tag="DP116_24615" CDS complement(7798..8640) /locus_tag="DP116_24615" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24615" /translation="MDFFNGDSKSTFLEDLQSELDNSQHHNIWNFTVLSDTIEIIEKI VMKISQFDQTGVQLENTKHKDSLIITSLDLNSGQRSEVEDTVELLCEQVQTLKQLLYH KELELQEIQQELLHTNEDLCAALNSPSLALDSAKESVKEILVSKNPIGETLPTRFSTI YNSTVKPSELGHKEKSNSIQPLISAPDNSILINNQAYQIKITALIKQGREIQAKSKIL REQATEIRAKSREVEAQFMEVGTKFIGSRASFLLREPNFRHKQKPKAIHLPDVSPADE LLDF" BASE COUNT 2488 a 1838 c 1997 g 2727 t ORIGIN 1 gtgtaggggt gtaggggtgt aggggtgtag ggagcaggag aatatgagaa tttttatgat 61 tagttatggt gtacgcacct gatcagggct acttcccaaa taaactacat gggaacagac 121 tgtcctgctg taaagacttg ttttgccgtg agttgcaagt tgggaaaggt taaggaaaca 181 agttggtcgt tatttttaaa taactgtttt tgatactcat cctctaataa tgtacaaatt 241 gtcaccgttg gctgcttagg tttaccaata tattctttgc caccaatgcc cagataatca 301 acaatccaat actcaggtac ccctagtgtg gcgtaatctt caaccttgcg ggcatagtcg 361 ttttgccagt ttgtactgac cacttcagca atgagtttaa ttgagtttcc taatgtaata 421 acaggttctt tttgccacaa cggctcattt atcagtcgag tttggtctaa cacaatgaga 481 tcaggacgaa atgcggtatc tgtccctaat aatttaatta ggcatcgatg gggaataaaa 541 taggttgcat cctggcggtc aatttctaca ttcagttttc gcccaatgaa ggcggatacc 601 tgttcgtgtg gtccagttgg ttccatctca attaattcac catcgattaa ctcatggcga 661 tcgctatcgc cgtagtaggt aataaactca tcaacagtta aaagttttgg cgatgatttt 721 accattgtga ttattcccga taattcttgc ctacccaaag ttttttcaaa ctaattctaa 781 gtcagcagca atgccaaagt ggtcggaagg gtacaaagtg cggctttctc gtgcaggctg 841 gttaagaatg attttgcacg aacgtacccg caaatgttga ttgatgaaga tgtaatctaa 901 ggtaccgcgc cagggtcgta aagtgcgatt agatattaag tcccgtacaa gtaggcgcag 961 tgattttttc cagctacggc gagcaagagg tgtagggcag gtgtattctg gttcttgtcc 1021 atggtactta gcataggctg atgtaaaatg ttgtttcatc agcatgatag ctggagtttg 1081 gggtgtgcca ttaaaatcac ccactgcgac gattggtatg gacgagggaa gctgacttaa 1141 ccagtcaatg agtaactgca cttgtttagt acgttctgga tgatgacccg gataccaata 1201 ataatgtccg ttgcaaaaca ccaatggctg accatctaaa ttaacttgca cgtattgagc 1261 aactcgtccc tgagattgta aatcgagaac tgcttcttcc acaaatgggt ggcgactgag 1321 aattgcattt cccaccccca tgtcgctgat acttgttgga cgattgtctg gaactagatg 1381 cacgtatggc atatcaagct tatcagcaag ccaagcagct gtgttctctg gcagcttaac 1441 ttcttgtaaa gcaatcaagt cagcttgttc tgcctgaagt ccatcaacta agagggttcg 1501 ccgttgttcc cagtctgcca tgtcgtacaa gatattgatt gtaacaagct taaccatttg 1561 ctagtgccat gaagcgtgtt ttacatcgcg taatatcaat taataactaa tgactcatga 1621 cgtatggctg aatatctcat tttcttaacg atttctactt caatttttgc attgtttggt 1681 ttggggctga acttacagtg gggttttacg ggtttgatta actttggtca tgttgccttt 1741 atgacgttgg gcgcgtacac cactgtgttg ttaaccttaa aaggagttcc tttggtgctg 1801 tcagcaatta ttggggcagt tatcgcagca ttattaggat tgataattgg tttctcgact 1861 ttgcgcttgc gagaagatta cctaggaatt gtgactattg gcgttggcga actgattcgt 1921 cttgtggtga ataaccagga tttacctgtt ggtaatgagt ggagatctgg agcgtttggt 1981 gtgcaaagct atcctatacc gctggcaact cttgtgccta atctcttcat caaacttttg 2041 ttgattgggc tgttaacgct acttgtgctg attactttat ggcagttatg gcggtggatt 2101 ggtgtttcta gagtgtcaag ttctgcgaac tcacaaaaaa aagttggtag taagcaagaa 2161 tttatatcac acctggttat aggggttttt ttagcgctgt taacagtggc aatttatata 2221 tctggtgtta ttggcttata tgactacaaa gccacaacgg gtttgatgtt ggtatcactt 2281 atagtattgg cttttgtctt ctggcggttg gaaattttag tgcgatcgcc ctggggtcga 2341 gttctcaaag ccatccgcga agatgaagag attcccaaag ctttaggaaa aaacgtcttt 2401 tggtataaac tacaatcact gatgcttgga ggtgcgatcg caggaattgc aggtgctttc 2461 atagcatggc aactttccgc aatttaccca aataattttg aacccataga aactttcact 2521 gcttggatta tcgtcatttt aggtggttct ggtagtaatc tcggcacaat tttgggtgca 2581 gtgctttact ttatgtacta cgaaggtacg cgcaacttag ataaaattat ctctctcgat 2641 tcagaccgtt taagtgcatt acggattatg atcatcggtc ttatcttgat ggtactgatg 2701 atttggcgtc ctcaaggtat cttagggaaa aaggaggaac tcacccttgg caaataacga 2761 gtcatcccaa tttcctctgc tggctgcaac tggagtttgt aaaaactttg gtggtattaa 2821 ggctgtagat aacgccgaaa ttcaagtatc aagtggtagt atcactggat tgattggtcc 2881 taacggcgca ggtaaaacga cattatttaa cttactctca aatttcatcc gtccagacaa 2941 aggacgagtc acctttgatg gcgaacccat acagaatttg cagccatacc aaattgctca 3001 aaagggaatg gttcgtacct ttcaggtagc ccgaactctg tcgcggttgt cggtgctaga 3061 aaatatgatg ctagcagcac aaaaacaaac aggtgagaaa ttttggcaag tccaattcca 3121 acaacccaaa attgctaaag aacaaaaaga attccaagaa cgagcgcttt tcctcttaga 3181 gtctgtagga ttagcgcaca aagcatatga ttacgctgga gggttatcag ggggacagcg 3241 caagttgctg gaaatgggaa gagcactgat gactgaacct aagttaatct tattggatga 3301 accagctgct ggggtaaatc cgagactgat tgatgagata tgcgatcgca ttaccggctg 3361 gaacaacagt ggtatgactt ttttgatcat cgagcacaat atggacgtta tcatgtcgtt 3421 gtgcgatcgc gtttgggtgc tagctgaagg acgaaatctc gctgacggaa cagcacaaga 3481 aattcagagt catccaaaag ttttggaggc ttatctggga aaatcagcat agcagcaatg 3541 tagagacgtt gcatgcaacg tctctacaga gttttgatga tacacaaacc aatcaaaact 3601 catgcgatag accactacct tgtccttaac ttaagtgaac agctacctac gcagtaccag 3661 ccgcagtgaa cagtgataac tgataactga taactgataa ctgataactg attgctgatt 3721 caagcatcag cgcgcttacg atacttttga gtgaatccca aagcagattg agcagagggg 3781 aatgtagaag tcgctgtttg tgcagttaat tgactcactg ttattggttg agtctcatac 3841 tctgtgctag taatgaccaa cggtttggta acggcttgag aaaacgtggg ttcgtttgcc 3901 ttttttgtgg aaataggtgg cgcagcctgt ttaacagaac ttgcatttgc agtagatatt 3961 gctgcttgaa ctttactctc atcttcgtct aagaaccatc tcagttgaga aagcatcgga 4021 ctggatggtg aaactttaag cgtgtgaatt tgagagtctt tttcggatat ctggttttgg 4081 agattggata actgcttttc taactcagaa gattgacgta aggattgttg ccatgtccac 4141 aatagcgcac ctgcaagtcc tagtccaaaa ctgaatgcag ctcctactcc aggagaaaca 4201 aaatccttta gttccccata gaaaagttct tgctcgtcta gttggacttt cacaggctta 4261 gacccttgca gggctaaaac taacatgaac gaggcaaaaa gcgtacctga aatcactacg 4321 gtcgggagta agattttttt gagcataacg aaagcctact ggtattaact gtcaagacaa 4381 cttaatcaaa gacccgtacc cgtgctcaaa ggctgtacct gctacctcaa aagtttcaaa 4441 ttcctaattc aaacagtgaa cagtgaacag taaacaggga ctgataactg gtaactgata 4501 actgataact gataactgat aaaggcaaca gaaggatacg gattttactt actttctcat 4561 ccaaagttag taaatttatt gaccatttgt ttgtaaattt ttgatgaagt agaccattat 4621 tttatgaagt ttttgtatgt tttatctttt gactgcttaa aaatactgca attggtataa 4681 aaagtttgaa atttctgtca acaaactagg ggtgtagggg aaagaaaaat gaaagcaagg 4741 agttttttga tgggggggaa accaattcac gtgaaaagcc tattcttccc ctacacccct 4801 acacccttac acccttacac gcttacaccc ttacacccac tcgttatttt tgtacaaaac 4861 gccaatacac tccattttcc tttcgcatca actggtagct gattaactct cgccgcaagg 4921 tggtgtagtc aggatggtag cacttgagaa tttcattgac tgtgtgttct gggtaggtta 4981 ttccctcctc aaatttactt gccaaccatt tgagaatgac taagcgcttt ttgcgactgg 5041 tgggaatttc tttcaggtgg ttaccttcaa tataggtttt gactactttg ctctcccacg 5101 cttcagtatc tacatcctca cctaaagaag ctattttctc aagggtgaac atttccttac 5161 ccatagtttg taaagcttcg ttatccaact gatacaaacg agcattaccc tctgggcgca 5221 tagtcaccaa attgagttcc ttgagtcttg ctaaatgatg cgatatggtt ggctctttaa 5281 gtcgtagtaa cgccgccaac tcctccacac tacactcctg atttgccaag atacccaaaa 5341 ttttgagtcg gctttcatct gctaacgcct taaaaaaacg caacagaatt ttaaactcct 5401 cattgttcat cttttgttat gatgttctta gatatcgatc taattaggtt tgcatcaaat 5461 tgttgtgctg ttcttgttcg acacagcaca gacatggtgt tttggttttt tgaagttatg 5521 ttaattctgt tacgaattgc taagacaatt gagatttctt aagtagttct ttacatttca 5581 caatgtttcg ccgctttcaa aatagcaaag gagtattgaa tatgcttaag catggtatgt 5641 cagttgcaca agcagcgaaa tacttacaag tagaacagtc cactctctat atcgctttgc 5701 agaaaggaca aattcctgac ttgagaagaa atggacgaac tgtagtgagt gaaggagctt 5761 tacttgatta tcgagcgaga acacaaacct tactataaga atttctaaat gtacaattaa 5821 atatttcaaa ttctctaatt gcttaatgca tggtaatgaa gagcgtaatt ataagtaatt 5881 gcaaattcat gtattaacaa tttaaaatat tcttctggaa gtcatataaa gttaattccc 5941 aaaaaggaag aaatgaaaat gcgcctcttc ctttggaaat gattggcatt atctgatatg 6001 ttgcacctgg tcaaacaaat gtcaaagtta ttacatattg tttgactccg ttgtcgtcca 6061 cctgggactg taagtcccag tctcatagcc aaagtccatt aaaatggact cacagtggaa 6121 ctggtttttg agtagattta aacacttttt catacgagca aagcgtggtt aactcgtaat 6181 gatgcaaaat atcagttatt gtggtcaatt gtcgtccacc aagctttcaa actcgtcttt 6241 gatcattaca taacactcac aggtgacgtc ctccagacct tctcggtcga gaatcgtcat 6301 ttttccacgc ttataggtga ttagccctgc cttctgaatc atagcggcaa ccacactcac 6361 actggcgcga cgtacgccca gcatttgccc taagaactcc tgagtgaggg gaaactcgtt 6421 ttttccgact cggtcttgag tcatgagtac ccaccggcaa aatcgctctt ctatggagtg 6481 caggcggttg caagcaacca gttgggcgat tgaattaaac agggcttgcg tgtagcgttg 6541 cagcagcttg taaagtggac ttcctggagt cacttcgcgt tggaatacat ctaccttaat 6601 tcttaagcct tcgcctggaa cctgcaccaa agcctgacca ggaattgtat tggctcccag 6661 taatatgggt agaccaacca ttccttcatt gcctatcgta gccacttcca ccgcatcacc 6721 attttgcata atggtgatta gagagatgac accatagttg gggaaataga cgtactggat 6781 cggttcgttg ggtatgtaga ggagttgctt gtattctagg gggattagtt ctaggtgagg 6841 ttggaggcgt tcttgttctt ccttggggag ggaagccaac agccgatttt taatgggttt 6901 acaatggttt tgggatgcgg acatagagaa cctttttgag ccagtggtat ctcaatcaga 6961 taactcacag catcagttga ggtcgtctgg cgaaagatca actcttttgt aaagaagtta 7021 tgtacgcaag cgcacataat ttccagaaaa tacagtacaa tattatttgt accctagggt 7081 acataagcgt taaactcgta attccatatt cgggcacgca acaagaaata ctttgcaaaa 7141 tcatcaagca gaagagaaaa acgtgatttc gcacttggga cacacagttg tctctcactt 7201 tccgaatgtt gcttataatg tagtggcgat cgcagcttct agaggtgggc taaaagccat 7261 tagccaaatt ctttcgacat tgccagctga gtttccagca cctattacat tggtacagca 7321 tttgtctccc cagcaccgca gccatatggc agaaatcctc agccgccgca cggctttgca 7381 ggtgaaggag gcagaggaag gcgaattgtt acgtccaggg acagtctaca ttgctgtccc 7441 taacaagcac ttggtggtta acccggatgc taccctttct ctctcagatg caccaaagat 7501 aaactttgtc cgccccgcag gtgataaact gttcacgtca gtggcgagta gttttaaaag 7561 tcgagcgata ggagttgtcc tgactggcaa agacggcgat ggggttttgg gtgtgctagc 7621 gataaaaaag tatggcggta cggtgattgt tcaggatgag gcgagtagcg aatgtttcag 7681 tatgccgaaa tctgcgatag acactgggaa agttgatttt gttcttcctt tggatgcgat 7741 cgctcaccgc ttactcaccc tagttacgac acaagaagta cactcacaag aactatctca 7801 aaaatcgagc aattcatctg ccggactaac gtccggcaga tgaattgctt tcggcttttg 7861 tttgtgccta aaattaggct cacggagcag aaatgaagct ctagagccaa taaatttagt 7921 tcccacttcc ataaactgag cttcaacctc tctagatttc gcccgaattt cagtagcttg 7981 ttctctaagt attttagatt tcgcctgaat ttcacgacct tgctttatca gcgccgtaat 8041 cttaatttga tacgcctgat tattgataag tatagaattg tctggagcac ttatcaaagg 8101 ttgaatagaa tttgattttt ctttgtgccc taactctgaa ggtttgacgg tcgaattata 8161 aatagtgcta aagcgtgttg gtaaagtttc acctattgga tttttactta ccaaaatttc 8221 tttaactgat tctttcgccg aatcaagtgc tagcgacggt gaattaagtg cagcacacaa 8281 atcctcattt gtatgaagta actcttgctg tatctcttgc agttctaact ccttgtggta 8341 cagcagttgc ttgagagttt gtacttgctc gcacaaaagc tctactgtat cttcaacttc 8401 tgatctttga ccagagttaa ggtcaaggga tgttattatc agactatctt tgtgtttagt 8461 attttctagt tgaacacctg tctgatcaaa ttgactgatt ttcatgacaa ttttttctat 8521 aatttctatt gtatcggata ataccgtaaa attccaaatg ttgtgatgtt ggctattatc 8581 cagctctgac tgtaagtctt ctaagaatgt tgacttggaa tcgccgttga aaaagtccat 8641 gcactacgct cgcaagctta tctacttatt ttagccgcta acaagtaaca agagcagctt 8701 tctcgttcct tgccgtactg ctctctcgtt cccatgctca gcgtgggaat gcactatcgg 8761 aggctctgcc tcccaatgat tatattgagg cagagcctca agtcaggcgt tcccatgcag 8821 agcataggaa ccagaggaaa gattttagcg ccgattcaag cgagggtgag gttttctctc 8881 gttaggagga ggttggttac tgtaaattac caagacgccc tcggaacctc accctgccct 8941 atcgggcatc cctcacgcca ggtgcttcaa gtcgggcatt gcccgcccaa cgcactggct 9001 ctccttggta aggagaggga aagattttag cgccgattca agcgagggtg // LOCUS NODE_3682_length_8929_cov_5.2409298929 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8929) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8929) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8929 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 38..1288 /locus_tag="DP116_24620" CDS 38..1288 /locus_tag="DP116_24620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315481.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR00303 family protein" /protein_id="PRJNA477356:DP116_24620" /translation="MTNEFIRIYTQIDQAQEWMSKYRGCLPVFACVLGFTETGLIPGI SAAGRTCEDRKYTACADAEFLYYGAEHKPKYSLPPLAAGASPVIISRAVVEALNIPVY LFNAGLPQSPAVPAIDLGGYSAGCLSQGAAMELATVDHLFNQGLVWGERLGRELQNRY LILSECVVGGTTTALAILTGLGIPAVGKVNSSHPVCNHGQKWAVVQAGLEKMRNREVG EEEKIGVKEGMREGERKGNAHSLIPSSSVIDPLKLVAAVGDPMQVVVAGMAIALSRSC GVLLAGGTQMLAVYALISAIAQVYALSWRPEEVVIGTTRWVAEDLTGGTIELAQLVGQ SNITPDGVTPPLLATELSFADSRYPQLRAYEQGFVKEGVGAGGACIAAHLVRGWQQHQ LLQAIEDQLEQYQNKSEYRSQNSE" gene complement(1448..1831) /locus_tag="DP116_24625" CDS complement(1448..1831) /locus_tag="DP116_24625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315482.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24625" /translation="MEPLQKQVLTLGNKLDALSEVISQLDSKVSQAISEGCLVKAQEA KDNLLENSEARRYQLKGQSSFSSELEHKDVITDGIYPDSTNFQGGEKSLTPEIQIQRL TAQLTAAYNRIAALEEQLLLKRIHS" gene 2569..3711 /locus_tag="DP116_24630" CDS 2569..3711 /locus_tag="DP116_24630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315483.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polyamine ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_24630" /translation="MERRSFLLGTTTLALSQLLVGCANNKQVLNVEILQGSIPGQVVN QFSRLQPKMQLKFTPVEQLEDLFKKLQTWQKKTKTTDEGGWRRFLPFAKSQKVTKADL VTLGDYWISYAIEKKLIQPLDVTKIQQWSALPKKWQELVTRNDKGLVDSKGNVWAVPY RWGSTVIIYRRDKFQELGWTPKDWSDLWNEKLQGRISLLNQPREVIGLVLKKLGKSYN TENLDTVPNLEKELQLLNQQVKFYDSTRYLEPLIIGDTWLAVGWSNDIAPFLGRYPEL ATVVPDSGTALSADVWVTPASQDRQSLLYEWINFCLKGDVAKKISLLTKTNSPIPTNI AESDFQKPLRSLLVINPKVFDKSDFLLPLSQQTMAQYESFFAKMKG" gene complement(4101..7829) /locus_tag="DP116_24635" CDS complement(4101..7829) /locus_tag="DP116_24635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872395.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_24635" /translation="MLNNLTIGNKIGASVGFGLMMLAAIGVVSYQATNNLIETSGWQT HTYQVLGDLNNLRSQLKDTEGAQRRYLLTGEKSELQSYNSAIPAVEESLKSLRRLTVD NPNQQRRIDALDPKIKQKLAIMQSSISLRDSQPLDAVRQQGLTDNSNNLTNEIRRLTL DLETEERNLLEQRSRRTQAATHQAMNTITYGIPLAFLLLTLIGIYLTRNISKPLQELS KVTEKLAIGDLSVSTTANNRQDEIGVLSQAFNEMVANLRETTKTNSEQNWLKSNVAKF SQMLQGQRSLETVASMILCELAPLVNAQQGVFYIMDSVDEQPMLKLIGSYAYQQRKHL SNQFRLGEGLVGQCALEKQKIILTDVPSDYIHISSGLGEAKPLNIIVLPVLFENQVTA VIELASFQRFGEISLIFLEQVTQTIGVVLNIIAADIRTQELLQQSQALTQQLQIQQEE LKQSNQILEEQTETLQASEKLLKQQQEELQQSNEELQQLNAEIEEKAELLSIQKKEVE RKNQQLEQTSQSLEEKAEQLALSSKYKSEFLANMSHELRTPLNSLLILANLLADNVEG NLSAKQVEYSRTIHSAGRDLLVLINDILDLAKIESGTMSININQMLFLELRDNIEGTF RQVAIDKKLSFTIELAPELPTTIETDVKCLQQVLKNLLANAFKFTERGEVSLRMFVAK QGWSSDQETLNGAATVIAFAVRDTGIGIASEKQNIIFEAFQQADGSTSRKYEGTGLGL SISRKIAHLLDGEIKLESRLGQGSTFTLYLPQAGGQEDRGTRGASTAGGFPSTGVWRR QGERGVTQGKSNTVWESFLTASLTPSLTPNSKEDTRTPDLPTPLIDDRGNIQPGDRVL LIVEDDLMFARILLDMARQKEFKVIVAHNGSTGLALVQKFQPSAIILDIRLPEIDGWT VLDRLKHNPSTRHIPVHIMTVEEGRQRGLQQGAIAYLQKPVSNDALHQALTKIKGFVD RSVKNLLVVEDDSTQRYSIVELIGNSDVETTAVGTGAEALAAIRSQHYDCVVLDLGLP DMSGFELISQIKQQPDGEALPIIVYTARELTRAEDTQLRRIAETIIVKDVRSPERLLD ETALFLHRVQANLPIPKRQMLEQLQSSDPVLAGKKVLIVDDDVRNIFALTSMLERYQM QVLYAENGKDGIEVLQNNPDVDMVLMDMMMPELDGYQTTRLIRQNNQFKSLPIIALTA KAMQSDRDKCIEAGASDYISKPVDTEQLLSLLRVWLYR" gene 7990..8715 /locus_tag="DP116_24640" CDS 7990..8715 /locus_tag="DP116_24640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009633378.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NGG1p interacting factor NIF3" /protein_id="PRJNA477356:DP116_24640" /translation="MITGSNLILLQEIAEFLDQFFAVERYSQEERGGVYLPSTRPVRR LGLVLEPCAQLQEWTNTQHLDALFLHRPWKLEPEQLSPDIGVISYHLPFDERLTFSFN PRLAQVLGMSSLEVLGKKDGRAIGMIGEIPTQSFAHFGHCVNQIFAGHEQVRAAQSAE VTRVAVVGAMTDLLVREAATRGANVYITGQLRQPAEQALLETKIGVIAVGHRRGEVWG LRSLAGVLRERWSSLEVVVPHHS" BASE COUNT 2458 a 1832 c 1913 g 2726 t ORIGIN 1 cacccctaca cccctacacc cctgtgtttc tttaataatg accaatgagt ttattcgcat 61 ttatactcag attgaccagg ctcaagaatg gatgagcaaa tatcgcggtt gtttacccgt 121 atttgcttgt gttttaggat ttactgaaac tggtttgatt cctggaattt ccgcagctgg 181 tcgtacttgt gaggatagaa aatacactgc ttgtgcagat gctgagtttt tatattatgg 241 tgcagaacat aaacccaagt attctctacc gccgttagcg gctggggcgt cccccgttat 301 tatttctcgc gctgtggttg aggcgctaaa tattccagtt tatttattca atgctggttt 361 acctcagtcc cctgctgttc cagcgattga tttaggtggt tattcagctg ggtgtttaag 421 tcaaggcgct gctatggaac tggcaacggt agaccacttg ttcaaccagg ggctggtttg 481 gggagaacga ttgggtagag aactccaaaa tcggtattta attttgagcg agtgtgttgt 541 aggaggaacg acaaccgccc tagcaatttt aactggctta ggcataccag cagtcggaaa 601 ggttaacagc tctcacccag tttgcaatca cgggcaaaag tgggcagtgg tgcaggctgg 661 gctggagaag atgaggaata gagaagttgg ggaagaggaa aaaataggag tgaaggaggg 721 aatgagggaa ggagagagga aaggaaacgc tcactccctc atcccctcat cttccgtaat 781 tgatcccctt aagttagttg cagctgtggg cgatcccatg caagttgtag tcgctgggat 841 ggcgatcgct cttagccgca gttgtggcgt tttgcttgct ggtggaacac aaatgctagc 901 agtatatgct ctaatcagtg ccattgctca agtttacgcc ttatcctggc gaccggaaga 961 agtcgttata ggaacaaccc gttgggtagc agaagacctc acggggggca caattgaact 1021 agcgcagcta gttggtcaga gcaacataac tcctgacgga gtgactccac cgctactagc 1081 gactgaactg agttttgctg attcccgtta tccccaactt agggcctacg agcagggttt 1141 tgtaaaagaa ggtgtcggcg caggaggggc ttgtatcgct gctcatctag tccgaggttg 1201 gcagcaacat caacttttgc aagcaattga agaccaatta gagcaatatc aaaacaagtc 1261 agaatacaga agtcagaatt cagaatgatc ccatgtctaa aagccaaggg tgagaattca 1321 tatagtcgag cacaaaacac gctgaattca agcgtaaatc gaacaagaat gcaatgaaat 1381 tagctaaaaa cggtgtagaa gccttcaagc tgagttccta tgggaggagt tttaagttct 1441 ttgttggtta agaatgaatt cttttgagca gtaattgttc ttccaaagca gcaatgcgat 1501 tgtatgctgc tgttaattgc gctgtcagcc gttggatttg aatttctggt gttaaacttt 1561 tctctccgcc ttgaaaattc gtgctatctg gataaatgcc atctgtgatg acgtccttat 1621 gttcaagctc tgaactgaaa ctgctttgcc cttttaactg atagcgcctt gcttcgctat 1681 tctccagtaa attatcctta gcctcttgtg ctttcactag gcagccttct gatattgctt 1741 gagaaacttt actatcaagc tgcgaaatca cctcagagag ggcatccagt ttgttaccca 1801 gagttaggac ctgcttttgt aatggctcca ttgaatactc tcatccgctt attttctcta 1861 tagtattcaa tctattgacc acgttaacta tacttatgat ttttttaact tttaatatgt 1921 tccgttgaaa tgcaatactt aaattcttaa ttttaaatat atctacatat tttctacaaa 1981 tcttagtatc aatttttatt gacaaaaatt aattttgcgg atttgtttga gaaatacttg 2041 cgcttcacgt tttcttagtg ctgagtgtgt aggaatctaa agctaatcgt cgtgttaatt 2101 tcttagtaaa cctctggtga taaatgctct ggattaagcg acaaaaacct gttttttgcc 2161 agaaatcaat tcacgacgta aaacctttac ctcaacgcct ttgattttag ggttagaaac 2221 tgtataggtt tttcggagcg gactgtaaaa caaaaagtga gagactccgt actcaatact 2281 ggctgttttt gtgaataatt acttaggtaa aagatcacta atagcttaca ccagtatcaa 2341 attgaaaaag gaaattccaa ataaaattgt caagaaactc tgacaatcag cgatattgac 2401 tagtatgacc taaaaataaa tattttttca gataaaaata aaaactataa atcaaccaac 2461 agagatatgt ttaactcatc ccctggaatg gtctaactta attcttgcta cacttaggtt 2521 tgtaaaactg atactactaa cgcttttcca agctaaaagt caaaatcaat ggagcgacgg 2581 tcttttttgc taggtacaac tacactcgca ctttcacaac tgcttgttgg gtgtgctaac 2641 aacaaacagg tgctaaacgt ggaaatattg caaggttcta tccctggtca ggtagttaat 2701 caatttagcc gtttgcagcc aaaaatgcaa ttaaagttta ctcctgtaga gcaattagag 2761 gatttattta aaaaattgca aacttggcag aagaaaacga agaccactga tgagggagga 2821 tggcgtcgct tcttaccatt tgcgaaatct caaaaagtta ccaaagctga cttagtcact 2881 ttgggagatt actggataag ttatgcaatt gagaagaaac tcatacaacc actggatgtg 2941 acaaagatac aacagtggtc tgctttgccc aaaaaatggc aagaattggt aacgcgtaac 3001 gacaaaggtc ttgtcgattc caaaggaaac gtttgggctg ttccttatcg ttggggtagt 3061 acggtgataa tttatcgtcg agataagttc caagaattag gttggacacc gaaagattgg 3121 agtgatttat ggaatgagaa actgcaaggg cgcatttctc ttttaaatca accacgggaa 3181 gtcattggtc tagttttaaa aaagctggga aaatcttaca acacagaaaa cttagacaca 3241 gttccaaatt tagaaaagga attgcaatta ttaaatcaac aggtaaaatt ttatgattct 3301 actagatact tggaacctct gattattggg gatacttggt tagcagttgg ttggtcaaac 3361 gatatcgcac catttcttgg acgctatcca gaacttgcca cagtggttcc tgactcagga 3421 actgcacttt cggcagatgt atgggtaact cctgcaagtc aagataggca atctttgtta 3481 tatgaatgga taaatttttg tttaaagggt gacgttgcta aaaaaatatc tttactgact 3541 aaaacaaatt caccaattcc tacaaatatt gctgagtctg attttcaaaa accattacgt 3601 agcctgttag ttattaatcc taaagttttt gataaaagtg attttttact tcctttatcc 3661 cagcaaacaa tggctcagta tgaatctttc ttcgccaaaa tgaaggggtg atgagggaga 3721 gaggggtaag aaactgtaac ttaaaatctg aaaataacta gaaaaagttt cccgaattgt 3781 ccaggtattt atggtatcat ttgaacggtg gcttgattca catccgatag atacgcctag 3841 ttaactgtga aggttaaaaa gctcgttctg gggcttgtgc gccactaaca acaactgaat 3901 acttttaagt ctcataccga aattcatttc agtatctcta actggggatt agtggtcaag 3961 gaacctgagt tatggagcga tggaaccaag ccgaaaggcg cttctaaccc gtgcgaactt 4021 acaccgtgga gttgagtttg tacagattta tccacttgtt ggtagttttg tccagacttt 4081 tagaagcggt atacggaaca ctatcgatat agccaaacgc gcaaaagaga aagcaactgc 4141 tcggtgtcaa caggtttgga aatataatcg gaggcaccag cttcaataca cttgtcgcga 4201 tcgctttgca tagctttagc agtcagagca ataatcggca aagatttaaa ttgattgttt 4261 tgccgaatca gacgagttgt ttggtaacca tctaattctg gcatcatcat atccatcaaa 4321 accatatcaa cgtctgggtt attttgcagg acttcaatgc catccttacc gttctcggca 4381 tacaaaacct gcatttgata acgctcaagc atacttgtga gggcgaagat gttacgcaca 4441 tcatcatcga caatcagcac ttttttgcct gcgagtacgg gatcagacga ttgcaattgt 4501 tcgagcattt gtcgtttagg tatcggcaaa tttgcttgca ctcgatgcag gaacaaagcg 4561 gtttcgtcga gtagacgttc gggtgaacgc acatccttga caataatcgt ttctgcaatg 4621 cgtcgcagtt gagtgtcttc ggctctagtt agttctctag cagtgtagac gatgatgggt 4681 aatgcttcac catctggttg ctgcttaatt tgtgaaatca gctcaaaccc gctcatgtct 4741 ggcagtccta aatccagcac cacacaatca taatgctgtg agcgaatcgc cgctagtgct 4801 tctgctcccg ttccaacagc agtcgtttca acatcgctgt tacctatcag ttccacaatg 4861 ctgtaccgtt gagtggagtc atcttccacg actagcagat tctttacact gcggtccaca 4921 aaacctttta ttttcgtcaa tgcttggtgc aacgcgtcat tactaacggg cttttgcaga 4981 taggcgatcg ccccttgttg caacccccgc tgtcgcccct cctcaactgt cataatatgc 5041 actgggatat gacgggtaga agggttatgc ttgaggcgat cgagcactgt ccaaccatcg 5101 atttctggta agcggatatc tagaatgatt gctgaaggct gaaatttttg taccagcgcc 5161 aaacccgtac tgccattatg ggcgacgatg accttaaatt ctttttgtcg tgccatatct 5221 aaaaggatac gcgcaaacat gagatcatct tcgacaatca gaagtacgcg gtctcctggt 5281 tggatgttac ctctatcgtc aatcaaggga gtggggaggt cgggtgttcg cgtatcttct 5341 ttggagtttg gggtgagtga aggagtaagg gacgcagtga gaaaactttc ccaaactgtg 5401 ttgctctttc cttgtgttac tcctctttct ccttgtctac gccagacacc tgtggaggga 5461 aaccctcccg cagtgctggc tccccgtgtc cccctatcct cttgtcctcc tgcttggggt 5521 aagtaaagcg taaacgtgct cccctgacct aagcgactct caagtttgat ttcgccatcc 5581 aacagatggg cgattttacg gctaattgac aagcctagcc ctgtaccctc atatttacga 5641 ctagtgctgc catctgcctg ttgaaatgct tcaaaaataa tattttgctt ttccgaggca 5701 atacctatac ccgtatccct gacagcgaag gctataaccg tagctgcacc attcaaagtt 5761 tcctggtcgg aactccaccc ctgctttgcc acaaacatcc gtaagctgac ttctcctcgc 5821 tctgtaaatt taaaggcgtt tgctaacaga tttttcaaca cttgttgtaa gcattttaca 5881 tcggtttcaa ttgttgttgg cagttctgga gccaattcaa tagtgaaaga aagttttttg 5941 tcaatagcga cttgcctaaa tgtaccttct atgttgtcgc gcaattctag aaacaacatc 6001 tggttaatgt taattgacat agttccagat tcaatttttg ccaaatctaa aatgtcgtta 6061 attaacacca aaaggtcacg accggctgag tgaattgtac ggctatattc tacttgtttt 6121 gcactcaggt tcccctcaac gttatctgct aacaagttcg ccaaaatcaa taagctattg 6181 agcggtgtcc gcaattcgtg ggacatattc gccagaaatt ctgacttgta ttttgacgat 6241 agcgctagtt gctctgcttt ctcttccaat gattggcttg tttgttccag ttgctgattt 6301 ttgcgttcaa cttccttttt ttgtattgat agtagctcgg ctttttcttc tatctcggcg 6361 tttagctgct gcaattcttc gttagattgc tgcaactctt cctgctgctg cttcaagagt 6421 ttttccgatg cttgcaacgt ctcggtctgt tcttctaata tctgattact ctgcttaagt 6481 tcttcttgct gaatttgtag ctgttgcgtt aaagcttggg attgttggag gagttcttga 6541 gtgcggatat cggctgcaat tatgttcaat accacaccta tggtttgtgt aacttgctct 6601 aggaaaatca gagaaatctc tccaaagcgt tgaaaagaag ctaactcaat cactgctgtc 6661 acctgatttt caaacagcac aggtaacaca ataatattta atggtttggc ttcgcctaaa 6721 ccagaactga tatgaatgta gtcacttggg acatctgtga gtataatttt ttgtttttct 6781 agagcgcatt gtcccaccag tccttcacct aagcggaatt ggttcgacag atgcttccgc 6841 tgttggtatg catagctacc gataagcttg agcattggct gttcatcaac agagtccatg 6901 atgtagaaaa caccttgttg tgcattaacc aatggggcaa gttcacacag tatcatgcta 6961 gcgacagttt ctaaacttct ctgaccttgc agcatctggc taaatttagc cacgttagac 7021 tttaaccaat tttgttcact atttgtctta gttgtctcgc gtaaattggc aaccatctcg 7081 ttaaacgcct gcgataacac accaatttcg tcctggcgat tgttagcagt cgtactcaca 7141 gataagtctc ctattgccag cttttctgtg actttagaga gttcttggag aggtttagaa 7201 atgttcctag ttaaataaat accaattaaa gttaagagta aaaaagctag aggaatacca 7261 taagtgatag tattcattgc ttgatgggtt gctgcttgag tccgcctaga gcgctgttct 7321 aataaattcc tctcttctgt ttccaaatct aaagttaggc gacggatttc attcgtcaaa 7381 ttgttgctat tatccgtcaa tccctgttgc ctgacagcat ctaatggttg actgtcacgc 7441 aaggaaatcg agctttgcat tatcgctaat ttttgtttaa tctttggatc gagagcatca 7501 attctacgct gctgattagg gttatctacc gttaatcttc tcagggattt gagactttct 7561 tcaacagctg gaatcgcact gttgtaagat tgtagctcac ttttttctcc tgtcagcaga 7621 taacgacgtt gtgctccctc agtatctttt agctgagacc ttaaattatt caaatcccca 7681 agtacttggt aggtatgggt ttgccagcca gaggtttcaa taagattgtt tgtcgcctgg 7741 tatgaaacaa ctccaatagc ggcaagcatc attaacccaa atccaacgct tgcacctatt 7801 ttgttaccta ttgtcagatt attaagcatc tctcattagt tatatttaga caattgccgc 7861 tattttttgg ttaagtgttt ggttgctatt gtataaattt taagatttat aaatatgatt 7921 tctgttattg cttacaaaaa acataaaatt aagaattaat cgttgaacgt tcataataag 7981 acattatgaa tgatcactgg aagcaacttg attcttttac aagaaatagc agaatttcta 8041 gaccaattct ttgctgttga gcgctactct caagaggaaa gaggtggtgt atatttgcct 8101 tcaacacgtc ctgttagacg tttgggcttg gtactagaac cgtgtgcgca gttacaggaa 8161 tggacaaaca ctcaacattt agatgcactg ttcttacatc gtccttggaa acttgaacca 8221 gaacaactgt caccagatat cggtgttata tcttatcatt taccatttga tgagcgcttg 8281 acgttttctt ttaatcccag attggcacaa gtgttgggaa tgtctagttt agaagtgctg 8341 ggaaagaagg acggcagagc aatcggtatg attggagaga ttccgactca aagctttgca 8401 cattttggtc attgtgtcaa tcaaatcttt gctggacacg agcaggtacg tgcagcccaa 8461 agcgctgaag tcacacgagt tgctgttgtt ggtgcaatga ctgacttgct cgtgcgtgaa 8521 gcagcaactc gcggtgctaa tgtctacatt accggacagt tacggcagcc tgctgaacaa 8581 gcattgctag aaacaaaaat tggtgtcatc gccgttggtc atcgtcgtgg tgaagtatgg 8641 ggtttgcgat cgcttgctgg agtcctacgc gaacggtggt ctagcttaga ggtggttgtg 8701 cctcaccatt cataattaag ttatagcagg gaactcttaa cagggaactc ttaacaggga 8761 acagggagat cttaacaggg aacaggcttg aaactctctc agtgtctaag ttttatgttc 8821 agtttctgtg ctaaaacacc tggcgacagc tatagtatga cagttgacga ttaaccgtta 8881 tataaatcct gctcaaacct aatcaacacc tatcaaagct attcaattg // LOCUS NODE_3684_length_8927_cov_5.3066958927 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8927) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8927) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8927 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(26..1557) /locus_tag="DP116_24645" /pseudo CDS complement(26..1557) /locus_tag="DP116_24645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316286.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="all-trans-retinol 13,14-reductase" gene 1652..2434 /locus_tag="DP116_24650" CDS 1652..2434 /locus_tag="DP116_24650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207238.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24650" /translation="MTEPLIELKGVSKSFGNNKVLDNVDLTIYRGEALGIIGPSGTGK STILRVIAGLIALDSGDVFIKGVHREGLIEDRTDPVGIGMVFQQAALFDSLTVEENVG FSLYQYSKLRRSRIRELVHEKLEMVGLSEIGDRYPAELSGGMRKRVSFARAIMSNPDN PKDTPEVLLYDEPTAGLDPIASTVIEDLIRQLQCIQGVCSTYAIVTHQQSTIRRTADR LVFLYQGKVQWQGSVSEIETTDDPLIRQFISGSISGPIHVAG" gene 2522..4156 /locus_tag="DP116_24655" CDS 2522..4156 /locus_tag="DP116_24655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196991.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MCE family protein" /protein_id="PRJNA477356:DP116_24655" /translation="MRSLLTSRFASARTFREGSVGLLLLLGLGVFGLVMLWLTKFNVA RSSYKAIVEFANAGGMQKGAVVRYRGVKVGTISALRPGPNNVEVEIEISKSNLIIPSN VTVDANQSGLIGESIIDITPKTQLPPGAVVGKPLEKNCNPQIIVCNGSRLKGQIGISM DELIRSTTQLATVYSDQKFYGNVNKAVENTAVAAANIAELSRNFSVLSKDLQQQLNSV SATTNTIQQATTQLSASSTKTLSQFGNTADQFSTTAKELRLTNTSVSKLINNLDTLVT SNRSSLVAALNNITETSNQLRKTVSSLSPAVNRVTQGELIKNLETLSANAAQASANLR EVSNSLNNPNNVVVLQQTLDSARVTFENTQKITSDLDELTGDPAFRQNLRQLVNGLSS LVSSTQQMQDHVQIASTLDSVKATVHNSNTAISAPKINKQQTSFSLPLTAAKIADKTF QPTTITLTNPTPSTRQQQMMFNLSPGTTKSADKTFEPTTITLTNSTPSTEQQSVDTKS ASKEPQPTSGQVTPSLAQENLLRKLREHREQEKLGE" gene complement(4425..4673) /locus_tag="DP116_24660" CDS complement(4425..4673) /locus_tag="DP116_24660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744922.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24660" /translation="MRDCEFALSVLEQFFATVTILPLSQPVLDRAVQLRQQRRMWLGD AMIAGTALAHNRTLVTRNITDFIWIAELRLLNPFDFCS" gene 4845..5483 /locus_tag="DP116_24665" CDS 4845..5483 /locus_tag="DP116_24665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015120364.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I restriction endonuclease subunit R" /protein_id="PRJNA477356:DP116_24665" /translation="MTQTKAITDAITSLADAENRFGFVRIESEGFFSEWCEELPALTE ADKANLDVLRRRYLYHRTFGNLLEGTVMLLLVSPLLALAGFYDPPFRIKAEESVDVVV DDGEEILRGRIDVLVLQNRFWVMVLESKKTTLSVWSALPQALAYLMANPNPHLPVFGM VTNGDDILFVKVVQADTPQYDLSRVFAPFTSVRELYSVLQILKRFGQAVSSV" gene complement(5639..5929) /locus_tag="DP116_24670" CDS complement(5639..5929) /locus_tag="DP116_24670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875483.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF3288 domain-containing protein" /protein_id="PRJNA477356:DP116_24670" /translation="MSKANKDQQHPLYNRDRPIIENLLVGEATDYNLAELARLRIRYS SFPGARDIQSDLDKILVQWGLSEEELFEKTRALHSRGGIYKSRGKKEEEDWN" gene 6332..8788 /locus_tag="DP116_24675" CDS 6332..8788 /locus_tag="DP116_24675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310273.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent Clp protease ATP-binding subunit ClpC" /protein_id="PRJNA477356:DP116_24675" /translation="MFEHFTSEAIKVIMLAQEEARRLGHNFVGTEQILLGLIGEGSGV AAKVLTDLGVTLKDARREVERIIGRGSGFVPPEIPFTPKVKSLFEQGFKEARTLGNNY IGTEHLLLGLTEAGEGVAAKVLQNLGLDLSQIRTAVIRQLGEATSVSGSRGGGSSRRT QNLTLEEFGRNLTKQAQIGNLDPVVGREKEIERAVQILGRRTKNNPVLIGEPGVGKTA IAEGLAQRIVNQDVPDILQDKQVISLDMGSVVAGTRFRGDFEERLKKIMDEIRTAGNI ILVIDEVHTLVGAGGTEGGMDAANILKPALARGELQCIGATTLDEYRQHIERDAALER RFQPIMIGEPSVEETIQILYGLRSAYEQHHKVQISDAAVVAAANLSDRYISDRFLPDK AIDLIDEAGSRVRLRNSLKSVDRELKRELALVTKQKQEAVKTQDFDKAGKLRDQELEL ETKLRDSQEDKSVNSPIVDEEDIAQIVASWTGVPVNKLTESESEMLLHLEDTLHQRLI GQEQAVTAVSKAIRRARVGLKNPNRPIASFIFSGPTGVGKTELAKALASYFFGAEDAM IRVDMSEFMERHTVSKLIGSPPGFVGYDEGGQLTEAVRRKPYTVVLFDEIEKAHPDVF NMLLQILDDGRLTDAKGRTVDFKNTLIIMTSNIGSKVIEKGGGGLGFQFSEDEAEATY NRIKMLVNEELKSYFRPEFLNRLDEIIVFTQLKKDEIKQIADIMLREVASRLTEKGIT LEVSDRFKDRVLQEGYNPSYGARPLRRAIMRLLEDSLAEAMLSGEITEGDTAIVDVDD DGQVKVNKSEKRELLLANLG" BASE COUNT 2486 a 2019 c 2047 g 2375 t ORIGIN 1 ctacacccct acaccccttt tttcactact tcaccagcat attcgcacac aaaataccag 61 acgcagctac tgcaggtaca ccaattcccg gcgtggtgct atcaccgacg cgatataatc 121 cctgtattgg tgtgtgtggt cctggaaaca tcccttgtcc cgcttgaata gccggaccat 181 aagttccctg atatcttcgc aaataataag catgggttaa gggtgtgcca ataagttcca 241 acacaacacg ctccctgata tctggaataa tttgctctaa ggcacgatat aaagtttgcg 301 ctttctgttg ctttttgtgt tcatattcgt cgttccgttc ccatcccgca tatggttcta 361 gggtgtaagc gtgaactaca tgatgtccct gtggggcaag tgttgcatcc cacacactag 421 gaattgatat catgcaagta tttcccggta ctgtgatatc cttgccactg tcgtgaacga 481 cgacatgatg tcctgtgaga ttttccaaac catctgcacg aatacctaag tgtaaatgca 541 gaaaactatc aaccgctggt gtattcaatg cagctttgcg gaaagacgag ggcaaatcct 601 cgggacgtaa cagcttggtg taagtatccc aaatactggc attggaaata acaatgggtg 661 ctttgaggat ttcacctttt gtcaactgga caccaataac tttgtgattc tcaactaaaa 721 tttgctccac atgatgtccc aaaagcaact gaccgcccca acgttccaac ccccgcacca 781 aagccttgac aatagctgca ctgccaccca caggatactc aactccggcg cgggagcgtt 841 cacctaacat aaaagctacc tctggagcaa tagtgccatg tgcctttaaa ccagatagta 901 aaaagcattc taagtcgatg agccgccgta tccaggggtc ttgtactgtt gcatccatga 961 catgacccac agatgcttga acaagaggta aatgaggtag catctttaac aaagagggta 1021 agtaacgtcc gattaacacc ggaattactt gccaatctgc tcgtaaagct agtgtaggaa 1081 tacctttcat cgcatcgtac aatggtaaca agcgttcctc aaagcgcttg aattccttcg 1141 ctccttgagg cgtaattttc gttaattctt cacggtaacg cgactgattg ctataaacag 1201 gtaaactcgc ttcaggaaaa tggtacagtc caagggggtc gtaaggtaca gattctaaag 1261 attcgcccaa aacgtcaaga acttgtttga cgggattcaa acttccggca cttgtcagac 1321 cacaatagaa tgagggaccg gaatcaaatt caaatccccg tcgtctgaag gtatgagcag 1381 cacccccagg tattgtatgg ctttcacaga caattactcg cttacccttc gggttcgcca 1441 gtcgcctacg gagggaaacc ctcctgcagc gctggactta ccgtaacgag caagtaatgc 1501 agccgctgtt aacccgccaa taccactacc gataacaatg acatcactat cctgcatttt 1561 tgttatgtgt ccgctgcttt gcaaagtcca gggtcaagag ttcaaaattt agagtaactt 1621 attttaacta atgactaatg actaatgact aatgactgaa cctttgattg aactaaaagg 1681 cgtttctaaa tcctttggta acaataaggt tttagataac gtggatttga cgatttaccg 1741 aggcgaagca ttaggaatta taggtcctag tggtactggt aaatcaacaa ttttacgggt 1801 gattgctggg ttaattgctc ttgattctgg agacgttttt attaaagggg tgcaccgaga 1861 aggattgatc gaggatagga cagatccagt tggtattggt atggtgtttc agcaagcggc 1921 gttgtttgat tctttaacgg tggaggagaa tgtgggtttt tcactttatc aatactcaaa 1981 actgcggcgt tctcggattc gagaactcgt tcatgaaaaa ttggagatgg tggggttatc 2041 ggaaataggc gatcgctacc cagccgaact ctctggaggt atgcgaaaac gagtcagctt 2101 tgctcgtgcg attatgtcta accccgataa ccccaaagac actccagaag tgttactcta 2161 cgatgaacca acagccggac ttgatcccat tgcttcgact gtgatcgaag atttgattcg 2221 gcagttgcaa tgtatacagg gagtttgtag tacctacgct attgtcactc accaacaaag 2281 tactattcgt cgcacagctg acagattggt atttctttac caaggtaaag tgcaatggca 2341 aggtagtgtc agtgaaatag aaactacaga cgatcctttg attcgacaat ttattagtgg 2401 aagcatttct ggaccaattc atgttgctgg ttagggtaat tttggatttg ggattttaga 2461 tagaagaaaa acctacaacc tcaaatctaa aatctaaaat ccataggtga ggaaaaataa 2521 aatgcgaagt ttattaacaa gccgcttcgc ctctgcgcga acttttcgag aaggttctgt 2581 tgggttgttg cttttgttgg gactgggagt gtttgggtta gtcatgttgt ggctaactaa 2641 attcaatgtt gctcgcagtt catacaaagc tattgttgaa tttgcaaatg caggcggtat 2701 gcaaaaaggg gcagtggttc ggtatcgtgg tgtcaaggta ggcacgattt ctgctcttcg 2761 acctggacca aacaacgttg aggtggaaat tgaaattagc aaatccaact tgattattcc 2821 tagcaatgtc acagttgatg ctaatcaatc tggattgatt ggcgaaagca ttattgacat 2881 cacaccaaaa acgcagctac ctccaggggc tgtggtaggt aaaccgttag aaaaaaattg 2941 caatcctcaa attatcgtct gtaatggttc tcgattaaaa ggtcagattg ggatcagtat 3001 ggatgaactg attcgctcaa caactcaatt agcaactgta tacagtgacc agaaattcta 3061 tggaaatgtt aataaggctg tagaaaatac tgctgttgct gcggcaaata ttgctgagtt 3121 aagtcgtaac ttcagtgttt taagcaaaga ccttcaacaa caactgaact ccgtttcggc 3181 gacgactaat acaattcaac aagcaacaac tcaactcagt gcatctagta ccaaaactct 3241 gagtcaattt ggtaatactg cagaccaatt tagtaccact gctaaagaac ttcgtttaac 3301 aaacacatct gttagtaagc tgattaataa tcttgatact ctagttacta gcaaccgctc 3361 ttcacttgtt gcagctttaa acaatatcac cgaaactagc aatcaactac gtaaaacagt 3421 tagcagtctt tcacctgcgg ttaatcgtgt cacccaagga gaattaatca aaaatttaga 3481 aactttgtct gcaaatgctg cccaggcttc tgctaatttg cgcgaagttt ccaacagcct 3541 caacaatcct aataacgtgg tggttctgca acaaacttta gattctgcac gagtcacatt 3601 tgaaaacacc caaaagatta catctgattt ggatgaattg acaggtgatc cagcttttcg 3661 acaaaatctg cgccaactgg tgaatggttt aagtagcttg gtttcctcca ctcagcagat 3721 gcaggatcac gtacaaattg ctagtacttt agactcagtc aaagctactg ttcataattc 3781 taatactgca atttccgcac ccaagataaa taaacagcag acgtcattta gtctcccatt 3841 gactgctgcg aagattgctg acaaaacgtt tcaacctact actattacct tgacgaatcc 3901 caccccaagt actcgtcaac agcagatgat gttcaatctc tcacctggta cgactaaaag 3961 tgctgacaaa acgtttgagc ctactactat tactttgacg aattccactc caagcactga 4021 gcaacaatca gttgatacta agagtgctag taaggaacct caacccacta gtggacaggt 4081 gactccctct ttagctcaag aaaacctttt gaggaaattg agggaacatc gtgaacagga 4141 gaagttgggt gagtagagaa taggacaggt aatggtaggt tggggagcca gcgcgaatga 4201 cggctttcca acgccagatg ctccacttgg ggagcatctg gcgttgcgga gagcagcgcc 4261 ttgcgggggt tccccccgtt gtggcgactg gctgtcgggt tcctcctttg ggtgccacct 4321 ggcgtgaaac cccaagaccg cactggctcc tccgtaggcg tctggcgttt aagcgtagcg 4381 caacccaaca catatcagtt aggtgaagca tgatatgaca aacattatga gcagaaatca 4441 aatggattga gtaatcgaag ttcagcaatc caaataaaat cggttatatt tcgagtcacc 4501 aaggttcggt tatgcgccag tgcagtaccc gcaatcattg catctcctaa ccacatcctg 4561 cgctgttgtc gaagctgcac tgctctgtct aaaacaggtt gtgacaaagg cagaatcgtg 4621 actgttgcaa agaactgttc caacaccgaa agcgcgaatt cgcaatctct aattgctgat 4681 actcccttcc cactacaaaa ttacctatct tcttttttct tacttcgtgt ccttcgcgtc 4741 tagccttcgg caacgccaag ggcgaacgcg gttcgttaaa taggtattct tctgacggga 4801 agggagtaaa ctaaagcttt tcccagaact tggaggttag cagcatgaca cagacaaaag 4861 ccatcaccga tgcgattacc agtctggcgg acgcagaaaa ccgctttggg ttcgtccgca 4921 ttgagtccga gggttttttt tcggagtggt gcgaggaatt accagccctt acggaagccg 4981 acaaagcgaa tcttgatgtc ctgcgacgca ggtatcttta ccaccgaact tttgggaatt 5041 tactggaggg gacggttatg ctgttgctgg tttccccgtt actggcactt gctgggtttt 5101 acgatccgcc gttccgcatc aaggcagaag aatcagtgga tgtggtggtg gatgatggtg 5161 aggagatttt gcgtggacgg attgatgtgc tggtgctgca aaatcggttt tgggtgatgg 5221 tgctggaatc taaaaaaacc acgctgtcgg tgtggtcggc attaccgcaa gccctggcgt 5281 atctgatggc gaacccaaac cctcatctgc ccgtctttgg tatggtgacg aatggcgatg 5341 atattttgtt tgtcaaggtg gtacaggcgg atacgccaca gtacgatttg tcacgggttt 5401 tcgcgccgtt tacttctgtt agggaactgt acagcgtttt gcaaatcctc aagcggtttg 5461 gtcaggcagt ttcctctgtg tgattaccca cagtaccata agtcatctaa aaagaacgaa 5521 aacggttttt ccaattcttg ccagctaatc acgggaaaat ctcacaacca aacataaaaa 5581 tgtatttccc ctacacccct atacccttac acccttacac ccctttttca agctagtact 5641 aattccaatc ttcttcttct tttttcccgc gacttttata aataccccca cggctgtgta 5701 aggcgcgtgt tttctcaaaa agttcttctt cgctcaaacc ccactgcacc agaattttgt 5761 ccaaatcgct ttgaatatct ctcgcgccag gaaaactgga atagcgaatt cgcaatctag 5821 ctaattctgc taaattataa tcagtcgcct caccaaccag taagttctca atgatggggc 5881 gatcgcgatt gtacaacggg tgctgttggt ctttatttgc tttactcatc ttgattctag 5941 gttgtcaatg gagagtcgta agtgaagcgc tgttcattgt gacataaact accgctaatc 6001 gccaacgact gacttcaata attatccgtc tattcgcttg tattcaagct cttgacgggt 6061 aacacgccta aactcttacg gctctcacta ctgtcagtcg actttgaata aagatacatt 6121 atccataaga aagtacacaa aagttcaggt gagggtgaat catatagtga tgacaatgaa 6181 agtgcttctt gtatgctgag tgatcacaac cgagtgctaa gtataaataa catagcaccc 6241 cccattcacc aactcagaac tcattaggac ttcattctct atactactga aatcacccat 6301 agcccaagcc attcgggagc acaaacccaa tatgtttgaa cacttcacat ccgaagccat 6361 taaagttatt atgctagccc aggaggaggc gcgtcgcctg ggacacaatt tcgtagggac 6421 ggagcaaatt ctcctcgggc tgattggaga aggatcaggg gttgctgcca aagtgctgac 6481 cgacttgggc gttaccctta aagatgcacg tcgcgaagtc gaaagaatta taggtcgggg 6541 ttctggcttt gtacccccgg aaattccttt tacccccaag gtgaaaagtt tgtttgagca 6601 aggctttaag gaagctcgca ccctgggaaa taattatata ggtactgaac acttactctt 6661 gggattgact gaagctggtg aaggtgtcgc cgctaaagtc ctgcaaaatt tagggcttga 6721 tttgtcgcaa attcgcacag cagtgattcg tcagttgggt gaagcgacat cggtttccgg 6781 ttctcgtggc ggtggttctt caaggcgtac ccaaaattta actttagagg aatttggtag 6841 aaatcttacc aaacaagcgc aaatcggcaa tctcgacccc gttgtcggtc gcgagaagga 6901 aattgagcgt gctgtccaaa ttttgggacg ccgcactaag aataatcctg tgctgattgg 6961 ggaaccaggt gttggtaaaa cagcgatcgc cgaaggtctt gctcaacgca ttgtaaacca 7021 agatgtcccc gacattctac aagacaagca agtcatcagc ctcgacatgg ggtcagtggt 7081 ggctggtact cgcttccgtg gcgattttga agaacgcctc aagaaaatca tggatgaaat 7141 ccgcacagcg ggtaatatca tcctggtgat agatgaagtt cacactttgg ttggtgctgg 7201 cggcacagaa ggtggtatgg atgcagccaa catcctcaag cccgcattgg cacgaggtga 7261 actccagtgt attggcgcaa ccactcttga tgagtaccgt caacacattg agcgcgatgc 7321 agctctagag cgtcgtttcc aaccgattat gatcggtgaa ccatcggttg aagaaaccat 7381 ccagatatta tacggcttgc gctctgctta cgaacagcat cacaaagtcc aaatctctga 7441 tgcagcagtc gttgctgcag ctaacttgtc agaccgctac attagcgatc gcttcttacc 7501 tgataaagcc attgacttga ttgatgaagc aggaagccgt gttcgtctac ggaactctct 7561 caagtcggtt gatcgcgaac tcaagcgcga attggcgcta gtgactaaac aaaaacagga 7621 agctgtcaaa actcaggact ttgacaaagc tggtaaactg cgcgatcaag agttggaact 7681 cgaaacaaaa ttgcgcgatt cacaagaaga taaatctgtc aatagcccga ttgttgatga 7741 ggaagatatt gctcagattg tcgcttcttg gactggtgtc ccagtcaaca agttgactga 7801 atctgaatca gagatgctgt tgcacttaga agacactctt caccagcgtc tcatcggtca 7861 agagcaagca gtgaccgcag tttctaaagc cattcgtcgc gccagagttg ggttaaagaa 7921 tcctaaccga ccgattgcaa gctttatctt ctctggtccc acaggagttg gtaagacaga 7981 actagccaaa gcattggctt cttacttctt tggtgctgaa gatgcaatga ttcgcgtgga 8041 tatgtccgaa ttcatggaac gccacaccgt ttctaagctg attggttcac ctcctggttt 8101 cgttggatac gacgaaggcg gacaactgac tgaagcagta cgtcgcaaac cttatacggt 8161 agtgctattc gacgaaatcg aaaaagcgca tcccgatgtc ttcaatatgc tactgcaaat 8221 cttggatgac ggtcgtctta ctgatgcgaa aggtcggact gtggacttca agaacacgct 8281 gattatcatg acctctaaca tcggttctaa ggtgattgaa aaaggtggcg gtggtttagg 8341 gttccagttt agcgaagatg aagctgaggc gacttacaac cggattaaaa tgctggtcaa 8401 cgaagaactg aaatcatact tccgtccaga attccttaac cgtcttgatg agattattgt 8461 cttcactcaa cttaagaaag atgaaatcaa gcaaattgct gatatcatgt tgcgcgaagt 8521 cgcaagccgc ttgactgaga agggaataac tctggaagtg agcgatcgct tcaaagaccg 8581 tgtcttacaa gaaggctaca accccagcta cggcgcaaga ccgttacgtc gggcgattat 8641 gcgcctctta gaagattctc tggctgaagc gatgctctca ggtgagatta cagaaggtga 8701 cacagctatt gtggatgtgg acgatgacgg tcaagtcaaa gtcaacaagt ccgagaagcg 8761 agaactgtta ctggcaaatc ttggctaaga ttcggtaaaa tgtcgctttt tgattgatag 8821 cgctttttaa tagtagggtg ggcatgggta atgtccacct tagttttttg gggcgatagc 8881 gtaagcgcaa gcgcacgccc agagggcgtt agcgcagcgt ctccgca // LOCUS NODE_3699_length_8877_cov_4.5562238877 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8877) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8877) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8877 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..654 /locus_tag="DP116_24680" CDS <1..654 /locus_tag="DP116_24680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uroporphyrinogen-III synthase" /protein_id="PRJNA477356:DP116_24680" /translation="IAHISDFHWLILTSTNGVGYFFERLFAQGKDARALAKIKIAVVG EKTAQRLNQHSLQPDFIPPNFVADSLVENFPEELTGKKVLFPRVESGGREILVQQFTA KGAEVIEVAAYQSCCPKSVSPSAELALESGTVDVITFASSKTVQFFYQLAQKIFFEDS QNNQSLVTDALNGVCIASIGPQTSKTCRSIFGRVDVEAEEYTLDGLTQALIKWSTNS" gene 878..1468 /locus_tag="DP116_24685" CDS 878..1468 /locus_tag="DP116_24685" /EC_number="2.7.1.24" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995922.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dephospho-CoA kinase" /protein_id="PRJNA477356:DP116_24685" /translation="MIQRIIGLTGGIATGKTTVANYLASAYNLPILDADIYAREAVSV GSPILQEIAQRYGEEILLTDGNLNRQKLGEIIFNKQEERIWVEGLIHPYVGDRFLKEI ALSPAQTLVLVIPLLFEAGMTDLVTEIWVVSCSEQQQLQRLMQRNHLTLDQAQARIKS QMSIAEKVARADVVLDNSSTLEVLLKQIDTAISKIL" gene complement(1490..2344) /locus_tag="DP116_24690" CDS complement(1490..2344) /locus_tag="DP116_24690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865451.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="deoxyribodipyrimidine photo-lyase" /protein_id="PRJNA477356:DP116_24690" /translation="MTKDMRRDFANRDELVAYLREQFPKATERDNHISKTVGGRKAAV EALQKVDPARYAKTRNFFTGAVTRLSPYIRYGVLSLREIRDDVLGRVKHQDDATKLVN ELGWRDYWQRLYVKLGDRIWKDEEEYKTGYTIAEYAPKLPDDIKQGTTGRVCIDSFSQ DLRETGYLHNHARMWMAAYIIHWRRIRWQAGAKWFLEHLLDGDPASNNMSWQWVASTF SHKPYFFNRENLERYTEGVYCRKCPLYGKCDFEGSYEELETRLFPKGEFTKQPNSQSW QKGKKGKR" gene 2550..4070 /locus_tag="DP116_24695" CDS 2550..4070 /locus_tag="DP116_24695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015184261.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24695" /translation="MSLCINPHCQKPENPDNILFCQNCGSELLLEGRYRAMRRLGGGG FGVTYEVNEVRSNIPKVLKVLINNQPKAVELFQREAEVLALLNHPGIPKVESNNYFVY FPRNSQQPLHCLVMEKVEGLDLYEYMRQRDHRPINQKLAVQWLTEIVTILQQVHSQNF FHRDIKPPNIMLRADGHLVLIDFGTARALTQTYWSAQSQGNVTGVVSAGYTPTEQMSG QAVPQSDFFALGRTFVYLLTGKEPTDPAIYDSYNNESRWRSHTPNILPTLADLIDRMM AHLPSQRPANTQEILQRLAAIDQALNPPQRPFTSTPPVNSPGVPPTQPVRTPQPTSSQ PVSGSKPPVKFQLTPIEKRLWNQWVFANVVGMMTFGLLLPITQWLVLRRRIRRAGWWL LASIVYLYPLQRFFVTGRVDQYQLSCALLIELITQWLVLRRQVLHAGWWISAYIVGAV SGFLVGMIASVAIYSVVRITEPIVVYVVGTAADGATYGAITGRELIRLLRHPVSQP" gene complement(4094..4648) /locus_tag="DP116_24700" CDS complement(4094..4648) /locus_tag="DP116_24700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310315.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_24700" /translation="MTLTIQDLERLQEKLQEDQRDYQLELQEGNIIVMGPSDIESSEI GAEFIYLLKTWVNPRKLGRVFDSSGGFIMPNTDLRAPDVSFVSAQRLKRTVRDFANLV PDLVVEIKSKTDRVRPIEEKIQLFLQLGAQVGILINPDKRNVSVYRRVGEVELLAGED KLIIPELFPGWEIAISELWPPVFE" gene 4774..5400 /locus_tag="DP116_24705" CDS 4774..5400 /locus_tag="DP116_24705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865929.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="restriction endonuclease subunit R" /protein_id="PRJNA477356:DP116_24705" /translation="MIQVIQAQNVTLGYLEERFGLQQAENEDFFTEWFDILPDITDLE KQYLDRVKFHFLRLVKRPPLLEEAVKLVVLSPLLSLAGFYDDPFFIKSEQSIEISLED EGEIVRGRIDVLVIQQQLWLLIIESKRASFSLLEAIPQALAYMLANPQLEKPLFGLVM NGSDFIFLKLTRVNQPEYALSDQFTLLRRENELYKVLSILKKLGTILI" gene 5406..6572 /locus_tag="DP116_24710" CDS 5406..6572 /locus_tag="DP116_24710" /EC_number="2.8.1.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874145.1" /note="catalyzes the removal of elemental sulfur from cysteine to produce alanine; involved in NAD biosynthesis; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IscS subfamily cysteine desulfurase" /protein_id="PRJNA477356:DP116_24710" /translation="MSSRPIYLDSHATTPVDERVLAAMLPFFTEHFGNPSSINHLYGW EAEAAVKQTREILATAINATPEEIVFTSGATEANNLAIKGVAEAYFQKGQHIITVATE HSAVLDPCKYLKTLGFEITILPVQKDGLIDLTELEKAFRPETILVSVMAANNEIGVLQ PLAEIGARCRERNVLFHTDAAQAIGKIPLDVQAMKIDLMSLTAHKIYGAKGIGALYVR RRNPRVQLAPQQHGGGHERGMRSGTLYTPQIVGFGKAVEIALEEQATENQRLTQLRQR LWEKLSQLEGIHLNGHPTQRLPGNLSISVEGVDGSALLLGLQPVMAVSSGSACSSATT APSHVLMALGHSEQQAYASIRFGIGRFNTIEEIDKVAEHTISTIQSLRKQALRV" gene 6950..7768 /locus_tag="DP116_24715" CDS 6950..7768 /locus_tag="DP116_24715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745907.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="vancomycin resistance protein" /protein_id="PRJNA477356:DP116_24715" /translation="MNQLPQSLKALIRQKLKDTKALWQGYAFHHAYIQDTENSSYCYQ WSEITTPIKQRSGFPEVNENRLWNMQLAAKDIHGLILNPGQIFDFWNRVARPTVANGF REGPTLLGNRLMTDVGGGLCQISTTLFQALLWADCDILERYNHSIDAHGETRFFTLGQ DATVAYGYKNLITRNNSQIPLQLRLQVLGEKAVVVASVWSTEPLPVEVKITSTVLEKL PAPSAHDMCGWRVETIRYVRLKECPNTEWQTNYHTLDVYKPHVKLHQNLSPIPV" gene 7728..8771 /locus_tag="DP116_24720" CDS 7728..8771 /locus_tag="DP116_24720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012163059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Ldh family oxidoreductase" /protein_id="PRJNA477356:DP116_24720" /translation="MSSYIKISAPSLYSFLDEVLSPYQLLGEVAEALKTHLVEANLCG MDSHGLQQIIGYVKSLQSGRINPQPQLRVNSERPTMIRIDGDRTPGQYAGQVAMDTAL AAARQFGMAVVGVSNSNHFGMAGYYTRMAAEAGMIGFATSDTNGVDLAPYGGRKAKLG NNPISWGIPTGTPQPIILDMAAGTVSGGKVKHFGYQGLPIPLGWGLTEAGEPTQNPKQ VAVNLPASYKGSGLAFVADLLCGPLLGTAAAMFKTKAIHDSANGTGHFFWVLDVEAWT NREEFEERVQSAIASLKQTPRLDVNQPIYYPGELEAITREERLLTGIPIPQALIDDLA VYFGEDSVSGLSG" BASE COUNT 2523 a 1912 c 1914 g 2528 t ORIGIN 1 atcgcccaca tatccgactt ccactggtta attctcactt ccactaatgg tgtaggctac 61 ttctttgaaa gactcttcgc acagggtaaa gatgctcgtg ctttagctaa aatcaaaatt 121 gccgttgttg gtgaaaaaac ggctcaacgt ctcaatcaac acagtctaca accagacttt 181 attcctccta actttgttgc agattctttg gtggaaaact ttcctgaaga actcacaggt 241 aaaaaagttt tgtttcccag agtcgaaagc ggcggacggg aaattttagt ccaacaattc 301 accgcaaaag gcgcagaagt gatagaagta gcggcttatc aatcttgctg tcccaaaagt 361 gtatctcctt cagcagagtt agcccttgaa agtggaactg tagatgttat tacctttgcg 421 agttccaaaa ctgtacaatt tttctatcaa cttgcacaaa agatattctt tgaagactct 481 cagaacaatc aatctttggt aactgatgca ttaaatggag tttgtattgc ttccattggt 541 cctcaaacct ctaaaacttg tcgttccata tttggtcgtg tagatgtaga agctgaggaa 601 tataccttag atggattgac tcaagcactg ataaaatggt caacaaattc ataatcataa 661 ctgcggatgt acaagctttg cacagagata atttatctgc gtccaaaact cgtgcatctg 721 cggtttcata ttcaaaaaac taaaaataac tcacatcaag ttcacctaat tacttacaat 781 aaaaaacctc acccccatcc cctctcctta ctaaggagag gggtgcccgg agggcggggt 841 gaggttcttt ttttaatagt catccggacg tgatatcatg attcaacgca tcatcgggtt 901 aactggaggt attgcgacag gcaaaacgac tgttgctaat tatttggcta gtgcttacaa 961 cttgcccatt ttagatgcag atatctatgc cagagaagct gtatctgtcg gttcgcccat 1021 tttacaagaa attgctcaac gctatggaga agaaatttta ctcacagacg gtaacctcaa 1081 ccgtcaaaag ctgggtgaga ttatttttaa caagcaagag gaacgcatct gggtagaggg 1141 tttaattcat ccttatgtgg gcgatcgctt tctcaaagaa attgccctat cacctgcaca 1201 aacattggtg ttagttatcc ctttgctatt tgaagcagga atgaccgatt tagtcacaga 1261 aatttgggta gtcagttgtt ctgaacaaca gcaactgcaa agattgatgc agcgaaatca 1321 cttaacttta gaccaagcac aagcccgtat caaaagtcaa atgtcgatag cagaaaaagt 1381 agcccgtgcg gatgttgtgt tggataattc ttccactctt gaagtgctgc tgaaacagat 1441 agatacagcc atttcaaaaa ttttgtagct gtgacaactg cttttagttt taccttttac 1501 ctttctttcc tttttgccaa ctttggctat taggttgttt cgtaaattcc cctttaggaa 1561 acagccgcgt ttctagttct tcataacttc cttcaaagtc acacttacca tataaagggc 1621 attttcgaca ataaacgcct tcggtgtagc gttccaggtt ttcgcggttg aaaaaataag 1681 gtttgtggct aaaagtgctg gctacccact gccatgacat attattgctt gcaggatcac 1741 catctagaag atgttccaga aaccattttg cccccgcttg ccaacgaatg cgtcgccaat 1801 ggatgatata agctgccatc cacattcgtg catggttgtg caagtatcca gtttctcgta 1861 aatcctgact gaagctgtcg atgcacactc gtcctgtggt gccttgtttg atatcatcag 1921 gtaattttgg ggcatattca gcgatagtat atccagtttt gtattcctct tcgtctttcc 1981 agatgcgatc gcctagcttc acataaagcc tttgccaata gtcacgccaa cccaactcat 2041 tcacaagttt agttgcatca tcttggtgtt tgacgcgccc aaggacatca tcccgaattt 2101 ctcgtaagct gagaacacca tagcgaatat aaggagataa tcgcgttacc gcacctgtaa 2161 aaaagttacg tgttttcgcg tagcgtgcag ggtctacttt ttgtagtgcc tcaacagcag 2221 ctttgcgtcc acccacagtt ttgctgatgt ggttatcgcg ttctgtggct ttgggaaatt 2281 gttcgcggag gtaggctacc aactcatcgc ggttggcaaa gtcgcgtcgc atatctttag 2341 tcattgtgta tcaaagttgc aggttaagat tgccgaacga caaaagtcgt ggctagacaa 2401 acgaatcttg tggaggcagg ctagtctctt attatatcga attcatactt aaaatcagca 2461 acgccagatt tatttcaggt tgtgtatcag gcagcagctt attttctcct gtaggttatg 2521 atgtttgcat atttactgtt gccaaaccaa tgagcctgtg cataaatcct cactgccaga 2581 agcctgagaa tcctgacaac atcttgtttt gtcaaaattg tggctcagaa ctgttgctag 2641 aaggacgcta tcgggcgatg cgtcggctag gaggtggtgg ctttggtgtg acatatgagg 2701 tgaacgaggt acgaagcaac attcctaaag ttctcaaagt tctgatcaac aatcaaccca 2761 aagcggtaga actatttcag cgggaagcag aagttcttgc tctgttgaat catccaggaa 2821 ttcccaaagt ggaatcaaac aactactttg tctattttcc tagaaatagt caacagcccc 2881 tgcattgtct ggtgatggag aaagtagagg ggttggactt gtatgaatac atgagacagc 2941 gagatcatcg ccctataaat caaaagttag cagtgcaatg gttgactgag attgtcacta 3001 tcctgcaaca agtccacagc cagaactttt ttcatcggga tatcaagcca cctaacatta 3061 tgcttagggc agatgggcat ctagtgctaa ttgattttgg cacagcacgg gcacttacac 3121 aaacctactg gtcggcacag tctcaaggta atgttacagg agttgtttcc gcaggctata 3181 ctccaacaga gcaaatgagt ggtcaagctg taccacagtc tgattttttt gctttgggac 3241 gcacgtttgt atatttgctc acaggaaaag aaccaactga cccagcaatt tatgactctt 3301 acaataatga gtcacgctgg cgtagtcata ctcctaatat cttgccaact cttgctgatt 3361 tgattgaccg aatgatggcg catttaccca gtcagcgacc tgctaatact caggagattt 3421 tacaacggtt ggcagcgatc gaccaagctt tgaatccgcc tcagcgacct ttcacatcaa 3481 ctccaccagt aaattctcct ggagttccac ctacgcagcc agtaagaaca cctcaaccaa 3541 catccagcca accagtaagt ggatctaaac cacctgtgaa atttcagctt acacccatcg 3601 aaaaacgctt atggaatcag tgggtttttg ctaatgttgt gggtatgatg acctttggct 3661 tattgcttcc aatcacgcaa tggttggtac tacgccggag aatccggagg gctggctggt 3721 ggttgttggc aagtattgta tatctatatc cgctccaacg gttttttgtg acaggtcgtg 3781 tagatcaata tcagctctct tgtgcgctcc tcatcgaatt aatcacgcaa tggttggtgc 3841 tgaggcgaca ggttttgcat gctggttggt ggatatcggc atatattgta ggcgctgtct 3901 cgggctttct tgtggggatg attgcaagtg ttgctatata ctcagtggta cgcattacag 3961 agccgatagt cgtttatgtt gttgggactg ctgcagatgg tgctacttat ggagcaatta 4021 caggacgtga gctaattcga ctcttacgac atccagtttc acaaccttaa agataacagt 4081 tctaccgttg atattactca aacacgggag gccatagttc agaaatagcg atttcccaac 4141 caggaaataa ctctgggata attagcttat cctcacccgc cagcagttcc acttcaccaa 4201 ctcgacgata tacactaaca tttcgcttat ctggattaat cagtatccca acttgcgccc 4261 ccaattgcaa aaatagctga atcttctctt caattggacg aacacggtct gttttcgatt 4321 taatttccac aaccaaatcc ggaaccaagt tagcaaagtc ccgtacagta cgtttcagtc 4381 tttgtgctga tacaaaagaa acatctggtg ctcgtaagtc agtatttggc atgatgaagc 4441 caccactgga atcaaacacc cgtccaagtt tgcgggggtt aacccaggtt ttgagtaagt 4501 aaataaactc agccccaatt tcactcgatt caatatctga tggtcccatg acgatgatgt 4561 taccttcttg tagttctagc tggtagtcgc gctggtcttc ttgtagtttt tcttgcagtc 4621 tttctaaatc ctggatggtg agagtcatag aaacctcaac agtgagtctg ctcttatggc 4681 tatcttactc gtcaatgact aaacgtacaa ttttgccaga ataggatatg acatcagagt 4741 ttaacattaa gataataacg agtataaaaa actatgattc aagtcatcca agcacaaaac 4801 gtcacccttg gctatttaga agaaagattt gggctacaac aggctgagaa tgaagatttt 4861 tttacagaat ggtttgatat tttgccagat attacggatt tagaaaagca atatctagac 4921 agagtaaagt ttcattttct tcgtttagtc aaacgtcctc ctttattgga agaagcagtg 4981 aaattagtgg tgttatcccc cttactcagt ctggctggat tttatgatga tccttttttt 5041 attaaaagtg aacaatctat agagatttcc ttggaggacg agggagaaat tgtacgtgga 5101 cgtattgatg tgttagttat ccaacaacaa ttatggttat tgataattga atctaaaaga 5161 gctagttttt ctcttctaga agctattccc caagcactcg cttatatgct agctaatcct 5221 caattagaaa aacctctatt tgggttagtc atgaatggta gtgattttat ttttctgaaa 5281 cttactaggg taaatcagcc agagtatgct ttgtctgacc aatttacact tttgagacgc 5341 gagaatgaat tatataaggt tctcagtata ttaaagaaac tgggtacaat tttaatttga 5401 taattatgtc tagtcgccct atctacctcg actctcacgc taccacacct gtagatgaac 5461 gagtactagc agcaatgcta ccctttttta cagaacactt tggtaacccc tccagcatca 5521 atcaccttta tggctgggaa gcagaagctg ctgttaagca aacacgagaa attttagcaa 5581 cagccataaa cgccactcca gaagaaattg tgttcaccag tggtgcgaca gaagcgaata 5641 atttagctat caaaggtgtt gcagaagcct attttcaaaa aggacagcat attattactg 5701 ttgcaacaga acatagtgca gttcttgacc cttgcaagta tttaaaaact cttggttttg 5761 aaatcacaat ccttccagtc caaaaagatg gactgattga tttaacagag ttagaaaaag 5821 ctttccgtcc tgagacaatt ctcgtttcgg tgatggctgc gaataatgaa attggggttt 5881 tgcagccatt ggcagaaatt ggtgcaaggt gtcgagaacg caatgtcctt ttccacaccg 5941 acgccgccca agctattggt aaaatccctc ttgatgtgca agcaatgaaa attgacctaa 6001 tgtcgctaac agcacataaa atctacggtg ccaagggtat tggtgcatta tatgtccgtc 6061 gccgcaatcc cagagtacaa cttgctcctc agcagcatgg tggtggacac gaacggggta 6121 tgcgttctgg cactttgtat acaccgcaaa tcgtaggatt tggtaaggcg gttgagattg 6181 ctttagaaga acaagcaaca gaaaaccaac gcctcacaca gttaagacaa agattgtggg 6241 aaaaactttc tcaattagaa ggaattcatc tgaacggaca tcccacccag cgactcccag 6301 gaaatttaag tatcagtgtt gaaggagtag acggatctgc actattgctg ggattacaac 6361 cagtgatggc ggtgtcttct ggttctgctt gttcctcagc aacgactgca ccttcccatg 6421 ttctcatggc attgggacac tcagaacaac aggcttatgc gtcgatacgc tttggtatag 6481 gacgcttcaa cacaattgag gaaattgaca aagttgcaga acatactatt tctacgattc 6541 aaagtttacg caagcaggct ttgagggtgt aaattgaatc gtatcagatg caagaaacgc 6601 agaaactata gcggttctga tttgaatcac atacatcacc cataatgtag agacgtagca 6661 tgtgagtcca gcgctgcggt ccttgtttcc ctccggagcc acttctgtgc gggggttccc 6721 cccgttgaag aatgtggcgt caggcgactg gcgaacccgg agggctacgt ctctacaact 6781 gtatattgca cccaacacca gaagtgctat agattttatc tgcgtgcaat actccgtgta 6841 tctgcggttt catatcaaag ttcacgtaaa taaactttac ttgcaatgcg taacagttaa 6901 ctgctaaaag gtattactta ataactgaaa attcagattt aaaaacagca tgaatcaatt 6961 accccaaagc ctaaaagcct taattcgcca aaaacttaaa gatactaaag cgctttggca 7021 aggctatgca tttcaccatg cttacataca agacacagaa aactccagct actgctacca 7081 atggagtgaa ataaccactc caattaaaca gcgttctggt tttcctgaag tcaacgaaaa 7141 tcgcctctgg aatatgcaac tggcggcgaa ggatattcat ggactcatct taaatcctgg 7201 gcaaattttc gacttttgga atcgtgttgc gcgtcctact gttgcgaatg gctttcgaga 7261 gggtcccact ttgctaggaa atcgcctcat gactgatgtc ggaggtggat tatgtcaaat 7321 ttctacgact ctgttccaag cactcctttg ggcagattgt gatattctag aacgctataa 7381 ccactcgatt gatgctcatg gagaaacgcg ctttttcact cttggtcaag atgcaacggt 7441 ggcttatggc tataaaaatc tcatcacaag gaacaatagt caaattccct tgcagttacg 7501 cttgcaggtt ttaggtgaaa aggctgtggt tgtagcaagt gtctggagta ctgaaccatt 7561 acctgttgaa gttaaaatca cctcaacggt tttagaaaaa ctccctgcac ctagcgcaca 7621 cgatatgtgt ggttggcggg ttgaaacaat tcgttatgtg cgtctaaaag aatgtccgaa 7681 tacagaatgg caaactaact accatacact tgatgtctac aaaccccatg tcaagctaca 7741 tcaaaatctc agccccatcc ctgtatagtt ttttagatga agttctctca ccctaccagc 7801 tattaggaga agtcgcagag gctttgaaga cgcatctggt agaagcaaac ctttgtggaa 7861 tggattctca cggtttgcaa caaattatcg gttatgtcaa aagtctacaa agtgggcgaa 7921 tcaatcctca gcctcagttg cgcgtgaatt cagaacgccc gactatgatt cggattgatg 7981 gcgatcgcac tcccggacag tatgccgggc aagtagcaat ggatacagcc cttgccgctg 8041 cacgtcagtt tggtatggca gttgttggcg tcagcaacag caatcacttt ggtatggcag 8101 gttactatac ccgcatggct gctgaggcag gaatgattgg ttttgctacc agtgatacga 8161 atggtgtaga ccttgctccc tatgggggaa gaaaagcaaa acttggtaac aatccgattt 8221 cctggggaat tccgacaggg acaccacaac ccatcatttt ggatatggca gccggaacgg 8281 tgagtggggg taaagtcaaa cactttggtt atcagggttt accgataccc ttgggttggg 8341 gactcacgga agcaggagaa ccaacccaga accccaagca ggtggcggtg aatttgccag 8401 cttcttataa aggttctggg ttggcgttcg tcgcagattt attatgcggt ccacttttgg 8461 gtaccgcagc agccatgttt aaaactaaag caattcatga tagtgctaac ggtacaggac 8521 atttcttttg ggtactagat gtagaggctt ggacgaatcg agaggaattt gaggaacggg 8581 tgcaaagtgc gatcgcctcc cttaaacaaa ccccccgcct tgatgtcaac caaccgattt 8641 attatccagg agaactagaa gccatcactc gcgaagaacg tctattaacg ggtattccca 8701 tacctcaagc tttgattgat gatttggcag tttattttgg ggaagatagt gtttctgggt 8761 taagtggtta gatttcaaaa cagttatcag ttaccagtta tcagttatca gttatcagtt 8821 accagtttga agaagaattc agcaattctg aattcagtat tcagaaggaa gaaaaaa // LOCUS NODE_3732_length_8773_cov_4.8573078773 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8773) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8773) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8773 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..549 /locus_tag="DP116_24725" CDS <1..549 /locus_tag="DP116_24725" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24725" /translation="DKTSYHLQDKRIQAVLAMKPFSSGIFGQKGLRQIQVPVMLIAGS NDVVTPAVVEQICPFSWLSSSDKYLVLMQNGTHVFDNQEFASRSFPIPGQLAHPHPAL ARRYLKAMSLAFAKTYVAGQSQYREYLNPSYIKALSQPPLPLFLVRSLTTTQLSQTQN LSCPGSQNFSSPEKPKIQNSRF" gene 611..1684 /locus_tag="DP116_24730" CDS 611..1684 /locus_tag="DP116_24730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009784901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AI-2E family transporter" /protein_id="PRJNA477356:DP116_24730" /translation="MLNIISNHSNQKLPRWLNLGLAFPIIFLNTWLIFLVCQYLQPIV TILVTASLIAFLLDYPIVFLEQRGVKRSWAVGLVLMLTLLLLGVLGFVLGPLVFQQLV EFGNRLPAWIEAGRQQLQTLNEQSMLHILPIDFNEITAEVTSQLSGTVQSLTTQIINV ILDTINSAFNLLLTVVVTIFLVLYGKSLWDEIFNWFFSSWNHQIQSSLKQSFQGYFAG QAITASILSVVLILAFLALEVPFGLLFGLGIGVASLIPFGGFVSITLISLLVAFQNVW LGVKVLVAAVILGQINENIVAPRLIGNITGLNPAWVLICLLIGSKLAGVLGLLVAVPT ASFVKKIAFTLNNSTSDFEMKLP" gene 1776..2522 /locus_tag="DP116_24735" CDS 1776..2522 /locus_tag="DP116_24735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865801.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24735" /translation="MQTLDSSQSKPSPTTEPVIYVHNLNHYFGSGDLRKQALFDINLD IYPGDIIIMTGPSGSGKTTLLTLMGGLRSAQEGSLKILGQEICGASKKQLTQIRCNIG YIFQAHNLMTFLTAKENVRMSMELHDEFLDQDMDAKAVAMLESVGLGHRADYYPENMS GGQKQRVAIARALVSQPKIILADEPTAALDKKSGRDVVELMQKLAKEQSCTILLVTHD NRILDIADRIVYMEDGRLIKDGIDAAAKMS" gene complement(2990..3115) /locus_tag="DP116_24740" CDS complement(2990..3115) /locus_tag="DP116_24740" /inference="COORDINATES: protein motif:HMM:PF00355.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24740" /translation="MALGRVIDGCIECPYHGFRYASEGHCTLIPCEGKKAKISMK" gene complement(3125..3271) /locus_tag="DP116_24745" CDS complement(3125..3271) /locus_tag="DP116_24745" /inference="COORDINATES: protein motif:HMM:PF00355.24" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24745" /translation="MFLQKWWPIAESKSIRKKPVGLRRLDEDLVLWRNNNGQIVCQSS QCAH" gene 4432..5238 /locus_tag="DP116_24750" CDS 4432..5238 /locus_tag="DP116_24750" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24750" /translation="MPQDHIRNIFLQQELKMLTATQTPLVSYQAMQKAARIIKYVYQF YFPLHGLSADDICTYYPILTSVEATIYQADLIMEQGQSLNLTHSPNDDASSLKLLKYS LINLLKELNCYDSVIEQELVRGEEFIQLENKIMVGGLIKHSDVMRVAELRSSDVRLLH LILFRLLGKPYDEKLLSLIWPVEVIADIEDDFRHYAADVAENSYNTYRMFVTLYQDKA PQYIKAELEHYENLFQDKIATFADDEKQRLMAIYSQFRLNHFSAIPEPIL" gene 5328..6356 /locus_tag="DP116_24755" CDS 5328..6356 /locus_tag="DP116_24755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009633723.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fatty acid desaturase" /protein_id="PRJNA477356:DP116_24755" /translation="MLTKTFSTKDIVVSYYINQEKPLWNVIAIFYVLSGYCGGLALLV LHNIWLNILGTMLLAHSLILSAYFSHEFMHGTIFRSMRWNTVGGNIMLWLNGGCYARF KDMAQEHIFHHVKKLDSVVFDLPAFINNLPAPIRGLILALEWLYFPVISFILQFWALT APFWNPKRRDERLRVIILLIVRSSLFTLLGLVSLKALVLYFVAYTGMVTVLRFMDAFQ HTYEAYPVGLPFPKRNDVYEQANTFTTLISRRYWWLNLLVLNFGYHNAHHALMKCSWH SLHELDRDLFARQQTQYLTLPQLLKNYHRFRIARIFLGQGTAVDKQGNLEFDNFYGAV GVSFLVKA" gene 6515..6982 /locus_tag="DP116_24760" CDS 6515..6982 /locus_tag="DP116_24760" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24760" /translation="MMDYISKAFLVWFLGFFPASEIFVAVPSGIALGLDYCSTVVWSV TGSYIAILLIHYGYEFFSQIPQVKTWLDHFSSQRFKNWIDIYGIGFVLLITPLLGVWV MSVTMKVFKMDSGRFLVYSFISVVISAVALTALTYTGIDLATNGYKMITSIAG" gene 7386..8773 /gene="dxs" /locus_tag="DP116_24765" /pseudo CDS 7386..8773 /gene="dxs" /locus_tag="DP116_24765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494906.1" /note="too many ambiguous residues; incomplete; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="1-deoxy-D-xylulose-5-phosphate synthase" assembly_gap 8451..8522 /estimated_length=72 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 2473 a 1779 c 1820 g 2629 t 72 others ORIGIN 1 gataaaacaa gctatcactt gcaagacaag cgcattcaag cagttttagc catgaaacct 61 ttcagtagtg gcattttcgg gcaaaaaggt ctgcgtcaga ttcaggttcc tgtaatgctg 121 attgcaggta gcaatgatgt tgtcacacca gcagttgtgg aacaaatctg cccctttagc 181 tggctttcta gttctgataa gtacttggtt ttgatgcaaa acggaaccca cgtttttgac 241 aaccaagaat ttgcgagtcg tagtttccct attcctggtc agttagctca cccccaccca 301 gccctcgccc gccgctatct taaagcaatg agtttggctt ttgcaaaaac ttatgtcgct 361 ggtcaatcac agtatcgtga ataccttaac ccatcctaca tcaaagcatt gagtcaacct 421 cctctgccgc tttttcttgt acgttctctc actacgactc agttatctca aacacaaaat 481 ctttcttgtc caggctcaca aaacttctca tctccagaga agcctaaaat tcaaaattca 541 agattttgaa ttattaatat gtctcatctt gctattgact caaacaagca aactgaattt 601 ttaacaactt atgttaaaca taatttcaaa tcattcaaat caaaaacttc ctcgttggtt 661 aaatttagga ttggctttcc cgatcatttt tttaaacact tggctgatat tcttggtttg 721 ccaataccta cagcctattg tcactatttt ggttacagcc agcttaattg cttttttatt 781 agactaccct attgtgtttc tagaacagcg aggagtgaaa cgcagttggg cagttgggct 841 agttctgatg ttgacgctct tgctgttggg agttctggga tttgtcctag gtcctcttgt 901 ttttcaacag ttagtagaat ttggcaaccg cttaccagct tggattgaag ctggaaggca 961 acaattgcaa accttgaatg agcaatcaat gttacacata ttacctattg atttcaatga 1021 aatcacggct gaggtaacta gtcaactctc tggtacagtg cagtctttga cgactcaaat 1081 tattaacgtc attctcgata ccattaacag tgcctttaat cttctgttga ctgtggttgt 1141 gacaattttt ctggttttat atgggaaaag cctttgggat gaaattttta actggttttt 1201 ttctagctgg aatcaccaaa ttcagtcatc tctcaagcaa agttttcagg gctactttgc 1261 gggtcaagcg attacggcgt ctatcctcag cgtagttttg atacttgctt ttctggctct 1321 agaggtgccc tttggactgc tgtttggact gggaattggg gtggcgagcc taataccatt 1381 tggtggtttt gtgagtatca cattaataag cttgttagtt gcgtttcaaa atgtttggct 1441 aggggtcaag gtgttggtag ctgccgttat cctcggtcaa atcaacgaga atatagtggc 1501 tcctcgtctc ataggaaaca ttaccggact caatcctgcc tgggtactca tttgtttact 1561 cattggctca aagctcgcag gagtattagg tttacttgta gcggtaccta cagccagttt 1621 cgttaaaaaa atagccttta ctctaaataa ttccacttcg gattttgaaa tgaaattgcc 1681 ttaagatatg tcatttgttc gtgaagctgt tgactgttaa cagtgaaaaa ataacagtct 1741 ttataatcac ttaagtactt aaaaacgtaa atcttatgca aaccctcgac tcttctcaat 1801 ctaagccatc tccaacgaca gaacctgtta tttatgtcca taatctcaac cattactttg 1861 gtagcggtga ccttcgcaaa caagcattat tcgacattaa cttggacatt tacccaggtg 1921 acattatcat tatgactgga ccctcaggtt caggaaaaac gacactatta accctgatgg 1981 gtggactgcg gtctgctcaa gaaggtagtt taaagatttt agggcaagaa atctgcggcg 2041 cgagcaagaa gcagttgacg cagatccgct gcaatattgg ttatattttc caagcacata 2101 acttaatgac gttcttgaca gctaaggaaa atgtgcggat gtcaatggag ttgcatgatg 2161 agtttttgga tcaggatatg gatgccaaag cagttgctat gctcgaaagt gttggtttgg 2221 gacatcgtgc cgattactac ccggagaata tgtcgggtgg acagaagcaa cgggtcgcta 2281 ttgcccgtgc tctggtttcc caacctaaaa ttattttggc agacgaaccc actgctgcgc 2341 tggataaaaa atctgggcgt gatgttgtgg aattgatgca gaagctagcg aaagaacaaa 2401 gctgtacaat tttgctggtg acccatgaca accgcatcct tgatattgcc gatcgcattg 2461 tctacatgga agatggtcgt ctgattaaag atggtataga tgctgctgcc aagatgagtt 2521 agacctcacc atgcgatcgg gttgtttcta acagttgatc gactgaatct agagcggtca 2581 actgttaact gtcaggcgtt acctggagca aatctactgt actcccatat ttcaaacaaa 2641 tacgttgaac tatagatatc aaaaaagtgt atgaagcgga aggtaatttt ggcagttttt 2701 gctacctcaa gtgttgagtt gggacagcga aagttcacaa aacacagaga cacagagcag 2761 acctcccgaa cctcaaattt ttcttctgta agttttacgc atttagtatg caccctagtt 2821 gaccacggtt ttgaagaagc gtcgctccgt tcgttgctga agcgactaag tttaatcaat 2881 ttgtagagga cttgcgccac ttgcactgct taatcttttc aagtggaaat acacaggttt 2941 tggtcttctt gttttactga tgggttattt ttgctcagat gcactagttt catttcatgc 3001 tgattttagc ttttttccct tcacagggaa taagagtgca atgcccttcc gaggcataac 3061 ggaaaccatg ataggggcat tcaatacacc catcaatcac ccgacctaat gccaggttga 3121 ctcctcaatg agcacactgg ctactttgac agacaatttg tccattgttg tttcgccaca 3181 gtactaaatc ctcatcgagt cgtctgagtc caactggttt tttacgtata ctttttgatt 3241 cagcaattgg ccaccacttt tgtaaaaaca tgattttctc gtgcgcttag caaaacaaac 3301 tgctttatgt atcggtcaag agtactgaat tgtaagattt cacttgagtt gtaccataac 3361 tgataaagtg cagccactgg taatacaaaa gtacgctcaa gataaaatcg gaatatacaa 3421 agctttgatt gtgaagaata agtttgattg gacataaatg ttgcatctgt tgctgacaca 3481 aaaaatatgg cggctaaaaa ggcgcacaat gtaccaatcc actataaaaa tgagattttc 3541 ttcattaaag tttgaatatt tagtacttta gacagcgagt cagtaaatac cactgggcat 3601 tcgtgacaat ttaaaacaac gttgcctttg tcaattagca attcggctta aaatcgggta 3661 aaagatgtat cagtaaatac tgaattttcc gggcaagagt gaatcatgcg tctaaagatt 3721 gcgcgataag cctccggctt gacgctgtat cgcatgattg cgtgcactct gcgcaacagc 3781 cgtggcgtta gccataggtg ggtgccccgc accatggtgt gcagtgccct ttgggcgatc 3841 gcttccccaa aaattatgtt ttctaccacg atttccgaaa aatttttaag tagcatgact 3901 tattgacaaa cgcgccataa ccttaagttg tcactaatga ccactgaatt gtaagataca 3961 gtgagttttg aagacaaaga gcaatcagag tacaagcatc gtgtcctaga tatgcttatg 4021 gactaagatg cgtttaattt agattgctgt ggtgcgtaca ataggcattt aagattgaga 4081 tggtcttaag ttgatgccta ataaaatcaa gtaaagacat ttcacctgtc attttttctg 4141 tgattaagca gcttagggaa aatccgtgcc tttttgctgc ttgtgcaaaa tacaggacaa 4201 aagtgacgcc aacaagaggc aagtatttag gcagaaatta caggaaagct gagccagata 4261 aacttcaagt ctgattttgg gatgaacgtg ggtttattag tgctaaagct attagacata 4321 tccgcacaaa aaggtgaaac cgatagagaa aaaccttttt ggtaattttc ggatgagtct 4381 attaaacgca atagttttgt ggaaaaaggc acctgaaaaa atcatcagta agtgccacaa 4441 gaccacatac gtaatatttt tctacaacag gagttaaaaa tgctaactgc cactcaaaca 4501 ccccttgttt cttatcaagc aatgcagaaa gctgctcgca tcattaagta tgtttatcag 4561 ttttactttc cacttcatgg tctctctgcg gatgatattt gcacttatta ccctattctg 4621 acatccgtag aagcaacaat ttatcaagct gatttgatta tggaacaagg gcaaagcttg 4681 aatctgacac atagccctaa tgacgatgct agtagtttaa aactgttgaa atacagttta 4741 atcaatcttt tgaaagaact gaattgttat gattcagtga ttgagcaaga attagtaaga 4801 ggagaagaat ttatccaact agaaaacaag ataatggttg gaggattaat aaaacattca 4861 gatgtcatgc gtgttgcaga attacgatct tctgatgttc gccttcttca tctcatttta 4921 tttcgtctgc tcggtaaacc ctatgatgaa aagcttttgt ctctcatttg gccagtagaa 4981 gttattgctg atattgagga cgatttcaga cattacgctg ctgacgttgc tgaaaatagc 5041 tacaacactt atcgtatgtt tgtgacactt taccaagata aagcccctca gtacatcaag 5101 gcagaattag agcattacga aaatcttttt caggataaaa tagcgacttt tgctgatgat 5161 gaaaaacaaa gattaatggc aatctactct caatttcgcc ttaatcattt ttcagctatc 5221 cctgaaccaa ttttgtagtc atttagctga ttctgtgacc aagaatgata gtgttttttt 5281 aaaacacttg ttgtgaggaa actaattctc ttaccggagc agtcattatg ttgacgaaaa 5341 ctttttcgac aaaagacatc gtcgtttcct attacatcaa tcaagagaag cccttgtgga 5401 atgtcattgc tatcttctac gttttgagtg gctactgcgg tgggctagca ctcttggtat 5461 tgcataacat ctggctgaac attttaggaa cgatgttact cgcccacagc ttgattctgt 5521 cagcctattt ttcccacgaa ttcatgcacg gcacgatttt cagaagtatg cggtggaata 5581 ccgttggtgg caacattatg ctgtggctga atggcggttg ttatgccaga tttaaagata 5641 tggcgcaaga gcatatcttt catcatgtta aaaaattgga ttctgtggtt tttgacttgc 5701 ctgcctttat taacaatctt ccagcaccaa tacgcggctt gattctagcg ctagagtggc 5761 tatattttcc ggtgatttct ttcatacttc agttttgggc attgactgct cctttttgga 5821 atccaaagcg tcgagatgaa aggctgcgtg tcattatcct gcttatagtg cgttcttcgt 5881 tgtttaccct actgggatta gtgtcactca aagcattagt gctttacttt gttgcataca 5941 ccggcatggt taccgtattg cggttcatgg atgcttttca acacacttat gaggcatatc 6001 ctgtaggctt accgtttccc aagcgcaatg acgtctacga acaagccaat acctttacaa 6061 ccttgatttc tcggcgatac tggtggttga acttgctagt gttgaacttt ggctaccata 6121 acgcccacca cgcactcatg aaatgctctt ggcacagtct tcatgaacta gacagggatt 6181 tattcgcaag acaacaaact cagtatctga cactgccaca attactcaaa aactatcacc 6241 gctttcggat tgctcgtatt ttcttaggtc aaggtacagc cgtagataag cagggtaatt 6301 tggaattcga taatttctat ggtgctgttg gtgtgtcttt tctggtgaag gcttaatagt 6361 ttgttgcaaa aacttgtgct tctctaaaaa cctcgtgaat gctgagtatt gataggccaa 6421 gagtcaacgt tcattaacca gaaaaagttg ttgactaaat attgacttag ttagtttatt 6481 ttcatctaac tagtcaactg aaaaagggtc aaagatgatg gattacattt caaaggcatt 6541 tttggtttgg ttcctcggat tttttccagc ctcagaaatt tttgtagctg tcccctcggg 6601 catcgctctt ggtctggatt actgttcaac tgtagtttgg tctgtcacag gtagttacat 6661 agcaattctg ctaatacact atggttatga gttttttagt caaataccac aagtcaaaac 6721 gtggctcgac catttttcat cccaacggtt taaaaattgg atagatattt atggaatcgg 6781 attcgtcctg ctaataactc cattacttgg tgtgtgggtt atgagtgtaa caatgaaagt 6841 tttcaaaatg gattccggac gattcttggt ttactctttc atcagcgtgg ttatatctgc 6901 tgtggcgcta acagcactta catacaccgg aatagacttg gctacaaatg gctacaaaat 6961 gataactagt atagctggct aaaatgtcag gctttcccca aaaaaggaga aatttgtttc 7021 ttggatgaca acgagaagaa aatgagataa gatgaacgtg aacttgataa tttgaccaca 7081 gaaaccaact aaaaaattaa ggtttcatac atagtcagta ttttgcgttc aaaacagtca 7141 ttggtagaag gaatacaggt agatagggat agttagaggc agtaaaaagt ggttgctcca 7201 tccaatgaat ctaccagaca aagaacaaca agaacatttg attgactgtt gagtgaatac 7261 caactcaatt gcaaatcaac ccaactcctt tgttgaataa agtcacactt aattgggttg 7321 atatgatgcc cccctttgac cagtttcaag catttgaaat cacttgcctt ttacaggaga 7381 tatatatgca tctgaaagat ttgactcatc ctaaccagtt gcacggtctg tcaattgaac 7441 aactccagca aatcgctgat cagattcggc aaaagcactt agaaacaata gcagcgagtg 7501 gtggtcacct tggtcctgga ttgggtgtgg tagaactcac gctagcgctt taccaaactc 7561 tggatttgga atgcgataaa gtcatttggg atgtgggtca ccaagcctat ccgcataaac 7621 taattactgg tcgctacaat tcctttcaca cactacggca aagaagcggt attgcaggat 7681 atcttaagcg ctgtgaaagc aagttcgacc actttggtgc aggacatgct tccacaagta 7741 tctctgctgc tttaggtatg gcgatcgccc gagatttgaa aggcgaaaca tttaaagtcg 7801 cagccgtgat tggcgatggt tctcttactg gtgggatggc actggaagcg attaaccatg 7861 caggtcactt gcccaacact aacctgctgg tagttcttaa cgacaatgaa atgtctatct 7921 ccccgaacgt tggtgcgctt tctcgctatc tcaacaagat gcgcttgaat ccatcagtgc 7981 agcatttatc agacaacctc aaggaacaaa tcaagcatct accctttgtg ggtgattcca 8041 tttctccaga acttggacgt cttaaggaag ggatgaaacg tttggcggtg accaaagaag 8101 gagcggtgtt cgaggaactg ggctttacct acatcggacc tgtggatggt cataatctcg 8161 aggaactgat tgctacgttt gaacgggcac atcagataac aggaccagtt ttggtgcatg 8221 tggcaactgt gaagggcaaa ggctacgaat ttgccgaaaa agaccaagta ggctaccacg 8281 cccaaacccc ctttaatctc gacaccggca aagccatccc ttccagcaaa cccaagcctc 8341 ctgcttatac taaagttttt gctcacactt tggtgaaact cgccgaacaa aaccccaaaa 8401 tcgtcggtct cactgctgcc atgtcaacgg ggacaggttt agatctgtct nnnnnnnnnn 8461 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8521 nnggtttaga taagctgcaa gaaaagctgc ccaatcagta tattgatgtc ggcattgcgg 8581 aacaacacgg cgttacccta gcagctggct tagcctgtga gggtatgcgc ccagtcgtag 8641 caatatactc caccttcttg caacgcgctt tcgaccaaat tattcatgat gtctgcattc 8701 aaaacttgcc tgtcttcttc tgcatggaca gggcgggaat ggcgttgctg aatggaagta 8761 tgaatcaggc gat // LOCUS NODE_3740_length_8758_cov_5.4863848758 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8758) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8758) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8758 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..273 /locus_tag="DP116_24770" CDS <1..273 /locus_tag="DP116_24770" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24770" /translation="GFGGSQSTGTAKTATPSPPSGRLQVGRPDGRCSTWGNPKTALPP QRAGSPRPHCLLMTLTGMPAFAQRANASSLGVGDPPAGLDSLMTND" gene 285..3341 /locus_tag="DP116_24775" CDS 285..3341 /locus_tag="DP116_24775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879163.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_24775" /translation="MSQDKELEIQMQFLEEATDYLNTLEGVLLELNTNSRISLEKINA ALRAAHSIKGGAGMMGFRALSDLAHRLEDSFKVLKTRKNSLEINTELQSLLLSGVDWL RQIVELLSEGNVIDEQWLSTFCYPVFEELHARLGDPTPEDAATMLSPEDGQDIIPLLF ETEVEGCLQRLESVLADSEQPCLKEEVAIMAAELGGLGEMLQLPAFTQLCESVANLLE QAASSEIETVARVALEAWRRSQALVMTNQLETLPTEIQLEDLAPQAAAYTQTQLLQVE VAPPLFPETEVAETDILQPEETQETWLDREIITADFEALEAAFADESNAQVQVPPNPV VSQEIPTTNYKFVEQKAEQTVSNNNKSEPQENTVRVPSKQLEQINDLFGELIIQRNGL NLQLERLRKLIRNLNQRVQVLGRENQQLRTAYDRIATQSVLSSSVPFLALPSRQNVED FIGFGDENQTQSGFDSLEMDSYNELNLLSQEVMETIVQVQEVTSDIQLSVDDTDLFAR KLTKTSKQLQRKLTQVRMRPLSDIVDRFPRALRDLCVEYGKNVQLKIEGAGVLIERNI LEALNEPLMHMLRNAFDHGIEDPATRRACGKSEQGLIEIKGTHQDNRTIITLRDDGRG IPLDKIRARAIAMGLEPSLIAQATDEELLSLIFEPGFSTSDQVTALSGRGVGMDVVRS NLKQVRGDVKVDTVAGKGTTYTISVPFTLSVARVLLIESNRMLLAFPTDAVSEIFLLN HEQVFTMATSEVLNWQGNMLPLIRLGQHLEFNCPRYDNPNLETPPAINASSILIVNQG NNQLVAVQVDRCWGEQEVAIRRVEGNIPLPSGFSNCTILGDGRVVPLVNANELFYWIV TNERTPRTNQLPSPRLKTAFLTPADDQSLLPSNQKGTVLIVDDSINVRRFLALTLEKG GYQVEQAKDGQDALEKLQGGLRVQAVICDIEMPRVDGYGFLGRIKSNTDFRDIPVAML TSRSSDKHRQLAMQLGATAYFSKPYNEQELLKTLDEIIFPLAGASN" gene 4254..4610 /locus_tag="DP116_24780" CDS 4254..4610 /locus_tag="DP116_24780" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24780" /translation="MNLLVLKILAIIFLIVGEILTIYAEMMAARNFSINSNFFGMFFQ AFLTVTLAGLFVISGYMLGFKAFKNIWLITAFSITSILLVEPILAYIVFRQIPTKGAT LGLFLGTLGMLATIFE" gene 5117..6139 /locus_tag="DP116_24785" CDS 5117..6139 /locus_tag="DP116_24785" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24785" /translation="MIAKLFKIQQTPFQAEPPLYLKAGKKFFSLIKEPGETEFIHQID YIRVPGKTNDFLIYRSLNDRNIRPEIIVHTKSTLASQHENDASEILAKKTGFWSYKDI LSPEALASVKLPSPEVATKEHIQEILAYDKVTEGDSLRFFNYVASTSLDMPKEFWPVM VLWLCDEMNHHAGFKTAYHKLFGVAPVMEAALAVEESDFGQFDSIMKEPFKLLVALTY DEASTINGYKHDLAVYKIIGQPFTNFLRRVNADESWHFSKFANLAAKYFPERIAEVPK ILEEVKSLDGRPYKRTFLFDHDPSVEAQFTKAAQDKACDIVLKVLTKKMDWWRSQQNG CRLNSY" gene 6555..>8758 /locus_tag="DP116_24790" CDS 6555..>8758 /locus_tag="DP116_24790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495725.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chemotaxis protein" /protein_id="PRJNA477356:DP116_24790" /translation="MTKNNRISYPVAETKITTDRKVVTSLSQWFYNLPISRKQLIALI ASELVSILGIGIGATLIITNGLRTQLFEQAKSEVAVLDTNYNIKINQMGFGFRGQSDN TAIIKASVIYDSMKTISPDLKAEVKRILENEIKARQIEYATLVGKDFKIIVNANSDRE GDVFNPNNLVSEVFKNPKQIKASRIVSWSEINKESPPLPNTFSNHDALIRYTVTPVRD QNTKAVVGALISGDIVNGKQPIVKNTLEATGGGYSAVYLRKSTGEFALATSVAQGESK DFNQALSNVELPKKETESLLKEAASSTEGTVVTRRMVVGNETYTIAAKAVPNKIVEEA DGLSAVFDGQSSAILVRGTPEKFFNHLLLQSLLEQTLTIVVALIIIAIWAVILRRTII KPIENLQQAAQKFAAGDRSSRAEIFATDEVGQLAINFNIMAERTLEQIRYQEDETKLA LQLNEITTAMRESLDTEKILKAAVSSTRKAIRAERVLFYRLDENYQGTVIAKSVNYEG SATLRTSITEPFHPEEHFQNYKTSEVKVVENIYEADLSQQYLKQLESFAVKAYLLAPI FLNKKLYGLLIVHECSDYRKWQDIEITLFKQVAIQVGYALEQSELLQHVEQSCQIAET ASIQERQQKEALQMQLLKLLSEVEGAARGDLTVRAEVKQGEIGTVADFFNCIVESLRL IVSKVQVSALQVNQAIWTNSEAIGELANSALIQAQEINRTLDAVDHMSQSIQ" BASE COUNT 2674 a 1766 c 1911 g 2407 t ORIGIN 1 gggttcgggg gttcgcaatc gacgggaacc gccaagactg cgaccccctc accgccctcc 61 gggcgtctac aagtcgggag acccgacggc agatgctcca cttggggaaa ccccaagacc 121 gcactgcctc cccaacgcgc tggctcccca agaccgcact gcctcctaat gacccttacg 181 ggtatgcctg ccttcgccca gagggctaac gccagttccc tgggcgtggg agacccgcct 241 gcaggactgg actcactaat gactaatgac tgatagcaaa tattatgtca caagacaaag 301 aattagaaat ccagatgcag tttctggagg aagcgactga ttacctgaat accctggaag 361 gggtattgct ggaacttaat accaacagcc gtatctctct agaaaaaatc aatgccgcac 421 ttcgagctgc ccactcaatt aaaggtggcg caggcatgat gggatttcgt gccctgagtg 481 atttggctca ccgtctggaa gattccttta aagttttaaa aactagaaaa aactctttag 541 aaattaatac ggagttgcaa agtttattgc tatctggagt agactggcta cgtcagatag 601 tggaattgtt atcagaaggc aatgttatag atgagcagtg gttatcgact ttctgttatc 661 cagtttttga agaactgcat gcgcgtttgg gtgatccaac cccagaggat gcagcaacta 721 tgctgtctcc agaggatggg caagacatca ttcctttact gtttgaaaca gaggtagaag 781 gatgtttgca acgcttagaa tccgtgttgg cagatagcga acagccctgt ttaaaagaag 841 aagttgcgat tatggcagct gagttaggtg gtttgggtga aatgctgcaa ttaccagctt 901 ttactcaact ttgcgaatca gtggcaaacc tgcttgagca agctgcttcc tctgagattg 961 aaactgttgc gcgtgtggca ttggaagcat ggcggcgatc gcaagcgttg gtgatgacaa 1021 atcaactaga gaccttacca acagaaattc agctagagga tttggcgcct caagccgcag 1081 catatactca aacacaactg ctacaagtag aagtagcacc acccctgttt ccagaaacag 1141 aagttgccga aacagacatt ctgcaaccag aagaaacaca agaaacttgg ttggatcgtg 1201 aaatcattac agctgatttt gaagctttag aggcagcttt tgcggatgag agtaacgctc 1261 aagttcaagt gccaccaaat ccagttgttt ctcaagaaat tccgacaaca aattacaaat 1321 ttgttgaaca gaaagccgaa caaactgtta gtaacaacaa taagagtgaa cctcaggaaa 1381 atactgttcg agttcctagc aagcaacttg agcaaattaa tgatttgttt ggggaactga 1441 ttattcagcg caacggattg aatctacaac ttgagaggtt acgcaaactg attcgtaatt 1501 tgaaccagcg agtgcaagtt cttggcagag aaaatcagca attacgtaca gcttacgata 1561 ggatagcaac tcagagtgtg ctgtcatcta gtgttccatt cctggcattg ccctcacgtc 1621 aaaacgtgga ggatttcatc gggtttggcg atgagaacca gacacaaagc gggtttgatt 1681 ctttagaaat ggatagctat aacgaattaa acctgctttc tcaggaagtg atggaaacta 1741 tcgttcaggt acaagaagtc accagtgaca ttcaactcag tgttgacgat acagatttat 1801 ttgcccgcaa actcaccaaa acatccaagc agttgcaaag aaagctgact caagtacgga 1861 tgcgtccgct atctgatatt gttgatcgct ttcctagggc tttgcgcgac ctatgtgtag 1921 agtacggcaa aaacgtccaa ctgaaaattg agggtgctgg tgtcttaatt gaacgtaaca 1981 tcttggaggc tttaaatgag cctttgatgc atatgttgcg aaatgccttc gatcatggta 2041 tagaagaccc agcaacacgc cgcgcctgcg gtaagtcaga acaaggatta attgagatta 2101 aaggcacgca tcaagacaat cgcacaatca ttactctcag agatgatgga cgtggaattc 2161 cactagacaa aatccgtgcc cgtgctatag ctatggggtt agaacctagt ctcatagcgc 2221 aagctactga tgaagaactg ctctcgctca tttttgagcc aggatttagc acctctgacc 2281 aagtcacagc tttgtccggt cgaggtgtcg gtatggatgt ggttcgcagt aacctcaaac 2341 aagtacgagg ggatgtcaaa gttgatactg tagcaggaaa ggggacaact tatacgatat 2401 cagtgccgtt tacactgtca gtcgcaagag tcttactgat agaaagcaac cgaatgcttt 2461 tagcatttcc cacagatgcg gtttcagaaa tattcttgct gaatcacgag caagtgttca 2521 caatggcaac cagcgaagtc cttaattggc aaggaaacat gttaccgttg attcgtctgg 2581 gtcagcattt agagtttaat tgcccccgct acgacaaccc aaacctagaa actccccctg 2641 caattaatgc ctctagcata ctcatcgtca atcaaggcaa taatcagctc gtggcggtac 2701 aagtagatcg ttgttggggt gagcaagaag ttgctatccg tcgagttgaa ggaaatatac 2761 ctttacccag tggctttagc aactgtacga ttctgggtga tggtcgggta gtcccactag 2821 tgaatgccaa cgagttattt tattggattg ttaccaacga acgcacgccc agaacgaatc 2881 aactaccatc gccaaggtta aagaccgctt tcctgacgcc agccgacgac caatcactgc 2941 tgccaagtaa tcaaaaaggt acggttttaa ttgtagatga ctcaattaat gtccggcgtt 3001 tcttagccct cactctagag aaaggagggt atcaagtaga acaagcgaaa gatggtcaag 3061 atgcgctcga aaaacttcag ggtggtttga gagttcaggc tgttatttgt gatattgaaa 3121 tgcctcgtgt tgatggttat ggctttttag gtcgtatcaa atcaaatact gactttagag 3181 atataccagt cgctatgcta acctctcgca gtagcgataa acatcgtcaa ttggcaatgc 3241 aactaggtgc tactgcctac ttctccaaac cttacaatga gcaagaatta ctgaaaactt 3301 tggatgaaat catttttcct ctggcaggag cttctaacta gagaatcatg ctgaaagtta 3361 taaattaagc ttgtcaagag ttcacacgag acagtgcgtt gcgagcagtt tgttggcagg 3421 ggagccacta ccaagaaact gctctgccga tttgtcgcaa gtggcgtatc aagctcagca 3481 gacgactgcg aggttccact gatattgggg caaccgttcc taacctctaa tatagcgttt 3541 atggagttgc ttgcaataca tggctctccg tgaaagtgca tgcactttca cattataggg 3601 attaaacccc gtccttaagg acggggtctt tccttattta tccgtggtgg tgtatgtgat 3661 ttaaatgaga accgctatat ctcgtgcgac tcttccctcc agtaagaatg acacattaga 3721 acgcaactta gtataaaatt ttttttattt tggcagtcga acaagcggat tttttcttgc 3781 cactataact gaaagacatg tttagtaagg actttggcag ctagtatcgt tcagctttct 3841 tgtagagtga ggctctctct atctaaaggc tgatggcatt ttttgatacc aaaagatata 3901 aaactagaat gtgacataag cagcaaatat tcagcaaata tacagagcaa gctttcaaat 3961 gataaatctg tgttctggca ctccagaaca cggaggtttg cactggtgta gtgttgtgac 4021 ctttttgcaa cacctttttc aaactcacgc ccgtaattgg atgtgaagtt atcaagttgg 4081 cattggttgt atattcagtg tcacaattga cggcagctaa atttatctaa ttaccaccca 4141 ttgactgaag aggagtctag atggcaattg atttaagaat tatacaaaat agatgaaatc 4201 atgactatac agttagtatg tgctagattg cataccattt ttgggaaaac aaaatgaatt 4261 tgttagtctt gaaaatccta gcaattatct ttttaattgt cggagaaata ttgaccatct 4321 atgctgaaat gatggcggct agaaatttct caattaattc caattttttt ggaatgtttt 4381 tccaagcttt tctaacagtt accttggcag gattatttgt aatttcaggt tacatgcttg 4441 gattcaaggc tttcaaaaat atttggttga taactgcttt ttctataact tccattttgt 4501 tagtcgagcc tattctcgct tatatagttt tccgtcaaat accaacgaaa ggagcaacgc 4561 ttggattatt tttaggaacg ttaggaatgc tagctaccat ttttgaatga taatttattc 4621 acaattcagc tattttatga agtacacaag ctattcgagg aactgactag taagtataca 4681 atactgaaga actttaccgc tgtgatagtg atggcaaaat tggtcgattt attgacctta 4741 agcagcggga tatcaagttt attttacaac tgcacatcat aaagctgaga aaactctact 4801 atctcaggat gttccttcag ctcacttaac gtttgtggaa tcgagagcga ctactgcaaa 4861 cttacgagca gagcttgaat ccgtgtaagc tgttgcgcgt atatattggg ctttttttcg 4921 ttacggcaag tcaaaactca agccgaaaaa ttgagctagt gctcgactac agttccacag 4981 gctacaggag ccagtgcgcc cttgtgggaa tgccccttac aaataagtat tcaattttaa 5041 ggcgttacag cttatttatt tttgtgagca gggattggac tggcaatgca agcgatagag 5101 ttattgatgg agcataatga tagcaaaatt atttaaaatt caacaaactc cttttcaagc 5161 tgagcctcct ctgtatctga aagcagggaa gaagtttttc tcccttataa aagagcctgg 5221 ggaaactgaa tttattcatc agattgacta tatccgtgtg cccggcaaaa ccaatgattt 5281 tttgatctat cgctcactaa acgatagaaa tataagacct gaaatcatag tacatacaaa 5341 gagtacactt gcttcccagc atgagaatga cgcgtctgaa atactagcaa aaaagactgg 5401 tttttggagt tacaaagata tactttctcc cgaagcatta gcttcagtta agttaccttc 5461 accagaagta gcaaccaaag agcatattca agagattctc gcttatgaca aagttaccga 5521 aggtgattca cttcgcttct ttaattatgt tgcctcaacc tctcttgata tgccaaagga 5581 attctggccc gtaatggtgc tatggctttg cgatgagatg aatcatcatg ctgggttcaa 5641 gacagcgtat cataaactgt tcggcgttgc tccagtcatg gaggctgcac ttgctgtaga 5701 agagtcagat tttggacagt ttgactcgat tatgaaagag ccttttaagc ttcttgtagc 5761 ccttacctat gatgaagctt ctaccatcaa tggatataag cacgacttag cagtctataa 5821 aataattggt cagccgttca ctaacttttt aagaagggtg aatgctgatg agagctggca 5881 cttctcgaag tttgcaaatt tggctgcaaa gtacttccct gaaaggatag ctgaagtccc 5941 caagattttg gaggaagtaa aatcactgga tggaagacct tataagcgca cgtttttgtt 6001 tgatcatgat ccgtctgtag aagctcaatt cacaaaagca gcgcaagata aagcttgtga 6061 catcgtcttg aaggtcttaa ccaagaagat ggattggtgg aggtctcaac aaaatgggtg 6121 cagacttaac agctattgag gctaaactcg aaaagttaaa tagctcggca caaataaagt 6181 taatccgtga aggctgtcgt tagtgagcca ctgcggtggc tcctccctag gcgactgacg 6241 tccagcgtgc cgtagacagt tgcacagccg cctcaagtgg cgtaagcgca agcctccgca 6301 agacttacgt tgtagacgtg ccataggcat acccggaggg tcatgactca ttgtttgatg 6361 agtgaatcta cggtgaacat ggcttttcag tagattcact catgcggttg agagatgaca 6421 ggtcattaat tgcttagtat attcatttat acataataga tattttcggt aacatttcag 6481 tatatttact aatgcataat ctagtcttca gggcggaatg tagtagaaaa tatgcaattt 6541 aagtataaac tctcatgact aaaaataata ggatatcgta cccagttgca gaaacaaaga 6601 ttacaactga tcgaaaagtt gttacgtctt taagtcaatg gttttacaac cttcctatta 6661 gccgcaaaca gttgattgct ttaattgcct ctgaattagt atctattctg ggaataggta 6721 ttggagccac cttaataatt actaacggtt tgcggactca actgtttgaa caagctaagt 6781 cagaagttgc agttttagat actaattaca atattaaaat taatcaaatg ggctttgggt 6841 ttcggggaca atcagataat accgcaatta ttaaagcatc tgttatctat gattcaatga 6901 aaaccataag tccggattta aaagcagaag tgaagcggat tcttgaaaac gaaatcaaag 6961 ctcgccaaat tgaatatgct acgttagtag gtaaagattt caagattatt gtcaacgcta 7021 atagtgaccg tgaaggtgat gtttttaatc ctaacaactt agtcagtgag gttttcaaaa 7081 atcccaaaca aattaaagcc agtagaatcg ttagctggtc agaaatcaat aaagaatcac 7141 ctcccttacc aaatactttc agcaatcatg atgctctcat tcgttataca gtaacccctg 7201 tcagagatca aaacacaaaa gctgttgtgg gtgctttaat atctggggat atagttaatg 7261 gcaaacaacc tattgttaag aacactttag aagcaactgg aggcggttac agtgctgttt 7321 acttacggaa atctacggga gagtttgccc tagccacttc tgttgctcaa ggtgaatcta 7381 aggatttcaa ccaagctctc tctaatgtag agttaccaaa aaaagagacg gaatctctac 7441 tcaaagaagc agcaagttca accgaaggaa cagtggtcac tagacgcatg gtagtaggaa 7501 atgaaaccta caccatagca gcaaaggctg tccctaataa aatagttgaa gaagctgatg 7561 gtctatcagc agtttttgac ggacagtcaa gcgctatttt agtgcgggga actccagaaa 7621 aatttttcaa tcatttactg ctacaaagtt tgctggaaca aaccctcacc attgttgtcg 7681 cactcatcat tattgctatt tgggcagtta ttcttaggcg gacaattatt aaacctatcg 7741 aaaatctaca gcaagcagcg caaaaatttg ctgctggcga tcgctcttct cgggctgaga 7801 tttttgctac tgatgaagtt ggtcaattag ccattaattt taacataatg gcagaaagaa 7861 ccttagaaca aatccgatac caagaagatg aaacgaaact agcactgcag ctcaatgaaa 7921 taactacggc gatgcgtgaa tcactagaca ctgaaaaaat cttgaaagca gcagtttcta 7981 gcacgcgaaa agccatacga gccgagcgag tgctttttta tcgtttagac gaaaactatc 8041 aaggtacagt tattgctaaa tccgttaatt acgaaggttc agcaacttta agaacaagca 8101 taactgaacc ctttcatcca gaagaacact ttcaaaacta taagacaagt gaggttaaag 8161 tcgttgaaaa catttacgaa gccgatttga gccagcaata ccttaaacag ctagaatcat 8221 ttgcggtcaa agcgtattta ctggcaccta ttttcctcaa caaaaagcta tatggtttat 8281 tgatagtgca tgagtgttcg gattaccgaa agtggcaaga tatagaaatc actttgttta 8341 agcaggtggc aattcaagtt ggctatgctt tagaacaatc agagcttctc caacatgtag 8401 aacaaagttg tcaaattgct gaaacagctt caattcagga acgacaacaa aaagaagccc 8461 tgcaaatgca acttttgaaa ctccttagtg aagtagaagg tgcagccaga ggtgacttga 8521 cggtgcgtgc tgaagtcaaa caaggggaaa ttggtacagt tgccgacttt ttcaactgta 8581 tcgttgaaag tttgcgcttg attgtgtcga aagtgcaggt atcggcattg caggtaaatc 8641 aggcgatctg gacaaactct gaagcaatcg gtgaactggc aaactcagca ctgattcaag 8701 cccaagaaat caaccgcaca ctcgatgcag ttgatcacat gagccaatcg attcaaca // LOCUS NODE_3780_length_8671_cov_6.0348198671 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8671) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8671) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8671 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..206 /locus_tag="DP116_24795" CDS <1..206 /locus_tag="DP116_24795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748670.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="IS607 family transposase" /protein_id="PRJNA477356:DP116_24795" /translation="LVFAICEEFETEVVIINKSNEEVPFEQELVQDMIELITVFSARL YGSRSKKNKKLIDGMTQVVKEVS" gene 206..1291 /locus_tag="DP116_24800" CDS 206..1291 /locus_tag="DP116_24800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006616249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_24800" /translation="MLLGFKTELKLNNLQRTALMKHCGVARHAWNWGLGLTKQILDHN KANPNSKIKFPSAIDLHKWLVALVKSEHEWYYEVSKSTPQQALMALRESWKRCFNKTA GVPRFKKKGRRDSFTLEGTVKILGNNKIQVPVIGVLKTYERLPQVKPKSCTISRQANR WFISFRIETETHSTEHTDVVGVDLGVKTLATLSTGEVITGAKSYKKYESKLSRMQWLN RHKIIGSANWKKAQMQIAKLHRKIANIRKDTLHKLTSLLAKNHGRIVIEDLNVSGMMA NHKLAKAIADMGFYEFRRQLTYKCELYGSKLVVVDRWFPSSKTCSNCGTKKETLTLDE RVFECGHCGFSLDRDLNAAINLSKIAS" gene 1412..1774 /locus_tag="DP116_24805" /pseudo CDS 1412..1774 /locus_tag="DP116_24805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744260.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" gene 1986..2078 /locus_tag="DP116_24810" /pseudo CDS 1986..2078 /locus_tag="DP116_24810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007356698.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="aspartyl protease" gene complement(2135..2701) /locus_tag="DP116_24815" CDS complement(2135..2701) /locus_tag="DP116_24815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358022.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_24815" /translation="MTTTTSEKVRLTTADLDLFPDDGKRYEIIDGELFVTRAPHWKHQ EVCVKIGTQLEIWSTQTGLGRVAFAPGIIFTDTDNVIPDVVWVSHQQLTQLLDEAGHL TGAPELVIEVLSPGEKQEKRDRELKLKLYSIQGVHEYWIFDREKQKVEIYRREKAVLK LVATLYKDDNLTSPLLPGFSCAVERLFG" gene complement(2755..3072) /locus_tag="DP116_24820" CDS complement(2755..3072) /locus_tag="DP116_24820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308858.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Txe/YoeB family addiction module toxin" /protein_id="PRJNA477356:DP116_24820" /translation="MTKKKKKKAEAEEIKPVVVNRTPGFSSKFKEDLAWWFKQDFDKA LKILDLVTAVMQDPFEGIGKPEPLKYMDADMWSRRIDLEDRLVYRVGNTQIDFLTCRY HYE" gene complement(3044..3349) /locus_tag="DP116_24825" CDS complement(3044..3349) /locus_tag="DP116_24825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453816.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prevent-host-death protein" /protein_id="PRJNA477356:DP116_24825" /translation="MLSYKITSPTDARNDFFKLLDLVVENHQVYIINRRDSENVALIA ESDLVSLVETVYLLRSPANARHLLDAIEESKTGKIQPQTITELQQELGIDQEEEKES" gene 3556..4830 /locus_tag="DP116_24830" CDS 3556..4830 /locus_tag="DP116_24830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017719957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NCS2 family permease" /protein_id="PRJNA477356:DP116_24830" /translation="MTMAYILVVNPRILSNAIFLQQPGDLFNQLLLSTAISSAIATAL MGLLSNYPFALAPAMGLNAFFAFSVVLDLQMNWRLALTSVLIESLIFISLILCNIHIQ IIKAIPASLKHATVAGIGLFIAYIGLSGVPEPPTLGAGIIVASKATLTTIGSFKQPAT LLSAFGLLLTATLVARHIKGAILWGIFATALLGWILGIAQSPEKIIALPEWPQDLISQ AFTGLSYLTPKQIWNFVSVTLTFLFVTSFDTIGALTGLGQQAGYINKNGELPRATKSL LAGAMGITIGALMGTSPCATYLESASGISEGGRSGFTSVVVAVLFLVSVLFTPLFAAV PTFATAPALIMVGFLMMSSVQNINWNDPAEAIAAFLILLTMPLTYSIAEALAIGFIIY PLIKVSQGLAHQVNKTVYFLAAISVFHLVLKS" gene 4849..7635 /locus_tag="DP116_24835" CDS 4849..7635 /locus_tag="DP116_24835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017721691.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NHLP bacteriocin export ABC transporter permease/ATPase subunit" /protein_id="PRJNA477356:DP116_24835" /translation="MEVLYRLKANERLLLDDQEKVWVVQTGSVAVFATKVNNEMPVGD TQSVKPPAYRLTEATQNEHRRYLFSVGAGAALFGAATKIGESLSLVAVAIEATELSQI SIADLVSLVATGEAEAMLQTATQALAKLTEGIALVNGWINHLSETFSTEPTAANFSHY LSLIKSINSENAASLPSILADLHSEFFDYLKKLQQQETEIAFRQFQQREQLNRQVVNS ALSKLASVLQPQQETVSFQGTPLLVAAGAVGRAMGITINPPAQSEDISRVKNPVEAIA RSSQIRTRRVVLEYGWWLSEYGPLLAYTQEEKRPVALLPAGKRYIFFDPLAQTRTFVN QAVAARLAPQAYQFYRPLPKVNNALALFQFGIKGYEKDIILVLVTGIIGTLLGMVVPQ ATALLVDNAIPDSDQSLLWQIGLALLAIAFGRSAFGMSQGILALRVENAADSALQPAI WDRLLKLSPAFFRSYSSGDLLNRTLSVNQIRRILSGATQRTLLSGLFALLNSVLMFVY SWQLALVGVGIALLTAAITAVSGLLLVRFSRRLQELDGEINGLTIQLINGVAKLRVAQ AEERAFAAWANQYSQRTRLTATSQQIKDSVSVLNEILSLLTSALLFGLAVLFLQTATA SGSGGFTTGTFLAFNSALGTFIGGVSNLSNTVTTILAIVPLWERAKPILQQELEYDSN KVDPGRLTGRVILDHITFRYREDGPPILNDVSLYAEPGEFVAIVGPSGSGKSTILRLL LGFETPLSGKVYYDGFDLAELDLVAVRRQFGVVLQNGRIGSGSIFENISASALISHNE AWEASRMAGFAADIEQMPMGMHTVISEGGTNLSGGQRQRLLIARALVNKPKIILMDEA TSSLDNRTQAIVTESLDQLNATRIVIAHRLSTIRNADRIYVMEAGRVVQVGTFEELAE QEGLFSQLVARQME" gene 7632..8102 /locus_tag="DP116_24840" CDS 7632..8102 /locus_tag="DP116_24840" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24840" /translation="MKQNEYGFIAQSGFSSSHLQPDRTHILFLKKVRTPLSPLFPVAC CLLPTSSLSIIHYRVRLISYDSTTEPHTPGASTGGTRQPVATTGRHMRDCRETRPTQW LGNPRNALPPQDRAGSPLSRTQPRPPGNQFPGLQFKSTKVDSKAYAVVFRRLLL" gene 8406..>8671 /locus_tag="DP116_24845" CDS 8406..>8671 /locus_tag="DP116_24845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749118.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24845" /translation="MNSPQPSDENNRYVKTILILAANPNSTSRLQLDKEIREIDEGLK RGNKREQFKLEQKWAVRQRDFYRAILELNSTSLSPDPLVFKTVL" BASE COUNT 2411 a 1838 c 1914 g 2508 t ORIGIN 1 agcttgtttt tgcaatatgt gaggagtttg agactgaagt tgtcattatc aataagtcta 61 atgaggaagt gccttttgaa caagaattgg tgcaagacat gattgaactg attactgtgt 121 ttagtgctcg cctttatggc tctagaagca aaaaaaacaa gaaattaatt gatggtatga 181 cgcaagttgt taaggaggtt tcatagtgtt gttaggtttc aagactgagt tgaaactgaa 241 taatctgcaa cgtacagcat taatgaagca ttgtggcgta gcgcgtcatg catggaattg 301 gggacttggt ttgaccaagc aaatccttga ccacaacaaa gcaaatccca attccaagat 361 taaattccct agtgccattg acttgcacaa gtggttagtg gcgttagtaa agagtgagca 421 tgaatggtac tacgaagtca gtaaatcaac gccacaacaa gcactgatgg ctttgcgcga 481 gtcctggaaa cgatgcttca acaaaaccgc tggcgttcct aggtttaaga agaaaggtag 541 acgtgactcc ttcacgcttg aaggtacagt caaaatttta ggcaacaaca aaattcaagt 601 acctgtaatt ggtgtcctca aaacttatga gcgactaccg caagttaaac ctaaatcttg 661 tactatatcg cgccaagcaa ataggtggtt tataagtttt cggattgaaa cagaaactca 721 cagcactgag catacagacg ttgtaggagt tgacctcgga gttaagactc tagcaactct 781 ttccaccggt gaggtgatca ctggcgcaaa gtcctacaag aaatatgaat ctaagttgtc 841 cagaatgcaa tggttgaatc gccataaaat aatcggttca gccaactgga agaaagcaca 901 aatgcagata gctaagctcc atagaaagat agccaatatc cgaaaagata cgttgcacaa 961 gcttacctca ctacttgcta agaaccacgg caggatagtg attgaagacc tcaatgtatc 1021 tggaatgatg gcgaatcata aactagctaa agcaatcgct gacatgggat tttatgagtt 1081 tcgtcgccag ttgacataca agtgtgaact gtatggttca aaacttgttg ttgttgaccg 1141 atggtttccg agttccaaaa cttgttcaaa ctgcggaaca aaaaaagaaa cactcacttt 1201 agatgaacga gtgtttgagt gtggtcattg tggtttcagt ctcgacagag atttgaacgc 1261 tgcaatcaat ttgtccaaaa tcgccagtta ggcgaggctc gcgcctgtgg actggttaac 1321 gccgacgtta ccaggatgaa gcaaggaagt aaacgaaact caaagtagat atttccattt 1381 atgggtagct ttgggtaggt gttatgtaac ggaaagtgag tcagttattc aaaacgcgac 1441 ttcggtcaaa ttgatagtag aagttgttag tacggcagtc gcttcaagcc gggaaacccg 1501 tccaacgcgc tgccttacca attggcaaga cgactactac gacaaacgtc gtgattatga 1561 agctatgggc attcccgaat actggattgt agattatgcc gccttgggag ggcgtgaatt 1621 tatcggctat ccaaaacaat gcactatctt tgtatacgaa ctaattgacg gggaatatgt 1681 aaaaacaacg tttagagaca gtgatgtgat tgtctcccct accttcccac aatttaacct 1741 caccgcgcaa cagattttta atttggcttt gtaattggat gaacaaaccc gcagaacgca 1801 acaatgttag tagccagttt gaattttgaa tatttgagca gcgagtggta cattaataga 1861 cgcaataggc gcaaatattg aagaatctct gcgatgtgaa actatggaag ttatgtcacg 1921 tcaaaagcac ccaaatgccc cttttttcct ttttaggatt aacgaaacag ggtttttggg 1981 actatatgat tgagggcagg tttgatgatg gagacgagtt attctttgaa atgcagttga 2041 ttggtgacgg cttggaacta acagtagatg tcatgttagg caaaatattg gataaatgtg 2101 ccaaaaagta ccgaggcttt tgtcaacggt actctcaccc aaacaaacgc tcaacagcac 2161 aactaaaccc tggcaaaaga ggactggtta aattatcatc cttatacaaa gttgcaacta 2221 gtttcaaaac agccttttcc cgacgataaa tttctacctt ctgcttttcg cggtcaaaaa 2281 tccagtattc atgcacacct tgaattgaat acagctttaa ctttaactca cgatctcgct 2341 tttcttgctt ttccccagga gacaatacct caatcaccaa ctctggtgca cctgtcaaat 2401 gtccagcttc atccagtaat tgtgtcaatt gttgatgact tacccaaacc acatcaggaa 2461 tcacgttgtc agtatcagta aaaatgatgc ctggtgcaaa agcaactctt cccaatccag 2521 tctgagttga ccagatttct aattgtgtac ctatttttac acaaacttcc tgatgcttcc 2581 agtgaggcgc tctagtcaca aataattctc catcaataat ttcgtaacgt ttgccgtcat 2641 caggaaataa atctagatca gcagttgtca atcgtacttt ttctgatgtt gtagttgtca 2701 tatcatccca ccttgttttc tcatttattg atatttctga caagtttaag tgcatcattc 2761 atagtgatac cgacaagtta gaaaatcaat ctgggtgttt cccacgcgat agacaagcct 2821 atcctctaaa tctattcgcc gcgaccacat atctgcatcc atatacttca atggttctgg 2881 tttaccgata ccctcaaatg ggtcttgcat gacagccgtc accaaatcta agatttttaa 2941 agctttgtca aaatcttgct tgaaccacca agctaaatct tctttaaatt tagagctaaa 3001 accaggagtg cgattcacta caactggctt aatttcctca gcttcagctt tctttttctt 3061 cttcttggtc aatccccaac tcctgctgaa gttctgtgat agtttgaggt tgaatctttc 3121 ctgtttttga ctcttctatt gcatccagta ggtgacgtgc atttgcaggc gatcgcaata 3181 gataaaccgt ttcaaccaaa ctcactaaat ctgactcagc aatcaacgct acgttttcgc 3241 tatcacgacg gttgatgata tacacttgat gattttctac taccaggtct aacagcttaa 3301 aaaagtcatt tctggcatcg gtcggggatg tgattttgta ggatagcatc cgtttctgta 3361 caaacttctg tacatatatt ttaaattgtc aatcatcttg ttgacaattt ccaaaaaact 3421 ttttctagca ggttaataac agtcaaatat gctacatata ttctgatttc ctgcaccgcg 3481 tttcaccaga ctgccgggaa acaggtagct gctactattc caccagagcg atcaccgtag 3541 tgtactagga cgtttatgac tatggcttac attttagtcg tgaatcccag aatattatca 3601 aatgccatat ttctccagca gcctggtgat ttgttcaatc agcttctttt gagcactgct 3661 atctcctcgg cgatcgcaac tgctctaatg ggactgttgt ctaattatcc ctttgccctg 3721 gctcccgcaa tgggtctaaa cgcttttttt gctttttcag ttgtgttgga cttgcagatg 3781 aactggcgac tggcgctaac gtctgtccta atcgaaagct taattttcat cagcttgatt 3841 ttatgcaata tccatatcca aattatcaaa gcgattccag cttccctcaa acatgctact 3901 gtggcgggta ttggtttgtt catcgcctat attggtcttt ctggtgttcc tgaacctccc 3961 actctaggag caggtattat tgttgctagt aaagcgacct taacaactat tggctctttt 4021 aagcaaccag ctactttact ctcagctttt ggtctattgc ttaccgctac tttggtagca 4081 cgtcatatta aaggagcgat attatggggt attttcgcta cagcgttgtt aggctggata 4141 ctaggcattg ctcagagtcc tgaaaaaatt attgccctac ctgaatggcc acaagacttg 4201 attagtcaag cattcacagg cttgagttac cttacaccca agcaaatatg gaattttgtg 4261 tctgtcacac ttaccttttt atttgtaact tctttcgaca ccattggcgc actcactggt 4321 ttgggacaac aagcaggtta cattaataaa aatggtgaat tgcctcgcgc taccaaatct 4381 ttgttagcgg gtgctatggg cataactatt ggggcattga tgggtacttc tccatgcgcg 4441 acttatctgg aatctgcctc tggcatatct gaggggggac ggagtggttt cacatctgtg 4501 gtagtagcag tcctattttt ggtttctgtg ttatttactc ccttatttgc agcagtgcct 4561 acttttgcaa cagctcctgc attgattatg gtgggttttt tgatgatgag tagcgtgcaa 4621 aacattaatt ggaatgatcc agcagaagca attgctgcat tcctaattct tctaactatg 4681 cccctgactt actctattgc agaggcattg gcaattggtt tcattattta ccctttgatt 4741 aaagtttccc aaggactggc tcatcaagta aataaaactg tatattttct agcagctatt 4801 tctgtgtttc acctcgtact aaagagttaa gaagacttgc gccaatatat ggaagtcctg 4861 taccgcctga aagcaaatga acgactactg ctagatgacc aagaaaaagt atgggtcgtg 4921 cagactggct ccgtcgcagt gtttgccaca aaagtcaata atgagatgcc agtaggcgat 4981 acacaaagcg tcaagcctcc agcttatcgc ctcacagaag ctactcaaaa cgagcatcgc 5041 cgttatttat tcagtgtagg tgcaggagcg gcgttgtttg gtgcagcaac caaaatcggt 5101 gaatctttga gtttagtagc agtagcaatt gaggcaacag aattatccca aatttccatt 5161 gcagatttgg tgagcctagt ggcgactggt gaagcggaag cgatgctgca aaccgcgacg 5221 caggcgttag cgaagctcac cgaaggtatc gctttagtga atggttggat aaatcactta 5281 agtgagacat tcagcacgga acccacagca gcaaattttt cccattattt atccttaata 5341 aagtcaatca actcagaaaa tgccgcttca ctaccaagca ttttggcaga tttacattct 5401 gaattcttcg actatttaaa gaaactccaa cagcaagaaa cagagatagc ctttcgccaa 5461 tttcaacaac gagagcaact caatcgtcag gtcgtcaatt ctgcactttc taagttagct 5521 tctgtgttgc aaccgcagca agaaactgta tcttttcaag gtacaccatt attagtagcg 5581 gcgggagcag taggacgagc aatggggata acgataaatc ctccggcgca gtctgaggac 5641 atcagtcgag tcaaaaatcc agtggaggct attgctcgct cttcccaaat tcgcactcgt 5701 cgcgtggtgc tagagtatgg ctggtggctg agtgagtatg gtccactttt agcttatacc 5761 caagaagaaa agcgtccggt cgctttgtta ccagcaggaa aacgttatat tttctttgac 5821 ccgcttgcac aaactcgcac atttgtcaat caagcagtag cggcaaggct agcaccgcaa 5881 gcataccagt tttaccgacc attacccaaa gttaataatg cacttgcgtt gtttcagttt 5941 ggcatcaagg gttatgaaaa agacattatc ctagttctgg taactgggat aattggcact 6001 ctgttgggaa tggttgtgcc gcaagcaaca gcgcttttgg tggataatgc aattccggat 6061 agcgatcaga gtttattatg gcaaatagga ctggcattgt tggcgatcgc ctttggaagg 6121 tcagcttttg gcatgtccca aggtatccta gccctacgag tggaaaacgc tgctgatagc 6181 gccttgcaac ctgcgatttg ggatcgactc ctcaaactaa gtcccgcctt ttttcgctct 6241 tattcctctg gcgacttact taaccggact ttatcagtaa accaaattcg tcggatacta 6301 tcgggggcaa cgcaacgcac cctattaagt ggactgtttg ctttactcaa ttcagtgcta 6361 atgtttgtct atagttggca actcgcctta gttggggtgg ggatagcttt gctaacagct 6421 gcaatcactg ctgtctctgg cttgctgtta gttcgctttt cgcgacggct gcaagaattg 6481 gatggcgaaa ttaacgggtt aacaatacaa cttataaatg gagttgccaa gttgcgggta 6541 gcgcaggcag aagaacgagc gtttgcagct tgggcaaatc agtacagcca gagaactaga 6601 ctgacagcaa catcgcaaca aattaaagac agtgtttctg tgttgaacga aatactatct 6661 ttgctaactt cagcactgtt gtttgggttg gcggtgctat ttctgcaaac ggctactgcg 6721 agtggtagcg gcgggttcac aacaggaaca tttttggcgt ttaattcagc gttgggaact 6781 tttatcgggg gagtgagcaa cctcagcaat actgtgacta ctattttggc aattgtacca 6841 ctgtgggagc gggcaaagcc gattttgcag caggaactag agtacgattc taacaaggtt 6901 gatccagggc gcttgacggg tcgtgtgatt ttagaccata ttacctttcg ttaccgtgag 6961 gatggtccac caattctgaa tgatgtcagt ctttacgcag aacctgggga gtttgtcgcg 7021 attgtcggac catcaggaag tgggaagtca actatactca ggttgttgtt agggtttgaa 7081 actccactat caggaaaggt gtactacgat ggttttgact tggcagaatt agaccttgtg 7141 gcggtgcgaa ggcagttcgg ggtggtgctg caaaatggtc gaattggatc tggctcaatt 7201 tttgagaaca tatctgcctc tgcgttgata tcgcacaatg aagcctggga agcgtcgcgg 7261 atggcggggt ttgctgctga tattgagcaa atgccaatgg gaatgcacac ggttattagt 7321 gaaggtggta cgaatctttc aggaggacaa cgccagcgat tattgattgc tagagcactt 7381 gtcaataagc cgaaaattat cttaatggat gaagcgacca gttctttgga taatcgtaca 7441 caagcgattg taactgaaag tttagatcaa ttgaatgcaa ctcggatagt gattgctcac 7501 cgccttagta cgattcgcaa tgctgaccga atttatgtga tggaagcggg gcgcgtggta 7561 caagtgggta catttgagga actagccgaa caagaggggt tgttttctca actggtagcc 7621 agacaaatgg aatgaagcaa aatgaatatg ggtttatagc ccagtcgggg ttctcatcct 7681 cccacttgca gcctgatagg actcatattt tatttttgaa aaaagtccgt acacctctat 7741 ctcctttatt tcctgttgcc tgttgcctgt tgcctacctc tagtttgtcc attattcact 7801 atcgtgtccg gttaattagt tatgattcca cgactgaacc ccacacgcca ggtgcttcaa 7861 cggggggaac ccgacagcca gtcgccacaa cgggacgcca catgcgtgac tgtcgggaaa 7921 cccgcccaac gcagtggctc gggaaccccc gcaacgcact gcctccccaa gaccgcgctg 7981 gctcaccatt atcaagaacg caacctcgtc cgcctgggaa tcaattccca ggcttacagt 8041 tcaagtctac taaagtagac tcaaaagctt atgcagtcgt ctttagacga cttttactat 8101 gagactgggg tttaaaccct aggcggttgt tggcacaagt gcaagatctc agttttaacg 8161 gacttggact ttgagccaag aaataaattt cttggcggac gaaaagtatg gtgcaagatc 8221 tgagcctacc tacttatcag tcatcagtca tcaattgctt gataattgtt cactgtttac 8281 tgttcactgt tcactgtaat gactggtgca taagttaatt tcaatttcat ctataagaat 8341 taggtataag tgtttttcat atagcacgtt tggattgtaa gcgtatcaaa ttttactatt 8401 gttctatgaa ctctccccaa ccttctgatg aaaacaaccg ttatgtcaaa actattttga 8461 ttttggcagc caatccaaat agtacttcca ggttgcaatt ggataaagaa atacgggaaa 8521 ttgatgaagg gttgaaacga gggaataagc gtgagcagtt taagttagag caaaagtggg 8581 cagtacgtca gcgtgatttt taccgagcaa tcttagaatt aaactcaacg agcctgtcac 8641 cagacccgtt ggtcttcaaa accgtgcttg a // LOCUS NODE_3844_length_8434_cov_4.5524538434 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8434) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8434) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8434 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..2677 /locus_tag="DP116_24850" CDS <1..2677 /locus_tag="DP116_24850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316023.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_24850" /translation="KLKRPLSLWNPLDYLRLLYWVFYFPQALRWYVDTFGGGYIPKEE MNWRKGWELLRQNSTQRNLLFQGLVLTVVTPLALSRLVQYIGVPLAWLGVALGVALGV VFGVTYGVTYGVGSGVVFGVGSGVAFGVAFGVAFGVALGVAFGVALGVGLGVADGVTS GLTSSVASGVASGVALGMALAVTSGVTLGVAGGMAFAVALSALGVAFAVALGVAILRP ENWLLGLPLVLRSPHRQWQIPRVTPLPLYSLSFWLKNWLRQDWETGLHNANQLLAYTL QFIPVIQAVNLVLAETPPEQVVFRVVQLAEDPFDWQLVRFTSASLNETLKSNFIKDFF SFRWFFFFPRRWKEQLQARFFTDTRLDTPARAAAAGFWHLRAAGFWWYLRKEELAKAM AAFAQIRSLPYGEEMFTLAQTLAAFNEAKKPDAIATVKIPPFPQEPLLRPVTWQTLTS LRSVVENAKIIQRSQSAYQRSLALNRALGELTEILDNPDTLPQAERGLIVDIARTWRK ALEGIAKEVGEISITKPVSNPYIIGDPVVGDRFVGREDVIRQLQELWMSGSQLQSVVI YGHRRMGKTSILRNVASFIKPSVQVAYVNLLEVADASQGVVEVLMAISDAISEVMQIP PPSNEDLLSLPQPTFRRYLTQVEQELGTKGLIIALDEFEKIEELISAEKIPVNFMGYL RGLTQKSSKIAFVLAGLHTLEEMTADYFQPFYASVITIKVGFMEAGATHQILANPAIQ DFPLDYTPEALDKIYELTHGQPYLVQLVAFQLVRRYNDEVFNTGRARDHILRIGEVEA VVNDSEFFQRGRYYFDGVWGQAARGAGGQREILQALASHPEGLSLEMLSDCTNIEIAL LQEALNTLMRHDVVEEIEGRWQIIVELFRRWVLQL" gene 3023..5500 /locus_tag="DP116_24855" CDS 3023..5500 /locus_tag="DP116_24855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743755.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24855" /translation="MTWTPLRERINKLPLVLAGPILRRTEPNAVTVWVALKESRNVTL EIFDTNKRRLFSGSQKTIQVGTYLHVVAVTAKTSSNVLCHGENYLYNLYFGHGETLNQ PGVLTQYGSIENIIYPPYDLPSFALVPSHLNDVRIIHGSCRKPHGESLDALVTVDKMI REALEQDSKKRPHQLFLTGDQIYVDDVADALLFMLMDASQTLLGWTETLPDVQNSEEL NPGKRNNLATHTAGLTASFSKLNKISNVAKSHLFTLGEFLAMYLFAWSDVLWIKPEDF PSFEDVFPDARNVHPDVKISGENRTSFTEDVIYLQYFQVAIKDIRRALANVPTYMIFD DHEITDDWFLNIAWCDRTLNKPLGRRILQNGLLACAICQAWGNTPEQFEQGKPGEALL KATEAWLASFGTNPQYEQEIALRVGLPSVADIKTSQPRRVIHQEGSLKWHYTVTAPEY EVLVLDTRTWREFPGKDFDFPGLLSAEGCDEQISKVVRPQNTRVTLVIAPSPVIGLPF LETLQKTAKTVAEKLGAAAWGFDPEAWGLEETAFERLFSRLALRALPAQQSRVIILSG DVHYSFAARLQYSAIRPFQSSKNVKTELVVAQFTSSSLKNEAKGFGGSHSLHKKGFVP FAIIKYLPTAEVIGWENKGGNVLEIGGFYTLVDQTVQHFPWRVKCSPAKVDLVQERDW FRVLQITKQPEWWYRINFLSAKTEAINEPRNYNSPQFYSVKAPLPGQERKQPLEQYLA MARNSHDYIGKKGKGREVVGLNNIGEITFEFVDGKEIAVQTLWWRLESREKGKLLEPF PLTRCEVSLAFDNPNYPMDEVLKEVKW" gene complement(5777..6370) /locus_tag="DP116_24860" CDS complement(5777..6370) /locus_tag="DP116_24860" /inference="COORDINATES: protein motif:HMM:PF00293.26" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NUDIX hydrolase" /protein_id="PRJNA477356:DP116_24860" /translation="MSIIELLEELRAIAQLGLNYSKDIYDRERYERLLTLASSQYSEL SSLPSSEIEKRFRAELGYITPKIGVSAAILNDQGKLLLVQRTDDSTWCLPCGWAEVRE TPQQSIMREVQEETGLLVEVGSLIRLGYRMPGDFGQPHTTYHLQFYCTVVGGTLQESH ETINVGYYDISSINKWHVDHQIEAEAAHQYWGFLKNK" gene complement(6392..7279) /locus_tag="DP116_24865" CDS complement(6392..7279) /locus_tag="DP116_24865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459177.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_24865" /translation="MLRIKNLNKAYGKRKVLQDLTLHIEHGKIYGLLGANGSGKTTTI NIICNLVKADSGYVTIYNQEVSEETKKIIGIAPQENLLYKTLSCEENLNFFALIYGLD SHTRRQQVEATLSAVNLLDRAKSPVETLSGGMQRRLNIAVALVHQPKLVILDEPTTGL DIEARYEIWELIRQLKNQGITILLTTHLLDEAERLCDKIGILKSGRILAEGSLLELRK RIPAQEIVIVQTSEEELAIARALEYGFTPRRYGRDLAFWVPEQLELKEILSRFDGIPL DSIARQPVRLEYIYLELTQ" gene complement(7352..8125) /locus_tag="DP116_24870" CDS complement(7352..8125) /locus_tag="DP116_24870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113207.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter permease" /protein_id="PRJNA477356:DP116_24870" /translation="MKYWRETIAVTQRILIELLRRKRSLIFWSIFPISVLILNGMILA ERAKLSMDRAFENAAPSTLVGAALFFSCLGGSVATVVAEREQQTLKRLFISPLSGMSY FLGIFLAHSCIGFGQTLLVYTIAGFWGATFKGSIFLGLIIIIMSIVAYVGLGFILGTQ LARRTEDVNALVAAFGVPLLILGGVFLPSSLFPKTLLDIANFNPIYHMNEALLGVSAQ NNTVSDIASHFQFLTVFSLVMVVGGWFSYRRMLMVEKRL" gene complement(8127..8243) /locus_tag="DP116_24875" CDS complement(8127..8243) /locus_tag="DP116_24875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24875" /translation="MGYGLGLRVKSPFGLIRGDLGINDDGDIRFEITSGQRF" gene complement(8253..>8434) /locus_tag="DP116_24880" CDS complement(8253..>8434) /locus_tag="DP116_24880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012593725.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24880" /translation="HDANNDGILNRNLLGIPTEEFGFSQNPKILTGPPKFGDCAVLVA GPQTNIQIQLLNFLS" BASE COUNT 2416 a 1759 c 1853 g 2406 t ORIGIN 1 caaactcaaa cgccccctct ccctgtggaa cccgctggat tatctacgcc ttttgtattg 61 ggtgttttac ttcccccaag ctttgcgatg gtatgtggac acctttgggg gtgggtatat 121 ccctaaagag gaaatgaatt ggcgcaaggg atgggagttg ctgcgacaaa atagcactca 181 acgaaatttg cttttccaag ggttggtgtt gacagttgtt acacctctag ctttgagtag 241 acttgttcaa tacataggcg ttcctcttgc ttggttaggc gtagcgttag gcgtagcgtt 301 aggcgtggtg tttggcgtga cgtatggcgt gacgtatggc gtggggagtg gcgtggtgtt 361 tggcgtgggg agtggcgtgg cgtttggcgt ggcgtttggc gtggcgtttg gcgtagcgtt 421 aggcgtggcg tttggcgtgg cgttaggtgt ggggttaggc gtggcggatg gcgtgacgag 481 tggtctgacg agtagcgtag cgagtggcgt ggcgagtggc gtggcgttag gcatggcgtt 541 agccgtgacg agtggcgtca cgttaggcgt ggcgggcggc atggcgtttg ccgtggcgtt 601 aagcgcgtta ggcgtggcgt ttgccgtggc gttaggcgtg gcgatactcc gtccagaaaa 661 ttggcttttg ggtttacctc ttgttctgcg atcgccccac agacaatggc aaattccccg 721 tgttactcca cttccccttt actccctgtc tttttggttg aaaaattggc tgcgtcaaga 781 ctgggagact ggtttgcata atgccaacca attgctagct tacactcttc aattcatccc 841 cgtgattcaa gctgttaatc tggtactggc ggaaacccct ccagagcaag tcgtctttcg 901 tgtcgttcag ctagctgaag atccttttga ttggcaactc gtgcgtttta cgtcagcttc 961 tctaaatgaa acgctcaagt caaactttat caaagatttc ttttctttcc ggtggttctt 1021 tttctttcct cgtcgttgga aagaacaact ccaagctcgt tttttcacag acactcgctt 1081 agacacccca gcccgtgctg ctgcagctgg tttctggcat ctccgtgcag ctggtttttg 1141 gtggtatctc cgtaaagaag aactagcgaa agctatggcg gcttttgcac aaatccgttc 1201 tcttccctac ggtgaagaaa tgttcaccct cgctcaaact ctagcagcct ttaacgaagc 1261 aaaaaaacca gacgcgatcg ccacagttaa aattcctccc tttccacaag aacctctgct 1321 gcgtccagtt acctggcaaa ctctcacatc tctacgttct gtcgttgaga atgccaagat 1381 tatacagcgc agtcagtctg cttatcaacg ttccttagcc ctcaaccgcg ctttagggga 1441 actcacagaa attctagata atcctgatac tctgccgcaa gcggaacgcg ggttaattgt 1501 ggacattgcc cgaacttgga gaaaagcctt ggaggggata gccaaggaag ttggggaaat 1561 ttccatcacc aagcctgtga gtaatcctta cattattggc gatccagttg tgggcgatcg 1621 cttcgttgga cgagaagacg tgatcagaca gttgcaggag ttgtggatga gcggttctca 1681 actccaatct gtcgttatct acgggcacag gcggatgggc aaaacttcca tcctccggaa 1741 tgtcgccagt tttataaaac caagtgtaca agtcgcttat gtcaaccttc tagaagttgc 1801 agatgcttcc caaggagtcg tagaagtgct gatggcaatt tctgatgcca tttctgaagt 1861 gatgcaaatt ccacctccaa gcaatgagga tttgctgagt cttccccaac cgacatttag 1921 aagatacttg acacaggtgg agcaagaact agggactaag ggcttaatta tcgccttaga 1981 cgagtttgag aaaatcgagg aattgatttc ggcggaaaaa atccctgtaa attttatggg 2041 atatctccgg gggttgacgc agaaaagctc taaaatcgct tttgttctgg caggtttgca 2101 caccttggag gagatgacag ccgattactt tcaacccttc tacgccagtg tgattacaat 2161 caaagttggc tttatggaag ccggggcgac tcaccaaatt cttgcaaatc cagctatcca 2221 agactttccc ctcgactaca cgccggaagc ccttgataaa atctacgaac tcacccacgg 2281 acaaccttat ttagtacaac ttgtcgcttt tcaactcgtc cgccgttaca atgacgaagt 2341 ttttaacaca ggacgcgccc gtgatcatat cctcagaata ggggaagttg aggcagttgt 2401 caacgactct gagtttttcc agcggggacg ttactatttt gatggcgttt ggggtcaggc 2461 ggcgcggggt gctggtggtc agcgggagat acttcaggcg ttagcgtctc atccagaagg 2521 attaagtctg gaaatgttat ctgattgcac aaatatagaa attgctcttt tacaagaagc 2581 gctcaatact ctgatgcgtc atgatgttgt tgaagaaatt gagggacgct ggcagattat 2641 tgtagaacta tttcgtcggt gggttttaca attgtagaat atttgttctc tagcaggact 2701 tacgcaaaat gatgaaaaaa cgaaccacaa aggacgcaaa ggacacgaag tggcttcccg 2761 aagggtagga cacaaaggaa taagagtttt agagagttat tgcgtaagtt ctatctaaaa 2821 gtcaatacgg ttcggataag atccccccgc ctgcggcgac ccccttaaaa agggggtaga 2881 tgggtaagat agcccccctt tttaaggggg gttgggggga tctcctttga tataacctgc 2941 ttaaccgaac cgtattgatc taaaggtgac agatgttgga gaagaaatat gattaaaatg 3001 tactattgat ggcagaaaat ccatgacatg gacacctcta agagaacgga ttaacaaact 3061 gccacttgtt ctcgctggac caattctacg acgcactgaa cccaatgctg taactgtatg 3121 ggtggctttg aaagaaagcc gtaatgttac tctagaaata tttgacacga ataaaagaag 3181 actttttagt ggtagtcaaa aaacgattca agttggtact tatttgcacg tcgttgctgt 3241 cacggctaaa acttcatcaa atgttctttg tcatggagaa aattatcttt acaatttata 3301 ttttggtcat ggggaaacgc taaatcagcc cggtgtctta actcaatacg gttcaattga 3361 aaacattata tatcctccat atgacttacc aagttttgcg cttgtgccaa gtcatttaaa 3421 tgatgtacga attattcacg gttcttgtcg caagcctcac ggtgaaagtc ttgatgcgct 3481 tgtgacagtg gataaaatga ttagggaagc attagaacaa gactcaaaaa aacgaccgca 3541 tcaactcttc ctcacaggtg accaaattta tgtggatgat gtggctgatg ctttactttt 3601 catgttgatg gatgctagcc agacgctttt gggatggacg gaaactttac cggatgtcca 3661 aaattctgaa gaattgaatc caggaaagcg taataacttg gcaacacata ctgctggctt 3721 gacggcaagt tttagcaaac ttaacaaaat ttctaatgtt gccaaaagtc atttattcac 3781 attgggtgaa ttcttagcaa tgtatttatt tgcttggagt gatgtgctgt ggataaaacc 3841 cgaagacttc ccaagctttg aagatgtttt tcccgatgcg cgaaacgttc atcctgatgt 3901 aaaaatatct ggagaaaata gaacttcatt tacagaagac gttatctatt tacaatattt 3961 tcaagtagca attaaggata ttaggcgtgc tttagcaaat gttcctacat acatgatatt 4021 tgatgatcat gaaattacag atgactggtt tttaaatatt gcgtggtgcg atcgcactct 4081 caataaaccc ctcggtaggc gcatactcca aaatggttta cttgcttgtg ctatctgcca 4141 agcttgggga aacacaccag aacaatttga acaaggaaag ccaggagaag cactactcaa 4201 agcaaccgaa gcatggttag cttcttttgg tacaaaccca cagtatgaac aagaaattgc 4261 cctgcgtgtt ggtttgccat ctgttgcaga tattaaaaca agtcagccaa ggcgagtcat 4321 tcatcaagaa ggttctttaa aatggcacta tactgtaact gctccagaat atgaagtctt 4381 agttttggat acgcgcacgt ggcgggaatt tccagggaag gactttgatt ttcccggact 4441 tttaagcgct gaaggatgcg atgaacaaat ctcaaaagtt gttcgtcccc aaaatacacg 4501 agtgactttg gtaatcgcgc ccagtccagt tattggctta ccgtttttag aaaccttaca 4561 aaaaacagct aaaactgttg cagaaaaatt aggtgctgcg gcttggggtt ttgatcctga 4621 agcttggggt ttagaagaaa cagcatttga aaggttgttt tccagactcg cactacgagc 4681 attacctgca cagcaaagtc gtgtcattat cttatctggt gatgttcatt atagttttgc 4741 agctcgcttg caatactcag caatacgtcc tttccaaagt tcaaaaaatg tcaaaactga 4801 acttgttgtt gcccaattta cgagcagttc gcttaagaat gaagccaagg ggttcggtgg 4861 aagtcattca ttgcacaaga aaggttttgt tccttttgca attatcaagt atctgccaac 4921 agcagaggtt atagggtggg aaaataaagg aggaaatgtt ctagaaattg gtggttttta 4981 tactcttgta gaccaaacag ttcaacattt tccttggaga gtcaaatgta gtcctgctaa 5041 agtggattta gtccaagaac gcgattggtt tagagtctta caaatcacga aacaacctga 5101 atggtggtac cggattaatt ttttatcagc gaaaacagaa gcaattaatg aaccgagaaa 5161 ttacaattct ccacaattct attctgtcaa agcacctcta ccaggacaag agcgtaagca 5221 gcctttggaa caatacctcg caatggcaag aaactctcat gattatatag gtaagaaagg 5281 aaaaggcaga gaagttgtag gattaaacaa tattggcgaa atcacttttg agtttgttga 5341 tggtaaggag atagctgtac aaactctttg gtggcgtttg gaaagtcggg aaaaggggaa 5401 gcttttggaa ccttttccgt taactagatg tgaagtatcg ctggcgtttg ataatccaaa 5461 ttatccaatg gatgaggtat tgaaggaagt gaaatggtga gtcatgagat ttttaaaaac 5521 atctaatttt taacgttcgc gtagcgtgcg cccttggcgc ataccgccaa gacgccaaga 5581 acgccaagaa aagagagaag agagaatttt acgaatgata tgatttagga ttgcaataac 5641 ttagttatat tcgctatctc tataatcatt ttggctaata tcaaatccgt ttgtaccgga 5701 actatctctc cttcttctct gcgttctctg cgtctctgcg gtttttttaa taattcagat 5761 tcaaccaaaa acgatatcat ttatttttta aaaatcccca atactggtgt gctgcttcgg 5821 cttcaatctg atgatcgaca tgccatttat tgatagagga tatatcgtaa tacccaacat 5881 taatagtttc atgggactct tgcaacgttc caccgacaac cgtacaataa aactggagat 5941 ggtaagtggt atggggctgt ccaaaatctc caggcatacg ataaccaaga cgaatcaacg 6001 aaccaacttc cacaagcaaa cctgtttctt cttgtacctc gcgcataatc gactgctgcg 6061 gcgtttctcg aacttcagcc caaccacacg gtaagcacca cgtactatca tcagttcgct 6121 gaaccaacaa caacttgcct tgatcgttta atattgcagc ggacactcct attttcggtg 6181 taatgtaccc cagttctgcg cgaaatcgct tttctatttc tgaactaggg agggaagaga 6241 gttctgaata ctgtgaggaa gcaagagtga gaagtcgttc gtagcgttcg cgatcataaa 6301 tgtcttttga atagtttaat ccaagctgtg cgatcgctcg cagttcttct agcaattcga 6361 taatactcat ggataaattt ttgaaataga tttattgcgt taactctaaa taaatatact 6421 ctaaccgcac tggctgacga gcaatcgaat cgagaggaat accatcaaag cgagaaagaa 6481 tttcttttaa ttctagttgt tctggcaccc aaaatgccaa atcacgacca taacgtcggg 6541 gtgtgaaacc atattcaagt gctcgtgcga tcgccagttc ttcttctgaa gtctgcacaa 6601 tcacaatttc ttgtgctgga attctttttc taagttctaa gagacttccc tcagccaaaa 6661 ttcgaccact ttttaaaatc ccgatcttat cacaaagacg ctcagcttca tctaataaat 6721 gagttgtcag taaaattgta atgccttgat ttttcagttg tcgaatcaat tcccaaattt 6781 cgtatcgcgc ttcaatatcc aaacctgttg ttggttcatc aagaataacg agcttcggct 6841 gatgtaccaa tgcaacagca atattcaaac gtcgctgcat tcctccactg agggtttcta 6901 ccggactttt tgccctatcc aatagattaa cagctgacaa agttgcttct acttgttggc 6961 gacgagtgtg agagtctaat ccatagatta atgcaaaaaa attcagattt tcttcacaag 7021 atagagtttt atatagtaaa ttttcttgag gggcaattcc aattattttt tttgtttctt 7081 ccgaaacttc ctgattgtat attgtaacat acccactatc agctttgact aagttacaaa 7141 taatattaat tgttgttgtt ttaccagatc catttgcacc caaaagacca taaattttcc 7201 catgttcaat atggagcgtc aaatcttgaa gaacttttct ttttccgtaa gccttattca 7261 aatttttgat tcttagcata tttcggaaat tagcataatt gcaaccgcaa ggcacgcaat 7321 agtccatctg caatcaaaag taaaaatcat ctcacagtct cttttctacc atcaacattc 7381 gtcgataaga aaaccatcca ccgacaacca tgaccagaga aaatactgtt aaaaattgaa 7441 agtgagatgc gatatctgat actgtgttat tctgagcaga tactcccaaa agtgcttcat 7501 tcatatgata tattggatta aagttggcga tatctagtag tgttttagga aacaatgaac 7561 tgggcaagaa gacgccaccc aaaattaaca aaggaactcc aaacgctgct accaaagcgt 7621 taacatcttc ggtacgacgc gctaattgtg tacctaaaat aaatcctaag ccgacataag 7681 cgactatact cataataata atgatgagtc ctaaaaatat agaaccttta aaagtagcac 7741 cccaaaatcc agcaattgta taaacaagca atgtttgtcc aaagccaatg cagctatgag 7801 ccagaaaaat tcccaaaaaa taagacatac cactcaatgg ggaaataaaa aggcgtttga 7861 gagtttgctg ttctcgttct gctaccacag ttgcgacact accacccaaa cagctaaaaa 7921 acaatgctgc accaactaaa gttgagggtg ctgcattctc aaaagcacga tccattgata 7981 attttgcccg ttctgccaaa atcatgccgt taagaattaa gactgagatt ggaaaaatac 8041 tccaaaaaat taagctacgt ttacgacgca ataattcaat taaaatccgt tgagtcacag 8101 ctattgtttc acgccaatac ttcatattaa aaccgctgac cactggtaat ttcaaatctt 8161 atatctccgt catcgttaat tcccaagtca ccccgaatta acccaaacgg tgactttact 8221 cgtagtccta acccgtaacc aatgcgttga ctttagctta aaaagttcag caattgaatt 8281 tgaatgttgg tctgtggacc ggcaactaga accgcacaat cgccaaattt gggcggacct 8341 gtgagaattt tcgggttttg agaaaagcca aactcttcag tggggatacc caatagattg 8401 cgattgagga tgccgtcgtt gttggcatca tgat // LOCUS NODE_3854_length_8422_cov_4.8973358422 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8422) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8422) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8422 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..761) /locus_tag="DP116_24885" CDS complement(<1..761) /locus_tag="DP116_24885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872992.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_24885" /translation="MDSPPFKTPLANILVIDDTPENLHLLAAMLTEQGYKVRSVTKGS AGLRGAAAAPPDLILLDINMPEMNGYEVCQQLKESDRTRDIPVILISAMNDVLDKVKA FSVGGVDYITKPFQVQEVLARVENHLTIRNLRSSLQEQNAKLQQEIHERKQAEEKFSK AFRSSPSPIAITTLKEGRFLDVNSSYLTMSGYSLEEIIGQNVAELNGITFEKYTNTID KLLETGSLQNYEMEFRTNTKLQSNVVGYKKGLRKQT" gene complement(742..1929) /locus_tag="DP116_24890" CDS complement(742..1929) /locus_tag="DP116_24890" /inference="COORDINATES: protein motif:HMM:PF00072.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24890" /translation="MINLLGNAIKFTDTGGVTLRVQMGQGTEIQNPKDILPNAHLLFE IQDTGKGIAPEELDNLFQPFVQTASATQVKEGTGLGLTISRQFVQLMGGDIRLKSEVN RGSSFDFNIIVQQAQSCEVAPPIRKEKVIGLATGQRVYRILVADDRKENCDLLTQLLN SIGFETRAVANGQEAIAQWQTWHPDLIWMDMRMPVMNGYEATRQIRAAEQSNSTIITQ SQYSRTPIIALTASAFEEQRSSILGAGCDDLICKPFREEIIFNKMAEYLGVQYVYAEE QENFIQTRFISQGFTEVSQPGDCKNLLSTEARDSNDSRPVVATGFGKLTPQDLLIMPS EWITALHQAAIQVDAELIFQLIDVIPKTHHTVAQELTDLVDRFCFDEIIDLIPDDDGQ PAV" gene complement(1926..2333) /locus_tag="DP116_24895" CDS complement(1926..2333) /locus_tag="DP116_24895" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24895" /translation="MSANLTSLTSQLRATLGKMEVALGAIAYGRASLNADAIVWTGDD GKVQWCNAAFDKLVSQPHILVLNMKLSDLLPLKQAGEAVAPESYPNMRVLVQQYEATE YEFQQGDYSLNKLNLPRIYLNISSQTKASSSKS" gene complement(2357..2728) /locus_tag="DP116_24900" CDS complement(2357..2728) /locus_tag="DP116_24900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412137.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diguanylate cyclase" /protein_id="PRJNA477356:DP116_24900" /translation="MPQAVMKTFDEMKQETDVPVGKADHQGFVTYVNDCFTSVFGWST DEIKDQLITVIIPEGFHAPHHLGFSHFLTTEKSTILNHPLRLVGITKDGRKIEAEHLI MAEKHQGEWVFMAMLRPLNPA" gene 3059..4144 /locus_tag="DP116_24905" CDS 3059..4144 /locus_tag="DP116_24905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456475.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="spermidine/putrescine ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_24905" /translation="MAKRRHFLKSIAAVSSLSLASCGWRLGDVRAYSATKRHSDQLFV YTWEQYTDKELFKTFNAQTGTKLLADVFDSNETMLSKLQAGGGGAYSVIYPSDYMVRK MVDLGLLTELNHKRLIGLDNLFTRFQNPNYDPNNRYSLPFNWGTTGFLYNSEKLKTPP EDWDYLWQNQQQLSKRMTLMNDVREVMGAVLRMLGYSYNSKDENEIKQAYEKLKVLKP ALATFTTDAWRNQILAGDLLLAMCYSSDAVKISKENPKLKYVIPKSGSSLWTDTIVIP KTAPNIEGAYAWMNLMLLPEVAAQTSQRLSLSTPNRAGFEQLPKKVQNNSSLFPSESI LDKCERLAPLGQTQEVYERYWTQLTSG" gene complement(4169..4612) /locus_tag="DP116_24910" CDS complement(4169..4612) /locus_tag="DP116_24910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316967.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24910" /translation="MLYLAQVHKNEFLNQLQLRLLAREETENMWSLIPEEALILLGKG KSLTENLLVLVELSPAGDIERLENATNWVLNLVKAYLTTGITPEFLQQEADRAEEWRQ NLTLQNQDLARRSLELEARREQIQALEESLKREKSGVQKEESTSS" gene complement(4746..7469) /locus_tag="DP116_24915" CDS complement(4746..7469) /locus_tag="DP116_24915" /EC_number="6.1.1.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875830.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="valine--tRNA ligase" /protein_id="PRJNA477356:DP116_24915" /translation="MIASNINLPSLYEPFSTEAKWQKFWEENQLYKADPNKGGESYCI VIPPPNVTGSLHMGHAFEMSLIDTLVRYHRMKGRNTLWLPGTDHASIAVQTILEKQLK KEGKTRYDLGREKFLERAWLWKAESGGAIVHQLRRLGVSVDWSRERFTMDEGLSKAVL SAFVQLYEEGLIYRGEYLVNWCPASQSAVSDVEVENQEVNGNLWHFRYPLTDGSGDIE VATTRPETMLGDTGVAVNPNDDRYKHLIGKTVTLPIMNREIPIIGDEFVDPSFGTGCV KVTPAHDLNDFEMGKRHNLPFINIMNKNGTLNENAGSFQGQDRFVARKNVVARLEDDG FLVKVEDYKHTVPYSDRGKVPIEPLLSTQWFVKIRPLADNALEFLDQKNSPEFVPQRW TKVYRDWLVKLKDWCISRQLWWGHQIPAWYAVSETRGEITDNTPFVVAKSEAEAQEKL IAQFGEDVKIEQDPDVLDTWFSAGLWPFSTLGWPEQTQDLATYYPTTTLVTGFDIIFF WVARMTMMGGHFTGQMPFKDVYIHGLVLDENGKKQSKSAGTGIDPLLLIDKYGTDALR YTLVKEVAGAGQNIRLEYNRKTDESSSVEASRNFANKLWNAARFVMMNLDGQTPQQLG KPSVSELSDCWILSRYYQVVKQTNNYIDNYGLGEAAKGLYDFIWGDFCDWYIELVKSR LQKDSEPTSRRTAQQTLAYVLEGILKLLHPFMPHITEEIWQTFTQQSEDSKQSLSLQS YPEAETNLIDSTLEEQFELLFGTIRTIRNLRAEADVKPAIKVTVNLQTESEKEREILT AGQSYIKDLAKVENLVFAGEQNKETFADVVGTVQVLLPLAGVADINVLRAKIEKRLSK VEGEIKPLSNRLNNPNFVEQARPDVVEGARNALAEAEKQAEILRDRLRRLA" gene 7538..7957 /locus_tag="DP116_24920" CDS 7538..7957 /locus_tag="DP116_24920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747889.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GxxExxY protein" /protein_id="PRJNA477356:DP116_24920" /translation="MNRRGAENAERRELGEEMRQLTGVVIGAAIEVHRMLGPGFLESV YHKALEVEFQMRGIPYKSKPPVAVNYKGYQVGEGELDFLISDVLIVELKAVEKLAPIH EAQVISYLKMTNHSLALLINFNVPILKEGIKRIVLSS" gene 7997..8374 /locus_tag="DP116_24925" /pseudo CDS 7997..8374 /locus_tag="DP116_24925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410999.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="MFS transporter" BASE COUNT 2322 a 1939 c 1688 g 2473 t ORIGIN 1 gtctgttttc gtaatccctt tttataccct accacatttg actgcaactt agtattagta 61 cgaaattcca tttcatagtt ttgcaaagac ccggtttcta gtaatttgtc aatagtgttg 121 gtatactttt caaaagtaat tccattaagt tctgctacat tttgcccaat gatttcctct 181 aaagaatagc cgctcatggt gagataactg ctattaacat caagaaagcg tccttctttt 241 agagtggtga tagcgatcgg gcttgggctg gaacgaaaag ctttactaaa cttttcctca 301 gcttgtttgc gttcgtggat ctcttgttgt aacttggcgt tttgctcttg gagtgatgaa 361 cgcaaattgc gaatcgtcag atgattttct actcgtgcca gcacctcttg cacctgaaac 421 ggttttgtga tataatctac tccacctaca gaaaaagctt tgactttatc tagtacatca 481 ttcattgcac tgattaaaat cactggaata tcgcgagtgc gatcgctttc tttaagctgc 541 tgacaaacct catagccatt catttctggc atattgatat caagcaaaat taagtcgggt 601 ggtgctgcag cagctccccg caaaccagcc gaacctttgg tgacgctgcg aactttgtaa 661 ccttgttctg tcaacatagc cgcaaggaga tgcaagttct caggggtgtc atcaatcact 721 aagatgttag ctaatggtgt tttaaacggc gggctgtcca tcgtcgtctg gtatcagatc 781 gataatttcg tcaaaacaga aacgatccac taaatcagtg agttcctgag ctacagtatg 841 atgggttttg gggatcacat caatcaactg aaaaatgagt tcggcatcaa cttgaatggc 901 tgcttgatgt agcgcagtaa tccactccga aggcataatc agtaagtctt gtggtgttag 961 cttcccaaat ccagttgcga caacgggacg actgtcgtta gaatcacggg cttcggttga 1021 aagcagattc ttgcaatccc ctggctgaga aacttctgta aagccctggc tgatgaatcg 1081 cgtctgtata aaattctctt gttcctctgc gtacacatat tgtactccca gatactcagc 1141 catcttgtta aagatgatct cctcccggaa cggcttgcag atgaggtcat cacaacctgc 1201 acctaaaata ctggaacgct gttcttcaaa agcgctggct gtaagggcaa taatcggcgt 1261 tcgggaatat tgactttgag ttattattgt gctattactc tgttctgctg cccgaatttg 1321 tcgagttgct tcatatccgt tcatcacagg catacgcata tccatccaaa tcaagtcggg 1381 atgccatgtt tgccactgag cgatcgcctc ttgaccatta gctactgcac gtgtctcaaa 1441 accaatcgag ttgagtaact gtgtcaataa gtcgcaattc tctttgcggt catctgctac 1501 caaaattcgg tatactcttt gacctgttgc cagcccaatc actttttctt tacgtatcgg 1561 tggtgcaact tcacatgatt gtgcctgttg aacaatgata ttaaaatcaa agctagatcc 1621 acgattgact tcgctcttga ggcggatatc acctcccatc aactgcacaa attgacgact 1681 gatagtcagt cccaagcctg taccttcttt aacttgagtt gcacttgcag tctgaacaaa 1741 aggttgaaag agattgtcta attcctctgg tgcaattcct ttaccagtat cttggatttc 1801 aaataacaaa tgggcattgg gcaaaatatc tttaggattc tgtatttccg tcccttgtcc 1861 catttggact cttagggtga ctcctcctgt atctgtaaac ttgatagcat ttcccaaaag 1921 gttgatcaag acttgctgga gcttgccttc gtctgcgaag atatattgag gtaaattcgg 1981 ggcaaattca acttgttaag agagtagtcg ccctgctgaa attcgtactc cgttgcttca 2041 tactgttgaa ctagtacccg catatttgga tacgactctg gcgcaacagc ttcacccgct 2101 tgctttaggg gtaacaaatc gcttagcttc atgttcagca ccaagatatg gggttggctt 2161 acaagtttgt cgaaagcagc attacaccac tgcacctttc catcatcgcc cgtccaaaca 2221 atagcatcag cgttaagcga agctctgccg taggcaatcg cccccagcgc tacttccatt 2281 ttgcccagag ttgcccgtaa ttggctggtt aaactagtta aattagcact cattatggac 2341 tcctttgaca gtagaattaa gcggggttta ggggacgcag cattgccata aacacccact 2401 ccccctgatg tttctctgcc ataatcagat gctccgcttc aatttttcta ccatctttgg 2461 taattccaac caaacgcagc gggtgattga gaatagtgga cttttcggtg gtgagaaagt 2521 gggaaaagcc gagatggtga ggggcatgaa aaccttcagg aataatgacc gtaatcagct 2581 gatccttaat ttcatctgta ctccaaccaa aaacagaggt aaagcaatca ttcacatagg 2641 tgacgaaccc ttgatgatcc gcttttccca cgggtacgtc tgtttcctgc ttcatttcat 2701 caaaggtttt cataactgct tgaggaataa agagtcgaaa ataaactgtt tactatttac 2761 tacagatgtg gcattgaaaa ttcctacagc gcgtgtaaaa ctttcatgtg ttgggcgcac 2821 aacaaagaaa tgaagttttt cttacccctt caagacaaat ggtaccattg acgcacccac 2881 ccctttaagg ggtagggtgt acaaaaaagc gcattttttc gtccccgtta cacctttaca 2941 cctctacacc cctagttttt gtcaatatca tgcgttcatc agcgatgaaa tcagacttcg 3001 agacatatcc tcactctttg gtgcgatact gacgacattg actcttgagt tttaactaat 3061 ggctaaaaga cggcactttt taaaaagcat agcagcagtt tctagcttat ctttagcgag 3121 ttgtggctgg agactcggtg atgtgcgagc ttattctgct acgaaacgtc atagtgacca 3181 actttttgtt tatacttggg agcaatatac agataaagaa ttattcaaaa cttttaacgc 3241 tcaaacagga actaaactac tggcggatgt gtttgactcc aatgaaacca tgttaagtaa 3301 gctgcaagct gggggtggag gcgcttatag tgtcatttac ccaagcgact acatggtgcg 3361 gaagatggtg gacttggggt tactaacaga attaaatcat aagcgcttaa ttggtttaga 3421 taatttattt acccggtttc aaaatcctaa ttacgaccct aacaaccgct atagtttacc 3481 ttttaattgg ggaacaacag gttttcttta taattctgaa aagctaaaaa ctccgccaga 3541 agattgggat tatctttggc aaaatcaaca acaactgtca aaacggatga cgttgatgaa 3601 tgatgtccga gaggtgatgg gtgcagttct gcggatgctg ggttactctt acaattcaaa 3661 agacgaaaac gaaatcaaac aagcatatga aaaattaaaa gttctcaaac cagcacttgc 3721 aacctttacc acagatgctt ggcgcaatca aattttggca ggagatttac tcttagcaat 3781 gtgttactca tcagatgctg tgaaaatctc taaagagaac cctaaactaa aatatgtgat 3841 tcccaaaagt ggttcctcac tatggacaga tacaatagtc attcctaaaa cagccccaaa 3901 tattgaaggg gcttatgctt ggatgaattt gatgctacta ccagaagtag cagcccaaac 3961 aagtcaacgg ctgagtcttt ctacacccaa ccgcgctggg tttgagcaat tgccaaaaaa 4021 agtgcagaat aattctagct tgtttccgtc agaatcaatt ctagataagt gtgaacgttt 4081 agctccttta gggcaaactc aagaagttta tgaacgttat tggactcaat taacaagtgg 4141 ctaggatact gaggctatca tccaaaatct aagaactagt gctctcttct ttttgaacgc 4201 cgcttttctc tcgtttcaaa ctttcttcta aggcttgaat ttgttctcta cgggcttcca 4261 attctaatga acgacgtgct aaatcttgat tttgcaatgt taggttttgt cgccactctt 4321 ctgctcgatc cgcttcctgt tgcaaaaatt ctggagtaat accagtggtt aggtaagctt 4381 tcaccaaatt tagcacccaa tttgtggcat tttccagtct ttctatatcg cctgcggggg 4441 aaagttctac caaaacgagt aagttctcag ttaaagactt gcccttcccc aaaagaatca 4501 aagcttcttc tggaatgagc gaccacatat tttcagtttc ttcacgtgct aataatcgca 4561 actggagctg atttaaaaat tcgtttttat gcacttgggc gagatacagc atagaatttt 4621 ctgtctgttt ataaacattt catacataaa tctagtataa attctttaga agctagtggt 4681 tagtagttaa gaatactttt atataagtaa ctactaacaa ccaacaacta aatcattcct 4741 tgaaactagg ctaacctacg taggcgatcg cgcaaaattt ctgcttgttt ttccgcctct 4801 gctaaagcat tccttgctcc ttctaccacg tctggtcgtg cttgttccac aaaattagga 4861 ttatttaacc tattactcag aggtttaatt tctccttcta ccttgctcaa gcgtttttca 4921 attttggctc gcagcacgtt gatatcagca acgccagcaa gaggaagtaa cacttgcacc 4981 gtaccaacaa cgtccgcaaa cgtttcttta ttttgctcac cagcaaagac taaattctca 5041 acctttgcca aatctttaat ataagactgt ccagcagtga gaatttcccg ttccttctcg 5101 ctttcagttt gcaaattcac cgtcactttt atcgccggtt taacatccgc ttccgctcgc 5161 aaattgcgaa tcgtgcgaat tgtaccaaac agtagttcaa actgttcctc caaagtggag 5221 tcaattaagt ttgtctcagc ctcagggtaa gattgtaaag ataaactttg cttggaatct 5281 tcgctttgtt gcgtgaaagt ttgccaaatc tcctcggtaa tatgaggcat gaaaggatga 5341 agtaatttca aaattccttc cagcacatac gcaagagttt gttgtgcagt gcgacgagat 5401 gtgggttccg aatccttctg gagacgagat ttaacaagtt caatatacca atcgcagaaa 5461 tcgccccaga taaaatcgta aagtcccttt gctgcttctc ctaatccata gttgtcgata 5521 tagttattag tttgtttaac aacttggtag tagcgagaga gaatccaaca gtcacttaac 5581 tcactgacgc ttggttttcc cagttgttgc ggcgtttgcc catccaaatt catcatcaca 5641 aaccgcgccg cattccacaa cttgttcgca aaattgcggg atgcttccac cgatgacgac 5701 tcatccgtct tgcgattgta ctccaagcgg atattttgcc ctgctcctgc gacttcctta 5761 accaaggtat aacgcaacgc atcagtaccg tacttgtcaa tcagcaacag cggatcaata 5821 ccagtaccag cactcttaga ctgcttctta ccattttcat ccaataccaa cccatgaatg 5881 taaacatctt taaacggcat ttgccctgta aagtgcccac ccatcattgt cattctagca 5941 acccagaaaa agatgatgtc aaaaccagtg acaagggtag tggtaggata gtaagttgcc 6001 aaatcctgag tttgttcagg ccagcccaaa gtcgaaaagg gccagagtcc ggcagaaaac 6061 caggtatcca agacatctgg gtcttgttct atcttgacat cttcgccaaa ctgtgcaatt 6121 aatttctcct gtgcttcggc ttctgatttt gccacaacaa acggtgtatt gtcagtaatt 6181 tctccccgtg tttcactcac cgcataccaa gcagggattt ggtgtcccca ccagagttga 6241 cgagagatac accaatcttt cagcttcacc agccaatcac gatacacctt tgtccaacgt 6301 tgcgggacaa actctggcga atttttctga tcaagaaatt ctagcgcatt atctgccaga 6361 ggacgaattt tgacaaacca ctgagtcgag aggaggggtt caatgggaac tttaccgcga 6421 tcgctataag gaaccgtatg cttataatcc tccaccttca ctaaaaaccc gtcatcctcc 6481 agacgagcaa ccacattctt tctagcaaca aagcggtctt gtccttgaaa cgaaccagca 6541 ttttcattca aagtgccgtt tttattcata atattgataa acggcagatt gtgacgcttg 6601 cccatttcaa agtcattaag atcatgtgct ggagtcacct tcacacaacc tgtgccaaaa 6661 ctggggtcaa caaactcatc cccaatgatg ggaatttccc gattcataat aggcagagtt 6721 acagttttgc caatcagatg cttatatcta tcatcattgg gattcaccgc cactccagta 6781 tcacccaaca tcgtttctgg tcgagtcgtc gccacctcaa tatcaccaga accatccgta 6841 agaggatagc gaaaatgcca gagatttcca ttcacctcct gattttccac ctccacatcc 6901 gacaccgcag attgggaagc cggacaccag ttgaccaggt actcaccacg ataaattaac 6961 ccttcctcgt aaagttggac aaaagctgac aacaccgctt tggataagcc ctcatccatc 7021 gtaaagcgtt ctcgtgacca atcaaccgag acgcctaaac gtcgcaattg atgaacaatc 7081 gcccctccag attccgcctt ccatagccaa gcgcgttcta gaaacttctc gcgtcccaaa 7141 tcgtagcgag ttttaccttc ttttttgagt tgcttttcca gaattgtttg cacagcgatg 7201 ctggcgtggt cagttccggg aagccacagg gtattacgtc ccttcatccg gtgataacgc 7261 acaagggtat caatcagcga catctcaaag gcgtgtccca tgtgcaggct accagtaacg 7321 ttaggcggtg ggatcacgat acagtaagat tcaccacctt tattgggatc agctttgtaa 7381 agttggtttt cttcccaaaa tttttgccac ttggcttcag tggagaaggg ttcgtaaaga 7441 ctaggaaggt taatgtttga tgcgatcatg ctgggaaaag gatactgtgg aaggactttt 7501 ataaattttg ccacagggtt aaggaggaaa agaggaaatg aaccgcagag gcgcagagaa 7561 cgcagagaga agagagttag gagaggagat gagacaacta acgggagtag tgattggggc 7621 ggcgattgag gtgcatcgga tgttgggacc agggtttttg gagtcggttt atcacaaggc 7681 tttagaggtg gaatttcaga tgcgtgggat accttacaag tctaagccgc cagtagcagt 7741 gaattacaaa ggatatcagg ttggcgaagg cgaattagat tttctcatta gtgatgtcct 7801 cattgttgaa ttgaaagctg tggagaaatt agctcctatc cacgaagctc aagttatttc 7861 ttaccttaaa atgacaaatc attccctcgc ccttctcatc aacttcaatg tccctatcct 7921 caaagaaggt attaaacgta tcgtactctc ttcttaattc tctctgcgct ctctgcgcct 7981 ctgcggttca ttatctatga aaaatcgaac actgtcccgt tccccttgga ctttcatccc 8041 taccttatac tttgcctctg gcgtacctta cgtcatcatc aacacagttt ctgtcatttt 8101 ctacaaaaaa ctcggaatca ataatactca aatcgcctta tggacgagtt ttctttatct 8161 tccttgggtc atcaaaatgt tctgggcacc tatcgttgat acttactcaa ccaaaagaac 8221 atggattctt gccagccaat ttgctatgtt ctcttgtttg ggtttgatag ccttttcttt 8281 acagttacca aatttctttt ttatctccct agcagcattg accataggag cattcatttc 8341 cgcaacttat gatattgcta ctgatggttt ttatagtggg tgagatgaaa accctgtggc 8401 tttagcccag ggacgccaca tg // LOCUS NODE_3890_length_8339_cov_5.0246268339 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8339) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8339) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8339 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..385) /locus_tag="DP116_24930" CDS complement(<1..385) /locus_tag="DP116_24930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874698.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sirohydrochlorin chelatase" /protein_id="PRJNA477356:DP116_24930" /translation="MASAYLLVSHGSHDPRPEIAMQHLAEQLCHKIQSDLVTMTTGGI TSQLKCETLVGTAYLELNPEPLHQQIIKFAKNALDSGCHSLKIQPLFLLPGVHVMEDI PAQVALAEQAISQDIKIFLQPYLGCC" gene complement(524..1222) /locus_tag="DP116_24935" CDS complement(524..1222) /locus_tag="DP116_24935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454989.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="manganese catalase family protein" /protein_id="PRJNA477356:DP116_24935" /translation="MFFHKKETIRPVNIADPNPRFAQLLLEQFGGATGELTAALQYWT QSLHCEHAGIKDMLQDIAIEEFSHLEMIGKLIESHTKNADQTEAYKSTLFAVRGVGPH LLDSQGQAWSAAYINEGGDVVRDLRADIAAEAGARQTYEALIKLAPDEGTKEALVHLL TREISHTQMFMKALDSLGKLTDPFFGNIQPDSTVDIYYNLSSNGKDERGPWNSEPTFR YIADPVAEQKKAEK" gene complement(1413..1793) /locus_tag="DP116_24940" CDS complement(1413..1793) /locus_tag="DP116_24940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198214.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24940" /translation="MRAITIITCLGVVAVSVLGMAMANTNPSQAKYEEYAVQRLSEYL KSNVCKKSQNPLENLIQMNCDKLLEAANPRMREIISISTEKQDFLIFSVYRTDLKLND WVPSYKFETVGALENFYTYNAQKQ" gene 1906..1978 /locus_tag="DP116_24945" tRNA 1906..1978 /locus_tag="DP116_24945" /product="tRNA-His" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:1939..1941,aa:His,seq:gtg) gene 2308..2394 /locus_tag="DP116_24950" CDS 2308..2394 /locus_tag="DP116_24950" /inference="COORDINATES: protein motif:HMM:PF05545.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24950" /translation="MVLYCCVLWMFRHKNRPCFKDYAEIINT" gene 2930..5350 /locus_tag="DP116_24955" CDS 2930..5350 /locus_tag="DP116_24955" /EC_number="2.4.1.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455178.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sucrose synthase" /protein_id="PRJNA477356:DP116_24955" /translation="MSELLQAVLDSEEKSDLRSLISELRQQEKKYLLRNDILNLYGEY CSKYEKPEQFYTASHLGKLVYYTQEIIQEESSLCFIIRPKIASQEVYRLTANLSIEPM TVQELLDLRDRLVNRYNPNEGDLLELDFGPFYDYTPVIRDPKNIGKGVQFLNRYLSSK LFADPKQWLESLFNFLRLHQYNGVQLLINQRIQSQQQLSEQVKKAISFVSSRPSEQPY EEFRFDLQVMGFEPGWGHTAQRVQESLSILDELIDSPDPQTLEAFISRIPMIFRIVLV SAHGWFGQEGVLGRPDTGGQVVYVLDQAKNLEKQLQEDAMLAGLEGLNVQPKVIILTR LIPNSDGTLCNQRLEKVYDTENAWILRVPLREFNPNMTQNWISRFEFWPYLETFAIDS EKELLAEFHGRPDLIVGNYTDGNLVAFLLARKLGVTQCNIAHALEKSKYLFSNLYWQD LEEKYHFSLQFSADLLAMNAANFIISSTYQEIVGTPDSVGQYESLKCFTMPELYHVVN GIELFSPKFNVVPPGVNENAYFPYTRTQDRVESDRARIEEMLFTLKDDSQIFGTLDDP TKRPLFSMARLDRIKNLTGLAECFGRSQQLQDQCNLILVAGKLRVEESGDNEERDEIV KLYQIIEQYNLYGKIRWLGVRLTKSDSGEIYRVIADHKGIFVQPALFEAFGLTILESM ISGLPTFATQFGGPLEIIQDKVNGFYINPTNLEETAEKLLEFVQNCEQNSNYWNQISK QAIDRVFSTYTWKIHTSKLLSLARIYGFWNFISKENREDLLRYIEALFYLIYKPRAQQ LLEQHKYR" gene complement(5401..6030) /locus_tag="DP116_24960" CDS complement(5401..6030) /locus_tag="DP116_24960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875406.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24960" /translation="MAVQAAMSIISIENKSILNPAEINIAYKISSNILRAKNFQVAAK DTTEDVITDTQCFHLCAEPEMLLPFDAIPQLVAEWENTVRNFQRSPLKESQKIMLAST SFFGFQVASEPSAKRVIRQRKQRATDIGNDQIVPQINTSQSLPVLSFRSSGLAVKVLQ QLLSSNGYTIRPDGVFGALTEAAVKAFQNKRNLPVDGIVGQRTWYELTK" gene complement(6644..6979) /locus_tag="DP116_24965" CDS complement(6644..6979) /locus_tag="DP116_24965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491911.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" /protein_id="PRJNA477356:DP116_24965" /translation="MAHQAQPEEQPLAKSESQPKAQPTSYPGIQPLAQSVKMQAVSSS GLPTLRFGNTGSTVRTLQRLLISNGYFVQVDGVFGALTEVAVRAFQSSRGLKADGVVG SRTWALLSG" gene complement(7311..8247) /locus_tag="DP116_24970" /pseudo CDS complement(7311..8247) /locus_tag="DP116_24970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119400.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 2471 a 1698 c 1674 g 2496 t ORIGIN 1 gacagcagcc taagtatggt tgtaaaaata tcttaatatc ttgactgata gcctgctctg 61 ccagtgcgac ttgggctgga atatcttcca tcacatgaac tcctggcagt agaaatagtg 121 gttggatttt gagactatga catccagaat ctaaagcatt cttagcaaat tttatgattt 181 gctgatgcag gggttcggga tttagttcca agtacgctgt accgaccaag gtttcacatt 241 tcaactgaga agttatacct ccagttgtca tagtgactaa atcgctttgt attttatgac 301 ataactgttc tgccagatgt tgcatagcaa tttccggacg cggatcgtga ctcccgtggg 361 ataccagcag atatgcagat gccattagca aggacgttac gtcaaatgtt tcagtttatt 421 aaatatagcg taggttgaat taggtgacaa aatgactatt agccaggagc tgttagttcc 481 tggctgttgt aaggattaat tgagaatctt gttagatagc gacttatttc tcagctttct 541 tttgttctgc aacggggtca gcaatgtaac ggaaagttgg ttcagagttc caaggaccgc 601 gctcatcctt accgttgcta gacaagttgt agtaaatgtc tacagtgcta tcaggttgga 661 tattaccgaa gaatgggtca gttaacttac ccaaagagtc cagcgccttc atgaacattt 721 gagtatgaga aatttcccgc gtcagaagat gaacaagtgc ttctttggtt ccctcatctg 781 gggctaactt aatcaatgcc tcataagttt gacgtgcacc agcttcagca gcaatatcag 841 cacgcaaatc gcgtactacg tcaccacctt cattaatata ggctgctgac caagcttgac 901 cttgactatc tagtaagtga ggtcctacac cccgcacagc aaacaaggta cttttgtaag 961 cttcagtttg gtcagcattt ttggtatgag attcgatgag cttaccaatc atttctaagt 1021 ggctaaactc ttcgatagca atatcttgca acatatcctt aattccagcg tgttcgcagt 1081 gtaaggactg tgtccagtat tgcaatgctg ctgttaattc acctgttgca ccgccaaact 1141 gctctagaag cagttgagca aaacgtggat ttggatcagc aatgttaaca ggtcgaattg 1201 tttctttctt atgaaaaaac acaatagttc tcctttcttt taggtatttg tgccaatttc 1261 ttaagagctt cgtgaatcag actcaagaaa atcggaaaat tccttactct cattacaaaa 1321 tgttaatggt ggcagcttgc taaaaccttg atcctgggaa agacttacta tctaagtagc 1381 tctatcgaaa gtcgctatta tttttatgta aactattgct tttgggcgtt gtatgtgtag 1441 aaattttcca aagcccccac cgtttcaaat ttataagaag gtacccaatc attaagtttt 1501 aaatctgtac ggtaaacact aaaaataaga aaatcttgtt tttctgtact aatagagata 1561 atttccctca tacgcgggtt tgctgcttct agcagtttat cgcagttcat ttgtattaaa 1621 ttttctaatg gattttgact tttcttacag acattacttt tcaaatactc gctcagcctt 1681 tgtactgcat actcttcata tttagcctga ctaggatttg tattcgccat tgccataccc 1741 aagactgata ctgcgacgac tcccaggcat gtaataatag tgatagctct catttgtctc 1801 aactgctatt tgccatagat attagactcc agattaactg gatccgttcc tttggcaaaa 1861 gcacttgatc tagtcagcag ctagtgttat aatatggatg taatggcgag cgtagccaag 1921 cggttaaggc agaggtttgt ggtacctcca ctggtgggtt cgactcccat cgttcgccct 1981 gaggtgattg ttagctgttg acaagaacct gacttagcaa tcaaaccgaa atcccccaac 2041 tcaagcttgg gggattttgt taatacaatt tttaacagtt atcagttatc agttatcagt 2101 catcagtcat caattgcttg ataactgttc actgcgtagg tagctgttca ctgattgaag 2161 gttagagaaa ttttttgtct tcattctcga aggaaacgtc cttggctttc tactgagaag 2221 ttatcctatc tagaaaatgc caagcagaaa aatgaaagtc aacccacata aggcaaggaa 2281 agagaaaata atgatgttgc ctatactgtg gtcttgtact gttgcgtgtt gtggatgttc 2341 aggcataaaa accgaccttg ctttaaagac tatgctgaaa ttataaatac ctgaacagat 2401 attttccttg gtagaataaa atacttgact ttaattcaat taagtttatc tttttcggta 2461 actaaaaaat aaaaaataaa cttttgtttc agttttttca gtaatgatca cgaaaaaact 2521 gaataagacg aaaagaagtt tgattgctac tcccaaatat atttcccctc gcaacttaac 2581 taaaatttat gtaagagtag ctgtaaaagt ctattctttg aaggaaaacg gatttttcca 2641 ggatatcttt aggtatctat actaacaaag accattaggt agcttgatgt tttcgcgctt 2701 tgtgcttaac aaaaagacaa taaagactac gcctttagtc agagtacaaa atactcatat 2761 ataagataag cgtaaaaagg tagagagatt taaaaatact cacagtagta ataattgcta 2821 attgccaagg tgtagtagtt gtttatacga ttaccaatgg gcagttggct attagctatt 2881 gctcagtcaa acgctgaatc caaaacctac aattatctca caagttacca tgtctgaatt 2941 acttcaagct gtgctagata gtgaagaaaa aagtgatttg cgttccctga tcagcgaatt 3001 acgtcagcaa gaaaaaaagt acttgctgcg aaacgacata ctcaatttat atggtgaata 3061 ttgctctaag tacgaaaaac ctgagcaatt ttacactgct tctcacttag gaaaacttgt 3121 ttactacact caagaaatca ttcaagagga atcaagcctc tgcttcatta tccgcccaaa 3181 gattgctagt caagaggttt atcgactgac agccaacttg agtatagaac cgatgacggt 3241 tcaggaactg ttggatttgc gcgatcgcct ggtcaatcga tacaacccca acgaaggcga 3301 cctcttggag ttggattttg gtccatttta tgactacacc ccagtgattc gcgatccaaa 3361 aaatattggc aaaggggtgc agtttctcaa ccgctatcta tccagcaagc tatttgctga 3421 ccccaagcaa tggctagaaa gcttgtttaa cttcttgcgc ctacaccaat acaacggtgt 3481 tcaactgctt atcaaccagc gaattcagtc acagcagcaa ctgtctgaac aagtcaaaaa 3541 agccatctct ttcgtaagtt ctcgtcccag tgaacaacct tacgaagaat tccgatttga 3601 cttgcaagtg atgggctttg aaccgggttg gggacacact gcgcagcgag tccaggaaag 3661 cttgagtatt ctggatgaat tgattgactc tcctgatcct cagaccttgg aagccttcat 3721 ctctcgcatc ccgatgattt ttagaatcgt cttagtttct gcccacggtt ggtttggaca 3781 agagggagtt ttagggcgtc ccgacactgg cggtcaagtg gtttacgtcc tcgaccaagc 3841 gaagaacttg gaaaagcaac tgcaagaaga tgcgatgctg gctggtttag aaggattgaa 3901 tgtccagccg aaggtgatta ttctcacccg ccttatcccc aacagtgatg gcaccctttg 3961 taaccagcgc ctagaaaaag tttacgatac agagaatgcc tggattttgc gagtgccttt 4021 gcgggaattc aatcccaaca tgacacaaaa ctggatttcg cggtttgaat tttggcccta 4081 tctggaaacc ttcgccattg attctgaaaa agaactgctg gcagagtttc atggtagacc 4141 agatttaatt gttggcaact atactgatgg aaatttggta gcgtttttgc tagcacgcaa 4201 actaggtgtc acacagtgta acattgccca tgcgttagaa aagtccaaat acttgttcag 4261 taacctctat tggcaagatt tggaggagaa atatcatttt tcgttacaat tcagcgctga 4321 cttgttggcg atgaatgctg ctaactttat catcagcagc acttaccaag aaattgtggg 4381 cacgccggat agtgtaggac agtacgaatc actcaagtgc tttaccatgc cggaactgta 4441 ccacgttgtc aatgggattg aactgtttag tcccaagttt aacgtggtac cgcctggggt 4501 aaacgagaat gcttatttcc catatactcg tacgcaagac agagttgaga gcgatcgcgc 4561 cagaatagaa gaaatgctct tcactctcaa agacgactct caaatttttg gtactcttga 4621 tgatccaacc aagcgcccac ttttctcaat ggcgcgtctt gaccgtatta aaaacctcac 4681 tggtttagcc gaatgctttg gtcgcagtca acagttgcaa gaccaatgta acctcatttt 4741 ggtagcgggt aagttgcgtg tagaagaatc aggcgacaac gaagaacgcg atgagattgt 4801 caaactttac caaatcatag agcagtacaa tctctatggt aaaattcgct ggttgggtgt 4861 gcgtttgacc aaatctgact ctggcgaaat ctatcgggtc attgctgacc acaaaggaat 4921 ctttgtacaa cctgctttat ttgaagcatt tggcttgaca attttagaat cgatgatttc 4981 aggattacca acttttgcta cgcaatttgg tggcccatta gaaatcattc aagataaggt 5041 aaatggcttt tatatcaacc caaccaattt agaagaaaca gccgaaaaac ttcttgagtt 5101 tgtccagaac tgcgaacaaa attccaatta ttggaatcaa atttccaaac aagccattga 5161 ccgagttttt agcacttata catggaaaat tcacacttcc aagctgctat cattagcacg 5221 catttatggc ttttggaact ttatctcaaa agagaatcgg gaagatttgc tgcgctatat 5281 cgaggctttg ttctatttaa tttacaagcc aagagcacaa cagctattag agcagcataa 5341 gtatcgttaa gttgtcagtc atttgtctat tgtcctttga aaagaacaat agacaaatga 5401 ttattttgtc aattcatacc aagttctttg accaactatg ccatccactg gtaaattccg 5461 tttgttttgg aaagctttaa cagcagcttc cgtcaaggca ccaaaaaccc catcaggtcg 5521 aatagtataa ccattagacg ataacaactg ttgcaagact tttacagcaa gacctgagct 5581 acgaaaactc aaaactggca aagattgact agtatttatt tgaggaacaa tttgatcatt 5641 acctatatca gtagctcttt gctttctttg acgaataacc cttttagcag aaggttctga 5701 agctacttgg aacccgaaaa aagaagtaga agcaagcata atcttctgac tttcttttag 5761 cggactcctt tgaaaatttc gcacagtgtt ttcccactct gctaccaact gaggaatagc 5821 atcaaagggc agaagcattt ctggctctgc acatagatga aagcattgag tgtctgtaat 5881 cacatcttct gttgtgtcct ttgctgcaac ctgaaaattc tttgctctaa gaatattaga 5941 agatatttta taagcaatat tgatttctgc tggattaagt atacttttgt tttctatact 6001 tatgattgac atagcagctt gaacagccat atcaggctgc acaaattcag gtggtgtaat 6061 ttgagcagtt gacactatct gagatagttg cctttgtgtt aacttctcct ggttattgtc 6121 tcgctgaact agtcgcagtt gcggtgcaat ggtcggagat agttgtcctg ttgttaacac 6181 gctcgtcatc agcagggcaa tatctgtcat tgtatttcat tttactaaca cccatgtttc 6241 ctttcacagg aaaaattgct ggtgttccgg aatagccagg acagaaaaat aaaaaaatct 6301 cataaattct ttgaaaagaa tcagcaaaat ataattcagc agtatgtgtt aaggtgtaat 6361 gcattcaaaa tctgctgcgg acttgtgaag gtcgtccttt gtcaaaagtc atttgtcatc 6421 tgtactgctt tggacaaatg aaagtcttca gatcacgata tgtagattat accgcagcag 6481 cttatttgtc aaggtatata caaaatacat aaaatataga agactcatga aatccaaaag 6541 tttttttcca gaaatcctga ggaattaatt ataaaaaata cgaattatga gttatcaaaa 6601 gtttcacaat ccataattaa tagtatatcg actcaatttt tgactagcct gacaataaag 6661 cccaagttct ggaaccaacg acaccgtccg cttttaaacc gcgtgaactc tggaaggctc 6721 taacagcaac ctctgtcagc gccccgaaaa ctccatcaac ttgaacaaag tagccattag 6781 agattaaaag tcgctgtaaa gttcttacag tgcttcctgt gttaccaaac cgtagagtag 6841 gtagacctga actggagaca gcttgcatct tgacagattg tgctaatggt tgtatgccgg 6901 gataggatgt cggttgtgcc ttaggctgtg actcagactt tgctaatggt tgttcttcag 6961 gttgtgcttg atgtgccata aaatcaattt ataaaactat ttttcataaa tttcaaccgt 7021 atctatattg ccaaagaaat cactcaccca gaagaagtag ttttcgcttc ttggcagtgc 7081 gtacataggt caacagccag catgagttgc tcgctattat tctacggata ttattcatat 7141 tcatgtcatg aactttgttt ccgcattttt tgctaatttt aattgagaaa aatgagaaag 7201 tggggaaaaa taggaagata ggggaaaatt ttctccccta ctagtgcagg ctaaatttat 7261 tactttatca gttaccagtt accagttacc agtgtcgtgt cccgttggat ctaaaaccct 7321 tcttcacaaa tggaatcaaa agccttaata taacttttgc gccgccagag aaaagctgca 7381 cccaaagcac ctaaacttaa cagaccgaat atagaatctg attccgggac ttttttgaca 7441 agagtgtcat aagtgtagcc gtacttccgc caatccagag tgtctaccgg tctgacaata 7501 tcatcggatg cgagacttgg taaatatggt ttgatatcta tgctaccaat tgaaccgggt 7561 atagaatcac ctccgtccca aggccacatt cgatcataga gattgtgtcc gtaaggttct 7621 gtacctggaa cagcataaaa ttcgctacct tggcgaccat tattctgcca ttcggcccaa 7681 agacgatcta tgttggtatg aagcagccag aaaactgggt cgttgggaga accaagagtg 7741 ttactcatgg tgccgaatgt tagtatgcgt cgatccgtct cgtttggatt aggattagcc 7801 acactcccac caaccaatcc atgaacgagg ttatgattac atgggttttg acattcactt 7861 tgagtgccgt ctgcatcgac ttaaatattc acggtgccaa ggtaaaaatc ccccatgttc 7921 atgagccgca tcagaaccac ttgcaggtcc tgtagaaggt gtgtcttgcg gtaacataac 7981 ctgaccaccc aattgcatgg ggtcgaatgt cattgctcct gcatgaatgg caacagtctc 8041 gtcatacaga ctcacgccct tgtcattatt gccaggtatg gtttttaatg ttttgatagc 8101 attaacaaag tctgcttttt cctccggtgt caggtcaaga acgttctttc tcaccaaggt 8161 ggaagccatt gctggaattt ccaacatagt tatggcaata gccatcccaa cacttattgt 8221 tgtgatgatt ttttgaataa gcttcatagt cttgtctgta ctttactaga aacgtctaga 8281 acttacgcat tgacaagcaa gacaaaaagt gcataaggag aatgcagaaa aaaacagcg // LOCUS NODE_3900_length_8321_cov_4.3055898321 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8321) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8321) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8321 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 452..1162 /locus_tag="DP116_24975" CDS 452..1162 /locus_tag="DP116_24975" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24975" /translation="MNFSDQVGHAIRFKNNDSEPRINQQLSGRFRINVAEAAKLASMQ RNREDALSGFVRMKIREWKASNRELQDLARIARIAKSAPSNVLGGMGVGNKTGPGYAR AFGFASYDDMRAAAYRWWQESGAEGAKQASDPDSAAGKAVDAVLGLGQGTRPQLETIV AAYAHPRFTGRDVDWWVTTLLEEVRLDRASAERDALQRKVTRRTQKEIRETAAKRREP KVEPEPSSDGHRRKAIGG" gene complement(1149..1580) /locus_tag="DP116_24980" CDS complement(1149..1580) /locus_tag="DP116_24980" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24980" /translation="MEKSAKRSLAIAAFLAFWAIAGVFGWAYWQDAKADADRNGVEMT ASYGGSCGPDYASVVARNRTSKTVRKVNFHLNVYERGNSENLNHEWPEWTYVLKPHAQ ESACFRLPPSARPRDGKALNLTLDARPWSVDYFAEGDFIPQ" gene 1666..2052 /locus_tag="DP116_24985" CDS 1666..2052 /locus_tag="DP116_24985" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24985" /translation="MSRDEWFSTDYALRSAPFEPVTGCHWCNGTSPEDDGPCEKQSCQ DAAEDDHRLARILSARKAADACVVARVRAMQMAARYIAEGDSPYSTRVVSCREQADAW ASDAEGWARKARELSSVTVDAVEAAQ" gene 2049..2327 /locus_tag="DP116_24990" CDS 2049..2327 /locus_tag="DP116_24990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24990" /translation="MTARNLTHEVETSVTLRGEDYWASGVVTLIDCHQLAVAHNEEPW DHLEIQSVVEDRFTASPRSKPVDLLPLLSADERHMVATALYEAVCREL" gene 2469..3389 /locus_tag="DP116_24995" CDS 2469..3389 /locus_tag="DP116_24995" /inference="COORDINATES: protein motif:HMM:PF12705.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_24995" /translation="MKTLSHNQIRTWRNCKRQYGFGYVQLRVPRKTPTPLLVGRSWDD CLQEWWQGEGAQDRLLRAAKVAMHEGDPFARAKLCAMLIGYSAMWGDVPCRLIATQVE FTVPIVHPVTGAVHPECNLTGFIDAVAEIDGRLVVVESKSSGEDITLGSAFWRRVAVM DPQVSTYLPAARALGYDVRDCIYDVARKPELRPKKDETPDAFQERIVLDMHARPEWYF QRQTLVRLEREEREHALDVWHFADEIVSASRTGRYPRNPDQCQKFGRACDYAPVCAGE ASLDDPHLFKDAAKRERKDNAGTNSARPAA" gene 3328..4242 /locus_tag="DP116_25000" CDS 3328..4242 /locus_tag="DP116_25000" /inference="COORDINATES: protein motif:HMM:PF13479.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_25000" /translation="MRRRERGKTMLERIQRGPQRRPLAVCVYGVPGVGKTTFGASAPG ALVWCLEDGASFIDVAKLPAPESWNEALGVLQELADKPHDFKTLVVDTLDALEVLAVA HVCAEAKKKTLADFAWGGGYAALANEWRRFLSGLDNLRAKRAMHIVLIAHEHRKRHDD PELGQYEMYRPKLQEKAWGLTNEWCDAVLFAQFDAAVFEKEGQKNRAIVSGRRVLRTV RETGFVAKNRLGLPRKVDLDWKSFEAAAVPPSTDVLRKELVELCTLAGEDVATKAHGF LADRGESVETLRAAIETVKTYLAEKSAA" gene 4266..4829 /locus_tag="DP116_25005" CDS 4266..4829 /locus_tag="DP116_25005" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25005" /translation="MIKQDGKYRAKAVDVMLGESGVKGTPYVGVEFKVVEQGESLGET VKWSGWLSEKAGPSGKTVAERTIESLRACGWTGDDLGCFATGLHGLDANEVEIVVEMK PYDGPNDSHKGKSFPEVKWVNSLAGGRGLRKDDAMDAVKAARLGERFRGLALALKPAE QPSNDQLQGTSFPYGANAAGGGKGRAF" gene 5533..6297 /locus_tag="DP116_25010" CDS 5533..6297 /locus_tag="DP116_25010" /inference="COORDINATES: protein motif:HMM:PF13392.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25010" /translation="MSLRPHLGHERGPRAARPRRGETANGPRSGVRSRAGRDVPSLRA PHHPDERTAVAGGGRVGAEERHRTRLVRGGAIVKPDTWLIYEDVPPDAKEIPGYPGYF VSDGGKVWNTNYFRRPHVLRPLRSAKGHLSVQLWVNGKPSRQQVHRIVLLAFVGPCPE GMEACHFPDPDPSNNFVANLRWDTSTENSADQKRLNRHPHGVKHGQAKLTEDDVREIR RLRSEGKLHREIALIFGISRKQVSDICIGRYWSHVA" gene 6294..6929 /locus_tag="DP116_25015" CDS 6294..6929 /locus_tag="DP116_25015" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25015" /translation="MKIDTPQPPRPTIDQVNAAYREGLFACGIRTSYELYLVRLGGQP IPMRGVTWIPGNRSMWSQTPQGWEPFDWDESAPSWFKSDSLVFPFGRVPVAEWRGWPH YNWQEIASLVQWPGNSGNFIPFREWKVEHVFLGTFGEFDSAKRLNLAGLWSIERCIAF PDEAPFARAAWSDPVNYLLYADHVDSQGCSLTAMALRRVAELTAKEAEVKA" gene 6932..7375 /locus_tag="DP116_25020" CDS 6932..7375 /locus_tag="DP116_25020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009298420.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25020" /translation="MSIHPLTLKEAARFVAAHHRHHRAPQGGLFAIGCAADGESSPRG VVIVGRPVSRMLDDGWTAEVTRLCTDGSRNACSILYAAAWRAARAMGYRRLVTYILDS EPGTSLEAAGWKCVGEAGGGKWSRVDRPRVDLHPTQKKMRWEATQ" gene 7372..7641 /locus_tag="DP116_25025" CDS 7372..7641 /locus_tag="DP116_25025" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25025" /translation="MTPFPDLSGLRRAGFDREADLLAELMRRPAWWIVRRAGTVGPEF DARRWDVREAAETTAKFWDDYYAPTKHSVVPLTAAAVIADQRGES" gene 7638..>8321 /locus_tag="DP116_25030" CDS 7638..>8321 /locus_tag="DP116_25030" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25030" /translation="MSDDIRDLITILCDDPAELRAVFGPLAELPAWWGVRDSIWGWRS YPATRDLAEQDRAYFDRVPLSGAVTVHAXXXRHHARGGDRRPKARRAVSATKPKPIVV PKRGVSPILRTFFERWRDDLPEADREMLRPYAPRLKGTGDGRDGVRPWLIIDWWVRAQ TPAWLDLARLPTEAASLRGLNELRAGVDLTAANKAANEVKARAVATRAATWAATWAAT WAATRDATWD" assembly_gap 7853..7862 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 1470 a 2631 c 2908 g 1302 t 10 others ORIGIN 1 ccgaagcacc accccacgaa cccgccagcg gcgaagccgg cgaggaagcc gagggacgcg 61 gcggtctcca gcgcggcagc tgtcagagcg ttgacagatc gagaatcatt gtattttaca 121 cgcattgttc aggtacccct ctcggtcagt tcagccgcac gctcgtgcag cactgcgggc 181 gtttgtcgga gcggaaatcc gacgcgttca gcatcgacag cgcgaccgga cggtcgcagg 241 ctcgatgcgt ggtcactcgg cggcggacga agacgactcg accggcgcct ctgccggaac 301 atcgcccttc gaccacttcg cgatccgatc tcgcgtctcg cggcggggtg tcgcgccgtg 361 acaccagtag tggaccttgg aaggcgcgac acgaagcgag aacgcggcct tctgctgcgc 421 gtgctcgccg aaccgcttgg agatgtgcgc ggtgaacttc tcagaccagg ttggacacgc 481 catacggttc aagaataatg attctgaacc gagaatcaac cagcagcttt cgggccgttt 541 cagaatcaat gttgcagaag ccgctaagct tgcgtctatg cagcggaatc gcgaggacgc 601 gctaagcggg ttcgttcgga tgaagatccg cgaatggaag gcaagcaacc gcgagctcca 661 ggacctggcg cggatcgctc gaatcgcgaa gtccgcgccg tccaacgttc ttggtgggat 721 gggcgtcggg aacaaaaccg gccccgggta cgcgcgcgcg ttcggcttcg ctagctacga 781 cgacatgcgg gctgcggctt accgctggtg gcaggagagc ggtgcggagg gcgcgaaaca 841 ggcatcggat ccggactcgg ccgcgggcaa ggccgtcgac gccgtccttg ggctaggcca 901 ggggacacgg ccgcagcttg agaccatcgt ggcggcgtac gcacacccaa gattcactgg 961 ccgcgacgtg gattggtggg tcacaacgct cctcgaggag gttcgcctag accgagcatc 1021 ggccgagcgc gacgcgctcc agcggaaggt cacgcgtcgg acgcagaagg agatccgcga 1081 gacggccgcc aagcgacgag agccaaaggt cgagcccgag ccgtcatcgg acggccaccg 1141 ccggaaggct attgggggat gaagtcgccc tccgcgaagt agtcgacgct ccaagggcgc 1201 gcatcgaggg tcaggttcaa cgcctttccg tcgcgcggcc gagcgctagg cggcaagcga 1261 aagcacgcgc tctcctgcgc atgaggcttc aagacgtacg tccactcggg ccactcgtgg 1321 ttcaggttct ctgagttgcc gcgttcgtag acgttgaggt ggaagttcac cttccgaacc 1381 gtcttggacg ttcggttgcg tgccacaacg ctcgcatagt cggggccaca cgagcctccg 1441 tacgacgccg tcatctcgac gccgttccgg tccgcatcgg ccttcgcgtc ctgccagtag 1501 gcccagccga acacaccagc gatggcccag aaagcaagga acgcggcgat cgcaagggac 1561 ctcttggcgc tcttttccat ggagtcgacg ctaccaggct cggtgagaca ctatttctga 1621 accgaccgct tgacgtctgg ttcagaatca ttattcttga ttcgcatgag ccgcgacgaa 1681 tggttttcga ccgactacgc gctgcgtagc gcgccctttg agcccgtcac cggctgccac 1741 tggtgcaacg gcacgtctcc ggaagacgac ggcccctgcg agaagcagag ctgccaggac 1801 gccgccgagg acgaccaccg gctggcgcgg attctgagcg cccggaaggc ggcggacgcg 1861 tgcgtggtgg cgcgggtcag ggccatgcag atggccgctc ggtacatcgc cgagggggac 1921 tcgccgtact cgacgcgtgt ggtgtcctgc cgcgagcagg cggacgcgtg ggcctcggat 1981 gccgaggggt gggcgcgcaa ggcgcgggag ctctccagtg tgacggtgga cgcggtggag 2041 gcggcgcagt gaccgcccgc aacctgaccc acgaggtcga gaccagcgtg acgctccgtg 2101 gcgaggacta ctgggcctcg ggcgtcgtca ccctcatcga ctgccaccag ctcgccgtgg 2161 cgcacaacga ggagccttgg gaccacctcg agatccagtc ggtggtcgag gaccggttca 2221 cggcgtcgcc ccggtcgaag ccggtggacc tcctcccgct gctctcggcc gacgagcgcc 2281 acatggtggc gacggctctc tatgaggcgg tgtgccgtga gctctgacct cttcgcccct 2341 ctgcgggccg tcatcgagcg gcttgaggcg gagcgcgacg agtaccgcgc tgacaacgaa 2401 tggctggagc gcgagaacgc gcggctccgg cggcgggttg aggagctgga ggggaaggag 2461 cgcgccgcat gaagacgctc tctcacaacc agatccgcac ctggcgcaac tgcaagcgtc 2521 agtacggctt cggctacgtc caactgcgcg tccctcgcaa gacgcccacg ccgctcctcg 2581 tgggccgctc gtgggacgac tgcctccagg agtggtggca gggcgaaggc gcgcaggacc 2641 gtctgctgcg ggctgcgaag gtggccatgc acgagggcga cccgttcgcg agggccaagc 2701 tctgcgcgat gctcatcggc tactcggcaa tgtggggcga cgtcccgtgc cgcctcatcg 2761 ccacgcaggt cgagttcaca gtccccatcg tccacccggt aacgggtgcg gtgcatcccg 2821 agtgcaacct gaccgggttc atcgacgcgg tggccgagat cgacggtcga ctcgttgtcg 2881 tcgagtcaaa gtcctcgggc gaggacatca cgctcggctc ggcgttctgg cgtcgggtgg 2941 cggtgatgga tccgcaggtt tcgacctacc tccccgcggc ccgcgcgctt ggctacgacg 3001 tgcgcgactg catctacgac gtcgcgcgga agcccgagct ccggccgaag aaggacgaga 3061 cgcccgacgc gtttcaggag cgcatcgtgc tcgacatgca cgcgcgcccc gagtggtact 3121 tccagcggca gacgctcgtg cgcctggagc gtgaggagcg ggagcacgcg ctcgacgtct 3181 ggcacttcgc cgacgaaatc gtgtcggcct cgcggaccgg ccgttacccc cgcaacccgg 3241 accagtgcca gaagttcggc cgggcctgcg actacgcccc cgtgtgcgcg ggtgaggcgt 3301 ccctagacga cccgcatctc ttcaaggatg cggcgaagag agagaggaaa gacaatgctg 3361 gaacgaattc agcgcggccc gcagcgtagg ccgctcgcgg tctgcgtgta cggggtgccc 3421 ggcgtcggca agacgacgtt cggggcgagt gcgcccggtg cgctcgtgtg gtgtctcgag 3481 gacggcgcgt cgttcatcga cgtggcgaag ctgccggcgc ccgagtcgtg gaacgaagcg 3541 ctcggcgtgc tccaggagct ggccgacaag ccgcacgact tcaagacgct ggtggtcgac 3601 accctcgacg ccctcgaggt gctggcggtg gcgcacgtct gcgccgaggc gaagaagaag 3661 acgctggcgg acttcgcgtg gggcggcggg tacgcggccc tggcgaacga gtggcgtcgg 3721 ttcctctccg gtctcgacaa cctccgggcg aagcgcgcga tgcacatcgt gctcatcgcg 3781 cacgagcacc ggaagcgcca cgacgacccg gagcttgggc agtacgagat gtaccggccg 3841 aagcttcagg agaaggcgtg gggcctgacg aacgagtggt gcgacgccgt gctgttcgcg 3901 cagttcgacg ccgcggtgtt cgagaaggaa ggccagaaga accgcgccat cgtcagcggg 3961 cggcgcgtgc ttcgcaccgt ccgcgagacg gggttcgtgg cgaagaaccg cctcggcctg 4021 ccgcgcaagg tggacctcga ctggaagtcg ttcgaggccg ccgcggtgcc gccgagcacc 4081 gacgtgctgc gaaaggagct cgtcgagctc tgcacgcttg cgggggagga cgtagccacg 4141 aaggcgcacg gcttcctcgc ggaccgcggc gagagcgtcg agacgctccg tgccgccatc 4201 gagaccgtga agacctacct ggccgagaag tcggcggcct gacccgagag accaaaggag 4261 agaccatgat caagcaggac ggcaagtaca gggcgaaggc ggtggacgtg atgctcggcg 4321 agagcggcgt gaagggcacg ccctacgtcg gcgtcgagtt caaggtggtg gagcagggcg 4381 agagcctcgg cgagaccgtg aagtggagcg gctggctcag cgagaaggct ggcccctcgg 4441 gcaagacggt cgccgagcgc accatcgaat cgctccgcgc gtgcggctgg acgggcgacg 4501 acctcggatg cttcgcgacc ggcctgcacg gcctcgacgc gaacgaggtg gaaatcgtcg 4561 tcgagatgaa gccgtacgac gggccgaacg acagccacaa gggcaagtcg ttccccgagg 4621 tgaagtgggt gaactccctc gccggcggtc gtgggctccg taaggacgac gccatggacg 4681 ccgtgaaggc ggccaggctc ggcgagcggt tccgcggcct cgctctcgct ctcaagccgg 4741 cagagcagcc gtcgaacgac cagctccagg gcacgtcgtt cccctacggc gcgaacgccg 4801 ccggtggtgg caagggtcgc gcgttctgag tgactgccgg gccgttgcga gcggcccacc 4861 tatcccccgc gcgacgccta tccccccaag gccggcgcga ttcgggcttc gaggcccgac 4921 gggggaccac aaccgcccct cacccacgac aacggagatt gaccatggcc acgaagaaga 4981 agatcgcgac gacgaagaag accactggcg aacgctacgt gctcgtcacc accgcgcacc 5041 ggggcgtgtt cgcaggattc gcgaaggaga cggacggcga cgtcatcaag cttcgcgcgg 5101 gacgcctctg cgtctactgg agccgtgacg tgaagggttt catggggctc gccgcgaacg 5161 gcccgagcgc gtcatgcagg atcggcccgc ccgcggacat tgagctgcgg tccatcacga 5221 gcgttgtcgc cgtgacggac gaggcgaagg ctcgctggga ggcggcgccg tggtcgtcct 5281 gaggggcgct atcccggagt ggtgcggaag ctccggctac ggctccggct acggctacgg 5341 ctacggcgac ggcgacggct ccggctacgg ctacggctac ggctccggct acggcgacgg 5401 ctccggctcc ggctcttaac cccttgggag actccttgtg aatctcacga ttgtcgtggc 5461 cgaccgtggg ttcgtcttcg tcgctgcgat cgagcagcac ccgcgcgacg ctcagatgtt 5521 cctcgcctac ggctgtcatt gcgtccgcat ctggggcacg aacgcgggcc tcgggcagct 5581 cgcccgcgaa ggggcgagac cgcaaacggt cctcgatcgg gagtgcgatc acgggccggt 5641 cgagatgtcc cgtcactacg tgctccgcac catcccgacg agcgaacggc tgtggccgga 5701 ggtggtcgag tgggcgcgga agaacggcat cgaacaagac tcgttcgcgg aggtgcgatc 5761 gtgaagcccg acacgtggct gatctacgaa gatgttcccc cggacgcgaa ggaaatcccc 5821 ggatatccag ggtacttcgt ttctgacggc ggaaaagttt ggaacaccaa ctacttccgc 5881 cgaccgcacg tccttcgacc actcaggagc gcaaaggggc acctctctgt gcagctgtgg 5941 gttaatggaa aaccctcgcg acagcaggtg catcggatcg ttctgttagc gttcgtcggc 6001 ccatgcccag aggggatgga ggcatgtcac tttcctgacc cagatccatc caacaacttt 6061 gtggcgaacc tcaggtggga cacctccacc gaaaactctg ctgaccaaaa gcgactcaac 6121 aggcacccgc atggagtgaa gcatgggcag gcgaaactga cggaagacga tgtcagggaa 6181 attcggcgtc tccggagcga ggggaaattg catcgcgaga ttgccctaat cttcggcatc 6241 tctcgcaagc aagtcagcga tatttgcatc ggacgatact ggagccacgt ggcatgaaaa 6301 tcgacacgcc ccagccgccg cggccgacga tcgaccaagt caacgccgcg tatcgcgagg 6361 ggctgttcgc gtgcggaatt cgcacgtcct acgaactgta tttggtgcgg ctgggcgggc 6421 agccgattcc gatgaggggc gtgacgtgga ttccgggaaa ccgatccatg tggtcgcaga 6481 ctccgcaagg ctgggagccg ttcgattggg acgagagcgc tccgtcgtgg ttcaaaagcg 6541 actcgctcgt cttccccttc ggccgcgtgc cggtcgccga gtggcgcgga tggccgcact 6601 acaactggca ggagatcgcc tctctggttc agtggcccgg caacagcggc aacttcattc 6661 cgttccgtga gtggaaggtc gagcatgttt ttctcgggac gttcggcgag ttcgattccg 6721 ccaagcgact caaccttgcc ggcctctggt ctatcgagcg ctgcatcgcc ttccccgacg 6781 aagccccgtt cgctcgcgcc gcgtggtccg atccggtcaa ctatctgctg tacgcggacc 6841 acgtcgattc tcaggggtgt tccctgacgg cgatggcgct gcggcgcgtg gcggaactca 6901 cggcgaagga agcggaggtg aaggcgtgag cctgtcgatt cacccgctga cgctgaagga 6961 ggccgcgaga ttcgtagcgg cccaccatcg gcatcatcgc gcgccgcagg gcgggttgtt 7021 cgccatcggg tgcgcggccg acggggagtc atcgccgcgg ggcgtcgtga tcgtcggtcg 7081 ccccgtgtcc cgaatgctcg acgacggctg gacggcggaa gtgactcggc tctgcacgga 7141 cggcagccgc aatgcgtgct cgatcctgta cgcggctgcg tggcgtgcgg cgcgagcgat 7201 gggctatcgg cggctggtga cctacatcct cgacagcgaa ccggggacga gccttgaggc 7261 cgcgggctgg aaatgcgtgg gcgaagcggg cggcgggaag tggagccgag tcgatcggcc 7321 gcgtgtggac ttgcacccga cccaaaagaa gatgcgatgg gaggcaaccc aatgaccccc 7381 ttccccgacc tttccggtct gcgtcgcgcg ggcttcgacc gcgaggcgga tttgctcgcc 7441 gagctgatgc ggcggccggc gtggtggatc gtccgtcgag ctgggaccgt cggacccgaa 7501 ttcgacgcgc ggcgatggga tgtgagagag gccgccgaaa cgaccgcaaa attctgggac 7561 gactactacg ccccgacgaa gcattccgtc gtcccgctca cggcggcggc cgtcatcgcg 7621 gaccagcgag gtgagtcgtg agcgacgaca tccgcgacct gatcacgatc ctgtgcgacg 7681 acccggccga gctgcgggcg gtcttcgggc cgctggccga gctgccggcg tggtggggcg 7741 tgcgggacag catttggggc tggcggagtt acccggcgac gcgcgatttg gccgagcagg 7801 atcgcgcgta tttcgatcgg gtgccgctta gcggagccgt caccgtccac gcnnnnnnnn 7861 nncgccatca cgcccgcggc ggtgatcgaa gaccgaaagc gaggcgtgct gtgagcgcga 7921 ccaaaccaaa gccgatcgtc gtccccaagc ggggcgtctc gcccatcctc cgcacgtttt 7981 tcgagcggtg gcgcgacgac ctgcccgaag cggatcgcga gatgctgcgg ccctacgcgc 8041 cgcggctcaa gggaaccggc gacggccgcg acggcgttcg gccttggctg attattgact 8101 ggtgggtgcg agcgcagacg ccggcctggc tcgacctcgc gcggctgccg accgaggcgg 8161 cgtcgctgcg cggactgaat gagctaaggg cgggggtcga tttgaccgcg gcgaacaaag 8221 ccgcgaacga agtcaaagca agagccgtcg cgactcgggc cgcgacttgg gccgcgactt 8281 gggccgcgac ttgggccgcg actcgggacg cgacttggga c // LOCUS NODE_3960_length_8139_cov_5.3414158139 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8139) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8139) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8139 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 268..1950 /locus_tag="DP116_25035" CDS 268..1950 /locus_tag="DP116_25035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314628.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25035" /translation="MLTGKLKSNKYKKAKKQKSQETPTLSLKERLAQKRKAAQSRKEF SNFLMIAAFGGIFFGILMAFVGGIKAAVPGLLGIVTIALSYKYPRQALFAFLIYVPFA GTITYYLGNSPILQLAKDSFYIPALIGLWQMCRKQRLPLIIPRAIKIPLFILLGLCLL TLVFVNGVQQFNPSLTQLLDDTAVEIPIGVGILGLKVLLGYVPLIACAYYLIRNKEDF LFLSRLQIVLILVCCALGLIQYLLLSKGICQGTRYLEGAAQFKASIEARCYFGGSLIY SPDQGIIRLPGTFVAPWQWAWFLISSIFFAFASGFSDPSRIWRVLSLSSMAAVAVNAV VSGQRAALALVPASFVALLLLTGQIRNLKRFLPIAVGLAMILGIAIINNPDILQERTA SLNERWEASPPLEFITQQFANVWKNLQPLGHGLGRATNSARALGATKLVETYYPKILY EIGPLGLLGFLILVTTLTIVCFQTYRSIKNPNFRSYAAALWVFILFISYNTYYYPLDV DPVAVYYWFFAGVLLKLPEIDKQEKLQQALAKGKQRKKGSRGSAWVAEENNF" gene 1981..2967 /locus_tag="DP116_25040" CDS 1981..2967 /locus_tag="DP116_25040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875124.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_25040" /translation="MNLPLISVIIPTYCREEPLRDSLADVLKQDYPNYEVLVVDQTPK HQLEIETYLEELAAANKIKWFRLSWASLPGARNYAVRRAAGEIILFIDDDVQLTPGFL AAHAKNYLEKPEVGAIAGRVFDRMKLGDSGGNLQISYLPPQAMDPGIAWYYIDLVHTI QPQEVLTARGCNMSFRREIFTKYGLRFDERFRGSAVREESDFCLRLRQTGYKIWYDPE AYLVHLGEETGGCHDISMRSLQYQLTFYHNHFLMGLKNLTATQALRLYFRLFDCHVLG HPPCNKSGSPIKIVTRGVFFSLGFLKALGSVVQSIWNDGQIYTRMDEHVEVI" gene 3269..4444 /locus_tag="DP116_25045" CDS 3269..4444 /locus_tag="DP116_25045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015215848.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 4 protein" /protein_id="PRJNA477356:DP116_25045" /translation="MKILVASHTYIVDLNCEKLRILSQLEPEIEVTVVVPKKWKPGGV QNKIIETQFRDEGKFRIVPVSNFSQNHQGLLTFGADLISLLWEFRPQIIQVEQGSKGL AYAEMITLNQLLGLKAKNVFFTWWNLPYHLKFPVSLLEKYNLTHSHGIISGNQDGAEI LREHGYKGSIKIMPQLGVDETLFTPTPQPELGNKFGIQPGDFVVGFVGRFVQEKGLLT LINALASLRDKSWKCLLLGRGSLKSELMNKAAESNIQDRIILVESVPHDEVPNYINLM STLVLPSETNYNLKNITSVGWKEQFGHVLIEAMACKVPVIGSNSGEIPYVIGDAGLIF PEGDAKALADCILKLMEKPEFAKKLGEMGYQKAMSQYTNKALAKQQLEFYKELVDSH" gene 4883..6076 /locus_tag="DP116_25050" CDS 4883..6076 /locus_tag="DP116_25050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314625.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyl transferase group 1" /protein_id="PRJNA477356:DP116_25050" /translation="MQVLQIVPSISLIYGGPSQMILGLAPALARQGVEVTVLTTDSNG DSGQKPLDVPLNCPIEQDGYKIIYFRCAPFRRYKFSVDLLRWLNRHAHEFNLAHIHAL FSPVSSFAATICHRQKLPYILRPLGTLDPADLRKKKQLKQLYAAILERPNLAGAAAIH FTSVQEAKVSERFGVSTRDLVIPLGVKPVKRVGEAETEEVCRKLGIPKDVPLVLFMSR IDPKKGLNLLLPALEALLAEKLNFHFVLAGTNPQDPSYEEKIKLQIQASVLRSHTTIT GFVTGELKSTLLQAADIFVLPSYYENFGIAVAEAMAAGTPVVISDQVHICQEVRDSES GWVGATDVSELTNMLRIALQNPAERQRRGLCAKEYALKNYSWDAIAHQTIEAYNQILS DLDVK" gene 6211..6849 /locus_tag="DP116_25055" CDS 6211..6849 /locus_tag="DP116_25055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019488594.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peroxidase" /protein_id="PRJNA477356:DP116_25055" /translation="MALHLGDTVPNFTQASSTGDIDFYEWAGDSWVVLFSHPADYTPV CTTELGRVAKLKPEFDKRNVKVIALSVDDVESHKGWIGDIEETQSTNLNYPILADADR KVSDLYDMIHPNANALLTVRTVFIIDPNKKLRLSLTYPPSTGRNFDEILRVIDSLQLT DNYSVATPADWKDGDDTVIVPSLKDPEVLKEKFPKGYKEVKPYLRLTPQPNK" gene 7116..7700 /locus_tag="DP116_25060" CDS 7116..7700 /locus_tag="DP116_25060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114077.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_25060" /translation="MLSSPIFLRMPSDLQMTDEQFFEFCQVNRDLRIERDKFGEISIM PPTGSETGNRNFNIALQLGIWSEQNGTGICFDSSTGFKLSTGADRSPDASWMKLERWN TLTGEQQQKFAPICPDFVVELRSSSDNLQPLKDKMEEYMKEPGIQLGWLIDRKHRKVY IYRPGMPEECLDNPATVSGESVLPGFVLNMSKVW" BASE COUNT 2352 a 1690 c 1736 g 2361 t ORIGIN 1 ggcgaacgcg cagcgtctcc gttcggcctc aagccgtgcc cgaagggctc aggagttacc 61 cccgaacccg gagggcttcc cgcagggtag cacctggcgt gaggggatgg gggtgaggtc 121 tttttactat tccctattct ttgtgtttct ttcttcacag acaaaaacac agattctcaa 181 gaatcattta agactgctat attcaaaata atatgtaaaa aaaacatcta tagctagtga 241 ctttatgcgc atatctacag caacgccatg ctgacaggaa aattaaaatc caacaaatac 301 aaaaaagcaa aaaaacagaa gagtcaggaa actccaaccc ttagcctcaa agaacgttta 361 gcccaaaagc gtaaagctgc acaatcgcgc aaagaattca gcaatttttt aatgattgct 421 gcttttggcg gtatcttctt tggtattctc atggcttttg tcggtggaat taaggcagca 481 gttccaggtt tattaggaat tgttactatc gctttgtcct acaaatatcc ccgccaagcc 541 ctttttgcgt ttcttattta cgtgccgttt gcgggtacta ttacttacta cctcggcaat 601 agtcccatac tccaattagc gaaagactcc ttttacattc cagcgctgat tggactttgg 661 caaatgtgtc gtaagcagcg gctaccccta atcattcccc gagccattaa aattccactg 721 tttattttgt taggtttatg tttgctaaca ctagtgtttg ttaatggagt gcagcagttc 781 aatccgtcct tgactcaatt attagacgat acagctgtag aaatacccat aggcgtgggg 841 atactcggtt taaaagtctt gttgggctat gtacccttga ttgcttgcgc ttactatctc 901 atccgcaata aggaggattt cctattttta tcgcgcctgc aaattgtcct tatactagtg 961 tgctgtgcgc tgggattaat tcaatacctc ttattatcga agggaatatg tcaaggcaca 1021 agatatttag aaggagcagc ccaatttaaa gcatcaatcg aagctcgatg ttattttggt 1081 ggttctctga tttatagtcc cgatcaaggg ataattcgtc taccaggaac ctttgtcgca 1141 ccttggcagt gggcatggtt cttgatttcc agtatctttt ttgcctttgc ttccggcttt 1201 agtgatcctt ctagaatctg gcgagttctc agtttaagtt ctatggcagc agtggctgtc 1261 aatgcagttg tctccgggca gagagcagcc ttagcattgg tacctgcgtc tttcgtagct 1321 ttgctattgc taactggtca aattcgtaac ctcaaacggt ttctccccat agcagtagga 1381 ctcgcgatga ttttgggaat tgcaatcatc aacaaccctg acattctcca agaaagaaca 1441 gcaagtctga atgaacgttg ggaagcttca ccacctctgg aatttatcac acagcaattt 1501 gcgaacgttt ggaaaaattt acaaccccta ggacatggat tagggcgcgc gactaactct 1561 gctcgtgcgc taggtgcaac caaactggta gaaacttact atcccaaaat actatatgaa 1621 attggaccgt tgggactact aggattttta attttagtca caactttaac aatcgtgtgc 1681 ttccagactt atcgttccat caaaaatcct aatttccgca gttacgcagc tgctttgtgg 1741 gtgttcatat tgtttattag ttacaacacc tactactatc ctcttgatgt cgatccagta 1801 gctgtctact actggttctt tgctggagtc ctgttgaaat tgccagaaat agacaagcaa 1861 gaaaaactcc aacaagcact agcaaaaggg aaacagagga agaagggaag caggggaagc 1921 gcttgggttg cagaggaaaa taatttttga ctaatgacta atgactaatg actaatgact 1981 atgaatttac ctttaatttc tgttattatc ccgacgtact gtcgtgagga accactgcgc 2041 gatagccttg ctgatgtcct gaaacaagac tacccaaatt atgaagtctt ggttgtggat 2101 caaactccaa aacatcagtt agaaattgaa acttacctag aagaattagc tgctgctaat 2161 aaaataaagt ggtttcgttt gagttgggca agtttgcctg gggcacggaa ttatgcagtg 2221 cggcgagcag caggtgaaat cattttattt attgatgatg atgtgcaact aacccctgga 2281 tttttagccg cgcatgcgaa aaactattta gaaaaaccag aagtgggggc tatcgctgga 2341 cgggtatttg acagaatgaa attaggtgac tctgggggaa atttacagat ttcatatctt 2401 cctccccagg caatggatcc aggaattgct tggtactata tagatttggt acacacaatt 2461 cagcctcaag aagttctcac agccagaggt tgtaatatgt cctttcgtcg cgaaattttt 2521 accaaatacg ggttgagatt tgatgaaagg tttcgtggta gcgcagtgcg agaagaatct 2581 gatttttgtt taaggttgcg gcaaacagga tataaaattt ggtatgaccc agaggcatat 2641 ttggtacatt taggagaaga gacaggaggt tgtcatgata ttagtatgcg ctctctccaa 2701 taccaactca ctttttacca caaccatttc ttaatgggac tgaagaacct taccgctacc 2761 caagctttac gcctttactt tcgtttattt gactgtcatg ttcttggaca tcctccttgc 2821 aacaaaagtg gttcacccat caaaattgtg actcgtggcg ttttcttctc tttggggttt 2881 ttaaaagcct tgggtagtgt tgttcaatca atttggaatg atggtcaaat ttacactcgc 2941 atggatgaac atgttgaggt tatttaaccg cagattaacg cagacggatg cagatagatg 3001 atatcaaatc cgtttgtatc ggaattattt ctcctttctc tccacgccag tcgccaagtg 3061 agggaaacaa ggactgcagc gctggctcct ctgcgttctc tctcttgaaa aggtatgatc 3121 ggaggaaacc tccgctcaga cttttcgctg cttcctctgc ggttttttaa ttattcagat 3181 tcaaccggaa acgatatgac agatttatct gcgttaatcc gcgtgcatct gcggtttaaa 3241 ataaaaaatt accaatcctg agatatcaat gaaaattctt gtagcaagtc acacttatat 3301 tgtagacctt aactgtgaaa aattacgcat tctttctcaa ctcgaaccgg aaattgaagt 3361 cacagtagtc gttccaaaaa agtggaaacc tggcggtgtt caaaacaaaa taattgaaac 3421 tcaattccgt gatgaaggca aatttcgcat agttccagtt tctaacttta gtcaaaatca 3481 tcaaggactc ctgacatttg gcgctgattt gatatctttg ttgtgggaat ttcgccccca 3541 gattatccag gtagaacaag gttctaaggg attagcttat gctgagatga ttactttaaa 3601 tcagctatta ggactaaagg caaaaaatgt gttttttacc tggtggaatt taccctatca 3661 tctgaaattc ccagtttctt tattagagaa atataatcta acccatagtc atggcattat 3721 ttcgggcaat caagatggcg cagaaatttt acgggaacac ggatacaaag gttcaatcaa 3781 aattatgcct caactgggtg ttgacgaaac tctgtttact cccacaccac aaccggaatt 3841 aggaaataag tttgggattc aaccaggtga ttttgttgtg ggatttgttg gacgctttgt 3901 ccaagaaaag ggattactga ctctaataaa tgctttagct agtttgagag ataaatcttg 3961 gaaatgtttg ctcttgggac gtggttcgtt aaagtcagag ttaatgaaca aagcagcgga 4021 atctaacatt caagacagaa tcatcttggt agaaagtgtt cctcatgatg aagttccaaa 4081 ttacatcaac ttaatgagta ctttagttct accatcagaa acaaactaca atcttaaaaa 4141 tatcacttct gttggctgga aagaacaatt tggtcatgta ctcatagaag caatggcttg 4201 caaagtacct gttattggtt ctaattctgg cgaaattcct tatgtgattg gtgatgctgg 4261 attaatattt cctgaaggag atgcgaaagc gctagctgat tgcatcctca aattgatgga 4321 aaaacctgaa tttgcgaaaa aattaggcga aatgggttat caaaaagcaa tgagtcagta 4381 cacgaataaa gctttggcaa aacagcaatt ggagttttat aaagaacttg tcgatagtca 4441 ttaatcactg ggttattcgt aaacttacat tttgcacctg gtaaaacaag cgtcaaattc 4501 attacatatt atttgactcc gttattgtcc gcctaggact gtaagtccca ggctaatagg 4561 cgaagtccat taaaatggac tcaaactgga actaattttg agtcaattga aatactattt 4621 tatacaagta ttaacggttt tctcgtaatg ctgcaagatc tcagtaaaca aacaacaaac 4681 ttcttgtaaa agtcaattat atgaaactga gacaactatc tgcgtgcatc cgcacggcag 4741 gtgctaccct tcgggaagcc gcccttcggg cgtctacaag tcggggaacc cgcccaacgc 4801 actgcctcgt gcatctgcgg ttccaaatat tctcagctca tatcttacaa gcagtctaat 4861 aacaaagaac aaatgacaaa ttatgcaagt tctacaaatt gtcccctcaa tttcactcat 4921 ttacggtggt cctagtcaaa tgatactagg acttgctccc gcattggcac ggcaaggagt 4981 ggaagttact gttctcacca ctgacagcaa tggtgattct gggcaaaaac ctttggatgt 5041 tcccttaaat tgtcctattg aacaggatgg ctataaaatc atttatttcc gttgcgcccc 5101 ttttcgtcgc tacaaattct ctgttgactt actcaggtgg ttaaatcgtc atgctcatga 5161 gtttaattta gctcatattc atgctctgtt ttcccctgtc agtagttttg ccgcgactat 5221 ttgtcacaga caaaaactac cttacatttt gcgtcctctg ggaactttag atcctgctga 5281 tttacgtaaa aaaaagcaac tgaaacagct ttatgctgca attctggaac gtccaaattt 5341 agctggtgca gccgcaattc acttcactag cgttcaggaa gccaaggtat cagaaagatt 5401 tggagtatca acccgagatt tggtgattcc cttgggtgtg aagcctgtta agagggtggg 5461 ggaagctgaa acagaggagg tgtgtcgtaa gttaggtata ccaaaggatg tgcctttagt 5521 gctgtttatg tcccggattg acccaaaaaa agggttgaat ctactgcttc cagcgttaga 5581 ggcgctttta gctgaaaaat taaattttca ctttgtccta gctgggacta atcctcaaga 5641 tccaagttat gaagaaaaga taaaattaca aatacaagct tctgtgttgc gatcgcacac 5701 cactatcact ggctttgtca ctggtgaatt aaaatctacc cttctacaag ctgctgatat 5761 attcgtcttg ccatcctatt acgaaaattt tggtattgct gtagctgaag cgatggcagc 5821 tggaacgcca gtagtcatat cagaccaagt gcacatttgt caagaggtgc gtgatagcga 5881 gtcgggatgg gtgggtgcaa cagatgtgtc agaactcact aacatgcttc gtatagcttt 5941 acaaaatcct gcggaacgcc aacggcgggg attgtgtgct aaagagtatg ccctgaaaaa 6001 ttatagttgg gatgcgatcg ctcatcaaac cattgaggcg tataatcaaa tcttgtccga 6061 cttagacgta aaatagtaaa tctgtttgtg caaacttgca caacttaaaa ttttgtttca 6121 tgacgctaaa gcccaatcgg caaaccggtg ttttgtgcga tcatggcaaa atagaataca 6181 tatgcctttt tgaacgacag ggaatctgac atggctctcc atttaggcga cacagtacca 6241 aactttaccc aagcctcctc cacaggagac atagactttt acgaatgggc tggcgacagc 6301 tgggttgtac ttttctccca ccccgcagac tatacacctg tttgtacaac agagttaggg 6361 cgagtggcta agctaaagcc agaatttgac aaacgcaatg tcaaagtcat cgctctcagt 6421 gttgatgatg tagaatctca caaaggatgg attggagata ttgaggaaac tcaaagcacc 6481 aacctcaatt atccaatttt ggcagacgcg gatcgtaaag tttctgacct ttacgacatg 6541 attcacccca atgcaaatgc attgctgact gtccgtacag tcttcatcat cgatcccaac 6601 aagaaactac gtctgagctt aacttatcct cccagcacag gacgcaactt tgatgaaatt 6661 cttcgggtga ttgactcgct gcagttgaca gacaactaca gtgtagcgac accagctgat 6721 tggaaagatg gcgatgacac agtcatcgtt ccttcgctaa aagatccaga agtgctgaag 6781 gagaaattcc ccaaaggata taaggaagtg aagccctatc tgcgcttgac tcctcaacct 6841 aataagtaat acgcgcgttg cattcataac ttagacttca ccccgcctta acgggcaccc 6901 ctctccttta taaggagagg ggtaggggtg aggtaaaatc ctgaaatcat tttacacaag 6961 tgctgtgtat tcatagctat cacctcaccc cgccctgacg ggcacccctc tggtgaattc 7021 gcgcagaggg gcaaacagat gcgagatttc ccatctctac agcatttttg cgtgagaata 7081 aaagctagtt gggtaggccc ttagagaaaa taactatgct ttcatctcct atttttttac 7141 ggatgccatc agatttgcaa atgacagatg agcagttttt tgaattctgt caggttaacc 7201 gagatttacg tattgaacgt gataaatttg gtgagatatc aattatgcct cctactggtt 7261 cagagacagg taaccgtaat tttaacatag ctctccagtt gggaatttgg tcagagcaga 7321 atggaacagg tatttgtttt gactctagca ctggatttaa gctatcaacc ggtgcagata 7381 gatcgcctga tgcttcttgg atgaaacttg aacggtggaa tactctaact ggcgaacaac 7441 agcaaaagtt cgctccaatt tgtccagatt ttgttgtaga actccggtcg tcttctgata 7501 atcttcagcc tttaaaagac aaaatggaag aatatatgaa agaaccgggg atacagttag 7561 gttggttgat tgaccgtaag catcgcaaag tttatattta tcgtcctgga atgccagagg 7621 aatgtttgga taatcctgct actgtcagtg gtgagtcagt tttgccaggg tttgttttga 7681 atatgagcaa ggtttggtag taagtagctc aacctaatta aacgtaaaat gtcattacgt 7741 aagcgcaaag cgcacgctaa gagcgaacgc gcagcgtggc cgaagggcat acgaaacgca 7801 ggcgtaggtc gggttaagcg tagcgcaacc caacaataaa cgtttggtgt tgggtttcct 7861 tacgtcaaac gccagacttg ccgtgagagc ggaaagccgt catgagctag gctccccaac 7921 ctacgttatt gattgttaat cgtcaaaaaa tattacaaat taagtattag agcgtacaaa 7981 cctgttatta atactttcat ccacaatttt actcgttatg tagtgctttt cattaccaga 8041 aaccacaaac cacatgtagg gttacagcat tgctttgtgc ccctaattaa actcaacgag 8101 cctgtcacca gacccgttgg tcttcaaaac cgtgcttga // LOCUS NODE_4004_length_8042_cov_4.6401658042 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 8042) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 8042) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..8042 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 199..864 /locus_tag="DP116_25065" CDS 199..864 /locus_tag="DP116_25065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318756.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipase" /protein_id="PRJNA477356:DP116_25065" /translation="MNNVKKQRNPVLLVHGIFDTGRVFDTMIPYLNQRGWTVYDLDLV PNNGNLGLDTLAQQVANYIDATFEPEQPLDIVGFSMGGVVSRYYVQRLGGINRVQRFV TISSPHHGTWIAYCREGIGCIQMRPDSVLLQDLNRDAGMLKQIDFTSIWTPYDLMIVP ATSSQMPVGREVIVPVLNHSWMLTDSRSLAAVAEALTIQNSLIQNSKFKNKQPSTGKY SVC" gene complement(892..1353) /locus_tag="DP116_25070" CDS complement(892..1353) /locus_tag="DP116_25070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015188425.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glyoxalase" /protein_id="PRJNA477356:DP116_25070" /translation="MNPTLFHLAFPVTDIAQAKAYYVDGLGCTPGRENRNALILNLYG HQLVAHVTKEPPTPQRGIYPRHFGLIFTLEHDWEDLLTRAQQRQLLFREEPKHRFVGS ALEHRTFFLEDPFYNIMELKYYRYPEAIFGSAEYTQIGDTPAGSRPLGERL" gene 1390..1701 /locus_tag="DP116_25075" CDS 1390..1701 /locus_tag="DP116_25075" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25075" /translation="MEVVLKSLTLAILEHQPMRKLIIQLFIVILVVSGSIGYVFVNRA TPASSCTITSLTAQVGNGTLSYSQVGNGRSILLLHGLFADKEQWNTIMCRLSLVASFA G" gene 2090..4192 /locus_tag="DP116_25080" CDS 2090..4192 /locus_tag="DP116_25080" /inference="COORDINATES: protein motif:HMM:PF07693.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25080" /translation="MNLSDQQRKKLQDALISAFPSRESLEMMLRAQGVDIGNLQILSN SNYSELVFQLIEHQERIGRLETFVHGVRRLNPGNPKLRAIAQELLTPGSVPNTSFAEN TKQPDSSQDSIQNYIYNKGASNDLVGDRDQLGFDCYVKAFADIIESPNTKPPLTIGIF GSWGTGKSFLLDHITTELEQRSQQRKQEKQSQPKKSTNEEQKDSPYPHVHVIKLNAWE YSAAKAIWPGLVRKIMNQLEKELAWGFPGLFWKKFLLNLKREYEDNKSKIILFFAILL GFLVLNLFKLKLDLRLIWGALLALGVTGTFKLVADTISKPLSQWMETVLKGTDYGKQI NYMEEIYSDLELLAQRLKNNNGRVLIIIDDLDRCEPLKAVEVLQAINLLLNFKSFIVC LGIDARIITRAIEQFYKNMLGPAKASGYEYLDKIIQIPFRIPESTPDEIELFLSDLLG DSSPSTSTASSSSERTTQATSNSNVQNQGAPVQKNTPVEPPALITFNDKERVAFKRYT RFLQANPRHLKRLVNVYRLVRTLAEYKQEDFITDNPDATICWLVICSQWPYTAYAMLY YFQELLERWEEEKENLKPRIYDEEKVSESSLEYLYQEVLKQLEQSQDAKNKQRKLDHD PDLLRMLLKKGEPLTWEQLYIVNRYTVNFNPAVESTIKSEVPVKVSTDDLSNMPDGFY AFLDILSKRKQPTDTHQT" gene complement(4437..7163) /locus_tag="DP116_25085" CDS complement(4437..7163) /locus_tag="DP116_25085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869439.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type I secretion system permease/ATPase" /protein_id="PRJNA477356:DP116_25085" /translation="MASRENSKTNGKIAGELKLLDDESSENSVLASQSWNQPPLCWLT AEQQNRLQEVSETLRYKLGEKIWSQEAGGFQFLIISGNIRLREESVGKTMTVLQPGDW FGDLNNFSVECKAVAASKEVVVVRWTTAIWAELSTPQIEGFWLGNKEDTGTREQINSK FQIPNSKSETSASSASLPDSFTPSSSPPSSPSLSPSSTQLSTYPFVSSWNTGAACLTM VAQQLDNPVQLEWVQRQLRGQQPKHLVEAAEKLGFVLRRLQVSWSQLQQLSFPALLRW EASGKTEGGWIVAYALKGDRLTVANPLNPNHTCESIPRSVVEETWDGQLWQVEVIQKQ EKFNLNWFTPAVWQYRGLLGEVLLASFTLQLLGLATPLITQVVIDKVMVQESLPTLDV MAIALLLTSTFEAVLGILRLFIFTHTARRLDLSLSAQLFRHLMRLPLAYFESRRVGDT VARVQELEQIRQFLTGTALTVVLDSIFAVVYLGLMFYYNIPLTFVALAVLPLFATLTV VATPILRNWLNETFNRSADSQSFLVETITGIHSVKAHAAEPAARDRWEGLFARFVRTG FKASTTSNISSNIGDFLTNFSNLLILWFGAKLVIDHKLTVGQLVAFQMLSGRVTAPLL RLVQLWQTFQQVLLSVDRIGDILNVAPEAEPGTGLVLPSLKGQVNFEQVFFRYRQSAE PVLRGISFSVEPGQFVGVVGRSGSGKSTLSKVLQRLYQIESGRILIDGFDIKSADLAS LRQQIGVVLQEDFLFNGSILDNITLGNPDITSEQVVEAARYAVAHDFISELPHGYETN VGERGTALSGGQRQRIALARLFLSQAPILILDEATSALDSETEQQVLENLQKVSANRT VFLIAHRFAPLKRADLILVLEKGVIAERGNHLDLLRQKGLYWSLYQRQQANI" gene complement(7112..7870) /locus_tag="DP116_25090" CDS complement(7112..7870) /locus_tag="DP116_25090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_25090" /translation="MENLSFLTVDEKPISYVQAVKYLQASGKLGQFIGDILRQYIIEQ ELEKREDILISSALTEQTIIDFRLKNKLTEPQKFQEWLQSNGTNYDTFHSSVAYGFKV EKLKAMVTESKLQEYYIERKIYLDRVVVSRIIVETQELAEELLTQIEEGTSFEQVARE YSLSDEKIVNGMMGPVSRGTMPDKLRACIDIASPGQLVGPIELEGRYGLFRVEQFLSA SLENTQLRQALQNELFEKWLAEKIQKLTVKLQVN" BASE COUNT 2373 a 1622 c 1711 g 2336 t ORIGIN 1 ttcgcgcaat cgcatggcat tattatctga taatttgtca ccttgacagg gctagagtta 61 agagttaaga gttaagcgtg tttcttcaaa acagtactac tcaaaccaat acggagactg 121 tcatagcttt ttggtaaagg gtgatgagac tccactagat taggaaaaga tagaacttaa 181 cattattttg ttttgtaaat gaataacgtc aaaaaacagc gaaatccagt gctattggta 241 cacggtattt ttgatacagg tcgagttttc gacacaatga ttccttacct gaaccagaga 301 ggttggacgg tgtatgacct agacttagta ccaaataatg gcaatttagg tcttgacaca 361 ttagctcagc aagttgctaa ttatattgat gcgacttttg aaccagaaca acccttggat 421 atagtgggct tcagcatggg gggtgtcgtt agccgctatt acgtccaacg actgggagga 481 atcaaccgcg tgcagcggtt tgttaccatc tcatcaccac atcatggcac ttggattgct 541 tactgtcgtg agggtatagg ttgcattcaa atgcgtcctg acagtgttct actacaagat 601 ttgaatcggg atgctggtat gttaaagcag atagatttta cctctatctg gacaccttat 661 gacttgatga ttgtccccgc aactagttca caaatgcctg tgggacgcga ggtgatagta 721 ccagttttaa atcactcctg gatgctgaca gattccagaa gcttggctgc ggtggcggaa 781 gcgttgacaa ttcaaaattc actcattcaa aattcaaaat tcaaaaacaa acaacctagc 841 actggcaagt attctgtctg ttgacaatat gctaaaaaaa tgcattgcag cttaaaggcg 901 ttcgcccaaa gggcgactcc ctgctggagt atcgccaatt tgcgtatact cagcactgcc 961 aaaaattgct tcaggataac ggtaatactt caattccata atgttataaa aaggatcttc 1021 taaaaagaaa gtgcgatgtt ctaaggcaga accaacaaag cggtgtttgg gttcttctct 1081 aaaaaggagt tgtcgctgtt gcgccctcgt gagtaaatcc tcccaatcat gttctaaagt 1141 aaaaattagc ccaaaatgtc ttgggtatat cccacgctga ggtgtgggtg gttcttttgt 1201 aacatgagct accagttgat gaccgtagag attgagaatt agggcgttgc ggttttcccg 1261 tccaggagtg cagcctaaac catcgacata gtatgctttt gcctgggcaa tgtctgtgac 1321 gggaaaggca agatgaaata atgttgggtt cattgcttgt gtgggtaaga ctccctcaag 1381 ttttaccaga tggaagttgt gctaaagtcc cttacactag ccattttaga gcatcaaccc 1441 atgagaaagt tgatcattca actatttatc gtcattctgg tagtcagcgg atcaataggc 1501 tatgtgtttg tgaatcgcgc aacgcctgcc tcaagctgca cgatcacaag ccttactgca 1561 caggtcggca acggtacatt gtcctacagc caggtgggga acggacgctc gattctactg 1621 ttgcatggtt tatttgccga caaggagcag tggaacacca tcatgtgtcg gctgtctctt 1681 gtggcgagtt ttgcagggta agctaatctg atgtattttg gtttgtgcga aatgcaagtt 1741 ctattcgttg tgcaaaagct gatctggaag ttgctccaaa accctttcaa atggtgcagt 1801 gttattgagt ccccgcgtta aaactagcgc accttgaagc gcagggcggt aggttgataa 1861 agtggaatag ttgacttgtt agccaagtca tgattttgtg gattaaaaaa aacgtctgtt 1921 caaagcgcaa ccaaagcaag cccctagcgg cgagtgcttg gggttgggtt gcattcttgt 1981 ttcactgcaa gcggttttat ctggaatgtt attgtctgcg tgtgtagaat aatgtgaatt 2041 ttgttctaca aatcccagtc tcctagtata tatacactgg ttttgagaaa tgaacctatc 2101 agatcaacag cgtaaaaaac tacaagatgc tctaatttct gcttttccct caagggaatc 2161 tctggagatg atgcttaggg cccagggcgt ggatataggg aatcttcaaa tattgtctaa 2221 ttctaattat tcagaacttg tctttcaatt aatagaacac caagaaagga taggcaggtt 2281 agaaactttt gttcatggcg tacgaaggtt aaaccctggg aacccaaaat tgcgagctat 2341 tgctcaagaa ctcttaactc ctgggtctgt acctaacact agttttgctg aaaacactaa 2401 gcaacccgat agttctcaag actctattca aaattatatt tataacaagg gagcaagcaa 2461 tgatttagta ggagatcgtg accaactagg cttcgattgc tacgttaaag cttttgccga 2521 tataattgaa tcccccaata ccaaaccacc attgactatt ggtatatttg gctcttgggg 2581 tacgggtaaa tcgtttctac tcgatcatat aacaacagaa cttgagcagc gttctcaaca 2641 acgtaaacag gaaaagcaaa gccagcccaa gaaatctact aatgaagagc agaaagattc 2701 gccttatccc catgttcatg ttataaaatt aaacgcttgg gaatacagtg cagctaaagc 2761 tatttggcct gggttggtgc ggaaaataat gaatcaatta gaaaaagagc ttgcctgggg 2821 gtttcctggt ctattttgga agaagttttt actaaactta aagcgtgaat atgaagataa 2881 taaaagtaaa ataattttat tttttgctat attacttggt tttttagttt taaatctctt 2941 taaattaaaa cttgatttga ggctaatttg gggtgctttg ttagcactag gagtcactgg 3001 aacattcaag cttgtagctg atacaatatc taaacctctt agtcagtgga tggaaacggt 3061 cttgaaggga actgactatg gtaaacagat taattatatg gaggaaattt attcagattt 3121 ggagttactt gcacagagac tgaaaaataa taatggtaga gtcttaatca tcatagatga 3181 cctagatcgg tgtgagccac tgaaagccgt tgaggtgttg caagcaatta acttactcct 3241 caactttaaa agctttattg tatgtttggg gattgatgct cgtatcatta ctcgtgcaat 3301 cgaacaattt tacaaaaata tgttggggcc tgcaaaagct tcaggatatg aatatttaga 3361 taagattata caaattccat tccgtattcc agaatctact cctgatgaga ttgaattatt 3421 cctttccgat ttgttaggag attcatcccc atcaacttct actgcctcct catcatcaga 3481 gcgcacaaca caagcaactt caaattcaaa tgtacaaaat cagggagcgc cagttcagaa 3541 aaatacacca gttgaacctc cagcacttat tacatttaat gataaagaac gggtagcatt 3601 caaacggtac acaagatttc tgcaagccaa cccccgtcat ctcaaaagat tagtgaacgt 3661 ttaccgcctt gttcgcactt tagccgagta caagcaagaa gacttcatta cagacaatcc 3721 tgatgccaca atttgttggc tggtgatatg cagtcagtgg ccgtacacag catacgcaat 3781 gctgtattac tttcaagaat tgttagaacg ttgggaggag gagaaagaaa atctaaaacc 3841 aagaatatac gatgaagaaa aggtttcaga aagctcatta gagtatcttt atcaagaagt 3901 gttgaaacaa ttggagcagt ctcaagacgc aaagaacaaa caaagaaaat tagatcatga 3961 cccagattta ctacgaatgc tattgaaaaa aggtgagcct ttaacctggg aacagctata 4021 tattgttaat cgctacactg ttaatttcaa tccagcagtt gagtcaacca taaaatctga 4081 agttccagta aaagtaagta ctgatgattt atctaatatg ccagatgggt tctatgcatt 4141 tttggatatt ttatcaaaga ggaaacagcc aacagatacg caccaaactt aattctaccg 4201 tcccaagact ccacaatttg accgacaggg ctttgaagtt gcgtttcaaa tatgacctgg 4261 ccgtttttgt aaattacaaa tgcctcatta tgggcattgc ccttgatgtc aactcaagcc 4321 aaaactctcc taaaatctcg ttcccagtct ccgactggga acgatgtaag acgaggtaat 4381 gagcttaact acaaagttaa gtataaatat gcaattaaaa acaccaatta ttgatatcag 4441 atatttgcct gctgtctttg ataaagtgac cagtacaaac ctttttgccg caacaaatcc 4501 aaatgatttc cgcgttcagc aataacgccc ttctccagta ccaaaatcaa atcagcacgt 4561 ttaagagggg caaaacggtg ggcgatgaga aacacggtgc ggttagcgga aactttttgt 4621 aggttttcca ggacttgctg ttcagtttca ctatctaaag cactggttgc ttcgtccaaa 4681 attaaaatcg gtgcttgcga gagaaataat cgtgccaaag ctatgcgttg acgttgtccg 4741 ccagataaag ctgtaccccg ttctccaaca tttgtttcgt agccgtgggg taattcactg 4801 ataaagtcat gagcaacagc atatcttgca gcttctacga cttgttctga ggtgatgtcg 4861 ggattaccga gagttatatt gtctaagata gaaccgttga ataaaaagtc ttcttgcaga 4921 actacaccaa tttgttggcg cagcgaagct aaatcagcgc ttttaatatc aaaaccatca 4981 atgaggatgc gtcctgattc aatttgataa aggcgttgca acactttaga aagagtactt 5041 ttcccagaac cactgcgtcc aacaacacca acaaactgtc ctggttctac agaaaaggag 5101 attcctctta agacgggttc agcgctttgt ctgtagcgga agaaaacttg ctcaaagtta 5161 acttgacctt tgagggacgg taacactaaa ccagtgcctg gttccgcttc tggggctacg 5221 ttaagaatat caccaatacg gtcaacagaa agcagcactt gttgaaaggt ttgccaaagc 5281 tgtacgaggc gtaaaagtgg tgctgtgact ctaccagaaa gcatttgaaa tgcgactagt 5341 tgaccgactg ttaatttatg atcaataaca agcttggctc caaaccaaag aataagcaag 5401 ttggaaaaat tggtgaggaa gtcaccaata ttgctactaa tgttagaggt ggtagaagct 5461 ttgaaacctg tgcggacgaa gcgagcaaat aaaccttccc agcgatcgcg cgctgctggt 5521 tctgctgcgt gtgctttaac tgagtgaatt cctgtaattg tttcgacaag aaatgattgg 5581 ctatcagcac tgcggttaaa ggtttcgttt agccagttgc ggagaattgg cgttgcaacc 5641 actgttaaag tggcaaataa cggtaagact gctaacgcta caaatgtcag ggggatattg 5701 tagtaaaaca tcaatcccaa gtacacgaca gcaaagatgc tgtccaaaac cacggttaac 5761 gccgtacctg tgagaaactg gcggatttgt tcgagttctt gaactcgtgc gactgtatct 5821 cctacgcgcc gagactcaaa ataagccaag ggtaaccgca tgaggtggcg aaacagctgt 5881 gctgataaac ttaaatctag acgacgggcg gtatgggtaa aaataaacag tcgcaggatt 5941 ccaagaactg cctcgaatgt tgaggttaac aggagtgcga tcgccataac atcaagagtt 6001 ggcaaactct cctgcaccat caccttatca ataacaactt gagtaatcag tggtgttgct 6061 aaccccaaaa gctgcaatgt aaaagacgct agcagcactt ctcccagtaa tcctcggtac 6121 tgccaaactg ctggagtgaa ccaattgagg ttaaattttt cttgcttttg gataacttct 6181 acttgccata actgcccatc ccaagtttct tcaaccactg atcgtggtat actctcacaa 6241 gtgtgattgg gatttagggg gtttgcgaca gttaggcgat cgcccttgag tgcatacgcg 6301 acgatccaac ctccttctgt ttttcctgat gcttcccagc gtaacaaggc tggaaatgat 6361 agctgttgta actgactcca actgacttgc aagcgccgca gcacaaatcc taacttttct 6421 gctgcttcca caagatgttt tggttgttgt cctcgcagtt gacgttgtac ccattcgagt 6481 tgtacgggat tgtctagctg ttgcgccacc attgttaaac acgcagcacc tgtgttccag 6541 ctggagacga atgggtatgt tgacaattgt gtggaggagg gggagaggga aggcgaggat 6601 gggggagatg atgaaggagt gaaggagtca gggagtgagg cagatgaggc agaggtttct 6661 gatttggaat ttggaatttg gaatttagaa ttgatttgtt ctcttgtccc ggtgtcttct 6721 ttatttccta accaaaatcc ttcaatttgt ggcgttgaaa gttctgccca tattgctgtc 6781 gtccagcgta ctacgacgac ttctttactt gcagcaacag ccttgcactc tacagaaaag 6841 ttgtttaagt cgccgaacca atctcctggc tgtagtacgg tcattgtctt accgacgctt 6901 tcttcccgca agcggatgtt tccagaaata attaagaact ggaaacctcc cgcttcttgt 6961 gaccagattt tttctcccaa tttgtagcga agagtttctg acacctcttg cagtcgattt 7021 tgttgttcag cagttagcca gcacagtggt ggttgattcc aagattgaga tgctagcacg 7081 ctattttctg aagattcgtc atctaaaagt tttaattcac ctgcaatttt accgttagtt 7141 tttgaatttt ctctgctagc catttctcaa acaactcatt ttgtagtgct tgccttagtt 7201 gagtattttc taatgacgcg cttagaaatt gttccacacg aaacaatcca tagcgtcctt 7261 ctagttctat tggtcctacc aactgtccag gactggcaat atcaatacat gctcttagtt 7321 tatctggcat tgttccccga cttacaggtc ccatcatacc gttcacaatt ttttcatctg 7381 atagtgaata ctctcgggca acttgttcaa agcttgttcc ttcttcgatt tgggttagta 7441 gttcttcagc caactcctga gtctcaacaa taattcgtga aactaccacc ctatctagat 7501 aaatcttacg ttcaatgtag tattcttgta gctttgattc tgtcaccatc gctttgagtt 7561 tttctacctt aaaaccataa gcaactgatg aatgaaaggt atcataattt gtcccgttac 7621 tttgtagcca ttcttgaaat ttctgaggtt ctgtcaattt attttttagc cgaaagtcaa 7681 ttatagtctg ttctgtcaaa gcagaactaa tgagtatatc ttctcgtttt tccagttctt 7741 gttcaattat gtactggcga agaatgtctc caataaattg ccccaatttt cccgaggctt 7801 gcaaatattt taccgcctgt acataagaaa tcggtttttc atctacagtt aaaaatgata 7861 aattttccat gaagttaaaa atttaatgaa tagaaaaaaa tgacaaaaaa acaacataca 7921 actttttttc aattcaaatt gcatttgaat tgacaattct aaagaaaaaa aagagtgttg 7981 caatcataag tagttttctc agaattgaac cctctgcgac ctgaaggtac gcagaatcac 8041 gc // LOCUS NODE_4075_length_7895_cov_4.9377557895 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7895) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7895) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7895 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(167..1639) /locus_tag="DP116_25095" CDS complement(167..1639) /locus_tag="DP116_25095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25095" /translation="MVVKLQRPILVGGLGLSFSLWMLQNLHHSIVQVGEFGLLSLLAV GGGLWLLKSNRTQDSSEWLNTLPVDRTTVEATIAKAEAVINQLAQEAENPLTVGTLRE QLTHLTAELDRQEMIAAVTGGKSVGKTTLIEVLESTWCQNVETGSFSFVEFQETPPLF VEAGENSDADVLTRTKKSDVVLFLINGDLTDSEFQSLQQLKAVNQNTILVFNKQDQYL PDERVSVLQSLKHRMSSIVVATAASPHPIKVRKHQEDGCVQESLELPTPDIQQLTQQL GEMLTQQPEQLVWATTMRKALLVKTEAKNLLNEIRRDRAVKSIEQYQWIAAAAAFANP VPALDILATAAINAQMVMDLGGIYQQKFSLDQAQTVAGTMGSLMVKLGLVELSSKAVS TVLKTNAITFVAGGLVQGVSAAYLTRIAGLTLVEYFQQQEVAINSGTVLNLDKLRQTL QSVFQQNQQMALLQSFVKQGVKHLTPQAQQVQLVESEISA" gene complement(2317..2517) /locus_tag="DP116_25100" CDS complement(2317..2517) /locus_tag="DP116_25100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453750.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25100" /translation="MSTQEKARALMMRHYQLIKNRQQSMLERTGEELGLPGEVSHYWN PIQGKIDPTARMTYDRSNATMS" gene 2815..3873 /locus_tag="DP116_25105" CDS 2815..3873 /locus_tag="DP116_25105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_25105" /translation="MSDTLTKLTYQTFQQGKNYFGLAHKLLSSRLMNIVSPSEHREIK PIPNEILLKAQQRLNNLLEVDWEDAERGVYPKSVLFDNPWEDFFRYYPMVWLDLPQIW ERAQHKRYQEFTQDVDTNGYPSYYVQNFHHQTNGYLSDLSASLYDLQVEILFGGSADP MRRRILAPLKEGLEVFSDVSPRQIRILDVACGTGRTLKLIRAALPQAALFGTDLSPAY LRKANELLSQIPGELPQLLQANAEKLPYLDNYFHAVTSVFLFHELPAAARQQVIEQSY RVTKPGGIFIICDSIQMSDSPEMKPMMENFHETFHEPYYKHYTTDDLVERLEKAGFEN IRIQVHFMSKYFIARKPA" gene 3950..4849 /locus_tag="DP116_25110" CDS 3950..4849 /locus_tag="DP116_25110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873336.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF455 domain-containing protein" /protein_id="PRJNA477356:DP116_25110" /translation="MAVAYPRKFQNAIGARDILTQVVGDRDVHLITLNRYRYSEQRSC KDLTEVIEQLNGKPAELVRDLSHHISDEARHAMWLTDLLVELGANVGTPPGSSYIDEF DRLIDREFFNPEHNLEDSIIAGLAAINVTEKRGCEYFSAHIYALKQAPQTEENIKIRK TIEKILPEEAGHVRWGNRWLGQLADKSPEHRQKVEQAKLKYAAIEQAAFEAGMDITLG AELRRVAKLLEVANTMPVWQRPQYLMERLTQTLLAPDLQMTRIDVVQRVWNRDPQALM ERFVPMFLNGLKGMQDNRQKTKA" gene complement(5702..5920) /locus_tag="DP116_25115" CDS complement(5702..5920) /locus_tag="DP116_25115" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25115" /translation="MRAAISFLITGLFFSSLAINTQAPDIQVSSVDSQQLMAAVGNIK KSPKAPYRGSGRRCMQVLEQINSTHPVV" gene complement(6026..7798) /locus_tag="DP116_25120" CDS complement(6026..7798) /locus_tag="DP116_25120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318180.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_25120" /translation="MVHQTYTADVLVVGGGTGGTAAAIQAARRGAKTILVSEFPWLGG MLTSAGVSAPDGNELEAFQTGLWGAFLQELQHRQPGGLDNCWVSFFSYDPRIGAKIFA DWVQELPNLLWITGQVPLEVFQEGSCVCGVRFADLTVNAKITLDATELGDLLALADIP YRWGWELHSEWGEPSAPADFNHLTQRYPVQAPTWVVVMQDFGEAVAPEIPSAPNNDPS QFVGAWESYGAEQFLNYGRLPGGLFMINWPIRGNDYGEGVGRLIETEETKREFLQECF WHSQNFAHFIQNQLGRRYGLADNIFPHLAHSWFALHPYYRESRRVVGLTTVREQDILP ITGGMVAPLNIDAIAIANYANDHHYPGVEFQVQPKSMRWGGRRTGTPFTIPYRCLIPI ETDGLLVCEKNISVSHIANGATRLQPVVMGIGQAAGMAAALCVELDLSPRNLPVRVLQ EALLHDKYAPAAIIPLLNMLPKDPDWLHWQSYYLEEPELYQVNGNCPCLSSCQRHDVD SNNTITVRKIDCFQGIFHHISEQDYKFTITAPSIHRGRTWQIVTLRSHIHEQLQTFCD QQLLTICGRMNHALNWLIVENVSR" BASE COUNT 2244 a 1807 c 1602 g 2242 t ORIGIN 1 ccctgttccc tgttccctgt tccctgttcc ctgttccctg ttccctgttc cctctagtaa 61 gtttgtttac gtctatacaa acaaaacccg ccttaacagc tttaggaaag ttgagtctgt 121 ataggcggac tttgtttgtg taagtgtaaa tttcacctga ctaaggctaa gcgctgatct 181 cagattcaac aagttgtact tgctgtgctt gtggcgtcaa atgcttcacg ccttgcttaa 241 caaaactttg caataaagcc atttgctggt tctgctgaaa tacgctttgc aaagtttgcc 301 gcaacttgtc gagattcaac actgttccag agttaatagc gacttcctgc tgttggaaat 361 attcgactaa agttaaacct gcgatgcgag tcagataagc tgcactcact ccttgcacca 421 agccaccagc aacaaaggtg atagcgttag ttttaagtac tgtacttact gccttagaag 481 acagttccac taaacccagt ttcaccatca aacttcccat tgtaccagca acagtttgcg 541 cctggtccaa ggaaaatttt tgctgataga taccacccaa atccataacc atttgagcgt 601 taattgcggc tgttgctagg atatcaagcg ctggtactgg gttggcaaat gctgctgcag 661 cagctatcca ctggtattgt tcaattgatt tgacagcgcg atcgcgtctg atttcattca 721 gcaagttttt cgcttcagtt tttaccaaca aagcttttct cattgtagtt gcccacacca 781 actgttctgg ttgctgtgtc aacatttcac ccaactgctg cgttaactgt tgaatgtctg 841 gtgttggtag ttccagggat tcttgcacac aaccatcttc ttgatgcttc cgtactttga 901 taggatgagg agacgccgca gtcgctacaa caattgagga catccgatgt ttcagcgact 961 gcaacacact cacacgttca tctggcaaat actggtcttg tttattaaaa actaaaatag 1021 tgttttgatt gactgctttt agctgttgta agctttgaaa ttccgaatct gttaaatcac 1081 catttattag gaacaacaca acatcagatt tctttgttcg tgtcaaaaca tctgcatctg 1141 agttttcgcc tgcttctaca aacaaaggag gtgtttcttg gaattctacg aacgaaaagc 1201 ttcccgtctc tacgttttga caccaggtag attctaagac ttcaatcaaa gtcgttttac 1261 ctaccgattt gccaccagta acagcagcaa tcatttcttg tctatctaac tcagccgtca 1321 gatgagtaag ttgttctcgc aatgtcccta cggttagagg attttctgct tcttgtgcca 1381 gttggttaat cacagcttcg gctttggcaa tggtagcctc tactgttgta cggtctactg 1441 gtagagtgtt taaccattca gaactatctt gagtgcgatt tgactttaat aaccacaaac 1501 cgccgccaac ggctaacaga ctcaacaaac caaattcacc cacctgcact atcgaatggt 1561 gcaaattttg taacatccac agagaaaagg acagtcccaa tcctcccact aaaatcggtc 1621 gctgcaactt cacaactatg attctccgcg ctatgctgtt tctacttcag gatatctcaa 1681 aactcggctt ctccgaatca tattgccaga tgcaaaaagt tttctttact aatggtgttt 1741 ctaagattaa ctcccaaaaa tcacactgtg gtttttcatc tcaagccaag tgtttttcac 1801 tacttttcgt cactcataaa attgagcctt gtatccatct accaccctcc gggtatgcgc 1861 aaagcgcacg ccttacggct aacgccagtc gcctgttgtc gggagaacgc cagatgctct 1921 acttggggag accccaagac cgcactggct cccctcccgc agcgctggac tcaccgggag 1981 gaacatagat gcaacatata acaacggtga acaattttca aaaatcgttc accgttgtta 2041 gccgtcattg gcgttggcag gcataccctt agtggtggtc aataagatag tgatgagttc 2101 ggtgccctac actcgctagc cccattaggg gcgctacgca aacgcaagaa gtcaaaagta 2161 tcaagtcaaa agttaaaagt gacagtgttc tttatttttg aacgccaggt gctacaagcg 2221 ggggaacccc gccaacgcac tggctccttt tgatttttga tttttgactt ccccgtaggg 2281 gtgcccttac accatgcaac ccctatttgt tgacacttag ctcatagtcg cgttgctgcg 2341 atcataagtc atgcgagcgg taggatctat ttttccttga atagggttcc agtagtgaga 2401 gacttctccg ggaagaccta attcttcacc tgtacgctcc agcatcgatt gttgacgatt 2461 cttaatgagc tgatagtgac gcatcatcaa tgcacgggct ttttcttgag tagacatatc 2521 aaagttctcc ttaattgttt ctgtaatgtc ataatttagt gactttcatt aattattata 2581 caaaataatt ctgtaacata agctacggaa tatttatttt aataaaactt catacagtca 2641 caagagtagg gaactcagat ccacaaagga gctagggggt tgcgcagaga agttcctaca 2701 tcttgctaaa gactgatttt gcctcatgac atagaccttt cttaggattc atggtttatt 2761 aattaaaatt tttaaactaa tataaagaaa agtaaaaaaa ctcaagcaat tctcatgtct 2821 gacaccttaa ctaagctgac ttatcaaact tttcagcagg gtaaaaatta ttttggtcta 2881 gctcataaac tgctaagctc gcgcttgatg aatatcgtct ctcccagtga gcacagggaa 2941 attaaaccca taccaaacga gattttacta aaggctcaac aaaggctgaa taatctcctc 3001 gaagttgatt gggaagatgc tgagcgtggt gtctatccta aaagcgtgtt gtttgataat 3061 ccttgggaag actttttccg ctactatcca atggtgtggc tggatttacc ccaaatttgg 3121 gaacgtgctc agcacaagag atatcaagag tttactcaag atgtagacac aaatggttat 3181 cccagctatt atgtgcagaa cttccaccac cagacaaatg gctacttgag cgatttatca 3241 gcgtctttgt atgacttgca ggtagaaatt ctctttggtg gttctgctga tccaatgcgg 3301 cggcgcattc tggctcctct taaagaaggg ttagaagtgt ttagtgatgt gtcaccacgc 3361 cagatacgta tactagatgt tgcttgtgga actggtcgta ctttaaagtt gattcgagca 3421 gctttgcctc aagcagcttt gtttggtaca gatttatcac cagcttattt gcgtaaagca 3481 aatgaactct tgtcgcaaat tcccggagaa ttaccacaac ttttgcaggc aaatgcagaa 3541 aagttaccat acttggacaa ctacttccat gctgtcactt cagttttcct cttccatgag 3601 ctacctgcag cagcacgtca acaggttata gagcaatcct atcgggtgac aaaaccaggg 3661 ggaatcttta tcatctgtga ctcgattcag atgagtgatt ctccagaaat gaaaccaatg 3721 atggaaaact ttcatgagac ttttcatgaa ccttactata agcactacac tactgatgac 3781 ttggtagagc gtttagaaaa agcagggttt gagaatattc gtatacaggt tcacttcatg 3841 agcaagtact ttattgctcg aaagcctgct taatgtaaag aaaagctacg gtggctacat 3901 aaaattaata taatattaat actagtttac ataactctca taaaaagtta tggctgttgc 3961 ttacccacgc aaatttcaaa acgctatagg tgcaagggac atcttgacac aagttgtcgg 4021 cgatcgcgat gtgcatctga tcactctcaa ccgctaccgc tatagcgaac aacgcagttg 4081 caaagacctc actgaagtca ttgaacaact caacggaaag ccagccgaac tggtacggga 4141 tttatctcac catatctccg atgaagcccg tcacgccatg tggctgactg atttgttagt 4201 ggaattggga gcgaatgtag gaacacctcc tgggtcttct tatattgatg agtttgaccg 4261 tctgatagac cgagaatttt tcaacccaga acacaaccta gaggacagca ttattgccgg 4321 attggcagca attaacgtga ccgaaaaacg gggctgtgag tacttctctg cccacattta 4381 cgccctcaag caagcgcccc aaaccgaaga aaacatcaaa atccgcaaaa cgattgaaaa 4441 aattctgcca gaggaagcag gacacgttcg ctggggtaac cgttggttag ggcagttagc 4501 agataaaagt ccagaacatc gccaaaaggt tgagcaagcc aagctcaaat atgctgctat 4561 tgaacaagca gcttttgaag ctgggatgga tatcaccttg ggtgcagaac tgcgtagagt 4621 tgccaaactg ctagaagtgg caaacacaat gcctgtatgg caacgtcctc agtacctgat 4681 ggaacgctta acgcagactc tgttggcacc ggatttgcaa atgactcgga ttgatgtcgt 4741 tcagcgagtt tggaaccgag atccacaggc attgatggaa aggtttgtgc cgatgttcct 4801 caatggcttg aaagggatgc aagataaccg ccaaaaaacg aaggcataga acaaatgttt 4861 tgtatgtgat atgtgtgggg tgggcaatgt tatccctttc aaggtaagta aaaactggtt 4921 taaaacacca ttattttgac tctagcttcg tgttcttcgg gacatagaca ccaagaacac 4981 gagcaaaaga gtctcaactc tacttgcact ttgacaggga tatgtgggca atgtccaccc 5041 tgttttttgc caaatttcac ccgaatagct gtttcataaa ttctcgctta ccagctatcc 5101 ataacctcag tagtctttct tcaacgtgtt cttgaacaaa gttgttgtac tcaagagaga 5161 tttctcaata gtcatatcaa atccgcttca tatagatgtc cgcagcgact gcggacatac 5221 gataagtaat caaccggact tgatatcaca atatctaaaa gatatgggct ttaactaaac 5281 tgctagccca actaataccc tttcaagaga taacatgaaa gttgcgttca ggaatttcct 5341 gcttattttt tatttaagtg gctcttaatg cttgcaatac tagcagctag tactatttca 5401 cgaaattttt catacatatt caaacctcaa aaacagcacg gaaatcaagc catttgtatg 5461 acagtaaagt gaaacggtat aacttgcatc gcatctctat gcaaaagtta acaggtaact 5521 aagtaaacaa aaaattatgg gtgaaataca gaactattcc ctagagtgat tgtattttta 5581 gtatcacgtt tatactaggg agggtgtgtt actaattttc aaaacaccct ggcttacact 5641 atttagattc tctttgtaca aaacaattac caccccaaca aatagcatta actcccagtt 5701 gttagacaac aggatgtgta ctgttgatct gttccagcac ttgcatgcaa cgacgaccac 5761 tacccctgta gggggctttg ggagactttt tgatattgcc tactgccgcc atcaactgtt 5821 gtgagtcaac gctagatacc tgtatatcag gtgcttgggt gttaatagct aaggagctaa 5881 agaacaatcc tgtgatcaga aatgaaatgg cagcacgcat aataaatgtc tattttagaa 5941 ctcttccttt cactactact tagttttctt tactgcatcc ggaaaaatcc ggaaatagta 6001 tttattttta cttttaaatt tccttctatc tgctgacgtt ttcaactatc aaccaattaa 6061 gagcgtgatt cattcgaccg caaatggtga gcagttgctg atcacaaaaa gtttgcaatt 6121 gctcatggat gtgcgatcgc aaagtcacaa tctgccaagt acgcccccta tggatacttg 6181 gcgcagtgat agtgaatttg taatcttgtt cagatatgtg gtgaaagatg ccctggaaac 6241 agtcaatctt tcgtacagtt atagtattat tgctatcaac gtcatggcgt tgacaagagg 6301 ataaacaagg acaatttcca ttaacttgat ataactctgg ttcttctaaa tagtacgatt 6361 gccagtgcaa ccaatccgga tcttttggta gcatattcaa taatggaata attgccgcag 6421 gtgcatattt atcatgcaat aaagcttctt gtaaaactct aactggtaaa ttccgggggc 6481 ttaaatccaa ctcaacacat aaagccgcag ccatacctgc agcttgaccg atacccatca 6541 ccacaggctg caatctggtt gctccattag caatatgaga aacagaaatg ttcttctcgc 6601 ataccagtaa accatctgtt tctataggaa ttaaacaacg gtagggaatg gtgaaggggg 6661 ttcccgtcct ccgtcctccc cagcgcatcg atttgggttg cacttggaac tcgacacccg 6721 gataatggtg gtcgttggcg tagttagcga tcgctattgc atctatattc aaaggtgcaa 6781 ccatacctcc agttatgggc aaaatgtcct gttctcgaac ggtcgttagt cccactacgc 6841 ggcggctttc ccggtagtaa gggtgaagcg cgaaccagga gtgagccaga tgagggaaga 6901 tgttatcagc taaaccgtag cgacgaccaa gctggttttg gataaaatgg gcaaaatttt 6961 ggctgtgcca aaagcattct tggagaaact cacgctttgt ttcttctgtt tctattaatc 7021 gccccacacc ttcaccgtag tcattgccac gaattggcca attaatcata aatagaccac 7081 cgggcaaacg tccgtaattc aaaaactgtt ctgcaccata gctctcccaa gcaccaacaa 7141 actgtgatgg gtcgttgttc ggtgcgcttg ggatttctgg tgcaactgct tcaccaaagt 7201 cttgcatcac caccacccaa gtcggagctt gcacgggata tctttgtgtg agatgattaa 7261 aatctgctgg ggcgcttggt tctccccact ccgaatgtaa ttcccagccc caacggtaag 7321 gtatatcagc taaagctaat aaatctccta actctgtagc gtcgagagtt atcttggcat 7381 tgacagtcaa atctgcaaat cggacgccac aaacgcaact cccctcctgg aaaacttcta 7441 acggcacttg tcctgtgatc caaagcaaat tgggcaattc ctgcacccaa tctgcaaaaa 7501 tctttgcccc aatacgcgga tcgtaactaa aaaagctaac ccaacaattg tctaatcctc 7561 ctggctgtcg gtgctgtaac tcctgtaaaa acgcacccca taaccctgtc tgaaaagctt 7621 ctagttcatt cccatctggt gcagatacgc cagcggaagt tagcattccc cccaaccaag 7681 gaaattcact cacaaggata gtttttgccc cccgtcgcgc tgcttggatg gcggcggctg 7741 ttcctcctgt tccaccgccg acgactagaa cgtcagctgt gtatgtttga tgaaccattt 7801 ttgaacctcg ctaaaacagt gctaagtacc gaatactaac tacaactcat cattaatagg 7861 gttcggaaat caaaattatg ccaaaacagg gaaat // LOCUS NODE_4082_length_7886_cov_4.8978427886 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7886) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7886) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7886 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..345) /gene="nifH" /locus_tag="DP116_25125" CDS complement(<1..345) /gene="nifH" /locus_tag="DP116_25125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877702.1" /note="nitrogenase iron protein; nitrogenase component 2; with component 1, an molybdenum-iron protein, catalyzes the fixation of nitrogen to ammonia; nitrogen reductase provides electrons to the nitrogenase complex; in R. etli there are three essentially identical copies of nifH which are actively expressed during symbiosis; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase reductase" /protein_id="PRJNA477356:DP116_25125" /translation="MMSDEKIRQIAFYGKGGIGKSTTSQNTLAAMAEMGQRIMIVGCD PKADSTRLMLHSKAQTTVLHLAAERGAVEDLEIEEVMLTGFRGVKCVESGGPEPGVGC AGRGIITAINFLE" assembly_gap 469..478 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(682..1038) /locus_tag="DP116_25130" CDS complement(682..1038) /locus_tag="DP116_25130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743417.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="group 1 truncated hemoglobin" /protein_id="PRJNA477356:DP116_25130" /translation="MSTLFDKLGGQQGLEQVVDEFYKRVMADNTLSKFFANTNMDKQR QKQVDFFAKIFDGPDQYKGRSMDATHTGMNLQQQHFDVIAKYLNEALAARGVSSEDAN AAVGRVESLKGTILNK" gene complement(1249..2154) /gene="nifU" /locus_tag="DP116_25135" CDS complement(1249..2154) /gene="nifU" /locus_tag="DP116_25135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Fe-S cluster assembly protein NifU" /protein_id="PRJNA477356:DP116_25135" /translation="MWDYTDKVMDLFYNPKNQGELEDSTEPGIKIAVGEVGSIACGDA LRLHLKVEETTEKILDARYQTFGCTSAIASSEALVDLIRGSTLDEALKLSNKDIANYL GGLPQAKMHCSVMGQEALEAAIYNYRGIAHEVHEDDDEGALVCTCFGISDAKIRRVIL ENNLTTAEEVTNYVKAGGGCGSCLATIDDIIASVRKESATPVSHSLNKNTTSPQATKL LTPVQKIALIQKVLDEEVRPVLIADGGDVELYDVDGDSVKVLLQGACGSCSASTATLK IAIESRLRDRVSKDLVVEAVEPSLL" gene complement(2245..3459) /gene="nifS" /locus_tag="DP116_25140" CDS complement(2245..3459) /gene="nifS" /locus_tag="DP116_25140" /EC_number="2.8.1.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cysteine desulfurase NifS" /protein_id="PRJNA477356:DP116_25140" /translation="MHKDCIYLDNNATTKVDPAVLEAMLPYFSDYYGNPSSMHTFGGQ VGKAVRLAQQQVAALLGADESEIIFTSGGTEGDNAAIRAALLAQPEKRHIITTQVEHP AVLNLCQQLETQGYSVTYLSVNRQGQLDLNELEASLTGNTALVTIMYANNETGTVFPI EQIGLRVKESGALFHVDAVQAVGKIPMNMKTSTIDMLTLSGHKIHAPKGIGALYVRRG VRFRPMIIGGGQQRGRRAGTENVAGVVALGKAAELELLHLEEATAREKRLRDRLEQTL ITTIPDCEVNGDPAQRLPNTTNIGFKYIEGEAILLHLNKHNICASSGSACSSGSLEPS HVLRAMGLPYTILHGSIRFSLCRYTTEAEIDAVLAVMPSIVERLRALSPFKSDNAGWL QQQEKAVLGVGR" gene complement(3586..3933) /locus_tag="DP116_25145" CDS complement(3586..3933) /locus_tag="DP116_25145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314877.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="PRJNA477356:DP116_25145" /translation="MAYKITSKCISCKLCLSACPTGAIKIVDGLHWIDPNLCTNCDGT AYSVPQCAAGCPTCDGCIKERGDYWESWFATYNKLVTKLTKKEEYWDNWFNFYSQKFS EQIQKHQTSGIEA" gene complement(4183..5646) /gene="nifB" /locus_tag="DP116_25150" CDS complement(4183..5646) /gene="nifB" /locus_tag="DP116_25150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459221.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase cofactor biosynthesis protein NifB" /protein_id="PRJNA477356:DP116_25150" /translation="MTQSTRLVTSLVTESTPTKAKSSGCGGCSDDSSATVEMEEKLKQ RIAQHPCYSEEAHHHYARLHVAVAPACNIQCNYCNRKYDCANESRPGVVSELLTPEEA AHKALVIAGKIPQLTVLGIAGPGDPLANPEKTFRTFELIAEKAPDIKLCLSTNGLMLA DYVDRIKQLNIDHVTITMNTVDPEIGEKIYSWVHYNRKRYRGIEGAKILLEKQMEGLQ ALKEADILCKVNSVMIPGINDEHLTEVNKVIRSKGAFLHNIMPLISAPEHGTHFGLTG QRGPTPKELKTVQDNCAGNMKMMRHCRQCRADAVGLLGEDRSQEFTKDKFMEMTSEYD LEKRQEVHAGIEKFKEELKVAKEKALTAVETTNGASVQSSPILVAVATKGGGLVNQHF GHAKEFQIYEVDGNKVSFVGHRKVDHYCQGGYGEKATLENIIQTIADCKAVLVSKMGD SPKEKLQTLGIQTVESYDVIETVALEFYQQYLQEQGN" gene complement(6145..6381) /locus_tag="DP116_25155" CDS complement(6145..6381) /locus_tag="DP116_25155" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25155" /translation="MLSISSKDNHTNKILKKPELHFFLEKNDILYTELNFAIFWSSAP LSAPDFQNQNFSSGSALGIISSAFEYKVKNRKKR" gene 6627..7571 /locus_tag="DP116_25160" CDS 6627..7571 /locus_tag="DP116_25160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877095.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)-dependent alcohol dehydrogenase" /protein_id="PRJNA477356:DP116_25160" /translation="MKAVVIRRYGSPEVFQYEDVEPPKIKPNQLLVKVNATCVNPVDW KMRKGMLKIITGNKFPMILGFDLSGEVIEVGSQVTQFKPTDVIYGSISPSGGAYAEFA AVLEKNAALKPTNMTYEEAASVPVAALTALQGLRDLGNIQPNQKVLVNGGSGGVGIFA VQIAKALGAQVTAVCSTKNIDFVKSLGADRIIDYTQQDFTQELGQYDIILDAVAKQSF SGCKQALKPNGIYVTTLPSLESFVQIILTALIPGKKAKFILEKPNTQDLVYLKELIEA GKIRSVIDRTYPLQELAAAHTYSETERAVGKIAIAVAN" BASE COUNT 2047 a 1701 c 1718 g 2410 t 10 others ORIGIN 1 ttccaagaag ttgatggcgg tgatgatacc ccgaccggcg caacctacac caggctcagg 61 accaccagat tccacgcact taacaccacg gaagccggtg agcatcactt cttcgatttc 121 caggtcttcc actgcacccc gttcagcagc caagtgaagc acggtggttt gagctttgct 181 gtgcaacatc aaacgggtag agtctgcttt ggggtcgcaa cccacaatca taatgcgctg 241 acccatttct gccatagcgg caagggtgtt ttgggaggtg gtagatttac cgataccgcc 301 tttaccgtag aatgctatct gtctgatttt ttcgtcggac atgatgagtt tttctcctgc 361 aattgattgc taattgattg gttggtttgg tgggtctttg ctttttgtga gtcacgcctg 421 taaaattaca gcggtgaccg cttgtagaac gaatcaggtg gctttagcnn nnnnnnnngc 481 tctaccccta aaactatcca aagcaagaac atcatttatc ctcgttaaat ttttttgggt 541 ttacgcactt tacaagtcac tcgtaatgtg cattgtcctg aaacaccaag taagacttga 601 cccccgctac cctaaaattc aaaattcaaa gttctaaatt caaaattttg aattcccccc 661 aacgggggcg tagctgctac atcacttgtt caaaatagtg cccttcaaac tttcgacgcg 721 accaactgca gcgtttgcgt cctctgatga cactccgcgc gcagctaatg cctcattcag 781 gtattttgca ataacatcga aatgttgttg ctgtagattc atacctgtgt gtgttgcgtc 841 catggagcga cccttgtatt ggtctggacc gtcaaaaatc ttagcgaaga aatccacttg 901 tttttgacgt tgcttgtcca tgttggtgtt ggcgaaaaac ttactgaggg tgttgtcagc 961 catgacgcgt ttgtagaatt catccactac ttgctcaaga ccctgttgtc caccaagttt 1021 gtcgaataat gtgctcatat ccttttcctt tgaagcgagt tggttggata atgaaattcg 1081 cgctcttagc taaaaagctt cacttgcgtg ctcttagcga gtgaatttag ctgtccgttg 1141 ttagtaggac tcaagcattg ataggaagga agataatgat atgagtcaaa tcctacttga 1201 tgtttcaggg taatgcatac tagggtatct ttgtaaagtg cgtaggtttt agagcaacga 1261 tggctcaact gcttctacga caaggtcttt gctgacgcga tcgcgcaatc tagattcaat 1321 cgcaatcttg agtgttgctg tgctagcaga acacgaacca caagcacctt ggagcaaaac 1381 ttttacagaa tcaccatcta cgtcgtatag ttctacgtct ccgccatcgg cgatcagaac 1441 aggtctgact tcttcatcta agactttttg aatcagtgcg attttttgca caggtgtcag 1501 cagttttgtg gcttgcggac tagtcgtatt cttgttcaaa gaatgactta cgggggtagc 1561 agattcttta cgcacagatg caattatatc atcaatcgtt gccaaacaag atccgcatcc 1621 gccacctgct ttgacataat ttgtgacttc ttcggcggtg gtgagattgt tttctagaat 1681 cacacgccgg atttttgcat cactaatgcc gaagcaggtg caaactaacg ctccttcatc 1741 atcgtcttca tgaacctcgt gagcaatgcc tcggtaatta taaatagctg cttctagcgc 1801 ttcttgtccc atcaccgagc aatgcatctt tgcttgtggt aatccaccga gatagttagc 1861 aatatctttg ttgctgagct tgagagcttc gtctagagtt gaacctctaa ttaagtctac 1921 cagagcttct gaagatgcga tcgcactagt acaaccaaaa gtttgataac gagcatcaag 1981 aattttttca gttgtttcct caactttcag gtgcaatctc aaagcatctc cgcaagcaat 2041 gctaccgact tctccaacag caattttgat tccaggttct gtagagtctt ccaattcccc 2101 ttgatttttg ggattgtaga agagatccat cactttatct gtatagtccc acatagccga 2161 tttcagattt gttgattttg aactttggat tgagtcataa ggaatttggg gagtagggag 2221 agagaaccgt ctcccttatc ttcctcatct ccctacacca agcactgctt tttcttgttg 2281 ttgcagccaa cccgcattat cactcttgaa gggtgagagg gcgcgtagac gttctacaat 2341 tgagggcatg actgccagaa ctgcatcaat ttcggcttct gttgtgtaac ggcaaaggct 2401 aaagcgaatg gaaccatgta agatggtgta gggtaagccc attgctcgca gaacatgaga 2461 gggttccaaa gagccagaac tgcaagcaga accggatgag gcacaaatgt tgtgtttgtt 2521 taggtggagg agaattgctt caccttcaat atatttgaaa ccaatattgg tggtgtttgg 2581 caatctttgt gctggatcac cgttgacttc gcagtcagga attgtggtga tgagagtttg 2641 ctctaggcga tcgcgtaaac gtttttctct agctgtcgct tcttctagat gcaataattc 2701 tagctcagct gctttgccta aagcaacaac tcctgcaaca ttttctgttc ctgctcttct 2761 accgcgctgt tgtcctccgc cgataatcat cggacggaac cgaactccgc gccgcacata 2821 caatgcacca atccctttcg gtgcatgaat tttgtgacca gacagagtca gcatatcaat 2881 ggtgcttgtc ttcatattca tggggatttt ccccactgct tgcaccgcat caacatggaa 2941 gagtgcacca ctttctttta cgcgcaatcc aatctgctca attgggaaaa ccgtaccggt 3001 ttcgttattg gcatacataa tcgtcaccag ggcagtattg cctgtcaacg aggcttcgag 3061 ttcattcaaa tccaactgcc cttgacgatt caccgaaaga taggtgacag aataaccttg 3121 agtttccaac tgttggcaca gattcagcac tgctgggtgt tcgacttgtg tagtgatgat 3181 gtgtcgcttt tcaggttgtg ctaacagtgc tgcgcgaata gcggcgttat ctccctcagt 3241 tcctccactg gtaaagataa tttctgattc atcggcacct aggagggcgg ctacttgttg 3301 ttgtgcaagt cttaccgcct ttccaacttg cccaccaaag gtgtgcatac tggagggatt 3361 gccgtaataa tcgctgaagt agggcagcat cgcctctaga actgctggat ctaccttagt 3421 ggtggcattg ttgtctagat atatgcaatc tttgtgcatc gcttccaatc ttactttgat 3481 gacaatcaaa aatttatgaa aatggttgtg ggatgtcata acagttatca gttatcagtt 3541 accagttatc aattgttcac tgtttactgt tcactgttca ctgatttaag cttctattcc 3601 tgatgtttga tgtttctgta tttgctcaga aaatttttgg gaataaaagt taaaccagtt 3661 gtcccaatat tcttcttttt ttgttagttt tgttacgagt ttattataag tagcaaacca 3721 ggactcccag taatccccac gttctttgat gcaaccatcg caggtgggac aaccagccgc 3781 acactgaggg acgctgtaag cagtaccatc acagtttgtg cagaggttgg ggtcaatcca 3841 gtgaagaccg tcaactattt tgattgcacc agtgggacag gcagaaagac acaatttgca 3901 ggaaatgcac ttgctggtaa ttttgtaagc catgactatt ccccttacta tcttagaaat 3961 tgtggattgg gaattgggta ctgggtagta ggtattgggt aatggggaga aggaaaagtg 4021 aaaagttaaa agttaaaaat tcctctgatt tctaactttt gacttttgac tcttacgagt 4081 tcgccagttg acggcacttg ctacaagtcg gcggagccgc ccaacgcagt gcctccttta 4141 tgccggggaa cccgtccacc gcactgactc actttcgacc ttttagttcc cttgttcttg 4201 gaggtattgc tggtaaaatt ctaaagcaac cgtctcgatc acatcatagg attcaaccgt 4261 ctgaattccg agtgtttgca atttttcctt gggactgtca cccattttag aaaccaacac 4321 tgctttgcaa tcagctattg tctgaatgat attttctaga gtggctttct cgccgtatcc 4381 accttgacaa taatggtcaa ctttgcggtg tccaacaaag gaaactttat tgccatcaac 4441 ttcataaatc tggaattcct tggcatgacc aaagtgttgg ttcaccaatc cgccaccttt 4501 agttgctact gcaactagga ttggactgct ttgtacagat gcgccattgg ttgtttctac 4561 agcagtgaga gccttttctt tggctacttt cagttcttct ttaaacttct caatccctgc 4621 atgaacttct tgacgttttt ctaagtcata ttctgaagtc atttccatga atttatcttt 4681 agtaaattcc tggctgcggt cttcacctaa tagaccgact gcatcagcac gacactgacg 4741 gcaatgacgc atcatcttca tgttaccagc gcaattatct tgcactgtct taagttcttt 4801 tggtgtgggt ccgcgctgac cagttaatcc aaagtgggtg ccgtgttctg gtgcagaaat 4861 cagcggcata atattgtgaa ggaatgcgcc tttggagcga atgaccttat taacttctgt 4921 taggtgctca tcattaattc ccggaatcat caccgagttg actttgcaca aaatgtcagc 4981 ttctttgagg gcttgtaatc cttccatctg cttttcaagc agaatttttg ctccttcaat 5041 gcctctataa cgcttgcggt tgtagtgaac ccaagaataa attttttccc cgatttctgg 5101 gtctaccgtg ttcatggtaa ttgtaacatg atctatgttg agttgtttaa tgcgatcaac 5161 atagtcagcc agcattaaac cgttagttga cagacaaagc ttgatatctg gcgctttttc 5221 tgcaatcaac tcaaaggtgc ggaaagtttt ttctggattt gctagtggat cgccaggacc 5281 tgcaattccc aacacggtta attgaggaat cttgcctgca atcactaaag ctttgtgtgc 5341 tgcttcttct ggagtcagca attcactcac cactccagga cgactctcgt tagcgcagtc 5401 gtatttacga ttgcagtagt tgcattgaat attgcaagca ggggcaactg caacgtgcaa 5461 ccttgcatag tggtgatggg cttcttcact gtagcaggga tgctgggcaa tccgttgttt 5521 gagcttttcc tccatttcca cagtggcgct gctatcatcg ctgcatccgc cgcaaccact 5581 tgattttgct ttggttggag tggattctgt aactagagag gtaacgagtc ttgtagactg 5641 tgtcattgaa tttcgcatgc ttgtcaggtg gttgccaccg tggaagatgc catagccact 5701 ctgttgaagt aagaatggaa gccgcgtctt gtactcgcgc tctttaccca aagttttcaa 5761 gggctggtgt gtcaacgcta aagacttgag ggaaatttgc gcgttaaggc gcgacttctg 5821 ttcacagggg catggtgcta tctcatccct ccactcactc actgcgctgt ttgtgtttgc 5881 gctttaagtg ttggcgatat aagttttata tcagttgtct tgaataaggg ctgcgcttgt 5941 gctctggata gaatttattg gattgattag acttgtactc aatactaaaa acctttaccc 6001 tttaagcatt tgaggagtta caaaattttt ttctgggtag gggtactaga atttttcggt 6061 tggggttcga gtatttatgg aaatgggggc aagtatttgt gtactcagac aaaactcaat 6121 ccctgcaagg acttgagcac aatcttacct tttttttcgg ttctttacct tgtactcaaa 6181 ggctgatgaa ataatgccta gagcactgcc tgaggagaaa ttttgatttt ggaaatctgg 6241 agcagataac ggagcgctac tccagaaaat tgcgaaattc aattctgtat acagaatatc 6301 atttttttcg aggaaaaaat gcaattcagg tttttttaat attttgttgg tatggttatc 6361 tttgctagat attgatagca ttttcctcac taattttggc tggctttcag gcgtaaacaa 6421 ggaaaaataa ggttttttac cgagttaatt tgacctgtta atagaatttt atctcaatta 6481 actgattttg aacaggtgga aaaagatgtc tgtcaaaaaa atcatgatta ttggttgctg 6541 aaaaactacc aacaccatac tctcgttaca ctgagaaaaa tattgtcctc attctgccca 6601 aaaatcaggg tactaagagt aaaactatga aagcagtggt tattcgtcga tatggctcac 6661 ctgaggtgtt tcagtacgaa gatgtagaac ctccgaaaat taagcctaat cagttacttg 6721 tcaaagttaa tgcgacttgc gttaatcctg ttgattggaa aatgcgcaaa ggaatgttga 6781 aaattatcac gggtaacaaa tttcccatga ttctggggtt tgatttatcg ggagaagtga 6841 tagaagttgg ttctcaagtt acgcaattca agccaacaga tgttatctac ggctctatta 6901 gtccatcggg aggagcttat gcagaatttg cagccgttct agaaaagaat gcggctctta 6961 aaccaacaaa catgacttat gaagaagcag cctctgtccc cgtagcagca ctcactgcac 7021 tgcaagggct gcgagacttg gggaacattc aaccaaatca aaaggttctt gtgaatggcg 7081 gttctggtgg tgtgggtatt tttgcagtgc aaattgctaa ggctctaggt gcacaagtaa 7141 ccgcagtttg cagtacaaaa aacattgatt ttgtaaaatc tttgggagca gaccgcatta 7201 tcgactatac acaacaagac ttcacccaag aattagggca gtacgacatc attttggatg 7261 cagttgctaa gcaatccttt tctggttgta aacaagcttt aaaaccgaat ggaatttacg 7321 tgacgacact ccccagtctt gaaagctttg tgcagattat tttgacagcg ttgattcctg 7381 gtaaaaaagc aaaatttatc cttgaaaagc ctaatactca ggatttagtt tacctgaaag 7441 agttgattga ggctggtaaa attcgctcag tgatagaccg cacctatccc ctacaagaac 7501 tcgctgcagc tcatacatat agcgaaacag agcgagcagt aggtaaaatt gccattgccg 7561 ttgcgaatta acctttggaa ctttttttgt gggaactatc aaaagacgag tcgaaaattg 7621 tactagaaaa gcctgcaaat gcaggctttt ttatttttgt catatcaagt tcgtctaatt 7681 acaataagaa acctcacccc gccctccggg cacccctctc cgcgagttca ccagagggga 7741 tgggggtgag gttcttcgtt ttttataagt attcatccgg acatgatatc agatgcagaa 7801 aatgcataaa aatattgctt ttatttcata aaaaacgctc aaaattctta tccaaacgga 7861 taaaatagat ccacacagtt aatcaa // LOCUS NODE_4091_length_7853_cov_34.4443457853 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7853) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7853) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7853 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..464) /locus_tag="DP116_25165" CDS complement(<1..464) /locus_tag="DP116_25165" /inference="COORDINATES: protein motif:HMM:PF00496.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_25165" /translation="MHGERAAMAIARFVADRADLVLGGRFTDWPLLGMAGIQESALRI DPAAGLFGFAITRREGFLSTAAGRQAVAMAINRPSLLATWRSDWAPAETILPEALDSA RPPARPDWAGLNQDIRLALARTRVQSWAADNGAPVVRIHVPPGSGGTLLWARV" gene 1260..2648 /gene="fumC" /locus_tag="DP116_25170" CDS 1260..2648 /gene="fumC" /locus_tag="DP116_25170" /EC_number="4.2.1.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015458802.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class II fumarate hydratase" /protein_id="PRJNA477356:DP116_25170" /translation="MTATRTERDSLGDVDVPATAYWGAQTQRSISNFPFPATERMPIA IIHALGVVKQAAARVNRRHDLPADLANAIDTAAGEVAAGRLDDQFPLVIWQTGSGTQT NMNVNEVIAGRANELLTGARGGKSPVHPNDHVNRGQSSNDSFPTALHIASAVGVHHRL LAGLDRLHAALDAKARAWSAIVKIGRTHLQDATPITLGQEFSGYAHQIARARERIDAA SAEMLLLAQGGTAVGTGLNAPAGFDQAVAAEIAAITGLGFRTAPNKFEALASNDPLVQ LSASLATLAVALTKIANDIRLLGSGPRSGLGELRLPENEPGSSIMPGKVNPTQSEMLT MVAAQVIGNHQAVTLGGLQGHLELNVFKPLIGAAVLRSIDLLAVAMTSFAERCVEGIE PDRARIADLVDRSLMLVTALAPAIGYDNAAQIAKHAHAEGLTLREAGLALGLVDQATF DRLVRPEAMVGD" gene 2701..2916 /locus_tag="DP116_25175" CDS 2701..2916 /locus_tag="DP116_25175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007406046.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25175" /translation="MGATTLGALIGGAIDNMSGDDGVADGAMIGAITANVLKVAVPVI ATYLVGWAVLRGIEQAADRLMEKGQTA" gene 2913..3110 /locus_tag="DP116_25180" CDS 2913..3110 /locus_tag="DP116_25180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017979342.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25180" /translation="MIGRIIGAMVGREIDRRDGAGGIKGAAMGWIAAGALRRMGPLGL VLGGGYVAKKIYDRAKGRGRI" gene 3168..3821 /locus_tag="DP116_25185" CDS 3168..3821 /locus_tag="DP116_25185" /EC_number="2.7.4.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007406092.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="guanylate kinase" /protein_id="PRJNA477356:DP116_25185" /translation="MAFTPDPDPHGFHRRGVLFVLSSPSGAGKSTIARRLLAAEPNLG MSVSATTRPIRPGEQDGVDYHFISTDRFKQMVADQAFLEWAHVFDHRYGTPKAQVEAM LARGQDVLFDIDWQGAQQLFQLKGGDVVRVFILPPSMRELRRRLDARGTDAQDVIQRR MERAEREASHWDSYDYVLVNDDIEACFEQVQTILHAERLKRSRQPGLIGFIRELGKS" gene complement(3837..4532) /locus_tag="DP116_25190" CDS complement(3837..4532) /locus_tag="DP116_25190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019370880.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_25190" /translation="MRRFWRDAAVAPVPDGWSVALDGKPVHTPGRALLTLPGERLAEA VAAEWRGVGDVLDPRAMPLTGLANAAIDRIAPAPEAFARQLAAYGESDLLCYRADDPP GLVAQQHALWDPPLDWARSRYDVAFALTAGVMHVDQPMATTDRLRAAVLAFDAFGLAG LSPIVTTTGSLVLALWLTEGAADPDTVWTAACCDEDWQADQWGREPLAEQARAARHAE FQAGTTFLALLGD" gene complement(4529..4750) /locus_tag="DP116_25195" CDS complement(4529..4750) /locus_tag="DP116_25195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010544346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25195" /translation="MTDADLQARNRYFTIVAVNLAATVGAIFGLVLMGRSAGLEGRLL GAAILIAGVYVMAVVPRSLARRWRTPPGS" gene complement(4747..5406) /locus_tag="DP116_25200" CDS complement(4747..5406) /locus_tag="DP116_25200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010544345.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="haloacid dehalogenase" /protein_id="PRJNA477356:DP116_25200" /translation="MNRLAIFDCDGTLVDGQANICLAMERAFDDHRLPPPDRHAIRRI VGLSVPDAVAQLAPTLDVRRHLAVAEDFKRHFQAMRSNGGLLDEPLFEGIAEGLARLA SAGWRLGVATGKSDRGLRMVLEHHGLAGHFLTLQTADRHPSKPHPSMVLTAMREAGAT PDATVMIGDTSYDMAMAVAAGAHPVGVGWGYHEPHELADAGAVFVAAHATALFNYLEA R" gene complement(5403..6014) /locus_tag="DP116_25205" CDS complement(5403..6014) /locus_tag="DP116_25205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010218443.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FMN-binding negative transcriptional regulator" /protein_id="PRJNA477356:DP116_25205" /translation="MHPNPTFRWQDEAAVRAFVRARSFAQLFATTPDGPAVAHLPVTL ADDDTLRFHLARSNRLASHLAGATGLIVVNGPDGYISPDWYGLGPDEVPTWNYLAVEI VGQIELVDHAAMMDQIDRLGQEHERALAPKPEWRRDKADPGKIDRMASAIRGFRLIPS AWRATAKLNQNKPEAARLAAADAVAARGQHDLAQWMRDPPARP" gene complement(6014..7102) /locus_tag="DP116_25210" CDS complement(6014..7102) /locus_tag="DP116_25210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007406124.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA pseudouridine synthase" /protein_id="PRJNA477356:DP116_25210" /translation="MSEVRQFTVDADDDGIRLDRWFKRHLPDTSFNTVSRWARTGQLR VDGARATPGDRVAAGQSIRVPPPEAAPASAPRPARQRPALSEEQVAFAQSLVIHRDAQ AIVLNKPPGLATQGGTRTHDHVDGLLDALQFDQTGRPKLVHRLDKDTSGALLLARTAR AAAYFSKHFSGRSAKKVYWALVIGVPEIEDGMIDLPIAKQPGTGGEKMHVDEAEGQPA RTRYRVIERAGNRCAWVELQPFTGRTHQLRVHMAAIGHPIVGDGKYGGQAAFLTGGIS RKMHLHARRIRIDHPDGGRLDVTADLPAHIAATMDTLGFDPALGEAALIDEAPLPPSR ERQKAKARQHAKAVRKERRGERRGRGQR" gene complement(7099..7476) /locus_tag="DP116_25215" CDS complement(7099..7476) /locus_tag="DP116_25215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010339636.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fluoride efflux transporter CrcB" /protein_id="PRJNA477356:DP116_25215" /translation="MWNLLLVMLGGAIGAGARYAVGRASLAALGPGYPWGTLMVNLAG GLAMGLLAAWLARGASGGEPVRLLLGVGVLGGFTTFSAFSLEIVTMMERGDWIAALAY ALLSVVGAVAALMAGLAVGRVLA" gene complement(7536..7817) /locus_tag="DP116_25220" CDS complement(7536..7817) /locus_tag="DP116_25220" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25220" /translation="MIPNARLVEAGGTSSGSCALRVASFTTDLPLQTALDWYYTRATD AGYTAEHQAEGDDHVLGGTRARDDAAYVVFLKPAKQGGTAIELIANRGA" BASE COUNT 1260 a 2701 c 2689 g 1203 t ORIGIN 1 acgcgtgccc acagcaacgt gcccccgctg ccagggggta catggatgcg aacgaccggc 61 gcgccattgt cggcggccca ggactgcacc cgcgtccgcg cgagcgccag gcggatgtcc 121 tggttcagcc ccgcccagtc gggccgggcg ggcggccgcg ccgaatcgag cgcttcgggc 181 aggatcgtct cggcgggcgc ccagtcgctg cgccaggttg cgagcaggct gggccggttg 241 atcgccatcg ccaccgcctg ccggccggcg gcggtcgaca gaaagccttc gcgtctggta 301 atcgcaaagc cgaacaggcc ggcggcggga tcgatgcgga gcgcgctttc ctggatgccc 361 gccatgccca gcagcggcca atcggtgaag cgcccaccca gtaccaggtc cgcccgatct 421 gccacgaatc gcgcgatcgc catcgctgcg cgctcgccat gcacgcgcac ccgatcggca 481 ggacgcggct cttcgctttc cggcattcgc tgcggatcga acaccggggt cagggtaacc 541 agtcccggcc gctgctccat gaagcggaac gggccgctgc cgaccaggcc acgcgaccgg 601 acgatcgcaa gctcgggctg ggcgagcaac tgcaacaggt tgggcaccgg gcggcgcaac 661 cggatttcga cgatctgcgg cgtcatctcg gccacttcct cgatattggc gaggaaaggc 721 tgaagcggat tggcactggt gtcggcgcgt gcccgatcca gcaccgcgac gacgtcgcca 781 gcggtgaccc gcgaaccgtc tggccattcg gcctcggcca ggcggaagat gtagctggcg 841 ccgtcgtcga tcacgatcca gcgttgggcc aaccccggtt cgacctggcc attggcatcg 901 aacctcacca gcccctgtgc ggtcgcggcc agtgcgacgc gctgcggttc ggtcagcgcg 961 cggcgcgagg ggtcggccag gtcgatgctg ccgccgatga cgctgacgtc taccggctgg 1021 tcatcctggc tgcggtcgca ccccgtcagg gccggcaggg tcagcatcgc aagggtcagg 1081 atcgggaggg caaggcgtcg gggcatcagg ctgctcgaca gacgacggga cccccggctc 1141 ctatcgtttg tacccggcaa atggaatccc gttcgtgtcg ggacgggcgg tacgggcttg 1201 cgctatccgc ccgcgccccc aagctaggcg gatcaatgcc ccagattccg gagaccgacg 1261 tgaccgccac tcgtaccgaa cgcgattccc tgggcgatgt cgacgtcccg gcaactgcct 1321 attggggcgc tcagacccag cgatcgatca gcaatttccc ttttcccgcg accgagcgga 1381 tgccgatcgc gatcatccat gcgctgggcg tggtgaagca ggccgcagcc cgggtgaacc 1441 ggcgccacga cctgcccgcc gatctggcaa atgcgatcga tacggcggcg ggggaggtcg 1501 cggcggggcg gctggacgac cagtttccgc tggtcatctg gcaaacgggc agcggcaccc 1561 agaccaacat gaacgtcaat gaggtgattg cggggcgggc gaacgagctg ctgaccggcg 1621 cgcggggcgg caagtcgccg gtgcatccaa acgaccatgt caaccgcggc caatcgtcca 1681 acgacagttt ccccaccgcg ctccacattg catcggcggt tggggtgcac catcgtctgc 1741 tggctgggct ggaccggctg cacgcggcgc tggatgcaaa ggcgcgtgcc tggtccgcga 1801 tcgtgaagat cggccgcacg catctgcagg atgcaacccc gatcacgctg gggcaggagt 1861 tttccggcta tgcccaccag atcgcccgcg cgcgggaacg gatcgatgcg gcgtcagccg 1921 aaatgctgct gcttgcccag ggcgggaccg cggtcggcac tggcctgaac gcgccggccg 1981 ggttcgatca agcggtcgcg gcagagatcg cggcgattac cgggcttggg ttccgcacgg 2041 cgcccaacaa gttcgaggcg ctggcgtcga acgatccact ggtccagctg tccgccagcc 2101 tcgccacgct ggcggtcgcg ctgaccaaga tcgcgaacga tatccggctg ctcggttccg 2161 gtccccggtc tggcctgggc gagctgcggc tgccggagaa cgaaccgggc agctcgatca 2221 tgccgggcaa ggtgaatccc acgcagagcg aaatgctcac catggtcgcc gcccaggtga 2281 tcggcaatca ccaggcggtg accctggggg ggctccaggg tcatctcgaa ctcaatgtgt 2341 tcaaaccgct gatcggcgcg gcagtcctgc ggtcgataga cctgctggcg gttgccatga 2401 ccagctttgc cgagcgctgc gtagagggca tcgagcccga ccgcgcgcgc atcgccgacc 2461 tggtcgaccg atcgctgatg ctggtcaccg cacttgcccc ggcaatcggc tatgacaatg 2521 cggctcagat cgccaagcat gcacatgccg aagggctgac gctgcgcgag gcagggctgg 2581 cgcttgggct ggtcgaccag gcgacctttg atcggctggt gcgtccggaa gcgatggtgg 2641 gggattgaac cccgtccccg cctggcgcgt tggacagcca ggatcttgag gagaaacgca 2701 ttgggtgcga cgacactggg cgcgctgatc ggtggcgcga tcgacaatat gagcggtgat 2761 gacggcgtgg ccgacggcgc gatgatcggc gcgatcacgg ccaatgtgct gaaggtggcg 2821 gttccggtca tcgccaccta tcttgtcggc tgggcggtgc tgcgcgggat cgaacaggca 2881 gcggatcgct tgatggagaa gggacagaca gcatgatcgg acgcatcatc ggcgccatgg 2941 ttggccgcga aatcgaccgg cgcgacggcg cgggcgggat caagggcgcc gcgatgggct 3001 ggatcgcggc gggcgcgctg cgccgcatgg gccccttggg gctggtgctg ggcggcggat 3061 atgtggccaa gaagatttac gatcgggcca agggacgcgg ccggatctga cccttaggcc 3121 ttctgccgcg ggttgacggg ggggcgttgg ctgcggcaag cgcggccatg gccttcaccc 3181 ccgatccaga tccgcacggc tttcaccggc gcggcgtgtt gttcgtgctg tcgtcgccgt 3241 cgggcgcggg caagtcgacc attgcgcggc gcctgttggc ggcggaaccc aatctgggca 3301 tgtcggtatc cgccacgacc cggcccattc gccctggcga gcaggacggg gtggattatc 3361 acttcatcag caccgaccgg ttcaagcaga tggtcgccga ccaggcgttc ctggaatggg 3421 cgcatgtgtt tgaccaccgt tacggcaccc ccaaggcgca ggtcgaagcg atgctggcgc 3481 ggggtcagga cgtgctgttc gacatcgact ggcagggcgc gcagcagctg tttcagctga 3541 aaggcggcga tgtcgtgcgg gtcttcatct tgccgccgtc gatgcgggag ctgcgccgtc 3601 ggctggatgc gcgcggaacc gatgcccagg acgttatcca gcggcgcatg gaacgcgcag 3661 agcgggaagc gagccattgg gacagctatg actatgtgct ggtcaatgac gatatcgagg 3721 cctgtttcga acaggtccag accatcctgc acgccgaacg gctcaagcga tcgcgccagc 3781 cggggctgat cgggttcatt cgggaactcg gcaaaagctg acccggcgct ggcgggtcag 3841 tcgccaagca gcgccagaaa tgtcgtaccc gcctggaact cggcatgacg ggcggcacgc 3901 gcctgttcgg ccagcggctc gcggccccat tgatcggcct gccaatcctc gtcgcagcag 3961 gccgcggtcc agacggtatc gggatcggcg gcgccctcgg tcagccacag cgcgagcacc 4021 agcgatccgg tcgtggtgac gatcggggaa aggcccgcaa gcccgaaggc gtcgaatgcc 4081 agcaccgccg cccgcagccg gtcggtcgtc gccattggtt gatcgacatg catcaccccc 4141 gcagtcagcg cgaacgccac gtcatagcgc gatcgggccc aatcgagcgg cggatcccac 4201 agcgcatgct gctgcgccac cagccccggg ggatcgtcgg cgcgatagca gagcaggtcg 4261 ctttctccat aggcggcaag ctggcgggca aatgcctccg gcgccggcgc gatccggtcg 4321 atggcggcat tggccagccc ggtcagcggc atcgcccgcg gatcaagcac atcgccaacc 4381 ccgcgccatt ccgccgccac cgcctcggcc agtcgctcgc cgggcagcgt cagcagcgcg 4441 cgtccgggcg tgtgcaccgg cttgccgtcc agcgcgacgg accagccgtc cggcaccggc 4501 gccactgccg catcgcgcca gaagcgcctc atgatcccgg tggcgtccgc cagcgccggg 4561 caaggctgcg cggcaccacc gccatcacat atacgcccgc gatcaggatc gcggcgccca 4621 gtagccgacc ctccagccct gccgacctgc ccatcagcac cagcccgaaa atggcaccca 4681 ccgttgccgc cagattgacc gcgacgatcg tgaaatagcg gttgcgcgcc tgcaggtcgg 4741 cgtcggtcat cgggcctcca gataattgaa cagcgcggtt gcatgcgccg cgacgaatac 4801 agccccggca tccgccagtt cgtgcggctc gtgatagccc cagccgaccc cgaccggatg 4861 cgcgccggcc gccaccgcca tagccatgtc atagctggtg tcgccgatca tcaccgtggc 4921 gtccggcgta gcgcctgctt cgcgcatggc ggtcagtacc atcgacgggt gcggcttgga 4981 cgggtggcgg tcggcggtct gcagcgtcag gaaatgcccc gccagcccat gatgctccag 5041 caccatgcgt aggccgcgat ccgacttgcc ggtggccacc cccaagcgcc aaccggccga 5101 tgccagcctt gccagccctt cggcgatccc ttcgaacagc ggttcgtcca gcagcccgcc 5161 attgctgcgc atcgcctgaa agtggcgctt gaaatcctcg gcaaccgcca ggtggcggcg 5221 gacgtccaag gtgggcgcca gctgggccac ggcatcggga acgctcaatc ccacgatccg 5281 gcggatcgcg tggcgatcgg gtgggggcag gcgatggtcg tcgaatgcgc gttccatcgc 5341 caaacagata ttggcctggc catccaccag cgtcccgtcg cagtcgaaga tcgccagccg 5401 gttcatggcc gcgccggcgg gtcccgcatc cattgcgcca gatcgtgctg gccgcgcgcc 5461 gcgacggcat ccgccgctgc cagccgcgcc gcttcgggct tgttctggtt gagcttcgcg 5521 gtcgcccgcc aggccgaagg gatcaggcga aagccccgga tcgcgctcgc catcctgtcg 5581 atcttgccgg ggtcggcctt gtcgcgccgc cattcgggct ttggcgccag cgctcgctca 5641 tgctcctggc ccaggcggtc gatctggtcc atcatcgccg catggtcgac cagctcgatc 5701 tggcccacga tctcaacggc cagatagttc caggtcggta cttcgtccgg gcccagcccg 5761 taccagtcgg ggctgatata gccgtccgga ccattgacga cgatcaaccc ggtggcgccg 5821 gccagatggg acgccagccg gttcgaccgc gccaggtgga accgcagcgt gtcgtcgtcg 5881 gccagcgtga ccggcagatg cgccacggct ggtccatccg gcgtggtcgc gaacagctgg 5941 gcaaagctgc gcgcgcggac aaaggcgcga accgcggcct cgtcctgcca gcgaaaggtc 6001 gggttggggt gcatcagcgc tgcccccgcc ctcgccgttc gccgcggcgt tccttgcgca 6061 ccgccttggc gtgctggcgc gccttggcct tctggcgttc gcgcgacggc ggcagcggtg 6121 cctcgtcgat caacgcagcc tcgcccagcg ccggatcaaa gcccagcgtg tccatcgtcg 6181 cggcgatatg ggcgggcaga tcggcggtga cgtccagccg cccgccatcg gggtggtcga 6241 tccggatgcg gcgggcgtgc aggtgcatct tgcgactgat cccgccggtc aggaacgcgg 6301 cctgaccgcc atatttgccg tcgcccacga tcggatgtcc gatcgctgcc atatggacgc 6361 gaagctggtg ggtgcgcccg gtaaagggct gcaattccac ccaggcgcac cgattgcccg 6421 cgcgttcgat cacgcgatag cgggtcctgg cgggctggcc ttctgcttca tcgacatgca 6481 tcttttcacc gccggtgccg ggctgcttgg cgatcggcag gtcgatcatc ccgtcctcga 6541 tctccggtac gccgatcacc aacgcccaat agaccttctt ggcgctgcgc cccgaaaaat 6601 gctttgagaa atatgccgca gcacgcgccg ttcgcgccag cagcagcgcg ccggaagtat 6661 ccttgtccag ccgatggacc agcttgggcc ggccggtctg gtcgaactgg agcgcgtcga 6721 gcagcccgtc gacatggtcg tgcgtgcgcg tcccgccctg ggtggcaaga ccgggcggct 6781 tgttgagcac gatcgcctgg gcgtcgcgat ggatcaccag cgactgggcg aacgccacct 6841 gctcctcgct cagcgccggt cgctggcggg cgggccgtgg cgcgctggcc ggcgcggctt 6901 cggggggcgg aacccggatc gactggccgg cggcgacccg gtcgccgggc gtcgcgcggg 6961 cgccatcgac ccgcaactgc ccggtgcgtg cccagcgcga cacggtgttg aagctggtgt 7021 cgggcaaatg ccgcttgaac cagcggtcga gccggatgcc gtcatcgtcg gcatcgacgg 7081 tgaactggcg cacctcgctc atgccaggac ccgcccgacc gcaagccccg ccatcagcgc 7141 cgccactgcg ccgaccaccg acagcagcgc ataggccaag gcggcgatcc agtcgccgcg 7201 ctccatcatc gtcacgattt ccagcgagaa ggccgaaaag gtggtgaagc cgcccagcac 7261 ccccacgccc agcaacagcc gtaccggctc gccaccgctg gcgccgcgcg ccagccatgc 7321 ggcgagcagc cccatggcga gcccgcccgc caaattgacc atcaacgtac cccagggata 7381 gccgggtccg agcgccgcca gcgacgcgcg cccgaccgca tatcgcgcgc cggcaccgat 7441 cgcgccgccc agcatcacca gcaacaggtt ccacatcgcc ctgccctagc gcggaaggcg 7501 cgccggggga accgcccaac gaaggcgggg gggggtcagg ccccgcggtt cgcgatcagc 7561 tcaatcgcgg tgccgccctg cttggccggc ttcaggaaca ccacataggc ggcatcatcg 7621 cgcgcgcggg ttccgcccag cacatgatca tcgccctccg cctgatgttc ggcggtgtac 7681 ccggcatcgg tcgcgcgggt atagtaccag tcaagcgcgg tctggagcgg caggtcggtg 7741 gtgaagctcg ccacgcgcag cgcgcagctg ccgctcgatg tgccgccggc ttcgaccagc 7801 cgggcattgg ggatcatcgg caggtccttg ggcaggcgct gggcccaggc cgg // LOCUS NODE_4099_length_7819_cov_5.1486357819 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7819) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7819) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7819 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(145..339) /locus_tag="DP116_25225" /pseudo CDS complement(145..339) /locus_tag="DP116_25225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131608.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="N-acetyltransferase" gene 757..2187 /gene="rbcL" /locus_tag="DP116_25230" CDS 757..2187 /gene="rbcL" /locus_tag="DP116_25230" /EC_number="4.1.1.39" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009634155.1" /note="type III RuBisCO; involved in carbon fixation; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribulose-bisphosphate carboxylase large subunit" /protein_id="PRJNA477356:DP116_25230" /translation="MSYAQTRTQTKSGYQAGVKDYRLTYYTPDYTPKDTDILAAFRVT PQPGVPPEEAGAAVAAESSTGTWTTVWTDLLTDLDRYKGRCYDIEPVPGDDNQFIAYV AYPLDLFEEGSVTNMLTSIVGNVFGFKALKALRLEDLRIPVAYLKTFQGPPHGIQVER DKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSAPFQRWRDR FLFVADAIHKSQAETGEIKGHYLNVTAPTCEEMLKRAEYAKELKMPIIMHDYLTAGFT ANTTLARWCRDNGILLHIHRAMHAVIDRQKNHGIHFRVLAKTLRMSGGDHIHTGTVVG KLEGERGITMGFVDLLRENYVEQDRSRGIYFTQDWASMPGVMAVASGGIHIWHMPALV EIFGDDSVLQFGGGTLGHPWGNAPGATANRVALEACIQARNEGRNLAREGNDIIREAA KWSPELAAACELWKEIKFEFEAVDTV" gene 2286..2684 /locus_tag="DP116_25235" CDS 2286..2684 /locus_tag="DP116_25235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867713.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chaperonin family protein RbcX" /protein_id="PRJNA477356:DP116_25235" /translation="MDIKQIAKDTAKTLQSYLTYQALKTVLAQVSETNPPLALWLQRF SADKIQDGEAYIKELFQEKPELALRIMTVREHIAEQVTEYLPEMVRTGIQQANMEHRR QHLERITHIDTSAPSPEPETQTTSDPNNEQ" gene 2763..3098 /locus_tag="DP116_25240" CDS 2763..3098 /locus_tag="DP116_25240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875878.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribulose bisphosphate carboxylase small subunit" /protein_id="PRJNA477356:DP116_25240" /translation="MQTLPKERRYETLSYLPPLSDAQIAKQIQYILNQGYIPAIEFNE NSEPTVYYWTLWKLPLFGAKSTQEVLNEVQACRSQYGNNFIRVVGFDNIKQCQVLSFI VHKPNSSRY" gene 3265..4545 /locus_tag="DP116_25245" CDS 3265..4545 /locus_tag="DP116_25245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016859875.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribulose 1,5-bisphosphate carboxylase" /protein_id="PRJNA477356:DP116_25245" /translation="MSYYISPRFLDKLAVHITKNYLDLPGVRVPLILGIHGRKGEGKS FQCELVFERMGVEVTHISGGELESPDAGDSARLIRLRYRETAELIRVRGRMCVIMIND LDAGAGRFDEGTQYTVNTQLVNATLMNIADNPTDVQLPGSYDATPLHRVPIIVTGNDF STLYAPLIRDGRMEKFYWDPDRDDKVGIVTGIFSEDELSRQEVEKLIDTFPNQSIDFF SALRARIYDEQIRNFIHEIGIERVSQRVVNSVEGPPQFRKPNFSLSHLIEIGNLMVGE QQQVESSQLVSQYNRSLYSRNQSAPSGAVTPTTQPSSNGASQGTTSNGYQKQQQSNTH LTLETQEQIREILSHGYRIGIEHVDERRFRTGSWQSCITSPITGEQDAISTVESCLGE YSNEYVRLVSIDPKAKRRVVETIIQRPNGKVDSR" gene 4752..4823 /locus_tag="DP116_25250" tRNA 4752..4823 /locus_tag="DP116_25250" /product="tRNA-Gln" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:4784..4786,aa:Gln,seq:ttg) gene 5381..7606 /locus_tag="DP116_25255" CDS 5381..7606 /locus_tag="DP116_25255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317184.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="capsular biosynthesis protein" /protein_id="PRJNA477356:DP116_25255" /translation="MLKSDKYPHPLSQAYTAQLNEEGGLNLGQVGATLRRRALLIAGV TGVIATAAVLKAETDPPVYQGRFEILTKPVTGEGRAVANIPQSLSSQQGIASPESVEA VTTTMQVLESRKMLDKVIEELQDKYKTKYPKLDYDSIVARLQIASEKPNILEVTYTSP DKELVRDVLTKLQKTYVEYSVNEHLEDVNQAISFVRQQELPLENRVRAWQERQRIIRQ RHDLVEPAQKAQEISQQIATLTQQQAENRIQLEQMLATYEDLKRELAQQPGERAGNSV LSENARYQKILDQIQTVDIEIKKGGAKFTSENPSLQNLQAQKANLLPMLAGEENRVKR DYESRIRALQARDKSLGDKIAYFNNYLRNLAVVIREYDDIQRNLKIATEGLTQFSAKK QALQIEQALRSPSWKLLDPKLEEVNEPKAGPGSAKRNLALGTLLGLLLGTGAALVADK LSNVLYSSKDLKEATGLPLLGVVPLRKEIGALAWQETSGGMQQAVKASFFEVFRSLYT NILLLSSDTPIRSLVISSAAKEDGKTTVAIHLALAAAAMGQRVLLVDANLRSPTIHNR VGLMNIQGLTDVIAQDLDWNNVIERSPLEDNLFVLSAGPIPPDSIRLLASQKMQDLMS ELQASFDLVIYDTPPLVGFADANLLAANTNGLILVAGLGKLKRTMFQQALDDLQMSGT PILGVIANKSKDTTPASYSYYQQYYKQSMSNERIGDDDSIDFTHSRSTSSSFRNGRRN S" BASE COUNT 2325 a 1738 c 1730 g 2026 t ORIGIN 1 cgctgcgcgt tcgcccttgg cgtgcgcttg cgcttacgga ggaaacctcc gctcaaactt 61 ctctcaaaat tcaaaattaa gaaaactagt gatagcagaa tatctagcct tcagttgtgc 121 gcgttttcac atgcgtaaca gtttaccata tccccgtagt tgccaatgac caagtatcat 181 tgccatattt cgccaagact cccaacggga caaaggtttg ccttcgccaa tgtagcgcat 241 aacttcaggg tcactgcaca tttcggtgta agcatcaagg tcttcttcac tacatccccg 301 taggatcaga cgttggcttt caagttgagg aatctgcata ggatttttgc atcgtgcttg 361 tttcagggat gtagtagctg gtacctgtag ttttacgatt gtaagaaaat acaaaaatct 421 tacatttttg taaagcattc ttaatataat gttaaattat tctttaataa gaaatttaaa 481 tataattgtt aaaaatcttc aaagaataac ttattacatt cttgatatat tgtgaaacga 541 gctacaaatt acgcaacgtg tcactttgtc tttcacgtaa cacagattgg atgttttgtc 601 aagaattgta ggaaaaccac aattctctgt aagaagcaac ccctgacaca aaatgttttc 661 tatctatcgc tcagtaaaag gtgataggtt tgaagtgagg aaagcagcac gtgacgcgat 721 ttgtaagtaa aaagagtgac ttcttggaag ggaattatgt cttacgcgca aacgaggact 781 cagacgaaat caggctatca agctggggta aaagattaca ggctaacata ttacaccccc 841 gattacactc cgaaagatac agatattttg gcagcattcc gcgtaacacc ccagccagga 901 gttccgcccg aagaagcagg cgctgctgtg gctgctgagt cttccactgg tacttggaca 961 accgtatgga cagacttgct caccgactta gatcgctaca aaggtcgttg ctacgatatt 1021 gagccagttc ctggcgatga caaccaattc attgcctacg ttgcctatcc gttggatcta 1081 tttgaagaag gttctgtaac caacatgttg acctctattg taggtaatgt gtttggtttc 1141 aaagccctga aagcactgcg tctggaagac ttgcggattc ctgttgctta cctcaagaca 1201 ttccaaggac ctccacatgg tattcaagta gagcgcgaca aactgaacaa gtatggtcgt 1261 cctctgttgg gttgtacgat taagcccaag ttgggtctgt ctgccaaaaa ctacggacgc 1321 gctgtatacg agtgcttgcg cggtggtttg gacttcacca aagacgacga aaacatcaac 1381 tctgcaccgt tccaacggtg gcgcgatcgc ttcctgttcg ttgcagacgc tatccacaaa 1441 tcacaagccg aaactggtga aatcaaaggt cactacctga atgtgaccgc tcccacctgc 1501 gaagaaatgc tgaaacgggc agagtacgcc aaagaactca aaatgcccat tatcatgcac 1561 gactacctaa ctgcaggctt caccgctaac accacattag ctcgctggtg ccgtgataac 1621 ggtattctgc tgcacatcca ccgtgctatg cacgccgtta tcgaccgtca aaagaaccac 1681 ggtatccact tccgcgtctt ggctaagacc ctgcgtatgt ccggcggtga ccacatccac 1741 accggaaccg tcgtcggtaa gctcgaaggt gagcgcggta tcacaatggg cttcgtcgat 1801 ctactgcgtg agaactacgt tgagcaagac agatctcgcg gtatctactt cactcaagac 1861 tgggcttcta tgcccggagt gatggctgta gcttccggtg gtatccacat ttggcatatg 1921 ccagctctag tggaaatctt cggtgatgac tccgtgctgc aatttggtgg tggtactctc 1981 ggtcacccct ggggtaacgc tcctggcgct accgctaacc gcgtcgccct ggaagcttgt 2041 atccaagctc gtaacgaagg tcgcaacttg gctcgtgaag gtaacgacat tatccgcgaa 2101 gctgctaagt ggtctcctga actggctgct gcttgcgaac tgtggaagga aattaagttc 2161 gagtttgaag cagtcgatac cgtctgatgc aagagtaaaa agttaaaaag taaaagaaaa 2221 taatccttat ttttcattac ttttgacttt tttacttttt acctattcta gggctggggt 2281 caagcatgga tattaagcaa attgcgaagg acacagccaa gacgctgcaa agctacctga 2341 cttatcaggc actaaaaacg gtgttggctc aagttagcga aacaaatcct cctttagcac 2401 tttggctaca acgcttttcc gccgacaaaa ttcaggatgg agaagcatac ataaaggaac 2461 tgtttcaaga aaagccggaa ttggctttgc ggatcatgac tgttagagaa cacatagcgg 2521 aacaagtcac tgaatactta cccgaaatgg ttcgcacagg cattcagcaa gccaatatgg 2581 aacaccgtcg ccagcatctt gagcgaatca cgcatataga tacatcagct cccagtcctg 2641 aaccagaaac gcaaacaaca tcagatccaa ataatgaaca gtgaacagtg aacagtgaac 2701 agttaataac tgataactga tgactgataa ctgatgtcaa ccgcaaccca ttatcatcac 2761 ctatgcaaac cctaccaaaa gagcgtcgct acgaaaccct ttcctatttg cctcctctgt 2821 ctgatgctca gattgccaag caaatccagt acattctcaa ccaaggttac attccggcta 2881 ttgagtttaa cgagaattca gagccgacag tatattactg gactctgtgg aaactgcctc 2941 tgtttggtgc aaaatctact caagaagttt taaacgaagt tcaagcttgc cgttctcagt 3001 acggcaataa ctttatccgc gtggtgggtt ttgacaacat caagcagtgc caagttctca 3061 gctttatcgt tcacaagccc aatagcagca gatactaaag ctgataaatt ggaatgaatt 3121 gtcagacaac tgtaagtcat tgattcatct cccagtaaaa gaggtagagt tatctacctc 3181 tattttttac tatttagaga gtgtagaccg cactttttgt aacgtatcca gtagatttgt 3241 atattgaaat tattggcgag atctatgagc tactacattt ctccccgctt tctggataaa 3301 cttgcagtac acatcaccaa aaattacctc gaccttcctg gtgttcgagt ccccctaatt 3361 ttaggtatcc acggacgcaa aggcgagggc aagtcgtttc aatgtgagtt agtctttgag 3421 agaatgggtg ttgaggtgac tcacatatcc ggcggcgaac tcgaaagtcc agatgctgga 3481 gattcagcac gtctaattcg tctgcgctat cgagaaacag cggaactcat tcgcgtgcgc 3541 ggcagaatgt gcgtgatcat gattaatgat ttggatgcag gtgcgggacg ctttgatgaa 3601 ggcactcagt acacagtcaa cactcagttg gtaaatgcca cactgatgaa tattgctgac 3661 aatcctacag atgtgcaact tcccggtagt tatgacgcaa cacctttaca tcgtgtaccg 3721 attattgtaa caggtaatga tttctccacc ctctatgcac ctttaattcg ggatggtcgg 3781 atggagaaat tttactggga tccagaccga gatgataagg ttggtattgt caccgggatt 3841 tttagtgaag atgaactttc acgccaagaa gttgaaaaat taatcgatac attcccaaat 3901 caatcgatag actttttcag cgctttgcgt gcccgaattt atgacgaaca aattcgcaac 3961 ttcatacacg aaataggaat tgagcgcgta tctcaacgtg tggttaatag cgttgaaggt 4021 ccaccacaat ttagaaagcc taatttcagc ttgtctcact taattgagat aggcaacttg 4081 atggttggtg aacaacagca agtcgagagt tctcaactgg tgagtcagta taatcgcagc 4141 ttgtactctc gtaatcaatc cgcaccttct ggtgctgtga caccaacaac tcagccgtca 4201 agtaatggcg caagtcaagg tacaacatcc aatgggtacc agaaacagca gcaatcaaat 4261 actcatttaa ctctggagac acaagaacag atacgtgaaa tcttgtctca tggttacaga 4321 ataggtattg agcatgtaga tgagcgacgc ttccgtacag gttcctggca aagttgcatc 4381 acaagcccaa ttaccggcga acaagatgca atatcaactg tggaatcctg tcttggggaa 4441 tatagcaatg agtatgtgcg cttggtaagt atcgatccaa aggcgaagcg gcgggttgta 4501 gaaacaatta ttcagcgccc aaatgggaaa gttgatagtc gctgaactac ccaagccttg 4561 gctctttttg cctgttcggc aaagcccgtg tcgttatagc agcccagtgg cgcaagccat 4621 agatatacgc tcttgcttct agtttcacgg gcgaatgcct cttcgatttg ataaaattct 4681 cctcaagtca acttgcgaag tacttgacac taccccctca attgttgcta tattattgaa 4741 agataaattg atggggcgta gccaagtggt aaggcagcgg gttttggtcc cgccatccct 4801 aggttcgaat cctagcgccc cagttagaaa gagagaaaaa tacaagttta gcatcttcat 4861 tgaaagcact ccaccggcag tgcgtcactt attatgcatg aaacaaacat ttcactctct 4921 cactccctca ctccctctct caagagtgtt tcttcagagt gccccttttg cggctaggat 4981 aattataaat ttctaccggc gagattgtgc gttcttctga ttggcactgc aagttccacc 5041 taatgtgtat acctaagata aaaaagttaa attcatatca ttaggtttta tccggaagag 5101 tattgagtgt ataagaatag tgtacttaag tgcattgctg cggtgttatt agtatttttt 5161 cgggacttca gacagggtta aagtatctat cacttggttt agggtaaaac tgaaattgcc 5221 ctcaagggta tcaaaactgt gccattaact aatgtgttca ccaacgttca ggcgaatgtc 5281 aaacaactgg cacttgagca cattagaaaa atgtatgcca aagtttagct cgaaagcgtg 5341 caactacgga aaaccacaac tcagagaagt tactaaggct atgctgaagt cagacaaata 5401 tcctcacccg ttgtcacaag cctacacagc ccagctaaat gaagaaggtg ggttgaatct 5461 tggtcaagtt ggggcaactc tacgccgtcg cgcattattg attgctggtg taacaggcgt 5521 catagctaca gcagctgtgt tgaaggcaga aacagatcct ccagtatatc aaggtcggtt 5581 tgagatttta acaaagcctg tgactggtga gggtcgggca gtagccaata tccctcaaag 5641 cctgagttcc caacaaggaa tagcatctcc tgagtcagta gaagcagtta cgacgacgat 5701 gcaagtttta gaaagtcgta aaatgctaga caaagttatc gaagagcttc aggataagta 5761 taaaactaaa tatccaaaac tagattacga ctcaatagtc gcccgcttac aaatagcatc 5821 cgaaaaacca aacatcttag aagtcacata cacatctcca gacaaagagc tagttcgcga 5881 tgtcttaact aaacttcaaa aaacttacgt cgagtatagt gtcaacgagc atctagagga 5941 tgtgaatcag gcaattagct ttgttagaca gcaagaactg cctttagaga atcgggttag 6001 agcttggcaa gaaagacagc gaattattcg acaaagacat gacttagttg aaccagcaca 6061 gaaagctcaa gaaatttctc aacaaattgc tactttaacc caacagcaag cagaaaatcg 6121 catacagcta gaacaaatgc tagctaccta tgaagatttg aaacgggagt tagcccaaca 6181 gccaggtgaa agggctggga attcggtgtt aagcgaaaac gctcgttacc aaaaaatatt 6241 ggatcaaatc caaacagtag atattgagat caaaaaagga ggagcaaaat tcacaagtga 6301 aaaccccagt ttgcaaaatt tacaggcaca gaaggcaaat ttactcccaa tgctagctgg 6361 ggaagaaaat cgcgtgaaaa gagattatga aagccgtatc cgggccttgc aagcacgtga 6421 taaatctttg ggagacaaaa ttgcttattt caataattac ctcagaaatt tagcagttgt 6481 gatccgtgaa tacgatgata ttcagcggaa cttaaagatt gccactgaag gtctgactca 6541 attttcagct aaaaaacaag cattgcagat tgagcaagct ttgaggtcgc cctcctggaa 6601 gttactcgat cccaaactag aggaagtcaa tgaaccaaaa gctggcccag gtagtgccaa 6661 acgaaactta gctttgggga cactgttagg tttgctgttg ggtacaggtg cagctttagt 6721 tgcagataag ctcagcaacg ttctatactc ttccaaagat ctcaaggaag caactggatt 6781 accgctatta ggagtcgttc ccttaaggaa agaaatagga gcattagctt ggcaagaaac 6841 atctggtgga atgcaacaag cagtcaaagc atcctttttt gaagtttttc gctctcttta 6901 cacaaacatt cttcttctga gctcagatac accgattcgt tcactcgtca tcagttcggc 6961 ggcaaaagaa gatggtaaga cgactgtagc aatacaccta gcgctcgcag ctgcagcaat 7021 ggggcaacga gtactactgg tagatgcaaa tctacgctct cccacaatcc ataatcgtgt 7081 gggtctgatg aatattcagg gattaactga tgtcattgca caagatttag actggaacaa 7141 tgtaattgag cgatcgccct tagaggacaa cctgtttgtt ctgagtgctg gtccaattcc 7201 ccctgactca attagattgc tcgcatctca gaaaatgcag gatttgatga gtgagttaca 7261 agcctctttt gatttggtaa tctacgacac gccaccttta gtgggttttg cagatgctaa 7321 cctactcgct gctaatacca atggactgat actagtcgca ggattaggaa agctaaaacg 7381 taccatgttc cagcaagcat tagacgatct ccaaatgtct ggcacaccga ttttaggagt 7441 gatagctaat aagtccaaag ataccacgcc tgcctcatat agctattacc aacagtatta 7501 caaacagagc atgagtaatg aaagaattgg agatgatgat agtattgact tcactcattc 7561 caggtcaact tcatcttctt ttagaaatgg tagaagaaac tcataaagct tcgtgggatg 7621 tgtggaaacg cagcagattg tgtctgatac aggaagacac aactgttgtt tagctgagag 7681 aaaacaaggc gcgggggacg gaatttgact cgcaaacaga gcacgagctt aatgcctgca 7741 gcaacgcgct gcgcgtatct cctgcggaga cgctaagcgc caaaggcgca cgctgcgcgt 7801 tcgctcttag cgtgcgctt // LOCUS NODE_4125_length_7751_cov_4.7319397751 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7751) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7751) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7751 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(87..1208) /gene="bchI" /locus_tag="DP116_25260" CDS complement(87..1208) /gene="bchI" /locus_tag="DP116_25260" /EC_number="6.6.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314355.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium chelatase ATPase subunit I" /protein_id="PRJNA477356:DP116_25260" /translation="MSPTAQATAAARRVVFPFTAIVGQEEMKLALLLNVIDPKIGGVM IMGDRGTGKSTTIRALADLLPEIPVVANDPFNSDPSDPDLMSDEVRQQLQQGADIPVV PKKVQMVDLPLGATEDRVCGTIDIEKALSEGVKAFEPGLLAKANRGILYVDEVNLLDD HLVDVLLDSAASGWNTVEREGISIRHPARFVLVGSGNPEEGELRPQLLDRFGMHAEIR TVKEPALRVQIVEQRSEFDQNPPVFLEQYQSQQEELQQKIINAQKLLPSVTIDYDLRV KISEVCSELDVDGLRGDIVTNRAAKAIAAYEERTEVTIDDIRRVMTLCLRHRLRKDPL ESIDSGYKVQKTFSRVFGVELSEETTQQNGTGQKIGLRN" gene 1335..2009 /locus_tag="DP116_25265" CDS 1335..2009 /locus_tag="DP116_25265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876545.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25265" /translation="MLEYDSLACLPSSEELPDSDDTPVDNELQDLIPSLLKAILAMVW ANRMDWFFGVDMGVYYDPNQPPIVPDGFLSLGVERFFDENLRLSYVLWEEEKVPTLTL EVVSQRYRGEYTTKKDEYAKLGVLYYVIYHPTRRRKPRLEVYKLVNGAYQLHSDNPVW LPEVGLGIGMERGIYQGITREWLYWYNEEGKRILTPEEQAEQAQNRAQVLAERLRAMG VDPDSI" gene 2112..3513 /locus_tag="DP116_25270" /pseudo CDS 2112..3513 /locus_tag="DP116_25270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318373.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ATP-binding protein" gene 3717..7391 /locus_tag="DP116_25275" CDS 3717..7391 /locus_tag="DP116_25275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198907.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25275" /translation="MDADEFNVHINNERSLKQLAWAIEASVGQFKLILARCNYASLRS SSREPPFGRSHLINRLRDICQVEIRVLVLKESARTLYTAIREESGDDVQALMILGLES VRNLEQMLISANQVREEFRNHFPFPVVLWIDDEVHKRFMQFAPDLESWGTTKNFPIAP NELIEFLQAIAEQLLTGDFSLTLEKCSEIKLAWQDLQNSGQVLEPEAKARITYLLGVT EYVDNRLDTALEYYQQSLAFWQQVNDLVWQGKVLSHITFCYYEKARQQEINRPVIASE TKWNEAISPNPRLSTPNWQETKNYLQQSLQVLEAAQRPDLVAHLLDKFGKILRDLEDW QELKKLAERSLLIHEAEGDLLRVSQDYGFLAEVALASQKWQDAKTLAEKALEILSTNQ SVIFHRQEILLILAQAEQNLGEEQAAIIFLKTAIQIGVSDSEPQLFLSLLRNLRSLYL KQKQYLEAFQIKQERLSVEQQFGLRAFIGAGRLQATRQIKAQGIATLQRETQENVAPE ISASGRLLDVERLIERIGRPDYKLIVIHGQSGVGKSSLVNAGLVPGLKKKAIGIQDNL VVPIRVYTNWMEELRRQLREALQEMGRWGDGEAETSALETQAQVSEVETPDLRTTLLK QLRQCETYNLRPVLIFDQFEEFFFVDIEPQQRWLFFHFLRDCLNILSVKIVLSLREDY LHYLLKFNRFRDNSMISIDILSENVLYELGNFSRDDAQSIIQQLTERANFHLEPALIA ELVRDLARELGEVRPIELQVVGAQLQAENITTLAKYQECGTKEELVKRYLDEVVQDCG EENQQTAEFVLYLLTDEKGTRPLKTRVELERDLQKLISVDDVQSGRIPPTPLKKGGYF TSFVRGDLSKLDLILEIFVESGLVVLLKENPANRYQLVHDYLAEFIRQQQQPKLSQVM AELEQEKKQRLQTEEQLKQTEQAKQILSQANDKAKQRIRIGSGVLIASFVVAGIVTLQ AFGLAGKAQKITILERQGIATEKLFQFQEIEALLLALDVGQDLHNFINKENEPAEYPA ASPVLALQTTLDNIHEQNQLKGHTLPVTSASFSPDGKRILTASWDKTARLWDSSGKLV TELKGHTDSVNSASFSPDGKRILTASWDKTARLWQYRTFDELLSEGCQWLNDYLVINP KKLEKLEVCQNKSNLRAAAQFLVKEGEEQATAGNIDEAIATFNKAFQWNPSLKFDPKA KAQEFANKGKAEKLLQKRVR" gene 7487..>7751 /locus_tag="DP116_25280" CDS 7487..>7751 /locus_tag="DP116_25280" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25280" /translation="MHRHAADIMFACEKAVQLAPDDGYIRDSRGLARALTGNTQGAIE DFEAYIPQTDDKEIKSQRQSWVKDLRALKNPFTEEVNGFLRARK" BASE COUNT 2253 a 1451 c 1889 g 2158 t ORIGIN 1 catgagtgct aagtgataag ttttgtcaaa gaagataaaa gaaaaaaaag ataaaggaat 61 gtcttttctt tcttaactct tccttcttaa tttctaagtc ctattttttg tcctgtaccg 121 ttttgttgtg tcgtttcttc tgacagttct acgccaaaga cacggctaaa ggttttttgt 181 actttgtatc cagaatctat tgactctaag ggatctttcc gtagtctatg acgcagacat 241 aaagtcatga cacgacggat atcatcaatt gtaacttcgg tacgttcttc atatgcggct 301 atggctttgg cggcgcggtt agtaacaatg tcaccgcgca agccatcaac atcgagttcg 361 gaacagactt cggaaatctt cacccgcaag tcgtaatcaa ttgtcacaga tggtaaaagc 421 ttttgggcgt taataatttt ctgttgcagc tcttcttgtt gagattgata ctgttctaga 481 aatactggag ggttttggtc gaattctgac cgttgttcaa cgatttgcac gcgcaaagct 541 ggttctttga ctgtacgaat ttctgcgtgc attccaaatc tatctaaaag ttgaggacgc 601 agttctcctt cctctgggtt cccagaaccg acaagcacga aacgggctgg gtgacgtatg 661 gaaattcctt ctcgttctac cgtgttccaa ccactggcgg ctgagtcaag aagcacgtct 721 actaggtgat catctagcaa gtttacttca tctacgtaga gaatacctcg gttggctttt 781 gccaagagtc ctggttcaaa ggctttgaca ccttctgata atgctttttc aatgtcgatg 841 gtaccgcaca ctcggtcttc tgttgctccc aagggcaaat cgaccatttg gacttttttc 901 ggaacgacgg gaatatccgc cccttgctgt agctgttggc ggacttcatc actcatcaaa 961 tcgggatcac taggatcgct gttgaaggga tcgttagcaa caacggggat ttctggaagc 1021 aaatcagcta gcgcccggat agttgtggat ttaccggtac cgcgatcgcc cataatcatc 1081 acaccaccaa ttttcgggtc aatcacgttc aacaatagag ccagtttcat ttcttcctga 1141 ccgacaattg ctgtaaaggg aaataccacg cgacgcgcag cagcggtagc ttgagcagtt 1201 ggactcacta aattacctta tctatatttc ttttattacc gtctttattg tgccataggt 1261 ggggttgtgg aggacaaagg gatgaggtaa tgggggacta tgatcagaat tatgaaataa 1321 ctgccgaacc gcttatgtta gagtacgatt cattagcttg cttgccctcc tctgaggaat 1381 taccagactc tgatgatacg cctgtggata atgaactcca agatttgatt cctagtttac 1441 ttaaagccat actcgctatg gtgtgggcaa accgtatgga ttggtttttt ggtgttgata 1501 tgggtgttta ctatgatcca aatcaaccgc caattgtacc agatgggttt ttgagtttgg 1561 gagttgagcg attttttgat gaaaatttgc gtttgagtta tgtgttatgg gaagaagaga 1621 aagttccaac gctaacgctg gaagttgttt ctcagagata tcgtggggaa tacaccacca 1681 agaaagatga gtatgcaaag ttaggggttt tgtactacgt catctatcac ccgactcgtc 1741 gccgcaaacc gcgtttggaa gtgtataagt tggtcaatgg tgcatatcaa ttgcattcgg 1801 ataatcctgt atggttaccg gaagttggtt tgggaattgg tatggaacgg ggaatttatc 1861 aagggattac gcgggaatgg ctgtattggt ataacgagga agggaaacgg attctgacac 1921 cagaagagca ggctgaacaa gctcaaaatc gcgcccaagt cttggcggaa cgcttacgtg 1981 ctatgggagt cgatccagat tctatttaag atcatcttaa aattatcgga aaatcaagca 2041 tcaaagctgt agctatttat atcttggaaa ccaaggatat gaaaaatgaa tagttataga 2101 ttataggctt tatgatgata gacttacgca agttttttga agctacagac cccagcagaa 2161 ctctggtgat taacaacaca caggataaaa agtattatat tgacttttct tctgtgcgtg 2221 gtggagatat tatcttcaaa ctgaagcaga aaatgacatt ttttaagccg aatgacccta 2281 cctgcactct ctttactggg cacattgggt gtgggaaatc tacggagttg cggcggctgc 2341 aactggagtt ggaagcggat ggtttttgtg tcatctattt tgagtctagc gaagacttgg 2401 aattgactga tgtggatatt gctgatgtgt tgctggcgtt aagcgtagct ctgccgtagg 2461 caatcgcccg tcgtgtgagt caaagtttag aaaaactcaa tcttgaggaa cccagcaggc 2521 tgaaagagtt actgcaaggt gctatgaggg ttttaaatgc tgatgtgact ggcgtaaaac 2581 tcaaagttcc taatgttggt gatttcggtg tcacgtctga aaaagaaaag tttactttgg 2641 cttttggaat tggggaaatc acgactaagg ctaagagtga cgccacactg cgggaaaaac 2701 tcaaccagta tctgggacca caaaaaatta aactcttaga cgcaattaat aaggagttgc 2761 tggaacctgc gatcgccaaa ctcaaacagc aaggcaaaaa aggtttagtg gtgattgtag 2821 ataatcttga ccgaattgat aatcgtccca aggcttgggg acgtccacaa caggaatact 2881 tgtttgtgga tcagggtgag tatctcacca agttgaattg tcatgttgtg tatacaatgc 2941 cactgtctct gaagttttcc aatgactatg gaacgctcac acaaagattt ttggaagacc 3001 cgagagtgtt accaatggta cctgtacaat ggtcagatgg cagtgttcat gaggagggaa 3061 tggcgctgat gcaagagatg gtactggcaa gagcttatcc tgacttgcga ccagaccagc 3121 gtgccagtaa tattacgttt gtatttgatc gtaaggcgac gcttgaacac ttgtgtcgaa 3181 tgagtggtgg tcatgtgcgc gacttgctga ggctgctgaa tacatggatt atggaggaaa 3241 tgtcacttcc tttgactcgt gatactttgg atactgtgat tcgcgctcgt cgcaatgaaa 3301 tgactctgcc gatttctgat gatgagtggg aattgttgcg tcgtgtgaag caaacgaaaa 3361 aagtgagtga tgatcagggg tatcaaaagc tgattcgcag tcggtttgtt tttgaatatc 3421 gcgatctgcc gaaggcagca gcgaagctat cgcggtgagt cttggtttga ggttaatcct 3481 attttggcag aggcgcggga gttgaatggc tgaacaattg ctgtgttttc tataagtttc 3541 cacaaccgcc aggaactaga agttcctggc tcataggcta agtccattaa aatggactga 3601 aaacaaagct acgcgaaagt gattagtcct caccagagga ctttggctat gagactgggg 3661 ttttcaaccc caggcgatcg tgaacaataa tacatcaaag agctaaggaa gcgcagatgg 3721 acgcagatga atttaatgtt cacatcaaca atgagcgctc attgaaacaa ttagcttggg 3781 cgattgaggc ttctgttgga cagtttaagc tgattctggc acggtgcaat tatgctagtt 3841 tgcgaagctc cagcagggag ccgccctttg ggcgatcgca tctcatcaac cgattacgtg 3901 acatttgtca agttgaaatc cgtgtcttgg ttctcaagga atctgcaagg actctttata 3961 ctgccattcg ggaagaatct ggggatgatg tacaagccct gatgattttg ggtttggaat 4021 cagtgcggaa tcttgagcaa atgctgatta gtgcaaatca ggtgcgggag gagtttcgca 4081 atcattttcc tttccctgtg gtgttgtgga ttgatgatga agttcacaag cgattcatgc 4141 agtttgctcc tgatttggaa agttggggaa cgacgaaaaa tttccccatt gctcccaatg 4201 aattaataga gtttttgcaa gcaatagctg agcaattatt gactggtgat ttcagtctga 4261 ctttagaaaa atgttccgag attaaattag cttggcaaga tttgcaaaat tcgggacaag 4321 ttctagaacc tgaagcgaaa gcaagaatta catatttgtt gggagtcaca gaatatgttg 4381 ataatcgtct agatactgct ctggagtatt atcaacagag tttggctttt tggcagcaag 4441 tcaatgattt agtctggcaa gggaaagtcc tgagtcatat cactttctgt tattacgaaa 4501 aagcgagaca acaagaaata aatcgccctg tcattgcgag tgaaacgaag tggaacgaag 4561 caatctcccc taacccaaga ctatcaactc ctaattggca agaaacaaaa aattaccttc 4621 agcaatctct ccaagttttg gaagctgctc aacgtccaga tttagttgct catttacttg 4681 ataagtttgg caaaatcttg cgtgatttgg aagattggca ggagttgaaa aagttagctg 4741 aacgttcttt gctaatccat gaagctgaag gagatttgct gagagtttct caggattatg 4801 gctttttggc tgaagttgct ttggcaagtc aaaaatggca agatgcgaaa actttagcgg 4861 aaaaagcgtt agagatttta tctacgaatc agtctgtcat ttttcatcgc caggaaattt 4921 tgttgattct agcgcaagca gaacaaaatt taggcgaaga gcaagcagcg ataatttttt 4981 taaaaacagc aatccagata ggcgtatcgg actcggaacc acaacttttt cttagtcttt 5041 taagaaattt acgaagttta tatttaaaac aaaagcaata tctcgaagct tttcaaatta 5101 aacaggaaag gctttcggtt gaacaacaat ttggcttgcg ggcgtttatt ggtgccggaa 5161 gattacaagc tacacgacaa atcaaagcgc aaggtattgc gactttacaa agagaaactc 5221 aagaaaatgt cgctccagaa attagtgcat ctggtcgttt actggatgta gaacgtttga 5281 ttgaacgcat cggtcgccct gattataaat tgatagttat tcatggacaa tcaggagttg 5341 gcaaaagttc tctggttaat gcgggacttg taccaggttt gaaaaagaag gcgatcggca 5401 ttcaagataa tttggttgtg ccgatacgag tttacaccaa ctggatggaa gagttgagac 5461 gccagctaag agaggcgtta caggagatgg ggagatgggg agatggggaa gctgaaactt 5521 ctgctttaga aactcaagcg caagtgtcag aagttgaaac gccggacttg cgaactacgc 5581 tgttaaaaca attacgacag tgtgaaacat ataatcttcg tccggtactg atttttgatc 5641 aatttgaaga atttttcttt gttgatatcg aaccacagca aaggtggctg ttttttcatt 5701 ttttaagaga ttgccttaat atcttatcgg tgaagatagt attgtcgttg cgagaagatt 5761 acttacatta tttgctcaag tttaatcgct tccgcgacaa ttcgatgatt agtatcgata 5821 ttctgagtga gaatgttctt tatgagttag gtaacttctc ccgtgatgat gctcagtcaa 5881 ttattcagca attaactgaa cgggctaatt ttcacttaga acctgcctta attgcagaac 5941 tggtgcggga tttagcgcgg gaattgggcg aagtgcgccc gattgagttg caggttgtgg 6001 gggcgcaact gcaagcggaa aacatcacaa ctctggcaaa atatcaggag tgtggcacaa 6061 aggaagaact ggtaaaacgt tatctggatg aagtggttca ggattgcggt gaggaaaatc 6121 agcaaacggc ggaatttgtg ttgtatttgc tgacggatga aaaaggaact cgcccattga 6181 agactcgtgt tgagttggaa cgggatttgc agaaattgat ctcggttgat gatgttcaat 6241 ctgggcggat ccccccaacc ccccttaaaa aggggggcta ttttacctcg tttgtaaggg 6301 gggatctcag caagttagat ttgattttag agatttttgt agaatctggc ttagttgttt 6361 tgttaaaaga aaaccccgct aatcgctatc aactggtgca tgactattta gctgagttca 6421 tccgccagca acagcaaccg aagttaagcc aggtgatggc ggaactggaa caggagaaga 6481 agcaacgcct gcaaactgaa gaacagttaa aacaaactga gcaggctaag caaattttgt 6541 cgcaagcaaa cgacaaggca aagcaacgaa ttcgtatagg ttcaggagtg ttgattgcat 6601 cctttgttgt tgcaggaatt gtcacattac aagcatttgg gcttgcgggg aaggcacaaa 6661 aaattacaat tctagaacga caaggcattg caactgagaa gctttttcag ttccaagaaa 6721 tagaagcatt gcttttggct ttagatgtag ggcaagattt acacaatttt ataaataaag 6781 aaaatgaacc agcagaatat ccagctgcta gccctgtgtt agctttacag acgactcttg 6841 acaatattca cgagcaaaat caactcaaag ggcatacact acctgtcaca agcgccagtt 6901 ttagcccaga tggcaaacgc attctcactg catcatggga caaaacggcg cggctgtggg 6961 atagctctgg taagctcgtc accgaactca aagggcatac agattctgtc aatagcgcca 7021 gttttagccc agatggcaaa cgcattctca cagcatcatg ggacaaaacg gcgcggctgt 7081 ggcaatatag aactttcgat gaactgttat cagaaggttg ccaatggctg aatgattatc 7141 ttgtcatcaa ccctaaaaaa ctggaaaaac tagaagtctg ccaaaataaa tctaacttaa 7201 gagcagcagc gcaatttttg gttaaagaag gtgaagagca agcaacagca ggtaatattg 7261 atgaggcaat tgcaactttc aataaagcct tccagtggaa tcccagttta aaatttgacc 7321 cgaaagcaaa agcacaagag tttgcaaaca aagggaaagc tgagaaactc ttgcaaaaga 7381 gggtaaggta acacaagcga tcgcaaacta cacaaaagcc caacagcttg atccaaaagt 7441 tgaaattaac gctgattttt gggaaaggct ttgctggtac ggtagcgtgc atcgtcatgc 7501 tgctgatatc atgttcgcct gtgagaaagc tgtgcagctt gctcctgatg atggatatat 7561 tcgagatagt cgcggtttag ccagggcgct gacaggtaat actcaaggag caattgagga 7621 ttttgaggca tacattcccc agactgatga taaggagata aaatcacaac gtcaaagctg 7681 ggtgaaggat ttacgcgccc ttaagaatcc gtttacagag gaagtaaacg gattcttaag 7741 ggcgcgtaaa t // LOCUS NODE_4132_length_7725_cov_5.1336387725 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7725) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7725) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7725 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 70..253 /locus_tag="DP116_25285" /pseudo CDS 70..253 /locus_tag="DP116_25285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002762727.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 344..569 /locus_tag="DP116_25290" /pseudo CDS 344..569 /locus_tag="DP116_25290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014277087.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(740..2689) /locus_tag="DP116_25295" CDS complement(740..2689) /locus_tag="DP116_25295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25295" /translation="MVDEYRFSGSLPEEATTYVTREADGELYEGLKAGKFCYVLNSRQ SGKSSLRVRTMRRLRDAGVECAAIDLSAGGIQNVPPEQWYADLIDTLIESFGLDVEFG DWWSQNQLNSLVTRFRKFLEEILLAEIKENIVIFIDEIDSVLSLNFPTDDFFAFIRAC YNQRVDNPEYNRLTFCLLGVASPSNLISDKKRTPFNIGKAISLKGFQLHEVEPLEKGL HGRYSDSQAVMKEILHWTGGQPFLTQKLCQFMVESETDNPRTVEQVVRSHIIENWESQ DEPEHLRTIRDRILRDEQRASYLLELYQQIRCSEEQSEITADDTQEQSELQLSGLVVR QQNNLRVYNPIYKDIFDQDWIETQLRNLRPYSENFKFWVASGGSDNSRLLRGKALQDA LDWSKDKSLNYQDREFLAASQAKEKEEAIAALEKEAALERERKDREAVEKRNQVLAEA NQTLADANDTLTEANKKAKRRIRVGSIVLVLTLLGAVISGSLAVVTLGRIQEQAKSLS LLSKLSGELQSKNQQENADEARRQLGLAYGLKENYKLKQAVLLSGIAFAYQKLGDKKK ASEEIQNSVKLLRLEEKKSSSLEKQVGVYVLAMQAALLKEQNNNTEAFKTYEQAFKLL LEIKELNPNFLDKNIAGRVKAGAGR" gene complement(2693..4376) /locus_tag="DP116_25300" /pseudo CDS complement(2693..4376) /locus_tag="DP116_25300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319314.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="adenylate cyclase" gene complement(4977..5240) /locus_tag="DP116_25305" CDS complement(4977..5240) /locus_tag="DP116_25305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317369.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25305" /translation="MRVWLACFLVLFALAELFDWVKEFSLPLPIYILGGTFLAVASNY DKLFGSYLNHASEVSPQQILLDDSPSFTISDSVEELQKSETKS" gene 5408..6328 /locus_tag="DP116_25310" CDS 5408..6328 /locus_tag="DP116_25310" /EC_number="2.7.7.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197086.1" /note="catalyzes the DNA-template-directed extension of the 3'-end of a DNA strand; the delta' subunit seems to interact with the gamma subunit to transfer the beta subunit on the DNA; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit delta'" /protein_id="PRJNA477356:DP116_25310" /translation="MTDAFAQLVGQQQAVELLLQAVAKKRIAPGYLFVGSDGVGRSLA ARCFVELLFSPHQNRVRQGNHPDLLWVQPTYLHQGQRLTAIEAAEKGLKRKAPATIRL EQIREITAFVSRSPLSAPRQVVVLDQAETMAEAAANALLKTLEEPGQTTLILIAPSVE SVLPTLVSRCQRIPFYRLDATAMAQVLTQTGHEEVLQHPAVLSVAAGSPGDAIASYQQ LQAIPPELLKTLTKVPSSYREALQLAKQIDKDLDTEAQLWLVDYLQQFYWQHKRQPSM IKQLEQARKSLLCYAQPRLVWECTLLSFVI" gene complement(6448..6765) /gene="trxA" /locus_tag="DP116_25315" CDS complement(6448..6765) /gene="trxA" /locus_tag="DP116_25315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015212294.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="thioredoxin" /protein_id="PRJNA477356:DP116_25315" /translation="MATKKQFNSFEEMLSGADVPVLVDFYADWCGPCHMMAPILEQVN AQLQGRIQIVKIDTEKYPELASQYEIYALPTLVLFKQGEPVDRIEGVLQTPQLVQRLT ALI" gene 6917..7681 /locus_tag="DP116_25320" CDS 6917..7681 /locus_tag="DP116_25320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015197185.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-ketoacyl-ACP reductase" /protein_id="PRJNA477356:DP116_25320" /translation="MELLPENLQLLREKVVIVTGASRGIGRAIALELAKLGANVVVNY ASSSNAAEEVVDTITKAGGSAIALQADVSQADQVEALVNTVIKKLNRVDILVNNAGIT RDTLLLRMKPEDWQAVIDLNLTGVFLCTRAVSKIMLKQRSGRIINITSVAGQMGNPGQ ANYSAAKAGVIGFTKTIAKEFASRGITVNAVAPGFIATDMTSDINAEEIIKYIPLARY GQPEEVAGMVRFLAADSAAAYITGQVFNVDGGMVMA" BASE COUNT 2188 a 1711 c 1546 g 2280 t ORIGIN 1 catataaacc acagccaaaa caagaaaaaa gctatcactg gggaagtaaa ctgcaaactg 61 ctgtaatttc tgtgcaaaaa ggaaggtata acggcacgaa cgttcttgct gagtgatgat 121 gggttgactc aatggcatcc agctttcttg gagttgggga aatctttggc agaatgtgcg 181 gcattcatat cggggatttt gtaagaaata taaacgggag catcccaatt ttgcaaaaag 241 actgggcgat tagaaatcgc ggctacacaa accaagtccg cgtaggcgga cttggtttgt 301 gtagccccag acttctagtc tgtaggcaat ttaaacattg gggatgctcc catataaacc 361 acagccaaaa caagaaaaaa gctatcactg gggaagtaaa ctgcactcac cctattaagg 421 gtaagaagaa aaagacatca ccgggacaat tgcgatcgcc ttgggcagaa tgggaaccct 481 taaccacgga agtccaagaa gttgcagaaa aatttgtgat ggctaattgt tatgacccca 541 aaaaagcaat gcagttgttt gatgtttaat aattccgtgg cagggtgccg ggagctaggt 601 atcgaaatcg gtgaaatgct tatgttttcg tagaaagtgc ttttgtaaaa atattgttac 661 aacgggctgg aacgcttata tgttacagct gacattcacc gggttcttcc aattgtctcc 721 atatcgcccc tcaccgccct caccgccctg cccccgcttt gactcttcct gcaatatttt 781 tatcgagaaa attcgggttc aattctttga tctctaaaag taacttgaat gcttgttcat 841 aagttttgaa tgcttcagta ttgttatttt gttcttttaa taatgcagct tgcatagcca 901 agacataaac ccccacttgc ttttctagag aagagctttt tttctcttcc aatcgtaaga 961 gttttacact attttgaatt tcttcactag cttttttctt gtctcccaac ttctgatatg 1021 caaaggcaat accgcttaac agtacggctt gtttaagttt atagttttct tttagtccat 1081 aagctagccc taattgtctt ctggcttcat ctgcattttc ttgctgattc ttgctttgaa 1141 gttctccgct tagtttagat agtagtgaga gacttttagc ctgttcctgg atacgtccaa 1201 gagtaactac tgccaagctt cctgaaatga ctgctcccag caaagtcaaa actaaaacaa 1261 tactcccaac acgaattcgc cgtttcgctt tcttattcgc ttcagttaat gtatcattag 1321 catcagccaa tgtctgatta gcctcagcca atacttgatt ccttttttct actgcttccc 1381 tatccttcct ctcgcgttcc aatgcagctt ctttttctaa tgcagcgatc gcctcttctt 1441 tttcctttgc ctgactagcc gctaaaaact ctctatcttg atagtttaaa cttttatctt 1501 tcgaccagtc caatgcatcc tgtaatgctt ttccgcgcaa cagtcgggag ttatcgctac 1561 ccccagaagc tacccaaaac ttaaaattct ctgaatatgg gcgtagattc cttaattggg 1621 tttcaatcca atcttgatca aaaatatctt tataaattgg gttataaact cttaagttat 1681 tttgttgcct aacgactaat cctgaaagct gtagttcgct ttgctcttga gtatcatctg 1741 ctgtaatttc agattgctcc tctgaacatc ggatttgctg atacaattcc agcagataag 1801 atgcgcgttg ctcatccctc aaaatcctat ctcgaatcgt tcgcagatgt tctggttcat 1861 cttgggattc ccaattttct atgatgtgcg atcgcacaac ctgttcaaca gtccggggat 1921 tatctgtttc tgattccacc ataaactgac acagcttttg cgtcagaaac ggttgtcctc 1981 ccgtccagtg taagatttct ttcatcactg cttgagaatc agaatacctt ccgtgcaacc 2041 ccttctctaa cggctcaact tcatgcagtt ggaatccttt gagagaaata gctttcccaa 2101 tattaaacgg ggtgcgcttt ttatctgaaa tcaaattgct tggtgatgcg actcccaaca 2161 aacaaaaggt gaggcgatta tattcaggat tatcaacacg ctgattgtaa caggcacgga 2221 taaatgcaaa gaaatcgtct gtggggaaat tgaggctgag aacactatca atctcatcaa 2281 taaaaatgac aatattttct ttaatctcag caagcagtat ctcttcaaga aacttccgaa 2341 atcgcgtgac aagcgaattt aactgatttt gtgaccacca atccccaaat tctacatcca 2401 atccaaaact ctcaatcagc gtatcaatca aatctgcata ccattgttct ggtgggacat 2461 tctgaatccc tccagcagaa aggtcaatag cagcacattc tacgccagca tctcgtaatc 2521 ggcgcatggt tcggactcgc aagctggact ttccactttg tctggagttc agcacataac 2581 aaaatttacc tgccttgagt ccttcataca attcgccatc agcttctcgc gttacataag 2641 tggtggcttc ttcgggtaga cttccagaaa atcggtattc gtcaaccata gcttacccca 2701 aacgcattga gaagtactga cgatataaat cgcaactggg aacgcaatca ttacctgaga 2761 gtttcactaa ccccaagctg tgcagtttaa acccaacttc cgcattaagt ctcataggtt 2821 catctgccat caccactttt ttataagctg attctaactg aggattgtgt tgtaaattcc 2881 atagttgctg tcgcagatgg tcgctaaaaa ttccttgttc tgtcggcgca aggcttaaaa 2941 gttgttctag agttatttgc tgactcttga gattggctaa agcctgctgt atgagatatg 3001 ggtgtcctcc taccaactgc atcaactgac gaaatccatc ttctcccaac tgtccatcta 3061 attcatactg gttggcaaga gtttgcactt gttgcaggtt aaattcgggt aactcaatcg 3121 ctaacccaac attaaacggt gaatgattgg tatccaaaga tggataaact tcggtggaat 3181 gaaccacaac taaccgcagt ttcttccaaa gattaccaat tttgtctcct tgtttcgctg 3241 tttcatacca actccgcaga agtaagcaaa atgaggagaa aatatcagga tattcaaaca 3301 accgttcaaa attatctata gcaaagacca gaggagtttt gatatctgag agtaaatact 3361 tttgaaagta acgagtacag tttttattca gcccgtagat atcttgccag tattcgttca 3421 ccttaagttg aagttcgaaa ctgtcagaaa catcaacgca taaccactgc aagaaggttt 3481 ttaagctagt taaggtatca ttatcagcta gtttcaaatc taatttagct gtttgataac 3541 cctgctctct tgcatagtcg agcattttct ccaatagtaa agtttttccc atcctctgcg 3601 gtgctttcag gcgaatcagc gcccctggtt gtacaatcgc accaaagctt ttctcctcaa 3661 tgggcggtcg ttcaatatat atgattttat ctggttctgg tgtataatta ccaggtagtg 3721 tttcttttga acgttgggat tgtaaattat tctgagtcat aagttgagga atcagctgtt 3781 gtggaatccc tgctacctga atgagactac aacctaattt ataagcaaac tcataactct 3841 tatctgctcc gagtgcatca taaaaaccca cagcaaattc aattgctgct ttatcttgga 3901 ttgcctgact catgcctatg acataatgaa tgtgttgtgc gatctgccga aggcagcagc 3961 gaagctatcg ccctagcctg atactctgaa taacaagcat taaggacaac acactcaact 4021 tggtcagcaa acagttgaaa caatcctgcc aaagcttgtg catcaacgag ctttatttga 4081 ccagtctcat cttcaaatac taaaccatct tctccagaac catgtccgga aaagtgaatg 4141 atgtgcggtt cgtattctaa aatggctctg tggatatccc tgtagcgtac tgcttctgcc 4201 cgatctattg aatagcgata agccggaggc ttgacgctgc gcgtatcgcg catatgtgct 4261 cgtttcaacc cctccttaat ttcgcgcatc tcctcatcca gacgcaattg acttgtccca 4321 gagggatttg ctgacatcag caggattttt cggttttgac tgggattatc actcatgaga 4381 aatttagggt gttcaagagt tatgacctac ggattcaaac agtgaacagt tatcagttac 4441 cagttaccag ttataagtta ccagttatca gttatcagtt atcagctttt tgttcactgt 4501 tcactgttca ctggtcactg ttaaagagaa aatgcacatt ataagaccac actgctgaaa 4561 aacagtgtgg tttggttagt tatagtgatt ttacaagtta agaaattaaa tctataatta 4621 gcttggaggc gtcactgttt tttagtgaca tagtaagcta caaaaccaga ttccaggatt 4681 ccggattcga ctcaaatatt ttttcttgcg ctgcaagtgc ttccaaaagc tcacaaattt 4741 cttttttttg atgcaactaa ttctggatga tgctttgggt tatcagcttg gggagcatcc 4801 caaatgtgta agttatatat agcaatccta aatcctgagc cgctgcgggc acgctgcttt 4861 gaacgtgaca aacaagatcc ccgacaactt ttacgaagtc ggggatctgt ggctttcaat 4921 ttatgcgcaa agcgcaggca ctttcacaca aatcaaatag gattgctata gtaccgttaa 4981 ctttttgttt cagatttttg taattcttca acagaatccg aaattgtaaa ggaaggtgaa 5041 tcatcaagca agatttgttg gggtgatact tcgcttgcat gatttaagta agagccaaaa 5101 agcttatcgt aattagaagc aacagctaga aatgttccac ctaatatata tataggaagc 5161 ggtaggctga attctttcac ccaatcaaac agttctgcta gggcaaacag caccaaaaag 5221 caggctagcc aaactctcat gtttttaccc ctaagattta aacacctttt ttaattttga 5281 cgctcccaca ctgaagtgaa aacatggcgt tccgctggtt ttcatcaaca acgtggctga 5341 aatatatcta ttataaagat ggctggctga aaacccaaaa gggatgtaaa acaaagcacg 5401 acagaaaatg actgacgcat ttgcacaact tgtaggacaa cagcaagcag tggaattact 5461 gctacaagct gtagcaaaaa aacgtattgc tccaggatat ttgtttgttg gttcagatgg 5521 tgttggacgg agtttagcag cacggtgctt tgtggaattg ctctttagtc cacatcaaaa 5581 ccgcgtgcgt caaggtaacc acccagattt gttgtgggtg cagccaactt atttgcatca 5641 aggacaacga ctgacagcaa tagaagccgc agaaaaagga cttaagcgca aagcaccagc 5701 tacaattcgc ttagaacaaa ttcgggaaat tactgcgttt gtgagccgtt cgccgttgtc 5761 agcacccaga caagtggttg tgctagacca agcagaaact atggcggaag ctgcagcaaa 5821 tgctttgctc aagactttgg aagaacccgg acagacaacg ctgattttaa ttgcgccttc 5881 agttgagtct gtattgccaa cgttggtgtc tcgttgtcag cgaattcctt tctatcgttt 5941 ggatgcaact gcaatggctc aagttctgac acaaacagga catgaggaag ttttgcagca 6001 tccagcagtg ttgagtgttg cggctggtag tccgggagat gcgatcgcat cctaccaaca 6061 actccaagcc attccccctg aactactcaa aaccctgacc aaagtccctt catcttaccg 6121 tgaggctttg caattagcta aacaaattga taaagattta gatacagaag ctcaattgtg 6181 gttagttgat tatctgcaac aattttactg gcagcacaag cgtcaaccaa gcatgattaa 6241 acagttagaa caagctcgca aatctttgct ttgctatgct cagccgcgtc ttgtttggga 6301 atgcacactt ttatcatttg ttatttagct atttgttgaa atcagtagct atcatgtact 6361 tccctacacc cttacacccc tacaccccta cacccatctt caagctagta gagacgttac 6421 atataacgtc tctaccttca aagccaatta aatcaatgcc gttaaacgtt gcactaactg 6481 gggtgtttgc agcacaccct caatccgatc cactggctca ccctgcttaa agagtaccag 6541 cgttggtaga gcataaattt catattgact tgctaattct ggatatttct cagtatcaat 6601 tttgacgatt tgtatgcgtc cttggagctg agcgttgact tgctctaaaa ttggagccat 6661 catatgacaa ggtccacacc aatcggcata aaagtccacc aatactggta catcagcacc 6721 agatagcatc tcttcaaagc tgttgaattg ttttttagta gccatgtgac acgcaccact 6781 tttctactgt ttgactattt gatttttcaa tagtagttcc caagagcgct gatctgtaaa 6841 atcctagatt gtttttattg ttgacgcaca atcactttat gataattagc gctgcggatc 6901 ataaagagga atatttatgg aactattgcc agaaaatttg caacttttgc gtgaaaaggt 6961 agttattgtt actggtgctt cacggggaat tgggcgggcg atcgcattag aattagcaaa 7021 actgggagct aatgtcgtcg tcaattatgc cagttccagc aatgcagccg aagaggtcgt 7081 cgatacaatc acaaaagccg gaggaagtgc catagcgctt caagctgacg tttctcaagc 7141 cgatcaggta gaagcgctag tgaacactgt gatcaaaaag ctcaatcgtg ttgatatttt 7201 agttaataac gcaggtatca ctcgcgatac actgcttttg cgaatgaaac cggaagactg 7261 gcaagctgtg atagacctaa acctaacagg tgttttttta tgtactcgcg ctgtcagtaa 7321 aattatgctc aaacaacgtt ccgggcgaat cattaacatt acctctgtcg ctggacaaat 7381 gggtaacccc ggacaagcca actacagcgc cgccaaagca ggtgtcatcg gctttactaa 7441 aaccatcgcc aaagaatttg cttctcgcgg tatcacggtt aacgctgttg ctcccggttt 7501 tatcgccaca gacatgacaa gcgatatcaa cgctgaagag attattaaat atattccgct 7561 agctcgttat ggtcaaccgg aagaagtcgc tgggatggtg cgctttctgg ctgccgattc 7621 cgccgcagct tacatcaccg gacaagtttt taacgttgat ggcggaatgg tgatggcttg 7681 acacttttaa aaaatacagg gtgtaggggt ataggggtgt agggg // LOCUS NODE_4141_length_7708_cov_5.1381167708 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7708) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7708) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7708 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(46..432) /locus_tag="DP116_25325" /pseudo CDS complement(46..432) /locus_tag="DP116_25325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182629.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="cupin" gene complement(497..1483) /locus_tag="DP116_25330" CDS complement(497..1483) /locus_tag="DP116_25330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="PRJNA477356:DP116_25330" /translation="MIQFRIQPDSEIPASTQLFNQIQFAIASRQYSPGYKLPSTRALA MQTGLHRNTISKVYRQLEEEGFVESLAGSGIYVRAQGHEGGSKLQSPMLKQYPQAYKV VQQAVDELLSQGCSLNQAREIFLAEIDWRLRCSARVLVTAPNQDIGAGELMVHELEES LKIPVQLVAMEELAAVLDQTSSATVVTHRYFIGEVEAIAAPKAVRVIPLDIQDYAKEF NQFKNLPKDSCLAIVSLSPGWLRAAEVIIHGLRGDEILVMSSQPKDAYKLQAMVKRAQ VIFCTDKPSFAAVQAAMQVVHEDIIRPPKLMCGENFIGSNSINLLKRELGLG" gene 1684..2424 /locus_tag="DP116_25335" CDS 1684..2424 /locus_tag="DP116_25335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dienelactone hydrolase family protein" /protein_id="PRJNA477356:DP116_25335" /translation="MAERAIHTTTVNLSQDHLQIQAYLAQPSEEGSYPGVVVLQEIFG VNSHIRDVTERIAKEGYLAIAPALFQRVAPGFETGYTKEDIEVGKKYAWEQTKASELL SDIQKAVDYLKTLPNVKQNGFGCIGFCFGGHVAYLASTLPDIKATASFYGAGIPNRTP GGGNPTITLTPDIKGIVYTFFGMEDASIPQEQVDQIEAELEKYNIPHRVFRYDGADHG FFCDQRPSYNSIAAADAWEQVKQLFKQL" gene 2596..3498 /locus_tag="DP116_25340" CDS 2596..3498 /locus_tag="DP116_25340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310262.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S1" /protein_id="PRJNA477356:DP116_25340" /translation="MDSEAKFSQKANSSFTMDDFAKALEIHDYQFQKGQVVHGRVFQL DSDGAYVDIGGKSSAFIPRDEASLRAVTNLSEVLPLNESLEFLIIRDQDAEGQVTLSR RQLEIKQIWERLAQMQENSQTIQVRVINLNKGGVTVDVQGLRGFIPRSHLVERDNLEA LKGQSLTAGFLEVNRTNNKLILSQRIATQSASFSMLEIGQLVSGKVTGIKPFGVFVDL DGTSALLHIKQVSQKFIDNLEKVFQVSQPIKAVIIDLDQGKGRVALSTRVLENFPGEI LENMDEVMASAQARAERARNKLLE" gene complement(3702..5891) /locus_tag="DP116_25345" CDS complement(3702..5891) /locus_tag="DP116_25345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316421.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA/RNA helicase" /protein_id="PRJNA477356:DP116_25345" /translation="MIVGHEEQSSKFITTEPLKVNTEAGEQKVWDAVKSAFSDRNCIG YWRYPIFSKVGEIRKEPDILIADREFGLVVIEVLPVVIDQVVTIDDGILQLQNYHTTE SNPYQQAEHQLRVLIAYTDRESAIWRRVTGRAIAALPLITQEQWQQKSLDQLPNCPPL IFQDQLGKVGLLERIQQTAAVVPGENLEDKDWELLLSVMSGTPVLRKPPRATVSTTTG KTRAIVMDSLRQRLYEIDLQQEHIGKEIPPGPQRIRGIAGSGKTVLLCQKAAMMHLKH PDWDIALVFFTRSLYDLMIGLLDQWIRRFSGGELQYEPKTNLKLRVLHAWGAKEHPGL YSMICDYHGKRRGTVVDTKQRQPNRGLADLCKRLQEEIKIEPMFDAILIDEGQDLVAE DDLKYEDKQAIYWLAYQALQPVSEEKPEERRLIWAYDEAQSLDSIAVPKAKEVFGENL SNLLSKQPQYSGGIKRSEVMRRCYRTPGSILTAAHAIGMGLLRPEGMLAGITNKDDWN RIGYEVKGDFRRVGKPITIHRPSQYSPNPIAELWGNSILEFQTYGSRQEEMTALAENI MHNIVHDGLNPSRDILVVILGSTSEAIELETEVAGFLIDQNIDVYIPTALTLNDLVPQ WPNNDPDKFWHEGGVTVSRINRAKGHEADMVYVVGFDNVARNEDDVNYRNQLFVALTR ARGWASLSGVGHYPMYDEMRQVIASGDTFTFTYKRPPKRDIGDGEAV" gene complement(6070..6321) /locus_tag="DP116_25350" CDS complement(6070..6321) /locus_tag="DP116_25350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131028.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25350" /translation="MVATLHDPDLPDDLYEQLQELATVENRSINTQVITLLRNALSAK RKQAEDHRRQNVAKLLEETRHRHRVNPADLGLPHLQPHY" gene 6529..7224 /locus_tag="DP116_25355" CDS 6529..7224 /locus_tag="DP116_25355" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316419.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Eco47II family restriction endonuclease" /protein_id="PRJNA477356:DP116_25355" /translation="MSYLSYIADDDLKSAVKIVVDCILETQQKAEEAMYKNVIYPFSA IFDGAVQGFDLNDWLTKERARQSQKTVQNQIGYFHQNILGSIPGWRVLPQGFDICNDT RQIFAEIKNKYNTVKGSDKFGIYDYLSNRLNEPDYQGFTAYYVEIIPNTKKPYDRAFT PSDRTTSTRRPLNEKIRQISGQAFYDMATGVPGALSMLFNVLPDVIGEVSGLDRLSEQ QRASYKTLFDRAY" gene complement(7234..>7708) /locus_tag="DP116_25360" CDS complement(7234..>7708) /locus_tag="DP116_25360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316418.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="DNA (cytosine-5-)-methyltransferase" /protein_id="PRJNA477356:DP116_25360" /translation="TINYTVLWSFTLFYESPLKDVPPSEGMKYSPKRSQVLELVPPGG CWRDLPEDIQKSYMMKSYYLTGGRTGMARRISWDEPSLTLTTSPSQKQTERCHPEETR PFTVREYARIQTFPDDWEFMGGVGSKYKQIGNAVPVNFAYHLGRAIVNVLTSGSH" BASE COUNT 2208 a 1662 c 1603 g 2235 t ORIGIN 1 cagcttaagc tttgtgagtt ttcttacaaa gacccctact tttcaaacat cctctaagcg 61 tcctgcttct ccacctatca ggtatataag gtcttcattg aatggattac gaagatgatg 121 gggttgttgg ggtagcccaa atcccatgaa gtctcctggt tcaacttcgt attccatatc 181 tccaatttcc gcaatacctc gacctgacaa gatgtaaacc cattcttctt cattctggtg 241 agcgtgataa atgaaagatt ccttacccgc aggtacacga gcaatcgtca caccaatgcg 301 tttaagaccg acagcacgcc caaggaaccg taaataaatt tctgaattgg gattgagcgg 361 atgatgaaac tcaaacgcat tcattgaatt aatgtctgct gctttcagca gagaatgttg 421 ttcatcagtc acgttgatgt cctcctatca agttggcaaa cggtctgaac accacaataa 481 agcctaaatt tatttattat cccaacccta gctctcgttt aagcaagtta atcgagttac 541 taccaataaa attttcacca cacatcagct tgggtggacg aataatgtct tcatgaacga 601 cttgcatagc agcttgcact gcagcaaaac taggtttatc ggtgcaaaaa atcacttgtg 661 cccgtttaac cattgcttga agtttgtagg catcttttgg ttgcgaactc attacgagta 721 tttcatcacc ccgtaaaccg tgaatgataa cttctgctgc tcgcagccag ccaggactga 781 ggctcactat ggcaagacag ctgtcttttg gtaggttttt aaactggttg aactctttgg 841 catagtcttg tatgtccagg ggaatgacgc gtacggcttt gggtgctgcg atcgcctcca 901 cttccccaat aaaataccga tgtgtaacca ctgtcgctga ggaagtttga tctagcacag 961 cagccaattc ttccatagca accaactgta ctggtatttt gagggattct tccaattcat 1021 gcaccatcaa ttctccagca ccgatgtctt gatttggagc cgtcaccaac actctcgcac 1081 tacaacgtaa acgccaatca atttcagcta aaaatatctc acgcgcctga ttaagcgagc 1141 agccttggga gagcagttcg tcaactgcct gctggacaac tttgtatgct tgaggatatt 1201 gtttgagcat tggcgattgc agcttactac caccctcatg accttgagcg cgaacgtaaa 1261 ttcctgagcc agcaagactt tctacaaatc cctcttcttc caactggcgg taaaccttgc 1321 tgattgtgtt gcggtgtaaa ccagtttgca ttgccaacgc gcgcgtactc ggcagtttat 1381 aacctgggga atattgccga gaggcgatcg caaattggat ttgattaaac agctgggttg 1441 atgcggggat ttcactgtct ggctgaatac gaaactgaat cattaccgaa tctcctgcgt 1501 gataatggct gcatacgacc agctgtgact gctaagcgtc aaatatttta cttatatatt 1561 gaatttctct attgaaaaat tctggtgcgg ttgttacttg tagttgataa tttgattgga 1621 cattggcttg agctatcggc atagctatat ctatatagtc acaaagaact ggtaaaaact 1681 attatggcag agcgagcaat acataccaca acagttaatc tttcgcaaga tcatttgcaa 1741 atacaggcat atctggcaca gccttctgaa gaaggttctt acccaggagt tgtagttttg 1801 caagagattt ttggagtcaa ctcccacatt cgggatgtca cagaacgtat tgccaaggaa 1861 ggttatctgg caattgcacc cgcacttttc caaagagttg cccctggatt tgaaactggc 1921 tacaccaagg aagatataga agtgggtaaa aaatacgcat gggagcaaac aaaggcttca 1981 gaactattga gcgacattca aaaagccgtt gactatctga aaactttgcc gaatgtaaag 2041 caaaacggct ttggttgcat cggcttctgt tttggtggtc atgtcgctta tcttgcttcc 2101 actttaccag atatcaaagc caccgcttcc ttctacggtg ctggtatccc caaccgtaca 2161 ccaggaggtg ggaatcccac catcacgctt accccagata ttaaaggcat cgtctacacc 2221 ttttttggca tggaagatgc tagcatccca caggaacaag tagaccaaat tgaggcagag 2281 ttagaaaaat acaacattcc tcatcgtgtg tttcgctacg atggagctga ccacggattt 2341 ttctgcgacc aacgtcccag ttataattct atagccgcag ccgatgcttg ggagcaggta 2401 aaacaactgt ttaaacaact ataactattc aattgattta cgtccgcttt gctttaagct 2461 agtccctagt catgagtcat aacaaatgac taaggacagt ttttataagc aatcatagtc 2521 actcgaatag tgttgtttgg tgttgaatca aacttcagtg gcgttagcta gaatagcaat 2581 ttatctttaa gagtcatgga ttccgaagcc aagttttctc aaaaagccaa ttcgtcattt 2641 acaatggacg actttgccaa agcactggaa atacacgatt atcagtttca aaagggacaa 2701 gtggtgcatg gcagagtatt ccaacttgat tcagatgggg catatgtcga tattggtggc 2761 aaatcgtcag cgtttattcc ccgcgatgag gcttctttga gagcagtaac caatttatcg 2821 gaggtgctgc cactaaatga gagtttagaa ttcttaatca tcagagatca ggatgcagaa 2881 ggtcaagtca ccctttcgcg acggcaatta gaaatcaagc agatttggga acgactggcg 2941 caaatgcaag agaattccca aacaatacaa gtgcgggtaa taaatttaaa taagggtgga 3001 gtgactgttg atgtgcaggg tttacgagga tttattccgc gatcgcactt ggttgagcga 3061 gataatttag aagcactcaa aggtcaatct ctgactgctg gctttttaga agttaaccgt 3121 actaacaaca agcttattct ttcccagcgc atcgcaaccc aatctgctag cttcagtatg 3181 ttagaaattg gtcagctagt gtccggaaaa gtaactggta tcaaaccctt tggcgtgttt 3241 gtagatttag atggcacaag cgctttgctt catattaagc aagtgagcca gaaatttatt 3301 gacaatttgg aaaaagtgtt tcaagtcagt caaccaataa aagctgtcat tatagattta 3361 gatcaaggca aaggtcgggt tgctctttcc actagagttc tggaaaattt ccctggtgag 3421 atcctagaaa atatggacga ggtgatggca tcagcccagg cgcgtgcaga acgagcaagg 3481 aacaagctgt tggaataagt tagttgttag ttgttggtta ttttactaac cactcactac 3541 caacaaataa ctctcttgga gttgctgttg taaagtcaat tctgtaaaag caaatttgat 3601 ttcacagacc aaactgccct aactgcacaa cgcaggctct aagagagctt gccagtattc 3661 attggcaaaa aattctgcta tatcggttac agtatccgtt gctaaactgc ttccccatca 3721 ccaatatccc gtttcggtgg ccgcttgtag gtgaacgtga aagtatcacc actagcaatg 3781 acttgccgca tctcatcata catcggataa tgaccgacac cactcagact tgcccaaccc 3841 cgcgccctag ttaaagcaac aaacaattgg ttacggtaat tcacatcatc ttcattgcga 3901 gcgacattat caaagccaac cacataaacc atatcagctt catgtccctt agcgcgattg 3961 atgcgcgaca ccgtaacccc accttcatgc cagaatttat ctgggtcgtt gttgggccat 4021 tggggaacca aatcatttaa ggtcaaagca gtggggatat agacatcaat attttggtct 4081 atcaagaaac ccgcaacttc agtttctaac tctattgctt cggaagtaga acctaaaatg 4141 accactaaga tatcacggct gggattaaga ccatcgtgaa caatattatg cataatattc 4201 tcagccagcg ctgtcatttc ttcttggcga gaaccgtaag tttgaaattc caatatagag 4261 ttaccccaaa gttcagcaat agggttgggt gaatattgag acggtcgatg tatggttatc 4321 ggcttaccaa cacgtcggaa gtctcctttg acttcgtaac caattctgtt ccaatcatct 4381 ttgttagtaa tacctgcaag cattccttct ggacgtagca aacccatacc aattgcatga 4441 gcagctgtga gaattgagcc aggagtgcga taacaacggc gcatcacctc agaacgttta 4501 attccacctg agtattgcgg ttgcttgctc aaaagattgc tcaaattctc gccaaatact 4561 tctttcgctt ttgggactgc tatactatca agactttgtg cttcatcata agcccaaatt 4621 aagcggcgtt cctctggttt ttcctcactg acaggttgta acgcttgata agccaaccaa 4681 taaattgctt gtttatcttc atatttcaaa tcatcttcgg ctactaaatc ttgaccttca 4741 tcaatcaaga tggcgtcgaa catcggctca atttttatct cttcttgtag ccgtttacat 4801 aagtctgcta aaccccggtt aggttgtctt tgttttgtgt ctacaaccgt tccgcgtctt 4861 ttgccgtggt agtcacagat catactgtac aaacctggat gttcctttgc tccccacgca 4921 tgaagcactc gcagtttgag attagttttc ggctcatact gcaactcacc accactgaaa 4981 cggcgtatcc actgatccag caagccaatc atcaaatcgt agagcgatcg cgtgaaaaac 5041 accaaggcaa tatcccagtc tggatgcttg aggtgcatca tcgccgcttt ttggcatagc 5101 agtactgttt tgcctgaacc agcaatacca cgaattcgct gaggacctgg gggaatttct 5161 ttaccaatat gttcttgctg taaatctatt tcgtatagcc tttgtcgcaa actgtccatg 5221 acaatggcgc gggttttgcc tgtggtcgtc gaaactgtcg cacgaggagg tttacgaagc 5281 acgggtgtcc cactcataac tgacagtagc aattcccagt ctttatcttc caggttttct 5341 cctggaacta ccgcagcagt ttgttgaata cgttccaaca agccaacttt acctaactgg 5401 tcttggaaaa ttagaggagg acagttgggt aattggtcaa ggctcttttg ttgccattgt 5461 tcttgggtaa tcagaggtaa tgcagcgatc gctcttccgg taaccctgcg ccaaatagca 5521 gattcgcggt ctgtatacgc aattaatact cgcagttgat gttcggcttg ttgataaggg 5581 ttagattctg tagtgtgata gttttgcaac tgcaaaatac catcatcaat agtaacgact 5641 tggtcaatga cgactggtaa gacctcaata accactaaac caaattctcg atcagcaatg 5701 agaatatcag gttccttacg aatctctccc accttcgaga aaattggata acgccaataa 5761 ccgatacagt ttctatcaga aaaagcactc ttaacagcat cccaaacttt ctgctcacct 5821 gcttcagtgt taacttttaa cggttcagtc gtgataaatt tgctactttg ttcttcatgc 5881 ccaacaatca tttagtccct gcctgtacta gttcaatttt tttatacaaa agtattttga 5941 gtaatgctgc caagctagca ctggcaatat gatacttacc agtgcgaata cacggaacta 6001 cgtagctggt taagtccagc agatatgcgg agttagttga aattattgta ataatttttt 6061 atatagctat cagtagtgag gttgcagatg aggtaatcct aagtccgctg gattcacgcg 6121 gtggcggtga cgagtttcct ccaagagttt tgcaacattc tgtcgcctgt ggtcttccgc 6181 ttgctttctc ttagctgata aagcatttcg taatagagtg ataacttgag tgttaattga 6241 tctgttctca acagtagcta actcttgtag ctgttcatac aaatcatcag gtaaatctgg 6301 gtcatgaaga gtagcaacta tcaacctgtt agttgaattt attggcaatg gtagcacgat 6361 ggaactggcg tgggttggag gggagagaga gggcgatcgc ccaaggggca gtgacaaatt 6421 tctgctacac aaagtttgaa ctagtcgaac ctattaccct gattgtgctg aggtataact 6481 gaggattggt tagtatatac aattgttagg actactgaat tattgagcat gagttatctt 6541 tcctatatag ccgatgatga ccttaaatct gccgtcaaaa tagttgttga ctgtattctt 6601 gaaactcaac aaaaagccga agaagcaatg tacaaaaatg tgatatatcc gttttcagct 6661 atttttgatg gtgctgtgca aggttttgac ttaaatgatt ggctgacgaa ggaaagagct 6721 agacaaagtc aaaaaactgt acaaaatcaa ataggttatt tccaccagaa tattttaggt 6781 agtattcctg gctggagagt acttcctcaa ggattcgata tctgtaacga tacacgccaa 6841 atttttgctg aaattaaaaa taagtataac actgttaaag gaagtgataa atttggaata 6901 tacgattatt tatcaaaccg tttaaacgaa ccggattatc aaggttttac agcctattat 6961 gtagaaatta ttcctaatac gaaaaagccc tacgatagag cttttactcc ttcagatcgg 7021 actactagta caagaagacc tttaaatgaa aaaatcagac aaatcagtgg tcaagcattt 7081 tacgatatgg caactggtgt gccaggggca ttaagtatgc tttttaatgt attaccagac 7141 gtgatcggtg aagtttccgg acttgataga cttagtgaac aacagcgagc aagctataaa 7201 accttatttg atagagctta ctgaggtaat aaatcaatgg gatccacttg ttagcacgtt 7261 aactatagct cttcctaagt gataagcaaa atttacagga actgcattac caatctgttt 7321 atattttgag cctacaccac ccataaattc ccaatcatcg ggaaaagttt gaattcgggc 7381 atactccctt actgtaaaag gtctagtttc ttctggatga cagcgttcgg tttgtttttg 7441 tgagggagaa gtagtcagag ttagacttgg ctcatcccaa gaaattctac gtgccatacc 7501 agtacgccca cctgtgagat agtagctttt catcatgtag cttttttgaa tatcttcagg 7561 taaatctcgc cagcagccgc ctggtggaac taactctaaa acttgagatc tctttggact 7621 atatttcata ccttcagaag gtggaacatc tttcaaggga gactcgtaaa acagagtaaa 7681 actccataaa acggtgtaat ttattgtt // LOCUS NODE_4160_length_7663_cov_5.3696117663 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7663) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7663) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7663 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1..523 /locus_tag="DP116_25365" /pseudo CDS 1..523 /locus_tag="DP116_25365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655735.1" /note="internal stop; incomplete; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=2 /transl_table=11 /product="hypothetical protein" gene 557..619 /locus_tag="DP116_25370" /pseudo CDS 557..619 /locus_tag="DP116_25370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182629.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="cupin" gene complement(908..1810) /locus_tag="DP116_25375" CDS complement(908..1810) /locus_tag="DP116_25375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878551.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_25375" /translation="MPYINVRAVEHYYEWIKKPSDTQDKPVMVFIHGWAGSCGYWQGT AHALSEQFDCLLYDLRGFGRSRGQPSVAKASEAVVESIELESPQAESIAIQELTYELE EYADDLAALLDKLQIQRVYIAAHSMGASIATLFLNRYPQRVQRAILTCSGIFEYDEKA FTTFHKFGGYVVKFRPKWLDKIPFVDRMFMARFLHRSIPAAERRAFLQDFLKADYDAA LGTIFTSVSKAAAELMPQEFAQLSVSTLLVAGEYDKIIPAQMGRQAAALNDKVEFVMI PNTAHFPMLEDPATYLKRVREFLQ" gene complement(1887..3227) /locus_tag="DP116_25380" CDS complement(1887..3227) /locus_tag="DP116_25380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111740.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent Clp protease ATP-binding subunit ClpX" /protein_id="PRJNA477356:DP116_25380" /translation="MSKYDSHLKCSFCGKSQEQVRKLIAGPGVYICDECVDLCNEILD EELLDTNGAASSQPAPKSEPPPKRRARSSNLSLNQIPKPREIKKYLDEHVIGQDEAKK VLSVAVYNHYKRLSLVQSKGSGKGLDDAVELQKSNILLIGPTGCGKTLLAQTLAKILD VPFAVADATTLTEAGYVGEDVENILLRLLQVADLDVEEAQRGIIYIDEIDKIARKSEN PSITRDVSGEGVQQALLKMLEGTVANVPPQGGRKHPYQDCIQIDTSNILFVCGGAFVG LEKVVEQRTGKKSIGFVQPGEGQSKDKRTADTLRYLEPDDLVKFGMIPEFIGRVPMVA VVEPLDEEALMAILTQPRSALVKQYQKLLKMDNVQLEFKPEALRAIAQEAYRRKTGAR ALRGIVEELMLDVMYELPSRKDLTRCTVTREMVEKRSTAELLMHPSTLPKPESA" gene complement(3237..3932) /gene="clpP" /locus_tag="DP116_25385" CDS complement(3237..3932) /gene="clpP" /locus_tag="DP116_25385" /EC_number="3.4.21.92" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130999.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent Clp endopeptidase proteolytic subunit ClpP" /protein_id="PRJNA477356:DP116_25385" /translation="MLVSQSGNYQPNSLSDFKLYSLNSPSNIVPMVVEQSGVGERAFD IYSRLLRERIIFLGTPIDDNVANSIIAQLLFLDAEDSEKDIQLYINSPGGSVTAGMAI YDTIQQIRPDVVTICFGLAASMGAFLLTAATPGKRMSLPDSRIMIHQPLGGAQGQAVD IEIQAREILYHKSKLNQLLAQHTGQPLERVEADTERDFFMSAEEAKNYGLIDQVISRQ NLPSPGENVTILK" gene complement(4200..5672) /locus_tag="DP116_25390" CDS complement(4200..5672) /locus_tag="DP116_25390" /EC_number="5.2.1.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878554.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="trigger factor" /protein_id="PRJNA477356:DP116_25390" /translation="MKVTQEKLPASQIGLEIEITPEKTKETYEKVVKNLASTANIPGF RKGKVPRQILLQRLGTTRIKAAALEEVIQDGIEQALSQEKIQVLGQPQLRSSFEELIN NYEPGKPLVISASVDVEPEPNLGEYTGLQFKAEEVKYDPQQVDKALEDERQQMATLIP VEGRSAQIGDVAVVDFKGVLAASVGEDKTDVPQPIPGGEATDFQLEMQEDKFIPGFVT GIVGMNPGETKEISAQFPDPYVNEELAGKPAVFTVTLKELKEKELPELNDDFAQEVSE YNTLDELRNSLEERFQKEASQKTKSHQQEALLAELLKHVEVXXXDLPETMIQQEVNQM LTQTAMRLSQQGLDVRKLFTQDVIPQLRERSRGEAVERIKRSLALMEVAKRESIEVSA DQIEARVKELMEEYSEKDVDEKRMLEVVENELLTEKIINWLLERSTIELVPEGSLSTA ETETEEVTVDEPVAATETAEQPLPASTETASGDTESTEGQ" assembly_gap 4713..4722 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 6563..7603 /locus_tag="DP116_25395" CDS 6563..7603 /locus_tag="DP116_25395" /EC_number="1.2.1.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316429.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate-semialdehyde dehydrogenase" /protein_id="PRJNA477356:DP116_25395" /translation="MSKSYRIAILGATGAVGQELLELLESRNFPVSDLRLLASVRSAG RTLPFEGENLPVEPVSEKAFENVDLVLASAGGSISKTWAPKAVAAGAVVVDNSSAFRM NPEVPLVVPEVNPEAAANHKGIIANPNCTTILMTVAVWPLHKVSPVQRLVAATYQSAS GAGAKAMEEVKTQASAILEAKQPVAEVFPYPLAFNVFPHNSPLNDFGYCEEEIKMVNE TRKIFGTQDIRITATCVRVPVLRSHSEAINLEFEAPFSQDLAREILSQAPGVKLVEDW KANHFPMPIEATGQDEVLVGRIRQDISHPCGLELWLCGDQVRKGAALNAIQIAELLVE KNLLKPAVAVSH" BASE COUNT 1989 a 1733 c 1677 g 2254 t 10 others ORIGIN 1 aaaaaaagaa tttcatcctt tttccctttt cccttttcct ttttccttcg ttaatcaaga 61 tgtgggctac gatgctggaa aaaagataca cgggcgtcaa cgacatttgt ccgtggactt 121 gctggggcta gttttacggg ttttagtcac atctgcaagt ctgccagaac gtactggagc 181 taaaaaagta ctaaaacgag ttcatggtat gggcgattgc ctacggcaga gctacgctta 241 acgcgtaaat cgcctgcata cgatttggat ggatgggggg tatagagggg aagattttat 301 gcgctgcatt atggatatgt ttcgctgggt tgttgaaatt gttctcagac ccttagagaa 361 gaagggtttt gtccacttgc caaaacgttg ggttgttgag agaacctttg gatggctcaa 421 ttggtgtcgg cgtttgagca aggattatga aagattacca gaaacttctg agacttttat 481 ttatatcgcg atgattcgta ttatggtcag acgactcgca taatttttta ctcgtttcga 541 cttttcaaac atcctcttag atgttggcgt ttttcctcgg cttggtaaac gggtaattag 601 ggatagtgag tcagcttaga tagttgatga gtcagcactt caactttttt ggagcagcaa 661 ccaatctgct gaggattgat gcttactgga gtggttacga cataacaatt caaatgcagc 721 ctccttttct ttttatacta gtacaaacgc cctgattctt tagttacccc acttcattta 781 gtctaaagtc cagtaaatga cattgactga gatatcatag tgaactacta aacaccgacc 841 cttacgggca cggtaattta aataatactt aacagttcac agtcaacact caacagttaa 901 ctgttgacta ttgcaaaaac tctcgcaccc gtttgaggta agtagccgga tcttccaaca 961 ttgggaaatg agcagtattt ggaatcatga caaactcaac tttatcgttg agtgcagctg 1021 cctgacgacc catctgggct ggaataattt tgtcatactc acccgccacc agtagcgtag 1081 acacactcag ttgagcaaac tcctgtggca tcaattcagc tgcggcttta ctcacagatg 1141 taaaaattgt gcctaaagct gcatcatagt ctgctttgag aaaatcttgt aagaaagccc 1201 gacgttctgc cgctggtatg gaacgatgca gaaatcgtgc cataaacatc cggtcaacaa 1261 acggtatttt atccagccac tttggacgaa atttgaccac gtagccaccg aatttatgaa 1321 aagttgtaaa tgctttttcg tcgtattcaa aaataccact acaggtcaaa atcgctcttt 1381 gcactcgttg tgggtagcgg ttgagaaaca aagtagcaat ggacgcgccc atagaatgag 1441 cagcaatata cacgcgttga atctgtaact tatctagcaa agctgccaag tcgtcagcgt 1501 attcctctaa ctcataagtt aactcttgaa ttgctataga ttctgcctgc ggtgactcca 1561 actcaataga ctcaacgaca gcttcactcg ctttggctac gcttggttgc ccacgggaac 1621 gcccaaaccc ccgcaaatcg taaagtaaac aatcaaattg ttctgacaga gcatgagcag 1681 taccctgcca atacccacaa gagcctgccc aaccgtggat aaaaaccatc acaggcttgt 1741 cttgtgtatc tgatggcttt tttatccact cgtagtaatg ctcaacagca cggacgttta 1801 tgtagggcat ttgtcaaaag tcaaaggtga aaagtcaaga gttaagagta ttttactaat 1861 gactaatgac taatgactaa tgactattac gctgattctg gcttaggtag tgtggatggg 1921 tgcatcagta gttccgcagt agaacgcttc tctaccattt cccgcgtgac tgtgcaacga 1981 gtgagatctt tacgcgaagg caactcgtac atcacatcca gcatcaattc ttccacaatg 2041 ccccgcagcg ctctcgcgcc ggttttgcgg cggtatgctt cttgagcaat agctcgcaga 2101 gcctctggtt taaactctag ctggacgtta tccatcttca gcagcttttg gtactgcttc 2161 accaaggcac tacgcggttg ggtcaaaatt gccatcagcg cctcttcatc cagcggctct 2221 accaccgcta ccattggcac ccgcccaata aattccggaa tcatcccaaa cttcacgagg 2281 tcatccggtt ctaaatagcg gagagtgtcg gctgtacgct tgtctttgga ctgaccttct 2341 cctggttgta caaaacctat tgactttttg ccagttctct gctctacaac tttttctaag 2401 ccgacgaaag caccgccgca gacaaacaag atattgctcg tatcaatctg tatgcagtct 2461 tgataggggt gcttgcgtcc tccttggggt ggtacattgg caactgtccc ctctaacatt 2521 ttcagtaaag cttgctgtac gccttcacca gagacgtctc gggtaattga ggggttctcg 2581 ctcttgcgag caatcttatc tatttcgtcg atgtagataa ttccgcgctg cgcttcttcg 2641 acatccaagt ctgcaacttg caacagtcgc agcaagatat tttctacatc ttctcccacg 2701 taccctgcct ctgttagggt tgtggcatca gcaacggcaa aaggcacatc cagaattttt 2761 gccagagttt gtgctagcaa agttttgccg cagcctgtgg gaccaatcag cagaatatta 2821 gatttttgca gttccaccgc atcatccaaa cctttgccac tgcctttaga ctgaaccagc 2881 gacagccgct tgtaatggtt gtaaactgcg acggaaagta ctttcttagc ttcgtcttga 2941 ccgatgacgt gttcatctag atacttttta atctctcgtg gcttaggtat ttgattcaat 3001 gagagattag aggagcgggc acgccgtttt ggtggtggtt ctgacttggg tgctggctgc 3061 gacgatgcag ccccatttgt gtcgagcaac tcctcatcca gaatttcatt acacaagtca 3121 acgcattcat cgcagatgta gactcccggt ccggcgatta atttacgtac ctgctcctga 3181 gactttccgc aaaacgaaca ttttaaatgg gagtcgtact tagacatacc agcctcttat 3241 ttcagaatgg tgacgttttc ccctggtgag ggaagatttt ggcgggagat gacttgatca 3301 atcaacccgt agttttttgc ctcttcagcc gacatgaaga aatcgcgctc agtatcggct 3361 tcaactcttt ctaacggttg accagtgtgc tgagcgagta actgattcaa cttagactta 3421 tgataaagaa tttctctggc ttgtatctcg atgtcaacgg cttgcccttg tgcaccgcct 3481 aatggttggt gaatcataat ccgggaatca ggaagagaca tccgtttacc aggcgttgct 3541 gctgttaaca agaacgcccc catgcttgcg gctaatccaa aacatatcgt gaccacatca 3601 ggacgaatct gctgaatggt atcatagata gccatacctg ctgtcacgga gccacctgga 3661 gaattaatgt acagttgaat gtccttttct gaatcttcag cgtctagaaa caacagctgg 3721 gcaataattg agttggcaac gttatcgtct atgggcgttc ctaagaagat aatccgttcc 3781 cttagcaagc gagagtagat gtcgaaggca cgttctccca cgcctgattg ttctaccacc 3841 atcgggacga tgttgctagg actgtttagg gagtagagtt taaagtcact caaactgtta 3901 ggttggtagt ttcccgactg cgatacaagc ataaaagaac gtttgacact aagtgtaaca 3961 ataatttaga cagcaactgc tgcctttttg atgagcctga tttttgatgt ggctatgtgt 4021 taaccattat gccttatata aattcttatc gtgtaaaaca gtatcaaacc aagacggtga 4081 tatacatggt tactgaaatt aaagtttctt aaacttctag cttaggacag ggagatgggg 4141 agagtaaggg aatgagttgg ggaaacttct cactctcttt ctccttatcc ccttataatc 4201 tattgtccct ctgttgactc tgtgtcacca gacgcagttt cagtactggc tggaagaggt 4261 tgttctgcag tttcagttgc tgcgactggc tcatctaccg tgacttcttc agtctcagtt 4321 tctgctgtac tcaaagaacc ttcaggtacc aactcaattg ttgagcgttc tagaagccaa 4381 ttgatgattt tttcagtcaa caattcgttt tctacaacct caagcattct tttctcatca 4441 acatcttttt cagaatactc ctccatcaat tctttgactc tggcttcgat ttgatctgcg 4501 ctcacttcga tggactcgcg tttagcaact tccataagcg ctagagaacg tttaatgcgc 4561 tcaaccgctt caccgcgaga acgttcccgc aactgaggaa tcacatcttg agtaaataac 4621 tttctcacat caagcccctg ctgagaaagc cgcatggctg tttgcgtcag catctgattg 4681 acttcctgtt gaatcatcgt ttcaggcaag tcnnnnnnnn nnacttccac gtgcttgagc 4741 agttcagcta acagggcttc ttgctgatgc gattttgttt tctgtgaggc ttctttttga 4801 aagcgctctt ccaaggaatt acgcaattcg tccaaagtat tatactcact gacttcttgg 4861 gcaaagtcat cattcaattc aggcagctct ttttccttga gttctttgag ggtgacagtg 4921 aaaacagcag gttttcctgc taattcttcg ttaacataag ggtctggaaa ctgagcagag 4981 atttctttgg tttctccagg attcattccc actatccctg tgacaaagcc aggaatgaat 5041 ttgtcttcct gcatctccaa ttggaaatca gttgcttccc cgcctggtat tggctggggt 5101 acgtccgttt tgtcttcacc aacagaagca gctaaaaccc ctttgaaatc taccacagcg 5161 acatcaccta tttgcgccga tcgcccttcc actggaatta atgttgccat ttgttggcgt 5221 tcgtcttcta aggctttgtc tacctgctgt ggatcgtact tgacttcctc ggctttgaat 5281 tgcaagcccg tatattcacc caaatttggt tctggttcca catctacaga tgcagaaatg 5341 accaggggtt ttcctggttc ataattgtta atcaattcct caaatgaaga gcgtagctgt 5401 ggttgaccaa gcacctggat tttttcttgt gagagtgctt gctcaatgcc atcttgaata 5461 acttcttcta gcgcagctgc cttgattcga gttgtgccaa ggcgctgcag gagtatctgc 5521 cgaggtacct tgcctttgcg aaacccagga atatttgcag tactcgctaa gtttttaaca 5581 actttttcgt aagtttcctt ggtcttttct ggggtaatct ctatttctag accaatttga 5641 ctggcgggaa gtttttcctg ggtgactttc atgcttcgtc tctattattt gtatttcttt 5701 ttacttttgg tttactgacg ccaaaatttg cgtctttaag ttaaagtttc ctgacttgga 5761 tcaaaggtgc ctgtatggct taattatatc caagtgctcc aggtgccctt ataacagata 5821 tctagtttac ttttgttgtc aaaatccttg acgttttagc gcaaactagt ttcttgttct 5881 caactccaaa ttgtttttta ccaaagggct aacaccgtga tatcctggtt ctcaactagc 5941 acgctttgca cgccaaagtt gtaacgttgg ggcgatcgct gctctgtgca actgagcgca 6001 gcagctgaac ctaactcctg tgttgtgaga ataggttagc ttatcgccct ggtggtctgc 6061 cgtccattga atcatactac actaatcatc aagacagagg tctattacag gcataacacc 6121 tgtctagtac aggtcggccc aaataaagct acccttcggg ttcaccagtc gcctctgtcg 6181 ggaaaccctc ctcatgcgcg ctggattcac cattccaaat caacgaaaag cccgcagaat 6241 caaccttttg acttttaact tttgactttt gacgaacgcc agcggtacta gggctatgca 6301 aaagtttagc tcatccacac gttggaagat gatctactgt ataataaaag tcttaatttg 6361 atggagagtt tagtaataaa atattgtaaa ttgtcatact aatcaggatg agtagcaaaa 6421 taaatgtatt agattaagaa ttacaatgaa tttctcaaaa aatcaaacta ttaaaaggtt 6481 tatttaatgt ctgctattgc catttaagct ttgaaatttt acaagaaatt gcacaaaaca 6541 tctcgtaaag gaggaagtca atttgtctaa atcctacagg atagctattt tgggagcaac 6601 tggtgcagta ggccaagagt tgctggaatt attggaaagc cgtaatttcc ctgtttctga 6661 tttaaggtta ttggcttccg ttcgcagtgc agggcgaaca ctgccttttg agggagaaaa 6721 tttgccagta gaaccagtga gtgaaaaagc ctttgagaat gtggatctgg tgttggcgag 6781 tgcaggtggt tccatctcaa aaacttgggc accaaaagct gtagcggcgg gtgctgtggt 6841 ggttgataac tccagtgcgt ttcgcatgaa tccggaagtt cctttggtgg ttccagaagt 6901 gaatccagaa gcagcagcga accacaaagg tatcattgct aatcccaact gcacgacgat 6961 tttgatgaca gtcgcagtgt ggcccctaca taaagtatca ccagtacagc gcctagtggc 7021 tgcaacttac caatcagcca gtggtgctgg tgctaaggca atggaagaag tgaaaactca 7081 agcaagtgct attttagaag ccaagcaacc tgttgctgaa gtttttccct atccactagc 7141 atttaatgtc ttcccgcaca actccccttt gaacgacttt gggtattgtg aggaagaaat 7201 aaaaatggtt aacgaaactc gcaaaatctt tgggactcag gatatcagaa tcactgcaac 7261 ttgtgtacgg gttcccgtac tgcgtagcca ttcggaagcg attaacctag aatttgaggc 7321 accatttagc caagatttgg ccagagaaat tctcagtcag gcaccgggtg tgaaattggt 7381 agaagattgg aaggcaaacc atttcccaat gccaattgaa gccacaggtc aagatgaagt 7441 tttggtaggg agaattcgtc aggatatttc tcacccgtgt ggcttggaac tgtggctgtg 7501 tggtgatcaa gtccgcaaag gcgcagcctt gaatgcaata caaattgctg agttattagt 7561 agaaaaaaat ctgctcaaac cagcagtagc ggttagtcat tagtgagcca gcactgcagg 7621 agggtttccc tccgtaggtg actgccgaaa gggttccaca ggc // LOCUS NODE_4168_length_7643_cov_4.5880347643 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7643) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7643) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7643 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 511..903 /locus_tag="DP116_25400" CDS 511..903 /locus_tag="DP116_25400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319293.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25400" /translation="MQLEFVPVEEFYFVLTLAVRTLEELAEPDLVEQVRSRLLVECGQ PSTVAPGKQNTFNYVFRVQGVDNSPASGLIVSISDWQDKLRLSSDYGWTLDEQRKPIR TEKYSQRSHFAQQLRSHLQLWLQLPINT" gene complement(975..1478) /locus_tag="DP116_25405" CDS complement(975..1478) /locus_tag="DP116_25405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319292.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="PRJNA477356:DP116_25405" /translation="MSIRLYIGNLPKEEIDRQELQAVFAAEGDAVTTKLIKDRKTGKC RGFGFLTVNNDEQADQIIEKYNGQLFKDTAIKLEKALPRTKAEENSEEQPIKVISHPT TSNSSPTPAKENNRREKNSKKSRRGGGGREHTPAAASTDDAIRPDPRWANELEKLKQM LAAQTTT" gene 1935..2117 /locus_tag="DP116_25410" CDS 1935..2117 /locus_tag="DP116_25410" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25410" /translation="MALSAALITKLESHGISAFAVNVKNNNILLRIVQVLFTNRCANL LSVEYKVKSPVTNDSK" gene complement(2139..4316) /locus_tag="DP116_25415" CDS complement(2139..4316) /locus_tag="DP116_25415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879119.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Zn-dependent protease with chaperone function" /protein_id="PRJNA477356:DP116_25415" /translation="MPSHPKLSLEAGLSAIKQGDFQSAIAILEALTQTLANSNIFLQA QIGLIVAYAKSGNIPRAIALCETLTHSQNSEVQRWANRTLEQLTLPRKDSKDFTSLVP FEQNTRITGQKPTQRNNDPNNETSLTPSLTPSLTPSPSPTNPEKSKPLTIHWRQAPRA KTWIQQQKLNLIPFRLLMLGTFIALFWVIRELVVLVMGFINTILVVVPYLQPIQLLYA NPTLLLLLVLLILIGVSPRLLDRILTNFYGLQKLEKDTLNRYSKEAVRVLERRCQQQG WQLPKLRVLPLAAPLALTYGNLPRTARIVVSQGLLEQLCDDEIATIYASQLGHMTHWD FAVMSLVLLVTIPIYKAYQYFSEWGDRTTARIWRWVMAVMASIAYVVWCLLTGTGLWL SQLRLYYSDRLAVDMTGNPNGLVRALLKIAIGIADDIQKQEHTSWQLESLNILAPVGY QQSICLGSIAHNTTFESFLMWDYLNPYRWWLVINNTHPLIGDRLQRLLSIARDWHLET ELNLEKQQPLRVKRQSFFIQVAPFLGIPIGSVFAIVIWLVWQTAYALKFLNLKWIYDN WSFMTGCWLIGFSIGTLIRINSFFPDIKPSTTQTDDRLPNLLSNPAALPSDSINVRLV GKLLGRRGTSNSLGQDFILQSSTASVRLHYLQWLGQQQNPQNLIGRQITVTGWLRRGA TPWIDVQTLQTHSGTTIYSPHPIWSTVLAVVTEAVGAYILLKG" assembly_gap 4348..4357 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 4894..5199 /gene="ureA" /locus_tag="DP116_25420" CDS 4894..5199 /gene="ureA" /locus_tag="DP116_25420" /EC_number="3.5.1.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006457638.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="urease subunit gamma" /protein_id="PRJNA477356:DP116_25420" /translation="MQLSPQEKDKLLIFTAALVAERRKDKGLKLNYPEAVAYISAAIL EGAREGKTVSQLMSEGGKLLKRDDVMDGIPEMIHEVQVEATFPDGTKLVTVHNPIQD" gene 5688..6029 /locus_tag="DP116_25425" CDS 5688..6029 /locus_tag="DP116_25425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015143349.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="urease subunit beta" /protein_id="PRJNA477356:DP116_25425" /translation="MTPGQIIVTKSEPIELNAGREITKIRVSNQGDRPIQVGSHFHFY EVNDGSQDKKGLQFDRNKAYGKRLNIPAGTAIRFEPGDEKEVELVPFGGSREIYGFNG LVNGSLEKQKK" gene complement(6194..7207) /locus_tag="DP116_25430" CDS complement(6194..7207) /locus_tag="DP116_25430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874433.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25430" /translation="MDNTLQQIVEQVEPKEHNQTDTHFANASPTIQKPLIIAGTSDGL LFLDSRQYVELEGHSITALASSPDGLWVIVDRNSVWHRNPEGEWHQIASIDDLQLNCI LPYNGAVLVGTSQAHLIRIADGSTNRINCFETAEGRDEWYTPWGGPPDVRSMAIGQSG ELYVNVHVGGILRSDDQGKSWQPTIDFHADVHEVRTVPSHPDLVLAATAKGLAVSKDR GNSWRFSHENLHAAYARAIAVCEETILMTASTGPSGHKAAIYRHPLDKPGAFEKCERG LPEWFNNNINTGTVATTGNLAAFGTSDGQVFFSDDAGITWKQIAADLASICCVNLMAT SKT" BASE COUNT 2292 a 1586 c 1608 g 2147 t 10 others ORIGIN 1 gttacttcac ttggggagac cccaagaccg tactacctca ccgcactggc tcctcacttg 61 gcgactggcg ctcttgggtc aggcgaaagc aggcataccc gtaagggtca tgagtagcaa 121 tgactgatag ctaataacca tgacgaaaaa cgacggttaa tcttccccat aaaaaaatga 181 gtttacccgt cgtttttggt aaatttaacg agatattaag aaaaatttga gtttatttgc 241 aacagatgag attctcaatc aatggcaaga cgtgaaataa caagaacaag cctgaaagag 301 agtaagtaga cagggggaaa taaattaagc tatgtaaaaa attgtcaaaa cgcctcaaac 361 ctttaccaat gactaatgac tgccttaacg agtgagtttt attagtactg acctacttat 421 ctcaacaaat ggtaagttaa ttgacgacat aagcaatctg acaaaatgat tcaagacatt 481 agtctctagc acactcacaa gagatactct atgcaattag aatttgttcc cgttgaagaa 541 ttttattttg tcctaacttt agcggttcgg actctggaag aattggcaga accagatctt 601 gtcgaacaag tgcgatcgcg actgttggta gaatgtggac aaccctctac tgtcgctcct 661 ggtaaacaaa atactttcaa ctatgtattc cgtgtacaag gtgtagacaa tagccctgca 721 tctggactta tcgtttccat ttcagattgg caagacaagc tgcgcctgag tagcgactat 781 ggatggacgc tagacgaaca acgcaagcct attcgtactg aaaagtatag ccagcgatca 841 catttcgccc aacagttgcg atcgcactta caactatggt tacagcttcc cattaatacc 901 taagggatat atagactaaa aattaaaaat caaactaact tgatttttaa tttttaatct 961 ctccttgttg aagtttaagt cgtcgtttgt gcggcaagca tttgcttgag tttttccaac 1021 tcgttagccc atctgggatc tggacgaatt gcatcatccg tgctggcagc tgctggtgtg 1081 tgttctcttc caccgccacc acgacgagac ttcttggagt ttttctccct acggttattt 1141 tcttttgcag gagtgggact agagttacta gttgtaggat gactaatgac tttaattggt 1201 tgctcctcac tattttcctc tgctttcgta cgaggtaatg ctttctcgag tttgatcgca 1261 gtgtctttga acaactgacc attgtacttt tcaataattt ggtctgcttg ttcgtcattg 1321 ttgactgtga gaaaaccaaa accacgacac ttgccggttt ttcggtcttt aatcagttta 1381 gtagtaacag catcgccttc tgctgcaaaa actgcttgca gttcttgacg atctatttct 1441 tctttgggca aattgcctat gtataggcga atggacatga actagacctc cacaggttga 1501 gtgaaaacga ccaggcaaaa aacagtagct tggctctggc tttcatatag atattgttga 1561 ccaattctgc cacaaaccaa tttttctctc cgaactgtta acctatacaa tacagttttt 1621 taccaggata aagcttgctt aatacaaaag cacacaactc taacaacacc tttattctaa 1681 gaattttgaa ttttattgcg tactgcttat aactcgaaag ttcttctaca ctttgttgtt 1741 ccaaacaatt aaacactctt ccctagatta tcacgctatt ttacacgcaa ccagttcata 1801 ttagacattt ctagcaatac cataattgca tagtagcata aaatacactt tttctacagt 1861 tttatgtcag agaaacaaaa accagccagt tactggctga ttaactgctg tgttggggag 1921 aaaaagggta tgtcttggcg ctatcagcag cactgattac caagcttgaa agccacggta 1981 tatcagcttt tgctgttaat gttaaaaata ataacatttt gttgagaatt gtgcaagttt 2041 tgtttacaaa tcgctgtgcg aacttattga gtgtagagta caaagtcaaa tcaccagtga 2101 ctaatgactc taaatgacta atgactaatg actcatgact aaccttttaa caaaatgtat 2161 gctccaaccg cctccgtgac aacggctaaa accgtagacc aaatgggatg aggactgtag 2221 atggttgtac cgctatgagt ttgcagagtt tggacgtcaa tccaaggtgt tgctcctcgc 2281 cgtaaccaac ctgtcaccgt aatttgccga ccaatcaaat tttgaggatt ttgctgttgt 2341 cccagccact gtaggtaatg taatctcacc gaagctgtac tggattgcag gatgaaatct 2401 tgtcccaacg agttacttgt acctcgccga cctaatagct taccgacaag acgcacgttg 2461 atactatcag atgggagggc tgctgggtta gacaagaggt taggcaatcg gtcatctgtt 2521 tgggtagtgc taggtttaat atctggaaaa aaagaattga tccggattaa tgtaccgatg 2581 ctaaatccaa tcagccagca acctgtcata aaagaccaat tgtcgtatat ccactttaag 2641 ttcaaaaact taagtgcata tgccgtttgc caaacaagcc agatcacaat cgcaaacaca 2701 ctacctatag gaattcctaa aaagggagca acttgaataa aaaaagattg acgcttcacc 2761 ctcaaaggtt gttgtttttc caaattcaat tctgtttcta aatgccaatc gcgggctatt 2821 gataacagac gttgcaggcg atcgcctatc aacggatgag tgttgttgat caccaaccac 2881 cagcgatagg gattgagata gtcccacatc aaaaacgatt caaaagtcgt gttatgagca 2941 atactaccca aacaaatact ttgctggtag cctacaggcg ctagtatatt taagctttct 3001 aactgccaac tggtgtgttc ttgtttttgg atatcatcag caataccaat agcaattttg 3061 agtaaagcac ggacaagacc gttaggatta ccagtcatgt caacagccag gcgatcgctg 3121 taataaaggc gcagttgaga caaccacaat ccagtcccag tcagcaaaca ccaaacgaca 3181 taggcgatac ttgccataac agccataacc cagcgccaaa ttcgtgctgt tgttctgtct 3241 ccccattccg aaaaatactg gtatgcttta tatattggga tcgtgactag tagcaccaaa 3301 gacatgacag caaaatccca atgagtcata tgccctagtt ggctggcgta tattgtggct 3361 atctcatcat cacaaagttg ctctaacagc ccctgactca ccacaattcg cgcagtacgg 3421 ggtaaattcc cataagtcag cgctaaaggt gctgctaatg gcaaaacacg cagttttggt 3481 aactgccaac cttgctgttg acaacggcgt tctagtaccc ggacagcttc tttactatac 3541 ctatttaacg tgtccttctc cagtttttgc aaaccataga aatttgtgag tattcggtct 3601 agcaacctgg gagaaacacc tatcaagatg agtaggacaa gcagcaacaa taaagtggga 3661 ttagcgtata aaagctgtat tggctgtaaa tatggaacta ctacaagaat agtgtttata 3721 aatcccatca ccaacacgac taattcccgt atcacccaaa ataaagcaat aaatgttcct 3781 agcatcagca gccgaaacgg aattaagttc agcttttgct gttgtatcca agtctttgct 3841 cttggtgctt gtcgccaatg aattgtcagt ggtttagatt tttcagggtt cgtgggagaa 3901 ggtgagggag tgagggaggg agtgagggag ggagttaaag aagtttcgtt gtttgggtca 3961 ttatttctct gtgttggttt ttgtcctgtg atcctcgtgt tctgctcaaa aggtaccaaa 4021 ctagtaaaat ctttactatc tttacgagga agtgtcaatt gctctaaagt gcgatttgcc 4081 caccgttgta cttcggaatt ttgactgtga gtgagagttt cgcataaagc gatcgccctt 4141 gggatattac cgctttttgc ataagctaca attaacccga tttgtgcttg taaaaaaatg 4201 ttactattgg caagtgtttg ggttagtgct tctagaatgg caatcgcact ttggaaatct 4261 ccctgcttaa ttgctgataa ccctgcctcc aaggataatt tgggatgtga aggcataaaa 4321 atcctctaac tctgatattg tttaacannn nnnnnnntct ttcggaaatt caaagtatgc 4381 cctatgccta gtgggaacga ccacggctat caatgaaaaa tcttttttgg aattttgaat 4441 gaatgagcac attttgaatt gaagcgtttg cgcagcgcca aagtctgccg gggggagtat 4501 ccccccggcg agctttgccc cttaggggct agggagtggc ggaagaagca ccaagggatg 4561 actcagtttt ttcgtttttt catccactta actcattcca cggaaaatag tttttttcat 4621 ccgtttgtcc aaagtcaaaa gtaataccag taactagtga ctaatgggac aagtatcgtc 4681 gcatccccta ctggggtaaa attggttaat cagacaagtg taagtagttt aggaaagaat 4741 gtttacttcg gtagcccaaa atttttcaaa tacttttgta ggaacgctga aaataaaatc 4801 accactgcaa agaatactga taaaattttt gacgacataa gaaaattgag ttgaggaaat 4861 acaaactaca agtgtcatta aaggaaaatt tatatgcaac tatcacctca agaaaaagat 4921 aaactgctca ttttcacggc ggctttagta gcagaaagac gtaaggacaa ggggttaaag 4981 ctaaattatc cggaagcagt ggcgtatata tctgctgcta ttttagaagg tgccagagaa 5041 ggaaaaactg tcagtcaatt aatgagtgag ggtggaaagc tgctgaaaag ggatgatgtt 5101 atggatggaa taccagaaat gatccatgaa gtgcaagtag aagcaacttt tcctgatgga 5161 acgaaattag taacggtaca taatccaatt caagactaaa atctctgaag aaatacaggg 5221 aacagggaac tcttaacagg gaacagtgaa actgataact gataactgat aactgataac 5281 tgttttaaga agggagatgg tgtttctttc attcgtagcg ggtggtgtgc cgttatagtg 5341 cgcctgctcc taaaaaagat gcagaatttt agataatgaa ttgacaagaa gggaaattct 5401 aaatgtatac aactaacaaa gtggtgcttt atgaagctcc taactacaag ggaaatcaga 5461 aagaacttgg ggaaggagaa tataatattt acgatttggg aattggtgat aataagctga 5521 gttccttaac agtgccagca ggcatgaaag tcaccatata cgagtacgag gaatttcgag 5581 gtcggagtaa aacctttact agtaacgtcg gtgacctgag aaatgttaaa gttgagggta 5641 aaaactttaa caacgaggct tcctcgatta aagtcgaaaa aatagtaatg actccaggac 5701 aaataatagt tactaaatca gaaccaattg aactcaatgc agggcgagag attaccaaaa 5761 ttagagttag taatcaaggc gatcgcccaa ttcaagttgg ttctcatttc cacttttatg 5821 aagttaatga tggcagtcaa gacaagaaag gtctacaatt tgaccgcaat aaagcttacg 5881 gtaagcgctt aaatattcca gcggggacag caatcagatt tgaaccagga gatgaaaaag 5941 aagttgagtt agtcccattc ggaggtagcc gtgaaatcta tggtttcaat ggattagtga 6001 atggatcttt agagaaacag aaaaaatgat tgtagaacgg gtgagtaaat cttaagtatc 6061 gactgtcttg aaagccaagc gattgcaaag aagcaagaga cgccgaaggc gtctgcacct 6121 tgttttttgt gacaaatcat atcgaaatat ccgatactgt aatccactgg gtctttaatt 6181 tttctgagat aatttatgtc ttggaagtcg ccataagatt aacacagcag attgaagcca 6241 aatccgctgc aatttgcttc cacgtaatcc cagcatcgtc cgagaaaaaa acttgaccgt 6301 cactagttcc gaaagcagct aaattccctg tagtagcaac ggttcccgtg ttgatattgt 6361 tattgaacca ctctggcaac cctcgctcac atttctcgaa agctcctggc ttgtcgagtg 6421 ggtgtcgata gatggctgct ttgtgaccac tcggaccagt agaagcagtc atcaaaatgg 6481 tttcttcaca cacagctatg gctcgtgcgt aagcagcatg taggttttca tgtgaaaacc 6541 tccaagaatt ccctctatct ttactcacag ccagcccctt cgccgtagcc gccagcacaa 6601 gatctggatg agatggaaca gtccgcacct catggacatc tgcatgaaag tcgatagtgg 6661 gttgccatga cttaccctgg tcatcagagc gcaaaatgcc tccaacgtgc acattcacat 6721 agagttcacc agattgacct atagccatcg aacgaacatc tggaggacca ccccacggcg 6781 tataccactc atctcgtccc tcagccgtct caaagcaatt gatgcgattc gtgcttccgt 6841 ctgcaatgcg gatgagatga gcttgtgatg tcccaactag aactgcgcca ttgtaaggca 6901 agatgcaatt tagctgtaaa tcatcaattg aagctatttg atgccattcg ccttctgggt 6961 tgcgatgcca taccgagtta cgatcaacta tgacccacaa cccatccgga cttgaggcaa 7021 gagcagtgat agaatggcct tcaagttcaa catattgcct agaatcgaga aacagcaacc 7081 catcagatgt tcctgctatt attaggggtt tttgtatcgt tggggaggca ttggcaaaat 7141 gtgtatctgt ttggttgtgc tcttttggct caacttgctc gactatttgc tgtagagtat 7201 tgtccattgt ttgaacctcc aaacctcatc ggtgctgcca ttatgagcat ctgtcgtagt 7261 gacactccta caccgaccct tcgacttcgc tcagggtaca gtgtaggctt ctggcagatt 7321 ttatctttcc agaacttcgt ttgtggtact ccaactgttt cccactaccg cagtttatag 7381 tcaggtacgt gtcctgcccg acttgaatgg aagctaaatg gctcagtcct tcagcacccg 7441 cttcttaatc tactgtactg aattcgttca accctgaact cagtcataag catgaggata 7501 agtgtccatc atttcgagca gtgatgtcgt aacggtctta ttcaaacaaa tcacattgcc 7561 tgcgctattt tagtctggat tcgacttaac tcacttcttg cacctaaccg aataaatccg 7621 taaaaggtga ataaagagcg caa // LOCUS NODE_4182_length_7598_cov_4.9803797598 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7598) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7598) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7598 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 51..698 /locus_tag="DP116_25435" CDS 51..698 /locus_tag="DP116_25435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139947.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25435" /translation="MLGIVGLSGGINSANAQQDLPVCQPPNSGEYLLLVSSPTAENQK QLRQALPSNTNPSTCRYLKNTVTRIGGFNKIDDANSWARYIKNIVGLSAIVTTRPSSE VAQKPPTVTQKPPTVTQKPPTQTVSYKPERLGEGLAVLVDYYNRPEMANKVREVVRGD VGFVSYGERPYLLAVYTRNQKEAYSTLQKLSERGFFAVVVDSRKVMLLRSSVRLQ" gene complement(733..1206) /locus_tag="DP116_25440" CDS complement(733..1206) /locus_tag="DP116_25440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317432.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="divergent PAP2 family protein" /protein_id="PRJNA477356:DP116_25440" /translation="MQDIGDILDNRVLWVAVGACLMAQALKLVIELLKNRKLNINVLV TTGGMPSAHSALVTALATGVGQTLGWASPEFAVAMVFAIIVMYDAAGVRQAAGKQARI LNQMIDELFHEHPEFDGDRLKELLGHTPVQVIVGSALGITISWLAKSTGVLVVSP" gene complement(1334..2260) /locus_tag="DP116_25445" CDS complement(1334..2260) /locus_tag="DP116_25445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994390.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="polyprenyl synthetase family protein" /protein_id="PRJNA477356:DP116_25445" /translation="MVAADNIQKTPEFVKFDLRAYLKERQKLCEAALDKSISIRYPEK IYEAMRYSLMAGGKRLRPILCLATCEMMGGTIDMAMATACAVEMIHTMSLIHDDLPAM DNDDYRRGKLTNHKVYGDDIAILAGDGLLAYAFEYVVTHTHNVPLERITQVIARLGHA TGATGLVGGQVLDLESEGKTDISLETLNFIHRHKTGALLEACVVCGGIIAGASSEDVQ RLSRYSQNIGLAFQIIDDILDITQTDEELGKTAGKDQQAQKVTYPSLWGLEESKQKAQ ELVEDACAELEPFGDKALPLKELAHFITSRKN" gene complement(2475..3353) /locus_tag="DP116_25450" CDS complement(2475..3353) /locus_tag="DP116_25450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012410125.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD" /protein_id="PRJNA477356:DP116_25450" /translation="METQTAKLLDGKALANKIQQELSAAIKELQPKIGRPPGLAVLMV GDNPASAAYVRGKERACQKVGIDSFGKHFPTETTQAELEEVIAALNNDERVDGILVQL PLPNHLDAVALLHQIVPEKDADGLHPINLGRLVRGEPGLRSCTPAGVMRLLQEYAIPL RGKHAVVVGRSILVGKPLALMLLEADATVSVAHSRSHSLQSITTNADILIAAAGRPGL ITADMVKPGAVVVDVGINRVTDSSGESRLVGDVHFESIAGIAEFITPVPGGVGPMTVA ILLQNTVNSYSQRSHR" gene 3632..4309 /locus_tag="DP116_25455" CDS 3632..4309 /locus_tag="DP116_25455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495083.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TrkA family potassium uptake protein" /protein_id="PRJNA477356:DP116_25455" /translation="MYVLIGGAGLVGLSLAQKLVELGHTISVIDIDPNACRYAREQVG AMAFEGSAVSTEVLLEAGIRKADALAAMLRSDALNLAMVTLAKHYGVPHILSRMRHPD FVEPLRLAGANHIISTVDLAVSTMVNAIEYPQVESMMHFEQGQIEVLKLSIPNNCYVV GRSVAEIAQDSRFPTGSLIIGYQPHPHEDLTIPNGSTVLEPDSTVLIVTKPGSLHQVI DFIEGCK" gene 4319..4891 /locus_tag="DP116_25460" CDS 4319..4891 /locus_tag="DP116_25460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868615.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25460" /translation="MKLYPLFYSAIALIVLTSCTSPEIPPQSQADTKGENQSQKVVNL DENFKTVMGQTIYVPVYSHIYHGDQKKIFNLAATLSIRNTDLTKPIIITSVRYYDSNG KLLKQYLERPIQLAALASTDFVVDRTDTSGGVGANFIVEWVAQTKVFQPVVEAVMIGT DFQQGISWISPGKVIKSQSNNKGSSSVQGS" gene complement(4898..5863) /locus_tag="DP116_25465" CDS complement(4898..5863) /locus_tag="DP116_25465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013320686.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25465" /translation="MRSDEPLSSILLTVAVALGAFQIGHSFSVSGVVAVVVAGLIVGS IGLSSQVSASTRLPLLSFWQSIAFGVNSFIFLLVGIEVDLITLWRTLPAVLLAIVAYQ VGRILCVYPLLAILRWFDRPIPWRWQHVLFLGNIKGSLSMGLALSIPTTVTGREQIIA IVFGTVLLSLLGQGVSLPWIVKRLRLSQFSESRQQVEELQAQLITGKAAQDELDSLLK SGILPKAIYEEMRSAYQVRVAGAEKALRELYNRRPDEFDLKSDGHSKLEAIRRRLLLA EKGALNEAIRKQILSEEIVRSAASSSASLSRLQKIDEQLLTLEDD" gene 5983..6390 /locus_tag="DP116_25470" CDS 5983..6390 /locus_tag="DP116_25470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25470" /translation="MQKVKNLNFSFLYNHNSVVVLTRSGLTLFLLVGLLSCGNLGYLG INAIGANITQIQDLKAQQEDGKTLYVQGKVEKRVPLLNRYAYQINDSTGKIWVVTNQT NLQQGVQVVIKGKVRYQSIPLGDKEYGEFYIDE" gene complement(6477..7376) /locus_tag="DP116_25475" CDS complement(6477..7376) /locus_tag="DP116_25475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323612.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="undecaprenyl-diphosphatase" /protein_id="PRJNA477356:DP116_25475" /translation="MLSLIVATTGCTNGSEVNLDFTNLSWIDVIILGIVQGITELLPI SSTAHMRIVPTLLGLQDPGSAFSAAMQLASLTAVLSYFWKDLKKLTGETVRAISGQDY QSSSFRLMLGLLLGTLPIIVGGLLLKPILNACDSPMRSLVVIGAASIVMSLLLGIAEK RGGRERNFEKITLWDGIWVGVAQAFALIPGVSRSGSTLTAGLFLGMERETAAKFSFLL GLPAVVLAGAVELHTLFKAGLSGAGWLILLIGLTSASISAFIAIWGLLRYLEKHSTLI FIFYRFVMGVFLIVAVISGWLPN" BASE COUNT 2233 a 1706 c 1558 g 2101 t ORIGIN 1 agactgttgt tcagaaattt cagatacaaa atactttagt cagtagttta atgctaggta 61 ttgtcgggtt gagtggtggc ataaattctg ctaatgctca acaagatctt cctgtctgtc 121 aacctcctaa ttccggtgag tatcttttgt tggtcagtag tcccacagca gagaatcaga 181 agcaattacg tcaagcttta ccgtccaata ctaacccaag cacttgtaga tatctcaaaa 241 atacagtcac acgaattggt gggtttaaca agatagatga tgctaacagt tgggcgaggt 301 atatcaagaa tatagttggg ttatctgcta ttgtcaccac tcgaccgtca tcagaagtgg 361 cacagaaacc gcctacggtg acacagaaac cgcctacggt gacacagaaa ccgcctactc 421 agacagtaag ctataaacct gaacgtttgg gagagggttt ggctgtatta gtagattact 481 ataaccgccc tgaaatggcg aacaaagtac gggaagttgt tagaggagac gtaggttttg 541 tttcctacgg agaacgtcct tatttgctgg ctgtttacac gaggaatcaa aaagaagcat 601 acagcacact gcaaaagctg agcgaacgag gtttttttgc cgtagttgta gatagtcgca 661 aagtgatgct actgcgctca tctgtgcgtt tgcagtagcc ctcagcaaaa tacagtcagg 721 ttttttgatt gattatggac tgacaaccaa cacgccggtt gacttagcca accacgagat 781 ggttataccc aacgccgaac caacaatcac ctgaacaggt gtgtgtccca gtaattcttt 841 gaggcgatcg ccatcaaact ctgggtgttc gtgaaacaat tcgtcaatca tctgattaag 901 gatacgggct tgctttccag cagcttgacg aactccagca gcgtcataca tgactataat 961 ggcaaaaacc attgccactg caaattctgg agaagcccaa ccaagagttt gcccgactcc 1021 tgttgcaaga gctgttacca gtgcagaatg agcactaggc atacctccag ttgttactaa 1081 aacatttata ttcagtttac gattcttgag tagctcgatg acaagcttta atgcttgagc 1141 catcaaacaa gctcctacag caacccacaa cactcggttg tctaatatgt cgcctatgtc 1201 ctgcatgcta ttttgataag atgagaagtt atgaattctg aatgatgaat tctcaacaga 1261 gatatgattt atgaatgata aattatataa tgtaaagttc ccctgcttgt atcattcatc 1321 attcattgtt agtttaattt ttacgactgg tgataaagtg tgctaattcc tttaggggca 1381 aagctttatc tccaaaaggt tctagttctg cacaagcatc ctctacaagt tcttgagcct 1441 tttgttttga ttcttcgagt ccccaaagac taggatacgt cactttttga gcctgttggt 1501 cttttccagc agttttgccc aattcctcat cggtttgagt gatatctaga atgtcatcaa 1561 taatttggaa agcaagccca atgttttggg aatagcgaga caatcgctgt acatcttctg 1621 atgatgctcc tgctattatg ccaccacaga cgacacaagc ttctaaaaga gcaccggttt 1681 tgtgcctatg aatgaaattc aacgtttcta aggagatatc tgtctttcct tctgattcta 1741 aatcaagcac ttgaccacca accaagcctg tcgctcccgt cgcatgaccc aaacgagcaa 1801 tgacctgtgt gattcgctct agtggtacat tatgagtatg agtgactaca tattcaaatg 1861 cgtaagctaa taagccatcc ccagccaaaa tcgcaatgtc gtcaccataa actttgtgat 1921 ttgtcaactt tccacgacgg taatcatcat tatccattgc tggcaagtcg tcgtgaatga 1981 gcgacattgt gtggatcatc tccacagcac aagcggttgc cattgccata tcaatggtac 2041 cacccatcat ctcacaggta gcaaggcaaa gaataggacg caaacgctta cctccagcca 2101 tgagcgagta gcgcattgct tcgtaaatct tttctggata acggatggaa atagatttat 2161 ccaaagcagc ttcacaaagc ttttgtcgct ctttgagata agctcgtaag tcaaatttga 2221 cgaactctgg agttttttga atgttatcag ctgctaccat tcctaaattc cttaatcttc 2281 tgttatatac gtaacaattt tacgagaaag agtcaccccc ttcgcttaat tcaaaattca 2341 aagttcaaaa ttcaaaatga agaattattt ttttgaattt ttaactttgc atttgagcgt 2401 gttggctgat tgagcggctc cctcacagga gcatcctggc tcatgcgcca ttagtcattg 2461 actatggact attcctaacg atgggagcgt tggctatagc tgtttaccgt attttgtagc 2521 aatatagcta cagtcatcgg accaacacca ccgggaactg gtgtgataaa ttccgcaata 2581 ccagcaatgg actcaaagtg gacatcacca accagacgac tctcaccgct tgaatctgtc 2641 acacgattta ttcccacatc taccacaaca gcgcccggtt tcaccatgtc agcagtaatg 2701 agtcctggac gacctgctgc tgcaattaga atatcagcat ttgtggtgat ggactgcaaa 2761 gagtgcgatc gcgaatgagc cacactcaca gtcgcatctg cctctagcag catcaaagcc 2821 agtggtttac ccaccaatat actgcgtcct accaccaccg catgctttcc cctcaaagga 2881 attgcgtatt cctgcaacag ccgcataact ccagctggag tgcagctacg taaaccaggt 2941 tctcctcgca ccagtcgccc taaattaatt gggtgcagtc cgtcggcgtc tttctctgga 3001 actatttgat gcaaaagcgc cacagcatcc aagtggttag gtaagggtaa ttgcacgaga 3061 ataccatcca ccctttcatc attattgagc gcagcaatga cttcttctag ttctgcttga 3121 gtggtttctg tgggaaaatg cttgccaaaa gagtcaatac caactttctg acaagctcgt 3181 tctttcccgc gtacgtaagc cgctgacgct ggattatcgc ccaccatcag cactgctaaa 3241 cccggcgggc gtcctatttt gggttgcaat tctttaatgg cagcagaaag ttcttgctga 3301 attttgttag ctaaagcttt accatcaaga agtttggctg tttgtgtttc cattaaaatt 3361 caagacaagt gcgctacgcc ttggcgttgc tgtctcatgc tttgttcaaa aaacagcaaa 3421 accacattta ttatcttctc agattggttg tctaagaaga ctttagaaat cagaattagt 3481 caatgttcct tgtaaaaatt aggcactgta caaattaact atcatatgca ttaggaaatt 3541 tggttgtttc cagtgttaac gataccaaac tgtttcacct gcatagtagc ttgtaaagta 3601 aagtagaagg caaaagtatc aaaaagcagt tatgtacgtg ctaattggtg gagcaggttt 3661 ggtggggcta agtttagcac agaaattggt agaactagga cataccatat ctgtgattga 3721 cattgaccct aatgcttgtc gttatgcccg tgaacaagtc ggagcaatgg cttttgaagg 3781 cagtgctgtt agtacagaag tgttgttaga agctgggatt cgtaaagccg acgctttggc 3841 agctatgcta cgaagcgatg ctttgaactt agcaatggta actcttgcca agcactatgg 3901 tgttcctcat attctgtctc gaatgcgcca tcccgatttt gtggaacccc tgcgtttagc 3961 tggagccaat catatcatca gtaccgtaga tttggcagtt tcaacaatgg tgaatgctat 4021 tgagtatccg caagtcgaat cgatgatgca ttttgaacag ggacagattg aggttttaaa 4081 actttctatt ccaaacaatt gctatgttgt cggtcgtagt gttgccgaaa ttgctcagga 4141 ttcccggttt cccactggct ctctcattat tggctatcag cctcatcctc acgaggattt 4201 aactattcct aacggtagta cagtactgga acccgattca actgtgctga ttgtgactaa 4261 accagggtca ttacatcaag tcattgattt tattgaaggt tgtaaatgat tcattctaat 4321 gaagctatat ccactttttt attcagcgat cgctcttatt gtactgacat cctgcacatc 4381 accagagatt ccaccgcagt cacaagctga tactaaaggg gaaaatcaat ctcaaaaggt 4441 ggtaaattta gatgaaaatt tcaagacagt gatgggtcaa actatctatg tccctgtcta 4501 ctcccacatc tatcacggcg atcagaaaaa aattttcaat ttagcggcga ctctcagtat 4561 tcgtaatact gatttgacca agcctatcat cattacttct gtgcgctact atgactccaa 4621 tggaaaacta cttaaacagt atttagagcg cccgattcaa cttgctgccc tagcttctac 4681 ggattttgtt gtagatagaa ctgacactag tggaggtgta ggagctaact ttattgtgga 4741 gtgggtagcc caaacaaaag tttttcagcc ggtggttgaa gcagttatga ttggtactga 4801 cttccaacaa ggaatttcct ggatcagtcc tggcaaggtt attaaaagtc aaagtaataa 4861 taagggctca tcttctgtac agggttcttg aaagcgacta atcatcttct agtgtgagca 4921 attgctcatc aattttttga agccgcgata gcgaagcgct gctgcttgca gcagatcgca 4981 caatttcttc tgaaagaatt tgcttacgta ttgcctcgtt gagtgctcct ttctctgcca 5041 gtagcaaacg gcggcgaatg gcttcgagtt tgctgtgtcc atcactttta agatcaaatt 5101 catctgggcg gcgattgtac aattcccgca aagctttttc tgctcctgcg actcgcacct 5161 gataagctga gcgcatctct tcgtaaatgg cttttggtaa tattcctgac tttaacaaac 5221 tatctaattc atcttgcgct gccttacctg tgattaattg cgcttgtaat tcttcaacct 5281 gctgacgaga ttctgaaaat tgggacaagc gcaaacgttt cactatccac ggcaaactca 5341 ctccctgccc gagcaaggac agcaacactg tcccaaatac tatggcaatg atttgttctc 5401 gtcctgttac tgttgttggg atgctgagtg ctaaccccat agaaagagaa cctttgatat 5461 tgccaagaaa caagacgtgc tgccagcgcc atggaattgg ccggtcaaac caacgtaata 5521 tagctaacag cggatagacg caaagtatac gccccacttg ataagcaacg atcgccagca 5581 acactgcggg cagagttctc cagagagtga ttaagtcaac ttctatacct acgagtagga 5641 aaataaagct attgacacca aaagcgatag actgccaaaa gctgagcaac ggcaatcgag 5701 tagaagctga tacctgactc gaaagtccaa tacttccaac aattaacccc gcaacaacca 5761 cagctaccac accagaaacg ctgaaagagt gcccaatttg aaaagctcct aatgccacag 5821 ccaccgtcag taaaatacta ctgagcggtt catccgagcg aatttggagt tctttgttga 5881 gtgtttttga tactacaaga caaagtgttg atagttactc tctaaatgaa aattgtcaca 5941 agccaagcac tgatgccaaa attggctcaa tagacattga ttatgcaaaa agtgaaaaat 6001 ttaaacttct cgttcttata taatcacaac agtgttgtcg tactcacacg ctcgggatta 6061 acccttttcc ttctagtagg actactcagt tgtggtaact tagggtattt gggcataaat 6121 gccataggag ccaacatcac ccaaattcag gacttgaaag cacaacaaga ggatggtaaa 6181 acactttacg tacaaggtaa ggtagaaaag cgagttccgc ttctcaatcg atacgcgtat 6241 caaattaatg attccacagg aaaaatttgg gttgtcacca atcaaactaa tctacagcag 6301 ggagtacaag tggtgattaa gggtaaagtt cgctatcaaa gtattcctct aggagacaaa 6361 gaatatgggg aattttacat agatgaatag tcataagtaa aacattatgc cagagaaact 6421 ctggaagctt ctagctcaaa gaagctccca gcaaaaccga gttaatcact aactaatcaa 6481 ttgggcaacc aacctgaaat cacagccact atcagaaaca cacccattac aaagcggtaa 6541 aaaataaaaa tcaaagtgct gtgtttttcc aagtagcgta agagtcccca aattgctatg 6601 aaagcagaaa tactggcaga ggttaagccg ataagaagta tcaaccaacc agccccactc 6661 aaccctgctt tgaaaagggt gtgcaattcc acagccccag ccaaaaccac tgctggtaag 6721 cccaacagaa aagaaaattt ggcggctgtc tcccgttcca ttcctaaaaa tagtccggcg 6781 gtgagggtgg aaccagaacg agaaacgcct ggaattaggg caaaggcttg agcaacccct 6841 acccaaattc catcccagag ggtaattttt tcaaagttac gctcacgtcc tccccgtttt 6901 tctgcaattc ccagcagcaa agacataaca atagacgctg caccaatgac aaccaaactt 6961 cgcatcgggg agtcacaagc gtttagaatt ggcttgagta gtaaacctcc aacaataatg 7021 ggtaaagttc ccagcaacaa gcccagcatc agccgaaaag aactcgattg atagtcttgc 7081 ccacttattg ctctaaccgt ttctcccgtc aattttttta ggtctttcca gaaataactt 7141 aaaacagctg ttaaactggc tagttgcatt gctgctgaaa aagcagagcc aggatcttgc 7201 agtcctaata aagtaggtac tatccgcata tgagcagtac tgcttatcgg taggagttct 7261 gtaatcccct ggacaatgcc taaaataatc acgtcaatcc agcttaagtt agtaaagtct 7321 agattcactt cactaccgtt ggtacagcct gtagtcgcca ctataagaga aagcataatt 7381 taactggttt aatgaatttg tttgcaaata agttacagac aaattaacct gtgcttatgt 7441 cgttgctatg actaatcaat ggcgacagta tgacatgtgg caaactagca ctttttttga 7501 ggttgaccgc atgagaatga cataatttac gtaacctgac agaattgaca ctcccctgcc 7561 taaaggcgag gggattctac attcatcgtc agaacttg // LOCUS NODE_4186_length_7593_cov_5.1382337593 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7593) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7593) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7593 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 206..430 /locus_tag="DP116_25480" CDS 206..430 /locus_tag="DP116_25480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198155.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_25480" /translation="MNVEKLSISLPPSLVEFIENYKLNKGRKSRSQVIEEALELLRNR ELEEAYREASAEVDSDWDVTVADGLTDETW" gene 417..755 /locus_tag="DP116_25485" CDS 417..755 /locus_tag="DP116_25485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006615904.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system PemK/MazF family toxin" /protein_id="PRJNA477356:DP116_25485" /translation="MKRGDIYYANLSPAVGSEMDKRRPVLIVSNDANNRAATTVTILP ITSNVTRVYPFEVLLNPEDSGLTKPSKVQAQQVRTISKQRIIGEVYGSLNEEMMVLID AALKLHLGLG" gene 808..1074 /locus_tag="DP116_25490" CDS 808..1074 /locus_tag="DP116_25490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006623199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25490" /translation="MLLPPKLQNLPHSLPIEGAVRIELVEGIPIFRASSTVQNRIEEL LTKQHISPLTSDEEHELDLYEEIDDYLSFINRTVRNAFLTQTSV" gene 1084..1536 /locus_tag="DP116_25495" CDS 1084..1536 /locus_tag="DP116_25495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007355952.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HNH endonuclease" /protein_id="PRJNA477356:DP116_25495" /translation="MSARRQIPKDIQTLVRQRANCLCEYCHASEQWQYVEFTIEHIIP LTKNGADAVDNLALACFHCNRQKSNKTTAVDSDSGVEVPLFNPRIDIWSEHFIWSADK LYIIGLTPIGRATVAALALNRERVINIRAADKIIGRHPPVDDSIQAQN" gene complement(1666..2589) /locus_tag="DP116_25500" CDS complement(1666..2589) /locus_tag="DP116_25500" /EC_number="2.5.1.115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="homogentisate phytyltransferase" /protein_id="PRJNA477356:DP116_25500" /translation="MDQVSSRQFQGNWFYSFWKFSRPHTIIGTSLSVLGLYLIAVAVS ERNYSLFPLLGAWVACLCGNVYIVGLNQLEDVAIDKINKPHLPVAAGEFSLTTGRLIV GVTGVLALVVALLQGPFLLGMVAISLAIGTAYSLPPIRLKRFPFWAAVCIFSVRGAIV NLGLFLHYSWVFEQSSSVTAAVWVLTVFVLVFTFAIAIFKDIPDMEGDLLYNIKTLTL QIGKQAVFNLALWVITVCYLGMILVGVLRLAEVNSVFLVMTHLVALVLMWWRSRQVDL QEKSEIARFYQFIWKLFFLEYLIFPIACLLA" gene complement(2648..3490) /locus_tag="DP116_25505" CDS complement(2648..3490) /locus_tag="DP116_25505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412254.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_25505" /translation="MSPTLYQQIQQFYDASSSLWEQVWGEHMHHGYYGADGTQKKDRR QAQIDLIEEVIKWTGVQQAENILDVGCGIGGSSLYLAGKFNARTTGITLSPVQAARAT KRASEFGLSARSHFQVADAQAMPFTDNSFDLVWSLESGEHMPDKSKFLQECYRVLKPG GTLIMVTWCHRPVDNLLLTADEEKHLAEIYRVYCLPYVISLPEYEVIARELGLQNIRT ADWSKAVAPFWDVVIDSAFSPMAIWGLLCSGWGTIQAALSLGLMSRGYERGLIRFGLL CGKK" gene 3605..3811 /locus_tag="DP116_25510" CDS 3605..3811 /locus_tag="DP116_25510" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25510" /translation="MKRYRQWLLLAAIRILIAYLFEISYQLSVISEPALQEGFPTEAT GVSASARNHALAVRRAVGIRAASP" gene complement(4013..5209) /locus_tag="DP116_25515" CDS complement(4013..5209) /locus_tag="DP116_25515" /EC_number="2.4.99.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742437.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tRNA preQ1(34) S-adenosylmethionine ribosyltransferase-isomerase QueA" /protein_id="PRJNA477356:DP116_25515" /translation="MQQQPAKTNFNPTSSPDKEDLELDSSLTGYDYELPQELIAQNPV VPRDHSKLLVVNSRNTGIKAEPIHCIFRSLPEQLKPGDLLVMNNTKVYPARLHGRKST GAEIEVLLLEERQHNCWLALVKPGKRFTRGAKIIFEARGETNSSSSSPSSLTATVLEI DEATGGRLLQFDLPKGISLVHLLDVFGELPLPPYITASDALGEQYQTVYAKVPGAVAA PTAGLHFTPELLERLSDRGIHQAFVTLHVGVGTFRPVEVENVKTHKMHEEWISVPSST VEQIRATQASGGRIIAVGTTVARALEAAAVKGELQPFSGKTDLFIYPGYQWRVVEGLI TNFHLPRSSLLMLVSALIGRQRLLNIYQQAISSQYRFYSFGDAMLILPEATTGNREQG TGNREQ" gene 5446..5835 /locus_tag="DP116_25520" CDS 5446..5835 /locus_tag="DP116_25520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412259.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25520" /translation="MEYLYYLANASLTLRVVQFLHGKPQIPVSFVTVIHQIDGWVVRV KLKRHVSPQEDGDIRAFLSELGISYEPKMRVQMAFWSLEAGQCPVDVMRRYQVAIVSH GNPEKEEIEAFRQQFVRGLGYCPETLA" gene complement(5874..6437) /locus_tag="DP116_25525" CDS complement(5874..6437) /locus_tag="DP116_25525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126909.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YraN family protein" /protein_id="PRJNA477356:DP116_25525" /translation="MADHSSYHYLDIGIAGEDLVAEWLQSNGWVILHRRWRYRNGEID IIAQYDGQQQPTKQGEKALQQGKSNTGNETTVVPSLSPSLTPLSPTLAFVEVKTRSRD NWDAGGRNAITKQKQTKLQQTSLMFLAKYPQKADYPCQFDVAIVYCQQISQGLAALTI SEEAITTVSVGGYLLMLEEYIPAAFDT" gene 6452..6967 /locus_tag="DP116_25530" CDS 6452..6967 /locus_tag="DP116_25530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319049.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_25530" /translation="MSKLSNRIWVSTLSLLLWVVICITASVGFASFAPEALALDYNKE TLIGSDFSGRDLTDSSFNQTNLRNSNFSHANLRGVSLFSAKLESANLEAADLTNAILD SARFTKANLTNAVLEGASAANARFDGAIIDGADFTDVLLRKYEQEKLCKVAQGTNPTT GHNTRDTLFCP" gene complement(7007..7423) /locus_tag="DP116_25535" CDS complement(7007..7423) /locus_tag="DP116_25535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319051.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heavy metal-responsive transcriptional regulator" /protein_id="PRJNA477356:DP116_25535" /translation="MLIQEKKLFLIGQVTALSGVPIRTIRYYESLGLIQSIGRTEGGF RQFSSDVLTRLSFIKCAQSLGLSLQEIGEILQVYDGGKPACDQIHHKLKDKILEIDKQ IEQLLTLREELRGLLSGWNSLSTKPQDTICPIIQNN" BASE COUNT 2277 a 1727 c 1528 g 2061 t ORIGIN 1 cattgggatg ctcccgggag ggagggagtg agggaaggaa ggaggagtca cttttggcga 61 tcgcgtggag gaatcatttt tggcgttaag ctagctgcgc ctccggcgat cgcgcacacg 121 aatcattttt ggcgatgctc ccaaacggtt tgagattaag ctactctatt agtaggaata 181 atttttacta ctaatcagta ggaatatgaa tgtcgaaaaa ctatccattt ccttgccacc 241 atctttagtg gagtttatcg aaaattacaa acttaataaa ggacgcaagt ctcgttctca 301 agtgattgaa gaagcattag aattgctgcg aaaccgagaa ctagaggaag cttaccgaga 361 agcatctgct gaagttgaca gcgattggga tgtaacagta gcagatggtt tgacagatga 421 aacgtggtga tatttattat gcaaacctta gtcctgcggt gggttcagaa atggataaac 481 gccgtccagt gctaatagta agtaatgatg caaataatcg tgctgctacc acagtgacaa 541 ttttaccaat cacttctaat gttacccgtg tttatccgtt tgaggttttg ctcaatccag 601 aggatagtgg tttaacaaag ccttctaaag tacaggcaca gcaagtgcga acaatttcca 661 agcagcgaat tattggcgaa gtgtatggta gtttaaatga agaaatgatg gtattaattg 721 atgctgcatt gaaattgcat ttgggattgg gttgagatat gactataaac aagagcgatc 781 gctattagaa agacgaaaac tacgacaatg ttactcccac caaaattaca aaacttgccc 841 catagtctcc caattgaagg tgcagttcgc atcgaattag ttgagggaat tcctatcttt 901 cgagcttcta gcactgtaca aaatcgtatt gaagaactct tgacaaaaca acacatttct 961 ccactgactt cggatgaaga acacgaactc gacctttacg aagaaattga cgattactta 1021 agttttatta atcggactgt tcgtaatgct ttcttgacac aaacgagcgt ctgattttga 1081 tatatgtcag cacgtcgtca aattcccaaa gatattcaaa ccctagtacg ccagcgtgct 1141 aattgcctgt gtgaatattg tcatgcttct gagcaatggc agtatgtaga atttacgatt 1201 gagcatatta tcccattaac taaaaacgga gcagatgcag ttgataattt ggcgctagct 1261 tgctttcact gtaaccgtca aaaatcaaac aaaacgacag cggttgattc tgattcaggt 1321 gttgaagtac ctctatttaa tcccagaata gatatttgga gtgagcattt tatttggtca 1381 gcagataagc tatatattat tggtttgacc cctattggta gagcaacagt agccgcctta 1441 gcattgaacc gagaacgagt cattaatatt cgtgctgcag acaaaattat cggaagacat 1501 cccccagtag atgactcaat tcaagcacaa aattgaaata gcgagacaaa agcaagtagt 1561 gccaagacgt gtagacgcgc agttccaaag gagccgcccc aagggggtga tactcccaga 1621 gggagtcgcc caaggggcga tcgcacatac cctatcacca aatcttcaag ccaaaagaca 1681 agctataggg aaaatcagat actccaagaa aaacaatttc cagatgaact ggtaaaaacg 1741 agcaatttca cttttctctt gcaaatcaac ttgtctactc cgccaccaca tcaaaaccaa 1801 cgccacaaga tgagtcatca ccaaaaaaac agaattcacc tcagccagcc gcaacacacc 1861 aactaaaatc atccccaaat agcaaacagt aatcacccaa agagcaagat taaaaacagc 1921 ttgcttacca atttggagag tcaaagtttt gatattgtag agtaagtcac cttccatatc 1981 agggatgtct ttgaagatgg cgatcgcaaa agtaaaaacc aaaacaaaca ccgtcaacac 2041 ccataccgcc gctgtaactg acgaactctg ttcaaacacc caactatagt gcaaaaacag 2101 tcctaaattt acaatcgctc cgcgcaccga aaaaatacac accgccgccc aaaacggaaa 2161 tcttttcaaa cgaataggtg gcaaagaata agcagtacca atcgccaaac taatcgccac 2221 cattcccaat aaaaacggtc cttgaagcaa cgcaacaacc agcgccaaaa caccagtaac 2281 acccacaatc aatctgccag tcgttaacga aaactcccca gcagcaacag gcaaatgagg 2341 cttattaatc ttgtcaatcg ccacatcctc aagttgattc agccccacaa tatagacatt 2401 gccacacaaa caagcaaccc atgcacctaa aagaggaaat aaagaataat tgcgttcgct 2461 aacagcaaca gcaatcaaat acaaccccaa cacactcaaa ctcgtcccaa taatcgtatg 2521 cggacgcgaa aacttccaaa aactataaaa ccaattcccc tgaaactgtc ttgaagaaac 2581 ctgatccatt ctcttttctc ctcttggcgt ttttgtcttg gcgttcttgg cgtcttgttg 2641 ctaatcgtta cttcttccca cacaacaacc caaaccgaat caacccgcgt tcataacccc 2701 gactcatcag ccccaaagac agcgcagctt gaatagtacc ccaacccgaa cacagcaaac 2761 cccaaatcgc cattggagaa aacgccgaat caatcacaac atcccaaaaa ggagcaacag 2821 ccttcgacca atcagcagtc cgaatatttt gtaaccctaa ttcacgagca atcacctcat 2881 actcaggcaa agatatcaca taaggcaaac agtacacccg ataaatctct gctaagtgct 2941 tctcctcatc cgccgtcagt agtaaattat caactggtcg atgacaccaa gtcaccataa 3001 tcaacgttcc accaggtttg agcacacgat agcactcttg caagaacttg ctcttatccg 3061 gcatatgctc accagattct agtgaccata ctaaatcaaa agagttatca gtaaaaggca 3121 ttgcttgagc atcagccact tgaaaatgac ttcttgcact caacccaaac tcagaagcgc 3181 gttttgtcgc tcttgcagct tgtactggac tcagagtaat tcctgttgtt ctggcattaa 3241 attttcctgc cagatacaga gaactcccac caattccaca accgacatct aaaatattct 3301 cagcttgttg gactcctgtc cacttaatca cttcttcaat taaatcaatt tgcgcctgac 3361 gacggtcttt tttctgtgta ccatcagcac cgtagtaacc gtggtgcata tgttcgcccc 3421 aaacctgttc ccacagactg gaagaagcat cgtaaaactg ctgaatttgc tggtaaagtg 3481 ttggactcat aatggaagcg ataatgaaaa cataaataaa cacatacagt aggctaccat 3541 gtgtgttttt aattaaatta aggtgttttt gcttatggcg gaagactacc atctctctat 3601 aactatgaag cgttataggc aatggttgtt attggctgca ataaggatat taatcgccta 3661 cctatttgaa atcagttatc agttatcagt tatcagtgag ccagcgctgc aggagggttt 3721 cccgacagag gcgactggcg taagcgcaag cgcacgcaat catgcgttag ccgtaaggcg 3781 tgccgtaggc atacgcgcag cgtctccgta ggagatacgc gcagcgtgtc gctttgcgac 3841 tcagcgtctc cgttcggcac ggggccgtgg ccgtacccgc tagggctcag aagatacccg 3901 aagggtcatc agtcatcagt tatcagtcat caagcaattg ataactgttc acccaatggg 3961 agagccagtc gcctgggcgt gggaaaacgc cgaatgcgcg cttactcact gttcactgtt 4021 ccctgttccc tgttccctgt tccctattcc ctgttgtcgc ttctggcaag attaacattg 4081 catcaccaaa agaataaaag cgatattgag atgatatggc ttgctgatat atatttaaca 4141 aacgttgccg accaattaaa gcacttacca acatgagcaa actagaacgt ggtaggtgaa 4201 aattggtgat taaaccctct acaactcgcc attggtagcc tgggtagata aacaaatcgg 4261 ttttgccaga aaatggttgc aactcaccct tgactgcagc tgcttctaat gctcgtgcta 4321 ctgttgtacc tacagcaata attcgaccac cagatgcttg agttgcacgg atttgctcta 4381 ccgtggagga aggcactgaa atccattctt catgcatttt gtgggttttt acattctcca 4441 cttccactgg gcgaaatgtt cctacgccaa cgtgtaatgt gacaaaagct tgatgaattc 4501 cgcgatcgct caacctctct aataattctg gcgtaaagtg aagcccagct gtcggagctg 4561 caactgctcc tggaacttta gcataaactg tctgatactg ttcgccaagg gcatcagatg 4621 cagtgatgta aggtggtaga ggaagttcac caaatacatc caacaggtgt actaaagata 4681 tcccttttgg taaatcaaat tgcaacagac gccctccagt cgcctcgtct atttctaaga 4741 ctgtcgcagt gagggaggat ggggaagatg aggaagaatt ggtttctcct cttgcttcaa 4801 aaataatctt cgctccccgt gtgaaacgtt ttcctggctt gactaaggct aaccaacaat 4861 tatgctggcg ttcttctaac aacaatactt ctatttccgc accagtagat ttacgaccat 4921 gcaagcgtgc tggataaact ttagtattat tcattaccag taaatcgcct ggtttgagtt 4981 gttctggcaa acttcggaaa atacagtgta taggttctgc ttttatacct gtattccggg 5041 aatttactac caaaagcttg gaatgatctc tgggaaccac tgggttttgg gctatgagtt 5101 cttgaggtaa ttcatagtca tacccagtta gagaactatc taattccaag tcttccttgt 5161 caggactaga tgttgggtta aaattggttt ttgctggttg ttgctgcatt tatatttcga 5221 ctgtctgaaa acttgtcttg gctttttttt atagagaatt ccgtgatccc cactgaattc 5281 tccttctata actttgtcat gaatgcaatt agctcacacg gaaacaactg tatgaaaatt 5341 gggtaaacct ccctatctaa aaacgggggc ttttacccta gccagcacta gggtctttat 5401 acggataata agagtgtagg attggctttg atcccaaaca caccgatgga atacttgtat 5461 tacctggcaa atgccagtct aactctaagg gttgttcaat ttttgcacgg taaaccccaa 5521 atacctgttt cgttcgtcac cgtcattcat cagattgatg gttgggtggt tagagtcaaa 5581 ctaaaacgtc acgtctcacc tcaagaagat ggggatattc gggctttcct cagtgaacta 5641 ggaattagct acgaaccaaa aatgcgagtg caaatggcgt tttggagttt ggaagcagga 5701 cagtgtccag ttgacgtgat gcgtcgctac caggtagcga tcgtctctca tggtaaccca 5761 gaaaaagaag aaatagaggc gttccggcaa cagtttgtca gaggtttggg ctattgtcca 5821 gaaacactgg cgtgatgcct ttgacttaca taacctgtta tatgtcagta tagttaagta 5881 tcaaaagctg ctggaatata ttcttctagc atcaacaaat atccacccac agatacagtg 5941 gttatagctt cttcagaaat tgtaagcgca gctaaccctt gtgatatttg ttgacagtag 6001 acaatagcaa catcaaattg acaagggtag tctgccttct gaggatattt agctaagaac 6061 attaaagacg tttgttggag ttttgtttgc ttctgttttg tgatggcgtt tcttccccca 6121 gcatcccaat tatctcggct gcgagttttc acttccacaa acgcgagtgt gggagataag 6181 ggagtgaggg aaggcgagag tgagggaaca acagttgttt cgttccctgt gttactcttt 6241 ccctgttgta gtgctttttc tccttgtttc gttggctgtt gctgtccatc atattgggca 6301 atgatatcaa tttctccgtt acggtaacgc catcgacggt gcaaaatgac ccaaccatta 6361 gattgtaacc attcagctac caggtcttct cctgcgatac ctatatcgag ataatgatat 6421 gaagaatggt ctgccattgg caaaaagtac aatgagtaag ttaagcaatc gaatttgggt 6481 aagcacactc agtttattac tttgggttgt catttgcatc actgcatctg ttggttttgc 6541 aagttttgct ccagaagctt tagcacttga ctacaataaa gaaactttga tcgggtctga 6601 tttctcagga cgtgacttga cagactccag ctttaatcaa actaacctcc gcaacagtaa 6661 ctttagtcac gctaatttgc gcggcgtcag tctgttttct gcaaagctgg aatcagcaaa 6721 tcttgaagca gctgacctga caaatgctat tttagactca gctcgtttca ctaaggcaaa 6781 tttgacaaat gcggtgttgg agggtgcttc tgctgctaac gccaggtttg atggtgctat 6841 tattgacgga gcagatttta ctgatgtgct gctgcgtaaa tatgagcaag agaaattgtg 6901 caaagtcgcc caaggtacca atccaaccac aggacacaac acgcgcgaca ccttattttg 6961 cccgtagctt aagctacacc accaccaaaa gcgtgaaatc tgttattcaa ttgttctgaa 7021 ttatgggaca gattgtatct tgcggctttg ttgataagct gttccacccg gacagtaatc 7081 ctctcaattc ctctcgcaag gttaataact gttcaatttg tttatcaatt tctaatattt 7141 tatctttcag cttgtgatga atttgatcac aggctggttt tcctccatca taaacttgaa 7201 gaatttcgcc gatttcttgc aagcttaaac ctaagctttg agcacattta ataaaagaaa 7261 gacgagtcaa cacatctgat gaaaactggc gaaatcctcc ttctgttcgt ccgatagact 7321 gaattaaacc caaactctcg taatagcgaa ttgtcctaat aggaactcca ctcaaggcag 7381 ttacctgacc aattaaaaac aactttttct cttgaattaa cacggtaaaa gaccctaaac 7441 ttacagtttt atctctcttt ttagatgtaa actaacaagc aaaaaaaact ttcataagta 7501 agtcaacatt aaaaaatata tattgcgatg gcaacgcctg ccgcttcatt tttcatagac 7561 aaggatggac aaccctccac aatgtctact cgt // LOCUS NODE_4201_length_7544_cov_4.8007747544 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7544) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7544) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7544 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 504..1484 /locus_tag="DP116_25540" CDS 504..1484 /locus_tag="DP116_25540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315028.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipolytic protein G-D-S-L family" /protein_id="PRJNA477356:DP116_25540" /translation="MRNLCLLTVGLLTGLVIPALTLVHLSNVSLKNSKVAGNLKYVSK ANLTKKIFSIAERNPPELSHSWLPKPIVSYSNLNTQVVSHKITSGNQLYHERLAALKA GQIYPSLPKQRTESSLISTTKTQLTYEDWKSLLAMEAKAMTQSQGDHRLGILVGDSLS LWFPQEKLPVDRLWLNQGISGDTCGGVLKRVSVFSETRPDFIYIMVGINDLRKGTTDE IILHNHRQIVRKLRQTHPKTIIFIQSILPTRLSAISNTRIRHLNDQLSLIARQEGVYY LNLYDWFADFQGNLRLELTTDGLHLNTEAYDVWRSAIDEAEHSTIRMAGS" gene complement(1704..3470) /locus_tag="DP116_25545" CDS complement(1704..3470) /locus_tag="DP116_25545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745568.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_25545" /translation="MLVTKTLQSGKYTINQEIGRGGFGITFKATHNFLNQVVVIKTLN EKLRQHPDYVKFERQFQDEARRLATCIHPNIVRVSDFFVEDRLPYMVMEYIPGETLAE AYVMPGIPLPEDKAIHYISQIAAALQVVHQNGLLHRDVKPDNIILREGTQEVILIDFG IAREFHNGVKQTHTGIVSEGYSPIEQYLSQATRSPATDVYGLAATLYALLTAQVPIPA LLRDREQMPSPRELQPHLSAAINQAIIRGMATDARFRPQSVQEWLLLLPDNRQSGVHQ SLVTSVVPTINLSNQPYPDVPDVPNVPQQVASVNTPTLPPSGNAKFKKILSSPKVLIA TGIALLGVTSGFGITSLLSKTQPSSSVPQSVVEKNDPNEEKPTVILTPSPVIKETNTV KETNTQQKEDNTSSQVKSAAASTSTRRRRIRRVSTEETPDSSNTTRQSYKRNATRESY RKNATTDSDSSDATRDSDSSNVTTESSQTYSRKSQRRRYQQDSQQEASGPKITSSPSL TERLRAIRESRKTSSESSGNNRPSSRGASESSQPVTQTNNSSVVVPEQPSVAEPKHFD SPDIVVPTQDKALENSQTDHNN" gene complement(3671..5749) /locus_tag="DP116_25550" CDS complement(3671..5749) /locus_tag="DP116_25550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315030.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="outer membrane protein assembly factor" /protein_id="PRJNA477356:DP116_25550" /translation="MRVSSAAIFTLVTLATGNAHQQAFAAPSNTSEQPEKPDNVVVPV TEETPARIETIAPPETVVIGQFSQKSGNVHGGSRKSVVIGTEKEKETREITFSSPSSP SSPSSPSSSPATGNDLVVTASEVNIVGANEELQQAIRSVINTQVGAQTSQSQLQKDVK AILETGLFTDARVNSRSTPTGLSVEYQVQPVVVRSLQLSGANVLSYEVARERFQPQIG KVISPSGLKQVVQDINKWYADKGYKVARVISIRPSAEGILTLNVAEGVVNNIKFRFVN EEGKTVDNKGKPVQGRTKTDFLQQQLKFKPGQVYQENLVKQDLQQLYKTGLFQNVNVA FEGDATKLDVVYELKEIGARSINVGGNYNADQGIVGTLTYRDQNVGGVNDLFGFNLEL GARDFQFDTKFTSPYRQTNPDAFGYSVSAFRKRGLSKTFDGDIKLANGDRVREGRVGG GVSLQRPIEGWNTSLGFNYSRVSLRDRDGKVTPTDQKGNPLSFSGTGIDDLTTVSLTA TKDERDNPFNPTNGSILKLSTEQSIPLGEGQISMNRVKADYSQFMPVKLFESKKPQVF ALNVQAGTVLGDLPPYESFNLGGVNSVRGYGEGDVASGRTYVLASAEYRFPIVQALGG VFFADFASDLGSGESVLGNPAGVRDKPGTGFGYGAGVRFDSPLGLIRADFGLNDQGES RVHFGVGHRF" gene 5905..7278 /locus_tag="DP116_25555" CDS 5905..7278 /locus_tag="DP116_25555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315031.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25555" /translation="MSDSSPSSLSSNIEKDTPNLRLLVMSNGHGEDIIAVRILQELQQ QPNPPEIFALPLVGQGYAYQGLDIRLIGSVRTMPSGGFIYMDGRQFVRDVNGGLLQLT VSQINSIRHWVNTQKKSGHRIAILAVGDIVPLIFAWLSGANYGFVGTAKSEYYVRDDV NLLKRKDKGAWWESFSGSIYHPWERWLMSQQRCKAVFPRDSLTAEILKKWPIPAFDLG NPMMDALEPTIPRPRFYGTHIEKQELARPLVVTLLPGSRPPEAYANWHQMMIAVSALM ALWKEQKFVEYRSETVVFLGAIAPSLHIESIRQILESQGWQPHAQPPIQIPDKDSLTF QQKNAFFILSQNAYNDCLHQGDLAIAMAGTATEQFVGLGKPAITIPGLGPQFTYAFAE AQSRHLGPSVILVEQPAEVAQVVKSLFANPDMLQAIAENGVRRMGKAGAAERIAEYSM KLLWNED" BASE COUNT 2154 a 1562 c 1618 g 2210 t ORIGIN 1 gttaacataa actgtaccta gctgaacaaa aatttttggg ggaattttat atttctataa 61 ttgacttgac aagtgatgat atcatattat tttcaaattt caaaaagctc tcacctatac 121 gatatagctt attcaaaggc gatgaacact tcagttgaaa ttttgggtga atgacttaag 181 tatgtgaaaa ggcgataggc tgaatggaat tcatgctgtg catacaagca ttgcatttta 241 attcctatta ccgcttcaca gaaagcagat ttttctcaga tgagcaacga tatttaatga 301 tgtgggcagt tttcgacaaa aatgaaaact gctaaaagca atatggaaaa aatttgaaga 361 gatggtaaac atcatcccgc aatgctttac agaaatatgt acagaaatga aaaataatga 421 aatagttatc taaaatttta ctttaattca acaagttaat gctgtcagtc agaaaaagcg 481 ctccggagag gataaatgta aagatgagaa atctttgtct gttaacagta ggtctgttga 541 caggattggt aataccagca ttaactcttg tgcatttgtc caatgtctct ttaaaaaata 601 gcaaagtcgc tgggaattta aaatatgttt caaaggcaaa tctcactaag aaaatattct 661 ctattgcaga aagaaatcca ccagagttga gtcactcttg gttgccaaag ccaatagtat 721 cgtatagcaa tctaaacact caagtagtat cccacaaaat tacatctgga aaccaacttt 781 accacgaaag attagccgca ctgaaagctg gtcagattta tccaagctta cctaaacaga 841 ggacagagtc atcgttgata tctacgacga agactcaact cacatatgag gattggaaaa 901 gcttattggc gatggaagcc aaagcaatga ctcaaagtca aggcgatcat cgtctgggaa 961 ttttggtggg tgattcttta agtttgtggt ttcctcaaga aaaactgcct gtagatagat 1021 tgtggctgaa tcagggaata tctggagata cttgcggtgg tgttttaaaa agggtgtccg 1081 tttttagtga aaccagaccg gattttattt atatcatggt tggaattaat gacttgcgaa 1141 agggtacgac agatgaaatc attttacaca atcaccgcca aattgttcgc aagttgcgac 1201 agactcaccc aaaaaccata atttttatcc aatcgatttt gccaactcgt ttatctgcaa 1261 tctccaatac ccgcattcgt cacctgaatg atcaactctc ccttattgct agacaagaag 1321 gagtttatta tttaaatctt tatgattggt ttgcagattt tcaaggtaac ttgcgtttag 1381 agttgactac agacgggctt catttgaaca cagaagccta cgatgtatgg cgatctgcaa 1441 tagatgaagc tgaacatagc acgataagaa tggctggaag ctgaatagaa atcgcaaaaa 1501 ttctatctta cgtttttcaa tgttgatctg acttgaaaat agggaacgct taacagggaa 1561 cgcttaacag ggagaaaatg tttcatcggt tctggtatat gcagttcatg atggctactt 1621 attgatgagt acacaatccc ctgccttttg gtgtaattat cgctatcgct tgagtacacg 1681 aaaaaccagg ggagtgtcaa gagctaatta ttatgatcag tttgtgaatt ttccaatgcc 1741 ttgtcttgtg taggaacgac aatatcagga gaatcaaagt gctttggttc cgcaacagat 1801 ggctgttctg gtacaacaac agaagaatta tttgtctggg tcactggttg cgatgattca 1861 gacgcaccac gagaagatgg tcggttattt cctgaagatt ctgaggaagt cttgcgagat 1921 tcgcggattg ctcgtagtcg ttctgtcagt gaaggtgatg aagtaatctt aggcccagaa 1981 gcttcctgtt gtgagtcttg ttggtatcgg cgacgttggg attttctgga gtaggtttga 2041 gacgattctg ttgtcacatt gctgctatcg gaatctcttg tcgcatcgct gctatcggaa 2101 tctgttgtcg cattcttcct ataggattct cttgtcgcat tcctcttata ggattgtctt 2161 gtcgtattgc tgctatcagg agtttcttca gttgaaacac gacggatacg ccgacggcgt 2221 gttgaagtag aagcagctgc agatttcact tgcgaggaag tattatcttc tttctgttga 2281 gtattagttt ctttaacagt attagtttct ttaatgacag gcgatggtgt taggatgact 2341 gtaggttttt cctcgttagg atcattcttt tcgacgactg attgtggaac tgatgagctt 2401 ggttgagttt tactcaacaa actagtgata ccaaaaccac tcgtaacacc caaaagcgct 2461 atacccgttg caatgagtac tttaggtgac gagagaattt tcttgaattt ggcattacct 2521 gaaggtggta gggttggcgt gttcacagat gcgacttgtt gtggcacatt tggcacatct 2581 ggcacgtctg ggtatggctg attagataaa ttgatagttg gcacgactga tgtcacgagt 2641 gattggtgta caccactttg cctattatca ggtaatagca gtaaccactc ttgcactgac 2701 tgaggacgaa agcgagcgtc agttgccata ccacgaatga ttgcttgatt gatagcagca 2761 cttaaatggg gttgtaactc tcggggggaa ggcatttgct cgcgatcgcg cagaagtgct 2821 ggtatcggca cttgtgctgt caacaatgca tacagtgttg ctgctaaacc ataaacatca 2881 gttgcaggac tgcgggttgc ttgcgacaga tactgctcaa ttggggaata accctcagaa 2941 acaattcccg tgtgagtttg cttgacgcca ttgtgaaatt ctcgtgcaat tccaaaatca 3001 atcagtatga cctcttgcgt tccttcacga agaatgatat tatccggttt cacatcccgg 3061 tgaagcaagc cattttgatg cacgacctgc aatgcagctg ctatttggct gatgtaatga 3121 attgctttat cctctggcaa aggtatccct ggcatgacat atgcctcagc tagagtctcg 3181 ccaggaatat attccatcac catgtagggt agtctgtctt cgacaaaaaa gtcgctcacc 3241 cggacaatat tcggatggat acaagttgct aatcgtcttg cctcatcttg aaattgacgc 3301 tcaaacttaa cataatcagg atgctgtcgc agcttttcat tgagagtttt aattaccacg 3361 acctgattga ggaaattgtg agttgctttg aacgtaatgc caaagccacc tcgaccgatt 3421 tcctggttta tggtatattt accactctgc aaagttttag taactagcat agagagtcct 3481 gattagtaaa tatgattact actgcttttc ttatagaaaa ccgggaactg taaaagttcc 3541 ggttttgact cctgagtcta acctgaattt tgaggaatat caaaccatgt tcaatactta 3601 gcccacagaa tagtcttatt tttcttcgtt attccttaat agtatactta tcacttataa 3661 ctcagcattt ctaaaagcgg tgaccgacac cgaaatgaac tcggctttct ccctggtcat 3721 tcagcccaaa atcagcccga atcaagccta gaggtgagtc aaagcggact cccgcaccat 3781 aaccaaaacc agtaccaggt ttatcacgaa ctcctgcagg gttacccaac acagactcac 3841 cagaacctaa gtctgaggca aagtcagcaa agaagacacc tcccaatgct tggacaatcg 3901 gaaagcgata ttctgccgac gccaacacat aagtgcgacc actagcaaca tcaccctcac 3961 cataaccgcg tactgaattg acaccaccta agttgaaact ttcataaggt ggtaaatcac 4021 caaggactgt accagcttgg acattcaagg cgaagacttg tggttttttg gattcaaata 4081 actttactgg cataaactgg ctatagtctg cctttacgcg attcatagaa atttgtcctt 4141 ccccaagagg aatggattgt tcggtactga gtttgagaat agaaccattc gttggattga 4201 agggattatc tcgctcgtct ttggtggcgg ttaaggatac agtggttagg tcatcaatac 4261 cagttccact aaaagacaga ggatttcctt tttgatcagt tggggtaact ttaccgtcgc 4321 gatcgcgaag actgacacga ctataattaa agcccaagga agtattccaa ccttcaatcg 4381 ggcgctgcaa actcacacca cccccaactc gaccttcccg cactctatcg ccatttgcca 4441 atttgatgtc cccatcaaaa gttttagaaa gtccccgctt gcggaacgca ctcacactat 4501 aaccaaaggc gtctggattg gtttgccggt agggacttgt aaatttagta tcaaattgaa 4561 aatcacgcgc accaagttca agattaaaac cgaaaagatc gttaacgcca ccaacatttt 4621 gatctcggta agttaatgta ccaactatcc cttgatcggc gttataatta ccgcccacat 4681 taatagaacg tgcccctatt tccttaagtt cgtaaacgac atccagtttt gtcgcatctc 4741 cttcaaaagc aacgttgaca ttctgaaata accctgtctt atacagttgc tgtaaatctt 4801 gcttgactaa gttttcttga taaacttgtc ctggcttaaa cttaagctgt tgctgcaaga 4861 aatctgtttt tgtacgtcct tggactggct tacccttgtt atcaactgtt ttgccttctt 4921 cgttcacaaa acgaaacttg atattgttga caacaccttc agcaacattc aaagtcagga 4981 ttccttcagc actgggtcta atcgatataa ctcttgcaac tttataacct ttgtcagcgt 5041 accatttatt aatatcttgg actacttgtt taagaccact gggactgata acttttccta 5101 tttgaggttg aaagcgttcc ctggcgactt cataactcaa cacgtttgcc ccggaaagtt 5161 gtagagaacg cacaacaact ggttgtactt ggtactctac acttaagcca gttggcgtag 5221 aacggctatt aacacgagca tccgtaaata aaccagtctc caaaattgct ttaacatctt 5281 tttgtaactg actttgactg gtttgtgcac cgacttgagt gttgatgaca ctgcgaatcg 5341 cctgctgcaa ttcctcattc gccccgacta tgtttacttc actagcagtg actactaaat 5401 cattaccagt agctggagag gatgagggag acgaggggga tgagggggat gagggagatg 5461 aaaaagttat ctcccttgtc tccttctcct tttcggtacc tatcaccact gatttacgac 5521 taccgccatg cacattacca gatttttgag agaattgtcc aataactact gtctctgggg 5581 gtgcaatcgt ttctatccga gcaggcgttt cttcggtcac tggcaccaca acgttatcag 5641 gtttttcagg ttgctcagaa gtattcgagg gtgcagcaaa tgcttgctga tgagcattgc 5701 cagttgccaa agtcactaaa gtaaaaatcg ccgcagaaga aactcgcata ataaatagta 5761 gtctaatgaa ctgggagtta gttcgtactt tcgtactgag gtttggcaat gggcatagta 5821 ctaacttcca gctacgttgc cgatttatac ccccattgat tcctcttgtg ccaaacgaca 5881 gttattttta aatttttgtt tctcatgagt gattcatcac catcatcgtt aagttctaac 5941 atcgaaaagg atactccaaa cttacggtta ctcgttatga gcaacggaca tggggaagat 6001 attattgctg tccgtatctt gcaagaactt cagcaacaac ccaacccacc cgaaatcttt 6061 gctttgccgt tggtgggaca aggatacgct taccaagggt tggatattcg cctcatcggt 6121 tccgtacgca caatgccgtc gggcggtttt atttacatgg atggacgcca atttgtacga 6181 gatgtcaatg gtggtttgtt acaactcacc gtcagtcaaa ttaactccat acgtcactgg 6241 gtaaatactc aaaaaaaatc aggtcacagg atagctatat tagcagtggg cgatatcgta 6301 ccactgatat ttgcgtggtt gagtggtgct aactacggct tcgtcggcac agcaaaatct 6361 gagtactatg tacgagatga cgtgaattta ttaaaacgca aagataaagg ggcatggtgg 6421 gaaagttttt ctggctcaat ttaccatcct tgggaacgtt ggttgatgag tcagcagcgt 6481 tgcaaggctg tgtttcccag agactcgctc acagcagaga tattaaaaaa gtggccgatt 6541 ccggcttttg atttgggtaa cccgatgatg gatgctttgg aaccaacaat tccacgtcca 6601 agattttatg gtactcatat agaaaagcaa gaattagctc gacctttagt tgtgactctc 6661 ctccccggtt ctcgtccgcc agaagcatac gctaattggc atcaaatgat gattgctgtc 6721 tctgcattga tggcgctttg gaaagagcaa aagtttgtgg aatacagatc tgaaactgtg 6781 gtttttttgg gtgcgatcgc ccccagttta cacattgaaa gcatacgcca aatccttgaa 6841 tctcaaggct ggcagcccca cgctcaacct ccaattcaaa ttcccgataa ggattccttg 6901 acatttcagc aaaagaatgc attctttata ttatcgcaga atgcttataa tgactgcttg 6961 catcagggag atttggcgat cgcaatggca ggaacagcaa cagagcaatt tgtgggttta 7021 ggcaaacctg cgattactat ccctggactt ggtccccaat ttacctatgc ctttgctgaa 7081 gcacaaagcc gtcatttggg gccatctgtg attttagtag aacaacccgc agaagttgct 7141 caggtggtta agtcactgtt tgccaacccc gatatgttgc aagctattgc agaaaatggc 7201 gtccgacgaa tgggtaaagc aggtgcagcc gaacgaatag ctgagtattc gatgaagttg 7261 ttgtggaatg aagattgatt tgcaatagaa ggaaaagagg atgttttaaa agttttgggc 7321 gaatataatt cgctactaca caagcgaagt taagaagaat tgtaggttgg gttaagcgga 7381 gcgcaaccca acaaaagcga ggtaggtgtt gggtttcgtt cctcaaccca acctacgtct 7441 acgtctgctc gcttcgctca aattcaaaat tatcaattca aaattcaaaa ttaagaattc 7501 tttttttttg aactttgaat tttgaacttt gaattccctt gcgg // LOCUS NODE_4216_length_7507_cov_4.9492757507 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7507) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7507) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7507 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(221..2353) /locus_tag="DP116_25560" CDS complement(221..2353) /locus_tag="DP116_25560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318294.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flavin-dependent dehydrogenase" /protein_id="PRJNA477356:DP116_25560" /translation="MQEILYLEISTSDTTAVCHWLDTDFEPGTTVEKVLTPQGFRLRV SNTTISTEIVSEKLPSELSVFVWSVQRTTYLKVFRWADKPFPREQQILQRLTTGIRSR FPHEYPEPPTINLSEQSIFEALAPYYPLTVKYFQKMPNGEYDLKRVYWWEQRWREGVR NPQQPRQVVFSHRSKAGEQRSPGAGESTDTNSSLHPQYDLIYIGGALGVIHAAVMARL GYRVLLIERMPFGRMNREWNISRDEIQSLINLGLLTTAELESIIAREYKDGFNKLFDA NNPRNLKAPVLHTPTVLNVALDSEKWLHMCGQKLRAAGGEIWDETEFIRADVHNTQVV VSVKDLTTQTEKQVSGRLLVDAMGTASPIAWQLNGGRAFDSVCPTVGAVISRGFEPGV WDSQYGDVLYSHGDISRGRQLIWELFPGADEELTIYLFHYHEVHPKNPGSLLEIYEDF FTILPEYRRCDMDKLVWKKPTFGYIPGHFSVGSRDRTVAFDRLIAIGDAASLQSPLVF TGFGSLVRNLERLTTLLDTALKHDLVSFRHLNQIRAFQNNIAVTWMFSKGMMVPTGKF IPPQRINSMLNNFFGLLADEPPQVSDNFIKDRFDWFTFNRLALKAARNNPALLLWIWE LAGAKDLLRWTRNYLDFSRHAFVRALLNGWFPRFLRRIGPMLEIRYPALWLQLLAINY SLTAGIARPQNLVSQVTSKAMLQKPEII" gene 2511..2744 /locus_tag="DP116_25565" CDS 2511..2744 /locus_tag="DP116_25565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455806.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25565" /translation="MNYPIPDSPQEIVALRQKPIDEEMVAAAIAGVIKIVRAQGQSLE DLTAQVLADDSLLDLQQRRWLSQVVAQAWENFS" gene complement(2943..4565) /gene="cimA" /locus_tag="DP116_25570" CDS complement(2943..4565) /gene="cimA" /locus_tag="DP116_25570" /EC_number="2.3.1.182" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="citramalate synthase" /protein_id="PRJNA477356:DP116_25570" /translation="MTANSSHQLWIYDTTLRDGTQREGLSVSIEDKLRIARRLDKLGI PFIEGGWPGANPKDVQFFWHLQEEPLQQAEIVAFCYTRRPHKTAADDPMLQPILAAGT RWVTIVGKSWDLHVTEGLRTTLTENLAMIQDTIEYLRSQGRRVIYDAEHWFDGYKQNP DYALQTLQTAISAGAEWIALCDTNGGTLPHEMTQIVTDVVKSIPGETQLGIHTHNDSD TAVANALAAVMAGVKMVQGTINGYGERCGNANLCSLIPNLQLKLGYKCIEDSQLSELA QTSRFVSEVVNLAPDEHAAFVGRSAFAHKGGLHVSAVERNPLTYEHIQPEQVGNSRRI VISEQSGISNVLAKARTFGIELDKNNPATGQILQRLKALESEGYQFEAAEASFELLMR EALGRRQSFFEIKGFQVHCDLVEGKEATNALATIKVAVKNKDILEAADGNGPVAALDA ALRKALRNFYPHIAEFELTDYKVRILNGHTGTAAKTRVLVESRNSYQRWTTVGVSTNI LEASYQAVVEGLEYGLLLHSQAQAALSTSSGN" gene 5161..5412 /locus_tag="DP116_25575" /pseudo CDS 5161..5412 /locus_tag="DP116_25575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016949086.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="(2Fe-2S)-binding protein" gene 5463..5849 /locus_tag="DP116_25580" CDS 5463..5849 /locus_tag="DP116_25580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310403.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25580" /translation="MGRWTPLILMAGGLAILLGTLLGISFPMGGQNTAQAPAPTTTQR PKESKVLPANIDVNSTAAGNGSAEKKNNIAAQTNQNSETNNKQKQDDDTGAANNQPTG GQSSGNNQSSPDTNQNQNNEPIRALW" gene 5998..>7507 /gene="thiO" /locus_tag="DP116_25585" CDS 5998..>7507 /gene="thiO" /locus_tag="DP116_25585" /EC_number="1.4.3.19" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997669.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycine oxidase ThiO" /protein_id="PRJNA477356:DP116_25585" /translation="MTSDVLIIGGGVISMAIAIELKLRGAKVTVVSRDIHAGATHAAA GMLAPGAERISDEAMQELCMRSLYLYPDWTRKLEEFTSINTGYWSCGILAPVYQDETT EQGEKGAKQEKSERGKETSLTHSSSPTYWLEKEAIHQYQPGLGEEVVGGWWYPEDAQV DNRALVQALLTAAKSLGVEFKEGIVVEGIQQQQRQVVGVQTSAGVLHAQHYVLAAGAW SNSLFPFPVRPVKGQMLSVKVPEFVPDLSLTRVLFGEEIYIVPRRDRRIIIGATVEDV GFTPHNTPAGIQTLLQRAIRLFPQIHNYPIQELWWGFRPATPDELPILGTSPCENLTI ATGHYRNGILLAPITARLLADLILEQKTDPLLSHFHYSRFHTKSSTTPMLTYPTLKTQ NSETAARNLELSTLEDSPLQIAGKTFQSRLMTGTGKYRSIEEMQQSVVKSGCQIVTVA VRRVQTNAPGHEGLAEALDWTKIWMLPNTAGCQTAEEAIRVARLGREMAKLLG" BASE COUNT 2170 a 1723 c 1611 g 2003 t ORIGIN 1 agggggtcgc agtcttggcg gttcccgtcg attgcgaacc cccgaacccg gagggcttcc 61 cgtagggtag caactgccgt tagcgcagcg tgtccgttcg gcacggggcc gtggccgtac 121 ccgctagggc tcaggacata cccggagggc tacgtctcta cattgccgtc ttgatcgcgg 181 ttttggtctt atcccaagcg tattggctaa taactaatga ctaaataatt tctggcttct 241 gaagcatcgc ttttgatgtc acctgagaga cgagattttg cggacgtgca atgcctgctg 301 tgagagaata gttaatcgcc aatagttgca accataatgc tggatatcgt atttccagca 361 ttggaccaat tcgacgaagg aaacgcggga accatccatt cagcaaagcc cggacaaaag 421 catggcgact aaagtcaaga tagtttctag tccatctcag taaatccttg gcacctgcaa 481 gttcccaaat ccacagaagc aatgctgggt tattgcgagc tgctttcagc gccagtcgat 541 tgaaggtgaa ccaatcaaat ctatctttga taaaattatc tgacacttgt gggggttcat 601 ctgccaacag cccaaagaag ttgttgagca tagagttgat tctttgcggt ggtataaatt 661 ttccggtggg aaccatcatc cctttggaaa acatccaagt gacggcaata ttattttgaa 721 aggcgcgaat ttggttcaaa tgccggaaac tcaccaagtc gtgtttgaga gcagtatcca 781 aaagagttgt caggcgttct aagttgcgaa ctaaagaacc aaaaccagtg aaaactaggg 841 gagactggag tgatgctgca tctcctatgg ctattaagcg gtcaaaagca actgtgcgat 901 cgcgactccc cacactaaaa tgccccggta tatacccaaa tgtcggcttt ttccacacca 961 gcttatccat atcacaacgg cgatactctg gcaaaattgt gaaaaagtcc tcgtatatct 1021 ctaataagga accgggattt tttggatgaa cttcgtggta atgaaataaa taaatcgtca 1081 gttcttcatc tgccccagga aacaattccc aaatcaactg tcgtcctcgc gaaatatcgc 1141 catgactgta gagaacatct ccgtactgag aatcccatac cccaggctca aatccacgcg 1201 aaatcactgc tcctacagtc ggacaaacac tgtcaaaggc acgaccacca ttcaattgcc 1261 acgcaattgg tgaagcggtt cccatcgcat ccactaacaa tcgcccgctc acttgcttct 1321 ctgtttgagt ggttaaatcc ttgacactaa caacaacttg tgtgttatgc acatctgccc 1381 ggatgaactc tgtttcatcc caaatttcac cgcctgctgc tcgtagcttt tgtccacaca 1441 tatgcaacca tttttcggaa tctaaagcca cgtttagtac tgtgggcgta tgcaaaacag 1501 gtgcttttaa gttcctggga ttattggcat caaataactt attaaatcca tctttatact 1561 cccgcgcaat gattgattcc aattctgctg tcgttagcaa acctaagtta atcaaacttt 1621 gaatctcatc acgggaaata ttccattctc ggttcattcg tccaaagggc atccgttcga 1681 tcagcagcac tcggtaaccc agtcttgcca tgactgctgc atggataacg ccaagcgccc 1741 caccgatgta gatgaggtca tattgggggt gcagggaaga attagtatcg gtgctctccc 1801 ctgctcctgg gctcctctgc tcccctgctt tacttctatg ggagaacact acctgacgcg 1861 gttgctgggg attccgcaca ccttcgcgcc atcgttgctc ccaccagtag actcgcttca 1921 gatcatattc gccattgggc attttctgaa aatatttgac agttagagga tagtatggag 1981 caagtgcctc aaaaatcgat tgctcagata aattaatcgt tgggggttca gggtattcat 2041 ggggaaaccg gcttcttatc ccagtagtga gcctttgcag aatttgttgc tctctgggga 2101 agggtttatc tgcccaacgg aatactttta aatacgtagt tcgttgcact gaccagacga 2161 agacggaaag ttcactaggt aacttttccg aaacaatttc cgtgctaatt gtagtattag 2221 aaactctcag gcgaaagcct tgcggagtca ggactttttc aacagtagtt cccggttcaa 2281 aatcagtatc caaccaatga cacacagctg ttgtatcaga agtggaaatt tccaggtaaa 2341 gaatctcttg catttttttc tcaaatctcc gaatctatgt attgagacag atttcacaaa 2401 gggatagagt atagtaagat gcgccctatc tgtctcatcc ctgtttagta aagaaatatt 2461 aagacttttt cagatttatt gttcattctt tgtttttgta aacacacgcc atgaattatc 2521 ccattccaga cagcccacaa gaaattgttg ccttgagaca aaaaccgatt gatgaagaaa 2581 tggttgctgc tgcgatcgct ggtgtcatca aaattgttcg tgctcaaggt caatccttag 2641 aagatttaac tgcccaagtg ctagccgacg acagcctgct agacttacaa caaagacgct 2701 ggctcagcca agtcgttgct caagcttggg aaaatttctc gtaagagagg tttcgtggtt 2761 agtcagtcgc actttcgtgg ttaggcacaa gttgtgatca aacaactaac aactaacaac 2821 tcatcataca gtcaactcac aaagccaaat tgccaatcac cttcgtggtg tgagagcttc 2881 aataaattgt ctgagattgg caatcttgac taaaaactca aaactcagca tcaatgagcg 2941 gttcagtttc cactcgaagt actcaaagct gcttgtgctt gagagtgtaa caaaagacca 3001 tattctaaac cttctaccac agcttgataa gaagcctcca aaatatttgt ggaaacgcct 3061 actgtagtcc agcgttgata actattacgc gactctacca aaacgcgggt tttcgctgca 3121 gtcccggtgt gtccattcag aattcgtact ttataatctg tcaactcaaa ctcagctata 3181 tggggataga aattcctcaa agccttgcgt aaagctgcat ccaaagctgc aacaggtcca 3241 tttccatctg cagcttctag aatatcctta ttcttgacag cgacttttat agtcgcaaga 3301 gcgttagtcg cttctttccc ctcaaccaaa tcacaatgaa cttgaaaacc tttgatttca 3361 aaaaaagact ggcgacgacc taacgcttcg cgcatcaaca actcaaaact cgcctccgca 3421 gcctcaaatt gatatccttc actctctaaa gctttgagac gttgcagaat ttgccctgtt 3481 gctggattat tcttatcgag ttcaatacca aaagtgcgag ctttcgctaa aacgttacta 3541 attccagact gttcagaaat gacaatgcgg cgactatttc cgacttgttc cggctgaatg 3601 tgttcataag tcagaggatt gcgttccaca gccgatacat gaagaccacc tttgtgagca 3661 aaagccgaac gtcccacaaa agcagcatgc tcgtcaggag caagattgac aacttcactc 3721 acgaagcgac tcgtttgggc cagttcagat aactgactat cttcaataca cttgtaaccc 3781 agcttcagtt gcaaattggg aattaacgaa caaagattgg cattgccgca gcgttcacca 3841 tatccgttga tcgtaccttg taccatcttc actcctgcca tgacagcagc caaggcgttg 3901 gcaactgcgg tatcagaatc gttgtgagta tgaattccta gctgcgtctc accaggaatt 3961 gattttacca catctgtgac aatttgagtc atttcatggg gtaaagtgcc accattggta 4021 tcgcaaagag ctatccattc agcaccagca gagattgccg tctgtaaggt ctgtaaagcg 4081 taatctggat tttgcttata gccatcaaac cagtgttctg catcgtatat aacgcgacgc 4141 ccttgagaac gaagatactc aatcgtatcc tgtatcatcg ctaaattttc tgtgagggtt 4201 gttctgagtc cttctgtgac gtgtaaatcc caagacttac caacaattgt cacccagcga 4261 gtgccagcag caagaattgg ttgcaacatt ggatcatctg ctgctgtttt gtgaggacgt 4321 cgagtataac aaaaggcaac aatctctgct tgttgaagcg gctcttcttg gagatgccaa 4381 aaaaattgta catccttagg atttgcccca ggccaacctc cctctataaa gggaattccc 4441 agtttatcta gcctccgggc aatgcgtaac ttatcttcta tagacaccga taacccttca 4501 cgttgagtgc catcccgtaa cgttgtgtca tagatccaaa gttggtgtga ggaatttgcg 4561 gtcataagat gcgaactctt ggagacaact ttcaagctaa tgtgtaagta tagttaagga 4621 tagaacaata aaactaacgc agagtatttc tttattgagg agaactcaga gcacagaatt 4681 tagaacatcc tcatcaaaag attagcagga tttaacaata tcgcttctta agcaaaacca 4741 gagttttgag tctgataagg gagattaaga gaaaaatttt ccggactcat tgcttgaaac 4801 ttagaaactg aagtcaagca tcgtttcttt caggtttttt gcaggattcc ccagaactgt 4861 aaagtagggt gttgagtgta gcgcccctgc tacctaccgc caattgtata gtgccaatat 4921 agatatggac aaggtacttg agttttttcg agtcttgggt tatcgattat atattgtttc 4981 tgtgtttttc tgcatcgaaa tgaattgtta caatacaaca aaaagctaaa gtgatcacaa 5041 atgcccaagg cacaaactca aggtcagaca gccaacgtcg tcgtacttgt ctttgccaga 5101 caacagaaag aatttctttc aaaaacgcct tggcaccata cgcgcgaacg cggagccaat 5161 ctgccgcaaa tattgctgca aaatggtgtt gacctctaca ataacgatgc taaggtagta 5221 aactattgga accttggtag ctgcaatcct tgtgcagttc ttgtagaagg tgaagtgtct 5281 aagctcaagc gcgaaaagcg cggcggtcac tacctcccca ttgttcagta agcaagggtg 5341 cgtttggcat gtcaaaccaa ggctttgggc aatgtgaaac tgacaagatt tgatcaattt 5401 tcggaaccct aggttctcaa agactttgaa catcacaagt ttaagacgga gaaattaacg 5461 atatgggtag atggacacca cttattctta tggctggagg cttagcaatt ttgcttggga 5521 cattgctagg cattagtttt cctatgggtg gacaaaatac tgcacaagca cctgcaccca 5581 caactacaca aagaccgaaa gaatcaaaag ttctcccagc taatatcgat gttaatagca 5641 ctgcggctgg gaatggttct gcagaaaaga aaaataacat tgctgcccag acaaatcaga 5701 acagcgaaac gaataacaag caaaaacaag atgatgacac aggcgctgcc aacaatcagc 5761 caacaggtgg tcaatcatcc ggtaacaatc aatcctctcc cgacacaaat cagaatcaga 5821 acaacgaacc aattcgagcc ttatggtagt aggtcattag ttagaagtta caacttatga 5881 gttacaagtt atgagttatg cgttatgagt tataagtcac gagttattag tccaaagtcc 5941 attgacaatt gactattgac tcttgactaa tcactaatga ctaatgacta atgactaatg 6001 actagtgacg ttttaattat tggtggcggc gtcatcagta tggcgatcgc cattgaatta 6061 aagttacgcg gtgccaaagt gacagtcgtc agccgcgata ttcacgcagg tgctactcat 6121 gctgctgctg ggatgttagc accgggtgcc gaaagaatct cagacgaagc gatgcaagag 6181 ttatgcatgc gctctctgta tttataccct gactggacgc gtaagctcga agagtttact 6241 agtatcaata ctggttattg gtcttgcggt atcctggctc cagtttatca ggacgagaca 6301 acggaacaag gggaaaaagg agcgaaacag gaaaagagtg agagaggaaa ggaaacttcc 6361 ctcactcact cctcttcccc tacttactgg ctagaaaaag aagccattca tcaatatcag 6421 ccaggattgg gagaagaagt tgttggaggt tggtggtatc ctgaagacgc acaagtagac 6481 aatagggcat tagtgcaagc actgctaaca gctgctaaat ctcttggcgt tgaattcaaa 6541 gaaggtattg tcgttgaagg aattcaacag cagcaacggc aagttgttgg tgtacaaaca 6601 agtgcaggag tccttcatgc acaacattat gttttggcag caggtgcttg gtcaaactca 6661 ttgtttcctt tccctgtgcg tcctgtcaaa ggtcaaatgc tgagtgtgaa agtaccagaa 6721 tttgtgccag atttatcctt gacacgggtt ttgtttggag aagaaattta catcgtaccg 6781 aggcgcgaca gacgaattat tattggtgcg acagttgaag atgttggttt cactccccac 6841 aacacgccag caggaattca aaccttacta caaagggcta ttcgtctatt tccccaaata 6901 cacaattacc ccattcaaga attatggtgg ggatttcgcc ctgcgactcc agatgaatta 6961 cctattcttg gcaccagtcc ttgtgaaaac ttaaccattg ccacaggtca ttatcgcaat 7021 ggtattcttc tagcacccat aacagcaaga ttgctagctg atttgattct cgaacaaaaa 7081 acagatcctt tactatctca tttccactac tcgcgcttcc acaccaagtc atctactacc 7141 ccaatgctta cttaccccac actcaaaact cagaactcag aaaccgcagc taggaattta 7201 gaactcagca ctttagaaga ttcaccactc caaatagccg gaaaaacttt ccaatctcgc 7261 ttgatgacgg gaactggcaa gtatcgcagc atcgaggaga tgcaacaaag tgttgtcaag 7321 agtggttgtc aaattgtcac tgttgcggta aggcgagtac aaaccaatgc ccctggacat 7381 gaaggtttag ccgaagcatt agattggaca aaaatctgga tgttgcctaa tactgccggt 7441 tgtcaaactg cagaagaggc aattcgtgtg gcacgtttgg ggcgagaaat ggcaaaactt 7501 ttaggac // LOCUS NODE_4260_length_7392_cov_4.7888787392 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7392) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7392) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7392 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(533..1150) /locus_tag="DP116_25590" CDS complement(533..1150) /locus_tag="DP116_25590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015205828.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_25590" /translation="MKPLFFHSFPQEPALAAAFPASYPFQMRAATPADLIGVAQIIAE SFHSQSGLLKWLFPLLRLGIYEDLRHRFASPAPHHLCLVAVDMTADTASNIVGTVELG VRFSDSWMQAGKSFPYLSNLAVHPKYRRQGAASGLLTACEKVSLSWGFQDLYLHVLEN NCQARQLYFKLGYQEHKVESHLNILFFRRSRQILLRKRLNGNLIV" gene 1540..2829 /locus_tag="DP116_25595" CDS 1540..2829 /locus_tag="DP116_25595" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25595" /translation="MTQSVADETPIPDTSWRRHTDPPSLWIAVTTISLTLHLLLFWLL RSYSYNLLSQRNSSNPIPVEFVEISSQPKAPSKAKPVLFKKPSTTQKSSVTSSPKLAS QGNLTPKSTSSVEDSNAIALTNPTSAKTSQSQVKNTTQQSLKKQVIKQKQPQPNPQPI ATSQPTRQPEPFIAKTPEPPLQPTPEDTPQAQQPQPTPDQTQPEPRLQPTPEAQQPQP TPEPTTLADNPDTQNQGNQQELGEQNQALNPENNTTDSTSTNSSPETGEQAKPANEST ESPSTLPRQSDDQVILGKKTPSPDLGPLVTPEQSPLDEPKVGGLVAFWDIPENLAQPQ DNSRRKELNIVLPDTAGDFQPVDFLVWLTIDEQGTLWLIKVDQEIPQPQRSQYQKYAN EIFKGQKFVPATDKDPATGKQKSGLRYLSVRVKIERP" gene 2874..2944 /locus_tag="DP116_25600" tRNA 2874..2944 /locus_tag="DP116_25600" /product="tRNA-Gly" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:2906..2908,aa:Gly,seq:tcc) gene complement(3038..3844) /gene="pstB" /locus_tag="DP116_25605" CDS complement(3038..3844) /gene="pstB" /locus_tag="DP116_25605" /EC_number="3.6.3.27" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139179.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_25605" /translation="MATDIRNGNQTDTVFRTEKLNVYYGNFLAVRDIWLDIPKNKVTA FIGPSGCGKSTLLRCYNRLNDLIDTFRAEGNVFYQGENLYAPHIDPVVVRRRIGMVFQ RPNPFPKSIYDNITFGPKLNGYKGNMDELVERSLRQAALWDEVKDKLRQSGASLSGGQ QQRLCIARAIAVQPEVILMDEPCSALDPISTLRVEDLLHELKEQYTIIIVTHNMQQAS RVSDKTAFFNVQTSEKGGRTGYLVEYDRTETIFQDPQQQATKEYVSGKFG" gene complement(3895..4785) /gene="pstA" /locus_tag="DP116_25610" CDS complement(3895..4785) /gene="pstA" /locus_tag="DP116_25610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747048.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease PtsA" /protein_id="PRJNA477356:DP116_25610" /translation="MTSTSGNNFSPGFTGGLSRSATSPRTLFDKVMTVVAFSCGILAL VPLIAVLSYVIIQGISTLSLDLFTKLPPPPLVKGGGFGNAILGTLVMVGIGALISIPL GVLAAVYLTEFSTGKMSRWIRFATNVLSGVPSIIAGVFAYGIVVSLTGGFSAVAGGFA LSILMLPIIVRTTDEALQLVSDDLRQAAVGLGATEFQTVALVVLPAAVPAIVTGATLS IARAAGETAPLLFTALFSQFWPRGIFEPTASLAVLVFNFAISPFKNLQSLAWAASLIL VLLVLLTSIIARWATSRKIK" gene complement(4805..5755) /gene="pstC" /locus_tag="DP116_25615" CDS complement(4805..5755) /gene="pstC" /locus_tag="DP116_25615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315923.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease subunit PstC" /protein_id="PRJNA477356:DP116_25615" /translation="MNIQAGNRQGTIQPRSELEKSVDRGFILLTRIFALAVAGILLWI AIQVAIQALPAIQKFGASFLVKSAWNPVNNDYGVIPQVYGTLVSAFIGLLIAVPIGIG TAVLLSENLLPSSARTVLVFLVELLAAIPSVVYGIWGIFVLVPILTGIGKWLNAYFGW LPIFSTPPTGPGMFPAGVILAIMTLPIITAISRDALISVPPSLRQAAMGLGATRWETI LKVIIPAAFSGIVSAVMLGLGRAMGETMAVTMIIGNANVINASIFAPANTISSLLANQ FAEANGLQVSALMYAALVLFVLTLIVNILAELIVLRVKRL" gene complement(5858..7042) /gene="pstS" /locus_tag="DP116_25620" CDS complement(5858..7042) /gene="pstS" /locus_tag="DP116_25620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878513.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter substrate-binding protein PstS" /protein_id="PRJNA477356:DP116_25620" /translation="MLFRYLVTNISCLSRTVSVLALSLSLAACGGPQAENGNTTSQAP GGAAKDATASAPGKLDLGGQVRLTGAGASFPAPLYQTWFQALNQKYPNLQVNYQSVGS GAGVEQFTKGTVDFGASDVAMKDEEISKVQNGVLLLPMTAGSIVLAYNVPGVQDLKLP RDVYTGIFLGTIKSWDDPKITAANPGAKLPKQPITIVHRSDGSGTTGVFTKHLSAINP EWKSKVGEGKTVNWPAGVGAKGNEGVTAQIQQTQGAIGYVEYGYAQQNKLTFAALQNK AGQFVAATDESASKALDAVELPANLRAFITDPEGKDSYPIVTYTWLLVYKKYPDAAKA KAVEAMIEYGLTEGQKAASQLGYVPLPEKVLKKVAASADAISPDYKISVGGGSGGASA SK" BASE COUNT 2377 a 1654 c 1430 g 1931 t ORIGIN 1 aaaatctaca aataaaaatc tcttaattta tctttttatg aatttttgta aagaaaagat 61 tttccaatca aaatgataat atttatttat tgtaactagt caagtctatt ttattaggat 121 tacctaacat tttaaggaaa atattcttaa tcacaacctt tgtattctag taaatagtga 181 gtcaagaagt agtaatgctc tcttttgcgg taaatatctt tattttgctt atctaacttt 241 agaataacgc taatagcgaa aaattctcat cttatgattg ctggaaactc aatgtcaaac 301 tacaaataga acacagattt ccagatcatc gaaatcaact tctaagcaga cacaatgtcc 361 tcgtgtcaga tcaagtcgta ttaacttgaa atgtctgtaa tagctattca cggtagagat 421 agaggctgtc tagtggtgaa cccgaggcga atagcacgac tgtctggatc tgttaatcct 481 acaagtggtg ttgtgataca ccatacgcca gaaaagtcat taagtagaaa attcagacaa 541 ttaagttacc atttaggcgc ttacgcaaga gaatctggcg tgagcgcctg aaaaacaaga 601 tattcaaatg ggattccact ttatgctctt gatatcccaa cttgaaatag agctgccgtg 661 cttgacagtt attttccaaa acatgaaggt ataaatcttg aaacccccaa gagagggaaa 721 ctttttcaca agctgtcagt aacccagaag cagcaccttg cctacggtat tttgggtgta 781 cagctaaatt ggacaggtac ggaaaacttt tacctgcttg catccaagaa tcgctaaaac 841 gcacacccaa ctctacagtt ccaacgatgt tgcttgcagt atcggcagtc atgtcaacag 901 caaccaaaca gagatgatga ggtgcaggag atgcaaaacg atgccttaag tcttcgtata 961 tacccaaacg caataatgga aaaagccact ttaatagacc actttggcta tgaaagcttt 1021 cggcaatgat ttgggcaaca ccaatcaaat cagcgggtgt agccgcacgc atttgaaatg 1081 ggtaagaagc tggaaacgct gcggctaagg ctggttcttg aggaaatgag tgaaaaaaca 1141 agggtttcaa agctagttca gttttgattt tatccaatac aaatattctc aaatatcgta 1201 aaacatcatc aaacctatag caaaactttg tgatccttca acactgatga accgtttact 1261 gtgagcaagt aatttagcat agctttctga agatcccgat tgctaaattg gaggtatgta 1321 tttgttgaat agtatacttt gctattgaat ctcaattgtt gccatatcgc aaactctacc 1381 atgattggca caaaggaaac ttacttgcac atcttaaagc cggtgctctg gcaaaaaact 1441 taacatattg agttcagtga acaaactgtt gttgaagagt tcacaacaag caataaggca 1501 gtgttacact ggctcaggat tgagaacaca gaacttggta tgacacaatc ggttgcggat 1561 gaaactccta ttccagatac atcatggagg cgacacaccg atccacctag tttgtggatt 1621 gctgtaacca caatttcttt aaccctccac ttattgctct tttggctgct gcgctcgtat 1681 tcgtataacc ttttatcaca acgaaattca tcaaatccta tcccagtgga atttgtggag 1741 atttcttcac agccaaaagc gccatcaaaa gcaaaaccag ttttgttcaa aaaaccttcg 1801 acaactcaaa agtcgtctgt aacaagctca ccaaagctcg ctagtcaagg aaatttaacg 1861 ccaaaatcca cctctagtgt agaagatagc aatgcgattg cactcacaaa ccccacaagc 1921 gcaaaaactt ctcaatctca agtcaagaat acaactcaac aatccttaaa aaaacaggtt 1981 atcaagcaaa aacaaccaca accaaatcct caacctattg caacatctca accaactcgc 2041 caacctgaac cattcatagc aaagactcca gaacctccac tacaaccgac tccagaagat 2101 accccacaag cacaacaacc tcagccaacc ccagatcaaa cacaaccaga acctcggtta 2161 cagccgactc cagaagcaca acaaccccag ccaactccag aaccgacaac tttggcggat 2221 aatccagaca cacagaatca agggaatcaa caagaattag gcgagcaaaa tcaggcttta 2281 aatcccgaga ataacaccac agattcaact agtactaatt cttcacctga aacaggtgag 2341 caagcaaagc cagctaatga gtctacagaa tcacctagta ctcttcccag acaatctgat 2401 gatcaggtta ttcttgggaa aaaaacacct tcaccagatc ttggtccact agttacacca 2461 gaacagtcgc cactggatga gccaaaagtg ggaggactgg tagcgttctg ggatatacca 2521 gaaaacctag ctcaacccca agacaacagt aggcgaaaag aattaaacat cgttttaccc 2581 gacacagcag gcgattttca gcctgtagat tttttagtgt ggctaaccat tgatgaacaa 2641 gggacgttgt ggttgatcaa agtagatcaa gaaatacccc aaccacaaag aagccaatac 2701 cagaaatatg cgaatgaaat ttttaaaggt caaaaatttg tccccgcaac cgataaagac 2761 cctgcaactg gcaagcaaaa gtcaggatta aggtatctgt ctgtacgtgt aaaaatcgag 2821 cgcccctagc ttgactttgt gcgaaaatag tgttatgttc taatagtgaa tttgcgggcg 2881 tagtttagtg gtaaaactat agccttccaa gctattaatg cgggttcgat tcccgccgcc 2941 cgcttatcta aaaatgaaaa aagttctcag ggtaaccctg aaaacttgaa acttgttgtg 3001 ataaagcatt attgcgacaa aaaagaatgt gtgatcttta accgaattta ccgctgacgt 3061 attccttagt ggcttgctgt tgcgggtctt ggaaaatagt ctctgtgcgg tcatattcta 3121 caagatagcc agtacgacct cctttttcgg aagtttgaac gttaaaaaag gctgtcttat 3181 ctgaaacccg ggaagcttgt tgcatattgt gagtcacaat aatgattgtg tactgctcct 3241 tgagttcatg cagcaagtct tcaacccgta gggtggagat ggggtctaga gcagaacagg 3301 gttcatccat taaaataact tcaggttgaa ctgcgatcgc ccgggcaata cacaaacgct 3361 gttgttgtcc cccagataag gacgcaccac tttgccgtag tttgtctttc acttcatccc 3421 acaaagccgc ttgtctgagc gatcgctcta ccaattcatc catattacct ttataaccat 3481 tcagtttcgg tccaaaggta atattgtcat aaattgactt tggaaaaggg ttcggtcttt 3541 gaaacaccat cccaattcga cgacgcacca caacagggtc aatatgaggt gcatacaaat 3601 tctcaccttg gtaaaataca ttaccttcag cccgaaaagt atcaattagg tcgtttaggc 3661 ggttgtaaca acgtagcaaa gtacttttac cacatccaga aggaccgata aacgctgtaa 3721 ccttattttt cgggatatca agccaaatat cgcgtactgc caaaaagttc ccgtagtaaa 3781 cgttgagctt ctctgtacga aaaacggtgt cagtttgatt tccgtttcta atgtcagttg 3841 ccataaattt agagactcta ttttaacctg aatgaaactt gcgatttgct ttagttattt 3901 aattttgcga ctcgtagccc aacgagcaat gatactggtg agtagaacca acaacaccaa 3961 aatcaatgac gccgcccaag ctaatgactg caaattttta aacggagaaa ttgcaaagtt 4021 gaaaactagt accgcaagag atgcagtagg ctcgaagata ccacgaggcc aaaattgaga 4081 gaacaaagca gtaaatagta gaggtgcagt ttctccggcg gcgcgggcga tcgacaaagt 4141 tgccccagtg acaattgctg gtactgctgc tggtagaact actaatgcca cagtttgaaa 4201 ctcagtcgca cctaacccca cagctgcttg tcgcaaatcg tccgatacta actgcaatgc 4261 ttcatcagtg gttcggacaa taattggtag catcaaaata gacagagcaa atcctccagc 4321 tacagctgag aatccgcctg ttaaactgac tacaatacca taagcaaaca caccagcaat 4381 aatagatgga actccactga gaacgttggt agcaaagcga atccaacgag acatcttacc 4441 agtgctaaat tccgtcaaat agactgctgc caaaactcct aagggaatgc taattaaagc 4501 tccaattcct accataacga gcgttcccaa tatggcattc ccaaagcctc ctccctttac 4561 caaaggtggt ggtggcaatt tggtaaataa gtctaggctc aaagtgctta tgccttggat 4621 aataacgtaa gacagcactg cgatcaaagg cacaagagcc aatatcccgc acgaaaaagc 4681 gactactgtc atcaccttgt caaatagcgt ccggggagac gtcgcggagc gagataagcc 4741 gccagtaaag ccgggagaaa aattgttacc agaagtagaa gtcattattt actacaccca 4801 aatgctacag tcgcttgact cgcagaacga ttaactctgc caaaatgttg actataaggg 4861 ttaaaacaaa cagaactaac gccgcataca tcaaagcact gacttgcaga ccatttgctt 4921 cggcaaactg attcgccagc aaagaagaaa ttgtgttagc tggtgcaaat atcgaggcgt 4981 taataacgtt ggcgttaccg atgatcatcg tcacagccat tgtttctccc atcgctcgac 5041 ccaaccccaa catcacagcg cttacaatac ctgaaaaggc agccggaata ataactttca 5101 aaattgtttc ccaacgagtt gctcctaatc ccatagctgc ttggcgtaaa ctcgggggga 5161 cagaaatcag tgcatcacgg gatatagctg taataattgg caaagtcata attgccaata 5221 taacccctgc tgggaacatt cctggtcctg ttggaggggt gctaaaaatt ggtaaccagc 5281 caaagtaagc gttaagccat tttccaatac ctgtcagtat gggaaccaaa acgaaaattc 5341 cccaaatgcc ataaacgaca ctgggaatag ctgctaatag ttccaccaaa aaaaccaaga 5401 ctgttcgtgc agatgatggt agcaaatttt cactcagcaa aacagcagtg ccaatgccaa 5461 ttggtacggc tatcaataga ccaataaatg cacttaccaa agttccatag acttgcggta 5521 taacaccgta gtcattattg actggattcc aagcgctttt tactaaaaag ctagcaccaa 5581 acttttggat cgcaggcagt gcctgaatgg caacttgtat cgcaatccat aatagaatac 5641 ctgcgaccgc tagtgcaaaa atccgagtca ggagaataaa gcctcgatca actgactttt 5701 ctaattccga ccggggttga attgtgcctt gacgatttcc ggcttgtata ttcatgaata 5761 acttcaccca tcaactgaaa aatgtgggag cgaaccacac agaagaattc aaaattcacg 5821 catttgccca ttcaaaattt gacaagcaat gccagtgcta cttgctggcg ctagcaccac 5881 cgctaccacc accgacactt attttgtaat ctggactgat agcatcagca ctagcagcaa 5941 cttttttcag cactttttca ggtaaaggaa catatcctag ttgtgaagca gccttttgac 6001 cttctgtcaa gccatattca atcatggctt ccactgcttt ggcttttgct gcatcaggat 6061 atttcttgta aaccaaaagc caagtgtagg taacaattgg gtaagaatct tttccttctg 6121 ggtctgtaat aaaggcgcgg agatttgcgg gcaattctac agcatctaaa gctttagatg 6181 ccgattcatc tgttgctgcg acaaattgac ctgctttatt ttgcaaagca gcaaaagtga 6241 gtttgttttg ttgagcgtaa ccgtactcta catatccaat tgcaccttga gtttgttgga 6301 tttgggctgt cacaccttca ttacctttag caccaactcc tgcgggccag tttacggttt 6361 taccttcacc aactttggat ttccattctg gattaatagc actgaggtgt tttgtgaaca 6421 cacctgttgt accactacca tcagaacgat gcacaatcgt aattggttgc ttaggaagtt 6481 tggcaccagg gttagcagcg gtgattttcg ggtcatccca agacttaatg gtgcctaaga 6541 agatcccagt ataaacatct cgtggaagtt tgagatcctg tactccaggt acattgtaag 6601 ctagcacaat gctaccagca gtcatgggca gcaaaaggac gccgttttgc accttgctga 6661 tttcttcatc cttcatggca acgtcgctag caccaaagtc tacagtacct ttggtaaatt 6721 gctcgactcc agcaccacta ccaactgact gatagttcac ttggagatta ggatatttct 6781 ggtttaaagc ctggaaccaa gtttgataca aaggtgctgg gaaagaggct ccagcgcctg 6841 ttaatctgac ttgtccacca aggtcaagtt ttccaggagc actagcagta gcatcctttg 6901 cagcaccacc aggggcttgg ctagttgtat taccgttttc cgcttgcgga ccaccgcaag 6961 cagctaagct gaggctcagt gctaatactg aaactgttct tgataagcaa gaaatattag 7021 ttacaaggta acgaaaaagc atggatatgt ttttttggtg attttccagc taagcgatca 7081 taatatacaa cttaagagta aagataaggt taagataatc caaacaagta ctttttctca 7141 cgtccaatat cgcaaaatct taatattgtt tacttcatta aattttgtaa ctgtcaattg 7201 ctcattcgga gaacaaatta tacattagta catttgtttt agtgggagat acagaaacac 7261 acgttacaca acaataatcg tttaagaagt ctatccaatt caaaaataat aaagtttaat 7321 gaatcagcta aacgtgttca aagaaaaatt tttaacttgt taaaatacag agacaaaaaa 7381 acaagataat ac // LOCUS NODE_4263_length_7388_cov_5.7140327388 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7388) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7388) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7388 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 87..1127 /gene="cobW" /locus_tag="DP116_25625" CDS 87..1127 /gene="cobW" /locus_tag="DP116_25625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182964.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobalamin biosynthesis protein CobW" /protein_id="PRJNA477356:DP116_25625" /translation="MHKIPVTVITGFLGAGKTTLIRHLLQNNQGRRIAVLVNEFGEVG IDGELLRSCQVCDDEEDPNSNIVELTNGCLCCTVQEEFLPAMQELLKRRDRIDCMLIE TSGLALPKPLVQAFRWPEIRTHATVDSVITVVDCEALATNQFVGDLEALEVQRQADAS LEHETPIEELFEDQLACADLVLLTKGDRVDDQTQAKVQQWLKQNLSPSVKLIQCQDGK IDCDLLLGFNAAVEDNLDSRPSHHDTEAEHEHDDGINAVQLLLDQAFEPSVLVQRLQT LVQQQEIYRIKGFVAVPKKAMRLVLQGVGNRFDYFYDRPWQPHEPRQTRLVLIGRELD QVGIESLVRQGN" gene complement(1124..2368) /locus_tag="DP116_25630" CDS complement(1124..2368) /locus_tag="DP116_25630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320731.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XRE family transcriptional regulator" /protein_id="PRJNA477356:DP116_25630" /translation="MKQDGSLCNNLKSIRTRLGMSQQDLANIAGVTRQTISGVESGQY APSVAVSLRLAKALGCQVENLFWFEEDLPEVEAVLTKPVKSGQQLRVSLARVGGQWIA YPLVGNDAFRIEMIPADGETVAVSPALREGFPPQVTGEPVRVDGFPGISKLSNPKGEA QSKTGTNKVQVRLLDDTDKLHNTVVIAGCTPVISLWAKATERWHPQLRVHYNFANSMA ALRSLERGEVHIAGMHLYDPQTGEYNIPFAREALGGKNAVLITIGVWEEGLLVAPGNP MGIKTVSDLVDLGATIVNRESGSGSRMLLERKLQEEQVPFHTLKGFDHIVHSHKDVAL SIVSGIVDAGVSTASIATAFGLGFVPLHRARYDLVILKEYLEEAPVQQLLSTLGHRLV QSQLEVLGGYDISKIGEVVANI" gene 3726..4469 /locus_tag="DP116_25635" CDS 3726..4469 /locus_tag="DP116_25635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320714.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdenum ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_25635" /translation="MLGCPQSWGWEKLSNALPPEVSREYNIPIKLEFDHSGLLQKRLL SGEHSDIFASVDMENPTALMKANRSSPVVNFIRNKMCAIVKPSLKVTPNNLLDWMLNP EIRLGTSTPNEDTSGDYAQEIFHKAEKVQAGSFNELNRKALRLIGGRNSPVVPKGKNE IAYFITETQQVDIFLSYRTDARLAVLAASSMQIVELPENLAVKANYGMTLMKDARSSG VMLAMYILSQNGQKILTKYGFDSPLILDN" gene 4730..6157 /locus_tag="DP116_25640" CDS 4730..6157 /locus_tag="DP116_25640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase iron-molybdenum cofactor biosynthesis protein NifE" /protein_id="PRJNA477356:DP116_25640" /translation="MKNTQVKINQPLNETVCEHNQKKLEDKKKKSCKKPPQPGAAQGN CSFEGAMVALVPITDVAHLVHGPAACTRNPWENRGSLSSGSELYKIGFSTDLSENDVI FGGEKKLYQAILDIAQRYNPAAVFVYSTCVTALIGDDTDSICKAAAKKIGIPVVYINS PGFLGSKNLGNRIGGQALLEHVVGTAEPEITTPFDINIIGDYNVAGEMWNVLPLFNRL GIRVLSKMTGDARYQEICYAHRAKLNVVICSNVSLGMAKTMQERYGIPYIEESFYGVE NINRCLRNIAATLSESLGENLTSRYLQERTEWLIAEETAALDIALASYRSQLKGKRIV LYSGGVKSWSIILAARDLGMEVVATSDRKSTPEEKAKIKQLLGQDGMVLSKGSPNVLL QVLKDTKADMLVGGASNQYTALKAKIPFLDVNHERHHAYAGYAGMLKMAQELYKALYS PVWEQVRKPAPWWNEKIDQVLSGQV" gene 6347..6748 /gene="nifX" /locus_tag="DP116_25645" CDS 6347..6748 /gene="nifX" /locus_tag="DP116_25645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867597.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogen fixation protein NifX" /protein_id="PRJNA477356:DP116_25645" /translation="MKVAFTTSDGTHINTHFGLAKDIDVYEVSKDGFNFIETLTFEGD LEEAPHEDKITPKMEAVLDCRIVYVKAIGKPAGNKLMKEGVTPIRAQEYDTIPDILYM LVQSLNGDAPPMLRKALQLVENNLAYSHDEE" BASE COUNT 2279 a 1488 c 1541 g 2080 t ORIGIN 1 ctgataactg cggctggtac ttgataactg ataactgata actgataact gttaactgtt 61 aatgaagaac aaacaacata ttttctatgc ataaaattcc cgttacagtc attacaggat 121 ttcttggcgc aggcaaaacg acgttgattc gccatttact ccaaaacaat cagggacgac 181 gcattgctgt tttggtcaat gaatttgggg aagtcggaat tgatggcgaa cttttgcgtt 241 cctgtcaagt ctgtgacgat gaagaagacc ccaacagtaa cattgttgaa ctcaccaacg 301 gttgcttgtg ctgcacggtg caagaggaat ttttaccagc gatgcaagaa ttgctaaagc 361 gacgcgatcg cattgactgc atgttaattg aaacctctgg actagcacta ccaaaaccat 421 tggtgcaagc attccggtgg ccggagattc gcacgcacgc tacagtggac agcgtcatca 481 ccgtggtgga ctgtgaggct ttagcaacca atcaatttgt gggcgattta gaagccctag 541 aagtacagcg acaagccgat gctagcttag aacacgaaac accgattgaa gaactgtttg 601 aagatcagct tgcgtgtgct gacttagtgt tgcttactaa gggcgatcgc gttgatgacc 661 aaacccaagc taaagtgcag cagtggttaa agcagaactt gtctcctagt gtaaaactca 721 ttcagtgcca ggacggtaaa attgattgcg atctgttact cggtttcaac gctgcggtag 781 aagacaactt ggatagtcgc cccagtcacc acgataccga agccgaacac gaacatgatg 841 acggaattaa tgcggtgcaa ctactactag accaggcatt tgagccgtct gtcttggttc 901 aacgtttgca aacattggta caacagcaag aaatttaccg aataaaggga tttgtcgcag 961 ttccgaaaaa agccatgcgt ctggtattac agggtgttgg taatcgattt gactactttt 1021 atgaccgtcc ttggcaaccc cacgaacccc gtcaaacgcg attagtgttg attggtcggg 1081 aacttgatca ggttggcatt gagtcattgg tgcgacaggg gaactaaata ttagctacaa 1141 cttctccaat cttgctgatg tcataaccac cgagaacctc taattgcgat tggactagcc 1201 gatgtcccaa agtactaagt aactgctgta ctggtgcttc ttccaagtat tccttgagaa 1261 tcaccaagtc gtatcgcgca cgatgcagcg gaacaaatcc caacccaaag gcagtagcta 1321 tagatgccgt actaacacct gcatcaacaa ttcctgagac tatagataaa gcaacatctt 1381 tatggctgtg gacgatatgg tcaaatcctt tgagagtgtg gaatggcact tgttcctctt 1441 ggagttttcg ttctaaaagc atccgactgc cagaacctga ttcacggtta acaatggttg 1501 ctcctaaatc caccaaatca gaaactgttt taattcccat tgggtttccc ggtgcaacta 1561 ataatccctc ttcccaaacc ccgatggtaa ttaagacagc attcttacct cctaaagctt 1621 cccgcgcaaa gggaatgtta tattccccag tttgcggatc gtacaagtgc atccccgcga 1681 tatggacttc acctcgttct aaactacgca atgctgccat gctgttagca aaattgtaat 1741 gaactcgcaa ctggggatgc caacgttcgg tggctttcgc ccaaagggaa atcacaggcg 1801 tacaaccagc tatcacgact gtattgtgca gtttgtccgt atcatccaaa agtctgactt 1861 ggactttatt tgtacctgtc ttactttgtg cctcaccctt cgggttcgac agtttgctta 1921 tgccggggaa cccgtccacc cttacgggtt cgccagtcac ctgcggaggg aaaccctccc 1981 gcagtgctgg actcaccgca actgtctcac cgtctgctgg aatcatttcg atgcgaaaag 2041 catcgtttcc aaccagggga taagctatcc attgaccccc aactcgtgct agacttactc 2101 gcaattgctg tccactctta actggtttag taagaacggc ctcaacttcc ggcaaatctt 2161 cctcgaacca gaataagttc tcgacttggc agccaagcgc tttggctaaa cgcagtgaca 2221 cagcaactga aggagcatat tgtcccgatt ccacaccgct aatagtctga cgagttacgc 2281 cagctatatt agccagatct tgctgactca tgcccaagcg agttctaatc gacttcagat 2341 tgttgcaaag actaccgtcc tgcttcatga ggctgtatct ctccgttaca aaaagtcgca 2401 gtgggcaaac tttatctacc gaggctatag caaggtcttc tagtctatgc tgttttgcct 2461 tggaagacta gaagaatgct tttacaattt cgatgatcct actgcaagaa tagcatacat 2521 ttagctgaaa tacttgctat tttttgattc tctgtagcaa gaatagtata ctaattgcta 2581 aaatatttat catttttttg ctttttacag caagtgttaa gcgatagagt gcaggcggct 2641 tccgcatcgc cgtcaggcat tcctctagga taaggctttt tgaagagtag ggtaggcaat 2701 cgtcatcgta ggataacagg ctcgtgtcca ataagctaaa gcgtaattat aagccatgtg 2761 cagacaaagt atctgtccta acgctattat tttaggattg tgtacagctt gccacagaat 2821 ctagctttag taagtatttt tgcaagatac atgatattgt tgccgcaaaa aaaaattccc 2881 caacagcaaa tacttttgca agatgtatgc tatgcttgca tcaaggggat aaaaagagcc 2941 aaagcattaa caagtcttcc tacagaaacg gaaaacgtta gaagacttag ctatttccac 3001 cgaaggtgct attgaatcgg ttacgatttt cagtaactct attcttgagg tatttgacag 3061 atcatagttc ctgatgaaga taaaagtttt acctttttac agttcgtctg tggtcttgga 3121 caatcgcaac atcttaggca atttcatttg ctgtaattga cttcatatga cattgaaaag 3181 ttttgtttag tgcggctaaa taattagctt tgtgacttct cactgaatat aaaagcttgc 3241 tggatgaggt gcatttaact ttactgggtt aggtgtatcc aactggggga aagggaaatt 3301 tccctttccc atttcctctc attcccacac ttttagagaa ctcccaaaaa agttttgaat 3361 actttcacca aaacaccact tctggtcaca ctttctgata tctacaatca atcaacgaca 3421 aagaagatta tgacttctac acacaagaga aaaaactgac tgacagtgtg tcttcccagg 3481 agttggaaat tgctatttga tagcttttct gccgataggg gatgtggagt aaatacgcta 3541 tagaacacct tctgactcta tacctggtta ctccttttac ccttcgggta tgcctgcgtt 3601 cgccaagggc gaacgcagtc gcctctgtcg ggaaaccctc ctgcagcgct gtctcactgt 3661 ccactcccct atctttcctc ttcatactaa aaacgcattt atgacggcag tcgctacaat 3721 gagagatttt gggatgccca caatcgtggg gctgggaaaa actctccaac gcgctgcctc 3781 cagaagtaag tagggaatac aacattccta taaaacttga atttgaccat tctggcttgc 3841 tgcaaaaacg tcttttgagt ggagaacaca gtgatatatt tgctagcgtt gatatggaaa 3901 atccaactgc tttgatgaaa gctaatagaa gcagtcctgt ggtgaatttc atcaggaata 3961 aaatgtgtgc aattgtcaaa cccagtttaa aggtgacgcc aaataatctt ttagattgga 4021 tgttaaatcc agaaatcaga ttaggaacat caacaccgaa tgaagatact tctggggact 4081 acgcacaaga aatattccat aaagctgaaa aagtacaagc aggtagcttt aacgaactaa 4141 atcggaaagc actgcgcttg ataggtgggc gtaattctcc tgttgttccc aagggaaaga 4201 atgaaattgc atacttcatt actgagacac aacaggtaga tatcttttta agctaccgta 4261 ctgatgctag gttagctgtt ttagcagcat cgagtatgca aatagtggaa ttaccagaaa 4321 atctggcagt caaggctaat tatggtatga ctttgatgaa ggatgcccgc agttctggag 4381 tgatgttagc tatgtatatt ctttcccaaa atggacaaaa aatcttgact aaatatgggt 4441 ttgactcacc tttaattttg gataactaag tgaagatggc tgaaatagac aagaactata 4501 catacaagta ttgagaagtc accttctacc aaataagaat tgagtagcta taagtaattc 4561 cattcatagt ttgcacgaaa acagaatgat tttccgtaaa taaggaggaa aaaataaatt 4621 ttaaaaaatt ggcacaagat gtcaatcaag ctcaaaatca ataataagta ttttacttat 4681 tcctcattcc ttcttcctcc acctcatcct attcaacgca aggaaagaca tgaaaaacac 4741 tcaagtaaaa atcaaccaac cactcaacga gacagtctgc gaacataacc agaagaaatt 4801 ggaggacaag aaaaagaaat cttgcaaaaa accaccacaa ccaggtgcag cccaagggaa 4861 ttgttctttt gagggggcga tggttgctct tgtaccaatt actgatgttg ctcatttagt 4921 gcatggtccc gctgcttgta ctcgtaaccc ttgggaaaat cgtggtagtc tctcttcagg 4981 ttctgaatta tataaaatcg gcttcagcac tgacttgagt gagaatgatg tcatttttgg 5041 tggtgaaaag aagctttatc aagctattct cgacattgct caacgctaca atcctgcggc 5101 tgtctttgtc tactctacct gtgtcactgc tttgattggt gatgatacgg atagtatctg 5161 taaagcagct gcaaagaaaa ttggtattcc agttgtctat ataaattcac ctggatttct 5221 tggtagtaag aatttaggta accgcatcgg tggtcaagct ttattagaac atgtggttgg 5281 aactgcagaa ccagaaatta ctacaccatt tgatatcaat atcattggtg actacaacgt 5341 tgctggtgaa atgtggaatg ttttgccact gttcaataga ttgggtattc gtgttctctc 5401 caaaatgacg ggtgatgctc gctatcaaga aatttgctat gctcatcgtg ccaagttaaa 5461 tgtagtgatt tgctcaaatg tatcactagg aatggcaaaa acaatgcagg aacgttatgg 5521 aattccttac attgaagaat ctttctacgg tgtggaaaac ataaatcgtt gcttacgcaa 5581 tattgcagca acattaagcg aatctcttgg ggagaacctc acttcacgct atcttcaaga 5641 acgcacagaa tggctgattg cagaagaaac cgctgcgcta gatattgctt tagcttccta 5701 tcgttcccag ttaaaaggta aacgcattgt tctctatagt ggtggcgtaa aaagttggtc 5761 gattatttta gcagccagag acttaggaat ggaagttgtt gctaccagtg atagaaagag 5821 tacaccagaa gagaaagcca aaattaagca attgctcggt caagatggca tggttttgtc 5881 aaagggaagc cccaatgtat tactacaggt actgaaagat acaaaagctg atatgttggt 5941 tggcggggct agcaatcaat acacagcact gaaagcaaag attccttttt tggatgttaa 6001 ccatgaacgt catcatgcct acgctggtta tgcagggatg ctgaaaatgg cacaggaatt 6061 gtataaagcc ttgtatagtc ctgtgtggga gcaagtcaga aaacctgctc cttggtggaa 6121 tgaaaaaatc gaccaagtcc tttcagggca ggtgtagggg tatatgggaa gaaagggata 6181 aggggcaatt tgggacaaag gaaagaaaga gttaatagac aattatttat tttttcctta 6241 cacccttaca cccttgtttt atgaccatga ttaattcgtt attagttttg gttagcagca 6301 aatgacaaag tacaaacaac aattactaaa tttctggagt caataaatga aagtagcttt 6361 tacgacaagc gacggaaccc atattaacac tcactttggc ttggcaaaag atattgatgt 6421 gtatgaagtc tctaaagatg gatttaattt tattgagact ttaacctttg aaggcgactt 6481 agaagaagct ccccatgaag ataagattac accaaaaatg gaagcagttc ttgactgcag 6541 aattgtatat gttaaagcta ttggtaaacc agcaggaaac aagttgatga aggagggtgt 6601 aactcctatt agagcacaag aatatgatac catacccgat attctatata tgctagtaca 6661 aagtttaaac ggtgatgccc caccaatgtt acgtaaagcc ctgcaactgg tagagaataa 6721 cttagcatat tcccatgatg aagaataatg gaattttgaa tttagggttg tcttgcatct 6781 tttttgtacc cattgaacgc ttcaaagaaa gttggatgca attaagtaaa agtagaaatg 6841 gaatgattga cgacagtcct caatctttgg caaagaagaa tattaaccaa gattatcgac 6901 tttaatagca attctatctg atttgcaaga atcgaaaaac ctagaacccc cgacttctat 6961 aagaagtcgg gggttttgtt tctcaaaaat gatttataat cgttatataa aacgtcaaga 7021 atatttatca ttaaatattt tagttagaat attacgtcta attaaaaaat attctgagct 7081 taaaaacgag atcaaaaata gacagctctg tctattttgc caagctaatt aaattattta 7141 ttttgatatt tgttgacatt ttgttatcaa aataacgtta aaaatgtctg attgtagtta 7201 ctatttaaaa agagaaaaga gaaaagaatc tttcttctca aagaactgaa gaaaaaaagc 7261 tttgttctag tattggaatt gcataaggag caatgacacc atgccattga aattattgaa 7321 gtgcgacgaa agcatacccg aacgtgaaaa gcacgtttac atcaaagaaa aaggagaaga 7381 tacaacac // LOCUS NODE_4280_length_7331_cov_5.1392257331 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7331) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7331) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7331 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 563..964 /locus_tag="DP116_25650" CDS 563..964 /locus_tag="DP116_25650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864685.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25650" /translation="MRLMIAIILLWLLTACTTSVLAPTSQLVEKAIALELQQTQQQLN QQLDLDFQGFEINRLSITQRKALTVENLPTYHVQGTYNLTFKLPKRKLTQPQKPFEVY LQLQREGKTWRLLVPEDHGKDSKQVWRSYSI" gene 1061..1833 /locus_tag="DP116_25655" /pseudo CDS 1061..1833 /locus_tag="DP116_25655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015126889.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 1727..1736 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(2056..2742) /locus_tag="DP116_25660" CDS complement(2056..2742) /locus_tag="DP116_25660" /EC_number="2.1.1.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872691.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium protoporphyrin IX methyltransferase" /protein_id="PRJNA477356:DP116_25660" /translation="MNVTDDKTIVKEYFNSTGFDRWRRIYGDGEVNKVQLDIRTGHQQ TVETVLSWLKADNNLAGLSICDAGCGTGSLSIPLAEAEAKVYASDISEKMVGEAKEKA SQILANAENPTFAVQDLETISGSYHTVICLDVLIHYPQQKADEMISHLCSLAQSRIIL SFAPKTFALSLLKKIGSFFPGASKATRAYLHREADVVKILQKNGFSVQRQSMTRTRFY FSRLLEATRQ" gene complement(2837..4078) /gene="nagA" /locus_tag="DP116_25665" CDS complement(2837..4078) /gene="nagA" /locus_tag="DP116_25665" /EC_number="3.5.1.25" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318232.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetylglucosamine-6-phosphate deacetylase" /protein_id="PRJNA477356:DP116_25665" /translation="MTEATHSLPGTSVDIINAKVPGYTDLQILSVNPQGIIEQILPMS KVLKRVPPVDLRVLDIGGDWISLGGVDLQINGALGLAFPDLSTENSRQMLDICRYLWY EGVDAFLPTLVTTSVENIQRALSVIADCINLTHSPSLSANRGYAKILGVHLEGPFLNP QKRGAHSPEYLLPLNLEEVKKVLGNYAHLVKVMTLAPELDPTGEVIPYLRSLGITVSL GHSQATAAEAQRAFELGATMVTHAFNAMPPLHHRESGLLGAAIIHPGVMCGFIADGEH VSPTMLQILLRASTPVFSHEREEIHKQQLFLVSDALAPLGLADGVYPWDTRQISVKNG TARLQDGTLSGTTLPLLVGVQNLVKWGICNVERAIALSTIAPRSAIGLSTILSGASAS QLLRWHLDESRQELLWQRLFS" gene 4271..4789 /gene="purE" /locus_tag="DP116_25670" CDS 4271..4789 /gene="purE" /locus_tag="DP116_25670" /EC_number="4.1.1.21" /EC_number="5.4.99.18" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411544.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5-(carboxyamino)imidazole ribonucleotide mutase" /protein_id="PRJNA477356:DP116_25670" /translation="MAPLVGIIMGSDSDLPTMQGAIAVCEEFGIATEVAIISAHRTPE RMVEYAKSAHQRGIKVIIAGAGGAAHLPGMVASLTPLPVIGVPVPSRHLQGVDSLYSI VQMPAGIPVATVAIGNAKNAGLLAVQILATHQAELLNKVQQYRQTLSESVMAKQEKLE ELGYEQYLKQMP" gene 5186..5872 /locus_tag="DP116_25675" CDS 5186..5872 /locus_tag="DP116_25675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746520.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_25675" /translation="MSGLVSGISTGIVAFTATNIDDIVILLVFFSQVNSTFTRRHIIV GQYLGFTALVIASVPGLFGGFILSPNWIGLLGLIPIAIGISSLVNPEEDSEEAAETEQ SEDSTFANFLSPQTYSVAAVTIANGSDNISVYVPLFASSDFGSFLVILVVFFLLIGVW CYTAYKLTNQQGIAEILTQYGNYLVPFVLMGLGAFIVLKSGALNPVKLLLSCLCLIVL VKDDQRMSQK" gene complement(6187..7282) /gene="acsF" /locus_tag="DP116_25680" /pseudo CDS complement(6187..7282) /gene="acsF" /locus_tag="DP116_25680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318251.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="magnesium-protoporphyrin IX monomethyl ester (oxidative) cyclase" assembly_gap 6894..6903 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 2092 a 1551 c 1528 g 2140 t 20 others ORIGIN 1 aggaataaac tttctattga cttcacataa agtaggctac aggatgactg cgcttcatta 61 tatctgaata ttcatgcatg acaactaact ttccgggtag ttttttttat caaaaaactt 121 aaatttcact taagaagttg acttgatcat agatacagat aagttattag taatgcgtat 181 aatccgcatt cttagttatt tgttgttgtg tcaactgcga tgaaacctct actggtaaaa 241 gatttcagct tcataatatg tctgtttact aactgtacat atgtcatcaa gttgctgttt 301 tgcttgaagt tatagtaaat gagaagatac ttgcccagac taagtaatat acgctctgac 361 acaaggtaag atttatttat tttcaaagta aattgtggat acgatgaaaa gcaaatattg 421 ttctggatta gtaagtagaa actgagtatt aatctagact tattaacttt gcgagtcaca 481 aaaagttata taatcacacc agcaacaggc tagactatcg ctttggaata gcagacatga 541 gtatagagga gtcagagtga tgatgcggct aatgatcgcg attattctgc tgtggctatt 601 aactgcgtgt accacgagtg ttctcgcacc aactagccag ttagtggaaa aagcaattgc 661 acttgaatta cagcaaaccc aacagcaact caaccaacaa ttagatttag attttcaggg 721 atttgaaatc aatcgcctat caattactca gcgaaaagct ttgacagttg agaatttacc 781 cacataccac gttcagggaa cctacaacct cacgttcaaa cttccaaaac gaaaattgac 841 acaacctcaa aaaccttttg aagtttacct gcaactccaa agagaaggta aaacttggcg 901 attactggtt cccgaagatc atggcaagga ttctaaacaa gtttggcgca gttattctat 961 ctaatgtctg tgtgtaactc taaaaaccat cacaagcatt tgtgcgtgtc tgttaacccc 1021 tcgccaagat tgaggtgata ccttttgaac acaagacaat atgtatttta acttctttat 1081 cggttccagt tttggaatta cggttctagt gctgttgggt tttagcgtac tgcaatggtt 1141 ccatattcct actggcaact ttcttgactg ggtgattgga ggagcaagtt tttggtggtt 1201 gttggtgatt gtgactgtac cgtggaatgt ttattttcag gcaaaagaag tcttggcaga 1261 agggtcacaa tcaactgaaa aaggaattcc agtggatgag aaacaactgg agtacgttaa 1321 agttttggca cggcgatcgc ttgtcgtcgc tcttgcccta cacttatttt cagctattag 1381 tctttacctt ctcgccgcca ctggtattag tgcagtcggc tatgtcagtt ctggtgctgc 1441 tttgctctta actggattac gcccagctgt cagtgcttat gaatatttgt atgcacgatt 1501 ggttatgatt cgccaagaat ggaaatatcc acgtgaagat gttgttgaac tccgttttcg 1561 tttcaacact ctagaagaaa ccgttaagcg tttagaagaa caacttaatc cagaacagcc 1621 ctactcctta gcagcaaatc agcaaactta tttagaagaa acgcgcaaag aattagccag 1681 aattgccgct agtgtagaag aattacgtgc cacaaatcaa atagaannnn nnnnnnggca 1741 agggaagcac ggggggcgat cgcccaactt tccacagatg gactattttt agatcatgtg 1801 cgcgaaatta ttcgtttttt caaaacagcg tgatactaag ttgcattcat acagagaatt 1861 tttcctctct ctttctctct tgctctgtgt tctctgcgcc tctgcggttt ttttttcttt 1921 gttcgcgtaa gcgtatcccc ttgagacata cgcaacttag tatgaataac ctacaccgac 1981 tctttcagta caatgtaggc ttctggcttc atgatcgcag caaaaaataa agcttatctt 2041 gctgcgattt tgaaatcact gacgcgtcgc ttccaataga cgagagaaat aaaagcgagt 2101 tcgagtcata gactgacgct gaacagaaaa accattcttt tgcaaaattt tcaccacatc 2161 agcttcgcga tgcagataag cgcgagtcgc tttactagca cctggaaaga aactccctat 2221 cttctttaat aaactaaggg cgaaagtttt cggcgcaaaa ctgagaataa tccgtgactg 2281 cgctaaagaa caaaggtgag aaatcatctc atctgccttt tgctgagggt agtgaatgag 2341 aacatccaag caaatcaccg tatggtagct accgctaatc gtttctaaat cctgtacagc 2401 aaatgtcgga ttttcagcat ttgcgagaat ttgtgaggct ttttccttgg cttctcctac 2461 cattttttcc gaaatatcgc tggcatagac ttttgcttca gcttccgcaa gcggtatgct 2521 gagactacct gtgccacatc cagcatcgca gatagataac cctgctaaat tgttatcagc 2581 tttcagccaa ctgagtaccg tctctactgt ttgctggtgt ccagtacgga tgtctagctg 2641 cactttgttg acttcgccat caccgtagat acgtctccaa cggtcgaagc ctgtagagtt 2701 gaaatattct ttaacaatcg tcttatcgtc agttacgttc atgaaactca atatctatcc 2761 attccaaagc taaaaagtac catgaccagg aaccaccaga aatatttttc tgcctggtgt 2821 acctggaagc aaaatcttaa gagaacaacc tctgccataa aagttcttgt ctagattcat 2881 ccaaatgcca acgcaataat tgactagcac tagcaccaga gagaattgtg gacaagccga 2941 tcgcacttcg tggtgcaatt gttgagagtg cgatcgccct ctccacatta cagattcccc 3001 acttcaccaa attttgcaca cctaccaata aaggtaaagt cgttcctgat aaagttccat 3061 cctgaagtcg tgcagttccg tttttcactg aaatctggcg agtatcccaa ggatacaccc 3121 catcagccaa tcccagagga gcaagggcat cactcacaag gaaaagttgt tgtttgtgta 3181 tctcttccct ttcatgggaa aacacaggag tgctagcgcg caataaaatt tgcagcatcg 3241 ttggcgagac atgttcacca tcagcaatga aaccacacat cacaccagga tggataattg 3301 ctgctcccaa caatccgctt tcacggtggt gtaatggtgg catagcatta aaagcatgag 3361 tcaccattgt tgcacctagt tcaaaagcac gttgcgcttc tgcagcagtc gcttgggaat 3421 gtcctaaact gacagtgata cccaaagaac gtaaatatgg aatcacctca cccgttggat 3481 ctaactctgg tgctagcgtc atgactttca caagatgagc gtaatttccc aaaaccttct 3541 ttacttcctc aagattaagt ggtaagaggt actcaggtga gtgtgcacca cgcttttgag 3601 gatttaaaaa tggtccttct agatgtactc ccaaaatttt cgcataacct ctatttgcag 3661 acaaagaggg cgagtgagtc aagttgatac aatcagcaat cacagaaagc gcacgctgaa 3721 tattttctac cgaagttgtc accagggtag gtaaaaaagc atctaccccc tcataccata 3781 aataacgaca tatgtctagc atttggcgag agttttctgt tgataaatca ggaaacgcca 3841 atcccaacgc gccgttaatc tgtaaatcta caccacctaa tgaaatccag tcgccaccaa 3901 tatccaacac ccgtaagtca actggcggaa cacgtttaag gactttagac attggcagga 3961 tttgttcaat tatgccctgc gggttaactg aaagaatctg caaatctgtg taaccaggta 4021 ctttagcgtt gataatgtct acagaagtac ctggcaagct gtgtgttgct tcggtcatga 4081 ctcagcacca aagatagtga aatctgacct gattttagag gtaagcggtg gcgatgaatc 4141 aaaaatcctg ttgcaaatgt cagcagatac acacagagtg agatgattgt aaatcggaaa 4201 gtcaaattcc aattttgaat tttgaatttt gttcgcgtag cgtgtccgaa ggacatattt 4261 tgaattgctt atggctcctc ttgtcggcat cattatgggt agtgattcgg atttaccaac 4321 catgcaaggt gcgatcgcag tttgtgaaga atttggtatt gccacagagg tagcaattat 4381 cagtgctcat cgcactccag aacgtatggt ggaatatgct aaatctgctc accaaagagg 4441 tattaaggtc attatcgctg gggcgggtgg tgctgctcac cttccgggaa tggtagcatc 4501 tttaactcct ctgcctgtta ttggtgttcc tgttcctagc cgtcacttac aaggagttga 4561 ttcgttgtat tctatcgtcc aaatgcccgc aggtatacca gtagcaactg ttgcgatagg 4621 caatgctaaa aatgctggac ttttggcagt acaaattctt gcgactcatc aagcagaatt 4681 acttaacaag gtgcagcagt accgtcaaac cttgtctgaa tcagtcatgg caaagcaaga 4741 aaaactagaa gagcttggtt acgagcaata tttgaaacag atgccatgac tagagtcaac 4801 aattaggggt gtgtaggggc gctgcggcct tgcgcccgta cagggtgtaa gggtgtgatt 4861 ggtgggggca atagggcggt catttgcgtt tttgtagagt tattgaaggc atttttacaa 4921 atctttatat agctggattt atttccgcct gtctactaaa aaaattaaaa aaatagatga 4981 tataggactc ctccaaagat ttttgcgaag cttcgtacac ctttattcct tcttcggtgt 5041 taagagttaa gcgttcgggt gttaagagtt aagcgttccc tgttccctgt ttcctgttcc 5101 ctgttccctg ttccctgttc cctgttccct gttccctgtt ccctgcctca atgagtaagt 5161 tcataacctc ttaacggatt gctatatgag tgggttagtc tctggaatta gcacgggaat 5221 agttgccttc accgccacca acattgatga tattgttatc cttttggtct ttttttctca 5281 ggtgaattct acctttactc gtaggcacat tattgttggt caatatttgg ggtttacagc 5341 acttgttatt gccagtgtgc ctggtttatt tggtggattc atcttgtcgc caaactggat 5401 tggattatta ggtttaattc ctattgctat aggtattagt agtttggtaa acccagaaga 5461 agattcagaa gaagcagcag aaacagaaca atcagaagat tcgacttttg ctaattttct 5521 ttcccctcag acttacagtg tagccgctgt cactattgct aatggtagcg ataacattag 5581 cgtttatgta ccattatttg ctagtagtga tttcggaagt tttttggtaa ttctagtcgt 5641 attctttctg ttaataggag tttggtgcta taccgcatac aaactcacaa atcaacaagg 5701 aatagctgaa attttaactc aatacggcaa ttatcttgtg ccttttgttt tgatgggatt 5761 gggtgctttt attgttttaa aaagtggcgc attgaatcca gtcaaacttc tacttagctg 5821 tctatgtttg atagttcttg taaaagacga tcagaggatg tctcagaagt aatatcatga 5881 tcggatgaac acggtgaagt acctcaacaa gcgcttcgct tctggtgttg aggtttccaa 5941 tctcaaaagc agttctcatt tgaatcatat acactaccat gcataatgta gagacgtagc 6001 atgtgagtcc agcgctgcgg gagggtttcc ctccgcaggc gactggcgtt agcgcagcgt 6061 gtccgcagga catacccgaa gggctacgtc tctacagtcg tgtattgcac gcatatgcgc 6121 aaagcgcacg cccagagggc tatcgagaag cgctataaca acctattggc aagctgtttt 6181 tcgactttaa caaactgctt gtcgagataa ttcagcatta attggcttaa tgaaataaag 6241 ccgtaacaaa tgcccaataa ttgaaagtat caagggtaac ttttgaaaga acttgagaat 6301 ctttggacta ttactgttgt taatcaacgt cagcttgaga ttaatatcag aacactcttc 6361 cagtcgccgg aaaaactctg gatgatttgt attcaatatc acaggaaatg cccgtgcagc 6421 ggtttcattc gtcttttcaa tcacttggat attgtactcg tatggattga ttccaataga 6481 gcgataaaat gacgccctct ccagtactgt caaagtatga gtcgcgaaca cagacagcag 6541 gaagaaccgc acccacaaac gcgctttcca gttgttccag agttggggtt gcgatcgcaa 6601 cagcgcctta aagaaatctc catgtcgatt ctcatcctga caccaactct cgaaatagcc 6661 aaacagagga tagaatttca actctggatg tttctctagg tgacggaaca taataatgta 6721 gcgccagtag ccaattttct cagagaggta cactgtgtaa ataatccact ctggtgggaa 6781 aaaggtgtag gtacgattct ttgtgagata acccaaatcc agcgagagat taaaatccgc 6841 cattgcttta ttgagaaatc ccgcatggcg tgcttcatcc cgcgccataa aatnnnnnnn 6901 nnnccataaa attaaaagct tctgacaaaa gaggatttcg ctgtttcagt ttgcgcgata 6961 actctttgaa cagaagaaac ccagaaaact ctgaagtgca tgatcgttcc aaaaagtcaa 7021 taaaagcgcg gcgcgtttcc ccgtcaatct gttcccaact ttgctgaaat tcagaatcac 7081 gcacgaagtg atggcggtta taatcagccc gcagttcatc cacaatagct ctcaactggg 7141 cttcatccgc tgaaatgtcc aacttcgcta cagcgtcaaa gtcagtcgtg taaaacctgg 7201 gcgttaacaa tgtttcttgc acaggtgctt tcacaccggg cttttgcaac tcaggctctg 7261 gagtttgaag aggcttagcc atgagaatcg ttcaatcaat ttatcttttt aaagctatat 7321 aagcataaat t // LOCUS NODE_4311_length_7264_cov_5.3857687264 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7264) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7264) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7264 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1176) /locus_tag="DP116_25685" CDS complement(<1..1176) /locus_tag="DP116_25685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315517.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cysteine lyase" /protein_id="PRJNA477356:DP116_25685" /translation="MATLSASPTKLLHHRQLFPGLANKAYFNYGGQGPMPQTALDAIS DAQAYIQQIGPFGTEVNSWIIEETKAARNAIASELSVPAETITLTEDVTVGCNIAMWG LEWHRGDHILLSDCEHPGIIATAQEIGRRFGVELTTCPLMATLNEGDPVEVIAQHLRP KTRLVILSHVLWNTGQVLPIDKIAKLCRGKSVKLLIDAAQSVGLLSLNLTELGADFYA FTGHKWLCGPGGTGGLYVRPEARDSLKPTFIGWRGVILDSKAQPVDWQSDGKRYEVAT SNYPLYTALREAIATHQQWGTPEERYQQICRNSEYLWRKLQVLPDIKCLRTSPPESGL VSFQLTNQTPSTSRQLVMFLESQKLLTRTIADPDCVRACVHYLTLESEIDELVEGIQR " gene 1456..3006 /locus_tag="DP116_25690" CDS 1456..3006 /locus_tag="DP116_25690" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315518.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase B" /protein_id="PRJNA477356:DP116_25690" /translation="MLINAELLLQYQRCKRRPFLDVHGDKSRREPESDLLLKLYQDRI AHHRSILTEFTYQRPVYRQGDWKAGVTATLELMEQGAESIYQGVLVTGYSQTHTLLSR PDILVKHRGESRFGDWMYVPVNIELGKRPKQEYQVVAAFHAYVLAMVQQSELEIAWLM LRGKEAGYSVDLLKWMPQMHRVLEEYIQTLETEEAPEVFISRQRCNLCPWYDYCYTVA QSQKHLSLLPGVTPVRYTQLQALEITTVESLAKTYPTQLENLPGFDSAVAPKLILQAK SILENRPFIIPNLPPKQQYVLNSCTVETSVCLDTTIEIDTEATQLLTPRNDTITTAPV EMFFDIEAQPDLNLDFLLGVLVVDRQAKTEKFYSFLAESPEEEELVWQQFLNLVWQYP DAPIYHFCVYELDTVKRLAKLYQTPQSYVLPVLNRFVDIYKILTQNVALPIESYALKA IARWLGFEWRDPEASGMKCIYWYDQWLEMGDRSLLEIIQRYNEDDCLATRSVKDWLVN FLQKNNGL" gene complement(3007..3402) /locus_tag="DP116_25695" CDS complement(3007..3402) /locus_tag="DP116_25695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017713128.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_25695" /translation="MSHLCDTNIISELTRPMPNSGVTAWSGTVTSINLSVITIEEIYY GLTAKPNARIQNWFENFLFTHCQILPITSEIAKCSGELRGFLRTQGKPRTQADIFIAA TAKIHSLTLVTRNIKDFDGCGISTLNPFS" gene complement(3386..3676) /locus_tag="DP116_25700" CDS complement(3386..3676) /locus_tag="DP116_25700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006105149.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prevent-host-death protein" /protein_id="PRJNA477356:DP116_25700" /translation="MKWTLEEAKQQLPSIINATSLEPQLIYTQEELVAAIVDPELFKE FLNWRQKTAKTSLAQVFKELQQLCTEENYSLEIPARSNRDNPFTEDQDESSL" gene complement(3743..4015) /locus_tag="DP116_25705" /pseudo CDS complement(3743..4015) /locus_tag="DP116_25705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317963.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="toxin-antitoxin system HicB family antitoxin" gene complement(4022..4444) /locus_tag="DP116_25710" CDS complement(4022..4444) /locus_tag="DP116_25710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458494.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative toxin-antitoxin system toxin component, PIN family" /protein_id="PRJNA477356:DP116_25710" /translation="MAIKIVVDTSVFISALISSQGSSRELIRRCLKGEYQPLMGNALF SEYESVIQRTEIIAKCPLTSEEISALLTSLMSVSQWIYIYYLWRPNLKDEADNHLIEL AVAGNAQIIATHNVKDFQNAELLFPNLSILKPEKIIRS" gene 4682..5071 /locus_tag="DP116_25715" CDS 4682..5071 /locus_tag="DP116_25715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008273708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25715" /translation="MDPITSAIVLGIAGNFATDAVKAGYKALKDALTKKYGSESELVE AVNKLEQKPNSEARKATLQEEVESAKALDDSVIVQLAQQLLAKVKEQPGGQQVITQTI NNVKYAATSGTANASISNIEEHGQPQA" gene 5117..6757 /locus_tag="DP116_25720" /pseudo CDS 5117..6757 /locus_tag="DP116_25720" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: GeneMarkS+." /pseudo /codon_start=1 /transl_table=11 /product="S-layer homology domain-containing protein" assembly_gap 6723..6732 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <6733..>7264 /locus_tag="DP116_25725" CDS <6733..>7264 /locus_tag="DP116_25725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012163196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="peptidase C14" /protein_id="PRJNA477356:DP116_25725" /translation="PGVRVALELLTQGCRAKRKWSWMPDFLVTSLFGNCADLRGADLR SANLRSANFSSANLSSANLSSANLSSANFSSANLNSANLNSANLSSANFSSANFSSAY LRDANLNSAYLSSANFSSANLINADLRSADLSSANLRSADLSSANLRSANLINADLRS AYLISAHLSSADLSSCL" BASE COUNT 2119 a 1548 c 1508 g 2079 t 10 others ORIGIN 1 tcgttgaatt ccttcaacca gttcgtcgat ttctgattct agagtcaaat agtgaacaca 61 agcacgtacg cagtcaggat cggctattgt tctggttaat aacttttgtg attctaaaaa 121 catcacaagt tgacgactgg ttgatggtgt ttgatttgtc agttgaaacg agactaaacc 181 actttctggt ggagaagttc gcaaacattt gatatcgggt aaaacttgta attttcgcca 241 caggtactca ctattgcggc aaatttgctg gtaacgttcc tctggtgttc cccactgctg 301 atgagttgcg atcgcctccc ttagcgccgt gtaaagtgga tagttggatg tcgctacttc 361 atagcgtttt ccatcgcttt gccaatctac aggctgtgct ttactgtcta aaataacgcc 421 gcgccaaccg ataaacgtag gtttcaaact gtctcgggct tctggtcgca catacaaacc 481 tcccgtacca ccaggaccac acaaccattt gtgacccgta aaggcgtaaa aatccgctcc 541 caattcagtg agatttaaag acagtaagcc tacagattga gccgcatcta tcagaagttt 601 gacggatttt cctctgcata atttcgctat tttgtcaata ggcaaaactt gacctgtatt 661 ccaaagaacg tgactcaaga tcacaagacg agtctttgga cgtaagtgtt gggcaatgac 721 ttcaacagga tcaccttcat ttaaagttgc cattagagga caggtggtga gttcaacacc 781 gaacctccgc ccgatttcct gagctgtcgc tataatacca gggtgttcgc agtcactaag 841 tagtatatga tcgcccctat gccattcaag accccacata gcaatattac agccgacggt 901 gacatcttca gtgagggtga ttgtttcggc tgggacactt aactcagagg cgatcgcatt 961 tcttgcagct tttgtctcct ctattatcca actgttgacc tcagtcccaa aaggacctat 1021 ttgttggata taagcttgcg cgtcagagat agcatccaaa gccgtttgtg gcattggtcc 1081 ttgaccgccg tagttaaaat atgctttatt tgccaaacct ggaaaaagct gtcgatggtg 1141 aagtaacttg gtaggtgatg cagaaagagt agccatcagt gatcaatgag tcatgacata 1201 ttaattattt ttatacaaaa tttaaaatta tttttgtcaa gtgcctgtcc gtagggcata 1261 ctttttcaag tgcatgcctg taataccatt tcactaaatt ctagatacag atcctctccc 1321 ttgtcctcct tgtcctcctt gtcctccttg tcctccttgt cctccttgtc tacctgaatg 1381 tatttgtgga caataacttc aactttagtt atcggctcac taccagaaag taataattct 1441 tgttacctta aaagaatgtt aatcaatgct gaactactgc tgcaatacca acgctgtaag 1501 cgccgacctt ttttagacgt tcacggggat aaaagtcggc gagaacctga gagtgacttg 1561 ctgttgaaac tttaccaaga caggattgct catcacagga gtattttgac agagtttacg 1621 tatcagcgac cagtctatcg acagggagac tggaaagcag gagtgacagc aaccttagag 1681 ttaatggaac aaggggctga gtccatttat caaggggtgc tggtcacagg ttattctcaa 1741 acacatacac tactaagccg tccagatatt cttgtcaaac atagaggaga gtcccgcttt 1801 ggagattgga tgtacgttcc agtcaatatt gaactgggta agcgtcctaa acaggagtat 1861 caggttgtag cagcatttca cgcatatgtg ttggctatgg tgcaacaaag tgaattggaa 1921 atagcttggc tgatgctgcg tggtaaagag gcgggttatt ctgtggatct actcaaatgg 1981 atgccacaaa tgcatcgtgt tctggaagag tatattcaaa ctttagaaac agaggaagcg 2041 cctgaagtct ttatctctcg tcaacggtgc aatctttgtc cttggtacga ttattgttat 2101 actgttgccc aatctcaaaa acacctttct ttgttaccag gagtgacacc tgttcgctac 2161 acacaactgc aagcgctgga aatcacaaca gtagaatctt tggcaaaaac ttatccgact 2221 caattagaaa acttgccagg tttcgacagc gcagtcgcac ccaagctgat actacaagca 2281 aaatctattc tggaaaaccg tccgttcatc ataccgaatt tgccaccaaa gcaacaatac 2341 gtcctaaatt cttgcactgt tgaaacttct gtttgcctcg acacaactat agaaattgat 2401 acagaagcta cacagttgct tacccctagg aatgacacca ttactacagc gcctgtagaa 2461 atgttttttg atattgaggc gcaaccagat ttgaatttag attttctgtt aggagttttg 2521 gtagttgata ggcaagctaa aacagaaaag ttttattcct ttttagcaga aagcccagaa 2581 gaagaagaat tggtttggca gcaatttctg aatttggttt ggcaatatcc agatgcacca 2641 atttatcatt tttgtgttta cgaacttgat acagtcaaac gactggcaaa gctttaccag 2701 actccacaaa gttatgtgtt gcccgttctg aatcggtttg tggatatcta taaaatctta 2761 acgcaaaacg tggcattacc tatagaaagt tatgctttga aagcgatcgc acgatggttg 2821 gggtttgagt ggcgtgatcc agaagccagc ggtatgaagt gtatttactg gtatgatcag 2881 tggttagaga tgggcgatcg ctccttactt gaaatcatcc aacgttacaa cgaagacgac 2941 tgcctcgcaa cccgcagtgt gaaagactgg ctggtgaact ttttacaaaa aaacaatggt 3001 ttgtaatcag ctaaatggat tcaatgtgga tataccacaa ccatcaaaat ccttaatatt 3061 tcttgtaaca agtgtcaaag aatgtatttt agctgttgcc gcgataaata tatcagcttg 3121 tgttcgcggt ttaccttgag ttctcaaaaa gcctctcaat tcacccgaac acttggcaat 3181 ttctgaggtg atggggagaa tttgacagtg agtgaagaga aagttttcaa accagttttg 3241 aatcctagca ttaggcttgg ctgttaaacc ataataaatt tcctcaattg taataacact 3301 taagttaatg gatgtcacag taccactcca tgctgtcacc cctgaatttg gcattgggcg 3361 agttaattca ctgatgatat tcgtgtcaca aagatgactc atcctgatct tccgtaaaag 3421 gattatcgcg attgcttcgt gctggtattt ctaagctata attttcttcg gtacataact 3481 gttgaagttc tttaaagact tgagctagag atgtttttgc agttttttgc cgccaattta 3541 aaaattcttt aaaaagctca ggatctacaa ttgcagccac taattcctct tgagtataaa 3601 ttagctgagg ttcaagactt gttgcattaa tgattgaagg tagctgttgt ttggcttctt 3661 caagtgtcca tttcattcta aaaatctcca agttttggaa gtattgaata taaagactgt 3721 ttatatcatt gcactacaag ctctattctg tcagagtatc aagtttagcc agtattctta 3781 acccttcttc tgggttgccc ttcgggttcg cagtcgccta cggagggaga ccctcctgca 3841 gcgctgtctc accagtcgca gccattgctt taaatcttgt atgggcatca aattctgcta 3901 aagctatggt agaaagttct tcaatcagct tattaacact tatgcctttg gcttgagcaa 3961 gttctttcaa tctattatgc ttatcgtctg gtaaacgaat agttaaagtt gccatttttg 4021 tttaactcct aataattttt tcgggtttta atattgataa gttaggaaat aacaattcag 4081 cattttggaa atctttgaca ttgtgagtag caataatttg ggcattccct gcaactgcta 4141 attcaattaa gtgattgtca gcttcgtctt ttaaattagg tcgccataaa tagtagatat 4201 aaatccattg actgacgctc atcaatgatg taagtaaagc agaaatttct tcactcgtta 4261 aagggcattt ggcgataatt tctgttcgct gaatgactga ttcatactca gaaaataaag 4321 catttcccat caaaggctga tattcacctt tcaaacagcg tcgaatgagt tctctactgg 4381 agccttgaga gctaatcagc gcactaataa aaacgctggt atcaactaca atttttatcg 4441 ccatgtcatg atgatagcat atacgctatc atccaatata gagttaaccg catcctccta 4501 gataaaactg ataactatac ataagcgcaa ataactgcgt ctaaaaaaaa agaaacattt 4561 ctggaatcat ctgtttttac aatgaatatt aatggtgaaa gctaccctac cgttgcggtt 4621 aataattcaa aaatcataaa aacgacggct tccaccagta cagataagtt aaggagaaac 4681 tatggatcct attacttcag caattgtttt gggcatcgca ggaaattttg ctacagatgc 4741 agtcaaagct ggttacaaag cgctcaaaga tgctctcaca aagaaatatg gttcagaaag 4801 cgagttggtt gaggcagtca acaaattgga acaaaagccc aattcagaag ctcgtaaagc 4861 cacgctacaa gaagaagttg aatctgccaa agcgttagat gactctgtta ttgtgcagtt 4921 agcacaacaa ttgcttgcca aggtgaaaga acaaccaggt ggacagcagg ttatcaccca 4981 gactatcaac aacgtcaaat atgctgctac ttctggtact gctaacgcca gtatcagcaa 5041 tattgaggaa cacggtcaac ctcaagcata acgcttgtca gtactttcat caatcaggag 5101 aaggtgagct gctggcatgg ctgaacaaag cgatatcaaa caagcaatta cggaaacaca 5161 atatgcagcc acttctgcaa caggcagtgc tgttattaat atcactaact attattaccg 5221 tgaggacatt aggacagcac ctgttaaccc tgccaaaact tctgctgatg acaacttgcc 5281 atgcccttat cgcggtctat ttcactttgg tcccaatcat actgaattct tctttggacg 5341 cgaagtcttt atagaagaac ttatacaagc aactgcaaag cggagcttta ttcctgtact 5401 aggtgcttcc ggcagcggga aatcttccgt tgtattagca gggctagtgc caaaattgca 5461 acaacagggg aattggttat ttactcattt ccgtcctggt tctgacccgt tttacgcttt 5521 ggctgaagcg ctggttcctc tttatacaga tttgaatgct actcaacaaa tagcccaagc 5581 acgcgaactg gctgactatt tttgcgctcg caaggttctt cttcctgacg taatcaccag 5641 aatccaacgc aatcattcta atcatcggct gctgttgatt gctgaccagt ttgaggaaat 5701 ttatacttta tgtaacaatg aatccacccg tcggcatttt ctagattgct tgctgagtct 5761 cattcaaact tccacaaacc aattgcctac ggtgctagtt gctactatgc gagcggattt 5821 tttaggcagt gcgctggagt accgtccatt tgctgatgtc ctgcaagatg acatcaagct 5881 gggaccaatg aaccgcaagg aactattgga ggtaattgaa aaacctgccc aaaagctagg 5941 agtcacattg gaaggtggac tcatagaacg cattttagat gatgttgaaa atgaaccagg 6001 tacattaccg ttactggaat ttgctttaac tttactgtgg gaacgtcgaa gtgatagaca 6061 actcactcat gcagcgtatg aagcgattgg tgaagtccaa ggtgccctag caactcatgc 6121 cgataagatt tataacaatt tcgatgctac cgaacaacaa caggtgcagc gaattttcat 6181 gcaattggta cgcccaggcg aaggtacaga agatacgcgt cggatagcca caaaatctga 6241 attgggtgca gccagctggt taagagtgag tcaactggca gatgcgcgat tggtcgttac 6301 cagccgaaat agctctggtc aacaaacagt ggaggttgta catgaagcgt taatccgcca 6361 ctggcagcta ttacaaagtt ggattgatga aaatcgcagt aaaattatcc aaaagaatag 6421 aattgaacgc ttggcgactg agtggcaaca gaataagcag cttaatgatt atttacttca 6481 aggaaaacag ttaaaagaag cgaaggcgtt ccagaaagag gaattgacaa actttgcctt 6541 atcggacttg gcttgtgagt ttattaagaa aagtatcaaa tacgaacgca gcaggagatt 6601 aaaaacaagt agcattttga caattcctct tctgattgta tttgtagttg ttgctttacc 6661 tataagaaaa gcaaacgagc agcaggcttt gaatacaatt caaaatccta aaaatcctgg 6721 agnnnnnnnn nnatcctgga gtccgtgtag cgcttgagtt gctaacccaa ggatgtcggg 6781 caaaaagaaa gtggagttgg atgcctgatt ttctggtaac ttctttgttc ggtaactgtg 6841 ccgacctcag aggtgccgac ctcagaagtg ccaacctcag aagtgccaac ttcagtagtg 6901 ctaacctcag tagtgccaac ctcagtagtg ccaacctcag tagtgccaac ttcagtagtg 6961 ccaacctcaa tagtgccaac ctcaatagtg ccaacctcag tagtgctaac ttcagtagtg 7021 ctaacttcag tagtgcctac ctcagagatg ccaacctcaa tagtgcctac ctcagtagtg 7081 ctaacttcag tagtgccaac ctcattaatg ccgacctcag aagtgccgac ctcagtagtg 7141 ccaacctcag aagtgccgac ctcagtagtg ccaacctcag aagtgccaac ctcattaatg 7201 ccgacctcag aagtgcctac ctcatttctg cccacctcag tagtgccgac ctcagtagct 7261 gtct // LOCUS NODE_4313_length_7258_cov_5.0533117258 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7258) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7258) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7258 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(259..1041) /locus_tag="DP116_25730" CDS complement(259..1041) /locus_tag="DP116_25730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458831.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_25730" /translation="MSKPLTISSSDLIHSLKLSCQIPDVVEAIATQKIIAEAAQEAGI QVSEQELQQEGDRLRFAKKLVKAQDTWNWLKKHHLSLAEFEELIHNKVLATKLASHLF AQHIERFFYENQLNYVAAVTYEVILDDRDLALELFYSLESGEITFQEIAREYIQQPEL RRAGGYQGMRHRKDFRPEIAAAVFAATPPSLIKPISTSKGVYLIWVEEIVQPILDEQL RDKIQQELFDHWLKEQIQTIEINTNLDLSLDLQGSQELRKQA" gene 1035..1268 /locus_tag="DP116_25735" CDS 1035..1268 /locus_tag="DP116_25735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008185263.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25735" /translation="MTRHDSPRNLNFRLETLKQLSVISYQLSVISYQLSVISYQLPVI SYQLSVTSYQLSVSQRCRRVSLRRRLRTRRVTR" gene complement(1372..2934) /locus_tag="DP116_25740" CDS complement(1372..2934) /locus_tag="DP116_25740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314984.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_25740" /translation="MPNTSPNPSSGVPKPQPDPQKNYIQSAKSAESTESVKQTETTDE TQNWFLGTEELLDALPKVWTRSSLYLLVAFAVIVLPWAMFSKVDETESARGRLEPKGA TQKLDSPVGGSVTVVKVKEGDTVKSGQVLLELDSEVLKTELNQVQTKLDGLQNRRKSS ELLKNQLTVSVQTQQQQNQAQLLAKQSQVDQARQNLDALKAVYNLQKEEKLAKVDQVQ QALNSSKAAQKLAEVRFQASQGKVPRYKKAYEDGVMSQERFKEVEQSAQENYERLVQA QSEIAQTQSSLKEQQTSYQRTIQQAQSEIQQAELRFKEEQRNYQSLVHTNQLAQLKLE EQLKELQSQIDSLQSEISQTQSQLRASKIQLQQRVVRSPINGVIFEFPTTKPGAVLQP GQRVAQIAPLQVGVVLKAQIPNQHSGFLKAGMPAKVKFDAYPFQEYGIVSGKVNWISP DSKVQQTLQGNVENFELEITLNQQYIQSGKKRIQFIPGQSATAEVVIRQRRIIDFILD PFQKLHKGGLNV" gene complement(2998..6075) /locus_tag="DP116_25745" CDS complement(2998..6075) /locus_tag="DP116_25745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314985.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="PRJNA477356:DP116_25745" /translation="MSSEFSQQLCQQIATALDVVVSQQELAGCIAQAEILEPAPTKQF WQTTDGIAGVYIVLRGKVRLLDSAENLITTLGTGSCFAEATLFPEQDFLPYIAKASTN LKLCYLKLEILKLLIDKNPSLGDRLLQRAQLWDLLLRCRQNSQIECPPSDVPEILKGL SYFSQHQLDSNQKISLPKDSKLWLVYKGELRHTNGQSLTPGQIITHSDQGDWQTTEPT IAYDLKETQWQTVIEHWQQLAQLVEPEQDSPAPQEQKVRPRKSKAEQITPSGNVIPFP QRESATTKQQPKKSRAYFPTPKVKVGQAWGQLTKSYPFFAQQSAADCGAACLVMIARY WGKRLNLNRLRESANVSRSGASLSALAAAAENVGFATRPVKASFDKLAQQSLPAIVHW EGKHYIVLYEITSKQVIVGDPAIGQRILSPKEFQAGWTGYALLLQPTSVLKETEEDST GIWRFFELLKPHTWVLLEVFFASVLIQAFGLITPLFTQLLFDRVIVQGSTSTLNAVCI GLIIFGLFSIAVNGLRQYLLAHTANRVSIALLVGFIKHTLRLPLSFFESRYVGDIVSR IQENQKIQNFLTGETLSIILDLLTVFIYATLMFWYNWRMALLGLMTVPPFFILTLAST NILRRMSREIFNAGAEQSSYLIQALTGIRSVRSMAIEHTVRWRWEELLNTLVKKSFSA QIIGNNLQITSSVIDTVMHTVLLWYGASLVINQELTIGQLIAFNMLLGNVLSPFKRLS VVWNQFQEIVISVERINDVLDAEPEEDLQNKPRKPLRHLRGHIRFDNVTFRYHPESQT NILENLSFEIQPEQTVAVVGRSGSGKTTLSKLMLGLYPASEGKVLVDGHDVSSITLKS LRQQIGVVDQATFLFGGTIRENIGIAHPEASLEDIIEAARLAGADEFIQQQAMGYDTE IGEGGGMLSGGQRQRLAIARALLGNPRFLIFDEATSSLDAESERIIQNNLKTILQGRT SLIIAHRLSTVRNADLILVLDRGVLVESGTHDELIAKKGHYYYLNQQQLAHAG" BASE COUNT 1925 a 1511 c 1563 g 2259 t ORIGIN 1 cgactgtttc gctgattcta tttatttgta attatgaagt tatcttgaag acgaaagcct 61 caattgattt cgagcaaata ttttttctat ctataatttt tcattcttaa aatcacttat 121 attgctaccc ttttgtaaaa agagttttta attatttgaa atcaaacaaa agcctttcgc 181 ccctttctaa caaaaggaga ataagtcgct atatacaaaa tttgtttaca cagacacttt 241 ttttcagaaa ccgaaaaatt aggcttgctt acgcaactct tgcgatcctt gcaaatcaag 301 acttaaatct aaatttgtgt taatctctat cgtttgaatt tgttccttca accaatgatc 361 aaacaattct tgctgaattt tgtcacgtaa ctgctcatcc aagataggtt gaacaatttc 421 ttctacccaa atcaaataaa ctcccttgga agtcgaaatc ggcttgataa gggacggtgg 481 agtggcagca aatacagccg ctgcaatctc tggtcgaaaa tctttacgat gtcgcattcc 541 ttgatatcct ccagcacgtc gaagttcagg ctgttgaata tattcacgag caatttcttg 601 gaaagtgatt tcgcctgatt cgagagaata aaacagttct agggccaagt ctctgtcatc 661 caagataact tcataagtga cagcagcaac gtaattgagt tggttttcat aaaagaaccg 721 ttctatatgc tgtgcaaaca aatggctagc taacttagtg gcaagtactt tgttatggat 781 caattcttca aattcagcca gagagagatg atgttttttg agccagttcc aagtgtcttg 841 agctttgaca agttttttgg caaagcgtag tctatctcct tcttgttgaa gttcttgttc 901 cgagacttga atcccggctt cttgagccgc ctcagcaata attttttggg ttgcgatcgc 961 ttccacaaca tcagggattt gacaagatag tttcagactg tgaatcaggt ctgaggacga 1021 aatggtcaaa ggttttgaca cgacacgatt ctcctaggaa tttgaatttt agactagaaa 1081 cgttaaaaca gttatcagtt atcagttatc agttatcagt tatcagttac cagttatcag 1141 ttatcagtta tcagttacca gttatcagtt accagttatc agttaccagt tatcagttat 1201 cagtgagcca gcgctgcagg agggtctccc tccgtaggcg actgcgaacc cgaagggtta 1261 ccaggtagga aacggactcg tccacccctt gttccctgcg taggtagctg cgtaggtagc 1321 tgcgtaggta gctgttcact cttgttggta gctgttaaag ttgacttttg actaaacgtt 1381 taaaccacct ttgtgcaatt tctgaaatgg atctaggatg aagtcgatga tacgccgctg 1441 gcggataacc acctcagcgg ttgcactttg tccaggaatg aattgaatac gttttttgcc 1501 actttggata tactgctgat tcaaagtgat ttctaactca aagttttcta cattcccttg 1561 aagtgtttgc tgaactttag agtctggaga aatccaattg acttttcctg acacgatgcc 1621 atactcctgg aaaggatagg catcaaattt gactttggca ggcatacccg cttttaagaa 1681 gccactgtgc tggttaggta tctgagcttt gagtactaca ccaacttgta atggtgcaat 1741 ttgagcaact ctctgacctg gttgtagtac ggctcctggc ttggtagtgg gaaattcaaa 1801 aatcacacca ttaatgggcg atcgcaccac tcgttgctgc aactgaatct ttgaggctct 1861 taattggctt tgagtttggg aaatttccga ttgcagggag tcaatttgtg attgcagttc 1921 tttgagttgt tcctctagtt tgagttgtgc aagttgattt gtatgaacca agctttgata 1981 attgcgttgt tcttctttga agcggagttc agcttgctgg atttctgact gagcttgctg 2041 aattgtcctt tgataactgg tttgttgctc ttttaagctc gactgagtct gagcaatctc 2101 agactgagct tgcacaaggc gttcgtagtt ttcttgtgct gactgttcca cttccttgaa 2161 gcgctcttgt gacatcacac catcttcata ggctttttta tagcgtggga cttttccttg 2221 agaggcttga aaacgaactt ctgctaattt ctgagctgcc ttgctagaat tgagagcctg 2281 ctgcacctgg tctactttcg ccagtttttc ctctttctgt aagttataga cggctttgag 2341 agcatctaaa ttctgtcgcg cctggtccac ttgagactgt tttgctaaga gttgagcttg 2401 gttttgttgt tgctgagttt gcacagaaac cgtcaattga tttttgagta gttctgaact 2461 cttgcgccga ttttgcagtc cgtcgagctt tgtctgaacc tggttaagtt cagttttgag 2521 aacctctgag tctagttcca gtagaacctg acctgatttt acggtgtcgc cttctttgac 2581 cttgacaacg gtcacacttc cacccactgg actatctaac ttttgggttg cacctttggg 2641 ttcaagacgt cctctagcac tctctgtctc atccaccttt gagaacatcg cccaaggtaa 2701 gacaatcact gcaaaagcga ctaacaaata cagcgaagaa cgtgtccaga ctttaggtaa 2761 agcatctaac agttcttcgg ttcccaaaaa ccaattttgt gtctcatctg ttgtttctgt 2821 ctgcttaact gactcagtag attccgcgct tttggctgac tgaatgtagt ttttttgcgg 2881 gtcaggctgg ggtttcggaa ctccagatga gggattagga gatgtgtttg gcatagtttt 2941 cgttggtggt acacgaaatc agtaatcact taagtgatca ctgttaactg taaaaactta 3001 acctgcatga gcaagttgtt gttgattcag gtagtaataa tgtccttttt tggcgattaa 3061 ttcgtcgtgt gtcccgcttt cgaccaagac accccggtct aacactagaa ttaagtcagc 3121 gttgcggaca gtggaaaggc gatgggcaat aatcagactc gtgcgtcctt gaagaatagt 3181 tttcaagttg ttctgaatga ttcgttcgga ttccgcatcc aggctactgg tggcttcatc 3241 gaaaattaaa aatcggggat tacctagcaa ggcgcgggcg atcgccagac gttgacgttg 3301 tccaccagaa agcattccac caccttcacc aatttcggtg tcgtaaccca ttgcttgctg 3361 ctgaataaat tcatctgcac cagccaaacg tgctgcttca ataatgtctt ctaaagatgc 3421 ttctggatga gcgataccaa tgttttctcg aattgtaccg ccaaacaaga aagtagcttg 3481 gtctaccaca ccaatttgct gtcgcaggga tttaagagtg atactactaa catcgtgacc 3541 atcgactaag actttgccct cgcttgcagg atatagaccc aacattaact ttgacaaagt 3601 tgtctttcca gaaccactgc gtcccaccac cgcgactgtt tgttccggct gaatttcaaa 3661 gctgagattt tccaggatgt ttgtttggct ttctgggtga tagcggaaag tgacattatc 3721 aaagcgaatg tgaccgcgca agtgacgcaa aggcttgcgg ggtttgtttt gcaagtcttc 3781 ttctggttct gcgtccagga catcgttgat gcgttccaca gaaatgacaa tttcttggaa 3841 ttgattccac accacgctga gtcgcttgaa agggctgagg acgttgccca acaacatatt 3901 aaaagcaatc agttgtccaa tagtgagttc ttgattaatg actagcgagg ccccgtacca 3961 tagcaatacc gtatgcatca cagtgtcgat gacactacta gtaatttgca gattattgcc 4021 aataatctgt gcgctaaagg attttttgac caaagtattc agcaattctt cccaacgcca 4081 gcgcactgta tgttcaattg ccattgaacg cactgaacga atacccgtga gggcttgaat 4141 caggtaactg ctttgttcag cgcctgcatt gaaaatctct ctggacatcc gacgcaagat 4201 atttgtcgaa gctagtgtca ggataaaaaa tggcggtaca gtcatcaacc ctagaagtgc 4261 catccgccag ttataccaaa acattaatgt tgcataaata aacactgtca acaaatccag 4321 aatgattgac agtgtttccc ctgtcaggaa gttttgaatt ttttggtttt cctggatgcg 4381 agagacaata tctccgacgt agcgggactc gaagaacgat agaggtaagc gtagggtatg 4441 tttgataaaa cctaccaaca gtgcaatgct gacacggttt gctgtatgcg ctaagagata 4501 ttgccgtaag ccattcacag cgatgctgaa caaaccaaat atgatcaacc ctatacagac 4561 ggcgtttaat gtagaggtgc taccctggac aatcactcta tcaaaaagca gttgggtaaa 4621 taagggcgtg attagcccaa acgcctgaat cagcactgaa gcaaagaaaa cttccagtag 4681 tacccaagtg tggggtttga gtagctcaaa aaacctccag atgcctgtgc tgtcttcttc 4741 tgtttctttt agtacagatg tgggttgcag caataaggca tatccagtcc aacctgcctg 4801 aaattccttt gggctgagaa tgcgttgacc aatcgcaggg tcaccgacaa tcacttgctt 4861 tgaggtaatt tcataaagga cgatgtagtg tttgccttcc caatggacga tggctggtaa 4921 agattgttgt gctagtttat caaagctggc tttgactgga cgggttgcaa agccaacatt 4981 ttcagcggcg gcggctaaag cactaagtga tgcaccagac cggctgacgt tggctgactc 5041 ccgcagtcga ttcagattta ggcgtttgcc ccaatagcgt gctatcatca ccagacacgc 5101 ggcaccacag tcggcagcac tttgctgtgc aaaaaatggg tagcttttgg ttagttgtcc 5161 ccacgcttgc ccgactttca ctttgggagt tggaaagtag gcgcgtgatt tttttggctg 5221 ttgttttgtt gtcgctgact ccctttgggg aaaagggatg acgtttccag aaggagtaat 5281 ttgttctgct tttgatttcc gaggacgtac cttttgctct tgaggtgcgg gtgaatcttg 5341 ttcgggttct accaactgtg ccaattgctg ccagtgttca atcactgttt gccattgtgt 5401 ttctttgaga tcgtaggcga tcgttggctc agttgtttgc caatctccct gatctgagtg 5461 tgtaataatt tgtcctggtg tcaaactttg accatttgta tgtcgcagtt ctcctttgta 5521 caccagccac aatttactat ctttgggtag cgatattttt tgattgctat ccaattggtg 5581 ttgagagaaa taagataaac cttttagtat ttctggtacg tctgatggtg ggcactctat 5641 ttgcgaattt tgccgacatc gcagtagtaa gtcccatagt tgagcgcgtt gcaggaggcg 5701 atcgccaagg ctaggatttt tatctattaa aagttttaat atctctagtt tgagatagca 5761 aagcttgagg ttcgttgatg ctttggcaat atatggtaga aagtcttgtt ctggaaacaa 5821 ggtagcttct gcgaagcaag aacctgtacc tagagttgtg attaagttct cggcgctatc 5881 taatagtctg actttgccac gaagaactat atacactccg gcaataccgt ctgttgtttg 5941 ccaaaactgc tttgtgggtg ctggttcgag gatttctgcc tgtgcaatgc accctgctag 6001 ttcctgttgt gaaacaacca cgtctaatgc tgtagctatt tgttgacaca actgctgcga 6061 aaattctgat gacattatca ttcctccacc acaataatca gtctttggtc tgaactaaat 6121 tgattgacga ccgtaaacta cactctggtt gagggagcag ttgctgagta ccatacatgc 6181 acctttctat ggtgccttta aagcttgcac aagcttgccg ttgagtgaac tggtgtaaca 6241 taatattagt tgacgtcaaa aaagtgggcg catgaatccg ctgcagctgt cactagtccc 6301 gatgcaaagg cttgacagcc acatacttaa ttagtcctgg gaagctaatt aattgtgtgt 6361 ttgacgtaca gcaccttgga agccgctccc ctactgttat ggggtttagg ggggctattt 6421 ccaggttaat acgcactttg gtttccctat cttccggttt ctggggtaac cagaacgaat 6481 agagtactca aagagttgca aacacatcaa gttagagttt tatttttttt aatgaatgag 6541 aaagctataa agacgcactg actaacagag gtataaaaaa tcttattagt aaatggcagt 6601 aagcatccct gagtaaacac actaaaaatc atggctgttg gatcctttaa aaacaacagt 6661 atgttgttaa cattatgttt gacagtatac aatcacttca ttcgcaatta cttactcgca 6721 ctccttttaa gtatgctgca agctaaccac tgtatcttaa atcaaattaa gagctgacaa 6781 agacctaaaa actaatgaat atatgactca gcgaagttgt gagcgaactt tgcttgactg 6841 aattctgttt agaaggaact tccagctaac ttcagaatat gataaacggt taactctggt 6901 ttcaaaccag tatcaatccg ttttttccct tggataacta gaattggctg actacaggca 6961 agactaggct tcttatttgc ttatcaagca tctgttagct tgattttcaa tcaggaatat 7021 gagttatgac gtctcgaagt tgcaaaaaaa acaactccaa gaattaatct agcacagtcc 7081 tccaatgaaa ctacccaacc attaattagt aaatttccgt acctcttgtc aagagttctt 7141 cataataaat tcattttttt ccggttcttc ttaaaaactt tatattaggt ttatcaattt 7201 cttaaattaa gctttgtgtc aaaaaatctt tatatgtaaa aaaaacaaaa ttcatatt // LOCUS NODE_4317_length_7238_cov_5.0309067238 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7238) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7238) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7238 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 453..812 /locus_tag="DP116_25750" CDS 453..812 /locus_tag="DP116_25750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017655802.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25750" /translation="MINLISKAVAQINSLFNQLQVKRFFAVVLAGLLVLTTNVANESS AKDLTNRVNQALERNSSDRPQTIGEWTKEGRETEGAPGERAKRIVGQSVEAVKQFGNV YPDTAERTAETVQDNTK" gene complement(1025..2764) /locus_tag="DP116_25755" CDS complement(1025..2764) /locus_tag="DP116_25755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="single-stranded DNA-binding protein" /protein_id="PRJNA477356:DP116_25755" /translation="MTIKDDLQKLLDILPQDLQQILEKHPQRDSLVEVVLDLGRRPEA RFPNQAEYLGETPVTQEQIDDCIKRVGTFGADNRAGIEQTLHRISAIRNRSGKIIGLT CRVGRAIFGTIGMIRDLVETGKSILMLGRPGVGKTTALREIARVLADDFQKRVVIIDT SNEIAGDGDVAHPAIGRSRRMQVAHPEQQHQVMIEAVENHMPEVIVIDEIGTELEALA ARTIAERGVQLVGTAHGNQIENLIKNPTLADLVGGIQAVTLGDDEARRRGTQKTVLER KAPPTFEIAVEMLERQRWVVHDSVADTVDTLLRGRQPSPQVRTVDENGNVNVTRQLTA INGRGQTLGAEESFAAPARQSNGWRATGQMIPLPTLSSDTQKSSGRTEFDRLLDESFN GYDGYDLDSTTRRAGPNGEDLPLHVYPYGVSRSQLEQVVGVLSLPVVLTKDLDSADAI LALRSHVKNHAKLRQMARARQIPIHMIKSSTLPQITRGLRRLLNMDDPEVNDDREMQL FLHSASDDEMDALEEARLAVEQIVIPKGQPVELLPRSPQVRKMQHELVEHYRLKSNSF GEEPNRRLRIYPA" gene complement(2874..3995) /locus_tag="DP116_25760" CDS complement(2874..3995) /locus_tag="DP116_25760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318126.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4Fe-4S ferredoxin" /protein_id="PRJNA477356:DP116_25760" /translation="MTHLLNPLRSLKQGHWFKLICGASYQHLPAVRSLTLAYTLAGAD CIDVAADPAVIAAAKEGLQAASGLSEVAGQQGFGLHGNSPLLMVSLNDGEDPHFRKAE FNSTECPKECDRPCERICPAQAIVFNSIKNESSGIISQKCYGCGRCIPVCPYDIIYTR SYVSTPGAIAPLILSSGVDAVEIHTNVGRLTEFQRLWQAILPWVERLKLVAISCPDGD GLIDYLHALYNVIAPIPCALIWQTDGRPMSGDIGDGTTQAAVKLGQKVLAAKLPGYVQ LAGGTNSYTVAKLKAMDLLKRAGGVNSSSSSSSSSIAGIAYGSYARVLLSPILEQLEA REVKNNCVKTTVRLEEEPELLAQAVSLAHSLVSQLKSQQ" gene complement(4210..5700) /locus_tag="DP116_25765" CDS complement(4210..5700) /locus_tag="DP116_25765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017310712.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_25765" /translation="MYRNQGQEYKANILVVDDTPDNLRLLSAMLSEQGYEVRKALNGK MALTACQMVLPDIILLDINMPGVNGYEVCQQLKADERTCEVPVIFISALDDVLDKVKA FDVGGVDYITKPFYGAEVILRIQNQINLRLLQTKLQEKNLLLENALDDLKAAQVKQIQ NEKMVALGQLVAGIAHEVNNPISFIYGNLEYVGQYVQELVRVILLYQQEYPHSTPMIE QIVQEIDLNFLMNDLKNLLGAMNRGADRIRQIVLSLQKFSRLDEAQMKSVNIHEGIDS SLMMLQHRLNETTYRPGIVVVKDYGNLPPVTCYASELNQVFMHILNNAIDALKSTQGH SEWEMSLPECGEMNKSLSNRNHQKFHQIGEDEQKLCDQQMWQLEDGQKIRPLHPTNSP TDEVSSISCVPVIRIRTELTDNNTVKISIADNGLGMDESVRSRTFDPFFTTKPVGKGS GLGLAISHQIVVQKHQGQISCNSLPGQGAEFIIEIPVQHPNTELFT" gene complement(5787..6743) /locus_tag="DP116_25770" CDS complement(5787..6743) /locus_tag="DP116_25770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015173925.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25770" /translation="MAQYPNGVWIWNLSEIRADYLDKLVERKVQRVYLKVFDGKYQGQ PTFWSWQCSPEIIQEFKSRQIEVYGWGYHYGTTDIARQIVKVRQALNCGLDGYIVDVE KEVENTSTHTNVDKLLSALRIIVKEGTFGYTTFGNPRLHPNVPWQILDKYCDLAFPQI YFEKFTFLPTTPEEVKDCLDAYKNLGLKKPILPIWGSESDTAKPATVAELQDYLNNYP GSSIWRIPKEGERGEAWNLTYSGFVLPILRRNLRQGRTGDDVMALQKVLNARGYNAGS ADGNFGPQTEAAVRIFQKEAVLTVDGEVGKLTWTALGGKFDA" gene 6959..7168 /locus_tag="DP116_25775" CDS 6959..7168 /locus_tag="DP116_25775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015203197.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycogen debranching protein" /protein_id="PRJNA477356:DP116_25775" /translation="MTTIWVNEQLDPSGMIYACIACCNESQAKDCHESFKQNLTQEQK AWGWVARLRTVNSWDEVPVNALKLD" BASE COUNT 1962 a 1605 c 1532 g 2139 t ORIGIN 1 atcttgctct tcggttgaaa gagcctctac atattcaatg actgtctgta aggatacaag 61 cttgctcatc tgtttacaat agatgatgat tttatgttcc tttattttat ctgtcttgat 121 tatctatagc cgttctgact tcgtttaaga cagtcgcaac cccaaaatcc agaaaaaacc 181 aacgtatctt aaccagatat tcaaatatac tccttgagtt gtgtcccagc aaatccgcac 241 atagtcttat gttgtttaca accaaactat gacttctgtt agaagaataa ttacgttttt 301 acagctagtc ttgacttata ttcaggacga ctcattcgct tgaagatagg ttgattttga 361 ctccatttcc ggagccaacc tgtgattgag gcgagtgtgc tccggttctg aaactgtaaa 421 aagtgttgaa aatcaacgaa tggagtcttg taatgataaa tttgatttct aaggcagttg 481 ctcagatcaa ttctctattt aaccaacttc aagtaaaacg ttttttcgct gtagtcctag 541 ctgggttgct agtgctgaca acaaatgttg ctaatgagag tagtgccaaa gacttaacca 601 acagagttaa ccaagcgcta gagcggaact cttccgacag accgcaaact ataggcgaat 661 ggactaagga aggtcgcgaa accgaaggtg ctcctggtga acgcgcgaaa agaattgtgg 721 gacaatcagt agaagctgta aaacagtttg gcaacgtgta cccagatact gctgaaagga 781 ctgctgagac tgtacaggac aacacaaagt aagcagacaa gtaggcgcga ataaagattt 841 tagaagacag gtgacagggt acagggaaac tgtaacttgt cacctattgc tttttaacag 901 ttatcagttt gaagaagaat tcagcaattc tgaattcagt attcagaagg aagaaaaaaa 961 gaattacgac ttctgactcc tgactcctga ctcctgactc ctgactcctt cactgttcac 1021 tgttctaagc tggataaatc cgtaaacgcc gatttggttc ttccccaaag ctattagatt 1081 tcagccgata gtgttctacc aactcgtgct gcattttccg cacttgagga gaacggggca 1141 ataattcgac tggctgtcct ttgggaatca cgatttgctc tacagcaagt cttgcttctt 1201 ccaaggcgtc catttcatca tcactggcgc tgtggagaaa cagttgcatt tcgcgatcgt 1261 cgttaacttc cggatcgtcc atattcagca accgccgcaa gcctcgggta atctgcggta 1321 gggtgctgga cttgatcata tgaattggta tttgacgtgc tctagccatt tgccgtagtt 1381 tggcgtggtt tttgacgtgc gatcgcaacg ccaatatagc atcagcacta tcaagatctt 1441 ttgtcaagac tactggtaaa gaaagaaccc caacgacctg ttccaattgg ctgcggctaa 1501 cgccataggg atatacgtgc agtggcaaat cttcgccatt aggtcccgcg cgtctggtgg 1561 tggagtccaa atcataccca tcatacccat taaaagattc atcgagtaag cggtcaaact 1621 cagttcgtcc cgaactcttt tgggtatcac tagacaatgt cggtaggggt atcatttgcc 1681 cagttgcacg ccagccattt gactgtcgtg ctggtgctgc gaaagactcc tcagcaccga 1741 gtgtttgccc acgaccgttg attgcggtaa gttgacgagt cacgttcaca ttgccgtttt 1801 catccacagt tctcacctgc gggctaggct gacgtccccg taggagagta tcaactgtgt 1861 cagctacgct gtcgtggact acccagcgtt gccgttctaa catttcgaca gcaatttcaa 1921 aggtaggagg tgctttgcgt tctaaaacag tcttttgagt ccctcgccgt ctggcttcat 1981 cgtctcccag tgtcacagcc tgaataccac caactaagtc agcaagcgtc ggatttttga 2041 tcaggttttc aatctgatta ccgtgagcag ttcctactaa ctgaacacct cgttcagcaa 2101 tagtacgagc tgctaaggct tccagttctg taccaatttc atcaataacg atgacttctg 2161 gcatatggtt ttccacagct tcaatcatca cttgatgctg ctgttctgga tgagccactt 2221 gcatgcgccg tgagcgaccg atggcggggt gggcaacgtc accatctccg gcgatttcgt 2281 tggaagtgtc aattatgacg actcgcttct ggaaatcatc cgctaaaaca cgggcaatct 2341 cccgtagtgc agtggttttc cccacaccag ggcgtcctag catgagaatt gattttccag 2401 tttctaccaa atcgcggatc atgccaattg tgccgaatat tgcccgacca acacgacagg 2461 ttaagccgat aatcttacca ctgcggttgc ggatggcact aattctgtgc agggtttgct 2521 caattcctgc ccgattatct gcgccaaagg ttcccactct cttgatgcaa tcatctatct 2581 gttcttgcgt aacgggtgtt tcgcccagat actctgcttg attgggaaag cgagcttcag 2641 gacgacgacc caaatccaag accacttcta ctaggctatc tcgttgagga tgcttttcta 2701 gtatctgctg caggtcttgg ggcaaaatgt ccaacaattt ttgaagatcg tctttaatcg 2761 tcatgctttt tatggtgacg ttttttagag atacttggac acttgtagta cgagcagagg 2821 atcaccccgc ccagagaaag tgtccacaac gcgctcataa ggaacagtgt tccctactgc 2881 tgtgacttga gctgggaaac gagagaatgt gcaagcgaaa ccgcttgagc gagcaattct 2941 ggttcttctt ctaagcggac agttgtttta acacagttat ttttcacctc ccttgcttct 3001 aattgctcga gaatgggtga cagtagtacc cgggcgtaac taccgtaagc aataccagca 3061 atggaagagg atgaagagga tgaagaggaa ttaactcccc ctgctctctt cagcagatcc 3121 attgccttca acttggcaac agtgtagctg ttggttccac cagctaattg cacatatccc 3181 ggtaatttag ctgccaagac tttttgccct aacttgacgg cggcctgagt ggtaccatca 3241 ccaatatccc cactcatcgg acgaccgtca gtctgccaaa ttaaagcaca cggtatgggg 3301 gcaattacgt tataaagggc gtgaagatag tcaatcagac cgtctccatc tggacaactg 3361 attgcgacta acttcaaacg ctcaacccaa ggtaaaatcg cttgccacaa tctttgaaat 3421 tctgtcaaac gcccaacatt tgtatgaatt tctacggcgt ctactcctga tgacaggatc 3481 aatggtgcga tcgctcctgg cgttgagaca tatgaccttg tataaattat atcatatgga 3541 caaactggta tgcaacgacc acagccgtag cacttttgtg atatgatccc tgatgattcg 3601 ttttttatac tattaaaaac aattgcttgg gctggacaaa tcctttcaca tggtctatca 3661 cattccttgg gacactctgt agaattgaac tcagctttgc ggaaatgtgg atcttccccg 3721 tcgttgaggc taaccatcaa caggggcgag tttccatgca agccaaatcc ttgctgccca 3781 gcaacctctg ataagccaga ggcagcttgc agcccttctt tggctgccgc aatcacagct 3841 ggatcagcgg caacatctat gcagtcagcg ccagccaaag tgtaggctag tgtcaaactt 3901 ctgaccgcag gcagatgttg gtagctggcc ccgcagatga gcttgaacca gtgaccttgt 3961 ttgagggatc gtaaagggtt taacagatga gtcacatctc tattatgcta tgggacggaa 4021 aaatgtaaga ggtgactcac aaaagttttg atgattaatt ttttgtattt caaaggagaa 4081 aatccttctt ttcgagcttt ccgtgagatt ttgttgaaaa gtgatatttt gggagtttga 4141 tgagaatatt gcttgtgtta cgaaggacga caaatgagca ggggaagata tagtgagtaa 4201 ctttaattac taagtaaaaa gctctgtatt tggatgctgg actgggattt cgatgataaa 4261 ctctgctcct tgccctggta aggagttgca actaatttgt ccttgatgtt tttgtaccac 4321 aatctgatgg ctaattgcta atcccagtcc actgccttta ccaacaggct ttgtcgtaaa 4381 aaatgggtca aaggttcttg atcgcaccga ctcatccata ccaagtccat tgtcagcgat 4441 actaatcttc actgtattat tatctgtcaa ctctgtacga atacgaatca caggaacaca 4501 acttatggaa ctaacctcat cagttggtga gttggtaggg tgtaatggtc ttatcttctg 4561 accatcttcc aattgccaca tctgttgatc gcaaagcttc tgctcatctt cccctatttg 4621 gtgaaatttt tgatggtttc tattactgag acttttgttc atttctccac attcaggaag 4681 cgacatctcc cactcactat gcccctgtgt tgactttaaa gcatcgatcg cattgttcaa 4741 gatatgcatg aacacttggt taagttcgct agcataacaa gtgactgggg gcaaattgcc 4801 atagtctttg acaacaacaa tgccaggacg atatgttgtt tcgttgagtc gatgctgcaa 4861 catcatcaaa ctactatcta taccttcatg aatgtttacc gacttcatct gggcttcatc 4921 aagccgagag aatttttgta gcgacagtac aatttgccga atgcgatcgg ctcctctatt 4981 catagcacct aataagtttt tcaggtcatt catcaaaaaa ttcaggtcta tctcctggac 5041 gatttgctca atcataggtg tagaatgtgg gtattcttgc tgataaagca aaatcacacg 5101 aaccagttct tgcacgtatt gccctacata ttcaagattg ccatagataa aactgattgg 5161 attattcacc tcatgggcaa ttccggcaac taactgcccc aatgcgacca tcttctcatt 5221 ttgaatctgt ttaacttgag cagcttttaa atcgtcaaga gcattttcca ataaaagatt 5281 tttttcttga agtttagttt gtaacagacg taagttaatc tgattttgaa ttcgcaagat 5341 aacttctgcc ccataaaaag gcttagtgat atagtctaca cctcccacat caaaggcttt 5401 gactttatct aacacatcat ccaaggcact aataaaaatc actgggactt cacaagttct 5461 ttcatcagct ttaagttgct gacaaacctc atacccgttt actcctggca tattaatatc 5521 cagcagaatg atatcaggca aaaccatttg acacgctgtc agtgccattt tgccattcaa 5581 ggctttgcga acttcataac cttgttcact cagcattgct gacaaaagac gtaggttgtc 5641 tggtgtatca tcaacgacta gaatattggc tttatattct tgaccttgat ttctatacat 5701 ttcaaaaaaa ctgagatggc aacttaatag atattcggca agcaagacag atttcaggtt 5761 agtgtgctga aatctgtctg ttaccatcat gcatcaaatt taccaccaag ggcggtccaa 5821 gtcagtttac ccacctctcc atctacagtt aaaacggctt ctttctggaa tattctgaca 5881 gcagcttccg tttgaggacc aaaatttcca tctgcgcttc ctgcattgta ccctcttgca 5941 ttcagcactt tttgcaaagc catgacatca tctcctgtcc ttccttgacg aagattacgt 6001 ctaagtattg gtaatacaaa gccagaatat gtcaaattcc aagcttcacc acgttcacct 6061 tccttaggaa tacgccaaat tgatgaacca ggatagttat tcaggtaatc ttgaagttcc 6121 gcgactgttg caggtttggc tgtgtcactt tcggaacccc aaatgggtaa gattggtttt 6181 ttcaaaccca aatttttgta agcatccaga caatctttaa cttcttctgg agtcgtggga 6241 agaaatgtga atttctcaaa atagatttgt gggaacgcta agtcgcaata cttatccaaa 6301 atctgccaag ggacatttgg atggagtcga ggatttccaa aagttgtata tccgaaagtt 6361 ccttccttga cgattatacg caaagcagaa agaagcttgt ctacatttgt atgtgtgcta 6421 gtgttttcta cttctttttc tacatcaaca atatacccat ccagtccaca gttaagagct 6481 tgtctaacct ttacaatttg tctggcaata tcagttgtgc catagtgata tccccatcca 6541 tacacttcaa tctgacgaga tttaaattcc tgaataattt ctggagaaca ctgccaactc 6601 caaaaagtag gttgaccttg atatttacca tcaaaaacct taagatatac acgctgtact 6661 ttgcgttcaa caagtttgtc cagataatca gcgcggatct ctgagaggtt ccaaatccaa 6721 actccgttgg gatattgtgc catattattt agttcttttt ttttgttaca atgtcaagct 6781 agtaaagtat aagccataga tttatagatt gtctgcggca gtatagtcta gatgctttgt 6841 ctgtaactgc ttcaaaaatt tatctattga ttccagaaga ttaaaaaaaa ttattatgca 6901 attgggtcat taaaatggca tattgtttct tctgcttgag ttttggtgag aatcagctat 6961 gacgactatc tgggtaaatg aacaacttga tccatctgga atgatttatg cctgtattgc 7021 ttgctgtaac gagtcccaag ccaaagactg tcatgagtcc tttaagcaaa atttgactca 7081 ggagcaaaag gcatggggtt gggtggcgcg gttgcgaaca gttaattctt gggatgaggt 7141 tccggtcaat gctttaaagt tggattgaag ttgggagttg ggagttagga gttgggagat 7201 gggagttggg agttgggagt tgggagttgg gagttgag // LOCUS NODE_4318_length_7236_cov_4.7029667236 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7236) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7236) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7236 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(330..402) /locus_tag="DP116_25780" tRNA complement(330..402) /locus_tag="DP116_25780" /product="tRNA-Val" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(367..369),aa:Val,seq:tac) gene 564..1175 /locus_tag="DP116_25785" CDS 564..1175 /locus_tag="DP116_25785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315800.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal-binding protein" /protein_id="PRJNA477356:DP116_25785" /translation="MPSGRTHDRITLWALPFVTGVAFWLTNSGNLTLLVTGGFLFGGL MFGPDLDIYSRQFQRWGVFRFIWLPYQKSLRHRSLLSHGPIIGTTLRVVYLSSLGAIV AIFTLLIVEKLWNMQFNWQMTGETVKFTIAHYSMEFLALFVGLEVGAMSHYLSDWGGS AYKRFQKQGVRGLLPRAKMKKRKVTSRGNGSKVKPKNSRRTKS" gene 1411..2241 /locus_tag="DP116_25790" CDS 1411..2241 /locus_tag="DP116_25790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315799.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleoside triphosphate pyrophosphohydrolase" /protein_id="PRJNA477356:DP116_25790" /translation="METNQTERNAETLAAIQKLIDVVAKLRSPDGGCPWDLEQTPQTL TPYVIEEAYEVVDAIKSGDQDAIAEELGDLLLQVVLQAQIASEDQQFSLKEVAEGISQ KLIRRHPHVFGDVSVQSVDDVRRNWEEIKAAEKGEASADTQKLSYKLSRYARKLPPLA ATMKISQKAAAVGFEWENIDGVWDKFNEELTEFKQALAEETPERQQAELGDLLFSLLQ VARWCNLDPEAALQGTNQRFIQRLQKMEANAERPLTDYSLEELETLWQEAKARLAQEE " gene 2307..3617 /locus_tag="DP116_25795" CDS 2307..3617 /locus_tag="DP116_25795" /EC_number="1.2.1.41" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874820.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamate-5-semialdehyde dehydrogenase" /protein_id="PRJNA477356:DP116_25795" /translation="MTISAENSLNLIAVAKKTRLAALKLAVLSTEAKNQAIEAIAQSL ESAKDEILQANVADCERATTDGIAKPLYKRLQLDEHKLRDAIAGVRDVGKLSDPVGAV QIHRELDQGLILKRVTCPLGVLGVIFEARPEAAIQIVTLAIKSGNGVIFKGGKEATRS CEAIVKAIKQGLSQTAVHPDAVQLLTTREEILELLRLDKYVDLVIPRGSNSFVRFVQE NTRIPVLGHADGICHLYIDKAADLEKAIAITVDAKTNYPAACNAIETLLIHASIAAKF LPKVAEVLGKLNVELRGDERTREIVPKIATATEEDWETEYSDLILSIKVVDSLEDAIA HINNYGSKHTDAIITEDAEAAQTFLALVNAAGVYHNCSTRFADGFRYGFGAEVGISTQ QMPPRGPVGLEGLVTYKYQMTGNGHIVATYTGADAKPFTFKDLG" gene 4489..4854 /locus_tag="DP116_25800" /pseudo CDS 4489..4854 /locus_tag="DP116_25800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013322994.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 4934..6373 /locus_tag="DP116_25805" /pseudo CDS 4934..6373 /locus_tag="DP116_25805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746978.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" gene 6416..>7236 /locus_tag="DP116_25810" CDS 6416..>7236 /locus_tag="DP116_25810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_25810" /translation="MADTAPVMIWMTGTDKLCNYFNKSWLEFTGRALEQELGNGWAEL LHPDDLQLCIETYTTAFDTRRSFTMEHRYRRFDGEYRWLFDTGIPRFHSNGSFAGYIG SGIDITEQKQMLQALQDSEARLRLTLDAARMVTWDWNITTNNVVDYNPMEPLCDLPLG CNEHTFEAFLDAVYFEDRDRVVAAIKDAIDNKKDYEVEFRLVSPDNALHWIGNKGQVY YDQTGKPVRMVGVGMDITDRKQSEQKIREQATWLDVATDAIIVRDLENKIVFWNR" BASE COUNT 2114 a 1402 c 1744 g 1976 t ORIGIN 1 tacgccttga actgaagttc aaggcttata gccaaagtcc gttaaaacgg actggtatgt 61 atattcagta agctttagct tacttgagat ttgagccaag aaatttattt cttggcggac 121 ggaaaagctg gtgcaaaatc ttagacgagc gatcatcgcc ctacatctac attcaacaaa 181 aaacttactt actgaagctt tgaagattat cccttaattg ttgatgaatg ggttggcgta 241 gccagccaca ggaatcgccc caaatgagca attactcact agaaacgacc cagcatagaa 301 atttactgag cttacttaac ttttaaaaat gggcggtact ggactcgaac cagtgacatc 361 ctgcttgtaa ggcaggcgct ctaccaactg agctaactgc ccgctgttgc acacaaaact 421 tatgttagca cacccatttt gaaaaagcga tagtcttata aaaaaatttt tgccttattc 481 tgcatctgtt ggattgaatt gcgactcgga cagcaagata ctggttagca atggcttgga 541 tatttttgaa ttttgaattg attatgccct ctggtcggac tcatgatcgc attactctct 601 gggctttgcc gtttgtgact ggtgttgctt tttggctaac caacagcggt aacttgactt 661 tgttggttac tggtgggttt ctgttcggag ggctgatgtt cggtcctgac ttggatattt 721 actctcgcca gttccaacgc tggggtgttt tccgttttat ttggttacct tatcaaaaaa 781 gtttacgcca tcgttctttg ttatctcacg ggccaatcat tggtacaacg ctgcgtgtcg 841 tttatctgag cagcttggga gcaatagtgg caattttcac tttgttgatt gtcgaaaagc 901 tgtggaatat gcagttcaat tggcagatga caggggaaac tgtaaaattc acgatcgctc 961 actacagtat ggaatttctt gctttatttg tcgggctgga agttggtgct atgagccatt 1021 atcttagtga ctggggaggt tcagcttata agcgttttca aaagcaggga gttcgtggat 1081 tgctgccgcg tgcgaaaatg aagaaacgta aagtgacgag tcgtggtaat gggagtaaag 1141 tcaaaccaaa aaatagccgc cgcacaaagt cataggtaaa ttgtggggta ggatcggagc 1201 ctgctgatta aactcgtttc cagcctcagg ctggaaatgc cagcgacgag gctctgcctc 1261 ctatctgaaa taccggaggc ggagcctcct agtagcgcat tccctggctg agccagggaa 1321 cgagaaatat gttgttacac atttgggatg ctcccaagaa taagattttc aaccgcagag 1381 gacgcagagg acgcagagct tttattaatt atggaaacta atcaaactga gagaaacgct 1441 gagactttgg cagcaataca aaaattaatt gatgtggtgg caaagttacg ttctcctgat 1501 gggggttgtc cgtgggattt ggagcaaact cctcaaacgc tgacaccata cgtcatagaa 1561 gaagcttacg aagttgtaga tgcgattaag agtggggatc aagatgcgat cgccgaagaa 1621 ttaggtgatt tactcttaca agttgtcttg caagcacaaa tcgccagtga agatcaacaa 1681 ttttccttga aggaagtcgc cgaaggaatt tcacaaaagc tgattcgccg tcatcctcat 1741 gtttttggtg atgtgtcggt gcaaagtgtt gatgatgtgc ggcgaaattg ggaagaaatt 1801 aaagctgctg agaaaggaga agcttctgcg gatactcaaa aactgagtta taaactcagt 1861 cgttatgcac gaaagcttcc tccgttagca gcgacgatga agatttcgca aaaggcggct 1921 gctgttggat ttgaatggga aaatattgat ggggtgtggg acaagtttaa tgaggaattg 1981 acggaattta agcaggcgtt agctgaggaa acaccagaac gacaacaagc tgagttgggt 2041 gatttgctat tttctcttct acaggttgct cgttggtgta atcttgatcc ggaagctgct 2101 ttgcagggaa caaatcagcg gtttatccag cgtttgcaaa aaatggaggc aaatgctgag 2161 cgtcctctta ctgattacag cttagaggaa ttagaaacgc tgtggcaaga agcaaaagcg 2221 cgacttgcac aggaagagta aaaatctggt ttcgggttat ttggaaccac agataaaaac 2281 agataatttc aacgtagttt tttagtatga ctatttctgc tgagaattca ttaaacctaa 2341 ttgcagttgc taaaaaaacg agacttgctg cactcaagct ggcggttctt tcaactgagg 2401 cgaagaatca agctattgaa gcgatcgccc aatccttaga atctgccaaa gatgaaattt 2461 tacaagcaaa tgttgctgat tgtgaaagag ccactacaga tggaattgct aagcctcttt 2521 ataagcgttt gcagttagat gaacataagt taagagatgc gatcgctggg gtgagagatg 2581 tcggcaaact cagtgatcca gttggtgcag tgcagattca ccgtgaactg gatcaaggct 2641 taatcttgaa gcgagtcact tgtcctttgg gtgtgttagg tgtcattttt gaagcacgtc 2701 cggaagcagc aattcaaatc gtgactcttg ctatcaagtc gggaaatggt gtgattttca 2761 aaggtgggaa agaagcgact cgttcttgtg aagcaatcgt caaggcgatt aaacaaggac 2821 tttctcaaac tgctgttcat ccagatgcgg tgcagttgtt aacaacacga gaagaaattt 2881 tggaacttct aagattagat aaatatgtgg atttagttat tcctagaggt tctaactctt 2941 ttgtgagatt tgtgcaggaa aatactcgta tacctgtatt aggtcatgct gatggaattt 3001 gtcatctata tatagataaa gctgcagatt tagaaaaagc gatcgcaatt acagttgacg 3061 ccaaaacaaa ctatcctgcg gcttgtaatg cgattgaaac tttgctaatc cacgctagta 3121 ttgctgcaaa gtttttaccg aaagttgctg aggttttggg aaaactcaat gtggaattga 3181 gaggggatga acgtactcgc gaaattgtgc caaagattgc aaccgcaaca gaagaagact 3241 gggagacaga atactctgat ttgattttgt cgataaaagt tgttgattcg ttggaggatg 3301 cgatcgccca cattaacaac tatggctcca agcacactga tgctattatc actgaagatg 3361 cagaagctgc ccaaactttc ctagcgctgg tgaatgcagc cggagtttat cacaactgtt 3421 ctacccgatt tgctgatggc ttccgctatg gtttcggtgc agaagtcggg attagtactc 3481 aacaaatgcc tcctcgtggt cctgttggtt tagaaggttt ggtaacatac aaatatcaga 3541 tgactggtaa tggtcatatt gtggctactt acacaggtgc agacgccaag ccctttactt 3601 ttaaggattt gggatgaata agcgctcaca aaatactatg atgacatttg tcagacgtta 3661 accaaaaata tagaagattt gatgaacgaa agtgaagata atttgatttg tcaaacacaa 3721 tcatctttgt ctagctgagc ttgaacagga gtaccttttc ttacgacata cagcgctttg 3781 catctaaatg gagtacaccc ctcgctaacc ctcacgcggg tgggatatct ctgtacttca 3841 ccaacttgca atttgctgta ttattattca tccccaattg aagtgaatac gtacagtact 3901 gctataattt gacttataca gaaaagccga atatctcttt aaaaagctac gcgtgaatat 3961 tgtactgtgg attttggcta aagttgcttt gatccatacg ggtgcttctg ccttgaaagt 4021 atacagccaa aaccttgtca gtgatgaaaa ctgacaaacc ataaaactta ttaaaaatct 4081 tatcaggggt aggctgaaaa ccccttgcct taagtacggg gacgccacat gcctacggag 4141 ggaaaccctc ctgcagcagt ggctcatgaa agccacgagg cactgtcttg aaaagttacc 4201 tgcggaggga aaccctcccc cagtactttt cgctttttaa agtgcctgaa tcttctccgg 4261 agttagtatt aggagcgttg tatactaatt gaaaacgagg aggtgatttt attgccaaaa 4321 gtttatggct gtcagcaagt attattgaac cgttcattac ttattcatcc ggttcttgaa 4381 tatctttgtt ctcaagctca caggttaacg aattgtggaa tatattacgg tcgtcaggtt 4441 tggttcaagg agcgtaggta tttaaagaag tttgacctga taaacgagac gtacaaatta 4501 aaagagcgaa ttaaacagtt atgtgaaaag tacggcatca attttattga aacagaagaa 4561 tcgtatacat ccaaggcgtc ttttttagat ggtgacgaac taccgacatt tggtgcaaaa 4621 cccgaagggt ggaaaccgtc aggacgacgc acaaagcgtg gattgtacag aacagggtcg 4681 ttccagacta tcaacgcgga ctgtaacgga gccgcgaata tattacgcaa agtagccaca 4741 acactgagtt tagaacttgg tcgagtgagt aggggcgctt tgacacgtcc cacccggatc 4801 aaagtctggg tgccagctaa aaagcgaagc acggcggctt tagccccgtg ttgagcatcg 4861 tttagaatcc ccgtctctag cggggagtgt caactactgc tttaagcaaa gctcgtgcat 4921 aatactcaag gcaatgaaag acgaggataa gacaaaaaag cagttgattg aagagttaac 4981 tgcactacgc caaaactttt ctcagttaga agaattagcg gctcaacaac agcagaccca 5041 agaggcgctg cgacagcaac acgaatggga acgacgccaa gttcaaaaag agcaggctct 5101 gaatcgagtg attcaaagta ttcgtaattc tctggattta gaaacaattt ttactaaagc 5161 agcttatgag attactcagc ttgtgagcgc tgatcgagtt gaaattgtgc aatatatccc 5221 tgaacgccaa ctgtgggtga atgtcgctga ttatcgtcgg actccagatt taccaagtgc 5281 tttgggggtc gaaattcccg atgctgataa tgaaatagct acccgactta agcgattaga 5341 agttgttgaa attgaagatg cgagtacttg ttcagatgaa atcaaccgta gctttgccga 5401 aacttatggg ggagcatggt tgctggtacc ggtgcacttt ggctctacta tttggggtag 5461 tcttagcctg aaaagaatta accacccttt atcttggcag caggaggaag tggaattaac 5521 ctgtgcagtg gcggatcaag tggcgatcgc cattcagcaa tccaccctct tgttgcagct 5581 acaaaccgaa cttaccgaac gcaaactggt agaagctact ttgcaagaaa agcagtattt 5641 tattgagcgg attgttgaaa caagccccga tatcatctat gtctacgact tagttgaaca 5701 gcgcaacatt tatataaaca gacaaatgct tgagattctt ggctacgctt cccaagaaat 5761 taaagccatg agagaggcag tattgccaaa tctcgttcat ccagatgaca tgggacgagt 5821 gagcgaacat ctcaaacgct ttgaaactgt taaggatggt gaacttctcg aaatagagta 5881 ccgaatgaga gatgtcaatg gtgagtggcg ttggttgcgt agtcgagaga ctgtgtttgc 5941 gaaaaatgca gacggtttgc ctgtgcagat tttgggcata gccggtgaca ttaccgaacg 6001 caagctagca gaagcagcca tttcctttca agcgcatctg ctatcagcgg tggaacaggc 6061 tgtgattgct acagatttgg acggaagaat tatatactgg aatcgctttg cagaaaaact 6121 ctacggctgg tcagcgttgg aagtcatcgg tcaaaatgtc ttagaagtcg ttactgctga 6181 gacttcacaa gagcaagccg ttgaaattat gtcccaccta cagcgcggtg aatgttggtc 6241 tggggaattt ctggttcggc gtcgagacgg cactttgttc ccaatcttaa ttactgattc 6301 gcccatatac gatgataaag gcgtgctagt tggcattgtt ggcatttcta ttgacatcac 6361 tgaacgcaaa caggctatta cagcgctgca agaaagtgaa gaacgatttc gcactatggc 6421 agacacagcc ccagtgatga tatggatgac tgggactgat aagctttgta attattttaa 6481 taagagctgg ttagaattta cgggacgcgc actagaacaa gaactaggta atggttgggc 6541 agaacttctc catcccgacg atttacaact ttgcatagag acttacacaa ctgcatttga 6601 tactcgccgc agtttcacta tggaacatcg ctacaggcgt tttgatggtg agtatcgttg 6661 gctatttgat acaggcatcc caaggtttca ttcaaacggt agttttgctg gttacattgg 6721 ttccggcatt gatatcacag agcaaaagca gatgctccaa gcattgcaag acagtgaagc 6781 acgattgagg ttaaccctag acgcagctcg catggtgact tgggactgga acattacaac 6841 caataatgtt gtggattata acccaatgga accgctctgt gatcttccgt taggctgcaa 6901 tgagcataca tttgaagctt ttctcgatgc tgtttatttt gaagatcgcg atcgcgtcgt 6961 tgctgctata aaagatgcta ttgacaataa gaaagattat gaagttgaat ttcgattggt 7021 atcacctgat aatgcgctgc attggatagg aaataaaggg caagtctact atgaccaaac 7081 tgggaaacca gtacgtatgg tcggtgtagg aatggacatc accgatcgca aacaatctga 7141 acaaaaaatc cgcgaacaag cgacttggct tgatgtcgcg acagatgcca ttatagtcag 7201 agatttagaa aacaaaattg tcttttggaa cagaag // LOCUS NODE_4348_length_7159_cov_5.0261827159 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7159) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7159) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7159 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..523) /locus_tag="DP116_25815" CDS complement(<1..523) /locus_tag="DP116_25815" /inference="COORDINATES: protein motif:HMM:PF13620.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25815" /translation="MSQNTKSQNRKKFNFTKWGFILAIVGTVATVATVPEIRGLIRLQ PEVGVVQKQEVELITQTEQGEVLPGVKLRVSSKGAPEMKQTDNNGYAKVQIPSKGDVA VNLSKAGYPTQNFTINLENEQSTTRVINLSKSGDPEVKSLASAPPTSSSNQLGTSAAS SIATKKTNNSYSFD" gene complement(859..1845) /locus_tag="DP116_25820" CDS complement(859..1845) /locus_tag="DP116_25820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194778.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_25820" /translation="MANKSNNQETFTLVLSLLITLGLLGFGVWFFRNSLPFASKQQTQ PSANSSEQQAATSQPSATFDTSNLDTSLPNPSVLTIDGSVTIVALIKQLQIAFNPVNP SLPTTYGLPDGSPNGTNKGIQNLRDGKVLIAASSRPLKSDEAQAGLVQVPIARDALAI AVGVNNPYKGELTMEQLKGIFQGKITNWSQVGGPNLPIKVINRSPDSGTYTFFQEVVL LEESFAPDSTNFTTIKKDETTSLLRALGDNGITYSTVSQLENQKTVRIVSINGISPTD QTAIKNSTYPISRVVYLVVPRKTSPGAKQFIDFAISPSGQQIVQRVGFIPLK" gene complement(2075..3523) /locus_tag="DP116_25825" CDS complement(2075..3523) /locus_tag="DP116_25825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314527.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cobyrinic acid a,c-diamide synthase" /protein_id="PRJNA477356:DP116_25825" /translation="MALVIAGERSGVGKTTVTLTLLASLCQRSTQVQSFKVGPDYIDP MFHQHVTGRPSRNLDPVLTSEEYIKQCFTYNSQVCEYTLIEGVMGLFDGVTPPQPLTF LRGGANDFASTAHIARLLNLPVVLVIDCSRLSGSVAAIAHGYCSFDQEIKIAGVVLNR VGSDRHLSLLKDALEPLQLPILGVLRREENITIPDRHLGLVPTAELPQLQALIDRLAH LGNTCFDWERLLPLLQTRGAGEQGSRGAGEQRSRGAEEQGRSSLPIFSPTPVRVRIAV ARDRAFNFYYQDNLDLLQQLGAELVFWSPLEDPGLPQDVQGMYFGGGFPEVFAQQLAQ NSTTRDAVKTAILSGMPTIAECGGLMYLCEEIVDFEEKSWSMVGVLPTTAVMGGRLTL GYRRAVALQDGLLVPAGTTVYGHEFHRSRLIPTPNSPLFQTYRYDCEESTGYEGWNLL PSLHASYIHQHWGESLDIPKRFLQKCCVFKFT" gene complement(3963..5342) /gene="opcA" /locus_tag="DP116_25830" CDS complement(3963..5342) /gene="opcA" /locus_tag="DP116_25830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876417.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucose-6-phosphate dehydrogenase assembly protein OpcA" /protein_id="PRJNA477356:DP116_25830" /translation="MTSQAPTIFSLQAPKDVSLTDIEAELSRIWQSYGIAGEDGALPA ATRATTFTLIVYEPEETQYLLAALGLYKGPIDGIFGPQMEAALREVQKTHGLTETGTA TEETLAVLREELSKRHGAATAENGSGSASYTPDASSSPRIADEIAIRNPCRIIALTPI AGEDVGVKAQVSAYCPIQKQSASTLVCCEYITLTGTAAALERVGGMIQALLIGGLPKF LWWKATPDLNNPLFKRLSAVCNNVIVDSCNFNKAETDLLNLQELVESDIPLADLNWRR LSGWQELTAEAYDAPQRRAALVEIDRVNIDYEKGNSVQALMYLGWLASRLQWHPVSYQ KETGDYDITKIQFVAQDQRQIEAELAGVPVGDAGEIPGDLIALRLSSTNPQANCGTLI CSETGGCMRMETQGGAQASGVFQQVTSLSEQKAEVLLSQQVQRWGHEALFEESLAQTA HMLKLEAKS" gene complement(5438..6967) /locus_tag="DP116_25835" CDS complement(5438..6967) /locus_tag="DP116_25835" /EC_number="1.1.1.49" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009343330.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucose-6-phosphate dehydrogenase" /protein_id="PRJNA477356:DP116_25835" /translation="MVSLLENPLRVGLQQQRMPEPQIIIIFGASGDLTWRKLVPALYK LRQERRVPPEITIVGVARRDWSHEYFREQMRKGMEEAHGGVGAEELWQDFSQGLFYCS GDIDKPESYQKLKNFLGELDEKRGTRGNRMFYLSVAPNFFPEAIKQLGSGGMLEDPYK HRLVIEKPFGRDLASAQSLNRVVQKYCKEEQVYRIDHYLGKETVQNLLVFRFANAIFE PLWNRQFVDHVQITVAETVGVEDRAGYYESSGALRDMLQNHLMQLFCMSAIEPPNSLD ADSIRTEKVKVLQATRLADVQNLHRSAVRGQYSAGWMKGKPVPGYHEEPGVNPNSTTP TYVAVKFLVDNWRWQGVPFYLRTGKRMPKKVSEISIHFREVPSRMFPSAAQQMAGNIL AMRIQPNEGISLRFEVKMPGAAFRTRSVDMDFTYGSFGIQATSDAYDRLFLDCMMGDQ TLFTRADEVEAAWRVVTPILSVWDTPSDPASVPRYEAGTWEPAEAELLINQDGRRWRR L" BASE COUNT 1914 a 1617 c 1588 g 2040 t ORIGIN 1 catcaaaact atatgaatta ttagttttct ttgtagcaat tgaagatgca gcactagttc 61 caagttggtt tgaagatgaa gttggaggtg cagaagcaag tgatttaact tcaggatcac 121 ctgatttgct taagtttatt actctggttg tactttgttc gttttctaaa ttaatcgtaa 181 aattttgagt tgggtaacca gctttactta aattgacagc aacatcacct ttactaggaa 241 tttgaacttt agcataacca ttattatcag tttgtttcat ttcaggcgca cctttagaag 301 agactcgaag ttttaccccc ggtaaaactt ccccttgttc agtttgtgta atgagttcaa 361 cctcttgctt ttgcactaca ccaacttctg gttgcaacct aatcaaaccc ctgatttctg 421 gtactgttgc tactgtagca acagtaccaa caatagctaa aataaagccc catttagtaa 481 agttaaactt ctttctgttt tggcttttgg tgttttggct catggttttg ttctggattg 541 agttgttatt ttacctgtca tacagttagc gataaggcgt acgctcaatt tcttaaccag 601 cgacgaagag cttggctaag atgtcaaatg acgtctaagt atttattatg cagtatcttt 661 cgcagtagtc cgagaaaatc cggataaata aatttttgat tcttaaagat ttatttactt 721 tttgctcagt aaatctatat agcgggttgc aagcgagcga ggtacagtaa aaacaatgag 781 cgagcagtac atactttctt tgaaaagaaa agcatattca ccagacaatc gtgataaata 841 ataaactgct acttcaacct actttaatgg aataaaacct actcgctgga caatttgttg 901 tccactaggg gagatagcaa agtcaataaa ttgttttgcc cctggactgg ttttacgcgg 961 tacaactaaa taaacgactc gactaatagg ataagtacta tttttaatgg cagtttgatc 1021 agtaggagaa attccattta tagaaacaat acgaaccgtt ttttgatttt ctaactgaga 1081 aactgtgctg taagtgatgc cattatcacc taaggcacgt aatagcgaag tcgtttcatc 1141 tttcttgatt gtggtgaagt ttgtgctatc aggtgcaaat gactcttcta agagcacaac 1201 ttcctgaaag aatgtatatg taccactatc tggagagcga ttaataactt ttattggtag 1261 gtttggacct cctacttgtg accagttagt gatttttcct tgaaaaatcc ccttcaattg 1321 ttccatagtc aattcccctt tatatgggtt gttgacgcct actgctatag ccaaagcatc 1381 gcgggcaatt ggaacttgta cgagtccagc ttgtgcttca tcagatttta ggggacgtga 1441 actcgctgct atcaacactt tgccgtctct taagttttga atccctttat tggtgccatt 1501 gggactacca tctggtaagc cataagttgt gggcaaggaa ggattcaccg gattaaacgc 1561 aatttgcagc tgcttgatca aagcgactat cgtgacgcta ccatcaattg tgagcacgct 1621 tgggttgggc agactggtat ccaagtttga tgtgtcaaaa gttgcactag gttgggatgt 1681 ggcagcttgc tgttcagaag aattggcaga aggttgagtc tgttgtttgg acgcaaaagg 1741 taagctattc cggaagaacc acacaccaaa ccctaaaagc cctagagtga tgagcagaga 1801 tagcaccaag gtgaaagttt cttgattatt acttttatta gccataagga atatcaatta 1861 ttgggacttt gtaaattagt atggtctagg tggtggttag tattccctag aacgccgctt 1921 cagctatcaa caaatgcgcc gaatgaaccg ccatctctat gctacaaaag tctttacgta 1981 tgtggtgtga tcgcttcttc aatctcaatg tttgcaaatt catcagattc aagctttttg 2041 cgatattcct attgagttaa ggcgttaacg atttttaagt aaatttaaag acacaacact 2101 tttgtaaaaa tcgcttggga atatctagac tctctcccca atgttgatga atgtaggaag 2161 catgtagaga tggaagcaaa ttccatcctt catatcctgt ggattcttcg caatcgtaac 2221 gataagtttg aaacaagggt gaattgggag ttgggattaa ccgggaacga tgaaactcgt 2281 gcccgtaaac ggttgtacct gcaggtacta acaaaccatc ctgtaaagca actgctcgac 2341 gataccctaa ggtaagacgc ccacccatta cggctgttgt gggcaatact cctaccattg 2401 accaagattt ctcctcaaaa tcgacgattt cctcgcatag atacattaat ccgccacatt 2461 cagcaatggt aggcattcca gagagaattg ctgttttcac tgcatcgcgg gtggtgctgt 2521 tttgggctag ttgttgggca aagacttctg gaaaaccacc accaaaatac atcccttgca 2581 cgtcttgtgg taatccaggg tcttctaaag gactccaaaa aaccagttct gcacccaact 2641 gctgtagcaa gtcaagattg tcttggtagt agaaattgaa agcgcgatcg cgcgctacag 2701 caatccttac acgaacaggt gtaggtgaaa agataggcaa agaacttctc ccctgctcct 2761 ctgctcccct gctcctctgc tcccctgctc ctctgctccc ctgctcccct gctccccttg 2821 tctgcaacag tggtaacaag cgttcccagt caaagcaagt atttcccaaa tgggcaagtc 2881 ggtcaattaa agcttgtagt tgaggaagtt ctgctgtagg tactaaaccc aggtgtcgat 2941 ctggtattgt aatattctcc tcacgccgta gcaccccaag aatcggtagt tggagaggtt 3001 ctagggcatc tttgaggagg gagagatggc gatcgcttcc cactcgattt aacaccaccc 3061 cagcaatttt gatttcttgg tcaaatgagc aataaccgtg tgcgatcgca gccacagaac 3121 cagacaaccg actgcaatct atcaccaata cgacaggtaa attcagtaac cgcgctatat 3181 gagccgtact agcaaaatca tttgcccctc ccctcagaaa ggtcaggggt tggggtgggg 3241 taactccatc aaacagcccc atgactcctt ctatcagagt atattcacat acttgcgagt 3301 tataagtaaa acattgcttg atgtattctt ctgatgtcag cactgggtct aaattgcgac 3361 tgggacgacc tgtcacgtgc tgatgaaaca tcgggtcgat gtaatctgga ccaaccttga 3421 aagattgtac ttgtgtactt cgttgacaca aagatgccaa aagggtgagc gtgactgtcg 3481 tcttacccac cccgctgcgt tctcctgcaa taactagagc cataaagaag ttaaaagtta 3541 gcaaaaaaga acgatcaaac acagttgtca gaactcagaa ttaattataa acacaggttg 3601 acccttcgat ttccttcgac tgtgctcagg atcaatcctt acaattctgt acaactgggt 3661 ttctcatctt gttcacaagc cctcctacac tagtagggtg tgttgtcgca cagcgcaacg 3721 caccatcaat aattttcggt gcgttaggac taacgtccat aacgcaccct accggattaa 3781 agctgagcta cttgcggttt gaaacgttaa atcaagagga tttcatccgt ggggatattc 3841 tgcgttttgc gttcttctta aagatcttga cttttataca aaattgtgta tatacctcat 3901 ccttatcccc tctccttgat aaggagaggg gtgcccgtga gggcggggtg aggtcttaac 3961 gactaacttt ttgcttccaa cttgagcata tgagctgttt gagcgagact ttcttcaaag 4021 agtgcttcat gaccccagcg ttgcacctgc tgactaagta aaacttctgc tttttgctcg 4081 gaaagtgaag tcacttgttg aaacacacca ctagcctggg caccaccttg tgtttccatc 4141 cgcatacaac cacctgtttc tgaacagatg agagtcccgc agtttgcctg tggatttgtg 4201 gaacttaagc gcaaggcaat taaatcacca ggaatttcac ccgcatcgcc aacagggact 4261 cctgctaact ctgcttctat ttgccgttga tcctgtgcaa cgaactgaat tttggtgatg 4321 tcataatctc cagtttcttt ttgataagaa accggatgcc attgtaaacg acttgccagc 4381 caacccaaat acattagcgc ttgtacggag ttgccttttt cataatctat gttaactctg 4441 tcaatttcta ccagggcagc acgacgttgg ggtgcgtcgt aggcttcagc tgttaattcc 4501 tgccatcccg agagacgacg ccaattcagg tcagcgagtg gaatatcaga ttctaccaac 4561 tcttgcaagt tgagcaaatc agtttctgcc ttgttaaagt tgcaggagtc cacaataacg 4621 ttattacaca ctgccgataa tcgcttgaac aaggggttgt tgaggtctgg tgttgctttc 4681 caccagagaa acttgggcaa gccaccaatt aacaatgcct gaatcatgcc accgactcgt 4741 tctaaagcag cagccgttcc tgttaaagta atgtattcgc agcacacgag tgtacttgcg 4801 gactgctttt ggattgggca gtaggcagaa acttgagcct tcacccctac atcttcgcct 4861 gctattggag tcagggcaat aatgcgacag gggttgcgga tggcaatttc atcagcaatt 4921 cggggactgc tgctggcatc tggagtataa gacgcactac cagaaccgtt ttcagctgta 4981 gctgcgccgt gacgtttgga gagttcttcg cgtaagacag caagggtttc ctctgttgct 5041 gttccagtct ctgttaaacc gtgggttttc tgcacttctc gcagtgcggc ttccatctgc 5101 ggaccgaaaa tgccatcaat tggacctttg taaagtccca aagcagccaa tagatactgg 5161 gtttcctcag gttcgtaaac tatcaaagta aatgttgtgg cgcgagtggc agcaggaagt 5221 gcaccatctt caccagcaat accgtaactt tgccaaatgc gactcagttc cgcttcaata 5281 tctgttagcg aaacatcttt gggagcctga agtgaaaaaa ttgtaggagc ttgggaagtc 5341 ataacaattg aagtattaag tattaagtta tgaactatga gttgcaccaa ggataaagga 5401 gagaaaattt tatttctatc ctttaccctt aattgttcta tagtctgcgc cagcggcgac 5461 catcttggtt aattaacaac tccgcttcgg ctggttccca agtgccagct tcatatcggg 5521 gaacggacgc cgggtcactt ggcgtatccc aaacggaaag gattggtgtc accacccgcc 5581 aagcagcttc cacttcgtca gcccgtgtga ataatgtctg gtcgcccatc atacaatcta 5641 aaaacaggcg atcataggca tcggaagttg cttggatacc aaaagaacca taagtaaagt 5701 ccatgtcaac ggaacgggta cggaatgccg ctcctggcat cttgacttca aagcgtagag 5761 aaattccttc attaggctga atccgcattg ccaaaatgtt accagccatt tgttgggcag 5821 cagatggaaa catccgagaa ggaacttcgc ggaagtggat cgaaatctca ctgacttttt 5881 ttggcatccg cttaccagta cgcaagtaga acggaactcc ttgccagcgc cagttatcaa 5941 ccagaaactt caccgccacg taggtggggg ttgtggagtt tggattcact cctggttctt 6001 catgataacc aggcacgggt ttacccttca tccacccagc actatactga ccccgtactg 6061 cagaacggtg gagattttgg acatcagcta gtcgggtggc ttggagcacc ttcacttttt 6121 ccgtacggat gctgtcggca tctaaggaat tgggaggctc aatcgcgctc atgcaaaaaa 6181 gttgcatcaa gtggttttgc aacatatccc gcagtgcacc ggagctttca tagtatccag 6241 ctctgtcttc aacccccacg gtttctgcta cagtaatctg aacgtgatca acaaattggc 6301 gattccataa gggttcaaaa atcgcgttgg caaaacgaaa caccagtaaa ttctgaactg 6361 tttctttgcc gaggtagtgg tcaatccggt agacttgctc ttctttgcaa tatttctgca 6421 ccactcggtt cagactttgg gctgaagcca agtcccgacc aaagggtttt tcaatgacta 6481 gacgatgttt gtagggatct tccagcattc cccctgagcc aagctgcttg atcgcttctg 6541 gaaagaaatt cggtgcaaca gacaggtaga acatccggtt tccccgtgtt cccctttttt 6601 cgtctaactc acccaagaaa ttcttgagtt tctgataact ttcgggcttg tcgatatcac 6661 cagagcaata gaacagacct tgggagaagt cttgccagag ttcctccgca ccgacaccgc 6721 catgagcttc ttccatgccc ttgcgcattt gttcgcggaa gtattcgtgg ctccagtcgc 6781 gacgtgctac gccaacaatg gtgatttctg gtggaacgcg tcgttcttgt cgcagtttgt 6841 acagtgctgg gactagtttg cgccaagtca ggtcaccgga agcaccaaag ataattataa 6901 tctgaggttc cggcatccgt tgctgttgca gaccaacccg caggggattt tctagcagac 6961 tgaccataac aattttggat ttctgatgga gatttgcaat gagagattgg ggaacttgtc 7021 ccttttgggg attagggaca atagaaattg ttagtcgtta gttatcaagc aatacagttc 7081 agttaaggat aattgtaggt tggggagcca gcgcgaatga cggctttccg ctctcacagc 7141 aagtctggcg tttgaggaa // LOCUS NODE_4362_length_7131_cov_4.4154897131 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7131) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7131) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7131 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(150..1517) /locus_tag="DP116_25840" CDS complement(150..1517) /locus_tag="DP116_25840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318590.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25840" /translation="MAQNYRSAKLRGKSFKGQDLTGTDFSYCDIRGADFTGATLRGAN FSHAQAGLQRRWAFALVIFALLLSTVSGLLSAVGGSLLGFILVDGNRQNTYVTVISSI ILLIFFLISMRRGVAAACGFLAAAVIGTGLAAVVWAGIVAVAWVGTGTQTRVMELAAV VTVVVTGAVAVIVSATGLVVISGAAVVAGFIAGLLAVALTVSVAGAVAGVVVVAAAKI GGVVAAIPASAVAILTLIVSADVSCRGLYQDDKQSWIGNIALGLVTGHGTSFQESDLT DADFTQARLKNTTFKKSILARTCWFQAKKLNLADIRTTYLKDEKIRQLLVTKELQNQN FDGWNLQGINLQGANITDASLVATVLNESNLQNADFSRANLTRAQLDRTDFRGATLTG AYIGNWGVTPETKLDGIKCEYIFLCVPTKDNPNPHRLPTNWEETFEDGEFAQFIRTSS KFSQI" gene complement(1912..2130) /locus_tag="DP116_25845" CDS complement(1912..2130) /locus_tag="DP116_25845" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25845" /translation="MVNHSVASSSFSKRTIFAPQPIGVLPAQLPTTQVLNVELDEDED IEWTWTTLPTGQQYVSGYTIISKSSDGN" gene 2400..3107 /locus_tag="DP116_25850" CDS 2400..3107 /locus_tag="DP116_25850" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25850" /translation="MFTRIDEILPKVFKISVSTNKNFVFSQFLILDECHMLIHTGHSK WFNAIYELVSSVCNPTEIRYVAFCHLEADECGALNQWLEVCPEAVPLVSPLNRANIDD IAIRKAHVLKNNKSVSLGSKNITLIETPHFPHGWEGCLFYEPQDCVLFCSDLAAHNGH FDTPLTNEDLTESVIKFQRQLGFMVEGKTFTRGIEAIKKLPIKYLATMHGSVIYGDSI VKMLHELQVNFGVPETF" gene complement(3142..3717) /locus_tag="DP116_25855" CDS complement(3142..3717) /locus_tag="DP116_25855" /EC_number="2.3.2.6" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129586.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="leucyl/phenylalanyl-tRNA--protein transferase" /protein_id="PRJNA477356:DP116_25855" /translation="MKYDVATIIQGYAQGYFLMADENDVLGWYGSRDRTLIPLDQRFR YPKSLQRVLNQERFSVAINRDFKAVVAGCANRESTWISPELKEIYWELYQTGWAYSFE TWQGDELAGGILGIVIGGAFIGESMFYRIPEGSKVAMVKLVEMLRKKRFVMFDAQMMN PHLERFGAYGVSDDEYMVLLKKALQRSCSLL" gene 3797..4126 /locus_tag="DP116_25860" CDS 3797..4126 /locus_tag="DP116_25860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318537.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25860" /translation="MTQPIPPITLPPPQDPMLEGKWLQQRLHQWLDEEFIPEAINQII AERAAQIFVRQRMEGENDLGSLVIAIVTEMQSFDFSKSFYGEFAIANAVSDLLLESLG IDKCCGQ" gene complement(4225..4527) /locus_tag="DP116_25865" CDS complement(4225..4527) /locus_tag="DP116_25865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006276731.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="30S ribosomal protein S14" /protein_id="PRJNA477356:DP116_25865" /translation="MAKKSLIEREKKRAKLVAKYAAKREALLEEFREAESPLDKLEIH REIQQLPRNSAPSRHRNRCWLTGRSRGVYRDFGLSRNVLREWAHEGLLPGVVKSSW" gene complement(4627..5316) /gene="nth" /locus_tag="DP116_25870" CDS complement(4627..5316) /gene="nth" /locus_tag="DP116_25870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878420.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="endonuclease III" /protein_id="PRJNA477356:DP116_25870" /translation="MSIRKQSSKKQRALEILVRLKRLYPDATCSLNYSTPVQLLVATI LSAQCTDERVNLVTPGLFSKFPDAATLANADLTELETSVRSTGFYRNKAKNIQAACRM IINEFGGHVPKQMEQMLRLPGVARKTANVVLAHAYGINVGVTVDTHVKRLSERLGLTE HTDPIRIERDLMGLLPQADWENWSIRLVYHGRAICKARSPACDTCKLADLCPSAFNVP QVTVKAKIKEA" gene complement(5316..6419) /gene="rseP" /locus_tag="DP116_25875" CDS complement(5316..6419) /gene="rseP" /locus_tag="DP116_25875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318585.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RIP metalloprotease RseP" /protein_id="PRJNA477356:DP116_25875" /translation="MSFLAGLAAIAVLAVLILVHEFGHFIAARSQGIYVNRFSLGFGP VIWKYQGAQTEYALRAFPLGGFVGFPDDDPDSDIPPEDPNLLRNRPILDRAIVISAGV IANLIFAYFLLVTQVGVVGVNIPQPPQNGILVPELISQPVSVAAKAGLQPGDVILAAD DRTFGTSEQELKAFTEIIKGNLGKQINLTIARGDQKLSVNLTPEANAKGEGSIGVRLS PNQKIVHRRATNPVEALKIGATQFQQILVKTAQGFGQLITNFRETANQVAGPVKIVEI GASWAQNDISNLLFFGALISINLAFINILPLPALDGGQLAFLVIEGLRGKPLPNRVQE GVMQTGLVILLGLGIFLIVKETTQLEWVQRLFQ" gene 6613..7026 /locus_tag="DP116_25880" CDS 6613..7026 /locus_tag="DP116_25880" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25880" /translation="MIDVIVLTKILSPFLPYLLKLGDKAAEEAAKKLGADSWEKAKVI WVKLHPKVEAKPSAQEAVQDVAQAPEDEDALAALRQQLKKLLKEDPTLESELTRLLTE AEAPQSRGNINIAGDVKGVSAYDISGGSITQGNIS" BASE COUNT 2092 a 1668 c 1453 g 1918 t ORIGIN 1 ggtatagggg tgtaggggtg taggggaatg atttattcgt tggttcccac aactttggaa 61 aaagaaacgg taagagtgtt ttactttctt ccccttacac ccttacaccc tcacaccctc 121 acacccttag tttcggtcaa gtcttttgtc taaatttgtg agaatttcga tgatgtgcga 181 ataaattgag caaactcccc atcttcaaat gtctcttccc agttggtggg tagacgatgg 241 gggttcggat tatctttcgt aggtacacac agaaaaatgt actcacactt aatcccatct 301 agtttggttt ccggtgtcac gccccagttc ccgatgtatg cgcctgtaag agttgcacct 361 ctaaaatcag ttctatctaa ttgtgctctt gtcagattgg ctctagaaaa atcagcattt 421 tgtaagttag actcattcag cacagttgca actaaactag catctgtgat atttgctcct 481 tgcaaattga taccttgcag attccagcca tcaaaatttt ggttttgtaa ctcttttgtc 541 accagtaatt gtctgatctt ttcatcttta agataagttg tcctaatatc agccagattc 601 agctttttcg cctgaaacca acaggtacgt gccaggatag attttttgaa ggtagtgttt 661 ttcagtcttg cctgcgtaaa atccgcatca gttaaatcac tttcttgaaa actggtgccg 721 tgtcctgtca ccaaaccaag agcaatattg ccaatccagc tttgcttatc atcttgatat 781 aagccacggc aactcacatc agcagacaca attaatgtta atattgctac tgctgatgct 841 ggtattgctg cgacgacacc acctatcttt gcagcggcta caactactac tcccgctaca 901 gcccctgcga ctgacactgt cagcgctact gccaataacc cagcgataaa tcctgccacc 961 accgccgccc cggatataac cactagccct gttgcagaca ctataactgc cacagcccca 1021 gtgacgacta ccgttaccac agctgccaat tccatcaccc gtgtctgagt gcctgtccca 1081 acccatgcta cagccactat cccagcccat acgactgctg caagcccagt tcctatgaca 1141 gccgctgcca aaaacccaca ggctgctgca actcctcggc gcatagagat gagaaaaaag 1201 attaacaata ttattgaact gatgacagta acataagtat tttgacggtt gccatcaact 1261 agaataaatc ctagcaacga accaccaaca gccgatagca atccagatac tgtcgatagc 1321 agtaacgcaa atatgactaa ggcaaatgcc caacggcgtt gtagtcctgc ttgcgcatgg 1381 ctgaagttgg ctcctctgag ggtggcacct gtaaagtcag cacctcggat atcgcaataa 1441 gagaagtccg tacctgttaa gtcttgccct ttgaaggatt tgcctctgag cttagcactg 1501 cggtagtttt gagccacgtt taattacaat taaattgggt acctttaact actagttgta 1561 ggatatcagt ttttggcagg tttagataac tgtgatgtgt ctcaatacaa ccacagtgat 1621 aagctgttgc gcattaaaaa tgcacaacag caaggcagaa ggcagacggc agaaggcaga 1681 aggaaagagg gttttaatga ttgattagac tacaagaata tgcaaattaa aagcgcgaca 1741 gcttatacgg acattctgct gggtgatgcg cctggaaaat aaagccaatt tgactgggaa 1801 cctgtatcag ttgcttgtga ctagcagtca acaatttccg ccccagaact ttgaggttat 1861 cttcctgcaa ccagcgtttt gatctgttga aaagaattta tcctcagtca attagttccc 1921 gtcagatgat ttggaaatga tggtgtagcc actaacatac tgttgaccag tcggtaatgt 1981 cgtccaagtc cactcaatgt cctcatcttc atcaagttca acattgagga cttgagtcgt 2041 cggtagttgt gcaggtaaaa caccaatcgg ttgaggtgca aatattgtcc gtttggaaaa 2101 gctagaactg gctactgaat gattcaccat catgaaaacc tccttgcata cttttctagt 2161 ttatcgtccg tccgttgcaa cgggagttag gtgtcattta gcttcttcgc cgtaagcccc 2221 gcaccataac tgtactcagt ttggtgtggg tagttcactg gaccaaaact tttgggagca 2281 tcccaatgca tgcaaaaaaa cagtagtggg agcatcttgc tccctagttt gtgacctcag 2341 cgggctagaa gcccacacta caaaataaaa taaatttaca caattgccca aggataacga 2401 tgtttaccag aattgatgag atacttccca aagtatttaa gatctccgtt tctactaaca 2461 agaactttgt cttcagtcag tttctcatct tagatgagtg tcacatgctc attcacacag 2521 gacactcgaa atggtttaat gccatttatg agctagtatc atcagtatgc aatccgactg 2581 aaatccgtta cgttgccttt tgtcaccttg aagctgatga gtgtggagca ttaaatcaat 2641 ggctggaagt ttgtccagaa gctgttccgc ttgtcagtcc gctaaatcga gctaatattg 2701 acgatattgc gatccgaaaa gcccacgttt tgaagaataa caaaagcgtt tccttaggat 2761 cgaagaacat taccctaatc gagacgcctc atttcccaca tggttgggag ggatgtctgt 2821 tctacgaacc gcaagattgt gtgcttttct gttctgactt agcagctcac aatggtcatt 2881 ttgacacacc actcaccaat gaggatttga cagagtcagt catcaagttt cagcgtcaac 2941 taggatttat ggttgaggga aagacattca ctcgcggaat cgaagctatc aagaagcttc 3001 ctattaagta ccttgctaca atgcacggtt cagtgattta cggagacagc attgtgaaaa 3061 tgctccacga actccaagta aattttggag ttccagaaac tttttgagct gatgacattg 3121 gttaactgat cgcaacaaaa ctcaaagtag agaacaagaa cgctgcaaag ctttcttcag 3181 taacaccatg tactcatcat cactgacccc ataagcacca aacctctcca aatggggatt 3241 catcatttga gcatcaaaca tcacaaatcg ctttttccgc aacatttcta ctaacttcac 3301 catcgccacc tttgaacctt ccgggatgcg gtaaaacatc gactccccaa taaaagcacc 3361 cccaatcaca attcctaaaa ttcccccagc cagttcatcc ccttgccaag tttcaaaact 3421 atacgcccaa cccgtctggt aaagttccca ataaatttct tttaactctg gtgaaatcca 3481 agttgattct cggttagcac atccagctac gacagctttg aaatcgcggt taatcgcaac 3541 gctaaaacgc tcttgattca gaacacgctg tagggacttg ggatagcgaa atctttgatc 3601 taaaggaatg agagtgcgat cgcgactccc ataccacccc aagacatcat tctcatcagc 3661 catgagaaaa taaccttgag catagccttg aataatagtc gcaacatcat atttcataga 3721 aataaggaat atacatagtt tgtagtcagc cctaccgcga ctactacaaa cctctttaat 3781 ataaaaacaa gagaaaatga cgcaacccat tccaccgatc accctaccac cacctcaaga 3841 tccgatgctt gaaggaaaat ggttacagca acgcttgcat cagtggctgg atgaagaatt 3901 tatcccagaa gcaatcaatc agataattgc cgaaagggca gcacaaattt ttgtacgcca 3961 acgcatggaa ggagaaaacg acttgggttc tctggttatt gccattgtta cagagatgca 4021 gtcgtttgat ttttccaaaa gtttctacgg tgagtttgct attgctaacg ccgttagcga 4081 cttactctta gaaagtttgg ggattgacaa gtgttgtggg caataaatag gcaataaata 4141 atttatgtcc tttgtcgttc gtcattagtc cttggttctt tgtctaaagc taatgactaa 4201 tgactaatga ctaatgactc accactacca actagactta acaactccgg gtaaaagacc 4261 ttcgtgtgcc cattcccgca gaacattacg agatagccca aagtcgcggt aaactcctct 4321 ggaacgaccc gttaaccagc aacggttacg gtggcggcta ggagcgctat tgcggggtag 4381 ctgttgaatt tcacggtgaa tttccaactt atcaagagga gattctgctt ctctaaattc 4441 ttccagcagt gcttcccgtt ttgcagcgta tttagccacc aatttggcgc gttttttctc 4501 gcgctcaatc aaactctttt ttgccataat gttagtgatt tattgaaaga cagcattttc 4561 cattctatat aatcccacgt ctagtgacca aagtctcttt tagacagggt ctgtaggtct 4621 agattcctaa gcttccttaa tttttgcctt gacagttact tgtggtacgt taaaagcaga 4681 aggacacagg tcagctaatt tacaagtatc acaagcaggc gatcgcgctt tacaaatcgc 4741 acgaccatga taaaccagcc gaatagacca attttcccaa tctgcttgag gcaatagacc 4801 catcaaatct cgttcaatac ggatggggtc tgtgtgttca gttaacccca aacgctcact 4861 cagacgtttc acatgagtat ccaccgtcac cccaacatta attccataag catgagcaag 4921 cacgacattt gctgtctttc gcgccacacc tggaagccgc aacatctgtt ccatctgctt 4981 ggggacatga ccgccaaact cattgataat cattcgacaa gcagcttgaa tattctttgc 5041 cttattgcga tagaaccctg tggaacgcac gcttgtttct aactctgtca aatcagcatt 5101 tgccagagtt gctgcatccg gaaatttact aaataagcct ggtgtcacta gattcacccg 5161 ttcatccgta cactgagccg agagaatcgt agccaccagc aactgtacag gagttgagta 5221 gtttaaagag caagttgcat ctggataaag acgcttcagg cgaactagaa tttctagcgc 5281 ccgttgcttc ttagatgact gtttgcggat actcattact ggaataatct ttgcacccac 5341 tccaattggg tagtttcttt gacgatgaga aaaattccta gtcctaaaag tatcaccaaa 5401 ccagtttgca tgacaccttc ttgaacacga ttcggtaagg gtttaccgcg caaaccttca 5461 attaccagaa aggcgagttg tcctccatct aaagctggta aaggcaaaat gttgataaaa 5521 gccaagttaa tgctgatcaa agcaccaaag aacaacaaat tgctgatatc gttctgagcc 5581 cagcttgcac caatctccac aattttaact ggaccagcaa cttggtttgc tgtttcgcgg 5641 aagttagtaa ttaattgtcc aaaaccttga gctgtcttga ccagaatctg ctgaaattgt 5701 gtagcgccaa tttttaatgc ttcgaccggg tttgtcgcgc gacgatgcac gattttctga 5761 ttaggagaaa ggcgcacacc tatgctaccc tctcccttag cattagcttc aggagtcaga 5821 ttgacagaca gtttttgatc gccacgggcg attgtcaaat taatttgctt gcctaaatta 5881 cctttgatga tttctgtgaa agcttttaac tcttgttcag aagtcccaaa agttctgtca 5941 tcagcagcta aaatgacatc tccaggctga agacctgctt tcgcagcaac agaaactggt 6001 tgtgaaatta attctgggac gagaatcccg ttttggggtg gttgtgggat gtttacccct 6061 acaacaccca cctgtgtcac aagcaggaag taggcaaata ttaaatttgc tatcactcct 6121 gcgctaatca caatcgcccg atctaaaata gggcggttac gtaaaagatt tgggtcttca 6181 ggaggaatat cgctatctgg gtcatcgtca ggaaaaccga caaacccacc cagaggaaaa 6241 gcgcggagag cgtattcagt ttgtgctcct tgatacttcc aaatcaccgg accaaagccc 6301 aaagaaaagc gattgacgta aatcccttgg gatcttgctg caataaaatg tccaaactca 6361 tgcaccagga tcaaaacagc caagactgcg atcgctgcta aacccgcaag aaatgacata 6421 gataaagaat taatgaactg cgaaaggcta tacttctttt cattttaagg tgtgtctaga 6481 gaagacgacg ctgcaaaggt cgaaaaagcc gcccatcaaa ttgagaaaaa ttggtacaac 6541 ttttccgaaa gcacaactta cgcaggtgta gatttcaatg gtactatcta tgcccaataa 6601 aatctcttac taatgataga tgttattgtt ctaactaaga ttctatctcc ttttctgcct 6661 tacctactca aactaggaga taaagcagca gaggaagcag caaaaaaact tggtgcagat 6721 agttgggaaa aagcgaaagt tatttgggtt aaactccatc ccaaagtgga agctaaacca 6781 tcagcccaag aagcagttca ggatgtagcg caagcgcctg aggacgaaga tgctttagcc 6841 gcactgcgtc agcaacttaa gaagttactc aaggaagatc caacgttaga aagcgaactc 6901 actcgccttc tgacggaagc cgaagcaccc caaagtcgtg gaaacatcaa tattgctgga 6961 gacgttaaag gtgtttctgc atacgacatt tccggaggaa gtatcactca aggaaacatt 7021 agttagagat aagggaacag ggaacaggga acagggaaca ggaaacagtg accagtgaac 7081 agtaaacagt gaacagtaaa caaggggtgg acgagtccgt ttcctacctg a // LOCUS NODE_4387_length_7062_cov_4.3034117062 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 7062) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 7062) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7062 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(164..733) /locus_tag="DP116_25885" CDS complement(164..733) /locus_tag="DP116_25885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017312274.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DivIVA domain-containing protein" /protein_id="PRJNA477356:DP116_25885" /translation="MELNRLEEMILAGFNIPLTRRTIVDEDKLLDQLDEIRLSLPEVF QEAAAIIQQKEEILLEAEEYGQQIVDAAQAKRSQILDESDILRQAQREAAELRRQVQE ECEQMMQDTLEEIDRKRRACQQEIEEIQRQAVAEAEAIEQGADDYADGVLENIEQNLQ DMLRIIRNGRQQLLPEASSEDNSQFPKKK" gene complement(826..1320) /locus_tag="DP116_25890" CDS complement(826..1320) /locus_tag="DP116_25890" /EC_number="2.7.7.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pantetheine-phosphate adenylyltransferase" /protein_id="PRJNA477356:DP116_25890" /translation="MIAIYPGSFDPVTLGHLDIIQRGSRLFAQVIVAVLRNPNKTPLF TVERRLEQIRLSTKHLTNVEVDAFAGLTVNYAQMRGAQVLLRGLRAISDFEVELQMAH TNKTLSYDIETVFFATSNEYSFLSSSVVKEIAKFGGSVDHLVPPHVALDIYQCYAQNH PVSN" gene 1729..2598 /locus_tag="DP116_25895" CDS 1729..2598 /locus_tag="DP116_25895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prolipoprotein diacylglyceryl transferase" /protein_id="PRJNA477356:DP116_25895" /translation="MLDISTLPLAFEFSSPGPILVKIGPLTIRWYGFLIASAVLIGVW LSQKLAKHRNVNPDLISDLSIWLVLGAIPAARIYYVLFQWSEYVQHPERIIAIWQGGI AIHGAIIGGVTAALIFAKRNQISFWQLADLVSPSLILGQAIGRWGNFFNSEAFGSPTN VPWKLHIPPENRPPELANFEYFHPTFLYESVWDLMVFALLLTLFFRSLSGKPALKTGT LSLVYLAAYSLGRFWIEGLRTDSLMLGPLRIAQVVSLLGIIFGLAGLAWLYIAKRPLP DVISTSQEDRQRR" gene complement(2797..3045) /locus_tag="DP116_25900" CDS complement(2797..3045) /locus_tag="DP116_25900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007355091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25900" /translation="MKQKIIDELLRQARLTFNVALAITAASAIMTLSGVGLVYLNKIP EASLTTTAGILASIGSVQFAKDAKEELREMIDKLPEKS" gene 3429..4808 /locus_tag="DP116_25905" CDS 3429..4808 /locus_tag="DP116_25905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132039.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25905" /translation="MLQSDKVQSSEEGKIRLRNTYKEEGLTIETLAVKAHVSEDTIKR LLGTKNCPNGIERRQVESIAKVLNIKPTDIVNPKDWYPPQIPPEFERLIKDKTESFRG RKFVFDAIEEFFKNNPKGYFTVVGDAGMGKSAIAAKYVLDNPAAICFFNIRAEGMNRP ELFLKKIREQLMSRYQLQDAADTDLSTLLTKASEKITAGERLVIVVDALDEVDQESTG NLLYLPTILPENVYFLLTRRPYNQNEKRLNVSFSVSTKELDLRDYAERSSQDVKEYIW LLLNDAEYKQGLSQWIQKQNHLSTTEFVEEVATKSENNFMYLRCVLPAIADGFYENKP LDELPVGLQGYYENHWQLMGMTTKPLPKNKIKIIYVMCALRSAASREVVAKYSKQNDL SVQEVLDGWAQFLQKQENYQPPRYRFYHESFRDFLHRQDIVQAAGVNLPNISAEVADN ITGGLYGDG" gene 4801..6726 /locus_tag="DP116_25910" CDS 4801..6726 /locus_tag="DP116_25910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496052.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25910" /translation="MGDVRTRLTELSSEEIELFLGDVPRLWLEGRQVKKFCRLLSDFD FIEAKINHPKFGVQALIEDYDLIDDAELSTYPEYDAQTVKALKLIQGALRLSAHILIQ DTNQLAGQLSGRLLHFDAPEIQKLLQQIPQSQTTCLRSLTSSLTTPNEPLIRTLTGHS GSLWSVTVTPDGEQVISGSQDGTIKIWNLNLGNLIYTISAHDDSVDTIAITADGLYVI SGSRDTTIKIWNLKTGQLVRTLRGHYGSVNTVILTPDGSKIISASSDDTLKIWNIKTG EVLHTLIGHTRSVQAVTIVFSKNNKWVISGSYDKTIKVWNLETGKEELALNESHWVGC ITATPDGKRVISALEDGTLTVWKVGTWEKEYILKGHNNSVRTVAVTPDGKRIISGSSY DGTLKIWKVETWENEATFTGHTAWVLAVAVTPNAKQIISASGNNIFSSEFTIKVWSLE KCIEAFSLKAERNTITAHSDSVEVVAFTPDGKYVISAAKDDNFKLWEVGTWENEASFT GESKSALALGDYAVITGYADLIFEQNESALTLVGGYSDYKVFELKAGDVKCTLHSINN TLPFLLATTADGKRQIWSSGDETLKVWDVSARQFIASFTGESEIRCCAIAPDGVTIVA GEASGRLHFLRLEGIEA" gene 6729..>7062 /locus_tag="DP116_25915" CDS 6729..>7062 /locus_tag="DP116_25915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015132037.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_25915" /translation="MKTPPSQAPINSASYPTEFQQVILEKSQNFIGREFIFTAITDFL HRHKRGYFTIVGVPGSGKSAILAKYVTENYHVIYYNAQIVGKNRAEEFLRDVCTQLIE WLHNFPSTP" BASE COUNT 2027 a 1393 c 1513 g 2129 t ORIGIN 1 ggtgggtact gcccaaattt tctttgacat aagttgctct ttatggtata tacttaaact 61 ttctattagt cgtcggtagt catagtctcc tcatcctgag aatacggcat agttgggttg 121 tgaaccgtgg aaagctgaga gatcaagaaa ctagaattta caattatttt tttttaggaa 181 attgagaatt atcctcacta gaagcttctg gtaatagttg ttgacgtcca ttacgaatga 241 ttcgcaacat atcctgaaga ttctgctcaa tattttctag cacgccgtca gcatagtcat 301 cagccccttg ttcaattgct tctgcttctg cgacagcttg acgctgtatt tcttctattt 361 cttgctgaca ggcgcgtcgt ttgcggtcaa tttcttcaag agtgtcttgc atcatctgct 421 cacattcttc ttgaacttgt cggcgcagtt cagcggcttc gcgttgtgct tgtctgagga 481 tatcgctttc atccaagatt tgcgatcgct tggcttgcgc tgcatcaaca atttgttgcc 541 catattcttc tgcttccaga aggatttcct ccttttgttg gatgattgct gctgcttcct 601 gaaaaacttc aggtaaagaa agccgtattt catcaagctg atcaagcagc ttgtcttcat 661 ctactatcgt gcgtcgcgtc agtggaatgt tgaaaccagc aagaatcatc tcctccaagc 721 ggttgagttc catctgtatg tctacacttg ttgttcgcgt agcatctgga ttttcggata 781 ttcctgcagg aaactcttgc gggggaggac tgcttccgtt gtgatttagt tcgatacggg 841 atggttttgg gcgtagcatt ggtaaatatc taaagcaacg tgcgggggga caagatgatc 901 aacagagcca ccaaacttag caatctcttt caccacacta ctacttaaaa aactatattc 961 atttgacgtt gcgaagaaaa cagtttcaat atcataagaa agggttttat ttgtgtgagc 1021 catctgaagt tctacttcaa agtcagaaat tgctcgtaaa ccgcgcaata gcacttgtgc 1081 accacgcatt tgggcatagt ttacggtcaa accggcaaaa gcgtctacct ccacattcgt 1141 gagatgtttt gtacttagac gaatttgttc tagccgtctt tccactgtaa acaggggagt 1201 tttgttcgga ttccgcaata cggcgacaat cacttgagca aacaaacgac tgccgcgttg 1261 gataatgtca aggtgtccta aagtcacagg atcgaaactt cctggataaa tggcaatcat 1321 aatcacaagt gcctcacagt tgtttaggtg attatatcta accaatcgtg atttgtaaca 1381 aagtacaagt tctcagttat aaattcaaga tgttagtctt tgtaagcttt aacgcatttg 1441 ctgccatagt tttatttatc gactttcaaa ataaaaattt ggataaatac actacccata 1501 ctgccttctc tactcacagc gatctaaaat caatagtttg gcttttgcaa caatttttga 1561 ctgtcgttcc cctaagccct gttccctgtt ccctgttccc cgttccctgt tccctgttaa 1621 gagtgtttct ttcatcggta acgagttaca atcactaccc gtaaagcgct gttaacaagc 1681 acaacttaaa tctacattac ctttttttag tcttctaggt acttttacat gctggatatt 1741 tccactttgc ccttggcatt tgaattttct tctccaggac cgattctggt gaaaatagga 1801 ccattaacaa tccgttggta tggcttttta attgcttcag cagtattgat tggcgtttgg 1861 ctttcccaaa aactggcaaa gcaccgtaac gttaatccag atttaattag cgatttgtca 1921 atttggctgg ttcttggggc aattccagca gcacggatat attacgtttt gtttcaatgg 1981 tcagaatatg tccagcaccc agaacgcatt attgcgattt ggcaaggagg gatagcaatt 2041 cacggggcaa ttatcggtgg tgttaccgca gcattaatat tcgccaaaag aaaccagatt 2101 tctttctggc aattggctga cttggtgtct ccttcgctga ttttaggtca agcaatagga 2161 cgttggggca atttcttcaa ctctgaggca tttggcagtc cgacaaatgt accttggaag 2221 ctacatattc caccagaaaa ccgtccccca gaactggcga attttgaata tttccatccc 2281 acttttcttt atgaatctgt atgggatctt atggtattcg ccttgctgct gactttattt 2341 tttagaagtt tgtcaggtaa accggctttg aaaacaggta cgctgtctct agtttatttg 2401 gcagcttaca gcttaggacg cttctggata gaaggtctgc ggacagatag tttaatgctt 2461 ggacctttac gaatagcaca agttgtgagt ttactgggaa taatctttgg tttagctggg 2521 ttagcttggc tttacattgc caaacgtcct ttaccagatg tcatctccac ctctcaagaa 2581 gatagacaga ggcggtaaaa gaatggggaa gagtggcttc cccatttttt tgtccgggat 2641 tggtggctta gaggcggtaa gtagctcaac cgaattaaac gtgaaatgtc attgcgagtg 2701 aaacgcagtg gagcgaagca atcgcaaaga ctctatttta tatttttcaa tgttgaccta 2761 cttatcaccg cctcaaagag ttctctcatt caacctctag gatttttccg gcaacttgtc 2821 tatcatctca cgcaattctt cttttgcatc tttggcaaac tgaacactac caatgctggc 2881 aagaattcct gctgttgtag tgaggctggc ttctggtatt ttattcaagt agacaagccc 2941 cacaccagat aatgtcatga tcgcagatgc tgcggttatg gctaaagcaa cgttgaaagt 3001 aagacgtgct tggcgtagta attcatcaat gattttttgt ttcatattgt tgggttggtt 3061 ttgaactaac ttattcgatt atgtagaggc aacaatttaa attcgagagc agataggttg 3121 agtgtagatt tatgatttgc cctatgtttt ggggcagatg gggcaagtga ataaaaccta 3181 actgccctac acaggtaaat taaagattag ttatcttaga tcttgtacct cactggtgag 3241 tacgccttga aataaatttt aaggctcata gccaaagtcc gttaaaacgg actgaaatat 3301 atattcagtg agctttagct tacttgacct ttgagcctag aaattcattc ctaggcggac 3361 gagaaagcag gtgcaatatc tgagctatca cattattaaa tacctggatt catcaatatc 3421 ttttatctat gctgcaatcg gataaggttc aatctagcga agaaggcaaa attagactca 3481 ggaataccta taaagaagag ggtttgacta tagaaacgct tgcagtaaaa gcacatgttt 3541 ctgaggacac cattaaacgt ttattaggta caaaaaactg cccgaatgga attgaaagaa 3601 ggcaagttga aagtattgcc aaagttttaa atattaaacc aactgatata gtaaacccaa 3661 aagattggta tccgccgcaa ataccgccag aatttgagcg attaatcaaa gacaaaacag 3721 agtcatttcg tggtagaaaa tttgtctttg atgctattga agagtttttc aaaaacaacc 3781 ccaagggcta cttcactgtt gtgggtgatg caggaatggg gaaaagtgcg atcgccgcca 3841 agtacgtatt agacaaccca gcagcgattt gctttttcaa tattcgcgct gagggcatga 3901 atcgcccgga attgttcctg aaaaaaatcc gcgaacaatt gatgagtcgt taccagttgc 3961 aagatgcagc agatactgat ttatcaactt tgctaaccaa agctagtgaa aaaatcactg 4021 ctggtgaacg tctggtaatt gttgttgatg cactcgacga agttgaccaa gaatcgaccg 4081 gaaatctttt atatttacct actattcttc cagaaaacgt ttactttctg ctgacaagac 4141 gaccatataa ccaaaatgag aaaaggttga atgtttcatt cagcgtttcc accaaggaat 4201 tggacttaag agactacgct gagagaagta gtcaggatgt taaagaatat atttggctat 4261 tgctcaatga tgcggaatat aaacagggtt tgagtcagtg gattcaaaag caaaatcatc 4321 tctctactac cgagtttgtg gaagaagtag ccacaaaaag cgaaaataat tttatgtatt 4381 tgcgctgtgt attgccagcg atcgctgatg gtttttacga aaataaacct ctggatgaat 4441 tacctgtagg tttacaagga tattatgaaa accactggca actcatgggc atgacaacca 4501 agcctttacc taaaaataaa attaagatta tttatgtgat gtgtgcttta cgtagcgcag 4561 cttcccgtga agtggttgcg aaatattcta agcagaatga cttgagtgtg caggaagttc 4621 ttgatgggtg ggcgcaattt ttgcagaagc aggaaaatta tcaaccaccg cgttacaggt 4681 tttatcacga aagtttccga gattttttgc atcgtcagga tattgtgcag gcggcggggg 4741 tgaatttacc aaatatcagc gccgaagtgg cggacaacat tacaggagga ctttacggcg 4801 atgggtgatg tccgaaccag gttaactgaa ctgtcatcgg aagaaataga acttttcttg 4861 ggtgatgttc cacgtttgtg gttagaaggt cgtcaagtta agaaattctg ccgtcttttg 4921 agtgattttg actttataga agcaaaaatt aatcatccta aatttggtgt tcaggcgctg 4981 attgaagatt acgatttgat tgatgatgcg gaattatcaa cttacccaga atacgatgcc 5041 caaactgtaa aagcgctgaa attaattcaa ggggcattgc gactttcggc acatatttta 5101 attcaagata caaaccaact agcagggcaa ttgtcagggc gtttgctgca ctttgatgcg 5161 ccagaaattc aaaagttgct gcaacaaata ccgcaaagtc aaactacttg cttgcgtagt 5221 ttgacaagta gcttaactac ccctaatgaa cctttaatac gtaccttaac tggtcatagt 5281 ggttcgctgt ggtctgttac tgttactcct gatggagaac aggtaatttc tggatcgcag 5341 gatgggacta ttaaaatctg gaatttgaac ttaggaaatt taatatatac catctctgct 5401 catgatgact ctgtggatac aatagcaata actgctgatg ggttatatgt aatttccggt 5461 tcacgcgata caactattaa aatctggaat ttgaaaacag gacaattagt acgtacttta 5521 agaggtcatt atggttctgt taacacagtt atactaactc ctgatgggtc aaaaataatt 5581 tctgcttctt ctgacgatac cctcaaaatt tggaacataa aaactggaga agtcctacat 5641 actctaattg gtcatactag gtcggtacaa gctgtcacaa tagtcttttc caagaataac 5701 aagtgggtca tttctggttc gtatgataaa actattaaag tctggaacct agaaactgga 5761 aaagaagaac ttgctctcaa cgagagtcac tgggtagggt gtattacagc gacccctgat 5821 ggaaagcgag taatttctgc tttagaggac gggactctta ccgtctggaa agttgggact 5881 tgggagaagg aatatatctt aaaaggtcat aacaattcgg tacgcactgt tgctgttacc 5941 cctgatggaa agcggataat ttctggttca agctatgatg gaactttaaa aatttggaaa 6001 gtagaaactt gggaaaacga agctactttt actggtcata cagcctgggt tcttgctgtt 6061 gctgtcactc ctaacgcaaa gcagataatt tctgcatcgg gaaataatat tttctctagt 6121 gagtttacca tcaaagtctg gagccttgaa aaatgtatag aagcattttc attgaaagcg 6181 gaacgaaaca caataactgc tcatagcgac tcagtagaag tagttgcttt tacccctgat 6241 ggtaagtatg taatttctgc tgcaaaggac gataatttta aattatggga agtaggaact 6301 tgggagaatg aagctagctt cactggtgaa agtaagtcag cgcttgcttt gggagattac 6361 gctgtaataa ctggttacgc ggatttaata tttgaacaga acgaatctgc acttacttta 6421 gtaggaggtt acagtgatta caaagttttt gaactgaaag ctggcgacgt taaatgtact 6481 cttcatagca tcaataacac tttacctttt cttcttgcaa ccaccgcaga tgggaagcgt 6541 caaatttggt cttctgggga tgaaacgcta aaagtgtggg atgtatcagc aagacaattt 6601 attgctagtt tcactggaga aagtgaaatc agatgctgcg ccattgcccc agatggcgtc 6661 acaatcgtag caggcgaagc atccggacgc ttgcatttcc tccgcctgga aggcatagag 6721 gcgtaaccat gaaaacccca ccttctcaag cccccatcaa ttccgcctcc tacccaaccg 6781 agtttcaaca agtcattctc gaaaaaagcc aaaattttat cggtcgtgaa tttatcttta 6841 ccgctatcac cgactttctc caccgtcaca agcgcggtta cttcaccatt gttggtgttc 6901 ctggtagcgg caagagtgct atacttgcca agtacgtgac ggaaaattac cacgttattt 6961 attacaatgc ccaaatcgtg ggtaaaaatc gggcagagga atttcttagg gatgtttgta 7021 cgcaattaat cgaatggttg cacaatttcc ccagcacccc ac // LOCUS NODE_4421_length_6988_cov_5.1254876988 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6988 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(642..1997) /locus_tag="DP116_25920" CDS complement(642..1997) /locus_tag="DP116_25920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015214212.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycolate oxidase" /protein_id="PRJNA477356:DP116_25920" /translation="MQVFEGSENKKASLENLTGFDGNHPPNPKLIDSCVHCGFCLSTC PSYRVIGKEMDSPRGRIYLMDGINEGEIPLNKATVEHFDSCLGCLACVTTCPSGVQYD KLISATRHQVERNYPRSLPDKLFRQLIFSLFPNPDVLRVLLVPLFFYQKLGLQKLVRG TQLLKILSPRLAAMESILPEVTLKSFQDNLPDIIPAQGKKRYRVGVILGCVQRLFFSP VNEATVRVLTANGCEVVIPKTQGCCAALPEHQGQTEQAKALARQMIDSFANTGVDFVI INAAGCGHTLKEYGHILEDDPQYREKAKAFAGKVRDAQEFLVSVGLTAKLSPLANQPL SLVYQDPCHLLHGQKISLQPRQLLREIPGVTLREPVDAAICCGSAGIYNLLQPEVAEE LGRQKVQNLLNTGADLITSANPGCSLQIRKHLQSQGKQISIMHPMELLDYSIRGVKLE V" gene complement(2059..3351) /locus_tag="DP116_25925" CDS complement(2059..3351) /locus_tag="DP116_25925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197080.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-binding oxidoreductase" /protein_id="PRJNA477356:DP116_25925" /translation="MKALASCFASIVGTENTVCWEDVPISQQKSILQTIASPTPPSCI VYPHTTEQLAQVITEAHRNKWRVLPCGSGSKLNWGGLVKDADVVVSTERLNQLIEHAV GDLTVTVEAGMKFSRLQEMLVNSRQFLALDPITQDTATIGGIVATADTGSLRQRYGSV RDQLLGITFIRADGQIAKAGGRVVKNVAGYDLMKLFTGSYGTLGVISQVTFRVYPMQE ISGTVVLTGDAKAISQAANVLRGSALTPTQADLLSTQLVSSLELGQGLGLIARFQSIA ESVKEQSNRLLEVGEQLGLKGTIYSAVDEADLWRILREQMHSSFNESAITCKIGVLPT SAVEVLTQADVGWIHVASGLGVLRFEGDNKIDEVLGIRNLCQTNGGFLTILSAPVKVK QQLDVWGYTGNAVQLMRLIKGQFDKECILSPGRFVGGI" gene 3501..3691 /locus_tag="DP116_25930" /pseudo CDS 3501..3691 /locus_tag="DP116_25930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745503.1" /note="frameshifted; internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(3888..4106) /locus_tag="DP116_25935" CDS complement(3888..4106) /locus_tag="DP116_25935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="toxin-antitoxin system HicB family antitoxin" /protein_id="PRJNA477356:DP116_25935" /translation="MATLTIRLPDDKHNRLKELAQAKGISVNKLIEELSTIALAEFDT YTRFKAMAATANPEEGLRILAKLDTLTE" gene complement(4113..4535) /locus_tag="DP116_25940" CDS complement(4113..4535) /locus_tag="DP116_25940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458494.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="putative toxin-antitoxin system toxin component, PIN family" /protein_id="PRJNA477356:DP116_25940" /translation="MAIKIVVDTSVFISALISSKGSSRELIRRCLKREYQPLMGNALF SEYESVIQRSEIIAKCPLTSEEISALLASLMSVSQWISIYYLWRPNLKDEADNHLIEL AVAGNAQIIATHNVKDFQNAELLFPNLSILKPEKIIRS" gene 5086..6015 /locus_tag="DP116_25945" CDS 5086..6015 /locus_tag="DP116_25945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009625969.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AAA family ATPase" /protein_id="PRJNA477356:DP116_25945" /translation="MGYKIVKEKENSTNPEKKDFNINIEVPKWTLNELALSSAIKDQI DEIVAYIKNRDILLDEWEFKKFLKSGNGISINFFGQPGTGKTVTAEAIADKLGVNIIK VNYGELESELVGRTSKNLSELFAIAENSRSLLFFDEADTLLSKRISNLSQAADYGVNS VRATLLTLLEKFNGVIIFATNLFENYDEAFIRRILFNIEFTLPDTTMRIQLWEFHLSP KIPKEISYDKAAEISEGLAGGDIKNITFKLALKLLTKKIESISEDIMKEEIQKYLQTK EQHKKGRSLESSTVTFKSDVKIDSNVLSTDMSN" gene 6195..6554 /locus_tag="DP116_25950" CDS 6195..6554 /locus_tag="DP116_25950" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25950" /translation="MPLDPISWLIIGAIIGAGTVVFWDRIKDWATRVMGFLLDQMNKL VEFLVGGVVFLIKEGKKYYKKLYLYTRDKESQKPYRRESDKVEIKEADIPSDLRDIVP EKTNNEETGLQVATLKY" gene 6754..>6988 /locus_tag="DP116_25955" CDS 6754..>6988 /locus_tag="DP116_25955" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25955" /translation="MSKEENLTDKSQRLFKTTQLGIDKIKRAWEEKNTIFDDWESINE NVLDFNLIEKDIINIHNEIKQDNVVIKKTQYTIS" BASE COUNT 2245 a 1418 c 1270 g 2055 t ORIGIN 1 aagttttatg atattattat ccgatcatat aattaacaaa ttatgatcaa tagctaaatt 61 atgaggctaa gattaagtat tcaaaggcac aacatgacat gacgttatgg tgtcgcacta 121 aagcatccta aaaatttaaa ctttctttac aaacagctta tagaattcat gaaagcttga 181 tgagactaaa acttatctgg atgactctac ttctttgttc tatcgtttca aatgtcagga 241 ttcctggtct tttttatacg cttgtttttc taaatcaaac aattatttag ctaattgctg 301 acgttttgtc tgatcatatt cagcctgtag cataattttt gactcaaaaa tgacattttt 361 ctctgtattc atagtggtgt tttgacgact agtggtctgc ctcattaatt ttaatggttt 421 gtcttgacgt tgcatgcaaa atctctacaa gatttctaga atttcaaaac ccctcacaat 481 taatgacata aaccactata aatctactta ctcagcaact aactgcatat gtttgctgac 541 ctgaaaccta taactagatg acagggtgcc attcacgggc taaagccctg taggtatagc 601 tctttgcata aagctacagg cttgtaacct gtagcttttt tttaaacttc caacttcaca 661 cccctgattg aataatccaa caactccatc gggtgcatga tagaaatctg tttaccctgc 721 gactgcaaat gcttgcgaat ttgcaaacta caacccggat tagcagaagt aattaaatca 781 gcaccagtat tcagcaaatt ctgaactttt tgtctaccca actcctcagc cacctctggt 841 tgcagtagat tgtaaattcc cgcactacca caacatatag ctgcatcaac tggttcgcgc 901 aacgtcacgc ctggaatttc ccgcaataac tgacgcggtt gcaagctaat cttttgtccg 961 tgcaacaaat gacatggatc ttgataaacc aaactcaagg gttgatttgc aagcggtgat 1021 aattttgctg ttaaaccaac actcaccaaa aactcttgcg catctctaac cttccccgcg 1081 aatgctttcg ctttttcccg atattgtgga tcatcttcta aaatatgacc atattctttt 1141 agtgtatgac cgcaaccagc agcattgata atcacaaaat ctacaccagt atttgcaaaa 1201 ctatcaatca tctgccttgc taaagctttg gcttgttctg tttgtccttg gtgttcagga 1261 agcgctgcac aacatccttg agttttagga atcacaactt cacagccatt cgccgttaaa 1321 acccgcacag tcgcttcatt gactggcgag aaaaataacc tttgcacaca tcccaaaatc 1381 accccaactc gataacgttt cttaccctgt gctggaataa tatctggcaa attatcctga 1441 aaagatttca aagtaacttc tggcagaatt gattccatcg ctgctaagcg aggagacaaa 1501 atcttcaaca actgagttcc acgcaccaat ttctgcaaac ccaacttttg ataaaaaaac 1561 aacggaacca gcaaaaccct taaaacatcg ggattaggaa acaaagaaaa tatcagttga 1621 cgaaacaact tatctggcaa actacgcgga taattccttt caacttggtg acgagttgca 1681 gaaattaact tgtcatattg cacaccagaa ggacaagtcg tcacacaagc cagacacccc 1741 aaacaactat caaaatgttc tactgttgcc ttattcagag gaatctcacc ctcattaatc 1801 ccatccataa gatagatgcg tcctcgcgga gaatccatct ccttgccaat cacccgataa 1861 ctaggacaag tcgagagaca aaacccacaa tgaacacaac tatcaatcaa ctttggattt 1921 ggcggatgat ttccatcaaa accagtcaaa ttctccaaac tagccttctt attctcagaa 1981 ccttcaaaaa cttgcatatt atctcttcct ctcccctctc tttctctgct tcctctgcgc 2041 ctctgcggtt cgtttatcct aaattccacc cacaaaacga cccggactca aaatacattc 2101 cttatcaaac tgccctttaa tcaaacgcat caactgcaca gcattaccag tgtagcccca 2161 tacatccaac tgttgcttaa ccttcaccgg cgctgacaaa atagtaagga aaccgccatt 2221 ggtttgacaa agatttcgta tccccaacac ctcatcaatc ttgttgtcac cctcaaaccg 2281 caacaccccc aaaccactag caacgtgaat ccaacccaca tctgcttgag tcaaaacttc 2341 aacagcagaa gtaggtaaca ctcctatttt gcaagttatt gccgactcat tgaaagaaga 2401 atgcatttgt tctcgcaata tgcgccataa atcagcttca tccactgctg aatatatcgt 2461 cccttttaac ccaagttgtt ccccgacttc caaaagtcgg tttgactgtt ccttcacact 2521 ctcagcaata ctttggaagc gagcaattaa tcccaatcct tgacctaact ccaagctaga 2581 caccagttgt gttgatagca aatcagcttg ggttggtgtc aacgcagaac ctctgaggac 2641 attcgcagct tgagatatag cctttgcatc accagtcagc accaccgttc ccgatatctc 2701 ctgcatagga taaacgcgaa acgtcacttg acttataacc cccaatgtgc catacgaccc 2761 agtaaataac ttcatcaagt cgtagccagc aacatttttc accactctcc cgccagcttt 2821 ggcgatttgt ccatcagcac gtataaaggt gattcccaac agttgatccc ttacactacc 2881 atagcgttgc cgtagggaac ctgtatcagc agttgcaaca ataccgccaa tggttgctgt 2941 atcttgtgtt attgggtcaa gagcaagaaa ttgccgagag tttaccaaca tttcttggag 3001 acgagaaaat ttcattcccg cttcgactgt caccgttaaa tcgccaacgg cgtgttcaat 3061 gagttgattc aggcgttctg tgctgactac gacgtcagca tctttgacta acccgcccca 3121 gttcagttta ctaccactac cgcatggcaa gacgcgccac ttgttgcgat gtgcttctgt 3181 gatgacttgc gctagctgtt ctgtggtgtg gggatagacg atacaactgg gaggagttgg 3241 ggaagcaata gtctgtaata tagatttttg ctgactgatt ggtacatctt cccaacagac 3301 agtattttct gtgccgacaa tagatgcaaa acaagaggct agcgctttca tttgtttatt 3361 ttagactctc tatatatcaa tacggtttat gaaagaagta cattgttatt agctttggtg 3421 aaagccttgc tgataaatcc tcagtctaca gcttagtgat gtcccaagac aactcctgtc 3481 aaagtctcaa agcgagacta atgagtcgct tacctatttc aacaagtcac tttgttgcat 3541 ctgttgtggg aatgccaaac ttaacggttg agggtagaac ggaacaagaa gcgatgcctt 3601 ggcgttcccc gtaggggaac gccaaggcaa aaccttccct gaaatccaat tagcaacggg 3661 taagtttgtg acgattgagg tgaatccaga aggtgggttg aatgaaactg tatctcaaat 3721 agagaagcca ttttttgaaa cctcaacaga tgaggaatgg gaagcagcac tcatggattt 3781 ggcaaatagt tcctttttta gcaaaacgct gcctctttca gacgaagcca tcagccgcga 3841 gagtatttat tgcgaacgag aagtatagcc aaatctctct acaagctcta ttctgtcaga 3901 gtatcaagtt tagctagtat tcttaagcct tcttctgggt tggcagttgc agccattgct 3961 ttaaatcttg tataggtatc aaattctgct agagctatgg tagaaagttc ttcaattaat 4021 ttattgacac ttatgccttt ggcttgagca agttctttca atctattatg cttatcgtct 4081 ggtaaacgaa tagttaaagt tgccattttt gtttaactcc taataatttt ttcgggtttt 4141 aatattgata agttaggaaa taacaattca gcattttgga aatctttgac attgtgagta 4201 gcaataattt gggcattccc tgcaactgct aattcaatta agtgattgtc agcttcgtct 4261 tttaaattag gtcgccataa atagtaaata gaaatccatt ggcttacgct catcagtgat 4321 gcaagtaaag cagaaatttc ttcactagtt aaagggcatt tagcgataat ttctgagcgc 4381 tgaatgactg actcatactc agaaaataaa gcatttccca tcaaaggctg atattctctt 4441 ttcaagcagc gtcgaatgag ttctcgactg gagcctttag agctaatcag cgcactaata 4501 aaaacgctgg tatcaactac aattttaatc gccatgtcat tatgatagca tatacgctgt 4561 cataaaagtc ccagtcaaaa gactcaaaag cgatcgccga aggcgggcac tctgtgccaa 4621 tgcccgaagg gcacgctacg cgaacgcact cttagcaacc gtgtcaaatc agcgatgtct 4681 ggcgacaaga agcagagctt ctacgcactt ctggtgatgg ggttaatgta ggcgatgtct 4741 ggtgataaat cgctttgcgt ctacccactc ttagcaactg tgtcaaatca gcgatcgcca 4801 gtcgagtatc aataagtcta aacgtgatgg tgcagagcgc agacgccaag ggcgtatcgc 4861 acttcgttgt acactcttcg ctgacacaag cattccacta aattcttaac tgatacctta 4921 actggtatat ctggtctggt gcaaatacgg gaaacattgc atactcggat taaggctaaa 4981 taaagggagc aattacagaa gtgtaatcgc tcttgtttat ttacgaaagc cttaagttga 5041 actgagaaat ttttagttaa ttagcttttg agagaataga gaaatatggg ttacaagata 5101 gtaaaagaaa aagagaattc aaccaaccct gaaaaaaagg attttaatat taatatagaa 5161 gttcccaaat ggactttgaa tgaactagct ttgtcatctg ccattaaaga ccaaattgat 5221 gaaattgttg cttatataaa aaatcgagat attttattag atgaatggga attcaaaaag 5281 tttcttaaaa gtggtaatgg aatatctatt aatttttttg gtcagccagg tactggaaaa 5341 actgtgactg ccgaggctat cgctgataaa ttaggagtga atattattaa agttaattat 5401 ggagaattag agtcagaatt ggtgggaaga acgtcaaaaa atttatcaga actgtttgca 5461 atagctgaaa acagcagaag tctgttattt tttgatgaag ctgacacgtt attgagtaaa 5521 agaatttcta atttatcaca agcagcagat tatggtgtca actcagtcag agctacatta 5581 ttaaccttac tagaaaagtt taatggggta ataatttttg ctactaattt atttgaaaat 5641 tacgatgaag cttttataag aagaatttta tttaatatag agtttacttt accagatact 5701 actatgagaa tccaattatg ggaattccat ctctctccca aaattcctaa ggaaatcagt 5761 tatgataaag ctgctgaaat cagtgaaggt ttagctggcg gtgatattaa aaatattact 5821 tttaaattgg ctttaaaatt gttaacaaaa aagatagaat ctatttctga agatataatg 5881 aaagaagaga tccaaaaata tttgcaaacc aaagaacaac acaaaaaagg gcgttcgtta 5941 gaatcttcta cggtaacttt taagagcgat gtcaaaatag attcaaatgt tttatcaaca 6001 gatatgtcaa attgaaaagc tgtatgtaaa cgagtattat caatactact ttctaaacag 6061 cttactgata acaatatctg gtaacaatct gtgaaaaagc gctaaagttc catcaatacc 6121 ctatgaacag ttaccttttc ctacagatgt taagtaaaga aacagtattt ttagataatt 6181 agaggtaaaa atttatgcct ttagatccaa taagttggtt aataattggt gcaattattg 6241 gtgctggtac tgtagtattt tgggacagaa ttaaagactg ggctactcgt gtaatgggtt 6301 ttttacttga tcaaatgaac aagttagttg aatttcttgt agggggtgtt gttttcttga 6361 ttaaagaagg aaaaaaatat tacaaaaaat tatacttata cacccgagat aaagagagtc 6421 aaaagcctta tcgtagggaa tctgataaag tagagataaa agaagctgat ataccttcgg 6481 atctgcgtga tattgtgcct gaaaaaacga ataacgagga aacaggactc caggttgcaa 6541 cactaaagta ctaaagcaag ttgctagtta gtgcgtgtaa acggtatctt aatgatacaa 6601 aaagtgcatt ttaaagtact gttaagtaca agtaatttca gttttgtact taaattgagg 6661 cactttttga taactagcaa cttgggtaac tacaaaacta aaaacttgat gatattcaac 6721 ttaaaaactc aattaattct ttaagaggtt gttatgtcaa aagaagaaaa tttgactgat 6781 aagtcccaaa gattatttaa gactacacag cttggtattg ataaaattaa aagggcatgg 6841 gaagaaaaaa acacgatatt tgatgactgg gaatctatta atgaaaatgt tctagatttt 6901 aatttaattg aaaaggacat aataaatatc cacaatgaaa ttaaacaaga taatgttgta 6961 attaagaaga cacaatacac tattagtg // LOCUS NODE_4427_length_6973_cov_5.1902286973 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6973) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6973) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6973 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 554..871 /locus_tag="DP116_25960" CDS 554..871 /locus_tag="DP116_25960" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25960" /translation="MFVSYFTKSTQEVFLALDDTGSIKPPAYRPSLPIGLTHELRKNK IVRLLPYVAMTQLRYFCVSPDPERFFQNFFAQIKINCQSYRHFELRTLSDKKEKEKYS FSL" gene complement(1135..1770) /locus_tag="DP116_25965" CDS complement(1135..1770) /locus_tag="DP116_25965" /EC_number="2.4.2.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019494981.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uracil phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_25965" /translation="MHSQVKVIEHPLIQHKLTLMRKAETTTTTFRVLLKEISLLLTYE ITRDFPVKYEQIKTPLAPMNAPVIALDKKLVIVSIQRAGQGILDGILELIPSATVGHI GLYRDPKTLIPVEYYFKVPQDIDQRDVIVVDPMLATGNSAVAAVERVKSTNPMSIKFL CILAAPEGIEHFTEVHPDIPLYTTAIDDHLDENGYIIPGLGDAGDRLFGTI" gene complement(1995..3254) /locus_tag="DP116_25970" CDS complement(1995..3254) /locus_tag="DP116_25970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875163.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1688 domain-containing protein" /protein_id="PRJNA477356:DP116_25970" /translation="MEMGTAEAQRSQRKEGKELVAYLRSPSAIRDRCEQLFELAVIGE SDYFNCDLTQLPKVAEYVIEVMREQYPDLQIPFHSRWRHFEAGGVQRLSQLDGKLAEL TPEQKAVAKFDLAIISVLLDAGAGENWYYHERETQLDFKRSEGLAVASFHMFCDGTFS SDRQTAPLQVDAQKLQALTEKELADGFGVNANNPLVGIAGRLKLLQKLGQALLSSPHL FGEQNPRPGNLVNFFIQNSYNKQVAAANVLGAVLEGLSEIWSGRIEVAGINLGDTWFH PRVADDGLVPFHKLSQWLTYSLLEPLQELGLETTGLDVLTGLAEYRNGGLCLDLGLIS PKHPEILLQSHSVASEIIVEWRALTVILLDRIAATVRDKLSMSAEELPLVKILQGGTW TAGRKIAAERRKGAVPPIQIESDGTVF" gene complement(3344..4603) /locus_tag="DP116_25975" CDS complement(3344..4603) /locus_tag="DP116_25975" /EC_number="3.5.4.25" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016875164.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GTP cyclohydrolase II" /protein_id="PRJNA477356:DP116_25975" /translation="MPNQKSVSGHIVLTSHPSRFGPKPISIQWGAADPMQRGPVIATL TKQAHRNVIGTHSGGYAIYRALAVASGVLQSDHKADLTNTSPVEYIGPHPSWADPKKI VSLDPFGATIGETFASYYAQGYDIRPTIAITKAHINIRELQEAVTEGRLQVDGKIMKP GGDLVVTKAAIEPVWYLPGIAKRFNITEGELRRALFEQTGGMFPELVTRPDLEVFLPP IGGITVYIVGDVAAITDPDKPLALRVHDECNGSDVFGSDICTCRPYLVHGIEVCVQTA QQGGVGVIVYCRKEGRALGEVTKFLVYNARKRQEGGDRADAYFTRTECVAGVEDMRFQ ELMPDVLHWLGITRIDRMVSMSNMKYNAITQSGIEIVERIPIPEELIPEDARVEIEAK KAAGYYTTGEVLDSDSLSGVKGRSLAD" gene 4957..6177 /locus_tag="DP116_25980" CDS 4957..6177 /locus_tag="DP116_25980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206139.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_25980" /translation="MLHKAVQVRLYPTKEQQTLLAQAFGCSRWWWNYALNKSIEVYKE TGKGLGQVALNALLPKLKKEKDTEWLADCYSQVLQATTLNLTTAYKNFFDQRAGFPRF KSKHGKQSIQYPQNVKIVDGNVKLPGNIGVVKAKIHRPIEGKIKTCTVVKTPSGKYFA SILTEVEGDQPNFTEGKIYGVDLGLKHFAVVTDGEKISKYDNPKHQAHHEKNLKRKQQ KLARKQKGSNSRYKYRKVVAKVYERVSNSRQDFLHKLSYKLVSDSQAVIVENLHLLGM VRNHNLAKAISDCGWGTFINFLAYKLERKGGKLVEIDRWFPSSKLCSNCFYQMSEMSL DVREWTCPHCGTHHDRDGNAATNIRAEGIRMLKADGSAVSAVGGEVRPKMGRKSNLRH SPMSTEAPSAYAAG" gene 6285..6755 /locus_tag="DP116_25985" CDS 6285..6755 /locus_tag="DP116_25985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131353.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="PRJNA477356:DP116_25985" /translation="MDSLFINSLLEDLKNPDETVRDEATRKIWRIWFQQKGVHGLEVI ERSQKFIDSGKITEAEAVLTELVNDQPDFAEAWNRRAFLYYTNCEYRKSLADCQMVVK INPIHFGALHGMGLCFAAMGEYREAIRAFHRALEIQPYSQVNQKLILECTLRLN" BASE COUNT 2027 a 1579 c 1450 g 1917 t ORIGIN 1 ccgccaagac gagccagcgc cgtgcggggg ttccccccgt tgaggcgact ggcgtggcca 61 aggacaccaa gaaaatcaaa aagaagatag gtaatcttgc accgggaagg gagtaggtat 121 gtatttgtca gttttttttc gttcccttga ttgtcaaagt ttggatcgtt cataaagaac 181 tctaaccaag ttaggtgttg ctcgttagtg ttatattttc cgtcactgga gttatggctt 241 ttggataagt tggcgaataa ggcgcttgaa ttggtacttg ttttttattg acatagtttt 301 ttaacaaaac cgattcttta taagatctaa aaaggttgtg ctctcaagct caagaagtga 361 agaatcagat ttgcaatagc tactgcttag cttaatgttt gagtttggga tacaaccata 421 tagtaaataa ctgattcatt catagtaaac acctcaaact attatggtgc ttattataag 481 actacgaatt attagaatat aagtaaagta aaattggttg ctttttgaat cagcttctca 541 gttttatttt gctatgtttg tttcatattt tacaaaaagt acgcaggagg tgtttcttgc 601 tctggacgat acgggtagca ttaaacctcc ggcttatcgc ccttctttac ccataggact 661 tacgcacgag ttacgaaaga acaagattgt gagattgctt ccttacgtcg caatgacaca 721 actacgttat ttttgcgtaa gtccagaccc agagagattc tttcaaaatt tctttgccca 781 aatcaaaata aattgccaat catataggca ttttgagctt cgcaccttgt ctgataagaa 841 ggaaaaggaa aaatattctt tttccctttg aaattttacc cttcgggtat gacctccggt 901 cacgctgcgc tttagccctt tgggcgtgcg ctttgcgcat acgcagagcc ggcacgaggg 961 aaaacgccag gtcctacaac ggggggaacc cccctccggg ttcgccagat gcctacggag 1021 ggagaccctc ctgcagcact ggtctcaccg caacggactg gctcccctcc cgcagcgctg 1081 tctcaccttt tacgttcttt ttaccgtgaa attacataat cgaaatgcac gttcttatat 1141 cgtcccaaac aacctatccc ctgcatctcc caatcctgga ataatgtaac cgttttcatc 1201 taagtggtca tcaatagccg tagtgtaaag aggtatatct gggtggactt cggtgaaatg 1261 ctcaatgcct tctggtgcag caagtatgca aagaaactta atcgacattg gattcgttga 1321 tttaactcgt tcaactgctg cgacagctga gttaccagtt gcgagcattg gatcaaccac 1381 tattacatct cgctgatcta tatcctgagg aaccttaaaa taatactcaa cgggaatgag 1441 ggttttaggg tcacggtata atccaatgtg tcctactgtt gctgatggta tgagttcgag 1501 tattccatcc aaaatccctt gtcctgctcg ttggatggaa acaatcacca gttttttatc 1561 caaagcaatg actggggcat tcattggtgc cagaggagtt ttaatctgtt catatttcac 1621 agggaaatct cgtgttattt catacgtcaa cagcaggcta atttctttga gaaggactcg 1681 aaatgtcgtc gtagttgttt ctgctttacg catcagtgtg agtttgtgtt gaatcaatgg 1741 atgctcgatg actttaactt gactgtgcat atgtcttaaa tgttttatta tctgaatgtg 1801 tgcctcataa gacgcagaaa gacgctcctg tggggtgcag tcaacgtgga aatcataact 1861 aattagccga acatgatatt aagcacgaac aacgtaacaa tagcccacag cgatccaacc 1921 gaacagcgca gccgtgccgt aggcatagaa cataagaact caacaaatga ctaatgagta 1981 atgactactg atagttagaa tactgtaccg tcgctttcta tctgtatggg gggtacagca 2041 ccctttctac gttcagccgc aattttacgt ccagctgtcc aagttccacc ttgcaggatt 2101 tttactagtg gtagttcttc ggcactcatg ctcaacttgt cgcgtactgt ggcggcgatg 2161 cgatccaaca gtatcacggt caaggcacgc cattcaacga taatctctga tgcaacagaa 2221 tgagactgga gcaaaatttc cggatgctta gggctaataa gtcccaaatc aagacacaat 2281 cccccattgc gatattctgc taatccagtc aggacatcca aaccagttgt ttccaagcca 2341 agttcttgta gaggttcaag cagagaatag gttagccact gggatagttt gtgaaatggg 2401 actaagccat catcagcaac acgtggatga aaccaagtat cccctagatt tattccagca 2461 acctcaattc gtccagacca aatctcactt aacccttcta aaactgcacc taacacattt 2521 gccgcagcta cttgcttgtt gtatgagttt tgtatgaaaa agtttaccaa attgcccgga 2581 cggggatttt gttcgccaaa tagatgaggt gaagaaagta gggcttgacc caatttttgc 2641 agcaatttta accgtccagc aatgcccacc aagggattgt ttgcattcac accaaagcca 2701 tctgctaatt ctttttctgt taaggcttgc aatttttgag catcaacttg caaaggtgct 2761 gtttgcctat cactagaaaa agttccgtca caaaacatat gaaaactcgc aactgccaaa 2821 ccttcagaac gcttaaaatc tagctgagtc tcccgctcgt gatagtacca attttcccct 2881 gcaccagcat ctagtaaaac actgataatc gccaaatcaa acttagcaac ggctttctgt 2941 tcaggcgtga gttctgctaa cttcccatcc aactgagata gacgctgtac acctccagcc 3001 tcaaaatgtc gccaccggct gtgaaacgga atttgcaaat ctgggtactg ttcccgcatc 3061 acctcaatga catactccgc cacttttggc aactgtgtta aatcgcaatt aaagtaatcg 3121 gactcaccaa taaccgccaa ctcaaacaat tgctcacaac gatctcggat agcactaggc 3181 gatcgcaaat aagcaaccaa ctccttcccc tctttcctct gcgatctctg cgcctctgcg 3241 gttcccattt ccattccgat ccactcctaa tattcacccc ataacagaag caagagagtg 3301 aagaagtcaa cagaagttcc cctccctctc tcgcttctga aactcaatct gctaaagaac 3361 gtcccttgac accactcaaa ctgtcggagt ctaacacctc tcctgttgta taataacccg 3421 cagccttttt cgcctctatc tccacccgcg catcttcggg aatcaactct tctggaattg 3481 gtattcgttc cacaatctca attcccgact gagtaatggc gttgtacttc atattactca 3541 ttgataccat acggtcaata cgggtaatgc ccaaccagtg caacacatcg ggcattagtt 3601 cttgaaatcg catatcttcc acgccagcca cacactcagt gcgggtaaag taggcatcag 3661 cgcgatcgcc accttcttga cgcttacggg cgttgtaaac caaaaatttc gttacttctc 3721 ccaaagcacg cccctcttta cgacagtaaa caatgacacc tacgcctccc tgttgtgcgg 3781 tttgtacgca aacttcaatt ccatgtacca gataggggcg acaggtgcaa atatccgacc 3841 caaagacatc agaaccatta cattcatcat gtactcgcaa tgccaaaggc ttatcagggt 3901 cagtaatagc tgctacatcc ccaacgatat acacagtaat acctccgatt ggcggtaaaa 3961 acacctccaa atcaggtcgt gtcactaact ctgggaacat tcctccagtt tgttcaaata 4021 aagcgcggcg taattcccct tctgtgatat tgaatcgttt ggcaattcct ggtaagtacc 4081 aaactggctc aattgctgct tttgtgacta ctaaatcgcc accaggtttc ataatcttgc 4141 catcaacctg taagcgtcct tctgtcaccg cctcttgtaa ttcgcggatg ttgatgtgcg 4201 cctttgtgat agcaatcgtg ggacgaatgt catatccctg cgcgtagtat gatgcaaaag 4261 tctcaccaat agtcgcccca aacgggtcaa gcgagacaat tttctttgga tcagcccaac 4321 ttggatgtgg tccaatatat tctactggag atgtattagt aaggtctgcc ttgtggtctg 4381 actgaagaac tccactagcc accgctaatg cccggtaaat cgcataacca ccagagtgag 4441 taccaattac gttacggtgt gcctgtttag tcagtgtagc aatgactgga ccgcgttgca 4501 ttggatcggc tgctccccat tgaatagaaa tcggtttggg accaaaccga ctgggatgcg 4561 aagtgagaac aatatgcccg gaaacgcttt tttggtttgg catagtcgtt tctttttctc 4621 ttacttttta ttaattctta gtcatttatc ttttctctac aactatcacc agttgcgact 4681 tgtaaaagta tgacatacca atgactctaa acaatgactt gaaatcttga gttaggtatt 4741 aatcgtattg cagactacag aaaatagcaa ttaatttgct attttttaac taattctgcg 4801 gaattctcca ccgtctttta cggtggggat gaatagcagg gagattggaa ggtgcatcct 4861 ttgtggatgc ccgaccaatc tccaaatatt ttccggatta gctttactgc taagtataga 4921 tagtagtaga ataagtgata ccaaggaggt gattaagtgc tacacaaggc tgtccaagtt 4981 cgtttatacc cgaccaagga acaacaaaca ttgctagcgc aagcattcgg atgctctcgt 5041 tggtggtgga attatgcctt gaataaatct atcgaagttt acaaggagac gggcaaaggg 5101 cttggacaag tagcactcaa cgcacttcta cctaaactca aaaaagagaa agatacagaa 5161 tggttagctg attgctatag tcaggttttg caagctacaa cacttaatct aaccacagcg 5221 tataagaact tttttgatca acgtgcagga tttcctcggt tcaaatctaa gcatggcaag 5281 cagtcgattc aatatcctca aaatgtcaaa atagtagatg gcaatgtaaa gctcccaggc 5341 aatattggag tagtcaaagc caagatacat agaccaattg aaggaaaaat caagacttgt 5401 accgtcgtca aaactccatc tgggaaatac tttgcatcta tcctgaccga agtagaagga 5461 gaccaaccaa actttaccga aggaaagatt tatggtgttg atttagggtt gaagcacttt 5521 gctgttgtta ctgacggcga aaagatttct aaatacgata atcctaaaca tcaagctcac 5581 cacgaaaaga atctcaagcg taaacaacaa aaactagcac gtaaacaaaa aggaagtaat 5641 tcacggtaca aatatagaaa agttgttgcc aaggtatacg aacgggttag caattcgcgg 5701 caagattttt tacataaact tagttataag ttggtcagcg atagccaagc tgtcatagta 5761 gaaaatcttc atcttctagg catggttcgt aatcacaatt tggcgaaagc aatatctgat 5821 tgtggatggg gaaccttcat caacttctta gcctataagc tagaacgcaa gggtgggaag 5881 ttggtcgaga ttgatagatg gttccctagt tccaaactct gctcaaattg tttttatcaa 5941 atgagtgaga tgtcgttaga tgtgagggaa tggacgtgtc cccactgtgg tactcaccat 6001 gaccgagatg gaaatgcagc aaccaatatt agagcagagg gtatcagaat gctaaaggcg 6061 gatggttcag ccgtctctgc tgtaggaggg gaagtaagac caaagatggg gcgaaagtcg 6121 aatcttcggc attcgcctat gagtacagaa gccccgtccg cctatgcggc ggggtagttc 6181 acttcttcac attttactca gtactcaaca ctttttagtg acttctgaga tcagaatatg 6241 cgaaatttga tccagagcaa ctatcaacta acgaccttga attcatggat tctttattta 6301 tcaattcctt acttgaagac ttaaaaaacc ccgatgaaac agtccgggac gaagcaacca 6361 ggaaaatttg gcgtatttgg tttcagcaaa agggagttca tggactggaa gtcattgaac 6421 gcagtcaaaa gtttatagat tctggaaaaa tcactgaagc cgaagctgtg ctaacagaac 6481 ttgttaacga ccaaccagat tttgctgaag cttggaatcg ccgtgctttt ctttactaca 6541 ctaactgcga gtatcggaag tctttagcag actgtcaaat ggtcgtcaag atcaatccaa 6601 tacatttcgg tgcacttcac ggtatgggct tgtgttttgc cgcaatggga gagtatcgtg 6661 aagctattag agcatttcac cgtgctttgg aaattcagcc ttattctcaa gtcaatcaaa 6721 aattgatttt ggaatgcact ctccgattaa actaactaca acaaatttgg agaaagcaaa 6781 attcaccaca ggctatagca atagcaacca gcaaaaactc tggcttggat aaagtcacct 6841 ttgggatcaa ctgggtagca tatgcacaac aagggagaca atgtccacgc tattgcaaca 6901 gccatctaca aaaactatgg aacagggaac agggaacagg gaacagggaa cagggaacag 6961 ggaacaggga aca // LOCUS NODE_4465_length_6886_cov_5.1171136886 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6886) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6886) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6886 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(696..2111) /locus_tag="DP116_25990" CDS complement(696..2111) /locus_tag="DP116_25990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879580.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25990" /translation="MMKKLISSHPIIWMIGASVFILFGCSSLRHELFKSTAYDLGIFD QAVYLISQGQSPISSFLGFHILGDHAALIFYLLALFYKIYPSVYWLFAVQAGALALAA LPIWHLARHANLTTQQGIAVAAVYLLYPLVFNINLFDFHPEVIAVPALLGAVLAARLE RTGWFCLCIFLTLGCKAVLSITVAAMGVWLLVFEKKRRCGAIALASGIIWFVIATKIV IPFFGTQAASVERHISRYAYLGNSFPEIAKNLLLNPGLVLGQVFSLANLGYLLLLLAP VIWGLSLQGIQPLIGAFPTLAMNLLADYPLQKDLIHQYSLPAVPFLILSIISTLATGR GWLRNKRNITLWSIVAFIALAKYGYFWSIYSDSLDTWQATRQAVALVQTKGSVLTTAE IAPHLTHRPVVELTSANSPPANLAKFDYILLNVTHPGWQSDQEFATNLVESLKKTQLF QLRYHQDGVHLFVKEVFRNGV" gene complement(2163..3437) /locus_tag="DP116_25995" CDS complement(2163..3437) /locus_tag="DP116_25995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872447.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_25995" /translation="MIKGQIFLNKALWTKIVLFPATMWLTSRLLIWITMLLIAPLLPV PAGGNTTTFGWGVFFAWDSEYYRTIASSGYKFINDGKSHTLAFFPLFPLIIRVLMNLG LPFEIAGTLVSNLAFLAALYFLYFWVKEQAGESAARWTTAVVAWCPLSIFAGVIYTEG LYLLLSTAALRAFDKQHYGWTVFWGAMATATRPTGMALITAFALAAWKQRRPPIAYLA SFAAGAGLVFFSIYCAIQFGDPLGFIHAQQKWRPTLGFDWQGWWKMLMQVTVGTLNWK HGGIKDPLHPLLFTMIIGIGSCLWYFRKQLGSQKVDYGFAALVFLLWILAGDPLINAI AIFGSVYLLWQLRAQLTPVTVIYGFCGIGLLLASGSTISLSRLVYGIVSPSVALGVLL SRYPRWGYLMLSFFAILLASFAVRFAQELWVG" gene complement(3549..4832) /locus_tag="DP116_26000" CDS complement(3549..4832) /locus_tag="DP116_26000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311601.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26000" /translation="MAEIQIFIKKVVWKNDFLFPTIIWLVSRFFIWVTMLLVVPHLSA PSQGVTPQFGLGVFDAWDSVHYRSIATSGYEFSDDGKQHNLAFFPLFPLAIWLFMRFG LSFEVAGTLINNLAFFAALYCIYFWVEEHCGTKEARWATAVVAWCPLSMFGTVIYTEG LYLLLSTAALRAFDQKQYYWTLLWGAMATATRPTGLALIPAFLLAAWKQRRPLMAYVA GCGSAMGVLLFSLYCAIQFGNPFGFIHAQRGWRPSLGFDGQGWLNMLLQITVGTSNLE SLSINNYLHLLLFGVVVSYGYCLWRFRKQLNFLAVWGFYGLSICLLILADDWFIYNFL NGVMVCGGTYMLWHSRTQLTPVTVIYALCGISLLLASGGTISLGRLAYGIVSLSVACG VFLSRYPRQGYLSLGLFAILLVRLSVKFAQELWVG" gene 5499..6695 /locus_tag="DP116_26005" CDS 5499..6695 /locus_tag="DP116_26005" /EC_number="1.1.1.267" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456040.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="1-deoxy-D-xylulose-5-phosphate reductoisomerase" /protein_id="PRJNA477356:DP116_26005" /translation="MKAITLLGSTGSIGTQTLDIVAQYPDQFRIVGLAAGNNVDLLAS QIRQFRPSIVAICSEEKLPALKEAIKDLAPQPILMAGDAGVIEVARYGDAQTVVTGIV GCAGLLPTIAAIEAGKDIALANKETLIAGGPVVLPLIKKHSVKLLPADSEHSAIFQCL QGVPDGGLRRILLTASGGAFRDWAVEKLAEVTVADAIKHPNWSMGRKITVDSATLMNK GLEVIEAHFLFGLDYENIEIVIHPQSIIHSLIELQDTSVLAQLGWPDMHLPLLYALSW PERIYTNWERLDLVKIGSLTFREPDHQKYPCMKLAYAAGRSGGSMPAVLNAANEQAVA LFLEEKIQFLDIPRCIEWVCDRHERDNRPNPSLDDILTADKWARQEVLKATQHLENRP RIISLR" BASE COUNT 2191 a 1444 c 1374 g 1877 t ORIGIN 1 attataagta tgatgaattt ttggcataaa taatgttttg cattgtttat tttgaagtat 61 cagttacata gaggatactg tatattctac tgctaaaggt tattgaatac agaaaaaaaa 121 gttaattttt taattgattg tctgttacca ttagccctat ggaaaagaat agacgcccaa 181 aaacaagata aagtttctta ttgtaaaata tacattttcc cttaacgcag agtttgctga 241 gaatcatact cttctcagta cagctttacg gacagattgg atgtttcatt tatagtcaag 301 ctgatatttc agctaggtca tttatcagtt attcctaata attcagtgat tatagtagtg 361 aatctacagt cacataatag ttaggattgt tactttattg ttattcaatc tcagacttct 421 gttttcagta aaagtacgga aaatctcaga aatggtacaa cgaaattttg tttaacaatc 481 aaagaatgtg tgcctattaa agcatattct ttccgatata acgatgtgat aagatatact 541 gaattaatct ttgctcatct aaaaaatatt tgatgaaata ttttatgaag ctaagtttag 601 atagagctaa acagtattgt aaaacattgc cacaaatacc gcagttaact caaggataat 661 gccagtaagg atgaaatcac tcctatttgc cttgattaaa cgccattgcg gaaaacttct 721 ttcacaaaca aatgcacgcc gtcttggtgg tagcgtagtt gaaataactg agttttcttg 781 agagactcta ccaagttagt cgcaaactct tgatcacttt gccagccagg atgagtgaca 841 ttcaacagta tatagtcaaa ctttgccaaa ttagcaggtg gtgagttggc gcttgtcaac 901 tcgacaactg gtctatgtgt taaatgtggg gcaatctcag cagttgtcaa aacagatcct 961 tttgtttgaa ctaaagcaac cgcttgtcgt gttgcttgcc aagtgtccag tgaatctgag 1021 tagatagacc agaagtatcc gtatttggct aaggcaatga atgcaactat tgaccacaat 1081 gttatatttc gtttattgcg taaccatcct cgacctgtag caagggtaga aataatgctt 1141 aatataagaa atggtactgc tggtagagaa tactgatgga tcaagtcttt ttggagtggg 1201 taatcggcaa gaagattcat agccaaagta ggaaaagcac caatcaaagg ctgtatccct 1261 tggagtgaaa gtccccaaat cacaggcgct agtaacagca gtaaatatcc taggttagcg 1321 agtgaaaaaa cttgtcccaa cacaagtcct gggttgagga gtaaattttt agcaatttct 1381 ggaaatgagt ttcccaaata agcatagcga gaaatgtggc gttccactga cgccgcttgg 1441 gtaccaaaaa aggggatgac aattttggtt gcgatgacaa accagatgat accgcttgcg 1501 agagcgatcg caccacatcg tcgcttcttt tcaaacacca atagccaaac tcccattgct 1561 gcaactgtta tagataatac agctttacat cccagcgtca ggaagataca taaacaaaac 1621 cacccagtgc gttcgagtct agccgctaaa accgccccta acaatgctgg cacagctatc 1681 acttctgggt ggaaatcaaa gagattaata ttgaacacta atggatacag taggtacacc 1741 gccgcgactg ctatgccttg ttgagtagta agattagcat gacgcgctag atgccatatg 1801 ggtaaagcag ctaatgccaa agcgcctgct tgcactgcaa acaaccagta gacgctgggg 1861 tatattttgt aaaataaggc taaaagataa aaaattaaag cggcatggtc accaagaata 1921 tggaaaccaa ggaaagaaga aataggtgac tgtccttggc taattaagta aactgcctga 1981 tcaaaaattc ctaagtcata agcagttgat ttaaatagct catgtcttaa actgctgcac 2041 ccaaataaaa tgaatacact cgccccaatc atccaaatta taggatgaga cgaaatcaat 2101 tttttcatca tcactccctc cctcactccc tcacttcctc acttcctcac tccctccccc 2161 atttacccta cccaaagttc ttgggcaaat cgaaccgcaa agcttgcaag taggatggca 2221 aaaaatgaca gcattaggta accccaacga ggatagcgag acaataatac accgagagcg 2281 acacttggag acacaatacc gtacaccaaa cgactcaaag atatagtact cccagatgct 2341 agcaacaaac ctataccaca gaagccatag ataacagtga ctggagtcaa ttgagcgcgt 2401 aattgccata ataggtaaac actaccgaaa atagcgatcg cgttaatcaa agggtcgcct 2461 gctagtatcc ataaaagaaa gactaaagcg gcaaagccat agtccacttt ctgtgaacct 2521 agctgtttac ggaagtacca taaacaagaa ccgatgccaa taatcatcgt aaacaataga 2581 ggatgcagag ggtctttgat cccgccatgc ttccaattca atgtcccaac tgtcacttgc 2641 atcagcattt tccaccaacc ctgccaatca aacccaagcg ttggtcgcca tttctgttgt 2701 gcatggataa atcccaaagg atcaccaaat tgaatcgcac agtatatgct gaaaaaaact 2761 agtccagcac cagcggcaaa agacgcaaga taagcaatag ggggtctgcg ttgtttccaa 2821 gctgctaagg caaacgcggt tatcagtgcc attcctgtcg gacgtgttgc tgtcgccatt 2881 gcaccccaaa agacagtcca accataatgt tgtttatcaa aagctcgcaa agctgctgta 2941 cttaagagta agtatagccc ttcggtgtaa atgactcctg caaatattga taatggacac 3001 caagcaacca cagctgttgt ccaccgcgcc gcgctttcac cagcttgttc cttaacccaa 3061 aagtataaaa agtaaagcgc cgccaaaaaa gccagattgc tgactaacgt ccctgcgatt 3121 tcaaacggca agcccaagtt cattaaaact cggatgatta aaggaaacag gggaaagaag 3181 gcaagggtgt gtgacttgcc atcgttaatg aatttatatc cagagctggc aattgtgcga 3241 taatattcac tatcccatgc aaaaaacacc ccccaaccaa aagtggtggt attgcctcct 3301 gctggtactg gtagcaatgg tgcaatcagc aacattgtta tccagatgag tagtcggcta 3361 gtaagccaca ttgttgctgg gaagaggacg atttttgtcc ataaagcttt gttcaaaaat 3421 atctgacctt taatcatgag tgaggtgtaa agaaggtgta gggatatttt tctgaagaaa 3481 aatgcggaat tctttgttgt taaattttga attgttcaag tcctacacca agtgaaatgt 3541 tttttagctt atcctaccca aagttcttgg gcaaatttaa cagagagtct gacgagtaag 3601 atagcaaaca aacccagact taagtaccct tgacgaggat agcgagataa gaagacacca 3661 caagctacac tcagtgagac aataccgtaa gccaaacgac ccaaggatat cgtgcctccg 3721 gaggcaagta gcaaactgat cccgcacaaa gcataaataa cagtaactgg ggttaattga 3781 gtgcgtgaat gccacaacat ataagtgcca ccacaaacca taactccgtt cagaaagtta 3841 tagatgaacc aatcatctgc tagtattaac aagcagattg ataaaccata aaagccccag 3901 acagcaagaa aattcagttg cttacggaag cgccacaaac aataaccgta gctgacaaca 3961 accccaaaaa gtaggagatg caggtagtta tttattgata aagattctaa attggacgtt 4021 ccaacagtga tttgcagtag catatttaac caaccctgtc catcaaatcc cagggagggt 4081 cgccatccgc gttgtgcatg gataaatcca aagggattgc caaattggat tgcacagtac 4141 agactaaaca aaagcactcc cattgcactg ccacacccag caacgtaagc catcaaaggt 4201 ctacgctgtt tccatgcggc taacaaaaat gcaggtataa gcgctaaccc cgtaggacga 4261 gttgccgttg ccattgcacc ccaaaggaga gtccagtagt attgcttttg atcaaaggct 4321 cgcaaagccg cagtgctcaa cagtaagtat agcccctcgg tgtaaataac cgtgccaaac 4381 atggacaatg gacaccaagc cacgacagct gttgcccaac gcgcctcctt tgtgccacaa 4441 tgctcctcta cccaaaagta gatacagtaa agggctgcaa aaaatgccaa gttatttatc 4501 agcgtacctg ccacttcaaa tgacaagcca aaacgcatga aaagccaaat agctaaggga 4561 aataagggga aaaatgcaag attgtgttgt tttccatcat cgctaaattc ataaccagat 4621 gttgcgatcg aacggtaatg tacactatcc catgcatcaa atactcccaa gccaaattga 4681 ggggtaactc cttgcgatgg tgcgcttagg tgtggaacaa ccagcaacat agtaacccaa 4741 ataaagaatc ggctaacaag ccagatgatt gtaggaaaaa gaaaatcatt tttccataca 4801 acttttttga taaatatttg aatttcagcc ataagcttga gtgttattca aactcctcta 4861 tgcattcaca tacttatggc tgtcatttgg tgatttatta gtaacattgc gcacattcca 4921 ctcctccacc aaaatgagtc attacacgaa aagaaacata gctgacaaag ttttcagcta 4981 tttactatca gtagttataa tttagctgat atccatatga accttggcag ttagatttaa 5041 ctatatagtc tttctataca acattttttt ttgcaaagaa aagttgcttt ggagagatgt 5101 ttaactcatc ttgtggcttc tagtgcatgg cagtagaact caagattggc aataaaacat 5161 tgggtagggt gaagttggtt cgttttccca ctttttgaag gtatccccca gcaattttgc 5221 ctcttgataa ctcactcact ccccatcagc cgttagcact gacaatatcc cgttaagttt 5281 agtaacaaat cgaactgaat ttttcgccaa cgcaaattaa cgcacttcgc aaacggattc 5341 gctctgttga agcaccataa acgcttgttg gttacagcgc ttattcttat gattagacca 5401 caaatcctag ggcgggcata aggatatagg ataaatcaat ttatttgtgt atcccaagca 5461 atctatatta atggttgtct gtcaatcacc aaacaattgt gaaagcaata actcttcttg 5521 gttctactgg ttctatcggt actcagactt tagacattgt cgctcaatac cccgatcaat 5581 tccggattgt gggcttagca gctgggaaca atgtggattt gttggcttct cagattcggc 5641 agtttcgacc aagtatagta gctatctgct cagaagagaa gttgccagca ctcaaagaag 5701 caatcaaaga ccttgctccc cagccgattt taatggctgg tgatgctgga gttattgaag 5761 ttgcccgcta cggagatgcc caaaccgttg ttacgggtat cgttggttgt gctgggttgc 5821 tacctaccat agcagccatt gaagctggta aagatatcgc cttggcaaac aaggaaacct 5881 taattgctgg aggtccggta gttttacctc tgattaaaaa acacagtgtg aaattactgc 5941 cagcagattc ggaacattct gctatatttc aatgcttaca aggtgttcca gatggaggtt 6001 tacgacgaat tttattaacc gcatctggtg gtgcttttcg tgactgggca gtagagaagt 6061 tagcagaggt cacagttgct gatgctatca aacatcctaa ctggtcaatg gggcgtaaaa 6121 ttactgttga ttctgcgact ttaatgaaca aaggtttgga agtgattgag gctcatttcc 6181 tctttgggtt ggattacgag aatattgaga ttgttattca tccccaaagt attattcact 6241 cactcattga actacaagac acctctgttc tcgcccaact aggttggcct gatatgcact 6301 tacctttact atatgctttg tcttggcctg aacgcatcta caccaattgg gaacgcctag 6361 atttagtcaa aatcggcagt ttaaccttcc gtgaaccaga tcatcagaag tatccttgca 6421 tgaaattagc ttatgcagca gggagatctg gtggttccat gcctgctgtg ttaaatgcag 6481 cgaatgagca agctgtggct ttgtttttag aagaaaaaat tcagttttta gatattcctc 6541 ggtgtatcga atgggtgtgc gatcgccacg aacgcgataa tcgtccaaat ccttctttag 6601 atgacatttt gacagcagat aaatgggcaa ggcaggaagt tttaaaagca actcaacact 6661 tggaaaatcg tccgcgtata atatctttgc gataaaaaac gactaacaac cctagatttt 6721 cattgtattt atgttcagcg ttcagcttac tgctgaacgc taattttttt tattcgtaat 6781 ttttacctta tagcaatgta taagaatata aacatatagt tctgttgttt tttctgagtt 6841 tcgtcattta gttattcatt tataatttat tatactgaga aagaat // LOCUS NODE_4490_length_6837_cov_4.9671196837 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6837) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6837) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6837 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1706) /gene="nifJ" /locus_tag="DP116_26010" CDS complement(<1..1706) /gene="nifJ" /locus_tag="DP116_26010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872961.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyruvate:ferredoxin (flavodoxin) oxidoreductase" /protein_id="PRJNA477356:DP116_26010" /translation="MKKTFATIDGNEAVARVAYRLNEVIAIYPITPSSSMGEWADTWS SEERPNVWGTIPRVVAMQSEGGAAAAVHGALQTGSLTTTFTSSQGLLLMIPNLYKIAG ELTSAVIHVAARSLATHALSIFGDHSDVMAARATGFALLCSASVQESQDFALIAQAVT LQARVPFMHFFDGFRTSHEIQKVQLLEDSDLQALIDEELVFAHRDRALTPDHPVLRGT AQNPDVYFQSRESINPYYNVCPDIVQQAMDKFGEITGRYYRIFEYHGAPDAEEVIVIM GSGCETVHETVDNLIARGKKVGVVKVRLFRPFDIKRFVEVLPASVKAIAVLDRTKEPG SAGEPLYLDVVTAVHEEWLDRQRGSGRISTPKIIGGRYGLSSKEFTPAMVLAVFENLT QAKSKNHFTIGINDDVSHTSLPFDADFSIEPDNVVRAMFYGLGSDGTVGANKNSIKII GEQTDNYAQGYFVYDSKKSGSITVSHLRFGSQPIRSTFLISQANFIGCHQWVFVERID VLKAAVVGGTFLLNSPYDADTVWEHLPVEVQQQIISKQLKFYVINANQVARESGMGGR INT" gene 2776..3801 /locus_tag="DP116_26015" CDS 2776..3801 /locus_tag="DP116_26015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017656195.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_26015" /translation="MYICTLAACGVVAVISSSALAIEVNLTDEFADSTSANELYFNTA ISLPDKAQDYLSNQPVFQDLVSDAVEQQPPNQAQSPLPDKSQYNLFHPTPKNLWRELS TDRPDQTESPFTVDAGHFQIEADFFVYTRDTNSADDTRTESFNLFVPNFKVGLLNNVD LQIIPEVYNVVRTTPKGGSTEERSGFGDITVRVKVNFWGNDVGKTAFAMMPFIKFPTN QNNLGNNSIEGGIIFPLGIALSDRWDLGMMTEFDFNKNEVDSGYNLGFVNSVTLGYAI NSRWSTYFELFTEKTTEKGSDFIATFDTGLKYLLTENIQLDAGVNIGLKQAADDFQPF VGLSMRF" gene 4041..4517 /locus_tag="DP116_26020" CDS 4041..4517 /locus_tag="DP116_26020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741050.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26020" /translation="MKKTLISAAAFALVSAALISAGYATAKTDNNRVFNVNDSVKFPP NSWRIVKHTFRVQIPRNNNTLSQLIIDTPSSVAVSNDIDVLDDKGQKININISVNGRR ILIDFPEKVISNTKLLIEFNKVRQPTVGPASVYSLWAKAVGNDTEIPVGTAQFSTF" gene 4545..5033 /locus_tag="DP116_26025" CDS 4545..5033 /locus_tag="DP116_26025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877479.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26025" /translation="MKKIMIYAVALALATVTLIPVNYAKASADDSQDPHIDGNAQFPP TRWYVVRHTFRVHIPKNSKEISQLSIQVPTNVTLSNDVDDINVEDKNGQRINTNVSVN DKTILLAFTEPVAPDTQLEIDLKNVKRRTGGNSYFYRLFAKFAGASTEVPIGGASFRI GY" gene 5277..>6837 /locus_tag="DP116_26030" CDS 5277..>6837 /locus_tag="DP116_26030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878023.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux RND transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_26030" /translation="MHSLERSQKSTTLRCISGGFLSLLLLIAPTVVLADGGHGDEFQG GSETSQSGGSVEVDASTVKQLGIKVEPVKRQRLSVGIKTTGQIETLPSQKVEVNTPIS GAKVVELLVEPGAVVKKGQPVAVVTSPDLVTLRVESQEKLASGQADLQQAQADLRLAQ QNYEKYQQIAAAEIASAQNKVTFAQEKYDKDKVVADAGALPRRNALESQTQLAEAKAE LTKASSRRDVIDAENKLKRAQASVEVAKKRIQLSNTNYETRLQQLGNSANAKGLLTVT APISGRVADRQVTIGQSFNDVGGKLITIVNDSRVYATANIYEKDLGKVRTGQRVSLKV ASVPNRTFTGRIAVIGSVVEGETRVVPVKAEIDNPGGVLKPGMFAELEVLTNQTSTAI LAIPNSAVVDVNGKKQVYVQNGNAFQATEVTLGQTSGDMVEVKTGLFEGDMIVTQRAP QLYAQSLRGGTKPTEGEKKEAQPAQKTEVKTPNLPVPLWLLLSGGGAAIAAVGFLAGR RTKRQEVPGTPE" BASE COUNT 2007 a 1412 c 1458 g 1960 t ORIGIN 1 gtgttaatcc gtcctcccat gccactttcg cgagcaactt gattcgcatt gatgacataa 61 aacttcagtt gcttgctaat aatttgctgt tgtacttcca ctgggagatg ttcccaaacg 121 gtatcagcat cataaggact gttcagcagg aatgttcctc caacaacagc ggcttttagg 181 acatcaatgc gttctacaaa tacccactga tgacaaccaa taaagttggc ttgtgaaatc 241 aggaaagttg agcgaattgg ttgtgaacca aaacgcaaat gcgatacagt tattgaacca 301 gatttctttg agtcgtaaac aaagtaacct tgagcgtagt tgtcggtttg ttcaccaata 361 attttgattg agtttttatt tgcccccact gtaccatcgg aacccaaccc atagaacatt 421 gcgcgcacaa cgttatctgg ttcaatagaa aagtcggcgt caaatggtaa agaggtgtga 481 ctaacgtcgt cgttaatacc gatggtaaaa tgattttttg attttgcctg agtcaggttt 541 tcaaaaactg ccagcaccat cgctggggta aattctttgg aagaaagacc gtaacgacca 601 ccgataattt taggagtaga aattcttcca ctcccccttt gtctatctaa ccactcctcg 661 tgaacagcag tgacaacatc caaataaagg ggttcacctg cactaccagg ttctttggtg 721 cgatctagga ctgctattgc tttgacacta gcaggtaaaa cttctacaaa ccgtttgata 781 tcaaaaggac ggaatagcct aactttgaca acaccaactt tttttccacg ggcaatgagg 841 ttatctacgg tttcatgcac tgtttcacag ccggaaccca tgatcacgat aacttcttct 901 gcatcagggg caccgtggta ttcaaagatg cggtaatatc gtcctgtgat ttccccgaat 961 ttatccattg cttgttggac aatgtcggga caaacgttgt aatagggatt aatgctttcg 1021 cgggattgaa agtaaacatc ggggttttgg gcggttcccc gcaagactgg atgatctggg 1081 gttaaagcgc gatcgcgatg agcaaatacc agttcctcat caatcagcgc ctgaagatcg 1141 ctgtcttcca aaagttgaac tttctgaatt tcatgagaag tgcggaagcc atcgaagaag 1201 tgcataaatg gcactcgtgc ttgaagggtg acggcttggg caatcagagc aaaatcctga 1261 ctttcctgta ctgaagcgga acacaacaaa gcaaaaccag ttgcacgcgc agccatcacg 1321 tcactatgat caccaaaaat tgacagggca tgggttgcta aagaacgtgc tgcaacgtga 1381 ataacggcac tggtgagttc acctgctatt ttgtacaggt tgggaatcat caacagcaat 1441 ccttgagatg atgtgaaggt cgtcgtcaga ctacctgttt gcaatgcccc atgcaccgca 1501 gccgcagccc cgccttcact ctgcattgcc accactcttg gaatagtacc ccatacattg 1561 ggacgttctt cactcgacca ggtatctgcc cattcaccca ttgatgatga gggggtgata 1621 ggataaatgg caatcacttc atttaggcgg taagcaacac gggcaacagc ctcatttcca 1681 tctattgttg caaaggtttt tttcatgatt tctcctctga gattgccgcc tgctggaatt 1741 gcggtcatct attcctactt agtgatgcaa gtttcgtttt cttcttgagc gatcagtagt 1801 acagattagg agcagccgaa cccggagggt gcaagatatc aggtatcaaa tacataatta 1861 gctgctacac tgtttagctc acttcatcga aatgtaatga gcctaaaact tatgacttgc 1921 tggagataaa ataaactgtg tagttgctac ccgtgacgtc aatttttata tcaatatttt 1981 ttcatgctaa gttatctcgt tggctgacta catttcgctc ttcttttgaa ggaacactaa 2041 tcttgctttt ttagtatcgc atcacatcgt tttcgtttgt ttctgttaat tactgatgta 2101 ctcaaacaac aaaatctcta ttctctaagg attttagaaa ttcaagtctt tttatggggg 2161 tataggttcg agaatttaat cagaactttg atgtaggaaa ttggagtata attaataata 2221 caattatggt attaaataaa catttaatgg aattcacaat tatcattaca agcagaaatt 2281 cttctctctg acaaaaacca gtacaacttg tttcactcta cacccaagaa tctgataatt 2341 ttcagtcttt tgttggctga tcgatgcgtt tttagtctct tgtgcgatag tgtcataaat 2401 ggggtcaaag ctaggggaga aagagggaac tacagcacaa aacatataaa atttaacttt 2461 aattgttctg tttaatcttt cgttacatac ttagaaattt tcctgcccac ctacttacac 2521 ctccactttc tgttagtcac tcaatttggg tttactcaac aaactgctga caaccttttg 2581 ttacgggcat agtacatttg cacaaaggta acttcagttt aagcctatga atgcaacttg 2641 gtctgagacg actggcgtgg gcttcaggag taggattttt gctcaaagac agagattatg 2701 aactaaaaat gaaataaata tgagagtatt atgaaataaa gttctcacaa ctaaggagag 2761 agtattttga ctgaaatgta tatctgcacc cttgctgctt gtggtgtggt agcagtcatt 2821 tctagcagtg ctttggctat agaagtaaac ttaacagacg agtttgctga tagcacttct 2881 gccaacgagc tttatttcaa tactgctatt tctttaccag ataaggcgca ggattactta 2941 tccaaccaac ctgtattcca agacttggta tctgatgcag ttgaacaaca acctcctaat 3001 caagcacaga gtcctctccc tgacaaaagc cagtacaacc tgtttcaccc tacacccaaa 3061 aacctatggc gggagttgag tacagatcgc cctgaccaaa cagagtctcc ctttactgtg 3121 gatgcaggac atttccaaat agaagcagat ttcttcgtat ataccagaga tactaatagt 3181 gcagacgata cccgtacaga atctttcaat cttttcgttc ccaactttaa agtcggtctg 3241 ttaaataacg tcgatttaca gattatccca gaagtgtaca acgtcgtgcg tactacaccc 3301 aaaggtggct ccactgagga acgctctggc tttggtgata ttacagttcg cgtcaaggtt 3361 aacttttggg gcaatgacgt cggcaaaact gcgtttgcta tgatgccttt tatcaaattc 3421 cctactaatc aaaataattt gggaaataac tcaattgaag gcggcattat ttttcccttg 3481 gggattgcac tgtctgatag gtgggacttg ggtatgatga ccgagttcga ctttaataaa 3541 aatgaagttg attctggata taatttaggt tttgtcaata gcgtcaccct tggctatgca 3601 attaactcca gatggagtac ttattttgag ttatttacgg aaaaaacaac cgagaaaggc 3661 tctgatttca ttgccacttt tgacactggc ttgaagtatc tgctgacgga gaatatccaa 3721 ctagatgcag gtgttaacat tgggttaaag caggctgcgg atgatttcca gccctttgtt 3781 ggcttatcga tgcgttttta gtctcttgtg cgatagcgcc aagtaaggct ccctctggga 3841 gcgtcgtccg atcacttcac aggtgctgtt gccaggaggg gcactcatga caatttatca 3901 caatatgaat gaataagttt tgttaaagag ccaaaatttc atcttaattt catagtgacc 3961 tgtcattcta gttaaataaa ggttcgtgca aacgatgaat ctgcaatcta attgaataat 4021 tgattcattg gacagacaag atgaagaaaa cactgatttc tgctgcggca tttgctctgg 4081 ttagcgcagc tttaatttct gctggctatg caactgctaa aacagataac aacagggttt 4141 tcaatgttaa tgatagtgta aaatttcctc ctaatagctg gcggattgtt aaacatacct 4201 ttcgagtaca aattcctcgg aataacaata ctctctccca gctaattatt gatactccat 4261 cctctgtggc tgttagtaat gatattgatg tgttggatga taagggtcaa aaaattaaca 4321 ttaatatttc tgtcaatggt agaagaatct taatagattt tccggaaaaa gttatttcta 4381 acaccaaact cttaattgaa tttaataaag tcagacaacc aactgttggt cctgcttccg 4441 tttacagctt atgggctaaa gccgtcggta acgacacaga gattcccgta ggcacagctc 4501 agttttccac attttaatct ataaattatc acgacaggtg ttgaatgaag aaaataatga 4561 tctacgctgt tgcattggct ctggctactg taaccttaat tcctgttaac tatgcaaaag 4621 ctagcgcaga tgatagtcaa gatcctcaca ttgatggaaa cgcgcaattt cctcctacac 4681 gctggtatgt tgttagacat actttccgag tacatattcc caaaaatagt aaggagattt 4741 ctcagctaag tattcaggtt ccaaccaatg taactttgag caatgatgtt gatgacatta 4801 atgtagaaga taaaaatggt cagagaatta acactaatgt ttctgtgaat gacaaaacta 4861 tactattagc ttttaccgaa ccagttgctc ctgacactca gctagaaatt gacctcaaga 4921 atgtcaaacg acgaacagga ggaaatagtt atttttaccg cctcttcgct aaatttgctg 4981 gcgctagtac agaagttcct ataggaggag ccagttttcg cattggttat taactctatt 5041 actacccagc cattaaaaac ttttagagtt aagctagagt ttaattgcat atttgttgaa 5101 gtcaagccag aaatttcaca gtaatttcat cctcatatga caaactagtg aaaaggcagt 5161 tttgttagtg cggaacattt gctgtgttta cccaatagca agcgactgta acaggacaca 5221 aatgccagct tcaagtcgtt actctagttc cttagtactc aaggacattt ctaataatgc 5281 acagccttga acgttcccaa aaatctacaa cactccgttg tatttctgga ggattcctga 5341 gcttgctcct actaattgct cccacagttg ttttagccga cggtggacac ggagatgaat 5401 ttcaaggagg aagtgaaacc agtcaaagtg gcgggtctgt tgaagtagat gcctcaacgg 5461 ttaaacagct aggaatcaaa gtcgagccag tgaaacgtca gcggctatct gttggtatta 5521 aaaccactgg gcaaattgag accctgccta gtcaaaaagt ggaagtcaat accccaattt 5581 ctggggcgaa agtggttgag ttgttggtgg aacctggtgc agtcgtgaaa aaaggtcaac 5641 ctgtggctgt tgtaaccagt cctgacttgg tgacactgcg cgttgaatct caagaaaaac 5701 ttgcttcagg tcaagctgat ttgcagcaag cgcaagctga cttacggctt gctcaacaaa 5761 actacgaaaa atatcaacag atagctgcag ccgaaatagc ctctgcacag aataaagtga 5821 cctttgctca agaaaagtat gacaaagata aggtagtagc tgatgcgggc gctctccctc 5881 gtcgcaatgc tttagaatcc caaacccaac tagcagaagc caaagcggaa ctgaccaaag 5941 cttcgagccg ccgggacgta attgatgctg aaaataaact taaacgtgct caagcctctg 6001 ttgaagtagc aaaaaaacgt attcaactca gtaatactaa ttatgagact cgattgcaac 6061 aactgggaaa cagcgccaat gctaagggac tgttaacggt gacggctccc atttccggtc 6121 gggttgctga cagacaagtt accattggtc aatcattcaa tgatgtaggt ggcaagctga 6181 taacgattgt caatgatagt cgggtttatg ccacagcaaa tatttatgaa aaagatttgg 6241 gcaaggtaag gacaggtcaa cgggtaagtt tgaaggtagc ttctgtgcct aatcgtacct 6301 tcactggacg aatagccgta attggttctg tggtggaagg cgaaacgcgg gttgttcctg 6361 tgaaagccga aatagataac cccggtggag ttctcaagcc agggatgttt gccgaacttg 6421 aagttttgac aaaccaaaca tcgacagcta tattagctat tcctaattca gctgtggttg 6481 atgtcaatgg taagaaacag gtttacgtac aaaatggtaa tgcttttcaa gcaactgaag 6541 tcactttagg tcaaacctct ggggacatgg ttgaggtgaa gactggctta ttcgagggtg 6601 atatgattgt cactcagcgt gcgcctcaac tttatgctca gtctttgcgg ggtggtacta 6661 agccaacaga aggtgagaag aaagaagcgc aacccgcaca gaagacggaa gttaaaacgc 6721 ccaacttgcc agtacccttg tggttgctct tatcgggagg aggggctgct attgctgcag 6781 ttggtttctt agcaggtcgt cgcaccaaac gtcaagaggt tccaggaaca ccagaat // LOCUS NODE_4528_length_6761_cov_4.1702956761 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6761) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6761) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6761 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..473) /locus_tag="DP116_26035" CDS complement(<1..473) /locus_tag="DP116_26035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197299.1" /note="Catalyzes the transfer of electrons from NADH to ubiquinone; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-quinone oxidoreductase subunit F" /protein_id="PRJNA477356:DP116_26035" /translation="MSHFLLETVWLVPCYALSGALLAVPWSPGIIKRTGPRPAGYINL VMTFLAFVHSALVLPIAWNQAPYEISIPWLNTAGLNLSIDLEISSLSVGAMVVITGLN LLAQIFAVGYMEMDWGWARFYSLLGLFEAGLCALALCNSLFFSYVILEILTLGTCL" gene complement(1017..1217) /locus_tag="DP116_26040" CDS complement(1017..1217) /locus_tag="DP116_26040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318415.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucose-inhibited division protein A" /protein_id="PRJNA477356:DP116_26040" /translation="MNRGKIVAIITGAVSIILALGYLLLVQLLDFRGEMKPAPISDSS HQSSVVAYILPANGQAIVEKIY" gene complement(1363..3477) /locus_tag="DP116_26045" CDS complement(1363..3477) /locus_tag="DP116_26045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744631.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acylase" /protein_id="PRJNA477356:DP116_26045" /translation="MNYLHKTFRQKPFRVFAFVFGIIFALVLGCQTSAQITKSTEILW DTYGIPHIYGKDTRSAFQAFGWAQMQSHGNLLLRLYGQARGRAAEYWGEKYLDSDRWV LTAGIPERASSWYNAQNPTFRNYLDAFAAGINAYAKEHGELIDDEVEVVLPVKPEDVL AHLNRVLHFTFIVNPQDISGVAKQESKAGSNGWAIAPEHSASGKAMLLANPHLPWSDL FLWYEAQLSTPDIDAYGATLVGVPVLAIAFNDNLGWTHTVNIYDGWDTYSLKLVDNGY RFDGKVRPFETTTVSLKVKQKDGTLSEQPLVVKRSVHGPVVTQKDGFALALRVAGLDR PGVLEQWWDMARSQNLNQFQSVLKRLQLPMFTVMYADKDGHIMHLFNGDVPIRSQGDF KYWEGIIPGDTSKTLWTKIHPYQDLPRVIDPESGWLQNTNDPPWTTTYPTAIKADNYP AYMAPTGLIFPKDFRTERSIRMLSSDDKISFDEMVEYKHSTRMELADRILDDLIPASR KQGGELARRAADVLEAWDRQANADSKGAVLFAFWAKQMNLDDKSFSKPWSEDSPRTTP DGLADPKGAVATLEAVAGKVEKAYGKLDIAWGDVFRVHVGNKDLPANGGDGSLGIFRV LNFAPGAEGRFQAVAGDSFVAAIEFSQPVKAMALIGYGNATQSGSPHTEDQLQMFANK QLRPVWRDRQDILAHLEERKAF" gene complement(3620..6343) /locus_tag="DP116_26050" CDS complement(3620..6343) /locus_tag="DP116_26050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456276.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TonB-dependent siderophore receptor" /protein_id="PRJNA477356:DP116_26050" /translation="MKYKRLVPSLLLTGVIVGLLTTPTLSEEKSSIGVDKSTASGEKS ARSTQQPRLVTNSRFVKPITEIRSLSEIERPITSAQMLVQSPTPQATPTSGVVEVTGV KANPTKKGVEVILQTPKGQALQLVNRSAGNNFVVDIPNAQLRLPSGDAFTFRSTKPIA GVTEITVTNFDANTIRVTVTGEAGVPTVELFDSPNEGIIFSVASTSQRAQQGQPQTQQ SPAQQPDSQKMPTQPSASGDEPIELVVTGEQDGYNSSVSTTGTKVDTNQRDIPQAIQV VPRQVIQDQQITRVGDAVRNLSGVQVQAGGSRSTFDRFYFRGFSNVNDGVLRNGLRDR IGSRVPSETANLERIEVLKGPGSVLYGQGILGGAVNLITKQPRQDPYYSIEGTFGNFD FYRGAVDLSGPLTSDKTLLYRLNASVESSGTFIDFFDTQRYFIAPVLSWQIGKNTKLT LEAEYLDSQLQDDNGLPAVGTVLPNPNGKIPLNRNINEPDDKDNRTALRVGYNFEHRF SENWQIRNAFQSTFSKVDQKFTSPTALLADNRTLQRGSFFDSQGSTSTNTYTMDTYVV GKFKTGSIQHQLVTGFDLFRQTDFTPDGLNSSQAPLDLFNPVYGRPQGPVSRFDSGIT SQALGIYIQDQITFADNLKLLLGGRFDLVNQKQENFVAATTTFQQDEAFTPRVGIVYQ PIPEISLYASYGKAFQQNYGTTLDNTLFTPERGTQYEVGVKADLSDKVSATLAFYDLT RSNVIVPNPNNTRFSILTGEQRSRGIEFDIAGEVLPGWNIIAAYAYTDAIVTKDTRPL LVDNQLNNIPKHSFSVWTTYTLQSSFLQGLGFGLGLFYVGDRQGDLANTFELPNYLRT DAALFYKRNNFRAAVNIKNLFDITYFESANSNLRVYPGAPLTVQGTIGWEF" BASE COUNT 1858 a 1519 c 1507 g 1877 t ORIGIN 1 agacaggttc ctaaagtcag gatttccaga atcacataac tgaaaaacaa agagttgcat 61 aatgcaaggg cgcataaccc agcttcaaac aatcccaaca gagagtaaaa gcgtgcccaa 121 ccccagtcca tctccatgta gccgacagca aaaatttgtg caagcaaatt taagccggta 181 atgacgacca tcgcgccgac acttaatgag gagatttcta aatcaataga aaggtttaaa 241 ccagcagtat tcagccaagg tatagatatc tcatatggcg cttgattcca ggcgattggt 301 aataccaaag cagaatgaac aaacgcaaga aatgtcatga ccaagttaat gtaacctgct 361 ggtcttggtc ctgtgcgttt aatgattcct ggagaccagg gtacggctaa aagcgcacca 421 cttaaggcat agcagggaac taaccaaaca gtttcaagca gaaaatgact cattaattct 481 cctccaattt gcagcataac aggacgcggt tgtcaatcaa caaccgtttt tttcaacaac 541 gcagatatgc gaaatgaaaa atttaagttt tattttattt tcgttaaatc atagattacc 601 ctatatctat gtcaaagggg aataagttag ctaaatcatt agtctatcat gactctaaga 661 aattcttata gcttgaaaat ttccaatagt tactattgtt tattatctat aaatgcttga 721 ttcaacagta ccagccgcag tgaacaaggg gtgggcgagt ccgtttcctg cctggtaact 781 gataactgat aactgttcac tgtttgaata gttactatta ttttacctct atatagcaat 841 ccattgagga gttgtgagaa ttgaaaaact aagatccccg acttctggtg agaagtgggg 901 gatcaagttt gctcacgttc gctcatagcg tgcccgtagc ggctcaggat ttaggactgc 961 catagcagtt ttagaaacaa acacagacgc aattcgttgt gtccgtgtta ctctagttag 1021 tagatttttt ctactattgc ttgcccattt gctggtaaaa tgtatgccac tactgatgac 1081 tggtgcgacg agtcactaat aggagcaggt ttcatttcgc ccctaaagtc tagcagttgt 1141 acaagcagga ggtagccaag tgccaaaatt atggaaaccg cacctgtgat aattgcaact 1201 attttcccac gattcatcaa ttccacctgt aatgttaaga tatgctaata cccctatatt 1261 aaaattgcaa taccacatag tggtgcagca agcgcataaa taaacatgcc atcccagaca 1321 gccaaaaagc cggaaaaaca ggctttttga ccctggttac attcagaatg ccttacgctc 1381 ctctaaatgc gctaaaatat cttgacgatc gcgccaaact ggacgcagtt gtttgttggc 1441 aaacatctgc aattggtctt cagtatgggg cgaaccgctt tgagttgcat taccatagcc 1501 aatcagcgcc atcgccttta ctggttgaga aaactcaata gcagcgacga acgaatcacc 1561 agcaactgct tggaagcgac cttctgctcc tggagcaaaa ttaaggacgc ggaaaatacc 1621 gagacttcca tcgccaccgt tagcaggtaa atccttattg cccacatgca cccggaaaac 1681 atccccccac gcgatatcca gctttccata agctttttcc actttgcccg cgacagcttc 1741 cagtgttgca actgcacctt taggatcggc taaaccgtct ggggtggtgc gtggagagtc 1801 ttcactccaa ggtttgctga acgatttatc atctaagttc atctgttttg cccaaaaggc 1861 aaaaagtaca gcacccttag aatctgcatt ggcttgtcta tcccacgctt ctaggacatc 1921 agctgctcgt cgtgctaatt cacccccctg ctttcgggaa gcgggaatta agtcatccaa 1981 aatgcgatct gctagttcca tccgggtcga atgtttgtat tctaccatct catcgaaaga 2041 aattttgtca tctgaagata gcattcttat agaacgttcc gtccgaaaat ccttgggaaa 2101 aatcaggcca gtcggtgcca tgtaagcagg gtaattatct gctttaatag cagtaggata 2161 cgtcgttgtc caaggtgggt cattagtatt ttgcaaccaa ccgctttctg ggtcaatgac 2221 gcgtggtaaa tcttggtagg gatggatttt cgtccatagt gttttagacg tgtcaccagg 2281 aataatccct tcccagtatt tgaaatcacc ttgagaacga attggaacat caccgttgaa 2341 caggtgcatg atgtgcccat ctttgtctgc atacatcacc gtaaacatag gtagttgcaa 2401 acgcttgaga acagattgaa actggttgag gttttgggaa cgcgccatat cccaccattg 2461 ctcaaggaca ccaggtcgat caagaccagc aacgcgcaac gccaaagcaa aaccatcttt 2521 ttgtgtcacc acaggaccgt gaacagaacg tttgacgacg aggggttgtt cgcttaacgt 2581 accatctttt tgtttcactt ttaaggagac agttgttgtc tcgaagggac gcactttgcc 2641 atcgaagcga taaccattgt caactaattt gagtgagtac gtatcccaac catcgtaaat 2701 gttgacggtg tgagtccagc ctaaattatc gttgaaggcg atcgccaaca ccggaactcc 2761 aacaagtgtt gcaccataag catcaatatc gggagtcgag agttgtgctt cgtaccacaa 2821 aaataaatcc gaccaaggca aatgtgggtt tgccagcagc atcgctttgc cacttgcaga 2881 atgttcgggt gctatagccc aaccattaga ccctgctttt gattcttgct tggcgacgcc 2941 tgaaatgtcc tgtgggttaa caataaaagt aaaatgcaaa actcggttta ggtgagctag 3001 aacatcttct ggcttgactg gtagcactac ctctacttca tcatcaatca gttcgccatg 3061 ctccttagcg taagcattaa tccccgctgc aaaagcatcc aggtagttgc gaaaagttgg 3121 attttgtgcg ttgtaccaag aactcgcgcg ttctgggatt cctgccgtga gaacccaacg 3181 gtctgagtct aaatatttct ctccccagta ttcggcagca cgtccacgtg cttgaccgta 3241 aagacgcaaa agcaagttac cgtgactttg catctgtgcc caaccaaagg cttgaaatgc 3301 acttcgggta tccttgccgt agatgtgggg tatcccataa gtatcccaga gtatttcagt 3361 ggattttgtt atctgtgcag aagtctgaca tcccaaaact aaagcaaaga tgataccaaa 3421 tacaaatgcg aaaactcgga atggtttttg ccggaaagtt ttatgtaggt agttcatatt 3481 tctatttagg aattaagtta ctcatcactg ttggatatca attttggagg atggattttg 3541 ggagatttag ttgtcagtta agttcaacgt tcctcctctg attcaacaac gcccatttcc 3601 gaaccacaga caatagcaat caaaattccc agccaattgt tccttgtaca gtcaaaggtg 3661 ctcctggata gacacgcagg ttactattag ctgattcaaa atatgtgatg tcaaacaggt 3721 ttttgatgtt cacagcggct cggaaattat ttcgcttgta gaacagagca gcatccgtcc 3781 gcagataatt cggtaactca aaagtgtttg ctaaatctcc ttggcgatcg cccacataaa 3841 acaatcctaa accaaacccc aaaccctgca aaaaggagct ttgcagggtg taggttgtcc 3901 acacgctaaa agagtgtttc gggatgttat tcaactggtt gtctaccagc agtggtctgg 3961 tgtctttggt gacgatggca tcggtgtaag cataggcagc aatgatgttc catccgggta 4021 ggacttcgcc tgcgatgtcg aattcgatgc ctcgactgcg ttgctctcca gtaaggatag 4081 agaaccgcgt attgtttgga ttagggacta tcacattaga gcgggtgagg tcatagaatg 4141 ccaaggttgc agaaacctta tcgctcaagt ctgccttgac gccgacttca tattgggtcc 4201 cccgttctgg tgtgaatagc gtgttgtcaa gggtggtgcc gtagttctgc tgaaaggctt 4261 taccatagct ggcgtaaagt gaaatctcgg ggatcggttg atagacaatc ccgacacgag 4321 gcgtgaacgc ttcatcttgt tgaaaggtgg ttgttgcggc gacgaagttt tcttgcttct 4381 gattgacaag gtcaaatcgt ccaccgagga gcagcttcag gttgtcggca aaggtaatct 4441 gatcttgaat gtaaatgcct aaagcttgag atgtgatacc actatcaaac ctgctcacag 4501 gtccttgagg gcgaccgtaa accggattaa acaggtcaag gggagcctga gagctgttca 4561 atccatcagg cgtaaaatca gtttgtctaa acagatcaaa gccagtcacc agttggtgtt 4621 gaatgctccc ggtcttaaat ttacccacaa cgtaggtatc catcgtgtag gtattggtag 4681 aagtacttcc ctggctgtcg aagaaagatc ctctttgtaa ggtgcggttg tctgccagaa 4741 gagcggttgg tgaagtgaat ttctggtcaa ctttggaaaa cgtactttga aacgcattgc 4801 ggatttgcca gttctcactg aagcgatgct caaagttgta gccaactcgc aaagcagtgc 4861 ggttatcttt gtcgtccggt tcgttgatat tgcgattgag cggaattttg ccatttggat 4921 tgggcagcac cgtaccaacc gctggcagtc cattatcatc ttgcagttga ctgtccagat 4981 attccgcttc cagggttagc tttgtattct tgccaatttg ccaggataaa accggagcaa 5041 taaagtaacg ctgggtatcg aagaagtcga taaacgtgcc ggatgattcc accgaagcat 5101 ttaagcgata gagcagggtt ttatcgctag ttaggggacc agaaaggtca accgcaccgc 5161 gataaaagtc gaaattgcca aatgttcctt cgatggaata gtaaggatcc tgacggggtt 5221 gctttgtaat caggttgact gcaccaccca gaatgccttg accgtaaagc acagaccccg 5281 gacctttgag cacttcgatg cgttctaagt tcgcagtttc cgagggaact cggctaccaa 5341 ttcgatctct cagaccattc cgcaaaacac catcattaac attagaaaaa cctcggaaat 5401 agaatcgatc aaatgtgctg cgagatccac ccgcttgcac ttgaacgcca ctcagattcc 5461 tcacggcatc gccgacgcga gtaatttgtt ggtcttggat cacctgtcgc ggcactactt 5521 gaatcgcttg cggaatatca cgctgattcg tatcgacttt cgtcccagtg gtcgatacag 5581 atgaattgta tccatcttgc tcccctgtta ccaccaactc aattggttca tcaccactag 5641 cggatggttg agttggcatt ttctgactgt ctggctgctg tgcaggtgat tgttgtgttt 5701 gaggttgccc ctgttgcgct ctttgtgatg tactcgcgac gctgaaaata ataccttcgt 5761 tgggactgtc aaacaactcg actgtcggta cacccgcctc acctgtcacc gtcactcgaa 5821 tagtattagc atcaaagtta gtaaccgtta tttcagtaac acctgcaatt ggcttggtgg 5881 agcgaaatgt gaatgcatcg ccactgggta gccgtagttg agcattagga atatcaacga 5941 caaagttatt gccagcacta cgattcacca gttgtaacgc ctgacctttg ggtgtctgta 6001 aaatcacctc cacgcctttt ttggtgggat tcgccttcac tccggtcact tccacgactc 6061 ctgaggtggg tgttgcctgt ggtgttggtg actgcacaag catttgagca ctggtaattg 6121 ggcgttctat ctctgagagt gaccgaatct ctgttattgg tttaacgaac cgggaattgg 6181 ttactagacg tggttgttga gtagaacgag cgctcttttc accagatgcc gtactcttgt 6241 caactccaat tgatgatttt tcttcactta atgtgggagt tgtgagcaac cccactatta 6301 cgcctgtcag caacagactg ggaacaagcc gcttatattt cattcttttc gttgttactc 6361 cctcacgcca cgaaatctct agttgatgac gccagtatag gacattattg aaattcattt 6421 gcattgagaa agcaagaagc tgaaactata acaaacaaaa gtttacattt agttggtcaa 6481 ctttaaacac actattgcta cattttgaga ctaattcatt cataaaactt catgatgtca 6541 agtcggtatt tagaaaacct tgatggtgca tgttcatttt tacatcatgc tattacctgt 6601 atgaacacag acagcattga ccagtaaaca tttttgggtg agttaacatg atttgactga 6661 gactacacag aatcatgtga gatgaagaaa agataaatct tgttgccatc tgtttttaac 6721 atctatcaat agtaagattt ttgtttatat agttgtaact c // LOCUS NODE_4559_length_6705_cov_5.4154896705 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6705) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6705) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6705 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(49..264) /locus_tag="DP116_26055" CDS complement(49..264) /locus_tag="DP116_26055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26055" /translation="MQVLLNKSLQALTSSTSGSSQVNTAIYTYQPMISKHTCSCCSYT LLRHIDLKGIYWRCSHCYQEMPVYQPL" gene complement(667..1371) /locus_tag="DP116_26060" CDS complement(667..1371) /locus_tag="DP116_26060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016948968.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-carotene ketolase" /protein_id="PRJNA477356:DP116_26060" /translation="MVIALVIIATWLTSLILLLSVDISHFNILTLSLAVLWQAFLYTG LFITTHDAMHGVVFPRNNKINHFIGSLCLTLYGFLPYEKLLKKHWMHHHNPASEKDPD FHDGEHKNFFAWYFYFMKNYSSWGQMLIITIIYNFAYFIVHIPRTNLTFFWAIPALIS SIQLFYFGTFLPHREPKDGYSEPHRAQTISYPVWWSFLTCYHFGYHEEHHESPHVPWW QLPDVHMLKRSKISQN" gene complement(1866..2057) /locus_tag="DP116_26065" CDS complement(1866..2057) /locus_tag="DP116_26065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874499.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="high light inducible protein" /protein_id="PRJNA477356:DP116_26065" /translation="MGNYPTDATEKAYNGSDRNAVKFGFTPQSELWQGRLAMIGFIAY LLWDLNGFSVLRDVLNLIH" gene 2374..3192 /locus_tag="DP116_26070" CDS 2374..3192 /locus_tag="DP116_26070" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017313666.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="PRJNA477356:DP116_26070" /translation="MAPTVLITGASQGIGKATALLFARKGYDLVLTARQLDELERVAQ EVQSLGCLAPLIVPCDVRDSSQVETLVEKALDHYGYIDLLINNAGIFAEGPVEQFSLS DWHQIIDTNLWGYIHTIHALLPHFLQRRTGTIVNISSIGGKVPTPYLVPYSTSKFGVT GLTEALHAELKPKGVHVCGIYPNVIKSRFVEAAVFRGKDEQDAKSRRDQMNSVLEIPG VEKPEDVANAIWDAVKNHKSEVFVGSANLSQAFYRLFPGLLQWVSQQALKNKDQ" gene complement(3217..3675) /locus_tag="DP116_26075" CDS complement(3217..3675) /locus_tag="DP116_26075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874496.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cyclase" /protein_id="PRJNA477356:DP116_26075" /translation="MLHFKYSSVINAPVEVVWKFHERPDVMQLLTPPWQPVQVLRREG GLGRGAITEFRLFLGPLPLRWLASHTEYQEHHLFTDEQISGPFDYWVHRHLFQAENGQ TRLTDDISFSMPGGEPVEFVSGWLVQVQLEAMFRYRHFVTKRECESPLVP" gene complement(3819..4007) /locus_tag="DP116_26080" CDS complement(3819..4007) /locus_tag="DP116_26080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006635525.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CPXCG motif-containing cysteine-rich protein" /protein_id="PRJNA477356:DP116_26080" /translation="MQTTAEYYCAYCGEPNTTFVDFSAGGHQSYVEDCQVCCRPNILY VRIDEETLDVEIDTEYEE" gene complement(4008..5654) /locus_tag="DP116_26085" CDS complement(4008..5654) /locus_tag="DP116_26085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131204.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="radical SAM protein" /protein_id="PRJNA477356:DP116_26085" /translation="MTSSLFTAERLLFTPAIPNTDAVPTIFAFPNEYTVGITSLGYQV VWATLAMRSDVQVSRLFTDTHEQLPRKPELLGFSMSWELDYVNVLNLLESLEIPIRAN ARDNHHPIVFGGGPVLTANPEPFADFFDVILLGDGEILLGNFIEAYKEVRNADRQTQL KALAQVSGVYVPSLYYVEYHASDNGVKSIQPISSEIPAVVQKQTYRGNVLSASTVVTE KAAWENIFMVEVVRSCPEMCRFCLASYLTLPFRTASVDSSLIPAIEKGLSVTNRLGLL GASVTQHPEFEALLDYINQPKYDDIRLSISSVRTNTVTVQLAQTLAKRDTRSLTIAVE SGSEKLRRIINKKLHNDEIIQAAGNAKAGGLSGLKLYGMVGIPGEEPEDLDETVTMMR DIKKAAPGLRLTLGCTTFVPKAHTPFQWFGVNPQAEKRLQFLQKKLKPQGIDFRPESY NWSIIQALLSRGDRRLSHLLELTRDFGDSLGSYKRAFKQLKGQIPDLDFYVHTNWSTD QILPWNHLQGPLPQSTLLKHLADATSHFNLSQRELQPLNN" gene complement(5675..5929) /locus_tag="DP116_26090" CDS complement(5675..5929) /locus_tag="DP116_26090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874493.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26090" /translation="MNWINILGLVAATLTTFSFLPQMLKTWQSKSAKDVSFAMLIFFN LGIFLWLIYGISLNALPIILANAVTLFFNLIILWFKIKYR" gene 6176..6580 /locus_tag="DP116_26095" CDS 6176..6580 /locus_tag="DP116_26095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019492410.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26095" /translation="MELRPYFWHRFIFSVVGVCSLSLPFAQSVQATPAQEVRNFCRKQ ESIFVAAETKNFWVSICGSESPLTYVGVNKKTGKAVRLPLSVNGRDSKGEYFEAVSGD YTYILAESTKGKNLTVAKGSREILREPVIRGW" BASE COUNT 2012 a 1274 c 1421 g 1998 t ORIGIN 1 ggcatgtggc gtccctgggc taaagccaca gggttttcat ctcacccact ataagggttg 61 ataaactggc atttcttggt aacagtgact acaacgccag taaattcctt tcaagtctat 121 gtgacggagc aaagtgtatg agcagcaaga gcaagtatgc ttacttatca tgggttgata 181 ggtatatatg gctgtattta cttggctact cccacttgta gaagaagtca gcgcttgaag 241 agatttattt aataaaactt gcatttttta acctgattgt tgttaacgtt tggctcaatt 301 ttcactcttt aactgacata caacacctct cgaaccaaat agggtgtaga gatttacatc 361 cgggaaggcg tctatggcaa tgtgtctatg gcgtagtcac gtaaccgaaa ataattctct 421 ttagcttaat cttagcatgg aatatgtgat tacgcttttt gtccatgtaa taccttagta 481 gggcgttgtt cctaacaatc tgttttaagt agtccgccct aaccatctgt acgactgggt 541 aacgagtaat tagtttaaag cctcgtctct ttatgggcga aaataggaaa aaatttttcc 601 ttactgaaac tccatcccct aggggattaa gttctgagtt ctgactcctt ttatatactt 661 gatagattag ttctgactaa tttttgaccg cttcaacata tgaacatctg gcaattgcca 721 ccaaggaaca tgaggagact cgtgatgttc ttcgtgatag ccaaaatgat agcaggtaag 781 aaatgaccac caaacaggat atgagatagt ttgcgcacgg tgaggttcac tatacccgtc 841 ttttggctca cggtggggca agaacgtacc gaaataaaat agctgtattg aactgataag 901 tgcaggtatt gcccaaaaga aagtgagatt tgttctgggt atatgaacta tgaaataggc 961 aaagttatag ataatcgtga tgatgagcat ctgcccccaa cttgagtaat ttttcatgaa 1021 ataaaaatac caagcaaaga aatttttatg ttcgccgtcg tgaaaatcgg ggtctttttc 1081 actggcagga ttatgatgat gcatccaatg cttttttaaa agtttttcat aaggtaaaaa 1141 accatagagt gttaaacata gtgaaccaat aaaatgatta attttgttgt ttctaggaaa 1201 gactacccca tgcatagcat catgagttgt aataaataat cctgtgtaga gaaatgcctg 1261 ccaaagcaca gcaagcgata aagtcaagat attaaagtga gagatatcca cagaaagtaa 1321 tagaatcaag ctcgttagcc aagttgcaat aattacaaga gcaatgacca ttcctgtggt 1381 agattcactg tgagtttttc gttgatgacc tgttggtttt tctaattgtt gaatcacggt 1441 tgtactttat tttgggcgaa agttcacaaa aataagctca tgacgagaca cactaatagc 1501 tcacacagcc acttcatcag tagtgacagt acttttcaac aagcgtgagg tagagtggca 1561 agatgcgaac cttttccata attccccgac tgtatttcat ccagttgcca gttgctgtat 1621 tgttgtccta caagcaacct tagtttgaga attatgtaga aatgttaact tactttaata 1681 tttacataca attgccatac cgaaaacttt tttggtctgg aggttagctg gtatgggtgt 1741 aagacagcaa aattctagag aaagattcag aacccagaac ccagagtgca caagaggttg 1801 gctgttgtgt ctgtagtgtc agcttatatc tctagctaat ttgtgagttc tgggttcttt 1861 caaatctagt gaatgaggtt gagtacgtca cgcagaacgc taaagccgtt aagatcccaa 1921 agaaggtagg cgataaaacc aatcatggct aagcgacctt gccaaagttc agactgtgga 1981 gtaaacccaa atttaactgc gttgcgatcg cttccgttat atgccttttc agtagcatca 2041 gtaggatagt ttcccattgt taagtttcct aaagtattta aattacattt ttcatagtag 2101 tgatccctct agtgtgcgct atctctctac agtgtcaaag cgatagctag tctaaaggat 2161 gagattttga gtggttcacg taaagatttg taatacaaaa ggttaatagg agtttgatta 2221 accttccaaa tagaggatta ttcttgtcat caggtagtag tcaggtatac caaaaaaatt 2281 tgaaaagaaa aaaattgaaa atctaacgtt agtcagaagg tttattaaat tttgttgctc 2341 aaactatagg aaatagagtt tttaactgaa aatatggctc ctacagtact gattacaggt 2401 gcttcccaag gtattgggaa agcaacagca cttctatttg cacgcaaagg atatgacctg 2461 gtactcactg cacgtcaact tgacgaattg gaaagggtag cacaggaggt gcaaagcctt 2521 ggttgtctag caccacttat agttccttgc gatgttagag actcatcgca ggtggaaaca 2581 ctggtggaaa aggctttgga tcattatggc tatattgatc tattaattaa caatgcaggc 2641 attttcgcag aaggaccagt ggagcagttt tctcttagcg attggcacca aatcatagat 2701 actaatttat ggggatatat tcacacaatt catgcccttt tgcctcattt ccttcaaagg 2761 agaactggaa caattgtgaa tataagttcc attggtggta aagtgcctac tccttactta 2821 gtgccgtact ccaccagtaa gtttggcgtc acaggtttga cagaggcgct acacgcagaa 2881 ttaaagccaa aaggtgttca cgtttgtgga atttacccca atgtgatcaa gagtcgtttt 2941 gtggaagcgg ctgtttttcg tggtaaagat gagcaagatg cgaaatcccg tcgtgatcag 3001 atgaacagcg tcctagaaat tccaggtgtg gagaagcctg aggatgtggc aaatgccatt 3061 tgggacgctg ttaaaaacca caagtctgag gtatttgtgg gttcggcgaa tttgtcgcaa 3121 gcgttttatc gattgtttcc aggcttgctt cagtgggttt ctcagcaagc tttgaaaaat 3181 aaagatcaat aaaagctgag aatgtacaga atttacttaa ggtactaatg gtgactcgca 3241 ttcacgtttc gtcacaaaat gtcggtaacg aaacatcgct tctaactgga cttggacaag 3301 ccaaccactg acaaattcca caggttctcc gcctggcata gaaaaggaaa tgtcatcagt 3361 caatctcgtt tgaccatttt ctgcttgaaa taaatgtcga tgcacccaat aatcaaaagg 3421 tccggaaatt tgttcgtcgg taaacaggtg atgttcttgg tattctgtgt gacttgctaa 3481 ccaacgcaaa ggcaatggtc ccaaaaaaag gcgaaactca gtgatagcac cccttccaag 3541 tcccccttcg cggcggagga cttgaacggg ttgccaaggt ggagtcagca gttgcataac 3601 atctggtctt tcgtggaatt tccaaacaac ctccactggt gcattaatga ctgatgaata 3661 tttaaagtgc agcatttcaa cggttttcga ttttgtaagt aataattaat aattgttagt 3721 tgttagttaa gtttcttaat aaccactaac gactaaccat tagttattcc ctatgactta 3781 ttaccaaaaa attttcaatt taaaatcagt agaaaagttc attcctcata ttcagtatct 3841 atctctacat caagagtctc ttcatcaatc cgcacgtaaa gaatattagg acgacagcaa 3901 acttgacaat cttctacgta agattggtgt ccacctgcgc tgaaatcaac gaaggtggta 3961 ttgggttcgc cacaataggc gcagtagtat tcggctgttg tttgcattta gttgtttaac 4021 ggttgaagtt ctctttggga taagttgaaa tgactcgtgg catcagccaa gtgctttagt 4081 agtgtagact gtggtaatgg tccttgcaag tggttccaag gtaatatttg gtctgttgac 4141 caattggtat gaacgtaaaa atctaagtcg gggatttgtc ctttgagttg tttgaaagca 4201 cgtttgtaac tacccaaaga atcaccaaag tcgcgggtga gttctaggag gtgggagagt 4261 cggcgatcgc ctctcgacaa caaagcctga ataattgacc aattataact ttccggacga 4321 aagtctattc cttgaggctt gagttttttc tgtaaaaact gcaacctctt ttctgcttgc 4381 ggattcaccc caaaccactg aaatggtgta tgtgccttgg gaacaaaggt ggtacacccc 4441 agtgttaacc gtagtccagg agcagctttt ttgatatcgc gcatcatcgt tacggtttca 4501 tccaaatcct ctggttcctc accaggaatt cccaccattc cgtagagttt caaaccagat 4561 aatcccccag ctttggcatt tcctgctgct tggatgattt catcattgtg caacttttta 4621 ttgataatcc gtcgtaattt ttcagaacca ctttctactg caatggtgag ggatcttgta 4681 tctcgttttg ccaaagtctg cgccaattgt acagtgactg tgtttgttcg tactgaggaa 4741 atgctcagac gtatatcatc gtactttggc tgattgatgt aatctagtaa agcttcaaat 4801 tccggatgtt gagtaactga agctcccaat aatcctaacc gatttgtgac tgataaccct 4861 ttttcaattg ctggaatgag tgaactatca acactcgctg ttctaaaagg taatgtcaga 4921 taacttgcca aacagaagcg acacatttct ggacaacttc tcaccacttc caccatgaag 4981 atattttccc aagccgcttt ttctgtcacc actgttgatg ctgaaaggac atttcccctg 5041 taagtttgct tttgcaccac agcaggaatt tctgaggaaa ttggttgaat tgatttgaca 5101 ccattatctg atgcgtgata ttccacataa tacaaactag gaacataaac accagaaact 5161 tgtgctaatg ctttgagttg ggtttgtctg tcagcgtttc taacttcttt gtaagcttca 5221 ataaaattcc ccagtaaaat ttccccatct cccagcaaaa tcacatcaaa gaaatctgcg 5281 aaaggttcgg ggttagctgt gagtacggga ccaccaccaa aaactatggg atgatgatta 5341 tctcgggcat ttgctctgat gggaatttcc aaagattcca gcagattcaa aacattgaca 5401 taatcgagtt cccacgacat ggaaaagcct aataattccg gctttcttgg gagttgttca 5461 tgagtgtcgg tgaacaaacg actcacctgc acatcagaac gcattgctaa agttgcccat 5521 accacctgat agccaaggct agtaataccc acagtgtact cgttgggaaa agcaaaaatg 5581 gtgggaacag cgtcagtgtt aggaatagca ggggtaaata aaaggcgttc agcagtaaat 5641 aaagaagatg tcacaggtgt ttacaataaa cagattatct atatttaatt ttaaaccata 5701 gaattatgag gttaaaaaat aatgttacgg cgttagcaag aataatcggt aaagcattta 5761 gagaaattcc atagattaac catagaaata tccctaaatt gaagaagata agcatggcaa 5821 atgaaacatc ttttgctgat ttggattgcc acgttttcag catttgcggc aagaaggaaa 5881 atgtagttaa cgttgccgca actaatccta atatgttaat ccaattcatg gagatgcaac 5941 ccttataata tttgtttaag atgatatcag cacaagaagt gcaggagata cgagtgcaag 6001 cgcatattcg acttctggtg gtgatcatgc tcccacactg caattatctt tgcataactt 6061 acaatctgtg ctctaaaagc aggtaagcga ataatatatc aagttttccg taatcacctt 6121 cactcgtgct gtagaactta gtaacaaatt caacagatat taatagtcat caagaatgga 6181 acttcgtcct tacttttggc atcgatttat tttctccgta gttggtgtat gttcgctgtc 6241 acttcccttc gcgcaaagtg ttcaagcaac tccagcacaa gaagtgcgta acttctgtcg 6301 caagcaggag agtatatttg ttgctgcgga gactaagaat ttttgggtaa gtatctgtgg 6361 tagtgaatca ccactgactt atgtgggtgt aaataaaaag actggtaaag ctgtacgact 6421 acctctaagt gtaaatgggc gagactccaa gggggagtac tttgaagctg ttagtggcga 6481 ttatacttat attctggcag aatcgaccaa aggcaaaaac ttgactgttg ctaaaggtag 6541 tcgtgaaatt ctgcgcgaac cagtcattag aggttggtaa gcgtcaaaac agctatgact 6601 tgcagtagtt gactaaattc tgaggctata gtttgagcca aagaattatt tgtgaccata 6661 atttctagca atttggattt aggtgctata gcaatcctat atgag // LOCUS NODE_4561_length_6703_cov_7.2393206703 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6703) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6703) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6703 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 448..1758 /locus_tag="DP116_26100" CDS 448..1758 /locus_tag="DP116_26100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864885.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_26100" /translation="MLQTQQVIHNRYQLKEKLGEGAGRQTWLALDLSTQEDIVVKLLT FSDQMQWESLKLFEREAQVLKQLNHPRIPDYRNYFCIDDQLLYFGLVQEYIPGISLKQ LLTKGQVFAEPEVRKIAAHILKILIYLHGLSPPVLHRDVKPSNVLVGKDSRIYLVDFG AVQDRAAREGATFTVVGTYGYAPLEQFGGRATPASDLYALGATLIHLLTGVSPADLPQ KDSRLQFSHLVRLNPGFVRWLEQLTEPNLERRFSSAQQALDALKANHNAIKIASPRLP ESCIWLKKSPTHLQIQLPVPWYKVISSPRNWIVLTGLGLWLFWLSMLISGWVYLFWLA GSLVLGAWFLLPVFVETNIYFNHQQFEIEVKLLGFCIKRQRGNVSEIDNVFKSDSGGY GNKKIPEVTLAVGVEEYSFGRLKPPLSHQVCRWLVDEIKHWLGL" gene 1776..3209 /locus_tag="DP116_26105" CDS 1776..3209 /locus_tag="DP116_26105" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_26105" /translation="MLQEGGQVLCIREATPQESRYQLKEKLGQDASRQTWLAVDLHTQ SQEQVVVKLLALSPQMQWDEHKLFEREAQVLKNLNHPRIPKYRDDFVLEQQPGSRFPW FGLVHSYVPGTSFQQLLDGGHRFSQSQVEKIATEILSILVYLHELDAPVLHRDIKPSN LIWGEDERVYLVDFGAVQDQAVLEGATFTVVGTYGYVPMEQFAGRAVPASDLYAVGTT LIHLLTGTPPADLPYKDSRIQFADKVSVDLGFVNWIGKLTEPNVAERLSTARQALDTL KNKHALTPPLTSRKPTGSRIQIKKSADKIEIKIPRRGRKTFKLFYLIGLMIPFVWQLP QWFNLLSTGLSYHSILYFVPLLALVFFLFQATVLPAFRHTDMYFDREHFEIRWKLFGL CYWRFRGKTLLICRVYEEIVQQGSAPRGVTIESSNKQKFTSSPLATVERHWLIEEIKD WLKLNQGTGNREQGTGNREYVDKRNSI" gene complement(3334..5364) /locus_tag="DP116_26110" CDS complement(3334..5364) /locus_tag="DP116_26110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745551.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="potassium transporter TrkA" /protein_id="PRJNA477356:DP116_26110" /translation="MHQAAQPQVLLDQFLVCGLGGLGQHCVVALKQFEVSVIAIDKKP PEDWEIPHLENLLDTLVIGDCRQIDVLQQAKIQQCRAAIIVTSSEQVNAETALAVRKL NPKTRLVIRSSKKNLNELLSEHLGNFVAFEPTQLPAPAFALAALGTETLGLFHLEGQW LQVVKRTLSPTDRWCNRQLLHELNTHKRHILYHTSSATSSFKAFHKWEPDTRLMPGDT IVYIEAIKPTSTDSQKLVKNTWYNPWHLLTRLKHLNWLTIKQQVTQFLRKPDQNRFKQ VGIICGVIVGVLLLLGTVLYRLTYPKMGLIDAFLTSTMLLLGGFGDLFGGFNFTLPVP FWLRLISLGMTITGTILVGVFYSFLTERLLAARFQLNKRRPPVPQKDHVVVVGLGRMG QGIAEILQEFNQPLVGITLNTDFDMTVLPKMPLITGSLKRFLSKANLQTAKSVVVVTD DEMLNLEVSLMAYDANPESNLVIRTTGLSLSDNLAELLPNAQVLCVYALVAEAFAGAA FGENIINLFRINHKTILVTEYQIEGGDTLNGLLLSEVAYGYGVVPILYQKETGMAKLM PSEDISLAVGDRMIVLATSDGLQRIEQGTTITTSKTWQVHITRALTEEAQFEGANVLV RISGCSLNTARTVMKNLPQTLHLPLYKHQAQRLIIELSKAQVIAHLLPTNSL" gene complement(5669..5959) /locus_tag="DP116_26115" CDS complement(5669..5959) /locus_tag="DP116_26115" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26115" /translation="MGKIDAGLSDYYMVDEVQGTYRGMIIAIAPGHWSVFANFELSEF AKCLQNLASRVHLKSFLKHTRVPKKKKDSPKYDPQHPHVSTAKLLKTAKKSP" gene complement(5974..6651) /locus_tag="DP116_26120" CDS complement(5974..6651) /locus_tag="DP116_26120" /inference="COORDINATES: protein motif:HMM:PF01609.19" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26120" /translation="MRILDGNCLEKTDHRLEVLRTIAAGALPGKSLVVLDPELRLAIN VFGCEDGHAQERSMFSEVLKTVKEGELWIADRNMCTVGFLSGLHRSGANFVIREHKSM PWEAINSLQAVGSIETGDLFEQTVSLSDAGKLLLMRRVVLKLKKPDRNGENEIAIFTM LPTETVTAVVIAQLYRERRSVENLFQTVTENYECQIQTLGYPKAALFSFCLALVAHNI LEPFAGF" BASE COUNT 1907 a 1359 c 1427 g 2010 t ORIGIN 1 caagatctga gaatcctatt tgatttgtga aaatcgcaat cgcggatgct cagattcccg 61 acttcggcga agttgtcggg aatcttgctg tggctttacc cactgggcat acgcgagtgc 121 gtgcccgtag agcttatgat ttaggattgc tataatagta aaatttataa tcaatgtatt 181 tcattctact aacaaagttt atctctaatt tttttccctt gctgctaaat atgaaaaaca 241 attgcctaaa ggagtagtca tgaagtctct cagttctggc agaaatttga taaaaacatc 301 cggaatacgg ttgtttcagt ttgtatatta gttttgtttc tacatagatt tcgatacttc 361 cctttgatgg agattgtctt gtcttgtctt ttttagggat tctagtgtca ttttggtaca 421 acaggagtgt tgaggaatag caacacgatg ctgcaaacgc aacaggttat acataatcgc 481 tatcaactga aagaaaaact gggtgaaggt gcaggtcgtc aaacctggtt agcactggat 541 ttgtcaacac aagaggatat tgtggtgaaa ctgttgacct ttagcgacca gatgcagtgg 601 gaaagcttga aactatttga gcgagaagca caagttttga aacagttgaa ccatcctcgc 661 attcctgact atcgtaatta tttttgcatt gatgaccagt tactctattt tgggttagtt 721 caagaatata ttcccggtat atcactcaag caattactca ctaaaggtca agtgtttgca 781 gaaccagaag ttcgcaaaat tgctgctcat atcctaaaaa ttctgattta tttgcatggt 841 ttgagtcctc ctgtgctgca tcgagacgtt aagcctagca atgtgctagt gggtaaggat 901 tctcgtattt atctggttga ttttggtgca gtacaagatc gcgccgctag agaaggtgct 961 accttcacag tggtaggaac ttatggctat gcaccactcg aacagtttgg cggaagagca 1021 actccagcat ctgatctcta tgctttggga gcaacattga ttcatttgct gacgggtgtt 1081 tctccagcag atttgcctca aaaagattct cgcttgcagt tttctcacct tgtcagactc 1141 aatcctgggt ttgtccgttg gttagaacaa ctgactgaac ctaatctcga aagacgcttt 1201 agtagcgctc aacaagcact tgacgctctc aaagctaatc acaatgctat caaaattgct 1261 agtccacgac tccctgaaag ttgtatttgg ctgaaaaaat cacctactca cttacagatt 1321 caacttcctg tcccttggta caaggtgatc tctagcccaa gaaattggat tgtgctaacg 1381 gggttgggac tttggttatt ttggctgtcg atgctcatca gcggttgggt ttatttgttt 1441 tggttggcag gtagtcttgt attgggggct tggtttctat tacctgtttt tgtggaaacc 1501 aatatctact tcaatcatca gcaatttgaa attgaggtga aattattggg tttttgcatc 1561 aaacgacagc gaggaaatgt ctcagagatt gataatgttt ttaaaagtga ctctggcggc 1621 tatggtaaca agaaaattcc tgaggtgact ctcgcagttg gtgtcgaaga gtattcattt 1681 ggaagattga aaccacctct ttctcatcaa gtatgccgtt ggctcgtaga tgaaatcaag 1741 cattggttgg gactttagat tttgggcttt tggttatgct acaagaaggt ggacaagttc 1801 tttgcattcg ggaagcgact ccgcaagagt ctcgttatca acttaaagaa aaattagggc 1861 aagatgctag tcgtcaaact tggttagcag tagatttgca tacacagtct caagaacagg 1921 ttgttgtcaa actgctcgct ttgagtccgc aaatgcagtg ggatgaacat aaactctttg 1981 aacgtgaagc tcaagtactg aaaaatctca atcatccccg aattcccaag tatcgagatg 2041 actttgtttt ggaacagcag cccggttcga gatttccctg gtttgggtta gtgcacagtt 2101 acgttcccgg aacatcattc cagcaactgc tggatggggg tcatcggttt tctcagtcac 2161 aggtggaaaa gatcgcaaca gaaattttga gcattcttgt ttacctgcac gaactcgacg 2221 ctcctgtgct acacagagat attaaaccca gcaatttaat ttggggtgag gatgaacgcg 2281 tttacctagt tgactttggc gcagtacaag accaggcggt gttggaaggt gcgactttca 2341 cagttgtagg aacctacggc tatgtaccaa tggaacagtt tgcaggtcgt gctgttcccg 2401 cctctgactt gtatgctgtg ggtacaacct taattcatct cctcaccgga acgcctcctg 2461 ctgatttacc atacaaagac tctcgcattc agtttgctga caaagtgagt gtcgatttgg 2521 gttttgtgaa ttggattggc aaactgacag aaccaaatgt tgcagaacgg cttagtactg 2581 cacgccaagc gttagacacg ctgaagaaca aacacgcact cactccacca cttacaagtc 2641 gtaagcctac aggtagtcga attcaaatta aaaaatctgc ggacaaaata gaaataaaaa 2701 ttcccagacg cggaagaaaa acgttcaaat tattctatct tatcgggctg atgattcctt 2761 ttgtctggca attaccacag tggttcaatt tactctcaac aggacttagc taccattcga 2821 tattgtattt tgtacctctt ttagcattag tgtttttcct cttccaggca acagtgctac 2881 ctgcttttag acatacagat atgtactttg accgtgaaca ttttgaaatc cgctggaagt 2941 tatttggttt atgttattgg cggtttagag gaaaaactct actgatttgt agagtgtatg 3001 aagagatagt tcaacagggt tctgcacctc ggggcgtgac aattgaatct tctaataaac 3061 agaagttcac ctctagtcct ttagctactg tggaacgtca ttggttaatt gaggagataa 3121 aggattggtt gaaattaaat cagggaacag ggaacaggga acagggaaca gggaataggg 3181 aatacgtaga taagcgaaat tcaatataag ataagataag acctcaccct caatccctct 3241 ccttattaag gagagggaag ccggaagcag ggtgaggttt tatatttaat ttgacccact 3301 gacttaagga ataactgata actgataaca ttgctaaaga gaattcgttg gaagcaagtg 3361 ggcgattact tgagctttac ttagttctat aataagtcgc tgagcttgat gtttgtagag 3421 tggtaaatgc aatgtttgtg gtaagttttt cattactgtt ctcgctgtgt tgagactgca 3481 tcctgaaatg cgaacaagaa cattcgcccc ctcaaactgt gcttcctctg tcaaagctct 3541 tgtaatatgt acttgccagg tttttgatgt ggtgatagtt gttccctgct caatgcgctg 3601 tagaccatcg cttgttgcta gtactatcat gcgatcgccc acagctaaac taatatcctc 3661 agaaggcata agcttagcca ttcctgtttc tttttggtaa agaattggaa cgactccata 3721 gccataagca acttcagata gcagtaagcc attgagcgta tcacctcctt caatttggta 3781 ttctgtaacc aagatagttt tgtgattaat acgaaacaga tttatgatat tttccccaaa 3841 tgctgctccg gcaaaagctt cagcaactaa agcgtagaca catagtactt gagcattcgg 3901 caagagttcg gctaaattgt cacttaagct aagacctgtg gtacggataa ccaaattgct 3961 ttctggattt gcatcataag ccattaaact gacttccaaa ttcagcatct catcatcagt 4021 caccacaacg acactttttg ctgtctgaag atttgccttg gaaagaaacc tttttaggga 4081 accagttatg agtggcatct ttggtagaac cgtcatgtca aaatctgtgt ttagcgtgat 4141 cccaacaagg ggttggttga attcttgcaa aatctctgcg attccctgac ccatccgacc 4201 gagtcccacg acgacaacat ggtctttttg aggaacgggt ggacgtcgct tattcaactg 4261 aaatctagca gctaaaagtc tttctgtcaa aaaactatag aataccccta caaggattgt 4321 accagtgatt gtcattccca agctaatcaa ccgcaaccaa aacggaactg gaagtgtgaa 4381 attaaaccca ccaaacaaat caccaaatcc gccgagaagt agcatggttg aggtcaaaaa 4441 ggcatctatc aagcccattt tgggataggt caaccgatac aaaaccgttc ctaatagcaa 4501 taaaacacct acaatgactc cacagataat acctacctgc ttaaatcgat tctgatcagg 4561 tttgcgtaaa aattgagtaa cttgctgttt tatagttaac cagttgaggt gtttgagtct 4621 tgttaataag tgccaagggt tataccaagt attttttacc agtttctgag aatcagtgga 4681 agttggtttt attgcctcaa tataaacaat tgtatctccc ggcatcaggc gtgtatctgg 4741 ctcccattta tggaaggctt tgaatgaaga agttgcacta gatgtgtgat agagaatgtg 4801 acgtttgtga gtattcaatt cgtgcaacaa ctgtctgttg caccaacgat ctgttggtga 4861 taaagtacgt ttgacaactt gcaaccactg tccttctaaa tgaaacaatc ctaatgtctc 4921 tgtaccaaga gcagcaaggg caaaagctgg ggcaggtagt tgagttggtt caaaagcaac 4981 aaagtttcct aaatgttcac tcaataattc attgagattt tttttagatg aacgaatcac 5041 taagcgggtt ttgggattaa gtttgcgcac cgctaaagct gtctctgcat tgacttgctc 5101 actacttgtg actatgatgg cagctcgaca ctgctggatt ttagcttgct gtagaacatc 5161 gatttgacga cagtcaccaa tgactaaggt atccaataaa ttctcaagat gaggaatttc 5221 ccagtcttca ggcggctttt tatcaatagc aatgacgctg acttcaaact gtttgagggc 5281 gacaacacaa tgttgaccta aaccgccgag tccgcaaact agaaattgat ctaagagcac 5341 ctgaggctgt gctgcttgat gcatgtgtat tactggttgt caatatatat ttacaattat 5401 gattgatctt atttgactgg gttgtcgtcc gcctaggact ataagtccta ggcttatagg 5461 cgaagtccat taaaatggac taaaaaactt acccagtccg ttttaacgga cttggacttt 5521 gagccaagaa atttatttct tggcggacga gaattatggt gcaagatctg agtatactta 5581 catcttgcac ctgggaaaag aaatgtcaaa gttattacat attatttgac tgggttctcg 5641 tccgcctagg actataagtc tagccctgtc aaggtgactt ttttgcagtc ttcaaaagtt 5701 tggcagttga gacatgtgga tgttgagggt catactttgg agagtctttc ttcttttttg 5761 gaacacgggt gtgtttaaga aaagacttca aatgaacccg tgaagccaaa ttttggaggc 5821 atttagcaaa ctcagatagt tcaaaattgg caaatacact ccaatgccca ggagcgatcg 5881 caattatcat tcctcgataa gtgccttgta cttcgtcaac catatagtaa tcagacagtc 5941 cagcatcaat ttttcccaca ccatgaacgc ttgctaaaac cccgcgaacg gttctaaaat 6001 gttatgggcg actaatgcaa gacaaaatga gaacaaggca gcttttggat agcccaaagt 6061 ttgaatttgg cattcgtaat tttccgtcac agtttgaaac agattttcaa cactgcgtcg 6121 ttcccggtat aattgtgcaa taactactgc ggttacagtt tccgttggaa gcatcgtaaa 6181 aatggcaatt tcattctcac cattacgatc tggttttttt agcttcaaaa cgactcgcct 6241 catcagcaac aattttccag catcgctcag actaacagtt tgctcaaata aatcccctgt 6301 ctcaatacta ccaactgctt gcaatgaatt gattgcttcc cagggcattg atttgtgttc 6361 ccgaatcacg aagtttgctc cacttcgatg taagccagac aaaaatccaa cagtacacat 6421 gttacggtct gcaatccata attcgccttc ttttacagtt tttagaacct cggaaaacat 6481 tgaacgttct tgagcatgac catcttcaca accaaataca ttgattgcta atcttaattc 6541 tgggtcaaga actactaatg actttcctgg taatgctcct gcggcgattg ttcgtaatac 6601 ttcgagacga tggtcagttt tttcaaggca gtttccatca aggattctca ggacttacgc 6661 aactgtcaca ggcgatcgcc acctataact agactaatga cat // LOCUS NODE_4572_length_6688_cov_5.0203536688 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6688) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6688) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6688 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(444..1667) /locus_tag="DP116_26125" CDS complement(444..1667) /locus_tag="DP116_26125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015184153.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfate ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_26125" /translation="MSKWQHTKKSGTYWIEKIETHVVQVWQGIKQWSHRPIKSWLNGH SLQGFVCLFLVGISLSLAMSAYAASSSNTSRGTHSANNASLVAQQKNIELNLVSFSVT KAAHDKIIPKFVEKWKKEHNQNVTFKQSYGASSPQALAVIQGGIEADVVHLSLAPDIL KIETAGLIQPGWEKEFPNNSIVSKSVAAIVTRKGNPKGIKTWADLAKNGVSVITPNPI TSGSARWNFLALWNAATKASGDESKAIEFVTNVYKNVPILPESAREATDAFFKGGKGD ALISYENEVYLKALNGEKPVYVVPDINFSIDNPVAIVDKNVDKHGTREVAKAFIQYLY TPEAQQIFAQTGYRPIDPGVAQAKEFVDKYPQIKTLATTNDYGGWAVIQKKFFSDNAI FAKILRGVNTQTNAR" gene complement(2102..5347) /gene="carB" /locus_tag="DP116_26130" CDS complement(2102..5347) /gene="carB" /locus_tag="DP116_26130" /EC_number="6.3.5.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316955.1" /note="four CarB-CarA dimers form the carbamoyl phosphate synthetase holoenzyme that catalyzes the production of carbamoyl phosphate; CarB is responsible for the amidotransferase activity; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbamoyl phosphate synthase large subunit" /protein_id="PRJNA477356:DP116_26130" /translation="MPRRDDIRKILLLGSGPIVIGQASEFDYSGTQACKALREEGYEV VLVNSNPATIMTDPETANRTYIEPLTPELVEKVIAKERPDALLPTMGGQTALNIAVSL AKSGVLEEYNVELIGAKLPAIEKAEDRKLFNEAMTKIGVKMCPSGTASSLEEAKAIAK QIGTYPLIIRPAFTMGGTGGGIAYNEEEFAEMAQAGIDASPVSQILIDQSLLGWKEFE LEVMRDLADNVVIICSIENLDPMGIHTGDSITVAPAQTLTDKEYQRLRDMAIKIIREI GVETGGSNIQFAVNPVDGDTIVIEMNPRVSRSSALASKATGFPIAKMAAKLAVGYTLD EIKNDITKKTPASFEPTIDYVVTKIPRFAFEKFPGAEEVLTTQMKSVGEAMAIGRTFN ESFQKALRSLETGRAGWGCDKKEKLPSGDQIRAQLRTPNPDRIFAVRHALLTGMSVEE IYELTGIDPWFLDKMQELLEVEKFLKRTSLQQLTKEQMYAVKRQGYSDRQIAYATKTT EDEVRQYRKQLDVIPVYKTVDTCAAEFEAFTPYHYSTYEEETEILPTTKPKVMILGGG PNRIGQGIEFDYCCCHAAFALKDAGYETIMVNSNPETVSTDYDTSDRLYFEPLTKEDV LNIIEAENPVGVIVQFGGQTPLKLSVPLQKALSTRTQIWGTSPDSIDIAEDRERFEKI LQELNITQPPNGIARTYEDALVVAQRIGYPVVVRPSYVLGGRAMEIVYSDAELERYMT YAVQVEPEHPILIDKFLENAIEVDVDAIADHTGRVVIGGIMEHIEQAGIHSGDSACTI PSISLPPIVLEKIRSWTVELARALNVIGLMNIQFAVVGTQGYSPQVYILEANPRASRT VPFVSKATGIPLAKLASLIMSGKTLEELNFTQEVIPDHIAVKEAVLPFNKFPGTDIIL GPEMRSTGEVMGIDKDFGRAFAKAALGAGERFPLSGTVFFTMNDRDKAAAVPVVQEFI NLGFAVMATDGTRRVLQEHGLDVELVLKLHEGRPHVLDAIKNHRIQLIINTPSGEEAQ SDGRLIRRTALAYKIPITTTIAGAKAIVAAIRSLQNTTLDVRIIQEYTTVN" gene complement(5895..6449) /locus_tag="DP116_26135" CDS complement(5895..6449) /locus_tag="DP116_26135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210377.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA starvation/stationary phase protection protein" /protein_id="PRJNA477356:DP116_26135" /translation="MSETQTLLRNFGQVYDNPILLDKSVTEPVCEGFNVVLASFQALY LQYQKHHFVVEGAEFYSLHEFFNESYNEVQEHVHDIGERLDGLGGVPVANLRKLAELC CFEQEPDGVYSSRQMVENDLKAEQAILGVIRRQAGQAESLGDRATRYLYEKILLKTEE RAYHLSHFLAKDSLTLGFVQPASN" BASE COUNT 1772 a 1502 c 1341 g 2073 t ORIGIN 1 ttctaagaat tcttaaaata aatcaacaat aaccataatc attttgtgct ggtacacgtc 61 gtttatttaa cattgacgtt tttgcagtgg cgactatgtg aaactcgcac aggacaaata 121 cttatattaa ataatactta tttttgaaat cctcattctg gaaaatacgc cagagattca 181 cttgttatag cgtgcaagca ggggattcca ttgggcaacg ccgaacctgt tgtgaaggac 241 ttcaataaaa ccacttagag tattttataa aaatttttac ttgttttgct ctgaagttgt 301 cagtcaacag gttcgttgat tatttacgca cacgaaatct ggccaaccgt atcaaaaatc 361 tctgcatata aggattcagt ccaggtgcat agtgacggtt gagtctaaca gccattagcc 421 cttgaatact cgccttgtga cctctaacgt gcgttagtct gtgtattaac accacgcaga 481 atcttggcaa aaatggcgtt gtcactaaaa aactttttct gaatcacagc ccaaccgcca 541 taatcattag ttgttgccaa agttttgatc tgagggtatt tatcaacgaa ctcttttgct 601 tgcgctacac ccggatcaat tggtcggtaa ccagtttggg caaatatttg ttgagcttct 661 ggggtgtaca aatattgaat aaacgctttt gccacttccc gggtgccgtg cttgtccaca 721 tttttatcga caatcgcaac tgggttatca atcgagaaat tgatatccgg aacaacgtat 781 acaggttttt cgccattgag agctttcagg taaacttcat tttcgtagct gatcaaagca 841 tcccctttgc cccccttgaa aaaagcatca gttgcttcac gagcgctttc aggcaagatg 901 ggaacatttt tatagacatt agtgacaaac tctattgcct tagactcgtc gccagaggct 961 ttcgttgctg cattccataa tgcgaggaag ttccagcgtg cgctaccaga ggttatcggg 1021 ttcggcgtaa tcacactgac tccatttttt gctaagtctg cccatgtttt gatccctttg 1081 ggattgcctt tacgagtcac aattgcagct acagacttgc tgacaatact gttgttagga 1141 aattcttttt cccaacctgg ttgaatcaat cctgctgtct caatctttag aatatctgga 1201 gcgagggata aatgtaccac atcagcttct atccctcctt gaatcaccgc aagggcttga 1261 gggctagagg ctccataact ctgcttaaag gtgacatttt ggttatgctc ttttttccac 1321 ttttctacaa atttgggaat aatcttgtcg tgggcagctt tcgtgacaga aaaggaaaca 1381 aggttaagtt caatattctt ctgttgagca acaaggctgg cattatttgc agaatgagtc 1441 cctcttgaag tattgctact acttgctgca taagcactca tcgccaagct caagctgatc 1501 cctaccaaaa aaaggcacac aaagccttgc agagaatgcc cgttcaacca actttttatc 1561 ggtctatgac tccactgttt aatcccttgc cagacttgaa caacgtgagt ctcaattttc 1621 tcaatccagt acgttcctga tttttttgta tgttgccact tgctcattaa aattctccga 1681 catgattacg gaaactttaa agatttaccg tattttagcg gcattatacg tcagttaagc 1741 aatggtttta ccggagattc atgtttcttt aacttcctcg cgccctcgtc ccgcactata 1801 acgcaatacg gttcagttaa ggatttttgg tgagatttag tgatgtgtcg tgaaagtgca 1861 tgctacgtct ctacattgcc ctgctgatag gaattttagt cttaagctgt tacgctttta 1921 acttgcatat tattttggtt tagtcaatca ttgaaaccct cttcccttct gccttctgcc 1981 gtctgccttc tgccttgctg ttgtgcattt ttaatgcaca acagcttatc accaagcgta 2041 ttgaactata acggtactcc gtgagcgtgc cgggattgct ctggcggacg gcaaggtgct 2101 attaattcac cgttgtgtat tcttggatga tcctcacatc taaggttgta ttttgtaaag 2161 aacggatagc agcgacaatc gcttttgctc cagcaattgt ggtggtgatg ggaattttgt 2221 aagccaaggc tgtgcgacga atgagtctac catcactttg cgcttcttcc ccagagggcg 2281 tgttgataat caattgaatt ctatggttct taattgcatc taagacgtga ggacgacctt 2341 catgcagttt cagcaccaac tctacatcta agccatgttc ttgaagaacg cgacgtgtac 2401 catctgtcgc catcacagca aagcccaaat tgataaattc ctgaacaaca ggaaccgcag 2461 ccgctttatc gcgatcgttc atggtgaaga atactgtacc acttaaggga aaacgctctc 2521 cagcgcccaa tgctgctttg gcaaaagcgc gtccaaaatc cttgtcaata cccatcacct 2581 caccagtaga gcgcatctct ggtcctaaga tgatatcagt accaggaaat ttgttaaatg 2641 gtaaaactgc ttctttaacc gcaatgtggt ctggaataac ttcttgggta aagtttagct 2701 cctccaatgt tttacccgac ataattaaag atgcgagttt tgccagtggt atgccagttg 2761 ctttagacac aaatgggact gtgcgagaag cacgaggatt ggcttctagg atgtaaacct 2821 ggggtgaata accttgcgtg ccaacaacag caaactgaat attcatcaac ccgataacat 2881 tgagcgccct tgccaactct acagtccaac tacggatttt ttctaaaacg attggtggta 2941 aagaaatcga aggtatcgta caggcagagt ctccagagtg aattcccgct tgttcgatgt 3001 gttccatgat accaccaatg acgacgcgtc cggtgtgatc ggcgatcgca tccacatcga 3061 cttcaatggc attttctaaa aacttatcaa tcagaattgg atgttccggt tccacctgca 3121 cggcataagt catgtaacgt tctagttcag catcggagta aacgatttcc atcgcccttc 3181 cccctaacac gtaactagga cgcaccacca ctgggtaacc aatgcgttga gcaacaacca 3241 aagcatcttc gtaggtacga gcaataccat tcggcggttg ggtaatattc aactcttgaa 3301 gaattttctc aaaccgttcc cggtcttctg ctatgtcgat agaatctggt gaagttcccc 3361 aaatttgagt gcgagtgctg agagcttttt gtagaggaac ggacagcttt aacggagttt 3421 gcccaccaaa ctggacaatc accccaactg ggttttcggc ttcgataata tttaggacat 3481 cttcttttgt cagcggctca aagtacaggc gatcgctcgt atcataatca gtcgatactg 3541 tctcagggtt cgagttaacc attattgtct catatcccgc gtcttttaaa gcaaaagcag 3601 cgtgacaaca acagtaatcg aactcgatac cttgcccaat gcggttagga ccacctccca 3661 aaatcatcac cttgggtttg gtagttggca gaatctccgt ttcttcttcg taagtcgaat 3721 agtgataagg agtaaatgcc tcaaattccg cagcacaggt atctaccgtt ttgtaaactg 3781 gaatcacatc cagttgtttg cgatattgcc ggacttcgtc ttcggtggtt ttggtggcgt 3841 aggcaatctg gcgatcgcta tatccctgcc gcttcactgc atacatttgc tcttttgtca 3901 attgctgcaa tgatgtccgc ttgagaaact tttccacctc cagcagttcc tgcatcttat 3961 ccaagaacca cgggtcaata cccgttaact cgtagatttc ctcaacactc atccctgtaa 4021 gtaaggcatg acgcacagca aatatacggt cagggtttgg tgtccgcagt tgggcgcgaa 4081 tctgatcacc gctgggtaat ttttcctttt tgtcacaacc ccagccagcg cgtccggttt 4141 ccaaagaacg caatgccttt tgaaaagact cgttgaaagt ccgcccaatc gccattgctt 4201 ctcctaccga cttcatttgt gttgtcagca cttcttccgc accggggaac ttttcaaaag 4261 cgaagcgggg aatttttgtg acaacataat caattgtcgg ctcaaaagat gctggagttt 4321 tctttgtgat atcattttta atctcatcca gcgtataacc aactgcaagc ttcgccgcca 4381 tcttggctat ggggaaacct gtggctttgg aagccaaagc cgaagaacga gagacacgag 4441 ggttcatttc aatgacaatc gtatctccat cgacgggatt aacggcaaac tggatattag 4501 aacctccagt ttccacgcca atctcgcgga taattttaat tgccatatcc cgcagtcgtt 4561 ggtactcttt atcggtgagg gtttgggctg gggcgacggt gatagaatct ccggtgtgga 4621 tacccattgg gtcaaggttt tcgatggaac agataatcac aacgttatct gccaaatcac 4681 gcatcacctc taactcaaat tctttccagc cgaggaggga ttggtcgatg agaatttgtg 4741 aaacgggact ggcgtctata cctgcttgcg ccatttctgc aaattcttct tcgttgtaag 4801 caataccgcc tccggtacca cccattgtga aagcgggacg gatgattagg gggtaggtgc 4861 caatttgttt ggcgatcgcc ttagcttctt ccagagatga agctgtccca ctcggacaca 4921 tcttcacccc gatctttgtc atcgcttcgt taaaaagttt cctatcttct gctttctcaa 4981 ttgctggcag tttagcgcca attaactcaa cattgtattc ctctagcact ccacttttcg 5041 ccaaactcac agcaatgttc agggcagttt gcccgcccat cgtcggtagt aaggcgtcgg 5101 gacgttcttt agcgatgact ttttcaacca attccggtgt cagcggttcg atgtaagtac 5161 gattcgctgt ttccggatca gtcataattg tggcgggatt cgagttgacc agcaccactt 5221 cataaccttc ttctcgcagc gctttacaag cttgggtgcc agaatagtca aactcgctgg 5281 cttgtccaat aacgattgga ccagatccga gtagcagaat ctttctgata tcgtcacgac 5341 ggggcatagt ttttggtttg gagtaaaaaa atataaatct tgattatttt aagggtcttc 5401 acccaattag ttactttgtt tattatcaac ttttcttagt tcttttatag acaaaagata 5461 gagggcaggt tacaggttac aggtttttct ctataccatt accctgtcca gtgttctttt 5521 atcctttttt tttagcagtt ttattttatt cattgatcat cagacctatc catttttttt 5581 gcaactcttt tctatttgaa atcgggattt ttctcaataa caagagaatc ggttttgaca 5641 aatatctgat tttcaacatt taatcttttc ttaatgttga gttctggtta aataaaaaaa 5701 gttactaatt attttttata acaaccctaa aaattaaaga ttatattttt ttgttgactt 5761 acattaaaaa tggcagagag ataaataatc tttctctctg ccatttttta tagttgttta 5821 agaactagtg cttacaactt ttgttctagt ctagttaaat cactctctct ttatgagtat 5881 atgagttgag tatgttagtt tgaagcaggt tgaacaaatc ccaaagttaa gctatccttg 5941 gcaaggaagt gagacaagtg ataagctctt tcttctgttt tcaacaggat tttttcgtat 6001 aaatagcgtg tagcgcggtc gcccaaactc tctgcctgtc cggcttggcg gcgaatgacg 6061 ccaagaattg cttgctctgc cttcaggtca ttttctacca tctgacggga agaatagaca 6121 ccatctggtt cttgctcaaa gcaacataat tctgctaatt tgcgtaagtt agcaacaggt 6181 actccaccca atccatccaa gcgttctcca atgtcatgaa catgttcttg gacttcatta 6241 taactttcgt taaaaaactc atgcagagag tagaactctg caccttcaac aacaaaatga 6301 tgcttttggt actgtaagta cagtgcttgg aaactagcta atacaacatt aaatccttca 6361 caaactggct cggttacact tttgtccaaa aggatagggt tgtcatatac ttgaccaaaa 6421 tttcgtaaca aagtttgcgt ttcagacatt gtggttctcc tagcttctgg tagatatagt 6481 tgctaacttc ttagtctcta catcttaaca tcgaaaaaat gatcacaaga tcgatctata 6541 tcacgaattt ttgacctttt tatgaaaaaa aagcgataac tttcattcct tttgagagat 6601 tttcacacaa atgttgcaaa atgttctaaa cactattgac actattttgt aaatatgtat 6661 ttataaagtt tgaacataat atagctgt // LOCUS NODE_4601_length_6636_cov_4.9632276636 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6636) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6636) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6636 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 58..1620 /locus_tag="DP116_26140" CDS 58..1620 /locus_tag="DP116_26140" /inference="COORDINATES: protein motif:HMM:PF00656.20,HMM:PF05419.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26140" /translation="MANNWAIVAGVNNYDFLPAAPLKFAVADALAMRSFLCDEVGFKS DQVLFCGDGTEGSKRATKPVLRDILLHQIQRAKNADNLWFFFSGHGIAEHLMTIDGNP RDLKETAISIHFVTDCLRRCKAKNIVLVLDMCRSESRDADERTVESIEGSLRELVKQR EGQQGIITLFSCGRGESSYEVPTLGQGAFTYALLEGLRKTTIVKDLERHLAERVPELH RIHASEKRRKQVPLVIPEPGWKYDEPILSRHATEADVARLKEMAIDAECDGDFPMALR FWEQINLLATKPEDRRRALNKTTDLVQRSQLKEPASPTPTIELPQPLIQHPLDSIPLD SEKGIDYSYLCGLLKTGQWQSADHQTLRVMLKAANRESKGWLDSDSLRTFPRKDLKTI DKLWMTASNGHFGFSAQKKIWEECGRPNDSDKNWNCFCTKLGWKNERMKYSSCSNLRA DLSTSPLGEFPARYTAGASFYRKEKGREIDYVVTTFFHIWGSSQDLVGGGREWFGDWG VRIYSLLSRSEL" gene 1633..1947 /locus_tag="DP116_26145" CDS 1633..1947 /locus_tag="DP116_26145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997948.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26145" /translation="MGVIAQELREFEQARKDYQQALQISIEFGDHYTQALTYHQLGIV AQELREFEEARANYQQALQIQVEFGDRYNQASTYHQLDIVAQALHKFEQARNFANNFV PQ" gene 2078..2308 /locus_tag="DP116_26150" CDS 2078..2308 /locus_tag="DP116_26150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="antitoxin of toxin-antitoxin stability system" /protein_id="PRJNA477356:DP116_26150" /translation="MTQISLTEAQQHLPELIANLKPGEEIQICQNERAIARLIIEPRT TRKPRQPGSAIGTFTIIADDEEHLEDFCDYMR" gene 2305..2691 /locus_tag="DP116_26155" CDS 2305..2691 /locus_tag="DP116_26155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015199290.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_26155" /translation="MKLLIDTHAFLWFVLNDSSLSPIARDLIIDPLNDIFLSPASHWE IAIKISIGKYRIPGQFEHWMNDQIQINELAILPLEVAHSAAVITLPFHHKDPFDRLLI AQSLVETIPIVSADVIFDAYGVTRLW" gene 2904..3146 /locus_tag="DP116_26160" /pseudo CDS 2904..3146 /locus_tag="DP116_26160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130579.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(3375..4727) /locus_tag="DP116_26165" CDS complement(3375..4727) /locus_tag="DP116_26165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010995496.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell wall hydrolase" /protein_id="PRJNA477356:DP116_26165" /translation="MGRIFISAAHGGKEAGGIDPGSIAGGTTEAKEMIMLRDLIISEL RARNFEVLAVPDDLSAQQTIAWINSRARQKDVALEIHADAASNPSVRGASIFYIANND ERRNHAELLLVGLLRRVPQLPNRGVKPDSATGLGRLAFCRQTSVASLLMQVGFLTSPD DRALLQTRRRDFALGIADGVASWSRTIDPGSAVETDSTYPVFSININGQLYPEKGILI NGNAYIPIDLADRLRIDLSKAPNVRRVTYRRVVFLKAVELRDFHLSVVWDSTSQTLNL RSIAQIPLAQIDKIMSYGHASEVQLQLFLRNNNENAIVQFPDLPKLYREEATIEGVSH DIAFCQMCLETGFLHFGDDVKPEQNNFAGLGAIGGAPNAASFESARLGVRAHIQHLKA YASLEPLVQEIEDPRFRFVTRGIAPSIDQLSGRWSADLEYGNKIMAMLKRLYESAGLF " gene 4885..>6636 /locus_tag="DP116_26170" CDS 4885..>6636 /locus_tag="DP116_26170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016871877.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA primase" /protein_id="PRJNA477356:DP116_26170" /translation="MHIPRLHPDTIEEVKQRADIVDVISEHVVLRKRGKDYVGLCPFH SEKTPSFTASQTKQMYYCFGCQAGGNAIKFLMDLEKRSFAEVVLDLARRYQVSVRTLE PEQRQELQRQLSLREQLYEVLATAAQFYQHALRYSGEKALQYLQSVRQLKEETIQQFG IGYAPAGWETLYRYLVEDKRYPVQLVEKAGLIKPRKEASGYYDVFRDRVIIPIHDVLG RVIGFGGRSLGDEQPKYLNSPETEVFIKGKTLFALDKAKAGISQQDQAVVVEGYFDAI ALHAAGINNAIASLGTALSIEQVRQVLRYTDSKQLVLNFDADKAGTNAAERAIGEIAE LAYKGEVQLKILNLPDGKDADEYLQGREPEDYRKLLANAPLWLDWQIQQMTKDRDLKQ AADFQQVSGQMVKLLKNIANSDTRNYYISHCAEILSLGDTRLISLRVENLLTQIVPAR AFRKPTPVIKQQPVAGQQSLAAGESSLLQIAEALLLQIYLHYPVYRGAIVDVLDERDL QFSLSHHRFLWRQILEVSSGIDDDLISMVQDRCLEFPEEMDLVSHLFHLNEKTQKEII SRHNKLIQAATACMELVL" BASE COUNT 1827 a 1543 c 1585 g 1681 t ORIGIN 1 ggcgggtatt acgctgaagt ttaagcgcag ctcgaaggag aatgggattg gttgagtatg 61 gcgaataatt gggcgatcgt tgctggtgtc aataactatg attttttgcc tgctgctccg 121 ctgaagtttg cggtggcgga tgcgctggcg atgcgatcat ttctctgcga tgaagtgggg 181 tttaagtcgg atcaagtgtt gttttgtggg gatgggacgg aggggagtaa gcgggcgaca 241 aagccagttc tgcgagatat tttattgcat cagatccagc gggctaagaa tgctgataat 301 ctttggtttt tcttcagtgg tcacgggatt gcggagcatt tgatgacgat cgatggcaat 361 cccagagatc ttaaagagac ggcgatttca attcacttcg tgacggattg tctgcggagg 421 tgcaaggcaa agaacattgt gcttgtgttg gatatgtgtc gcagtgaaag tcgggatgcg 481 gatgagagaa cggtagagtc gatcgagggt tctctgcgag agttagtgaa gcagcgggag 541 gggcagcagg gaattattac gctgttttcc tgtgggcgtg gggagagttc ctatgaggtt 601 ccaaccttgg ggcagggagc gtttacctat gcgttgctgg aagggttgcg gaaaacgaca 661 atcgttaagg atctagaacg acatttagcg gaacgggtgc ctgagctaca tcggattcat 721 gcgagcgaaa aacggcgtaa gcaggttccg ctagtgattc cggaaccggg ctggaaatat 781 gatgagccga ttttgagtcg tcatgcgacg gaggctgatg ttgcgcggtt gaaggagatg 841 gcgatcgatg ctgaatgtga tggtgatttt ccaatggcgt tgcggttctg ggagcaaatt 901 aatttgttgg caacaaagcc cgaagaccgc cgcagggcgc tgaataaaac aaccgatttg 961 gttcagcgtt ctcagttgaa agaaccagcg agtcctactc ctacgattga actcccccag 1021 ccactgatcc agcatcccct cgactccatc cccctcgact cagagaaggg catagactac 1081 agctacctct gcggcttact caaaactgga cagtggcagt ctgctgatca ccaaaccctc 1141 cgtgttatgc tcaaagcggc taatcgggaa agtaagggct ggctcgattc tgatagccta 1201 agaacatttc ctcgaaaaga ccttaagact atcgataagc tctggatgac ggcaagcaac 1261 ggtcattttg gctttagcgc gcagaagaaa atttgggagg aatgtggtag accaaatgat 1321 tctgacaaga attggaactg cttttgcact aaattagggt ggaagaatga aaggatgaaa 1381 tattcaagct gttccaatct gagggccgat ctttccactt ctccgcttgg agagttccca 1441 gctagatata cggcaggagc aagtttttac cgaaaagaaa aaggtagaga aatagattat 1501 gttgtcacaa cattctttca tatctgggga agttctcaag atcttgttgg aggaggccgt 1561 gagtggttcg gagattgggg cgtaaggatc tattctcttt tgtctcgaag tgaattgtga 1621 acttgagaca atttgggagt gatcgcccaa gaattacgcg agttcgagca agcccgcaag 1681 gactaccaac aagccctcca aatctctatc gaattcggcg atcactacac ccaagccctc 1741 acctaccacc aattaggcat agtcgcccag gaactgcggg agtttgagga ggcgcgcgcc 1801 aactatcaac aagcgctcca aatccaagtc gaattcggcg atcgctataa ccaagccagc 1861 acctaccacc agttggacat tgtcgcccag gcattacaca agtttgagca agcccgaaac 1921 ttcgccaata attttgtccc tcaatgattg cgccgctgtg gctgcgactg gagccgatac 1981 tgccgacaaa accctgattg tcggtgacgt gttacaacta gatgcgatcg ccctttggca 2041 tagcttataa tgatgagagc gatcgcagcc caagcccatg acccaaatca gcctaaccga 2101 agcccagcag catctcccag aactgatcgc caacctcaaa ccgggcgaag aaatccagat 2161 atgccaaaat gagcgggcga tcgcccgcct gatcatcgaa cctcgaacaa cccgcaaacc 2221 tcggcaacct ggaagcgcga tcggaacatt caccatcatt gccgatgacg aagagcatct 2281 agaagacttt tgcgactaca tgcgatgaag cttctcatcg acacccatgc cttcctctgg 2341 tttgtcctaa acgactcctc actcagtccc atcgctcgtg acctgatcat tgatccgctg 2401 aatgatattt tcctcagtcc agcatcccac tgggagattg ccatcaaaat cagcatagga 2461 aaatatcgaa ttcctggtca gtttgaacat tggatgaatg atcaaattca aatcaatgaa 2521 ctggcgattt tacccctcga agtcgctcat tccgctgccg tcataacgct tccttttcat 2581 cacaaagatc cattcgatcg gctgctaatc gctcagtcgc ttgtcgagac tattcccatt 2641 gtcagtgctg atgtaatctt tgatgcctac ggcgttactc gcctttggta gagtccgtgg 2701 aatccttttg agcttggcgg cgatcgccca ggaattacgc gagtttgacc aagcccgcaa 2761 ggactaccaa caggcgctca aaatcaaaat cgaattcggc gttaagcgta gctctgccgt 2821 aggcaatcgc tactcccaag ccagtaccta cagcacccta cgctacaata gcaaaagcct 2881 cccatctctg gggcgacagt gccatgactc cacaactcca gcaaatcttc cacagcctca 2941 acgaacttac ccagtcagaa cgttggcaag tatttgacta cctcgtcaac cagctcaaaa 3001 acgctctcac ccgttctgac gtccttgatc gctctgcctc ccaacaaccc ccaaaacgct 3061 caccccagga gatttttgca tcgacccaag gcagttggag ccaccaaact cgtgacgaaa 3121 ttgatgctca actcgcatcc cagagggtta aggattgggc atctgaagct tttggtcagt 3181 cgatactctc aaaaagaaaa attgtgtttt caacttcaga tgacgacaca gaatttgcat 3241 gaactttttt gagggatttt cccacaaagt caacacccta ggcggagcct ccatgactgt 3301 attccttgtc agagacaagg aaccagacga cggttgcgga gaacaatttg tttgtggtgc 3361 taggatttca gcacttagaa tagccctgct gactcataaa gccgtttgag cattgccata 3421 attttgttac catattctaa atctgcagac caacgtcctg agagttggtc gattgatggg 3481 gcaataccgc gagtcacaaa gcgaaatctc ggatcttcga tttcctgcac caatggttcc 3541 aaactggcgt aagctttcaa gtgttggatg tgcgcccgaa caccaagtct ggcactttca 3601 aacgacgctg cgtttggtgc accaccaata gcccctaaac ctgcaaagtt attttgttct 3661 ggtttgacat cgtcaccaaa gtgtaagaat ccagtttcta ggcacatttg gcaaaaagca 3721 atgtcatgac tgactccttc tattgtcgct tcttcgcggt agagtttggg caagtcggga 3781 aactggacga tcgcattttc attattattt ctcagaaata actgcaactg cacttctgat 3841 gcatgaccat atgacataat cttgtcaatt tgagcaaggg gaatttgggc gatcgagcgt 3901 aaattcagag tctgacttgt actatcccat acgacagaaa gatgaaagtc gcgcagttca 3961 acggctttga ggaaaacaac tcggcgataa gtcacgcggc ggacattagg cgctttggac 4021 aaatcaatcc gcaagcgatc cgctaagtca atggggatat aggcattacc gttaatgagt 4081 atccctttct ctggataaag ttgtccgtta atattgatac taaataccgg atatgttgag 4141 tctgtttcca cagctgaacc aggatcaata gtacgactcc aagatgcaac tccatcggca 4201 attcctaagg caaagtcgcg gcggcgagtt tgtagcaagg cgcgatcatc tgggctagta 4261 agaaagccaa cttgcatcaa caaagaagca actgatgtct gacgacaaaa cgccaaacgc 4321 cccaaacctg ttgctgagtc tggcttgact ccccgatttg gtaactgagg aacacgacgc 4381 aacagtccca ccagtagcag ttcggcatga tttctgcgtt catcattgtt agcaatgtag 4441 aaaatgcttg ccccacgtac agaaggattg ctcgcagcat cagcatgaat ttctagggca 4501 acatcttttt gacgagcacg agaatttatc caagcgatag tttgttgggc gctcaagtca 4561 tctggaaccg ccaaaacttc aaaattacgc gctctgagtt cgctgatgat taaatcccgc 4621 agcataatca tttccttggc ttcggttgtc ccaccggcga tggaacctgg atctattcct 4681 cctgcttctt tgcctccgtg agctgctgaa ataaaaatgc gtcccatttt cagtatttcc 4741 tgttatacaa ttaacaatta gcgattctgg agcgtcattc ttggattagc atcaagaata 4801 taatattcat cagaaccgtt agtctatagt caacagtcaa tgatcaatca tttgtaactc 4861 ttgactgttg actcctaatt tagtatgcac atcccccgcc tgcatccgga cacaattgaa 4921 gaagttaaac aacgagctga cattgtcgat gtcatctcag aacacgttgt cttacgcaag 4981 cgcggaaaag actatgtagg tttgtgtccc tttcactcag aaaaaactcc cagtttcact 5041 gccagccaga ctaagcagat gtactactgc tttggctgtc aagctggtgg caatgctatt 5101 aagtttctca tggatttgga gaagcgctct tttgctgaag tggtattaga tttagcgcgg 5161 cgttaccaag tctcagtgag aacactcgaa ccggaacaga ggcaagagtt gcagcgtcag 5221 ttgtctttgc gcgaacaact ctatgaagtt ctcgccacgg cggcacagtt ttatcaacat 5281 gctcttaggt atagtggaga gaaggcatta caatatttgc aatcggtgcg ccaactgaag 5341 gaagaaacaa tacagcagtt cggtatcgga tatgcgcccg caggttggga aacactctac 5401 cgctatttgg ttgaggacaa acgctaccca gtacaactgg tggaaaaagc gggtttgatt 5461 aaaccacgta aggaagcgag tggttattat gatgtgttcc gcgatcgcgt catcattccc 5521 atccacgatg ttctcggacg tgtcattggc tttggtggca gaagtttagg agacgaacaa 5581 cccaaatatt taaattcacc cgaaacagaa gtttttatca aggggaaaac tttatttgct 5641 ttagacaagg caaaagcagg aatttctcag caggatcaag ctgtggtggt ggagggatat 5701 tttgatgcga tcgctctcca cgcagctggg atcaataatg cgatcgcctc ccttggcact 5761 gctcttagca tagaacaagt tcggcaagtc ttacgctaca ccgactctaa acagttagtc 5821 ctcaactttg acgctgataa agctgggaca aacgccgcag aaagagcaat tggagaaatt 5881 gccgaactcg catataaagg tgaagtccag ctcaagattc tcaacttacc tgatggcaaa 5941 gacgctgacg aatatttgca aggacgcgag ccagaagatt atagaaaatt attggcaaat 6001 gcaccacttt ggttagattg gcaaattcag caaatgacaa aagaccgtga cttgaaacag 6061 gctgctgatt tccaacaagt gtctgggcag atggtgaagt tattgaaaaa tatagccaac 6121 agtgatacac gtaactatta catttcccac tgcgctgaaa tcctcagttt aggagatacc 6181 agactgatat ccctaagggt agaaaatctt ctgacacaaa ttgttcctgc tagggcattc 6241 cgtaagccaa caccagttat caaacagcaa cctgtagcag gacagcaaag tcttgcagct 6301 ggtgaaagta gtttattaca aatagcagag gctttattgc tgcaaattta cttgcactat 6361 ccagtatatc gtggggctat cgttgatgtt ttagacgagc gagatttgca atttagcctt 6421 tctcatcacc gctttttatg gcggcaaatt ttagaggttt cctctggaat tgacgatgat 6481 ttgatatcaa tggtgcaaga tagatgtcta gagtttcctg aagaaatgga tttagtctct 6541 cacttatttc atttaaatga gaaaacacaa aaagaaatta tttcacgcca caataagttg 6601 attcaagcag ccactgcttg catggaactg gtgctg // LOCUS NODE_4704_length_6440_cov_3.0342996440 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6440) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6440) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6440 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 333..920 /locus_tag="DP116_26175" CDS 333..920 /locus_tag="DP116_26175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869148.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26175" /translation="MAKSALPVPSFPQIFVPQLVRDKKFAPVTAPSGTGDAPVGASEN LFGNVLKHYFGEGVVLAQQKMLPPGHDLHYTTDFLIVEPTTGLHLDVEVDEPISFATG KPTHCIGEDNYRNKCFVDANWVVVRFAEEQVSSQPERCALVIAGAIAKLTGNTTYEQK LRHAGRVDAIKQWTPRQATKLKKSNYRQGYLASRK" gene 1150..1467 /locus_tag="DP116_26180" CDS 1150..1467 /locus_tag="DP116_26180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314072.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26180" /translation="MSNCVTYYTHTNQIDQLVESFGTTLEKLTLADKLALRVTLTYWL FHCEVADLGEYTLNHALEDMMQGYSEDCQINLKEAIAILGGIGKDEAEGLIESLTAQL RCL" gene complement(1792..2091) /locus_tag="DP116_26185" CDS complement(1792..2091) /locus_tag="DP116_26185" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26185" /translation="MSYHLNTTGTSRNPIGSICPLDIVVQVGFGTAKITKNRQTYYDG ETEVGQPKTLAEFEEIAQQEPGNWKLIMEAPLWEAIWERQDTNKWVCIKAGKGFA" gene complement(2598..3098) /locus_tag="DP116_26190" /pseudo CDS complement(2598..3098) /locus_tag="DP116_26190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314075.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 2622..2631 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(3463..4044) /locus_tag="DP116_26195" CDS complement(3463..4044) /locus_tag="DP116_26195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015164451.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26195" /translation="MNVLVIPEDFRKDQYMLKPMITAMMEALGKPKTKVRVCQDPLLG GINEALKWERISEIIERYKGMVNLFLLCVDRDGKEGRRGVLDKIEQQAANILTGGRLL LAENAWQEIEVWVLAGHDLPADWNWQFIRNEINPKETYFLPFAAQRNVLDAPGEGRKL LAQEAARRYSRIRQLCPEDIAVLENRINSYTAG" gene complement(4041..5294) /locus_tag="DP116_26200" CDS complement(4041..5294) /locus_tag="DP116_26200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012233587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_26200" /translation="MLTKLRLERFKNFKDAELILGALTLLVGTNASGKSNIRDAFRFL HGISRNYNLAEIFGEKWIEGGVLQWRGIRGGTREVTFIDESTTFALAVSFTLVKNDSS QEATYRIEVNPGASGKAPTVIAESLTIAGQSDSVFEAQSHQESGQNNLSVKVSGGTPF DPILSYTNYRPVLSQIAAIAPSPSVRETALAAMSALSSMRFLDLSPDAMRLPSLPGQT VLGDRGENLSSVLQEICENPESKQALLQWVQELTPMDAKDFEFPADFTGKILLQLIEE NSQKTSAISASDGTLRFLAMIAALLGQEPARFYFFEELDNGIHPTRLHLLLELIERKV SQGTIQIVATTHSPQLLRLLSPQSLEYASLTYRFWDKPDAHIKRIVDIPEARRVINEQ DLARLHESGWLEDAVEFLEDEETNE" gene 5841..6398 /locus_tag="DP116_26205" CDS 5841..6398 /locus_tag="DP116_26205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877636.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26205" /translation="MPDQKSPPSSMIRVPVPLIGVVRRLSKLHRQGHTIALLQALEEL MSNFDINNDIDVTAGDKSVKQQEERLLKLESHLADLGSSVETKLEVITLKLELIERAI ASGRLSHNTKPRRLAYPYQQTQVELQPRTNESLAQRLGVTAQGLVAERENKSVKEFLS WSRNRDPMSIAWEWKTSDGLYHPQR" BASE COUNT 1785 a 1309 c 1343 g 1993 t 10 others ORIGIN 1 tagcaccttt ttctggtggt ttggcagaac tgactgatat gagtgagtag gcgcagtctt 61 tcttggaggg gagattcgca ggcgatgacg gggtattctt cttgaatcag tcctgagagt 121 atctggaagc tgtacatacc ttgtccaaag tgcttttagc gtattttctt ccatgtgact 181 gacaattggc atccaatttt agttgtaaaa aatatgcaac atctgttggt ttcaataatg 241 ctggtgtact tgtaaaactg ggtgtaatat tgttccttat ttaaggaaga tatgaagcga 301 caacctatta tttaaagatg tgatgaacaa gcatggcgaa atcagcatta ccagttccat 361 cttttcctca aatatttgtc ccgcagttgg ttcgagataa gaaatttgct cccgtgactg 421 caccttcggg tacgggtgat gcccctgtgg gggcgagtga gaatctattc gggaatgtct 481 tgaagcatta ttttggtgag ggggttgttt tagcacaaca gaagatgtta cctccagggc 541 acgatttaca ttacacgact gactttctga ttgtggaacc cactactggg ctacacctag 601 atgttgaagt ggatgagcct atctcatttg caacgggaaa acctacccac tgtattgggg 661 aggataacta ccggaacaag tgctttgtcg atgctaactg ggtggtggtg cgctttgcgg 721 aggaacaggt ttctagccag ccagagcgtt gtgcgcttgt catagctggt gcgatcgcaa 781 aactgaccgg caatactact tatgaacaga agttgcgcca tgcaggtagg gtagacgcta 841 ttaagcagtg gacacctcgt caggctacga agctgaagaa aagtaattac cgtcaggggt 901 atttggcatc taggaagtga catactccct caccataatc tttgatgtgc gatttgtagc 961 gcgggagctg tagggcaaag cccttcgatg cagggtagga aggaaagctt gtacagtaag 1021 ctttttaacc tttttcaact ggatagcaat cgccacgctg caatagtaaa gtttttcagg 1081 tatttagcaa aagagatcta ggctactact ggtgatattt tcgagaaaaa atttagggag 1141 tttctcaaga tgagcaactg cgttacatat tatacgcata cgaatcaaat tgaccaactt 1201 gtcgagtctt tcggcacaac actagagaaa cttaccctag ctgataagct tgcattgcgg 1261 gtaacattaa catattggtt attccactgc gaagttgctg atttaggtga gtatacttta 1321 aatcatgcct tagaggatat gatgcaagga tattcagaag actgtcaaat taatctcaaa 1381 gaagcgattg cgattttagg aggtattggc aaggatgaag cagaagggtt aattgaatca 1441 cttacggctc aactgcgctg tttgtaaaaa atattgaact aacggctttg gcgctaagta 1501 ggaaagcaca aataaacaca actatgttta gaaagattaa ggtattgaaa tcctcttccc 1561 ttgagctccg gctgccttgt ctcaacgaca aatatttgtg ctgacctatt tatcatacgg 1621 gaattagttt aagttctttt aatcgttcct gaatatctac tacctcaact tcccaaaatt 1681 tttcagatgg ttgcacgaag agccgctatc tggtggttcc caatagctag ctgctaggat 1741 tatcccatgt atggatgtgc ttgcaattga agcagctttt cggttgtgtc actatgcgaa 1801 tccttttcct gccttaatac aaacccactt gttagtatcc tgcctttccc agattgcttc 1861 ccataaaggc gcttccatta ttaatttcca atttcctggt tcttgttggg caatttcctc 1921 aaactcagca agtgtttttg gttgacctac ttcagtctcg ccgtcgtagt aagtttgtct 1981 gtttttggtt atcttagccg ttccaaatcc tacttgaact actatgtcta gagggcaaat 2041 actgccgatt ggattgcgag aagtaccagt tgtattgaga tgatagctca ttataaagaa 2101 tccataaata tttcttgata atgagcatta gctattgatt gtagtctaat tatttttagc 2161 ttttctttat aattttttag ccaaacgata cggatattac tcattttttt ctctgactgg 2221 tagcagttga ttgctgaatg atttaatgtg gttatgcacg gtgtaacctg acatatggtt 2281 ggcgaatgca gtttgcatat ctaacatgaa taccagaata aacaagctgc taaatccata 2341 gataaggatg tagagtaagt tgtcgataaa atgaactaaa ctttccattt gcagttgcct 2401 tataaaactg aattgttgtt accgtcagga agtgatttgg acaatttctt ctaatatcat 2461 tcagcgaagc tgctgtggtg tatattttcg gacattttta ttctttctcc ttataggtac 2521 agtatgtaag agttaataaa aaaatggttg agtaaaatct caaccatgat ggtaatattt 2581 ctgctcggga atttgtttta tcgagcttgg cttctgccat cnnnnnnnnn ngattgtact 2641 ttaaaccgta cctgcttacc gatagttacc tcagcaacat tgttgatatt ttgaccagat 2701 ttgtaagctt cgataattgc cttattatta gcttggtact tgatagttct aaactcatca 2761 ggaaaatctt catcatctac ttctaatagt aagttagcca ctttaggcgg gttatctcta 2821 atctcaatca ccctttcctt accgatattt ttgtcattga ttaatccaat ctcgtataaa 2881 tggatgaggg tgcgcttgat ttgttcaagt tgagttttgc ggcgtgaaat tacctggtcg 2941 tggagttcga ctactcgtcc ttttctttcc tcccaggttc caaggtcaag atttaactgg 3001 tcaactaccc aggcgatggc atcgattttg gtttcgatac catcctgaat agtcataagt 3061 tcttgtacaa gttcctctac tttaccctca gagcctaaaa tgtattttcc gttccgggcg 3121 agcaatatgt tcactgttcc agtaccaact accatcagtt cgacgacaaa tcttggtttc 3181 taaccagttg tttgtacgaa cgagacaacc tccatcttta aactggatag ctggtttgaa 3241 acagtttccg gaggcgttag atacccagat accaaaaact tgcgcattgg aattgcatag 3301 tcgaagcgtg catgttaagt gacttctact tttcctggcg taagcctccg ttgattagac 3361 atctggtgaa aaatgccaat accactctct tggaagttta aggtgaatag tcaacaagcc 3421 aaagtatcag agtggattgc tatgtaacgg tctgctcatg ctttatcctg ctgtgtaaga 3481 gttaattcgg ttttcgagaa cagctatatc ttcgggacag agttggcgaa ttcggctgta 3541 tcgtcgcgct gcttcttggg ctagtaactt tcgtccttca ccaggagcat ctaaaacatt 3601 tcgttgtgca gcaaagggaa gaaagtaagt ttctttagga ttaatttcat tgcggatgaa 3661 ttgccaattc caatcagcag gtaaatcatg accagctaaa acccaaacct caatttcctg 3721 ccaagcgttt tctgctaaaa gtaacctacc accagttaaa atatttgctg cttgttgctc 3781 aatcttatca agcacccccc ttcttccttc ttttccatcc cggtctacgc aaagtaggaa 3841 taagttaacc atacccttgt aacgctcaat aatttctgaa atccgttccc atttgagagc 3901 ttcattaatg cctcctaaaa gtggatcttg acaaactctg accttggttt tcggttttcc 3961 taatgcttcc atcatcgccg taatcatcgg cttaagcata tattggtctt tacgaaaatc 4021 ttctggaatt acaagtacat tcattcgttt gtttcctcgt cttccagaaa ttccactgca 4081 tcttccaacc atccagattc atggaggcgt gccaagtctt gctcattgat aacacgtctt 4141 gcttctggaa tatcaacaat gcgttttata tgtgcatcag gtttatccca aaagcgatag 4201 gttaatgagg catattctaa tgattgcgga cttaaaagtc tgagcaactg tggtgagtga 4261 gtggttgcta caatttggat tgttccttga gaaactttac gctcgataag ttcaagaagt 4321 aggtgtaagc gggtggggtg aatgccgttg tccagttctt caaaaaaata gaatcttgct 4381 ggttcttgtc ctagtaaagc agctatcatt gctagaaaac gtaacgtccc atctgacgca 4441 ctaatagcag atgttttttg actattttct tcaattaatt gaagcaatat ttttccagta 4501 aaatcagcag gaaactcaaa atctttagca tccattgggg ttagttcttg tacccattgg 4561 agaagtgctt gtttactttc agggttttca caaatttctt gcaatacaga agataggttt 4621 tcacccctgt ctcccaagac tgtttgtcct ggcaaggaag gtagtcgcat ggcatctggg 4681 cttaaatcta aaaaccgcat tgaacttagg gcggacattg cagctagggc tgtttctctc 4741 actgatggtg atggtgcaat cgctgcaatt tgtgaaagca cagggcgata attggtataa 4801 gataaaattg ggtcaaaagg agttccaccg cttactttca cggataaatt attctgccct 4861 gactcctgat gagattgcgc ttcaaaaacg ctatcagatt gccctgcaat tgtgagactt 4921 tcggcaataa cagtaggggc tttaccgctt gcaccaggat tcacctcaat ccgataagtt 4981 gcttcttgtg aggaatcatt ttttaccaag gtaaatgaaa ctgctaatgc aaaagtggtg 5041 gattcatcga taaaagtgac ttctcgcgtt ccgccgcgaa taccccgcca ttggagaacg 5101 cctccttcaa tccatttttc accaaaaatt tcagctaggt tgtaattacg agaaatacca 5161 tgaagaaagc gaaaggcatc ccgaatattg ctttttccag aggcatttgt ccctactaaa 5221 agagtcaagg caccgagaat gagttcggca tctttgaaat ttttaaatcg ttccagacgc 5281 agctttgtaa gcataactcc ctcaattaaa ccgccttttg ctggaaaaaa gctgatgtca 5341 gtgcgtttat tgattgctaa atataaatca tacttgacga aaggcaatgg tgatgcacca 5401 actgtggcaa aatgatctac tagaatttgc ttttctctat acttagagag taactatttt 5461 ttaaatatgg gtatttacgt agcagttgtt gaattgtctc ataggtagtg taaatttgcg 5521 ctacccatga aacagggaac ggggatcgtt ccccccggtt tccctactca tctaatcttg 5581 ctgctgttgg cgttgtgccg ctatttctcg tttcacctct tgcacaaggt gaataatttt 5641 atccggtgaa atcatagccc gcttcggtca acgcttgctt taatccctgt tccaattgtt 5701 ttggtgacat caaagtacac tcgcaaattg tgttgatatg taacaagttt gagaaatttg 5761 gacacaatta atactcagtc aattactgta ttccaccaaa ccttttataa tcagcttgtc 5821 gatatcaata atgatatcaa atgccagacc aaaaatcccc accaagctcg atgattcgtg 5881 tgcccgttcc actcattggg gtggttcggc gactatcaaa actacacagg cagggtcata 5941 caattgcctt gctgcaagcc ttagaagaac taatgtcaaa tttcgatatc aataatgata 6001 tcgatgtaac agcaggcgat aaatcagtca aacagcaaga agagaggctt ttgaaactgg 6061 aatcccatct tgcagatctt ggttcttcgg tagaaacaaa actagaggtt attactttga 6121 agctagagtt aatcgaacgg gcaattgcct ctgggagatt aagtcacaac actaaaccgc 6181 gaagacttgc gtacccgtac caacaaacac aagttgaact acagcctagg acaaacgaaa 6241 gccttgccca aagacttggt gtcacagctc aaggtctcgt cgctgaaaga gaaaataaga 6301 gcgttaaaga atttttgagt tggtcccgga accgcgatcc tatgagtatt gcttgggaat 6361 ggaaaacttc ggacggactg taccaccccc aacgctaaag agtatatctc ccgcttcatt 6421 aatcctttcc aaaattggac // LOCUS NODE_4716_length_6422_cov_2.8682276422 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6422) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6422) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6422 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..2848) /locus_tag="DP116_26210" CDS complement(<1..2848) /locus_tag="DP116_26210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877652.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26210" /translation="MTSTKSTKPKDKLGRQSLETEQLGVVKGLTPFEDITHLAGIASI SLGGRKNIGALILKKKENIQIRFCFDIQGIHPSLSSEQILPIFENIEGGLKELPERET LTIHIGSFTNDTFRQQELKKIEKECDLQQLTLLIRSERLRARELTQAGVRKNKFLRFW CTYTVLAAENSKSNDPIEKTIKQLQTYWQGFTGEIHKLRHQRIETILQNSFTSGFQMW EQIISNTMGLSAKVLTAGDIWEVIWNQFNRSEVPPIPNPLILDEAGLREEQTSDFHIK HLVLENEQSVPFLDRSWVKLQDKYIGVLQFSEKPQGWVDEYDQLRYLWRVLSKEKISD TEIICQLSKANQGLAKTVLQRITKQSITSSAMSADKGSIDVKANMNIEEAVRAQETIL RGSIPIHTAVVFCIHRHSRHQLDEACRYLSSCFLRPAVVEREVEYAWKTWLQTVPIIW ENLLTKPFNRRLPYFSCEVPGITPLVRTATGDKNGFELIAEEGGTPVHLDLYNQHKNL AVFGTTRAGKSVLVAGLLTQALAQGIPVIALDYPKPDGSSTFTDYTKFMGTEGAYFDI SKESNNLFELPDLRGYEPEVIKERMTDFKDFLKSTLLTMVLGTNPIGVSPTMISNIES IITIAIETFFNDDDIKLRYKFALENGVGTTEWADIPTLKDFYNYCSPGFIKLDSIANN SKEILDALDHIRLRLNFWLNSRVGQSIANPSSFRTDARLLVFALRSLSSEADAAVLAL SAYAAALRRALSSKASIFFLDEAPILFSFESIAELIGRLCANGAKAGIRVILSAQEPE SIFQSKSADKIFANITTRLIGRIQTSAVDPFVNRFKYPSEIISRNSTEAFFPKRESIY SQWLLDDNGKLTFCRYYPAYCLLAAVANNPNEQELRTLFLNKYAKNPMLGMVRFSEAY VQMIRGDELSQEAKQLLEILDNNNGTQLKLVQTIE" gene complement(2967..3320) /locus_tag="DP116_26215" CDS complement(2967..3320) /locus_tag="DP116_26215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314065.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26215" /translation="MLRDREFRTVNRVLGDQPRLGPFPADQIVPWSAIALIMYFVVQG FLQAGWLATGISITWGWATWWTVTSNKAFLGKFVGTPRITRGYKPFVSLLNPRQSEPR KKPAPKRKRNSNSKQ" gene complement(3385..3999) /locus_tag="DP116_26220" CDS complement(3385..3999) /locus_tag="DP116_26220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314066.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26220" /translation="MRTNYDPDYNKDYTSYEQEYGYLVSARRRRTWKENGTIAGLMAV GTGLMIVDLSIHNMVMASLAFATAIVALMPKEANSLLSNFEKRQHKYGINIYTIMFCL VGAVFLLDMSVAPASAQFMNSAEKFFTNEKYFPGIDKKITGFIFAVLRGLFLIYLAIG LIRVVQAARNDEDWQTIARTPIIIAITVVVGDILAGLVVGTGGG" gene complement(4354..4893) /locus_tag="DP116_26225" CDS complement(4354..4893) /locus_tag="DP116_26225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879184.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF192 domain-containing protein" /protein_id="PRJNA477356:DP116_26225" /translation="MEVTRPTTQRGIQAKQAQTSPKSLFFKILNVVGPIAGFIAALSP LPLYFWNHQPQYLPIGATFTSNNQTIKLEVARVAQELSKGLKFRKSIPNNHGMLFVIN KSEPISLWMKDTYIPLDMIFLKSGEIKHIVRNVPPCTTQECPKYNSVYPVNQVIELPA GSVDFLRLNNGEFLKINLK" gene complement(5011..>6422) /locus_tag="DP116_26230" CDS complement(5011..>6422) /locus_tag="DP116_26230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_26230" /translation="PKPRQNKEDYFHILNFWKEYQENGILIIENIYPWIQENSPKKTE DLLVSEWMKYSLLNLKLYNQNTGKVALLLGATADISSDISAEIPIWKQDLPDVSEIIN SLTQTSLLPPEYTDTDYLTVANAGMGLYISDIVNLLKEVRKTTDFNNSEFVAQSLLKQ KIQLLNRLYDIEFLPPPKVQLGGLELMQESFKKFKRLLTPRAKQYNLKVPKGIMLVGP PGTGKSHSAKACSQNMGIPLIMVDWGNFRSFGNQAEIKLERLLKLADKLSQIILYFDD FDKAFAGDDDLAKRLAGKLLTWMQERQSDVLVIASVNRMEWLPPELTRAGRFDYLFKV DLPNNGERYTIFKLHAARFDERFRNGGDPWDEEQWRRLLKATNRCVGAEIQTIVERAA STIFCQTVIEDTYSQSKLPPLELTIEALLEERRQINPLAIREADRVESMRNKADQQAL PSSPLDESKFAVGNINIFS" BASE COUNT 1674 a 1201 c 1442 g 2105 t ORIGIN 1 cttctatagt ttgtactagc tttaattgcg tgccgttatt gttatccagt atttccagta 61 gttgtttagc ttcttggctt aactcatctc cacgaatcat ttgtacgtaa gcctcagaaa 121 acctcaccat tccaagcatt ggattcttgg catatttgtt gaggaataga gtccgtagtt 181 cttgctcgtt tggattgttt gctactgctg caagtaagca ataagcagga taataacgac 241 agaaagtcag tttgccatta tcatcaagta accactgcga gtaaatactt tctcgcttcg 301 ggaaaaaagc ttcggtacta ttgcgtgaga taatttctga tgggtatttg aatcggttaa 361 caaacgggtc tactgctgat gtttgaattc tccctatcaa ccgtgtagta atgttcgcga 421 atattttgtc agcagattta ctttgaaata tactttctgg ctcttgcgct gataaaataa 481 cacgtatacc agcttttgca ccgttggcac agagtcttcc aatgagttcg gcaattgact 541 cgaaagaaaa gagaatcggt gcttcgtcaa ggaagaagat acttgcttta gatgataagg 601 cacgacgcaa ggcagctgca taagcgctga gtgcgaggac tgctgcatct gcttctgaag 661 aaagcgaacg cagtgcaaat acgagtagtc tcgcatcagt tctaaagcta gatgggttgg 721 caattgattg accaactcgt gaatttaacc agaaatttaa tcgtaaccta atgtggtcta 781 atgcatcaag tatttcctta ctattattag caattgaatc aagcttaata aatcccggtg 841 agcagtaatt ataaaaatct ttaagagttg gtatatctgc ccactcggtt gttcctactc 901 cattttctag ggcaaattta taccgaagtt taatatcatc atcattaaag aatgtttcaa 961 ttgcaattgt aataatactc tcgatatttg atatcattgt gggactgact cctatcgggt 1021 ttgtccctag taccattgtc aggagtgttg atttcaaaaa atccttgaaa tcagtcatgc 1081 gctccttaat aacttcaggt tcataccccc tcaaatcggg taattcaaat aaattgtttg 1141 attctttgct aatatcaaaa tacgcccctt ctgttcccat gaatttggta taatcggtaa 1201 atgtggaact cccatcaggc ttgggatagt ccagggcgat aacggggata ccttgcgcca 1261 gggcttgtgt taataatcct gccactagta cagatttccc agccctggta gtaccaaaca 1321 cagctaagtt tttatgttgg ttgtataagt cgagatgtac gggtgttcct ccctcttcgg 1381 caatgagttc aaacccattt ttatcacctg tggcggtgcg tactaagggt gtgattcctg 1441 gtacttcaca agagaaatat ggaaggcggc ggttaaatgg ctttgttagt aaattctccc 1501 aaataatagg tactgtttgc agccaagttt tccatgcgta ttctacttct ctctcaacta 1561 ccgctggtcg taggaagcaa ctagaaagat acctgcaagc ttcatctagt tgatgacgac 1621 tgtgacgatg gatacaaaag acaactgctg tgtggatggg gatagaacct ctaaggatag 1681 tttcttgtgc ccgtacagct tcttcaatgt tcatgtttgc cttgacatca atggaacctt 1741 tatcggcaga catggcgcta gatgtaattg actgtttagt tatgcgttgt aaaactgttt 1801 ttgctagccc ttgatttgct ttggatagct ggcagataat ttcggtgtcg gagatttttt 1861 ctttagaaag tactctccag aggtatcgta gttggtcgta ttcatctacc cacccttgag 1921 gtttctcgct aaattgcaag acaccgatat acttatcttg aagcttaacc caactacggt 1981 cgagaaaggg aactgattgt tcgttttcca gaactaggtg tttgatatga aaatcgctag 2041 tttgttcttc cctaagccct gcttcatcga ggatgagtgg gttgggaatg ggtggtactt 2101 cactacggtt gaattgattc cagataactt cccaaatatc acctgctgta agtacttttg 2161 cgcttaatcc cattgtgttg gagataattt gctcccacat ttggaatcca gaggtgaagc 2221 tattctgcag aatggtttcg attctctggt gacgaagctt gtgaatttct ccagtaaatc 2281 cttgccagta agtttgcagt tgtttaattg ttttctcgat ggggtcgttg cttttgctat 2341 tttcagctgc gagtacggta taggtacacc aaaagcggag gaacttattt ttacgtaccc 2401 ctgcttgagt gagttcgcgt gctcgcaggc gttctgacct aataagtagt gttagctgtt 2461 ggaggtcgca ttctttttcg attttcttta attcttgttg ccggaaggta tcattggtaa 2521 atgagccaat gtggattgtt aaggtttcgc gttcgggtaa ttctttgagt ccgccctcaa 2581 tattttcaaa tataggtagg atttgttcag aacttaggga tgggtgaata ccttggatat 2641 caaagcaaaa tctgatttgg atattttcct tcttcttgag aatgagcgca ccaatatttt 2701 tacgaccacc aagtgagata cttgcaatcc ctgctagatg ggtaatatct tcaaacgggg 2761 taagaccttt tactactcct aattgttcgg tttcgaggga ttgtctgcca agtttatctt 2821 tcggcttggt tgatttagtc gaagtcatgg cggtatgata tctgtgttaa ttcttaaaag 2881 gagaaggtta tcagttatca gttagcagtt atcagttatc agttaccagt gtcgtgtccc 2941 cttgttcact gttcactgtt cactgtttac tgtttactgt ttgaattgcg tttgcgctta 3001 ggagcgggtt ttttccttgg ttcgctttgg cgtggattaa gcagtgaaac aaagggctta 3061 tatccacgag tgatgcgggg agtaccgaca aacttaccta agaaggcttt atttgatgtg 3121 actgtccacc aagttgccca tccccaggta attgatattc ctgttgctag ccatcctgct 3181 tgaagaaatc cttgaactac gaaatacatg atcagtgcga tcgcactcca aggaacaatt 3241 tggtcggcgg gaaatggacc gaggcgcggt tggtcaccta agactcggtt gacggtacgg 3301 aattctctat ctctaagcat atttaatgag aataggagta atttgtatca ctcctagagg 3361 gttgattatg gtgctctttt gtgtttagcc gccgccagta ccgactacta atccagctag 3421 gatatcgcca actactacag tgatggcaat aataattgga gtacgggcaa tggtctgcca 3481 atcttcgtca tttctggctg cttggacgac tcggataaga ccgatagcaa gataaatcag 3541 gaacagacct cgcagtactg cgaagataaa gccagttatt tttttatcaa taccagggaa 3601 gtacttttca tttgtgaaga atttttcagc gctgttcatg aattgcgcgc tggctggtgc 3661 aactgacata tccagcagga ataccgcacc taccaagcaa aacatgattg tgtaaatatt 3721 gataccgtac ttgtgctgcc gtttctcaaa gttagaaagc aggctgtttg cttctttggg 3781 cattaatgcg acgatggcag tagcgaatgc gagtgatgcc ataaccatgt tgtggattga 3841 aagatccaca atcatcaatc cggtaccaac agccatcaaa ccggcgattg tgccattttc 3901 tttccaagtg cggcggcgtc tcgctgaaac aagatatccg tattcttgtt cgtaggatgt 3961 atagtcctta ttgtaatcgg ggtcgtagtt tgttctcatg tgagtgtaat tatcgattta 4021 tgagagctaa tttttgagag aacttgtcgg ggagatactc ttttgggcaa agttcaacgg 4081 agaatcggta ttggttaaaa ctcttgaaag ccttgtggct tatacgttta ttctataagt 4141 ggaatgaatt ttgatacatc cgattaattg tttaagtttt gatgttatct ttgcctatac 4201 agtgttgaag ttaatttaaa atctagttga attgacaaaa ctaaaaatgt atctcaacac 4261 tatctttaag caaagtaaaa tccctaaaaa caaggatttt tagggatttt atgcttgatt 4321 ttggctaatg tgttgtttga ttaggagttg aggctatttc agattgattt tgagaaattc 4381 tccattattt aaccttagaa aatctacact acctgcgggt aattcgatga cttggtttac 4441 agggtacact gaattgtatt taggacattc ctgagttgta cagggtggta cattcctgac 4501 gatatgttta atttctccac ttttaagaaa tatcatatct aggggaatgt aagtatcctt 4561 catccacaag ctaattggtt ctgatttgtt gataacaaat agcatcccgt ggttgttcgg 4621 tattgacttg cggaatttga gtcctttgga tagttcttgg gcaactctag ctacctctag 4681 tttgatagtt tggttattac tagtaaaggt tgcaccaata ggtagatact ggggttgatg 4741 attccaaaag tagagtggta aaggtgagag ggcagcaatg aaaccggcaa ttggtccaac 4801 gacattgaga attttgaaga ataagctttt tggtgaagtt tgtgcttgct tagcttgaat 4861 tcccctttga gtggttggtc ttgtgacttc cataatttta gatgaataaa atggtaaagt 4921 ggttacaaat tgcaaacatt atgaactgtt taaatctttt tggtatgctt gctttagatg 4981 atgaaacttg aaacttattc ctgacgtttc ctagctgaaa atattaatgt ttccaacagc 5041 aaacttactt tcatcaagag ggctactagg taaagcttgt tggtctgctt tattacgcat 5101 tgactcgact ctatctgctt cccggatagc aagcgggtta atttggcggc gttcttcgag 5161 tagtgcttca attgttaatt ctagaggcgg gagtttgctt tgtgagtaag tatcttctat 5221 tactgtctga cagaatatgg tgctagcagc acgttctaca attgtctgaa tttcagcacc 5281 cacacaccta ttggtggctt taagaagtct gcgccactgt tcttcgtccc aaggatcgcc 5341 accattgcgg aagcgttcgt caaatcgggc agcgtgcagt ttgaagatag tatatctttc 5401 cccgttgttg ggtaggtcta ccttgaagag gtagtcaaac cgtccggcgc gggtgagttc 5461 gggtggtagc cactccattc ggttgactga agcaataact aatacatcag actgtctttc 5521 ttgcatccag gtcaaaagct taccagctag gcgtttcgct aggtcatcat ctccggcaaa 5581 agctttatca aaatcatcaa aataaagtat tatttgtgac aatttatcag cgagttttag 5641 caggcgttca agcttaattt cagcttggtt cccgaaactg cggaagttcc cccaatctac 5701 catgatgagg ggaataccca tgttttggga acaggctttg gcggagtgag atttacctgt 5761 tcctggtggt cccactaaca tgataccttt gggcactttg agattatatt gtttggcgcg 5821 aggggtgagg aggcgcttga attttttaaa tgactcttgc atgagttcta acccacctag 5881 ttggactttg ggtggaggta gaaactcaat atcgtagagg cggttgagca gttggatttt 5941 ttgtttgagc agagattgtg ctacgaattc ggagttattg aaatcggtag tttttctaac 6001 ctccttgagt aggtttacaa tatcagatat gtataacccc atcccagcgt tcgctactgt 6061 gagatagtct gtgtctgtat attctggggg taggaggctg gtttgagtga gagagttgat 6121 gatttcgcta acatcaggta aatcttgttt ccagatagga atctcggcgg agatatcgct 6181 actaatatca gcagttgctc ctaggagtag ggcaactttc ccagtatttt ggttataaag 6241 tttcaggttg agtagtgaat atttcatcca ctctgacacc agtaaatctt cagttttttt 6301 tgggctattt tcctgaatcc agggatatat attctcgata attagtatcc cattttcttg 6361 atattctttc caaaaattga ggatatgaaa ataatcttct ttattttgtc ttggtttagg 6421 aa // LOCUS NODE_4726_length_6403_cov_5.4651866403 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6403) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6403) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6403 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(407..784) /gene="folB" /locus_tag="DP116_26235" CDS complement(407..784) /gene="folB" /locus_tag="DP116_26235" /EC_number="4.1.2.25" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318876.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydroneopterin aldolase" /protein_id="PRJNA477356:DP116_26235" /translation="MDCIHLTGIRSYGYTGYLQEEQVLGQWFEADVRLWVDLSQAAET DAIENTIDYRGTISLVQNLLKTSKFLLIERLAGAIADSILASSDRVVQVQVILNKPAA PIPDFGGTISIELTRTKHETSNS" gene 987..2789 /locus_tag="DP116_26240" CDS 987..2789 /locus_tag="DP116_26240" /EC_number="2.2.1.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651677.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-succinyl-5-enolpyruvyl-6-hydroxy-3- cyclohexene-1-carboxylic-acid synthase" /protein_id="PRJNA477356:DP116_26240" /translation="MVIDFRNINTVWSSIAAQTLKRLGLTCVVICPGSRSTPLAVAFA QQSPDIEAISILDERSAAFFALGQAKATGRPTLVVCTSGTAGANFYPAVIEAKESRVP LLLFTTDRPPELRDCHSGQTITQLKLYGNYPNWQTELAIPSVDMQMLAYLRQTMIHAW ERSQTPVPGPVHLNVPFRDPLAPIPDVEMLYALSLQLLQSEFDTEDFFSGVTTISPSL YLPSSLNLPFQQWLKTQRGIIIAGVGQPQRPEEYCRAIAQLSKTLKFPVLAEGLSPVR NYADENPYIISTYDLILRNQELAKQLAPEVVIQIGEIPTSKELRNWLASTQAQRWIID PSDQNLDPLHGRTTHLRISIEELGRCVTWGENTSSSPSSFSSPPSSSEYLQLWCAIEA KVRATVDQTLTTIEELFESKAAWLLSQTLPPRTPIFIANSMPVRDVEFFWKPTHSQVQ PFFNRGANGIDGTLSTALGIAHRQQSSVMLTGDLALLHDTNGFLMRKKFVGHLTIVLI NNNGGGIFEMLSISKFEPPFEEFFATPQDIDFAQLSATYGVEHELITSWQQLLSRLNP LPSKGIRVLELQTNRKADAQWRRENLGKFAVDLA" gene complement(2894..4975) /locus_tag="DP116_26245" CDS complement(2894..4975) /locus_tag="DP116_26245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455439.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S8 and S53 subtilisin kexin sedolisin" /protein_id="PRJNA477356:DP116_26245" /translation="MSTTFSESSLETAKNLNTGVNPQIFSDTLNSGHSNFYSFNLKGR SSFNLDFEDLSANAHVDLIQDLDGNGVVDDGEVINSSVFSGAKPESINQTLDAGLYYI QISIDEALDTDYKLAVSATPIDYAGDFLENARQITLHSQAKNYSDWVSISDTNDYYKF TLKTTSDFKLGLSGLSEDAQVQLLDGNGNTSYHYVGITNESINRTLDPGTYYVRVNSY DNSETFYKLSLSATPVSTSETIPTPSDGGILTTISNGAQQLIASVITSVFPNSNTQYF KGTLRADNFTYQSTYNRTIYSGNGNVDYGSGGRDLLDLSAFSSTQATIKLADSTGGVM YNPGNGTRLFDTITLSNGKEILFEGIEAIKFADKTINLSVTPNDPLFGQQWNLHMMGV QSAWRFTTGSNNVLIGIEDTGLAGTTANLHPDFRALYTLPNNYLDEMSKFIAHGTEVQ GVIAAASNNGVGMSGINWNSDVFHIDVMGGDAGDYDLVTATQALIDQAKSKSQRLAVN LSLTGGSSPQFEQLIANNLNTALFVVAAGNGNSNSLESPADLAKRYANVVAVGASWGV KDWNNNPTTPGTRISYQGGWGSNKGDGLTLMAPSEYLTTNAIKSSNGFTFDYDQRFNG TSAGVPNVTGVASLVWSLNSNLTATQIKTILSETAYDLGAPGYDTEYGYGFVNADAAA RRAMALARGAA" gene 5526..6320 /locus_tag="DP116_26250" CDS 5526..6320 /locus_tag="DP116_26250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318752.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR00297 family protein" /protein_id="PRJNA477356:DP116_26250" /translation="MFDSINSLNPWLVGVGLNFVLLLVAWIVPKKLLTPAGFFHAWLL GVLIWATLGWEGYVVVMFYFLVGSLITRVGMAQKEAQGIAEKRSGARGPENVWGSALT AALCALGVWATSSGILPTPQNLHPTPATLLLLGYVASFSTKLSDTCASEIGKAYGKST FLITTLQPVPRGTEGAVSLEGTLAGAIASLAQAFLGWTVGLIDLVGVFWCVVAAFIAT NLESVIGATLQSRFSWLTNEVVNIFNTLIGASTAILLAWIWANFIT" BASE COUNT 1851 a 1357 c 1290 g 1905 t ORIGIN 1 gagacaggaa gatagaacag ttttttgaaa tctttactgc aaacaagtat aaaaatacac 61 tccaaaattt aaaatgacac atgttttgta tatatttagg caagctaata actgaaattt 121 aatactatgt atagcagcta tatctgaaag gaactgccaa agtgccaaga tataataaag 181 tatgagaaaa gaataattca accagtcctc aactgattgc tacaaatact gcgttttgct 241 gctagtctag ttaagagtcc aagaagcaaa actctcaagg gcaactcttg gaatccctca 301 gcataccgcg aggttgatga gggaagatgt caaggaaatt aacaagattt acttccttgc 361 tggcaaatac agaagattaa aatggagtct tgacatacag tctcagttat gaattggatg 421 tctcgtgttt agttctagtt aattcaatgc tgattgtgcc gccaaagtct ggaatgggtg 481 cagcaggttt atttaaaata acttgaactt gtaccacgcg atcgcttgat gccaaaatag 541 aatctgcaat agcgcctgct aggcgttcta tcagaagaaa ctttgatgtc ttcagcagat 601 tttgcaccaa actgatagta ccacggtaat cgatagtatt ttctatcgcg tcagtttcag 661 cagcttgtga gagatctacc cataatctga catctgcttc aaaccattgt cctaacactt 721 gttcctcttg caggtaccca gtgtagccat atgagcgaat tcccgtcaaa tgaatgcagt 781 ccataaaaaa tagtcctaag ttagtacttt cagtatttaa cagcccattg gctttcctga 841 ttaatctacg ataggaaggt tggttgaaat ataccagaaa attaggatga tcagataact 901 caccagacaa tcagtttaaa atgacacgct cttattggca tatctgtttg aactgaaaac 961 tcaggacttt tggactttca aggttaatgg taattgattt tagaaacatt aatacagtat 1021 ggtcgtcgat tgctgcccag acattaaaac gcctaggatt gacttgtgtt gtgatttgtc 1081 cgggttcccg ttccacacct ctagcagtcg cctttgccca acaatcacct gatattgagg 1141 caatttccat tcttgatgaa cgttccgcag ccttttttgc cttaggtcaa gcaaaagcaa 1201 ccggacgccc cacactcgtt gtttgtacct ctggaacagc aggagcgaat ttttacccag 1261 cggtgattga ggcaaaagaa agtcgtgtac ctctgttgct gttcaccaca gatagaccgc 1321 ctgagttacg agattgccat tctggacaaa ccataaccca gttaaaatta tacggcaatt 1381 atccaaactg gcagacagag ttagctattc cctctgtgga tatgcaaatg cttgcttatc 1441 tgcggcaaac tatgattcat gcatgggaac gttcccaaac tcccgtaccg ggaccagtgc 1501 atctcaatgt tccctttcgc gaccctcttg cacccatccc cgatgtagag atgttgtatg 1561 cactttcatt acaattgtta cagtccgaat ttgacacaga agactttttc tctggggtaa 1621 cgaccatctc tccatctctc tatctcccca gttctttaaa tcttcctttc caacagtggt 1681 taaaaactca acggggaatt atcatcgctg gtgttggtca accgcaaaga ccagaggagt 1741 attgtcgtgc gatcgcccaa ctctccaaaa ccttaaaatt tccagttttg gcagaaggac 1801 tctccccagt cagaaactat gctgacgaaa atccttatat tatttccacc tatgacttga 1861 ttttacggaa tcaggaatta gcaaaacagc tagcaccaga agtcgtgatt caaattggtg 1921 aaatacccac cagtaaagaa ttacgtaatt ggttagcttc tacccaagca caacgctgga 1981 tcattgaccc tagcgaccaa aacctcgacc ctctgcacgg gagaacaact catctacgca 2041 taagtataga agaattgggc agatgcgtaa cttggggaga aaatacttcc tcatccccgt 2101 catctttttc atctccccca tcttcttctg agtatctcca attgtggtgt gcaatagaag 2161 caaaagtcag agcaactgtt gaccaaactt tgacaacaat agaggagtta tttgaaagca 2221 aagcagcttg gttactttct cagactttac caccaagaac acctattttt attgctaaca 2281 gtatgccggt gcgggatgtt gaattttttt ggaaaccgac tcactcacaa gtgcaaccct 2341 ttttcaaccg aggtgcgaat ggtattgatg gcacattatc cacagcttta ggaattgccc 2401 atcgccaaca aagcagtgtc atgttgacag gtgatttagc cttattgcac gatacaaatg 2461 gttttttaat gagaaagaaa tttgtcggac atctaacaat tgtgttaatt aacaacaatg 2521 gtggtggaat ttttgaaatg ttatctattt ccaaatttga gccaccattt gaagaatttt 2581 ttgccactcc ccaagatatt gattttgctc aactatctgc tacctatggt gttgagcatg 2641 aattgataac ttcttggcaa cagttactct caagattaaa cccactacca agcaagggaa 2701 ttcgggtttt agaattgcag acaaatcgta aagcagatgc tcaatggcgt agggagaatt 2761 tagggaaatt tgcagtagat ttagcttaga ttttgaccat attttcttct gacacggagg 2821 gttgggcaaa ccctacaccc ctacaccctt acacccctat tcccctacac ccctattccc 2881 ctagttccgg tcattacgct gcacctcgtg caagagccat tgctcttcta gcagcagcat 2941 cagcattcac aaacccgtaa ccgtactctg tatcataacc tggtgcaccc aagtcgtaag 3001 ctgtttctga cagaatcgtc ttgatttgag tagcagtgag atttgagttc agactccaca 3061 ctaacgaagc aactcccgtc acgtttggta ctcctgctga agtaccattg aaacgttgat 3121 catagtcgaa cgtaaaacca ttagaagact tgatagcatt ggttgtcagg tactcagatg 3181 gtgccatgag agtcagacca tctcctttgt tggaacccca ccctccttga taggaaatcc 3241 gtgttccagg tgttgttgga ttgttattcc agtctttgac tccccaagaa gcaccgactg 3301 cgacaacatt ggcatacctc ttcgctaagt cggcaggact ttcaagacta ttgctattac 3361 catttccggc tgcaaccaca aacaaggctg tatttaggtt attagcaatc agttgctcaa 3421 attgtggtga actaccacca gtcaagctga ggttgactgc cagacgctga cttttactct 3481 ttgcctgatc aatcaatgct tgtgtagcag taaccaaatc ataatcccca gcgtcaccac 3541 ccataacatc gatatggaaa acatcagaat tccagttgat accactcatt cccacgccat 3601 tattgctagc cgcagcaatc actccttgaa cttcagtacc gtgggcaatg aacttactca 3661 tttcatccaa gtaattgttc ggcaatgtat agagggcacg aaaatcgggg tgaagattag 3721 cagtagtgcc agccaacccc gtatcttcaa ttccaataag tacattgttg gaaccagtcg 3781 tgaaacgcca cgcactttga acacccatca tatgtaggtt ccattgttgc ccaaacaggg 3841 ggtcgttagg tgtcactgat aagttgattg ttttgtctgc aaatttaatc gcttcaatac 3901 cctcaaacaa aatttctttg ccgttgctta atgttatggt atcaaatagg cgagtaccat 3961 tcccagggtt atacatgaca cctcctgtgg agtcggctaa cttgattgtg gcttgtgtgg 4021 aagaaaatgc agataaatcc aataagtctc gtcctccgct accatagtca acgttgccat 4081 tgccagaata tattgtccga ttgtatgtag actgataagt aaagttatct gcccgaagag 4141 ttcccttaaa atattgcgtg ttgctattag ggaatacaga tgtaataaca gaggcaatca 4201 gctgttgtgc gccgtttgat atggtggtta atatgcctcc atcactggga gtcggtattg 4261 tttcactagt ggatacagga gttgctgaca aacttagctt gtagaaagtt tcactattat 4321 cgtacgaatt tacgcgtaca tagtatgttc ctgggtctaa agtacgattt attgattcat 4381 ttgtgatgcc aacgtaatgg tagcttgtat taccattgcc atctagcaat tgcacctgtg 4441 catcctcact caagccacta agtcctaact tgaaatcact cgtagtcttc agtgtaaatt 4501 tgtagtaatc atttgtatct gatatactta cccaatcact gtagtttttt gcttgggaat 4561 gaagagtaat ttgacgtgca ttttctaaaa aatctccagc atagtcaatg ggcgttgctg 4621 aaacagccaa tttgtaatca gtatctaaag cttcatcaat agagatttgg atataataca 4681 aacctgcatc taaagtttga ttaatcgatt caggcttcgc gccactgaaa acagagctat 4741 ttatgacttc accgtcatct accacaccat tgccatctag atcttgaatc aaatctacgt 4801 gtgcattggc gctcaaatct tcaaaatcaa ggttaaaact actacgacct ttcagattaa 4861 agctgtagaa attgctatga ccggaattta atgtatcgct aaatatttga ggattaacgc 4921 ctgtatttaa gtttttagca gtctctagag agctttcaga aaaagtcgtt gacataggtt 4981 aacgagtgta ccatttgtat atattacatc atttataata tcgtattact ttgaaggcaa 5041 aatttatgat tgtgtagaat tttactgaaa aaatagaaaa atattactgc ttcttcatat 5101 aattgataca ttctaagagt gttttctctg aaaaattaac aaaatttgtc tttggaacag 5161 ttgtaatata aaacacgtag tctcgacggg agagaacgag gaaaaaggca aggggctgtt 5221 cctccctttg aagaagatag agaatagaag acttcaggta gtaaaaagat ttttgtcaat 5281 ccgcgttcct caaaattcac catctggaga aaaaatttat tcctgagtct tccctccctg 5341 cattgccgat aagctatggt cttttggaaa attcccagag aactgagata cgtaatgaaa 5401 gaagtacaaa aaaatcactg tctcttctgt gactaaggag ttgtaggtag caatcaaact 5461 tgttctaaat tagttattct attggtactt tgacaaaata ctcctaagaa atatgagttg 5521 gtacaatgtt tgattcaata aactctctaa atccttggct agttggagtc ggcttgaact 5581 ttgtcctact actggtagct tggattgttc ccaaaaagct gctgactcca gctggattct 5641 ttcatgcttg gctacttggt gtcctgattt gggcaacact agggtgggaa ggatatgtag 5701 ttgtcatgtt ctactttctt gttggttctt tgattacgcg cgtaggtatg gcgcagaagg 5761 aagcacaagg tattgcagaa aagcgttctg gtgctagagg tccagaaaat gtctggggtt 5821 ctgctttaac tgcggctttg tgtgcgttag gagtatgggc gacaagttcg ggaattcttc 5881 ctacacctca aaacttacat cccacaccag ccactctatt gttactgggt tatgtggcga 5941 gttttagtac aaagctttct gacacctgtg caagcgaaat aggtaaggct tatggtaaaa 6001 gcacttttct cattactacc ctacaaccag ttccacgggg aacagaaggc gcggttagct 6061 tggagggaac tttggctggt gcgatcgcgt cactcgctca agcattcctc ggctggacag 6121 tgggtttaat agatttggta ggtgtctttt ggtgtgttgt ggcggcgttt attgccacta 6181 atttagaaag tgtcattggt gcaacactgc aatctcgctt tagctggctt actaatgaag 6241 tggtgaatat tttcaacaca ttgataggcg caagtacagc aattttgctc gcctggatat 6301 gggcgaattt cattacctaa tttttttgag aaaatcttga gtactttcta gcaatctcgt 6361 ggttgaattg tcacgctaat tcacacggga gtatggaagc ccc // LOCUS NODE_4735_length_6375_cov_5.2172476375 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6375) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6375) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6375 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 242..847 /locus_tag="DP116_26255" CDS 242..847 /locus_tag="DP116_26255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016869150.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="PRJNA477356:DP116_26255" /translation="MGRPTKEKSLTQQDVIEAAIACLEKEGESALGVNRVARELGIKP PAIYKHLDGNAGLRRAVVLAIWRKYLTYSQEQMAGLSEPNALLRAGGHATRNFARSHP ALYKVMMQFQLQPTEPDAAALIQESLGLLKKSLQLYDLNDSQLIDVMRMVNAAIYGFI SLEQAGLMTLEHSTDTSYEVMLNALIVAIEHIRRDSSDLPP" gene complement(1165..2265) /locus_tag="DP116_26260" /pseudo CDS complement(1165..2265) /locus_tag="DP116_26260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741404.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="penicillin-binding protein 1C" gene 2478..3492 /locus_tag="DP116_26265" /pseudo CDS 2478..3492 /locus_tag="DP116_26265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_042152754.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(3597..4130) /locus_tag="DP116_26270" /pseudo CDS complement(3597..4130) /locus_tag="DP116_26270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872931.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="penicillin-binding protein 1C" gene 4824..5201 /locus_tag="DP116_26275" /pseudo CDS 4824..5201 /locus_tag="DP116_26275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139497.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ISAs1 family transposase" gene complement(5443..6105) /locus_tag="DP116_26280" CDS complement(5443..6105) /locus_tag="DP116_26280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006515454.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ankyrin repeat domain-containing protein" /protein_id="PRJNA477356:DP116_26280" /translation="MSSLKDELIRAVVRGDATAVADLLAQNANVNTTGGVTPVGESNT LLMWAAAEGYADVVKILLSYGADVNIKNDANYTALMYAAEGGYLESVNALLDYGADIH PRNHYGETVLMSVARFGLTDLILRLIDLGADIHATNKIGDTALYLAVDNGQFYTVKAL ISRRASVNTQNIGGWTPLMMASARGDLEVMELLLEHGADFRPRNRWGATALSEASRAN AS" BASE COUNT 1866 a 1368 c 1378 g 1763 t ORIGIN 1 actaaagaag tacaagtaag tttaagaaac ttttttgcta tttaatttag ctctaggggg 61 aagtcatgac agtgcagaac ttgataaagt taattatatt cacttttagt taatgaagtt 121 aactaaaaga atttatacca tttcacaaaa atcctcatac aagttactcc ccctgctcct 181 tccttcctcc cctgctccct ctgcttgcct aaatgtatca accttaaagt gaaagaatat 241 tatgggtcgt cccacgaaag aaaaatctct gacacaacag gatgtgattg aagctgcgat 301 cgcctgctta gaaaaagagg gagaatctgc tcttggggtg aatcgagtcg cgcgggaatt 361 gggaatcaaa ccacccgcga tttacaagca tctcgatgga aacgcaggac tccggcgagc 421 agtggtattg gcaatttggc gaaaatattt aacgtacagc caagaacaaa tggctggatt 481 gagtgaaccg aatgctttgt taagagcagg tggacatgca actcgtaact ttgcgcgatc 541 gcatccagca ctttacaagg tgatgatgca atttcaatta cagccaaccg aaccggatgc 601 tgctgcactc atacaagaat ctcttggctt actcaaaaag tcactacaac tgtatgattt 661 aaatgacagc caattgattg atgtcatgcg gatggtgaac gctgcgattt acggtttcat 721 tagcttagaa caagcaggat taatgacgct agagcattcg acggatacca gttatgaagt 781 tatgctcaat gctctaattg ttgcgatcga gcatatcaga cgtgacagct cagatcttcc 841 accttgactt tagtacgcct tgaacttaag ttcaaggctc atagccaaag tccgttaaaa 901 cggactggct aataccaagt tgcgttctgt tgtagtagcg ctttacggga gattgcggca 961 gattgcggga agccccctcc ggggtctaca ttccactcgc tgcggacaga ttcgctcttt 1021 gagtgcaact tggtataagt cctttcagtc cgttttaacg gacttgaact ttgagccaag 1081 aaatttattt cttggcagac gagaattatg gtgcaagatc tcagacagca cttaatctgc 1141 aaggttgtct gggcgcttat gcttttagtg agaataacca acagaaaaac cccgacgtgt 1201 gggtttcacg ctagctaact ccacttgaaa acttacctta ccgctcattt caccactttt 1261 ggcttccaat gtccatctac cagggcgcaa agtccaaaac acagaatcag atgaatttgt 1321 cgctaacttc tgaccattta accaccactc tacaggcgct gtgggtgttc ctgcaagctt 1381 aaactccaat ctttgctttg ttttctcatc aggatacaac aagaagacat cattctgatg 1441 aggagacaca atctttagtt tgtcagaact catactagat tggttctgcc ttgctaacca 1501 ctcatcatac tcactagact tagtgaaatt gttttgacgc tcgtagtctc caacgtcttg 1561 tgcgtaaaaa tattcttgta ccacagagga acaatctggt gttggtcgta acccagaaat 1621 tgcacaaata ggtagttgta ccataccttt aggaggagga aaacttgctg gttcttgatg 1681 ctcatgcagg tgtagcatga ttcgattcca tagaggtgct gcgcccatga ctccggaaac 1741 ttgacgcatg ggatcgccat cgaaattacc tacccatgtg gcgacggtgt aatcagtcgt 1801 aaatccaact gtccaagtat cgcgaaaatt tgaggaagtc cctgttttga cagcagcaga 1861 aaatggtaaa ctgaggattg agtctacacc aaagactgtt gcacgggcgt ggcgatcgct 1921 caacatatcg gtcatcagtt gccatgttac accagaagtt gggagataag gagatttgtt 1981 agtctgtgtc gtgctgtcta acctagtcac aatataagta ggttgtccct gtcgtgccat 2041 tgtgacataa gcttttgcaa gttcccataa actgacttcg ccactaccga gagtcaatcc 2101 caaaccatag tattctgcgc tttgagtcag atgttcaaac cccaattggt ggagacgatc 2161 caaaaaagtc tgcactccca tcttttccaa gactcgtaca gctggaacat tgagcgaatt 2221 tgctaaagcc aaacgcaccc gcactggtct ttgaaagctt ttactatcag gacaggactt 2281 acgcaactgg cacagcgcga tacgccttta ggcgtctgcc cttcgggcaa tcgtatttct 2341 ctagtaattg tgtactacaa aaaactggcg tacccagagg gcatggctga aattagtaac 2401 actaatcggt tgtgttaagt attcctcaaa tcatgagaaa attagttgag gatgcataaa 2461 ataaatcctg taggttgatg agatttacta aacttaacta ttgccagtat ttgttaagta 2521 atcagattaa ttatactctc acaaatttgg cggagcattt agagcagatc agtcatgata 2581 aaatcaaccg ttatttaaag aacgagaaat taacacctcg tcttctttgg gataatgtaa 2641 aagatttaat agaaagaaat gagaaagcat acttggtttt tgatgacaca gtaattgata 2701 aacgatacgc tattgaagta gagccaagta aacgccaata tagtggcaac gagcatggtg 2761 taattcaagg tattgggcga gtaaattgtg tatatgttaa tcatgaaatc ggaaagtttt 2821 gggtagttga ctatcggatt tatgacccag acagagatgg aaaaacaaag atagaacatg 2881 taacagagat gctacaaaac cttgtgtacc ataaaaactt actatttcaa tccgtcttga 2941 tggacacgtg gtatgcgaca aataaattga tgttatatat tgatgggtta gggaaatatt 3001 attattgccc tctgaaacgt aatcgacttg ttgatgatac ggaagagcaa gaagactata 3061 aaagaattga attgttatct tgggatgaaa aacacttaaa attaggaaaa acagttaaaa 3121 taaaaaagtt tcctggcgcg aaaaaagtga aactattccg ggtaactgtc tccaccgaca 3181 gaacggattt tatcgctaca aacgatttat ctcaagattc tacggacgtt gtacaacaag 3241 tgtgtaaggt tcgatggaag gttgaggagt aacacccctg aattaaaaca attgactggc 3301 gttgaatcat gtcaatgccg taagggtcgt attcaaagaa accatattgc ctgcgctatt 3361 ttagtctggc ttcggctcaa agatttagct tataaaactg gtcaaacaat ctatcaaatt 3421 aagcacggat tgttgtcaaa ttatttagtt cagcaactaa aacgtccgga tgttcccatg 3481 tttatcgtct agttgtttgg cgtgggcgcg agagtgcaag cgccattgct tgagaaaatg 3541 gcgataagcc ggaggcttga cgctttgcgt atcgcattgt gccagttgcg taagtcctga 3601 ctataatcag taggactgta gagttttgcg ccaggtatcg catagtgagc agggacatct 3661 gccaaaattg tattaggacg aatcaagttt ttttccaaag ccaattcata tagaaacggc 3721 ttgagagttg aacctggttg acgcaaggct tgtacaccat catttcttcc ctgcttggct 3781 tcattaaaat agtcaggcga accgacataa gccaaaactt ctccactgtg gttatcaatc 3841 accaaagcag ctgcgtcatg gacattgtga acagcaaggg tggaaataac ctgttgtacc 3901 tgggcttcaa caaactgctg caagagacga tctatagtgg tgcgaatggg ggatgatttt 3961 tctagcctct caggctgact cgccagccaa aacaaaaagt gtggtgcagc aagaattcct 4021 cgttggcgag actgaaacga aatttcctca gcgtatgctc gctctgattg ggggcgagtg 4081 atatacccat cttgtatagc agtcgccaca attgttagga catttctctg ttccccgttc 4141 cccattccct gttcccttga gatcaaaata tgatgtccta accaatatgt ccgttgctat 4201 acccaataga tgtcatgatt aattttttcc aaactcaaca agttctttgt ctttatagtg 4261 attaacgtaa aatcacctga ctcgttacta acggtaacga tgagatacag tgtctttaac 4321 cctaccgacc gtataaattc ataacttaag acatctgcta atacaaaacc tcagagagag 4381 gatatttgaa gagtaaaaaa gtttttatat aagtaaggtt tgtcgtctga gttcactccg 4441 ttgcacctgc ctgctgtaat atttgaactg cactgacgga acgaaacgat cgccttgctt 4501 caccagggca aacgcatcta aattttgaca aagtggtaag tagaggtgcg gtctagatag 4561 tacagatgaa ctgtatgaag agttcggtga gcgatcgccc gtatcacacc ctgcgggtgt 4621 gtcaaagaca cagggcaatc caactccgtt ggattggggc gtaggggcag gtacctcttg 4681 cagctatgca gcgcgggacg ggattgccca aagggcagtg ccaaaggcaa tcacgagttg 4741 gtgtattctc attctcaaag tagaatgaag ccagagcctg tgaaactcaa accgaaaatc 4801 acgattgcgg atcactttaa ggcaggcttg acccagattc ggtttggtca aaattaaata 4861 gtgttggtat ggtagaatct atccggcaag tggatggtga aacaacggtc gaaactcgtt 4921 attttatcag tagtctgtcc caaaatgctc aaaccttcgc caattccgtt cgtagtcatt 4981 ggggaattga aaattcatta cattggatac tggatgtaac cttaaaagaa gatgactgtc 5041 ggattagaaa agataatgca ccagagaatt ttgcttcttt tagccacgta gcagttaatc 5101 tgttaggtca agaaaagcgg gtaaaattag gtatgaaaaa taagcaattt ctagctgcaa 5161 tggataacga atacttggcg agggttctat ccctggctta aagcttatta ttttcatata 5221 atattgagct aaaagtacat ttcagatttt gagtcttgtt agcctagatt caattaaaac 5281 tacacccttt tttattttta tttttaactc aaaatcatgt cctattgtgc taatctaact 5341 attgtttatt gagctttaat cctgaattta gcttttttat tttatttgga aaattcaact 5401 tattctaaat atttcaatct agtttcccga tattagtaac aattaagatg cgttcgccct 5461 gcttgcttca cttaatgccg ttgctcccca acggtttctg ggacgaaagt ccgcgccatg 5521 ctcaagcagg agttccatca cctccaaatc tcctcgtgct gatgccatca ttaagggagt 5581 ccagccgcca atgttctgcg tattcacgga ggctcgccgt gaaatcagtg ccttaacagt 5641 ataaaattgc ccattatcaa ctgccaagta cagcgctgta tcaccaatct tgttggttgc 5701 atgtatatct gcccccaagt caatcagacg caggataaga tctgttagac caaatcgagc 5761 aacagacatc agcacagtct caccgtagtg gtttctcgga tggatatcag ctccataatc 5821 cagcagcgca ttaacgcttt ctaaataccc accctctgcg gcgtacatca gggcagtata 5881 gttagcatca tttttgatat tcacatctgc accataactc agtaggattt taaccacatc 5941 cgcataaccc tctgccgctg cccacatgag taaagtattg ctttctccaa ctggggtaac 6001 tcccccggtt gtgttgacgt ttgcgttttg agctaacaga tctgccaccg ctgtagcatc 6061 tcctcgaaca actgctcgga tcaattcatc cttcagtgaa gacatagttt tctaccccct 6121 acactgacta tagaacaaac cacatagaca agagagcggt gagggggtcg cagtcttgga 6181 ggccacttcc tacaagtcgg cacagccgga cgccctatgc ctggggcacg gctgcgcaag 6241 agcgggtatg cctacggcac gcctgaccca agagcgccct tacgggtatg cctgccaacg 6301 cgaatgacgg ctaagggact tcctacccta cgggaagcac ctggcgtgag accccaagac 6361 gagccactgc tgcag // LOCUS NODE_4747_length_6350_cov_5.2665616350 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6350) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6350) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6350 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(198..929) /locus_tag="DP116_26285" CDS complement(198..929) /locus_tag="DP116_26285" /EC_number="2.1.2.2" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216481.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoribosylglycinamide formyltransferase" /protein_id="PRJNA477356:DP116_26285" /translation="MSFSPDSTQQSPALQHSPAALTQPDAVFSNLVSPNISADTLRTQ TPIKLGILASGSGSNFEAVAQAIKDGQLSAQIQVLIYNNPDAYAAVRAAKWGVPAVLL NHREYKSREDLDRQIVQTLQEYDVELVVMAGWMRLVTSVFIDAFPDKIINIHPSLLPS FKGSRAVEQALAEGVKIAGCTVHLVCLEMDSGSILMQAAVPVLPDDTPETLHARIQVQ EHRILPLAISFAGSLLEHRPSSKPI" gene 1170..1826 /locus_tag="DP116_26290" CDS 1170..1826 /locus_tag="DP116_26290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407965.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_26290" /translation="MKNFAEKHEFILNQITDQCFQQRDLSGCDLSGIDLRGVDLSGIN LIGADLRSANLSHAILTGANLSGANLMQANLQEAYLYEVSLCEANLSYADLSHANLCG AFLWRVKLCGSKLWAASLCDADLSEADLTEANLIEASLIQANLVRANLTAAKLCGASL LEANLNQANLTAADLTWANLMEANLNEANLWEANLSEAKLQGAIMPDGTIHQPQIFFF " gene complement(1965..2993) /locus_tag="DP116_26295" CDS complement(1965..2993) /locus_tag="DP116_26295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314384.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26295" /translation="MQIIRDAFGLFGIVESVYERIKKILIPPRAYSWQTLIYLSIFSW LMSSLSLGFVKDLIAFCGWLFLIFGTAWYTTDNPLYVPGTNLPVGALITGFLVSVFAF GYGENVLTSRTIVLWPTISAIITAIPEFFEGSGIDAKTQIPKIEDRQKMIILVGSCMV ISCWLQFYFVTENWLKEYPSLIVDNYQGSAFVTKTVSTEKIPKNGVLILDKLTPKVEE QIRDKPWSEAEQWLLNAGQNVGNLGKQVIQRELAEYDERFLWRVEPRVSNAKNGYKLD LLSIWGGPSSNPRGYYLVKSCRVDPVLTSGSTSSQTVNKTENRNTIAEIDCDTKSSFF AGSPPPQQ" gene complement(3059..3601) /locus_tag="DP116_26300" CDS complement(3059..3601) /locus_tag="DP116_26300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314383.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="filament integrity protein fraC" /protein_id="PRJNA477356:DP116_26300" /translation="MFDIPELPTIFPIGAILFNFLFLLVAIPIEAYVLNTRLKFDKKT SAFYAISINVFSNVIGWVIFFFVEPVLSPNLKSELMSFVFFNRLQTPGIQTLLILTAF IIFFSTFLVKYALLRVLLISLSDFKKAPPEPQVIQRRNSRLASKSKWQNTNIVTTILI ANALSYSLVAIILFIRSVNT" gene 4185..5321 /locus_tag="DP116_26305" CDS 4185..5321 /locus_tag="DP116_26305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653427.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP--corrinoid adenosyltransferase" /protein_id="PRJNA477356:DP116_26305" /translation="MTRNSIGIRTAQLRPSRLTGQIHVYDGAGKGKSQAALGVVLRSI GLGINTPNNSNRVLLLQFLKGPERDYDEDGAIAALQRGFPHLIDQVRTGRAEYFGPEE ITPYDRAEAARGWDVAKGAIASGLYSVVVLDEINPVLDLGLLPVDEVVRTLKSKPQEL EIITTGRAAPQQLLDIADLHSEMKPHLHPTAKAQRIQGIEIYTGAGKGKSTSALGKAL KAIGRGINHPGSTRVLIMQWLKGGTGYTEDAAIAALQQSYPEVVDHQRCGRDAIVWRN SRQELDYVEAERGWEIAKTAIASGLYKTIILDELNPTVDLELLPIEPIVQALLRKPRD TEIIITGRCQNQPPYFDLASVHSEVYCHKHYANHGVELKQGVDF" gene 5596..6144 /locus_tag="DP116_26310" CDS 5596..6144 /locus_tag="DP116_26310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome C" /protein_id="PRJNA477356:DP116_26310" /translation="MSNIVKRQSRQRKQKRKPIGLLVVILAWSLAMGWLLALATHANG ATPTSEVGTVDVVPAQYQLGQQLYIENCSTCHIAIPPAVLPTQTWKNLLEDTQHYGVQ LKPLVDPPRILVWKYLATFSRLQLQDEQTPYRVNNSRYFKALHPKVKLPNPVQIGSCV TCHPSAVDFNFRRLTPEWENSQ" BASE COUNT 1892 a 1303 c 1311 g 1844 t ORIGIN 1 gtcttaattc aaggaatttt gtcttctagt ttttccgggt aaaaaaagta aattttggtt 61 tgatgattat ttagccgaca tcgcaagatt gtagagaaat cttgcttcca gacataatta 121 aaggattttt gtctttcaaa ctggtcgtgt ggcaaagaca atgagtgcaa tgtggggtct 181 tggcaacttt ctaaccgtta gatgggcttg ctgctagggc gatgctccag cagggagcca 241 gcaaagctga tcgccaaagg caaaatccga tgttcctgaa cttgaatcct cgcgtggagt 301 gtttctggtg tatcatcagg caacacaggt actgctgctt gcatcaaaat ggaaccgcta 361 tccatttcta aacaaacaag atgcactgta caaccagcaa ttttcacccc ctcggcaagg 421 gcttgttcta ctgcacgtga acccttaaaa ctaggtaaca aactaggatg gatattgata 481 atcttgtcag gaaacgcatc aataaaaact gacgttacca gccgcatcca accagccata 541 actaccaatt ccacatcata ctcctgcaaa gtttggacaa tttgtctatc caaatcttca 601 cggcttttgt actcacggtg attaagaagc acagcaggga caccccattt tgctgcccgt 661 actgcggcat aggcgtccgg attattgtag attaaaactt gaatttgggc ggacaactgc 721 ccgtctttga ttgcttgggc aacagcctca aaattactgc cacttccaga agccagaatt 781 cccaatttta tgggagtttg tgtacgaagt gtgtcagcac tgatattggg agaaacgagg 841 ttagaaaaga cagcgtcggg ctgagtcaaa gctgcgggag agtgttgcaa tgcaggcgac 901 tgctgtgtag aatcagggct aaagctcata ataaatgacc atctgactgg tttgctaaaa 961 tatcaagttt ttgtgacctt tatcagagtc ttttgacata aagaaagact caaagcttgc 1021 ataataaggc ttttcctgat ttggctgaaa tttttaggaa ttttttttag attcttgact 1081 cagcttatat acttctgagg tgtaatctca aaaatgcagt aatttttggt atccgtagtt 1141 tggatatgtt acacctagag gtcaatacca tgaaaaactt tgctgaaaaa cacgaattca 1201 ttctcaatca aattacagat caatgttttc aacaaaggga cttgagtgga tgcgacttaa 1261 gtggtatcga cttgagagga gttgatttga gtggaattaa cttgatagga gcagatttgc 1321 gttctgcaaa tctgagtcat gcaatcctca ctggtgcaaa tctaagtggg gcaaatttaa 1381 tgcaagccaa tctgcaagaa gcttatttat atgaagtttc tttatgtgaa gctaatttga 1441 gttacgctga tttaagtcat gcaaatttat gtggagcttt tttatggcga gtgaagttat 1501 gtggaagtaa gctatgggca gcttctttgt gtgatgctga tttaagcgaa gccgacttaa 1561 ctgaagctaa cttaattgaa gcatcactca ttcaagctaa tttagtaaga gccaatctta 1621 cagctgcaaa gctttgtgga gcaagcttac tagaagccaa tttgaatcaa gctaacttaa 1681 ctgctgctga cttgacatgg gcaaatctca tggaagcaaa tttgaatgaa gcaaaccttt 1741 gggaggcaaa tttatcggag gcaaaactac aaggtgctat tatgcctgac ggaactattc 1801 accaacctca aatcttcttt ttttaattaa gggaaagtgg caatacggtt cggtatagtc 1861 attgttgatc agagaaaccg ggtaacttct gcgagtttat tcaagaaacc cggtaacttc 1921 tgcgagtgct tatccgaatc ctattgtcct gactcctctt gtcatcattg ctgcggtggt 1981 ggtgaacctg caaaaaatga actttttgtg tcgcaatcta tttcggcgat cgtatttctg 2041 ttttctgttt tattaacagt ttggctggag gtgcttccag atgttaacac tggatcaact 2101 cgacaggatt ttactaagta atatccacgg ggattagaac taggaccacc ccagatactt 2161 aggagatcca gtttatatcc attcttggcg ttagatacac gcggttcaac acgccataaa 2221 aatctttcgt catattctgc caactctctt tgtatgactt gcttgcctaa gttacccaca 2281 ttttgtcctg cattgagtaa ccattgttct gcttccgacc aaggtttgtc acgtatttgt 2341 tcttctactt ttggggtaag tttatctaaa attaaaacgc cgttttttgg aattttttca 2401 gttgatactg ttttagtaac aaaggcgctg ccctgataat tgtctactat gagactggga 2461 tattctttta accaattctc tgtgacaaaa tagaattgca gccaacaact aattaccata 2521 caactgccaa ctagaataat cattttttgg cggtcttcta tcttaggaat ttgagttttt 2581 gcatctatac cacttccttc aaaaaattct ggaattgcag taatgattgc tgaaatcgtt 2641 ggccaaagaa caattgtcct tgatgtcaaa acattttctc cataaccaaa agcaaaaaca 2701 ctgactaaaa atccagtaat cagtgcccct acaggcagat tagttcctgg aacatataac 2761 gggttatccg tcgtatacca agcagtacca aaaatcagaa ataaccaacc gcaaaatgct 2821 attaagtcct tgacaaatcc cagagataga gatgacatta accaagaaaa aatacttaaa 2881 taaattaatg tttgccaaga gtaggctctc ggcggaatta gtatcttttt tatccgttca 2941 taaacgcttt caacaattcc aaaaagacca aatgcgtccc ttattatttg catattcttc 3001 tcctttaatt aagaattaaa aatgattgta gttcatgatt tatagagctt gaacaaaatt 3061 acgtattcac agagcgaata aataagataa ttgcgactaa actatagctg agtgcattgg 3121 ctataagtat agtagtcact atatttgtat tttgccattt actcttagaa gcaagacggg 3181 aattgcgtcg ttgtataact tgaggttctg gaggagcttt tttgaaatca cttaatgata 3241 ttaacaagac tctcaaaaga gcatacttaa ctaaaaaagt gctaaagaaa atgataaaag 3301 cagttaaaat taataacgtt tgtatcccag gtgtttgtaa gcgattgaaa aaaacaaaac 3361 tcattaattc tgattttaaa ttgggtgaca agactggctc tacaaagaaa aagatgaccc 3421 aaccaatcac attagaaaag acattgatag agatcgcata aaaagcactc gtctttttgt 3481 caaattttag ccttgtattt aagacatatg cttcgatagg aattgccact agtaaaaata 3541 aaaaattaaa caaaattgca ccaatgggaa aaattgtagg aagttcagga atgtcaaaca 3601 taaacaagta atcgctaaag tgtaacttaa gagagtatag ccgcaacact gagcatgatg 3661 ggggcaacta agccattcaa aagatgccct acaggtaccc taagcacaat ttgccgcttt 3721 tgtgcaggag gcacaaaatt cgctcgttca atccgcaaga aaacaacctg ctagcaagta 3781 gtatagcccc tagctgtatc agtttttctg tgtgacagtt tatcttattt cttctagttt 3841 tttagcatct ttttgtttat atatcctctt gttgcttttc ttcatatatt tcaatacctt 3901 atagggtgtc agatgtgaag agtccttaca cccttagact gatcttgtag aaaacgggca 3961 accttgttta ctcgttttaa taagatatgg gcggtttcgt acgatgaagc gacaattcca 4021 gcgtctagaa ccgcaatcat gcctaggttg cccgttttct ataattttca tcaaacccca 4081 taacgctagt tgcttgaacc gaactctccg gcttggcttg cctgcatgaa atctgataaa 4141 accttgggta agatcaaaga ctcccacgtt aaagctttca aaagatgaca aggaacagca 4201 ttggtattcg tacggcgcaa ttgcgcccct cacggcttac tggtcaaatt cacgtttatg 4261 atggtgcagg taaaggcaag tcacaagcag ctttaggagt cgtattgcgt tctatagggt 4321 tggggataaa tactcccaac aactctaacc gtgttttgct gttgcagttt ttaaaaggac 4381 cagaaaggga ttacgatgag gatggggcta tagcagcctt gcaacgaggt tttccccatc 4441 tgattgacca agttcgcact ggtagagcag aatactttgg accagaagaa attacgcctt 4501 atgacagggc tgaagcggcg agaggttggg atgtggctaa aggtgcgatc gcaagcggtt 4561 tatattcagt tgtcgtcttg gatgaaatta accccgtcct agatttaggt ttgcttccag 4621 tagatgaagt ggtacgcaca ctaaaatcca aaccacagga attagaaatc attactactg 4681 ggcgtgcagc accgcaacaa ttacttgata ttgcggattt acactcagaa atgaaacccc 4741 atctccaccc aaccgcaaaa gcacagagaa tacaaggtat tgaaatttac acaggcgcgg 4801 gcaaagggaa atcgacgagt gccttgggca aagccttaaa agcaattggt agaggaatta 4861 atcatccagg ttctactcgg gtattaatta tgcagtggct caaaggtggc actggttata 4921 ctgaagatgc cgccatagca gctttacagc agtcatatcc agaggtggtg gatcatcaac 4981 gctgtgggcg agatgcgata gtttggcgca actcccgtca agaattggac tacgtggaag 5041 ctgaacgcgg ttgggaaatt gcgaagactg cgatcgcctc cggattgtac aagacgatta 5101 ttctcgatga gcttaatccg acagtggatt tggaactact accgattgaa ccgattgttc 5161 aagctttact tcgtaaaccc cgcgataccg aaatcattat cacaggtcgc tgccaaaatc 5221 aaccaccata cttcgactta gctagcgttc actctgaagt atactgccac aagcactatg 5281 ctaatcatgg tgtagaactc aagcaaggtg tagattttta aatgagtcat tagtcattag 5341 tcattagtca ttagtcaaaa tggcttgact attgactcat tcccaataaa caattatctg 5401 ttttgtgaat atggataaca atacactgga cttaattaca gcaggattgg cgatcgctat 5461 tatcttaggt ggcttcctga tgatgtttac tacgattctg accaccaaac gcaagtgagc 5521 taagccaagt ggacaaacat ttttcttata aaatgtgcaa gaagcgccga tatgcaaaat 5581 gtgtgagtgt gacgaatgtc aaatattgtt aagcgccaat cacgtcagcg aaaacaaaag 5641 cgaaagccca tcggtttgct tgtggtcatc ctggcttgga gtctagctat gggatggctg 5701 ttagctttgg caactcatgc caatggtgct actcctacct cagaggtagg tactgttgat 5761 gtcgtaccag cacaatatca actaggacaa caattataca tagaaaactg ctccacttgt 5821 cacatagcca taccaccagc ggttttgcct acccaaactt ggaaaaatct tttggaagat 5881 acacaacatt acggcgtaca gttaaagcct ttggtcgatc cgccacgtat tttagtatgg 5941 aagtatctcg caactttttc ccgtcttcaa ctccaagacg agcaaacacc atatcgcgtc 6001 aataactccc ggtatttcaa agctttgcat cctaaagtca agctacccaa ccccgtgcaa 6061 attggcagct gtgtcacctg tcatcccagt gctgttgact ttaactttcg tcgtctgact 6121 ccggagtggg agaactccca gtaaactttt aactagcggt gacgacaacc tgctgcggga 6181 gggtttcccg acagccaggc gactggcgtt agccgtaagg cgtgcgcttt gcgcataccc 6241 ggagggtttc ttgcgctagc aacgcccttt aagaaacact atcccccttc tattttattc 6301 cctgttccct gttccctgtt ccctgttccc tgttccctgt tccctgttcc // LOCUS NODE_4804_length_6231_cov_4.8769436231 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6231) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6231) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6231 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 4..2308 /locus_tag="DP116_26315" /pseudo CDS 4..2308 /locus_tag="DP116_26315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315129.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(2830..3321) /locus_tag="DP116_26320" /pseudo CDS complement(2830..3321) /locus_tag="DP116_26320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317038.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" gene complement(3424..3795) /locus_tag="DP116_26325" CDS complement(3424..3795) /locus_tag="DP116_26325" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456203.1" /note="involved in start site selection during the initiation of translation; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="stress response translation initiation inhibitor YciH" /protein_id="PRJNA477356:DP116_26325" /translation="MSSSNHKPSENRLVYREFGNDNSAALQRPMSAALERPIEELPPQ QQNLRVQATRAGRKGKTVTVITGFQSKPETLQALLKQLKAQCGTGGTVKDNEIEIQGD HKQKILEIVTKLGYKAKISGG" gene 4297..4845 /locus_tag="DP116_26330" CDS 4297..4845 /locus_tag="DP116_26330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876869.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CAP domain-containing protein" /protein_id="PRJNA477356:DP116_26330" /translation="MFRQTAFGIALSTLVLVSGMSSSYIRGQTATKKSDHNQVLSMSP RQVTSPVTVKSTDLERSVFDQINRYRASKGLPKLLLNAKISRQARLHSLNMANGKAPF SHQGFRRRVAGIPIRCRSAGENLAFNLGYSDPAQEAVTGWLHSSGHLANIKGNYNMTG IGVATNSQGEVYLTQIFLRSGR" gene 5104..6008 /locus_tag="DP116_26335" /pseudo CDS 5104..6008 /locus_tag="DP116_26335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015211048.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 1825 a 1301 c 1421 g 1684 t ORIGIN 1 cttatgaata acaacacccg tatcaaaact attttaattc tggcagcaaa tccgacaagt 61 acctcaaggc tgcgacttga tgaagaggtg cgggaaattg atgaaggatt gcgacgcgct 121 aacaaacgag aagaattcaa gttagagcaa aaatgggcgg tacgtcagcg tgacttctac 181 cgcgccatgt tagattatca gccgcaaatt gttcacttct gcggacatgg tgctgggcaa 241 gatggtattg ttttgcaaga tgaaacagga cagcctgcat taatctccgc cgaagcactc 301 gggagtatgt ttaagctgtt tgccactaaa ggagttgatt gcgtactttt aaacgcttgt 361 tactcccaaa tgcaagcaga agcaattagc caacacgtca attatgtgat tggcatgact 421 cagacagtag gggataaagc agctgttgct tttgctgtcg cgttttatga tgcgatcgct 481 acaggggaag aggtagaatt tgcctatcaa cttggctgct ctcatatgat tggttttttg 541 gaacagcaaa ctccggtact aagaaaaaaa cagtcaatta cccccactgc acctcaaata 601 gaagtcattc ctcctaaccc ctatcaagga ttgtctgcat ttggggaaga agatgcagaa 661 ttcttttttg gacgtgatac atttgtgaat ggcttagtgc aaggaactca ctctcagtcg 721 ttggtagcag tgattggtcc tagtgggagt ggcaagtctt cggtggtgtt tgctggactc 781 attccacagt tacgaagtga aggaaattgg gtgattgaat cttttcgtcc tggtaatcag 841 ccattttatc agcttgcttc tgcgttggtg cgtcaattag aaccagaaat tggtgaaact 901 gaaaaattgc gatcggcggc tggactagca acagatatac agcagggtaa agtgactttg 961 cagcaggtgg tgtctaacat tagagaacgc aactcagata aacgttttct gctgtttgcc 1021 gatcaatttg aagaactcta cactctttgc cagaaggaag aaattgagcg ttttactgat 1081 acgttgctcg aagctatcca tcagaaaatt atcacactgg ttttgacttt acgcgctgac 1141 ttctacggct atgtcctctc ctaccgcccg tttcgggatg cattgcagga gtttacaccc 1201 caattagtaa gttccatgag tcgggaggaa ctacaagcag cgatcgccct acctgcacaa 1261 aagctagaag tgcaacttga ggcacaatta caagagcgaa ttttggatga tgtcggactg 1321 gaaccgggta atttaccact attagaattt gctcttaccc gattatggtc aaagcagcaa 1381 aaccggacgt taactcacca agcatatact gagattggcg gtgtgaaaaa ggcattagct 1441 aatcatgccc agcaggttta tagtcaactt agcccaacag agcaaaagca ggcacagcga 1501 atttttgtac aattagtgcg tccgggggaa ggaacagaag atactcgacg gctggctacc 1561 cgcaaagaag ttggggaaga gaattgggga ttggtaagtt atttagcggg ataccaagcc 1621 cgtttagttg tgacgggacg cgatgacaac tcaggagaag atacagtaga ggttgtccat 1681 gaagcggata actctgcgcg agtggatgaa tgctaatcgt cagtttcgcg tttggcagga 1741 acggttaaag gttgcgatgt ttgagtggaa aaataacaat catgactctg gggcgttgtt 1801 gcggggtgtg ccgttaactg tagcagaaga ttggttgcaa aaacgcgctg aggaaatgac 1861 gcaggaagag cgagacttta ttcaggctag cacaagtcaa cgggttcgag acaaacaaga 1921 acgcgatcgc cggacgcaac taacaattat cgcactcagt agcttttctg gggtaactct 1981 aattatagca ggcgtcgcag gcgtgggctg gagcaatgcc gcaatcagtc aaattagttc 2041 tcttactatt acttcagatg cgctgttgaa ctcagatagg gcaaaagctt taaaagcaag 2101 tttaaaagct gttgtgcaaa tgcaacatac acccttggta aatgctgata cccgcacaca 2161 ggtagaactg acattactga atacagttga caatgttgca gcaccgaaca ccctaggagg 2221 acacgcaaaa gctgtttctg gcgtcagctt cagcccggat ggcaaaatcc tcgcttctgc 2281 aagtaatgac aacacggtga aactgtggag atgggatttt gattatttac tcaggcaagg 2341 gtgcgcgttc atgggtgagt atttcaaaac taatcccaat gatgatgatg ctgaaattgg 2401 caatatgtgc cgcgcagtca gtcgttagtt gccattttct ccacactgct ggattaattt 2461 actctcaaga aacacagggg tgtaggggtg taagggtgta ggggtgtaag ggaaaaaggt 2521 gtttctttca tattttgctg gtggctgcga agccgccagt caaaggctga tatcttgtac 2581 cattacgagt aaatcaccaa tgcttgtata aaatagtatt taaatttact caaaaataag 2641 ttccagtgtc agtccatttt aatggacttt gcctatgcag cccaggactt acagtcctag 2701 gcggacaaga acggagtcaa attattgtaa caaatttgac ttttattgac aaggtgcaag 2761 atgtaagcct cttacgggcg tgtttttcgc tgccgtgaaa tatgaaagaa acacatttca 2821 cttaaatctc taaaactttt ttcgacgagc tacaatccca tacatcagag gcaaattcgg 2881 cacatcctct ggcggaaaca tccggcgtcc aggtgcttcc cgcatccgtt caaatatctt 2941 acaaccgttg gcgtaagggt actccttcaa aacctctaga gttaatccag cttccaataa 3001 cgccgtgact atttcaccga taccccattg gaactcatgg gaacggtaag gattgcgaaa 3061 attttcaata cccgtttcat agccccaagg agttagcgcc gtacctgatg aggctacgta 3121 atcacttatt ccatcttccc aggttagagg ctttccttca ctgaagtaag aaaacttgtg 3181 actccaatct tcatcaaata ttatagccac aggatgaaaa tccaccacta tgaagcgtcc 3241 accaggtttg agaacatctg caattccttt tgcccaaagg ttcaaatcag atatccaaca 3301 aactgcacca taagacgaga aaacaatcta taaatctcaa aacgagcagg atttttatac 3361 ttttcaaaca taacatttat gtgcgaaaac agcaggtttc tggagttcgg taaacaccac 3421 aaattaacca ccgcttattt tggctttata acctagcttg gtcacaatct ctagtatttt 3481 ctgcttatgg tcgccttgaa tttctatttc gttgtctttg actgtaccac ctgtaccgca 3541 ctgagctttc aattgcttca gtaaggcttg taaggtttct ggtttggact gaaaaccggt 3601 aatcacagtc acagtcttgc ctttgcgtcc ggcgcgagta gcttgcacgc gcagattttg 3661 ctgttgaggg ggtagttcct cgattggtct ttctagggca gcagacattg gtctttgtag 3721 ggcggcggag ttatcgttgc caaattcccg gtagacaaga cggttttcgg agggtttgtg 3781 atttgatgat gacataaggc taaattgaat gtgttagtag cgtgcgctaa ggcgcgcata 3841 ccgcagagaa ccaagagaca aagaggaaaa agagtttata taatgatggt cgttctgtca 3901 ttttagcttg tgcgtgacta tgagtcccat actctccagc ttgaaatcaa caactcaact 3961 tcctgatata caaggacgtt ttggcggtca ctatgtgcct gaaacgctaa tcaaacctct 4021 tgctggatta taaacagcga ttacagcaat accgtagaac gtaatttgat gtatgctatc 4081 acaacctcga tagttaacca aataaatacg gcaacttggg tatcattttt ctaaataact 4141 ttgctctatg atgattgtta tgaaagactc ttgacgcttc tagttatcat gtgccatcat 4201 ttcgtaactt cctgaataca aacaagtata cgggtatatg aagaaacaaa aaaacaaact 4261 ctgcgtctta caaccctaca tactacataa ttctccatgt tccgacaaac tgcttttggc 4321 attgctttaa gtacgcttgt ccttgttagt ggaatgtcat ccagttacat acgaggtcaa 4381 accgctacaa aaaaatcaga ccacaaccag gtgttgtcaa tgtcaccacg tcaagtgacg 4441 tcacctgtta ctgttaaatc tactgattta gaaaggtcag tttttgacca aattaatcga 4501 tatcgagcat ctaagggttt gccaaagctc ttgctaaatg caaaaatatc gcggcaagca 4561 agacttcaca gtctaaacat ggctaatggt aaagctccgt ttagtcatca gggatttaga 4621 aggagggttg ctggtattcc tattcgctgc agaagtgcag gagaaaacct tgcctttaac 4681 ctaggataca gcgatcccgc tcaggaagct gtgactggtt ggttgcacag ttccggacat 4741 ctcgccaaca ttaaaggcaa ttacaacatg actggaattg gtgtagcgac taatagccaa 4801 ggtgaagttt atctgacgca aattttcctt cgtagtggta gataattctc aaaatcggtt 4861 caagcgtgag tgtttaagct atccgctaaa cgctggacag tagtggacaa ccagtacctc 4921 cagtgtagag atagccagtc aaagcgaaag ctacgttgtt tgggtgatga caccagtggg 4981 tgctttccag cttactgctc tgtcgctagc tattaaatat ctctatttag ctaaggaagt 5041 gtagctagcc taacaagtcc ttacaacatt gacgaggaaa acattacccg aaaggaggct 5101 cttatgagca aagtcttcgt aattgatacc aagaaacgac cgttagatcc aattcatcca 5161 gcacaagcaa gacagctttt aaggaacaaa aaagctgcag ttttgaaaaa attcccattc 5221 acaattattc tcaaagaatc tagagcagat gcactggttc aacctttgag aattaaaatc 5281 gaccctggtt ctaaagcaac tggaatagta ttagtaaacg attctgcaaa cgaggttata 5341 tttgtggcag aacttcaaca tagaggtttt gtgattagag aagctttaac ttctcgtaga 5401 caacttcgta gaagcaggag agcaagaaaa actcgctatc gtcaagcacg gtttaacaat 5461 agaaagcgtc cggaaggttg gttaccgccg agtttgatga gtcgtgtact gctgcagtca 5521 atgcaactag atttaaacta ttggaaacgt tgatctctac tgggttgcca gtagaaacag 5581 gctctggcgg tttgacaaag ttcaatagaa caaaacttgg actagacaaa actcactatt 5641 ttgatgcggc ttgcgttggc ttatcgacac cggacaagct attagtcaaa ggggtcaaac 5701 ctttgatcat caaagcttgc ggacatggca gcagacagat gtctatcaca aataagtttg 5761 gttttcccaa aaggcataaa actaatagta aattccactt tggatttagg actggcgata 5821 tcgtcaaagc tgatgtaccc aaggggaaaa atgcaggcat tcacgttgga cgagtaacta 5881 ctcgaaagac tggacagttt gacattgccg caggcggtaa gacattgcag attatcaatc 5941 ataagtactg caaaatactg caacgacatg atggctattt gtactcgttg tccagtattg 6001 tccattgatt ccgcaattag atgcgatccg atcaccggag gtggcggctt ccgcatcgca 6061 atcctaaacc gtcaaagtgc cgcaacctgc aactctgtgg gggttggaag ctccgaaaga 6121 acccatgaat gtataggatt cctatttgat ttttgaacag aactccgtac agtttatgtt 6181 aagcgttaag agttccctgt tccctgttcc ctgttccctg ttccctgttc c // LOCUS NODE_4812_length_6218_cov_4.8015586218 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6218) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6218) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6218 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1321 /locus_tag="DP116_26340" CDS <1..1321 /locus_tag="DP116_26340" /inference="COORDINATES: protein motif:HMM:PF13424.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26340" /translation="KLDQAIAAYNKAIQLDPNYAKAYYNLGIALSDQKKLDQAIATYN KAIQLAPNYANAYNNLGLALSEQKKLDQAIAAYNKAIQLDPNNTYAYIALGIVLAQQK KPDAANVAFNKAIQLDPAVSSLAYTALGLVLAQQKKPDAAIVVFNKALNLPEYKLETL TTAHSLAHTGLGLVLQQQGKLKEAISEYEKATKIDPNFVYANNNLKEAQRLLSIKSGN VTEASDDRAWLPKNEPSLPILRPVVFITAEFNTRERLGSENGAGIVIKREGNRTLIVT NRHVIFDKDANQQGKNIQVEFFSQPPSGKVRMRRDAKLLFMTPPDDSIDIAVLEVIGN LPKDIQPLPISQNSIHRGMSIRTIGHPFDDVPWAMQAGEISSYSSAKMLISTVKIKPG YSGGPVINSKNNQLLGIVVERDSDTGLAFAYPMSVIKEKLRNLGLAL" gene 1414..2142 /locus_tag="DP116_26345" CDS 1414..2142 /locus_tag="DP116_26345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739669.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26345" /translation="MVCKPWLFTSLLLLTLSNLAAQAQASCPAGNPRSTEYIRRNPPN RCEGIQREPISGNTLRLISIAIRNIPSYGETLLMQIPQINGGNNPQVKLQSLEKYYQL DNPSLSPNGSGYSFSWDTYVLKKENILPDTLRALAYFNLGSESVHIPVILGNTSGKYE FVFFTPSRAKFSTFEILRDGKRVYSSPLNNARSGEIVFNWDGRNASAGRYELHIIANI EPIRQPPQRIERRFVFEHNPNWLK" gene 2148..3452 /locus_tag="DP116_26350" CDS 2148..3452 /locus_tag="DP116_26350" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26350" /translation="MDKKTMARRQKTRQKLTPFFSWAMRGIWQAITTIVAWVVSGVRV FITLIFEKFNQLGKYLVQRKLVLFVLLLFLAFVLLGLAAIIPKTHIFEGNLIVEELTF TYNGDKDKLFLDSMRSIRNFDIEGVINNELTFTGEFQSTSFPQLNQLNKPLKIRLPNR DSKLVIEPVKNQKGSDLDITELRILPGTKVSGFKYNLSQKQHQLAFGLEQSSTSQIQP NELKIYFGEKPIKVVLENYKISGFNQNNLDTQQPLEFILTPKNKELNLELKQKNNIYL SLDESNKNDPNLWFQEKIDTKNVYFQHLDRTGDINDDVTVSTIVEGKIRMVEQEREIK ANQFLMGDEPNTPLNIERIRHLQIVPKKGLEVRISGRTKQIQIGLDKDFPVSRIQGSW LDGFLPRDAIVALFSFSAATITYLLGFLIENASKSGSNPKSS" gene complement(3624..4478) /gene="purU" /locus_tag="DP116_26355" CDS complement(3624..4478) /gene="purU" /locus_tag="DP116_26355" /EC_number="3.5.1.10" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318467.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="formyltetrahydrofolate deformylase" /protein_id="PRJNA477356:DP116_26355" /translation="MISPTATLLISCPDQRGLVAKIANFIYANGGNIIHADQHTDFAA GLFLTRIEWQLDGFNLPRDLIAPAFNAIAQPLQAKWELHFSDTVRRIAIWVSRQDHCL FDLIWRQRAKEFAAEIPVIISNHPDLKVVAEQFDIDYHHIPIIKENKQEQEAKQLELL HQYKIDLVVLAKYMQIISGEFISKFPQIINIHHSFLPAFVGANPYHKAFERGVKIIGA TSHYVTDHLDAGPIIEQDVVRVSHRDEVEDLIRKGKDLERVVLARAVRLHLQNRVLVY GNKTVVFE" gene 4777..5049 /locus_tag="DP116_26360" CDS 4777..5049 /locus_tag="DP116_26360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318466.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26360" /translation="MTQMLITYLVILVYLIMASCFFNQWLVFFLADEDMDSQQRFYST IALVIATILWPIIVPFAYLELLKFQKKHKDIIDLLINVPKGGSYDE" gene complement(5097..5840) /locus_tag="DP116_26365" CDS complement(5097..5840) /locus_tag="DP116_26365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017323708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26365" /translation="MKLQRTTLILILLALGLGGFVYFHEIKSEPEPKEVKQQQKIFSF KEDDVQSLTVKTQNQTLNLERTDKSEEPKWMMKSPEVAPASNAIVGYLMNLLVEGKSD RIVSISPNQLAEFGLDQPQTTIDVKLKNQKTHQLILGKPDFNRRFLYAQTDPQNKPGS NTDVLLVSTDFENAVNRELSEWKQPIDDGKKPTPNNDKPIDDGKKPTPNNDKPIDNGK KPTPNTDKPIDNGKKPTPNTDKPTPANSK" gene complement(5837..>6218) /locus_tag="DP116_26370" CDS complement(5837..>6218) /locus_tag="DP116_26370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867396.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="ABC transporter" /protein_id="PRJNA477356:DP116_26370" /translation="TPTPTTAQTPQTSASAKPTPTPTATPTPASPTPSPSSGESRMVV LGNSDFAINGLFDKQLNGDVFLNSVTWLSQQDQQPLSISPKEVKNRRINLTAAQALLL ELSSLVILPLIGLVTAAIFWWIRR" BASE COUNT 1854 a 1270 c 1212 g 1882 t ORIGIN 1 aaaactggat caagcgatcg ccgcctacaa caaagccatc caactcgacc ccaactatgc 61 taaggcttac tacaatctgg gcattgcgct gagtgaccag aaaaaactgg atcaagcgat 121 cgccacctac aacaaagcca tccaactggc ccccaactat gctaatgctt acaataatct 181 aggtttagcg ctgtctgagc agaaaaaact ggatcaagcg atcgccgcct acaacaaagc 241 catccaactc gaccccaaca atacttatgc ttacatcgct ctgggtattg tgctggctca 301 acagaagaaa cccgatgcgg caaatgttgc tttcaacaaa gccatccaac tcgaccccgc 361 tgtgagtagt ttggcttaca ccgctctggg tcttgtgctg gctcaacaga agaaacccga 421 tgcggcaatt gttgttttca acaaagcact gaatttacca gagtataagc ttgaaactct 481 aactaccgct catagtttgg ctcatactgg tttgggcttg gtactccagc agcaagggaa 541 attaaaagaa gcaatttcag agtatgagaa agctactaaa attgacccta attttgtcta 601 cgctaacaac aacctcaaag aagcacagcg gttattaagt ataaaaagcg gtaacgtcac 661 tgaagcatca gacgatagag catggttacc caaaaatgag ccatctctac ccattttacg 721 ccctgtggta tttatcaccg ctgagtttaa cacccgcgag agactgggaa gtgaaaacgg 781 tgcgggtatt gtcatcaagc gagaaggaaa tcggacatta atcgtcacca atcgccacgt 841 catttttgat aaggatgcta atcaacaggg taaaaatatt caagtcgaat tttttagtca 901 accaccttct ggtaaagtca gaatgcggcg agatgccaaa ctcctattca tgactcctcc 961 agatgattcc attgatatag ctgttttgga agtcattggc aatttaccaa aagatatcca 1021 acccttaccc atctcccaaa actccattca tcggggaatg tccatccgaa ccataggtca 1081 tcctttcgac gatgttcctt gggcaatgca agcaggagaa attagcagct acagtagcgc 1141 gaaaatgttg atatctacag tcaaaattaa acctggctat tccggtggtc cggttatcaa 1201 ctccaagaat aatcaacttt tgggcattgt tgttgaacgc gactctgata ctggactagc 1261 ttttgcttat ccgatgtctg taattaagga aaaactgcgt aatttgggtc tagcgctatg 1321 aacgacaaaa tctttttagg atttacgcac acgctactaa aaacaggggg ttgggattgc 1381 ttccctccgg tcgcaatgac tatagttaag tcattggtgt gcaaaccctg gttatttacc 1441 agcctactgc tactaactct cagtaatctt gctgctcaag ctcaagcttc ctgtccagca 1501 ggtaaccctc gctctacaga gtatatccgt cgcaaccccc ccaaccgttg cgaaggtatc 1561 cagcgagaac ccattagcgg gaataccctg cgtctgattt ccattgccat ccgcaacatt 1621 cctagctatg gcgaaactct cctcatgcaa attccccaaa tcaatggcgg taacaaccca 1681 caagttaagc tgcaatcttt agagaaatat tatcaattag ataacccttc cctgtctccc 1741 aatggttctg gctatagctt cagttgggat acttacgtcc tcaagaaaga gaacatacta 1801 ccagataccc tccgcgcctt agcatatttc aacttgggtt ccgaatcagt ccatatacct 1861 gtgattctcg gaaacacttc tggcaagtat gaatttgttt tctttactcc cagtcgtgct 1921 aaattctcca cctttgaaat cctgcgcgat ggaaaacgag tttacagcag tccgcttaat 1981 aatgctcgaa gcggtgaaat tgtctttaac tgggatggac gcaatgcttc tgctggacgt 2041 tatgaattgc atattattgc taatatcgag ccaattcgtc aacccccgca acgaatagag 2101 cggagatttg tttttgaaca caacccaaac tggctgaaat agctgaaatg gataaaaaaa 2161 caatggcacg tagacaaaag actaggcaaa aacttactcc tttcttcagt tgggcaatgc 2221 ggggaatttg gcaagctata acaactattg tggcgtgggt tgtgtctggt gtaagagttt 2281 ttataacatt aatttttgaa aaattcaatc agctaggaaa gtatttagta cagagaaaat 2341 tagttctttt tgttctatta ctctttctcg cttttgttct ccttggtctt gccgctatta 2401 tccccaaaac tcatatattt gaaggtaacc tgattgttga agaattgaca tttacttata 2461 atggcgacaa agataaactt tttctcgatt ctatgaggag tatccgcaat tttgacattg 2521 aaggtgttat taataacgaa ctaactttca caggtgaatt tcaaagcact tcttttcccc 2581 aacttaatca gttaaataaa ccactaaaaa ttcggcttcc gaatcgtgat agtaagttgg 2641 ttattgaacc tgtaaagaac cagaagggca gtgatctaga cattacagaa ttgcgaattc 2701 taccaggtac aaaagtctct gggtttaaat ataacctttc tcaaaaacaa catcaacttg 2761 cttttggtct ggaacagagt tctacttctc aaatacaacc aaacgaatta aaaatatatt 2821 ttggagaaaa acctattaag gttgttttag aaaactacaa aatatctgga tttaaccaaa 2881 ataatttaga tacacagcag ccattagaat ttatattaac tccaaagaat aaagaactca 2941 atttagagtt aaagcagaaa aacaatatct atctcagcct agatgaatct aacaaaaacg 3001 acccaaatct ctggtttcag gaaaagatag atactaaaaa tgtctatttt caacacctcg 3061 ataggactgg cgatatcaat gatgatgtga ctgtttctac aatcgtggaa ggcaaaattc 3121 gcatggtaga acaggaacgg gaaattaaag ccaatcagtt tctcatggga gacgaaccta 3181 atactcctct caacatagaa cgtatacgcc atttacaaat tgtccccaag aagggattag 3241 aagtccgtat ttctggaaga acaaaacaaa ttcaaatcgg tttagataag gattttccag 3301 tttctcgcat tcaaggtagt tggttagatg gttttttacc ccgcgatgct attgttgctc 3361 tattttcttt cagtgctgct accataacct atctgttagg tttcctgatt gaaaatgcct 3421 ctaaatcagg ctcaaatcct aaatcttcct aatgaccgag acaattttag attttagatt 3481 ttagattttg gattaaattc aaaatctaaa atctaaaata aagggtttca tatcatgtcc 3541 ggatgaatac ttataaaaaa cgaagaacct cactcccctc tccttaataa ggagaggggt 3601 gcccggaggg cggggtgagg tcttcattca aatacaactg ttttatttcc atatactaat 3661 acacgattct gcaaatgtag acgcactgct cttgctaaaa caactcgctc caaatccttt 3721 ccttttctaa tcaaatcctc aacttcatca cggtgactga ctcgtaccac atcttgctca 3781 ataattggtc ctgcatctaa atgatcagtc acataatgag aggttgcacc gataattttt 3841 acacctcttt caaaagcttt gtgataagga tttgctccaa caaaagctgg tagaaatgaa 3901 tgatgaatat ttataatttg tgggaattta gagataaatt ctccactaat aatttgcata 3961 tattttgcta agacaactaa atctattttg tactgatgta gtaattctaa ttgcttggct 4021 tcctgttcct gtttgttttc cttgatgatg gggatgtggt gatagtcgat atcaaactgt 4081 tctgctacta cttttaaatc aggatgattg ctgataatga caggtatctc agcagcaaat 4141 tcttttgcac gttgtcgcca aattaaatca aataaacaat ggtcttgacg actcacccaa 4201 atagcaatgc gccggacagt atcagaaaag tgcaattccc atttagcttg caatggttga 4261 gcaatggcat tgaatgctgg tgcaattaag tcacgcggca aattaaaccc atctaactgc 4321 cactcaatac gagtgagaaa taaccctgct gcaaagtctg tatgctgatc cgcatggata 4381 atattaccac cattagcgta gatgaagttg gcaattttcg cgaccaatcc tctttggtcg 4441 ggacaggaaa tgagtagtgt tgcggttgga cttatcatga acacattcta aagactagca 4501 agtttttgtc aactatcttt tcagtataat tacccttggt gaaagtattt tatatacatt 4561 actaatagaa gagaactttc taatttgata gcaggtttta tggaaagtga ataactgctc 4621 ttttttattt tgtctagaat agagctgagt cgtactttaa gctagacgtt atacgaaaca 4681 ttcaaaaaaa atacatgata tatgtgaact agggtcttgt tacaaacgtc aagcaagatt 4741 cctgctgtac ttggttttat ttcaaactat attatcatga cacaaatgct tattacttac 4801 cttgtaattc tagtctactt aatcatggct tcctgttttt ttaatcaatg gctggtattt 4861 tttttagccg atgaagacat ggattcacag cagcgcttct attctactat cgctttggtt 4921 atagctacta tcttgtggcc aataattgtt ccttttgctt acttggaatt actgaagttc 4981 caaaaaaagc ataaagatat catcgattta ctgataaacg tacctaaagg tggtagttat 5041 gacgagtgaa gtttcttgaa aataaactat accatcatct caaataataa aagctgttat 5101 ttgctgtttg ctggagttgg tttatcagta ttgggagtgg gttttttccc attatctatg 5161 ggtttatcag tattgggagt gggttttttc ccattatcta tgggtttatc gttattggga 5221 gtgggttttt tcccatcatc tatgggttta tcgttattgg gagtgggttt ttttccatca 5281 tctatgggtt gtttccactc tgacagttct cgattaactg cattttcaaa atctgtagat 5341 actaatagta catcagtatt gctacctggt ttgttctgag gatcagtttg agcatacagg 5401 aaacggcggt taaagtcagg cttacccaaa atcaactgat gtgttttctg attttttagt 5461 ttgacatcga tagttgtttg aggctgatct aagccgaatt ctgcaagctg gttgggtgag 5521 atggatacaa tgcgatcgct ctttccctcc accagcaaat tcatcaaata acctacaata 5581 gcattacttg ctggggcgac ttccggagat ttcatcatcc acttgggttc ttcagacttg 5641 tcagtacgtt ccagatttaa agtctgattt tgagttttga ctgtcaaaga ctggacgtca 5701 tcctctttga aagaaaagat tttttgctgt tgcttgactt cctttggttc aggttcactc 5761 ttaatttcat gaaagtaaac aaaaccacct aaaccaagcg ctagcagtat cagaattaaa 5821 gttgttcgct gtaatttcat ctgcgtatcc accagaaaat agctgccgtc actagcccaa 5881 tcagaggtaa aatcaccaga gatgataact ctaaaagaag ggcttgtgct gctgttaggt 5941 tgatgcgacg gtttttcact tcttttgggc taatggaaag aggttgttga tcctgctgac 6001 tcagccaagt gactgagttg agaaacacgt ctccattaag ttgcttgtcg aacaagccat 6061 tgatcgcaaa atctgaattt cctaacacta ccattcgtga ctcaccggaa gaaggtgagg 6121 gagtcgggct ggctggagtt ggtgttgctg ttggtgtcgg ggtgggttta gcgctagctg 6181 aagtttgagg tgtttgggct gttgttggtg tcggggtg // LOCUS NODE_4817_length_6204_cov_4.6197766204 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6204) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6204) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6204 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(219..2231) /gene="tkt" /locus_tag="DP116_26375" CDS complement(219..2231) /gene="tkt" /locus_tag="DP116_26375" /EC_number="2.2.1.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198159.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transketolase" /protein_id="PRJNA477356:DP116_26375" /translation="MAVATQTLEELCINSIRFLAIDAVEKAKSGHPGLPMGAAPMAFV LWDSFLRFNPKNPKWFNRDRFVLSAGHGSMLQYALLYLTGYDSVTIEDIKQFRQWESK TPGHPENFMTPGVEVTTGPLGQGIANGVGLAMAEAHLAAKFNKPDAKIVDHYTYVILG DGCNMEGVSGEACSYAGHLGLGKLIALYDDNHISIDGSTDIAFTEDVSKRFEAYGWHV QHVAEGNTDLEGLAKAIEAAKAVTDKPSFIKVTTTIGYGSPNKANTAGVHGSALGGDE VKLTRENLGWEYEPFVVPEDALKHWRKAVERGANLEEEWNKAFSDYKAKYSEEAAEFE RYISGKLPDGWDKVLPTYKPEDKGVATRKHSETCLNKLGAVLPELIGGSADLTHSNLT ELKNSGEFQKGQYQNRNVHFGVREHGMGAICNGIALHQSGLIPYGATFLIFTDYMRAA IRLSALSQAGSIWVMTHDSIGQGEDGPTHQPIETLASLRAIPNLTVIRPADGTETSGA YKVAIESAKENNPTLLAFTRQNVPNLAGTSIEGVTKGGYTVVDSEGTPDLILIGTGSE LSLCVTAAEKLTAEGKKVRVVSLPSWELFEAQDAAYKESVLPKAVTKRLSVEAAASFG WHKFVGSEGDTVSIDRFGASAPGAVALEKFGFTVDNVLAKAKALLG" gene complement(2614..3864) /gene="fabF" /locus_tag="DP116_26380" CDS complement(2614..3864) /gene="fabF" /locus_tag="DP116_26380" /EC_number="2.3.1.179" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010997494.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-ketoacyl-[acyl-carrier-protein] synthase II" /protein_id="PRJNA477356:DP116_26380" /translation="MTDFIRKRVVVTGVGALTPIGNTPAEYWEGLISGRNGIGEITLF DASRHDCRIAGEVKNFDPHEYMDRKEAKRMDRFAQFGVSAAIQAVADAQLVINDLNAE QIGVIIGSGVGGIKVLEDQQTIYLDRGPDRCSPFMIPMMIANMAAGLTAIHTGAKGPN SCSVTACAAGSNAIGEAFRLIQGGYAQAMITGGCEAAITPLSLAGFASARTLSTRNND PAHASRPFDKDRDGFVMGEGAGILILEELQHALSRKARIYAEIVGYGMTCDAYHMTSP VPGGEGAARAIQLALKDAGVRPEHVSYINAHGTSTPMNDPTETAAMKKALGEHAYKIA VSSTKSMTGHLLGGSGGIEAVATVLAIANDRIPPTINLENPDPECDLDYVPNTSREAK VEVALSNSFGFGGHNVTLAFKKYV" gene complement(4563..4808) /locus_tag="DP116_26385" CDS complement(4563..4808) /locus_tag="DP116_26385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878949.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl carrier protein" /protein_id="PRJNA477356:DP116_26385" /translation="MSQAETFEKVKKIVVEQLSVKDEQVTPEASFANDLGADSLDTVE LVMALEEEFDIEIPDEAAEKITTVKEAVDYINNKVAA" gene 4994..5905 /locus_tag="DP116_26390" CDS 4994..5905 /locus_tag="DP116_26390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739735.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heterodisulfide reductase" /protein_id="PRJNA477356:DP116_26390" /translation="MLAHTLKYAYFPGCVAQGACRELHQSTVALTQALDIELIELKKA ACCGSGTFKEDSQMLEDTVNARNIALAEQLNLPLLTHCSTCQGVIGHVDEHLKECQKT NPAYVERVNGLLQKEGCSPYRGTTEVKHLLYALVTDYGLEEIQKRVNRKLTGLKCAAF YGCYLLRAQKSMPYDDPFQPKAMENVFLTVGAEPIYYRGRTQCCGWPLSSYATTQSFK MAGMHIQEAIEAGADCIVTPCPLCHLNLDSRQKEVEKVIGRELGLPILHLPQLIALAV GVSPKELGLDRHIVSTTPVLEKLGFTS" BASE COUNT 1573 a 1509 c 1412 g 1710 t ORIGIN 1 cagcgctgca ggagggtctc cctccgtagg cgactgcgaa cccgaagggt taccaagtag 61 gaaacggact cgtccacccc ttgttcactg ttcactgttc actgttccct gttaagcgtt 121 ccctattaag cgttccctat tccccaaaac tgtaaaacaa aaaaaggtgg gcaataaagc 181 ccacctgaaa agaaagagaa tagatactct gttgctaatt aacccaacaa tgccttagcc 241 ttagctaata cattatcaac tgtgaagccg aatttctcta aagcaaccgc acctggagct 301 gaagcaccaa agcgatcaat actaacagta tcgccttcgc tgcccacaaa cttgtgccaa 361 ccgaaactgg cagctgcttc gacagacaaa cgcttggtaa cggctttagg gagaacagat 421 tctttataag ccgcatcctg tgcttcaaac agttcccacg agggtaacga gacaacacga 481 acttttttac cttcagccgt gagtttctcg gctgcagtta cacagaggct caattctgaa 541 ccagtgccaa ttaagatcaa atctggcgta ccttcagaat ccaccaccgt gtatccaccc 601 ttggtcacgc cctcaatcga ggtacctgcc aagttgggga cattttgacg ggtgaatgct 661 aacagggtgg ggttgttttc ctttgcactc tcgatcgcga ctttgtaagc gccagaggtt 721 tccgtaccgt ctgcgggacg aatcaccgtt aggttaggaa tcgctcgcag ggaagccaaa 781 gtttcaatcg gttggtgcgt cggaccatct tcaccttgtc caatcgaatc gtgagtcatc 841 acccaaatcg agccagcttg agatagggcg gataagcgga tagcagcacg catgtaatct 901 gtgaagatca aaaaggtagc gccgtaggga attaatccag actgatgcag cgcgataccg 961 ttacagattg cgcccatacc atgttcccgc acgccaaagt ggacgttacg gttttggtat 1021 tgccctttct gaaattcgcc ggagtttttc agttcggtaa ggttggagtg agtcaagtca 1081 gccgaaccac caatgagttc aggtaaaacc gcccccagtt tgttgaggca ggtttccgag 1141 tgtttgcggg tggcgactcc tttgtcttct ggtttgtagg tgggtagtac cttatcccaa 1201 ccatcgggta atttgccgct aatgtaacgt tcaaattcag cagcttcttc ggaatatttg 1261 gctttgtagt cgctgaaggc tttgttccat tcttcctcca ggtttgcccc gcgctcaaca 1321 gctttgcgcc agtgcttaag agcatcttct ggtactacaa aaggttcgta ttcccaaccc 1381 aagttttcgc gggtgagttt tacttcgtct ccaccaaggg cagatccgtg aacaccagcg 1441 gtgtttgctt tgttgggtga accgtaacca atggtggtgg tgactttaat gaaagaaggt 1501 ttgtcagtaa cagctttggc tgcttcaata gctttggcaa gaccttctaa atcagtgtta 1561 ccttctgcaa cgtgctgcac gtgccaaccg taagcttcaa agcgcttgga aacatcttcg 1621 gtgaatgcta tatctgtaga accatcgatg gagatgtggt tgtcgtcgta cagagcaatg 1681 agcttaccta atcccaggtg tcccgcgtaa gaacaagctt caccggaaac gccttccatg 1741 ttgcagccat cacccaaaat cacgtaggtg taatggtcaa caattttggc atcgggtttg 1801 ttgaactttg cggcgaggtg tgcttctgcc attgccaaac cgactccatt ggcaattcct 1861 tgccccagag gtccggtggt aacttccacg ccaggggtca tgaagttttc ggggtgtccg 1921 ggggttttgg attcccactg acggaattgc ttgatgtcct cgatggtcac gctgtcgtaa 1981 ccagtcaaat acagcagggc atactgcaac attgagccgt gaccggcaga caggacaaag 2041 cgatcgcggt tgaaccactt gggatttttg gggttaaacc gcaaaaagct atcccacagc 2101 acaaacgcca ttggcgcagc acccattggt agtcctgggt gaccagactt tgccttctct 2161 acagcatcaa ttgccagaaa gcggatcgag ttaatacaaa gttcttcgag ggtttgggtt 2221 gcaacagcca taatcagatt gttctttacg acgggttagc agtcttcgag catcaccagg 2281 ggggttgttt atcagttagc agttagcagt tatcaagtac cactcgtacc agttatcagt 2341 taccagtttt cactgttaac tgttcactgt taactgttca ctgtttgaaa catccccggc 2401 agtatcctca tcatcccatt ggtgcttgtt gaccgacaag cggcagatct tgggattttt 2461 gataaaaatt ttgcattcat tttatggggg atatcttcta ttatcggttg ctgaaccgac 2521 cgagtgatca agatttctgc ccaatatctc aatccagaca atgcttgagt ttcatgcttt 2581 tgcgtgaaat ccacgcatat aaaatttcca accctagaca tacttcttaa aggctagtgt 2641 cacattatga ccgccaaaac cgaaagaatt ggatagtgcc acttcgactt ttgcttcccg 2701 gctggtgtta gggacgtaat ccaagtcaca ctcaggatct gggttttcca aattaattgt 2761 cggtggaatc cggtcattgg cgatcgccag tactgttgct actgcttcaa ttccaccaga 2821 accacccagc aaatgacctg tcattgattt ggtggagcta actgctatct tataagcatg 2881 ttcacctaaa gcttttttca ttgcagcagt ttccgtcgga tcattcattg gggtactggt 2941 accatgtgca ttgatgtagc tcacatgttc tggtcttact cctgcgtctt tgagtgctag 3001 ttggattgct cttgctgcac cttcgcctcc tggtactgga gaggtcatgt ggtaagcgtc 3061 acaagtcatg ccatagccaa caatttctgc ataaatccta gctttacgac tcaaggcatg 3121 ctgtaattct tctaaaatta agatgcccgc accttcaccc atgacaaatc cgtcacggtc 3181 tttgtcaaat gggcggcttg catgagcggg gtcattattc cgagttgaaa gtgtcctagc 3241 tgaagcaaat cctgctaaag acagaggtgt aattgccgct tcacatcccc cggtaatcat 3301 cgcttgggca tatcccccct gtatgaggcg aaaagcttcc ccgatagcgt ttgagccagc 3361 agcacaggca gttaccgagc aggaatttgg tcctttagca ccagtgtgaa ttgctgtcaa 3421 tcccgctgcc atattggcta tcatcatagg tatcatgaaa ggactacagc gatccggacc 3481 acgatccagg taaattgttt gctggtcttc taaaacttta atgccaccaa cgccagaacc 3541 aataataacg ccaatctgtt ctgcgtttaa gtcgttgatg actaactgcg cgtctgcgac 3601 agcctgaatt gctgctgaga cgccaaattg ggcaaatcga tccatacgtt tggcttcttt 3661 gcgatccatg tattcatgtg gatcgaagtt ttttacctca ccagcaatgc gacaatcgtg 3721 acgagacgca tcaaacaagg tgatttcacc aatcccattg cgtccgctga tcaatccttc 3781 ccaatattcg gctggtgtat taccaatcgg tgtaagcgcg ccaacaccag taacaacaac 3841 gcgtttacgt ataaaatcag tcatgatagt tatgatggct attgatgcag gcagaacaaa 3901 acagtgctga gcaatgagtt ggtgagccag tgcgcccttg gggagccacc cccaagggac 3961 gggttccccg gcttgggggg agtggcgtgg ttcctccttt gggtgccatc tggcgtatct 4021 cctgagccct agtgggtacg gtcacggctg cgccgtatgc gcaaggcgca tacgcttgcg 4081 tgcgcaaagc gcatacccta tgagtgagtt ggtgagttct caagagggac aaagggacaa 4141 ggagattagt cattgctaca taaaacttct ccatagttgt accagacaac tacttggtct 4201 acttcggtgc tattgctaag gcagatagat ccaactatcc caaacgtaac tttagaccta 4261 agcgatccta cgtacgtcca cttcgtcatt ttcaactagt gatttggtgt tagttggctt 4321 gtttgtcacc attttgacta actttcgata attcgagcaa gaaaatttgc tacttattta 4381 agctcttgac tcttcactcc tgacttttta ctcagcaatg ccagtcgcct gccttcggtt 4441 accctcttgg agtttacact cctgggcaca ctctgctgga gagtctcctt tggaggagat 4501 agcaactccc gtcaaagtgc ccgtagcctc aatagcactc agcactcagc aattttttag 4561 gtttatgcgg caactttatt gttaatgtag tccacagcct ctttaacagt tgtaattttt 4621 tcggcggctt catcaggaat ttcgatgtca aactcttctt ccaaagccat aaccagttca 4681 acggtatcaa gggagtcagc tcctaaatcg ttggcgaaac tagcttctgg tgttacttgt 4741 tcgtctttaa cactcagttg ctcgacaaca atttttttga ccttttcaaa tgtttccgct 4801 tggctcatag atacaattcc tcaaccagtt gctaaaatcc gctgtttatg ggggatatat 4861 ttttgggcat attcatctta tcggaaagcg cgagcacccg tacactgtta agggattttc 4921 tttggtgaaa atctcttgct tcgctcaaaa taaacaagta atgataaaac agttattgac 4981 taactttgat actatgctag cccatacatt gaaatacgca tacttcccag gttgtgttgc 5041 ccaaggagct tgtcgggagc ttcaccaatc aactgttgcc ctaactcaag cgctagacat 5101 cgaactgatt gaactcaaaa aagctgcttg ctgtggttcc ggcacgttta aagaagattc 5161 ccaaatgttg gaagatacgg ttaatgcacg gaacattgct ttagcagaac agttaaatct 5221 accactgcta actcattgca gcacttgtca gggtgttatt ggtcatgttg acgaacacct 5281 taaagaatgc caaaaaacaa atccagcata cgtagaacga gtgaacggct tgctgcaaaa 5341 agaaggctgt tcaccctatc gaggaaccac agaggtcaaa catctcctct atgctttagt 5401 aactgactac ggtttagaag aaattcaaaa acgtgtcaat cgtaagttaa ctggattaaa 5461 atgtgcagct ttttatggct gttatctcct ccgtgcccaa aaatccatgc cttatgatga 5521 ccctttccaa ccaaaagcaa tggaaaatgt ctttcttacg gtaggagcag aacctattta 5581 ttacagaggt cgtacgcaat gttgcggttg gcctctttct agctatgcca cgactcaatc 5641 ttttaaaatg gctggaatgc atattcaaga agccatagaa gctggtgcgg attgtatagt 5701 gactccttgt cctctttgtc acctaaattt ggattctcgt caaaaggagg tagaaaaagt 5761 tattggacga gaactcggtt taccaatact gcatttaccc cagctgattg ctttagcagt 5821 tggtgtcagt cctaaggaac taggtttaga tcggcatatt gtttccacca caccagtttt 5881 ggaaaaatta ggattcacgt cctagagact ttcaacatgg ttcggctcca aaatcgatta 5941 gcgcgaaggg tttcccgact tgagcaaagt ggcgtttgaa cgtaatgaaa cccaacaaag 6001 tcctcaaaat gttgggtttc gttcctcaaa cggacacttt ctttaacggg ggggaaccct 6061 ccgttcgcgc agcgtgccct acgggcatag ctttgagtct cctccaccag acgctgcgcg 6121 tatgcctgcg gcacgcctta cggctaacgc atgattgcgt gcgcttgcgc ttacggagga 6181 aacctcccgc agtcgccaca acgg // LOCUS NODE_4825_length_6191_cov_4.6597136191 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6191) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6191) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6191 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1067 /locus_tag="DP116_26395" CDS <1..1067 /locus_tag="DP116_26395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017739736.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="energy transducer TonB" /protein_id="PRJNA477356:DP116_26395" /translation="GGGGGGNSPRIATGSGEGTVRSGTGTGTGIGTGSGSGFGSGIGS GSGSGFGSGIGSGIGSGIGSGIGSGTGSGIGSGTGSGIGSGTGSGIGSGTGSGQGSGK TQVATAPKPRTPAPTQLDFTQCVRCDIKYPEKSKQRGLEGRPGIAFDVDKNGNVNNIR LVRPSGHKELDQALIDSAKDFKLNSAAFGRQNVQLFANFAIQGSRSNQEAQQRQRERQ QRLEAQQKRQQEAQQRNREPVANEEENSGRRRRAISTTPEVAPQTTPETGIRRGQDVP STPTESNSEQATPQPPTLTQPLEQNSSSEQNEQQPSGRRKRNLGTSDNQNDLGEQLRR SQNQSQPSEQVPSDSSGNNQ" gene 1370..1552 /locus_tag="DP116_26400" /pseudo CDS 1370..1552 /locus_tag="DP116_26400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006194353.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="FAD-dependent monooxygenase" gene 1948..2412 /locus_tag="DP116_26405" CDS 1948..2412 /locus_tag="DP116_26405" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26405" /translation="MLQKWFKKYLESRPVLTAYNNLEKTQPRQHTHFILFPPNRWFNI TLNHILDRPQWYKGVLHNHPWPNVSLILKGGYWEKTKNGVKWYGPGSIIFRSAYKHHN IWIKDNQEAWTIFIHGPMMKNSFDFVQDGGSVTPEQLGLPIGLVIRGASTQN" gene 2458..3198 /locus_tag="DP116_26410" CDS 2458..3198 /locus_tag="DP116_26410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137486.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c biogenesis protein CcdA" /protein_id="PRJNA477356:DP116_26410" /translation="MLETLQTRLYELEQFANALVANQLTHLSLLSVGVIFIAGLLTSL TPCMLSMLPITIGYIGGYEAKSRLQAVAQSTWFALGLATTLAGLGIIAAFVGKVYGQV GIGLPIIVSIIAILMGLNLLEALPLQLPSFGGTEWISKELPQGVRAYFIGLSFGVVAS PCSTPVLASLLGWVANTQDLILGAVLLISYTAGSVAPLILAGTFTASIKKLLELRRWS GWINPISGALLVGFGVLSLISRIPVGSF" gene 3214..4611 /locus_tag="DP116_26415" CDS 3214..4611 /locus_tag="DP116_26415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316315.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c biogenesis protein" /protein_id="PRJNA477356:DP116_26415" /translation="MTIENSVPKKSIWSAFGQLLRQELLPVLTDLRLAIVMLLIIAIF SVTGTVIEQGQSLAFYQANYPEHPALFGFLTWKVLLTLGLDHVYRTWWFLALLIFFGT SLTACTFTRQLPALKAARRWKFYDEPRQFQKLALSAELDNGSVNSLTQLLQNQRYKVF QEKDDSLYARKGMIGRIGPIVVHIGIVTILLGSIWGAMTGFVAQEMVTSGNTFQVKNI IDAGPWATALPKDWSVRVNRFWIDYTSSGGIDQFYSDMSVLDNQEQEVDRKTIFVNSP LRYRGVTFYQTDWGISAVRVRVNKSPIFQLPMAQLNTNGKGRIWGTWIPTKPDLSEGV SLLAKDLQGTVLIYDGTGKLVDTVRTGMSTSVNGVTLKILDVLGSTGLQIKADPGIPV VYTGFALLMLGVVMSYFSHSQIWALQKDGRLYVGGKTNRAQVAFEREMLEILDKLSLP TQGEEKAVAIEPHSA" gene 4716..6158 /locus_tag="DP116_26420" CDS 4716..6158 /locus_tag="DP116_26420" /inference="COORDINATES: protein motif:HMM:PF05860.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26420" /translation="MSKLGARWGWLLGIAIGGVYIWSAHCASAQITPDATLPNNSIVT PSGNTINITGGTQAGSNLFHSFKEFSIPTGNEAFFNNTVDIANIITRVTGGSRSDIDG FIRANGTANLFLINPNGIVFGPNARLDIKGSFIGSTASSLKFADGTEFSATNSGAPPL LTISVPLGLQFGSNPKEIQVTGPGHELSYQDDIIEYNNGSKINQPIPVLNSSQPGLQV SKGKTLALVGGNVSIEGGILKSPEGRIEIGSVGSNQAVSLVPIEQGWKLGYEAGTSFA DIGFSGKSFLSATGDGGGAIAIAGRNINITSESIVRADTLDNRNGQQISILGDAIVVD RSNIGANSYSSGNGGQVKLEANNITFKNNSGVGSQAAASGKAGDITLIAKNSFVVSNQ SGLGSPTFGVGNGGVINVEANSVLLEKEAGFGASSFGKGGVGEINIKVGDLVMRDSGI GSDSSSVSNGGKININANSFQLERGLGTVT" BASE COUNT 1754 a 1261 c 1451 g 1725 t ORIGIN 1 gtggtggtgg cggcggtggt aacagtcccc gtattgccac cggttcagga gaaggcactg 61 tgagatctgg cacaggtact ggtactggta ttggtactgg tagtggctca ggttttggtt 121 caggtatcgg ttcaggtagt ggctcaggtt ttggttcagg tatcggttca ggtatcggtt 181 caggtatcgg ctcaggtatc ggctcaggta ctggctcagg tatcggttca ggtactggct 241 caggtatcgg ttcaggtact ggctcaggta ttggctcagg taccgggagt ggacaaggaa 301 gtggtaagac acaagttgct acagctccca aaccaagaac ccctgccccc acacagttgg 361 atttcacaca gtgtgtcaga tgtgatatca agtatccaga aaaatctaaa cagcgaggtc 421 tcgaaggtag acctggtatt gcttttgatg ttgacaaaaa tggtaatgtt aacaacatca 481 ggcttgtccg ccctagtggt cacaaagagc tagatcaagc actgatagac tctgcaaaag 541 attttaaatt aaattccgca gcctttggta gacaaaacgt acaattattt gcaaattttg 601 ctatccaagg gtcaaggagt aaccaagaag ctcagcaacg tcaaagagaa agacaacaaa 661 gactagaggc tcaacagaaa agacaacaag aagcacaaca gagaaaccgc gaacctgttg 721 ctaatgagga agaaaattct ggacgtagaa gacgagccat atcaacaact ccagaagtag 781 ccccgcaaac cacgccagaa actggaatca ggcgcgggca agatgtacca agtacaccta 841 cagagtcaaa ttccgaacaa gcaactccac agccaccaac tttaactcaa ccactagagc 901 aaaattcttc ttccgagcag aatgaacagc aaccctccgg tcgccgcaaa agaaatctag 961 gcacatctga caatcaaaat gatttaggag agcaattacg tagaagtcaa aatcaatccc 1021 agccgtcaga acaagtccct tcggatagta gtggaaacaa tcaataaggc tgtataaaaa 1081 gttatgcgat acagatcttg cgtagatgtg gagtaaatac ttcaactgca catctacgcc 1141 actttgcttt tgaggtttcg ccagtttgaa atcacgagca gaaatgtctg ctagttgttg 1201 tcctttacaa gcatgagcaa ctgggaggga atttatgcgt tctctctaaa gaaagatgaa 1261 aatatgtgaa cttgcagtaa aacagaatta acgacaggaa attaagatta aagcaaccca 1321 tgttccaaaa gagctataaa gaaaatttaa gtctcaattg gtaaaaatca tggttaagaa 1381 aatcgcaatt attggtgcag gatccagtgg actcctactt gctcactacc tgttgcatcg 1441 tggtgacaaa tatcaagttg acatttatga gcgccgcagc gatcccagaa ctgtctcatt 1501 ctcaaattct agaacctacc ccatctccct aaacgaaagg ggaatgagcg ctaatcccaa 1561 aaacccggtt tcttcgagtt gagcatacga gatatcaagc ttgacgacag agagaaaccg 1621 ggttttttgc cccttcttga ctactgtact ggcgctctag gctcattcaa ataacgcttg 1681 cgcctattgt ttggtttgtt aactctccgt ttagcatttc cggcttgtgg ctgtggactt 1741 ttgaccggag gtaacggaag tctccggtca gaattgatgg agatagaaat gcttattccg 1801 cctgtatgct tgacttgttt ccaacagtat agttatttgt gtcgtgctgc actagttgaa 1861 attacacaaa tgttgtgaca attggtagtt agttacacat ttatactggg aaaactaagg 1921 acagctgatc ttctagaaga catctctatg cttcagaaat ggttcaaaaa atatctagag 1981 tctcgacctg tcctaactgc ctacaacaat ttagaaaaga cacaacctcg acagcatacc 2041 cactttatct tgttcccacc caatagatgg tttaacatca cacttaacca tattttagac 2101 cgaccgcaat ggtataaagg agtactccac aatcatcctt ggcccaacgt cagtctgatt 2161 ttgaagggtg gctattggga gaaaaccaag aacggggtca agtggtatgg accaggctca 2221 attatcttcc ggtctgcata caaacaccac aatatctgga ttaaagataa tcaagaagca 2281 tggacaatat tcatccacgg accaatgatg aaaaacagtt ttgatttcgt gcaggatggt 2341 ggatctgtca ctccagaaca gcttgggttg cctataggtt tggtgatccg tggagcctca 2401 acgcaaaatt aagctaaaat tcaagcatag taagcatttc ctgcttttat atctcccatg 2461 cttgaaaccc tgcaaacccg actttacgaa ctagaacaat tcgctaacgc ccttgtcgcc 2521 aaccaactca cgcacttaag tttgctgagt gttggcgtca tttttatcgc tgggttgctc 2581 actagcctca caccctgtat gctttccatg ctaccgatta caatcggtta tattggtggt 2641 tatgaagcga aaagccgtct gcaagccgtt gctcagtcaa cgtggtttgc tttgggattg 2701 gctactacac tggcaggact tggtattata gcagcttttg tgggaaaagt ctatggtcag 2761 gtaggaattg ggttaccaat tattgtcagt atcattgcta ttctcatggg gctgaattta 2821 ttagaagctt tacccttgca attgccctct tttggtggaa cagagtggat ttctaaagaa 2881 ttacctcaag gagtgcgggc ttatttcatt ggtctgagtt tcggtgtcgt cgcctctcct 2941 tgcagtactc ctgttttagc aagcttactt ggttgggttg ccaacacgca agacttaatt 3001 ctaggcgctg ttttgctgat ttcctataca gcaggttctg tcgcaccatt gatattagcg 3061 ggtacgttta cagcttcgat taagaaattg ctagaattgc ggcgttggtc aggttggatt 3121 aacccaataa gcggtgcatt attggtagga tttggtgtac tttctttaat ttctcggatt 3181 cccgtaggaa gtttttaggt aaaaattaga taaatgacta tagaaaattc agttcccaaa 3241 aaatctattt ggtcagcatt tgggcagtta ttgcgacaag agttattgcc cgtgctgaca 3301 gatttacgct tagcaattgt gatgctgtta atcattgcaa tcttcagcgt cactggtaca 3361 gttatagagc aaggtcaatc gttagcgttc taccaagcta actacccaga acacccagct 3421 ttatttggtt ttctcacctg gaaagttctc ttaacactgg gcttagacca tgtttatcga 3481 acttggtggt ttttagcatt actcatcttt tttgggacaa gcttaactgc ttgtactttt 3541 actcgtcagc taccagcttt aaaagctgcc cgtcgatgga aattttatga tgaaccccgt 3601 caatttcaaa agttagcttt gagtgcggaa ctagacaatg gttctgtgaa ttccctcact 3661 caacttttgc aaaatcagcg ctacaaagtt tttcaagaaa aagatgattc tctctacgcc 3721 cgcaaaggta tgatcggacg catcggacct attgtggttc atataggtat tgtgacaatt 3781 cttctgggat ctatttgggg agcaatgact ggttttgttg cccaggaaat ggtgaccagt 3841 ggtaatacct ttcaagtgaa aaatatcata gatgcggggc cttgggcaac agcactaccc 3901 aaagattggt ctgtgcgagt caatcgcttt tggattgact acacttcttc tggtggaatt 3961 gaccagtttt actcagacat gtctgttctg gacaatcaag aacaggaagt tgaccgcaag 4021 acgattttcg tcaacagtcc tctgcgttat cgcggcgtga ctttctacca aactgattgg 4081 ggaatttctg cagttcgagt tcgtgtgaac aaaagcccca tttttcaact accgatggcg 4141 caactaaaca ccaacggtaa aggacgcatc tggggaactt ggattcctac taaaccagat 4201 ttgagtgagg gtgtctcctt actagcaaaa gacttgcagg gaacggtgtt gatttacgat 4261 ggtactggca agttggttga cactgttcgc actggaatgt ccacatctgt taatggtgtg 4321 acgttgaaaa ttctggatgt ccttggtagt actggcttgc aaattaaagc agatccagga 4381 ataccagttg tttatacagg atttgccttg ctgatgctag gtgtggtgat gagttacttt 4441 tctcactccc aaatttgggc gttgcagaaa gatggtcgct tatatgtggg tggaaaaacg 4501 aatcgtgctc aggtcgcttt tgaacgggaa atgttggaga ttttggataa gttgagttta 4561 cctacacaag gtgaggagaa agcggttgcg atagaacctc attcggctta attatagaat 4621 aaactccttt atttattgtt tgcacaaata aaattcaacc cataattgtg atttactgag 4681 tattagtgca aataccataa aaactggagg gtgcaatgtc caagttgggt gctcgttggg 4741 gttggctctt aggaattgcg atcggtggtg tgtatatttg gagtgcacat tgcgcttctg 4801 ctcaaattac accggatgcc actctaccca ataattctat cgtcacaccc tctggtaata 4861 cgataaatat cactggcgga actcaagcgg gtagcaactt gttccacagc ttcaaagagt 4921 tttctattcc cactggtaac gaagctttct ttaacaatac ggttgacatt gcgaacatta 4981 ttacccgtgt cacaggtggg tcacgctctg atattgatgg tttcatccga gctaatggca 5041 cagccaactt attcctgata aatcccaatg gcattgtttt tggaccgaat gcacggttag 5101 atattaaagg ctcgtttatt ggtagtactg cgagcagtct aaagtttgcc gatggtacgg 5161 aatttagcgc gaccaattct ggtgcccccc ctctattaac gatcagcgtt ccactggggt 5221 tgcaatttgg ttcaaatcct aaagaaattc aggtgaccgg accagggcac gaattgtcat 5281 atcaagatga tatcatcgag tataataacg gctctaaaat caatcaaccc attcccgttc 5341 ttaacagtag ccaaccagga ttgcaagtga gtaaggggaa aaccttagca ctggtaggag 5401 ggaatgtgtc tattgagggt ggcattctca agtcaccaga aggacgtatt gaaatcggca 5461 gtgttggcag caatcaggct gtgagtctcg tgccgataga acaaggttgg aagctgggtt 5521 atgaagctgg gacgagtttt gcagacatcg ggttttcagg gaaatcgttt ctgagtgcca 5581 caggagatgg tggtggtgcg atcgcaatcg ctggtagaaa tatcaatatt acctctgagt 5641 caatagtacg agctgatacg ctggacaata gaaatgggca acaaattagt atccttgggg 5701 atgcgatcgt tgtcgataga tccaacatcg gcgctaacag ctacagttca gggaatggcg 5761 gacaagtaaa gctggaagcc aacaacatta cgtttaagaa taacagtggc gtaggaagtc 5821 aggcagcagc atcgggaaag gcgggagaca ttacccttat agctaagaat tcttttgttg 5881 tcagtaatca aagtgggcta ggcagcccta cctttggtgt aggtaatggc ggagtgataa 5941 acgttgaagc taattcggtg ctgcttgaaa aggaagctgg atttggtgcg agttcttttg 6001 gtaaaggcgg cgtcggtgaa atcaatatta aagttggcga tttggtaatg cgcgattctg 6061 gtattggtag tgactcctca tcggtaagta atggtggaaa aatcaacatc aacgcaaatt 6121 cttttcaact cgaaagaggg ttgggaactg taacttaaaa tctgtaaata tctggacata 6181 acttccggaa t // LOCUS NODE_4909_length_6008_cov_4.3193356008 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 6008) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 6008) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6008 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(67..139) /locus_tag="DP116_26425" tRNA complement(67..139) /locus_tag="DP116_26425" /product="tRNA-Arg" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(104..106),aa:Arg,seq:ccg) gene complement(239..643) /locus_tag="DP116_26430" CDS complement(239..643) /locus_tag="DP116_26430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006632051.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_26430" /translation="MTLTFNPKIYSELLSKHQPRIIKSEEENEKFIAIVEELLSRPNL TPEEDAVLELLVRLIEDFEDKHYEINASTPHSRLLHLMEARSLEKADLVKILGSRDVT AEVVNGELEISKEQAEALGEFFQVNPSLFLSN" gene complement(627..926) /locus_tag="DP116_26435" /pseudo CDS complement(627..926) /locus_tag="DP116_26435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198241.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system HigB family toxin" gene 1494..1814 /locus_tag="DP116_26440" CDS 1494..1814 /locus_tag="DP116_26440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316594.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26440" /translation="MSIGRAILFIALVIPGLLVAGSSLYSFNTEYTELQKTERYVDKL AKDGRTNNRQLDLAYHRSFVHRMNAFSNGTWGFIGATIAAIGVHGMATTNEETIQDQR KTSK" gene complement(1934..3239) /gene="gid" /locus_tag="DP116_26445" /pseudo CDS complement(1934..3239) /gene="gid" /locus_tag="DP116_26445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008231905.1" /note="TrmFO; Gid; glucose-inhibited division protein; similar to GidA; the gene from Bacillus subtilis encodes a tRNA-methyltransferase that utilizes folate as the carbon donor and bound flavin as reductant; modifies tRNA at position 54 (uridine) of the T-psi loop to form a C5-methyluridine; frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="methylenetetrahydrofolate--tRNA-(uracil(54)- C(5))-methyltransferase (FADH(2)-oxidizing) TrmFO" assembly_gap 3003..3016 /estimated_length=14 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(3315..3404) /locus_tag="DP116_26450" CDS complement(3315..3404) /locus_tag="DP116_26450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011490165.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26450" /translation="MRLDADVVAWFKNRQSRGYQTLINFVLRE" gene 3633..5180 /locus_tag="DP116_26455" CDS 3633..5180 /locus_tag="DP116_26455" /inference="COORDINATES: protein motif:HMM:PF02493.18" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137779.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_26455" /translation="MLLGQILRQRYKILRLLGSGSFGVTYLAEDLDLPDHPLCVVKNL KQMQNPDELQLARGLFDREAKVLYRLGNECSQIPRLFAHFEENGEFYLVQEFIDGHDL SGEIFPGKRLTEAEVTQLLQEILEVLTFVHNKNIIHRDIKPQNLMRRKTDGKIVLIDF GAVKQISALVVNAQGQTNVSVIVGTYGYMPSEQLNGYPKLSSDVYAVGMLGIYALTGI RPQELPKNSETLEVIWRNRASVSPRLADVLDKMVRHNFNKRYQTASEALQALRPSSLP SPSPSPLPPVSPPPTSTPRELKLYQKILIGTGVGVGVLIGVGLLFMSFWRQPTPDLEK PVAYSGQPTPDSQKTVADSKQPTPDSPQTTYDSPQATYDVQLVCPETPIPPLPSTPGR KIGPATVYGFPKNSPLTGKGIIIFPDRKFGGYVRYDGELKNGVYSGCGNLVRGNGERY VGQFDKNSFQGIGKYIYKNGCQYIGEFKNNQFDGQGTWISKDGSPYSGIWHQGKLQGS NKVLSCQ" BASE COUNT 1797 a 1351 c 1159 g 1687 t 14 others ORIGIN 1 tttgagccaa gaaatttatt tcttggcgga cgaaaagtat ggtgcaagat ctgagtatac 61 aaaaaacgag cgcggaggga tttgaacccc cgacccacag gaccggaacc tgttgctcta 121 tccactgagc cacgcgccct taaataagca aacattaaaa tttgctattc taatccattg 181 tagcatagtc ttggaataaa gtgggcgatc tgcaaaggag cgcaagcgaa gctatcgcct 241 aatttgacaa aaacaagctc ggattaactt ggaaaaactc ccctaaagcc tcagcttgtt 301 ctttactgat ttctagctca ccattgacta cttcggctgt gacatccctc gaaccaagaa 361 tttttactaa atcagctttt tccagactcc tagcctccat caaatgaagg agtcttgagt 421 gtggcgttga ggcattaatc tcatagtgct tatcttcaaa atcctcaatt agtcttacca 481 atagttccaa cacagcatct tcttctggag tcaggttagg acgagaaagc agttcttcaa 541 caattgcgat aaatttctcg ttttcctctt cactttttat aattcgaggt tgatgtttag 601 acagcaattc gctgtaaatt tttggattaa aagtaagggt catttttcca gttatctttg 661 tcatattcgg catgagttag aacgtatttg atataggtta cctgcttcgc ataattgata 721 ctaacgatta agcgatattt attgccctta atattgaaaa ccgtaaaatt gccaacggct 781 tcagcttgag gataaactgc ctgaacttca acaagactaa cccactctgc ttgacgggca 841 gttgtgtacc agtcatcaag ggcttcacag gaatcagcgt gctttttgca ttattctcgc 901 agctttctcc gactaataac gtgcattaag tgataattat catatttttc taatttctca 961 gcctgagact attctaataa atggagggtc aagaaaatcg ctgaggtctt ggctttgtaa 1021 caaaatcgtc catatcaagt atagcgatcc tatttgattt gtgaaaattg aaagccacag 1081 atccccgaca acttttacga agtcggggat cttcggggat caagtttgct cacgtcagcg 1141 caaagcgtgc ccgcaaggct tatgatttag gattgcgata gatgcataat gtgcagcaaa 1201 atacaagaat aaatcgctgc tcatttttcc ctgatgttcc ggaaggtgag cttttatcag 1261 ttaccagtta acagttagca gttatcagtt attaattgtt cactgttcac tgttcactgt 1321 ttcctgttca ctgttcactg ttttatgctc ttgatattag atttcccagg gtattattcc 1381 cggaatttgg ttttacctca gggtgcacga tttttctaat ttgttcgcca gatactatca 1441 gcaaatgctt gccatgtcag cattaactca aatatattgc ctaacgaggt tgtatgagca 1501 taggcagagc aattctcttt attgccttgg ttattccagg tttattggta gcggggtcat 1561 cgctttactc gtttaacact gaatatacgg agttacaaaa aaccgaaagg tatgtggaca 1621 agttagctaa ggatggaaga accaataata gacagttgga tcttgcctat catcgtagtt 1681 ttgttcatcg catgaatgca ttcagcaatg ggacttgggg ctttattggt gcaacgatcg 1741 cagcgatcgg tgtacacgga atggctacaa ccaatgagga aacaattcaa gaccagcgaa 1801 agacatcaaa atagttatga gtggtgagtt ctgatgaagg attaggggac aagacaattc 1861 aaaattcaaa attcaaaatt caaaattcaa aattcagaat ttagaattag aagtctctcc 1921 ctgactcagc acttcactgc tgaacactgt ttatcaattc cttcaagtca gctaaagcgc 1981 gatcgcgata acgtccataa cgctcttgtt tatttttgat tttctcaccc aattctggta 2041 atatcccaaa attagggggc attggttgaa aatgcttagg cgaagcggaa ctgataaact 2101 caaacagtga ccccatcatc gtcgtcttgg gtagagtcaa aggttctttc cccaaagcca 2161 gccgcgctgc attcgttccc gccaaccaac caccggctgc ggctgcggta tatccttcag 2221 ttccaattaa ctgtccagca gctagcaacg ttggacggtt gataaattgc aaactaggct 2281 gcatcagcac aggcgcgttg agaaaggtgt tgcggtgcat gactcccaac cgcacgaact 2341 cagcattttc caaaccagga atcatttgga aaatacgctt ttgctcaccc caacgcagat 2401 ttgtttggaa tcctaccata ttccaaagtt gacctgcttt gtcttcttgc cgcaactgta 2461 tcacagcata agaacgttct ccagtccgac tatccgataa cccgactggc ttaagaggac 2521 cataacgcat tgtatcttct cctcgccgtg caagttcttc aattggaaga caagcttcaa 2581 aaaatttcgc cgtttcccgt tcaaaatcct tcaattccac ttgttcagca gtacgcagtt 2641 cttgccaaaa ctgcaaatac tgctctttgt tcataggaca gtttaaatat gcagcttcac 2701 ctttgtcgta acgagaggcg agaaaagcaa tatcgtggtt aatcgattct cccacaatga 2761 ttgggcttgc ggcatcaaaa aagctgaggt attccatccc tgtgaaacgc tgcaaatctt 2821 ctgccaattc cggacttgtt aaaggaccag ttgccaaaac gacaattcct tcaggaataa 2881 cacgtaatgc ctctcgacgc agttcaatca gaggatgctg agcgagagtt tgcgtcaagt 2941 cctgactaaa ttgtcctcta tcgactgcaa gcgccccacc agcaggaacc gcgtgttcat 3001 cannnnnnnn nnnnnnggcg tagttcttca tgcaaaagac cagcagcgcg atcgcttgcc 3061 attgccccaa aagaattgct acacaccaat tccgccaaat gttccgtgtg atgcgcacca 3121 ctgaagcgat ttgggcgcat ttcccagaga acaaccgata ttccaacaga ggctatttgc 3181 caagctgctt ctgtcccagc tagtccacct ccaataacat gaattgtttg tttttccata 3241 tttttattct gtagcttctg agctgccttt gttggtcatt caccccgagg agactcggac 3301 tggtgtttga gcatttactc ccgaagaacg aagttgatga gggtctgata gcctcgtgac 3361 tgtcggttct tgaaccaggc taccacatcg gcatcaagcc gcatattcac gggaactttc 3421 ggtacccctg gttttctaat tgtatcaacc ctaaagtgaa acagtattgt aaatattagc 3481 caccaatcaa aaccgagtta caaccacggg taattttatt aacttgtatg cttagaatca 3541 aaattagcca aaaatcgaac ttaatttcca cagcctttag gttgtatgtg gcaaaattta 3601 agataattca tcttgttctt ggttaaatag taatgctcct cggtcaaatt ctccgccagc 3661 gctacaaaat tctcagactc ttaggaagcg gtagctttgg agttacttat ttagctgaag 3721 atttagattt acccgatcat cctctgtgtg ttgtaaaaaa cctcaagcaa atgcaaaatc 3781 cagatgaatt gcaacttgcc agagggttat ttgacagaga agcaaaggtt ttatatcgct 3841 taggcaatga gtgcagtcaa attccccgac ttttcgccca ctttgaagaa aacggcgaat 3901 tttatctagt acaagaattt atagacgggc acgatttaag tggggaaatt tttcctggta 3961 agaggttaac tgaagctgaa gttacacagc tattacagga aattttagaa gttcttacat 4021 ttgttcacaa taaaaatatc attcaccgag acatcaaacc acaaaattta atgcggcgca 4081 agacagacgg caaaatagta ttgatagact ttggcgctgt caagcaaatt agcgcattag 4141 tagttaatgc tcaagggcaa acaaacgtct ccgttattgt tggtacttat ggctatatgc 4201 ccagcgaaca attgaatggg tatcccaagt taagcagcga tgtttatgca gtggggatgc 4261 tggggattta tgctttaact gggatcagac ctcaagaact gccaaaaaac tccgaaactt 4321 tggaagtcat ttggcgaaat cgggcttcgg ttagccctag actggcagat gtattagata 4381 aaatggtgcg ccataacttc aataaacgtt atcaaacagc atccgaggcg ttgcaagcgc 4441 taagaccatc ttcattaccc tcaccatcgc catcaccgtt accaccggta tcgccaccac 4501 caacctcaac accacgagaa ttaaaacttt accagaaaat attaattggt actggagttg 4561 gggtgggtgt gctaattgga gtgggattgt tatttatgag tttttggaga caacccacac 4621 ccgatctaga aaaacccgta gcttattcag gacaacccac acccgattca caaaaaaccg 4681 tcgctgattc caaacaaccc acacctgact cgccacaaac cacatatgat tcgccacaag 4741 ccacatatga cgtgcaactg gtttgcccag aaacaccaat accacctcta ccaagtacgc 4801 ccggtaggaa aattggtcca gccacagttt atggtttccc aaagaattca ccactgacag 4861 gcaaaggaat cattatcttt cccgatcgca agtttggtgg atatgttcga tatgatggag 4921 aactaaaaaa tggagtatat agcggttgtg gaaacttggt acgtggaaat ggtgagcgtt 4981 acgtaggtca gtttgataag aattcttttc aaggcattgg aaaatatatt tacaaaaatg 5041 gttgtcaata tataggagaa ttcaaaaaca accagttcga tggtcaaggt acatggatat 5101 ctaaagacgg ctcgccttat agtggaattt ggcaccaagg aaaattgcaa ggaagtaata 5161 aggttttaag ctgccaataa tttttaatct tatattatgt caggatacat tatccgagcc 5221 agtgttgaaa taatcacatt acttaggaaa aaattttcta tcaattagga aaagcataat 5281 catcaagcca aaggaatatt gtaatttttt ttttgcaaaa atgtatgata aaatcacaaa 5341 ctttaaattg gacttattac gttatatttt tagtattttt tacttaaaag aaaactaaac 5401 acaagtagtt caagctaaaa aacatgagat tatgtagagt gtgttaagcg ctgccgcaat 5461 gcttataaac gcatcattcg gcgacctgca acactccaaa accgatttta caagctttat 5521 ttgctcatac ctttgggctg tctaccttct agcttcaagt accagcctgc gatcggtcta 5581 tttatgtctt cggtgtgtgt tgatatttag aggagtgata aatgaatcaa aaattactta 5641 caaagctgag attagttttg tctaaaactt ctttactagg tattactagt ttagctattt 5701 ctctacaacc atccttagcg caaccttcgg ctgttaatcg atcgccaaca acaccatcaa 5761 ccttaccacc tgcaaaaaat ccgatcagcc aaacacctcc agaaaatccc gtcagtccaa 5821 gcccaacacc ttcacaaaat cagacaccta gtccaacacc tccagaaaat cccctcagcc 5881 ctaccccaac accttcacaa aatcagatac ctggtccaac acctccagaa aaccccctca 5941 gcccaacccc aacgccaact ttcacaccat caccaacgcc gactttaaca ccatcaccaa 6001 ccccaacc // LOCUS NODE_4920_length_5986_cov_4.8585405986 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5986) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5986) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5986 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..536 /locus_tag="DP116_26460" CDS <1..536 /locus_tag="DP116_26460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019490348.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="putative thioesterase" /protein_id="PRJNA477356:DP116_26460" /translation="LLPSLNKPFVFFGHSMGAVVSFELARRLYQNYRLTPLHLYVSGH RAPQIPDPEPPIHNLPEPAFLNELRRYNGTPKEVLDNSELMQLLLPILRADFAVLENY VYTPSVPLTCPITAFGGLQDWKVSREDLAGWQQQTNGAFSIQMFPGDHFFVYSSQSLL LQQLCEELHSYARKLNV" gene complement(544..1188) /locus_tag="DP116_26465" CDS complement(544..1188) /locus_tag="DP116_26465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745336.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sigma-70 family RNA polymerase sigma factor" /protein_id="PRJNA477356:DP116_26465" /translation="MDEELLSCLVDEACQHLPGSPERQKLLTRVIRLTSSKLWRETTP YYQDALQQTWLYFCRNICEGTTGQRYDSSLGSVITWLNVYLKRRLQDFYRDHQRQQAR IVSANINQFRLGEASEIVNPVDNLAAEPDIPPILENVRSWAEKDADGELRQTYIQGHP QVNCQVLILKRMPPEVSWKELSAEFNLPVSTLSSFYQRQCLPRLRSFAEKEGFI" gene 1299..2033 /locus_tag="DP116_26470" CDS 1299..2033 /locus_tag="DP116_26470" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207974.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HAD family hydrolase" /protein_id="PRJNA477356:DP116_26470" /translation="MATIQCRNVTFSNIQAILFDKNGTLEDSEADLRAFAQRGARIID AQIPGTGEPLLMAFGVNGDNLDPAGLIAVASRRETEIAAAAYIAETGRSWFESLAIAR QALDDAEKYLGNTPSPLFVGSLEVLKSLSETELKLGILSAATTQEVRAFVKHHQLGDY IQLSMGVDKGPSKPDPILFLQACQALGVQPSATLMVGDSVGDMQMARHAKAAGCIGIT WVGKAENVKGADVVINRLDEIQVVSE" gene complement(2137..4893) /locus_tag="DP116_26475" CDS complement(2137..4893) /locus_tag="DP116_26475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198507.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M43" /protein_id="PRJNA477356:DP116_26475" /translation="MKNWITRLLIYITLLYNLLLGANDAKANQLSDVYVQQLNTLPGI EKLAMADNTHQENRKDDFRRFREIIKGADKLEGLFTLYRAKDSGEIYWEIKPEQLNKN YLGIVTLESGVGESGLYSGLPLQDFLFYFQRVNNNLHFVIRNVKFRTEAGQPEERSLA RSFSDSVLYAVNIVCIDPYTKNILIDINDLLMQDFPGLTSLLKYSLQTEYQLQESKSY LGDVNSLPLNLEIDSIYGFSSPEGADLITLPDSRALTLKVHYSFSQLRENNGYIPRFA DDRVGYFITAFQNFSGNIGKEPFVRYINRWHLEPSDPNAALSPPKKPIVFWIENAVPL EYRDAVREGVLMWNKAFEKVGFQNAIEVRQMPDDADWHPADVRYNTIRWFNSLDAGFA RGPMRVNPLTGEILDADIIVDANMVRSIQQNYHTLIEANSDDLYPMSIYAKSGVKMSQ HNATGIQKSIVENRESWSNDSDFYYGMESSFQAAMGALTLSLVQDATPSSDQMKKYVH QYLRSLIAHEVGHTLGLRHNFHGSTMLAPEELNNTEITHTKGLVGSVMDYVPVNIAPQ GVQQGDYFPAIIGPYDEWAIEYGYKRYSHLMLEGITPLTEKPFLEQIALASPQPELSY ATDEDIWDINPLANVWDMSSDVLVYSQWQMDNARMMWQRLDKRSPLKGESYSERRLLF NRIFQYYFRNAILLTKYIGGQSFKRHHALDDTHAHFVPVPLEKQRQILTNLHEYVFDQ DAFRFSSQLLNQLAPSRFSHWGNPIPVYRLDYPIHERILQLQSVILRSLLDGDRLNRL RDLELKTLPGESLTIPELFDTLQKDIWTEVFTSESHMSISSIRRSLQREHLNILLGMV LRTTYVPEDGRTIAWYELRELLKAIDVGVKQHGGKLDIYTLAHLEETSDRITKALNAQ LLSN" gene complement(5102..5800) /locus_tag="DP116_26480" CDS complement(5102..5800) /locus_tag="DP116_26480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318223.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="PRJNA477356:DP116_26480" /translation="MTNSLQSLFQAIAQTRDEQELRLHVMVRIGEYFVAQRWGLFFFD QLPLVETNLPGIVKLALSLEHNPVLRYVVEHHAPVHEELLLPPGAWSMICPRSDHAHV MAGPIVSHGRLVGGVGFTRVRGTSAFNTQELTELSALCLHLSTWLTTTSSQPTEFNSV NINRLTPREIQIAELVAQGRTNAEIGTALWITENSVKQALKRMFRKLEVSSRAELVAR LFSNTRLLASRNVL" BASE COUNT 1749 a 1233 c 1240 g 1764 t ORIGIN 1 ccctcttgcc aagcctcaat aaacctttcg tcttcttcgg tcacagcatg ggcgcagtgg 61 ttagctttga acttgcgcga cgactttatc aaaattatcg cttgactcca ttacatttat 121 atgtatctgg tcaccgcgct ccacaaattc ccgatccaga accacctatt cacaatttgc 181 cagaaccagc attcttaaat gaactacgcc gctataacgg tacgcccaaa gaagtactag 241 ataactccga actcatgcaa ctgcttcttc caatcctgcg ggcagatttt gctgttctgg 301 aaaattacgt ttacactccc tctgttccac taacttgtcc cattactgct ttcgggggct 361 tacaagattg gaaagtcagt cgtgaagatt tggcaggttg gcaacaacag actaatggtg 421 ctttttcaat acagatgttt cctggagatc atttctttgt gtactcatcc cagtcactcc 481 tgctacagca gctttgtgaa gaactgcata gctacgcccg caagctaaat gtgtaaaaaa 541 atgctatata aacccctctt tttcagcaaa acttcgtaaa cgaggaagac actgccgttg 601 ataaaaacta ctcaaagtgg aaacaggtaa attaaattcc gctgatagtt ccttccaact 661 cacttctggt ggcatacgtt tcaaaattaa cacttgacaa ttcacttgcg gatgtccttg 721 aatgtatgtc tggcgcagtt ctccatcagc gtctttctct gcccacgatc taacgttttc 781 taaaatagga ggaatatcag gttcagcagc aaggttatct acaggattaa caatttcact 841 tgcttcaccc aagcgaaact ggttgatatt agcagaaact attctggctt gttgtcgttg 901 atgatcacgg taaaagtctt gcagtctgcg ttttagataa acattcagcc aagttataac 961 actacctaaa gaagaatcgt acctctgacc tgttgttcct tcacaaatgt tgcgacaaaa 1021 atataaccaa gtttgttgca atgcatcctg atagtaagga gtcgtttccc tccagagttt 1081 gcttgaggtt aaacgaatga ctcgcgtcag cagtttttga cgttctggac ttccgggtag 1141 gtgttgacaa gcttcgtcaa ctaagcagga taacagttcc tcatccatag catttgtggc 1201 aaaacagcaa gcattatcag tattgctcag aaactgattc tatagtatta cccctcgtcg 1261 cgatattata tgctatgccc ttttttcaga ggtgtctttt ggcaactatt cagtgtagaa 1321 atgtgacgtt ttccaatatc caagcgattc tgtttgacaa aaatggtacg ttagaagact 1381 cagaagctga tttacgtgct ttcgcacaaa gaggagcacg gataattgat gcccaaattc 1441 ctggcactgg ggaaccattg ttaatggcat ttggtgttaa tggtgataac ctcgatccag 1501 cgggtttaat cgcggtggcg agtcgtcgcg aaacagaaat tgctgcagca gcatatattg 1561 cggaaactgg acggagttgg tttgagtcgt tggcgatcgc ccgtcaagct ttggacgacg 1621 ctgaaaaata ccttggcaat acaccttcac ctttgtttgt aggtagtttg gaagtgctaa 1681 agtctttatc ggaaactgaa ttgaaattag gaattctttc agcagcgaca acccaagaag 1741 tccgtgcttt tgtcaagcat catcaattag gtgattacat tcaattatct atgggagtcg 1801 ataaaggacc aagtaaacca gatccaattt tatttttgca agcatgccaa gctttaggag 1861 tccaaccaag tgctacgctc atggtaggag attctgtggg tgatatgcaa atggcgcgtc 1921 atgctaaagc tgctggttgc attggtatta cttgggtggg gaaggcggag aatgtcaaag 1981 gtgcggatgt ggtgataaat cgactggatg aaattcaagt tgttagtgaa tgaatcatag 2041 ccattctgtt gttgagcgat aaataattct ttgctcaaca actttgctaa aaactagcaa 2101 gcgcttttat gtattgaatt ctaaaatcaa acctcatcaa ttagaaagca attgtgcgtt 2161 taaagctttg gtaatgcgat cgctcgtttc ttccaaatga gccagcgtgt aaatatccaa 2221 cttaccaccg tgttgtttaa caccaacatc aatagctttc agcagttcac gcaattcata 2281 ccaagcgatt gtacgaccat cttcagggac ataagttgta cgcaacacca tccctagcaa 2341 aatattcaga tgttcccgtt gcagggaacg gcggatgcta gaaattgaca tatgtgattc 2401 ggaagtgaaa acttctgtcc aaatgtcttt ttgcaatgtg tcaaacagtt ctggtattgt 2461 cagggattcc ccaggcaaag tttttaattc tagatcacgt aaacgattta agcgatcgcc 2521 atctaataaa cttcgcaata tcacactttg aagctgcaaa atacgctcat gaattggata 2581 atcgagacgg taaacaggta tagggttacc ccagtgtgaa aagcgcgatg gtgctaattg 2641 attcagcaat tgtgatgaaa aacgaaaagc atcctgatca aatacatatt catgtaaatt 2701 tgtcaatatc tgacgttgtt tttctagagg aactggtaca aagtgagcat gagtatcatc 2761 taaagcatgg tgacgcttaa aagactgccc accaatgtat ttagtgagca aaatggcatt 2821 acggaaatag tattgaaata ttctgttaaa taatagacgt cgctcactat aactttctcc 2881 tttcaaggga gaacgcttat ctagacgctg ccacatcatg cgggcattat ccatttgcca 2941 ttgagaataa actaacacat cactactcat atcccagaca tttgccagag gattgatatc 3001 ccaaatatct tcatctgtgg cgtaagataa ttctggttga ggcgacgcta aagcaatttg 3061 ctctaaaaaa ggcttttcag tcaaaggagt gattccctca agcatcagat gcgagtacct 3121 tttgtaaccg tactcaattg cccattcgtc gtagggacca ataatcgcgg gaaaataatc 3181 gccttgttgt actccttgtg gtgcgatgtt gacaggtaca taatccatca ctgaacctac 3241 caaacctttg gtgtgagtga tttcagtatt gtttaattcc tcgggtgcta acattgtgct 3301 accgtgaaag ttgtggcgta aaccaagggt gtgtccaact tcatgagcga tcagagaacg 3361 taaatattga tgtacgtact ttttcatttg gtcactactt ggtgtagcat cttgcacgag 3421 tgagagcgtt aacgctccca ttgcggcttg aaaagatgac tccataccgt agtagaagtc 3481 agaatcatta gaccaagatt ctctgttttc aactattgat ttttgaattc cagttgcatt 3541 gtgctggctc attttgactc cagactttgc atagatactc atcgggtaga ggtcatctga 3601 atttgcttct atcaatgtgt gataattttg ctgaattgag cgcaccatat tggcatcgac 3661 aatgatgtct gcatctaaga tttccccagt gagtgggtta acgcgcattg gtcctctggc 3721 aaaacctgca tctagagaat taaaccagcg aattgtgttg taacgcacat ctgcggggtg 3781 ccaatcagca tcatctggca tttggcggac ttcaatggcg ttttgaaatc ctactttttc 3841 aaatgctttg ttccacatca aaacgccttc acgaactgca tcgcggtatt ctaacggtac 3901 ggcattttca atccaaaaca caatcggttt tttaggtgga gacaaagcag catttggatc 3961 agatggttca agatgccaac gattgatgta acgtacaaat ggttctttgc caatattgcc 4021 agagaagttc tggaaggcag taataaaata tccaactctg tcgtcagcaa atctgggaat 4081 atagccgttg ttttctctga gttgggaaaa actatagtgt actttgagcg tgagtgccct 4141 gctgtcaggt aaggtgatta aatctgctcc ttctggtgat gaaaaaccat aaatcgaatc 4201 aatctctaga tttaaaggaa ggctgttgac atcacccaaa tatgacttac tttcttgtaa 4261 ctgatattca gtttgcaaag agtattttaa tagagaagtt aatccaggaa agtcctgcat 4321 gagcaggtca tttatatcaa tcagaatatt ttttgtgtat gggtcgatac aaactatatt 4381 aactgcataa agaacagagt cgctaaatga acgagcaagc gatcgctctt ctggttgacc 4441 cgcttctgtg cggaacttaa cattacgaat cacaaagtgc aaattattgt taactcgctg 4501 gaagtagaac agaaaatctt gtaacgggag tccactataa agtccacttt ccccaacgcc 4561 tgactccaaa gttactatac ccaagtaatt tttgttgagt tgttctggct taatttccca 4621 atagatttcg ccagaatctt tagcgcgata aagcgtaaaa agtccctcta acttatctgc 4681 acctttgatt atctcgcgaa atcttcggaa atcatccttt ctgttctctt gatgcgtatt 4741 gtcagccatt gctaactttt caattcccgg tagtgtattt aattgctgta cgtaaacatc 4801 tgaaagttgg ttagcttttg cgtcattagc tcctagcaac aggttgtaca ataacgttat 4861 atatattaat aatcttgtta tccagttttt catacttcag cttcccgtgg caaagcagat 4921 tttatctctg catccccctt aagtattaat agattagctg tttttatatt tttaataaag 4981 ttatgtattt taccaaagtt actagaaaag tatgagttaa tgactgacta actgacactt 5041 aagtcagaaa gtagcgcgtg cgtaagtcct aaatttttta ttcttttata gaacgtaggc 5101 gttagagcac atttctagac gccaaaagcc tggtgttgga aaaaagccgt gctactaact 5161 cagcacgaga agaaacttca agcttgcgaa acattcgttt taaggcttgc ttaacagaat 5221 tttctgtaat ccaaagggct gtaccgattt cagcatttgt tcgcccttgt gcgactagtt 5281 cggcaatttg aatttcacgc ggtgttaagc gattaatatt cacagaattg aattcagttg 5341 gttgtgacga tgttgttgtt aaccaggttg acagatgaag gcaaagggca ctcagttcag 5401 taagttcttg agtattaaac gcagaagtac cgcgaacgcg agtaaagcca actccgccca 5461 caaggcgacc atgactaaca attggtccag ccattacatg cgcgtgatca gaacgcggac 5521 aaatcatcga ccacgcccca ggaggtaata ataattcttc gtgaactggg gcatgatgct 5581 caactacgta acgcaagacg ggattatgtt ctaaagataa agctaattta acaatgcccg 5641 gaagatttgt ttcgacaagg ggaagctgat caaagaaaaa aagcccccaa cgttgagcaa 5701 caaaatactc accaattcta accatgacgt gcagtcgtag ttcctgctca tcacgagttt 5761 gagcgatcgc ttgaaataaa gactgcaaag aatttgtcat ctgaatagtt acagaaaaag 5821 taattgtagg ttgggtagag cgtaagcgaa actcaacatt gggcgtattt gttgggtttc 5881 gttccaacgg cccaacctac gcctactacg cctactaatt tgccatctat aagtcaatac 5941 ggttcagtta aggctgaaaa cgttattgag gacacgaaaa gaagga // LOCUS NODE_4929_length_5968_cov_5.1195675968 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5968) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5968) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5968 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 470..2173 /locus_tag="DP116_26485" CDS 470..2173 /locus_tag="DP116_26485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309196.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_26485" /translation="MDVYCSKQHVNNEKNRFCTQCGEPLPLAVGQVVDNRYRIIRHLG QGGFGRTYLAEDLNQSQQQCVLKEFAPQVEEQQDLEKAKELFEREASVLKKLQHPQIP RFHASLQVQLSNKDFFFLVQDYVEGENYWDLLESRTNQGQTFSEQEVVKLLQQMLPIL SYIHSVNVVHRDLSPDNIILRQSDKLPVLIDFGAVKQLPASKGFWFTQLGGIRTILGK KGYAPEEQLRQGKAFPSSDLYSLAVTALVLLTGQEPQKLYDSYQGHWRWGEQIKVSPK IEAVLKKMVEYKPSDRYQKADDVLKDLQFYNSATKPINTHMTKLKTMVVAPGHKRVKT LVGKLHNKTQMVAQALPLPVWLRPFAVSLVGTSVIVLTLAGTWAVVNAVVQAVSSISI PAISSPKTSRDSDSGEKPSVSNQENRGNQIIRRRQDLEIPEVFFTKMVDQNFYTKNPQ AKGRTLTGSSEDTALRKEWFAIAQDVLNQLERANLSTSARRKLGSYSARDYDNWRRQA QAGQLGNYTIKQLNEDTNKKFDRLFPGQRRDNEKLDKQTLGQIWYALAADQISKVESA N" gene 2240..4489 /locus_tag="DP116_26490" CDS 2240..4489 /locus_tag="DP116_26490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317545.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_26490" /translation="MNPVYCSKGHENPTGSRFCLNCGEKLDNPVNQSIQSGQTLSERY FIVRQLGQGGFGRTYLAEDINRFREPCVLKEFSPQVQTPYVLQKAEELFQREASVLYK LQHPQIPRFRELFRINLAGKEYLFLVQDYVEGQTYRSLLDTRKQQGLRFTEAEVRLLM QQILPVLEYIHSIGVIHRDISPDNLILRSTDQLPILIDFGGVKQVAATVVSQYYQPGA AGGGLPTLLGKVGYSPPEQMQTGLVEPHSDLYALAATVLVLLTGKQPQEFIDEYTLTW RWRREVNLSQSFGMVLDKMLSPKPKERYQSADQVLQALSPSPVSYPPPQQPVPISPPP QTQQTLAVSPTPPIPPTRQESSPYEKPFSTPTVSWWTPGKIIAIAVVIVAAVSVGIWG ANQLFDSSSLSPNDPKLSLQEQQRKEKLDQRRQQLNIDKKFFTSLVNQVFWEKNPTVS NRTLTDKPEDEQLRAQWDSIAEELLAKLTVLSSDSRQKLGSFGQAERDRAKVEVNQIN VGSRSLYDLGDAAFYQQFPEQRGKNFIDQPIGQVWQGFVSDKLNAILSGSAFKKIVFD KEASGSRVSGTLKPGEGKVFIAGLKEEQLMKVKLNANSKILLSVYSPSGKMRFLQDST QRSLSVKLPESGFYEYVIVATGSNTSDYTLAIIAENPAPPPSPTPTPIETVTPTPTPT FSPTPTETPTPTTPTETPTPTPTVTSTPTSTPTSTPTPTPTSTPTPTPTPTSTPTLAP TPTPTPTTP" gene 4900..5376 /locus_tag="DP116_26495" CDS 4900..5376 /locus_tag="DP116_26495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008099557.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_26495" /translation="MVENKFLIRLAKPDDLDVLVEFNRAMARETEDKDLSLDILTAGV ETLLKNPILGFYVVAENQNKVTGSLMVTTEWSDWRNGIFWWIQSVYVHPEFRRQGIYR RLYEFVKEKAINQQQELKICGFRLYVEHQNTVAQKTYEALGMKDTGYMIFEELELG" assembly_gap 5487..5496 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(5744..5965) /locus_tag="DP116_26500" CDS complement(5744..5965) /locus_tag="DP116_26500" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26500" /translation="MRVDLVKPVASTVSSTSTTQKEASQQPTVKKIIPAQGWIFNEKG EVLLVGYDPTKTGPQRSSPAPTSNCAAVK" BASE COUNT 1784 a 1339 c 1273 g 1562 t 10 others ORIGIN 1 ctttgcgacg ccagtcgcct caagtcgggg gacccgccca cggcgctggc tcccctttgc 61 ggtatgcgct aaggcgcacg ctacgcgttc gctcttagcg tgcgcttgcg cttacgtttt 121 tcatataatt aggcgcatct atggcacttt acgggcaagc tatcatacag aattagtata 181 agataagacc tcaccctgcc ctatcgggca tccctctcct tataaaggag agggaaggat 241 ttttgtgacg caaaaagcga gggtcagatt ttgagcgaga tgtgtgtaca ccgtagcttt 301 attaagaaga gggatttcag gtgaagtttt atatttaatt tgacgcactg acttgttctt 361 tctcatgctc ccgagcaaag gagtgcaaaa tatatagaga gagtatgcgt aattgtgact 421 ttccgttttg tagcaactta taaaaacctt ctattcatcc ttcagcatca tggatgttta 481 ttgcagcaaa caacacgtaa acaatgagaa gaaccgcttt tgcactcagt gtggagagcc 541 gttacctctg gcggtaggac aagttgtcga taaccgctat cgaattatac gtcatttggg 601 acagggtggc tttggacgca cttatttggc tgaggattta aatcaatctc aacaacagtg 661 cgtgctgaag gagtttgcac cccaagttga agaacaacaa gatttagaaa aagcgaaaga 721 actgtttgaa cgagaagcga gtgtgctcaa aaaactacag cacccacaaa ttccacgttt 781 ccacgcctcg ctacaagtgc agttgagtaa caaagatttt ttctttttgg tgcaagacta 841 tgtagagggt gagaactact gggatttgtt agaaagtcgc acaaatcagg gacaaacttt 901 tagtgagcag gaagttgtca aactgctgca acaaatgttg cctattttgt cttacattca 961 ctctgttaat gttgttcacc gcgatctttc tcctgataat atcattttac ggcagagtga 1021 taaattacct gtgctgattg actttggtgc agtgaagcaa ttgccagctt ctaagggttt 1081 ttggtttacg caactcggtg gaattcggac aattttggga aagaagggct acgcaccaga 1141 ggaacaatta cgtcagggaa aagcttttcc cagtagcgat ttgtactctt tagcagtcac 1201 ggcgctcgtg ttattaactg gtcaagaacc acaaaaactg tatgattcct atcagggaca 1261 ttggcgttgg ggagaacaaa taaaagtcag ccctaaaata gaagctgtct taaaaaagat 1321 ggtggaatat aaaccaagcg atcgctacca aaaagctgat gatgttctca aagatttaca 1381 gttttataat tctgctacca aacctataaa tactcatatg actaagctga agacgatggt 1441 tgttgctcct gggcacaaac gtgtcaaaac tcttgttggt aagctgcata acaaaactca 1501 aatggttgct caggcgttac ctctacctgt ttggcttcgt ccttttgctg tcagtcttgt 1561 gggtacaagt gtgatagttt tgacattagc agggacttgg gcagttgtca atgctgttgt 1621 tcaggctgta tcatctattt ctataccagc aatttcatca ccgaaaactt cacgagattc 1681 agattctggt gaaaaaccga gtgttagcaa ccaagaaaat cgcggtaacc aaattatccg 1741 tcgccgtcag gatctagaga taccagaagt cttttttaca aaaatggtcg atcaaaattt 1801 ttatactaaa aatccgcaag caaaaggacg tactctcacg ggtagctcag aagatacggc 1861 tttacgaaag gaatggtttg cgatcgcaca agatgtgtta aatcaactag aacgagctaa 1921 tctcagcaca tcagcgcgtc ggaaactcgg aagctatagt gcgcgagatt acgataattg 1981 gaggcgacaa gcacaagcag gacagttggg taactatacc atcaagcaac tgaacgagga 2041 cacaaacaag aaatttgata gattgtttcc gggacagcga cgcgacaatg agaaacttga 2101 taaacagaca cttggtcaaa tatggtatgc acttgctgct gaccaaatta gtaaagtaga 2161 atctgcgaat taacctaaaa gtcaaaagtc aaaagttaaa agtacttgac taatgactaa 2221 tgactaatga ctaatgacta tgaatcctgt ttattgctct aaaggacatg aaaatccaac 2281 tggtagccgc ttttgtctaa actgcggcga aaagttggat aacccagtga atcaaagtat 2341 tcaaagcggg caaactttga gcgaacgtta ctttattgtc cgtcaattag gacagggagg 2401 ttttggacgt acttatttag ctgaagatat aaaccgcttt cgcgaacctt gtgttttaaa 2461 agaattttcc cctcaagttc aaacaccata cgttttacaa aaagctgaag aactgtttca 2521 gcgagaagcc agcgttctct acaagctaca acatccccaa atccctcgct tccgggaact 2581 tttccgcatt aacctagctg gtaaggaata cctgtttctg gtacaagact atgtagaagg 2641 gcaaacttac cgttctttgt tagatactag gaagcagcaa ggtttacgat ttacagaggc 2701 ggaagttcgt ttgctgatgc agcaaattct gcctgttttg gaatatatcc attctatagg 2761 agtgattcac cgtgatatct ctcctgacaa cttgatttta cgcagcactg accaactccc 2821 gatattaatc gattttggtg gagttaaaca agtcgctgca acagtcgtct cgcaatatta 2881 tcaacctggt gctgctggtg gtggacttcc aacattattg ggtaaggtag gatactctcc 2941 cccagaacag atgcagactg ggttagtaga gcctcatagt gatttgtatg ctttagccgc 3001 aacagtactg gttttgctga caggtaaaca accccaagaa ttcatagatg aatataccct 3061 cacctggcga tggcgacgag aagttaacct cagccagagt ttcgggatgg tgttggataa 3121 aatgctatcg ccaaaaccaa aagaacgcta tcaaagcgct gatcaagttc tgcaagcact 3181 aagtccatca cctgtcagct atcctcctcc acagcaacct gttccgattt caccaccacc 3241 acaaacgcaa caaactctag ctgtatcccc aactcccccc attcctccca ctcgacaaga 3301 gtcctcacct tacgagaagc ccttctctac tcccaccgtc agctggtgga cacctggtaa 3361 aatcattgcg atcgcggtgg ttatagttgc tgctgttagt gtgggtattt ggggagcaaa 3421 tcagttattt gattcttcgt ccctgtcgcc taacgatcca aaactttcct tacaagaaca 3481 gcagcgtaag gaaaaattag atcaacgccg tcaacaactg aatattgaca agaaattttt 3541 tactagctta gtcaatcaag tcttttggga gaaaaatcca actgttagca accgtacctt 3601 gacagataag cccgaagatg aacaattacg ggcacaatgg gacagcatag cagaggaatt 3661 gctggcaaag ctgacggtat tgagttcaga ctcacgtcaa aaattgggta gttttggaca 3721 agcagaacgc gatcgcgcaa aagtcgaagt taaccaaatc aacgttggta gtcgttctct 3781 gtatgattta ggcgatgcag cattttacca gcaatttcca gaacaacgcg gcaagaattt 3841 tatcgatcag cccattggac aagtctggca aggttttgtc agcgacaaac tcaatgctat 3901 cctttctggc agtgctttca agaaaatagt ttttgacaaa gaagctagtg gtagcagagt 3961 tagtggtacc ctcaaaccag gggaaggtaa agtttttatc gccggactta aagaagaaca 4021 attgatgaaa gtcaagctca atgcaaattc taaaattttg ctttcagtgt attcccctag 4081 tggaaaaatg agatttttgc aagattcaac acaacgtagt ttatccgtta aacttccaga 4141 aagtggattt tatgaatatg tgattgttgc gacggggtca aatacatcag attatacact 4201 tgcaattatc gcagagaatc ctgctccacc tccatcacct acgccaactc ctatagaaac 4261 cgttacacca acaccgacac cgacattctc acccacacct acagagacac ccacacccac 4321 cacgcctaca gagacaccga cacctacgcc tacagtaacg tctacaccga catccacacc 4381 gacatctacg cctacaccaa caccaacatc tacgcctaca ccgacaccga caccaacatc 4441 cacacccaca cttgcaccta cgcctacacc gactccgaca acaccataag caatacaaaa 4501 tgtgagagag tttttaaaaa gttttttgat accaaaaaac atggaaaatc ccccgaaatc 4561 ccccttaaaa agggggactt ttaccacgcc tttttaaggg ggcaggggga tcaagtaaaa 4621 tatttgatac ttcttatatc atgtttggat gaacacttat aaaaaacgaa aaaccctccg 4681 ggtatctcct gcggagacgc tacgcgaacg ccagtcgcct gcggagggaa accctcccgc 4741 agcgctggtc tcacctcacc ccgccctccg ggcacccctc acgccaggtg ctacaacggg 4801 gggaaccccc gcaacgcact ggctctcctt attaaggaga ggggggtgaa gtatttttgt 4861 tgtaagtaat cacccaaatt tgacattatc taaacaaaaa tggtggaaaa caagttttta 4921 attcgtttag caaaacctga tgatttagac gtcttggttg aattcaatcg agcaatggca 4981 cgtgaaaccg aagacaaaga cctttcttta gatattctca ctgctggtgt agaaacatta 5041 ttgaaaaatc cgattttggg tttttatgtt gttgcagaaa atcagaataa agtgactggt 5101 tctctgatgg taacaacgga gtggagtgac tggcgaaatg ggatattttg gtggatacaa 5161 agtgtctacg ttcacccaga atttcgtcgt caagggattt atcgaagact ctatgaattt 5221 gtcaaagaaa aagctataaa tcaacaacag gaactaaaaa tatgtggttt tagactttat 5281 gtagaacatc aaaatactgt tgcacaaaaa acttatgaag ccttaggtat gaaagacaca 5341 ggctatatga tcttcgagga acttgagttg ggatgacaag atgctcgctt tggttagcag 5401 ttttcatagc aggttgcaaa aacctcaccc cggttttgtc tcttgccaaa acctcccctc 5461 tccttactaa ggagaggggt gcccgtnnnn nnnnnnggtt ttgtctcttg ccaaaacctc 5521 ccctctcctt actaaggaga ggggtgcccg taagggcggg gtgaggtaac acttatgaat 5581 gcaacacgcg tatgacccca ccccggtttt gtctcttgcc aaaacctccc ctctccttac 5641 taaggagagg ggtgcccgtt agggcggggt gaggtcaaat caacgtggaa tcaaagcttg 5701 aacgttaaat tgacaccaat ggcacagtac ccaacctacg tatctatttc accgcagcac 5761 aattactagt aggtgctggt gaggatcgct gtggaccagt cttagtagga tcataaccaa 5821 ccagcagcac ctcacctttc tcattaaata tccaaccctg tgcaggtatt atctttttga 5881 cagttggctg ttgagatgcc tccttttgtg ttgtacttgt tgaacttacc gtactagcaa 5941 caggcttaac caaatcaacc cgcacatt // LOCUS NODE_4951_length_5935_cov_4.2510205935 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5935) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5935) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5935 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(101..385) /locus_tag="DP116_26505" CDS complement(101..385) /locus_tag="DP116_26505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353501.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26505" /translation="MQAYKLKGKVDTAGNLVITEPVQIPAGDVEVIVLQAVETAANST VPETESQPETPKRKSRVKAFEGLFENAPPVPPDFDPDQARWEALKEKHNL" gene complement(396..4130) /locus_tag="DP116_26510" /pseudo CDS complement(396..4130) /locus_tag="DP116_26510" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114875.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(4258..4806) /locus_tag="DP116_26515" CDS complement(4258..4806) /locus_tag="DP116_26515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316746.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SH3 domain-containing protein" /protein_id="PRJNA477356:DP116_26515" /translation="MVFVNILKYTLGILLAIAILAGGGFATALYVINRTSMPPAKPIF ANDSPAVRGEDPKVTAAKAAKATSAIQAKAKASAKPTPKPTPSAKSLPPGAYHARITW KQGLILRGEPKQEAERIGGVGFNSRVIVLQESQDKVWQKIRLEGGDEKEAGKQEGWVK AGNTQKVDENDSQQTEQKKEAQ" gene complement(4977..>5935) /locus_tag="DP116_26520" CDS complement(4977..>5935) /locus_tag="DP116_26520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316747.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="AAA family ATPase" /protein_id="PRJNA477356:DP116_26520" /translation="ELQPEDVDLVLEEKRQTIRQTQILDFYPATEQISDIGGLDNLKD WLLRRGGAFTERARQYGLPHPRGLMLVGIQGTGKSLTAKAIAHHWHLPLLRLDVGRLF GGLVGESESRTRQMIQVAEALAPCVLWIDEIDKAFSGLGSKGDAGTTSRVFGTFITWL AEKTSPVFVVATANDIQALPPEMLRKGRLDEIFFVGLPSQEERKAIFNVHLSRLRAHN LKDYDIDRLAYETPDFSGAEIEQTLIEAMHIGFSQNRDFTTDDILESASQIIPLARTA VEQIQKLQEWAAAGRARLASKHNPLSDSPYMPKGFSGGRPSS" BASE COUNT 1536 a 1522 c 1156 g 1721 t ORIGIN 1 gagggtttcc ctccgtaggc gactgcgaac ccgaagggct tcccgcaggg tacttaagtt 61 caaggcgtac tatcggtgaa gcacgagatc tccaagactt tcacaagtta tgtttctcct 121 ttaaagcttc ccatcttgcc tgatctggat caaagtcagg cggaacggga ggagcatttt 181 caaataaacc ctcaaatgct ttcactcttg actttctctt aggtgtttct ggttgtgact 241 cggtttctgg aactgtcgaa ttagcagccg tttcaacagc ctgcaagaca atcacttcca 301 catctccggc tgggatttga acgggttcgg taatcactaa gttacctgct gtatcgactt 361 tccccttgag tttgtaagct tgcataatat ttctcctatg actctgcttc tagagtatcc 421 cgcgctgccg aaattgcctc gcgcaacgat tctggacact ccgcacccat ttctttccgc 481 cagattgcat caaattggct ttctcctaat tggttgagca ttcgtcctaa attattgata 541 tccaagccaa ttcgctcttg gttatgctct aagtcaattt ctaaagtttt aatgtaaatt 601 ttcagcgcct cagtccaatt ttcttcggtt tataacaatc taccccaatt tttcaatgtt 661 cttgaagctg tataccaatc gccgaattgc ccaaaaattt caaaggctct aaagtaatag 721 gtgtgcgcct cctcaaattg ccgttgctcc tcggctacca tgcccagttg gttatagaca 781 gttgcagcgt tgtacaaatc ccccgcattt tctttaatct cgacggcttt gaggtaatag 841 gcacgtgcct cctcaaattg ccgttgctcc tcggctacga tacctagttg gtgatagaca 901 gttgcagcgt tgtacaaatc ttctgcatct tccaaaattt tcagagcttt gaggtaatag 961 gcgcgcgcct cttcaaactg ccgttgcaga tacgcaacat tacccagttg gtgatattct 1021 ataggagcgt tgtacaaatc ccttgcatct tctctaatct tcaaagctct gaggtaatag 1081 gcacgtgcct cctcaaactg ctgttgctcc tgggctacca tacccagttg gtgatactga 1141 cccgcaacgt tgtaaaaatc ccgcacatct tcataaatct tcagagcttt gaggtaatag 1201 gcgcgcgcct ccgcaaactg gcgttgcaga tacgcaacat tacccagttg gtgatattct 1261 ctgggagcgt tgtacaaatc ctttccatct tctgtaatct tcaaagcttt gaggtaatag 1321 gcgcgcgcct ccgcaaactg ccgttgcaga cacgcaacat tacccagttg gtgatataca 1381 gtagcaattt tgtcattcac tgaggagtca tttaaagccg ttagttcatc cagaatttct 1441 tgataaattt tctcagccgt ttttaagtca gccttttgaa gtgcttcatt tccatctata 1501 ccccgcaaat acacccagaa attaaaacca cttttaccct ttaccttcgc atctgccaag 1561 tggttgccaa tttggttcag cactcgtttt cgcagtgact taaattctgc ttgacgattc 1621 caccgctcat acacgtctcc caaagcttgc agaattgtct gtgcttcagc ccattcctcc 1681 tgctgttcag caagtcgcaa gttttgcaac agattcggtt cttcaacttg cagcacaaag 1741 cttgccagtt ctgcgttgct gattagttta ttacgagagt tatccgccaa catcgcgtaa 1801 aaaaccagca actttttttc cagttcactg atcacctcct gcgaatttat tgagtttaac 1861 tgttggcgta aataccacgg taatgctggg tgaattttgt agatagtatt acccaaatgt 1921 tccagaatgc ctgctgctac agcttcatta agtaacctga gccaatcagc ttcttgtaag 1981 ttttcctcaa acaccgcctg atatgcttgt ccatattcgt catcaggatt tgatgataaa 2041 tgattgagcc aatgaacacc aactcgctca cagaaaaaac ccaaaaacgg caaatgccga 2101 cgcgcctgtt ccgataactt agaaaacgaa taatccaacg acagcgtgag agatttatct 2161 cgtccttctt cctctgcacc agaaaaagta tctagtcctt tgcgcaaagc ctcaatcaac 2221 tgtactgcac tctgcgtctt taaatgaggc aacaccactc gcaaagacag cggatgtccc 2281 cccaacagtt tcaacaaatc caaatactct gctggcagat tcttcctatc tactcccgcc 2341 gtttccaaaa tcttcgccgc taactcctca acatcctgct tttgcagtcc ccgcaaattc 2401 accagattgt acgcacaatc gagccaagac tcctcgcggc gactggtaat caatacccaa 2461 gatttaccgc cgcgcaactc tttgagaaat cgctgcaaat tctctcgctc atctgcactc 2521 aacaacggct catttcctgt aggaaatccc gcgactggtt caaagttatc ccaaatcagc 2581 agacaaggat ttttcttcag atagttgagt actccttgaa cttgtttctc ttcctgatat 2641 tgggaaaatt tatctcccca aatcttccta cccacttgat tgacgacacg acttagcgtt 2701 gcgccatgtt caaaagaagt aaagaatatt gtagagcgtc cttgagtctc atccagccag 2761 cgcccaaaac cacaagctaa ctcagttttg ccgacgccac ccatgccttg caacaacacg 2821 atattatttt ggcgaaacgc ccgttctaaa cgcaatatat cgtaatctcg cccaataaag 2881 ccgtagcgtc cttcctgtgg aaactccacc aagttggtca caacttcctc gaagtaatct 2941 tcgagatcat cttctgttgg agtaaagggc ttgtaggatt cctgctgata caacactggt 3001 actaaccaat cttgcagcgg cttatcgcct ttgggactag ggcgcagttg cttattcaca 3061 agttccttgc gtcccaaagc taccgcgtac tccagactct ctccccgtac taactcgcca 3121 tacaaccgcc ccatgaaatg tttagcagct tgagcataca cagtataagc cattgctacc 3181 acacctttcg cccccaagga taccaagcgt gtcgccactg aagaaaattt ttcttctccc 3241 tcctgagcag atttgcaagc gtttaagata aaaatcggca cgcgacaact ttctaaattc 3301 tgagcgattt gtgctgctgt gatgatttgc ggcgaaccat cattcgcttc aaacactaaa 3361 actccttgcc cagctgcacc caaggtgtgc tgaaatcctg tgctattggg gtcgaaattt 3421 ccatgtccat caaaatgaac gatgtgataa aaccccttat tggcgttcag ttcccgttca 3481 aactgctcaa agctgggcgg acgcaggact tttatattga cttttttctt aatatctgat 3541 aaggctgtta ggagaggacg ggcaatagtt ttgaaggcaa tatcttgttc accgtaggga 3601 cgagcaatca cgagcaaaat attcagcttg tcgctgggta actcacccat ttctgctcgc 3661 acagcgtatt cgctcagact acgagacatt cctgcaagtg aaggtgcgag aaattggcgt 3721 tagcgaagcg gctccggagg agcatcgcct ggtgaaaaaa gcaactccca aggcaaattc 3781 agcacttccg gattatcaga agtcacaaca agttcgcact tatccagccc atcaaaagta 3841 gcagcttgaa aaaattgccg cgtttgctcg ctacttctaa acactaactc aaataactgc 3901 tctccccatg tttgcaactt ctgctcaatc ttggctgcat tgtctggata tataccatag 3961 ggaaagctga gatactcttc caaataccag cgtaaatcta tcaacgcctg cttctcaaaa 4021 ggatgctgaa atgctaccgc tggtgcagaa cgaagcgcag cttgtccgcg ttgccagaaa 4081 agttgaatcg aatctccctc atgatgaatt cgcaaacaat ttatcgccat atctgctcca 4141 ctcgctaggt ttacagtaat acttccttta ctgatgactt cacgtaaatt cagagacttt 4201 ccggagacat ttcatctcta acgagtggtg ttattttttg tttatttttg taaactttta 4261 ttgagcttcc tttttttgtt ctgtctgctg cgaatcgttt tcgtcaactt tttgggtgtt 4321 acctgctttt acccaacctt cttgtttacc agcttctttt tcgtcaccac cttccaagcg 4381 aattttttgc cagactttat cttggctttc ttgcaaaacg ataactcttg aattaaatcc 4441 aaccccacca atacgttcag cttcttgctt tggttctcct cgcaaaatta agccttgctt 4501 ccaagtgata cgcgcatggt aagctcctgg tggtaatgat tttgcagatg gcgttggttt 4561 gggggttggt ttggctgagg ctttagcttt agcctgaatt gcagaggttg ctttggctgc 4621 tttagccgca gttacttttg gatcttcccc tctgactgca ggactatcat tggcaaaaat 4681 gggtttggcg gggggcatag aagttcgatt gatgacatac aaagcagtcg caaatccgcc 4741 gccagctaaa atagcgatcg ccaataaaat cccaagcgta tatttcagta tattgacaaa 4801 aaccacagtg acaacctcac actacaaatt tatgaattat aaattatgaa ttctgaaaat 4861 attgagtgtt cataattcat aattccgaac tcattattgc agagttgggg taaaatgccg 4921 tcgcctagcg cgcctcatac cccaacatgc ctctgacatc ttaccctgac tgaagtttag 4981 gaagacggtc tccctccaga aaaccctttg ggcatatatg ggctatcgct caatggattg 5041 tgtttggaag ccaaacgcgc cctaccagcc gccgcccatt cttgcaattt ctgaatttgt 5101 tcgacggcgg ttcgtgctaa tggtataatc tgactagcag attctaaaat atcatcggtt 5161 gtaaagtccc gattttggct aaatccgatg tgcatcgctt caattaatgt ttgctcaatt 5221 tctgcgccag aaaaatcagg tgtttcgtaa gctaatctgt caatatcgta atctttcaag 5281 ttatgggcgc gtagtcgaga taagtgtaca ttaaaaatgg cttttctctc ttcttggctg 5341 ggtaaaccaa caaagaaaat ttcatccaac cgtccttttc gtaacatttc tggtggtaat 5401 gcttgaatat cgttcgcagt cgcaacaaca aacacaggtg aggttttctc ggctaaccag 5461 gtaataaatg taccaaaaac acgactcgtg gttcccgcgt ctcctttgct accaagtccg 5521 gaaaaagctt tatctatctc atctatccac aaaacacaag gcgcaagggc ttcggcgact 5581 tgtatcattt gtcgcgtacg agattcagat tcacccacca aacccccaaa caatcttccc 5641 acatcgagac gcagcaaagg taaatgccag tgatgggcga tcgcctttgc agttaacgat 5701 ttccctgtcc cctgaattcc caccaacatc aaaccacggg ggtgtggtaa accgtactgt 5761 cgcgcccttt cggtaaacgc gccaccccga cgcaaaagcc agtctttgag gttatccaat 5821 ccgccaatat cagaaatttg ctcagtggct gggtaaaagt cgaggatttg ggtttgacga 5881 attgtttggc gcttttcttc caaaaccaag tctacatctt ctggttgcaa ttctc // LOCUS NODE_4960_length_5920_cov_5.5993185920 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5920) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5920) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5920 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..384 /locus_tag="DP116_26525" CDS <1..384 /locus_tag="DP116_26525" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26525" /translation="GNIQIKADSLSLTNDASLFANSSGQGDPGNIEVTARQIRLNNKA KFNAESASGNGGNINLRVRDLLLLRRGSQISTSAGTDQKGGNGGNITINAPSGFIVSV PNENSDITCRKFGCFFSTMSYSLNQ" gene complement(562..717) /locus_tag="DP116_26530" CDS complement(562..717) /locus_tag="DP116_26530" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206681.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" /protein_id="PRJNA477356:DP116_26530" /translation="MVKVNLNVRVEQAEMEILETYAEQTGRTKTDIIREYIRSLAIRR PHHSLNE" gene complement(1084..1395) /locus_tag="DP116_26535" CDS complement(1084..1395) /locus_tag="DP116_26535" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine dehydratase" /protein_id="PRJNA477356:DP116_26535" /translation="MSGLKQVIQNLFIRLEGFFSVVFKNIFNFFGNVFGFFAKLFGFS NSGYFLEADQVQSIKRETTQQSMKTEASIATETPATSNRRRPNPQMDYYRKMAQEVKK S" gene 1394..1822 /locus_tag="DP116_26540" CDS 1394..1822 /locus_tag="DP116_26540" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26540" /translation="MVVSPVTTYYYFDLQDWSLLMFKGIQAIFFFVDDVVAAATWYSK LLNAPVKYFHTDNEIRGALIEVASIDMFFHVADEKMRPGNAGQVAYWRVDDFSQAVDH AQKHGAKLYRGPLTIEENQAICQMWDPFGNLFGMQGLILV" gene complement(1843..2124) /locus_tag="DP116_26545" CDS complement(1843..2124) /locus_tag="DP116_26545" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26545" /translation="MKTKNPPEGKLTNFVLSSVDIFDLVETWFVNPPRQFPTCFACIN STHRFALYHVRLISCDSHDILPHPLIPSPQAGRGGFGARQNRGGVIRDL" gene complement(2114..2452) /locus_tag="DP116_26550" CDS complement(2114..2452) /locus_tag="DP116_26550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878858.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26550" /translation="MSQKDNESLQIQEINNLKPVHFADLVRAAQLIFDPAKGVSGIYT EVDWKDFGIPDDVVQNLKVLGQEYQYSSPHIPIEEIWSKLTPQTRVWFIQNKNELWRF EEAFPALDED" gene complement(2589..2954) /locus_tag="DP116_26555" CDS complement(2589..2954) /locus_tag="DP116_26555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408190.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26555" /translation="MARKRLIIEMGMGVDQHGQEPTVAAARAVRNAIAHNALPGVWEV AGLSDPNEMIVEVKVAVPYPEQVREEEVLAVLPFGRKTLTVELGGMVVQGKAIPVLND KNDEMLIAVAAVTVFVEND" gene 3452..4288 /locus_tag="DP116_26560" CDS 3452..4288 /locus_tag="DP116_26560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317423.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26560" /translation="MSHDSLYHIRTRTVITSISERPIHQIPFITHSPNGIGIVHRVLP VLLAKVDSLPDGLEAIIATSDLQGVDPANYRLLGHLVSEELENLAQLGKIPSPQTTGV ILAGDLYAKVDKRGGVGDVREVWQAFSRRFRWVAGVGGNHDNIGKTPQEIQAFQNEQG IHYLDGDLTCLDGIRIAGISGIIGKRTKPFRRPEEDFRELIHELLNESPDILILHEGP NDAEAKLIGNQSIRTELTTGSNLLVICGHSHWRVPMTTMPKGVQVLNVDTRVVVMTKM KS" gene 4329..4526 /locus_tag="DP116_26565" /pseudo CDS 4329..4526 /locus_tag="DP116_26565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744414.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="trans-aconitate methyltransferase" gene complement(4530..5873) /locus_tag="DP116_26570" CDS complement(4530..5873) /locus_tag="DP116_26570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314423.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amine oxidase" /protein_id="PRJNA477356:DP116_26570" /translation="MSSHAEVAIVGAGISGLSAAWELHNLGIESLVVLEANSRVGGRT LNHPLVSGGYVEQGGTWAGPTQTALLNLAKEMGVSIKKGKLEGQTFYGFRQKWTMLEA SPTSESDIAQKDFAQAMTTFEALCRTVPLETPWQAPNANILDSMTMGAWIEQNTTTDE ARAWFEGCVRQILSGDPNKVSLLWMLHFVHTAGFNDLLETAEDFSFVGGTQQISLNIA ERLGERVLLNASVTEISGYDDSLVQLICERGVVYAQQVIVAMMPKTVARIQFHPPLPP IHRQLIAEWKTMSWIKFHAIYEKPWWQGQIISGHFLSIDSKIEVIDISPQDGSRGVIV GLLAPDYATLPESKRQELCLAFLGETFGEAARQPSEWVEFDWNNQPWTGGCISALPPG LLTTAGSGLNSPVGRIHWAGTERSSIWANYIEGAIRAGQKAARDVAAKLRSVSTT" BASE COUNT 1698 a 1318 c 1250 g 1654 t ORIGIN 1 ggaaacatac aaattaaggc tgattctctg tctttaacca atgacgcttc actttttgcc 61 aacagttctg gacaagggga tccaggtaac attgaagtga ccgcacgcca aattcgtttg 121 aacaacaaag cgaagttcaa cgctgaatca gcatcaggta atggcggcaa tatcaatctg 181 cgagtcagag acttattact cctacgccgt ggtagccaaa tttccacatc tgcgggtact 241 gatcaaaaag gtggcaatgg cggtaacatt actattaatg ccccatcggg ctttatcgtc 301 agcgttccaa atgaaaacag cgacatcacc tgcagaaaat ttggctgctt cttctcaacc 361 atgtcatatt cgttaaacca gtgatggggg tgccctcgtc gccgtccgcc gcctctcccg 421 ccactaaacg gagtaccgtt atagtggggg acttacggcg atttaacttg aaccgacaaa 481 agccccacgt ctgacccgat agggcagcgt gggatgaatg gcggggcgct gaggagtccc 541 gcccccgacc aataacgact atcactcgtt gagagagtgg tgcgggcgac gaatcgccaa 601 acttctaatg tattcacgga taatatctgt tttggttctt cccgtttgtt ctgcataggt 661 ttctagtatt tccatttctg cttgttctac gcgaacgttt aaattcactt ttaccattta 721 cattgtaatg ataattccat tacaatcata ctagatggca aacccgaaaa tgctggtcat 781 ctctttgcag aaccttggca actcagagta tggggttcta cccagtaaca gacacttctg 841 tgtcgctttc tgtgggtagg taaaactctg acagtacaaa ggtgtgaaca tttttgtatt 901 gttcaactgt gcggaggggc atctcgtgca gtctaaacaa agctcagtta atgtccgacg 961 ggataccggg agtcgctgag aagcccacac tgtacgcttg cgcgacttgt gggagtatgt 1021 cacttaacag cgacaagcga tcgcccccac acttctagct aaacccagtc aacagcggta 1081 gtcctagctc tttttcacct cctgagccat ttttcggtag tagtccattt gaggattggg 1141 acgacggcga ttagaagtcg caggagtttc agtcgcaatt gacgcctcag tcttcattga 1201 ttgttgtgtt gtttctcgtt tgatactctg tacttgatca gcttctagaa aatatccaga 1261 attggaaaac ccaaaaagct tagcgaagaa accaaagaca tttccaaaga aattaaagat 1321 gtttttaaaa acaacagaga aaaaaccttc aaggcgaata aaaagattct gaataacttg 1381 ttttaaacca gacatggtag tgtctcccgt aactacatac tactattttg acttacagga 1441 ttggagtctg ttgatgttta agggtattca agccatcttt ttttttgtag atgacgttgt 1501 tgcagcagca acttggtata gtaaacttct aaatgcacca gttaagtatt ttcacactga 1561 taacgaaata cgaggagcgc tcatagaagt cgctagcata gatatgtttt ttcatgtcgc 1621 agatgaaaaa atgcgtcctg gaaacgccgg acaagtagca tactggcgtg tagacgactt 1681 ttctcaagca gtagatcatg ctcaaaagca cggggctaaa ctttatcgag gtccgttgac 1741 aattgaggag aaccaagcta tttgtcagat gtgggatccc tttggcaatc tttttggtat 1801 gcaaggtttg atattggtgt aatatcatgt ccggctaatt gcttataaat cacggatgac 1861 cccaccccgg ttttgtcttg cgccaaaacc gcccctcccc gcttgcgggg aggggattaa 1921 ggggtggggt aaaatatcat gggaatcaca actaatcaac ctgacatgat ataacgcaaa 1981 cctatgtgtt gaattgatgc aagcaaaaca agtgggaaat tgtctaggtg ggttgacaaa 2041 ccaagtttca accaagtcga agatatcaac actggataac acaaagtttg tcaatttccc 2101 ctctggtgga tttttagtct tcatcaaggg ctgggaaagc ttcctcaaac cgccacaact 2161 catttttatt ttgaatgaac caaacacggg tttgtggagt taatttactc caaatttctt 2221 cgatgggtat gtggggagat gaatactgat actcttgacc aagcactttc aaattttgca 2281 caacatcatc aggaataccg aaatctttcc agtcaacttc tgtgtatatt ccagatactc 2341 cctttgcagg gtcgaaaatt aattgtgcag ctctaactaa gtcagcaaag tgtactggct 2401 tgaggttatt tatctcttga atctggagtg attcgttatc tttttgagac atcgtcacta 2461 acgttggtga aggtaagtta tctattggac ttaccgccag agatattcgc ctcgatgtct 2521 ctagtttagc gcaaagcatt agatgaaatg ataactgttt actgttaact gataactgat 2581 ttagtcgctt aatcattttc aacaaaaact gtaactgcag ccacagctat caacatttcg 2641 tcatttttat cattgagtac tggaattgct ttaccctgaa cgaccattcc accaagctca 2701 acagtaagag ttttgcgacc gaaaggcaat acagccaaaa cttcttcttc tcgtacttgt 2761 tctgggtaag ggactgctac cttaacctca actatcatct catttgggtc actcaaacca 2821 gcaacttccc acacaccggg taaagcattg tgtgcgatcg catttctcac agccctcgct 2881 gctgcaactg ttggttcctg tccatgctga tccactccca tccccatttc aataattaag 2941 cgtttgcgtg ccacagtgac ctctgattgt aaagtttcgt tcttaaactc agcctacaag 3001 attttttgat tcatgtcttg ctctagggac gtcgcataga ttccctcaaa agtgtgttct 3061 ttgatactga agcagcgggg atggaaaatc tcagcaataa gcaagtgcaa caagaggcga 3121 tcgcattggc agaggcaaga gttagactgg tagactggta gatagtaact tatcggtact 3181 atctcaaatg acagagctgc ctttcgtgct tttacagaac aagagtttcc tgtctttagc 3241 acaaataagc gatgactttc aaaaagtata ctagataaca aactagcaaa caaccctgag 3301 tctcttaggc gaaaagttat atttatgtac ttgcaatatt ctaattgtaa ctattgggat 3361 aaactgtctc tctttcaatc ataatagaga tttataaatg acttttctac ccaattagcc 3421 aagcccggat aggcattacc taaatcatac catgtcccat gattctttgt accatatccg 3481 aacaagaact gtcataactt ctatttctga gcgaccgatt catcaaatac catttattac 3541 ccattcaccc aatggtattg gtatagttca tagagttctg cctgttttgt tagcaaaagt 3601 tgactcacta ccagatggtt tggaagcgat cattgccaca agcgatctac agggagttga 3661 tccagccaat taccgccttc ttggtcatct tgtcagtgaa gaactagaaa acttggcaca 3721 gttaggaaaa attccatctc cacaaacaac gggcgtcatt ttagcaggag acctttacgc 3781 aaaagtagat aagcgtggtg gtgtaggaga tgtccgtgaa gtttggcaag cttttagtcg 3841 ccgttttcgt tgggttgctg gggtaggagg aaaccacgac aatattggca aaacaccaca 3901 agaaattcaa gcatttcaaa atgagcaagg tatacattat ttagatggcg acctcacttg 3961 tctagatgga attcgcattg ctggaatttc agggattatt ggtaagagaa ctaagccttt 4021 ccgtcgtccg gaagaagatt ttagagaact tatacacgaa cttctcaacg agtcaccaga 4081 cattttaatt ttgcatgaag gtcccaacga tgcagaagcg aaattgatag ggaatcagtc 4141 aattcgtact gaactaacaa caggaagtaa tttattagtt atttgtggtc attcacactg 4201 gagagtccca atgacaacta tgcctaaagg agtacaagtg ctgaatgtag acactcgggt 4261 tgttgtcatg acaaagatga agtcctagag ctgaagctag tagaaagtca tcccacaagt 4321 ttaatcgctt attaccgtca ccagaagcag tcatagaatg gtatcgcggg acgttgctca 4381 caacatacca gcagcggcta gatgctgaga cttacgagcg atttgtgaaa cggtacacgc 4441 agaagttaat caagtacttg ggtgaggagc gtctgttctt ctttgcgtat aaacgtattt 4501 taatgtgggg aagcttaagc aattgaacac taggttgtag agactgacct cagcttagcc 4561 gcaacatcac gagcagcctt ttgtccagcg cgaatagcac cttcaatgta gttagcccaa 4621 atactagagc gttctgtacc agcccaatga atccgaccta ccggactatt gagaccagaa 4681 ccagcagtag tgagtaaacc aggtggtaat gcggaaatgc atccacctgt ccaaggttga 4741 ttattccaat caaattccac ccactctgat ggttgtcgtg cggcttcacc aaaagtttct 4801 cccagaaatg caaggcataa ctcttggcgt ttggactcag gtaacgttgc gtaatcggga 4861 gctaacaatc caacaatcac acctctactt ccatcttgcg gcgagatatc aattacctct 4921 attttgctat cgatactgag aaagtgacca ctgattattt gaccttgcca ccaaggtttt 4981 tcatagatag catgaaattt aatccaactc atcgtcttcc actctgcaat cagttgacgg 5041 tgaattgggg gaagaggggg gtgaaattgg atacgagcaa cagtcttggg catcatcgcc 5101 acaatgactt gttgagcata cacaacccca cgttcacaaa taagttgaac aagggagtca 5161 tcgtaacccg atatttcagt cacagacgca ttgagtaaga cccgttcccc taagcgttct 5221 gcgatattta gggaaatctg ttgtgttccg ccaacaaagg aaaaatcctc agcagtctct 5281 aataaatcat tgaatcctgc tgtatggaca aaatgaagca tccataatag tgaaactttg 5341 tttggatcgc cactgagaat ttgccgcaca cacccctcga accaagcccg tgcttcatca 5401 gtcgttgtgt tttgctcaat ccaagccccc attgtcatag aatctaatat gtttgcatta 5461 ggagcttgcc aaggggtttc tagtggaaca gtacggcaca aagcctcaaa tgttgtcatc 5521 gcctgagcga aatctttttg agcaatgtcc gattcagatg tcggagatgc ctctagcatc 5581 gtccatttct ggcgaaaacc atagaatgtc tgaccttcaa gctttccttt cttgatagaa 5641 acccccattt cttttgctag attcagcaac gcagtttgag taggaccagc ccaagtccca 5701 ccttgttcaa cgtaaccccc tgacacaagc ggatgattga gagttcgtcc ccctactcgt 5761 gaattggctt ccagtacaac cagcgattcg atacctaagt tgtgcaattc ccaggctgca 5821 ctcagaccag aaatgcctgc acccacaata gcaacttctg cgtgagaact catggtgata 5881 aactttattt atgcaatata gttgatttat actacaatac // LOCUS NODE_4983_length_5876_cov_5.1766025876 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5876) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5876) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5876 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 166..1737 /locus_tag="DP116_26575" CDS 166..1737 /locus_tag="DP116_26575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455618.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metallophosphoesterase" /protein_id="PRJNA477356:DP116_26575" /translation="MEFIFDPPIPVKIQKMKERVRWGDPSVVQRGIDQTSMVLDDGNS DNPEFSFIVIGDSGTTEHHGHHPQRKIAELMLAHQKECRFVLHTGDVIYVVGSHEYYP TNFIEPYREFLVGGENPRRIPYDRMVFKVPILPVLGNHDYYDVPWMYRFITGGTRSLR RLLGYKNVEIGWHGSYKGDAYARAFLDYLKRFDSGEELERHLDNHYTAKTDTGYCLRY QPGRFTRLPNRYYTFRYGGIDFFALDSNTFNAPLPLPATKEGEAERRELVHRRHKIEQ EEIQILEACDKLNPDIPSEAEQLDALKAKLYQIDEIKIDIEKQLESDEASVVDFEQLN WLKQRLISSWNTEEVRGRIIYFHHPPYVTEATKWHQAQTLAVRRCLRWVFDQVADTLG SMTQGRPIVDLILNGHAHCLEHLRTTDTGHADSHINCIISGGSGHYPRRQRNEGTELM ETFEHLPGSPYRKVADSFLFVGRNGYDVQMRRPYSFVRIDVRGGCPPKLIVRPFVTER FMGGWCDRHLEPFVI" gene complement(2061..4829) /locus_tag="DP116_26580" CDS complement(2061..4829) /locus_tag="DP116_26580" /inference="COORDINATES: protein motif:HMM:PF14903.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004504173.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26580" /translation="MVFQKRTCLVVLFLLALVSAIAFQLLHISKTENLYPIYNGPGYG YVNQAGELVVKPQFSQAEEFSEGFAPVKVNDKVIVHSPCTMAETLVLSLRGRGECERL HGKWGFVNLKGQVSISPQYDTVGESSSPYSDGIDDDFSPSSFSEGLAGVKLNGKWGFI NQSGKLVIPYQFDKVQRFSGGVATVQVGGLWGVINPEGKWIIQPVDKFPIKFFQGIAK LNLISWNESGQFIGDPNIYWDKSGRRVGDFFNPPKLASEFQEGLAIVEISRWAKQQGR DDPLVVVATSEDSIGSKCGFQDKRGKVVIEPQFDYCQSFASGLAAVQVDKKWGYIDKT GKFIVSPQFDYADRLIEERALVVSDGKIGFIDKTGKIVIKPEFQIDPELAIHYKKEPA AVLKEWKSRLKEFGQPLDLSSIEKWLAQVLKPYSSEVVQRQFSNGLAAVAKDNKCGYI DKTGKFAIQPQFTECKRFDLHGVAQVTRQNGVVGQGGRDEYVYINREGKTFPKYAASL SNFYPTWSKLLTSVMACLLWIVAISCHEFGHAIVAYWGGDRSVKDKGYLSFNPRRYIN PLHSIILPAIFLVTGGIPLPGAAVYIEFDQIRNRLWLSATAAAGPIASILFGLLLVPI FQFSLAWNSPHWFSTFIADFISLQFFIALFNLLPIPPLDGYRILSVWLPPKLQDRTGI ASMIGLFFLCFVLPFISIAAAPFFIILFVGSYLVMRASGISEEFSLDVWHSFHRWYIA LCLLVASLVYLLYKPASIFQLAGLVLENFFPKVALKMYDRALKIDPKSAWSWERKAWT LQRVGTVGDSVEIISACEQAIQLAPYNRFLWRALILELSLSLTWGHKFYEQKLIEAIE RYIKLYPKDGWGWGVKALTLDDFGDSEEALLAWEKNIELDPKDHPYKWRTQMKESTLE RLRSLGKL" gene 5002..5769 /locus_tag="DP116_26585" CDS 5002..5769 /locus_tag="DP116_26585" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015201311.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="PRJNA477356:DP116_26585" /translation="MKLINRKQKYYFLRFTAIIFVILWCLLFDPTGIANAASPSSTVY EQRLLHSPDGIGKIYMGREIAKVMGHTGVLWLERPSRESEEQPSKIISVLNFKPDDVV ADIGAGTGYISFRIAPLLTKAKVLAVDIQPEMLDIIEFLKKENNITNVEPILATLTNP NLPPESVDLALMVDAYHEFEYPREIMEGMIRGLKPGGRIVLVEYRGENPFILIKALHK MSQRQVRKEMQAVGLVWRETKNLLPQQHLMVFEKPSL" BASE COUNT 1707 a 1276 c 1252 g 1641 t ORIGIN 1 cagtagagca ggacttacgc acgaacaacg taactatagt cattgcgacc gaagggaagc 61 aatcgcaacc cgcttttttt cgtagcgcgt gcgtaagtcc tatagaggaa attgggaagt 121 agtgggagaa tagctaaatg actaataact catgaatact gatttatgga attcatattt 181 gatccaccaa ttcctgtaaa aatccaaaag atgaaggaac gggtacggtg gggagatccg 241 agtgttgtcc aacgtggaat tgaccaaacc agcatggttt tagatgatgg taactcggat 301 aatccagaat tctcttttat agttataggt gatagtggca ctacagaaca tcacggacat 361 cacccgcaac ggaaaattgc ggaattgatg cttgctcacc aaaaggagtg ccgttttgtc 421 ctgcatacgg gtgatgtcat ttatgtggtg ggatctcatg agtattaccc gaccaacttt 481 atcgaacctt accgcgagtt tctggtgggt ggcgaaaatc ctagacgcat tccctatgat 541 cgcatggtct ttaaggtgcc aattctgcca gttttgggta atcatgatta ttacgatgtt 601 ccgtggatgt accgctttat caccggaggt acacgatcac tgcgtcgctt acttggctat 661 aaaaatgtcg agataggttg gcatggatct tataagggtg acgcttatgc acgggcgttt 721 cttgattatc ttaaaagatt cgactctggt gaagagttgg aacgtcactt agataaccat 781 tacactgcta agactgacac aggatactgt ttacgttatc aacccggacg tttcacccgt 841 ttacccaacc gctactacac ttttcgttac ggcgggatcg actttttcgc ccttgactcc 901 aatactttta atgcaccatt accactccca gcgacaaaag agggagaagc cgaacgccgc 961 gaattggttc atcgtcggca taagatagaa caagaggaaa tccaaattct ggaagcgtgc 1021 gacaagctga atccagatat tccctcagaa gccgaacaac ttgatgccct caaggcgaaa 1081 ttatatcaaa tcgatgaaat caaaattgac attgaaaaac aacttgaatc tgatgaggcg 1141 tcagtcgtag actttgaaca actgaactgg ctgaagcaaa gactgatttc atcttggaat 1201 actgaggaag ttcgcggacg tattatttat tttcatcatc ctccttacgt cacggaagcg 1261 acgaaatggc atcaagcgca aacgttagca gttcgccgct gtctgcgttg ggtgtttgat 1321 caagtcgcag acactcttgg ttcgatgact caaggtcgtc caatagtcga tttgattttg 1381 aatggtcatg cacactgttt ggaacacctg cggacaacgg atactggaca tgctgactcg 1441 catatcaact gcatcatctc tggtggtagc ggtcattacc cccgtcgtca gcgaaacgaa 1501 ggaacagaat tgatggagac ttttgagcat cttccaggta gtccttaccg taaagttgcg 1561 gattcgttcc tttttgttgg tcgcaatgga tacgatgttc aaatgcgacg accatactct 1621 tttgttcgta ttgatgttcg aggtggatgt ccaccgaagc ttattgtcag accatttgtg 1681 actgaacggt ttatgggagg gtggtgcgat cgccacctcg aaccgtttgt gatttagaaa 1741 tgttcagtgt ccagaggtta tttgtggatg atgtaagttt ttgatcagcg gataagacaa 1801 agtatgaact ctacgagaaa acttctatag atttgtcaaa agtagctaga ctcacatctt 1861 gcaccagttg cgttattgca atttttgtaa ggtgggcact gcctagagtt cgctcaaaac 1921 cttattctta cgttgtgacc agtgcccacc ctacagttat tgagaatagt gagccagcgc 1981 agtcttgggg gtttccccca tgagcgactg gcgttagcgc agcgtgaccg aaggtcatac 2041 ccggagggtg caagatatca ctagagcttt cctaaacttc ttaaccgttc tagagtcgat 2101 tctttcatct gcgtccgcca tttataagga tgatccttag gatcaagttc aatgtttttc 2161 tcccaagcta atagtgcttc ttcactgtcc ccaaagtcat ccaatgttaa agcttttaca 2221 ccccatcccc aaccatcttt aggataaagt ttgatatatc tttcaatagc ttctattaat 2281 ttttgctcat aaaatttgtg accccaagta agcgacagac ttagttccaa aattagtgct 2341 cgccataaaa aacgattata aggagcaagt tgaatggctt gttcacaggc ggatattatt 2401 tctacactgt cccctactgt tcctactctc tgtaaagtcc atgctttacg ttcccatgac 2461 caggcgcttt taggatctat tttgagtgcc ctgtcgtaca tttttagtgc tactttggga 2521 aagaagtttt ctagtactaa acctgctaat tgaaaaattg atgctggctt gtacagtaga 2581 taaacaagac tagcaactag cagacataat gctatatacc agcgatgaaa cgagtgccaa 2641 acatccagag aaaattcttc tgaaatgccc gaagcacgca tgacaagata agatccaaca 2701 aagagaataa taaaaaatgg agcagcagca atcgagataa acggtaacac aaagcagaga 2761 aaaaatagtc cgatcatcga tgcgatgcca gtacgatctt gtagcttagg tggcaaccaa 2821 acagaaagaa ttcgataacc atctaatggg ggaattggca gcaagttaaa tagtgcgatg 2881 aaaaactgca aactaataaa gtcagctatg aatgtagaga accagtgagg agagttccag 2941 gcaaggctga actgaaatat gggaactaac agcaatccaa aaagtatact agcgattggt 3001 cctgctgctg ctgtggcact taaccacaaa cggttccgaa tctgatcaaa ttcaatatat 3061 actgctgcac caggtaaagg tattccacct gtcaccaaga agattgcagg caggatgata 3121 ctgtgtagag gattgatata tcggcgcgga ttgaagctta aatagccttt gtctttgacc 3181 gagcgatcgc caccccaata tgctacaatc gcatgcccaa attcatgaca gctaatggca 3241 actatccaca gcaagcacgc catcactgaa gtcagcagtt tactccaagt tggatagaaa 3301 ttgcttaagg aagcagcgta ttttggaaat gttttcccct cacgattgat gtaaacatat 3361 tcatccctac ctccttgtcc aacaaccccg ttctgacgag ttacttgagc gactccgtgt 3421 aggtcaaatc gcttgcattc agtaaattga ggttggatag caaatttacc agttttatcg 3481 atgtatccac acttattatc ttttgcgaca gctgcaagcc cattggaaaa ctgacgttgc 3541 accacctcgc ttgaatacgg cttaagcacc tgggcaagcc atttctcgat tgaagacagg 3601 tcaagcggct gaccaaactc tttcaagcgt gacttccatt ccttaagtac agcagcaggt 3661 tctttcttat aatgaattgc caactcagga tctatctgaa actctggttt aataacaatt 3721 ttgcctgttt tgtcaataaa accaatcttg ccatcagaaa cgaccaaagc acgttcttca 3781 atcaggcgat ctgcataatc gaattgtgga ctaacaatga acttgcccgt tttgtcaatg 3841 taaccccatt ttttgtctac ttggacggct gctaatccgg atgcaaagct ttgacagtag 3901 tcgaattgtg gttctataac aactttccct cgtttatcct gaaagccaca tttcgatcct 3961 atggaatctt cagaggttgc taccaccaca agagggtcat cacgtccttg ctgtttagcc 4021 cacctactta tctcaacaat cgcaagtcct tcttgaaact cggacgctag cttgggcgga 4081 ttgaagaagt caccgactcg ccgccctgac ttatcccaat agatgttcgg atcaccaata 4141 aactgtcctg attcattcca agaaatcaag ttaagtttcg ctatcccttg aaaaaactta 4201 attggaaact tgtctacagg ctggataatc catttaccct ctggattgat aacgccccat 4261 aagccaccta cttgaacagt tgctacacca ccagagaatc gttgaacttt atcaaactgg 4321 tatgggatta ctagtttccc actctgattg ataaatcccc acttgccgtt gagtttgaca 4381 cctgccaagc cttctgaaaa agaacttggt gaaaaatcat catcaatacc atcactataa 4441 ggagaacttg actcaccgac cgtatcgtac tgtggggaaa tgcttacctg acctttaagg 4501 ttcacgaatc cccatttacc atgtaaccgt tcacactccc ccctacccct caacgataaa 4561 acaagggttt cggccattgt gcagggggag tgaacaatta ctttgtcatt gacttttaca 4621 ggtgcaaacc cctcagaaaa ttcttctgct tgagaaaact gaggtttaac aaccagttct 4681 cctgcttggt ttacataccc atatccaggc ccattatata tgggatataa gttttcagtt 4741 ttggagatgt ggagtagttg aaaggcgatc gcacttacta atgccagcaa aaacagaaca 4801 accaagcaag ttcttttctg gaataccatt agtagaactt ttccttacta ttggtagata 4861 tcaagcactg ttctttactt aaatggtact atatcgcttc atttttatga cgttaactta 4921 tatctcgttc tttagtaacg ttctttagta aactacacct caaaactctt cataatagtt 4981 tataaacacg ctctgagtga catgaaactc atcaaccgta aacagaaata ttatttttta 5041 cgttttaccg caattatttt cgtcatactg tggtgcttgc tttttgaccc cactggtata 5101 gcaaacgctg cttccccatc ctctactgtt tacgaacaac ggctgcttca cagtccagat 5161 ggtattggta aaatttacat gggtcgagaa atagccaaag tcatgggaca cacaggagta 5221 ttatggttag aacgaccaag tcgagagtct gaagaacaac caagcaagat tattagcgtt 5281 ctgaacttca aacctgacga tgttgtggca gatattggtg ctggaacagg ctacataagc 5341 tttcgtattg cccccctctt aacaaaggca aaggtgctag ctgtggatat tcaaccagaa 5401 atgctggata ttattgagtt tttgaagaaa gaaaataata ttaccaatgt tgaacctatc 5461 ttggcaactc tgactaaccc caatttacca ccagaaagtg ttgacctggc tttgatggta 5521 gatgcttatc acgaatttga gtacccacga gagattatgg aagggatgat tcgagggcta 5581 aaaccaggtg gtaggattgt gctggttgaa tatcgaggtg aaaatccctt tattctgatt 5641 aaggctttgc acaaaatgag tcagagacaa gtacgaaaag aaatgcaagc tgttggtttg 5701 gtttggcggg aaactaagaa cttgttaccc caacagcatt tgatggtatt tgaaaaacct 5761 tccctgtgag gagacattgc gcggatgggt aggtttttga aaactctcac ccagtccgca 5821 acgggcgcgg gtgtaagggt gtaagggtgt aagggtgtaa gggtgtaagg gtgtaa // LOCUS NODE_5005_length_5838_cov_4.4508045838 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5838) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5838) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5838 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 295..1314 /locus_tag="DP116_26590" CDS 295..1314 /locus_tag="DP116_26590" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315134.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26590" /translation="MTPSMTDSAVKAAMDIIASSPATTQEKIEMLIEMAQGFQKKPKT AQDLWNAVSLCQQAYDLCDEDNLLWQARAKVGIAVALKAIPDVGEQLLLEAKERFQEA LPILQRTLASTEDSDAQQCPQQLATPVELAEAQMNFGLVLQSLVPFNLAQMTDSIQAY QKALQVFTWQKYPQEYAILHNNIAIAYLSIPLTSEREYLRQGLAVQTFEEALKHIELI SHPREYAMLQNNLGNALQYLPSSHPLENVVRAIAAYDEALKVRNPRDTPLEYANTISN KANALWNLPDNPEKPEAGNLKNLLQARTYYQKAWEIFTQHEQIEQAQVVAQVLQEVET EIDNG" gene 1370..1636 /locus_tag="DP116_26595" CDS 1370..1636 /locus_tag="DP116_26595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315135.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26595" /translation="MTDLLTNPVLKDFFTSLMAGDLNLMTGFVWFLVATALSMIGGAI GGIILAKEDLGYELAALLGGFFGPAGVIPGIILGLIVLNVLRNF" gene 1645..1869 /locus_tag="DP116_26600" CDS 1645..1869 /locus_tag="DP116_26600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017320976.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26600" /translation="MLEIGWFSARLFFKGKLLRDPMYFVRQTTIGISVSALLLFLLAQ AKVSLWIPVIVSSFLTGVLMPFLHKDLKLK" gene complement(1912..2151) /locus_tag="DP116_26605" CDS complement(1912..2151) /locus_tag="DP116_26605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008199529.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26605" /translation="MFIGDFGIKKCSFYQLSVTSYQLSVSCPVVHCSLFTVHCSLFDK PQINLRPEAISKLKFVRLFSRLELKFRPNNSFSQC" gene 2144..3373 /locus_tag="DP116_26610" CDS 2144..3373 /locus_tag="DP116_26610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315137.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26610" /translation="MNTPTKMTLTPKDATEAVKVSQLSPFLLQGAEVILGKTLELGQL ATPVVPSSREMFGPRGACLISETGPLWVSDTGHHRLLGWRNVPTQDSQPADWVIGQPD FNHEGQNAKGSPGRATLSVPTGICACGEGLAVADAWNHRVLIWKKLPEDSNVPADLVL GQTHFTDNEPNRGTQQAAANTLNWCYGVFYHQGKFFVADTGNRRVLIWHQLPEENGQP ADVVLGQHDMMSRDENGGDTASASSMRWCHDITVWEENLVVTDAGNNRVMIWQGIPTE NNASCAVVLGQKTFDSVEMNQGVYFPSANSLSMPYGVAAAGEWLFVADTANSRILGWK KPQSIFSLHGSECHAIAGQIDFQRKGENRDFGLAKRDSLNWSYGIKVCGKTAVIADSG NNRVLLWKITTKELSNS" gene 3609..>5838 /locus_tag="DP116_26615" CDS 3609..>5838 /locus_tag="DP116_26615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315138.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbamoyltransferase HypF" /protein_id="PRJNA477356:DP116_26615" /translation="MSTEEIIRVRGIVQGVGFRPTVYRLAKAYGLQGEVCNDGQGVLI WVSGCEKSIEEFVQKLQKECPPLARIDEVTRKRYEGKSTFGDFVISDSVSSAVKTEIP ADAATCPKCKAEIFDPLNRWYRYPFTNCTHCGPRLSIIRAIPYDRNNTSMAAFAMCAE CRREYNDVENRRFHAQPVACQICGPKAWLERSDNQPITCHMFSMLDDVDAVCTLLQKG EIVAIKGLGGFHLACDATQESAVQKLRSRKQRNQKPFALMARDISVIEEYCTPNTKEK ELLESPAAPVVLIQAKAGDDEEERGSGRIQFKIRLEKFALSGNQRQIPSEGDPHQVLA SPRNFSQNSKFKIQNPKSKIQNQIAPSVAPGQNTLGFMLPYTPLHHLILRRMNRPIVL TSGNLSDDPQCIDNVEAREKLGKIADYFLLHNREIVSRVDDSVVRVVGDKVLTIRRAR GYAPTSINLPSGFESVPHILAMGSQLKNTFCLLRDGKAILSQHIGDLETAATFKSYQD TLNLYLNLFEHQPKAIAVDLHPEYFSTKLGKELAHNQYGHGTAVPLLHIQHHHAHIAA CMAENDININSPPVLGIALDGLGYGEDGTLWGGEFLLADYRQFKRLATFKPVPMIGGT QAMREPWRNTYAQLRSAFTWEDLKQKYRNLEILEFLEQKPLKLLNQLVEKGINAPLTS SVGRLFDAIAAAIGICREQCSYEGQAAIEMEALADTSSLKNKEYLNYNFQFLVSDNIS YID" BASE COUNT 1785 a 1129 c 1291 g 1633 t ORIGIN 1 caatagaaat gttttatctt aaaagagaat agacttaata aattcaagga gtatttatcg 61 gtcttctggc aaaaatactg aacgctgaat atagctgtac cccttttatc ttcaagcctc 121 tgacaactcg ggaaatagcg cgcctcattt tgcaaagagg ggattttcca agacacaact 181 cattatctgc gtaaaacttg cttggttggt acagcgcaaa gacgtaagga atatgcgtct 241 attgggtgtg cgaacttgaa agactcaact ccaacctgct ataggaagat ggtaatgacc 301 cccagcatga cagattctgc ggtgaaggcg gcaatggata ttattgcttc ttccccagca 361 acgacacaag aaaaaatcga gatgctgatt gaaatggcgc agggctttca aaaaaagccg 421 aaaactgccc aagatttatg gaatgcagtt tcgctatgtc agcaagcgta tgatttgtgt 481 gatgaggata atttgctttg gcaagctagg gcaaaggtag gaatagcagt ggctttgaag 541 gctattcccg atgttgggga acaactgttg ttagaggcga aagaacgatt ccaagaagcc 601 ctacctattt tacaacggac acttgcttca acggaggaca gcgacgcaca gcagtgtcct 661 caacagcttg caacacccgt ggaactagca gaagcccaaa tgaattttgg gttagtgttg 721 cagtcgcttg tgccattcaa tttagcccag atgacagata gcatccaggc ttatcaaaaa 781 gctttgcagg tgtttacctg gcagaagtac ccgcaggaat atgcgatttt acacaacaat 841 attgcaattg cttacctctc gatacccctg acatcagaaa gagaatacct gcgtcaaggg 901 ttagcagtac agacatttga ggaggcgctg aagcatattg agttaatcag ccatcccagg 961 gaatatgcga tgttgcagaa taatctgggc aacgctttgc aatatctacc gagttcgcat 1021 cctttggaga atgttgtgcg ggcgatcgcc gcttatgacg aagccttaaa agtgcgtaac 1081 cccagagata cacctctgga atacgccaac acaatttcta acaaagccaa cgccttatgg 1141 aatttaccag ataatccgga aaaaccagaa gctggaaatc tcaaaaatct cttgcaagcg 1201 cgtacctatt atcaaaaagc ttgggaaatt ttcactcaac atgaacaaat agaacaagca 1261 caagtagtag cacaggtact gcaagaagtt gagacagaaa tcgataatgg ttagtggttg 1321 gtagttttca aaaaaataac aactaacaat tttcgtgcag gagataagaa tgacagacct 1381 tttgacaaat cctgttttaa aagatttttt cacaagctta atggctggcg acttaaatct 1441 gatgacggga tttgtgtggt ttttagtagc aacagcctta tctatgattg gcggtgcaat 1501 tggcggaatc attttagcca aagaagattt aggctatgag ttagctgcac ttcttggtgg 1561 attctttggt ccggctggcg tcattcctgg aataatctta gggctaatag tcttgaatgt 1621 gttgagaaat ttctaggagg agtcatgtta gagataggat ggttcagcgc taggctattt 1681 tttaagggaa agctactgcg cgaccccatg tattttgtca ggcaaactac aattggcatt 1741 agcgtaagtg cattgttgtt gttcctgctt gcacaagcta aagttagcct ttggatacca 1801 gttattgtct ctagcttttt gactggtgtt ttgatgccat ttctacataa agacctcaaa 1861 ttgaaataaa tttcaggcag aaagctaaaa cattataaag ttttccgtct cttaacattg 1921 actgaagcta ttattaggcc ggaacttcag ttctaggcgg ctgaacaggc gaacaaattt 1981 caacttacta atggcttctg gtctcaaatt tatttgaggc ttatcaaaca gtgaacagtg 2041 aacagtgaac agtgaacagt gaacaacagg acacgacact gataactggt aactggtaac 2101 tgataactga taaaagctgc atttctttat tccaaaatca ccgatgaaca cgccaactaa 2161 aatgactctt acaccaaaag atgcaacaga agcggtaaaa gtttcccaac tttcaccttt 2221 tttgcttcaa ggtgcagaag ttattttagg aaagactctt gaactaggtc aactagcaac 2281 acctgttgta cccagcagta gagaaatgtt tggtcctcgt ggtgcttgtt taatatctga 2341 aactggacca ttgtgggtat cagatacagg acatcatcgt ttattaggat ggcggaatgt 2401 tcccactcaa gatagtcaac ctgctgattg ggtgattgga caaccagact ttaatcatga 2461 aggacaaaat gccaaaggta gtcctggaag ggcaacactc agtgtaccaa ccggaatttg 2521 tgcttgtggt gaaggtttgg cagttgcaga tgcttggaat catcgcgttt tgatttggaa 2581 aaaattgcca gaagatagca atgtacccgc cgatttagta ttaggtcaaa cccatttcac 2641 tgacaatgaa ccaaacagag gaactcaaca agcagcagct aatactctca actggtgtta 2701 tggagttttc tatcatcaag gaaagttctt tgttgctgac acaggtaatc gtcgggtttt 2761 aatttggcat caattacctg aagaaaatgg tcaaccggcg gatgttgttt tgggacaaca 2821 cgatatgatg tcccgtgatg aaaatggcgg tgatactgct tcagcatcga gtatgcgttg 2881 gtgtcatgac atcactgttt gggaagaaaa tttagttgtc actgatgcgg gtaacaaccg 2941 tgtgatgatt tggcagggaa ttccaacaga aaataatgct tcttgtgcgg tggtattagg 3001 acaaaaaact tttgattcag ttgaaatgaa ccaaggagtt tattttccta gtgccaatag 3061 tttgagtatg ccttatggtg tcgctgctgc tggtgagtgg ttgttcgtcg cagatacagc 3121 gaattcaagg attttaggct ggaaaaaacc acaatcaatt ttctctttgc acggaagcga 3181 atgtcatgct attgctggac aaatcgactt tcaacgtaaa ggtgagaatc gtgattttgg 3241 gttagcaaaa cgagatagct tgaattggtc ttatggaata aaagtgtgtg gaaagactgc 3301 agtcatagca gattccggaa ataaccgagt tttactttgg aagataacta ccaaagaact 3361 cagcaactcc tgaattagaa cataaaagaa gtttctgtta tgtgttgtgt taaatcaggt 3421 acacagtctc caaatagatt agacctcttg caaaagtgag atcttacgag gttctgttaa 3481 gagttctttg ttaagagttc cctacaactg ccacgaagtc tattgtacaa aacctttatc 3541 taaagatagg aaatcattct gagttctgtc tttgtttcct ttgcatctac gtaattagtt 3601 caaatatcat gtctactgaa gaaatcatca gggttcgcgg tatcgttcaa ggagtgggat 3661 tccgccccac tgtatataga cttgcaaaag catacgggtt gcaaggcgaa gtttgtaatg 3721 acggtcaagg tgtcctcatt tgggtatctg gctgtgaaaa atctatagaa gaatttgttc 3781 aaaaattaca gaaagagtgt ccgccattag cgagaattga tgaagtaacg cgcaagcgtt 3841 atgaaggcaa atctaccttt ggtgattttg tcatttctga cagtgttagc agcgcagtga 3901 aaacagaaat tcctgcggat gctgcgactt gtccaaaatg taaagcggaa atttttgacc 3961 ctctcaaccg ctggtatcgt tatcccttca ccaactgtac tcattgcggt cctcgcttaa 4021 gtattattcg cgccatacct tacgatcgca acaacacaag tatggctgct tttgctatgt 4081 gtgcagagtg tcgtcgggaa tacaacgatg ttgaaaaccg tcgttttcac gcccagcccg 4141 tagcgtgtca gatttgcggt cctaaagcct ggttagaacg ttcagataat caaccaatta 4201 catgtcatat gttttctatg cttgacgatg ttgatgccgt ttgtacctta ttacaaaaag 4261 gcgaaatagt tgccattaaa gggttaggtg gatttcatct agcttgcgac gccactcagg 4321 aaagtgctgt gcaaaaacta cgcagccgta aacagcgcaa tcaaaagcct tttgctttaa 4381 tggcgcgaga tatatcagtt attgaagagt actgcacccc caacaccaag gaaaaagaat 4441 tactggaaag tcccgccgcg cctgttgtat taatccaggc gaaagcggga gacgatgaag 4501 aagaaagggg aagtgggaga atacaattca aaattcgtct tgaaaagttt gctctgtcgg 4561 gaaaccaacg ccagatacca agtgagggag accctcatca agtactggct tctcctagga 4621 acttttcgca aaattcaaaa ttcaaaattc aaaacccaaa atccaaaatc caaaatcaaa 4681 ttgcgccctc ggttgcacct gggcaaaaca ctttgggttt catgctacct tacaccccgt 4741 tacatcacct cattctcagg cggatgaatc gcccgattgt tttgacaagt ggtaatcttt 4801 ctgatgatcc gcaatgtatt gataatgtcg aagcgcgtga gaagctagga aagattgcag 4861 attattttct tctccacaat cgggagattg taagccgagt tgatgattcg gtggtgaggg 4921 ttgttggcga taaagtgcta acaatccgtc gtgctagagg atacgcacca acatctatta 4981 atttaccatc tggatttgaa agcgttcctc acattttagc aatgggcagt cagctaaaga 5041 atactttttg tttattgcga gatggaaaag ctattctatc tcaacatata ggagatttag 5101 aaactgcagc cactttcaaa tcttatcagg atacgctcaa tctctactta aatttatttg 5161 agcatcaacc aaaagcgatc gctgttgatt tacaccctga atatttctca acaaaacttg 5221 gaaaagaact tgcacacaat cagtacggac acggcacagc cgtgccccta cttcacatcc 5281 aacatcatca tgctcatatt gctgcttgca tggcagaaaa tgatataaat attaattcac 5341 cacctgtttt aggtattgca ttagatggat tgggttatgg tgaagatgga acactgtggg 5401 gcggagaatt tctgttagca gattatcgcc aatttaagag attggcgaca tttaagccag 5461 taccaatgat tggcggaaca caagcaatga gagaaccctg gcgtaatacc tacgcccagt 5521 taagatctgc ctttacttgg gaagatttaa aacaaaaata cagaaattta gagattttag 5581 aatttctaga acaaaaaccg ctgaaacttc tcaatcagct tgtcgaaaaa gggattaatg 5641 ctcccttgac ttcatcagta gggcgtttat ttgatgcgat cgctgctgca attggtatct 5701 gccgagaaca atgtagctat gaaggacaag cagcaattga aatggaagct ttggctgata 5761 cgagttcttt aaaaaataaa gaatatttaa attataattt tcaatttctt gtttcagata 5821 atatttctta tatagatt // LOCUS NODE_5067_length_5719_cov_5.4427975719 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5719) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5719) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5719 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(105..926) /locus_tag="DP116_26620" CDS complement(105..926) /locus_tag="DP116_26620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308088.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADPH-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_26620" /translation="MTNPTELLQSRYGEIPFNYDLPWNDFLETLLSHRSIRSYLSDPL PPKTVELLVAAAQSAASSANLQTWSVVAVEDEQRKEELYRLSNNQAQVKQAPLFLVWL ADLGRLAHIADSRGLPHVALEYIELLMKAIVDASVAAQNATLAAESLGLGTVYIGAIR NNTQEVATLLNLPSFVFPVFGLCVGYPNPEVQPAIKPRLPQSVILHRETYKLGEQDEA IAHYNDIMKQFYTQQKMNVPGDWSAQSAERVATLDYLKGCKNLREVLNNFGFKLL" gene complement(1081..1683) /locus_tag="DP116_26625" CDS complement(1081..1683) /locus_tag="DP116_26625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015139983.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="heme-copper oxidase subunit III" /protein_id="PRJNA477356:DP116_26625" /translation="MNSSITSKELQHSAHEHAHDEEGSKMFGFTVFLLSETFIFLGFF TAYVVYKTTLPNWLPPGVSGLETKDPAINTVILVSSSFVIYLAERALARHDLTKFRQF LALTIAMGSYFLVGQAIEWSHLSFGFTTGVFGGMFYLLTGFHSLHVLTGILLQIIVFG RSFIPGNYDTGHFGINATSVFWHFVDVIWIVLFVLIYIWQ" gene complement(1780..1959) /locus_tag="DP116_26630" /pseudo CDS complement(1780..1959) /locus_tag="DP116_26630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319236.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="cytochrome c oxidase subunit I" gene complement(1969..3098) /locus_tag="DP116_26635" /pseudo CDS complement(1969..3098) /locus_tag="DP116_26635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012599779.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS630 family transposase" gene complement(4097..4444) /locus_tag="DP116_26640" CDS complement(4097..4444) /locus_tag="DP116_26640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873365.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26640" /translation="MLLDEDSQAKYLLNLLRTAGHDVITVNEAGLGSCPDSTVFNYAR QQGLVVLTRNCDDFEQLHQAEPVHPGILAVYQNSDSSKNMSYQSIVAAISNLEAMSYV LKNQFVILNQWNY" gene complement(4434..4817) /locus_tag="DP116_26645" CDS complement(4434..4817) /locus_tag="DP116_26645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873364.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26645" /translation="MSLQELKEQAFKLSVSDRLALISALIQSLQTDSQTEDWKYLVTR PHPWRRQLYIKGRKLLVSTVWQDMIANKMSAEQAAENWDLPLAAIHEAINYCENNREL LKLEADEERYRLETKGISLEPTTAA" gene complement(4850..5629) /locus_tag="DP116_26650" CDS complement(4850..5629) /locus_tag="DP116_26650" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26650" /translation="MQQRSIKRKQNIMDKEYAPMFVWSPDISKEYKHFKLTSKCPECG VFHWTLKLVKSDGTPVPMKLRLERDTYIECRKCQRRFPICNTSNNHPQNLTEVPTVKV HRSTEKKRVQVANEVINVPQGVTITVKRSRVIEHSIDINWQFSSGIEIELGLQPIIKT VVRGEIEKAQGRKYQESETIEYEIELSGETSNRYSLTWTDIWLTGKTEIQQSNTTRIL PFQFREYSELEVAPASFCRHCGFKLEALNAFCPSCGKKVPV" BASE COUNT 1617 a 1183 c 1159 g 1760 t ORIGIN 1 ccttacaccc tgtccgaagg gcagtgacag gattatatta atagtttctg agcagtttaa 61 gccgctgctc agaattcttt gtgtcttttt gttgctgatt ccttttatag caacttgaag 121 ccgaagttat tcagaacttc acgcaagttc ttgcatcctt ttaaataatc caatgttgcg 181 actcgctcgg ctgattgtgc tgaccaatca ccagggacat tcatcttttg ttgagtgtag 241 aactgtttca tgatgtcatt gtagtgagcg atcgcctcat cttgttctcc cagtttataa 301 gtttctcgat gtagtataac cgactgaggc agtcgtggct taattgctgg ttggacttca 361 ggattcggat aacccacaca caacccaaac acagggaaca caaaagatgg taaattcagt 421 agtgttgcta cctcttgcgt gttgttgcgt attgcgccaa tatatactgt tcctaaacca 481 agggactcag cagcaagggt ggcattttgc gctgctacag aggcgtcaac tattgctttc 541 atcaataatt ctatgtattc caaagcaaca tgaggtagtc cgcgactgtc agcaatatga 601 gccagacgac ctaaatctgc taaccaaacc aagaataaag gagcctgctt gacctgtgct 661 tggttattgg acagcctgta taattcttct ttacgttgtt catcttcaac ggctaccaca 721 ctccaagttt gcagattagc agaactagca gcagattgag cagctgcaac taataactct 781 acggttttcg gtggtagagg atcagataaa taagaccgaa ttgaacggtg agacagtagc 841 gtctctagga aatcgttcca gggaagatcg taattgaagg gaatttcacc gtagcgcgat 901 tgcagcaatt ctgtaggatt tgtcatggtt aacttttaac tccaacatgt tctttgagtc 961 acaaaaaact ttaggaatga gaacacctca actcttgttg aatcgttcag tcaaatgttc 1021 tccaactcgc acgcgtattg gcaatgattg tcaaagctgg actaatctcc cacttttgtt 1081 tcattgccaa atataaataa ggacaaacaa aacaatccag ataacatcaa caaagtgcca 1141 gaacacagag gttgcattta taccaaaatg acctgtgtcg tagttacctg gaatgaaaga 1201 gcgaccaaaa acaataatct gcaacaaaat accagtgaga acgtgtaaac tgtggaagcc 1261 tgttagtaaa tagaacattc caccaaaaac tcctgtggta aagccaaagg aaaggtggct 1321 ccactcgatc gcttgtccta ctaaaaagta acttcccatt gcgatcgtta acgcaagaaa 1381 ctggcgaaat tttgtcaagt cgtggcgtgc aagggcgcgt tctgccaaat aaatgacaaa 1441 gctactggaa acaagaatca ccgtatttat tgcaggatct ttggtttcta gtccggaaac 1501 accaggaggt aaccagttag gaagtgttgt tttgtagaca acatatgctg taaaaaaacc 1561 gaggaaaatg aaagtctctg aaagcaagaa cacagtgaag ccgaacattt tgctgccttc 1621 ttcatcatga gcatgctcat gtgcggaatg ctgtaactct tttgaagtga tcgaactgtt 1681 catttgataa attctcacag gttgttagtg gttggtagtt agtggaaaaa caaatgatgt 1741 acaaacgctt tcatcagagc gcgtctgtac tactaatcac taacgctctt gcaccaatgg 1801 ttcagatttt ccgtaggcgt agggttcacc aataacaact ggaatttcgt caaagttctc 1861 tacaggaggt ggtgaagaaa cttgccactc aagtccaatg gctcgccaag gattcttgag 1921 tgcttgctgg tcatgcatcc aagaaccaat catattaata ccaattctct atgaagttgc 1981 acttaataaa gcggaggtga tgtaatccca tccacaaatt gaagcgttaa gcgtagctct 2041 gcctacggca atcgcttcgg atgagataga atcaagaacc tgttttaatt tccgtcttaa 2101 gtcatctaaa tttgtacaat gttcccaaga tagtttgtac ttaatatgtt cccataagcc 2161 ttcgatgggg ttaagctctg gactgttggc gggctgaaaa atgggaataa cattatctgg 2221 ccattttaag tccaaggatt tgtgaaaact gccattgtcc atttgtatca aattgagtgc 2281 gtcaggatat gcttttgaaa aaaggtctaa gaaatcttga aaacacgtac catcaaggtg 2341 agaaaactcc cagaaaaaac tatctccagt tgctggttca accacaccgt acaaataaaa 2401 gttttttcgt tgccaactaa cgatgccttc aggcttaact cctttgaggg taatttgacg 2461 cccagtaagg gttttgagtc ccaatcgact ttcatcttca caccaataac gtactggttg 2521 atctgttcct aaataacgca ataatacttg gattatttct ggcagttttt tttaaatgcg 2581 tcttggactt caggcaaagc cttagcactg cgggggcggg gtactttcaa tttggcattg 2641 agtttgtaac gcacaatttt gtgtacagtc ttgtacgcaa ctacaacctg gtactcttga 2701 gtgagccact cttgaatttg gctataactt ctgaatcctt gaggttgact taagcgttga 2761 tgtagttgag cgatcgctgt ttctggaatt aaacctggct ttcctggggc tgtcttaatt 2821 tcgagtaatg catcaatgcc tccttgctta tatcgtttca accatcggta cactgttgat 2881 tcgtctcgtc caagctgttg cgatagttcc tgtcttgtag cgttagcgtt cgcgcagcgt 2941 gtccctttgg gactcagcgt gccgcaggca ttcgctttgg tcttaatcca atacagcatc 3001 tgcaaccgtt ccttagttgt cgctgttacc gcacgccgca atcgatgttg tagctcttcg 3061 aggctctcat ctacttcgac tcgaaagtta tttcccataa ccttagatag acagtttcta 3121 cttctactaa tttagtgcat cttcatggag aattagtata actcctaaca ctaaacaacc 3181 acaatctttt gacttctata acgactcagg ttgaacttaa gatctgtcta cataacagca 3241 gcaattttca ttttggagaa tgattttcac acaagagctt cattttgata agtagaagct 3301 cgcacagaat gaatatagaa ttgcagcaat tatcgttatg ctttttcaac aggtgtttgg 3361 atggaactcc agctcaagcc cctaaatcat gaagttcagt agactatttt agtaatcgca 3421 gtagagataa tgaggtaaag cttgcaagtc atcacaaaaa gtcctaaagg gttggtcaaa 3481 tgctgactga gttggtattt ttcatgcgtg aagtcaagca gaatatgaaa cagaaacttg 3541 tgaattaagt cttcggtagc ctggtgaatc acaaactgcc tgccacagtc ttaacacttg 3601 aaatttagtt tcccgttatg gattttacca ttagaaacta tgtattgaga gccacagttt 3661 ggataggtga ggagtgcttc tgacatcgat gcaattagaa tcaattacac tgtatcctta 3721 cctatttagg actattgatt tcccatatat aaagaatatc atacatcttg ccaatttact 3781 tgcgattgtc aatttttttg caaagagtat tcgtggttac aattaatcaa aaattagaag 3841 attctcaacc aacaattacg tttttcttgt attgtttcga ggttaacagt tttgatctga 3901 atttatatct taataagacc atcaattgtc tatagtttgt gcaaatacct cgcgatgctt 3961 cagaggagcc acccctttgg ggcgatcgca gtaacgattg agttcagcgg cggctaataa 4021 cttttgtatt tcaccagtaa cctccattcc gccactgtaa cgatttgtta tgccgcgagc 4081 tgtgcaattc ccagagttaa taattccatt gattaagaat aacgaactgg tttttaagca 4141 catagctcat cgcctcaaga ttggagattg ctgcaacaat tgactgatag ctcatgtttt 4201 ttgatgagtc cgagttttga tagactgcca aaatacctgg atgaactggt tcagcttgat 4261 gaagttgctc aaagtcatca cagttgcgag tcagcaccac taacccttgt tgcctagcgt 4321 aattaaatac ggttgaatct gggcaacttc ctaaacctgc ttcattgaca gtgattacat 4381 catgccctgc tgtgcgaagc aaattgagca gatactttgc ttgagagtcc tcatcaagca 4441 gcagttgtag gctcaagaga aattcccttt gtttctaaac gatacctttc ctcatctgcc 4501 tctaatttca gcagttcccg attattttcg caatagttaa tagcctcgtg aatagctgcc 4561 aagggtaagt cccaattttc tgctgcttgt tcggctgaca ttttattggc aatcatatct 4621 tgccaaacgg ttgagacaag caacttacgc cctttaatat atagttgtcg tcgccaaggg 4681 tgaggacgtg tgactaaata tttccaatct tctgtttggc tatcagtttg taaagattga 4741 atcagggcac taatcagggc aaggcgatcg ctcacagaca atttgaaagc ttgttctttg 4801 agttcttgta gtgacataaa agcgtttctt aacagtgctt atgttcattc tagacaggaa 4861 ccttcttgcc gcaacttggg cagaaagcat ttaaagcttc aagcttaaat ccacaatgac 4921 gacaaaaact agccggagcg acttcaagct cactgtattc acggaattga aaaggtagta 4981 tacgagtcgt attactttgt tgaatttccg tctttcccgt aagccatata tcagtccatg 5041 ttagagagta acggttactc gtttctccac tcagttcaat ttcgtattct attgtttcgc 5101 tttcttgata ttttcgtcct tgtgccttct caatctcgcc acgaaccacc gtttttatta 5161 tgggttgcaa tccaagttcg atttcaattc cagaggaaaa ctgccagtta atgtctatac 5221 tatgctcgat cactcttgaa cgctttacgg taatggtgac tccttgtgga acattaatta 5281 cttcatttgc aacttgcact cttttctttt ctgttgacct atgaactttt acggtgggca 5341 cctccgtaag gttctgtgga tgattgttag atgtgttgca aataggaaat ctacgttggc 5401 attttcgaca ttcaatatag gtatcacgtt ctaatctcaa cttcatggga actggggttc 5461 catcactttt tacaagctta agagtccagt gaaatacacc gcattctggg cacttgctgg 5521 tcaacttgaa atgcttgtac tctttagaaa tatctggaga ccaaacgaac atgggggcgt 5581 attctttatc catgatgttt tgttttcgct ttatgcttcg ttgctgcatc aagcaagaat 5641 ttcaattttt accaaccaaa ctaataggtt atcaagtttg ttttgaggaa taaaaaagat 5701 gtaatgtgaa gaagaagga // LOCUS NODE_5121_length_5602_cov_5.4642155602 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5602) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5602) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5602 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1778) /locus_tag="DP116_26655" CDS complement(<1..1778) /locus_tag="DP116_26655" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314268.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="L-glutamate gamma-semialdehyde dehydrogenase" /protein_id="PRJNA477356:DP116_26655" /translation="MVLQVQTSTYEAKTQEIAKQLLAATQENRSFFAALRDQMRWDDK LLAWAMSNPGLRVQLFRFIDTLPALRSKPEIAAHLQEYLGDETVELPTALKGMLGFAN PDSMPGQVAATTVSTGVETLAHKYISGENIKQALKTIERLRKEKMAFTVDLLGEAVIT EAEAQSYLERYLDLMQQLVQASKSWTPVGLIDEADGQPLPKVQVSVKLTAFYSQFDPL DAKGSEERVSDRIRTLLRRAKELGASVHFDMEQYAYKDLTFSILKKLLIEEEFRSRTD IGMTIQAYLRDSEQDAQNLIDWAKQRGYPITIRLVKGAYWDQETIKAEQKHWQQPVYN DKAATDANFERITQLLLENHQYVYAAIGSHNVRSQAHAIAIAQSLNVPRRSFEMQVLY GMGDKLASSLVDRGYRVRVYCPYGELLPGMAYLIRRLLENTANSSFLRQNMENRPVEE LLAPPVVNSASDASPSLAGGTEGGFSGAPDTDYAEEEERKEAVQAFQNVRQQLGKTYL PLINGEYVQTQQVVDSVNPSNFSEVVGKVGLLSVEQAEQAMQAAKAAFPAWRKTPVKE RAGILRKAADLMEQRRAELSVWIVLEV" gene complement(1998..3686) /locus_tag="DP116_26660" CDS complement(1998..3686) /locus_tag="DP116_26660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015138128.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GMC family oxidoreductase" /protein_id="PRJNA477356:DP116_26660" /translation="MLKDARLISAEQVIETEVCIVGAGAAGITLAREFIGHKTKVCLV ESGGLEFDPQTQALSEALTVGDPFLSLKEIRCRRFGGNSNLWSIKIGKGRKGVRYAPL DEIDFEKRAGLPYSGWPIERSHLEPFYKRAQSVCQAGSFAYDPDIWEDKQAKRLPLNE NQVVTAMFQFGPGDIFNQQYRDELSRVDNITVYLNLNALEIETNESVRTATRLRVASL QGQQFWIAAKVFILAQGGLENPRTLLMSNRQQTAGLGNQHDLVGRFFMDHPLVQGGDF IPADANLFNKMPLYDLRRVNDVPVLGYFKLSQEVMQRENLVNMSTVLFPRPSKRGLKA IESFKAVAQALAHGKLPQVSGQDFLNIIGGIDYVMLANYLAATKNQSLLHGFHRGGWS ELPNNQHRFKVFEVIHQVEQVPNPANRVVLSKERDALGCQKLELHWRWDTDNALNIQR SQEIMAQEIASSGLGEYRIQYKEGLPKLGMPAGTAHHMGTTRMHIDPKQGVVDENCQV HGISNLFIAGSSVFPTGGYANPTLTIVALSIRLADWIKKSLVRADTNLLVSSHC" gene complement(4134..5198) /locus_tag="DP116_26665" CDS complement(4134..5198) /locus_tag="DP116_26665" /inference="COORDINATES: protein motif:HMM:PF13599.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26665" /translation="MTLVNSENFRFYDKKAKMRLISLVKQVTWLGLGFTLVLLLTKGH PSEFMIGLYLCFTFFVTWSKHWWFEKKPENVLTISPPLNHKELSDTENELDCQSNLIE KALHLKSQSHNNSMNFILANLKNAQLADLNLNCAYLSGSNLSHANLSGASLWNSNFST AQLVDTNFSNAILKNANLSSSNLINANLSSADLKEANLSNANLSNADLRDTDLSGANL SNADLSDANLSSGNLTNVYLWSANLNRANLLGANLSSAYMNSTKLRNANLSGALLKDT NLSDAKLQEANLTGTNLCGANLCCANLKNANLSNANLENANLEGANLEGADLSTANLN GANLRCVRINQATQMHKKWL" gene complement(5354..5464) /locus_tag="DP116_26670" /pseudo CDS complement(5354..5464) /locus_tag="DP116_26670" /inference="COORDINATES: protein motif:HMM:PF05199.11" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 1556 a 1272 c 1160 g 1614 t ORIGIN 1 acttccaaaa ctatccaaac cgaaagttca gcgcggcgtt gttccatcaa atcagccgct 61 tttcgtaaaa tcccagcgcg ttcttttaca ggtgttttgc gccaagcagg aaacgcagct 121 ttcgcggctt gcattgcttg ttccgcttgt tcaacactca ggagtccaac cttaccgaca 181 acctcactaa aattcgaggg gttaacagaa tctacaactt gctgcgtctg aacgtactca 241 ccgttgatca gagggagata agtttttcca agttgctgac ggacattttg gaaagcttga 301 acagcctcct tcctctcctc ttcctcagca taatctgtat caggagcacc cgagaacccc 361 ccctcagtcc cccccgcaag cgagggggaa gcatcagagg cggagtttac gactggcggc 421 gctaataatt cctcaacggg gcgattctcc atattttgcc gcaaaaagga actattagca 481 gtattttcca acaagcggcg aattagatac gccattcctg gaagaagttc tccataagga 541 cagtaaactc gaacgcgata acctctatca actaacgatg aagctagttt atcgcccata 601 ccgtagagaa cttgcatttc aaagctacgg cgggggacat ttaagctttg tgctattgct 661 attgcatgag cttgcgatcg cacgttatgg ctaccaatcg cagcatacac atattgatga 721 ttctctagca ataactgagt tattctctca aaatttgcat ctgtcgccgc tttatcgtta 781 taaactggct gttgccaatg cttctgttct gcttttatag tttcttgatc ccaatacgcg 841 cctttcacca aacgaattgt aatcggataa ccgcgctgtt ttgcccaatc aatcaagttt 901 tgtgcatctt gctcgctatc gcgcagatag gcttgaattg tcataccaat atctgtacga 961 gagcgaaact cttcttctat taaaagtttt ttcaaaatgc tgaaagtgag gtctttgtat 1021 gcatactgtt ccatgtcaaa atgaacagaa gcgcctagtt ctttggcacg acgtaagaga 1081 gtgcggatgc gatcgctcac ccgttcttca ctacccttag catccaacgg gtcaaactgc 1141 gagtaaaacg ctgttaactt cacagaaacc tggacttttg gtaatggttg tccatcagct 1201 tcgtcaatca acccaactgg cgtccaactc ttcgatgctt gcaccaattg ttgcatcaaa 1261 tctagatagc gttccaaata agactgcgct tctgcttcgg taatcacagc ctcacccaat 1321 aagtctacgg tgaaagccat cttttcctta cgcagtcgct caattgtttt cagcgcctgt 1381 ttaatatttt ctccagaaat atatttatga gctaatgttt ctacacctgt ggaaacagtt 1441 gtcgcagcga cttgtccagg catagaatca ggattagcaa agcccagcat tcccttcaag 1501 gctgtaggta attctactgt ttcatcaccc agatattctt gtaaatgggc tgcaatttct 1561 ggtttactac gtaaagcagg aagagtatca ataaagcgaa acagttgcac ccgtaagccc 1621 ggattgctca tcgcccaagc taacaactta tcgtcccagc gcatctgatc tcgcagtgca 1681 gcaaagaagg aacgattctc ttgagtcgcg gctagaagct gtttagcgat ttcttgggtt 1741 ttggcttcgt aggtgcttgt ttgtacttgt aataccactg ataacaaact ccatattcaa 1801 gacaaggctt cattgtatac agcctctatc ttctattgtg gctttttcat tgtgcgacta 1861 ccacttccta aattggtagt cgctagagga agcctataca agtaaggcta gcgcctgaga 1921 cggtgatgca gtccccaagt atggaaaaac cacgcctagc ttcacaggaa agcgaattaa 1981 gcgtggtctg tataggtcta gcaatgacta gatacgagta aatttgtgtc agcccttact 2041 agggacttct taatccaatc tgccaaacga attgaaagtg caacaattgt cagggtaggg 2101 ttagcatatc ctcctgtagg gaaaacggag ctacctgcta tgaataagtt agagatgcca 2161 tgaacttggc aattctcatc aacaactccc tgtttggggt caatatgcat tcgggttgtt 2221 cccatgtgat gggcggtacc agcaggcata ccaagcttgg gaagaccttc tttgtactgt 2281 attctatatt cacctaaccc agaagaagca atttcttgag ccatgatttc ttgagaacgc 2341 tgaatgttga gggcattgtc agtatcccag cgccagtgaa gttctagctt ttgacaccct 2401 agtgcgtctc gttctttgct caaaacaact cgattagcag gattgggtac ttgctcgact 2461 tgatgaataa cctcaaagac tttaaacctg tgttggttat ttggcaattc tgaccatcca 2521 cctctgtgga atccatgcag caaggactga tttttagtag cagccaggta attggcaagc 2581 atcacgtaat caataccacc gatgatattg agaaaatctt gaccagaaac ctgtggaagt 2641 ttgccatgag caagcgcttg agccacagct ttaaaagact caattgcttt gagtcctcgc 2701 ttactaggtc gtggaaacag aacggtgctc atattgacta aattctcacg ctgcataacc 2761 tcttgcgaga gtttgaaata gcctaatact ggaacatcgt tcacacgacg tagatcgtac 2821 agaggcattt tgttaaataa gttagcgtca gcagggataa aatcgccacc ttgaactaag 2881 ggatgatcca taaaaaatct gccgactaaa tcgtgttggt tgcccaagcc agccgtttgc 2941 tgacgattgg acatcaacag cgtacgggga ttttctagcc caccctgggc gagtataaac 3001 accttggcag ctatccaaaa ctgctgtccc tgcaaagaag caacccgtag acgagtggct 3061 gttctaacag actcgttagt ttcaatctca agggcgttta ggttgaggta aacagtaatg 3121 ttatcaaccc tactaagctc gtcccgatac tgctggttga atatatctcc aggtccaaac 3181 tgaaacattg cagtcactac ttggttctca ttcaggggta accgcttcgc ttgtttgtcc 3241 tcccatatat caggatcata ggcaaacgaa cctgcttgac aaactgattg agcacgtttg 3301 tagaacggtt ctaagtgcga tcgctcaatt ggccaaccgc tgtagggtag ccctgctcgc 3361 ttctcaaaat caatttcgtc caaaggggca taccggacgc ccttgcgccc tttaccaatc 3421 tttatcgacc aaaggtttga attgcccccg aaccttcgac accggatctc tttaagtgac 3481 aaaaagggat ctccaacggt taatgcctca gaaagcgctt gagtttgtgg atcaaactca 3541 agtccgccac tctccacaag gcaaactttt gttttatgtc caataaattc acgggctagg 3601 gtgattccag ctgcgcctgc accaacaata caaacttctg tttcaattac ttgttctgca 3661 gagattaaac gagcatcttt gagcatatga aggtagatgt caaaaaaata ggatttttag 3721 cgatttttcc atatatagca cgaactatag atatatataa gaactctatt tgatttttta 3781 acaaaataga cttgccttga aaatagaggc tcaaagcatg caaaatcgtt cgagcatcat 3841 tttcattgtt ccttgtgagc gatcgctcat gatgtctgtt gcaaaaatgt ttgaaatgtc 3901 atgttgagcg ttcgcgtcag cgcaagcgcc cccgaagcgg ggtacaaagc agcgtgtccc 3961 tttgggacga gcgcgcgtac tgcggtagcc gtggcgtgag ccataagcat agctaaagat 4021 ctcttgagat ttttcgctac actgcgtttc gcaccctgcg ggaagccttt gggttgcgtt 4081 tacagaatga catcaacgtg aagctcaccg gaaagaatca aagtttccca aactcaaagc 4141 cattttttat gcatctgtgt tgcctggtta atcctgacac aacgtaagtt agcaccatta 4201 agattggctg tgctaagatc agcaccttcc aaattagcgc cctccagatt agcattctct 4261 aagttggcat tgctgagatt agcgttcttc aaattagcac aacaaagatt tgcaccacag 4321 aggttagtac cggtgaggtt cgcctcttga agtttggcat cactcagatt ggtgtcttta 4381 agcaaagcac cgcttaggtt tgcattcctc agttttgtac tgttcatgta agcactacta 4441 agattcgcac ccaaaaggtt agccctgttg agatttgcac tccataaata gacattagtt 4501 aggtttccac tgctgaggtt tgcatcgctc aagtcagcat tgctaaggtt agcgccactc 4561 aaatcagtat ccctcaagtc agcattgctg aggttagcat tactgagatt tgcctctttc 4621 aaatcagcac tactgagatt tgcatttatg agattgctgc tactgaggtt tgcattcttc 4681 aagatggcat tactgaagtt ggtatctact aactgagcag tactgaagtt actattccat 4741 aggctagctc cactcagatt agcatgactt aggttgctac cgctaagata agcacagttg 4801 aggttaagat ccgcaagttg agcattctta aggttggcaa ggataaagtt catactgttg 4861 ttgtgacttt gagatttcag atgaagagcc ttctcaatca aattagattg acaatctagt 4921 tcattttctg tatcgcttaa ttctttatgg ttgagaggtg gagaaattgt taaaacattt 4981 tctggcttct tctcaaacca ccagtgttta ctccaagtaa caaagaaagt aaaacacaag 5041 tataaaccaa tcataaattc agatgggtga ccttttgtca atagcagcac cagtgtaaaa 5101 cctaacccaa gccaagttac ttgcttgact aacgaaatca gtctcatctt cgcttttttg 5161 tcatagaacc taaagttttc actgttaact aaagtcatgt tagatacctc tgtgtctttt 5221 gttaaaatga ctcaattttg ctaagaatga ctagccatct aacctaaaag caaaagctga 5281 aggatatttt tcatgggtta ccgtaagtag taccgtattc catttcatca ccgtttttac 5341 attgtctttt agcctaatcg ccaaagcgac gagaatgtgg gattagatat cctcctgtag 5401 gaaacacaga actacctgct atgaataagt tggaaacgcc atgaacttta aatctttcat 5461 tgaggttgtc attggtcttt aataggaagg actacatata tatctgggtt tatacttttc 5521 ttatagggaa aacgtaacaa aaatatgcaa atcctgtatc gaagttgtga cagattcact 5581 aaaaattgtg aaagtattta tc // LOCUS NODE_5141_length_5560_cov_4.6007275560 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5560) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5560) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5560 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(45..1757) /locus_tag="DP116_26675" CDS complement(45..1757) /locus_tag="DP116_26675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874665.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flavin oxidoreductase" /protein_id="PRJNA477356:DP116_26675" /translation="MNNPNPRDVQVLPIATNTTVLRARSWTRLRFEIEYALAKGTTSN CYLIEGDKTAIIDPPPETFTQIFLDALQQSVDLKHLDYVILGHFNPNRVATLKALLEL APQLTFVCSLPSAANLSATFPNHNLNIMTMRGKETLDLGKGHVLRCLPIPSPRWPGGL CTYDQQTQILYTNKLFGAHICNEEVFDEDWGVLKEDQRYYFDCLMAPHASHVQAALEK ISELQVRMFGTLHGPLVRYGLVELTQAYQQWSHSQTAREITVALLYASAYGNTATLAQ AIALGLTKGGVAVESINCEFASPEEIRAAFDKAEGFIIGSPTIGGNAPTPIHTALGTV LASGDNSKLAGIFGSYGWSGEAFDFIEGKLRDAGYRFGFETMKVKFKPSDVLLKQCEE TGTDFAQVLRKAKKVRMTQTAATPVEQAVGRIVGSICVVTAKLGEVSTGMMGSWISQA TFNPPGITVAIAKERAVESLMYPGGKFVLNILPEGHQQEYMKHFRKSFAPGEDRFAGF STKVADNGCIILTDSSAYLECSVNKRMECGDHWVVYATVDNGKLLKPDAVTAIHHRKA GNHY" gene complement(1946..3661) /locus_tag="DP116_26680" CDS complement(1946..3661) /locus_tag="DP116_26680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115750.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diflavin flavoprotein A" /protein_id="PRJNA477356:DP116_26680" /translation="MVALAENVQKRLTVQTVEIAPNTTAIRCLDWDRDRFDIEFGLQN GTTYNSYLIKGDRVVLIDTAHTKFRDLYLNTLKSLINPKAIDYIIVSHTEPDHSGLVE DVLQLAPRATVLASKVALQFLENLVHDPFSKRIVKSGDRVDLGLGHEIEFVSAPNLHW PDTIFSFDRKTQTLYTCDAFGMHYCSDDTFDINLDAIEPDFKFYYDCLMGPNARSLVN AMNKMGDLGKIKIIANGHGPLLYHHLDILRECYQTWSHRQVQAQTIVALYYVSDYGHS ERLAHEIAEGLQKTGLGVEVMDLSSGEIQEIQELAGRSAGIIIGMPPSANVAAHAAIS SLLGVVKKQVVGLFECYGGDDEPIDPLRRKFLDLGIKEAFPAIRIKETPTPDTYQLCR EAGIDLGQLLVRERNIKQIKSIDANLEKALGRISNGLYIITTRKDDVNGAMLASWVAQ ASLQPLGFTVAVAKDRAIDSLMQVGDKFVLNVLEEGNYQELKKHFLKRLVPGGDRFAG VKTQTAKNGSPILTDALAYMECEIVSSIECSDHWLLYCTVADGKVSKPEGLTAVRHRK VGTYY" gene 4172..4630 /locus_tag="DP116_26685" CDS 4172..4630 /locus_tag="DP116_26685" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015111955.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26685" /translation="MEPVTLTAVATAIATLLLKKALEKTGENLGDAAWQQSRKLIEQL RTKNKLPSLTNATQGNEQQRLDYGQAVLELKAAADKDTEIAQGVREVEAAVNKDESQS AKEIQKLAEEIESQPSVVNNFAKLAEEIKAEKGAMVAQQITIGQQTNTYT" gene 4739..>5560 /locus_tag="DP116_26690" CDS 4739..>5560 /locus_tag="DP116_26690" /inference="COORDINATES: protein motif:HMM:PF05729.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26690" /translation="MQDESAKLQREQIYVPLALVQRTKPEKRDKGENSPEAGTRLYEP QYEEKQRFEHEAFLTQILEKAESKTKAKHIALIGEPGAGKTTLLQSIAFWVLEKNLGF PIWISLADLGRSGSLTDLQSYLLNNWLSFAVPPTQLTQAKAELTTQIQQGRVWLLLDG VDEVAVSGVQTLQTLAQQLTGWIAQSRVVLTCRLNVWQADYNALETFETYRLLDFDYP KQVHEFIDNWFGNLTPQRNLTPQPPLLIKERGSDKSEERGSDKNEEPGSKGERLKA" BASE COUNT 1577 a 1197 c 1271 g 1515 t ORIGIN 1 cccttacacc cttaccccct tacaccccta gttttgctca aagcttaata atgattgccc 61 gctttgcgat ggtgaatggc tgtcaccgca tcaggtttaa gtaatttccc gttatcaaca 121 gtcgcataaa caacccaatg gtcgccgcat tccatgcgtt tgttgacaga acattcaaga 181 tatgctgagg aatcagtcag gataatacag ccgttatctg caacttttgt ggaaaaacct 241 gcaaatctgt cttctcctgg tgcaaaagat ttgcggaagt gtttcatgta ttcttgctga 301 tgaccttccg gtaggatatt caggacaaac ttaccgccag gatacatcag ggattctacc 361 gcccgctctt tggcaatggc aacagtaata cctggtgggt taaaggtggc ttgagaaatc 421 caagacccca tcattccagt agaaacttct ccaagctttg ctgtcacaac gcaaatagat 481 cctacaatcc gaccaacagc ttgttcaaca ggagtcgctg cagtttgagt catacgcact 541 tttttggctt ttctcaagac ttgggcaaag tctgtacctg tttcttcaca ctgtttaaga 601 agaacatctg agggtttaaa cttgactttc atagtctcaa acccaaaccg atatcccgca 661 tcacgaagtt tgccttcaat aaagtcaaaa gcttcgccac tccatccata agaaccaaat 721 atcccagcaa gcttgctatt gtcaccgctt gccagtacag tacccaaagc agtgtgaatc 781 ggagttggtg cgttaccacc aatggtagga gaaccaatga taaaaccctc agctttgtca 841 aaagctgcac gaatctcttc aggtgaagca aactcacagt tgatggattc tacagcgact 901 cctccttttg ttaaaccaag ggcgatcgcc tgcgccaaag ttgctgtatt tccatacgcc 961 gaagcataaa gtaaagcaac ggtaatttca cgagcagttt gagaatggct ccattgctga 1021 taagcctgag taagttccac taagccatag cgtaccaagg gaccatgaag agtaccaaac 1081 attctgactt gcagttctga tattttctcc agcgctgctt gtacatgact tgcatgagga 1141 gccatcaagc aatcaaaata gtagcgctgg tcttctttta aaacacccca atcttcatca 1201 aagacttctt cattacagat atgggcacca aatagcttat tagtgtacag tatctgagtt 1261 tgctggtcat aagtacaaag acctccaggc caacgaggac taggaatagg gagacagcgc 1321 aaaacatgac ctttacctaa atccagagtt tctttccctc gcatcgtcat gatgttcaag 1381 ttgtggttag ggaaagttgc actcaaatta gcagcactag gaagcgaaca aacaaacgtc 1441 agttgtggtg ctagttctaa cagagctttg agagttgcga ctcggttagg attaaagtga 1501 cctagaatca cgtaatccaa gtgtttcagg tcaacagact gctgtaatgc gtctagaaaa 1561 atttgtgtga aagtttctgg gggtgggtca ataattgcag ttttatcacc ctctattaaa 1621 tagcaattcg atgtggtacc ctttgctagt gcgtattcaa tttcaaaccg tagacgtgtc 1681 caactacgtg cccttagtac tgtcgtgttt gtagcgattg gtaaaacttg tacgtcgcgg 1741 ggattgggat tattcatact tgtcgagtga gggagaggat gagaaggatg aggagagaaa 1801 aagtcaaaag aatttttaac ttttgaccct tacgggttcg ccagttgacg ccaggtgctt 1861 caagtcggga aacccgccca acgcactggc tcctttatgc cggggaaccc gtccaccgca 1921 ctggctcact tttgactttt gacttttaat aataggtgcc taccttgcga tggcgaacag 1981 ctgtgagtcc ttctggttta gagactttac cgtcagcaac tgtacaatat aatagccagt 2041 ggtcgctgca ttcgatgctg ctgacaattt cacattccat gtaggcaaga gcgtctgtga 2101 gaatggggga accgtttttg gctgtttggg ttttgacgcc agcaaatcta tcaccaccag 2161 gaactaaacg cttgaggaag tgtttcttta attcttgata atttccttct tccaaaacgt 2221 taaggacgaa cttatcaccc acttgcatca aagaatctat cgcacgatct ttggcgacag 2281 caactgtaaa tcctaaaggt tgcaaactgg cttgagccac ccaggaagct agcattgcgc 2341 cgttgacatc atcttttcgg gttgtgataa tgtagagtcc gttgctaatc cgaccgagcg 2401 ctttttctaa gtttgcgtcg atggacttga tttgcttgat gttacgttcg cgaacaagca 2461 attgtcctaa gtctatacct gcttctctac acaattggta agtgtctggg gtgggagttt 2521 cttttatgcg gattgctggg aaggcttcct taataccaag gtcgagaaat tttctgcgaa 2581 gtggatcaat aggttcatca tctccgccgt agcactcaaa taatccaacg acttgcttct 2641 tgacgacgcc taagagcgaa ctgatagcag catgagctgc aacgttagca cttggaggca 2701 taccgataat aatgccagct gagcgacctg cgagttcttg aatttcttgg atctcaccac 2761 tgctgagatc catcacttca acgccaagcc cagttttctg gagaccttca gcaatttcgt 2821 gggcaagacg ctcactgtgt ccgtagtcgg agacataata taaggcgact attgtttgtg 2881 cctgaacttg tctgtgactc caagtttggt aacattccct gagaatatca agatggtggt 2941 aaagtagggg tccgtgtcca ttggcgatga ttttaatttt cccaagatca cccattttat 3001 tcatggcgtt cacaagggat cgggcgttgg gacccatcaa gcagtcgtag taaaatttga 3061 agtcaggttc aatggcatct agattgatgt caaaggtatc atcgctgcaa tagtgcatcc 3121 caaaggcatc gcaagtatac agggtttggg ttttgcggtc gaagctgaat atcgtatcgg 3181 gccagtgtag gttaggtgca ctgacgaatt cgatttcgtg tccaagacca agatctacgc 3241 gatcgccact cttgacaatc cgcttagaaa acggatcgtg taccaggttt tctaaaaact 3301 gaagtgcaac ttttgatgct aagacggtag ccctaggagc taattgcaga acgtcttcca 3361 ctaaaccgct gtggtctggt tctgtgtggc tgacaatgat gtaatcaatt gccttcgggt 3421 tgatcaggct tttgagcgta tttaaataaa ggtcgcgaaa cttggtgtga gcggtgtcaa 3481 taagaacaac cctgtcacct ttgatgagat atgagttgta tgtcgtaccg ttttgcaatc 3541 cgaattctat atcaaagcga tcgcggtccc aatcaagaca gcgaatcgcc gttgtattag 3601 gagctatttc aacagtttgt acagttaacc gtttttggac gttctctgcg agcgctacca 3661 ttcgtcgcct ccgggcacaa attatcagtc ttctgcctct agttttcgca taattctttt 3721 tgtaaagttg tcaaaaattt tattttttag aaaattcaag atgtagttaa aattgctata 3781 tattggcgtt tttttttact tcaaaattgt tcttcagggt ctttttaata agactgttat 3841 tatatcacaa aaaacttgaa ataatgaatt attttactta caaattgtta tcaaaattgt 3901 cttcgtgtta actaatttgt agcaaaaggc atagattttg caaccgttgt gtttaacagt 3961 tatcagtgat cagttatcag tttcactgtt tactgttccc tcctttttaa ggagggcgag 4021 tttcggatca atccttaaca acaaccgtat tgagttatca gcgtatcttc agccaagagt 4081 gaggtgattt cgcgatcgct aggtttcttc ctgaaaaacc agtagatgct gcgttataaa 4141 gaatacagaa gaatctacta tgaaatgata tatggaacca gtcactttga cggctgttgc 4201 aacggcgatc gccacccttc ttctaaaaaa agcactggag aaaacaggcg aaaatctcgg 4261 tgatgctgct tggcagcaaa gtcgcaagtt aatcgaacag cttcgcacta agaataaatt 4321 accatcgctc accaatgcta cacaaggaaa cgaacagcag cgattagatt atggacaagc 4381 tgtgctggaa ctgaaagcag cagcagacaa agatacagaa attgctcaag gagttaggga 4441 agtagaagca gcagtcaaca aggatgagtc gcaaagtgca aaggaaattc aaaagttagc 4501 agaggaaatt gagtctcagc cctcagtagt caacaatttt gcgaaattag cagaggagat 4561 aaaagcagag aaaggtgcaa tggttgccca acaaattaca attgggcaac aaacgaacac 4621 ctacacttga ctgcagtcgc ccgacgggga gaatcgggta aagtcaactc ccatcaactg 4681 gcaagaaatt tgccgccaat tactggaatc ccagcggcga ctcaccagca atcctttgat 4741 gcaagatgaa tctgccaaat tacaaaggga gcagatttat gttcccctag cactggtgca 4801 gcggacaaaa cctgagaaac gcgacaaggg cgaaaattcc ccagaagcag gaacgcgcct 4861 gtatgaacca caatatgagg aaaagcagcg atttgagcat gaagctttct tgactcaaat 4921 cttggaaaaa gcagaaagca aaaccaaggc taagcacata gctttaattg gagaaccagg 4981 tgcaggaaaa acaactctgc tgcaatccat cgcgttttgg gtgttagaga aaaacttggg 5041 attcccaatt tggatttcgc tggcagattt ggggcgaagt ggttctttga cggatttaca 5101 aagttatttg ttgaacaatt ggctttcttt tgctgttcca ccaacacagc tgactcaggc 5161 gaaagcagaa ttgacaacac aaattcagca ggggcgagtt tggttgctgt tggatggggt 5221 ggatgaggtg gctgtatcag gtgtgcagac gttgcaaaca cttgcccaac aactcaccgg 5281 atggatcgcc cagtcacgag tggtgctaac ttgtcggctg aatgtttggc aagcggatta 5341 caatgcgcta gaaacttttg aaacttatcg tctgctagat tttgattatc ctaagcaggt 5401 gcatgagttt attgataatt ggtttggaaa cctcactccc cagaggaacc tcacccccca 5461 gccccctctc cttattaagg agagggggag tgataaaagt gaggagaggg ggagtgataa 5521 aaatgaggag ccggggagta agggggaacg gttgaaggca // LOCUS NODE_5191_length_5493_cov_5.6761685493 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5493) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5493) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5493 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 432..992 /locus_tag="DP116_26695" CDS 432..992 /locus_tag="DP116_26695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130554.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="methylmalonic aciduria and homocystinuria type D protein" /protein_id="PRJNA477356:DP116_26695" /translation="MQYPTVYTCTGGCPIHLVGKRGQGVQISIHVPSRYICANREQIL PDWKKQLFLWVVIVLQQSRYELVESTPEIETEKQRLREKFMRFGCDLAFNLRDRHYLT DLIDPRTGYPLLSRPGKILHDDTAAVKALLHYPVIQNKCRLLVHPNWGMAVYPGILIS QAPPIIIEWVIKSIAPLHDWKIKDEG" gene complement(1007..2002) /locus_tag="DP116_26700" CDS complement(1007..2002) /locus_tag="DP116_26700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015204428.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="capsular biosynthesis protein CpsI" /protein_id="PRJNA477356:DP116_26700" /translation="MTKVLVTGVAGFIGYHLAQRLVKEGVEVIGIDNLNDYYDVNLKK ARLAQLHSQPGFTFQFLELSDRPEVAQLFQNYTFDYVVNLAAQAGVRYSLQNPWVYID SNITGFVNLLEGCRQSQIKHLVFASSSSVYGINTKVPFAVTDNVDHPISLYAATKKAN ELIAHTYSHLYHIPTTGLRFFTVYGPWGRPDMAYFKFVKAIQEGKPIDVYNFGKMQRD FTYIDDVVEGVFRVMLKPPQSNVANIANSNAPYRLYNIGNNNPVELMTFIEVIEKALG KKAVKNLLPMQPGDVPATYADVDDLMRDVGFKPSTPLEQGIHCFLQWYQEYKQPE" gene 2704..>5493 /gene="treY" /locus_tag="DP116_26705" CDS 2704..>5493 /gene="treY" /locus_tag="DP116_26705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007357093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="malto-oligosyltrehalose synthase" /protein_id="PRJNA477356:DP116_26705" /translation="MRIPAATYRIQFNSEFGFDAAKAITNYLVDLGISDLYASPIFKA RPGSSHGYDVVNPTQLNPELGTQEAFESLVQELQQHKLGWLQDIVPNHMGYDSQNKYL MDVLEYGIHSSYVDYFDIPWNAPFARNYEPILAPLLGNFYGECLENGEIQLKYDETGL SVNYFSLQIPLKLESYAIFLTENLEKLAQALGREHPIFVRLLGVLFIVKSLPSETVAQ QRQDQAAFVKGLLWELYTENPEVKTFLDENLQLFNGEPGKPQSFNLLDSLLSQQFFRL CYWKVGAEEMNYRRFFTVNELISVRVENLKVFENTHDLIFKLVSEGKFTGLRIDHIDG LYDPKEYLDRLREKTGETYITVEKILQPGEDLPSNWSVQGTSGYDYLNYVNGIFCKTE NVDKLTEIYSNYTGLTTSFEEMVPDKKHFILERKLAGDIDNITSLLKTIASKHRYGND FTINGLKRALAEVLSRFPVYRTYVTQEDLSEIDRAYVHEVIEATKPHTPFLNHELNFI EKLLLLEYDDNITQAEKDQWLYFVMRVQQYTGPLMAKGVEDTAFYVYNRFISLNEVGG EPSRFGISASEFHDFNQQRQAKWCHAMSTTSTHDTKRCEDSRARLNVLSEIPEEWEKQ VRNWSEINSSHRKTVEQLTMPDRNDEYLFYQTLVGVFPFNEEEFPTCVERVKNYVIKA IREAKVHTGWLRQNSVYENAFTDFVTAVFEPLKDNPFLKEFLPFQRRVAEYGIFNTLS QTLLKITSPGVPDFYQGTELWDFSLVDPDNRRRVNFENRQSYLKAIKEQVKTDILKLI DELLANKEDARIKLFLVVQGLKARTEYLQVFQQGDYLPLEVSGKFKENIIAFARKDGN KTIITIAPRFFTSLIQPGESPLGKEIWHDTSLKLPTEVTSWKDAISEQTIEANGTLLI GEALKYFPVGLLVSQG" BASE COUNT 1713 a 956 c 1197 g 1627 t ORIGIN 1 agttgggagt tgggagttag gagttgggag ttgggagttg ggagttggga gttgggagtt 61 gggagttgag agttgggagt cgagtgttgg gagttgggag ttgggagttg ggagttagga 121 gttgcgagtt aggagttggg agttaggagt tgcgagttgc gagttgcgag ttgcgagttg 181 cgagttggaa gtcgcaactc gcaagttgct cactattcaa atagaactgc catagcagag 241 tgaaaaaaca gaaacttatc ataaaaggta aaataaagat taaagaatcc ttaaagataa 301 gtagaatatc aaaaatagct tgaaagcatc tcttcgtatt agggaaattt catatttttt 361 ttgactattt aagagttatc atcaatacaa aagatcgatg ttaaaaactt gtaaaaacca 421 aataaagttg agtgcaatat ccgacagttt acacttgtac tggaggctgt cccattcatt 481 tggttggcaa aaggggacaa ggtgttcaaa tttcaattca tgttcctagt cggtatatct 541 gtgccaaccg cgagcagata ttaccagatt ggaaaaaaca gcttttcttg tgggtggtga 601 ttgttttaca gcaatcaaga tatgaacttg tggaaagtac accggaaatt gagacagaaa 661 aacaaaggtt gcgggaaaag tttatgaggt ttggctgtga tctggctttt aatctgcgcg 721 atcgccatta tctaacagat ctcatcgacc cccgcactgg ctatcctttg ctttcccgtc 781 cagggaaaat tctacatgat gacactgcag cggtgaaggc tttactgcac tatccggtta 841 ttcagaacaa atgccgttta ttagttcacc cgaattgggg aatggcagtt tatcctggaa 901 ttctgatatc acaagctccc ccaatcatca tagagtgggt tatcaagagt atagcccccc 961 tacatgactg gaagataaaa gatgaaggat gaaaaattta tactttctat tcaggttgtt 1021 tgtactcttg ataccattgc aaaaagcaat gtataccttg ttccaggggt gtactgggct 1081 taaatccaac atcacgcatc aggtcatcta catcggcata agtcgcggga acatctcctg 1141 gttgcatcgg taataaattt ttcacagctt tcttgccaag cgccttttct atcacttcaa 1201 tgaacgtcat caattccaca gggttattgt taccaatgtt ataaagcctg taaggagcat 1261 tactatttgc tatgttggct acgttggatt gaggtggttt aagcatcacc cgaaagacgc 1321 cctcgacaac atcgtcaatg tatgtaaaat ctcgctgcat tttaccgaag ttgtagacat 1381 caatgggctt gccttcttgt atcgctttta cgaatttaaa atatgccatg tctggtcttc 1441 cccagggacc ataaactgta aaaaagcgta gcccggttgt tggaatgtga taaagatgac 1501 tataagtatg ggcaatcagt tcatttgctt ttttagttgc agcatatagt gaaataggat 1561 ggtcaacatt atcggtaaca gcgaagggaa ctttggtatt gataccgtaa acagaacttg 1621 aagaggcaaa tactaagtgc ttgatttgac tttggcggca tccttctaaa aggttgacaa 1681 agcctgtaat attgctatct atataaaccc aaggattttg taaagagtag cgtacgcctg 1741 cttgggctgc aagattgacg acataatcaa aggtgtagtt ttgaaagagt tgagcaacct 1801 cggggcgatc gcttaactcc aaaaactgaa acgtaaaccc cggctgcgag tgaagctgtg 1861 ctaaacgagc tttttttagg ttgacatcat aatagtcgtt taagttgtca atccctatca 1921 cctcgacacc ttctttcaca agacgttgtg ctaagtgata accaataaat ccagcaacgc 1981 cagtaaccag taccttcgtc atgaatatcc ctcaaaactc cataacagcc atcatagtca 2041 ttggttgaag tacagttaag ctgtagctag caattcaatt ggggggattt tcacgactga 2101 ggctgatgta gcttttattc tgggtgtgct atttgccgaa acagaaaaat cagatgaaaa 2161 aataaattgt gtggtggtgc aggtggagtg aaggtgagga ctatagcact tttggtgttg 2221 ggtgcaatac acagcatagc ttgagttcta ggcggacaag gttttcttct tgagaatcgt 2281 gcaagatttc agtctaatga acttcaatca aattgatgtc agatttatac ttataataat 2341 ctcagaaaag aattgaaaaa atcttttcta ttatactcat gaaaaatcac aatgtcatag 2401 tctcccatta ccagaaaaaa ttgcgatttg caacaactga aagtaagata tgagactttt 2461 ctgcttaacc aagcctcgta attttcatca aaacatctat aaggtttctt actttaatca 2521 aagatagatt tatctactac aggctgcgta ttctctgata agctatccag aggatatgta 2581 tatctgtggt tgtttgactc tttgatagac tagattgaat tgaatattct tctagtcatt 2641 ttagcagcat aaactggata cgaaaaaaac gctcattaga catatacccg agggatgaat 2701 tctatgcgga ttccagcagc aacttatcga attcaattca attcagaatt tggttttgat 2761 gccgcaaaag caattacaaa ctatctggta gatctgggaa tttcggatct ctacgcttct 2821 cctattttca aagcaagacc cggaagtagt catggatatg atgttgttaa ccccactcag 2881 ttaaatccag aattaggaac tcaagaagct tttgaaagct tagtccaaga attacaacag 2941 cataaactcg gttggttaca ggatattgtc cctaatcaca tgggttatga tagccaaaat 3001 aaatatttga tggatgtgtt ggaatacggt atacattcca gctatgttga ttactttgat 3061 attccttgga acgctccttt tgctagaaac tacgaaccaa tattggctcc attactgggc 3121 aacttttatg gagagtgttt agaaaatggc gagattcaac ttaagtatga tgagacagga 3181 ctcagcgtta actattttag tttacagatt cctcttaaat tagaatctta cgccattttt 3241 ttgactgaga atttagagaa attggctcaa gctttaggaa gagaacaccc aatatttgtg 3301 aggttgcttg gtgtcctgtt tatagtgaaa agtcttcctt ccgagacagt agcacagcaa 3361 agacaagacc aagcagcttt tgtcaaagga cttctttggg aattgtacac agaaaatccg 3421 gaagtaaaaa cctttcttga tgaaaatctg caacttttta atggtgaacc aggaaaacca 3481 caaagcttta atcttttaga tagcttactt tctcaacagt tttttcggct gtgttattgg 3541 aaagttgggg ctgaagagat gaattataga agatttttca ctgtgaatga gttgatctca 3601 gtcagagtgg aaaatttaaa agtttttgaa aacactcatg atttgatttt caagctagtc 3661 agcgaaggta agtttacagg cttgcgaatt gatcacattg atggtttata cgatccaaaa 3721 gagtatttag atagattgag agaaaagacg ggagaaacat atattactgt tgaaaaaatt 3781 ttacagcctg gagaagactt accaagtaac tggtcagttc aaggtacatc tgggtatgat 3841 tacctgaatt acgtcaacgg aattttttgt aaaactgaga acgtagataa gttgactgaa 3901 atctactcaa attatacagg gctaacaaca tcttttgagg aaatggttcc agacaaaaag 3961 cattttattc tagagagaaa gttagcgggg gatattgata acatcacttc tcttttaaag 4021 acaattgcga gtaagcatcg gtatggaaat gactttacaa tcaatggttt aaagcgagca 4081 cttgcagaag ttctcagtcg ctttcctgtt taccgaactt acgttactca agaggatcta 4141 tctgaaattg atcgcgctta tgttcacgaa gtgattgaag caaccaaacc acacacacca 4201 ttcttgaacc atgaattaaa ctttatcgag aaattactgt tgctggaata tgatgataat 4261 atcactcaag cagagaaaga ccaatggctt tactttgtca tgagagtgca gcagtacaca 4321 ggaccattga tggcgaaagg agtcgaagac actgcttttt atgtttataa ccggtttatt 4381 tctttaaacg aagttggtgg cgaaccgagt cgttttggta ttagtgcttc agaatttcac 4441 gactttaacc aacaacggca ggcgaaatgg tgtcatgcta tgagtacgac ctccactcat 4501 gataccaagc gatgtgaaga tagtagggca aggctgaatg ttctctcaga aattcctgag 4561 gaatgggaaa aacaagttcg caattggagt gaaattaata gttcccacag aaagacagtt 4621 gaacagttaa caatgccaga tagaaacgat gaatatcttt tctatcagac acttgtagga 4681 gtctttccat ttaatgaaga agagttcccc acttgtgttg aacgggtgaa aaattatgtg 4741 atcaaagcta tcagagaagc gaaagttcat actggatggc tgcgacaaaa tagcgtttac 4801 gaaaatgctt ttactgattt tgtcacagca gtttttgaac ccttgaaaga caatccattt 4861 ttaaaagaat ttttgccttt ccaaagacgg gttgctgagt atggtatctt caatactctt 4921 tctcaaacac tactgaaaat tacctcgcct ggagttcccg acttttatca aggaactgaa 4981 ctatgggatt ttagcttagt tgatcctgat aatcgccgtc gggtgaactt tgaaaatcgg 5041 cagtcttatc tcaaagcaat taaggaacag gtaaagacag atattctcaa gctaatagat 5101 gagttacttg ctaacaagga agatgccaga attaaactgt tcctcgttgt ccaaggtttg 5161 aaagcaagga ctgagtattt acaagttttt cagcaaggcg actacttacc tttggaagtg 5221 agtggcaagt ttaaagagaa tatcattgcc tttgcaagga aggatggaaa taagacaatt 5281 attaccattg ctccccgctt tttcacaagc ttaattcaac cgggagagtc cccgctagga 5341 aaagagattt ggcatgatac gagtttgaag ttacctacag aagtcacttc gtggaaagat 5401 gcgatcagcg agcaaactat tgaagcgaat ggcactttgt tgattgggga agcgttgaaa 5461 tacttccctg tgggtttgtt ggttagtcaa ggc // LOCUS NODE_5218_length_5450_cov_4.8630215450 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5450) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5450 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1095) /locus_tag="DP116_26710" CDS complement(<1..1095) /locus_tag="DP116_26710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315899.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26710" /translation="MFRKFFLLIIISFWGVVMFPMPKQANATTPLSVAVIQNLRNLVQ LMPHKNSKRAARKSDSMSPGDKLSTGRSALADLRFNDGSLARIGEQAVFQFLPQRRNF NLDHGTVLLLIPPGRGQTRISTPNAAAAIRGSALFVRYDQETDTTIVGALTNSGIQVS NKDASQKRELQAGQLIVLVKGELKGLYDFDLRTFYETSTLVRDLNLTLRNGIPNSDPA IASVQAETAAAVAAQSPVAGQDVVENPPFVKPTTSPNLPSNNITRDNSPVPGIIDNQD LVSNPVSSSTPKPRSIPTDSGTPPTNEPTTTTSPIVTPSGPVTPVPVTQPTITAPPVT QPSVPVTQPTITVPPVTQPSVPVTQPTITVP" gene complement(1988..2134) /locus_tag="DP116_26715" /pseudo CDS complement(1988..2134) /locus_tag="DP116_26715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012334317.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="peptidoglycan-binding protein" gene 2151..3122 /locus_tag="DP116_26720" CDS 2151..3122 /locus_tag="DP116_26720" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26720" /translation="MPQLEGQPADFLQPSSPAIEQRLQLPEIDYAQRLKRLSQLLQEK KQASTEVSDSELGIIRARPTKPLEPQKPPKRPERSIGYLQAHVGYFHTNNIFSSTVAP IEDGLIYSGLTLASMPIRLGNRTFLNGSIDGNLMRYINQSKYNYNQVRFNVGIFQQFS SQMYGEIGWSNQQLFYARNGTNFDAGQRFLNEDSFRLSLGRRDPLTSKLMLDSFYELR LSLTDPPSQQDNRDRLIHTAWVSLNYYLEPSLQIGLDYQFGLSDFTRRQREDIYHRLY GHFTYGLSNYTNLSVQGGVTLGSSTDRNIDVDGWFFSINYNLELGRF" gene complement(3156..3974) /locus_tag="DP116_26725" CDS complement(3156..3974) /locus_tag="DP116_26725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315901.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfurtransferase" /protein_id="PRJNA477356:DP116_26725" /translation="MTNTPFVVSCEWLLQHLDDAQVVIVDCRFSLADPQLGRQQYQES HINGAYYLDLNQDLSSRVEEHGGRHPLPNPIELANKLSTIGVHSQKTLVVTYDDSRLA FASRLWWLLRYLGHEQVAVLDGGFSAWKKAGYPVTDVIPEPQIVTFSPDLQPQMEVDI SAVKSRKDLPEVALVDSRESERYLGIKEPIDKIAGHIPGAVNYPWQEVTDAAGFLLPQ QQQRQRWLELENAKEFFVYCGSGVTACVNLLSLEVAGIPRGKLYAGSWSDWISY" gene 4081..5046 /locus_tag="DP116_26730" CDS 4081..5046 /locus_tag="DP116_26730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876251.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26730" /translation="MSDLDNFTPKLTADGSFTFFSHEFGESFHSHFGARQESFLKFAV PTQLPLKAHKPTVRLLDVCYGLGYNTAAALQTIWTVNPNCYIEVIGLELNTAVPQAAI AHHLFDNWEYQYTQILTELAFEHQLVQERLKATLLIGDARNSIRFIHQSGFKADAIFL DPFSPPHCPQLWTIEFIKQLSLCLDIDGLLATYSCAAAVRTALLAAGLQIGSTPPVGR KTPGTVAGHREVGKDDGVKENILVCFSPSVFPSSSSPSPFASLSQVEQEHLLTRAAIP YRDPHLCDTGDVILRRRQQEQQTSSLESTSSWRKRWSSKNQLNNM" BASE COUNT 1609 a 1180 c 1082 g 1579 t ORIGIN 1 tggtactgta attgtcggtt gtgtgactgg cacagaaggc tgagtcaccg gtggtactgt 61 aattgtcggt tgtgtgactg gcacagaagg ctgagtcacc ggtggtgctg taattgtcgg 121 ttgtgtgact ggcacaggag ttactggccc agaaggtgta actattggtg atgtggtagt 181 agttggttcg ttggttggcg gcgtccccga atccgtaggt atagacctag gcttgggtgt 241 ggaagatgaa acaggatttg acacaagatc ttggttatcg attataccgg gaactgggga 301 attatctctg gtaatgttgt tactgggtaa attaggggaa gtcgttggct ttacaaaagg 361 tgggttttca acaacgtcct gaccagccac tggcgattgt gcagcaacag cagcagcagt 421 ttcagcttga acgctggcga tcgccggatc tgaatttggt ataccatttc ttaaagtcag 481 atttaaatcc cgaaccaggg tactcgtttc ataaaaagtt cttagatcaa aatcgtataa 541 acctttcaat tcccctttaa ctagaactat cagctgtcct gcttgtagct ctcgtttttg 601 agaagcatct ttattagaaa cttgaatgcc actgtttgtc agtgcaccaa caatcgtggt 661 atccgtttcc tggtcataac gtacaaataa tgctgaaccg cgaattgctg ctgctgcatt 721 tggcgtactt atacgtgttt gtcccctgcc tggtgggatg aggagcagta ctgtcccatg 781 gtcaaggtta aagtttcttc tttgtggcaa gaattgaaaa accgcctgtt ctccgatccg 841 agccaaagac ccgtcgttaa aacgtaagtc tgccaaagca gatcgccctg tagataactt 901 atctcccgga ctcatagaat ccgatttccg tgccgcacgc ttagagttct tgtgcggcat 961 gagttgcacc aaattgcgga ggttttgaat gacagcaaca ctcaaaggag ttgtagcatt 1021 tgcttgtttt ggcatgggga acataacaac tccccagaaa ctaatgatta tgagtagaaa 1081 aaacttacga aacatagttt ggagcctcgg ataattttac tgataccaaa ccagttgaag 1141 acgggttcta atgttgacag tttaagccta ttatctgtta ttgccattgt ttacgtgaat 1201 atgcttaact ctaatttatt atttaatttg attttatctg taattatact gttaatcggt 1261 catgatagtt tattttaaga aaactcgtct ttgagaaact atagtgtaag cttcatacat 1321 acctcttaat gatataaatg gctagacact gatttaactt gaattaagaa gtagacatta 1381 cctctgtttt caagtcccga cccttcaggg cggagtcagt aacagtttat ttgcgttttt 1441 catataaaaa tgggtacatt taccaaagac tgtcgctaaa gtctcttgaa attataattt 1501 tttccaatgg aacttttagc acagtgagtg agtctactgt tgccaaaaat gaaagattaa 1561 atatgcaacc caagggctgg ataaatttgt tcttttggat tgtattagct tatagtagtg 1621 caatcttgtt ttacatcaag aaggtagatg cggcgcaggc tgctaataca gaccgatcaa 1681 tgactgaaga tgatctaaaa tcaagaataa catttttagg ggttgcatgg atacccgctc 1741 aattgtggca agagtgtggg caaatatctc gcaaagaagt agaaaaaata aattttggta 1801 aaaaagtagg gcgttctcag gaaaaagagg gcatgttttc tggctctttt gcctctacaa 1861 gctaccctgc aaatacttat gatagaatct caaatttata ttcccagaca ccaccagatc 1921 agacaccgcc accctcacca actcttgaac aaaatccacc gccagtacct gaaggaaatc 1981 aaactccagc accaccacca gcagcaccaa caccaccagc agcaccaaca gcacccacat 2041 caacaccacc cactccagca ccacctccag caccatctcc agcaccacct ccagcaccac 2101 gttctgcacc caccccagca ccacgacctg caccctctac accggataat ttgccacagc 2161 tagaaggtca acccgctgat tttctccaac cttcctcacc agctatagag caacgattgc 2221 aattacctga aattgattac gcacaaagat tgaagaggct atcgcaattg ttgcaggaga 2281 aaaagcaagc gtctaccgaa gttagcgatt ctgaactagg aataataagg gcacgaccaa 2341 caaaaccatt agaaccgcag aaacccccaa agcgtcctga gagatctata ggttatctac 2401 aggctcatgt aggctatttt catacaaata atattttttc ctcaacagta gctcccatag 2461 aagacggctt gatttactct ggattaacac tagcttctat gcctatacgc ttaggaaaca 2521 gaactttttt gaatggatca atcgatggca acttaatgcg ttacattaat caatcaaaat 2581 acaattacaa ccaagtgaga ttcaatgttg gtattttcca gcagttctcg tcgcaaatgt 2641 atggggaaat tggatggagt aatcaacagt tattctatgc aagaaatggt actaattttg 2701 acgctggtca gcgctttttg aatgaagatt ctttccgcct gtctttggga cgacgagacc 2761 ccctcacatc taaattaatg ttagatagtt tctacgaatt gcgtctgagc ttaactgatc 2821 ctcccagtca acaagacaat cgcgatcgac tgattcatac tgcgtgggtt tctttgaact 2881 actacttaga accatcactc caaattggtc ttgattatca gtttggttta tcagacttca 2941 cgcgacgtca gcgagaagac atataccatc ggttatatgg ccacttcact tatgggctat 3001 caaattacac aaatcttagt gtacagggag gtgttacttt gggtagctca actgacagaa 3061 atattgatgt tgatggctgg ttcttcagta ttaactacaa tttggaatta ggtcgttttt 3121 gagtcttctc tttgagcgtc ataaatcaca aatgcctaat aactaatcca gtcactccag 3181 ctaccagcat aaagtttacc tctcggaatt ccagcgactt ccagagagag taaattcaca 3241 caagcagtga cgccagaacc acaatagaca aaaaattcct ttgcattttc caactctaac 3301 cagcgttgac gttgctgttg ttgcggaagt aaaaaacctg cagcatctgt cacttcttgc 3361 caaggataat taaccgcacc aggaatatga ccagcaattt tatcaattgg ttctttaatt 3421 cctagataac gttcgctttc tcgtgaatca accaacgcta cttctggtaa atctttccgg 3481 cttttgacgg ccgaaatatc cacttccatc tgtggttgta aatcagggct aaaagtcact 3541 atttgaggtt caggaatgac atctgtaaca gggtaaccag cttttttcca cgcagaaaac 3601 cctccatcta ggacagcgac ttgctcgtgt ccaagataac gtaatagcca ccataaacga 3661 gatgcaaagg caagtcggga gtcatcgtaa gtaacaacta gagttttttg agaatggact 3721 ccgattgttg ataacttatt cgctaattca atcgggttag gtaaaggatg tcttcctcca 3781 tgttcttcga ctcgactgga aagatcctga ttcaagtcta agtaatacgc tccattgata 3841 tgactttctt ggtattgctg tcgtcccagt tgtggatcag ccagagagaa gcgacaatcc 3901 acgatcacaa cttgggcgtc atcaagatgt tgaagtagcc attcacaaga aacaacaaaa 3961 ggggtgttgg tcataaactc agattggtca ttggtcattg tttattgttt attgttagtt 4021 gttaggtgtg ccactctcta ctcaccactc atcactaacg actaaccact atctataact 4081 atgtcagact tagacaattt tacacccaag ctcacagcag atggctcgtt cacttttttt 4141 tctcatgagt ttggcgagtc gtttcacagc cattttggag cgcgtcagga gagttttctc 4201 aagtttgcgg ttcctactca actaccgttg aaagcacaca aaccaacagt gcgcttgttg 4261 gacgtttgtt atggtttggg atataataca gcggctgctt tacaaacaat ttggacagtc 4321 aatcccaact gttatataga agttattggt ttagaactga atacagctgt accacaagct 4381 gcgatcgctc atcacttatt cgacaattgg gaataccaat acacccaaat tttgaccgag 4441 ttagcctttg aacatcaatt ggtgcaagag cgtctcaaag caactctact tattggtgat 4501 gccagaaact cgataagatt tattcaccag tcaggtttta aggctgatgc aatttttctc 4561 gatccgtttt caccacctca ttgtcctcag ttatggacta ttgaatttat caaacaactg 4621 tccctttgtt tagacataga tggcttactt gccacttact cttgtgctgc tgctgtacgt 4681 acagcacttt tggcggctgg actccaaatc ggttctacac caccagtggg aagaaagaca 4741 cccggtactg tagctggaca tcgagaagta ggcaaagatg acggagtgaa ggaaaatatc 4801 cttgtctgtt tctctccttc tgtgtttccc tcttcatcct ccccatctcc ctttgcctcg 4861 ctttcccaag tagaacagga acatttactc actcgtgctg ctattcctta tcgcgatcca 4921 catttgtgcg ataccggtga tgtgattttg agacggcgac aacaagagca acaaacttct 4981 tctctagagt ctacctctag ttggcgaaaa agatggtcat cgaaaaatca actaaataac 5041 atgtagcatt tgtcgcttac tttttgtata atcaaaaata caaaatttct gtgatcaaag 5101 tcatcagtat aaatagcaga gtctcactga catgcagagg tagttatagt ttgataatta 5161 acatcagttt cacggctcaa atacgtacaa aactatctca aaagcataag ttaaaacccg 5221 atacacctta ttttgataga aataacatgc gttaataatt tcgttggaag agtaaataaa 5281 aaaatctcag taaaaacccg atttaatcta tttggaaaaa ttgcttatac taaaaacata 5341 taaaacaaaa taaggcagca atagtttcaa ttacatgaga ctataaacat ttaccgtaga 5401 aaaaaaacga gtatactttt ttgtcaaacc ctgtcaacac tcagggaggt // LOCUS NODE_5239_length_5417_cov_5.1930255417 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5417) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5417) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5417 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1535) /locus_tag="DP116_26735" CDS complement(<1..1535) /locus_tag="DP116_26735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26735" /translation="MSNPSIYTVGGTVQANQNGVYIPRKADEELLGLCRENKFVYILT SRQMGKSSLMVRTAEQLAEEGIQTVVIDLTQVGVQVTAEQWYLGLLTIVEGTLMLDTD VVQWWQERSHLGFTQRLTSFFQEVLLKEVAEPIVIFVDEIDTTLSLDFTDDFFAAIRY LYNARSQKPEFQRLSFVLVGVATPGDLIHDPKRTPFNIGQRVDLTDFTFEEAKPLAQG LGLSTQEAQQVLKWMLEWTGGHPYLSQRLCCIISQQDKKSLSKADVDGLVSSAFFGAM SEQDNNLQFVRDMLTKRAPDKEGVLTVYREIRRGKSVLDEEQSIVKSHLKLSGVVRRE HNVLQMRNRIYQQVFDLRWVNQHLPLKLRDWWERYKPLLPYLAGGLIFSVAMGTMALY ANEQRLRAEENGRKNVEILMQSITSENLFTSGLKLEALLEVLKVRKRLYDRDIEAGNR MKAVAILQQVVYGIREHNRLEGHRSVVRSVAFTPDGKTIASSGDDKTVRLWNLNGQLL KTLT" gene complement(1528..3669) /locus_tag="DP116_26740" CDS complement(1528..3669) /locus_tag="DP116_26740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749118.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_26740" /translation="MSENTRVKTILLLAANPRNTSALRLDEEVREIDEGLRRANKREL FKLEQKWAVRSRDFYRAILDTQPQIVHFCGHGAGEDGIVLEDETGQIAFVQADALASM FKLFAKKGVECVLLNACYSQVQAEAISQHIEYVIGMNQQIGDKAAVNFAVAFYDALAA GEEVEFAYDLGCSQLFGLKENQTPVFKRKLLNHPVAQKTTRQRIFISYKRDVEPDEPV ALEVFQALSQQHEVFIDQRMLVGTRWAERIEGEIRQADFLIVFLSSRSVYSEMVEQEI KMAHEMAQVQGGHPVILPVRLGYREPFQYPLSAYLNNINWAFWQGEEDTPRLIAELTQ AVCGGELPIDKAQAKAILLETSEPSALPRPFASAQPVSLAQPVRLEMPEGTMDSQSAF YVERSLDGIALTTIAHRGVTIAIKGPRQVGKSSLLIRAIEAAVNAGKRVAFLDFQLFD KAALTDAELFFRRFCSWLTYELEMEDRVEEYWKTPLGNSQRCTRYVQRHILKELGKTP LVLAMDEVDKVFDADFRNDFFGMLRSWHNSRATTPIWKQLDLTLVTSTEPYQLIDDLN QSPFNVGQVIELQDFTPEQVIDLNRLHGSPLNSSEERQLILLLGGHPYLVRRALYLVA SQQISTTELFANATAGNSPFGDHLRHHISLLHKKTELIQGLLQVIRQNICDDKRVFWR LRGAGLVREQGRAVLPRCQLYAEYFRENLRE" gene complement(3981..5174) /locus_tag="DP116_26745" CDS complement(3981..5174) /locus_tag="DP116_26745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206136.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cell division protein FtsW" /protein_id="PRJNA477356:DP116_26745" /translation="MNLRRLTLLFDDSVSSWALEARLLRWLTLVWLFVGLVMLFSASY PVADVRHHDGLYYFKRQLLWVLVGFIVFNIVVHSPLNKVLSISHWFVIVLLTLIFATL VPGVGKKAFDASRWIALGPFPIQPSELIKPFLVLQSARLFGQWENLTWRVRFFWLGIF GLVLLGILGQPNLSTTALCGMTIWFIALAAGLPYKYLGGTAIGGVLLAVLSISIKEYQ RKRVLSFLNPWADATGDGYQLVQSLLAVGSGRQWGAGFGLSQQKQFYLPIQDTDFIFA VFAEEFGFVGSVVLLLLIALYATLGLIVALKAKHPVYRLVAIGVTILMVGQSLLHIGV TTGALPTTGLPLPMFSYGGNSMVASLMASGLIIRVARQSSEADVVPLREEGEKRRKRK LFQKK" BASE COUNT 1591 a 1414 c 1051 g 1361 t ORIGIN 1 gtcagggttt tgagtagctg cccgttgaga ttccacaacc gtaccgtctt gtcatcacca 61 gaagaagcga tcgttttgcc atcgggagtg aatgccacac tcctgacaac actgcgatgc 121 ccttccagtc ggttgtgttc ccttatccca taaacaacct gctggagaat ggcaactgcc 181 ttcatgcggt tacctgcttc tatatcccta tcatataggc gttttcttac cttcaatact 241 tctagcaaag cctcaagctt taaacctgag gtaaacaggt tttctgaagt tatactttgc 301 ataagaattt ctacgttttt gcggccattt tcttcagcac gcagacgttg ctcgttagcg 361 tacaacgcca tcgtgcccat tgccacagaa aatattaacc ctcccgcgag atagggtaat 421 aacggcttat aacgctccca ccaatcccgc aacttcaacg gcaaatgttg attcacccac 481 cttaaatcaa acacttgttg ataaattcga ttccgcatct gcaaaacatt gtgctcacgt 541 cgcaccacac ccgaaagctt gaggtgagat ttcactatcg actgttcctc atccaaaacc 601 gacttaccac gacgaatttc tcgataaaca gtcaaaacac cttctttatc tggtgcacgt 661 ttggtgagca tatcgcggac aaactgcaag ttattatctt gctcgctcat tgcaccaaag 721 aaagcactgc tcaccaagcc gtcaacatca gcttttgata aacttttctt gtcctgctgg 781 ctaataatac aacaaagacg ctgcgataaa tacggatgtc cacctgtcca ctccagcatc 841 catttcaaca cctgttgcgc ttcttgagtt gacagcccta gtccttgtgc cagtggtttt 901 gcctcctcaa aagtgaaatc agtcaagtca acgcgctgac cgatattaaa cggtgtacgc 961 ttagggtcat gtatcaaatc gcccggagtt gccaccccaa ctaggacaaa agaaagacgc 1021 tgaaactctg gtttctggga acgagcgtta tacagatacc gaatagcagc aaaaaaatca 1081 tcagtaaaat ctaggctaag cgtcgtatca atttcatcaa caaaaatcac tatcggttcg 1141 gcaacctctt tcagcaaaac ctcttgaaaa aatgaagtca gccgttgcgt aaaacccaga 1201 tgtgagcgtt cttgccacca ctgcaccaca tctgtatcca gcatcagagt tccttccaca 1261 atcgtaagta accccagata ccactgttca gcagtcacct gcacgcccac ctgagtcaaa 1321 tcaatcacaa ccgtctgaat tccttcttct gccaattgtt cagcagtccg caccatcagg 1381 ctagatttgc ccatctggcg tgaggtgagg atatacacaa acttattttc ccgacacagc 1441 cccagcaatt cctcatctgc tttgcgggga atgtagacac cattttggtt ggcttgcaca 1501 gttccgccaa cagtgtagat gcttgggtta ctcacgcaag ttctcccgga aatactcggc 1561 ataaagctga cagcgtggca aaactgctcg tccttgttcg cgcaccaaac ccgcaccccg 1621 cagccgccag aagacgcgct tatcatcaca gatgttctga cgaatgactt gcagcaaccc 1681 ttgaatcaat tctgtttttt tgtgcagtaa tgaaatgtgg tgacgcagat gatcaccaaa 1741 aggactatta ccagcagttg cattagcaaa taactcagtc gtagaaattt gctgactcgc 1801 gactaggtaa agcgcccgac gcaccaaata aggatgtccg cccaacagca atatcaattg 1861 tctttcctca ctggagttga gaggtgaacc gtgaaggcgg ttgaggtcaa tcacttgttc 1921 tggtgtaaaa tcttgtaatt caatcacctg tcccacatta aaaggtgatt ggttgaggtc 1981 gtcaatgagt tgataaggtt cggtggaagt caccagcgtc aaatccagct gcttccaaat 2041 gggcgtggtg gcgcggctat tgtgccaact gcgcaacatc ccaaagaaat cattgcgaaa 2101 gtcagcatca aaaaccttat ccacctcatc cattgccaaa accaggggcg tcttgcccaa 2161 ctcttttaaa atatggcgtt gaacatagcg cgtacaacgt tgactgtttc ctagtggcgt 2221 cttccaatac tcctccactc gatcttccat ttccaactca taagtcagcc agctgcaaaa 2281 ccgcctaaag aaaagttcag catctgtgag ggcggctttg tcaaaaagtt gaaaatccaa 2341 aaaagcaact cgcttacctg cattaaccgc agcttcaatt gcgcgaatca acagcgaact 2401 cttaccaact tgtcttggtc ctttgatggc gatggtgact cctcggtgtg caattgttgt 2461 gagggcaatg ccatccaatg aacgttccac ataaaatgca gattgggaat ccatcgttcc 2521 ttctggcatt tccaacctga ctggctgggc taaagaaact ggctgcgctg aagcaaaagg 2581 tcgaggtaac gctgaaggtt cgcttgtttc aagtaaaata gctttggctt gcgccttatc 2641 aatcggtaac tcgccgccgc agacagcctg agtcaactct gctatcagcc gtggtgtatc 2701 ctcttcacct tgccagaaag cccaattgat attgttcagg taagcactca aaggatactg 2761 gaacggttcg cgatacccca aacgtaccgg gagaataaca ggatgtcctc cctgcacttg 2821 cgccatctcg tgcgccatct tgatttcttg ttccaccatc tcgctgtaaa ctgagcgaga 2881 cgaaaggaaa acgatgagaa aatctgcttg acgaatttcc ccttcaatgc gttcagccca 2941 gcgcgtccca accaacattc tttggtcgat aaaaacttcg tgctgctgtg acagcgcttg 3001 aaaaacctct aaagcaactg gttcatccgg ttcaacatcg cgcttataac taataaagat 3061 acgctggcgc gtcgtcttct gtgcaacggg atgattcagc aattttcttt taaacacagg 3121 ggtttggttt tccttcaaac caaaaagctg agaacaaccc aaatcatagg caaactccac 3181 ctcttcccca gccgctaaag catcgtaaaa cgccacagca aaattaacag cagctttatc 3241 tcctatctgc tgattcatcc caatcacata ctcgatgtgt tgactaatcg cctctgcttg 3301 cacctgggaa taacaagcgt tcaggagaac acattcaacc ccctttttag caaatagctt 3361 aaacatactc gctagcgcat ccgcttgcac aaacgctatc tgccccgtct catcctccag 3421 cacaattcca tcttccccag caccgtgtcc acaaaagtgt acaatttggg gttgagtgtc 3481 gaggattgct cgataaaagt cgcgagaacg aaccgcccat ttttgctcta gcttaaacag 3541 ttctcgctta tttgcccgcc gcaatccctc atcaatttcc cgcacctctt catccagccg 3601 cagcgcagaa gtgtttcttg gattcgccgc caagagcaaa attgttttga cacgcgtgtt 3661 ttcactcata gcgggaatat ggaggaagca tgagaactta gcttattgac aaaatgcaga 3721 atctgtagta atcacacaaa gtcagcacaa tgaaattttt tgctgatact tatacatcac 3781 tatagatgaa accaaaatta acttatgcac aattgaggtt cgccttattt cttttggaat 3841 caaacatata ggtagggtgt gtaatgccgt aggctaacgc accatcccaa gatcttgatg 3901 cgttagacgt tccgtctaac acaccctaca atgactgata catgaaacca aaattaactt 3961 atgcacaact gaggttcgcc ttatttcttc tgaaataact ttcgtttccg cctcttctca 4021 ccctcctctc gcaacggtac cacgtctgct tcactactct gtcgtgcaac ccgaataatc 4081 aaaccagatg ccatcaaact ggcaaccatc gaatttccac cataactaaa catgggcaag 4141 ggcaaaccag ttgttggcag ggcaccagtg gtgacaccaa tgtgaagtaa agattgtccc 4201 accattaaaa ttgtgacacc gatcgccacc aatcgataca ctggatgctt tgccttaaga 4261 gcgacaatta atcctaaagt ggcgtataaa gcaatcagta acaacagcac aacactacca 4321 acaaagccaa actcctcagc aaacactgca aaaataaaat cagtatcctg aattggcaaa 4381 taaaactgct tttgttgaga aagcccaaac ccggcacccc attgtctacc agaacccact 4441 gcgagcaaac tttgcacgag ttgatagcca tccccagttg catcagccca aggattaagg 4501 aatgataata cccgcttacg ctgatattcc ttgatactaa tactgaggac tgccaacaac 4561 actcctccaa ttgctgttcc tcctaagtac ttgtaaggta atccagcagc aagggcaata 4621 aaccaaatcg tcataccaca cagtgcggtt gtgcttaagt taggctgccc taaaatacct 4681 aaaaggacta gaccaaaaat acccagccaa aagaagcgaa cacgccaagt caggttttcc 4741 cactgaccaa aaagtcgcgc actttgtaat accaaaaaag gtttaattaa ttcagatggt 4801 tgaattggaa acggtcctaa agctatccac cgggatgcgt caaaagcttt ttttccaact 4861 cctggtacta aggtggcgaa aatcaacgtc aaaagcacta tcacaaacca atgagatatg 4921 ctcaaaactt tattcaaagg cgaatgaaca acaatattga atactatgaa acctactagt 4981 acccacaaga gttgacgctt aaagtagtac agtccgtcat gatgacgaac atcagctacg 5041 ggataggatg ccgaaaatag catgaccaat cccacaaaca gccaaaccag tgttaaccag 5101 cgcaataacc gcgcttctaa tgcccagctg gacacggagt catcaaataa taaagtcagg 5161 cggcgtagat tcacgatgaa taaaatttaa attgtaaggt gcgatcgttt ggagttgacg 5221 ttaatgatac caacttgctt tcaaaaggga ttaagttgtt gattcctttt tcaggataaa 5281 acttgagcaa attgagaact cccaccaagc agcagcgatc attacagtac tcaaaccctg 5341 cgacagggat gagaagaata ttaccgttat ataataccta cccaaagctg tccattactg 5401 gaaatatcta ccttggg // LOCUS NODE_5252_length_5404_cov_5.0566465404 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5404) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5404) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5404 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1..352 /locus_tag="DP116_26750" /pseudo CDS 1..352 /locus_tag="DP116_26750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127621.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=2 /transl_table=11 /product="hypothetical protein" gene complement(1355..1573) /locus_tag="DP116_26755" CDS complement(1355..1573) /locus_tag="DP116_26755" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26755" /translation="MRLLTTSYFFCRAWVEVGTNSTLGKENVTIQNALHKLFDKRKAD GSHAKSIMKKVSFDTKTSHLCSAIWFKA" gene complement(1913..2671) /locus_tag="DP116_26760" CDS complement(1913..2671) /locus_tag="DP116_26760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654928.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Crp/Fnr family transcriptional regulator" /protein_id="PRJNA477356:DP116_26760" /translation="MTSLSSAERSLLPMQSPSSFSEASRPFLTWQRILDWAQEHYRCR TFSKDERIPARAGLLYLVQRGAIRMVGTAQVSATASQLTSRRINRTPEEAFLGFVGAG QPFEIVAQSPFTLQAYAHVDQTAVLWMYWHDLDNWPHFRREVMDAFRYQHQRKLLWLS ALGQRRTIDRLLGFLTLLIEEYGEPSMSDTDPDVIRGYCLPFPLTHAQIGSAIGSTRV TVTRLMGKLRQRGLILTQGDNLICLPAESINRGN" gene 2693..2923 /locus_tag="DP116_26765" CDS 2693..2923 /locus_tag="DP116_26765" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26765" /translation="MSAQQALNLTVPLIQKVESVAPTLLLYFLLYNDGSKIYSNLIKF CTILIFLPRNLTSFLRQMTEIEDIQSRVMLLG" gene 2926..3909 /locus_tag="DP116_26770" CDS 2926..3909 /locus_tag="DP116_26770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315051.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamate acetyltransferase" /protein_id="PRJNA477356:DP116_26770" /translation="MPKKLLQNKYAAIKQLVNSNLAIALSTYANSQPKICIKNINIPL SQGRDDEKIVYISGIALRLAKFEKYPAIEIANGIVSHISANYDKDFNIQVVSPGWIHL EVTHRLLAAWLQSLTLGEARELLVGSEGVGTKAEEERMSIGSFSPLPPAFCPISSSDL TSSSPFSSPNSASLFTAQYAHARCSSLLRLAVNEGLIKLRESNVDEGENVKQNLLLSI FTPNPIPWLNSDEKLRFNHPASYLLINELVRVVDRLEGSDFGDSVNWEKAALDLSRVF ETFWCKCRIFGEVKTASPELAQARIGLVLATQCVLKFVLEEKLDVFALDEV" gene complement(4124..4999) /locus_tag="DP116_26775" CDS complement(4124..4999) /locus_tag="DP116_26775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874373.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Cof-type HAD-IIB family hydrolase" /protein_id="PRJNA477356:DP116_26775" /translation="MHKVSATELASPGIGDTSTFDIQLLVVDIDGTIAGKSNSISTAV KQAIVAVRSRGIQVAIATGRMYRSALRFHQEIGSTLPLLAYQGAWIQDPATQKIHRHL PVSRELAHKLLDYFEQPELRTVLSVHFYINDQLYVREITRETELYAQRSGITPIAVGD LRQVTSNEPTKILALCDDTNIIEQLLGNLRSQYTPAELYLTTSVATFFEATNPSVNKG TGVRYLAEELLGVQRINVMTIGDNFNDVEMLEYAGIGVAMGNAPAAVQAKAQWVAPDV ECDGAAAAIERFLLC" BASE COUNT 1543 a 1136 c 1207 g 1518 t ORIGIN 1 gacaggcaca aagattgctt ctgcaagttc tgaccaaacg gtgaagctgt gggacgtatc 61 ttctggtaaa gaactcaaaa ccctccaagg gcattcagct gaggtgagga gcgtcagctt 121 tagccccgat agcaaaacga ttgcttctgc aagttccgac aagacggtga agctgtggga 181 cacatcttct ggtagagaac tcaaaaccct ccgatattca gcagaagtga ggagcgtcag 241 cttcagccct gatggcaaga tgattgcttc tgcaaattcc gacgggacgg tgatagtgtg 301 gaatttggat actgacaatt tactgtcgct tgggtgcaaa tggctcaaag ataacccgtc 361 cctgccttct gagacattaa aaaaacaaca gatttgtcaa caaacacaaa aaatgtctgt 421 tgcgcctgag tcagcatgga ttgcccaagg tgagctagcg caaattaaaa ctccagatcc 481 cacttctgcc caacaaaaac cttcaactca aacaacaggc tttagttgta attcttatac 541 tcgttctaat cttttgactt acgttgtcaa ggctcgtgat catcgtgtcg gaactggtat 601 ccgttgtgtc aaatttagtg atggtggagg caatactatt ccttcattgg catggtacgg 661 tgaaggtaaa tggggaggta agacataccg ccatgtaggt catgcttttt acaataatgg 721 taagcttatc ggttctgctt ccaatattta tgggaatggc gaagatatca acggcaactt 781 tgacaaaaat cttgacgtaa aaatagtcaa tcaatcaaca attcgtgtta aggatgcatc 841 gggtgtatgg aatgaagagt ggaagaagca agattctccc attaaatacg aacctttgcc 901 aatgcctaat acttgcggtg gacattttga tgagtataag gtttctgatt taaaaagcag 961 tcgtcaaggt ggttatggtt tgcgttgtat cctccgtgta ggagctaaaa ataccacttg 1021 gtttggcaat gggaaatggg aagatacaat ttatactcac ttaggaaccc tttcctctaa 1081 gggttacggt gctgatgata tctgttacac tcgagatcaa ttttgtaaca catttgaata 1141 tggtttatta aaattaactc acaactctgg tgactttgag gttactggtt cctggagtga 1201 aaagtggcat tacaagagcg aaaaatttca agggagatag gggcatacag cgcggtaagc 1261 taaaatttgg ggataaacaa ctcattcttg cttctcacgc ggctagtgca attataggca 1321 gtaaaagcga ttcaaaaaac aaggagtcag tgcgttaggc tttaaaccat atggcagagc 1381 acaaatgact tgtttttgta tcaaagctta cttttttcat tatgctttta gcatggctgc 1441 catctgcctt tcttttgtca aacaatttat gcagggcatt ttggatggtc acgttttcct 1501 tccctaatgt ggagtttgtc cccacttcca cccaagccct gcaaaaaaag tagctggtag 1561 tcaaaagtcg catttactac acttctgcct cgaaaatttt tcctcaatgt tctacttaac 1621 ctttgctttt ggggcagata caagggcaat tctttgctcc ttaccaagaa tcatttgttg 1681 tcctcccttc taaatgttct ggtaagggca caagaaattg aacattgagg ctttccaaaa 1741 tggaaagtac ctaattgtgt gcaatctgtt tgtggtggtc aaacagattg tcaatttcag 1801 cagcaagtgg gtatcccttt tgccaactta ctgccagcta gatcagggat atcgaattca 1861 tgaccgaagc caggtttgtt aaaattcgct ctccactgca tccggtgttg gcttagttgc 1921 ctctgttaat tgactctgct ggcaagcaaa tcagattgtc gccttgagta aggattaagc 1981 cacgttgacg caatttaccc attaaacggg tgacggtaac acgagtcgaa ccgatcgcgc 2041 tgccaatttg agcatgggtg agggggaaag gcaggcaata accgcgaatc acatctgggt 2101 cggtatcgct cattgacggt tctccatatt cctcaatcaa caatgtgaga aatcctaaga 2161 gtcggtcaat cgtgcgtcgt tgtcccaagg cacttagcca cagcagtttg cgctggtgct 2221 gatacctaaa ggcatccatg acttcgcggc ggaagtgggg ccagttgtcc aaatcgtgcc 2281 agtacatcca cagtacagca gtttggtcta catgggcata ggcctggagc gtgaatggcg 2341 actgggcaac aatttcaaac ggttgccctg caccaacaaa ccctaagaaa gcttcttcgg 2401 gagttctatt gatgcgtcga gatgttagct gacttgcagt tgcacttacc tgggcggttc 2461 ccaccatacg gatcgcaccc ctttgcacca agtacagcaa tccagctcgg gctggaatgc 2521 gctcatcttt gctaaaggtg cggcagcggt agtgttcctg agcccaatca agaattcgtt 2581 gccaagtcaa aaaaggacgt gatgcctcag agaaagagga tggagattgc ataggtaaca 2641 aagagcgttc ggcggaagac aaagacgtca taaaaaggtg caaggagtgt caatgtcggc 2701 acaacaggca ttaaacctaa ctgtaccact catccaaaag gtagaaagcg tagctcctac 2761 tcttttgtta tacttcttac tgtacaatga tggtagtaaa atttactcta atttaattaa 2821 attttgcaca attctaattt ttcttcccag aaatttaact tctttcctta gacagatgac 2881 agaaatcgaa gacattcagt caagagtaat gttactcggt tgattgtgcc aaaaaaatta 2941 cttcaaaata agtatgcagc aataaagcag ttagtaaaca gcaatttggc aattgcttta 3001 agtacttatg ctaattctca gccgaaaata tgcataaaaa atataaatat tcctctatct 3061 cagggtagag atgatgagaa gattgtttat atttcaggta tagctttgcg gctggcgaaa 3121 tttgagaaat atccagctat agagattgcc aatggcattg tttctcatat ttcagcaaac 3181 tatgacaaag atttcaatat tcaagtcgtt tcccctggct ggattcactt ggaagtgact 3241 catcgcctat tagctgcttg gttgcagagt cttactttag gggaagcacg ggagctctta 3301 gtagggagtg agggagtagg gacaaaggca gaggaagaaa gaatgtccat tggttctttt 3361 tccccccttc cccccgcttt ctgccccatt tcctcttctg acctcacttc ctcatctccc 3421 ttttcttccc caaactctgc ttccttattc actgctcaat acgctcatgc acgctgctcc 3481 tcgcttttac ggctggctgt taacgaggga ttgattaaac tgagggaatc aaatgttgat 3541 gagggtgaaa atgtcaagca aaaccttttg ttaagtattt ttactcccaa tcctattcct 3601 tggctcaaca gtgatgaaaa actgcgcttc aatcacccag cttcctatct cctgatcaac 3661 gagttagtac gagtggtaga tagactagag ggttctgatt ttggtgactc agttaactgg 3721 gaaaaagcgg cgttggattt aagtcgagtt tttgaaactt tttggtgtaa atgccggatt 3781 ttcggcgagg tgaaaactgc ctcaccagaa ctcgctcaag ccagaatcgg attggttttg 3841 gctacacagt gtgtgttgaa gtttgtgttg gaagaaaaat tagatgtttt tgctctcgac 3901 gaagtataaa caataaaatc cccaaaaaaa taaacttttt taacaatcat tgactcgtca 3961 tatttcctgg ggtatattat ccaatgtgtg aggagcgaac cagcaggagc accgagacga 4021 aacacggcca gtcgtcggtg ctctttctga ttttatccgt tgtaactgaa actcacggtt 4081 atacggacaa aatttcccta tgactaccca ctactaacta cttttaacac agtaaaaatc 4141 tttcaatagc cgctgcagct ccatcacact ccacgtcagg agctacccac tgggctttag 4201 cctgcactgc tgctggtgca ttacccatcg ctacacctat accagcatac tccaacattt 4261 ccacgtcatt gaagttatca ccaatcgtca tgacgttaat tctttgcact cccagtaatt 4321 cttcggctag gtaacgtact ccagtccctt tgttgacact cgggttagtc gcttcaaaaa 4381 atgtggcaac agatgtggtg agatatagtt cggctggggt gtattgactg cgcaaatttc 4441 ccagtaactg ttcaataata tttgtgtcat cgcacaatgc caaaattttt gtgggttcat 4501 tgctggtgac ttggcgtaag tctcccacag caataggggt aattccacta cgttgtgcat 4561 aaagttctgt ttccctggtt atttcacgga cataaagctg atcgttgata tagaagtgaa 4621 cagataagac cgttcgtaac tccggttgtt caaaatagtc tagaagcttg tgtgctaact 4681 ctcgcgaaac aggcaaatgg cgatgaattt tttgggtagc tgggtcttga atccaagccc 4741 cttggtaagc taacaatggc agggttgagc caatttcttg atgaaagcgc aaagcggaac 4801 gatacatacg accagtggcg atcgccactt gaataccccg ggaacgcacc gcaacaattg 4861 cctgcttcac ggctgtgctg atgctgttag attttcctgc gattgtgcca tcgatatcta 4921 caaccagcag ttgaatgtca aaagtagatg tatccccaat gccaggtgat gccaattctg 4981 tggcagacac tttatgcata attgatgagt gaatattgaa gttccaataa gagattaaca 5041 ggctatcacg ctgtagaggc ttgccctgag cgtagccgaa gggcgagtaa gagtaaagag 5101 gactaacaag agaatacaaa tgaattcatc tcgtcatcgt gaacaaattt ttacacagat 5161 atacagatgg aactgtgatt agtatatggt aaatgctata ggaatccggt ttgatttggt 5221 gaattgacaa agtagaggta gggaacgctt aacagggaac agggaacaga aagtgtacct 5281 agcttcgtca aaaataaaat ataagtccta tatatagaat gatttccact tacgagctac 5341 tacttacagc agattgcaac tacatgaggt acagattatt gtaagggcgc aaggcattgc 5401 gccc // LOCUS NODE_5290_length_5357_cov_4.4492645357 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5357) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5357) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5357 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 387..860 /gene="rplU" /locus_tag="DP116_26780" CDS 387..860 /gene="rplU" /locus_tag="DP116_26780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L21" /protein_id="PRJNA477356:DP116_26780" /translation="MTYAIIETGGKQVKVEAGRFYDIELLCVEPDEKVTIDKVLLVQH NGEVTIGQPLVGGAKVEGTVMRHLRGRKVLVYKMKPKKKTRKKRGHRQEITRLLINSI SLDGSVLASEENAISVTPEASVTTVPAATTETPETVDVVAEPVAENSPAEETPAE" gene 892..1173 /locus_tag="DP116_26785" CDS 892..1173 /locus_tag="DP116_26785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458028.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L27" /protein_id="PRJNA477356:DP116_26785" /translation="MAHKKGTGSTRNGRDSNAQRLGVKRYGSEVVRAGNILVRQRGTK FHPGNNVGIGKDDTLFALVDGVVTFERKGKTRKKVSVYPVAAPEAAVAS" gene complement(1220..1659) /locus_tag="DP116_26790" /pseudo CDS complement(1220..1659) /locus_tag="DP116_26790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873505.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 1466..1475 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 1985..2896 /locus_tag="DP116_26795" CDS 1985..2896 /locus_tag="DP116_26795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314343.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carotenoid biosynthesis protein" /protein_id="PRJNA477356:DP116_26795" /translation="MRQLVIAQRVCLTGHILAKAFGLIGILLVIPNAEIILNSGEVGQ QAMQLSMADGGVVDIVLGTMAVSIYAYRILGWRTWLAFMVPAVLISVGSELLGTSTGF PFGYYSYLSGLGYKIAGLVPFTIPLSWFYVGLSAYLIARSGLRVAQNPSLVRQVGAVA LGALLFTCWDFALEPAMSQTSLPFWFWENSGEFFGTPYQNYAGWFATSALFISVAALL WKNTPIKLERSQLNVPLVVYLSNFAFAAGLSLAAGFAIPVSLGFVLGVVPAVVLWWSS KAASAHVAVAPTTTEVTTVANVKVAVK" gene 2928..4130 /locus_tag="DP116_26800" CDS 2928..4130 /locus_tag="DP116_26800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198411.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="PRJNA477356:DP116_26800" /translation="MLDVLTIASALCFFLLLIQLSATAILLSRLLKGPSRHPPITPQD PTPELLGCVSVVIPTLNEALRIAPLLDGISHQSYEVREMIIVDSKSSDGTPELVKTAQ QKDPRFRLMTDDPLPTGWVGRPWALHYGFLHTSEASEWFLGMDADTQPSPGLVAGLVK TAETQGYDLVSLSPQFILKYPGECWLQPALLMTLLYRFNPAGVNAEQSERVMANGQCY LCRRSVLAAVGGYTSANGSFCDDVTLARNIAASGYKVGFLDGAKVLKVRMYEGAMETW QQWGRSLDLKDASSRAQLWGDLWLLTAVQGLPLLVVFSYLFFSPPSLVWLSPSPSLLL VGLNVFLVVIRFAMLFAIAPSYDRNQAKGGWLFWLSPLADPIAVLRIFLSAFRTPIEW RGRKYSTQ" gene complement(4152..5048) /locus_tag="DP116_26805" CDS complement(4152..5048) /locus_tag="DP116_26805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017652967.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease HI" /protein_id="PRJNA477356:DP116_26805" /translation="MSLVSTIKSIYTDGACTGNPGPGGWGVVVYFSDGSIHEMGGASA RTTNNKMEMQAAIAALQFLKDSGQNEAITLHTDSEYLINCVTKWVKSWKRKGWKKSDG NPVLNQDLLEILDDLNSPRVRWEHVRGHAGNIGNERCDAIARAFANGKTLSLQQQDFL LEQKNGLSVSRVSDSETNSTIINTKKQEIITLATDITNMEPQTHSPSSANEELPREIR VTQLRNLVETLRIADEIAEKGYLITSSELADLMDVHASAVTSRGDEWRWRNWLVSRVR REGNQILWELERGDKVESETES" BASE COUNT 1443 a 1161 c 1236 g 1507 t 10 others ORIGIN 1 catgcgcgat tttcacaaat caaataggat tgctatattg cttgtcaaat actttccaac 61 gccgcttatg aaaggagtta tatgaagtcc ggttaattag ttatgattcc cacagtcatt 121 gcaccccacc cccaacccct ccccgcaggc ggggagggga gacaaagcac agctttggcg 181 gggtggggtt cagagggttt aataagtaat caaacggaca tcatattaat attaattatt 241 ttgacaaaaa tgtagtcatt gaggtaaagt aatagattgt gtgtctaact cctgctcgga 301 tcaggtaaag gcaatacgaa agctggaaag ctgtcttaac gagcatacag tccagctacc 361 tgtacggcaa cctcaaggac aattttatga cctacgcaat tattgaaacc ggcggcaagc 421 aggtaaaagt cgaagctggt cgtttttacg acattgaact gctttgtgtt gaaccagacg 481 agaaagttac catagataaa gtactgctag tgcagcacaa cggcgaagtc accataggac 541 agccgcttgt gggaggagca aaggtggaag gcacggtgat gcggcatcta agaggtcgca 601 aagtcctggt ttataagatg aaaccaaaaa agaaaactcg taaaaaacgg ggtcatcgcc 661 aggaaatcac tagattattg attaattcaa ttagcctcga cggttcggtg cttgcttcag 721 aagaaaacgc aatcagcgtc acaccagaag cctcggtgac gacagttcca gctgcaacga 781 ctgaaactcc agagactgtc gatgttgtgg cagaaccagt tgcagagaat tcacctgcag 841 aagaaactcc tgctgaataa tagatagaag taaaggatac aagaggaaag tatggctcat 901 aagaaaggaa caggtagtac acgcaacggt cgtgattcta acgctcaaag actcggtgtc 961 aagcgctacg gtagtgaagt tgttcgggcg ggaaatatct tagtacgtca acgcggtact 1021 aaatttcatc ctggtaacaa cgtcggtatt ggcaaagatg acactctgtt tgctctcgtt 1081 gacggcgtag tcacatttga acgcaagggc aaaacccgta aaaaggttag cgtttatcca 1141 gttgcagcgc cagaggctgc agttgcaagt tagtcattag ttaaacccaa gagtccatag 1201 cgactatgga ctcttgagtt tagatcaact tatgaaccag cgcacctaag gctaaaccca 1261 tcaaagacaa gatagacaac aagcaaaaag tttgtccttc attgaagcca gctgtttgaa 1321 agtctatcct tgccaataca gccgcagcag ccattagtaa gatatcaaga aatagtcgga 1381 acctggcgat cattaaaaaa aacaagaaag ccgctaaaac gctaaaacca aaagacctaa 1441 gatttgattt gaataagaaa aaagannnnn nnnnnaagaa aagtctgcaa gcctgatcca 1501 ggaaatcgtt aaacctcccg tgagaagtaa aatggcaatc acagccaaga tccacatata 1561 ccaaggagtg tatattttag atattaacca cccagtggcc gtgtaagcaa gcactagtaa 1621 tgctagggaa aagaaaggcg gtcttttcga gattgacatt ggttgttccc aacataagaa 1681 taagctataa tttcacaact atctaaagca tatacgtact gctttgtcgc aaagcatgaa 1741 tcgcgacact ataagccgtg agaagaaggt caatttagtt gagagataat ggcttgaata 1801 gcttaaacta taaatctatt atcatatttt ttctaccttc tatcaagagg ctgttatcct 1861 ttccataagg aaagcaaggt cagtttgcca aaaattcccg gacaaagtat taaatatttt 1921 cttaaacatt tgtaaactat taccagctaa gtagctgcgt atcatgtctt gttaagggat 1981 tatcatgaga caacttgtta tcgctcagcg tgtatgcctg actggtcata tcttggcaaa 2041 ggcttttgga ctaataggca ttctattggt catacctaat gccgaaatca ttttgaactc 2101 aggagaggtt ggacaacaag ccatgcagtt gagtatggct gatggtggtg tggtagatat 2161 tgttttgggg acaatggccg tctctattta tgcatatcga atattgggat ggcgcacttg 2221 gctggcgttt atggtaccag ctgtattgat atcggtgggc agtgaattac tgggaactag 2281 cactggattt cccttcggat actacagtta cttgagtggc ttaggatata agattgcggg 2341 acttgtacca tttacaattc ccttatcctg gttttatgtg ggtttgtctg cttacctcat 2401 tgctcgtagt ggtctaaggg tagctcaaaa tcccagttta gtgcgccaag tgggtgctgt 2461 agcgctgggt gctttgctct ttacctgctg ggactttgca ttggaaccag ccatgagtca 2521 aacatctctc cccttttggt tttgggagaa ttcaggtgaa ttttttggaa caccgtatca 2581 aaattatgcg ggttggtttg ccactagcgc cctgtttatt agtgtggcag cattgctgtg 2641 gaaaaacaca ccgataaaat tggagcgatc gcaactcaat gtgcccttag tcgtttatct 2701 cagcaatttt gccttcgccg ccggactgag tttagctgct ggatttgcta tcccagtttc 2761 attaggcttt gtcctgggcg tagttcctgc tgtagtactt tggtggagtt caaaagctgc 2821 atctgctcat gttgctgttg caccaacaac cactgaagtg acgactgtag caaacgtcaa 2881 agttgctgtt aagtaaagct ttgatttcca ttgccaggta ttttgctgtg ctagacgtgt 2941 tgacgatcgc aagtgctctt tgtttctttt tactgctcat ccaactaagc gcaacagcaa 3001 ttttgctgtc gcgcctctta aaaggaccta gccgccatcc tcccattaca ccccaagatc 3061 ctacaccaga acttttgggg tgtgtgagtg ttgtcattcc taccttaaat gaagctcttc 3121 gtattgctcc cttgctagac gggatctctc accaaagcta tgaagtccga gaaatgatca 3181 ttgtagatag caagtctagt gatggcacac ctgagttagt aaaaaccgca cagcagaaag 3241 atcctcgctt tcgcctgatg acggatgatc ccttaccaac tggttgggtg gggcgtcctt 3301 gggcattgca ttacggcttt ttgcatacct cagaggcgag tgagtggttt ctgggtatgg 3361 atgctgatac acaaccatca cctggtttgg ttgcgggttt ggtgaagaca gcagaaactc 3421 aaggctatga cttggtttct ctctcacccc agttcattct caaatatcca ggagagtgct 3481 ggctacaacc agctttgttg atgactctgc tttaccgatt taatcctgct ggtgtgaatg 3541 cggaacagtc agaacgagtg atggcgaatg gacaatgcta tttgtgccgt cgctctgttt 3601 tagcagctgt gggtggatat acgagtgcga atggttcttt ttgtgatgat gtgactttag 3661 cacggaacat cgctgcctct gggtataaag ttggcttttt agatggggca aaagtactca 3721 aggtacggat gtatgaggga gcaatggaaa cttggcaaca atgggggcga agccttgact 3781 tgaaagacgc ttcctcccgc gctcaacttt ggggagattt atggctcctc acagcagttc 3841 aaggtttacc ccttctagtc gtcttcagct acctcttttt ttctccccct tcccttgtct 3901 ggttgtctcc ctctccctcc ctgcttctag tgggactcaa tgtatttcta gtggtgattc 3961 gctttgctat gctttttgcg atcgcacctt cctacgatcg caatcaagca aaaggtggtt 4021 ggttgttctg gctttcgcct ttggctgatc caatcgcggt gctgcggatt tttctatctg 4081 catttcgtac cccgatagag tggcgagggc gcaagtacag tacccagtga tgagtcctga 4141 gtcctcattg gttaagactc agtttcactc tctaccttat cgccccgttc caattcccag 4201 agaatttgat taccttcacg ccgcacccgc gagacaagcc agtttcgcca gcgccactcg 4261 tctccacgac tggtaacagc gctggcgtga acatccatca agtctgctag ttcagaactt 4321 gtgatcaggt aacctttttc ggcaatttcg tccgcaatgc ggagagtttc caccaagttg 4381 cggagttgtg ttaccctgat ttctcggggt aattcctcat ttgctgaact aggagagtgg 4441 gtttgtggtt ccatattggt tatatctgtt gcaagagtaa tgatttcttg tttctttgtg 4501 ttaattattg tagaatttgt ttccgaatca gatactcttg atacacttag accgttcttt 4561 tgttccaaga gaaagtcttg ttgttgcagg gacaaggttt tgccattggc aaaggcgcgg 4621 gcgatcgcat cacagcgttc gttaccaatg ttacctgcat gaccccgcac atgctcccac 4681 ctgacccgtg gactattaag atcgtctaag atttccaaga ggtcttggtt cagcacagga 4741 tttccatctg actttttcca gcctttcctt ttccagcttt tcacccattt tgtcacgcag 4801 tttatgagat attcgctgtc ggtgtggagg gtgatggctt cgttttgtcc tgagtctttt 4861 aaaaattgga gggcggcgat cgcagcttgc atctccattt tattgtttgt cgttcgggca 4921 gatgcacccc ccatttcgtg aatagaacca tcactgaaat agacaacaac accccaaccg 4981 ccaggaccag ggttgcctgt gcaagcacca tcagtgtata tacttttgat tgtggaaact 5041 aaagacatag ttaagaatag aaaagcggca aaaagctaac gatgcaaaga tatcagttgc 5101 gaaaaatcac agaaagtatt aaggttagca caaaaacagt tccgcaaata tcgcatctca 5161 tttctatcag ccctttttcc taaccaactg cgtattcttt aaactgccgc agaaatgctg 5221 aatcaaacca attacaatag acttcgtggc agttgtaggg aactcttaac gcttaacagg 5281 gaacgcttaa cagggaatag ggaattctga acgcttaact cttaacagaa cctcgtaaaa 5341 tctcactttt gcaagag // LOCUS NODE_5386_length_5211_cov_5.1227705211 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5211) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5211) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5211 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 56..262 /locus_tag="DP116_26810" CDS 56..262 /locus_tag="DP116_26810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318379.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26810" /translation="MKFQIECNSFKNNQMCLICNQLFQMSEARLIVCSDEGDGFGDIC PECIGMGPYWIKSQLQHFSSSLST" gene 1230..3089 /locus_tag="DP116_26815" CDS 1230..3089 /locus_tag="DP116_26815" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318380.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase family 2" /protein_id="PRJNA477356:DP116_26815" /translation="MNLLELENRAVELTSHADMGTTFALKETAYPRPQLQRAHWQSLN GLWKFAFDDQGKCVQPSDLNQWTHHIEVPFAPESTKSGIGDTGFHPNCWYEREFETPP GEGRLLLHFGAVDYRARVWVNDQYIAEHEGGHTPFTLDITHVLNDSGITKVTVWAQDD PHDLAKPRGKQDWQLEPHSIWYPRTSGIWQTVWVERVGTTYIDHIQWTPDFERWEIGC YAALAGDVPVSGVQIKMKLSVGDRMLVNDTYEVFNGEISRRLALSDPGIDDYRNELLW SPEKPTLIDAEVELWYKGELIDQVKTYTAIRTVSIQRDRFMLNGRPYYLRLVLDQGYW HDSFMTAPDDEALRRDVELAKAMGFNGVRKHQKIEDPRFLYWADVLGLLVWEEMPSAY RFTNKAVERMTKEWTEVIKRDINHPCIVAWVPFNESWGVPNLVETQAHRNYVLAMYHL TKTLDPTRPVIGNDGWESTDTDILAIHDYDTNPQHLANRYGPSVKLSDLFDRKRPGGR ILTLDNYPHQGQPVMLTEFGGIAYAPENKPDANKVWGYERSWNISELQMKYAALLETI NNIEIFSGFCYTQFTDTFQEANGLLYADRTPKFPIEAIRAATLSGGLCTPTSC" gene 3068..4168 /gene="galT" /locus_tag="DP116_26820" CDS 3068..4168 /gene="galT" /locus_tag="DP116_26820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="galactose-1-phosphate uridylyltransferase" /protein_id="PRJNA477356:DP116_26820" /translation="MYSHKLLKPDGRQLTLYSRYPITSEIQATSPSNEPVQANPHLRW HPLRGEWVAYASHRQGRTFMPPPEYNPLAPTSNPEFPTELPQGKYDVAVFDNRFPSMT VTAHDPPASIVETLPSNGACEVVVFTQDPNASLGSLELDHLELLLQVWGDRTRVLGAN PQIQYVLPFENKGVEVGVTLHHPHGQIYAYPFVPPVPARMLEQQRRFYQQQGRGLLED LIQKEIADNKRIIYQDEEAIAFVPVCARYPYEVWIAPIEPVGTFYDLSAKQRQGLARA LKTVTLKYDGLWNRPFPYLMAWFSAPTDGEAHREAHLHAEFYPPYRTSERLKYLAGTE LAAGMFANDALPEEKAKELQALVVNIETPVRL" gene 4244..4327 /locus_tag="DP116_26825" /pseudo CDS 4244..4327 /locus_tag="DP116_26825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195986.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene 4513..4584 /locus_tag="DP116_26830" tRNA 4513..4584 /locus_tag="DP116_26830" /product="tRNA-Asn" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:4545..4547,aa:Asn,seq:gtt) BASE COUNT 1431 a 1162 c 1275 g 1343 t ORIGIN 1 cgcatatttt ttcattaata tattcaaaat aaaaatttct tcaggatgta aaaatatgaa 61 atttcaaata gaatgtaaca gtttcaaaaa taaccaaatg tgtttaatat gtaatcaact 121 atttcaaatg agcgaagcga gattaattgt ctgcagtgac gaaggtgatg gctttggtga 181 tatttgtcct gagtgcatag gtatgggtcc ttattggatc aaaagccaac ttcaacactt 241 cagcagtagt ctgagtacat agtaggatgc gttaaaaaac tgtaacgccc cgcctgaata 301 tagtgcgtta taacgcttaa acttgtgtca ttgctaactg tgagctaata ccgtttctcc 361 atgaagctgc gctaaataat ttcttagccc cccttgggaa ggggagccag tgcattgcgg 421 tgaggcagcg cagtcttggg ggtttccccc aggagcgact gccgaacccc gaaggggtga 481 atccagcgcg catgaggcgt tctcccgccg taggcgactg gtgttagccg tcaggcgaag 541 gcaggcatac ccgaagggag ccagtacttg atgagggttt ccctcacgct tgttctggcg 601 ttggggttcc tcctttgggt gcacctggcg tgggttgggg ggatctgatt tgttgcatct 661 tcatatagaa ctggtataag atgacttgct tgacgtaccc tactgttgtc aataagtgcc 721 tgaaattagc acaggtaaaa tttcaacaaa aatagttatt aatttttcgc attttaactc 781 attaaaaagg ctttgcgact attttggctg caattgctta tcaaaaatcg ctgctacatc 841 gttttctaac aaatattgag tctgaaaact taataaatcc gcctctaaaa tagtttatat 901 aggggcagtt tcttatatta ggtagtaaaa attttatttc attttttcag caaactcgcg 961 ctcacatttc ccccaagtat gccataatgt cacgagccgg aaattaatac ctaatttgac 1021 accaccaata gccttggggg ctatggtggt cgtttgctca tagcaaatct ccaccgttaa 1081 gcatttatta ccaaaaaata gtctgtttgg aggtgtgtga caaagtttaa gtattattac 1141 cagcaatatt caccgaaatg tttggaataa ggcgaatttt tgacaattca ttttgtagaa 1201 gatagtagaa gagttgggaa gagattgcga tgaacttatt agaattggag aatcgtgcgg 1261 ttgaactaac ctctcatgcc gatatgggga cgacttttgc cttaaaagaa acagcatacc 1321 cgcgtccgca gttgcagcgc gcgcattggc aaagtttaaa cggtctgtgg aagttcgcat 1381 ttgatgacca gggcaaatgc gttcaaccta gcgacctgaa ccagtggaca catcatatag 1441 aagttccctt tgctccggaa tctaccaaaa gtggtattgg ggatacagga tttcacccaa 1501 actgctggta cgagcgggaa tttgaaacgc caccaggcga gggcagatta ttattacact 1561 tcggtgctgt ggactatcgc gcccgtgtct gggtgaacga tcaatacata gctgagcatg 1621 aaggcggaca tacccctttc actctcgata ttacccatgt cttgaatgac agtgggataa 1681 cgaaagtgac agtgtgggcg caggatgatc ctcatgacct cgccaagcct agaggaaagc 1741 aagattggca gcttgaaccg catagtattt ggtatcctcg caccagtggt atttggcaga 1801 cagtctgggt tgaacgtgtg ggtacgactt atatagatca catccagtgg actcccgact 1861 ttgagcggtg ggaaattggt tgttacgctg cgctggcggg tgatgtgcct gtctcgggcg 1921 tacaaataaa gatgaaactg agcgttggcg ataggatgct ggtgaacgat acttatgaag 1981 tgttcaatgg ggaaattagc cgccgcctcg ccctttctga ccctggtatc gacgactacc 2041 gcaacgaatt gctgtggagt ccggaaaaac cgacgctgat tgatgccgag gtagaactgt 2101 ggtataaagg cgaactcata gatcaagtga aaacttacac cgcaatccgg actgtgagta 2161 tacagcgcga tcgctttatg ctcaacggtc gtccctacta tttgcggctg gttcttgacc 2221 aaggctactg gcacgatagc ttcatgactg caccagatga tgaagcgttg cggcgcgatg 2281 tagaactggc gaaagcaatg ggctttaacg gagtccgcaa gcaccaaaaa attgaagacc 2341 cccgcttttt atattgggca gacgtgctgg ggttgttagt atgggaggag atgcccagtg 2401 cttaccgatt cacaaataaa gcggtagagc gcatgaccaa agagtggaca gaggtcatca 2461 agcgggatat caaccacccg tgtattgtgg cctgggtgcc gttcaatgaa tcttggggag 2521 ttccgaattt ggtcgaaaca caggctcaca ggaactacgt tttagcaatg tatcacttga 2581 ccaagactct tgatccaact cgcccagtga ttggcaacga tggctgggaa agtacagata 2641 ccgacattct cgctatccat gactacgaca ccaatccgca gcacttggcc aatcgctatg 2701 ggccgtccgt caagctatca gatttatttg atcgtaagcg tcccggagga cgcatcttga 2761 cgctggacaa ctatccacat cagggacagc cagtgatgct gaccgagttt ggcggtatcg 2821 cctatgcccc tgagaacaag cccgatgcta acaaagtttg gggatatgag cgcagttgga 2881 atatctccga attacaaatg aaatacgctg ccctgctgga aactatcaat aatattgaga 2941 tattcagcgg gttttgttac acacaattca ccgatacctt tcaagaagca aacgggttgt 3001 tgtacgccga tcgcacgcct aaatttccca tcgaggcgat tcgcgctgca accctttcag 3061 gaggattatg tactcccaca agctgttaaa gccggatggt cgccaactga ccttatacag 3121 tcggtaccct attaccagcg aaatacaagc cacgagtcct agcaacgagc cagtacaggc 3181 aaatccacac ttgcgctggc atcccttgcg gggcgaatgg gtggcttacg cgagtcaccg 3241 tcaagggcgg actttcatgc cgcccccaga atataacccc ctcgcaccca ccagcaaccc 3301 agagtttcca actgaactac cgcaaggtaa gtatgacgtg gcggtattcg ataaccgctt 3361 tccgtcgatg acggtcactg cacacgaccc ccctgccagc atcgtggaaa ccttgccctc 3421 aaatggagcg tgtgaggtcg tggtttttac ccaagacccc aacgcttccc ttggttccct 3481 agaactggat cacctagagt tgctgttgca agtgtggggc gatcgcactc gcgtactcgg 3541 agcaaatccc caaattcagt acgtactacc gtttgaaaac aagggcgtgg aggtaggtgt 3601 aactctgcac catccccacg ggcaaatcta cgcttatcct tttgtaccac ctgttccagc 3661 acgaatgttg gagcaacagc gacgatttta ccagcaacag gggcgcgggt tgctggaaga 3721 tttgatccaa aaggagattg ctgacaacaa acggattatt tatcaggatg aggaagcgat 3781 cgcattcgtc ccagtgtgcg ctcgttaccc gtatgaggtt tggattgctc caattgagcc 3841 agtaggcacc ttttatgatc tgagtgcaaa acaacgtcag ggacttgcca gggcattaaa 3901 aacggtcacc cttaagtatg acggattgtg gaatcgcccg tttccttacc tgatggcttg 3961 gtttagtgca ccaactgacg gtgaggctca tagagaagcc catttacacg ccgaatttta 4021 tcctccctat cggacgagcg agaggcttaa gtatctggcg ggaacggaac tcgcagcagg 4081 gatgtttgcc aatgatgctt tacctgagga gaaggccaag gagttacagg ctttggtggt 4141 caatatcgaa acgccagtgc ggttgtgata acttcaccct tgttttacat cctcccctgg 4201 ctattggcgg aaggtatggc tacgccacgc aagctatcag cctatgagtg ctgtaggagg 4261 ggaggtcaga ccaaggcttg gacgaaagtc tcatattcgg cactcgcgtt tgagtacaca 4321 aacctgagct aagtcggtac tcccacaagc gcggtgggta gttcaccttt ctcctgctgt 4381 ttcacaaaaa gtgcaaactg ctaaaggctt gcaaatacta ctgtttcacg ttttccaaaa 4441 aatatccaaa aaaattttcc aaaagtactt gccaaatcct tttcttagga ctattataag 4501 tattcagagc gttcctcagt agctcagtgg tagagcgatc gactgttaat cgattggtcg 4561 caggttcgaa tcccgcctgg ggagttaaaa aagataaaga aagttctggg atattttctt 4621 cttgaattgt cctggaactg tggaatggct ctggtcaaaa ccagttggtc agtgtagcct 4681 ccaacagatg cttttggtgt ccaaagtgca tgccaatcta gagaagcaaa tgagaatgat 4741 caaaaaatgc tcgctgaacc tatttacttt aagagcagta cagcgagcaa aaaatcagag 4801 cagcagagtc aatacagcac gaaatatggt agtgtctttg aaaaatattt gatgagttat 4861 taacgctgcc atgagccaag cttatattga atatgagtgt tgagttaagt agctttcata 4921 ttgtcggtga cagtggacaa aaccaaaaag tgcgatgcgg caagtaactg ttaggccaaa 4981 aacactgttt ataactgagg cttcagcgaa ggaatctatt catgctggtg tcgagcttgt 5041 aggttgtgat ttacgggtat gcagcaagat ttatccacca gcaccagacc gagttgttaa 5101 tcccggtata ccagaacatc gaatttccct tcacattgca aatcaagatt atttagggga 5161 cttccaagga ataaaatcac ccaagtcaat aataataatc attgttaggg g // LOCUS NODE_5392_length_5204_cov_4.6348815204 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5204) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5204) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5204 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(1..1020) /locus_tag="DP116_26835" CDS complement(1..1020) /locus_tag="DP116_26835" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26835" /translation="MNIEEALAVADAAIFAYTGKHLRDQEKLILSASWENQTYEQMAQ RFSWSEKTLKDAGSDLWKSLSKALGEKVTKKNFKAALERRRIENTSAQQSTDNQDWET APDVSVFFGREEELQLLEQWILKERCRLVGIIGFGGIGKTKLSVCLGKGGIGKTDLSL KLARSIGHEFTYIIWRSLLNAPPLTTILTDWIKFLSKQQETRLPETVDEQLRLVLQYL QSERCLLIVDNMETVLQGGVSAGQYRQGYEEYGQLLRAIAQVPHQSCLLLTSREKPKE LDRSPYPVQVLELRGLNYAQGRKIFAQIGSFEGSDEEWQELIAFYNRHVREFAKHSTS GFAQH" gene 1453..2016 /locus_tag="DP116_26840" CDS 1453..2016 /locus_tag="DP116_26840" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26840" /translation="MHKIIAFGFIALNTLLLLPLPTQAQNTFNLSGEWISAGESVTGN SAGCVKYQGGTYTTNGIQYGATYEAPTRITQSGNQLIFNNDNISNSLGNFTRVTRGTV TGNQVRIESTDSPASFNFVFQGKINENGNVITGLSTCSYRGGSATYTSFVVYYRKTFC SAQPISITELRSLQAQKPSLQQTSRLS" gene 2201..3121 /locus_tag="DP116_26845" CDS 2201..3121 /locus_tag="DP116_26845" /inference="COORDINATES: protein motif:HMM:PF06739.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26845" /translation="MKAQWKWHRLFLGSLIATTITTSLFDTNSYRATANSQQKLSNPQ GLSQFQTIGLKKNSGLSTLVKPNVHRFSSKKTLISAGERSPLVTGWIRQFGISGDDLS EKISIDKAGNVYVVGGTSGSLGGANAGNYDAFVTKYDNRGNRLWIRQFGTLGFDVASD VAVDNAGNVYVSGATNGSLVTSRSISGANPSFFTTSNTVAFVTKYDSNGNQLWIQKLG AAGTISDISGFTNGSLGGTNAGSYDAWIGKYDRNGQLLWTQRLGTPAEDKAFGIAVQG NDIYVTGETAGALGAANAGSYDAWLGKFTF" gene complement(3277..3654) /locus_tag="DP116_26850" CDS complement(3277..3654) /locus_tag="DP116_26850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748241.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADPH-dependent 7-cyano-7-deazaguanine reductase QueF" /protein_id="PRJNA477356:DP116_26850" /translation="MEMKYGERNIAEGQLITFPNPRVGRRYNIDITLPEFTCKCPFSG YPDFATIHISYIPNERVVELKALKLYINNYRDRYISHEETANQILDDFVAACDPMEVT VKADFTPRGNVHTVVEVRHQKNA" gene complement(3785..4189) /locus_tag="DP116_26855" CDS complement(3785..4189) /locus_tag="DP116_26855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316312.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26855" /translation="MEKWQEEFWDMIEIVADEVERFFVGMTEMIDSLFEITEEISEQV QNTIATELDQYLQELTDPILEIYSELEDIVVDDIDLGFPYPVEPCQEKNPACIGCRHY HGQVYSGNLLVCGMHPYGWEDENCPDWEQEEL" gene complement(4343..4732) /locus_tag="DP116_26860" CDS complement(4343..4732) /locus_tag="DP116_26860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748243.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26860" /translation="MPGFGDIVKKAFYLGVGLASYAGEKAGGTITELRSQVQKLADEM VARGEMTTDEARRFVEDMMKQAQSAPTSDTTARQTSPSEPRRIEILEEDEEPTTVKVT QPTENVDKLRDQVRQMQEELRRLQRDK" gene complement(4758..4854) /gene="ffs" /locus_tag="DP116_26865" ncRNA complement(4758..4854) /ncRNA_class="SRP_RNA" /gene="ffs" /locus_tag="DP116_26865" /product="signal recognition particle sRNA small type" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00169" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="Derived by automated computational analysis using gene prediction method: cmsearch." /db_xref="RFAM:RF00169" gene complement(4890..5158) /locus_tag="DP116_26870" /pseudo CDS complement(4890..5158) /locus_tag="DP116_26870" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013192243.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 1463 a 1148 c 1057 g 1536 t ORIGIN 1 ctaatgctgt gcaaagccag atgtggagtg ctttgcaaat tcccggacat gtctattata 61 aaaagcaatc aattcttgcc attcttcatc tgaaccctca aaagagccaa tttgggcaaa 121 aatcttgcgt ccttgtgcat aatttagacc cctcagttcc agaacctgca caggataggg 181 actacgatcc agttcttttg gcttctcgcg gctagtgagc aataaacaac tttgatgggg 241 aacttgtgcg atcgccctca gcagttgacc atactcctca taaccctggc gatactgtcc 301 cgcgctaacc ccaccttgca gcacagtttc catattatca actatcagca gacagcgttc 361 cgattgcaga tactgcaaca ccagtctcag ttgctcatcc acagtctcag gtaaacgagt 421 ttcttgttgc ttagataaaa acttaatcca gtcagtcagg atagttgtca gaggtggagc 481 gttgagtaaa gagcgccaga taatataagt aaactcatgc ccaatgctgc gagctaactt 541 tagggaaagg tctgttttgc caataccacc tttacccaag caaacggaca gtttagtttt 601 accaatccca ccaaaaccaa taataccaac cagtcggcag cgttccttaa gaatccattg 661 ctcaagaagt tgcagttctt cttcacgtcc aaagaacact gagacatccg gtgctgtttc 721 ccagtcttgg ttatctgttg attgctgcgc tgaggtgttc tctattcgcc gccgttccaa 781 agctgcttta aagttttttt tagtgacttt ttcccctagc gcctttgata gggatttcca 841 taaatcagaa cctgcatctt tgagagtttt ctcagaccat gaaaacctct gtgccatctg 901 ctcataggtt tgattctccc aagaggctga cagtattaat ttttcctgat ccctcaaatg 961 cttccctgta taagcaaata tggcagcgtc tgcaacagcc agcgcttcct caatattcat 1021 gatttcaagt ggttacagaa tacttccacc agcattttac gggaaagtca gtagatatcc 1081 gactttttcc gactttcttc caattctgtt ccgacttttt cgcactgaca tatctcagca 1141 gacctgagag tatagctttg caagcaaaga ttcagattga ttctgatagc gaaaacaggt 1201 tgtttgatgt ccgcttagtt tttactataa gcaggcagtt tgtggtattg acttacttgt 1261 tttgctgact tttaataaca ttttcgctaa gaaaaaagcg taatcggaaa ctcaaacccc 1321 cctattgcat aaaagtgcaa tcaatttgtt ataaacttgc aacttcgtta ttacaaaagg 1381 atagaaaatt atttcatcta ctgttttact gttttgagca aacttaatta agttcagaaa 1441 aggtctatca atatgcacaa aattatagcc ttcggattca ttgcgttgaa tacactcctt 1501 ttactgccct taccaacaca agcccaaaat acgtttaatt taagtggaga atggatttca 1561 gctggagaat cggtcacggg taacagtgct ggatgtgtaa aataccaagg cggtacttat 1621 actaccaacg gcatacaata tggtgctact tatgaagctc ctacgagaat aacacagtca 1681 ggaaatcaac taatattcaa taatgacaac atctctaatt cattgggaaa ttttacgcga 1741 gtaacgagag gaacggtaac tggtaatcaa gttagaattg agagtactga cagtcctgct 1801 agttttaact ttgtatttca aggaaaaatc aatgaaaatg gaaatgtaat cacaggttta 1861 tcgacctgta gttatcgagg tggttccgca acatatacta gctttgttgt ttattataga 1921 aaaacttttt gtagtgctca gcccatcagc ataacggaat tacgctcact tcaggcacag 1981 aagcccagtt tacagcaaac gtcacgcctt agctagaaat catgaaattt gagggattac 2041 ccaattggct tggctccccc acaaaattga taccatacca atcctaaatc attcgtcaac 2101 aaaaagattc ccgacttcgc agaagttgtc gggaatctaa gccttaagac atgacgaata 2161 ggatatctga ataaaaaatt tgtaaccagg agaaaaatca atgaaagctc aatggaaatg 2221 gcacagatta ttcttgggca gtttgattgc aactacaata actacaagtt tatttgacac 2281 caattcctac cgtgctactg ccaatagtca gcagaaactc agcaatcctc aaggactatc 2341 tcaatttcaa actatcggtt tgaaaaaaaa tagtggtttg tctacacttg tcaagccaaa 2401 tgttcataga ttcagcagca aaaagactct catttccgct ggtgagcgat cgccccttgt 2461 tacaggttgg atcagacaat ttgggatttc cggtgatgac ctttctgaaa agataagtat 2521 agataaagct ggtaatgttt atgttgtagg aggtacatct ggctctttag gaggagccaa 2581 tgctgggaat tatgatgctt ttgtgacgaa gtatgacaac agaggtaacc gactatggat 2641 aagacagttc gggactttgg gttttgacgt agcttctgat gtagctgtcg ataatgctgg 2701 caacgtttat gtttcggggg caactaatgg ctctcttgtc acttctagaa gtattagcgg 2761 tgctaatcca tccttcttca ctacatctaa tactgttgcc tttgtgacga agtatgacag 2821 caatggtaac cagttgtgga tacaaaagtt gggagcggct ggtactattt cagatatttc 2881 tggtttcacc aacggatctt tgggaggaac gaatgctggg agttacgacg cttggatagg 2941 aaagtacgat aggaatgggc agctgttatg gacacaacga ttaggtactc ctgccgaaga 3001 caaagctttt ggcatagctg tccaaggcaa tgatatctat gttacaggag agactgcagg 3061 agcattgggt gcagctaatg caggcagtta tgatgcttgg ctagggaaat tcacttttta 3121 gccatatttg aaattaaaac caaaactagg tacaggggcg acttttctcc tgtaccaact 3181 agaagtaaat cagtaaattc acaaaaaaag tccattgaag tggctaaagg cagtcaggat 3241 actggacatg ggaatgtcag cagcgctatc agatttctac gcgttcttct gatgtcgcac 3301 ttcaaccacc gtatgcacat taccgcgagg tgtgaaatct gccttaacag taacttccat 3361 tggatcacag gcagcaacaa aatcatctag gatttgattg gcagtttctt cgtgggaaat 3421 atagcgatcg cgatagttat taatatatag ttttagcgcc ttcaattcca cgactcgctc 3481 attagggatg tagctaatgt gaatcgtggc aaagtcagga tagccagaaa acgggcattt 3541 acacgtaaat tccggcagag taatgtcaat attgtatcgc cttccaacac gcggattggg 3601 aaatgtgatt agttgccctt ccgcaatgtt gcgttctcca tatttcattt ccatagtcaa 3661 gagtcaaaaa tcaagagtca agaggagaca gcgctgcagg agggtttccc tccgtaggcg 3721 actggcgttc aacagtcaac agtcaaaatt ttttcttttc aactttggac tattgactaa 3781 ctcatcataa ctcttcctgt tcccagtctg ggcaattttc atcttcccaa ccgtagggat 3841 gcataccaca aactaataga tttccactat acacttgacc gtggtaatga cgacaaccga 3901 tgcaagcggg atttttttct tgacagggtt caacaggata aggaaaaccc aggtcgatat 3961 catctacgac aatgtcttcc agttcactat aaatttccaa aattggatca gtcagttcct 4021 gtaaatactg atctaactcg gtggctatag tattttgtac ttgctcgcta atctcttctg 4081 tgatctcaaa aagggagtct atcatctccg tcatgcccac gaagaagcgt tctacttcat 4141 cagccactat ttctatcatg tcccaaaact cttcttgcca cttttccata aacccgacgt 4201 ccttacaagt tagttactgg acgcttaatg ctacatggct gggagcattt ttgtacatcc 4261 taaggactat gtttttataa aaattttaca ttgtcaagaa atatttcagc ttcatgctga 4321 aacactcctg ccagccttcc aactacttat cgcgttgtaa cctacgtagt tcttcctgca 4381 tttgtctgac ttgatcgcgt agtttatcaa cattctcagt aggttgagtg acttttacag 4441 tagttggctc ttcgtcttcc tctagaatct caatgcgacg cggttcagaa ggagatgttt 4501 gacgagctgt tgtatcagat gtgggtgcgc tttgggcttg cttcatcata tcttctacaa 4561 agcggcgggc ttcgtctgtt gtcatttcgc cccgcgcaac catttcatct gccagctttt 4621 ggacttgcga tcgcagttcg gttattgtcc cccctgcttt ttctcccgcg taagaagcca 4681 acccgacacc taggtaaaaa gcttttttta caatatctcc aaaaccgggc attgctgctt 4741 ttgcgctcct aaaatggggc gacccggaga ctacgcctgc tataagcatc ccgtattgct 4801 gctaccttcc ggtcctgaca agatttgggc gttacagccg cgtagatccg agtctgtaaa 4861 cagcatagca ttaatttttc gattgtgtgt cattccacga caattttcca ctcaaaaagc 4921 tgataattca cacaatcatt ttgtaccaat ggatcaagag ctacgatagt ttgtgcttct 4981 tccattgagg ctgcctcaaa cagcatcata ccgccacctt tttgcgccca gtagccagtt 5041 ttcgctttgt gtcctttggc aatcaattcc tgaacgcata cgcttgatgg gcacttacat 5101 attgatcaaa cgtggacttc tcaactttgc cttcttctat cttcacaaac cagggcattg 5161 ctttttacca aaactagggg tgtaagggtg tgagggtgta gggg // LOCUS NODE_5407_length_5172_cov_4.4268135172 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5172) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5172) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5172 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..282 /locus_tag="DP116_26875" CDS <1..282 /locus_tag="DP116_26875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206351.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="filamentous hemagglutinin" /protein_id="PRJNA477356:DP116_26875" /translation="GGLPMSPTEPLQDTSTLSAWVRLRPKAANSAKTTISPQPTAVSN STKVAAATPIVEATGWIVDKNGNIELVAQAPGVNPHSSWQTPASCQNSK" gene complement(291..644) /locus_tag="DP116_26880" CDS complement(291..644) /locus_tag="DP116_26880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006516528.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26880" /translation="MTTVRFQADADLKQAIVTGTIRREPNIDFQSANTAGLEGKTDKE VLAIAALQEKILVTHDRKTMPVEFAEFIMSQTSSGVLIISQNMPISDAIEALILIWEA STAEEWINQIMSLPL" gene complement(641..973) /locus_tag="DP116_26885" CDS complement(641..973) /locus_tag="DP116_26885" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009625295.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26885" /translation="MTASPTVAKRYIEQRDKGFWIEGTRVSLDSVVYAFLNGESPESI AQNFPLLSLEQIYGAIAFYLANRKLIDVYLKEGEKEFEKLQQSLGEKNPSLYEKLKAA LMQKQSKT" gene complement(1103..2617) /locus_tag="DP116_26890" CDS complement(1103..2617) /locus_tag="DP116_26890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015157421.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HlyD family secretion protein" /protein_id="PRJNA477356:DP116_26890" /translation="MISNSNSDFLPPPVQDNEFLPPMSSWVTMGGLVMVGILGISVIV ASVAKYKVTVKAQALVRPAGELRIVQAATEGQVMHVSVKENQLVKKGTEIATIDDSKL QTKKSQLQSNIQQSQLQIVQINAQISALNNQIAAETDRTKGAVDSAVAELSRRSRDYR DKQITSNKDVEEAEANVKQAEAELQKAQAQLQSTSANFKSSQAALNAAKLKRNRYQSV AKQKALSYNQLEEAQLVVEQQSSAVEVQQAAIEVQLQAIEQQKQAISAARAKWQRAQA ALNPSKAEVAIAQQTIAREQATGKVTLATLDKERQALIQQRFEKSKQLERDARELQQV IIDLNQTTITATSDGIISKLNLRNPGQTVRSGEEIAQIVPSNAPLVVKAAVAPQEKSK LKQGQKVLLRVSSCPYTDYGTLKGVVSQISEDTIKPQANSATATAAATGSSQQGGAVG TFYEVTIQPEVLSLGKGKNQCAIQVGMEGRADIISKEETVLQFFLRKARLIADV" gene complement(2934..5099) /locus_tag="DP116_26895" /pseudo CDS complement(2934..5099) /locus_tag="DP116_26895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877902.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="peptidase domain-containing ABC transporter" BASE COUNT 1333 a 1141 c 1219 g 1479 t ORIGIN 1 ggcggactac ccatgagtcc cactgaacca ttacaagata caagcactct atcagcttgg 61 gtgagattaa gaccaaaagc tgcaaactca gctaaaacaa caatttcgcc acaaccaaca 121 gcagtctcaa atagtactaa agttgctgct gcaactccaa ttgtggaagc gactggttgg 181 atagtggata agaatgggaa tatagaactc gtagcgcaag cacctggtgt caaccctcac 241 agttcttggc aaactcctgc ctcttgtcaa aattcaaaat gattctgaca ttataaagga 301 agagacatta tctgattaat ccattcctcg gcggtagaag cttcccaaat gaggataagt 361 gcttcaattg cgtcactaat tggcatattc tgggaaataa ttaaaactcc tgaactggtt 421 tgtgacatta taaattcagc aaactcaaca ggcatagttt tgcggtcgtg agttacaaga 481 attttttcct ggagtgctgc tatggctaat acctctttgt ctgtcttacc ctcaagccct 541 gctgtattag ctgattgaaa atcaatattt ggttccctac gtatcgtacc agttacaatt 601 gcctgcttga gatcggcgtc cgcttgaaat cgaactgtcg tcatgtcttg ctctgctttt 661 gcataagagc cgctttcaac ttctcataca aagacggatt tttctctccc agagactgtt 721 gcaatttctc aaactctttt tcaccttctt tcaaatagac atcaattagc tttcggttgg 781 caagataaaa ggcaatagca ccataaatct gttctaggga gagtaatgga aagttttgag 841 caatactttc aggagactca ccattcaaga aagcgtaaac aactgaatca agagaaacgc 901 gagtcccttc aatccagaag cctttatccc gctgctcaat atatcgtttt gctacagtcg 961 gtgatgctgt cataaaactg tttgctggtt actcaatatt tagatattaa cttgtaccag 1021 acgcgaaata cgaagtctct acacccaaat tcatactgtg cttgagcaac gcccaacaga 1081 gtttgctcga cattcaagaa tgttacacat cagcaatcaa tcttgctttc ctcaaaaaga 1141 attggagtac agtctcttct ttggaaataa tatcagctct gccctccatc cctacttgaa 1201 tagcacactg gtttttgccc ttaccgagag ataagacttc tggctgaatc gtcacctcgt 1261 agaaagtacc aactgcaccc ccttgctgac ttgaacctgt tgcagcagca gttgcagtcg 1321 cactattagc ttgaggttta atcgtatctt cggaaatctg gctgacgaca cctttaagag 1381 tgccgtaatc tgtataagga caggaagaaa cccgcaaaag tactttttga ccttgtttca 1441 acttgctttt ctcctggggt gctactgctg ctttgactac caaaggtgca ttactgggga 1501 caatttgagc gatttcctct ccagaacgca cagtttgacc agggttccgc aagtttagct 1561 ttgatattat accatctgat gtggcagtta tggtggtttg gttgaggtca attatcactt 1621 gttggagttc acgggcatct cgctctagct gtttgctttt ttcaaacctt tgttggatga 1681 gggcttggcg ttctttgtct agtgtcgcga gggtgacttt tcctgtcgct tgttctcggg 1741 caatggtttg ttgggcgatc gccacttctg ctttactagg attaagagca gcttgagcac 1801 gttgccattt cgctctcgca gccgaaatcg cttgtttttg ctgctcaatc gcttgaagtt 1861 gtacctcaat tgctgcttgt tgcacctcaa ccgctgacga ttgctgctca acaacgagtt 1921 gtgcttcttc caattgatta taagaaagag ctttttgttt ggctacgctc tgatatcggt 1981 tccgctttaa tttggctgca tttaacgcag cttggcttga cttgaagttc gcagaggttg 2041 attgtaattg tgcttgtgct ttttgcaatt ccgcctcggc ttgctttacg ttcgcctcgg 2101 cttcttccac atctttgttg ctggtaattt gtttgtctcg ataatcgcga ctgcggcgac 2161 tgagttcagc gactgctgag tcaacagccc ccttagtacg gtcagtttct gcagctatct 2221 ggttattaag agcactaatt tgagcgttga tttgaacgat ttgtaactgg ctctgctgga 2281 tattgctttg tagttggctc tttttggttt gcagcttcga gtcgtcgata gttgcaatct 2341 ccgttccttt tttgacaagt tgattttctt tgacagatac gtgcataacc tgaccttcag 2401 ttgcagcttg cacaatccgc aattcaccag ccggacggac gagtgcttga gctttaacgg 2461 tgactttgta tttagcgaca gaagcaacga tgacgctgat tcccaaaatg ccaaccatca 2521 ccagtccgcc catagtaacc caggaactca tcggtgggag aaattcgttg tcttgaactg 2581 ggggaggcag aaagtctgag ttggagttgc taatcataat tcaagttagt ttaactaatt 2641 gcgtggaatt ttccgtctca aaacctcacc ctcgcttttt gcttcgcaaa aatctttccc 2701 tctccttagc aaggagaggg atgtccgtga ggacagggtg aggttttccc agaatcttgg 2761 gtaatttgat cacttgtgtg tacagggtag ttggtaagga gaggggtgcc gtaggcgggc 2821 tgaggtgaga cagcgctgca ggagggtctc cctccgtagg cgactgcgaa cccggagggt 2881 gatacgtatg aatgcaactt agtattagaa tgagaaatag aacctttcat catcaaacca 2941 ttcgttggtt gggcaacgtc gtccaggaag tctaagtgct cgcctctttg atgacgcagt 3001 tgctgtggtg tgccttggat ttttaagcgc ccttcctcca gtaggacaat ccaatcggct 3061 cgctggatga ctctggggcg gtgactaatc ataattgtgg ttttgcctcg tcggtgggaa 3121 agcagttgat ctagtacttg agcttcactt accgggtcga gagcaccagt ggattcatct 3181 aaaatgagta tgggtgggtc agtgactatt gccctggcta tggctaatct ttgcttttgt 3241 cccccagaga gatttgcacc aaattctcct aaaacagttt gatatttgtc gggcaattta 3301 ctaataaatt catctgcacc agcaatctgg caagcttcga caatttgttt aaaagtaacg 3361 ttaggataac taaagcggaa gttgtcaatg atggaacgac tccagaagtg agcttcttga 3421 ggcacaagta ccacctgttg tcgcagacat tccacagaaa ggtcttgctg attgtacatt 3481 ccatagcgga tattgccaga ctgggaggag tataaaccag caatcaattt ggcgagggta 3541 cttttaccac agccagattt accaatcaag gcaatgactt gaccaccagt gatagtcagg 3601 gaaaaatctt ccaggaggtc aactctacct gcatggtgga agttgaggtt ggtgcaggtg 3661 atatctgcgt tagctagtat ctctgcaatc ggctttttca agtcgttttc gtcttctgga 3721 gtcgcatcta tcacttctgt caagcgttgg acgacaattt gagcagtgat aaattcatca 3781 attagcccaa ctacagaccc caaaaagcca agaaaactgc cactcatgcc gttatacgcc 3841 agcagttgtc cgatggataa agtctgatta atcactaagt gcgagccgaa ccagagtaag 3901 gcaatattcg tgaaattgga aagaatccca gtgattgtgc tgctgtaaag tcctagcttc 3961 atggtagtcc agcctaagtt agcaaggcga ccaaaattgg tttgatactc ttgccaagct 4021 tgttctgtgg cttgggtggt tttgaggact tgaacaccgc gaaaggtttc aaccagaaag 4081 ccttggtttt ctgtgccttt gacgatgagg ctgcgggttt tctgacgcaa agcggggaga 4141 aaaagcaagt tgacgagagt gacaatgaca aaggcggcga gggaacctag agtgagttgc 4201 caactataga agagcataaa gcccaaggaa accaaagcca caaaaaattg actgggtaaa 4261 ccgaggacaa ttttggaaac tagggcattg atggtatgaa catcggcaat gcggctaaca 4321 acttccccac tgcgacgtgc ttcaaaatat gataggggta agcgcaaaag ttgtcgcccg 4381 tattctagaa tgagtccgta ttgcaaccgt tgaccaaaat gaccgaccag gtgagattgt 4441 accaaaccaa tggtactttg gaataagttc atggcaatga ctccaattgc cacggtggta 4501 agcagttggg tatctcctcg cacgagtacg tcatcggtga ggagttgcat cattaagggg 4561 gaggcgaggg agaggagtcc aatggcaaag ttgatggcga tcgcctgaac caaaagattg 4621 cgatatggca tgacgcgggc aataaagcgc ccaaaaccgc caattttatc ttctggttgc 4681 tcatagaagc ggctatcgtc tggcattagc aacagcataa tgccatcact ccaaccctcg 4741 attaactcct tggaggtgag gtacaagata ccaacggcgg ggtcagagat gatatatttt 4801 ttcccctttt tcccgtataa tactacccag tggtaacctt tccaatgaat gatagctggt 4861 aaggggactt catttaattt ttcaagtagt tctggggtgg cttgaacttg acgagcatgg 4921 aatcccaggg tttgtgcacc tcgtcttaaa cctagcaagg tggttccccg tgatccggtg 4981 cccactgctt ctcgcacgcg gttgagggta aaggtgcgtc tgtagtgttt ggcgactgtg 5041 gcaagacttg ctgcaccaca atcttcttca ctgtgttgca atatgctgtg atatttcatt 5101 gttcaaaaaa tcagtgaaca gtgagaccag tactgcagga gggtttccct ccgtaggtaa 5161 ctggcgaacc cg // LOCUS NODE_5428_length_5135_cov_4.7903545135 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 5135) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 5135) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5135 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(362..505) /locus_tag="DP116_26900" /pseudo CDS complement(362..505) /locus_tag="DP116_26900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314894.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="histidine kinase" gene complement(675..2075) /locus_tag="DP116_26905" CDS complement(675..2075) /locus_tag="DP116_26905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314891.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="PRJNA477356:DP116_26905" /translation="MLDIETLRQVPLFSKLPSEQLQWLLEEGTEVWREPGEIHRREGD PADHVFILLEGQVRITQKVGNQEILLAMYEPKTLFGELPVLMGQTHFWACGRAVTRSH ILEVPNQVFWEMLSSCTCVMTEILRTMAERLQAVQTISQHREKLVALGTLAAGLAHEL NNPASACRRAVGQLRQTRQVLQPLTVKLNQKQMTCEQKAFVAKLQEDAIARAKTATRL DPLTQSDREDEVIQWLEAHNVSNSWKLAPTLVTAGLDIEWLENVKKNVGELLLGNVLT WTDITLQEWGLLDELDHCTTRISTLVDAVKDYSYMDRAPLQEIDVHEGIESTLTILNH KLKHSHVTVTREYDCEVPLITAYGRELNQVWTNLIDNAIDAIHGQDGQIWIRTSCESD FLLVEIADNGPGIPPEMRSHIFEPFFTTKGVGEGTGLGLHIVYRIVVEQHQGDIRVLS QPGDTRFQVRLPINLS" gene complement(2133..3539) /locus_tag="DP116_26910" CDS complement(2133..3539) /locus_tag="DP116_26910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314894.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_26910" /translation="MLHDSNQMTLFPKLPDDALEEMKQFGKEIQLNVGDVLFSEGDSK YHFYVVLEGQIEVTKQVGVETKVLAIHRHGEFMGELSMLTGSGAIANAHAIAPSRVLQ IDVETFRHILVECSPIADMILSAMAGRTKDVEAQLRQQEKMAALGKMSAGLAHELNNP GAAAQRAASQLRGNFKNLQTLALQLNLLSKDQLKFITDIQNQATEHAIHSPKLDPLTC SDKEDEVTEWLEDHDISNSWKLAPTLVTAGLYTEKLDTVADNIPIDCLEHVLKWLDAT LSSFGLIHEIEQSTTRISELVKAIKGYSYMDQAPLQEIDIHEGIENTLLILHHCLKKG IIVNREFDRTLPKICAYASELNQVWTNLIDNAIYAMKGKGDLTIRTYRENNCLVVEIL DTGSGIPPAIQSRIFEPFFTTKGVGKGTGLGLEIAYRIVVNKHHGDIRFESQPGHTRF RVHLPIQAENRSCQGVSE" gene complement(3589..3975) /locus_tag="DP116_26915" CDS complement(3589..3975) /locus_tag="DP116_26915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314895.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_26915" /translation="MRLLLDTHTFIWFVIDSPRLSIVVRGLIEDENNEKLLSIVSVWE IAIKQSTGKLSFGVPFQEFVEQQLSLNSIDLLNINLDHLAVVATLPLQHRDPFDRLLI AQSIVEKIPILSIDSAFDAYPIERLW" gene complement(3975..4205) /locus_tag="DP116_26920" CDS complement(3975..4205) /locus_tag="DP116_26920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358081.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system prevent-host-death family antitoxin" /protein_id="PRJNA477356:DP116_26920" /translation="MQEITLAEASKNLSELIEAAMSGEEVVITKDSQPVVKLTPVTPV KKRRPAKAGSAKGLITISDDFDEPLEDFKDYM" gene complement(4304..4879) /locus_tag="DP116_26925" CDS complement(4304..4879) /locus_tag="DP116_26925" /inference="COORDINATES: protein motif:HMM:PF13302.5" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="N-acetyltransferase" /protein_id="PRJNA477356:DP116_26925" /translation="MPHNNSTMMKLITPETTLETSRLLLEPLVASHAVHIYRQLKDER LYQFIPQEPPISLQVLQTRYRALSSRLSPDGQEAWLNWAVSLRQFETYIGTLEATVYA NQTAAIAYMIFPPFWQQGYAKEGCLQLLNHLFNDYQVNIVAAEIDTRNVASIELIKSL GFKRVSTKENADFFKGCVSHEYRYECMSSPT" gene complement(4881..4964) /locus_tag="DP116_26930" /pseudo CDS complement(4881..4964) /locus_tag="DP116_26930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314896.1" /note="internal stop; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system prevent-host-death family antitoxin" BASE COUNT 1452 a 1119 c 1016 g 1548 t ORIGIN 1 ggcacccctc tccttaataa ggagaggggt gcccggaggg cggggtgagg ttcttcgtta 61 tgataaaaaa acaagatccc cgacaacttt tacgaagtcg gggatctcaa tcaataggag 121 ttcattttca tttctgtttt taaaaacgct gtaatttttt tcatattgtc tgtctggaat 181 gactcataga gatgatgtag ataaatttgt actttaactc agaggatgaa aatatgacga 241 cgactccaaa acaaaatcca caaccaaaag ataaaccaaa aaagaccgag ccaggtctta 301 tcagttccga aaagaaaaag cccagtccgt aaaaatttta ctcagatacc ccttgaaatg 361 attaattcac agcttgaata ggcaaacgta ctcgaaaaga tgtatgacca ggttgggatt 421 ccaagtcaac gtcaccgtga tgtttgttaa caacaatgcg ataggcaatt tctaaaccca 481 acccagtgcc ttttccaact tcttttcgcg gcggacatag tgtgaattat ttatgccgtt 541 ctacttatgt acaattcatt aacatagaca gacaaagaaa gttttggatg tgccatcaat 601 ccaaaccagg cgaacacgca gccaaaaaaa gcagcgagtg actattgcca aatgttttct 661 caacaatcaa taattcatga caaattaatt ggtaaccgca cttgaaacct agtatctccc 721 ggttgagaca acacccgaat atcaccctga tgctgttcta caacaatgcg gtagactatg 781 tgcaaaccta acccagtccc ctcaccaact cctttggttg taaaaaatgg ctcaaaaata 841 tgagaccgca tctctggagg aattcccgga ccattatcag caatttctac gagtaaaaag 901 tcgctttcgc aacttgtgcg aatccagatt tgaccatcct gtccatgaat agcatcaatg 961 gcgttatcaa ttaaatttgt ccagacttga ttgagttctc taccgtatgc agtgattaac 1021 gggacttcgc agtcatactc tcgcgtgacc gtcacatgac tatgctttaa cttgtgattc 1081 aaaatggtga gagtactttc tataccctcg tgtacgtcta tttcctgtag gggtgcccga 1141 tccatgtagg agtaatcttt gacagcatca actagagtag agattcgggt tgtacaatgg 1201 tccagttcat ctaacaaacc ccattcttgt aacgttatat ctgtccaagt taggacatta 1261 cccaataaca attcaccgac atttttctta acattctcta gccattcgat atcgagtcct 1321 gctgtcacta gtgtgggagc cagtttccag ctattgctaa cgttgtgtgc ttctagccat 1381 tggattacct catcttcgcg atcgctctgc gttaatggat cgagtctggt tgctgtttta 1441 gcacgggcga tcgcatcttc ctgcaattta gcaacaaacg ctttttgctc gcaagtcatt 1501 tgcttttgat tcagttttac cgtcaatggc tgcaacacct gcctagtttg gcgtaattgt 1561 cccacagccc tacgacaagc tgaagcaggg ttattcaact cgtgtgccaa tcctgctgcc 1621 aaagtgccta atgctactag tttttctctg tgctgtgata tcgtctgtac tgcttgcaac 1681 cgttctgcca ttgtgcggag aatctcagtc atgacgcaag tacaacttga aagcatttcc 1741 caaaatacct ggtttggcac ttccaaaata tggctacgag ttacagcacg accgcaagcc 1801 cagaaatgtg tctgacccat taacacgggt aattcaccaa acagcgtttt tggttcgtac 1861 attgccaata aaatttcttg attaccaact ttctgcgtaa tccgtacttg accttctagc 1921 aatataaaaa catggtcagc agggtcacct tcacggcgat gaatttctcc aggttctcgc 1981 cagacctctg taccctcctc aagtaaccat tgcaactgtt cacttggtaa tttggaaaag 2041 agaggaactt ggcgtaaagt ttctatatca agcatgatac accttgaatt tgtaagtatt 2101 gaattaatag ctaaaatatt gttttacatt acttattctg aaaccccttg acatgaacga 2161 ttctctgctt gaataggtaa atgcactcga aagcgcgtat gaccaggttg ggattcaaag 2221 cggatgtcac catgatgttt attaacgaca atacgatagg caatttctaa acccaagcca 2281 gtgccttttc ctactccttt tgttgtaaaa aatggttcaa aaatccgaga ttgaattgct 2341 ggtggaatac ctgaacctgt atcaagaatt tctacaacaa gacagttatt ttctctataa 2401 gtacgaatgg tcaaatcgcc tttgcctttc atggcatata tggcattgtc aattaaattt 2461 gtccacactt ggttcagttc actggcataa gcacaaattt ttggcagagt gcggtcaaat 2521 tctcgattaa caataattcc ttttttcagg caatgatgga ggatgagtaa agtattttca 2581 attccctcat ggatatctat ttcttgtaga ggggcttgat ccatataaga atagccttta 2641 atggctttta ctaactcaga aatacgggtg gtactttgct caatttcatg aatcaaccca 2701 aaagatgaaa gtgtggcgtc aagccatttg aggacatgtt caagacagtc aatggggata 2761 ttatcagcaa cagtatccag cttttcggta tatagcccag cggtcactaa agtgggggca 2821 agtttccaac tattactaat atcatggtct tccaaccatt cagttacttc gtcttcttta 2881 tcgctacaag tcagagggtc aagtttagga gaatggatgg cgtgttctgt tgcttggttt 2941 tgaatatcag taatgaattt tagttggtct ttagaaagta aattgagttg taaagcaaga 3001 gtttgtaagt ttttaaaatt cccgcgcaat tgactagctg cccgttgcgc cgccgcccct 3061 ggattattta attcgtgtgc taaacctgct gacattttac ctagcgctgc cattttttct 3121 tgctgtcgta gttgcgcttc tacgtctttg gttcgtcctg ccatagcgga gagaatcatg 3181 tcagcaatcg gcgaacattc aacgaggata tgtctgaatg tttcaacgtc tatttgtaaa 3241 acacgactgg gtgcgatcgc atgagcatta gcaatagcac ccgaacccgt cagcatcgaa 3301 agttcaccca tgaactcacc atgacgatga atagccagca ctttcgtttc aacgcccact 3361 tgcttagtga cttcaatttg accttctaag acaacataaa aatgatactt tgagtcgcct 3421 tcactaaaca gcacatcgcc cacattcagt tgtatttctt taccaaactg cttcatttcc 3481 tccaaagcat catctggcaa tttaggaaac agcgtcattt gatttgagtc gtgcagcatg 3541 gcaactcctt atatgaaatt atctttaatt agatttaata attgctacct accacagccg 3601 ttcaatagga taggcatcaa aggctgaatc aatactcaaa ataggtattt tttcgacaat 3661 agactgtgca atcaaaagtc ggtcaaatgg gtctctatgt tgaagcggta gggttgcaac 3721 aacagcaagg tggtcaagat tgatgtttag caaatcgata ctattaagac taagttgctg 3781 ttctacaaat tcctgaaatg gcaccccaaa acttagttta ccggtgcttt gtttaattgc 3841 tatttcccaa acactgacta tactgagtag tttttcgtta ttttcgtcct caattagccc 3901 tcgtactaca atactgaggc gagggctgtc gatgacgaac caaataaagg tatgagtgtc 3961 cagcagtagc ctcattacat atagtcctta aagtcttcta aaggttcgtc aaaatcatct 4021 gagattgtga ttaacccttt agcactccct gctttagcag gacgacgttt tttaacaggc 4081 gtaactggag ttagtttgac aacaggctga ctgtcttttg taataactac ttcttcacca 4141 ctcatagcgg cttcaattaa ttcagacaaa tttttagatg cttccgcgag agttatttct 4201 tgcatagcta tcacctctgg gttcctaatt catacttaca tcaaaaattg gttatttgat 4261 gcaacaactt cagtttagtt tatcgaactg gcagatgagt gagttaagta ggagacgaca 4321 tacactcata tcgatactca tggctgacac atcctttaaa aaagtccgcg ttttcttttg 4381 tagatacgcg cttaaatcca agtgatttaa tgagttcaat ggatgcgacg ttacgagtgt 4441 ctatctcggc tgccacaatg ttgacttgat aatcgttgaa aagatgattt agcagttgta 4501 aacatccttc ctttgcataa ccttgttgcc agaatggtgg gaaaatcatg taggcgattg 4561 cagctgtttg gttggcatat actgttgctt ccaatgttcc aatatatgtt tcaaattggc 4621 gaagactcac agcccaattc agccatgctt cttgcccgtc aggagaaagt cttgaagata 4681 gcgcacgata tcgagtctgc aaaacttgca atgatatggg aggctcctgt gggataaact 4741 gataaagcct ctcatccttc aattgcctgt agatatgtac tgcatgagat gccaccagtg 4801 gctcaagcaa cagccgtgaa gtctcaagtg ttgtttcagg tgtaatcaat ttcatcattg 4861 tactgttgtt atgaggcatt aatgataatt tcttcgccac tgatgactgc ctcaattaat 4921 tctggcaaat ttttagatgc ctcaacgaga gttatttatt gcatggttgt cacctcttat 4981 tcctaactaa atgttgacat caattcagtt cactgttcac ccttcgggta tgcgcaaagc 5041 gcacgccaag ggcgttcgcg cagcgtctgg tggaggagat acgccagtcg cctacggcgg 5101 gaaaacgcct catgcgcgct ggactcactg ttcac // LOCUS NODE_5532_length_4988_cov_5.0881824988 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4988 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 776..973 /locus_tag="DP116_26935" CDS 776..973 /locus_tag="DP116_26935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315307.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26935" /translation="MFATLIVAWILYTVLVKVVRTTTRTAFISATTVVILHSGLGISP QDIWHQIIQLLQAFSQVLRVR" gene complement(1203..1703) /locus_tag="DP116_26940" CDS complement(1203..1703) /locus_tag="DP116_26940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129134.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA thioesterase" /protein_id="PRJNA477356:DP116_26940" /translation="MSEEKSQQPQLKSTTAIDSPTSGSFGNWFEYSIRVYPHHTDYAG IVWHGSYIAWLEEARVACLRAIGIEYADLVALGCDLPVVELSVRYHRQLQLGATAVVR TRMAEVTGVRINWDYAIESPDKQELYVTAQVTLVAVDRERGKIMRQLPTTVMDALVRL SGSKNN" gene complement(2561..2905) /locus_tag="DP116_26945" CDS complement(2561..2905) /locus_tag="DP116_26945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315304.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1815 domain-containing protein" /protein_id="PRJNA477356:DP116_26945" /translation="MFLRLAQQHRQFVQDLVMNLQALAIVLERRGYPASCYTCGDQMN SASFMVSLGENHLIRFLVSDYGITWTEMRDDRELMKLEGAEAVNQLQELANIVKHPVP TSVGNKLLAKRC" gene complement(4131..4346) /locus_tag="DP116_26950" CDS complement(4131..4346) /locus_tag="DP116_26950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26950" /translation="MGEAKRRKTTLGEKYGQDTTRILPWIPISKTQAEKFVKWSNRLT WIGIGGLVAAWVIVRFIGPAFGWWEVV" gene complement(4430..>4988) /locus_tag="DP116_26955" CDS complement(4430..>4988) /locus_tag="DP116_26955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315414.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="ATP-dependent DNA helicase" /protein_id="PRJNA477356:DP116_26955" /translation="PNTPEFQPAFLHKVMTLVCLSATAPGLTVVLVGDVPLKAQVGTI LASEFGSRVQVEKTCLDENGILVSGWEFWREHQKVLPAPQLLAIATLPLPSLENPLVA GRVAHYKSLHQDWFRLYLLPIAMNELQRAIAPLRESKGIVALLDSRVIIRSYGAQVLA ALSPLARINYLDPNLFSSVTEEDSA" BASE COUNT 1397 a 1087 c 1025 g 1479 t ORIGIN 1 attatactga gaaagaataa attatttttc cttgcataag ttttctctat agaatttctg 61 aaaatcctat caaaagtaaa aatcaaaatt taatattttt ttgaaagaag gttttgcggc 121 atactaattt ttcaaatttt ttattgtata gtgagcgcag aacaactaca ggaaaaagct 181 caagtaaact caacattctc agcaaaacaa tgctgagcaa tgcagagctt tttttctgtg 241 gaggttccac gtcagttgct actttagtgg tgccaactgg ggttgcggaa tactgaattt 301 aaggctttca catctgtact ctttgagcag taagttgatc cgatactgcg gactatggtt 361 gtggaattcc acgtcaaacg tcactcgcaa caacggggaa aacctccctc cgggtgtgcg 421 cggagcgcac gccttacggc gaacgccagt cgcctgcgga ggagccagtg cggtgaggca 481 gtacggtctt ggggtctccc caagtgaagt aactgccgaa cccgtaaggg tcttggggag 541 ccagtgcggt ggacggcttc gccggcataa agcatctggc gtgtctcccc aagtggagca 601 tctggcgttg gaaaccctcc cgcagcgctg gactcagtgc aaggcagtgg ctcctcaacg 661 ctcagcacta ctgaattcct gaagagagac tttatgtctg tcacggttac actcataaaa 721 ttttctctct tctaatagct tcattcgtaa gactctattt aaaaaaggtg tatctatgtt 781 tgctaccctg attgtggctt ggattctgta tacagtatta gtaaaagtcg tgagaacgac 841 aacgagaact gcgtttatta gtgctactac tgttgttatc ttgcattctg gtcttggaat 901 tagccctcaa gatatttggc atcaaataat acagctactt caagccttct ctcaagtcct 961 tagggttagg taatatgact tttggatgtt gcccgtcatg agtcatcagt tatttgttct 1021 ctgttcgacc acgttcgcat atctcgtttg gaggcgctga gtcccaaggg gacacgctgc 1081 gcgaacgcgt atggctgcaa cataggcgaa gcgttagcga gtgagacggg cgcgcctcct 1141 ctagaaatca ggctgccgtt tcattaaacc ggatttttca cttctttaat aaaacggatg 1201 agttaattat tttttgagcc tgaaagccgt accagggcat ccataacagt agtcggtaac 1261 tgacgcataa tcttacctct ttctcgatct actgccacta aagtcacctg agcagtaacg 1321 tataattctt gcttatctgg tgattcaatt gcataatccc agttgatacg gacgccagtc 1381 acctcagcca tgcgcgttct tacaaccgct gttgcaccca actgaagctg acggtgatag 1441 cgcacagaga gttctacaac tggtaaatca cagcccaaag ctaccaaatc agcatattcg 1501 ataccgatcg ctcgcaagca tgcgactcgt gcttcttcca accaagctat gtaggaaccg 1561 tgccagacaa tacctgcata gtctgtatga tgcgggtaga ctcttatgga atattcaaac 1621 caatttccaa aggaaccgct tgttggactg tcaattgcag tggttgattt cagttgtggt 1681 tgttgtgatt tttcctctga catcttaaat tttttggaga caaggataaa tgttccaaat 1741 ctggaatagc ataaagtgac acgtcttgtg caatttcttc ataagttgac actccgacga 1801 ctatcaaact attatccaag gtaaagaatc tgaggttcta tgtcgaacct caaacttggt 1861 gccttctgca ctcgtgtatt tccttaggtc aggaaatttt aagttttggt tattagattc 1921 taaaaaaact ggggatcaat ctgctattgc cttgttgtgg gttgtagatg atggcaggac 1981 gcagctcttt gataagtttt ttttatacct tgctaccctt aggcaagtgg acgaaaaaag 2041 ctcaagcttt acgacaagca caaatcgctt tgattataga aactaaactg acacacactc 2101 aacactaagc cttgggtgac agcctcagtc actcctacta ttaggcacta tttgttttta 2161 ttggaaacgg tttgtcattc ttttcgctgg cttgctgatt gacattgata cgctgctggc 2221 aattgttaag atgctgatta agactccaca ggcataacga gccaatatca caatcaaaat 2281 gacagcaatg tcaatcaagt tctgagatca cctatgaaag ttatgagcct ttaactagcg 2341 taataaaccg ctagcaacat atgaaacaaa caccgttaag agttaagagt taagcgttcg 2401 ggtgttaaga gttaagcgtt ctctgttccc tgttaagagt tctctgttcc ctgttaagag 2461 ttccctgttc cctgtgtttc ttaagagtta cgagttatta gccactgtta cgagttctta 2521 actgttaact tttaactctc aactcttaac tttttactgt ctaacaccgc ttagctaaaa 2581 gtttattacc tacgctggtt ggtacaggat gcttgactat attggcaagt tcctgtagct 2641 ggttgactgc ctctgcaccc tctagcttca ttaattcgcg gtcatcccgc atttctgtcc 2701 aagtgatccc gtaatcagag actaaaaacc gaatcagatg attttctccc aggctaacca 2761 taaatgaagc actattcatc tggtcaccgc aggtgtagca ggacgcagga tatcctcgcc 2821 tctctaagac aattgctaaa gcttgcaagt tcatcactaa gtcttgcacg aattgccgat 2881 gttgttgtgc tagtctcaaa aacatttttc cctccaaaca ccacagttat ttgcttcctt 2941 ttatattttc tttagattct tgcaccaatt ggtattgtaa actatccatt acaattcaca 3001 gatatatatt tatgaattgg ctacctagct tagggtagtc gatgtaagtt ttgcgtgccg 3061 tatcaaagta ttcagtttgt agatctgaaa tatcggatag aattgctaac tttgtgtttt 3121 ttttacagat taccgtcagg ataaatggta agttgaaaca attccgctat aagattaact 3181 catatttggc acaagtcaag tcgagaaaat gccgattttg tacttttatc cgctaatttt 3241 aggtagactc attacccttt gtcccaacaa ggcagacata cgtcttctca gcctgtttta 3301 agaggtgatt atctcgtcaa acgtctacac accagttaag taatgtaaag atcttaacct 3361 attggctgca acttctccat agaatctggc aaaactggta gatgctcgta gctgtttttt 3421 gattctcgct taaggttagt acttttttgt acgactcagg gctgagtctt tttactaaat 3481 aatcagtact tagtagtcat acagtgcggt agataaacca ctgcgttgct gtcagcagtg 3541 cgttgtcaag agttgttgct cggctgaaac tgcccgtcca aactatgtct tgcggataag 3601 cacttgcgct cttgagagtt tccccagttg ttctaagcag cgtctttaga ggagatagca 3661 acagctgtcc cattgtggta agtgacgtat caagctctgc aggaaaaagc caagtcgggg 3721 ggtgactatc tgtaaaggga aataccctga tccacaatcc caagtgtaaa acctaaatcg 3781 tttgacttaa cactgacaat aactgaactc aagataacca tgttgccatt tcggcagata 3841 tttttctgac tgtctttgag ttgctattta tactttttaa aagagcaatt cgagagaata 3901 acgagtcgct ttttaaaaag tgtatgctct gctaggttcg agatatacta atatgaatag 3961 gttttttaca tataatttac tctcataact tttatcagta tactgtatat actaaagtca 4021 gttaaattaa ccgaaataaa ttaacaaata aaaagaataa cacagccagg aaaacctagc 4081 tgttgtgttg acactttcaa tcgttgtcag aaacttatcc aagctcgggt ttagacaact 4141 tcccaccaac cgaaagctgg accgatgaaa cgaactatca cccaagcggc gaccaaacca 4201 ccaataccaa tccaggtgag ccgatttgac cattttacga atttttccgc ttgggttttg 4261 gagataggaa tccacggcaa gatcctggtt gtatcttgac cgtatttttc tcccagtgta 4321 gttttacgac gctttgcttc acccataatc taaaataaat ccctgaatac tgctgatgat 4381 agcgtttgct tgaagcaata gctaaaaaat actgaataat ttagatgact taagctgaat 4441 cttcctctgt gacagaggaa aacagattgg gatcaaggta gttaatacgt gcgaggggac 4501 tgagggcagc caaaacttga gcaccatagc tacggataat cacgcgacta tctagcaaag 4561 caacgatgcc tttactttcg cgcagtgggg cgatcgcccg ttgcaattca ttcatagcaa 4621 ttggcaacaa gtacaaacga aaccaatctt gatgcaaact cttgtagtgc gccactcgac 4681 ctgccactaa ggggttttct agagatggca agggtaaagt cgcgatcgcc aaaagctggg 4741 gagcaggcaa aaccttttga tgctcacgcc aaaattccca accactcacc aaaataccat 4801 tttcatccaa acaagttttt tccacctgta cccgtgaacc aaactcagag gcaagaattg 4861 tccccacttg ggcttttagc ggtacatccc ccacaagcac aactgtcaat cctggtgctg 4921 tcgcgcttaa gcaaaccagt gtcataactt tatgaagaaa cgctggctga aactctggcg 4981 tgttgggt // LOCUS NODE_5539_length_4983_cov_4.9691564983 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4983) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4983) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4983 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 185..1009 /locus_tag="DP116_26960" CDS 185..1009 /locus_tag="DP116_26960" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206599.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26960" /translation="MSYDNVCKILAEKYPTDFARWLLAVEPRTIEVLKTELSIEPIRA DSLTFLQTENRILHIEFQTITRSKTPISFRMLDYSVRLIRQYKVPVTQVVIFLQQTSD EIAFTEQYRSETTTHRYRVLRMWEQDSVQFLNNPALLPLAPLTQTNSPQALLSQVAQS VARISDRETRQNIAAYTEILAGLKFEKDSIRQFLREDVMQESVIYQDILQKGEQKEAL RFCLSLLNERFGEIDSSILERVQVFNKEQLEALGRALFRMSSIADLLTWLDEQESN" gene complement(1108..1401) /locus_tag="DP116_26965" CDS complement(1108..1401) /locus_tag="DP116_26965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008274494.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26965" /translation="MSKTNFSGDMSPEERHRKLVEMADEEIDYSDIAPLDEDFWKNAE LSLPQKKEGIYIKLNPRTLQWFRSRGKGYQAMIDSVLTSYVEHQEKLGVKSQE" gene complement(1376..1675) /locus_tag="DP116_26970" CDS complement(1376..1675) /locus_tag="DP116_26970" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319029.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="BrnT family toxin" /protein_id="PRJNA477356:DP116_26970" /translation="MSFEWNDAKDKQNIEKHGIRFEEAKRVFNDPFAITREDCRFDYG EIRKVTLGEIPLSTLATSIVVVVVHTERDGHTRIISARKANKKERAYYEQNKLLG" gene complement(1829..3226) /locus_tag="DP116_26975" CDS complement(1829..3226) /locus_tag="DP116_26975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952833.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_26975" /translation="MPTLLADAQNLLSDLLKRYSSRVDYLMIRLEEAEGTDIFLRGDK VESLSEGISIGGQIRACYKGGWGFSSFNQLATIRDRIEEAIAAAKIVGDEETVLAPID IVQAVCSMPLTGSDPRQISLKKKKELCDRYTQLLKSVDSRITTTSVRYGDSAQKVILA TSEGTFIEQSWVDMEMRFAATARNGETVQTGRETTGSRKAFEDLTALDEQVKEAAQRA VAALSLPSVKGNIYTVVIDPILSGLFVHEAFGHLSEADMAYENPDLLEAMTIGRQFGP KELQIFDGAAPQGHRGSYFYDDEGTPATTTQLIEDGILTGRLHSRETAGKLDEAPTGN ARCLSYHYPPIVRMTNTWIKPGKTPVEDLFSGIKEGVYARNWLGGMTNGEMFTFSAGE AWMIRNGKIAEPVKDVTLSGNVFQTLADIEAIGDDFYWDESGGCGKGGQNGLAVGCGG PSLRIRDVVVGGEAA" gene 3614..3961 /locus_tag="DP116_26980" CDS 3614..3961 /locus_tag="DP116_26980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016860775.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Dethiobiotin synthetase" /protein_id="PRJNA477356:DP116_26980" /translation="MNYETACKFLIDQTLYNEENPDALLIRLQRGQPPIPGQITSILL ALKVVFEGVKDATVIDKKLAYALYLLSIRTQQLFAAGRKAGVQWSPLLLQDLLRIATA TESIFSGEWQNLH" gene complement(4103..4983) /locus_tag="DP116_26985" /pseudo CDS complement(4103..4983) /locus_tag="DP116_26985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999243.1" /note="too many ambiguous residues; incomplete; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="dehydrogenase" assembly_gap 4835..4844 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 1347 a 1165 c 1025 g 1436 t 10 others ORIGIN 1 atatttaccc acgattctcg ctttacccca ccccggtttt gtctcttgcc aaaacctccc 61 ctccccttgc taaggggagg ggattgaggg gtggggttaa atcagcgtgg attgaagaag 121 aaataaatct tatagatagc aactaaataa taaaattcta tcagtgtcaa ttagattttc 181 atttgtgagc tacgacaacg tttgtaaaat cttagctgaa aaatacccaa ccgattttgc 241 gcgttggttg ctagcggttg aaccccgaac catcgaagtt ttaaaaacag aattgagcat 301 tgaaccaatt cgcgctgatt cgttgacatt tctgcaaaca gaaaaccgca tcttacacat 361 cgaatttcaa actatcacaa ggtctaaaac accgatttct tttcggatgt tggattattc 421 tgtcagattg atacgtcagt ataaagtccc tgtcacgcaa gtcgtgattt ttttacaaca 481 gacaagtgat gaaattgcct tcactgaaca atatcgcagt gaaacgacaa ctcatcgcta 541 tcgcgttttg cggatgtggg aacaagattc agtgcaattt ctcaataatc ctgcactctt 601 accgttagca ccgttaacgc aaacaaactc accccaagca ttattatcgc aggttgccca 661 gagcgttgct agaatttcgg atagggagac gaggcagaat attgcagctt atacagagat 721 attagcgggt ttgaaatttg aaaaagattc aattcggcaa tttttacggg aggacgttat 781 gcaagagtca gtgatttatc aggatatttt gcagaaggga gaacaaaaag aagcacttag 841 attctgtctg tctttactta atgaacgttt tggtgaaatt gattcgtcaa ttcttgagcg 901 agttcaagtt ttcaataaag aacagctaga agctttaggt agagcgcttt ttagaatgtc 961 atcaattgct gatttactga cttggttaga cgagcaggaa agcaattaag taagtgggtt 1021 taatgaaata taagatgtag tgcgggctgt cctgcccgcg agcgagacgc tcgcactact 1081 gctatctacg aattaattac cttgatatta ctcctggctc tttaccccga gcttttcctg 1141 atgctcaacg taggaagtca gcaccgaatc aatcatcgct tgatacccct tccctcgact 1201 gcgaaaccat tggagcgttc gagggttaag ttttatgtaa atcccctcct tcttctgggg 1261 taacgaaagc tcggcatttt tccagaagtc ttcatcaagc ggtgcaatat cggagtaatc 1321 gatttcttcg tccgccattt cgacgagctt tctgtgacgt tcctcggggg acatatcacc 1381 cgagaagttt gttttgctca tagtacgccc tctctttctt gttcgctttg cgagccgaga 1441 tgatccgagt atgcccatca cgttcggtgt gaacgactac caccacgatg ctcgtagcaa 1501 gtgtgcttaa cgggatctca ccaagggtga ctttccgtat ctcaccatag tcaaatcggc 1561 aatcttccct ggtgattgca aaaggatcat tgaataccct cttggcttcc tcgaagcgta 1621 tgccgtgttt ctctatgttc tgcttgtctt ttgcgtcgtt ccactcgaaa gacatacatc 1681 aaaatatata tgattatata tattttcttg aaaacagggc aatcttgcat aactgtgggc 1741 tgctgcctca atataattat ggaaggcaga gcctcctgta tggcactccc acgcagagca 1801 tgggagcgag agagcgagca aagaacctct aagccgcttc tccccccacc accacatcac 1861 gtatccttaa actaggacca ccgcaaccta ctgctaaacc gttttgtcca cctttaccgc 1921 aaccgccaga ctcatcccag taaaagtcat ccccgatcgc ctcaatatcc gccaaagtct 1981 gaaaaacatt ccctgaaagc gtcacatcct tcacaggttc agcaatctta ccgtttctaa 2041 tcatccacgc ttccccagcg ctgaaggtga acatttcccc attcgtcatt ccacccaacc 2101 agttacgggc gtaaactcct tcttttatac cgctaaataa atcctcgaca ggcgtttttc 2161 ctggtttaat ccaggtattt gtcatccgta caattggggg atagtgataa ctgagacagc 2221 gtgcattgcc cgttggcgct tcgtccaatt tgccagccgt ttcacgagaa tgtaaacgtc 2281 ctgtcaaaat tccatcttct ataagttggg ttgtagttgc aggtgtgcct tcatcgtcgt 2341 aaaaatagct acctcgatgt ccttggggcg cagcaccatc aaaaatctgc aattcttttg 2401 gtccaaattg acgcccaatc gtcattgctt ccaacaaatc tgggttttcg tatgccatat 2461 ctgcttcgga aagatgtcca aaggcttcat gaacaaataa cccagaaaga atcgggtcta 2521 tcaccacagt ataaatatta cctttcacag atggtaaaga taacgctgca actgcccttt 2581 gtgctgcttc tttaacttgt tcatctaaag ccgtcaaatc ttcaaaagct ttgcgcgaac 2641 ctgtggtttc tcttccagtt tgcacagttt cgccgtttct cgctgtggct gcaaagcgca 2701 tttccatatc cacccaagat tgctcaataa aagtcccttc gctggtggcg aggataactt 2761 tttgggcgct gtctccatag cggacagagg tggtggtaat ccggctgtca acgcttttga 2821 gtaattgtgt gtagcgatcg cacaattctt ttttcttttt caacgagatt tgccgaggat 2881 cgcttccagt taagggcata ctgcatactg cttgcactat gtcaattgga gctaggacag 2941 tttcttcatc accaacaatt ttggcggcgg cgatcgcttc ttcaatgcga tcgcgaattg 3001 tcgccagttg gttaaaacta ctaaaccccc atccaccctt gtaacatgcg cgaatctgtc 3061 caccaatgga gataccttca ctaaggcttt ctaccttgtc gccacgcaag aagatatcgg 3121 taccttcagc ctcttcaagg cgaatcatca gataatctac acgggatgag tagcgcttga 3181 gtaagtcaga aagtaaattt tgagcgtcag caagtaaagt cggcatttgt cggacaagaa 3241 caatgaggaa ctgccttcat tgtgcaatta tgccagtatt agctgccact gaacgaggga 3301 actgagatct tgcatcatta cgaataaacc actaatactt atatgaaata gtgtttaaat 3361 atactaaata gttccttcca ttttttttca gtccatttga atggacttcg cttattagcc 3421 tgggacttac agtcctaggc ggacgagaac aaagtcaagc agcctgtaat taatttgaca 3481 tatttgttcc aggtgcaaga tgtaagggaa ttctttttat atgacgcgtt tgtaaggcta 3541 tcaaggtaaa atgttaattc gtgagtcaaa tgttgcaata ggtaggcaaa ggtgtgatgg 3601 atgcccctcg aaaatgaatt acgaaaccgc ttgcaaattt ttgattgacc aaacacttta 3661 taacgaggaa aacccagatg ccttgttgat acgtctacag cggggacaac caccaatacc 3721 tggtcagatt acctcgattt tgttggcatt gaaagtggtg tttgaaggtg taaaagatgc 3781 tactgttata gacaaaaaat tagcttacgc cttatatcta ttaagcatta ggactcaaca 3841 actttttgca gcagggcgca aggcaggcgt tcaatggtca cccttactac tgcaagattt 3901 gctgagaatc gccactgcaa ctgaatctat cttttctggt gagtggcaaa atttacatta 3961 acagttaaca gttaacagtt atcagtctat aggattcctt tttgattttt gcgaagctag 4021 gtactgttcc ctgttccctg ttccctgttc cctgttccct gttccctgtt ccctgttccc 4081 tgttaagcgt gtttcttaag agtcagcctg atggaatcat tgcgactttt atgaccttac 4141 gatctctcat atcacaaaac acctgttcta aatcctgcaa tggacggtgt tcgctgatga 4201 gtaactcaaa gggaatcttc tgacttgcaa tcagtgacag cgccgcccga acgtactctg 4261 gagtattatg aaaaacccct ttgagagtga gttcgctgta gtgtagctgt tctgtattca 4321 cgctaatagt tgtatctcgt ggacaaccac caaataggtt aacagttgca ccaggacgag 4381 cacaagcaat tgcagtttcc cagacacttg gtacaccagt tgcttcaatc accacatctg 4441 caccccatcc ctgtgtcagt tctttcacta aactaggaat ctctgggaat tgatgataat 4501 taaaggtctt tgctgcaccc agtttttctc caatttctag cctttggtca ttgcctccaa 4561 agagtaacac ttcagctttg acatcatgtg ctaatttagc cacaaacatc agcccaattg 4621 ccccgtctcc taacaccacg acaaaatcac ctggtttgac gttggaacgg gcgactccat 4681 gtagcacaca agccagaggt tctgtcattg ctgccaattc ataaggcaag tcatcaggaa 4741 ttggcaacat gttatgttgc acaatcggtg cgggaatttt taagtattgg gcaaaagtgc 4801 cgttattcca tgtcaaattt ggacagaggg aatannnnnn nnnntcttga cgttgacaaa 4861 aaaagcaatt catgcatgga gcggaattat ttgcgaccag gcgatcgccc acttgccaac 4921 ttgtgacacc ctctccgaca gcgacaattt ccccagcagc ttcatgacca aacagtgttg 4981 gtg // LOCUS NODE_5565_length_4956_cov_5.6345644956 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4956) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4956) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4956 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1..684 /locus_tag="DP116_26990" /pseudo CDS 1..684 /locus_tag="DP116_26990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017651814.1" /note="frameshifted; incomplete; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="cysteine synthase A" gene 700..1401 /locus_tag="DP116_26995" CDS 700..1401 /locus_tag="DP116_26995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317915.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molecular chaperone DnaJ" /protein_id="PRJNA477356:DP116_26995" /translation="MADKTNHYQTLKVNPNASQAEIKQAYRRLVKMFHPDSNQGTADP EQIIRINAAYEVLGDVQNRQNYDQKLRQRSQETTTNRQQRTAAAQQQYQATKKRGKDA DEQVEEWLQRVYQPVNRLVFSILASLREQIEKLAADPFDDILLEEFQDYLHACQADFK QAHLTFRSLPNPPSLARAAAHLYYCLNQIGDGLDQLAYFPLNYDESYLHAGQEMFRTA TNMYREAQESLGAYV" gene complement(1545..2816) /locus_tag="DP116_27000" CDS complement(1545..2816) /locus_tag="DP116_27000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315409.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PrsW family intramembrane metalloprotease" /protein_id="PRJNA477356:DP116_27000" /translation="MTGLHRYKSFLRKVNHNTASDSQHTEDIVLSNKPIVIGRDPSCC DIVLDSTKYQEVSRQHLKISPLSSGSPSDLPQWEISDLESRNGIYINGQRLQGSQILQ VHDQIKLGRKGPEFVFECEQISFVGVSDIFPVVSRKLDLHQKGFLVPGIVTVIFVVAM LATQNSNNFLYIFAAYLALASHYVIHKLCHKHKPWWLLVSLGLATGLPLLTAHHELTA IFSDILPGDLFKHNENILAITIKEFLKAGLLEELFKVLPVILVYFIGRLLQSPKRELI GVWEPMDGILLATASATGFALVESMMKVHEETQSKGSFAGLTLLIPLILGDISGQVAY SGYFGYFIGLGAMKPKKLWRFIGIGYLTSAALHTIGAIISELQKEHKLDFLIGNVLLA LIGSLAYIFLMAAILKANQLSPNHSHKSANG" gene 3322..4194 /locus_tag="DP116_27005" CDS 3322..4194 /locus_tag="DP116_27005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459962.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="6-pyruvoyl tetrahydropterin synthase" /protein_id="PRJNA477356:DP116_27005" /translation="MQCIISRRAQFSASHRYWLPELSEAENIKKFGACSRFPGYGHNY VLFVSLVGELNEYGMILNLSNVKHVINRKVTSILDFSYLNDVWSEFQQTLPTTENIAR VIWQRLAPDLPLVRIQLFEHPELWADYLGNGMEAYLTVSTYFSAAHQLAHPRLSSEEN FKIYGKCANPNGHGHNYHLEVTVKGEIDPRTGMIIDLGALNLAIKEYVIELFDHSFLN KDVPYFAEVVPTAENIALCISQQLTSPIQKLGATLYKVKLMETPNNSCEIYCTESDSN SVNTAQNESVLVRV" gene complement(4266..4601) /locus_tag="DP116_27010" CDS complement(4266..4601) /locus_tag="DP116_27010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012413009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="XisI protein" /protein_id="PRJNA477356:DP116_27010" /translation="MDKLDEYRTKVKQLLTKYVQYKPSYGDVEVEQIFDTERDHYQII SIGWNNQKRVYGPIMHLDIKNDKIWIQQNTTEVNIASELMEMGIPKQDIVIGFHTPKM RQLSGFAVE" gene complement(4589..4786) /locus_tag="DP116_27015" CDS complement(4589..4786) /locus_tag="DP116_27015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012413010.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27015" /translation="MKRQFINYRTALRTEDPNITINLAVPLAIYNDFFNLPFTQTVVS ENQLKLIIYNVDTEEIVQWIN" BASE COUNT 1489 a 1039 c 1051 g 1377 t ORIGIN 1 ctgaacgaag ggcaatgttg cgggcatatg gcgcgcaatt agaactcact acaggaactg 61 aaggcatgag tggggcaatt cgacgggcgc agcaaatcgt tgatacgact cttgatgctt 121 acatgttaca gcagttccgt aacccagcca acgccaaaat acaccgggac acgacagcgg 181 aggaaatctg ggaagatacc tcaggacagg tcgatatagt agttgctgga gtcggaacag 241 gtggtacaat tacaggtgta gctgaagtta tcaaagcacg caaaccaagt tttcaggcga 301 taagccggag gcttgacgct ccgcgtatcg cagttgaacc cgcaaatagc ccagtgttat 361 ctgggggaag accaggacct cacaaaattc aaggaattgg tgctgggttc gttcctcaag 421 tgctgaatgc caacattata gatgaagtca tttcggtcac tgatgaagag gcgatcgctt 481 acgggcggcg tctggcaaaa gaagaaggac tgctgtctgg tatttccagc ggggccgctt 541 tgtgtgctgc tattcgtgtg gctcaacgca aagagaactc gggacgcaaa atccgtgatg 601 attcagccta gttttggaga aaggtatttg agtacaccgt tgttccaaga cttggagcca 661 aaggtagcag ttagcatcag ctaacggtac attgaaagaa tggcagataa gacgaaccac 721 taccaaacac tcaaagttaa tccgaacgca agccaagcgg agattaagca agcttatcgt 781 cgcttggtca agatgtttca tcctgatagc aatcagggca cagcagatcc tgagcaaatt 841 atccgtatca atgcagcata tgaggtctta ggcgacgttc aaaatcgcca aaattacgac 901 caaaaactgc gtcaacgctc tcaagaaacc actaccaata ggcaacagcg tacagcagct 961 gctcaacagc aataccaggc gacaaaaaaa agaggaaaag atgcggatga acaagtagag 1021 gaatggctgc agcgagttta tcagccagtg aatcgcctgg tatttagcat tctggcttct 1081 ctacgggagc aaatcgagaa gctagctgct gacccctttg atgatatcct cttggaagaa 1141 tttcaggatt acttacacgc ctgtcaagct gacttcaagc aagcgcacct gacttttcgt 1201 tctttaccaa atccacctag tttagcaaga gcagcagccc atctctacta ttgtctcaat 1261 caaataggcg atgggctaga ccagttagcg tatttccctc ttaactatga cgagagttac 1321 ttgcacgctg ggcaagagat gtttcgcact gccacgaata tgtatcgtga ggcacaagaa 1381 tctctaggag cctatgtgta gtggttaatg gttagtgatc ggatttaccc actgtacaaa 1441 tgcacccact tacggcgttt tttatttgca tcttttattt gcatcaatta taaaccatat 1501 ggtagaggaa gacaatgtta tacgccctac atctcgcttg tgttctaccc attggcagat 1561 ttgtgtgagt gatttggcga aagttgattg gctttgagga ttgctgccat taaaaatata 1621 taggctagag agccgatcaa agctaagagt acgttaccga tgagaaaatc tagcttatgc 1681 tccttttgca actcagagat aatcgctcca atggtatgta gagctgctga agtcaggtag 1741 ccaattccta taaaccgcca aagtttttta ggtttcattg cacccaatcc aatgaagtag 1801 ccaaagtagc cactatatgc cacctgtcca gaaatatctc ctaagatcaa gggaattaaa 1861 agtgttaatc ccgcaaaact acctttgctt tgagtttctt catgtacttt catcatactc 1921 tctactaatg caaatccagt agcagatgct gtcgctaaca gaattccgtc cataggttcc 1981 cataccccaa tgagttcgcg cttgggtgat tggaggagtc tgcctataaa gtaaactaat 2041 atcactggca gcactttaaa caattcttct aacaagcctg cttttaaaaa ttctttaata 2101 gttatagcaa gaatgttctc gttatgttta aataaatctc ctggtaaaat gtcagaaaaa 2161 atagctgtca attcatgatg ggcagttaaa agcggtaaac cagtagccaa acctaagctc 2221 actagcaacc accagggctt atgtttatgg cagagtttgt gaataacata gtggcttgca 2281 agagctaggt aagctgcgaa gatgtagagg aagttattgc tattttgagt agccaacatt 2341 gctactacga atatcactgt gacaatgccc ggaacaagga aacctttctg atgtaaatct 2401 agctttcttg agacgacagg aaatatatcg ctgacgccaa caaagctaat ctgttcacac 2461 tcaaaaacga attctggacc cttgcgacca agtttgatct ggtcatgtac ttgcaaaatt 2521 tgagaacctt gtaagcgctg accattaata taaataccat tacggctttc caaatcggat 2581 atttcccatt gaggcagatc actgggagat ccagacgaga ggggagaaat ttttaaatgc 2641 tgacgggaca cttcctggta tttcgtggaa tctaaaacaa tgtcacaaca actaggatct 2701 cgtccaatta ctattggttt attagataaa actatgtcct cagtgtgttg agagtcgctt 2761 gctgtattat ggttgacttt tcttaaaaag cttttgtaac ggtgaagtcc tgtcatcaaa 2821 ttactcctaa acttgtgttt taatcaataa aaatgctacg aaaaccctgt tttgtgtttg 2881 agagcctcag aaagatttgc aagcactata tactttgaga acttaggcat taacaaaagt 2941 ttgattttca cagaacttaa caatgaagaa gcttttcttc tatattctta ataggtgtca 3001 gcaaaatcat tttatgcaga agtcacgtta ggctcacaaa ctagaagtct gccgtagata 3061 aacaaactgc aaagcgccca aactggatca atccaagccg ggcgtgttgc agtaagagta 3121 aatgtcgtct aggttgtgcc aaacctgccg acaaagataa cctgtctgaa tcgataagtg 3181 ggtgcatttg tactgtgctt aagtccccag tgaatccagg aatattgacg aacaaccaac 3241 tattaacaac ttacaccaga attagtttat gctggaagca gtaaaattta agatatttta 3301 aattttactc ccagcgtgct catgcaatgt attataagtc gtcgggctca gttttcagca 3361 agtcatcgct attggttgcc agaactgagt gaagcagaaa acattaagaa atttggtgct 3421 tgttcaagat ttcccggata tggacacaat tatgtcttat ttgtctccct cgttggggaa 3481 ttaaatgaat atggcatgat tctcaacttg tccaacgtga aacacgttat taaccgaaaa 3541 gtgacgagta tactggactt ttcctatctc aacgatgtat ggtcagagtt tcaacaaact 3601 ctacccacca cagagaacat tgcacgggtc atttggcaac ggcttgcacc cgatttacct 3661 ttagtgcgta tacagctttt tgaacatccc gaactttggg ctgattattt aggaaacggc 3721 atggaagcat atctcactgt tagcacttat tttagcgccg cccatcagct agctcatcct 3781 cgtcttagca gcgaagaaaa cttcaagatt tacggtaagt gcgctaatcc caacggtcat 3841 gggcacaact atcatttaga agtgactgta aaaggggaga ttgatccacg cactgggatg 3901 attattgatt tgggcgcttt gaatctagct ataaaagagt acgtcattga gctatttgac 3961 cacagcttct taaacaaaga cgttccatac tttgctgaag ttgttcccac tgctgagaat 4021 attgcacttt gtattagtca gcagctgact tcgcccatcc aaaaattagg agcgacacta 4081 tacaaagtaa aattaatgga aactccgaat aactcctgcg aaatttattg tactgagtca 4141 gattccaact cagttaacac tgcacagaat gaatcagtgt tagtacgcgt gtagcgagtc 4201 aagaactact cacaagcatt taaaaggggc gactatttac tttttaatag tcgccctttt 4261 aaagcttatt cgacagcaaa acccgatagc tgacgcattt tcggtgtatg aaacccaata 4321 acaatatctt gtttaggtat gcccatttcc atcaattctg aggctatatt aacttcagtt 4381 gtattttgct gaatccaaat tttatcattc ttgatatcaa gatgcattat cggaccataa 4441 acccgctttt gattgttcca tccaatgcta ataatttgat agtggtcacg ttcggtatca 4501 aagatttgtt cgacttcaac atccccgtag ctaggtttgt actgtacata tttggttagc 4561 agttgcttaa ctttcgtccg gtattcatct aatttatcca ttgcacaatt tcctccgtat 4621 caacgttgta aatgataagt ttcaattggt tttcgcttac tacggtttgg gtaaacggca 4681 aattaaagaa atcgttataa atcgcgagcg gaacagctaa attaatggtt atatttgggt 4741 cttcagtacg caacgcagtg cgatagttga taaattgccg cttcatcaaa tccggaaaaa 4801 ctgcggtgcc tcattacaag taacgggttc taagcctccg agttcttggc ggaaaaatcg 4861 ccccaattgt caactgctat acaaaaatga caacctttcg tggggttggg caaaccctcg 4921 acaaggatgg acaacccttg tatttttgcg gcgtat // LOCUS NODE_5584_length_4925_cov_4.3640664925 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4925) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4925) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4925 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 265..879 /locus_tag="DP116_27020" CDS 265..879 /locus_tag="DP116_27020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131497.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DedA family protein" /protein_id="PRJNA477356:DP116_27020" /translation="MLEWITNTINSLGYWGIALLMFLENVFPPIPSELIMPLAGFTAR ATPEKLNVIGVFFAGLLGSVLGALVWYYPGKYLGERRLQVWADKYGKWLSISSKDIVK AKGWFDQQGGKAVCIGRLVPGVRTLISVPAGMSHMHLLPFLIYSTLGSAVWVGLLTYS GYALGSQYELVDKYLAPVSKIVAVVLILAFVVWVMRRKRKNRRQ" gene 1146..2291 /locus_tag="DP116_27025" CDS 1146..2291 /locus_tag="DP116_27025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015119880.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chromosome segregation protein ParM" /protein_id="PRJNA477356:DP116_27025" /translation="MTDQPSAATPMNAAAIPMNRVSASTPINAAPPQPLAGIPGKKIL SIDLGRTSTKTCISRESNNVTFVSANVKEMSMEQVRGGVFEARATDPLMDLWLEYQGN GYAVGQLAADFGANLGVGQSKVESALIKVLACAGYFKLKDDISVVLGLPYLSQEQFEK EKAQIISQLNGPHVMNFRGESVSLNINKVWVMPEGYGSLLWCEAQPKKAAAMPDLTKV SVAIVDIGHQTTDCLMVDNFRFARGASKSEDFGMSKFYELVAAEIEGADSQSLSLIAA VNRAKGDRFYRPRGASKPTNLDDFLPNLIEMFSREICNRVLAWLPERVTDVILTGGGG EFFWEDVQRLLKEAKINAHLASPSRQANALGQYIYGEAQLSNARART" gene 2297..2815 /locus_tag="DP116_27030" CDS 2297..2815 /locus_tag="DP116_27030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216275.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="plasmid segregation centromere-binding protein ParR" /protein_id="PRJNA477356:DP116_27030" /translation="MFQWSKKVVKSVTFEPGVADESLLALVESHLEKDPQKTFSDLCK EALWQSLCVPESVRPAPKPAAATATATQGVEQQIAALQSQMADLEERFFAKESNRLEI MEQQLIQLSQQVAQLAIMLNNVSTSSPPTPQQVSTLEVINHTSSPTHAAATYSQEIDP VISRISQFLDDF" gene complement(2892..3404) /locus_tag="DP116_27035" CDS complement(2892..3404) /locus_tag="DP116_27035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009947435.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27035" /translation="MAVAIVALVLCFTTVVGLPTSADASVSPSPQSVVIKDIFSKGVV KQTESDEYVEITNQKPAIIDLSGWRLNSEDGRQNFYFPKGTLLTPGKSLRVYTNEIHP ETGGFSFGIKRAVWNNKGSVGLLYDAQGNLVDSFSYGNKKTEITAVSKQPQTLQSTPP KSLTQNNKNS" gene 3552..4511 /gene="pip" /locus_tag="DP116_27040" CDS 3552..4511 /gene="pip" /locus_tag="DP116_27040" /EC_number="3.4.11.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317691.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prolyl aminopeptidase" /protein_id="PRJNA477356:DP116_27040" /translation="MERTRKLYPPIEPYKKGNLKVSDLHTIHFEESGNPQGKPIVLLH GGPGGGCPPFYRQYFNPEKWRLVMFDQRGCGQSTPHAELRENTTWDLVSDIEKLREHL GIEKWVVFGGSWGSTLALAYSQTHPSRCTGLILRGIFMLRRKELRWFYQEGASYIFPD AWEEYLKPIEPAERNDMLTAYHKRLTSPDSFTRLEAARAWSIWEASTSRLFLDTELMQ KFSANEFAEAFARIECHYFIHQGFFETENQLLLNVERIRHIPAVIVQGRYDVVCPMIS AWELHRAWEEAEFVVVPDAGHSMSEPGILSALISATDKFANFQ" gene 4647..>4925 /locus_tag="DP116_27045" CDS 4647..>4925 /locus_tag="DP116_27045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459833.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoglucomutase/phosphomannomutase family protein" /protein_id="PRJNA477356:DP116_27045" /translation="MSVAANVVKFGTDGWRGVIGDEFTFERVALVAPLAAKVLLDTYG KTTGNRTIIVGHDRRFMAEDFAQKVADVVCAAGFDVLLTDTYAPTPAFS" BASE COUNT 1320 a 1062 c 1120 g 1423 t ORIGIN 1 acaacggggg gaacccccgc aacgcactgg ctcctctgtc gggaaagccc ttcgggtatg 61 cctgcggcac gcggcttggg cgaacgcagt cgcctctgtc gggagaccct cctgcagcgc 121 tgtctcaccg tcattcgcgc tgtctcaccg ctccgcgtct acgtttttcc gttactcctg 181 cgtaagtcct atagacaaag caagttttat ttagtaacga caaatgacta caaaacttcc 241 ctcacaactt caggaataca cttgatgctt gaatggataa ccaatactat taactctctg 301 ggctactggg gaattgctct actgatgttt ctcgaaaatg tctttcctcc aattccgtca 361 gaacttatta tgccattggc gggatttaca gcaagggcaa ccccagaaaa gcttaatgtc 421 atcggcgtat tttttgcagg actgctgggt tctgtcctgg gcgcactcgt ctggtactat 481 cccggcaaat atttgggaga aagacgcttg caagtttggg ctgacaaata cggtaagtgg 541 ttgagtatat ctagcaagga tattgtcaag gcaaagggct ggtttgacca gcaaggcggc 601 aaagcagttt gtattggtcg ccttgtaccg ggagtccgca ctttgatttc cgtacccgct 661 ggcatgagtc atatgcacct tctaccattt ttaatctact caactttagg tagcgctgta 721 tgggtcggtt tgctaacata ctcaggatat gctttgggta gtcagtatga actcgtggac 781 aagtatctgg ctcctgtttc taaaattgtg gctgtagttc tgatcctggc atttgtcgtt 841 tgggtgatga ggcgtaagcg aaaaaacagg agacagtaaa aattattgtt tctaattagc 901 gttggaaaaa taagtattct ctgtgtaaaa gcaagcgtta gcaattctct aactctgctt 961 gttagctttg tgattctatg gtatcaagtg aaacttacac gctatctatg tttatcgctc 1021 aacaattaag tggttctttg tatattgctt gctacagttg acgcaacaga tgttttctgg 1081 tacacccatt tttgtaagat gagcatgata acttgtgata gctaagttca gtattaggag 1141 cttttatgac agaccaacct tccgccgcca ctcccatgaa cgccgctgct attccaatga 1201 atagagtttc ggcatctacc ccgataaatg ctgctccccc tcaaccactt gctggtattc 1261 cggggaaaaa gattctcagt attgatttag gtagaacttc aacaaagact tgtattagcc 1321 gcgagtctaa taatgtgacg ttcgtctccg ctaacgtgaa agagatgtca atggaacaag 1381 tgcgtggagg tgtttttgaa gcccgcgcga ctgacccatt gatggatctg tggctggagt 1441 atcaaggcaa tggatacgct gttggtcaat tggcagcaga ttttggggct aacctaggag 1501 tgggtcaatc taaggtcgaa tccgcactga tcaaagtctt ggcttgcgct ggctatttta 1561 agctcaaaga tgatatctct gtggtgctgg gtctgcctta cctttctcaa gaacaatttg 1621 agaaggaaaa agcacagatc atcagccaac tcaatggtcc tcatgtgatg aactttcgcg 1681 gagaatccgt atcgttgaat atcaacaagg tttgggtcat gccagaaggc tatggcagct 1741 tactatggtg tgaggctcaa ccaaagaaag ctgcagcaat gcctgatctt accaaagtct 1801 ccgtggcaat tgttgatatt ggacatcaaa caaccgattg tttgatggtt gataatttcc 1861 gttttgcccg aggtgcgtca aagagtgaag acttcggtat gagcaagttt tatgaactgg 1921 ttgctgccga aattgagggc gcagatagcc aatctctatc tttaattgct gcagtcaacc 1981 gagccaaggg cgatcgcttc taccgtccca gaggtgccag caagcctacc aatttagatg 2041 actttctccc caatctgata gaaatgtttt ctcgcgaaat ctgtaaccgc gtgctagcat 2101 ggctaccaga gcgcgtcacc gatgtaattc tcaccggagg aggtggagaa ttcttctggg 2161 aagatgttca acgcttgctc aaagaagcaa aaattaacgc ccatttagct tcaccgtctc 2221 ggcaagcaaa tgctttaggg caatatattt atggagaggc acagttatcc aacgctcgcg 2281 ctagaactta acactgatgt tccaatggtc aaaaaaagta gtgaaatccg ttacgtttga 2341 gccaggggtg gctgacgaaa gcttgttagc tcttgtggaa agtcatttgg agaaagatcc 2401 ccaaaagacc ttcagcgacc tttgtaaaga agccttatgg caatctttgt gcgtaccgga 2461 atctgtacga ccagccccga aaccagcagc agcaacagca acagcaacac aaggggtgga 2521 acaacaaatt gctgcactgc aaagtcagat ggctgacctt gaggaacgtt tttttgctaa 2581 ggaatctaat cggttggaaa ttatggaaca gcaactgata cagctgagcc aacaagtagc 2641 tcagttggct attatgctga ataatgtttc aaccagttcc cctccaaccc ctcaacaggt 2701 ctcaacctta gaagttatca atcatacttc tagtcctact catgctgctg ccacttattc 2761 tcaagaaatt gatccagtta tcagccgcat tagtcaattt ctggatgatt tctaaggaaa 2821 tttcagggtg tgaatgtatt tgactgctag aaaacctcct taatgagaat tctcttaagg 2881 aggttttcta tttatgaatt tttattgttt tgtgttaagg acttaggagg cgtactctgt 2941 agtgtttgtg gctgctttga gacggcagtt atttcggttt tcttattacc atagctaaag 3001 ctatccacta gattcccttg tgcatcatac agtagcccca cactcccttt attgttccaa 3061 actgctcgct tgatgccaaa actgaaaccc ccggtttctg ggtgtatctc attggtatac 3121 actcgcaaac tttttcctgg tgtcaatagg gttcctttgg ggaaatagaa gttttgccga 3181 ccatcctcac tattgagtct ccaccctgat agatctatta ttgctggttt ctgattggta 3241 atttcaacat attcatctga ttccgtctgt tttacgaccc ctttagagaa aatgtcctta 3301 ataactactg actgcggact agggctgacg ctagcatcag cgctcgtcgg caagcctact 3361 actgttgtga agcacaaaac caaagcaaca atcgcaacag ccatccgtga gaaaagtttt 3421 ttcataacat ccttgcttgc aaagccatcg ccttcaatcc ctctcttcag cataaatcat 3481 gggatcttgt atgaaaaatc atccaaaatt gaagtaattt aatatacaag tccctgaata 3541 taggcggtaa aatggaaaga actcgaaaac tttacccacc aatcgaacct tacaaaaaag 3601 gcaacttaaa ggtttctgac ctccacacca ttcattttga agaatcaggt aacccacagg 3661 gcaagccaat tgttttgcta catggaggac ctggaggtgg atgtccgccc ttttatcgac 3721 aatatttcaa tccagaaaaa tggcgtcttg taatgtttga ccagcgtggt tgtggtcaaa 3781 gtacgcctca tgcagaacta cgggagaata ccacttggga cttggttagt gatattgaaa 3841 agctgcgaga acatctgggt atagaaaagt gggttgtctt cggtgggagt tggggaagca 3901 ctctagcatt agcttacagt caaactcatc cctctcgatg cacaggactc attttacgtg 3961 gcatatttat gttaaggcgc aaagaattgc gatggttcta tcaagaggga gctagttata 4021 tttttcctga tgcttgggaa gaatatctca aaccaattga gcctgcggaa cgtaacgata 4081 tgctcacagc ttatcacaaa cgcttaacca gccctgattc attcactcgg ttggaagcag 4141 cccgtgcttg gtcaatttgg gaagcgagta caagtcgatt gtttctagac acagaactga 4201 tgcaaaagtt tagcgcaaat gagtttgcag aagcctttgc acgaattgag tgtcattatt 4261 ttattcatca aggatttttt gaaacagaaa atcaattact tttaaatgtt gagcgcatcc 4321 gccatattcc tgctgtgatt gttcagggac gttacgatgt tgtttgtccg atgatatcgg 4381 cttgggaatt acatcgtgct tgggaggaag cagaatttgt tgttgtccct gatgctggac 4441 attcaatgag tgaaccagga attctcagcg ctttgatttc ggcaacagat aagtttgcaa 4501 actttcaatg agtcaggaac acatttagtt atttgtgtgt catcaatagt atcataatta 4561 ctattacact ttttatacta aaagtcttgt taccgggtgc cctgtgtgaa tgttagaatc 4621 cactgttaga aagtagggat acctcaatgt cagtagcagc taacgtagtc aaatttggta 4681 cagacggctg gcggggcgtc attggcgatg agttcacctt tgagcgtgtc gccttggttg 4741 cgccacttgc cgcaaaagtt ttacttgata cgtacggaaa aactacaggt aaccgtacaa 4801 ttattgtggg acacgaccga cggtttatgg cagaagattt tgctcagaaa gtcgcggatg 4861 ttgtctgtgc tgctggattt gatgtattac tcacggatac ttatgcccca actccagctt 4921 ttagt // LOCUS NODE_5682_length_4786_cov_4.5901504786 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4786) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4786) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4786 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 133..883 /locus_tag="DP116_27050" /pseudo CDS 133..883 /locus_tag="DP116_27050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011563505.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="short-chain dehydrogenase" assembly_gap 693..702 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 948..1967 /locus_tag="DP116_27055" CDS 948..1967 /locus_tag="DP116_27055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006512397.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="catalase" /protein_id="PRJNA477356:DP116_27055" /translation="MTISKLSASEQETITDEVLAANLKTQQEKGPDLRQVHPKSHGLV WGEFIVEKNIPDALKVGVFCAPQTYPIWVRFSNGGEAEKRGQFKPDKEPDVRGLAIKL MNVEGEKVLDDEEKTQDFLSINHPVFFVRDVQGYVDLAKVASGQADPELAQAMQPSFA ILQKMTSKKISNPLFIQYWSTTPYKLGSQAIKFSIKSQQIETVPDSIPDSQNYLREAI VKYLTDGGKEAIFDFLIQLYVDEEKTSIENPMQEWKEQDSPFIKVATIRIPRQKFDFD ERKRLDEGLSFNPWHTLPEHEPIGSVNLARKKIYQELAKYRREQIEKRLREPQPYASV QDEPS" gene complement(2153..3754) /locus_tag="DP116_27060" CDS complement(2153..3754) /locus_tag="DP116_27060" /EC_number="2.3.3.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137916.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-isopropylmalate synthase" /protein_id="PRJNA477356:DP116_27060" /translation="MNTTQERIIIFDTTLRDGEQCPGATLNIDEKLTIAKQLARLGVD VIEAGFAFASPGDFEAVTKIAQLVGTQDGPVICSLARARHEDIQTAAEAIKPAAKGRI HTFIATSDIHLKYKLKKTKSEVLAIAEEMVAYAKTFTNDVEFSPEDAGRSEPEFLYQV LERAIAAGATTVNIPDTVGYTTPTEFGALIKGIKENVPNIDQAIISVHGHNDLGLAVA NFLEAVKNGARQLECTINGIGERAGNAALEELVMGLHVRRQYFNPFFGRPSDCEESLT NIDTRQIYKTSRLVSNLTGMLVQPNKAIVGANAFAHESGIHQDGVLKNKLTYEIMDAQ LVGLTENQIVLGKHSGRNAFRTRLRELGYELTETELNKAFVRFKEVADKKKEISDWDL EAIVIDEIQQAPDLFRVELVQVSCGSNARPTATVTLRTPEGEELIDAAIGTGPVDAVY KAINRVVNVPNQLIEFSVQSVTAGIDAIGEVTIRLKHEGRVFSGHAANTDIIVASAQA YVNALNRLYASLQNKQQQVVAKNLS" gene complement(3838..4368) /locus_tag="DP116_27065" CDS complement(3838..4368) /locus_tag="DP116_27065" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114631.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NYN domain-containing protein" /protein_id="PRJNA477356:DP116_27065" /translation="MGFTMNRLSIFVDGNNMFYAQQKNGWFFDPRRVLEYFRHEQSDT TLINAFWYTGLKDLQDQRGFRDALISLGYTVRTKILKEYYDDSSGRYSQKANLDIEIV VDMFNTVDQYDRVVLFSGDGDFERAIELLRGKNTHITVVSTEGMIARELRNATDRYID LNDIRDRIEKVDSQLP" BASE COUNT 1393 a 1021 c 928 g 1434 t 10 others ORIGIN 1 gggtattgaa atcaataagc catctgtcgc tctatcagca tttttaggaa ctttgaattc 61 caacaatgtc agtctaattc ttcaaagtgg tcttctataa gagccacttt tctaaaatca 121 ggagagaact ctatgcaaag taacgacaaa aaaattgccc tagtcacggg cgcaaataaa 181 gggcttggtt ttgaaatcag ccgccaacta gccaaacaag aaattacggt actgataggt 241 gcccgtgatg aatctaaagg tgctgaagct gcagagaaat tacaggccca agggcttgat 301 gtgcaatcca tcaaacttga tgtcacagat atctcctcta tcgccactgc tgccaaaaat 361 attgaggagc atttcggtaa gctcgatatt ttggtcaaca atgcaggcat tttccttgac 421 tcaggaacga atccgagcga ccttgacctc aatattctaa aacagacatt tgagacaaat 481 gtattcggtg cttttgcagt attgcaagca acgctacctc tgattcgtaa gagtaatgct 541 ggtcgcatcg tgaatatgtc gagtactctt ggatccctga ctgacaccct tgatccgaac 601 tcacactatt tcggattgcg agggctagct tatcaagcga ctaaagcggc gctcaatgaa 661 atcaccgctc agttttctaa agaattggcc tgnnnnnnnn nnggctgata caccaattaa 721 ggtcaattct gcttgccccg gttgggtgca gacagacatg ggtagtgcag atgcaccagg 781 aacagtagaa gaaggtgctg acactcctgt gtggcttgct acacttccag aagatggtcc 841 aacaggtggt ttctttaatt ctcgcaaacc aattccttgg taatttctta cataggtggg 901 tactgcccat ctttttcatt cattgcataa attgaaagga gaaaagtatg actatttcaa 961 aattatctgc ttcagagcaa gagacaatca cggatgaagt cttagctgct aacctaaaaa 1021 ctcagcaaga aaagggacca gatttacgcc aagtacaccc gaaaagccac ggtttggttt 1081 ggggagaatt tatagttgag aagaacattc ctgatgcact aaaagttggt gtcttctgtg 1141 cacctcaaac ctatcccatc tgggttcgtt tttccaatgg tggagaagca gaaaaacgcg 1201 gtcaattcaa accagataaa gaacctgatg ttcgcggctt ggcaataaaa ttgatgaacg 1261 ttgagggcga aaaagttctt gatgatgaag agaaaaccca ggattttctg agcataaatc 1321 atccagtctt tttcgttcga gatgtgcaag gatatgttga tcttgccaaa gtagcaagtg 1381 gacaagctga tccagaatta gcacaagcca tgcagccttc ttttgcaata ctccagaaaa 1441 tgactagtaa aaaaatcagt aatccgcttt ttatccaata ttggagtaca acaccttaca 1501 aattaggctc ccaagctatc aaattctcca tcaaatctca acaaatagaa acagttcccg 1561 actcaatacc cgactcacaa aattatctac gtgaagcaat tgtcaagtat ctgactgatg 1621 gagggaaaga agccattttt gattttctca ttcagcttta tgtagatgaa gagaaaactt 1681 caatcgaaaa tccaatgcag gagtggaaag aacaagattc acctttcatt aaggtggcta 1741 ctatcaggat tcctagacaa aaatttgact ttgatgagcg aaagcgattg gatgagggtt 1801 tgtctttcaa tccttggcat actttacctg aacatgaacc aattggtagc gtgaatttag 1861 ctcggaaaaa aatttaccaa gaacttgcta aatatcgacg cgagcaaatc gaaaagcgtt 1921 tgagagaacc gcaaccctac gcttctgttc aggatgaacc tagttaaata gacttcttgc 1981 aaaagcaatc ataactcaga tcttgcacta ttgttattta gctccgactg aggagacaga 2041 acctcccaac cctcgttacc aggctgaagg aggcagagag agcctcccaa ccctcgttac 2101 caggctcagc ctggtaacga ggttcatccc ctacacccaa gcaaccccta gttcatgaca 2161 aattctttgc aacgacttgc tgttgtttat tttgcaaaga tgcatacagc ctattcaatg 2221 catttacgta agcttgagcg gatgctacga tgatatctgt gtttgccgca tgaccagaaa 2281 acactctacc ttcatgcttg agacggatag tgacttcacc aatagcatca atacccgctg 2341 tgactgactg tacagaaaac tcaattaact ggttcggtac gttgacaaca cgattgatgg 2401 ctttgtaaac tgcatctact ggtcctgtac caattgcagc atcgattaat tcttcgcctt 2461 ctggagtccg gagtgtgact gttgcagtgg gacgagcatt actaccacag gaaacttgca 2521 ctaattctac acggaacaaa tcgggtgctt gttggatttc atcaataaca attgcttcta 2581 aatcccaatc agaaatttct ttctttttgt ctgcgacttc tttgaatctg acaaatgctt 2641 tatttaattc agtttctgtg agttcatatc ccagttctct caaacgtgtc cggaaagcat 2701 ttctccctga atgtttgccc aagacaattt gattttctgt caagccaacg agttgggcat 2761 ccataatttc ataggtgagc ttatttttta gcacaccatc ttgatgaatt ccagactcgt 2821 gagcaaaagc attcgcgccc acaattgctt tattcggttg aacgagcatt cctgtcaagt 2881 tagaaacgag acgggatgtt ttgtaaattt gtctggtgtc aatatttgtt aaagattctt 2941 cgcaatcaga tggacgtcca aagaaggggt tgaaatattg ccgacggacg tgcaaaccca 3001 tcaccaattc ttctaacgct gcatttcctg cacgttcacc aataccattg atagtacatt 3061 ctaactgtct ggcaccattt ttcacagctt ctaagaagtt ggcgacggcc aaacctaaat 3121 cattgtgtcc atgaacagaa ataattgctt gatcaatgtt aggaacgttt tctttaatac 3181 ctttaatcag cgcgccaaat tcagtgggtg tggtgtaacc tactgtgtcc ggaatattaa 3241 ctgttgttgc tccggcggcg atcgctcgct ctaaaacttg ataaagaaac tctggctcag 3301 accgtcctgc atcttcagga gaaaattcta catcattagt gaatgttttg gcataagcca 3361 ccatttcttc agcaattgcc agaacttctg attttgtctt tttcagtttg tacttgaggt 3421 gaatatcgct tgttgcaata aatgtatgaa ttctcccttt tgcggctggc ttgattgctt 3481 ctgccgcagt ttgaatatct tcgtgtcgcg ctcttgctaa actacaaatc acaggaccat 3541 cttgtgtccc cacaagttgg gcaatcttag tcacagcctc aaaatctcca ggactggcaa 3601 aggcaaaccc tgcttcaatc acatccacac ccagacgtgc cagttgctta gcaatagtca 3661 gtttttcatc tatgttcagt gttgctcccg gacactgttc accgtctcga agtgtcgtgt 3721 caaagatgat gattctctct tgtgtcgtgt tcatttgcgg cttgttgttg acttcttagt 3781 tgtgacttaa aatacctttt accttagttt ctttgctatt ccttttgtaa ataattgcta 3841 aggtaactga gaatctactt tttctattct gtcccgaata tcatttaaat ctatatatct 3901 atcagtagcg ttacgtagtt ctctggctat cattccttct gttgatacta ctgtaatgtg 3961 cgtatttttt ccgcgtaata gttctattgc tctttcaaaa tctccatccc cactaaataa 4021 aacaactcgg tcgtactgat ctactgtatt aaacatatct acaacaattt ctatatctaa 4081 attagctttt tgtgagtaac gaccagatga atcatcataa tattctttta aaatttttgt 4141 tctaactgta taacctaaac taatgagagc atctctaaat cctcgttgat cttgtaggtc 4201 ttttaagcca gtgtaccaaa aagcattgat taatgttgtg tctgattgtt cgtgtctgaa 4261 gtattctaag actcttcgtg ggtcaaaaaa ccagccattt ttctgttgag catagaacat 4321 attgtttccg tctacaaaaa tagacagacg attcatcgta aaacccatag taaagttaca 4381 cctaataata tataagcaat ttagatttaa caatagatga tagcaggaat tagaaaatat 4441 taattctggt ggtatctata aattatatat tgtcatatat agcttttatc taacctctcg 4501 catatcagaa aatcgactga agcattttag tagtcgctgt cctccagttt agtaatgtta 4561 acttttttct ctcttggctg gtaagcgtgc ccctataatc aaaatctacc aataaaatta 4621 acgtacaagc attctttgct ctataccagt caagtttact cctaaagagg tatcgaaaag 4681 tttaagaatt aggtcgcgaa cctcggagca gactggctgt atcctccaga catgctttgt 4741 gctggcgtag cctttcaaga aacacagggg tgtaggggtg tagggg // LOCUS NODE_5691_length_4774_cov_5.5941944774 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4774) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4774) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4774 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 277..927 /locus_tag="DP116_27070" CDS 277..927 /locus_tag="DP116_27070" /EC_number="2.4.2.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009455437.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uracil phosphoribosyltransferase" /protein_id="PRJNA477356:DP116_27070" /translation="MTLQLRVYVPPHPLIKHWLAVARDAATPSVLFRSAMTELGRWLT YEAAREWLPTQEATVQTPLDSSPATLIDPTVPMAVVPILRAGLGLLEGAQTVLPLASI YHLGLVRDEKTLQPSCYLNKLPEKFPPQTRVLVTDPMLATGGSIMNAMAELTQRGVDP ALTRIICVVAAPPALQKLNDAFPGLIVYTATIDETVNNQGFIIPGLGDAGDRIFGT" gene 1004..1363 /locus_tag="DP116_27075" CDS 1004..1363 /locus_tag="DP116_27075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318546.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27075" /translation="MSQQDNFASGFFAGAIFGGVVGGIIGTLVATRRDPEEIVEEEPQ TTTNSTEAKKAARRRQMRASENENMEMEVARRSLEDKIAQLNATIDEVRQQLGNVNGN SEQLIHERSSSRSEELR" gene 1517..1789 /locus_tag="DP116_27080" CDS 1517..1789 /locus_tag="DP116_27080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872802.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YggT family protein" /protein_id="PRJNA477356:DP116_27080" /translation="MNLLITTLSTFIQIYSTLLIIRVLLTWFPSINWYNQPFAALSQI TDPYLNLFRSIIPPLGGIDFSPILAFLLLNLLSSLLSSLSTIPLVG" gene complement(1918..2334) /locus_tag="DP116_27085" CDS complement(1918..2334) /locus_tag="DP116_27085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016879217.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27085" /translation="MKVILISCIIAVVSWVFTPSAVALTQIKLFDISYHKCPSEIGKG SVTSGTTMAANCFIVTGKAENGTYKTVYDADIFGRIYDANNNSVMENRNRLGSIQEIP PGVSDFELRISVAANQPEPLKLKQFKAAGFSAQVRR" gene 2619..3236 /locus_tag="DP116_27090" CDS 2619..3236 /locus_tag="DP116_27090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318543.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27090" /translation="MSPDEMKAVLQAAFDRCDIASCPLCDTQKEILLQVLEQIKGSHT GVSDIANPLDELTKEELQAFLEFVKAQEEQNRSWKAQLLNDWLNENDSGGVQFIRDRY GLLWLNRIEPYHFDKYSTEDVLKLKLGDRIEICNAVWEWVQDNGPCTPEWYSCIVIKV DTIGDGDSSSTNGIVRFYNGAEFEIQGMYEWNRVYWRRGSEGVRP" gene complement(3261..3515) /locus_tag="DP116_27095" CDS complement(3261..3515) /locus_tag="DP116_27095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27095" /translation="MLSHAQNLADNWEFTELWIDPTASPPYVLILLSDSSGKSCIYDP SQNYQLVFNSNTYLEAKLWLLEDEYERVEGRLRASVSATP" gene complement(3487..3816) /locus_tag="DP116_27100" /pseudo CDS complement(3487..3816) /locus_tag="DP116_27100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198565.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(3830..4702) /locus_tag="DP116_27105" CDS complement(3830..4702) /locus_tag="DP116_27105" /EC_number="1.1.1.25" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318542.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="shikimate dehydrogenase" /protein_id="PRJNA477356:DP116_27105" /translation="MITGKTKLLGVIGYPVEHSLSPVMHNAALTQLGLNYVYVPFPIE PQNLEVAVQGFAAIDVVGFNITIPHKQAILPLLCEISPIAKAVGAVNTVSRKDNKWIG TNTDVEGFLAPLRTMYNNHRLSQRVAVILGNGGAARAVVAGCAELGFDKIFVVGRNLQ KLEEFQKSWDHFRESQNLHVHTWDSLPKLIPQANLLVNTTPIGMYPKVEESPLSVEEM GNLPAGAIAYDLIYTPNPTKFLRQAQQVGAIAIDGLEMLVQQGAAALKIWLQQDTVPV DVMRQALRQHLGLS" BASE COUNT 1385 a 1030 c 1016 g 1343 t ORIGIN 1 tccctgttcc ctgttccctg ttccctattc cctattccct attacaaaaa agagggaaaa 61 gcctgctacc gctggccaat cccgtctgcc gatccttaaa gagaggagaa cactaaaaat 121 acaatataac tttgattgag aatatatgtc aactactaat gataaaaagt atcaataaat 181 ttttgttgag ctaattatag cagtgagttg catacaagaa aaagtattat gatgtttcag 241 ttgttatacc atagagaaaa agccccaaaa ccggctatga cgctacaact gcgtgtttat 301 gtaccacccc atcccctgat caagcactgg ttagcagttg cccgtgatgc tgctacaccc 361 tccgtattat ttcgcagtgc gatgactgag ttgggacgtt ggttgactta cgaagcggct 421 agagagtggt tgccaactca agaggcaaca gtgcaaaccc ccttggatag cagcccagca 481 actttaatag atccaacggt tccaatggca gtggtcccga ttctgcgagc tgggctagga 541 ttgctagagg gagcacaaac agtgttgcct ttggcgtcga tttatcatct tggcttagtg 601 cgagatgaaa aaacactcca gccgtcttgt tacttaaaca aattgccaga aaaattccca 661 ccgcaaacgc gagtgttagt gaccgatcca atgttagcaa ctggagggtc aatcatgaat 721 gcaatggcag aattaacaca acggggtgtt gacccagccc taacgcggat tatctgtgtg 781 gtagcagcac cgccagcttt acaaaaactg aatgatgctt ttccaggttt aattgtttac 841 acagctacca ttgatgaaac ggttaacaat cagggattta tcataccagg attaggagat 901 gcaggcgatc gcatctttgg gacataagct agtatgcagc taagacagca tgaggtgcga 961 aaacaaccca acagatgcag cgtttcttca aggcagtaaa attatgagtc agcaagataa 1021 tttcgcaagt ggtttttttg ccggagcaat tttcggtggt gtggttggtg gtatcattgg 1081 tacgcttgtt gccacaagac gtgatccaga agaaatcgta gaggaagaac ctcaaacgac 1141 caccaattca acagaagcga aaaaagcagc cagaaggcgt cagatgagag cttcggaaaa 1201 tgagaatatg gaaatggaag tggcgcggcg atcgcttgaa gataaaatcg cccagctcaa 1261 tgccacaata gatgaagtgc gacagcagct agggaatgtg aatggcaatt cagagcagct 1321 cattcacgaa cgttcttcgt cgcgtagcga ggagttacgt taacgttctt agcattgaac 1381 tctagcaaat gccaactaca aaggcaagag tattcatcat ggaccagatg cagaatctgc 1441 aaccgccatt tgaattagcc tcatttaaaa tcgaagataa gacccatttg acacttagta 1501 gcaaatcaag caatccatga atttactgat taccacactg agcacattca tacagattta 1561 tagtactcta ctgattatta gagtcctctt gacatggttt cccagcatca actggtacaa 1621 tcaaccattt gctgcattga gccagataac cgatccttat ctaaatctgt tccgttccat 1681 cattccccca ttgggcggta ttgatttttc cccgatcttg gcgtttctct tactcaacct 1741 attaagcagt ttgctctctt ctctaagcac cattccatta gttggttaag tataaactcc 1801 acatatattg actttcgtag agacgttgca tgtgaggcag tgcgttgggc gggttccccg 1861 acttgaagca actgccgaac ccgaagggca acgtctctac atattcatat tttccaacta 1921 cctacgcact tgagcactga atccagccgc cttgaactgc ttcaacttca aaggttcagg 1981 ctggtttgca gccacagaaa tcctcaattc aaaatcactg acacctggtg gaatttcctg 2041 gatagaacca agacggttac ggttttccat cacagagtta ttgttagcat cataaatgcg 2101 cccaaaaata tccgcatcgt aaactgtttt gtacgtccca ttttcagctt tgcctgtgac 2161 aataaagcaa tttgctgcca tagttgtacc actcgtcaca gatccttttc ctatttccga 2221 tggacatttg tgataggaaa tgtcaaatag tttaatctgt gtcagcgcta cggcgcttgg 2281 cgtaaacacc cacgatacaa ctgcgatgat acaggaaatc aaaatcacct tcaagaatcc 2341 tgagtactga atcctgaatt ttgagttatt ccttgtaaat aaggacacag aatttaataa 2401 cgatagtagg ctatttttct tttgcatatc tccaataaca agaataacga aacaatgtac 2461 caaaaaaaga gctgcgacta acacaaacac aggcgaaaag ctgtttagta gtattttcaa 2521 cctcagctta gccgaaatta gctcttaaga tattagctag ttctttttaa cttttgtaat 2581 aattatgaaa tatgaaactc tcaattgcta tatagcttat gagtccagat gagatgaaag 2641 cagtcttgca agcagcgttt gatcgttgtg atatcgcaag ctgtcccctc tgtgatacgc 2701 agaaagagat attactgcaa gtcttagagc aaatcaaggg gtctcatact ggtgtgtctg 2761 atatcgctaa tcccctagac gaactgacaa aagaggaatt gcaagcattt ttagagttcg 2821 tcaaagcgca agaagaacaa aatcgctcat ggaaagcgca attactgaac gactggctga 2881 atgaaaatga ctctggggga gtacaattta ttcgcgatcg ctacggattg ctttggctga 2941 atcgtattga accatatcac tttgacaagt attctactga ggatgtcctc aagctgaaac 3001 tgggtgatcg cattgaaatt tgcaacgctg tttgggaatg ggtacaagat aatggtcctt 3061 gcaccccaga atggtattcc tgtattgtca ttaaagttga tacaataggc gacggcgatt 3121 cttcctcaac caacggcatc gtccgcttct acaacggtgc tgagtttgaa attcaaggta 3181 tgtacgaatg gaatcgcgtt tattggagaa gagggagtga gggagttcgt ccttagcata 3241 aaaaggggtc ggaattttaa ttatggagtt gctgaaactg aagctcttag tcgtccttct 3301 actcgttcgt attcatcttc aagtagccaa agttttgcct caaggtaggt gttgctattg 3361 aacactaact gataattttg acttggatcg taaatacagc ttttaccaga actgtcgctc 3421 aataaaatca aaacgtaggg aggcgaggcg gttggatcta tccatagctc tgtaaactcc 3481 cagttatcag ctaggttttg tgcgtgggat aacatgatac ttgtttcctt ttactttcat 3541 ctgtgcgtaa taaagtggaa ttttactacc gtcaatactg agatgatatc caacaggttc 3601 gttaaagtat agtccatagc gttcatgtcc acgaatgact cctgtaaatt cccctttttc 3661 tatgattgct tgagtaacca tagacattgt ggcttcatca ttaaacacgt gagcgcttca 3721 ttctttttga agtagctgtt gcatttgggg agtatctggg agatgcttgg caacatgttt 3781 aatatcgaga gttagttctc gctcaattag actcattcaa aggtcgaact tacgataaac 3841 ccaaatgttg tcgcaaagct tggcgcatta cgtctacggg aacagtgtcc tgttgtaacc 3901 agatttttaa tgctgctgcg ccttgttgaa cgagcatttc cagtccatca atcgcgatag 3961 cgcccacctg ctgcgcttgt ctgagaaatt ttgttgggtt aggggtgtat atcaaatcgt 4021 aggcgatcgc ccccgctggc aaattcccca tttcctctac actcaaaggt gattcctcaa 4081 ccttgggata cataccaata ggggttgtat ttactagcag gtttgcttgt ggaatgagtt 4141 ttggcaatga atcccaagtg tggacgtgta aattctgcga ctcacggaaa tgatcccaac 4201 tcttctgaaa ttcttctaac ttttggaggt tgcgcccaac aacaaaaatc ttgtcaaaac 4261 ctaattcagc acaacctgca acaactgctc ttgctgcacc accattgcct aaaatcacag 4321 ctactctctg gctcaatcgg tgattattat acattgttcg caaaggagca agaaatcctt 4381 ccacatcagt gtttgtacca atccatttgt tgtccttgcg acttaccgta ttcactgctc 4441 ctacagcttt agcgatcggt gaaatttcgc aaagcaaagg caatattgct tgtttatgtg 4501 gaattgtaat gttaaaacca acaacatcga tagcagcaaa gccttggaca gcgacctcta 4561 aattctgtgg ttctatcgga aaagggacat agacataatt tagtcccaac tgcgtaagtg 4621 cagcattgtg catcactggt gagagtgaat gttctaccgg atacccaatc actccaagta 4681 atttagtttt gcctgtaatc ataggtttgt catgagttga ttatagtggg tgagatgaaa 4741 accctgtggc tttagcccag ggacgccaca tgcc // LOCUS NODE_5749_length_4690_cov_4.4308524690 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4690) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4690) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4690 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..190) /locus_tag="DP116_27110" CDS complement(<1..190) /locus_tag="DP116_27110" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="uroporphyrinogen-III C-methyltransferase" /protein_id="PRJNA477356:DP116_27110" /translation="MQTNLALSSSSRPLTGKTIVVTRAAGQSSQFTQVLASFGANVIE MPTLEIGPPSSWEGLDNAI" gene complement(194..967) /gene="cobA" /locus_tag="DP116_27115" CDS complement(194..967) /gene="cobA" /locus_tag="DP116_27115" /EC_number="2.1.1.107" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208282.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uroporphyrinogen-III C-methyltransferase" /protein_id="PRJNA477356:DP116_27115" /translation="MAEQRGKVYIVGAGPGDIGYLTLKAYKLLSSAEVLVYDALVDAE LLQCVPSDCLKLNVGKRGGQPSTPQAEINELLVKYCQHRKQVIRLKSGDPFIFGRCSS EIEALKAAGCEFEVIPGISSAIAAPLLAGIPLTDPVMSRCFAVFTAHDPDALDWEALS RLETLVILMGGQHLAEILHRLVRQGRSRLTPIAIIRWAGTPQQTIWTGTLENILEQTS GVSLSPAVIVIGEVVGLRGYLQPEKISLENSSVADTWHT" gene complement(971..1438) /locus_tag="DP116_27120" CDS complement(971..1438) /locus_tag="DP116_27120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319039.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="PRJNA477356:DP116_27120" /translation="MFDNFQPISIIAITAVISSIALLGYFSWKTLITSNVFQKAINLY QQEDYKGAEAAFRQVIALNSTNDVVHLLLGDTLIQQGKIEEARQQFQEVIDRAPKKVD AYLRLSNVLMQQEKKEDAIATLQKARDLCQAQRQPEKAEQINRILQKMTKNNS" gene 1936..2829 /locus_tag="DP116_27125" CDS 1936..2829 /locus_tag="DP116_27125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358372.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="serine/threonine protein kinase" /protein_id="PRJNA477356:DP116_27125" /translation="MSHLIGVTLRNRYKIFQCLGGGGFGDTYLARDLDLPGQPLCVVK HLKPKDSSPAVLAIAKRLFETEAQTLYRLGNAHNQIPTLFAHFEENGEFYLVQEFVDG HDLKKEIILGSSKSEQVVFKLLKDILEVLAFVHQQNVIHRDIKPQNLMRRRKDGKIVL IDFGAVKEISALKVNPQGDSSLTVPIGTPGYVPIEQHYSRPQFSSDIYAVGMIGFGAL TGLHPKQLPRDANNGELYCALFRDRINVSPSFAQILDTMVRNDYRLGGSASSSGRAAL SVFPGITWEQGKCINKFKKNI" gene 2830..>4690 /locus_tag="DP116_27130" CDS 2830..>4690 /locus_tag="DP116_27130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27130" /translation="MNQQFHQSYKYQVGGSLPNDAPTYVQRQADVDFYTALKAGEFCY VLNSRQMGKSSLKIRTMRRLQAEGFACVDIDITSIGTAELTSEQWYAGIINGIINSLE LYEKFDLNTWWSTEGLLSPVNRLSKFFETVLLQLVSKNIVIFVDEIDSVLSLNFERDD FFALIRECYNNRATKPDYQRLTFALLGVATPSDLIQDKRRTPFNIGRAIQLTGFQLQE AKPLAGGLAVKTNNPKAAITAILDWTGGQPFLTQKLCQMILDADAAIPENRVAEWVEE LVQTRVIYNWESQDEPEHLKTIRDRILRGGGQLTGRLLGLYQEILQHEEIADDNSLEQ TQLRLSGLVVRRESKLRVSNRIYAAVFNQIWLQQALKNLRPYAQTLRAWVDSRYQDES RLLRGQALQDALAWAAGKSLSNEDYEFLSASQKLDKREIEIALEVKEEEGRILAQANQ TLTKAQQKAKRRIRIGGAILVLCLIGAVIAGIFATNAFKQAQEAQEGTQLERAGAAAL DQFEKQHQEIEALLVAMDAGQKLKQLVKNGRPLEKYPAASPILTLQTIVDNIHERAQL KGHQSTVTSAGFSPDGQRIVTASDDGTAKVWRVGGLDDLLARGCDWLQDYFVTH" BASE COUNT 1399 a 915 c 1043 g 1333 t ORIGIN 1 cgatcgcatt atccaaaccc tcccaactcg aaggcggacc aatttccaat gttggcattt 61 caatcacatt tgcaccaaat gaagcaagca cttgggtaaa ctggcttgat tgtccagcag 121 cgcgtgtgac tacaattgtt ttaccagtca gggggcgtga agaggaagaa agggcgaggt 181 tagtctgcat agtttaggtg tgccaagtgt cagctactga ggaattctct aatgatattt 241 tctcaggttg cagataacca cgtagcccca caacttcacc aatcacaatc accgctggcg 301 ataacgacac accggatgtt tgctcaagta tattttctaa agtgcctgtc cagattgttt 361 gttgtggagt tcctgcccat ctgataatgg ctatgggagt taagcgcgat cgtccttgtc 421 gcacgaggcg gtgtaaaatc tctgctagat gctgtcctcc cattaaaata accaatgtct 481 ctaaccgtga cagcgcctcc caatccaaag catctggatc atgagcagta aacacagcaa 541 aacatcgact cataactgga tcagtcagag gtattcccgc cagcaacggt gcggctattg 601 ctgaggagat tcctgggata acttcaaatt cacaaccagc tgctttcaac gcctcaattt 661 cagaactaca gcgaccaaaa ataaatggat caccagactt gaggcgaata acttgctttc 721 tgtgctgaca atactttacc agtaactcgt taatttcagc ttgcggtgta cttggttgac 781 caccccgttt ccccacattc agcttcaagc aatcagaggg tacacattgc aacagctcag 841 catctaccag agcatcgtaa accaaaacct cagcacttga tagcagcttg taagccttga 901 gcgttagata tcctatgtct ccaggtcccg cacctacaat gtagactttg cccctttgtt 961 cagccatgag ttacgaatta ttcttagtca ttttttgaag gatacgatta atttgttctg 1021 ctttttctgg ttgacgctgt gcttgacata aatctctcgc tttttgcaga gttgcgatcg 1081 catcttcttt tttttcttgt tgcattaaaa catttgacag acgcaaataa gcatcaactt 1141 ttttcggagc acggtcaatc acttcttgaa actgttgcct tgcttcttct attttacctt 1201 gctgtatcaa agtatctcct agcagcaagt gaacaacgtc attagtcgaa ttcagagcaa 1261 tgacttgacg aaaagctgct tcagcacctt tgtaatcttc ttgctgataa agattgattg 1321 ctttttgaaa aacatttgac gttatcaaag ttttccagga gaaataaccc aaaagtgcaa 1381 tactagaaat tacggcagtt atggcaataa tagatattgg ttgaaagttg tcaaacatag 1441 ttcatatata aaagagaatt tttgcaaaaa cctgtattta ggatttagaa ccacagatta 1501 acacagatta acacagatga attatctctg tgactcggca ccatcccaaa tttgaaaaaa 1561 tagaaagcag aagcacacaa actaactcca gtcccctact gttataattt ttactattaa 1621 ctaagtataa taagactttt tggaaaatta aatgtatcta ttttaacaag tatatacacg 1681 atgacagatt gagcaacaac atgatgattg ttacaaaaat cgaaataaaa tcagctactg 1741 tttgttttgg ctatcaactc gacgtaatta cttataatag ttaaaacaag ggctgtttac 1801 tcgaaagttt atactagagt attttcatct ggtgcgggta aaaatgaaag aaataaaaaa 1861 ttttataccg attttatctt tgtattgctt tctatctttt gatcagttat cagtccagaa 1921 ggtaggagat acgaaatgag tcatctaatc ggcgtaacac tccgtaatcg atacaaaata 1981 ttccagtgtc tggggggtgg gggttttggc gatacttact tggctcgtga tttagattta 2041 ccaggacaac ctttgtgtgt cgtcaaacat ctcaagccaa aggactcaag tcctgctgtt 2101 ttggcgatcg ccaaaaggtt attcgagaca gaagcacaaa ctctatatcg cttaggcaat 2161 gcacacaacc agattcccac attgttcgcc cactttgaag agaatgggga attttactta 2221 gttcaggaat tcgttgatgg gcatgacttg aaaaaagaaa ttattctggg cagttccaag 2281 agtgagcaag tagtctttaa acttttgaag gacatcttgg aagttttggc atttgttcat 2341 caacaaaacg tcatccatcg ggatattaag ccacagaatt taatgcggcg acgaaaagat 2401 ggaaaaattg tactcattga ctttggggca gttaaggaaa tcagtgcttt gaaagtcaac 2461 cctcaaggtg acagtagctt gacagtgcca attggtactc ctggctatgt gcctatcgaa 2521 caacactact ctcgtcccca gttcagtagc gatatctatg cagtgggaat gattggtttt 2581 ggtgcattga ctgggttaca ccctaaacag ttaccgcgag atgccaataa tggtgaattg 2641 tattgtgctt tgtttcggga caggattaat gttagcccca gttttgcaca aattttagat 2701 acgatggtgc gcaacgacta tcgtctagga ggctctgcct cctctagcgg cagagccgct 2761 ttgtctgtat tcccaggtat aacctgggaa caaggaaaat gtataaataa atttaaaaaa 2821 aacatctaaa tgaatcaaca atttcaccaa tcatataaat atcaagtagg gggcagtctg 2881 ccaaatgatg ctcccactta tgtgcagcga caggcagatg ttgatttcta tacagctttg 2941 aaggcaggag aattctgcta tgtcctcaac tcccggcaaa tgggcaagtc tagcctgaag 3001 atacgaacga tgcgaaggtt gcaggcggag ggttttgcct gtgttgatat tgatattacg 3061 agcattggca ctgcggaact gacttctgaa caatggtatg cggggataat taatggtatt 3121 attaatagtc tggaactcta tgaaaagttt gatttaaata cttggtggtc aactgaaggt 3181 ttgctgtctc cagtgaaccg attaagcaag ttttttgaaa ctgtgttatt gcagttggtg 3241 tctaaaaata tagtgatttt tgttgatgaa atcgacagtg ttttgagtct gaattttgag 3301 agagatgatt tttttgcttt gattcgtgaa tgctataaca atcgagcgac aaaaccagat 3361 tatcaacgac tcacttttgc gctgcttggg gttgccactc ccagtgattt aattcaagat 3421 aaacgacgca ctccttttaa tattggtaga gcgattcaac taactggttt tcagttacaa 3481 gaagctaaac cgttggcggg tggtttggcg gtgaaaacga ataatccaaa ggcagctata 3541 acagcaattt tggattggac tggggggcaa ccttttctga ctcaaaaact ttgccagatg 3601 attcttgatg ctgatgctgc tattccagaa aatagagttg ctgagtgggt agaggagttg 3661 gtgcaaacta gagtcattta taattgggaa tctcaggatg aacctgagca tttaaaaacg 3721 attcgtgacc gaattttacg aggtggtgga cagcttacag gtaggttgtt agggctgtat 3781 caagaaattc tacaacacga agaaattgca gacgataata gtttagaaca aacccaattg 3841 cggctttccg gattggtagt cagacgtgaa agcaagctga gagtttctaa ccgtatttat 3901 gctgctgttt ttaaccaaat ttggctgcaa caagcactga aaaacttgcg tccttatgca 3961 caaacactta gagcttgggt agattcacgt tatcaggatg agtcgcgttt gttgcggggt 4021 caagcgttgc aggatgcctt agcatgggcg gctggtaaaa gtttgagcaa tgaggattat 4081 gagtttttat ctgctagtca aaagttagat aaaagagaaa ttgaaattgc tttagaagtt 4141 aaagaagaag aaggacggat attagcacaa gcgaatcaga ctttaaccaa agcgcaacaa 4201 aaagctaagc ggcggattcg cattggcggt gcgatactag tgttatgctt aattggggca 4261 gttatcgctg ggatatttgc aactaatgcg tttaaacaag cgcaagaggc acaagaaggg 4321 acacagttag aacgagcagg ggctgcagct ttagaccagt ttgagaaaca acatcaagaa 4381 atagaagcat tactggtagc gatggatgct ggacaaaagt tgaaacaact ggtgaagaac 4441 ggacgacctt tagaaaaata tcctgctgct agccccatcc tcactttaca aacaattgtt 4501 gacaacatcc acgaacgggc acaactcaaa gggcatcaga gcacggtcac gagtgcagga 4561 tttagcccgg atggtcaacg cattgtcact gcatcagatg acggcactgc caaggtgtgg 4621 cgggttggcg ggttggatga tttgctggcg cggggttgcg actggctgca ggattatttt 4681 gtcacccatc // LOCUS NODE_5778_length_4644_cov_4.9986934644 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4644) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4644) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4644 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..2438) /locus_tag="DP116_27135" CDS complement(<1..2438) /locus_tag="DP116_27135" /inference="COORDINATES: protein motif:HMM:PF13493.4,HMM:TIGR00229" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27135" /translation="MLKVTRSRLQQYSVAGLSVGLALLLMLALDPWLAMTKTPFLLFF GAVIVSAWYGGLRAGLLSTFVSVLISNYFFLQPIYSLALDLPNTIRLALFALQGVLFS VLCEALRNAKRRAEVSLESLQAADVRYRRIVETAGEGIWLFDAQLHTEYVNPQLAQML GYNVEQMLQRPILDFMDQAAQVEAHQYIAQLKQGMQQRFDFRFQCRDGSNLWAMVSTT PILGEWGEFQGGLAMISNLSQRKQIEESLQVREEQLRLFVEHAPAAIAMLDNQMRYIL VSQRWLTDFQLQNQNIIGRSHYEVFPEIPDSWREIHALCLAGEVRKSEKELFPRADGS IDWVKWEIHPWRNRVDEIGGIIIFTEVITERVQAEEAVQAANNRITMILESITEAFVA FDREWRYTYVNQEAARLLQKPPEELLGKQLWKDVYPELIGKTFYQEAHRAICAPRSSE AIAQQVPIEFEEFCEVLHRWLEIRMYPSSEGLAVYFRDVTERKKGKEELQQSEAKFRR LFESNLVGVAFWNVEGFITNANDAYLRIVGYSRQEFDALGRIDWRSLTPPEYNDVDNR ALEEAFNTRVSSIFEKEYVQRDGTRVPVVLGIALMDDSQVDGVAFVLDISKRKQAEQE RDRLLQAEQKARATAEAANRMKDEFLAVLSHELRSPLNAMLGWLTIMRTENLDEATTA RALETIERNARAQAQLVEDLLDVSRIIQGKLRLNVRTVDLLPAIEAAIDTVNPAALAK NIRLQPVLDPDAGPVFGDSDRLQQIVWNLLSNAVKFTPKGGRVQIRLERVHSHVEIVV SDTGKGISPEFLPYV" gene complement(2628..2978) /locus_tag="DP116_27140" CDS complement(2628..2978) /locus_tag="DP116_27140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130761.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF952 domain-containing protein" /protein_id="PRJNA477356:DP116_27140" /translation="MSTILHITQREQWKQAKLLGTYQGDTLDSEGFIHCSTPTQIIKV ANTFFRNQKGLVLIFINSEKVQPEIRYEGVEEDELFPHIYGVLNIDAVFKVSDFEPGE DGLFLLTHEILTMK" gene complement(3045..4559) /locus_tag="DP116_27145" CDS complement(3045..4559) /locus_tag="DP116_27145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015140694.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27145" /translation="MIPRVRAPELPQNYPWLNVDKPLSLQELRGRVVILDFWTYCCIN CLHILPTLKYLEQKYKDSLTVIGVHSAKFDNEKETENIRQAILRYNIEHPVLVDSGFR VWQEYAVRAWPTLMIIDPQGYVISYVSGEGNKDVLDELIQKLVSQHQEKGTINLQELN LTLEKQRKPLITPLAFPGKVLATQAGLFIADSGHHRVVMSSFDGEIINLIGSGQSGLT DGSFSEAQFFAPQGMVFDKENQILYVADTENHALRRVDLKHQVVETIAGTGEQSRNIR PHSGIGRKTALNSPWDLQLCENSLFIAMAGPHQIWEMDLETNIVRTYAGMGAEACVDG VLTESTFAQPSGITTDGQELYIADSEISSIRGVGLVEPRQVRTVCGSGDLFGFGDVDG QGFDVRLQHCLGVEYAENQLWVADTYNHKIKLVNPHTGDCQTILGDGLVGLQDGEGKN TRFSEPSGLSVIGSDLFIADTNNHAIRRVDLDTLTVTTLNFPNLCAPDVCVPHE" BASE COUNT 1298 a 1112 c 919 g 1315 t ORIGIN 1 acataaggaa gaaactcagg gcttattccc ttaccagtat cagatacaac aatttccaca 61 tgagagtgga cgcgttccaa gcgaatctgg acccgtccac cttttggggt aaactttaca 121 gcattcgata gcagattcca gacaatttgc tgcaagcggt ctgaatcgcc aaaaactggt 181 ccagcatctg ggtctagcac tggttgaagt cggatgtttt ttgccagggc ggcaggattt 241 accgtatcaa ttgctgcctc aattgcgggt aatagatcca cagtacggac gttcaggcgt 301 aattttcctt ggataatgcg ggagacatcc agcaaatctt cgactagttg tgcttgtgct 361 ctagcattgc gttcaatggt ttccagcgcg cgggctgtgg ttgcttcatc aagattttcc 421 gtacgcatga tagtcagcca accaagcata gcattgaggg gagaacgcaa ctcatgagaa 481 agaacagcaa ggaactcatc cttcatgcga tttgctgctt ctgctgtcgc ccgcgctttt 541 tgctcagctt gtaaaaggcg atcgcgctct tgttctgctt gtttgcgttt gctaatgtcc 601 aacacaaatg cgactccatc aacttgggaa tcgtccatca aagctatacc taagacaact 661 ggtactcgtg taccgtctcg ttgaacatac tctttctcaa aaatactgga aactctggta 721 ttaaaggctt cctcaagcgc tcggttatcc acatcgttgt actctggtgg agtcagtgaa 781 cgccagtcaa ttctccctaa agcatcaaac tcctgacggc tgtagccaac gatacgcaaa 841 taagcatcgt tagcattggt tataaagcct tcgacattcc agaaagcaac tcccacaaga 901 ttagactcaa acaaacgtct aaacttcgct tcactttgtt ggagttcttc cttccccttt 961 ttgcgctcag tcacatcgcg aaagtaaact gcaagccctt cagaactggg atacatccta 1021 atttccagcc accggtgtag tacctcacag aactcctcaa attcaatggg aacttgttgg 1081 gcgatagctt cgctgctgcg cggagcgcag atcgcccgat gagcttcttg ataaaaagtt 1141 ttgccgatta attccgggta aacatctttc cataactgct tacccagcag ttcttcgggt 1201 ggtttttgca aaagtcgtgc ggcttcctga ttcacgtagg tatagcgcca ctcacgatca 1261 aaggcaacaa atgcttctgt tatgctttca agaatcatcg tgatgcggtt gttagccgct 1321 tgcacagctt cttctgcctg tacacgttca gtgatgacct ctgtaaaaat aatgatgcca 1381 ccaatttcat caactcggtt tcgccaagga tggatttccc atttcaccca atcgattgag 1441 ccatcagcac gggggaataa ttctttttca gatttccgaa cctccccagc caaacacaaa 1501 gcatgaattt ctcgccaaga gtcgggaatt tcgggaaaca cttcgtaatg agagcgacca 1561 ataatgtttt gattttgcaa ctgaaaatct gtcagccatc gttgactcac aagaatatat 1621 cgcatctggt tgtcaagcat ggcaatagct gctggcgcat gctcaacaaa caaccgcagc 1681 tgttcttctc tgacttgtag cgactcttca atctgcttgc gctgactcag gttgcttatc 1741 atcgcgagtc cgccttgaaa ttctccccac tcacccaaaa ttggagtggt tgaaaccatc 1801 gcccataaat tagagccatc ccgacactga aaccgaaagt cgaatcgctg ttgcatcccc 1861 tgcttcagtt gggctatgta ctggtgtgct tccacctgag ctgcttgatc cataaagtcc 1921 aaaatcggac gttgcagcat ttgctccacg ttatatccca acatctgcgc cagttgcgga 1981 ttcacatact ccgtatgcaa ctgagcatca aataaccaga taccttcacc cgccgtctcc 2041 acaatgcgac gataccgtac gtcagccgct tgtagacttt ctaagctcac ttcagcccgt 2101 cgcttggcat ttcgtagcgc ctcgcatagc acactaaaga ggactccctg aagtgcgaac 2161 agcgcaagcc ttatagtgtt gggtagatcg agcgccagcg aatagattgg ctgtagaaag 2221 aaatagttgc taatcagaac agacacgaag gtagacaata accccgctct caagcctcca 2281 taccatgcac tcactataac cgccccaaaa aatagcagaa aaggggtctt agtcatagca 2341 agccaaggat ctagcgccag cataagcaat agtgccagac cgacactaag tccagcaaca 2401 ctgtattgct gtagccgaga gcgagtaact tttagcattc aattgtagat agttgtcttt 2461 agtagctagc tcaaacttcg cttaatttac tgtagatttc ggcttttgac atttacctta 2521 agagtgattt gttgaatgtc ttgtgttctg ctcactccat ttcattaagc taccagcaga 2581 ttcccagagc tactttttag acgttttcga gtgcgactat tgagctacta tttcatagtt 2641 aatatttcat gagtcaatag aaacaagcca tcttctccag gttcaaagtc gctcaccttg 2701 aacacagcat caatattcaa aacaccataa atatgaggaa agagttcgtc ttcctcaaca 2761 ccttcatagc gaatttccgg ctgaactttt tcagaattga tgaaaataag taccaatcct 2821 ttttgattgc gaaaaaatgt atttgcgact ttaatgattt gtgttggggt tgaacagtgg 2881 ataaagcctt cagagtcaag ggtgtcacct tgatatgttc cgaggagttt ggcttgtttc 2941 cattgttctc gttgagtgat gtgtaggata gtgctcatta agattcaatt tgacattttc 3001 aacatagaga taagaagtat ttagcaacac atttaagttg tcatttactc atgtggaaca 3061 catacatctg gggcgcacaa attaggaaag tttaatgttg tcaccgttaa cgtatccaaa 3121 tctacgcgac ggatagcatg gttattcgta tcagcaataa acaaatcaga accgatcaca 3181 ctcagtcccg aaggttcaga aaaacgagta tttttacctt caccatcttg caaaccaacc 3241 aaaccatcac ctaagattgt ttgacaatcg ccagtgtggg ggttaacaag tttaattttg 3301 tggttataag tatcagctac ccataactga ttctcagcat attctactcc caaacagtgc 3361 tgtaacctaa catcgaaacc ttgtccatca acgtcaccaa aaccaaacaa atcaccacta 3421 ccgcaaacgg ttcttacttg tcgcggttca acgagtccca caccgcgaat tgagctaatt 3481 tcactgtcgg caatgtataa ttcttgtcca tcagtggtga taccgctagg ctgagcaaaa 3541 gtagattcag tcagtacgcc atctacacaa gcttctgctc ccataccagc ataggttctc 3601 acaatgttag tttctaagtc catttcccaa atttgatgag gaccagccat cgcaataaac 3661 aagctatttt cacatagctg taaatcccaa ggagaattta gcgctgtttt tcgtccaata 3721 ccagagtgag gacgaatatt acggctttgt tcacctgttc ctgcaatcgt ttctacgact 3781 tggtgtttta aatcaactcg ccgcagagca tgattttctg tatccgcaac atacaaaatc 3841 tgattttctt tatcaaatac cattccctgc ggtgcgaaaa actgagcttc actaaaagaa 3901 ccatcggtta atccagattg tccacttcca attaaattta taatttcccc atcaaagcta 3961 ctcatgacaa cacggtgatg tccagagtca gcaatgaaca aacccgcttg tgtcgccagg 4021 actttaccag gaaaagctag gggtgtgatt aatggttttc gttgtttttc taaagtcagg 4081 ttgagttctt gcaaattaat cgtgcctttt tcttggtgtt ggctaactaa cttttgaatg 4141 agttcatcta aaacatcttt attcccttca ccagacacgt aactaatcac gtaaccttgt 4201 ggatcaatta tcatcaaagt gggccaagcc cgcacagcat attcttgcca aactcgaaat 4261 ccactatcta ctaaaactgg atgttcgatg ttatagcgga ggatagcttg gcggatattc 4321 tctgtttcct tttcattgtc aaatttggca ctgtgtacgc caataactgt gaggctatct 4381 ttatactttt gttcgagata tttcagggtt ggcaagatat gcagacagtt gatacaacaa 4441 tatgtccaga aatctaaaat gacaactcta cccctgagtt cttgaagaga caaaggtttg 4501 tcaacattca gccaaggata attttgtggt aattctggtg ctctaacacg cggaatcata 4561 aacaattcaa atttcaaaat tcaaaattta aaattcgtct tggaaagttt gctacggagg 4621 gaaaccctcc tggcaacttt ccgc // LOCUS NODE_5793_length_4620_cov_4.5780944620 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4620) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4620) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4620 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 171..1754 /locus_tag="DP116_27150" CDS 171..1754 /locus_tag="DP116_27150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011317909.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="B12-binding domain-containing radical SAM protein" /protein_id="PRJNA477356:DP116_27150" /translation="MKALLIYPQFPQSFWSYDRFMEIAGLKAVLPPLGMITVAALLPK DWEIRFYDRNVNLETEADWEWCDLVILSAMLVQKPDFHALIQKAVRLGKKVAVGGPYP TSIPQDALDSGAHYLVLDEGELTVPQFLEALKGKKEQGIFRSLEKPDVTQSPMPRFDL LQRDAYLMMAIQFSRGCPFNCEFCDIIVLYGRKPRTKEPHQTIAELQALYDLGWRGSL FIVDDNFIGNQRNVKRFLRELIPWMKQHDYPFTFITEASVNLAEDDELLQLMNEAGFY AVFLGIETPDQDSLQVTQKLQNTRNPLIEACRKINEAGMLIYAGFILGFDGERPGAGE RIQAFVEQTSIPQPMLGILQALPNTALWNRLQKEQRLVEGIGVTEVGDQNSLMNFVPT RPLAEIAREYAEGFWTLYEPRNYLRRCFQQCLSIGSLAKRKQTMQFSPGKGLRLVAQL IWHQGLRRPEIRGQFWQQLWTILLKKPQVLNMYLGLCAAGEHFWEYRALARERITQQL GYDPLRVSVLREQEPMLIK" gene 1841..3289 /locus_tag="DP116_27155" CDS 1841..3289 /locus_tag="DP116_27155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009459939.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="recombinase RecQ" /protein_id="PRJNA477356:DP116_27155" /translation="MNPSATTSWHEVRAAFKKIWGYEDFRPPQGEIIQSLLAKKDALI IMPTGGGKSICFQLPALLQTGLTLVVSPLVALMENQVQELRERNLPAALLHSELPSDK RRMTLQALERQQLRLLYLSPETLLSKPVWERLCQPELVINGLILDEAHCLVQWGDTFR PAYRRLGAVRPALLKSKPSGTNIALAAFTATADPLAQTQIREILQLQQPAVFRINPYR SNINPSIRIVWTPRGRKQQFLKFLSDKRQQTGLVYVRTRRDSEELAAWLLQLGYVTAG YHAGLGAEERREIEAQWLSGKMPFVICTCAFGMGINKSDVRWVVHFHPPLLISEYVQE IGRAGRDGKPAQALMLISEPTGWLDPQDKQRQKFFEENLQQQQLSAQQVLKKLPRTGE VNAVAREFHNGAIALSLLHSTGQLKWLDPFHYSIVPGAKTQSVTQSHAAKQMNQYLTT RDCRWRFLLTAFGFEEEIKNWRCGHCDNCCRK" gene 3465..4028 /locus_tag="DP116_27160" CDS 3465..4028 /locus_tag="DP116_27160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010467708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_27160" /translation="MNENQTSEPQHSIVSISVRVLSPADAESYRFVRLLALHEQPPAF GSLPEDEPNLSEIAARLAESDERCFFGAFQDNQLIGTVRISRYCAPNEKHRAHLGGLY VLPAFRRNGCGRSLVRQALNWAANARSIRRVNLTVVTQQKAAICLYQSLGFRIYGTEQ ETFSKAGRFYDEHLMTLELTSDNDRNA" gene complement(4156..>4620) /locus_tag="DP116_27165" CDS complement(4156..>4620) /locus_tag="DP116_27165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137905.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phenylalanine--tRNA ligase subunit alpha" /protein_id="PRJNA477356:DP116_27165" /translation="SVQIRYMETEEPPIRVVAPGRVYRRDNVDATHSAVFHQIELLAI DEGLTFTDLKGTIKVFLEAMFGELPIRFRASYFPFTEPSAEVDLQWNGRWLEVMGCGM VDPNVLKAVGYDPEVYTGFAAGFGVERFAMVLHQIDDIRRLYASDLRFLRQF" BASE COUNT 1299 a 1057 c 1065 g 1199 t ORIGIN 1 tctatatacc taatttgtag tcatttgact aacagcatca aatactactt gcaggcgtta 61 atttgcttgg ggcaacattc cagttcgacg agccgattgc gaatctctag gagatgcttt 121 cggctattga tgggtgttcg tggtactcac agttgtgcaa aaaggatttt atgaaagcac 181 tattgattta ccctcagttt ccccagtcct tttggtctta tgatcgcttc atggaaatcg 241 ccggactcaa agccgtcttg cctccactgg ggatgattac agtagcagcc cttctaccca 301 aggactggga aattagattt tacgatcgca acgttaatct tgaaacagag gctgattggg 361 agtggtgtga cctagtaatc ctttctgcaa tgctggtgca gaaaccagat ttccatgccc 421 taattcaaaa agcggtgcgg ttaggcaaaa aagtggcagt cgggggtcct taccccacct 481 caatcccgca agatgctctt gactctggag cgcattatct ggttttggat gaaggggagt 541 tgacagttcc acagtttcta gaagcgctca aaggcaaaaa agagcaagga atctttcgct 601 ccctggaaaa acctgatgtc acccaaagcc cgatgccgcg ttttgacctg ctgcaacggg 661 atgcctactt gatgatggct atccaatttt ctcgcggttg ccccttcaac tgcgagtttt 721 gcgacatcat tgtcctctac ggtcggaaac cacgcaccaa ggagcctcac cagaccatag 781 ccgagttaca agctctttat gatttaggct ggcgagggtc actcttcatc gttgatgaca 841 actttattgg aaatcagcgt aacgtcaaac gcttcttacg agaattaatt ccttggatga 901 agcagcacga ctaccccttc accttcataa ctgaagcttc tgtgaatttg gcagaagatg 961 atgaactgtt gcaattaatg aatgaagcag gcttctatgc agtttttctc ggcatagaaa 1021 ctcctgacca agacagcctg caagtaacac aaaaactgca aaatactcgc aatccgctca 1081 tcgaagcctg tcgcaagatc aatgaagcag ggatgctaat ctatgcaggg tttatccttg 1141 gttttgatgg agaacgccca ggagcaggag aacgaattca agcttttgtt gaacaaacca 1201 gtattcctca accgatgctg ggcatccttc aagctttgcc caatactgct ctatggaacc 1261 gtctgcaaaa agagcagcgt ttagtagagg gtattggcgt cactgaggtg ggagaccaga 1321 attccttgat gaattttgtc cccacccgcc ccttagctga aattgctagg gagtatgcag 1381 aaggcttctg gacgttgtat gaacccagaa actatctcag acgctgtttt cagcaatgtc 1441 tcagtattgg ctcgctagca aaacgaaagc aaaccatgca attttctcca ggtaaggggt 1501 tgcggctcgt tgctcagtta atctggcatc agggcttacg gcgacctgaa attcgtgggc 1561 agttctggca acaactatgg acgattctgc tgaaaaagcc tcaagttctc aatatgtatt 1621 tggggctatg cgctgctgga gaacattttt gggagtaccg cgctttagct agggaacgga 1681 ttactcaaca actagggtac gatccactga gagtctctgt gctaagagag caagaaccaa 1741 tgctcatcaa ataaaatcaa ataaaatact gataatatta ctgagccagt agcttaaaac 1801 cagaggatag attcaatatc gtccctgtat actatggctc atgaatccat ctgcaacaac 1861 atcttggcac gaagtccgcg ctgcatttaa aaaaatctgg ggttatgaag atttccgtcc 1921 accgcaggga gaaattatcc agagtctgtt agcaaaaaaa gatgcgctga ttatcatgcc 1981 cacaggtggg ggaaagtcaa tttgttttca acttcccgca ctgctacaaa ctggattaac 2041 gcttgttgtt tcgcctttgg tggcgctgat ggaaaaccaa gtgcaggaac ttcgggaacg 2101 caatttaccc gctgcacttt tgcacagtga attaccatct gacaagcggc gcatgacttt 2161 acaagctttg gaacgacaac aactcagatt actttacttg tcgccagaaa ctttactcag 2221 caagccagtg tgggaaaggt tgtgccagcc ggaacttgtg attaatggat tgatactaga 2281 tgaagcccat tgtttggtac agtggggaga cacttttaga ccagcttatc gaagattggg 2341 ggcagtacga ccagcgctac tcaaatcaaa accatcggga acaaatatag cactagccgc 2401 ttttaccgcc accgctgacc ctttagccca aactcaaatt cgagaaattc ttcagttaca 2461 gcaaccagca gtttttcgta ttaatcctta tcgttctaat attaatcctt ctattcgtat 2521 agtttggact ccaagaggca gaaagcaaca attcttaaaa tttctttcag ataaacgaca 2581 acaaactggg ctagtttacg ttcgcacaag gcgagatagc gaagagttgg ctgcatggct 2641 gctacagttg ggttacgtta cagcaggtta tcatgcagga ttgggcgcag aggaacgtcg 2701 tgagatagaa gcacaatggc tgagtggaaa aatgccgttt gtcatctgta cgtgtgcttt 2761 tggtatgggg ataaataagt ctgatgtccg ttgggtggtt cattttcacc cgccattgtt 2821 gatttcagag tacgtgcagg aaattggacg cgctggacga gatggaaaac cagcccaagc 2881 actgatgttg atcagtgaac ccacagggtg gttagatcca caagataagc aaaggcaaaa 2941 gttttttgag gagaatttgc agcagcaaca gctatccgcg cagcaagttc tgaaaaaatt 3001 gccaagaaca ggggaagtga atgctgtagc gcgagagttt cataatggtg cgatcgccct 3061 ctctctacta cacagcaccg gacaactcaa gtggcttgat ccctttcact acagtatcgt 3121 tccaggggca aaaacccaaa gcgttacaca atcacacgct gcaaagcaga tgaatcaata 3181 tcttacgact agggattgtc gctggcgttt tttgttaact gcttttggtt ttgaggaaga 3241 aattaagaac tggcgttgcg gtcactgcga taattgctgc cgaaaataac tttcttggca 3301 gaagcagaaa aaaatgtcta ccgcgctgag tggcaagagt ttatacagaa tctttgaaat 3361 ttggctaaag tacaacatca ccagaaaagt taggagaatc gattacaatg caacagttac 3421 ctgactccgg ttagcttcac cgttaggcgt ttctaacagt ctgtatgaat gaaaatcaaa 3481 ccagcgagcc tcaacacagc attgtttcca tcagcgtacg agtgctttct cccgctgatg 3541 ctgaatcata tcgttttgtg cgtctgcttg cgcttcatga acagccccca gctttcggct 3601 cgttgccaga ggatgaacca aatctttctg agatagctgc aagactcgca gagagcgacg 3661 agcgttgctt tttcggagcg tttcaagaca atcaactcat tggcaccgtc cggatttctc 3721 gctattgcgc acccaacgag aagcaccgcg ctcatcttgg ggggctatat gttttgcccg 3781 cattccgtcg caacggttgt ggtagatcgc ttgtcaggca agctttgaat tgggcagcga 3841 acgcgcgaag cattaggaga gtcaacctaa ctgtcgtgac ccaacaaaaa gcggcaatct 3901 gtctttacca atcgcttggc ttccgcatct acggtactga gcaagagaca ttctcgaaag 3961 ctggacgttt ctacgacgaa cacctgatga cgttggaact cacttcggat aatgaccgca 4021 acgcctaaca acagtatgaa ccagaagctt aagtgatgtt attgagcgta tggcaagaga 4081 aaagctaccc aaaaatgggt agtaatgggt agctttgtca atagattaag ttacagttcc 4141 taactcctaa cttttttaaa actgccttaa aaagcgtaaa tcactcgcat aaaggcgacg 4201 aatgtcatca atttgatgta aaaccatcgc aaagcgttct acaccaaaac cggcggcgaa 4261 tcctgtgtac acttctggat cgtatcccac ggctttgagc acatttggat cgaccatacc 4321 gcaacccata acttccagcc agcgaccatt ccactgcaaa tcaacttcag cagaaggttc 4381 tgtaaacggg aaataactgg cgcggaagcg aataggtaac tcaccaaaca tcgcttccaa 4441 aaatacctta attgtgcctt taagatcagt aaacgtcagt ccttcgtcaa ttgccaaaag 4501 ttctatttgg tggaaaactg ctgagtgagt cgcatctacg ttatctcttc ggtagactcg 4561 ccctggtgca actacccgga tggggggttc ttcagtttcc atataacgaa tttgtactga // LOCUS NODE_5804_length_4608_cov_5.8994074608 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4608) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4608) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4608 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(49..927) /locus_tag="DP116_27170" CDS complement(49..927) /locus_tag="DP116_27170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411368.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27170" /translation="MAKAADTGGKRLISLAPDAWVQWVTQRPEVVAKEILGSEFQWIS RETDVLVKAYSATHGDFLVLNELQLRYTAHMPLRMRAYAALAQERYRLPTYPVLINIL PPPSTLTVISSYEQEFLGLRAIQDYRVINLWEVDAEIVFQQPLPSLLPFVPILRGGGE ASVVQRALQVLRTDPQLNQLESLLAFFASFVLETPLVQQIMRWDMAVLRESPWYQEIL TEGEERGLQQGLQQGLQQGVQRQLIRVLQRRFGEIPQEVKARLEGESVEQLESFMDSA IAVSSLEEFLTILSTC" gene complement(1022..2326) /locus_tag="DP116_27175" CDS complement(1022..2326) /locus_tag="DP116_27175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653206.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="PRJNA477356:DP116_27175" /translation="MTKSRERKPRVKQASALVGSSSVSMVEILSMALEALWNNKLRTL LTMLGVIIGITAVIAVTAIGQGTQKSTEQQLQSLGTDILQVQSGAARSGGVRQGSGSA TTLTLEDAQAIAQEVLGVDRVSAFLQQNAQVVYAGNNDSMTIIGTDINYPYVRNTHPQ TGRFFNQEEVDSAKAVAILGPTARDELLGTGSNAEGAQIRIAGETYDVIGVMETKGAQ GPQNPDEQIYIPLTNMSARLVGNNAVKGLAIRGIYVKVKSQDLLDAAQYQTTNLLRVR HGIFPEKGETDDFRTVNSADIIETLTSTSKLFTVMIVAVAGISLIVGGIGIANIMLVS VVERTREIGIRKAIGATGTAILTQFLAEAVVIAAMGGVVGICLGVAIAFAASNLFKFP FVISPWSILFGFGLTFVIGLLAGVIPARNAARLDPITALRSD" gene complement(2354..3892) /locus_tag="DP116_27180" CDS complement(2354..3892) /locus_tag="DP116_27180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950154.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_27180" /translation="MTLDSPQPTTFKPLGKGLVKDKLSKWLMGLLILSSLTGGGYLVY HQTVISSAQEARSKMQTVPVQRETLPITISANGIIEAKQSTNVSPKSSGRLKSLLVDE GDSVKAGQILAYMDDSNLQGQLIQARGSLAAAQANLQKAIAGNRPQDIAQAQAQLEEA QANLQKGQAGNRPQDIAQAQARLKSAQASFTKAQDDFQRNQQLYNAGAISLQIVNQKR ADRDSAQAAVNEAQQALALQKAGSRTEDIEQLKAVVEQRQQALALLKAGNRKEDIDSA RAQVIQQQGSLKTIQTQIQDMVIRAPFSGIVTSKYSDPGDFVTPTTSGSSVSGATSSS ILSLASNYQVVANVAETDISKIKVGQPVTITADAYPDKTFNGKVAQIAAVASVTSNVT SVKVRVDLSDPETLLLPSMNVDVKFNAGNLNNVLVVPTVAIARQENGTGVRVLRENGK TRFVPIQTGLTVGNKTEVRSGLQGNEKVLVSAPPGSRDSNRGGRSGRMGGGYGGMGGR RGGF" gene complement(4050..4334) /locus_tag="DP116_27185" CDS complement(4050..4334) /locus_tag="DP116_27185" /inference="COORDINATES: protein motif:HMM:PF01292.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27185" /translation="MLVVPISGIFLSNTGGHEIPFFFVTLPNWFEENRSVGAIREASS LRLIAHSLHFWLSYTLLALVILHIIEQRQFLRRTWKRTFKDNSTKQITKL" gene complement(4370..>4608) /locus_tag="DP116_27190" CDS complement(4370..>4608) /locus_tag="DP116_27190" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27190" /translation="NHTSIRIPGIENIDGVLTFQNPNNSSTYQKSCSEIQVEHGDLLS ATCVRIDGSRNHTSIRIPGIENIDGVLTFQNPNN" BASE COUNT 1162 a 1140 c 994 g 1312 t ORIGIN 1 atatcccacc tcatgatttg ttggactaaa ggcgtctcta acacaaagct agcaagtaga 61 taaaattgtc agaaattcct ctaaagaact cacagcgatt gcactatcca tgaaactttc 121 taattgttcc acactctcgc cctcaagcct tgcttttact tcttgaggaa tttcgccaaa 181 gcgtcgttgc aatactcgta ttaactgccg ttgtacacct tgctgaagtc cttgctgaag 241 tccttgctga agtccacgtt cttcaccttc agtcagaatt tcctgatacc aaggcgattc 301 tcgcaacact gccatatccc acctcatgat ttgttggact aaaggcgtct ctaacacaaa 361 gctagcaaaa aatgccagca atgattctaa ctgattcaac tgtggatcgg ttcgtagtac 421 ctgtagtgct cgttgtacaa ctgatgcctc acctcctcct cgcaagatgg gcacaaacgg 481 taataacgaa ggtagtggtt gctgaaacac tatctcagca tccacttccc acagattaat 541 gacgcgataa tcttgaatag cacgtaatcc caaaaattct tgctcgtaac tactgataac 601 agttaacgtc gatggaggtg gcaagatgtt gatcagcact gggtaagttg gcagtcggta 661 gcgttcttgt gctaaggctg cataggctct catacgtaga ggcatatgtg ccgtgtaacg 721 caactgcaat tcgttgagta cgagaaaatc cccgtgggta gcactgtatg ccttcaccaa 781 cacatctgtt tcgcggctaa tccactgaaa ctcagaaccc aaaatttctt tcgccacgac 841 ttcgggacgc tgtgtcaccc attgtaccca tgcatcagga gctaaactaa taagtcgctt 901 accacctgta tctgctgctt ttgccacagg tttttttata ccgttactag ttcttaagaa 961 tatatcataa tgtgttaagc aaaatgtgat acgcgcaagc caaagccaga catgaattct 1021 cctaatcact cctcagcgct gtgattgggt caagtctagc agcattgcga gctgggatga 1081 cgccagccag cagtccaatc acaaacgtca aaccaaagcc gaacaggatc gaccagggcg 1141 aaatgacgaa gggaaatttg aacaggtttg aggctgcgaa ggcgatcgca actccaaggc 1201 atataccaac aacccctccc attgctgcaa tcaccactgc ctcagctaaa aactgagtca 1261 agatggcagt accagtcgct ccaatcgctt tacgaatccc aatttcgcgt gtccgttcga 1321 ccactgacac aagcatgata ttggcaatac caatgccgcc aacaattaaa gaaattccgg 1381 caactgccac tatcatcacg gtaaaaagct tagatgtact ggtgagcgtt tcgatgatat 1441 cggctgaatt caccgtccta aaatcatccg tttctccttt ttcgggaaaa atgccgtgtc 1501 gtacccgtaa cagattcgta gtttgatact gggcagcatc tagcaaatcc tggcttttga 1561 ctttgacata aattcctctg atggcaagtc ctttgacagc gttgtttccc actaaccggg 1621 cagacatatt ggtaagcggg atataaattt gttcatccgg gttttgtgga ccttgcgctc 1681 ccttcgtctc cataacgcca attacgtcat aggtttctcc ggcgatgcgg atttgtgcgc 1741 cttctgcgtt gcttcctgtt cccagcagtt catctcgtgc ggtgggaccc aatatcgcca 1801 cagctttcgc cgaatcgact tcctcctgat tgaaaaatct acctgtttga ggatgcgtgt 1861 ttcgcacata tggataatta atatcagttc caataatggt catcgagtcg ttgtttccgg 1921 cataaacgac ttgggcattc tgttgcagga aagcagacac ccggtcaaca cctaaaactt 1981 cttgcgcgat cgcctgagca tcttccaaag ttaaagttgt tgcactccca ctgccttgcc 2041 gcactccgcc acttctggct gcacccgatt gtacttgcaa aatgtcagtt cccagtgatt 2101 gtaattgttg ctcggttgac ttttgggtgc cttgaccgat agccgtaacg gcgattactg 2161 ccgtaatgcc aatgatgacg cctaacatgg ttagcagggt gcggagttta ttattccaca 2221 gtgcctctaa agccatcgac agaatttcaa ccatcgacac cgatgaactc ccaaccaaag 2281 cagatgcctg tttaacccgt ggtttgcgct cacgtgattt tgtcattgct aaacctcgct 2341 tgtaaaatca agtctaaaaa ccaccgcgac gaccacccat accgccataa ccaccgccca 2401 tcctgccaga acgaccacct ctattactat ccctggaacc cggtggcgca ctcaccagaa 2461 ctttttcatt tccctgtaaa ccggagcgca cttcggtttt gttacccacg gtcaatccgg 2521 tttgaattgg cacaaatcga gttttaccat tttccctaag tacccgcact ccggtgccat 2581 tttcctggcg ggcgatcgcc accgttggta caacaagcac attatttaag ttccctgcat 2641 tgaatttcac atcaacattc atgcttggaa gcagtagagt ttcaggatcg ctaagatcca 2701 ctctcacttt tacgctggtg acgttggatg ttacacttgc aacagcggca atctgggcaa 2761 ctttcccatt aaaggttttg tcgggatacg catcggctgt gattgttacc ggctgtccta 2821 ccttaatttt gctaatatcg gtttcggcaa catttgccac aacttgataa tttgatgcca 2881 gcgacagaat cgaagaagag gttgctcccg aaacagaact cccggatgtt gtaggtgtca 2941 cgaaatcacc cggatcggaa tacttggaag tcacaatccc actaaacggg gcacgaatga 3001 ccatatcttg aatttgtgtt tgaatcgttt tcaggctgcc ctgctgctga attacctgag 3061 cgcgtgcgga gtcaatatct tccttgcggt ttcccgcttt cagcagtgct aaagcttgct 3121 gtctttgctc caccaccgcc tttaattgct cgatatcttc ggtgcgtgat cctgcttttt 3181 gcaatgccag cgcttgctgc gcttcattta ccgcagcttg ggcgctgtcc ctgtcagcac 3241 gtttttggtt tacaatctga agggaaatcg caccagcatt gtatagttgc tgattgcgct 3301 ggaaatcatc ttgagctttt gtgaaacttg cttgagcgct ttttaaacgt gcttgtgctt 3361 gagcaatatc ttgagggcgg ttacctgctt gtcctttttg cagattcgct tgggcttctt 3421 ctaattgtgc ctgtgcttga gcaatatctt gagggcgatt ccctgctatt gctttttgta 3481 agttcgcctg ggcagcagcc agagatcccc gtgcttgaat cagttgccct tgcaggtttg 3541 agtcatccat gtatgccagg atttgccctg ctttcacaga gtcaccttca tccaccagta 3601 agctttttag acgtccagag cttttggggc tgacgttggt cgattgtttg gcttcaatga 3661 tgccgttggc tgagatagtg atgggtaagg tttctcgctg gactggaact gtctgcattt 3721 tacttctggc ttcctgagcg gaggaaatca ccgtctgatg gtaaacaagg tatccacccc 3781 ccgtcagtga actaagaatg agtaacccca tcagccactt gcttagcttg tctttgacca 3841 agccttttcc taaaggcttg aatgtggtag gttggggaga atcaagtgtc ataggctgct 3901 ttccaccttg agtgtaaaca aagttattac gttcgtgctt tgtgcgcctc ccggagggag 3961 tatcgctttt gcttcgttgt attgaaacta caaggagttg atacttttaa ttacagcagc 4021 acaacatgac agcttgccct ttgtgtaaat tacaattttg taatttgttt tgtactgtta 4081 tccttgaaag ttcgtttcca ggttctacgg agaaactgac gttgctcaat tatatgcagg 4141 ataaccagtg ccagcaaagt gtacgatagc cagaagtgca gactgtgagc gataagccgg 4201 aggcttgacg cttcgcgtat cgcaccaaca gaacgatttt cctcaaacca attgggcaac 4261 gtcacaaaaa agaacggaat ttcatgccca cctgtgtttg ataaaaaaat gccactgatt 4321 ggaacaacca gcattaggag ataaagcagt aaatggaggg aaaaggtctt tagttgttag 4381 gattctgaaa cgtcaatact ccatcaatat tttcaattcc cggaatgcga atagatgtat 4441 ggtttcgact gccatctatt ctgacacaag ttgctgataa caaatctcca tgctcaactt 4501 gaatctcaga acaacttttt tgataggtac tagagttgtt aggattctga aatgtcaata 4561 ctccatcaat attttcaatt cccggaatgc gaatagatgt atggtttc // LOCUS NODE_5834_length_4575_cov_4.0314164575 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4575) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4575) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4575 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..645 /locus_tag="DP116_27195" CDS <1..645 /locus_tag="DP116_27195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007356606.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_27195" /translation="VFERFRQADSSTTRSFGGLGLGLSIVRNLVELHGGSVHVESPGE GQGATFTVKLPLSPISSKIQPRVHPTVGDPLPFDCTPQLDGLRILVVDDEVDARELLI QILVECGAEVVAVGSADEVIAALKEQTSDSRFDILISDIGMPEQDGYALLRRVRALES NEGGRIPAIALTAYARAEDRKAAFLAGFQSHVAKPVEPGELIAVIGNLTGRSMS" gene 695..883 /locus_tag="DP116_27200" CDS 695..883 /locus_tag="DP116_27200" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872761.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4089 domain-containing protein" /protein_id="PRJNA477356:DP116_27200" /translation="MEGKELNVGEYVDLMALLLDLQLKDEFRGGVVANFERIMAIAQV VNEFPLPETIEAASVFEP" gene 880..2280 /locus_tag="DP116_27205" CDS 880..2280 /locus_tag="DP116_27205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456982.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AtzE family amidohydrolase" /protein_id="PRJNA477356:DP116_27205" /translation="MSLELVDAVAIASAVREGKVSAVEVTKAALQRIAARDNELNCFT TITAETALADAERIDREISQGKNPGCLAGVPFAVKNLFDIAGLTTLAGAKINAENSPA TQDATAVARLKQAGAVLVGALNMDEYAYGFVTENFHYGVTHNPHDLKRIAGGSSGGSA AAVAAGLVPLTLGSDTNGSIRVPAALCGVFGLKPTYGRLSRAGVALFSSSLDHVGPFA RSVRDIATAFDVVQGEDERDPVCTKRPRVELSDATSRQNIEDLRIGIADDYFTKGASP EAIDAVQKVADALGVTQYVTIPEAHRARAAAFVITACEGANLHFEKLQLRPQDFDPAT RDRFLAGALIPSTWYIQAQRFRRWYRDKVREIFQKVDVILAPTTPISAPLISQETMIL DGEEILVRPHLGLFTQPLSFIGLPVLSVPIQRPNVLPLGVQLIAAPYNEALILRVASV LEAKGVISAPIVPLKL" gene complement(2302..2964) /locus_tag="DP116_27210" CDS complement(2302..2964) /locus_tag="DP116_27210" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27210" /translation="MSFLHKFSIAVLSTVYVAVGTINKAGAVTLNFDELGSPTPVDGL TVKGVTFDFKINGVDSSEAIYNQSFPPNFPSNLFVNLQAPLLEGNAKGILTLDFTAPI SALQFAVGVETSGTLTSGLTVELFDTGLKSLGITPVDTSSLAFLSEGLFKYNGVPITR AVLDFDETKLGFDPSVAPRFSLDNLTYTAVPETSSLFGLLALGALGAGVRLLRKQQQK AL" gene complement(3185..3865) /locus_tag="DP116_27215" CDS complement(3185..3865) /locus_tag="DP116_27215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113100.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_27215" /translation="MPLTILIVDDDLGTRLSISDYLELSGYSVMTADDGQEALTIVEE HHPDLIVTDIIMPRMNGYDLVRRVRQQPGFRLLPVILLTARTKTQERILGYQSGCDLY LPKPFELEELAAAIRNLLERSQIIQSEYGFSHQENFVTHTPTKSPDTNNFDVTGIQKP HIFDLTAREQEVLELLTHGLSNAEMGQHLHLSPRTVEKYVSSLLRKTETSNRAELVRF AITHGLVK" gene 4359..>4575 /locus_tag="DP116_27220" CDS 4359..>4575 /locus_tag="DP116_27220" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009457839.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YbaB/EbfC family nucleoid-associated protein" /protein_id="PRJNA477356:DP116_27220" /translation="MTGKGQGFGFGLGKMRELADAFKKAQQVQEGAKRLQEELEQMEI QGESGGGLVKVVVSGNQEPRRVEISPNA" BASE COUNT 1292 a 844 c 1102 g 1337 t ORIGIN 1 gtttttgagc gcttccgtca ggctgatagc tcaaccactc ggtcttttgg cggtttggga 61 ttggggttat ccattgtgcg taacttagtt gagttgcatg gtggttcagt tcatgttgaa 121 agtccggggg aaggacaagg agcaacgttt actgtgaagc taccgctgag tcctattagc 181 agcaagatac aaccacgggt gcatcctact gtgggggatc ctttaccgtt cgattgcact 241 ccccagttgg atggtttgcg aatattagtg gtggatgatg aggtggatgc tcgtgagttg 301 ctgattcaaa ttcttgtaga atgtggagct gaggttgtag cagtgggaag tgcagatgag 361 gtgattgcgg cgctgaagga gcaaacgtct gactcacgtt ttgatattct tattagtgat 421 attggtatgc cagaacagga tggttatgct ctgctgcgac gagtgagagc gcttgagtct 481 aatgaaggtg gacggattcc ggctattgca ttgactgctt atgctagggc tgaggatcgc 541 aaggcagctt ttttagcagg gtttcaatct catgttgcta agcctgttga accgggagaa 601 ttgattgcgg tgattggaaa tttgactggg cgaagtatga gttaacgaac cgcagaggcg 661 cagaggacgc agaggaagag gaagaggagg agagatggaa ggtaaggagt tgaatgtggg 721 tgagtatgtg gatttgatgg ctttgttgtt ggatttgcag ctgaaggatg agtttcgagg 781 tggggtggtg gcgaattttg agagaattat ggcgatcgcc caagttgtga atgagtttcc 841 tttgcctgag acaattgaag ctgcttctgt ttttgagcca tgagtttaga actagttgat 901 gctgttgcga tcgcttctgc tgtgcgagaa ggtaaagtga gtgcggtgga agtgactaaa 961 gctgctttac aaagaattgc cgcgcgggat aatgaactta attgttttac aactataact 1021 gctgagactg ctttggcaga tgcagaaaga attgacaggg aaatttctca aggtaaaaat 1081 cctggttgtt tggctggtgt cccttttgct gtcaaaaatc tttttgatat cgctgggtta 1141 acgactcttg cgggtgcaaa aattaatgcg gaaaattctc cagcaactca ggatgcaaca 1201 gcagtcgcaa ggttgaagca agcgggtgct gtacttgtag gtgctttgaa tatggatgag 1261 tacgcctatg gatttgtgac ggaaaatttt cactatggtg tgacacataa tccgcatgat 1321 ttaaagcgga tagctggtgg ttcatcgggt ggttcggcgg cggcggttgc agctggtttg 1381 gttccactga cgctgggttc tgatacgaat ggttctattc gcgtccctgc tgctttgtgt 1441 ggtgtttttg gtttgaagcc gacttatggg cgattgtcgc gtgctggggt agcattattt 1501 tccagcagtt tagaccatgt tggaccattt gctcgttcgg tgcgggatat tgcgacagcg 1561 tttgatgtgg ttcagggaga agatgagaga gatccagttt gtacaaaacg tcctcgtgta 1621 gaactttcgg atgcaacgtc aagacaaaat atagaagatt tacgtattgg tattgcagat 1681 gattatttta caaaaggcgc aagtcctgaa gcaatagatg cagtacaaaa agttgctgat 1741 gctttgggtg tcacccagta tgtcacaata cctgaagcgc atcgcgcgcg tgcagcagcg 1801 tttgtcatca cagcttgtga gggggcaaat ttgcactttg agaaactgca attgcgtccc 1861 caagattttg atccagcaac acgcgatcgc tttctcgctg gtgctttgat acccagtacc 1921 tggtacattc aagcacaacg ctttcgcaga tggtatagag ataaagttcg agaaatcttt 1981 caaaaggtag atgtcattct tgcgcctacg acaccaattt cggcaccact cataagtcaa 2041 gaaaccatga ttttagatgg agaagaaatt cttgtccgtc ctcatttggg gttgtttact 2101 caaccattat ctttcattgg gttgcccgtt ttgtcagtac caattcaacg tccaaacgtt 2161 ctaccccttg gcgtacagtt aatagctgca ccatataatg aagcgctgat tttacgagta 2221 gcatccgtgc tggaagcaaa gggtgtaatt tcagcgccaa tagtgccact aaagctatga 2281 ctaaagctat gaatttatga gttacaatgc tttttgctgc tgtttacgta ataagcgcac 2341 acctgcaccc aaagcaccta aagctagcaa accgaagagc gatgacgttt caggtactgc 2401 ggtgtatgtg aggttgtcga gagaaaatcg cggagctaca ctcggatcaa aacctaattt 2461 tgtctcatca aaatccagca cagcccttgt tattggaact ccgttatatt tgaacagtcc 2521 ctcagacaag aaagctaagc tgctggtatc cacgggtgta attcctaaag attttaatcc 2581 tgtatcaaaa agttctacag ttagaccaga agttaaggta ccacttgttt ccactcctac 2641 agcgaactgg agtgctgaaa tgggtgcagt gaaatcaagt gttaaaatac cttttgcatt 2701 cccttccaaa agtggagctt gaagattaac aaataagttg ctgggaaagt ttggagggaa 2761 agattgattg tagatggctt cgctagagtc aacaccgttt attttgaaat caaatgttac 2821 ccctttgaca gtaagcccat ccaccggtgt aggagaacct agctcatcaa agtttaacgt 2881 cactgctcca gctttgttta ttgtaccaac tgcaacataa accgtactga gtacagcgat 2941 ggaaaattta tgaagaaaag acataaacct tggtactcat actaagatac tcaaagttta 3001 gtactgttct tcataaaatt aatatgatac atattttgat gttttctgaa ggatttacat 3061 atttactaaa tgaactctca gtcaacaaac ccaagatata aattcaatag tagaaagtat 3121 caagataagc aaaaaataat actcccactg aatgattcgt ttattatgta cagataaatg 3181 tgagctattt tacaagacca tgcgtaatgg caaaacgcac caactcggct cggttgctgg 3241 tttcagtttt tctcaataaa ctactaacgt atttctccac agttcttgga cttagatgca 3301 ggtgctgacc catttcagca ttagaaagac catgagtcaa aagttccaga acttcttgct 3361 ctctagctgt taagtcgaag atgtggggtt tttgaatacc agtcacatca aagttattgg 3421 tgtcaggcga ttttgtggga gtatgagtga cgaaattttc ttgatgagaa aaaccgtact 3481 ctgattgaat aatttgcgat cgctctaaaa gattgcgaat tgctgcagcc aactcttcta 3541 gttcaaaagg tttgggcaaa tacaaatcgc atcctgactg atagcctagg attctttcct 3601 gagtttttgt tctcgctgtt agcaatatga caggtaacaa ccggaaccct ggttgttgac 3661 gcactcgcct cactaagtca tatccattca ttcgtggcat gattatgtca gtgacaatca 3721 agtcaggatg atgttcctcc acgattgtca aagcctcttg accatcatcc gctgtcatta 3781 ctgagtagcc agatagttca agataatcgc tgatagataa acgagtgccc aggtcgtcat 3841 ctactatgag gatcgtcaag ggcatggtca gtagacccct aagatttttt agtcttgagt 3901 ttgtttttat aagcactata agtaaacaca ggtaagaata gtactctcat ttttcatact 3961 atgacaggat taacaatagg taactaccat taagttgatt tataaaatta atattaaatc 4021 cctcttaagg ttaacaactt aagctcacca tcaactggct actttaaata ttatatattg 4081 ctgtatgatt tgctacaata tagtcaagaa tattccattg aatagaatgt gagttgaaaa 4141 aatctagaca tgacaaggta agtcaacgat tgaacccaac acaaacgtaa taaaatctat 4201 ttctaagtgc tttgaatgtt ctacataaca gtaagttccg tcttgcgatt tcttccttgt 4261 ttgtaaacaa agctatggct aatctctctc tacaatttac tcacgacatc tataattgaa 4321 tttgatttgt cattcaaagc acacttggac aagagattat gacaggtaaa ggacagggat 4381 ttggctttgg cttgggaaaa atgagagaac tggctgatgc ctttaaaaaa gcgcaacaag 4441 ttcaagaggg cgcaaagcga ctccaagaag aattggaaca gatggagatt caaggagaga 4501 gcggcggtgg tttagtcaaa gttgttgtta gtggtaatca agaaccgagg cgagtggaaa 4561 tctctccaaa cgcat // LOCUS NODE_5951_length_4435_cov_5.2984024435 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4435) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4435) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4435 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 219..1646 /locus_tag="DP116_27225" CDS 219..1646 /locus_tag="DP116_27225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318960.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase S8" /protein_id="PRJNA477356:DP116_27225" /translation="MFTRISSNSSLFNQGQNITSLSSVNTFDYNNNYNVNLNGHSSFN SAINDLENNAEEQLFNDNGYSFTSGYGLINAAAAVAKAAGQTTFADVPKLGGNNWGAD LVKAPEAWAKGYTGQGVVVAVLDTGVDFNHDDLKDNIWTNSKEITGNGIDDDGDGFID DVHGWNFVDNNNDVSDKFGHGTHVSGTIAGEKNDIGVTGIAYGAKIMPVKVLNDEGSG SYTSISNGIYYAVDHGANVINLSLVGGSSDSTLEKAVEYASSKGVIVVMAAGNDSGFQ PGYPARYADKWGIAVGAVNKNNNMADFSNKAAMNSLTYVTAPGVNVYSTIPGNKYASY NGTSMATPHVAGVVALMLSANHSLTDAQVRQIVAETAVHDTQVPNFSLGNFVTKSSAT ISSFSVTDYHDGSYGSQDQLDDSNTVQFSGFSVSSTLSSQFSSYQTLLNNLNYSITDY DDTTVSENILKQRQEMLEENFRMVS" gene complement(1708..3204) /locus_tag="DP116_27230" CDS complement(1708..3204) /locus_tag="DP116_27230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863090.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hemolysin D" /protein_id="PRJNA477356:DP116_27230" /translation="MRYSLAANTATARKTKQRFAKPDEHLSYELGKAVQELPPLYTRL LAGTLSVVVLGTIAWANFSRIDEVATAPGELIASTQVRPVTSIGNGTIVKVNVKEGDR VIKGQTLIQRDPDLQRVDVTRLASSTKLIQEDLRRLDAERTGGKIAGTQLQDQLLNSR LRDYQARQAASVAEANRQQALINQAKVRQTRLQENLVNARTSLTNAKTNLVNAQNIRR KVETGLAIAQQREQSLKTLVTPGAIPRLDYLDAQERLNRATTEITRANDEVVNSQNKV TEAQDKVTSLEKDIAAQAEEIRQAQEAYQAARSQAQRIESERQSEIITQINKRKEELT TVQGQLEQAQKQQDMETIKAPVTGTIYRIKATTGPVQAGEELVSILPEGQELLLEVKV LNRDIGFIREGMKAKVKMATFPFQEFGIINGEVVQVSPNAIVDKEMGLVFPTRIKLNK HSVMAQGQEVAFTPGMTANGEIVTRKKSILTFLTEPITRRFSEAFSVR" gene complement(3488..4363) /locus_tag="DP116_27235" CDS complement(3488..4363) /locus_tag="DP116_27235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318958.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SDR family NAD(P)-dependent oxidoreductase" /protein_id="PRJNA477356:DP116_27235" /translation="MFLVTGATGGIGRTVVRLLREQEKQVRAFVRLTSRYSELEHRGA NIFIGDLRQEKDIQKACQGVQYIISAHGSGGDALALDYRANIELIDSAKANQVQHFVF ISVLGADRGYEDAPVFKAKRAVERYLETSGLNYTILRPSGLASDLLPLAERFRETGFY LLVGDPKNRSSIVSTDDLARIVVDSVTVEAAQNQILSVGGPEILTREDIPKIISRIFN REPLIINPPLLAVDGLRGALGFFNPQAQKALGTFRTLLANEFFCTREEIAKLETTFNF ELETLESFLRRYLAI" BASE COUNT 1207 a 935 c 951 g 1342 t ORIGIN 1 acatagtaat tattatctat agcaatccta aatcattcgt aaaaggtgag aaacccggta 61 tctcaaagat accgggtttc tagagattcg caatttttac aaatcacata ggattgctat 121 aaagccaaat gtttgtgaaa aaaaaccctt attaaagaga agtattttgt aagttttttg 181 gtcaaaaact agagatttta ttaaggaaaa tcatactaat gttcactcgt attagtagta 241 acagttctct ttttaatcaa gggcaaaata ttacttctct atcctcagtc aacacatttg 301 attataacaa caactataac gtcaacctta acggtcatag tagctttaac tcagcgataa 361 atgacctaga aaataatgct gaagaacaat tgtttaatga caatgggtac agcttcacct 421 ccggctatgg cttaataaat gcagcagcag cagtggctaa agcagctggt caaactacct 481 ttgctgatgt tcctaagctt ggtggtaaca attggggagc cgaccttgtg aaggctccag 541 aagcctgggc aaaaggatac actggtcagg gtgtggtcgt tgctgttctg gatactgggg 601 ttgacttcaa ccacgatgat ttaaaagata atatctggac aaattcaaag gaaattactg 661 gcaatggcat agatgatgat ggagatggtt tcattgacga cgttcacggc tggaactttg 721 ttgataacaa caacgatgtt tcagataaat tcggtcatgg aacccatgtc tctggaacga 781 ttgctgggga aaaaaatgat attggtgtga ctggtattgc ctacggtgct aaaattatgc 841 cagtcaaagt ccttaatgat gaaggttcag gctcctatac ttcgatttct aatggcattt 901 actatgccgt agatcatgga gctaacgtga ttaatcttag tcttgttggt ggttcttctg 961 acagcactct agagaaagct gttgaatatg ccagcagcaa aggggtgatt gttgttatgg 1021 cagctggcaa tgatagtgga tttcaaccag gctatccagc ccgctatgca gacaaatggg 1081 gaattgcagt tggagcagtt aataaaaata ataacatggc tgatttctcc aacaaagcgg 1141 caatgaattc actcacttac gtgacagccc caggagttaa tgtctattct acaattccgg 1201 gtaataagta tgcttcctat aatggcacgt ctatggcgac tcctcacgtt gctggcgtag 1261 ttgctctcat gctcagtgct aaccacagtt tgactgatgc tcaagtgcgt cagattgttg 1321 cagaaacagc agtacacgac acacaagttc caaactttag cctgggcaac tttgtaacca 1381 agagcagtgc tacaatttct agcttcagcg ttacagacta ccatgatggt tcttatgggt 1441 cacaagacca gttggatgat agcaatactg tgcaattctc tggctttagt gtcagttcta 1501 cactgagttc acaattctca agctaccaaa cactactaaa caaccttaac tatagtatca 1561 ctgattatga tgacaccact gtttctgaaa atatactcaa gcaacgccaa gaaatgttgg 1621 aggagaattt ccgaatggta agttaagact aatgaattga aggtggatcg taaggcacac 1681 cgccttgcac catcacaccc tccaaaccta cctcacagaa aatgcctcac tgaatcgacg 1741 agtaattggc tcagttaaga atgtcaaaat tgactttttg cgagtgacaa tctcaccatt 1801 tgcagtcatc ccaggagtaa atgcaacttc ttgtccctgc gccattacgg agtgtttatt 1861 cagcttaatt ctggtgggga aaactaagcc catttctttg tcaacaatag cattgggact 1921 gacttgcacg acttcaccat taatgatgcc aaattcttga aatgggaaag ttgccatttt 1981 gacctttgct ttcattccct cgcgaataaa accaatatct cggttgagga ctttcacctc 2041 taacaaaagt tcttgccctt ctggtaatat agatactaac tcttcacccg cttgcactgg 2101 tcctgttgtc gctttgattc tatagatagt acctgtgact ggagctttga tagtttccat 2161 atcttgctgc ttctgggctt gctcaagttg accttgaacg gtagttagtt cttctttgcg 2221 tttgttgatc tgggtgataa tttcactttg acgctctgat tctatacgct gtgcttgact 2281 gcgagctgct tgatatgctt cttgggcttg gcgaatttcc tctgcttgag ctgcaatatc 2341 cttttctaag gatgtgactt tatcttgagc ctctgtcact ttattttggg agtttaccac 2401 ctcatcattt gcgcgggtaa tttccgtagt cgctcgattg agtctttctt gggcgtccag 2461 ataatcgagt cgaggaatag cgccaggagt cactagggtt ttgaggcttt gttccctttg 2521 ttgagcaatt gctaagccgg tttcaacttt cctacggata ttttgggcgt taaccaagtt 2581 tgttttggcg ttggtaaggc tagttcttgc gtttaccaga ttttcttgta accgagtctg 2641 gcgaactttc gcctggttaa tcagcgcctg ttggcgattt gcttccgcaa cggatgcagc 2701 ttgacgcgct tgataatctc gtaaacgaga gtttaacaat tgatcttgaa gttgtgttcc 2761 agcaattttc ccacccgtac gttctgcatc taaacgtcgc aaatcttctt gaattaattt 2821 agtagatgag gctaagcgag tcacatcgac tcgttgcaag tctggatcac gttgaatcag 2881 ggtttgacct ttgatgacgc gatcgccttc tttaacatta accttgacaa tcgttccatt 2941 acctatcgat gtcactggtc gtacttgtgt ggaagcgatt aactccccag gtgctgtggc 3001 gacttcatca atccgcgaaa aattcgccca ggcgattgtt cccagtacca ctacactcag 3061 tgttcccgct aacaatctag tgtacagtgg tggtaattcc tgtactgctt tgcccagttc 3121 ataggataag tgttcatcag gttttgcaaa tcgctgtttt gttttacgtg ctgtagccgt 3181 atttgcagct agggaatatc tcataagaat ttagattttg gattttagat tataggtgag 3241 tagtcagtgc taaacagtta tcagttatca gttgtgaaaa aacaataact tgataactga 3301 taaccaaatc tttctatggc acttcgtgcc cttcaggtgg gcactgcaca caacctaaaa 3361 aataaggatt tgagcgcacg gtgaatccag cgctgcagga gggtttcccg acagaggcga 3421 ctggtgaacc cgaagggtta tcaatgaaca ataacttgat aactgataac tgttcactgt 3481 ttactgatta aatcgccaga taacgtcgca agaaactttc tagtgtttcc agctcaaaat 3541 tgaaggtagt ttccaatttc gcaatttctt ctcttgtaca gaaaaattcg ttagctagta 3601 atgtacgaaa cgttcccaaa gctttttgtg cttggggatt gaaaaaacct aatgcacccc 3661 gcagcccatc tacagctaag agtggcgggt taattatcag tggctctcgg ttaaaaatgc 3721 gactaataat tttgggaata tcctctcgcg ttaaaatctc tggtccccca actgataata 3781 tctgattttg agccgcttca actgtcactg aatctactac tatccttgcc aaatcatctg 3841 tactaacaat agagctacgg tttttggggt cgccaacaag caggtaaaac cccgtttccc 3901 gaaaacgttc tgctaatggt agtaagtcgg atgctaatcc agatgggcgt agaatagtgt 3961 agttcaagcc actagtttcc aggtatcgct ccactgctcg ttttgcctta aacacaggag 4021 catcttcgta tcccctgtct gctcctagca cggaaataaa aacaaagtgc tggacttgat 4081 tagcttttgc ggaatctata agttcgatgt tagcgcgata gtctaaagct aaggcatctc 4141 caccagaacc gtgagcgctg ataatgtact gtacgccctg acaagctttt tgaatatctt 4201 tttcttggcg caaatcacct ataaagatgt tagctcctcg gtgttctaac tcgctgtagc 4261 gcgaagtgag acggacaaat gctcgcacct gcttttcctg ttcacgtaaa agtcgcacaa 4321 cggtgcgacc tattccccct gtggctccag tgactagaaa catatcaatt ttggattttg 4381 gatgcgtgga ttttggatga gtgttgactc atgactaaaa ctaggggtgt agggg // LOCUS NODE_6064_length_4295_cov_4.6004724295 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4295) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4295) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4295 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 322..1158 /locus_tag="DP116_27240" CDS 322..1158 /locus_tag="DP116_27240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017747318.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PEP-CTERM sorting domain-containing protein" /protein_id="PRJNA477356:DP116_27240" /translation="MIIKKITGLFAATAAITSVFSAVTPAGAATFTWENLTPYEVKDK STDDSGFQSRIPEFQQYVQQERIAIPEDKLNKLDPTKLSLKNDHNVRVWFLNEGAGYK NQLAYEAINGSDYQKGLIFENISCNTSNGANSACQIGEDNGVLNIGDYVDLGTKAAGT KLNFFLKADGFNNPNGYVYGADATQNPDGLDHLVAYEMDGYLLVGFEDLYGPEGFRSN GNGVLAADRDFNDVVFLVDLGRDNIATVPESASAIALLGIGGVAMLQQRRRRQKQAKE IA" gene 1210..4170 /locus_tag="DP116_27245" CDS 1210..4170 /locus_tag="DP116_27245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874371.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase I" /protein_id="PRJNA477356:DP116_27245" /translation="MSQTTPKNQLASDSLTTTSPTFILVDGHSLAFRSYFAFAKGRDG GLRTKTGIPTSVCFGFLKSLLEVMATQQPQAMAIAFDLGLPTFRHEAYDTYKANRPET PEDFVPDLKNLHELLDALNLKIVTAARYEADDVLGTLAQQATAAGYQVKIVSGDRDLF QLIDTEKQISVLYFSPDALKVSKASLTEFGPEQVKEKLGIRPSQIVDYKALCGDKSDN IPGVRGIGEKTAVRLLNTYESVKRIYDALHEIKGTLHTKLETGKEDAEKSRELAEIVL NVPLEVDLENCILKGFDISALTPILEKLEFKSFLGKINELQQRFGGKVEQTPETETNE SISTQLKPSTNNEDDDLWFFTADDTANQQQPPTSAIKPRIINTPEQLTELVKLLQKFT DTQHPVAWDTETTDLEPRDADLVGIGCCWGTEPDEMAYIPLGHKVGDNLNKDFALEAL RPILESADYPKALQNAKFDRLILRCQGIKLAGVVFDTMLASYVLNPDSNHNLSDLGQR YLGLTAKSYADLVPKGKTIADLDIPSVADYCGMDVYVTFQLVAKLREELENPALYKLL VDVEQPLEPVLAEMEDQGISINTAYLQELSQQLEIDLAKFEEEVFNIAGEKFNLGSPK QLSQILFDKLGLNTKVSRKIQTGYSTDAATLERLREVDETGIIDAIVEYRTLSKLKST YVDSLPKLVRSDTHRVHTSFNQTVTSTGRLSSSDPNLQNIPIRTAFSRQIRKAFVPES GWLMVAADYSQIELRILAHLSQEPVLVEAYQQNEDIHTVTARLVFEKENVTSEERRLA KTINFGVIYGMGSLRFSRSTGVDKANANEFIKRFNSRYPKVFEYLEKVKKQAISQGYV ETILGRRRYFDFTSNNIRKYKGNKPEDIDLKKLGNLGAEDAGLLRAAANAPIQGSSAD IIKIAMVKLHEVLQSYQTRLLLQVHDELVFEVPPEEWGELQPRIKDEMENAVSLSVPL VVDVRVGDNWMETK" BASE COUNT 1343 a 833 c 935 g 1184 t ORIGIN 1 tcggaaaata ctcaagtcac tcaatcaaaa cctaagatat aaaagaatga actgatgttc 61 cattactaaa aaaagatttg gattcaaaac taatatagtt cagagcagaa taaatttcag 121 ttgatgtagg cagtcgcaac cccagaaaac ctttcagttg gtgttgggta gagttcctca 181 aacggacagt tcgttctagt taggggaagt gttgctgtgt aattgtctgc ccaactaata 241 tcgttttatt ttttctctaa ctgatcgata ttagaaccta aacaacagag tattttccta 301 gacaaaaaat aaagggttta tatgattata aaaaaaataa cgggactttt tgctgcaact 361 gctgctatca caagtgtatt ttctgctgtc actcctgcag gcgctgccac atttacttgg 421 gagaacctga caccctatga agttaaagac aaatcaaccg atgattcagg attccaatct 481 cgaattcccg agtttcagca atatgtgcaa caggaaagaa ttgcaattcc tgaagacaaa 541 ttaaataaac tagatcccac gaagctgagt ctaaaaaatg atcacaatgt tcgcgtttgg 601 ttcttaaatg agggtgctgg ctataagaat cagttggctt atgaagctat caacggttct 661 gactaccaga aaggattaat ttttgaaaac atctcttgta atacatcaaa tggcgcgaat 721 tcagcgtgtc agataggaga agataatggc gttttaaata tcggagacta tgttgattta 781 ggcacgaaag cagctggaac taaactcaat ttctttctga aagcagatgg gtttaacaat 841 cctaacggct atgtttacgg cgcagatgca actcaaaatc cagacggact ggatcatcta 901 gtggcttatg aaatggatgg ttatctgttg gtgggttttg aggatttgta tggaccagaa 961 ggttttagat ctaatggaaa tggtgtttta gcagctgatc gcgacttcaa cgacgttgtt 1021 ttcttggttg atttaggtag ggacaacata gcgactgttc cagaatctgc aagtgcgatc 1081 gcacttttgg gtataggagg agttgctatg ttacaacagc gtcgccgtcg ccaaaagcaa 1141 gccaaggaaa tagcttaatt ttctctacag atgggattcg tagcccaagt cgctggctaa 1201 aataggttta tgtcccaaac aacaccaaaa aaccagttgg cttccgattc cttaacgact 1261 acttctccca catttattct agtggatggg cactccctcg cctttcgttc gtacttcgct 1321 tttgctaaag gaagagacgg cggattacgg acaaagacgg gaattcccac aagcgtgtgt 1381 tttggctttc tcaaatcttt gttggaggtg atggcgactc aacagccaca agcaatggcg 1441 atcgcctttg atttgggttt gccaactttt cgccacgaag catatgatac atacaaagca 1501 aaccgcccag aaacaccaga agactttgtt ccagatctga aaaatctaca tgagttgcta 1561 gatgctttga atctgaaaat tgtcacggct gctcgttacg aagcagatga tgttttagga 1621 actttagcac aacaagcaac ggctgctgga tatcaagtca aaattgtcag tggcgatcgc 1681 gatttatttc aactcatcga cactgaaaaa caaatcagtg ttctgtattt tagcccagac 1741 gccttaaaag tttctaaagc aagtcttact gaatttggtc cagaacaagt taaagaaaaa 1801 ctaggtatta gaccatcaca gatcgttgat tataaagctc tttgtggtga caaatcagat 1861 aatatccctg gtgtcagggg aattggcgaa aaaacagccg tgcgcttgct gaacacgtac 1921 gaatctgtga agcgaattta tgatgcatta catgaaatca aaggtacact tcacacaaag 1981 ctagaaactg ggaaagagga tgctgaaaag tctcgtgaat tagcggagat tgtccttaat 2041 gttcctttgg aagttgattt agaaaactgc attctaaaag ggtttgatat cagcgccctg 2101 acacctattt tagaaaagct agaattcaaa tcttttttag ggaaaattaa cgaacttcag 2161 caacgttttg gtggaaaagt tgaacaaaca ccagaaacag aaacaaacga atcaatcagc 2221 actcaactta agccaagcac aaataacgag gatgatgatt tgtggttttt cactgctgat 2281 gatacagcaa atcaacagca accccctact tctgcaatta aaccacgcat cattaacacc 2341 ccagaacaac tcaccgaatt ggttaagttg ctgcaaaaat tcaccgacac tcagcatccc 2401 gttgcttggg atactgaaac gactgattta gaaccacgag acgctgactt agtgggaatt 2461 ggctgctgct ggggaacgga accagatgag atggcttaca taccacttgg tcacaaagtc 2521 ggggataatt taaacaaaga ttttgcctta gaagcactac gtccaatttt agaaagtgct 2581 gattatccca aagctttgca aaatgccaaa tttgaccgct taattctacg gtgtcaagga 2641 attaagttgg cgggagttgt ctttgacacg atgctagcaa gttatgtcct aaatcctgat 2701 agtaatcata atctgagtga cttgggtcag cgttacttgg gtttaacggc aaaaagttat 2761 gctgatttgg ttccaaaagg gaaaaccatc gctgatttgg atattcccag cgtcgcagat 2821 tactgcggta tggatgttta tgtcacattc caactcgtgg caaaattgcg agaggaactg 2881 gaaaatccag ctttgtacaa actactcgtg gatgtggaac agccactaga accagtttta 2941 gctgagatgg aagaccaagg aatctccata aatacagcgt acctgcaaga actttcacag 3001 cagttagaaa tagatttagc gaagtttgaa gaggaagttt ttaacattgc tggagaaaaa 3061 ttcaacttgg gttcccccaa acaattgagc cagatattgt ttgacaaatt aggattaaat 3121 acgaaagttt cccgtaaaat tcaaacgggt tattctacag atgctgcgac attggaaaga 3181 ctgcgagaag ttgatgaaac tggcattatt gatgcgattg ttgagtatcg taccttatcg 3241 aaattgaaat caacttatgt agattctctg ccaaaattgg tgcgctcaga tactcacaga 3301 gtacacacga gttttaacca aacggtaaca tctactggta gattgtcttc ctctgatccg 3361 aatttacaaa atatccccat tcgcactgct tttagtcgcc agattcgcaa agcatttgtg 3421 ccagaatctg gttggttgat ggttgctgct gattactcac aaattgagtt acggattttg 3481 gctcatttga gtcaagaacc tgtcttagtt gaagcttatc aacaaaatga ggacattcac 3541 actgtaacag cgcggctcgt ttttgaaaaa gaaaatgtca catcagaaga acgacggtta 3601 gcaaaaacca tcaactttgg tgtgatttat ggaatgggtt ctctaagatt ttcgcgctca 3661 actggtgtgg ataaagccaa tgcgaacgag tttattaaac gatttaacag tcgctatccc 3721 aaagtgtttg aatatttaga gaaagtgaaa aagcaggcta tatcccaagg ttatgtcgaa 3781 acaattttgg ggcgtcgtcg ttatttcgat tttaccagca acaacatacg caaatacaaa 3841 ggcaacaaac cagaagatat tgacttgaaa aaacttggta atttaggtgc tgaagatgcg 3901 gggttactgc gtgctgctgc caatgcaccg attcaaggtt ccagcgctga tattattaaa 3961 attgcgatgg taaagctgca tgaagttttg caaagttatc aaacgcgtct gttgttacag 4021 gtacacgacg aattagtgtt tgaagttcca ccagaggaat ggggagaatt acaaccgcgt 4081 attaaagatg aaatggaaaa tgcggtgtcg ttgagtgtac cgttggttgt ggatgttcgc 4141 gtgggcgaca attggatgga gacgaagtag gacttgaacc tcaccccccg cccctttccg 4201 atgttttggc tacggtgtac acacaagtca tcgaattacc caaaattcgg aacctcaccc 4261 tgccctgtcg ggcatccctc tccttattaa ggaga // LOCUS NODE_6074_length_4282_cov_4.8152354282 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4282) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4282) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4282 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(243..1091) /locus_tag="DP116_27250" CDS complement(243..1091) /locus_tag="DP116_27250" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27250" /translation="MMINRLAKVISAATVLAMAVSLAPQSASAQTVSSTPQPAPAQQQ PVSVLEELPPGRVSSPVNGFPEDTLEFNINTSAPNSSAPNNAKIFYGAIQNPVYIDEP SPGIDNSTETRKEYKFNPGDLKVSPVEISDELRDVLNQSRDDQGPSSFENKYGNTVVK YESRLEDNSNPKNFVNFAFYAPATEPFTNLNSLSVFNASNLTPFLNSQGQVRFPKYLS PLNGPLLDPQPNTTLLSLTPIPDNVTKVPEPAATASLLGFGIVSTALLRKRNKRLETS LSNGQR" gene 1541..2398 /locus_tag="DP116_27255" CDS 1541..2398 /locus_tag="DP116_27255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017742412.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27255" /translation="MKTDSIFYRLFTTFPNAFFELINLQSNEANAYNFASVELKQTAF RIDGVFLPVADASTRPIYFVEVQFQKDAEFYARLFSEIFLYLRLYAPTKAWRAVVIFP RRSIEPTKVEPYQVLLESQLVTRLYLNELGDRAEQSLGVGIIKLVVENEKQTPALAKN LIARTRTELTNAALVQQVLDLIETIVLYKLPRISRQELVRMFGLGDFDIKTTRFYEEV REEVRQEQALELIMRQLRRRIGNMDQQLQERISQLSIEQLENLAEALLDFSSQADLAT WLQDQSNQV" gene 2459..3595 /gene="gcvT" /locus_tag="DP116_27260" CDS 2459..3595 /gene="gcvT" /locus_tag="DP116_27260" /EC_number="2.1.2.10" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874719.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycine cleavage system aminomethyltransferase GcvT" /protein_id="PRJNA477356:DP116_27260" /translation="MANQEEIILSLARSPLFQLAQELKARLTNFGGWEMPVQFVGITK EHEAVRNAAGMFDISHMGKFTLQGKNLISQLQFLVPSDLSRLQPGQAQYTVLLNPNGG IIDDIIFYYQGENATGEQRGVMIVNAATTDKDKAWLLQHLDQNEVIFQDLSPKKVLIA VQGPKAVSILQTFVQEDLTSVKAFGHLEGTVLGKPAFLARTGYTGEDGFEVMLDLDVG VELWRSLLNAGVIPCGLGARDTLRLEAAMALYGQDIDDTTTPLEAGLGWLVHLDTKGD FIGRSVLEQQKAVGVQRRLVGLQMPGRNIARHGYQVLSTGVVVGEVTSGTLSPTLGYP VALAYVPTELATVNQQLEVEIRGKAYPAVVVKRPFYRSKSRVSH" gene 3670..4056 /gene="gcvH" /locus_tag="DP116_27265" CDS 3670..4056 /gene="gcvH" /locus_tag="DP116_27265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874718.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycine cleavage system protein GcvH" /protein_id="PRJNA477356:DP116_27265" /translation="MPFEYPEDLRYLDTHEYVRLENEIATIGITDFAVDQLGDIVFLD LPEVGDAVSKEETFGTIESVKAVEELNSPVTGTVIERNEVLLESPEQLADDPYGEGWL LKVRVNKSGEIDDALTANEYSAQVEG" BASE COUNT 1254 a 860 c 971 g 1197 t ORIGIN 1 gttgaacggc agtgaaaccc aacatgaaca agaatcaagt gttgggtttc gttcctcaaa 61 cgccagacgc ctacggaggg tttccctcct ggagcttacg ctccccaacc tacagatatt 121 ataagtgttt aactttaaga actctttccc ccctttttaa gggggtaggg gggatcgata 181 agtgcttaaa attacagcaa atcacttttc aaacaacttc ttagctttct tggacatctt 241 cactatcttt gaccgttaga tagacttgtt tccagacgtt tattacgttt gcgcaataat 301 gctgtactca caattccaaa acccagtaaa cttgctgtag cagcaggttc tggaaccttg 361 gttacattat caggaatagg agtcaaggac aaaagagtag tattaggctg aggatcaagc 421 agaggcccat tcaaagggct aaggtatttt ggaaatctaa cttgcccttg agaattcaaa 481 aatggagtta aatttgatgc attgaaaaca gatagagaat ttagattagt gaatggctca 541 gtggcaggag cataaaatgc aaagttaacg aagttttttg gatttgagtt atcttctaac 601 ctagattcat atttgactac agtattgcca tatttattct cgaagctact ggggccttga 661 tcgtctctgc tttgattgag tacatcccgc aattcgtctg aaatctctac tggagaaact 721 ttcaaatctc ctggattaaa tttatactct ttacgagttt cggttgagtt atcaattcca 781 ggggaaggtt catcaatata cactgggttt tgaattgctc catagaatat ttttgcgttg 841 ttaggagcag agctgttagg agcagaggtg ttaatattaa attccagtgt atcttcaggg 901 aaaccattta caggagaaga gactctgcct gggggaagtt cttctagaac agagacaggt 961 tgttgttgag caggtgcagg ttgtggggtt gaagaaactg tttgagcaga tgcagattgt 1021 ggggctaaag aaactgccat cgctaataca gtagccgctg atataacttt tgccaatcta 1081 ttaatcatca ttcaataacc tctcaaattg ctttacgatg tgtgcaatta atgagatagt 1141 tgttaaagta tgcacccgca ttctttaaaa ctaaccctgt ggaatggaaa ctctaagtta 1201 ataacatctt tcggaaagaa tttcctgagc aaaattaaag catagtcaac agtgtttctg 1261 ctagatgtta tgtcaattat tcatgaaaaa gacatactat cttgatattg cctttaccac 1321 agtgtcctcc gtggggactc cgagttcccc tctccttaat aaggagaggg gtgcccgata 1381 gcgtagcgtg ccgttaggca tagggcgggg tgaggtgaga cctgtgaatg caacgcgcgt 1441 atcgcctcat gaacggtact ccgcaccatc gcaaagcacg catttagttg gtattgaaaa 1501 caggcggcat aaaatacaaa gaagacttaa ttaagcccaa atgaaaactg actccatttt 1561 ttaccgccta ttcacaacct ttccgaatgc tttctttgaa ttaattaacc tccaatctaa 1621 tgaggcaaat gcctacaatt tcgcttccgt agaattaaaa caaacagcat ttcgtattga 1681 tggagtcttt cttcccgttg ctgatgctag tactcgcccg atttattttg ttgaggttca 1741 gtttcaaaaa gatgccgaat tttacgcccg tttattttca gaaatatttc tgtatttgcg 1801 actttacgca ccaactaagg cttggagagc agtggtaatc tttcctcgcc gtagcatcga 1861 accaacaaaa gttgaaccct atcaagtctt actcgaaagt caacttgtca cgcgattata 1921 tctgaatgag cttggggata gggctgaaca atctttggga gttggtatca ttaaattagt 1981 agttgaaaat gaaaaacaaa ctcccgcatt agccaaaaac ttgattgcca ggacacgcac 2041 agagctaact aatgcagcgc ttgtgcagca agtgctagat ttgatagaaa caattgtact 2101 atacaaatta ccgcgcatca gtcgtcagga gttggtgagg atgtttggat tgggtgattt 2161 tgatatcaaa acaaccagat tttatgagga agtccgtgaa gaagtcagac aggaacaagc 2221 tttagagtta attatgcgtc agcttcgacg ccgtattggt aatatggatc agcaattgca 2281 agagcgtatt agtcagttat ctatcgaaca actggagaat cttgccgaag cactattgga 2341 tttttcaagt caagcagatt tagcgacttg gttgcaagat caatcaaatc aggtttaagt 2401 acacaataga taccaaaagt gaaatactac agacaaatgt atctaaacaa gtagacctgt 2461 ggctaatcaa gaagaaatta tcctgtcctt agcgcgatcg cctctatttc aacttgcaca 2521 agaactcaaa gcacgactca ccaattttgg tggttgggaa atgcccgtgc aatttgtggg 2581 tatcaccaag gaacacgaag ctgtcagaaa tgcggcagga atgttcgata tttcccacat 2641 gggtaaattt actctgcaag ggaagaacct gatttcccaa ctccagtttt tagttccttc 2701 agatttaagc cgcttgcaac ccggtcaagc tcaatacact gtcttattaa atcccaatgg 2761 tggtatcatt gacgacatca tcttttacta ccaaggcgag aacgccactg gtgaacaacg 2821 aggagtaatg attgtcaatg cggctaccac tgataaagat aaagcatggc tcttgcaaca 2881 tcttgaccaa aatgaggtga tattccaaga cctttcaccg aaaaaagtct taattgccgt 2941 gcaaggacca aaagccgtaa gcattcttca gacttttgtg caagaagatt taacatctgt 3001 taaagcattt gggcacttgg agggaacagt actaggtaaa ccagcatttc ttgcccgcac 3061 aggttacacc ggggaagatg gctttgaggt gatgttagat ttagatgtgg gggtagaatt 3121 gtggagaagt ctgctcaatg ctggcgtcat tccctgcgga ctcggtgcga gagacaccct 3181 caggctagaa gcagcaatgg cactttatgg gcaggatatt gacgacacga ctaccccttt 3241 agaagctggt ttaggctggc tagttcatct tgatacaaaa ggagacttta tcggtcgctc 3301 agttttggaa cagcaaaaag ctgttggagt tcaacgccga ctagtaggtt tgcaaatgcc 3361 agggcgcaac atcgcccgtc atggctacca agtgctatca acaggtgtcg tcgttggaga 3421 agtgactagt ggtacactat caccaacact tggttatcct gtagctttag cctatgtccc 3481 cactgaacta gcgactgtta atcagcagct agaagtggaa atccgtggca aagcttaccc 3541 agcggttgtg gtaaaacgtc cgttttatcg atccaaaagc cgtgtcagtc actaagtgct 3601 gagtaatggg tactatagct taaatcgcta tcgtagtggt aatttttttg atgtggacga 3661 ggaagtggta tgccttttga atatcctgaa gatttgagat acctagatac tcatgaatac 3721 gtgcgtcttg aaaacgaaat tgccaccatt ggcattactg actttgccgt agatcaattg 3781 ggtgacattg tgtttttgga tcttccagaa gttggtgacg ctgtctccaa ggaagaaacc 3841 tttggcacaa ttgaatccgt gaaagctgtg gaagaactta attccccagt tactggtaca 3901 gttatagaac gcaatgaagt tttgctagaa tcaccagaac aattggcaga cgatccttat 3961 ggcgaagggt ggttgctgaa agtacgcgtc aacaagtctg gtgaaattga tgatgccttg 4021 actgcgaatg aatacagcgc tcaggttgaa ggatagaaca gaacaagggg ggagtaaggg 4081 agtgagggaa ataaggcaga ggtagcaatt ctcgcttctt ctgcttggct cccaagtaga 4141 cgaatccaac gtagggtggg cactgcctat cttcctatat tattggagtg ccgtgagcgg 4201 gcagtgccca ccctacaaat taaaattttc tacttatcat tcctgaagaa ccccaccccg 4261 ccaaagctgc gcttaggctc cc // LOCUS NODE_6091_length_4264_cov_5.2523164264 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4264) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4264) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4264 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 285..956 /locus_tag="DP116_27270" CDS 285..956 /locus_tag="DP116_27270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016870255.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbonic anhydrase" /protein_id="PRJNA477356:DP116_27270" /translation="MSIKHLISGLNEFHDNYFVSHRELFQELSHGQTPEVLLITCSDS RIDPNLITQTQPGELFVIRNIGNIIPPYGPLNNGEGAGVEYAVQALNIKDIIICGHSH CGAMKGLLQIGNLAQQMPLVYDWLKHYAEPTRRLVLDNYKHCPTEKLLKIAIEQNVLI QIENLKTYPIICSKLHSGQITLHAWIYEIESGEVFAYDVSKKQFELLNRSFLVPNSLI GVHSE" gene 1074..2357 /locus_tag="DP116_27275" CDS 1074..2357 /locus_tag="DP116_27275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012407071.1" /note="involved in light-induced Na+-dependent proton extrusion; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="proton extrusion protein PcxA" /protein_id="PRJNA477356:DP116_27275" /translation="MKILFFNKNPGFIGQKIQEYLQAGNKRLLGTPERALSEAYQAAH RIKNIEREHFNGQKITPHLSNYTESVRAYWQVSLNKNLMIIKAKLAEFRISCSLLDIS SSAFLEKLSFIDEVTLKYNLEQEINKNGITPVSQPLQMNRDEVNNQSDSSGVDNIKVD PVFKKTGVFPRSIGRTLNRIKADFLPQAEEQFVRDFQIFKNRTRTAAKFLAMLVIVPI LTQSLSKEFLVSPIVERVRGENVSQLFLNSEMEEEALTELKTFEQNLKFQSLISQAPP LSPEVIEEKVKQKVHTIAEEFREKGNNAISNVFADLLSLIAFALVIVTSKREIVIVKS FIDDVIYGLSDSAKAFLIILFTDIFVGFHSPHGWEVILEGFAEHLGLPAEKSAISLFI ATFPVILDTIFKYWIFRYLSRLSPSALATMKEMNE" gene 2722..3321 /locus_tag="DP116_27280" CDS 2722..3321 /locus_tag="DP116_27280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131062.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DedA family protein" /protein_id="PRJNA477356:DP116_27280" /translation="MHFDLVQLIKSLGYFGVWAIVFAESGLLIGFFLPGDSLLFTAGF VASQGLLNIWVLIIGAFICAVLGDNVGYMTGHKFGRKLFQKEDSWLFHKKHLIKTTNF YQIHGKKTLVLARFVPIVRTFAPIVAGIGAMHYRTFMSYNLVGGFLWTFGITLLGFFL GKSLPAEQLDKYLLPIIGLIIVVSLLPSILHIIKENKKN" gene 3391..3921 /locus_tag="DP116_27285" CDS 3391..3921 /locus_tag="DP116_27285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016861035.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27285" /translation="MNLYLFIENFVFLIISTFFCVVVGWAWRNSKPFSLPEPLPGWFK IWFGTVQVIGLIVPLVVILVWGVWFGDNRILTVLVPYLVMLGLQILAEILTLRQFHTV VWVMVPYLYLPYRFWQLYEGWKFLSAETDLTWVRNLLVVQVVLWVVNYALDVTQLPRL LRWEVKEESDTSALVP" gene complement(4028..4249) /locus_tag="DP116_27290" CDS complement(4028..4249) /locus_tag="DP116_27290" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27290" /translation="MKEGGNAIEGLIVSVPWFREAPQSKNFAQKAAQQWGGKISWRTA TSYDATQALIQALSSKARSRNSFAKIAKG" BASE COUNT 1266 a 775 c 853 g 1370 t ORIGIN 1 caaagtgcca cgctgcgcga acagtactgg ctcccctaca cccctttctt ggtcaataag 61 ttggaaatca tgaccaattc actaaacttt tgaatgggaa tagagaatcc cacccatagg 121 ttttagactt ttgatgacaa gactgcttga ctcttttgtt gcgggcttta aaaaagtcta 181 tcatctttaa aaaaatcaaa aaaatgacag tttttttccg aaaaataggt taaaaaatat 241 gaagtttcaa aataaagatt tctttccata ggaggaacta gattgtgtct ataaaacact 301 taatcagtgg tttaaatgag tttcatgaca actattttgt cagccataga gagcttttcc 361 aggaattatc tcatggtcaa actcctgagg ttttacttat cacttgttct gactctcgta 421 ttgaccctaa tttaatcact caaacccagc caggggagtt atttgttatt cgtaatattg 481 gtaatattat tccaccttat ggaccactta ataatggtga aggtgcaggt gtagaatatg 541 ctgttcaagc attgaatatt aaagatatta ttatttgcgg tcattcccac tgtggagcga 601 tgaaaggact attacaaata ggtaatcttg ctcagcaaat gccactagta tacgattggt 661 tgaagcacta tgccgagcct acccgtcgtc ttgtcctaga caactacaag cattgtccca 721 ccgagaaact attaaaaatt gcaattgaac agaacgtctt aatacagata gaaaatctga 781 aaacttaccc gataatttgc tcgaaacttc acagtggtca aataactctt catgcttgga 841 tttatgaaat tgagagtgga gaagtctttg cttatgatgt tagcaaaaaa caatttgaac 901 ttctcaatcg ctcattcctt gtaccaaact ctcttatagg tgtacactca gaataagata 961 cccatctaat ggctaataag ttataagtat caattatagt ttattagcca ttattgattt 1021 ttcaattaaa gaaaacaaaa atagttaaaa atattacaaa ttaatagtgt aacatgaaaa 1081 ttttgttttt caacaaaaat ccaggattca tcggacaaaa aattcaagaa tacttacagg 1141 ctggtaacaa acggttactg ggtacaccag aaagagcact atcagaagct taccaggctg 1201 ctcatagaat taaaaatatt gaaagagaac attttaatgg tcaaaaaatt acgccacact 1261 taagcaatta cactgaaagt gtcagggctt attggcaggt ttctctcaac aagaacttaa 1321 tgattatcaa ggcaaaatta gcagagtttc gtatcagctg ttctctcctc gatatatcaa 1381 gttctgcttt tttagaaaaa ttaagtttta ttgatgaggt taccctcaaa tataatttag 1441 aacaagaaat caataagaac gggataacac cagtctctca accattacaa atgaaccgtg 1501 acgaagttaa caaccagtca gattcatctg gtgtagacaa tataaaagtt gaccctgttt 1561 ttaagaaaac aggtgtattt cctaggtcaa ttggaagaac actcaataga attaaagcag 1621 attttttgcc acaagcagaa gaacaatttg ttagagactt ccaaattttt aaaaacagaa 1681 caagaactgc tgcaaaattt ttggcaatgc tagttattgt gccaatttta actcaatctc 1741 tttctaaaga attcttagta agtccaattg tagagcgggt cagaggtgaa aatgtcagtc 1801 aactgttcct taactctgaa atggaagaag aagctctcac agagttaaaa acttttgaac 1861 aaaatctaaa gtttcagagc ctgattagtc aagcaccacc actttctcca gaagtcatag 1921 aggaaaaagt aaaacagaaa gtccatacaa ttgctgagga atttcgggag aaaggcaata 1981 acgctattag caatgttttt gctgatttgc tatcacttat agcttttgca ttggtaattg 2041 ttacaagcaa aagagaaatt gtgattgtga aatcttttat cgatgatgtt atctatggtt 2101 tgagtgacag tgctaaggca ttcttgatta ttctattcac agatatattt gtcggattcc 2161 actcaccaca tggttgggag gtgattcttg aaggatttgc ggaacatttg gggcttccag 2221 cagagaaaag cgcaatatct ctttttattg ccactttccc tgtgattttg gacactattt 2281 ttaaatactg gatattccgc tatctgagtc ggttgtctcc ttcagctttg gcgacaatga 2341 aagaaatgaa cgagtaataa gctgtgaccg atatggcgtt gcacacttac tcctaagaaa 2401 cacgcttaac agggaacagg gaacagggaa cagggaacag ggaacaggga tggtgtttct 2461 taaggctgtt gctagcgggt agtcgctaag gtcttggggg tttcggcttg agttaatgta 2521 ctactgagat tcggggcgta atcctgatcg actgcgtcac tttcctattt tctctagtga 2581 cgctgctact cgtagtttcc cccaaccctg tgatttgggt gtatgcaatg tccaagggct 2641 gcaaatgcgt gacagccaaa aaaaattctg atattcctct gaattttgtg ttaaatagat 2701 tattgattta gaggatattc tatgcatttt gatttagtgc aactcataaa atctttaggc 2761 tactttggag tatgggcaat tgtctttgct gagtctggct tactcattgg cttttttcta 2821 ccaggtgata gtttgctgtt cactgctgga tttgtagcat ctcagggatt actcaacatc 2881 tgggttctga tcattggcgc ttttatttgt gcagtccttg gtgataacgt tggctatatg 2941 actggacata agtttggtcg caaattattt caaaaagaag attcatggtt atttcataaa 3001 aaacatttga taaaaacaac aaatttttat caaatacatg gtaagaaaac acttgtctta 3061 gctcgatttg tgccaatcgt acggactttt gcgccgattg tcgctggcat tggtgccatg 3121 cattatcgca catttatgtc ctataacttg gttggcgggt ttctttggac attcggcatc 3181 actctgttag gatttttctt aggaaaatcc ctaccagctg aacagctaga taagtatttg 3241 ttacccatta ttggattaat tattgttgtt tctttattac catcaattct tcatattatt 3301 aaagaaaaca agaagaactg acaaagaagg taatggtggt tagttgagtt acttggtcaa 3361 gaggatatca gggtcatgac ttcttcaaac ataaacttat acctgtttat cgaaaacttt 3421 gttttcctga taatttctac atttttttgt gttgttgtag gctgggcgtg gagaaattcc 3481 aaacccttca gccttcccga accactccct gggtggttta aaatttggtt cggtacggtg 3541 caagttatag gattgattgt tccccttgtg gtgatacttg tatggggtgt gtggttcggt 3601 gacaatcgca tcctcactgt gcttgttccc tacttagtca tgctaggatt gcaaatctta 3661 gctgagattt tgacactcag gcagtttcac acagttgtat gggtgatggt tccctactta 3721 tacttgcctt atcgcttttg gcagctttat gaggggtgga agttcctcag tgctgaaact 3781 gatctcactt gggtacgaaa tctgttggtg gtacaagtcg tcctatgggt tgtgaactat 3841 gccttggatg tgacacagct accaaggctt ctacgttggg aagtgaaaga agagtccgac 3901 acgagtgctt tagttccgta aaatcttaaa ctgaccattc ttaatcttaa ccaagatggg 3961 ttttatctgt atctctcctt gtggagtaaa ttggattgta ttcccggagg tttcttgggg 4021 tgaaaggtta acctttcgca atctttgcaa aactgttgcg cgatcgcgct ttggaggata 4081 aagcttgaat caaagcttga gtggcgtcat aactggtggc tgtgcgccag ctaatctttc 4141 ctccccattg ctgtgctgct ttttgggcaa agttttttga ttgtggtgct tcccgaaacc 4201 aaggcacact tacaatcaag ccttcaattg catttccacc ttctttcaag gttcttctct 4261 attc // LOCUS NODE_6111_length_4252_cov_2.8622834252 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4252) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4252) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4252 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 1..1071 /locus_tag="DP116_27295" /pseudo CDS 1..1071 /locus_tag="DP116_27295" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876301.1" /note="frameshifted; incomplete; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=2 /transl_table=11 /product="hypothetical protein" gene 1085..1318 /locus_tag="DP116_27300" /pseudo CDS 1085..1318 /locus_tag="DP116_27300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013334229.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(1301..1501) /locus_tag="DP116_27305" CDS complement(1301..1501) /locus_tag="DP116_27305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015216371.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27305" /translation="MHSTYCSHTELPVISYQLSVIVVSFWSLFTGHCSLVTVHWSLFT GHWSLVTVHWSLVTGHCSLFEW" gene 1496..>4252 /locus_tag="DP116_27310" CDS 1496..>4252 /locus_tag="DP116_27310" /inference="COORDINATES: protein motif:HMM:TIGR00229" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876297.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_27310" /translation="MQRPQRETVTLNDILITEDLSRRPPRSPNLLSENQALHTLARQL VQPEMMLQSIVDIALELCSAGSAGVSLLEVLPSGEEIFRWNVLAGTLKQYVGGSTPRN FSPCGVCLERGAPVLFSHPERYFTYFQEAKTPVVEGLVLPLQADNHALGTIWIMSHYE QRHFDAEDVRVMTSLADFTAAALLLKQRQTRELLAANAALEAEVVERKLAEERLRALI ENLPVGAAFVVDRDLRYLLAEGQALSAAGFKSEDFVGKTIFEVLPPELTTYYEGLYRQ GLAGESFEHEHNAHDRSFISCGTPLRSADGEVYAVLAVSYDITDRKQTEDAIAADLKD TQLLHDLSTRLVTEGNIQALYQEIMAAAIALTRADAGTVQILDEATQELVLLATQGFE RNLTDHFYRVNASSNTSCGIALFTGNRTFVDFDVPQSEDLDGSGQMHVEAGYFSAQSS PLIARSGKPIGMVSTHWRKHHRPSDRELRFLDLLARQAADLIEQRQAQVALRESERRL CATQNNAGIGIHELDEMGRYLRMNETFTRLSGYTLADFANRTIFDGIPSEDRDKAREH YSRLVRGEIDSYVDERTYVTRDGCRAWVEVLTTAVRDDEGRFLYAVRAIHDVTQRRNA EEALRESEAKYRTLFNSIDAGFCLIEVLFDESGKASDYRFLEANPAFEKQTGLVDVIG KTVRELVPSHEAHWFEIYDRIALTGVPERFENAAQELGRFYDVYAFRMGEPQERKVAV LFNDISERKRRETNLAFLAEVSQDLAHLTNIEATMNALGAKIGRHLNLGRCIFCEVIE AEDKVIMSYDWHCPELPSAVGEVPISQFVSEEFRRASRAGTTIVVADVHKSSLVEAWQ VEDFFDIAGLVCVPLLRDGQWRFALVVHNVAPRDWRDDEIELLREITTRIWTRLERAR AEESL" BASE COUNT 1065 a 1071 c 1160 g 956 t ORIGIN 1 ggagaaaaaa gaggatgggg ggatgggggg gtcgcttcgc tcgggagcag aggggaatcg 61 tgattcaaga tcctcccctc tgcccctccg cacctctgcc cctctgcctt caaaagcccg 121 tattcttttg gtggatgaca acgcggatat gcgcgcctac ctgaagcggc tattgagtga 181 acgatggcag gtggagacag cccctaatgg cgtgatcgcc ctcacccaaa tccagcaaca 241 cccacccgac ttggtgctaa ctgatgtcat gatgccagag atggatgggt tgcaattgct 301 ggctgcacta cgggctgatc ctcaaacaaa aagtattcca atcatcttac tgtcagcacg 361 agcaacagaa gaggcaacgc tcgaaggttt gacaacagga gcggatgatt acttgatcaa 421 acccttctct gctcgtgaac tgatggcaag agttgaaact catctccaat tggcatggct 481 gcgctttgag cgatccgcga accgcttcaa gaatgagttt ctcatgaccg tcacccatga 541 attgcaagct cccttagtaa cgattctcgg atgggcacgc ttgttacaaa ccaaatccat 601 aaatttggaa acaatggcac gagcacttgc aaccatcgag cgcaatgcga cgattgaagc 661 aaaactaatt aaagatttgc tggatgtttc gagcattctc tctggcaagt ttcaattgaa 721 gcctcaactc gtcgatttgg tttcgctggt acaaaacgtg attgaaacat ttcgtaaagc 781 ggcccaggca aagagcattc aactggtcga gacgatatcg aacgttacac aaattgatat 841 tttagcagac ggcgatcgcc tcaaacaaat catcaccaac ttgctggaca atgccatcaa 901 atttacgcct aaggggggaa gcgtagaaat tcaggtgact actgatgccc aagattcaca 961 gccccctaaa tccacgccag ttcctcctcg acgggaaccg ccaagatcgg actggctccc 1021 caattccccc caaactatag cctttgcggg cattccagaa aggggggtta ggggggcaac 1081 atttgctcaa attaccgtaa ccgatacagg actcggcatt cctccagaat ttcttcccta 1141 tgtctttgat cgctttactc aagcagaagt gccgagccgt cattcgccgg gaggggtggg 1201 cattgggcta gcgatcgccc gtctgttggt ggaattacat tatggtacga ttgaggtagc 1261 aagcgccgga gttggaagag gcgcaacctt tacagttcga ttaccattca aacagtgaac 1321 agtgaccagt gaccagtgac cagtgaacag tgaccagtga ccagtgacca gtgaacagtg 1381 accagtgaac agtgaccagt gaacagtgac cagtgaacag tgaccaaaaa ctgacaacga 1441 taactgataa ctggtaactg ataactggta actcagtgtg cgagcaataa gtgctatgca 1501 acgacctcaa agagaaaccg ttaccttaaa cgacatttta attactgagg acctatcacg 1561 gcgaccgcct cgttcgccga atttgctctc ggaaaatcag gcgttgcata cattagcacg 1621 gcagttggtg cagccggaaa tgatgcttca aagtatagtt gacattgcgc tggagttatg 1681 cagtgccgga agcgccggag tcagtttact cgaagtgcta cctagtgggg aagaaatctt 1741 tcgctggaat gtgttagcag gaacattaaa acagtacgta ggcggcagta caccccgcaa 1801 ctttagcccc tgtggagttt gcttggagcg gggtgcccct gtgctttttt cccatcctga 1861 acggtatttc acctactttc aggaagcgaa aaccccagtt gtcgaaggct tggtgttacc 1921 tcttcaggcg gataaccatg ctctcggcac catttggatt atgtcgcatt atgagcagcg 1981 gcactttgac gcggaagatg tgcgggtgat gacgagtctg gcagacttta cagccgcagc 2041 actcctgctg aagcagcgac aaactagaga attgctagct gccaacgctg ctttagaagc 2101 ggaagttgtg gagcgcaagc tggcggaaga acgtttgcgc gctttgatcg agaatctacc 2161 ggttggggct gcgtttgtcg tcgatcgcga tctgcgctac ttgcttgctg aaggccaagc 2221 gttgtccgct gctggattca aaagcgaaga ttttgtcgga aagacgattt tcgaggtgct 2281 gccgcccgaa ttaacgacgt attacgaggg attgtaccgt caggggcttg ctggcgagtc 2341 gtttgagcat gagcacaacg cacatgatcg ctcatttatc tcatgtggaa cgccgctgcg 2401 ttctgccgat ggcgaagttt atgcagtgct tgccgtttct tatgacatca ctgatcgcaa 2461 acaaaccgaa gacgccatcg cagcagacct taaagatacg caactgttgc acgacctgag 2521 tacacggctc gtcaccgaag gcaacattca ggcgctttat caagagatta tggcggcggc 2581 gatcgccctc acgcgagcag atgccggaac ggtgcaaatt ttggatgaag caacgcaaga 2641 gttggtgctg ctcgctaccc aggggtttga gcgaaatctg accgatcatt tctatcgcgt 2701 taacgccagt tcaaacacgt cttgcggtat tgccctcttc acaggcaatc gcacgttcgt 2761 tgatttcgat gtgccccaga gcgaagacct tgatggatct gggcaaatgc acgttgaagc 2821 tgggtatttt tcggcacagt ccagcccgtt gattgcccgt tcgggcaaac ccatcggcat 2881 ggtttcaacc cactggcgca aacaccatcg accgagcgat cgcgaactgc ggtttctcga 2941 tttacttgcc cgtcaagccg ccgacttgat tgagcagcga caagcccaag tcgccctgcg 3001 cgagagcgag cggcggcttt gcgccaccca gaacaacgcc ggcatcggca ttcacgaact 3061 cgacgagatg gggcgttatc tgcggatgaa cgagacgttc acccgattga gcggctacac 3121 gctcgcggat ttcgccaacc gcacgatctt cgatggcatc cccagcgagg acagggataa 3181 ggcgcgcgaa cattacagtc ggctggtgcg cggcgaaatc gacagctatg tcgatgagcg 3241 aacctatgtc acgagggatg gatgccgcgc gtgggttgag gtgctgacga ctgcggtgcg 3301 ggacgacgaa gggcgatttc tctatgcggt gcgtgcgatt cacgacgtga cacagcgtag 3361 gaatgcagaa gaagcgttgc gcgaatcgga agcaaaatac cgcacgctgt tcaactcaat 3421 agacgcaggc ttttgcctga tcgaagtttt gtttgatgaa agtggaaaag catctgatta 3481 tcgctttctt gaagccaatc cagcctttga gaaacagacc ggactggtgg acgtgattgg 3541 caaaacagtg cgcgaactag tgccttctca cgaggcacac tggtttgaaa tttacgacag 3601 aatcgcgcta acaggggtac cagagcggtt tgaaaacgct gcccaggaac ttgggcggtt 3661 ttatgatgtt tacgccttcc ggatgggtga accacaagag cgcaaggtgg cggttctctt 3721 caacgacatc agcgaacgca aacgccgcga aacaaatctc gcctttttag ccgaagtcag 3781 ccaagacctc gcgcatttga cgaacattga agcgacgatg aatgcgctcg gcgcgaaaat 3841 cggcaggcat ctgaatctcg gacgctgcat tttttgcgaa gtcatcgaag ccgaagacaa 3901 ggtgatcatg agctacgatt ggcattgccc ggaattgcct agcgctgtcg gtgaagtccc 3961 gatctcgcag ttcgtcagcg aagaatttcg gcgagcgagc cgcgccggaa caaccatcgt 4021 cgtcgctgat gttcacaaaa gttcgctggt tgaagcatgg caggttgaag atttttttga 4081 cattgctggt ttggtctgcg tcccgcttct tagagacggg caatggcgat ttgcgttggt 4141 ggttcacaat gtcgcgccgc gcgattggcg agacgacgaa attgaattgc tgcgcgaaat 4201 cacgacgcgc atctggacgc ggcttgaacg cgcccgcgcc gaagagtcat tg // LOCUS NODE_6119_length_4243_cov_4.2977554243 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4243) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4243) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4243 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(229..3531) /locus_tag="DP116_27315" CDS complement(229..3531) /locus_tag="DP116_27315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874650.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit X" /protein_id="PRJNA477356:DP116_27315" /translation="MGVKASGGSSVARPQLYQTLAVSTISQAEQQDRFLGVGELSELA RYFASGVKRLEIAQTLTENSEIIVSRAANRIFVGGSPMSFLEKPREPEVVMAGAPASV RDTMRLGTVTYVESRGGFLENLRSVFNSSPSGPTPPGFRPINVARYGPANMAKSLRDL SWFLRYATYAIVAGDPSIIAVNTRGLREIIENACSGEATIVALQEIKLAALSYFRQDA EAKDIVSQYMDVLITEFKAPSPSNKLRQRPSSDQQGLELPQIYFNAAERRPKFVMKPG LSALEKNEVVKAAYRQIFERDITRAYSQSISYLESQVKNGTISMKEFVRRLGKSPLYQ KQFYQPFINSRALELAFRHFLGRGPSSREEVQKYFDIVSRGGLSALIDALVDSQEYSD YFGEETVPYIRGLGQEAQECRNWGPQQDLFNYSAPFRKVPQFITTFAAYNQPLPDQHP YGSGNDPLEIQFGAIFPKETRNPSTRPAPFGKDTKRILIHSGPGINNQNSNPGARGEN PGTLGPKVFKLDQLPGTRGKKTPTGLSVKYSESSTQAVIRAAYLQVFGRDVYDGQRLK VQEIKLENGDISVREFIRALAKSDLFRKMYWTSLYVMKAIEYIHRRLLGRPTYGRQEN NKYFDIASKKGFYAVVDAIIGSVEYSEAFGEDTVPYERYLTPGGVASRKLRVGSIRED ITPKIEKEETPLFVELGSVKGVRTEPEIQFRINQGVTKKREQTKVFKLVAGTNDKVAV GIVIGAAYRQIFERNIEPYILKNEFTVLQSKLGNGEITVKEFIEGLGCSSLYQKEFYT PYPNTKVIELGTKHFLGRAPLDQAEIRKYNQILATQGIRAFIRAMVNSPEYLEAFGED TVPYNRFPTLPAANFPNTQKLYNQLTKQNQDIVVPSFETAQPRIRTTADTAATSSEID NGKPQPVEVGRSFNSSQGQLVEADVDTTRRNPARIYRMTANANQAEMEQVLNAIYCQV MDVYGDQVPDHFHHPELESRLRNGEISVKEFVSELASSEIYRQRFCVSYPNTKVVEFL FRHLLGRAPATQVEIHHYTNLLADSGIKAAVEEIVNSSEYAQYFGENVVPYQRFPFLS AGNYLGSVKAAD" BASE COUNT 1090 a 995 c 975 g 1183 t ORIGIN 1 ttgtgagggt gtagggggaa aagaagaatg agagaggggg cagggtgaag accagaaggg 61 gagaaaccaa tccacgtgaa ctaatatttc tccgcactac aaccgctccc tactcgtggg 121 cttcttagtt ttttctccta cacccttata cccccactcc cctacacccc ttcttagtca 181 ataatcaagt atccaagagt ttacgctctt gaatattgac tattgcactt aatcagctgc 241 cttgacgcta ccaaggtagt taccagcact taagaatggg aaccgctggt atggcaccac 301 attctcaccg aaatactgag catactcaga gctattgaca atctcctcga cagcagcttt 361 tataccgcta tctgccagta agttggtgta atggtgaatt tccacctgag ttgctggcgc 421 acgtcccaac aggtgacgga agaggaactc aacgactttg gtgttgggat aggaaacgca 481 gaaacgctgg cgatagattt ctgaactcgc aagttcactc acaaattctt tgacagaaat 541 ttcaccattg cgcagtctgc tttccaactc aggatggtgg aagtgatcag gaacttgatc 601 gccatacaca tccatcactt gacagtaaat agcgtttaga acctgctcca tctcagcttg 661 gtttgcattt gctgtcatgc ggtaaatacg agctgggttg cggcgggttg tatccacatc 721 agcttcaacc aactgtccct gactggagtt aaaagaacga ccgacttcaa caggctgcgg 781 cttaccattg tcaatttcac tggatgtcgc tgctgtgtct gcggttgtcc ttatgcgcgg 841 ttgcgctgtc tcaaagctgg ggacaacaat atcctgattt tgcttagtga gctggttgta 901 gagcttttga gtgttcggga agttcgccgc tggcaaggtt gggaagcggt tgtaaggtac 961 tgtatcttca ccaaacgctt cgagatattc cggactgttg accattgccc ggataaaggc 1021 gcgaataccc tgagttgcaa gtatttgatt gtacttacgt atttccgcct gatcaagtgg 1081 ggcacgtccc aggaagtgct tggttcctag ttcgatgacc ttggtgttgg ggtatggggt 1141 gtagaactct ttctggtata ggctagaaca acctaaacct tcaataaatt ctttaacggt 1201 gatttcacca ttacccagct tgctttgaag cactgtgaat tcgtttttga gaatgtaggg 1261 ttcaatattg cgctcgaaaa tctgacggta agcagcacca ataacgattc ccaccgcaac 1321 tttgtcgttg gtaccagcta ccaacttgaa gactttcgtt tgttcgcgct tcttggtgac 1381 gccttggtta atgcggaact gaatctcggg ttctgtccgc actcccttaa cactaccaag 1441 ttcgacaaat agtggagttt cttccttctc aattttcggg gtgatatctt cgcggatgct 1501 accaacgcgc agtttccgcg acgctacacc accaggagtc agataccgtt cataaggaac 1561 tgtatcttca ccaaatgcct cactgtactc tacgctacca atgatggcgt caacaactgc 1621 gtagaagccc ttcttggagg cgatgtcaaa gtacttgttg ttttcttgac gaccgtaggt 1681 aggacgacct aacaagcgac ggtggatgta ctcaatcgcc ttcatgacat acagcgatgt 1741 ccagtacatc ttccggaata aatctgactt agccaaagcg cggataaact cccgtacgga 1801 gatatcaccg ttttccagct taatttcttg cactttcagc cgctgaccgt cgtaaacatc 1861 gcgaccaaaa acttgcaggt aagccgcccg aatcaccgct tgggttgagc tttccgagta 1921 cttgacactg agaccagttg gtgtcttttt acctcttgta ccaggcagtt gatccaactt 1981 gaacaccttg ggacctaggg taccagggtt ctcgccccgc gctccaggat tgctgttttg 2041 gttattaatt cctggtccag agtgaatcag gatgcgcttg gtatctttac caaaaggagc 2101 aggacgagta ctggggtttc tggtttcttt cgggaaaatt gccccaaact gaatttctaa 2161 ggggtcgtta ccagaaccat aaggatgttg gtctggtagt ggttggttat aagcagcaaa 2221 agttgtgata aactgtggaa ccttgcggaa aggagcactg tagttaaaca ggtcttgctg 2281 tggcccccag ttacgacatt cttgtgcttc ttgaccaagt ccccgaatgt aaggaactgt 2341 ttcttcaccg aagtaatcac tgtattcctg agagtctacc agtgcatcga ttaaagctga 2401 cagaccgcct cgggaaacga tgtcaaagta cttttgtact tcttctcggc tacttggtcc 2461 gcgtcccaag aagtgacgga aagcaagctc tagggcacgg ctgttgataa acggctgata 2521 aaattgtttt tggtaaagcg gagatttgcc aaggcgacgg acaaattctt tcatggagat 2581 agtgccgttc ttcacctgag attccaggta agaaatcgac tgactgtaag cacgggtaat 2641 atcgcgctca aaaatttgtc tatatgccgc ttttacaact tcgtttttct ccagtgctga 2701 caacccaggc ttcatgacaa acttgggacg ccgttctgcc gcattgaagt aaatttgagg 2761 cagttctaag ccctgttgat cgcttgaagg acgttgacgc agtttattag aaggagaggg 2821 tgctttgaat tctgtaatca aaacatccat gtactgagac acaatgtcct ttgcctcagc 2881 atcctgacgg aaataggaaa gcgccgcaag tttgatttcc tgcaaagcaa caattgtggc 2941 ttcaccggaa caggcgtttt caatgatttc tctcaaacca cgcgtgttga cagcgatgat 3001 gctggggtca ccagccacaa tcgcgtaggt agcgtagcgc aaaaaccacg acaagtcccg 3061 caggctctta gccatgtttg ctggaccata acgggcgacg ttgattggtc ggaaaccagg 3121 aggagtcgga ccactcggtg aactgttaaa gactgaccgc aaattttcta agaaaccacc 3181 acgactttct acgtaagtga cagttcctag tctcatggta tctctaacag aagcgggtgc 3241 gccagccatc accacttctg gttccctggg cttttctagg aaagacattg gcgaaccacc 3301 aacaaaaatc cggttggcag cacgagagac aataatctcc gaattttccg tgagcgtctg 3361 ggcgatttct agacgcttga caccagatgc aaaatacctt gccagttcac ttaactcgcc 3421 aactcccaga aagcggtctt gttgctctgc ttgggaaatt gttgatactg ctagggtttg 3481 atatagttgc ggacgcgcaa ctgagcttcc accacttgcc ttaacaccca tcggatttgt 3541 aaaactccta tcatattttt tttgtgttaa gtctgggttg cttttaggct tgacacatcg 3601 gtgcagttat gcgttaagac tgctgccctc aagactgcca aaaaaaatta acccagtcag 3661 cttttagatt agaacgtttt ggcatttctc tgtggcttta ttaagaagtg tgtagaaagt 3721 aaagccaaga tgtatcaact ccaaaagcag atgtttaaac ttctgtgaga atatatttaa 3781 gatttgtatc atttcttgat aaaacgagat ttcttctttt gacatcaaat ccatccggtt 3841 gattggttat cattcacagt ttctacctca ccccgcctac ggcacccctc tccgaactcg 3901 cggagagggg ctgggggtga ggttttgacg acgaacacca cacagatgct cacaagtcaa 3961 tatatgtgtc agtatctgtg gtttcataca atcacttgaa aaaaattaca agagagttta 4021 ataaaaaatg atgacactgg tttggcgatc gcccttcttt gtccgaacgc ttcaactaat 4081 gcctcgttcc cagtttctga ctgataatac ttattagtac tcacagtctc ccttactcgc 4141 aatcgctaat tttagatttg gagttgatca tgaaacaggg tttgcattta ctgctcggaa 4201 aaacttatct aaacctagtt aaaccagatt tagcgttctc ttc // LOCUS NODE_6253_length_4107_cov_5.1974334107 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4107) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4107) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4107 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..2897 /locus_tag="DP116_27320" CDS <1..2897 /locus_tag="DP116_27320" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496127.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27320" /translation="AAFSDALTVYTQQAFPQEWATTQNCLGVAYVDRIRGEKAQNIEL AITAFSAALSVYTRDAFPQDWAMTQGNLGNAYRQRILGEKAQNIELAITACDDALTVY TQHDFPQDWAGTKNNLGNAYCARILGEKAENIELAIAACDDALTVITQQAFPQDWAKT QSNLATAYGERIRGDKAKNIELAIAACDDALSVYTQQDFPQDWARMQNNLGAAYVYKI RGDKAENIELAINACSAALSVRTQQAFPLDWARTQNNLGAAYVYRIKGDKAENIELAI AAFSAGLSVYTQQAFPQDWARMQACLGVAYCQRIRGDKAENIELAIAACDDALSVYTQ QAFPQDWAMTQNNLGLAYVERITGEKAQNIELAIAACSAALTVRTREAFPQNYAETLF NLGMAYQNANQLTSAYNTFKSAIDTVESLREEILSGEETKRKQAEEWNQLYSRMVEVC LKLVKITEAIEYVERSKTRNLVELILNRDLKTIFLPEVVTRLETYRDEITTGQYQIQN GKAENSKVLAQRLQELRQQRNELQNCYLRVGYDFKFDSFQATLDERTAIIEWYILIDK ILAFIVIPKGEVTVWQSQPEDQEAFRNWVNQYLQNYDNQKKQWQNSLGEELKKLASIL HIDEILTKIPNHCDQLILIPHRFLHLFPLHALPITSQNSKFKIQNYEDLPCLVDLFPR GVGYAPSCQLLQQVQKRERLDFQSLFAIQNPTEDLSFTDLEVESILSYFPGHQLLPKK QATKAALSEAATQLKEVNYLHFSCHGFFNLNSPQNSCLVLADAYVSPIPANASRERYL KVPDDKTIDLSKCLTLGNLFELNERGELIFDFSQSRLVVLSACETGLIDFQNTSDEYI GLPSGFLYAGSSSVVSSLWTVNDLSTSFLMIKFIQILKSATDMSVPLAMNQAQRWLRD ATKEELQEWMIQLPLDSTKKGQIRRQINKMTGEKPFDSPDHWAAFTAVGK" gene 2908..3246 /locus_tag="DP116_27325" CDS 2908..3246 /locus_tag="DP116_27325" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27325" /translation="MSDYEIIVNNFVHVLTTQSSLFSKEDRDELIQLIEEQPDEIQSL SNAISDWCSEHPEVDEALAEIEELTERAPGEKRPTNIPKYELDKKNIINAIQQSSSSA KKVDKPTPNN" gene 3326..>4107 /locus_tag="DP116_27330" CDS 3326..>4107 /locus_tag="DP116_27330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208325.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27330" /translation="MDYLSTVKVAYPKLTLYAFHLKHSLAQKPKTPVENANHFWQKCQ QLGKQLGVPRLETLPALIEKENNKKTSITGEILPERILTFTAIKHNKNLHLSGEANPL EIHDTYALDLTLRYPHPEVQLTDLRGLNPDDCLLPKNINASLGQTLVFFAQPVERIDD EQAFADACVAALLSEETVRKVNIYCQHQGQLLGSPIFEYNNDADSPEEQCHLLIWLNT HSETTKLEEKGEYYYPFIDLLLCRSKIIYARSEAIWCYEQARS" BASE COUNT 1335 a 848 c 804 g 1120 t ORIGIN 1 tcgctgcttt ttctgatgct ttgactgtct acacccagca agcctttccc caagaatggg 61 caacaacgca aaattgtctc ggtgttgcct acgttgatag aatcagagga gagaaagcac 121 aaaatataga attggcgatc actgcttttt ctgctgcttt aagtgtctac actcgcgacg 181 cttttcccca agattgggca atgacgcaag gtaatctcgg taatgcttac cgtcaaagaa 241 ttctaggaga gaaagcacaa aatatagaat tggcgatcac tgcttgtgat gatgctttaa 301 ctgtctacac ccagcatgac tttccccaag attgggcagg aacgaaaaat aatctcggta 361 atgcttactg tgcaaggatt ctaggagaga aagcagaaaa tatagaattg gcgatcgctg 421 cttgtgatga tgctttgact gtcataaccc agcaagcctt tccccaagat tgggcaaaaa 481 cgcaaagtaa tctcgctact gcctacggag aaagaatcag aggagacaaa gcaaaaaata 541 tagaattggc gatcgctgct tgtgatgatg ctttgagtgt ctacacccag caagactttc 601 cccaagattg ggcaagaatg caaaataatc tcggtgctgc ctacgtttat aaaatcagag 661 gagacaaagc agaaaatata gaattggcga tcaatgcttg ttctgctgct ttgagtgtca 721 gaacgcagca agcctttccc ctagattggg caagaacgca aaataatctc ggtgctgcct 781 acgtttatag aatcaaagga gacaaagcag aaaatataga attggcgatc gctgcttttt 841 ctgctggttt gagtgtctac acccagcaag cctttcccca agattgggca agaatgcaag 901 cttgtctcgg tgttgcctac tgtcaaagaa tcagaggaga caaagccgaa aatatagaat 961 tggcgatcgc tgcttgtgat gatgctttga gtgtctacac ccagcaagcc tttccccaag 1021 attgggcgat gacgcaaaat aatctcggtc ttgcctacgt tgaaagaatc acaggagaga 1081 aagcacaaaa tatagaattg gcgatcgctg cttgttctgc tgctttgact gtcagaaccc 1141 gcgaagcctt tccccaaaac tatgcagaga ctttatttaa cctggggatg gcttaccaaa 1201 atgcaaacca gttgacctca gcttacaata cctttaaatc tgctattgac acagtagaat 1261 ctttgcggga ggaaatacta tctggagagg aaaccaagcg caaacaagcg gaagaatgga 1321 atcaacttta tagccgcatg gtagaagttt gcctgaaatt ggttaagata accgaagcga 1381 ttgaatatgt tgaacgtagt aaaacccgca atttagttga actgatcctc aaccgtgact 1441 tgaaaaccat cttcctccca gaagtagtta ctcgactaga aacatacaga gatgaaataa 1501 ctacaggaca atatcaaatc caaaatggca aagctgaaaa ttcaaaagtc ttagcacaac 1561 gtctccaaga attgcgacag cagcgtaatg aattgcaaaa ctgctactta cgcgttggtt 1621 atgatttcaa atttgactca ttccaggcaa ctttggatga gcgtacagcc ataatcgagt 1681 ggtatattct cattgataaa attctggcgt ttattgtcat acctaaagga gaggtcactg 1741 tttggcaatc tcaaccagaa gatcaagaag cttttaggaa ttgggtaaat caatatttgc 1801 aaaactacga caatcaaaaa aaacagtggc agaatagctt aggggaagaa ctcaaaaaat 1861 tagcttcgat tctgcacatt gatgaaatat taaccaaaat accaaaccat tgcgaccaac 1921 tgatcctaat ccctcatcgg ttcctgcact tattccccct tcatgcactc ccaataacaa 1981 gtcaaaattc aaaattcaaa attcaaaatt atgaggattt gccttgtctt gtggatttat 2041 tccctcgtgg tgtaggttat gcacccagtt gtcaacttct gcaacaagtg caaaagcgag 2101 aacgtcttga ttttcaatcc ctgtttgcga ttcaaaaccc cacagaagac ctctctttta 2161 ctgacttgga agtagaaagc atattatctt actttcccgg acatcaacta ttacccaaaa 2221 aacaagccac aaaagctgcc ttatctgaag cagcaacaca attaaaagaa gtgaattatc 2281 tccacttttc ttgtcacggt ttcttcaact taaattctcc ccaaaattct tgtttagtgc 2341 tagcagatgc ctatgtttct cccattcccg ctaatgctag ccgcgaaaga tatctaaaag 2401 tacctgatga caaaactata gatttaagta aatgcctgac attgggtaat ttgtttgagc 2461 taaatgagcg aggtgagcta attttcgact ttagtcaatc tcgtctcgtg gttctttcag 2521 cctgcgaaac aggcttaatc gacttccaga acaccagcga tgaatatatc ggtttaccca 2581 gtgggtttct ctatgcaggt agtagcagtg ttgtaagtag tttatggacg gtaaacgatt 2641 tatcaacatc ctttttgatg attaagttca ttcaaatttt gaagtctgct acagatatgt 2701 cagtaccact tgctatgaat caagcgcagc ggtggttgcg ggatgctaca aaggaagagt 2761 tacaggaatg gatgattcaa ttgcctttag atagcaccaa gaaaggacag atacgtcgtc 2821 aaattaataa gatgactgga gagaaacctt tcgactctcc tgatcattgg gctgcattta 2881 ctgctgttgg caaataggag aagagtcatg tctgattatg aaattatcgt taataatttt 2941 gtccatgtgt tgacaactca atcctcattg ttttccaaag aagataggga tgaattaatt 3001 caattgattg aggaacaacc tgatgaaatc caatctctct ccaatgctat ttccgattgg 3061 tgttcagaac atcctgaagt agatgaggct ttagcagaaa ttgaagaatt aacagaaaga 3121 gcaccagggg agaagcgacc tacaaatatt cccaaatacg aacttgataa aaagaatata 3181 atcaatgcta ttcaacaaag ttcgtcatct gctaagaagg tagacaaacc aactcctaat 3241 aattgatatt ttgtgaaaat ttagatcccc gacttcttat agaagtcggg gatctggaca 3301 ccgcgacatt tatcatcatt ctaaaatgga ttatctttca acagtaaagg tagcttatcc 3361 caaactaact ctctatgctt ttcatctgaa gcatagcttg gcacaaaaac ccaaaacccc 3421 tgtagaaaat gccaaccatt tctggcaaaa atgccagcaa cttggtaaac aacttggtgt 3481 gccgagatta gaaactttac ctgcactgat tgaaaaagaa aataataaaa aaactagcat 3541 cactggggaa atacttcccg aacgtatttt aactttcacc gccattaaac ataacaagaa 3601 tctacactta agcggcgaag ctaatcccct agaaattcac gatacctacg cgcttgattt 3661 aactttacgt tatccccacc cagaggtaca actcactgat ttaaggggac taaatccaga 3721 tgattgctta cttccgaaaa atatcaatgc ttctctagga caaactttag tattttttgc 3781 tcaacctgta gaaagaatag atgatgagca agccttcgct gatgcttgtg ttgcagcatt 3841 actatcagaa gaaactgtcc gaaaagtcaa catttattgt cagcatcaag ggcaactatt 3901 aggtagtcca atatttgaat ataacaatga tgctgactct cctgaagaac agtgtcattt 3961 attaatttgg ttaaataccc attccgaaac tacaaaatta gaagagaaag gagaatatta 4021 ttatcctttc attgaccttc tcctttgccg cagtaaaatt atttacgctc gttctgaagc 4081 gatatggtgt tatgagcaag ccagatc // LOCUS NODE_6285_length_4074_cov_4.7240614074 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4074) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4074) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4074 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..2316) /locus_tag="DP116_27335" CDS complement(<1..2316) /locus_tag="DP116_27335" /inference="COORDINATES: protein motif:HMM:PF05860.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="filamentous hemagglutinin" /protein_id="PRJNA477356:DP116_27335" /translation="MLGNCCLLGGAMSFRNTCWFLGIAVGGAFALAANSASAQITPDA TLPNNSTVTINGNTFNIDGGTTAGRNLFHSFQQFSVPTNGTASFNNAADIQNIFSRVT GTSVSNIDGIIKALGTANLFLINPNGIIFGKDASLNVGGSFVGTTANAIQFGNQGFFS ATNPNNPALLTVNPSALFFNQIAAASITNNSVALAGKDPAGLEDALGLRVPDGKSLLL VGGNVNMDGGRLNAYGGRVELGGLAASGNVGLNVDGEKLSASFPTESALADVSLTNNA GVYVYASGGGNIIINARNLDMNESRLRAGIGPNLGSDGTVAGDIVLNAKGKITVVTSS IFNYVSSGAEGKAGNIQISADSLSLTNNAYLSASTDGKGDAGNVIIDVRGNVSFDNSS ANTTVFPNGEGKGGNIQIKADSLSLTKAFLFTNTYGRGDAGNVIIEARRNVFFDNSSA DTSVGSNVEAKGGNIQIKADSLSLNKASLSASTFGKGDAGDVIIDARGNVSFDNSSAS TTVSFNVEGKGGNIQIKADSLSLNKAFLFTNTFGKGDAGDVIIDVRGNVSFDNSNAST SVGDTGEGKGGNVQIKADSLSLTNNAVLFASSSGKGDAGNVIIDVRGNVSFDNSNAYT SPGFTGEGKSGNIQIKADSLSLTNAYLSAFTLRKGDGGNVIIDARGNVSFDNSYAFTS AGFTGEGKGGNIQIKADSLSLTNNASLDAKSFGQGDAGNIEVTARQIRLDNSASFNAE SASGNGGNINLRVTDLLLLRRGSQISTSAGTD" gene complement(2788..3678) /locus_tag="DP116_27340" CDS complement(2788..3678) /locus_tag="DP116_27340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016868381.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="PRJNA477356:DP116_27340" /translation="MTVTTQQQATTSFEKLFWTWQGHKIQYTVMGTGRPLVLVHGFGA SIGHWRKNIPVLAAAGYRVFALDLLGFGGSDKAPLNYTVELWVELLKDFWTAHISEPA VFIGNSIGALLSLTVVAEHPEIAAGGVLINCAGGLSHRSHELNPPLRIAMGAFNSLVR SRITGALVFNRIRQKAQIRRTLYQVYRNREAVTDELVDLLYAPSCDPGAQQVFTSILT APPGPSPTELLPKVERPLLVIWGADDPWTPITGAKIYEKARENGKDIQIVPIPGAGHC PHDEVPDVVNQQISDWLSSL" BASE COUNT 1142 a 950 c 830 g 1152 t ORIGIN 1 atcagtaccc gcagatgtgg aaatttggct accacggcgt aggagtaata agtctgtgac 61 tcgcagattg atattgccgc cattacctga tgctgattca gcgttgaaac tcgctgagtt 121 gtccaaacga atttggcgtg cggtcacttc aatgttacct gcatcccctt gtccaaaact 181 cttggcatca agtgaagcgt tattggttaa agacagagaa tcagctttaa tttgtatgtt 241 tccacctttt ccttctccag tgaagcctgc tgaggtgaaa gcatagctgt tatcaaaaga 301 aacattacca cgggcatcaa ttatcacatt ccctccatcc ccttttcgta aggtgaaggc 361 agaaagataa gcgttggtta aagacagaga atcagcctta atttgtatgt ttccactttt 421 tccttctcca gtaaagcctg gtgaggtata agcattgctg ttatcaaaag agacattacc 481 acggacatca attatcacat tccctgcatc cccttttcca ctgctgctgg caaaaagaac 541 agcgttattg gttaaagaca gagaatcagc cttaatttgt acgtttccac cttttccttc 601 tccagtgtcg cctactgagg tggaagcatt gctgttatca aaagagacat taccacggac 661 atcaattatc acatcccctg catccccttt tccaaaggtg ttggtaaaaa gaaaagcttt 721 atttaaagac agagaatcag ccttaatttg tatgtttcca ccttttcctt ctacattgaa 781 gcttactgta gtggaagcac tgctgttatc aaaagaaaca ttaccacggg catcaattat 841 cacatcccct gcatcccctt ttccaaaggt gctagcagaa agagaagctt tatttaaaga 901 cagagaatca gccttaattt gtatgtttcc accttttgct tctacattgg agcctactga 961 ggtgtcagca ctgctgttat caaaaaatac attacgacgg gcttcaatta taacatttcc 1021 tgcatcccct cgtccatagg tgttggtaaa aagaaaagct ttggttaaag acagagaatc 1081 agccttaatt tgtatgtttc caccttttcc ttctccattg gggaatactg tggtgttagc 1141 actgctgtta tcaaaagaga cattaccacg gacatcaatg atcacattcc ctgcatcccc 1201 ttttccatcg gtgctggcag aaagataagc gttattggtt aaagacagag aatcagcgct 1261 aatttgtatg tttccagctt ttccttctgc acctgatgac acataattaa aaatgctgct 1321 agttacgact gttattttcc cttttgcatt cagcacaata tctcccgcca cagtgccatc 1381 agaacctaaa ttaggtccaa tgccagcacg gagacgactt tcgttcatgt ctaaatttcg 1441 agcgttgatt ataatgttac cgcctcctga tgcgtataca tacacaccag cgttgttagt 1501 aagtgataca tcagctaagg cactttcagt tggaaagcta gcgctcaatt tttcgccatc 1561 cacatttagc cctacatttc cagaggctgc caatcctcct aactcaactc ttccaccata 1621 agcattcagc cgaccaccat ccatattgac attgccaccc actagtagca aacttttacc 1681 atcaggtaca cgtaaaccta atgcatcttc taatcctgct ggatcttttc cggctagtgc 1741 aactgagtta tttgtaattg atgcagcagc tatttgattg aaaaataaag ctgaaggatt 1801 taccgttagc agtgcaggat tatttggatt ggtagcacta aaaaaacctt gattaccaaa 1861 ttgaatggca tttgctgtcg ttcccacgaa cgaaccacca acattcaggc tggcatcttt 1921 accaaagata atcccatttg gattaatcaa aaacagatta gctgtaccta acgctttaat 1981 tattccatca atatttgaaa ctgacgtacc tgttacccta ctaaagatgt tctgaatgtc 2041 cgcagcatta ttaaaagaag ccgtaccgtt agttggtaca gaaaactgtt gaaagctgtg 2101 gaacaagttg cgtcctgcgg tagttccccc atcgatattg aaggtgttgc catttatggt 2161 gacagtggaa ttatttggta gagtggcatc aggtgtaatt tgggcagaag cagaatttgc 2221 agccaaggcg aacgcacccc ccaccgcaat ccctaaaaac caacaagtat ttctaaagga 2281 catcgcaccc ccaagtaaac agcagttacc caacatacag tgtaccagtg acaagtaaac 2341 catatttgcg tagacggtta attgtccttg gtgaagtctt cttttggtca aattagtgct 2401 tgctttcggc agagaacgcc attttttgcg gaaattcgtc tatcgctcat tttacaaaaa 2461 tttatctaat atcatgtccg cttaaatagt tgtcattgcg aacgcagcga tagcggagtg 2521 aagcaatccc agactcttgc gattgctacc ctgcgggaag ccccctccgg ggtctacgtt 2581 tcactccgtt ccactcgcaa tgacatatca taagtatttt gccgcacatg atatgatact 2641 aagttgcgtt ctaatggaga atttactgtg gctgagattg cttcgcttca ctccgttacg 2701 ctcgcaatga cagattcgct ctttgagtgc aacttagtat aagtcatcaa gaaacccggt 2761 tttataaaac cgagtttctc tttctatcta caaagatgat aaccagtccg aaatctgctg 2821 atttacaaca tctggtactt catcatgagg acaatgacca gcacctggaa ttggtacaat 2881 ttggatatct ttgccattct cccgcgcttt ttcgtatatt tttgccccag tgattggtgt 2941 ccaagggtca tctgcacccc atatcaccag caatgggcgc tcaactttgg gcaaaagttc 3001 tgttggagaa ggacctggag gtgcagtgag aatagaggtg aaaacttgtt gcgctcctgg 3061 atcacaagaa ggagcataga gtaagtctac caattcatcg gtgacggctt cccggttacg 3121 gtaaacttgg tatagagtac ggcgaatttg agctttttga cggatgcggt tgaagacaag 3181 tgctccggtg atgcgcgatc gcaccaaaga gttaaaagcc cccattgcaa tacgtagtgg 3241 tgggttcaac tcgtgcgatc gatggctcaa cccacctgca cagttaatca aaacgccacc 3301 tgcagctatt tccggatgtt ccgccaccac cgttaagctc aaaagtgcac caatggagtt 3361 cccgataaat acagcaggtt cactaatgtg tgctgtccaa aaatccttga gcagttccac 3421 ccacagctct actgtataat ttaacggagc tttatcagaa ccaccaaacc ccaaaaggtc 3481 aagagcaaaa actcggtagc cagcggcggc taaaacgggg atatttttcc gccaatgtcc 3541 aatagaagca ccaaagccat gaaccagtac aagggggcgt ccagtaccca tgacagtgta 3601 ctgaattttg tgaccttgcc aagtccaaaa gagtttttca aagctagttg ttgcttgctg 3661 ctgcgttgtt acagtcatta ttaagattct taacgttttt cctcatatat tcattttctc 3721 tgatttcaaa ggattaggaa caacgttcta agtacggttt tgcgtaaaaa cgagcgtgtt 3781 tttaaacgca ccaggagcgc agagtcagcg cagaggggag ccagcgcgtt gcgggggttc 3841 cccccgttgt agcgactggc gtcgcagagt ctttgtatag agaattcgcg ctcgtattga 3901 tgaaatgttg tactaagtac tggattgcaa ttgcccaact tcgtctggag gggataaatc 3961 taagattgag tcatgtcagc cgggctaagt tctcgccgcc actgtttgag aatcagtgct 4021 gaccaaaaag taaatcccaa tatagccaaa ccagtcaacc catcagtcca gata // LOCUS NODE_6290_length_4070_cov_4.9322544070 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 4070) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 4070) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4070 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(112..639) /locus_tag="DP116_27345" CDS complement(112..639) /locus_tag="DP116_27345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317354.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4112 domain-containing protein" /protein_id="PRJNA477356:DP116_27345" /translation="MSEIPPSYPIIEPDVKAPTLRRLRQLSRLLDRAITVPGTQVSIG LDPIIGLIPVGGDFLGVMLSAYIVLEAARLGAPAATLSRMMINIIIDGLVGAIPIAGD LFDFAWKANERNVKLLEDHLRFPRQRKSADKWFVFGVFIVLFIVAIALVAFTVMFIRL LGSLLVMLQGLLSGG" assembly_gap 647..656 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 922..1380 /locus_tag="DP116_27350" CDS 922..1380 /locus_tag="DP116_27350" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877970.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27350" /translation="MTQELTSQWLAEIKALKQQIAELQAEQEAGWQSGEKWRKLYNIE AEQRRTDAQMAQQTIASLKAEIQQIKGIEEARLDDPTAATAIEQQVEQLKSVEELQAK LILVTKERDRLLQALKTEQDNHAQTRKSLTTALADAIDGLTKEREEREGK" gene complement(1522..2613) /locus_tag="DP116_27355" CDS complement(1522..2613) /locus_tag="DP116_27355" /EC_number="4.2.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009454732.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine synthase" /protein_id="PRJNA477356:DP116_27355" /translation="MTLSLSAAKSHRQPWLGLIEQYREYLPVNEKTPVVTLQEGNTPL IPVPSIAELVGRQVRVFVKYDGLNPTGSFKDRGMTLAISKAKEAGAKAVICASTGNTS AAAAAYARRGGMRAFVLIPDGYVALGKLAQALLYGAEVLAIKGNFDQALEIVREMAET YPVTLVNSVNPYRLEGQKTGAFEIVDVLGNAPDWLCIPVGNAGNITAYWMGFCQYHQA QKCDRLPKMMGFQAAGAAPLVTGQPVAHPETIATAIRIGKPASWEKAVAAQQASAGKF HAVTDAEILDAYRLLASEEGIFCEPASAASVAGLLQVKDQVPTGATVVCVLTGNGLKD PDTAIKHSHSKFKQGIEPELLAVAEAMGF" gene 3042..3224 /locus_tag="DP116_27360" CDS 3042..3224 /locus_tag="DP116_27360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877973.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27360" /translation="MSTSSVSEKATQSVLASKLNKCYYQAEQLTKFMHLQAEVDCLLE QLQSMKTEKSITNKEE" BASE COUNT 1160 a 861 c 948 g 1091 t 10 others ORIGIN 1 tcagttatca gtacttgtta ctcataacct tgataactgt tcactgtgaa atccacctca 61 gcaaacttga taactgttca ctgcggctgg tactgttcac tgttcactgt tttaaccgcc 121 acttaacaat ccctgtagca ttacaagcag tgatccaagc agtctaatga acattacggt 181 aaatgcgact aatgcaatag caactatgaa caatacaata aaaaccccaa atacaaacca 241 cttgtctgca cttttcctct gcctaggaaa cctcagatga tcttccagta gcttaacatt 301 acgctcgttc gctttccagg cgaaatcaaa caagtctcct gctatgggta ttgcacctac 361 taagccatca ataatgatat tgatcatcat tctacttaaa gttgccgcag gcgctcctag 421 tcgtgcagct tcaaggacaa tgtacgctga aagcatgact cctaaaaaat caccaccaac 481 aggtattaat cctatgattg gatctagacc aatactaacc tgcgttcctg gaacggtaat 541 agctctatcg agtaagcggc ttaactgacg cagccttctt aaagttggtg ctttaacatc 601 aggttcaatt atggggtagg aagggggaat ttcagacatt agggcgnnnn nnnnnngagt 661 tttttaaacc gttaaatatt ttaactctct aattgggaat gggaatactt gatctttggt 721 attgtaaaac ttcgttaaaa tagtacagat ggtgtaaaaa aaaataccat ctcaaacagc 781 caaaaagaaa ggctttttgt aaagtgcgtg cctgtagggc atatttttaa tttttagaga 841 agtcaccacc cggggttttc tcggctgaaa atttcggtga tacgtgaatt ttgaattgga 901 acgaagtggt agtaggcatt tatgactcaa gaattaacat cacagtggtt ggcagaaatt 961 aaagccctca aacagcagat tgcggaactt caagccgaac aggaggcggg gtggcaaagt 1021 ggcgaaaaat ggcgtaaact ttacaacatt gaagcagaac aacgccgtac agatgctcag 1081 atggcgcagc agacaatcgc gtctttaaaa gcagaaatac agcaaattaa aggtattgag 1141 gaagcacgac ttgatgatcc cactgctgca acagcgattg aacaacaagt cgaacaacta 1201 aaatctgtgg aagaattgca agcaaaactg attttggtta ctaaagaacg cgatcgcctg 1261 ctacaagctt tgaaaacaga acaagacaac cacgcccaaa cgcgcaaaag tttaaccaca 1321 gctcttgctg atgctataga tggtttgaca aaggagaggg aagagaggga ggggaaatag 1381 gctggggagt aggggaagaa attcaaaatt ttgaattttg cggaaagtgg gcggtgtcgg 1441 tttggatccg gtccaacttt ccaagacgaa tgagcgaatt ttgaattgat ctgtctcctt 1501 gtctccctgt gctttagtct tctaaaatcc cattgcctct gcgactgcta acagttctgg 1561 ctcaatgcct tgtttaaatt tgctatggct gtgtttgata gctgtatctg gatctttgag 1621 accattacca gtcaggacgc aaacgactgt cgcgcctgtg ggaacttggt ctttcacctg 1681 taacaaacca gcgacggaag cggcgctagc gggttcgcag aaaattcctt cttcggatgc 1741 caaaaggcga taggcatcaa gtatttccgc atcggtgacg gcgtggaatt tgcccgcgct 1801 tgcttgttgg gctgctaccg ctttttccca gctggcgggc ttgccaatac gaattgctgt 1861 agctatggtt tctggatgtg cgactggctg tcctgtcaca aggggagctg cacctgctgc 1921 ttggaatccc atcatcttag gtaggcgatc gcacttttgg gcttgatgat attgacaaaa 1981 acccatccaa tatgctgtga tatttcccgc attccccact gggatgcaca gccagtctgg 2041 agcattaccc aaaacatcaa caatttcaaa cgcccctgtt ttttgacctt ctaagcggta 2101 aggattgaca gagtttacca aagtgactgg ataagtttcg gccatttcgc gcacaatttc 2161 tagcgcttgg tcaaaattcc ctttgattgc caggacttct gccccataca gcaacgcctg 2221 tgccaacttg cccagcgcca cgtagccatc gggaattaag acaaaggcac gcattcctcc 2281 acggcgagca taagctgctg ctgctgctga ggtgttgccc gtgctggcac agataaccgc 2341 ttttgctcca gcttcctttg ccttagaaat tgccaatgtc atacctcggt ctttaaagct 2401 accagtgggg ttaagaccat cgtatttaac aaagacacgc acttgtctac caacaagttc 2461 tgcaattgag ggaacaggta ttaatggtgt attgccctcc tgtagagtaa caactggtgt 2521 tttttcgttg acaggcaagt actcgcgata ttgttctata agtccaagcc agggttggcg 2581 atgagattta gcagcagaca ggctcaaagt cacggtattt aaaagctctt agctccaatt 2641 gaaaacagta tctggtattt attttgcacc ttcagtgcta tgcaagcgca agggggtgca 2701 tctaggatct atggattggt gaaattgttc tttggaacaa aactgtagga tgtgcagtct 2761 cctaaaatac aattctaaag gatgtttgtt aaagtcaata ttgtcatatt gaagtgcggc 2821 aaagtgaaag ataactccat cagcaaggag ttgctgattt ggcgtaacat gcttgtacac 2881 gttgccgcta cggtaatagc acgacagatg aatacttttc aaacatcctt tgagagatgt 2941 taaatttgtg cgagaaaaat atacaattat ttaatgattc ttaaaactcg cctattataa 3001 tatctctaat ggtgtgagtt gatttatgga tagatacgtc aatgtctaca tctagtgtgt 3061 cagaaaaagc aacccaaagt gtcttggcaa gcaaactaaa caagtgttac taccaagctg 3121 agcagttaac aaaatttatg catttgcagg cagaagtcga ttgcttgcta gagcagttgc 3181 aaagtatgaa gacggaaaaa tcaataacaa ataaggagga ataggttagt tttcggtaac 3241 ggtgctagcc ccatataagt aggtcagtgt gaatatttat cgttgagaca aggcagaagg 3301 cagacggcag aaggcagaag ggaagagggt tttagctaag tttacttttc gttacatact 3361 tctgtttatt tgcacctttc tacttagtca caatatcatg acgagcacgc aactgaaagc 3421 ggtggttgtt gtgcgagcaa atttttagac cagaactgtg ggcagtctct acgagcgaag 3481 ttagggaatg ccctagtacg agtctagcga gacacatgat tgcgtctgtt gctttcttgc 3541 gatggcaacg cctgccctac gccccaatcc agcggagccg gattgccttg tgctgaaagc 3601 acaggggcag ggttttcctc cagaaaaccc tccgggcgtg atacgggcga tgctccggag 3661 gagccgcccc aagggggcga tcgcagttca tagttgatct tacttttcaa atcagtataa 3721 ataaattgac attatcggat aaaaaaagcg tgcgagaact gaccagattc ttgcagcaat 3781 gatgcaaacg caagtgcgtg ctctttatac ttagtgccgc accttgctaa atttttggag 3841 ctgtttctac tttgtgctta tccgggggcg ttacggatgc tgcgggcgtt ggcgaaatga 3901 ggtaattgca tccggtggat aatacacgag attgttcgtg cgtgaagtac gatgcaccag 3961 atctaaaagc gacacagaaa gcgtttttta cttccaactt ccgacttcca acttctgact 4021 tccgaccaag aaggggtgta aggggagcca gtgcgttggg gagccagtac // LOCUS NODE_6488_length_3878_cov_5.0980913878 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3878) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3878) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3878 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(164..1606) /locus_tag="DP116_27365" CDS complement(164..1606) /locus_tag="DP116_27365" /EC_number="6.1.1.17" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017319000.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutamate--tRNA ligase" /protein_id="PRJNA477356:DP116_27365" /translation="MTVRVRIAPSPTGNLHIGTARTAVFNWLFARHHGGQFILRIEDT DLERSRPEYTENILQGLRWLGLTWDEGPFFQTKRLEIYKQAVQTLFEKGLAYRCYTTP EELEALREAQKARNEAPRYDNRHRNLTPEQEAAYKAEGRNFVIRFKIDDDREIVWNDL VRGKMVWRGSDLGGDMVIARAAADGIGIPLYNFAVVVDDMDMQITHVIRGEDHIANTA KQILLYEAFGAKVPEFAHTPLILNQEGRKLSKRDGVTSIFDFKQMGFTAEAMVNYMTL LGWSAPDSTQEIFTLEEAAKQFGFERVNKAGAKFDWAKLDWINSQYIHNMSVDKLTDL LIPAWEEAGYQLTGGRERAWLDQLVTLIGPSLTRLVDAVAMSKLFFVETVEFSDEATA QLKQEGSAATLKSVIAALENHQLTETSAQEIIKQVVKEQNVKKGLVMKALRAGLTGDL HGPDLIQSWLLLNQIGLDKPRLIEAVKEAS" gene complement(1738..1811) /locus_tag="DP116_27370" tRNA complement(1738..1811) /locus_tag="DP116_27370" /product="tRNA-Asp" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(1775..1777),aa:Asp,seq:gtc) gene complement(2009..3208) /locus_tag="DP116_27375" CDS complement(2009..3208) /locus_tag="DP116_27375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315519.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27375" /translation="MQFVKKTFKFPRILFFFIPLLILISIFIINVPAPNYPITEINFI GSASFPTGYTFKKTPMGGFSGITYDTKKQLFYTISDDRSEKAAARFYTLKIDLSNGTL TNEKVVPVGVTTLLNESRQTFPRGTIDSEGIALTKKETVYISSEGDNLKQITPFLKEF SLSSGQVLRTLPIPNKFLPDKSRQQGIRNNLAFESLTITPNNKSLFTATENALIQDGP EAKPKISSPCRILQYNLLTQQPEKEFLYQTEAVAPILNIFSRLSPQFSSGLTDLAALD NQSHFISLERTFTGFGFSIALFQISLEGADDIHNIDSLLAVDINKIKPTKKELLLDLR KLDVALDNIEGLTLGPKLPDGQLSLILVSDNNFNRLQRTQILAFKLKMEPPLIRLFRR FVPNFNR" gene complement(3631..>3878) /locus_tag="DP116_27380" CDS complement(3631..>3878) /locus_tag="DP116_27380" /inference="COORDINATES: protein motif:HMM:PF00805.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_27380" /translation="ADLSSADLSSADLSSADLSSADLNCSTTFTKEPICTNLKGVKNL TFQQVKAAKNWEQACYDPELRKQLNLPPENPKYCAGE" BASE COUNT 1090 a 820 c 829 g 1139 t ORIGIN 1 gagacaggtg agacagcgcg aatgacggct ctccctcact tggcgactgc gaacccgata 61 gccgtaaggc gtggcgttag cgatagggcg aacccgaagg gttatcagtt cactgttcac 121 tgttcactga taactgttga gcacgttccc aaaggataat tacttaactc gcttctttta 181 ccgcctcaat caaacgtggc ttgtctaaac caatttgatt caaaagtaac cacgattgga 241 tcaagtcagg accatgcaaa tctcctgtta agccagcgcg aagtgctttc atcaccaagc 301 ctttcttgac gttttgttct ttaacgactt gcttaataat ttcttgagcg ctagtttcag 361 tgagttgatg attctccaaa gcagcgatca cactcttgag ggtagcagca gaaccttctt 421 gttttagttg agcagttgct tcgtcactaa attcaactgt ttctacaaaa aacagtttgc 481 tcatagccac tgcatcgaca agacgagtta aacttggacc aattaaagtg acaagctgat 541 ccaaccaagc acgttcacgt cctcctgtca attgatatcc tgcttcttcc caagcaggta 601 taagcaaatc agtcagctta tccactgaca tattgtgaat atactgactg tttatccaat 661 ccagcttcgc ccagtcaaat tttgccccag ctttattaac acgctcaaag ccaaattgct 721 tggctgcttc ttccaaggtg aaaatttcct gagtcgagtc tggggctgac caacccagca 781 atgtcatata attcaccata gcttcagcag taaagcccat ttgcttaaag tcaaaaatgg 841 atgtgactcc atctcgcttg gaaagcttgc gtccctcttg attcaaaatc aagggtgtat 901 gggcaaactc tggcactttt gcgccaaaag cttcatacaa caaaatttgc ttagcggtgt 961 tggcgatgtg gtcttctccc cgaatgacat gggtaatttg catatccatg tcatccacga 1021 caacagcaaa attgtacagt gggataccaa taccatctgc tgcggcgcga gcaataacca 1081 tatcaccgcc taaatcgcta cctcgccaaa ccatctttcc ccgtaccagg tcattccaga 1141 caatttcccg gtcgtcatca attttgaaac gaatgacaaa gttacgccct tcggctttat 1201 atgcggcttc ttgttctggt gtaagattac ggtgacggtt atcatagcgc ggagcttcat 1261 ttctggcttt ctgagcctca cgcagcgctt ctaattcttc tggcgtggtg tagcagcgat 1321 atgctaatcc tttctcaaag agggtttgaa ccgcttgttt gtaaatttcc agacgcttcg 1381 tttggaaaaa tggtccctca tcccatgtca gtcccagcca acgcagacct tggagaatat 1441 tttcggtgta ttcaggacgc gatcgctcca agtctgtgtc ttcaatgcgc aggataaact 1501 gaccaccgtg gtggcgggca aacaaccagt taaatacagc tgtccttgct gttccaatgt 1561 gtaaattccc agttggactg ggggctatac ggactctaac agtcacgggc tttttctctc 1621 ttgtctatat ttgtaaaaca cgactcattt agcttaatat taagccttaa gcaatgagcg 1681 ctgagttgtg aataactcac gactcacgac gcggaactca gcaatcaaga ctttcaacgg 1741 gactgacggg gctcgaaccc gcaacttccg ccgtgacagg gcggtgctct aaccaattga 1801 actacagtcc cttaattggt cactttattc attatgacta tttagctgaa cattgtcaag 1861 cttttttttt aaattcgtta ttagttattg agtgatgatt cacagggaac agggaacagg 1921 gaacagggaa cacccgaacg cttaactctt aacgcttgac atacactgta cgaagcttcg 1981 caaaaatctg cggaggaatc ctatattatc aacgattgaa attagggaca aaacgccgaa 2041 ataacctgat aagcggtggt tccatcttaa gtttaaaggc gagtatctgg gttcgctgaa 2101 gacgattaaa gttattatcg cttacaagga ttaatgaaag ctgaccatca ggtagtttgg 2161 gacctagagt taagccctca atattatcta gcgctacatc taattttctc aaatccaaaa 2221 gtagctcttt cttggtgggt ttaattttgt taatatcaac agctaaaaga ctatcaatat 2281 tatgaatgtc atcagcacct tctagagaaa tctgaaacag agcaatagaa aacccaaagc 2341 cagtaaaagt ccgctctaaa ctgataaaat gactttgatt gtcaagcgcg gctaaatcag 2401 ttaaaccact tgaaaattga ggagacaatc tcgaaaaaat attcaagatt ggtgcgactg 2461 cttcagtttg gtaaagaaat tctttctctg gttgctgagt gagcaaattg tattgcaaaa 2521 tgcggcaagg actgctgatt tttggtttcg cttctggacc atcttgaatc agagcatttt 2581 ctgtggctgt aaacaaagac ttgttattag gtgtgatagt cagagattca aatgctaagt 2641 tgttacggat accttgttga cgacttttat ctggcaaaaa cttgttggga attggtagtg 2701 ttctcaacac ttgtccagaa gagagtgaaa attctttaag aaaaggtgtt atctgtttga 2761 gattatcacc ttcagaagaa atataaacag tttctttctt agttaaggcg ataccttctg 2821 aatcgatagt accacgggga aaagtttgac gactttcatt taagagtgtc gtgacaccaa 2881 cggggacaac tttctcattt gttagtgttc cattgctcag gtcaattttt aatgtgtaga 2941 agcgagccgc agctttttcg ctgcgatcat cagaaatcgt gtaaaaaagt tgttttttgg 3001 tatcgtaggt tattccagaa aaccctccca tgggagtttt tttgaaagtg taaccagtgg 3061 gaaaactcgc agaaccaata aaattaattt cagttatagg ataatttggg gcaggtacat 3121 taatgataaa gatgctgata agaattagaa gaggaataaa aaagaaaaga attcgtggaa 3181 acttaaatgt ctttttgacg aactgcattt tgagtcattg ctaagctggg aaaatcacaa 3241 acacgatatc atatcatgtc ttgatgatta cttatcagaa aattacgtta gggcagttac 3301 gtaaacgcta tgggagggga acttcaactg attgtagagt ttcctgatag acagcctgtt 3361 gtgctgactg gacttgcaga attagatgag gattctgaga cagaagaaga tgcgtatctg 3421 cagacgtaag attctcgctc tcatgtctgt cgctatttgg cgagatttaa cagttccttt 3481 gtaacttggc tggctgagtt gtgctcttag cgtttctcat tccggctact cacctgtata 3541 ctaccgttta cattgtgtac acaagtgact atacctcatt gaccttcatt tgtatacaat 3601 atgtagacat tagtagacaa gagaaaatcc ttattcgcct gcacaatact ttggattttc 3661 tgggggcaaa ttcagttgct tgcgtaattc cgggtcatag caggcttgtt cccaattctt 3721 agcagctttt acctgctgga aggtcaagtt tttaactccc ttaaggttgg tacagatggg 3781 ttcctttgtg aacgttgtgg aacagttaag gtcggcacta ctgaggtcgg cactactgag 3841 gtcggcacta ctgaggtcgg cactactgag gtcggcac // LOCUS NODE_6489_length_3878_cov_4.0753343878 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3878) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3878) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3878 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 68..760 /locus_tag="DP116_27385" CDS 68..760 /locus_tag="DP116_27385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318634.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transporter" /protein_id="PRJNA477356:DP116_27385" /translation="MTQLWTAFTEAVIAFVATNIDDILILVLFFSQVNANFRRRHIVF GQYLGFTAIIIASLPGFFGGLFIPREWIGFLGLLPIAIGLKLLVNKEQETTQVQTVTT DFQPSSHSNPILSFLLSILHAQTYQVAAVTLANGGDNISIYIPLFAGKSFASLGVTLS VFFVLVGVWCAVAYLLARQRAIAFGRASLNANILSRYGRAVVPFVLICLGLFIMYERG TFSLILHSYRGN" gene complement(737..1540) /locus_tag="DP116_27390" CDS complement(737..1540) /locus_tag="DP116_27390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017746518.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfite exporter TauE/SafE family protein" /protein_id="PRJNA477356:DP116_27390" /translation="MHYSLLPVFSFFIGIVVGLTGIGGASLITPMLIFLFQVPPSIAV SSDVVSATLMKFVGGYQHWQQKTLDVQVVKWLAFGSVPGSLFGVGILHFVKLTGEQNL DDILLHLVGMMILFMTLLALAQLLVLWFFPDFKLPELPKFDLTTKLGCAIAISVGAVL GCLVGLTSVSSGSMFALVLIAFFQLETRKLVGTDISQAAILLFFTSLGHLTLGTVDWS LVLPIWLGSVPGVLLGAKLCQLAPQRPLRFIIYTILLMASYKLVSPVGV" gene complement(1798..2862) /locus_tag="DP116_27395" CDS complement(1798..2862) /locus_tag="DP116_27395" /inference="COORDINATES: protein motif:HMM:PF14315.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27395" /translation="MKAKTWNLVCFALLLAVLIVGWGQVTAKIPNQKAPSQYAKITVV KTNYQGWRDSWVLSNGQVEAIIVPAIGRVMQFRLLDGEDIFWENPKVFGKAPNPAPEK WVNFGGDKTWPAPQSDWTKITGRDWPPPTGFDSIPVQAKVDGSEVTLISPIDPLYGIR TYRRIRLEPQKPVMTISTTYEKVKAKPKDVAVWVITQLRHPVGVYAALPQASIFPQGY NRQSEELPANLKVENGMLSLTRDPKKSHKIGSDASTLLWVGEKVVVRIDSPREPALSY PDQESSAEIYTNSDPDAFVELEFLSPLKTLQIGERIDLTTTYTLLRRTTPNAEEEARK IIGRSEVTVPNPRNIAKATR" gene complement(2859..3791) /locus_tag="DP116_27400" CDS complement(2859..3791) /locus_tag="DP116_27400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006632700.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SMP-30/gluconolactonase/LRE family protein" /protein_id="PRJNA477356:DP116_27400" /translation="MILRIATCSLLLSILTVQQSQGKAIADLQTIMSSNTKLEKLADG LKFTEGPVWHPLGFLLFSDIPANTIYKWTTDSKTSIYRRPAGNPNGNTFDREGRLITA EHDRRLVRTEKNGQITVLAERYQGKRLNSPNDVVVKSDGSIYFTDPPYGISKEKEELG FYGIYRLKPDGTLTLLSKDMVRPNGLAFSPDEKKLYVSDSEKGHIRVFQVNSDGTLTN TRVFAELTGPKDKGVPDGMKVDVQGNIYCSGSEGVWVFSPTGQLLGKILVPEVVTNLA WGNKDYKTLYITAGQGIYRIRLLVAGVQPPKDKQ" BASE COUNT 1057 a 861 c 805 g 1155 t ORIGIN 1 tgtatatcat gattttattt tttataattt ataggctttc taacatcata ttttcttatc 61 aatgaccata actcaacttt ggacggcttt cacggaagct gtcattgcct tcgttgctac 121 caatattgat gatatcctga tcctggtgct atttttctca caagtaaatg ccaattttcg 181 gcgacgacat attgtatttg gtcagtacct tggttttaca gctatcatca ttgctagcct 241 accaggattc tttggtggct tgttcatccc gcgagaatgg attggattcc tgggattact 301 acccatagca attggtttaa agctattggt gaataaagaa caagaaacta cgcaagttca 361 aacagtaacc actgattttc agccctcctc acatagtaac cccatactgt cttttctctt 421 gagtatttta cacgctcaaa cttatcaagt cgcagcggta acacttgcta atggcggtga 481 taatattagt atttatatac ccttatttgc tggtaaaagc tttgctagtt taggagtgac 541 tttaagtgta ttttttgtct tggtaggggt ttggtgtgca gttgcttacc tattggctcg 601 tcaaagagcg attgccttcg gcagagcttc gcttaacgct aacattttaa gccgctacgg 661 tagagccgtt gttccttttg tactaatttg ccttggtctt tttatcatgt atgaaagagg 721 aacattcagc ttgatattac actcctacag gggaaactaa cttgtaactt gccatgagca 781 agattgtgta aatgataaac cgcagtggac gttgaggcgc aagttggcaa agttttgcac 841 caagcagtac cccaggtaca gagcctaacc atattggtaa gactaaactc cagtcaactg 901 tcccaagggt gagatgtcct aatgaagtga aaaataataa aattgcggct tgtgaaatat 961 ctgtacccac taacttacgg gtctctagct ggaaaaaggc tatcagcacc aatgcaaaca 1021 tcgaaccgga ggagacactc gttaaaccaa ccaaacagcc tagaactgct cccacgctta 1081 ttgcgatcgc acaaccaagc tttgttgtca aatcaaactt tggtagctct ggtaatttaa 1141 aatcagggaa aaaccataat accagcagtt gtgcaagtgc taacaaagtc ataaacaaaa 1201 tcatcattcc aaccaagtgc agcaggatgt cgtctaggtt ctgctctcct gttagtttga 1261 cgaagtgcaa aattcctacc ccaaacagtg agccaggaac gctgccaaat gccagccatt 1321 ttaccacttg tacgtcgaga gttttctgct gccagtgctg atagccgccg acaaacttca 1381 tcagcgtggc agacacaaca tcagaactca cagcaatgga aggaggaacc tggaataaaa 1441 aaatgagcat tggcgtgatc aaagatgctc cgccaatgcc tgttaagccg acgacaatac 1501 caataaagaa gctgaatacg ggtaataatg agtagtgcat attgctgttc tacgaaaggg 1561 ttgaaggtgt tttatctctg accagaagtc agcttggtag atccaaaccc cttttgcttg 1621 acaaatacaa taattctata tgcttaccgt actttaactc aaaaagtgag cgaacgtcta 1681 gcgtaatggc acagatgaac aagatattat tatttacaca gataataaat gagttcttga 1741 gttcttttgt gctgtgttaa tcggatgtag tcaaaacccg ttggtaagag ggaagagtta 1801 tctagtagct ttcgctatat ttcttgggtt aggaacggtt acttcagatc gtccaataat 1861 tttcctggct tcttcttcag cattcggggt cgtgcgtcga agtaaagtat atgtagtggt 1921 aaggtcgatt ctttccccaa tttgcagggt ttttagtgga ctaagaaatt caagctccac 1981 gaaagcatct ggatctgaat tggtataaat ttcagcactg ctttcctggt cgggataact 2041 aagcgcaggt tctctcggtg agtcaatgcg gactacaact ttttctccaa cccataacaa 2101 cgtacttgca tcagagccta tcttgtggga ttttttcgga tcacgtgtca atgacagcat 2161 cccgttttcc acctttaagt tggctggtaa ctcctcggat tgtctgttat acccttgagg 2221 aaaaatcgaa gcttggggta gggcagcgta tactcccact ggatgacgca actgtgtaat 2281 cacccatacc gcaacatctt ttggcttggc tttgaccttt tcataagttg tggaaatagt 2341 catcacaggt ttttgtggct cgagtcgaat cctgcggtaa gtccgaatgc cgtaaagtgg 2401 atctatggga gaaatgagcg tcacttcgct accgtcaacc tttgcctgaa ctggtatgga 2461 atcaaaacct gtcggtggag gccaatctcg ccccgtgatt tttgtccagt cagactgagg 2521 agcaggccaa gttttatcac ccccgaaatt aacccatttt tctggcgcgg gattgggggc 2581 tttgccaaac actttcggat tttcccagaa tatatcttct ccatctagta atcgaaactg 2641 catgactcgc cctattgcgg gcactataat tgcttctact tgaccattac tcaaaaccca 2701 ggagtctcgc cagccttggt aattagtctt tactaccgta atttttgcat actgtgatgg 2761 agctttttgg ttcgggattt tcgcagtgac ttgtccccaa cccacaatca aaaccgctag 2821 taataaagcg aagcaaacta gattccacgt tttagccttc attgtttatc cttcggcggt 2881 tgcactcccg ccaccaagag gcgaatacga taaataccct gacctgcagt gatgtaaagt 2941 gttttgtagt ctttattacc ccaagccaaa tttgtgacta cctctggcac gagaatttta 3001 cccagaagtt gacctgttgg cgagaaaacc cacacccctt ctgaaccgct acagtagata 3061 ttgccttgga catctacctt catgccatcg ggtacacctt tgtctttcgg tccagttaac 3121 tctgcaaaaa ccctcgtgtt agttaatgta ccatctgaat taacctggaa aacacggatg 3181 tgtccttttt cagaatcact gacgtagagt tttttttcat caggagagaa cgcgagtcca 3241 ttaggacgca ccatatcttt gctgagcaaa gtcaaggttc catctggttt caagcgatag 3301 ataccataaa agccaagttc ttccttctcc ttacttatac catagggcgg gtcagtgaag 3361 taaatgctac catctgattt gactacaacg tcgttgggac tattgagacg tttaccctgg 3421 tagcgttcag ccaaaacggt tatttgaccg tttttttctg tgcgtacaag ccggcggtca 3481 tgttcagcag taattaagcg tccttcccta tcgaacgtat tgccgttggg attgccagca 3541 ggacgacgat agatggatgt tttactgtca gttgtccact tatagatagt attagcggga 3601 atatcactga agagtaaaaa gccaagcgga tgccagacgg gtccttctgt aaactttaaa 3661 ccatcagcca acttttctag cttggtgttg ctgctcataa tggtttgtaa atcagcgatc 3721 gcttttccct gactctgttg tacggtcaaa atgcttagta ataaactaca ggtagctatt 3781 cgtaaaatca attttttcac aatcacttac tcaccttaag tcaactcaga tcttgcacca 3841 tacttttcgt ccgccaagaa ataaatttct tggctcaa // LOCUS NODE_6493_length_3877_cov_4.9631083877 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3877) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3877) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3877 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(252..1718) /locus_tag="DP116_27405" CDS complement(252..1718) /locus_tag="DP116_27405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component sensor histidine kinase" /protein_id="PRJNA477356:DP116_27405" /translation="MRLRSFRLRIALLSAGLAGSALVGFGAVSWFQIYNAQIARLDAQ LFNQLLRATRSAQRERLQFYGDSLPYAFGTNTKIPIAVLVRDANGNTLYQSDEVPIDK EVNRLLLGRLELTPLPPFPKEPPPKPESTDPPVKPPPLLRSRRPPEFVTQQTTTGLWR IGALKFPNVQVAIAVSLQAVNQEMGTIRNIFLVSISGALLLVAFGAWFVSGGALRPIR QLTGVIQQVTVKGLDQRIPIGTTDVEFVELIQVFNQMLERLERSFTQASRFSADAAHE LKTPLTILQGELERTLQQVDSGSEVQQRLSNLLDEVRRLSGITRKLLLLSLADAGQMK LFLVEVDMSELLMEMLEDVELLAPHLSVQTDITDGLRVQGDRDLLIQVLQNLFSNAIK YNLANGWIQIHAKKTETTLHITIANASKEIPVGDRSRIFDRFHRGDPARTRKVEGIGL GLSLAREIARAHRGDLTLDSTSDGQTAFTLTLPMTLDN" assembly_gap 1721..1730 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(1734..2414) /locus_tag="DP116_27410" CDS complement(1734..2414) /locus_tag="DP116_27410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015113589.1" /note="response regulator in two-component regulatory system with CusS; regulates the copper efflux system; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_27410" /translation="MNVLFVEDEAKIANFVQAGLKEQGFVVDYCDNGDEGYLRALDNE YDVIVLDIMVPGKDGLWILKQLRRSGRNAPVILLTARNELDDRLSGLNLGADDYIAKP FFVEELAARIHAVVRPSVGDRQNLLSVGSLKLDRITRKVTCNQQAIELTSREFNLLEY LMRSPGRVFTRTQILEHVWGYDFNPNTNVVDVCIQRIRKKIDSIDRANWIESIRGVGY RFCKPDLI" gene 3151..>3877 /locus_tag="DP116_27415" CDS 3151..>3877 /locus_tag="DP116_27415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740408.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27415" /translation="MVQTILILAANPKDTLRVRLEEEVREIDAGLKRARQRDQFVLEQ KWAVRPRDIQRAMLDINPQIVHFSGHGAGDEGLVFEDETGQAKLVNSEALARLFNLFA NNVECIVLNGCYSEVQADAISQHIKYVIGMRQAIGDKAAIEFAVGFYDALGAGRSIEF AYNSGCVAIRLAGISEELTPTLKKKPNIGDTILRHNSRYNRTVESSPFSSSFFRVPGI LTSIITIIVTGVLISFFTSHMNSK" BASE COUNT 1129 a 830 c 845 g 1063 t 10 others ORIGIN 1 gaagctctat tccatgaagc tttattctat gcatcttcat agagaattgg tataagatca 61 tcaacaatcg ctaatactaa aataacgttt tgaatcatta aatgacctag aatagaaatt 121 gtccacttga tattagacct taacccttgt gtgaaagcgc ggtggaaggt tatccgtggg 181 gtaaaaccgc caaagaaggg tgagtccagg gttagagcgc ggttagaaga ttattggtaa 241 aaagtaatcc cttaattgtc taacgtcatc ggcaaagtga gagtaaacgc cgtctgccca 301 tccgatgtag aatcaagggt gagatcgcca cgatgtgccc gagcaatttc acgggctaag 361 ctgagtccca gtccgattcc ttctaccttg cgggttcgag caggatcacc gcgatggaag 421 cggtcaaaaa tgcgcgagcg atcgcccacc ggaatctctt ttgaagcatt ggcgatcgta 481 atatggagag ttgtttcagt tttctttgca tgaatctgta tccagccgtt ggcgagattg 541 tatttgatgg cattactgaa caggttttgc agaacttgaa tgaggagatc gcgatcgccc 601 tgtacccgta gcccatcagt aatgtcagtt tgcacactca ggtggggagc caggagttcc 661 acatcttcta acatctccat gagcaactca gacatatcca cctcaaccag aaacaacttc 721 atttgtcccg catctgccaa agacaacagc aaaagtttcc gcgtaatacc actgaggcga 781 cgcacttcat ccaacaaatt acttaagcgc tgttggactt cacttccaga gtcaacctgt 841 tgcagcgttc gttccagttc accttgcaga atcgtcagtg gggttttcag ttcgtgagct 901 gcgtcagcac tgaagcggga agcttgtgta aaactgcgtt ccaggcgttc cagcatttgg 961 ttaaacacct gaatcagttc gacaaattca acatccgttg tcccaatcgg gattcgctga 1021 tctagccctt taacagtcac ctgttgaata acgcccgtta attgccggat gggacgcaaa 1081 gcaccaccag aaacaaacca tgccccaaaa gcaactagca aaagcgctcc cgaaattgaa 1141 actaggaaga tattgcgaat ggtacccatc tcttgattaa cggcttgcag actaacggcg 1201 atcgcaacct gaacattggg aaatttaagt gccccaatgc gccaaagtcc tgttgttgtc 1261 tgttgagtga cgaattcagg tgggcgtcgt gagcgaagca acggtggtgg cttgacaggt 1321 ggatctgtac tctcaggttt tggcggtggt tctttaggaa acggtggcaa aggtgtcaac 1381 tcaagacgcc caagcaacag acgattcacc tccttatcga tgggcacttc gtcagattga 1441 taaagtgtat tgccgtttgc atctcgcacc agcacagcaa tagggatttt tgtatttgtt 1501 ccgaaagcat aaggtaatga atctccgtag aactgcaatc gctctcgctg ggcagaacga 1561 gttgcccgca gcaactgatt gaacagttgt gcatcaaggc gagcgatctg agcattgtaa 1621 atctggaacc aggagactgc accaaaccct actaatgcgc ttccagccaa acccgcagat 1681 aatagggcaa tccggagtcg aaacgaacgc agtctcatat nnnnnnnnnn gtctcatatt 1741 aaatccggtt tgcaaaatcg gtatccaacc cctcgaatgc tttcaatcca atttgctcga 1801 tcaatagagt caattttttt acgaatgcgc tgaatacaaa catcaacgac attggtgttt 1861 gggttaaagt cgtagcccca aacgtgttcc aggatttggg tacgagtgaa aacccgtccg 1921 ggagagcgca taagatactc cagaagatta aactcgcggc tggtaagttc tattgcttgt 1981 tgattgcaag tcacttttcg cgtgatccga tcaagcttaa gcgagccaac cgaaagcaga 2041 ttttggcgat cgcccacact cgggcgcaca acagcatgaa ttcgagccgc taactcttcc 2101 acaaaaaacg gtttggcaat atagtcatca gctcccaaat tcaagccgga caggcgatca 2161 tccagttcat tgcgagctgt caacaaaatc acaggagcat ttcgacctga acgccgcagt 2221 tgtttgagaa tccatagtcc atctttccct ggtaccatga tgtcgagcac aatcacatcg 2281 tattcattgt ctaatgcccg cagatagcct tcatcaccgt tgtcgcagta gtctacgaca 2341 aatccctgct ccttcagtcc agcctggaca aaattagcaa tttttgcttc gtcttcgaca 2401 aacagaacat tcacaactat tcattactcg acctattgct ccattctaga ttcattacca 2461 tttggttcaa attacaaaag tgtaattcag aatcaggcaa gctccccaag ctcctattga 2521 aagcacagat caacattgga gagaacctcg cattgccaca caaatcaagt aatttaaaaa 2581 aagaattagt tatctaaatt actaatataa attaaatata ggttttttaa aatactattt 2641 tatactaatt atttgcttta ttcatggttt atacgtgctc taagagatga gggtttaact 2701 actagcagcg aagcactagt aaggttctgg catatacaat tttcggcaga atgcttgtga 2761 attttattac aacatatctc tacttttcaa acatcctctt aagagtcatg tgattttttc 2821 ggaaacgcca tttttacttt tgtagcttta tccatgcgaa aatcaattac atgagaagca 2881 aaacaaataa agaaatacac attaacattt caaatgactt taaaaagtag tttcggtaaa 2941 ccaaaaagtt ttttataagt cgtgccatgc tgacatttca gctaaataca gggatctcaa 3001 aaaacaacct acctgtgtag agacgcttca aacagcgcgt ctatacatgg gttggtggta 3061 aataaattat cttaactgaa ccgtattggt ttttacgttg gttttcaaaa gtttgtggtt 3121 tattgagttt atatagctgg agttgagtaa atggtgcaaa caattttaat tttggcagca 3181 aatcctaaag acacactacg ggtgcgttta gaagaggagg tgcgtgaaat cgatgcagga 3241 cttaaacgag ctaggcagcg tgatcagttt gtcttagagc agaagtgggc agtgcgaccg 3301 agggacatcc agcgggcaat gttggatatc aatccccaga ttgttcattt ttctgggcat 3361 ggggcgggag atgagggtct ggtatttgag gatgaaactg ggcaagcaaa gctagttaac 3421 tcagaagcgc tagcgagact gtttaatctg ttcgcgaata atgttgagtg tattgttctt 3481 aatggctgtt actctgaggt gcaggctgat gcaatctctc aacatattaa gtatgtgatt 3541 ggcatgaggc aggcaattgg agataaagcc gcaattgaat ttgctgtagg tttctacgat 3601 gcactaggag cagggcgatc cattgaattt gcttataatt ctggttgcgt tgcaattaga 3661 ttagcaggta tttcagagga actaaccccg actctaaaga agaagccaaa tattggagat 3721 acgatactta gacataattc tcgatacaat agaacggtgg aatcttcacc tttttcatca 3781 tcttttttta gagttcctgg tattctgaca agtattatta ctataatagt cacaggagtc 3841 ttaataagtt tttttacatc ccatatgaat tcaaagc // LOCUS NODE_6505_length_3862_cov_17.5292883862 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3862) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3862) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3862 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 191..1099 /locus_tag="DP116_27420" CDS 191..1099 /locus_tag="DP116_27420" /inference="COORDINATES: protein motif:HMM:PF13876.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27420" /translation="MSQHYIGIKQVFAWPEEKSGEAGYHVVYRYGQPDAYHSWSPKAE FERFYLPQGNDPTRVTQAMVDGFVAKCEPATVGGKMTVCSVELANGTLYAESSACVDP ASYDEAVGKGICLDRAKDRVWHLLGFALQWARRGLPPAAAAIVAVMLLAGHVAAADLG SPIPIDKQVAELRADVDALKRDVAEIKGKPATASPPTWAGVRDRVAAGETVWIAVGVA PRLGDVALPARADVHPARHKCYRDAAGVERFECEDGRPSLVASRRVATAAPVMGPTFA NPFGGPPSGCPGGNCAAPAGRLGRRW" gene 1198..1617 /locus_tag="DP116_27425" CDS 1198..1617 /locus_tag="DP116_27425" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27425" /translation="MNPLDVIRKIEESLKTAADLASGAGADTLDAAAGVLDKISHATH DFADFLRNGGHSPTPAPADGSPASFAAAPTSAEVVACCDRLDALKAKCTAAAAPRSFG ASQPVGANPVVDALIAQLVAATIEAVTGWFRRKFPQA" gene 1966..3216 /locus_tag="DP116_27430" CDS 1966..3216 /locus_tag="DP116_27430" /inference="COORDINATES: protein motif:HMM:PF00589.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27430" /translation="MYYRDDAGPHRKKVAPTRNQAEQVAARVNAQLASREPTLLTFTP VGVAELRRQFLDHHEHVLHSAVGTLRRYRSATQHLDDFVRTLPRLPQAHEVRAEAFAA YLRRIEVAPNGHPNASRRKLLANGVRYVLETCRAMYTYAVRWRYLPPYAGNPFAALPL DRMKAEDAKPVFVFTADAELAFLRACTAWAFPIHFTLAKTGLRVGELVHLLIEDVDLA DGWLHVRNKVELGWRVKTGQERVVPLLPEVVAVLRTVVGMRTRGPVFLRERFHARRAS VAGDRRELGKALRDRRGAAGRPLTRAEEAALAGKLWWDAGAVRADKVRQVFVRGMAAI GRPDATCPKSWRHTFATLLQDANVDPLIRQQVLGHKPTLGAGLGMTAHYTHTRPETRR EQVERALRRWPASLAYAAGRPGSR" gene 3213..3416 /locus_tag="DP116_27435" CDS 3213..3416 /locus_tag="DP116_27435" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27435" /translation="MTEPGRRGEHSDTPDGRPTGAAPENRAERAVPARQPGADTSVTG ATVSDRSAGTNRDGPSVGWLDPA" BASE COUNT 532 a 1395 c 1446 g 489 t ORIGIN 1 ggcggttcgt catccccgag gcggcctacg ccgggccggt cggcgggtgg ttcgcagtgc 61 gggccgtcac cgacgagggc ggggtcatcc ccaccccgac cgcctgaccc ggccacgcgg 121 ggacgccagc cggggctaga gccggccggc cgggtgggac gcccggcacc ttcaccctgg 181 gagagtcgca gtgagtcagc actacatcgg gatcaagcag gtgttcgcgt ggccggagga 241 gaagagcggg gaggccggct accacgtcgt gtaccgctac ggccagccgg acgcctacca 301 ctcatggagt ccgaaggccg agttcgagcg gttctacctc ccgcagggca acgacccgac 361 ccgcgtcacc caggcgatgg tggacgggtt cgtcgccaag tgcgagccgg ccaccgtcgg 421 cggcaagatg acagtctgct cggtcgagct cgccaacggc accctgtacg ccgagtcgtc 481 ggcgtgcgtg gacccggcca gctacgacga ggcggtcggc aagggcatct gcctggaccg 541 ggcgaaggac cgggtgtggc acctgctcgg gttcgccctc cagtgggcgc gtcgcgggct 601 gcccccggcg gctgccgcca tcgtggcggt gatgctgctg gccggtcacg tcgccgcggc 661 cgacctcggc tcgcccatcc cgatcgacaa gcaggtggcc gagctgcggg ccgacgtgga 721 cgccctcaag cgggacgtgg ccgagataaa gggcaagccg gccaccgcca gcccgccgac 781 ttgggccggg gtccgcgacc gggtggccgc cggggaaacc gtctggatcg ccgtcggcgt 841 ggcaccgcgg ctgggcgacg tggccctgcc ggcccgggcg gacgtccacc cggctcggca 901 caagtgctac cgggacgccg ccggcgtcga gcggttcgag tgcgaggacg gccgcccgtc 961 gctggtggcg agccgccggg tggcgaccgc cgccccggtg atggggccga cattcgccaa 1021 ccccttcggc ggcccgccgt cgggctgccc cggcgggaac tgtgccgccc cggccgggcg 1081 gctcggccgc cgctggtgac cgccgccgct cgggggcggg cgctcaccgg ctgcccgtcg 1141 ccgccctgcc tcggttactg ccggtggtgc gttcgacttc atcccctgga gcctcaagtg 1201 aacccgctcg acgtgatccg caagatcgaa gagagcctga agacggccgc cgacctggcc 1261 tcgggtgccg gggccgacac cctggacgcg gccgccggcg tcctggacaa gatcagccac 1321 gccacccacg acttcgccga tttcctccgc aacggcggcc acagcccgac gccggccccg 1381 gccgacggca gcccggcctc gttcgccgcc gccccgacct cggccgaggt ggtggcctgc 1441 tgcgaccggc tggacgccct caaggcgaag tgcacggccg ccgccgcccc gcggtcgttc 1501 ggggccagcc agcccgtcgg ggcgaacccg gtggtggacg ccctcatcgc ccagctcgtg 1561 gccgccacga tcgaggccgt caccggctgg ttccgtcgga agttcccgca ggcgtaaggc 1621 aagccggcgg gtaaacggag ggctgattac tccgcaaacc gttgaccaat aacactctgc 1681 ggcaggcgtg taggtgtctc gaaaaccgat ttgcccgaaa gggtaacgag ggttcgaatc 1741 cctccctctc cgttaccaag tccgctcctt gatggacttg tgacgcaaag ccagtaacgg 1801 caacgccccg cgtcacggct cccccgacgg tcctgcccgc ctggtgcacg gttggtgcac 1861 gagccgtgca ccggggggag tcccgtgccg ccgaagaagt cccgaccctc tccgcgggtc 1921 cgcatcggaa gggtcagcgt ctacgaacac cacggggcgt ggtgggtgta ctaccgggac 1981 gacgccggcc cgcaccgcaa gaaggtcgcc ccgacccgca accaggccga gcaggtggcc 2041 gcccgggtca acgcccagct cgccagccgc gagccgaccc tcctgacgtt caccccggtc 2101 ggcgtggccg agctgcggcg gcagttcctc gaccaccacg agcacgtcct gcactcggcg 2161 gtgggcacgc tgcggcggta ccgctcggcc acccagcacc tcgacgactt cgtccgcacc 2221 ctgccgaggc taccccaggc ccacgaggtc cgggccgagg cgttcgcggc atacctgcgg 2281 cggatcgagg tggcgccgaa cggccacccg aacgcctcta ggcggaagct gctggccaac 2341 ggcgtgcggt acgtgctgga gacgtgccgg gccatgtaca cctacgccgt caggtggcgg 2401 tacctgccgc cgtacgccgg caacccgttc gcggcgctgc ctctcgacag gatgaaggcc 2461 gaggacgcca agccggtgtt cgtgttcacc gccgacgccg agttggcgtt cctacgggcc 2521 tgcaccgcct gggccttccc catccacttc acgctcgcca agaccgggct gcgggtcggg 2581 gaactggtcc acctgctcat cgaggacgtg gacctggccg acggctggct gcacgtccgc 2641 aacaaggtgg aactgggctg gcgggtgaag acgggccagg agcgggtggt gccgctcctg 2701 ccggaggtgg tcgccgtcct ccgcacggtg gtcgggatgc ggacccgcgg cccggtgttc 2761 ctgcgagaac ggttccacgc ccgcagggcg tccgtcgccg gcgaccggcg ggagctgggc 2821 aaggcgctcc gcgaccggcg gggggccgcc gggcggccgc tcacccgcgc cgaggaggca 2881 gcgttggcgg gcaagctctg gtgggacgcc ggggcggtgc gggcggacaa ggtccggcag 2941 gtgttcgtgc gagggatggc ggccatcggc cggccggacg ccacctgccc caagagctgg 3001 cggcacacgt tcgccaccct gctccaggac gccaacgtgg acccgctgat ccgccagcag 3061 gtgctggggc acaagccgac gctcggagcc gggctgggga tgacggcgca ctacacccat 3121 acccggccgg agacccgccg ggagcaggtg gagcgggcgc tgcggcggtg gccggcgtcg 3181 ctggcgtacg cggcggggcg gcccggttcc agatgaccga gccgggtcgg cggggtgagc 3241 actcggacac tccggacggc cgcccgaccg gggcggctcc ggagaaccgg gcggaacggg 3301 ccgtgccggc acgccagccc ggtgccgaca cgtccgtgac cggggcgaca gttagtgatc 3361 ggtcggcggg aaccaaccgg gacgggccga gcgtgggatg gctcgacccg gcgtgacggc 3421 ggccgaggtg ccgcgccaaa cttcgcggac tgaccgtccg gtgacgggcg gtggtggcac 3481 gcggctgggg tcggcgacgg tcgcccaggt atggtgaccc cgcggaacgg gcgaatgtca 3541 tagtggtcgg agtcgatccc gtcgtggcgg gttggccacg gggtggtcgt ccggtccaac 3601 ctccggccgc cgtccggtcg tcgacggccc ccgggcggat cgatcgcggt gacgggtggg 3661 acggttcggc gggcgccccc gccggccgcg ggtcagcccg gggccgggaa gaacgccagg 3721 caggtgccgt cggccggccc ccgggcggtc gcctgccgga ggaggtcggc cgccaactcg 3781 tcgtacgacc ggtccggccg gtggtacgcc tcgaccgcca gggccagccc gcggagccgg 3841 ttcacctcct ccaccgcggc cg // LOCUS NODE_6526_length_3843_cov_5.3991553843 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3843) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3843) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3843 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(59..616) /locus_tag="DP116_27440" CDS complement(59..616) /locus_tag="DP116_27440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411045.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase family protein" /protein_id="PRJNA477356:DP116_27440" /translation="MLKLYGGARSRASIVQWYLEELEIPYEFVKLDMQAGEHRQPQYL AINPIGKVPAIVDGDFQVWESGAILLYLADKYGKSAHSPEERAELAQWVLFANATLGP GIFVEASREREMPVLMTALNEIFERQPFLLGKEFTVADVAVGSILSYIPIMLKLDLSQ YPSVLNYMKQMSERPAFQKSIGGRR" gene complement(703..1167) /locus_tag="DP116_27445" CDS complement(703..1167) /locus_tag="DP116_27445" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196314.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27445" /translation="MTTVTSYSNAPDLAADDYIVIGLATCFVKEDGEVYQVEVIEPIP SASLETLAKGIPTSFKLALATTLGSVLDEDKPEIPPEFPQTAQFSDDFVERAIAAART YKRRETAKSLIPLGTTYTDFKYSTERKRVLNVSRVVTKEDNVKQHPNTHKVL" gene 1621..2346 /locus_tag="DP116_27450" CDS 1621..2346 /locus_tag="DP116_27450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316975.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27450" /translation="MNIHSSFPFGGEQNHGNPFVSNTTKVRQQQGTNGSSKSTFGDSY LRSDALTKARQGHYTEALALLNQLIHRHPHNAIDYNNRGLIYFQSGETQKAFCDYNTA LQLNPKLASAYNNRANYYAACGELIEAIADYDRALDLNPSHVRAWINRGITLRDLGQY EEAIENFEVALLFEQLNGHVLAERGRTYHLWGDWNCAIADYRRALTELPPTSFKTGEP GSHLRLQVENWLHELLPTQHPAW" gene complement(2542..2946) /locus_tag="DP116_27455" CDS complement(2542..2946) /locus_tag="DP116_27455" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316974.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27455" /translation="MNAFQPSRLPLQPAQQRRVTPRPKRRLRQRSYQVMALETTAKIA VNLVISTAAVSALMQLLPYHWSQQEKLRTIRIDVNQMEERVYQLQGEFSRNFDPRQAK VIMQEQGYRFDPNQRQVVFPKDTIEHKLPESN" gene complement(3184..3321) /locus_tag="DP116_27460" CDS complement(3184..3321) /locus_tag="DP116_27460" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015127716.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_27460" /translation="MSVLNTLILAQVGTSISDTQVYIALLVALIPGFLAWRLSTTLYN S" gene 3634..>3843 /locus_tag="DP116_27465" CDS 3634..>3843 /locus_tag="DP116_27465" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316972.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbohydrate kinase" /protein_id="PRJNA477356:DP116_27465" /translation="MNFYLGIDFGTSGARAVVIDEQACIQAERRFPFEKSEISDLASS WERGLFSLLEQIPQQLRREIRAIAIN" BASE COUNT 1103 a 793 c 850 g 1097 t ORIGIN 1 ctttacctga ctttgaggct taactgaact gtattggaat atgagcttaa gcgagtgatt 61 acctgcgtcc accaatactc ttttgaaaag ctggacgctc agacatctgt ttcatgtagt 121 tcaatacaga tgggtattgg cttaggtcta acttgagcat gataggaatg taagagagaa 181 tagaacctac agccacgtca gcaactgtga attctttgcc cagtaagaaa ggttgacgct 241 caaaaatttc atttaaagca gtcatcaaaa ccggcatttc tcgctctcgg cttgcttcca 301 cgaaaattcc tggtcccaaa gtcgcattgg caaacaacac ccattgagct aattcagctc 361 gttcctctgg agagtgggca gatttaccat acttgtcggc taaatacagc aaaattgccc 421 cagattccca aacctggaaa tctccatcca caattgctgg gactttccct attgggttaa 481 tagccagata ctgaggctga cgatgctcac cggcttgcat atcaagcttg acaaattcgt 541 agggaatttc tagttcctct agataccact ggacaattga tgcacgacta cgtgcgccgc 601 cataaagttt tagcatgagc aaatcagtaa acagtttaca accatcagtc atcagtaatt 661 atattgttat ttctgagact tgaggtaaat aactataact gattaaagaa ctttatgagt 721 attggggtgt tgtttgacgt tgtcttcctt cgtgacaact cgcgaaacat ttagcacccg 781 cttgcgttca gtagaatatt taaaatctgt ataagttgtg cccagaggaa tcagagattt 841 agcagtttca cgacgcttgt aggtgcgagc tgcggcgatc gcccgttcca caaaatcatc 901 actaaattga gccgtctggg gaaattctgg aggaatctca ggtttgtctt cgtccagcac 961 agatcccaaa gttgtcgcca aagcaagctt aaaggaggta ggaatacctt tagcaagcgt 1021 ttccagtgat gcagagggaa ttggttctat aacttcgact tgataaactt ctccatcttc 1081 tttgacaaag caagttgcca acccgatcac aatgtaatca tcggctgcta aatcaggagc 1141 gttggagtag gaggtgacag ttgtcataag aatattttat cacaaattcc aaaaagaaaa 1201 atccgaattc cgcaacgcag tatcaaagac tctttttgtt cttttgtgct ctgagacttg 1261 agttggggag ttttttttca tcgggacatt ctggcatttt atcaatttgt tgcccacgct 1321 acttgaatcg tctactaaat tgatatttat attatctgga atttaccttg gggaagacgt 1381 atcaattatt agattaaagc atcaattgag tatgctttac gtttacctgg ttaaagacgg 1441 tcaagcacaa tatggctttg atcgccgtaa actttcacgc tataagggtt tatggcattt 1501 tcttccatac ttatgcccca tgactcatag acaccttaac aagtgacaca gttagaaaac 1561 cacagtagca agtaacctta taagttgatc tcacagagca agcagctcag gtaggaatac 1621 atgaacattc attcatcttt cccttttggt ggcgaacaaa atcacggcaa cccatttgtc 1681 tcaaacacta caaaagtacg acaacagcaa gggactaacg gtagcagtaa atctactttt 1741 ggggacagct acttgcgctc tgatgctcta acaaaagccc gacaaggtca ttacactgag 1801 gcgcttgccc tgttgaatca attaattcat cgtcatccac acaatgctat tgattacaac 1861 aatcgagggc taatttattt tcaaagtggt gaaacgcaaa aagcgttttg cgattataac 1921 acagcattgc aacttaatcc gaaattagcc agtgcttata acaatcgggc aaattactat 1981 gcagcttgtg gggaattgat agaagcaatt gctgattatg atcgggcttt agatctgaac 2041 ccaagccacg ttagggcgtg gataaaccga ggcataacct tgcgcgattt agggcaatac 2101 gaggaggcga ttgagaattt tgaggttgct ctgctgtttg agcaattgaa tggtcatgtt 2161 ctagcagaac ggggtaggac ttatcatctt tggggagatt ggaattgcgc gatcgccgat 2221 tatcgtcgcg ctttgactga actgcctcct actagcttca aaaccggaga gccgggttca 2281 cacttacgtt tacaagtcga aaattggtta cacgagttgc taccaacgca gcatcctgcc 2341 tggtagacat tgccttgatc gtcactggca tagtctacgt atgaaacaca gaatgagaaa 2401 catcctgata acggtgtaat ggctaacaga tgataactat atagctccta gatagtaagt 2461 gactagctat gaactatcag ctcttagcct tactaccaaa gtcgaaattt ctttggtcga 2521 gtagcaatga caaattagaa attagtttga ttctggtagt ttatgttcta tagtatcttt 2581 tgggaacacg acctgacgct gattagggtc aaatctatag ccctgttctt gcataatcac 2641 cttagcttgg cgcggatcga agttacggct aaattctccc tgcagctgat agacacgttc 2701 ttccatctga ttaacatcaa tacggatagt tcgtaatttt tcttgctgag accaatgata 2761 aggtagaagc tgcataagag cagagactgc tgctgttgaa atgacaaggt taacagctat 2821 ctttgctgta gtttccagtg ccatcacttg ataagagcgt tgacgaagac gtcgcttcgg 2881 tcgaggggtg actcgacgct gttgtgctgg ttgtaacggc agtctagagg gttgaaatgc 2941 gttcatcgtg ataaaaagaa cctcgctaaa tgagtagacg catgccaaat gcacacctgc 3001 gtcttttgca gtttcgttta gctgccatat tacttcaatt ttttcgtact tggtacactg 3061 atcatctgtc aagtaaaaaa cctggcattt atacgtgtac agtctcaaac aatttacgat 3121 attaaaccgc aaaagtccga gaactgtacc ccgtactacc aggtaactca gctcttcaga 3181 atcttatgaa ttgtaaagtg ttgttgacaa tcgccaagcc agaaagcctg gaatcagcgc 3241 tacaagaaga gcaatgtaca cttgggtatc tgagatagaa gttcctacct gagcgaggat 3301 cagagtattc aggacagaca tgatttgaaa cctcccaata tcaacagatt ttgaatactt 3361 cgtttactac ttttccctgt tttgccaaga ttggggagta ggttgttaca agatgaaaca 3421 ataactagtc attagtgagc cagcactgca gagagaagtg ccttgcggag ccagcgccgt 3481 gcggaggttc cctccgttga ggcgactggc gtcgggttcc ccgcgttgtg gcaacttcgg 3541 agagggtttc cctccgtagg tgactggcga acccgtaagg gtcattagtt attagtcatt 3601 agtcatgata aattataagt aaattttata tttatgaatt tttatttggg aattgacttt 3661 ggcacgtccg gtgcaagagc tgtggtgatt gacgagcaag cttgtattca agcggagagg 3721 aggtttccgt ttgagaagtc agaaatatct gacttagcaa gttcttggga gagaggtttg 3781 tttagcttgc ttgagcaaat ccctcaacaa ttgcggcgag aaataagagc aattgcgatc 3841 aac // LOCUS NODE_6568_length_3795_cov_4.5216583795 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3795) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3795) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3795 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 299..372 /locus_tag="DP116_27470" tRNA 299..372 /locus_tag="DP116_27470" /product="tRNA-Met" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:333..335,aa:Met,seq:cat) gene complement(442..2643) /locus_tag="DP116_27475" CDS complement(442..2643) /locus_tag="DP116_27475" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318327.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amylo-alpha-1,6-glucosidase" /protein_id="PRJNA477356:DP116_27475" /translation="MVDLDTREWLLTNGLGSFASGTVCDIRTRTYHGWLFAATNPPSG RTLVLSHLEASLELSDRVVALGTNFWGSCQIEPKGYQLLRLFDINPVPKWIWGEDDWQ LTRHLVMPYGLGSRRGLGEPVRSWGLPKWSNWRGFPHERLPNPEGVEKVGGEREGSPH LWTPTSPPFQFCHRILVQYRYEGTQTAKLKLRVLIGDRDFHHQQKVIPELHFSQLLGQ QQVCLQAISSHNFGTPWHLRWTQGNYQQDPFWYWDYKLPEETLRGLGDREDLYSPGYL TVTLNGGDAVTLEARVGFPSELQTPLTCETFAEVVEAEQERLSQIFGWENWQTEVGRD HTLISPSSSSTPSHLWRRLLQAADQFIVYRASIAGPTVIAGYHWFNDWGRDTLIALPG LTLTPQRFDLAKGLLQTFGRYCSHGLIPNAFPDADGEPFYNSIDAALWWIETLGLYLE ATQDWQFLAEQYPVVQQIYKAFIGGTRYNIQADVTDGLVSWYAPAVALTWMDAVVDGQ PVTPRRGKPVEINALWYSALCWASRWAEILSEQEAMLDRGRLAKQALRYTQLAQQVKA SMQQYWNPQLGYLYDVIEPDDGRNSQIRPNAVLALSLSHCAFSQQQGQKILDLARSCL LTPYGLRSLAPTDPEYIGKYKGNPEKRDHAYHQGTVWSWLIGPFIRGWERFYPGLPLP FDWQPLLKHFLSDACLGSISEIFEGDEPHTPRGAVAQAWSVAEVIRHLKKW" gene 2965..3576 /locus_tag="DP116_27480" CDS 2965..3576 /locus_tag="DP116_27480" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653603.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peroxiredoxin" /protein_id="PRJNA477356:DP116_27480" /translation="MPLSYASEGCLRVGQQAPDFTATAVVDQEFKTIKLSDYRGKYVV LFFYPLDFTFVCPTEITAFSDRHEEFKNLNTEILGVSVDSEFSHLTWIQTDRKSGGVG DLNYPLVSDIKKEISAAYNVLDPSEGIALRGLFIIDKEGVLQQATINNLAFGRNVDET LRILQAIQHVQSHPDEVCPAGWQPGDKTMVPDPVKSKVYFAAV" BASE COUNT 1055 a 904 c 841 g 995 t ORIGIN 1 caagttctga cgatgaatgt agaatcccct cgcctttagg caggggagtg tcaatgctta 61 ttctatagag ggagtttggc tccctttttt gttgtttgtt atggggaact cttaacaggg 121 aacagggaac tcttaacagt gaatactgac caaggttatg gaaaacaaat actggtaact 181 gataactgat aactgctata gggatacatg ggaaagtgaa ttgattttta gctgtcaatt 241 cttgagtatt gacttcaaca tttttgtatg ctacattaaa taatcaaatc atgtctatgg 301 ctcagtagct cagttggtta gagtagggga ctcataagcc cttggtcgtg tgttcgagtc 361 acacctgagc cattattgcg agtgttaaac aatgaatagc cagtgctaag taaaagtaac 421 ttagcaccta gccgttctaa gttaccactt cttcaaatgt cgaatgactt ctgcaacaga 481 ccaagcttga gctaccgctc ctctaggtgt gtgcggctca tcaccttcaa aaatttcgga 541 aatagagcca agacaggcat cagatagaaa gtgttttagc aggggttgcc aatcaaaagg 601 caatggaagt cccggataaa aacgctccca accacggatg aaaggaccta ttaaccaact 661 ccacaccgtg ccttggtgat aagcgtgatc gcgtttttct gggttaccct tatatttgcc 721 aatgtattct ggatctgttg gagccagact gcgaagacca taaggagtga gcaagcaaga 781 acgagcaaga tcaagaatct tttgcccttg ctgttgagag aacgcacaat gtgagagtga 841 aagtgctaaa acggcatttg ggcgaatctg agaatttcgc ccgtcgtctg gctcaatcac 901 atcgtacaag taacccagtt gaggattcca gtactgttgc atagaagctt tcacttgttg 961 tgccagctga gtgtaacgca aagcctgctt agcaagacgc cctcgatcaa gcatagcttc 1021 ctgttcactt aatatttctg cccaccgact tgcccaacat agcgctgaat accatagagc 1081 attaatttcc actggcttac cacgacgggg agtgacaggc tgcccgtcta ctaccgcatc 1141 catccaggtc agagctacag caggagcata ccaactcact agcccatcgg taacatcagc 1201 ctgaatatta tagcgtgtcc ctccgataaa agctttgtag atttgctgca ccactggata 1261 ttgctctgcc aaaaactgcc agtcttgtgt cgcttccaag taaagcccta aagtttcaat 1321 ccaccatagt gccgcatcaa tactgttata aaacggttcg ccatccgcgt caggaaatgc 1381 attgggaatt aaaccgtgag aacagtaacg tccaaaagtt tgcagcagtc cttttgccaa 1441 atcaaaacgc tgtggagtca gcgtcaaccc tggtaaggca attaaggtat cgcgccccca 1501 gtcattaaac cagtgatatc cagcaatgac ggtgggacca gcaatagaag cacgataaac 1561 tataaactga tctgcggctt gtagtagtcg ccgccagagg tgcgatggag tcgaggaaga 1621 tgagggagat ataagagtat gatctcgccc tacttctgtt tgccaatttt cccagccaaa 1681 aatttgagac aaccgttctt gttctgcttc gacaacttcc gcaaacgttt cacaagtcag 1741 gggagtttgc agttcactag gaaaacccac tcgtgcttcc aaagtcaccg catctcctcc 1801 attaagtgta actgttaagt aaccagggct atagaggtct tcgcgatcgc ccaacccccg 1861 cagagtttcc tcagggagtt tataatccca gtaccaaaat gggtcttgtt gatagtttcc 1921 ttgcgtccag cgcaaatgcc aaggtgttcc aaagttgtga gaactgattg cttgcaggca 1981 aacttgctgt tgtcccagca attgcgagaa gtgtaattct ggaataactt tttgttgatg 2041 gtgaaagtcg cgatcgccta taagcactcg cagcttcaat ttagctgttt gtgtaccttc 2101 atagcgatat tgaaccaaga tacgatggca aaactgaaat gggggagatg tgggagtcca 2161 caagtgaggg cttccttccc tctctcctcc tactttctcc accccttcgg ggttcggcag 2221 tcgctcatgg gggaaaccac gccagttgct ccacttgggg agaccccaag accgcactgg 2281 ctccccaaga cctcggcgac tacccaaccc atacggcatc accaaatgtc tagtcaactg 2341 ccagtcgtcc tcaccccata tccactttgg aactgggtta atatcaaaaa ggcgcagcag 2401 ttggtagccc tttggctcta tctgacaact gccccagaaa tttgtcccca atgccacaac 2461 ccgatctgat agttctaggc tagcttctag gtgcgaaagc accagagtac gcccagaagg 2521 tgggttagta gcggcaaaca accagccgtg ataagtgcga gtccggatat cgcaaactgt 2581 accactggca aaacttccta agccattcgt tagtaaccat tctcttgtat ctaaatccac 2641 gattgcctac tcaaaatcgt tatgagtatg atatattcgt aacagaacgt aatcaaactg 2701 taagtccagc cccgcaggta gtgagatttg ttctgtcgga cagtctccct gctcaagacg 2761 gtctttgcgc ccgattcgtg cgtgggcata cccgtatctc ctgcgaagac gctgggcgga 2821 cggcgtagca gtgccgcatt tatagggctt ttccactaat tgcggactgg ctcttgcgaa 2881 gggcgatgac gagaacgggt gagattacct gacccgaact ccaacaatac tagacagata 2941 ataaattgaa ggagaattag gttaatgccc ctcagttatg catcagaagg atgcctccgt 3001 gtaggtcaac aggctcccga ctttacagca acggctgtgg tagatcagga atttaaaact 3061 ataaaacttt ccgactatcg cggcaagtat gtcgtactgt ttttctatcc tttagacttt 3121 acctttgttt gtcccactga aattacagcc tttagcgatc gccacgaaga atttaagaat 3181 cttaacactg aaatccttgg tgtttccgtt gatagtgagt tttctcactt aacatggatt 3241 caaacagatc gcaagtctgg tggcgtcggt gacctcaact accccttagt ttccgacata 3301 aaaaaagaga ttagcgcagc ttacaacgtg cttgacccat ctgagggtat tgccttgcgc 3361 ggtttgttca ttattgacaa agaaggtgtc ttacaacaag caaccatcaa caacctagct 3421 tttggtcgca acgttgatga gaccctgcgg atcttgcaag ccattcaaca cgttcagtcc 3481 catccggacg aagtttgccc tgctggctgg caaccaggcg acaagacaat ggttcccgac 3541 ccagtgaagt ccaaagttta cttcgctgct gtttaagttt caaaggtaat acagtagtag 3601 tagcaggctg actgctacta taaactcact caaccattac cacagataaa cacagacaaa 3661 tcaaattcgt aatcaaataa tccgcagtct attttatcat gaattgcgag ttaaaaatta 3721 ctcattattt ggttaaactc cgtgtatatc tgtggtttgt tatttaagaa tttaataata 3781 tcaaaaaaaa atggt // LOCUS NODE_6581_length_3787_cov_5.2098073787 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3787) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3787) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3787 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 145..1608 /locus_tag="DP116_27485" CDS 145..1608 /locus_tag="DP116_27485" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017311744.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-amylase" /protein_id="PRJNA477356:DP116_27485" /translation="MEIQTPDWVKHAVFYQIFPDRFARSKQPRKRLLQNAAWEDWEAM PTLQGYKGGDLWGILEQLDYIQDLGINAIYFTPIFQSASNHRYHTHDYYQVDPMLGGN PAFKELLDAAHARNIKVVLDGVFNHSSRGFFFFHDVLENGPHSPWVDWFKIEDWPVSP YNGEFPANYVGWDGNRALPVFNHDNPEVREYIMEIAEYWVKFGIDGWRLDVPFEIKTP GFWQEFRQRVKAINPDAYIVGEVWGDSRQWLDGTQFDGVMNYLFAGPTIGFTAGDRVV MEQVQSRDYKPYPPLFAAEYAEKIERLLKLYPWEIQLTQLNLLASHDTARLLTIAGGD KASGELATLLLLTFPGASSIYYGDEVGLPGAIDPDSRRGFPLEANWDREIFQTHRELI AIRHAYPCLRTGDYKVLYAQGALYIFARVLATEELIVAVNVGTAEAKGNVDITSLNTQ PNKLLYGAAEVEWSVEGESRNLTLTVPPRSGSIIGVG" gene complement(1700..2119) /locus_tag="DP116_27490" CDS complement(1700..2119) /locus_tag="DP116_27490" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998778.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fasciclin domain-containing protein" /protein_id="PRJNA477356:DP116_27490" /translation="MADLVETAINAGNFNTLVKAVEAAELVEILKSPGSYTVFAPTDD AFNNLPPGTLDSLLQDIPKLKKILMYHVAYGDVRSDDLIQIDHAETLEGSIVAIDSAD GKVKVNDANVLKTDIITDNGVIHVIDAVLMPAMVAGK" gene 2271..2507 /locus_tag="DP116_27495" CDS 2271..2507 /locus_tag="DP116_27495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456735.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit omega" /protein_id="PRJNA477356:DP116_27495" /translation="MLKRSKFDTTQSQVMHRAEDLIGAASNRYRITVQVANRAKRRRY EDFDNADDVMMKPVLRAIIEMSDELTQPEIIGEV" gene 2680..3219 /locus_tag="DP116_27500" CDS 2680..3219 /locus_tag="DP116_27500" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743078.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27500" /translation="MIRGVGIVLISLLLLSSSQRADAQRVSPGDVWQAVYQQLPDLPR ENQYISKETGKVAENNTLVSRMIWYHSYLKGRAPNYRLDWKLTLADYMSANEVMYETT YPGKETLRKNPFDGDRAAIARLNRRQRDALVQALVNVFSPKSQNTPASTPTPSSSQPE DTTPTPRRVPKQGGAQLLK" gene 3272..3637 /locus_tag="DP116_27505" CDS 3272..3637 /locus_tag="DP116_27505" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011318807.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1818 domain-containing protein" /protein_id="PRJNA477356:DP116_27505" /translation="MERVLKSGLGWRIGWNPAAPEFKGLVGTDDWAIELTEAELNDFC RLLTQLADTIRQIATELMDEEKITCEAESDLLWMEVEGYPHAYSLRFILKTGRCAEGK WDDSAVGGLLKASQMLKVF" gene 3690..3763 /locus_tag="DP116_27510" tRNA 3690..3763 /locus_tag="DP116_27510" /product="tRNA-Pro" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:3724..3726,aa:Pro,seq:ggg) BASE COUNT 1036 a 781 c 905 g 1065 t ORIGIN 1 ccctgttccc tgttccctgt tccctgttcc ctgttccctg ttaagcgtgt ttcttgagag 61 taaccgtgaa attttctacc acaagtgtga gtcgtaagta ccagagatct gcaacgattg 121 ctgtggagaa atagagagtg attcatggag attcaaacac cagactgggt taaacacgct 181 gttttttacc aaatctttcc agatcggttt gccagaagca aacaaccccg caaacggctt 241 ttacaaaatg ctgcttggga agattgggaa gctatgccta cactccaagg ttataaggga 301 ggagatttgt ggggcattct agagcagtta gattatatac aagacttggg aattaacgcc 361 atttacttca caccgatttt ccagtcggct agcaatcacc gctatcacac acacgactat 421 tatcaagtag atccaatgct ggggggtaac ccggctttta aggaattgct agacgctgct 481 cacgcgcgga atatcaaagt cgttctggat ggggtattta accattctag tcgtggattt 541 ttctttttcc acgatgtttt ggaaaatggt cctcattcac cttgggtaga ttggttcaaa 601 atagaagact ggcccgtttc tccttataat ggtgagtttc ctgctaatta tgtgggttgg 661 gatggtaatc gagcgctgcc agtgtttaat catgataacc cggaagtgcg agagtatatc 721 atggaaatcg ccgaatattg ggttaaattc gggattgacg gctggcggtt ggatgtgcca 781 tttgagatta aaactcctgg tttttggcaa gagttccgtc agcgagtcaa agctattaat 841 cccgatgctt atattgttgg cgaagtttgg ggagactccc gtcaatggtt ggatgggact 901 caattcgatg gggtgatgaa ttatttgttt gcaggaccga ctattggttt tactgcgggc 961 gatcgcgttg ttatggaaca agtacaaagc cgcgactaca aaccctaccc acccttgttc 1021 gccgctgagt atgctgagaa aattgaacga ctcttgaaac tttacccttg ggagattcaa 1081 ctgactcaac taaatctact tgcgagtcac gatacagcac ggttgctgac tattgctggt 1141 ggtgataaag caagtggaga attagcaact ttactgcttc tgacctttcc tggtgcttcc 1201 agtatctatt atggagatga agttggttta ccaggagcaa tagatccaga ctctcgtcgt 1261 ggttttccac tagaagctaa ttgggacagg gaaattttcc agactcatcg cgaattaatt 1321 gccataagac atgcttaccc atgtttgcgt acaggcgatt acaaagttct atatgctcaa 1381 ggagcacttt acatctttgc gcgagttttg gcaacagagg aattgattgt tgcggttaac 1441 gttggtactg ctgaggcaaa agggaatgtt gacatcacaa gtttgaacac tcaacctaat 1501 aaacttttgt atggcgcagc agaggtggag tggagtgttg agggagaatc taggaatctg 1561 actttaactg ttcctccacg ctcaggctcc attataggtg tagggtgaat gaagaaaaat 1621 tcagctatac gaattcagaa taactctcgc gtttgaggtg acttctccta cccttagagg 1681 ggtgcagtag attgcacccc tactttccag caaccattgc aggcattaac actgcatcaa 1741 taacatgaat gacgccattg tccgtaataa tgtcagtttt taggacattg gcgtcattca 1801 ctttcacttt gccatcagca gaatcaattg ccacaattga cccctctagt gtttcagcat 1861 ggtcaatttg gattaaatcg tcagacctaa catccccgta ggctacatga tacatcagaa 1921 tctttttcaa tttggggata tcttgcagca atgaatctaa agttcctggt ggtaggttgt 1981 taaatgcgtc atccgtgggt gcaaagactg tataggaacc aggacttttc aaaatctcta 2041 cgagttctgc agcttcaaca gccttcacta gcgtgttgaa atttccggcg ttgatagcag 2101 tttcaacaag gtcagccatg agataatgtt tgatggttac aaaatactta tagagcgata 2161 aaaaagtatc aacctctatc aacagaaagg ttgcttgtaa ctaatcatgg ggcagtaagc 2221 atgttggtgc tgctaaactg aagagaatga cttcactgaa ccagtaattt atgctaaagc 2281 gttctaagtt cgacacgact caatcgcaag ttatgcaccg tgctgaggat ctcattggtg 2341 cagcctcaaa tcgctaccgc attacagttc aagtggcaaa tcgtgcgaag cgtcggcgtt 2401 atgaagactt tgacaatgca gatgatgtaa tgatgaagcc agtactgagg gcaattattg 2461 aaatgtctga tgaactgact caaccagaaa ttattggtga agtctagagt gatgtctacc 2521 ggaaagaagc aggagaaaag gaagcaggtg agacagcgct gcgggagggt ttccctcgtg 2581 ctggctctgc gaacccgaag ggggaagaaa atactccctc atctccctga ctcccttact 2641 tccttactct ccctacctcg ccatctcccc ttccgggaca tgattagggg agtagggata 2701 gtactaatta gcttacttct tttgagttct tcccaaagag ctgacgctca aagagtcagt 2761 cctggtgatg tttggcaagc agtttatcaa caattacccg acttaccccg agaaaatcag 2821 tacattagca aagaaacagg aaaagttgca gaaaataaca ccttggtcag tcgtatgata 2881 tggtatcact cctatttgaa aggacgtgca ccaaattatc gactggattg gaagctgact 2941 ttggctgatt acatgagtgc caatgaagtg atgtatgaaa cgacttatcc tgggaaagag 3001 actctaagaa aaaatccttt tgatggcgat cgcgctgcta ttgcacggct aaaccgtcgc 3061 cagcgggatg cgttagtgca agctttggta aatgttttta gtccgaaatc tcaaaataca 3121 ccagcttcta cccccactcc ctcctcatca cagccagagg acacaacccc aactcccaga 3181 cgagtaccca aacaaggagg tgcgcagttg ttaaagtaaa aagtcaaaag attttttcac 3241 tttttacttc tgatttttta ctttttcaac tatggaacgc gttctcaaaa gtggacttgg 3301 ttggcgtatc ggctggaacc cagcagcgcc agagttcaaa ggtttagtgg gtacagatga 3361 ttgggcgatc gagttaactg aagctgagtt gaatgatttt tgtcggctac taacgcaact 3421 agcagacacc atcaggcaaa ttgcaactga attaatggat gaggaaaaga ttacctgtga 3481 agctgaaagc gatttattat ggatggaagt ggaaggttat cctcatgcct atagtctgcg 3541 ttttatcctg aaaacagggc gttgtgcaga aggtaaatgg gatgattccg ctgttggtgg 3601 tttgttgaaa gcctcccaga tgctcaaagt tttttaaatc tagaccttga taattgtggt 3661 ttttgatgtt atacttggaa agttgactac ggggcgtagc gcagcttggt agcgcgccac 3721 tttggggtag tggaggtcgt gggttcgaat cccgccgctc cgattgaagg ttcaatcact 3781 gtggctg // LOCUS NODE_6622_length_3742_cov_4.1299163742 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3742) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3742) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3742 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(93..1028) /locus_tag="DP116_27515" /pseudo CDS complement(93..1028) /locus_tag="DP116_27515" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862954.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 1401..2082 /locus_tag="DP116_27520" /pseudo CDS 1401..2082 /locus_tag="DP116_27520" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411735.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" assembly_gap 1755..1764 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(2146..2400) /locus_tag="DP116_27525" CDS complement(2146..2400) /locus_tag="DP116_27525" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411827.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27525" /translation="MKLDSSTSDCAIEGFWHNLDWLRKSFSTESPDDFQAYSWVKLLE LPNPYSFDEALLLCHVSRQEWLAWIPDHGEAQLHISQFSH" gene 2613..3350 /locus_tag="DP116_27530" CDS 2613..3350 /locus_tag="DP116_27530" /EC_number="1.3.7.5" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458964.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycocyanobilin:ferredoxin oxidoreductase" /protein_id="PRJNA477356:DP116_27530" /translation="MTTTPLPSLREQQHPLIRQLADCIEAVWHKHLDLSPYHLPNEFG YVEGRLEGEKLTIENQCYQTPQFRKMHLELAKVGNMLDILHCVMFPRPEYELPMFGCD LVGARGQISAAIADLSPVNSQYTLPKSYTSALAALPQLNFSHPRELPEWGDIFSDFCV FVRLSSPEEENLFLSRTREFLEIHCMQATASQPVPSEQAMLSLAGQRNYCTKQQQNDK TRRVLEKAFGSDWAEQYMTTVLFDMPS" BASE COUNT 1000 a 794 c 813 g 1125 t 10 others ORIGIN 1 gcaggtggag tcggggaaac aggggaagca ggtggagaaa taactccttc actctctccc 61 tccctccttt ttcttgtccc ctcatttcct acctgaccaa aattactcag attttccgca 121 gccttacgtc gttcacgctc aatttgtcgt ggcaaatcac ctgggagttg agatttctta 181 gtcagggctg tatcaaaatt cttaatcgct tcctgcatca gttgcgagtt tttctcctta 241 ttggcttttg cccgaaggat ttgagcttta agataataca actctgggtt atctgttgtc 301 acctttaaag cacggttgac atagtctagt gcctgagagt actgttttaa atcccgataa 361 gcaagagcga taccccgatc tgccaaatac tggggagcag cattcttttc tagcttttga 421 attgcctgtt ctgggttagc aaagggcaaa ttgacagcta gcattaaatc catgtatcct 481 ttgactaagt ttagttctgg atcgttagga gaaacagctt cagctttgtc caaatgttga 541 tagacttgct gtagtctagt cagggcttgt ggtgcaccct tgactgtccc ctcacgttgt 601 agaatgacag acccttctaa aaatagacca acagcagcat acagatttcc gcgtaaggga 661 tcactagaaa ttaaattttg tgcggtttgt gatgtttttt tgctgtaagt gtctagtgtg 721 gccaaatcct tattcgtata cgctaaagat gctttcatag cataggctaa aggttccttt 781 ggctctgtag ataatgcttg ttgtaagtaa cgatctccag cttgatagtc tccttgttgg 841 aaaatggctt tgaatgctgc ctctgttttg ttgccaatat tacgaggctc tttcatccga 901 aagggatctg cagcccaaga aggatttgcc cagatgttca gtgcaatggt agctgaaaaa 961 gccactataa tagagcgata aaatttggca ggtacaattt tttgcaaacc tgggaaccgt 1021 gcatccattt tttgcctcag tatatttacg tcttatattt gttttttaga tgacttttaa 1081 ttcacggaat gtggatctgt ttcaggcaga attttgtata ctgtttgact ttggtaattt 1141 ttctaagttc ccaagcgtgt gttttggcta ttgttcaaga ctcattttgc cacaatgaat 1201 gaaagtaggg gtgacggttt tcgccgttag cttggctcca ttctaaccag tggtgcgtaa 1261 ttttggtcca gatgcagcat gtttcttgct gcaatagaga cgttgctagc ctcttgaatc 1321 cttgtaaatt tttcggaatt ctacagtagc tggcagtgta gttaataaat atgttgctga 1381 ctttttggta aacttaacaa atgctctatc tgagaaacct aatttatcat cccacggctt 1441 gcccgacagc tattctcaaa tcaatcaact tggaattagc atcccaacaa ctgggtttga 1501 ttattggacc gagtggttct ggcaaaagta cgttactcga aattttgtct ggacttgccg 1561 aaccgacatc aggttcactc ttctggcgcg agcaagaact gacagctgag cagatgcaac 1621 aactggctgg cttggtcttt cagtttcctg agaggcattt ttgcggcggt actctcttag 1681 aagaattgcg cttgggacat ccagagttag gaaaagagcg agttacgcaa gcacttggag 1741 aagtaggctt agagnnnnnn nnnncattta tcgcttagta catcacctca tgctttgagt 1801 ggaggtcagc agcggcgttt agctttggcg gtgcagttga ttcgccagcc acatctatta 1861 ttgttagacg aaccgacagc tgggttggat tggtcaatgc gtcggcaact ggtaaattta 1921 ttagcgaaac tgaaaaaaaa ttggacactg ctggttgtga cacacgatgc aggagatttg 1981 ttaccaatcg cagaccgttg ttggactctc aaccacggtg aactcaaatc agttgatcct 2041 ttagcattgg gtgcccagaa agaacctcaa ccagcagtgt gatttagcag tggggatgag 2101 ggacaacaaa agcagaccat cttgtgcagg gggaataatt cttgactaat gactgaattg 2161 actgatgtgc aattgtgctt ccccatgatc tggaatccag gctaaccatt cttgacgaga 2221 aacgtggcac agtagcagtg cttcgtcaaa actgtagggg ttgggaagtt ccaacagctt 2281 tacccaacta taagcctgga aatcatctgg tgactctgtg gagaaggact tccttaacca 2341 atccagattg tgccagaagc cttctatggc acaatccgat gtacttgaat ctaatttcat 2401 ggctctgctt agccttcgct aatggactaa gacaaaatta tttctgtaac taagaataca 2461 aaataactcc tatttttgaa aagtgagtgt tttataactt tacatttgtc agttgcaagt 2521 gtttggttgt gttgcgagct acttcgggtg agactttgtg gacggtagac taataactga 2581 atcacattac ccagtaacaa atcccataag atatgacaac aactccttta ccatcgctgc 2641 gcgagcaaca acatccgctg attcgtcaac tagcagattg tatagaagca gtgtggcata 2701 agcacttgga cttatcacct tatcatttgc ctaatgaatt tggatatgtg gaaggtcggc 2761 tagaaggcga gaagctgacg attgaaaacc agtgctatca aacaccccag ttccggaaaa 2821 tgcacctgga actggcaaag gtgggaaata tgctggatat attgcactgc gttatgtttc 2881 cccgtccaga gtatgaacta ccaatgtttg gttgtgattt agttggggcg aggggtcaaa 2941 taagtgcggc gatcgcagac ctttctccag tcaactccca gtacactctc ccaaaatcat 3001 atacttctgc actggctgca ctccctcagt taaacttttc ccacccaaga gaattacctg 3061 agtggggaga tattttttca gacttttgcg ttttcgttcg cctcagttct cctgaagaag 3121 aaaatctatt tctctctcgc actcgggaat ttttagaaat acattgcatg caagccacag 3181 catcccaacc cgttccatca gaacaggcga tgctaagtct ggctggacaa cgtaactact 3241 gcactaaaca gcagcagaac gacaaaaccc gtcgcgtact ggaaaaagct tttggttcag 3301 attgggcaga acaatatatg acgacagttt tgtttgatat gccaagttag tccaaactca 3361 acagtcaaaa acttcttgtc ttccctaccg cccccatctc cctatctgta ggttctgcta 3421 cgttttcaag caacttacaa atgtagaatt catttaactt aagcaaaatc gagtcacaaa 3481 atccttcaat gtaagtttca tttccaaaca gcttgtctat gatcttcaga gttaagttga 3541 aatcagctgc cgttcactca aagccaatca agttacctct ttagtcatac tttttttcta 3601 acacacaatg tagttgttac ccagttttgt ttgctatcta tttcagatag acaaaccaaa 3661 ctgttgtaac tgtgttagga ctcacgcaag aactagtcgt aaaactctta ttcctatcct 3721 agggggtgta ggggtgtagg gg // LOCUS NODE_6641_length_3725_cov_4.9735693725 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3725) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3725) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3725 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(91..903) /locus_tag="DP116_27535" CDS complement(91..903) /locus_tag="DP116_27535" /EC_number="3.6.1.22" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015188906.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(+) diphosphatase" /protein_id="PRJNA477356:DP116_27535" /translation="MHRTFIPGIAPPLLQSEPAWWFAFVGNKLLVRAEAKVSEIPNLI SLTEIGLEPVRTQFLGTLDGQPCYSAELPKDAIAPDGMVLQGLRELYGTLDEQLYALS GRAIQIVEWDRTHQYCGHCATGTTQLSHERAKRCPKCGLVNYPRLSPAIIVLVSRGEE LLLARAPRFPPGMYSVLAGFVEPGESLEETVVREVREEVGIEVKDIRYFGSQPWPFPN SLMIGFTATYASGDIVIEPQELVDAAWFSKDNLPQIPPKLSIARKLIDSFVL" gene 1273..2517 /locus_tag="DP116_27540" CDS 1273..2517 /locus_tag="DP116_27540" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019507856.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="PRJNA477356:DP116_27540" /translation="MATKRITFRLYPNKAQTNKMHYWRRLHKDLYNACVEHRKTSYKK FGKSVDYFDQQNCLPEFKEYWEEYKELGSHALQDTVKRVDFAFKRFFKLKSGYPKFKS SRYYKGWTYPCSSGWKACTNGKNGYLKISNLGNLKMRGQARDWGKPKTCTIIFKQGKW YASITVDCVPTRPQTDTGAVGLDFGTHHAIADSNGNVIENPRFVKVAQTKINQIAKTS RRKRPPSKGVKASRRWRKANKSVAKIQSKVARQRQDWQHKVSTQIVSCNSLVATEKLN LKGMTRSSLGKNKRQKSGLNRSLLDVAIGNLKELIKYKITEAGGFYIEIPTVKVKPSQ TCPNCGHQKKKTLAERVHYCEKCDYQCDRDVAAAMVMLNYARGQELSSTDADESTSTW CGSFKQVAQMKRQKQLAQPLGG" gene 2567..3169 /locus_tag="DP116_27545" CDS 2567..3169 /locus_tag="DP116_27545" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015203585.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2OG-Fe(II) oxygenase" /protein_id="PRJNA477356:DP116_27545" /translation="MNYFSQHQDAFPITYLNDLRGEILACPYLATNNLNRDFVDTKGF SVVFQRSEIAEVERRFPFFKPYLDQALQPTCNAFYLNPLLLKEGSRVDPHIDRSLRSY CKTIEPPVVVSVLYVQVPSNLQGGELLLRRHKQQVGQIKPQANTLLYFQGDLTHSVNA VKTTGTRLSLVCEQYSLSEIELQDIPAFTVESRVVKVKRR" gene complement(3217..3570) /locus_tag="DP116_27550" CDS complement(3217..3570) /locus_tag="DP116_27550" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27550" /translation="MHEISPQTVQAAQKHALQSLEHGKSVEEVGKRLQSDDQTKPEIG QSIEASGKTIQKQAQESLEKAQQLKEDPSVEVFSESAQAHINASQNHIEAVKEFQKQV RTHLDDHDRSKSKHE" BASE COUNT 1057 a 845 c 821 g 1002 t ORIGIN 1 gggaacaggg aacagggaac agggaacagg gaacagggaa cagggaacag ggaaatcccc 61 tcagcgtggc ggaacgccat agggacttgg tcagagtaca aaggagtcga taagcttgcg 121 agcaatactc aacttaggag gaatctgcgg taagttgtcc ttactaaacc aggcagcatc 181 taccaattct tgcggttcga tgacgatgtc accacttgca taagtcgctg tgaatccaat 241 cataagtgag ttgggaaatg gccagggttg cgagccaaaa tagcgaatat ctttcacctc 301 aatccctact tcctcgcgga cttcacgcac caccgtctct tccaacgatt ctcccggttc 361 cacgaagccc gctagcacgc tatacatccc cggtggaaac cgaggggcgc gagctaacaa 421 aagttcttcg ccgcgagaaa cgagaacgat aatcgcaggc gagaggcgag gataattcac 481 taatccacac ttggggcaac gtttggcacg ctcatgggat aattgggttg tcccagtcgc 541 acagtgcccg cagtactggt gagtgcggtc ccattccacg atttggatgg cgcgaccact 601 cagcgcatac aactgttcat ccaacgttcc gtacaattca cgtaatccct gtaagaccat 661 cccatcgggg gcgatcgcat ccttgggcaa ttctgccgag taacacggtt gaccatctag 721 ggtgccaagg aattgggttc gcactggctc caagccaatt tctgttagac tgattagatt 781 aggaatttcg ctgacctttg cttcggcgcg aaccagcaat ttattgccaa caaacgcgaa 841 ccaccaggcg ggttcagact gtagtagggg tggagcaatg ccagggatga aggttcgatg 901 catatcagta gaatgcgatt accttgatcg tatcaccgcc gctctcaccc aaaaccatac 961 aataataata taccctttcc taagtaggat tatttatcaa aatctgaaca gaatcggcta 1021 tccttcccca ccccgcctga cggctgaggg tggggaagga tagccggtca attaaggggg 1081 ttcaacgccc ccttaattga caatttccgt attaactcac gaagaacatc tgactgagtt 1141 ctctctgtca actgacaaaa ttgttgtaaa tgctcaaatt ctctgtctga tattcgcgtt 1201 gtgacttgtt tcatagttgt caatatgctt tcaaaatgct actattttga cattagcaga 1261 ggtcgatatg ctatggctac taagcgaatt actttccggc tctatccaaa taaagctcaa 1321 accaataaga tgcactattg gcgacgactt cataaagatt tgtacaatgc ttgtgttgag 1381 catcgcaaaa cctcttacaa aaagtttggc aaatcggttg attacttcga ccaacaaaat 1441 tgtttaccag agttcaaaga gtattgggaa gagtataaag agcttggtag tcatgcattg 1501 caagacactg ttaagcgcgt tgattttgcg tttaagcgct tttttaaact caagtctgga 1561 tatcctaaat tcaaatcaag tcgttactac aaaggatgga catatccttg ttcatcaggg 1621 tggaaagctt gcacaaatgg aaaaaacggt tatctaaaaa tatctaattt aggtaactta 1681 aaaatgcgtg gtcaagctag agactggggt aaacctaaaa cttgcacaat tatatttaaa 1741 caagggaaat ggtatgcttc aatcactgtt gattgtgttc caacccgtcc ccaaactgat 1801 acaggtgctg ttggtttaga ttttggtact caccatgcta ttgcagattc taatggcaat 1861 gtgattgaga atcccagatt tgttaaagtt gctcaaacca aaattaatca aatagctaaa 1921 accagtcgta gaaaaagacc tccgagtaaa ggcgttaaag cttcacgacg ttggagaaaa 1981 gcaaataaat cagtagccaa aatacaatcc aaggttgcca gacaaaggca agattggcaa 2041 cataaagtct ctacacaaat agttagctgt aatagcttgg tagcgactga aaagttaaac 2101 cttaaaggga tgactcgctc ttctttaggc aagaataaac gtcaaaagtc aggacttaat 2161 cgttcgttgc ttgacgtggc aattggcaac cttaaagagt taatcaagta caaaatcacc 2221 gaagctggtg gtttttatat cgagattccc acggtcaaag ttaagccatc tcagacttgc 2281 ccgaattgtg gtcatcaaaa gaagaaaaca ctagcagaaa gagtacatta ttgcgaaaaa 2341 tgcgactatc aatgcgatag ggatgtagca gcagcaatgg taatgctcaa ctatgcaagg 2401 gggcaggaac tgtcctctac agatgctgat gagtcaacct ctacttggtg cggaagcttc 2461 aagcaagttg ctcagatgaa gcgtcagaaa cagctagctc agccgttagg cggctagctg 2521 tagttcattc tctgttcaac ttgaaactcc taaactcctt gaacacttga attacttttc 2581 tcagcaccaa gacgccttcc caatcactta cctcaacgac ttgcgaggag agattttagc 2641 gtgcccctat cttgcaacca acaaccttaa ccgcgacttt gttgacacca aaggcttttc 2701 tgtggtattc cagcgttcag aaattgcgga agtggaacgg cggtttcctt tcttcaagcc 2761 ctatctcgac caggcgctac agcccacctg caacgctttc tatcttaacc ccttgctgct 2821 taaagaaggt tcccgcgtcg atccacatat tgatcgctcc ctgcgatctt actgcaaaac 2881 gattgagcct cctgtggtgg ttagtgttct ttatgtgcaa gtaccgtcaa acttgcaagg 2941 cggagaactt ctgctgcgcc gccataaaca gcaagtcggg cagattaaac cacaagctaa 3001 caccttactg tattttcaag gcgatctgac tcattccgtg aatgctgtta aaaccacagg 3061 gactcgtcta agcttagttt gtgaacaata cagcctgagc gaaattgaac tgcaagatat 3121 tccagcgttc acagtggagt ctagggtagt gaaggtaaag cggaggtaag aaattttcca 3181 aaccagaggt tatgtccgaa gtgcgattcg ctgtgtttat tcatgcttgg atttactgcg 3241 atcgtgatca tctagatgag ttcgcacttg cttttgaaat tctttaacag cctcaatatg 3301 attttgagaa gcatttatat gtgcctgcgc agattcagaa aatacttcca ccgaaggatc 3361 ctcttttaat tgctgcgcct tttccaaaga ttcctgcgcc tgcttctgaa tagtttttcc 3421 agacgcttca atgctttgac cgatttccgg cttagtttgg tcgtcgcttt gtagccgctt 3481 accaacctct tctaccgatt taccatgctc aagtgattgc aaggcgtgtt tctgtgcggc 3541 ttgcacggtc tgcggtgaaa tctcatgcat aacggtctta cctacttatc agttatcagt 3601 gtgatagcgt tttcttaggg acttcctcat ctaccgtaag ctatggttat accgtttcac 3661 tttaaggttg ataagctgtt acgcttttaa cttgcatatt attttggttt agtcaatcat 3721 tgaaa // LOCUS NODE_6642_length_3725_cov_4.2452323725 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3725) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3725) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3725 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(232..573) /locus_tag="DP116_27555" CDS complement(232..573) /locus_tag="DP116_27555" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007354629.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27555" /translation="MARLYADEQFPYEVVEHLRDLGHDVVTVQEAGKANLKIPDDEVL AFASSNERVVLTLNRRDFKRLHRYVPSHAGIIVCTDDVDRSGLAKRINAAILAGEPLA GKLVSVVRPAR" gene complement(573..896) /locus_tag="DP116_27560" CDS complement(573..896) /locus_tag="DP116_27560" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011320460.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27560" /translation="MTLQKLEPELLALTPNEKAQAIQILAQSLGNPWRGIEKTPGVCG GDACISGTRIPIWVLVNARNLGISEAQLLKDYPTLSATDLANAWVYATVYPEEITTAI RENEE" gene 1137..2948 /locus_tag="DP116_27565" CDS 1137..2948 /locus_tag="DP116_27565" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198200.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27565" /translation="MYEAFIDLDELIVRCRDKQARKFIKEAVACYRAGAYRSCIVATW NAVVFDFLHKLTELKLLEDKEASNLLEQFEKLSSEKKVKELWQFESDIPKKALNPFEL ISTVEKSDIERLFEDRSRCAHPSMTSLEEPFEATAELARYHLRSAITHLLERPPVQGR AARERIFQDIKSEYFPTDSELAIKYFQKSPLARARLALIKDVVRGLTINLLTENLPED ERERQFSAIHAISSMYPEQTREILNEKLSDIILNKVVDTNWEKVIIYLADVKIWDTLT EPCQLKAVAFIEKLNIFDTSRYYLCEQNVYIFFKAAQISFLKEAIYVKLQLLTLKQLL SLKEFCQEKLQNSSINEVMESLLEKAIPQASFNELVSMISKDNNSWNEKIYLYLMEKI KEASLKGIFCNLSEITQEEKLLKITEQRLLYLLENASLEKLLQVSKYYVYKLSGSNLE SAIELLKTSIIKLSQQVEFDKLILMKSNYSNEFLDDLLKPILIENLPKIVSNFRLSDS YDDAAFNANILLEIADSLSPAEWESILKAFCENDQIYDSRGCPSIFESLFNKSRKLNN CVEPYWLSFRENLNTFGSSYKKINKLKQLIDSYLPIK" gene complement(2982..3104) /locus_tag="DP116_27570" /pseudo CDS complement(2982..3104) /locus_tag="DP116_27570" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129985.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system RelE/ParE family toxin" gene complement(3088..3378) /locus_tag="DP116_27575" CDS complement(3088..3378) /locus_tag="DP116_27575" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015129986.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system Phd/YefM family antitoxin" /protein_id="PRJNA477356:DP116_27575" /translation="MINISRDIHSLSSFKRNTLEFLEQMKQTGKPVVLTVNGKAELVV QDAESYQKLLDALEKLEAIAGINKGLEDVEAGRTTTLSEFEQEMRQKYGISS" BASE COUNT 1187 a 755 c 651 g 1132 t ORIGIN 1 atcaacaatt gcaacgccgc ccccgcccgc tgtgctaata atttcgccaa ggctaaacgg 61 attgcaaagt taattctaaa cgcttcccca ccagagtaag tttcataagc ccgcatcccg 121 gagatcgcaa gacattgtta aagccgtggc gatagcttcg ctgcagcata gcggcagatc 181 gctcttaact tccgcaggta agcatacact atcgtctaca cacaatcttg ttcatctggc 241 aggacgcacc acactaacca acttaccagc caaaggttcc cctgccaaaa tagctgcatt 301 aattcgcttt gctaatccac ttctatctac atcatccgta caaacaataa taccagcatg 361 actgggtaca tagcgatgca aacgtttgaa gtctctgcgg tttagggtta aaacaacacg 421 ctcgttacta cttgcaaatg cgagaacttc atcatcaggt atttttaaat tagcctttcc 481 tgcttcttgg acggtaacaa catcatgccc caaatcgcgt aaatgctcca caacctcata 541 gggaaactgt tcatctgcat acaaacgagc cattattctt cattctcccg aatggctgtt 601 gttatttcct ctggatatac tgtcgcataa acccaggcgt tggctaagtc tgtcgcagat 661 aaagttggat agtctttcaa aagttgagct tcgctaatac ctaaattacg agcattaact 721 aacacccaaa taggaatgcg ggttccacta atacaagcat ctccaccaca tacacctggg 781 gttttctcga ttccacgcca aggatttcct aaactttgag ctaatatctg aatcgcctgt 841 gctttttcat ttggtgttaa ggcaagtagt tctggttcca acttttgaag cgtcatagca 901 ttactaccaa cctaaaaaga ctataactat atttgaatat tctaacccat gctcaatatg 961 cttctccacc agagtaagtc tcataagccc gtgtcccgga gatagcaaga cattgttaaa 1021 gccgtagcga tcgctttgct gcagcatagc ggcagatcgc taggaacttc cgcaggtggg 1081 cgtacatcta agttaccata ctgaacaaat ggtaaaaaat cagtcgttct cacattatgt 1141 atgaagcttt cattgattta gatgaattaa tagtacgttg tagagataaa caagcaagga 1201 agtttatcaa agaagcagta gcttgttata gagcaggagc ataccgttct tgtattgtgg 1261 caacatggaa tgcagtagta tttgattttc ttcataaact tacagaatta aagcttttag 1321 aagataaaga agcatcaaac ttattggaac aatttgaaaa actgagttcc gaaaaaaagg 1381 tcaaagagct ttggcagttt gagtcagata ttcccaaaaa agcgctcaac ccatttgaat 1441 taatctcgac tgttgagaag tcagatattg aaagattatt tgaagataga agccgatgcg 1501 ctcatccctc aatgacatct ttggaggaac catttgaggc tacagcagaa ttagcacgct 1561 atcacttaag aagtgctatt acacaccttt tagaaagacc accagttcaa ggtcgtgcag 1621 cccgtgaaag aattttccaa gatattaagt ctgagtattt cccaacagac tcagaactgg 1681 cgataaagta ttttcaaaaa agtccgttag ctcgcgctcg cttggctctt attaaagatg 1741 ttgttagagg attaacaata aacttgttaa cagaaaatct tcccgaagac gaaagagaac 1801 gtcaattctc tgcaattcat gcaatttcaa gtatgtatcc tgaacaaaca agagaaatat 1861 tgaatgagaa attatctgat attattctga acaaagtcgt tgatacaaat tgggagaaag 1921 taattatcta cttagcagat gttaaaattt gggacactct tactgagccg tgccaattaa 1981 aagcggtagc ctttatcgaa aagcttaaca tatttgacac atcaagatat tatctatgcg 2041 agcaaaatgt ttatatattc tttaaagctg ctcaaataag ttttttaaaa gaagctatat 2101 acgttaagct tcaactgctt actctgaaac aactactttc tctcaaggaa ttttgccaag 2161 aaaaactaca aaatagctca atcaatgaag ttatggaatc tctactagaa aaagctattc 2221 cccaagctag ttttaatgaa ttagtttcaa tgatatcaaa ggacaataat tcttggaatg 2281 agaaaattta tttgtattta atggaaaaaa ttaaagaagc ctctctcaaa ggaattttct 2341 gcaacttatc agaaatcaca caagaagaaa agttactaaa aatcactgaa caacggctgc 2401 tctatttact agagaatgct tctctagaaa agttacttca agtgagtaaa tattatgtct 2461 ataaattatc tggcagcaat ctagaaagtg cgattgaatt actgaagact tcaattataa 2521 aactttctca acaagtagaa tttgataaat taatactcat gaaatcaaat tatagtaacg 2581 aattcctcga tgacttatta aaacctatct tgatagaaaa tcttcctaaa atagtaagta 2641 attttagatt atcagactca tatgatgatg ccgctttcaa tgctaatata ttacttgaga 2701 tagccgattc tttaagtcct gcagaatggg aatctattct aaaagcgttc tgcgaaaatg 2761 atcagatata cgattcacgt ggttgtccta gcatttttga atctctcttc aataaatcta 2821 gaaagcttaa taattgtgta gagccttatt ggttatcttt cagagaaaat ctaaatacgt 2881 ttggcagtag ttataaaaag attaataaat tgaaacagct aattgattct taccttccga 2941 ttaagtgatt tttttgctgt ctagtacctt acatcatcaa atatagtatc catcaattcc 3001 cgaaaccact tatccgcata aacaggattt tgctctctca accaagaata agcagcctca 3061 atttctgcat ttgctgttcg agtcattcta acttgaaatg ccatactttt gtcgcatttc 3121 ttgttcaaat tcgctcaagg tagtggttct acctgcttcg acatcttcca atcccttgtt 3181 gattccagca atagcttcca acttctccaa agcatcaagc agcttttggt aagattcagc 3241 atcttggacg acaagttcgg ctttaccatt aactgtcagc actacaggtt ttcccgtctg 3301 cttcatctgt tctaggaatt ccagcgtatt gcgtttaaag ctggagagtg aatgtatatc 3361 tcggctgatg tttatcattg ggttgattat gcactaaatt tgcacttaat ttaatgctat 3421 agcaatccta aatcatttgt aaaattctct cttctttctt ttcttggcgt tcttggcgtc 3481 ttggcggttc gatattttta caactcaaat aggattgcta tatttcaccc aaaagctttt 3541 tagatgtttg gtgcagtctt aaatcaacaa tcgcacctgt gaaccctgct gagttttatt 3601 cacttcaatt cgggcttgaa aagcttcttt gaggtgaggc atatgagtta cagtgaggat 3661 acaagcaaaa tcagaggcga tcgcattaat cgctgcaatc aggcgatcgc atccttctgc 3721 atctt // LOCUS NODE_6703_length_3669_cov_5.2291093669 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3669) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3669) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3669 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 735..2297 /locus_tag="DP116_27580" CDS 735..2297 /locus_tag="DP116_27580" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017748822.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27580" /translation="MQVYTLPGQKEIEKVAERFRTDSQQVTTTDLAINSAKQLTQLIL APVADKLPGKRLVIVADGGLQTIPFGALADITSNKYQPLMINHEIVNLPSASTIAIQR QKLANRQSAPKAIAILADPVYSATDTRVTGKPENTQLAPEIQFERSALNRSARSLKRN GLPRLANTATEAKGILKLIPAATSLEALSFDANYDWATSKTLNQFRILHFATHGFVNQ EQPELSGIVLSLVDKKGKPIRGYLRLGDLFNQDYPAELIVLSACETGLANQPDKADDQ EFKRAVIGWDDRVPMLSRQYPWSAIGRVQGLTTKGEDYHCTGTLISEDVVLTNSHCVI DPETHQASQKILFLPNLINGKVADESDIAQVQNVIYGTDFTKTKLENQTDDWALLKLD KPIGLKYGYLGWKSLPSSTLTKNRNKYIFVGYSGDFPNTNKEKYRFFTAGKGWTASVQ VGCSIVGEEGNVLLHDCDTTGGSSGGAIIGVIGNQPYIIGLNNAEIKTRDGRGIINLG VKIDFLDRLVGK" gene complement(2501..3445) /locus_tag="DP116_27585" CDS complement(2501..3445) /locus_tag="DP116_27585" /inference="COORDINATES: protein motif:HMM:PF10503.7" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27585" /translation="MSLVTACESMQAQQMTSRQGKVVYGESNGELTDQGYARTYDIYT PKSYSPSRPMPLVLVFHGDEGSGRSISNVSQFNTLAEQKGFLVAYPDGIDQRWSLRKT SKRNIDDVSFVKNFINHLEQVRNIDSRRIYATGFSRGGILTQALTCQLSDRIAAFASV AGSLPRRLKQTCQPQMPVSMLMINGTNDQSVHYQGDEKTQKGALVSIPEAVNFWRDHN QCPAYTAKNVAFVAGNQSDAPAGSRARRDRSKVKTYSYSGCSGGSEVVQLAVVDGGHL WYGGTSSDKDVNEFNKDLGLDSTQTIWNFFERHSLPFV" BASE COUNT 1193 a 868 c 685 g 923 t ORIGIN 1 ataaacaaac tgccctcaaa tacttcaacg attctctgcc tttgagtaaa caagtcggcg 61 acaaagcaca acaagcaaca accctttaca acctcgcgta cctagaacgc gatcgcaaca 121 acctgcaagc agcccgcacc aacgtagaaa ccgcaatcaa aattatcgaa gaattacgca 181 ccaaaatcga caggcaagaa ctccgcactt cttactttgc tacagtaacg ggttactata 241 aattctacat cgacctgctg atgcaactgc acaaaaaaga cccatcccaa ggatacgatg 301 cattagccct acacatcagc gaacgctccc gcgccagaag cttgatagaa ctattaactg 361 aagcaaatgc caaaattctc aaaggcgcaa acccagaact cgtacaacaa gaacgcaact 421 tactacagca aattgatgca acagagacac tgcggcaaaa tttagcaaat tcaccaaata 481 aaaacgacct catcaccaaa gcagctattc aaagacacac cacagaaatt gaaaatctct 541 tcagccaaca ccaggaagtg caagccaaaa tccgtacgac tagcccagaa tatgcaaaac 601 tgacaaaccc aaaccccgaa aaggacattc tcaaattacc gcaaatccaa caacaacttg 661 ataaagacac tttgctattg caatattctt tgggtgaaga acgcagttat ttgtgggcag 721 ttactcccac ttcaatgcaa gtttacactc tccccggaca aaaggaaata gaaaaagttg 781 cagagcggtt ccgtacagat tcgcagcaag taacaactac tgacttagcg attaacagcg 841 caaaacaact cactcaactg attctcgcac ctgttgctga taaattacca ggaaagcgtt 901 tggtcattgt tgctgatggt gggctgcaaa ccattccctt tggggcatta gctgatatta 961 cttcaaataa ataccaacca ctgatgataa accatgaaat tgtcaattta ccctcagctt 1021 caaccattgc catccaacgc caaaaactcg ccaaccgcca aagcgcaccc aaagccatag 1081 cgattctcgc cgacccggta tacagtgcta ccgacacacg agtgacaggt aaaccggaaa 1141 atactcaact cgcccccgaa atccaatttg aacgttctgc ccttaaccgt tccgccagaa 1201 gcctcaaacg caatggtttg cccaggctag ccaacacagc aacagaagca aaaggaattt 1261 taaaactgat accagcagca actagcctgg aagcattgag ctttgatgct aattacgact 1321 gggcaacaag caaaaccctc aatcaattcc gtatcctgca ttttgccacc cacggcttcg 1381 tcaatcaaga gcaaccagag ttatccggga ttgtactgtc attggttgat aaaaaaggca 1441 agccaatcag aggatacttg cgcttgggag atttgtttaa ccaagactac ccagcagagt 1501 taatcgtctt gagcgcctgc gaaaccggac ttgcgaatca accagataaa gctgatgatc 1561 aagaatttaa acgagcagtt attggttggg atgaccgagt tcctatgtta agccgacaat 1621 atccttggtc ggcaattggt agagtacagg gattaaccac caaaggcgag gactatcatt 1681 gcacaggtac attaatcagt gaagatgtcg ttttaaccaa ttcacactgc gtcattgatc 1741 ctgaaacaca ccaagctagt caaaaaatct tgtttctccc aaatcttatc aatggcaaag 1801 tcgcggatga atcggatatt gcccaagttc aaaatgtcat ttatggcact gatttcacta 1861 aaactaagtt agagaatcaa actgacgact gggcgctttt aaaactcgat aaacctatcg 1921 gcttaaaata cggatatctc ggttggaaat ctttaccctc atcaactctc accaaaaacc 1981 ggaataaata tattttcgtt ggttattctg gtgatttccc taacacaaac aaagaaaagt 2041 atcgattttt taccgcaggt aagggatgga cagcaagcgt gcaagtaggt tgcagcatcg 2101 tgggtgaaga aggcaatgtg ttgttacacg actgcgatac cactggtggt tcttccggag 2161 gagcaattat tggcgtcata ggtaatcaac cgtacataat tggtcttaat aatgcagaaa 2221 tcaaaactcg tgatggaaga gggataatca acctgggtgt gaagattgat ttcttagata 2281 gattggtagg aaaataagta cagttctaaa tcatttgtga acaacaagat tcttgacttc 2341 taaaagaagt cgtaggttgg gttgaacgga gtgaaaccca acaatatcaa acgtgtgttg 2401 ggtttcgctt acgctctaac gccactttgc tcaagtcggg aaacccgccc accgcaagtc 2461 gctccccaac ctaccattct acttcttccc tacaccccta ttaaacaaaa ggtagactat 2521 gacgctcaaa aaaattccaa atcgtctgag ttgagtctaa accgagatct ttattgaact 2581 cattcacatc tttatcacta gaagtaccac cgtaccagag atgtccgcca tcgactacag 2641 ctaattgtac aacctcggaa ccaccgctac aaccggagta ggaataggtt ttaacttttg 2701 agcgatcgcg ccttgcgcgg ctccctgctg gagcatcgct ctgattacca gccacgaatg 2761 ctacattctt cgcagtatat gcaggacatt ggttgtgatc tcgccaaaag ttgactgctt 2821 ccggaataga aactagcgct cctttttgag ttttttcatc accctgataa tgtacagatt 2881 ggtcgtttgt gccgttaatc atcaacattg atacaggcat ttgtggttga catgtttgtt 2941 taagtcgcct tggcagagaa ccagcgacag aggcaaaagc tgctattcta tcagatagtt 3001 gacaagtcaa agcttgggtt aatattccac ctctagaaaa accagtagca tagattctac 3061 ggctatcgat atttctaact tgttccaggt gattaataaa attcttgaca aaagaaacat 3121 catcaatatt tcttttgctt gtctttctta agctccatct ctggtcaatt ccatcaggat 3181 aagcgacaag aaaccctttt tgttctgcta atgtattaaa ttgagaaaca ttactaattg 3241 aacgtccact accttcgtct ccatgaaata ccaaaactaa tggcatagga cgactaggag 3301 aataggactt tggcgtgtaa atatcgtaag tccgggcgta accttggtca gttaattcac 3361 cattactctc accataaaca acttttccct gtcgtgaagt catttgctgc gcttgcattg 3421 attcgcacgc tgtgactaaa ctcattgaga gtaaacaaat aaagactcgt ttaagcaatt 3481 gaaaactgaa attgacattc acggttaatt ctatttaata aaattataag catatttttt 3541 ttagtattat gagtaatagg atacacctat tcactcacgg ttgacctcat acaaatggct 3601 gaatctttct aaataacaaa cagtcaaact ctaactacag atagaagatg tctctttccg 3661 cacaaaggg // LOCUS NODE_7013_length_3400_cov_5.8831093400 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3400) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3400) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3400 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 441..926 /locus_tag="DP116_27590" CDS 441..926 /locus_tag="DP116_27590" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_681747.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="allophycocyanin" /protein_id="PRJNA477356:DP116_27590" /translation="MSIVTKSIVNADAEARYLSPGELDRIKGFVTTGEKRLRIAQTLT ENRERIVKQAGDQLFQRRPDVVSPGGNAYGQEMTATCLRDLDYYLRLVTYGVVAGDVT PIEEIGLVGVREMYKSLGTPIEGVAEGVRGLKSAATSLLSGEDATEAGSYFDYVIGGL Q" gene 992..1480 /gene="apcB" /locus_tag="DP116_27595" CDS 992..1480 /gene="apcB" /locus_tag="DP116_27595" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015206134.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="allophycocyanin subunit beta" /protein_id="PRJNA477356:DP116_27595" /translation="MAQDAITSVINSADVQGKYLDTSALEKLKSYFSTGELRVRAATT IAANASAIVKEAVAKALLYSDITRPGGNMYTTRRYAACIRDLDYYLRYATYAMLAGDP SILDERVLNGLKETYNSLGVPVGATVQAIQAIKEVTASLVGPDAGKEMGVYLDYISSG LS" gene 2046..2252 /locus_tag="DP116_27600" CDS 2046..2252 /locus_tag="DP116_27600" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015114200.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_27600" /translation="MARLFKITACVPSQTRIRTQRELQNTYFTKLVPYENWFSEQQRI QKAGGKIVKVELATGKRQTNTGLS" gene complement(2456..>3400) /locus_tag="DP116_27605" CDS complement(2456..>3400) /locus_tag="DP116_27605" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015217690.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27605" /translation="LTMIIIIDLGDFIPWKSLSLKNSIFRCLKAGTNIEFPEQGTPQG GVVSPLLANIALNGIEEIHPSVRYADDMIFFLKPGDNEKNILAKICNFLARRGMKISE RKTKITAATDGFDFLGWEFKVQSNGKFRCYPSKDNFRAFRKKIKNVVNSSNYGANEKA QKLAPIVRGWRNYHRFCKMDRARDNLTHIQLRASRKFNKESKQSRHSTKELLKKAFPK VPYSENKHINVKGEKSPYDGDLVYWSNRNSKLYDGITSKILKKQNHTCGHCGCKVLSD ERTHLHHIDGNHQNWKDKNLLAIHESCHDYHHMGKGKP" BASE COUNT 888 a 721 c 760 g 1031 t ORIGIN 1 ccccctacac cctcacaacg ccaggtgcta agaaagcggg aagccgcctc cggcgtctac 61 aagtcgggaa acccgcccac agcactggct ccccttatac ccttacaccc tatccgtagg 121 acattgactc gcaaaaggcg ttctaacctt tccctagatt agctttcaga ttgtcggtag 181 agcagtcaac atgaaaagca atgttacaaa gtgttaagag cagtcataat tgctcaacag 241 aatgcccgaa aataattccg gaaggtaact gagtttatga gctgatgtac tagccattac 301 gcaagcccac tcgtttaagt actagttgca gcagtttttt ggctgttgta tctaaaaaat 361 gagaaaataa cctcagttaa ccaatcaaag ccgcaccagt ttaagattct ggttttattt 421 agtactggag gaatccatta atgagtatcg tcacgaagtc catcgtgaat gctgatgcag 481 aagcgcgcta cctcagtcct ggcgaattag accggattaa gggatttgtt actactggtg 541 aaaagcgtct ccgcattgct caaactctca ccgaaaaccg cgagcgtatt gtgaagcaag 601 ctggtgatca actgttccaa agacgtcctg atgttgtttc tcctggcggt aatgcttacg 661 gtcaagaaat gaccgctact tgtttgcgcg acctggatta ctacctgcgt ctcgtcacct 721 acggcgttgt agctggtgat gtcaccccca ttgaagaaat tggtcttgtg ggtgttcgcg 781 aaatgtacaa gtcccttggt actccaattg aaggtgttgc tgaaggtgtc cgtggtttga 841 agagcgctgc tacctcactg ttgtctggtg aagacgctac tgaagctggc tcttacttcg 901 actacgttat cggtggcttg caatagggtg aagttatttc cctgctgaaa ctgaagtaat 961 gcaataaagt tggaaataag gaatcaacaa catggctcaa gacgcaatta cctctgtcat 1021 taactctgca gacgttcaag gtaagtactt ggatacttct gctctcgaaa agctcaagag 1081 ctacttctca actggcgaat tgcgcgtacg tgctgctacc accattgctg ctaacgcatc 1141 tgcaattgtc aaagaagctg tggctaaggc tctgctatac tctgacatta cccgtcccgg 1201 cggtaacatg tacaccactc gtcgctatgc tgcttgcatc cgcgacttgg attactacct 1261 ccgttatgct acctacgcta tgttggctgg cgacccatcc atcttggatg agcgtgtact 1321 caatggtttg aaagaaacct acaactcctt aggtgtaccc gttggtgcta ccgtgcaagc 1381 tattcaagct atcaaagaag tgactgctag cttggtaggt cccgacgctg gtaaggaaat 1441 gggtgtttac ttagattaca tttcctctgg cttaagctaa aagctaggtt gcacttgaga 1501 acttaagtgc ggctttatta agtaaggtct ggaaatcgat catgagtgct agaactttta 1561 atcgcttatc ttgctaaata cgtatgtatg atgagtgtta gaagtttagt attgatgctt 1621 gatttccagc ctttgagtga ttagagaaaa gtttgcactg acaattttga acagaaaata 1681 ctgagcacag ccctcaccta aaaatactaa gaagaaatct acattttagg tggggtaagc 1741 gaagccaata taaaaagcga agcaacattg tagtataata gatcaatggt aatttactaa 1801 aaggcactgt aaaagcacac agatgctgaa gacatagacc tgatggtaat gcacagattt 1861 tcattaagcc ttatcgcgct ggagtgttct aggacaaaag tttccacgac tcacgcagga 1921 caccagtttt tccggcgacg ttaggacgag ttcttagcgt agaaatcctc aactataatc 1981 ttctatttag ttgggatacc ttcaaataaa aaagtttcca catcaatata gggagatatc 2041 aaaacatggc aaggttgttt aaaattactg cttgtgttcc tagccaaact cgaattcgta 2101 ctcagcgtga actgcaaaac acctatttca ctaagcttgt tccttacgaa aactggttca 2161 gtgaacaaca acggattcaa aaagcaggcg gcaaaattgt caaggtagag ttggctactg 2221 gtaagcggca aactaatact ggtttgagtt aatgtcactt atatagctga atacgacttg 2281 tagggagtta tcaagaggtc aagtagacct cttttttgtt tacacgcgat taaaaacctg 2341 attattccac taccttaatg gaggtcgctt gttagagtcg agaggcggta ttatctgccg 2401 tccctctctg ttagatccga gcgtgcgtat ttctatgcac tcggctcccg atgtttcagg 2461 gtttcccctt gcccatgtgg tggtagtcgt ggcaactttc gtgtattgct aggaggtttt 2521 tatccttcca gttttgatgg ttgccgtcga tgtgatgtag atgggttctt tcatctgaaa 2581 gtactttgca tccacaatga ccacatgtat ggttttgctt cttgagtatc tttgaggtta 2641 tcccatcgta tagcttgcta ttgcgattgc tccagtaaac taagtcgcca tcgtaggggg 2701 atttttcacc ttttacattg atgtgcttgt tttctgagta tgggactttg gggaatgctt 2761 ttttaagcag ttcctttgtg gagtgtctac tctgcttgga ttctttgttg aactttctgg 2821 aagctctaag ttggatgtga gtgaggttgt ctctagctct gtccatcttg cagaatctgt 2881 ggtagttcct ccatcctctg acgattggcg ctagtttttg cgctttctca ttggctccat 2941 agttggaaga gttgacgacg tttttaattt tctttcggaa agctcggaaa ttgtccttag 3001 aagggtaaca tctgaatttc ccgttagatt gaactttgaa ctcccaacca aggaaatcaa 3061 agccgtctgt cgctgctgta attttggttt ttctttcact gattttcatc cctcttcttg 3121 caagaaagtt gcatatcttg gcaagtatgt ttttctcatt gtctcctggt tttaggaaga 3181 atatcatgtc atctgcgtat ctcacagaag gatgtatttc ttctattccg ttgagggcaa 3241 tgttagctaa taagggactt accacaccgc cttgaggcgt tccttgttct gggaattcta 3301 tattggttcc tgctttgagg catcggaaga tgctattctt aaggcttagg gacttccaag 3361 gaataaaatc acccaagtca ataataataa tcattgttag // LOCUS NODE_7129_length_3308_cov_5.5336613308 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3308) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3308) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3308 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..420) /locus_tag="DP116_27610" CDS complement(<1..420) /locus_tag="DP116_27610" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408013.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PEP-CTERM sorting domain-containing beta-propeller repeat protein" /protein_id="PRJNA477356:DP116_27610" /translation="MKLVKNLSIAILGAGFMVFATAAQAFSLELTYDKSIGSPGFGPG ELFVPQGITKDSQGNILITNGRGVNPDGTPNFNIGNKVEKFSPSGEYIGAIGAGGTGP GQFDEPSALEISPVTGDLYVGDVYNNRINQFDSQGNFI" gene 675..1037 /locus_tag="DP116_27615" CDS 675..1037 /locus_tag="DP116_27615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412503.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_27615" /translation="MPLSQESVSSCPVGTLLNLLSGPWTFYILWILRNNGPTRFGALK RQIEGISSKVLTERLRMLEEAEILYRTYEPTIPPQVTYGLTERSQDLIVVLDQLAAIA HKWYPQERKDIDLPVEKR" gene 1383..1514 /locus_tag="DP116_27620" CDS 1383..1514 /locus_tag="DP116_27620" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017307411.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27620" /translation="MHTSITDDIKKRFHAACAIRGLKMSQVITELVEMWLKTNEVIH" gene 1531..>3308 /locus_tag="DP116_27625" CDS 1531..>3308 /locus_tag="DP116_27625" /inference="COORDINATES: protein motif:HMM:PF13432.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019497717.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27625" /translation="MKKILILSANPINTNNLRLDEEVREIQAAWERALNRENFELINK GAVRIDELRRTLLDHKPQIVHFSGHGTGTDGLVLENNSGEAQIVSTQSLSRFFELSKE QVECVLLNACYSETQAEAIFQHIDCVIGMGQSIPDNSAILFSKGFYDAIFGGRNYADG FKFGCNNIDLNNLSEFNTSTIKIRHRDFSPEDSITTQPYAPLFFIGTVVFGLIISLLG LFRLVLNDYGLITRILLGLGLITFWLCCAYIYYPSSQIAGFNINFFRRFKQYKKLRRL ALAGMVIIPLLTITGFYILELPTKDIIVLLADFETSADQKNYRVTQNIFNKLDNAFQR YPDVKVQRLNKIITNRQDALSEGKSHKASIVIWGNYGVTNTKTQFEPHFEVLRNPKKY LPKIGSLEQTASISELNNFKLQLRLSKQMSYLTHFTVGLIRYTVSDWKQAINSFSEAL KQGNDSIKVLNQKIVYLYRGNSYLYKKEYDRALADYNQAIKLDQNYAQAYNSRGISYT NKKDYDRAIADYNQAIKLEPNSAQAYIGRGVSYNNKKEYDRALADYNQAIKLEPNYAQ AYNDRGNSYLYKKEYDRALADYNQAI" BASE COUNT 1036 a 645 c 636 g 991 t ORIGIN 1 aataaagtta ccctgagaat cgaattgatt gatgcggttg ttgtaaacat cacctacata 61 cagatcccct gttaccggag agatttctag agctgatggc tcgtcaaact gtccgggtcc 121 tgtgccgcct gcgccaatgg ctccaatata ctcaccgcta ggactaaatt tttctacttt 181 gttaccgatg ttgaagttag gagtaccatc cgggttaaca ccgcgtccgt tagttatgag 241 gatattccct tggctatctt tggttatgcc ttggggaaca aatagctctc caggaccaaa 301 gcccgggcta ccaatagact tatcgtaagt tagctctaga gagaaagctt gagcggctgt 361 tgcaaatacc atgaatccag caccaagaat ggcaattgac aaatttttaa ctaatttcat 421 gggttgaact cctgtgtgtg taacttatag acgcaagcgt atagcagtgc atatcgacga 481 gcgataccac cggcaagggc aaagccctac gcaccacttc ctcagtcaaa gactgtgaag 541 tttttactga tttagcctat ggtgtggtaa gtggaagact tgtatctttc ttttagttac 601 tataagttac cagttgaaaa agtaaaagta ggtactttaa agtaactaga gtacgcagga 661 gaaactagat aattatgcct ctgtctcagg aatcagtcag tagttgcccg gtaggtactt 721 tgttaaacct gcttagcgga ccttggactt tctacattct ttggatttta cgaaacaatg 781 gacccactcg ctttggtgct ttaaaacgtc agatcgaagg gatctcatca aaagtcttga 841 ctgagagatt gcggatgctt gaggaggcag agatactcta ccgcacctac gaaccaacta 901 tcccgccgca agtaacctac ggcttaactg aacgatcgca agacctgatt gtagttctgg 961 atcaacttgc ggcgatcgcg cataaatggt atccacaaga acgcaaggat atcgatttac 1021 cagtagaaaa aagataaaat atatatcata actcctgttt tcaatcaagc ataaaaagca 1081 gcttttacac aatgactttg gatatgtcta atgtgtctta gcgcaaaatc gccaaaatct 1141 gggaaattat ccggttggct tgggataaat aaccaccacc aaatcgttgg aagtctatag 1201 gactcctatt tgatttttgc gaagctaggt acatgtttat tccttcttcc ctgttccctg 1261 cgatgcactg agcttgtcga agtgttccct gttaagagtt ccctgtcctg actttgttag 1321 ttcacaaatc aaaccggatt cttatacaat tttttataca agctttcaaa tctgtaaaac 1381 ttatgcacac tagtatcact gacgacatca aaaaacggtt tcacgcagcc tgtgctatcc 1441 gtgggctgaa gatgagccaa gtgataactg aactagttga gatgtggtta aaaacaaacg 1501 aagttattca ttgagtgatc ccaaagcctg atgaagaaga ttttaatttt atcagctaat 1561 cctataaata caaataactt gcgcttggat gaagaagtac gagaaattca agcagcgtgg 1621 gaacgcgccc tcaatagaga aaattttgaa ttgattaata aaggcgcagt tcgtattgat 1681 gagttgcgtc gtacattgtt agatcataaa ccacaaatag tacatttttc cggtcacggt 1741 actgggacag atggcttagt tctggaaaat aactcaggtg aagcgcagat agtaagtacg 1801 caatcactct cacgtttttt tgagttatca aaagagcaag tagagtgtgt tttactcaac 1861 gcttgctatt ccgagactca agctgaagca atttttcaac atattgattg tgtcattggt 1921 atgggacaat ctattccaga taattctgct atcctttttt ccaaaggatt ttatgatgct 1981 atttttggtg gcagaaacta cgcagatgga tttaaatttg gttgtaataa tatagattta 2041 aataatcttt cggaatttaa tacttccacc ataaaaatta gacatagaga ttttagtccg 2101 gaagactcaa tcacaactca accatatgca cccttgttct ttattggcac ggtagtattt 2161 ggtttgataa ttagcttact aggacttttc cggttagtac ttaacgatta tggattaata 2221 actcgtattt tactaggtct tggtctgatt actttttggc tctgttgtgc atacatctac 2281 tatccctcaa gtcaaattgc tggttttaat atcaatttct ttagacgatt caaacaatat 2341 aaaaaattac ggcgtttggc tctagcaggg atggttatta tacctctttt gactataact 2401 ggattttata tattggaact tccaaccaag gacattattg ttcttctcgc tgattttgaa 2461 acttctgccg atcaaaaaaa ctatcgtgtg actcaaaata tctttaataa gctagataat 2521 gcctttcaac gatatcctga tgtcaaagta caacgtttaa acaaaataat tactaatcgt 2581 caagatgccc tgagtgaagg aaaaagtcac aaagctagta ttgtcatctg gggtaattat 2641 ggcgtaacaa atacaaagac acagtttgag ccacactttg aagtattgcg aaacccaaaa 2701 aaatacttac ctaaaattgg ttcattagaa cagacggcta gcatctctga attaaacaat 2761 tttaaattgc aattacgtct ttcaaaacag atgagttatt tgactcattt tactgtggga 2821 ttaattcgtt acacagttag tgattggaaa caggctataa actcttttag cgaagctttg 2881 aaacagggta atgattcaat taaagtgtta aaccagaaaa tagtctacct ttaccgagga 2941 aattcttact tatataaaaa agagtatgac cgcgctcttg ctgattataa ccaagctatt 3001 aagcttgacc agaactatgc ccaagcttac aatagccgag gaatttctta caccaataaa 3061 aaagactatg accgtgctat agctgattat aaccaagcta ttaaacttga gccgaactct 3121 gcccaagctt acatcggccg aggagtttct tacaacaata aaaaagagta tgaccgcgct 3181 cttgctgatt ataaccaagc tattaaactt gagccgaact atgcccaagc ttacaacgac 3241 cgaggaaatt cttacttata taaaaaagag tatgaccgcg ctcttgctga ttataaccaa 3301 gctattaa // LOCUS NODE_7131_length_3306_cov_4.5059983306 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3306) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3306) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3306 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..708 /locus_tag="DP116_27630" CDS <1..708 /locus_tag="DP116_27630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741057.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="efflux RND transporter periplasmic adaptor subunit" /protein_id="PRJNA477356:DP116_27630" /translation="FTGRIAVVGSVVEGETRVVPVKAEINNPGGVLKPGMFAQLEVLT DQTSAATLAIPKSAVVEANNKTIVYVQNGNAFKSTEVTLGQTSGDLVEVTQGLSQGNS IVTQRAPQLYAQSLRGGSKSKEGEQKEEGNSHKEGEQKEEGNSHSQETKAEAHNFGLP LWLIAALGGTAIATGAFVTGRSRSSRRTRSQMVAVENIEYDATHETEIHTDNHKQPTL STSAKQDEERENPHQPH" gene 844..>3306 /locus_tag="DP116_27635" CDS 844..>3306 /locus_tag="DP116_27635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877592.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CusA/CzcA family heavy metal efflux RND transporter" /protein_id="PRJNA477356:DP116_27635" /translation="MLNSIVKWSIAQRWLVVFASILISLWGFRVLTQMPLDVFPSFSP PQVEIMTEAPGLAPEEVESLVTRPIESSINGTPGLESLRSSSAVGLSAVRAIFSWDTD IYRARQLVTERLQQARSLLPQGVEDSEILPVSSALGWTVKYAFTSETTPLMDVWRIVN WQVKNRLLAVRGVSNVVIFGGDERQYQVLVNPAKLRAFNVTLDDVTKAAAAANANAPG GFLITPDQETLVRGVGRIQSIEQLKKSVIKAKNGTPLLLEQVADVQIGAALKRGDGSF AGKKAVILTINKQPTADTPSVTKAAEAAMEEIKAGLPKDVKITTTFRQEDFIEASIKN VEEALRDGTIIVSVILILFLMNWRTVVISLSALPVSLLLGMMILNWTGQGINTMTLGG LVVAIGSVVDDAIVDMENVYRRLRENQLAGNPVPPFQVVFNGSVEVRVSVLFATIIIA VVFAPIFGLSGVEGRIFTPMGLAYLLSIAASTLVALTLTPAMCALLLVNTRLPSTETW VERQAHRLYRPALKFSMRRPKIILATAMAGIVAAIVILLGLGQVFLPEFQDRALVIAT LLMPGESLDATNQVGLAIEEALKKNPRIETVQFRSGRAQGDTEVAGVNFGEMDVQLSE EGAKDRDKTIEMIRSEFEKIPGVAPNIGGFISHRMDEVLSGVRSAIAVKIFGLDLEQL RTLGKQVQSAMGKVQGLTDLQLEPQVPMKQVQIQFDRDAAARYGLTIGELSETIETAL NGRVVSQVLEQQQTFNMIVWLQESYRNNIEIIRDLLVDTPNTQKIPLAQVAKIDYGTG PNTINRENVSRYIVVSSNVGGRD" BASE COUNT 922 a 695 c 801 g 888 t ORIGIN 1 tttactggac gaattgccgt cgttgggtca gtggtagaag gcgaaacgcg ggttgttcct 61 gtgaaagccg aaataaataa ccccggtgga gttcttaaac cagggatgtt cgcccaatta 121 gaagttttaa cagaccaaac atcagcagct accttagcaa ttcccaaatc tgctgtggta 181 gaggcaaata ataaaacaat agtttacgta caaaatggta atgcctttaa gtcaactgaa 241 gtgacattag gtcaaacttc tggggatttg gtagaagtca ctcaaggtct atctcaggga 301 aattcaattg ttacccaacg tgcgccgcaa ctttatgcac agtctttgcg gggtggaagt 361 aaatctaagg aaggagaaca gaaggaagaa ggtaattcgc ataaggaagg agaacagaag 421 gaagaaggta attcgcattc acaagagact aaagctgaag ctcacaactt cggcctacct 481 ttgtggttaa ttgcagcatt aggaggaacg gcaattgcga ctggtgcttt cgtgacaggt 541 cgttcccggt ctagtcgtcg tacccgttcc cagatggtag cggtggagaa cattgagtac 601 gacgcaactc atgaaacaga gattcatacc gataaccaca agcagccaac cttatctact 661 tccgctaaac aggatgagga gcgtgaaaac cctcaccagc ctcactaatg ggaaatgact 721 atgacgagtg ctgttaccaa gtggatagtt gttgaggaag gcaaaggagg agaagtaagt 781 tttgaagatt gattttattt tatcctctaa atcccttgtg actctaccta aacaatcaat 841 aatatgctta attccatagt taaatggtct attgcccaac gctggctggt ggtttttgcc 901 tcaattttaa ttagcttgtg gggctttcgc gtcctgactc aaatgccttt ggatgtgttt 961 cccagctttt ctccgccgca agtcgaaatt atgactgaag ccccaggact tgcaccagaa 1021 gaagtagagt cgttggtgac tcgtcccata gaaagttcaa taaacggaac tcccggactc 1081 gaatcgttac gctcttcttc agctgtaggt ctttctgctg tgagagccat tttcagctgg 1141 gatacggaca tttatcgtgc ccgtcaattg gtgacggagc ggttgcaaca agcacgcagc 1201 ctgctaccgc aaggggtaga agattcagaa attcttcccg ttagttctgc tttaggatgg 1261 actgtcaaat acgccttcac ctccgaaacc actcctttaa tggatgtatg gcgcattgtc 1321 aactggcagg taaaaaaccg cctgctggct gtccgtggtg tcagtaatgt ggtgatattt 1381 ggtggggatg agcgtcagta tcaagtgctg gttaatccag cgaagctaag agcgtttaat 1441 gttactcttg atgatgtcac caaagccgca gctgccgcca acgccaatgc gccaggagga 1501 tttttaataa ctcctgacca agagactttg gttcgaggag tagggcggat tcaatctatt 1561 gagcagctaa aaaaatcagt cattaaagcc aaaaatggta cgccactgct tctagagcaa 1621 gtggctgacg tacaaattgg tgcggcgctc aaacgcggtg atggtagttt tgcgggtaaa 1681 aaagcagtga ttttgaccat caacaaacag ccaactgcag atacaccatc agtcacaaag 1741 gctgctgagg cagcaatgga ggaaattaaa gctggtctgc ccaaagatgt caaaattaca 1801 actaccttcc gccaagaaga ttttattgaa gcctctatta agaatgttga agaagccttg 1861 cgcgatggca ctatcatcgt ttccgtcatt ttgattctgt ttttgatgaa ttggcgcacg 1921 gtagttatta gtttaagcgc cctccctgtg tctttgttgc tagggatgat gattctcaac 1981 tggacaggac aaggcattaa caccatgact ttgggcggac tggtggtagc cattggttcg 2041 gtggtagatg atgcaattgt tgatatggaa aacgtctacc gccgcttacg ggaaaaccaa 2101 ctcgctggaa atcctgtccc accttttcaa gttgttttca atggttcggt ggaagtgcgc 2161 gttagcgttc tctttgccac aattattatt gccgtcgtct tcgcaccaat tttcggcctc 2221 tctggtgtgg aaggtcgaat ttttacacca atgggtttag cgtatctgct gtcaattgct 2281 gcttccactt tggtagcgct gacattaact ccagcaatgt gtgccctact gttggtgaat 2341 acaaggttac ccagtacaga aacttgggta gaaagacaag ctcatcggct ttaccgtcca 2401 gctttaaaat tttctatgcg tcgccccaag attattttgg caacagcgat ggctgggatt 2461 gtagctgcaa tagtaattct acttggttta ggacaagttt ttttaccaga gtttcaagat 2521 cgcgccctag tgattgctac tcttcttatg cctggtgaat ctttagatgc aaccaatcaa 2581 gtggggttag cgatagaaga agctctcaaa aagaaccctc ggattgaaac tgttcaattc 2641 cgctctggac gggctcaagg tgatactgaa gtggctggtg tcaactttgg agaaatggat 2701 gtgcaattga gtgaagaggg ggcaaaagat agggacaaaa ctattgaaat gattcgctca 2761 gaatttgaga aaattcccgg cgtagcacct aatattggtg gttttatttc tcaccggatg 2821 gatgaagtac tgtccggggt aagaagtgct atagctgtga aaatctttgg tctcgacttg 2881 gaacaactcc gtacccttgg caaacaagtg caatcagcta tgggtaaggt tcaaggtcta 2941 acagacttgc aattagagcc acaagtacca atgaaacagg tgcaaattca gtttgaccgc 3001 gatgcggctg ctcgctatgg tctaactatt ggagaattat cagaaacgat tgaaacggct 3061 ctcaatggaa gagtcgtttc tcaagtctta gaacagcaac aaaccttcaa catgattgtt 3121 tggttgcaag aaagctaccg taacaacata gaaatcatcc gtgatttatt agttgataca 3181 cccaacactc agaaaattcc cttggctcaa gtcgcaaaaa ttgactacgg tactggtccc 3241 aataccatca accgcgaaaa cgtctcccgg tatattgttg tatctagcaa cgtgggtggg 3301 cgggat // LOCUS NODE_7132_length_3304_cov_5.5857193304 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3304) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3304) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3304 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 929..3157 /gene="psaB" /locus_tag="DP116_27640" CDS 929..3157 /gene="psaB" /locus_tag="DP116_27640" /EC_number="1.97.1.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017315024.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I core protein PsaB" /protein_id="PRJNA477356:DP116_27640" /translation="MAVKYPKFNQDLAQDPTTRRIWYAIATGNDFESHDGITEENLYQ KIFATHFGHVAIIFLWASSLLFHVAWQGNFEQWMQDPVHIRPIAHAIWDPHFGKPAID AFTQGGANYPVNIAYSGVYHWWYTIGMRTNNDLYIGSLFLLLLAAVFLFAGWLHLQPK YRPSLAWFKSAEPRLNHHLAGLFGVSSLAWAGHLVHVAIPESRGIHVGWNNFLTTLPH PQGLGPFFSGNWGAYAANPDTGEHIFGTSQGAGTAILTFLGGFHPQTESLWLTDMAHH HLAIAVIFIIAGHQYRTNFGIGHSIKEMLNAKNFFGIQTEGQFNLPHQGLYDTYNNSL HFQLSIHLAALGTALSLVAQHMYALPPYAFIAKDYTTQACLYTHHQYLAVFFMVGAFA HAGIFWIRDYDPEQNKGNVLDRVIKHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAF GTPEKQILIEPVFAQFIQAAQGKALYGMNVLLSNPDSLAHTAWPNHANLWLPGWLDAI NNGTNSLFLTIGPGDFLVHHAFALGLHTTTLICVKGALDARGTKLMPDKKDFGFTFPC DGPGRGGTCQTSSWEQSFYLAIFWALNTVGWATFYFHWKHLGIWQGNVAQFNESSTYL MGWLRDYLWANSAQVMNGYNPFGTTNLAVWDWMFLFGHLVWATGFMFLIAWRGYWQEL IETLVWAHERTPLANLVRWKDKPVALSIVQGWLVGLTHFTVGYILTYAAFLIAGNAGK FG" BASE COUNT 840 a 762 c 739 g 963 t ORIGIN 1 gatgctcgca ttaaattgaa gcgactttat ccctcaattc aaacttgatg acctagtagg 61 gtgaggatgg acaaaaatac ctcaagcata gaggatggaa atacctcaaa caaatgacat 121 ttcgcctacg tgttcctcga ccataacatg gctgtgctga cccagttgag caagaaaatt 181 ggaaaaaaaa ctcgatttga ggctcaaatt tcttcaagct cgatatccaa gcgctgaaat 241 cgaagtttgg agcatggacg agcaacgcta ccggacttca tccaatcttg tgtagagttt 301 gggtatcatt cggcgaacaa ccaattgctt ctgtcgtaca gaggtataaa tggatgtggt 361 tatatgggga cgcactaatc tttacaaaac gcgcaacccg ataacaactg gagtttggac 421 tcaaagccag agaatacatt tttaaataat tgattttcca attatttagg atctggctag 481 cgtattcatc tagcagtgca gatggtggtg aaaccttggc aatagaagct tgaaaacttc 541 atttattata tgcactttct tcctacctca gactctatta acaatctgta atattttggt 601 tgtattgagg atgacaagca aatttcatac taatctcttt ctttagtaag agatgcaatg 661 gttgttttag tagataattt gtttatgact catataaagt tgaaagcatc ataccaactg 721 aggatgaaaa ggcaggaaga aaactttatt tttgcatgca ttggcatgtt tccaagaaaa 781 cgagttttct cctcatactt acgcttcttt ttttatgaga ttgacctccg attaaggctg 841 aaagactgtt ttttcaattt gctgtgttgc tagctgatta aaactagcaa cctgaataag 901 cgtttagggg aggatgttcg taaaatacat ggcagtaaaa tatcctaaat ttaaccagga 961 tcttgcacag gatccgacga cacgtcggat ttggtacgcg atcgcaacag gaaatgattt 1021 tgaaagccac gatggcatca cagaagaaaa tctttaccaa aagatttttg caactcactt 1081 cggtcatgtg gcaatcatat tcctgtgggc atcaagcttg ttgttccacg tagcctggca 1141 aggtaacttt gaacagtgga tgcaagatcc tgttcatatt cgcccaatcg cccatgcgat 1201 atgggacccc cacttcggta aaccagcaat agatgctttt acccaaggcg gcgctaacta 1261 tccagttaac attgcttact ctggtgttta tcactggtgg tacaccatcg gtatgcggac 1321 gaacaatgat ctctacatag gttcattgtt cctcctgctg ttagcagcag tgttcctgtt 1381 tgcaggttgg ttgcacttac agcccaagta ccgtcctagc ttagcttggt ttaagagtgc 1441 agaaccccgt ctgaaccacc acctagcagg tttgtttggt gttagctctc tggcatgggc 1501 aggacatttg gttcacgtag caatccccga atctcgtggt attcacgttg gctggaataa 1561 cttcctgacg actctaccac atccgcaggg attgggacca ttcttctctg gtaattgggg 1621 tgcctacgct gcgaacccag atacgggtga gcatatattt ggtacatctc aaggtgcagg 1681 aactgcaatt ctgacattct tgggtgggtt ccatccacag actgaatcgc tgtggctgac 1741 ggatatggca caccaccact tggctatagc agtcatcttt atcatcgccg gtcaccaata 1801 ccgcactaac tttgggattg gtcacagcat caaagagatg ctcaacgcca agaacttctt 1861 tggtatccaa actgaaggtc aattcaacct gcctcaccaa ggactgtacg atacctacaa 1921 caactctctg cacttccagt tgtctattca cctggcagcg ctaggtactg ctctttcctt 1981 ggtggcgcag catatgtacg ctttgcctcc ttatgccttc atcgctaagg actacacgac 2041 ccaggcgtgt ctctacactc accaccagta cctcgctgtg ttcttcatgg tcggtgcttt 2101 cgcccacgct ggaattttct ggatacgtga ttacgatcca gagcaaaata agggtaacgt 2161 acttgaccgg gtcataaagc acaaagaagc gattatctct cacctgagtt gggtttctct 2221 gttcctaggc ttccacactc tgggattgta cgtccacaac gacgtagtcg ttgcttttgg 2281 aactcctgag aagcaaatct tgattgaacc agtgtttgct cagtttatcc aagctgctca 2341 gggtaaggcg ctgtatggca tgaatgtact gttatctaat ccagatagcc ttgcacacac 2401 tgcttggcct aaccacgcaa acttgtggct accaggttgg ctggatgcga ttaacaacgg 2461 tactaactcc ctgttcctaa ccattggtcc tggcgatttc ttggttcacc acgcctttgc 2521 tctgggtttg cataccacca cgctcatttg tgttaagggt gcgttggatg ctcgtggtac 2581 caagttaatg cccgataaga aggatttcgg ttttactttc ccctgtgacg gtccaggtcg 2641 gggtggtact tgccaaactt cctcttggga acagtcgttt tacctcgcta tattctgggc 2701 acttaacact gttggatggg ccaccttcta cttccactgg aagcatttag gaatctggca 2761 aggtaatgtg gctcagttca atgaatcttc tacatatctc atgggctggc tgcgagatta 2821 tctctgggct aactccgctc aagtcatgaa cgggtacaac cccttcggta caactaacct 2881 agctgtttgg gattggatgt tcctcttcgg gcacctagtt tgggcaacgg gtttcatgtt 2941 cctcatcgct tggagaggtt actggcaaga gttgattgaa actctagttt gggcacacga 3001 acgcactcca ttggcaaatt tagtccgctg gaaggataag cctgttgctt tgtccatcgt 3061 ccaaggttgg ttggtgggtc tgactcactt cacagttgga tacatcttga cctacgcagc 3121 attcctcatt gccggaaacg ctggtaaatt cggttaaagt ttttgatttt gatatcttct 3181 gacctttagc ttcccctcta tctggatggt acagagggga atttcttttg ctctcaaaca 3241 aggcgcaaag tgcaaaatcc cctagggagt gttaacgctg ttccgcgaaa ttcaagagtc 3301 cgcc // LOCUS NODE_7235_length_3226_cov_5.0132453226 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3226) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3226) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3226 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 176..1345 /locus_tag="DP116_27645" CDS 176..1345 /locus_tag="DP116_27645" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015202289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor histidine kinase KdpD" /protein_id="PRJNA477356:DP116_27645" /translation="MYSSYTVSSNGTYLRPARRGKHKIFIGMAPGVGKTYRMLDEACR QKKDGIDVVIGWLETHDRPETDAKAQGLEVIPRNKIEQGGLIFTQMNTDAIIARQPQL VLIDELAHTNIPGAKHNRRYQDVETILAAGIDVYSTVNIQHLESLCNQVTQMTGIVVQ ERIPDSLLEAADQVVVVDVTPETLKERLLEGKIYPTRKIEPSVQNLFQRSNLVALREL ALRQVADNIEKKEIQQAARLNSNAKAVSANVYCIHERILVCVSCAPNSVRLIRRGAIF ADYMNAPLYVLFVNNPDHFMTKVEALHIETCKQICQEFKGEFLQVSGQNVAQEIARVA KLYRITQVVLGQTRRSKWQMIFTESLIHQLLRYAERQASGLSLNQIDIHIISSDK" gene complement(1551..1796) /locus_tag="DP116_27650" CDS complement(1551..1796) /locus_tag="DP116_27650" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007355091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27650" /translation="MPHSDSPIELSIARERLRQARYSFNLAIISTAVSAFVSLTGAGL LLRGKANEGAVTAACGMIASVRCVELAKDANDRLDEI" gene 2083..>3226 /locus_tag="DP116_27655" CDS 2083..>3226 /locus_tag="DP116_27655" /inference="COORDINATES: protein motif:HMM:PF00805.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="PRJNA477356:DP116_27655" /translation="MTDPSQNRKKKKITIIASSQGVERAENALIRLGFDSKSNFAQSQ LLARNTVTKFFQREPIQLDSFKRICEALELEWGEIAEIPSEEEQSKRIERKYSTSLET NEEVGQMQTLRRQVTVIDTQSKIIKAEIILKGDINSVHNFKIIQLILQEHSGDTITIT DIQEGSIRLIVEGSQEDIERLVSRIKSGELTEVNGFPVEDAQILSEISDDDENNELED KWRLVQEIVSQRVKDRKLIGADLSDADLSDADLSGADLSDADLNGADLSDADLSGADL SGANLSGAIINKETRIDRKWRLIWEIVNHAAVDRDLSNANLSGANLSGANLSGANLSE ANLISANLSEANLSEANLSRANLSLAYLKGAIINKETRIDRKWRLIW" BASE COUNT 1014 a 657 c 709 g 846 t ORIGIN 1 tcaggtagga aacggactcg tccacccctt gttcactgtt tactgttcac tgttcactgt 61 tcactgattg aagggcgctt attcttaccc gttcaaaata aattattaac ttttcctgaa 121 cttttgagta cagaacgcaa gtaagatggt atttagcaat gttaatgcgc agataatgta 181 cagttcttat actgtttctt ctaacggtac ctacttacgt cctgctcgtc gcggcaagca 241 caagattttt attggtatgg ctcctggggt tggcaaaacc tacaggatgc tagatgaagc 301 atgccgacaa aaaaaagatg gtatagatgt cgttattgga tggctggaaa ctcacgatcg 361 cccagaaaca gatgctaagg cacaaggttt agaagtcatt ccacgaaata agatagaaca 421 gggcgggtta atcttcaccc agatgaatac tgatgctatt atcgctcgcc aacctcagct 481 tgttttaatt gatgaactag cacacaccaa cattccagga gcaaaacaca acagacgcta 541 ccaggatgta gagacaattt tggcagcagg tattgatgtt tattctactg tcaacataca 601 acacttggag agtctgtgca atcaagtcac ccagatgacg ggtatcgttg tgcaggagcg 661 tattccagac agcttactgg aagctgcgga ccaagtggtt gttgtggatg tcacaccaga 721 aacattaaaa gagcgattgt tggaaggtaa aatttatcca acaagaaaaa ttgagccatc 781 cgtacaaaat ttattccaac gttccaacct ggtggctctg cgagaactag cgctgcggca 841 ggtagctgat aacatagaaa agaaagaaat tcagcaagcc gctcgactaa actccaacgc 901 taaagctgtc agcgcaaatg tttactgtat ccacgaacga atactggttt gcgtttcttg 961 tgcgccaaac tctgtgcgat tgattcgacg cggagctata tttgctgatt atatgaacgc 1021 accgctctac gttttgtttg tgaacaatcc agatcatttt atgacaaaag tagaagcact 1081 gcatattgaa acttgcaagc aaatctgcca agaatttaag ggtgagtttt tgcaagtttc 1141 tggtcaaaat gtagcacaag aaattgctcg tgttgcaaag ttataccgaa ttactcaggt 1201 agtattaggt caaactcgtc gctcaaaatg gcaaatgata tttacagagt ctctgattca 1261 ccaactgctg cgatacgcgg agcgtcaagc ctccggctta tcgctcaacc aaattgatat 1321 tcatattatc tcatctgata aataacgaaa accaagtaaa tcaagacgga acctcaccct 1381 gccctatcgg gcatccctca cgccaggtgc ttcaagtcgg caaagccgcc caacgcactg 1441 gctctcctta ctaaggagag ggaaagattc tagcgcagct aaaagcgagg gtgaggttgt 1501 aaaaatctta ttcccgcgac agaatcaaga gttttccatt tcttcacaga tcaaatttca 1561 tcaagtctat cattagcatc tttagcaagc tcaacacatc ggacactcgc aatcattcca 1621 caagcagcag tcactgcacc ttcatttgct tttcctctca gcagaagccc agcacctgta 1681 agactaacaa aagcagacac tgcggtagaa attattgcaa gattgaaact ataacgtgct 1741 tgtcgcaggc gttcacgagc aatacttagt tctatcggag agtcagagtg agggattggc 1801 ttgttattca tagatttcgc agagcaaata aacttcatac ttctaatgtg cagccaaacg 1861 cctaaaatct tgactacgca acgattgtgt tgttaatcac ctatgacctt tctatgattg 1921 tcttatgcgc tacttttgat tgcacaagcg ttaagcgaag ctctgccgta ggcaatcgcc 1981 tttaacatgg aagaagtcag aaggaacaag gaagatagaa gttggtcttc ttccttgttt 2041 cttccatctt cttccttcgc ttaaaaaaac tctggtcaaa ccatgacaga tcctagccaa 2101 aacagaaaaa agaagaaaat aactatcatc gcgtcttcac aaggtgtaga gagagcagaa 2161 aatgctttga taaggctagg atttgattca aaaagtaact ttgcacagtc tcaattatta 2221 gctcgtaata cagttacaaa attttttcag cgtgaaccaa ttcaacttga ttcctttaaa 2281 agaatatgtg aggcattgga actggagtgg ggagaaatag cagagatacc atcagaagaa 2341 gaacaatcta agcgaataga aagaaaatat agcactagct tagaaactaa tgaggaggtg 2401 gggcaaatgc aaacacttcg ccgtcaagtc actgtaatag atacacaaag taaaataatt 2461 aaagcagaaa ttatattaaa aggcgatatc aattcagttc acaacttcaa aattattcaa 2521 ttaattttac aagaacattc aggagacact ataacaatta ctgatattca agaaggcagt 2581 atcagattaa ttgtagaagg ttctcaagaa gatatcgaac ggcttgtatc tcgcatcaaa 2641 tcaggagaac taacagaagt aaacggtttt ccggttgagg atgctcaaat tttgagcgaa 2701 atttcagatg atgatgaaaa taacgaatta gaggataaat ggcgtctagt gcaagagatt 2761 gttagtcagc gagttaagga tcgaaagttg attggcgctg acttgagtga tgctgactta 2821 agtgatgctg acttgagtgg tgctgacttg agcgatgctg acttgaatgg tgctgacttg 2881 agcgatgctg acttgagtgg tgctgacttg agtggtgcca acctgagcgg tgcaattatt 2941 aacaaggaaa cgcggattga tcgcaagtgg cgtttaattt gggagattgt taatcacgca 3001 gctgtagatc gagatttaag taatgccaat ctgagcggtg ccaacctgag cggtgccaac 3061 ctgagcggtg ccaacctgag cgaggccaat ctgattagtg ccaacctgag cgaggccaac 3121 ctgagcgagg ccaacctgag ccgtgccaac ctgagccttg cttacctaaa gggtgcaatt 3181 attaacaagg aaacgcggat tgatcgcaag tggcgtttaa tttggg // LOCUS NODE_7526_length_3022_cov_4.0660603022 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 3022) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 3022) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3022 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(131..379) /locus_tag="DP116_27660" CDS complement(131..379) /locus_tag="DP116_27660" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182534.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phycobilisome linker polypeptide" /protein_id="PRJNA477356:DP116_27660" /translation="MVYQIVSGRSNNSITANRSFRFEVIGLHQNEVTDNNNYAIRSSA SVFITVPFSRLNQELQRINRLGGKIVNIEPLTPETDKA" gene complement(454..1320) /locus_tag="DP116_27665" CDS complement(454..1320) /locus_tag="DP116_27665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015182533.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_27665" /translation="MAITTAASRLGTSAFTNAAPIELWSNASQTDISAVIAAVYRQVL GNDYLLKSERLTGAESLLLNGSISVREFVRQVAKSQLYKAKFFSNSFHSRFTELNYKH FLGRAPYDESEIAYHLDLYQTKGYDADIDSYIDSTEYEVNFGENIVPYYRGFTTQPGQ KTVGFTRIFQLYRGYATSDRSQIPGNSPRLASELAKNSASTVVAPALSNNGFAYRPPL RGETPSSTFGGSQAFGTGRLYRVEVASISKPGYPKVRRVNQAVVIPYEQLSDYFQRVQ RQNGKIASITPL" gene complement(1658..2473) /locus_tag="DP116_27670" CDS complement(1658..2473) /locus_tag="DP116_27670" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006618346.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I reaction center subunit XII" /protein_id="PRJNA477356:DP116_27670" /translation="MAALGEASRLGISPFADAEKVELRPVRTEADVQAVIWATYRQVL GNEHLMQSERLLSAESLLRQGQITLRDFVRAIAQSELYRQKFFYSNSQVRFIELNYKH LLGRAPYDESEIAYHVDLYNSQGYEADINSYIDSVEYQQSFGDSIVPYYRGFQTVVGQ KTVGFPHFFQLYRGYANSDRAQSKPKGQLTWDLAKNLVSPIYPASAGTLTGTSTGRRG GNTYRIRLTQAASPNSTVIRQSISEVVVPFEQLSDRLQQLNRQGRKIISITLS" gene 2622..2810 /locus_tag="DP116_27675" CDS 2622..2810 /locus_tag="DP116_27675" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27675" /translation="MPVFEAFQADYAKASTHLQTLGSVLWNYVTKFGENNLVITLVWM HNYQIIFLLHHTKLKDSR" gene complement(2807..2908) /locus_tag="DP116_27680" /pseudo CDS complement(2807..2908) /locus_tag="DP116_27680" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015221642.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="phycocyanin subunit alpha" BASE COUNT 899 a 617 c 638 g 868 t ORIGIN 1 aatcataact aattaaccgg acacgatatg aaaagttttg ggcgaatata attcgctact 61 acacaggcaa agtctgcctg cgtagactaa taatttttaa tttgacctgc aaatgaaatg 121 agaagttgtc ctaagcttta tcagtttcag gcgtcaaggg ttcaatatta acgattttgc 181 caccgagtcg attaatccgc tggagttctt gatttaatcg gctaaacggt accgtaataa 241 acacactggc actggagcga atggcataat tgttattatc cgtgacctcg ttttgatgga 301 gaccaatcac ttcaaaccgg aaactccgat tcgctgtgat tgagttgttg cttctgccgc 361 taacaatttg ataaaccatg ataatatccc ggaaattaaa ggaaatttaa attccccctt 421 ttcgggggat tgacaaattt tataaatcat gatttaaaga ggcgtaatac tggcaatctt 481 gccattttgg cgttgaaccc tttgaaagta atctgatagc tgttcgtagg ggataacgac 541 tgcttggttg acgcggcgaa ccttgggata tcctggttta gaaatacttg ccacttcaac 601 tcgatagagt cgtcctgtgc caaatgcttg cgagcctcca aatgtgctgc tgggagtttc 661 ccctcttagc ggaggacgat aggcaaagcc attattgcta agcgcaggag caactaccgt 721 tgatgcacta tttttcgcca gttcactagc tagacgaggg gaattgcctg ggatttggga 781 gcgatcgctc gtcgcatacc cacgatacaa ctgaaaaatc cgagtaaagc ccacggtttt 841 ctgtccaggt tgagtcgtaa atccccgata gtagggtacg atattttcac caaaattaac 901 ttcgtactca gttgagtcaa tataggaatc aatgtcagcg tcgtatcctt tggtttggta 961 caaatccagg tgataggcta tctctgattc atcataagga gcacgaccta aaaaatgttt 1021 gtagttcaac tccgtgaaac ggctgtggaa actgttggag aaaaacttag ctttatacag 1081 ttgggatttc gccacctgac ggacaaattc tcgcacactt atcgagccat tcaacagcag 1141 agattctgca ccagtcaggc gttcggattt taataaataa tcattgccta aaacttgacg 1201 ataaacagcc gcgatcacag cagaaatatc tgtctggctg gcattagacc ataattcaat 1261 gggggcagca ttagtaaaag cagaggttcc gagtcgagag gcggctgttg taatagccat 1321 ctaaaaatct ccttgattaa aaacttttat tgaataaaga acttagaact cagttgtcag 1381 aactcagaat aaatagaaag attaatatcg gtataacttc tgtacaactt ttttaggttt 1441 taggttgcaa gtttcaggtt taagaagagt tactctcttc tcgttagaag aataccgaat 1501 ttaacgatta gcaacaagac gccaaggacg ccaagaaaat caaaaagaag ataggtaatc 1561 ttgcacgcct gatgggagta atgaacaaca atctttctcc tacctgatgc ctggtatttt 1621 ttttcctgca ttctgcgttc tgtgttctgc gttcttctta agaaagagtg atgctaataa 1681 tcttgcgtcc ctggcgattg agttgttgca atcgatctga caactgttca aaagggacaa 1741 cgacttcact aatactctgg cggatgactg tagagttagg agaagccgcc tgcgttaggc 1801 gaattcgata agtgttacca ccacgacgac ctgttgaagt gccagttaag gtgccagcag 1861 aagcaggata gatgggggat accaggtttt ttgctaaatc ccaagttaat tgtcctttgg 1921 gcttgctttg ggcgcgatcg ctattcgcgt aacctcgata caactggaaa aaatgcggaa 1981 acccgactgt tttttgtcct accacggttt ggaagccccg atagtaaggc acaatggaat 2041 ctccaaagct ttgttgatac tcgacagaat caatgtagga attaatatca gcttcgtaac 2101 cctgagaatt gtacaggtca acgtggtagg caatttctga ctcgtcgtaa ggagcacgac 2161 ccaaaagatg cttataattc aattcaatga agcgaacttg agagttggaa tagaaaaact 2221 tttgtcgata tagttctgat tgagcgattg cacgcacaaa atcccgtagt gtaatctgtc 2281 cctgacgcaa gagtgattct gcactcaaaa gacgctcact ttgcatcaga tgttcattgc 2341 ccaaaacttg tctataggtc gcccaaatga ccgcttggac atcagcttct gtcctcacag 2401 gacgcagttc aactttttca gcatctgcaa atgggctaat tcctaatcgg cttgcttctc 2461 ccaacgctgc cataattcca cctcttaaat ctcgattaca caactgcgaa tctggttaaa 2521 aagacatttt aaaaaaaacc cttgcgtagt cattgaaaag ccagaaaact ccaacagcat 2581 aagtatctca gtcagtttga attgacagtt aacacataat gatgccagtg tttgaagcct 2641 ttcaggctga ctacgcaaaa gcctcgacgc atttgcaaac tttaggttca gttttgtgga 2701 actatgtaac gaagtttggt gaaaataatc tggtaataac attagtttgg atgcacaatt 2761 accaaattat tttcttgctg catcacacaa aactgaaaga ctcccgttag gtcaaagcat 2821 tgatcacata atcaatgtag gagttcgctt caacagcggg gtcacctgat aaaccatgat 2881 ttgccttgat gtgtttgagg gcttcgatat tttcttcaag taaatctagt atttgactga 2941 aaaaataagt ctgtaacctt taccatcagt catgtgtgtg ctcaaaagat attttttaaa 3001 ctgagttatt aatatttagt at // LOCUS NODE_7570_length_2995_cov_2.6231292995 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2995) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2995) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2995 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..348) /locus_tag="DP116_27685" CDS complement(<1..348) /locus_tag="DP116_27685" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27685" /translation="MTTIANAYHVADSDAPIFFPVSRAKFILMIVVTLGWYAFYCSYR NWRYVQIERRRNVSAALRALFSILFMYSLAEEIRIMAHKYDVESDVKPVFIGAGFAFI LICAQLGRFYAPLY" gene 772..844 /locus_tag="DP116_27690" tRNA 772..844 /locus_tag="DP116_27690" /product="tRNA-Ala" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:805..807,aa:Ala,seq:tgc) gene 871..1221 /locus_tag="DP116_27695" /pseudo CDS 871..1221 /locus_tag="DP116_27695" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: GeneMarkS+." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 1056..1065 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" misc_feature 1169..2991 /note="possible 23S ribosomal RNA but 16S or 23S rRNA prediction is too short" assembly_gap 2762..2771 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 898 a 620 c 795 g 662 t 20 others ORIGIN 1 atataaaggc gcatagaatc ttcccagttg cgcgcaaatt agaatgaacg cgaaacctgc 61 gccaataaaa acgggcttga cgtccgattc cacatcatat ttgtgcgcca tgatgcgaat 121 ttcttccgcc agggaataca taaataggat cgagaataat gcgcgtaacg ccgcactgac 181 gtttcgtcgt ctctcaatct gcacgtagcg ccagtttcgg tatgagcaat aaaatgcata 241 ccagccgagc gtgacgacaa tcatcagaat aaacttcgct ctggacactg gaaagaagat 301 tggtgcatcc gaatccgcaa cgtggtaagc gtttgcgatg gttgtcattt tggcaccaga 361 ttcggtgata agtgatttgc gacagtttct gcaattccca tttgatgcgg gttgcggcgt 421 attgcaatgc tgacaagtca gttctttcaa tcagaatatt ccatataata tgaatgcgca 481 gggctttttt tcaactggga gcaagtgctg gaagccgcac accatcaggg tttcagaggg 541 tactaaattg acttccttaa cgcgaacaaa tcgaccctta aaactttgtt tgcatgccca 601 cacttttcag attttcttgt attatagaga agtagtcatc actgggcttc ctaatcgcgc 661 gtcaagccga ttgagttcga agccgatagt gatgtgaaga aagttcaaaa tgtacgctcg 721 aagctaacga gcacaggcaa cgagcctagt acaattagaa cccgcatgga tgggggcttg 781 gctcagctgg tagagcgtct cctttgcacg gagaaggtca gcggttcgac tccgctagcc 841 tccagaattc agtcggatag cgcttcttaa ttgaagctcg actccgaaag cgacaagaaa 901 aactctgaag taagattcag aaaaaacctt gtaaaagcgc tgattcatct gatagaattg 961 atgactcgcg aaggtttgga aaaaccaaac gcgacagccc aaccggaaaa gggcattaaa 1021 caaaaccggg tttgtaccct gaaaactgca tattgnnnnn nnnnngtccg tatcctagaa 1081 gttcttcgga acttcaaaga tatccacaga attaatttgt ctggaatcct cgacaatgtc 1141 ctgttgttcg agaaaaagat ccagactttg gtcaagctac aaagagcgca cggtggatac 1201 ctaggcgtaa gcagccgatg aaggacgcag taaacagcga aatgccacgg gaagtcgtta 1261 cagactaaga tccgtgggtc tccgaatggg gcaaccccac cagagttatg tctggtgacg 1321 tctatctgaa tacataggat agaacgaggt aagtccggga actgaaacat cttagtaccg 1381 ggaagaaaat aaaacaaaac agtgattccg ttagtagcgg cgagtgaaag cggaacagcc 1441 caaacctaat taacgcaagt tatttagggg ttgcgggact ccatcatgag atgggatgac 1501 gatagttgaa gtgtttggga aggcacacca tagacggtga cagtccggta aacgaaattg 1561 tctgaacctc aggagtatcc caagtaatac gggacacgcg aaaccctgta tgaatctgcc 1621 gggaccatcc ggtaaggcta aatactgact tacgaccgat agtgaacaag taccgcgagg 1681 gaaaggtgaa aagaaccccg gtgaggggag tgaaatagaa catgaaaccg tgtgcttaca 1741 agcaattgga gcacttttag atagtgtgac agtgtgcctg ttgaagaatg agccggcgag 1801 ttaccttatg tagttggtta agcagttaat atgcgaagcc acagggaaac cgagtctgat 1861 aagggcgaca tataattaca tgaggtagac ccgaacccgg gtgatctata tatggccagg 1921 atgaagcgtg tgtaaaaatg cgtagaggtc cgaaccaact gatgttgaaa aatcagtgga 1981 tgagctgtgt ataggggtga aatgccaatc gaacccggag ctagctggtt ctccccgaaa 2041 ggcgttgagg caccgcgtgg gacgtatctt acagggggta gagcactagt tcggtgcggg 2101 cggagaaaat ccgtaccaaa tcgagttaaa ctccgaatac ctgtaacgta atatcctgca 2161 gtgagagtgc gagagataag ttccgtactc aagagggaaa cagcccagac taccgactaa 2221 ggtcccaaag tacatgctaa gtgattaagg aggtggagtt gcatagacaa ccaggatgtt 2281 ggcttagaag cagccaccat tcaaagaaag cgtaatagct cactggtcaa gcgactctgc 2341 gccgaaaata tacggggcta agcatgtcac cgaaatcgta gactctattt atttagagtg 2401 gtaggggagc gttgtgttgt agggtgaagc atgagcgatg agcacatgtg gacgaagcac 2461 aagtgagact gtcggcttaa gtagcgaaaa cattggtgag aatccaatgc accgaaagac 2521 taagggttcc cacggcaggt tcgtccacgt ggggttagtc gggagctaag gcgaggccga 2581 gaggcgtagc cgatgcacaa cgggttaata ttcccgtaca accgtaatat cgttaggagc 2641 taagtgggga cgcaggaggc tatatggagc ggcttaatgg attgccgtcc aaaggcgtag 2701 ctagttgatg taggcaaatc cgcattgacg ttataaagtg aggccgagta ggaaactgtc 2761 tnnnnnnnnn naattactta cgagtgaaat ttcatagatg ccacgctgcc gagaaaagcc 2821 gcgaaagcca gatattaagg ttcccgtacc cgaaaccgac acaggtagtc aggtagagaa 2881 taccaaggtg cgcgagataa ctctctctaa ggaactcggc aaaataacct cgtaacttcg 2941 gaagaagagg tgtcatgata gggtgtaaga cttcgcgtct gaagcctgag atgat // LOCUS NODE_7584_length_2988_cov_5.1902492988 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2988 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 225..836 /locus_tag="DP116_27700" CDS 225..836 /locus_tag="DP116_27700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016864020.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_27700" /translation="MTQAQYKLVTFEEFAKWKPEDGHYELHGGVIVKMPQPLGGHEEI TGFLSIQLGVQCYQLGLPYLIPKTALVKPPESESAYSPDVLLVNRENLINEELWKNES TVTQAASIPLVVEVVSTNWRDDYLKKYADYEEMGIPEYWIVDYAANGGREFIGKPKQP TILICSLDEGEYQISKFRGDERIQSPTFPQLNLTVQQIFEAGK" gene 954..2804 /locus_tag="DP116_27705" CDS 954..2804 /locus_tag="DP116_27705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749041.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27705" /translation="MSDSLPLREKYLALIDEIIQTTLKGKISSQEQVYQMLIKGVTAD TGEVFELALSDRLNSTQHQVDTQKDELKQAKATRSLRAIKTIQSQWQRAQQQNKATEA IVSGFREVTTAEVGNRLAAFLRTLDFNRKNPLNLQQIQQVSKSLQQFAQADSDFQEIS DGINRGIVSWQRLQDHLVSWMYEQSRESLGFGGVPGERGPWATWAKQVNSEFPKTLFR YLALEKSAIEFAQQHSSVSLSHWVEMAVILQYLQRGLVNWFDQQPYNVKAGSKLSIST FLTFAVIWSQLANGFGSASSIYSAGCSQVMLQILRTFAQRSYFPLYGGIFASFSGQYL RDALNYLDEPLRQVEGTQEKARILTLLGSSQRAFGQYKRSVEFHQQALEIARSAGDRA CEIANLNHLSRTHVAQIMYAEAIDFSQRALILSRQTGDRTGEANALVNLGYSEVMQAQ QLEQLEPEVYETAINYLQQGLTLSERLGDVQSKALCFSSLGVAYVIIEQPQTAIKYLE DGFQAAQISGDLYLQGLNLSNLGEAYYYLPDQEKAIYAGCLGMYLLNQIASNEWQKSA SWLIILQGQIGEESFRNFLQKNRSKIIAVIGVDGYDYIPQLLEEYRNPKN" BASE COUNT 924 a 609 c 665 g 790 t ORIGIN 1 aaaggtgtga ctgctgacac gggggaagtt tttgaactgg ctttgagcga tcgccggagg 61 cgcagctagc ttaacgctcc gctgctgcgc tcagaagcag aacgtccaat gccgacgcgg 121 ctaaacgtaa ttccccatac tatataatta tttagtaaca aacaaaaaaa ataactatta 181 aatttcaaac ctcatgtacg cctaaaaaat aaaaagaaaa ataaatgact caagctcaat 241 acaaactagt cacatttgaa gaatttgcaa aatggaaacc agaggatggg cattacgaac 301 tgcatggtgg agtcattgtt aaaatgccgc aaccattagg aggacatgaa gaaatcacag 361 gttttttatc gatacaacta ggtgtccaat gttatcaatt aggtcttcct tacctcatcc 421 ccaaaacagc attggtcaaa ccaccagaaa gtgaatcagc ttattcacca gatgtactgt 481 tagtaaaccg tgaaaatctt atcaacgaag aactatggaa aaacgaatcg actgttactc 541 aagctgcatc tattccttta gtggttgaag tcgtcagtac caactggcgt gatgattatc 601 tcaaaaaata tgctgattat gaagaaatgg gaattcctga atattggatt gttgattatg 661 cagctaatgg tggcagagaa tttattggca agcctaagca accaactatc ttgatttgct 721 ctttggacga gggtgagtat caaattagca agtttcgagg cgatgagcgc atacagtcac 781 caacattccc acaattgaat ttgacagttc aacaaatttt tgaagctggg aaatgatcaa 841 gcaagaactc accaactcac gagtcgtttg agttgaaatc ttggcacggt cttgttctgc 901 aacttaaccc agacaggaat taaaattgac atatccctat tctggttgct aaagtgtctg 961 actctctgcc gttgcgcgaa aagtatctcg ccctcatcga cgaaattatc caaaccaccc 1021 tcaagggcaa gattagctct caagagcagg tttatcaaat gttgatcaaa ggtgtgactg 1081 ctgacacggg ggaagttttt gaactggctt tgagcgatcg cctcaattcc acccaacacc 1141 aagtagacac ccaaaaagac gaactcaagc aagccaaagc aacccgcagt ctcagggcaa 1201 tcaaaacaat tcaaagtcaa tggcaacgcg ctcaacagca aaataaagcc actgaagcta 1261 ttgtctcagg atttagagaa gtgacaacag cggaagtcgg gaatcgtctg gctgcattct 1321 tacgcactct cgacttcaac cgcaaaaatc cactgaattt gcaacaaatc cagcaagtct 1381 caaagtcttt acaacaattt gctcaagctg attcggattt tcaggaaatc tcagatggta 1441 taaaccgtgg catagttagt tggcaaagac tgcaagacca cttagtcagc tggatgtacg 1501 agcaaagccg ggaatcactg ggatttggtg gtgtaccggg agaacgtggt ccttgggcaa 1561 cttgggcgaa gcaagtgaat agtgagtttc ccaaaacact gtttcgctac ttagctttgg 1621 agaagtctgc aattgaattt gcacaacagc atagcagtgt cagcctgagt cattgggtgg 1681 agatggcagt tattttgcag tacttgcaac gggggttagt caactggttt gaccaacagc 1741 cttacaatgt taaagcaggc tctaagttgt cgatttcaac tttcttgaca tttgcggtga 1801 tttggagtca actggcaaat ggttttggta gtgcgtcgtc aatatacagc gctggttgtt 1861 ctcaagtcat gctgcaaatt ctgcgaacct ttgcacagcg ttcatacttt cccttgtacg 1921 ggggaatttt cgcctctttt tctggtcagt acttacgaga tgcgctgaat tatttggatg 1981 aaccgttgcg tcaagtcgaa ggaactcaag aaaaagcacg gattttgaca cttttaggct 2041 cttctcaacg agcattcggg caatataagc gttctgtgga gtttcatcag caggctttgg 2101 agatagcgcg cagtgcaggc gatcgcgctt gtgaaattgc caatctcaac cacctcagcc 2161 gtacccacgt tgcccaaata atgtatgctg aagcaattga ctttagtcaa cgggcattga 2221 tattgagtcg gcaaacaggc gatcgcacag gtgaagcaaa tgccctggta aacttgggat 2281 acagcgaagt catgcaagcc cagcaactgg aacaactaga accagaggtt tatgaaacag 2341 caattaacta cttacagcaa ggtttaacac tctcagaacg actgggcgat gttcaaagta 2401 aggctttgtg ttttagtagt ttgggtgtag cctatgtcat catcgaacaa cctcaaacag 2461 ctattaaata tttggaagac ggcttccaag cagcacaaat ttctggtgat ttgtatcttc 2521 aaggtttgaa cttatcgaat ttaggcgaag cttattatta tttacccgac caagaaaaag 2581 caatttatgc tggctgtttg ggaatgtatt tattaaacca aattgcttca aatgaatggc 2641 aaaaatccgc gagttggctc ataattttac aaggacaaat aggggaagaa tcattcagaa 2701 atttcttgca gaaaaatcgt tctaaaatca ttgctgtcat aggtgtagat gggtatgatt 2761 acatcccgca gctattggaa gaatatagga atccgaagaa ctgatttgtc aaaacttaag 2821 ggtgtaaggg ggtaagggtg taggggtgta ggggaacctg atacagcaga attcataagt 2881 cataagtcag aagtattata gaatggactt tttagagatt tgaaatggtt gccctattga 2941 cgccgtgatg tactagattc tttcccttac acccctaaac ccctacac // LOCUS NODE_7836_length_2842_cov_4.1019022842 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2842) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2842) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2842 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 472..1368 /locus_tag="DP116_27710" CDS 472..1368 /locus_tag="DP116_27710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015208400.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin:protochlorophyllide reductase (ATP-dependent) iron-sulfur ATP-binding protein" /protein_id="PRJNA477356:DP116_27710" /translation="MTKSYKRQTTVKLAVYGKGGIGKSTTSCNISVALAKRGKKVLQI GCDPKHDSTFTLTGFLIPTIIDTLQEKDYHYEDVWPEDVIYKGYGGVDCVEAGGPPAG AGCGGYVVGETVKLLKELNAFDEYDVILFDVLGDVVCGGFAAPLNYADYCMIVTDNGF DALFAANRIAASVREKARTHPLRLAGLIGNRTSKRDLIDKYVETVPMPVLEVLPLIED IRVSRVKGKTLFEMAESDPCLNYVCDYYLNIADQILARPEGVVPHDAPDRDVFALLSD FYLNPGKQQVPNPEEELDLMIV" gene 1398..2129 /locus_tag="DP116_27715" CDS 1398..2129 /locus_tag="DP116_27715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867077.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27715" /translation="MAFFDSFTESLKQKWLQFFQVNRDWISLHMEVESVYTPDGGSRP PSYLILGVINALEPKLAQLMMPFSKLNPDADTLVEVLGLNFDPDLALGNRALPKLRIE KHVEESDFSSENLSDETLTNSPRVLNPSRDNVTQAKEFQGTQHGIDNSSSNQKISNSE FYKAQNSGDEFSNLSFDDLKEADSKTFTATAADQSGFADALSDSWGSETVTQKGEPDN ETLLGEEKSSGSLNESEIDRLFPKT" gene 2286..>2842 /locus_tag="DP116_27720" CDS 2286..>2842 /locus_tag="DP116_27720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017744459.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ferredoxin:protochlorophyllide reductase (ATP-dependent) subunit N" /protein_id="PRJNA477356:DP116_27720" /translation="MTLAQPESLTFECETGNYHTFCPISCVAWLYQKIEDSFFLVIGT KTCGYFLQNAMGVMIFAEPRYAMAELEEGDISAQLNDYEELKRLCVQIKRDRNPSVIV WIGTCTTEIIKMDLEGLAPKLEGEIGIPIVVARANGLDYAFTQGEDTVLAAMATRCPD KAPVVEAEKNASTTGALSGQRVAIAA" BASE COUNT 795 a 597 c 638 g 812 t ORIGIN 1 atatgtaata tgtaataatt ttttacattg gatctttgcc cagagggata aggtttcata 61 taagcctttc acgcttagca caaagcgcaa gcgacgtatg aaagaaacac catccctcaa 121 tacggtttga ttaagaaaga gatccccgca ggcgttctga tcttatccga accgtatagc 181 accattcccc ttcttccctg ttccctgtta agcgttccct gttaagcgtt ccctgttccc 241 tctatttcac atacgtcttt ttaataaaaa ctatccaaaa taatcaacag tcaaattctt 301 ctcattaaac catcactcat ttcatagaac cctaatatac ttgacagccc cctttatttt 361 ctgggtaact tacacaataa tgacagtcat gagacctcac atatcgccat ctcaggctag 421 gtgaaggttc accctgagcc cagccgaagg gtaagtgttt aagccacgag tttgacaaaa 481 agctataaga ggcaaacaac agtgaaacta gcagtttacg gaaaaggcgg tatcgggaaa 541 tccacaacaa gctgtaatat ctccgtcgcc ctagccaagc gcggtaagaa agtgttgcag 601 attggttgcg atcccaaaca cgatagcacc ttcaccctga ccgggttctt gattccaacg 661 attatcgaca ctctccaaga aaaggactac cactacgaag atgtgtggcc cgaagatgtc 721 atctataagg gttacggtgg tgtggattgt gttgaagcag gtggtccccc agcaggtgct 781 ggatgcggcg gttatgtggt aggcgaaact gtgaaactct taaaggaact caacgccttt 841 gatgagtacg atgtgatttt gtttgacgtt cttggtgacg ttgtttgcgg tggttttgca 901 gcacccctca actacgctga ttactgtatg attgtgactg acaatggctt tgatgccttg 961 ttcgctgcta atcgtattgc ggcttcagtt agggagaaag cccggactca cccgctgcgt 1021 cttgctggct taattggtaa ccgcacctca aagcgcgact tgattgacaa gtacgtggaa 1081 acagttccga tgccagtttt agaagttttg cctttgattg aagatattcg tgtttcccgc 1141 gtcaaaggta agactttgtt tgaaatggca gaatctgacc cctgtttaaa ttacgtttgc 1201 gattactatc tcaacatcgc tgaccaaatt ttggctcgtc cagaaggtgt tgtaccacat 1261 gatgccccag atcgcgatgt gttcgctttg ttatccgatt tctatttaaa tccgggtaaa 1321 caacaagttc ctaatccaga agaagaatta gacttgatga ttgtataaat catcttaatt 1381 ccaggatggg ggacaatatg gctttttttg atagttttac agaatcttta aagcagaagt 1441 ggttgcaatt tttccaggtg aatcgtgact ggatttcctt gcacatggag gtagaatctg 1501 tttacactcc cgatggcggg agtcgtccac cttcttacct catcttggga gttattaatg 1561 cgttggaacc caagctggcg cagttaatga tgccattttc caaattgaat cctgatgctg 1621 acaccttggt tgaggtgttg gggttaaatt tcgatcctga tcttgctctt ggtaaccgtg 1681 ctcttccgaa gttacgtata gaaaaacacg tggaagagtc cgatttctca agtgaaaatt 1741 taagtgatga aacactgaca aattctccaa gggtactcaa tccaagcagg gataatgtta 1801 ctcaagcaaa ggagttccaa ggaactcaac acgggatcga taattcctca tctaaccaga 1861 aaatttcaaa tagtgagttt tacaaagcac aaaactcagg tgatgaattt agcaatcttt 1921 cctttgatga tttgaaggaa gcagatagca aaacttttac agcaaccgca gcagatcaga 1981 gtggctttgc tgatgctttg tcagattctt ggggtagcga aaccgtaacc caaaagggag 2041 aaccggataa cgaaacgctt ttaggggaag aaaaatcttc tggttctttg aatgaatcag 2101 agattgatcg tctgttcccc aagacttaac ttaacttaaa ttgaacttgt gttgaatctg 2161 ggtgtggctt gtgtcatgcc atagattatg gcagtttgaa gggaagtgtt cattcttcca 2221 catatagtct gccttacttt taactgcgct tttagcttaa aaaatcaagg ggagaaaata 2281 ctaaaatgac tcttgctcaa ccagaaagtt taacttttga gtgcgaaact ggaaattacc 2341 acactttttg cccgattagc tgcgtcgctt ggctttacca aaaaattgaa gatagcttct 2401 tcttggtgat tggtacaaaa acttgtggtt atttcttgca aaacgcaatg ggggtcatga 2461 tttttgctga accccgttat gcaatggcag agttggaaga aggagatatt tccgcacaac 2521 tgaatgatta tgaagagttg aagcggctgt gtgtgcagat aaagcgcgat cgcaatccca 2581 gtgtcattgt ctggattggc acctgcacca cggaaatcat caaaatggat ttggaaggtt 2641 tagcacccaa gctagaaggc gaaattggta ttcccattgt tgtcgcacgc gctaacggtt 2701 tggactacgc cttcactcaa ggagaagata ccgtgttagc tgcaatggcg acacgttgtc 2761 ctgataaagc gcctgtggta gaagcggaga aaaacgcttc taccacaggc gctttatcag 2821 gacaacgtgt cgccattgca gc // LOCUS NODE_7904_length_2810_cov_5.0606172810 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2810) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2810) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2810 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(181..265) /locus_tag="DP116_27725" tRNA complement(181..265) /locus_tag="DP116_27725" /product="tRNA-Ser" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(229..231),aa:Ser,seq:tga) gene complement(342..1271) /locus_tag="DP116_27730" CDS complement(342..1271) /locus_tag="DP116_27730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131033.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR04168 family protein" /protein_id="PRJNA477356:DP116_27730" /translation="MTSQKTNSKNLKIAVVGDVHDQWEEEDGIALKHLGVDLLLFVGD FGNESVEVVRAIASLDIPKAAVMGNHDAWYTATEWGRKKCPYDRTKEDWVQQQLDLLG ETQVGYGKLDFPDLNLTVVGGRPFSWGGPEWKYTDFYQQWYGVTSFEESTARIVSATK SAAYQTIIFIGHNGPTGLGDRPEDPCGKDWHPIGGDFGDPDLAEAISQTITAGKTIPL VTFGHMHHSLRHTKKEPRKRIFRSPEGIVYLNAASVPRIVEHGSHKQRNFSIVLLEDG VVSQVFLVWLGKDFSVVYEEILYQKPSPVVQTA" gene 1627..2601 /locus_tag="DP116_27735" CDS 1627..2601 /locus_tag="DP116_27735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012412656.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="quinolinate synthase" /protein_id="PRJNA477356:DP116_27735" /translation="MFTTALAQREKTEQGDLPLDLFAAIETLKQELNAVILAHYYQEP DIQDIADFIGDSLQLAKAAADTNADVIVFAGVHFMAETAKILNPDKMVLLPDLNAGCS LADSCPAEAFAAFKAAHPNHLVVSYINCSADIKAMSDIICTSSNAVKIVQQIPKYQPI IFAPDRNLGRYVMEKTGRDLVLWEGSCVVHETFSEKKIVQLKIAHPQAEAIAHPECET SVLRHANFIGSTAALLKYCEQSPCQEFIVATEPGIIHQMQKRAPYKRFIPAPPLNNCA CNECPFMRLNTLEKLYCVMKNRSAEIILPENIRITALRPIQRMLEMSV" BASE COUNT 814 a 646 c 576 g 774 t ORIGIN 1 tgaggctagg ggactctctc tattctatgt cctaaccttc ttggcaactg ctataaaaaa 61 cgcggaggaa gctataaaat tttaaggcaa acaacttaaa tctcgcctaa atattattac 121 aggtgaccaa atgcctaatt tgtgctgtta cgcccaatca agtcatcaca aatctttcaa 181 cggagagggg gagattcgaa ctcccggaac ctttcggttc atccgatttc aagtcggacg 241 caatcgacca ctctgccacc tctccagtaa agctgtcagc gtttagctgt cagcggtcag 301 ccaaaaataa agccaaccag atacgaatta tatctcattc tctatgcagt ttgcactact 361 ggactgggtt tttgataaag aatttcttca tatacaacgc tgaaatcctt tccaagccag 421 acgaggaaga cttgagagac aacaccatct tccaggagga caatggagaa gttacgctgt 481 ttatggctac catgttccac aatcctgggt acacttgcgg cgttcaagta aactatcccc 541 tctggacttc taaaaatgcg ttttcgcggc tctttctttg tgtgtcgcag gctgtggtgc 601 atatgaccaa atgtgacgag gggaatggtt ttaccagctg tgatggtttg agatatcgcc 661 tctgctaaat ctggatcgcc aaagtcacca cctatgggat gccaatcttt tccgcaggga 721 tcttctgggc gatcgcccaa ccctgtagga ccattgtgac ctataaaaat tatcgtctgg 781 taagccgcac tttttgtggc tgacacaatg cgggctgtgg actcttcaaa actggtcaca 841 ccataccact gttggtagaa atcagtatat ttccactctg gtccacccca gctaaaggga 901 cgacctccaa caaccgttaa atttagatca ggaaaatcta gcttaccgta accgacttgg 961 gtttcaccca ataaatctag ttgctgctgt acccagtctt ccttggtgcg atcgtaagga 1021 cacttcttac gtccccattc tgtggcagtg taccaggcat cgtggtttcc catgactgct 1081 gctttgggaa tgtcaagaga tgcgatcgct cttaccactt ccaccgactc attgccaaaa 1141 tctccaacaa acagtagtaa gtcaacacca agatgcttga gtgctatgcc atcttcttct 1201 tcccattggt cgtgaacatc tcctactaca gcaattttca aattttttga gtttgttttc 1261 tgactggtca tgccactttc cttgccatct gtttccagca taggaaaatt acaggatttt 1321 gaccaagaga gaacgcacaa ctcagaaaaa taaaagaagt ttgaatatca cggattgctg 1381 gatgagttca taagaagtgc aatgtaggaa tgtacaagcc cccaggaaat ctttgatgca 1441 tctggccaac acgctctctt cgcgggtcac caagccccca taagaagtgc ccaagctgag 1501 ttctgcgttc tgagttcttc ttcacacaat ttcacagagg agcattaccc ctcaaacatt 1561 tttggtaaaa actactataa ttaaatccac cagacaaaaa cttgtaacat tcagcaacgt 1621 ctaactgtgt ttacgactgc acttgctcaa cgagaaaaaa ccgaacaggg tgatctacca 1681 ctagatttat ttgcagcaat tgagactctt aaacaagaac tcaatgcggt tatcctggca 1741 cattactatc aagaacccga cattcaagat atagcggatt ttattgggga ttcgctacag 1801 cttgccaaag cagcagctga tacaaatgca gatgtcattg tctttgctgg tgtccatttc 1861 atggcagaaa cggcaaagat tcttaatcct gacaagatgg ttcttttacc agatttaaat 1921 gctggttgtt ctttagctga tagttgtcca gcagaggcgt ttgcagcttt taaagcggcg 1981 cacccaaacc atttggtggt ctcttacatc aactgctctg ctgatattaa ggcaatgagc 2041 gatattatct gcactagttc caacgctgtc aaaattgtgc agcagattcc caaatatcag 2101 ccaattattt ttgccccaga ccggaattta ggacggtatg ttatggaaaa aactgggaga 2161 gatttagtgc tgtgggaagg aagttgcgtt gtccatgaaa ccttttctga aaagaaaatt 2221 gtgcagttaa aaatcgcaca cccacaagca gaggcgatcg cccatccaga atgtgaaact 2281 agtgtcttgc gccacgcaaa ttttatcgga tctacagcgg ctttacttaa gtattgtgaa 2341 caaagccctt gccaagaatt tatcgttgct acagagcctg gaatcattca tcaaatgcag 2401 aaacgagcgc cttacaagcg ctttatacca gcaccgccat taaataactg cgcgtgtaac 2461 gagtgtcctt ttatgcggtt aaataccttg gaaaagctct actgtgtgat gaaaaatcgc 2521 tcagccgaaa tcatcttgcc agaaaatatt cgcataacag cattgcgacc aattcaacga 2581 atgctagaga tgagtgtata aaactcttat gaaaataact acttaggcac tgctagagac 2641 ggttagtgtc agttccaagg aaactttata gctcaactgg tgcaaacgtt tactcagtaa 2701 attagtgagt actcagtgta tgccatccat cagtttttgg atggaggttt gcagtaaaaa 2761 ggtaaaaacg aacaaaaggg tgttgcttaa actagcaaca cccttttgta // LOCUS NODE_7981_length_2766_cov_3.9336042766 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2766) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2766) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2766 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(52..258) /locus_tag="DP116_27740" /pseudo CDS complement(52..258) /locus_tag="DP116_27740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017745916.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(427..1617) /locus_tag="DP116_27745" CDS complement(427..1617) /locus_tag="DP116_27745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015131094.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 1 protein" /protein_id="PRJNA477356:DP116_27745" /translation="MKILVLSWEFPPRIVGGIARHVGELYPELVKLGHEIHLITPEFG QAPMYEIVEGVKVHRVPVASGNDFFHWVVNLNQSFGHHGGKLMLEEGPFDIIHAHDWL VGDAAIALKHSFKVPLIATIHATEYGRYNGVHTDIQRYINAKEELFAFNAWRIIVCTD YMRREVERALHSPWNKIDVIYNGIRPEKKQHHQDFHALDFRRQFAEDGEKIVYYVGRM TYEKGVPLLLSAAPKVLSQMGGYVKFVIIGGGNTDHLKKQARDMGIWNKCYFTGFISD EYLDKFQTIADCAVFPSLYEPFGIVALESFASRVPVVVSDTGGFPEVVQHTKTGVVTY KNNPDSLAWGILEVLKNPGYRQWLVDNAYEDLERRFSWSKLAKQTEDVYKRVVEERSQ VTWL" gene complement(1814..2440) /locus_tag="DP116_27750" CDS complement(1814..2440) /locus_tag="DP116_27750" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015191048.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27750" /translation="MWIHRNALALQVEHLALETRHSRWPHRFWAVCLSLSLLFSFTSS SSSLNRAFYFLLPDSFNPLVYVSLPQQWGHVLQFRPMLAQIPPDASISATTYLVPHLS SRREIIRLPALELRNDARQVMKVDYAIADLWQLKQYQPAFKEERNLLRELVQLFDQLT KTQEYGVIDFRDGVILLQKAATPDPQATAAWVAFRQQLERELTQNRAR" BASE COUNT 810 a 610 c 529 g 817 t ORIGIN 1 acaacaataa attacaccgt tttatggagt tttactctgt tttacgagtc tccctagagg 61 aaccatcatc gggaaacttt gctttgcttg tgtgacgcga tacgcctgcg gcgttaagca 121 aagcttatcg cgttctgctg gcgataaagg taaccagtct tgtgcttgat gcgatcgcct 181 taaactttcg attaactcgt ctttccagcc tacttccgtt gaactgttgt gaaatgtatt 241 gatatctgtg tttgtcattt cccaaactct taacaggtga tagggatttt ttgattatcc 301 caaatttggt actccatccc ttgttgatac tcacactcat taaaggattc taggcgttca 361 tatgaatgat aaaggacagc taaaatttag ctgcccttga tgttaaaaat caaatatttt 421 aattatttat agccaggtca cttgcgatcg ctcttccacc acccgcttat acacatcctc 481 cgtctgctta gctaatttag accaactgaa gcgtctttct aaatcttcat aagcgttatc 541 aaccagccat tgtcgataac ctggattttt caacacttcc aaaatacccc aagccagaga 601 atcgggattg tttttgtaag tcactacacc tgttttggta tgttgcacca cctctggaaa 661 accacctgta tcagaaacca caacgggaac gcgagaggca aagctttcta atgcaacaat 721 accaaagggt tcataaaggc tggggaatac agcacaatcg gcaatggttt ggaatttatc 781 taagtactca tcagatataa aaccagtaaa atagcactta ttccaaattc ccatatccct 841 agcttgcttt ttgagatggt cggtattacc gccaccgata ataacaaatt taacgtaacc 901 ccccatttgg gacaacactt tgggagcagc actcagcagt agagggacac ctttttcata 961 ggtcatacga ccgacatagt aaacaatttt ctcaccatct tcggcaaatt gacggcgaaa 1021 atccagagcg tgaaaatctt gatggtgctg ttttttttct ggtcggatac cgttatagat 1081 aacatcaatt ttgttccacg gagagtgtaa cgcccgttct acttcccgcc gcatatagtc 1141 agtgcaaacg ataattcgcc aagcgttgaa agcaaaaagt tcttctttgg cattaatata 1201 acgttgaata tcagtgtgaa caccgttgta gcgtccgtat tcagttgcgt gaatagtggc 1261 aatgaggggg actttgaagc tgtgcttgag ggcgatcgcc gcatccccaa caagccaatc 1321 atgcgcgtga ataatatcaa acggtccttc ctccagcatc aacttaccgc cgtgatgccc 1381 aaagctttgg ttcagattaa caacccaatg aaaaaagtca ttaccagaag ccacaggtac 1441 acgatgaacc ttgactcctt caacgatttc atacatcggt gcttgaccaa actctggtgt 1501 aatcaagtgg atttcatgtc ctaacttgac cagttccgga tataactctc ccacatgacg 1561 agcaattccc ccaactattc gcggcggaaa ctcccaactt aacaccaata tcttcataga 1621 cttatatgcc caacccttta caaatctata atcttctaca tttatctcaa gatacaagtt 1681 tgactactat aacaagtccc aaaatggaat ttttttgatt ttttttaaca ttgttccgaa 1741 aaaagtagtg gtagaaagac tttcaaaaga aaagtcagaa ctcaaaattg ttttttggct 1801 cttggtttct ttattatctt gccctgtttt gtgttaattc tctctcaagc tgttggcgaa 1861 aagctaccca agctgctgtg gcttgtggat caggggttgc agctttttga agtaaaataa 1921 cgccatctct aaagtcaatc actccgtatt cctgagtctt cgtcagctgg tcaaacagtt 1981 gtacgagttc ccgtaaaaga ttacgctctt ccttaaatgc tggttgatac tgttttaact 2041 gccacagatc tgcgatcgca taatcaactt tcatcacctg acgtgcatcg tttcgcagtt 2101 ccagtgcagg taagcgaatg atttcccggc gactagaaag atgagggaca aggtaagtgg 2161 ttgcagagat actcgcatct ggtggaattt gtgccagcat tgggcgaaac tgcaacacat 2221 gaccccattg ttgaggtaaa gacacataca ccaaaggatt aaacgaatct ggtaataaaa 2281 aataaaacgc tctatttaag ctagaactac tggaagtgaa ggaaaatagt aaagaaagac 2341 tcagacaaac tgcccaaaaa cggtgaggcc agcgcgagtg acgggtttcc aacgccagat 2401 gctccacttg gagagccagt gcgttgcggt gaatccagat gccaagtgag ggaaaccctc 2461 ctcatgcgca ctggactcac tgttcactgt tcactgttcc ctgttcactg attttggata 2521 ctaaattcta caaagttttc aaaatatagc agatctaata aatgatttgt gagaacctct 2581 atttatgtct tttctgacca aagattgcgg ggtacctttc gggcgtctac gtgtcatttg 2641 tggtttttga aaaaaatatc acaactccaa tcgaatgatt ctatgaaatc tctatttata 2701 gttttataaa aaatatttgt aaaatcatta gcttttttta tttggaaaac atagtaatta 2761 ttatct // LOCUS NODE_8005_length_2751_cov_2.5619442751 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2751) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2751) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2751 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..>2751 /locus_tag="DP116_27755" CDS <1..>2751 /locus_tag="DP116_27755" /inference="COORDINATES: protein motif:HMM:TIGR00229" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016878090.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="histidine kinase" /protein_id="PRJNA477356:DP116_27755" /translation="RQTQDLLAANSRLEAEVTERKQTEIALRESEERSRALIENLPGG AMFVVDRNLRYLVAEGEALSAAGFKPEDLVGRTIFEVLPPELAAYYEGLYRRGLAGEP FEHEHKAHNRSFISRGTPLRSTDGEVYAVLAVSYDISECKRVEDERALAEAAIASDLQ DTQLLRDLSARLTTEADIQVLYDEIMATAIALMRADAGSMQFLDETTQELVLIATQGF SQAMTDDCQRLNASSNTSCGRALAAGERTFIDFDVPDTDDPDGSLRLLINAGYFSGQS TPLISRSGNLIGMVSTHWRRHHRPSDRELRFLDLLVRQAADLIEQRQAAAALRESEEK YRTLVSATSKMVWTADAKGNIIAEVPGWEQFTGQTFDEYKNFGWLAVLHPDDVEQTRQ IWQTAIAHHTTAIAEYRLRTMAGDYRRVAVRGEPLLNADGNLRGWVGTITDIEDMRRT TEAEQAALHQLRESEEKYRTLFDSIDEGFSLLEIIYNQQGAAIDVRYRDTNRAFERHT GIKDAEGKTLRELIPNIESSYWINTWNRVARTGEPERLENYVQETDHWFNVYTSRIGG EGSNMVAVVFNDITERKRTETNLAFLAEVSQDLTRLTNIDETMDAIGAKIGGHFNVVR VVFAEISEDQQTGRVSHEWHQPDLPDMKGSYATKDYFSPELELLHRAGEIAVVRDTAK DEQIDGERYAALGVGAFVGVPLIRQDKWRFYFSLLDSKPHDWRDDEIELLRELTTRIW TRLERARAEEDLAQSEEKYRTLFDSIDEGFVLFELIYDENEQVVDHRYLEVNQVFERQ TGLENAAGKLASEIAYVEPCWLETYDRVVQTGEPVRFENYSQATGRWYATYASRVGKA GSRQVIAIFDDITDRKRAEEAMRVFFSNVSHEFRTPLTLLLSSIQETSNDPAHPLSSA QRS" BASE COUNT 665 a 730 c 768 g 588 t ORIGIN 1 gcgacaaacc caggacttgc tggcagcaaa ttccagattg gaagctgagg tcactgagcg 61 caagcagaca gagattgctt tgcgcgaatc cgaagaacgt tcgcgcgctt tgattgagaa 121 tctgccaggt ggggctatgt ttgtcgtcga tcgcaatctc cgctacttgg tagctgaagg 181 agaagcgttg tccgctgctg gattcaagcc ggaagatctc gttgggcgga caattttcga 241 ggtgctacca cccgaattgg cagcatatta tgaggggctt taccgtcggg ggctggctgg 301 tgagccgttt gagcatgagc acaaggctca caatcgctca ttcatctcac gtggaacgcc 361 gctacgttcg accgacggtg aagtttatgc agtgcttgct gtttcctatg atattagcga 421 atgcaagcgc gtcgaagacg aacgcgcatt agccgaagct gcgatcgcaa gcgacttaca 481 agatacacaa ctgttgcgtg acctgagcgc acgactcacc actgaagccg atattcaggt 541 gctttatgac gagattatgg caaccgcgat cgccctcatg cgggcagatg ctggaagcat 601 gcaatttctt gatgaaacga cgcaagagtt agtgctgatc gctacgcagg gctttagtca 661 agcgatgacc gatgattgcc agcggcttaa cgcgagttca aacacgtctt gtggcagggc 721 attggcagcg ggcgagcgga cttttattga ttttgacgtg cctgatactg atgaccccga 781 tggctctctg cggctgctga taaacgcggg ctatttctct ggtcagtcca ctcccctaat 841 cagccgttcg ggtaacctga tcggcatggt ttccactcac tggcgcagac accatcgacc 901 gagcgatcgc gaactgcggt ttctcgattt actggttcgt caagccgccg acttgattga 961 gcagcgacag gcagcagcag ccctgcgcga gtcggaagaa aaatatcgca cgttagtttc 1021 cgccacttca aaaatggttt ggacggcgga cgcaaaaggc aacattattg ctgaagttcc 1081 gggttgggaa cagttcactg ggcaaacctt cgacgaatac aaaaattttg gctggcttgc 1141 tgtcctacat cctgacgacg tggaacaaac taggcaaatc tggcaaacgg cgatcgcaca 1201 tcatacgacc gcgatcgccg aatatcggct tcgcacgatg gcgggcgatt accgtcgtgt 1261 agctgtgcgc ggcgaacccc tattgaacgc tgacggaaat ttgcgcggtt gggttggcac 1321 catcaccgat atcgaagata tgcgtcgtac cacagaagcc gaacaagccg cgctccacca 1381 attgcgcgag agcgaagaaa aataccgcac gctgttcgat tcaattgacg aaggattttc 1441 cctgctcgaa atcatttaca accagcaggg cgcggcgatc gacgtgcgct accgcgacac 1501 gaaccgcgct ttcgagcggc atacgggcat caaggatgcc gaggggaaga ccctgcgcga 1561 attgattccg aatatcgaat catcgtactg gattaatact tggaaccgag ttgcgcgaac 1621 gggcgaacca gagcgtcttg agaattacgt gcaggaaaca gaccattggt tcaacgttta 1681 tacctcgcgc atcggcggcg aaggcagcaa catggtcgcc gttgtcttca acgacatcac 1741 cgagcgcaaa cgcacggaaa cgaatctcgc ctttttagcc gaagtcagtc aagacctcac 1801 gcgcctgacg aacattgacg aaacgatgga tgcgatcggc gcgaagatcg gcgggcattt 1861 caatgtcgtt cgcgtcgttt tcgccgaaat cagtgaagac caacagacgg gtcgcgtcag 1921 ccacgaatgg catcagccgg atttgccgga tatgaaaggt tcttacgcga caaaagacta 1981 tttctcgcct gaattggaac ttctccaccg cgccggagaa atcgccgtcg tccgcgatac 2041 cgcgaaagac gagcagattg acggggagcg ttatgccgcg ctgggcgtag gcgcgttcgt 2101 cggcgtgccg ctcatccgcc aggacaagtg gcgtttctat ttcagtctgc tcgattccaa 2161 accgcatgac tggcgcgacg acgaaatcga gttgctgcgg gaattgacca cgcgcatctg 2221 gacacgcctc gaacgcgccc gcgccgaaga agacctcgcc cagtcagagg aaaaataccg 2281 cacgctgttc gattcgattg acgaaggctt tgtccttttc gagttgattt atgacgaaaa 2341 cgaacaggtg gttgatcacc gttatctgga agtcaaccaa gttttcgagc ggcagacggg 2401 actcgaaaac gcggcgggca aactcgccag cgagattgcc tacgtcgaac cgtgttggct 2461 cgaaacctat gaccgagtgg tgcaaacggg cgagccggtg cgctttgaaa attacagcca 2521 agcaaccgga cgctggtatg cgacttacgc ttcgcgcgtc ggcaaagcgg gcagccgtca 2581 agtcatcgcc attttcgacg acatcaccga tcgcaaacgt gccgaagaag ccatgcgagt 2641 atttttcagt aatgtgagtc atgagtttcg cactccgctg accctgttgt tgagttcaat 2701 tcaagagacc tcgaatgatc ctgcccatcc cttgagttct gcccagcgat c // LOCUS NODE_8039_length_2734_cov_5.4176932734 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2734) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2734) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2734 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..198 /locus_tag="DP116_27760" CDS <1..198 /locus_tag="DP116_27760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017317199.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27760" /translation="YVTSVVFSPDGKTLASGSSDNTVKLWNMDLDDLLRRGCAWAHDY LKNNPNVKDDRHLCDDVLKRK" gene complement(270..474) /locus_tag="DP116_27765" /pseudo CDS complement(270..474) /locus_tag="DP116_27765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197472.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="CopG family transcriptional regulator" gene complement(642..980) /locus_tag="DP116_27770" CDS complement(642..980) /locus_tag="DP116_27770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017743966.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF86 domain-containing protein" /protein_id="PRJNA477356:DP116_27770" /translation="MRDDRLYLSNIFECIERIESYTRDGKEVFLQTTIIQDAVIRNFE IIGEATKRLSPEIRAVYPDVPWQQVAGFRDVLIHDYLKVNLNRVWGVIEQNLPQLKAT IEAILQELGK" gene complement(977..1267) /locus_tag="DP116_27775" CDS complement(977..1267) /locus_tag="DP116_27775" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115108.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase subunit beta" /protein_id="PRJNA477356:DP116_27775" /translation="MGIDEILKAYREEILRIAALYGAYNVRVFGSVARGEARLDSDVD FLVELEPQRTLLDQIALMQSLEELLGRKVDVTEPETLHELIRDKVLREAVAL" gene complement(1377..1626) /locus_tag="DP116_27780" /pseudo CDS complement(1377..1626) /locus_tag="DP116_27780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874181.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="UPF0175 family protein" gene complement(1704..1835) /locus_tag="DP116_27785" /pseudo CDS complement(1704..1835) /locus_tag="DP116_27785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019497445.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="XisI protein" gene 1869..2162 /locus_tag="DP116_27790" CDS 1869..2162 /locus_tag="DP116_27790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008054503.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase subunit beta" /protein_id="PRJNA477356:DP116_27790" /translation="MKLKEILQEKRAEIISIAAKHGAYNIRIFGSVARGEETPSSDID FLIDYDLNKITPWFPGGLVQELETLLNRKVDVVTTNSLHYFIQDKVLKEAISL" gene 2159..2500 /locus_tag="DP116_27795" CDS 2159..2500 /locus_tag="DP116_27795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876002.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF86 domain-containing protein" /protein_id="PRJNA477356:DP116_27795" /translation="MKDSRIYLIHIRDCLNRIKQYTLEGKEVFFNDIKTQDAVLRNIQ VMCESVQKLPTDWKNAYPETEWSNIAGFRNRLTHEYLSVELDIVWNVIENYLPSLERT IEAMAQEFWNI" BASE COUNT 824 a 602 c 506 g 802 t ORIGIN 1 tatgtcacaa gtgtcgtgtt cagcccggat ggtaagactc tagcttctgg tagttctgac 61 aacacagtca aattgtggaa tatggatttg gatgatttac tcagacgtgg ttgcgcttgg 121 gcgcatgatt atttgaagaa taaccctaat gttaaggatg accgacacct ttgtgatgat 181 gttcttaaaa gaaagtagtg aggaggtaga acctcctact gctggttccc aggcaaagcc 241 cgcgaacgat aaaaattagc aagcagaaat tacgaatgcc actcaagccg ttgagcaatc 301 cataccttaa tgatggactg tcgagtgaca ctcccgccgc gccgcctcac gatcaagtgc 361 ttcaatcatc catatcggga aatcgacgtt gatccgccgc tgctcataac ctgggcggcg 421 aacggtggaa agatcaaggt atgaagttac gtcaagccaa gggcttatcg ctatggcatg 481 atatcatccg gcgtctacaa ggctaataag ccaagtccgt tttaacggac tcaaagactt 541 attcagtccg ttttaacgga cttgaacttt gagccaagaa atttatttct tggcggacga 601 acattatggt gcaagatctg agctagcagg aaagcctgat attacttccc caattcttgc 661 aaaatcgctt caatagttgc ctttagttgg ggcaaattct gttcaatcac gccccaaact 721 cgatttaaat tcaccttcaa ataatcatga atcagtacat ctctaaaacc agctacttgc 781 tgccaaggta catctgggta aactgctctt atttcgggag ataaccgctt cgttgcttcc 841 ccaataattt caaaatttct aattaccgca tcttgaatta ttgtggtttg caaaaatacc 901 tctttgccat cacgagtata agactctatg cgctcgatac attcaaagat gttgctcaag 961 tacagccgat catctctcac aacgccacag cttcccgcaa cactttatct ctaatcaact 1021 catgtagagt ttctggttca gttacatcca ctttacgtcc tagcaattct tccaaagact 1081 gcatcaaagc aatttggtct aacaaggttc gttgtggttc aagttccacc agaaaatcta 1141 catcactgtc tagtcttgct tctcctctcg ccacagaacc aaacacccgg acattatacg 1201 ctccgtacaa agccgcaatt ctcaaaattt cttcccgata agctttaagt atttcatcaa 1261 tgcccatagc ctaacccaaa atgatttcaa caaatctcta cctctagttt ggcattttca 1321 ggcatcttct ggtctttggt gtaaacgcaa ctcaaacaaa cctgccctaa taacaatcac 1381 aacctaccca attcccgcaa atttttcaaa tcgagttcaa aatcttctat ttcataagaa 1441 tacagaggaa ttttgcgttt ttgaagcaac tgttgaaact gtattttatc catttctgcc 1501 aacttgctcg catatccaag gggaagtttt ccagatccaa acagcaacag agcaacttct 1561 tgtctaaatt cggcttctgt cattccacgc ttcacgtaaa gtatcatctg atatgacgac 1621 gctcattttt cctctattgc taaaaatttc agtagattaa ataacgcgaa cattacagcc 1681 ctaatattaa cctacaatat tcttcaagtt ccaaaccccg taaactgacg cacatagggt 1741 tcgtgaaatg ccaaaataat atcttgcttg ggtactccca tttcaactaa tcgattggca 1801 attccttctt ctgtaccatc gtgttgaatc catattcaaa taagctgtag tgttatgaga 1861 gattgaaaat gaaattaaaa gaaatcctcc aagaaaaacg agcagaaatt atcagtattg 1921 ctgctaaaca tggtgcttat aatatccgaa tttttggtag cgtagctaga ggagaagaaa 1981 caccaagcag tgatattgat tttttgattg attatgactt gaataaaatt acaccttggt 2041 ttcctggtgg cttagttcaa gagttagaaa ccttgttgaa tcggaaggtt gatgtggtaa 2101 ctacgaattc tttgcattat tttattcaag ataaagtctt aaaagaggct atttctttat 2161 gaaagacagt cggatttatc ttattcacat tcgcgattgt ctcaacagaa ttaagcaata 2221 tacacttgaa gggaaagaag tattttttaa tgacattaaa acccaagatg cagttttaag 2281 aaatattcag gtgatgtgtg agtctgtaca aaaattacca actgattgga aaaatgcata 2341 ccctgaaaca gaatggagta atattgctgg ctttcgtaat agattaaccc acgaatatct 2401 aagtgtagag cttgatattg tttggaatgt cattgaaaac tatttacctt ctctagaaag 2461 aacgattgaa gcaatggcac aggagttttg gaatatttag ttgagaagta ctccgaaact 2521 gaaaaatcac cttaatgaag tgcgataagc cggaggcttg acgctacgcg tatcgcgcag 2581 cgtgcgtctt cgcgcactcg cctgtttgta tctgttcttg cactattgtc atttagctat 2641 gactcaggag gtagaacctc ctactgctag ttcccaggct ataaatccag aagaacccca 2701 ccccggtttt gtcttgcgcc aaaaccgccc ctcc // LOCUS NODE_8160_length_2669_cov_4.7364192669 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2669) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2669) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2669 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(186..1106) /locus_tag="DP116_27800" CDS complement(186..1106) /locus_tag="DP116_27800" /EC_number="1.1.1.94" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877313.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NAD(P)H-dependent glycerol-3-phosphate dehydrogenase" /protein_id="PRJNA477356:DP116_27800" /translation="MTHPKSVAILGAGAWGTALATLAKTNGHQVRMWSRQGSQTLAEI VQGADVVLSAVSMKGVRAVTSQLQSLAISPETIFVTATKGLDPQTTCTPSQIWQETFP NHPVVVLSGPNLSKEIQQELPAATVVASSVMAAAEFVQVIFSSDRFRVYTNCDILGVE LGGTLKNVMAIAAGVCDGLHLGTNAKAALLTRGLTEMVRIGVHWGAKLETFYGLSGLG DLLATCNSPLSRNYQVGYQLALGKTLAEILAHIEGTAEGINTTQVLLRRARQQNIPMP ITEEVYRLLEGEVTPRQALLELMLRDMKPE" gene complement(1225..2115) /gene="lipA" /locus_tag="DP116_27805" CDS complement(1225..2115) /gene="lipA" /locus_tag="DP116_27805" /EC_number="2.8.1.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006197693.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="lipoyl synthase" /protein_id="PRJNA477356:DP116_27805" /translation="MTVKPDWLRVKAPQWERVGNVKEILRDLTLNTVCEEASCPNIGE CFKAGTATFLIMGPACTRACPYCDIDFEKKPKPLDPTEPARLAEAVRRLKLNHVVITS VNRDDLPDGGASQFIRCVEAVRTISPHTTIEVLIPDLCGHWEALELILQAQPEVLNHN TETVPRLYRRVRPQGNYERTMELLQRSRQIAPWIYTKSGIMVGLGETDEEVRQVMRDL RAVDCDILTIGQYLQPSQKHLKVDDFITPEQFAAWKAFGEELGFLQVVSSPLTRSSYH AEEVRGLMERYPRQKLMTND" gene 2334..2407 /locus_tag="DP116_27810" tRNA 2334..2407 /locus_tag="DP116_27810" /product="tRNA-Pro" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:2368..2370,aa:Pro,seq:cgg) BASE COUNT 698 a 621 c 584 g 766 t ORIGIN 1 gggttcgcca gtcacctgcg ccagggttac cctcccgcag tgctggactc accgcactgg 61 ctcctcactt ggcgactgcg ttcgcccaag ccgcgtgccg caggcatacc cgaagggcga 121 gggagttatt tttcccgctg cacccagctt tccttacccc ctgtttccct gtacagaatg 181 ctttcctact ccggcttcat atctcgcaac atcagttcta aaagtgcttg tcggggtgta 241 acttcacctt cgagtaagcg ataaacttcc tcggtaattg gcataggaat attctgctgc 301 ctagctcgcc gcagcaaaac ttgagtggtg ttgattccct cagcagttcc ttctatatga 361 gcgagaattt cggctagtgt tttaccaaga gccaactggt agccaacttg atagttacga 421 ctcaagggac tgttacaggt tgctaacaga tcgcccaagc cggacaaacc gtaaaatgtt 481 tctaattttg caccccaatg gacaccaata cgaaccattt ctgttaatcc acgggtaagt 541 aaagctgctt tggcattggt tcccaagtgc aagccatcac agacaccggc ggcgatcgcc 601 atcacatttt ttagtgttcc acccagctct actcccaaga tatcacaatt ggtgtaaacg 661 cgaaaacgat cagaagaaaa tatcacttgt acaaattctg ctgcagccat aacactgctt 721 gctacaaccg ttgcagcggg taactcctgt tgtatttctt tagataaatt ggggcctgag 781 agaacaacta ctgggtggtt ggggaaagtt tcttgccaaa tttgtgatgg tgtacaagtc 841 gtttgggggt ctaggccttt tgtcgctgtg acaaaaattg tctctgggga aatggctaaa 901 gactgtaatt gagaagttac agctctcaca cccttcatag aaacagcaga aaggacaaca 961 tcagcacctt gtacaatctc tgccagcgtt tgagaaccct gacgcgacca catgcgcact 1021 tgatgaccat tcgtctttgc aagagttgca agagccgtcc cccaagcacc tgcacccaga 1081 attgctactg attttggatg agtcattagt cattagtcat tagtcattag tcattagtca 1141 ttggtcatta gtcattagtc attagtcatt ggtcattggt cattagtcat tagtcattgg 1201 tcattagtca ttagtcattg gtcattagtc attagtcatt agtttttgcc gggggtaacg 1261 ttccatcaat cccctcactt cttctgcatg atatgagctt cttgtcaatg gtgaggaaac 1321 aacttgtaaa aatccgagtt cttcaccaaa agctttccaa gcagcaaatt gctctggcgt 1381 gataaaatca tcgactttca aatgcttttg gctgggttgg agatattgcc caatagtcaa 1441 aatgtcgcaa tctacagctc gtaaatcccg catgacttgg cggacttcct catcagtttc 1501 accaagtccc accataatac cggatttggt gtaaatccaa ggagcaattt gacgggaacg 1561 ttgcaataat tccattgtcc gctcatagtt tccttgggga cgtactcgcc gatataaacg 1621 aggaacggtt tctgtgttgt ggttgagaac ttctggttgt gcttggagaa tcaactccag 1681 cgcctcccaa tgaccacaca aatcaggaat gagtacctca attgttgtgt ggggagagat 1741 agtacgaaca gcttcaacac atctgataaa ctgcgaagcc ccaccatctg gtaagtcatc 1801 tcggttcaca gaggtgatga ccacatgatt gagtttcagg cggcgtactg cttcagccag 1861 tcgtgctggt tccgtggggt ctaggggttt gggttttttt tcaaagtcaa tatcgcagta 1921 gggacacgca cgcgtgcaag ctggtcccat aattaggaat gtggcggttc cagctttgaa 1981 gcactcacca atatttggac aggacgcttc ctcacaaacc gtattgaggg ttaaatcccg 2041 caaaatttct ttaacgttac caacacgctc ccattgaggc gcttttaccc gcaaccagtc 2101 tggtttaaca gtcacaatcc gtttctacca ctgaatttgt acattgtaaa tcctagcaag 2161 cttcatcgac tggtggatga atttgcatac gactttctca tttctcaatg atttgtgata 2221 ttatatatat acttttgcgt gcattaaatt tgcacacagc taaaggctaa aagactagct 2281 atgactggtc ttattcattt tttgaacttt gaattttgaa tttcttcttg tgtcgggatg 2341 tagcgcagct tggtagcgca cttcgttcgg gacgaagggg ccgctggttc gaatccagtc 2401 atcccgattt tttttagatt aaataattgt tgtccagttt taaggaatgg taatggcgca 2461 gacctattga ttcggctcca agattatggt gtgatctgtc catttaatag cttatccaca 2521 catctaaacc attaccggca agcgctttgc taagaaagac taaaatatgg tcaacctaga 2581 agctgaagaa cgcaaacaat taataactct cctcatacca attctatgtg aaggtgcata 2641 gaactgtaga ataaagaagg gaaaagatt // LOCUS NODE_8243_length_2621_cov_6.6597822621 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2621) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2621) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2621 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(112..819) /locus_tag="DP116_27815" CDS complement(112..819) /locus_tag="DP116_27815" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27815" /translation="MTTGATLNSICDIAWVFTYVAIVWRGFKDRSLGMPMVALSANIS WEAIYSFIYIPPSNALLYASIAWFLFDIPIVWQCLLYGYKDFPPQITKNNFRLIFLAA IAIAFPIVFGVASELNDTKGVYTGFGQNCMMSILFVCMLLRRNDISGQSIYIAIYKWF ATLFAFLGSSFDAPGDVNKILNLQTLLTEIFVDNTYPTTPLIKILYATTFIFDVLYII LLYRKCTENKINPWARY" gene complement(1177..2403) /locus_tag="DP116_27820" CDS complement(1177..2403) /locus_tag="DP116_27820" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017715261.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase" /protein_id="PRJNA477356:DP116_27820" /translation="MFTLPSPTTVNISSRQLPVDTTIVRRKTRPVRVGHVTIGGGHPV VVQSMINEDTLDVEGSVAAIVRLHKIGCEIVRVTVPSLAHAKALSEIKQKLVETYLPV PLAADVHHNGMKIALEVAKYVDKVRINPGLYVFEKPKPSRTEYTQAEFDEIGEKIRET LEPLVVSLRDQDKAMRIGVNHGSLAERMLFTYGDTPEGMVESALEFIRICESLDFHNL VISLKASRVPVMLAAYRLMVQRMDELGIDYPLHLGVTEAGDGEYGRIKSTAGIGTLLA EGIGDTIRASLTEAPEKEIPVCYSILQALGLRKTMVEYVACPSCGRTLFNLEEVLHKV REATKHLTGLDIAVMGCIVNGPGEMADADYGYVGKQPGYISLYRGKEEIKKVPESQGV EELINLIKQDGRWVDP" BASE COUNT 780 a 584 c 538 g 719 t ORIGIN 1 gcaataatcg cgaattatct ttttcccttt atttttgcaa aattgggatg ctcccgtttc 61 gttgtcctcg ctcattggag gcaattcagt tgaagccgga aggcgagtaa attagtacct 121 tgcccaagga ttaatcttgt tctccgtaca tttgcgatac aaaagtatga tgtaaagaac 181 atcaaaaatg aaagtggtgg catataaaat tttgatcagc ggtgttgtgg gataggtgtt 241 gtctacaaaa atttctgtta agagggtttg aagatttaaa attttattaa catcaccagg 301 tgcatcaaaa gatgagccta aaaaggcgaa caatgtagca aaccatttgt aaattgctat 361 gtaaatagat tgtccactga tgtcattgcg ccgaagaagc atacatacaa agagaattga 421 catcatacag ttctgtccaa aaccagtata gactcccttt gtatcattca actcggaagc 481 aactccaaaa acaataggaa aagcaatagc tatagcagct aaaaaaatta acctgaaatt 541 attctttgtt atttgaggcg gaaaatcttt gtatccataa agaaggcatt gccaaacaat 601 tggtatatcg aagagaaacc aagcaatact agcgtatagt aaagcgttac taggtggtat 661 ataaatgaat gaataaatag cttcccacga aatgtttgca ctaagagcca ccattggcat 721 ccccaatgaa cggtctttga agccgcgcca aacaatagcc acatatgtga atacccaagc 781 aatgtcacaa atagagttca aagtcgcgcc tgtagtcata acctatacct cccaatccta 841 aataacatct gctaaaactc tactaggtgt atatgcgcta atagtaactc ctaccaaaaa 901 tgaaccattt tctagcaaag tttcaggtgt ttgtgttcca taaaaaagac ctcatttgat 961 cgtaatttat tccagggtag tcgtgttaga agagttgaag tcgccaatca caactatcaa 1021 gtcctgcttc agttcacaaa gttgaactta gtttgttgaa gttactgagt gagaccgagt 1081 atacttttga agttgaggaa gtattaattt agattcatga ggctacactt cctcttctaa 1141 ccacgactaa ctacatatcc aaattttcag cccggtttaa ggatctaccc aacgcccatc 1201 ttgtttaatc agattaatca actcttcgac gccttgagat tcggggactt ttttgatttc 1261 ttctttgcca cgatacaaag agatataacc gggctgctta ccaacataac cgtagtcagc 1321 atctgccatt tctccaggac cgttgacaat gcaacccatg acggcgatat ctagacctgt 1381 aagatgtttg gttgcttccc gcactttgtg cagcacttct tctaggttaa acaaagtacg 1441 accgcaggaa gggcaagcaa catactccac catcgttttc cttaaaccca gtgcttgcaa 1501 aatgctgtag cacacaggaa tttctttttc tggggcttcc gtaagcgacg cccgaattgt 1561 atctccaatc ccttcagcta aaagtgtgcc aatcccagcg gtggatttga tgcgtccgta 1621 ttctccatca ccggcttctg ttacgcccag atgtaggggg taatctatac ctagttcatc 1681 catccgctgt accatcaggc gataagcggc tagcatgact ggcacccgtg acgccttgag 1741 agagatgacg aggttgtgga agtctagaga ttcacaaatg cgaatgaatt ccaatgcaga 1801 ttctaccata ccctctggag tatcgccata ggtaaagagc atccgctcag caagggaacc 1861 atggttgacg ccaatccgca tcgctttgtc ttgatcgcgc agggaaacca caaggggttc 1921 tagagtttca cggattttct cgccaatttc gtcaaattcg gcttgggtgt attcagttct 1981 gctaggtttt ggcttttcaa atacatacaa acctgggtta atccgcactt tatcaacata 2041 ctttgccact tccaaggcaa tcttcatgcc attgtggtgt acatcagctg ctagaggcac 2101 aggtaagtaa gtttcgacca atttctgctt gatttcactc aaggctttag catgagccag 2161 acttgggacg gtgacacgca caatctcaca gccaattttg tgaaggcgaa caatagcagc 2221 taccgaacct tctacatcca aagtgtcttc attaatcatg gactgcacca ccacagggtg 2281 accaccacca atggtgacat gaccaacgcg aacgggacgt gttttacgac gaacaatggt 2341 ggtatcaacg ggaagttgac gagaagaaat gttgacagta gttggagatg gaagagtgaa 2401 catagatttt tgcccttgtg tgtcaaaatt tagctcaagt tcaacacgaa tggcacgctc 2461 gataatagca aatccttgag gtatcaggtt ttagggtgtg gggaattaag aaaaatgttg 2521 cttgtgctaa cgccaggaat tgaatgtctt ttaacactac acccttctcc cccttgcccc 2581 taatccccaa cgccagtcgc ctcaagtcgg gaaacccgcc c // LOCUS NODE_8244_length_2620_cov_4.9742692620 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2620) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2620) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2620 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(34..363) /locus_tag="DP116_27825" CDS complement(34..363) /locus_tag="DP116_27825" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999793.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27825" /translation="MIRHSHEASESWKNLNWKEFRKDLFRLQCRVFKAIREGNKRKAL SLQKLILKSKAARFLAIRQISQLNAGKKTAGIDGKKSLTSEERFELEKLLKASSNNWY HQKLRSN" gene complement(1157..2479) /locus_tag="DP116_27830" CDS complement(1157..2479) /locus_tag="DP116_27830" /EC_number="3.5.2.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318129.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dihydroorotase" /protein_id="PRJNA477356:DP116_27830" /translation="MSSPTSLLIRRARIVLPNGESVVGDVLTSDRTIVEVAPEISATP STTQIDAEGLTLLPGVIDPQVHFREPGLEHKEDLFTASCACAKGGVTSFLEMPNTRPL TTTQQALDDKLQRASQKCLVNYGFFIGATADNLPDLLLAKPSCGIKIFMGSMHGQLLV DQDAALEAIFSKGERLIAVHAEDQARINERRKEFAGIHEVAVHSQIQDNQAALLATQQ ALRLSKKYQRRLHILHMSTAEEADLLRQDKPSWVTAEVTPQHLLLNTSAYEKIGTLAQ MNPPLRSPHDNQVLWQALRDGIIDFIATDHAPHTLEEKAQEYPNTPSGMPGVETSLPL MLTAAMQGQCTVAQVSDWMSRAVAKAYGIPNKGAIAPGYDADLVLVDLDTYRPVLREE LLTKCGWSPFEGWNLTGWPVISIVGGQVVYEKGKVHTEVRGQALTFSH" BASE COUNT 689 a 611 c 537 g 783 t ORIGIN 1 agaccaacgg gtctggtgac aggctcgttg agtttaatta ctacgtaact tttggtgata 61 ccaattatta ctcgatgctt tcaacagttt ttctaattca aaacgttcct cagaggtgag 121 agactttttg ccgtcgatcc ctgctgtctt ctttccagcg tttagctgcg atatttgacg 181 aatcgccaaa aatctagctg ctttggattt caaaataagc ttttgaagtg acagggcttt 241 acgcttgttt ccttctcgaa tagctttgaa caccctacat tgcaggcgaa ataaatcctt 301 ccgaaattcc ttccagttta gatttttcca agattcacta gcctcgtgac tgtgcctaat 361 catactctac tccgttagat gtttttttga ataccttgca gcagttacgc cgcatcctac 421 ccgaatcaaa ggagttccgt atctcgtctt acctacctgg gattcgaccg tccccaagac 481 ctttgattcg tttctattcg ttcccttaga tgattatttg ttccgttagg tgtagccatt 541 ttaaccatca gaaacccctg attcttgcag ctattccgga aagatgctgc gggtcttact 601 ctgataaagt cgggttctaa ggatatgtcc cgcttacgtg taggttactt ttctaggctc 661 tgtttcaccg tgggaactcc ctgttagcgc aaagctctgc gggggaagct tcccccgcaa 721 gactttgcac cagtgtccac ccataacagt agtctgattg tgccctgttc ccagcttcac 781 tctgtaagaa tcgagcttac tcggtgtggg caccctctgg agtcacgcat gaaccttgcg 841 ggtggggctt tcacccacat caatctaaga gttcagtgtc tagttaacta gagttcactg 901 gcttagttat atacggactg attaccgaga ttcactaagc aacgaatcgc actcccagac 961 acattgaggt cttcaatcac tactgtgcca cagagcaaag cagaataatt tgactaatgg 1021 ggattgctga atttggtcat gaattgtgtt tgtcaagaga tttttaattt tccccaaccc 1081 ctgtagagac gttgcatgca acgtctctac aaaatcttga ttttctcaat tcatatcaag 1141 atttcagcaa cgccgactaa tgactaaaag ttaaagcttg accccgcacc tctgtatgta 1201 ctttgccttt ttcatacaca acctgaccac ctactatact tatcacgggc catccagtta 1261 aattccagcc ttcaaaagga ctccaaccac acttggtcaa caattcttca cgtaaaacag 1321 ggcgataagt gtccaagtct acaagaacta aatcagcatc ataaccagga gcgatcgccc 1381 ccttatttgg aataccatat gccttagcta cagctcgtga catccagtca ctcacttgcg 1441 caacggtaca ttgtccttgc attgcagcgg ttaacatcaa aggtagagaa gtttcgacac 1501 caggcattcc agagggagtg tttggatact cttgagcttt ctcttctaaa gtatgtggtg 1561 cgtgatctgt ggcgatgaaa tcaataatgc catcgcgtaa agcttgccaa aggacttggt 1621 tatcgtgggg cgatcgcaaa ggtggattca tctgcgccaa ggtgccaatc ttctcatatg 1681 cactggtatt gagcaacaaa tgctggggcg tcacttctgc tgtcacccaa cttggtttat 1741 cctgacgcag taagtctgct tcttctgctg ttgacatatg cagaatatgt agacgacgtt 1801 gatatttttt agaaagtctt aatgcttgct gcgttgctag tagcgctgct tgattatctt 1861 gaatttggga atgaactgct acttcatgaa ttccagcaaa ttccttgcga cgttcgttaa 1921 ttctcgcttg gtcttcagca tgaacagcaa tcaaacgttc gcctttagaa aatatagcct 1981 ccaaggcagc gtcttggtca accagtaact gtccatgcat tgaacccatg aaaattttga 2041 ttccacaact tggctttgcc aaaagcaaat caggcaagtt gtctgccgtt gccccgataa 2101 aaaagccata attaaccaag cacttttgtg aggctcgttg tagcttatca tcaagagctt 2161 gctgggtagt cgttaaagga cgcgtattcg gcatttctaa aaaagaagtg actccccctt 2221 tggcacaagc acaactagcg gtaaataagt cttccttatg ttccagtcct ggttcacgaa 2281 agtggacttg tggatcaatt actcctggca acaaagttaa cccttctgcg tcaatttgtg 2341 tagttgaggg tgtggcagaa atctctggtg cgacttcgac tattgtgcga tcgctcgtca 2401 gcacatcccc aacaaccgat tcaccattgg gtaaaactat tcgagcgcga cgaatcagca 2461 aactggtagg agatgacatg aatcaactgg gttaaaaaaa caactgagcg tgacagataa 2521 ctatttatat aggttaaact tagttggact cttcgcggat attttgtatt ttttgttaca 2581 aaaaacttta tgatttttaa taaacaaaga aatctggtta // LOCUS NODE_8287_length_2596_cov_4.8181822596 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2596) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2596) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2596 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 54..275 /locus_tag="DP116_27835" CDS 54..275 /locus_tag="DP116_27835" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27835" /translation="MTKELGIKVSACHINRLLKDMGLSTRPKDNTTKKEISSQQNSSI TIGDLPSHASPNLLWQLSLMKTSVANNYG" gene 533..1330 /locus_tag="DP116_27840" CDS 533..1330 /locus_tag="DP116_27840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009458831.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_27840" /translation="MSEVLTVSKDEIIHQIKLYCQIPSVVEGILTRKIIARAAGEAGI KVESSELQQAADGLRLMNKLTSADATWEWLQKHSLSLDEFEELVYSTVISSKLAQHLF AEKVEPFFVEHQLDYAQVVMYEVILDDFDLAMELFYALAEGEISFPEVAHKYIQDTEA RRSGGYKGILSRTDFKPEISAAVFAATGPQILKPIVTSKGVHLIFVEEIIQPQLNEML REKILSNLFSQWLKQQVEQLKVEIVLNSKSLDVDSLYQQQGIGTSQA" gene complement(1263..1598) /locus_tag="DP116_27845" CDS complement(1263..1598) /locus_tag="DP116_27845" /inference="COORDINATES: protein motif:HMM:PF12167.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27845" /translation="MKDDRGRLRLRFSFEGKRYSIALGVENTKANFKLAQAKADELRV DIVFKRFDAKRLEKYKLITTFAPKQAHTLTLTDIWLKYLDYKRALVKPGTYQYLAVDK DCLHLMILS" gene 1663..1737 /locus_tag="DP116_27850" tRNA 1663..1737 /locus_tag="DP116_27850" /product="tRNA-Gly" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:1695..1697,aa:Gly,seq:ccc) BASE COUNT 712 a 539 c 559 g 786 t ORIGIN 1 tattttaact cctattcatt ttaccagttt tacgcttttg aacgcggatt ggtatgacaa 61 aagaattggg aattaaagtt agcgcttgcc acattaaccg cttgcttaag gatatgggac 121 tttccactcg accaaaagac aacacaacta agaaagaaat ctcaagccaa caaaattcca 181 gcattacgat tggtgattta ccatcgcacg catcacctaa tttattgtgg caattgagtt 241 tgatgaaaac cagtgttgca aataattatg gctgaaaaaa gtcgggaagt gcaggtttgg 301 gcatttacaa ttgcagatta ctacgtgtag gaatgccagt caaaatcaaa tttgatgctt 361 atcctttttc agattatgga gtgttgtctt gcactagcaa aggaaggaga aaaggtaaaa 421 ctaaaaaaat tttgctcctt cttttgcttg tttactttta acttgtcccc ttgttctctt 481 gtcagcattt cacaattaaa ctcgcattaa tatattacta aggaatgata ctatgtcaga 541 agtgctaact gtttccaagg acgaaattat tcaccaaatc aagctttact gtcaaattcc 601 ctctgtggtt gaaggtattc ttactcgcaa gataattgcg cgtgcagctg gggaggctgg 661 tattaaagtg gagtcgtcag aacttcagca agcagcagac ggtttgcggt taatgaacaa 721 gctgactagc gctgatgcta cttgggagtg gctacaaaaa catagtctgt ccttagacga 781 gtttgaagaa ttagtttatt ccaccgtcat ttcctcaaag ttggctcaac atttgttcgc 841 cgagaaagtt gaaccctttt ttgtggaaca ccagcttgat tatgcccagg ttgtcatgta 901 tgaagtaatc ttggatgatt ttgacttagc gatggaactc ttttatgcac ttgctgaagg 961 tgaaataagt tttcctgaag tcgcccacaa atacattcaa gatactgaag ctcgccgctc 1021 tggaggatat aaaggaatac taagtcgcac agatttcaaa ccagaaatat cagccgctgt 1081 atttgcagca actgggcccc agattcttaa accaattgtt acttctaagg gagtccattt 1141 gatttttgtt gaggaaatta ttcagccaca attaaacgag atgctgcgcg aaaaaattct 1201 ctcaaacttg ttttctcaat ggctaaagca acaagttgag caattgaaag ttgaaattgt 1261 tcttaactca aaatcattag atgtagacag tctttatcaa cagcaaggta ttggtacgtc 1321 ccaggcttaa ctagtgcgcg tttgtagtcg aggtacttta gccagatgtc cgtgagtgtg 1381 agcgtgtgtg cttgttttgg tgcaaaggtc gtgatgagtt tgtacttctc taacctttta 1441 gcatcgaacc gcttaaagac aatatccact ctcaactcat ccgccttcgc ctgtgcaagt 1501 ttaaaattcg cctttgtatt ctcaacacct agcgcaatcg agtacctctt cccctcaaaa 1561 ctaaatctca ggcgcaatct cccccggtcg tctttcacct aactgtcatt ggtcggaata 1621 tttggtagga ttgactatat tcgagagcgc taaccctgac ttgcgggtgt agttcagtgg 1681 tagaacgtca gcttcccaag ctgaatgtcg tgggttcgag tcccatcacc cgcttcaaaa 1741 aaaatcctca aatcgtaagc ttttaaaggg ttacagccgt ttgctcaccc cttgctcaca 1801 tcgagaatag cctttattgg gtgcaactat tgggtcatct aaggtctttt tgggtgtaat 1861 tttggtgtaa agctggtgta aaacttttgt actaatagag attgcgccat gaactcagaa 1921 tgtactactg gcaaagcttc taagggtacc gtccctacat cattggcggc gtgtggaacg 1981 gtaaggatgc accgcaagaa ttggtggatg actccattca accatgactc gccgtcctat 2041 tttgcaagag gggcaatcct acacccgatc actcctattt tgagatgcct tatgaggcag 2101 acgaaatatt ggcggaattg ggctatgctt tggtgaaaac acgccttgat ttaccatgta 2161 gcgatcgcca ttgagaacgt ttgcaagaat tgcgacagct gtttttgtac gtacggacaa 2221 ttcttggcat atggtgagtt tagtaacttc tgaagcagtt gaatgccgat tggctgcaca 2281 ggttgctttt ttcagaattt gaaattgttt ggtggttttt aagtcccaga ggggagaggg 2341 cgagcatgta gcactgttgc cttgggtctc gccttttgta aaactgggca accctctcta 2401 cgagtaaacg cgagtgcgcc ttacagaact ttaggagacg cattgtccac gcactttgtc 2461 ccctaaaaaa ttgggcagcc ttattactac gggtactcgt ctttacgggg ttataccatt 2521 tcacgaaatt cgtgatacaa attctttttc cctacttccc ctactgccta tttgtatcaa 2581 ccttaaagtg aaacgg // LOCUS NODE_8562_length_2455_cov_3.9558332455 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2455) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2455) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2455 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..618 /locus_tag="DP116_27855" CDS <1..618 /locus_tag="DP116_27855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017716254.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27855" /translation="LNAHSNSVTALAVTPDGKRLISASVDKTLKVWDLADGKEVFTLT GHSNSVTAVALTPDGKRLISASVDKTLKVWDLADGKQLFTLNTHRPWELAVAVTPDGK RVISGSVDNTLKVWDLADGKQLFTLNAHSNSVKAVAITPDGKRVISASDDKTLKVWDL ADGKVITNFTAESPVLCCAVTPDGMIITAGDDSGQIHFLQLEAGL" gene 615..1811 /locus_tag="DP116_27860" CDS 615..1811 /locus_tag="DP116_27860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016950829.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="PRJNA477356:DP116_27860" /translation="MNITPRPSTASQKEFQQLIHQKSSNFVGREFAFAAITDFLNHQL CGYFTIVGTPGSGKSAILAKYVMENPSVVYYSAEVEGKNCAEEFLITVCTQLTLRLRS GLKPSGVEALIREMGDTNVSLPDNATEGSWFLSLLLQKVSDLLEPDQRLIIAIDGCDQ IDLNNQSPDSNLFYLPRYLPERVYFLLTRRPFKSEKSGLLIETPAQILDLEAYSKQNR EDVQTYIQQYISVTPSFLKNARWINNHSISEQEFCQQLTQQLTVKSENNFMYLNHILK GISQGFYRDSPSVALAIPFQFEPLPPGLEAYYKNHYQRIKGKGLSPVGNRVLKCLAQL VQPLSVELIAQIIDEDEYEVEKVLENWLEFLHQEIRGEETFYSFYHTSFANWLGKELN LVSYSS" gene 1964..2209 /locus_tag="DP116_27865" CDS 1964..2209 /locus_tag="DP116_27865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017309847.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF433 domain-containing protein" /protein_id="PRJNA477356:DP116_27865" /translation="MLGLDRITFDARIMAGQACIRGMRIPVSLVVNLVANGKPVEEIL EEYPDLEPEDIRQSLLYAAWLTQERVYSFTNAEKQAS" BASE COUNT 720 a 540 c 520 g 675 t ORIGIN 1 ctcaacgctc atagcaactc ggtaacggca ctcgccgtca ccccagacgg caagcggctg 61 atttctgctt ccgttgacaa aacactcaaa gtttgggatt tggctgatgg taaagaagtt 121 ttcacactca cgggtcatag caactcggta acggcagtcg ccctcacccc agacggtaag 181 cggctgattt ctgcttccgt tgacaaaaca ctcaaagtct gggatttggc tgatggcaaa 241 caacttttca cactcaacac tcatagaccg tgggaattag cagtcgccgt caccccagac 301 ggcaagcggg tgatttctgg ttccgttgac aacactctca aagtctggga tttggcagat 361 ggcaaacaac ttttcacact caacgctcat agcaactcgg taaaggcagt cgccatcacc 421 ccagacggca agcgggtgat ttctgcttcc gatgacaaga cactcaaagt ctgggatttg 481 gctgatggca aggttattac caattttact gctgagagtc cagtattgtg ttgtgctgtt 541 acaccggatg gaatgataat tacggcaggg gatgattcag gacagattca tttcctccag 601 ttggaagcag gtttatgaac atcacacctc gcccttcaac cgcttcccaa aaagaatttc 661 agcaactgat tcaccaaaaa agcagcaatt ttgttggtcg tgaatttgcc tttgctgcta 721 ttaccgactt cctcaaccac caactctgtg gttactttac cattgtgggc acacctggca 781 gtggcaaaag tgccattctc gccaagtatg tgatggaaaa tccttctgtt gtttattaca 841 gtgcggaagt tgaaggaaaa aattgtgcgg aggaatttct cataactgtt tgcactcagt 901 taacccttcg gctacgctca gggttaaagc cgagcggagt cgaggcttta attagggaaa 961 tgggggatac aaatgtgtcg cttcctgata acgcgactga gggaagttgg tttctctcac 1021 ttttacttca gaaagtgagc gatttactag aaccagatca gcgtttgatt attgctattg 1081 atgggtgtga tcaaatcgac ctcaacaacc aatctccaga ctcaaatcta ttttaccttc 1141 ctagatatct tccagagcga gtttattttc tgctcacccg tcgccccttc aagagcgaaa 1201 aatctggttt attaatcgaa acccctgctc aaattttgga tttagaagca tattccaaac 1261 aaaaccgaga agatgtgcag acatacatac aacaatatat ttcagtcaca ccctcttttt 1321 taaagaacgc tcggtggata aacaatcact ctatcagcga acaagaattc tgccaacagc 1381 tcacacaaca gctaacagtc aagagcgaaa acaatttcat gtatctcaac cacatcttaa 1441 aaggaatctc ccaaggcttt tatcgcgata gcccaagcgt ggcgttagcc atacctttcc 1501 aatttgaacc actcccccca ggattagagg cgtattacaa aaatcattac cagcgcatca 1561 agggtaaagg tttgtctcca gtgggtaatc gtgtattaaa atgtttagca cagcttgtgc 1621 aacccctttc agtagaatta atcgcccaaa taattgatga agatgaatac gaagtagaaa 1681 aagtgctgga aaattggctg gaatttttac atcaggaaat tagaggagaa gaaacttttt 1741 atagtttcta tcataccagc tttgcgaact ggttaggcaa ggaattaaat ttagtttcgt 1801 acagctcatg aactcgtaga aactctcata tggtggtggt gggcactgcc actaaaaccc 1861 tcataatggt gagcaggatc tgactggcag tgcccaccct acattactgg ttaggtaagg 1921 aattaaattt agtttagaca acgtaatttt ggagagttga tttatgttag gtttagacag 1981 aattaccttc gatgcgcgca tcatggctgg acaagcttgt attcgcggaa tgagaattcc 2041 tgtttcgtta gttgtcaatt tagtagcgaa tgggaaacct gtagaggaaa ttttagaaga 2101 ataccctgac ctagaaccag aggatattcg ccaatctcta ctttatgctg cttggttgac 2161 tcaagagcgg gtttattctt tcacaaatgc tgaaaaacaa gcatcatgaa atttttagca 2221 gatatgggaa tttcgctacg cacagtagca aggattgttg ctaacaattc cacatatgta 2281 gaactttcgg atgcaacgtc tctacatcaa tgggaatttt taaattcata tcaagatttc 2341 agcaacaaca gtttttcagt ccattaaaat ggacttcgcc tataagccta ggacttatag 2401 tctagccctg tcaaggtgac aaattatcag ataataatgc catgcgattg cgcga // LOCUS NODE_8588_length_2445_cov_16.7175732445 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2445) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2445) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2445 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 31..>2445 /locus_tag="DP116_27870" rRNA 31..>2445 /locus_tag="DP116_27870" /product="23S ribosomal RNA" BASE COUNT 733 a 521 c 747 g 444 t ORIGIN 1 agttaacact ccaaacaaaa acaccaaaga ttgtagtcaa gcaagaaaag gcgaatggtg 61 gatacctagg cacacagagg cgaagaagga cgtggtgacc gacgaaaagc tccggggagc 121 tggaagcaag catagatccg gagatgtccg aatggggcaa ccccaagtac tgcctgttga 181 atatatagac aggaaagagc caacccagcg aactgaaaca tcttagtagc tggaggaaga 241 gaaatcaaaa gagattccct aagtagtggt gagcgaaagg ggaagagcct aaaccaaaaa 301 gcttgctttt tggggtagtg ggacagcaac aaggaatcca gaggttagac gaagcagcga 361 aaaactgtac cagagaaggt gagagtcctg tagtcaaaaa ctaaaagata ctagctgtat 421 cccgagtagc atggggcacg tgaaatccca tgtgaatcag cgaggaccac ctcgtaaggc 481 taaatactac tgtgtgaccg atagcgaaac agtaccgcga gggaaaggtg aaaagaaccc 541 caatgagggg agtgaaatag aacatgaaac catgagccca caagcagtgg aagtccgatt 601 caacggatga ccgcgtgcct gttgaagaat gagccggcga gttaaaagca ctggtaggtt 661 aaggtgaaag actgtagcca aagcgaaagc gagtctgaat agggcgcgaa tcagtgtttt 721 tagacccgaa cccgggtgat ctaaccatgt ccaggatgaa gcttgggtaa caccaagtgg 781 aggtccgaac cgaccgatgt tgaaaaatcg gcggatgagg tgtggttagg ggtgaaatgc 841 caatcgaacc cggagctagc tggttctccc cgaaatgtgt tgaggcgcag cggttgtgat 901 tatactctgg gggtaaagca ctgtttcggt gcgggcggcg aaagctgtac caaatcgaga 961 caaactcaga ataccagaag aacacacaac cagtgagacg gtgggggata agcttcatcg 1021 tcaagaggga aacagcccag accaccagct aaggtcccca aatcatcgct aagtggcaaa 1081 ggaggtggga gtgcacagac aaccaggagg tttgcctaga agcagccacc cttaaaagag 1141 tgcgtaatag ctcactggtc aagcgctcct gcgccgaaaa tgaacgggac taagcgatgt 1201 accgaagctg tgggattact ttgtaatcgg taggggagcg ttccgtgtta ggaagaagca 1261 ctagcggtaa gcaggtgtgg acgaaacgga agtgagaatg tcggcttgag tagcgaaaac 1321 attggtgaga atccaatgcc ccgaaatccc aagggttcca gagccaggtt cgtccactct 1381 gggttagtcg ggacctaagg cgaggcggaa acgcgtagtc gatggacaca gggtcaacaa 1441 tccctgacta ttcagtggga gcatttgcag tgcgcatgaa agtaagctac accctgactg 1501 gattgggaga cttctacgga ggtcgagtag tgaggatagt gtcaagaaaa gctgtggatg 1561 tgatgaaagc tgagtacccg tacccgaaac cgacacaggt gggagggtag agaataccaa 1621 ggggagcgag ataactctct ctaaggaact cggcaaaata gccccgtaac ttcggaagaa 1681 ggggtgccca cgcaagtggg ttgcagtgaa gaggcccagg cgactgttta ccaaaaacac 1741 aggtctccgc caactcgtaa gaggaagtat gggggctgac gcctgcccag tgccggaagg 1801 ttaaggaagc tggtcagcca ataggtaaag ctggcgactg aagccccggt gaacggcggc 1861 cgtaactata acggtcctaa ggtagcgaaa ttccttgtcg ggtaagttcc gacccgcacg 1921 aaaggcgtaa cgatctgggc actgtctcgg agagagactc ggcgaaatag gagtgtctgt 1981 gaagatacgg actacctgca cctggacaga aagaccctat gaagctttac tgtagcctgg 2041 aattgggtcc gggcttggct tgcgcagaat aggtgggagg cattgaagca ttccttgtgg 2101 ggagtgtgga gccaacagtg agataccact ctggcgaagc taggattcta acccgtttcc 2161 gtgatccgga aaggagacaa tttcaggtgg gcagtttgac tggggcggtc gcctcctaaa 2221 aggtaacgga ggcgcacaaa ggttccctca gcacggttgg aaatcgtgct ttgagtgtaa 2281 aggcaataag ggagcttgac tgcaagagca acaactcaag cagggtggaa acacggtctt 2341 agtgatccga cggcgcagcg tggaatggcc gtcgctcaac ggataaaagt tactctaggg 2401 ataacaggct gatctccccc aagagtccac atcgacgggg aggtt // LOCUS NODE_8679_length_2402_cov_3.7609712402 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2402) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2402) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2402 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(160..510) /locus_tag="DP116_27875" CDS complement(160..510) /locus_tag="DP116_27875" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137800.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="four helix bundle protein" /protein_id="PRJNA477356:DP116_27875" /translation="MSYRNQFIWQRAVQLAINCYKFTRLFPKSELYGLTSQIRRSSVS VASNIAEGYGRRSKQEYIQFLHIALGSLRELDTQLIIAKEVDLAEIDLFTPIMNEVEE MQSILVATINKLKG" gene complement(569..2276) /locus_tag="DP116_27880" /pseudo CDS complement(569..2276) /locus_tag="DP116_27880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862863.1" /note="frameshifted; internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="ISL3 family transposase" BASE COUNT 661 a 496 c 550 g 695 t ORIGIN 1 ccctacacga taatcattat tggcaaaaaa actcaaatat caaaaatgaa ggataaactg 61 gatcttggca gaatttggtg atcttggctg gggtaaggca gagggcagag ggcagagggc 121 agaaggagga aaaaggaaga aggaggaaga aataaaattt taacctttta gtttgtttat 181 ggtagcaact agtatacttt gcatttcctc aacttcattc ataatgggag tgaaaaggtc 241 aatttctgct aaatctactt ctttggcaat gattaattgc gtgtcaagct ctcttaaaga 301 gcctaatgca atatgtaaaa actgtatgta ttcttgtttt gaacgtctac cataaccttc 361 agctatgttg gatgctacag atacggatga acgtcgtatt tgacttgtta aaccatacaa 421 ttcagattta gggaatagac gcgtaaattt gtagcaatta atagcaagtt gaactgctct 481 ttgccatata aattgatttc tataactcat aagttaagtt tgaaggtaga agaaaagaag 541 atgtttgaag tattaagaat tatacctctc aggtcgccaa caaaaatcgt cgctcaagta 601 agtctatctt ggcgcgacca tacatctgcc gctttaacat tttcagtcga ttaatatgcc 661 cttcaacagg gccattgctg actggcatag ttacacctgc ttttacagca tcgtagtcag 721 acttcttacc aacagcaaag gagtgcagca aagaaaccga gctgttttta gctttgttta 781 accaagcatc gaactctgac ggcaggcgtt gacgcacaag agatgcaaac tgttgagcta 841 gttcaatggc tgactccaaa tcagaatggg ctgtttgtag tcgagcgatc gcttcacgct 901 ccgaccggct ctgttaattc cggtcgtcgc aggactaaag ctgtgacgcg actgggggtg 961 agaggacgat gggaacaaga gcttaccctg ggggaagcgt tttttcttga accttgtcgc 1021 ggttcaaatc cgggcaagct cctagttgcc gagtgaagcg aatgaccgtg gcataactac 1081 cgatatagcc cgcctgtcga atttcttcaa acagttcttg ggtgttgtag ttcccgctat 1141 tccagcggct gaggaggtaa tcatgataag ggttgagaag actgagacct tggtcgctac 1201 gttggcgacg ttcggtaaaa gttgagctac gcaagtaatt atatacggta tttttagaaa 1261 ctcccagctc ttgagcgatt gcttgtactg ataagccaat agaccgtaag ctccaaactt 1321 gctcatggat gtccctccgt ctggcccgtg ccttggccga ttgaactttt cgcatttaag 1381 atgtgttttg gggaaacctt ggcacaactg ggcagggatt agcctcagca attagggaaa 1441 cgttgtgtgc tgtttccacc tgttttagca ctttagtatc agtactaaag acttgttttt 1501 ccacttcttt aagtgccgaa gcgtgagtac taaagacttg ctcaagcgtt tgagataagt 1561 tctgcaataa gtgaaaacgg tctgcaactt gaatggcttc tggcgcgcct tgacgaatac 1621 cactttcata agtttttgac cggtctcgtg agacgacttt gacaccaggg tgagctttta 1681 accattctgc caaggtttca gccttggcat cttttagtag agcaattggt cggctgcgtt 1741 ccagatcaat tagtgctgtg ccgtaagttt tacatttacg aaagcagaaa tcgtctaccc 1801 caagagtatg tggcgttacg attggtggta gtgggattga gcggactaaa tttaatagtg 1861 tattgcgaga aacttttatc cccaattgct ccgagagtct tacccctgct gcaccaccat 1921 tagaagagcg caatcgcact tagtcgttga gctaaacgta gagttcttct cgcccaaggt 1981 gcggttacat tggtcaacct ttctgtaaaa atgcgccgtt tacacaaact attgatgcaa 2041 aaaaacttcc gcacccgtaa ctgtaaggta atgctgtaat cagcccaggg taagtctgct 2101 aagaagcgtt catagcgact atgaatttta tgagttggtt ggttacaaac tggacaatta 2161 actactctac tgatggcgga aacaatcaac caaatcttcg tctttatctc gtcaagttgg 2221 caattctcaa gtttcagatt ggttgaatct ggtaataggt gagttagcac cgacataggg 2281 caagcttcgg cttattcctc aaatgtttga tggagtgaag tttgaagtat ctcaattata 2341 tctttcttcc ttctgccttc tgccctctgc cctctgcctt cccccaagcg agatcaccaa 2401 aa // LOCUS NODE_8701_length_2392_cov_30.9084302392 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2392) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2392) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2392 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 587..2284 /gene="ltrA" /locus_tag="DP116_27885" CDS 587..2284 /gene="ltrA" /locus_tag="DP116_27885" /EC_number="2.7.7.49" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006634177.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="group II intron reverse transcriptase/maturase" /protein_id="PRJNA477356:DP116_27885" /translation="MSKTLSNQMVEWNKVNWRKLERIVFKLQKRIYQASLRGDSKAVR KLQKTLMTSWSAKMLAVRKVTQDNKGKSTAGVDGVKSLNPKQRIELVTTMNLNQKVKP TRRVWIPKPGKAEKRPLGIPTMHDRATQALVKLALEPEWEAKFEENSYGFRPGRSTQD AMQMIFNCINQVHKYVLDADISKCFDKINHEKLLNKINTFPKLRRTLKAWLKSGVLDN GYYSETFAGTPQGGVISPLLANIALHGLEGIVDEFIRGIKYRKLNSMQCVRYADDFVI LTNNKANLEIVQQKVSDWLAEMGLELNTSKTRITHTDDGFDFLGFNFKHYNVGKYKSG KNGNGKLLGFKTFIKPSRESIQEHYKHLANQVDRHKASPQAVLINVLNPIIKGWVNYY STAVSAEVFKDLDDLLFKKLSRWAKRRHPKKSWNWVQNKYWHTVGGNNWVFSDKVDEK TVSLYDHQNKKIVRHVKVKGTATPFDGNLKYWSIRKGSNPLVPTRVATLLKAQKGKCK HCGLYFREDDITEVDHIIPRSKKGKDTYTNLQLLHRHCHDTKTANDLLTDWDFIEWQ" BASE COUNT 838 a 478 c 542 g 534 t ORIGIN 1 agtgcggtac gttgcttata aactgtaccc cataagggtt acaggggatt aagcggtgaa 61 ggtaacttca cataatttga taacaatgta ggtgagaatc ctacccctca agtgctgagg 121 tgaaagcact tcccctacag tagcgactag tcatcaatct gtaaagcggg gatgaaaggc 181 ggaaaactgg gagcctaaac ctggtagacc gtcgagcaag gtacacatga gtaaggctag 241 ggcttgaaga cacgaatgaa acatcgtaac ccatgaagta gcgaaataag gctttctgcg 301 ctaccaccaa aaggtgaagg attaaggggc tggcgaatta gccatcgacc agcaagcact 361 cccaaagaaa cccggtgtaa agtgtcactc taaatgtacg tagtattgtt atcagaaacg 421 tataaacccg ctgatttgct ctcaggaggt cgaatacctg agaaggtgct aaccggatgc 481 agcagcggga gagattgaga aagaagcgaa cgccctactg taatggtaag gatatgctga 541 cgttctcacg atgatttgaa aggtacagtc tttacgggaa gtataaatgt ctaaaacact 601 gagtaatcag atggtggaat ggaacaaagt caactggcgc aagctagaac ggattgtgtt 661 caagcttcaa aaaagaatat accaagcatc tctacgtgga gactctaaag cggttcgtaa 721 gctccaaaag acgctgatga cttcatggtc tgccaaaatg ctagccgtcc gcaaagtgac 781 ccaagataat aaagggaaat caactgccgg agtagatgga gtaaaatcct taaaccccaa 841 gcaaaggatt gaacttgtta caacgatgaa ccttaaccaa aaagttaagc caacccgacg 901 ggtatggata cctaaaccag ggaaagccga aaagcgcccg ttaggaatcc ccacaatgca 961 cgatagagct actcaggcgc ttgtcaaact agcattagag ccagaatggg aagccaaatt 1021 tgaggaaaac agctatggat tcagaccagg aagaagcacc caagacgcca tgcaaatgat 1081 attcaactgc attaaccaag tacataaata cgtgctcgat gcggacatat caaaatgctt 1141 tgacaaaatc aatcacgaga aactactgaa taaaatcaac acattcccta aattacgacg 1201 cacattaaaa gcgtggttaa aatcgggggt acttgataac ggatactaca gtgaaacctt 1261 tgcgggtact ccccaaggcg gagtaatatc gccactacta gccaacattg ctctgcatgg 1321 tttggaaggt atagtggatg aatttatacg cgggattaag taccgtaagt taaactctat 1381 gcaatgcgta aggtacgcag atgattttgt aatacttact aacaataagg caaacctgga 1441 aatcgtacag cagaaagtaa gtgattggtt agctgagatg ggtctagaat taaacaccag 1501 caaaacaagg attacccaca ccgatgacgg ttttgatttt cttggattca actttaaaca 1561 ctataatgtg ggcaaatata aatccggaaa aaatggtaat ggaaaattgc tggggtttaa 1621 aacattcatc aagcctagca gagagtcaat tcaagaacat tacaagcatc tcgctaatca 1681 agtagaccga cataaagcct caccacaagc ggtactaata aatgtactca accccatcat 1741 aaaaggttgg gtaaattatt actctacggc ggtgagcgct gaagtgttca aagatttaga 1801 tgacttactg ttcaagaaac tttctagatg ggcaaaacgt cgccacccca aaaaatcctg 1861 gaactgggta cagaataaat actggcacac agtcggtgga aataactggg tattttcaga 1921 taaagtcgat gaaaaaactg tatccttata tgaccaccag aacaagaaaa ttgttcgaca 1981 tgtgaaagta aaaggtacag ctacaccatt tgacggaaat ctgaaatatt ggagcatacg 2041 aaaagggtca aatcccctag taccaacaag agtagctaca ctactcaagg cacaaaaggg 2101 gaaatgcaag cactgtggct tatactttag agaagatgat ataactgaag ttgatcatat 2161 cattccccgg tcaaagaagg gaaaagatac gtacaccaat ttacagttat tacacagaca 2221 ttgccacgat accaaaaccg ccaatgattt actcactgat tgggacttca tagaatggca 2281 gtgagatgta cccatgacaa gggtcaaacc attgaggagc cgtgtgatgt gaaagtatca 2341 agcacggttt tgaagaccaa cgggtctggt gacaggctcg ttgagtttaa tt // LOCUS NODE_8702_length_2392_cov_4.0945662392 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2392) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2392) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2392 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 160..706 /locus_tag="DP116_27890" /pseudo CDS 160..706 /locus_tag="DP116_27890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015209522.1" /note="frameshifted; incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" gene 1004..1816 /locus_tag="DP116_27895" CDS 1004..1816 /locus_tag="DP116_27895" /inference="COORDINATES: protein motif:HMM:PF03412.13" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27895" /translation="MNLRWEELPFCLLTPEQQIQLRNQAEIGRYQTGNIIWSTDEPGS QFLIISGNVRLREEGNPKSSTLKAGDWFGDLLELSGNFKAVASSKDVEVVRWNAALWQ QASSLELNSFWQQERSRYQPQDPNSPQPVSGYPFILSPNTAAACLTMAAQYLQNPIQL EFVQRQLRGQHPNDVMEAAEKLGLQLRQLQVTWNDLRSLSFPALLLWQPPLHPPLSKG RAVSFAQLRILSRVSARNFRESAIEKLLLQRVNLSAIRSASSQRLIARLMSK" gene 1877..2344 /locus_tag="DP116_27900" CDS 1877..2344 /locus_tag="DP116_27900" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017290436.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27900" /translation="MKQPCLSKGGKEQEWIVVYGMKGNRLIIANPTNLSQTCESVPQK MLEESWDGQLWQVEPIQKQEKFNLSWFVPAIWRYRKLLAEVLLLSFALQLLGLATPII TQFIIDRALVYGNRGTLDVMAIALLIVALFEATLGILRLFIFTHTANRLDLSL" BASE COUNT 680 a 489 c 550 g 673 t ORIGIN 1 accccttacc cctccccgca agcgaggagg ggaaaaaata cctccctctc tctgcaagca 61 agcagggtaa aaaatacctc cccctctctg cttgcggaga gggggttggg gggtgaggtt 121 ttttcatctg tggtggtgta ttttgattgc aaagaggtca tctgggtagg tcagttaaaa 181 caagctggct tattgattat cttatgtgcg ctcttgttgt tttcacaacc cctacccgtg 241 tggggagcac ctgcacaaat agaacgtact cctttaactc cagaattatt gcaagaacga 301 ctgcatacac ccattcttcg tgaaggcaat gttgtcgtag atttacggga gatggtgata 361 tatttacgac cagaaaatgc aatttttcgg gatatgttct atcaaatttt acgaaaggaa 421 ttgcaaaaag caggttcaaa acctttgggt ttagatttaa gtcgttctct gattcaggga 481 gatttttttg gtagtgattt gggtttgaga acgcctttgt atgcaaagcg cgatcgcccc 541 cattttcacc tcaactgaac aagaacaact ggaacgtctc cgcgaagttt gcttgcagtc 601 actcgcctta gctttaccta gttccaaaga ctgtaaatcg cttttagcaa aatcaactgc 661 atcgaccgaa atcagtgttt tccgtggttc gctgacactg ctagagagcc agtccagtaa 721 aagtagttga caaaaaaaca caaatataaa tgtgcaaaag ctaataatta acgagaatgt 781 aattgagggt aaggagcaac gtgaaaaagt tggattgtgt ccttgaaaag cgatcgctaa 841 aagtcttcag tagctttgcc acatttagta cataaaaata gttgtaaagg tagcataaat 901 tacggattga caatcaaaaa aagatgagat ttttgagcaa tggttagaag agaaaattca 961 aaatattaat gtcaagcttc aggtgaaatt ttagtgataa atcatgaact tacgctggga 1021 agagttgcct ttttgcttgc taactcctga gcagcaaatc cagttgagaa atcaagcaga 1081 aattggtcgt tatcaaactg gaaatataat ttggtctaca gatgaacctg gaagccaatt 1141 tttaattatt tcgggaaatg tacgtcttag agaagaggga aaccctaagt catcaacttt 1201 aaaagcagga gattggtttg gggatttatt agaactttct ggaaatttta aagctgtggc 1261 atctagcaaa gatgtggagg tggtacgctg gaatgcagca ctgtggcagc aagcttcttc 1321 tttagaatta aatagctttt ggcagcaaga gcgatcgcgc tatcaacctc aagatcctaa 1381 ttcaccccaa ccagtatcgg gctatccatt tatcttgagt cccaacactg cggctgcttg 1441 cttaacaatg gcagctcaat atctacaaaa tcccatccag ttggagtttg tacagcgtca 1501 actccgggga cagcacccta atgatgtgat ggaagctgct gaaaagttgg ggttgcaact 1561 gcgacagcta caagtgactt ggaatgattt gcgctcatta tctttccctg ctttgctgct 1621 ttggcaaccc ccccttcatc ccccccttag caagggcagg gctgtttcat tcgctcagct 1681 aaggattttg tcaagagtat cagccagaaa ttttagggag tcggcaatcg aaaaattact 1741 acttcaacga gtaaatttaa gtgcgatacg ctctgcgtca agccagaggc ttatcgccag 1801 actgatgagc aaataaaatt tttgaactac ctgatttttt tggttgatac gcttgtaaat 1861 tcgggtcttt tagggaatga aacagccctg ccttagcaag ggggggaaag agcaagagtg 1921 gattgttgtg tatggcatga agggtaatcg cttaattatt gccaatccta ccaacctcag 1981 tcaaacttgc gaaagcgttc cccaaaagat gcttgaggaa agttgggatg gacaactttg 2041 gcaagtcgaa ccaatccaaa agcaggagaa attcaacctc agttggtttg tccctgcgat 2101 ttggcgctac cgcaagttgc tagcagaagt cctgttactt tcttttgcgc tgcaattgtt 2161 ggggttagca acgccgatta tcactcagtt tatcatcgat agagcattgg tgtacggaaa 2221 ccgtggcact ttggatgtga tggcgatcgc cctcctgatt gttgctcttt ttgaggcaac 2281 tttgggcatc ttacgcctat tcatctttac tcacactgcc aaccgcctcg acttgagttt 2341 atagtgggtg agatgaaaac cctgtggctt tagcccaggg acgccacatg cc // LOCUS NODE_8825_length_2344_cov_4.8942772344 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2344) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2344) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2344 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..955) /locus_tag="DP116_27905" CDS complement(<1..955) /locus_tag="DP116_27905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010998215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-dependent DNA helicase" /protein_id="PRJNA477356:DP116_27905" /translation="MIEAEVHLSLHNFLRSQAGFPSWPHHLTMARLVARALRLGRSAL IQVGAACGYQGRYRTSFVASALMWHGPVIIVAPESTQQRLTKVEIPRLQQWLQVNKAI RTGDAWPGGEFQGVFLISPAAWLKAQLSKNDNFPRDIPTIIDGVDDLEDWVRSELTVS LEGHDWEQLMLAQPAQAEVIRSARAQLTHELFKHPANPYECYLISPAEEEILTHLYSV LDISSLPDTWKQFAQQFQVRNENFSPSSSPSLLWATIARRQGLFSLHCVPIELGKILS PIWQRQPVVFIGSAVEPETEAPLFRQRLGLEDELTCLKFSSD" gene 1125..2090 /locus_tag="DP116_27910" CDS 1125..2090 /locus_tag="DP116_27910" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011319880.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase M48" /protein_id="PRJNA477356:DP116_27910" /translation="MPTYTGISSEAFRHPLDRQAEQALRNLPGFDFLARKFVEFFVER PQLIYLMGNTIQVGPRQYSTVYQIFRECVRDLDIYPEPALFVEQNPLANSYALGQEHP YIVINTGTLDLLSEAEIRAVLAHELGHIKCGHTILIQMAMWAMSAASAIGELTFGIGN FVTQGLIYAFYEWRRKAELSADRAAFLVTDDLNSVMTSMMKVSGGSIKYANECSLQEF IRQSEDYQALDADGLNQIYKFLIYNGARGTMLSHPFAVERIHYLRQWANSEEYQQIKK GNYQRATAAGAVNVESQTPKNQEVETLQRQIEELQREIDRMKKSE" BASE COUNT 669 a 517 c 549 g 609 t ORIGIN 1 tatctgagga aaacttgagg caagttaact catcttccaa ccccaagcgc tggcgaaata 61 aaggagcttc agtttctggt tctacagcac taccaataaa caccacaggc tgacgctgcc 121 aaataggcga tagtattttt cctaattcaa tggggacaca gtgcaaagaa aataaacctt 181 ggcgacgggc aatagttgcc cagagtagag atggggaaga tgagggagag aaattttcgt 241 ttctcacctg aaattgctga gcgaattgtt tccaggtatc cggtaggctg gatatatcta 301 aaaccgaata cagatgggtc aaaatctctt cttccgccgg agaaatcaga taacattcgt 361 agggatttgc agggtgctta aacagttcgt gtgtgagttg cgcccgtgca gaacgaataa 421 cttcagcttg ggctggctgg gcaagcatga gctgctccca atcatgtcct tctaagctca 481 cagtgagttc agaacgtacc caatcttcca aatcatccac tccatcaata attgtgggga 541 tgtcacgagg aaaattatca ttcttactca gttgtgcttt taaccaagct gcgggagaga 601 tcaaaaagac tccttggaac tcaccaccag gccaagcgtc acctgttctg attgctttat 661 taacttgcag ccactgttgt aggcggggta tttccacttt cgtcaatcgt tgctgtgtgg 721 attcaggggc aacaataatc acaggaccgt gccacattaa tgccgaagcc acaaagctcg 781 tgcgataccg cccctgataa ccgcaagctg cgccgacttg aatcagggcg ctacgtccta 841 ggcgcaaggc acgagccact aaccgtgcca ttgttaaatg atgtggccat gaagggaaac 901 ctgcctgcga tcgcaggaag ttatgtaatg acaaatgaac ttctgcttca atcacacgct 961 tttaatccca tacaaagtaa agtgattttt gagttgctct tagccccatt actattatgt 1021 tgtcagtttc aggagtgaat tgttatgtgt taattgttgg ttgttcattg ctgattgcta 1081 tggtaaccat taaccactaa gtagtatcca aatacccacc aattatgcct acttacacag 1141 gaatctccag cgaagctttc aggcatcctt tagatcgcca agcagaacaa gccttgcgca 1201 atttaccagg atttgatttt cttgcccgta aatttgtgga atttttcgtc gaacgccccc 1261 agttaatcta tttaatgggc aacaccatcc aagtcgggcc acgccaatat tccactgttt 1321 accagatctt tcgggaatgt gtgcgggatt tggatattta tccagaacct gcactgtttg 1381 ttgagcaaaa tcccctagca aacagctacg cgctgggaca agagcatcct tatatagtga 1441 taaatacagg gacattggac ttgctaagcg aagccgaaat tagggcagtg ctagcccatg 1501 agctggggca tattaaatgt ggtcatacta ttctaattca aatggcgatg tgggcaatga 1561 gtgctgcttc tgccataggg gagttgactt tcggtatagg taattttgtc actcaaggtt 1621 tgatttacgc gttttatgaa tggcggcgta aagccgagtt atcggcagat cgtgcggcat 1681 ttttagtgac agatgactta aactctgtca tgacttctat gatgaaagtc tctggaggca 1741 gtatcaaata tgctaacgaa tgcagtttgc aagaatttat ccgtcagtca gaagactacc 1801 aagcactcga tgcagatgga ctgaatcaaa tatacaaatt cttgatttat aacggcgctc 1861 gtggtacaat gcttagccat ccttttgctg tagaacgcat acactacctg cggcagtggg 1921 caaactcaga agaatatcag caaattaaga aaggaaatta tcagcgagca acggctgcgg 1981 gtgcagtcaa tgtagaatca cagactccca aaaatcaaga agtagaaact ttgcaacgac 2041 aaattgaaga attgcaaaga gaaattgatc gcatgaaaaa gtctgagtaa cagtgtattg 2101 atgcgactcg tttgctcgca atgaataaga attcctacct tcaaactgta gcttgctttt 2161 cgcaagtgcg agtaggaagt gaagttaagg taggaaacat atcaacatca gtagaatgcg 2221 gtgagggggt cgcagtcttg gacgccacgt gcttcaagcc gggaaacccg tccaccgcag 2281 tggctccgtt tcccgtcgat tgcgtaagcg caagcgcacg ccaagggcga acgcgcagcg 2341 tctc // LOCUS NODE_8829_length_2343_cov_3.6687062343 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2343) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2343) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2343 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..>2343) /locus_tag="DP116_27915" CDS complement(<1..>2343) /locus_tag="DP116_27915" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872394.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_27915" /translation="GEILELKNTINTMVDQLNSFASEVTRVAREVGTEGKLGVQAEVR GVAGTWKDLTDNVNLMAGSLTAQVRNIAEVTTAIANGDLSKKITVDVRGEILELKNTI NIMVDQLSSFASEVTRVAREVGSEGKLGVQADVKGGAGTWKDLTDSVNFMAGSLTAQV RNIAEVTTAVANGDLSKKITVDVKGEILELKNTINTMVDQLNSFAGEVTRVAREVGAE GKLGVQAEVRGVAGTWKDLTDSVNFMAGSLTAQVRNIAEVTTAVATGDLSKKITVDVK GEILELKNTINTMVDQLSSFASEVTRVAREVGTEGKLGVQADVPGVAGTWKDLTDSVN FMAGSLTAQVRNIADVTTAIANGDLSKKITVQVKGEILELKNTINIMVDQLNSFASEV TRVAREVGSEGKLGVQADVRGVAGTWKDLTDSVNFMAGSLTAQVRNIAAVTTAVATGD LSKKITVDVKGEILELKNTVNTMVDQLNSFASEVTRVAREVGTEGKLGVQAEVRGVAG TWKDLTDSVNSMAGSLTAQVRNIAEVTTAVANGDLSKKITVDVKGEILELKNTINTMV DQLNSFAGEVTRVAREVGTEGKLGVQAYVRGVAGTWKDLTDNFNLMAGNLTAQLRNIA EVTKAVANGDLSKKITVDVKGEILELKNTINTMVDQLSSFASEVTRVAREVGTEGKLG GQAQVVGVGGTWKDLTDNVNSMAGNLTAQVRGIAKVVTAVANGDLKRKLTLDAKGEIE TLAETINEMIGTLATFANQVTTVAREVGIEGKLGGQAKVPG" BASE COUNT 542 a 737 c 418 g 646 t ORIGIN 1 cccggcactt tcgcttgtcc gcccaactta ccttcaattc ccacctcacg cgccacggtt 61 gtcacctggt tggcaaacgt cgccagcgtc ccaatcatct cattaattgt ttctgccaaa 121 gtttcaattt ctcctttggc atccagcgtg agtttccgct tcaagtcacc attcgcaacc 181 gccgtcacaa ctttggcaat acctcgcact tgtgcggtta agttacccgc catcgagttg 241 acattatcag ttaaatcttt ccaggtaccc cctacgccga caacctgtgc ttgtccaccc 301 agcttacctt cagttcccac ctcacgtgca acccgcgtca cttcagaagc gaaagacgaa 361 agttgatcca ccatcgtgtt gatagtgttc ttgagttcta agatttcgcc tttgacatca 421 acagtaattt tcttcgacaa gtcaccattt gccaccgcct tcgtcacttc ggcaatattc 481 cgcaattgtg ccgtcaagtt acctgccatt aagttgaagt tgtcggtcaa atctttccaa 541 gttccggcaa cacctctgac ataagcttgc acgcccagct taccttcagt tcccacctca 601 cgcgcaaccc gcgtcacttc acctgcgaag gagttgagtt gatcaaccat cgtgttgatg 661 gtgttcttga gttccagaat ttcgccttta acatcaacag taattttctt cgataagtca 721 ccattcgcca ccgccgttgt gacttcagca atgttccgca cctgcgctgt caaagaaccc 781 gccatcgaat tcacactgtc ggttaagtct ttccacgttc ccgcaacgcc tcgaacttcg 841 gcttgcacac ccagttttcc ttcagttccc acttcacgcg caacccgcgt cacttcagaa 901 gcaaaggaat tgagttgatc caccatcgta ttcacggtgt ttttcaactc cagaatttca 961 cctttgacat caacagtaat tttcttcgac aagtcaccag tcgcaaccgc cgtcgtcaca 1021 gcagcaatgt tccgtacctg cgccgtcaag gaacctgcca tgaagttcac gctgtcggtc 1081 aaatctttcc aagttccagc aacaccgcgc acatctgctt gtacacccag ctttccttca 1141 gaacccactt ccctagcaac ccgcgtcacc tcagaagcga aagagttgag ttgatccacc 1201 ataatattga tggtgttttt caactccaaa atttcccctt tcacctgtac agtaattttc 1261 ttggacaagt caccgtttgc gatcgctgtt gtcacatcgg caatattccg cacctgcgct 1321 gttaaggaac ccgccatgaa gtttacactg tcggtcaagt ctttccacgt cccagcaaca 1381 cctggcacat ctgcttgtac acccagtttg ccttcagttc ccacttcacg cgcgactcgc 1441 gtcacctcag aagcaaacga actgagttga tccaccatcg tgttgatggt atttttcaac 1501 tctagaattt cgcctttgac atcaacagtg attttcttcg ataagtcacc agttgcaacc 1561 gccgtcgtca cttcggcaat gtttcgcacc tgcgctgtca aagaacccgc catgaagttc 1621 acactgtctg tcaaatcctt ccacgtacca gcgacacctc gtacttctgc ttgtacgccc 1681 agctttcctt ctgcacccac ctcacgcgca acccgcgtca cttcacccgc gaaagagttg 1741 agttgatcaa ccatcgtatt gatagtattt ttcaactcca gaatttcacc tttgacatca 1801 acagtaattt tcttcgataa gtcaccattc gccaccgctg ttgtcacttc cgcgatgttc 1861 cgtacctgcg ccgtcaaaga acccgccatg aagttcacac tgtcggtcaa atctttccac 1921 gtaccagcac cacctttaac atctgcttgt acgccgagtt ttccttcaga accgacttca 1981 cgcgcaactc gcgtcacctc agaagcgaag gaactgagtt ggtccaccat gatattgatg 2041 gtgtttttga gttctagaat ttctcctctg acatcaacag tgattttttt ggagagatca 2101 ccgttggcga tcgccgtcgt cacttccgca atattccgta cctgcgccgt caaagaaccc 2161 gccatcaagt tcacattatc ggtcaagtcc ttccacgtcc cagcaacacc tcgcacttct 2221 gcttgtacac ccagcttacc ttcggttccc acctcacgcg caacccgcgt cacctcagaa 2281 gcaaaagaat tgagttgatc taccatcgta ttgatagtat ttttcaactc caaaatttca 2341 cct // LOCUS NODE_8858_length_2330_cov_3.5296702330 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2330) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2330) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2330 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..817 /locus_tag="DP116_27920" CDS <1..817 /locus_tag="DP116_27920" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017292239.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27920" /translation="FLHPTIIPPFNTAIVNGFNYLFSEKQKLGSWDSYLRMREIIIQV NQQYKNILSSDLGAFAGLLFEIGSNKLIINNKQVIDETQRSKVEKLMAKRHTEVSFER EEEDLHTEMQYHLLKIGQSLGYAVIAAANDRSKSFAENKFSFFCLPCLPAMEIDKDTL NTINLIDVLWLEKSTNQIISGFEVEKSTSIYSGILRLTDLAISFPNNLKTLFLVVPNI REKEVIMQLKRPCIQQQNISIKYILFSDLRQHCEAICKFGEDYTIMSKISKVA" gene complement(883..1092) /locus_tag="DP116_27925" /pseudo CDS complement(883..1092) /locus_tag="DP116_27925" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017292251.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(1083..1808) /locus_tag="DP116_27930" CDS complement(1083..1808) /locus_tag="DP116_27930" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019496733.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27930" /translation="MVTAVGEPSNDVFFSYCIEGICYKAIEESFETSSLVSEDTEELE QNQKKDLLNCIYQVIPQAEQRPFYLFAIDTTPYKRPYARTLIERGYIYQPNTIKGNKP INIGHSYSIVSLLPEKDSKQTATWSIPLSGKRVPISSNGVSVGSEQINNIMSCPQVPW SGKLSVLAADSTYSQRSFLVEQAQHKNLVLIARTRSHRVFYQSPSIQETPKKRGCPKK YGERFSLADSSTWHEPVSQIGGS" assembly_gap 2030..2039 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 730 a 435 c 463 g 692 t 10 others ORIGIN 1 tttcttacac ccaacgatta tccctccctt caacactgct atagtcaacg gttttaacta 61 tctctttagc gagaagcaaa aattaggttc ttgggatagt tacttgcgga tgcgagaaat 121 tattatccaa gtaaatcagc aatataaaaa tattctttcc agtgacctgg gtgctttcgc 181 tgggcttcta tttgagattg gctcaaataa gctaattatt aataataaac aagttattga 241 tgaaacgcag agaagcaaag tagaaaaact aatggcgaaa cgtcatacag aagtttcgtt 301 tgaaagagag gaagaagatt tacatactga gatgcaatat catctcctga aaataggtca 361 atcgttagga tatgctgtga tagcagcagc aaacgaccgc tcaaaatctt ttgcagaaaa 421 taagttctct ttcttttgtc ttccttgtct tccggcaatg gaaatagata aagatacttt 481 gaacacaatc aatttaattg atgttctatg gttagaaaaa agtactaatc aaattattag 541 tggttttgag gttgagaaga gtacttctat ttactcagga attttaagac tgacagactt 601 ggcaatttct tttcctaata atttaaagac actattttta gtagttccaa atatacgaga 661 gaaagaagta attatgcaac tcaaaagacc ctgtattcaa cagcaaaaca tctcaattaa 721 atacattcta ttttctgact tacgtcagca ttgtgaggca atttgtaaat ttggtgaaga 781 ctatactata atgtcgaaga tatcgaaggt agcttaatac aagctgattt tttaagttct 841 ttactcctat tcgattagag attaagtaat aaattcaaaa atttacctga cattagaaag 901 ttgccgttga acgaaccaga taattatggg tggtatcagt atccactgaa gcaatggaag 961 caaggcagta ccgaagaatg gaagcatcag catgatatta gcatattccc atctgttaag 1021 cactccagta gctagagctt caaaaataat cgtaattata acaccaacta aaataaagct 1081 acttaactgc caccaatttg gctgaccggt tcatgccaag tggaagaatc tgcaagacta 1141 aaacgttcac cgtacttttt tgggcaacca cgtttttttg gggtttcttg tatagaagga 1201 gactgataga agacacgatg gctacgggtt ctggcaatga gcactaaatt tttgtgctgg 1261 gcttgttcta ctaaaaatga acgttgacta taagtactat ccgcagctaa aacagataat 1321 ttcccagacc acggaacttg agggcaggac ataatattgt taatttgctc acttcccaca 1381 ctgacaccat ttgaagatat tggtacccgt tttccagaca aaggaatcga ccaagttgca 1441 gtctgctttg aatctttctc tggtaacagt gaaacgatag agtaagaatg acctatgttg 1501 atcggtttgt tgccttttat tgtgttaggt tgataaatat accctcgctc aattagagtt 1561 ctagcataag gacgtttgta aggcgttgta tcaattgcaa ataagtaaaa gggtcgttgt 1621 tcagcttgag gaattacctg atagatgcaa ttaagtaaat cttttttctg attttgctct 1681 agctcctctg tatcttctga aactagacta ctggtttcaa atgattcttc aattgcttta 1741 taacaaatgc cctcaataca gtaactaaaa aagacatcat tggatggttc acccactgct 1801 gttactatat tgcacccaac tgagaaccgc tatagtcagc tagcggtcca gaagttccac 1861 aactgcctca aaaagtccac agtacttttt tatttcctca tcaaaacgag caatggtttc 1921 atctaagcta tcaaactgac acaacagttc agccagaaca aatctttgat gcggtcgcac 1981 tcttccttcg gctgaggtaa ctttgactta ttgctgggcg aatcatttgn nnnnnnnnna 2041 ccactgtaac tggttgtaga gaaacgctaa ggaaaaatcg tatagaccat tatgcttacg 2101 aagaggcagc gtagactgtg ggagttcaat aacttctggg agactaggag gttcgggaat 2161 agctgcggga cagtgatgcc ggaacagttg aatacggtag cggggcgctc ggatttcctt 2221 tactagctgc aatggatatg tgttcgcaag cgtctcgcta attaggttcg ctccgaacgg 2281 gagcagccag tcgagctttg tccggtcagg ctccttgtca acgcgaagac // LOCUS NODE_8917_length_2303_cov_4.9991102303 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2303) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2303) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2303 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 606..800 /locus_tag="DP116_27935" CDS 606..800 /locus_tag="DP116_27935" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008055515.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27935" /translation="MPLVSIPKSYLVSEDEESIILDLPQSVLASLQRDYGKIQKAKGI LQHQKEAMFAHLDAVREEWE" gene 797..1183 /locus_tag="DP116_27940" CDS 797..1183 /locus_tag="DP116_27940" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002789757.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_27940" /translation="MNYLYDTNIFIYYLAGDDTVSELFSETFLNKNYVVISPIVRIEL LCFSGLSDDEAEVIEDLLSQFDSILISRKIENQTIALKRKHKIKLPDAVIAATALCQQ AVLVTRNFHDFQDIAELKLENPFGDE" gene complement(1280..2134) /locus_tag="DP116_27945" CDS complement(1280..2134) /locus_tag="DP116_27945" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019491489.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sulfurtransferase" /protein_id="PRJNA477356:DP116_27945" /translation="MSPLNYAHPEVIVETEWVADHLHDPKVRLVEVDMDATAYDSGHI PGAIFWNGFTTILQPDFRFNFDKKALEELLGRSGIANDTTVVIYGNHNAIAPIIFWFL KIFGHDDVRVMNGGRKKWVVEERPLLTETPSITTTTYTQQDGNYSIRALQEQVQASIG KSDRVLLDVRTPEEYRGEWFSTKPPEGTERAGHIPGAVHIPYESALNEDETFKSAEEL YALYSNKGVTPDKEVITYCAIGGRSSHTWFVLKYLLGYQNVRNYDGSWNEWGKQPDTP VEVQSLVI" BASE COUNT 656 a 482 c 498 g 667 t ORIGIN 1 cctcaccctg ccctgtcggg catccctctc cttataaagg agagggaaag atttttacgc 61 agtaaaaagc gagggtgagg ttgtctcgtt ccttgtctct ggcaaggaat gcagctgctt 121 aggctctgcc tccttttgtt tgctactaga ggcagagcct ctagaagttc gttcccaggc 181 tccagcctgg gaacgagagt gatagatatt cagtgagcta atcgcatact tggactctta 241 gcctcagaat ttatttcttg gcggacgaga atgatgtcgg aggatttgag cgataagcca 301 ccagacttcg gcgtgagcgc tcagtgcttc gacaggctca gcaaccatcg aaggcttgag 361 gctacaaagt atcgcacaga gtaccagttg agcgatcgcg caaaataccc gcttttaggt 421 ggcgatagct tcgctgctgc ctaagcgcaa agcgcacgct acgcgaacgg cagatcgctc 481 tttaattcaa agccaagatt ctaaagataa cgttcaattt aaaaaaacat ttttaaagtt 541 tgatagggag agcttttttg taataatggg tctacaacag taaagtaaac acaaaagtga 601 tagttatgcc cctggtaagt attcctaaga gttatctcgt gtctgaagat gaggaatcga 661 tcatattgga tttaccacag tcagttctag cctcattgca acgggattat ggaaagattc 721 agaaggctaa aggaattctt cagcatcaga aagaggcaat gtttgctcat ctagatgctg 781 tgcgtgaaga atgggaatga actatctcta tgatacaaat atttttattt actatttggc 841 gggagatgat acagtttctg aattattttc agaaactttt cttaataaaa attacgttgt 901 aatctctccg attgtacgga tcgagttact ttgtttttct ggtttgtctg atgacgaggc 961 agaggttatt gaagatttat taagtcagtt tgattcaatt ctgatttcta ggaaaattga 1021 gaatcaaaca attgctttaa agcggaaaca caaaatcaag ttaccagatg cggtgattgc 1081 agcaacagct ttatgtcagc aagcagtttt ggtaactcgc aattttcacg atttccagga 1141 tattgcagaa ttaaagttag agaatccttt tggtgatgaa taaaaagttt ggggagatgt 1201 taatctttaa gagaaaacgc atatcgatgt attaacaata caccgacttg cgttacagtg 1261 tggatatcca aaaagcagat taaatcacta gtgattgtac ctccacaggc gtatcaggtt 1321 gcttccccca ctcattccaa gaaccgtcgt aattgcgtac attctgatag cccagcaaat 1381 acttcaacac aaaccaagta tgtgaggaac gtcccccaat agcgcagtac gttatcactt 1441 ctttgtcagg agtaacgcct ttgttgctgt ataaagcgta tagttcttcg gctgatttaa 1501 atgtctcatc ttcattcagg gctgattcat atggaatgtg aacagcacct ggaatgtgcc 1561 ctgcacgttc tgttccttct ggtggcttcg tgctgaacca ttcgccgcga tactcttcgg 1621 gtgtgcgtac atccagcaaa acacgatctg acttgccaat actcgcttgc acttgctctt 1681 gcagcgcacg tatgctataa ttaccatcct gttgtgtgta tgttgttgtc gtaatactgg 1741 gtgtttcagt caacagaggg cgttcttcta caacccattt tttgcgacca ccattcatga 1801 ctcgcacatc atcatgccca aatattttta aaaaccaaaa gataataggg gcgatcgcat 1861 tatgattacc ataaataacg actgtcgtat catttgcgat acctgaacgt cctagcagtt 1921 cttccaacgc ttttttatca aaattgaaac ggaaatcggg ttgcaggatg gtggtaaatc 1981 cgttccaaaa gatcgcgcca ggaatgtgac cagagtcata agctgttgca tccatatcca 2041 cttccacaag acgcactttg gggtcatgca aatggtcagc aacccactca gtctctacaa 2101 taacttcagg atgagcgtaa tttaacggtg acatcaggcg ttactccttg ctgtgtaaat 2161 gttgatgacg ctagagtaaa attgattccc tgactacgcc tagtcctcga acgagtacac 2221 cgagtaactt tttctgtaac ttatagatca atacagttca gttaagcctc aaagtcaggt 2281 aaagtaagca ttattcgcaa gca // LOCUS NODE_9019_length_2259_cov_2.0626132259 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2259) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2259) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2259 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(47..121) /locus_tag="DP116_27950" tRNA complement(47..121) /locus_tag="DP116_27950" /product="tRNA-Thr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(87..89),aa:Thr,seq:ggt) gene complement(217..299) /locus_tag="DP116_27955" tRNA complement(217..299) /locus_tag="DP116_27955" /product="tRNA-Tyr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(263..265),aa:Tyr,seq:gta) gene complement(303..374) /locus_tag="DP116_27960" tRNA complement(303..374) /locus_tag="DP116_27960" /product="tRNA-Thr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(340..342),aa:Thr,seq:tgt) gene 617..1141 /locus_tag="DP116_27965" CDS 617..1141 /locus_tag="DP116_27965" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_27965" /translation="MLRIAEEWKPESPEEESQRAYFFLHMVGAQCMEHLEHVLEESPR PLGVISTEKVLHAVKFICCVSSHLAVLEQADGAPQIWMKEWLLHVHKQIEEMIPEHSM FELNSFFANLDIDEICRYATEQICLILTVRRHEFQDILWDMIEEDKEFRNEILVTSFK ESTDALREHAALFP" gene complement(1174..1851) /locus_tag="DP116_27970" /pseudo CDS complement(1174..1851) /locus_tag="DP116_27970" /inference="COORDINATES: protein motif:HMM:PF00588.17" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="23S rRNA (guanosine(2251)-2'-O)-methyltransferase RlmB" assembly_gap 1259..1268 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(1839..2259) /locus_tag="DP116_27975" /pseudo CDS complement(1839..2259) /locus_tag="DP116_27975" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="too many ambiguous residues; incomplete; missing start; Derived by automated computational analysis using gene prediction method: GeneMarkS+." /pseudo /codon_start=2 /transl_table=11 /product="hypothetical protein" assembly_gap 1856..1865 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 559 a 581 c 527 g 572 t 20 others ORIGIN 1 gatgtattca gggaccaaag tgtctataaa acgcgcttgg taaaaatgga gcccatgacg 61 agaatcgaac tcgtgacctc ccccttacca agggggtgct ctaccgctaa gctacatggg 121 catgcggtac cgactcatta tggctcaatt gagcctatta gtcgcacctc ctaaggttgc 181 ctttagaaat caaggtctat ttagggggaa caagtttggg cagaactgga ttcgaaccag 241 tgtaggcttg cgccaacgag tttacagccc gtctccttta gccactcgga catctgccca 301 taagccggtg atgggattcg aacccgcaac cgacggttta caaaaccgtt gctctaccat 361 tgagctacac cggcataacg ccctgtgggg ggcgcgaacg ggaattctac tagtttcgct 421 taaaccctgt caatggcgca tgatgcggct tcaggtcaag cgcttgcttc tggaataatt 481 ttttcttgcg gcgaaatctc aaatggcacg cacacagcct gtcgtgaaaa ttttgatcga 541 cttagataac tttgcgtatc aataagattt atgttgcaga gacctacgca tcacgccaca 601 caattaagat ggcagcatgc tgcgcatcgc tgaagaatgg aaacccgagt cgcccgaaga 661 agagtctcag cgcgcctact tctttttgca catggttggt gctcaatgca tggagcatct 721 ggaacatgtt ctggaagaat cgccgcgacc acttggcgtg atatcaactg aaaaagtttt 781 gcacgcagtc aagtttattt gctgcgtatc gagtcaccta gccgttctcg agcaggcaga 841 cggtgcccct cagatttgga tgaaagagtg gctgcttcat gtgcacaagc aaatcgaaga 901 gatgattcca gagcattcga tgttcgaatt gaactcgttt tttgcgaatt tggatataga 961 cgaaatttgt cgttacgcga ctgaacaaat ttgtctaatt ctcacggtgc gtcgacacga 1021 atttcaagac attctctggg acatgatcga agaggacaaa gaatttcgca acgagattct 1081 agtcacatcc tttaaggaat cgacagacgc gcttcgcgag cacgcggcgc tttttcctta 1141 gccgcgcttt gacctacgac tgcaccaatc cctttacctg tagctgtctg acaacttcgt 1201 agaaaacaat gccggcagcg acagaagcat taagagactc ggtttttcca agcattggnn 1261 nnnnnnnnag agcatgtcgc aattctcttt gaccaatcgg ctcattcccg taccttcgct 1321 gccaactact agtgcaagcg gtcgcaccag gtctgtcttt gtataaagat caccctggtt 1381 agaatctaac cctgcaatcc agaatcctga ttctttcagc tcttcaagtg cctgcacaag 1441 attgtgaatg cgcactatag gtagtgacgc gagagcgccg gcgcttgtct ttgcaacgat 1501 accggtaaga cctgcagcgc gcctggacgg tagaagcaaa cctttcaaac cagctgattc 1561 agccactctt atgatggcac caagattgtg tggatcttct ataccatcaa caattgcgac 1621 aacacttcca tctaatgaat caggtgatgc ctttttcaat tcagtcaaga agtcgctcaa 1681 ttcccagaat tcagcagcac tgatctgcgc cgcgacgccc tggtgaagtt ggctcggacc 1741 aactaacgcg tctaatcttc gtcgatcgca aacaacgact ggaatgcgtt gggactttgc 1801 aagttgcttg atacgttcaa ctttgcgatc gggatgatct aaagtgacca aatagnnnnn 1861 nnnnngccat gaacggattc gccagcagcg gattcgccag cagcggattc gctagcagcg 1921 gattcgccat catcggattc gccagcagct gattcgccag cagcggattc gccattatcg 1981 gctacgtcat catcatcgtc gtccagatca aaatcgctgc cgtcgttttc gctatcctct 2041 aaaaacgcga gcactgcgtt tttaccgaaa atcatttcgg tgccatcttc cgtgcgaatc 2101 tcgtcatcat ttatatgtgt gtttttcgat tcgcgtccgc ttcctgcgcc actgcgagct 2161 ccatcctgac gtgttctagc ttgaggggca cggtttcgag aatcaaagcc gccgtcgaag 2221 cgtcttccac gttcaccctc aaaacgattt tgtccgcga // LOCUS NODE_9040_length_2253_cov_5.1169242253 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2253) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2253) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2253 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 455..1174 /locus_tag="DP116_27980" CDS 455..1174 /locus_tag="DP116_27980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009453726.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_27980" /translation="MSEISIVLIEDHDLTRMGLKAALQSSSTLRVIGEAPNGTKGLKL LETAKPDVAVVDLGLPDIDGIELTRRFREFQKQTGDSATKILVLTMDHSEDAVLAAFA AGADSYYMKDTSIDKLTEAIQATHTGNSWIDPAIANVVLQQMRQGIPETQQSDQPKTV KIEALPSEYEQVLETYPLTQRELEILELIVAGCSNGQIAERLYITVGTVKTHVRNILN KLCADDRTQAAVRALRSGLVA" gene 1475..2000 /locus_tag="DP116_27985" /pseudo CDS 1475..2000 /locus_tag="DP116_27985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006198162.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" BASE COUNT 712 a 441 c 466 g 634 t ORIGIN 1 gaagtatact tatcattttt ctacatgaaa agttttgttt gtcaattaag tttaattatg 61 atttgataaa aaaaatcatt tgttgttatt tattcgtttt agactggtaa caaataacta 121 actctagctt acttgtcagg aaagacacaa actgtagtta aatctcataa gtttcatttg 181 tctagtcaat cggttttgca tcctcataag gagtatcagt cgaaatttgt acaattatgc 241 gactaacttt gaaaattttt gagaaattac ccatatatcc caagctagat atttgtctgt 301 ataaaataac aagaagataa aaaactcctt atttctatcg gtggatgcaa gtttgataaa 361 atatcatcta tctttggttt tggaaaataa aaaaaacaat ctgcaaattg tcagagtttg 421 acagatgcaa ttccctagag tagcgagtgt aactatgagt gaaattagta ttgttttaat 481 tgaggatcat gacctgacta gaatggggct aaaggctgca ttacagtcga gcagcacact 541 tagagtgatt ggtgaagcac caaacggaac gaaaggacta aaacttttgg aaacagctaa 601 gccagatgta gctgttgtgg atcttggttt gcccgatatt gatgggattg aactcacccg 661 aagatttaga gagttccaaa aacagacggg tgattcagca acgaaaatcc ttgttttaac 721 catggatcat tcagaggatg cagtacttgc tgcttttgca gccggagctg actcctatta 781 tatgaaggat acaagcattg ataagttgac ggaagcaatt caagcaactc atacaggaaa 841 ctcctggatt gatccggcga tcgccaacgt agtattgcag caaatgcggc aaggtattcc 901 agaaactcag caatctgatc agcctaaaac tgtcaaaatc gaagcgctgc catcagagta 961 cgagcaagtt ttggaaactt atccactcac acaacgagaa ctagagattc tagagttgat 1021 tgtagcagga tgtagtaacg gacagattgc tgagagacta tacatcacag ttggtaccgt 1081 taaaactcac gttcgcaaca tcctaaataa actttgtgcc gatgatcgta cccaagctgc 1141 tgttcgagct ttacgttccg gcttagtagc gtaaaccata tgcgtatttg gattgagtct 1201 ggtcagttat cacattagcc caatctaaat cacacactca gcaccctact ctcaagagga 1261 aatgcactgt agagttttgc caatcccagt cacaatctga gcacagattg aatgattaac 1321 aacaggtcca acactctaca ggtctaccga aattattcgc cgcagtgccc ctagcttcta 1381 gctatggggg tgtagcggca ggggaattta ttccccgtcc taaaaaccat cctgacgatt 1441 attagaattg ctgtaggccg aacgcaagct taaattgctg caacagcgcg tttccagaaa 1501 acgtattggc tcgaataatt ggcacaaagc acaaaagaaa gttgctttat tgcatgagta 1561 cgttgccaac agtcgtaaag actttcatag aaagctgtct catcaaatct gtaacgatgc 1621 gggaatggta tttgttgaag atttaaacct tgtcgcgtta tctcgtggaa tgttaagtaa 1681 acattgccta gatgccggat ttggtcagtt cttcaacatt cttgaacaaa cttgtttcaa 1741 acgcgatgtc tattttcaga aagtagatgc acgaaaaaca agtcagattt gcccaaactg 1801 cggaaccgag acaggtaaaa aagaattgtc agagcgtact catgtttgtt cacattgcgg 1861 ctatacaacc gatagagacg ttgcagccgc tcaaatagtc gcaatacgcg gacttgcagc 1921 cgtagggcat acggtcaaga tgcgcggctt cgggtaaatt cattggaatc cccgtgacgc 1981 aagaatcccc ttgcctttag gcatggggag tgtcaatttg cacttcctct aacgtgcgtg 2041 ctaggtttca tatcagtcat ccgaacagtt tcttgagtgg tgaaacaatc caactcttga 2101 agttttgcaa aaattctttg aaattaaaaa gagctaaagc tctacagtcc acaattaggt 2161 tgtgctgctc tttaagaaat aagaaataga tattgtgtct taactcacat cttgcacctg 2221 gaaatatgga tgtcaaaacc aaaactacgt gca // LOCUS NODE_9095_length_2228_cov_10.5531522228 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2228) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2228) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2228 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(2..2227) /gene="psaB" /locus_tag="DP116_27990" CDS complement(2..2227) /gene="psaB" /locus_tag="DP116_27990" /EC_number="1.97.1.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017654078.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="photosystem I core protein PsaB" /protein_id="PRJNA477356:DP116_27990" /translation="MATKFPKFSQDLAQDPTTRRIWYAIATGNDFESHDGMTEENLYQ KIFATHFGHVAIIFLWASSLLFHVAWQGNFEQWIKDPLHIRPIAHAIWDPHFGKPAIE AFTQGGASYPVNITYSGIYHWWYTIGMRTNNELYIGSVFLLVLSSVFLFAGWLHLQPK YRPSLAWFKSAEPRLNHHLAGLFGVSSLAWAGHLIHVAIPESRGQHVGWDNFLTTLPH PEGLTPFFTGNWAVYAQNPDTANHVFGTSQGTGTAILTFLGGFHPQTESLWLTDMAHH HLAIAVIFIIAGHMYRTNFGIGHSIKEMLNAKNFFGSKTEGQFNLPHQGLYDTYNNSL HFQLSIHLAALGTALSLVAQHMYAMPPYAFIAKDYTTQAALYTHHQYIAVFLMLGAFA HAAIFWVRDYDPEQNKGNVLDRVLKHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAF GTPEKQILIEPVFAQFIQASHGKVLYGLNVLLSNPDSIAATAWPNYANVWLPGWLDAI NNGTNSLFLTIGPGDFLVHHAFALAIHTTVLVLVKGALDARGSKLMPDKKDFGYAFPC DGPGRGGTCDISAWDAFYLATFWALNTVGWVTFYWHWKHLGIWQGNVAQFNESSTYLM GWFRDYLWANSAQLINGYNPYGVNNLSVWAWMFLFGHLVWATGFMFLISWRGYWQELI ETLVWAHERTPLANLIRWKDKPVALSIVQARLVGLTHFAVGYVFTYAAFLIASTAGKF G" BASE COUNT 596 a 538 c 579 g 515 t ORIGIN 1 atcaaccaaa cttaccagca gtagaggcaa tcaggaatgc tgcgtaggta aagacgtagc 61 ccacagcgaa gtgagttaag cccaccaaac gagcttgaac gatcgacaga gcaacgggct 121 tgtctttcca gcgaatcaag ttcgccaaag gagtacgctc gtgtgcccaa accaaagttt 181 caatcaactc ttgccagtaa cctctccaag agatcaggaa catgaagcct gtcgcccaaa 241 ctaggtgtcc gaacaggaac atccaagccc acacagacag gttgttcacg ccgtaggggt 301 tataaccgtt gattaattga gcggagttcg cccacaggta atcacggaac cagcccatca 361 ggtatgtaga agactcgttg aactgtgcga cgtttccttg ccaaatacct agatgcttcc 421 agtgccaata gaacgtcacc caacctacgg tgttcaatgc ccagaacgtc gctaggtaga 481 acgcgtccca agcggagatg tcgcaagtac caccacgacc tggaccgtcg caggggaagg 541 cgtaaccgaa gtccttttta tcgggcatca gtttggaacc acgcgcatcc aaagcacctt 601 ttaccagtac cagaactgtt gtgtggatcg ccaacgcaaa agcatggtga accaagaagt 661 cgccaggacc gatggttaag aacagggagt tggtgccatt gttgatggca tccagccagc 721 ctggtaacca aacgttagcg tagttaggcc aagcagtggc tgcaatgcta tctggattag 781 ataacagaac gttcaagccg tagagcactt ttccgtgaga agcctgaatg aattgtgcaa 841 acactggctc aatcaggatt tgcttctctg gcgtgccaaa tgctaccact acgtcattgt 901 ggacgtacag acccaaggtg tggaagccca agaacaacga gacccaactc aggtgcgaga 961 tgattgcttc tttatgcttg agaacacggt ctagtacgtt gcctttgttt tgctcggggt 1021 cgtagtcacg tacccagaat attgccgcgt gagcaaaagc accgagcatc aagaacacag 1081 ctatgtattg gtgatgcgta tacaaagctg cttgggttgt gtagtcctta gcgatgaacg 1141 cgtaaggagg catcgcgtac atatgctgcg ctaccaagga aagagcggtt cctaacgctg 1201 ccaggtgaat agacaactgg aagtgcaagg agttgttata tgtgtcgtac agtccttggt 1261 gaggcaggtt gaactgacct tcggttttgc tgccaaagaa attcttggcg ttgagcattt 1321 ctttgatgct gtgaccaatg ccgaagttcg tccggtacat atgaccggca atgatgaaga 1381 tcaccgcgat cgccaggtgg tggtgagcca tatcggtcag ccacagtgat tcagtctggg 1441 gatggaaacc acccaagaat gttagaatcg ctgtccctgt accttgagat gtaccaaaca 1501 catggtttgc tgtgtccggg ttctgagcgt aaacagccca gtttcctgta aagaatggtg 1561 tcaaaccttc tgggtgcggc agggtggtta ggaagttgtc ccaacctacg tgctgtccac 1621 gagattcggg aatggcaacg tgaattaagt gacctgccca agccaaagaa ctcacaccga 1681 acaaacctgc taagtggtgg ttcaaacgtg gttctgcact cttaaaccaa gccaagctcg 1741 gacggtactt gggctgtaag tgcaaccaac cagcgaacaa gaagacagac gatagtacta 1801 acaggaacac tgaacctata tacagctcat tgttcgtccg cataccaatg gtgtaccacc 1861 agtggtagat gccggagtag gtgatgttga ctggatagct cgcgccgcct tgggtaaacg 1921 cttctattgc tggtttcccg aagtgggggt cccagattgc gtgagcgatt ggacggatgt 1981 gaagaggatc ttttatccac tgttcaaagt taccttgcca ggccacatgg aacagcaggc 2041 tcgatgccca taggaagatg attgccacat gaccgaagtg agtcgcgaaa atcttttggt 2101 aaagattttc ttctgtcatg ccatcgtggc tttcaaaatc attccccgtt gcgatcgcat 2161 accagatccg acgcgtcgtc ggatcctgtg ctagatcctg gctaaatttt ggaaattttg 2221 ttgccata // LOCUS NODE_9247_length_2175_cov_10.9910382175 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2175) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2175) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2175 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(89..1630) /locus_tag="DP116_27995" CDS complement(89..1630) /locus_tag="DP116_27995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007306708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-dependent DNA polymerase" /protein_id="PRJNA477356:DP116_27995" /translation="MIGRIVKYSESWKNLNWKKFRKNLFRLQCRVFKAIQAGDKRKAR FLQKLILKSRAARYLAIRQVTQLNAGKKTAGIDGKKSLNYKERFDLEKMLEAFSNNWE HQGLRSIPIPKKDGSTRMLKIPTIADRAWQCLAKYAIEPAHEALFHAKSYGFRPGRSA HDAQRILFNSLNSRCNGINKRVIELDIEKCFDRISHTTIMDNLIAPQAIKKCIFRCLK AGVNPEFPEQGTPQGGVISPLLANIALNGIESITSYKNGKVIVEPSIRYADDMIIILR PQDNADKVLEEIDKFLASRGMKISEKKTKVTAATDGFDFLGWHCKVLSNGKFRTIPSE ENFQKFRAKVKAIINCSNYGAKEKAQKLAPIVRGWRNYHRFCNMNASKHSLWFINHRT WKVFRKETKMNGESTNKLIEKAFPSVPYSENTHVNVKGNKSPYDGDINYWSERDSKLY HGETSKALKRQNHTCSHCGLGNIDNERWNLHHIDGNHDNWKAKNLTAVHESCHDYIHM GKRAT" BASE COUNT 455 a 487 c 462 g 771 t ORIGIN 1 atgttagagt cgatggaggc tattatgccc cgcacctctc tgttagatcc ggacgtgcgt 61 atttctacgc atccggctcc cgatattctt aggttgccct tttgcccatg tgtatataat 121 catgacaaga ctcgtgtact gctgttaagt ttttggcttt ccagttgtca tggtttccgt 181 cgatgtggtg tagattccat cgttcattat cgatgttccc taatccgcag tgactgcatg 241 tatggttttg ccttttgagg gctttagagg tttccccgtg gtataactta ctatcacgtt 301 cgctccagta gtttatatct ccgtcatagg gtgatttatt acctttgacg tttacgtggg 361 tgttttcgga gtatggaact gatgggaatg ctttctctat caacttattt gttgattcgc 421 cattcatttt tgtctctttc ctgaatacct tccatgttct gtggttgatg aaccatagcg 481 agtgcttaga tgcattcatg ttacagaacc tgtggtagtt tctccatcct ctaactatag 541 gagctagctt ttgggctttt tccttagcac cataattcga gcagttgatg atagctttta 601 ctttcgcacg gaatttttgg aagttttcct ctgagggaat ggttctaaat tttccgttgc 661 ttagtacttt gcagtgccag ccgagaaagt cgaatccatc tgtcgcagcg gttactttgg 721 tcttcttctc gcttattttc atccctctac tggctagaaa cttgtcaatt tcctcaagta 781 ctttatccgc attgtcttgt ggtctgagta ttattatcat gtcatcggca taccggattg 841 atggttctac aatgaccttt ccatttttat aactggtaat actctctatt ccgtttaatg 901 cgatgttagc taatagcgga cttatcactc ctccctgcgg tgttccctgt tcgggaaatt 961 ctggatttac tcctgcttta aggcagcgaa atatacattt ctttatagcc tgtggggcga 1021 tgaggttatc cattatggtg gtatgactta tcctgtcaaa acacttttca atgtcgagtt 1081 ctatcactcg cttattgatc ccattacatc ttgaattaag gctgttgaag aggattcttt 1141 gcgcatcgtg tgctgaacgt cctggtctaa aaccgtagct cttggcgtgg aatagtgctt 1201 cgtgtgctgg ttctattgcg tattttgcta ggcattgcca cgctctgtcc gcgatagtgg 1261 ggattttgag cattctggtg cttccgtcct tcttggggat gggaatacta cgtaatcctt 1321 ggtgttccca attattactg aatgcttcca gcattttttc gaggtcaaat cgttccttgt 1381 aattaaggga tttcttccca tcgattcctg ctgttttctt accagcattt agctgtgtta 1441 cttgccggat tgctaagtat ctagctgcgc gagatttcag aataagtttt tggaggaacc 1501 tagctttacg cttatctcct gcttgaatgg ctttaaacac tcgacattgt aggcggaata 1561 ggttcttccg aaatttcttc cagttgaggt tcttccaaga ttcgctatat tttactatgc 1621 gtccaatcat gctcttactt tattaagtgt tctctgaata ccttgcagca attacgccgc 1681 atcctacccg aattaagaga gttccgtatc tcgtcatacc taccttgggt tcgacctccc 1741 aaagactctt aacccgtttt tattcgttcc ctcagacgat gatttattcc gttaggtgta 1801 gccaatttga ctatcagaag ccccggattc ttaccaatac ctcacagata gcttggcagg 1861 ctttaactct gatgaatgtc ggagtgtttg gttgttatgc tccgcataaa cactaagtca 1921 cttttctagg ctcggtttca ccgtgggaat tccctgttaa cgccagtgtc agcccataac 1981 gaccgtctga ttgcgtcctg ttcccagctt cactctgtag aaaccgagtc tactcggtgt 2041 gggcaggtaa ggagtcacac atgaatcttg tggataggac tttcacctat atccttctga 2101 gagttcaacc attgttggtt gactagctta tgtgcggact gattaccgtg attcagctag 2161 taacgaatcg cacct // LOCUS NODE_9261_length_2170_cov_5.4510642170 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2170) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2170) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2170 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 172..1551 /locus_tag="DP116_28000" CDS 172..1551 /locus_tag="DP116_28000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196116.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase family protein" /protein_id="PRJNA477356:DP116_28000" /translation="MISQEISIADIIRKQRAYFNTGQTKDINFRLEQLKTLKQAVTEQ KDAITNALKADLNKPEFESYATEIGVIQEISHAIKHIKIWTKPKKAPVSLQFFPASAR IYPEPLGVVLIIGPWNYPFQLMISPLVGAIAAGNCAILKPSELTPNTSRLLNEMIKQY FEPAYITVVEGGVQTSQQLLAEKFDHIFFTGGTAVGKIVMEAAAKNLTPVTLELGGKS PCIVDTDINIEHTARRITWGKFLNCGQTCIAPDYLLVDKSIKENLLNELHKCLQEFYG DNPAKSSDYPRIVNQKHFDRLAHFLKEGKVRIGGETNPSELYIAPTVLEDVSLTGSVM QEEIFGPILPVIEYTDITEAIDLINSKSKPLALYLFSNNKNLQQQVLQQTSSGGVCIN DTVIQVAVSSLPFGGVGDSGIGKYHGKSSFDTFSHYKSVLHNPFWLDLKWRYAPYKDK LSTLKRLIG" gene 1632..>2170 /locus_tag="DP116_28005" CDS 1632..>2170 /locus_tag="DP116_28005" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28005" /translation="MSDNYIAQQKKMIQQVKDSISQGQSQLSIVEMRSDYENDDAICL ASLTFPSKDIAQTISKEIIAPLQSLDSHHFYYSSECLHITIKSIRTVHNPPLFKNEDV VKVHELFTQIVPNFKSFTFSLEGLILFPTSVSLVGYCDETLRKLVQALDLGLREIGVP DIDTPLPKGEGILGSLTRR" BASE COUNT 719 a 427 c 409 g 615 t ORIGIN 1 ctgcgggagg gtttccgaca gccaggcgac tggcgaaccc ggagggtaat acgtatgaat 61 acaacgcgcg tatcaataag aatttcatat tagtctaaga tagaaaaaag ctttcttctg 121 actttcaatt actgaaaaat tgattcaact tagctaacga gaacaaaaac aatgatttct 181 caagaaatat caattgccga tatcatccgc aaacaacgtg cttactttaa tactggtcag 241 acaaaagaca ttaattttcg acttgaacaa ctaaaaactt taaaacaagc agtcactgag 301 cagaaagacg caattaccaa cgcactaaaa gcagacttaa ataaaccaga atttgagagt 361 tatgcaacag aaattggagt catccaagaa attagccatg ctatcaagca tatcaaaatt 421 tggacgaaac ctaagaaagc acctgtttcc ctgcaattct ttcctgcatc ggcacgcatt 481 tatccagaac cgctaggagt ggttttaatt attggtcctt ggaattatcc atttcaatta 541 atgatttcac cattggtagg tgctattgca gcaggaaact gtgcaattct taaaccctca 601 gaactgacac caaatacgtc tcgtcttctc aatgagatga tcaaacaata tttcgagcca 661 gcttatatta cagtggtcga aggaggtgtg caaacaagtc aacaattact agcagaaaaa 721 tttgaccata tctttttcac tggtgggacg gctgtaggaa aaatagtcat ggaagcagcc 781 gctaaaaacc tgacacccgt caccttagaa ttgggtggaa agagtccttg tatcgtagat 841 actgacatta acatcgaaca caccgcaaga cgcattactt ggggcaagtt tctcaactgc 901 ggacaaactt gtatcgcacc agactatctt ttggtggata aaagcatcaa agaaaattta 961 ttaaatgaac ttcataaatg cttgcaagaa ttttatggag acaatcctgc aaagagttct 1021 gattatccaa ggattgtcaa tcaaaaacat tttgacagat tggctcattt cctgaaagag 1081 ggtaaagttc ggattggcgg agaaaccaat ccttcagaac tttatattgc cccaacagtg 1141 cttgaggatg tctccttaac cggctcggta atgcaggaag aaatttttgg tccaattctt 1201 ccagtgatag aatacacgga cataacagaa gcaattgact taattaactc taaatcgaaa 1261 cccttggctt tatatttatt ttctaacaac aaaaatctgc aacaacaagt cttgcaacaa 1321 acctcgtccg gtggagtttg tattaacgac acagtcatac aagttgctgt ttcatcttta 1381 ccatttggcg gtgttggtga tagtggaatt ggcaagtacc acgggaaatc cagttttgac 1441 accttttccc attacaaaag tgtcttgcac aatcccttct ggttggattt aaaatggcga 1501 tacgctccct acaaagacaa gttatctaca ctgaagcgac ttattggtta gctattttag 1561 ctgttaaagt aagattgtct tcttaacgtc cagctatggg cgtttctttg taagagaaca 1621 agtacttcat aatgagtgac aactacatcg cacagcagaa aaaaatgatc caacaggtga 1681 aagatagtat ctctcaaggg caaagtcaat taagcattgt tgagatgcga agtgactatg 1741 aaaacgatga cgcaatttgt ctagcatcat taacttttcc gtcaaaagac attgcccaaa 1801 caatatcaaa agaaatcatc gctcccttac aaagcttaga ttctcaccat ttttattatt 1861 cgagtgaatg cctgcatata actattaaaa gtatacgaac tgtccacaat cctccgcttt 1921 ttaaaaacga agatgtggta aaagtccatg aattatttac tcaaatagtt ccaaatttta 1981 aatcattcac ttttagttta gagggtttaa ttttatttcc aacgagtgtt tctcttgtag 2041 ggtattgtga tgagacactg agaaagttag ttcaagcgtt agatttgggc ttaagagaaa 2101 taggcgtacc agatattgac actcccttgc ctaaaggcga agggattctt ggttcgttga 2161 ctcgccgtta // LOCUS NODE_9274_length_2168_cov_5.0000002168 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2168) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2168) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2168 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 918..1988 /locus_tag="DP116_28010" CDS 918..1988 /locus_tag="DP116_28010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862210.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Photosystem Q(B) protein 1" /protein_id="PRJNA477356:DP116_28010" /translation="MNTTVRRRKEFDFSRLWHLFCAWITSTENRLYIGWFGVLMIPTL LVAVTCFILAYLIAPGVDMEGIREPVMGSLKDGNNLITAAVVPTSAAIGLHLYPIWEA ASLDEWLYNGGPYQLIVLHFLIGIWCYLGRLWELSYRLGSRPWIAVAFSAPAMAATAV LLIYPIGQGSFSDGLPLGIAGTFHFMMAFQANHNILMHPFHMLGVAGVFGGALLSSLH GSLVTSTLLRRTNENEPPEAGYKFGQAKVTYSFLAGHYGFLGRLLIPSFSSQNHRAFH LLLAALPTVGIWFAALGVCVMAFNLNGFNFNHSLLDSRGAVITTEADLLNRATLGLQA MHAPNTHHFPIILSGSEPIPVS" BASE COUNT 525 a 466 c 454 g 723 t ORIGIN 1 ggcatgtggc gtccctgggc taaagccaca gggttttcat ctcacccact ataattctaa 61 tttttcgttc ttccatctct gttaacttct gattgtgcag aacttgtgca ttttggattc 121 attgctgtta ctggttatat ttgagaccaa ctaagcgcag cgtttcttct aagttcttgt 181 ctatgtactc tactatgtta gtcatgtttt tctggtaaaa aagctcattt tatcttcttt 241 tcccacaaaa tttttagatt tcggagttgt ctattgagtt acagcccaag ctacaattta 301 cctcaaatgt ggaactagat ttagcttcag tttaaaaact catttcgtag ctgaggcagg 361 caagagtttt cccagtttgc ttggtggact attaggaagt ggcgtccaag actgcgatcc 421 ccgaactgct gcgcgtctag tttcatatgc gcgtctagtt tcataaatat taagcgcatt 481 ttcattgaga attggtatca gtgcggtttg ttgccaaaca ttgcactact acaacagtca 541 cagcgccggg cgatcgcttc agtcaaaccc ttgcccattc tatcatttaa ccgattttag 601 gcttaactga actgtcttgt tctataatta aaatactcgg acaaattctg aaatttagtg 661 agccagcgtg gtcttgggga gccactgcgt tgggcggctc tgccgacttg tagcatgtgg 721 cgtggtttcc cccatgagcg actggcgaac cccgaagggg taatctgtaa cctccgcaac 781 agaaaacgcg ttcagactgc taaatttaac atttcaaccc taaaaaaacg ttctgaaaaa 841 tacaattgtg agttctggaa tttcagaatc tttttgtcag aatcccgagt attctcttca 901 aataggacgg agatacaatg aataccactg ttcggcgtcg aaaagaattc gatttctcca 961 gactgtggca tcttttttgc gcgtggataa ccagtactga aaatcgtctc tacatcggct 1021 ggtttggtgt cctcatgatt cccactctgc ttgttgccgt cacttgcttc atcctggctt 1081 atcttatcgc tcctggtgta gatatggaag gcattagaga gccagtcatg ggttctctga 1141 aggatggcaa caatttaatc acagctgctg tggtaccaac ttctgccgct attggtttgc 1201 atttatatcc aatttgggaa gctgcttctc tcgatgagtg gctctacaac ggcggtcctt 1261 atcagttaat tgtactgcat tttctcattg ggatttggtg ttacctcgga cgactttggg 1321 aattaagcta tcgcctcggt tcgcgtcctt ggattgctgt tgctttctct gcacctgcta 1381 tggcagcaac agcagtttta ctgatttatc ctattggtca aggtagtttt tctgatggac 1441 ttcctttggg tattgcaggc actttccatt ttatgatggc attccaagca aaccacaata 1501 tcctgatgca tcccttccat atgttggggg ttgcaggagt ctttggcggt gcacttttga 1561 gttctttgca cggttcattg gtcacctcaa cacttcttag gcgaactaac gaaaatgaac 1621 ctccagaagc agggtataaa tttggtcaag cgaaagttac ctacagcttt ttggctggtc 1681 actatggttt tctgggtcgt ttgctgatac cctcttttag tagtcaaaat catcgtgcat 1741 tccatctgct tttagctgct ttaccaacag tcggtatatg gtttgctgcg ttgggtgtat 1801 gcgtgatggc gtttaaccta aatggattca acttcaatca ttccctctta gatagtcgag 1861 gcgcagtgat cacaacagaa gcagatttgc tcaatcgagc gactcttgga ctgcaagcaa 1921 tgcatgctcc taacacccat catttcccaa taattctatc tggtagtgaa cctattccag 1981 tcagttaaag agttctcctc cagggcacag ggcattggtt ttgtaccata tgtcctatgc 2041 tctgtttttt gtcatgagta cattacaagg ttgctaaatt tgataagtgc aattaatata 2101 gtggttttta aaaaattata actatattta aaattgcaat agtaattttt tattatgaaa 2161 cattttca // LOCUS NODE_9535_length_2074_cov_4.9316492074 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2074) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2074) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2074 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1491) /locus_tag="DP116_28015" CDS complement(<1..1491) /locus_tag="DP116_28015" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28015" /translation="MSPKVYAPNVHLFAFVLQKAEEQEKDWLWKSCDEIIDKILHKNF NLTQLLDLEKEPDNPTVSLIKADERTEVTPIEGAVTFDNSSNNSSKISIDGFAYPLRL YDSYGLWLNLRRPEKENGQKTDDVDIQLLRQLNPENCLLLNGNNNFIGETLIITAWLS SEQDPQNQKALKNLADECLKAFFPQGYTIPPFARSDTLFGSPIFEYGLFSQLTHYRHI LVWFFSHPEAEEKFVFYQEQLLDLFFFRAKVVKAFQQSRQVYYDTNQEYSKIEEEIKT LEKFGETQQLTAEELEQLSDKLKTLPKSALDYTNLLRYLEAYQNTIAINTRNYAERLQ QIRGIIKDEDISFLERFSLENCAYFQEQIQAELGYFHHGSSLLDKAIASIRGRVAIDR AKSDRIAQDKKEIGDRNLQITILAVGSAIGSAGIMASSYALVTQEDPLLPPFSTSIPH RFSLSICYSLLFGVVLGMLIWGGDKIWQSFDTPLPKGEGILHSSSEL" gene complement(1488..1832) /locus_tag="DP116_28020" CDS complement(1488..1832) /locus_tag="DP116_28020" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28020" /translation="MEVNQPNITIEAFLQVLEQQGFVFDEKSREDLLTIKQTLAELEN QPLSAAVDAITAWCREHPDVADAVRFAAREITIKKRNPANQEGTLINQFPDYQKIIDE RQKNPQPEPPKK" gene complement(1844..>2074) /locus_tag="DP116_28025" CDS complement(1844..>2074) /locus_tag="DP116_28025" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28025" /translation="LESGRAAPTAFPGRTWKRDLKQDLKRDPCSQSLTGNTHWEALPP LESGRAAPTAFPGRTWKRGYEVTRFERDLKSY" BASE COUNT 573 a 419 c 419 g 663 t ORIGIN 1 aagttctgac gatgaatgta gaatcccctc gcctttaggc aggggagtgt caaaagactg 61 ccaaatttta tcccctcccc aaatcaacat tcctaaaaca actccgaaaa ggagactgta 121 acaaatggat aagctgaagc gatggggtat ggatgtggag aaaggaggta acagaggatc 181 ttcctgtgtg acgagagcat aactagatgc cattattcca gcagaaccga tcgcactacc 241 tacggcgaga atcgtgattt ggaggttgcg atcgcctatt tctttcttat cctgtgcaat 301 gcgatcgctc ttcgccctat ctatcgccac acgcccccgg atagaagcta tcgctttatc 361 cagcaagctc gaaccatgat gaaaatatcc caactctgct tgaatttgtt cctgaaaata 421 agcacagttt tccagactaa atctttctaa aaagctgata tcttcatctt taatgatacc 481 acgaatttgt tgcaatctct cagcatagtt tcgagtattt atcgcaattg tattttggta 541 tgcttccaga taccgcaata aattggtata gtctaaagca ctttttggaa gtgtttttaa 601 tttatcgctt aattgctcta attcttctgc tgttaattgc tgagtctctc caaacttttc 661 taatgtttta atttcctctt ctattttgct atattcttga tttgtgtcat aataaacctg 721 acggctttgt tgaaaagctt tcaccacttt ggcgcgaaag aaaaataaat ctaataattg 781 ctcttgataa aaaacaaatt tctcctcagc ttctggatgg ctgaaaaacc aaactaaaat 841 atgacggtaa tgtgttaatt gactaaaaag tccgtattca aaaatggggc taccaaagag 901 agtgtcggaa cgagcaaatg gaggtatggt atagccttga ggaaagaaag ctttgagaca 961 ttcatcagca agatttttca aagctttttg attttgaggg tcttgttctg atgacaacca 1021 agcagtaatg atgagagttt caccgataaa gttgttatta ccatttaaga gcaaacaatt 1081 ctctggattg agttgacgca gaagctgaat atcgacatca tcagtttttt gaccattttc 1141 tttttctgga cggcgcagat ttaaccacag tccgtaacta tcataaagtc ttagcggata 1201 ggcaaatccg tcaatagata ttttggaaga attattggaa gaattatcga aagtcaccgc 1261 gccttcaatc ggggtaactt ctgttcgttc atcggctttg atcaaagaga cagtcgggtt 1321 atctggctcc ttttctaaat ccagcaattg cgtcaagttg aaatttttat gcaggatttt 1381 gtcgatgatt tcatcacaac ttttccaaag ccaatctttc tcctgttctt ctgctttttg 1441 cagaacaaag gcaaataaat gaacattggg agcatagact ttaggactca ttttttggga 1501 ggttctggct gtggattttt ttgtcgctca tcaatgattt tctgataatc tgggaactga 1561 ttgattaaag ttccttcttg gtttgccgga ttgcgctttt tgatagtgat ttctctagca 1621 gcaaatcgga cagcatctgc aacatctgga tgttctctac accaagcggt aattgcatca 1681 acggctgctg aaagtggttg attttctaat tctgccaggg tttgcttgat tgtcaacaaa 1741 tcttcgcggc ttttttcatc aaagacaaat ccttgctgtt ctaagacttg caaaaaagcc 1801 tcaattgtga tgtttggttg attaacttcc ataggtcata ccttcagtag gatttcaaat 1861 ctcgttcaaa tctcgtaacc tcgtaacctc gtttccaggt tctacctgga aatgcagttg 1921 gggcggctct gccgctttct aagggaggca gagcctccca gtgagtattc ccagtcagag 1981 actgggaaca aggatctcgt ttcaaatctt gtttcaaatc tcgtttccag gttctacctg 2041 gaaatgcagt tggggcggct ctgccgcttt ctaa // LOCUS NODE_9610_length_2048_cov_3.3000502048 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2048) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2048) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2048 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(1..1676) /gene="groL" /locus_tag="DP116_28030" /pseudo CDS complement(1..1676) /gene="groL" /locus_tag="DP116_28030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016865329.1" /note="frameshifted; incomplete; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="chaperonin GroEL" assembly_gap 577..586 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" BASE COUNT 460 a 528 c 448 g 602 t 10 others ORIGIN 1 gcatgggtgg catgggtggc cacccatgcc gcccatgccg cccatgccgc ccatatcagg 61 ggcaccgcca gcgggttttt tctctggttt ttcgaccacg actgcttcag tggttaaaac 121 cattccagcg atggaagcgg cgttttgcaa agctgaacgc actacttttg cgggatcaat 181 aataccagca gcaatcaagt cctcaaattc cccagtagcg gcgttgtaac cgatgttgaa 241 atcagtgtcc agcactctgg cgacaataac agacccttca acaccagcat tatctgcaat 301 ttgacgcaag ggggcttcga gcgatcgcgc cacgatatca gccccaatct gttcttctgc 361 ctcaagagtg tttttaacct cttgtacttt cttcgccaaa tgaatcagtg ttgttccacc 421 accaggaacg ataccttctt ccacagctgc tttggtagca ttgagggcgt cttcaatccg 481 cagtttgcga tctttgagtt cggtttctgt tgccgcaccc acttttatca ccgcaactcc 541 accagccagt ttggcaatgc gctcttgcag tttttcnnnn nnnnnngttt ttctctatcg 601 tagtctgaat cggtttcttc tagttgttta cgaatttgac caattcgctt ttgcacgtcc 661 gcttttgtgt tgtcaccagc agcaacgatg gtggtgtttt ctttgtcaat cgtgattttc 721 cgggcagttc ccagggtttc caaagaagcg gtatccaagc ttaagccaat ttcttcagaa 781 atcagctgtc cgttggtgag aatggcaata tcttgcaaca atgctttgcg gcgatcgcca 841 aacccagggg ctttgatagc ggcgacagca agcactcccc gcgctttgtt tacaaccaaa 901 gttgctaaag cttccccttc tacatcttca ctgataatca gcaagggttg acctagacgg 961 gcgacctttt ccaaaactgg aattaactct tggatgttgc tgattttctt atcagtaatc 1021 aggatgcggg cattttcaaa ttctaccgtc aaccgttcgt tgttggtgat gaagtagggg 1081 gagatgtaac ctctgtctat ctgcatccct tctacaactt ctaaatccgt tgtcagggac 1141 ttagattctt caacagtgat cacaccgtct ttggtgactt tctccatcgc ctcagctagc 1201 attttgccga cttcttcgtc gtttccagcg gagactgtgg caacttgggc gatcgccgca 1261 ccttctactg gctttgctac tgctgcaatt tcctttacca acgcctcaac ggttttgtca 1321 attccccgct tcagggcaat agggttggta cctgcggcaa cattcttcaa accctcgcga 1381 atcaacgctt gcgccaacac agtagcagtc gtggtaccat ctccagccac atctttggtt 1441 tttgacgcca cttcctgaac cagttttgcg ccagtgtttt ctaacgggtc ttctagttca 1501 atttctttgg caacggtgat accatcgtta acaatttggg gtgctccaaa ttttttctcc 1561 aaaagaacat tgcgaccttt tggtcctaag gtaattttca cagcatcggc aagagcgttg 1621 atgcctcgtt ctaaagcccg ccgcgaatct tcattaaatg caataatttt tgccatgttt 1681 tgtgttctct agcgcttcca atagacaatt tagcactcac tggtcaagag tgctaatcac 1741 ttttagacaa agctacccaa aaatcgtctt gaagttgggg tgtaagggtg taggggtgta 1801 agggtgtaag ggtgtaaggg tgtaagggtg taggggtgta gggaataaat cactgggtcg 1861 aggtaccctt attggttccg gggctttcct gtacaaaaaa ttaggaattg ggatttggga 1921 tttggggaag agaaacccaa aacttggatc taggcgtcaa ctcccaactt ccccctaact 1981 cccaactcct aactcctaac tcctagtaat agtttctctt ccccccttac acccttacac 2041 ccttacac // LOCUS NODE_9615_length_2047_cov_2.7434742047 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 2047) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 2047) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2047 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(390..794) /locus_tag="DP116_28035" CDS complement(390..794) /locus_tag="DP116_28035" /inference="COORDINATES: protein motif:HMM:PF01381.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="PRJNA477356:DP116_28035" /translation="MQVGETSRQRLAELLKELRGERSQRSFAKLLGVSNQAVQYWEKE RTWPDDNNLERIAELKGWTLLQLQVYLEGEQERSSVADEDVNRVNDSELQQQSRSVQQ LLEEVRMLPFQAAVQVAKVALETMEMMAAKSS" gene 937..2037 /locus_tag="DP116_28040" CDS 937..2037 /locus_tag="DP116_28040" /inference="COORDINATES: protein motif:HMM:PF02195.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28040" /translation="MSTSDQIKIQEVSLNLLTPHPYYFSIYGANEDVSDLVELIAERN WVEPLVVTPNYVIVSDHRPWKVCQILGKDCIPVVFRQFPDEIALLKALLLENASRKKT IEQRIKEGMAWESIEKHKAKQRQGRRTDLTNIPETFPECSTGDSRDAIGSRIGLSGRS YSKGRSVVKRIDSLLQEGNEKRAQFLLSVLNKSIDTAQKLIQMSVAEQNAIAQLIEKG KARSTTTAIRMLREQSQPTVQVKTQSCWNCQHRLESIDNQSIYCNKFGILNLIYKSGD ERGRECPEWRDRTSPPAPLKNPTCTFQVLLPLEWQDRLEETAASVGMDATTWITNLIG ASLYETLDYEDDQESSQESGARIQNEFCASAG" BASE COUNT 616 a 459 c 433 g 539 t ORIGIN 1 gtctaataag gtaaccgttc agggggggag tgcccttgac gatacccatt gcctaattag 61 tcggtgaaac ccaaatatta tatgaatatg aaccgtgtaa actaaagtta tagaaatcgg 121 tgactttagt ttttcatact gggttgacca gtattggttg cacggcacaa ctcactatct 181 cctcggcttt aaaagcctct ggagaatact tgcggataat ttgcatagat ttttagaagc 241 ggttacaacc atcgatattt ctttgattaa agcttgtttc gtccgctccc actacctccg 301 actgctggac gtaaagtttt gctttattgt agaagtaggt acttgacaaa atgagagaat 361 agcgatactc ctgctcttgt gcagtcaact caagaactct tggcagccat catctccatc 421 gtctctaacg ctaccttagc cacttgaaca gcagcctgaa acggcagcat ccgcacctct 481 tctaaaagct gctgaacaga gcgcgattgc tgttgaagct cactatcatt cacacggttt 541 acatcttcat ctgcaacgct tgatctctct tgttctcctt ctagatagac ttgtagctgc 601 aacagggtcc atcccttcag ttcagcaatc ctttccagat tattatcatc aggccaagtg 661 cgctctttct cccaatactg tacagcctga ttactgacac caagcagctt ggcaaaactg 721 cgttgacttc tttcacctcg cagttctttg agtaactcag ccagcctttg tcttgatgtc 781 tctcctactt gcatcttcca tacctctatc tactacaaaa atatcacaaa tttttcctga 841 aaacaaattt tttacttgac aaattgacac ttgacaataa atagtaaaat aaaacacgtt 901 aaataaaaca cagagtatta atgaagttag aggtcaatgt ccactagtga tcaaattaaa 961 attcaagagg tatccctcaa tcttctaacc cctcatccat actacttcag catttatggt 1021 gcaaatgaag atgtatcaga cttggtggaa cttatagccg aaaggaactg ggtggagccg 1081 cttgttgtaa ctccaaatta cgtaatcgtt tcagatcaca gaccttggaa ggtgtgtcaa 1141 atattgggca aagattgtat ccctgtagtt ttcagacagt ttccagacga gattgcctta 1201 ctcaaagcac tactgctaga aaatgccagt cgtaagaaga ccatagagca gagaatcaaa 1261 gagggtatgg cttgggaatc aatagaaaaa cataaggcaa agcaaaggca aggaaggcgg 1321 acagacttga ccaacattcc ggaaactttt ccggaatgtt caacaggaga ctcgcgtgat 1381 gcaatcggtt cacggattgg tctttctgga agatcctact caaaaggacg ttcggtcgtt 1441 aagcgtatcg attctttact tcaagaaggc aacgagaaga gagctcagtt tttgctgtca 1501 gttttgaata agagtatcga tactgcccag aagctaattc aaatgagcgt tgcagagcaa 1561 aatgcgatcg cacagttgat agagaaaggc aaagctcgca gcactaccac agctatccgt 1621 atgctgcgtg agcagagtca accgacagtg caggtaaaaa cacagtcctg ttggaactgc 1681 caacatcgat tagagtcgat agacaaccaa agtatttact gcaataagtt cggcatcctc 1741 aacctgatat acaaatcagg ggatgaacgt ggtcgagagt gcccagagtg gagagacaga 1801 acctcaccac cagcaccatt gaaaaacccc acctgcacgt tccaagtctt actaccgctt 1861 gagtggcaag accgactaga agaaacggca gcatcagtcg gcatggatgc aaccacctgg 1921 attactaact tgattggggc tagtctgtat gaaaccttgg actatgaaga tgaccaagaa 1981 agtagtcagg agtcaggagc cagaattcag aatgaattct gtgcgagtgc tggatgactt 2041 gggggtt // LOCUS NODE_9783_length_1993_cov_3.2688341993 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1993) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1993) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1993 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 56..1324 /locus_tag="DP116_28045" /pseudo CDS 56..1324 /locus_tag="DP116_28045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017656159.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="MFS transporter" gene complement(1377..1652) /locus_tag="DP116_28050" /pseudo CDS complement(1377..1652) /locus_tag="DP116_28050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010993956.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="mechanosensitive ion channel family protein" BASE COUNT 541 a 459 c 414 g 579 t ORIGIN 1 tgcgctcttt attcaccttt tacggattta ttcggttagg tgcaagaagt gagttgagtt 61 agctggaaaa aatgctgcgg ttgtgctttc agtcgctttg actctgcgtg tcactgcttt 121 tgtgttgctg tctccccttg caggtgcgat cgcagaccga ctcgaccgaa agaaaatcat 181 ggttgttacc catatattca gaatgctaat tgtgggtctg ttgcctttcg tgacgcagat 241 ttggcaagtt tgcgtattga tatttgcgct taacgtcttt aatgctttct tcaccccaac 301 ttaccaagcc acaattccat tagttacggg tgagaatgac tatccacaag ccatcgctct 361 ttcggctgct acttttcaat tacttggtgt acttggccct ggtatcgctg gaagtgtcgc 421 agccttcatt ggtgcaagac aagtcttctt cttggatgct ctcagctttg cgatcgcggc 481 aatcttaatc ttcaccctac caggtcagct cgttgttgca caaaatcaac agccttccag 541 gacaacacgc cgaacatggg gagacattaa agacggtaca actcgcctcc tttcagatac 601 acctattcgt tacgcgctag cgatgcagtt agtcgcatca attgccggag cgcaaatctt 661 agtaaatact gttggctatg ttcaaggtac actcaaatta ggagaagtgc agtacggatg 721 ggtgatggcg gcttttggga ttggtgcaac tctctgtgca gttatttttg gtacgttcaa 781 tagaagtttg ccacggacaa catttgtttt gattggtgca actttgataa cattagcttt 841 gttgcctgcc aactacgcaa atttagcagc actgatgctt ttatggtttg tggcgggtgc 901 agggcaaagt ttagtgaatt tacccaccca gacattaatt gcagaccgca tcccatctac 961 tttacaagga cgagtttatg gcgcacactt tgcctggagt catctttggt gggcaatctc 1021 ataccctctt gccggttggt tgggaagtaa ctttgctgaa cgcgagttcc tttacggtag 1081 cttagtgggc ttgatgctgc tagtggtagt gcaaattacc ctctcacctc aagtgcatga 1141 acacgaacat ctccactatc ctagtgtgca tgaacataaa cacatccacg atgaacacca 1201 tcagcataac cacgatgagg ggatggctat aggcgaatct cacaaccatc ttcacgaaca 1261 cacgacaatt atctatcaca accatcctca cacaagggat attcaccatc gccatagtca 1321 ttaaattgct gttgccaata tcttgagtag atattggtta aaacattatt acaggactaa 1381 gatgaagcac tatcaccatc ttttagaccg aatgctgctc gacacttttg cggatgacgc 1441 ggcactaaaa ttctcacatc ggggtctggt aatgcgatat tcgctacttt caatgcagca 1501 actatcgcct gtcttacaga tgatgttgta gccacaaaat tagaacgccg taaatctgtc 1561 caaaagcaag tttcaactac aacgtcatct agtcctaagt ctcgtattcg cacagaaact 1621 cccggctcat caagtacctc ctctgttgct tctagttaac cggattcaaa catttttgac 1681 gatgaaacaa gcctaaaacc atcactgggc aagaaaaata catcaaatgc tcaaaaacgc 1741 ctaaacccca aatatagcag ttgccaggta gattaggaca tgaactaatg agaaaatagg 1801 gacaccacga tactttcaag cctgttccct gttgcctgct atatatcaat aaaccaacca 1861 aaaccatgtc aaattttttt ggggtatgcg tttgaggtta ttttgtctaa gtcccattaa 1921 atgagaaatg atggtttggt ttagggactt ccaaggaata aaatcaccca agtcaataat 1981 aataatcatt gtt // LOCUS NODE_9796_length_1988_cov_3.1841701988 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1988) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1988 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..574) /locus_tag="DP116_28055" CDS complement(<1..574) /locus_tag="DP116_28055" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019245620.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28055" /translation="MKVLELFPRRALLALAVAAAVAVPASALDDAVSQEERAIREKEA KADDLPRVKLSTTEGDMVVELYENEAPNTVANFVSLVEKGFYNGLTFHRVLEDFMAQG GDPTGTGSGGPGYNIPCECYQPNARKHFPATLSMAHAGRDTGGSQFFITFVQTKHLDG KHTVFGRVIEGQEVLDKLQRIDPSSPSLGTK" gene complement(717..1298) /locus_tag="DP116_28060" CDS complement(717..1298) /locus_tag="DP116_28060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010509924.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidylprolyl isomerase" /protein_id="PRJNA477356:DP116_28060" /translation="MSYRTPDVSQEEQTIRAAEAKADDLPRVSLSTNKGDMVLELYEN EAPNTVANFVSLVEKGFYNGKIFHRVLEDFMAQGGCPEGSGRGGPGYAIPCECYLPNA RKHFAATLSMAHAGRDTGGSQFFITFGQTSHLDGKHTVFGRVIQGEEVLGKLQRVDPS RPVPGVTPDKIVTAKVLRKRDHEYQPKTLPSRR" gene complement(1295..1774) /locus_tag="DP116_28065" /pseudo CDS complement(1295..1774) /locus_tag="DP116_28065" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: GeneMarkS+." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 1336..1345 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(1778..>1988) /locus_tag="DP116_28070" CDS complement(1778..>1988) /locus_tag="DP116_28070" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28070" /translation="DDPQVGFGQPVALFDRLRGGGRLAHGAAAGREPEREPRPDAIVR VSHHRDQRTVQARSSRRQGQSRKLC" BASE COUNT 323 a 628 c 677 g 350 t 10 others ORIGIN 1 gcttggtccc cagactcggc gagctcgggt cgatgcgttg cagcttgtcg agcacttcct 61 ggccttcgat cacgcggccg aacaccgtgt gcttgccgtc gagatgcttc gtctggacga 121 aggtgatgaa gaactggctg ccgccggtgt cgcgtcccgc gtgggccatg ctcagcgtcg 181 cggggaagtg tttgcgggcg ttgggctggt agcattcgca cggaatgttg tagccgggac 241 cgccggagcc ggtgcccgtg ggatcgcccc cctgggccat gaagtcttcc aacacgcggt 301 ggaacgtcag gccgttgtag aagcccttct ccacgagcga gacgaagttg gcgacggtgt 361 tcggcgcttc gttctcgtac agctcgacga ccatgtcccc ctccgtcgtg gagagcttca 421 ctcgcgggag gtcgtcggcc ttggcctcct tctcgcggat cgcgcgttcc tcttgcgaca 481 ccgcgtcgtc gagggccgag gccggaaccg ccaccgccgc cgcgaccgcc agcgccaaga 541 gcgcccgccg cgggaagagc tcgagaactt tcatcgccag gctccagagg aagaaggacg 601 acgccgtgac gacccgcgct cgcgcgcgaa ccggccgccg acgtgtggca aagcgaaccg 661 gccggcgatg aacggtcatc gtcggccggg ggagagagtc gagacgttcg cgacgattag 721 cgccgcgacg ggagcgtctt cggctggtac tcgtggtcac gcttccgcag caccttggcg 781 gtgacgatct tgtcgggcgt gacgccgggc accgggcggc tcgggtcgac ccgctgcagc 841 ttgccgagca cctcttcgcc ctggatcacc cggccgaaga cggtgtgctt gccgtcgagg 901 tgcgacgtct gaccgaacgt gatgaagaac tggctgccgc cggtgtcgcg acccgcgtgg 961 gccatgctca gcgtggccgc gaaatgcttg cgggcgttcg gcaggtagca ctcgcagggg 1021 atcgcgtagc ccgggccgcc gcggccgctg ccttcggggc agcccccctg ggccatgaag 1081 tcttccaaca cgcggtggaa gatcttgccg ttgtagaagc ccttttcgac gagcgagacg 1141 aagttggcga ccgtgttcgg cgcttcgttc tcgtacagct ccagcaccat gtcgccctta 1201 ttggtggaga gcgacacgcg gggaaggtcg tcggccttgg cctcggcggc gcggatcgtt 1261 tgctcttcct gggagacgtc gggggtgcgg tagctcatga ggggcctttc gcagcgggaa 1321 gggagacggc gcacgnnnnn nnnnngatcg aaggcaatct agcccggtcg cgagggatcg 1381 taaatcggcc ggcggccgcc cgacgggccg cgcgaccgtc acgcccggcg cgatcgcttc 1441 ctccggcaag tccggcgtgg cccgcggagt cgattccgtc gtcccggacg ggctcgggaa 1501 aaaacgcgaa aaaaattcgc gggccgagaa aaacgccggc cgggccgccc tctccgcgca 1561 aatgccgccg gaaaacttct gaatttgccc ggaccaaatt gggcagtttt tgctcaccgc 1621 gtgaccagaa tccgtcattc gtcacgacac cgctgattct tgagcggtga tgacccatca 1681 aaaccagaac aaatgcccgt ccatcggtcg gatttgaggg cgtaatgccg ctccgtttgg 1741 ggagggcacc aaccaaggca tcgaatttgc acatgaatca gcacaatttg cgactttgtc 1801 cctggcgtct ggaacttcgg gcttggacgg ttctctggtc gcggtggtgg ctcacgcgga 1861 cgatcgcgtc cgggcgcggc tcgcgctctg gctctcggcc tgcggctgcg ccgtgcgcga 1921 ggcgaccacc gccgcgcaag cgatcgaaga gtgcaaccgg ttgcccaaag ccaacttggg 1981 gatcatcg // LOCUS NODE_9845_length_1973_cov_5.6157461973 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1973) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1973) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1973 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 347..565 /locus_tag="DP116_28075" CDS 347..565 /locus_tag="DP116_28075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876470.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28075" /translation="MLEIIINKSVEIILGLLLEKFGEWLLNERNLKRLNHLLKKQILL LYFYWVLLKTPQESLPKQPQQHDIQISE" gene 641..1744 /locus_tag="DP116_28080" CDS 641..1744 /locus_tag="DP116_28080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017316440.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_28080" /translation="MSKSSKSDKILVVDDSPDNVFLVKTILQEEGYTVSTAANGASAL AQLETSTCDLVLLDLMMPGMDGYEVTKRIRQNTTLPFIPILLITAHDSPNVAHGLDLG ADDFIRKPVTVDELLARVRSLLRLKHSIDERDEIARLRQDFASRLTHDLRTPLVAADR MLLLFADGALGQLSPQMQEVIEIMARSNSNLLAMVNTLLEVYRFEAGRKTLAFQPVNL SKLLQEVSQELSPLADQKHLTVNLDLGENSGSSKVMGDRLELHRLFTNLIGNAIKFTD TGSITIRITPATPSENCITVEIADTGRGISPEEQLTLFERFRQGSHNHSGSGLGLYLS RRIVEAHQGTIHVNSEVGKGSVFIVNLPIKQSS" BASE COUNT 577 a 439 c 423 g 534 t ORIGIN 1 cgccacgcta gtccctaaga gggacgctgc gcaaacgcta tcacgtcccc tccccgagct 61 cgcggggagg ggcggttttg gcgcaagaca aaaccggggt ggggtaaagc gagagttgtg 121 agcaaacata agggtatgac gtaagtaata acaagggttt caggttaagt tgacacgtat 181 aagactgccc accctacaca ggatgtaaca tgctaagtcg tatctgaaaa ttacttgcaa 241 attagcacat atcaggacaa gtcaagctaa gtcactgcaa gtcaaataaa taactacgaa 301 agaaaattgc gacagttaac tcagtcaaat aactgaggcg tacgcaatgc tagaaataat 361 tattaacaaa tctgttgaaa ttattcttgg tctgttacta gagaagtttg gggaatggct 421 gctaaatgag cgcaacctca aacgcctcaa ccacttgctg aaaaagcaga ttctcttact 481 ctacttttat tgggttttac tcaaaacacc gcaagaatct ttacctaagc aaccacaaca 541 acatgatatc caaataagtg agtagtattg gtaaggggga aaatagcaaa ggctgaagct 601 atttttcttt cttgtggaat gagcgtctct tgtgtgctat atgtctaaat cttctaagtc 661 tgacaagatt ttggttgtag atgattctcc tgataatgta tttttggtga aaaccatttt 721 gcaggaagaa gggtacacag ttagcacggc tgcaaatggt gcttcagcat tagcacaact 781 cgaaacatcg acttgtgatt tagtgttgct ggatcttatg atgccgggaa tggatggtta 841 tgaagttact aagcggattc ggcaaaatac gactttgcca tttatcccaa ttctcctgat 901 taccgcacat gactcaccta atgttgctca tggattagat ttgggcgcag atgattttat 961 ccgcaagcct gtgactgtgg atgaattgtt ggcacgagtg cgatcgctcc tccgtctcaa 1021 gcacagtata gatgaacgcg atgaaattgc ccgtctacgt caagattttg cctctcgcct 1081 cacccacgac ttacgcacac ccttggtagc agcagatcga atgctgttgc tgtttgctga 1141 cggtgctttg ggacaactgt caccacaaat gcaggaagtg atagagatca tggctcgcag 1201 taacagtaat ctgctagcta tggtaaacac cttactagaa gtttatcgct ttgaagcagg 1261 tcgcaaaact ctagcgtttc aaccagtgaa tctcagtaag ttactacaag aagtttcaca 1321 agaattgagt cctttagctg accaaaagca ccttacagtc aatttagatt taggtgagaa 1381 ctcagggagt agtaaagtga tgggcgatcg cctggaactc cacagactat tcaccaacct 1441 gataggtaac gctatcaaat ttactgacac cgggtcaata accatccgca tcactccagc 1501 gactccttca gagaattgca taacagttga aatcgcagat acaggtcgag gtatttctcc 1561 tgaagaacaa ctcactttat ttgaacgatt tcgccaagga agtcataatc attctggtag 1621 tggcttagga ctgtaccttt cgcgacgaat tgtcgaagca catcaaggca cgattcacgt 1681 gaattcagaa gtcggcaaag ggagtgtatt cattgtgaat ttgccaatta aacagtcatc 1741 atagaaagta tgagttattt taacaatgcc aatttcatac acccttttta tagcccgcgc 1801 aacccaacaa gttttatgtt ttttgcgcta actgatacca agttgcagct caattgcctc 1861 acctcatccc ctctccttaa tcctcacctt tatcccctct ccttaatcct cacctctatc 1921 ccctctcctt aataaggaga ggggtgcccg atagcgtagc gtgccgttag gca // LOCUS NODE_9906_length_1955_cov_2.7273681955 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1955) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1955) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1955 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..739 /locus_tag="DP116_28085" CDS <1..739 /locus_tag="DP116_28085" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877480.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28085" /translation="NMGDSHDGNSKLEPANATAKLTLPDKITPKKPVPLAIDVKDKEG RPIANFDTFQEKLMHLIVVSDDFQSFNHIHPTYKGNGRFEVQADFLHSGNYTLFSDYK VAGKAEQVSVLKAQVPGKSPAASEIDLATTKTLGDTKANLKLSQPTIKAGQEVHLIFN LQDAANNQPLKDLKPYLGERGHLVILKQSSPLTEADYIHAHALKNTPAGEIHFITSFP QAGKYKMWGQFNRNGKIITADFWVDVQ" gene complement(1136..1460) /locus_tag="DP116_28090" /pseudo CDS complement(1136..1460) /locus_tag="DP116_28090" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016866412.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 604 a 425 c 379 g 547 t ORIGIN 1 taatatgggt gattctcatg atggtaattc caaattagaa cctgcgaacg ctaccgcgaa 61 gctcaccctc cccgacaaaa tcactcctaa aaaacctgta cctttagcaa ttgatgtcaa 121 agacaaagaa ggaagaccaa ttgctaattt cgatactttc caagaaaaat tgatgcattt 181 aatcgtcgtg agtgatgatt ttcagtcttt taatcacatc catcccacat acaaaggaaa 241 tggacgcttt gaagttcaag ctgattttct ccactctggt aattacacac ttttcagtga 301 ttacaaagta gcaggaaaag cagaacaagt ttcggtctta aaagcacaag ttccaggaaa 361 aagtccagct gcatcagaaa ttgatttagc tactacaaag actcttggtg atactaaggc 421 taatctaaaa ttatctcaac ctacaatcaa agctggacaa gaagttcatc taattttcaa 481 cttacaagac gctgctaaca accaaccact gaaagatttg aaaccttatt taggagaaag 541 aggacattta gtcatcctca agcagtcatc accattgaca gaagcagact acattcatgc 601 tcatgcattg aaaaacactc cagcaggaga aattcatttt atcaccagtt ttcctcaagc 661 aggaaaatat aaaatgtggg gtcagtttaa tcgtaatggc aaaatcatca ctgctgattt 721 ttgggtagac gtgcaatagt tatctcaaat ttcatcttga tttcatcatg gtcatatact 781 taagtttcgc cgtctcaatt tatgctttga tagtcattag aggcttttta gttgtgccac 841 tatggtatga ctgaaagtct ggttttatat taagaatgag tgaaatgaga taatcagcta 901 ttctcagaat caatctaatt tgatgaggag attaaccgtg aaatttcatc ttaatttcat 961 tttaattatc tataataaat attaaaaaca tgctcctcat gtttcacaat cctgacgggc 1021 ttatagtgat actgtaagcc tgttttctta ctggagttta gactaaccga tgacaaaatg 1081 actcagaagc actgagttga caagctacac agctgaaaat ataaaaatta aaaggtcaag 1141 ctgaccttgc tggttctttc ttgggtttag tagtagcctt tttaactata ggataacgaa 1201 ttctacgcac acggggtttg ccagtagtcc acccagggga ctttccacgg ggtttgggtg 1261 caacagcagg tgtaccaatc gcccctaaaa ctccacccat agcctgagca acccttcctg 1321 aagtcagttt agtctgtggc ttttgccagg gtagaggatt gtcagccacc cttcgggaac 1381 gcccgtcgcc tacggcggga aacccgcctg cgccgtgctg gtctgacaat atcacgagct 1441 ttccctagtt gccaggtcat cagtggtaac aagtcactcc aagactcaca ctgttctgga 1501 gtacttaatt tcggaagagt ccaatgcaag cttgcgagcg caaatctata ccaatgatca 1561 acagtaaaac gtcgcaggta gagtcgccaa aattcagaca attcaggcat gtccattcct 1621 acccatgcaa gccataaagg tttaaattga ttaactgcat cttcattcaa acgctgcact 1681 tgaatcaaat ccatttttat cttggcagat tgttgaaagt gaaaattacg ccacaaatgc 1741 aatcgtagca accctaactt tggatctttg agttctacaa attgatcagc tgtccaccaa 1801 gtttggctgt cgttgagctt gaatttatcc ccttgacgaa gctggcacaa ccgtactctg 1861 catcccacaa cgataaagga cgtgttggca gtttttcacc gggaaaggca gagggcagaa 1921 ggcaggaggc agaaggaatg tctcctgccc tctgc // LOCUS NODE_9958_length_1937_cov_3.8065891937 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1937) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1937) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1937 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 408..1829 /locus_tag="DP116_28095" CDS 408..1829 /locus_tag="DP116_28095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017318295.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28095" /translation="MNEADTPNNGDFVQPQDSNSLQPKQASCPFYSVSKSENTATAKL PLLAPAENQQCESQILSEENTPQQDWVADPDSEKQELVDAEFKKLLGLNEELRSANND LYDKVEELKSVLTESEMALQWQKKRSSVTESMLNQQAQELTASQDQVNSLFQQLETAV QTVQRQESLISNYKTQLEISQQRLAQLERECVLMKSSYNDQSHQLLQSESACRELRTR LMRQQRQTLQFKAALEKCLETPVPSYDSLENNDTSSSQTRPFRHSSVLFSQAQPIQPW SVESESLTNSVDNACKGTPALPINQWDNSTPNSSSTCDLDTEETAIQTQSVDTPEPEI QTVSSLGLTSLEEQLDGVIQMFFVANPPVSTSPEPPVEKDANSTQAGEAIWETSVIPL KDEPKITETLATNDSPEETEDFWVNVAQSSSLELQKTADSPEPFTDHATNDPSPLLYP QRPPKKRKSLASVELPNFQQKSH" BASE COUNT 614 a 438 c 418 g 467 t ORIGIN 1 cggagccact tctgtgcggg ggttcccccc gttgaagaat gtggcgtcag gcgacccgcg 61 ttcccggagg gtttcttgcg ctagttaaag gctcatatcc caaaaaggcg atattgcgat 121 ataggttgag caattctcac gcgttcacct ataacaagcg cgagtctcgt ggttgcagta 181 ctctcgcgct tttcacatag cagcatcaaa aacaagaata ttttacaact ttagcgtgtt 241 tttgcagaca gtagcgagga aaagctacaa ttcactcatg tttatttatg tattttagaa 301 ggagcaatac ttgccttcta tagtaaattt tggtaagaca ggatttatat ataatgctac 361 agcagtctgt ggttcttttt ttatagctgg tgggtgaaga gagagtgatg aatgaagctg 421 acaccccaaa taatggggat tttgttcaac cccaagatag caattcactg caaccaaagc 481 aggcgagttg tccattttac tcagttagca agagtgaaaa cactgcaact gcgaaactgc 541 cactactggc accagcagaa aaccagcagt gtgaaagcca aattctatca gaagaaaata 601 caccccagca agactgggtt gcagaccctg acagcgaaaa acaagaactc gtagacgctg 661 agttcaagaa attgttggga ttaaatgaag aattacgttc tgccaacaat gatttgtacg 721 acaaagttga ggagctaaaa agtgtcttaa ctgagtcgga gatggctttg cagtggcaga 781 aaaaacgctc aagtgtcaca gagtcaatgc ttaaccaaca agctcaagaa ctgactgcgt 841 ctcaagacca agtaaattca ttatttcaac aattggaaac tgctgtacaa actgttcagc 901 gccaagaaag tttgatctca aactataaaa cgcagctaga aatcagtcaa caacgccttg 961 cacagttaga acgggaatgt gtgttgatga aatctagcta caacgatcag tcccaccaac 1021 tattgcaatc agaaagtgct tgtcgggagc tacgaacaag gctcatgcga caacaacgcc 1081 agactctgca gttcaaagct gctctggaaa agtgcctgga gacacctgtt cctagctacg 1141 actcccttga gaataacgac acaagcagta gtcagacaag accattcaga cactctagcg 1201 ttttgtttag ccaagcacaa ccaattcaac cttggtcggt ggaatcagaa tccttgacaa 1261 atagtgtaga taatgcttgt aagggaactc cagcgttacc tataaatcaa tgggataact 1321 ccacaccaaa ttcatcatcc acttgcgatt tagatacaga agaaacagct atacaaacgc 1381 aatcagttga cacaccagaa ccagaaattc aaacagtttc atctttgggt ttgacaagct 1441 tagaagagca attggatggc gttatccaaa tgttttttgt tgccaatcct cccgtatcga 1501 catctccaga gcctcctgtt gaaaaagatg cgaatagcac ccaggcaggt gaagctattt 1561 gggaaacttc tgtaataccc ctcaaagatg aaccaaaaat tactgaaaca ttggcgacta 1621 acgattctcc agaagaaact gaggacttct gggtaaacgt agcacaatca tcatccctcg 1681 aattacaaaa aactgctgac tctccagaac ccttcaccga ccacgctact aacgaccctt 1741 cgccactgct ttatccgcag cgtccaccca aaaagcgaaa gtctctagca tctgtggagc 1801 taccaaattt tcaacaaaag agtcattagt cattagtcat tagtcattag tgatgtgtag 1861 agacgtagca tgtgaggcag gtgctaacag cgggttcccc gacttgtaga cgccggaggc 1921 ggtgaggggg tcgcagt // LOCUS NODE_9982_length_1931_cov_2.2361411931 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1931) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1931) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1931 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1046 /locus_tag="DP116_28100" CDS <1..1046 /locus_tag="DP116_28100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017656161.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="MerR family transcriptional regulator" /protein_id="PRJNA477356:DP116_28100" /translation="VRHYHTLGLLPPVQRSEGNYRLYTQQDVQRLQRVIALKQQGFQL SHIRQLLDSHLQETIDPTLMAQLQQQYRSVIQQITRLRQTASALEGLLGQDRHCQITQ AEALAQLKQLEVDAQEGLGKLDQLWINLDAETTTHPEAFQESLQRLLPDLSHYSEITV HLLHQLVLACGDVSLVNFVRLSRDAISAAWDALKFGCSVVTDVPAVAATLDHTRLAHL GCPVEALIDDPHITGASEAEQAFWDHHLWQKRLQQIPKGCVLVIGYAPSVLLTTCQLI EQQQIQPAFVIGMPIGFSHAPAAKRRLMETQIPYITIQGSLGGGLLAAMTLNALVETL IEKPDCHCYLTRP" gene complement(1175..1585) /locus_tag="DP116_28105" CDS complement(1175..1585) /locus_tag="DP116_28105" /inference="COORDINATES: protein motif:HMM:PF13358.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28105" /translation="MAMTRLYARAFKGKRAHGARPDKRGKNVTVIGAIAYGRATLNAL RGIVAGMTFKGGTDKMAFQTYVEQVLVPNLWEGACVVMDNFSSHKVAGIQEAIEAAGA HLVYLSPYSPDFSQARKLLVKDERVSAFHRSPYI" gene complement(1551..>1931) /locus_tag="DP116_28110" CDS complement(1551..>1931) /locus_tag="DP116_28110" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28110" /translation="YLDTVRIEPLPHKSGNAPKIKEEHYSVLQKLVEENNDSTLEELC VQMELKSQVKISRSGMGKTLQKLKLTRKKTLHAAEQDREIIRTFDANNLVFVDESGVK LAGGSFPRQTLRQHSNDETLCQSI" BASE COUNT 501 a 471 c 414 g 545 t ORIGIN 1 tggtacggca ctatcatacg ctagggttgc taccacccgt tcaacgctca gaaggcaact 61 accgccttta tacccagcaa gatgtacaac ggctgcaacg ggtaatagca ctgaaacaac 121 aagggtttca gttatcccac attcgccagc tattagacag ccatttacag gaaacgattg 181 atccgacttt gatggctcaa ttgcagcagc aatatcgttc cgttattcag cagattaccc 241 ggttgcgcca aaccgcatcc gcattggagg gattgctggg gcaagatcgt cattgtcaga 301 tcacccaagc tgaagctctg gcacaactaa agcaattgga agttgatgcc caggaggggt 361 tgggtaagct agatcaactt tggataaacc tggatgcaga gacaaccacc caccccgaag 421 cgtttcaaga atctttgcag cggctgctcc ccgatttatc tcactattct gaaatcacgg 481 tacatctgtt gcatcaactc gtgctggcat gtggagatgt tagtctggtc aatttcgttc 541 gcttgagtcg agatgcgatc tctgctgcat gggatgcgct caaatttggg tgttcggtag 601 tgacagatgt gccagccgta gctgccacgc tggatcacac gcggttggct catttaggat 661 gtcccgttga agcactgatt gatgaccctc acatcacagg ggcaagtgag gcagaacagg 721 cattttggga tcatcacctc tggcagaaac gactacaaca aattccaaag gggtgtgtac 781 tcgtgattgg atacgcccct tcggtactgc tcactacctg tcagttgatt gagcagcagc 841 aaattcaacc agcctttgta attggaatgc cgatcggatt tagtcatgcc ccagcagcaa 901 agcgacgatt aatggaaaca caaattcctt acatcacgat tcaaggcagt ctcggtggtg 961 gacttttagc agcgatgacg ttaaacgcat tggttgaaac attgattgaa aagccagatt 1021 gccattgcta tctcactcgt ccgtagtatt agtagcactt aaatatagcg gttctcattt 1081 gggtgcaata caataaccgc aatgtttaaa ccagccaata atgtcactta aatgaatagt 1141 ttcaaaagcc tctgtaatcg ctttatctaa atcctcatat gtacgggctg cgatggaacg 1201 cagaaactct ttcatctttg accaacaatt ttcgcgcttg agaaaaatca ggcgaatacg 1261 gtgataaata aaccaagtgt gcgcccgctg cctcaatcgc ttcttgaatt cctgcaactt 1321 tatgcgagct aaaattatcc attactacac aagccccttc ccaaagatta ggaaccaaga 1381 cttgttcaac ataagtttga aatgccattt tatccgttcc gcctttaaag gtcattcctg 1441 caacgattcc acgtaaggcg ttaagcgtag ctctgccgta ggcaatcgcc ccaatcactg 1501 tcacattttt tccccgctta tctgggcgag caccatgtgc cctttttccc ttaaatgctc 1561 tggcataaag tctcgtcatt gctatgttga cgtaaagtct gccgggggaa gcttcccccg 1621 gcgagcttta caccggattc gtcaacaaac accaagttgt tcgcgtcaaa tgtacggata 1681 atttccctgt cttgctccgc cgcgtgtaag gttttttttc gggtcaattt cagcttttgt 1741 aacgttttac ccattcctga gcgactaatt tttacctggc tttttaactc catttgtacg 1801 cacaattctt ccagagtgct atcgttattc tcttctacca gtttttgtaa aactgaataa 1861 tgttcctctt ttattttagg tgcattccca cttttatgtg gtaatggttc tatacgaact 1921 gtgtctaaat a // LOCUS NODE_10033_length_1915_cov_3.9575271915 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1915) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1915) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1915 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 109..294 /locus_tag="DP116_28115" CDS 109..294 /locus_tag="DP116_28115" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28115" /translation="MYNNNKDDKKNISTDIWRSIDFQIVKHRIDKINAKLEAVLGKYN SQFRGDDLFHQNQDWKD" gene 364..810 /locus_tag="DP116_28120" CDS 364..810 /locus_tag="DP116_28120" /inference="COORDINATES: protein motif:HMM:PF00583.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="PRJNA477356:DP116_28120" /translation="MSEEIQTFTTEYLDDCANLYVEVFNSEPWNEQWTIETARLRLFE ILNTPGFIGFVLRQDEVLGFVAGYCEQMQKGKGFYLKEICTHHDKQRRGIGTKLLNQL MNTLTGMEVTAIYLVTMKDGQAEAFYTKNGYQRSQKLIVMSKRFYK" assembly_gap 1021..1030 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(1156..1725) /locus_tag="DP116_28125" CDS complement(1156..1725) /locus_tag="DP116_28125" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019495094.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28125" /translation="MNAISVVLKEGSKGAEVTKLQEGLKKLNFYSGVVDGIFGAKTKE AVINFQKSQQLVADGIVGEKTWSKLNAALQPSDPRSNFRVINVQEFISSNRIKVSTPK AVAVQLFSKREDEEGRRSEDIVVEYPTRETAEIVHAIVGLADDSIRGIRQRIELKQKQ NKWEIVWVGEQYKCQPNRGSQDWSSSPCL" BASE COUNT 597 a 388 c 354 g 566 t 10 others ORIGIN 1 tatattaggc taattatata tcaaaagaga gaaccgaata gaattgtaat ttgtaaaaat 61 tgtgagtaaa gcaaaacaat aatcaagact ttattttcca taattaccgt gtataacaac 121 aataaagacg ataaaaaaaa tatcagcaca gacatttgga ggagtataga tttccaaata 181 gttaaacaca gaattgataa aattaacgct aagcttgagg ctgttctggg taagtataac 241 tctcaattta ggggcgatga cttatttcac cagaatcaag actggaaaga ttaacatttt 301 taaagctttg cttggtaaac tagatgtagc ttagttcatg accaactaaa ctccagtttt 361 ttgatgtcag aagaaattca aacctttacg actgaatatt tggatgactg tgccaatctg 421 tatgttgaag tgtttaacag tgaaccttgg aacgagcaat ggactatcga aactgcaaga 481 ttgcgcttgt ttgaaattct caatacacct gggtttatcg gatttgtctt acgacaagat 541 gaagtgctgg gattcgtcgc tggttattgt gagcaaatgc aaaagggtaa aggcttttat 601 ctcaaagaaa tctgtaccca ccatgacaaa cagcgtcggg gaattggaac gaagttactc 661 aaccaattaa tgaataccct aaccgggatg gaggttactg caatttatct ggtaactatg 721 aaagatggac aagcggaagc tttttacact aaaaatggct atcaaagaag ccagaaatta 781 attgtaatgt caaaacggtt ttataaataa ccattaatac aagcatataa cagctaggtt 841 aggaacaagt cctaacccag tctcggtcgt taatatatgg aaaccaataa agtctcctta 901 tttcaaccga aactcatcat atcatgtccg gttaattgct tataaatccc gcacaacccc 961 accccggttt tgtcttgcgc caaaaccgcc cctcccctta ccaaggggag gggtaagggg 1021 nnnnnnnnnn cccctccccg caagcgggga ggggattaag gggtggggtg cagatatcgc 1081 ggaaatcaca actaatcaac tcaaactcat caaactgaat acctaccttt tttgagaatt 1141 tggcgaaggt gattgttata agcaaggact gctagaccag tcttgggatc ctcgatttgg 1201 ttgacacttg tattgctcac ctacccagac aatctcccac ttattttgtt tctgttttaa 1261 ttcgatgcgc tgtcttattc cgcgaattga atcatcagct aatcctacta tggcatgcac 1321 aatttcagct gtttctcttg taggatattc aactacgatg tcttctgacc gtcttccctc 1381 ttcgtcttcc cgcttgctaa acaattgtac agctactgct tttggagtag agactttaat 1441 ccggttagaa gagataaatt cttgtacatt aattacccga aaattggaac gtggatcact 1501 tggttgcaac gctgcattta atttagacca agttttttcg cccacaatac cgtcagcaac 1561 gagttgttgt gatttttgaa aattgatgac agcctcctta gtttttgctc caaaaatacc 1621 gtcaacaaca cctgaataaa aattgagttt tttcaaacct tcttgtagct tagtaacttc 1681 tgcacctttt gaaccttcct taagtacaac ggaaatagcg ttcataattt gtcctttctc 1741 taaactcaac tacgtttgtt gctaattgat cttcatgttt tgtgtgaaga tatacttaaa 1801 atcacttttg aatcagccga ttccggacgc acaaatataa aaatttaaaa cttattacat 1861 cccctacaac gccaggtgct tcaagtcggg aaacccgccc aacgcactgg ctccc // LOCUS NODE_10046_length_1912_cov_5.0500811912 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1912) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1912) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1912 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(51..1223) /locus_tag="DP116_28130" CDS complement(51..1223) /locus_tag="DP116_28130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006195549.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxal phosphate-dependent aminotransferase" /protein_id="PRJNA477356:DP116_28130" /translation="MESLISRMEAVQSPIIPVVAELIKSCPGTISLGQGVVSYSPPAE AIELLPKFLAEPTNNLYKAVEGILPLQTAIAAKLQAFNGIEINEDNCIVVTAGSNMAF MNAVLAITSVGDEVILNTPYYFNHEMAITMAGCRPVLVKCDENYQLRPEAIASAITPK TRAVVTISPNNPTGAVYSEQALWEVNQICRNHGIYHISDEAYEYFTYNGVKHISPGAF AGSSKYTISLYSLSKAYGFASWRIGYMVIPKHLLVPVKKVQDTILICPPVISQYAALG ALQAKEEYLKDHIGAIAQVRQVVLDSLNSLQDLCTIAPADGAFYFFLKVHTQLDSFEL VKRLIREHQVAVIPGTTFGMDDGCYLRVAYGALQKQTAKEGIERLVRGLQKICKVH" gene 1447..>1912 /locus_tag="DP116_28135" CDS 1447..>1912 /locus_tag="DP116_28135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877321.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PD-(D/E)XK nuclease family protein" /protein_id="PRJNA477356:DP116_28135" /translation="MQSTQTQLLRLSQGQLNLLERCPRQFQHTYLEQLHSPLDPEHEE RQTLGSRFHLLMQQREMGLPIDTFLQEDTQLQSWMSAFANAAPEILNSTSNSQTFRES EHYRTLQVQDYLLTVIYDLLIADNQQAQILDWKTYPKPPNKRKLEQNWQTRLY" BASE COUNT 571 a 443 c 387 g 511 t ORIGIN 1 ccctataccc ctacaccctt aaacccctac acccctgtgt ttcttgaaag ttaatgcacc 61 ttgcatattt tttgcaaacc cctcaccaag cgctcaattc cttcttttgc tgtctgtttt 121 tgtagcgcac cgtaggcgac gcggagatag catccatcat ccatcccaaa ggttgtacca 181 ggaatcactg cgacttgatg ttcgcggatg agtcgtttga caagttcaaa gctatccaat 241 tgagtatgaa ctttcaggaa aaaatagaaa gcaccatcag caggagcaat agtacacaag 301 tcttgtaagc tgttgaggga atcaagtacg acttgtctta cttgggcgat cgccccaata 361 tgatccttca aatattcctc tttcgcctgc aacgccccca aagctgcata ctgggaaatc 421 acaggcggac aaatgagaat cgtatcttga acttttttga cggggacaag caggtgttta 481 gggataacca tgtaaccaat acgccagctt gcaaaaccat aagccttaga aagactataa 541 agagaaatcg tgtacttgct acttccggcg aatgcaccgg gagaaatatg tttcactccg 601 ttgtaagtaa agtattcata ggcttcatcg ctgatgtggt agataccatg attgcgacaa 661 atttgattga cttcccacaa cgcttgttct gaatacacag cccctgtagg attattaggg 721 gaaattgtca cgactgcgcg tgttttggga gtgatagcag aggcgatcgc ctctggacgt 781 agctggtaat tttcatcaca tttcacaagc accggacgac aacccgccat agtaatcgcc 841 atttcatggt tgaaatagta aggcgtattg agaataactt catctcccac cgaagtgata 901 gcaagaacag cattcataaa cgccatattg ctacctgcgg taacaacaat gcagttgtcc 961 tcatttattt caataccgtt aaaagcttgt aattttgcag ctattgcagt ttgtaaaggg 1021 agaattcctt caacagcttt gtataaatta ttagtaggtt cagcaaggaa tttgggcaaa 1081 agttctatgg cttcggctgg tggactatag gaaacaacac cctgtcccag agagattgta 1141 ccaggacagc ttttaatcaa ttccgcgaca actggaataa taggcgactg caccgcctcc 1201 atccgagaga taagagattc catacggtat cacatttaag ctaagttcaa ttttctctca 1261 tatggcgctt gtatttgagt tgtgaaatca aagattggat ttagatgagg aggatgacga 1321 gagaaggagg gaaggaagga gggaaggata ttttcccaca ctcccacact cccacttttt 1381 ctgagatagg tattcacgcg aatcatttag ggctgtatac aatgcgatcg cctctaaaat 1441 agaactatgc aatccactca aactcaactc ttgcgacttt cgcaaggaca acttaaccta 1501 ctagaacgtt gtccccgtca atttcaacac acctacctag aacaactcca ttctccctta 1561 gatccggaac atgaagaacg gcaaacttta ggaagccgct ttcacttgct gatgcagcag 1621 cgagaaatgg gtttacccat tgatactttt ctacaagagg atacccaact gcaaagctgg 1681 atgtccgctt tcgcgaatgc agcaccagaa attttaaact ctacttccaa tagccaaact 1741 tttcgtgaaa gcgaacacta ccgtactctg caagttcaag attatttgct cacggttatt 1801 tatgatttat tgattgcaga taatcagcag gcacaaattc ttgactggaa aacttatccc 1861 aaaccgccca ataaacgcaa gttggaacaa aactggcaaa cacgtcttta tc // LOCUS NODE_10181_length_1870_cov_5.3085401870 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1870) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1870) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1870 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(435..785) /locus_tag="DP116_28140" CDS complement(435..785) /locus_tag="DP116_28140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016867613.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28140" /translation="MLTEEILVQKFTAVLNDRCPELSGLLEYCHVELVNSYWGKPPKL SQYFVVYSCDQLFPSVNAYKDILRGVAENLGISEAICMNATRILRDPASTLKQKNPIL WLELQWVVTLYLEW" gene complement(873..1694) /gene="nifH" /locus_tag="DP116_28145" CDS complement(873..1694) /gene="nifH" /locus_tag="DP116_28145" /EC_number="1.18.6.1" /inference="COORDINATES: similar to AA sequence:RefSeq:YP_009055.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrogenase iron protein" /protein_id="PRJNA477356:DP116_28145" /translation="MRKIAIYGKGGVGKSTTTQNTVAGLVDMGRKVMVVGCDSKADST RLLLGGLHQKIVLDTLRKEEDVNLEDFRKEGWGKTLCVESGGPEPGVGCTGRGVLTSI GLLEQLGAYDEEVRLDYTFYDGLGNVVCGGFVMPIREGKAQEVYIITSGEIMAMYATN NICRSIQKYASIGSIRLGGLICNSRNVDQQDNVIQALAEQLGTQMIYSMPRNNMVQRA EFYSKTVIAYAPECQQAQHYRNLAKAIDQNTNFVIPKSMSNDQLEKLLAKFWLFD" BASE COUNT 475 a 440 c 414 g 541 t ORIGIN 1 gtgttgtatc ttctcctttt tctttgatgt aaacgtgctt ttcacgttcg ggtattttca 61 catcgcattt caataatttc aatggcatga cttattgctc ctttggctat ttcaatacta 121 gaacaaagct tttttttctt ctgtttgaga actggacttt acagcagttc tgtgtaaggt 181 gggcagttcc ttcaacggga cgccacatcc ttcaagtcgg cgaagcgttt cccttcggag 241 tggctcggga acccccgcaa cggactgctc actgatatct caaatgttca ttagataagc 301 ttttatgggc aatgcccacc ttacgatttg tggtgagtca gcgctgcggg agggtttccc 361 tccgcaggcg actgcgaacc cggagggtct aatcatagaa ttagcgcact aaactagtgc 421 cagttgcgta agttctacca ttctagatac agtgttacca cccactgaag ttccaaccac 481 aggatagggt ttttttgctt caacgttgag gcgggatcgc ggagaatgcg tgtggcgttc 541 atgcaaatag cttcggaaat acccaggttt tcggcaacgc ctctaagaat gtctttgtag 601 gcattgactg agggaaacag ctggtcacag gaatatacaa caaaatattg agaaagtttt 661 ggaggcttgc cccagtagga atttacgagt tcgacatgac agtactcaag caacccactg 721 agttccggac agcgatcgtt gagcacggcg gtaaattttt gaactaaaat ctcttcagtc 781 aacataatca gtcatactcc ttcaatcagg taaacaagtt ttggcgtagt tgaaaacgcc 841 gatttatcag agaatgatag ggttgcattg tcctagtcaa acaaccaaaa cttcgccagc 901 agtttctcaa gctgatcatt agacatagat ttcggaatca caaaatttgt gttttgatca 961 atcgcctttg ccaagttgcg atagtgctgt gcctgttgac actcaggagc atatgcaatc 1021 acagtcttgc tgtagaactc tgcgcgttgt accatgttgt ttctgggcat ggagtagatc 1081 atttgagtac ccagttgctc tgctagagcc tgaatcacat tgtcttgttg atcaacattg 1141 cgagagttgc agatcaatcc tcccagacgt atactaccga tagaagcata tttttgaatg 1201 cttcggcaga tattgttagt cgcatacatt gccatgattt ccccagaagt gataatgtaa 1261 acttcttgag ctttgccttc acgaatgggc atgacaaatc caccacacac cacgttaccg 1321 agaccatcat agaaagtata gtcaagtctg acttcttcat cgtaagcacc aagttgttcc 1381 agcaacccaa tcgatgtcag aacgcctcga cctgtgcatc ccacaccagg ttccggtcca 1441 cccgactcta cacaaagcgt cttcccccat ccttccttgc ggaagtcttc taaattcaca 1501 tcttcttctt tacgcagtgt gtccagtaca atcttctggt gcaatcctcc caagagcagt 1561 cgggtagagt ctgctttcga gtcacatcct acaaccatta ccttacgacc catatcgacc 1621 agtccagcaa ctgtattttg agtcgttgtg gacttgccaa ctcccccttt tccgtaaatc 1681 gcaatcttcc gcataaaaac ctcgcttatc gggttaaaac taatcagaag tgagggattt 1741 ttgtgtaaac aattttgtgt tttctgttgt ttcaggatta gcaattctga gcaagaaagg 1801 ggtgtagggg tatgggggtg taaggggagc cagtgcgttg ggcgggtttc ccgacttgaa 1861 gcacctggcg // LOCUS NODE_10233_length_1854_cov_3.7398551854 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1854) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1854) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1854 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 73..327 /locus_tag="DP116_28150" CDS 73..327 /locus_tag="DP116_28150" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28150" /translation="MKNTTLNSQHHQPELTNSQHTLTCCVNSGGSHLSELSDSELLLV VGGGNWNQPPQKERFLNLFWAAVNWANFNGPTSNRGNQLG" gene complement(776..>1854) /locus_tag="DP116_28155" CDS complement(776..>1854) /locus_tag="DP116_28155" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874756.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="carbohydrate kinase" /protein_id="PRJNA477356:DP116_28155" /translation="IAINGTSSTVLLCDAAGKPVDVPLLYNDARGAVVSEELRSIAPA NHTVLSATSSLAKLLWMSRLPSFPKARYFLHQADWLAFLLHGQLGITDYHNALKLGYD VEELKYPEWLENLKIKIHLPKVITPGTPIGELRSEIADQFEFRRDCIVCAGTTDSIAA FLASGAKLPGEAVTSLGSTLVLKLLSHTRVEDSRYGIYSHRLPPLDKGGTEGGLWLTG GASNTGGAVLQQFFTKSELESLSCEIDESKVSELDYYPLLKSGERFPINDSNLLPRLE PRPASNVEFLHGLLESIARIEARGYELLQELGADRLTQVYTAGGGAANPTWRAIRKRI LQVPMVQSVNMEAAYGTALLAMRG" BASE COUNT 501 a 470 c 370 g 513 t ORIGIN 1 ttttgacaaa gagtgttttt ttgtgttagg aattgcttcg gtaatattta taacagatgc 61 aacggcaaat aaatgaaaaa cacaactcta aactcacaac accaccaacc cgaactaaca 121 aactcacaac acacactgac ctgttgtgtg aattctggtg gttcgcacct gagtgaactc 181 tccgattcgg aactgctctt ggttgttggt ggtgggaact ggaatcagcc tcctcaaaaa 241 gaaagattcc taaacctttt ttgggcggct gttaactggg ccaacttcaa tgggccaact 301 tcaaatcgtg gcaaccagtt gggatagggg tactctgcat tctaccaagg agtgcatcaa 361 atgcacacgg ttctgttagc ttggataaac ttctctacca caactgccac ttccgggcac 421 ttgaatacct agcactcgct gtgctcaaag tcaaaagtca aaagtcaaaa gtcaaaaaag 481 actcttactt ttgacttttg atttttgatt tttgaattcc ccgcaggggc tgcttcccca 541 attcgacttg ctggctggtt tggcttttgg tttatatctt tggcaaaaac ctcgatactc 601 agatcttgca tctgcttttt cgtccaccaa gaaataaatt tcttggctca gagctactgt 661 aagctaaagc tcactggata tacttcagtc cgttttaacg gactttggct atgagccttg 721 aacttgagtt caaggcgtac tcaccagtga ggtgcaagat ctgaggatac cgttgctatc 781 ccctcatcgc caacaacgcc gtcccataag ccgcttccat attcacagac tgcaccatag 841 gaacttgcaa aatccgcttt ctaatcgccc tccaagtagg attcgccgca ccaccaccag 901 cagtataaac ttgggttaat cggtcagctc ccaattcttg cagtaattca tatcctcttg 961 cttctatccg agcaatgctt tctagcaacc catgcaaaaa ttctacatta ctagctggac 1021 gcggttccag tcttggcaat aaatttgagt catttatcgg aaagcgttcc ccagatttca 1081 acaacggata ataatccaac tcgctgactt ttgactcatc gatttcacaa ctcaagcttt 1141 ctaattcact tttggtaaaa aattgctgta gcactgcacc tcctgtattg gaagcaccgc 1201 cagtcagcca taaccccccc tctgtccccc ccttgtcaag agggggaagg cgatggctat 1261 aaattccgta cctcgaatct tccacgcggg tatgactcaa cagcttcagc accagtgttg 1321 aaccaagcga agtcactgct tcaccaggga gtttggcacc actggcaaga aaagctgcaa 1381 tactgtcagt tgtcccagca cacacgatac aatctcgacg aaactcgaat tgatccgcaa 1441 tctcagagcg taattcacca atcggagttc caggagttat tacttttggt agatgaattt 1501 ttatttttaa gttttctagc cattctgggt atttcaactc ttccacgtca taacccagct 1561 ttaatgcatt gtggtagtca gtaataccca actgtccatg caggagaaat gctagccaat 1621 ctgcttgatg cagaaaatat ctggctttag gaaaagaggg caatcgcgac atccacagaa 1681 gtttagcaag gctggaagta gcacttaata ctgtatgatt tgccggcgct atgcttctta 1741 actcttcgct caccactgct ccccgtgcgt cgttgtacag caggggtacg tctacaggtt 1801 tccccgcagc atcacacagc agaactgtgg aagacgtacc gttgatcgca attg // LOCUS NODE_10360_length_1822_cov_3.1878891822 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1822) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1822) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1822 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 180..1766 /locus_tag="DP116_28160" CDS 180..1766 /locus_tag="DP116_28160" /EC_number="5.3.1.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017653850.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glucose-6-phosphate isomerase" /protein_id="PRJNA477356:DP116_28160" /translation="MDATALWQRYQKWLYFHEELGLYLDVSRMRFDDAFVEKLQPKFE KAFADMAELEKGAIANPDEDRMVGHYWLRNPDLAPTPELTQEIVDSIEQIEVFAEKVH TGGIHPPKAPRFTDIISIGIGGSALGPQFVAEALASDFPSLGIHFIDNSDPAGIDRIL NHLRNRLASTLVLVISKSGGTPEPRNGMIEVKKAYAGQNLDFANYAIAITMPGSKLDE QAKSESWLARFPMFDWVGGRTSELSAVGLLPAALQGVDILAMLEGAKQMDDATRIPDI KQNPAALLALSWYYAGNGRGEKDMVVLPYKDSLLLFSRYLQQLVMESLGKEKDLEGNV VHQGIAVYGNKGSTDQHAYVQQLREGVPNFFATFIEVLEDRNGPSSELEPGVTMGDYL SGFLQGTRQALYENQRDSITVTIPQVNSRTVGALIALYDRAVGLYASLVNINAYHQPG VEAGKKAAAAILELQKQVVKVLQTEKTALSIEEIAQKAGATDKIEAIYKIVRHLQANQ RGVVLQGDLGQPSSLKISVS" BASE COUNT 517 a 411 c 437 g 457 t ORIGIN 1 cttctcccct tctcctcatc tccctttgtt ttatgagaaa attctggcga ggtcatccct 61 atgccagatg tttagggtga agaatcaagc caacacttaa aactgtataa gatagacagg 121 gattgctagc accttggctt aaacttgttt ttgaaaatca ctttacctga gattccccta 181 tggacgctac ggcactttgg caacgatacc agaaatggtt atatttccat gaagaattgg 241 gactatacct cgatgtgagc cggatgcggt ttgatgacgc ctttgtagag aagttgcagc 301 cgaagtttga aaaagctttt gcagatatgg cggaacttga gaagggggcg atcgccaatc 361 cagatgaaga ccgcatggtt ggacactact ggctgcgaaa tcctgattta gccccaacgc 421 cagaactcac acaagaaatc gtagacagca tagagcaaat cgaagtattt gccgaaaaag 481 tccacacggg tggaattcat cccccaaaag cgcctcgctt cactgatatt atctctattg 541 gtattggtgg ttctgcctta ggtccccaat tcgttgcgga agcgttagcc tcagacttcc 601 cgtcgctggg aattcacttc attgacaatt ccgatccagc aggaattgac cgcattttga 661 atcatttgag aaaccgtctc gccagcactt tggtattggt gatctccaag tctgggggaa 721 cgccagaacc gcgtaatggt atgattgaag ttaaaaaagc ttacgctgga cagaatttgg 781 actttgctaa ttatgcgatc gccattacca tgccaggtag taagctggac gaacaagcaa 841 aatccgaaag ctggctcgca agatttccca tgtttgactg ggtgggagga cgcacctcag 901 aattgtctgc tgtggggcta ttaccagcgg cgttgcaagg cgttgatatt ctcgccatgc 961 tagagggtgc aaaacagatg gatgacgcca cccgcatccc tgatatcaaa cagaacccag 1021 ccgcattgtt agctttgtcc tggtattatg ctggcaatgg acgaggcgaa aaagacatgg 1081 ttgtcctacc ctacaaagac agcttacttc ttttcagtcg gtatttgcaa cagctggtga 1141 tggaatcctt gggtaaggag aaagacttag aaggcaacgt cgttcaccaa ggtatcgccg 1201 tttatggcaa caaaggctca accgaccaac acgcttatgt tcagcagttg cgtgagggtg 1261 tgccaaattt ctttgccaca ttcattgaag ttttggaaga ccgcaacgga ccatcttcag 1321 aacttgaacc tggagttaca atgggcgact atctctctgg ttttttgcaa ggaactcgac 1381 aagcgcttta tgaaaaccag cgcgactcga ttactgtgac tattcctcaa gtgaattccc 1441 ggacagttgg tgcgctcatc gctttatatg atcgcgctgt cggcttatac gccagcttag 1501 tgaacatcaa cgcctaccat caaccaggcg tagaagctgg taaaaaagct gcggctgcaa 1561 ttctggagtt gcaaaaacaa gtcgtaaaag ttttgcaaac agaaaaaaca gccctctcta 1621 tagaggaaat tgctcaaaaa gcaggtgcaa cagacaaaat cgaggcaatt tataagattg 1681 tgcgtcatct acaggcaaat cagcgcggag tcgtgttgca aggagatcta ggacaaccca 1741 gcagcttgaa gatttctgtt agctaattaa aacagtcaat agtgaacagt gaacagtaaa 1801 cagtaaacaa ggggtggacg ag // LOCUS NODE_10379_length_1817_cov_3.7905791817 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1817) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1817) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1817 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..>1817) /locus_tag="DP116_28165" CDS complement(<1..>1817) /locus_tag="DP116_28165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872394.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_28165" /translation="QAKVPGAAGTWKDLTDNVNELAATLTTQLRAIAEVATAVTKGDL TRSISVEAQGEVAMLKDYINQMIANLRETTQKNTEQDWLKTNLAKFTRMLQGQRDLET VSKLILSELAPLVGASHGVFFIMENADKIQYLKLLTSYAYRERKQLANRFYLGEGLVG QSALEKERILLTEVPQDYVKIGSGLGEATPLNVVVLPVLFEGQVTAVIELASFRRFNE IHLSFFDQLTESIAIVLNTIAASMRTEELLKQSQSLAHELQSQQNELRETNKRLEEQA NSLQASEDLLRKQQEELQETNAELEERSELLALQNKEVERKNEEIEQASLDLKEQAEQ LALSSKYKSEFLANMSHELRTPLNSLLILAKLLTDNVEKNLTAKQVEYSRTIYSSGND LLTLINDILDLAKIESGTMSIDIDQMLLTELQEHIERTFRQVAIDKALNFTINFAPEL PRSIYTDPKRLQQVLKNLLANAFKFTDRGEVSLQVFVTTQGWSHDQESLNRAQTVIAF AVSDTGIGIASDKQKIIFEAFQQADGTTSRRYGGTGLGLSVSREITRLLGGEIKLHSR LGEGSTFTLYLPQEGKQINSKLEVSPSLPHSVTPSSPRP" BASE COUNT 430 a 408 c 405 g 574 t ORIGIN 1 aggccgtgga gatgagggag taacggagtg agggagtgag ggagagactt ctaattttga 61 attgatttgt ttcccttcct ggggtaagta cagtgtaaaa gtactacctt ccccaagacg 121 actgtgaagt ttaatttctc caccaagaag gcgggtgatt tcgcgactca ctgataagcc 181 taaccccgta ccgccatatc tgcgacttgt ggtgccatca gcttgttgaa acgcctcgaa 241 aattatcttt tgcttatctg aagcaatgcc aatgcctgta tcgctgactg caaatgctat 301 cacagtttga gcacgattca aactttcctg gtcatgactc cagccttgtg ttgtcacaaa 361 cacctgcaag ctgacttctc cacgatctgt aaatttaaaa gcattggcga ggagattttt 421 caagacttgt tgtaagcgct ttggatcggt gtagatgctt cttggcagtt cgggagcgaa 481 atttatcgtg aaattgagtg ctttatcaat cgcgacttgt cgaaaagttc gctcaatgtg 541 ttcttgtaat tctgtaagca gcatctggtc tatgtcaatt gacatcgttc cagattcgat 601 ttttgctaga tccaaaatgt cattgatcag cgtcaaaagg tcgtttcctg acgagtaaat 661 cgtacggcta tactcaactt gttttgcagt caggtttttc tcaacgttat ctgttagcaa 721 tttcgccaaa atcaataagc tatttagcgg tgtccgcaat tcatgggaca tattcgccaa 781 aaactcagac ttgtattttg acgagagtgc tagttgttcg gcttgctctt ttaaatctag 841 gcttgcttgt tcaatttctt catttttacg ctcaacttcc ttgttttgca aagctaataa 901 ttccgatctt tcttctagtt ccgcgttcgt ctcttgtaat tcttcttgct gcttcctcaa 961 taaatcctcg gaggcttgca gggaatttgc ctgttcttct agacgcttgt tcgtttccct 1021 gagttcgttt tgctgacttt gtaactcgtg tgctaaagac tgcgactgtt tgagtaactc 1081 ttcagtccgc atactggcgg cgatggtgtt taatacaatc gctatacttt cagtcagttg 1141 gtcgaagaat gacagatgaa tttcgttaaa acgtcggaat gaggctagtt caatcactgc 1201 tgtgacttgt ccttcaaata ggacaggtaa gacaacaaca ttgagcggag ttgcttcgcc 1261 taagccagaa ccaattttga cataatcttg cggtacctcc gtcagcagaa tacgttcttt 1321 ttctaaagct gattgtccga ccaaaccttc acccaagtag aagcggttcg caagttgctt 1381 gcgctcacgg taagcatagc tagtgagtaa ttttaaatac tgtattttat ccgcgttttc 1441 catgataaag aacacgccat gcgaagcccc cacaagcggt gctaattcgg agagtattag 1501 tttagacacc gtttctaagt cacgctgacc ttgaagcatg cgggtgaact tggcgaggtt 1561 cgttttcaac cagtcttgtt cagtgttttt ctgcgttgtc tcgcgcaggt tggcaatcat 1621 ttggttgatg tagtctttaa gcatggcgac ttcaccttgc gcctcgacgg aaattgaccg 1681 cgttaagtcg cctttggtga cagcagttgc gacttcggcg atcgcccgca actgagtcgt 1741 cagggtagca gccagttcat tcacattatc tgtcaaatct ttccaagttc ccgcagcccc 1801 cggcactttc gcttgtc // LOCUS NODE_10675_length_1736_cov_6.0178471736 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1736) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1736) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1736 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(36..1618) /locus_tag="DP116_28170" /pseudo CDS complement(36..1618) /locus_tag="DP116_28170" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006196382.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="RNA-dependent DNA polymerase" BASE COUNT 341 a 389 c 398 g 608 t ORIGIN 1 cgtgcccgtt tccgtgcata cggctcccga tgttcttacc ttgcggtttt gctcatgtgg 61 atgtagtcat gacagctttc gtgaaccgct agaaggtttt tatgcttcca attgttgtgg 121 ttgccgtctt tgtggtgtaa atgtactcgt tcatcgctaa gtatctttaa gccacagtat 181 ccacatgcat ggttttgccg ttttagggca atagaacggt ttccccgtcg tagagcttgc 241 tattacgttc gctccagtat gggatatctc cgtcgaaagg tgatttattt cctctgacgt 301 tgacgtgttt gttttcggag tagggtacgt ctggaaatgc cttgtctagt aatttcttac 361 tggagtatcg gtcattcttt gcttccttgt tgaacacctt gtatgctctt gtttcgatgt 421 ggtatagcga gttacgcgac ccatccatct tacagtggcg gtggtagttt ctccatcctc 481 taaccaaggg agctaatctc ttagcttttt cctcggaacc ataatttgag cagttgatga 541 tggcttttat tttctgacga aatgctttga agttatccac tgagggcgtg cttctaaatt 601 ttccgttgtt ctggacttta aagttccagc cgaggaagtc aaacccatct gtcgttgcgg 661 taattttggt tttctttcca ctgatattca ttcctctgtc ggcgagaaac tggcttatct 721 ttccaagtac ttcatttgcg tcgtcttcag gtcgaagtat tattaccatg tcatccgcgt 781 atcggattga tggttcggtg atatctcccg gcggtgtatt gggtgtaatt ttacaacctc 841 ttttacagca ccgatgatat ctgtggatac tttcgattcc gttaagcgct atgttggcta 901 aaagtggact tattactcct ccttgtgggg ttccctgttc tgggaactct ggattgaccc 961 ctgccttaag gcagcggaaa atacctctgt tcataccctg tggggcgatg aggttatcca 1021 ttatggcgga gtggtttatc ctatcgaagc atttttcaat gtcgagttct atgactcgtt 1081 tatttattcc gttacaggag gagcgtagat tgttgaatag aattctctgt gcatcgtgcg 1141 cagagcgtcc tgctctaaat ccatagcttc tttcgtggaa agttgcctcg tgtgctggtt 1201 ctaaggcgta ttttgctagg cactgccatg ctctatccgc tatagttggg actttaagaa 1261 ttctggtttt cccgtccttc ttgggtattg gtatctgtct taatttgcta tgatgccaat 1321 taaggtaatc tttcttgagg agttcctcaa gtgcgaagcg ttcttcatga ttgagggacg 1381 ctttgccgtc gatacccgca gtctttttac cagcatttag ctgagttacc tgacgaattg 1441 ccagtaatct tgctgctcgg gatttcagta tcagtttctg gagtgaccgc gctttccgca 1501 tgtctcctgc ttgaactgct ttaaataacc gcacttgtag acggaataat tcacggcgaa 1561 atttcttcca ctttaaattc ttccaagatt cactaggttt ttcactgtgt ccaatcatgt 1621 gctaactcct gtggttattc tctgaacacc tcgcgaaaat tacttcgcgt cctacccgag 1681 gtgtggggtt tccgtcgctc gtctgaccta ctgagggttc gacctttcct cagacc // LOCUS NODE_10765_length_1710_cov_4.0447131710 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1710) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1710) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1710 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..>1710 /locus_tag="DP116_28175" CDS <1..>1710 /locus_tag="DP116_28175" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411915.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28175" /translation="DAFPVDWAGTQNNLANAYSNRIRGEQAENLEVAIACYQDALKVF TFDVFPYEWARTQNNLGLAYHDRIRGERAQNLERALACYQQALKVRTFDAFPQKYAET LFNIGTVYQEAKQFDLAYTIFESAIATVESLREEIVSGQESKRKQAEEWNKLYRRMVE VCLELDKITEAIEYVERSKTRNLVEQILIRDQKTIFPPEVVTQLETYRDEIATGQAQI QNGKAENPKVLAQKLQKLRQQRNELQNRYLPIGYGFKLDSFQGSLDEHTAIIEWYILN DKILAFIATKQKEVTVWQSQTEDQQALYDWVNQYLQNYDEQKDQWRNNLGEEIKKLAS ILHIDEILTQIPKHCDKLILIPHRFLHLFPLHAIPINQNSENSSCLLDLFAGGVSYAP SCQLLQQVQQRKRPDFQFLFAIQNPTEDLNYTDLEVQVIQSYFNTANILKKTAATLTA INNTDLNTYHCTHFSCHGQFNLTNPDKSALILANALVADAPTKPDSERYLNLRLGETH DLDKCLTLEKIFSLKLEKCRLVTLSACETGLIDYSNTSDEYIGLPSGFLLAGSPSVVS SLWT" BASE COUNT 575 a 362 c 316 g 457 t ORIGIN 1 tgacgctttt cctgttgatt gggcaggtac gcaaaataat ctcgctaatg cttacagtaa 61 cagaattaga ggggagcagg ctgagaattt agaggtagcg atcgcttgtt accaagatgc 121 tctaaaagtt tttacttttg acgtttttcc ctatgaatgg gcaaggacgc aaaataatct 181 tggtcttgct tatcatgaca gaataagagg ggagcgggca cagaatttag agcgggcgct 241 cgcttgttac caacaagctc taaaagtcag aacttttgac gcttttcccc aaaaatatgc 301 agaaacttta tttaatattg ggacagttta ccaagaagca aaacaatttg atttagctta 361 cactatcttt gaatccgcta tagccacagt agaatctttg cgggaggaaa tagtttctgg 421 gcaggaaagc aaacgcaaac aagcggaaga atggaacaaa ctttatcgcc gcatggtaga 481 agtttgccta gaattagata aaatcacgga agcaattgaa tatgttgaac gtagtaaaac 541 tcgcaattta gttgaacaaa tactcatccg tgaccagaaa accatctttc ctccagaagt 601 agtcactcaa ttagaaacat acagggatga aatagctaca ggacaagctc aaatccaaaa 661 tggcaaagct gaaaatccca aagtccttgc acaaaaactt caaaagttac gacagcagcg 721 aaatgaattg caaaaccgct acttacccat tggttatggt ttcaaacttg actcattcca 781 aggttctttg gatgagcata cagccattat cgagtggtat attctcaacg ataaaattct 841 ggcgtttatt gccacaaaac aaaaagaggt aactgtttgg caatcccaaa cagaagacca 901 acaagctttg tatgattggg taaatcaata tttgcagaat tacgacgagc aaaaagacca 961 atggcgaaat aacttagggg aagaaatcaa aaaattagct tcgattctgc acattgatga 1021 aatattaacc cagataccaa agcactgcga taaactcatc ctaattcccc atcgattcct 1081 gcatttattc ccccttcatg caatcccaat aaatcaaaat tcggaaaatt cgtcttgtct 1141 tctagattta tttgctggtg gtgtcagcta tgcacccagt tgtcaactac tgcaacaggt 1201 acaacagcgc aaacgtcctg attttcaatt tctatttgca attcaaaatc ccacagaaga 1261 cctgaattac accgatttag aagtacaagt tattcaaagc tactttaata ctgccaacat 1321 cctgaaaaaa acagcagcga cactcacagc tattaataat actgacttaa acacttacca 1381 ctgcacccac tttagctgtc acgggcaatt taacttaaca aatcccgata aatcagccct 1441 aattttggca aatgcactcg ttgctgacgc accgacaaaa cccgactctg aacgttatct 1501 aaatttgcga ttgggtgaaa cccatgattt agacaaatgc cttaccttag aaaaaatctt 1561 cagtctcaaa ttagaaaaat gtcgcctcgt aactctttca gcgtgcgaaa ctggattgat 1621 tgattatagc aataccagtg acgaatacat tggtttacct agcggttttc tgttagcagg 1681 tagccccagt gtcgttagtt ctctctggac // LOCUS NODE_10862_length_1688_cov_5.5854261688 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1688) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1688) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1688 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 76..755 /locus_tag="DP116_28180" /pseudo CDS 76..755 /locus_tag="DP116_28180" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002758294.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="WD40 repeat domain-containing protein" assembly_gap 140..149 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 825..1088 /locus_tag="DP116_28185" CDS 825..1088 /locus_tag="DP116_28185" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863086.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28185" /translation="MKPSNLRTKLLTEINLIPEEKLEELYNFIHYFRVGVEASQGTSE QIMQFAGCWDDMSDEIFSDFNEEINTRRQQAFLGRRNDETSLG" gene 1069..1476 /locus_tag="DP116_28190" CDS 1069..1476 /locus_tag="DP116_28190" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016863085.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_28190" /translation="MKLALVDTNILSLFFRNQPLVVENFNTYIKEYGKINISIITYYE IVSGLKHRDAQKQLTSFLEFASHNIILPLTTDSTTISGDIYASLRKKGTPVDDIDILI AGIAIANDLILVTNNRRDFEKIEGLEIEDWTQA" gene complement(1537..>1688) /locus_tag="DP116_28195" CDS complement(1537..>1688) /locus_tag="DP116_28195" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016874181.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="UPF0175 family protein" /protein_id="PRJNA477356:DP116_28195" /translation="LPLGYASKLAEMDKIQFQQLLQKRKIPLFSYEIEDFELDLKNLR DLGRL" BASE COUNT 563 a 323 c 322 g 470 t 10 others ORIGIN 1 ctccttcttt tcgtgtcctc aataacgttt tcagccttaa ctgaaccgta ttgggatata 61 aacagtggca aggaaattaa aacccttcaa gggcatacaa acactgttga tagcgtcagt 121 tttagcccgg atggtaaaan nnnnnnnnnc ttcaagggca tacaaacact gttgatagcg 181 tcagttttag cccggatggt aaaactcttg cttccgggtc tgctgacaag acagcgaaac 241 tgtgggatat aaacagtggc aaggaaatta aaacccttca agggcataca aacactgttg 301 atagcgtcag ttttagcccg gatggtaaaa tccttgctac cgcatcagtg gacaagacag 361 cgaaactgtg ggatgtcaaa agtggtaagg aaattaaaac ccttaaaggt catacagact 421 tgtttacaag cctcagcttt agtcccgaca gcaaaaccct tgcttccgca tcacatgaca 481 atacggtgaa attgtggaat gttaacactg gtagggagat tcaaactctc aaagaagaca 541 aaggtaattt taacagctcc tatattcaaa gcctcagctt tagtccggat ggtaaaaccc 601 ttgcttctgc gtccagtgat aatactataa aactgtggaa ttttgattta gataatttgc 661 tggtgcgggg gtgtggcttg atacactact acctgcaaaa taaccgcgat gtcagcccag 721 aagatcaacg cttgtgtgac gatatcaagc gttgagggtc aatagcgatg ttaattcaat 781 taaaatgaaa taaacttgat acaccacacc ctatacagta caacatgaaa ccatcaaatt 841 tacgtaccaa acttttgaca gaaatcaatc ttattcccga agaaaagctg gaagaattat 901 ataattttat tcattatttt cgagtcggtg tagaagcatc tcaaggtaca tccgaacaaa 961 ttatgcagtt tgcaggttgt tgggatgata tgtcagatga aatattttcc gacttcaatg 1021 aagaaataaa cacacgtcgc cagcaggctt ttttggggag aagaaacgat gaaactagcc 1081 ttggttgata ccaacatttt atccttgttt ttccgcaatc aacccttggt agttgaaaac 1141 tttaatacat atattaaaga atatggcaaa atcaacatca gcatcatcac ctattacgaa 1201 atagttagcg gattgaagca tcgtgacgca cagaaacaat taacatcttt tttagaattt 1261 gcttcacata acataatttt gcctctaact acagattcaa caacaatttc tggtgatatt 1321 tatgccagtt taagaaaaaa gggaacacca gtagatgata tagatattct cattgctgga 1381 attgcgatcg ctaacgattt gattcttgta actaataatc ggagagattt cgagaaaatc 1441 gaaggcttag aaattgaaga ctggactcag gcataagtaa atctaattga ccaacagcag 1501 caaggctagt gattagtgaa gtatcactaa taacaatcac aacctaccca aatcccgtaa 1561 atttttcaaa tcgagttcaa aatcttctat ttcataagaa aacagaggaa ttttgcgttt 1621 ttgaagcaat tgctgaaact gtattttatc catttctgcc aacttgctcg catatccaag 1681 gggaagtt // LOCUS NODE_10972_length_1665_cov_2.1807451665 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1665) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1665) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1665 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..271 /locus_tag="DP116_28200" CDS <1..271 /locus_tag="DP116_28200" /EC_number="6.1.1.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019371038.1" /note="catalyzes a two-step reaction, first charging a serine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="serine--tRNA ligase" /protein_id="PRJNA477356:DP116_28200" /translation="LEVWLPGQNRYREISSCSNCGDFQARRMGARYRVTGEKGGKYVH TLNGSGVAVGRALVAVMENYQQEGGMVAVPEALIPYMGGATRLVP" gene 271..>857 /locus_tag="DP116_28205" CDS 271..>857 /locus_tag="DP116_28205" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007714516.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="5'/3'-nucleotidase SurE" /protein_id="PRJNA477356:DP116_28205" /translation="MRILLTNDDGIYAPGLKVLEAIARQLSDDIWVVAPQEEQSGAGH SLTLSRPVRVRSHDDRRFSVSGTPTDAVMMALGVCMDGLKPDLILSGVNRGANLAEDV TYSGTVSAAMEGTLAGYTSIALSQVYAREGMGDTVPFAAAETWGARVLRPLINAPMPP RTAVAVVRKLALGLVAAHAQKVIHRDLKPANVLYD" assembly_gap 858..867 /estimated_length=10 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <868..1169 /locus_tag="DP116_28210" CDS <868..1169 /locus_tag="DP116_28210" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008833525.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="5'/3'-nucleotidase SurE" /protein_id="PRJNA477356:DP116_28210" /translation="MPPRTLINVNFPALAPENVRGVKVVEQGFHDYGRSKIVKGTDPR GYPYYWFGLGGSEQTPGHATDLEAIVEGYVTVTPLHLDLTHYASMSVLDEALRAV" gene complement(1273..>1665) /locus_tag="DP116_28215" CDS complement(1273..>1665) /locus_tag="DP116_28215" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010217087.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="PRJNA477356:DP116_28215" /translation="VTQEVAAAKAVPELADFATRLEKALGDLQASTMWLMQNGLTNPD NAGAASVPYMHLMGIVAVGLMWLRMATAAQKLLAAGEGDAAFLNAKLMTARFYAERIM PDTGALRRKLEGGAESLMALPVEAFDVA" BASE COUNT 280 a 543 c 549 g 283 t 10 others ORIGIN 1 cctcgaagtc tggctgccgg ggcagaaccg gtatcgcgag atttcgtcct gttcgaactg 61 cggcgatttc caggcacggc gcatgggtgc gcggtatcgt gtcaccggcg aaaagggcgg 121 caagtacgtc cacacgctta acggatcggg ggtcgcggtc ggtcgcgcgt tggtggcggt 181 catggagaat taccagcagg agggcggcat ggtcgcggtt ccggaagccc tgatcccgta 241 tatgggcggt gcgacgaggc ttgttccctg atgcgcatcc tgttgaccaa cgacgacggc 301 atctacgcgc ccggcctgaa agtgctggag gcgatcgcgc ggcaactgtc cgacgatatc 361 tgggtcgtag cgccacaaga ggagcaatcc ggtgcgggcc attcgctgac gctcagccgc 421 ccggtgcggg tccgtagcca tgacgaccgg cgcttctcgg tcagcggcac gccgaccgac 481 gccgtgatga tggcgctcgg cgtgtgcatg gacgggctga agccggacct gatcctgtcg 541 ggcgtgaacc gtggtgccaa tctcgccgaa gacgtgacct attcgggcac cgtctctgcc 601 gcgatggagg gaacgctcgc gggctacacc tcgattgcgc tcagccaggt ctatgcgcgc 661 gaaggcatgg gcgacaccgt gccgttcgcc gccgccgaaa cctggggcgc gcgcgtgctg 721 cgcccgctga tcaacgcgcc gatgccgccg cggaccgcgg tcgcggtggt gcgcaagctg 781 gcgctggggc tggtggcggc gcacgcgcag aaggtgatcc accgcgacct caagccggcc 841 aacgtgctgt acgacgannn nnnnnnncga tgccgccgcg gaccctgatc aacgtcaact 901 tcccggccct cgcgccagaa aacgtgcgcg gggtgaaggt cgtcgaacag ggtttccacg 961 attacggtcg ctccaagatc gtgaagggta cggacccgcg cggttatccc tattactggt 1021 tcgggcttgg cggcagcgag cagacgccgg gccatgcgac cgacctcgag gcgatcgtcg 1081 aaggctatgt gacggtgacg ccgttgcacc tggatctgac gcactatgca tcgatgtccg 1141 tgctcgatga ggcgttgcgg gcggtttaac ccattatccg cagcccctga gctcgtcgaa 1201 gggccgtcgc cttatgcgca atgccccgcg cggaccaggc ttcgacaagc tcagccctgc 1261 ggacaaggac ggttaggcaa catcaaacgc ctccactggc agcgccatca ggctctccgc 1321 gccgccttcg agcttgcggc gaagcgcgcc cgtgtcaggc atgatccgct cggcatagaa 1381 ccgtgccgtc atcagcttgg cgttgaggaa tgcggcatcg ccctctcccg ccgccagcag 1441 cttctgcgcc gccgtcgcca tgcgcagcca catcaggcca accgcgacga tgcccatcaa 1501 atgcatgtac ggcaccgacg ccgcaccggc attgtcggga ttggtcaggc cgttctgcat 1561 cagccacatg gtcgacgcct gaagatcgcc cagcgccttt tcgagccgcg tggcgaagtc 1621 cgccagttcc ggcacagcct tcgccgccgc gacctcctgt gtcac // LOCUS NODE_11030_length_1653_cov_3.1376721653 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1653) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1653) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1653 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..131 /locus_tag="DP116_28220" CDS <1..131 /locus_tag="DP116_28220" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=3 /transl_table=11 /product="copper oxidase" /protein_id="PRJNA477356:DP116_28220" /translation="VRALGKGGHNLAAEVERIAATSGEVPAPNASTESDMSSMEGM" gene 134..1471 /locus_tag="DP116_28225" CDS 134..1471 /locus_tag="DP116_28225" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015162943.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="copper oxidase" /protein_id="PRJNA477356:DP116_28225" /translation="MKTHRNMNMNSSTADSHADMGMKPKATRAQLLAVTLLTLLALAF GVLIAALYGNFTMSARNMQHGSTPGMNISNQSTPGMNMDNNNESMPGMNMGGTKSPAP VSSLPPAPISSVTQTNGLVMPPGMIMTSDMSMEAMEDMAAVDLTKITYTASVDARGDQ VLQPKLENGVKVFNLDVSLIKWNILPNVQVAAYAFNRQVPGPRIQVTEGDRVRMVVKN NLPEPTTVHWHGMILPNNMDGPADVTQKPIQPGASYTYEFTVKQPGTYFYHSHKDIDR QQTLGMYGAFIVDPKNKPQTLAYNQDVVVQLQEWTVKQGYTFPAMPMEGLLPNFFTIN GKAYPATTTINAKVGEKIRFRFIGSNNAFIHPMHIHGGPFKIIETDGSPVPVAAQIEK DTINVAPGERYDVIWTAREKGKWLLHCHIAHHATNNNVEVQGAGGLTMIINVT" BASE COUNT 488 a 367 c 383 g 415 t ORIGIN 1 ctgtacgggc attgggtaaa ggtggtcata atttagctgc cgaagtagaa cgaattgcag 61 caacttctgg tgaagtgcct gcacccaatg cttccactga gtctgatatg tccagcatgg 121 aaggaatgta agaatgaaaa cccatagaaa tatgaacatg aattcgtcta ctgccgattc 181 gcacgcggat atgggaatga aacccaaagc tacgcgagcg caacttctgg cagtaacgct 241 gttaacacta ttagccctag cattcggtgt tcttatcgcc gcactctatg gcaactttac 301 catgagtgct aggaatatgc agcatgggtc aacgccagga atgaatataa gcaaccagtc 361 aacgcctggg atgaatatgg ataacaacaa cgagtcaatg ccaggaatga atatgggtgg 421 cacgaaatct ccggctccag tgtcatcgct accacctgct cctataagca gtgtcactca 481 gacgaatggt ttagttatgc cgccagggat gattatgacc tctgatatga gcatggaagc 541 aatggaagat atggctgcag tagacttgac aaaaattacc tatactgctt ctgttgatgc 601 acgtggggat caggttctcc agcctaagct agagaatggt gtgaaggttt ttaacctcga 661 tgtttcttta attaagtgga atattttacc aaatgtccag gtagctgctt atgccttcaa 721 ccgtcaagtt ccaggaccgc gtattcaggt tactgaaggc gatcgcgtgc gaatggtcgt 781 caaaaacaac ttaccagaac caacgacagt tcattggcat ggcatgattt tacctaacaa 841 catggatggt ccagccgatg ttacccaaaa accaattcaa ccgggtgcaa gctataccta 901 cgagttcact gtcaagcaac caggtactta cttctaccac tcccacaaag atatagaccg 961 ccagcaaact ctagggatgt atggtgcgtt catcgttgat cctaagaata aacctcaaac 1021 tcttgcttac aaccaagatg ttgtggttca gcttcaggag tggacagtaa agcaaggtta 1081 caccttccca gcaatgccaa tggaggggtt actaccgaac ttttttacaa taaacggtaa 1141 agcttaccct gctactacaa ctatcaatgc caaggttggt gagaaaatcc gcttccgctt 1201 tatcggttcc aataatgcct ttatccaccc aatgcacatt catggtggtc cgtttaagat 1261 catcgaaaca gatggaagcc ctgtacctgt tgctgcccaa attgaaaaag atactatcaa 1321 cgtggctcct ggtgaacgtt acgatgtgat ttggacagcc cgtgagaaag gtaagtggtt 1381 gctgcattgt cacattgccc atcatgcaac aaacaacaac gttgaggtac aaggcgcagg 1441 tggtttaaca atgattatca acgttacctg atttgagaaa actgacggta agtaactaaa 1501 ccaaacacta ctgtattatg ggtatggttc acgagcgcat ttgtctaatt ttggtgattt 1561 tagcgttgta gggtaaacac ggtagtaccg caaggcggaa gtcaaaagtc aaaagtcaaa 1621 agtgagccag tgcggtggac gggttccccg gca // LOCUS NODE_11151_length_1627_cov_4.6030531627 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1627) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1627) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1627 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 166..732 /locus_tag="DP116_28230" CDS 166..732 /locus_tag="DP116_28230" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015130788.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="fasciclin domain-containing protein" /protein_id="PRJNA477356:DP116_28230" /translation="MKNRQNQHLFGKTLGIALATASLVISTPTFAQSPSAAPTKPAMS GSQTTGAGTVVDVASSNPSFKTLVKAVKAAGLVETLSGSGPFTVFAPTDAAFNKLPKA TLQKLLKPENKETLTKILTYHVVSGAVDSKSLKSGAVNTVEGSPVDVKVGKGVTVGKA KVTKPDIKASNGVIHAIDTVLLPPDVKL" gene complement(1042..1326) /locus_tag="DP116_28235" CDS complement(1042..1326) /locus_tag="DP116_28235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019503292.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28235" /translation="MENTLFDKIFRDSDGQIVVAQPPNPPIIVWAVASLLKLIFTSGE INTVLDAVAFGSLFTWAWEELFQGVNYFRRALGLIVLIGVVGSKVYPIRF" BASE COUNT 482 a 362 c 349 g 434 t ORIGIN 1 ctacacccct acacccctag ttttggtaaa tccgtaccaa gttgatttcg agatataact 61 acaggagtag caactgtagt atgacaaaat gttacacttt ataaagatca aaggaatttc 121 gtaacaaaat attcacatct ttaacattac gcaaagttaa gaaccatgaa gaaccgacaa 181 aatcaacacc tttttggcaa aacacttggt atcgcgctcg cgacagccag tcttgtgatc 241 agcactccaa ctttcgccca gtctccctca gctgcaccga ccaaaccagc gatgtcaggg 301 tctcaaacaa ctggtgcagg tacagttgtc gatgttgcta gctcgaatcc ttctttcaaa 361 accctggtaa aggctgtgaa agcagcaggt cttgtagaga ccttatctgg tagtggccca 421 tttactgtgt ttgcacccac ggatgctgca tttaataagc taccaaaggc aactttgcaa 481 aaactactca agccagaaaa caaggaaacc ttgaccaaga ttttgaccta tcatgtcgta 541 tctggtgcag ttgattccaa aagccttaag tcaggtgcgg ttaataccgt tgaaggtagc 601 ccagttgacg ttaaagttgg caagggtgta acggtgggta aggcaaaagt caccaaacca 661 gatatcaaag ctagtaatgg cgtgattcat gcgatcgata cagtgcttct tcctcctgat 721 gtgaagctgt agtgtttccc taacaatata gagagtcata aatctcggct tggagttcag 781 cctttccccc gtgttcccaa aaaaagatag gctagaggcg tagatacagg ttgtattgcg 841 tacgaagtgc gttgtaacct acgccatatt taccttaatt gttctgacgt tccttgagtt 901 gcagtaatat ttgcgcgtgc atttcgcggg taattgggta aaaatacgtt aagactaagc 961 cactaactaa tgcgatagct tcgcgctctg catagcggca gatcgtgttt acaccaatct 1021 cgaacaaaaa ttcgttgttt ttcaaaacct tatgggataa acctttgatc caacaacccc 1081 aatcagcacg ataagaccta gggctctgcg aaaataatta acgccttgaa ataattcctc 1141 ccatgcccac gtaaacaagg agccaaaagc cacagcgtct aacactgtat tgatttcgcc 1201 gcttgtaaaa attagtttga gtaaactagc tacagcccag acaataattg gtgggtttgg 1261 tggctgagcc acaacaatct gaccgtcgct gtcgcggaag attttatcga ataatgtatt 1321 ttccattgtt tggatgcccg cgattgctgc gcggtttatg ggatctataa ccaaatctac 1381 atcataaact catgtggagt cagaagaatt taacgtgact tgggaagata tttgcaacgc 1441 gcacgggtgc gaggtcttag cactggatag gtttgtgcta atggcgacgt gaggtgaagt 1501 tgagcgagat taattttcct gacccagttg tctaaaaaac agtaacaagc aagcttagat 1561 tcaatccaga actattgaca ctcccctgcc taaaggcgag gggattctac attcatcgtc 1621 agaactt // LOCUS NODE_11329_length_1586_cov_5.1939911586 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1586) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1586) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1586 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(36..200) /locus_tag="DP116_28240" /pseudo CDS complement(36..200) /locus_tag="DP116_28240" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006616249.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="transposase" gene complement(286..1505) /locus_tag="DP116_28245" /pseudo CDS complement(286..1505) /locus_tag="DP116_28245" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010999795.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="RNA-dependent DNA polymerase" BASE COUNT 353 a 339 c 340 g 554 t ORIGIN 1 tattttaact cctattcatt ttaccagttt tacgcttttg aacgcggatt ggtattagag 61 caagtcttgc tggacggaaa ccatcggtca accacaagca atcttgaacc atacaactca 121 cacttgtagg tcaactggcg acgaaactca aaaaagctca tatcagcaat tgccttagcc 181 aatttgtgat tagccatcag ttagagtcga gaggggatat tattccccgt ccctctctgt 241 tagatccgtg cgtgccactt tcgttgcaca cggctcccga tattcttaag ggataaccct 301 tttactcatg tggatataat cgtgacagga ttcatgtatt gctgttagat tcttactctt 361 ccagttgtca tggtttccgt ctatgtggtg taggtttact cgttcttcac ttataaattt 421 taagccacaa tgaccacatg tatgattctg gcgttttagt gccttagagg tttctccgtt 481 atataattta ctgttgcgtt cgctccagta gggcatgtct ccatcgaaag gtgatttttt 541 tccttcaaca ttgacatgtt tgttttcgga gtaaggaact tcaggaaacg ctctttttat 601 caactttgta gttgattctt tgtcgtttct tttttcctta ttaaacacct tccacgttct 661 tttgttggtg tgccaaagcg agagccttga cccatccatt ttacagaagc ggtggtagtt 721 cctccatcct ctaattattg gcgctagctt ttcagctttt acattagaac cataattcga 781 gttgttgaca atagctttta ccttctgacg gaattttttg aagttttcca ccgagggcgt 841 acttctaaat ctcccgttac tttgcacttt gaagtgccag ccaaggaagt caaaaccatc 901 tgtcgcagcg gttactcttg tcttcttttc gctaagcttc attcctttgt tggctaggaa 961 tttgtcgatt ttgtcaagta tctctgttgc atcgtcttct ggtcggagta agataaccat 1021 atcgtccgcg tagcggattg atggttctac gatatttttt tccggggtat tttcagaaat 1081 tttgaacccg tttttatgat atcggtggat actttcgatt ccgttgagag cgatgttggc 1141 taatagcgga cttaccacgc caccttgcgg tgtaccttgc tcgggaaact ctgggtttgt 1201 tccgcttttg aggcagcgga atatgccaga tttaatgcct ttaggggcaa taaggttttc 1261 catgattgtg gtgtgactta tcctgtcaaa gcatttttca atatcgagtt caatcactcg 1321 tttattgatt ccattacagc atgagcgtag atttataaac aggattttct gggcatcatg 1381 tgccgaacgt cctggtctga acccgtagct tctttcgtgg aatgttgcaa cgcgcaagct 1441 ggttctagtg cgaactttac gaggcattgc caagctcggt ctgccatagt tggaattttt 1501 agcattctgg tactgccgtc ctttttgggg atggctagtg cggtacgttg cttataaact 1561 gtaccccata agggttacag gggatt // LOCUS NODE_11455_length_1558_cov_2.5861611558 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1558) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1558) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1558 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(1..1266) /locus_tag="DP116_28250" /pseudo CDS complement(1..1266) /locus_tag="DP116_28250" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877661.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 331 a 301 c 320 g 606 t ORIGIN 1 gtcaggacca ccatgcttga tcatgtttgt ccattcctcg gcgatatagg ggtgaacttt 61 aacggggata tcgcctgctg cttgtattgg ttgttctttg ggtgtgggtg cgggtgcggc 121 ggtaggctgt tcgttaccga agatttgttc taggttaagc ttttgggcgt tttgggcaat 181 aaacttcatt tggttgatat cgtagtcaga gatattcatt ttactgactt ctttggtaat 241 gccagcttcg gttttagtca gatcaaaccg gacaagttcc tgagtacctc tttcctcggt 301 ggttttaaat agttgcaggc tggctttttt tccttgagga tcacgggtgc gtttaatggt 361 aaattctcct acggcgagtt catccttccc tgctgtcagg agtatgttgt tgagagtatc 421 cagcatctcg ttttgtttga atacctcaag agttttgata gtaccagcag gcgcaagaga 481 accaagtgta tttgccacat cccggatatc tgaagaattc aagtcaggta gtttaccctt 541 atcctggagg tgttcgttta ccatcaaaaa ttcctgacgt tctaccccta acatttcttt 601 aggctgtttg gtgattttgg gttctttgtt tgccttaccg ggtttgaact ccatcaagga 661 gttgttccaa ccctgaagtt cgtcactgcg gcggtggatg ctgatggtat ccccttcttg 721 ccgaattaca aaagcatcac ttctgtaaat acgagaacca tcttgctcga tggttttata 781 ttttttcaac atttcggttg cagcttgggc aatatctttg ttttcgccgt tgtaacgttc 841 tttttctgct cgtttgtttt ttatctcgta tactggtact tctacttgac gtgcccactg 901 ttgcgcttgc ggttctaccc cctcggagtt ggctgcttgt tggtaagtat attgaccagt 961 atattgttcc tcttcctcca aattttctct tttcggagtt ctgcttgggc tttcttcttc 1021 aacaggtttc ttattattag taattttttc ttcggaagta taattttcct gattttctac 1081 ctcttcctga aggttgttga tgacagattc ttcaaacttc tcttcggctt tgtctgatat 1141 gattaatact atggagttag tattttcaat atttgaaggt tggttatcta tctgattact 1201 tggttttgat agaaatgttg caccagtatt atctattaat tctgtttcct cattattctg 1261 aatgaccaca ataattttct catcttcatc tggtgcatca tcaaaatctg gaggattgct 1321 aatatctata tcttctagaa acacaactgt ctcttctgac ggcggtgcat cttctattgg 1381 agtatattct aatgattgaa tatctttggc atcagtagtc attaaatatg ttgcatcttc 1441 ttccattttg ttattattta acacttgcga tgacatttta tcatttgcta ctatggatag 1501 atgttttagt tggtttatta gaattttaaa aagctgatta tttacgttcc gttcatac // LOCUS NODE_11744_length_1505_cov_2.4717241505 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1505) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1505) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1505 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..>1505 /locus_tag="DP116_28255" CDS <1..>1505 /locus_tag="DP116_28255" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877602.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="CusA/CzcA family heavy metal efflux RND transporter" /protein_id="PRJNA477356:DP116_28255" /translation="VFPSFAPPQVEIQTEAPGLAPEELESLVTLPIESAINGTPGITA VRSSSAAGISVVKVIFNWGTDIYQARQLVTERLQQAQSKLPSGVETPQISPTSSPIGT VLQYAFTSQSTPLMEVRRIVDWQVTNRLLAVPGVSQVVAYGGDSRQYQVLVDHQKLKA FNITLQDIVEAASAANVNAPGGYFITPDREKLIRGIGRIESIEELQQSVITSRNGTPI KLSDVTDVQIGAAIKRGDGSFNGQKAIIVMINKQPQADTPTVTRAIEGAMEEIKAGLP KDVQVHPTFRQENYIDSSIENVREALVEGSIIVAIILIPFLMNWRNLAICLTALPLSL LIGVLALNWLGQGLNTMTLGGLAVAIGSAVDDAIVDAENVFRNLRENKYSPNPRPVLD VVFDGCQEVRDSVFGATIITIVVFSPIFALSGVEGSIFIPMGLGYLAAVLASSVTALT VTPALCAILLPYGNLPETEPWVARFFKKLYHPLLTHILHLEIWMSKPKLRA" BASE COUNT 405 a 321 c 337 g 442 t ORIGIN 1 tgtcttcccc agctttgcac ccccccaagt cgaaattcaa actgaagcac cgggacttgc 61 tcccgaagaa ttggaatctt tggtaacttt accaattgaa agtgcaatta acgggactcc 121 gggcatcact gcggtacgct cttcgagtgc ggcgggaatt tctgttgtca aagtcatttt 181 taactgggga accgatatct atcaagctcg ccagctagta acagagcgat tgcaacaagc 241 tcaaagtaag cttccatccg gggtagaaac gccacaaatt tcccctacca gttcccctat 301 tggcactgta ctacagtatg cctttacttc tcaaagcact cctttaatgg aagtgcggcg 361 tattgttgat tggcaagtga caaaccgcct tttggctgtc cccggcgtta gccaggtagt 421 agcgtacggt ggcgatagtc gtcaatatca agtattggtt gaccaccaga aattaaaagc 481 atttaatatt actttacaag atatagtaga agcagcttct gctgccaatg tcaatgctcc 541 tggtggctat tttatcaccc ctgaccgaga aaagttaatt cggggtattg ggcggattga 601 atctatcgaa gaattacagc aatcagtcat tacctcccgc aatggtacgc ctattaagtt 661 atccgatgtc actgatgtgc aaattggtgc agctattaaa cggggcgatg gcagttttaa 721 cggtcaaaag gcaattattg tcatgattaa taaacagccg caagctgata ctcccactgt 781 tacccgtgcc atagaagggg cgatggaaga gattaaagca ggcttaccta aagacgttca 841 ggtacaccca acatttcgtc aagaaaacta tatcgattct tctattgaaa atgttagaga 901 agctttagtt gaaggcagca ttattgttgc gattattctc attccgtttt tgatgaattg 961 gcgcaacttg gctatttgtt taactgctct tcccttgtct ttactaatag gagtactggc 1021 actgaattgg ttgggacaag gtttaaatac gatgactttg ggagggttag cagtagcaat 1081 tggttcagca gttgatgatg cgattgtcga tgctgaaaat gtatttcgta acctgcgaga 1141 aaataaatac tctcccaacc cgcgtcctgt tctagatgtt gtatttgacg gttgtcagga 1201 agtgcgcgat tcagtatttg gagctacgat aattacaata gttgtcttct ctccaatttt 1261 tgctttgagt ggcgtagaag gtagcatttt tattccaatg gggttgggat atctagcagc 1321 agttctcgct tctagcgtga cagcattgac ggtgactcca gctttatgtg caattctttt 1381 gccttacggt aacttgccag aaaccgaacc ttgggttgcg agatttttta aaaagctcta 1441 tcatcccctg ttaactcaca tcttgcacct ggaaatatgg atgtcaaaac caaaactacg 1501 tgcaa // LOCUS NODE_11957_length_1466_cov_4.9085751466 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1466) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1466) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1466 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(169..1113) /locus_tag="DP116_28260" /pseudo CDS complement(169..1113) /locus_tag="DP116_28260" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017308356.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Red carotenoid-binding protein" gene complement(1193..1402) /locus_tag="DP116_28265" /pseudo CDS complement(1193..1402) /locus_tag="DP116_28265" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015196839.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="response regulator" BASE COUNT 415 a 347 c 331 g 373 t ORIGIN 1 ccccacaccc cacaccctgt ttgctgtaac gaaatgtaaa atcgacaaac agtttagaaa 61 taggtcattt caaagttttt acaaaactca acatacatta ttttaagccg ctcgcgatcg 121 cgagcggctt aaaacagaaa gctcatcact catcactctc aagctcttct accttatgaa 181 gcccaagttc agcagttcct ggggagacgc cagcaagtca attgccacaa agaaaatctt 241 gccttgagaa tcgagcaaga accgctatgc taggtttatg ccaacactgt cactaaacca 301 gggagtttgc actttacccg tcactttaat ctgggtaaac cctccttccg caggctcaga 361 aaccccctgc tcaggtatca gcttgagccc gtagcattct tcacgcatat acgcgaggat 421 ggattcatga ccgacaattg gttcttggaa aggtgattgc agggcaccat cttcagcaaa 481 taaagccaca gccgcttgga agtcaaaagc gttcatattc tccatgtagc tcaggactgt 541 caagttgtcg atgccctgaa tgctgacctt agctcgtggt gcgatatctt taggtggaac 601 cacaggttct ttgacttttt gagtgccaac agttggggca taccccatgt tgatcacaat 661 atcctgtaat acggcgagtt gctgaccccc ttcaagttga cggatagctt ggagcacact 721 tgatgctttc gcagacagtt tatagccttc tggaatcgga gcaacgattc cctctttcat 781 ccactcgctt aactgatacc agaagcccaa cttgacattc gtgccgaaag acgaataggt 841 acggcagatg tccgtgtcag tatggttaac gagatcgcac ataacttgcg tttgctccaa 901 atcaggcatc tgcttgattt gagtcagggt tttttctgca aagaccatgt tgactacctg 961 catagcagcg gaagtgattg taactcccat ctcggtatag gcaaaccaaa gtaaagccaa 1021 ctgatcctca gcactgagtt gattaaatga ttcaacggtc gctggaaccg catcagcaac 1081 ttgagtgtca ggaaaaatgg agcgtgcaga caagggtagt aaatgacatt tttcaaaatc 1141 tccagaacgt ttaggacatt gatttgttgg tagtgtttca aaaatagtga tgctaacttg 1201 ccaatgttgg atataactgc ttttcgagac ttagctcacc atgtttcggg acattaccca 1261 aaatgcgttg agcgatcagg tatgaggact cttggcaaac cactaaacag gcaagcacag 1321 caaggaactc atcaaaattc aggggtttgg tgaaccacag gtcaaaccca gcacacaaag 1381 cacgttgaag catcttttgg ttcttggcaa cttttggtga tctcgcttgg gggaaggcag 1441 agggcagagg gcagaaggca gaagga // LOCUS NODE_12021_length_1455_cov_4.6450001455 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1455) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1455) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1455 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..1114 /locus_tag="DP116_28270" CDS <1..1114 /locus_tag="DP116_28270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015078275.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="IS4 family transposase" /protein_id="PRJNA477356:DP116_28270" /translation="LFRVNEIYLLAGDEVVVSKSGKQTYGLDRFFSSLASQPISGLSF FVLSLVSVERGQSFPIQIEQVIKSDTEINSTWAIKTVKAQEKRGRGRPKGSQNKNKTE VILTSELLLIKKMIHSLFKLVANFIPLTYLVLDGHFGNNNALQMARQVNLHIISKLRH DSALYIPYENPDSNSRSRRKYGDKLDYRNLPDKYLCQSTINKDIQTDVYQATFLHKEF AQALNVVILVKINLKTNTRSHVILFSSDRKLSFDKIIDYYKLRFQIEFNFRDAKQFWG LEDFRNLSKTAVTNAANLAFFMVNLSHHILADFRLFNPESGIIDLKAHYRGFRYVREI LKMLPEIPEPILLTQIFAKLTSLGRIHTASTGVEPS" gene complement(1132..1401) /locus_tag="DP116_28275" /pseudo CDS complement(1132..1401) /locus_tag="DP116_28275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015210056.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="amine oxidase" BASE COUNT 469 a 284 c 273 g 429 t ORIGIN 1 tttgttccgt gtaaatgaaa tatacttgct ggcaggagat gaggttgtag tcagtaaatc 61 aggtaaacag acttatggat tagatagatt cttttctagc cttgccagtc aaccgatatc 121 aggactatct ttctttgtat tatcgttagt gagtgttgaa agggggcagt catttcctat 181 tcagatagaa caggtaatca agagtgatac agaaatcaat agcacgtggg caatcaaaac 241 agtaaaagcc caagaaaaac ggggacgggg acgaccaaaa gggagtcaga acaaaaataa 301 aacagaagtg atattaacat ctgaattatt gctaattaaa aagatgattc attcactatt 361 caagttagta gctaacttta ttcccctgac atacttggta ttagatggtc atttcggtaa 421 caataatgct ttacagatgg cacgacaagt taacttgcac ataatttcca aactccgcca 481 tgattcagcg ttatacatcc cttatgaaaa tcctgactcc aatagtcgct ctcgtcgtaa 541 atacggtgac aagctggact atcgtaatct accagacaaa tatttatgcc aaagtaccat 601 taacaaagat attcaaactg acgtttatca agccactttc ctgcacaaag aatttgctca 661 ggctctcaat gtagtaattt tggtcaaaat caatcttaaa actaatactc gtagtcatgt 721 aattctcttc tctagtgacc ggaaattatc gtttgataaa atcattgact attacaagct 781 tcgtttccaa atcgagttta actttcgtga tgccaagcag ttttggggat tagaagattt 841 taggaatttg agcaaaactg cagtgactaa tgctgctaat ttagcatttt ttatggttaa 901 tttatctcat catattctag ctgactttcg cctctttaat cccgaatccg gcattattga 961 ccttaaggct cactatcgtg gctttcgata tgtccgtgaa atcttaaaaa tgcttcctga 1021 aattcctgag cctattttat taacccagat ttttgccaag cttacttctt taggtcgtat 1081 tcataccgct tctacgggcg ttgaaccctc gtaaattggc aaaggtattg taaaatagta 1141 aagcattcct atagctgctg ctgccgaaca ctgttctcca ggagcaaaca agcccaccaa 1201 taacattggt tcaaaagatt cgcggtagag tcgtgcagaa acgccaaaat ctttaaacaa 1261 ttcacgggca gtgataaaat cataacgtcg ccaagcttca tcagaattat caaaatcaat 1321 aatcgcgtaa agtaaaggca gggcgctgag gcgatcaagg agtggcaaac gtttaaactg 1381 tgtataaagg aaagttccta gaacctatcc cactaatgcc gaaatatggt aatctacttc 1441 tcgtatgaga agggt // LOCUS NODE_12067_length_1445_cov_4.2482011445 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1445) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1445) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1445 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..1111) /locus_tag="DP116_28280" CDS complement(<1..1111) /locus_tag="DP116_28280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016872394.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hybrid sensor histidine kinase/response regulator" /protein_id="PRJNA477356:DP116_28280" /translation="MSTEQLTRENDNLDLKQLLKTLVAVKKGDFSARMPLDQTGMAGK ISDTLNDIIEQNERLTAELQRISHVVGKDGKISERASLGNVRGSWSVCVDSVNTLVTD LVQPTAETARVIRAVANGDLSQAIAPEIEGRPLKGEFLQTAQMVNTMVGQLNSFASEV TRVAREVGTEGKLGVKAEVPGVAGTWKDLTDSVNLMAGNLTAQVRNLAEVTTAVANGD LSKKITVDVKGEILELKNTINTMVDQLNSFASEVTRVAREVGTEGKLGVQAQVRGVAG TWKNLTDSVNLMAGNLTAQLRNIAEVTTAVANGDLSKKITVDVKGEILELKNTVNIMV DQLNSFASEVTRVAREVGADGKLGGQAQVRGVAGTW" BASE COUNT 376 a 381 c 258 g 430 t ORIGIN 1 tccaagttcc agcaacaccc cgtacttggg cttgaccacc gagtttacca tctgcaccca 61 cctcacgcgc aacccgcgtc acttccgatg caaaagaatt gagttgatcc accataatat 121 tcacggtgtt cttcaactcc agaatttcac ctttgacatc aacagtaatt ttcttcgata 181 agtcgccgtt cgccaccgct gtcgtgactt cagcaatatt acggagttgt gccgtcaaat 241 ttcctgccat caagttgacg ctgtcggtca agtttttcca agtcccagca acaccgcgca 301 cttgtgcttg tacgcccagc ttaccttctg ttcccacttc cctagcaacc cgcgtcactt 361 cagaagcaaa agagttgagt tgatcgacca tcgtgttgat agtatttttc aactccaaaa 421 tttcaccttt gacatcaaca gtaattttct tcgagaggtc gccatttgcc acagcggttg 481 ttacttcagc aagattacga acttgtgcgg ttaagttacc cgccattaaa ttcacactgt 541 ccgtcaagtc cttccaagtt ccagccacac ctggaacttc tgctttcact cccagtttgc 601 cttcagttcc cacttccctt gcaactcgtg tgacctcgct agcaaacgaa ttcaactgac 661 ccaccattgt attaaccatt tgggcagttt ggagaaattc tcctttgagg ggtctacctt 721 caatttctgg tgcaatcgct tgagataaat caccatttgc cacagcccgg atcacacgcg 781 ccgtttccgc tgtcggctga accaaatctg tcactagggt gttgacagaa tcaacacaca 841 ctgaccaaga accacgaaca tttcccaaag acgcgcgttc actaattttg ccatctttac 901 caacaacatg actaatcctc tgtagctctg ctgttagccg ctcattttgt tcaataatat 961 cgttgagcgt atcagagatt tttcctgcca taccagtttg gtctaaaggc atacgggcag 1021 aaaagtcacc ttttttaaca gctaccagcg ttttcagtag ctgttttaaa tctaaattat 1081 cgttttctct ggttaactgt tcagtggaca tggtggcggg gcaataatat ctatattaat 1141 tatctttgct attgcttgct atcagttgca attgcgttct ttacacaaag ggcataccaa 1201 attatgacgc aaaaaagaaa ctttggtttt caattaactt gttgtgccaa aggagatatc 1261 gtataaaata taacgattat caagctcttt gagaaagata tgttctagaa atctatctga 1321 gggatgaaaa gtatcgatca gtccgctgga ttatttataa ttcctgcgtt ccctcacctc 1381 accctcatcc cctccgctta attcaaagta tgcctacggc acgctgcgct atcaaaattc 1441 aaagt // LOCUS NODE_12082_length_1443_cov_6.0540351443 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1443) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1443) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1443 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..770 /locus_tag="DP116_28285" CDS <1..770 /locus_tag="DP116_28285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017741294.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="RNA-dependent DNA polymerase" /protein_id="PRJNA477356:DP116_28285" /translation="FVSSITFSALTEPYWVIILRPEDNADEILGKINQFLADRGMNIS EKKTKITASTDGFDFLGWRFKVQSNGKFRCVPSEDNYKAFRQKVKAIVNCSNYGSKVK AEKLAPLVRGWRNYHKFCDMDKYSLYHIQLRTFRVFNKEAKNNRYTSKKLLDKAFPAV PYSENKYVNVKGNKSPYDGDLTYWSERNSRLYDGETSRALRKQNHACGHCGLKMLSEE RVHLHHIDGNHNNWKKKNLLAVHESCHDYIHVSKTAR" gene 939..1409 /locus_tag="DP116_28290" CDS 939..1409 /locus_tag="DP116_28290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017749009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28290" /translation="MEAQKQTISRLQQAVSAALARQQRAASALNPSHAEVAIAESRIA QEKASGEVSFATLDKERKALIQQQIEIHKQRERDARELQQVEIDLSQTAITATATGII SKLNLRNSGQTVRPGEEIAQIVPSDAPIVVKAAVALEDKSKLKEGQNVLVVCQP" BASE COUNT 488 a 322 c 324 g 309 t ORIGIN 1 ttttcgtgtc ctcaataacg ttttcagcct taactgaacc gtattgggtt ataatactca 61 gacccgaaga caatgcagat gaaatacttg gaaaaataaa ccagttcctt gcagacagag 121 gaatgaatat cagcgagaaa aagacaaaga ttactgcatc gacagatggt tttgatttcc 181 tcggttggcg cttcaaagtg caaagcaacg gaaaatttag atgtgtccct tcggaggaca 241 attacaaagc attccgtcag aaagtaaaag ccatcgttaa ctgctcgaac tatggttcca 301 aggtaaaagc tgaaaaacta gctcccttgg ttagaggatg gaggaactat cataagttct 361 gtgacatgga taagtactct ctataccata tccaactcag aacatttaga gtgttcaaca 421 aggaagcaaa gaacaaccga tacactagta agaaactact agataaggcg ttcccagcag 481 tcccatactc cgaaaacaaa tacgtcaacg ttaaaggaaa taaatcccct tatgacggag 541 acctcaccta ctggagtgag cgaaatagca gactctacga cggtgaaacc tctagagctt 601 taagaaagca aaaccatgca tgtggtcact gtggtttaaa gatgcttagt gaagaacgag 661 tgcatctaca ccatatcgat ggtaatcaca acaactggaa gaagaaaaat ctgctagctg 721 tacatgaaag ctgtcacgat tatatccacg tgagcaaaac cgcaaggtaa gaacatcggg 781 agccgtatgc acggaaacgg gcacgtacgg ttctaacaga gagggacggg ggataatacc 841 ccctctcgac tctacctcag agtgtagcaa aagttcgagc tctttcttgg aatcaattcg 901 aggaagcaca actcgctgtt gaccaacaag aacaagcagt ggaggcgcaa aaacaaacaa 961 tttcgcgtct acaacaagca gtttcggcag ctttagctag acagcaacgc gccgcttctg 1021 ccctgaatcc gagtcatgca gaagtggcga tcgccgagtc gcggattgcc caagaaaaag 1081 cttcggggga agtcagcttc gctacattag acaaagaacg taaagccctc atccagcaac 1141 aaattgaaat tcacaaacaa cgagaacgcg acgcccgcga actccaacaa gtggaaatcg 1201 acctcagcca gactgccatt actgccacag caaccggtat tatctccaaa ctaaatctgc 1261 gaaattctgg tcaaactgta cgcccaggag aggaaatcgc gcaaatagtc cctagtgatg 1321 cccctatagt agttaaggca gcggtagcac ttgaggataa aagcaagttg aaagaaggac 1381 agaacgtact agtagtctgt cagccttgaa ttgaggggtg attcggttaa taatttgagt 1441 aat // LOCUS NODE_12162_length_1430_cov_2.5287271430 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1430) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1430) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1430 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..815) /locus_tag="DP116_28295" rRNA complement(<1..815) /locus_tag="DP116_28295" /product="16S ribosomal RNA" BASE COUNT 348 a 374 c 325 g 383 t ORIGIN 1 cagtaggggt cgatacccac tacgcctagt atccatcgtt tacggccagg actactgggg 61 tatctaatcc cattcgctcc cctggctttc gttcctcagc gtcagttatg gcccagttag 121 tcgcctacgc cactggtgtt cttcccaata tctacgcatt tcaccgctac actgggaatt 181 ccactaacct ctaccacact ccagtctgcc agtatccaat gcactcccga ggttgagccc 241 cgatctttaa catcagactt aacaaaccgc ctacgaactc tttacgccca ataaatccgg 301 acaacgcttg catcctacgt cttaccgcgg ctgctggcac gtagttagcc gatgcttcct 361 ttgctggtac cgtcattttt ttcgttccag ccgacagaac tttacacccc aaagggcttc 421 ttcgttcacg cggtgtcgct gtgtcagggt ttcccccatt gcacaaaatt ccccactgct 481 gcctcccgta ggagtatggg ccgtgtctca gtcccattgt ggctgatcat cctctcagac 541 cagctaccga tcgtcgcctt ggtaggcctt taccctacca actagctaat cggacgcaga 601 ctcatcttaa gacggattac tcctttcata tacgctcgtg tccgagcaca tacatatgcg 661 gtattagcaa tcctttcgga ctgttatccc ccatctcaag gcagatttct acgcgttact 721 cacccgtccg ccactgggta ttgctacccc gttcgacttg catgtgtgaa gcacaccgcc 781 agcgttcgtc ctgagccagg atcaaactct ccgttgtcaa aaaccatcag gagtactgat 841 gatcgttaaa cgtttagttt gcctactcaa agcttttgac aggcgccttg atggctattc 901 agttatcagg gttcaacgac ctgtccaccg agcgattcgc tcagttcagt gacaggcaac 961 tagtaaataa taccaatcga cccaaaaggt gtcaacacga aaaaagcccc aagaatggcg 1021 tttttactat gttgttgcaa ttcatcatat atacaatgcc tttagaatta catatttgca 1081 gccgctcgca gggtagttag aagcgcatgg aaagccattc ttcgcgattg gagaaattgc 1141 agagatttgg aaaaaacttt tttgatcgat ttggaataag cgatatcaca tcttgtattt 1201 gctagaaatg tgcgatcgag gtccgcacta cgagcgggat tctatataaa ggaagaagaa 1261 agtcgatagc ggaatgttga tgcaggaagc atcagcggag taatgagagc agcatgcagt 1321 gggggtcttg gacgctggtt ggatttatcg gatcgttgat tactagataa attaggacgg 1381 tggagtggtg ggaagtggca tgaattgtcg gttgctgcac ggtagatgga // LOCUS NODE_12294_length_1408_cov_3.8270511408 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1408) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1408) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1408 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 19..1212 /locus_tag="DP116_28300" CDS 19..1212 /locus_tag="DP116_28300" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009456151.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28300" /translation="MTSLSPHKKAKALKPSSRRPAKELCSECGLCDTYYIHYVKEACA FINQQIDKLEEQTHTRSRNFDNSDELYFGVHQDMMAARKKQPVPGAQWTGIVSTIAIE MLNRGIVEGVVCVQNTKEDRFQPMPIIARTPEEILAARVNKPTLSPNLSVLEQVEKSG MKRLLVIGVGCQIQALRAVEKQLGLEKLYVLGTPCVDNVNRAGLQKFLETTSKSPDTV VHYEFMQDFRVHFKHEDGSTETVPFFGLKTNKLKDVFAPSCMSCFDYVNSLADLVVGY MGAPFGWQWIVVRNEQGQQMLDLVTDQLETGPVMSKGDRTAAVQQSIPAYDKGVTLPM WAAKLMGVVIEKIGPKGLEYARFSIDSHFTRNYLYVKRNHPEKLDSHVPEFAKRIVEQ YKLPE" BASE COUNT 434 a 315 c 308 g 351 t ORIGIN 1 taataaactg ctcttaaaat gacatccctg tctcctcaca aaaaagccaa agccctcaaa 61 cccagtagcc gtcgccctgc taaagaactc tgtagcgagt gtggtctgtg cgatacatac 121 tatattcatt atgtcaagga agcctgtgct tttattaatc agcaaataga taaactcgaa 181 gaacaaacgc acactcgctc tcgaaatttc gacaattctg atgaattgta ctttggtgtg 241 catcaagaca tgatggcggc gcggaaaaag cagcctgttc ccggtgctca atggacagga 301 attgttagta ccatcgccat cgaaatgctg aatcgcggta ttgttgaggg tgtcgtttgt 361 gtgcaaaaca ccaaagaaga ccgctttcaa ccaatgccca tcattgcccg taccccagag 421 gaaatactgg cagcacgagt caataaacca actctctcac caaatttatc cgtcttggaa 481 caagtggaaa aatcaggaat gaagcggcta ttggtgattg gtgttggttg ccaaattcag 541 gcattacgag ccgtcgaaaa acaactaggc ttagaaaaac tttacgtttt aggcacacct 601 tgtgtcgata atgtcaaccg tgcaggactg caaaaattct tggaaacaac cagcaaatca 661 cctgacacgg tagtgcatta cgaattcatg caagacttcc gggttcattt caaacacgaa 721 gatggctcaa ccgaaacggt gcctttcttt ggactaaaaa ccaataaact taaagatgta 781 tttgccccct cttgcatgag ttgctttgac tacgttaatt ctctggcgga tttggtcgta 841 ggttacatgg gggcaccttt tgggtggcaa tggattgttg tcagaaatga acaaggtcaa 901 cagatgctgg acttggtgac agaccaacta gaaactgggc cagtgatgtc aaagggcgat 961 cgcacagccg cagtccaaca aagcataccc gcctacgata aaggtgtcac cctccccatg 1021 tgggcggcaa aattaatggg tgtcgtgatc gaaaaaattg gtcctaaagg tttagaatac 1081 gcccgttttt caattgattc tcactttacc cgcaactatt tgtatgtgaa gcgaaatcat 1141 ccggagaaat tagactcaca cgttccagaa tttgccaagc gtatcgttga gcagtataag 1201 ttaccagagt aaaaagacaa gagtagccga ctcagaacct agctataatt ctcgatagtc 1261 tcttacccga acagccttaa tgagttcact aaaagtgagt tgcaatgggt gatgaagacg 1321 tagaatttga tctgggcaaa aacaattcat ccttatggca acaaaatttc caaaatttag 1381 ccaggatcta gcacaggatc cgacgacg // LOCUS NODE_12375_length_1395_cov_21.5820901395 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1395) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1395) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1395 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(6..1076) /locus_tag="DP116_28305" CDS complement(6..1076) /locus_tag="DP116_28305" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015152474.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS701 family transposase" /protein_id="PRJNA477356:DP116_28305" /translation="MPTQYQKDNLEAMLALFLEAQGHPLPEHSQTKSPSAISRFLNIN PWSTREMIRVVRSVALQTVFQALASSGKGRKPFLQVIIDLTTLEKRGKFQEFGDLIRV YNGKRGLHLVVVYLVIGKCRIPWNFRVWRGKGTPSPAKLGLKLVKRLPQSLTQHFQTI ILADTAFGSVEFLEGVRRLKYHAITGVAISRKLSDGRVLRHLHQQGQQVRLVGLKFPV TVYWYYLKRANGKLEKRFVLSTRPMKASTLKWWGKRRWQIEGWFKIAKHRFGLHRFGQ GTLLGMYRWLILSLTAYLIAHWTHLHLQLASPPNWGHAAQTALESIFPHVVVYLLLLD IERLAPLARSFGFDIHISRCKM" BASE COUNT 414 a 290 c 283 g 408 t ORIGIN 1 ttaactcaca tcttgcacct ggaaatatgg atgtcaaaac caaaactacg tgcaagggga 61 gcaaggcgtt caatatcaag taaaagaaga tacacaacta catgtggaaa aatagactca 121 agggcagttt gtgcagcgtg accccaatta ggtggtgatg ctagctgaag gtgaagatga 181 gtccaatgag cgatgaggta cgcagttagt gatagaatca accaacgata catacccagg 241 agagttcctt gcccaaaacg atgtagaccg aagcgatgct ttgcaatttt aaaccaaccc 301 tcgatttgcc acctccgctt accccaccac ttaagagtag aagctttcat gggacgggta 361 gacaagacga aacgcttttc tagtttcccg ttggcacgct ttaagtaata ccaataaacg 421 gtcacaggga acttcaaacc aactaaacga acttgttgcc cctgttgatg taaatgtcgt 481 aaaactctgc catcagataa cttgcgacta atagctacgc cagtaatagc atgatattta 541 aggcgacgca caccttcaag aaattcgaca ctgccgaaag ctgtatcagc taaaataatt 601 gtctgaaaat gttgagttaa agattgaggt aaacgtttaa ccagtttgag tccaagcttt 661 gcgggggaag gagttccttt acctcgccaa acacgaaaat tccaaggaat gcggcacttt 721 ccaattacta aataaactac gactagatgt agaccacgct taccattata aactctaatt 781 aaatcaccaa actcttggaa tttaccccgc ttctctaaag ttgtgaggtc aattattact 841 tgtaaaaatg gttttcttcc tttgccagat gatgctaacg cctgaaaaac tgtttgtaat 901 gctacagagc gaacaacacg aatcatctcc ctggttgacc aaggattaat atttaagaat 961 cggctgattg cacttgggga ttttgtttga ctgtgttccg gtaatggatg tccttgtgct 1021 tctaagaata acgccagcat agcttctagg ttatcttttt ggtactgcgt cggcatcaat 1081 tcttgcaagg tgtaaactag atcttgggcg tgggcaagca ttgtcttgtt aatctatgaa 1141 tacatttcac gccctttgtc tcatgttttg aaccaactaa gcaaatccaa tttccgagtg 1201 cgagcgcact gttgttgagg cagaatgcat catcctagca tcttgtagtt tttgtcgtta 1261 tctatgtata tgtgcattca aattagttat attattaacc ctattgtact gaatccagac 1321 tttttttcta ctgttatctt ttgcgctctt tattcacctt ttacggattt attcggttag 1381 gtgcaagaag tgagt // LOCUS NODE_12537_length_1369_cov_20.7511421369 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1369) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1369) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1369 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 128..1212 /locus_tag="DP116_28310" CDS join(128..293,293..1212) /locus_tag="DP116_28310" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_076612113.1" /ribosomal_slippage /note="programmed frameshift; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ISKra4 family transposase" /protein_id="PRJNA477356:DP116_28310" /translation="MVKTMTPEDRKLLEAHIKEIAKILSKNTPPEKIETFEGIETSVR DQVLEHVSPQINLFFVAEKTGTESGRIRTIRSCVGRIKITQKQASRLGIEPYRRLSPM LEKCCLLLAANESFQDAENDLKVLTGVEVGHSTHHRQVQKVDLSPPNIKQKLTEVCLD GGKVRLRSEEKGKSAYWKEYKTGRLQGIYYGAFFQDNFSLINWVNTQNLARTIYCLGD GHDGVWNLFAQIANDHTRQEILDWYHLKENLYKVQATKKFLEQIEADLWQGCVEEAMN KLKKTKYVGVTNFISYLRKHRHRLVNYMYFQAEQLSSIGSGAVESAVKQIDKRLQIVG AQWKYENLPQILQLRCAYLNGHLAPQF" BASE COUNT 479 a 244 c 299 g 347 t ORIGIN 1 gggagcatcc caattttgca agaagacgaa ataaaggaaa aatgattcgc gagttttcat 61 cgcgaaaaaa ggggggggac tatacggatg tggcacataa ctgcgaaaat tatggcagtc 121 aggaagcatg gttaaaacaa tgacgccaga agatagaaaa cttttagaag cccacattaa 181 agagatagcc aaaatccttt ctaaaaacac cccacccgaa aaaatcgaga cattcgaggg 241 catcgaaaca tctgtacgcg accaggtttt ggaacatgtt agcccccaaa tcaccttttt 301 tttgtcgcag aaaagacggg aacagaatct gggcgcatcc gaacgataag aagttgtgtt 361 ggaaggataa aaataactca aaagcaagcg tcacgtttag gaattgagcc ataccggcga 421 ttaagtccga tgctagaaaa atgctgtttg ttattggcag caaatgagtc attccaggac 481 gcagagaatg accttaaagt tcttactggc gtagaagttg gtcacagcac tcatcatcgt 541 caagttcaga aagtagactt atccccaccc aatattaaac aaaaactgac agaagtctgc 601 cttgatggcg ggaaagtacg tttacgttca gaggaaaaag gtaaatccgc gtactggaaa 661 gagtataaaa caggacgatt acaaggaata tattacgggg cattctttca agataatttc 721 tctctcataa attgggttaa tactcaaaac cttgctcgaa ccatttattg cttgggtgat 781 ggacatgatg gtgtgtggaa tttatttgca caaatcgcaa acgaccatac ccgacaagaa 841 attcttgatt ggtatcattt gaaggaaaac ctttataaag ttcaggcgac aaagaagttt 901 ttagagcaaa ttgaagcaga cttatggcag ggatgtgtag aagaagcgat gaataaactg 961 aagaaaacga agtatgttgg tgtgactaac tttataagtt atctacgtaa acataggcat 1021 cgcttagtca attatatgta tttccaagca gagcagttaa gctcgattgg ttcaggtgca 1081 gttgagtcag cagtcaagca aatcgacaag cgactacaaa tagttggcgc acagtggaag 1141 tatgaaaatt taccacaaat tcttcaactg cgttgtgcct atttaaatgg acacttagct 1201 ccacaattct aaaacataaa gtaaacctgg ttaaaatgtc aatcgaactt tattcgcgag 1261 ttataatcgc gaataatttg ggtttgtctt tatgcctcga ttaacatcga atgatactga 1321 acaagataaa ttggctcgtt ttgaccgtat taaaaaaaga agtcagaag // LOCUS NODE_12711_length_1341_cov_1.7363921341 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1341) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1341) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1341 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(29..523) /locus_tag="DP116_28315" CDS complement(29..523) /locus_tag="DP116_28315" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008239375.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4385 domain-containing protein" /protein_id="PRJNA477356:DP116_28315" /translation="MPPEKARKPSYLDFDRAKYAWKAGVDYRAHPEKYRVGKGEQGVL ICEPYKSEIVAHWQFKTPQIARKSSRTIFKMFKDYLAEGDFVGADMARKFLQMGFTRA RRYTNWKGGKKYDKEHDYALNEKGTGDPEKAESAEIFFKAWKKAEAVAEYAERKQEWR EQYG" gene complement(613..685) /locus_tag="DP116_28320" tRNA complement(613..685) /locus_tag="DP116_28320" /product="tRNA-Phe" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:complement(650..652),aa:Phe,seq:gaa) gene complement(782..1285) /locus_tag="DP116_28325" CDS complement(782..1285) /locus_tag="DP116_28325" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28325" /translation="MQAQSPGQLQGQPMQQGGVPTAQTPGQPGVGTGFTPQMATQFVT VWLQRAMDYNMQTCQTSHKEAFDWMTPNTVQAFRQGYWTPDIEQAVMSGKVSAAFQPI SVQATYMNPDGGIIVAMTGSLVMQSAGTSPATQPLAADFLVKQDNGNVRVVGINIHPT AQQGSPY" BASE COUNT 297 a 365 c 326 g 353 t ORIGIN 1 aaccatattg ctctcgcaag ctgatccgct aaccatattg ctctcgccat tcctgctttc 61 gctcagcgta ctccgcaaca gcttctgcct ttttccacgc tttgaagaat atttcggcgc 121 tttctgcttt ctctggatcg ccggtgcctt tctcattcag ggcgtagtca tgctctttat 181 cgtacttctt gccacccttc caatttgtgt atcgccgcgc acgcgtgaat cccatctgca 241 gaaatttgcg tgccatgtcg gcgccgacaa agtctccttc tgctagatag tctttgaaca 301 ttttgaatat cgtacgactg cttttgcggg cgatctgagg agtcttgaac tgccagtgcg 361 cgacaatctc tgacttgtac ggttcgcaga tcaacacgcc ctgctccccc ttaccaacgc 421 gatatttctc ggggtgcgct cggtagtcga ctccggcttt ccaagcgtac tttgctcggt 481 caaaatcaag gtaggatggc ttgcgtgctt tttctggtgg catgacgtaa tcgaccgaaa 541 aaacattacg aaagtgccaa aacgtctaat aaaaacgacc ccccaatttt ttggagagcc 601 gtttaaatga tttgccgaga gccaggattg aactggcgac acgaggattt tcagtcctct 661 gctctaccga ctgagctacc tcggcaaggg gctctgaatt gctttcgcaa ttgaggcgga 721 aacccaatat accacaacga attcgtgcgt aataacgcag gttaagaact caattaaaga 781 tttagtacgg agagccctgc tgcgcggtcg gatgaatgtt aatgccaact acacgaacat 841 ttccattgtc ttgcttgaca agaaaatccg cagccaaggg ctgagttgct ggtgatgtgc 901 ccgccgattg catgacaagc gacccggtca ttgcaacgat aattccgccg tctggattca 961 tgtaagtcgc ttgcaccgag atcggttgaa aagccgcaga caccttccca gacatcacgg 1021 cttgctcgat gtcgggagtc caatagccct ggcgaaaagc ttgcacagta ttaggcgtca 1081 tccagtcaaa cgcttctttg tggctggtct gacaggtctg catgttgtaa tccatggcac 1141 gttgcagcca aacggtgaca aattgagtag ccatctgtgg tgtaaaacca gtgcctactc 1201 ccggctgacc tggcgtttga gctgtaggca ctccaccttg ctgcataggc tgtccctgca 1261 actgcccggg cgactgagcc tgcatgccct gctgcgagtg gagaggcggc tcgtgcgtct 1321 tctgctcgtc acctgtaaat g // LOCUS NODE_12975_length_1300_cov_4.6144581300 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1300) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1300) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1300 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(64..1158) /locus_tag="DP116_28330" CDS complement(64..1158) /locus_tag="DP116_28330" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016876253.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="two-component system response regulator" /protein_id="PRJNA477356:DP116_28330" /translation="MMQQQTQIAPLLTNILVSETHDLFVSKTVQNQGSTVNSFDSDVP RILVVDDHAASRMTAVALLAMEGYQVIEADSGYTAIKLVIQKQPDLILLDVMMPEMDG YQVCELLKQNEHTRLIPVIFITALNDRRSRIRGIETGADDFLTKPFDRVELAARVKSL ITQKRLNEDLDHAEQVLFSIARAIESRDPNTGNHCERLVHLGKTFGEHLNLTRNQIRD LMWGGFLHDIGKVGIPDAVLLKKGKFTAEEWDIMKQHVLIGEKICQPLRSMRGVIPII RHHHERWDGSGYPDGLIKDDIPYLAQVFQVIDIFDALINERPYKRAFTPEEALAVMVD ETAKGWRNPKLMQQFIEFICCFKDWQLSPS" BASE COUNT 359 a 299 c 227 g 415 t ORIGIN 1 cccttacacc cccactcccc tattccccta gttctggtca caaaagccgt aatagttcaa 61 aacttagcta ggcgataact gccaatcttt gaaacaacaa ataaactcta taaactgctg 121 catgagtttt ggattacgcc atccctttgc tgtttcatct accatcacag caagggcttc 181 ttctggcgtg aaagctcttt tatatggtct ttcattaatc aaagcatcaa aaatatctat 241 tacctgaaat acttgcgcta aataaggaat atcatctttt ataagtccat caggatagcc 301 tgagccatcc caacgttcgt ggtgatggcg gataatggga attacacccc gcatactgcg 361 tagtggctgg cagatttttt ctccaatcaa aacgtgttgc ttcataatat cccactcttc 421 ggcagtgaat ttgccttttt tgagcagcac tgcgtcggga atacccactt taccgatgtc 481 atgaagaaat cctccccaca tcaaatcccg aatttggttg cgtgtgagat tgagatgttc 541 tccaaaagtt ttccccaaat gtaccaaacg ttcgcagtgg ttaccagtat taggatcacg 601 actttcaatt gcccttgcaa tggaaaacag cacttgttcg gcgtgatcta agtcttcatt 661 caaacgcttt tgtgttatca aagattttac tcgcgccgct aactctacac ggtcaaaggg 721 tttggtgaga aaatcatctg ccccagtttc aattcctcga atgcgcgatc gcctgtcatt 781 taaagcagta atgaaaatta caggaatgag tctggtatgt tcattctgct taagtaactc 841 gcacacctga tatccatcca tttctggcat catcacatcc aataaaatca aatctggttg 901 tttctgtatt accagcttta tcgccgtata accactgtct gcttctatga cttggtaacc 961 ttccattgcc aaaagggcga cagcagtcat tcgactggca gcatggtcat ctactaccaa 1021 aattcttggc acatctgaat caaagctatt cactgtcgaa ccctgatttt gtaccgtctt 1081 agagacgaat aaatcatgag tctcactaac aagaatattc gttaacaaag gtgcgatttg 1141 tgtttgttgt tgcatcatgc cccttggctc tagccaatga ataattttat gcttttttgc 1201 ttaacaaggc atttttttta aacaagtgat attagatttt tttattgtat actttagttt 1261 tttgttatat catcaagtaa ctcctgttga aaatgttaac // LOCUS NODE_13267_length_1263_cov_4.5389071263 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1263) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1263) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1263 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 163..291 /locus_tag="DP116_28335" /pseudo CDS 163..291 /locus_tag="DP116_28335" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015817401.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="NAD-dependent deacylase" gene complement(386..>1263) /locus_tag="DP116_28340" CDS complement(386..>1263) /locus_tag="DP116_28340" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016952942.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="ferredoxin:protochlorophyllide reductase (ATP-dependent) subunit N" /protein_id="PRJNA477356:DP116_28340" /translation="SKLLNFGKKKEEVANEESEYVDHPPLVLFGSLPDPVVTQLTLEL KKQGIKVSGWLPAKRFTELPVLEEGYYVAGVNPFLSRTATTLMRRRKCKLIGAPFPIG PDGTRAWIEKICSVFGITPKGLDEREAQIWESLEDYIKLIRGKSVFFMGDNLLEVSLA RFLVRCGMTVPEIGIPYMDKRYQDAELKLLEKTCKEMGVPLPRIVEKPDNYNQVQRIY DLKPDLVITGMAHANPLEARGINTKWSVEFTFAQIHGFTNARDILELVTRPLRRNNSL KDLGWDKLVKEEAKI" BASE COUNT 342 a 339 c 212 g 370 t ORIGIN 1 aggtgattga ctcatatata ttgtcattca taaattaaga aatagaaaaa tgaaaagttt 61 agatattttt aagcagcttt cccgttcact tctcattgta gacatgattt accaacagag 121 ggtcgcgtgt taaaacttga aagtagacag cgatcgctct tttcttcctc tttctctttc 181 tctttctctt tctctttctc tttctcttcc tctttctctt cctctttctc ttcctctttc 241 tctttttctt tctcttcctc ttcctcttcc tcttcctctt cctcttcctc tctgcgtcct 301 ctgcgcctct gcggttcaat aaaaaaaagt gcgccacgac aaacgccaac gcacccaaca 361 aaattcattc caacaaaagg taaaattaaa tcttcgcttc ttccttcacc aacttatccc 421 aacccaaatc tttcaaacta ttattccgac gtagcggacg agtcaccaac tccaaaatat 481 cccgcgcatt agtaaaaccg tgaatctgag caaaagtaaa ctccacagac cacttagtat 541 taatccctcg tgcttccaac gggttagcat gagccatacc agtaataacc aaatctggct 601 tcaaatcgta aatccgctgc acttgattgt agttgtctgg cttttccaca atcctaggca 661 gaggtacgcc catttccttg caagtcttct ccagcaactt caattcagca tcttggtagc 721 gcttatccat gtaagggata ccaatttccg gaacagtcat accgcagcgt accaagaacc 781 gtgctaggga aacttccaac aagttatcac ccatgaaaaa cacagacttg ccacgaataa 841 gttttatgta gtcttccaaa ctttcccaaa tttgtgcttc ccgttcatcc aaacctttag 901 gagtaatacc aaacaccgag cagattttct caatccaagc gcgggtacca tctggaccaa 961 tcgggaaggg tgcgccaatc agcttacact tgcggcgacg catcaaggta gtcgcagttc 1021 ggctaagaaa ggggttgaca ccagcgacat aatacccttc ttcaaggact ggtagttcgg 1081 tgaagcgctt ggcgggtagc caaccagaaa ccttgatacc ttgcttcttc aattccagtg 1141 ttaactgagt caccacggga tcaggaaggg aaccaaacaa cacaagtggt ggatgatcca 1201 cgtactcaga ttcttcgttg gcgacttctt ctttcttctt accaaagtta agcagtttag 1261 aaa // LOCUS NODE_13509_length_1231_cov_84.5093541231 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1231) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1231) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1231 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(183..>1231) /locus_tag="DP116_28345" CDS complement(183..>1231) /locus_tag="DP116_28345" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012408342.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="IS4 family transposase" /protein_id="PRJNA477356:DP116_28345" /translation="SRDSMRDVLKNLIDGLSEAWVKVGKYWRVPCKSAITQARQRLGA KVMSQLFHQLVRPMATDETLGAFLNGLRIVVIDGTCFDVPDSDENARVFGRPSSRPKT QAAFPKVRLVILVEAGTHLIFDALMCPYRIGERVRALRLLRSVTSGMLLMWDRGLHSY AMVNATVTKGCDYLGRIPANVKFLAEKPLADGSYLSWIYPSLKLRKKGCEPRVVRVIE YTIEHPENPEEQVRYRLITSLLDIEKFPAELLATQYHQRWEVENTIDELKIHLLGRKT HVRSQKPREVVQEVYGWLLGHWVVRVLMFQAATTAGVPPLRLSFTGTLRVIRRAVPKF QRLKSEEFPLFSIG" BASE COUNT 342 a 293 c 231 g 365 t ORIGIN 1 caatacggtt cagttaaggc tgaaaacgtt attgaggaca cgaaaagaag gagcttcaac 61 tcgtactcca gtacctcggt gttttggttt tttagaacga aactttgata caggtttttt 121 cacaactctg ggattatttc gctgaacccg ttcgggcaac aatgtgtcca gaatctctat 181 aattaaccaa tcgaaaaaag ggggaattct tcgcttttaa gacgttgaaa tttaggtaca 241 gcacgacgga taactcgtaa tgttccagta aaactcagac gtaaaggtgg aactccagca 301 gtggttgcag cttgaaacat caatactctg actacccaat gtcctaatag ccaaccgtat 361 acctcttgca ccacctcgcg tggtttttga gaacggacat gggtttttcg tcctaaaagg 421 tgtatcttca actcatcaat cgtattttct acttcccatc gctgatgata ttgtgtcgct 481 aataactccg caggaaattt ttcaatatct agtaagcttg taatcaagcg gtatctaact 541 tgttcttctg ggttttcagg atgctcaatt gtatactcaa tcactcgtac aactcttggc 601 tcgcaacctt tttttcgtag cttaagcgat gggtaaatcc atgataaata tgagccgtct 661 gccaacggtt tttctgcaag gaatttaacg ttagcgggta ttctccctaa ataatcgcac 721 cctttagtaa cagtggcgtt gaccattgca taggaatgta agcctctgtc ccacatcaac 781 agcatccctg atgttaccga gcgtaataat cttagtgccc gtactcgttc tccaatacga 841 tatggacaca tcaatgcatc gaagattaag tgtgtcccag cctcaaccaa aattaccaat 901 cgtactttag gaaatgctgc ttgtgtctta ggacgactac taggacgacc aaaaactctt 961 gcattttcat cgctgtctgg aacatcaaaa cacgtaccat caatgactac aattcgtagt 1021 ccatttagaa atgcccccaa tgtttcatcg gttgccattg gtcgtaccaa ctgatgaaac 1081 aattggctca tcacttttgc tcccaatcgc tgcctagctt gtgtgatggc tgatttacac 1141 ggcactcgcc aatatttacc aactttaacc catgcctccg atagaccatc tatgagattt 1201 ttcagcacat cacgcatcga atctctcgac c // LOCUS NODE_13948_length_1178_cov_6.7088161178 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1178) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1178) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1178 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..366 /locus_tag="DP116_28350" CDS <1..366 /locus_tag="DP116_28350" /inference="COORDINATES: protein motif:HMM:PF13358.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="IS630 family transposase" /protein_id="PRJNA477356:DP116_28350" /translation="GLVAAMTVPGSTNTEVFLTYVTQVLAPQLWKGAIVVMDNLKVHH AERVKIAIESVGALVKFLPPYSPDLSPIELCWSKLKQFLRSCEARTLELLDRAMADAV NCITEDDAFGWFNHCGLFT" gene 336..548 /locus_tag="DP116_28355" CDS 336..548 /locus_tag="DP116_28355" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28355" /translation="MVQPLWSIYLKIAVNVAAEIGSRKENIDFRVHWQNFLKTQKRCC RKLERSSRQVCYLENGVIKYIQVGKR" BASE COUNT 321 a 242 c 271 g 344 t ORIGIN 1 gggctggttg cagcgatgac tgtgcctgga agcaccaata ccgaggtgtt tcttacatat 61 gtaactcagg ttttggcacc tcaattgtgg aaaggggcta ttgtggttat ggataatctg 121 aaagttcacc acgcggagcg tgtaaaaatt gccattgagt ctgttggtgc attagtcaag 181 tttttacccc cttactctcc tgatttatcc cccatagaac tttgttggtc aaaattgaag 241 caatttctcc gttcttgtga agcacgcaca ctggaattac tcgaccgagc aatggctgat 301 gctgtaaatt gtattaccga agatgatgcc ttcggttggt tcaaccactg tggtctattt 361 acctgaaaat tgctgtaaat gtagctgctg agattggtag tcgtaaagaa aatattgatt 421 ttcgcgtaca ttggcaaaac tttttaaaaa cacagaaacg ctgttgtcgc aaacttgaac 481 gttcttcacg acaagtctgt tacttagaaa atggtgttat taaatacatc caagtaggta 541 aacgttaaaa acctgacaat tgtttcaaat tgtaaagtta tattccgcag aatttgacag 601 ccttattcta caaggattat ttgttaggca tgattaaaaa tcaaaacccc tcacgctagc 661 aaactgatag gggttacaag atttttcatt tcatgtagtt tcttacccgc tcgatagcgg 721 ttacttcgcc atgcctacgc aaaatctcga catacttgat tattgcccat cctgcggtgg 781 ctacaacttt atagaatcgt tagaacctga aaccagtctc cactatgggc gtctgacctg 841 taacgagtgt ggacgctttg tcaaatggtt gcggaagccc aggaaaacac ccgacatatc 901 agtagcaagg cttttgaaca gcactggctt ggaaccgtgg gaaaagaaat ttttgaaagc 961 tgtcgtgtgt cgcaacttga ccaaagctga agtgattgtg gttcaagcaa tagcaaataa 1021 agttaacccc gcgacggtcg ggcggggtag tgggtagtga tagttgttgg gttatgcggc 1081 ttgagcatct gacggaattt ctaacttttc ctccaatgcc gcagtgtatt agcgatacgc 1141 gttcgcgcag cgtgtcccct tgggactcag cgtcaagc // LOCUS NODE_14160_length_1154_cov_3.2757051154 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1154) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1154) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1154 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 98..307 /locus_tag="DP116_28360" CDS 98..307 /locus_tag="DP116_28360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137529.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28360" /translation="MSTPEIEARLAILEAEVARLKQQLPVSSIPWWQTILGSFANDPA YDEAMQLGQQYRQSLRPDSDSSGDI" gene 309..731 /locus_tag="DP116_28365" CDS 309..731 /locus_tag="DP116_28365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006668553.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type II toxin-antitoxin system VapC family toxin" /protein_id="PRJNA477356:DP116_28365" /translation="MYLLDTDHISVFDRGGASAQPLLAKIARINPNEVATTIVTYEEQ MRGWLSYIAKADSIEGQIAAYRKLENQLVNYRNIFVISFDDKAGQIFQNLRKSYPRLG SMDLKIAAIALAHQATLLTRNLKDFGQIENLHVEDWTA" BASE COUNT 321 a 284 c 239 g 310 t ORIGIN 1 ttggatttgg actggagagc cagcacaagc gcgatcgcca acctacctta gtccagcaaa 61 atgacataat tagaacaaca gacgctcaaa aaatcctatg tcaacccccg aaatcgaagc 121 cagactagca attttagaag ctgaggtggc tcgcctcaag caacaactcc ccgtttcttc 181 tattccttgg tggcaaacga ttttaggcag ctttgccaac gacccagctt acgacgaagc 241 gatgcaatta gggcaacaat accgtcaatc ccttcgccct gattctgact cctctggcga 301 catttgatat gtatcttttg gataccgacc atatcagcgt ttttgaccgt ggtggagcct 361 cagcccagcc acttcttgct aaaatagccc gtatcaatcc caatgaagtt gctaccacca 421 tagtgactta tgaggaacaa atgcggggtt ggctcagtta catcgccaaa gccgattcca 481 tagaagggca aatcgctgcg tatcgcaaat tagaaaatca acttgttaat taccgaaaca 541 tttttgtgat cagttttgat gacaaagcag ggcaaatctt tcaaaatctt cgcaaatcct 601 atcctcgact aggttcaatg gatctcaaaa ttgcggcgat cgcccttgcc catcaagcaa 661 ctctgctcac ccgtaacctg aaagacttcg gacagattga gaatcttcat gtggaagatt 721 ggacagctta agtcaattca tccaaactca attactcgca gcttttccct tcaggcgtta 781 agcgtagctg cgcctccggc aatcgccctg attctagata actgtaaaat taagcaagag 841 cgatggtttc gccaaagcct tctctgccag aggctccgcg aacggctaac gcaaagaagt 901 acaaaacaat gacacggatt ccattaacat tgttagtttt acttcagtga tactggtttt 961 attatagcga tcgcacctcc cctgtcctgt ggttacttcg cagttgcgct tagaaaaggg 1021 aaagtgctgc atctttcctt ggatgggtac tgtcgcttac cgtaccttgg aaagattgct 1081 gaactgtttt tgccgtgaat cgttgggaat tagtagtatc agtggagata atccttattt 1141 tttgacactt aaat // LOCUS NODE_15586_length_1013_cov_4.3997911013 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 1013) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 1013) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1013 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..826) /locus_tag="DP116_28370" CDS complement(<1..826) /locus_tag="DP116_28370" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012411915.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28370" /translation="MNEQRQQDYLNLIQLLLNCRSDDEIREILPANQELLDVDFLQTV EAEAQKFSQQGDENTANRLQGLAMQLGEALNLTPQPPSLVGKEENSKPLKKQEEEWKR GLQSLSEEERQTYFQFLMQVLQATADSSGNSQVVYPLLANNTDKLDGVLVEILRRWGT NRLREAQADVAKSIAGYVVNFSNLIAQFPLGNKVSNIEIAITGYEVALTVYTQQAFPV DWAGMQNNLANAYSIRIAGERAENLERVIAYYQQALKVRTFDAFPVDWAGTQNNLAN" BASE COUNT 246 a 253 c 214 g 300 t ORIGIN 1 cattagcgag attattttgc gtacctgccc aatcaacagg aaaagcgtca aaagttctga 61 ctttcagagc ttgttggtaa taagcgatca ctcgctctaa attctcagcc cgctcccctg 121 cgattctgat actgtaagca ttagcgagat tattttgcat ccctgcccaa tcaacaggaa 181 aagcttgctg ggtgtagact gtcagcgcga cttcatagcc agtgatggca atttctatat 241 tgctgacttt gttacccaag ggaaattgcg ctattagatt gctaaaattt actacatatc 301 cagcgatgga tttcgccaca tctgcttggg cttccctcag tctatttgtc ccccaacggc 361 gcaatatttc tactaacaca ccgtcgagtt tgtctgtatt gtttgccagc aacgggtaaa 421 ctacctggga attaccacta ctgtctgcgg ttgcttgtag tacctgcatc aagaattgga 481 aatatgtctg tctctcttcc tcactgagag attgcaaacc tcgcttccat tcctcctcct 541 gctttttgag aggctttgaa ttctcctcct tccctactag ggaagggggt tggggggtta 601 ggttcaacgc ttcccctagt tgcattgcca aaccttgcaa ccgattagct gtattctcat 661 ctccctgctg cgaaaacttc tgcgcctccg cttctaccgt ctgcaagaag tcaacatcca 721 gtaattcttg gtttgctggc aaaatctctc ggatttcatc gtcactacga caattcaaca 781 ggagttgaat gaggttgaga tagtcttgct gacgctgttc gttcataggt aagggcgtga 841 gagtcgcaca tattttatta tcagagaaat tcaacaccag cgccctagat tccggaaacg 901 gatagacttt catcacaggt gatgtgttcc cgacgacgat ttagatcttg caccttcgcc 961 tgagtacgcc ttgaactgaa gttcaaggct tatagccaaa gtccgttaaa acg // LOCUS NODE_16238_length_959_cov_2.021018959 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 959) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 959) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..959 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(47..826) /locus_tag="DP116_28375" /pseudo CDS complement(47..826) /locus_tag="DP116_28375" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015181776.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="heavy metal translocating P-type ATPase" BASE COUNT 253 a 257 c 211 g 238 t ORIGIN 1 ggcatgtggc gtccctgggc taaagccaca gggttttcat ctcacccact ataaccttcg 61 catagcttcg ctcaaatcgt tcgataaatt gctgcgaggg tggtgcttcg ttttgagctt 121 gctggacgag acggatgact cgctgaatta agctactttc aggcggttga tgaatcttga 181 gccgcagcgc accactgcca ttgatcgtgc ctgcaaaaac ttcatcacca acagttttct 241 ctacaggaat cgattctcct gtgattgaag cctggttcaa cgtactgaat ccttccatca 301 ccaaaccatc ggtcggcact aattctcccg gtttgaccaa gacttgatcg cctatcctta 361 gctcagaaat cggtatagtt cgttcctgcc cgttccgcat cactcgcgct gtatcggtgg 421 ttaaactcat caacccctga atactccgct ccgttcgctg catggcatat ccttcaagtg 481 cgccactaat cgcaaaaatc aggattagga cagcaccatc cacaatcatg taatattccc 541 gccgccatag ccccagacct gctgctccca gggctgccac aatcatcagc aagtctacat 601 caagttcttt ttctttaaat agggttgtca agccttcacg cgcactctca aacccaccaa 661 tcacgtaggc agcggtcaga atgaagaggg cggctccaat ccagccaaag ttgagcgttt 721 gccagcccag gaaaaccaga accccacaag agatcgccgc gactgcatcg gggtgttcct 781 tcagaaggtt aaaaaagcga aggcgggaag gggaagcaga aaccatttct gaaaatgtcc 841 aaattgatat ctttctatct tacacattga cattagtgtc aatgtcaatg tttcgagtaa 901 actgagagta tgaaaattga acatctgacg attaaagaac tgacggatac ggttggggg // LOCUS NODE_16456_length_941_cov_5.786682941 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 941) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 941) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..941 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(127..393) /locus_tag="DP116_28380" CDS complement(127..393) /locus_tag="DP116_28380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994930.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AT hook motif protein" /protein_id="PRJNA477356:DP116_28380" /translation="MNQESEFPFERARQVTPEENQKFRDAIADQFGITLRKRGRPAKD EEEKYEPISIRFHPKIIAWAKEEAEKRGIGYQTVINEALLEKIG" gene complement(377..652) /locus_tag="DP116_28385" CDS complement(377..652) /locus_tag="DP116_28385" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994929.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28385" /translation="MRFEWDENKAESNFLKHGIQFEEALTVFADPYLLFTEDSSHSQG EEREWAIGEAEDGSIVVVVFTMRGERIRIISARKATKRECQQYESGI" BASE COUNT 231 a 202 c 188 g 320 t ORIGIN 1 tcataattat tttatttttc tttaacaact cattgcgttt atggatcgga cgagtgctgt 61 aagttgagag tgcgatcgca cttgtttcag ctttatttgg gtttcaagat agctaaagtg 121 gtgctatcag cctattttct ccaatagtgc ttcgttgata actgtttggt agcctattcc 181 tcgtttttct gcctcttcct ttgcccaggc tatgatttta ggatgaaatc tgatggaaat 241 cggctcatac ttttcctctt catccttagc tggtcgtccg cgcttcctga gtgtaatccc 301 aaactgatcc gcgatcgcat ctctgaattt ttggttttct tctggtgtaa cttgtcttgc 361 tctttcaaaa ggaaactcag attcctgatt catattgttg acactccctt tttgtcgctt 421 ttcttgcact aattatcctt attcgctcac ctcgcatggt aaaaacaacc acaacaattg 481 agccatcttc cgcttcacca atcgcccatt ccctttcttc tccttgagaa tgactagagt 541 cctccgtaaa tagaagatag ggatcggcaa aaactgttag agcttcctca aattgaatac 601 catgttttaa aaaatttgac tcagctttat tctcgtccca ctcaaaacgt atttatagta 661 tataccaaaa atatttgcct ttgctatcct acaccgactt gaactttggg acgaagtgca 721 tagaagcgat agcttcgctg cagcatagcg gcagatcgcg tttgttgggt tgatgggaga 781 gtgcgttagc gaagctgaac gaagttctag cttgtgcgac gtgagtgcga tcgcatttat 841 tagggttggg aactgtaact taaaatctgt aaatatctgg acatagcttc cggaatgggt 901 acataatgaa gtgtagtcct tacaagttgc acttgaaatg g // LOCUS NODE_16479_length_939_cov_4.254525939 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 939) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 939) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..939 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(81..752) /locus_tag="DP116_28390" /pseudo CDS complement(81..752) /locus_tag="DP116_28390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016873963.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="protoporphyrin IX magnesium chelatase" BASE COUNT 267 a 218 c 193 g 261 t ORIGIN 1 agtttcaaat tagatccccc caacccccct tccaaagggg ggctaatatt gccctgttta 61 taagcagggc gaggagggat ctgctgatac cattccaggt atgcccgtgg tgattcaaaa 121 aatccttgat agtcgggatg taacaacccc atattcgggg tttcgactgg cgggggaatt 181 tcacctacct taagacttaa atatttttct gccaatgtcc agaataaagc agcaacgttt 241 tctggtccac cagcattcca gtaaccatat ataataagcc agttgcgtaa gtcttgcact 301 ttttgcactg gcacatattt caatagtttt ggtcctattt ttaaaaagct aatataacct 361 gcgagtttat cttcctctcg tccgttgctg aatttgtcta agataaattt aactggtttg 421 ggcattcctt tgggtttatc gccgatggcg aaagcaccca gtttggttaa actcagcaat 481 tccaaggctg actcaaacac gaggcggatg gggatatggg agatgcgatc gcgcaaccac 541 aatacctgat cataatcaaa cagcaagcta ccaaaaaaca catcagcatc ttttagtgcg 601 gcttccacct cagtgcgttt ggtggtgatg tcgcgatcgc taaacacaca aatatccaac 661 tcactgacac gagaagtcgc caaaaaagct gccttccgat acaagtcggc gttgaatgat 721 tcaaatccag caattaagac gatacgtttc atgtgttgag atagaaaggt tagggttttc 781 tatctctata ttgtgaccga taattccgaa cagggtacaa gcccccaaca gttatcagtt 841 atcagttacc agttatcagt taccagttac cagttatcag ttaccagtta ccagttatca 901 gttatcagtt atcaggtagg aaacggactc gtccacccc // LOCUS NODE_16885_length_908_cov_4.723329908 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 908) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 908) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..908 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 253..714 /locus_tag="DP116_28395" CDS 253..714 /locus_tag="DP116_28395" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006528005.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="PRJNA477356:DP116_28395" /translation="MTRHLVNPLVPTPEAAVLAEEALNTLGNFLENPPEHVTLSLVEP ELGGATITMPGAVFRLLVDVLKHVKDGHPITILPTQAELTTQQAADILDVSRPYLVKL LDEGKIPSRKVGVYRRVLAADVLRYKQTTDRLRHEALDELVKEGQDLGMGY" gene 720..845 /locus_tag="DP116_28400" /pseudo CDS 720..845 /locus_tag="DP116_28400" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018003706.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="PIN domain-containing protein" BASE COUNT 204 a 234 c 244 g 226 t ORIGIN 1 ataattaatt acacaatgtc attgcgaatg gagcgaagcg aaatgaagca atcgcaaggg 61 ctgggattgc ttcgcttcgc tcgcaatgac tgtaaatatt tgtgttcatc tacttattgc 121 ctgttgcctt gtttgctggt aactgataac tgataactga taactgattc tttgcgccca 181 tttcgactgt tgcgtccatt tcgcgtgtgt gctatggtaa acgaagaatt tgtgtctcgg 241 taaagagcct ctatgacccg acaccttgtc aacccgcttg ttcctacccc cgaagccgcc 301 gtgctggcag aggaggcgct gaatacgctc gggaatttcc ttgagaatcc gccagagcat 361 gtcacgcttt ccctggtcga gccggagctt ggcggtgcga ctatcacgat gccgggggcg 421 gtctttcggc tcctcgtgga cgtcttgaag catgtaaagg acgggcatcc catcaccata 481 ctcccgaccc aggcagagct tacgacgcaa caggcggctg atatactgga cgtgtctcgt 541 ccttatctgg tgaaacttct ggacgaagga aagattccga gccgtaaagt tggcgtgtat 601 cgacgggttc ttgccgctga tgtgcttcgc tacaagcaga ccaccgaccg actccggcat 661 gaggcgctcg acgagcttgt caaagaagga caagacctcg ggatgggata ttgactgaga 721 tgttggtcgg cgtcgcggtc ctggacgctt gcgtcctcta ccccgcaccg ttacgagacc 781 tgttcatgcg tcttgcgacg actgggctat ttcacgcacg ctggactatc gagatccacg 841 aagaagagtc agaagtgcga ttcgttgtta actgaatcac ggtaatcagc ccgtacataa 901 gttaacca // LOCUS NODE_17070_length_896_cov_3.500595896 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 896) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 896) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..896 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(511..>896) /locus_tag="DP116_28405" CDS complement(511..>896) /locus_tag="DP116_28405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015115755.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="Uma2 family endonuclease" /protein_id="PRJNA477356:DP116_28405" /translation="DQNQLGEVFDSSTGFTLPSGADRSPDVSWVEKSRWDALTKDQRE KFIPLCPDFVIEILSPNDSLKKTQKKMQEYIENGCRLGWLINRKKQEVEIYRPGQEVE VLKFPQTLGGESIIPGFVLNMQRIC" BASE COUNT 290 a 175 c 199 g 232 t ORIGIN 1 gggagagaag gagggaagga gtgagggagt cagggagagg gcttgacgtt tttcttcaaa 61 gttaaaaata tgcagattaa aaggttaaaa atatgcagat taaaagcgcg acagcttcca 121 cactacgaac ttttaccgaa aaactgggtc taaagcctcg cccttctagg gcgactttta 181 tatattcggg aacaaaaacg tcatgttgtg cgacaataat cataagcagc ggtaattcgc 241 aataaaaaac gaaggcaata aaaaacgaag gcaataaaaa acgaagtagg tctggcggtc 301 aactcctgaa ccgaaaccct ggtagtagaa atcctaaatg ttgtacaagt gggagtcaaa 361 ggaaagcttt gtaaaaacct tatcaaacgc ggtcgggcag tccatgtacg cttgtggacg 421 tatccaaacg gggataccta aatgcttgca tttaggtatc tagggaggac ggatgaagca 481 agaatcctca gccgcgacgg ctgaggagtg tcaacaaatt cgttgcatat tgagtacaaa 541 accaggtata atactttccc cgccaagagt ttgaggaaat tttaaaactt ccacctcttg 601 tcctgggcga taaatttcta cttcctgttt ttttcggttt atcaaccaac ctagacgaca 661 accattttcg atatactctt gcatcttctt ctgagttttt ttcaagctgt cattaggtga 721 gagtatctcg atgacaaaat caggacatag aggaataaat ttttctcttt ggtctttagt 781 caaagcatcc caacgagatt tttccaccca agaaacatct ggagaacggt cagcccctga 841 aggtaaagta aatccagttg atgaatcaaa aacttcgccc aattgatttt ggtcgt // LOCUS NODE_17095_length_894_cov_5.235995894 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 894) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 894) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..894 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..119 /locus_tag="DP116_28410" CDS <1..119 /locus_tag="DP116_28410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017740266.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="DNA-binding response regulator" /protein_id="PRJNA477356:DP116_28410" /translation="ELHLSEGTVRNYVSQIFSQLDVRDRVQAVLWVQQHLLD" gene 200..478 /locus_tag="DP116_28415" CDS 200..478 /locus_tag="DP116_28415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007358468.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutaredoxin family protein" /protein_id="PRJNA477356:DP116_28415" /translation="MRIILYSKPGCHLCEGLQEKLEQIQNLKFELEIRDITTREDWFQ AYQYEVPVLFLANPRTTEGAEEKKDLPLPRPSPRATVQQLENMLQKYL" BASE COUNT 287 a 157 c 183 g 267 t ORIGIN 1 aagaacttca cctgagtgaa ggaacagtga gaaattatgt ttctcaaatt ttcagtcagt 61 tagatgttcg cgatcgagtt caggctgtgc tgtgggtaca acaacatttg ctcgactaat 121 atcttgctcg atgggatgaa atgaaagtga gtatggtagt aaagatgaag tcaaacagag 181 tctattttgg ttgaagcaaa tgcgaatcat cttatatagc aagcccggtt gtcatttgtg 241 tgagggtttg caggaaaaac tagaacagat acaaaacctc aagtttgagc tagaaatccg 301 ggatattaca actcgtgaag attggtttca agcttaccaa tacgaggtgc cagtgctttt 361 tttagcaaac cccagaacta cagaaggcgc agaggagaaa aaggacctac ccttacctcg 421 tccttctcct agggcaacag tacaacaatt ggagaatatg cttcagaagt atttataact 481 taggtgtaag cgaatatact agaccccaaa ccgctctttg tcgtttcaat ttcttcttac 541 ccagagagcg acacggtgtt atttttacat taaaaagcgc ttgagatttt ttttagatca 601 agcgcttttt tatataattg tataccaatt cacttttttg tgcaacattt gaaactcgca 661 acactctgct ggtgagacaa attgcgaaga taacttgtgg caaacatatg gtgaagtggt 721 atcactcctt gcattaataa agattaaact cataccatga aacgcggaat tagatgagtg 781 acagtttatc aactgaccct agtagatgag ttgttagttt tcaattatta cgggaattac 841 ttacgatact ttttttaacc taaaagtcaa gataagtatg aaaccgtata agta // LOCUS NODE_17184_length_888_cov_3.596639888 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 888) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 888) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..888 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(38..835) /locus_tag="DP116_28420" /pseudo CDS complement(38..835) /locus_tag="DP116_28420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017314612.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="acetyl-CoA C-acyltransferase" BASE COUNT 207 a 268 c 195 g 218 t ORIGIN 1 cccccacacc cctatttccc tacacccctt tgaattgtca aagtcgctcc agtgcaaacg 61 ccgtaccacc gccagtacca tgacacaccg ctgccagccc taactgccca ttcctgtgct 121 gcaaagcgtt gagcagcgtg accataattc gcgccccaga agcgccgatg ggatgcccta 181 gtgcgatcgc ccccccaaag atattgagct tttcgtaagg aattcccagc acccggttaa 241 acagcacgtt gcttagagca aaagcttcgt tgttctcaaa caagtcaaaa tcgtagattc 301 tcatcttcag cttttctaaa agcttattca ctgcgagtat gggaacttca gggaagcgcc 361 aagcttctcc accgatccac gatcctccaa tcacccgtgc tagaggtacc agtccgtaac 421 gttccaccgc cgttcggctt gccaagacaa gagctgctgc tccatctgat atctgactgc 481 tgttaccagc agtgaacacg ccatcctctg tgaaagcaga cttgagtcgg gtcaggctct 541 ctaaggtggt ttccgggcga atgccttcat ctttttccac cagttgcaaa cctttcttcc 601 cttcgacctc aatcggtaca atttccttgt tgaagcaacc ttgctcggtg gcgagtgttg 661 cccgctgctg tgaatataag gcgacttggt ccaagtcagc ccgcgtcacc tggtaagctg 721 ctgccagccg ttcagcttgc tctcccattg cttcagcggt ggtaggatcg gtgagaccgt 781 cgtaaagtaa aatatccgtg agttgctctg gcgcacccaa caggaatttg taaccgcttg 841 taaaaatcta cgcatatctg ggcaacaatc ctcaaatcta cgtagagg // LOCUS NODE_17869_length_845_cov_3.393671845 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 845) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 845) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..845 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..510) /locus_tag="DP116_28425" CDS complement(<1..510) /locus_tag="DP116_28425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010994161.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28425" /translation="MLSQDFSGQNLRGRSFKNQNLEGANFSGADIRSADFCGANLTGA NFSHAKAGLQKRWAISLVVVSWLLSGISGLFSSFAGAIVMDLLFDDSKNHLEQTSYQI AGLGAITIIVIFFLVTIRQGIQGGFAVAGAVVVAGAVAGAVAGAVAGAGAGAGAGARA VAFVVAIAVA" BASE COUNT 226 a 213 c 178 g 228 t ORIGIN 1 ggctacggct atggctacga cgaaggctac ggctctggct ccggctccgg ctccggctcc 61 ggctccggct acggctccgg ctacggctcc ggctacggct ccggctacga ctacggctcc 121 ggctacggcg aagccccctt gtattccttg gcgaattgtg acgagaaaaa atataacaat 181 aatagtaata gcacccaagc ccgcgatctg ataacttgtt tgctctaaat ggtttttact 241 atcgtcaaat agtaagtcca tcacaatagc gccagcaaaa ctagagaaaa gtcctgatat 301 tcccgacagc aaccacgaaa caacaaccag ggatatagcc caacgctttt gcagtcccgc 361 tttggcatga ctgaagttag cacctgtgag gttggcaccg caaaaatcag cgcttcggat 421 atctgcaccg ctaaagttcg ctccctcaag gttttgattt ttaaacgatc gccctctaag 481 attttgccct gagaaatctt gtgatagcat aaatcacgca ttgataaagt gctgttatga 541 tgtcgtacaa aacccgcact ttattttcca ggttttttag gtaattagga gttacgcatt 601 gacaaaaaat cttattcatg tatgctaagt aaatgtaggg tgtgttgtcg cgcagcgcaa 661 cgcaccttca agaattcaag gtgcgttagc cttcggcata acgcacccta cggactacga 721 acaatttata tttcttcata tacattgatt ttttttcgcc aacttactta gctctgccgt 781 aggcaatcgt ctcacaacta aatttgtgac aaaatagttt tgttgtccta cagccaatgc 841 aatcg // LOCUS NODE_18061_length_834_cov_6.173299834 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 834) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 834) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..834 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..235 /locus_tag="DP116_28430" CDS <1..235 /locus_tag="DP116_28430" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28430" /translation="CRSGCAGLPPSPRRHAVFTVEAGGSDGAGTVTAVGAGVGRLGLL SARVTNTNPLFGAEPGVGVTMCGRLAGIVPLRT" gene 369..554 /locus_tag="DP116_28435" CDS 369..554 /locus_tag="DP116_28435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013630851.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28435" /translation="MTLTDFYNAVAKQADTDKTAIGVADTKRVLAVAFSVLHTLPTAD VLDVIAKGLATAAKKAK" gene 584..>834 /locus_tag="DP116_28440" CDS 584..>834 /locus_tag="DP116_28440" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28440" /translation="MTGYGSAVPAQHSALSTRHYVFSAPPLPLPLGPRSGASPGPVGR RRAGAVGDGVPAGAATGPAGGPAGDGDGAALPAPGRAGRR" BASE COUNT 120 a 282 c 297 g 135 t ORIGIN 1 gtgccgttca ggctgtgcag gtcttccacc gtcaccccgc cgccatgccg tgttcaccgt 61 ggaggccggc ggctcggacg gggccggcac ggtgacggcc gtgggcgccg gggtgggtcg 121 gttgggcttg ctcagtgccc gggtgacgaa caccaacccg ctcttcgggg ccgagccggg 181 ggttggggtg accatgtgcg gccgcctggc gggaatcgtt ccgctgagaa catagcccac 241 gattgccggt cgcgccggcc gcgccgactg agtcaggaaa actgttcccg gtgtcgggat 301 ttctgctgac gcctcaccaa tcagtcggcg atgatcggac aatcccctat ccctcacgag 361 gccgaccgat gaccctgacc gacttctaca acgcggtggc caagcaggcc gacaccgaca 421 agacggccat cggggtggcc gacaccaagc gggtgctggc cgtcgccttc agcgtgctgc 481 acacgctgcc caccgccgac gtgctggacg tgatcgccaa agggctggcg acggcagcga 541 aaaaggccaa gtagtgctga gtgctgagtg ctgagtgctg agaatgacgg ggtacgggtc 601 ggcggtcccg gctcagcact cagcactcag cactcggcac tatgtattct ctgctccgcc 661 ccttcctctt ccgcttggac cccgaagcgg cgcatcacct ggccctgtcg gccgccgccg 721 ggctggggcg gtcggcgacg gtgtgccggc tggtgcggcg actggcccgg ccggcggccc 781 ggccggtgac ggcgatgggg ctgcacttcc cgcacccggt cgggctggcc ggcg // LOCUS NODE_19068_length_783_cov_2.421703783 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 783) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 783) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..783 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene complement(<1..475) /locus_tag="DP116_28445" CDS complement(<1..475) /locus_tag="DP116_28445" /inference="COORDINATES: protein motif:HMM:TIGR01212" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR01212 family radical SAM protein" /protein_id="PRJNA477356:DP116_28445" /translation="MPIDAGFTCPNRDGVRAFGGCTFCDEKGSGAPTIDQKLNVREQL VMGMERIVGRYKAKKFLGYFQAFTNTYAPEEVLRNLYDVCFEFEDVVGLCIGTRPDCL PDNILDLLAEYDQKTFVWTEVGLQTIHDRTLDLINRGHSAAEFFDALDRAKQRNLK" BASE COUNT 185 a 190 c 181 g 227 t ORIGIN 1 tttttagatt gcgctgtttc gctcggtcca atgcatcaaa aaattccgcc gctgaatgtc 61 cgcgattgat caggtcaagt gttcggtcgt gaatggtctg caatccgact tcagtccata 121 caaaagtttt ttggtcgtac tccgctaaca aatcaagaat gttgtcaggc aagcagtccg 181 gtcttgtgcc gatgcacaat ccgactacat cttcgaactc gaagcaaaca tcgtataaat 241 tacgcagcac ttcttctggc gcatatgtgt ttgtgaatgc ttgaaaatat ccaagaaatt 301 tcttcgcttt gtatctgcca acgatgcgct ccattcccat cacgagctgc tcgcgcacat 361 tcagcttctg gtcgattgta ggcgcacccg accccttttc gtcgcagaat gtgcagccgc 421 cgaaagcacg aacgccatct ctattaggac aagtaaatcc agcgtcgatt ggcacgcgat 481 atacacgagt tccgaaatat tccttcaaat agttgttgaa agttcggtat ctgtacgaat 541 gtgtttcggc tgtttttgat tcgagctgcg tattcatagg cgcaatgggc ttacttcggc 601 ggagatatgc tgtggaaaac gtccatacta ctacgagttg ccagtaataa gtaatgcgtt 661 gaagttagtt gacaccgttt ggtgcggttt gttattctgc ttgccgatgc tctctgcgtc 721 tagattatgc ccccatcgtc tagaggccta ggacacctcc ctttcacgga ggtaacgggg 781 att // LOCUS NODE_22774_length_636_cov_4.877797636 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 636) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 636) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..636 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene 211..561 /locus_tag="DP116_28450" CDS 211..561 /locus_tag="DP116_28450" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016862862.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="four helix bundle protein" /protein_id="PRJNA477356:DP116_28450" /translation="MSYRNQFIWQRAVQLAINCYKFTRLFPQSELYGLTNQIRRSAVS VASNIAEGYGRRSKQEYVQFLHIALGSLRELDTQLIIAKEVDLAEKNLFTPLLNEVEE MQSILVASLNKLKV" BASE COUNT 195 a 134 c 116 g 191 t ORIGIN 1 gcaggtgtaa ctatgtcagt cagcaatggg cccgttgaag ggcatattaa tcgactgaaa 61 atgttaaagc ggcaaatgta tggtcgtgcc aagatagacc ttcttgagcg acgattcttg 121 ttagcaacct aaagattgag aaggcagcta tgttgccttt tttcggtata atctttgtac 181 ttcaaacttc tctagacttc aaacttaatt atgagctaca gaaatcaatt tatctggcaa 241 agggcagttc aacttgccat taattgctat aagtttacgc gcctattccc tcagtcagaa 301 ttgtatggtt taaccaacca aatacgtcgt tccgcagtat ctgtagcatc caacatagcc 361 gaaggatatg gaaggcgctc aaaacaagaa tacgtacaat ttttgcatat tgctttaggt 421 tctttaagag aactcgatac acaattaatc attgctaaag aagtagattt ggcagagaaa 481 aaccttttta ctcccctatt aaatgaagtt gaggaaatgc aaagtatatt agttgcaagt 541 ttaaacaaac tcaaagtttg aatctataat ttccttcttc ctccttctgc cttctgccct 601 ctgccctctg ccttccccca agcgagatca ccaaaa // LOCUS NODE_23821_length_605_cov_2.720000605 bp DNA linear BCT25-JUN-2018 DEFINITION Brasilonema bromeliae SPC951, whole genome shotgun sequence. ACCESSION VERSION DBLINK BioProject: PRJNA477356 BioSample: SAMN09468270 KEYWORDS WGS. SOURCE Brasilonema bromeliae SPC951 ORGANISM Brasilonema bromeliae SPC951 Bacteria; Cyanobacteria; Nostocales; Scytonemataceae; Brasilonema. REFERENCE 1 (bases 1 to 605) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Comparative genomics of Brasilonema spp. strains JOURNAL Unpublished REFERENCE 2 (bases 1 to 605) AUTHORS Alvarenga,D.O., Fiore,M.F. and Varani,A.M. TITLE Direct Submission JOURNAL Submitted (25-JUN-2018) School of Agricultural and Veterinarian Sciences, Sao Paulo State University (UNESP), Via de Acesso Prof. Paulo Donato Castellane s/n, Jaboticabal, Sao Paulo 14884-900, Brazil COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/25/2018 16:41:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,693 CDS (total) :: 5,639 Genes (coding) :: 5,207 CDS (coding) :: 5,207 Genes (RNA) :: 54 rRNAs :: 1, 2, 2 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 2, 2 (16S, 23S) tRNAs :: 45 ncRNAs :: 4 Pseudo Genes (total) :: 432 Pseudo Genes (ambiguous residues) :: 58 of 432 Pseudo Genes (frameshifted) :: 142 of 432 Pseudo Genes (incomplete) :: 255 of 432 Pseudo Genes (internal stop) :: 71 of 432 Pseudo Genes (multiple problems) :: 88 of 432 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..605 /organism="Brasilonema bromeliae SPC951" /mol_type="genomic DNA" /strain="SPC951" /isolation_source="leaves" /host="bromeliads" /type_material="type strain of Brasilonema bromeliae" /db_xref="taxon:385972" /country="Brazil: Sao Paulo, Sao Paulo" /collection_date="2004" gene <1..337 /locus_tag="DP116_28455" CDS <1..337 /locus_tag="DP116_28455" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="PRJNA477356:DP116_28455" /translation="LWGGAGAARVPTGYLVKGTSGRGDDHPSPMSAPGIIPVQTSIVQ TGIFRSTHPTSLHTRPYAVRLCNRRWEVRVRFGGLVREPGACRIALGCGAGSNAGTAR KQRARVAQW" gene 317..388 /locus_tag="DP116_28460" tRNA 317..388 /locus_tag="DP116_28460" /product="tRNA-Thr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:349..351,aa:Thr,seq:tgt) gene 457..539 /locus_tag="DP116_28465" tRNA 457..539 /locus_tag="DP116_28465" /product="tRNA-Tyr" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:491..493,aa:Tyr,seq:gta) BASE COUNT 122 a 166 c 192 g 125 t ORIGIN 1 cctttggggt ggggcgggag cggcgcgggt gcccacgggg tacctcgtga aagggacgtc 61 gggccgcggg gatgatcacc caagcccaat gagcgccccg ggaataatac ccgttcagac 121 aagtatcgtc cagactggta ttttccgttc cacccacccg acgtcacttc acacgcgtcc 181 gtacgccgtc cggctttgca accgccggtg ggaagtccgg gtccggtttg gtgggcttgt 241 gcgcgaaccc ggagcttgta gaatcgctct tggatgtggt gccggctcga acgcgggcac 301 cgcacgaaag cagcgtgctc gcgtagctca gtggtagagc agcggttttg taaaccgctg 361 gtcgtgggtt caactccctc cgcgagcttt ccgtttccgt ccgcggtttg tataacggca 421 taggtgcgga gggccgcacc gaagtgaggg aaaaacgggt aggtggcaga gtggtcaatt 481 gcaccagact gtaaatctgg ctcctaacgg attcgggggt tcaaatccct ccctacccac 541 tgttcagaag gaaattcgaa atacgaagca cgaaatccga aatgaaggca atttcgctgc 601 tgttt //